Query psy16240
Match_columns 284
No_of_seqs 223 out of 467
Neff 7.0
Searched_HMMs 46136
Date Fri Aug 16 16:23:55 2013
Command hhsearch -i /work/01045/syshi/Psyhhblits/psy16240.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/16240hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 cd00257 Fascin Fascin-like dom 99.9 5.1E-26 1.1E-30 182.8 16.2 118 108-227 1-119 (119)
2 PF06268 Fascin: Fascin domain 99.9 1.2E-23 2.7E-28 167.6 11.2 111 116-227 1-111 (111)
3 PF06268 Fascin: Fascin domain 99.9 1.1E-21 2.5E-26 156.3 12.2 108 21-144 2-111 (111)
4 cd00257 Fascin Fascin-like dom 99.8 7.1E-19 1.5E-23 141.2 15.1 117 53-186 2-119 (119)
5 PF06229 FRG1: FRG1-like famil 99.4 1.6E-12 3.5E-17 112.8 12.6 109 120-229 3-117 (191)
6 PF06229 FRG1: FRG1-like famil 99.3 3.5E-11 7.5E-16 104.5 11.9 93 94-188 22-117 (191)
7 KOG3962|consensus 98.7 1.5E-07 3.3E-12 82.3 10.6 111 110-222 49-164 (246)
8 smart00791 Agglutinin Amaranth 98.6 2.2E-06 4.9E-11 69.7 13.8 116 110-227 7-137 (139)
9 PF00167 FGF: Fibroblast growt 98.4 7E-06 1.5E-10 66.3 13.4 105 110-216 2-110 (122)
10 KOG3962|consensus 98.4 2E-06 4.4E-11 75.3 9.5 77 95-173 79-156 (246)
11 PF07468 Agglutinin: Agglutini 98.2 4.2E-05 9.1E-10 63.8 13.6 117 111-227 8-151 (153)
12 PF00167 FGF: Fibroblast growt 98.2 6.5E-05 1.4E-09 60.6 14.3 96 153-249 3-103 (122)
13 cd00058 FGF Acidic and basic f 98.1 0.00012 2.7E-09 59.4 12.9 103 112-216 2-108 (123)
14 PF04601 DUF569: Protein of un 98.1 9.3E-05 2E-09 61.3 12.2 99 108-207 7-116 (142)
15 smart00442 FGF Acidic and basi 98.0 0.0003 6.5E-09 57.3 14.1 105 110-216 4-112 (126)
16 cd00058 FGF Acidic and basic f 97.8 0.00033 7.2E-09 56.8 11.6 94 155-249 3-101 (123)
17 smart00442 FGF Acidic and basi 97.7 0.00076 1.6E-08 55.0 12.1 86 153-239 5-91 (126)
18 PF04601 DUF569: Protein of un 97.6 0.0025 5.4E-08 52.9 12.7 100 51-165 6-115 (142)
19 smart00791 Agglutinin Amaranth 97.4 0.005 1.1E-07 50.4 12.3 97 152-249 7-114 (139)
20 KOG3885|consensus 97.3 0.0021 4.6E-08 54.1 9.6 85 153-238 28-114 (155)
21 PF07468 Agglutinin: Agglutini 96.9 0.029 6.3E-07 47.0 12.2 115 153-269 8-152 (153)
22 KOG3885|consensus 96.8 0.017 3.7E-07 48.6 10.5 102 111-214 28-136 (155)
23 PF14200 RicinB_lectin_2: Rici 95.7 0.26 5.7E-06 38.0 11.2 68 140-207 3-78 (105)
24 PF14200 RicinB_lectin_2: Rici 94.8 0.47 1E-05 36.6 10.0 66 100-166 3-78 (105)
25 PF05270 AbfB: Alpha-L-arabino 92.7 2.3 5E-05 35.4 11.0 103 111-215 3-123 (142)
26 PF05270 AbfB: Alpha-L-arabino 91.4 5.2 0.00011 33.2 11.6 75 97-173 36-122 (142)
27 KOG0276|consensus 85.4 12 0.00025 38.4 11.4 131 34-177 330-481 (794)
28 PF03498 CDtoxinA: Cytolethal 81.1 23 0.00051 29.6 10.1 100 111-211 2-115 (150)
29 KOG1446|consensus 80.9 20 0.00044 33.5 10.4 77 136-214 174-258 (311)
30 COG3377 Uncharacterized conser 78.1 2 4.4E-05 32.8 2.5 25 224-248 16-40 (95)
31 KOG1446|consensus 71.9 88 0.0019 29.3 12.0 43 17-59 85-131 (311)
32 cd00161 RICIN Ricin-type beta- 71.2 40 0.00087 25.1 10.3 84 117-207 9-100 (124)
33 PF14913 DPCD: DPCD protein fa 69.4 78 0.0017 27.7 10.8 78 88-165 45-135 (194)
34 PF00652 Ricin_B_lectin: Ricin 63.9 56 0.0012 24.7 8.0 93 110-208 4-104 (124)
35 PF03498 CDtoxinA: Cytolethal 51.6 1.3E+02 0.0028 25.2 8.6 59 108-167 51-112 (150)
36 KOG0276|consensus 50.1 3.2E+02 0.0069 28.5 15.0 100 109-211 302-418 (794)
37 PF14913 DPCD: DPCD protein fa 49.0 56 0.0012 28.5 6.1 63 4-66 67-134 (194)
38 cd00028 B_lectin Bulb-type man 47.3 1.2E+02 0.0027 23.4 7.5 22 183-204 88-109 (116)
39 PF14870 PSII_BNR: Photosynthe 43.6 2.8E+02 0.006 25.9 21.5 127 114-243 152-297 (302)
40 PF05588 Botulinum_HA-17: Clos 41.5 1.6E+02 0.0035 24.0 7.2 79 149-231 9-94 (146)
41 PF01690 PLRV_ORF5: Potato lea 39.1 4E+02 0.0088 26.5 14.3 40 94-133 25-65 (465)
42 smart00458 RICIN Ricin-type be 38.0 1.7E+02 0.0036 21.8 9.6 47 116-167 5-53 (117)
43 KOG0649|consensus 34.6 3.7E+02 0.0081 24.8 10.3 89 116-206 117-213 (325)
44 smart00108 B_lectin Bulb-type 30.8 2.5E+02 0.0054 21.6 7.4 20 183-202 87-106 (114)
45 KOG0973|consensus 26.6 5.7E+02 0.012 27.9 10.3 34 138-171 59-92 (942)
46 PF11197 DUF2835: Protein of u 25.4 1.6E+02 0.0034 21.3 4.3 15 109-123 21-35 (68)
47 PF04790 Sarcoglycan_1: Sarcog 24.4 5.5E+02 0.012 23.5 16.7 16 111-126 182-197 (264)
48 cd00161 RICIN Ricin-type beta- 23.2 3.1E+02 0.0066 20.1 7.2 46 180-228 34-83 (124)
49 PLN00033 photosystem II stabil 23.1 7E+02 0.015 24.2 15.1 61 114-176 246-308 (398)
50 KOG2394|consensus 22.6 6.6E+02 0.014 25.7 9.3 41 197-240 329-371 (636)
51 PF08709 Ins145_P3_rec: Inosit 22.3 3.6E+02 0.0077 23.6 6.9 76 152-230 103-208 (214)
52 PF11250 DUF3049: Protein of u 21.1 1.5E+02 0.0033 20.6 3.4 29 168-197 26-54 (56)
53 PLN00115 pollen allergen group 21.0 2.1E+02 0.0045 23.0 4.7 30 100-129 15-47 (118)
No 1
>cd00257 Fascin Fascin-like domain; members include actin-bundling/crosslinking proteins facsin, histoactophilin and singed; identified in sea urchin, Drosophila, Xenopus, rodents, and humans; The fascin-like domain adopts a beta-trefoil topology and contains an internal threefold repeat; the fascin subgroup contains four copies of the domain; Structurally similar to fibroblast growth factor (FGF)
Probab=99.94 E-value=5.1e-26 Score=182.78 Aligned_cols=118 Identities=27% Similarity=0.498 Sum_probs=111.6
Q ss_pred CCeEEEEECCCCeEEeecCc-eEEeecCCCCCCceEEEEEeCCCCeEEEEEcCceeEEEeCCCcEEecCCCCCCCceEEE
Q psy16240 108 DKPSKFKIYSVPLSFLSSGV-DVTANQEEISDHETFQLEFDPSTKRWYIRTMQDRYWTLETGGGIQASGDKKSSNALYDL 186 (284)
Q Consensus 108 ~~~V~Lra~nGkyvs~~~~g-~l~A~~~~~~~~e~F~l~~d~~~g~~~Lrs~nGkY~~v~~~G~v~a~~~~~~~~~~F~l 186 (284)
++||+||+.|||||++.+++ .|.|+++.++++|+|+++. +++|+++||+.+||||+++.+|.|.++++ ++++|.|.+
T Consensus 1 ~p~v~Lrs~~gkyl~~~~~g~~v~a~~~~~~~~e~F~l~~-~~~g~v~Lrs~~G~yls~~~~g~l~~~~~-~~~~e~F~~ 78 (119)
T cd00257 1 RPQVVLRSVNGRYLSAEAGGDKVDANRDSLKGDETFTLEF-DNTGKYALRSHDGKYLSADSDGGVQLEGH-PNADCRFTL 78 (119)
T ss_pred CcEEEEEEcCCCEEEEeccCCEEEEcCccCCCceEEEEEE-CCCCeEEEEECCCcEEEEECCCCEEecCC-CCCCcEEEE
Confidence 48999999999999999998 8999999999999999995 67899999999999999999999999988 899999999
Q ss_pred EEeCCCeEEEEeCCCCeEEEccCceEEEcccCCCCCceEEE
Q psy16240 187 VWQGDGSVALRANNGKFIATKRSGHLYANSDSADDSCRYYF 227 (284)
Q Consensus 187 e~~~~g~v~lra~nGkyl~~~~~G~L~A~~~~~~~~e~F~~ 227 (284)
|++++|+++||++|||||+++++|.|.++++.++++|.|++
T Consensus 79 e~~~~g~~al~~~~G~yl~~~~~g~l~~~~~~~~~~e~f~~ 119 (119)
T cd00257 79 EFHGDGKWALRAENGRYLGGDGSGTLKASSETVGPDELFEL 119 (119)
T ss_pred EECCCCeEEEEcCCCCEEeecCCCeEEEecCCCCccceecC
Confidence 99888999999999999999999999999999999999975
No 2
>PF06268 Fascin: Fascin domain; InterPro: IPR022768 This family consists of several eukaryotic fascin or singed proteins. The fascins are a structurally unique and evolutionarily conserved group of actin cross-linking proteins. Fascins function in the organisation of two major forms of actin-based structures: dynamic, cortical cell protrusions and cytoplasmic microfilament bundles. The cortical structures, which include filopodia, spikes, lamellipodial ribs, oocyte microvilli and the dendrites of dendritic cells, have roles in cell-matrix adhesion, cell interactions and cell migration, whereas the cytoplasmic actin bundles appear to participate in cell architecture []. Dictyostelium hisactophilin, another actin-binding protein, is a submembranous pH sensor that signals slight changes of the H+ concentration to actin by inducing actin polymerisation and binding to microfilaments only at pH values below seven []. Members of this family are histidine rich, typically contain the repeated motif of HHXH []. ; GO: 0030674 protein binding, bridging, 0051015 actin filament binding; PDB: 1DFC_B 3P53_B 3LLP_A 1HCE_A 1HCD_A.
Probab=99.90 E-value=1.2e-23 Score=167.56 Aligned_cols=111 Identities=31% Similarity=0.573 Sum_probs=103.1
Q ss_pred CCCCeEEeecCceEEeecCCCCCCceEEEEEeCCCCeEEEEEcCceeEEEeCCCcEEecCCCCCCCceEEEEEeCCCeEE
Q psy16240 116 YSVPLSFLSSGVDVTANQEEISDHETFQLEFDPSTKRWYIRTMQDRYWTLETGGGIQASGDKKSSNALYDLVWQGDGSVA 195 (284)
Q Consensus 116 ~nGkyvs~~~~g~l~A~~~~~~~~e~F~l~~d~~~g~~~Lrs~nGkY~~v~~~G~v~a~~~~~~~~~~F~le~~~~g~v~ 195 (284)
.|+++++.+.+..|.|++...+++|+|+|+++.+...++||+.+||||+++++|.|++++++++++++|+|++++ ++++
T Consensus 1 ~~~~~~~~k~~~~l~an~~~~~~~e~f~le~~~~~~~v~lrs~~GkYls~~~~G~v~~~~~~~~~~~~F~i~~~~-~~~~ 79 (111)
T PF06268_consen 1 ANGYLVSEKFGAHLNANRASLSDWETFQLEFDDGSYKVALRSHNGKYLSVDSDGSVVADSETPGPDEFFEIEWHG-GKVA 79 (111)
T ss_dssp TTEEEEETTCTCBEEEEESSSSCGGSEEEEEETTEEEEEEECTTSEEEEEETTSEEEEEESSSSGGGCBEEEEET-TEEE
T ss_pred CCcEEEEEEcCCEEECChhcCcccEEEEEEEECCCCEEEEEcCCCCEEEEcCCCeEEecCCCCCCCcEEEEEECC-CEEE
Confidence 367788888899999999999999999999876556677999999999999999999999989999999999996 9999
Q ss_pred EEeCCCCeEEEccCceEEEcccCCCCCceEEE
Q psy16240 196 LRANNGKFIATKRSGHLYANSDSADDSCRYYF 227 (284)
Q Consensus 196 lra~nGkyl~~~~~G~L~A~~~~~~~~e~F~~ 227 (284)
++++|||||+++++|+|.|+++.|+++|+|++
T Consensus 80 ~~~~nGkYl~~~~~g~l~a~~~~~~~~elf~~ 111 (111)
T PF06268_consen 80 LRASNGKYLSAGPNGQLKANATSPGKDELFEY 111 (111)
T ss_dssp EECTTSCEEEEETTTEEEEEESSSSGGGEEEE
T ss_pred EECCCCCEEeeCCCCeEEEcCCCCCcceEEeC
Confidence 99999999999999999999999999999986
No 3
>PF06268 Fascin: Fascin domain; InterPro: IPR022768 This family consists of several eukaryotic fascin or singed proteins. The fascins are a structurally unique and evolutionarily conserved group of actin cross-linking proteins. Fascins function in the organisation of two major forms of actin-based structures: dynamic, cortical cell protrusions and cytoplasmic microfilament bundles. The cortical structures, which include filopodia, spikes, lamellipodial ribs, oocyte microvilli and the dendrites of dendritic cells, have roles in cell-matrix adhesion, cell interactions and cell migration, whereas the cytoplasmic actin bundles appear to participate in cell architecture []. Dictyostelium hisactophilin, another actin-binding protein, is a submembranous pH sensor that signals slight changes of the H+ concentration to actin by inducing actin polymerisation and binding to microfilaments only at pH values below seven []. Members of this family are histidine rich, typically contain the repeated motif of HHXH []. ; GO: 0030674 protein binding, bridging, 0051015 actin filament binding; PDB: 1DFC_B 3P53_B 3LLP_A 1HCE_A 1HCD_A.
Probab=99.87 E-value=1.1e-21 Score=156.26 Aligned_cols=108 Identities=20% Similarity=0.307 Sum_probs=97.9
Q ss_pred cceeee--eccEEEEcccCCCCceeEEEEEeCCCCeEEEEECCCceEEEecCCceeeeccccccCCcceeeeeeecccCc
Q psy16240 21 PLSFLS--SGVDVTANQEEISDHETFQLEFDPSTKRWYIRTMQDRYWTLETGGGMIEEESRLFSNPSKVRTVLQYLEVKI 98 (284)
Q Consensus 21 ~~~~~~--~g~~v~An~~e~~~~etFqLe~d~~~~r~~LRt~~gky~tl~~~gg~~a~~g~~~s~~~~~~~~~~~~~~~~ 98 (284)
|+.++. ++++|.||+.+++++|+||||++.+..+++||+.+||||+++.+|.+++. ++++++
T Consensus 2 ~~~~~~~k~~~~l~an~~~~~~~e~f~le~~~~~~~v~lrs~~GkYls~~~~G~v~~~----------------~~~~~~ 65 (111)
T PF06268_consen 2 NGYLVSEKFGAHLNANRASLSDWETFQLEFDDGSYKVALRSHNGKYLSVDSDGSVVAD----------------SETPGP 65 (111)
T ss_dssp TEEEEETTCTCBEEEEESSSSCGGSEEEEEETTEEEEEEECTTSEEEEEETTSEEEEE----------------ESSSSG
T ss_pred CcEEEEEEcCCEEECChhcCcccEEEEEEEECCCCEEEEEcCCCCEEEEcCCCeEEec----------------CCCCCC
Confidence 445555 89999999999999999999998776778899999999999988666665 578999
Q ss_pred CeeEEEEEcCCeEEEEECCCCeEEeecCceEEeecCCCCCCceEEE
Q psy16240 99 QYFYGLYCTDKPSKFKIYSVPLSFLSSGVDVTANQEEISDHETFQL 144 (284)
Q Consensus 99 ~E~F~l~~~~~~V~Lra~nGkyvs~~~~g~l~A~~~~~~~~e~F~l 144 (284)
+++|.|++.+.++.|++.|||||++.++|.|.|+.+.++++|.|++
T Consensus 66 ~~~F~i~~~~~~~~~~~~nGkYl~~~~~g~l~a~~~~~~~~elf~~ 111 (111)
T PF06268_consen 66 DEFFEIEWHGGKVALRASNGKYLSAGPNGQLKANATSPGKDELFEY 111 (111)
T ss_dssp GGCBEEEEETTEEEEECTTSCEEEEETTTEEEEEESSSSGGGEEEE
T ss_pred CcEEEEEECCCEEEEECCCCCEEeeCCCCeEEEcCCCCCcceEEeC
Confidence 9999999999999999999999999999999999999999999986
No 4
>cd00257 Fascin Fascin-like domain; members include actin-bundling/crosslinking proteins facsin, histoactophilin and singed; identified in sea urchin, Drosophila, Xenopus, rodents, and humans; The fascin-like domain adopts a beta-trefoil topology and contains an internal threefold repeat; the fascin subgroup contains four copies of the domain; Structurally similar to fibroblast growth factor (FGF)
Probab=99.81 E-value=7.1e-19 Score=141.18 Aligned_cols=117 Identities=20% Similarity=0.313 Sum_probs=102.0
Q ss_pred CeEEEEECCCceEEEecCCceeeeccccccCCcceeeeeeecccCcCeeEEEEEc-CCeEEEEECCCCeEEeecCceEEe
Q psy16240 53 KRWYIRTMQDRYWTLETGGGMIEEESRLFSNPSKVRTVLQYLEVKIQYFYGLYCT-DKPSKFKIYSVPLSFLSSGVDVTA 131 (284)
Q Consensus 53 ~r~~LRt~~gky~tl~~~gg~~a~~g~~~s~~~~~~~~~~~~~~~~~E~F~l~~~-~~~V~Lra~nGkyvs~~~~g~l~A 131 (284)
.++.||+.+|+|++++..+..+.+. ++.++.+|.|.++.. .+.|+||+.|||||++.++|.|.+
T Consensus 2 p~v~Lrs~~gkyl~~~~~g~~v~a~---------------~~~~~~~e~F~l~~~~~g~v~Lrs~~G~yls~~~~g~l~~ 66 (119)
T cd00257 2 PQVVLRSVNGRYLSAEAGGDKVDAN---------------RDSLKGDETFTLEFDNTGKYALRSHDGKYLSADSDGGVQL 66 (119)
T ss_pred cEEEEEEcCCCEEEEeccCCEEEEc---------------CccCCCceEEEEEECCCCeEEEEECCCcEEEEECCCCEEe
Confidence 3689999999999998875344432 467788999998764 788999999999999999899999
Q ss_pred ecCCCCCCceEEEEEeCCCCeEEEEEcCceeEEEeCCCcEEecCCCCCCCceEEE
Q psy16240 132 NQEEISDHETFQLEFDPSTKRWYIRTMQDRYWTLETGGGIQASGDKKSSNALYDL 186 (284)
Q Consensus 132 ~~~~~~~~e~F~l~~d~~~g~~~Lrs~nGkY~~v~~~G~v~a~~~~~~~~~~F~l 186 (284)
+.+ ++++|.|+++.. ++|+++||+.+|||++++.+|.|.++++.+++++.|.+
T Consensus 67 ~~~-~~~~e~F~~e~~-~~g~~al~~~~G~yl~~~~~g~l~~~~~~~~~~e~f~~ 119 (119)
T cd00257 67 EGH-PNADCRFTLEFH-GDGKWALRAENGRYLGGDGSGTLKASSETVGPDELFEL 119 (119)
T ss_pred cCC-CCCCcEEEEEEC-CCCeEEEEcCCCCEEeecCCCeEEEecCCCCccceecC
Confidence 998 899999999974 67999999999999999999999999998899998864
No 5
>PF06229 FRG1: FRG1-like family; InterPro: IPR010414 This entry represents Frg1 (FSHD region gene 1), a protein that is considered to be a candidate for facioscapulohumeral muscular dystrophy (FSHD). FSHD is a dominant neuromuscular disorder caused by deletions in a number of tandem repeat units (called D4Z4) located on chromosome 4q35. D4Z4 contains a transcriptional silencer whose deletion causes the over-expression in skeletal muscle of 4q35 genes, including Frg1 [, ]. Frg1 is localised to nucleoli and appears to be a component of the human spliceosome, but its exact function is unknown [].; PDB: 2YUG_A.
Probab=99.44 E-value=1.6e-12 Score=112.84 Aligned_cols=109 Identities=17% Similarity=0.227 Sum_probs=68.8
Q ss_pred eEEeecCceEEeecC-----CCCCCceEEEEEeCCCCeEEEEEcCceeEEEeCCCcEEecCCCCCCCceEEEEEeCCCeE
Q psy16240 120 LSFLSSGVDVTANQE-----EISDHETFQLEFDPSTKRWYIRTMQDRYWTLETGGGIQASGDKKSSNALYDLVWQGDGSV 194 (284)
Q Consensus 120 yvs~~~~g~l~A~~~-----~~~~~e~F~l~~d~~~g~~~Lrs~nGkY~~v~~~G~v~a~~~~~~~~~~F~le~~~~g~v 194 (284)
||.+-.+|.++...+ .+.+.|+|......++.+++||+++|||++++.+|.|+|+++++++.|.|++.+.+ ++.
T Consensus 3 yi~a~d~G~~t~~ePhd~~~~p~p~qV~va~~v~~~~~iafKs~~GkYLs~Dk~G~v~a~sdAiGp~E~f~~V~~~-~~~ 81 (191)
T PF06229_consen 3 YIEALDNGLFTTGEPHDVGEGPDPRQVWVATRVPGDEKIAFKSGHGKYLSCDKDGIVSARSDAIGPQEQFEPVFQD-GKP 81 (191)
T ss_dssp BEEE-TTS-EEE----SSS----TTT-EEEEE--SSS-EEEEETTS-BEEE-SSSBEEE--SS--TTTBEEEE-ST-T--
T ss_pred eeeeeccCCccccCCCcCCCCCChhHeEEEEEecCCCceEeeccCccEEEEcCCCcEEEEeecCCCceEEEEEECC-CCe
Confidence 566666665544432 35578999998765778999999999999999999999999999999999999987 889
Q ss_pred EEEe-CCCCeEEEccCceEEEcccCCCCCceEEEEE
Q psy16240 195 ALRA-NNGKFIATKRSGHLYANSDSADDSCRYYFYL 229 (284)
Q Consensus 195 ~lra-~nGkyl~~~~~G~L~A~~~~~~~~e~F~~~l 229 (284)
++.+ .|++||++++.+.|.|+++.+++.+.+.++.
T Consensus 82 a~~~~~~~~FLs~~~~~~i~a~s~~a~~~e~~~iR~ 117 (191)
T PF06229_consen 82 ALFSSSNNKFLSVDEEGDIRADSKTAGENEMIKIRS 117 (191)
T ss_dssp EEEE-TTS-BEEE-SSS-EEE--S---TTT--EEEE
T ss_pred EEEecCCCeEEEEecccCeeeccccCCCCceEEEEE
Confidence 9998 9999999999888999999999888887764
No 6
>PF06229 FRG1: FRG1-like family; InterPro: IPR010414 This entry represents Frg1 (FSHD region gene 1), a protein that is considered to be a candidate for facioscapulohumeral muscular dystrophy (FSHD). FSHD is a dominant neuromuscular disorder caused by deletions in a number of tandem repeat units (called D4Z4) located on chromosome 4q35. D4Z4 contains a transcriptional silencer whose deletion causes the over-expression in skeletal muscle of 4q35 genes, including Frg1 [, ]. Frg1 is localised to nucleoli and appears to be a component of the human spliceosome, but its exact function is unknown [].; PDB: 2YUG_A.
Probab=99.29 E-value=3.5e-11 Score=104.51 Aligned_cols=93 Identities=14% Similarity=0.144 Sum_probs=60.2
Q ss_pred cccCcCeeEE-EEE-cCCeEEEEECCCCeEEeecCceEEeecCCCCCCceEEEEEeCCCCeEEEEE-cCceeEEEeCCCc
Q psy16240 94 LEVKIQYFYG-LYC-TDKPSKFKIYSVPLSFLSSGVDVTANQEEISDHETFQLEFDPSTKRWYIRT-MQDRYWTLETGGG 170 (284)
Q Consensus 94 ~~~~~~E~F~-l~~-~~~~V~Lra~nGkyvs~~~~g~l~A~~~~~~~~e~F~l~~d~~~g~~~Lrs-~nGkY~~v~~~G~ 170 (284)
..|.+.|.|. ... +..+|+||+++||||++...|.|+|++++++++|.|++.+ .+++.++.+ .+++|+++++++.
T Consensus 22 ~~p~p~qV~va~~v~~~~~iafKs~~GkYLs~Dk~G~v~a~sdAiGp~E~f~~V~--~~~~~a~~~~~~~~FLs~~~~~~ 99 (191)
T PF06229_consen 22 EGPDPRQVWVATRVPGDEKIAFKSGHGKYLSCDKDGIVSARSDAIGPQEQFEPVF--QDGKPALFSSSNNKFLSVDEEGD 99 (191)
T ss_dssp ----TTT-EEEEE--SSS-EEEEETTS-BEEE-SSSBEEE--SS--TTTBEEEE---STT--EEEE-TTS-BEEE-SSS-
T ss_pred CCCChhHeEEEEEecCCCceEeeccCccEEEEcCCCcEEEEeecCCCceEEEEEE--CCCCeEEEecCCCeEEEEecccC
Confidence 3577889999 444 5789999999999999999999999999999999999987 567888887 9999999999888
Q ss_pred EEecCCCCCCCceEEEEE
Q psy16240 171 IQASGDKKSSNALYDLVW 188 (284)
Q Consensus 171 v~a~~~~~~~~~~F~le~ 188 (284)
|++++.+++..+.+.|--
T Consensus 100 i~a~s~~a~~~e~~~iR~ 117 (191)
T PF06229_consen 100 IRADSKTAGENEMIKIRS 117 (191)
T ss_dssp EEE--S---TTT--EEEE
T ss_pred eeeccccCCCCceEEEEE
Confidence 999999888888887754
No 7
>KOG3962|consensus
Probab=98.69 E-value=1.5e-07 Score=82.26 Aligned_cols=111 Identities=19% Similarity=0.187 Sum_probs=93.3
Q ss_pred eEEEEECCCCeEEeecCceEEeecC-----CCCCCceEEEEEeCCCCeEEEEEcCceeEEEeCCCcEEecCCCCCCCceE
Q psy16240 110 PSKFKIYSVPLSFLSSGVDVTANQE-----EISDHETFQLEFDPSTKRWYIRTMQDRYWTLETGGGIQASGDKKSSNALY 184 (284)
Q Consensus 110 ~V~Lra~nGkyvs~~~~g~l~A~~~-----~~~~~e~F~l~~d~~~g~~~Lrs~nGkY~~v~~~G~v~a~~~~~~~~~~F 184 (284)
.|.|.-..|-||.+..++.++...+ ++.+.|+|.+... .+.+++|++..|||+++++||.|++.+.++++.++|
T Consensus 49 ~v~ie~~~~~yl~a~dng~ft~g~ph~~~~gp~p~e~f~avki-~dsrIaLKsGyGKYlsinsdglvvg~qeAvG~~EQw 127 (246)
T KOG3962|consen 49 TVAIEIDDGTYLGAMDNGLFTLGAPHDEVDGPEPEEQFMAVKI-SDSRIALKSGYGKYLSINSDGLVVGRQEAVGSREQW 127 (246)
T ss_pred EEEEEecCceEEEEEecCceeeccCCccccCCCchhhEEEEEc-cCceEEecccccceeeecCCccEEEehhhcCcHhhc
Confidence 7888877799999998887655443 3556788888765 457899999999999999999999999999999999
Q ss_pred EEEEeCCCeEEEEeCCCCeEEEccCceEEEcccCCCCC
Q psy16240 185 DLVWQGDGSVALRANNGKFIATKRSGHLYANSDSADDS 222 (284)
Q Consensus 185 ~le~~~~g~v~lra~nGkyl~~~~~G~L~A~~~~~~~~ 222 (284)
..++.. +++++.+.||.|...++.|...+...++...
T Consensus 128 ~~vFq~-~r~a~~as~s~~~~~~e~~d~~va~kt~t~~ 164 (246)
T KOG3962|consen 128 EPVFQE-GRMALLASNSCFIRCNEAGDAEVASKTATEE 164 (246)
T ss_pred hhhhhc-cceEEeeccceeEEechhhchhhhcccCCCC
Confidence 999999 9999999999999998888866665555443
No 8
>smart00791 Agglutinin Amaranthus caudatus agglutinin or amaranthin is a lectin from the ancient South American crop, amaranth grain. Although its biological function is unknown, it has a high binding specificity for the methyl-glycoside of the T-antigen, found linked to serine or threonine residues of cell surface glycoproteins PUBMED:2271665. The protein is comprised of a homodimer, with each homodimer consisting of two beta-trefoil domains PUBMED:9334739.
Probab=98.58 E-value=2.2e-06 Score=69.74 Aligned_cols=116 Identities=16% Similarity=0.111 Sum_probs=86.7
Q ss_pred eEEEEECCCCeEEeecCc---eEEeecCCCC-CCceEEEEEeCCCCeEEEEE-cCceeEEEeCCCcEEecCCCC----CC
Q psy16240 110 PSKFKIYSVPLSFLSSGV---DVTANQEEIS-DHETFQLEFDPSTKRWYIRT-MQDRYWTLETGGGIQASGDKK----SS 180 (284)
Q Consensus 110 ~V~Lra~nGkyvs~~~~g---~l~A~~~~~~-~~e~F~l~~d~~~g~~~Lrs-~nGkY~~v~~~G~v~a~~~~~----~~ 180 (284)
.|+||+.||+||...... -|-...+.+. +...|+++.. .+|.++||+ .+||||... +..|.|+++.+ ++
T Consensus 7 ~V~FKg~n~kYLry~~~~~~~~lqf~~ddI~dp~v~~ev~~~-~dg~V~ik~~~~nKfWr~s-~~WI~a~s~d~~e~~ss 84 (139)
T smart00791 7 YVLFKGNNQKYLRYQSIQQYGLLQFSADKILDPLVQFEVFPT-YNGLVHIKSNYTNKFWRLS-HYWITADANDPDENKSA 84 (139)
T ss_pred EEEEEcCCCceEEEEeecccceeEecccccCCcceeEEEEEc-CCCcEEEEecCCCceEccC-CCEEEecCCCCccCCCc
Confidence 589999999999987533 3666666654 5677888864 689999996 899999876 56677766654 36
Q ss_pred CceEEEEEeCCCeEEEEe-CCCCeEEEc-cCc----eEEEcccCCCCCceEEE
Q psy16240 181 NALYDLVWQGDGSVALRA-NNGKFIATK-RSG----HLYANSDSADDSCRYYF 227 (284)
Q Consensus 181 ~~~F~le~~~~g~v~lra-~nGkyl~~~-~~G----~L~A~~~~~~~~e~F~~ 227 (284)
.+.|.....+++.++||. .||.|+... ..| -|.|...+++....+.+
T Consensus 85 cTLF~Pv~~d~~~i~lr~vq~~~~~~r~t~~~~~~~CL~A~~~~i~~~a~l~v 137 (139)
T smart00791 85 CTLFRPLYVEMKKIRLLNVQLGHYTKRYTVGKSFVSCLNAASSQIDTKAVLHV 137 (139)
T ss_pred ccEEeEEeccCceEEEEEecCCceEEeecCCCCcccceeeccccccceeeEEe
Confidence 778988887779999997 789988853 222 37777777766555444
No 9
>PF00167 FGF: Fibroblast growth factor; InterPro: IPR002348 The interleukin-1 (IL1) and heparin-binding growth factor (HBGF) families share low sequence similarity (about 25% []) but have very similar structures. Coupled with the Kunitz-type soybean trypsin inhibitors (STI), they form a structural superfamily. Despite their structural correspondence, however, they show no sequence similarity to the STI family. The crystal structures of interleukin-1 beta and HBGF1 have been solved, showing both families to have the same 12-stranded beta-sheet structure []; the beta-sheets are arranged in 3 similar lobes around a central axis, 6 strands forming an anti-parallel beta-barrel [, ]. The beta-sheets are generally well preserved and the crystal structures superimpose in these areas. The intervening loops are less well conserved - the loop between beta-strands 6 and 7 is slightly longer in interleukin-1 beta.; GO: 0008083 growth factor activity; PDB: 1AFC_F 1BAR_A 2P39_A 1EV2_D 2BFH_A 4FGF_A 1BAS_A 1BFG_A 1FQ9_B 1CVS_A ....
Probab=98.44 E-value=7e-06 Score=66.26 Aligned_cols=105 Identities=13% Similarity=0.138 Sum_probs=88.5
Q ss_pred eEEEEECCCCeEEeecCceEEeecCCCCCCceEEEEEeCCCCeEEEEE-cCceeEEEeCCCcEEecCCCCCCCceEEEEE
Q psy16240 110 PSKFKIYSVPLSFLSSGVDVTANQEEISDHETFQLEFDPSTKRWYIRT-MQDRYWTLETGGGIQASGDKKSSNALYDLVW 188 (284)
Q Consensus 110 ~V~Lra~nGkyvs~~~~g~l~A~~~~~~~~e~F~l~~d~~~g~~~Lrs-~nGkY~~v~~~G~v~a~~~~~~~~~~F~le~ 188 (284)
.+.|-+.+|+||.+.++|.+.+..+..++...|+++... .|.+.||+ .+++|++++..|.|.++.. ....+.|.-++
T Consensus 2 ~~~Ly~~~~~~L~i~~~g~V~gt~~~~~~~s~~~i~~~~-~g~V~i~~~~s~~YLcmn~~G~ly~~~~-~~~~C~F~e~~ 79 (122)
T PF00167_consen 2 HVQLYCRTGYFLQINPNGTVDGTGDDNSPYSVFEIHSVG-FGVVRIRGVKSCRYLCMNKCGRLYGSKN-FNKDCVFREEL 79 (122)
T ss_dssp EEEEEETTSEEEEEETTSBEEEESSTTSTTGEEEEEEEE-TTEEEEEETTTTEEEEEBTTSBEEEESS-BTGGGEEEEEE
T ss_pred CEEEEECCCeEEEECCCCeEeCCCCcCcceeEEEEEecc-ceEEEEEEecceEEEEECCCCeEccccc-cCCCceEEEEE
Confidence 366778789999999999999999988899999998764 48999997 7899999999999998755 45699999887
Q ss_pred eCCCeEEEEeC---CCCeEEEccCceEEEcc
Q psy16240 189 QGDGSVALRAN---NGKFIATKRSGHLYANS 216 (284)
Q Consensus 189 ~~~g~v~lra~---nGkyl~~~~~G~L~A~~ 216 (284)
.+++-..+.+. .+.||..+++|...-..
T Consensus 80 ~~n~y~~~~s~~~~~~~yla~~~~G~~~~~~ 110 (122)
T PF00167_consen 80 LENGYNTYESAKYGRGWYLAFNRRGKPRRGK 110 (122)
T ss_dssp ETTSEEEEEESTTGTTEBCEBCTTSBBEBGG
T ss_pred ccCCEEEEEeccCCccEEEEECCCCCCCCCC
Confidence 77677777764 58899999999977644
No 10
>KOG3962|consensus
Probab=98.39 E-value=2e-06 Score=75.33 Aligned_cols=77 Identities=13% Similarity=0.088 Sum_probs=67.4
Q ss_pred ccCcCeeEE-EEEcCCeEEEEECCCCeEEeecCceEEeecCCCCCCceEEEEEeCCCCeEEEEEcCceeEEEeCCCcEEe
Q psy16240 95 EVKIQYFYG-LYCTDKPSKFKIYSVPLSFLSSGVDVTANQEEISDHETFQLEFDPSTKRWYIRTMQDRYWTLETGGGIQA 173 (284)
Q Consensus 95 ~~~~~E~F~-l~~~~~~V~Lra~nGkyvs~~~~g~l~A~~~~~~~~e~F~l~~d~~~g~~~Lrs~nGkY~~v~~~G~v~a 173 (284)
-|.+.|+|. +.....+|+|++.-||||++.++|.|++.++++++.|+|..++ ..++.+|.+.++.|..++..+.+.+
T Consensus 79 gp~p~e~f~avki~dsrIaLKsGyGKYlsinsdglvvg~qeAvG~~EQw~~vF--q~~r~a~~as~s~~~~~~e~~d~~v 156 (246)
T KOG3962|consen 79 GPEPEEQFMAVKISDSRIALKSGYGKYLSINSDGLVVGRQEAVGSREQWEPVF--QEGRMALLASNSCFIRCNEAGDAEV 156 (246)
T ss_pred CCCchhhEEEEEccCceEEecccccceeeecCCccEEEehhhcCcHhhchhhh--hccceEEeeccceeEEechhhchhh
Confidence 467778877 7778899999999999999999999999999999999999997 5788999999999888876666543
No 11
>PF07468 Agglutinin: Agglutinin; InterPro: IPR008998 Agglutinins are sugar-specific lectins that can agglutinate erythrocytes and other cell types. Lectins occur widely in plants, as well as some microorganisms and animal []. Agglutinin from Amaranthus caudatus (amaranthin) is a lectin from the ancient South American crop, amaranth grain. Although its biological function is unknown, it can agglutinate A, B and O red blood cells, and has a carbohydrate-binding site that is specific for the methyl-glycoside of the T-antigen found linked to serine or threonine residues of cell surface glycoproteins []. The protein is comprised of a homodimer, with each homodimer consisting of two beta-trefoil domains []. Lectin B chains from ricin and related toxins also contain beta-trefoil domain, however they are not related to agglutinin, showing little sequence similarity [].; PDB: 1JLY_B 1JLX_B.
Probab=98.24 E-value=4.2e-05 Score=63.77 Aligned_cols=117 Identities=21% Similarity=0.229 Sum_probs=77.8
Q ss_pred EEEEECCCCeEEeecC-c----eEEeecCC-CCCCceEEEEEe-CCCCeEEEEE-cCceeEEEeCCC--cEEecCCCC--
Q psy16240 111 SKFKIYSVPLSFLSSG-V----DVTANQEE-ISDHETFQLEFD-PSTKRWYIRT-MQDRYWTLETGG--GIQASGDKK-- 178 (284)
Q Consensus 111 V~Lra~nGkyvs~~~~-g----~l~A~~~~-~~~~e~F~l~~d-~~~g~~~Lrs-~nGkY~~v~~~G--~v~a~~~~~-- 178 (284)
|.++..|||||++..+ . -|..+.+. ..+...|+++.. ..+|.++||. .+||||...+.+ .|.|++.++
T Consensus 8 V~fkg~N~kYLry~~e~~~~~~~LqF~~edi~dP~v~fev~~~~~~dG~V~Ir~~y~nKfWrr~s~n~~WI~ada~~p~e 87 (153)
T PF07468_consen 8 VAFKGDNGKYLRYRTEDIQQYGYLQFSGEDIGDPYVKFEVEPSKTHDGLVHIRCCYNNKFWRRSSPNDYWIWADADDPDE 87 (153)
T ss_dssp EEEETTTS-EEEEEESSCTTCCEEEEEESSTT-CCG-EEEEE-SSTTT-EEEEETTTTEEEEESCCC--BEEEEESSHHH
T ss_pred EEEEcCCCcEEEEEecccccceeEEecCCcCCCCceeEEEEEcccCCCeEEEEeccCCceeEeCCCCCcEEEecCCCccc
Confidence 4555699999998862 2 36666655 456899999972 2579999997 899999975556 788865532
Q ss_pred ----CCCceEEEEEeC---CCeEEEEe-CCCCeEEEc----c-CceEEEcccCCCCCc--eEEE
Q psy16240 179 ----SSNALYDLVWQG---DGSVALRA-NNGKFIATK----R-SGHLYANSDSADDSC--RYYF 227 (284)
Q Consensus 179 ----~~~~~F~le~~~---~g~v~lra-~nGkyl~~~----~-~G~L~A~~~~~~~~e--~F~~ 227 (284)
.+.+.|+....+ ...++|+. .||.|..-- + -.-|.|...+|+... +|++
T Consensus 88 d~s~~~cTLF~Pv~vd~~~~~~i~l~~~~n~~~~~r~t~~~~~~sCL~A~~~~i~~~~~~~~~~ 151 (153)
T PF07468_consen 88 DQSKPSCTLFEPVKVDVKDFNVIALRNMQNGHFCKRLTYGGKFVSCLNAASSSIDREAKDVFTI 151 (153)
T ss_dssp -TCSTCGG-EEEEESCCCETTEEEEEETTTTEEEEEE--STTBSSEEEEEESS--CCGTT-EEE
T ss_pred ccCCCCceEEEEEEecCCCccEEEEEecCCceEEEEEccCCcceeeEeeccCcccccccceEEE
Confidence 356679987653 36789997 799998843 2 345999998887655 6665
No 12
>PF00167 FGF: Fibroblast growth factor; InterPro: IPR002348 The interleukin-1 (IL1) and heparin-binding growth factor (HBGF) families share low sequence similarity (about 25% []) but have very similar structures. Coupled with the Kunitz-type soybean trypsin inhibitors (STI), they form a structural superfamily. Despite their structural correspondence, however, they show no sequence similarity to the STI family. The crystal structures of interleukin-1 beta and HBGF1 have been solved, showing both families to have the same 12-stranded beta-sheet structure []; the beta-sheets are arranged in 3 similar lobes around a central axis, 6 strands forming an anti-parallel beta-barrel [, ]. The beta-sheets are generally well preserved and the crystal structures superimpose in these areas. The intervening loops are less well conserved - the loop between beta-strands 6 and 7 is slightly longer in interleukin-1 beta.; GO: 0008083 growth factor activity; PDB: 1AFC_F 1BAR_A 2P39_A 1EV2_D 2BFH_A 4FGF_A 1BAS_A 1BFG_A 1FQ9_B 1CVS_A ....
Probab=98.23 E-value=6.5e-05 Score=60.61 Aligned_cols=96 Identities=16% Similarity=0.412 Sum_probs=79.7
Q ss_pred EEEEEcCceeEEEeCCCcEEecCCCCCCCceEEEEEeCCCeEEEEe-CCCCeEEEccCceEEEcccCCCCCceEEEEEec
Q psy16240 153 WYIRTMQDRYWTLETGGGIQASGDKKSSNALYDLVWQGDGSVALRA-NNGKFIATKRSGHLYANSDSADDSCRYYFYLIN 231 (284)
Q Consensus 153 ~~Lrs~nGkY~~v~~~G~v~a~~~~~~~~~~F~le~~~~g~v~lra-~nGkyl~~~~~G~L~A~~~~~~~~e~F~~~lvn 231 (284)
++|-+.+|+|+.+.++|.|.++.+..+..+.|+++-.+.+.|.|++ ..++||++++.|.|.+... +.+++.|..++..
T Consensus 3 ~~Ly~~~~~~L~i~~~g~V~gt~~~~~~~s~~~i~~~~~g~V~i~~~~s~~YLcmn~~G~ly~~~~-~~~~C~F~e~~~~ 81 (122)
T PF00167_consen 3 VQLYCRTGYFLQINPNGTVDGTGDDNSPYSVFEIHSVGFGVVRIRGVKSCRYLCMNKCGRLYGSKN-FNKDCVFREELLE 81 (122)
T ss_dssp EEEEETTSEEEEEETTSBEEEESSTTSTTGEEEEEEEETTEEEEEETTTTEEEEEBTTSBEEEESS-BTGGGEEEEEEET
T ss_pred EEEEECCCeEEEECCCCeEeCCCCcCcceeEEEEEeccceEEEEEEecceEEEEECCCCeEccccc-cCCCceEEEEEcc
Confidence 5666777999999999999999887788999999988779999997 5799999999999999754 5569999988776
Q ss_pred CCeEEEeec-C--C-eeeeecC
Q psy16240 232 RPILVLKCE-Q--G-FVGFKTG 249 (284)
Q Consensus 232 rp~lvL~g~-~--g-fV~~~~~ 249 (284)
.-...+... + + ||+-.+.
T Consensus 82 n~y~~~~s~~~~~~~yla~~~~ 103 (122)
T PF00167_consen 82 NGYNTYESAKYGRGWYLAFNRR 103 (122)
T ss_dssp TSEEEEEESTTGTTEBCEBCTT
T ss_pred CCEEEEEeccCCccEEEEECCC
Confidence 666677664 3 3 7887766
No 13
>cd00058 FGF Acidic and basic fibroblast growth factor family; FGFs are mitogens, which stimulate growth or differentiation of cells of mesodermal or neuroectodermal origin. The family plays essential roles in patterning and differentiation during vertebrate embryogenesis, and has neurotrophic activities. FGFs have a high affinity for heparan sulfate proteoglycans and require heparan sulfate to activate one of four cell surface FGF receptors. Upon binding to FGF, the receptors dimerize and their intracellular tyrosine kinase domains become active. FGFs have internal pseudo-threefold symmetry (beta-trefoil topology).
Probab=98.07 E-value=0.00012 Score=59.36 Aligned_cols=103 Identities=10% Similarity=0.049 Sum_probs=85.2
Q ss_pred EEEECCCCeEEeecCceEEeecCCCCCCceEEEEEeCCCCeEEEEE-cCceeEEEeCCCcEEecCCCCCCCceEEEEEeC
Q psy16240 112 KFKIYSVPLSFLSSGVDVTANQEEISDHETFQLEFDPSTKRWYIRT-MQDRYWTLETGGGIQASGDKKSSNALYDLVWQG 190 (284)
Q Consensus 112 ~Lra~nGkyvs~~~~g~l~A~~~~~~~~e~F~l~~d~~~g~~~Lrs-~nGkY~~v~~~G~v~a~~~~~~~~~~F~le~~~ 190 (284)
.|-..+|.||.+.++|.+.+..+..++...|++... ..|.+.||+ .+++|++++..|.|-++. .....|.|.-++.+
T Consensus 2 qLy~~~~~~L~I~~dG~V~Gt~~~~~~~s~l~~~s~-~~g~v~i~~v~s~~YLCmn~~G~ly~s~-~~~~dC~F~E~~~~ 79 (123)
T cd00058 2 QLYCRTGFHLQILPDGTVDGTRDDSSSYTILERIAV-AVGVVSIKGVASCRYLCMNKCGKLYGSK-GFTEECLFREELLE 79 (123)
T ss_pred eEEEcCCeEEEEcCCCcEecccCCCCCCceEEEEEC-CCCEEEEEEcccceEEEECCCCCEEECC-CCCCCCEEEEEEcc
Confidence 344556999999999999999988778999999865 578999996 799999999999998876 57889999877777
Q ss_pred CCeEEEEeC---CCCeEEEccCceEEEcc
Q psy16240 191 DGSVALRAN---NGKFIATKRSGHLYANS 216 (284)
Q Consensus 191 ~g~v~lra~---nGkyl~~~~~G~L~A~~ 216 (284)
.+-..+.+. .+-||..+.+|...-..
T Consensus 80 n~Y~~y~S~~~~~~~ylal~~~G~~rr~~ 108 (123)
T cd00058 80 NNYNTYASAKYRRRWYLALNKKGRPRRGQ 108 (123)
T ss_pred CCcEEEEEcccCCCcEEEECCCCCCcccc
Confidence 677777763 47899999999866543
No 14
>PF04601 DUF569: Protein of unknown function (DUF569); InterPro: IPR007679 This is a family of hypothetical proteins. Some family members contain two copies of the region.
Probab=98.06 E-value=9.3e-05 Score=61.35 Aligned_cols=99 Identities=17% Similarity=0.224 Sum_probs=66.9
Q ss_pred CCeEEEEECCCCeEEeecCc-eEEeecCCCCCCceEEEEEeCC-CCeEEEEEcCceeEEEeCC-------CcE--EecCC
Q psy16240 108 DKPSKFKIYSVPLSFLSSGV-DVTANQEEISDHETFQLEFDPS-TKRWYIRTMQDRYWTLETG-------GGI--QASGD 176 (284)
Q Consensus 108 ~~~V~Lra~nGkyvs~~~~g-~l~A~~~~~~~~e~F~l~~d~~-~g~~~Lrs~nGkY~~v~~~-------G~v--~a~~~ 176 (284)
..-|+||+..|+||.+..++ .++-.....+-....++++..+ ...+-||++.|+|++.... |.. +....
T Consensus 7 ~~~VRLRS~~~kYL~ADeDg~~Vs~~~~~~s~na~W~Ve~v~~~~~~v~L~saYGrYL~as~~~~~lG~~G~~v~Q~~~~ 86 (142)
T PF04601_consen 7 GKHVRLRSHHGKYLHADEDGEGVSQDRRGASLNAAWTVERVPGSPNYVRLRSAYGRYLAASDEPALLGHTGRRVVQTDPD 86 (142)
T ss_pred CCEEEEEecCCCEEEEcCCCCeEEECCCCCCCcceEEEEEecCCCCEEEEeeccCceEeccCCcCCCCCCCCEEEecCCc
Confidence 45699999999999999776 4666666666677888876533 3577888999999998643 222 22222
Q ss_pred CCCCCceEEEEEeCCCeEEEEeCCCCeEEEc
Q psy16240 177 KKSSNALYDLVWQGDGSVALRANNGKFIATK 207 (284)
Q Consensus 177 ~~~~~~~F~le~~~~g~v~lra~nGkyl~~~ 207 (284)
.......++... +...+.||..+|+||.+.
T Consensus 87 ~~d~~~~Wepvr-~g~~V~Lr~~~gr~LRAN 116 (142)
T PF04601_consen 87 RLDSSVEWEPVR-DGFYVKLRHRSGRYLRAN 116 (142)
T ss_pred cCCCCceEEEec-CCCEEEEEecCCceEEcC
Confidence 222222222222 336799999999999864
No 15
>smart00442 FGF Acidic and basic fibroblast growth factor family. Mitogens that stimulate growth or differentiation of cells of mesodermal or neuroectodermal origin. The family play essential roles in patterning and differentiation during vertebrate embryogenesis, and have neurotrophic activities.
Probab=98.00 E-value=0.0003 Score=57.32 Aligned_cols=105 Identities=10% Similarity=0.110 Sum_probs=85.7
Q ss_pred eEEEEECCCCeEEeecCceEEeecCCCCCCceEEEEEeCCCCeEEEEE-cCceeEEEeCCCcEEecCCCCCCCceEEEEE
Q psy16240 110 PSKFKIYSVPLSFLSSGVDVTANQEEISDHETFQLEFDPSTKRWYIRT-MQDRYWTLETGGGIQASGDKKSSNALYDLVW 188 (284)
Q Consensus 110 ~V~Lra~nGkyvs~~~~g~l~A~~~~~~~~e~F~l~~d~~~g~~~Lrs-~nGkY~~v~~~G~v~a~~~~~~~~~~F~le~ 188 (284)
++.|-..+|.||.+.++|.+.+..++.....+|++.-. ..|.+.||+ .+++|++++..|.|-++.. ....|.|.=++
T Consensus 4 ~~~Ly~~~~~~L~I~~~G~V~Gt~~~~~~~~ile~~s~-~~g~V~ik~~~s~~YLCmn~~G~ly~s~~-~~~dC~F~E~~ 81 (126)
T smart00442 4 LRQLYCRNGQHLQILPDGTVDGTRDESSSFTILEIIAV-AVGVVAIKGVASCRYLCMNKCGKLYGSKN-FTEDCVFREEM 81 (126)
T ss_pred EEEEEeCCCeEEEEcCCceEecccCCCCcceEEEEEec-cCCEEEEEEcccceEEEECCCCCEEEccc-CCCCcEEEEEe
Confidence 46677788899999999999999888888999999864 468999996 8999999999999988665 66889997777
Q ss_pred eCCCeEEEEe-CCC--CeEEEccCceEEEcc
Q psy16240 189 QGDGSVALRA-NNG--KFIATKRSGHLYANS 216 (284)
Q Consensus 189 ~~~g~v~lra-~nG--kyl~~~~~G~L~A~~ 216 (284)
.+.+-..+.+ .+. -||..+.+|...-..
T Consensus 82 ~~n~y~~y~S~~~~~~~ylal~~~G~~r~~~ 112 (126)
T smart00442 82 EENGYNTYASAKYRKRWYVALNKKGRPRRGQ 112 (126)
T ss_pred ccCCeEEEEEcccCCceEEEECCCCCEeEcc
Confidence 6656666665 333 699999999977654
No 16
>cd00058 FGF Acidic and basic fibroblast growth factor family; FGFs are mitogens, which stimulate growth or differentiation of cells of mesodermal or neuroectodermal origin. The family plays essential roles in patterning and differentiation during vertebrate embryogenesis, and has neurotrophic activities. FGFs have a high affinity for heparan sulfate proteoglycans and require heparan sulfate to activate one of four cell surface FGF receptors. Upon binding to FGF, the receptors dimerize and their intracellular tyrosine kinase domains become active. FGFs have internal pseudo-threefold symmetry (beta-trefoil topology).
Probab=97.84 E-value=0.00033 Score=56.82 Aligned_cols=94 Identities=12% Similarity=0.275 Sum_probs=75.8
Q ss_pred EEEcCceeEEEeCCCcEEecCCCCCCCceEEEEEeCCCeEEEEe-CCCCeEEEccCceEEEcccCCCCCceEEEEEecCC
Q psy16240 155 IRTMQDRYWTLETGGGIQASGDKKSSNALYDLVWQGDGSVALRA-NNGKFIATKRSGHLYANSDSADDSCRYYFYLINRP 233 (284)
Q Consensus 155 Lrs~nGkY~~v~~~G~v~a~~~~~~~~~~F~le~~~~g~v~lra-~nGkyl~~~~~G~L~A~~~~~~~~e~F~~~lvnrp 233 (284)
|=..+|.|+.+.+||.|..+.+...+.+.|+++-.+.|.|.||+ ..++||++++.|.|.+.. .+.+++.|.-++...-
T Consensus 3 Ly~~~~~~L~I~~dG~V~Gt~~~~~~~s~l~~~s~~~g~v~i~~v~s~~YLCmn~~G~ly~s~-~~~~dC~F~E~~~~n~ 81 (123)
T cd00058 3 LYCRTGFHLQILPDGTVDGTRDDSSSYTILERIAVAVGVVSIKGVASCRYLCMNKCGKLYGSK-GFTEECLFREELLENN 81 (123)
T ss_pred EEEcCCeEEEEcCCCcEecccCCCCCCceEEEEECCCCEEEEEEcccceEEEECCCCCEEECC-CCCCCCEEEEEEccCC
Confidence 33445999999999999998887667899999987789999998 599999999999999987 7889999998877555
Q ss_pred eEEEeec-C--C-eeeeecC
Q psy16240 234 ILVLKCE-Q--G-FVGFKTG 249 (284)
Q Consensus 234 ~lvL~g~-~--g-fV~~~~~ 249 (284)
.-+.... | + ||+-...
T Consensus 82 Y~~y~S~~~~~~~ylal~~~ 101 (123)
T cd00058 82 YNTYASAKYRRRWYLALNKK 101 (123)
T ss_pred cEEEEEcccCCCcEEEECCC
Confidence 5555553 3 3 6776554
No 17
>smart00442 FGF Acidic and basic fibroblast growth factor family. Mitogens that stimulate growth or differentiation of cells of mesodermal or neuroectodermal origin. The family play essential roles in patterning and differentiation during vertebrate embryogenesis, and have neurotrophic activities.
Probab=97.74 E-value=0.00076 Score=54.97 Aligned_cols=86 Identities=13% Similarity=0.285 Sum_probs=70.0
Q ss_pred EEEEEcCceeEEEeCCCcEEecCCCCCCCceEEEEEeCCCeEEEEe-CCCCeEEEccCceEEEcccCCCCCceEEEEEec
Q psy16240 153 WYIRTMQDRYWTLETGGGIQASGDKKSSNALYDLVWQGDGSVALRA-NNGKFIATKRSGHLYANSDSADDSCRYYFYLIN 231 (284)
Q Consensus 153 ~~Lrs~nGkY~~v~~~G~v~a~~~~~~~~~~F~le~~~~g~v~lra-~nGkyl~~~~~G~L~A~~~~~~~~e~F~~~lvn 231 (284)
.+|=..+|.|+.+.+||.|..+.+...+...|++.-.+.|.|.||+ ..++||++++.|.|.+... +.+++.|.-++..
T Consensus 5 ~~Ly~~~~~~L~I~~~G~V~Gt~~~~~~~~ile~~s~~~g~V~ik~~~s~~YLCmn~~G~ly~s~~-~~~dC~F~E~~~~ 83 (126)
T smart00442 5 RQLYCRNGQHLQILPDGTVDGTRDESSSFTILEIIAVAVGVVAIKGVASCRYLCMNKCGKLYGSKN-FTEDCVFREEMEE 83 (126)
T ss_pred EEEEeCCCeEEEEcCCceEecccCCCCcceEEEEEeccCCEEEEEEcccceEEEECCCCCEEEccc-CCCCcEEEEEecc
Confidence 5566777899999999999988877677888999877779999997 6899999999999999766 6789999888764
Q ss_pred CCeEEEee
Q psy16240 232 RPILVLKC 239 (284)
Q Consensus 232 rp~lvL~g 239 (284)
.-.-+...
T Consensus 84 n~y~~y~S 91 (126)
T smart00442 84 NGYNTYAS 91 (126)
T ss_pred CCeEEEEE
Confidence 44444444
No 18
>PF04601 DUF569: Protein of unknown function (DUF569); InterPro: IPR007679 This is a family of hypothetical proteins. Some family members contain two copies of the region.
Probab=97.55 E-value=0.0025 Score=52.89 Aligned_cols=100 Identities=10% Similarity=0.048 Sum_probs=62.2
Q ss_pred CCCeEEEEECCCceEEEecCCceeeeccccccCCcceeeeeeecccCcCeeEEEEEc---CCeEEEEECCCCeEEeecCc
Q psy16240 51 STKRWYIRTMQDRYWTLETGGGMIEEESRLFSNPSKVRTVLQYLEVKIQYFYGLYCT---DKPSKFKIYSVPLSFLSSGV 127 (284)
Q Consensus 51 ~~~r~~LRt~~gky~tl~~~gg~~a~~g~~~s~~~~~~~~~~~~~~~~~E~F~l~~~---~~~V~Lra~nGkyvs~~~~g 127 (284)
+.+-+.||+..|+|+.+..++--|.-. .......+.+.+|.- .+.|.|+++.||||++....
T Consensus 6 d~~~VRLRS~~~kYL~ADeDg~~Vs~~---------------~~~~s~na~W~Ve~v~~~~~~v~L~saYGrYL~as~~~ 70 (142)
T PF04601_consen 6 DGKHVRLRSHHGKYLHADEDGEGVSQD---------------RRGASLNAAWTVERVPGSPNYVRLRSAYGRYLAASDEP 70 (142)
T ss_pred CCCEEEEEecCCCEEEEcCCCCeEEEC---------------CCCCCCcceEEEEEecCCCCEEEEeeccCceEeccCCc
Confidence 567899999999999998775444311 122234566665543 45799999999999987532
Q ss_pred eE------EeecCCC-CCCceEEEEEeCCCCeEEEEEcCceeEEE
Q psy16240 128 DV------TANQEEI-SDHETFQLEFDPSTKRWYIRTMQDRYWTL 165 (284)
Q Consensus 128 ~l------~A~~~~~-~~~e~F~l~~d~~~g~~~Lrs~nGkY~~v 165 (284)
.+ .+.+... ....-+..+...+...+.||+.+|+|+..
T Consensus 71 ~~lG~~G~~v~Q~~~~~~d~~~~Wepvr~g~~V~Lr~~~gr~LRA 115 (142)
T PF04601_consen 71 ALLGHTGRRVVQTDPDRLDSSVEWEPVRDGFYVKLRHRSGRYLRA 115 (142)
T ss_pred CCCCCCCCEEEecCCccCCCCceEEEecCCCEEEEEecCCceEEc
Confidence 11 1111111 11122233444456678899999999874
No 19
>smart00791 Agglutinin Amaranthus caudatus agglutinin or amaranthin is a lectin from the ancient South American crop, amaranth grain. Although its biological function is unknown, it has a high binding specificity for the methyl-glycoside of the T-antigen, found linked to serine or threonine residues of cell surface glycoproteins PUBMED:2271665. The protein is comprised of a homodimer, with each homodimer consisting of two beta-trefoil domains PUBMED:9334739.
Probab=97.40 E-value=0.005 Score=50.38 Aligned_cols=97 Identities=14% Similarity=0.118 Sum_probs=73.7
Q ss_pred eEEEEEcCceeEEEe---CCCcEEecCCCCC-CCceEEEEEeCCCeEEEEe-CCCCeEEEccCceEEEcccCCC----CC
Q psy16240 152 RWYIRTMQDRYWTLE---TGGGIQASGDKKS-SNALYDLVWQGDGSVALRA-NNGKFIATKRSGHLYANSDSAD----DS 222 (284)
Q Consensus 152 ~~~Lrs~nGkY~~v~---~~G~v~a~~~~~~-~~~~F~le~~~~g~v~lra-~nGkyl~~~~~G~L~A~~~~~~----~~ 222 (284)
.++||+.||||+... ..+-|+..++.+. +...|+++...+|.|.||. .+|||.... .--+.|.++.++ ..
T Consensus 7 ~V~FKg~n~kYLry~~~~~~~~lqf~~ddI~dp~v~~ev~~~~dg~V~ik~~~~nKfWr~s-~~WI~a~s~d~~e~~ssc 85 (139)
T smart00791 7 YVLFKGNNQKYLRYQSIQQYGLLQFSADKILDPLVQFEVFPTYNGLVHIKSNYTNKFWRLS-HYWITADANDPDENKSAC 85 (139)
T ss_pred EEEEEcCCCceEEEEeecccceeEecccccCCcceeEEEEEcCCCcEEEEecCCCceEccC-CCEEEecCCCCccCCCcc
Confidence 578999999999864 3456888887765 6778999987779999997 799999876 345777777663 56
Q ss_pred ceEEEEEecCCeEEEe--ecCCeeeeecC
Q psy16240 223 CRYYFYLINRPILVLK--CEQGFVGFKTG 249 (284)
Q Consensus 223 e~F~~~lvnrp~lvL~--g~~gfV~~~~~ 249 (284)
++|.-..+..-.+.|+ |+..|+..-..
T Consensus 86 TLF~Pv~~d~~~i~lr~vq~~~~~~r~t~ 114 (139)
T smart00791 86 TLFRPLYVEMKKIRLLNVQLGHYTKRYTV 114 (139)
T ss_pred cEEeEEeccCceEEEEEecCCceEEeecC
Confidence 7998877777778887 45567665544
No 20
>KOG3885|consensus
Probab=97.30 E-value=0.0021 Score=54.05 Aligned_cols=85 Identities=13% Similarity=0.377 Sum_probs=68.3
Q ss_pred EEEEEcC-ceeEEEeCCCcEEecCCCCCCCceEEEEEeCCCeEEEEe-CCCCeEEEccCceEEEcccCCCCCceEEEEEe
Q psy16240 153 WYIRTMQ-DRYWTLETGGGIQASGDKKSSNALYDLVWQGDGSVALRA-NNGKFIATKRSGHLYANSDSADDSCRYYFYLI 230 (284)
Q Consensus 153 ~~Lrs~n-GkY~~v~~~G~v~a~~~~~~~~~~F~le~~~~g~v~lra-~nGkyl~~~~~G~L~A~~~~~~~~e~F~~~lv 230 (284)
.+|=+.+ |.++.+.++|.|..+++...+...++|.-.+-|.|+||. ..++||++++.|.|.+. ....+++.|.-++.
T Consensus 28 ~~LY~~t~g~hLqi~p~g~V~Gt~~~~s~~siLei~sv~~GvV~IkGV~s~~YL~Mn~~G~LygS-~~~t~eC~F~E~~~ 106 (155)
T KOG3885|consen 28 KLLYCRNGGHFLRILPDGTVDGTRDRSDQHTIFEIITVAVGVVAIKGVESELYLAMNKEGKLYAS-KEFTEECKFKELVL 106 (155)
T ss_pred eEEEEcCCCEEEEEcCCCccccccccCCCceeEEEEEeeecEEEEEEeeceeEEEECCCCcEecC-CCCCccceeEEEee
Confidence 3444444 999999999999999887677778888877779999997 68999999999999999 66678899988877
Q ss_pred cCCeEEEe
Q psy16240 231 NRPILVLK 238 (284)
Q Consensus 231 nrp~lvL~ 238 (284)
..-.-+..
T Consensus 107 EN~YntY~ 114 (155)
T KOG3885|consen 107 ENYYNTYA 114 (155)
T ss_pred cCCchhee
Confidence 54333333
No 21
>PF07468 Agglutinin: Agglutinin; InterPro: IPR008998 Agglutinins are sugar-specific lectins that can agglutinate erythrocytes and other cell types. Lectins occur widely in plants, as well as some microorganisms and animal []. Agglutinin from Amaranthus caudatus (amaranthin) is a lectin from the ancient South American crop, amaranth grain. Although its biological function is unknown, it can agglutinate A, B and O red blood cells, and has a carbohydrate-binding site that is specific for the methyl-glycoside of the T-antigen found linked to serine or threonine residues of cell surface glycoproteins []. The protein is comprised of a homodimer, with each homodimer consisting of two beta-trefoil domains []. Lectin B chains from ricin and related toxins also contain beta-trefoil domain, however they are not related to agglutinin, showing little sequence similarity [].; PDB: 1JLY_B 1JLX_B.
Probab=96.87 E-value=0.029 Score=46.95 Aligned_cols=115 Identities=18% Similarity=0.278 Sum_probs=70.6
Q ss_pred EEEEEcCceeEEEeC-----CCcEEecCCCCC-CCceEEEEE--eCCCeEEEEe-CCCCeEEEccCc--eEEEcccCCC-
Q psy16240 153 WYIRTMQDRYWTLET-----GGGIQASGDKKS-SNALYDLVW--QGDGSVALRA-NNGKFIATKRSG--HLYANSDSAD- 220 (284)
Q Consensus 153 ~~Lrs~nGkY~~v~~-----~G~v~a~~~~~~-~~~~F~le~--~~~g~v~lra-~nGkyl~~~~~G--~L~A~~~~~~- 220 (284)
+.++..||||++... .+-|+.+++.+. +...|+++. ..+|.|+||. .+|||......+ -+.|.++.+.
T Consensus 8 V~fkg~N~kYLry~~e~~~~~~~LqF~~edi~dP~v~fev~~~~~~dG~V~Ir~~y~nKfWrr~s~n~~WI~ada~~p~e 87 (153)
T PF07468_consen 8 VAFKGDNGKYLRYRTEDIQQYGYLQFSGEDIGDPYVKFEVEPSKTHDGLVHIRCCYNNKFWRRSSPNDYWIWADADDPDE 87 (153)
T ss_dssp EEEETTTS-EEEEEESSCTTCCEEEEEESSTT-CCG-EEEEE-SSTTT-EEEEETTTTEEEEESCCC--BEEEEESSHHH
T ss_pred EEEEcCCCcEEEEEecccccceeEEecCCcCCCCceeEEEEEcccCCCeEEEEeccCCceeEeCCCCCcEEEecCCCccc
Confidence 456678999998754 244888777654 788999998 5569999998 799999975444 5788766543
Q ss_pred -----CCceEEEEEec---CCeEEEee-cCC-eeeeecCCCCCCceeec--------ceeceeEEEE
Q psy16240 221 -----DSCRYYFYLIN---RPILVLKC-EQG-FVGFKTGANNSLRLECN--------KANYETIQVE 269 (284)
Q Consensus 221 -----~~e~F~~~lvn---rp~lvL~g-~~g-fV~~~~~~~~~~~l~~n--------r~~~~~~~l~ 269 (284)
..++|.-.-+. .-.|.|+- .++ |+..-..+ .....|- +..+|+|++.
T Consensus 88 d~s~~~cTLF~Pv~vd~~~~~~i~l~~~~n~~~~~r~t~~--~~~~sCL~A~~~~i~~~~~~~~~~i 152 (153)
T PF07468_consen 88 DQSKPSCTLFEPVKVDVKDFNVIALRNMQNGHFCKRLTYG--GKFVSCLNAASSSIDREAKDVFTII 152 (153)
T ss_dssp -TCSTCGG-EEEEESCCCETTEEEEEETTTTEEEEEE--S--TTBSSEEEEEESS--CCGTT-EEEE
T ss_pred ccCCCCceEEEEEEecCCCccEEEEEecCCceEEEEEccC--CcceeeEeeccCcccccccceEEEc
Confidence 56789876553 24677775 344 43332221 1224453 4456677664
No 22
>KOG3885|consensus
Probab=96.80 E-value=0.017 Score=48.63 Aligned_cols=102 Identities=13% Similarity=0.128 Sum_probs=79.6
Q ss_pred EEEEECC-CCeEEeecCceEEeecCCCCCCceEEEEEeCCCCeEEEEE-cCceeEEEeCCCcEEecCCCCCCCceEEEEE
Q psy16240 111 SKFKIYS-VPLSFLSSGVDVTANQEEISDHETFQLEFDPSTKRWYIRT-MQDRYWTLETGGGIQASGDKKSSNALYDLVW 188 (284)
Q Consensus 111 V~Lra~n-Gkyvs~~~~g~l~A~~~~~~~~e~F~l~~d~~~g~~~Lrs-~nGkY~~v~~~G~v~a~~~~~~~~~~F~le~ 188 (284)
-.|-..+ |.+|.+-++|.+.+.+++-++..+++++-+ .-|.++||+ .+++|++.+.+|.|-++ ......|.|.=..
T Consensus 28 ~~LY~~t~g~hLqi~p~g~V~Gt~~~~s~~siLei~sv-~~GvV~IkGV~s~~YL~Mn~~G~LygS-~~~t~eC~F~E~~ 105 (155)
T KOG3885|consen 28 KLLYCRNGGHFLRILPDGTVDGTRDRSDQHTIFEIITV-AVGVVAIKGVESELYLAMNKEGKLYAS-KEFTEECKFKELV 105 (155)
T ss_pred eEEEEcCCCEEEEEcCCCccccccccCCCceeEEEEEe-eecEEEEEEeeceeEEEECCCCcEecC-CCCCccceeEEEe
Confidence 3344555 999999999999999988888999999864 568999995 89999999999999998 5577889996555
Q ss_pred eCCCeEEEEeC----CC-CeEEEccCceEEE
Q psy16240 189 QGDGSVALRAN----NG-KFIATKRSGHLYA 214 (284)
Q Consensus 189 ~~~g~v~lra~----nG-kyl~~~~~G~L~A 214 (284)
.+++-.+..+. .+ .||...++|+-.-
T Consensus 106 ~EN~YntY~S~~y~~~~~~yvaL~k~G~pr~ 136 (155)
T KOG3885|consen 106 LENYYNTYASAKYRHNGEWFVALNKKGIPRR 136 (155)
T ss_pred ecCCchheeehhhcccccEEEEECCCCCCCC
Confidence 55465555541 23 7888888877443
No 23
>PF14200 RicinB_lectin_2: Ricin-type beta-trefoil lectin domain-like; PDB: 2X2S_C 2X2T_A 2VSE_B 2VSA_A 3EF2_A 2IHO_A 3HZB_H 1YBI_B 3PHZ_A 3NBE_A ....
Probab=95.68 E-value=0.26 Score=38.00 Aligned_cols=68 Identities=22% Similarity=0.341 Sum_probs=49.6
Q ss_pred ceEEEEEe-CCCCeEEEEE-cCceeEEEeCC----Cc-EEecCCCCCCCceEEEEEeCCCeEEEEe-CCCCeEEEc
Q psy16240 140 ETFQLEFD-PSTKRWYIRT-MQDRYWTLETG----GG-IQASGDKKSSNALYDLVWQGDGSVALRA-NNGKFIATK 207 (284)
Q Consensus 140 e~F~l~~d-~~~g~~~Lrs-~nGkY~~v~~~----G~-v~a~~~~~~~~~~F~le~~~~g~v~lra-~nGkyl~~~ 207 (284)
..+++... ..+|.|.|+. .+|+|+.+... |. |+.......+...|.|+..++|.+.|+. .+|++|-+.
T Consensus 3 Q~W~~~~~~~~~g~Y~i~n~~sg~~L~v~~~~~~~g~~v~~~~~~~~~~Q~W~i~~~~~g~y~I~n~~s~~~Ldv~ 78 (105)
T PF14200_consen 3 QQWTFTPVGDSDGYYKIRNVNSGKYLDVAGGSTANGTNVQQWTCNGNDNQQWKIEPVGDGYYRIRNKNSGKVLDVA 78 (105)
T ss_dssp GEEEEEEEETTTTEEEEEETTTTEEEEEGCTTCSTTEBEEEEESSSSGGGEEEEEESTTSEEEEEETSTTEEEEEG
T ss_pred CEEEEEEecCCCCEEEEEECCCCCEEEeCCCCcCCCcEEEEecCCCCcCcEEEEEEecCCeEEEEECCCCcEEEEC
Confidence 45677652 2478999997 89999999753 22 4433222357889999998889899997 458999885
No 24
>PF14200 RicinB_lectin_2: Ricin-type beta-trefoil lectin domain-like; PDB: 2X2S_C 2X2T_A 2VSE_B 2VSA_A 3EF2_A 2IHO_A 3HZB_H 1YBI_B 3PHZ_A 3NBE_A ....
Probab=94.75 E-value=0.47 Score=36.56 Aligned_cols=66 Identities=14% Similarity=0.096 Sum_probs=49.0
Q ss_pred eeEEEEE---cCCeEEEEE-CCCCeEEeecCc-----eEEeecCCCCCCceEEEEEeCCCCeEEEEE-cCceeEEEe
Q psy16240 100 YFYGLYC---TDKPSKFKI-YSVPLSFLSSGV-----DVTANQEEISDHETFQLEFDPSTKRWYIRT-MQDRYWTLE 166 (284)
Q Consensus 100 E~F~l~~---~~~~V~Lra-~nGkyvs~~~~g-----~l~A~~~~~~~~e~F~l~~d~~~g~~~Lrs-~nGkY~~v~ 166 (284)
..|.+.. ..+...|++ .+|+||.+..+. .|..-.....+...|.++. ..+|.|.|+. .+|++|.+.
T Consensus 3 Q~W~~~~~~~~~g~Y~i~n~~sg~~L~v~~~~~~~g~~v~~~~~~~~~~Q~W~i~~-~~~g~y~I~n~~s~~~Ldv~ 78 (105)
T PF14200_consen 3 QQWTFTPVGDSDGYYKIRNVNSGKYLDVAGGSTANGTNVQQWTCNGNDNQQWKIEP-VGDGYYRIRNKNSGKVLDVA 78 (105)
T ss_dssp GEEEEEEEETTTTEEEEEETTTTEEEEEGCTTCSTTEBEEEEESSSSGGGEEEEEE-STTSEEEEEETSTTEEEEEG
T ss_pred CEEEEEEecCCCCEEEEEECCCCCEEEeCCCCcCCCcEEEEecCCCCcCcEEEEEE-ecCCeEEEEECCCCcEEEEC
Confidence 3455444 257899997 799999997543 3554444457889999996 4788999997 679999985
No 25
>PF05270 AbfB: Alpha-L-arabinofuranosidase B (ABFB); InterPro: IPR007934 This family consists of several fungal alpha-L-arabinofuranosidase B proteins. L-Arabinose is a constituent of plant cell wall polysaccharides. It is found in a polymeric form in L-arabinan, in which the backbone is formed by 1,5-a- linked l-arabinose residues that can be branched via 1,2-a- and 1,3-a-linked l-arabinofuranose side chains. AbfB hydrolyses 1,5-a, 1,3-a and 1,2-a linkages in both oligosaccharides and polysaccharides, which contain terminal non-reducing l-arabinofuranoses in side chains [].; GO: 0046556 alpha-N-arabinofuranosidase activity, 0046373 L-arabinose metabolic process; PDB: 3AKI_A 3AKF_A 3AKH_A 3AKG_A 2D44_A 1WD3_A 2D43_A 1WD4_A 3KMV_E.
Probab=92.69 E-value=2.3 Score=35.36 Aligned_cols=103 Identities=14% Similarity=0.157 Sum_probs=60.7
Q ss_pred EEEEEC--CCCeEEeecCc-eEEeecCCCC----CCceEEEEEeC-CCCeEEEEEcC--ceeEEEeCCCcEEecCCCCC-
Q psy16240 111 SKFKIY--SVPLSFLSSGV-DVTANQEEIS----DHETFQLEFDP-STKRWYIRTMQ--DRYWTLETGGGIQASGDKKS- 179 (284)
Q Consensus 111 V~Lra~--nGkyvs~~~~g-~l~A~~~~~~----~~e~F~l~~d~-~~g~~~Lrs~n--GkY~~v~~~G~v~a~~~~~~- 179 (284)
+++++. .++||.-.... .+.+.....+ ...+|.+..-- +.+.+.|.+.+ |.|+... ++.|........
T Consensus 3 ~~~~s~~~~~ryirh~~~~~~~~~v~~~s~~~~r~da~f~vvpGLa~~~~vSfES~~~PG~yLrh~-~~~v~l~~~d~s~ 81 (142)
T PF05270_consen 3 LRLTSPNYPDRYIRHRGSLVRLDPVSSSSSALDRADATFRVVPGLADSSCVSFESVNYPGYYLRHS-NFRVRLEKNDGSA 81 (142)
T ss_dssp EEEEESSSTTEEEEEETTEEEEEES-SSGGHHHHHGG-EEEEE-SS-TTCEEEEESSSTTEEEEEE-TTEEEEEE--SSH
T ss_pred EEEECCCCCCeEEEEcCceEEEeeccCCcchhhccCceEEEEEccCCCCEEEEEECCCCCcEEEEE-CCEEEEeecCCCc
Confidence 456663 58888765432 2222222221 15789997521 24579999977 9999974 677765443322
Q ss_pred ---CCceEEEEEe--CCCeEEEEeCC--CCeEEEccCceEEEc
Q psy16240 180 ---SNALYDLVWQ--GDGSVALRANN--GKFIATKRSGHLYAN 215 (284)
Q Consensus 180 ---~~~~F~le~~--~~g~v~lra~n--Gkyl~~~~~G~L~A~ 215 (284)
...+|..+.- +.|.+.|++.| |+||+-.. +.|.+.
T Consensus 82 ~F~~dATF~~~~Gl~~~g~~sfeS~n~Pg~ylrh~~-~~l~l~ 123 (142)
T PF05270_consen 82 LFREDATFCPRPGLAGPGYVSFESYNYPGRYLRHYN-GELYLA 123 (142)
T ss_dssp HHHHHT-EEEEE-SSSTTEEEEEESSSTTEEEEEET-TEEEEE
T ss_pred cccCCceEEEecCCCCCCcceEEEecCCCeEEEEEC-CEEEEe
Confidence 3456877753 45778999865 89999655 444444
No 26
>PF05270 AbfB: Alpha-L-arabinofuranosidase B (ABFB); InterPro: IPR007934 This family consists of several fungal alpha-L-arabinofuranosidase B proteins. L-Arabinose is a constituent of plant cell wall polysaccharides. It is found in a polymeric form in L-arabinan, in which the backbone is formed by 1,5-a- linked l-arabinose residues that can be branched via 1,2-a- and 1,3-a-linked l-arabinofuranose side chains. AbfB hydrolyses 1,5-a, 1,3-a and 1,2-a linkages in both oligosaccharides and polysaccharides, which contain terminal non-reducing l-arabinofuranoses in side chains [].; GO: 0046556 alpha-N-arabinofuranosidase activity, 0046373 L-arabinose metabolic process; PDB: 3AKI_A 3AKF_A 3AKH_A 3AKG_A 2D44_A 1WD3_A 2D43_A 1WD4_A 3KMV_E.
Probab=91.37 E-value=5.2 Score=33.24 Aligned_cols=75 Identities=12% Similarity=0.047 Sum_probs=49.0
Q ss_pred CcCeeEEEEEc---CCeEEEEECC--CCeEEeecCceEEeecCCCC----CCceEEEEEe-CCCCeEEEEE--cCceeEE
Q psy16240 97 KIQYFYGLYCT---DKPSKFKIYS--VPLSFLSSGVDVTANQEEIS----DHETFQLEFD-PSTKRWYIRT--MQDRYWT 164 (284)
Q Consensus 97 ~~~E~F~l~~~---~~~V~Lra~n--Gkyvs~~~~g~l~A~~~~~~----~~e~F~l~~d-~~~g~~~Lrs--~nGkY~~ 164 (284)
..+..|.|..+ ..-|.|.+.| |.||.. .++.|.......+ ...+|..+.. +..|..+|++ .-|+|+.
T Consensus 36 r~da~f~vvpGLa~~~~vSfES~~~PG~yLrh-~~~~v~l~~~d~s~~F~~dATF~~~~Gl~~~g~~sfeS~n~Pg~ylr 114 (142)
T PF05270_consen 36 RADATFRVVPGLADSSCVSFESVNYPGYYLRH-SNFRVRLEKNDGSALFREDATFCPRPGLAGPGYVSFESYNYPGRYLR 114 (142)
T ss_dssp HHGG-EEEEE-SS-TTCEEEEESSSTTEEEEE-ETTEEEEEE--SSHHHHHHT-EEEEE-SSSTTEEEEEESSSTTEEEE
T ss_pred ccCceEEEEEccCCCCEEEEEECCCCCcEEEE-ECCEEEEeecCCCccccCCceEEEecCCCCCCcceEEEecCCCeEEE
Confidence 35788999888 3569999977 999987 5555555544333 3688998864 3457779997 5689999
Q ss_pred EeCCCcEEe
Q psy16240 165 LETGGGIQA 173 (284)
Q Consensus 165 v~~~G~v~a 173 (284)
... +.|..
T Consensus 115 h~~-~~l~l 122 (142)
T PF05270_consen 115 HYN-GELYL 122 (142)
T ss_dssp EET-TEEEE
T ss_pred EEC-CEEEE
Confidence 764 44443
No 27
>KOG0276|consensus
Probab=85.37 E-value=12 Score=38.40 Aligned_cols=131 Identities=13% Similarity=0.135 Sum_probs=74.3
Q ss_pred cccCCCCceeEEEEEeCCCCeEEE------EECCCceEEEecCCceeeeccccccCCcceeeeeeecccCcCeeEEEEEc
Q psy16240 34 NQEEISDHETFQLEFDPSTKRWYI------RTMQDRYWTLETGGGMIEEESRLFSNPSKVRTVLQYLEVKIQYFYGLYCT 107 (284)
Q Consensus 34 n~~e~~~~etFqLe~d~~~~r~~L------Rt~~gky~tl~~~gg~~a~~g~~~s~~~~~~~~~~~~~~~~~E~F~l~~~ 107 (284)
.+.|+.|.|...|-.- +-+.+-| .+.||||+.+-++ |+|. ..+.+.||...-|..-.|.--..
T Consensus 330 ~~~ev~DgErL~LsvK-eLgs~eiyPq~L~hsPNGrfV~Vcgd-------GEyi---IyTala~RnK~fG~~~eFvw~~d 398 (794)
T KOG0276|consen 330 AQKEVTDGERLPLSVK-ELGSVEIYPQTLAHSPNGRFVVVCGD-------GEYI---IYTALALRNKAFGSGLEFVWAAD 398 (794)
T ss_pred CcccccCCccccchhh-hccccccchHHhccCCCCcEEEEecC-------ccEE---EEEeeehhhcccccceeEEEcCC
Confidence 3558889999888652 2233333 2579999987544 3333 23445667666666555553333
Q ss_pred CCeEEEEECCCCeEEeecCceEEeecCC-----CCC-------CceEEEEEeCCCCeEEEE---EcCceeEEEeCCCcEE
Q psy16240 108 DKPSKFKIYSVPLSFLSSGVDVTANQEE-----ISD-------HETFQLEFDPSTKRWYIR---TMQDRYWTLETGGGIQ 172 (284)
Q Consensus 108 ~~~V~Lra~nGkyvs~~~~g~l~A~~~~-----~~~-------~e~F~l~~d~~~g~~~Lr---s~nGkY~~v~~~G~v~ 172 (284)
.+..++|-+|+.-+.-+...+..--+.. +-. ..-|.-.+||+++.+.=| +..+-||+ .+|.++
T Consensus 399 sne~avRes~~~vki~knfke~ksi~~~~~~e~i~gg~Llg~~ss~~~~fydW~~~~lVrrI~v~~k~v~w~--d~g~lV 476 (794)
T KOG0276|consen 399 SNEFAVRESNGNVKIFKNFKEHKSIRPDMSAEGIFGGPLLGVRSSDFLCFYDWESGELVRRIEVTSKHVYWS--DNGELV 476 (794)
T ss_pred CCeEEEEecCCceEEEecceeccccccccceeeecCCceEEEEeCCeEEEEEcccceEEEEEeeccceeEEe--cCCCEE
Confidence 4566677666666655432222111111 000 233555668888877544 47788997 677887
Q ss_pred ecCCC
Q psy16240 173 ASGDK 177 (284)
Q Consensus 173 a~~~~ 177 (284)
+.+..
T Consensus 477 ai~~d 481 (794)
T KOG0276|consen 477 AIAGD 481 (794)
T ss_pred EEEec
Confidence 76553
No 28
>PF03498 CDtoxinA: Cytolethal distending toxin A/C family; InterPro: IPR003558 Escherichia coli, Haemophilus spp and Campylobacter spp. all produce a toxin that is seen to cause distension in certain cell lines [, ], which eventually disintegrate and die. This novel toxin, termed cytolethal distending toxin (cdt), has three subunits: A, B and C. Their sizes are approx. 27.7, 29.5 and 19.9kDa respectively [], and they appear to be entirely novel []. Further research on the complete toxin has revealed that it blocks the cell cycle at stage G2, through inactivation of the cyclin-dependent kinase Cdk1, and without induction of DNA breaks. This leads to multipolar abortive mitosis and micronucleation, associated with centrosomal amplification []. The roles of each subunit are unclear, but it is believed that they have separate roles in pathogenicity. This entry represents the A and C subunits.; GO: 0009405 pathogenesis; PDB: 2F2F_A 1SR4_C.
Probab=81.06 E-value=23 Score=29.64 Aligned_cols=100 Identities=8% Similarity=0.040 Sum_probs=59.4
Q ss_pred EEEEE-CCCCeEEee--cCce-E---EeecC----CCCCCceEEEEEeCCCCeEEEEE-cCceeEEEeCCCcEEec-CCC
Q psy16240 111 SKFKI-YSVPLSFLS--SGVD-V---TANQE----EISDHETFQLEFDPSTKRWYIRT-MQDRYWTLETGGGIQAS-GDK 177 (284)
Q Consensus 111 V~Lra-~nGkyvs~~--~~g~-l---~A~~~----~~~~~e~F~l~~d~~~g~~~Lrs-~nGkY~~v~~~G~v~a~-~~~ 177 (284)
|+||+ .+|--|++- ..+. + -+-.+ ..+..-. ++......|.++|+. .++..+.+..+|.+... -+.
T Consensus 2 vsIrsl~tG~~Lt~~~~~~~~~~~~nW~~~~~~s~~~~~~r~-~~~~~~~~g~Vqf~n~~~~~CL~~~~~G~~~~~~C~~ 80 (150)
T PF03498_consen 2 VSIRSLQTGAVLTNWALDPGNWLDQNWGYSEIDSNDFGKARD-RLIEFFPFGYVQFVNPKTGTCLAAYGNGVFHYKSCDQ 80 (150)
T ss_dssp EEEEETTTSEEEEESSSSTTSCCGG-EEEECGGTGCCGCCGT-EEEE-STTCEEEEEETTTSEEEEEETTCEEEE--TTT
T ss_pred EEEEcCCCCcEEEEeecCCCCeeEeeeEeEeecCchHhhhhe-eecccCCCCEEEEEcCCCCcceeecCCCeEeecccCC
Confidence 67777 778777772 1121 2 11111 2223333 565444559999997 67888887778877633 222
Q ss_pred CCCCceEEEEEeCCCeEEEEe-CCCCeEEEccCce
Q psy16240 178 KSSNALYDLVWQGDGSVALRA-NNGKFIATKRSGH 211 (284)
Q Consensus 178 ~~~~~~F~le~~~~g~v~lra-~nGkyl~~~~~G~ 211 (284)
......|.|..-..|.|-||+ .+|+-|++..+..
T Consensus 81 ~~~~q~F~iiPtttgAVQIks~~~~~Cl~~~~~~~ 115 (150)
T PF03498_consen 81 DNLEQVFSIIPTTTGAVQIKSLSTGECLQTFNNSR 115 (150)
T ss_dssp CHGHH-EEEEEBTTS-EEEEETTT--EEEE-STTS
T ss_pred CChhceEEEEEcCCCcEEEEecCCCceEEecCCCc
Confidence 235678999998889999998 6788888765543
No 29
>KOG1446|consensus
Probab=80.86 E-value=20 Score=33.48 Aligned_cols=77 Identities=13% Similarity=0.220 Sum_probs=41.7
Q ss_pred CCCCceEEEEEeCCCCeE-EEE-EcCceeEEEeCCCc--EEecCCCCCCCceEEEEEeCCCeEEEE---eCCCCeEEEc-
Q psy16240 136 ISDHETFQLEFDPSTKRW-YIR-TMQDRYWTLETGGG--IQASGDKKSSNALYDLVWQGDGSVALR---ANNGKFIATK- 207 (284)
Q Consensus 136 ~~~~e~F~l~~d~~~g~~-~Lr-s~nGkY~~v~~~G~--v~a~~~~~~~~~~F~le~~~~g~v~lr---a~nGkyl~~~- 207 (284)
.+|-++|.+.. ...-.| .|+ +.+|||+-+..++. .+.+|=.-.....|..+.+. +.+.+- .++|+||-..
T Consensus 174 kgPF~tf~i~~-~~~~ew~~l~FS~dGK~iLlsT~~s~~~~lDAf~G~~~~tfs~~~~~-~~~~~~a~ftPds~Fvl~gs 251 (311)
T KOG1446|consen 174 KGPFTTFSITD-NDEAEWTDLEFSPDGKSILLSTNASFIYLLDAFDGTVKSTFSGYPNA-GNLPLSATFTPDSKFVLSGS 251 (311)
T ss_pred CCCceeEccCC-CCccceeeeEEcCCCCEEEEEeCCCcEEEEEccCCcEeeeEeeccCC-CCcceeEEECCCCcEEEEec
Confidence 45667777652 122223 355 67777777654444 22233222245557766655 544444 4788888765
Q ss_pred cCceEEE
Q psy16240 208 RSGHLYA 214 (284)
Q Consensus 208 ~~G~L~A 214 (284)
.+|.|-+
T Consensus 252 ~dg~i~v 258 (311)
T KOG1446|consen 252 DDGTIHV 258 (311)
T ss_pred CCCcEEE
Confidence 4466544
No 30
>COG3377 Uncharacterized conserved protein [Function unknown]
Probab=78.08 E-value=2 Score=32.78 Aligned_cols=25 Identities=32% Similarity=0.376 Sum_probs=21.2
Q ss_pred eEEEEEecCCeEEEeecCCeeeeec
Q psy16240 224 RYYFYLINRPILVLKCEQGFVGFKT 248 (284)
Q Consensus 224 ~F~~~lvnrp~lvL~g~~gfV~~~~ 248 (284)
-|.+++-|-|++++++++||+.|.-
T Consensus 16 ~~~~~L~napll~~~~ekgy~mCGy 40 (95)
T COG3377 16 GLKVDLPNAPLLALKGEKGYAMCGY 40 (95)
T ss_pred EEEecCCCCcEEEEEccccEEEecc
Confidence 4667888899999999999988763
No 31
>KOG1446|consensus
Probab=71.93 E-value=88 Score=29.33 Aligned_cols=43 Identities=26% Similarity=0.460 Sum_probs=22.2
Q ss_pred eeeecceeee--eccEEEEcccCCCCc-eeEEEE-EeCCCCeEEEEE
Q psy16240 17 VSLFPLSFLS--SGVDVTANQEEISDH-ETFQLE-FDPSTKRWYIRT 59 (284)
Q Consensus 17 ~~~~~~~~~~--~g~~v~An~~e~~~~-etFqLe-~d~~~~r~~LRt 59 (284)
+++..+|||| +|=.-..|.-++++. ++|-=- .|....=|-||+
T Consensus 85 Lsl~dNkylRYF~GH~~~V~sL~~sP~~d~FlS~S~D~tvrLWDlR~ 131 (311)
T KOG1446|consen 85 LSLHDNKYLRYFPGHKKRVNSLSVSPKDDTFLSSSLDKTVRLWDLRV 131 (311)
T ss_pred EEeecCceEEEcCCCCceEEEEEecCCCCeEEecccCCeEEeeEecC
Confidence 8999999999 563333333333221 444321 232223455665
No 32
>cd00161 RICIN Ricin-type beta-trefoil; Carbohydrate-binding domain formed from presumed gene triplication. The domain is found in a variety of molecules serving diverse functions such as enzymatic activity, inhibitory toxicity and signal transduction. Highly specific ligand binding occurs on exposed surfaces of the compact domain sturcture.
Probab=71.22 E-value=40 Score=25.13 Aligned_cols=84 Identities=12% Similarity=0.094 Sum_probs=43.2
Q ss_pred CCCeEEeec---CceEEeecCCCC-CCceEEEEEeCCCCeEEEEEcCceeEEEeCC---CcEEec-CCCCCCCceEEEEE
Q psy16240 117 SVPLSFLSS---GVDVTANQEEIS-DHETFQLEFDPSTKRWYIRTMQDRYWTLETG---GGIQAS-GDKKSSNALYDLVW 188 (284)
Q Consensus 117 nGkyvs~~~---~g~l~A~~~~~~-~~e~F~l~~d~~~g~~~Lrs~nGkY~~v~~~---G~v~a~-~~~~~~~~~F~le~ 188 (284)
.|+.|.+.. +..|.......+ ....|.+.. .|.|.++. .++++.+... ..|+.. -........|.++.
T Consensus 9 ~~~cL~~~~~~~~~~v~~~~c~~~~~~Q~W~~~~---~g~~~~~~-~~~Cl~~~~~~~~~~~~~~~c~~~~~~Q~W~~~~ 84 (124)
T cd00161 9 TGLCLDVNGGSDGGPVQLYPCHGNGNNQKWTLTS---DGTIRIKS-SNLCLDVGGDAPGSKVRLYTCSGGSDNQRWTFNK 84 (124)
T ss_pred CCeEEECCCCCCCCEEEEEECCCCCccCCEEEeC---CCeEEEcC-CCeEEcccCCCCCCEEEEEECCCCCcCCEEEECC
Confidence 577777665 244444444333 566777762 67777776 6777776543 233321 11113455666642
Q ss_pred eCCCeEEEEeCCCCeEEEc
Q psy16240 189 QGDGSVALRANNGKFIATK 207 (284)
Q Consensus 189 ~~~g~v~lra~nGkyl~~~ 207 (284)
. +++..++ .++-|.+.
T Consensus 85 ~--~~i~~~~-~~~cl~~~ 100 (124)
T cd00161 85 D--GTIRNLK-SGKCLDVK 100 (124)
T ss_pred C--cEEEECC-CCeEEeCC
Confidence 2 4443333 44555443
No 33
>PF14913 DPCD: DPCD protein family
Probab=69.37 E-value=78 Score=27.67 Aligned_cols=78 Identities=6% Similarity=0.040 Sum_probs=49.6
Q ss_pred eeeee-ecccCcCeeEEEEEcCCeEEEEECCCCeEEeec-CceEEeecCC-----------CCCCceEEEEEeCCCCeEE
Q psy16240 88 RTVLQ-YLEVKIQYFYGLYCTDKPSKFKIYSVPLSFLSS-GVDVTANQEE-----------ISDHETFQLEFDPSTKRWY 154 (284)
Q Consensus 88 ~~~~~-~~~~~~~E~F~l~~~~~~V~Lra~nGkyvs~~~-~g~l~A~~~~-----------~~~~e~F~l~~d~~~g~~~ 154 (284)
++.|| ..+.|....|.++.+.+.......-+..+-..+ ...+...++. +-+.++|.+.++.+...+.
T Consensus 45 ~RkwR~k~~lG~~~~We~EvGep~~~~~~~~~~~~i~ESs~nP~~~r~dTk~~fqWRIRNLPYP~dvYsVtvd~~~r~iv 124 (194)
T PF14913_consen 45 VRKWRRKSTLGGEGQWEIEVGEPEPTQNNNLESELIKESSSNPIFVRRDTKTSFQWRIRNLPYPKDVYSVTVDEDERCIV 124 (194)
T ss_pred EEeeccccccCCCCCcEEEeCCCCccccCCCCcceeeecCCCCEEEEEcCccceEEEEccCCCCccceEEEEcCCCcEEE
Confidence 34466 467888888999998755443322123322222 2223333332 4567999999887777899
Q ss_pred EEEcCceeEEE
Q psy16240 155 IRTMQDRYWTL 165 (284)
Q Consensus 155 Lrs~nGkY~~v 165 (284)
+||.|.||-..
T Consensus 125 vRTtNKKYyKk 135 (194)
T PF14913_consen 125 VRTTNKKYYKK 135 (194)
T ss_pred EECcCccceeE
Confidence 99999999874
No 34
>PF00652 Ricin_B_lectin: Ricin-type beta-trefoil lectin domain; InterPro: IPR000772 Ricin is a legume lectin from the seeds of the castor bean plant, Ricinus communis. The seeds are poisonous to people, animals and insects and just one milligram of ricin can kill an adult. Primary structure analysis has shown the presence of a similar domain in many carbohydrate-recognition proteins like plant and bacterial AB-toxins, glycosidases or proteases [, , ]. This domain, known as the ricin B lectin domain, can be present in one or more copies and has been shown in some instance to bind simple sugars, such as galactose or lactose. The ricin B lectin domain is composed of three homologous subdomains of 40 amino acids (alpha, beta and gamma) and a linker peptide of around 15 residues (lambda). It has been proposed that the ricin B lectin domain arose by gene triplication from a primitive 40 residue galactoside-binding peptide [, ]. The most characteristic, though not completely conserved, sequence feature is the presence of a Q-W pattern. Consequently, the ricin B lectin domain as also been refered as the (QxW)3 domain and the three homologous regions as the QxW repeats [, ]. A disulphide bond is also conserved in some of the QxW repeats []. The 3D structure of the ricin B chain has shown that the three QxW repeats pack around a pseudo threefold axis that is stabilised by the lambda linker []. The ricin B lectin domain has no major segments of a helix or beta sheet but each of the QxW repeats contains an omega loop []. An idealized omega-loop is a compact, contiguous segment of polypeptide that traces a 'loop-shaped' path in three-dimensional space; the main chain resembles a Greek omega.; PDB: 2VLC_B 3A22_B 3A21_B 3A23_B 1GGP_B 1VCL_A 2Z48_B 2Z49_A 2D7R_A 2D7I_A ....
Probab=63.89 E-value=56 Score=24.69 Aligned_cols=93 Identities=15% Similarity=0.097 Sum_probs=49.1
Q ss_pred eEEEEECCCCeEEee----cCceEEeecCCCCCCceEEEEEeCCCCeEEEEEcCceeEEEeCCC---cEEe-cCCCCCCC
Q psy16240 110 PSKFKIYSVPLSFLS----SGVDVTANQEEISDHETFQLEFDPSTKRWYIRTMQDRYWTLETGG---GIQA-SGDKKSSN 181 (284)
Q Consensus 110 ~V~Lra~nGkyvs~~----~~g~l~A~~~~~~~~e~F~l~~d~~~g~~~Lrs~nGkY~~v~~~G---~v~a-~~~~~~~~ 181 (284)
+|++++.+++.|.+. .+..|........+...|.+. ..+.+.........+.+..+. .|.. .-......
T Consensus 4 ~i~~~~~~~~cl~~~~~~~~~~~v~l~~c~~~~~Q~w~~~---~~~~i~~~~~~~~CL~~~~~~~~~~i~l~~C~~~~~~ 80 (124)
T PF00652_consen 4 YIRNVNKSGLCLDVQGSTKNGSPVVLYPCDGSDNQLWRFD---PDGQIRSNNNPNLCLDVDGSSPGTKIVLWPCDSNSSN 80 (124)
T ss_dssp EEEEEEGGGEEEEEGGSSSTTTBEEEEE--SSGGGEEEEE---TTSBEEETTETTEEEEESSSSTTEBEEEEETTTTGGG
T ss_pred EEEEeeCCCCeEEEcCCCCCCCEEEEEECCCCCceeEEEc---CCCceeeccCcceEEEeeccCCCceEEEeeccCCccC
Confidence 455655588888887 344555555555566777666 234333333344467666544 2332 22222233
Q ss_pred ceEEEEEeCCCeEEEEeCCCCeEEEcc
Q psy16240 182 ALYDLVWQGDGSVALRANNGKFIATKR 208 (284)
Q Consensus 182 ~~F~le~~~~g~v~lra~nGkyl~~~~ 208 (284)
..|.+.- .+++.-+. +|+.|.+..
T Consensus 81 Q~W~~~~--~~~i~n~~-s~~cL~~~~ 104 (124)
T PF00652_consen 81 QRWKFDP--DGRIRNKN-SGLCLDVKG 104 (124)
T ss_dssp GBEEEET--TSBEEETT-TTEEEEEGG
T ss_pred CeEEEcC--CeeEEeCC-CCEEEEecC
Confidence 6677654 25444444 677777753
No 35
>PF03498 CDtoxinA: Cytolethal distending toxin A/C family; InterPro: IPR003558 Escherichia coli, Haemophilus spp and Campylobacter spp. all produce a toxin that is seen to cause distension in certain cell lines [, ], which eventually disintegrate and die. This novel toxin, termed cytolethal distending toxin (cdt), has three subunits: A, B and C. Their sizes are approx. 27.7, 29.5 and 19.9kDa respectively [], and they appear to be entirely novel []. Further research on the complete toxin has revealed that it blocks the cell cycle at stage G2, through inactivation of the cyclin-dependent kinase Cdk1, and without induction of DNA breaks. This leads to multipolar abortive mitosis and micronucleation, associated with centrosomal amplification []. The roles of each subunit are unclear, but it is believed that they have separate roles in pathogenicity. This entry represents the A and C subunits.; GO: 0009405 pathogenesis; PDB: 2F2F_A 1SR4_C.
Probab=51.59 E-value=1.3e+02 Score=25.17 Aligned_cols=59 Identities=7% Similarity=-0.109 Sum_probs=37.7
Q ss_pred CCeEEEEE-CCCCeEEeecCceEEeecCCCCC-CceEEEEEeCCCCeEEEEE-cCceeEEEeC
Q psy16240 108 DKPSKFKI-YSVPLSFLSSGVDVTANQEEISD-HETFQLEFDPSTKRWYIRT-MQDRYWTLET 167 (284)
Q Consensus 108 ~~~V~Lra-~nGkyvs~~~~g~l~A~~~~~~~-~e~F~l~~d~~~g~~~Lrs-~nGkY~~v~~ 167 (284)
.+.|.|+. .++.-+.+-.+|-+....+..+. ...|++..- .+|.++||+ .+|+.+....
T Consensus 51 ~g~Vqf~n~~~~~CL~~~~~G~~~~~~C~~~~~~q~F~iiPt-ttgAVQIks~~~~~Cl~~~~ 112 (150)
T PF03498_consen 51 FGYVQFVNPKTGTCLAAYGNGVFHYKSCDQDNLEQVFSIIPT-TTGAVQIKSLSTGECLQTFN 112 (150)
T ss_dssp TCEEEEEETTTSEEEEEETTCEEEE--TTTCHGHH-EEEEEB-TTS-EEEEETTT--EEEE-S
T ss_pred CCEEEEEcCCCCcceeecCCCeEeecccCCCChhceEEEEEc-CCCcEEEEecCCCceEEecC
Confidence 47899997 66777777667755533244434 577999875 789999998 7888887653
No 36
>KOG0276|consensus
Probab=50.12 E-value=3.2e+02 Score=28.46 Aligned_cols=100 Identities=15% Similarity=0.175 Sum_probs=52.8
Q ss_pred CeEEEEECCCCeEEeecCceEEeecC------CCCCCceEEEEEeCCCCeE-----EEE-EcCceeEEEeCCCcE-EecC
Q psy16240 109 KPSKFKIYSVPLSFLSSGVDVTANQE------EISDHETFQLEFDPSTKRW-----YIR-TMQDRYWTLETGGGI-QASG 175 (284)
Q Consensus 109 ~~V~Lra~nGkyvs~~~~g~l~A~~~------~~~~~e~F~l~~d~~~g~~-----~Lr-s~nGkY~~v~~~G~v-~a~~ 175 (284)
|-|++ ..+||.+=++.+.-..++-. +..+.|...|.. .+=|.+ +|. +.||||+.+-+||-- +-++
T Consensus 302 P~vsM-d~~gKIiwa~~~ei~~~~~ks~~~~~ev~DgErL~Lsv-KeLgs~eiyPq~L~hsPNGrfV~VcgdGEyiIyTa 379 (794)
T KOG0276|consen 302 PAVSM-DSNGKIIWAVHSEIQAVNLKSVGAQKEVTDGERLPLSV-KELGSVEIYPQTLAHSPNGRFVVVCGDGEYIIYTA 379 (794)
T ss_pred Cceee-cCCccEEEEcCceeeeeeceeccCcccccCCccccchh-hhccccccchHHhccCCCCcEEEEecCccEEEEEe
Confidence 43433 36677666655443333322 333444444432 122332 344 689999999888863 2222
Q ss_pred CC---CCCCceEEEEEeCC-CeEEEEeCCCCeEEEccCce
Q psy16240 176 DK---KSSNALYDLVWQGD-GSVALRANNGKFIATKRSGH 211 (284)
Q Consensus 176 ~~---~~~~~~F~le~~~~-g~v~lra~nGkyl~~~~~G~ 211 (284)
-. ..--.--++.|..+ +..++|.+||+-+.- +|+.
T Consensus 380 la~RnK~fG~~~eFvw~~dsne~avRes~~~vki~-knfk 418 (794)
T KOG0276|consen 380 LALRNKAFGSGLEFVWAADSNEFAVRESNGNVKIF-KNFK 418 (794)
T ss_pred eehhhcccccceeEEEcCCCCeEEEEecCCceEEE-ecce
Confidence 11 01111236777644 889999888876555 5544
No 37
>PF14913 DPCD: DPCD protein family
Probab=49.03 E-value=56 Score=28.53 Aligned_cols=63 Identities=14% Similarity=0.261 Sum_probs=40.2
Q ss_pred CCCCCCCCcccee--eeeecceeeeeccEEEEc---ccCCCCceeEEEEEeCCCCeEEEEECCCceEE
Q psy16240 4 PPPSPQENPKYAT--VSLFPLSFLSSGVDVTAN---QEEISDHETFQLEFDPSTKRWYIRTMQDRYWT 66 (284)
Q Consensus 4 ~~~~~~~~~~~~~--~~~~~~~~~~~g~~v~An---~~e~~~~etFqLe~d~~~~r~~LRt~~gky~t 66 (284)
|+|..+.|..... -|--|=-|+|....-+-- |.=+=+-++|++-+|.+.+.+-+||.|.||+-
T Consensus 67 p~~~~~~~~~~~~i~ESs~nP~~~r~dTk~~fqWRIRNLPYP~dvYsVtvd~~~r~ivvRTtNKKYyK 134 (194)
T PF14913_consen 67 PEPTQNNNLESELIKESSSNPIFVRRDTKTSFQWRIRNLPYPKDVYSVTVDEDERCIVVRTTNKKYYK 134 (194)
T ss_pred CCccccCCCCcceeeecCCCCEEEEEcCccceEEEEccCCCCccceEEEEcCCCcEEEEECcCcccee
Confidence 5556655554443 333444555544332211 24455688999999877788999999999984
No 38
>cd00028 B_lectin Bulb-type mannose-specific lectin. The domain contains a three-fold internal repeat (beta-prism architecture). The consensus sequence motif QXDXNXVXY is involved in alpha-D-mannose recognition. Lectins are carbohydrate-binding proteins which specifically recognize diverse carbohydrates and mediate a wide variety of biological processes, such as cell-cell and host-pathogen interactions, serum glycoprotein turnover, and innate immune responses.
Probab=47.27 E-value=1.2e+02 Score=23.44 Aligned_cols=22 Identities=36% Similarity=0.658 Sum_probs=14.8
Q ss_pred eEEEEEeCCCeEEEEeCCCCeE
Q psy16240 183 LYDLVWQGDGSVALRANNGKFI 204 (284)
Q Consensus 183 ~F~le~~~~g~v~lra~nGkyl 204 (284)
.+.+.++++|.+.|...+|+.|
T Consensus 88 ~~~~~L~ddGnlvl~~~~~~~~ 109 (116)
T cd00028 88 NYVLVLLDDGNLVLYDSDGNFL 109 (116)
T ss_pred ceEEEEeCCCCEEEECCCCCEE
Confidence 3566677778888877666543
No 39
>PF14870 PSII_BNR: Photosynthesis system II assembly factor YCF48; PDB: 2XBG_A.
Probab=43.58 E-value=2.8e+02 Score=25.91 Aligned_cols=127 Identities=16% Similarity=0.219 Sum_probs=57.6
Q ss_pred EECCCCeEEeecCceEEeecC-CCCCCceEEEEEeCCCCeEEEE--EcCceeEEEeCCCcEEecC--CC-----------
Q psy16240 114 KIYSVPLSFLSSGVDVTANQE-EISDHETFQLEFDPSTKRWYIR--TMQDRYWTLETGGGIQASG--DK----------- 177 (284)
Q Consensus 114 ra~nGkyvs~~~~g~l~A~~~-~~~~~e~F~l~~d~~~g~~~Lr--s~nGkY~~v~~~G~v~a~~--~~----------- 177 (284)
++.+|+||.+.+.|.+....+ ....|+.+... ....++-- +.++..|-+...|.|..+. ..
T Consensus 152 r~~dG~~vavs~~G~~~~s~~~G~~~w~~~~r~---~~~riq~~gf~~~~~lw~~~~Gg~~~~s~~~~~~~~w~~~~~~~ 228 (302)
T PF14870_consen 152 RSSDGRYVAVSSRGNFYSSWDPGQTTWQPHNRN---SSRRIQSMGFSPDGNLWMLARGGQIQFSDDPDDGETWSEPIIPI 228 (302)
T ss_dssp E-TTS-EEEEETTSSEEEEE-TT-SS-EEEE-----SSS-EEEEEE-TTS-EEEEETTTEEEEEE-TTEEEEE---B-TT
T ss_pred ECCCCcEEEEECcccEEEEecCCCccceEEccC---ccceehhceecCCCCEEEEeCCcEEEEccCCCCccccccccCCc
Confidence 458999999998887655433 23335555443 12233222 3567777776666666543 10
Q ss_pred -CCCCceEEEEEeCCCeEEEEeCCCCeEEEccCce-EEEc-ccCCCCCceEEEEEecCCeEEEeecCCe
Q psy16240 178 -KSSNALYDLVWQGDGSVALRANNGKFIATKRSGH-LYAN-SDSADDSCRYYFYLINRPILVLKCEQGF 243 (284)
Q Consensus 178 -~~~~~~F~le~~~~g~v~lra~nGkyl~~~~~G~-L~A~-~~~~~~~e~F~~~lvnrp~lvL~g~~gf 243 (284)
......+.+-++.++.+-+-..+|..+....+|. -... ....-++-++.|++++...--+.|+.|.
T Consensus 229 ~~~~~~~ld~a~~~~~~~wa~gg~G~l~~S~DgGktW~~~~~~~~~~~n~~~i~f~~~~~gf~lG~~G~ 297 (302)
T PF14870_consen 229 KTNGYGILDLAYRPPNEIWAVGGSGTLLVSTDGGKTWQKDRVGENVPSNLYRIVFVNPDKGFVLGQDGV 297 (302)
T ss_dssp SS--S-EEEEEESSSS-EEEEESTT-EEEESSTTSS-EE-GGGTTSSS---EEEEEETTEEEEE-STTE
T ss_pred ccCceeeEEEEecCCCCEEEEeCCccEEEeCCCCccceECccccCCCCceEEEEEcCCCceEEECCCcE
Confidence 1112234555655555555566666666666665 1111 1111134466677766444444455554
No 40
>PF05588 Botulinum_HA-17: Clostridium botulinum HA-17 protein; InterPro: IPR008903 This family consists of several Clostridium botulinum haemagglutinin (HA) subcomponents. C. botulinum type D strain 4947 produces two different sizes of progenitor toxins (M and L) as intact forms without proteolytic processing. The M toxin is composed of neurotoxin (NT) and nontoxic-nonhaemagglutinin (NTNHA), whereas the L toxin is composed of the M toxin and haemagglutinin (HA) subcomponents (HA-70, HA-17, and HA-33) [].; PDB: 2E4M_C.
Probab=41.48 E-value=1.6e+02 Score=24.03 Aligned_cols=79 Identities=16% Similarity=0.320 Sum_probs=46.2
Q ss_pred CCCeEEEEE--cCceeEEEeCCCcEEecCCCCCCCceEEEEEeCCCeEEEE----eCCCCeEEEccCceEEEcccCCCCC
Q psy16240 149 STKRWYIRT--MQDRYWTLETGGGIQASGDKKSSNALYDLVWQGDGSVALR----ANNGKFIATKRSGHLYANSDSADDS 222 (284)
Q Consensus 149 ~~g~~~Lrs--~nGkY~~v~~~G~v~a~~~~~~~~~~F~le~~~~g~v~lr----a~nGkyl~~~~~G~L~A~~~~~~~~ 222 (284)
+.|.|.+++ .+.-|++.- .|.|.-...+..-.+.+.++... .+-++| +.-.|||+-+.-|.+.+++. ...
T Consensus 9 p~gny~iks~fs~sLyl~~v-Sg~lSfsn~ss~~nQkWkL~Y~s-~k~ayKisnIa~pNkyLtyN~~~fIsldS~--a~n 84 (146)
T PF05588_consen 9 PNGNYKIKSIFSTSLYLTPV-SGNLSFSNQSSNNNQKWKLEYDS-EKNAYKISNIAEPNKYLTYNAPGFISLDSN--ANN 84 (146)
T ss_dssp -SEEEEEEESS-SSEEEEE--SS-EEEEE--SSGGG-EEEEE-S-TT-SEEEEETTEEEEEEEE-TTSBEEEES---SS-
T ss_pred CCCCeeEeeeecCcEEEEee-cceEEeccccccccceeEEEEee-cCCceEeeeccCccceEEecCCceEEeccC--CCc
Confidence 468899998 677888865 57777666656678889999876 444444 23579999999999887433 334
Q ss_pred ceEE-EEEec
Q psy16240 223 CRYY-FYLIN 231 (284)
Q Consensus 223 e~F~-~~lvn 231 (284)
..|. +++.+
T Consensus 85 ~yW~piKIa~ 94 (146)
T PF05588_consen 85 QYWLPIKIAN 94 (146)
T ss_dssp --EEEEEEET
T ss_pred ceeEEEEEec
Confidence 4444 34333
No 41
>PF01690 PLRV_ORF5: Potato leaf roll virus readthrough protein; InterPro: IPR002929 This family consists mainly of the Potato leafroll virus (PLrV) read through protein otherwise known as the minor capsid protein. This is generated via a readthrough of open reading frame 3, the coat protein, allowing transcription of open reading frame 5 to give an extended coat protein with a large C-terminal addition or read through domain []. The read through protein is essential for the circulative aphid transmission of PLrV [] and Beet western yellows virus []. The N-terminal region of the luteovirus readthrough domain determines virus binding to Buchnera GroEL and is essential for virus persistence in the aphid [].; GO: 0019028 viral capsid
Probab=39.12 E-value=4e+02 Score=26.52 Aligned_cols=40 Identities=8% Similarity=-0.232 Sum_probs=30.5
Q ss_pred cccCcCeeEEEEEcCCeEEEEE-CCCCeEEeecCceEEeec
Q psy16240 94 LEVKIQYFYGLYCTDKPSKFKI-YSVPLSFLSSGVDVTANQ 133 (284)
Q Consensus 94 ~~~~~~E~F~l~~~~~~V~Lra-~nGkyvs~~~~g~l~A~~ 133 (284)
.+|.+.+-|....+.++..|.+ .|-.++++++=+......
T Consensus 25 PtP~~~~RF~~Y~G~p~~~I~tr~n~d~I~v~~l~~q~~~y 65 (465)
T PF01690_consen 25 PTPAKHERFIGYEGVPQTKISTRENDDSISVRSLNSQRMRY 65 (465)
T ss_pred CcccCccceEEEecccceeeeccccccceEeeccCceeEEE
Confidence 3455688999999999999997 899999987655444333
No 42
>smart00458 RICIN Ricin-type beta-trefoil. Carbohydrate-binding domain formed from presumed gene triplication.
Probab=38.01 E-value=1.7e+02 Score=21.77 Aligned_cols=47 Identities=13% Similarity=-0.029 Sum_probs=23.8
Q ss_pred CCCCeEEeecCc-eEEeecCCC-CCCceEEEEEeCCCCeEEEEEcCceeEEEeC
Q psy16240 116 YSVPLSFLSSGV-DVTANQEEI-SDHETFQLEFDPSTKRWYIRTMQDRYWTLET 167 (284)
Q Consensus 116 ~nGkyvs~~~~g-~l~A~~~~~-~~~e~F~l~~d~~~g~~~Lrs~nGkY~~v~~ 167 (284)
..++.|.+..++ .+....... +....|.+. ..|.+.++ .++.+.+..
T Consensus 5 ~~~~Cl~~~~~~~~v~l~~c~~~~~~Q~w~~~---~~g~~~~~--~~~Cl~~~~ 53 (117)
T smart00458 5 NTGKCLDVNGNSNPVGLFDCHGTGGNQLWKLT---SDGAIRIA--TDLCLTANG 53 (117)
T ss_pred cCCccEecCCCCceEEEEeCCCCCccceEEEe---CCCeEEec--CCccCccCC
Confidence 345555554432 332222222 445677766 35666666 566665543
No 43
>KOG0649|consensus
Probab=34.63 E-value=3.7e+02 Score=24.80 Aligned_cols=89 Identities=15% Similarity=0.064 Sum_probs=51.4
Q ss_pred CCCCeEEeecCceEEeecCC-----CCCCceEEEEEeCCC---CeEEEEEcCceeEEEeCCCcEEecCCCCCCCceEEEE
Q psy16240 116 YSVPLSFLSSGVDVTANQEE-----ISDHETFQLEFDPST---KRWYIRTMQDRYWTLETGGGIQASGDKKSSNALYDLV 187 (284)
Q Consensus 116 ~nGkyvs~~~~g~l~A~~~~-----~~~~e~F~l~~d~~~---g~~~Lrs~nGkY~~v~~~G~v~a~~~~~~~~~~F~le 187 (284)
.|-+.|--+.+..+.|..|. .-+.-+|++++...+ ..+..|.+++.-++=..||+++....+ ...+.=.||
T Consensus 117 INam~ldP~enSi~~AgGD~~~y~~dlE~G~i~r~~rGHtDYvH~vv~R~~~~qilsG~EDGtvRvWd~k-t~k~v~~ie 195 (325)
T KOG0649|consen 117 INAMWLDPSENSILFAGGDGVIYQVDLEDGRIQREYRGHTDYVHSVVGRNANGQILSGAEDGTVRVWDTK-TQKHVSMIE 195 (325)
T ss_pred cceeEeccCCCcEEEecCCeEEEEEEecCCEEEEEEcCCcceeeeeeecccCcceeecCCCccEEEEecc-ccceeEEec
Confidence 44444444444444444321 113446777763112 234566788888887778888775442 234444566
Q ss_pred EeCCCeEEEEeCCCCeEEE
Q psy16240 188 WQGDGSVALRANNGKFIAT 206 (284)
Q Consensus 188 ~~~~g~v~lra~nGkyl~~ 206 (284)
... ..-.+|-.+||++.+
T Consensus 196 ~yk-~~~~lRp~~g~wiga 213 (325)
T KOG0649|consen 196 PYK-NPNLLRPDWGKWIGA 213 (325)
T ss_pred ccc-ChhhcCcccCceeEE
Confidence 555 667888889998774
No 44
>smart00108 B_lectin Bulb-type mannose-specific lectin.
Probab=30.77 E-value=2.5e+02 Score=21.56 Aligned_cols=20 Identities=30% Similarity=0.541 Sum_probs=13.3
Q ss_pred eEEEEEeCCCeEEEEeCCCC
Q psy16240 183 LYDLVWQGDGSVALRANNGK 202 (284)
Q Consensus 183 ~F~le~~~~g~v~lra~nGk 202 (284)
.+.+.++++|.+.|...+|+
T Consensus 87 ~~~~~L~ddGnlvl~~~~~~ 106 (114)
T smart00108 87 NYVLVLLDDGNLVIYDSDGN 106 (114)
T ss_pred ceEEEEeCCCCEEEECCCCC
Confidence 35666677677777766655
No 45
>KOG0973|consensus
Probab=26.61 E-value=5.7e+02 Score=27.89 Aligned_cols=34 Identities=3% Similarity=0.002 Sum_probs=23.4
Q ss_pred CCceEEEEEeCCCCeEEEEEcCceeEEEeCCCcE
Q psy16240 138 DHETFQLEFDPSTKRWYIRTMQDRYWTLETGGGI 171 (284)
Q Consensus 138 ~~e~F~l~~d~~~g~~~Lrs~nGkY~~v~~~G~v 171 (284)
+...++|.-..+...|..-+.||+|+...+|..+
T Consensus 59 ~k~l~~m~~h~~sv~CVR~S~dG~~lAsGSDD~~ 92 (942)
T KOG0973|consen 59 PKHLCTMDDHDGSVNCVRFSPDGSYLASGSDDRL 92 (942)
T ss_pred chhheeeccccCceeEEEECCCCCeEeeccCcce
Confidence 5677888744334444334999999999877665
No 46
>PF11197 DUF2835: Protein of unknown function (DUF2835); InterPro: IPR021363 This is a bacterial family of uncharacterised proteins. One member of this family (A4VM42 from SWISSPROT) is annotated as the A subunit of Type IIA topoisomerase (DNA gyrase/topo II, topoisomerase IV).
Probab=25.44 E-value=1.6e+02 Score=21.33 Aligned_cols=15 Identities=0% Similarity=-0.274 Sum_probs=10.6
Q ss_pred CeEEEEECCCCeEEe
Q psy16240 109 KPSKFKIYSVPLSFL 123 (284)
Q Consensus 109 ~~V~Lra~nGkyvs~ 123 (284)
..|..++.+|+-|..
T Consensus 21 ~~V~v~s~~Gr~v~~ 35 (68)
T PF11197_consen 21 SKVVVRSDDGRRVQF 35 (68)
T ss_pred cEEEEEecCCcEEEE
Confidence 557677777777765
No 47
>PF04790 Sarcoglycan_1: Sarcoglycan complex subunit protein; InterPro: IPR006875 The dystrophin glycoprotein complex (DGC) is a membrane-spanning complex that links the interior cytoskeleton to the extracellular matrix in muscle. The sarcoglycan complex is a subcomplex within the DGC and is composed of several muscle-specific, transmembrane proteins (alpha-, beta-, gamma-, delta- and zeta-sarcoglycan). The sarcoglycans are asparagine-linked glycosylated proteins with single transmembrane domains. This family contains beta, gamma and delta members [, ].; GO: 0007010 cytoskeleton organization, 0016012 sarcoglycan complex, 0016021 integral to membrane
Probab=24.36 E-value=5.5e+02 Score=23.48 Aligned_cols=16 Identities=6% Similarity=-0.153 Sum_probs=10.1
Q ss_pred EEEEECCCCeEEeecC
Q psy16240 111 SKFKIYSVPLSFLSSG 126 (284)
Q Consensus 111 V~Lra~nGkyvs~~~~ 126 (284)
+.+++..|-++.++.+
T Consensus 182 L~~~a~egV~i~a~ag 197 (264)
T PF04790_consen 182 LSMRAPEGVHIEAKAG 197 (264)
T ss_pred EEEECCCCeEEEeccC
Confidence 6677777766665443
No 48
>cd00161 RICIN Ricin-type beta-trefoil; Carbohydrate-binding domain formed from presumed gene triplication. The domain is found in a variety of molecules serving diverse functions such as enzymatic activity, inhibitory toxicity and signal transduction. Highly specific ligand binding occurs on exposed surfaces of the compact domain sturcture.
Probab=23.19 E-value=3.1e+02 Score=20.07 Aligned_cols=46 Identities=17% Similarity=0.320 Sum_probs=25.2
Q ss_pred CCceEEEEEeCCCeEEEEeCCCCeEEEccC---ceEEEcccCC-CCCceEEEE
Q psy16240 180 SNALYDLVWQGDGSVALRANNGKFIATKRS---GHLYANSDSA-DDSCRYYFY 228 (284)
Q Consensus 180 ~~~~F~le~~~~g~v~lra~nGkyl~~~~~---G~L~A~~~~~-~~~e~F~~~ 228 (284)
....|.+.. .|.+.++. +++.|.+... ..+....-.. .+...|.+.
T Consensus 34 ~~Q~W~~~~--~g~~~~~~-~~~Cl~~~~~~~~~~~~~~~c~~~~~~Q~W~~~ 83 (124)
T cd00161 34 NNQKWTLTS--DGTIRIKS-SNLCLDVGGDAPGSKVRLYTCSGGSDNQRWTFN 83 (124)
T ss_pred ccCCEEEeC--CCeEEEcC-CCeEEcccCCCCCCEEEEEECCCCCcCCEEEEC
Confidence 455566654 36676666 6777776543 2333322222 456677764
No 49
>PLN00033 photosystem II stability/assembly factor; Provisional
Probab=23.13 E-value=7e+02 Score=24.19 Aligned_cols=61 Identities=15% Similarity=0.117 Sum_probs=27.9
Q ss_pred EECCCCeEEeecCceEEeecCCCCC-CceEEEEEeCCCCeEEEE-EcCceeEEEeCCCcEEecCC
Q psy16240 114 KIYSVPLSFLSSGVDVTANQEEISD-HETFQLEFDPSTKRWYIR-TMQDRYWTLETGGGIQASGD 176 (284)
Q Consensus 114 ra~nGkyvs~~~~g~l~A~~~~~~~-~e~F~l~~d~~~g~~~Lr-s~nGkY~~v~~~G~v~a~~~ 176 (284)
+..+|+++-+...|.+.-..+.-.. |+.-.... ......+- ..+|..|-+...|.|..+.+
T Consensus 246 ~~~dG~~~~vg~~G~~~~s~d~G~~~W~~~~~~~--~~~l~~v~~~~dg~l~l~g~~G~l~~S~d 308 (398)
T PLN00033 246 RSPDGDYVAVSSRGNFYLTWEPGQPYWQPHNRAS--ARRIQNMGWRADGGLWLLTRGGGLYVSKG 308 (398)
T ss_pred EcCCCCEEEEECCccEEEecCCCCcceEEecCCC--ccceeeeeEcCCCCEEEEeCCceEEEecC
Confidence 3577777777776655443322111 22222110 11111111 35666666666676655444
No 50
>KOG2394|consensus
Probab=22.56 E-value=6.6e+02 Score=25.70 Aligned_cols=41 Identities=12% Similarity=0.175 Sum_probs=20.5
Q ss_pred EeCCCCeEEE--ccCceEEEcccCCCCCceEEEEEecCCeEEEeec
Q psy16240 197 RANNGKFIAT--KRSGHLYANSDSADDSCRYYFYLINRPILVLKCE 240 (284)
Q Consensus 197 ra~nGkyl~~--~~~G~L~A~~~~~~~~e~F~~~lvnrp~lvL~g~ 240 (284)
|+.-|-+||+ -++|.+++.+.+.+=-+.|.++. --||.||+
T Consensus 329 kSYFGGLLCvcWSPDGKyIvtGGEDDLVtVwSf~e---rRVVARGq 371 (636)
T KOG2394|consen 329 KSYFGGLLCVCWSPDGKYIVTGGEDDLVTVWSFEE---RRVVARGQ 371 (636)
T ss_pred HhhccceEEEEEcCCccEEEecCCcceEEEEEecc---ceEEEecc
Confidence 4444555554 25555555554443345565543 23566664
No 51
>PF08709 Ins145_P3_rec: Inositol 1,4,5-trisphosphate/ryanodine receptor; InterPro: IPR014821 This domain corresponds to the ligand binding region on inositol 1,4,5-trisphosphate receptor, and the N-terminal region of the ryanodine receptor. Both receptors are involved in Ca2+ release. They can couple to the activation of neurotransmitter-gated receptors and voltage-gated Ca2+ channels on the plasma membrane, thus allowing the endoplasmic reticulum to discriminate between different types of neuronal activity []. ; PDB: 3UJ4_B 3UJ0_B 3T8S_B 3JRR_B 1XZZ_A 3ILA_B 3HSM_A 2XOA_A 3IM6_A 3IM7_A ....
Probab=22.28 E-value=3.6e+02 Score=23.59 Aligned_cols=76 Identities=9% Similarity=0.131 Sum_probs=38.9
Q ss_pred eEEEE-EcCceeEEEeCCC---------cEEecCCCCC-CCceEEEEEe----C-------CCeEEEEe-CCCCeEEEc-
Q psy16240 152 RWYIR-TMQDRYWTLETGG---------GIQASGDKKS-SNALYDLVWQ----G-------DGSVALRA-NNGKFIATK- 207 (284)
Q Consensus 152 ~~~Lr-s~nGkY~~v~~~G---------~v~a~~~~~~-~~~~F~le~~----~-------~g~v~lra-~nGkyl~~~- 207 (284)
.++|+ ..++||+++..+- .|.-... .. ..++|.|... . ...|.|.+ ..++||-+.
T Consensus 103 ~IQL~H~~S~kyL~~~~~~~s~~e~~~~~v~L~~~-~~~e~s~F~i~P~~k~r~~Gd~V~~gD~i~l~~~~~~~~Lh~s~ 181 (214)
T PF08709_consen 103 AIQLLHVKSNKYLTCNSTEPSEYEKNNFKVSLQEF-SSGENSWFRIHPAYKQRSEGDPVRYGDQIILISVSTEQYLHVSS 181 (214)
T ss_dssp EEEEEETTTTEEEEEEEEEESSSSTTSEEEEEESS-SSSGGGEEEEEESSTTS-TTSB-BTT-EEEEEETTT-SEEEEEE
T ss_pred eEEEeEeCccEEEEEeCCCCCcccccceEEEeccC-CCcccEEEEEEcchheEcCCCeeeeCCEEEEEECCCCCcccccC
Confidence 45666 4788888875322 1222222 23 5688988743 1 13467766 668888843
Q ss_pred ------cCceEEEcccCCCCCceEEEEEe
Q psy16240 208 ------RSGHLYANSDSADDSCRYYFYLI 230 (284)
Q Consensus 208 ------~~G~L~A~~~~~~~~e~F~~~lv 230 (284)
..+.+..++. ...+.|.+.+.
T Consensus 182 ~~~~~~~~~~~eVn~~--~~~T~w~i~~~ 208 (214)
T PF08709_consen 182 NKSLSDNKGCKEVNAS--FQQTSWKINPF 208 (214)
T ss_dssp EEEESSSSSCEEEEES--SS--EEEEEEE
T ss_pred ccccccCCCceEEEEE--ccccceEEEEE
Confidence 2333333332 24456666543
No 52
>PF11250 DUF3049: Protein of unknown function (DUF3049); InterPro: IPR021410 This eukaryotic family of proteins has no known function.
Probab=21.12 E-value=1.5e+02 Score=20.56 Aligned_cols=29 Identities=14% Similarity=0.264 Sum_probs=13.9
Q ss_pred CCcEEecCCCCCCCceEEEEEeCCCeEEEE
Q psy16240 168 GGGIQASGDKKSSNALYDLVWQGDGSVALR 197 (284)
Q Consensus 168 ~G~v~a~~~~~~~~~~F~le~~~~g~v~lr 197 (284)
||.++..+-.+...+.|+.+..+ |++.|.
T Consensus 26 dGRLvl~~v~v~~~~~~~A~R~~-GRL~L~ 54 (56)
T PF11250_consen 26 DGRLVLEEVRVPSHEYFHAERED-GRLRLQ 54 (56)
T ss_pred CCEEEEEEEEcCCcceEEEEccC-CEEEEE
Confidence 34433333333345556665555 666553
No 53
>PLN00115 pollen allergen group 3; Provisional
Probab=21.00 E-value=2.1e+02 Score=22.99 Aligned_cols=30 Identities=17% Similarity=0.190 Sum_probs=19.1
Q ss_pred eeEEEEEcCCeEEEEE---CCCCeEEeecCceE
Q psy16240 100 YFYGLYCTDKPSKFKI---YSVPLSFLSSGVDV 129 (284)
Q Consensus 100 E~F~l~~~~~~V~Lra---~nGkyvs~~~~g~l 129 (284)
++|.+-+++..|+|+- +|-.|+.+-.++++
T Consensus 15 ~l~~~~~~g~~v~F~V~~gSnp~yL~ll~~~dI 47 (118)
T PLN00115 15 ALFAVGSCATEVTFKVGKGSSSTSLELVTNVAI 47 (118)
T ss_pred HHhhhhhcCCceEEEECCCCCcceEEEEEeCCE
Confidence 5667777777787774 44557766554443
Done!