Query psy2222
Match_columns 264
No_of_seqs 174 out of 322
Neff 6.3
Searched_HMMs 46136
Date Fri Aug 16 19:18:08 2013
Command hhsearch -i /work/01045/syshi/Psyhhblits/psy2222.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/2222hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 KOG3962|consensus 100.0 8.1E-66 1.8E-70 446.1 16.5 246 8-263 1-246 (246)
2 PF06229 FRG1: FRG1-like famil 100.0 9.4E-61 2E-65 414.1 12.9 188 71-262 2-191 (191)
3 cd00257 Fascin Fascin-like dom 99.8 3E-20 6.6E-25 148.5 13.6 113 65-183 4-119 (119)
4 PF06268 Fascin: Fascin domain 99.8 8.7E-18 1.9E-22 133.6 13.8 109 70-183 2-111 (111)
5 cd00257 Fascin Fascin-like dom 99.4 1.9E-12 4E-17 103.4 10.6 78 108-187 3-82 (119)
6 PF06229 FRG1: FRG1-like famil 99.4 9.2E-13 2E-17 114.6 9.0 144 44-224 24-174 (191)
7 PF06268 Fascin: Fascin domain 99.1 2.5E-10 5.5E-15 90.6 9.5 91 44-142 21-111 (111)
8 PF04601 DUF569: Protein of un 98.7 5.2E-07 1.1E-11 75.1 13.9 97 64-164 9-118 (142)
9 PF00167 FGF: Fibroblast growt 98.0 0.00022 4.7E-09 57.5 12.6 98 63-166 3-105 (122)
10 cd00058 FGF Acidic and basic f 97.7 0.0013 2.9E-08 53.4 13.7 115 64-184 2-122 (123)
11 KOG3962|consensus 97.7 0.00015 3.2E-09 64.3 7.9 80 108-188 49-134 (246)
12 smart00442 FGF Acidic and basi 97.7 0.0024 5.2E-08 52.1 14.2 115 63-183 5-125 (126)
13 PF04601 DUF569: Protein of un 97.6 0.00024 5.2E-09 59.3 7.4 58 108-165 9-70 (142)
14 PF14200 RicinB_lectin_2: Rici 97.4 0.004 8.6E-08 48.3 11.8 89 96-184 2-105 (105)
15 PF00167 FGF: Fibroblast growt 97.4 0.0034 7.4E-08 50.4 11.6 76 109-186 3-80 (122)
16 smart00791 Agglutinin Amaranth 97.4 0.0038 8.3E-08 51.5 11.8 114 64-182 7-136 (139)
17 cd00058 FGF Acidic and basic f 97.1 0.0058 1.3E-07 49.7 9.5 72 113-186 5-78 (123)
18 smart00442 FGF Acidic and basi 96.9 0.013 2.9E-07 47.7 10.0 77 108-186 4-82 (126)
19 PF07468 Agglutinin: Agglutini 96.8 0.048 1E-06 46.0 12.9 116 64-183 7-151 (153)
20 KOG3885|consensus 95.8 0.14 3.1E-06 43.4 10.6 100 61-166 26-133 (155)
21 KOG3885|consensus 95.7 0.064 1.4E-06 45.4 8.1 74 108-184 29-104 (155)
22 smart00791 Agglutinin Amaranth 95.5 0.096 2.1E-06 43.3 8.2 66 108-175 7-78 (139)
23 PF07468 Agglutinin: Agglutini 94.9 0.21 4.5E-06 42.2 8.8 68 108-175 7-85 (153)
24 PF08555 DUF1754: Eukaryotic f 94.4 0.021 4.5E-07 44.3 1.5 16 3-18 1-16 (98)
25 PF03498 CDtoxinA: Cytolethal 93.2 0.34 7.4E-06 40.8 6.8 62 61-126 52-115 (150)
26 PF14200 RicinB_lectin_2: Rici 83.9 17 0.00036 27.7 10.3 72 47-123 2-79 (105)
27 PF03664 Glyco_hydro_62: Glyco 81.2 15 0.00033 33.6 9.5 106 40-161 105-240 (271)
28 PF05270 AbfB: Alpha-L-arabino 80.0 13 0.00028 31.0 8.1 69 96-165 38-119 (142)
29 cd00161 RICIN Ricin-type beta- 73.4 35 0.00075 25.3 10.9 65 94-163 33-101 (124)
30 PF05270 AbfB: Alpha-L-arabino 70.5 20 0.00043 29.9 6.9 78 109-186 3-94 (142)
31 KOG3410|consensus 66.0 1.2 2.7E-05 36.0 -1.1 19 1-19 2-20 (120)
32 KOG1446|consensus 56.2 1.3E+02 0.0028 28.4 10.0 100 64-164 137-252 (311)
33 KOG4226|consensus 49.2 54 0.0012 30.7 6.3 114 42-162 220-351 (379)
34 COG2706 3-carboxymuconate cycl 40.1 1.3E+02 0.0028 28.8 7.5 71 113-183 39-118 (346)
35 PF03498 CDtoxinA: Cytolethal 35.4 2.3E+02 0.0049 23.8 7.6 65 100-164 44-113 (150)
36 PF14870 PSII_BNR: Photosynthe 34.3 3.9E+02 0.0085 24.9 13.5 42 110-153 150-191 (302)
37 PF15017 AF1Q: Drug resistance 30.1 24 0.00053 27.1 0.8 19 41-59 66-84 (87)
38 KOG1691|consensus 29.7 2.9E+02 0.0063 24.6 7.6 60 146-214 82-142 (210)
39 PF02061 Lambda_CIII: Lambda P 27.7 29 0.00062 23.0 0.7 10 245-254 15-24 (45)
40 cd00161 RICIN Ricin-type beta- 27.7 2.4E+02 0.0053 20.5 8.2 26 136-163 34-59 (124)
41 PF00340 IL1: Interleukin-1 / 27.1 2.6E+02 0.0056 22.4 6.4 64 62-125 27-99 (120)
42 PF11250 DUF3049: Protein of u 26.0 1.3E+02 0.0028 21.1 3.9 30 123-152 25-54 (56)
43 PF08758 Cadherin_pro: Cadheri 25.1 1.7E+02 0.0037 22.3 4.7 37 108-145 38-76 (90)
44 smart00458 RICIN Ricin-type be 24.4 2.9E+02 0.0063 20.3 10.7 64 94-163 28-94 (117)
45 TIGR03422 mito_frataxin fratax 24.3 98 0.0021 24.0 3.3 26 93-120 54-80 (97)
46 PF01491 Frataxin_Cyay: Fratax 23.8 67 0.0014 25.3 2.3 26 93-120 57-83 (109)
47 COG2747 FlgM Negative regulato 23.3 48 0.001 25.8 1.4 36 211-246 39-75 (93)
48 PF14913 DPCD: DPCD protein fa 22.1 4.3E+02 0.0093 23.3 7.1 48 70-121 87-135 (194)
49 PF11138 DUF2911: Protein of u 21.3 2.4E+02 0.0053 23.6 5.3 70 62-146 47-124 (145)
50 COG2706 3-carboxymuconate cycl 21.2 5.4E+02 0.012 24.7 8.1 10 113-122 252-261 (346)
51 PF10467 Inhibitor_I48: Peptid 21.0 4.4E+02 0.0095 22.0 6.6 65 95-164 45-125 (149)
No 1
>KOG3962|consensus
Probab=100.00 E-value=8.1e-66 Score=446.10 Aligned_cols=246 Identities=46% Similarity=0.824 Sum_probs=226.3
Q ss_pred cccceeeecCCCCccccccccccccCCCCCCCCCCCcCCCCCeeeccccCCCCCCEEEEeCCCcEEEEcCCCceEeeccC
Q psy2222 8 KRGKLVLKGEKPAKKKKKNKSDTQAHGSNSKQVDPDIENYAGWTQVKHIKDIVGAISIEFGKRTFVSALDNGLFILGAIH 87 (264)
Q Consensus 8 k~~kL~fKGdk~~kkkkk~~~~~~~~~~~~~~~~~d~~~~~~W~~~~~~~dI~Gp~~i~~~s~~yL~a~~~G~f~~~~~~ 87 (264)
+++.|+|||+|+.++++++++++.. ..+++.++|++||.+....||.|.+.|.+.+++||.|++||+|++++||
T Consensus 1 ~~~~l~lk~~Kk~k~kkkk~kk~~~------~~~~~~~~~g~w~~~~~~~~~~g~v~ie~~~~~yl~a~dng~ft~g~ph 74 (246)
T KOG3962|consen 1 KSMGLKLKGDKKIKKKKKKNKKSKE------SSDEDAVNDGGWWDANELNDIEGTVAIEIDDGTYLGAMDNGLFTLGAPH 74 (246)
T ss_pred CCCcceeecccccchhccccchhhh------hcchhhhcCCcceeccccceeeEEEEEEecCceEEEEEecCceeeccCC
Confidence 4789999999997765555444332 3578889999999999999999999999999999999999999999999
Q ss_pred CCCCCCCccccEEEEEecCCeEEEEecCCeEEEEecCCcEEEeccCCCCCCceEEEEeCCceEEeccCceeEEEecCCCe
Q psy2222 88 EDKEGPSPEEIFTAILVNDSKVAFKSGFDKYLSIDKNGRVTGRSDAVGPLEQWEPVFEDDNMALLGANQCFMSVSEQDDS 167 (264)
Q Consensus 88 ~~~e~~~~~evf~~~~v~dt~~~lKs~~GkYLs~d~~G~v~a~a~a~g~~E~f~iv~~~g~~alq~~ng~fLs~~~~~~~ 167 (264)
+..++|+|.++|+++.++|++|+|||++|+||+|+++|.|+|+++|+|++|+|+++|+.|++++++.|++|+.+++.++
T Consensus 75 ~~~~gp~p~e~f~avki~dsrIaLKsGyGKYlsinsdglvvg~qeAvG~~EQw~~vFq~~r~a~~as~s~~~~~~e~~d- 153 (246)
T KOG3962|consen 75 DEVDGPEPEEQFMAVKISDSRIALKSGYGKYLSINSDGLVVGRQEAVGSREQWEPVFQEGRMALLASNSCFIRCNEAGD- 153 (246)
T ss_pred ccccCCCchhhEEEEEccCceEEecccccceeeecCCccEEEehhhcCcHhhchhhhhccceEEeeccceeEEechhhc-
Confidence 9888999999999999999999999999999999999999999999999999999999999999999999999988876
Q ss_pred eEEecccCCCcceEEEEEcccccccCCCCCCCChhhhcCChhHHHHHHHHhhhhccCCceecCccchHHHHHHHhhchhH
Q psy2222 168 IVATSRKAGKNEMVKLRSNSLHLKNLDKDDDIPEEERTNKLSDIELNYVKKFQKFQDKKIKLNSDNIAVLKEAKAQGYLH 247 (264)
Q Consensus 168 l~a~s~~~g~~e~~~iR~~a~~~~~~~~~~~~~~e~~~~~~~~~E~~~vk~~Q~~~~~~~~l~~~~~~~LkkA~~eG~lh 247 (264)
+.+-+.+++.+++..||.++......+. ++|.|++| .+.+||.|||++||+|||++++++.+|++.||+|+++|.||
T Consensus 154 ~~va~kt~t~~~~ik~r~~~~~~~~~kd--~~~~edk~-~~~evE~nyvkkfq~fqd~kl~i~~~d~K~lkkA~kdG~lH 230 (246)
T KOG3962|consen 154 AEVASKTATEEPEIKIRSCAAQRDKKKD--TIPEEDKG-NVKEVEINYVKKFQSFQDRKLKIDKEDKKILKKAQKDGSLH 230 (246)
T ss_pred hhhhcccCCCCceeecccchhhcccccC--CCcccccc-ccchhhHHHHHHHhhhhhcccccchhHHHHHHHHHhcchHH
Confidence 6667788999999999997765555443 77899999 99999999999999999999999999999999999999999
Q ss_pred HHhhhhhhccccccCC
Q psy2222 248 ETMLDRRSKMKADRYC 263 (264)
Q Consensus 248 E~lLDrR~K~K~Dryc 263 (264)
|+|||||+||||||||
T Consensus 231 EtLLDrR~k~ksDrYc 246 (246)
T KOG3962|consen 231 ETLLDRRVKMKSDRYC 246 (246)
T ss_pred HHHHhhhhhhcccccC
Confidence 9999999999999999
No 2
>PF06229 FRG1: FRG1-like family; InterPro: IPR010414 This entry represents Frg1 (FSHD region gene 1), a protein that is considered to be a candidate for facioscapulohumeral muscular dystrophy (FSHD). FSHD is a dominant neuromuscular disorder caused by deletions in a number of tandem repeat units (called D4Z4) located on chromosome 4q35. D4Z4 contains a transcriptional silencer whose deletion causes the over-expression in skeletal muscle of 4q35 genes, including Frg1 [, ]. Frg1 is localised to nucleoli and appears to be a component of the human spliceosome, but its exact function is unknown [].; PDB: 2YUG_A.
Probab=100.00 E-value=9.4e-61 Score=414.08 Aligned_cols=188 Identities=51% Similarity=0.805 Sum_probs=78.8
Q ss_pred cEEEEcCCCceEeeccCCCCCCCCccccEEEEEecC-CeEEEEecCCeEEEEecCCcEEEeccCCCCCCceEEEEeCCce
Q psy2222 71 TFVSALDNGLFILGAIHEDKEGPSPEEIFTAILVND-SKVAFKSGFDKYLSIDKNGRVTGRSDAVGPLEQWEPVFEDDNM 149 (264)
Q Consensus 71 ~yL~a~~~G~f~~~~~~~~~e~~~~~evf~~~~v~d-t~~~lKs~~GkYLs~d~~G~v~a~a~a~g~~E~f~iv~~~g~~ 149 (264)
+||.|++||.|++++||+++|+|+|+|||+++++++ ++|+|||+||+||+||++|.|+|+++||||+|+|+|||++|++
T Consensus 2 syi~a~d~G~~t~~ePhd~~~~p~p~qV~va~~v~~~~~iafKs~~GkYLs~Dk~G~v~a~sdAiGp~E~f~~V~~~~~~ 81 (191)
T PF06229_consen 2 SYIEALDNGLFTTGEPHDVGEGPDPRQVWVATRVPGDEKIAFKSGHGKYLSCDKDGIVSARSDAIGPQEQFEPVFQDGKP 81 (191)
T ss_dssp -BEEE-TTS-EEE----SSS----TTT-EEEEE--SSS-EEEEETTS-BEEE-SSSBEEE--SS--TTTBEEEE-STT--
T ss_pred ceeeeeccCCccccCCCcCCCCCChhHeEEEEEecCCCceEeeccCccEEEEcCCCcEEEEeecCCCceEEEEEECCCCe
Confidence 799999999999999999999999999999999954 4999999999999999999999999999999999999999999
Q ss_pred EEec-cCceeEEEecCCCeeEEecccCCCcceEEEEEcccccccCCCCCCCChhhhcCChhHHHHHHHHhhhhccCCcee
Q psy2222 150 ALLG-ANQCFMSVSEQDDSIVATSRKAGKNEMVKLRSNSLHLKNLDKDDDIPEEERTNKLSDIELNYVKKFQKFQDKKIK 228 (264)
Q Consensus 150 alq~-~ng~fLs~~~~~~~l~a~s~~~g~~e~~~iR~~a~~~~~~~~~~~~~~e~~~~~~~~~E~~~vk~~Q~~~~~~~~ 228 (264)
++++ .|++||++++++. |+|++.+++++|+|.||.++.++..... +++.++++ +++++|.+|+|+||+||+++++
T Consensus 82 a~~~~~~~~FLs~~~~~~-i~a~s~~a~~~e~~~iR~~~~~~~~~~~--~~~~~~r~-~~~~~e~~~~k~~q~~~~~~l~ 157 (191)
T PF06229_consen 82 ALFSSSNNKFLSVDEEGD-IRADSKTAGENEMIKIRSDAEREFKTTD--RIRMQDRF-KPKQCESNYVKKFQKISDKELR 157 (191)
T ss_dssp EEEE-TTS-BEEE-SSS--EEE--S---TTT--EEEE-S-----------------------------------------
T ss_pred EEEecCCCeEEEEecccC-eeeccccCCCCceEEEEEeccccCCccc--cccccccC-CHHHHhhHHHHHHhhhhccccc
Confidence 9999 9999999999988 9999999999999999999998655443 55788899 9999999999999999999999
Q ss_pred cCccchHHHHHHHhhchhHHHhhhhhhccccccC
Q psy2222 229 LNSDNIAVLKEAKAQGYLHETMLDRRSKMKADRY 262 (264)
Q Consensus 229 l~~~~~~~LkkA~~eG~lhE~lLDrR~K~K~Dry 262 (264)
|+++|++.|||||+||+|||+|||||+|||||||
T Consensus 158 l~~~e~~~LkkAr~eG~lhE~LLDrR~K~KsDry 191 (191)
T PF06229_consen 158 LSDDEVKELKKARKEGNLHEALLDRRSKMKSDRY 191 (191)
T ss_dssp ----------------------------------
T ss_pred cCHHHHHHHHHHHhcChHHHHHHHHHhhhcccCC
Confidence 9999999999999999999999999999999999
No 3
>cd00257 Fascin Fascin-like domain; members include actin-bundling/crosslinking proteins facsin, histoactophilin and singed; identified in sea urchin, Drosophila, Xenopus, rodents, and humans; The fascin-like domain adopts a beta-trefoil topology and contains an internal threefold repeat; the fascin subgroup contains four copies of the domain; Structurally similar to fibroblast growth factor (FGF)
Probab=99.84 E-value=3e-20 Score=148.47 Aligned_cols=113 Identities=25% Similarity=0.315 Sum_probs=99.1
Q ss_pred EEeC--CCcEEEEcCCCceEeeccCCCCCCCCccccEEEEEecCCeEEEEecCCeEEEEecCCcEEEeccCCCCCCceEE
Q psy2222 65 IEFG--KRTFVSALDNGLFILGAIHEDKEGPSPEEIFTAILVNDSKVAFKSGFDKYLSIDKNGRVTGRSDAVGPLEQWEP 142 (264)
Q Consensus 65 i~~~--s~~yL~a~~~G~f~~~~~~~~~e~~~~~evf~~~~v~dt~~~lKs~~GkYLs~d~~G~v~a~a~a~g~~E~f~i 142 (264)
|.|. .|.||++.+.| ..+.++ ++.+.++|+|.+++.+++.|+||+++|+||+++.+|.|+|+++ ++++|.|++
T Consensus 4 v~Lrs~~gkyl~~~~~g-~~v~a~---~~~~~~~e~F~l~~~~~g~v~Lrs~~G~yls~~~~g~l~~~~~-~~~~e~F~~ 78 (119)
T cd00257 4 VVLRSVNGRYLSAEAGG-DKVDAN---RDSLKGDETFTLEFDNTGKYALRSHDGKYLSADSDGGVQLEGH-PNADCRFTL 78 (119)
T ss_pred EEEEEcCCCEEEEeccC-CEEEEc---CccCCCceEEEEEECCCCeEEEEECCCcEEEEECCCCEEecCC-CCCCcEEEE
Confidence 4444 49999999988 344333 4467779999999988889999999999999999999999999 999999999
Q ss_pred EEe-CCceEEeccCceeEEEecCCCeeEEecccCCCcceEEE
Q psy2222 143 VFE-DDNMALLGANQCFMSVSEQDDSIVATSRKAGKNEMVKL 183 (264)
Q Consensus 143 v~~-~g~~alq~~ng~fLs~~~~~~~l~a~s~~~g~~e~~~i 183 (264)
++. +|.|+|++.||+||++++.+. |.|++..++++|.|.+
T Consensus 79 e~~~~g~~al~~~~G~yl~~~~~g~-l~~~~~~~~~~e~f~~ 119 (119)
T cd00257 79 EFHGDGKWALRAENGRYLGGDGSGT-LKASSETVGPDELFEL 119 (119)
T ss_pred EECCCCeEEEEcCCCCEEeecCCCe-EEEecCCCCccceecC
Confidence 998 599999999999999998876 9999999999999964
No 4
>PF06268 Fascin: Fascin domain; InterPro: IPR022768 This family consists of several eukaryotic fascin or singed proteins. The fascins are a structurally unique and evolutionarily conserved group of actin cross-linking proteins. Fascins function in the organisation of two major forms of actin-based structures: dynamic, cortical cell protrusions and cytoplasmic microfilament bundles. The cortical structures, which include filopodia, spikes, lamellipodial ribs, oocyte microvilli and the dendrites of dendritic cells, have roles in cell-matrix adhesion, cell interactions and cell migration, whereas the cytoplasmic actin bundles appear to participate in cell architecture []. Dictyostelium hisactophilin, another actin-binding protein, is a submembranous pH sensor that signals slight changes of the H+ concentration to actin by inducing actin polymerisation and binding to microfilaments only at pH values below seven []. Members of this family are histidine rich, typically contain the repeated motif of HHXH []. ; GO: 0030674 protein binding, bridging, 0051015 actin filament binding; PDB: 1DFC_B 3P53_B 3LLP_A 1HCE_A 1HCD_A.
Probab=99.77 E-value=8.7e-18 Score=133.60 Aligned_cols=109 Identities=28% Similarity=0.403 Sum_probs=93.7
Q ss_pred CcEEEEcCCCceEeeccCCCCCCCCccccEEEEEecCC-eEEEEecCCeEEEEecCCcEEEeccCCCCCCceEEEEeCCc
Q psy2222 70 RTFVSALDNGLFILGAIHEDKEGPSPEEIFTAILVNDS-KVAFKSGFDKYLSIDKNGRVTGRSDAVGPLEQWEPVFEDDN 148 (264)
Q Consensus 70 ~~yL~a~~~G~f~~~~~~~~~e~~~~~evf~~~~v~dt-~~~lKs~~GkYLs~d~~G~v~a~a~a~g~~E~f~iv~~~g~ 148 (264)
+.|+.+...|. .+.++ ....+.+|+|+|+...++ .|+|||++|+||+++.+|.|+|+++++++++.|+|+++++.
T Consensus 2 ~~~~~~~k~~~-~l~an---~~~~~~~e~f~le~~~~~~~v~lrs~~GkYls~~~~G~v~~~~~~~~~~~~F~i~~~~~~ 77 (111)
T PF06268_consen 2 NGYLVSEKFGA-HLNAN---RASLSDWETFQLEFDDGSYKVALRSHNGKYLSVDSDGSVVADSETPGPDEFFEIEWHGGK 77 (111)
T ss_dssp TEEEEETTCTC-BEEEE---ESSSSCGGSEEEEEETTEEEEEEECTTSEEEEEETTSEEEEEESSSSGGGCBEEEEETTE
T ss_pred CcEEEEEEcCC-EEECC---hhcCcccEEEEEEEECCCCEEEEEcCCCCEEEEcCCCeEEecCCCCCCCcEEEEEECCCE
Confidence 35677776662 22222 234667899999988887 88999999999999999999999999999999999999888
Q ss_pred eEEeccCceeEEEecCCCeeEEecccCCCcceEEE
Q psy2222 149 MALLGANQCFMSVSEQDDSIVATSRKAGKNEMVKL 183 (264)
Q Consensus 149 ~alq~~ng~fLs~~~~~~~l~a~s~~~g~~e~~~i 183 (264)
+++++.||+||++...+. |.|+++.++.+|.|.+
T Consensus 78 ~~~~~~nGkYl~~~~~g~-l~a~~~~~~~~elf~~ 111 (111)
T PF06268_consen 78 VALRASNGKYLSAGPNGQ-LKANATSPGKDELFEY 111 (111)
T ss_dssp EEEECTTSCEEEEETTTE-EEEEESSSSGGGEEEE
T ss_pred EEEECCCCCEEeeCCCCe-EEEcCCCCCcceEEeC
Confidence 889999999999999987 9999999999999974
No 5
>cd00257 Fascin Fascin-like domain; members include actin-bundling/crosslinking proteins facsin, histoactophilin and singed; identified in sea urchin, Drosophila, Xenopus, rodents, and humans; The fascin-like domain adopts a beta-trefoil topology and contains an internal threefold repeat; the fascin subgroup contains four copies of the domain; Structurally similar to fibroblast growth factor (FGF)
Probab=99.41 E-value=1.9e-12 Score=103.35 Aligned_cols=78 Identities=18% Similarity=0.294 Sum_probs=72.5
Q ss_pred eEEEEecCCeEEEEecCC-cEEEeccCCCCCCceEEEEe-CCceEEeccCceeEEEecCCCeeEEecccCCCcceEEEEE
Q psy2222 108 KVAFKSGFDKYLSIDKNG-RVTGRSDAVGPLEQWEPVFE-DDNMALLGANQCFMSVSEQDDSIVATSRKAGKNEMVKLRS 185 (264)
Q Consensus 108 ~~~lKs~~GkYLs~d~~G-~v~a~a~a~g~~E~f~iv~~-~g~~alq~~ng~fLs~~~~~~~l~a~s~~~g~~e~~~iR~ 185 (264)
.++||+++|+||++..+| .|.|+++.++++|.|+++.. +|.++|++.||+||++++++. |.+++. ++++|.|.+..
T Consensus 3 ~v~Lrs~~gkyl~~~~~g~~v~a~~~~~~~~e~F~l~~~~~g~v~Lrs~~G~yls~~~~g~-l~~~~~-~~~~e~F~~e~ 80 (119)
T cd00257 3 QVVLRSVNGRYLSAEAGGDKVDANRDSLKGDETFTLEFDNTGKYALRSHDGKYLSADSDGG-VQLEGH-PNADCRFTLEF 80 (119)
T ss_pred EEEEEEcCCCEEEEeccCCEEEEcCccCCCceEEEEEECCCCeEEEEECCCcEEEEECCCC-EEecCC-CCCCcEEEEEE
Confidence 589999999999999999 99999999999999999877 899999999999999998886 899888 99999999987
Q ss_pred cc
Q psy2222 186 NS 187 (264)
Q Consensus 186 ~a 187 (264)
+.
T Consensus 81 ~~ 82 (119)
T cd00257 81 HG 82 (119)
T ss_pred CC
Confidence 64
No 6
>PF06229 FRG1: FRG1-like family; InterPro: IPR010414 This entry represents Frg1 (FSHD region gene 1), a protein that is considered to be a candidate for facioscapulohumeral muscular dystrophy (FSHD). FSHD is a dominant neuromuscular disorder caused by deletions in a number of tandem repeat units (called D4Z4) located on chromosome 4q35. D4Z4 contains a transcriptional silencer whose deletion causes the over-expression in skeletal muscle of 4q35 genes, including Frg1 [, ]. Frg1 is localised to nucleoli and appears to be a component of the human spliceosome, but its exact function is unknown [].; PDB: 2YUG_A.
Probab=99.40 E-value=9.2e-13 Score=114.58 Aligned_cols=144 Identities=19% Similarity=0.269 Sum_probs=54.3
Q ss_pred cCCCCCeeeccccCCCCCCEEEEeCC--CcEEEEcCCCceEeeccCCCCCCCCccccEEEEEecCCeEEEEe-cCCeEEE
Q psy2222 44 IENYAGWTQVKHIKDIVGAISIEFGK--RTFVSALDNGLFILGAIHEDKEGPSPEEIFTAILVNDSKVAFKS-GFDKYLS 120 (264)
Q Consensus 44 ~~~~~~W~~~~~~~dI~Gp~~i~~~s--~~yL~a~~~G~f~~~~~~~~~e~~~~~evf~~~~v~dt~~~lKs-~~GkYLs 120 (264)
+...|.|+....+++ ..|.|++ |+||+|+.+|.+++ .+++.+|+|.|+++...++ .++.+ .+++||+
T Consensus 24 p~p~qV~va~~v~~~----~~iafKs~~GkYLs~Dk~G~v~a-----~sdAiGp~E~f~~V~~~~~-~a~~~~~~~~FLs 93 (191)
T PF06229_consen 24 PDPRQVWVATRVPGD----EKIAFKSGHGKYLSCDKDGIVSA-----RSDAIGPQEQFEPVFQDGK-PALFSSSNNKFLS 93 (191)
T ss_dssp --TTT-EEEEE--SS----S-EEEEETTS-BEEE-SSSBEEE-------SS--TTTBEEEE-STT---EEEE-TTS-BEE
T ss_pred CChhHeEEEEEecCC----CceEeeccCccEEEEcCCCcEEE-----EeecCCCceEEEEEECCCC-eEEEecCCCeEEE
Confidence 345789997776544 4677777 99999999999888 6789999999999886554 55555 9999999
Q ss_pred EecCCcEEEeccCCCCCCceEEEEeCCceEEeccCceeEEEecCCCeeEEecccCCCcceEEEEEcccccccCCCCCCC-
Q psy2222 121 IDKNGRVTGRSDAVGPLEQWEPVFEDDNMALLGANQCFMSVSEQDDSIVATSRKAGKNEMVKLRSNSLHLKNLDKDDDI- 199 (264)
Q Consensus 121 ~d~~G~v~a~a~a~g~~E~f~iv~~~g~~alq~~ng~fLs~~~~~~~l~a~s~~~g~~e~~~iR~~a~~~~~~~~~~~~- 199 (264)
++..+.|+|.+...++.|.|.|-- + +.. .+.+.+.||||+++.++..+.+-.
T Consensus 94 ~~~~~~i~a~s~~a~~~e~~~iR~-------------------~-------~~~-~~~~~~~~~~~~r~~~~~~e~~~~k 146 (191)
T PF06229_consen 94 VDEEGDIRADSKTAGENEMIKIRS-------------------D-------AER-EFKTTDRIRMQDRFKPKQCESNYVK 146 (191)
T ss_dssp E-SSS-EEE--S---TTT--EEEE---------------------------S----------------------------
T ss_pred EecccCeeeccccCCCCceEEEEE-------------------e-------ccc-cCCccccccccccCCHHHHhhHHHH
Confidence 999989999999999988666421 1 111 345566677776665554321000
Q ss_pred ---ChhhhcCChhHHHHHHHHhhhhccC
Q psy2222 200 ---PEEERTNKLSDIELNYVKKFQKFQD 224 (264)
Q Consensus 200 ---~~e~~~~~~~~~E~~~vk~~Q~~~~ 224 (264)
....+...+.+.|+..||+.+..|+
T Consensus 147 ~~q~~~~~~l~l~~~e~~~LkkAr~eG~ 174 (191)
T PF06229_consen 147 KFQKISDKELRLSDDEVKELKKARKEGN 174 (191)
T ss_dssp ----------------------------
T ss_pred HHhhhhccccccCHHHHHHHHHHHhcCh
Confidence 1111223678889999999999887
No 7
>PF06268 Fascin: Fascin domain; InterPro: IPR022768 This family consists of several eukaryotic fascin or singed proteins. The fascins are a structurally unique and evolutionarily conserved group of actin cross-linking proteins. Fascins function in the organisation of two major forms of actin-based structures: dynamic, cortical cell protrusions and cytoplasmic microfilament bundles. The cortical structures, which include filopodia, spikes, lamellipodial ribs, oocyte microvilli and the dendrites of dendritic cells, have roles in cell-matrix adhesion, cell interactions and cell migration, whereas the cytoplasmic actin bundles appear to participate in cell architecture []. Dictyostelium hisactophilin, another actin-binding protein, is a submembranous pH sensor that signals slight changes of the H+ concentration to actin by inducing actin polymerisation and binding to microfilaments only at pH values below seven []. Members of this family are histidine rich, typically contain the repeated motif of HHXH []. ; GO: 0030674 protein binding, bridging, 0051015 actin filament binding; PDB: 1DFC_B 3P53_B 3LLP_A 1HCE_A 1HCD_A.
Probab=99.15 E-value=2.5e-10 Score=90.63 Aligned_cols=91 Identities=22% Similarity=0.350 Sum_probs=72.7
Q ss_pred cCCCCCeeeccccCCCCCCEEEEeCCCcEEEEcCCCceEeeccCCCCCCCCccccEEEEEecCCeEEEEecCCeEEEEec
Q psy2222 44 IENYAGWTQVKHIKDIVGAISIEFGKRTFVSALDNGLFILGAIHEDKEGPSPEEIFTAILVNDSKVAFKSGFDKYLSIDK 123 (264)
Q Consensus 44 ~~~~~~W~~~~~~~dI~Gp~~i~~~s~~yL~a~~~G~f~~~~~~~~~e~~~~~evf~~~~v~dt~~~lKs~~GkYLs~d~ 123 (264)
..+++.|....... ..-++|....|+||+++.+|..++ .++.+++++.|.++.. ++.+.|+.++|+||+++.
T Consensus 21 ~~~~e~f~le~~~~--~~~v~lrs~~GkYls~~~~G~v~~-----~~~~~~~~~~F~i~~~-~~~~~~~~~nGkYl~~~~ 92 (111)
T PF06268_consen 21 LSDWETFQLEFDDG--SYKVALRSHNGKYLSVDSDGSVVA-----DSETPGPDEFFEIEWH-GGKVALRASNGKYLSAGP 92 (111)
T ss_dssp SSCGGSEEEEEETT--EEEEEEECTTSEEEEEETTSEEEE-----EESSSSGGGCBEEEEE-TTEEEEECTTSCEEEEET
T ss_pred CcccEEEEEEEECC--CCEEEEEcCCCCEEEEcCCCeEEe-----cCCCCCCCcEEEEEEC-CCEEEEECCCCCEEeeCC
Confidence 34567888764443 122234433499999999998777 3456889999999988 678999999999999999
Q ss_pred CCcEEEeccCCCCCCceEE
Q psy2222 124 NGRVTGRSDAVGPLEQWEP 142 (264)
Q Consensus 124 ~G~v~a~a~a~g~~E~f~i 142 (264)
.|.|.|++++++++|.|++
T Consensus 93 ~g~l~a~~~~~~~~elf~~ 111 (111)
T PF06268_consen 93 NGQLKANATSPGKDELFEY 111 (111)
T ss_dssp TTEEEEEESSSSGGGEEEE
T ss_pred CCeEEEcCCCCCcceEEeC
Confidence 9999999999999999985
No 8
>PF04601 DUF569: Protein of unknown function (DUF569); InterPro: IPR007679 This is a family of hypothetical proteins. Some family members contain two copies of the region.
Probab=98.71 E-value=5.2e-07 Score=75.08 Aligned_cols=97 Identities=20% Similarity=0.312 Sum_probs=71.3
Q ss_pred EEEeCC--CcEEEEcCCCceEeeccCCCCCCCCccccEEEEEecCC--eEEEEecCCeEEEEecCC-------c--EEEe
Q psy2222 64 SIEFGK--RTFVSALDNGLFILGAIHEDKEGPSPEEIFTAILVNDS--KVAFKSGFDKYLSIDKNG-------R--VTGR 130 (264)
Q Consensus 64 ~i~~~s--~~yL~a~~~G~f~~~~~~~~~e~~~~~evf~~~~v~dt--~~~lKs~~GkYLs~d~~G-------~--v~a~ 130 (264)
.+-|.| ++||.||++|.-+.-.+ .+.+..-+|++..+.+. -|.|||+||+||++.... . +...
T Consensus 9 ~VRLRS~~~kYL~ADeDg~~Vs~~~----~~~s~na~W~Ve~v~~~~~~v~L~saYGrYL~as~~~~~lG~~G~~v~Q~~ 84 (142)
T PF04601_consen 9 HVRLRSHHGKYLHADEDGEGVSQDR----RGASLNAAWTVERVPGSPNYVRLRSAYGRYLAASDEPALLGHTGRRVVQTD 84 (142)
T ss_pred EEEEEecCCCEEEEcCCCCeEEECC----CCCCCcceEEEEEecCCCCEEEEeeccCceEeccCCcCCCCCCCCEEEecC
Confidence 666777 99999999995433222 23456678999888874 899999999999997653 2 2222
Q ss_pred ccCCCCCCceEEEEeCCceEEeccCceeEEEecC
Q psy2222 131 SDAVGPLEQWEPVFEDDNMALLGANQCFMSVSEQ 164 (264)
Q Consensus 131 a~a~g~~E~f~iv~~~g~~alq~~ng~fLs~~~~ 164 (264)
.+.....-.|+++..++.+-|++.+|+||-+++.
T Consensus 85 ~~~~d~~~~Wepvr~g~~V~Lr~~~gr~LRANG~ 118 (142)
T PF04601_consen 85 PDRLDSSVEWEPVRDGFYVKLRHRSGRYLRANGG 118 (142)
T ss_pred CccCCCCceEEEecCCCEEEEEecCCceEEcCCC
Confidence 2334445578888777788899999999999775
No 9
>PF00167 FGF: Fibroblast growth factor; InterPro: IPR002348 The interleukin-1 (IL1) and heparin-binding growth factor (HBGF) families share low sequence similarity (about 25% []) but have very similar structures. Coupled with the Kunitz-type soybean trypsin inhibitors (STI), they form a structural superfamily. Despite their structural correspondence, however, they show no sequence similarity to the STI family. The crystal structures of interleukin-1 beta and HBGF1 have been solved, showing both families to have the same 12-stranded beta-sheet structure []; the beta-sheets are arranged in 3 similar lobes around a central axis, 6 strands forming an anti-parallel beta-barrel [, ]. The beta-sheets are generally well preserved and the crystal structures superimpose in these areas. The intervening loops are less well conserved - the loop between beta-strands 6 and 7 is slightly longer in interleukin-1 beta.; GO: 0008083 growth factor activity; PDB: 1AFC_F 1BAR_A 2P39_A 1EV2_D 2BFH_A 4FGF_A 1BAS_A 1BFG_A 1FQ9_B 1CVS_A ....
Probab=97.97 E-value=0.00022 Score=57.47 Aligned_cols=98 Identities=15% Similarity=0.212 Sum_probs=77.9
Q ss_pred EEEEeCCCcEEEEcCCCceEeeccCCCCCCCCccccEEEEEecCCeEEEEe-cCCeEEEEecCCcEEEeccCCCCCCceE
Q psy2222 63 ISIEFGKRTFVSALDNGLFILGAIHEDKEGPSPEEIFTAILVNDSKVAFKS-GFDKYLSIDKNGRVTGRSDAVGPLEQWE 141 (264)
Q Consensus 63 ~~i~~~s~~yL~a~~~G~f~~~~~~~~~e~~~~~evf~~~~v~dt~~~lKs-~~GkYLs~d~~G~v~a~a~a~g~~E~f~ 141 (264)
+.|.+.++.||....+|.... ..+..++..+|.++.+..+.|.|++ ..++||+.|..|.|.+.. ..++...|.
T Consensus 3 ~~Ly~~~~~~L~i~~~g~V~g-----t~~~~~~~s~~~i~~~~~g~V~i~~~~s~~YLcmn~~G~ly~~~-~~~~~C~F~ 76 (122)
T PF00167_consen 3 VQLYCRTGYFLQINPNGTVDG-----TGDDNSPYSVFEIHSVGFGVVRIRGVKSCRYLCMNKCGRLYGSK-NFNKDCVFR 76 (122)
T ss_dssp EEEEETTSEEEEEETTSBEEE-----ESSTTSTTGEEEEEEEETTEEEEEETTTTEEEEEBTTSBEEEES-SBTGGGEEE
T ss_pred EEEEECCCeEEEECCCCeEeC-----CCCcCcceeEEEEEeccceEEEEEEecceEEEEECCCCeEcccc-ccCCCceEE
Confidence 567888899999999996554 2344677889999999888999999 579999999999999964 466688888
Q ss_pred EEEeCCce-EEecc---CceeEEEecCCC
Q psy2222 142 PVFEDDNM-ALLGA---NQCFMSVSEQDD 166 (264)
Q Consensus 142 iv~~~g~~-alq~~---ng~fLs~~~~~~ 166 (264)
-.+.++.| .+.+. ++.||+++..|.
T Consensus 77 e~~~~n~y~~~~s~~~~~~~yla~~~~G~ 105 (122)
T PF00167_consen 77 EELLENGYNTYESAKYGRGWYLAFNRRGK 105 (122)
T ss_dssp EEEETTSEEEEEESTTGTTEBCEBCTTSB
T ss_pred EEEccCCEEEEEeccCCccEEEEECCCCC
Confidence 77765444 45554 589999999886
No 10
>cd00058 FGF Acidic and basic fibroblast growth factor family; FGFs are mitogens, which stimulate growth or differentiation of cells of mesodermal or neuroectodermal origin. The family plays essential roles in patterning and differentiation during vertebrate embryogenesis, and has neurotrophic activities. FGFs have a high affinity for heparan sulfate proteoglycans and require heparan sulfate to activate one of four cell surface FGF receptors. Upon binding to FGF, the receptors dimerize and their intracellular tyrosine kinase domains become active. FGFs have internal pseudo-threefold symmetry (beta-trefoil topology).
Probab=97.75 E-value=0.0013 Score=53.42 Aligned_cols=115 Identities=16% Similarity=0.219 Sum_probs=81.6
Q ss_pred EEEeCCCcEEEEcCCCceEeeccCCCCCCCCccccEEEEEecCCeEEEEe-cCCeEEEEecCCcEEEeccCCCCCCceEE
Q psy2222 64 SIEFGKRTFVSALDNGLFILGAIHEDKEGPSPEEIFTAILVNDSKVAFKS-GFDKYLSIDKNGRVTGRSDAVGPLEQWEP 142 (264)
Q Consensus 64 ~i~~~s~~yL~a~~~G~f~~~~~~~~~e~~~~~evf~~~~v~dt~~~lKs-~~GkYLs~d~~G~v~a~a~a~g~~E~f~i 142 (264)
.|.+.++.||..+.+|...- ..+..++.-+|.+..+..+.|.||+ ..++||+.|+.|.|.+.. ..+....|.-
T Consensus 2 qLy~~~~~~L~I~~dG~V~G-----t~~~~~~~s~l~~~s~~~g~v~i~~v~s~~YLCmn~~G~ly~s~-~~~~dC~F~E 75 (123)
T cd00058 2 QLYCRTGFHLQILPDGTVDG-----TRDDSSSYTILERIAVAVGVVSIKGVASCRYLCMNKCGKLYGSK-GFTEECLFRE 75 (123)
T ss_pred eEEEcCCeEEEEcCCCcEec-----ccCCCCCCceEEEEECCCCEEEEEEcccceEEEECCCCCEEECC-CCCCCCEEEE
Confidence 45566688899998885443 2334567788888888888999999 499999999999999977 8888889977
Q ss_pred EEeCCce-EEecc---CceeEEEecCCCeeEEe-cccCCCcceEEEE
Q psy2222 143 VFEDDNM-ALLGA---NQCFMSVSEQDDSIVAT-SRKAGKNEMVKLR 184 (264)
Q Consensus 143 v~~~g~~-alq~~---ng~fLs~~~~~~~l~a~-s~~~g~~e~~~iR 184 (264)
...+..| .+.+. .+.||+++..|..-.+. .........|-+|
T Consensus 76 ~~~~n~Y~~y~S~~~~~~~ylal~~~G~~rr~~~~~~~~~~~~Flp~ 122 (123)
T cd00058 76 ELLENNYNTYASAKYRRRWYLALNKKGRPRRGQLTKKNQKASHFLPR 122 (123)
T ss_pred EEccCCcEEEEEcccCCCcEEEECCCCCCccccccCCCCcceEEEeC
Confidence 6664333 44444 47899999888523332 2223344555554
No 11
>KOG3962|consensus
Probab=97.70 E-value=0.00015 Score=64.32 Aligned_cols=80 Identities=24% Similarity=0.229 Sum_probs=64.8
Q ss_pred eEEEEecCCeEEEEecCCcEEEec-----cCCCCCCceEEEEe-CCceEEeccCceeEEEecCCCeeEEecccCCCcceE
Q psy2222 108 KVAFKSGFDKYLSIDKNGRVTGRS-----DAVGPLEQWEPVFE-DDNMALLGANQCFMSVSEQDDSIVATSRKAGKNEMV 181 (264)
Q Consensus 108 ~~~lKs~~GkYLs~d~~G~v~a~a-----~a~g~~E~f~iv~~-~g~~alq~~ng~fLs~~~~~~~l~a~s~~~g~~e~~ 181 (264)
.|++.-..|.||.+-.+|..+--+ +.+.|.|+|.++.- |.+++|++.+|+||+++.++. |.|.+..||..|.|
T Consensus 49 ~v~ie~~~~~yl~a~dng~ft~g~ph~~~~gp~p~e~f~avki~dsrIaLKsGyGKYlsinsdgl-vvg~qeAvG~~EQw 127 (246)
T KOG3962|consen 49 TVAIEIDDGTYLGAMDNGLFTLGAPHDEVDGPEPEEQFMAVKISDSRIALKSGYGKYLSINSDGL-VVGRQEAVGSREQW 127 (246)
T ss_pred EEEEEecCceEEEEEecCceeeccCCccccCCCchhhEEEEEccCceEEecccccceeeecCCcc-EEEehhhcCcHhhc
Confidence 566666668899998887654432 36778899988865 899999999999999999998 99999999999999
Q ss_pred EEEEccc
Q psy2222 182 KLRSNSL 188 (264)
Q Consensus 182 ~iR~~a~ 188 (264)
.+=.|..
T Consensus 128 ~~vFq~~ 134 (246)
T KOG3962|consen 128 EPVFQEG 134 (246)
T ss_pred hhhhhcc
Confidence 8754433
No 12
>smart00442 FGF Acidic and basic fibroblast growth factor family. Mitogens that stimulate growth or differentiation of cells of mesodermal or neuroectodermal origin. The family play essential roles in patterning and differentiation during vertebrate embryogenesis, and have neurotrophic activities.
Probab=97.67 E-value=0.0024 Score=52.11 Aligned_cols=115 Identities=15% Similarity=0.192 Sum_probs=79.2
Q ss_pred EEEEeCCCcEEEEcCCCceEeeccCCCCCCCCccccEEEEEecCCeEEEEe-cCCeEEEEecCCcEEEeccCCCCCCceE
Q psy2222 63 ISIEFGKRTFVSALDNGLFILGAIHEDKEGPSPEEIFTAILVNDSKVAFKS-GFDKYLSIDKNGRVTGRSDAVGPLEQWE 141 (264)
Q Consensus 63 ~~i~~~s~~yL~a~~~G~f~~~~~~~~~e~~~~~evf~~~~v~dt~~~lKs-~~GkYLs~d~~G~v~a~a~a~g~~E~f~ 141 (264)
+.|.+..+.||..+.+|...- ..+..++.-+|.+..+..+.|.||+ ..++||+.|+.|.|.+... .+....|.
T Consensus 5 ~~Ly~~~~~~L~I~~~G~V~G-----t~~~~~~~~ile~~s~~~g~V~ik~~~s~~YLCmn~~G~ly~s~~-~~~dC~F~ 78 (126)
T smart00442 5 RQLYCRNGQHLQILPDGTVDG-----TRDESSSFTILEIIAVAVGVVAIKGVASCRYLCMNKCGKLYGSKN-FTEDCVFR 78 (126)
T ss_pred EEEEeCCCeEEEEcCCceEec-----ccCCCCcceEEEEEeccCCEEEEEEcccceEEEECCCCCEEEccc-CCCCcEEE
Confidence 567777788899988884332 1234566778888888778999999 5899999999999999766 67888886
Q ss_pred EEEeCCce-EEecc--Cc-eeEEEecCCCeeEEec-ccCCCcceEEE
Q psy2222 142 PVFEDDNM-ALLGA--NQ-CFMSVSEQDDSIVATS-RKAGKNEMVKL 183 (264)
Q Consensus 142 iv~~~g~~-alq~~--ng-~fLs~~~~~~~l~a~s-~~~g~~e~~~i 183 (264)
=...+..| .+.+. .+ -||+++..|..-.+.. ..-.....|-+
T Consensus 79 E~~~~n~y~~y~S~~~~~~~ylal~~~G~~r~~~~t~~~~~~~~Flp 125 (126)
T smart00442 79 EEMEENGYNTYASAKYRKRWYVALNKKGRPRRGQKTKPLQKASHFLP 125 (126)
T ss_pred EEeccCCeEEEEEcccCCceEEEECCCCCEeEcccCCCCCcccEeee
Confidence 65554333 33332 33 8999998886334432 22334455543
No 13
>PF04601 DUF569: Protein of unknown function (DUF569); InterPro: IPR007679 This is a family of hypothetical proteins. Some family members contain two copies of the region.
Probab=97.60 E-value=0.00024 Score=59.25 Aligned_cols=58 Identities=21% Similarity=0.335 Sum_probs=46.5
Q ss_pred eEEEEecCCeEEEEecCCc-EEEeccCCCCCCceEEEEeC---CceEEeccCceeEEEecCC
Q psy2222 108 KVAFKSGFDKYLSIDKNGR-VTGRSDAVGPLEQWEPVFED---DNMALLGANQCFMSVSEQD 165 (264)
Q Consensus 108 ~~~lKs~~GkYLs~d~~G~-v~a~a~a~g~~E~f~iv~~~---g~~alq~~ng~fLs~~~~~ 165 (264)
.|.|||++|+||.+|.||. |+=.....+....|.++.-. .-+-|++++|+||++.+..
T Consensus 9 ~VRLRS~~~kYL~ADeDg~~Vs~~~~~~s~na~W~Ve~v~~~~~~v~L~saYGrYL~as~~~ 70 (142)
T PF04601_consen 9 HVRLRSHHGKYLHADEDGEGVSQDRRGASLNAAWTVERVPGSPNYVRLRSAYGRYLAASDEP 70 (142)
T ss_pred EEEEEecCCCEEEEcCCCCeEEECCCCCCCcceEEEEEecCCCCEEEEeeccCceEeccCCc
Confidence 7999999999999999986 44444555678889887542 4678999999999997654
No 14
>PF14200 RicinB_lectin_2: Ricin-type beta-trefoil lectin domain-like; PDB: 2X2S_C 2X2T_A 2VSE_B 2VSA_A 3EF2_A 2IHO_A 3HZB_H 1YBI_B 3PHZ_A 3NBE_A ....
Probab=97.41 E-value=0.004 Score=48.27 Aligned_cols=89 Identities=16% Similarity=0.177 Sum_probs=65.0
Q ss_pred cccEEEEEec--CCeEEEEe-cCCeEEEEecCC-----cEEEeccCCCCCCceEEEEe-CCceEEecc-CceeEEEecCC
Q psy2222 96 EEIFTAILVN--DSKVAFKS-GFDKYLSIDKNG-----RVTGRSDAVGPLEQWEPVFE-DDNMALLGA-NQCFMSVSEQD 165 (264)
Q Consensus 96 ~evf~~~~v~--dt~~~lKs-~~GkYLs~d~~G-----~v~a~a~a~g~~E~f~iv~~-~g~~alq~~-ng~fLs~~~~~ 165 (264)
-|.|++.++. ++.|.|++ +.|+||.+...+ .|+.....-.+..+|.|+.. +|.|.|++. .|++|.+.+.+
T Consensus 2 nQ~W~~~~~~~~~g~Y~i~n~~sg~~L~v~~~~~~~g~~v~~~~~~~~~~Q~W~i~~~~~g~y~I~n~~s~~~Ldv~~~~ 81 (105)
T PF14200_consen 2 NQQWTFTPVGDSDGYYKIRNVNSGKYLDVAGGSTANGTNVQQWTCNGNDNQQWKIEPVGDGYYRIRNKNSGKVLDVAGGS 81 (105)
T ss_dssp GGEEEEEEEETTTTEEEEEETTTTEEEEEGCTTCSTTEBEEEEESSSSGGGEEEEEESTTSEEEEEETSTTEEEEEGGGS
T ss_pred CCEEEEEEecCCCCEEEEEECCCCCEEEeCCCCcCCCcEEEEecCCCCcCcEEEEEEecCCeEEEEECCCCcEEEECCCC
Confidence 4789999987 56899998 799999998643 34444444589999999988 688999987 58999996543
Q ss_pred ----CeeEEec-ccCCCcceEEEE
Q psy2222 166 ----DSIVATS-RKAGKNEMVKLR 184 (264)
Q Consensus 166 ----~~l~a~s-~~~g~~e~~~iR 184 (264)
..|..-. +.-+.+..|+|+
T Consensus 82 ~~~g~~v~~~~~~~~~~~Q~W~l~ 105 (105)
T PF14200_consen 82 TANGTNVQQWEYDNGSDNQQWKLE 105 (105)
T ss_dssp SSTTEBEEEEE-STSSGGGEEEEE
T ss_pred CCCCCEEEEEeCCCCCccCEEEeC
Confidence 1233322 255677778774
No 15
>PF00167 FGF: Fibroblast growth factor; InterPro: IPR002348 The interleukin-1 (IL1) and heparin-binding growth factor (HBGF) families share low sequence similarity (about 25% []) but have very similar structures. Coupled with the Kunitz-type soybean trypsin inhibitors (STI), they form a structural superfamily. Despite their structural correspondence, however, they show no sequence similarity to the STI family. The crystal structures of interleukin-1 beta and HBGF1 have been solved, showing both families to have the same 12-stranded beta-sheet structure []; the beta-sheets are arranged in 3 similar lobes around a central axis, 6 strands forming an anti-parallel beta-barrel [, ]. The beta-sheets are generally well preserved and the crystal structures superimpose in these areas. The intervening loops are less well conserved - the loop between beta-strands 6 and 7 is slightly longer in interleukin-1 beta.; GO: 0008083 growth factor activity; PDB: 1AFC_F 1BAR_A 2P39_A 1EV2_D 2BFH_A 4FGF_A 1BAS_A 1BFG_A 1FQ9_B 1CVS_A ....
Probab=97.40 E-value=0.0034 Score=50.44 Aligned_cols=76 Identities=16% Similarity=0.229 Sum_probs=64.2
Q ss_pred EEEEecCCeEEEEecCCcEEEeccCCCCCCceEEEEe-CCceEEecc-CceeEEEecCCCeeEEecccCCCcceEEEEEc
Q psy2222 109 VAFKSGFDKYLSIDKNGRVTGRSDAVGPLEQWEPVFE-DDNMALLGA-NQCFMSVSEQDDSIVATSRKAGKNEMVKLRSN 186 (264)
Q Consensus 109 ~~lKs~~GkYLs~d~~G~v~a~a~a~g~~E~f~iv~~-~g~~alq~~-ng~fLs~~~~~~~l~a~s~~~g~~e~~~iR~~ 186 (264)
++|=+.+|.||.+.++|.|.++.+..++...|+++.. .|.+.|++. .++||+++..|. |.+. ...+....|.-.+.
T Consensus 3 ~~Ly~~~~~~L~i~~~g~V~gt~~~~~~~s~~~i~~~~~g~V~i~~~~s~~YLcmn~~G~-ly~~-~~~~~~C~F~e~~~ 80 (122)
T PF00167_consen 3 VQLYCRTGYFLQINPNGTVDGTGDDNSPYSVFEIHSVGFGVVRIRGVKSCRYLCMNKCGR-LYGS-KNFNKDCVFREELL 80 (122)
T ss_dssp EEEEETTSEEEEEETTSBEEEESSTTSTTGEEEEEEEETTEEEEEETTTTEEEEEBTTSB-EEEE-SSBTGGGEEEEEEE
T ss_pred EEEEECCCeEEEECCCCeEeCCCCcCcceeEEEEEeccceEEEEEEecceEEEEECCCCe-Eccc-cccCCCceEEEEEc
Confidence 5666767999999999999999999999999999987 689999986 799999999887 7774 44555888887763
No 16
>smart00791 Agglutinin Amaranthus caudatus agglutinin or amaranthin is a lectin from the ancient South American crop, amaranth grain. Although its biological function is unknown, it has a high binding specificity for the methyl-glycoside of the T-antigen, found linked to serine or threonine residues of cell surface glycoproteins PUBMED:2271665. The protein is comprised of a homodimer, with each homodimer consisting of two beta-trefoil domains PUBMED:9334739.
Probab=97.40 E-value=0.0038 Score=51.50 Aligned_cols=114 Identities=13% Similarity=0.165 Sum_probs=76.0
Q ss_pred EEEeCC--CcEEEEcCCCc---eEeeccCCCCCCCCccccEEEEEecCCeEEEEe-cCCeEEEEecCCcEEEeccCC---
Q psy2222 64 SIEFGK--RTFVSALDNGL---FILGAIHEDKEGPSPEEIFTAILVNDSKVAFKS-GFDKYLSIDKNGRVTGRSDAV--- 134 (264)
Q Consensus 64 ~i~~~s--~~yL~a~~~G~---f~~~~~~~~~e~~~~~evf~~~~v~dt~~~lKs-~~GkYLs~d~~G~v~a~a~a~--- 134 (264)
.+.|++ +.||.+...+. ..++.. +-.+|.-.|.+....++.|.+|+ +.|||+....+ =+.|.++.+
T Consensus 7 ~V~FKg~n~kYLry~~~~~~~~lqf~~d----dI~dp~v~~ev~~~~dg~V~ik~~~~nKfWr~s~~-WI~a~s~d~~e~ 81 (139)
T smart00791 7 YVLFKGNNQKYLRYQSIQQYGLLQFSAD----KILDPLVQFEVFPTYNGLVHIKSNYTNKFWRLSHY-WITADANDPDEN 81 (139)
T ss_pred EEEEEcCCCceEEEEeecccceeEeccc----ccCCcceeEEEEEcCCCcEEEEecCCCceEccCCC-EEEecCCCCccC
Confidence 677777 88988776443 233222 12344445555556777999999 79999988743 677877655
Q ss_pred -CCCCceEEEEe-CCceEEecc-CceeEEEecCCCe----eEEecccCCCcceEE
Q psy2222 135 -GPLEQWEPVFE-DDNMALLGA-NQCFMSVSEQDDS----IVATSRKAGKNEMVK 182 (264)
Q Consensus 135 -g~~E~f~iv~~-~g~~alq~~-ng~fLs~~~~~~~----l~a~s~~~g~~e~~~ 182 (264)
++-..|.|+.. ++.++||.. +|+|+.-...+.. |.|.+.+|.....+.
T Consensus 82 ~sscTLF~Pv~~d~~~i~lr~vq~~~~~~r~t~~~~~~~CL~A~~~~i~~~a~l~ 136 (139)
T smart00791 82 KSACTLFRPLYVEMKKIRLLNVQLGHYTKRYTVGKSFVSCLNAASSQIDTKAVLH 136 (139)
T ss_pred CCcccEEeEEeccCceEEEEEecCCceEEeecCCCCcccceeeccccccceeeEE
Confidence 46678999985 688999986 8888877554322 444555555544443
No 17
>cd00058 FGF Acidic and basic fibroblast growth factor family; FGFs are mitogens, which stimulate growth or differentiation of cells of mesodermal or neuroectodermal origin. The family plays essential roles in patterning and differentiation during vertebrate embryogenesis, and has neurotrophic activities. FGFs have a high affinity for heparan sulfate proteoglycans and require heparan sulfate to activate one of four cell surface FGF receptors. Upon binding to FGF, the receptors dimerize and their intracellular tyrosine kinase domains become active. FGFs have internal pseudo-threefold symmetry (beta-trefoil topology).
Probab=97.05 E-value=0.0058 Score=49.66 Aligned_cols=72 Identities=11% Similarity=0.114 Sum_probs=61.0
Q ss_pred ecCCeEEEEecCCcEEEeccCCCCCCceEEEEe-CCceEEecc-CceeEEEecCCCeeEEecccCCCcceEEEEEc
Q psy2222 113 SGFDKYLSIDKNGRVTGRSDAVGPLEQWEPVFE-DDNMALLGA-NQCFMSVSEQDDSIVATSRKAGKNEMVKLRSN 186 (264)
Q Consensus 113 s~~GkYLs~d~~G~v~a~a~a~g~~E~f~iv~~-~g~~alq~~-ng~fLs~~~~~~~l~a~s~~~g~~e~~~iR~~ 186 (264)
+..|.||-++++|.|.++.+.-.+...|++... .|.+.|++. .++||+++..|. |.+.. ..+....|+-++.
T Consensus 5 ~~~~~~L~I~~dG~V~Gt~~~~~~~s~l~~~s~~~g~v~i~~v~s~~YLCmn~~G~-ly~s~-~~~~dC~F~E~~~ 78 (123)
T cd00058 5 CRTGFHLQILPDGTVDGTRDDSSSYTILERIAVAVGVVSIKGVASCRYLCMNKCGK-LYGSK-GFTEECLFREELL 78 (123)
T ss_pred EcCCeEEEEcCCCcEecccCCCCCCceEEEEECCCCEEEEEEcccceEEEECCCCC-EEECC-CCCCCCEEEEEEc
Confidence 444999999999999999998889999999876 699999997 899999999987 77655 6777777777664
No 18
>smart00442 FGF Acidic and basic fibroblast growth factor family. Mitogens that stimulate growth or differentiation of cells of mesodermal or neuroectodermal origin. The family play essential roles in patterning and differentiation during vertebrate embryogenesis, and have neurotrophic activities.
Probab=96.85 E-value=0.013 Score=47.74 Aligned_cols=77 Identities=12% Similarity=0.139 Sum_probs=62.5
Q ss_pred eEEEEecCCeEEEEecCCcEEEeccCCCCCCceEEEEe-CCceEEecc-CceeEEEecCCCeeEEecccCCCcceEEEEE
Q psy2222 108 KVAFKSGFDKYLSIDKNGRVTGRSDAVGPLEQWEPVFE-DDNMALLGA-NQCFMSVSEQDDSIVATSRKAGKNEMVKLRS 185 (264)
Q Consensus 108 ~~~lKs~~GkYLs~d~~G~v~a~a~a~g~~E~f~iv~~-~g~~alq~~-ng~fLs~~~~~~~l~a~s~~~g~~e~~~iR~ 185 (264)
.++|=+.+|.||-+.++|.|.++.+...+...|++... .|.+.|++. .++||+++..|. |.+... .+.+..|+=.+
T Consensus 4 ~~~Ly~~~~~~L~I~~~G~V~Gt~~~~~~~~ile~~s~~~g~V~ik~~~s~~YLCmn~~G~-ly~s~~-~~~dC~F~E~~ 81 (126)
T smart00442 4 LRQLYCRNGQHLQILPDGTVDGTRDESSSFTILEIIAVAVGVVAIKGVASCRYLCMNKCGK-LYGSKN-FTEDCVFREEM 81 (126)
T ss_pred EEEEEeCCCeEEEEcCCceEecccCCCCcceEEEEEeccCCEEEEEEcccceEEEECCCCC-EEEccc-CCCCcEEEEEe
Confidence 35666778899999999999999999999999999876 599999986 799999999997 766443 55666665554
Q ss_pred c
Q psy2222 186 N 186 (264)
Q Consensus 186 ~ 186 (264)
.
T Consensus 82 ~ 82 (126)
T smart00442 82 E 82 (126)
T ss_pred c
Confidence 3
No 19
>PF07468 Agglutinin: Agglutinin; InterPro: IPR008998 Agglutinins are sugar-specific lectins that can agglutinate erythrocytes and other cell types. Lectins occur widely in plants, as well as some microorganisms and animal []. Agglutinin from Amaranthus caudatus (amaranthin) is a lectin from the ancient South American crop, amaranth grain. Although its biological function is unknown, it can agglutinate A, B and O red blood cells, and has a carbohydrate-binding site that is specific for the methyl-glycoside of the T-antigen found linked to serine or threonine residues of cell surface glycoproteins []. The protein is comprised of a homodimer, with each homodimer consisting of two beta-trefoil domains []. Lectin B chains from ricin and related toxins also contain beta-trefoil domain, however they are not related to agglutinin, showing little sequence similarity [].; PDB: 1JLY_B 1JLX_B.
Probab=96.77 E-value=0.048 Score=46.00 Aligned_cols=116 Identities=15% Similarity=0.181 Sum_probs=70.2
Q ss_pred EEEeCC--CcEEEEcCCCc-----eEeeccCCCCCCCCccccEEEEE--ecCCeEEEEe-cCCeEEEEecCC--cEEEec
Q psy2222 64 SIEFGK--RTFVSALDNGL-----FILGAIHEDKEGPSPEEIFTAIL--VNDSKVAFKS-GFDKYLSIDKNG--RVTGRS 131 (264)
Q Consensus 64 ~i~~~s--~~yL~a~~~G~-----f~~~~~~~~~e~~~~~evf~~~~--v~dt~~~lKs-~~GkYLs~d~~G--~v~a~a 131 (264)
.+.++. +.||.+...+. +.++. .+..+|.-.|.+++ ..++.|.+|+ ++|||+.-...+ =+.|.+
T Consensus 7 ~V~fkg~N~kYLry~~e~~~~~~~LqF~~----edi~dP~v~fev~~~~~~dG~V~Ir~~y~nKfWrr~s~n~~WI~ada 82 (153)
T PF07468_consen 7 YVAFKGDNGKYLRYRTEDIQQYGYLQFSG----EDIGDPYVKFEVEPSKTHDGLVHIRCCYNNKFWRRSSPNDYWIWADA 82 (153)
T ss_dssp CEEEETTTS-EEEEEESSCTTCCEEEEEE----SSTT-CCG-EEEEE-SSTTT-EEEEETTTTEEEEESCCC--BEEEEE
T ss_pred EEEEEcCCCcEEEEEecccccceeEEecC----CcCCCCceeEEEEEcccCCCeEEEEeccCCceeEeCCCCCcEEEecC
Confidence 345554 88988776221 23322 24567888899988 6777999999 799999975555 588886
Q ss_pred cCCC------CCCceEEEEeC----CceEEecc-CceeEEEecC----CCeeEEecccCCCcc--eEEE
Q psy2222 132 DAVG------PLEQWEPVFED----DNMALLGA-NQCFMSVSEQ----DDSIVATSRKAGKNE--MVKL 183 (264)
Q Consensus 132 ~a~g------~~E~f~iv~~~----g~~alq~~-ng~fLs~~~~----~~~l~a~s~~~g~~e--~~~i 183 (264)
+.+. +-..|+|+.-+ ..++|+.. ||.|..--.. .+-|.|.+.+|..+. .|+|
T Consensus 83 ~~p~ed~s~~~cTLF~Pv~vd~~~~~~i~l~~~~n~~~~~r~t~~~~~~sCL~A~~~~i~~~~~~~~~~ 151 (153)
T PF07468_consen 83 DDPDEDQSKPSCTLFEPVKVDVKDFNVIALRNMQNGHFCKRLTYGGKFVSCLNAASSSIDREAKDVFTI 151 (153)
T ss_dssp SSHHH-TCSTCGG-EEEEESCCCETTEEEEEETTTTEEEEEE--STTBSSEEEEEESS--CCGTT-EEE
T ss_pred CCcccccCCCCceEEEEEEecCCCccEEEEEecCCceEEEEEccCCcceeeEeeccCcccccccceEEE
Confidence 6443 34459998653 56889986 7888766433 233666666665544 5544
No 20
>KOG3885|consensus
Probab=95.79 E-value=0.14 Score=43.35 Aligned_cols=100 Identities=19% Similarity=0.263 Sum_probs=71.9
Q ss_pred CCEEEEeCC-CcEEEEcCCCceEeeccCCCCCCCCccccEEEEEecCCeEEEEe-cCCeEEEEecCCcEEEeccCCCCCC
Q psy2222 61 GAISIEFGK-RTFVSALDNGLFILGAIHEDKEGPSPEEIFTAILVNDSKVAFKS-GFDKYLSIDKNGRVTGRSDAVGPLE 138 (264)
Q Consensus 61 Gp~~i~~~s-~~yL~a~~~G~f~~~~~~~~~e~~~~~evf~~~~v~dt~~~lKs-~~GkYLs~d~~G~v~a~a~a~g~~E 138 (264)
.+..+.+.+ |-+|..+.+|...- .+|..++.-++.++.+.=+.|++|+ ..++||+-|++|.|.+. +--++..
T Consensus 26 r~~~LY~~t~g~hLqi~p~g~V~G-----t~~~~s~~siLei~sv~~GvV~IkGV~s~~YL~Mn~~G~LygS-~~~t~eC 99 (155)
T KOG3885|consen 26 RPKLLYCRNGGHFLRILPDGTVDG-----TRDRSDQHTIFEIITVAVGVVAIKGVESELYLAMNKEGKLYAS-KEFTEEC 99 (155)
T ss_pred eeeEEEEcCCCEEEEEcCCCcccc-----ccccCCCceeEEEEEeeecEEEEEEeeceeEEEECCCCcEecC-CCCCccc
Confidence 456788888 88899998885433 2344566677777777767999999 69999999999999998 7777778
Q ss_pred ceEE-EEeCCceEEecc----Cc-eeEEEecCCC
Q psy2222 139 QWEP-VFEDDNMALLGA----NQ-CFMSVSEQDD 166 (264)
Q Consensus 139 ~f~i-v~~~g~~alq~~----ng-~fLs~~~~~~ 166 (264)
.|.= +..++=-+.++. ++ -||+.+..|.
T Consensus 100 ~F~E~~~EN~YntY~S~~y~~~~~~yvaL~k~G~ 133 (155)
T KOG3885|consen 100 KFKELVLENYYNTYASAKYRHNGEWFVALNKKGI 133 (155)
T ss_pred eeEEEeecCCchheeehhhcccccEEEEECCCCC
Confidence 8843 344433233332 22 7888888775
No 21
>KOG3885|consensus
Probab=95.67 E-value=0.064 Score=45.44 Aligned_cols=74 Identities=16% Similarity=0.290 Sum_probs=59.7
Q ss_pred eEEEEecCCeEEEEecCCcEEEeccCCCCCCceEEEEeC-CceEEecc-CceeEEEecCCCeeEEecccCCCcceEEEE
Q psy2222 108 KVAFKSGFDKYLSIDKNGRVTGRSDAVGPLEQWEPVFED-DNMALLGA-NQCFMSVSEQDDSIVATSRKAGKNEMVKLR 184 (264)
Q Consensus 108 ~~~lKs~~GkYLs~d~~G~v~a~a~a~g~~E~f~iv~~~-g~~alq~~-ng~fLs~~~~~~~l~a~s~~~g~~e~~~iR 184 (264)
.+..+|. |.||-+.++|.|.++++--++.-.++|+.-+ |.++|+.. .+.||+.+..|. |.+. ..-+++..|+=+
T Consensus 29 ~LY~~t~-g~hLqi~p~g~V~Gt~~~~s~~siLei~sv~~GvV~IkGV~s~~YL~Mn~~G~-LygS-~~~t~eC~F~E~ 104 (155)
T KOG3885|consen 29 LLYCRNG-GHFLRILPDGTVDGTRDRSDQHTIFEIITVAVGVVAIKGVESELYLAMNKEGK-LYAS-KEFTEECKFKEL 104 (155)
T ss_pred EEEEcCC-CEEEEEcCCCccccccccCCCceeEEEEEeeecEEEEEEeeceeEEEECCCCc-EecC-CCCCccceeEEE
Confidence 3455555 9999999999999999999999999998774 89999986 899999999997 8776 444455555443
No 22
>smart00791 Agglutinin Amaranthus caudatus agglutinin or amaranthin is a lectin from the ancient South American crop, amaranth grain. Although its biological function is unknown, it has a high binding specificity for the methyl-glycoside of the T-antigen, found linked to serine or threonine residues of cell surface glycoproteins PUBMED:2271665. The protein is comprised of a homodimer, with each homodimer consisting of two beta-trefoil domains PUBMED:9334739.
Probab=95.46 E-value=0.096 Score=43.34 Aligned_cols=66 Identities=24% Similarity=0.268 Sum_probs=52.9
Q ss_pred eEEEEecCCeEEEEec---CCcEEEeccCCC-CCCceEEEEe-CCceEEecc-CceeEEEecCCCeeEEecccC
Q psy2222 108 KVAFKSGFDKYLSIDK---NGRVTGRSDAVG-PLEQWEPVFE-DDNMALLGA-NQCFMSVSEQDDSIVATSRKA 175 (264)
Q Consensus 108 ~~~lKs~~GkYLs~d~---~G~v~a~a~a~g-~~E~f~iv~~-~g~~alq~~-ng~fLs~~~~~~~l~a~s~~~ 175 (264)
.|+||+-+|+||..-. +|-|.+.++-|+ |.-.|++... +|.++|++. +|+|....+. = |.|+++.+
T Consensus 7 ~V~FKg~n~kYLry~~~~~~~~lqf~~ddI~dp~v~~ev~~~~dg~V~ik~~~~nKfWr~s~~-W-I~a~s~d~ 78 (139)
T smart00791 7 YVLFKGNNQKYLRYQSIQQYGLLQFSADKILDPLVQFEVFPTYNGLVHIKSNYTNKFWRLSHY-W-ITADANDP 78 (139)
T ss_pred EEEEEcCCCceEEEEeecccceeEecccccCCcceeEEEEEcCCCcEEEEecCCCceEccCCC-E-EEecCCCC
Confidence 4899999999998865 466999999887 8888998877 799999986 8999876532 3 77776655
No 23
>PF07468 Agglutinin: Agglutinin; InterPro: IPR008998 Agglutinins are sugar-specific lectins that can agglutinate erythrocytes and other cell types. Lectins occur widely in plants, as well as some microorganisms and animal []. Agglutinin from Amaranthus caudatus (amaranthin) is a lectin from the ancient South American crop, amaranth grain. Although its biological function is unknown, it can agglutinate A, B and O red blood cells, and has a carbohydrate-binding site that is specific for the methyl-glycoside of the T-antigen found linked to serine or threonine residues of cell surface glycoproteins []. The protein is comprised of a homodimer, with each homodimer consisting of two beta-trefoil domains []. Lectin B chains from ricin and related toxins also contain beta-trefoil domain, however they are not related to agglutinin, showing little sequence similarity [].; PDB: 1JLY_B 1JLX_B.
Probab=94.94 E-value=0.21 Score=42.16 Aligned_cols=68 Identities=26% Similarity=0.289 Sum_probs=49.2
Q ss_pred eEEEEecCCeEEEEec-----CCcEEEeccCCC-CCCceEEEE--e-CCceEEecc-CceeEEEec-CCCeeEEecccC
Q psy2222 108 KVAFKSGFDKYLSIDK-----NGRVTGRSDAVG-PLEQWEPVF--E-DDNMALLGA-NQCFMSVSE-QDDSIVATSRKA 175 (264)
Q Consensus 108 ~~~lKs~~GkYLs~d~-----~G~v~a~a~a~g-~~E~f~iv~--~-~g~~alq~~-ng~fLs~~~-~~~~l~a~s~~~ 175 (264)
-|.||..+|+||..-. .|-|.+.++-++ |.-.|++++ . +|.++|++. +|+|..... ++.=|.|+++..
T Consensus 7 ~V~fkg~N~kYLry~~e~~~~~~~LqF~~edi~dP~v~fev~~~~~~dG~V~Ir~~y~nKfWrr~s~n~~WI~ada~~p 85 (153)
T PF07468_consen 7 YVAFKGDNGKYLRYRTEDIQQYGYLQFSGEDIGDPYVKFEVEPSKTHDGLVHIRCCYNNKFWRRSSPNDYWIWADADDP 85 (153)
T ss_dssp CEEEETTTS-EEEEEESSCTTCCEEEEEESSTT-CCG-EEEEE-SSTTT-EEEEETTTTEEEEESCCC--BEEEEESSH
T ss_pred EEEEEcCCCcEEEEEecccccceeEEecCCcCCCCceeEEEEEcccCCCeEEEEeccCCceeEeCCCCCcEEEecCCCc
Confidence 4788889999999865 355888887666 999999998 3 899999996 899999633 322377776543
No 24
>PF08555 DUF1754: Eukaryotic family of unknown function (DUF1754); InterPro: IPR013865 This is a eukaryotic protein of unknown function.
Probab=94.40 E-value=0.021 Score=44.31 Aligned_cols=16 Identities=63% Similarity=0.839 Sum_probs=15.2
Q ss_pred ccccccccceeeecCC
Q psy2222 3 EYQAVKRGKLVLKGEK 18 (264)
Q Consensus 3 ~y~~vk~~kL~fKGdk 18 (264)
+|+.|-.|+|+|||.+
T Consensus 1 eY~~v~gG~LKLKG~~ 16 (98)
T PF08555_consen 1 EYSNVVGGKLKLKGGK 16 (98)
T ss_pred CccccCCCCceeCCcc
Confidence 6999999999999988
No 25
>PF03498 CDtoxinA: Cytolethal distending toxin A/C family; InterPro: IPR003558 Escherichia coli, Haemophilus spp and Campylobacter spp. all produce a toxin that is seen to cause distension in certain cell lines [, ], which eventually disintegrate and die. This novel toxin, termed cytolethal distending toxin (cdt), has three subunits: A, B and C. Their sizes are approx. 27.7, 29.5 and 19.9kDa respectively [], and they appear to be entirely novel []. Further research on the complete toxin has revealed that it blocks the cell cycle at stage G2, through inactivation of the cyclin-dependent kinase Cdk1, and without induction of DNA breaks. This leads to multipolar abortive mitosis and micronucleation, associated with centrosomal amplification []. The roles of each subunit are unclear, but it is believed that they have separate roles in pathogenicity. This entry represents the A and C subunits.; GO: 0009405 pathogenesis; PDB: 2F2F_A 1SR4_C.
Probab=93.17 E-value=0.34 Score=40.82 Aligned_cols=62 Identities=26% Similarity=0.329 Sum_probs=42.7
Q ss_pred CCEEEEeCC-CcEEEEcCCCceEeeccCCCCCCCCccccEEEEEecCCeEEEEe-cCCeEEEEecCCc
Q psy2222 61 GAISIEFGK-RTFVSALDNGLFILGAIHEDKEGPSPEEIFTAILVNDSKVAFKS-GFDKYLSIDKNGR 126 (264)
Q Consensus 61 Gp~~i~~~s-~~yL~a~~~G~f~~~~~~~~~e~~~~~evf~~~~v~dt~~~lKs-~~GkYLs~d~~G~ 126 (264)
|-|+|.-+. ++||++..+|.+.... ++.-...++|.+++...+.|+||+ ..|+-|.......
T Consensus 52 g~Vqf~n~~~~~CL~~~~~G~~~~~~----C~~~~~~q~F~iiPtttgAVQIks~~~~~Cl~~~~~~~ 115 (150)
T PF03498_consen 52 GYVQFVNPKTGTCLAAYGNGVFHYKS----CDQDNLEQVFSIIPTTTGAVQIKSLSTGECLQTFNNSR 115 (150)
T ss_dssp CEEEEEETTTSEEEEEETTCEEEE------TTTCHGHH-EEEEEBTTS-EEEEETTT--EEEE-STTS
T ss_pred CEEEEEcCCCCcceeecCCCeEeecc----cCCCChhceEEEEEcCCCcEEEEecCCCceEEecCCCc
Confidence 556666665 8899999999877510 344467899999999888999999 6788887766543
No 26
>PF14200 RicinB_lectin_2: Ricin-type beta-trefoil lectin domain-like; PDB: 2X2S_C 2X2T_A 2VSE_B 2VSA_A 3EF2_A 2IHO_A 3HZB_H 1YBI_B 3PHZ_A 3NBE_A ....
Probab=83.94 E-value=17 Score=27.68 Aligned_cols=72 Identities=11% Similarity=0.055 Sum_probs=45.6
Q ss_pred CCCeeeccccCCCCCCEEEEeC-CCcEEEEcCCC----ceEeeccCCCCCCCCccccEEEEEecCCeEEEEe-cCCeEEE
Q psy2222 47 YAGWTQVKHIKDIVGAISIEFG-KRTFVSALDNG----LFILGAIHEDKEGPSPEEIFTAILVNDSKVAFKS-GFDKYLS 120 (264)
Q Consensus 47 ~~~W~~~~~~~dI~Gp~~i~~~-s~~yL~a~~~G----~f~~~~~~~~~e~~~~~evf~~~~v~dt~~~lKs-~~GkYLs 120 (264)
.+-|......+ -.|...|... ++.||.+..++ .....-+ .. -.+.|.|.+++..++.|.|++ ..|++|.
T Consensus 2 nQ~W~~~~~~~-~~g~Y~i~n~~sg~~L~v~~~~~~~g~~v~~~~---~~-~~~~Q~W~i~~~~~g~y~I~n~~s~~~Ld 76 (105)
T PF14200_consen 2 NQQWTFTPVGD-SDGYYKIRNVNSGKYLDVAGGSTANGTNVQQWT---CN-GNDNQQWKIEPVGDGYYRIRNKNSGKVLD 76 (105)
T ss_dssp GGEEEEEEEET-TTTEEEEEETTTTEEEEEGCTTCSTTEBEEEEE---SS-SSGGGEEEEEESTTSEEEEEETSTTEEEE
T ss_pred CCEEEEEEecC-CCCEEEEEECCCCCEEEeCCCCcCCCcEEEEec---CC-CCcCcEEEEEEecCCeEEEEECCCCcEEE
Confidence 35677655432 1344445443 49999987654 1111111 11 167899999999988899999 5699998
Q ss_pred Eec
Q psy2222 121 IDK 123 (264)
Q Consensus 121 ~d~ 123 (264)
+..
T Consensus 77 v~~ 79 (105)
T PF14200_consen 77 VAG 79 (105)
T ss_dssp EGG
T ss_pred ECC
Confidence 853
No 27
>PF03664 Glyco_hydro_62: Glycosyl hydrolase family 62 ; InterPro: IPR005193 O-Glycosyl hydrolases 3.2.1. from EC are a widespread group of enzymes that hydrolyse the glycosidic bond between two or more carbohydrates, or between a carbohydrate and a non-carbohydrate moiety. A classification system for glycosyl hydrolases, based on sequence similarity, has led to the definition of 85 different families [, ]. This classification is available on the CAZy (CArbohydrate-Active EnZymes) web site. This is a family of alpha -L-arabinofuranosidases (3.2.1.55 from EC) which are all members of glycoside hydrolase family 62 (GH62 from CAZY). This enzyme hydrolyzed aryl alpha-L-arabinofuranosides and cleaves arabinosyl side chains from arabinoxylan and arabinan.; GO: 0046556 alpha-N-arabinofuranosidase activity, 0046373 L-arabinose metabolic process
Probab=81.24 E-value=15 Score=33.62 Aligned_cols=106 Identities=24% Similarity=0.339 Sum_probs=58.1
Q ss_pred CCCCcCCCCCeeecccc-----CCC-CCCE--EEEeCC-CcEEE-EcCCCc-eEeeccCCCCCCCCc-------------
Q psy2222 40 VDPDIENYAGWTQVKHI-----KDI-VGAI--SIEFGK-RTFVS-ALDNGL-FILGAIHEDKEGPSP------------- 95 (264)
Q Consensus 40 ~~~d~~~~~~W~~~~~~-----~dI-~Gp~--~i~~~s-~~yL~-a~~~G~-f~~~~~~~~~e~~~~------------- 95 (264)
...|+.+..||..|+.+ .+. .||+ -|++.+ .+||- |++||. +...-| .+.=|.-
T Consensus 105 Ts~DptnpngWSapq~lf~g~i~~~~~g~iD~~vI~D~~n~yLFfa~DnGkiYRs~~~--i~nFP~~fgs~~~vvmsd~~ 182 (271)
T PF03664_consen 105 TSSDPTNPNGWSAPQPLFSGSISGSGTGPIDQWVICDDTNMYLFFAGDNGKIYRSSMP--IGNFPGGFGSSYTVVMSDTR 182 (271)
T ss_pred cCCCCCCCccCCCCcccccccccCCCCCceeeEEEecCCceEEEEcCCCCcEEEeccc--hhhCCCCCCCceEEEEecCc
Confidence 45678889999998776 222 3565 355555 88976 889996 332211 1111111
Q ss_pred cccEEEEEecCCeEEEEecCCeEEEEecCCcEEEeccCCCCCCceEEEEe----CCceEEecc--CceeEEE
Q psy2222 96 EEIFTAILVNDSKVAFKSGFDKYLSIDKNGRVTGRSDAVGPLEQWEPVFE----DDNMALLGA--NQCFMSV 161 (264)
Q Consensus 96 ~evf~~~~v~dt~~~lKs~~GkYLs~d~~G~v~a~a~a~g~~E~f~iv~~----~g~~alq~~--ng~fLs~ 161 (264)
.-.|.+..| +.++ +.++||-+ -||+|..++|-=-|. +|.|.-++. .+-|-..
T Consensus 183 ~nLFEA~~V----Ykv~-G~~~YLmi---------VEaiGs~gRyFRSfTa~sL~G~Wtp~A~tEsnPFAg~ 240 (271)
T PF03664_consen 183 NNLFEAVQV----YKVK-GQNQYLMI---------VEAIGSDGRYFRSFTATSLDGPWTPQAATESNPFAGK 240 (271)
T ss_pred cceeeeeEE----EEEc-CCceEEEE---------EEEecCCCcEEEeeecCCCCCcceeccccccCCcccc
Confidence 112322222 3343 36788876 255666555544443 577777764 3344443
No 28
>PF05270 AbfB: Alpha-L-arabinofuranosidase B (ABFB); InterPro: IPR007934 This family consists of several fungal alpha-L-arabinofuranosidase B proteins. L-Arabinose is a constituent of plant cell wall polysaccharides. It is found in a polymeric form in L-arabinan, in which the backbone is formed by 1,5-a- linked l-arabinose residues that can be branched via 1,2-a- and 1,3-a-linked l-arabinofuranose side chains. AbfB hydrolyses 1,5-a, 1,3-a and 1,2-a linkages in both oligosaccharides and polysaccharides, which contain terminal non-reducing l-arabinofuranoses in side chains [].; GO: 0046556 alpha-N-arabinofuranosidase activity, 0046373 L-arabinose metabolic process; PDB: 3AKI_A 3AKF_A 3AKH_A 3AKG_A 2D44_A 1WD3_A 2D43_A 1WD4_A 3KMV_E.
Probab=80.02 E-value=13 Score=31.05 Aligned_cols=69 Identities=14% Similarity=0.080 Sum_probs=45.9
Q ss_pred cccEEEEEecCC--eEEEEecC--CeEEEEecCCcEEEeccCCC----CCCceEEEEe---CCceEEecc--CceeEEEe
Q psy2222 96 EEIFTAILVNDS--KVAFKSGF--DKYLSIDKNGRVTGRSDAVG----PLEQWEPVFE---DDNMALLGA--NQCFMSVS 162 (264)
Q Consensus 96 ~evf~~~~v~dt--~~~lKs~~--GkYLs~d~~G~v~a~a~a~g----~~E~f~iv~~---~g~~alq~~--ng~fLs~~ 162 (264)
..+|++++--.. .|+|.|.+ |.||.. .++.|.-.....+ ...+|.++.. .|.++|++. -|+||...
T Consensus 38 da~f~vvpGLa~~~~vSfES~~~PG~yLrh-~~~~v~l~~~d~s~~F~~dATF~~~~Gl~~~g~~sfeS~n~Pg~ylrh~ 116 (142)
T PF05270_consen 38 DATFRVVPGLADSSCVSFESVNYPGYYLRH-SNFRVRLEKNDGSALFREDATFCPRPGLAGPGYVSFESYNYPGRYLRHY 116 (142)
T ss_dssp GG-EEEEE-SS-TTCEEEEESSSTTEEEEE-ETTEEEEEE--SSHHHHHHT-EEEEE-SSSTTEEEEEESSSTTEEEEEE
T ss_pred CceEEEEEccCCCCEEEEEECCCCCcEEEE-ECCEEEEeecCCCccccCCceEEEecCCCCCCcceEEEecCCCeEEEEE
Confidence 468999874433 79999987 999998 4556555554333 4568988875 466688886 58999987
Q ss_pred cCC
Q psy2222 163 EQD 165 (264)
Q Consensus 163 ~~~ 165 (264)
+..
T Consensus 117 ~~~ 119 (142)
T PF05270_consen 117 NGE 119 (142)
T ss_dssp TTE
T ss_pred CCE
Confidence 754
No 29
>cd00161 RICIN Ricin-type beta-trefoil; Carbohydrate-binding domain formed from presumed gene triplication. The domain is found in a variety of molecules serving diverse functions such as enzymatic activity, inhibitory toxicity and signal transduction. Highly specific ligand binding occurs on exposed surfaces of the compact domain sturcture.
Probab=73.40 E-value=35 Score=25.31 Aligned_cols=65 Identities=14% Similarity=0.174 Sum_probs=38.7
Q ss_pred CccccEEEEEecCCeEEEEecCCeEEEEecC---CcEEE-eccCCCCCCceEEEEeCCceEEeccCceeEEEec
Q psy2222 94 SPEEIFTAILVNDSKVAFKSGFDKYLSIDKN---GRVTG-RSDAVGPLEQWEPVFEDDNMALLGANQCFMSVSE 163 (264)
Q Consensus 94 ~~~evf~~~~v~dt~~~lKs~~GkYLs~d~~---G~v~a-~a~a~g~~E~f~iv~~~g~~alq~~ng~fLs~~~ 163 (264)
...|.|++.. ++.+.+.. .+.+|.+... ..|+. .....++...|.+... +.+.... .+.-|.+..
T Consensus 33 ~~~Q~W~~~~--~g~~~~~~-~~~Cl~~~~~~~~~~~~~~~c~~~~~~Q~W~~~~~-~~i~~~~-~~~cl~~~~ 101 (124)
T cd00161 33 GNNQKWTLTS--DGTIRIKS-SNLCLDVGGDAPGSKVRLYTCSGGSDNQRWTFNKD-GTIRNLK-SGKCLDVKG 101 (124)
T ss_pred CccCCEEEeC--CCeEEEcC-CCeEEcccCCCCCCEEEEEECCCCCcCCEEEECCC-cEEEECC-CCeEEeCCC
Confidence 4578998876 55777776 7788887653 22333 3333367788887644 4443333 455555543
No 30
>PF05270 AbfB: Alpha-L-arabinofuranosidase B (ABFB); InterPro: IPR007934 This family consists of several fungal alpha-L-arabinofuranosidase B proteins. L-Arabinose is a constituent of plant cell wall polysaccharides. It is found in a polymeric form in L-arabinan, in which the backbone is formed by 1,5-a- linked l-arabinose residues that can be branched via 1,2-a- and 1,3-a-linked l-arabinofuranose side chains. AbfB hydrolyses 1,5-a, 1,3-a and 1,2-a linkages in both oligosaccharides and polysaccharides, which contain terminal non-reducing l-arabinofuranoses in side chains [].; GO: 0046556 alpha-N-arabinofuranosidase activity, 0046373 L-arabinose metabolic process; PDB: 3AKI_A 3AKF_A 3AKH_A 3AKG_A 2D44_A 1WD3_A 2D43_A 1WD4_A 3KMV_E.
Probab=70.48 E-value=20 Score=29.90 Aligned_cols=78 Identities=9% Similarity=0.043 Sum_probs=43.8
Q ss_pred EEEEec--CCeEEEEecCCcEEEeccCCCC-----CCceEEEEe---CCceEEeccC--ceeEEEecCCCeeEEeccc--
Q psy2222 109 VAFKSG--FDKYLSIDKNGRVTGRSDAVGP-----LEQWEPVFE---DDNMALLGAN--QCFMSVSEQDDSIVATSRK-- 174 (264)
Q Consensus 109 ~~lKs~--~GkYLs~d~~G~v~a~a~a~g~-----~E~f~iv~~---~g~~alq~~n--g~fLs~~~~~~~l~a~s~~-- 174 (264)
++|.|. .++||.-...-..+..-.+.+. ..+|.|++- .+-+.|++.| |.||...+....|..+..+
T Consensus 3 ~~~~s~~~~~ryirh~~~~~~~~~v~~~s~~~~r~da~f~vvpGLa~~~~vSfES~~~PG~yLrh~~~~v~l~~~d~s~~ 82 (142)
T PF05270_consen 3 LRLTSPNYPDRYIRHRGSLVRLDPVSSSSSALDRADATFRVVPGLADSSCVSFESVNYPGYYLRHSNFRVRLEKNDGSAL 82 (142)
T ss_dssp EEEEESSSTTEEEEEETTEEEEEES-SSGGHHHHHGG-EEEEE-SS-TTCEEEEESSSTTEEEEEETTEEEEEE--SSHH
T ss_pred EEEECCCCCCeEEEEcCceEEEeeccCCcchhhccCceEEEEEccCCCCEEEEEECCCCCcEEEEECCEEEEeecCCCcc
Confidence 456664 6899988754333332333333 688999986 4779999987 9999985443223332111
Q ss_pred CCCcceEEEEEc
Q psy2222 175 AGKNEMVKLRSN 186 (264)
Q Consensus 175 ~g~~e~~~iR~~ 186 (264)
-....+|+++--
T Consensus 83 F~~dATF~~~~G 94 (142)
T PF05270_consen 83 FREDATFCPRPG 94 (142)
T ss_dssp HHHHT-EEEEE-
T ss_pred ccCCceEEEecC
Confidence 123466776643
No 31
>KOG3410|consensus
Probab=66.02 E-value=1.2 Score=35.95 Aligned_cols=19 Identities=37% Similarity=0.485 Sum_probs=16.7
Q ss_pred CcccccccccceeeecCCC
Q psy2222 1 MSEYQAVKRGKLVLKGEKP 19 (264)
Q Consensus 1 m~~y~~vk~~kL~fKGdk~ 19 (264)
|+.|.-|-.|+|++||...
T Consensus 2 ~e~y~~v~kg~LKLK~v~~ 20 (120)
T KOG3410|consen 2 MEAYCGVVKGSLKLKKVSE 20 (120)
T ss_pred chHHHHHhhchhhhhchhh
Confidence 6779999999999999765
No 32
>KOG1446|consensus
Probab=56.17 E-value=1.3e+02 Score=28.42 Aligned_cols=100 Identities=15% Similarity=0.130 Sum_probs=55.5
Q ss_pred EEEeCCCcEEEEcCCCc-eEeeccCCC-------CCCCCccccEEEEEecCC---eEEEEecCCeEEEEecCCc--EEEe
Q psy2222 64 SIEFGKRTFVSALDNGL-FILGAIHED-------KEGPSPEEIFTAILVNDS---KVAFKSGFDKYLSIDKNGR--VTGR 130 (264)
Q Consensus 64 ~i~~~s~~yL~a~~~G~-f~~~~~~~~-------~e~~~~~evf~~~~v~dt---~~~lKs~~GkYLs~d~~G~--v~a~ 130 (264)
.|.+.++...+-|..|+ |+++.+... .=.-+|.++|.+.....- .+.| |..|+||-+...+. .+-+
T Consensus 137 ~l~~~~~pi~AfDp~GLifA~~~~~~~IkLyD~Rs~dkgPF~tf~i~~~~~~ew~~l~F-S~dGK~iLlsT~~s~~~~lD 215 (311)
T KOG1446|consen 137 LLNLSGRPIAAFDPEGLIFALANGSELIKLYDLRSFDKGPFTTFSITDNDEAEWTDLEF-SPDGKSILLSTNASFIYLLD 215 (311)
T ss_pred EEecCCCcceeECCCCcEEEEecCCCeEEEEEecccCCCCceeEccCCCCccceeeeEE-cCCCCEEEEEeCCCcEEEEE
Confidence 34444456667777785 566555421 112456777777642221 4555 67999988776655 2222
Q ss_pred ccCCCCCCceEEEEeCCceEE---eccCceeEEEecC
Q psy2222 131 SDAVGPLEQWEPVFEDDNMAL---LGANQCFMSVSEQ 164 (264)
Q Consensus 131 a~a~g~~E~f~iv~~~g~~al---q~~ng~fLs~~~~ 164 (264)
|-.-.....|+..++.+...+ -++.|.||-+-.+
T Consensus 216 Af~G~~~~tfs~~~~~~~~~~~a~ftPds~Fvl~gs~ 252 (311)
T KOG1446|consen 216 AFDGTVKSTFSGYPNAGNLPLSATFTPDSKFVLSGSD 252 (311)
T ss_pred ccCCcEeeeEeeccCCCCcceeEEECCCCcEEEEecC
Confidence 222236677777766664433 3457776655443
No 33
>KOG4226|consensus
Probab=49.16 E-value=54 Score=30.65 Aligned_cols=114 Identities=18% Similarity=0.283 Sum_probs=71.0
Q ss_pred CCcCCCCCeeeccccCCCCCCEEEEeCCCcEEEEcCCCceEeeccCC------CCCCCCc-----cccEEEEEec----C
Q psy2222 42 PDIENYAGWTQVKHIKDIVGAISIEFGKRTFVSALDNGLFILGAIHE------DKEGPSP-----EEIFTAILVN----D 106 (264)
Q Consensus 42 ~d~~~~~~W~~~~~~~dI~Gp~~i~~~s~~yL~a~~~G~f~~~~~~~------~~e~~~~-----~evf~~~~v~----d 106 (264)
+-++.+..||.+-...+..| ++ ...||.-+.+|..+.++-|+ +..+|++ -+-|..=.+. +
T Consensus 220 d~Pe~DPdWwkarn~~G~vG---LV--PrNYv~vl~d~~~ts~~~~~s~~pq~sgn~p~~sg~~ag~~WYyG~itR~qae 294 (379)
T KOG4226|consen 220 DKPENDPDWWKARNARGQVG---LV--PRNYVVVLSDGPSTSKALHPSHAPQISGNGPSSSGRFAGRPWYYGNITRHQAE 294 (379)
T ss_pred cCCCCCchHHhhcccCCccc---ee--ecceEEEeccCccccccCCccccccccCCCCCccccccCCcceeccccHHHHH
Confidence 34567789998876655443 33 34588888888766444332 2334443 2334433332 2
Q ss_pred CeEEEEecCCeEEEEecC---CcEEEeccCCCCCCceEEEEeCCceEEeccCceeEEEe
Q psy2222 107 SKVAFKSGFDKYLSIDKN---GRVTGRSDAVGPLEQWEPVFEDDNMALLGANQCFMSVS 162 (264)
Q Consensus 107 t~~~lKs~~GkYLs~d~~---G~v~a~a~a~g~~E~f~iv~~~g~~alq~~ng~fLs~~ 162 (264)
+....+++.|-||--|+. |-++..-.|+|-.-.|.+...++.|+|- +-+|.+.+
T Consensus 295 ~~Ln~hG~eGdFLiRDSEsnpgD~SvSlka~grNKHFkVq~~d~~ycIG--qRkF~tmd 351 (379)
T KOG4226|consen 295 CALNEHGHEGDFLIRDSESNPGDFSVSLKASGRNKHFKVQLVDNVYCIG--QRKFHTMD 351 (379)
T ss_pred HHHhccCccCceEEecCCCCCcceeEEeeccCCCcceEEEEecceEEec--cceeccHH
Confidence 356788999999999876 5566667788888888887777766653 23455443
No 34
>COG2706 3-carboxymuconate cyclase [Carbohydrate transport and metabolism]
Probab=40.07 E-value=1.3e+02 Score=28.85 Aligned_cols=71 Identities=18% Similarity=0.375 Sum_probs=41.0
Q ss_pred ecCCeEEEEecCCc-EEEeccC--CCCCCceEEEEeCCceEEecc-----Cc-eeEEEecCCCeeEEecccCCCcceEEE
Q psy2222 113 SGFDKYLSIDKNGR-VTGRSDA--VGPLEQWEPVFEDDNMALLGA-----NQ-CFMSVSEQDDSIVATSRKAGKNEMVKL 183 (264)
Q Consensus 113 s~~GkYLs~d~~G~-v~a~a~a--~g~~E~f~iv~~~g~~alq~~-----ng-~fLs~~~~~~~l~a~s~~~g~~e~~~i 183 (264)
..+=+||..++++. |.+-... -|--..|.|-..+|+..+.+. ++ |||++++.+.-|.+.+-..+.-..+.|
T Consensus 39 ~~nptyl~~~~~~~~LY~v~~~~~~ggvaay~iD~~~G~Lt~ln~~~~~g~~p~yvsvd~~g~~vf~AnY~~g~v~v~p~ 118 (346)
T COG2706 39 LGNPTYLAVNPDQRHLYVVNEPGEEGGVAAYRIDPDDGRLTFLNRQTLPGSPPCYVSVDEDGRFVFVANYHSGSVSVYPL 118 (346)
T ss_pred cCCCceEEECCCCCEEEEEEecCCcCcEEEEEEcCCCCeEEEeeccccCCCCCeEEEECCCCCEEEEEEccCceEEEEEc
Confidence 45667777777762 3332222 455556666666788877653 44 999999998644443333333333333
No 35
>PF03498 CDtoxinA: Cytolethal distending toxin A/C family; InterPro: IPR003558 Escherichia coli, Haemophilus spp and Campylobacter spp. all produce a toxin that is seen to cause distension in certain cell lines [, ], which eventually disintegrate and die. This novel toxin, termed cytolethal distending toxin (cdt), has three subunits: A, B and C. Their sizes are approx. 27.7, 29.5 and 19.9kDa respectively [], and they appear to be entirely novel []. Further research on the complete toxin has revealed that it blocks the cell cycle at stage G2, through inactivation of the cyclin-dependent kinase Cdk1, and without induction of DNA breaks. This leads to multipolar abortive mitosis and micronucleation, associated with centrosomal amplification []. The roles of each subunit are unclear, but it is believed that they have separate roles in pathogenicity. This entry represents the A and C subunits.; GO: 0009405 pathogenesis; PDB: 2F2F_A 1SR4_C.
Probab=35.41 E-value=2.3e+02 Score=23.79 Aligned_cols=65 Identities=11% Similarity=0.019 Sum_probs=42.4
Q ss_pred EEEEecC-CeEEEEec-CCeEEEEecCCcEEEe-ccCCCCCCceEEEEe-CCceEEecc-CceeEEEecC
Q psy2222 100 TAILVND-SKVAFKSG-FDKYLSIDKNGRVTGR-SDAVGPLEQWEPVFE-DDNMALLGA-NQCFMSVSEQ 164 (264)
Q Consensus 100 ~~~~v~d-t~~~lKs~-~GkYLs~d~~G~v~a~-a~a~g~~E~f~iv~~-~g~~alq~~-ng~fLs~~~~ 164 (264)
.++.-.. +.|+|+.. .|.-|.+..+|.+... .+.--....|.+++. .|-+.||+. +|.=|++...
T Consensus 44 ~~~~~~~~g~Vqf~n~~~~~CL~~~~~G~~~~~~C~~~~~~q~F~iiPtttgAVQIks~~~~~Cl~~~~~ 113 (150)
T PF03498_consen 44 RLIEFFPFGYVQFVNPKTGTCLAAYGNGVFHYKSCDQDNLEQVFSIIPTTTGAVQIKSLSTGECLQTFNN 113 (150)
T ss_dssp EEEE-STTCEEEEEETTTSEEEEEETTCEEEE--TTTCHGHH-EEEEEBTTS-EEEEETTT--EEEE-ST
T ss_pred eecccCCCCEEEEEcCCCCcceeecCCCeEeecccCCCChhceEEEEEcCCCcEEEEecCCCceEEecCC
Confidence 3444444 68999994 7889999999988744 344455678999998 599999986 5555555443
No 36
>PF14870 PSII_BNR: Photosynthesis system II assembly factor YCF48; PDB: 2XBG_A.
Probab=34.33 E-value=3.9e+02 Score=24.93 Aligned_cols=42 Identities=14% Similarity=0.309 Sum_probs=23.3
Q ss_pred EEEecCCeEEEEecCCcEEEeccCCCCCCceEEEEeCCceEEec
Q psy2222 110 AFKSGFDKYLSIDKNGRVTGRSDAVGPLEQWEPVFEDDNMALLG 153 (264)
Q Consensus 110 ~lKs~~GkYLs~d~~G~v~a~a~a~g~~E~f~iv~~~g~~alq~ 153 (264)
..++..|+||.+...|.+.-..+ .-+..|+++.....-.||+
T Consensus 150 ~~r~~dG~~vavs~~G~~~~s~~--~G~~~w~~~~r~~~~riq~ 191 (302)
T PF14870_consen 150 ITRSSDGRYVAVSSRGNFYSSWD--PGQTTWQPHNRNSSRRIQS 191 (302)
T ss_dssp EEE-TTS-EEEEETTSSEEEEE---TT-SS-EEEE--SSS-EEE
T ss_pred EEECCCCcEEEEECcccEEEEec--CCCccceEEccCccceehh
Confidence 34678999999999998876554 2234588876644344443
No 37
>PF15017 AF1Q: Drug resistance and apoptosis regulator
Probab=30.08 E-value=24 Score=27.07 Aligned_cols=19 Identities=26% Similarity=0.602 Sum_probs=13.7
Q ss_pred CCCcCCCCCeeeccccCCC
Q psy2222 41 DPDIENYAGWTQVKHIKDI 59 (264)
Q Consensus 41 ~~d~~~~~~W~~~~~~~dI 59 (264)
+++..+.+|||.|+....|
T Consensus 66 e~~ddD~gGWITPsNIkqi 84 (87)
T PF15017_consen 66 EEEDDDGGGWITPSNIKQI 84 (87)
T ss_pred ccccCCCCccccchhhhhh
Confidence 3445568899999887654
No 38
>KOG1691|consensus
Probab=29.73 E-value=2.9e+02 Score=24.63 Aligned_cols=60 Identities=8% Similarity=0.008 Sum_probs=31.3
Q ss_pred CCceEEecc-CceeEEEecCCCeeEEecccCCCcceEEEEEcccccccCCCCCCCChhhhcCChhHHHHH
Q psy2222 146 DDNMALLGA-NQCFMSVSEQDDSIVATSRKAGKNEMVKLRSNSLHLKNLDKDDDIPEEERTNKLSDIELN 214 (264)
Q Consensus 146 ~g~~alq~~-ng~fLs~~~~~~~l~a~s~~~g~~e~~~iR~~a~~~~~~~~~~~~~~e~~~~~~~~~E~~ 214 (264)
.|+||+-+. +|.|+.|=. ++...=++...+.|-+.++--..++. +.+.-.+. .++-+|+.
T Consensus 82 ~gqFaFta~e~~~y~~Cf~------~~~~~~~p~~~~~I~ld~k~Gv~akd--w~~IAKke-klep~E~e 142 (210)
T KOG1691|consen 82 KGQFAFTAEESGMYEACFT------ADVPGHKPETKRSIDLDWKTGVEAKD--WDSIAKKE-KLEPLEVE 142 (210)
T ss_pred cceEEEEeccCCcEEEEEe------cccCCCCCCcceEEEEEeeccccccc--hHHHHhhh-cCcHHHHH
Confidence 688888885 777877732 22222345556666666553333332 33332233 44444444
No 39
>PF02061 Lambda_CIII: Lambda Phage CIII; InterPro: IPR013056 Bacteriophage lambda regulatory protein CIII is a small protein that plays a role in stabilising the CII transcriptional activator, via a mechanism that is not yet fully understood [, ]. Stabilised CII activates CI, the gene for the repressor protein that prevents transcription of proteins required for lytic development. The central portion of the protein is well conserved and is both necessary and sufficient for the activity of the protein []. Comparative analysis of the CIII sequence in lambda, Bacteriophage HK022 and the lambdoid Enterobacteria phage P22 has led to the suggestion that this central region assumes an amphipathic alpha-helical structure []. The characteristics of the protein distribution suggest prophage matches in addition to the phage matches.
Probab=27.68 E-value=29 Score=23.05 Aligned_cols=10 Identities=50% Similarity=0.700 Sum_probs=7.9
Q ss_pred hhHHHhhhhh
Q psy2222 245 YLHETMLDRR 254 (264)
Q Consensus 245 ~lhE~lLDrR 254 (264)
.|||.||||=
T Consensus 15 ql~ESLLdrI 24 (45)
T PF02061_consen 15 QLSESLLDRI 24 (45)
T ss_pred hhhHHHHHHH
Confidence 3799999973
No 40
>cd00161 RICIN Ricin-type beta-trefoil; Carbohydrate-binding domain formed from presumed gene triplication. The domain is found in a variety of molecules serving diverse functions such as enzymatic activity, inhibitory toxicity and signal transduction. Highly specific ligand binding occurs on exposed surfaces of the compact domain sturcture.
Probab=27.67 E-value=2.4e+02 Score=20.51 Aligned_cols=26 Identities=12% Similarity=0.286 Sum_probs=16.9
Q ss_pred CCCceEEEEeCCceEEeccCceeEEEec
Q psy2222 136 PLEQWEPVFEDDNMALLGANQCFMSVSE 163 (264)
Q Consensus 136 ~~E~f~iv~~~g~~alq~~ng~fLs~~~ 163 (264)
+...|.+.. +|.+.+.. .+..|.+..
T Consensus 34 ~~Q~W~~~~-~g~~~~~~-~~~Cl~~~~ 59 (124)
T cd00161 34 NNQKWTLTS-DGTIRIKS-SNLCLDVGG 59 (124)
T ss_pred ccCCEEEeC-CCeEEEcC-CCeEEcccC
Confidence 566777665 67777766 566666654
No 41
>PF00340 IL1: Interleukin-1 / 18; InterPro: IPR000975 Interleukin-1 alpha and interleukin-1 beta (IL-1 alpha and IL-1 beta) are cytokines that participate in the regulation of immune responses, inflammatory reactions, and hematopoiesis []. Two types of IL-1 receptor, each with three extracellular immunoglobulin (Ig)-like domains, limited sequence similarity (28%) and different pharmacological characteristics have been cloned from mouse and human cell lines: these have been termed type I and type II receptors []. The receptors both exist in transmembrane (TM) and soluble forms: the soluble IL-1 receptor is thought to be post-translationally derived from cleavage of the extracellular portion of the membrane receptors. Both IL-1 receptors appear to be well conserved in evolution, and map to the same chromosomal location []. The receptors can both bind all three forms of IL-1 (IL-1 alpha, IL-1 beta and IL-1RA). The crystal structures of IL1A and IL1B [] have been solved, showing them to share the same 12-stranded beta-sheet structure as both the heparin binding growth factors and the Kunitz-type soybean trypsin inhibitors []. The beta-sheets are arranged in 3 similar lobes around a central axis, 6 strands forming an anti-parallel beta-barrel. Several regions, especially the loop between strands 4 and 5, have been implicated in receptor binding. The Vaccinia virus genes B15R and B18R each encode proteins with N-terminal hydrophobic sequences, possible sites for attachment of N-linked carbohydrate and a short C-terminal hydrophobic domain []. These properties are consistent with the mature proteins being either virion, cell surface or secretory glycoproteins. Protein sequence comparisons reveal that the gene products are related to each other (20% identity) and to the Ig superfamily. The highest degree of similarity is to the human and murine interleukin-1 receptors, although both proteins are related to a wide range of Ig superfamily members, including the interleukin-6 receptor. A novel method for virus immune evasion has been proposed in which the product of one or both of these proteins may bind interleukin-1 and/or interleukin-6, preventing these cytokines reaching their natural receptors []. A similar gene product from Cowpox virus (CPV) has also been shown to specifically bind murine IL-1 beta []. This entry represents Interleukin-1. ; GO: 0005615 extracellular space; PDB: 1J0S_A 3F62_B 2VXT_I 1MD6_A 2KKI_A 2L5X_D 2WRY_A 3NJ5_A 8I1B_A 2MIB_A ....
Probab=27.13 E-value=2.6e+02 Score=22.35 Aligned_cols=64 Identities=13% Similarity=0.160 Sum_probs=38.4
Q ss_pred CEEEEeCC-CcEEEEcCCCc---eEeeccCC--CCCCCCccccEEEEE-ecCCeEEEEe--cCCeEEEEecCC
Q psy2222 62 AISIEFGK-RTFVSALDNGL---FILGAIHE--DKEGPSPEEIFTAIL-VNDSKVAFKS--GFDKYLSIDKNG 125 (264)
Q Consensus 62 p~~i~~~s-~~yL~a~~~G~---f~~~~~~~--~~e~~~~~evf~~~~-v~dt~~~lKs--~~GkYLs~d~~G 125 (264)
||+|-+.. ..||+|.+.|. +.+-.... .-......-.|+..+ .+++...|.| +-|-||+...+.
T Consensus 27 pV~L~I~~~~l~LsC~~~~~~P~L~lee~~~~~~i~~~~~~~~F~F~~~~~g~~~~FESaa~PgwFIaT~~~~ 99 (120)
T PF00340_consen 27 PVILGIKNTNLCLSCVKEGGQPTLQLEEVEIPKLIDGKESDKRFLFYKKDSGSTSTFESAAYPGWFIATSPED 99 (120)
T ss_dssp EEEEEETTSTEEEEEEECTTCEEEEEEESSHTSEESSSSCGGGGEEEEEEETTEEEEEESSSTTEEEEBESSS
T ss_pred cEEEEEeCCcEEEEEeccCCeEEEEeccccChhhccCCccceEEEEEEecCCCceEEEEccCCCeEEEecccC
Confidence 78888877 78999999864 22211110 011122234564443 4455666999 578899988765
No 42
>PF11250 DUF3049: Protein of unknown function (DUF3049); InterPro: IPR021410 This eukaryotic family of proteins has no known function.
Probab=26.04 E-value=1.3e+02 Score=21.12 Aligned_cols=30 Identities=23% Similarity=0.267 Sum_probs=21.9
Q ss_pred cCCcEEEeccCCCCCCceEEEEeCCceEEe
Q psy2222 123 KNGRVTGRSDAVGPLEQWEPVFEDDNMALL 152 (264)
Q Consensus 123 ~~G~v~a~a~a~g~~E~f~iv~~~g~~alq 152 (264)
.+|.++-.+..+.+.|.|....++|+..|+
T Consensus 25 ~dGRLvl~~v~v~~~~~~~A~R~~GRL~L~ 54 (56)
T PF11250_consen 25 EDGRLVLEEVRVPSHEYFHAEREDGRLRLQ 54 (56)
T ss_pred cCCEEEEEEEEcCCcceEEEEccCCEEEEE
Confidence 456666666667778888888888887765
No 43
>PF08758 Cadherin_pro: Cadherin prodomain like; InterPro: IPR014868 Cadherins are a group of proteins that mediate calcium dependent cell-cell adhesion. They are activated through cleavage of a prosequence in the late Golgi. This protein corresponds to the folded region of the prosequence, and is termed the prodomain. The prodomain shows structural resemblance to the cadherin domain, but lacks all the features known to be important for cadherin-cadherin interactions []. ; GO: 0007155 cell adhesion, 0016021 integral to membrane; PDB: 1OP4_A.
Probab=25.14 E-value=1.7e+02 Score=22.32 Aligned_cols=37 Identities=16% Similarity=0.110 Sum_probs=23.2
Q ss_pred eEEEEecCCeEEEEecCCcEEEeccCCCCCC--ceEEEEe
Q psy2222 108 KVAFKSGFDKYLSIDKNGRVTGRSDAVGPLE--QWEPVFE 145 (264)
Q Consensus 108 ~~~lKs~~GkYLs~d~~G~v~a~a~a~g~~E--~f~iv~~ 145 (264)
.+.|.|..- .+.+..||.|.+.++-.-+.+ +|.|..+
T Consensus 38 ~~~~~ssDp-dF~V~~DGsVy~~r~v~l~~~~~~F~V~a~ 76 (90)
T PF08758_consen 38 RVIFESSDP-DFRVLEDGSVYAKRPVQLSSEQRSFTVHAW 76 (90)
T ss_dssp -EEEE---S-EEEEETTTEEEEES--S-SSS-EEEEEEEE
T ss_pred ceEEecCCC-CEEEcCCCeEEEeeeEecCCCceEEEEEEE
Confidence 577778766 888999999999987555555 6776544
No 44
>smart00458 RICIN Ricin-type beta-trefoil. Carbohydrate-binding domain formed from presumed gene triplication.
Probab=24.42 E-value=2.9e+02 Score=20.30 Aligned_cols=64 Identities=16% Similarity=0.160 Sum_probs=33.6
Q ss_pred CccccEEEEEecCCeEEEEecCCeEEEEecCC--cEEEeccCC-CCCCceEEEEeCCceEEeccCceeEEEec
Q psy2222 94 SPEEIFTAILVNDSKVAFKSGFDKYLSIDKNG--RVTGRSDAV-GPLEQWEPVFEDDNMALLGANQCFMSVSE 163 (264)
Q Consensus 94 ~~~evf~~~~v~dt~~~lKs~~GkYLs~d~~G--~v~a~a~a~-g~~E~f~iv~~~g~~alq~~ng~fLs~~~ 163 (264)
...|.|.+. +++.+.++ .+..|.++... .|.-..-.. ++...|.+.. ++.+... ..+..|.+..
T Consensus 28 ~~~Q~w~~~--~~g~~~~~--~~~Cl~~~~~~~~~v~l~~c~~~~~~Q~W~~~~-~~~i~~~-~~~~cl~~~~ 94 (117)
T smart00458 28 GGNQLWKLT--SDGAIRIA--TDLCLTANGNTGSTVTLYSCDGDADNQYWTVNK-DGTIRNP-DSGLCLDVKD 94 (117)
T ss_pred CccceEEEe--CCCeEEec--CCccCccCCCCCCEEEEEECCCCCcCCEEEECC-CeeEEeC-CCCEEEecCC
Confidence 456888887 44566666 66677665532 222222222 5677777754 2333222 3455555543
No 45
>TIGR03422 mito_frataxin frataxin. Frataxin is a mitochondrial protein, mutation of which leads to the disease Friedreich's ataxia. Its orthologs are widely distributed in the bacteria, associated with the ISC system for iron-sulfur cluster assembly, and designated CyaY. This exception-type model allows those examples of frataxin per se that score above the trusted cutoff to the CyaY equivalog-type model (TIGR03421) to be named appropriately.
Probab=24.29 E-value=98 Score=24.05 Aligned_cols=26 Identities=12% Similarity=0.335 Sum_probs=18.2
Q ss_pred CCccccEEEEEecCC-eEEEEecCCeEEE
Q psy2222 93 PSPEEIFTAILVNDS-KVAFKSGFDKYLS 120 (264)
Q Consensus 93 ~~~~evf~~~~v~dt-~~~lKs~~GkYLs 120 (264)
+...|+|.+.++++. .|.+. .|++++
T Consensus 54 ~p~~QIWlsSp~sGp~hfd~~--~~~Wi~ 80 (97)
T TIGR03422 54 PPNKQIWLSSPVSGPKRYDYV--NGEWIY 80 (97)
T ss_pred ChhhHHheecCCCCCcceeec--CCEEEE
Confidence 345899999998876 67664 455444
No 46
>PF01491 Frataxin_Cyay: Frataxin-like domain; InterPro: IPR002908 The eukaryotic proteins in this entry include frataxin, the protein that is mutated in Friedreich's ataxia [], and related sequences. Friedreich's ataxia is a progressive neurodegenerative disorder caused by loss of function mutations in the gene encoding frataxin (FRDA). Frataxin mRNA is predominantly expressed in tissues with a high metabolic rate (including liver, kidney, brown fat and heart). Mouse and yeast frataxin homologues contain a potential N-terminal mitochondrial targeting sequence, and human frataxin has been observed to co-localise with a mitochondrial protein. Furthermore, disruption of the yeast gene has been shown to result in mitochondrial dysfunction. Friedreich's ataxia is thus believed to be a mitochondrial disease caused by a mutation in the nuclear genome (specifically, expansion of an intronic GAA triplet repeat) [, , ]. The bacterial proteins in this entry are iron-sulphur cluster (FeS) metabolism CyaY proteins hmologous to eukaryotic frataxin. Partial Phylogenetic Profiling [] suggests that CyaY most likely functions as part of the ISC system for FeS cluster biosynthesis, and is supported by expermimental data in some species [, ]. ; PDB: 1EW4_A 2P1X_A 1SOY_A 2EFF_A 3T3T_B 3S4M_A 3T3K_A 3S5D_A 1LY7_A 3T3X_B ....
Probab=23.76 E-value=67 Score=25.32 Aligned_cols=26 Identities=12% Similarity=0.376 Sum_probs=18.8
Q ss_pred CCccccEEEEEecCC-eEEEEecCCeEEE
Q psy2222 93 PSPEEIFTAILVNDS-KVAFKSGFDKYLS 120 (264)
Q Consensus 93 ~~~~evf~~~~v~dt-~~~lKs~~GkYLs 120 (264)
+...|+|.+.++++. +|.+. .|+.++
T Consensus 57 ~p~~QIWlsSpisG~~hf~~~--~~~W~~ 83 (109)
T PF01491_consen 57 PPNRQIWLSSPISGPFHFDYD--DGKWID 83 (109)
T ss_dssp CCCTEEEEEETTTEEEEEEEE--SSSEEE
T ss_pred CHHHHHHHhcccCCceEEEEc--CCEEEE
Confidence 345799999998877 77776 455443
No 47
>COG2747 FlgM Negative regulator of flagellin synthesis (anti-sigma28 factor) [Transcription / Cell motility and secretion / Intracellular trafficking and secretion]
Probab=23.30 E-value=48 Score=25.76 Aligned_cols=36 Identities=17% Similarity=0.281 Sum_probs=25.0
Q ss_pred HHHHHHHhhhhccCCcee-cCccchHHHHHHHhhchh
Q psy2222 211 IELNYVKKFQKFQDKKIK-LNSDNIAVLKEAKAQGYL 246 (264)
Q Consensus 211 ~E~~~vk~~Q~~~~~~~~-l~~~~~~~LkkA~~eG~l 246 (264)
+|+...-.++.++..... ++.+-|.+||.|..+|.|
T Consensus 39 ~evs~s~~~~~~~~~~~~~~~~~kVeeiK~aI~~G~y 75 (93)
T COG2747 39 VEVSLSAQQAKLMQPGSSDIREEKVEELKQAIENGEY 75 (93)
T ss_pred ccccHHHHHHHhcCcchhhhhHHHHHHHHHHHHcCCe
Confidence 455555445554444444 488889999999999986
No 48
>PF14913 DPCD: DPCD protein family
Probab=22.07 E-value=4.3e+02 Score=23.29 Aligned_cols=48 Identities=17% Similarity=0.311 Sum_probs=33.2
Q ss_pred CcEEEEcCCCceEeeccCCCCCCCCccccEEEEEecCC-eEEEEecCCeEEEE
Q psy2222 70 RTFVSALDNGLFILGAIHEDKEGPSPEEIFTAILVNDS-KVAFKSGFDKYLSI 121 (264)
Q Consensus 70 ~~yL~a~~~G~f~~~~~~~~~e~~~~~evf~~~~v~dt-~~~lKs~~GkYLs~ 121 (264)
|.++..|....|..-. ..-|-|.++|.+....+. .+-+||.|.+|.--
T Consensus 87 P~~~r~dTk~~fqWRI----RNLPYP~dvYsVtvd~~~r~ivvRTtNKKYyKk 135 (194)
T PF14913_consen 87 PIFVRRDTKTSFQWRI----RNLPYPKDVYSVTVDEDERCIVVRTTNKKYYKK 135 (194)
T ss_pred CEEEEEcCccceEEEE----ccCCCCccceEEEEcCCCcEEEEECcCccceeE
Confidence 6666666665555422 234778888888877766 78888888888644
No 49
>PF11138 DUF2911: Protein of unknown function (DUF2911); InterPro: IPR021314 This bacterial family of proteins has no known function.
Probab=21.32 E-value=2.4e+02 Score=23.57 Aligned_cols=70 Identities=30% Similarity=0.419 Sum_probs=39.8
Q ss_pred CEEEEeCC-----CcEEEEcCCCceEeeccCCCCCCCCccccEEEEEecCCeEEEEecCCe--EEEEecCCcEEEec-cC
Q psy2222 62 AISIEFGK-----RTFVSALDNGLFILGAIHEDKEGPSPEEIFTAILVNDSKVAFKSGFDK--YLSIDKNGRVTGRS-DA 133 (264)
Q Consensus 62 p~~i~~~s-----~~yL~a~~~G~f~~~~~~~~~e~~~~~evf~~~~v~dt~~~lKs~~Gk--YLs~d~~G~v~a~a-~a 133 (264)
++.|.|.. |.-|.|-.+++|++ |...+ |+++-..++ ...|- |=-.+.-..+.... ..
T Consensus 47 aT~i~f~~dv~igGk~l~AG~Ysl~ti---------P~~~~-WtvI~n~~~-----~~wG~~~Y~~~~Dv~rv~v~~~~~ 111 (145)
T PF11138_consen 47 ATTITFSKDVTIGGKKLKAGTYSLFTI---------PGEDE-WTVIFNKDT-----DQWGAYNYDPSKDVLRVTVPPQKL 111 (145)
T ss_pred ceEEEECCCeEECCEEcCCeeEEEEEe---------cCCCe-EEEEEECCC-----CccCccccCchheEEEEEeeeecC
Confidence 46777765 77777777777887 33334 888776665 12333 21111112233333 34
Q ss_pred CCCCCceEEEEeC
Q psy2222 134 VGPLEQWEPVFED 146 (264)
Q Consensus 134 ~g~~E~f~iv~~~ 146 (264)
..+.|.|++.|.+
T Consensus 112 ~~~~E~fti~f~~ 124 (145)
T PF11138_consen 112 PEPVETFTISFED 124 (145)
T ss_pred CCceEEEEEEEEe
Confidence 4589999998873
No 50
>COG2706 3-carboxymuconate cyclase [Carbohydrate transport and metabolism]
Probab=21.22 E-value=5.4e+02 Score=24.74 Aligned_cols=10 Identities=20% Similarity=0.195 Sum_probs=4.9
Q ss_pred ecCCeEEEEe
Q psy2222 113 SGFDKYLSID 122 (264)
Q Consensus 113 s~~GkYLs~d 122 (264)
|+.||||.+.
T Consensus 252 s~dGrFLYas 261 (346)
T COG2706 252 SPDGRFLYAS 261 (346)
T ss_pred CCCCCEEEEe
Confidence 3455555544
No 51
>PF10467 Inhibitor_I48: Peptidase inhibitor clitocypin; InterPro: IPR019508 Peptide proteinase inhibitors can be found as single domain proteins or as single or multiple domains within proteins; these are referred to as either simple or compound inhibitors, respectively. In many cases they are synthesised as part of a larger precursor protein, either as a prepropeptide or as an N-terminal domain associated with an inactive peptidase or zymogen. This domain prevents access of the substrate to the active site. Removal of the N-terminal inhibitor domain either by interaction with a second peptidase or by autocatalytic cleavage activates the zymogen. Other inhibitors interact direct with proteinases using a simple noncovalent lock and key mechanism; while yet others use a conformational change-based trapping mechanism that depends on their structural and thermodynamic properties. Clitocypin binds and inhibits cysteine proteinases. It has no similarity to any other known cysteine proteinase inhibitors but bears some similarity to a lectin-like family of proteins from mushrooms []. ; PDB: 3H6R_B 3H6S_H.
Probab=20.97 E-value=4.4e+02 Score=22.01 Aligned_cols=65 Identities=12% Similarity=0.234 Sum_probs=34.2
Q ss_pred ccccEEEEEecCC--eEEEEecC---CeEEEEec---CCcEEEeccCCCCCCceEE--EEe-CCceEEecc-----Ccee
Q psy2222 95 PEEIFTAILVNDS--KVAFKSGF---DKYLSIDK---NGRVTGRSDAVGPLEQWEP--VFE-DDNMALLGA-----NQCF 158 (264)
Q Consensus 95 ~~evf~~~~v~dt--~~~lKs~~---GkYLs~d~---~G~v~a~a~a~g~~E~f~i--v~~-~g~~alq~~-----ng~f 158 (264)
-+|+|.+....++ ++.+.+++ |--++.|. ++=|++..+ .| |.+ +++ ...+.||+. .+-|
T Consensus 45 erQ~WkI~~~~~~~YTI~~h~~~~~fg~~fs~de~~P~~PVIt~e~----~E-~i~~~~p~~p~t~~IqA~tk~iG~~~~ 119 (149)
T PF10467_consen 45 ERQVWKITRGKKGTYTITLHTHGIPFGGSFSLDELIPEGPVITSEP----SE-WIIKLTPSVPDTYTIQAPTKRIGVDLY 119 (149)
T ss_dssp GGS-EEEEE-TTS-EEEEESSSTTSS-EEEE-SSE-TT-B-EEES--------BEEEE-TT-SSEEEEEES-CGCTEEEE
T ss_pred ceeEEEEEecCCCcEEEEEeccCCccccccccCccCCCCCeecCCc----ee-EEeecCCCCCceEEEeCCccccceeeE
Confidence 3799999988877 66666543 55566652 444553322 24 422 222 368999986 3457
Q ss_pred EEEecC
Q psy2222 159 MSVSEQ 164 (264)
Q Consensus 159 Ls~~~~ 164 (264)
+..+++
T Consensus 120 vGt~~n 125 (149)
T PF10467_consen 120 VGTNEN 125 (149)
T ss_dssp EEEETT
T ss_pred eecCCC
Confidence 777653
Done!