*Experimental: mutation (blue); functional region (cyan); located in the long functional sequence (lightblue)
*NES patterns: region with experimental evidence (red-orange-yellow); false positive region (gray)
# | candidates | id | start# | sequence | secondary | class | multi-pattern | diso | spotd | iup | loc_DISO | loc_CDD | beta |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|
1 | fp_beta_D | P04406 | 1 | XMGKVKVGVNGFGR                | CCCEEEEECCCCCC | c3-4-Nt | uniq | 0.104 | 0.146 | 0.128 | boundary | boundary|Gp_dh_N superfamily; | 0.714 |
2 | fp_O | P04406 | 29 | DIVAINDPFIDLNY                | EEEEECCCCCCCCC | c3-4 | multi-selected | 0.009 | 0.014 | 0.05 | ORD | MID|Gp_dh_N superfamily; | 0.286 |
3 | fp_O | P04406 | 33 | INDPFIDLNYMVYMFQ                  | ECCCCCCCCCEEEEEE | c1d-4 | multi | 0.006 | 0.014 | 0.04 | ORD | MID|Gp_dh_N superfamily; | 0.143 |
4 | fp_beta_O | P04406 | 36 | PFIDLNYMVYMFQYDS                  | CCCCCCCEEEEEECCC | c1aR-4 | multi-selected | 0.006 | 0.018 | 0.046 | ORD | MID|Gp_dh_N superfamily; | 0.571 |
5 | fp_O | P04406 | 36 | PFIDLNYMVYMFQY                | CCCCCCCEEEEEEC | c3-4 | multi-selected | 0.005 | 0.015 | 0.041 | ORD | MID|Gp_dh_N superfamily; | 0.429 |
6 | fp_O | P04406 | 108 | AGAHLQGGAKRVIISA                  | HHHHCCCCCCEEEEEC | c1a-AT-4 | uniq | 0.015 | 0.043 | 0.296 | ORD | MID|Gp_dh_N superfamily; | 0.143 |
7 | fp_O | P04406 | 122 | SAPSADAPMFVMGVNH                  | ECCCCCCCEEEEECCC | c1a-AT-4 | multi-selected | 0.01 | 0.027 | 0.241 | ORD | MID|Gp_dh_N superfamily; | 0.429 |
8 | fp_beta_O | P04406 | 124 | PSADAPMFVMGVNH                | CCCCCCEEEEECCC | c2-AT-4 | multi | 0.01 | 0.028 | 0.245 | ORD | MID|Gp_dh_N superfamily; | 0.571 |
9 | fp_D | P04406 | 156 | CLAPLAKVIHDNFGIVE                   | CCHHHHCCCCCCCCCCC | c1c-5 | multi-selected | 0.01 | 0.046 | 0.151 | boundary | MID|Gp_dh_N superfamily; | 0.0 |
10 | fp_D | P04406 | 157 | LAPLAKVIHDNFGIVE                  | CHHHHCCCCCCCCCCC | c1d-AT-4 | multi | 0.01 | 0.046 | 0.152 | boundary | MID|Gp_dh_N superfamily; | 0.0 |
11 | fp_D | P04406 | 166 | DNFGIVEGLMTTVHAIT                   | CCCCCCCCCCEEEEECC | c1cR-5 | multi | 0.01 | 0.067 | 0.224 | boundary | MID|Gp_dh_N superfamily; | 0.25 |
12 | fp_D | P04406 | 167 | NFGIVEGLMTTVHA                | CCCCCCCCCEEEEE | c3-4 | multi-selected | 0.01 | 0.068 | 0.204 | boundary | MID|Gp_dh_N superfamily; | 0.143 |
13 | fp_beta_D | P04406 | 170 | IVEGLMTTVHAITA                | CCCCCCEEEEECCC | c3-4 | multi-selected | 0.01 | 0.073 | 0.263 | boundary | MID|Gp_dh_N superfamily; | 0.571 |
14 | fp_D | P04406 | 220 | VIPELNGKLTGMAFRV                  | CCCCCCCCCCCEEEEC | c1a-5 | uniq | 0.015 | 0.041 | 0.246 | boundary | MID|Gp_dh_N superfamily; | 0.0 |
15 | fp_D | P04406 | 229 | TGMAFRVPTANVSVVD                  | CCEEEECCCCCCEEEE | c1a-AT-4 | multi-selected | 0.013 | 0.025 | 0.215 | boundary | MID|Gp_dh_N superfamily; | 0.286 |
16 | fp_D | P04406 | 231 | MAFRVPTANVSVVD                | EEEECCCCCCEEEE | c2-AT-4 | multi | 0.012 | 0.024 | 0.209 | boundary | MID|Gp_dh_N superfamily; | 0.143 |
17 | fp_D | P04406 | 233 | FRVPTANVSVVDLTC                 | EECCCCCCEEEEEEE | c1b-AT-4 | multi | 0.011 | 0.023 | 0.208 | boundary | MID|Gp_dh_N superfamily; | 0.429 |
18 | fp_D | P04406 | 234 | RVPTANVSVVDLTC                | ECCCCCCEEEEEEE | c2-AT-5 | multi | 0.01 | 0.024 | 0.207 | boundary | MID|Gp_dh_N superfamily; | 0.429 |
19 | fp_D | P04406 | 234 | RVPTANVSVVDLTC                | ECCCCCCEEEEEEE | c3-AT | multi | 0.01 | 0.024 | 0.207 | boundary | MID|Gp_dh_N superfamily; | 0.429 |
*candidates: NES candidates and false positives annotated with "cand" and "fp", respectively;
                if the segment is located in the disordered or boundary region, flagged with "D"; if the segment is located in the ordered region, flagged with "O";
                if the segment's beta-strand content is over 0.5, flagged with "beta".
*sequence: Hydrophobic positions are colored in red. The positions with the experimental evidence is marked with '*' (mutation) and '+' (functional sequence in NESdb or sites in validNES).
                The positions with '.' are for the region annotated in the long (more than 25 residues) functional sequence or site.
*multi-pattern: the consensus pattern is unique or multiple within the region (if the start# difference is less than 5, the segments are considered to be the same region)
*diso: average DISOPRED3-predicted disorder propensity for the segment
*spotd: average SPOT-Disorder-predicted disorder propensity for the segment
*iup: average IUPRED2A-predicted disorder propensity for the segment
*loc_DISO: location of the segment with respect to the ordered/disordered regions
*loc_CDD: location of the segment with respect to the conserved region annotated in the Conserved Domain Database (CDD)
*beta: beta-strand content in the middle of the segment