Query psy7091
Match_columns 205
No_of_seqs 117 out of 132
Neff 4.0
Searched_HMMs 46136
Date Fri Aug 16 17:57:14 2013
Command hhsearch -i /work/01045/syshi/Psyhhblits/psy7091.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/7091hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 PF10274 ParcG: Parkin co-regu 100.0 9E-66 1.9E-70 434.0 10.4 138 67-204 1-139 (183)
2 KOG3961|consensus 100.0 1.3E-65 2.9E-70 445.9 9.5 167 38-204 47-214 (262)
3 PF12460 MMS19_C: RNAPII trans 92.0 0.2 4.4E-06 46.4 4.3 90 102-194 321-411 (415)
4 PF08064 UME: UME (NUC010) dom 89.6 0.54 1.2E-05 36.3 4.0 70 119-192 30-99 (107)
5 PF12719 Cnd3: Nuclear condens 89.0 1.4 3.1E-05 39.0 6.8 78 103-180 63-149 (298)
6 PF08623 TIP120: TATA-binding 83.1 1.3 2.8E-05 37.6 3.3 99 56-172 30-145 (169)
7 PF12348 CLASP_N: CLASP N term 83.1 0.82 1.8E-05 37.9 2.0 94 103-201 6-107 (228)
8 smart00802 UME Domain in UVSB 82.2 2.9 6.4E-05 32.8 4.7 70 118-191 27-98 (107)
9 KOG2171|consensus 81.0 2.5 5.3E-05 45.0 5.0 98 103-203 158-262 (1075)
10 KOG0212|consensus 77.4 6.7 0.00015 39.7 6.5 99 98-201 330-428 (675)
11 PF12755 Vac14_Fab1_bd: Vacuol 73.4 4.5 9.7E-05 31.0 3.4 66 135-203 17-83 (97)
12 PF13513 HEAT_EZ: HEAT-like re 68.3 2.2 4.8E-05 28.1 0.6 48 124-171 7-54 (55)
13 cd00020 ARM Armadillo/beta-cat 67.9 11 0.00023 26.9 4.2 72 105-176 8-80 (120)
14 KOG2023|consensus 66.9 11 0.00023 39.2 5.3 91 103-194 214-304 (885)
15 smart00185 ARM Armadillo/beta- 63.9 14 0.0003 22.1 3.6 28 146-173 13-40 (41)
16 PF12717 Cnd1: non-SMC mitotic 63.2 35 0.00075 28.0 6.9 90 103-197 24-113 (178)
17 KOG2171|consensus 62.0 8.7 0.00019 41.1 3.8 95 103-200 429-527 (1075)
18 PF13646 HEAT_2: HEAT repeats; 58.9 21 0.00046 24.7 4.3 71 106-191 1-71 (88)
19 PF12755 Vac14_Fab1_bd: Vacuol 56.6 11 0.00024 28.9 2.6 71 102-173 25-95 (97)
20 PF12348 CLASP_N: CLASP N term 54.7 69 0.0015 26.4 7.3 90 103-197 93-185 (228)
21 KOG1837|consensus 54.4 13 0.00029 41.2 3.7 79 105-186 1539-1620(1621)
22 PF11865 DUF3385: Domain of un 54.0 26 0.00056 28.8 4.6 50 124-173 107-156 (160)
23 PF12333 Ipi1_N: Rix1 complex 52.3 15 0.00032 28.2 2.7 58 137-197 3-61 (102)
24 PF02985 HEAT: HEAT repeat; I 52.1 33 0.00072 20.4 3.8 29 146-174 1-29 (31)
25 KOG1242|consensus 51.0 17 0.00038 36.4 3.7 88 102-189 370-460 (569)
26 PF10363 DUF2435: Protein of u 48.9 6.1 0.00013 30.1 0.1 44 99-143 38-82 (92)
27 PF00514 Arm: Armadillo/beta-c 45.6 48 0.001 20.6 3.9 27 147-173 14-40 (41)
28 KOG0212|consensus 44.9 29 0.00064 35.3 4.2 99 90-191 187-293 (675)
29 COG4899 Uncharacterized protei 44.4 4.7 0.0001 34.3 -1.2 55 70-126 48-104 (166)
30 KOG2956|consensus 43.7 25 0.00054 34.9 3.4 82 116-200 418-505 (516)
31 PF01602 Adaptin_N: Adaptin N 42.0 37 0.0008 31.3 4.2 83 103-191 41-123 (526)
32 KOG2023|consensus 41.4 28 0.00062 36.2 3.5 46 147-195 478-523 (885)
33 KOG2021|consensus 40.5 52 0.0011 34.8 5.2 73 117-195 710-782 (980)
34 PLN03076 ARF guanine nucleotid 38.8 26 0.00057 39.4 3.1 38 145-182 86-123 (1780)
35 PF06953 ArsD: Arsenical resis 38.3 14 0.00031 29.8 0.8 28 116-143 49-76 (123)
36 PF10363 DUF2435: Protein of u 37.0 1.1E+02 0.0025 23.1 5.5 70 118-188 17-87 (92)
37 cd00020 ARM Armadillo/beta-cat 36.1 56 0.0012 23.1 3.6 70 104-173 49-119 (120)
38 PF03378 CAS_CSE1: CAS/CSE pro 36.0 29 0.00062 33.3 2.5 56 138-195 110-165 (435)
39 PF04118 Dopey_N: Dopey, N-ter 34.8 28 0.00061 32.0 2.2 77 102-182 91-172 (307)
40 cd03569 VHS_Hrs_Vps27p VHS dom 31.4 73 0.0016 25.8 3.9 31 145-175 41-71 (142)
41 cd03568 VHS_STAM VHS domain fa 30.9 80 0.0017 25.7 4.0 32 145-176 37-68 (144)
42 PF09324 DUF1981: Domain of un 30.4 51 0.0011 24.5 2.6 51 121-171 35-85 (86)
43 cd03567 VHS_GGA VHS domain fam 29.9 82 0.0018 25.7 3.9 30 146-175 39-68 (139)
44 cd00197 VHS_ENTH_ANTH VHS, ENT 29.6 88 0.0019 23.6 3.9 45 143-190 35-79 (115)
45 cd03561 VHS VHS domain family; 28.8 1.5E+02 0.0032 23.4 5.1 48 125-177 21-69 (133)
46 PF13646 HEAT_2: HEAT repeats; 28.2 99 0.0022 21.2 3.7 39 147-191 1-40 (88)
47 PF00790 VHS: VHS domain; Int 27.7 99 0.0022 24.5 4.0 49 124-176 25-73 (140)
48 PF10508 Proteasom_PSMB: Prote 26.7 1.2E+02 0.0027 29.1 5.1 75 102-177 75-151 (503)
49 smart00288 VHS Domain present 26.6 1E+02 0.0022 24.4 3.9 30 146-175 38-67 (133)
50 PF11698 V-ATPase_H_C: V-ATPas 26.5 70 0.0015 25.9 2.9 23 152-174 93-115 (119)
51 PF12397 U3snoRNP10: U3 small 26.2 1.2E+02 0.0026 23.1 4.1 52 142-193 3-55 (121)
52 PF10508 Proteasom_PSMB: Prote 25.8 1.3E+02 0.0029 28.9 5.2 36 143-178 75-110 (503)
53 KOG1062|consensus 24.3 50 0.0011 34.8 2.0 53 138-194 172-224 (866)
54 KOG4653|consensus 24.0 75 0.0016 33.9 3.3 83 102-190 845-931 (982)
55 PF11865 DUF3385: Domain of un 23.1 1.5E+02 0.0032 24.3 4.3 70 109-202 71-142 (160)
56 cd08819 CARD_MDA5_2 Caspase ac 23.0 22 0.00048 27.5 -0.5 16 103-118 66-81 (88)
57 PF13251 DUF4042: Domain of un 22.8 2E+02 0.0044 24.5 5.2 65 114-178 112-178 (182)
58 KOG2149|consensus 22.2 1.4E+02 0.0031 28.8 4.6 89 109-200 100-193 (393)
59 PRK15356 type III secretion sy 22.1 44 0.00095 25.4 0.9 12 137-148 14-25 (75)
60 cd03572 ENTH_epsin_related ENT 21.7 1.3E+02 0.0028 24.3 3.6 46 150-198 43-88 (122)
61 PF01602 Adaptin_N: Adaptin N 20.5 1.1E+02 0.0024 28.2 3.4 27 146-172 153-179 (526)
62 KOG1824|consensus 20.4 1.2E+02 0.0027 33.0 4.0 92 97-190 1098-1203(1233)
No 1
>PF10274 ParcG: Parkin co-regulated protein; InterPro: IPR019399 This family of proteins is transcribed anti-sense along the DNA to the Parkin gene product and the two appear to be transcribed under the same promoter. The protein has predicted alpha-helical and beta-sheet domains which suggest its function is in the ubiquitin/proteasome system []. Mutations in parkin are the genetic cause of early-onset and autosomal recessive juvenile parkinsonism.
Probab=100.00 E-value=9e-66 Score=433.99 Aligned_cols=138 Identities=62% Similarity=1.097 Sum_probs=136.3
Q ss_pred ChhHHHHhcCCCCEEEeeCCCCCeeeeeccCCCCCCCCchhhhhcccccccchhHhHHHhHHHHHHHc-CCCceeccccc
Q psy7091 67 SMFRTYYSRGDFPIAMEFKSVGNKISWKEDIKTLNYQYYLPIFFDGLRETEYPYKFFASKGIHDLLEN-GKDKIVPVLPH 145 (205)
Q Consensus 67 T~Fr~~Y~rGdlP~~i~h~s~~~~L~Wkv~~e~LDy~~yLPiFfdGLrEt~hPY~FlA~~G~~dLL~~-~~~KIlPvlPq 145 (205)
|+||++|+||||||+++||+.+++|+|+|+||+||||||||+||||||||+|||+|+|++|++|||++ |++||+|+|||
T Consensus 1 t~Fr~~Y~rG~lP~~i~h~~~~~~l~W~~~~e~Ldy~~~Lpif~dGL~Et~~Py~flA~~g~~dll~~~~~~kilPvlPq 80 (183)
T PF10274_consen 1 TQFRRFYERGDLPCRIDHGSVKNKLQWKVDPEKLDYHHYLPIFFDGLRETEHPYRFLARQGIKDLLERGGGEKILPVLPQ 80 (183)
T ss_pred ChHHHHHhcCCCCEEEECCCCCceeEEecChhhcchhhHHHHHHhhhhccCccHHHHHHHHHHHHHHhcchhHHHHHHHH
Confidence 78999999999999999999999999999999999999999999999999999999999999999999 88999999999
Q ss_pred chHHHHHhhcCCCHHHHHHHHHHHHHHHhcCCccccchhhHHHhhhhhhhHhhhccccC
Q psy7091 146 LILPIKNALSTRNPEVICETLKVLQHLVTSSSMVGEALVPYYKQILPHLNMYKDLNNRY 204 (205)
Q Consensus 146 LI~PiK~AL~Trd~~V~~~~L~~Lq~LV~~~~~VG~ALvPyyRqLLPvlNlfk~~n~n~ 204 (205)
||+|||+||||||++|+|++|++||+||+++++||||||||||||||+||+||++|.|.
T Consensus 81 LI~plk~AL~tr~~~V~~~~L~~Lq~Lv~~~~~vG~aLvPyyrqLLp~ln~f~~k~~n~ 139 (183)
T PF10274_consen 81 LIIPLKRALNTRDPEVFCATLKALQQLVTSSDMVGEALVPYYRQLLPVLNLFKNKNVNL 139 (183)
T ss_pred HHHHHHHHHhCCCHHHHHHHHHHHHHHHHhhhhhhHHHHHHHHHHHHHHHHHHhcccCC
Confidence 99999999999999999999999999999999999999999999999999999999875
No 2
>KOG3961|consensus
Probab=100.00 E-value=1.3e-65 Score=445.91 Aligned_cols=167 Identities=51% Similarity=0.857 Sum_probs=164.6
Q ss_pred CccccccccccccC-CCCccCCCCCCCCCCChhHHHHhcCCCCEEEeeCCCCCeeeeeccCCCCCCCCchhhhhcccccc
Q psy7091 38 AFSLQTHQKNTITG-APPKCVPKDKTYDYTSMFRTYYSRGDFPIAMEFKSVGNKISWKEDIKTLNYQYYLPIFFDGLRET 116 (205)
Q Consensus 38 ~Ft~~s~q~~t~~~-~pp~~~~f~~r~~~~T~Fr~~Y~rGdlP~~i~h~s~~~~L~Wkv~~e~LDy~~yLPiFfdGLrEt 116 (205)
+||.++.++++.+. .||++++|++++..+|+||.+|+||||||+.+|++++++|+|+|+||+|||++|||+|||||+|+
T Consensus 47 p~s~~~~~kr~~v~~~pp~~~~~~~~~~~ps~fr~~~~Rgdlpis~~~~~k~~~l~W~~~pe~Ldy~~yLp~F~dGL~e~ 126 (262)
T KOG3961|consen 47 PFSARKLHKRPVVRLGPPTINAFRERPPVPSAFRAIYSRGDLPISLEHGSKGRKLQWECDPEKLDYCPYLPLFFDGLAET 126 (262)
T ss_pred chhHhhcCCCceeccCCCCCccccccCCCccHHHHHHccCCCceecccCCCCCccccccCHHhccchHHHHHHhhhhhhc
Confidence 79999999999998 89999999999999999999999999999999999999999999999999999999999999999
Q ss_pred cchhHhHHHhHHHHHHHcCCCceecccccchHHHHHhhcCCCHHHHHHHHHHHHHHHhcCCccccchhhHHHhhhhhhhH
Q psy7091 117 EYPYKFFASKGIHDLLENGKDKIVPVLPHLILPIKNALSTRNPEVICETLKVLQHLVTSSSMVGEALVPYYKQILPHLNM 196 (205)
Q Consensus 117 ~hPY~FlA~~G~~dLL~~~~~KIlPvlPqLI~PiK~AL~Trd~~V~~~~L~~Lq~LV~~~~~VG~ALvPyyRqLLPvlNl 196 (205)
+|||+|+|+||+.|||.++|+||+|||||||+|||+||+|||.||+|.+|++||+||.++++||.|||||||||||+||+
T Consensus 127 ~hpyrf~A~~Gi~DLLl~~g~kilpVLPqLI~plK~al~trd~ev~~~~Lkvlq~lv~~~~~vG~aLVPfYRQlLp~~n~ 206 (262)
T KOG3961|consen 127 DHPYRFVARQGITDLLLAGGEKILPVLPQLILPLKAALVTRDDEVICRTLKVLQQLVVSVGCVGAALVPFYRQLLPVLNT 206 (262)
T ss_pred CCCcchhhhhcHHHHHHhcccccccccHHHHHHHHHHHhhccHHHHHHHHHHHHHHHHhccccchhhhhHHHHhhhhhhh
Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999
Q ss_pred hhhccccC
Q psy7091 197 YKDLNNRY 204 (205)
Q Consensus 197 fk~~n~n~ 204 (205)
||++|.|.
T Consensus 207 ~k~~n~n~ 214 (262)
T KOG3961|consen 207 FKNSNVNR 214 (262)
T ss_pred hccccccc
Confidence 99999884
No 3
>PF12460 MMS19_C: RNAPII transcription regulator C-terminal; InterPro: IPR024687 This domain, approximately 60 amino acids in length, is found in the N-terminal region of MMS19 proteins. MMS19 is required for both nucleotide excision repair (NER) and RNA polymerase II (RNAP II) transcription [].
Probab=92.03 E-value=0.2 Score=46.42 Aligned_cols=90 Identities=22% Similarity=0.417 Sum_probs=73.8
Q ss_pred CCCchhhhhcccccccchhHhHHHhHHHHHHHcCC-CceecccccchHHHHHhhcCCCHHHHHHHHHHHHHHHhcCCccc
Q psy7091 102 YQYYLPIFFDGLRETEYPYKFFASKGIHDLLENGK-DKIVPVLPHLILPIKNALSTRNPEVICETLKVLQHLVTSSSMVG 180 (205)
Q Consensus 102 y~~yLPiFfdGLrEt~hPY~FlA~~G~~dLL~~~~-~KIlPvlPqLI~PiK~AL~Trd~~V~~~~L~~Lq~LV~~~~~VG 180 (205)
|...+|.+.+|.++....=+-..-.++--+|++-+ +-+++=+|+|++=+=.||...|+++...+|++|..++..+ -
T Consensus 321 F~~~~p~L~~~~~~~~~~~k~~yL~ALs~ll~~vP~~vl~~~l~~LlPLLlqsL~~~~~~v~~s~L~tL~~~l~~~---~ 397 (415)
T PF12460_consen 321 FTQVLPKLLEGFKEADDEIKSNYLTALSHLLKNVPKSVLLPELPTLLPLLLQSLSLPDADVLLSSLETLKMILEEA---P 397 (415)
T ss_pred HHHHHHHHHHHHhhcChhhHHHHHHHHHHHHhhCCHHHHHHHHHHHHHHHHHHhCCCCHHHHHHHHHHHHHHHHcC---H
Confidence 67889999999999876422223557777888755 6788999999999999999999999999999999998765 6
Q ss_pred cchhhHHHhhhhhh
Q psy7091 181 EALVPYYKQILPHL 194 (205)
Q Consensus 181 ~ALvPyyRqLLPvl 194 (205)
+++-+|...|.|.|
T Consensus 398 ~~i~~hl~sLI~~L 411 (415)
T PF12460_consen 398 ELISEHLSSLIPRL 411 (415)
T ss_pred HHHHHHHHHHHHHH
Confidence 77777888888764
No 4
>PF08064 UME: UME (NUC010) domain; InterPro: IPR012993 This domain is characteristic of UVSB PI-3 kinase, MEI-41 and ESR1 [].; GO: 0004674 protein serine/threonine kinase activity
Probab=89.56 E-value=0.54 Score=36.34 Aligned_cols=70 Identities=26% Similarity=0.370 Sum_probs=55.7
Q ss_pred hhHhHHHhHHHHHHHcCCCceecccccchHHHHHhhcCCCHHHHHHHHHHHHHHHhcCCccccchhhHHHhhhh
Q psy7091 119 PYKFFASKGIHDLLENGKDKIVPVLPHLILPIKNALSTRNPEVICETLKVLQHLVTSSSMVGEALVPYYKQILP 192 (205)
Q Consensus 119 PY~FlA~~G~~dLL~~~~~KIlPvlPqLI~PiK~AL~Trd~~V~~~~L~~Lq~LV~~~~~VG~ALvPyyRqLLP 192 (205)
+.+--|..|+.+|++.++.-|-++.||+..=|++||... +.=..|+++...+++.=+ .+.|.|..-|+.-
T Consensus 30 ~ek~~~l~si~~lI~~~~~~i~~~~pQI~a~L~sal~~~--~l~~~al~~W~~fi~~L~--~~~l~~ll~~~~~ 99 (107)
T PF08064_consen 30 PEKKRALRSIEELIKLGGSHISSARPQIMACLQSALEIP--ELREEALSCWNCFIKTLD--EEDLGPLLDQIFA 99 (107)
T ss_pred HHHHHHHHHHHHHHHHhHHHHHHHHHHHHHHHHHHhCCh--hhHHHHHHHHHHHHHHCC--HHHHHHHHHHHHH
Confidence 345668889999999999999999999999999999955 777778888888887533 3666666666543
No 5
>PF12719 Cnd3: Nuclear condensing complex subunits, C-term domain
Probab=89.03 E-value=1.4 Score=39.00 Aligned_cols=78 Identities=24% Similarity=0.317 Sum_probs=68.0
Q ss_pred CCchhhhhcccccccchhHhHHHhHHHHHHHcCCCceeccccc---------chHHHHHhhcCCCHHHHHHHHHHHHHHH
Q psy7091 103 QYYLPIFFDGLRETEYPYKFFASKGIHDLLENGKDKIVPVLPH---------LILPIKNALSTRNPEVICETLKVLQHLV 173 (205)
Q Consensus 103 ~~yLPiFfdGLrEt~hPY~FlA~~G~~dLL~~~~~KIlPvlPq---------LI~PiK~AL~Trd~~V~~~~L~~Lq~LV 173 (205)
..++++|+..+.+.+++-+-.|-+++-|++..-|..++..-.. ++--+..+|.+.|+++-..+.+.+-+|.
T Consensus 63 ~~~l~l~~~~~~~~~~~v~~~al~~l~Dll~~~g~~~~~~~~~~~~~~~~~~l~~~l~~~l~~~~~~~~~~a~EGl~KLl 142 (298)
T PF12719_consen 63 KEHLPLFLQALQKDDEEVKITALKALFDLLLTHGIDIFDSESDNDESVDSKSLLKILTKFLDSENPELQAIAVEGLCKLL 142 (298)
T ss_pred HHHHHHHHHHHHhCCHHHHHHHHHHHHHHHHHcCchhccchhccCccchHhHHHHHHHHHHhcCCHHHHHHHHHHHHHHH
Confidence 5789999999977788899999999999999977777777775 8888999999999999999999999998
Q ss_pred hcCCccc
Q psy7091 174 TSSSMVG 180 (205)
Q Consensus 174 ~~~~~VG 180 (205)
.++-...
T Consensus 143 L~~~i~~ 149 (298)
T PF12719_consen 143 LSGRISD 149 (298)
T ss_pred hcCCCCc
Confidence 7765555
No 6
>PF08623 TIP120: TATA-binding protein interacting (TIP20); InterPro: IPR013932 TIP120 (also known as cullin-associated and neddylation-dissociated protein 1) is a TATA binding protein interacting protein that enhances transcription []. ; PDB: 4A0C_A 1U6G_C.
Probab=83.14 E-value=1.3 Score=37.65 Aligned_cols=99 Identities=18% Similarity=0.308 Sum_probs=75.6
Q ss_pred cCCCCCCC-----CCCChhHHHHhcCCCCEEEeeCCCCCeeeeeccCCCCCCCCchhhhhcccccccchhHhHHHhHHHH
Q psy7091 56 CVPKDKTY-----DYTSMFRTYYSRGDFPIAMEFKSVGNKISWKEDIKTLNYQYYLPIFFDGLRETEYPYKFFASKGIHD 130 (205)
Q Consensus 56 ~~~f~~r~-----~~~T~Fr~~Y~rGdlP~~i~h~s~~~~L~Wkv~~e~LDy~~yLPiFfdGLrEt~hPY~FlA~~G~~d 130 (205)
-|||+.+- ..+++|.-+|.==| .-.+.+|...++.....||.+ +|--+.++-.-+.-
T Consensus 30 mGPFKh~vDDGLelRK~ayE~lytlLd-----------------~~~~~~~~~~~~~~v~~GL~D-~~DIk~L~~~~l~k 91 (169)
T PF08623_consen 30 MGPFKHKVDDGLELRKAAYECLYTLLD-----------------TCLSRIDISEFLDRVEAGLKD-EHDIKMLCHLMLSK 91 (169)
T ss_dssp ETTCEEEEEGGGHHHHHHHHHHHHHHH-----------------STCSSS-HHHHHHHHHHTTSS--HHHHHHHHHHHHH
T ss_pred cCCceeeecCcHHHHHHHHHHHHHHHH-----------------HHHHhCCHHHHHHHHHhhcCC-cHHHHHHHHHHHHH
Confidence 37777653 23566766665433 234677888899999999999 88889999999999
Q ss_pred HHHcCCCceecccccchHHHHHhhcCC------------CHHHHHHHHHHHHHH
Q psy7091 131 LLENGKDKIVPVLPHLILPIKNALSTR------------NPEVICETLKVLQHL 172 (205)
Q Consensus 131 LL~~~~~KIlPvlPqLI~PiK~AL~Tr------------d~~V~~~~L~~Lq~L 172 (205)
|...++.-++.-|.+|+.|+|+-|+++ ..|.+..+|+++..|
T Consensus 92 l~~~~p~~v~~~Ld~l~~~l~~~L~~k~k~~AvkQE~Ek~~E~~rs~lr~~~~l 145 (169)
T PF08623_consen 92 LAQLAPEEVLQRLDSLVEPLRKTLSKKLKENAVKQEIEKQQELIRSVLRAVKAL 145 (169)
T ss_dssp HHHS-HHHHHHCCTTTHHHHHHHHH----TTS-HHHHHHHHHHHHHHHHHHHHH
T ss_pred HHHhCHHHHHHHHHHHHHHHHHHhhccCCCCcccccHHHHHHHHHHHHHHHHHH
Confidence 988899999999999999999999764 567788899888877
No 7
>PF12348 CLASP_N: CLASP N terminal; InterPro: IPR024395 This domain is found in the N-terminal region of CLIP-associated proteins (CLASPs), which are widely conserved microtubule plus-end-tracking proteins that regulate the stability of dynamic microtubules [, ]. The domain is also found in other proteins involved in microtubule binding, including STU1, MOR1 and spindle pole body component Alp14.; PDB: 2QK2_A.
Probab=83.05 E-value=0.82 Score=37.89 Aligned_cols=94 Identities=22% Similarity=0.339 Sum_probs=57.8
Q ss_pred CCchhhhhcccccccchhH--hHHHhHHHHHHHcC-----CCceecccccchHHHHHhhcCCCHHHHHHHHHHHHHHHhc
Q psy7091 103 QYYLPIFFDGLRETEYPYK--FFASKGIHDLLENG-----KDKIVPVLPHLILPIKNALSTRNPEVICETLKVLQHLVTS 175 (205)
Q Consensus 103 ~~yLPiFfdGLrEt~hPY~--FlA~~G~~dLL~~~-----~~KIlPvlPqLI~PiK~AL~Trd~~V~~~~L~~Lq~LV~~ 175 (205)
+..+..| ++ +|+++-+. .=|.+-+..++..+ .+.++..+.+++..|-..+++.+-.|...++..|+.|+
T Consensus 6 ~~~~~~l-~~-~~~~~~W~~r~~al~~L~~l~~~~~~~~~~~~~~~~l~~~~~~i~~~l~d~Rs~v~~~A~~~l~~l~-- 81 (228)
T PF12348_consen 6 EEILAAL-EK-KESESDWEERVEALQKLRSLIKGNAPEDFPPDFVECLRQLLDAIIKQLSDLRSKVSKTACQLLSDLA-- 81 (228)
T ss_dssp GGS-TTH-HH-HHT-SSHHHHHHHHHHHHHHHHH-B-----HHHHHHHH---HHHHH-S-HH---HHHHHHHHHHHHH--
T ss_pred HHHHHHH-hc-cCCccCHHHHHHHHHHHHHHHHcCCccccHHHHHHHHHHhHHHHHHHHhhhHHHHHHHHHHHHHHHH--
Confidence 3444433 33 66666663 34455666666555 35677778888889999999999999999999999885
Q ss_pred CCccccchhhHHHhhhhhh-hHhhhcc
Q psy7091 176 SSMVGEALVPYYKQILPHL-NMYKDLN 201 (205)
Q Consensus 176 ~~~VG~ALvPyyRqLLPvl-Nlfk~~n 201 (205)
...|..+-||..+++|.| ....+.+
T Consensus 82 -~~l~~~~~~~~~~~l~~Ll~~~~~~~ 107 (228)
T PF12348_consen 82 -RQLGSHFEPYADILLPPLLKKLGDSK 107 (228)
T ss_dssp -HHHGGGGHHHHHHHHHHHHHGGG---
T ss_pred -HHHhHhHHHHHHHHHHHHHHHHcccc
Confidence 578999999999999965 4544443
No 8
>smart00802 UME Domain in UVSB PI-3 kinase, MEI-41 and ESR-1. Characteristic domain in UVSP PI-3 kinase, MEI-41 and ESR-1. Found in nucleolar proteins. Associated with FAT, FATC, PI3_PI4_kinase modules.
Probab=82.18 E-value=2.9 Score=32.80 Aligned_cols=70 Identities=29% Similarity=0.397 Sum_probs=55.5
Q ss_pred chh--HhHHHhHHHHHHHcCCCceecccccchHHHHHhhcCCCHHHHHHHHHHHHHHHhcCCccccchhhHHHhhh
Q psy7091 118 YPY--KFFASKGIHDLLENGKDKIVPVLPHLILPIKNALSTRNPEVICETLKVLQHLVTSSSMVGEALVPYYKQIL 191 (205)
Q Consensus 118 hPY--~FlA~~G~~dLL~~~~~KIlPvlPqLI~PiK~AL~Trd~~V~~~~L~~Lq~LV~~~~~VG~ALvPyyRqLL 191 (205)
.|| +--|..++.+|++.+|.-|-.++||+..=|++||+ ++|.=..++++...+++. +=.+.|-|-.-|++
T Consensus 27 ~~~~ek~~~i~ai~~lI~~~g~~i~~a~pQI~acL~saL~--~~eL~~~al~~W~~~i~~--L~~~~l~~ll~~~~ 98 (107)
T smart00802 27 KPYNEKKRALRSIGFLIKLMGKHISSALPQIMACLQSALE--IPELRSLALRCWHVLIKT--LKEEELGPLLDQIF 98 (107)
T ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhC--chhHHHHHHHHHHHHHHh--CCHHHHHHHHHHHH
Confidence 466 77788999999999999999999999999999998 777888888888888765 22344555444443
No 9
>KOG2171|consensus
Probab=81.02 E-value=2.5 Score=45.01 Aligned_cols=98 Identities=19% Similarity=0.287 Sum_probs=81.9
Q ss_pred CCchhhhhcccccccchhHhHHHhHHHHHHHcC------CCceecccccchHHHHHhhcCCCHHHHHHHHHHHHHHHhcC
Q psy7091 103 QYYLPIFFDGLRETEYPYKFFASKGIHDLLENG------KDKIVPVLPHLILPIKNALSTRNPEVICETLKVLQHLVTSS 176 (205)
Q Consensus 103 ~~yLPiFfdGLrEt~hPY~FlA~~G~~dLL~~~------~~KIlPvlPqLI~PiK~AL~Trd~~V~~~~L~~Lq~LV~~~ 176 (205)
+.++++|..|+.+-.+|-++.|.+++.-+...- .++.-++||.+|.-+..-++..|.++-+.+|++|--|+
T Consensus 158 ~~l~~lf~q~~~d~s~~vr~~a~rA~~a~~~~~~~~~~~~~~~~~llP~~l~vl~~~i~~~d~~~a~~~l~~l~El~--- 234 (1075)
T KOG2171|consen 158 DDLLRLFSQTMTDPSSPVRVAAVRALGAFAEYLENNKSEVDKFRDLLPSLLNVLQEVIQDGDDDAAKSALEALIELL--- 234 (1075)
T ss_pred HHHHHHHHHhccCCcchHHHHHHHHHHHHHHHhccchHHHHHHHHHhHHHHHHhHhhhhccchHHHHHHHHHHHHHH---
Confidence 367899999999999999999999877666543 26888999999999999999999999999999988885
Q ss_pred CccccchhhHHHhhhhh-hhHhhhcccc
Q psy7091 177 SMVGEALVPYYKQILPH-LNMYKDLNNR 203 (205)
Q Consensus 177 ~~VG~ALvPyyRqLLPv-lNlfk~~n~n 203 (205)
+..+.-|-||+-+++.+ +-+.+|+..|
T Consensus 235 e~~pk~l~~~l~~ii~~~l~Ia~n~~l~ 262 (1075)
T KOG2171|consen 235 ESEPKLLRPHLSQIIQFSLEIAKNKELE 262 (1075)
T ss_pred hhchHHHHHHHHHHHHHHHHHhhccccc
Confidence 56678888999999875 4466666543
No 10
>KOG0212|consensus
Probab=77.41 E-value=6.7 Score=39.69 Aligned_cols=99 Identities=19% Similarity=0.255 Sum_probs=69.4
Q ss_pred CCCCCCCchhhhhcccccccchhHhHHHhHHHHHHHcCCCceecccccchHHHHHhhcCCCHHHHHHHHHHHHHHHhcCC
Q psy7091 98 KTLNYQYYLPIFFDGLRETEYPYKFFASKGIHDLLENGKDKIVPVLPHLILPIKNALSTRNPEVICETLKVLQHLVTSSS 177 (205)
Q Consensus 98 e~LDy~~yLPiFfdGLrEt~hPY~FlA~~G~~dLL~~~~~KIlPvlPqLI~PiK~AL~Trd~~V~~~~L~~Lq~LV~~~~ 177 (205)
+++||...+-..--=|.--.+--+.-|-+=+.-|..+.+.+++--.-++-.-+=++|.-++.+|+..+|.+|+.+++++.
T Consensus 330 ~~id~~~ii~vl~~~l~~~~~~tri~~L~Wi~~l~~~~p~ql~~h~~~if~tLL~tLsd~sd~vvl~~L~lla~i~~s~~ 409 (675)
T KOG0212|consen 330 EEIDYGSIIEVLTKYLSDDREETRIAVLNWIILLYHKAPGQLLVHNDSIFLTLLKTLSDRSDEVVLLALSLLASICSSSN 409 (675)
T ss_pred cccchHHHHHHHHHHhhcchHHHHHHHHHHHHHHHhhCcchhhhhccHHHHHHHHhhcCchhHHHHHHHHHHHHHhcCcc
Confidence 34666554444333333333333444555555566668899999999999999999999999999999999999999887
Q ss_pred ccccchhhHHHhhhhhhhHhhhcc
Q psy7091 178 MVGEALVPYYKQILPHLNMYKDLN 201 (205)
Q Consensus 178 ~VG~ALvPyyRqLLPvlNlfk~~n 201 (205)
-. . |.+=++..|++|+..+
T Consensus 410 ~~--~---~~~fl~sLL~~f~e~~ 428 (675)
T KOG0212|consen 410 SP--N---LRKFLLSLLEMFKEDT 428 (675)
T ss_pred cc--c---HHHHHHHHHHHHhhhh
Confidence 76 2 2333455678887755
No 11
>PF12755 Vac14_Fab1_bd: Vacuolar 14 Fab1-binding region
Probab=73.44 E-value=4.5 Score=31.01 Aligned_cols=66 Identities=15% Similarity=0.329 Sum_probs=53.1
Q ss_pred CCCceecccccchHHHHHhhcCCCHHHHHHHHHHHHHHHhcCCccccchhhHHHhhhhhhh-Hhhhcccc
Q psy7091 135 GKDKIVPVLPHLILPIKNALSTRNPEVICETLKVLQHLVTSSSMVGEALVPYYKQILPHLN-MYKDLNNR 203 (205)
Q Consensus 135 ~~~KIlPvlPqLI~PiK~AL~Trd~~V~~~~L~~Lq~LV~~~~~VG~ALvPyyRqLLPvlN-lfk~~n~n 203 (205)
=+..+-..+++||.|+=+.++..|..|=-.+.++|-.++ ...++++.+|+-+|...|- +..+.+.|
T Consensus 17 l~~~~~~~l~~Il~pVL~~~~D~d~rVRy~AcEaL~ni~---k~~~~~~l~~f~~IF~~L~kl~~D~d~~ 83 (97)
T PF12755_consen 17 LGKDISKYLDEILPPVLKCFDDQDSRVRYYACEALYNIS---KVARGEILPYFNEIFDALCKLSADPDEN 83 (97)
T ss_pred chHhHHHHHHHHHHHHHHHcCCCcHHHHHHHHHHHHHHH---HHHHHHHHHHHHHHHHHHHHHHcCCchh
Confidence 344588899999999999999999999999999988885 4567888899999998764 44444433
No 12
>PF13513 HEAT_EZ: HEAT-like repeat; PDB: 2Z5J_A 2OT8_B 2Z5O_A 2H4M_A 2QMR_A 1QBK_B 2Z5M_A 2Z5K_A 2Z5N_A 1GCJ_B ....
Probab=68.31 E-value=2.2 Score=28.10 Aligned_cols=48 Identities=21% Similarity=0.258 Sum_probs=37.1
Q ss_pred HHhHHHHHHHcCCCceecccccchHHHHHhhcCCCHHHHHHHHHHHHH
Q psy7091 124 ASKGIHDLLENGKDKIVPVLPHLILPIKNALSTRNPEVICETLKVLQH 171 (205)
Q Consensus 124 A~~G~~dLL~~~~~KIlPvlPqLI~PiK~AL~Trd~~V~~~~L~~Lq~ 171 (205)
|-.++-.+.+..++.+-|.+|+++..|-..|+..+++|=..+..+|..
T Consensus 7 A~~aLg~l~~~~~~~~~~~~~~~~~~L~~~L~d~~~~VR~~A~~aLg~ 54 (55)
T PF13513_consen 7 AAWALGRLAEGCPELLQPYLPELLPALIPLLQDDDDSVRAAAAWALGN 54 (55)
T ss_dssp HHHHHHCTTTTTHHHHHHHHHHHHHHHHHHTTSSSHHHHHHHHHHHHC
T ss_pred HHHHHhhHhcccHHHHHHHHHHHHHHHHHHHcCCCHHHHHHHHHHHhc
Confidence 333444444556678889999999999999999999998888887754
No 13
>cd00020 ARM Armadillo/beta-catenin-like repeats. An approximately 40 amino acid long tandemly repeated sequence motif first identified in the Drosophila segment polarity gene armadillo; these repeats were also found in the mammalian armadillo homolog beta-catenin, the junctional plaque protein plakoglobin, the adenomatous polyposis coli (APC) tumor suppressor protein, and a number of other proteins. ARM has been implicated in mediating protein-protein interactions, but no common features among the target proteins recognized by the ARM repeats have been identified; related to the HEAT domain; three consecutive copies of the repeat are represented by this alignment model.
Probab=67.86 E-value=11 Score=26.93 Aligned_cols=72 Identities=19% Similarity=0.227 Sum_probs=43.8
Q ss_pred chhhhhcccccccchhHhHHHhHHHHHHHcCCCceecccc-cchHHHHHhhcCCCHHHHHHHHHHHHHHHhcC
Q psy7091 105 YLPIFFDGLRETEYPYKFFASKGIHDLLENGKDKIVPVLP-HLILPIKNALSTRNPEVICETLKVLQHLVTSS 176 (205)
Q Consensus 105 yLPiFfdGLrEt~hPY~FlA~~G~~dLL~~~~~KIlPvlP-qLI~PiK~AL~Trd~~V~~~~L~~Lq~LV~~~ 176 (205)
.++.+.+=|.+.....+.-|-..+..|-....+..-.++. ..|..+-..|++.|+++...++.+|..|+...
T Consensus 8 ~i~~l~~~l~~~~~~~~~~a~~~l~~l~~~~~~~~~~~~~~~~i~~l~~~l~~~~~~v~~~a~~~L~~l~~~~ 80 (120)
T cd00020 8 GLPALVSLLSSSDENVQREAAWALSNLSAGNNDNIQAVVEAGGLPALVQLLKSEDEEVVKAALWALRNLAAGP 80 (120)
T ss_pred ChHHHHHHHHcCCHHHHHHHHHHHHHHhcCCHHHHHHHHHCCChHHHHHHHhCCCHHHHHHHHHHHHHHccCc
Confidence 3444444444444444555555555554432222222222 66777778889999999999999999997643
No 14
>KOG2023|consensus
Probab=66.92 E-value=11 Score=39.17 Aligned_cols=91 Identities=22% Similarity=0.332 Sum_probs=72.9
Q ss_pred CCchhhhhcccccccchhHhHHHhHHHHHHHcCCCceecccccchHHHHHhhcCCCHHHHHHHHHHHHHHHhcCCccccc
Q psy7091 103 QYYLPIFFDGLRETEYPYKFFASKGIHDLLENGKDKIVPVLPHLILPIKNALSTRNPEVICETLKVLQHLVTSSSMVGEA 182 (205)
Q Consensus 103 ~~yLPiFfdGLrEt~hPY~FlA~~G~~dLL~~~~~KIlPvlPqLI~PiK~AL~Trd~~V~~~~L~~Lq~LV~~~~~VG~A 182 (205)
|.||-++|-=--..++=-|=-.=+++.-||+.-.+|++|-++.+|-=+=.--.+-|.+|-..|=+-.-.++.. +.--+.
T Consensus 214 D~Fle~lFalanD~~~eVRk~vC~alv~Llevr~dkl~phl~~IveyML~~tqd~dE~VALEACEFwla~aeq-pi~~~~ 292 (885)
T KOG2023|consen 214 DKFLEILFALANDEDPEVRKNVCRALVFLLEVRPDKLVPHLDNIVEYMLQRTQDVDENVALEACEFWLALAEQ-PICKEV 292 (885)
T ss_pred HHHHHHHHHHccCCCHHHHHHHHHHHHHHHHhcHHhcccchHHHHHHHHHHccCcchhHHHHHHHHHHHHhcC-cCcHHH
Confidence 5677766654444444445555668888999999999999999999988888888999887777777777775 488899
Q ss_pred hhhHHHhhhhhh
Q psy7091 183 LVPYYKQILPHL 194 (205)
Q Consensus 183 LvPyyRqLLPvl 194 (205)
|.||..+|.|+|
T Consensus 293 L~p~l~kliPvL 304 (885)
T KOG2023|consen 293 LQPYLDKLIPVL 304 (885)
T ss_pred HHHHHHHHHHHH
Confidence 999999999997
No 15
>smart00185 ARM Armadillo/beta-catenin-like repeats. Approx. 40 amino acid repeat. Tandem repeats form superhelix of helices that is proposed to mediate interaction of beta-catenin with its ligands. Involved in transducing the Wingless/Wnt signal. In plakoglobin arm repeats bind alpha-catenin and N-cadherin.
Probab=63.90 E-value=14 Score=22.11 Aligned_cols=28 Identities=21% Similarity=0.242 Sum_probs=23.3
Q ss_pred chHHHHHhhcCCCHHHHHHHHHHHHHHH
Q psy7091 146 LILPIKNALSTRNPEVICETLKVLQHLV 173 (205)
Q Consensus 146 LI~PiK~AL~Trd~~V~~~~L~~Lq~LV 173 (205)
.|.+|...|.+.|+++...++.+|..|+
T Consensus 13 ~i~~L~~ll~~~~~~i~~~a~~aL~nl~ 40 (41)
T smart00185 13 GLPALVELLKSEDEEVVKEAAWALSNLS 40 (41)
T ss_pred CHHHHHHHHcCCCHHHHHHHHHHHHHHc
Confidence 4666677788999999999999998874
No 16
>PF12717 Cnd1: non-SMC mitotic condensation complex subunit 1
Probab=63.23 E-value=35 Score=27.96 Aligned_cols=90 Identities=19% Similarity=0.245 Sum_probs=69.2
Q ss_pred CCchhhhhcccccccchhHhHHHhHHHHHHHcCCCceecccccchHHHHHhhcCCCHHHHHHHHHHHHHHHhcCCccccc
Q psy7091 103 QYYLPIFFDGLRETEYPYKFFASKGIHDLLENGKDKIVPVLPHLILPIKNALSTRNPEVICETLKVLQHLVTSSSMVGEA 182 (205)
Q Consensus 103 ~~yLPiFfdGLrEt~hPY~FlA~~G~~dLL~~~~~KIlPvlPqLI~PiK~AL~Trd~~V~~~~L~~Lq~LV~~~~~VG~A 182 (205)
++|+|.++..|+..+..-|--|...+..|+..+--|+= +++..-+=.+|.-.|++|-..|...+..+... .-+..
T Consensus 24 e~~~~~l~~~L~D~~~~VR~~al~~Ls~Li~~d~ik~k---~~l~~~~l~~l~D~~~~Ir~~A~~~~~e~~~~--~~~~~ 98 (178)
T PF12717_consen 24 EPYLPNLYKCLRDEDPLVRKTALLVLSHLILEDMIKVK---GQLFSRILKLLVDENPEIRSLARSFFSELLKK--RNPNI 98 (178)
T ss_pred HhHHHHHHHHHCCCCHHHHHHHHHHHHHHHHcCceeeh---hhhhHHHHHHHcCCCHHHHHHHHHHHHHHHHh--ccchH
Confidence 68999999999999888888889999999977655544 55555555788999999999999999888543 24555
Q ss_pred hhhHHHhhhhhhhHh
Q psy7091 183 LVPYYKQILPHLNMY 197 (205)
Q Consensus 183 LvPyyRqLLPvlNlf 197 (205)
+..++-.++-.||-.
T Consensus 99 i~~~~~e~i~~l~~~ 113 (178)
T PF12717_consen 99 IYNNFPELISSLNNC 113 (178)
T ss_pred HHHHHHHHHHHHhCc
Confidence 666666777666654
No 17
>KOG2171|consensus
Probab=62.03 E-value=8.7 Score=41.07 Aligned_cols=95 Identities=19% Similarity=0.306 Sum_probs=70.1
Q ss_pred CCchh-hhhcccccccchhHhH-HHhHHHHHHHc-CCCceecccccchH-HHHHhhcCCCHHHHHHHHHHHHHHHhcCCc
Q psy7091 103 QYYLP-IFFDGLRETEYPYKFF-ASKGIHDLLEN-GKDKIVPVLPHLIL-PIKNALSTRNPEVICETLKVLQHLVTSSSM 178 (205)
Q Consensus 103 ~~yLP-iFfdGLrEt~hPY~Fl-A~~G~~dLL~~-~~~KIlPvlPqLI~-PiK~AL~Trd~~V~~~~L~~Lq~LV~~~~~ 178 (205)
|-+|| ...+=+--+++|=.-- |--..-.+.+. .++.|-|.||.|.- =+..+++++.+.|-+.++.+| ...++.
T Consensus 429 ~e~l~~aL~~~ld~~~~~rV~ahAa~al~nf~E~~~~~~l~pYLd~lm~~~l~~L~~~~~~~v~e~vvtaI---asvA~A 505 (1075)
T KOG2171|consen 429 HERLPPALIALLDSTQNVRVQAHAAAALVNFSEECDKSILEPYLDGLMEKKLLLLLQSSKPYVQEQAVTAI---ASVADA 505 (1075)
T ss_pred HHhccHHHHHHhcccCchHHHHHHHHHHHHHHHhCcHHHHHHHHHHHHHHHHHHHhcCCchhHHHHHHHHH---HHHHHH
Confidence 44444 5666666666654221 22233334444 66899999999999 788999999999999999886 555789
Q ss_pred cccchhhHHHhhhhhhhHhhhc
Q psy7091 179 VGEALVPYYKQILPHLNMYKDL 200 (205)
Q Consensus 179 VG~ALvPyyRqLLPvlNlfk~~ 200 (205)
.|++.+|||..++|.|--|-..
T Consensus 506 A~~~F~pY~d~~Mp~L~~~L~n 527 (1075)
T KOG2171|consen 506 AQEKFIPYFDRLMPLLKNFLQN 527 (1075)
T ss_pred HhhhhHhHHHHHHHHHHHHHhC
Confidence 9999999999999998755443
No 18
>PF13646 HEAT_2: HEAT repeats; PDB: 1OYZ_A 3FGA_A 2PF4_C 2IAE_A 3B2A_A.
Probab=58.95 E-value=21 Score=24.66 Aligned_cols=71 Identities=17% Similarity=0.244 Sum_probs=43.7
Q ss_pred hhhhhcccccccchhHhHHHhHHHHHHHcCCCceecccccchHHHHHhhcCCCHHHHHHHHHHHHHHHhcCCccccchhh
Q psy7091 106 LPIFFDGLRETEYPYKFFASKGIHDLLENGKDKIVPVLPHLILPIKNALSTRNPEVICETLKVLQHLVTSSSMVGEALVP 185 (205)
Q Consensus 106 LPiFfdGLrEt~hPY~FlA~~G~~dLL~~~~~KIlPvlPqLI~PiK~AL~Trd~~V~~~~L~~Lq~LV~~~~~VG~ALvP 185 (205)
+|.+++-|.+.++|+ .+.-+-+.|..-+ -++.+..|..+|++.|+.|-..+..+|-.+ + ++..++
T Consensus 1 i~~L~~~l~~~~~~~---vr~~a~~~L~~~~------~~~~~~~L~~~l~d~~~~vr~~a~~aL~~i---~---~~~~~~ 65 (88)
T PF13646_consen 1 IPALLQLLQNDPDPQ---VRAEAARALGELG------DPEAIPALIELLKDEDPMVRRAAARALGRI---G---DPEAIP 65 (88)
T ss_dssp HHHHHHHHHTSSSHH---HHHHHHHHHHCCT------HHHHHHHHHHHHTSSSHHHHHHHHHHHHCC---H---HHHTHH
T ss_pred CHHHHHHHhcCCCHH---HHHHHHHHHHHcC------CHhHHHHHHHHHcCCCHHHHHHHHHHHHHh---C---CHHHHH
Confidence 355666664444454 3444444554322 236788889999999999999999887544 1 344555
Q ss_pred HHHhhh
Q psy7091 186 YYKQIL 191 (205)
Q Consensus 186 yyRqLL 191 (205)
.+.++|
T Consensus 66 ~L~~~l 71 (88)
T PF13646_consen 66 ALIKLL 71 (88)
T ss_dssp HHHHHH
T ss_pred HHHHHH
Confidence 555533
No 19
>PF12755 Vac14_Fab1_bd: Vacuolar 14 Fab1-binding region
Probab=56.56 E-value=11 Score=28.85 Aligned_cols=71 Identities=11% Similarity=0.305 Sum_probs=60.8
Q ss_pred CCCchhhhhcccccccchhHhHHHhHHHHHHHcCCCceecccccchHHHHHhhcCCCHHHHHHHHHHHHHHH
Q psy7091 102 YQYYLPIFFDGLRETEYPYKFFASKGIHDLLENGKDKIVPVLPHLILPIKNALSTRNPEVICETLKVLQHLV 173 (205)
Q Consensus 102 y~~yLPiFfdGLrEt~hPY~FlA~~G~~dLL~~~~~KIlPvlPqLI~PiK~AL~Trd~~V~~~~L~~Lq~LV 173 (205)
++..+|..+.++...+.=-|+.|-+.+.++.+.+++.+++..+++.-.|-+-+.-.|+.|-..+ +.|-+|+
T Consensus 25 l~~Il~pVL~~~~D~d~rVRy~AcEaL~ni~k~~~~~~l~~f~~IF~~L~kl~~D~d~~Vr~~a-~~Ld~ll 95 (97)
T PF12755_consen 25 LDEILPPVLKCFDDQDSRVRYYACEALYNISKVARGEILPYFNEIFDALCKLSADPDENVRSAA-ELLDRLL 95 (97)
T ss_pred HHHHHHHHHHHcCCCcHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCchhHHHHH-HHHHHHh
Confidence 3567788889999999999999999999999999999999999999999999988888876665 5555554
No 20
>PF12348 CLASP_N: CLASP N terminal; InterPro: IPR024395 This domain is found in the N-terminal region of CLIP-associated proteins (CLASPs), which are widely conserved microtubule plus-end-tracking proteins that regulate the stability of dynamic microtubules [, ]. The domain is also found in other proteins involved in microtubule binding, including STU1, MOR1 and spindle pole body component Alp14.; PDB: 2QK2_A.
Probab=54.66 E-value=69 Score=26.40 Aligned_cols=90 Identities=18% Similarity=0.268 Sum_probs=53.5
Q ss_pred CCchhhhhcccccccchhHhHHHhHHHHHHHcCCCceecccccc-hHHHHHhhcCCCHHHHHHHHHHHHHHHhcCCcccc
Q psy7091 103 QYYLPIFFDGLRETEYPYKFFASKGIHDLLENGKDKIVPVLPHL-ILPIKNALSTRNPEVICETLKVLQHLVTSSSMVGE 181 (205)
Q Consensus 103 ~~yLPiFfdGLrEt~hPY~FlA~~G~~dLL~~~~~KIlPvlPqL-I~PiK~AL~Trd~~V~~~~L~~Lq~LV~~~~~VG~ 181 (205)
+.++|.+++=+-++..-.+--|.+.+..|++..+ +.+++ +.-+..+++++|+.+=..+++.|..++.....-..
T Consensus 93 ~~~l~~Ll~~~~~~~~~i~~~a~~~L~~i~~~~~-----~~~~~~~~~l~~~~~~Kn~~vR~~~~~~l~~~l~~~~~~~~ 167 (228)
T PF12348_consen 93 DILLPPLLKKLGDSKKFIREAANNALDAIIESCS-----YSPKILLEILSQGLKSKNPQVREECAEWLAIILEKWGSDSS 167 (228)
T ss_dssp HHHHHHHHHGGG---HHHHHHHHHHHHHHHTTS------H--HHHHHHHHHHTT-S-HHHHHHHHHHHHHHHTT-----G
T ss_pred HHHHHHHHHHHccccHHHHHHHHHHHHHHHHHCC-----cHHHHHHHHHHHHHhCCCHHHHHHHHHHHHHHHHHccchHh
Confidence 3455666666666666666666777777776644 34455 67788899999999999999999888765442224
Q ss_pred chhh--HHHhhhhhhhHh
Q psy7091 182 ALVP--YYKQILPHLNMY 197 (205)
Q Consensus 182 ALvP--yyRqLLPvlNlf 197 (205)
.+-. .+.++.+.+.-+
T Consensus 168 ~l~~~~~~~~l~~~l~~~ 185 (228)
T PF12348_consen 168 VLQKSAFLKQLVKALVKL 185 (228)
T ss_dssp GG--HHHHHHHHHHHHHH
T ss_pred hhcccchHHHHHHHHHHH
Confidence 4433 457777776544
No 21
>KOG1837|consensus
Probab=54.38 E-value=13 Score=41.17 Aligned_cols=79 Identities=22% Similarity=0.372 Sum_probs=70.6
Q ss_pred chhhhhcccccccchh---HhHHHhHHHHHHHcCCCceecccccchHHHHHhhcCCCHHHHHHHHHHHHHHHhcCCcccc
Q psy7091 105 YLPIFFDGLRETEYPY---KFFASKGIHDLLENGKDKIVPVLPHLILPIKNALSTRNPEVICETLKVLQHLVTSSSMVGE 181 (205)
Q Consensus 105 yLPiFfdGLrEt~hPY---~FlA~~G~~dLL~~~~~KIlPvlPqLI~PiK~AL~Trd~~V~~~~L~~Lq~LV~~~~~VG~ 181 (205)
|=|+--+||.-|..-| ||.|..-++.|...=|+-.++.+||+|+-|-.-+.--|.+|=|.+-++++++ -++.||
T Consensus 1539 ~~pl~~k~l~~trss~~~~r~~ai~~~~~l~~~lge~~~~lL~q~iPfLaEL~ED~~~~Ve~~~q~li~q~---e~~lGE 1615 (1621)
T KOG1837|consen 1539 LKPLNQKILKKTRSSSRKARYLAIIQVKLLYTKLGENVIVLLPQSIPFLAELMEDEDDEVECLCQKLIRQL---EEVLGE 1615 (1621)
T ss_pred hHHHHHHHHHHhccccHHHHHHHHHHHHHHHHHhcchhHHhhhhhhHHHHHHHhhhHHHHHHHHHHHHHHH---HHHhch
Confidence 7788889998888886 7999999999999999999999999999999999989999999998888877 578999
Q ss_pred chhhH
Q psy7091 182 ALVPY 186 (205)
Q Consensus 182 ALvPy 186 (205)
-|..|
T Consensus 1616 ~l~~y 1620 (1621)
T KOG1837|consen 1616 PLQSY 1620 (1621)
T ss_pred hhhhc
Confidence 87766
No 22
>PF11865 DUF3385: Domain of unknown function (DUF3385); InterPro: IPR024585 This uncharacterised domain is is typically between 160 to 172 amino acids in length. It is found in the phosphatidylinositol kinase-related protein kinases TOR (target of rapamycin). In Saccharomyces cerevisiae the TOR proteins, TOR1 and TOR2, regulate growth in a rapamycin-sensitive manner [].
Probab=54.03 E-value=26 Score=28.81 Aligned_cols=50 Identities=22% Similarity=0.196 Sum_probs=39.8
Q ss_pred HHhHHHHHHHcCCCceecccccchHHHHHhhcCCCHHHHHHHHHHHHHHH
Q psy7091 124 ASKGIHDLLENGKDKIVPVLPHLILPIKNALSTRNPEVICETLKVLQHLV 173 (205)
Q Consensus 124 A~~G~~dLL~~~~~KIlPvlPqLI~PiK~AL~Trd~~V~~~~L~~Lq~LV 173 (205)
+.+.+..+++.-+.|.+|.+||+|+++=+.+.+-++..-+..++-|..||
T Consensus 107 vv~ai~~If~~l~~~cv~~L~~viP~~l~~i~~~~~~~~e~~~~qL~~lv 156 (160)
T PF11865_consen 107 VVQAIMYIFKSLGLKCVPYLPQVIPIFLRVIRTCPDSLREFYFQQLADLV 156 (160)
T ss_pred HHHHHHHHHHhcCcCchhHHHHHhHHHHHHHHhCCHHHHHHHHHHHHHHH
Confidence 45555666666777889999999999999999888888888777666665
No 23
>PF12333 Ipi1_N: Rix1 complex component involved in 60S ribosome maturation; InterPro: IPR024679 This domain is found in IPI1, which is a component of the Rix1 complex involved in pre-rRNA-processing [, ]. It is also found in testis-expressed sequence 10 protein, a nuclear membrane protein, which is a component of the MLL1/MLL complex [].
Probab=52.27 E-value=15 Score=28.18 Aligned_cols=58 Identities=26% Similarity=0.451 Sum_probs=47.0
Q ss_pred CceecccccchHHHHHhhcCCCHHHHHHHHHHHHHHHhcCCccccchh-hHHHhhhhhhhHh
Q psy7091 137 DKIVPVLPHLILPIKNALSTRNPEVICETLKVLQHLVTSSSMVGEALV-PYYKQILPHLNMY 197 (205)
Q Consensus 137 ~KIlPvlPqLI~PiK~AL~Trd~~V~~~~L~~Lq~LV~~~~~VG~ALv-PyyRqLLPvlNlf 197 (205)
+.+-|.++.++.-|+.|+-+=+++|=..+|+.|..|+. ..++.++ -+..++|+-+--+
T Consensus 3 ~~l~p~~~~l~~~i~sAMTHi~~~Ir~dsl~~L~~lL~---~~p~~~~~~~~~kil~~f~~l 61 (102)
T PF12333_consen 3 ELLSPFFPLLMLYISSAMTHISPDIREDSLKFLDLLLE---HAPDELCSGGWVKILPNFLDL 61 (102)
T ss_pred HHHHhHHHHHHHHHHHHHHhCCHHHHHhHHHHHHHHHH---HCChHhHhhhHHHHHHHHHHH
Confidence 45678899999999999999999999999999999984 5566644 4777777765533
No 24
>PF02985 HEAT: HEAT repeat; InterPro: IPR000357 The HEAT repeat is a tandemly repeated, 37-47 amino acid long module occurring in a number of cytoplasmic proteins, including the four name-giving proteins huntingtin, elongation factor 3 (EF3), the 65 Kd alpha regulatory subunit of protein phosphatase 2A (PP2A) and the yeast PI3-kinase TOR1 []. Arrays of HEAT repeats consists of 3 to 36 units forming a rod-like helical structure and appear to function as protein-protein interaction surfaces. It has been noted that many HEAT repeat-containing proteins are involved in intracellular transport processes. In the crystal structure of PP2A PR65/A [], the HEAT repeats consist of pairs of antiparallel alpha helices [].; GO: 0005515 protein binding; PDB: 3FGA_A 2PF4_C 2IAE_A 2BKU_D 3EA5_B 3ND2_A 2BPT_A 2NYL_A 2NPP_D 2PKG_B ....
Probab=52.10 E-value=33 Score=20.44 Aligned_cols=29 Identities=21% Similarity=0.259 Sum_probs=24.2
Q ss_pred chHHHHHhhcCCCHHHHHHHHHHHHHHHh
Q psy7091 146 LILPIKNALSTRNPEVICETLKVLQHLVT 174 (205)
Q Consensus 146 LI~PiK~AL~Trd~~V~~~~L~~Lq~LV~ 174 (205)
|++.+.+.|+..|++|=.++.+.|..+++
T Consensus 1 llp~l~~~l~D~~~~VR~~a~~~l~~i~~ 29 (31)
T PF02985_consen 1 LLPILLQLLNDPSPEVRQAAAECLGAIAE 29 (31)
T ss_dssp HHHHHHHHHT-SSHHHHHHHHHHHHHHHH
T ss_pred CHHHHHHHcCCCCHHHHHHHHHHHHHHHh
Confidence 46778899999999999999999988864
No 25
>KOG1242|consensus
Probab=50.99 E-value=17 Score=36.41 Aligned_cols=88 Identities=24% Similarity=0.348 Sum_probs=65.8
Q ss_pred CCCchhhhhcccccccchhHhHHHhHHHHHHHc--CCCceecccccchHHHHHhhcCCCHHHHHHHHHHHHHHHhcCCcc
Q psy7091 102 YQYYLPIFFDGLRETEYPYKFFASKGIHDLLEN--GKDKIVPVLPHLILPIKNALSTRNPEVICETLKVLQHLVTSSSMV 179 (205)
Q Consensus 102 y~~yLPiFfdGLrEt~hPY~FlA~~G~~dLL~~--~~~KIlPvlPqLI~PiK~AL~Trd~~V~~~~L~~Lq~LV~~~~~V 179 (205)
.+-..|+.--||.|..-.-.=-+.+=+..|-.. .+.-+-|.||+|++-+|.++.|-+|||=..+.++|-.|+.--..+
T Consensus 370 LalmvpiL~R~l~eRst~~kr~t~~IidNm~~LveDp~~lapfl~~Llp~lk~~~~d~~PEvR~vaarAL~~l~e~~g~~ 449 (569)
T KOG1242|consen 370 LALMVPILKRGLAERSTSIKRKTAIIIDNMCKLVEDPKDLAPFLPSLLPGLKENLDDAVPEVRAVAARALGALLERLGEV 449 (569)
T ss_pred HHHHHHHHHHHHhhccchhhhhHHHHHHHHHHhhcCHHHHhhhHHHHhhHHHHHhcCCChhHHHHHHHHHHHHHHHHHhh
Confidence 345778999999998888775555555556544 467899999999999999999999999999988886665543334
Q ss_pred c-cchhhHHHh
Q psy7091 180 G-EALVPYYKQ 189 (205)
Q Consensus 180 G-~ALvPyyRq 189 (205)
+ +.++|-.-.
T Consensus 450 ~f~d~~p~l~e 460 (569)
T KOG1242|consen 450 SFDDLIPELSE 460 (569)
T ss_pred cccccccHHHH
Confidence 4 555554433
No 26
>PF10363 DUF2435: Protein of unknown function (DUF2435)
Probab=48.89 E-value=6.1 Score=30.09 Aligned_cols=44 Identities=25% Similarity=0.587 Sum_probs=35.7
Q ss_pred CCCCCCchhhhhcccccccchhHhH-HHhHHHHHHHcCCCceeccc
Q psy7091 99 TLNYQYYLPIFFDGLRETEYPYKFF-ASKGIHDLLENGKDKIVPVL 143 (205)
Q Consensus 99 ~LDy~~yLPiFfdGLrEt~hPY~Fl-A~~G~~dLL~~~~~KIlPvl 143 (205)
..+.+..|.+|.+.|++. ++|.|+ |.+|+..|....++.++|.|
T Consensus 38 ~~~~~~il~l~l~~L~d~-DsyVYL~aI~~L~~La~~~p~~vl~~L 82 (92)
T PF10363_consen 38 VIDIPKILDLFLSQLKDE-DSYVYLNAIKGLAALADRHPDEVLPIL 82 (92)
T ss_pred hhhHHHHHHHHHHHcCCC-CchHHHHHHHHHHHHHHHChHHHHHHH
Confidence 356678899999988775 899998 58999999988888777654
No 27
>PF00514 Arm: Armadillo/beta-catenin-like repeat; InterPro: IPR000225 The armadillo (Arm) repeat is an approximately 40 amino acid long tandemly repeated sequence motif first identified in the Drosophila melanogaster segment polarity gene armadillo involved in signal transduction through wingless. Animal Arm-repeat proteins function in various processes, including intracellular signalling and cytoskeletal regulation, and include such proteins as beta-catenin, the junctional plaque protein plakoglobin, the adenomatous polyposis coli (APC) tumour suppressor protein, and the nuclear transport factor importin-alpha, amongst others []. A subset of these proteins is conserved across eukaryotic kingdoms. In higher plants, some Arm-repeat proteins function in intracellular signalling like their mammalian counterparts, while others have novel functions []. The 3-dimensional fold of an armadillo repeat is known from the crystal structure of beta-catenin, where the 12 repeats form a superhelix of alpha helices with three helices per unit []. The cylindrical structure features a positively charged grove, which presumably interacts with the acidic surfaces of the known interaction partners of beta-catenin.; GO: 0005515 protein binding; PDB: 2Z6G_A 1IQ1_C 3RZX_A 2C1M_A 3BTR_C 3OQS_A 3TPO_A 1IAL_A 1Q1S_C 1PJM_B ....
Probab=45.60 E-value=48 Score=20.60 Aligned_cols=27 Identities=33% Similarity=0.426 Sum_probs=23.2
Q ss_pred hHHHHHhhcCCCHHHHHHHHHHHHHHH
Q psy7091 147 ILPIKNALSTRNPEVICETLKVLQHLV 173 (205)
Q Consensus 147 I~PiK~AL~Trd~~V~~~~L~~Lq~LV 173 (205)
|.+|-.-|.+.|++|.+.++-+|.-|+
T Consensus 14 i~~Lv~ll~~~~~~v~~~a~~al~nl~ 40 (41)
T PF00514_consen 14 IPPLVQLLKSPDPEVQEEAAWALGNLA 40 (41)
T ss_dssp HHHHHHHTTSSSHHHHHHHHHHHHHHH
T ss_pred HHHHHHHHcCCCHHHHHHHHHHHHHHh
Confidence 666777888999999999999998874
No 28
>KOG0212|consensus
Probab=44.90 E-value=29 Score=35.31 Aligned_cols=99 Identities=22% Similarity=0.381 Sum_probs=69.4
Q ss_pred eeeeecc---CCCCCCCCchhhhhccc----ccccchhHhHHHhHHHHHHHcCCCceec-ccccchHHHHHhhcCCCHHH
Q psy7091 90 KISWKED---IKTLNYQYYLPIFFDGL----RETEYPYKFFASKGIHDLLENGKDKIVP-VLPHLILPIKNALSTRNPEV 161 (205)
Q Consensus 90 ~L~Wkv~---~e~LDy~~yLPiFfdGL----rEt~hPY~FlA~~G~~dLL~~~~~KIlP-vlPqLI~PiK~AL~Trd~~V 161 (205)
-+.|-.- ...+++-.|||-|+||| .+..+--+-++..-..|.|..-..+=.- =-++.|.-+-.-+.+.++++
T Consensus 187 lv~Wl~~Lds~P~~~m~~yl~~~ldGLf~~LsD~s~eVr~~~~t~l~~fL~eI~s~P~s~d~~~~i~vlv~~l~ss~~~i 266 (675)
T KOG0212|consen 187 LVSWLYVLDSVPDLEMISYLPSLLDGLFNMLSDSSDEVRTLTDTLLSEFLAEIRSSPSSMDYDDMINVLVPHLQSSEPEI 266 (675)
T ss_pred HHHHHHHHhcCCcHHHHhcchHHHHHHHHHhcCCcHHHHHHHHHHHHHHHHHHhcCccccCcccchhhccccccCCcHHH
Confidence 4677643 34677899999999996 4555566777778888888661111000 12333333346788999999
Q ss_pred HHHHHHHHHHHHhcCCccccchhhHHHhhh
Q psy7091 162 ICETLKVLQHLVTSSSMVGEALVPYYKQIL 191 (205)
Q Consensus 162 ~~~~L~~Lq~LV~~~~~VG~ALvPyyRqLL 191 (205)
-..+|.-||..|. .-|.+++||+-.+|
T Consensus 267 q~~al~Wi~efV~---i~g~~~l~~~s~il 293 (675)
T KOG0212|consen 267 QLKALTWIQEFVK---IPGRDLLLYLSGIL 293 (675)
T ss_pred HHHHHHHHHHHhc---CCCcchhhhhhhhh
Confidence 9999999999984 67899999998754
No 29
>COG4899 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=44.39 E-value=4.7 Score=34.27 Aligned_cols=55 Identities=15% Similarity=0.436 Sum_probs=46.4
Q ss_pred HHHHhcCCCCEEEeeCCCCCeeeee--ccCCCCCCCCchhhhhcccccccchhHhHHHh
Q psy7091 70 RTYYSRGDFPIAMEFKSVGNKISWK--EDIKTLNYQYYLPIFFDGLRETEYPYKFFASK 126 (205)
Q Consensus 70 r~~Y~rGdlP~~i~h~s~~~~L~Wk--v~~e~LDy~~yLPiFfdGLrEt~hPY~FlA~~ 126 (205)
.+.+-|.-|||-|. +...++.|- |+.|.=||..||-.|.||.-|.-++=.|+++.
T Consensus 48 ~r~FIr~~lpIPi~--G~~~~l~yG~WveVe~~~f~~Yl~~yydn~e~~v~~~~~~nrl 104 (166)
T COG4899 48 KRHFIRSNLPIPID--GGKKKLDYGGWVEVEHSDFMTYLNLYYDNVEKKVRIPGFLNRL 104 (166)
T ss_pred cceeeeecccceec--CccceeccceEEEEeHHHHHHHHHHHhcCcccccchhhHhhcc
Confidence 35678889999874 556788887 67888899999999999999999999888875
No 30
>KOG2956|consensus
Probab=43.72 E-value=25 Score=34.89 Aligned_cols=82 Identities=21% Similarity=0.329 Sum_probs=63.7
Q ss_pred ccchhHhHHHhHHHHHHHc-CCCceecccccchHHHHHhhcCCCHHHHHHHHHHHHHHHhcCCccc-cchhhHHHhh---
Q psy7091 116 TEYPYKFFASKGIHDLLEN-GKDKIVPVLPHLILPIKNALSTRNPEVICETLKVLQHLVTSSSMVG-EALVPYYKQI--- 190 (205)
Q Consensus 116 t~hPY~FlA~~G~~dLL~~-~~~KIlPvlPqLI~PiK~AL~Trd~~V~~~~L~~Lq~LV~~~~~VG-~ALvPyyRqL--- 190 (205)
-++|-.-++......|.+. ..+-++++||-+++-+=.|-++-.-.|=..+.=.|=.+ -..|| +++-||++||
T Consensus 418 ~D~~~~~~~iKm~Tkl~e~l~~EeL~~ll~diaP~~iqay~S~SS~VRKtaVfCLVam---v~~vG~~~mePhL~~Lt~s 494 (516)
T KOG2956|consen 418 ADEPRAVAVIKMLTKLFERLSAEELLNLLPDIAPCVIQAYDSTSSTVRKTAVFCLVAM---VNRVGMEEMEPHLEQLTSS 494 (516)
T ss_pred CcchHHHHHHHHHHHHHhhcCHHHHHHhhhhhhhHHHHHhcCchHHhhhhHHHhHHHH---HHHHhHHhhhhHhhhccHH
Confidence 5677777777788888877 55889999999999999999987777666655544344 46899 9999999998
Q ss_pred -hhhhhHhhhc
Q psy7091 191 -LPHLNMYKDL 200 (205)
Q Consensus 191 -LPvlNlfk~~ 200 (205)
+-.+++|.++
T Consensus 495 k~~LlqlYinR 505 (516)
T KOG2956|consen 495 KLNLLQLYINR 505 (516)
T ss_pred HHHHHHHHHHH
Confidence 4456677664
No 31
>PF01602 Adaptin_N: Adaptin N terminal region; InterPro: IPR002553 Proteins synthesized on the ribosome and processed in the endoplasmic reticulum are transported from the Golgi apparatus to the trans-Golgi network (TGN), and from there via small carrier vesicles to their final destination compartment. This traffic is bidirectional, to ensure that proteins required to form vesicles are recycled. Vesicles have specific coat proteins (such as clathrin or coatomer) that are important for cargo selection and direction of transfer []. Clathrin coats contain both clathrin and adaptor complexes that link clathrin to receptors in coated vesicles. Clathrin-associated protein complexes are believed to interact with the cytoplasmic tails of membrane proteins, leading to their selection and concentration. The two major types of clathrin adaptor complexes are the heterotetrameric adaptor protein (AP) complexes, and the monomeric GGA (Golgi-localising, Gamma-adaptin ear domain homology, ARF-binding proteins) adaptors []. All AP complexes are heterotetramers composed of two large subunits (adaptins), a medium subunit (mu) and a small subunit (sigma). Each subunit has a specific function. Adaptin subunits recognise and bind to clathrin through their hinge region (clathrin box), and recruit accessory proteins that modulate AP function through their C-terminal appendage domains. By contrast, GGAs are monomers composed of four domains, which have functions similar to AP subunits: an N-terminal VHS (Vps27p/Hrs/Stam) domain, a GAT (GGA and Tom1) domain, a hinge region, and a C-terminal GAE (gamma-adaptin ear) domain. The GAE domain is similar to the AP gamma-adaptin ear domain, being responsible for the recruitment of accessory proteins that regulate clathrin-mediated endocytosis []. While clathrin mediates endocytic protein transport from ER to Golgi, coatomers (COPI, COPII) primarily mediate intra-Golgi transport, as well as the reverse Golgi to ER transport of dilysine-tagged proteins []. Coatomers reversibly associate with Golgi (non-clathrin-coated) vesicles to mediate protein transport and for budding from Golgi membranes []. Coatomer complexes are hetero-oligomers composed of at least an alpha, beta, beta', gamma, delta, epsilon and zeta subunits. This entry represents the N-terminal domain of various adaptins from different AP clathrin adaptor complexes (including AP1, AP2, AP3 and AP4), and from the beta and gamma subunits of various coatomer (COP) adaptors. This domain has a 2-layer alpha/alpha fold that forms a right-handed superhelix, and is a member of the ARM repeat superfamily []. The N-terminal region of the various AP adaptor proteins share strong sequence identity; by contrast, the C-terminal domains of different adaptins share similar structural folds, but have little sequence identity []. It has been proposed that the N-terminal domain interacts with another uniform component of the coated vesicles. More information about these proteins can be found at Protein of the Month: Clathrin [].; GO: 0006886 intracellular protein transport, 0016192 vesicle-mediated transport, 0030117 membrane coat; PDB: 1W63_C 2JKR_A 2JKT_A 2XA7_A 2VGL_B 3TJZ_E.
Probab=41.99 E-value=37 Score=31.32 Aligned_cols=83 Identities=16% Similarity=0.250 Sum_probs=45.0
Q ss_pred CCchhhhhcccccccchhHhHHHhHHHHHHHcCCCceecccccchHHHHHhhcCCCHHHHHHHHHHHHHHHhcCCccccc
Q psy7091 103 QYYLPIFFDGLRETEYPYKFFASKGIHDLLENGKDKIVPVLPHLILPIKNALSTRNPEVICETLKVLQHLVTSSSMVGEA 182 (205)
Q Consensus 103 ~~yLPiFfdGLrEt~hPY~FlA~~G~~dLL~~~~~KIlPvlPqLI~PiK~AL~Trd~~V~~~~L~~Lq~LV~~~~~VG~A 182 (205)
+.+++-...=+.-+++-.+=++.-.+..+....++ ++.-.|-.|++.|++.|+.+.+.||+.|..+. .+-+.+.
T Consensus 41 ~~~~~~vi~l~~s~~~~~Krl~yl~l~~~~~~~~~----~~~l~~n~l~kdl~~~n~~~~~lAL~~l~~i~--~~~~~~~ 114 (526)
T PF01602_consen 41 SFLFMEVIKLISSKDLELKRLGYLYLSLYLHEDPE----LLILIINSLQKDLNSPNPYIRGLALRTLSNIR--TPEMAEP 114 (526)
T ss_dssp GSTHHHHHCTCSSSSHHHHHHHHHHHHHHTTTSHH----HHHHHHHHHHHHHCSSSHHHHHHHHHHHHHH---SHHHHHH
T ss_pred chHHHHHHHHhCCCCHHHHHHHHHHHHHHhhcchh----HHHHHHHHHHHhhcCCCHHHHHHHHhhhhhhc--ccchhhH
Confidence 34444444444444444455555555555544444 44446667777777777777777777776654 3344455
Q ss_pred hhhHHHhhh
Q psy7091 183 LVPYYKQIL 191 (205)
Q Consensus 183 LvPyyRqLL 191 (205)
|.|...++|
T Consensus 115 l~~~v~~ll 123 (526)
T PF01602_consen 115 LIPDVIKLL 123 (526)
T ss_dssp HHHHHHHHH
T ss_pred HHHHHHHHh
Confidence 554444433
No 32
>KOG2023|consensus
Probab=41.39 E-value=28 Score=36.20 Aligned_cols=46 Identities=28% Similarity=0.338 Sum_probs=34.1
Q ss_pred hHHHHHhhcCCCHHHHHHHHHHHHHHHhcCCccccchhhHHHhhhhhhh
Q psy7091 147 ILPIKNALSTRNPEVICETLKVLQHLVTSSSMVGEALVPYYKQILPHLN 195 (205)
Q Consensus 147 I~PiK~AL~Trd~~V~~~~L~~Lq~LV~~~~~VG~ALvPyyRqLLPvlN 195 (205)
..+|.+-|.-.|..|=++|-.+. ++.-+-.|+.|||||-+||.-|-
T Consensus 478 L~~ll~~llD~NK~VQEAAcsAf---AtleE~A~~eLVp~l~~IL~~l~ 523 (885)
T KOG2023|consen 478 LEGLLRRLLDSNKKVQEAACSAF---ATLEEEAGEELVPYLEYILDQLV 523 (885)
T ss_pred HHHHHHHHhcccHHHHHHHHHHH---HHHHHhccchhHHHHHHHHHHHH
Confidence 34555556667888877777664 44578999999999999998654
No 33
>KOG2021|consensus
Probab=40.51 E-value=52 Score=34.78 Aligned_cols=73 Identities=21% Similarity=0.272 Sum_probs=54.2
Q ss_pred cchhHhHHHhHHHHHHHcCCCceecccccchHHHHHhhcCCCHHHHHHHHHHHHHHHhcCCccccchhhHHHhhhhhhh
Q psy7091 117 EYPYKFFASKGIHDLLENGKDKIVPVLPHLILPIKNALSTRNPEVICETLKVLQHLVTSSSMVGEALVPYYKQILPHLN 195 (205)
Q Consensus 117 ~hPY~FlA~~G~~dLL~~~~~KIlPvlPqLI~PiK~AL~Trd~~V~~~~L~~Lq~LV~~~~~VG~ALvPyyRqLLPvlN 195 (205)
.++-|=-.+--++-|+.--|++++|.||++|--| |++-|+.=+..-|-.|.+|. -+--.++-+.+-|+||++-
T Consensus 710 ~E~iRsavrft~hRmI~~lg~~vlPfipklie~l---L~s~d~kEmvdfl~flsQLi---hkfk~~~~~ilnqmlppll 782 (980)
T KOG2021|consen 710 FENIRSAVRFTFHRMIPILGNKVLPFIPKLIELL---LSSTDLKEMVDFLGFLSQLI---HKFKTDCYQILNQMLPPLL 782 (980)
T ss_pred cchhHHHHHHHHHHHHHhcchhhhcchHHHHHHH---HhcCCHHHHHHHHHHHHHHH---HHHHHHHHHHHHHHHHHHH
Confidence 3455555666677888888999999999998765 77777776666676666664 3444788899999998763
No 34
>PLN03076 ARF guanine nucleotide exchange factor (ARF-GEF); Provisional
Probab=38.79 E-value=26 Score=39.45 Aligned_cols=38 Identities=26% Similarity=0.457 Sum_probs=33.8
Q ss_pred cchHHHHHhhcCCCHHHHHHHHHHHHHHHhcCCccccc
Q psy7091 145 HLILPIKNALSTRNPEVICETLKVLQHLVTSSSMVGEA 182 (205)
Q Consensus 145 qLI~PiK~AL~Trd~~V~~~~L~~Lq~LV~~~~~VG~A 182 (205)
.++.|++.|..|+++.++..+|+.||+|..-+-..|+.
T Consensus 86 ~i~~pl~lac~s~~~ki~~~ALdcl~KLis~~~i~~~~ 123 (1780)
T PLN03076 86 LILSPLINACGTGSAKIVDPALDCIQKLIAHGYLRGEA 123 (1780)
T ss_pred HHHHHHHHHHcCCCccHHHHHHHHHHHHHHHhhcCCCC
Confidence 38899999999999999999999999999877776644
No 35
>PF06953 ArsD: Arsenical resistance operon trans-acting repressor ArsD; InterPro: IPR010712 This family consists of several bacterial arsenical resistance operon trans-acting repressor ArsD proteins. ArsD is a trans-acting repressor of the arsRDABC operon that confers resistance to arsenicals and antimonials in Escherichia coli. It possesses two-pairs of vicinal cysteine residues, Cys(12)-Cys(13) and Cys(112)-Cys(113), that potentially form separate binding sites for the metalloids that trigger dissociation of ArsD from the operon. However, as a homodimer it has four vicinal cysteine pairs [].; GO: 0003677 DNA binding, 0045892 negative regulation of transcription, DNA-dependent, 0046685 response to arsenic-containing substance; PDB: 3MWH_A 3KGK_A 3KTB_B.
Probab=38.31 E-value=14 Score=29.81 Aligned_cols=28 Identities=18% Similarity=0.451 Sum_probs=23.0
Q ss_pred ccchhHhHHHhHHHHHHHcCCCceeccc
Q psy7091 116 TEYPYKFFASKGIHDLLENGKDKIVPVL 143 (205)
Q Consensus 116 t~hPY~FlA~~G~~dLL~~~~~KIlPvl 143 (205)
+++|-.|+.-+.+.++|+..|..+||++
T Consensus 49 ~~~P~aF~~n~~V~~~L~~~G~e~LPit 76 (123)
T PF06953_consen 49 AQNPQAFVENPEVNQLLQTEGAEALPIT 76 (123)
T ss_dssp TT-TTHHHHSHHHHHHHHHH-GGG-SEE
T ss_pred ccCHHHHHhCHHHHHHHHHcCcccCCEE
Confidence 6799999999999999999999999975
No 36
>PF10363 DUF2435: Protein of unknown function (DUF2435)
Probab=37.04 E-value=1.1e+02 Score=23.09 Aligned_cols=70 Identities=19% Similarity=0.183 Sum_probs=48.4
Q ss_pred chhHhHHHhHHHHHHHcCCCceecccccchHHHHHhhcCCCHHHHHHHHHHHHHHHhcCCc-cccchhhHHH
Q psy7091 118 YPYKFFASKGIHDLLENGKDKIVPVLPHLILPIKNALSTRNPEVICETLKVLQHLVTSSSM-VGEALVPYYK 188 (205)
Q Consensus 118 hPY~FlA~~G~~dLL~~~~~KIlPvlPqLI~PiK~AL~Trd~~V~~~~L~~Lq~LV~~~~~-VG~ALvPyyR 188 (205)
-|-|--|-.....|++... -.+--+|+++.=+...|...|+=|.-++.+.|-.|+...+- |=+.|+-.|.
T Consensus 17 ~PvRa~gL~~L~~Li~~~~-~~~~~~~~il~l~l~~L~d~DsyVYL~aI~~L~~La~~~p~~vl~~L~~~y~ 87 (92)
T PF10363_consen 17 PPVRAHGLVLLRKLIESKS-EPVIDIPKILDLFLSQLKDEDSYVYLNAIKGLAALADRHPDEVLPILLDEYA 87 (92)
T ss_pred cchHHHHHHHHHHHHHcCC-cchhhHHHHHHHHHHHcCCCCchHHHHHHHHHHHHHHHChHHHHHHHHHHHh
Confidence 3444445555666676655 33444688888889999999999999999998888765543 5555555553
No 37
>cd00020 ARM Armadillo/beta-catenin-like repeats. An approximately 40 amino acid long tandemly repeated sequence motif first identified in the Drosophila segment polarity gene armadillo; these repeats were also found in the mammalian armadillo homolog beta-catenin, the junctional plaque protein plakoglobin, the adenomatous polyposis coli (APC) tumor suppressor protein, and a number of other proteins. ARM has been implicated in mediating protein-protein interactions, but no common features among the target proteins recognized by the ARM repeats have been identified; related to the HEAT domain; three consecutive copies of the repeat are represented by this alignment model.
Probab=36.13 E-value=56 Score=23.09 Aligned_cols=70 Identities=16% Similarity=0.126 Sum_probs=46.1
Q ss_pred CchhhhhcccccccchhHhHHHhHHHHHHHcCCCce-ecccccchHHHHHhhcCCCHHHHHHHHHHHHHHH
Q psy7091 104 YYLPIFFDGLRETEYPYKFFASKGIHDLLENGKDKI-VPVLPHLILPIKNALSTRNPEVICETLKVLQHLV 173 (205)
Q Consensus 104 ~yLPiFfdGLrEt~hPY~FlA~~G~~dLL~~~~~KI-lPvlPqLI~PiK~AL~Trd~~V~~~~L~~Lq~LV 173 (205)
..+|.+.+=|...++.-+.-|...+..|....+... .-+-..++.-+-..|++.|.++...++.+|..|+
T Consensus 49 ~~i~~l~~~l~~~~~~v~~~a~~~L~~l~~~~~~~~~~~~~~g~l~~l~~~l~~~~~~~~~~a~~~l~~l~ 119 (120)
T cd00020 49 GGLPALVQLLKSEDEEVVKAALWALRNLAAGPEDNKLIVLEAGGVPKLVNLLDSSNEDIQKNATGALSNLA 119 (120)
T ss_pred CChHHHHHHHhCCCHHHHHHHHHHHHHHccCcHHHHHHHHHCCChHHHHHHHhcCCHHHHHHHHHHHHHhh
Confidence 556666666666555566666666666665543221 1222346777778888889999999999988775
No 38
>PF03378 CAS_CSE1: CAS/CSE protein, C-terminus; InterPro: IPR005043 Mammalian cellular apoptosis susceptibility (CAS) proteins and the yeast chromosome-segregation protein, CSE1 are homologous []. CAS is involved in both cellular apoptosis and proliferation [, ]. Apoptosis is inhibited in CAS-depleted cells, while the expression of CAS correlates to the degree of cellular proliferation. Like CSE1, it is essential for the mitotic checkpoint in the cell cycle (CAS depletion blocks the cell in the G2 phase), and has been shown to be associated with the microtubule network and the mitotic spindle [], as is the protein MEK, which is thought to regulate the intracellular localization (predominantly nuclear vs. predominantly cytosolic) of CAS. In the nucleus, CAS acts as a nuclear transport factor in the importin pathway []. The importin pathway mediates the nuclear transport of several proteins that are necessary for mitosis and further progression. CAS is therefore thought to affect the cell cycle through its effect on the nuclear transport of these proteins []. Since apoptosis also requires the nuclear import of several proteins (such as P53 and transcription factors), it has been suggested that CAS also enables apoptosis by facilitating the nuclear import of at least a subset of these essential proteins []. This entry represents the C-terminal portion of these proteins. Structural studies of the yeast CSE1 protein indicate that this domain binds to both the transport-orchestrating protein RanGTP and the cargo molecule that is being exported [].; GO: 0005515 protein binding; PDB: 1Z3H_B 1WA5_C.
Probab=35.97 E-value=29 Score=33.28 Aligned_cols=56 Identities=23% Similarity=0.331 Sum_probs=42.6
Q ss_pred ceecccccchHHHHHhhcCCCHHHHHHHHHHHHHHHhcCCccccchhhHHHhhhhhhh
Q psy7091 138 KIVPVLPHLILPIKNALSTRNPEVICETLKVLQHLVTSSSMVGEALVPYYKQILPHLN 195 (205)
Q Consensus 138 KIlPvlPqLI~PiK~AL~Trd~~V~~~~L~~Lq~LV~~~~~VG~ALvPyyRqLLPvlN 195 (205)
-+--+-..|.+++..-|+..=.|.+--++++|..|+..++ ++.+-+.|.+|+|+|-
T Consensus 110 ~v~~~E~~L~P~f~~ILq~dV~EF~PYvfQIla~Lle~~~--~~~~p~~y~~L~~~Ll 165 (435)
T PF03378_consen 110 AVSQFEEALFPPFQEILQQDVQEFIPYVFQILAQLLELRP--SSPLPDAYKQLFPPLL 165 (435)
T ss_dssp ---HHHHHHHHHHHHHHHTT-TTTHHHHHHHHHHHHHHSS----S--TTTGGGHHHHT
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCC--CCCCcHHHHHHHHHHc
Confidence 3445667788899999998888999999999999999887 8899999999999874
No 39
>PF04118 Dopey_N: Dopey, N-terminal; InterPro: IPR007249 DopA is the founding member of the Dopey family and is required for correct cell morphology and spatiotemporal organisation of multicellular structures in the filamentous fungus Emericella nidulans (Aspergillus nidulans). DopA homologues are found in mammals. Saccharomyces cerevisiae DOP1 is essential for viability and, affects cellular morphogenesis [].
Probab=34.76 E-value=28 Score=32.03 Aligned_cols=77 Identities=30% Similarity=0.394 Sum_probs=54.0
Q ss_pred CCCchhhhhcccccccchhHhHHHh-HHHHHHHc----CCCceecccccchHHHHHhhcCCCHHHHHHHHHHHHHHHhcC
Q psy7091 102 YQYYLPIFFDGLRETEYPYKFFASK-GIHDLLEN----GKDKIVPVLPHLILPIKNALSTRNPEVICETLKVLQHLVTSS 176 (205)
Q Consensus 102 y~~yLPiFfdGLrEt~hPY~FlA~~-G~~dLL~~----~~~KIlPvlPqLI~PiK~AL~Trd~~V~~~~L~~Lq~LV~~~ 176 (205)
+..-|+++.-||-..- +|.-+... -+-|+++. -+..+.|+++-+|..|=.+|...+-|++..+++.|..+ .
T Consensus 91 L~~dl~i~~~GLfpl~-~~asi~Vkp~lL~i~e~~~lpL~~~L~p~l~~li~slLpGLede~sE~~~~~~~ll~~l---~ 166 (307)
T PF04118_consen 91 LAQDLPIYSPGLFPLF-SYASIQVKPQLLDIYEKYYLPLGPALRPCLKGLILSLLPGLEDEGSEFFDRTLKLLDKL---K 166 (307)
T ss_pred HHhhcHHHHHHHHHHH-HHHHHhhHHHHHHHHHHHhcCccHHHHHHHHHHHHHhccccccCCchHHHHHHHHHHHH---H
Confidence 3455666666664321 22222222 34455555 33499999999999999999999999999999999888 4
Q ss_pred Cccccc
Q psy7091 177 SMVGEA 182 (205)
Q Consensus 177 ~~VG~A 182 (205)
..||+.
T Consensus 167 ~~v~~~ 172 (307)
T PF04118_consen 167 EAVGDK 172 (307)
T ss_pred HhcChh
Confidence 668887
No 40
>cd03569 VHS_Hrs_Vps27p VHS domain family, Hrs and Vps27p subfamily; composed of Hrs (Hepatocyte growth factor-regulated tyrosine kinase substrate) and its yeast homolog Vps27p (vacuolar protein sorting). The VHS domain, an essential part of Hrs/Vps27p, has a superhelical structure similar to the structure of ARM (Armadillo) repeats and is present at the N-termini of proteins. Hrs also contains a FYVE (Fab1p, YOTB, Vac1p, and EEA1) zinc finger domain C-terminal to VHS, as well as two coiled-coil domains. Hrs has been proposed to play a role in at least three vesicle trafficking events: exocytosis, endocytosis, and endosome to lysosome trafficking. Hrs is involved in promoting rapid recycling of endocytosed signaling receptors to the plasma membrane.
Probab=31.42 E-value=73 Score=25.79 Aligned_cols=31 Identities=26% Similarity=0.343 Sum_probs=28.4
Q ss_pred cchHHHHHhhcCCCHHHHHHHHHHHHHHHhc
Q psy7091 145 HLILPIKNALSTRNPEVICETLKVLQHLVTS 175 (205)
Q Consensus 145 qLI~PiK~AL~Trd~~V~~~~L~~Lq~LV~~ 175 (205)
..+..||+=|+++|+.|...||.+|..+|.-
T Consensus 41 ~a~ral~krl~~~n~~vql~AL~LLe~~vkN 71 (142)
T cd03569 41 YAMRALKKRLLSKNPNVQLYALLLLESCVKN 71 (142)
T ss_pred HHHHHHHHHHcCCChHHHHHHHHHHHHHHHH
Confidence 4678899999999999999999999999875
No 41
>cd03568 VHS_STAM VHS domain family, STAM subfamily; members include STAM (Signal Transducing Adaptor Molecule), EAST (EGFR-associated protein with SH3 and TAM domains) and Hbp (Hrs-binding protein). Collectively, they are referred to as STAM. All STAMs have at their N-termini a VHS domain, which is involved in cytokine-mediated intracellular signal transduction and has a superhelical structure similar to the structure of ARM (Armadillo) repeats, followed by a SH3 (Src homology 3) domain, a well-established protein-protein interaction domain. At the C-termini of most vertebrate STAMS, an ITAM (Immunoreceptor Tyrosine-based Activation) motif is present, which mediates the binding of HRS (hepatocyte growth factor-regulated tyrosine kinase substrate) in endocytic and exocytic machineries.
Probab=30.91 E-value=80 Score=25.72 Aligned_cols=32 Identities=19% Similarity=0.326 Sum_probs=28.4
Q ss_pred cchHHHHHhhcCCCHHHHHHHHHHHHHHHhcC
Q psy7091 145 HLILPIKNALSTRNPEVICETLKVLQHLVTSS 176 (205)
Q Consensus 145 qLI~PiK~AL~Trd~~V~~~~L~~Lq~LV~~~ 176 (205)
..+..||+=|+++|+.|...||.+|..+|.-+
T Consensus 37 ~a~ral~KRl~~~n~~v~l~AL~LLe~~vkNC 68 (144)
T cd03568 37 DCLKAIMKRLNHKDPNVQLRALTLLDACAENC 68 (144)
T ss_pred HHHHHHHHHHcCCCHHHHHHHHHHHHHHHHHC
Confidence 46788999999999999999999999998753
No 42
>PF09324 DUF1981: Domain of unknown function (DUF1981); InterPro: IPR015403 This domain is functionally uncharacterised and found in various plant and yeast protein transport proteins. It is noramlly associated with and C-termianl to the SEC7 domain. The SEC7 domain was named after the first protein found to contain such a region []. It has been shown to be linked with guanine nucleotide exchange function [, ].
Probab=30.44 E-value=51 Score=24.46 Aligned_cols=51 Identities=10% Similarity=0.162 Sum_probs=43.1
Q ss_pred HhHHHhHHHHHHHcCCCceecccccchHHHHHhhcCCCHHHHHHHHHHHHH
Q psy7091 121 KFFASKGIHDLLENGKDKIVPVLPHLILPIKNALSTRNPEVICETLKVLQH 171 (205)
Q Consensus 121 ~FlA~~G~~dLL~~~~~KIlPvlPqLI~PiK~AL~Trd~~V~~~~L~~Lq~ 171 (205)
+=+..+.+..|++..+++|-.--+.++.-|+.|.+..+.+++..|.+.++.
T Consensus 35 re~il~ci~qil~~~~~~i~SGW~~if~il~~aa~~~~e~lv~~af~~~~~ 85 (86)
T PF09324_consen 35 RELILECILQILQSRGENIKSGWKVIFSILRAAAKDNDESLVRLAFQIVQL 85 (86)
T ss_pred HHHHHHHHHHHHHHhHHHHHhccHHHHHHHHHHHhCCCccHHHHHHHHHhh
Confidence 334466788888888888888889999999999999999999999998864
No 43
>cd03567 VHS_GGA VHS domain family, GGA subfamily; GGA (Golgi-localized, Gamma-ear-containing, Arf-binding) comprise a subfamily of ubiquitously expressed, monomeric, motif-binding cargo/clathrin adaptor proteins. The VHS domain has a superhelical structure similar to the structure of the ARM (Armadillo) repeats and is present at the N-termini of proteins. GGA proteins have a multidomain structure consisting of an N-terminal VHS domain linked by a short proline-rich linker to a GAT (GGA and TOM) domain, which is followed by a long flexible linker to the C-terminal appendage, GAE (gamma-adaptin ear) domain. The VHS domain of GGA proteins binds to the acidic-cluster dileucine (DxxLL) motif found on the cytoplasmic tails of cargo proteins trafficked between the trans-Golgi network and the endosomal system.
Probab=29.88 E-value=82 Score=25.66 Aligned_cols=30 Identities=10% Similarity=0.193 Sum_probs=27.2
Q ss_pred chHHHHHhhcCCCHHHHHHHHHHHHHHHhc
Q psy7091 146 LILPIKNALSTRNPEVICETLKVLQHLVTS 175 (205)
Q Consensus 146 LI~PiK~AL~Trd~~V~~~~L~~Lq~LV~~ 175 (205)
-+..||+=|+++|+.|...||.+|..+|+-
T Consensus 39 a~rai~krl~~~n~~v~l~AL~LLe~~vkN 68 (139)
T cd03567 39 AVRLLAHKIQSPQEKEALQALTVLEACMKN 68 (139)
T ss_pred HHHHHHHHHcCCCHHHHHHHHHHHHHHHHH
Confidence 577899999999999999999999999873
No 44
>cd00197 VHS_ENTH_ANTH VHS, ENTH and ANTH domain superfamily; composed of proteins containing a VHS, ENTH or ANTH domain. The VHS domain is present in Vps27 (Vacuolar Protein Sorting), Hrs (Hepatocyte growth factor-regulated tyrosine kinase substrate) and STAM (Signal Transducing Adaptor Molecule). It is located at the N-termini of proteins involved in intracellular membrane trafficking. The epsin N-terminal homology (ENTH) domain is an evolutionarily conserved protein module found primarily in proteins that participate in clathrin-mediated endocytosis. A set of proteins previously designated as harboring an ENTH domain in fact contains a highly similar, yet unique module referred to as an AP180 N-terminal homology (ANTH) domain. VHS, ENTH and ANTH domains are structurally similar and are composed of a superhelix of eight alpha helices. ENTH adnd ANTH (E/ANTH) domains bind both inositol phospholipids and proteins and contribute to the nucleation and formation of clathrin coats on membra
Probab=29.64 E-value=88 Score=23.58 Aligned_cols=45 Identities=22% Similarity=0.324 Sum_probs=34.6
Q ss_pred cccchHHHHHhhcCCCHHHHHHHHHHHHHHHhcCCccccchhhHHHhh
Q psy7091 143 LPHLILPIKNALSTRNPEVICETLKVLQHLVTSSSMVGEALVPYYKQI 190 (205)
Q Consensus 143 lPqLI~PiK~AL~Trd~~V~~~~L~~Lq~LV~~~~~VG~ALvPyyRqL 190 (205)
....+.-|+.-|+++|+.++..||.+|-.|+.- .|+....+.+.-
T Consensus 35 ~~~~~~~l~kRl~~~~~~~~lkaL~lLe~lvkN---~g~~f~~~i~~~ 79 (115)
T cd00197 35 PKEAVDAIKKRINNKNPHVVLKALTLLEYCVKN---CGERFHQEVASN 79 (115)
T ss_pred HHHHHHHHHHHhcCCcHHHHHHHHHHHHHHHHH---ccHHHHHHHHHh
Confidence 345778899999999999999999999999864 455555544443
No 45
>cd03561 VHS VHS domain family; The VHS domain is present in Vps27 (Vacuolar Protein Sorting), Hrs (Hepatocyte growth factor-regulated tyrosine kinase substrate) and STAM (Signal Transducing Adaptor Molecule). It has a superhelical structure similar to that of the ARM (Armadillo) repeats and is present at the N-termini of proteins involved in intracellular membrane trafficking. There are four general groups of VHS domain containing proteins based on their association with other domains. The first group consists of proteins of the STAM/EAST/Hbp family which has the domain composition VHS-SH3-ITAM. The second consists of proteins with a FYVE domain C-terminal to VHS. The third consists of GGA proteins with a domain composition VHS-GAT (GGA and TOM)-GAE (gamma-adaptin ear) domain. The fourth consists of proteins with a VHS domain alone or with domains other than those mentioned above. In GGA proteins, VHS domains are involved in cargo recognition in trans-Golgi, thereby having a general me
Probab=28.82 E-value=1.5e+02 Score=23.38 Aligned_cols=48 Identities=25% Similarity=0.337 Sum_probs=35.8
Q ss_pred HhHHHHHHHcCCCceeccccc-chHHHHHhhcCCCHHHHHHHHHHHHHHHhcCC
Q psy7091 125 SKGIHDLLENGKDKIVPVLPH-LILPIKNALSTRNPEVICETLKVLQHLVTSSS 177 (205)
Q Consensus 125 ~~G~~dLL~~~~~KIlPvlPq-LI~PiK~AL~Trd~~V~~~~L~~Lq~LV~~~~ 177 (205)
...+-|++... +.-|+ .+..||.=|+++|+.+...||.+|..++.-+.
T Consensus 21 il~icd~I~~~-----~~~~k~a~raL~krl~~~n~~vql~AL~lLd~~vkNcg 69 (133)
T cd03561 21 NLELCDLINLK-----PNGPKEAARAIRKKIKYGNPHVQLLALTLLELLVKNCG 69 (133)
T ss_pred HHHHHHHHhCC-----CCCHHHHHHHHHHHHcCCCHHHHHHHHHHHHHHHHhCC
Confidence 34566666554 23332 46789999999999999999999999987543
No 46
>PF13646 HEAT_2: HEAT repeats; PDB: 1OYZ_A 3FGA_A 2PF4_C 2IAE_A 3B2A_A.
Probab=28.25 E-value=99 Score=21.21 Aligned_cols=39 Identities=21% Similarity=0.352 Sum_probs=27.8
Q ss_pred hHHHHHhh-cCCCHHHHHHHHHHHHHHHhcCCccccchhhHHHhhh
Q psy7091 147 ILPIKNAL-STRNPEVICETLKVLQHLVTSSSMVGEALVPYYKQIL 191 (205)
Q Consensus 147 I~PiK~AL-~Trd~~V~~~~L~~Lq~LV~~~~~VG~ALvPyyRqLL 191 (205)
|..|-..| ++.|+.|-..++++|. ..-.+..+|.+.++|
T Consensus 1 i~~L~~~l~~~~~~~vr~~a~~~L~------~~~~~~~~~~L~~~l 40 (88)
T PF13646_consen 1 IPALLQLLQNDPDPQVRAEAARALG------ELGDPEAIPALIELL 40 (88)
T ss_dssp HHHHHHHHHTSSSHHHHHHHHHHHH------CCTHHHHHHHHHHHH
T ss_pred CHHHHHHHhcCCCHHHHHHHHHHHH------HcCCHhHHHHHHHHH
Confidence 44566777 8899999998888764 333556777776665
No 47
>PF00790 VHS: VHS domain; InterPro: IPR002014 The VHS domain is a ~140 residues long domain, whose name is derived from its occurrence in VPS-27, Hrs and STAM. Based on regions surrounding the domain, VHS-proteins can be divided into 4 groups []: STAM/EAST/Hbp which all share the domain composition VHS-SH3-ITAM and carry one or two ubiquitin-interacting motifs Proteins with a FYVE domain (IPR000306 from INTERPRO) C-terminal to VHS which also carry one or two ubiquitin-interacting motifs GGA proteins with a domain composition VHS-GAT (GGA and Tom1) homology domain VHS domain alone or in combination with domains other than those listed above The VHS domain is always found at the N- terminus of proteins suggesting that such topology is important for function. The domain is considered to have a general membrane targeting/cargo recognition role in vesicular trafficking []. Resolution of the crystal structure of the VHS domain of Drosophila Hrs and human Tom1 revealed that it consists of eight helices arranged in a double-layer superhelix []. The existence of conserved patches of residues on the domain surface suggests that VHS domains may be involved in protein-protein recognition and docking. Overall, sequence similarity is low (approx 25%) amongst domain family members.; GO: 0006886 intracellular protein transport; PDB: 1X5B_A 2L0T_B 1DVP_A 3LDZ_C 3ZYQ_A 4AVX_A 3G2U_A 3G2W_A 1UJJ_A 3G2V_A ....
Probab=27.69 E-value=99 Score=24.49 Aligned_cols=49 Identities=24% Similarity=0.312 Sum_probs=35.7
Q ss_pred HHhHHHHHHHcCCCceecccccchHHHHHhhcCCCHHHHHHHHHHHHHHHhcC
Q psy7091 124 ASKGIHDLLENGKDKIVPVLPHLILPIKNALSTRNPEVICETLKVLQHLVTSS 176 (205)
Q Consensus 124 A~~G~~dLL~~~~~KIlPvlPqLI~PiK~AL~Trd~~V~~~~L~~Lq~LV~~~ 176 (205)
+...+-|++.....-. --.+..||+-|.++|+.+...||.+|..+|.-+
T Consensus 25 ~~l~icD~i~~~~~~~----kea~~~l~krl~~~~~~vq~~aL~lld~lvkNc 73 (140)
T PF00790_consen 25 LILEICDLINSSPDGA----KEAARALRKRLKHGNPNVQLLALTLLDALVKNC 73 (140)
T ss_dssp HHHHHHHHHHTSTTHH----HHHHHHHHHHHTTSSHHHHHHHHHHHHHHHHHS
T ss_pred HHHHHHHHHHcCCccH----HHHHHHHHHHHhCCCHHHHHHHHHHHHHHHHcC
Confidence 3345667776542111 235678999999999999999999999998743
No 48
>PF10508 Proteasom_PSMB: Proteasome non-ATPase 26S subunit; InterPro: IPR019538 The 26S proteasome is an enzymatic complex that degrades ubiquitinated proteins in eukaryotic cells. 26S proteasome non-ATPase regulatory subunit 5 is one of a number of chaperones that are involved in the assembly of the proteasome. The chaperones dissociate before 26S proteasome formation is complete [].; GO: 0044183 protein binding involved in protein folding
Probab=26.67 E-value=1.2e+02 Score=29.13 Aligned_cols=75 Identities=19% Similarity=0.266 Sum_probs=52.6
Q ss_pred CCCchhhhhcccccccchhHhHHHhHHHHHHHcCCCceeccc--ccchHHHHHhhcCCCHHHHHHHHHHHHHHHhcCC
Q psy7091 102 YQYYLPIFFDGLRETEYPYKFFASKGIHDLLENGKDKIVPVL--PHLILPIKNALSTRNPEVICETLKVLQHLVTSSS 177 (205)
Q Consensus 102 y~~yLPiFfdGLrEt~hPY~FlA~~G~~dLL~~~~~KIlPvl--PqLI~PiK~AL~Trd~~V~~~~L~~Lq~LV~~~~ 177 (205)
...|.+.+..||..-..--+-+|-.-+..++.+... .+..+ ..++.-|=.+|...|.+|-..+.++|..|+....
T Consensus 75 ~~~~~~~L~~gL~h~~~~Vr~l~l~~l~~~~~~~~~-~~~~~~~~~l~~~i~~~L~~~d~~Va~~A~~~L~~l~~~~~ 151 (503)
T PF10508_consen 75 LPQYQPFLQRGLTHPSPKVRRLALKQLGRIARHSEG-AAQLLVDNELLPLIIQCLRDPDLSVAKAAIKALKKLASHPE 151 (503)
T ss_pred HHHHHHHHHHHhcCCCHHHHHHHHHHHHHHhcCCHH-HHHHhcCccHHHHHHHHHcCCcHHHHHHHHHHHHHHhCCch
Confidence 567889999999865554555554444444443222 22222 5677778899999999999999999999987543
No 49
>smart00288 VHS Domain present in VPS-27, Hrs and STAM. Unpublished observations. Domain of unknown function.
Probab=26.62 E-value=1e+02 Score=24.39 Aligned_cols=30 Identities=27% Similarity=0.393 Sum_probs=26.9
Q ss_pred chHHHHHhhcCCCHHHHHHHHHHHHHHHhc
Q psy7091 146 LILPIKNALSTRNPEVICETLKVLQHLVTS 175 (205)
Q Consensus 146 LI~PiK~AL~Trd~~V~~~~L~~Lq~LV~~ 175 (205)
.+..||.=|+++|+.+...||.+|..+|.-
T Consensus 38 a~r~l~krl~~~n~~v~l~AL~lLe~~vkN 67 (133)
T smart00288 38 AVRLLKKRLNNKNPHVALLALTLLDACVKN 67 (133)
T ss_pred HHHHHHHHHcCCCHHHHHHHHHHHHHHHHH
Confidence 356799999999999999999999999875
No 50
>PF11698 V-ATPase_H_C: V-ATPase subunit H; InterPro: IPR011987 ATPases (or ATP synthases) are membrane-bound enzyme complexes/ion transporters that combine ATP synthesis and/or hydrolysis with the transport of protons across a membrane. ATPases can harness the energy from a proton gradient, using the flux of ions across the membrane via the ATPase proton channel to drive the synthesis of ATP. Some ATPases work in reverse, using the energy from the hydrolysis of ATP to create a proton gradient. There are different types of ATPases, which can differ in function (ATP synthesis and/or hydrolysis), structure (e.g., F-, V- and A-ATPases, which contain rotary motors) and in the type of ions they transport [, ]. The different types include: F-ATPases (F1F0-ATPases), which are found in mitochondria, chloroplasts and bacterial plasma membranes where they are the prime producers of ATP, using the proton gradient generated by oxidative phosphorylation (mitochondria) or photosynthesis (chloroplasts). V-ATPases (V1V0-ATPases), which are primarily found in eukaryotic vacuoles and catalyse ATP hydrolysis to transport solutes and lower pH in organelles. A-ATPases (A1A0-ATPases), which are found in Archaea and function like F-ATPases (though with respect to their structure and some inhibitor responses, A-ATPases are more closely related to the V-ATPases). P-ATPases (E1E2-ATPases), which are found in bacteria and in eukaryotic plasma membranes and organelles, and function to transport a variety of different ions across membranes. E-ATPases, which are cell-surface enzymes that hydrolyse a range of NTPs, including extracellular ATP. V-ATPases (also known as V1V0-ATPase or vacuolar ATPase) (3.6.3.14 from EC) are found in the eukaryotic endomembrane system, and in the plasma membrane of prokaryotes and certain specialised eukaryotic cells. V-ATPases hydrolyse ATP to drive a proton pump, and are involved in a variety of vital intra- and inter-cellular processes such as receptor mediated endocytosis, protein trafficking, active transport of metabolites, homeostasis and neurotransmitter release []. V-ATPases are composed of two linked complexes: the V1 complex (subunits A-H) contains the catalytic core that hydrolyses ATP, while the V0 complex (subunits a, c, c', c'', d) forms the membrane-spanning pore. V-ATPases may have an additional role in membrane fusion through binding to t-SNARE proteins []. This entry represents the C-terminal domain of subunit H (also known as Vma13p) found in the V1 complex of V-ATPases. This subunit has a regulatory function, being responsible for activating ATPase activity and coupling ATPase activity to proton flow []. The yeast enzyme contains five motifs similar to the HEAT or Armadillo repeats seen in the importins, and can be divided into two distinct domains: a large N-terminal domain consisting of stacked alpha helices, and a smaller C-terminal alpha-helical domain with a similar superhelical topology to an armadillo repeat []. More information about this protein can be found at Protein of the Month: ATP Synthases [].; GO: 0016820 hydrolase activity, acting on acid anhydrides, catalyzing transmembrane movement of substances, 0015991 ATP hydrolysis coupled proton transport, 0000221 vacuolar proton-transporting V-type ATPase, V1 domain; PDB: 1HO8_A.
Probab=26.47 E-value=70 Score=25.86 Aligned_cols=23 Identities=30% Similarity=0.498 Sum_probs=19.4
Q ss_pred HhhcCCCHHHHHHHHHHHHHHHh
Q psy7091 152 NALSTRNPEVICETLKVLQHLVT 174 (205)
Q Consensus 152 ~AL~Trd~~V~~~~L~~Lq~LV~ 174 (205)
.-+++.|++|--.||.++|+|+.
T Consensus 93 ~Lm~h~d~eVr~eAL~avQklm~ 115 (119)
T PF11698_consen 93 ELMNHEDPEVRYEALLAVQKLMV 115 (119)
T ss_dssp HHTS-SSHHHHHHHHHHHHHHHH
T ss_pred HHhcCCCHHHHHHHHHHHHHHHH
Confidence 45678999999999999999975
No 51
>PF12397 U3snoRNP10: U3 small nucleolar RNA-associated protein 10 ; InterPro: IPR022125 This domain family is found in eukaryotes, and is approximately 120 amino acids in length. The family is found in association with PF08146 from PFAM. This family is the protein associated with U3 snoRNA which is involved in the processing of pre-rRNA.
Probab=26.18 E-value=1.2e+02 Score=23.11 Aligned_cols=52 Identities=21% Similarity=0.399 Sum_probs=42.8
Q ss_pred ccccchHHHHHhhc-CCCHHHHHHHHHHHHHHHhcCCccccchhhHHHhhhhh
Q psy7091 142 VLPHLILPIKNALS-TRNPEVICETLKVLQHLVTSSSMVGEALVPYYKQILPH 193 (205)
Q Consensus 142 vlPqLI~PiK~AL~-Trd~~V~~~~L~~Lq~LV~~~~~VG~ALvPyyRqLLPv 193 (205)
+++++.+-+-.+|. +..+|....+.=++-.|+...++--+.+..-.+.++..
T Consensus 3 ~l~~lLP~l~~~L~~s~~~d~~~a~ymIl~~La~k~~L~~~~l~~l~~~i~~~ 55 (121)
T PF12397_consen 3 ILPRLLPFLLKGLKSSSSPDLQAAAYMILSVLASKVPLSDEVLNALMESILKN 55 (121)
T ss_pred HHHHHHHHHHHHHccCCcHHHHHHHHHHHHHHHhhcCCcHHHHHHHHHHHHhc
Confidence 46788888899999 88999999999999999988887777777666666543
No 52
>PF10508 Proteasom_PSMB: Proteasome non-ATPase 26S subunit; InterPro: IPR019538 The 26S proteasome is an enzymatic complex that degrades ubiquitinated proteins in eukaryotic cells. 26S proteasome non-ATPase regulatory subunit 5 is one of a number of chaperones that are involved in the assembly of the proteasome. The chaperones dissociate before 26S proteasome formation is complete [].; GO: 0044183 protein binding involved in protein folding
Probab=25.80 E-value=1.3e+02 Score=28.89 Aligned_cols=36 Identities=25% Similarity=0.290 Sum_probs=33.3
Q ss_pred cccchHHHHHhhcCCCHHHHHHHHHHHHHHHhcCCc
Q psy7091 143 LPHLILPIKNALSTRNPEVICETLKVLQHLVTSSSM 178 (205)
Q Consensus 143 lPqLI~PiK~AL~Trd~~V~~~~L~~Lq~LV~~~~~ 178 (205)
+|+...-|..+|.+.++.|-.-+|+.|++++..++.
T Consensus 75 ~~~~~~~L~~gL~h~~~~Vr~l~l~~l~~~~~~~~~ 110 (503)
T PF10508_consen 75 LPQYQPFLQRGLTHPSPKVRRLALKQLGRIARHSEG 110 (503)
T ss_pred HHHHHHHHHHHhcCCCHHHHHHHHHHHHHHhcCCHH
Confidence 999999999999999999999999999999876654
No 53
>KOG1062|consensus
Probab=24.29 E-value=50 Score=34.78 Aligned_cols=53 Identities=17% Similarity=0.531 Sum_probs=43.5
Q ss_pred ceecccccchHHHHHhhcCCCHHHHHHHHHHHHHHHhcCCccccchhhHHHhhhhhh
Q psy7091 138 KIVPVLPHLILPIKNALSTRNPEVICETLKVLQHLVTSSSMVGEALVPYYKQILPHL 194 (205)
Q Consensus 138 KIlPvlPqLI~PiK~AL~Trd~~V~~~~L~~Lq~LV~~~~~VG~ALvPyyRqLLPvl 194 (205)
|+-...--.+++-|+.|..+|+.|+.+++..++.|.. +++..+-|||.+.|-|
T Consensus 172 K~P~l~e~f~~~~~~lL~ek~hGVL~~~l~l~~e~c~----~~~~~l~~fr~l~~~l 224 (866)
T KOG1062|consen 172 KVPDLVEHFVIAFRKLLCEKHHGVLIAGLHLITELCK----ISPDALSYFRDLVPSL 224 (866)
T ss_pred cCchHHHHhhHHHHHHHhhcCCceeeeHHHHHHHHHh----cCHHHHHHHHHHHHHH
Confidence 3334455567899999999999999999999999974 5668899999988755
No 54
>KOG4653|consensus
Probab=24.02 E-value=75 Score=33.88 Aligned_cols=83 Identities=18% Similarity=0.256 Sum_probs=56.8
Q ss_pred CCCchhhhhcccccccchhHhHHHhHHHHHHHc----CCCceecccccchHHHHHhhcCCCHHHHHHHHHHHHHHHhcCC
Q psy7091 102 YQYYLPIFFDGLRETEYPYKFFASKGIHDLLEN----GKDKIVPVLPHLILPIKNALSTRNPEVICETLKVLQHLVTSSS 177 (205)
Q Consensus 102 y~~yLPiFfdGLrEt~hPY~FlA~~G~~dLL~~----~~~KIlPvlPqLI~PiK~AL~Trd~~V~~~~L~~Lq~LV~~~~ 177 (205)
|+.++=+|.-|+||-+|=+|.=+-.-+-+|++. +++-..-|+-.++.-++. .+.+-+=.+|..+|..|. .
T Consensus 845 ~~~Li~tfl~gvrepd~~~RaSS~a~lg~Lcq~~a~~vsd~~~ev~~~Il~l~~~---d~s~~vRRaAv~li~~lL---~ 918 (982)
T KOG4653|consen 845 KAVLINTFLSGVREPDHEFRASSLANLGQLCQLLAFQVSDFFHEVLQLILSLETT---DGSVLVRRAAVHLLAELL---N 918 (982)
T ss_pred HHHHHHHHHHhcCCchHHHHHhHHHHHHHHHHHHhhhhhHHHHHHHHHHHHHHcc---CCchhhHHHHHHHHHHHH---h
Confidence 366778999999999999988877777777766 345444444444443333 344555567787777765 5
Q ss_pred ccccchhhHHHhh
Q psy7091 178 MVGEALVPYYKQI 190 (205)
Q Consensus 178 ~VG~ALvPyyRqL 190 (205)
+.|+-|.|.||-.
T Consensus 919 ~tg~dlLpilr~~ 931 (982)
T KOG4653|consen 919 GTGEDLLPILRLL 931 (982)
T ss_pred ccchhhHHHHHHH
Confidence 6788888877644
No 55
>PF11865 DUF3385: Domain of unknown function (DUF3385); InterPro: IPR024585 This uncharacterised domain is is typically between 160 to 172 amino acids in length. It is found in the phosphatidylinositol kinase-related protein kinases TOR (target of rapamycin). In Saccharomyces cerevisiae the TOR proteins, TOR1 and TOR2, regulate growth in a rapamycin-sensitive manner [].
Probab=23.13 E-value=1.5e+02 Score=24.33 Aligned_cols=70 Identities=19% Similarity=0.386 Sum_probs=45.5
Q ss_pred hhcccc-cccchhHhHHHhHHHHHHHcCCCceecccccchHHHHHhhcCCCHHHHHHHHHHHHHHHhcCCccccchhhHH
Q psy7091 109 FFDGLR-ETEYPYKFFASKGIHDLLENGKDKIVPVLPHLILPIKNALSTRNPEVICETLKVLQHLVTSSSMVGEALVPYY 187 (205)
Q Consensus 109 FfdGLr-Et~hPY~FlA~~G~~dLL~~~~~KIlPvlPqLI~PiK~AL~Trd~~V~~~~L~~Lq~LV~~~~~VG~ALvPyy 187 (205)
+..|.. ..++=|--+|......+|+. ..|.+-...++.+.+.+++. .|..-|||+
T Consensus 71 ~~~~~~~~~ee~y~~vvi~~L~~iL~D-----------------~sLs~~h~~vv~ai~~If~~-------l~~~cv~~L 126 (160)
T PF11865_consen 71 PMMGISPSSEEYYPTVVINALMRILRD-----------------PSLSSHHTAVVQAIMYIFKS-------LGLKCVPYL 126 (160)
T ss_pred hhccCCCchHHHHHHHHHHHHHHHHHh-----------------hhhHHHHHHHHHHHHHHHHh-------cCcCchhHH
Confidence 345554 57777888888888777754 34555555566666666533 455569999
Q ss_pred Hhhhhhh-hHhhhccc
Q psy7091 188 KQILPHL-NMYKDLNN 202 (205)
Q Consensus 188 RqLLPvl-Nlfk~~n~ 202 (205)
.|++|.+ +..++...
T Consensus 127 ~~viP~~l~~i~~~~~ 142 (160)
T PF11865_consen 127 PQVIPIFLRVIRTCPD 142 (160)
T ss_pred HHHhHHHHHHHHhCCH
Confidence 9999965 45554443
No 56
>cd08819 CARD_MDA5_2 Caspase activation and recruitment domain found in MDA5, second repeat. Caspase activation and recruitment domain (CARD) found in MDA5 (melanoma-differentiation-associated gene 5), second repeat. MDA5, also known as IFIH1, contains two N-terminal CARD domains and a C-terminal RNA helicase domain. MDA5 is a cytoplasmic DEAD box RNA helicase that plays an important role in host antiviral response by sensing incoming viral RNA. Upon activation, the signal is transferred to downstream pathways via the adaptor molecule IPS-1 (MAVS, VISA, CARDIF), leading to the induction of type I interferons. Although very similar in sequence, MDA5 recognizes different sets of viruses compared to RIG-I, a related RNA helicase. MDA5 associates with IPS-1 through a CARD-CARD interaction. In general, CARDs are death domains (DDs) found associated with caspases. They are known to be important in the signaling pathways for apoptosis, inflammation, and host-defense mechanisms. DDs are protei
Probab=23.03 E-value=22 Score=27.55 Aligned_cols=16 Identities=38% Similarity=0.598 Sum_probs=14.5
Q ss_pred CCchhhhhcccccccc
Q psy7091 103 QYYLPIFFDGLRETEY 118 (205)
Q Consensus 103 ~~yLPiFfdGLrEt~h 118 (205)
..+.|.|.++||||+|
T Consensus 66 ~~aF~~Fl~aLreT~~ 81 (88)
T cd08819 66 EGWFSKFLQALRETEH 81 (88)
T ss_pred CcHHHHHHHHHHHcCc
Confidence 5689999999999987
No 57
>PF13251 DUF4042: Domain of unknown function (DUF4042)
Probab=22.78 E-value=2e+02 Score=24.55 Aligned_cols=65 Identities=18% Similarity=0.255 Sum_probs=47.9
Q ss_pred ccccchhHhHHHhHHHHHHHcCC-Cce-ecccccchHHHHHhhcCCCHHHHHHHHHHHHHHHhcCCc
Q psy7091 114 RETEYPYKFFASKGIHDLLENGK-DKI-VPVLPHLILPIKNALSTRNPEVICETLKVLQHLVTSSSM 178 (205)
Q Consensus 114 rEt~hPY~FlA~~G~~dLL~~~~-~KI-lPvlPqLI~PiK~AL~Trd~~V~~~~L~~Lq~LV~~~~~ 178 (205)
+|+..+--.-.-+.+.-|++..+ +++ -.+++.+|..+|.-+.++|++|-..+|-++-.|+...+.
T Consensus 112 ~E~~~~~l~q~lK~la~Lv~~tPY~rL~~~ll~~~v~~v~~~l~~~d~~v~v~~l~~~~~l~s~~~~ 178 (182)
T PF13251_consen 112 AEKSPPVLTQLLKCLAVLVQATPYHRLPPGLLTEVVTQVRPLLRHRDPNVRVAALSCLGALLSVQPP 178 (182)
T ss_pred cccccHHHHHHHHHHHHHHccCChhhcCHhHHHHHHHHHHHHHhcCCCcHHHHHHHHHHHHHcCCCC
Confidence 56666655555555566666644 222 368899999999999999999999999999888766543
No 58
>KOG2149|consensus
Probab=22.21 E-value=1.4e+02 Score=28.80 Aligned_cols=89 Identities=21% Similarity=0.312 Sum_probs=73.2
Q ss_pred hhcccccccchhHhHHHhHHHHHHHc-----CCCceecccccchHHHHHhhcCCCHHHHHHHHHHHHHHHhcCCccccch
Q psy7091 109 FFDGLRETEYPYKFFASKGIHDLLEN-----GKDKIVPVLPHLILPIKNALSTRNPEVICETLKVLQHLVTSSSMVGEAL 183 (205)
Q Consensus 109 FfdGLrEt~hPY~FlA~~G~~dLL~~-----~~~KIlPvlPqLI~PiK~AL~Trd~~V~~~~L~~Lq~LV~~~~~VG~AL 183 (205)
.+.+++|.-+-=.+.++.+..++++. +.+.+=|.++-+..=|..|+-+--++|=...++.|..|+ ...++.+
T Consensus 100 ll~~~~~~i~D~~~~vR~~~~qll~~~i~~~~~e~~sp~~~l~~~yi~~AMThit~~i~~dslkfL~~Ll---~~~~p~~ 176 (393)
T KOG2149|consen 100 LLQKLRELILDDDSLVRDALYQLLDSLILPACKEDQSPMVSLLMPYISSAMTHITPEIQEDSLKFLSLLL---ERYPDTF 176 (393)
T ss_pred HHHHhhhhhcCccccHHHHHHHHHHHHHhhcchhhhcchHHHHHHHHHHHHhhccHHHHHhhHHHHHHHH---HHcChHH
Confidence 46788888888888999999999976 344477888999999999999999999999999999886 6788888
Q ss_pred hhHHHhhhhhhhHhhhc
Q psy7091 184 VPYYKQILPHLNMYKDL 200 (205)
Q Consensus 184 vPyyRqLLPvlNlfk~~ 200 (205)
.-|--+||.-++=+++.
T Consensus 177 ~~~~~~il~n~~d~i~~ 193 (393)
T KOG2149|consen 177 SRYASKILENFKDVISK 193 (393)
T ss_pred HHHHHHHHHHHHHHHHH
Confidence 88888877666655543
No 59
>PRK15356 type III secretion system protein SsaH; Provisional
Probab=22.09 E-value=44 Score=25.36 Aligned_cols=12 Identities=17% Similarity=0.492 Sum_probs=10.5
Q ss_pred CceecccccchH
Q psy7091 137 DKIVPVLPHLIL 148 (205)
Q Consensus 137 ~KIlPvlPqLI~ 148 (205)
.-|++++||||+
T Consensus 14 ~aiLnvlPqLIp 25 (75)
T PRK15356 14 HAMLPALTVIVP 25 (75)
T ss_pred HHHHHhhhhhcC
Confidence 568999999996
No 60
>cd03572 ENTH_epsin_related ENTH domain, Epsin Related family; composed of hypothetical proteins containing an ENTH-like domain. The epsin N-terminal homology (ENTH) domain is an evolutionarily conserved protein module found primarily in proteins that participate in clathrin-mediated endocytosis. A set of proteins previously designated as harboring an ENTH domain in fact contains a highly similar, yet unique module referred to as an AP180 N-terminal homology (ANTH) domain. ENTH and ANTH (E/ANTH) domains are structurally similar to the VHS domain and are composed of a superhelix of eight alpha helices. E/ANTH domains bind both inositol phospholipids and proteins and contribute to the nucleation and formation of clathrin coats on membranes. ENTH domains also function in the development of membrane curvature through lipid remodeling during the formation of clathrin-coated vesicles. E/ANTH-bearing proteins have recently been shown to function with adaptor protein-1 and GGA adaptors at the t
Probab=21.73 E-value=1.3e+02 Score=24.28 Aligned_cols=46 Identities=15% Similarity=0.265 Sum_probs=32.8
Q ss_pred HHHhhcCCCHHHHHHHHHHHHHHHhcCCccccchhhHHHhhhhhhhHhh
Q psy7091 150 IKNALSTRNPEVICETLKVLQHLVTSSSMVGEALVPYYKQILPHLNMYK 198 (205)
Q Consensus 150 iK~AL~Trd~~V~~~~L~~Lq~LV~~~~~VG~ALvPyyRqLLPvlNlfk 198 (205)
|=+=|+.+++-|-..+|++|..|+.-| .+..+-..++-+.+...++
T Consensus 43 L~kRL~~~~~hVK~K~Lrilk~l~~~G---~~~f~~~~~~~~~~Ik~~~ 88 (122)
T cd03572 43 LLKRLKRSSPHVKLKVLKIIKHLCEKG---NSDFKRELQRNSAQIRECA 88 (122)
T ss_pred HHHHhcCCCCcchHHHHHHHHHHHhhC---CHHHHHHHHHhHHHHHHHH
Confidence 346688899999999999999998764 3556666665555544433
No 61
>PF01602 Adaptin_N: Adaptin N terminal region; InterPro: IPR002553 Proteins synthesized on the ribosome and processed in the endoplasmic reticulum are transported from the Golgi apparatus to the trans-Golgi network (TGN), and from there via small carrier vesicles to their final destination compartment. This traffic is bidirectional, to ensure that proteins required to form vesicles are recycled. Vesicles have specific coat proteins (such as clathrin or coatomer) that are important for cargo selection and direction of transfer []. Clathrin coats contain both clathrin and adaptor complexes that link clathrin to receptors in coated vesicles. Clathrin-associated protein complexes are believed to interact with the cytoplasmic tails of membrane proteins, leading to their selection and concentration. The two major types of clathrin adaptor complexes are the heterotetrameric adaptor protein (AP) complexes, and the monomeric GGA (Golgi-localising, Gamma-adaptin ear domain homology, ARF-binding proteins) adaptors []. All AP complexes are heterotetramers composed of two large subunits (adaptins), a medium subunit (mu) and a small subunit (sigma). Each subunit has a specific function. Adaptin subunits recognise and bind to clathrin through their hinge region (clathrin box), and recruit accessory proteins that modulate AP function through their C-terminal appendage domains. By contrast, GGAs are monomers composed of four domains, which have functions similar to AP subunits: an N-terminal VHS (Vps27p/Hrs/Stam) domain, a GAT (GGA and Tom1) domain, a hinge region, and a C-terminal GAE (gamma-adaptin ear) domain. The GAE domain is similar to the AP gamma-adaptin ear domain, being responsible for the recruitment of accessory proteins that regulate clathrin-mediated endocytosis []. While clathrin mediates endocytic protein transport from ER to Golgi, coatomers (COPI, COPII) primarily mediate intra-Golgi transport, as well as the reverse Golgi to ER transport of dilysine-tagged proteins []. Coatomers reversibly associate with Golgi (non-clathrin-coated) vesicles to mediate protein transport and for budding from Golgi membranes []. Coatomer complexes are hetero-oligomers composed of at least an alpha, beta, beta', gamma, delta, epsilon and zeta subunits. This entry represents the N-terminal domain of various adaptins from different AP clathrin adaptor complexes (including AP1, AP2, AP3 and AP4), and from the beta and gamma subunits of various coatomer (COP) adaptors. This domain has a 2-layer alpha/alpha fold that forms a right-handed superhelix, and is a member of the ARM repeat superfamily []. The N-terminal region of the various AP adaptor proteins share strong sequence identity; by contrast, the C-terminal domains of different adaptins share similar structural folds, but have little sequence identity []. It has been proposed that the N-terminal domain interacts with another uniform component of the coated vesicles. More information about these proteins can be found at Protein of the Month: Clathrin [].; GO: 0006886 intracellular protein transport, 0016192 vesicle-mediated transport, 0030117 membrane coat; PDB: 1W63_C 2JKR_A 2JKT_A 2XA7_A 2VGL_B 3TJZ_E.
Probab=20.51 E-value=1.1e+02 Score=28.19 Aligned_cols=27 Identities=33% Similarity=0.618 Sum_probs=14.2
Q ss_pred chHHHHHhhcCCCHHHHHHHHHHHHHH
Q psy7091 146 LILPIKNALSTRNPEVICETLKVLQHL 172 (205)
Q Consensus 146 LI~PiK~AL~Trd~~V~~~~L~~Lq~L 172 (205)
++..|..+|...|+.|...|+.++..+
T Consensus 153 ~~~~l~~lL~d~~~~V~~~a~~~l~~i 179 (526)
T PF01602_consen 153 LIPKLKQLLSDKDPSVVSAALSLLSEI 179 (526)
T ss_dssp HHHHHHHHTTHSSHHHHHHHHHHHHHH
T ss_pred HHHHHhhhccCCcchhHHHHHHHHHHH
Confidence 345555555555555555555555444
No 62
>KOG1824|consensus
Probab=20.37 E-value=1.2e+02 Score=32.95 Aligned_cols=92 Identities=20% Similarity=0.317 Sum_probs=68.3
Q ss_pred CCCCCCCCchhhhhcccccccchhHhHHHhHHHHHHHcCCCceecccccchHHHHHhhcCC------------CHHHHHH
Q psy7091 97 IKTLNYQYYLPIFFDGLRETEYPYKFFASKGIHDLLENGKDKIVPVLPHLILPIKNALSTR------------NPEVICE 164 (205)
Q Consensus 97 ~e~LDy~~yLPiFfdGLrEt~hPY~FlA~~G~~dLL~~~~~KIlPvlPqLI~PiK~AL~Tr------------d~~V~~~ 164 (205)
.+++|..+||-..-+||..- |--+.+..--..-|-...+..|+-.+=+++-|+|+-++.| ..|....
T Consensus 1098 ld~~dit~Fl~~~~~GL~Dh-ydiKmlt~l~l~rLa~lcPs~VlqrlD~l~EpLr~t~~~k~k~~svKqE~ek~~eLkRS 1176 (1233)
T KOG1824|consen 1098 LDRLDITEFLNHVEDGLEDH-YDIKMLTFLMLARLADLCPSAVLQRLDRLVEPLRKTCTLKVKANSVKQEFEKQDELKRS 1176 (1233)
T ss_pred hhhccHHHHHHHHHhhcchh-hHHHHHHHHHHHHHHhhCcHHHHHHHHHHHHHHHHHhhcccccchHhHhHHHHHHHHHH
Confidence 56778888999888898754 5555555444444445589999999999999999999875 5688899
Q ss_pred HHHHHHHHHhcCCcc--ccchhhHHHhh
Q psy7091 165 TLKVLQHLVTSSSMV--GEALVPYYKQI 190 (205)
Q Consensus 165 ~L~~Lq~LV~~~~~V--G~ALvPyyRqL 190 (205)
+|+++-.|.+. +.| -+.+..++.||
T Consensus 1177 AlRav~~L~~i-p~v~~np~~~df~sqi 1203 (1233)
T KOG1824|consen 1177 ALRAVAALLTI-PEVEKNPQFSDFESQI 1203 (1233)
T ss_pred HHHHHHHHhcc-cccccChHHHHHHHHh
Confidence 99999888554 333 35566677766
Done!