Query psy4488
Match_columns 194
No_of_seqs 214 out of 520
Neff 5.3
Searched_HMMs 46136
Date Fri Aug 16 18:23:03 2013
Command hhsearch -i /work/01045/syshi/Psyhhblits/psy4488.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/4488hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 KOG2056|consensus 100.0 3.5E-40 7.6E-45 294.6 10.4 108 86-193 3-110 (336)
2 cd03571 ENTH_epsin ENTH domain 100.0 8.5E-39 1.8E-43 251.2 9.3 90 104-193 1-90 (123)
3 PF01417 ENTH: ENTH domain; I 100.0 4.8E-34 1E-38 222.1 10.3 92 102-193 1-93 (125)
4 KOG2057|consensus 100.0 7.7E-32 1.7E-36 240.9 9.0 107 86-192 5-114 (499)
5 smart00273 ENTH Epsin N-termin 99.9 9E-27 1.9E-31 181.4 9.5 90 103-193 1-91 (127)
6 KOG2056|consensus 99.8 1.4E-21 3.1E-26 175.1 6.6 68 1-68 94-162 (336)
7 cd03572 ENTH_epsin_related ENT 99.8 4.2E-20 9E-25 145.3 7.4 82 108-192 5-88 (122)
8 cd00197 VHS_ENTH_ANTH VHS, ENT 99.8 2.8E-19 6E-24 136.6 8.4 86 105-192 1-87 (115)
9 KOG2057|consensus 99.7 5.5E-18 1.2E-22 152.3 4.7 77 1-77 99-177 (499)
10 cd03564 ANTH_AP180_CALM ANTH d 99.7 3.6E-17 7.9E-22 126.1 8.6 84 105-190 1-85 (117)
11 cd03571 ENTH_epsin ENTH domain 99.6 5.3E-16 1.2E-20 122.0 3.4 49 1-49 74-123 (123)
12 PF01417 ENTH: ENTH domain; I 99.2 2.3E-11 5E-16 94.5 4.2 48 1-48 77-125 (125)
13 smart00273 ENTH Epsin N-termin 99.0 3.2E-10 7E-15 88.2 3.1 52 1-52 75-127 (127)
14 PF07651 ANTH: ANTH domain; I 98.9 1.1E-08 2.3E-13 88.4 8.8 87 103-190 2-88 (280)
15 KOG0251|consensus 98.5 4.1E-07 8.9E-12 86.1 8.0 84 101-185 20-104 (491)
16 cd03572 ENTH_epsin_related ENT 98.0 1.9E-06 4.1E-11 68.0 1.8 45 1-45 73-120 (122)
17 PF00790 VHS: VHS domain; Int 97.8 0.00012 2.5E-09 58.0 8.6 78 101-180 2-80 (140)
18 cd03565 VHS_Tom1 VHS domain fa 97.5 0.00057 1.2E-08 54.7 8.6 76 104-180 1-77 (141)
19 smart00288 VHS Domain present 97.5 0.00079 1.7E-08 53.1 8.9 73 106-180 2-75 (133)
20 cd03568 VHS_STAM VHS domain fa 97.4 0.0012 2.7E-08 53.1 8.9 74 106-181 2-76 (144)
21 cd03569 VHS_Hrs_Vps27p VHS dom 97.4 0.0013 2.8E-08 52.8 8.5 77 102-180 2-79 (142)
22 cd03561 VHS VHS domain family; 97.2 0.0024 5.2E-08 50.2 8.8 75 106-182 2-77 (133)
23 cd03567 VHS_GGA VHS domain fam 97.2 0.0023 4.9E-08 51.4 8.3 74 105-180 2-76 (139)
24 cd00197 VHS_ENTH_ANTH VHS, ENT 96.3 0.0034 7.3E-08 47.6 2.7 42 2-44 73-115 (115)
25 KOG1087|consensus 93.7 0.26 5.6E-06 47.0 7.7 72 105-178 2-74 (470)
26 KOG2199|consensus 90.3 1.4 3E-05 41.6 7.9 78 102-181 6-84 (462)
27 cd03564 ANTH_AP180_CALM ANTH d 86.4 0.49 1.1E-05 36.2 2.0 41 1-42 72-114 (117)
28 KOG3208|consensus 81.0 11 0.00025 32.8 8.3 57 66-131 35-94 (231)
29 KOG0980|consensus 77.3 3.5 7.5E-05 42.3 4.6 84 107-192 9-92 (980)
30 KOG1086|consensus 76.5 6.5 0.00014 37.8 5.9 30 152-182 56-85 (594)
31 PF01418 HTH_6: Helix-turn-hel 54.5 20 0.00043 25.3 3.6 61 89-152 1-61 (77)
32 PF08513 LisH: LisH; InterPro 46.3 25 0.00055 20.3 2.5 20 164-183 6-25 (27)
33 smart00667 LisH Lissencephaly 45.4 22 0.00048 20.1 2.2 21 164-184 9-29 (34)
34 PF07571 DUF1546: Protein of u 43.5 90 0.002 23.0 5.7 53 121-177 26-81 (92)
35 KOG0037|consensus 41.4 35 0.00076 29.8 3.6 48 107-162 77-132 (221)
36 COG5602 SIN3 Histone deacetyla 38.4 95 0.0021 32.6 6.6 72 91-163 576-658 (1163)
37 COG1777 Predicted transcriptio 36.7 1.5E+02 0.0032 25.9 6.7 104 66-185 87-191 (217)
38 PF13606 Ank_3: Ankyrin repeat 35.9 24 0.00053 20.5 1.3 16 160-175 12-27 (30)
39 PF12755 Vac14_Fab1_bd: Vacuol 35.8 65 0.0014 24.1 4.0 65 96-171 30-96 (97)
40 smart00503 SynN Syntaxin N-ter 33.8 78 0.0017 23.1 4.1 22 90-111 93-114 (117)
41 PF11802 CENP-K: Centromere-as 31.6 73 0.0016 28.6 4.2 49 126-174 199-250 (268)
42 PF02847 MA3: MA3 domain; Int 31.6 2E+02 0.0044 20.8 6.1 19 152-173 92-110 (113)
43 cd03562 CID CID (CTD-Interacti 31.0 2.2E+02 0.0047 21.0 6.2 73 106-182 5-77 (114)
44 COG2307 Uncharacterized protei 30.8 3.6E+02 0.0078 24.8 8.5 143 24-186 56-245 (313)
45 PF04904 NCD1: NAB conserved r 29.8 73 0.0016 23.7 3.2 39 122-164 33-71 (82)
46 PF06743 FAST_1: FAST kinase-l 27.9 1.1E+02 0.0025 21.2 3.9 47 123-169 3-54 (71)
47 PF07276 PSGP: Apopolysialogly 26.3 25 0.00054 17.4 0.2 12 111-122 2-13 (13)
48 KOG2160|consensus 24.8 4.1E+02 0.009 24.6 7.9 103 72-174 135-243 (342)
49 PF08467 Luteo_P1-P2: Luteovir 23.9 1.6E+02 0.0034 27.3 4.9 44 24-67 137-180 (361)
50 PF00046 Homeobox: Homeobox do 23.6 62 0.0013 20.8 1.8 32 101-132 6-37 (57)
51 PF05178 Kri1: KRI1-like famil 23.2 3.4E+02 0.0073 20.6 6.3 37 108-147 32-68 (101)
52 PF06336 Corona_5a: Coronaviru 22.8 58 0.0013 22.8 1.5 33 151-183 6-44 (65)
53 smart00389 HOX Homeodomain. DN 22.3 1.4E+02 0.003 18.8 3.3 32 102-133 7-38 (56)
54 PF08542 Rep_fac_C: Replicatio 21.9 2.3E+02 0.005 19.7 4.7 41 139-179 38-81 (89)
55 PF00023 Ank: Ankyrin repeat H 21.4 61 0.0013 18.6 1.3 14 162-175 14-27 (33)
56 cd00086 homeodomain Homeodomai 20.9 1.5E+02 0.0032 18.7 3.3 33 101-133 6-38 (59)
No 1
>KOG2056|consensus
Probab=100.00 E-value=3.5e-40 Score=294.58 Aligned_cols=108 Identities=58% Similarity=0.951 Sum_probs=106.0
Q ss_pred hhhHHHHHHHHhhhccchHHHHHHHHhhcCCCCCCCHHHHHHHHHHccChhhHHHHHHHHHHHhCcCCCchHHHHHHHHH
Q psy4488 86 EGVAGLRRNIKNLAHNYSDAQVKVREATSNDPWGPSSSLMSEIADLTYNVVAFTEIMAMIWKRLNDSGKNWRHVYKALSL 165 (194)
Q Consensus 86 ~s~~~~~r~~kn~v~~~t~~E~~VreATsnd~wGps~~~m~eIa~~T~~~~~~~~Im~~l~kRL~~~~k~WR~vyKsL~L 165 (194)
++++.+.|+|+|+|++|+++|++|++||+||||||++++|.+||++||++.+|.+||.||||||++.|++||||||||+|
T Consensus 3 ~~~~~l~Rqakn~v~~y~~~e~kVrdAT~nd~wGPs~~lm~eIA~~ty~~~e~~eIm~vi~kRl~d~gknWR~VyKaLtl 82 (336)
T KOG2056|consen 3 MSFRDLKRQAKNFIKNYSEAELKVRDATSNDPWGPSGTLMAEIAQATYNFVEYQEIMDVLWKRLNDSGKNWRHVYKALTL 82 (336)
T ss_pred ccHHHHHHHHHHHHhcchHHHHHHHhccccccCCCchHHHHHHHHHhcCHHHHHHHHHHHHHHHhhccchHHHHHHHHHH
Confidence 56789999999999999999999999999999999999999999999999999999999999999999999999999999
Q ss_pred HHHHHhcCChhHHHhhhhhhhhcccccC
Q psy4488 166 LEYLIKTGSDKVTRIGGIFTHLIKTRSD 193 (194)
Q Consensus 166 LeyLlknGse~vv~~~r~~~~~i~~L~d 193 (194)
|||||+||||+||+||+.|+++|++|++
T Consensus 83 leyLl~~GSErv~~~~ren~~~I~tL~~ 110 (336)
T KOG2056|consen 83 LEYLLKNGSERVVDETRENIYTIETLKD 110 (336)
T ss_pred HHHHHhcCcHHHHHHHHhhhHHHHHHhh
Confidence 9999999999999999999999999986
No 2
>cd03571 ENTH_epsin ENTH domain, Epsin family; The epsin (Eps15 interactor) N-terminal homology (ENTH) domain is an evolutionarily conserved protein module found primarily in proteins that participate in clathrin-mediated endocytosis. A set of proteins previously designated as harboring an ENTH domain in fact contains a highly similar, yet unique module referred to as an AP180 N-terminal homology (ANTH) domain. ENTH and ANTH (E/ANTH) domains are structurally similar to the VHS domain and are composed of a superhelix of eight alpha helices. E/ANTH domains bind both inositol phospholipids and proteins and contribute to the nucleation and formation of clathrin coats on membranes. ENTH domains also function in the development of membrane curvature through lipid remodeling during the formation of clathrin-coated vesicles. E/ANTH-bearing proteins have recently been shown to function with adaptor protein-1 and GGA adaptors at the trans-Golgi network, which suggests that E/ANTH domains are univ
Probab=100.00 E-value=8.5e-39 Score=251.16 Aligned_cols=90 Identities=64% Similarity=1.052 Sum_probs=88.2
Q ss_pred HHHHHHHHhhcCCCCCCCHHHHHHHHHHccChhhHHHHHHHHHHHhCcCCCchHHHHHHHHHHHHHHhcCChhHHHhhhh
Q psy4488 104 DAQVKVREATSNDPWGPSSSLMSEIADLTYNVVAFTEIMAMIWKRLNDSGKNWRHVYKALSLLEYLIKTGSDKVTRIGGI 183 (194)
Q Consensus 104 ~~E~~VreATsnd~wGps~~~m~eIa~~T~~~~~~~~Im~~l~kRL~~~~k~WR~vyKsL~LLeyLlknGse~vv~~~r~ 183 (194)
|+|++|++|||||||||++++|++||++||++++|.+||++||+||++++++||||||||+||||||+|||++||+|+++
T Consensus 1 ~~e~~vreATs~d~wGp~~~~m~eIa~~t~~~~~~~~Im~~l~kRL~~~~k~WR~vyKaL~lleyLl~nGse~vv~~~r~ 80 (123)
T cd03571 1 EAELKVREATSNDPWGPSGTLMAEIARATYNYVEFQEIMSMLWKRLNDKGKNWRHVYKALTLLEYLLKNGSERVVDDARE 80 (123)
T ss_pred CHHHHHHHHcCCCCCCCCHHHHHHHHHHhCCHHHHHHHHHHHHHHHHhccccHHHHHHHHHHHHHHHHhCCHHHHHHHHH
Confidence 57999999999999999999999999999999999999999999999999999999999999999999999999999999
Q ss_pred hhhhcccccC
Q psy4488 184 FTHLIKTRSD 193 (194)
Q Consensus 184 ~~~~i~~L~d 193 (194)
|+++|++|++
T Consensus 81 ~~~~i~~L~~ 90 (123)
T cd03571 81 NLYIIRTLKD 90 (123)
T ss_pred hHHHHHhhcc
Confidence 9999999975
No 3
>PF01417 ENTH: ENTH domain; InterPro: IPR001026 The ENTH (Epsin N-terminal homology) domain is approximately 150 amino acids in length and is always found located at the N-termini of proteins. The domain forms a compact globular structure, composed of 9 alpha-helices connected by loops of varying length. The general topology is determined by three helical hairpins that are stacked consecutively with a right hand twist []. An N-terminal helix folds back, forming a deep basic groove that forms the binding pocket for the Ins(1,4,5)P3 ligand []. The ligand is coordinated by residues from surrounding alpha-helices and all three phosphates are multiply coordinated. The coordination of Ins(1,4,5)P3 suggests that ENTH is specific for particular head groups. Proteins containing this domain have been found to bind PtdIns(4,5)P2 and PtdIns(1,4,5)P3 suggesting that the domain may be a membrane interacting module. The main function of proteins containing this domain appears to be to act as accessory clathrin adaptors in endocytosis, Epsin is able to recruit and promote clathrin polymerisation on a lipid monolayer, but may have additional roles in signalling and actin regulation []. Epsin causes a strong degree of membrane curvature and tubulation, even fragmentation of membranes with a high PtdIns(4,5)P2 content. Epsin binding to membranes facilitates their deformation by insertion of the N-terminal helix into the outer leaflet of the bilayer, pushing the head groups apart. This would reduce the energy needed to curve the membrane into a vesicle, making it easier for the clathrin cage to fix and stabilise the curved membrane. This points to a pioneering role for epsin in vesicle budding as it provides both a driving force and a link between membrane invagination and clathrin polymerisation. ; PDB: 1H0A_A 1EYH_A 1EDU_A 2QY7_B 1XGW_A 2V8S_E 1VDY_A 2DCP_A 1INZ_A 3ONL_B ....
Probab=100.00 E-value=4.8e-34 Score=222.07 Aligned_cols=92 Identities=55% Similarity=0.956 Sum_probs=85.4
Q ss_pred chHHHHHHHHhhcCCCCCCCHHHHHHHHHHccChhhHHHHHHHHHHHh-CcCCCchHHHHHHHHHHHHHHhcCChhHHHh
Q psy4488 102 YSDAQVKVREATSNDPWGPSSSLMSEIADLTYNVVAFTEIMAMIWKRL-NDSGKNWRHVYKALSLLEYLIKTGSDKVTRI 180 (194)
Q Consensus 102 ~t~~E~~VreATsnd~wGps~~~m~eIa~~T~~~~~~~~Im~~l~kRL-~~~~k~WR~vyKsL~LLeyLlknGse~vv~~ 180 (194)
||++|++|++|||+|||||++++|++||++||++.+|..||++||+|| ..++++||++||||+||+||++|||++|++|
T Consensus 1 ys~~e~~v~eAT~~d~~gp~~~~l~eIa~~t~~~~~~~~I~~~l~kRL~~~~~k~wr~~~KaL~ll~yLl~nG~~~~~~~ 80 (125)
T PF01417_consen 1 YSELELKVREATSNDPWGPPGKLLAEIAQLTYNSKDCQEIMDVLWKRLSKSDGKNWRHVYKALTLLEYLLKNGSERFVDE 80 (125)
T ss_dssp --HHHHHHHHHTSSSSSS--HHHHHHHHHHTTSCHHHHHHHHHHHHHHHSSTSSGHHHHHHHHHHHHHHHHHS-HHHHHH
T ss_pred CCHHHHHHHHHcCCCCCCcCHHHHHHHHHHHhccccHHHHHHHHHHHHHhcCCcchhHHHHHHHHHHHHHHHCCHHHHHH
Confidence 899999999999999999999999999999999999999999999999 8899999999999999999999999999999
Q ss_pred hhhhhhhcccccC
Q psy4488 181 GGIFTHLIKTRSD 193 (194)
Q Consensus 181 ~r~~~~~i~~L~d 193 (194)
++.|.+.|+.|++
T Consensus 81 ~~~~~~~I~~l~~ 93 (125)
T PF01417_consen 81 LRDHIDIIRELQD 93 (125)
T ss_dssp HHHTHHHHHGGGG
T ss_pred HHHHHHHHhhcce
Confidence 9999999999975
No 4
>KOG2057|consensus
Probab=99.97 E-value=7.7e-32 Score=240.91 Aligned_cols=107 Identities=40% Similarity=0.723 Sum_probs=102.1
Q ss_pred hhhHHHHHHHHhhhccchHHHHHHHHhhcCCCCCCCHHHHHHHHHHccC--hhhHHHHHHHHHHHh-CcCCCchHHHHHH
Q psy4488 86 EGVAGLRRNIKNLAHNYSDAQVKVREATSNDPWGPSSSLMSEIADLTYN--VVAFTEIMAMIWKRL-NDSGKNWRHVYKA 162 (194)
Q Consensus 86 ~s~~~~~r~~kn~v~~~t~~E~~VreATsnd~wGps~~~m~eIa~~T~~--~~~~~~Im~~l~kRL-~~~~k~WR~vyKs 162 (194)
..++++..++.++++||+++|.+||||||+|||||++.+|.+||.+||. +++|+++|.+||.|+ .++.++||+|||+
T Consensus 5 ~kVRel~dkandaiMNY~e~e~~VREATNdDPWGPsG~lMgeIaeaTfmry~EdFpelmnmL~qRMLedNK~~WRRVYKS 84 (499)
T KOG2057|consen 5 EKVRELTDKANDAIMNYPEAEMDVREATNDDPWGPSGPLMGEIAEATFMRYMEDFPELMNMLFQRMLEDNKDAWRRVYKS 84 (499)
T ss_pred HHHHHHHHHHhHHHhcchHHHHHHHhhccCCCCCCccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhHHHHHHHHHH
Confidence 4578999999999999999999999999999999999999999999999 999999999999995 5677899999999
Q ss_pred HHHHHHHHhcCChhHHHhhhhhhhhccccc
Q psy4488 163 LSLLEYLIKTGSDKVTRIGGIFTHLIKTRS 192 (194)
Q Consensus 163 L~LLeyLlknGse~vv~~~r~~~~~i~~L~ 192 (194)
|+||.|||+||||+||+++|+|+|-++.|.
T Consensus 85 LiLLaYLikNGSER~VqeAREh~YdLR~LE 114 (499)
T KOG2057|consen 85 LILLAYLIKNGSERFVQEAREHAYDLRRLE 114 (499)
T ss_pred HHHHHHHHhcccHHHHHHHHHHHHHHHhhh
Confidence 999999999999999999999999998875
No 5
>smart00273 ENTH Epsin N-terminal homology (ENTH) domain.
Probab=99.94 E-value=9e-27 Score=181.43 Aligned_cols=90 Identities=50% Similarity=0.757 Sum_probs=86.6
Q ss_pred hHHHHHHHHhhcCCCCCCCHHHHHHHHHHccCh-hhHHHHHHHHHHHhCcCCCchHHHHHHHHHHHHHHhcCChhHHHhh
Q psy4488 103 SDAQVKVREATSNDPWGPSSSLMSEIADLTYNV-VAFTEIMAMIWKRLNDSGKNWRHVYKALSLLEYLIKTGSDKVTRIG 181 (194)
Q Consensus 103 t~~E~~VreATsnd~wGps~~~m~eIa~~T~~~-~~~~~Im~~l~kRL~~~~k~WR~vyKsL~LLeyLlknGse~vv~~~ 181 (194)
|++|+.|++|||+|+|||+++.|.+|+.+||++ .+|..||.+||+||.+++ +|+++||||+||+|||+|||++|+.++
T Consensus 1 ~~~e~~V~kAT~~~~~~p~~k~~~~I~~~t~~~~~~~~~i~~~l~~Rl~~~~-~w~~v~KsL~llh~ll~~G~~~~~~~~ 79 (127)
T smart00273 1 SDLEVKVRKATNNDEWGPKGKHLREIIQGTHNEKSSFAEIMAVLWRRLNDTK-NWRVVYKALILLHYLLRNGSPRVILEA 79 (127)
T ss_pred CHHHHHHHHhcCCCCCCcCHHHHHHHHHHHccCHhhHHHHHHHHHHHHhCcc-cHHHHHHHHHHHHHHHhcCCHHHHHHH
Confidence 589999999999999999999999999999999 999999999999999987 899999999999999999999999999
Q ss_pred hhhhhhcccccC
Q psy4488 182 GIFTHLIKTRSD 193 (194)
Q Consensus 182 r~~~~~i~~L~d 193 (194)
+.|.+.|..|.+
T Consensus 80 ~~~~~~i~~L~~ 91 (127)
T smart00273 80 LRNRNRILNLSD 91 (127)
T ss_pred HHhhHHHhhHhh
Confidence 999999988864
No 6
>KOG2056|consensus
Probab=99.84 E-value=1.4e-21 Score=175.14 Aligned_cols=68 Identities=51% Similarity=0.827 Sum_probs=65.3
Q ss_pred ChhHhhhhhhhhhhcccceecc-CCccchhhHHHHHHHHHHhhCCHHHHHHHHHHHHHhhhhhccCCCC
Q psy4488 1 VAFQCKENIFLIQTLRDFQYTE-EGKDQGFNVREKAKALATLLGDEERLRNERARQLKARERFARSASS 68 (194)
Q Consensus 1 v~~~~r~~~~~i~~L~~F~y~d-~g~D~G~~VR~kak~l~~LL~D~~~lr~eR~~a~~~r~k~~~~~~~ 68 (194)
|+.|||+|+|+|++|++|+||| +|+|||+|||+|||+|++||+|+++|++||++|+++|+||.+.+++
T Consensus 94 v~~~~ren~~~I~tL~~Fq~iD~~G~dqG~nVRkkak~l~~LL~D~erLkeeR~~a~~~r~k~~~~~~~ 162 (336)
T KOG2056|consen 94 VVDETRENIYTIETLKDFQYIDEDGKDQGLNVRKKAKELLSLLEDDERLKEERKKARKTRTKFAGFGPG 162 (336)
T ss_pred HHHHHHhhhHHHHHHhhceeeCCCCccchHHHHHHHHHHHHHhccHHHHHHHHHHHHHHhhhhcccCCC
Confidence 6899999999999999999998 9999999999999999999999999999999999999999987653
No 7
>cd03572 ENTH_epsin_related ENTH domain, Epsin Related family; composed of hypothetical proteins containing an ENTH-like domain. The epsin N-terminal homology (ENTH) domain is an evolutionarily conserved protein module found primarily in proteins that participate in clathrin-mediated endocytosis. A set of proteins previously designated as harboring an ENTH domain in fact contains a highly similar, yet unique module referred to as an AP180 N-terminal homology (ANTH) domain. ENTH and ANTH (E/ANTH) domains are structurally similar to the VHS domain and are composed of a superhelix of eight alpha helices. E/ANTH domains bind both inositol phospholipids and proteins and contribute to the nucleation and formation of clathrin coats on membranes. ENTH domains also function in the development of membrane curvature through lipid remodeling during the formation of clathrin-coated vesicles. E/ANTH-bearing proteins have recently been shown to function with adaptor protein-1 and GGA adaptors at the t
Probab=99.81 E-value=4.2e-20 Score=145.29 Aligned_cols=82 Identities=28% Similarity=0.320 Sum_probs=78.6
Q ss_pred HHHHhhcCCCCCCCHHHHHHHHHHccCh-hhHHHHHHHHHHHhCcCCCchHHH-HHHHHHHHHHHhcCChhHHHhhhhhh
Q psy4488 108 KVREATSNDPWGPSSSLMSEIADLTYNV-VAFTEIMAMIWKRLNDSGKNWRHV-YKALSLLEYLIKTGSDKVTRIGGIFT 185 (194)
Q Consensus 108 ~VreATsnd~wGps~~~m~eIa~~T~~~-~~~~~Im~~l~kRL~~~~k~WR~v-yKsL~LLeyLlknGse~vv~~~r~~~ 185 (194)
+|+.||+||+|||+|-+|+|||+.||+. .+|.+||++|.+||++++ +|| +|+|.||.||+.+||+.|+.|++.|.
T Consensus 5 ll~~ATsdd~~p~pgy~~~Eia~~t~~s~~~~~ei~d~L~kRL~~~~---~hVK~K~Lrilk~l~~~G~~~f~~~~~~~~ 81 (122)
T cd03572 5 LLSKATSDDDEPTPGYLYEEIAKLTRKSVGSCQELLEYLLKRLKRSS---PHVKLKVLKIIKHLCEKGNSDFKRELQRNS 81 (122)
T ss_pred HHHHHhcCCCCCCchHHHHHHHHHHHcCHHHHHHHHHHHHHHhcCCC---CcchHHHHHHHHHHHhhCCHHHHHHHHHhH
Confidence 6899999999999999999999999996 899999999999999876 899 99999999999999999999999999
Q ss_pred hhccccc
Q psy4488 186 HLIKTRS 192 (194)
Q Consensus 186 ~~i~~L~ 192 (194)
++|+++.
T Consensus 82 ~~Ik~~~ 88 (122)
T cd03572 82 AQIRECA 88 (122)
T ss_pred HHHHHHH
Confidence 9999875
No 8
>cd00197 VHS_ENTH_ANTH VHS, ENTH and ANTH domain superfamily; composed of proteins containing a VHS, ENTH or ANTH domain. The VHS domain is present in Vps27 (Vacuolar Protein Sorting), Hrs (Hepatocyte growth factor-regulated tyrosine kinase substrate) and STAM (Signal Transducing Adaptor Molecule). It is located at the N-termini of proteins involved in intracellular membrane trafficking. The epsin N-terminal homology (ENTH) domain is an evolutionarily conserved protein module found primarily in proteins that participate in clathrin-mediated endocytosis. A set of proteins previously designated as harboring an ENTH domain in fact contains a highly similar, yet unique module referred to as an AP180 N-terminal homology (ANTH) domain. VHS, ENTH and ANTH domains are structurally similar and are composed of a superhelix of eight alpha helices. ENTH adnd ANTH (E/ANTH) domains bind both inositol phospholipids and proteins and contribute to the nucleation and formation of clathrin coats on membra
Probab=99.79 E-value=2.8e-19 Score=136.55 Aligned_cols=86 Identities=33% Similarity=0.385 Sum_probs=79.8
Q ss_pred HHHHHHHhhcCCCCCCCHHHHHHHHHHccCh-hhHHHHHHHHHHHhCcCCCchHHHHHHHHHHHHHHhcCChhHHHhhhh
Q psy4488 105 AQVKVREATSNDPWGPSSSLMSEIADLTYNV-VAFTEIMAMIWKRLNDSGKNWRHVYKALSLLEYLIKTGSDKVTRIGGI 183 (194)
Q Consensus 105 ~E~~VreATsnd~wGps~~~m~eIa~~T~~~-~~~~~Im~~l~kRL~~~~k~WR~vyKsL~LLeyLlknGse~vv~~~r~ 183 (194)
.|..|.+|||+++|||+.+.|.+|++.+++. ..+.+||+.|++||.++ +|+++||||+|||||++||+++|..++..
T Consensus 1 ~~~~v~~AT~~~~~~p~~~~i~~i~d~~~~~~~~~~~~~~~l~kRl~~~--~~~~~lkaL~lLe~lvkN~g~~f~~~i~~ 78 (115)
T cd00197 1 FEKTVEKATSNENMGPDWPLIMEICDLINETNVGPKEAVDAIKKRINNK--NPHVVLKALTLLEYCVKNCGERFHQEVAS 78 (115)
T ss_pred ChHHHHHHcCCCCCCCCHHHHHHHHHHHHCCCccHHHHHHHHHHHhcCC--cHHHHHHHHHHHHHHHHHccHHHHHHHHH
Confidence 3789999999999999999999999999987 66799999999999886 99999999999999999999999999999
Q ss_pred hhhhccccc
Q psy4488 184 FTHLIKTRS 192 (194)
Q Consensus 184 ~~~~i~~L~ 192 (194)
+..+++.++
T Consensus 79 ~~~~~~l~~ 87 (115)
T cd00197 79 NDFAVELLK 87 (115)
T ss_pred hHHHHHHHH
Confidence 998887643
No 9
>KOG2057|consensus
Probab=99.71 E-value=5.5e-18 Score=152.26 Aligned_cols=77 Identities=25% Similarity=0.539 Sum_probs=70.6
Q ss_pred ChhHhhhhhhhhhhcccceecc-CCccchhhHHHHHHHHHHhhCCHHHHHHHHHHHHHh-hhhhccCCCCCCCCCCccc
Q psy4488 1 VAFQCKENIFLIQTLRDFQYTE-EGKDQGFNVREKAKALATLLGDEERLRNERARQLKA-RERFARSASSGFGSEGSAM 77 (194)
Q Consensus 1 v~~~~r~~~~~i~~L~~F~y~d-~g~D~G~~VR~kak~l~~LL~D~~~lr~eR~~a~~~-r~k~~~~~~~~~g~~~~~~ 77 (194)
||+++|||+|.++.|++|+||| .|+|||||||+|.|+|+++.+||++|++||++|++. ++||.|+++..+|+-+++.
T Consensus 99 ~VqeAREh~YdLR~LEnYhfiDEhGKDQGINIR~kVKeilEfanDDd~Lq~ERkKAkkddKdKY~G~~qd~mggf~~Se 177 (499)
T KOG2057|consen 99 FVQEAREHAYDLRRLENYHFIDEHGKDQGINIRHKVKEILEFANDDDLLQAERKKAKKDDKDKYRGFDQDDMGGFGSSE 177 (499)
T ss_pred HHHHHHHHHHHHHhhhhccchhhhCccccccHHHHHHHHHHHhccHHHHHHHHHhhccccchhccccchhhcccccccc
Confidence 6899999999999999999997 899999999999999999999999999999999998 9999999876666544433
No 10
>cd03564 ANTH_AP180_CALM ANTH domain family; composed of adaptor protein 180 (AP180), clathrin assembly lymphoid myeloid leukemia protein (CALM) and similar proteins. A set of proteins previously designated as harboring an ENTH domain in fact contains a highly similar, yet unique module referred to as an AP180 N-terminal homology (ANTH) domain. AP180 and CALM play important roles in clathrin-mediated endocytosis. AP180 is a brain-specific clathrin-binding protein which stimulates clathrin assembly during the recycling of synaptic vesicles. The ANTH domain is structurally similar to the VHS domain and is composed of a superhelix of eight alpha helices. ANTH domains bind both inositol phospholipids and proteins, and contribute to the nucleation and formation of clathrin coats on membranes. ANTH-bearing proteins have recently been shown to function with adaptor protein-1 and GGA adaptors at the trans-Golgi network, which suggests that the ANTH domain is a universal component of the machine
Probab=99.71 E-value=3.6e-17 Score=126.13 Aligned_cols=84 Identities=25% Similarity=0.277 Sum_probs=78.7
Q ss_pred HHHHHHHhhcCCCCCCCHHHHHHHHHHccC-hhhHHHHHHHHHHHhCcCCCchHHHHHHHHHHHHHHhcCChhHHHhhhh
Q psy4488 105 AQVKVREATSNDPWGPSSSLMSEIADLTYN-VVAFTEIMAMIWKRLNDSGKNWRHVYKALSLLEYLIKTGSDKVTRIGGI 183 (194)
Q Consensus 105 ~E~~VreATsnd~wGps~~~m~eIa~~T~~-~~~~~~Im~~l~kRL~~~~k~WR~vyKsL~LLeyLlknGse~vv~~~r~ 183 (194)
.|+.|..|||+++|||..+.|.+|..+|++ +..+..+|..|++|+.+ ++|+++||||+||++||++|++.|++++..
T Consensus 1 ~~~aV~kAT~~~~~~pk~k~v~~ii~~t~~~~~~~~~~~~~l~~Rl~~--~~w~v~~K~LillH~llr~G~~~~~~~~~~ 78 (117)
T cd03564 1 LEKAVKKATSHDEAPPKDKHVRKIIAGTSSSPASIPSFASALSRRLLD--RNWVVVLKALILLHRLLREGHPSFLQELLS 78 (117)
T ss_pred CchHHHhhcCCCCCCCChHHHHHHHHHHcCCCCCHHHHHHHHHHHHcc--CcHHHHHHHHHHHHHHHhcCCHHHHHHHHH
Confidence 367899999999999999999999999999 89999999999999998 789999999999999999999999999999
Q ss_pred hhhhccc
Q psy4488 184 FTHLIKT 190 (194)
Q Consensus 184 ~~~~i~~ 190 (194)
|...|+.
T Consensus 79 ~~~~l~l 85 (117)
T cd03564 79 RRGWLNL 85 (117)
T ss_pred ccCeeec
Confidence 8766654
No 11
>cd03571 ENTH_epsin ENTH domain, Epsin family; The epsin (Eps15 interactor) N-terminal homology (ENTH) domain is an evolutionarily conserved protein module found primarily in proteins that participate in clathrin-mediated endocytosis. A set of proteins previously designated as harboring an ENTH domain in fact contains a highly similar, yet unique module referred to as an AP180 N-terminal homology (ANTH) domain. ENTH and ANTH (E/ANTH) domains are structurally similar to the VHS domain and are composed of a superhelix of eight alpha helices. E/ANTH domains bind both inositol phospholipids and proteins and contribute to the nucleation and formation of clathrin coats on membranes. ENTH domains also function in the development of membrane curvature through lipid remodeling during the formation of clathrin-coated vesicles. E/ANTH-bearing proteins have recently been shown to function with adaptor protein-1 and GGA adaptors at the trans-Golgi network, which suggests that E/ANTH domains are univ
Probab=99.59 E-value=5.3e-16 Score=122.03 Aligned_cols=49 Identities=59% Similarity=0.974 Sum_probs=47.2
Q ss_pred ChhHhhhhhhhhhhcccceecc-CCccchhhHHHHHHHHHHhhCCHHHHH
Q psy4488 1 VAFQCKENIFLIQTLRDFQYTE-EGKDQGFNVREKAKALATLLGDEERLR 49 (194)
Q Consensus 1 v~~~~r~~~~~i~~L~~F~y~d-~g~D~G~~VR~kak~l~~LL~D~~~lr 49 (194)
||.+||+|+++|++|++|+|+| +|+|+|+|||+|||+|++||+|+++||
T Consensus 74 vv~~~r~~~~~i~~L~~F~~~d~~g~d~G~~VR~ka~~i~~Ll~D~~~L~ 123 (123)
T cd03571 74 VVDDARENLYIIRTLKDFQYIDENGKDQGINVREKAKEILELLEDDERLR 123 (123)
T ss_pred HHHHHHHhHHHHHhhccceeeCCCCCchhHHHHHHHHHHHHHhCCHhhcC
Confidence 6899999999999999999998 899999999999999999999999985
No 12
>PF01417 ENTH: ENTH domain; InterPro: IPR001026 The ENTH (Epsin N-terminal homology) domain is approximately 150 amino acids in length and is always found located at the N-termini of proteins. The domain forms a compact globular structure, composed of 9 alpha-helices connected by loops of varying length. The general topology is determined by three helical hairpins that are stacked consecutively with a right hand twist []. An N-terminal helix folds back, forming a deep basic groove that forms the binding pocket for the Ins(1,4,5)P3 ligand []. The ligand is coordinated by residues from surrounding alpha-helices and all three phosphates are multiply coordinated. The coordination of Ins(1,4,5)P3 suggests that ENTH is specific for particular head groups. Proteins containing this domain have been found to bind PtdIns(4,5)P2 and PtdIns(1,4,5)P3 suggesting that the domain may be a membrane interacting module. The main function of proteins containing this domain appears to be to act as accessory clathrin adaptors in endocytosis, Epsin is able to recruit and promote clathrin polymerisation on a lipid monolayer, but may have additional roles in signalling and actin regulation []. Epsin causes a strong degree of membrane curvature and tubulation, even fragmentation of membranes with a high PtdIns(4,5)P2 content. Epsin binding to membranes facilitates their deformation by insertion of the N-terminal helix into the outer leaflet of the bilayer, pushing the head groups apart. This would reduce the energy needed to curve the membrane into a vesicle, making it easier for the clathrin cage to fix and stabilise the curved membrane. This points to a pioneering role for epsin in vesicle budding as it provides both a driving force and a link between membrane invagination and clathrin polymerisation. ; PDB: 1H0A_A 1EYH_A 1EDU_A 2QY7_B 1XGW_A 2V8S_E 1VDY_A 2DCP_A 1INZ_A 3ONL_B ....
Probab=99.17 E-value=2.3e-11 Score=94.53 Aligned_cols=48 Identities=52% Similarity=0.812 Sum_probs=42.8
Q ss_pred ChhHhhhhhhhhhhcccceecc-CCccchhhHHHHHHHHHHhhCCHHHH
Q psy4488 1 VAFQCKENIFLIQTLRDFQYTE-EGKDQGFNVREKAKALATLLGDEERL 48 (194)
Q Consensus 1 v~~~~r~~~~~i~~L~~F~y~d-~g~D~G~~VR~kak~l~~LL~D~~~l 48 (194)
|+.+||+|++.|+.|.+|+|+| .|.|+|.+||++|++|++||.|+++|
T Consensus 77 ~~~~~~~~~~~I~~l~~f~~~d~~g~d~~~~VR~~A~~i~~lL~d~~~L 125 (125)
T PF01417_consen 77 FVDELRDHIDIIRELQDFQYVDPKGKDQGQNVREKAKEILELLNDDERL 125 (125)
T ss_dssp HHHHHHHTHHHHHGGGG---BBTTSTBHHHHHHHHHHHHHHHHTSHHHH
T ss_pred HHHHHHHHHHHHhhcceeeccCCCCccHHHHHHHHHHHHHHHhCCcccC
Confidence 4789999999999999999998 79999999999999999999999986
No 13
>smart00273 ENTH Epsin N-terminal homology (ENTH) domain.
Probab=98.97 E-value=3.2e-10 Score=88.20 Aligned_cols=52 Identities=44% Similarity=0.567 Sum_probs=49.6
Q ss_pred ChhHhhhhhhhhhhcccceecc-CCccchhhHHHHHHHHHHhhCCHHHHHHHH
Q psy4488 1 VAFQCKENIFLIQTLRDFQYTE-EGKDQGFNVREKAKALATLLGDEERLRNER 52 (194)
Q Consensus 1 v~~~~r~~~~~i~~L~~F~y~d-~g~D~G~~VR~kak~l~~LL~D~~~lr~eR 52 (194)
|+.+++.|.+.|..|.+|+|++ .|.|+|.+||++|+.|.++|.|.++|++||
T Consensus 75 ~~~~~~~~~~~i~~L~~f~~~~~~~~d~g~~VR~ya~~L~~~l~~~~~l~~er 127 (127)
T smart00273 75 VILEALRNRNRILNLSDFQDIDSRGKDQGANIRTYAKYLLERLEDDRRLKEER 127 (127)
T ss_pred HHHHHHHhhHHHhhHhhCeecCCCCeeCcHHHHHHHHHHHHHHcCHHHHhccC
Confidence 4689999999999999999986 999999999999999999999999999987
No 14
>PF07651 ANTH: ANTH domain; InterPro: IPR011417 AP180 is an endocytotic accessory protein that has been implicated in the formation of clathrin-coated pits. The domain is involved in phosphatidylinositol 4,5-bisphosphate binding and is a universal adaptor for nucleation of clathrin coats [, ].; GO: 0005543 phospholipid binding; PDB: 1HX8_A 3ZYM_A 1HFA_A 1HG2_A 3ZYL_B 3ZYK_A 1HG5_A 1HF8_A.
Probab=98.86 E-value=1.1e-08 Score=88.41 Aligned_cols=87 Identities=25% Similarity=0.250 Sum_probs=68.7
Q ss_pred hHHHHHHHHhhcCCCCCCCHHHHHHHHHHccChhhHHHHHHHHHHHhCcCCCchHHHHHHHHHHHHHHhcCChhHHHhhh
Q psy4488 103 SDAQVKVREATSNDPWGPSSSLMSEIADLTYNVVAFTEIMAMIWKRLNDSGKNWRHVYKALSLLEYLIKTGSDKVTRIGG 182 (194)
Q Consensus 103 t~~E~~VreATsnd~wGps~~~m~eIa~~T~~~~~~~~Im~~l~kRL~~~~k~WR~vyKsL~LLeyLlknGse~vv~~~r 182 (194)
++.|+-|..||+.++.+|..+.+..|..+|+++.....++..|.+|+.. ..+|..++|+|+|++.|++-|++.+..++.
T Consensus 2 ~~l~~av~KAT~~~~~ppk~Khv~~il~~t~~~~~~~~~~~~l~~Rl~~-~~~w~V~~K~Lil~H~llr~G~~~~~~~~~ 80 (280)
T PF07651_consen 2 SDLEKAVIKATSHDEAPPKEKHVREILSATSSPESVAFLFWALSRRLPL-TRNWIVALKALILLHRLLRDGHPSFLQELL 80 (280)
T ss_dssp -HHHHHHHHHT-SSS---HHHHHHHHHHHCSTTS-HHHHHHHHHHHCTS-S-SHHHHHHHHHHHHHHHHHS-CHHHHHHH
T ss_pred hHHHHHHHHHcCCCCCCCCHHHHHHHHHHhcCCccHHHHHHHHHHHccc-cccHHHHHHHHHHHHHHHHcCchHHHHHHH
Confidence 5788999999999999999999999999999978889999999999987 457999999999999999999999998877
Q ss_pred hhhhhccc
Q psy4488 183 IFTHLIKT 190 (194)
Q Consensus 183 ~~~~~i~~ 190 (194)
.+...+..
T Consensus 81 ~~~~~~~~ 88 (280)
T PF07651_consen 81 RYNRRLFD 88 (280)
T ss_dssp HTT-----
T ss_pred Hcccchhh
Confidence 66554443
No 15
>KOG0251|consensus
Probab=98.48 E-value=4.1e-07 Score=86.11 Aligned_cols=84 Identities=27% Similarity=0.325 Sum_probs=76.8
Q ss_pred cchHHHHHHHHhhcCCCCCCCHHHHHHHHHHccCh-hhHHHHHHHHHHHhCcCCCchHHHHHHHHHHHHHHhcCChhHHH
Q psy4488 101 NYSDAQVKVREATSNDPWGPSSSLMSEIADLTYNV-VAFTEIMAMIWKRLNDSGKNWRHVYKALSLLEYLIKTGSDKVTR 179 (194)
Q Consensus 101 ~~t~~E~~VreATsnd~wGps~~~m~eIa~~T~~~-~~~~~Im~~l~kRL~~~~k~WR~vyKsL~LLeyLlknGse~vv~ 179 (194)
.++.+++.|..||+.++.+|..+.+..|-.+|-+. ...+.++..|..||+.. ++|-.+||||+|+++|+++|.+.+..
T Consensus 20 ~~~~l~~AV~KATsh~~~ppk~k~l~~Il~~ts~~~~~i~~~v~aLs~Rl~~T-rnW~VAlKsLIliH~ll~~G~~~f~~ 98 (491)
T KOG0251|consen 20 AGSDLEKAVVKATSHDDMPPKDKYLDEILSATSSSPASIPSCVHALSERLNKT-RNWTVALKALILIHRLLKEGDPSFEQ 98 (491)
T ss_pred hhhhHHHHHHhhccCCCCCccHHHHHHHHHHhcCCcccHHHHHHHHHHHhCCC-cceeehHHHHHHHHHHHhcCcHHHHH
Confidence 57899999999999999999999999999998776 78889999999999985 57999999999999999999999998
Q ss_pred hhhhhh
Q psy4488 180 IGGIFT 185 (194)
Q Consensus 180 ~~r~~~ 185 (194)
.+-..-
T Consensus 99 ~l~~~~ 104 (491)
T KOG0251|consen 99 ELLSRN 104 (491)
T ss_pred HHHhcc
Confidence 766554
No 16
>cd03572 ENTH_epsin_related ENTH domain, Epsin Related family; composed of hypothetical proteins containing an ENTH-like domain. The epsin N-terminal homology (ENTH) domain is an evolutionarily conserved protein module found primarily in proteins that participate in clathrin-mediated endocytosis. A set of proteins previously designated as harboring an ENTH domain in fact contains a highly similar, yet unique module referred to as an AP180 N-terminal homology (ANTH) domain. ENTH and ANTH (E/ANTH) domains are structurally similar to the VHS domain and are composed of a superhelix of eight alpha helices. E/ANTH domains bind both inositol phospholipids and proteins and contribute to the nucleation and formation of clathrin coats on membranes. ENTH domains also function in the development of membrane curvature through lipid remodeling during the formation of clathrin-coated vesicles. E/ANTH-bearing proteins have recently been shown to function with adaptor protein-1 and GGA adaptors at the t
Probab=98.05 E-value=1.9e-06 Score=68.05 Aligned_cols=45 Identities=18% Similarity=0.197 Sum_probs=41.3
Q ss_pred ChhHhhhhhhhhhhccccee-cc--CCccchhhHHHHHHHHHHhhCCH
Q psy4488 1 VAFQCKENIFLIQTLRDFQY-TE--EGKDQGFNVREKAKALATLLGDE 45 (194)
Q Consensus 1 v~~~~r~~~~~i~~L~~F~y-~d--~g~D~G~~VR~kak~l~~LL~D~ 45 (194)
++.|+|.|.++|+++++|++ .| .|.|.|..||++|++++++|..+
T Consensus 73 f~~~~~~~~~~Ik~~~~f~g~~Dp~~Gd~~~~~VR~~A~El~~~if~~ 120 (122)
T cd03572 73 FKRELQRNSAQIRECANYKGPPDPLKGDSLNEKVREEAQELIKAIFSY 120 (122)
T ss_pred HHHHHHHhHHHHHHHHHcCCCCCcccCcchhHHHHHHHHHHHHHHhcc
Confidence 46899999999999999999 56 89999999999999999998764
No 17
>PF00790 VHS: VHS domain; InterPro: IPR002014 The VHS domain is a ~140 residues long domain, whose name is derived from its occurrence in VPS-27, Hrs and STAM. Based on regions surrounding the domain, VHS-proteins can be divided into 4 groups []: STAM/EAST/Hbp which all share the domain composition VHS-SH3-ITAM and carry one or two ubiquitin-interacting motifs Proteins with a FYVE domain (IPR000306 from INTERPRO) C-terminal to VHS which also carry one or two ubiquitin-interacting motifs GGA proteins with a domain composition VHS-GAT (GGA and Tom1) homology domain VHS domain alone or in combination with domains other than those listed above The VHS domain is always found at the N- terminus of proteins suggesting that such topology is important for function. The domain is considered to have a general membrane targeting/cargo recognition role in vesicular trafficking []. Resolution of the crystal structure of the VHS domain of Drosophila Hrs and human Tom1 revealed that it consists of eight helices arranged in a double-layer superhelix []. The existence of conserved patches of residues on the domain surface suggests that VHS domains may be involved in protein-protein recognition and docking. Overall, sequence similarity is low (approx 25%) amongst domain family members.; GO: 0006886 intracellular protein transport; PDB: 1X5B_A 2L0T_B 1DVP_A 3LDZ_C 3ZYQ_A 4AVX_A 3G2U_A 3G2W_A 1UJJ_A 3G2V_A ....
Probab=97.83 E-value=0.00012 Score=58.04 Aligned_cols=78 Identities=29% Similarity=0.254 Sum_probs=63.7
Q ss_pred cchHHHHHHHHhhcCCCCCCCHHHHHHHHHHccC-hhhHHHHHHHHHHHhCcCCCchHHHHHHHHHHHHHHhcCChhHHH
Q psy4488 101 NYSDAQVKVREATSNDPWGPSSSLMSEIADLTYN-VVAFTEIMAMIWKRLNDSGKNWRHVYKALSLLEYLIKTGSDKVTR 179 (194)
Q Consensus 101 ~~t~~E~~VreATsnd~wGps~~~m~eIa~~T~~-~~~~~~Im~~l~kRL~~~~k~WR~vyKsL~LLeyLlknGse~vv~ 179 (194)
..|+++..|..||+...-.|.-....+|++.--. +..-.+++..|.|||.. .+-..++-||+|||+|++||.+.|-.
T Consensus 2 ~~~~~~~li~kATs~~~~~~Dw~~~l~icD~i~~~~~~~kea~~~l~krl~~--~~~~vq~~aL~lld~lvkNcg~~f~~ 79 (140)
T PF00790_consen 2 PSSSITELIEKATSESLPSPDWSLILEICDLINSSPDGAKEAARALRKRLKH--GNPNVQLLALTLLDALVKNCGPRFHR 79 (140)
T ss_dssp CCSHHHHHHHHHT-TTSSS--HHHHHHHHHHHHTSTTHHHHHHHHHHHHHTT--SSHHHHHHHHHHHHHHHHHSHHHHHH
T ss_pred CCChHHHHHHHHhCcCCCCCCHHHHHHHHHHHHcCCccHHHHHHHHHHHHhC--CCHHHHHHHHHHHHHHHHcCCHHHHH
Confidence 3589999999999999888888888888887443 55667899999999998 47899999999999999999887665
Q ss_pred h
Q psy4488 180 I 180 (194)
Q Consensus 180 ~ 180 (194)
.
T Consensus 80 e 80 (140)
T PF00790_consen 80 E 80 (140)
T ss_dssp H
T ss_pred H
Confidence 5
No 18
>cd03565 VHS_Tom1 VHS domain family, Tom1 subfamily; The VHS domain is an essential part of Tom1 (Target of myb1 - retroviral oncogene) protein. The VHS domain has a superhelical structure similar to the structure of the ARM repeats and is present at the very N-termini of proteins. It is a right-handed superhelix of eight alpha helices. The VHS domain has been found in a number of proteins, some of which have been implicated in intracellular trafficking and sorting. The VHS domain of the Tom1 protein is essential for the negative regulation of Interleukin-1 and Tumor Necrosis Factor-induced signaling pathways.
Probab=97.54 E-value=0.00057 Score=54.75 Aligned_cols=76 Identities=22% Similarity=0.253 Sum_probs=61.6
Q ss_pred HHHHHHHHhhcCCCCCCCHHHHHHHHHHccChh-hHHHHHHHHHHHhCcCCCchHHHHHHHHHHHHHHhcCChhHHHh
Q psy4488 104 DAQVKVREATSNDPWGPSSSLMSEIADLTYNVV-AFTEIMAMIWKRLNDSGKNWRHVYKALSLLEYLIKTGSDKVTRI 180 (194)
Q Consensus 104 ~~E~~VreATsnd~wGps~~~m~eIa~~T~~~~-~~~~Im~~l~kRL~~~~k~WR~vyKsL~LLeyLlknGse~vv~~ 180 (194)
|.+..|..||+...-.|.-...-+|++.--+.+ .-.+.|..|.|||.. +++-+.++-||+|||-|++||..+|-.+
T Consensus 1 ~~~~~IekATse~l~~~dw~~ileicD~In~~~~~~k~a~ralkkRl~~-~~n~~v~l~aL~LLe~~vkNCG~~fh~e 77 (141)
T cd03565 1 PVGQLIEKATDGSLQSEDWGLNMEICDIINETEDGPKDAVRALKKRLNG-NKNHKEVMLTLTVLETCVKNCGHRFHVL 77 (141)
T ss_pred CHhHHHHHHcCcCCCCcCHHHHHHHHHHHhCCCCcHHHHHHHHHHHHcc-CCCHHHHHHHHHHHHHHHHHccHHHHHH
Confidence 457889999999877888888888887754443 346899999999973 4578889999999999999999876654
No 19
>smart00288 VHS Domain present in VPS-27, Hrs and STAM. Unpublished observations. Domain of unknown function.
Probab=97.51 E-value=0.00079 Score=53.14 Aligned_cols=73 Identities=27% Similarity=0.256 Sum_probs=60.2
Q ss_pred HHHHHHhhcCCCCCCCHHHHHHHHHHccCh-hhHHHHHHHHHHHhCcCCCchHHHHHHHHHHHHHHhcCChhHHHh
Q psy4488 106 QVKVREATSNDPWGPSSSLMSEIADLTYNV-VAFTEIMAMIWKRLNDSGKNWRHVYKALSLLEYLIKTGSDKVTRI 180 (194)
Q Consensus 106 E~~VreATsnd~wGps~~~m~eIa~~T~~~-~~~~~Im~~l~kRL~~~~k~WR~vyKsL~LLeyLlknGse~vv~~ 180 (194)
|.+|..||+...-.|.-..+.+|++.--+. ..-.+.+..|.|||.. ++-..++.||+|||+|++|+...|-.+
T Consensus 2 ~~~i~kATs~~l~~~dw~~~l~icD~i~~~~~~~k~a~r~l~krl~~--~n~~v~l~AL~lLe~~vkNcg~~f~~e 75 (133)
T smart00288 2 ERLIDKATSPSLLEEDWELILEICDLINSTPDGPKDAVRLLKKRLNN--KNPHVALLALTLLDACVKNCGSKFHLE 75 (133)
T ss_pred hhHHHHHcCcCCCCcCHHHHHHHHHHHhCCCccHHHHHHHHHHHHcC--CCHHHHHHHHHHHHHHHHHCCHHHHHH
Confidence 678999999888788888888888875433 3456899999999996 467889999999999999988776554
No 20
>cd03568 VHS_STAM VHS domain family, STAM subfamily; members include STAM (Signal Transducing Adaptor Molecule), EAST (EGFR-associated protein with SH3 and TAM domains) and Hbp (Hrs-binding protein). Collectively, they are referred to as STAM. All STAMs have at their N-termini a VHS domain, which is involved in cytokine-mediated intracellular signal transduction and has a superhelical structure similar to the structure of ARM (Armadillo) repeats, followed by a SH3 (Src homology 3) domain, a well-established protein-protein interaction domain. At the C-termini of most vertebrate STAMS, an ITAM (Immunoreceptor Tyrosine-based Activation) motif is present, which mediates the binding of HRS (hepatocyte growth factor-regulated tyrosine kinase substrate) in endocytic and exocytic machineries.
Probab=97.40 E-value=0.0012 Score=53.08 Aligned_cols=74 Identities=19% Similarity=0.173 Sum_probs=61.4
Q ss_pred HHHHHHhhcCCCCCCCHHHHHHHHHHccC-hhhHHHHHHHHHHHhCcCCCchHHHHHHHHHHHHHHhcCChhHHHhh
Q psy4488 106 QVKVREATSNDPWGPSSSLMSEIADLTYN-VVAFTEIMAMIWKRLNDSGKNWRHVYKALSLLEYLIKTGSDKVTRIG 181 (194)
Q Consensus 106 E~~VreATsnd~wGps~~~m~eIa~~T~~-~~~~~~Im~~l~kRL~~~~k~WR~vyKsL~LLeyLlknGse~vv~~~ 181 (194)
|..|..||+...-.|.-..+-+|++.--+ ...-.+.|..|.|||... +-...+.||+|||-|++|+...|-.++
T Consensus 2 e~~iekATse~l~~~dw~~il~icD~I~~~~~~~k~a~ral~KRl~~~--n~~v~l~AL~LLe~~vkNCG~~fh~ev 76 (144)
T cd03568 2 DDLVEKATDEKLTSENWGLILDVCDKVKSDENGAKDCLKAIMKRLNHK--DPNVQLRALTLLDACAENCGKRFHQEV 76 (144)
T ss_pred hHHHHHHcCccCCCcCHHHHHHHHHHHhcCCccHHHHHHHHHHHHcCC--CHHHHHHHHHHHHHHHHHCCHHHHHHH
Confidence 67899999998888888888898887433 445568999999999864 688999999999999999998766543
No 21
>cd03569 VHS_Hrs_Vps27p VHS domain family, Hrs and Vps27p subfamily; composed of Hrs (Hepatocyte growth factor-regulated tyrosine kinase substrate) and its yeast homolog Vps27p (vacuolar protein sorting). The VHS domain, an essential part of Hrs/Vps27p, has a superhelical structure similar to the structure of ARM (Armadillo) repeats and is present at the N-termini of proteins. Hrs also contains a FYVE (Fab1p, YOTB, Vac1p, and EEA1) zinc finger domain C-terminal to VHS, as well as two coiled-coil domains. Hrs has been proposed to play a role in at least three vesicle trafficking events: exocytosis, endocytosis, and endosome to lysosome trafficking. Hrs is involved in promoting rapid recycling of endocytosed signaling receptors to the plasma membrane.
Probab=97.36 E-value=0.0013 Score=52.77 Aligned_cols=77 Identities=27% Similarity=0.158 Sum_probs=62.3
Q ss_pred chHHHHHHHHhhcCCCCCCCHHHHHHHHHHccC-hhhHHHHHHHHHHHhCcCCCchHHHHHHHHHHHHHHhcCChhHHHh
Q psy4488 102 YSDAQVKVREATSNDPWGPSSSLMSEIADLTYN-VVAFTEIMAMIWKRLNDSGKNWRHVYKALSLLEYLIKTGSDKVTRI 180 (194)
Q Consensus 102 ~t~~E~~VreATsnd~wGps~~~m~eIa~~T~~-~~~~~~Im~~l~kRL~~~~k~WR~vyKsL~LLeyLlknGse~vv~~ 180 (194)
.|+.+..|..||+...-.|.-..+.+|++.--+ ...-.+.+..|.|||.. ++-...+-||.|||.|++|+...|-.+
T Consensus 2 ~~~~~~~I~kATs~~l~~~dw~~ileicD~In~~~~~~k~a~ral~krl~~--~n~~vql~AL~LLe~~vkNCG~~fh~e 79 (142)
T cd03569 2 VSEFDELIEKATSELLGEPDLASILEICDMIRSKDVQPKYAMRALKKRLLS--KNPNVQLYALLLLESCVKNCGTHFHDE 79 (142)
T ss_pred cchHHHHHHHHcCcccCccCHHHHHHHHHHHhCCCCCHHHHHHHHHHHHcC--CChHHHHHHHHHHHHHHHHCCHHHHHH
Confidence 368899999999988778888887888776433 33446899999999987 478999999999999999987765443
No 22
>cd03561 VHS VHS domain family; The VHS domain is present in Vps27 (Vacuolar Protein Sorting), Hrs (Hepatocyte growth factor-regulated tyrosine kinase substrate) and STAM (Signal Transducing Adaptor Molecule). It has a superhelical structure similar to that of the ARM (Armadillo) repeats and is present at the N-termini of proteins involved in intracellular membrane trafficking. There are four general groups of VHS domain containing proteins based on their association with other domains. The first group consists of proteins of the STAM/EAST/Hbp family which has the domain composition VHS-SH3-ITAM. The second consists of proteins with a FYVE domain C-terminal to VHS. The third consists of GGA proteins with a domain composition VHS-GAT (GGA and TOM)-GAE (gamma-adaptin ear) domain. The fourth consists of proteins with a VHS domain alone or with domains other than those mentioned above. In GGA proteins, VHS domains are involved in cargo recognition in trans-Golgi, thereby having a general me
Probab=97.24 E-value=0.0024 Score=50.18 Aligned_cols=75 Identities=25% Similarity=0.157 Sum_probs=60.6
Q ss_pred HHHHHHhhcCCCCCCCHHHHHHHHHHccCh-hhHHHHHHHHHHHhCcCCCchHHHHHHHHHHHHHHhcCChhHHHhhh
Q psy4488 106 QVKVREATSNDPWGPSSSLMSEIADLTYNV-VAFTEIMAMIWKRLNDSGKNWRHVYKALSLLEYLIKTGSDKVTRIGG 182 (194)
Q Consensus 106 E~~VreATsnd~wGps~~~m~eIa~~T~~~-~~~~~Im~~l~kRL~~~~k~WR~vyKsL~LLeyLlknGse~vv~~~r 182 (194)
+..|..||+...-.|.-..+.+|++.-.+. ..-.+.+..|.|||... +-...+-||.|||.|++||.+.|-..+-
T Consensus 2 ~~~I~kATs~~~~~~D~~~il~icd~I~~~~~~~k~a~raL~krl~~~--n~~vql~AL~lLd~~vkNcg~~f~~~i~ 77 (133)
T cd03561 2 TSLIERATSPSLEEPDWALNLELCDLINLKPNGPKEAARAIRKKIKYG--NPHVQLLALTLLELLVKNCGKPFHLQVA 77 (133)
T ss_pred hHHHHHHcCcccCCccHHHHHHHHHHHhCCCCCHHHHHHHHHHHHcCC--CHHHHHHHHHHHHHHHHhCChHHHHHHh
Confidence 457899999877777888888888775443 45578999999999975 6789999999999999999987655443
No 23
>cd03567 VHS_GGA VHS domain family, GGA subfamily; GGA (Golgi-localized, Gamma-ear-containing, Arf-binding) comprise a subfamily of ubiquitously expressed, monomeric, motif-binding cargo/clathrin adaptor proteins. The VHS domain has a superhelical structure similar to the structure of the ARM (Armadillo) repeats and is present at the N-termini of proteins. GGA proteins have a multidomain structure consisting of an N-terminal VHS domain linked by a short proline-rich linker to a GAT (GGA and TOM) domain, which is followed by a long flexible linker to the C-terminal appendage, GAE (gamma-adaptin ear) domain. The VHS domain of GGA proteins binds to the acidic-cluster dileucine (DxxLL) motif found on the cytoplasmic tails of cargo proteins trafficked between the trans-Golgi network and the endosomal system.
Probab=97.20 E-value=0.0023 Score=51.39 Aligned_cols=74 Identities=9% Similarity=0.110 Sum_probs=61.0
Q ss_pred HHHHHHHhhcCCCCCCCHHHHHHHHHHccCh-hhHHHHHHHHHHHhCcCCCchHHHHHHHHHHHHHHhcCChhHHHh
Q psy4488 105 AQVKVREATSNDPWGPSSSLMSEIADLTYNV-VAFTEIMAMIWKRLNDSGKNWRHVYKALSLLEYLIKTGSDKVTRI 180 (194)
Q Consensus 105 ~E~~VreATsnd~wGps~~~m~eIa~~T~~~-~~~~~Im~~l~kRL~~~~k~WR~vyKsL~LLeyLlknGse~vv~~ 180 (194)
+|..|..||+...-.|.-....+|++.--+. ..-.+.+..|.+||... +-...+-||.|||.|++|+...|-.+
T Consensus 2 ~~~~iekAT~~~l~~~dw~~ileicD~In~~~~~~k~a~rai~krl~~~--n~~v~l~AL~LLe~~vkNCG~~fh~e 76 (139)
T cd03567 2 LEAWLNKATNPSNREEDWEAIQAFCEQINKEPEGPQLAVRLLAHKIQSP--QEKEALQALTVLEACMKNCGERFHSE 76 (139)
T ss_pred HHHHHHHHcCccCCCCCHHHHHHHHHHHHcCCccHHHHHHHHHHHHcCC--CHHHHHHHHHHHHHHHHHcCHHHHHH
Confidence 6889999999888888888888888775443 33457899999999865 57789999999999999999876554
No 24
>cd00197 VHS_ENTH_ANTH VHS, ENTH and ANTH domain superfamily; composed of proteins containing a VHS, ENTH or ANTH domain. The VHS domain is present in Vps27 (Vacuolar Protein Sorting), Hrs (Hepatocyte growth factor-regulated tyrosine kinase substrate) and STAM (Signal Transducing Adaptor Molecule). It is located at the N-termini of proteins involved in intracellular membrane trafficking. The epsin N-terminal homology (ENTH) domain is an evolutionarily conserved protein module found primarily in proteins that participate in clathrin-mediated endocytosis. A set of proteins previously designated as harboring an ENTH domain in fact contains a highly similar, yet unique module referred to as an AP180 N-terminal homology (ANTH) domain. VHS, ENTH and ANTH domains are structurally similar and are composed of a superhelix of eight alpha helices. ENTH adnd ANTH (E/ANTH) domains bind both inositol phospholipids and proteins and contribute to the nucleation and formation of clathrin coats on membra
Probab=96.27 E-value=0.0034 Score=47.65 Aligned_cols=42 Identities=33% Similarity=0.443 Sum_probs=36.5
Q ss_pred hhHhhhhhhhhhhcccceecc-CCccchhhHHHHHHHHHHhhCC
Q psy4488 2 AFQCKENIFLIQTLRDFQYTE-EGKDQGFNVREKAKALATLLGD 44 (194)
Q Consensus 2 ~~~~r~~~~~i~~L~~F~y~d-~g~D~G~~VR~kak~l~~LL~D 44 (194)
..+++.+.+.+..++ |+|.+ .|.|+|.+||+|+++|+.+..|
T Consensus 73 ~~~i~~~~~~~~l~~-~~~~~~~~~~~~~~Vr~k~~~l~~~w~~ 115 (115)
T cd00197 73 HQEVASNDFAVELLK-FDKSKLLGDDVSTNVREKAIELVQLWAS 115 (115)
T ss_pred HHHHHHhHHHHHHHH-hhccccccCCCChHHHHHHHHHHHHHhC
Confidence 567889999998775 68887 9999999999999999998754
No 25
>KOG1087|consensus
Probab=93.71 E-value=0.26 Score=47.03 Aligned_cols=72 Identities=28% Similarity=0.227 Sum_probs=57.5
Q ss_pred HHHHHHHhhcCCCCCCCHHHHHHHHHHccC-hhhHHHHHHHHHHHhCcCCCchHHHHHHHHHHHHHHhcCChhHH
Q psy4488 105 AQVKVREATSNDPWGPSSSLMSEIADLTYN-VVAFTEIMAMIWKRLNDSGKNWRHVYKALSLLEYLIKTGSDKVT 178 (194)
Q Consensus 105 ~E~~VreATsnd~wGps~~~m~eIa~~T~~-~~~~~~Im~~l~kRL~~~~k~WR~vyKsL~LLeyLlknGse~vv 178 (194)
+..+|..||+..--+|.=.+-.+||+.--. ...-.+++..|.|||+.+.. +..|=||+|||-|++|+-+.|=
T Consensus 2 v~~~IdkAT~~~l~~pDWa~NleIcD~IN~~~~~~~eAvralkKRi~~k~s--~vq~lALtlLE~cvkNCG~~fh 74 (470)
T KOG1087|consen 2 VGKLIDKATSESLAEPDWALNLEICDLINSTEGGPKEAVRALKKRLNSKNS--KVQLLALTLLETCVKNCGYSFH 74 (470)
T ss_pred hHHHHHHhhcccccCccHHHHHHHHHHHhcCccCcHHHHHHHHHHhccCCc--HHHHHHHHHHHHHHHhhhHHHH
Confidence 457899999998888888888888887533 23445899999999998644 7777799999999999987643
No 26
>KOG2199|consensus
Probab=90.33 E-value=1.4 Score=41.62 Aligned_cols=78 Identities=21% Similarity=0.178 Sum_probs=57.4
Q ss_pred chHHHHHHHHhhcCCCCCCCHHHHHHHHHH-ccChhhHHHHHHHHHHHhCcCCCchHHHHHHHHHHHHHHhcCChhHHHh
Q psy4488 102 YSDAQVKVREATSNDPWGPSSSLMSEIADL-TYNVVAFTEIMAMIWKRLNDSGKNWRHVYKALSLLEYLIKTGSDKVTRI 180 (194)
Q Consensus 102 ~t~~E~~VreATsnd~wGps~~~m~eIa~~-T~~~~~~~~Im~~l~kRL~~~~k~WR~vyKsL~LLeyLlknGse~vv~~ 180 (194)
-+++|..|..||+..+..-.=.+.-++++. +-+++--...|+.|.|||+.. .--.++-+|+||+-|+.|+-.++..+
T Consensus 6 ~n~~e~~v~KAT~e~nT~enW~~IlDvCD~v~~~~~~~kd~lk~i~KRln~~--dphV~L~AlTLlda~~~NCg~~~r~E 83 (462)
T KOG2199|consen 6 ANPFEQDVEKATDEKNTSENWSLILDVCDKVGSDPDGGKDCLKAIMKRLNHK--DPHVVLQALTLLDACVANCGKRFRLE 83 (462)
T ss_pred cchHHHHHHHhcCcccccccHHHHHHHHHhhcCCCcccHHHHHHHHHHhcCC--CcchHHHHHHHHHHHHHhcchHHHHH
Confidence 578999999999754322222344455544 455566678899999999975 45689999999999999999887765
Q ss_pred h
Q psy4488 181 G 181 (194)
Q Consensus 181 ~ 181 (194)
+
T Consensus 84 V 84 (462)
T KOG2199|consen 84 V 84 (462)
T ss_pred H
Confidence 3
No 27
>cd03564 ANTH_AP180_CALM ANTH domain family; composed of adaptor protein 180 (AP180), clathrin assembly lymphoid myeloid leukemia protein (CALM) and similar proteins. A set of proteins previously designated as harboring an ENTH domain in fact contains a highly similar, yet unique module referred to as an AP180 N-terminal homology (ANTH) domain. AP180 and CALM play important roles in clathrin-mediated endocytosis. AP180 is a brain-specific clathrin-binding protein which stimulates clathrin assembly during the recycling of synaptic vesicles. The ANTH domain is structurally similar to the VHS domain and is composed of a superhelix of eight alpha helices. ANTH domains bind both inositol phospholipids and proteins, and contribute to the nucleation and formation of clathrin coats on membranes. ANTH-bearing proteins have recently been shown to function with adaptor protein-1 and GGA adaptors at the trans-Golgi network, which suggests that the ANTH domain is a universal component of the machine
Probab=86.37 E-value=0.49 Score=36.19 Aligned_cols=41 Identities=17% Similarity=0.067 Sum_probs=31.8
Q ss_pred ChhHhhhhhhhhhhcccceecc--CCccchhhHHHHHHHHHHhh
Q psy4488 1 VAFQCKENIFLIQTLRDFQYTE--EGKDQGFNVREKAKALATLL 42 (194)
Q Consensus 1 v~~~~r~~~~~i~~L~~F~y~d--~g~D~G~~VR~kak~l~~LL 42 (194)
|+++++.+...+ .|.+|.+.. .|.|+|.+||..|+-|.+-|
T Consensus 72 ~~~~~~~~~~~l-~l~~~~~~~~~~~~~~~~~Vr~Ya~yL~~rl 114 (117)
T cd03564 72 FLQELLSRRGWL-NLSNFLDKSSSLGYGYSAFIRAYARYLDERL 114 (117)
T ss_pred HHHHHHHccCee-eccccccCCCCCchhhhHHHHHHHHHHHHHH
Confidence 356777774444 677899986 59999999999999886544
No 28
>KOG3208|consensus
Probab=80.96 E-value=11 Score=32.84 Aligned_cols=57 Identities=12% Similarity=0.311 Sum_probs=35.9
Q ss_pred CCCCCCCCCccchh---hhhhHHhhhHHHHHHHHhhhccchHHHHHHHHhhcCCCCCCCHHHHHHHHHH
Q psy4488 66 ASSGFGSEGSAMRR---QREDIQEGVAGLRRNIKNLAHNYSDAQVKVREATSNDPWGPSSSLMSEIADL 131 (194)
Q Consensus 66 ~~~~~g~~~~~~rR---~~~d~~~s~~~~~r~~kn~v~~~t~~E~~VreATsnd~wGps~~~m~eIa~~ 131 (194)
+-+||+.++++.-. +-.+.-.-+.++.++..++.-.+++ +...|+++...|..|.++
T Consensus 35 ~~gg~~~~~s~~~~~~~s~ks~~~eie~LLeql~~vndsm~~---------~~~s~a~~aa~~htL~RH 94 (231)
T KOG3208|consen 35 THGGYDIDTSPLSGSDRSFKSLENEIEGLLEQLQDVNDSMND---------CASSPANSAAVMHTLQRH 94 (231)
T ss_pred cCCCCCCCcccccCcCcchhhhHHHHHHHHHHHHHHHHHHHh---------hccCCCCcHHHHHHHHHH
Confidence 44678877776655 3333444455566666554332222 456899999999988876
No 29
>KOG0980|consensus
Probab=77.26 E-value=3.5 Score=42.32 Aligned_cols=84 Identities=18% Similarity=0.114 Sum_probs=67.0
Q ss_pred HHHHHhhcCCCCCCCHHHHHHHHHHccChhhHHHHHHHHHHHhCcCCCchHHHHHHHHHHHHHHhcCChhHHHhhhhhhh
Q psy4488 107 VKVREATSNDPWGPSSSLMSEIADLTYNVVAFTEIMAMIWKRLNDSGKNWRHVYKALSLLEYLIKTGSDKVTRIGGIFTH 186 (194)
Q Consensus 107 ~~VreATsnd~wGps~~~m~eIa~~T~~~~~~~~Im~~l~kRL~~~~k~WR~vyKsL~LLeyLlknGse~vv~~~r~~~~ 186 (194)
.-|+.|+|.+..+|..+...-|.-.|++...-...+.+| +|+.-. .+-...+|-+.||+-+|+-|.+.++.++..|-.
T Consensus 9 ~av~KAis~~Et~~K~KH~Rt~I~gTh~eksa~~FWt~i-k~~PL~-~~~VltwKfchllHKvLreGHpsal~es~r~r~ 86 (980)
T KOG0980|consen 9 EAVQKAISKDETPPKRKHVRTIIVGTHDEKSSKIFWTTI-KRQPLE-NHEVLTWKFCHLLHKVLREGHPSALEESQRYKK 86 (980)
T ss_pred HHHHHHhccccCCCchhhhhheeeeecccccchhHHHHh-hccccc-cchHHHHHHHHHHHHHHHcCCcchhHHHHHHHH
Confidence 357899999999999999999999999986543333333 354432 346778899999999999999999999999988
Q ss_pred hccccc
Q psy4488 187 LIKTRS 192 (194)
Q Consensus 187 ~i~~L~ 192 (194)
.|..+.
T Consensus 87 ~i~~l~ 92 (980)
T KOG0980|consen 87 WITQLG 92 (980)
T ss_pred HHHHHH
Confidence 887653
No 30
>KOG1086|consensus
Probab=76.47 E-value=6.5 Score=37.81 Aligned_cols=30 Identities=33% Similarity=0.576 Sum_probs=25.1
Q ss_pred CCCchHHHHHHHHHHHHHHhcCChhHHHhhh
Q psy4488 152 SGKNWRHVYKALSLLEYLIKTGSDKVTRIGG 182 (194)
Q Consensus 152 ~~k~WR~vyKsL~LLeyLlknGse~vv~~~r 182 (194)
++.+|--+| +|++|||+++||-++|=+++-
T Consensus 56 SPqe~EAl~-altvLe~cmkncGekfH~evg 85 (594)
T KOG1086|consen 56 SPQEWEALQ-ALTVLEYCMKNCGEKFHEEVG 85 (594)
T ss_pred ChhHHHHHH-HHHHHHHHHHhhhHHHHHHHH
Confidence 367798776 899999999999999877643
No 31
>PF01418 HTH_6: Helix-turn-helix domain, rpiR family; InterPro: IPR000281 This domain contains a helix-turn-helix motif []. Every member of this family is N-terminal to a SIS domain IPR001347 from INTERPRO. Members of this family are probably regulators of genes involved in phosphosugar metobolism.; GO: 0003700 sequence-specific DNA binding transcription factor activity, 0006355 regulation of transcription, DNA-dependent; PDB: 2O3F_B 3IWF_B.
Probab=54.45 E-value=20 Score=25.29 Aligned_cols=61 Identities=18% Similarity=0.140 Sum_probs=41.1
Q ss_pred HHHHHHHHhhhccchHHHHHHHHhhcCCCCCCCHHHHHHHHHHccChhhHHHHHHHHHHHhCcC
Q psy4488 89 AGLRRNIKNLAHNYSDAQVKVREATSNDPWGPSSSLMSEIADLTYNVVAFTEIMAMIWKRLNDS 152 (194)
Q Consensus 89 ~~~~r~~kn~v~~~t~~E~~VreATsnd~wGps~~~m~eIa~~T~~~~~~~~Im~~l~kRL~~~ 152 (194)
|++...++....++|+.|.+|-+..-+.|.-.....+.+||+.++-.. --+-.+.++|.-+
T Consensus 1 M~l~~~i~~~~~~ls~~e~~Ia~yil~~~~~~~~~si~elA~~~~vS~---sti~Rf~kkLG~~ 61 (77)
T PF01418_consen 1 MNLLEKIRSQYNSLSPTEKKIADYILENPDEIAFMSISELAEKAGVSP---STIVRFCKKLGFS 61 (77)
T ss_dssp --HHHHHHHHGGGS-HHHHHHHHHHHH-HHHHCT--HHHHHHHCTS-H---HHHHHHHHHCTTT
T ss_pred CcHHHHHHHHHhhCCHHHHHHHHHHHhCHHHHHHccHHHHHHHcCCCH---HHHHHHHHHhCCC
Confidence 457778888888999999999999888877666777899999987763 2334445566655
No 32
>PF08513 LisH: LisH; InterPro: IPR013720 The LisH motif is found in a large number of eukaryotic proteins, from metazoa, fungi and plants that have a wide range of functions. The recently solved structure of the LisH domain in the N-terminal region of LIS1 depicted it as a novel dimerization motif, and that other structural elements are likely to play an important role in dimerisation [, , ]. The LisH (lis homology) domain mediates protein dimerisation and tetramerisation. The LisH domain is found in Sif2, a component of the Set3 complex which is responsible for repressing meiotic genes. It has been shown that the LisH domain helps mediate interaction with components of the Set3 complex []. ; PDB: 2XTE_L 2XTC_B 2XTD_A 1UUJ_B.
Probab=46.31 E-value=25 Score=20.32 Aligned_cols=20 Identities=15% Similarity=0.370 Sum_probs=15.0
Q ss_pred HHHHHHHhcCChhHHHhhhh
Q psy4488 164 SLLEYLIKTGSDKVTRIGGI 183 (194)
Q Consensus 164 ~LLeyLlknGse~vv~~~r~ 183 (194)
.|.+||+++|-.....-+..
T Consensus 6 lI~~YL~~~Gy~~tA~~f~~ 25 (27)
T PF08513_consen 6 LIYDYLVENGYKETAKAFAK 25 (27)
T ss_dssp HHHHHHHHCT-HHHHHHHHH
T ss_pred HHHHHHHHCCcHHHHHHHHh
Confidence 57899999999887766553
No 33
>smart00667 LisH Lissencephaly type-1-like homology motif. Alpha-helical motif present in Lis1, treacle, Nopp140, some katanin p60 subunits, muskelin, tonneau, LEUNIG and numerous WD40 repeat-containing proteins. It is suggested that LisH motifs contribute to the regulation of microtubule dynamics, either by mediating dimerisation, or else by binding cytoplasmic dynein heavy chain or microtubules directly.
Probab=45.41 E-value=22 Score=20.06 Aligned_cols=21 Identities=24% Similarity=0.377 Sum_probs=16.7
Q ss_pred HHHHHHHhcCChhHHHhhhhh
Q psy4488 164 SLLEYLIKTGSDKVTRIGGIF 184 (194)
Q Consensus 164 ~LLeyLlknGse~vv~~~r~~ 184 (194)
.|++||+.+|-......+...
T Consensus 9 lI~~yL~~~g~~~ta~~l~~e 29 (34)
T smart00667 9 LILEYLLRNGYEETAETLQKE 29 (34)
T ss_pred HHHHHHHHcCHHHHHHHHHHH
Confidence 489999999998877666543
No 34
>PF07571 DUF1546: Protein of unknown function (DUF1546); InterPro: IPR011442 These proteins are associated with IPR004823 from INTERPRO in transcription initiation factor TFIID subunit 6 (TAF6).; GO: 0051090 regulation of sequence-specific DNA binding transcription factor activity, 0005634 nucleus
Probab=43.46 E-value=90 Score=22.96 Aligned_cols=53 Identities=26% Similarity=0.331 Sum_probs=40.8
Q ss_pred CHHHHHHHHHHccC---hhhHHHHHHHHHHHhCcCCCchHHHHHHHHHHHHHHhcCChhH
Q psy4488 121 SSSLMSEIADLTYN---VVAFTEIMAMIWKRLNDSGKNWRHVYKALSLLEYLIKTGSDKV 177 (194)
Q Consensus 121 s~~~m~eIa~~T~~---~~~~~~Im~~l~kRL~~~~k~WR~vyKsL~LLeyLlknGse~v 177 (194)
+.++|..|++. |. +.=.++|++.+.+-|.|+.+.+-..|=|+.-|..| |++-|
T Consensus 26 AA~lL~~I~~~-~~~~~~~L~~Ri~~tl~k~l~d~~~~~~t~YGAi~gL~~l---G~~~v 81 (92)
T PF07571_consen 26 AASLLAQICRK-FSSSYPTLQPRITRTLLKALLDPKKPLGTHYGAIVGLSAL---GPEAV 81 (92)
T ss_pred HHHHHHHHHHH-hccccchHHHHHHHHHHHHHcCCCCCHHHHHHHHHHHHHH---HHHHH
Confidence 34566666654 33 33456899999999999999999999999999988 66543
No 35
>KOG0037|consensus
Probab=41.38 E-value=35 Score=29.77 Aligned_cols=48 Identities=27% Similarity=0.650 Sum_probs=28.5
Q ss_pred HHHHHhhcCCCCCCCHHHHHHHHHHccC--------hhhHHHHHHHHHHHhCcCCCchHHHHHH
Q psy4488 107 VKVREATSNDPWGPSSSLMSEIADLTYN--------VVAFTEIMAMIWKRLNDSGKNWRHVYKA 162 (194)
Q Consensus 107 ~~VreATsnd~wGps~~~m~eIa~~T~~--------~~~~~~Im~~l~kRL~~~~k~WR~vyKs 162 (194)
+-+..|.+|..|.|=+-.=-.+--..|+ ..+| ..||+.|+ .||.||++
T Consensus 77 ~eLq~aLsn~~~~~Fs~~TcrlmI~mfd~~~~G~i~f~EF----~~Lw~~i~----~Wr~vF~~ 132 (221)
T KOG0037|consen 77 KELQQALSNGTWSPFSIETCRLMISMFDRDNSGTIGFKEF----KALWKYIN----QWRNVFRT 132 (221)
T ss_pred HHHHHHhhcCCCCCCCHHHHHHHHHHhcCCCCCccCHHHH----HHHHHHHH----HHHHHHHh
Confidence 4467899988887643322222222222 3333 57888876 49999874
No 36
>COG5602 SIN3 Histone deacetylase complex, SIN3 component [Chromatin structure and dynamics]
Probab=38.39 E-value=95 Score=32.63 Aligned_cols=72 Identities=17% Similarity=0.282 Sum_probs=50.1
Q ss_pred HHHHHHhhhccchHHHHHHHHhhcCCCCCCCHHHHHHHHHHccChhhHHHHHH-----------HHHHHhCcCCCchHHH
Q psy4488 91 LRRNIKNLAHNYSDAQVKVREATSNDPWGPSSSLMSEIADLTYNVVAFTEIMA-----------MIWKRLNDSGKNWRHV 159 (194)
Q Consensus 91 ~~r~~kn~v~~~t~~E~~VreATsnd~wGps~~~m~eIa~~T~~~~~~~~Im~-----------~l~kRL~~~~k~WR~v 159 (194)
+..++-|.+.+.++.|+....--.+ -=+|+-+...++...-|+-+.-++|++ ++.+||..+.++||.+
T Consensus 576 ~Le~i~d~~~~~~e~Eka~~~Lp~g-lg~~S~sIyKkvik~VY~KEhA~eile~L~k~P~vt~PiVlkRLk~Kd~EWR~~ 654 (1163)
T COG5602 576 ALEQIIDKIKDMEESEKANKTLPGG-LGLPSKSIYKKVIKKVYDKEHAPEILEALLKKPHVTIPIVLKRLKMKDEEWRSC 654 (1163)
T ss_pred HHHHHHHHhhcchhhHHHHhcCCCc-CCCccHHHHHHHHHHHhchhhHHHHHHHHHhCCCcchHHHHHHHHHhhHHHHHH
Confidence 4455555566667666654432221 237888999999999999998888887 4556888888888877
Q ss_pred HHHH
Q psy4488 160 YKAL 163 (194)
Q Consensus 160 yKsL 163 (194)
....
T Consensus 655 kRew 658 (1163)
T COG5602 655 KREW 658 (1163)
T ss_pred HHHH
Confidence 6544
No 37
>COG1777 Predicted transcriptional regulators [Transcription]
Probab=36.73 E-value=1.5e+02 Score=25.92 Aligned_cols=104 Identities=19% Similarity=0.220 Sum_probs=53.5
Q ss_pred CCCCCCCCCccchhhhhhHHhhhHHHHHHHHhhhccchHHHHHHHHhhcCCCCCCCHHHHHHHHHHccC-hhhHHHHHHH
Q psy4488 66 ASSGFGSEGSAMRRQREDIQEGVAGLRRNIKNLAHNYSDAQVKVREATSNDPWGPSSSLMSEIADLTYN-VVAFTEIMAM 144 (194)
Q Consensus 66 ~~~~~g~~~~~~rR~~~d~~~s~~~~~r~~kn~v~~~t~~E~~VreATsnd~wGps~~~m~eIa~~T~~-~~~~~~Im~~ 144 (194)
+|..||..+ .-...|.+.+.+.-..+..+.-.++++.....++|- ....+|.++--. ...+.++|..
T Consensus 87 sp~~f~~~~---~~~~~~~l~~~r~~~~~~~~s~~~~~~l~srl~~~~---------~~~e~l~~~~~~L~~~~~el~~r 154 (217)
T COG1777 87 SPNFFGAER---FDLEEDDLESERSEVSKLFKSPEGISELISRLLEIN---------REIEELSRAQTELQKQLNELMDR 154 (217)
T ss_pred cCcccceec---cCccccchhhhhcchhhhcccchhHHHHHHHHHHHH---------HHHHHHHHHHHHHHHHHHHHHHH
Confidence 466676644 222333344444443333333333344444444432 123333333211 2466788888
Q ss_pred HHHHhCcCCCchHHHHHHHHHHHHHHhcCChhHHHhhhhhh
Q psy4488 145 IWKRLNDSGKNWRHVYKALSLLEYLIKTGSDKVTRIGGIFT 185 (194)
Q Consensus 145 l~kRL~~~~k~WR~vyKsL~LLeyLlknGse~vv~~~r~~~ 185 (194)
|...+.+....-- -..|++||++||.-+|-..++.+.
T Consensus 155 ik~~ied~~~~~~----~~~vl~~l~~n~~~~v~E~~r~~~ 191 (217)
T COG1777 155 IKEEIEDKDGDMT----ERIVLEYLLKNGAADVEETSRRTV 191 (217)
T ss_pred HHHHHhcccchHH----HHHHHHHHHhhhhhHHHHHHhccc
Confidence 8888776433211 245899999999655655555543
No 38
>PF13606 Ank_3: Ankyrin repeat
Probab=35.86 E-value=24 Score=20.52 Aligned_cols=16 Identities=25% Similarity=0.706 Sum_probs=12.2
Q ss_pred HHHHHHHHHHHhcCCh
Q psy4488 160 YKALSLLEYLIKTGSD 175 (194)
Q Consensus 160 yKsL~LLeyLlknGse 175 (194)
+...-+++|||.+|+.
T Consensus 12 ~g~~e~v~~Ll~~gad 27 (30)
T PF13606_consen 12 NGNIEIVKYLLEHGAD 27 (30)
T ss_pred hCCHHHHHHHHHcCCC
Confidence 3456789999999963
No 39
>PF12755 Vac14_Fab1_bd: Vacuolar 14 Fab1-binding region
Probab=35.76 E-value=65 Score=24.05 Aligned_cols=65 Identities=25% Similarity=0.354 Sum_probs=45.5
Q ss_pred HhhhccchHHHHHHHHhhcCCCCCCCHHHHHHHHHHccC--hhhHHHHHHHHHHHhCcCCCchHHHHHHHHHHHHHHh
Q psy4488 96 KNLAHNYSDAQVKVREATSNDPWGPSSSLMSEIADLTYN--VVAFTEIMAMIWKRLNDSGKNWRHVYKALSLLEYLIK 171 (194)
Q Consensus 96 kn~v~~~t~~E~~VreATsnd~wGps~~~m~eIa~~T~~--~~~~~~Im~~l~kRL~~~~k~WR~vyKsL~LLeyLlk 171 (194)
+-++..+++.+..||.+... .|..|+..+-+ ...|++|+..|++.+.|...+ |-.+-.+|+-|||
T Consensus 30 ~pVL~~~~D~d~rVRy~AcE--------aL~ni~k~~~~~~l~~f~~IF~~L~kl~~D~d~~---Vr~~a~~Ld~llk 96 (97)
T PF12755_consen 30 PPVLKCFDDQDSRVRYYACE--------ALYNISKVARGEILPYFNEIFDALCKLSADPDEN---VRSAAELLDRLLK 96 (97)
T ss_pred HHHHHHcCCCcHHHHHHHHH--------HHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCchh---HHHHHHHHHHHhc
Confidence 44455567777788776543 34445544333 247899999999999998665 6668889999886
No 40
>smart00503 SynN Syntaxin N-terminal domain. Three-helix domain that (in Sso1p) slows the rate of its reaction with the SNAP-25 homologue Sec9p
Probab=33.78 E-value=78 Score=23.14 Aligned_cols=22 Identities=32% Similarity=0.520 Sum_probs=17.3
Q ss_pred HHHHHHHhhhccchHHHHHHHH
Q psy4488 90 GLRRNIKNLAHNYSDAQVKVRE 111 (194)
Q Consensus 90 ~~~r~~kn~v~~~t~~E~~Vre 111 (194)
.+.+..+.++..|-.+|...++
T Consensus 93 ~L~~~f~~~m~~fq~~Q~~~~~ 114 (117)
T smart00503 93 KLRKKFKEVMNEFQRLQRKYRE 114 (117)
T ss_pred HHHHHHHHHHHHHHHHHHHHHH
Confidence 4677888888888888887664
No 41
>PF11802 CENP-K: Centromere-associated protein K; InterPro: IPR020993 Cenp-K is one of seven new Cenp-A-nucleosome distal (CAD) centromere components (the others being Cenp-L, Cenp-O, Cenp-P, Cenp-Q, Cenp-R and Cenp-S) that are identified as assembling on the Cenp-A nucleosome associated complex, NAC []. The Cenp-A NAC is essential, as disruption of the complex causes errors of chromosome alignment and segregation that preclude cell survival despite continued centromere-derived mitotic checkpoint signalling. Cenp-K is centromere-associated through its interaction with one or more components of the Cenp-A NAC.; GO: 0005634 nucleus
Probab=31.63 E-value=73 Score=28.58 Aligned_cols=49 Identities=14% Similarity=0.380 Sum_probs=34.7
Q ss_pred HHHHHHccChhhHHHHHHHHHHHhCc-CCCchHHHHHHH--HHHHHHHhcCC
Q psy4488 126 SEIADLTYNVVAFTEIMAMIWKRLND-SGKNWRHVYKAL--SLLEYLIKTGS 174 (194)
Q Consensus 126 ~eIa~~T~~~~~~~~Im~~l~kRL~~-~~k~WR~vyKsL--~LLeyLlknGs 174 (194)
......+.+...+.+|++.|.+|+-+ ++.-+..|-.+- --+|+||++|-
T Consensus 199 ~~~~e~~~~~~~l~eilE~LmN~l~~~p~DpYv~i~~~~WPpyie~LlR~GI 250 (268)
T PF11802_consen 199 KGEDEPSAQLITLREILEILMNKLLDSPHDPYVKIDDSFWPPYIELLLRSGI 250 (268)
T ss_pred ccccccchhhhHHHHHHHHHHHHhcCCCCCCceecCcccChHHHHHHHHcCC
Confidence 44667788888889999999999874 323333333333 37899999994
No 42
>PF02847 MA3: MA3 domain; InterPro: IPR003891 This entry represents the MI domain (after MA-3 and eIF4G), it is a protein-protein interaction module of ~130 amino acids [, , ]. It appears in several translation factors and is found in: One copy in plant and animal eIF4G 1 and 2 (DAP-5/NAT1/p97) Two copies in the animal programmed cell death protein 4 (PDCD4) or MA-3 that is induced during programmed cell death and inhibits neoplastic transformation Four tandem-repeated copies in a group of uncharacterised plant proteins The MI domain consists of seven alpha-helices, which pack into a globular form. The packing arrangement consists of repeating pairs of antiparallel helices packed one upon the other such that a superhelical axis is generated perpendicular to the alpha-helical axes []. The MI domain has also been named MA3 domain.; PDB: 2ION_A 2IOL_B 2NSZ_A 3EIQ_C 2HM8_A 2KZT_B 2IOS_A 2RG8_B 2ZU6_E 3EIJ_A ....
Probab=31.58 E-value=2e+02 Score=20.85 Aligned_cols=19 Identities=32% Similarity=0.245 Sum_probs=10.7
Q ss_pred CCCchHHHHHHHHHHHHHHhcC
Q psy4488 152 SGKNWRHVYKALSLLEYLIKTG 173 (194)
Q Consensus 152 ~~k~WR~vyKsL~LLeyLlknG 173 (194)
.|+.|.++.| ++.+++..|
T Consensus 92 ~P~~~~~la~---~~~~~i~~~ 110 (113)
T PF02847_consen 92 IPKAPEYLAK---FLARLIADG 110 (113)
T ss_dssp STTHHHHHHH---HHHHHHHTT
T ss_pred chHHHHHHHH---HHHHHHHcC
Confidence 4556666555 555555554
No 43
>cd03562 CID CID (CTD-Interacting Domain) domain family; CID is present in several RNA-processing factors such as Pcf11 and Nrd1. Pcf11 is a conserved and essential subunit of the yeast cleavage factor IA, which is required for polyadenylation-dependent 3'-RNA processing and transcription termination. Nrd1 is implicated in polyadenylation-independent 3'-RNA processing. CID binds tightly to the carboxy-terminal domain (CTD) of RNA polymerase (Pol) II. During transcription, Pol II synthesizes eukaryotic messenger RNA. Transcription is coupled to RNA processing through the CTD, which consists of up to 52 repeats of the sequence Tyr 1-Ser 2-Pro 3-Thr 4-Ser 5-Pro 6-Ser 7. CID contains eight alpha-helices in a right-handed superhelical arrangement, which closely resembles that of the VHS domains and ARM (Armadillo) repeat proteins, except for its two amino-terminal helices.
Probab=30.98 E-value=2.2e+02 Score=21.02 Aligned_cols=73 Identities=21% Similarity=0.181 Sum_probs=47.5
Q ss_pred HHHHHHhhcCCCCCCCHHHHHHHHHHccChhhHHHHHHHHHHHhCcCCCchHHHHHHHHHHHHHHhcCChhHHHhhh
Q psy4488 106 QVKVREATSNDPWGPSSSLMSEIADLTYNVVAFTEIMAMIWKRLNDSGKNWRHVYKALSLLEYLIKTGSDKVTRIGG 182 (194)
Q Consensus 106 E~~VreATsnd~wGps~~~m~eIa~~T~~~~~~~~Im~~l~kRL~~~~k~WR~vyKsL~LLeyLlknGse~vv~~~r 182 (194)
+..+++=|.+...+|+-..+..+|.. +......|++.+.+++...+. .+-+-.|-|++-+++|........+.
T Consensus 5 ~~~l~~L~~~~~S~~~I~~lt~~a~~--~~~~a~~iv~~i~~~i~~~~~--~~KL~~LYL~dsIvkn~~~~~~~~~~ 77 (114)
T cd03562 5 NALLEKLTFNKNSQPSIQTLTKLAIE--NRKHAKEIVEIIEKHIKKCPP--EQKLPLLYLLDSIVKNVGRKYKEFFS 77 (114)
T ss_pred HHHHHHHHcCcccHHHHHHHHHHHHH--HHHHHHHHHHHHHHHHHhCCc--ccchHHHHHHHHHHHHcccchHHHHH
Confidence 34455556666555554444444443 345667899999999876543 56667899999999997665544433
No 44
>COG2307 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=30.76 E-value=3.6e+02 Score=24.78 Aligned_cols=143 Identities=16% Similarity=0.143 Sum_probs=77.5
Q ss_pred CccchhhHHHHHHHHHHhhCCH-------HHHHHHHHHHHHhhhhhccC---------------CCCCC-----------
Q psy4488 24 GKDQGFNVREKAKALATLLGDE-------ERLRNERARQLKARERFARS---------------ASSGF----------- 70 (194)
Q Consensus 24 g~D~G~~VR~kak~l~~LL~D~-------~~lr~eR~~a~~~r~k~~~~---------------~~~~~----------- 70 (194)
+.++|..+=..+-.|--|+.|+ .-|...|..|+..|+++..- .+.+.
T Consensus 56 ~~~~~~g~~t~~~~~~~l~~D~~~P~Sl~ssl~~ar~nAr~VR~~ls~etWe~LN~~~~~~~~~~~~~~~~~~~~~~~~~ 135 (313)
T COG2307 56 DYLAGYGVLTAADVLDFLTRDRDNPSSLVSSLEAARENARAVRDRLSSETWEALNELYLALKRPAARAAGDAGLHEFLAR 135 (313)
T ss_pred cccccccccchhHHHHHHHhCCCCcHHHHHHHHHHHHHHHHHHhccCHHHHHHHHHHHHHHhhhhccCccccchHHHHHH
Confidence 3445555555577777788887 56888999999999887430 11111
Q ss_pred ---------CCCCccchhhhhhHHhhhHHHHHHHHhhhccchHHHHHHHHhhcCCCCCCCHHHHHHHHHHccChhhHHHH
Q psy4488 71 ---------GSEGSAMRRQREDIQEGVAGLRRNIKNLAHNYSDAQVKVREATSNDPWGPSSSLMSEIADLTYNVVAFTEI 141 (194)
Q Consensus 71 ---------g~~~~~~rR~~~d~~~s~~~~~r~~kn~v~~~t~~E~~VreATsnd~wGps~~~m~eIa~~T~~~~~~~~I 141 (194)
|.-...|-| |..--++.+.+-+.-.-+ |.-=+.++-.+...||.|++ ...-|.+..-.++.|..
T Consensus 136 i~~~~~~f~G~a~~tM~R---d~gwrFm~iGr~iER~d~--~aRlL~~~~~~~~~~~a~~~-~w~slLr~~sa~~~Yrr- 208 (313)
T COG2307 136 IKRRLALFAGLAHETMLR---DDGWRFMRIGRRIERADN--TARLLDVKLYVLAVPAAGDG-AWESLLRSGSAHETYRR- 208 (313)
T ss_pred HHHHHHhhccccccccch---hhhHHHHHHHHHHHHHHH--HHHHHHHHHhcccCccCcch-HHHHHHHHhhchhhhhh-
Confidence 222346666 555555666665554432 22222222222357888888 44444444444444432
Q ss_pred HHHHHHHhCcCCCchHHHHHHHHHHHHHHhcCCh-h----HHHhhhhhhh
Q psy4488 142 MAMIWKRLNDSGKNWRHVYKALSLLEYLIKTGSD-K----VTRIGGIFTH 186 (194)
Q Consensus 142 m~~l~kRL~~~~k~WR~vyKsL~LLeyLlknGse-~----vv~~~r~~~~ 186 (194)
..+-.++.-.|.|+|+.++-. + |.+.++.|+.
T Consensus 209 -------------~y~a~~~~~~VaelLlLd~~~PRSl~~~l~~l~~~L~ 245 (313)
T COG2307 209 -------------RYRAVIDPANVAELLLLDELNPRSLLYQLEELEANLS 245 (313)
T ss_pred -------------eeecccCchhHHHHHhcCCCCchhHHHHHHHHHHHHH
Confidence 223334556667777766642 1 4444554444
No 45
>PF04904 NCD1: NAB conserved region 1 (NCD1); InterPro: IPR006988 Nab1 and Nab2 are co-repressors that specifically interact with and repress transcription mediated by the three members of the NGFI-A (Egr-1, Krox24, zif/268) family of eukaryotic (metazoa) transcription factors []. This entry represents the N-terminal NAB domain, which interacts with the EGR1 inhibitory domain (R1) []. It may also mediate multimerisation.; GO: 0045892 negative regulation of transcription, DNA-dependent, 0005634 nucleus
Probab=29.82 E-value=73 Score=23.65 Aligned_cols=39 Identities=23% Similarity=0.304 Sum_probs=29.1
Q ss_pred HHHHHHHHHHccChhhHHHHHHHHHHHhCcCCCchHHHHHHHH
Q psy4488 122 SSLMSEIADLTYNVVAFTEIMAMIWKRLNDSGKNWRHVYKALS 164 (194)
Q Consensus 122 ~~~m~eIa~~T~~~~~~~~Im~~l~kRL~~~~k~WR~vyKsL~ 164 (194)
|...++++.+. .++|.+||+++- +..++-.-|+..|||.
T Consensus 33 gDDvqQL~~~~--e~eF~eim~lvG--M~sKPLHVrRlqKAL~ 71 (82)
T PF04904_consen 33 GDDVQQLCEAG--EEEFLEIMALVG--MASKPLHVRRLQKALQ 71 (82)
T ss_pred ChhHHHHHhcC--hHHHHHHHHHhC--ccCccHHHHHHHHHHH
Confidence 34466666654 379999999885 6777888899999875
No 46
>PF06743 FAST_1: FAST kinase-like protein, subdomain 1; InterPro: IPR010622 This entry represents a conserved region of eukaryotic Fas-activated serine/threonine (FAST) kinases that contains several conserved leucine residues. FAST kinase is rapidly activated during Fas-mediated apoptosis, when it phosphorylates TIA-1, a nuclear RNA-binding protein that has been implicated as an effector of apoptosis []. Note that many family members are hypothetical proteins.; GO: 0004672 protein kinase activity
Probab=27.85 E-value=1.1e+02 Score=21.20 Aligned_cols=47 Identities=23% Similarity=0.327 Sum_probs=33.4
Q ss_pred HHHHHHHHHccChhhHHHHHHHHHHHhCc-----CCCchHHHHHHHHHHHHH
Q psy4488 123 SLMSEIADLTYNVVAFTEIMAMIWKRLND-----SGKNWRHVYKALSLLEYL 169 (194)
Q Consensus 123 ~~m~eIa~~T~~~~~~~~Im~~l~kRL~~-----~~k~WR~vyKsL~LLeyL 169 (194)
+++.--|..-|.|..-.+..+.+.++|.. ++..|-.+.=||.+++|.
T Consensus 3 ~il~~fa~LNy~P~~~~~f~~~~~~~L~~~l~~~~p~~ll~~v~Sl~~l~~~ 54 (71)
T PF06743_consen 3 SILLPFARLNYQPPNAEEFFEKLIERLESYLDEFSPEDLLDLVWSLCLLQRF 54 (71)
T ss_pred HHHHHHHhcCCCCCCHHHHHHHHHHHHHHhcccCCHHHHHHHHHHHHHHhhC
Confidence 34555677778887766666666665553 477899999999998874
No 47
>PF07276 PSGP: Apopolysialoglycoprotein (PSGP); InterPro: IPR009900 This entry represents a series of 13 residue repeats found in the apopolysialoglycoprotein of Oncorhynchus mykiss (Rainbow trout) and Oncorhynchus masou (Cherry salmon). Polysialoglycoprotein (PSGP) of unfertilised eggs of rainbow trout consists of tandem repeats of a glycotridecapeptide, Asp-Asp-Ala-Thr*-Ser*-Glu-Ala-Ala-Thr*-Gly-Pro-Ser- Gly (* denotes the attachment site of a polysialoglycan chain). In response to egg activation, PSGP is discharged by exocytosis into the space between the vitelline envelope and the plasma membrane, i.e. the perivitelline space, where the 200 kDa PSGP molecules undergo rapid and dramatic depolymerisation by proteolysis into glycotridecapeptides [].
Probab=26.27 E-value=25 Score=17.35 Aligned_cols=12 Identities=50% Similarity=0.667 Sum_probs=9.5
Q ss_pred HhhcCCCCCCCH
Q psy4488 111 EATSNDPWGPSS 122 (194)
Q Consensus 111 eATsnd~wGps~ 122 (194)
+|||...-||++
T Consensus 2 datseaatgpsg 13 (13)
T PF07276_consen 2 DATSEAATGPSG 13 (13)
T ss_pred ccccccccCCCC
Confidence 588888888874
No 48
>KOG2160|consensus
Probab=24.78 E-value=4.1e+02 Score=24.65 Aligned_cols=103 Identities=15% Similarity=0.171 Sum_probs=76.0
Q ss_pred CCCccchhhhhhHHhhhHHHHHHHHhhhccchHHHHHHHHhhcCCCCCCCHHHHHHHHHHccChh----hHHHH--HHHH
Q psy4488 72 SEGSAMRRQREDIQEGVAGLRRNIKNLAHNYSDAQVKVREATSNDPWGPSSSLMSEIADLTYNVV----AFTEI--MAMI 145 (194)
Q Consensus 72 ~~~~~~rR~~~d~~~s~~~~~r~~kn~v~~~t~~E~~VreATsnd~wGps~~~m~eIa~~T~~~~----~~~~I--m~~l 145 (194)
+..+..|..+-+...+.-+=-=.+++.|+.+.-.+.++.-.++++|-...++.+-.|+-.-.+.. .|..| ..+|
T Consensus 135 ~~~~~lR~~Aa~Vigt~~qNNP~~Qe~v~E~~~L~~Ll~~ls~~~~~~~r~kaL~AissLIRn~~~g~~~fl~~~G~~~L 214 (342)
T KOG2160|consen 135 NSDAELRELAARVIGTAVQNNPKSQEQVIELGALSKLLKILSSDDPNTVRTKALFAISSLIRNNKPGQDEFLKLNGYQVL 214 (342)
T ss_pred CCcHHHHHHHHHHHHHHHhcCHHHHHHHHHcccHHHHHHHHccCCCchHHHHHHHHHHHHHhcCcHHHHHHHhcCCHHHH
Confidence 45567777777776655544445677777777888888888988898888899998887766642 22222 3577
Q ss_pred HHHhCcCCCchHHHHHHHHHHHHHHhcCC
Q psy4488 146 WKRLNDSGKNWRHVYKALSLLEYLIKTGS 174 (194)
Q Consensus 146 ~kRL~~~~k~WR~vyKsL~LLeyLlknGs 174 (194)
..-|..++..-+-.-|++.|+.||+.--.
T Consensus 215 ~~vl~~~~~~~~lkrK~~~Ll~~Ll~~~~ 243 (342)
T KOG2160|consen 215 RDVLQSNNTSVKLKRKALFLLSLLLQEDK 243 (342)
T ss_pred HHHHHcCCcchHHHHHHHHHHHHHHHhhh
Confidence 77788888889999999999999997544
No 49
>PF08467 Luteo_P1-P2: Luteovirus RNA polymerase P1-P2/replicase; InterPro: IPR013674 This domain is found in RNA-dependent RNA polymerase P1-P2 fusion/replicase proteins in plant Luteoviruses. ; GO: 0003968 RNA-directed RNA polymerase activity
Probab=23.90 E-value=1.6e+02 Score=27.33 Aligned_cols=44 Identities=11% Similarity=0.123 Sum_probs=38.4
Q ss_pred CccchhhHHHHHHHHHHhhCCHHHHHHHHHHHHHhhhhhccCCC
Q psy4488 24 GKDQGFNVREKAKALATLLGDEERLRNERARQLKARERFARSAS 67 (194)
Q Consensus 24 g~D~G~~VR~kak~l~~LL~D~~~lr~eR~~a~~~r~k~~~~~~ 67 (194)
..|+-..-+.|++.+..||++.+.+..-+.++++...+-.|-++
T Consensus 137 ~~~~~~~~~~~~ek~~~Ll~~~~~f~~~e~ri~~~Y~ee~G~~p 180 (361)
T PF08467_consen 137 CEPKSLENPARVEKFINLLEKKHVFSATEVRIKKVYEEEIGRGP 180 (361)
T ss_pred CCcccccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccc
Confidence 45677788999999999999999999999999999888877654
No 50
>PF00046 Homeobox: Homeobox domain not present here.; InterPro: IPR001356 The homeobox domain was first identified in a number of drosophila homeotic and segmentation proteins, but is now known to be well-conserved in many other animals, including vertebrates [, , ]. Hox genes encode homeodomain-containing transcriptional regulators that operate differential genetic programs along the anterior-posterior axis of animal bodies []. The domain binds DNA through a helix-turn-helix (HTH) structure. The HTH motif is characterised by two alpha-helices, which make intimate contacts with the DNA and are joined by a short turn. The second helix binds to DNA via a number of hydrogen bonds and hydrophobic interactions, which occur between specific side chains and the exposed bases and thymine methyl groups within the major groove of the DNA []. The first helix helps to stabilise the structure. The motif is very similar in sequence and structure in a wide range of DNA-binding proteins (e.g., cro and repressor proteins, homeotic proteins, etc.). One of the principal differences between HTH motifs in these different proteins arises from the stereo-chemical requirement for glycine in the turn which is needed to avoid steric interference of the beta-carbon with the main chain: for cro and repressor proteins the glycine appears to be mandatory, while for many of the homeotic and other DNA-binding proteins the requirement is relaxed.; GO: 0003700 sequence-specific DNA binding transcription factor activity, 0043565 sequence-specific DNA binding, 0006355 regulation of transcription, DNA-dependent; PDB: 2DA3_A 1LFB_A 2LFB_A 2ECB_A 2DA5_A 3D1N_O 3A03_A 2XSD_C 3CMY_A 1AHD_P ....
Probab=23.61 E-value=62 Score=20.81 Aligned_cols=32 Identities=25% Similarity=0.207 Sum_probs=27.3
Q ss_pred cchHHHHHHHHhhcCCCCCCCHHHHHHHHHHc
Q psy4488 101 NYSDAQVKVREATSNDPWGPSSSLMSEIADLT 132 (194)
Q Consensus 101 ~~t~~E~~VreATsnd~wGps~~~m~eIa~~T 132 (194)
.+|+.+..+.++.-....-|+...+.+||..+
T Consensus 6 ~~t~~q~~~L~~~f~~~~~p~~~~~~~la~~l 37 (57)
T PF00046_consen 6 RFTKEQLKVLEEYFQENPYPSKEEREELAKEL 37 (57)
T ss_dssp SSSHHHHHHHHHHHHHSSSCHHHHHHHHHHHH
T ss_pred CCCHHHHHHHHHHHHHhccccccccccccccc
Confidence 57888888888887777789999999999876
No 51
>PF05178 Kri1: KRI1-like family; InterPro: IPR018034 The Kri1 protein is also known as KRR1-interacting protein 1. The Saccharomyces cerevisiae member of this family is found to be required for the assembly of preribosomal 40S subunits in the nucleolus []. KRR1 is highly expressed in dividing cells and its expression ceases almost completely when cells enter the stationary phase.
Probab=23.18 E-value=3.4e+02 Score=20.58 Aligned_cols=37 Identities=11% Similarity=0.216 Sum_probs=28.8
Q ss_pred HHHHhhcCCCCCCCHHHHHHHHHHccChhhHHHHHHHHHH
Q psy4488 108 KVREATSNDPWGPSSSLMSEIADLTYNVVAFTEIMAMIWK 147 (194)
Q Consensus 108 ~VreATsnd~wGps~~~m~eIa~~T~~~~~~~~Im~~l~k 147 (194)
+|++++.+..-+-+...+. ..-|+|..|-..|.-|..
T Consensus 32 kik~~~G~~~~~~~~~dl~---e~dfDp~~~D~~M~~~F~ 68 (101)
T PF05178_consen 32 KIKEVAGLEDIDLDEEDLD---EGDFDPEEWDRKMQKIFN 68 (101)
T ss_pred HHHHHhCCCccCCCHHHhh---cccCChHHHHHHHHHHhh
Confidence 5688888876666655554 789999999999988764
No 52
>PF06336 Corona_5a: Coronavirus 5a protein; InterPro: IPR009404 This family consists of several Coronavirus 5a proteins. The function of this family is unknown [].
Probab=22.77 E-value=58 Score=22.84 Aligned_cols=33 Identities=18% Similarity=0.371 Sum_probs=23.3
Q ss_pred cCCCchHHHHHHHHHHH------HHHhcCChhHHHhhhh
Q psy4488 151 DSGKNWRHVYKALSLLE------YLIKTGSDKVTRIGGI 183 (194)
Q Consensus 151 ~~~k~WR~vyKsL~LLe------yLlknGse~vv~~~r~ 183 (194)
+-|+..-..||+|.|.. .++-||+.++....|.
T Consensus 6 sfgra~iscyksllltqlrvldrlil~hgp~r~ltc~rr 44 (65)
T PF06336_consen 6 SFGRAFISCYKSLLLTQLRVLDRLILDHGPKRTLTCARR 44 (65)
T ss_pred HHhHHHHHHHHHHHHHHHHHHHHHHHhcCCcceehhhHH
Confidence 33556777899987643 3567999987777664
No 53
>smart00389 HOX Homeodomain. DNA-binding factors that are involved in the transcriptional regulation of key developmental processes
Probab=22.30 E-value=1.4e+02 Score=18.83 Aligned_cols=32 Identities=19% Similarity=0.062 Sum_probs=25.7
Q ss_pred chHHHHHHHHhhcCCCCCCCHHHHHHHHHHcc
Q psy4488 102 YSDAQVKVREATSNDPWGPSSSLMSEIADLTY 133 (194)
Q Consensus 102 ~t~~E~~VreATsnd~wGps~~~m~eIa~~T~ 133 (194)
+|+.++.+.++.-....-|+...+.+||..+-
T Consensus 7 ~~~~~~~~L~~~f~~~~~P~~~~~~~la~~~~ 38 (56)
T smart00389 7 FTPEQLEELEKEFQKNPYPSREEREELAAKLG 38 (56)
T ss_pred CCHHHHHHHHHHHHhCCCCCHHHHHHHHHHHC
Confidence 67888888777766655799999999998873
No 54
>PF08542 Rep_fac_C: Replication factor C C-terminal domain; InterPro: IPR013748 Replication factor C (RFC) is a multimeric AAA+ protein complex that loads the DNA polymerase processivity clamp PCNA (Proliferating Cell Nuclear Antigen) onto DNA using ATP to drive the reaction []. PCNA functions at multiple levels in directing DNA metabolic pathways []. When bound to DNA, PCNA organises various proteins involved in DNA replication, DNA repair, DNA modification, and chromatin modelling. Replication factor C consists of five subunits in a spiral arrangement: Rfc1, Rfc2, Rfc3, Rfc4, and Rfc5 subunits. Rfc1 and Rfc2 load the PCNA sliding clamp onto DNA, while Rfc3 binds ATP and also acts as a checkpoint sensor. The RFC complex contains four ATP sites (sites A, B, C, and D) located at subunit interfaces. In each ATP site, an arginine residue from one subunit is located near the gamma-phosphate of ATP bound in the adjacent subunit. These arginine residues act as "arginine fingers" that can potentially perform two functions: sensing that ATP is bound and catalyzing ATP hydrolysis []. This entry represents the core domain found in Rfc1-5.; GO: 0003689 DNA clamp loader activity, 0005524 ATP binding, 0006260 DNA replication, 0005663 DNA replication factor C complex; PDB: 1SXJ_B 2CHG_B 2CHV_F 2CHQ_C 1IQP_A.
Probab=21.91 E-value=2.3e+02 Score=19.71 Aligned_cols=41 Identities=17% Similarity=0.394 Sum_probs=27.5
Q ss_pred HHHHHHHHHHhCc---CCCchHHHHHHHHHHHHHHhcCChhHHH
Q psy4488 139 TEIMAMIWKRLND---SGKNWRHVYKALSLLEYLIKTGSDKVTR 179 (194)
Q Consensus 139 ~~Im~~l~kRL~~---~~k~WR~vyKsL~LLeyLlknGse~vv~ 179 (194)
..|+.-|...+.. +......+.+-+--.||=+..|+...++
T Consensus 38 ~~Il~~l~~~l~~~~~~~~~k~~i~~~la~~e~rl~~G~~e~lQ 81 (89)
T PF08542_consen 38 SDILKQLHEVLVESDIPDSQKAEILKILAEIEYRLSDGASEILQ 81 (89)
T ss_dssp HHHHHHHHHHHHTSTSSHHHHHHHHHHHHHHHHHHHTTC-HHHH
T ss_pred HHHHHHHHHHHHHhhccHHHHHHHHHHHHHHhHHHHCCCCHHHH
Confidence 3555555555443 3455778888888889999998877654
No 55
>PF00023 Ank: Ankyrin repeat Hereditary spherocytosis; InterPro: IPR002110 The ankyrin repeat is one of the most common protein-protein interaction motifs in nature. Ankyrin repeats are tandemly repeated modules of about 33 amino acids. They occur in a large number of functionally diverse proteins mainly from eukaryotes. The few known examples from prokaryotes and viruses may be the result of horizontal gene transfers []. The repeat has been found in proteins of diverse function such as transcriptional initiators, cell-cycle regulators, cytoskeletal, ion transporters and signal transducers. The ankyrin fold appears to be defined by its structure rather than its function since there is no specific sequence or structure which is universally recognised by it. The conserved fold of the ankyrin repeat unit is known from several crystal and solution structures [, , , ]. Each repeat folds into a helix-loop-helix structure with a beta-hairpin/loop region projecting out from the helices at a 90o angle. The repeats stack together to form an L-shaped structure [, ].; GO: 0005515 protein binding; PDB: 1D9S_A 1NFI_F 1IKN_D 1WDY_A 1OT8_C 1QYM_A 1TR4_A 1UOH_A 1N11_A 1K1A_A ....
Probab=21.40 E-value=61 Score=18.60 Aligned_cols=14 Identities=29% Similarity=0.603 Sum_probs=11.4
Q ss_pred HHHHHHHHHhcCCh
Q psy4488 162 ALSLLEYLIKTGSD 175 (194)
Q Consensus 162 sL~LLeyLlknGse 175 (194)
..-++++||.+|+.
T Consensus 14 ~~~~v~~Ll~~ga~ 27 (33)
T PF00023_consen 14 HPDIVKLLLKHGAD 27 (33)
T ss_dssp CHHHHHHHHHTTSC
T ss_pred HHHHHHHHHHCcCC
Confidence 56688999999875
No 56
>cd00086 homeodomain Homeodomain; DNA binding domains involved in the transcriptional regulation of key eukaryotic developmental processes; may bind to DNA as monomers or as homo- and/or heterodimers, in a sequence-specific manner.
Probab=20.95 E-value=1.5e+02 Score=18.70 Aligned_cols=33 Identities=18% Similarity=0.066 Sum_probs=25.7
Q ss_pred cchHHHHHHHHhhcCCCCCCCHHHHHHHHHHcc
Q psy4488 101 NYSDAQVKVREATSNDPWGPSSSLMSEIADLTY 133 (194)
Q Consensus 101 ~~t~~E~~VreATsnd~wGps~~~m~eIa~~T~ 133 (194)
.+|+.+..+.++.-...--|+...+.+||..+-
T Consensus 6 ~~~~~~~~~Le~~f~~~~~P~~~~~~~la~~~~ 38 (59)
T cd00086 6 RFTPEQLEELEKEFEKNPYPSREEREELAKELG 38 (59)
T ss_pred cCCHHHHHHHHHHHHhCCCCCHHHHHHHHHHHC
Confidence 467777777777766655889999999999873
Done!