Query psy4701
Match_columns 124
No_of_seqs 120 out of 196
Neff 5.3
Searched_HMMs 46136
Date Fri Aug 16 23:40:20 2013
Command hhsearch -i /work/01045/syshi/Psyhhblits/psy4701.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/4701hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 KOG4151|consensus 99.6 1.1E-15 2.4E-20 137.2 6.3 88 2-96 516-603 (748)
2 PF11701 UNC45-central: Myosin 97.6 5.5E-05 1.2E-09 56.5 3.4 47 77-123 72-120 (157)
3 cd00020 ARM Armadillo/beta-cat 97.3 0.0026 5.6E-08 42.4 8.2 70 34-104 7-81 (120)
4 cd00020 ARM Armadillo/beta-cat 97.0 0.0033 7.2E-08 41.9 6.6 84 3-89 20-103 (120)
5 PLN03200 cellulose synthase-in 95.9 0.044 9.5E-07 55.4 9.2 83 3-88 202-285 (2102)
6 PLN03200 cellulose synthase-in 94.0 0.27 5.8E-06 50.1 8.5 84 35-119 447-558 (2102)
7 PF00514 Arm: Armadillo/beta-c 94.0 0.08 1.7E-06 30.5 3.1 31 33-64 11-41 (41)
8 KOG0166|consensus 93.9 0.12 2.6E-06 46.0 5.4 97 7-119 83-180 (514)
9 KOG4224|consensus 93.7 0.055 1.2E-06 47.3 2.9 89 35-123 334-449 (550)
10 PF10508 Proteasom_PSMB: Prote 91.9 1.2 2.6E-05 38.9 8.8 93 8-120 137-231 (503)
11 smart00185 ARM Armadillo/beta- 90.9 0.36 7.8E-06 26.7 3.0 30 34-64 12-41 (41)
12 KOG0166|consensus 89.7 0.84 1.8E-05 40.8 5.7 75 49-123 287-396 (514)
13 COG5064 SRP1 Karyopherin (impo 88.6 0.15 3.3E-06 44.3 0.4 83 35-117 115-206 (526)
14 PF13646 HEAT_2: HEAT repeats; 86.6 2.9 6.4E-05 26.6 5.5 27 89-115 60-87 (88)
15 PF04826 Arm_2: Armadillo-like 84.6 3.1 6.8E-05 33.6 5.9 89 5-100 110-200 (254)
16 PF04826 Arm_2: Armadillo-like 83.8 10 0.00022 30.6 8.5 59 35-94 13-71 (254)
17 PF13646 HEAT_2: HEAT repeats; 82.7 4.9 0.00011 25.5 5.3 26 91-116 31-56 (88)
18 PF00514 Arm: Armadillo/beta-c 82.3 3.5 7.6E-05 23.4 4.0 31 89-119 10-40 (41)
19 KOG2171|consensus 81.4 9.9 0.00021 36.9 8.6 84 6-96 325-408 (1075)
20 PF05804 KAP: Kinesin-associat 80.9 7 0.00015 36.2 7.4 56 32-89 329-384 (708)
21 PRK09687 putative lyase; Provi 75.5 26 0.00056 28.4 8.5 30 33-63 158-187 (280)
22 PF13513 HEAT_EZ: HEAT-like re 74.6 0.93 2E-05 27.2 -0.1 54 5-62 2-55 (55)
23 KOG4224|consensus 72.9 23 0.00049 31.4 7.9 82 35-118 252-361 (550)
24 COG5064 SRP1 Karyopherin (impo 71.3 13 0.00028 32.7 6.0 93 7-105 132-234 (526)
25 KOG4151|consensus 64.1 3.9 8.5E-05 38.1 1.5 48 75-123 254-301 (748)
26 PF10508 Proteasom_PSMB: Prote 62.7 54 0.0012 28.7 8.3 112 5-120 175-319 (503)
27 PRK09687 putative lyase; Provi 59.7 71 0.0015 25.8 8.0 50 5-62 69-118 (280)
28 PF11701 UNC45-central: Myosin 59.0 22 0.00047 26.3 4.5 54 46-100 98-155 (157)
29 PF05004 IFRD: Interferon-rela 58.8 49 0.0011 27.3 7.0 28 91-118 228-255 (309)
30 PRK13800 putative oxidoreducta 56.0 59 0.0013 30.5 7.8 95 4-117 789-896 (897)
31 PF14500 MMS19_N: Dos2-interac 53.8 26 0.00057 28.3 4.6 71 4-75 13-83 (262)
32 PRK13800 putative oxidoreducta 52.3 94 0.002 29.2 8.5 45 4-62 635-679 (897)
33 COG1017 Hmp Hemoglobin-like fl 52.0 31 0.00068 26.3 4.4 46 76-121 19-67 (150)
34 PF05804 KAP: Kinesin-associat 43.1 71 0.0015 29.8 6.1 88 20-114 277-373 (708)
35 PF10165 Ric8: Guanine nucleot 43.0 71 0.0015 27.7 5.8 92 25-119 233-336 (446)
36 PF03224 V-ATPase_H_N: V-ATPas 39.3 85 0.0018 25.3 5.4 78 5-86 120-200 (312)
37 COG1560 HtrB Lauroyl/myristoyl 37.8 29 0.00063 28.9 2.5 33 55-88 55-91 (308)
38 PF02985 HEAT: HEAT repeat; I 36.2 71 0.0015 16.9 4.1 28 92-119 1-28 (31)
39 PTZ00429 beta-adaptin; Provisi 35.8 3.1E+02 0.0067 25.8 9.0 57 4-66 154-210 (746)
40 PF07539 DRIM: Down-regulated 34.2 66 0.0014 23.7 3.7 31 87-117 13-43 (141)
41 PF07208 DUF1414: Protein of u 33.0 47 0.001 20.4 2.3 21 55-75 8-34 (44)
42 KOG2122|consensus 33.0 57 0.0012 33.7 4.0 26 48-74 365-390 (2195)
43 PF12031 DUF3518: Domain of un 31.2 1.7E+02 0.0038 24.2 5.9 72 5-76 139-214 (257)
44 PF01602 Adaptin_N: Adaptin N 28.3 2.9E+02 0.0062 23.2 7.1 52 4-62 128-179 (526)
45 KOG4500|consensus 26.0 98 0.0021 28.1 3.9 83 5-90 57-146 (604)
46 PF07571 DUF1546: Protein of u 23.7 1.1E+02 0.0023 20.8 3.0 66 4-74 20-86 (92)
47 PF03130 HEAT_PBS: PBS lyase H 23.1 27 0.00058 18.4 -0.1 24 7-43 2-25 (27)
48 PF04063 DUF383: Domain of unk 22.7 2.7E+02 0.0058 21.5 5.5 51 59-120 82-132 (192)
49 PF03278 IpaB_EvcA: IpaB/EvcA 21.7 3.7E+02 0.008 20.4 6.7 62 60-121 57-130 (152)
50 smart00567 EZ_HEAT E-Z type HE 21.5 1.1E+02 0.0024 15.8 2.3 26 6-44 3-28 (30)
51 PF12755 Vac14_Fab1_bd: Vacuol 20.5 2.5E+02 0.0054 19.2 4.4 85 8-111 4-88 (97)
No 1
>KOG4151|consensus
Probab=99.60 E-value=1.1e-15 Score=137.24 Aligned_cols=88 Identities=43% Similarity=0.585 Sum_probs=83.9
Q ss_pred CchhhHHHHHHHHHHHhHhhCCCcccCCCCCCcchHhhHhhcCCCCchhhhhHHHHHHHhhcCCChHHHHHHHHccchhh
Q psy4701 2 GTVKGKRQAAQGLARLGITLNPEVAFPGERSLEVVRPLLSLLHPEATALETFEGLMALCNLAAIGEKQRQRILKEKGPLE 81 (124)
Q Consensus 2 ~~~~~~~~A~qaLAkilIs~nP~l~F~~~~~~~~v~pL~~LL~~~~~~l~~fEaLlALTNLAs~~~~~r~~Iv~~~G~s~ 81 (124)
+|+.|++++.|||| + .+-|++++++++++|+.+++|+|+.++++||+|+|||||||++++.|.+|+++++...
T Consensus 516 q~e~akl~~~~aL~--~-----~i~f~~~~~~~v~~~~~s~~~~d~~~~en~E~L~altnLas~s~s~r~~i~ke~~~~~ 588 (748)
T KOG4151|consen 516 QFEEAKLKWYHALA--G-----KIDFPGERSYEVVKPLDSALHNDEKGLENFEALEALTNLASISESDRQKILKEKALGK 588 (748)
T ss_pred hchHHHHHHHHHHh--h-----hcCCCCCchhhhhhhhcchhhhhHHHHHHHHHHHHhhcccCcchhhHHHHHHHhcchh
Confidence 47889999999999 2 8899999999999999999999999999999999999999999999999999999999
Q ss_pred hhhhhhccchHHHHH
Q psy4701 82 VGSSIMAKEGILEMV 96 (124)
Q Consensus 82 IEnlLls~n~~Lqra 96 (124)
||+++.++|..+||+
T Consensus 589 ie~~~~ee~~~lqra 603 (748)
T KOG4151|consen 589 IEELMTEENPALQRA 603 (748)
T ss_pred hHHHhhcccHHHHHH
Confidence 999999999999887
No 2
>PF11701 UNC45-central: Myosin-binding striated muscle assembly central; InterPro: IPR024660 The UNC-45 or small muscle protein 1 of Caenorhabditis elegans is expressed in two forms from different genomic positions in mammals: as a general tissue protein (UNC-45a) and as a specific form (UNC-45b) expressed only in striated and skeletal muscle. Myofibril formation requires both UNC-45 forms, consistent with the fact that the cytoskeleton is necessary for the development and maintenance of organised myofibrils []. Rng3 (Ring assembly protein 3), the homologue in Schizosaccharomyces pombe, is crucial for cell shape, normal actin cytoskeleton, and contractile ring assembly, and is essential for assembly of the myosin II-containing progenitors of the contractile ring. Widespread defects in the cytoskeleton are found in null mutants of all three fungal proteins []. Mammalian Unc45 is found to act as a specific chaperone during the folding of myosin and the assembly of striated muscle by forming a stable complex with the general chaperone Hsp90 []. All members carry up to three amino-terminal tetratricopeptide repeat (TPR) and a UCS domain at the C terminus that contains a number of Arm repeats. ; PDB: 3OPB_A 3NOW_A.
Probab=97.62 E-value=5.5e-05 Score=56.53 Aligned_cols=47 Identities=38% Similarity=0.526 Sum_probs=44.2
Q ss_pred cchhhhhhhhhccchHHHHHHHHhC--CcchHHHHHHHHHHHHHhhhcc
Q psy4701 77 KGPLEVGSSIMAKEGILEMVLVMAN--TEDILQQKVACECIIAVASKKD 123 (124)
Q Consensus 77 ~G~s~IEnlLls~n~~Lqral~M~~--S~d~~~Q~~a~e~ii~a~sk~~ 123 (124)
+|++++++.+|..++.++.++.|++ +++...|.+++|+|.+||+|++
T Consensus 72 p~~~dv~~~l~~~eg~~~~l~~~~~~~~~~~~~~~~~lell~aAc~d~~ 120 (157)
T PF11701_consen 72 PGPPDVGSELFLSEGFLESLLPLASRKSKDRKVQKAALELLSAACIDKS 120 (157)
T ss_dssp TTTHHHHHHHCCTTTHHHHHHHHHH-CTS-HHHHHHHHHHHHHHTTSHH
T ss_pred CCCHHHHHHHHhhhhHHHHHHHHHhcccCCHHHHHHHHHHHHHHHccHH
Confidence 7889999999999999999999999 9999999999999999999986
No 3
>cd00020 ARM Armadillo/beta-catenin-like repeats. An approximately 40 amino acid long tandemly repeated sequence motif first identified in the Drosophila segment polarity gene armadillo; these repeats were also found in the mammalian armadillo homolog beta-catenin, the junctional plaque protein plakoglobin, the adenomatous polyposis coli (APC) tumor suppressor protein, and a number of other proteins. ARM has been implicated in mediating protein-protein interactions, but no common features among the target proteins recognized by the ARM repeats have been identified; related to the HEAT domain; three consecutive copies of the repeat are represented by this alignment model.
Probab=97.29 E-value=0.0026 Score=42.42 Aligned_cols=70 Identities=19% Similarity=0.146 Sum_probs=55.6
Q ss_pred cchHhhHhhcCCCCchhhhhHHHHHHHhhcCCChHHHHHHHHccchhhhhhhhhccchHHHHH-----HHHhCCcc
Q psy4701 34 EVVRPLLSLLHPEATALETFEGLMALCNLAAIGEKQRQRILKEKGPLEVGSSIMAKEGILEMV-----LVMANTED 104 (124)
Q Consensus 34 ~~v~pL~~LL~~~~~~l~~fEaLlALTNLAs~~~~~r~~Iv~~~G~s~IEnlLls~n~~Lqra-----l~M~~S~d 104 (124)
..|++|..+|. +.+.-....++.+|.|+++.+++.++.+++.++++.+-+++-+++..+++. .+++.++.
T Consensus 7 ~~i~~l~~~l~-~~~~~~~~~a~~~l~~l~~~~~~~~~~~~~~~~i~~l~~~l~~~~~~v~~~a~~~L~~l~~~~~ 81 (120)
T cd00020 7 GGLPALVSLLS-SSDENVQREAAWALSNLSAGNNDNIQAVVEAGGLPALVQLLKSEDEEVVKAALWALRNLAAGPE 81 (120)
T ss_pred CChHHHHHHHH-cCCHHHHHHHHHHHHHHhcCCHHHHHHHHHCCChHHHHHHHhCCCHHHHHHHHHHHHHHccCcH
Confidence 47899999997 444667899999999999999999999999899999998888766655554 44555443
No 4
>cd00020 ARM Armadillo/beta-catenin-like repeats. An approximately 40 amino acid long tandemly repeated sequence motif first identified in the Drosophila segment polarity gene armadillo; these repeats were also found in the mammalian armadillo homolog beta-catenin, the junctional plaque protein plakoglobin, the adenomatous polyposis coli (APC) tumor suppressor protein, and a number of other proteins. ARM has been implicated in mediating protein-protein interactions, but no common features among the target proteins recognized by the ARM repeats have been identified; related to the HEAT domain; three consecutive copies of the repeat are represented by this alignment model.
Probab=97.02 E-value=0.0033 Score=41.88 Aligned_cols=84 Identities=24% Similarity=0.177 Sum_probs=57.9
Q ss_pred chhhHHHHHHHHHHHhHhhCCCcccCCCCCCcchHhhHhhcCCCCchhhhhHHHHHHHhhcCCChHHHHHHHHccchhhh
Q psy4701 3 TVKGKRQAAQGLARLGITLNPEVAFPGERSLEVVRPLLSLLHPEATALETFEGLMALCNLAAIGEKQRQRILKEKGPLEV 82 (124)
Q Consensus 3 ~~~~~~~A~qaLAkilIs~nP~l~F~~~~~~~~v~pL~~LL~~~~~~l~~fEaLlALTNLAs~~~~~r~~Iv~~~G~s~I 82 (124)
.+..|..|..+|+.+....++.. ..- ...+.+++|..+|..+ +.--...++.+|+||+..+++.+..+++.++.+.+
T Consensus 20 ~~~~~~~a~~~l~~l~~~~~~~~-~~~-~~~~~i~~l~~~l~~~-~~~v~~~a~~~L~~l~~~~~~~~~~~~~~g~l~~l 96 (120)
T cd00020 20 DENVQREAAWALSNLSAGNNDNI-QAV-VEAGGLPALVQLLKSE-DEEVVKAALWALRNLAAGPEDNKLIVLEAGGVPKL 96 (120)
T ss_pred CHHHHHHHHHHHHHHhcCCHHHH-HHH-HHCCChHHHHHHHhCC-CHHHHHHHHHHHHHHccCcHHHHHHHHHCCChHHH
Confidence 35678889999999877533222 111 1236899999999744 44555799999999999888888888875555555
Q ss_pred hhhhhcc
Q psy4701 83 GSSIMAK 89 (124)
Q Consensus 83 EnlLls~ 89 (124)
-.++-+.
T Consensus 97 ~~~l~~~ 103 (120)
T cd00020 97 VNLLDSS 103 (120)
T ss_pred HHHHhcC
Confidence 5554443
No 5
>PLN03200 cellulose synthase-interactive protein; Provisional
Probab=95.94 E-value=0.044 Score=55.41 Aligned_cols=83 Identities=23% Similarity=0.189 Sum_probs=62.3
Q ss_pred chhhHHHHHHHHHHHhHhh-CCCcccCCCCCCcchHhhHhhcCCCCchhhhhHHHHHHHhhcCCChHHHHHHHHccchhh
Q psy4701 3 TVKGKRQAAQGLARLGITL-NPEVAFPGERSLEVVRPLLSLLHPEATALETFEGLMALCNLAAIGEKQRQRILKEKGPLE 81 (124)
Q Consensus 3 ~~~~~~~A~qaLAkilIs~-nP~l~F~~~~~~~~v~pL~~LL~~~~~~l~~fEaLlALTNLAs~~~~~r~~Iv~~~G~s~ 81 (124)
.+.-+..|+.+|++++-+. +|.-..- ...+||+|+.+|..+.+.--+++|..||+|||+-+++.|+.|++.+|.+.
T Consensus 202 d~~lQ~eAa~aLa~Lass~ee~~~aVI---eaGaVP~LV~LL~sg~~~~VRE~AA~AL~nLAs~s~e~r~~Iv~aGgIp~ 278 (2102)
T PLN03200 202 NSDAQANAASLLARLMMAFESSISKVL---DAGAVKQLLKLLGQGNEVSVRAEAAGALEALSSQSKEAKQAIADAGGIPA 278 (2102)
T ss_pred CHHHHHHHHHHHHHHHcCChHHHHHHH---HCCCHHHHHHHHccCCChHHHHHHHHHHHHHhcCCHHHHHHHHHCCCHHH
Confidence 4456778888888885442 2321111 24589999999975555556899999999999999999999999899887
Q ss_pred hhhhhhc
Q psy4701 82 VGSSIMA 88 (124)
Q Consensus 82 IEnlLls 88 (124)
+=+++-+
T Consensus 279 LI~lL~s 285 (2102)
T PLN03200 279 LINATVA 285 (2102)
T ss_pred HHHHHhC
Confidence 7777653
No 6
>PLN03200 cellulose synthase-interactive protein; Provisional
Probab=94.01 E-value=0.27 Score=50.05 Aligned_cols=84 Identities=15% Similarity=0.135 Sum_probs=64.2
Q ss_pred chHhhHhhcCCCCchhhhhHHHHHHHhhcCCChHHHHHHHHccchhhhhhhhhc--------------------------
Q psy4701 35 VVRPLLSLLHPEATALETFEGLMALCNLAAIGEKQRQRILKEKGPLEVGSSIMA-------------------------- 88 (124)
Q Consensus 35 ~v~pL~~LL~~~~~~l~~fEaLlALTNLAs~~~~~r~~Iv~~~G~s~IEnlLls-------------------------- 88 (124)
.|++|+.+|.. .+.-.+.+|+-+|-||+.-+++.|..|+..+|++.+-+++-+
T Consensus 447 gIp~LV~LL~s-~s~~iQ~~A~~~L~nLa~~ndenr~aIieaGaIP~LV~LL~s~~~~iqeeAawAL~NLa~~~~qir~i 525 (2102)
T PLN03200 447 GVQLLISLLGL-SSEQQQEYAVALLAILTDEVDESKWAITAAGGIPPLVQLLETGSQKAKEDSATVLWNLCCHSEDIRAC 525 (2102)
T ss_pred cHHHHHHHHcC-CCHHHHHHHHHHHHHHHcCCHHHHHHHHHCCCHHHHHHHHcCCCHHHHHHHHHHHHHHhCCcHHHHHH
Confidence 79999999973 456777889999999999889999999988888887777743
Q ss_pred --cchHHHHHHHHhCCcchHHHHHHHHHHHHHh
Q psy4701 89 --KEGILEMVLVMANTEDILQQKVACECIIAVA 119 (124)
Q Consensus 89 --~n~~Lqral~M~~S~d~~~Q~~a~e~ii~a~ 119 (124)
+.+.+.-.+.+..+.+...|..|+.+|-+.+
T Consensus 526 V~~aGAIppLV~LL~sgd~~~q~~Aa~AL~nLi 558 (2102)
T PLN03200 526 VESAGAVPALLWLLKNGGPKGQEIAAKTLTKLV 558 (2102)
T ss_pred HHHCCCHHHHHHHHhCCCHHHHHHHHHHHHHHH
Confidence 1223334466777777778888887776654
No 7
>PF00514 Arm: Armadillo/beta-catenin-like repeat; InterPro: IPR000225 The armadillo (Arm) repeat is an approximately 40 amino acid long tandemly repeated sequence motif first identified in the Drosophila melanogaster segment polarity gene armadillo involved in signal transduction through wingless. Animal Arm-repeat proteins function in various processes, including intracellular signalling and cytoskeletal regulation, and include such proteins as beta-catenin, the junctional plaque protein plakoglobin, the adenomatous polyposis coli (APC) tumour suppressor protein, and the nuclear transport factor importin-alpha, amongst others []. A subset of these proteins is conserved across eukaryotic kingdoms. In higher plants, some Arm-repeat proteins function in intracellular signalling like their mammalian counterparts, while others have novel functions []. The 3-dimensional fold of an armadillo repeat is known from the crystal structure of beta-catenin, where the 12 repeats form a superhelix of alpha helices with three helices per unit []. The cylindrical structure features a positively charged grove, which presumably interacts with the acidic surfaces of the known interaction partners of beta-catenin.; GO: 0005515 protein binding; PDB: 2Z6G_A 1IQ1_C 3RZX_A 2C1M_A 3BTR_C 3OQS_A 3TPO_A 1IAL_A 1Q1S_C 1PJM_B ....
Probab=93.99 E-value=0.08 Score=30.54 Aligned_cols=31 Identities=35% Similarity=0.326 Sum_probs=26.6
Q ss_pred CcchHhhHhhcCCCCchhhhhHHHHHHHhhcC
Q psy4701 33 LEVVRPLLSLLHPEATALETFEGLMALCNLAA 64 (124)
Q Consensus 33 ~~~v~pL~~LL~~~~~~l~~fEaLlALTNLAs 64 (124)
..+|+||+.||+ +.+.--+.+|+-||.||++
T Consensus 11 ~g~i~~Lv~ll~-~~~~~v~~~a~~al~nl~~ 41 (41)
T PF00514_consen 11 AGGIPPLVQLLK-SPDPEVQEEAAWALGNLAA 41 (41)
T ss_dssp TTHHHHHHHHTT-SSSHHHHHHHHHHHHHHHT
T ss_pred cccHHHHHHHHc-CCCHHHHHHHHHHHHHHhC
Confidence 358999999998 6677778899999999985
No 8
>KOG0166|consensus
Probab=93.91 E-value=0.12 Score=46.01 Aligned_cols=97 Identities=19% Similarity=0.159 Sum_probs=60.9
Q ss_pred HHHHHHHHHHHhHh-hCCCcccCCCCCCcchHhhHhhcCCCCchhhhhHHHHHHHhhcCCChHHHHHHHHccchhhhhhh
Q psy4701 7 KRQAAQGLARLGIT-LNPEVAFPGERSLEVVRPLLSLLHPEATALETFEGLMALCNLAAIGEKQRQRILKEKGPLEVGSS 85 (124)
Q Consensus 7 ~~~A~qaLAkilIs-~nP~l~F~~~~~~~~v~pL~~LL~~~~~~l~~fEaLlALTNLAs~~~~~r~~Iv~~~G~s~IEnl 85 (124)
+..|.+++=|++-+ .||-+-..=. ..+|+=|+..|+.+.+.--+|||.-||||+||.+.+.-+.++..+..+-
T Consensus 83 q~~a~~~~rkllS~~~~ppi~~vi~--~G~v~~lV~~l~~~~~~~lq~eAAWaLTnIAsgtse~T~~vv~agavp~---- 156 (514)
T KOG0166|consen 83 QLTATQAFRKLLSKERNPPIDEVIQ--SGVVPRLVEFLSRDDNPTLQFEAAWALTNIASGTSEQTKVVVDAGAVPI---- 156 (514)
T ss_pred HHHHHHHHHHHHccCCCCCHHHHHH--cCcHHHHHHHHccCCChhHHHHHHHHHHHHhcCchhhccccccCCchHH----
Confidence 34455555555432 2343322211 1478889999976776677999999999999998888777777555444
Q ss_pred hhccchHHHHHHHHhCCcchHHHHHHHHHHHHHh
Q psy4701 86 IMAKEGILEMVLVMANTEDILQQKVACECIIAVA 119 (124)
Q Consensus 86 Lls~n~~Lqral~M~~S~d~~~Q~~a~e~ii~a~ 119 (124)
-+.|.+|.+...+..|+=+|=+.+
T Consensus 157 ----------fi~Ll~s~~~~v~eQavWALgNIa 180 (514)
T KOG0166|consen 157 ----------FIQLLSSPSADVREQAVWALGNIA 180 (514)
T ss_pred ----------HHHHhcCCcHHHHHHHHHHHhccc
Confidence 445555555555555554444444
No 9
>KOG4224|consensus
Probab=93.71 E-value=0.055 Score=47.29 Aligned_cols=89 Identities=20% Similarity=0.262 Sum_probs=70.4
Q ss_pred chHhhHhhcCCCCchhhhhHHHHHHHhhcCCChHHHHHHHHccchhhhhhhhhc--------------------------
Q psy4701 35 VVRPLLSLLHPEATALETFEGLMALCNLAAIGEKQRQRILKEKGPLEVGSSIMA-------------------------- 88 (124)
Q Consensus 35 ~v~pL~~LL~~~~~~l~~fEaLlALTNLAs~~~~~r~~Iv~~~G~s~IEnlLls-------------------------- 88 (124)
..|||+.||.-..+.--+.-|-=-|-|||.-++.-++.|...+.++++-+++++
T Consensus 334 fl~pLVrlL~~~dnEeiqchAvstLrnLAasse~n~~~i~esgAi~kl~eL~lD~pvsvqseisac~a~Lal~d~~k~~l 413 (550)
T KOG4224|consen 334 FLRPLVRLLRAGDNEEIQCHAVSTLRNLAASSEHNVSVIRESGAIPKLIELLLDGPVSVQSEISACIAQLALNDNDKEAL 413 (550)
T ss_pred chhHHHHHHhcCCchhhhhhHHHHHHHHhhhhhhhhHHHhhcCchHHHHHHHhcCChhHHHHHHHHHHHHHhccccHHHH
Confidence 589999999655565556677788999999888777777765777777776665
Q ss_pred -cchHHHHHHHHhCCcchHHHHHHHHHHHHHhhhcc
Q psy4701 89 -KEGILEMVLVMANTEDILQQKVACECIIAVASKKD 123 (124)
Q Consensus 89 -~n~~Lqral~M~~S~d~~~Q~~a~e~ii~a~sk~~ 123 (124)
+-+++...+.|..|+.+-.|.-|++++++-+|+.+
T Consensus 414 ld~gi~~iLIp~t~s~s~Ev~gNaAaAL~Nlss~v~ 449 (550)
T KOG4224|consen 414 LDSGIIPILIPWTGSESEEVRGNAAAALINLSSDVE 449 (550)
T ss_pred hhcCCcceeecccCccchhhcccHHHHHHhhhhhhH
Confidence 45566666889999999999999999999888754
No 10
>PF10508 Proteasom_PSMB: Proteasome non-ATPase 26S subunit; InterPro: IPR019538 The 26S proteasome is an enzymatic complex that degrades ubiquitinated proteins in eukaryotic cells. 26S proteasome non-ATPase regulatory subunit 5 is one of a number of chaperones that are involved in the assembly of the proteasome. The chaperones dissociate before 26S proteasome formation is complete [].; GO: 0044183 protein binding involved in protein folding
Probab=91.94 E-value=1.2 Score=38.86 Aligned_cols=93 Identities=19% Similarity=0.293 Sum_probs=60.3
Q ss_pred HHHHHHHHHHhHhhCCCc--ccCCCCCCcchHhhHhhcCCCCchhhhhHHHHHHHhhcCCChHHHHHHHHccchhhhhhh
Q psy4701 8 RQAAQGLARLGITLNPEV--AFPGERSLEVVRPLLSLLHPEATALETFEGLMALCNLAAIGEKQRQRILKEKGPLEVGSS 85 (124)
Q Consensus 8 ~~A~qaLAkilIs~nP~l--~F~~~~~~~~v~pL~~LL~~~~~~l~~fEaLlALTNLAs~~~~~r~~Iv~~~G~s~IEnl 85 (124)
..|+.+|.++.= ..+.+ +|++. .++-|..++.. ++..-+|..+-.++++++.+++....++..+
T Consensus 137 ~~A~~~L~~l~~-~~~~~~~l~~~~----~~~~L~~l~~~-~~~~vR~Rv~el~v~i~~~S~~~~~~~~~sg-------- 202 (503)
T PF10508_consen 137 KAAIKALKKLAS-HPEGLEQLFDSN----LLSKLKSLMSQ-SSDIVRCRVYELLVEIASHSPEAAEAVVNSG-------- 202 (503)
T ss_pred HHHHHHHHHHhC-CchhHHHHhCcc----hHHHHHHHHhc-cCHHHHHHHHHHHHHHHhcCHHHHHHHHhcc--------
Confidence 456777777762 33333 66642 36667777753 3444456666678899999998888777633
Q ss_pred hhccchHHHHHHHHhCCcchHHHHHHHHHHHHHhh
Q psy4701 86 IMAKEGILEMVLVMANTEDILQQKVACECIIAVAS 120 (124)
Q Consensus 86 Lls~n~~Lqral~M~~S~d~~~Q~~a~e~ii~a~s 120 (124)
++..++.-..++|.+.|.-++|.+..-+.
T Consensus 203 ------ll~~ll~eL~~dDiLvqlnalell~~La~ 231 (503)
T PF10508_consen 203 ------LLDLLLKELDSDDILVQLNALELLSELAE 231 (503)
T ss_pred ------HHHHHHHHhcCccHHHHHHHHHHHHHHHc
Confidence 55555665666777777777777666554
No 11
>smart00185 ARM Armadillo/beta-catenin-like repeats. Approx. 40 amino acid repeat. Tandem repeats form superhelix of helices that is proposed to mediate interaction of beta-catenin with its ligands. Involved in transducing the Wingless/Wnt signal. In plakoglobin arm repeats bind alpha-catenin and N-cadherin.
Probab=90.86 E-value=0.36 Score=26.74 Aligned_cols=30 Identities=27% Similarity=0.250 Sum_probs=24.3
Q ss_pred cchHhhHhhcCCCCchhhhhHHHHHHHhhcC
Q psy4701 34 EVVRPLLSLLHPEATALETFEGLMALCNLAA 64 (124)
Q Consensus 34 ~~v~pL~~LL~~~~~~l~~fEaLlALTNLAs 64 (124)
..|+||+.||+ +.+.--+.+++.||.||+.
T Consensus 12 g~i~~L~~ll~-~~~~~i~~~a~~aL~nl~~ 41 (41)
T smart00185 12 GGLPALVELLK-SEDEEVVKEAAWALSNLSS 41 (41)
T ss_pred CCHHHHHHHHc-CCCHHHHHHHHHHHHHHcC
Confidence 37999999998 4455567889999999974
No 12
>KOG0166|consensus
Probab=89.69 E-value=0.84 Score=40.76 Aligned_cols=75 Identities=15% Similarity=0.174 Sum_probs=58.8
Q ss_pred hhhhhH------HHHHHHhhcCCChHHHHHHHHccchhhhhhhhhc-----------------------------cchHH
Q psy4701 49 ALETFE------GLMALCNLAAIGEKQRQRILKEKGPLEVGSSIMA-----------------------------KEGIL 93 (124)
Q Consensus 49 ~l~~fE------aLlALTNLAs~~~~~r~~Iv~~~G~s~IEnlLls-----------------------------~n~~L 93 (124)
.|.+++ ||.|+-|++.-+|+..+.++..++.+.+-+++.. +.+++
T Consensus 287 lL~~~~~~v~~PaLRaiGNIvtG~d~QTq~vi~~~~L~~l~~ll~~s~~~~ikkEAcW~iSNItAG~~~qiqaVida~l~ 366 (514)
T KOG0166|consen 287 LLGHSSPKVVTPALRAIGNIVTGSDEQTQVVINSGALPVLSNLLSSSPKESIKKEACWTISNITAGNQEQIQAVIDANLI 366 (514)
T ss_pred HHcCCCcccccHHHhhccceeeccHHHHHHHHhcChHHHHHHHhccCcchhHHHHHHHHHHHhhcCCHHHHHHHHHcccH
Confidence 455555 8889999888888888888877778887777773 23456
Q ss_pred HHHHHHhCCcchHHHHHHHHHHHHHhhhcc
Q psy4701 94 EMVLVMANTEDILQQKVACECIIAVASKKD 123 (124)
Q Consensus 94 qral~M~~S~d~~~Q~~a~e~ii~a~sk~~ 123 (124)
+..+.+..+.|...|+-|+=+|-|+++..+
T Consensus 367 p~Li~~l~~~ef~~rKEAawaIsN~ts~g~ 396 (514)
T KOG0166|consen 367 PVLINLLQTAEFDIRKEAAWAISNLTSSGT 396 (514)
T ss_pred HHHHHHHhccchHHHHHHHHHHHhhcccCC
Confidence 666888889999999999999999987643
No 13
>COG5064 SRP1 Karyopherin (importin) alpha [Intracellular trafficking and secretion]
Probab=88.57 E-value=0.15 Score=44.27 Aligned_cols=83 Identities=19% Similarity=0.199 Sum_probs=52.5
Q ss_pred chHhhHhhcCCCCchhhhhHHHHHHHhhcCCChHHHHHHHHccchhhhhhhhhccc-hHHHHH-HHHhC-------Ccch
Q psy4701 35 VVRPLLSLLHPEATALETFEGLMALCNLAAIGEKQRQRILKEKGPLEVGSSIMAKE-GILEMV-LVMAN-------TEDI 105 (124)
Q Consensus 35 ~v~pL~~LL~~~~~~l~~fEaLlALTNLAs~~~~~r~~Iv~~~G~s~IEnlLls~n-~~Lqra-l~M~~-------S~d~ 105 (124)
+||-+++++......+.+|||.-||||+||-....-+..|...+.+-+=.++.+.+ .+.+.+ -+|-| .-|.
T Consensus 115 vVpRfvefm~~~q~~mlqfEAaWalTNiaSGtt~QTkvVvd~~AVPlfiqlL~s~~~~V~eQavWALGNiAGDS~~~RD~ 194 (526)
T COG5064 115 VVPRFVEFMDEIQRDMLQFEAAWALTNIASGTTQQTKVVVDAGAVPLFIQLLSSTEDDVREQAVWALGNIAGDSEGCRDY 194 (526)
T ss_pred ccHHHHHHHHhcchhHHHHHHHHHHhhhccCcccceEEEEeCCchHHHHHHHcCchHHHHHHHHHHhccccCCchhHHHH
Confidence 78889999965667889999999999999976544444444455555555555533 333333 22221 2356
Q ss_pred HHHHHHHHHHHH
Q psy4701 106 LQQKVACECIIA 117 (124)
Q Consensus 106 ~~Q~~a~e~ii~ 117 (124)
+.|+-+.|-+++
T Consensus 195 vL~~galeplL~ 206 (526)
T COG5064 195 VLQCGALEPLLG 206 (526)
T ss_pred HHhcCchHHHHH
Confidence 667666665543
No 14
>PF13646 HEAT_2: HEAT repeats; PDB: 1OYZ_A 3FGA_A 2PF4_C 2IAE_A 3B2A_A.
Probab=86.57 E-value=2.9 Score=26.57 Aligned_cols=27 Identities=11% Similarity=0.283 Sum_probs=15.1
Q ss_pred cchHHHHHHHHhCCc-chHHHHHHHHHH
Q psy4701 89 KEGILEMVLVMANTE-DILQQKVACECI 115 (124)
Q Consensus 89 ~n~~Lqral~M~~S~-d~~~Q~~a~e~i 115 (124)
++..++....++.++ +...+..|+++|
T Consensus 60 ~~~~~~~L~~~l~~~~~~~vr~~a~~aL 87 (88)
T PF13646_consen 60 DPEAIPALIKLLQDDDDEVVREAAAEAL 87 (88)
T ss_dssp HHHTHHHHHHHHTC-SSHHHHHHHHHHH
T ss_pred CHHHHHHHHHHHcCCCcHHHHHHHHhhc
Confidence 445566666666554 444466666654
No 15
>PF04826 Arm_2: Armadillo-like; InterPro: IPR006911 This entry consists of mammalian proteins of unknown function.
Probab=84.62 E-value=3.1 Score=33.57 Aligned_cols=89 Identities=19% Similarity=0.130 Sum_probs=59.7
Q ss_pred hhHHHHHHHHHHHhHhhCCCcccCCCCCCcchHhhHhhcCCCCchhhhhHHHHHHHhhcCCChHHHHHHHHccchhhhhh
Q psy4701 5 KGKRQAAQGLARLGITLNPEVAFPGERSLEVVRPLLSLLHPEATALETFEGLMALCNLAAIGEKQRQRILKEKGPLEVGS 84 (124)
Q Consensus 5 ~~~~~A~qaLAkilIs~nP~l~F~~~~~~~~v~pL~~LL~~~~~~l~~fEaLlALTNLAs~~~~~r~~Iv~~~G~s~IEn 84 (124)
..+..+.++|..+.++-+=..++ .+.|+.|+.||.. .+.--++.+|-.|.|||+-++- .+.++..++++.+-.
T Consensus 110 ~~Q~agLrlL~nLtv~~~~~~~l-----~~~i~~ll~LL~~-G~~~~k~~vLk~L~nLS~np~~-~~~Ll~~q~~~~~~~ 182 (254)
T PF04826_consen 110 EVQLAGLRLLTNLTVTNDYHHML-----ANYIPDLLSLLSS-GSEKTKVQVLKVLVNLSENPDM-TRELLSAQVLSSFLS 182 (254)
T ss_pred HHHHHHHHHHHccCCCcchhhhH-----HhhHHHHHHHHHc-CChHHHHHHHHHHHHhccCHHH-HHHHHhccchhHHHH
Confidence 34556677777775443333333 2368889999973 4556679999999999987555 455565588888775
Q ss_pred hhhc--cchHHHHHHHHh
Q psy4701 85 SIMA--KEGILEMVLVMA 100 (124)
Q Consensus 85 lLls--~n~~Lqral~M~ 100 (124)
+.=. +.+++.+++.+.
T Consensus 183 Lf~~~~~~~~l~~~l~~~ 200 (254)
T PF04826_consen 183 LFNSSESKENLLRVLTFF 200 (254)
T ss_pred HHccCCccHHHHHHHHHH
Confidence 4443 356788887654
No 16
>PF04826 Arm_2: Armadillo-like; InterPro: IPR006911 This entry consists of mammalian proteins of unknown function.
Probab=83.76 E-value=10 Score=30.64 Aligned_cols=59 Identities=24% Similarity=0.205 Sum_probs=45.4
Q ss_pred chHhhHhhcCCCCchhhhhHHHHHHHhhcCCChHHHHHHHHccchhhhhhhhhccchHHH
Q psy4701 35 VVRPLLSLLHPEATALETFEGLMALCNLAAIGEKQRQRILKEKGPLEVGSSIMAKEGILE 94 (124)
Q Consensus 35 ~v~pL~~LL~~~~~~l~~fEaLlALTNLAs~~~~~r~~Iv~~~G~s~IEnlLls~n~~Lq 94 (124)
.+++|+.+|....+..-+-.++.||.|.|..+ .-+.-|-.-+|.+-|++++=+.+.-++
T Consensus 13 ~l~~Ll~lL~~t~dp~i~e~al~al~n~aaf~-~nq~~Ir~~Ggi~lI~~lL~~p~~~vr 71 (254)
T PF04826_consen 13 ELQKLLCLLESTEDPFIQEKALIALGNSAAFP-FNQDIIRDLGGISLIGSLLNDPNPSVR 71 (254)
T ss_pred HHHHHHHHHhcCCChHHHHHHHHHHHhhccCh-hHHHHHHHcCCHHHHHHHcCCCChHHH
Confidence 57889999986777888889999999998875 334444444899999999888666555
No 17
>PF13646 HEAT_2: HEAT repeats; PDB: 1OYZ_A 3FGA_A 2PF4_C 2IAE_A 3B2A_A.
Probab=82.75 E-value=4.9 Score=25.51 Aligned_cols=26 Identities=12% Similarity=0.239 Sum_probs=16.8
Q ss_pred hHHHHHHHHhCCcchHHHHHHHHHHH
Q psy4701 91 GILEMVLVMANTEDILQQKVACECII 116 (124)
Q Consensus 91 ~~Lqral~M~~S~d~~~Q~~a~e~ii 116 (124)
..+...+.+++++|...+..|+.++-
T Consensus 31 ~~~~~L~~~l~d~~~~vr~~a~~aL~ 56 (88)
T PF13646_consen 31 EAIPALIELLKDEDPMVRRAAARALG 56 (88)
T ss_dssp HHHHHHHHHHTSSSHHHHHHHHHHHH
T ss_pred hHHHHHHHHHcCCCHHHHHHHHHHHH
Confidence 45666666667777777776666654
No 18
>PF00514 Arm: Armadillo/beta-catenin-like repeat; InterPro: IPR000225 The armadillo (Arm) repeat is an approximately 40 amino acid long tandemly repeated sequence motif first identified in the Drosophila melanogaster segment polarity gene armadillo involved in signal transduction through wingless. Animal Arm-repeat proteins function in various processes, including intracellular signalling and cytoskeletal regulation, and include such proteins as beta-catenin, the junctional plaque protein plakoglobin, the adenomatous polyposis coli (APC) tumour suppressor protein, and the nuclear transport factor importin-alpha, amongst others []. A subset of these proteins is conserved across eukaryotic kingdoms. In higher plants, some Arm-repeat proteins function in intracellular signalling like their mammalian counterparts, while others have novel functions []. The 3-dimensional fold of an armadillo repeat is known from the crystal structure of beta-catenin, where the 12 repeats form a superhelix of alpha helices with three helices per unit []. The cylindrical structure features a positively charged grove, which presumably interacts with the acidic surfaces of the known interaction partners of beta-catenin.; GO: 0005515 protein binding; PDB: 2Z6G_A 1IQ1_C 3RZX_A 2C1M_A 3BTR_C 3OQS_A 3TPO_A 1IAL_A 1Q1S_C 1PJM_B ....
Probab=82.27 E-value=3.5 Score=23.42 Aligned_cols=31 Identities=16% Similarity=0.108 Sum_probs=26.6
Q ss_pred cchHHHHHHHHhCCcchHHHHHHHHHHHHHh
Q psy4701 89 KEGILEMVLVMANTEDILQQKVACECIIAVA 119 (124)
Q Consensus 89 ~n~~Lqral~M~~S~d~~~Q~~a~e~ii~a~ 119 (124)
+.+.++..+.|..++|...|..|+.+|-+.+
T Consensus 10 ~~g~i~~Lv~ll~~~~~~v~~~a~~al~nl~ 40 (41)
T PF00514_consen 10 EAGGIPPLVQLLKSPDPEVQEEAAWALGNLA 40 (41)
T ss_dssp HTTHHHHHHHHTTSSSHHHHHHHHHHHHHHH
T ss_pred HcccHHHHHHHHcCCCHHHHHHHHHHHHHHh
Confidence 4567888899999999999999999987765
No 19
>KOG2171|consensus
Probab=81.42 E-value=9.9 Score=36.93 Aligned_cols=84 Identities=21% Similarity=0.157 Sum_probs=59.4
Q ss_pred hHHHHHHHHHHHhHhhCCCcccCCCCCCcchHhhHhhcCCCCchhhhhHHHHHHHhhcCCChHHHHHHHHccchhhhhhh
Q psy4701 6 GKRQAAQGLARLGITLNPEVAFPGERSLEVVRPLLSLLHPEATALETFEGLMALCNLAAIGEKQRQRILKEKGPLEVGSS 85 (124)
Q Consensus 6 ~~~~A~qaLAkilIs~nP~l~F~~~~~~~~v~pL~~LL~~~~~~l~~fEaLlALTNLAs~~~~~r~~Iv~~~G~s~IEnl 85 (124)
....|.|+|.|+....+|+.+||. ..+-+..+|. +.+--+++=|||||.=++--..+.-..++. +-.+.+=|+
T Consensus 325 ~~~~A~~~lDrlA~~L~g~~v~p~-----~~~~l~~~l~-S~~w~~R~AaL~Als~i~EGc~~~m~~~l~-~Il~~Vl~~ 397 (1075)
T KOG2171|consen 325 PYRAAEQALDRLALHLGGKQVLPP-----LFEALEAMLQ-STEWKERHAALLALSVIAEGCSDVMIGNLP-KILPIVLNG 397 (1075)
T ss_pred cHHHHHHHHHHHHhcCChhhehHH-----HHHHHHHHhc-CCCHHHHHHHHHHHHHHHcccHHHHHHHHH-HHHHHHHhh
Confidence 467899999999999999999984 3444555665 778899999999998777554443333333 444555555
Q ss_pred hhccchHHHHH
Q psy4701 86 IMAKEGILEMV 96 (124)
Q Consensus 86 Lls~n~~Lqra 96 (124)
+=+.|.++|-+
T Consensus 398 l~DphprVr~A 408 (1075)
T KOG2171|consen 398 LNDPHPRVRYA 408 (1075)
T ss_pred cCCCCHHHHHH
Confidence 55666666665
No 20
>PF05804 KAP: Kinesin-associated protein (KAP)
Probab=80.93 E-value=7 Score=36.21 Aligned_cols=56 Identities=21% Similarity=0.165 Sum_probs=43.7
Q ss_pred CCcchHhhHhhcCCCCchhhhhHHHHHHHhhcCCChHHHHHHHHccchhhhhhhhhcc
Q psy4701 32 SLEVVRPLLSLLHPEATALETFEGLMALCNLAAIGEKQRQRILKEKGPLEVGSSIMAK 89 (124)
Q Consensus 32 ~~~~v~pL~~LL~~~~~~l~~fEaLlALTNLAs~~~~~r~~Iv~~~G~s~IEnlLls~ 89 (124)
..++|++|..+++.+...+ .=.+|..|-||+. +++.|.++++.+.++++-.++-++
T Consensus 329 ~~giV~kL~kLl~s~~~~l-~~~aLrlL~NLSf-d~~~R~~mV~~GlIPkLv~LL~d~ 384 (708)
T PF05804_consen 329 ESGIVEKLLKLLPSENEDL-VNVALRLLFNLSF-DPELRSQMVSLGLIPKLVELLKDP 384 (708)
T ss_pred HcCCHHHHHHHhcCCCHHH-HHHHHHHHHHhCc-CHHHHHHHHHCCCcHHHHHHhCCC
Confidence 3459999999997544434 4459999999964 588899999988888888877654
No 21
>PRK09687 putative lyase; Provisional
Probab=75.47 E-value=26 Score=28.41 Aligned_cols=30 Identities=17% Similarity=0.080 Sum_probs=20.4
Q ss_pred CcchHhhHhhcCCCCchhhhhHHHHHHHhhc
Q psy4701 33 LEVVRPLLSLLHPEATALETFEGLMALCNLA 63 (124)
Q Consensus 33 ~~~v~pL~~LL~~~~~~l~~fEaLlALTNLA 63 (124)
-++|++|..+|. |.+.-=..+|..||-++.
T Consensus 158 ~~ai~~L~~~L~-d~~~~VR~~A~~aLg~~~ 187 (280)
T PRK09687 158 EAAIPLLINLLK-DPNGDVRNWAAFALNSNK 187 (280)
T ss_pred HHHHHHHHHHhc-CCCHHHHHHHHHHHhcCC
Confidence 457888888886 333346677777777774
No 22
>PF13513 HEAT_EZ: HEAT-like repeat; PDB: 2Z5J_A 2OT8_B 2Z5O_A 2H4M_A 2QMR_A 1QBK_B 2Z5M_A 2Z5K_A 2Z5N_A 1GCJ_B ....
Probab=74.58 E-value=0.93 Score=27.21 Aligned_cols=54 Identities=30% Similarity=0.329 Sum_probs=35.0
Q ss_pred hhHHHHHHHHHHHhHhhCCCcccCCCCCCcchHhhHhhcCCCCchhhhhHHHHHHHhh
Q psy4701 5 KGKRQAAQGLARLGITLNPEVAFPGERSLEVVRPLLSLLHPEATALETFEGLMALCNL 62 (124)
Q Consensus 5 ~~~~~A~qaLAkilIs~nP~l~F~~~~~~~~v~pL~~LL~~~~~~l~~fEaLlALTNL 62 (124)
.-|..|+.+|++ +....|...-+ ..-++++.|..+|.++++.. +-.|.-||.||
T Consensus 2 ~vR~~A~~aLg~-l~~~~~~~~~~--~~~~~~~~L~~~L~d~~~~V-R~~A~~aLg~l 55 (55)
T PF13513_consen 2 RVRRAAAWALGR-LAEGCPELLQP--YLPELLPALIPLLQDDDDSV-RAAAAWALGNL 55 (55)
T ss_dssp HHHHHHHHHHHC-TTTTTHHHHHH--HHHHHHHHHHHHTTSSSHHH-HHHHHHHHHCH
T ss_pred HHHHHHHHHHhh-HhcccHHHHHH--HHHHHHHHHHHHHcCCCHHH-HHHHHHHHhcC
Confidence 457888999988 33333443222 23458888999997555533 77788888775
No 23
>KOG4224|consensus
Probab=72.85 E-value=23 Score=31.45 Aligned_cols=82 Identities=16% Similarity=0.175 Sum_probs=53.1
Q ss_pred chHhhHhhcCCCCchhhhhHHHHHHHhhcCCChHHHHHHHHccchhhhhhhhhc--------------------------
Q psy4701 35 VVRPLLSLLHPEATALETFEGLMALCNLAAIGEKQRQRILKEKGPLEVGSSIMA-------------------------- 88 (124)
Q Consensus 35 ~v~pL~~LL~~~~~~l~~fEaLlALTNLAs~~~~~r~~Iv~~~G~s~IEnlLls-------------------------- 88 (124)
.||-|+.|.. +.+.--+.-|-+||-||||- .+....|+..++.+..-.++=+
T Consensus 252 lv~~Lv~Lmd-~~s~kvkcqA~lALrnlasd-t~Yq~eiv~ag~lP~lv~Llqs~~~plilasVaCIrnisihplNe~lI 329 (550)
T KOG4224|consen 252 LVPALVDLMD-DGSDKVKCQAGLALRNLASD-TEYQREIVEAGSLPLLVELLQSPMGPLILASVACIRNISIHPLNEVLI 329 (550)
T ss_pred hHHHHHHHHh-CCChHHHHHHHHHHhhhccc-chhhhHHHhcCCchHHHHHHhCcchhHHHHHHHHHhhcccccCcccce
Confidence 4888999997 55666677889999999986 4455556665655444333322
Q ss_pred -cchHHHHHHHHh-CCcchHHHHHHHHHHHHH
Q psy4701 89 -KEGILEMVLVMA-NTEDILQQKVACECIIAV 118 (124)
Q Consensus 89 -~n~~Lqral~M~-~S~d~~~Q~~a~e~ii~a 118 (124)
+.+-++-.+-+. ..+++--|+.|+-.+-+-
T Consensus 330 ~dagfl~pLVrlL~~~dnEeiqchAvstLrnL 361 (550)
T KOG4224|consen 330 ADAGFLRPLVRLLRAGDNEEIQCHAVSTLRNL 361 (550)
T ss_pred ecccchhHHHHHHhcCCchhhhhhHHHHHHHH
Confidence 334444444444 455566899998877543
No 24
>COG5064 SRP1 Karyopherin (importin) alpha [Intracellular trafficking and secretion]
Probab=71.32 E-value=13 Score=32.72 Aligned_cols=93 Identities=18% Similarity=0.189 Sum_probs=63.0
Q ss_pred HHHHHHHHHHHhHhhCCC--cccCCCCCCcchHhhHhhcC-CCCchhhhhHHHHHHHhhcCCChHHHHHHHHccchhhhh
Q psy4701 7 KRQAAQGLARLGITLNPE--VAFPGERSLEVVRPLLSLLH-PEATALETFEGLMALCNLAAIGEKQRQRILKEKGPLEVG 83 (124)
Q Consensus 7 ~~~A~qaLAkilIs~nP~--l~F~~~~~~~~v~pL~~LL~-~~~~~l~~fEaLlALTNLAs~~~~~r~~Iv~~~G~s~IE 83 (124)
+..|+-||..|.--|.-. ++- -..+||-++.||. ++.+.++ .+.-||-|+|+-++..|..+...+...-+=
T Consensus 132 qfEAaWalTNiaSGtt~QTkvVv----d~~AVPlfiqlL~s~~~~V~e--QavWALGNiAGDS~~~RD~vL~~galeplL 205 (526)
T COG5064 132 QFEAAWALTNIASGTTQQTKVVV----DAGAVPLFIQLLSSTEDDVRE--QAVWALGNIAGDSEGCRDYVLQCGALEPLL 205 (526)
T ss_pred HHHHHHHHhhhccCcccceEEEE----eCCchHHHHHHHcCchHHHHH--HHHHHhccccCCchhHHHHHHhcCchHHHH
Confidence 345666666665432221 111 2458999999985 2333444 478999999999999999999878888787
Q ss_pred hhhhccc---hHHHHH----HHHhCCcch
Q psy4701 84 SSIMAKE---GILEMV----LVMANTEDI 105 (124)
Q Consensus 84 nlLls~n---~~Lqra----l~M~~S~d~ 105 (124)
+++.++. .|+|.+ -+||-++++
T Consensus 206 ~ll~ss~~~ismlRn~TWtLSNlcRGknP 234 (526)
T COG5064 206 GLLLSSAIHISMLRNATWTLSNLCRGKNP 234 (526)
T ss_pred HHHHhccchHHHHHHhHHHHHHhhCCCCC
Confidence 7777753 366665 677777654
No 25
>KOG4151|consensus
Probab=64.07 E-value=3.9 Score=38.14 Aligned_cols=48 Identities=21% Similarity=0.266 Sum_probs=42.0
Q ss_pred HccchhhhhhhhhccchHHHHHHHHhCCcchHHHHHHHHHHHHHhhhcc
Q psy4701 75 KEKGPLEVGSSIMAKEGILEMVLVMANTEDILQQKVACECIIAVASKKD 123 (124)
Q Consensus 75 ~~~G~s~IEnlLls~n~~Lqral~M~~S~d~~~Q~~a~e~ii~a~sk~~ 123 (124)
...++-.+.....+..++++++++|..+.++.+|...+++++|. +|+.
T Consensus 254 ~~~~~v~~~~~~~~~e~l~~~~l~~~~~~~e~~~~~~~~~~v~~-~~i~ 301 (748)
T KOG4151|consen 254 AQEDPVSDTVKVMDKEQLLEEILAVGGSQDELAQVWRTLKLVYG-DDIR 301 (748)
T ss_pred hcCCcccchhhhhhHHHHHHHhhcccccchHHhhhhhhhhhhhc-cHHH
Confidence 34677777888888899999999999999999999999999999 6754
No 26
>PF10508 Proteasom_PSMB: Proteasome non-ATPase 26S subunit; InterPro: IPR019538 The 26S proteasome is an enzymatic complex that degrades ubiquitinated proteins in eukaryotic cells. 26S proteasome non-ATPase regulatory subunit 5 is one of a number of chaperones that are involved in the assembly of the proteasome. The chaperones dissociate before 26S proteasome formation is complete [].; GO: 0044183 protein binding involved in protein folding
Probab=62.65 E-value=54 Score=28.65 Aligned_cols=112 Identities=17% Similarity=0.142 Sum_probs=70.4
Q ss_pred hhHHHHHHHHHHHhHhhCCCcccCCCCCCcchHhhHhhcCCCCchhhhhHHHHHHHhhcCCChHHHHHHHHccchhhhhh
Q psy4701 5 KGKRQAAQGLARLGITLNPEVAFPGERSLEVVRPLLSLLHPEATALETFEGLMALCNLAAIGEKQRQRILKEKGPLEVGS 84 (124)
Q Consensus 5 ~~~~~A~qaLAkilIs~nP~l~F~~~~~~~~v~pL~~LL~~~~~~l~~fEaLlALTNLAs~~~~~r~~Iv~~~G~s~IEn 84 (124)
..|......+.+| -+.+|.. |..-.....+..+...|. +.+.|-+--++--|+.||. .+.-.+.+.+.+.++++-+
T Consensus 175 ~vR~Rv~el~v~i-~~~S~~~-~~~~~~sgll~~ll~eL~-~dDiLvqlnalell~~La~-~~~g~~yL~~~gi~~~L~~ 250 (503)
T PF10508_consen 175 IVRCRVYELLVEI-ASHSPEA-AEAVVNSGLLDLLLKELD-SDDILVQLNALELLSELAE-TPHGLQYLEQQGIFDKLSN 250 (503)
T ss_pred HHHHHHHHHHHHH-HhcCHHH-HHHHHhccHHHHHHHHhc-CccHHHHHHHHHHHHHHHc-ChhHHHHHHhCCHHHHHHH
Confidence 3455555555555 2344443 221112236777777787 4777888889999999999 4555556655444444443
Q ss_pred hhhc---------------------------------cchHHHHHHHHhCCcchHHHHHHHHHHHHHhh
Q psy4701 85 SIMA---------------------------------KEGILEMVLVMANTEDILQQKVACECIIAVAS 120 (124)
Q Consensus 85 lLls---------------------------------~n~~Lqral~M~~S~d~~~Q~~a~e~ii~a~s 120 (124)
.+.+ -...+.+...|..|+|...+.+|.|++-.-+|
T Consensus 251 ~l~~~~~dp~~~~~~l~g~~~f~g~la~~~~~~v~~~~p~~~~~l~~~~~s~d~~~~~~A~dtlg~igs 319 (503)
T PF10508_consen 251 LLQDSEEDPRLSSLLLPGRMKFFGNLARVSPQEVLELYPAFLERLFSMLESQDPTIREVAFDTLGQIGS 319 (503)
T ss_pred HHhccccCCcccchhhhhHHHHHHHHHhcChHHHHHHHHHHHHHHHHHhCCCChhHHHHHHHHHHHHhC
Confidence 3322 13355666888999999999999998865554
No 27
>PRK09687 putative lyase; Provisional
Probab=59.70 E-value=71 Score=25.84 Aligned_cols=50 Identities=18% Similarity=0.086 Sum_probs=22.7
Q ss_pred hhHHHHHHHHHHHhHhhCCCcccCCCCCCcchHhhHhhcCCCCchhhhhHHHHHHHhh
Q psy4701 5 KGKRQAAQGLARLGITLNPEVAFPGERSLEVVRPLLSLLHPEATALETFEGLMALCNL 62 (124)
Q Consensus 5 ~~~~~A~qaLAkilIs~nP~l~F~~~~~~~~v~pL~~LL~~~~~~l~~fEaLlALTNL 62 (124)
.-|+.|+.+|..|.-+-.+ .-++++.|..++..|.+..=..+|.-||-++
T Consensus 69 ~vR~~A~~aLg~lg~~~~~--------~~~a~~~L~~l~~~D~d~~VR~~A~~aLG~~ 118 (280)
T PRK09687 69 IERDIGADILSQLGMAKRC--------QDNVFNILNNLALEDKSACVRASAINATGHR 118 (280)
T ss_pred HHHHHHHHHHHhcCCCccc--------hHHHHHHHHHHHhcCCCHHHHHHHHHHHhcc
Confidence 3456666666665432111 1234555555532244444444444444444
No 28
>PF11701 UNC45-central: Myosin-binding striated muscle assembly central; InterPro: IPR024660 The UNC-45 or small muscle protein 1 of Caenorhabditis elegans is expressed in two forms from different genomic positions in mammals: as a general tissue protein (UNC-45a) and as a specific form (UNC-45b) expressed only in striated and skeletal muscle. Myofibril formation requires both UNC-45 forms, consistent with the fact that the cytoskeleton is necessary for the development and maintenance of organised myofibrils []. Rng3 (Ring assembly protein 3), the homologue in Schizosaccharomyces pombe, is crucial for cell shape, normal actin cytoskeleton, and contractile ring assembly, and is essential for assembly of the myosin II-containing progenitors of the contractile ring. Widespread defects in the cytoskeleton are found in null mutants of all three fungal proteins []. Mammalian Unc45 is found to act as a specific chaperone during the folding of myosin and the assembly of striated muscle by forming a stable complex with the general chaperone Hsp90 []. All members carry up to three amino-terminal tetratricopeptide repeat (TPR) and a UCS domain at the C terminus that contains a number of Arm repeats. ; PDB: 3OPB_A 3NOW_A.
Probab=59.02 E-value=22 Score=26.30 Aligned_cols=54 Identities=13% Similarity=0.043 Sum_probs=38.2
Q ss_pred CCchhhhhHHHHHHHhhcCCChHHHHHHHHccchhhhhhhh-hccchH---HHHHHHHh
Q psy4701 46 EATALETFEGLMALCNLAAIGEKQRQRILKEKGPLEVGSSI-MAKEGI---LEMVLVMA 100 (124)
Q Consensus 46 ~~~~l~~fEaLlALTNLAs~~~~~r~~Iv~~~G~s~IEnlL-ls~n~~---Lqral~M~ 100 (124)
.........+++=|=|.|.++++.|..|.+ +|.+-++... .+++.. ++-++.+|
T Consensus 98 ~~~~~~~~~~~lell~aAc~d~~~r~~I~~-~~~~~L~~~~~~~~~~~~ir~~A~v~L~ 155 (157)
T PF11701_consen 98 KSKDRKVQKAALELLSAACIDKSCRTFISK-NYVSWLKELYKNSKDDSEIRVLAAVGLC 155 (157)
T ss_dssp CTS-HHHHHHHHHHHHHHTTSHHHHHCCHH-HCHHHHHHHTTTCC-HH-CHHHHHHHHH
T ss_pred ccCCHHHHHHHHHHHHHHHccHHHHHHHHH-HHHHHHHHHHccccchHHHHHHHHHHHh
Confidence 344455566677777999999999999999 9999999988 455443 55555554
No 29
>PF05004 IFRD: Interferon-related developmental regulator (IFRD); InterPro: IPR007701 Interferon-related developmental regulator (IFRD1) is the human homologue of the Rattus norvegicus early response protein PC4 and its murine homologue TIS7 []. The exact function of IFRD1 is unknown but it has been shown that PC4 is necessary for muscle differentiation and that it might have a role in signal transduction. This entry also contains IFRD2 and its murine equivalent SKMc15, which are highly expressed soon after gastrulation and in the hepatic primordium, suggesting an involvement in early hematopoiesis [].
Probab=58.84 E-value=49 Score=27.25 Aligned_cols=28 Identities=18% Similarity=0.205 Sum_probs=23.8
Q ss_pred hHHHHHHHHhCCcchHHHHHHHHHHHHH
Q psy4701 91 GILEMVLVMANTEDILQQKVACECIIAV 118 (124)
Q Consensus 91 ~~Lqral~M~~S~d~~~Q~~a~e~ii~a 118 (124)
..+.+...+..|+|...|.+|.|+|.-.
T Consensus 228 ~~~~~l~~lL~s~d~~VRiAAGEaiAll 255 (309)
T PF05004_consen 228 EALPALSELLDSDDVDVRIAAGEAIALL 255 (309)
T ss_pred HHHHHHHHHhcCCCHHHHHHHHHHHHHH
Confidence 3567778999999999999999998644
No 30
>PRK13800 putative oxidoreductase/HEAT repeat-containing protein; Provisional
Probab=56.01 E-value=59 Score=30.50 Aligned_cols=95 Identities=16% Similarity=0.046 Sum_probs=57.5
Q ss_pred hhhHHHHHHHHHHHhHhhCCCcccCCCCCCcchHhhHhhcCCCCchhhhhHHHHHHHhhcCC-------------ChHHH
Q psy4701 4 VKGKRQAAQGLARLGITLNPEVAFPGERSLEVVRPLLSLLHPEATALETFEGLMALCNLAAI-------------GEKQR 70 (124)
Q Consensus 4 ~~~~~~A~qaLAkilIs~nP~l~F~~~~~~~~v~pL~~LL~~~~~~l~~fEaLlALTNLAs~-------------~~~~r 70 (124)
+..|..|+++|+++.- | ..++.++...|. |.+..=..+|+-||-.+.+- +.++|
T Consensus 789 ~~VR~aA~~aLg~~g~---~---------~~~~~~l~~aL~-d~d~~VR~~Aa~aL~~l~~~~a~~~L~~~L~D~~~~VR 855 (897)
T PRK13800 789 PLVRAAALAALAELGC---P---------PDDVAAATAALR-ASAWQVRQGAARALAGAAADVAVPALVEALTDPHLDVR 855 (897)
T ss_pred HHHHHHHHHHHHhcCC---c---------chhHHHHHHHhc-CCChHHHHHHHHHHHhccccchHHHHHHHhcCCCHHHH
Confidence 3567788888888721 1 223466777776 45566667777777666532 12555
Q ss_pred HHHHHccchhhhhhhhhccchHHHHHHHHhCCcchHHHHHHHHHHHH
Q psy4701 71 QRILKEKGPLEVGSSIMAKEGILEMVLVMANTEDILQQKVACECIIA 117 (124)
Q Consensus 71 ~~Iv~~~G~s~IEnlLls~n~~Lqral~M~~S~d~~~Q~~a~e~ii~ 117 (124)
+.-+.. ++.+ -.+...........+.+|...+..|.++|-+
T Consensus 856 ~~A~~a-----L~~~-~~~~~a~~~L~~al~D~d~~Vr~~A~~aL~~ 896 (897)
T PRK13800 856 KAAVLA-----LTRW-PGDPAARDALTTALTDSDADVRAYARRALAH 896 (897)
T ss_pred HHHHHH-----Hhcc-CCCHHHHHHHHHHHhCCCHHHHHHHHHHHhh
Confidence 555542 2221 1133455555677788899999999988755
No 31
>PF14500 MMS19_N: Dos2-interacting transcription regulator of RNA-Pol-II
Probab=53.77 E-value=26 Score=28.28 Aligned_cols=71 Identities=18% Similarity=0.238 Sum_probs=52.1
Q ss_pred hhhHHHHHHHHHHHhHhhCCCcccCCCCCCcchHhhHhhcCCCCchhhhhHHHHHHHhhcCCChHHHHHHHH
Q psy4701 4 VKGKRQAAQGLARLGITLNPEVAFPGERSLEVVRPLLSLLHPEATALETFEGLMALCNLAAIGEKQRQRILK 75 (124)
Q Consensus 4 ~~~~~~A~qaLAkilIs~nP~l~F~~~~~~~~v~pL~~LL~~~~~~l~~fEaLlALTNLAs~~~~~r~~Iv~ 75 (124)
+..|..|.+.|+.++-...|.. +..+.+.-.+.++..=|.+.........||.||.+...++.+.-.+|++
T Consensus 13 ~~~R~ka~~~Ls~vL~~lp~~~-L~~~ev~~L~~F~~~rl~D~~~~~~~l~gl~~L~~~~~~~~~~~~~i~~ 83 (262)
T PF14500_consen 13 PIIRAKALELLSEVLERLPPDF-LSRQEVQVLLDFFCSRLDDHACVQPALKGLLALVKMKNFSPESAVKILR 83 (262)
T ss_pred HHHHHHHHHHHHHHHHhCCHhh-ccHHHHHHHHHHHHHHhccHhhHHHHHHHHHHHHhCcCCChhhHHHHHH
Confidence 3567789999999999999886 4544433345566666655556777799999999999888766655554
No 32
>PRK13800 putative oxidoreductase/HEAT repeat-containing protein; Provisional
Probab=52.28 E-value=94 Score=29.20 Aligned_cols=45 Identities=18% Similarity=0.068 Sum_probs=26.7
Q ss_pred hhhHHHHHHHHHHHhHhhCCCcccCCCCCCcchHhhHhhcCCCCchhhhhHHHHHHHhh
Q psy4701 4 VKGKRQAAQGLARLGITLNPEVAFPGERSLEVVRPLLSLLHPEATALETFEGLMALCNL 62 (124)
Q Consensus 4 ~~~~~~A~qaLAkilIs~nP~l~F~~~~~~~~v~pL~~LL~~~~~~l~~fEaLlALTNL 62 (124)
+.-|+.|+++|+++. ..+++++|..+|. |.+.-=...|+-||..+
T Consensus 635 ~~VR~~Av~~L~~~~-------------~~~~~~~L~~aL~-D~d~~VR~~Aa~aL~~l 679 (897)
T PRK13800 635 PGVRRTAVAVLTETT-------------PPGFGPALVAALG-DGAAAVRRAAAEGLREL 679 (897)
T ss_pred HHHHHHHHHHHhhhc-------------chhHHHHHHHHHc-CCCHHHHHHHHHHHHHH
Confidence 346788888888864 1336788888885 33322333444444433
No 33
>COG1017 Hmp Hemoglobin-like flavoprotein [Energy production and conversion]
Probab=52.01 E-value=31 Score=26.27 Aligned_cols=46 Identities=26% Similarity=0.276 Sum_probs=39.1
Q ss_pred ccch---hhhhhhhhccchHHHHHHHHhCCcchHHHHHHHHHHHHHhhh
Q psy4701 76 EKGP---LEVGSSIMAKEGILEMVLVMANTEDILQQKVACECIIAVASK 121 (124)
Q Consensus 76 ~~G~---s~IEnlLls~n~~Lqral~M~~S~d~~~Q~~a~e~ii~a~sk 121 (124)
++|. ..+-..||++|.-+.-+.+|++-+.-.+|.+-+.+|++.|.-
T Consensus 19 ~~G~~iT~~FY~~MF~~hPEl~niFN~~nQ~~G~Q~~aLA~ai~ayA~n 67 (150)
T COG1017 19 EHGETITAHFYKRMFAHHPELKNIFNMANQKNGDQPKALANAILAYAKN 67 (150)
T ss_pred hcchHHHHHHHHHHHhhCHHHHHHHhHhhhcccccHHHHHHHHHHHHHh
Confidence 4555 455668999999999999999999999999999999988753
No 34
>PF05804 KAP: Kinesin-associated protein (KAP)
Probab=43.05 E-value=71 Score=29.78 Aligned_cols=88 Identities=15% Similarity=0.125 Sum_probs=52.1
Q ss_pred hhCCCcccCCCCCCcchHhhHhhcCCCCchhhhhHHH----HHHHhhcCCChHHHHHHHHccchhhhhhhhhccchHH--
Q psy4701 20 TLNPEVAFPGERSLEVVRPLLSLLHPEATALETFEGL----MALCNLAAIGEKQRQRILKEKGPLEVGSSIMAKEGIL-- 93 (124)
Q Consensus 20 s~nP~l~F~~~~~~~~v~pL~~LL~~~~~~l~~fEaL----lALTNLAs~~~~~r~~Iv~~~G~s~IEnlLls~n~~L-- 93 (124)
+.||..-..= .-.+.|++|+++|..+ ++|.+ -.|.+|+-.. +-+..|...+.+..+--++=++|..+
T Consensus 277 Aed~~ve~kM-~~~~iV~~Lv~~Ldr~-----n~ellil~v~fLkkLSi~~-ENK~~m~~~giV~kL~kLl~s~~~~l~~ 349 (708)
T PF05804_consen 277 AEDPRVELKM-VNKGIVSLLVKCLDRE-----NEELLILAVTFLKKLSIFK-ENKDEMAESGIVEKLLKLLPSENEDLVN 349 (708)
T ss_pred hcChHHHHHH-HhcCCHHHHHHHHcCC-----CHHHHHHHHHHHHHHcCCH-HHHHHHHHcCCHHHHHHHhcCCCHHHHH
Confidence 4455533222 2356899999999744 34444 4667776664 45778877666677666666555422
Q ss_pred ---HHHHHHhCCcchHHHHHHHHH
Q psy4701 94 ---EMVLVMANTEDILQQKVACEC 114 (124)
Q Consensus 94 ---qral~M~~S~d~~~Q~~a~e~ 114 (124)
+...+|+..++...|.+.+++
T Consensus 350 ~aLrlL~NLSfd~~~R~~mV~~Gl 373 (708)
T PF05804_consen 350 VALRLLFNLSFDPELRSQMVSLGL 373 (708)
T ss_pred HHHHHHHHhCcCHHHHHHHHHCCC
Confidence 222566666666666655544
No 35
>PF10165 Ric8: Guanine nucleotide exchange factor synembryn; InterPro: IPR019318 Ric8 is involved in the EGL-30 neurotransmitter signalling pathway []. It is a guanine nucleotide exchange factor [] that regulates neurotransmitter secretion.
Probab=43.04 E-value=71 Score=27.65 Aligned_cols=92 Identities=20% Similarity=0.283 Sum_probs=63.3
Q ss_pred cccCCCCCCcchHhhHhhcCCCCchhhhhH----------HHHHHHhhcCCChHHHHHHHHccchhhhh--hhhhccchH
Q psy4701 25 VAFPGERSLEVVRPLLSLLHPEATALETFE----------GLMALCNLAAIGEKQRQRILKEKGPLEVG--SSIMAKEGI 92 (124)
Q Consensus 25 l~F~~~~~~~~v~pL~~LL~~~~~~l~~fE----------aLlALTNLAs~~~~~r~~Iv~~~G~s~IE--nlLls~n~~ 92 (124)
-+|+......+|.-|+.+|+ ..+.+|+ -|+.||+++..++.+|+.+-..--|+.-+ .-+=..+.+
T Consensus 233 ~~~~~~~~~~~v~~Ll~~Ld---~~l~~~~~~~l~~~l~PlL~lL~~~~~~~~~~Rk~lr~~lLP~~~Dr~~~~e~~~tL 309 (446)
T PF10165_consen 233 SLFPEGDNMDVVERLLDFLD---KRLDKYEALKLDELLTPLLTLLTRLARAAREVRKYLRARLLPPDKDRKKPPEKGDTL 309 (446)
T ss_pred cccCCCCChHHHHHHHHHHH---HHHHhcCcccchhhHhhHHHHHHHHHHhcHHHHHHHHHHhCCChhhcccCCCCCcch
Confidence 34555566667777888875 3444443 26789999999988888876644443322 222234567
Q ss_pred HHHHHHHhCCcchHHHHHHHHHHHHHh
Q psy4701 93 LEMVLVMANTEDILQQKVACECIIAVA 119 (124)
Q Consensus 93 Lqral~M~~S~d~~~Q~~a~e~ii~a~ 119 (124)
--|.+.+..+-+.....+++|++-.-+
T Consensus 310 ~~rLlrLmt~~~~~~k~~vaellf~Lc 336 (446)
T PF10165_consen 310 RSRLLRLMTSPDPQLKDAVAELLFVLC 336 (446)
T ss_pred HHHHHHHhCCCCchHHHHHHHHHHHHH
Confidence 788888888888888899999987665
No 36
>PF03224 V-ATPase_H_N: V-ATPase subunit H; InterPro: IPR004908 ATPases (or ATP synthases) are membrane-bound enzyme complexes/ion transporters that combine ATP synthesis and/or hydrolysis with the transport of protons across a membrane. ATPases can harness the energy from a proton gradient, using the flux of ions across the membrane via the ATPase proton channel to drive the synthesis of ATP. Some ATPases work in reverse, using the energy from the hydrolysis of ATP to create a proton gradient. There are different types of ATPases, which can differ in function (ATP synthesis and/or hydrolysis), structure (e.g., F-, V- and A-ATPases, which contain rotary motors) and in the type of ions they transport [, ]. The different types include: F-ATPases (F1F0-ATPases), which are found in mitochondria, chloroplasts and bacterial plasma membranes where they are the prime producers of ATP, using the proton gradient generated by oxidative phosphorylation (mitochondria) or photosynthesis (chloroplasts). V-ATPases (V1V0-ATPases), which are primarily found in eukaryotic vacuoles and catalyse ATP hydrolysis to transport solutes and lower pH in organelles. A-ATPases (A1A0-ATPases), which are found in Archaea and function like F-ATPases (though with respect to their structure and some inhibitor responses, A-ATPases are more closely related to the V-ATPases). P-ATPases (E1E2-ATPases), which are found in bacteria and in eukaryotic plasma membranes and organelles, and function to transport a variety of different ions across membranes. E-ATPases, which are cell-surface enzymes that hydrolyse a range of NTPs, including extracellular ATP. V-ATPases (also known as V1V0-ATPase or vacuolar ATPase) (3.6.3.14 from EC) are found in the eukaryotic endomembrane system, and in the plasma membrane of prokaryotes and certain specialised eukaryotic cells. V-ATPases hydrolyse ATP to drive a proton pump, and are involved in a variety of vital intra- and inter-cellular processes such as receptor mediated endocytosis, protein trafficking, active transport of metabolites, homeostasis and neurotransmitter release []. V-ATPases are composed of two linked complexes: the V1 complex (subunits A-H) contains the catalytic core that hydrolyses ATP, while the V0 complex (subunits a, c, c', c'', d) forms the membrane-spanning pore. V-ATPases may have an additional role in membrane fusion through binding to t-SNARE proteins []. This entry represents subunit H (also known as Vma13p) found in the V1 complex of V-ATPases. This subunit has a regulatory function, being responsible for activating ATPase activity and coupling ATPase activity to proton flow []. The yeast enzyme contains five motifs similar to the HEAT or Armadillo repeats seen in the importins, and can be divided into two distinct domains: a large N-terminal domain consisting of stacked alpha helices, and a smaller C-terminal alpha-helical domain with a similar superhelical topology to an armadillo repeat []. More information about this protein can be found at Protein of the Month: ATP Synthases [].; GO: 0046961 proton-transporting ATPase activity, rotational mechanism, 0015991 ATP hydrolysis coupled proton transport, 0000221 vacuolar proton-transporting V-type ATPase, V1 domain; PDB: 1HO8_A.
Probab=39.29 E-value=85 Score=25.34 Aligned_cols=78 Identities=19% Similarity=0.069 Sum_probs=53.3
Q ss_pred hhHHHHHHHHHHHhHhhCCCcccCCCCCCcchHhhHhhcCCCCchhh---hhHHHHHHHhhcCCChHHHHHHHHccchhh
Q psy4701 5 KGKRQAAQGLARLGITLNPEVAFPGERSLEVVRPLLSLLHPEATALE---TFEGLMALCNLAAIGEKQRQRILKEKGPLE 81 (124)
Q Consensus 5 ~~~~~A~qaLAkilIs~nP~l~F~~~~~~~~v~pL~~LL~~~~~~l~---~fEaLlALTNLAs~~~~~r~~Iv~~~G~s~ 81 (124)
.....|++-|++++..-++.-.... -+.+++++..|....+.-+ ++-++-+|-||... ++.|..+++.+|.+.
T Consensus 120 ~i~~~a~~iLt~Ll~~~~~~~~~~~---~~~l~~ll~~L~~~l~~~~~~~~~~av~~L~~LL~~-~~~R~~f~~~~~v~~ 195 (312)
T PF03224_consen 120 FIQLKAAFILTSLLSQGPKRSEKLV---KEALPKLLQWLSSQLSSSDSELQYIAVQCLQNLLRS-KEYRQVFWKSNGVSP 195 (312)
T ss_dssp HHHHHHHHHHHHHHTSTTT--HHHH---HHHHHHHHHHHH-TT-HHHH---HHHHHHHHHHHTS-HHHHHHHHTHHHHHH
T ss_pred HHHHHHHHHHHHHHHcCCccccchH---HHHHHHHHHHHHHhhcCCCcchHHHHHHHHHHHhCc-chhHHHHHhcCcHHH
Confidence 4467789999998877666542211 3578888888753332222 28889999999865 889999999888888
Q ss_pred hhhhh
Q psy4701 82 VGSSI 86 (124)
Q Consensus 82 IEnlL 86 (124)
+-+.+
T Consensus 196 l~~iL 200 (312)
T PF03224_consen 196 LFDIL 200 (312)
T ss_dssp HHHHH
T ss_pred HHHHH
Confidence 87755
No 37
>COG1560 HtrB Lauroyl/myristoyl acyltransferase [Cell envelope biogenesis, outer membrane]
Probab=37.81 E-value=29 Score=28.92 Aligned_cols=33 Identities=27% Similarity=0.409 Sum_probs=23.8
Q ss_pred HHHHHHhhc----CCChHHHHHHHHccchhhhhhhhhc
Q psy4701 55 GLMALCNLA----AIGEKQRQRILKEKGPLEVGSSIMA 88 (124)
Q Consensus 55 aLlALTNLA----s~~~~~r~~Iv~~~G~s~IEnlLls 88 (124)
.=.|++||+ ..+++.|++|++ +.|...+-.+++
T Consensus 55 ~~~a~~NL~~~FPe~se~ere~i~~-~~~~~~~r~~~E 91 (308)
T COG1560 55 RKIARRNLALCFPEKSEAEREKIVK-ESFASMGRALLE 91 (308)
T ss_pred HHHHHHHHHHhCCCCCHHHHHHHHH-HHHHHHHHHHHH
Confidence 446899999 778899999998 555554444443
No 38
>PF02985 HEAT: HEAT repeat; InterPro: IPR000357 The HEAT repeat is a tandemly repeated, 37-47 amino acid long module occurring in a number of cytoplasmic proteins, including the four name-giving proteins huntingtin, elongation factor 3 (EF3), the 65 Kd alpha regulatory subunit of protein phosphatase 2A (PP2A) and the yeast PI3-kinase TOR1 []. Arrays of HEAT repeats consists of 3 to 36 units forming a rod-like helical structure and appear to function as protein-protein interaction surfaces. It has been noted that many HEAT repeat-containing proteins are involved in intracellular transport processes. In the crystal structure of PP2A PR65/A [], the HEAT repeats consist of pairs of antiparallel alpha helices [].; GO: 0005515 protein binding; PDB: 3FGA_A 2PF4_C 2IAE_A 2BKU_D 3EA5_B 3ND2_A 2BPT_A 2NYL_A 2NPP_D 2PKG_B ....
Probab=36.16 E-value=71 Score=16.95 Aligned_cols=28 Identities=29% Similarity=0.464 Sum_probs=21.7
Q ss_pred HHHHHHHHhCCcchHHHHHHHHHHHHHh
Q psy4701 92 ILEMVLVMANTEDILQQKVACECIIAVA 119 (124)
Q Consensus 92 ~Lqral~M~~S~d~~~Q~~a~e~ii~a~ 119 (124)
++...++++++++..-...|+++|-.-+
T Consensus 1 llp~l~~~l~D~~~~VR~~a~~~l~~i~ 28 (31)
T PF02985_consen 1 LLPILLQLLNDPSPEVRQAAAECLGAIA 28 (31)
T ss_dssp HHHHHHHHHT-SSHHHHHHHHHHHHHHH
T ss_pred CHHHHHHHcCCCCHHHHHHHHHHHHHHH
Confidence 3567788889999999999998887655
No 39
>PTZ00429 beta-adaptin; Provisional
Probab=35.77 E-value=3.1e+02 Score=25.76 Aligned_cols=57 Identities=18% Similarity=0.300 Sum_probs=39.9
Q ss_pred hhhHHHHHHHHHHHhHhhCCCcccCCCCCCcchHhhHhhcCCCCchhhhhHHHHHHHhhcCCC
Q psy4701 4 VKGKRQAAQGLARLGITLNPEVAFPGERSLEVVRPLLSLLHPEATALETFEGLMALCNLAAIG 66 (124)
Q Consensus 4 ~~~~~~A~qaLAkilIs~nP~l~F~~~~~~~~v~pL~~LL~~~~~~l~~fEaLlALTNLAs~~ 66 (124)
+--|..|+-+++|+ ...+|+++ .. .+.+.-|.+||. |.+..=..-|+.+|.-+...+
T Consensus 154 pYVRKtAalai~Kl-y~~~pelv-~~---~~~~~~L~~LL~-D~dp~Vv~nAl~aL~eI~~~~ 210 (746)
T PTZ00429 154 PYVRKTAAMGLGKL-FHDDMQLF-YQ---QDFKKDLVELLN-DNNPVVASNAAAIVCEVNDYG 210 (746)
T ss_pred HHHHHHHHHHHHHH-HhhCcccc-cc---cchHHHHHHHhc-CCCccHHHHHHHHHHHHHHhC
Confidence 34688899999997 56888864 32 224555778886 666666777888888776543
No 40
>PF07539 DRIM: Down-regulated in metastasis; InterPro: IPR011430 These eukaryotic proteins include DRIM (Down-Regulated In Metastasis) (O75691 from SWISSPROT), which is differentially expressed in metastatic and non-metastatic human breast carcinoma cells []. It is believed to be involved in processing of non-coding RNA [].
Probab=34.16 E-value=66 Score=23.70 Aligned_cols=31 Identities=19% Similarity=0.442 Sum_probs=26.4
Q ss_pred hccchHHHHHHHHhCCcchHHHHHHHHHHHH
Q psy4701 87 MAKEGILEMVLVMANTEDILQQKVACECIIA 117 (124)
Q Consensus 87 ls~n~~Lqral~M~~S~d~~~Q~~a~e~ii~ 117 (124)
+...++-+....++.++|...|+.|.+||..
T Consensus 13 ~~~~~l~~~~~~LL~~~d~~vQklAL~cll~ 43 (141)
T PF07539_consen 13 YRSDELYDALLRLLSSRDPEVQKLALDCLLT 43 (141)
T ss_pred hhHHHHHHHHHHHHcCCCHHHHHHHHHHHHH
Confidence 4456677788999999999999999999975
No 41
>PF07208 DUF1414: Protein of unknown function (DUF1414); InterPro: IPR009857 This family consists of several hypothetical bacterial proteins of around 70 residues in length. Members of this family are often referred to as YejL. The function of this family is unknown.; PDB: 2JPQ_A 2JUZ_B 2JUW_B 2QTI_A 2OTA_A 2JR2_A 2JRX_A.
Probab=33.01 E-value=47 Score=20.38 Aligned_cols=21 Identities=38% Similarity=0.520 Sum_probs=16.2
Q ss_pred HHHHHHhhc------CCChHHHHHHHH
Q psy4701 55 GLMALCNLA------AIGEKQRQRILK 75 (124)
Q Consensus 55 aLlALTNLA------s~~~~~r~~Iv~ 75 (124)
+||.|-|.+ +++++.|+.|..
T Consensus 8 sLMvLGN~vTniln~~V~~~qR~~iAe 34 (44)
T PF07208_consen 8 SLMVLGNMVTNILNTSVPPAQRQAIAE 34 (44)
T ss_dssp HHHHHHHHHHHHHHHCS-HHHHHHHHH
T ss_pred HHHHHHHHHHHHHhhcCCHHHHHHHHH
Confidence 789887775 788888988876
No 42
>KOG2122|consensus
Probab=32.97 E-value=57 Score=33.65 Aligned_cols=26 Identities=27% Similarity=0.286 Sum_probs=17.4
Q ss_pred chhhhhHHHHHHHhhcCCChHHHHHHH
Q psy4701 48 TALETFEGLMALCNLAAIGEKQRQRIL 74 (124)
Q Consensus 48 ~~l~~fEaLlALTNLAs~~~~~r~~Iv 74 (124)
..|-.| |+||||||.=-+..-+.++.
T Consensus 365 ~aLRrY-a~MALTNLTFGDv~NKa~LC 390 (2195)
T KOG2122|consen 365 NALRRY-AGMALTNLTFGDVANKATLC 390 (2195)
T ss_pred HHHHHH-HHHHhhccccccccchhhhh
Confidence 467777 78999999866554344443
No 43
>PF12031 DUF3518: Domain of unknown function (DUF3518); InterPro: IPR021906 This presumed domain is functionally uncharacterised. This domain is found in eukaryotes. This domain is about 260 amino acids in length. This domain is found associated with PF01388 from PFAM.
Probab=31.23 E-value=1.7e+02 Score=24.15 Aligned_cols=72 Identities=25% Similarity=0.244 Sum_probs=49.0
Q ss_pred hhHHHHHHHHHHHhHhhCCC---cccCCCCCCc-chHhhHhhcCCCCchhhhhHHHHHHHhhcCCChHHHHHHHHc
Q psy4701 5 KGKRQAAQGLARLGITLNPE---VAFPGERSLE-VVRPLLSLLHPEATALETFEGLMALCNLAAIGEKQRQRILKE 76 (124)
Q Consensus 5 ~~~~~A~qaLAkilIs~nP~---l~F~~~~~~~-~v~pL~~LL~~~~~~l~~fEaLlALTNLAs~~~~~r~~Iv~~ 76 (124)
-.|+.|..+|+|+.|.=|=- +.-|+.+-++ .+.-|+.+|....+..-.==|+.=|-|||.-++..++-|..+
T Consensus 139 SPqrlaLEaLcKLsV~e~NVDliLaTpp~sRlE~l~~~L~r~l~~~e~~v~REfAvvlL~~La~~~~~~~r~iA~q 214 (257)
T PF12031_consen 139 SPQRLALEALCKLSVIENNVDLILATPPFSRLERLFHTLVRLLGMREDQVCREFAVVLLSNLAQGDEAAARAIAMQ 214 (257)
T ss_pred CHHHHHHHHHHHhheeccCcceeeeCCCHHHHHHHHHHHHHHhccccchhHHHHHHHHHHHHhcccHHHHHHHHHh
Confidence 46899999999999874433 3333322122 556678888655554444447788999999999888887764
No 44
>PF01602 Adaptin_N: Adaptin N terminal region; InterPro: IPR002553 Proteins synthesized on the ribosome and processed in the endoplasmic reticulum are transported from the Golgi apparatus to the trans-Golgi network (TGN), and from there via small carrier vesicles to their final destination compartment. This traffic is bidirectional, to ensure that proteins required to form vesicles are recycled. Vesicles have specific coat proteins (such as clathrin or coatomer) that are important for cargo selection and direction of transfer []. Clathrin coats contain both clathrin and adaptor complexes that link clathrin to receptors in coated vesicles. Clathrin-associated protein complexes are believed to interact with the cytoplasmic tails of membrane proteins, leading to their selection and concentration. The two major types of clathrin adaptor complexes are the heterotetrameric adaptor protein (AP) complexes, and the monomeric GGA (Golgi-localising, Gamma-adaptin ear domain homology, ARF-binding proteins) adaptors []. All AP complexes are heterotetramers composed of two large subunits (adaptins), a medium subunit (mu) and a small subunit (sigma). Each subunit has a specific function. Adaptin subunits recognise and bind to clathrin through their hinge region (clathrin box), and recruit accessory proteins that modulate AP function through their C-terminal appendage domains. By contrast, GGAs are monomers composed of four domains, which have functions similar to AP subunits: an N-terminal VHS (Vps27p/Hrs/Stam) domain, a GAT (GGA and Tom1) domain, a hinge region, and a C-terminal GAE (gamma-adaptin ear) domain. The GAE domain is similar to the AP gamma-adaptin ear domain, being responsible for the recruitment of accessory proteins that regulate clathrin-mediated endocytosis []. While clathrin mediates endocytic protein transport from ER to Golgi, coatomers (COPI, COPII) primarily mediate intra-Golgi transport, as well as the reverse Golgi to ER transport of dilysine-tagged proteins []. Coatomers reversibly associate with Golgi (non-clathrin-coated) vesicles to mediate protein transport and for budding from Golgi membranes []. Coatomer complexes are hetero-oligomers composed of at least an alpha, beta, beta', gamma, delta, epsilon and zeta subunits. This entry represents the N-terminal domain of various adaptins from different AP clathrin adaptor complexes (including AP1, AP2, AP3 and AP4), and from the beta and gamma subunits of various coatomer (COP) adaptors. This domain has a 2-layer alpha/alpha fold that forms a right-handed superhelix, and is a member of the ARM repeat superfamily []. The N-terminal region of the various AP adaptor proteins share strong sequence identity; by contrast, the C-terminal domains of different adaptins share similar structural folds, but have little sequence identity []. It has been proposed that the N-terminal domain interacts with another uniform component of the coated vesicles. More information about these proteins can be found at Protein of the Month: Clathrin [].; GO: 0006886 intracellular protein transport, 0016192 vesicle-mediated transport, 0030117 membrane coat; PDB: 1W63_C 2JKR_A 2JKT_A 2XA7_A 2VGL_B 3TJZ_E.
Probab=28.27 E-value=2.9e+02 Score=23.21 Aligned_cols=52 Identities=17% Similarity=0.202 Sum_probs=34.6
Q ss_pred hhhHHHHHHHHHHHhHhhCCCcccCCCCCCcchHhhHhhcCCCCchhhhhHHHHHHHhh
Q psy4701 4 VKGKRQAAQGLARLGITLNPEVAFPGERSLEVVRPLLSLLHPEATALETFEGLMALCNL 62 (124)
Q Consensus 4 ~~~~~~A~qaLAkilIs~nP~l~F~~~~~~~~v~pL~~LL~~~~~~l~~fEaLlALTNL 62 (124)
+.-|..|+-++.|+.=. +|.++-. . .++++..+|. |.+.-=..-|+.++..+
T Consensus 128 ~~VRk~A~~~l~~i~~~-~p~~~~~----~-~~~~l~~lL~-d~~~~V~~~a~~~l~~i 179 (526)
T PF01602_consen 128 PYVRKKAALALLKIYRK-DPDLVED----E-LIPKLKQLLS-DKDPSVVSAALSLLSEI 179 (526)
T ss_dssp HHHHHHHHHHHHHHHHH-CHCCHHG----G-HHHHHHHHTT-HSSHHHHHHHHHHHHHH
T ss_pred hHHHHHHHHHHHHHhcc-CHHHHHH----H-HHHHHhhhcc-CCcchhHHHHHHHHHHH
Confidence 45678888888887654 6664322 1 4778888885 55555556677777777
No 45
>KOG4500|consensus
Probab=26.01 E-value=98 Score=28.10 Aligned_cols=83 Identities=13% Similarity=-0.004 Sum_probs=55.1
Q ss_pred hhHHHHHHHHHHHhHhhCCCcccCCCCC-CcchHhhHhhcCCCCc--hhhhhHHHHHHHhhcCCChHHHHHHHHccc---
Q psy4701 5 KGKRQAAQGLARLGITLNPEVAFPGERS-LEVVRPLLSLLHPEAT--ALETFEGLMALCNLAAIGEKQRQRILKEKG--- 78 (124)
Q Consensus 5 ~~~~~A~qaLAkilIs~nP~l~F~~~~~-~~~v~pL~~LL~~~~~--~l~~fEaLlALTNLAs~~~~~r~~Iv~~~G--- 78 (124)
.-+..++.++|++.++-=-.-.|..+-+ .+++.||..+...+++ .+|.|.| |-|+++.+++-|..+..-+|
T Consensus 57 tv~~~qssC~A~~sk~ev~r~~F~~~~I~a~~le~Lrq~psS~d~ev~~Q~~Ra---LgNiCydn~E~R~a~~~lgGaqi 133 (604)
T KOG4500|consen 57 TVYLFQSSCLADRSKNEVERSLFRNYCIDAEALELLRQTPSSPDTEVHEQCFRA---LGNICYDNNENRAAFFNLGGAQI 133 (604)
T ss_pred hhhhhhHHHHHHHhhhHHHHHHHHHHhhHHHHHHHHHhCCCCCcccHHHHHHHH---HhhhhccCchhHHHHHhcCCcee
Confidence 3456677778988743222345777632 4577888887753433 5778875 68999999999999998777
Q ss_pred -hhhhhhhhhccc
Q psy4701 79 -PLEVGSSIMAKE 90 (124)
Q Consensus 79 -~s~IEnlLls~n 90 (124)
+.-+..+...++
T Consensus 134 vid~L~~~cs~d~ 146 (604)
T KOG4500|consen 134 VIDVLKPYCSKDN 146 (604)
T ss_pred hHhhhccccccCC
Confidence 444444444444
No 46
>PF07571 DUF1546: Protein of unknown function (DUF1546); InterPro: IPR011442 These proteins are associated with IPR004823 from INTERPRO in transcription initiation factor TFIID subunit 6 (TAF6).; GO: 0051090 regulation of sequence-specific DNA binding transcription factor activity, 0005634 nucleus
Probab=23.73 E-value=1.1e+02 Score=20.81 Aligned_cols=66 Identities=24% Similarity=0.286 Sum_probs=40.8
Q ss_pred hhhHHHHHHHHHHHhHhhCCCcccCCCCCCcchHhhHhh-cCCCCchhhhhHHHHHHHhhcCCChHHHHHHH
Q psy4701 4 VKGKRQAAQGLARLGITLNPEVAFPGERSLEVVRPLLSL-LHPEATALETFEGLMALCNLAAIGEKQRQRIL 74 (124)
Q Consensus 4 ~~~~~~A~qaLAkilIs~nP~l~F~~~~~~~~v~pL~~L-L~~~~~~l~~fEaLlALTNLAs~~~~~r~~Iv 74 (124)
+.=|..|++-|+.|+-.-+.+ ++.-+ ...++-|..- +.|..+.-..|=|+++|..| -++-+|.-|+
T Consensus 20 ~~LRd~AA~lL~~I~~~~~~~--~~~L~-~Ri~~tl~k~l~d~~~~~~t~YGAi~gL~~l--G~~~vr~~il 86 (92)
T PF07571_consen 20 WALRDFAASLLAQICRKFSSS--YPTLQ-PRITRTLLKALLDPKKPLGTHYGAIVGLSAL--GPEAVRALIL 86 (92)
T ss_pred HHHHHHHHHHHHHHHHHhccc--cchHH-HHHHHHHHHHHcCCCCCHHHHHHHHHHHHHH--HHHHHHHhhc
Confidence 345788999999988763322 00000 0134444444 46777788999999999998 2355555554
No 47
>PF03130 HEAT_PBS: PBS lyase HEAT-like repeat; InterPro: IPR004155 These proteins contain a short bi-helical repeat that is related to HEAT. Cyanobacteria and red algae harvest light energy using macromolecular complexes known as phycobilisomes (PBS), peripherally attached to the photosynthetic membrane. The major components of PBS are the phycobiliproteins. These heterodimeric proteins are covalently attached to phycobilins: open-chain tetrapyrrole chromophores, which function as the photosynthetic light-harvesting pigments. Phycobiliproteins differ in sequence and in the nature and number of attached phycobilins to each of their subunits. These proteins include the lyase enzymes that specifically attach particular phycobilins to apophycobiliprotein subunits. The most comprehensively studied of these is the CpcE/Flyase P31967 from SWISSPROT, P31968 from SWISSPROT, which attaches phycocyanobilin (PCB) to the alpha subunit of apophycocyanin []. Similarly, MpeU/V attaches phycoerythrobilin to phycoerythrin II, while CpeY/Z is thought to be involved in phycoerythrobilin (PEB) attachment to phycoerythrin (PE) I (PEs I and II differ in sequence and in the number of attached molecules of PEB: PE I has five, PE II has six) []. All the reactions of the above lyases involve an apoprotein cysteine SH addition to a terminal delta 3,3'-double bond. Such a reaction is not possible in the case of phycoviolobilin (PVB), the phycobilin of alpha-phycoerythrocyanin (alpha-PEC). It is thought that in this case, PCB, not PVB, is first added to apo-alpha-PEC, and is then isomerized to PVB. The addition reaction has been shown to occur in the presence of either of the components of alpha-PEC-PVB lyase PecE or PecF (or both). The isomerisation reaction occurs only when both PecE and PecF components are present, i.e. the PecE/F phycobiliprotein lyase is also a phycobilin isomerase []. Another member of this family is the NblB protein, whose similarity to the phycobiliprotein lyases was previously noted []. This constitutively expressed protein is not known to have any lyase activity. It is thought to be involved in the coordination of PBS degradation with environmental nutrient limitation. It has been suggested that the similarity of NblB to the phycobiliprotein lyases is due to the ability to bind tetrapyrrole phycobilins via the common repeated motif [].; PDB: 1TE4_A.
Probab=23.05 E-value=27 Score=18.40 Aligned_cols=24 Identities=29% Similarity=0.502 Sum_probs=14.7
Q ss_pred HHHHHHHHHHHhHhhCCCcccCCCCCCcchHhhHhhc
Q psy4701 7 KRQAAQGLARLGITLNPEVAFPGERSLEVVRPLLSLL 43 (124)
Q Consensus 7 ~~~A~qaLAkilIs~nP~l~F~~~~~~~~v~pL~~LL 43 (124)
|..|+.+|.+|.- -++|+||.+.|
T Consensus 2 R~~Aa~aLg~igd-------------~~ai~~L~~~L 25 (27)
T PF03130_consen 2 RRAAARALGQIGD-------------PRAIPALIEAL 25 (27)
T ss_dssp HHHHHHHHGGG-S-------------HHHHHHHHHHH
T ss_pred HHHHHHHHHHcCC-------------HHHHHHHHHHh
Confidence 5566777766543 34677777665
No 48
>PF04063 DUF383: Domain of unknown function (DUF383); InterPro: IPR007205 This is a protein of unknown function. It is found N-terminal to another domain of unknown function (IPR007206 from INTERPRO).
Probab=22.65 E-value=2.7e+02 Score=21.53 Aligned_cols=51 Identities=18% Similarity=0.201 Sum_probs=35.5
Q ss_pred HHhhcCCChHHHHHHHHccchhhhhhhhhccchHHHHHHHHhCCcchHHHHHHHHHHHHHhh
Q psy4701 59 LCNLAAIGEKQRQRILKEKGPLEVGSSIMAKEGILEMVLVMANTEDILQQKVACECIIAVAS 120 (124)
Q Consensus 59 LTNLAs~~~~~r~~Iv~~~G~s~IEnlLls~n~~Lqral~M~~S~d~~~Q~~a~e~ii~a~s 120 (124)
|.|++++ ++.|+.++..+-... .+++.+....+.+.+--.-++.+|-|.|=
T Consensus 82 l~NlS~~-~~gR~~~l~~~~~~~----------~l~kLl~ft~~~s~iRR~Gva~~IrNccF 132 (192)
T PF04063_consen 82 LANLSQL-PEGRQFFLDPQRYDG----------PLQKLLPFTEHKSVIRRGGVAGTIRNCCF 132 (192)
T ss_pred HHHhcCC-HHHHHHHhCchhhhh----------HHHHHHHHhccCcHHHHHHHHHHHHHhhc
Confidence 6799888 677777775321111 67777888888888877778877777663
No 49
>PF03278 IpaB_EvcA: IpaB/EvcA family; InterPro: IPR004959 The IpgB family includes the invasion plasmid antigen from Shigella flexneri [], as well as related proteins from Escherichia coli species. Members of this family seem to be involved in pathogenicity of some enterobacteria. However the exact function of these proteins is unclear.; GO: 0009405 pathogenesis; PDB: 3LYQ_A 3LW8_G 3LWN_F 3LXR_F 3GCG_B.
Probab=21.67 E-value=3.7e+02 Score=20.45 Aligned_cols=62 Identities=16% Similarity=0.107 Sum_probs=43.8
Q ss_pred HhhcCCChHHHHHHHHc----cch--------hhhhhhhhccchHHHHHHHHhCCcchHHHHHHHHHHHHHhhh
Q psy4701 60 CNLAAIGEKQRQRILKE----KGP--------LEVGSSIMAKEGILEMVLVMANTEDILQQKVACECIIAVASK 121 (124)
Q Consensus 60 TNLAs~~~~~r~~Iv~~----~G~--------s~IEnlLls~n~~Lqral~M~~S~d~~~Q~~a~e~ii~a~sk 121 (124)
-|=..++.+.|.+|++. .+. +.|+..+.++...-..+=.+|.|.|...+-...+.+++..++
T Consensus 57 ~n~~~Is~e~k~~IF~~i~~~~~~~LD~naAQsSI~H~l~~n~~f~kk~d~lc~~~d~~~kn~~~~~i~n~iad 130 (152)
T PF03278_consen 57 QNNSNISKETKQRIFKLIEKKFGIKLDVNAAQSSINHFLQSNKYFNKKMDELCKGMDRDVKNQTQTYIINQIAD 130 (152)
T ss_dssp HCT----HHHHHHHHHHHHHHHT----TT----HHHHHHHT-HHHHHHHHHHCTTB-HHHHHHHHHHHHHHHHH
T ss_pred HcCCcCCHHHHHHHHHHHHHHhCCccchhHHHhhHHHHHhcCHHHHHHHHHHHccCCHHHHHHHHHHHHHHHHH
Confidence 35556788999999862 122 778888888888888888999999999988888888887765
No 50
>smart00567 EZ_HEAT E-Z type HEAT repeats. Present in subunits of cyanobacterial phycocyanin lyase, and other proteins. Probable scaffolding role.
Probab=21.48 E-value=1.1e+02 Score=15.83 Aligned_cols=26 Identities=35% Similarity=0.454 Sum_probs=18.2
Q ss_pred hHHHHHHHHHHHhHhhCCCcccCCCCCCcchHhhHhhcC
Q psy4701 6 GKRQAAQGLARLGITLNPEVAFPGERSLEVVRPLLSLLH 44 (124)
Q Consensus 6 ~~~~A~qaLAkilIs~nP~l~F~~~~~~~~v~pL~~LL~ 44 (124)
-|..|+.+|.+|. .-+++++|...|.
T Consensus 3 vR~~aa~aLg~~~-------------~~~a~~~L~~~l~ 28 (30)
T smart00567 3 VRHEAAFALGQLG-------------DEEAVPALIKALE 28 (30)
T ss_pred HHHHHHHHHHHcC-------------CHhHHHHHHHHhc
Confidence 4677888888872 1346778888775
No 51
>PF12755 Vac14_Fab1_bd: Vacuolar 14 Fab1-binding region
Probab=20.49 E-value=2.5e+02 Score=19.21 Aligned_cols=85 Identities=20% Similarity=0.290 Sum_probs=52.4
Q ss_pred HHHHHHHHHHhHhhCCCcccCCCCCCcchHhhHhhcCCCCchhhhhHHHHHHHhhcCCChHHHHHHHHccchhhhhhhhh
Q psy4701 8 RQAAQGLARLGITLNPEVAFPGERSLEVVRPLLSLLHPEATALETFEGLMALCNLAAIGEKQRQRILKEKGPLEVGSSIM 87 (124)
Q Consensus 8 ~~A~qaLAkilIs~nP~l~F~~~~~~~~v~pL~~LL~~~~~~l~~fEaLlALTNLAs~~~~~r~~Iv~~~G~s~IEnlLl 87 (124)
.-+.-+||.+.|..-..+ .. ..-+.++|+...+. |.+.-=+|=|.=||-|++-. .|..++.
T Consensus 4 ~ggli~Laa~ai~l~~~~--~~-~l~~Il~pVL~~~~-D~d~rVRy~AcEaL~ni~k~---~~~~~l~------------ 64 (97)
T PF12755_consen 4 KGGLIGLAAVAIALGKDI--SK-YLDEILPPVLKCFD-DQDSRVRYYACEALYNISKV---ARGEILP------------ 64 (97)
T ss_pred hHHHHHHHHHHHHchHhH--HH-HHHHHHHHHHHHcC-CCcHHHHHHHHHHHHHHHHH---HHHHHHH------------
Confidence 345667777777766552 21 23448999999997 55555588888888888743 2333322
Q ss_pred ccchHHHHHHHHhCCcchHHHHHH
Q psy4701 88 AKEGILEMVLVMANTEDILQQKVA 111 (124)
Q Consensus 88 s~n~~Lqral~M~~S~d~~~Q~~a 111 (124)
.=+++....+.+++..|+.-|..|
T Consensus 65 ~f~~IF~~L~kl~~D~d~~Vr~~a 88 (97)
T PF12755_consen 65 YFNEIFDALCKLSADPDENVRSAA 88 (97)
T ss_pred HHHHHHHHHHHHHcCCchhHHHHH
Confidence 124455666666666666666555
Done!