Query psy14464
Match_columns 175
No_of_seqs 116 out of 121
Neff 4.8
Searched_HMMs 46136
Date Fri Aug 16 20:32:42 2013
Command hhsearch -i /work/01045/syshi/Psyhhblits/psy14464.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/14464hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 KOG4151|consensus 99.5 3.1E-15 6.8E-20 143.4 4.0 161 7-175 302-465 (748)
2 cd00020 ARM Armadillo/beta-cat 96.6 0.0048 1E-07 43.8 4.8 113 14-133 7-119 (120)
3 PRK09687 putative lyase; Provi 95.6 0.25 5.4E-06 42.9 11.6 104 17-132 57-186 (280)
4 PF13646 HEAT_2: HEAT repeats; 94.4 0.26 5.6E-06 33.8 6.9 72 18-118 3-75 (88)
5 PLN03200 cellulose synthase-in 92.9 0.24 5.2E-06 53.9 6.6 120 10-137 647-767 (2102)
6 cd00020 ARM Armadillo/beta-cat 92.2 0.28 6.1E-06 34.5 4.4 73 62-138 8-81 (120)
7 PF13646 HEAT_2: HEAT repeats; 91.5 0.59 1.3E-05 32.0 5.2 58 14-88 31-88 (88)
8 PF04826 Arm_2: Armadillo-like 91.4 1.4 3.1E-05 38.0 8.6 109 14-133 54-162 (254)
9 PRK09687 putative lyase; Provi 91.3 2.6 5.7E-05 36.6 10.3 93 17-133 26-119 (280)
10 PF03224 V-ATPase_H_N: V-ATPas 89.3 0.79 1.7E-05 39.8 5.3 110 16-133 108-226 (312)
11 PF04826 Arm_2: Armadillo-like 88.6 2.2 4.8E-05 36.8 7.5 110 13-133 11-123 (254)
12 PF12719 Cnd3: Nuclear condens 87.2 2.6 5.7E-05 36.4 7.2 80 12-93 62-144 (298)
13 PF05804 KAP: Kinesin-associat 86.3 1.9 4.1E-05 42.6 6.4 111 12-133 288-398 (708)
14 PF00514 Arm: Armadillo/beta-c 83.9 0.65 1.4E-05 28.4 1.4 39 94-133 2-40 (41)
15 PF01602 Adaptin_N: Adaptin N 83.7 1.9 4.2E-05 38.8 4.8 103 11-132 76-179 (526)
16 KOG4224|consensus 81.9 4.7 0.0001 38.0 6.7 133 6-149 239-385 (550)
17 TIGR02270 conserved hypothetic 81.1 17 0.00036 33.6 10.0 113 16-152 149-290 (410)
18 PRK13800 putative oxidoreducta 79.3 9.1 0.0002 38.3 8.2 27 17-43 745-771 (897)
19 PLN03200 cellulose synthase-in 79.2 3.9 8.6E-05 45.0 5.9 130 10-148 441-572 (2102)
20 KOG2973|consensus 78.1 17 0.00036 33.3 8.7 101 21-133 10-110 (353)
21 PTZ00429 beta-adaptin; Provisi 71.4 15 0.00032 36.7 7.3 99 17-133 108-207 (746)
22 PF03224 V-ATPase_H_N: V-ATPas 70.5 6.6 0.00014 34.1 4.2 130 10-147 146-281 (312)
23 cd00256 VATPase_H VATPase_H, r 69.5 57 0.0012 30.6 10.3 110 16-133 104-215 (429)
24 KOG0166|consensus 68.2 51 0.0011 31.8 9.8 113 17-137 112-226 (514)
25 PRK13800 putative oxidoreducta 66.9 23 0.00049 35.6 7.6 57 16-90 623-679 (897)
26 PF10508 Proteasom_PSMB: Prote 63.0 19 0.00041 33.7 5.9 116 16-140 162-283 (503)
27 KOG1077|consensus 61.4 31 0.00067 35.0 7.2 112 9-132 324-472 (938)
28 KOG2062|consensus 60.1 24 0.00051 35.9 6.2 50 75-133 567-617 (929)
29 PF05804 KAP: Kinesin-associat 59.3 40 0.00087 33.6 7.6 117 9-138 325-444 (708)
30 smart00185 ARM Armadillo/beta- 57.6 12 0.00026 21.7 2.5 38 95-133 3-40 (41)
31 PF12719 Cnd3: Nuclear condens 55.8 1.3E+02 0.0029 25.8 12.1 107 17-133 117-232 (298)
32 PF05947 DUF879: Bacterial pro 55.5 22 0.00047 34.3 5.0 40 73-115 36-76 (602)
33 TIGR03359 VI_chp_6 type VI sec 54.9 24 0.00053 33.9 5.3 39 73-114 36-75 (598)
34 PF15596 Imm34: Immunity prote 53.5 16 0.00035 28.3 3.1 28 80-115 32-59 (110)
35 PF02985 HEAT: HEAT repeat; I 52.6 28 0.0006 20.0 3.4 27 16-42 2-28 (31)
36 KOG0166|consensus 51.2 96 0.0021 30.0 8.5 58 58-117 234-291 (514)
37 KOG1062|consensus 46.0 1E+02 0.0022 31.6 8.0 77 16-116 515-595 (866)
38 PF12031 DUF3518: Domain of un 45.7 45 0.00098 29.4 5.0 88 30-118 140-229 (257)
39 PF03408 Foamy_virus_ENV: Foam 43.8 10 0.00022 38.6 0.8 44 128-172 173-221 (981)
40 PF13513 HEAT_EZ: HEAT-like re 42.8 57 0.0012 20.5 4.1 34 8-41 22-55 (55)
41 PF10508 Proteasom_PSMB: Prote 42.2 1.3E+02 0.0029 28.1 7.9 115 11-133 74-188 (503)
42 PTZ00429 beta-adaptin; Provisi 39.5 2.6E+02 0.0057 28.1 9.8 35 10-44 136-170 (746)
43 PF01602 Adaptin_N: Adaptin N 38.0 1.4E+02 0.0031 26.9 7.2 112 10-140 110-223 (526)
44 KOG4596|consensus 35.0 34 0.00074 36.7 3.0 76 13-91 841-918 (1936)
45 COG4333 Uncharacterized protei 34.5 31 0.00068 28.4 2.1 68 87-170 39-106 (167)
46 KOG1061|consensus 34.0 1.9E+02 0.0041 29.2 7.8 116 10-140 117-233 (734)
47 KOG1062|consensus 32.5 1.6E+02 0.0034 30.3 7.0 60 14-87 313-374 (866)
48 PF06685 DUF1186: Protein of u 31.9 78 0.0017 27.6 4.3 136 26-171 54-212 (249)
49 KOG2759|consensus 31.0 3.9E+02 0.0085 25.5 9.0 54 83-138 220-274 (442)
50 PF03914 CBF: CBF/Mak21 family 30.0 37 0.0008 26.7 1.9 35 103-137 64-98 (164)
51 PF10363 DUF2435: Protein of u 29.6 2.3E+02 0.0049 20.7 6.1 64 18-91 8-71 (92)
52 PF11701 UNC45-central: Myosin 29.2 1E+02 0.0022 24.3 4.3 105 18-133 8-116 (157)
53 KOG2171|consensus 28.9 3.5E+02 0.0076 28.6 8.9 111 15-143 37-155 (1075)
54 PF05004 IFRD: Interferon-rela 27.5 4.3E+02 0.0093 23.3 10.1 142 6-157 121-290 (309)
55 PF12348 CLASP_N: CLASP N term 27.3 1.4E+02 0.003 23.9 4.9 71 16-90 133-204 (228)
56 COG1413 FOG: HEAT repeat [Ener 27.3 2.4E+02 0.0051 24.0 6.5 103 14-133 180-285 (335)
57 PF00607 Gag_p24: gag gene pro 27.3 2.1E+02 0.0045 23.9 6.0 96 30-125 104-204 (206)
58 COG1413 FOG: HEAT repeat [Ener 26.4 2.5E+02 0.0055 23.9 6.6 76 15-91 44-134 (335)
59 PRK00034 gatC aspartyl/glutamy 26.3 1.2E+02 0.0026 21.7 3.9 58 84-142 11-68 (95)
60 PF08167 RIX1: rRNA processing 26.3 2.9E+02 0.0063 21.9 6.5 62 9-70 58-125 (165)
61 KOG2003|consensus 25.9 3.9E+02 0.0084 26.4 8.1 109 16-133 439-572 (840)
62 PF10691 DUF2497: Protein of u 25.5 77 0.0017 22.7 2.7 32 81-112 32-64 (73)
63 COG3519 Type VI protein secret 25.5 1.1E+02 0.0023 30.4 4.4 37 76-114 42-79 (621)
64 PF11841 DUF3361: Domain of un 25.3 3.8E+02 0.0083 21.9 8.8 119 9-133 6-131 (160)
65 PF12755 Vac14_Fab1_bd: Vacuol 24.5 2.9E+02 0.0063 20.3 6.5 89 31-132 3-94 (97)
66 KOG2973|consensus 24.0 1.4E+02 0.0031 27.5 4.7 77 68-152 10-87 (353)
67 KOG4224|consensus 23.8 60 0.0013 30.9 2.3 60 73-133 177-236 (550)
68 KOG1104|consensus 22.6 2.1E+02 0.0045 29.1 5.8 75 56-134 21-97 (759)
69 KOG1048|consensus 22.5 1.6E+02 0.0034 29.7 5.0 90 24-117 529-618 (717)
70 PF07106 TBPIP: Tat binding pr 21.7 68 0.0015 25.4 2.0 41 108-150 4-44 (169)
71 PF09838 DUF2065: Uncharacteri 21.6 67 0.0015 21.8 1.7 19 82-100 9-27 (57)
72 PRK07418 acetolactate synthase 21.5 4.1E+02 0.009 25.3 7.5 103 53-163 130-272 (616)
73 TIGR02270 conserved hypothetic 21.4 6.6E+02 0.014 23.3 8.7 108 15-133 179-295 (410)
74 PF04063 DUF383: Domain of unk 20.9 4.5E+02 0.0097 21.8 6.8 57 78-134 73-132 (192)
75 PF03489 SapB_2: Saposin-like 20.6 1.1E+02 0.0025 17.8 2.4 34 5-38 1-34 (35)
76 COG1088 RfbB dTDP-D-glucose 4, 20.4 49 0.0011 30.3 1.0 35 122-157 169-207 (340)
77 PF14496 NEL: C-terminal novel 20.4 1.7E+02 0.0038 24.6 4.3 40 94-133 54-93 (221)
No 1
>KOG4151|consensus
Probab=99.54 E-value=3.1e-15 Score=143.42 Aligned_cols=161 Identities=20% Similarity=0.156 Sum_probs=147.6
Q ss_pred hHHHHHHhhHHHHHHHHhcCCchhHHHHHHHHHHhcCCCCCCCCCCCCCCCCHHHHHHHHHHhhcCCCCCchhhhHHHhh
Q psy14464 7 SITSIIKQGVDILKRLYMSKNENIRVRALVGLCKLGSMGGSDASIRPFADGSTRKLAEACRRFLVNPARDPDIRRWAAEG 86 (175)
Q Consensus 7 ~~~~i~~~g~~~Lk~l~~s~~d~ir~~A~V~L~KL~~~~~~d~~~~~~~~~s~~~La~~~~~~L~~~~~~~~~~~~aiEG 86 (175)
+|.-.+..|+.+|+.+|.+.++-+++++.+++|+.+..++++....+++++....|++.||.|++..+++.+.|
T Consensus 302 laql~~~~~~~~lR~ii~~~~~~~~v~ik~~~~~~~~vtit~~a~l~l~~sa~i~l~~~~r~~~ve~s~~qd~r------ 375 (748)
T KOG4151|consen 302 LAQLPLNCTVKILREIIHSFPSLRAVLIKYRDTEGDLVTITTTAELRLAESAVIKLGKTLRFYVVEVSPEQDPR------ 375 (748)
T ss_pred HHhcchhhchHHHHHHHhcccchhhheeeeccCCcceeeeeehhhhhhhhHHHHHHHHHhhhheeecCccccch------
Confidence 45556788999999999998888899999999999999888888889999999999999999999887776666
Q ss_pred hhhhccCHHHHHHHhhCHHHHHHHHHHhhcCC-CCcchh-hhHHHHhcccCCCCC-CcchHHHHHHhhhcCCCCCCCCCC
Q psy14464 87 LAYLTLDAEVKEALIEDKPALAALVDVASSGK-PACTYG-VVTTLVNLCNAYDKQ-EIIPEMIELAKFAKHHIPEDHELD 163 (175)
Q Consensus 87 LAYlSL~p~VKE~L~~D~~~Lk~L~~llk~~d-~s~~YG-~lsIl~NLTn~Y~~~-e~~~~l~~L~~ya~~~~pe~~pld 163 (175)
|+|+++++||++++|..+++++..+...++ +++.|+ +.|++.|.+|.|..+ |..++|-||++|++||+|+.||+|
T Consensus 376 --y~s~~a~~ke~~i~d~~s~~aln~l~~~~~~~~~~~~~lg~~l~s~a~~~~~~sE~~qel~~la~~a~qev~~~~~~n 453 (748)
T KOG4151|consen 376 --YLSIDAELKELKIEDSSSSKALNGLKGDAVESYLDLHELGMKLYSEAMEEEVTSEKAQELFDLAAFAFQEVAALSPLN 453 (748)
T ss_pred --hhccccchhhhhccccchHHHHHhhhcccccccccHHHHHHHHHHHHHHhhhhhhhhHHHHHHHHHHHhhhhhccccc
Confidence 999999999999999999999999998874 489999 999999999999988 889999999999999999999999
Q ss_pred ChHHHHHHhhhC
Q psy14464 164 DPDFVTKRLLVL 175 (175)
Q Consensus 164 ~d~~V~~R~~~L 175 (175)
.-+++..|++++
T Consensus 454 ~~~v~~~r~rk~ 465 (748)
T KOG4151|consen 454 WGNVHMSRARKR 465 (748)
T ss_pred cchHHHHHHHHh
Confidence 999999998863
No 2
>cd00020 ARM Armadillo/beta-catenin-like repeats. An approximately 40 amino acid long tandemly repeated sequence motif first identified in the Drosophila segment polarity gene armadillo; these repeats were also found in the mammalian armadillo homolog beta-catenin, the junctional plaque protein plakoglobin, the adenomatous polyposis coli (APC) tumor suppressor protein, and a number of other proteins. ARM has been implicated in mediating protein-protein interactions, but no common features among the target proteins recognized by the ARM repeats have been identified; related to the HEAT domain; three consecutive copies of the repeat are represented by this alignment model.
Probab=96.56 E-value=0.0048 Score=43.79 Aligned_cols=113 Identities=20% Similarity=0.240 Sum_probs=83.4
Q ss_pred hhHHHHHHHHhcCCchhHHHHHHHHHHhcCCCCCCCCCCCCCCCCHHHHHHHHHHhhcCCCCCchhhhHHHhhhhhhccC
Q psy14464 14 QGVDILKRLYMSKNENIRVRALVGLCKLGSMGGSDASIRPFADGSTRKLAEACRRFLVNPARDPDIRRWAAEGLAYLTLD 93 (175)
Q Consensus 14 ~g~~~Lk~l~~s~~d~ir~~A~V~L~KL~~~~~~d~~~~~~~~~s~~~La~~~~~~L~~~~~~~~~~~~aiEGLAYlSL~ 93 (175)
.+++.|-.+++..+..++.-|+.+|+.+.... +.....+.+ .+....+.++|.+ .+...+..|+--|..++..
T Consensus 7 ~~i~~l~~~l~~~~~~~~~~a~~~l~~l~~~~--~~~~~~~~~---~~~i~~l~~~l~~--~~~~v~~~a~~~L~~l~~~ 79 (120)
T cd00020 7 GGLPALVSLLSSSDENVQREAAWALSNLSAGN--NDNIQAVVE---AGGLPALVQLLKS--EDEEVVKAALWALRNLAAG 79 (120)
T ss_pred CChHHHHHHHHcCCHHHHHHHHHHHHHHhcCC--HHHHHHHHH---CCChHHHHHHHhC--CCHHHHHHHHHHHHHHccC
Confidence 36788888888888899999999999998752 111111111 1344555566644 3667889999999999988
Q ss_pred HHHHHHHhhCHHHHHHHHHHhhcCCCCcchhhhHHHHhcc
Q psy14464 94 AEVKEALIEDKPALAALVDVASSGKPACTYGVVTTLVNLC 133 (175)
Q Consensus 94 p~VKE~L~~D~~~Lk~L~~llk~~d~s~~YG~lsIl~NLT 133 (175)
+........+..++..|++++...+..+.--.+.+|.||+
T Consensus 80 ~~~~~~~~~~~g~l~~l~~~l~~~~~~~~~~a~~~l~~l~ 119 (120)
T cd00020 80 PEDNKLIVLEAGGVPKLVNLLDSSNEDIQKNATGALSNLA 119 (120)
T ss_pred cHHHHHHHHHCCChHHHHHHHhcCCHHHHHHHHHHHHHhh
Confidence 8655555556679999999998888888888899999987
No 3
>PRK09687 putative lyase; Provisional
Probab=95.58 E-value=0.25 Score=42.92 Aligned_cols=104 Identities=15% Similarity=0.096 Sum_probs=54.4
Q ss_pred HHHHHHHhcCCchhHHHHHHHHHHhcCCCCCCCCCCCCCCCCHHHHHHHHHHhhcCCCCCchhhhHHHhhhhhhcc----
Q psy14464 17 DILKRLYMSKNENIRVRALVGLCKLGSMGGSDASIRPFADGSTRKLAEACRRFLVNPARDPDIRRWAAEGLAYLTL---- 92 (175)
Q Consensus 17 ~~Lk~l~~s~~d~ir~~A~V~L~KL~~~~~~d~~~~~~~~~s~~~La~~~~~~L~~~~~~~~~~~~aiEGLAYlSL---- 92 (175)
+.+..+.++.+..+|..|+-+|+.|+.... ...+-...+..++.++ .+...|..|+++|.|+.-
T Consensus 57 ~~l~~ll~~~d~~vR~~A~~aLg~lg~~~~-----------~~~~a~~~L~~l~~~D-~d~~VR~~A~~aLG~~~~~~~~ 124 (280)
T PRK09687 57 RLAIELCSSKNPIERDIGADILSQLGMAKR-----------CQDNVFNILNNLALED-KSACVRASAINATGHRCKKNPL 124 (280)
T ss_pred HHHHHHHhCCCHHHHHHHHHHHHhcCCCcc-----------chHHHHHHHHHHHhcC-CCHHHHHHHHHHHhcccccccc
Confidence 334444444455555555555555543210 1234455666665554 466788899999999731
Q ss_pred ----------------CHHHHHHHh------hCHHHHHHHHHHhhcCCCCcchhhhHHHHhc
Q psy14464 93 ----------------DAEVKEALI------EDKPALAALVDVASSGKPACTYGVVTTLVNL 132 (175)
Q Consensus 93 ----------------~p~VKE~L~------~D~~~Lk~L~~llk~~d~s~~YG~lsIl~NL 132 (175)
+|.||...+ .|+..+..|+.++++.+..+-+..+..|.++
T Consensus 125 ~~~~a~~~l~~~~~D~~~~VR~~a~~aLg~~~~~~ai~~L~~~L~d~~~~VR~~A~~aLg~~ 186 (280)
T PRK09687 125 YSPKIVEQSQITAFDKSTNVRFAVAFALSVINDEAAIPLLINLLKDPNGDVRNWAAFALNSN 186 (280)
T ss_pred cchHHHHHHHHHhhCCCHHHHHHHHHHHhccCCHHHHHHHHHHhcCCCHHHHHHHHHHHhcC
Confidence 223333322 2455555555555555445555555555554
No 4
>PF13646 HEAT_2: HEAT repeats; PDB: 1OYZ_A 3FGA_A 2PF4_C 2IAE_A 3B2A_A.
Probab=94.35 E-value=0.26 Score=33.77 Aligned_cols=72 Identities=28% Similarity=0.382 Sum_probs=54.7
Q ss_pred HHHHHH-hcCCchhHHHHHHHHHHhcCCCCCCCCCCCCCCCCHHHHHHHHHHhhcCCCCCchhhhHHHhhhhhhccCHHH
Q psy14464 18 ILKRLY-MSKNENIRVRALVGLCKLGSMGGSDASIRPFADGSTRKLAEACRRFLVNPARDPDIRRWAAEGLAYLTLDAEV 96 (175)
Q Consensus 18 ~Lk~l~-~s~~d~ir~~A~V~L~KL~~~~~~d~~~~~~~~~s~~~La~~~~~~L~~~~~~~~~~~~aiEGLAYlSL~p~V 96 (175)
.|-+.+ ++.++.+|..|+-.|++++. .+....+..++.+ .+...|..|++.|..+
T Consensus 3 ~L~~~l~~~~~~~vr~~a~~~L~~~~~----------------~~~~~~L~~~l~d--~~~~vr~~a~~aL~~i------ 58 (88)
T PF13646_consen 3 ALLQLLQNDPDPQVRAEAARALGELGD----------------PEAIPALIELLKD--EDPMVRRAAARALGRI------ 58 (88)
T ss_dssp HHHHHHHTSSSHHHHHHHHHHHHCCTH----------------HHHHHHHHHHHTS--SSHHHHHHHHHHHHCC------
T ss_pred HHHHHHhcCCCHHHHHHHHHHHHHcCC----------------HhHHHHHHHHHcC--CCHHHHHHHHHHHHHh------
Confidence 344444 78889999999999996653 3678888888833 5778999999999988
Q ss_pred HHHHhhCHHHHHHHHHHhhcCC
Q psy14464 97 KEALIEDKPALAALVDVASSGK 118 (175)
Q Consensus 97 KE~L~~D~~~Lk~L~~llk~~d 118 (175)
.++..+..|.+++.+.+
T Consensus 59 -----~~~~~~~~L~~~l~~~~ 75 (88)
T PF13646_consen 59 -----GDPEAIPALIKLLQDDD 75 (88)
T ss_dssp -----HHHHTHHHHHHHHTC-S
T ss_pred -----CCHHHHHHHHHHHcCCC
Confidence 36778888888887643
No 5
>PLN03200 cellulose synthase-interactive protein; Provisional
Probab=92.88 E-value=0.24 Score=53.87 Aligned_cols=120 Identities=14% Similarity=0.146 Sum_probs=84.8
Q ss_pred HHHHhhHHHHHHHHhcCCchhHHHHHHHHHHhcCCCCCCCCCCC-CCCCCHHHHHHHHHHhhcCCCCCchhhhHHHhhhh
Q psy14464 10 SIIKQGVDILKRLYMSKNENIRVRALVGLCKLGSMGGSDASIRP-FADGSTRKLAEACRRFLVNPARDPDIRRWAAEGLA 88 (175)
Q Consensus 10 ~i~~~g~~~Lk~l~~s~~d~ir~~A~V~L~KL~~~~~~d~~~~~-~~~~s~~~La~~~~~~L~~~~~~~~~~~~aiEGLA 88 (175)
.+...|++-|=+++++++..++--|+.+|.-+-.. +.+..+.. -.. .....+-++|.. .+.+.+..|.++|+
T Consensus 647 vv~agaIpPLV~LLss~~~~v~keAA~AL~nL~~~-~~~~q~~~~v~~----GaV~pL~~LL~~--~d~~v~e~Al~ALa 719 (2102)
T PLN03200 647 LATDEIINPCIKLLTNNTEAVATQSARALAALSRS-IKENRKVSYAAE----DAIKPLIKLAKS--SSIEVAEQAVCALA 719 (2102)
T ss_pred HHHcCCHHHHHHHHhcCChHHHHHHHHHHHHHHhC-CCHHHHHHHHHc----CCHHHHHHHHhC--CChHHHHHHHHHHH
Confidence 45567778888888888888999999999888863 22211111 111 234456666644 36678999999999
Q ss_pred hhccCHHHHHHHhhCHHHHHHHHHHhhcCCCCcchhhhHHHHhcccCCC
Q psy14464 89 YLTLDAEVKEALIEDKPALAALVDVASSGKPACTYGVVTTLVNLCNAYD 137 (175)
Q Consensus 89 YlSL~p~VKE~L~~D~~~Lk~L~~llk~~d~s~~YG~lsIl~NLTn~Y~ 137 (175)
.+.-+++++..+... ..+..|++++++.....-=..+..+.+|++.++
T Consensus 720 nLl~~~e~~~ei~~~-~~I~~Lv~lLr~G~~~~k~~Aa~AL~~L~~~~~ 767 (2102)
T PLN03200 720 NLLSDPEVAAEALAE-DIILPLTRVLREGTLEGKRNAARALAQLLKHFP 767 (2102)
T ss_pred HHHcCchHHHHHHhc-CcHHHHHHHHHhCChHHHHHHHHHHHHHHhCCC
Confidence 999999999999854 558999999987644444555666777775544
No 6
>cd00020 ARM Armadillo/beta-catenin-like repeats. An approximately 40 amino acid long tandemly repeated sequence motif first identified in the Drosophila segment polarity gene armadillo; these repeats were also found in the mammalian armadillo homolog beta-catenin, the junctional plaque protein plakoglobin, the adenomatous polyposis coli (APC) tumor suppressor protein, and a number of other proteins. ARM has been implicated in mediating protein-protein interactions, but no common features among the target proteins recognized by the ARM repeats have been identified; related to the HEAT domain; three consecutive copies of the repeat are represented by this alignment model.
Probab=92.24 E-value=0.28 Score=34.54 Aligned_cols=73 Identities=26% Similarity=0.241 Sum_probs=59.8
Q ss_pred HHHHHHHhhcCCCCCchhhhHHHhhhhhhccC-HHHHHHHhhCHHHHHHHHHHhhcCCCCcchhhhHHHHhcccCCCC
Q psy14464 62 LAEACRRFLVNPARDPDIRRWAAEGLAYLTLD-AEVKEALIEDKPALAALVDVASSGKPACTYGVVTTLVNLCNAYDK 138 (175)
Q Consensus 62 La~~~~~~L~~~~~~~~~~~~aiEGLAYlSL~-p~VKE~L~~D~~~Lk~L~~llk~~d~s~~YG~lsIl~NLTn~Y~~ 138 (175)
+...+-++|.+. +.+.+..|+-.|+.++.. |+.+..+.. ..+++.|++++.+.+..+....+.++.||+ ...+
T Consensus 8 ~i~~l~~~l~~~--~~~~~~~a~~~l~~l~~~~~~~~~~~~~-~~~i~~l~~~l~~~~~~v~~~a~~~L~~l~-~~~~ 81 (120)
T cd00020 8 GLPALVSLLSSS--DENVQREAAWALSNLSAGNNDNIQAVVE-AGGLPALVQLLKSEDEEVVKAALWALRNLA-AGPE 81 (120)
T ss_pred ChHHHHHHHHcC--CHHHHHHHHHHHHHHhcCCHHHHHHHHH-CCChHHHHHHHhCCCHHHHHHHHHHHHHHc-cCcH
Confidence 455666777543 457889999999999998 888888876 488999999999888899999999999999 5443
No 7
>PF13646 HEAT_2: HEAT repeats; PDB: 1OYZ_A 3FGA_A 2PF4_C 2IAE_A 3B2A_A.
Probab=91.51 E-value=0.59 Score=31.95 Aligned_cols=58 Identities=24% Similarity=0.308 Sum_probs=44.7
Q ss_pred hhHHHHHHHHhcCCchhHHHHHHHHHHhcCCCCCCCCCCCCCCCCHHHHHHHHHHhhcCCCCCchhhhHHHhhhh
Q psy14464 14 QGVDILKRLYMSKNENIRVRALVGLCKLGSMGGSDASIRPFADGSTRKLAEACRRFLVNPARDPDIRRWAAEGLA 88 (175)
Q Consensus 14 ~g~~~Lk~l~~s~~d~ir~~A~V~L~KL~~~~~~d~~~~~~~~~s~~~La~~~~~~L~~~~~~~~~~~~aiEGLA 88 (175)
..++.|..+++++++.||..|+.+|.++|. .+....+.+++.++ .+...|.+|++.|+
T Consensus 31 ~~~~~L~~~l~d~~~~vr~~a~~aL~~i~~----------------~~~~~~L~~~l~~~-~~~~vr~~a~~aL~ 88 (88)
T PF13646_consen 31 EAIPALIELLKDEDPMVRRAAARALGRIGD----------------PEAIPALIKLLQDD-DDEVVREAAAEALG 88 (88)
T ss_dssp HHHHHHHHHHTSSSHHHHHHHHHHHHCCHH----------------HHTHHHHHHHHTC--SSHHHHHHHHHHHH
T ss_pred hHHHHHHHHHcCCCHHHHHHHHHHHHHhCC----------------HHHHHHHHHHHcCC-CcHHHHHHHHhhcC
Confidence 346888888899999999999999999863 45667788888664 34445788998875
No 8
>PF04826 Arm_2: Armadillo-like; InterPro: IPR006911 This entry consists of mammalian proteins of unknown function.
Probab=91.38 E-value=1.4 Score=38.01 Aligned_cols=109 Identities=20% Similarity=0.307 Sum_probs=81.7
Q ss_pred hhHHHHHHHHhcCCchhHHHHHHHHHHhcCCCCCCCCCCCCCCCCHHHHHHHHHHhhcCCCCCchhhhHHHhhhhhhccC
Q psy14464 14 QGVDILKRLYMSKNENIRVRALVGLCKLGSMGGSDASIRPFADGSTRKLAEACRRFLVNPARDPDIRRWAAEGLAYLTLD 93 (175)
Q Consensus 14 ~g~~~Lk~l~~s~~d~ir~~A~V~L~KL~~~~~~d~~~~~~~~~s~~~La~~~~~~L~~~~~~~~~~~~aiEGLAYlSL~ 93 (175)
.|++++..+..+.++.+|..|+-+|.=+....... .+ -..-+...|+..+.. .-+.+....+.--|.-+|+.
T Consensus 54 Ggi~lI~~lL~~p~~~vr~~AL~aL~Nls~~~en~---~~----Ik~~i~~Vc~~~~s~-~lns~~Q~agLrlL~nLtv~ 125 (254)
T PF04826_consen 54 GGISLIGSLLNDPNPSVREKALNALNNLSVNDENQ---EQ----IKMYIPQVCEETVSS-PLNSEVQLAGLRLLTNLTVT 125 (254)
T ss_pred CCHHHHHHHcCCCChHHHHHHHHHHHhcCCChhhH---HH----HHHHHHHHHHHHhcC-CCCCHHHHHHHHHHHccCCC
Confidence 47799999999999999999999998776532111 11 123455555555543 34666667788889999999
Q ss_pred HHHHHHHhhCHHHHHHHHHHhhcCCCCcchhhhHHHHhcc
Q psy14464 94 AEVKEALIEDKPALAALVDVASSGKPACTYGVVTTLVNLC 133 (175)
Q Consensus 94 p~VKE~L~~D~~~Lk~L~~llk~~d~s~~YG~lsIl~NLT 133 (175)
.+..+.++. .+..|+.++.+++.....=++-+++||+
T Consensus 126 ~~~~~~l~~---~i~~ll~LL~~G~~~~k~~vLk~L~nLS 162 (254)
T PF04826_consen 126 NDYHHMLAN---YIPDLLSLLSSGSEKTKVQVLKVLVNLS 162 (254)
T ss_pred cchhhhHHh---hHHHHHHHHHcCChHHHHHHHHHHHHhc
Confidence 999998864 6888899998887777788888999998
No 9
>PRK09687 putative lyase; Provisional
Probab=91.35 E-value=2.6 Score=36.56 Aligned_cols=93 Identities=17% Similarity=0.191 Sum_probs=67.8
Q ss_pred HHHHHHHhcCCchhHHHHHHHHHHhcCCCCCCCCCCCCCCCCHHHHHHHHHHhhcCCCCCchhhhHHHhhhhhhccCHHH
Q psy14464 17 DILKRLYMSKNENIRVRALVGLCKLGSMGGSDASIRPFADGSTRKLAEACRRFLVNPARDPDIRRWAAEGLAYLTLDAEV 96 (175)
Q Consensus 17 ~~Lk~l~~s~~d~ir~~A~V~L~KL~~~~~~d~~~~~~~~~s~~~La~~~~~~L~~~~~~~~~~~~aiEGLAYlSL~p~V 96 (175)
+.|..++...+..+|..|+-.|.+++.. +....+.+++.+ .+...|.+|+.+|.++.-....
T Consensus 26 ~~L~~~L~d~d~~vR~~A~~aL~~~~~~----------------~~~~~l~~ll~~--~d~~vR~~A~~aLg~lg~~~~~ 87 (280)
T PRK09687 26 DELFRLLDDHNSLKRISSIRVLQLRGGQ----------------DVFRLAIELCSS--KNPIERDIGADILSQLGMAKRC 87 (280)
T ss_pred HHHHHHHhCCCHHHHHHHHHHHHhcCcc----------------hHHHHHHHHHhC--CCHHHHHHHHHHHHhcCCCccc
Confidence 4566777888899999999999988752 444556666544 3668899999999998764432
Q ss_pred HHHHhhCHHHHHHHHHHh-hcCCCCcchhhhHHHHhcc
Q psy14464 97 KEALIEDKPALAALVDVA-SSGKPACTYGVVTTLVNLC 133 (175)
Q Consensus 97 KE~L~~D~~~Lk~L~~ll-k~~d~s~~YG~lsIl~NLT 133 (175)
+...+..|..++ ++.+..+....+..+.+++
T Consensus 88 ------~~~a~~~L~~l~~~D~d~~VR~~A~~aLG~~~ 119 (280)
T PRK09687 88 ------QDNVFNILNNLALEDKSACVRASAINATGHRC 119 (280)
T ss_pred ------hHHHHHHHHHHHhcCCCHHHHHHHHHHHhccc
Confidence 446667777663 5566677778888888886
No 10
>PF03224 V-ATPase_H_N: V-ATPase subunit H; InterPro: IPR004908 ATPases (or ATP synthases) are membrane-bound enzyme complexes/ion transporters that combine ATP synthesis and/or hydrolysis with the transport of protons across a membrane. ATPases can harness the energy from a proton gradient, using the flux of ions across the membrane via the ATPase proton channel to drive the synthesis of ATP. Some ATPases work in reverse, using the energy from the hydrolysis of ATP to create a proton gradient. There are different types of ATPases, which can differ in function (ATP synthesis and/or hydrolysis), structure (e.g., F-, V- and A-ATPases, which contain rotary motors) and in the type of ions they transport [, ]. The different types include: F-ATPases (F1F0-ATPases), which are found in mitochondria, chloroplasts and bacterial plasma membranes where they are the prime producers of ATP, using the proton gradient generated by oxidative phosphorylation (mitochondria) or photosynthesis (chloroplasts). V-ATPases (V1V0-ATPases), which are primarily found in eukaryotic vacuoles and catalyse ATP hydrolysis to transport solutes and lower pH in organelles. A-ATPases (A1A0-ATPases), which are found in Archaea and function like F-ATPases (though with respect to their structure and some inhibitor responses, A-ATPases are more closely related to the V-ATPases). P-ATPases (E1E2-ATPases), which are found in bacteria and in eukaryotic plasma membranes and organelles, and function to transport a variety of different ions across membranes. E-ATPases, which are cell-surface enzymes that hydrolyse a range of NTPs, including extracellular ATP. V-ATPases (also known as V1V0-ATPase or vacuolar ATPase) (3.6.3.14 from EC) are found in the eukaryotic endomembrane system, and in the plasma membrane of prokaryotes and certain specialised eukaryotic cells. V-ATPases hydrolyse ATP to drive a proton pump, and are involved in a variety of vital intra- and inter-cellular processes such as receptor mediated endocytosis, protein trafficking, active transport of metabolites, homeostasis and neurotransmitter release []. V-ATPases are composed of two linked complexes: the V1 complex (subunits A-H) contains the catalytic core that hydrolyses ATP, while the V0 complex (subunits a, c, c', c'', d) forms the membrane-spanning pore. V-ATPases may have an additional role in membrane fusion through binding to t-SNARE proteins []. This entry represents subunit H (also known as Vma13p) found in the V1 complex of V-ATPases. This subunit has a regulatory function, being responsible for activating ATPase activity and coupling ATPase activity to proton flow []. The yeast enzyme contains five motifs similar to the HEAT or Armadillo repeats seen in the importins, and can be divided into two distinct domains: a large N-terminal domain consisting of stacked alpha helices, and a smaller C-terminal alpha-helical domain with a similar superhelical topology to an armadillo repeat []. More information about this protein can be found at Protein of the Month: ATP Synthases [].; GO: 0046961 proton-transporting ATPase activity, rotational mechanism, 0015991 ATP hydrolysis coupled proton transport, 0000221 vacuolar proton-transporting V-type ATPase, V1 domain; PDB: 1HO8_A.
Probab=89.26 E-value=0.79 Score=39.76 Aligned_cols=110 Identities=19% Similarity=0.218 Sum_probs=79.4
Q ss_pred HHHHHHHHhcCCchhHHHHHHHHHHhcCCCCCCCCCCCCCCCCHHHHHHHHHHhhcCCC--CCchhhhHHHhhhhhhccC
Q psy14464 16 VDILKRLYMSKNENIRVRALVGLCKLGSMGGSDASIRPFADGSTRKLAEACRRFLVNPA--RDPDIRRWAAEGLAYLTLD 93 (175)
Q Consensus 16 ~~~Lk~l~~s~~d~ir~~A~V~L~KL~~~~~~d~~~~~~~~~s~~~La~~~~~~L~~~~--~~~~~~~~aiEGLAYlSL~ 93 (175)
-++|+ +..+++.-|+..|+-.|+++-+.++... .....+....+-.||.+.- .+.+....++..|+-+.-.
T Consensus 108 ~~fl~-ll~~~D~~i~~~a~~iLt~Ll~~~~~~~------~~~~~~~l~~ll~~L~~~l~~~~~~~~~~av~~L~~LL~~ 180 (312)
T PF03224_consen 108 SPFLK-LLDRNDSFIQLKAAFILTSLLSQGPKRS------EKLVKEALPKLLQWLSSQLSSSDSELQYIAVQCLQNLLRS 180 (312)
T ss_dssp HHHHH-H-S-SSHHHHHHHHHHHHHHHTSTTT--------HHHHHHHHHHHHHHHH-TT-HHHH---HHHHHHHHHHHTS
T ss_pred HHHHH-HhcCCCHHHHHHHHHHHHHHHHcCCccc------cchHHHHHHHHHHHHHHhhcCCCcchHHHHHHHHHHHhCc
Confidence 45666 6778899999999999999998754221 1111366678888887532 2233458999999999999
Q ss_pred HHHHHHHhhCHHHHHHHHHHh------hc-CCCCcchhhhHHHHhcc
Q psy14464 94 AEVKEALIEDKPALAALVDVA------SS-GKPACTYGVVTTLVNLC 133 (175)
Q Consensus 94 p~VKE~L~~D~~~Lk~L~~ll------k~-~d~s~~YG~lsIl~NLT 133 (175)
+..|..+.+ ...++.|++++ .. .+-.++|-++.++--||
T Consensus 181 ~~~R~~f~~-~~~v~~l~~iL~~~~~~~~~~~~Ql~Y~~ll~lWlLS 226 (312)
T PF03224_consen 181 KEYRQVFWK-SNGVSPLFDILRKQATNSNSSGIQLQYQALLCLWLLS 226 (312)
T ss_dssp HHHHHHHHT-HHHHHHHHHHHH---------HHHHHHHHHHHHHHHT
T ss_pred chhHHHHHh-cCcHHHHHHHHHhhcccCCCCchhHHHHHHHHHHHHh
Confidence 999999988 99999999999 22 25667899999999998
No 11
>PF04826 Arm_2: Armadillo-like; InterPro: IPR006911 This entry consists of mammalian proteins of unknown function.
Probab=88.60 E-value=2.2 Score=36.84 Aligned_cols=110 Identities=20% Similarity=0.231 Sum_probs=78.0
Q ss_pred HhhHHHHHHHHh-cCCchhHHHHHHHHHHhcCCCCCCCCCCCCCCCCHHHHHHHHHHhhcCCCCCchhhhHHHhhhhhhc
Q psy14464 13 KQGVDILKRLYM-SKNENIRVRALVGLCKLGSMGGSDASIRPFADGSTRKLAEACRRFLVNPARDPDIRRWAAEGLAYLT 91 (175)
Q Consensus 13 ~~g~~~Lk~l~~-s~~d~ir~~A~V~L~KL~~~~~~d~~~~~~~~~s~~~La~~~~~~L~~~~~~~~~~~~aiEGLAYlS 91 (175)
.+.++-|-.+++ +.++.|+..|+.+|+...++..+..-.+.. | -...+.++|-+ ++...+..|+..|+-+|
T Consensus 11 ~~~l~~Ll~lL~~t~dp~i~e~al~al~n~aaf~~nq~~Ir~~--G----gi~lI~~lL~~--p~~~vr~~AL~aL~Nls 82 (254)
T PF04826_consen 11 AQELQKLLCLLESTEDPFIQEKALIALGNSAAFPFNQDIIRDL--G----GISLIGSLLND--PNPSVREKALNALNNLS 82 (254)
T ss_pred HHHHHHHHHHHhcCCChHHHHHHHHHHHhhccChhHHHHHHHc--C----CHHHHHHHcCC--CChHHHHHHHHHHHhcC
Confidence 445566666666 778999999999999987653221111111 1 12345566644 36678889999999999
Q ss_pred cCHHHHHHHhhCHHHHHHHHHHhhc--CCCCcchhhhHHHHhcc
Q psy14464 92 LDAEVKEALIEDKPALAALVDVASS--GKPACTYGVVTTLVNLC 133 (175)
Q Consensus 92 L~p~VKE~L~~D~~~Lk~L~~llk~--~d~s~~YG~lsIl~NLT 133 (175)
...+=.+.+. ..+..+.+...+ -++.++=.++-.+.|||
T Consensus 83 ~~~en~~~Ik---~~i~~Vc~~~~s~~lns~~Q~agLrlL~nLt 123 (254)
T PF04826_consen 83 VNDENQEQIK---MYIPQVCEETVSSPLNSEVQLAGLRLLTNLT 123 (254)
T ss_pred CChhhHHHHH---HHHHHHHHHHhcCCCCCHHHHHHHHHHHccC
Confidence 9999888874 466666665544 38899999999999998
No 12
>PF12719 Cnd3: Nuclear condensing complex subunits, C-term domain
Probab=87.19 E-value=2.6 Score=36.35 Aligned_cols=80 Identities=23% Similarity=0.331 Sum_probs=58.5
Q ss_pred HHhhHHHHHHHHhcCCchhHHHHHHHHHHhcCCCCCCC---CCCCCCCCCHHHHHHHHHHhhcCCCCCchhhhHHHhhhh
Q psy14464 12 IKQGVDILKRLYMSKNENIRVRALVGLCKLGSMGGSDA---SIRPFADGSTRKLAEACRRFLVNPARDPDIRRWAAEGLA 88 (175)
Q Consensus 12 ~~~g~~~Lk~l~~s~~d~ir~~A~V~L~KL~~~~~~d~---~~~~~~~~s~~~La~~~~~~L~~~~~~~~~~~~aiEGLA 88 (175)
-.+.+.++-+.++.+++.+|+.|+=+++-+--.-|.+. ............+.+.+.++|.+. +.+.+--|+||+|
T Consensus 62 a~~~l~l~~~~~~~~~~~v~~~al~~l~Dll~~~g~~~~~~~~~~~~~~~~~~l~~~l~~~l~~~--~~~~~~~a~EGl~ 139 (298)
T PF12719_consen 62 AKEHLPLFLQALQKDDEEVKITALKALFDLLLTHGIDIFDSESDNDESVDSKSLLKILTKFLDSE--NPELQAIAVEGLC 139 (298)
T ss_pred HHHHHHHHHHHHHhCCHHHHHHHHHHHHHHHHHcCchhccchhccCccchHhHHHHHHHHHHhcC--CHHHHHHHHHHHH
Confidence 34556777777777789999999999988876544322 111124556789999999999775 6678899999999
Q ss_pred hhccC
Q psy14464 89 YLTLD 93 (175)
Q Consensus 89 YlSL~ 93 (175)
=|=+.
T Consensus 140 KLlL~ 144 (298)
T PF12719_consen 140 KLLLS 144 (298)
T ss_pred HHHhc
Confidence 76554
No 13
>PF05804 KAP: Kinesin-associated protein (KAP)
Probab=86.28 E-value=1.9 Score=42.62 Aligned_cols=111 Identities=21% Similarity=0.219 Sum_probs=81.7
Q ss_pred HHhhHHHHHHHHhcCCchhHHHHHHHHHHhcCCCCCCCCCCCCCCCCHHHHHHHHHHhhcCCCCCchhhhHHHhhhhhhc
Q psy14464 12 IKQGVDILKRLYMSKNENIRVRALVGLCKLGSMGGSDASIRPFADGSTRKLAEACRRFLVNPARDPDIRRWAAEGLAYLT 91 (175)
Q Consensus 12 ~~~g~~~Lk~l~~s~~d~ir~~A~V~L~KL~~~~~~d~~~~~~~~~s~~~La~~~~~~L~~~~~~~~~~~~aiEGLAYlS 91 (175)
-.+++..|-+.+.++|.++.+++...|-||.-.+.+ +....+ ......+.+++-. ++.+.++.++--|-=||
T Consensus 288 ~~~iV~~Lv~~Ldr~n~ellil~v~fLkkLSi~~EN---K~~m~~---~giV~kL~kLl~s--~~~~l~~~aLrlL~NLS 359 (708)
T PF05804_consen 288 NKGIVSLLVKCLDRENEELLILAVTFLKKLSIFKEN---KDEMAE---SGIVEKLLKLLPS--ENEDLVNVALRLLFNLS 359 (708)
T ss_pred hcCCHHHHHHHHcCCCHHHHHHHHHHHHHHcCCHHH---HHHHHH---cCCHHHHHHHhcC--CCHHHHHHHHHHHHHhC
Confidence 356678898999999999999999999999976332 222221 1234445555543 35678899999999999
Q ss_pred cCHHHHHHHhhCHHHHHHHHHHhhcCCCCcchhhhHHHHhcc
Q psy14464 92 LDAEVKEALIEDKPALAALVDVASSGKPACTYGVVTTLVNLC 133 (175)
Q Consensus 92 L~p~VKE~L~~D~~~Lk~L~~llk~~d~s~~YG~lsIl~NLT 133 (175)
.++++|..+++- .++..|+.+++.. ....-++.|+-||+
T Consensus 360 fd~~~R~~mV~~-GlIPkLv~LL~d~--~~~~val~iLy~LS 398 (708)
T PF05804_consen 360 FDPELRSQMVSL-GLIPKLVELLKDP--NFREVALKILYNLS 398 (708)
T ss_pred cCHHHHHHHHHC-CCcHHHHHHhCCC--chHHHHHHHHHHhc
Confidence 999999988764 5677899999753 34344789999999
No 14
>PF00514 Arm: Armadillo/beta-catenin-like repeat; InterPro: IPR000225 The armadillo (Arm) repeat is an approximately 40 amino acid long tandemly repeated sequence motif first identified in the Drosophila melanogaster segment polarity gene armadillo involved in signal transduction through wingless. Animal Arm-repeat proteins function in various processes, including intracellular signalling and cytoskeletal regulation, and include such proteins as beta-catenin, the junctional plaque protein plakoglobin, the adenomatous polyposis coli (APC) tumour suppressor protein, and the nuclear transport factor importin-alpha, amongst others []. A subset of these proteins is conserved across eukaryotic kingdoms. In higher plants, some Arm-repeat proteins function in intracellular signalling like their mammalian counterparts, while others have novel functions []. The 3-dimensional fold of an armadillo repeat is known from the crystal structure of beta-catenin, where the 12 repeats form a superhelix of alpha helices with three helices per unit []. The cylindrical structure features a positively charged grove, which presumably interacts with the acidic surfaces of the known interaction partners of beta-catenin.; GO: 0005515 protein binding; PDB: 2Z6G_A 1IQ1_C 3RZX_A 2C1M_A 3BTR_C 3OQS_A 3TPO_A 1IAL_A 1Q1S_C 1PJM_B ....
Probab=83.90 E-value=0.65 Score=28.44 Aligned_cols=39 Identities=26% Similarity=0.343 Sum_probs=33.5
Q ss_pred HHHHHHHhhCHHHHHHHHHHhhcCCCCcchhhhHHHHhcc
Q psy14464 94 AEVKEALIEDKPALAALVDVASSGKPACTYGVVTTLVNLC 133 (175)
Q Consensus 94 p~VKE~L~~D~~~Lk~L~~llk~~d~s~~YG~lsIl~NLT 133 (175)
|+-|..++ +...++.|++++++.+..+..-.+..|.||+
T Consensus 2 ~~~~~~i~-~~g~i~~Lv~ll~~~~~~v~~~a~~al~nl~ 40 (41)
T PF00514_consen 2 PENKQAIV-EAGGIPPLVQLLKSPDPEVQEEAAWALGNLA 40 (41)
T ss_dssp HHHHHHHH-HTTHHHHHHHHTTSSSHHHHHHHHHHHHHHH
T ss_pred HHHHHHHH-HcccHHHHHHHHcCCCHHHHHHHHHHHHHHh
Confidence 56666666 6678999999999989999999999999997
No 15
>PF01602 Adaptin_N: Adaptin N terminal region; InterPro: IPR002553 Proteins synthesized on the ribosome and processed in the endoplasmic reticulum are transported from the Golgi apparatus to the trans-Golgi network (TGN), and from there via small carrier vesicles to their final destination compartment. This traffic is bidirectional, to ensure that proteins required to form vesicles are recycled. Vesicles have specific coat proteins (such as clathrin or coatomer) that are important for cargo selection and direction of transfer []. Clathrin coats contain both clathrin and adaptor complexes that link clathrin to receptors in coated vesicles. Clathrin-associated protein complexes are believed to interact with the cytoplasmic tails of membrane proteins, leading to their selection and concentration. The two major types of clathrin adaptor complexes are the heterotetrameric adaptor protein (AP) complexes, and the monomeric GGA (Golgi-localising, Gamma-adaptin ear domain homology, ARF-binding proteins) adaptors []. All AP complexes are heterotetramers composed of two large subunits (adaptins), a medium subunit (mu) and a small subunit (sigma). Each subunit has a specific function. Adaptin subunits recognise and bind to clathrin through their hinge region (clathrin box), and recruit accessory proteins that modulate AP function through their C-terminal appendage domains. By contrast, GGAs are monomers composed of four domains, which have functions similar to AP subunits: an N-terminal VHS (Vps27p/Hrs/Stam) domain, a GAT (GGA and Tom1) domain, a hinge region, and a C-terminal GAE (gamma-adaptin ear) domain. The GAE domain is similar to the AP gamma-adaptin ear domain, being responsible for the recruitment of accessory proteins that regulate clathrin-mediated endocytosis []. While clathrin mediates endocytic protein transport from ER to Golgi, coatomers (COPI, COPII) primarily mediate intra-Golgi transport, as well as the reverse Golgi to ER transport of dilysine-tagged proteins []. Coatomers reversibly associate with Golgi (non-clathrin-coated) vesicles to mediate protein transport and for budding from Golgi membranes []. Coatomer complexes are hetero-oligomers composed of at least an alpha, beta, beta', gamma, delta, epsilon and zeta subunits. This entry represents the N-terminal domain of various adaptins from different AP clathrin adaptor complexes (including AP1, AP2, AP3 and AP4), and from the beta and gamma subunits of various coatomer (COP) adaptors. This domain has a 2-layer alpha/alpha fold that forms a right-handed superhelix, and is a member of the ARM repeat superfamily []. The N-terminal region of the various AP adaptor proteins share strong sequence identity; by contrast, the C-terminal domains of different adaptins share similar structural folds, but have little sequence identity []. It has been proposed that the N-terminal domain interacts with another uniform component of the coated vesicles. More information about these proteins can be found at Protein of the Month: Clathrin [].; GO: 0006886 intracellular protein transport, 0016192 vesicle-mediated transport, 0030117 membrane coat; PDB: 1W63_C 2JKR_A 2JKT_A 2XA7_A 2VGL_B 3TJZ_E.
Probab=83.65 E-value=1.9 Score=38.85 Aligned_cols=103 Identities=23% Similarity=0.314 Sum_probs=76.2
Q ss_pred HHHhhHHHHHHHHhcCCchhHHHHHHHHHHhcCCCCCCCCCCCCCCCCHHHHHHHHHHhhcCCCCCchhhhHHHhhhhhh
Q psy14464 11 IIKQGVDILKRLYMSKNENIRVRALVGLCKLGSMGGSDASIRPFADGSTRKLAEACRRFLVNPARDPDIRRWAAEGLAYL 90 (175)
Q Consensus 11 i~~~g~~~Lk~l~~s~~d~ir~~A~V~L~KL~~~~~~d~~~~~~~~~s~~~La~~~~~~L~~~~~~~~~~~~aiEGLAYl 90 (175)
.+.-.++-|.+=+++.|+.+|.+|+=.||.++.. .-.+.+.....+.+.++ ++..|+.|+-++.-+
T Consensus 76 ~~~l~~n~l~kdl~~~n~~~~~lAL~~l~~i~~~------------~~~~~l~~~v~~ll~~~--~~~VRk~A~~~l~~i 141 (526)
T PF01602_consen 76 LLILIINSLQKDLNSPNPYIRGLALRTLSNIRTP------------EMAEPLIPDVIKLLSDP--SPYVRKKAALALLKI 141 (526)
T ss_dssp HHHHHHHHHHHHHCSSSHHHHHHHHHHHHHH-SH------------HHHHHHHHHHHHHHHSS--SHHHHHHHHHHHHHH
T ss_pred HHHHHHHHHHHhhcCCCHHHHHHHHhhhhhhccc------------chhhHHHHHHHHHhcCC--chHHHHHHHHHHHHH
Confidence 3334455555557789999999999999999953 12467777888888654 557899999999877
Q ss_pred c-cCHHHHHHHhhCHHHHHHHHHHhhcCCCCcchhhhHHHHhc
Q psy14464 91 T-LDAEVKEALIEDKPALAALVDVASSGKPACTYGVVTTLVNL 132 (175)
Q Consensus 91 S-L~p~VKE~L~~D~~~Lk~L~~llk~~d~s~~YG~lsIl~NL 132 (175)
. .+|+ +..+. +++.|.+++.+.+.++.+.++..+..+
T Consensus 142 ~~~~p~----~~~~~-~~~~l~~lL~d~~~~V~~~a~~~l~~i 179 (526)
T PF01602_consen 142 YRKDPD----LVEDE-LIPKLKQLLSDKDPSVVSAALSLLSEI 179 (526)
T ss_dssp HHHCHC----CHHGG-HHHHHHHHTTHSSHHHHHHHHHHHHHH
T ss_pred hccCHH----HHHHH-HHHHHhhhccCCcchhHHHHHHHHHHH
Confidence 4 4444 33344 788888888888888999999999888
No 16
>KOG4224|consensus
Probab=81.94 E-value=4.7 Score=37.99 Aligned_cols=133 Identities=26% Similarity=0.321 Sum_probs=88.9
Q ss_pred hhHHHHHHhh----HHHHHHHHhcCCchhHHHHHHHHHHhcCCCCCCCCCCCCCCCCHHHHHHHHHHhhcCCCCCchhh-
Q psy14464 6 HSITSIIKQG----VDILKRLYMSKNENIRVRALVGLCKLGSMGGSDASIRPFADGSTRKLAEACRRFLVNPARDPDIR- 80 (175)
Q Consensus 6 ~~~~~i~~~g----~~~Lk~l~~s~~d~ir~~A~V~L~KL~~~~~~d~~~~~~~~~s~~~La~~~~~~L~~~~~~~~~~- 80 (175)
.|++.++.+. ++-|-++|.+++|++|-.|..+|--|.+. +..-+.--..|+.--|++.++. + ..+...
T Consensus 239 ~~~Rk~Laqaep~lv~~Lv~Lmd~~s~kvkcqA~lALrnlasd--t~Yq~eiv~ag~lP~lv~Llqs----~-~~plila 311 (550)
T KOG4224|consen 239 RRARKILAQAEPKLVPALVDLMDDGSDKVKCQAGLALRNLASD--TEYQREIVEAGSLPLLVELLQS----P-MGPLILA 311 (550)
T ss_pred HHHHHHHHhcccchHHHHHHHHhCCChHHHHHHHHHHhhhccc--chhhhHHHhcCCchHHHHHHhC----c-chhHHHH
Confidence 4566666553 57788889999999999999999888874 2221112234555666665422 1 111111
Q ss_pred -hHHHhhhhhhccCHHHHHHHhhCHHHHHHHHHHhhcCCCC-cchhhhHHHHhcccCCCCCC-------cchHHHHHH
Q psy14464 81 -RWAAEGLAYLTLDAEVKEALIEDKPALAALVDVASSGKPA-CTYGVVTTLVNLCNAYDKQE-------IIPEMIELA 149 (175)
Q Consensus 81 -~~aiEGLAYlSL~p~VKE~L~~D~~~Lk~L~~llk~~d~s-~~YG~lsIl~NLTn~Y~~~e-------~~~~l~~L~ 149 (175)
-.+|-.++ +.| --|.|..|..||+-|+.++.-.|.- ++-...+++-||.-+|+... ..|++.+|.
T Consensus 312 sVaCIrnis---ihp-lNe~lI~dagfl~pLVrlL~~~dnEeiqchAvstLrnLAasse~n~~~i~esgAi~kl~eL~ 385 (550)
T KOG4224|consen 312 SVACIRNIS---IHP-LNEVLIADAGFLRPLVRLLRAGDNEEIQCHAVSTLRNLAASSEHNVSVIRESGAIPKLIELL 385 (550)
T ss_pred HHHHHhhcc---ccc-CcccceecccchhHHHHHHhcCCchhhhhhHHHHHHHHhhhhhhhhHHHhhcCchHHHHHHH
Confidence 12333333 332 3467899999999999999887655 88899999999997798842 367777775
No 17
>TIGR02270 conserved hypothetical protein. Members are found in Myxococcus xanthus (six members), Geobacter sulfurreducens, and Pseudomonas aeruginosa; a short protein homologous to the N-terminal region is found in Mesorhizobium loti. All sequence are from Proteobacteria. The function is unknown.
Probab=81.13 E-value=17 Score=33.63 Aligned_cols=113 Identities=19% Similarity=0.107 Sum_probs=68.8
Q ss_pred HHHHHHHHhcCCchhHHHHHHHHHHhcCCCCCCCCCCCCCCCCHHHHHHHHHHhhcCCCCCchhhhHHHhhhhhhccCHH
Q psy14464 16 VDILKRLYMSKNENIRVRALVGLCKLGSMGGSDASIRPFADGSTRKLAEACRRFLVNPARDPDIRRWAAEGLAYLTLDAE 95 (175)
Q Consensus 16 ~~~Lk~l~~s~~d~ir~~A~V~L~KL~~~~~~d~~~~~~~~~s~~~La~~~~~~L~~~~~~~~~~~~aiEGLAYlSL~p~ 95 (175)
.+.|..+++..+..++..|+-+|.+++.. +....++..+.+ .+++.|.+|++||+.+--..-
T Consensus 149 ~~~L~~~L~d~d~~Vra~A~raLG~l~~~----------------~a~~~L~~al~d--~~~~VR~aA~~al~~lG~~~A 210 (410)
T TIGR02270 149 GPALEAALTHEDALVRAAALRALGELPRR----------------LSESTLRLYLRD--SDPEVRFAALEAGLLAGSRLA 210 (410)
T ss_pred HHHHHHHhcCCCHHHHHHHHHHHHhhccc----------------cchHHHHHHHcC--CCHHHHHHHHHHHHHcCCHhH
Confidence 34556666666677777777777777653 122234555543 477899999999987754221
Q ss_pred HHHHH---------------------hhCHHHHHHHHHHhhcCCCCcchhhhHHHHhcccCCCCCC--------cchHHH
Q psy14464 96 VKEAL---------------------IEDKPALAALVDVASSGKPACTYGVVTTLVNLCNAYDKQE--------IIPEMI 146 (175)
Q Consensus 96 VKE~L---------------------~~D~~~Lk~L~~llk~~d~s~~YG~lsIl~NLTn~Y~~~e--------~~~~l~ 146 (175)
+ +.+ ..++..+..|..+++.. .+...++..+..+- +|.. .++.+.
T Consensus 211 ~-~~l~~~~~~~g~~~~~~l~~~lal~~~~~a~~~L~~ll~d~--~vr~~a~~AlG~lg---~p~av~~L~~~l~d~~~a 284 (410)
T TIGR02270 211 W-GVCRRFQVLEGGPHRQRLLVLLAVAGGPDAQAWLRELLQAA--ATRREALRAVGLVG---DVEAAPWCLEAMREPPWA 284 (410)
T ss_pred H-HHHHHHHhccCccHHHHHHHHHHhCCchhHHHHHHHHhcCh--hhHHHHHHHHHHcC---CcchHHHHHHHhcCcHHH
Confidence 1 221 23445555666666543 47889999999888 5642 234466
Q ss_pred HHHhhh
Q psy14464 147 ELAKFA 152 (175)
Q Consensus 147 ~L~~ya 152 (175)
.++.-|
T Consensus 285 R~A~eA 290 (410)
T TIGR02270 285 RLAGEA 290 (410)
T ss_pred HHHHHH
Confidence 666444
No 18
>PRK13800 putative oxidoreductase/HEAT repeat-containing protein; Provisional
Probab=79.28 E-value=9.1 Score=38.32 Aligned_cols=27 Identities=22% Similarity=0.427 Sum_probs=18.9
Q ss_pred HHHHHHHhcCCchhHHHHHHHHHHhcC
Q psy14464 17 DILKRLYMSKNENIRVRALVGLCKLGS 43 (175)
Q Consensus 17 ~~Lk~l~~s~~d~ir~~A~V~L~KL~~ 43 (175)
+.|..++...+..||..|+-+|..++.
T Consensus 745 ~~l~~~l~D~~~~VR~~aa~aL~~~~~ 771 (897)
T PRK13800 745 ESVAGAATDENREVRIAVAKGLATLGA 771 (897)
T ss_pred HHHHHHhcCCCHHHHHHHHHHHHHhcc
Confidence 455566667777777777777777765
No 19
>PLN03200 cellulose synthase-interactive protein; Provisional
Probab=79.23 E-value=3.9 Score=44.96 Aligned_cols=130 Identities=22% Similarity=0.173 Sum_probs=87.1
Q ss_pred HHHHh-hHHHHHHHHhcCCchhHHHHHHHHHHhcCCCCCCCCCCC-CCCCCHHHHHHHHHHhhcCCCCCchhhhHHHhhh
Q psy14464 10 SIIKQ-GVDILKRLYMSKNENIRVRALVGLCKLGSMGGSDASIRP-FADGSTRKLAEACRRFLVNPARDPDIRRWAAEGL 87 (175)
Q Consensus 10 ~i~~~-g~~~Lk~l~~s~~d~ir~~A~V~L~KL~~~~~~d~~~~~-~~~~s~~~La~~~~~~L~~~~~~~~~~~~aiEGL 87 (175)
.|... |++.|-.++++++..+|..|+-.|.-|... ++.++.. -..|-+. .+-++|.++ +...++.|+-.|
T Consensus 441 aIi~~ggIp~LV~LL~s~s~~iQ~~A~~~L~nLa~~--ndenr~aIieaGaIP----~LV~LL~s~--~~~iqeeAawAL 512 (2102)
T PLN03200 441 ALGGREGVQLLISLLGLSSEQQQEYAVALLAILTDE--VDESKWAITAAGGIP----PLVQLLETG--SQKAKEDSATVL 512 (2102)
T ss_pred HHHHcCcHHHHHHHHcCCCHHHHHHHHHHHHHHHcC--CHHHHHHHHHCCCHH----HHHHHHcCC--CHHHHHHHHHHH
Confidence 34443 679999999988888998887777776542 2211110 1223344 445566443 557788888888
Q ss_pred hhhccCHHHHHHHhhCHHHHHHHHHHhhcCCCCcchhhhHHHHhcccCCCCCCcchHHHHH
Q psy14464 88 AYLTLDAEVKEALIEDKPALAALVDVASSGKPACTYGVVTTLVNLCNAYDKQEIIPEMIEL 148 (175)
Q Consensus 88 AYlSL~p~VKE~L~~D~~~Lk~L~~llk~~d~s~~YG~lsIl~NLTn~Y~~~e~~~~l~~L 148 (175)
.=++-+.+-+..++.+...++.|++++++.+.-+.--.+-++.||+ ....++..+.+.+|
T Consensus 513 ~NLa~~~~qir~iV~~aGAIppLV~LL~sgd~~~q~~Aa~AL~nLi-~~~d~~~I~~Lv~L 572 (2102)
T PLN03200 513 WNLCCHSEDIRACVESAGAVPALLWLLKNGGPKGQEIAAKTLTKLV-RTADAATISQLTAL 572 (2102)
T ss_pred HHHhCCcHHHHHHHHHCCCHHHHHHHHhCCCHHHHHHHHHHHHHHH-hccchhHHHHHHHH
Confidence 8777776666667777789999999999887777788889999998 33333334444444
No 20
>KOG2973|consensus
Probab=78.13 E-value=17 Score=33.34 Aligned_cols=101 Identities=22% Similarity=0.266 Sum_probs=70.9
Q ss_pred HHHhcCCchhHHHHHHHHHHhcCCCCCCCCCCCCCCCCHHHHHHHHHHhhcCCCCCchhhhHHHhhhhhhccCHHHHHHH
Q psy14464 21 RLYMSKNENIRVRALVGLCKLGSMGGSDASIRPFADGSTRKLAEACRRFLVNPARDPDIRRWAAEGLAYLTLDAEVKEAL 100 (175)
Q Consensus 21 ~l~~s~~d~ir~~A~V~L~KL~~~~~~d~~~~~~~~~s~~~La~~~~~~L~~~~~~~~~~~~aiEGLAYlSL~p~VKE~L 100 (175)
+++.+-++.+|-.|.-.|.=+-+.+ .+.+..+ ...+.+-++.++... .. .+.|+=.|-=+|=++.|++.|
T Consensus 10 ~ll~~~sP~v~~~AV~~l~~lt~~~-----~~~~~~~-~~~~lk~l~qL~~~~-~~---~~~a~~alVnlsq~~~l~~~l 79 (353)
T KOG2973|consen 10 ELLHSLSPPVRKAAVEHLLGLTGRG-----LQSLSKY-SEALLKDLTQLLKDL-DP---AEPAATALVNLSQKEELRKKL 79 (353)
T ss_pred HHhccCChHHHHHHHHHHhhccccc-----hhhhccc-hhhhHHHHHHHccCc-cc---ccHHHHHHHHHHhhHHHHHHH
Confidence 4556778877776655554444321 1223333 367778788887433 22 568888999999999999999
Q ss_pred hhCHHHHHHHHHHhhcCCCCcchhhhHHHHhcc
Q psy14464 101 IEDKPALAALVDVASSGKPACTYGVVTTLVNLC 133 (175)
Q Consensus 101 ~~D~~~Lk~L~~llk~~d~s~~YG~lsIl~NLT 133 (175)
..| |+|.++.++-.-.+++.=-..+.++|||
T Consensus 80 l~~--~~k~l~~~~~~p~~~lad~~cmlL~NLs 110 (353)
T KOG2973|consen 80 LQD--LLKVLMDMLTDPQSPLADLICMLLSNLS 110 (353)
T ss_pred HHH--HHHHHHHHhcCcccchHHHHHHHHHHhc
Confidence 999 9999999986554555566778889998
No 21
>PTZ00429 beta-adaptin; Provisional
Probab=71.44 E-value=15 Score=36.66 Aligned_cols=99 Identities=19% Similarity=0.237 Sum_probs=73.2
Q ss_pred HHHHHHHhcCCchhHHHHHHHHHHhcCCCCCCCCCCCCCCCCHHHHHHHHHHhhcCCCCCchhhhHHHhhhhhhc-cCHH
Q psy14464 17 DILKRLYMSKNENIRVRALVGLCKLGSMGGSDASIRPFADGSTRKLAEACRRFLVNPARDPDIRRWAAEGLAYLT-LDAE 95 (175)
Q Consensus 17 ~~Lk~l~~s~~d~ir~~A~V~L~KL~~~~~~d~~~~~~~~~s~~~La~~~~~~L~~~~~~~~~~~~aiEGLAYlS-L~p~ 95 (175)
+-|.+=+++.|+-||.+|+=.||+++... -.+.+...+++-|.+. ++-.|+.|+=+++-+. ++|+
T Consensus 108 Ntl~KDl~d~Np~IRaLALRtLs~Ir~~~------------i~e~l~~~lkk~L~D~--~pYVRKtAalai~Kly~~~pe 173 (746)
T PTZ00429 108 NTFLQDTTNSSPVVRALAVRTMMCIRVSS------------VLEYTLEPLRRAVADP--DPYVRKTAAMGLGKLFHDDMQ 173 (746)
T ss_pred HHHHHHcCCCCHHHHHHHHHHHHcCCcHH------------HHHHHHHHHHHHhcCC--CHHHHHHHHHHHHHHHhhCcc
Confidence 33444555788999999999999988641 2356777778877543 5668888888888763 4553
Q ss_pred HHHHHhhCHHHHHHHHHHhhcCCCCcchhhhHHHHhcc
Q psy14464 96 VKEALIEDKPALAALVDVASSGKPACTYGVVTTLVNLC 133 (175)
Q Consensus 96 VKE~L~~D~~~Lk~L~~llk~~d~s~~YG~lsIl~NLT 133 (175)
++.+..|+..|.+++.+.+..+++..+..|..+.
T Consensus 174 ----lv~~~~~~~~L~~LL~D~dp~Vv~nAl~aL~eI~ 207 (746)
T PTZ00429 174 ----LFYQQDFKKDLVELLNDNNPVVASNAAAIVCEVN 207 (746)
T ss_pred ----cccccchHHHHHHHhcCCCccHHHHHHHHHHHHH
Confidence 4556678888888888888888898888888776
No 22
>PF03224 V-ATPase_H_N: V-ATPase subunit H; InterPro: IPR004908 ATPases (or ATP synthases) are membrane-bound enzyme complexes/ion transporters that combine ATP synthesis and/or hydrolysis with the transport of protons across a membrane. ATPases can harness the energy from a proton gradient, using the flux of ions across the membrane via the ATPase proton channel to drive the synthesis of ATP. Some ATPases work in reverse, using the energy from the hydrolysis of ATP to create a proton gradient. There are different types of ATPases, which can differ in function (ATP synthesis and/or hydrolysis), structure (e.g., F-, V- and A-ATPases, which contain rotary motors) and in the type of ions they transport [, ]. The different types include: F-ATPases (F1F0-ATPases), which are found in mitochondria, chloroplasts and bacterial plasma membranes where they are the prime producers of ATP, using the proton gradient generated by oxidative phosphorylation (mitochondria) or photosynthesis (chloroplasts). V-ATPases (V1V0-ATPases), which are primarily found in eukaryotic vacuoles and catalyse ATP hydrolysis to transport solutes and lower pH in organelles. A-ATPases (A1A0-ATPases), which are found in Archaea and function like F-ATPases (though with respect to their structure and some inhibitor responses, A-ATPases are more closely related to the V-ATPases). P-ATPases (E1E2-ATPases), which are found in bacteria and in eukaryotic plasma membranes and organelles, and function to transport a variety of different ions across membranes. E-ATPases, which are cell-surface enzymes that hydrolyse a range of NTPs, including extracellular ATP. V-ATPases (also known as V1V0-ATPase or vacuolar ATPase) (3.6.3.14 from EC) are found in the eukaryotic endomembrane system, and in the plasma membrane of prokaryotes and certain specialised eukaryotic cells. V-ATPases hydrolyse ATP to drive a proton pump, and are involved in a variety of vital intra- and inter-cellular processes such as receptor mediated endocytosis, protein trafficking, active transport of metabolites, homeostasis and neurotransmitter release []. V-ATPases are composed of two linked complexes: the V1 complex (subunits A-H) contains the catalytic core that hydrolyses ATP, while the V0 complex (subunits a, c, c', c'', d) forms the membrane-spanning pore. V-ATPases may have an additional role in membrane fusion through binding to t-SNARE proteins []. This entry represents subunit H (also known as Vma13p) found in the V1 complex of V-ATPases. This subunit has a regulatory function, being responsible for activating ATPase activity and coupling ATPase activity to proton flow []. The yeast enzyme contains five motifs similar to the HEAT or Armadillo repeats seen in the importins, and can be divided into two distinct domains: a large N-terminal domain consisting of stacked alpha helices, and a smaller C-terminal alpha-helical domain with a similar superhelical topology to an armadillo repeat []. More information about this protein can be found at Protein of the Month: ATP Synthases [].; GO: 0046961 proton-transporting ATPase activity, rotational mechanism, 0015991 ATP hydrolysis coupled proton transport, 0000221 vacuolar proton-transporting V-type ATPase, V1 domain; PDB: 1HO8_A.
Probab=70.49 E-value=6.6 Score=34.07 Aligned_cols=130 Identities=15% Similarity=0.092 Sum_probs=70.8
Q ss_pred HHHHhhHHHHHHHHhcCCchhHHHHHHHHHHhcCCCCCCCCCCCCCCCCHHHHHHHHHHhhcCC--CCCchhhhHHHhhh
Q psy14464 10 SIIKQGVDILKRLYMSKNENIRVRALVGLCKLGSMGGSDASIRPFADGSTRKLAEACRRFLVNP--ARDPDIRRWAAEGL 87 (175)
Q Consensus 10 ~i~~~g~~~Lk~l~~s~~d~ir~~A~V~L~KL~~~~~~d~~~~~~~~~s~~~La~~~~~~L~~~--~~~~~~~~~aiEGL 87 (175)
.++...++||+...++.+.+.+..|.-+|+-|-... .-+..+-+ .+....+...|... .......+-.=|.+
T Consensus 146 ~~l~~ll~~L~~~l~~~~~~~~~~av~~L~~LL~~~---~~R~~f~~---~~~v~~l~~iL~~~~~~~~~~~~Ql~Y~~l 219 (312)
T PF03224_consen 146 EALPKLLQWLSSQLSSSDSELQYIAVQCLQNLLRSK---EYRQVFWK---SNGVSPLFDILRKQATNSNSSGIQLQYQAL 219 (312)
T ss_dssp HHHHHHHHHHH-TT-HHHH---HHHHHHHHHHHTSH---HHHHHHHT---HHHHHHHHHHHH---------HHHHHHHHH
T ss_pred HHHHHHHHHHHHhhcCCCcchHHHHHHHHHHHhCcc---hhHHHHHh---cCcHHHHHHHHHhhcccCCCCchhHHHHHH
Confidence 567778899998777655544444444444443220 01112222 34455555555110 01112233333332
Q ss_pred ---hhhccCHHHHHHHhhCHHHHHHHHHHhhcCCC-CcchhhhHHHHhcccCCCCCCcchHHHH
Q psy14464 88 ---AYLTLDAEVKEALIEDKPALAALVDVASSGKP-ACTYGVVTTLVNLCNAYDKQEIIPEMIE 147 (175)
Q Consensus 88 ---AYlSL~p~VKE~L~~D~~~Lk~L~~llk~~d~-s~~YG~lsIl~NLTn~Y~~~e~~~~l~~ 147 (175)
==||..+++-+.+.... .+..|+++++.+.+ =++==+++||.||. .+.++.-.+.|.+
T Consensus 220 l~lWlLSF~~~~~~~~~~~~-~i~~L~~i~~~~~KEKvvRv~la~l~Nl~-~~~~~~~~~~mv~ 281 (312)
T PF03224_consen 220 LCLWLLSFEPEIAEELNKKY-LIPLLADILKDSIKEKVVRVSLAILRNLL-SKAPKSNIELMVL 281 (312)
T ss_dssp HHHHHHTTSHHHHHHHHTTS-HHHHHHHHHHH--SHHHHHHHHHHHHHTT-SSSSTTHHHHHHH
T ss_pred HHHHHHhcCHHHHHHHhccc-hHHHHHHHHHhcccchHHHHHHHHHHHHH-hccHHHHHHHHHH
Confidence 23899999999998888 99999999987633 34444589999999 6676655555444
No 23
>cd00256 VATPase_H VATPase_H, regulatory vacuolar ATP synthase subunit H (Vma13p); activation component of the peripheral V1 complex of V-ATPase, a heteromultimeric enzyme which uses ATP to actively transport protons into organelles and extracellular compartments. The topology is that of a superhelical spiral, in part the geometry is similar to superhelices composed of armadillo repeat motifs, as found in importins for example.
Probab=69.47 E-value=57 Score=30.58 Aligned_cols=110 Identities=18% Similarity=0.128 Sum_probs=79.5
Q ss_pred HHHHHHHHhcCCchhHHHHHHHHHHhcCCCCCCCCCCCCCCCCHHHHHHHHHHhhcCCCCCchhhhHHHhhhhhhccCHH
Q psy14464 16 VDILKRLYMSKNENIRVRALVGLCKLGSMGGSDASIRPFADGSTRKLAEACRRFLVNPARDPDIRRWAAEGLAYLTLDAE 95 (175)
Q Consensus 16 ~~~Lk~l~~s~~d~ir~~A~V~L~KL~~~~~~d~~~~~~~~~s~~~La~~~~~~L~~~~~~~~~~~~aiEGLAYlSL~p~ 95 (175)
.+++. ++.+.++-|...|+-.|+++-+.+... ........+...+...|.++ .+.+.+..+|-.|+-+---+.
T Consensus 104 ~~fl~-lL~~~d~~i~~~a~~iLt~l~~~~~~~-----~~~~~l~~~~~~l~~~l~~~-~~~~~~~~~v~~L~~LL~~~~ 176 (429)
T cd00256 104 EPFFN-LLNRQDQFIVHMSFSILAKLACFGLAK-----MEGSDLDYYFNWLKEQLNNI-TNNDYVQTAARCLQMLLRVDE 176 (429)
T ss_pred HHHHH-HHcCCchhHHHHHHHHHHHHHhcCccc-----cchhHHHHHHHHHHHHhhcc-CCcchHHHHHHHHHHHhCCch
Confidence 35555 556778899999999999998764321 11112233555666666543 345678899999999988999
Q ss_pred HHHHHhhCHHHHHHHHHHhhcC--CCCcchhhhHHHHhcc
Q psy14464 96 VKEALIEDKPALAALVDVASSG--KPACTYGVVTTLVNLC 133 (175)
Q Consensus 96 VKE~L~~D~~~Lk~L~~llk~~--d~s~~YG~lsIl~NLT 133 (175)
+|..+.+.. .++.|+.+++.. +-.++|-++-++-=||
T Consensus 177 ~R~~f~~~~-~v~~L~~~L~~~~~~~Ql~Y~~ll~lWlLS 215 (429)
T cd00256 177 YRFAFVLAD-GVPTLVKLLSNATLGFQLQYQSIFCIWLLT 215 (429)
T ss_pred HHHHHHHcc-CHHHHHHHHhhccccHHHHHHHHHHHHHHh
Confidence 999887666 899999999753 5577888888888777
No 24
>KOG0166|consensus
Probab=68.15 E-value=51 Score=31.79 Aligned_cols=113 Identities=19% Similarity=0.198 Sum_probs=79.9
Q ss_pred HHHHHHHh-cCCchhHHHHHHHHHHhcCCCCCCCCCCCCCCCCHHHHHHHHHHhhcCCCCCchhhhHHHhhhhhhccCHH
Q psy14464 17 DILKRLYM-SKNENIRVRALVGLCKLGSMGGSDASIRPFADGSTRKLAEACRRFLVNPARDPDIRRWAAEGLAYLTLDAE 95 (175)
Q Consensus 17 ~~Lk~l~~-s~~d~ir~~A~V~L~KL~~~~~~d~~~~~~~~~s~~~La~~~~~~L~~~~~~~~~~~~aiEGLAYlSL~p~ 95 (175)
+-|-..+. ..++.++.-|+=+|..+.+- .++....--..| -+-.|..+|.++ +.+.++.||=.|+=.--|..
T Consensus 112 ~~lV~~l~~~~~~~lq~eAAWaLTnIAsg-tse~T~~vv~ag----avp~fi~Ll~s~--~~~v~eQavWALgNIagds~ 184 (514)
T KOG0166|consen 112 PRLVEFLSRDDNPTLQFEAAWALTNIASG-TSEQTKVVVDAG----AVPIFIQLLSSP--SADVREQAVWALGNIAGDSP 184 (514)
T ss_pred HHHHHHHccCCChhHHHHHHHHHHHHhcC-chhhccccccCC----chHHHHHHhcCC--cHHHHHHHHHHHhccccCCh
Confidence 44444454 56689999999999999873 333222211112 123478888654 56788888888888888877
Q ss_pred HHHHHhhCHHHHHHHHHHhhcCCC-CcchhhhHHHHhcccCCC
Q psy14464 96 VKEALIEDKPALAALVDVASSGKP-ACTYGVVTTLVNLCNAYD 137 (175)
Q Consensus 96 VKE~L~~D~~~Lk~L~~llk~~d~-s~~YG~lsIl~NLTn~Y~ 137 (175)
.=-.++-+...+.-|+.++...++ +.+=-+.-+++||| +..
T Consensus 185 ~~Rd~vl~~g~l~pLl~~l~~~~~~~~lRn~tW~LsNlc-rgk 226 (514)
T KOG0166|consen 185 DCRDYVLSCGALDPLLRLLNKSDKLSMLRNATWTLSNLC-RGK 226 (514)
T ss_pred HHHHHHHhhcchHHHHHHhccccchHHHHHHHHHHHHHH-cCC
Confidence 777788899999999999976654 66677788999999 443
No 25
>PRK13800 putative oxidoreductase/HEAT repeat-containing protein; Provisional
Probab=66.86 E-value=23 Score=35.55 Aligned_cols=57 Identities=26% Similarity=0.226 Sum_probs=42.1
Q ss_pred HHHHHHHHhcCCchhHHHHHHHHHHhcCCCCCCCCCCCCCCCCHHHHHHHHHHhhcCCCCCchhhhHHHhhhhhh
Q psy14464 16 VDILKRLYMSKNENIRVRALVGLCKLGSMGGSDASIRPFADGSTRKLAEACRRFLVNPARDPDIRRWAAEGLAYL 90 (175)
Q Consensus 16 ~~~Lk~l~~s~~d~ir~~A~V~L~KL~~~~~~d~~~~~~~~~s~~~La~~~~~~L~~~~~~~~~~~~aiEGLAYl 90 (175)
+++|...++..++.||..|+-.|.+++.. .....+...|.+ .+.+.|..|+++|..+
T Consensus 623 ~~~L~~~L~D~d~~VR~~Av~~L~~~~~~----------------~~~~~L~~aL~D--~d~~VR~~Aa~aL~~l 679 (897)
T PRK13800 623 VAELAPYLADPDPGVRRTAVAVLTETTPP----------------GFGPALVAALGD--GAAAVRRAAAEGLREL 679 (897)
T ss_pred HHHHHHHhcCCCHHHHHHHHHHHhhhcch----------------hHHHHHHHHHcC--CCHHHHHHHHHHHHHH
Confidence 47888888899999999999999998742 234455566633 3667788888888766
No 26
>PF10508 Proteasom_PSMB: Proteasome non-ATPase 26S subunit; InterPro: IPR019538 The 26S proteasome is an enzymatic complex that degrades ubiquitinated proteins in eukaryotic cells. 26S proteasome non-ATPase regulatory subunit 5 is one of a number of chaperones that are involved in the assembly of the proteasome. The chaperones dissociate before 26S proteasome formation is complete [].; GO: 0044183 protein binding involved in protein folding
Probab=63.03 E-value=19 Score=33.74 Aligned_cols=116 Identities=19% Similarity=0.118 Sum_probs=79.6
Q ss_pred HHHHHHHHhcCCchhHHHHHHHHHHhcCCCCCCCCCCCCCCCCHHHHHHHHHHhhcCCCCCchhhhHHHhhhhhhccCHH
Q psy14464 16 VDILKRLYMSKNENIRVRALVGLCKLGSMGGSDASIRPFADGSTRKLAEACRRFLVNPARDPDIRRWAAEGLAYLTLDAE 95 (175)
Q Consensus 16 ~~~Lk~l~~s~~d~ir~~A~V~L~KL~~~~~~d~~~~~~~~~s~~~La~~~~~~L~~~~~~~~~~~~aiEGLAYlSL~p~ 95 (175)
..-|+.+..+.++.+|.|..=.++++.+.+..-. .......+-+.+-+.|.+ .|.=.+..|+|=|.-+...+.
T Consensus 162 ~~~L~~l~~~~~~~vR~Rv~el~v~i~~~S~~~~-----~~~~~sgll~~ll~eL~~--dDiLvqlnalell~~La~~~~ 234 (503)
T PF10508_consen 162 LSKLKSLMSQSSDIVRCRVYELLVEIASHSPEAA-----EAVVNSGLLDLLLKELDS--DDILVQLNALELLSELAETPH 234 (503)
T ss_pred HHHHHHHHhccCHHHHHHHHHHHHHHHhcCHHHH-----HHHHhccHHHHHHHHhcC--ccHHHHHHHHHHHHHHHcChh
Confidence 5667777777788899998888899976522110 000001244444444533 354467799999999999999
Q ss_pred HHHHHhhCHHHHHHHHHHhhcC-CCC-----cchhhhHHHHhcccCCCCCC
Q psy14464 96 VKEALIEDKPALAALVDVASSG-KPA-----CTYGVVTTLVNLCNAYDKQE 140 (175)
Q Consensus 96 VKE~L~~D~~~Lk~L~~llk~~-d~s-----~~YG~lsIl~NLTn~Y~~~e 140 (175)
--+.|.. ...++.|++++... ..+ .++|.+--|.|+. .+.|++
T Consensus 235 g~~yL~~-~gi~~~L~~~l~~~~~dp~~~~~~l~g~~~f~g~la-~~~~~~ 283 (503)
T PF10508_consen 235 GLQYLEQ-QGIFDKLSNLLQDSEEDPRLSSLLLPGRMKFFGNLA-RVSPQE 283 (503)
T ss_pred HHHHHHh-CCHHHHHHHHHhccccCCcccchhhhhHHHHHHHHH-hcChHH
Confidence 9988855 58999999999654 233 5688889999999 655543
No 27
>KOG1077|consensus
Probab=61.44 E-value=31 Score=34.99 Aligned_cols=112 Identities=30% Similarity=0.416 Sum_probs=72.7
Q ss_pred HHHHHhhHHHHHHHHhcCCchhHHHHHHHHHHhcCCCCCCCCCCCCCCCCHHHHHHHHHHhhc---CCCCCchhhhH---
Q psy14464 9 TSIIKQGVDILKRLYMSKNENIRVRALVGLCKLGSMGGSDASIRPFADGSTRKLAEACRRFLV---NPARDPDIRRW--- 82 (175)
Q Consensus 9 ~~i~~~g~~~Lk~l~~s~~d~ir~~A~V~L~KL~~~~~~d~~~~~~~~~s~~~La~~~~~~L~---~~~~~~~~~~~--- 82 (175)
-.++.+.++.|-.+++....+||.+|+=.+|+|-+++. +++ -++.-..++. +...|...|+.
T Consensus 324 ~~ll~~~~~~Lg~fls~rE~NiRYLaLEsm~~L~ss~~-----------s~d-avK~h~d~Ii~sLkterDvSirrravD 391 (938)
T KOG1077|consen 324 PELLSRAVNQLGQFLSHRETNIRYLALESMCKLASSEF-----------SID-AVKKHQDTIINSLKTERDVSIRRRAVD 391 (938)
T ss_pred HHHHHHHHHHHHHHhhcccccchhhhHHHHHHHHhccc-----------hHH-HHHHHHHHHHHHhccccchHHHHHHHH
Confidence 45788999999999999999999999999999998732 111 1111122211 11223334433
Q ss_pred --------------HHhhhhhh-ccCHHHHHHHhhCHHHHH---------------HHHHHhhc-CCCCcchhhhHHHHh
Q psy14464 83 --------------AAEGLAYL-TLDAEVKEALIEDKPALA---------------ALVDVASS-GKPACTYGVVTTLVN 131 (175)
Q Consensus 83 --------------aiEGLAYl-SL~p~VKE~L~~D~~~Lk---------------~L~~llk~-~d~s~~YG~lsIl~N 131 (175)
.-|=|.|| |.++.+||+++--.+.|- .|+.++-+ .+.-+-|-++-|++|
T Consensus 392 LLY~mcD~~Nak~IV~elLqYL~tAd~sireeivlKvAILaEKyAtDy~WyVdviLqLiriagd~vsdeVW~RvvQiVvN 471 (938)
T KOG1077|consen 392 LLYAMCDVSNAKQIVAELLQYLETADYSIREEIVLKVAILAEKYATDYSWYVDVILQLIRIAGDYVSDEVWYRVVQIVVN 471 (938)
T ss_pred HHHHHhchhhHHHHHHHHHHHHhhcchHHHHHHHHHHHHHHHHhcCCcchhHHHHHHHHHHhcccccHHHHHHhheeEec
Confidence 34567887 689999999987655443 23333322 366677888888777
Q ss_pred c
Q psy14464 132 L 132 (175)
Q Consensus 132 L 132 (175)
=
T Consensus 472 n 472 (938)
T KOG1077|consen 472 N 472 (938)
T ss_pred c
Confidence 4
No 28
>KOG2062|consensus
Probab=60.13 E-value=24 Score=35.86 Aligned_cols=50 Identities=30% Similarity=0.502 Sum_probs=42.0
Q ss_pred CCchhhhHHHhhhhhhccCHHHHHHHhhCHHHHHHHHHHhhcC-CCCcchhhhHHHHhcc
Q psy14464 75 RDPDIRRWAAEGLAYLTLDAEVKEALIEDKPALAALVDVASSG-KPACTYGVVTTLVNLC 133 (175)
Q Consensus 75 ~~~~~~~~aiEGLAYlSL~p~VKE~L~~D~~~Lk~L~~llk~~-d~s~~YG~lsIl~NLT 133 (175)
.+.|.|+.||.+|-|. |..||+.+-..++++..+ +..+-||.++.+.=.|
T Consensus 567 ~nDDVrRaAVialGFV---------l~~dp~~~~s~V~lLses~N~HVRyGaA~ALGIaC 617 (929)
T KOG2062|consen 567 VNDDVRRAAVIALGFV---------LFRDPEQLPSTVSLLSESYNPHVRYGAAMALGIAC 617 (929)
T ss_pred cchHHHHHHHHHheee---------EecChhhchHHHHHHhhhcChhhhhhHHHHHhhhh
Confidence 3557899999999997 567999999999999766 8899999998876554
No 29
>PF05804 KAP: Kinesin-associated protein (KAP)
Probab=59.33 E-value=40 Score=33.55 Aligned_cols=117 Identities=21% Similarity=0.275 Sum_probs=81.3
Q ss_pred HHHHHhhH-HHHHHHHhcCCchhHHHHHHHHHHhcCCCCCCCCCCC-CCCCCHHHHHHHHHHhhcCCCCCchhhhHHHhh
Q psy14464 9 TSIIKQGV-DILKRLYMSKNENIRVRALVGLCKLGSMGGSDASIRP-FADGSTRKLAEACRRFLVNPARDPDIRRWAAEG 86 (175)
Q Consensus 9 ~~i~~~g~-~~Lk~l~~s~~d~ir~~A~V~L~KL~~~~~~d~~~~~-~~~~s~~~La~~~~~~L~~~~~~~~~~~~aiEG 86 (175)
..|...|+ +-|..++.+++......|+=.|+-|..-... +.+ -..|-...|+ .+|.+ ++.+..++--
T Consensus 325 ~~m~~~giV~kL~kLl~s~~~~l~~~aLrlL~NLSfd~~~---R~~mV~~GlIPkLv----~LL~d----~~~~~val~i 393 (708)
T PF05804_consen 325 DEMAESGIVEKLLKLLPSENEDLVNVALRLLFNLSFDPEL---RSQMVSLGLIPKLV----ELLKD----PNFREVALKI 393 (708)
T ss_pred HHHHHcCCHHHHHHHhcCCCHHHHHHHHHHHHHhCcCHHH---HHHHHHCCCcHHHH----HHhCC----CchHHHHHHH
Confidence 34445544 7788888888888888888888888763211 111 1122234444 44532 3456778888
Q ss_pred hhhhccCHHHHHHHhhCHHHHHHHHHHhhc-CCCCcchhhhHHHHhcccCCCC
Q psy14464 87 LAYLTLDAEVKEALIEDKPALAALVDVASS-GKPACTYGVVTTLVNLCNAYDK 138 (175)
Q Consensus 87 LAYlSL~p~VKE~L~~D~~~Lk~L~~llk~-~d~s~~YG~lsIl~NLTn~Y~~ 138 (175)
|.=+|.+.+.|..+..- ..++.|++++-+ .+..+.-.++..+.||+ .+++
T Consensus 394 Ly~LS~dd~~r~~f~~T-dcIp~L~~~Ll~~~~~~v~~eliaL~iNLa-~~~r 444 (708)
T PF05804_consen 394 LYNLSMDDEARSMFAYT-DCIPQLMQMLLENSEEEVQLELIALLINLA-LNKR 444 (708)
T ss_pred HHHhccCHhhHHHHhhc-chHHHHHHHHHhCCCccccHHHHHHHHHHh-cCHH
Confidence 99999999999988765 478999988754 46666778899999999 7666
No 30
>smart00185 ARM Armadillo/beta-catenin-like repeats. Approx. 40 amino acid repeat. Tandem repeats form superhelix of helices that is proposed to mediate interaction of beta-catenin with its ligands. Involved in transducing the Wingless/Wnt signal. In plakoglobin arm repeats bind alpha-catenin and N-cadherin.
Probab=57.59 E-value=12 Score=21.71 Aligned_cols=38 Identities=29% Similarity=0.350 Sum_probs=28.4
Q ss_pred HHHHHHhhCHHHHHHHHHHhhcCCCCcchhhhHHHHhcc
Q psy14464 95 EVKEALIEDKPALAALVDVASSGKPACTYGVVTTLVNLC 133 (175)
Q Consensus 95 ~VKE~L~~D~~~Lk~L~~llk~~d~s~~YG~lsIl~NLT 133 (175)
+-++.+++ ...++.|++++++.+..++.-.+..|.||+
T Consensus 3 ~~~~~i~~-~g~i~~L~~ll~~~~~~i~~~a~~aL~nl~ 40 (41)
T smart00185 3 EQKQAVVD-AGGLPALVELLKSEDEEVVKEAAWALSNLS 40 (41)
T ss_pred HHHHHHHH-CCCHHHHHHHHcCCCHHHHHHHHHHHHHHc
Confidence 34444443 347788889888778888888889999987
No 31
>PF12719 Cnd3: Nuclear condensing complex subunits, C-term domain
Probab=55.79 E-value=1.3e+02 Score=25.79 Aligned_cols=107 Identities=22% Similarity=0.291 Sum_probs=68.0
Q ss_pred HHHHHHHhcCCchhHHHHHHHHHHhcCCCCCCCCCCCCCCCCHHHHHHHHHHhhcCCCCCchhhhHHHhhh--hhhccCH
Q psy14464 17 DILKRLYMSKNENIRVRALVGLCKLGSMGGSDASIRPFADGSTRKLAEACRRFLVNPARDPDIRRWAAEGL--AYLTLDA 94 (175)
Q Consensus 17 ~~Lk~l~~s~~d~ir~~A~V~L~KL~~~~~~d~~~~~~~~~s~~~La~~~~~~L~~~~~~~~~~~~aiEGL--AYlSL~p 94 (175)
+++.+.+.+.+++++..|+-|+|||--.|--.. ...-|+..+..|.--...+....+...--. +|.+..+
T Consensus 117 ~~l~~~l~~~~~~~~~~a~EGl~KLlL~~~i~~--------~~~vL~~Lll~yF~p~t~~~~~LrQ~L~~Ffp~y~~s~~ 188 (298)
T PF12719_consen 117 KILTKFLDSENPELQAIAVEGLCKLLLSGRISD--------PPKVLSRLLLLYFNPSTEDNQRLRQCLSVFFPVYASSSP 188 (298)
T ss_pred HHHHHHHhcCCHHHHHHHHHHHHHHHhcCCCCc--------HHHHHHHHHHHHcCcccCCcHHHHHHHHHHHHHHHcCCH
Confidence 566666678899999999999999987643221 245666667777733322222222222211 4666777
Q ss_pred HHHHHHhhCHHHHHHHHHHhhcCCC--C-----cchhhhHHHHhcc
Q psy14464 95 EVKEALIEDKPALAALVDVASSGKP--A-----CTYGVVTTLVNLC 133 (175)
Q Consensus 95 ~VKE~L~~D~~~Lk~L~~llk~~d~--s-----~~YG~lsIl~NLT 133 (175)
+-++.+. +.|+..+-.+++..+. + -...++..++.+|
T Consensus 189 ~~Q~~l~--~~f~~~l~~~~~~~~~~~~~~~~v~~~~v~~~lv~lt 232 (298)
T PF12719_consen 189 ENQERLA--EAFLPTLRTLSNAPDELDSPLAMVSPSQVASFLVDLT 232 (298)
T ss_pred HHHHHHH--HHHHHHHHHHHhCcccccCchhhCCHHHHHHHHHHHC
Confidence 7777775 6777777777764322 1 2567888899999
No 32
>PF05947 DUF879: Bacterial protein of unknown function (DUF879); InterPro: IPR010272 This entry represents a protein family associated with type VI secretion in a number of pathogenic bacteria [, ]. Mutation is associated with impaired virulence, such as impaired infection of plants by Rhizobium leguminosarum.
Probab=55.49 E-value=22 Score=34.27 Aligned_cols=40 Identities=28% Similarity=0.447 Sum_probs=33.0
Q ss_pred CCCCchhhhHHHhhhhhhccCHHHHHHHhhC-HHHHHHHHHHhh
Q psy14464 73 PARDPDIRRWAAEGLAYLTLDAEVKEALIED-KPALAALVDVAS 115 (175)
Q Consensus 73 ~~~~~~~~~~aiEGLAYlSL~p~VKE~L~~D-~~~Lk~L~~llk 115 (175)
...|+ .++.-+||.|||| +.+++.|-.| |++.++|++++-
T Consensus 36 ~~~DP-~VERLlEgfAfLt--Ari~~kLDd~~Pe~t~~LL~~L~ 76 (602)
T PF05947_consen 36 ESEDP-HVERLLEGFAFLT--ARIRQKLDDEFPELTESLLEVLY 76 (602)
T ss_pred CCCCc-hHHHHHHHHHHHH--HHHHHHHhccCHHHHHHHHHhhh
Confidence 33454 4789999999998 7788889777 999999999883
No 33
>TIGR03359 VI_chp_6 type VI secretion protein, VC_A0110 family. This protein family is associated with type VI secretion in a number of pathogenic bacteria. Mutation is associated with impaired virulence, such as impaired infection of plants by Rhizobium leguminosarum.
Probab=54.90 E-value=24 Score=33.93 Aligned_cols=39 Identities=33% Similarity=0.503 Sum_probs=32.7
Q ss_pred CCCCchhhhHHHhhhhhhccCHHHHHHHhhC-HHHHHHHHHHh
Q psy14464 73 PARDPDIRRWAAEGLAYLTLDAEVKEALIED-KPALAALVDVA 114 (175)
Q Consensus 73 ~~~~~~~~~~aiEGLAYlSL~p~VKE~L~~D-~~~Lk~L~~ll 114 (175)
...|+ .++.-+||.|||| +.+++.|-.+ |+|.++|++++
T Consensus 36 ~~~DP-~VERLlEgfAfLt--ARi~~kLDd~~Pe~t~~Ll~~L 75 (598)
T TIGR03359 36 ESEDP-HVERLLEGFAFLT--ARIRQKLDDDFPELTEALLEVL 75 (598)
T ss_pred CCCCc-hHHHHHHHHHHHH--HHHHhHhhcccHHHHHHHHHhh
Confidence 34454 4789999999998 7889999777 99999999998
No 34
>PF15596 Imm34: Immunity protein 34
Probab=53.53 E-value=16 Score=28.30 Aligned_cols=28 Identities=32% Similarity=0.291 Sum_probs=24.1
Q ss_pred hhHHHhhhhhhccCHHHHHHHhhCHHHHHHHHHHhh
Q psy14464 80 RRWAAEGLAYLTLDAEVKEALIEDKPALAALVDVAS 115 (175)
Q Consensus 80 ~~~aiEGLAYlSL~p~VKE~L~~D~~~Lk~L~~llk 115 (175)
+..|+|||+|++-.- .++|++.|++++.
T Consensus 32 ~~~aelgl~~l~~s~--------s~~~~~~LvnL~r 59 (110)
T PF15596_consen 32 AVIAELGLAALGPSV--------SDAALDHLVNLMR 59 (110)
T ss_pred hHHHHHHHHHhCccc--------CHHHHHHHHHHHh
Confidence 457999999998654 8999999999996
No 35
>PF02985 HEAT: HEAT repeat; InterPro: IPR000357 The HEAT repeat is a tandemly repeated, 37-47 amino acid long module occurring in a number of cytoplasmic proteins, including the four name-giving proteins huntingtin, elongation factor 3 (EF3), the 65 Kd alpha regulatory subunit of protein phosphatase 2A (PP2A) and the yeast PI3-kinase TOR1 []. Arrays of HEAT repeats consists of 3 to 36 units forming a rod-like helical structure and appear to function as protein-protein interaction surfaces. It has been noted that many HEAT repeat-containing proteins are involved in intracellular transport processes. In the crystal structure of PP2A PR65/A [], the HEAT repeats consist of pairs of antiparallel alpha helices [].; GO: 0005515 protein binding; PDB: 3FGA_A 2PF4_C 2IAE_A 2BKU_D 3EA5_B 3ND2_A 2BPT_A 2NYL_A 2NPP_D 2PKG_B ....
Probab=52.56 E-value=28 Score=20.03 Aligned_cols=27 Identities=22% Similarity=0.241 Sum_probs=21.3
Q ss_pred HHHHHHHHhcCCchhHHHHHHHHHHhc
Q psy14464 16 VDILKRLYMSKNENIRVRALVGLCKLG 42 (175)
Q Consensus 16 ~~~Lk~l~~s~~d~ir~~A~V~L~KL~ 42 (175)
++.+.++++..++.||.-|+-+|..+.
T Consensus 2 lp~l~~~l~D~~~~VR~~a~~~l~~i~ 28 (31)
T PF02985_consen 2 LPILLQLLNDPSPEVRQAAAECLGAIA 28 (31)
T ss_dssp HHHHHHHHT-SSHHHHHHHHHHHHHHH
T ss_pred HHHHHHHcCCCCHHHHHHHHHHHHHHH
Confidence 466778888899999999998887764
No 36
>KOG0166|consensus
Probab=51.16 E-value=96 Score=29.96 Aligned_cols=58 Identities=24% Similarity=0.261 Sum_probs=47.2
Q ss_pred CHHHHHHHHHHhhcCCCCCchhhhHHHhhhhhhccCHHHHHHHhhCHHHHHHHHHHhhcC
Q psy14464 58 STRKLAEACRRFLVNPARDPDIRRWAAEGLAYLTLDAEVKEALIEDKPALAALVDVASSG 117 (175)
Q Consensus 58 s~~~La~~~~~~L~~~~~~~~~~~~aiEGLAYlSL~p~VKE~L~~D~~~Lk~L~~llk~~ 117 (175)
.+..+-..+-..|-+ .|.+....|.=.|+|||=-|.-+-+++-|...+..|++++...
T Consensus 234 ~v~~iLp~L~~ll~~--~D~~Vl~Da~WAlsyLsdg~ne~iq~vi~~gvv~~LV~lL~~~ 291 (514)
T KOG0166|consen 234 VVAPILPALLRLLHS--TDEEVLTDACWALSYLTDGSNEKIQMVIDAGVVPRLVDLLGHS 291 (514)
T ss_pred HHHHHHHHHHHHHhc--CCHHHHHHHHHHHHHHhcCChHHHHHHHHccchHHHHHHHcCC
Confidence 345555566666644 4888888999999999999999999999999999999999654
No 37
>KOG1062|consensus
Probab=46.03 E-value=1e+02 Score=31.56 Aligned_cols=77 Identities=27% Similarity=0.329 Sum_probs=50.0
Q ss_pred HHHHHHHHh--cCCchhHHHHHHHHHHhcCCCCCCCCCCCCCCCCHHHHHHHHHHhhcC--CCCCchhhhHHHhhhhhhc
Q psy14464 16 VDILKRLYM--SKNENIRVRALVGLCKLGSMGGSDASIRPFADGSTRKLAEACRRFLVN--PARDPDIRRWAAEGLAYLT 91 (175)
Q Consensus 16 ~~~Lk~l~~--s~~d~ir~~A~V~L~KL~~~~~~d~~~~~~~~~s~~~La~~~~~~L~~--~~~~~~~~~~aiEGLAYlS 91 (175)
++.|.+.+. +.+..++..|+++|.||.+.-++ + +.++++++.. .+.+-+.++.|+|==+.
T Consensus 515 vd~l~~v~~~~~s~~~tk~yal~Al~KLSsr~~s----------~----~~ri~~lI~~~~~s~~~elQQRa~E~~~l-- 578 (866)
T KOG1062|consen 515 VDKLEKVLMSHSSDSTTKGYALTALLKLSSRFHS----------S----SERIKQLISSYKSSLDTELQQRAVEYNAL-- 578 (866)
T ss_pred HHHHHHHHHhccchHHHHHHHHHHHHHHHhhccc----------c----HHHHHHHHHHhcccccHHHHHHHHHHHHH--
Confidence 455666665 45578999999999999985221 1 4445554432 34577889999995443
Q ss_pred cCHHHHHHHhhCHHHHHHHHHHhhc
Q psy14464 92 LDAEVKEALIEDKPALAALVDVASS 116 (175)
Q Consensus 92 L~p~VKE~L~~D~~~Lk~L~~llk~ 116 (175)
+..|...-+.|++..+.
T Consensus 579 --------~~~~~~lr~siLe~mp~ 595 (866)
T KOG1062|consen 579 --------FAKDKHLRKSILERMPS 595 (866)
T ss_pred --------HHHHHHHHHHhcccCcc
Confidence 44555666666666654
No 38
>PF12031 DUF3518: Domain of unknown function (DUF3518); InterPro: IPR021906 This presumed domain is functionally uncharacterised. This domain is found in eukaryotes. This domain is about 260 amino acids in length. This domain is found associated with PF01388 from PFAM.
Probab=45.74 E-value=45 Score=29.44 Aligned_cols=88 Identities=23% Similarity=0.255 Sum_probs=59.8
Q ss_pred hHHHHHHHHHHhcCCCC-CCCCCCCCCCCCHHHHHHHHHHhhcCCCCCchhhhHHHhhhhhhccCHH-HHHHHhhCHHHH
Q psy14464 30 IRVRALVGLCKLGSMGG-SDASIRPFADGSTRKLAEACRRFLVNPARDPDIRRWAAEGLAYLTLDAE-VKEALIEDKPAL 107 (175)
Q Consensus 30 ir~~A~V~L~KL~~~~~-~d~~~~~~~~~s~~~La~~~~~~L~~~~~~~~~~~~aiEGLAYlSL~p~-VKE~L~~D~~~L 107 (175)
=|-+|+=+||||.-.+. -|.=..+..-...+.|...+.++|.. .++.-.|+.||==|+|+..--+ +=-.++..+..+
T Consensus 140 PqrlaLEaLcKLsV~e~NVDliLaTpp~sRlE~l~~~L~r~l~~-~e~~v~REfAvvlL~~La~~~~~~~r~iA~q~~~i 218 (257)
T PF12031_consen 140 PQRLALEALCKLSVIENNVDLILATPPFSRLERLFHTLVRLLGM-REDQVCREFAVVLLSNLAQGDEAAARAIAMQKPCI 218 (257)
T ss_pred HHHHHHHHHHHhheeccCcceeeeCCCHHHHHHHHHHHHHHhcc-ccchhHHHHHHHHHHHHhcccHHHHHHHHHhhchH
Confidence 46689999999996532 12211122223478999999999965 4677899999999999976433 333466667777
Q ss_pred HHHHHHhhcCC
Q psy14464 108 AALVDVASSGK 118 (175)
Q Consensus 108 k~L~~llk~~d 118 (175)
-.|+..+..++
T Consensus 219 ~~Li~FiE~a~ 229 (257)
T PF12031_consen 219 SHLIAFIEDAE 229 (257)
T ss_pred HHHHHHHHHHH
Confidence 77777776543
No 39
>PF03408 Foamy_virus_ENV: Foamy virus envelope protein ; InterPro: IPR005070 Expression of the envelope (Env) glycoprotein is essential for viral particle egress. This feature is unique to the Spumavirinae, a subclass of the Retroviridae. ; GO: 0019031 viral envelope
Probab=43.76 E-value=10 Score=38.59 Aligned_cols=44 Identities=23% Similarity=0.268 Sum_probs=33.1
Q ss_pred HHHhcccCCCCC----CcchHHHHHHhhh-cCCCCCCCCCCChHHHHHHh
Q psy14464 128 TLVNLCNAYDKQ----EIIPEMIELAKFA-KHHIPEDHELDDPDFVTKRL 172 (175)
Q Consensus 128 Il~NLTn~Y~~~----e~~~~l~~L~~ya-~~~~pe~~pld~d~~V~~R~ 172 (175)
--+||+ .+.+. -..+||.+|.+-- +=.+|=.||+|.++|+.+||
T Consensus 173 ~~~nl~-qe~K~lL~~mINEEm~~L~~vmldFeLPlgDP~tQ~qYIh~rC 221 (981)
T PF03408_consen 173 NSANLT-QETKVLLTDMINEEMQSLQNVMLDFELPLGDPRTQKQYIHKRC 221 (981)
T ss_pred hhhhhh-HHHHHHHHHHHHHHHHHHhhceecccccCCCCccHHHHHHHHH
Confidence 346777 54442 1468899998766 55578789999999999999
No 40
>PF13513 HEAT_EZ: HEAT-like repeat; PDB: 2Z5J_A 2OT8_B 2Z5O_A 2H4M_A 2QMR_A 1QBK_B 2Z5M_A 2Z5K_A 2Z5N_A 1GCJ_B ....
Probab=42.80 E-value=57 Score=20.51 Aligned_cols=34 Identities=18% Similarity=0.259 Sum_probs=25.5
Q ss_pred HHHHHHhhHHHHHHHHhcCCchhHHHHHHHHHHh
Q psy14464 8 ITSIIKQGVDILKRLYMSKNENIRVRALVGLCKL 41 (175)
Q Consensus 8 ~~~i~~~g~~~Lk~l~~s~~d~ir~~A~V~L~KL 41 (175)
........++.|-.++++.++.||..|+-+|..|
T Consensus 22 ~~~~~~~~~~~L~~~L~d~~~~VR~~A~~aLg~l 55 (55)
T PF13513_consen 22 LQPYLPELLPALIPLLQDDDDSVRAAAAWALGNL 55 (55)
T ss_dssp HHHHHHHHHHHHHHHTTSSSHHHHHHHHHHHHCH
T ss_pred HHHHHHHHHHHHHHHHcCCCHHHHHHHHHHHhcC
Confidence 3445667778888888888888998888887643
No 41
>PF10508 Proteasom_PSMB: Proteasome non-ATPase 26S subunit; InterPro: IPR019538 The 26S proteasome is an enzymatic complex that degrades ubiquitinated proteins in eukaryotic cells. 26S proteasome non-ATPase regulatory subunit 5 is one of a number of chaperones that are involved in the assembly of the proteasome. The chaperones dissociate before 26S proteasome formation is complete [].; GO: 0044183 protein binding involved in protein folding
Probab=42.18 E-value=1.3e+02 Score=28.10 Aligned_cols=115 Identities=21% Similarity=0.224 Sum_probs=78.7
Q ss_pred HHHhhHHHHHHHHhcCCchhHHHHHHHHHHhcCCCCCCCCCCCCCCCCHHHHHHHHHHhhcCCCCCchhhhHHHhhhhhh
Q psy14464 11 IIKQGVDILKRLYMSKNENIRVRALVGLCKLGSMGGSDASIRPFADGSTRKLAEACRRFLVNPARDPDIRRWAAEGLAYL 90 (175)
Q Consensus 11 i~~~g~~~Lk~l~~s~~d~ir~~A~V~L~KL~~~~~~d~~~~~~~~~s~~~La~~~~~~L~~~~~~~~~~~~aiEGLAYl 90 (175)
+..+..+.|...+++.++.||.+|+=.|.++-.... ...... ...++...+...|..+ |.+..+.|+.-|.-+
T Consensus 74 l~~~~~~~L~~gL~h~~~~Vr~l~l~~l~~~~~~~~--~~~~~~---~~~~l~~~i~~~L~~~--d~~Va~~A~~~L~~l 146 (503)
T PF10508_consen 74 LLPQYQPFLQRGLTHPSPKVRRLALKQLGRIARHSE--GAAQLL---VDNELLPLIIQCLRDP--DLSVAKAAIKALKKL 146 (503)
T ss_pred HHHHHHHHHHHHhcCCCHHHHHHHHHHHHHHhcCCH--HHHHHh---cCccHHHHHHHHHcCC--cHHHHHHHHHHHHHH
Confidence 466777889999999999999999888888753211 100011 1235666666667543 666778889888888
Q ss_pred ccCHHHHHHHhhCHHHHHHHHHHhhcCCCCcchhhhHHHHhcc
Q psy14464 91 TLDAEVKEALIEDKPALAALVDVASSGKPACTYGVVTTLVNLC 133 (175)
Q Consensus 91 SL~p~VKE~L~~D~~~Lk~L~~llk~~d~s~~YG~lsIl~NLT 133 (175)
+-.+..=+.|. ++..+..|.+++...+..+.+=++.+++++.
T Consensus 147 ~~~~~~~~~l~-~~~~~~~L~~l~~~~~~~vR~Rv~el~v~i~ 188 (503)
T PF10508_consen 147 ASHPEGLEQLF-DSNLLSKLKSLMSQSSDIVRCRVYELLVEIA 188 (503)
T ss_pred hCCchhHHHHh-CcchHHHHHHHHhccCHHHHHHHHHHHHHHH
Confidence 76665555443 4455777888887666667788888888887
No 42
>PTZ00429 beta-adaptin; Provisional
Probab=39.50 E-value=2.6e+02 Score=28.06 Aligned_cols=35 Identities=26% Similarity=0.317 Sum_probs=28.5
Q ss_pred HHHHhhHHHHHHHHhcCCchhHHHHHHHHHHhcCC
Q psy14464 10 SIIKQGVDILKRLYMSKNENIRVRALVGLCKLGSM 44 (175)
Q Consensus 10 ~i~~~g~~~Lk~l~~s~~d~ir~~A~V~L~KL~~~ 44 (175)
.|+..-+.-++..+...++-||=-|++++.|+-..
T Consensus 136 ~i~e~l~~~lkk~L~D~~pYVRKtAalai~Kly~~ 170 (746)
T PTZ00429 136 SVLEYTLEPLRRAVADPDPYVRKTAAMGLGKLFHD 170 (746)
T ss_pred HHHHHHHHHHHHHhcCCCHHHHHHHHHHHHHHHhh
Confidence 45555567778888888999999999999999764
No 43
>PF01602 Adaptin_N: Adaptin N terminal region; InterPro: IPR002553 Proteins synthesized on the ribosome and processed in the endoplasmic reticulum are transported from the Golgi apparatus to the trans-Golgi network (TGN), and from there via small carrier vesicles to their final destination compartment. This traffic is bidirectional, to ensure that proteins required to form vesicles are recycled. Vesicles have specific coat proteins (such as clathrin or coatomer) that are important for cargo selection and direction of transfer []. Clathrin coats contain both clathrin and adaptor complexes that link clathrin to receptors in coated vesicles. Clathrin-associated protein complexes are believed to interact with the cytoplasmic tails of membrane proteins, leading to their selection and concentration. The two major types of clathrin adaptor complexes are the heterotetrameric adaptor protein (AP) complexes, and the monomeric GGA (Golgi-localising, Gamma-adaptin ear domain homology, ARF-binding proteins) adaptors []. All AP complexes are heterotetramers composed of two large subunits (adaptins), a medium subunit (mu) and a small subunit (sigma). Each subunit has a specific function. Adaptin subunits recognise and bind to clathrin through their hinge region (clathrin box), and recruit accessory proteins that modulate AP function through their C-terminal appendage domains. By contrast, GGAs are monomers composed of four domains, which have functions similar to AP subunits: an N-terminal VHS (Vps27p/Hrs/Stam) domain, a GAT (GGA and Tom1) domain, a hinge region, and a C-terminal GAE (gamma-adaptin ear) domain. The GAE domain is similar to the AP gamma-adaptin ear domain, being responsible for the recruitment of accessory proteins that regulate clathrin-mediated endocytosis []. While clathrin mediates endocytic protein transport from ER to Golgi, coatomers (COPI, COPII) primarily mediate intra-Golgi transport, as well as the reverse Golgi to ER transport of dilysine-tagged proteins []. Coatomers reversibly associate with Golgi (non-clathrin-coated) vesicles to mediate protein transport and for budding from Golgi membranes []. Coatomer complexes are hetero-oligomers composed of at least an alpha, beta, beta', gamma, delta, epsilon and zeta subunits. This entry represents the N-terminal domain of various adaptins from different AP clathrin adaptor complexes (including AP1, AP2, AP3 and AP4), and from the beta and gamma subunits of various coatomer (COP) adaptors. This domain has a 2-layer alpha/alpha fold that forms a right-handed superhelix, and is a member of the ARM repeat superfamily []. The N-terminal region of the various AP adaptor proteins share strong sequence identity; by contrast, the C-terminal domains of different adaptins share similar structural folds, but have little sequence identity []. It has been proposed that the N-terminal domain interacts with another uniform component of the coated vesicles. More information about these proteins can be found at Protein of the Month: Clathrin [].; GO: 0006886 intracellular protein transport, 0016192 vesicle-mediated transport, 0030117 membrane coat; PDB: 1W63_C 2JKR_A 2JKT_A 2XA7_A 2VGL_B 3TJZ_E.
Probab=37.98 E-value=1.4e+02 Score=26.85 Aligned_cols=112 Identities=14% Similarity=0.169 Sum_probs=69.1
Q ss_pred HHHHhhHHHHHHHHhcCCchhHHHHHHHHHHhcCCCCCCCCCCCCCCCCHHH-HHHHHHHhhcCCCCCchhhhHHHhhhh
Q psy14464 10 SIIKQGVDILKRLYMSKNENIRVRALVGLCKLGSMGGSDASIRPFADGSTRK-LAEACRRFLVNPARDPDIRRWAAEGLA 88 (175)
Q Consensus 10 ~i~~~g~~~Lk~l~~s~~d~ir~~A~V~L~KL~~~~~~d~~~~~~~~~s~~~-La~~~~~~L~~~~~~~~~~~~aiEGLA 88 (175)
.+...-.+-+..++.+.++-||--|+.+++|+-...+. .... +.+.+.++|.+ .+...+..|+--|.
T Consensus 110 ~~~~~l~~~v~~ll~~~~~~VRk~A~~~l~~i~~~~p~----------~~~~~~~~~l~~lL~d--~~~~V~~~a~~~l~ 177 (526)
T PF01602_consen 110 EMAEPLIPDVIKLLSDPSPYVRKKAALALLKIYRKDPD----------LVEDELIPKLKQLLSD--KDPSVVSAALSLLS 177 (526)
T ss_dssp HHHHHHHHHHHHHHHSSSHHHHHHHHHHHHHHHHHCHC----------CHHGGHHHHHHHHTTH--SSHHHHHHHHHHHH
T ss_pred chhhHHHHHHHHHhcCCchHHHHHHHHHHHHHhccCHH----------HHHHHHHHHHhhhccC--CcchhHHHHHHHHH
Confidence 45556667888888999999999999999999865321 2233 78888998843 35566777776665
Q ss_pred hhccCHHHHHHHhhCHHHHHHHHHHhhcC-CCCcchhhhHHHHhcccCCCCCC
Q psy14464 89 YLTLDAEVKEALIEDKPALAALVDVASSG-KPACTYGVVTTLVNLCNAYDKQE 140 (175)
Q Consensus 89 YlSL~p~VKE~L~~D~~~Lk~L~~llk~~-d~s~~YG~lsIl~NLTn~Y~~~e 140 (175)
.+.-+++.-. .++..++..+... ...--++...++--|+ .|-+.+
T Consensus 178 ~i~~~~~~~~------~~~~~~~~~L~~~l~~~~~~~q~~il~~l~-~~~~~~ 223 (526)
T PF01602_consen 178 EIKCNDDSYK------SLIPKLIRILCQLLSDPDPWLQIKILRLLR-RYAPME 223 (526)
T ss_dssp HHHCTHHHHT------THHHHHHHHHHHHHTCCSHHHHHHHHHHHT-TSTSSS
T ss_pred HHccCcchhh------hhHHHHHHHhhhcccccchHHHHHHHHHHH-hcccCC
Confidence 5512222211 3344444444321 3344466666666666 565643
No 44
>KOG4596|consensus
Probab=35.04 E-value=34 Score=36.72 Aligned_cols=76 Identities=14% Similarity=0.128 Sum_probs=55.4
Q ss_pred HhhHHHHHHHHhcCCchhHHHHHHHHHHhcCCCCCCC--CCCCCCCCCHHHHHHHHHHhhcCCCCCchhhhHHHhhhhhh
Q psy14464 13 KQGVDILKRLYMSKNENIRVRALVGLCKLGSMGGSDA--SIRPFADGSTRKLAEACRRFLVNPARDPDIRRWAAEGLAYL 90 (175)
Q Consensus 13 ~~g~~~Lk~l~~s~~d~ir~~A~V~L~KL~~~~~~d~--~~~~~~~~s~~~La~~~~~~L~~~~~~~~~~~~aiEGLAYl 90 (175)
+|-++||.++++|.....+++-.-+||-.--+...+. +.....++-.-.|...+++++.+...+ .+-.+|-|+|.
T Consensus 841 ~qSmpwl~dlvqsseg~~~iLpvqClcefLlfda~~i~~s~~~d~~~~~a~l~~~Lk~l~~~~~~~---~qitce~ldyi 917 (1936)
T KOG4596|consen 841 EQSMPWLRDLVQSSEGTAKILPVQCLCEFLLFDAQRIRDSDQNDTVVVKAVLYFYLKKLSKSIMKT---WQITCENLDYI 917 (1936)
T ss_pred ccccHHHHHHHhcCCCccccchHHHHHHHHhhhhhhhhcccccchHHHHHHHHHHHHHHhcccccc---ceeehHHHHHH
Confidence 4678999999999888899999999998765422111 222333445578889999999775443 46789999996
Q ss_pred c
Q psy14464 91 T 91 (175)
Q Consensus 91 S 91 (175)
-
T Consensus 918 f 918 (1936)
T KOG4596|consen 918 F 918 (1936)
T ss_pred H
Confidence 3
No 45
>COG4333 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=34.46 E-value=31 Score=28.35 Aligned_cols=68 Identities=19% Similarity=0.319 Sum_probs=44.5
Q ss_pred hhhhccCHHHHHHHhhCHHHHHHHHHHhhcCCCCcchhhhHHHHhcccCCCCCCcchHHHHHHhhhcCCCCCCCCCCChH
Q psy14464 87 LAYLTLDAEVKEALIEDKPALAALVDVASSGKPACTYGVVTTLVNLCNAYDKQEIIPEMIELAKFAKHHIPEDHELDDPD 166 (175)
Q Consensus 87 LAYlSL~p~VKE~L~~D~~~Lk~L~~llk~~d~s~~YG~lsIl~NLTn~Y~~~e~~~~l~~L~~ya~~~~pe~~pld~d~ 166 (175)
-++++|.|..-..+ .|...+..++.+++ |--||++ +++||- ++..... ++| .|.|+|=-.|.|-
T Consensus 39 ~~fi~lnPs~ada~-~Dd~Ti~rci~fA~----swgyGgv-y~~NLF-a~~~T~~-~~L--------~~~~dPigkdnD~ 102 (167)
T COG4333 39 TLFIGLNPSYADAE-KDDRTLSRCISFAK----SWGYGGV-YMANLF-AFVHTQR-HEL--------MKASDPIGKDNDS 102 (167)
T ss_pred eEEEEeccchhhhh-hcchHHHHHHHHHh----hcccCcE-Eeeehh-hhccCCH-HHH--------hcCCCCCCCCchH
Confidence 47899999999988 78888889999998 4457754 567776 4443221 111 2355555566666
Q ss_pred HHHH
Q psy14464 167 FVTK 170 (175)
Q Consensus 167 ~V~~ 170 (175)
|.-+
T Consensus 103 HL~~ 106 (167)
T COG4333 103 HLIR 106 (167)
T ss_pred HHHH
Confidence 6543
No 46
>KOG1061|consensus
Probab=34.04 E-value=1.9e+02 Score=29.24 Aligned_cols=116 Identities=17% Similarity=0.191 Sum_probs=73.4
Q ss_pred HHHHhhHHHHHHHHhcCCchhHHHHHHHHHHhcCCCCCCCCCCCCCCCCHHHHHHHHHHhhcCCCCCchhhhHHHhhhhh
Q psy14464 10 SIIKQGVDILKRLYMSKNENIRVRALVGLCKLGSMGGSDASIRPFADGSTRKLAEACRRFLVNPARDPDIRRWAAEGLAY 89 (175)
Q Consensus 10 ~i~~~g~~~Lk~l~~s~~d~ir~~A~V~L~KL~~~~~~d~~~~~~~~~s~~~La~~~~~~L~~~~~~~~~~~~aiEGLAY 89 (175)
.|....++=|....+.+++-+|.-|+++.+|+..-++.- .-...+.+.++.++..+ ++..+..|+=.|.=
T Consensus 117 ~i~ey~~~Pl~~~l~d~~~yvRktaa~~vakl~~~~~~~--------~~~~gl~~~L~~ll~D~--~p~VVAnAlaaL~e 186 (734)
T KOG1061|consen 117 KITEYLCDPLLKCLKDDDPYVRKTAAVCVAKLFDIDPDL--------VEDSGLVDALKDLLSDS--NPMVVANALAALSE 186 (734)
T ss_pred HHHHHHHHHHHHhccCCChhHHHHHHHHHHHhhcCChhh--------ccccchhHHHHHHhcCC--CchHHHHHHHHHHH
Confidence 355666777888888999999999999999999753311 11235777888888632 44455555544332
Q ss_pred hc-cCHHHHHHHhhCHHHHHHHHHHhhcCCCCcchhhhHHHHhcccCCCCCC
Q psy14464 90 LT-LDAEVKEALIEDKPALAALVDVASSGKPACTYGVVTTLVNLCNAYDKQE 140 (175)
Q Consensus 90 lS-L~p~VKE~L~~D~~~Lk~L~~llk~~d~s~~YG~lsIl~NLTn~Y~~~e 140 (175)
+. .+|+ ...+.-...++..|+..+.. -..+|-+.|+-+|+ .|.|+.
T Consensus 187 I~e~~~~-~~~~~l~~~~~~~lL~al~e---c~EW~qi~IL~~l~-~y~p~d 233 (734)
T KOG1061|consen 187 IHESHPS-VNLLELNPQLINKLLEALNE---CTEWGQIFILDCLA-EYVPKD 233 (734)
T ss_pred HHHhCCC-CCcccccHHHHHHHHHHHHH---hhhhhHHHHHHHHH-hcCCCC
Confidence 11 1111 12233445566666666553 33499999999999 777753
No 47
>KOG1062|consensus
Probab=32.53 E-value=1.6e+02 Score=30.27 Aligned_cols=60 Identities=23% Similarity=0.300 Sum_probs=29.3
Q ss_pred hhHHHHHHHHhcCCchhHHHHHHHHHHhcCCCCCCCCCCCCCCCCHHHHHHHHHHhhcC--CCCCchhhhHHHhhh
Q psy14464 14 QGVDILKRLYMSKNENIRVRALVGLCKLGSMGGSDASIRPFADGSTRKLAEACRRFLVN--PARDPDIRRWAAEGL 87 (175)
Q Consensus 14 ~g~~~Lk~l~~s~~d~ir~~A~V~L~KL~~~~~~d~~~~~~~~~s~~~La~~~~~~L~~--~~~~~~~~~~aiEGL 87 (175)
-|+++|.+++.|.+-++|..|+=.|-|.-.+. ...+.+=|..+.. ...|...++.|.|=+
T Consensus 313 lainiLgkFL~n~d~NirYvaLn~L~r~V~~d--------------~~avqrHr~tIleCL~DpD~SIkrralELs 374 (866)
T KOG1062|consen 313 LAINILGKFLLNRDNNIRYVALNMLLRVVQQD--------------PTAVQRHRSTILECLKDPDVSIKRRALELS 374 (866)
T ss_pred HHHHHHHHHhcCCccceeeeehhhHHhhhcCC--------------cHHHHHHHHHHHHHhcCCcHHHHHHHHHHH
Confidence 34555555555555555555555555544331 2445555554432 123444556666633
No 48
>PF06685 DUF1186: Protein of unknown function (DUF1186); InterPro: IPR010602 This family consists of several hypothetical bacterial proteins of around 250 residues in length and is found in several Chlamydia and Anabaena species. The function of this family is unknown.
Probab=31.90 E-value=78 Score=27.55 Aligned_cols=136 Identities=21% Similarity=0.327 Sum_probs=75.5
Q ss_pred CCchhHHHHHHHHHHhcCCCC-----------CCCCCCCCCCCCHHHHHH-----------HHHHhhcCCCCCchhhhHH
Q psy14464 26 KNENIRVRALVGLCKLGSMGG-----------SDASIRPFADGSTRKLAE-----------ACRRFLVNPARDPDIRRWA 83 (175)
Q Consensus 26 ~~d~ir~~A~V~L~KL~~~~~-----------~d~~~~~~~~~s~~~La~-----------~~~~~L~~~~~~~~~~~~a 83 (175)
.+....+-|...|+-++.... .+.--.-+.+..+++|.. .+++++.++.-+.=.|-.|
T Consensus 54 ~~~~~~~~a~~LLaq~re~~A~~~li~l~~~~~~~~~~l~GD~~tE~l~~ilasv~~G~~~~L~~li~~~~~~~yvR~aa 133 (249)
T PF06685_consen 54 EEYNLHFYALYLLAQFREERALPPLIRLFSQDDDFLEDLFGDFITEDLPRILASVGDGDIEPLKELIEDPDADEYVRMAA 133 (249)
T ss_pred cchHHHHHHHHHHHHHhhhhhHHHHHHHHcCCcchHHHHHcchhHhHHHHHHHHHhCCCHHHHHHHHhCCcHHHHHHHHH
Confidence 334456777777777775311 010000133444555554 4556666665555578899
Q ss_pred HhhhhhhccC-HHHHHHHhhCHHHHHHHHHHhhcCCCCcchhhhHHHHhcccCCCCCCcchHHHHHHhhhcCCCCCCCCC
Q psy14464 84 AEGLAYLTLD-AEVKEALIEDKPALAALVDVASSGKPACTYGVVTTLVNLCNAYDKQEIIPEMIELAKFAKHHIPEDHEL 162 (175)
Q Consensus 84 iEGLAYlSL~-p~VKE~L~~D~~~Lk~L~~llk~~d~s~~YG~lsIl~NLTn~Y~~~e~~~~l~~L~~ya~~~~pe~~pl 162 (175)
+.+|+++..- |.=||.++ ..++.++.-.-..+.+.++|.+. .+++ .-.|.|..++++++ |...-++... .
T Consensus 134 ~~aL~~l~~~~~~~Re~vi---~~f~~ll~~~l~~~~~~~~~~Lv--~~~~-dL~~~EL~~~I~~~--f~~~lVd~~~-i 204 (249)
T PF06685_consen 134 ISALAFLVHEGPISREEVI---QYFRELLNYFLERNPSFLWGSLV--ADIC-DLYPEELLPEIRKA--FEDGLVDPSF-I 204 (249)
T ss_pred HHHHHHHHHcCCCCHHHHH---HHHHHHHHHHhccCchHHHHHHH--HHHH-hcCHHHhHHHHHHH--HHcCCCCccc-c
Confidence 9999999864 44477774 35566665422334455565544 6777 44555666666654 5543344222 2
Q ss_pred CChHHHHHH
Q psy14464 163 DDPDFVTKR 171 (175)
Q Consensus 163 d~d~~V~~R 171 (175)
+-+.|+..
T Consensus 205 -~~e~ve~~ 212 (249)
T PF06685_consen 205 -DLEDVEEA 212 (249)
T ss_pred -CHHHHHHH
Confidence 44555544
No 49
>KOG2759|consensus
Probab=31.02 E-value=3.9e+02 Score=25.47 Aligned_cols=54 Identities=20% Similarity=0.178 Sum_probs=41.1
Q ss_pred HHhhhhhhccCHHHHHHHhhCHHHHHHHHHHhhcC-CCCcchhhhHHHHhcccCCCC
Q psy14464 83 AAEGLAYLTLDAEVKEALIEDKPALAALVDVASSG-KPACTYGVVTTLVNLCNAYDK 138 (175)
Q Consensus 83 aiEGLAYlSL~p~VKE~L~~D~~~Lk~L~~llk~~-d~s~~YG~lsIl~NLTn~Y~~ 138 (175)
+++-.=-||..|..=|.+ .--..++.|.++++.. ..-+.=-+++||+||. .-.|
T Consensus 220 sifciWlLtFn~~~ae~~-~~~~li~~L~~Ivk~~~KEKV~Rivlai~~Nll-~k~~ 274 (442)
T KOG2759|consen 220 SIFCIWLLTFNPHAAEKL-KRFDLIQDLSDIVKESTKEKVTRIVLAIFRNLL-DKGP 274 (442)
T ss_pred HHHHHHHhhcCHHHHHHH-hhccHHHHHHHHHHHHHHHHHHHHHHHHHHHHh-ccCc
Confidence 445555678888888877 6678899999999875 5556667899999999 4444
No 50
>PF03914 CBF: CBF/Mak21 family; InterPro: IPR005612 This domain is present in the CAATT-binding protein which is essential for growth and necessary for 60S ribosomal subunit biogenesis. Other proteins containing this domain stimulate transcription from the HSP70 promoter.
Probab=29.97 E-value=37 Score=26.73 Aligned_cols=35 Identities=9% Similarity=0.130 Sum_probs=29.7
Q ss_pred CHHHHHHHHHHhhcCCCCcchhhhHHHHhcccCCC
Q psy14464 103 DKPALAALVDVASSGKPACTYGVVTTLVNLCNAYD 137 (175)
Q Consensus 103 D~~~Lk~L~~llk~~d~s~~YG~lsIl~NLTn~Y~ 137 (175)
=.+|+|+|..++-....+..-|++.++.||.++|+
T Consensus 64 vaAFiKRLl~~sl~~~~~~~~~~L~~i~~ll~~~p 98 (164)
T PF03914_consen 64 VAAFIKRLLQLSLHLPPSFALAILALIRKLLKRHP 98 (164)
T ss_pred HHHHHHHHHHHHHcCCHHHHHHHHHHHHHHHHHCH
Confidence 46899999999877788899999999999995544
No 51
>PF10363 DUF2435: Protein of unknown function (DUF2435)
Probab=29.61 E-value=2.3e+02 Score=20.73 Aligned_cols=64 Identities=25% Similarity=0.124 Sum_probs=44.4
Q ss_pred HHHHHHhcCCchhHHHHHHHHHHhcCCCCCCCCCCCCCCCCHHHHHHHHHHhhcCCCCCchhhhHHHhhhhhhc
Q psy14464 18 ILKRLYMSKNENIRVRALVGLCKLGSMGGSDASIRPFADGSTRKLAEACRRFLVNPARDPDIRRWAAEGLAYLT 91 (175)
Q Consensus 18 ~Lk~l~~s~~d~ir~~A~V~L~KL~~~~~~d~~~~~~~~~s~~~La~~~~~~L~~~~~~~~~~~~aiEGLAYlS 91 (175)
+++.+ .+....||+.|++-|.||=.... ........+...|...|.++ |+=.--.||=||+-+.
T Consensus 8 al~~L-~dp~~PvRa~gL~~L~~Li~~~~-------~~~~~~~~il~l~l~~L~d~--DsyVYL~aI~~L~~La 71 (92)
T PF10363_consen 8 ALSDL-NDPLPPVRAHGLVLLRKLIESKS-------EPVIDIPKILDLFLSQLKDE--DSYVYLNAIKGLAALA 71 (92)
T ss_pred HHHHc-cCCCcchHHHHHHHHHHHHHcCC-------cchhhHHHHHHHHHHHcCCC--CchHHHHHHHHHHHHH
Confidence 44444 34557899999999999987522 34456778888899988654 4334458898988653
No 52
>PF11701 UNC45-central: Myosin-binding striated muscle assembly central; InterPro: IPR024660 The UNC-45 or small muscle protein 1 of Caenorhabditis elegans is expressed in two forms from different genomic positions in mammals: as a general tissue protein (UNC-45a) and as a specific form (UNC-45b) expressed only in striated and skeletal muscle. Myofibril formation requires both UNC-45 forms, consistent with the fact that the cytoskeleton is necessary for the development and maintenance of organised myofibrils []. Rng3 (Ring assembly protein 3), the homologue in Schizosaccharomyces pombe, is crucial for cell shape, normal actin cytoskeleton, and contractile ring assembly, and is essential for assembly of the myosin II-containing progenitors of the contractile ring. Widespread defects in the cytoskeleton are found in null mutants of all three fungal proteins []. Mammalian Unc45 is found to act as a specific chaperone during the folding of myosin and the assembly of striated muscle by forming a stable complex with the general chaperone Hsp90 []. All members carry up to three amino-terminal tetratricopeptide repeat (TPR) and a UCS domain at the C terminus that contains a number of Arm repeats. ; PDB: 3OPB_A 3NOW_A.
Probab=29.16 E-value=1e+02 Score=24.35 Aligned_cols=105 Identities=24% Similarity=0.243 Sum_probs=65.5
Q ss_pred HHHHHHh-cCCchhHHHHHHHHHHhcCCCCCCCCCCCCCCCCHHHHHHHHHHhhcCCCCCchhhhHHHhhhh-hhccCHH
Q psy14464 18 ILKRLYM-SKNENIRVRALVGLCKLGSMGGSDASIRPFADGSTRKLAEACRRFLVNPARDPDIRRWAAEGLA-YLTLDAE 95 (175)
Q Consensus 18 ~Lk~l~~-s~~d~ir~~A~V~L~KL~~~~~~d~~~~~~~~~s~~~La~~~~~~L~~~~~~~~~~~~aiEGLA-YlSL~p~ 95 (175)
+|+.+-. ..++.+|-.|+|+++|+-.. ..+...+.+.+.+...+.....+ + .-.++.-|+ -.-..|+
T Consensus 8 lL~~L~~~~~~~~~r~~a~v~l~k~l~~---------~~~~~~~~~~~~i~~~~~~~~~d-~-~i~~~~~l~~lfp~~~d 76 (157)
T PF11701_consen 8 LLTSLDMLRQPEEVRSHALVILSKLLDA---------AREEFKEKISDFIESLLDEGEMD-S-LIIAFSALTALFPGPPD 76 (157)
T ss_dssp HHHHHHCTTTSCCHHHHHHHHHHHHHHH---------HHHHHHHHHHHHHHHHHCCHHCC-H-HHHHHHHHHHHCTTTHH
T ss_pred HHHHhcccCCCHhHHHHHHHHHHHHHHH---------hHHHHHHHHHHHHHHHHccccch-h-HHHHHHHHHHHhCCCHH
Confidence 5666665 77889999999999999421 11112344555555555433222 3 335555553 3567899
Q ss_pred HHHHHhhCHHHHHHHHHHhh--cCCCCcchhhhHHHHhcc
Q psy14464 96 VKEALIEDKPALAALVDVAS--SGKPACTYGVVTTLVNLC 133 (175)
Q Consensus 96 VKE~L~~D~~~Lk~L~~llk--~~d~s~~YG~lsIl~NLT 133 (175)
|=-.|-..+.|++.|+.++. ..+..+.-.++=.|.--+
T Consensus 77 v~~~l~~~eg~~~~l~~~~~~~~~~~~~~~~~lell~aAc 116 (157)
T PF11701_consen 77 VGSELFLSEGFLESLLPLASRKSKDRKVQKAALELLSAAC 116 (157)
T ss_dssp HHHHHCCTTTHHHHHHHHHH-CTS-HHHHHHHHHHHHHHT
T ss_pred HHHHHHhhhhHHHHHHHHHhcccCCHHHHHHHHHHHHHHH
Confidence 99899988899988888887 445555555554444333
No 53
>KOG2171|consensus
Probab=28.87 E-value=3.5e+02 Score=28.63 Aligned_cols=111 Identities=20% Similarity=0.115 Sum_probs=66.5
Q ss_pred hHHHHHHHHh-cCCchhHHHHHHHHHHh----cCCCCCCCCCCCCCCCCHHHHHHHHHHhhcCCCCCchhhhHHHh---h
Q psy14464 15 GVDILKRLYM-SKNENIRVRALVGLCKL----GSMGGSDASIRPFADGSTRKLAEACRRFLVNPARDPDIRRWAAE---G 86 (175)
Q Consensus 15 g~~~Lk~l~~-s~~d~ir~~A~V~L~KL----~~~~~~d~~~~~~~~~s~~~La~~~~~~L~~~~~~~~~~~~aiE---G 86 (175)
-...|-++.. +.++.+|-+|+|=+=|+ |+.- .++....-.+..+.-++... .+..|+--.. -
T Consensus 37 ~l~~L~~i~~~~~~p~~Rq~aaVl~Rkl~~~~w~~l--------~~e~~~siks~lL~~~~~E~--~~~vr~k~~dviAe 106 (1075)
T KOG2171|consen 37 LLPALAHILATSADPQVRQLAAVLLRKLLTKHWSRL--------SAEVQQSIKSSLLEIIQSET--EPSVRHKLADVIAE 106 (1075)
T ss_pred hHHHHHHHHhcCCChHHHHHHHHHHHHHHHHHhhcC--------CHHHHHHHHHHHHHHHHhcc--chHHHHHHHHHHHH
Confidence 3467777776 78899999999976554 4431 12222223333444444332 2223322222 1
Q ss_pred hhhhccCHHHHHHHhhCHHHHHHHHHHhhcCCCCcchhhhHHHHhcccCCCCCCcch
Q psy14464 87 LAYLTLDAEVKEALIEDKPALAALVDVASSGKPACTYGVVTTLVNLCNAYDKQEIIP 143 (175)
Q Consensus 87 LAYlSL~p~VKE~L~~D~~~Lk~L~~llk~~d~s~~YG~lsIl~NLTn~Y~~~e~~~ 143 (175)
+|=--++. .=|+.|+-|++..++.+.+..+.++-||.++. .+-+....+
T Consensus 107 ia~~~l~e-------~WPell~~L~q~~~S~~~~~rE~al~il~s~~-~~~~~~~~~ 155 (1075)
T KOG2171|consen 107 IARNDLPE-------KWPELLQFLFQSTKSPNPSLRESALLILSSLP-ETFGNTLQP 155 (1075)
T ss_pred HHHhcccc-------chHHHHHHHHHHhcCCCcchhHHHHHHHHhhh-hhhccccch
Confidence 22111221 24899999999999999999999999999998 665544333
No 54
>PF05004 IFRD: Interferon-related developmental regulator (IFRD); InterPro: IPR007701 Interferon-related developmental regulator (IFRD1) is the human homologue of the Rattus norvegicus early response protein PC4 and its murine homologue TIS7 []. The exact function of IFRD1 is unknown but it has been shown that PC4 is necessary for muscle differentiation and that it might have a role in signal transduction. This entry also contains IFRD2 and its murine equivalent SKMc15, which are highly expressed soon after gastrulation and in the hepatic primordium, suggesting an involvement in early hematopoiesis [].
Probab=27.50 E-value=4.3e+02 Score=23.26 Aligned_cols=142 Identities=23% Similarity=0.318 Sum_probs=74.8
Q ss_pred hhHHHHHHhhHHHHHHHHhcCCchhHHHHHH--HHHHhcCCCCCCCCCCCCCCCCHHHHHHHHHHh-----hcCCC----
Q psy14464 6 HSITSIIKQGVDILKRLYMSKNENIRVRALV--GLCKLGSMGGSDASIRPFADGSTRKLAEACRRF-----LVNPA---- 74 (175)
Q Consensus 6 ~~~~~i~~~g~~~Lk~l~~s~~d~ir~~A~V--~L~KL~~~~~~d~~~~~~~~~s~~~La~~~~~~-----L~~~~---- 74 (175)
.-+..|...-.+.|+.+++.+...+++|+++ +|+=+--.++.+ ...++++.+.|..+ +..++
T Consensus 121 ~~~~ei~~~~~~~L~~~l~d~s~~~~~R~~~~~aLai~~fv~~~d-------~~~~~~~~~~le~if~~~~~~~~~~~~~ 193 (309)
T PF05004_consen 121 EDSEEIFEELKPVLKRILTDSSASPKARAACLEALAICTFVGGSD-------EEETEELMESLESIFLLSILKSDGNAPV 193 (309)
T ss_pred ccHHHHHHHHHHHHHHHHhCCccchHHHHHHHHHHHHHHHhhcCC-------hhHHHHHHHHHHHHHHHHhcCcCCCccc
Confidence 3456677788899999998666555554322 333222222222 12345444555522 21111
Q ss_pred ----CCchhhhHHHhhhhhh--ccCH-HHHHHHhhCHHHHHHHHHHhhcCCCCcchhhhHHHH---hcc----cCC--CC
Q psy14464 75 ----RDPDIRRWAAEGLAYL--TLDA-EVKEALIEDKPALAALVDVASSGKPACTYGVVTTLV---NLC----NAY--DK 138 (175)
Q Consensus 75 ----~~~~~~~~aiEGLAYl--SL~p-~VKE~L~~D~~~Lk~L~~llk~~d~s~~YG~lsIl~---NLT----n~Y--~~ 138 (175)
.+...+--|++|-++| ++++ .+.+.+ ...+..|.+++.+.+..+.=...=.|+ -+. +.| +.
T Consensus 194 ~~~~~~~~l~~aAL~aW~lLlt~~~~~~~~~~~---~~~~~~l~~lL~s~d~~VRiAAGEaiAll~E~~~~~~~~~~~~~ 270 (309)
T PF05004_consen 194 VAAEDDAALVAAALSAWALLLTTLPDSKLEDLL---EEALPALSELLDSDDVDVRIAAGEAIALLYELARDHEEDFLYED 270 (309)
T ss_pred ccCCCccHHHHHHHHHHHHHHhcCCHHHHHHHH---HHHHHHHHHHhcCCCHHHHHHHHHHHHHHHHHhhcccccccccC
Confidence 1234667888887776 4555 245433 456788888887765555433322222 221 112 11
Q ss_pred -CCcchHHHHHHhhhcCCCC
Q psy14464 139 -QEIIPEMIELAKFAKHHIP 157 (175)
Q Consensus 139 -~e~~~~l~~L~~ya~~~~p 157 (175)
++....|++|+.-++++..
T Consensus 271 ~~~l~~~l~~La~dS~K~~s 290 (309)
T PF05004_consen 271 MEELLEQLRELATDSSKSRS 290 (309)
T ss_pred HHHHHHHHHHHHHhccCccc
Confidence 1234557777777766555
No 55
>PF12348 CLASP_N: CLASP N terminal; InterPro: IPR024395 This domain is found in the N-terminal region of CLIP-associated proteins (CLASPs), which are widely conserved microtubule plus-end-tracking proteins that regulate the stability of dynamic microtubules [, ]. The domain is also found in other proteins involved in microtubule binding, including STU1, MOR1 and spindle pole body component Alp14.; PDB: 2QK2_A.
Probab=27.30 E-value=1.4e+02 Score=23.94 Aligned_cols=71 Identities=28% Similarity=0.321 Sum_probs=42.9
Q ss_pred HHHHHHHHhcCCchhHHHHHHHHHHhcCCCCCCCCCCCCC-CCCHHHHHHHHHHhhcCCCCCchhhhHHHhhhhhh
Q psy14464 16 VDILKRLYMSKNENIRVRALVGLCKLGSMGGSDASIRPFA-DGSTRKLAEACRRFLVNPARDPDIRRWAAEGLAYL 90 (175)
Q Consensus 16 ~~~Lk~l~~s~~d~ir~~A~V~L~KL~~~~~~d~~~~~~~-~~s~~~La~~~~~~L~~~~~~~~~~~~aiEGLAYl 90 (175)
+..+....+++|+.+|..++-.|.-+-...+.+. .... ......+.+.+.+.+.+. +.+.|+.|-+.+..+
T Consensus 133 ~~~l~~~~~~Kn~~vR~~~~~~l~~~l~~~~~~~--~~l~~~~~~~~l~~~l~~~l~D~--~~~VR~~Ar~~~~~l 204 (228)
T PF12348_consen 133 LEILSQGLKSKNPQVREECAEWLAIILEKWGSDS--SVLQKSAFLKQLVKALVKLLSDA--DPEVREAARECLWAL 204 (228)
T ss_dssp HHHHHHHTT-S-HHHHHHHHHHHHHHHTT-------GGG--HHHHHHHHHHHHHHHTSS---HHHHHHHHHHHHHH
T ss_pred HHHHHHHHhCCCHHHHHHHHHHHHHHHHHccchH--hhhcccchHHHHHHHHHHHCCCC--CHHHHHHHHHHHHHH
Confidence 4666667778888888888877766644322110 0111 112467888888888654 677898888888766
No 56
>COG1413 FOG: HEAT repeat [Energy production and conversion]
Probab=27.27 E-value=2.4e+02 Score=24.05 Aligned_cols=103 Identities=21% Similarity=0.234 Sum_probs=66.2
Q ss_pred hhHHHHHHHHhcCCchhHHHHHHHHHHhcCCCCCCCCCCCCCCCCHHHHHHHHHHhhcCCCCCchhhhHHHhhhhhhccC
Q psy14464 14 QGVDILKRLYMSKNENIRVRALVGLCKLGSMGGSDASIRPFADGSTRKLAEACRRFLVNPARDPDIRRWAAEGLAYLTLD 93 (175)
Q Consensus 14 ~g~~~Lk~l~~s~~d~ir~~A~V~L~KL~~~~~~d~~~~~~~~~s~~~La~~~~~~L~~~~~~~~~~~~aiEGLAYlSL~ 93 (175)
+..+.|..+....++.||..|+.+|+.++... ..++..|-+++..+ +...|..+++.|.+.-..
T Consensus 180 ~~~~~l~~~l~~~~~~vr~~Aa~aL~~~~~~~--------------~~~~~~l~~~~~~~--~~~vr~~~~~~l~~~~~~ 243 (335)
T COG1413 180 EAIPLLIELLEDEDADVRRAAASALGQLGSEN--------------VEAADLLVKALSDE--SLEVRKAALLALGEIGDE 243 (335)
T ss_pred hhhHHHHHHHhCchHHHHHHHHHHHHHhhcch--------------hhHHHHHHHHhcCC--CHHHHHHHHHHhcccCcc
Confidence 34577888888888899999999999998742 57788888888654 556788899999988766
Q ss_pred HHHHH---HHhhCHHHHHHHHHHhhcCCCCcchhhhHHHHhcc
Q psy14464 94 AEVKE---ALIEDKPALAALVDVASSGKPACTYGVVTTLVNLC 133 (175)
Q Consensus 94 p~VKE---~L~~D~~~Lk~L~~llk~~d~s~~YG~lsIl~NLT 133 (175)
..+.- .+..+...+......... .....+-.......++
T Consensus 244 ~~~~~l~~~l~~~~~~~~~~~~~~~~-~~~~~~~~~~l~~~~~ 285 (335)
T COG1413 244 EAVDALAKALEDEDVILALLAAAALG-ALDLAEAALPLLLLLI 285 (335)
T ss_pred hhHHHHHHHHhccchHHHHHHHHHhc-ccCchhhHHHHHHHhh
Confidence 54432 233333333333333332 2333444444444444
No 57
>PF00607 Gag_p24: gag gene protein p24 (core nucleocapsid protein); InterPro: IPR000721 The Gag protein from retroviruses, also known as p24, forms the inner protein layer of the nucleocapsid. This protein performs highly complex orchestrated tasks during the assembly, budding, maturation and infection stages of the viral replication cycle. During viral assembly, the proteins form membrane associations and self-associations that ultimately result in budding of an immature virion from the infected cell. Gag precursors also function during viral assembly to selectively bind and package two plus strands of genomic RNA. ELISA tests for p24 is the most commonly used method to demonstrate virus replication both in vivo and in vitro.; GO: 0016032 viral reproduction; PDB: 1BMX_A 1SJH_C 1SJE_C 1U57_A 1FGL_B 1G03_A 2XT1_A 2JO0_A 2L6E_A 2HJL_C ....
Probab=27.27 E-value=2.1e+02 Score=23.89 Aligned_cols=96 Identities=13% Similarity=0.152 Sum_probs=68.1
Q ss_pred hHHHHHHHHHHhcCC---CCCCCCCCCCCCCCHHHHHHHHHHhhcCCCCCchhhhHHHhhhhhhccCHHHHHHHhhC--H
Q psy14464 30 IRVRALVGLCKLGSM---GGSDASIRPFADGSTRKLAEACRRFLVNPARDPDIRRWAAEGLAYLTLDAEVKEALIED--K 104 (175)
Q Consensus 30 ir~~A~V~L~KL~~~---~~~d~~~~~~~~~s~~~La~~~~~~L~~~~~~~~~~~~aiEGLAYlSL~p~VKE~L~~D--~ 104 (175)
++..|.-++.|+... ++.=...+|-...+-.|.+.+|-+-+.....+.+...|-++-|+|==-+|++|..|..= .
T Consensus 104 ~~~~ai~Aw~~l~~~~~~~~~~~~I~QGp~Epf~dFv~rl~~a~~~~~~~~~~~~~~~~~L~~eNAN~~C~~~~~~l~~~ 183 (206)
T PF00607_consen 104 IRAWAIKAWRKLPRKGSPGESFTKIKQGPKEPFADFVDRLQKAIRREQGENEVKNILIRQLAYENANPDCRRIIRPLGKD 183 (206)
T ss_dssp HHHHHHHHHHHHHHHHSSSSTGGGH-S-TTSHHHHHHHHHHHHHHCSSSTHHHHHHHHHHHHHHTS-HHHHHHHHHH-TT
T ss_pred HHHHHHHhhhcccccccccccHHHhhhccccchHHHHHHHHHHHhhcccccchhhHHHHHhhhccchHHHHHHHHccCCC
Confidence 444455566666543 21212455777778889999999999887777788899999999999999999988653 3
Q ss_pred HHHHHHHHHhhcCCCCcchhh
Q psy14464 105 PALAALVDVASSGKPACTYGV 125 (175)
Q Consensus 105 ~~Lk~L~~llk~~d~s~~YG~ 125 (175)
..|..++..+...+++..|+-
T Consensus 184 ~~lee~~~~C~~vg~~~~k~~ 204 (206)
T PF00607_consen 184 APLEEMIRACQGVGGPSHKAQ 204 (206)
T ss_dssp STHHHHHHHTTTTSSTTSSSB
T ss_pred CCHHHHHHHhhccCCHhhhhh
Confidence 467778887777777776653
No 58
>COG1413 FOG: HEAT repeat [Energy production and conversion]
Probab=26.37 E-value=2.5e+02 Score=23.86 Aligned_cols=76 Identities=25% Similarity=0.307 Sum_probs=38.9
Q ss_pred hHHHHHHHHhcCCchhHHHHHHHHHHhcCCC----------CCCCCCC-----CCCCCCHHHHHHHHHHhhcCCCCCchh
Q psy14464 15 GVDILKRLYMSKNENIRVRALVGLCKLGSMG----------GSDASIR-----PFADGSTRKLAEACRRFLVNPARDPDI 79 (175)
Q Consensus 15 g~~~Lk~l~~s~~d~ir~~A~V~L~KL~~~~----------~~d~~~~-----~~~~~s~~~La~~~~~~L~~~~~~~~~ 79 (175)
.++.+-..|.+.+..+|..|++.|+.++... ..+...+ .........-...+.++|.+ ..+...
T Consensus 44 ~~~~~~~~l~~~~~~vr~~aa~~l~~~~~~~av~~l~~~l~d~~~~vr~~a~~aLg~~~~~~a~~~li~~l~~-d~~~~v 122 (335)
T COG1413 44 AADELLKLLEDEDLLVRLSAAVALGELGSEEAVPLLRELLSDEDPRVRDAAADALGELGDPEAVPPLVELLEN-DENEGV 122 (335)
T ss_pred hHHHHHHHHcCCCHHHHHHHHHHHhhhchHHHHHHHHHHhcCCCHHHHHHHHHHHHccCChhHHHHHHHHHHc-CCcHhH
Confidence 4455556666666666666666666665320 0000000 01111223445556666654 235567
Q ss_pred hhHHHhhhhhhc
Q psy14464 80 RRWAAEGLAYLT 91 (175)
Q Consensus 80 ~~~aiEGLAYlS 91 (175)
|..|+..|+++-
T Consensus 123 R~~aa~aL~~~~ 134 (335)
T COG1413 123 RAAAARALGKLG 134 (335)
T ss_pred HHHHHHHHHhcC
Confidence 778888887764
No 59
>PRK00034 gatC aspartyl/glutamyl-tRNA amidotransferase subunit C; Reviewed
Probab=26.34 E-value=1.2e+02 Score=21.70 Aligned_cols=58 Identities=16% Similarity=0.102 Sum_probs=40.7
Q ss_pred HhhhhhhccCHHHHHHHhhCHHHHHHHHHHhhcCCCCcchhhhHHHHhcccCCCCCCcc
Q psy14464 84 AEGLAYLTLDAEVKEALIEDKPALAALVDVASSGKPACTYGVVTTLVNLCNAYDKQEII 142 (175)
Q Consensus 84 iEGLAYlSL~p~VKE~L~~D~~~Lk~L~~llk~~d~s~~YG~lsIl~NLTn~Y~~~e~~ 142 (175)
+..||.+.+..+.++.+..+=.-+-..++.+...|++-+--...++. .+|..|+++..
T Consensus 11 la~La~l~l~~ee~~~~~~~l~~il~~~~~l~~vd~~~vep~~~~~~-~~~~lReD~~~ 68 (95)
T PRK00034 11 LAKLARLELSEEELEKFAGQLNKILDFVEQLNEVDTEGVEPTTHPLD-MKNVLREDVVT 68 (95)
T ss_pred HHHHhCCCCCHHHHHHHHHHHHHHHHHHHHHhhcccCCCCcccCccc-ccCCCCCCCCC
Confidence 56799999999999999999888888888887765443322222222 44567776543
No 60
>PF08167 RIX1: rRNA processing/ribosome biogenesis
Probab=26.26 E-value=2.9e+02 Score=21.85 Aligned_cols=62 Identities=18% Similarity=0.242 Sum_probs=36.5
Q ss_pred HHHH-HhhHHHHHHHHh--c--CCchhHHHHHHHHHHhcCCCCCCCC-CCCCCCCCHHHHHHHHHHhh
Q psy14464 9 TSII-KQGVDILKRLYM--S--KNENIRVRALVGLCKLGSMGGSDAS-IRPFADGSTRKLAEACRRFL 70 (175)
Q Consensus 9 ~~i~-~~g~~~Lk~l~~--s--~~d~ir~~A~V~L~KL~~~~~~d~~-~~~~~~~s~~~La~~~~~~L 70 (175)
..++ ..|..|+..+++ . .+..+...|...|.+|-.....-.+ .|+......-.+...|-+.+
T Consensus 58 ~e~l~~~~~~W~~~Ll~~L~~~~~~~~~~~ai~~L~~l~~~~~~~p~l~Rei~tp~l~~~i~~ll~l~ 125 (165)
T PF08167_consen 58 WEILLSHGSQWLRALLSILEKPDPPSVLEAAIITLTRLFDLIRGKPTLTREIATPNLPKFIQSLLQLL 125 (165)
T ss_pred HHHHHHHHHHHHHHHHHHHcCCCCHHHHHHHHHHHHHHHHHhcCCCchHHHHhhccHHHHHHHHHHHH
Confidence 3444 899999999997 3 3346667777888888754222111 23333444445555555544
No 61
>KOG2003|consensus
Probab=25.88 E-value=3.9e+02 Score=26.42 Aligned_cols=109 Identities=26% Similarity=0.278 Sum_probs=68.3
Q ss_pred HHHHHHHHhcCCchhHHHHHHHHHHhcCC-CCCCC------------------------CCCCCCCCCHHHHHHHHHHhh
Q psy14464 16 VDILKRLYMSKNENIRVRALVGLCKLGSM-GGSDA------------------------SIRPFADGSTRKLAEACRRFL 70 (175)
Q Consensus 16 ~~~Lk~l~~s~~d~ir~~A~V~L~KL~~~-~~~d~------------------------~~~~~~~~s~~~La~~~~~~L 70 (175)
+++||- +.+++.++...|+-.||-|+-- ||.|+ +-..++.|+-++-++..+.-|
T Consensus 439 ieilkv-~~~kdnk~~saaa~nl~~l~flqggk~~~~aqqyad~aln~dryn~~a~~nkgn~~f~ngd~dka~~~ykeal 517 (840)
T KOG2003|consen 439 IEILKV-FEKKDNKTASAAANNLCALRFLQGGKDFADAQQYADIALNIDRYNAAALTNKGNIAFANGDLDKAAEFYKEAL 517 (840)
T ss_pred HHHHHH-HHhccchhhHHHhhhhHHHHHHhcccchhHHHHHHHHHhcccccCHHHhhcCCceeeecCcHHHHHHHHHHHH
Confidence 577764 4566777888999999999843 22221 222478888888888888888
Q ss_pred cCCCCCchhhhHHHhhhhhhccCHHHHHHHhhCHHHHHHHHHHhhcCCCCcchhhhHHHHhcc
Q psy14464 71 VNPARDPDIRRWAAEGLAYLTLDAEVKEALIEDKPALAALVDVASSGKPACTYGVVTTLVNLC 133 (175)
Q Consensus 71 ~~~~~~~~~~~~aiEGLAYlSL~p~VKE~L~~D~~~Lk~L~~llk~~d~s~~YG~lsIl~NLT 133 (175)
.|+ ..++|.|--+-|-++.--.|-+--++.-.|-.++.. +-.++|-+++|.--|-
T Consensus 518 ~nd-------asc~ealfniglt~e~~~~ldeald~f~klh~il~n-n~evl~qianiye~le 572 (840)
T KOG2003|consen 518 NND-------ASCTEALFNIGLTAEALGNLDEALDCFLKLHAILLN-NAEVLVQIANIYELLE 572 (840)
T ss_pred cCc-------hHHHHHHHHhcccHHHhcCHHHHHHHHHHHHHHHHh-hHHHHHHHHHHHHHhh
Confidence 653 257787777777776655553333344444444432 2355666666655554
No 62
>PF10691 DUF2497: Protein of unknown function (DUF2497) ; InterPro: IPR019632 Members of this family belong to the Alphaproteobacteria. The function of the family is not known.
Probab=25.52 E-value=77 Score=22.71 Aligned_cols=32 Identities=28% Similarity=0.182 Sum_probs=26.9
Q ss_pred hHHHhhhhhhccCHHHHHHHhhC-HHHHHHHHH
Q psy14464 81 RWAAEGLAYLTLDAEVKEALIED-KPALAALVD 112 (175)
Q Consensus 81 ~~aiEGLAYlSL~p~VKE~L~~D-~~~Lk~L~~ 112 (175)
..++|+|+-=-|+|-.|+||-.+ |..++.++.
T Consensus 32 ~~TlE~lvremLRPmLkeWLD~nLP~lVErlVr 64 (73)
T PF10691_consen 32 GRTLEDLVREMLRPMLKEWLDENLPGLVERLVR 64 (73)
T ss_pred cccHHHHHHHHHHHHHHHHHHhccHHHHHHHHH
Confidence 46899999999999999999777 777777764
No 63
>COG3519 Type VI protein secretion system component VasA [Intracellular trafficking, secretion, and vesicular transport]
Probab=25.46 E-value=1.1e+02 Score=30.37 Aligned_cols=37 Identities=24% Similarity=0.353 Sum_probs=30.9
Q ss_pred CchhhhHHHhhhhhhccCHHHHHHHhhC-HHHHHHHHHHh
Q psy14464 76 DPDIRRWAAEGLAYLTLDAEVKEALIED-KPALAALVDVA 114 (175)
Q Consensus 76 ~~~~~~~aiEGLAYlSL~p~VKE~L~~D-~~~Lk~L~~ll 114 (175)
.+..++--+||.|||| +.+++.|=.| |+|-.+|++++
T Consensus 42 ~DP~VERLlEgfAfLs--aR~r~KlDdE~PefT~~LL~~L 79 (621)
T COG3519 42 ADPYVERLLEGFAFLS--ARLRQKLDDEFPEFTHALLEVL 79 (621)
T ss_pred CChhHHHHHHHHHHHH--HHHHHhhhccchHHHHHHHHHh
Confidence 4456899999999998 6677777655 99999999999
No 64
>PF11841 DUF3361: Domain of unknown function (DUF3361)
Probab=25.26 E-value=3.8e+02 Score=21.93 Aligned_cols=119 Identities=17% Similarity=0.189 Sum_probs=80.1
Q ss_pred HHHHHhhHHHHHHHHhcCCc------hhHHHHHHHHHHhcCCCCCCCCCCCCCCCCHHHHHHHHHHhhcCCCCCchhhhH
Q psy14464 9 TSIIKQGVDILKRLYMSKNE------NIRVRALVGLCKLGSMGGSDASIRPFADGSTRKLAEACRRFLVNPARDPDIRRW 82 (175)
Q Consensus 9 ~~i~~~g~~~Lk~l~~s~~d------~ir~~A~V~L~KL~~~~~~d~~~~~~~~~s~~~La~~~~~~L~~~~~~~~~~~~ 82 (175)
.||-.+|+++|-.+..++.+ ++=..++-++.-|.--|- .+. +.=.....+++-.|+.....|.+..+-
T Consensus 6 EFI~~~Gl~~L~~~iE~g~~~~~~~~~~La~~L~af~eLMeHg~--vsW----d~l~~~FI~Kia~~Vn~~~~d~~i~q~ 79 (160)
T PF11841_consen 6 EFISRDGLTLLIKMIEEGTEIQPCKGEILAYALTAFVELMEHGI--VSW----DTLSDSFIKKIASYVNSSAMDASILQR 79 (160)
T ss_pred HHHhccCHHHHHHHHHcCCccCcchHHHHHHHHHHHHHHHhcCc--Cch----hhccHHHHHHHHHHHccccccchHHHH
Confidence 47888999999999987663 454556667777775421 111 112356777778888666567777777
Q ss_pred HHhhhhhhccCHHHHHHHhhCHHHHHHHHHHhhcCCCCcc-hhhhHHHHhcc
Q psy14464 83 AAEGLAYLTLDAEVKEALIEDKPALAALVDVASSGKPACT-YGVVTTLVNLC 133 (175)
Q Consensus 83 aiEGLAYlSL~p~VKE~L~~D~~~Lk~L~~llk~~d~s~~-YG~lsIl~NLT 133 (175)
|..=|-=+-+...-.=.+++.+-.+..|+..+...+..++ |.++-|-+=+-
T Consensus 80 sLaILEs~Vl~S~~ly~~V~~evt~~~Li~hLq~~~~~iq~naiaLinAL~~ 131 (160)
T PF11841_consen 80 SLAILESIVLNSPKLYQLVEQEVTLESLIRHLQVSNQEIQTNAIALINALFL 131 (160)
T ss_pred HHHHHHHHHhCCHHHHHHHhccCCHHHHHHHHHcCCHHHHHHHHHHHHHHHh
Confidence 7766666666666666677777788889988887654443 66666555444
No 65
>PF12755 Vac14_Fab1_bd: Vacuolar 14 Fab1-binding region
Probab=24.46 E-value=2.9e+02 Score=20.27 Aligned_cols=89 Identities=18% Similarity=0.267 Sum_probs=50.9
Q ss_pred HHHHHHHHHHhcCCCCCCCCCCCCCCCCHHHHHHHHHHhhcC--CCCCchhhhHHHhhhhhhccCHHHHHHHhh-CHHHH
Q psy14464 31 RVRALVGLCKLGSMGGSDASIRPFADGSTRKLAEACRRFLVN--PARDPDIRRWAAEGLAYLTLDAEVKEALIE-DKPAL 107 (175)
Q Consensus 31 r~~A~V~L~KL~~~~~~d~~~~~~~~~s~~~La~~~~~~L~~--~~~~~~~~~~aiEGLAYlSL~p~VKE~L~~-D~~~L 107 (175)
|--+++||+...-+-+.+ +.+..+.+.+.+.+ ...|...|-.|+|.|--++ +. .++.+.. =++.+
T Consensus 3 R~ggli~Laa~ai~l~~~----------~~~~l~~Il~pVL~~~~D~d~rVRy~AcEaL~ni~-k~-~~~~~l~~f~~IF 70 (97)
T PF12755_consen 3 RKGGLIGLAAVAIALGKD----------ISKYLDEILPPVLKCFDDQDSRVRYYACEALYNIS-KV-ARGEILPYFNEIF 70 (97)
T ss_pred hhHHHHHHHHHHHHchHh----------HHHHHHHHHHHHHHHcCCCcHHHHHHHHHHHHHHH-HH-HHHHHHHHHHHHH
Confidence 344666776665432211 33333433333332 2246678899999999886 22 2444433 25566
Q ss_pred HHHHHHhhcCCCCcchhhhHHHHhc
Q psy14464 108 AALVDVASSGKPACTYGVVTTLVNL 132 (175)
Q Consensus 108 k~L~~llk~~d~s~~YG~lsIl~NL 132 (175)
..|+.++...|.++..|. ..+-+|
T Consensus 71 ~~L~kl~~D~d~~Vr~~a-~~Ld~l 94 (97)
T PF12755_consen 71 DALCKLSADPDENVRSAA-ELLDRL 94 (97)
T ss_pred HHHHHHHcCCchhHHHHH-HHHHHH
Confidence 666777777788888775 444443
No 66
>KOG2973|consensus
Probab=24.00 E-value=1.4e+02 Score=27.47 Aligned_cols=77 Identities=26% Similarity=0.283 Sum_probs=54.6
Q ss_pred HhhcCCCCCchhhhHHHhhhhhhccCHHHHHHHhhCHHHHHHHHHHhhcCCCCcchhhhHHHHhcccCCCCCCcchH-HH
Q psy14464 68 RFLVNPARDPDIRRWAAEGLAYLTLDAEVKEALIEDKPALAALVDVASSGKPACTYGVVTTLVNLCNAYDKQEIIPE-MI 146 (175)
Q Consensus 68 ~~L~~~~~~~~~~~~aiEGLAYlSL~p~VKE~L~~D~~~Lk~L~~llk~~d~s~~YG~lsIl~NLTn~Y~~~e~~~~-l~ 146 (175)
+||-.. .+..++.|||-|-=+|-. --+-++..+...+|.++++++..+. .=-+++++||+. .+++.-+. |.
T Consensus 10 ~ll~~~--sP~v~~~AV~~l~~lt~~-~~~~~~~~~~~~lk~l~qL~~~~~~--~~~a~~alVnls---q~~~l~~~ll~ 81 (353)
T KOG2973|consen 10 ELLHSL--SPPVRKAAVEHLLGLTGR-GLQSLSKYSEALLKDLTQLLKDLDP--AEPAATALVNLS---QKEELRKKLLQ 81 (353)
T ss_pred HHhccC--ChHHHHHHHHHHhhcccc-chhhhccchhhhHHHHHHHccCccc--ccHHHHHHHHHH---hhHHHHHHHHH
Confidence 455443 346789999999888888 7788888889999999999987654 344688999999 34433333 33
Q ss_pred HHHhhh
Q psy14464 147 ELAKFA 152 (175)
Q Consensus 147 ~L~~ya 152 (175)
.|-++.
T Consensus 82 ~~~k~l 87 (353)
T KOG2973|consen 82 DLLKVL 87 (353)
T ss_pred HHHHHH
Confidence 344444
No 67
>KOG4224|consensus
Probab=23.81 E-value=60 Score=30.87 Aligned_cols=60 Identities=25% Similarity=0.359 Sum_probs=53.0
Q ss_pred CCCCchhhhHHHhhhhhhccCHHHHHHHhhCHHHHHHHHHHhhcCCCCcchhhhHHHHhcc
Q psy14464 73 PARDPDIRRWAAEGLAYLTLDAEVKEALIEDKPALAALVDVASSGKPACTYGVVTTLVNLC 133 (175)
Q Consensus 73 ~~~~~~~~~~aiEGLAYlSL~p~VKE~L~~D~~~Lk~L~~llk~~d~s~~YG~lsIl~NLT 133 (175)
.++|...+++|-|.|.-+|---+-|..|+.- ..++-|++++++.|..++|=..+-+.|+.
T Consensus 177 kskdirvqrnatgaLlnmThs~EnRr~LV~a-G~lpvLVsll~s~d~dvqyycttaisnIa 236 (550)
T KOG4224|consen 177 KSKDIRVQRNATGALLNMTHSRENRRVLVHA-GGLPVLVSLLKSGDLDVQYYCTTAISNIA 236 (550)
T ss_pred ccchhhHHHHHHHHHHHhhhhhhhhhhhhcc-CCchhhhhhhccCChhHHHHHHHHhhhhh
Confidence 3467677889999999999999999999854 56899999999999999999999999997
No 68
>KOG1104|consensus
Probab=22.63 E-value=2.1e+02 Score=29.06 Aligned_cols=75 Identities=15% Similarity=0.059 Sum_probs=50.2
Q ss_pred CCCHHHHHHHHHHhhcCCC-CCchhhhHHHhhhhhhccCHHHHHHHhhCHHHHHHHHHHhhc-CCCCcchhhhHHHHhcc
Q psy14464 56 DGSTRKLAEACRRFLVNPA-RDPDIRRWAAEGLAYLTLDAEVKEALIEDKPALAALVDVASS-GKPACTYGVVTTLVNLC 133 (175)
Q Consensus 56 ~~s~~~La~~~~~~L~~~~-~~~~~~~~aiEGLAYlSL~p~VKE~L~~D~~~Lk~L~~llk~-~d~s~~YG~lsIl~NLT 133 (175)
..+.+.|++++.+.+..-+ ......+.=+|||.++..+---+ .....|..|..++-. -.+-..||-|++++|+-
T Consensus 21 ~~~~e~l~krl~~~i~~vg~~s~ss~e~~l~~l~~~l~~~~~~----~~~~iL~~L~~ca~~lP~K~~~yaTLvgllN~k 96 (759)
T KOG1104|consen 21 ISPAETLEKRLESLIREVGEPSGSSVEDNLENLVAVLEADLEN----FKSKILDILNTCAVYLPEKITAYATLVGLLNLK 96 (759)
T ss_pred CCcHHHHHHHHHHHHHhhcCCCCCcHHHhHHHHHHHHHhhHHH----HHHHHHHHHHHHHHHcccchhHHHHHHHHHhcc
Confidence 3457899999998876532 22334678899999987654333 233344444444433 36778899999999998
Q ss_pred c
Q psy14464 134 N 134 (175)
Q Consensus 134 n 134 (175)
|
T Consensus 97 n 97 (759)
T KOG1104|consen 97 N 97 (759)
T ss_pred c
Confidence 5
No 69
>KOG1048|consensus
Probab=22.49 E-value=1.6e+02 Score=29.71 Aligned_cols=90 Identities=27% Similarity=0.115 Sum_probs=58.4
Q ss_pred hcCCchhHHHHHHHHHHhcCCCCCCCCCCCCCCCCHHHHHHHHHHhhcCCCCCchhhhHHHhhhhhhccCHHHHHHHhhC
Q psy14464 24 MSKNENIRVRALVGLCKLGSMGGSDASIRPFADGSTRKLAEACRRFLVNPARDPDIRRWAAEGLAYLTLDAEVKEALIED 103 (175)
Q Consensus 24 ~s~~d~ir~~A~V~L~KL~~~~~~d~~~~~~~~~s~~~La~~~~~~L~~~~~~~~~~~~aiEGLAYlSL~p~VKE~L~~D 103 (175)
.+.++.+-.-++=+|=-|.+..+.-..-....-+-.++--..+.++|.++ +++..+.++=-|-=+|+|+.-||.+.
T Consensus 529 ~s~n~~TlEasaGaLQNltA~~~~~~~~~~~~v~~kekgl~~l~~ll~~~--~~~vv~s~a~~LrNls~d~rnk~lig-- 604 (717)
T KOG1048|consen 529 LSKNDNTLEASAGALQNLTAGLWTWSEYMRGAVFRKEKGLPPLVELLRND--DSDVVRSAAGALRNLSRDIRNKELIG-- 604 (717)
T ss_pred HhcchHHHHHhhhhHhhhhccCCcchhHHHhhhhhhccCccHHHHHHhcC--CchHHHHHHHHHhhhccCchhhhhhh--
Confidence 46777777766666666655433221100000001122234566777664 66778888889999999999999997
Q ss_pred HHHHHHHHHHhhcC
Q psy14464 104 KPALAALVDVASSG 117 (175)
Q Consensus 104 ~~~Lk~L~~llk~~ 117 (175)
...++.|++.+...
T Consensus 605 k~a~~~lv~~Lp~~ 618 (717)
T KOG1048|consen 605 KYAIPDLVRCLPGS 618 (717)
T ss_pred cchHHHHHHhCcCC
Confidence 88999999999763
No 70
>PF07106 TBPIP: Tat binding protein 1(TBP-1)-interacting protein (TBPIP); InterPro: IPR010776 This family consists of several eukaryotic TBP-1 interacting protein (TBPIP) sequences. TBP-1 has been demonstrated to interact with the human immunodeficiency virus type 1 (HIV-1) viral protein Tat, then modulate the essential replication process of HIV. In addition, TBP-1 has been shown to be a component of the 26S proteasome, a basic multiprotein complex that degrades ubiquitinated proteins in an ATP-dependent fashion. Human TBPIP interacts with human TBP-1 then modulates the inhibitory action of human TBP-1 on HIV-Tat-mediated transactivation [].
Probab=21.66 E-value=68 Score=25.44 Aligned_cols=41 Identities=17% Similarity=0.242 Sum_probs=32.2
Q ss_pred HHHHHHhhcCCCCcchhhhHHHHhcccCCCCCCcchHHHHHHh
Q psy14464 108 AALVDVASSGKPACTYGVVTTLVNLCNAYDKQEIIPEMIELAK 150 (175)
Q Consensus 108 k~L~~llk~~d~s~~YG~lsIl~NLTn~Y~~~e~~~~l~~L~~ 150 (175)
..+++-++..+.| |++-.|+.||-|.|.+...++-|.+|..
T Consensus 4 ~~Il~y~~~qNRP--ys~~di~~nL~~~~~K~~v~k~Ld~L~~ 44 (169)
T PF07106_consen 4 DAILEYMKEQNRP--YSAQDIFDNLHNKVGKTAVQKALDSLVE 44 (169)
T ss_pred HHHHHHHHHcCCC--CcHHHHHHHHHhhccHHHHHHHHHHHHh
Confidence 3456677666554 9999999999999999887887888863
No 71
>PF09838 DUF2065: Uncharacterized protein conserved in bacteria (DUF2065); InterPro: IPR019201 This entry represents a protein found in various prokaryotic proteins, and has no known function.
Probab=21.59 E-value=67 Score=21.81 Aligned_cols=19 Identities=21% Similarity=0.188 Sum_probs=14.8
Q ss_pred HHHhhhhhhccCHHHHHHH
Q psy14464 82 WAAEGLAYLTLDAEVKEAL 100 (175)
Q Consensus 82 ~aiEGLAYlSL~p~VKE~L 100 (175)
-.+|||-|.-.+...|+.+
T Consensus 9 liiEGl~~~l~P~~~r~~l 27 (57)
T PF09838_consen 9 LIIEGLLPFLAPERWRRML 27 (57)
T ss_pred HHHHhhHHHhCHHHHHHHH
Confidence 4689999998877777654
No 72
>PRK07418 acetolactate synthase 3 catalytic subunit; Reviewed
Probab=21.49 E-value=4.1e+02 Score=25.31 Aligned_cols=103 Identities=15% Similarity=0.093 Sum_probs=57.0
Q ss_pred CCCCCCHHHHHHHHHHhhcCCCCCc---hhhhHHHh-------hhhhhccCHHHHHHH--------------------hh
Q psy14464 53 PFADGSTRKLAEACRRFLVNPARDP---DIRRWAAE-------GLAYLTLDAEVKEAL--------------------IE 102 (175)
Q Consensus 53 ~~~~~s~~~La~~~~~~L~~~~~~~---~~~~~aiE-------GLAYlSL~p~VKE~L--------------------~~ 102 (175)
.+.+.+...+.+-+.+|-..-...+ .....|+. |=+|++++-||=++- .-
T Consensus 130 ~~Qe~d~~~~~~~vtk~~~~v~~~~~i~~~l~~A~~~A~~~~~GPv~l~iP~Dv~~~~~~~~~~~~~~~~~~~~~~~~~~ 209 (616)
T PRK07418 130 AFQETDIFGITLPIVKHSYVVRDPSDMARIVAEAFHIASSGRPGPVLIDIPKDVGQEEFDYVPVEPGSVKPPGYRPTVKG 209 (616)
T ss_pred CcccccHHHHhhhcceeEEEeCCHHHHHHHHHHHHHHHhcCCCCcEEEecchhhhhchhcccccCccccccCCCCCCCCC
Confidence 3555556666666655532111111 12223333 667888887763211 12
Q ss_pred CHHHHHHHHHHhhcCCCCcchhhhHHHHhcccCCCCCCcchHHHHHHhhh----------cCCCCCCCCCC
Q psy14464 103 DKPALAALVDVASSGKPACTYGVVTTLVNLCNAYDKQEIIPEMIELAKFA----------KHHIPEDHELD 163 (175)
Q Consensus 103 D~~~Lk~L~~llk~~d~s~~YG~lsIl~NLTn~Y~~~e~~~~l~~L~~ya----------~~~~pe~~pld 163 (175)
|++.+.++.++++.+..+++|.+--+ . -....+++.+|+.-- +.-+|+.||+-
T Consensus 210 ~~~~v~~~~~~L~~AkrPvI~~G~g~------~--~~~a~~~l~~lae~l~~pV~tt~~gkg~~p~~hpl~ 272 (616)
T PRK07418 210 NPRQINAALKLIEEAERPLLYVGGGA------I--SAGAHAELKELAERFQIPVTTTLMGKGAFDEHHPLS 272 (616)
T ss_pred CHHHHHHHHHHHHhCCCCEEEECCCc------C--cccHHHHHHHHHHHHCCCEEEccCCCcCCCCCCccc
Confidence 67789999999998888888744333 1 012344555555311 34588889974
No 73
>TIGR02270 conserved hypothetical protein. Members are found in Myxococcus xanthus (six members), Geobacter sulfurreducens, and Pseudomonas aeruginosa; a short protein homologous to the N-terminal region is found in Mesorhizobium loti. All sequence are from Proteobacteria. The function is unknown.
Probab=21.40 E-value=6.6e+02 Score=23.25 Aligned_cols=108 Identities=15% Similarity=0.046 Sum_probs=59.0
Q ss_pred hHHHHHHHHhcCCchhHHHHHHHHHHhcCCCCCCCCCC-CCCCCCH--HHHHHHHHHhhcCCCCCchhhhHHHhhhhhhc
Q psy14464 15 GVDILKRLYMSKNENIRVRALVGLCKLGSMGGSDASIR-PFADGST--RKLAEACRRFLVNPARDPDIRRWAAEGLAYLT 91 (175)
Q Consensus 15 g~~~Lk~l~~s~~d~ir~~A~V~L~KL~~~~~~d~~~~-~~~~~s~--~~La~~~~~~L~~~~~~~~~~~~aiEGLAYlS 91 (175)
..+.|...+.+.|+.||.-|+-+|..+|.....+.=.. ....|.. ..++..+.. . .+ ..++|-|.-+.
T Consensus 179 a~~~L~~al~d~~~~VR~aA~~al~~lG~~~A~~~l~~~~~~~g~~~~~~l~~~lal-~----~~----~~a~~~L~~ll 249 (410)
T TIGR02270 179 SESTLRLYLRDSDPEVRFAALEAGLLAGSRLAWGVCRRFQVLEGGPHRQRLLVLLAV-A----GG----PDAQAWLRELL 249 (410)
T ss_pred chHHHHHHHcCCCHHHHHHHHHHHHHcCCHhHHHHHHHHHhccCccHHHHHHHHHHh-C----Cc----hhHHHHHHHHh
Confidence 34667788889999999999999999986311000000 0011111 111111111 0 00 13334444444
Q ss_pred cCHHHHHH------HhhCHHHHHHHHHHhhcCCCCcchhhhHHHHhcc
Q psy14464 92 LDAEVKEA------LIEDKPALAALVDVASSGKPACTYGVVTTLVNLC 133 (175)
Q Consensus 92 L~p~VKE~------L~~D~~~Lk~L~~llk~~d~s~~YG~lsIl~NLT 133 (175)
=++.|++. ...|+.++..|++.+... .+-.-..-.|..+|
T Consensus 250 ~d~~vr~~a~~AlG~lg~p~av~~L~~~l~d~--~~aR~A~eA~~~It 295 (410)
T TIGR02270 250 QAAATRREALRAVGLVGDVEAAPWCLEAMREP--PWARLAGEAFSLIT 295 (410)
T ss_pred cChhhHHHHHHHHHHcCCcchHHHHHHHhcCc--HHHHHHHHHHHHhh
Confidence 44444432 235888889999888743 36666777888888
No 74
>PF04063 DUF383: Domain of unknown function (DUF383); InterPro: IPR007205 This is a protein of unknown function. It is found N-terminal to another domain of unknown function (IPR007206 from INTERPRO).
Probab=20.92 E-value=4.5e+02 Score=21.78 Aligned_cols=57 Identities=23% Similarity=0.218 Sum_probs=41.5
Q ss_pred hhhhHHHhhhhhhccCHHHHHHHhhCHHH---HHHHHHHhhcCCCCcchhhhHHHHhccc
Q psy14464 78 DIRRWAAEGLAYLTLDAEVKEALIEDKPA---LAALVDVASSGKPACTYGVVTTLVNLCN 134 (175)
Q Consensus 78 ~~~~~aiEGLAYlSL~p~VKE~L~~D~~~---Lk~L~~llk~~d~s~~YG~lsIl~NLTn 134 (175)
+..+|..==|+=+|--|++|+.|...... ++.|+-.....+.---=|+++||=|||.
T Consensus 73 ~~~~yla~vl~NlS~~~~gR~~~l~~~~~~~~l~kLl~ft~~~s~iRR~Gva~~IrNccF 132 (192)
T PF04063_consen 73 DNYDYLASVLANLSQLPEGRQFFLDPQRYDGPLQKLLPFTEHKSVIRRGGVAGTIRNCCF 132 (192)
T ss_pred cchhHHHHHHHHhcCCHHHHHHHhCchhhhhHHHHHHHHhccCcHHHHHHHHHHHHHhhc
Confidence 34567777788889999999999876554 4566655554433334799999999994
No 75
>PF03489 SapB_2: Saposin-like type B, region 2; InterPro: IPR008138 Saposins are small lysosomal proteins that serve as activators of various lysosomal lipid-degrading enzymes []. They probably act by isolating the lipid substrate from the membrane surroundings, thus making it more accessible to the soluble degradative enzymes. All mammalian saposins are synthesized as a single precursor molecule (prosaposin) which contains four Saposin-B domains, yielding the active saposins after proteolytic cleavage, and two Saposin-A domains that are removed in the activation reaction. The Saposin-B domains also occur in other proteins, many of them active in the lysis of membranes [, ].; PDB: 3BQQ_A 2RB3_B 2R0R_A 3BQP_A 2R1Q_A 1NKL_A 1L9L_A 1QDM_C 3RFI_A 4DDJ_A ....
Probab=20.64 E-value=1.1e+02 Score=17.79 Aligned_cols=34 Identities=15% Similarity=0.173 Sum_probs=20.8
Q ss_pred hhhHHHHHHhhHHHHHHHHhcCCchhHHHHHHHH
Q psy14464 5 SHSITSIIKQGVDILKRLYMSKNENIRVRALVGL 38 (175)
Q Consensus 5 ~~~~~~i~~~g~~~Lk~l~~s~~d~ir~~A~V~L 38 (175)
++.|..++.+..+.+-+++.+.-+.-.+=+.+++
T Consensus 1 ~~~C~~~V~~y~~~ii~~l~~~~~p~~iC~~i~~ 34 (35)
T PF03489_consen 1 SDECKNFVDQYGPQIIQLLEKQLDPQQICTKIGL 34 (35)
T ss_dssp HHHHHHHHHHHHHHHHHHHHTTSTHHHHHHHTTS
T ss_pred CcHHHHHHHHHHHHHHHHHHhcCChHHHHHHcCC
Confidence 3578888888887766666644444434444433
No 76
>COG1088 RfbB dTDP-D-glucose 4,6-dehydratase [Cell envelope biogenesis, outer membrane]
Probab=20.44 E-value=49 Score=30.28 Aligned_cols=35 Identities=31% Similarity=0.594 Sum_probs=24.9
Q ss_pred chhhhHHHHhcccCCCCCCc----chHHHHHHhhhcCCCC
Q psy14464 122 TYGVVTTLVNLCNAYDKQEI----IPEMIELAKFAKHHIP 157 (175)
Q Consensus 122 ~YG~lsIl~NLTn~Y~~~e~----~~~l~~L~~ya~~~~p 157 (175)
+||+=.+++||+|-|-|-+- +|.|. +.+..+.++|
T Consensus 169 TYglp~~ItrcSNNYGPyqfpEKlIP~~I-~nal~g~~lp 207 (340)
T COG1088 169 TYGLPATITRCSNNYGPYQFPEKLIPLMI-INALLGKPLP 207 (340)
T ss_pred HcCCceEEecCCCCcCCCcCchhhhHHHH-HHHHcCCCCc
Confidence 59999999999999999543 34332 4455566566
No 77
>PF14496 NEL: C-terminal novel E3 ligase, LRR-interacting; PDB: 3CVR_A 3CKD_B 3G06_A 3L3P_A.
Probab=20.43 E-value=1.7e+02 Score=24.56 Aligned_cols=40 Identities=28% Similarity=0.292 Sum_probs=34.3
Q ss_pred HHHHHHHhhCHHHHHHHHHHhhcCCCCcchhhhHHHHhcc
Q psy14464 94 AEVKEALIEDKPALAALVDVASSGKPACTYGVVTTLVNLC 133 (175)
Q Consensus 94 p~VKE~L~~D~~~Lk~L~~llk~~d~s~~YG~lsIl~NLT 133 (175)
+.|=+.+..|+..-+.+|.++.....+|.=++..+|.||-
T Consensus 54 ~~~L~~~a~d~~LR~~lF~~A~~a~~tC~D~v~l~f~~me 93 (221)
T PF14496_consen 54 WTVLEAAAQDPELREQLFDIAEDATGTCEDRVALTFSNME 93 (221)
T ss_dssp HHHHHHHHH-HHHHHHHHHHHHHTCCSCHCHHHHHHHHHH
T ss_pred HHHHHHHcCCHHHHHHHHHHhcCCCCCCchHHHHHHHHHH
Confidence 3456778899999999999999998999999999999994
Done!