Query psy7537
Match_columns 224
No_of_seqs 4 out of 6
Neff 1.4
Searched_HMMs 46136
Date Fri Aug 16 20:53:51 2013
Command hhsearch -i /work/01045/syshi/Psyhhblits/psy7537.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/7537hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 PF00038 Filament: Intermediat 93.0 0.48 1E-05 39.3 7.2 83 117-201 52-135 (312)
2 KOG0977|consensus 92.6 0.48 1E-05 45.9 7.6 144 61-204 42-218 (546)
3 PF01576 Myosin_tail_1: Myosin 89.3 0.11 2.4E-06 51.0 0.0 54 86-139 126-179 (859)
4 PF01576 Myosin_tail_1: Myosin 88.0 0.15 3.3E-06 50.0 0.0 135 57-195 127-269 (859)
5 PF05529 Bap31: B-cell recepto 86.4 1.5 3.2E-05 35.0 4.7 33 142-180 154-186 (192)
6 PRK09039 hypothetical protein; 82.4 36 0.00078 30.4 13.8 77 139-216 134-221 (343)
7 KOG0977|consensus 81.1 14 0.00031 36.0 9.8 94 63-160 94-194 (546)
8 TIGR02168 SMC_prok_B chromosom 80.3 40 0.00086 31.6 12.0 17 143-159 434-450 (1179)
9 TIGR00606 rad50 rad50. This fa 74.0 1E+02 0.0022 31.8 13.8 25 175-199 873-897 (1311)
10 PRK09039 hypothetical protein; 73.0 36 0.00078 30.4 9.3 40 142-181 116-156 (343)
11 KOG1962|consensus 72.8 8.1 0.00018 33.8 5.1 81 87-180 130-211 (216)
12 PF05667 DUF812: Protein of un 71.5 1.1E+02 0.0023 30.0 14.0 126 53-180 320-458 (594)
13 PF14197 Cep57_CLD_2: Centroso 69.9 12 0.00026 27.1 4.7 62 117-180 3-65 (69)
14 PF11932 DUF3450: Protein of u 69.8 31 0.00068 28.6 7.8 17 113-129 50-66 (251)
15 PF12711 Kinesin-relat_1: Kine 67.6 14 0.0003 28.4 4.9 37 142-178 24-60 (86)
16 PF08317 Spc7: Spc7 kinetochor 65.3 60 0.0013 28.3 8.9 48 162-209 251-304 (325)
17 PF00038 Filament: Intermediat 65.0 22 0.00049 29.5 6.0 88 62-160 55-142 (312)
18 PF05529 Bap31: B-cell recepto 64.0 22 0.00048 28.3 5.6 39 162-200 154-192 (192)
19 PF07106 TBPIP: Tat binding pr 59.8 55 0.0012 25.6 7.1 79 112-199 72-161 (169)
20 PF12711 Kinesin-relat_1: Kine 57.4 16 0.00035 28.0 3.6 68 107-178 19-86 (86)
21 COG1579 Zn-ribbon protein, pos 56.7 1.4E+02 0.0031 26.4 12.6 12 116-127 63-74 (239)
22 PF15058 Speriolin_N: Sperioli 55.7 7.7 0.00017 34.0 1.8 33 86-133 14-46 (200)
23 TIGR03185 DNA_S_dndD DNA sulfu 54.8 1.7E+02 0.0037 27.7 10.5 48 112-159 391-438 (650)
24 PF09730 BicD: Microtubule-ass 53.7 1.8E+02 0.004 29.4 11.1 111 55-170 31-156 (717)
25 PF02403 Seryl_tRNA_N: Seryl-t 53.5 43 0.00093 24.2 5.2 39 141-179 49-91 (108)
26 PF06156 DUF972: Protein of un 53.2 31 0.00067 26.8 4.6 50 145-194 18-84 (107)
27 cd00938 HisRS_RNA HisRS_RNA bi 52.4 8.5 0.00018 26.9 1.3 35 90-128 7-41 (45)
28 PF09744 Jnk-SapK_ap_N: JNK_SA 50.6 1.4E+02 0.0031 24.6 8.5 72 85-158 23-94 (158)
29 PF12325 TMF_TATA_bd: TATA ele 50.4 79 0.0017 25.0 6.6 25 100-124 39-63 (120)
30 PF05557 MAD: Mitotic checkpoi 49.2 63 0.0014 31.0 6.9 83 110-192 515-629 (722)
31 TIGR00606 rad50 rad50. This fa 48.4 3.1E+02 0.0068 28.3 12.0 49 144-192 972-1021(1311)
32 PF13094 CENP-Q: CENP-Q, a CEN 48.4 1.3E+02 0.0028 23.4 7.9 76 43-120 9-84 (160)
33 PRK10929 putative mechanosensi 47.4 3E+02 0.0066 29.1 11.8 33 113-145 280-312 (1109)
34 PF11932 DUF3450: Protein of u 47.3 1.7E+02 0.0036 24.4 11.4 118 84-204 49-167 (251)
35 PRK13676 hypothetical protein; 47.2 89 0.0019 23.2 6.1 68 107-179 31-98 (114)
36 PF11559 ADIP: Afadin- and alp 47.0 1.2E+02 0.0027 23.2 7.1 34 145-178 111-149 (151)
37 PRK11281 hypothetical protein; 46.7 1E+02 0.0022 32.3 8.4 35 113-147 300-334 (1113)
38 PF13166 AAA_13: AAA domain 45.9 2.5E+02 0.0054 26.0 10.1 53 143-195 418-471 (712)
39 PF12325 TMF_TATA_bd: TATA ele 45.2 1.3E+02 0.0028 23.8 7.1 85 62-161 31-115 (120)
40 PF06133 DUF964: Protein of un 44.8 80 0.0017 22.5 5.4 65 105-174 26-90 (108)
41 KOG0161|consensus 44.6 3E+02 0.0065 31.1 11.7 111 85-198 1098-1218(1930)
42 KOG0612|consensus 43.4 1.3E+02 0.0028 32.8 8.6 25 62-88 462-486 (1317)
43 PF15058 Speriolin_N: Sperioli 43.4 23 0.0005 31.2 2.8 15 144-158 14-28 (200)
44 KOG0946|consensus 42.1 1.6E+02 0.0034 31.2 8.8 76 49-129 634-709 (970)
45 PF10168 Nup88: Nuclear pore c 40.7 2.6E+02 0.0055 27.9 9.7 61 113-180 601-664 (717)
46 PF05701 WEMBL: Weak chloropla 40.3 3.2E+02 0.0069 25.7 11.5 66 95-160 113-190 (522)
47 KOG0933|consensus 39.9 3.7E+02 0.0079 29.2 11.1 147 53-203 786-940 (1174)
48 KOG0995|consensus 39.7 3.8E+02 0.0083 27.0 10.7 44 110-159 306-349 (581)
49 PF11559 ADIP: Afadin- and alp 38.7 1.8E+02 0.0039 22.3 10.4 18 61-78 38-55 (151)
50 KOG4674|consensus 37.8 3E+02 0.0064 31.1 10.4 173 7-198 1191-1382(1822)
51 KOG0995|consensus 37.6 4.5E+02 0.0096 26.6 12.6 111 11-130 220-350 (581)
52 PRK10884 SH3 domain-containing 37.4 1.9E+02 0.0041 24.6 7.3 26 107-132 88-113 (206)
53 KOG4674|consensus 35.9 7E+02 0.015 28.3 13.2 92 110-202 1276-1382(1822)
54 PF12128 DUF3584: Protein of u 35.5 5.2E+02 0.011 26.7 11.4 35 100-134 644-678 (1201)
55 PF07889 DUF1664: Protein of u 35.3 61 0.0013 26.1 3.9 41 142-182 50-91 (126)
56 KOG0240|consensus 34.6 5.1E+02 0.011 26.3 10.9 33 98-130 400-432 (607)
57 COG4942 Membrane-bound metallo 34.1 3.8E+02 0.0082 25.8 9.4 45 86-130 54-98 (420)
58 COG1196 Smc Chromosome segrega 33.3 5.5E+02 0.012 26.3 12.0 37 95-131 811-847 (1163)
59 PF00846 Hanta_nucleocap: Hant 31.9 1.1E+02 0.0023 29.8 5.5 103 113-219 3-129 (428)
60 PRK11020 hypothetical protein; 31.6 71 0.0015 26.3 3.7 47 110-160 3-49 (118)
61 PF01519 DUF16: Protein of unk 31.1 1.2E+02 0.0025 24.3 4.8 37 166-202 64-100 (102)
62 PF15070 GOLGA2L5: Putative go 30.7 5.4E+02 0.012 25.4 11.6 27 167-193 179-205 (617)
63 PF02403 Seryl_tRNA_N: Seryl-t 30.1 1.3E+02 0.0029 21.6 4.7 41 84-128 50-90 (108)
64 PF05911 DUF869: Plant protein 30.0 5.6E+02 0.012 26.2 10.3 39 55-98 18-56 (769)
65 PF14662 CCDC155: Coiled-coil 29.8 1.2E+02 0.0026 26.5 5.0 70 114-192 17-91 (193)
66 PF14197 Cep57_CLD_2: Centroso 29.8 1E+02 0.0022 22.3 4.0 20 110-129 45-64 (69)
67 PF08826 DMPK_coil: DMPK coile 29.7 1.2E+02 0.0027 21.8 4.3 55 66-129 2-56 (61)
68 TIGR00219 mreC rod shape-deter 29.6 65 0.0014 28.0 3.4 14 167-180 96-109 (283)
69 PF04420 CHD5: CHD5-like prote 29.2 1.4E+02 0.0029 24.0 5.0 26 191-216 95-120 (161)
70 KOG0240|consensus 29.2 4E+02 0.0087 27.0 9.0 60 122-183 396-456 (607)
71 KOG4603|consensus 29.0 1.2E+02 0.0025 27.0 4.9 46 101-148 68-113 (201)
72 PRK11020 hypothetical protein; 29.0 1.3E+02 0.0029 24.8 4.9 37 142-178 5-47 (118)
73 PF03962 Mnd1: Mnd1 family; I 28.6 2.8E+02 0.0061 22.9 6.8 26 104-129 102-127 (188)
74 PF06133 DUF964: Protein of un 27.8 98 0.0021 22.0 3.6 73 76-151 25-97 (108)
75 PF04111 APG6: Autophagy prote 27.5 1.6E+02 0.0036 26.0 5.6 14 167-180 111-124 (314)
76 PF05667 DUF812: Protein of un 27.1 3.4E+02 0.0074 26.6 8.1 29 171-199 396-424 (594)
77 TIGR03007 pepcterm_ChnLen poly 27.1 4.5E+02 0.0098 23.4 11.7 21 112-132 275-295 (498)
78 PF13851 GAS: Growth-arrest sp 27.0 3.7E+02 0.0081 22.4 7.4 100 84-196 27-134 (201)
79 PRK13922 rod shape-determining 26.7 93 0.002 25.9 3.8 37 142-181 76-112 (276)
80 PRK11281 hypothetical protein; 26.3 3.9E+02 0.0085 28.3 8.8 52 112-172 278-329 (1113)
81 PF08826 DMPK_coil: DMPK coile 25.9 1.7E+02 0.0036 21.1 4.4 52 126-179 4-56 (61)
82 PRK13729 conjugal transfer pil 25.9 84 0.0018 30.5 3.7 18 161-178 103-120 (475)
83 PF00458 WHEP-TRS: WHEP-TRS do 25.2 1.5E+02 0.0032 20.9 4.0 37 142-178 3-39 (56)
84 KOG1962|consensus 24.7 2E+02 0.0043 25.4 5.5 18 142-159 193-210 (216)
85 KOG0980|consensus 24.5 9E+02 0.02 26.0 11.4 36 122-159 448-483 (980)
86 PF10458 Val_tRNA-synt_C: Valy 24.5 2.4E+02 0.0053 19.4 5.0 21 110-130 2-22 (66)
87 PRK02224 chromosome segregatio 24.0 6.5E+02 0.014 24.1 12.3 10 190-199 436-445 (880)
88 PF13851 GAS: Growth-arrest sp 22.7 4.5E+02 0.0098 21.9 8.4 33 142-180 100-132 (201)
89 PF10354 DUF2431: Domain of un 22.6 36 0.00079 27.3 0.6 30 185-214 32-61 (166)
90 PF04912 Dynamitin: Dynamitin 22.5 1.7E+02 0.0036 26.1 4.8 53 109-161 91-144 (388)
91 PRK13922 rod shape-determining 22.4 1E+02 0.0022 25.7 3.1 12 166-177 80-91 (276)
92 KOG0161|consensus 22.3 8.4E+02 0.018 27.8 10.6 102 102-205 1594-1710(1930)
93 PRK04778 septation ring format 22.0 6.7E+02 0.015 23.6 9.2 130 62-200 257-393 (569)
94 KOG0946|consensus 21.6 8.1E+02 0.017 26.3 9.8 58 8-66 703-760 (970)
95 PF10212 TTKRSYEDQ: Predicted 20.8 3.5E+02 0.0076 26.8 6.8 77 106-184 428-509 (518)
96 KOG1853|consensus 20.8 3.4E+02 0.0073 25.6 6.4 18 110-127 64-81 (333)
97 PF10498 IFT57: Intra-flagella 20.5 4.9E+02 0.011 23.9 7.4 46 114-161 275-320 (359)
98 COG3657 Uncharacterized protei 20.3 52 0.0011 26.5 1.0 39 183-221 23-67 (100)
99 PRK03918 chromosome segregatio 20.2 7.5E+02 0.016 23.5 11.9 32 97-128 199-230 (880)
No 1
>PF00038 Filament: Intermediate filament protein; InterPro: IPR016044 Intermediate filaments (IF) [, , ] are proteins which are primordial components of the cytoskeleton and the nuclear envelope. They generally form filamentous structures 8 to 14 nm wide. IF proteins are members of a very large multigene family of proteins which has been subdivided in five major subgroups: Type I: Acidic cytokeratins. Type II: Basic cytokeratins. Type III: Vimentin, desmin, glial fibrillary acidic protein (GFAP), peripherin, and plasticin. Type IV: Neurofilaments L, H and M, alpha-internexin and nestin. Type V: Nuclear lamins A, B1, B2 and C. All IF proteins are structurally similar in that they consist of: a central rod domain comprising some 300 to 350 residues which is arranged in coiled-coiled alpha-helices, with at least two short characteristic interruptions; a N-terminal non-helical domain (head) of variable length; and a C-terminal domain (tail) which is also non-helical, and which shows extreme length variation between different IF proteins. While IF proteins are evolutionary and structurally related, they have limited sequence homologies except in several regions of the rod domain. This entry represents the central rod domain found in IF proteins.; PDB: 3TNU_B 3KLT_D 1GK4_F 3TRT_A 3G1E_A 3UF1_C 1GK6_B 1GK7_A 3TYY_B 3V4W_A ....
Probab=92.97 E-value=0.48 Score=39.30 Aligned_cols=83 Identities=17% Similarity=0.375 Sum_probs=49.9
Q ss_pred hcchHHHHHHhhhhhhhhhhccCcHHHHhhhhchHHHHHHhhccH-HHHhhhhchHHHHHHhhchHHHHHHHHHHhHHHH
Q psy7537 117 QVDSQAETKKQLDSQAETKKQFDSQAETKKQFDSQAETKKQFDSQ-AETKKQLDSQAEMKKQFDSQAEAKKHLAQRYDDF 195 (224)
Q Consensus 117 lsDe~AalKkQLDSnAqsk~sadsqAE~KrlsDe~AalKkQlDSn-Ae~KrlsDe~AalKKQlDSnAqsKkhlaqryddf 195 (224)
.-.++..+.++||....-+.... .++..+-.++..++.++..- +..+.+-.++..|++.+|.-.-+...|-.+...+
T Consensus 52 ye~el~~lr~~id~~~~eka~l~--~e~~~l~~e~~~~r~k~e~e~~~~~~le~el~~lrk~ld~~~~~r~~le~~i~~L 129 (312)
T PF00038_consen 52 YEEELRELRRQIDDLSKEKARLE--LEIDNLKEELEDLRRKYEEELAERKDLEEELESLRKDLDEETLARVDLENQIQSL 129 (312)
T ss_dssp HHHHHHCHHHHHHHHHHHHHHHH--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
T ss_pred hhhHHHHhHHhhhhHHHHhhHHh--hhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhhhhhhHhHHHHHHHHH
Confidence 34555666666666666555554 66666666666666666544 4444555566666777766666666666555555
Q ss_pred Hhhhcc
Q psy7537 196 MKELDY 201 (224)
Q Consensus 196 mkeldy 201 (224)
..+|+|
T Consensus 130 ~eEl~f 135 (312)
T PF00038_consen 130 KEELEF 135 (312)
T ss_dssp HHHHHH
T ss_pred HHHHHH
Confidence 555544
No 2
>KOG0977|consensus
Probab=92.55 E-value=0.48 Score=45.88 Aligned_cols=144 Identities=19% Similarity=0.301 Sum_probs=96.0
Q ss_pred hhHHHHHHHhhHHHHhhhhhchHHHH-------------------hh-hhcchhhhhhhcchhhhhhhhchhhhhhhcch
Q psy7537 61 DNQAETKKQLESEIETKKQLESEAET-------------------KK-QVDSQGETKKQVDSQAESKKQVDAETKKQVDS 120 (224)
Q Consensus 61 d~~a~~kkQL~s~ies~~ql~adAe~-------------------K~-l~d~ngalkKQ~dS~aqSk~saDAE~KrlsDe 120 (224)
.++..|--+|..-|+--.+|+++--+ |- ...+-+.+.+-++..+-.+..+..|+++|.++
T Consensus 42 ~El~~LNDRLA~YIekVR~LEaqN~~L~~di~~lr~~~~~~ts~ik~~ye~El~~ar~~l~e~~~~ra~~e~ei~kl~~e 121 (546)
T KOG0977|consen 42 KELQELNDRLAVYIEKVRFLEAQNRKLEHDINLLRGVVGRETSGIKAKYEAELATARKLLDETARERAKLEIEITKLREE 121 (546)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccCCCcchhHHhhhhHHHHHHHHHHHHHHHHHHHHHHHHhHHH
Confidence 35566667777777766655543211 00 01122345667888888999999999999999
Q ss_pred HHHHHHhhhhhhhhhhccCcH--HHHhhhhchHHHHH------HhhccH-----HHHhhhhchHHHHHHhhchHHHHHHH
Q psy7537 121 QAETKKQLDSQAETKKQFDSQ--AETKKQFDSQAETK------KQFDSQ-----AETKKQLDSQAEMKKQFDSQAEAKKH 187 (224)
Q Consensus 121 ~AalKkQLDSnAqsk~sadsq--AE~KrlsDe~AalK------kQlDSn-----Ae~KrlsDe~AalKKQlDSnAqsKkh 187 (224)
+..+++.++......-.+-.. -.+.++++-.|++. +.|... +|.-|+-++++.+++|+|----.+-.
T Consensus 122 ~~elr~~~~~~~k~~~~~re~~~~~~~~l~~leAe~~~~krr~~~le~e~~~Lk~en~rl~~~l~~~r~~ld~Etllr~d 201 (546)
T KOG0977|consen 122 LKELRKKLEKAEKERRGAREKLDDYLSRLSELEAEINTLKRRIKALEDELKRLKAENSRLREELARARKQLDDETLLRVD 201 (546)
T ss_pred HHHHHHHHHHHHHHHhhhHHHHHHHhhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHhhhhHHHHHHHHHHHHHHHHHHHH
Confidence 999999998876555444321 12444444444321 111111 56667888899999999988888888
Q ss_pred HHHhHHHHHhhhccccc
Q psy7537 188 LAQRYDDFMKELDYKET 204 (224)
Q Consensus 188 laqryddfmkeldyket 204 (224)
+--+-.+.+.+|+|...
T Consensus 202 ~~n~~q~Lleel~f~~~ 218 (546)
T KOG0977|consen 202 LQNRVQTLLEELAFLKR 218 (546)
T ss_pred HHhHHHHHHHHHHHHHh
Confidence 88888899999988764
No 3
>PF01576 Myosin_tail_1: Myosin tail; InterPro: IPR002928 Muscle contraction is caused by sliding between the thick and thin filaments of the myofibril. Myosin is a major component of thick filaments and exists as a hexamer of 2 heavy chains [], 2 alkali light chains, and 2 regulatory light chains. The heavy chain can be subdivided into the N-terminal globular head and the C-terminal coiled-coil rod-like tail, although some forms have a globular region in their C-terminal. There are many cell-specific isoforms of myosin heavy chains, coded for by a multi-gene family []. Myosin interacts with actin to convert chemical energy, in the form of ATP, to mechanical energy []. The 3-D structure of the head portion of myosin has been determined [] and a model for actin-myosin complex has been constructed []. This family consists of the coiled-coil myosin heavy chain tail region. The coiled-coil is composed of the tail from two molecules of myosin. These can then assemble into the macromolecular thick filament []. The coiled-coil region provides the structural backbone of the thick filament [].; GO: 0003774 motor activity, 0016459 myosin complex; PDB: 2LNK_C 3ZWH_Q.
Probab=89.33 E-value=0.11 Score=50.97 Aligned_cols=54 Identities=22% Similarity=0.329 Sum_probs=0.0
Q ss_pred HhhhhcchhhhhhhcchhhhhhhhchhhhhhhcchHHHHHHhhhhhhhhhhccC
Q psy7537 86 TKKQVDSQGETKKQVDSQAESKKQVDAETKKQVDSQAETKKQLDSQAETKKQFD 139 (224)
Q Consensus 86 ~K~l~d~ngalkKQ~dS~aqSk~saDAE~KrlsDe~AalKkQLDSnAqsk~sad 139 (224)
.|+..+...+|..|++.---.+..++.+-..+-.++..|..+||+-...|..+.
T Consensus 126 rkkh~~~~~eL~eqle~lqk~k~~lEK~k~~l~~e~~dL~~~l~~~~k~k~~~E 179 (859)
T PF01576_consen 126 RKKHQDAVAELNEQLEQLQKQKAKLEKEKSQLEAELDDLQAQLDSLQKAKQEAE 179 (859)
T ss_dssp ------------------------------------------------------
T ss_pred HHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHH
Confidence 344444444444444444444444444444444444444444444444443333
No 4
>PF01576 Myosin_tail_1: Myosin tail; InterPro: IPR002928 Muscle contraction is caused by sliding between the thick and thin filaments of the myofibril. Myosin is a major component of thick filaments and exists as a hexamer of 2 heavy chains [], 2 alkali light chains, and 2 regulatory light chains. The heavy chain can be subdivided into the N-terminal globular head and the C-terminal coiled-coil rod-like tail, although some forms have a globular region in their C-terminal. There are many cell-specific isoforms of myosin heavy chains, coded for by a multi-gene family []. Myosin interacts with actin to convert chemical energy, in the form of ATP, to mechanical energy []. The 3-D structure of the head portion of myosin has been determined [] and a model for actin-myosin complex has been constructed []. This family consists of the coiled-coil myosin heavy chain tail region. The coiled-coil is composed of the tail from two molecules of myosin. These can then assemble into the macromolecular thick filament []. The coiled-coil region provides the structural backbone of the thick filament [].; GO: 0003774 motor activity, 0016459 myosin complex; PDB: 2LNK_C 3ZWH_Q.
Probab=87.96 E-value=0.15 Score=49.97 Aligned_cols=135 Identities=20% Similarity=0.299 Sum_probs=0.0
Q ss_pred hHhhhhHHHHHHHhhHHHHhhhhhchHHHHhhhhcchhhhhhhcchhhhhhhhchhhhhhhcchHHHHHHhhhhhhhhhh
Q psy7537 57 NKQVDNQAETKKQLESEIETKKQLESEAETKKQVDSQGETKKQVDSQAESKKQVDAETKKQVDSQAETKKQLDSQAETKK 136 (224)
Q Consensus 57 ~KQ~d~~a~~kkQL~s~ies~~ql~adAe~K~l~d~ngalkKQ~dS~aqSk~saDAE~KrlsDe~AalKkQLDSnAqsk~ 136 (224)
.|..+.++++..|++..-..+. ..+-+...+-.+...|..++++-..++..++.-.|.+-..+++++..++..-....
T Consensus 127 kkh~~~~~eL~eqle~lqk~k~--~lEK~k~~l~~e~~dL~~~l~~~~k~k~~~Ek~~K~lE~qL~El~~klee~er~~~ 204 (859)
T PF01576_consen 127 KKHQDAVAELNEQLEQLQKQKA--KLEKEKSQLEAELDDLQAQLDSLQKAKQEAEKKRKQLEAQLNELQAKLEESERQRN 204 (859)
T ss_dssp --------------------------------------------------------------------------------
T ss_pred HhHHHHHHHHHHHHHHHHHHHH--HHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHhHHhhHHHHHHHHHHHHHHHHHHHH
Confidence 5677788888888888777777 55566777778888999999999999999999999999999999999998887777
Q ss_pred ccCcHHHHhhhhchHHHHHHhhccH--------HHHhhhhchHHHHHHhhchHHHHHHHHHHhHHHH
Q psy7537 137 QFDSQAETKKQFDSQAETKKQFDSQ--------AETKKQLDSQAEMKKQFDSQAEAKKHLAQRYDDF 195 (224)
Q Consensus 137 sadsqAE~KrlsDe~AalKkQlDSn--------Ae~KrlsDe~AalKKQlDSnAqsKkhlaqryddf 195 (224)
.+. ....|+..++..|.++|..- ..-..+...+..++.+|+....+|..|.......
T Consensus 205 el~--~~k~kL~~E~~eL~~qLee~e~~~~~l~r~k~~L~~qLeelk~~leeEtr~k~~L~~~l~~l 269 (859)
T PF01576_consen 205 ELT--EQKAKLQSENSELTRQLEEAESQLSQLQREKSSLESQLEELKRQLEEETRAKQALEKQLRQL 269 (859)
T ss_dssp -------------------------------------------------------------------
T ss_pred HHH--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhHHHHHhHhhhhhhhHHHHHHH
Confidence 766 77889999999999998754 1222344556678888888887777766554433
No 5
>PF05529 Bap31: B-cell receptor-associated protein 31-like ; InterPro: IPR008417 Bap31 is a polytopic integral protein of the endoplasmic reticulum membrane and a substrate of caspase-8. Bap31 is cleaved within its cytosolic domain, generating pro-apoptotic p20 Bap31 [].; GO: 0006886 intracellular protein transport, 0005783 endoplasmic reticulum, 0016021 integral to membrane
Probab=86.35 E-value=1.5 Score=34.98 Aligned_cols=33 Identities=24% Similarity=0.301 Sum_probs=19.4
Q ss_pred HHHhhhhchHHHHHHhhccHHHHhhhhchHHHHHHhhch
Q psy7537 142 AETKKQFDSQAETKKQFDSQAETKKQLDSQAEMKKQFDS 180 (224)
Q Consensus 142 AE~KrlsDe~AalKkQlDSnAe~KrlsDe~AalKKQlDS 180 (224)
.+.++...|++.+|+++.. ...+..+||||.+.
T Consensus 154 ~~~~~~~~ei~~lk~el~~------~~~~~~~LkkQ~~~ 186 (192)
T PF05529_consen 154 EENKKLSEEIEKLKKELEK------KEKEIEALKKQSEG 186 (192)
T ss_pred hhhhhhHHHHHHHHHHHHH------HHHHHHHHHHHHHH
Confidence 4555666666666666654 33455666666543
No 6
>PRK09039 hypothetical protein; Validated
Probab=82.41 E-value=36 Score=30.37 Aligned_cols=77 Identities=14% Similarity=0.239 Sum_probs=40.0
Q ss_pred CcHHHHhhhhchHHHHHHhhccH--------HHHhhhhchHHHHHHhhchHHHH-HHHHHHhHHHHHhhh-cccc-cCcc
Q psy7537 139 DSQAETKKQFDSQAETKKQFDSQ--------AETKKQLDSQAEMKKQFDSQAEA-KKHLAQRYDDFMKEL-DYKE-TSGT 207 (224)
Q Consensus 139 dsqAE~KrlsDe~AalKkQlDSn--------Ae~KrlsDe~AalKKQlDSnAqs-Kkhlaqryddfmkel-dyke-tsgt 207 (224)
+.+.++.+|..+|++|++|+.+- ++.+.+-..++.|+..|..--.. -+.|++--.+|+..| +.-. ..|-
T Consensus 134 e~~~~V~~L~~qI~aLr~Qla~le~~L~~ae~~~~~~~~~i~~L~~~L~~a~~~~~~~l~~~~~~~~~~l~~~~~~~~~i 213 (343)
T PRK09039 134 RALAQVELLNQQIAALRRQLAALEAALDASEKRDRESQAKIADLGRRLNVALAQRVQELNRYRSEFFGRLREILGDREGI 213 (343)
T ss_pred HhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhHHHHHHHHHHHhCCCCCc
Confidence 34477888888888888886543 12222223334444444433222 233444346787666 2433 3466
Q ss_pred ceecceeee
Q psy7537 208 TIHHGIRYR 216 (224)
Q Consensus 208 tihhgiryr 216 (224)
.|. |-|+.
T Consensus 214 ri~-g~~~~ 221 (343)
T PRK09039 214 RIV-GDRFV 221 (343)
T ss_pred EEE-CCEEE
Confidence 666 55554
No 7
>KOG0977|consensus
Probab=81.12 E-value=14 Score=36.03 Aligned_cols=94 Identities=19% Similarity=0.328 Sum_probs=49.9
Q ss_pred HHHHHHHhhHHHHhhhhhchHHHHhhhhcchhhhhhhcchhhhhhh----hchhhhhhhcchHHHHH---Hhhhhhhhhh
Q psy7537 63 QAETKKQLESEIETKKQLESEAETKKQVDSQGETKKQVDSQAESKK----QVDAETKKQVDSQAETK---KQLDSQAETK 135 (224)
Q Consensus 63 ~a~~kkQL~s~ies~~ql~adAe~K~l~d~ngalkKQ~dS~aqSk~----saDAE~KrlsDe~AalK---kQLDSnAqsk 135 (224)
+++..+-|++..-.+. ..+.++++|.++...+++.+......+- .++-...++++-+|++. .-...--...
T Consensus 94 l~~ar~~l~e~~~~ra--~~e~ei~kl~~e~~elr~~~~~~~k~~~~~re~~~~~~~~l~~leAe~~~~krr~~~le~e~ 171 (546)
T KOG0977|consen 94 LATARKLLDETARERA--KLEIEITKLREELKELRKKLEKAEKERRGAREKLDDYLSRLSELEAEINTLKRRIKALEDEL 171 (546)
T ss_pred HHHHHHHHHHHHHHHH--HHHHHHHHhHHHHHHHHHHHHHHHHHHhhhHHHHHHHhhhhhhhhhHHHHHHHHHHHHHHHH
Confidence 3445555555555555 5567778888888888777776654443 33434445555554432 1111111111
Q ss_pred hccCcHHHHhhhhchHHHHHHhhcc
Q psy7537 136 KQFDSQAETKKQFDSQAETKKQFDS 160 (224)
Q Consensus 136 ~sadsqAE~KrlsDe~AalKkQlDS 160 (224)
+-.- +|.-|+.++++.+++|+|-
T Consensus 172 ~~Lk--~en~rl~~~l~~~r~~ld~ 194 (546)
T KOG0977|consen 172 KRLK--AENSRLREELARARKQLDD 194 (546)
T ss_pred HHHH--HHhhhhHHHHHHHHHHHHH
Confidence 1111 4555666777777776653
No 8
>TIGR02168 SMC_prok_B chromosome segregation protein SMC, common bacterial type. SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle.
Probab=80.32 E-value=40 Score=31.63 Aligned_cols=17 Identities=18% Similarity=0.235 Sum_probs=7.0
Q ss_pred HHhhhhchHHHHHHhhc
Q psy7537 143 ETKKQFDSQAETKKQFD 159 (224)
Q Consensus 143 E~KrlsDe~AalKkQlD 159 (224)
++..+-.++..+..+++
T Consensus 434 ~~~~~~~~~~~~~~~~~ 450 (1179)
T TIGR02168 434 ELKELQAELEELEEELE 450 (1179)
T ss_pred HHHHHHHHHHHHHHHHH
Confidence 33444444444444433
No 9
>TIGR00606 rad50 rad50. This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University).
Probab=73.97 E-value=1e+02 Score=31.76 Aligned_cols=25 Identities=12% Similarity=0.315 Sum_probs=12.2
Q ss_pred HHhhchHHHHHHHHHHhHHHHHhhh
Q psy7537 175 KKQFDSQAEAKKHLAQRYDDFMKEL 199 (224)
Q Consensus 175 KKQlDSnAqsKkhlaqryddfmkel 199 (224)
+-++....+.+..|..+.+.+..++
T Consensus 873 klkl~~~l~~r~~le~~L~el~~el 897 (1311)
T TIGR00606 873 KLQIGTNLQRRQQFEEQLVELSTEV 897 (1311)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 3344444444555555555554443
No 10
>PRK09039 hypothetical protein; Validated
Probab=72.97 E-value=36 Score=30.36 Aligned_cols=40 Identities=10% Similarity=0.077 Sum_probs=30.0
Q ss_pred HHHhhhhchHHHHHHhhc-cHHHHhhhhchHHHHHHhhchH
Q psy7537 142 AETKKQFDSQAETKKQFD-SQAETKKQLDSQAEMKKQFDSQ 181 (224)
Q Consensus 142 AE~KrlsDe~AalKkQlD-SnAe~KrlsDe~AalKKQlDSn 181 (224)
+..-.+.++.++++.... .++++.+|..++++|+.|+.+-
T Consensus 116 ~~~~~l~~~L~~~k~~~se~~~~V~~L~~qI~aLr~Qla~l 156 (343)
T PRK09039 116 GRAGELAQELDSEKQVSARALAQVELLNQQIAALRRQLAAL 156 (343)
T ss_pred HHHHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHH
Confidence 445556677777776665 5689999999999999997654
No 11
>KOG1962|consensus
Probab=72.80 E-value=8.1 Score=33.80 Aligned_cols=81 Identities=20% Similarity=0.311 Sum_probs=45.3
Q ss_pred hhhhcchhhhhhhcchhhhhhhhchhhhhhhcchHHHHHHhhhhhhhhhhccCcHHHHhhhhchHHHHHHhhccH-HHHh
Q psy7537 87 KKQVDSQGETKKQVDSQAESKKQVDAETKKQVDSQAETKKQLDSQAETKKQFDSQAETKKQFDSQAETKKQFDSQ-AETK 165 (224)
Q Consensus 87 K~l~d~ngalkKQ~dS~aqSk~saDAE~KrlsDe~AalKkQLDSnAqsk~sadsqAE~KrlsDe~AalKkQlDSn-Ae~K 165 (224)
+..+.+|.++++++.+... -.+|......+..-|+.+|. ++. -+.++.-....+++||++.. -|--
T Consensus 130 ~~~~~~~~~lk~~~~~~~~----~~~~~~~~~~~~~kL~~el~----~~~-----~~Le~~~~~~~al~Kq~e~~~~Eyd 196 (216)
T KOG1962|consen 130 EKAMKENEALKKQLENSSK----LEEENDKLKADLEKLETELE----KKQ-----KKLEKAQKKVDALKKQSEGLQDEYD 196 (216)
T ss_pred HHHHHHHHHHHHhhhcccc----hhhhHHHHHhhHHHHHHHHH----HHH-----HHHHHHHHHHHHHHHHHHHcccHHH
Confidence 5566677777777655322 22222222222222222222 221 22334444567788888775 7888
Q ss_pred hhhchHHHHHHhhch
Q psy7537 166 KQLDSQAEMKKQFDS 180 (224)
Q Consensus 166 rlsDe~AalKKQlDS 180 (224)
|++++...|..|+++
T Consensus 197 rLlee~~~Lq~~i~~ 211 (216)
T KOG1962|consen 197 RLLEEYSKLQEQIES 211 (216)
T ss_pred HHHHHHHHHHHHHhc
Confidence 888888888888765
No 12
>PF05667 DUF812: Protein of unknown function (DUF812); InterPro: IPR008530 This family consists of several eukaryotic proteins of unknown function.
Probab=71.46 E-value=1.1e+02 Score=30.02 Aligned_cols=126 Identities=18% Similarity=0.258 Sum_probs=65.7
Q ss_pred hhhhhHhhhhHHHHHHHhhHHHHhhhhhchHHHHhhhhcchhhhhhhcchhhhhhhhchhhhhh-------hcc---hHH
Q psy7537 53 ETETNKQVDNQAETKKQLESEIETKKQLESEAETKKQVDSQGETKKQVDSQAESKKQVDAETKK-------QVD---SQA 122 (224)
Q Consensus 53 eT~~~KQ~d~~a~~kkQL~s~ies~~ql~adAe~K~l~d~ngalkKQ~dS~aqSk~saDAE~Kr-------lsD---e~A 122 (224)
+.+...+..+..+++.||+..-..-. ..+++++.+..+-+.+...+.........--.+++. |.| -++
T Consensus 320 ~~~~~~~~~el~~l~~~l~~l~~~i~--~~~~~~~~l~~~~~q~~~e~~~~~~~~~~le~~~~l~~k~~~lL~d~e~ni~ 397 (594)
T PF05667_consen 320 EDEQEEQEQELEELQEQLDELESQIE--ELEAEIKMLKSSLKQLEEELEEKEAENEELEEELKLKKKTVELLPDAEENIA 397 (594)
T ss_pred hhhHHHHHHHHHHHHHHHHHHHHHHH--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCcHHHHH
Confidence 34456777777888888776544433 233445554444333333222211111111111110 111 135
Q ss_pred HHHHhhhhhhhhhhccCcHHHHh--hhhchHHHHHHhhccH-HHHhhhhchHHHHHHhhch
Q psy7537 123 ETKKQLDSQAETKKQFDSQAETK--KQFDSQAETKKQFDSQ-AETKKQLDSQAEMKKQFDS 180 (224)
Q Consensus 123 alKkQLDSnAqsk~sadsqAE~K--rlsDe~AalKkQlDSn-Ae~KrlsDe~AalKKQlDS 180 (224)
.|+..+++.++-.-..-.|=|-. .|.++...|+...+.. .+.++..+++..|+.+...
T Consensus 398 kL~~~v~~s~~rl~~L~~qWe~~R~pL~~e~r~lk~~~~~~~~e~~~~~~~ik~~r~~~k~ 458 (594)
T PF05667_consen 398 KLQALVEASEQRLVELAQQWEKHRAPLIEEYRRLKEKASNRESESKQKLQEIKELREEIKE 458 (594)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhHHHHHHHHHHHHHhhcchHHHHHHHHHHHHHHHHHH
Confidence 56666666666555555444443 3667788888766644 6677777777777655543
No 13
>PF14197 Cep57_CLD_2: Centrosome localisation domain of PPC89
Probab=69.88 E-value=12 Score=27.08 Aligned_cols=62 Identities=16% Similarity=0.273 Sum_probs=39.7
Q ss_pred hcchHHHHHHhhhhhhhhhhccCcHHHHhhhhchHHHHHHhhccH-HHHhhhhchHHHHHHhhch
Q psy7537 117 QVDSQAETKKQLDSQAETKKQFDSQAETKKQFDSQAETKKQFDSQ-AETKKQLDSQAEMKKQFDS 180 (224)
Q Consensus 117 lsDe~AalKkQLDSnAqsk~sadsqAE~KrlsDe~AalKkQlDSn-Ae~KrlsDe~AalKKQlDS 180 (224)
|-.+++.|...||+--.--+. .+.+.|.|..|....-.+|..- .+.-+|.+++.+|+++|+.
T Consensus 3 Lea~~~~Lr~rLd~~~rk~~~--~~~~~k~L~~ERd~~~~~l~~a~~e~~~Lk~E~e~L~~el~~ 65 (69)
T PF14197_consen 3 LEAEIATLRNRLDSLTRKNSV--HEIENKRLRRERDSAERQLGDAYEENNKLKEENEALRKELEE 65 (69)
T ss_pred HHHHHHHHHHHHHHHHHHHHH--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 344667777777764332222 2477787777776666666544 5666777778888877654
No 14
>PF11932 DUF3450: Protein of unknown function (DUF3450); InterPro: IPR016866 There is currently no experimental data for members of this group or their homologues, nor do they exhibit features indicative of any function. However, they are found in an operon along with components of a TonB transport system (typified by Vibrio cholerae TonB2 [], and are predicted to be localized to the periplasmic space. Caution: the low-complexity nature of these sequences produces spurious BLAST hits to chromosome segregation ATPases (which are much longer in length and contain canonical Walker motifs). Accordingly, some members are misidentified as such.
Probab=69.78 E-value=31 Score=28.65 Aligned_cols=17 Identities=6% Similarity=0.309 Sum_probs=6.9
Q ss_pred hhhhhcchHHHHHHhhh
Q psy7537 113 ETKKQVDSQAETKKQLD 129 (224)
Q Consensus 113 E~KrlsDe~AalKkQLD 129 (224)
|...+..+...+++|++
T Consensus 50 e~~~L~~e~~~l~~e~e 66 (251)
T PF11932_consen 50 EKQELLAEYRQLEREIE 66 (251)
T ss_pred HHHHHHHHHHHHHHHHH
Confidence 33344444444444433
No 15
>PF12711 Kinesin-relat_1: Kinesin motor; InterPro: IPR024658 Kinesin [, , ] is a microtubule-associated force-producing protein that may play a role in organelle transport. The kinesin motor activity is directed toward the microtubule's plus end. Kinesin is an oligomeric complex composed of two heavy chains and two light chains. The maintenance of the quaternary structure does not require interchain disulphide bonds. The heavy chain is composed of three structural domains: a large globular N-terminal domain which is responsible for the motor activity of kinesin (it is known to hydrolyse ATP, to bind and move on microtubules), a central alpha-helical coiled coil domain that mediates the heavy chain dimerisation; and a small globular C-terminal domain which interacts with other proteins (such as the kinesin light chains), vesicles and membranous organelles. A number of proteins have been recently found that contain a domain similar to that of the kinesin 'motor' domain [, ]: Drosophila melanogaster claret segregational protein (ncd). Ncd is required for normal chromosomal segregation in meiosis, in females, and in early mitotic divisions of the embryo. The ncd motor activity is directed toward the microtubule's minus end. Homo sapiens CENP-E []. CENP-E is a protein that associates with kinetochores during chromosome congression, relocates to the spindle midzone at anaphase, and is quantitatively discarded at the end of the cell division. CENP-E is probably an important motor molecule in chromosome movement and/or spindle elongation. H. sapiens mitotic kinesin-like protein-1 (MKLP-1), a motor protein whose activity is directed toward the microtubule's plus end. Saccharomyces cerevisiae KAR3 protein, which is essential for nuclear fusion during mating. KAR3 may mediate microtubule sliding during nuclear fusion and possibly mitosis. S. cerevisiae CIN8 and KIP1 proteins which are required for the assembly of the mitotic spindle. Both proteins seem to interact with spindle microtubules to produce an outwardly directed force acting upon the poles. Emericella nidulans (Aspergillus nidulans) bimC, which plays an important role in nuclear division. A. nidulans klpA. Caenorhabditis elegans unc-104, which may be required for the transport of substances needed for neuronal cell differentiation. C. elegans osm-3. Xenopus laevis Eg5, which may be involved in mitosis. Arabidopsis thaliana KatA, KatB and katC. Chlamydomonas reinhardtii FLA10/KHP1 and KLP1. Both proteins seem to play a role in the rotation or twisting of the microtubules of the flagella. C. elegans hypothetical protein T09A5.2. Kinesin-like proteins KLP2 (or KIF15) also contain a kinesin 'motor' domain. They are involved in mitotic spindle assembly, playing a role in positioning spindle poles during mitosis, specifically at prometaphase []. This entry represents a domain of unknown function found in this type of kinesin-like proteins.
Probab=67.57 E-value=14 Score=28.37 Aligned_cols=37 Identities=8% Similarity=0.159 Sum_probs=22.2
Q ss_pred HHHhhhhchHHHHHHhhccHHHHhhhhchHHHHHHhh
Q psy7537 142 AETKKQFDSQAETKKQFDSQAETKKQLDSQAEMKKQF 178 (224)
Q Consensus 142 AE~KrlsDe~AalKkQlDSnAe~KrlsDe~AalKKQl 178 (224)
.|.+.|.+||.-|..|+|-|.++-|..-|+-.|+.|+
T Consensus 24 ~e~~~L~eEI~~Lr~qve~nPevtr~A~EN~rL~ee~ 60 (86)
T PF12711_consen 24 EENEALKEEIQLLREQVEHNPEVTRFAMENIRLREEL 60 (86)
T ss_pred HHHHHHHHHHHHHHHHHHhCHHHHHHHHHHHHHHHHH
Confidence 3446666666666666666666666555555555543
No 16
>PF08317 Spc7: Spc7 kinetochore protein; InterPro: IPR013253 This entry consists of cell division proteins which are required for kinetochore-spindle association [].
Probab=65.30 E-value=60 Score=28.34 Aligned_cols=48 Identities=25% Similarity=0.299 Sum_probs=25.3
Q ss_pred HHHhhhhchHHHHHHhhchH----HHHHHHHHHhHHHHHhhhccc--ccCccce
Q psy7537 162 AETKKQLDSQAEMKKQFDSQ----AEAKKHLAQRYDDFMKELDYK--ETSGTTI 209 (224)
Q Consensus 162 Ae~KrlsDe~AalKKQlDSn----AqsKkhlaqryddfmkeldyk--etsgtti 209 (224)
++...+..+|+++++.+.-+ ..--..|-.+|+..-+-.+++ ..+|+++
T Consensus 251 ~~k~~l~~eI~e~~~~~~~~r~~t~~Ev~~Lk~~~~~Le~~~gw~~~~~~~~~l 304 (325)
T PF08317_consen 251 EQKQELLAEIAEAEKIREECRGWTRSEVKRLKAKVDALEKLTGWKIVSISGSTL 304 (325)
T ss_pred HHHHHHHHHHHHHHHHHHHhcCCCHHHHHHHHHHHHHHHHHHCcEEEEEeCCeE
Confidence 34445566666666655411 122344666777665555544 4455554
No 17
>PF00038 Filament: Intermediate filament protein; InterPro: IPR016044 Intermediate filaments (IF) [, , ] are proteins which are primordial components of the cytoskeleton and the nuclear envelope. They generally form filamentous structures 8 to 14 nm wide. IF proteins are members of a very large multigene family of proteins which has been subdivided in five major subgroups: Type I: Acidic cytokeratins. Type II: Basic cytokeratins. Type III: Vimentin, desmin, glial fibrillary acidic protein (GFAP), peripherin, and plasticin. Type IV: Neurofilaments L, H and M, alpha-internexin and nestin. Type V: Nuclear lamins A, B1, B2 and C. All IF proteins are structurally similar in that they consist of: a central rod domain comprising some 300 to 350 residues which is arranged in coiled-coiled alpha-helices, with at least two short characteristic interruptions; a N-terminal non-helical domain (head) of variable length; and a C-terminal domain (tail) which is also non-helical, and which shows extreme length variation between different IF proteins. While IF proteins are evolutionary and structurally related, they have limited sequence homologies except in several regions of the rod domain. This entry represents the central rod domain found in IF proteins.; PDB: 3TNU_B 3KLT_D 1GK4_F 3TRT_A 3G1E_A 3UF1_C 1GK6_B 1GK7_A 3TYY_B 3V4W_A ....
Probab=65.01 E-value=22 Score=29.50 Aligned_cols=88 Identities=15% Similarity=0.244 Sum_probs=49.7
Q ss_pred hHHHHHHHhhHHHHhhhhhchHHHHhhhhcchhhhhhhcchhhhhhhhchhhhhhhcchHHHHHHhhhhhhhhhhccCcH
Q psy7537 62 NQAETKKQLESEIETKKQLESEAETKKQVDSQGETKKQVDSQAESKKQVDAETKKQVDSQAETKKQLDSQAETKKQFDSQ 141 (224)
Q Consensus 62 ~~a~~kkQL~s~ies~~ql~adAe~K~l~d~ngalkKQ~dS~aqSk~saDAE~KrlsDe~AalKkQLDSnAqsk~sadsq 141 (224)
.+..+..+|+....-+.++ ..++..+...-..++.++..-. +..+.+-.++..+++.||.-.-.+-.+ +
T Consensus 55 el~~lr~~id~~~~eka~l--~~e~~~l~~e~~~~r~k~e~e~-------~~~~~le~el~~lrk~ld~~~~~r~~l--e 123 (312)
T PF00038_consen 55 ELRELRRQIDDLSKEKARL--ELEIDNLKEELEDLRRKYEEEL-------AERKDLEEELESLRKDLDEETLARVDL--E 123 (312)
T ss_dssp HHHCHHHHHHHHHHHHHHH--HHHHHHHHHHHHHHHHHHHHHH-------HHHHHHHHHHHHHHHHHHHHHHHHHHH--H
T ss_pred HHHHhHHhhhhHHHHhhHH--hhhhhhHHHHHHHHHHHHHHHH-------HHHHHHHHHHhhhhhhhhhhhhhHhHH--H
Confidence 3444555555555544422 2344444445555555554443 334445556677788888555444444 4
Q ss_pred HHHhhhhchHHHHHHhhcc
Q psy7537 142 AETKKQFDSQAETKKQFDS 160 (224)
Q Consensus 142 AE~KrlsDe~AalKkQlDS 160 (224)
.++..|-++++-+++..+.
T Consensus 124 ~~i~~L~eEl~fl~~~hee 142 (312)
T PF00038_consen 124 NQIQSLKEELEFLKQNHEE 142 (312)
T ss_dssp HHHHHHHHHHHHHHHHHHH
T ss_pred HHHHHHHHHHHHHHhhhhh
Confidence 7777888888777766554
No 18
>PF05529 Bap31: B-cell receptor-associated protein 31-like ; InterPro: IPR008417 Bap31 is a polytopic integral protein of the endoplasmic reticulum membrane and a substrate of caspase-8. Bap31 is cleaved within its cytosolic domain, generating pro-apoptotic p20 Bap31 [].; GO: 0006886 intracellular protein transport, 0005783 endoplasmic reticulum, 0016021 integral to membrane
Probab=63.97 E-value=22 Score=28.30 Aligned_cols=39 Identities=23% Similarity=0.392 Sum_probs=28.4
Q ss_pred HHHhhhhchHHHHHHhhchHHHHHHHHHHhHHHHHhhhc
Q psy7537 162 AETKKQLDSQAEMKKQFDSQAEAKKHLAQRYDDFMKELD 200 (224)
Q Consensus 162 Ae~KrlsDe~AalKKQlDSnAqsKkhlaqryddfmkeld 200 (224)
.+.++...|++.+|+++......-.-|......+-+|.|
T Consensus 154 ~~~~~~~~ei~~lk~el~~~~~~~~~LkkQ~~~l~~eyd 192 (192)
T PF05529_consen 154 EENKKLSEEIEKLKKELEKKEKEIEALKKQSEGLQKEYD 192 (192)
T ss_pred hhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcC
Confidence 456677888888998888876666667666666666654
No 19
>PF07106 TBPIP: Tat binding protein 1(TBP-1)-interacting protein (TBPIP); InterPro: IPR010776 This family consists of several eukaryotic TBP-1 interacting protein (TBPIP) sequences. TBP-1 has been demonstrated to interact with the human immunodeficiency virus type 1 (HIV-1) viral protein Tat, then modulate the essential replication process of HIV. In addition, TBP-1 has been shown to be a component of the 26S proteasome, a basic multiprotein complex that degrades ubiquitinated proteins in an ATP-dependent fashion. Human TBPIP interacts with human TBP-1 then modulates the inhibitory action of human TBP-1 on HIV-Tat-mediated transactivation [].
Probab=59.76 E-value=55 Score=25.63 Aligned_cols=79 Identities=18% Similarity=0.238 Sum_probs=36.8
Q ss_pred hhhhhhcchHHHHHHhhhhhhhhhhccCcHHHHhhhhchHHHHHHhhcc---HHHHhhhhchHHHHHHhhch--------
Q psy7537 112 AETKKQVDSQAETKKQLDSQAETKKQFDSQAETKKQFDSQAETKKQFDS---QAETKKQLDSQAEMKKQFDS-------- 180 (224)
Q Consensus 112 AE~KrlsDe~AalKkQLDSnAqsk~sadsqAE~KrlsDe~AalKkQlDS---nAe~KrlsDe~AalKKQlDS-------- 180 (224)
.|+..+-.++..|..+|..-. .+++.+..+++.|...+-. ...+..+..++.+|...|..
T Consensus 72 eel~~ld~ei~~L~~el~~l~---------~~~k~l~~eL~~L~~~~t~~el~~~i~~l~~e~~~l~~kL~~l~~~~~~v 142 (169)
T PF07106_consen 72 EELAELDAEIKELREELAELK---------KEVKSLEAELASLSSEPTNEELREEIEELEEEIEELEEKLEKLRSGSKPV 142 (169)
T ss_pred hhHHHHHHHHHHHHHHHHHHH---------HHHHHHHHHHHHHhcCCCHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCC
Confidence 344445555555555554433 3344444444444444311 13333444444444444432
Q ss_pred HHHHHHHHHHhHHHHHhhh
Q psy7537 181 QAEAKKHLAQRYDDFMKEL 199 (224)
Q Consensus 181 nAqsKkhlaqryddfmkel 199 (224)
.++-+..+...|..+.++.
T Consensus 143 s~ee~~~~~~~~~~~~k~w 161 (169)
T PF07106_consen 143 SPEEKEKLEKEYKKWRKEW 161 (169)
T ss_pred CHHHHHHHHHHHHHHHHHH
Confidence 2344556666666666553
No 20
>PF12711 Kinesin-relat_1: Kinesin motor; InterPro: IPR024658 Kinesin [, , ] is a microtubule-associated force-producing protein that may play a role in organelle transport. The kinesin motor activity is directed toward the microtubule's plus end. Kinesin is an oligomeric complex composed of two heavy chains and two light chains. The maintenance of the quaternary structure does not require interchain disulphide bonds. The heavy chain is composed of three structural domains: a large globular N-terminal domain which is responsible for the motor activity of kinesin (it is known to hydrolyse ATP, to bind and move on microtubules), a central alpha-helical coiled coil domain that mediates the heavy chain dimerisation; and a small globular C-terminal domain which interacts with other proteins (such as the kinesin light chains), vesicles and membranous organelles. A number of proteins have been recently found that contain a domain similar to that of the kinesin 'motor' domain [, ]: Drosophila melanogaster claret segregational protein (ncd). Ncd is required for normal chromosomal segregation in meiosis, in females, and in early mitotic divisions of the embryo. The ncd motor activity is directed toward the microtubule's minus end. Homo sapiens CENP-E []. CENP-E is a protein that associates with kinetochores during chromosome congression, relocates to the spindle midzone at anaphase, and is quantitatively discarded at the end of the cell division. CENP-E is probably an important motor molecule in chromosome movement and/or spindle elongation. H. sapiens mitotic kinesin-like protein-1 (MKLP-1), a motor protein whose activity is directed toward the microtubule's plus end. Saccharomyces cerevisiae KAR3 protein, which is essential for nuclear fusion during mating. KAR3 may mediate microtubule sliding during nuclear fusion and possibly mitosis. S. cerevisiae CIN8 and KIP1 proteins which are required for the assembly of the mitotic spindle. Both proteins seem to interact with spindle microtubules to produce an outwardly directed force acting upon the poles. Emericella nidulans (Aspergillus nidulans) bimC, which plays an important role in nuclear division. A. nidulans klpA. Caenorhabditis elegans unc-104, which may be required for the transport of substances needed for neuronal cell differentiation. C. elegans osm-3. Xenopus laevis Eg5, which may be involved in mitosis. Arabidopsis thaliana KatA, KatB and katC. Chlamydomonas reinhardtii FLA10/KHP1 and KLP1. Both proteins seem to play a role in the rotation or twisting of the microtubules of the flagella. C. elegans hypothetical protein T09A5.2. Kinesin-like proteins KLP2 (or KIF15) also contain a kinesin 'motor' domain. They are involved in mitotic spindle assembly, playing a role in positioning spindle poles during mitosis, specifically at prometaphase []. This entry represents a domain of unknown function found in this type of kinesin-like proteins.
Probab=57.39 E-value=16 Score=28.02 Aligned_cols=68 Identities=12% Similarity=0.200 Sum_probs=52.6
Q ss_pred hhhchhhhhhhcchHHHHHHhhhhhhhhhhccCcHHHHhhhhchHHHHHHhhccHHHHhhhhchHHHHHHhh
Q psy7537 107 KKQVDAETKKQVDSQAETKKQLDSQAETKKQFDSQAETKKQFDSQAETKKQFDSQAETKKQLDSQAEMKKQF 178 (224)
Q Consensus 107 k~saDAE~KrlsDe~AalKkQLDSnAqsk~sadsqAE~KrlsDe~AalKkQlDSnAe~KrlsDe~AalKKQl 178 (224)
.+....|.+.|.+|+.-|..|+|-|.+--.-| .|-.+|-+++-.++.-- -.-+-.-+..+++.|.+||
T Consensus 19 ~~~~~~e~~~L~eEI~~Lr~qve~nPevtr~A---~EN~rL~ee~rrl~~f~-~~gerE~l~~eis~L~~~l 86 (86)
T PF12711_consen 19 ESYLEEENEALKEEIQLLREQVEHNPEVTRFA---MENIRLREELRRLQSFY-VEGEREMLLQEISELRDQL 86 (86)
T ss_pred cchhHHHHHHHHHHHHHHHHHHHhCHHHHHHH---HHHHHHHHHHHHHHHHH-HhhHHHHHHHHHHHHHhhC
Confidence 33444677999999999999999998876655 68888888888776544 5556666788889988876
No 21
>COG1579 Zn-ribbon protein, possibly nucleic acid-binding [General function prediction only]
Probab=56.70 E-value=1.4e+02 Score=26.36 Aligned_cols=12 Identities=8% Similarity=0.141 Sum_probs=4.3
Q ss_pred hhcchHHHHHHh
Q psy7537 116 KQVDSQAETKKQ 127 (224)
Q Consensus 116 rlsDe~AalKkQ 127 (224)
++.-++.++.++
T Consensus 63 ~~e~ei~~~r~r 74 (239)
T COG1579 63 QLESEIQEIRER 74 (239)
T ss_pred HHHHHHHHHHHH
Confidence 333333333333
No 22
>PF15058 Speriolin_N: Speriolin N terminus
Probab=55.67 E-value=7.7 Score=34.04 Aligned_cols=33 Identities=24% Similarity=0.299 Sum_probs=20.5
Q ss_pred HhhhhcchhhhhhhcchhhhhhhhchhhhhhhcchHHHHHHhhhhhhh
Q psy7537 86 TKKQVDSQGETKKQVDSQAESKKQVDAETKKQVDSQAETKKQLDSQAE 133 (224)
Q Consensus 86 ~K~l~d~ngalkKQ~dS~aqSk~saDAE~KrlsDe~AalKkQLDSnAq 133 (224)
..|||.+|.+||||| ||..||-+||+-|-..++
T Consensus 14 ierLv~ENeeLKKlV---------------rLirEN~eLksaL~ea~~ 46 (200)
T PF15058_consen 14 IERLVRENEELKKLV---------------RLIRENHELKSALGEACA 46 (200)
T ss_pred HHHHHhhhHHHHHHH---------------HHHHHHHHHHHHHHHhhc
Confidence 456777777777765 556666666665544333
No 23
>TIGR03185 DNA_S_dndD DNA sulfur modification protein DndD. This model describes the DndB protein encoded by an operon associated with a sulfur-containing modification to DNA. The operon is sporadically distributed in bacteria, much like some restriction enzyme operons. DndD is described as a putative ATPase. The small number of examples known so far include species from among the Firmicutes, Actinomycetes, Proteobacteria, and Cyanobacteria.
Probab=54.81 E-value=1.7e+02 Score=27.69 Aligned_cols=48 Identities=8% Similarity=0.156 Sum_probs=23.6
Q ss_pred hhhhhhcchHHHHHHhhhhhhhhhhccCcHHHHhhhhchHHHHHHhhc
Q psy7537 112 AETKKQVDSQAETKKQLDSQAETKKQFDSQAETKKQFDSQAETKKQFD 159 (224)
Q Consensus 112 AE~KrlsDe~AalKkQLDSnAqsk~sadsqAE~KrlsDe~AalKkQlD 159 (224)
.++..+.+++..+..+||.-.+.=..+++..+++.+.++...+.+.++
T Consensus 391 ~~~~~~~~~~~~~e~el~~l~~~l~~~~~~e~i~~l~e~l~~l~~~l~ 438 (650)
T TIGR03185 391 DAKSQLLKELRELEEELAEVDKKISTIPSEEQIAQLLEELGEAQNELF 438 (650)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhcCCChHHHHHHHHHHHHHHHHHH
Confidence 344455555555555555544444444444455555554444444444
No 24
>PF09730 BicD: Microtubule-associated protein Bicaudal-D; InterPro: IPR018477 BicD proteins consist of three coiled-coiled domains and are involved in dynein-mediated minus end-directed transport from the Golgi apparatus to the endoplasmic reticulum (ER) []. Glycogen synthase kinase-3beta (GSK-3beta) is required for the binding of BICD to dynein but not to dynactin, acting to maintain the anchoring of microtubules to the centromere []. It appears that amino-acid residues 437-617 of BicD and the kinase activity of GSK-3 are necessary for the formation of a complex between BicD and GSK-3beta in intact cells [].; GO: 0006810 transport, 0005794 Golgi apparatus
Probab=53.69 E-value=1.8e+02 Score=29.43 Aligned_cols=111 Identities=17% Similarity=0.243 Sum_probs=67.1
Q ss_pred hhhHhhhhHHHHHHHhhHHHHhhhhhchHHHHhhhhcchhhhhhhcchhhhhhhhchhhhh--------------hhcch
Q psy7537 55 ETNKQVDNQAETKKQLESEIETKKQLESEAETKKQVDSQGETKKQVDSQAESKKQVDAETK--------------KQVDS 120 (224)
Q Consensus 55 ~~~KQ~d~~a~~kkQL~s~ies~~ql~adAe~K~l~d~ngalkKQ~dS~aqSk~saDAE~K--------------rlsDe 120 (224)
.+.-++.+...=-+|+...+..-. +|.-||...+-.+++-...--.-|+..-+|+| .|-.|
T Consensus 31 ~~~~~i~~l~~elk~~~~~~~~~~-----~e~~rl~~~~~~~~~~~~~~e~~~~~lr~e~ke~K~rE~rll~dyselEeE 105 (717)
T PF09730_consen 31 YLQQRILELENELKQLRQELSNVQ-----AENERLSQLNQELRKECEDLELERKRLREEIKEYKFREARLLQDYSELEEE 105 (717)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHH-----HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhHHHHHH
Confidence 333333333333344444443333 66666666666666543333333333344443 24457
Q ss_pred HHHHHHhhhhhhhhhhccCc-HHHHhhhhchHHHHHHhhccHHHHhhhhch
Q psy7537 121 QAETKKQLDSQAETKKQFDS-QAETKKQFDSQAETKKQFDSQAETKKQLDS 170 (224)
Q Consensus 121 ~AalKkQLDSnAqsk~sads-qAE~KrlsDe~AalKkQlDSnAe~KrlsDe 170 (224)
|-.|.||+.+--++---|++ .-|++|+-+++.-|+-|++--...|+.+..
T Consensus 106 NislQKqvs~Lk~sQvefE~~Khei~rl~Ee~~~l~~qlee~~rLk~iae~ 156 (717)
T PF09730_consen 106 NISLQKQVSVLKQSQVEFEGLKHEIKRLEEEIELLNSQLEEAARLKEIAEK 156 (717)
T ss_pred HHHHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 77788888877776666654 358999999999999999988877775543
No 25
>PF02403 Seryl_tRNA_N: Seryl-tRNA synthetase N-terminal domain; InterPro: IPR015866 The aminoacyl-tRNA synthetases (6.1.1. from EC) catalyse the attachment of an amino acid to its cognate transfer RNA molecule in a highly specific two-step reaction. These proteins differ widely in size and oligomeric state, and have limited sequence homology []. The 20 aminoacyl-tRNA synthetases are divided into two classes, I and II. Class I aminoacyl-tRNA synthetases contain a characteristic Rossman fold catalytic domain and are mostly monomeric []. Class II aminoacyl-tRNA synthetases share an anti-parallel beta-sheet fold flanked by alpha-helices [], and are mostly dimeric or multimeric, containing at least three conserved regions [, , ]. However, tRNA binding involves an alpha-helical structure that is conserved between class I and class II synthetases. In reactions catalysed by the class I aminoacyl-tRNA synthetases, the aminoacyl group is coupled to the 2'-hydroxyl of the tRNA, while, in class II reactions, the 3'-hydroxyl site is preferred. The synthetases specific for arginine, cysteine, glutamic acid, glutamine, isoleucine, leucine, methionine, tyrosine, tryptophan and valine belong to class I synthetases. The synthetases specific for alanine, asparagine, aspartic acid, glycine, histidine, lysine, phenylalanine, proline, serine, and threonine belong to class-II synthetases []. Based on their mode of binding to the tRNA acceptor stem, both classes of tRNA synthetases have been subdivided into three subclasses, designated 1a, 1b, 1c and 2a, 2b, 2c. This entry represents the N-terminal domain of Seryl-tRNA synthetase, which consists of two helices in a long alpha-hairpin. Seryl-tRNA synthetase (6.1.1.11 from EC) exists as monomer and belongs to class IIa [].; GO: 0000166 nucleotide binding, 0004828 serine-tRNA ligase activity, 0005524 ATP binding, 0006434 seryl-tRNA aminoacylation, 0005737 cytoplasm; PDB: 3QO8_A 3QO5_A 3QO7_A 3QNE_A 3LSQ_A 3LSS_A 2DQ3_B 1SET_A 1SER_A 1SRY_B ....
Probab=53.46 E-value=43 Score=24.22 Aligned_cols=39 Identities=26% Similarity=0.291 Sum_probs=23.4
Q ss_pred HHHHhhhhchHHHHHHhh-ccH---HHHhhhhchHHHHHHhhc
Q psy7537 141 QAETKKQFDSQAETKKQF-DSQ---AETKKQLDSQAEMKKQFD 179 (224)
Q Consensus 141 qAE~KrlsDe~AalKkQl-DSn---Ae~KrlsDe~AalKKQlD 179 (224)
+++.+.+|.+++.++++= |-. ++++.+.+++.++..++.
T Consensus 49 r~~rN~~sk~I~~~~~~~~~~~~l~~e~~~lk~~i~~le~~~~ 91 (108)
T PF02403_consen 49 RAERNELSKEIGKLKKAGEDAEELKAEVKELKEEIKELEEQLK 91 (108)
T ss_dssp HHHHHHHHHHHHHHCHTTCCTHHHHHHHHHHHHHHHHHHHHHH
T ss_pred HHHHhHHHHHHHHHhhCcccHHHHHHHHHHHHHHHHHHHHHHH
Confidence 367777788888777764 111 455555555555555543
No 26
>PF06156 DUF972: Protein of unknown function (DUF972); InterPro: IPR010377 FUNCTION: Involved in initiation control of chromosome replication. SUBUNIT: Interacts with both DnaA and DnaN, acting as a bridge between these two proteins. SIMILARITY: Belongs to the YabA family.
Probab=53.17 E-value=31 Score=26.77 Aligned_cols=50 Identities=18% Similarity=0.353 Sum_probs=25.0
Q ss_pred hhhhchHHHHHHhhccH-HHHhhhhchHHHHHHhhchH----------------HHHHHHHHHhHHH
Q psy7537 145 KKQFDSQAETKKQFDSQ-AETKKQLDSQAEMKKQFDSQ----------------AEAKKHLAQRYDD 194 (224)
Q Consensus 145 KrlsDe~AalKkQlDSn-Ae~KrlsDe~AalKKQlDSn----------------AqsKkhlaqrydd 194 (224)
..+.+++.+||+++..- .|-.+|.-|+..|+..|+-- -+++..|+.-|++
T Consensus 18 ~~l~~~~~~LK~~~~~l~EEN~~L~~EN~~Lr~~l~~~~~~~~~~~~~~~~~~~~~g~~NL~~LY~E 84 (107)
T PF06156_consen 18 GQLLEELEELKKQLQELLEENARLRIENEHLRERLEELEQEEEEKEEKKTKKKLGEGRDNLARLYQE 84 (107)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccccccccccccccccchHHHHHHHHhc
Confidence 34445555555554433 33334444555555554432 2466677777654
No 27
>cd00938 HisRS_RNA HisRS_RNA binding domain. This short RNA-binding domain is found at the N-terminus of HisRS in several higher eukaryote aminoacyl-tRNA synthetases (aaRSs). This domain consists of a helix- turn- helix structure, which is similar to other RNA-binding proteins. It is involved in both protein-RNA interactions by binding tRNA and protein-protein interactions, which are important for the formation of aaRSs into multienzyme complexes.
Probab=52.41 E-value=8.5 Score=26.90 Aligned_cols=35 Identities=37% Similarity=0.488 Sum_probs=23.7
Q ss_pred hcchhhhhhhcchhhhhhhhchhhhhhhcchHHHHHHhh
Q psy7537 90 VDSQGETKKQVDSQAESKKQVDAETKKQVDSQAETKKQL 128 (224)
Q Consensus 90 ~d~ngalkKQ~dS~aqSk~saDAE~KrlsDe~AalKkQL 128 (224)
+..+|+.-.++-+.-.++..+|+||++|. +||+||
T Consensus 7 I~~QGe~VRkLKa~KA~k~~i~~eV~~LL----~LKaql 41 (45)
T cd00938 7 VKLQGELVRKLKAEKASKEQIAEEVAKLL----ELKAQL 41 (45)
T ss_pred HHHHHHHHHHHHHccCCHHHHHHHHHHHH----HHHHHh
Confidence 34456666666677777777888888776 466665
No 28
>PF09744 Jnk-SapK_ap_N: JNK_SAPK-associated protein-1; InterPro: IPR019143 This entry represents the N-terminal 200 residues of a set of proteins conserved from yeasts to humans. Most of the proteins in this entry have a RhoGEF domain (IPR000219 from INTERPRO) at their C-terminal end.
Probab=50.57 E-value=1.4e+02 Score=24.57 Aligned_cols=72 Identities=15% Similarity=0.169 Sum_probs=38.1
Q ss_pred HHhhhhcchhhhhhhcchhhhhhhhchhhhhhhcchHHHHHHhhhhhhhhhhccCcHHHHhhhhchHHHHHHhh
Q psy7537 85 ETKKQVDSQGETKKQVDSQAESKKQVDAETKKQVDSQAETKKQLDSQAETKKQFDSQAETKKQFDSQAETKKQF 158 (224)
Q Consensus 85 e~K~l~d~ngalkKQ~dS~aqSk~saDAE~KrlsDe~AalKkQLDSnAqsk~sadsqAE~KrlsDe~AalKkQl 158 (224)
-+++||..-=..=.-++|-+-.......|+.+|...+..|-.|.-...+.+..+. -++-.+-|..-.-++.|
T Consensus 23 ~v~~LmP~VV~vLE~Le~~~~~n~~~~~e~~~L~~d~e~L~~q~~~ek~~r~~~e--~~l~~~Ed~~~~e~k~L 94 (158)
T PF09744_consen 23 AVKGLMPKVVRVLELLESLASRNQEHEVELELLREDNEQLETQYEREKELRKQAE--EELLELEDQWRQERKDL 94 (158)
T ss_pred HHHHHHHHHHHHHHHHHHHHHhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHH--HHHHHHHHHHHHHHHHH
Confidence 3455555444333335665555556666666666666666666665555555554 33443344444433333
No 29
>PF12325 TMF_TATA_bd: TATA element modulatory factor 1 TATA binding; InterPro: IPR022091 This is the C-terminal conserved coiled coil region of a family of TATA element modulatory factor 1 proteins conserved in eukaryotes []. The proteins bind to the TATA element of some RNA polymerase II promoters and repress their activity. by competing with the binding of TATA binding protein. TMF1_TATA_bd is the most conserved part of the TMFs []. TMFs are evolutionarily conserved golgins that bind Rab6, a ubiquitous ras-like GTP-binding Golgi protein, and contribute to Golgi organisation in animal [] and plant cells. The Rab6-binding domain appears to be the same region as this C-terminal family [].
Probab=50.44 E-value=79 Score=25.04 Aligned_cols=25 Identities=12% Similarity=0.225 Sum_probs=10.5
Q ss_pred cchhhhhhhhchhhhhhhcchHHHH
Q psy7537 100 VDSQAESKKQVDAETKKQVDSQAET 124 (224)
Q Consensus 100 ~dS~aqSk~saDAE~KrlsDe~Aal 124 (224)
+..-...|..+-+|+-++..++.++
T Consensus 39 l~~l~~~r~~l~~Eiv~l~~~~e~~ 63 (120)
T PF12325_consen 39 LARLEAERDELREEIVKLMEENEEL 63 (120)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 3333334444444444444444433
No 30
>PF05557 MAD: Mitotic checkpoint protein; InterPro: IPR008672 This family consists of several eukaryotic mitotic checkpoint (Mitotic arrest deficient or MAD) proteins. The mitotic spindle checkpoint monitors proper attachment of the bipolar spindle to the kinetochores of aligned sister chromatids and causes a cell cycle arrest in prometaphase when failures occur. Multiple components of the mitotic spindle checkpoint have been identified in Saccharomyces cerevisiae and higher eukaryotes. In Saccharomyces cerevisiae, the existence of a Mad1-dependent complex containing Mad2, Mad3, Bub3 and Cdc20 has been demonstrated [].; PDB: 1GO4_F 4DZO_A.
Probab=49.17 E-value=63 Score=30.96 Aligned_cols=83 Identities=14% Similarity=0.220 Sum_probs=45.0
Q ss_pred chhhhhhhcchHHHHHHhhhh-------------------hhhhhhccCcHHHHhhhhchHHHHHHhh-----c-cH-H-
Q psy7537 110 VDAETKKQVDSQAETKKQLDS-------------------QAETKKQFDSQAETKKQFDSQAETKKQF-----D-SQ-A- 162 (224)
Q Consensus 110 aDAE~KrlsDe~AalKkQLDS-------------------nAqsk~sadsqAE~KrlsDe~AalKkQl-----D-Sn-A- 162 (224)
.-.|..+|-.+++.|..+|.+ |.-+.-.+-.+.++..|-.|+++|..++ + ++ .
T Consensus 515 Le~e~~~L~~~~~~Le~~l~~~~L~g~~~~~~trVL~lr~NP~~~~~~~k~~~l~~L~~En~~L~~~l~~le~~~~~~~~ 594 (722)
T PF05557_consen 515 LERENERLRQELEELESELEKLTLQGEFNPSKTRVLHLRDNPTSKAEQIKKSTLEALQAENEDLLARLRSLEEGNSQPVD 594 (722)
T ss_dssp HHHHHHHHHHHHHHHHHHHHHHCCCT--BTTTEEEEEESS-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHTTTT-----
T ss_pred HHHHHHHHHHHHHHHHHHHHHhhhccccCCCCceeeeeCCCcHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccCCCCCcc
Confidence 344556666666777766654 1112222334567888888999999888 1 11 1
Q ss_pred -----HHhhhhchHHHHHHhhchHHHHHHHHHHhH
Q psy7537 163 -----ETKKQLDSQAEMKKQFDSQAEAKKHLAQRY 192 (224)
Q Consensus 163 -----e~KrlsDe~AalKKQlDSnAqsKkhlaqry 192 (224)
.+.....++++|++++.+----...|-+-|
T Consensus 595 ~~p~~~~~~~~~e~~~l~~~~~~~ekr~~RLkevf 629 (722)
T PF05557_consen 595 AVPTSSLESQEKEIAELKAELASAEKRNQRLKEVF 629 (722)
T ss_dssp -----------HHHHHHHHHHHHHHHHHHHHHHHH
T ss_pred cccchhhhhhHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 122344567888888876443333343333
No 31
>TIGR00606 rad50 rad50. This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University).
Probab=48.38 E-value=3.1e+02 Score=28.31 Aligned_cols=49 Identities=14% Similarity=0.187 Sum_probs=25.6
Q ss_pred HhhhhchHHHHHHhhccH-HHHhhhhchHHHHHHhhchHHHHHHHHHHhH
Q psy7537 144 TKKQFDSQAETKKQFDSQ-AETKKQLDSQAEMKKQFDSQAEAKKHLAQRY 192 (224)
Q Consensus 144 ~KrlsDe~AalKkQlDSn-Ae~KrlsDe~AalKKQlDSnAqsKkhlaqry 192 (224)
++++..++..+..+++.- ++...+..++.++.+.++..-.-+..+....
T Consensus 972 L~~~e~el~~~~~~ie~le~e~~~l~~~i~~l~kel~~~~~~kr~l~dnL 1021 (1311)
T TIGR00606 972 LKQKETELNTVNAQLEECEKHQEKINEDMRLMRQDIDTQKIQERWLQDNL 1021 (1311)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 444444444444443332 5566666666666666666555555544433
No 32
>PF13094 CENP-Q: CENP-Q, a CENPA-CAD centromere complex subunit
Probab=48.36 E-value=1.3e+02 Score=23.43 Aligned_cols=76 Identities=13% Similarity=0.235 Sum_probs=40.6
Q ss_pred cCCCcchhhhhhhhhHhhhhHHHHHHHhhHHHHhhhhhchHHHHhhhhcchhhhhhhcchhhhhhhhchhhhhhhcch
Q psy7537 43 IESPSAAIESETETNKQVDNQAETKKQLESEIETKKQLESEAETKKQVDSQGETKKQVDSQAESKKQVDAETKKQVDS 120 (224)
Q Consensus 43 iesP~a~~~SeT~~~KQ~d~~a~~kkQL~s~ies~~ql~adAe~K~l~d~ngalkKQ~dS~aqSk~saDAE~KrlsDe 120 (224)
+--|..+..+......-++..+.+..||..+..+-..|. +|+.+.......-.+++.+=-..-....+|++++...
T Consensus 9 ~~~Pp~~~~~~~~~e~ll~~~~~LE~qL~~~~~~l~lLq--~e~~~~e~~le~d~~~L~~Le~~~~~~~~e~~~~~~~ 84 (160)
T PF13094_consen 9 LPFPPQKREDSFDYEQLLDRKRALERQLAANLHQLELLQ--EEIEKEEAALERDYEYLQELEKNAKALEREREEEEKK 84 (160)
T ss_pred CCCCcccccccccHHHHHHHHHHHHHHHHHHHHHHHHHH--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhc
Confidence 445666677777777777888888888776666665444 3344443333322222332222223334444444443
No 33
>PRK10929 putative mechanosensitive channel protein; Provisional
Probab=47.35 E-value=3e+02 Score=29.13 Aligned_cols=33 Identities=9% Similarity=-0.002 Sum_probs=22.4
Q ss_pred hhhhhcchHHHHHHhhhhhhhhhhccCcHHHHh
Q psy7537 113 ETKKQVDSQAETKKQLDSQAETKKQFDSQAETK 145 (224)
Q Consensus 113 E~KrlsDe~AalKkQLDSnAqsk~sadsqAE~K 145 (224)
++.+++.++...|.++|.-.|+-....-|.++-
T Consensus 280 ~~n~l~~~~~~~~~~l~~~~q~~~~i~eQi~~l 312 (1109)
T PRK10929 280 RMDLIASQQRQAASQTLQVRQALNTLREQSQWL 312 (1109)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh
Confidence 456677777788888887777766555555443
No 34
>PF11932 DUF3450: Protein of unknown function (DUF3450); InterPro: IPR016866 There is currently no experimental data for members of this group or their homologues, nor do they exhibit features indicative of any function. However, they are found in an operon along with components of a TonB transport system (typified by Vibrio cholerae TonB2 [], and are predicted to be localized to the periplasmic space. Caution: the low-complexity nature of these sequences produces spurious BLAST hits to chromosome segregation ATPases (which are much longer in length and contain canonical Walker motifs). Accordingly, some members are misidentified as such.
Probab=47.32 E-value=1.7e+02 Score=24.39 Aligned_cols=118 Identities=12% Similarity=0.244 Sum_probs=60.3
Q ss_pred HHHhhhhcchhhhhhhcchhhhhhhhchhhhhhhcchHHHHHHhhhhhhhhhhccCcHHHHhhhhchHHHHHHhhccHHH
Q psy7537 84 AETKKQVDSQGETKKQVDSQAESKKQVDAETKKQVDSQAETKKQLDSQAETKKQFDSQAETKKQFDSQAETKKQFDSQAE 163 (224)
Q Consensus 84 Ae~K~l~d~ngalkKQ~dS~aqSk~saDAE~KrlsDe~AalKkQLDSnAqsk~sadsqAE~KrlsDe~AalKkQlDSnAe 163 (224)
.+...+..+-..+.+|+++-..-.......+..+-.+++.|+.|+++-..++.... .=|.+..+.+...= .+|.---
T Consensus 49 ~e~~~L~~e~~~l~~e~e~L~~~~~~l~~~v~~q~~el~~L~~qi~~~~~~~~~l~--p~m~~m~~~L~~~v-~~d~Pf~ 125 (251)
T PF11932_consen 49 DEKQELLAEYRQLEREIENLEVYNEQLERQVASQEQELASLEQQIEQIEETRQELV--PLMEQMIDELEQFV-ELDLPFL 125 (251)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH--HHHHHHHHHHHHHH-hcCCCCC
Confidence 44445555555555555555555555555666666677777777777777766655 44554444433211 2222211
Q ss_pred HhhhhchHHHHHHhhch-HHHHHHHHHHhHHHHHhhhccccc
Q psy7537 164 TKKQLDSQAEMKKQFDS-QAEAKKHLAQRYDDFMKELDYKET 204 (224)
Q Consensus 164 ~KrlsDe~AalKKQlDS-nAqsKkhlaqryddfmkeldyket 204 (224)
...-...++.|+.-++. +.-.-..+..-..-|.-|.+|.-+
T Consensus 126 ~~eR~~Rl~~L~~~l~~~dv~~~ek~r~vlea~~~E~~yg~~ 167 (251)
T PF11932_consen 126 LEERQERLARLRAMLDDADVSLAEKFRRVLEAYQIEMEYGRT 167 (251)
T ss_pred hHHHHHHHHHHHHhhhccCCCHHHHHHHHHHHHHHHHHhCCc
Confidence 11112233555555532 222233344444556677788544
No 35
>PRK13676 hypothetical protein; Provisional
Probab=47.23 E-value=89 Score=23.17 Aligned_cols=68 Identities=16% Similarity=0.208 Sum_probs=48.1
Q ss_pred hhhchhhhhhhcchHHHHHHhhhhhhhhhhccCcHHHHhhhhchHHHHHHhhccHHHHhhhhchHHHHHHhhc
Q psy7537 107 KKQVDAETKKQVDSQAETKKQLDSQAETKKQFDSQAETKKQFDSQAETKKQFDSQAETKKQLDSQAEMKKQFD 179 (224)
Q Consensus 107 k~saDAE~KrlsDe~AalKkQLDSnAqsk~sadsqAE~KrlsDe~AalKkQlDSnAe~KrlsDe~AalKKQlD 179 (224)
+-.+|.+.+.+..+--..+.++ .+++..|..-+++++++ ..++..++..|..++.......++-+-+.
T Consensus 31 ~l~~d~~a~~li~~F~~~q~~~-~~~q~~g~~~~~e~~~~----l~~l~~~i~~n~~i~~y~~Ae~~l~~ll~ 98 (114)
T PRK13676 31 AVKADEEAKKLFDEFRALQLEI-QQKQMTGQEITEEEQQK----AQELGQKIQQNELLSKLMEAEQRLSVYIN 98 (114)
T ss_pred HHHcCHHHHHHHHHHHHHHHHH-HHHHHcCCCCCHHHHHH----HHHHHHHHhcCHHHHHHHHHHHHHHHHHH
Confidence 3346888899999998888888 45566666555555554 45677888888888888776666655443
No 36
>PF11559 ADIP: Afadin- and alpha -actinin-Binding; InterPro: IPR021622 This family is found in mammals where it is localised at cell-cell adherens junctions [], and in Sch. pombe and other fungi where it anchors spindle-pole bodies to spindle microtubules []. It is a coiled-coil structure, and in pombe, it is required for anchoring the minus end of spindle microtubules to the centrosome equivalent, the spindle-pole body. The name ADIP derives from the family being composed of Afadin- and alpha -Actinin-Binding Proteins Localised at Cell-Cell Adherens Junctions.
Probab=47.01 E-value=1.2e+02 Score=23.15 Aligned_cols=34 Identities=18% Similarity=0.311 Sum_probs=19.4
Q ss_pred hhhhchHHHHHHhhcc-----HHHHhhhhchHHHHHHhh
Q psy7537 145 KKQFDSQAETKKQFDS-----QAETKKQLDSQAEMKKQF 178 (224)
Q Consensus 145 KrlsDe~AalKkQlDS-----nAe~KrlsDe~AalKKQl 178 (224)
|.+.|+.+.++..+.+ ..|+||---|+..||.+|
T Consensus 111 k~~kee~~klk~~~~~~~tq~~~e~rkke~E~~kLk~rL 149 (151)
T PF11559_consen 111 KQEKEELQKLKNQLQQRKTQYEHELRKKEREIEKLKERL 149 (151)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh
Confidence 3344444444443332 267777777777777765
No 37
>PRK11281 hypothetical protein; Provisional
Probab=46.71 E-value=1e+02 Score=32.33 Aligned_cols=35 Identities=14% Similarity=0.199 Sum_probs=23.7
Q ss_pred hhhhhcchHHHHHHhhhhhhhhhhccCcHHHHhhh
Q psy7537 113 ETKKQVDSQAETKKQLDSQAETKKQFDSQAETKKQ 147 (224)
Q Consensus 113 E~KrlsDe~AalKkQLDSnAqsk~sadsqAE~Krl 147 (224)
.+..++.++...|.+||.-.|+-...+.|.++-+.
T Consensus 300 ~~~~l~~~~~~~~~~l~~~~q~~~~i~eqi~~l~~ 334 (1113)
T PRK11281 300 KLNTLTQQNLRVKNWLDRLTQSERNIKEQISVLKG 334 (1113)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcc
Confidence 45666777777888888777776666656555443
No 38
>PF13166 AAA_13: AAA domain
Probab=45.91 E-value=2.5e+02 Score=26.03 Aligned_cols=53 Identities=17% Similarity=0.212 Sum_probs=24.8
Q ss_pred HHhhhhchHHHHHHhhccH-HHHhhhhchHHHHHHhhchHHHHHHHHHHhHHHH
Q psy7537 143 ETKKQFDSQAETKKQFDSQ-AETKKQLDSQAEMKKQFDSQAEAKKHLAQRYDDF 195 (224)
Q Consensus 143 E~KrlsDe~AalKkQlDSn-Ae~KrlsDe~AalKKQlDSnAqsKkhlaqryddf 195 (224)
+++++...+..+++.+... .+...+..++.+++.++-+-..+...+......|
T Consensus 418 ~~~~~~~~i~~~~~~~~~~~~~~~~~~~~i~~l~~~~~~~~~~~~~iN~~L~~~ 471 (712)
T PF13166_consen 418 EIKELEKEINSLEKKLKKAKEEIKKIEKEIKELEAQLKNTEPAADRINEELKRL 471 (712)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhHHHHHHHHHHHHHh
Confidence 3344444444444433322 3444555566667766665444444444333333
No 39
>PF12325 TMF_TATA_bd: TATA element modulatory factor 1 TATA binding; InterPro: IPR022091 This is the C-terminal conserved coiled coil region of a family of TATA element modulatory factor 1 proteins conserved in eukaryotes []. The proteins bind to the TATA element of some RNA polymerase II promoters and repress their activity. by competing with the binding of TATA binding protein. TMF1_TATA_bd is the most conserved part of the TMFs []. TMFs are evolutionarily conserved golgins that bind Rab6, a ubiquitous ras-like GTP-binding Golgi protein, and contribute to Golgi organisation in animal [] and plant cells. The Rab6-binding domain appears to be the same region as this C-terminal family [].
Probab=45.21 E-value=1.3e+02 Score=23.85 Aligned_cols=85 Identities=14% Similarity=0.162 Sum_probs=57.6
Q ss_pred hHHHHHHHhhHHHHhhhhhchHHHHhhhhcchhhhhhhcchhhhhhhhchhhhhhhcchHHHHHHhhhhhhhhhhccCcH
Q psy7537 62 NQAETKKQLESEIETKKQLESEAETKKQVDSQGETKKQVDSQAESKKQVDAETKKQVDSQAETKKQLDSQAETKKQFDSQ 141 (224)
Q Consensus 62 ~~a~~kkQL~s~ies~~ql~adAe~K~l~d~ngalkKQ~dS~aqSk~saDAE~KrlsDe~AalKkQLDSnAqsk~sadsq 141 (224)
+.+.++.++..-...|. ++.+|+-+++.+|.+++. ...++..|-.+.++|....+.--+-=|.-.
T Consensus 31 E~~~l~~el~~l~~~r~--~l~~Eiv~l~~~~e~~~~-----------~~~~~~~L~~el~~l~~ry~t~LellGEK~-- 95 (120)
T PF12325_consen 31 ELASLQEELARLEAERD--ELREEIVKLMEENEELRA-----------LKKEVEELEQELEELQQRYQTLLELLGEKS-- 95 (120)
T ss_pred HHHHHHHHHHHHHHHHH--HHHHHHHHHHHHHHHHHH-----------HHHHHHHHHHHHHHHHHHHHHHHHHhcchH--
Confidence 45666777777666777 555677777777766633 345677778888888888888877777766
Q ss_pred HHHhhhhchHHHHHHhhccH
Q psy7537 142 AETKKQFDSQAETKKQFDSQ 161 (224)
Q Consensus 142 AE~KrlsDe~AalKkQlDSn 161 (224)
.++-.|-..++.+|.-+..+
T Consensus 96 E~veEL~~Dv~DlK~myr~Q 115 (120)
T PF12325_consen 96 EEVEELRADVQDLKEMYREQ 115 (120)
T ss_pred HHHHHHHHHHHHHHHHHHHH
Confidence 56666666666666555443
No 40
>PF06133 DUF964: Protein of unknown function (DUF964); InterPro: IPR010368 This entry consists of several relatively short bacterial and archaeal hypothetical sequences. It also includes YlbF and YmcA proteins which are involved in the formation of biofilms []. YlbF regulates sporulation prior to stage II, positively controlling the competence regulator ComK at a post-transcriptional level. It may also modulate the translation, stability or activity of ComS and may work together with YmcA to regulate community development [].; PDB: 2IAZ_C 2OEE_A 2OEQ_D 2PIH_A.
Probab=44.75 E-value=80 Score=22.47 Aligned_cols=65 Identities=15% Similarity=0.268 Sum_probs=40.2
Q ss_pred hhhhhchhhhhhhcchHHHHHHhhhhhhhhhhccCcHHHHhhhhchHHHHHHhhccHHHHhhhhchHHHH
Q psy7537 105 ESKKQVDAETKKQVDSQAETKKQLDSQAETKKQFDSQAETKKQFDSQAETKKQFDSQAETKKQLDSQAEM 174 (224)
Q Consensus 105 qSk~saDAE~KrlsDe~AalKkQLDSnAqsk~sadsqAE~KrlsDe~AalKkQlDSnAe~KrlsDe~Aal 174 (224)
+..-..|.|.+++.++.-.++.++ -+++..|...+ +..-.+..++.+++..|..++.......++
T Consensus 26 ~~~l~~d~e~~~l~~~f~~~q~~~-~~~q~~g~~~~----~e~~~~l~~~~~~l~~~p~v~~y~~ae~~~ 90 (108)
T PF06133_consen 26 EEALEADPEAQKLIEEFQKLQQEL-QNAQMYGKEPP----KEEIEELQELQEELMQNPVVKEYLQAEQAL 90 (108)
T ss_dssp HHHHHCSHHHHHHHHHHHHHHHHH-HHHHHTTSCHH----HHHHHHHHHHHHHHHTSHHHHHHHHHHHHH
T ss_pred HHHHHhCHHHHHHHHHHHHHHHHH-HHHHhhccCCc----HHHHHHHHHHHHHHHcCHHHHHHHHHHHHH
Confidence 344456778888888888887777 45555555443 333344566677777777766665544443
No 41
>KOG0161|consensus
Probab=44.57 E-value=3e+02 Score=31.06 Aligned_cols=111 Identities=21% Similarity=0.331 Sum_probs=74.8
Q ss_pred HHhhhhcchhhhhhhcchhhhhhhhchhhhhhhcchHHHHHHhhhhhhhhhhccCcHHHH-hhhhchHHHHHHhhccH--
Q psy7537 85 ETKKQVDSQGETKKQVDSQAESKKQVDAETKKQVDSQAETKKQLDSQAETKKQFDSQAET-KKQFDSQAETKKQFDSQ-- 161 (224)
Q Consensus 85 e~K~l~d~ngalkKQ~dS~aqSk~saDAE~KrlsDe~AalKkQLDSnAqsk~sadsqAE~-KrlsDe~AalKkQlDSn-- 161 (224)
.++.|......|...+.....++.-++.....|..+..+++.+|+-...+. -+|.|+ ++..-+++.+++.+.--
T Consensus 1098 ~i~eL~~~i~el~e~le~er~~r~K~ek~r~dL~~ele~l~~~Lee~~~~t---~~q~e~~~k~e~e~~~l~~~leee~~ 1174 (1930)
T KOG0161|consen 1098 QIKELEARIKELEEELEAERASRAKAERQRRDLSEELEELKEELEEQGGTT---AAQLELNKKREAEVQKLRRDLEEETL 1174 (1930)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhH---HHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 345555555677777777778888888888889999999999998874332 233443 34455667776665432
Q ss_pred -------HHHhhhhchHHHHHHhhchHHHHHHHHHHhHHHHHhh
Q psy7537 162 -------AETKKQLDSQAEMKKQFDSQAEAKKHLAQRYDDFMKE 198 (224)
Q Consensus 162 -------Ae~KrlsDe~AalKKQlDSnAqsKkhlaqryddfmke 198 (224)
.--++..|+.++|--|++---+.|..|..-+.++-.+
T Consensus 1175 ~~e~~~~~lr~~~~~~~~el~~qle~l~~~k~~lekek~~lq~e 1218 (1930)
T KOG0161|consen 1175 DHEAQIEELRKKHADSLAELQEQLEQLQKDKAKLEKEKSDLQRE 1218 (1930)
T ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 2235667888888888888777777766665555443
No 42
>KOG0612|consensus
Probab=43.41 E-value=1.3e+02 Score=32.79 Aligned_cols=25 Identities=36% Similarity=0.478 Sum_probs=15.9
Q ss_pred hHHHHHHHhhHHHHhhhhhchHHHHhh
Q psy7537 62 NQAETKKQLESEIETKKQLESEAETKK 88 (224)
Q Consensus 62 ~~a~~kkQL~s~ies~~ql~adAe~K~ 88 (224)
.-..+.++|++.|+-.. ..+.+.++
T Consensus 462 ~~~~~~keL~e~i~~lk--~~~~el~~ 486 (1317)
T KOG0612|consen 462 ELEEMDKELEETIEKLK--SEESELQR 486 (1317)
T ss_pred hHHHHHHHHHHHHHHHH--HHHHHHHH
Confidence 34467778888887776 44445544
No 43
>PF15058 Speriolin_N: Speriolin N terminus
Probab=43.37 E-value=23 Score=31.18 Aligned_cols=15 Identities=20% Similarity=0.215 Sum_probs=8.8
Q ss_pred HhhhhchHHHHHHhh
Q psy7537 144 TKKQFDSQAETKKQF 158 (224)
Q Consensus 144 ~KrlsDe~AalKkQl 158 (224)
|.||-.|+++||||+
T Consensus 14 ierLv~ENeeLKKlV 28 (200)
T PF15058_consen 14 IERLVRENEELKKLV 28 (200)
T ss_pred HHHHHhhhHHHHHHH
Confidence 455666666666654
No 44
>KOG0946|consensus
Probab=42.08 E-value=1.6e+02 Score=31.22 Aligned_cols=76 Identities=16% Similarity=0.198 Sum_probs=39.2
Q ss_pred hhhhhhhhhHhhhhHHHHHHHhhHHHHhhhhhchHHHHhhhhcchhhhhhhcchhhhhhhhchhhhhhhcchHHHHHHhh
Q psy7537 49 AIESETETNKQVDNQAETKKQLESEIETKKQLESEAETKKQVDSQGETKKQVDSQAESKKQVDAETKKQVDSQAETKKQL 128 (224)
Q Consensus 49 ~~~SeT~~~KQ~d~~a~~kkQL~s~ies~~ql~adAe~K~l~d~ngalkKQ~dS~aqSk~saDAE~KrlsDe~AalKkQL 128 (224)
++++.|+.++|+..-...-.+|+++++.-+ -+++++-..-.+++.++---..-....++|++-+..+.+.+|+|+
T Consensus 634 ~ls~~~eee~~~~~~~k~~e~l~~~~~kyK-----~lI~~lD~~~e~lkQ~~~~l~~e~eeL~~~vq~~~s~hsql~~q~ 708 (970)
T KOG0946|consen 634 LLSSKTEEEEQTQLAEKYHEELDDIQQKYK-----GLIRELDYQIENLKQMEKELQVENEELEEEVQDFISEHSQLKDQL 708 (970)
T ss_pred HhcCCCccchhhHHHHHHHHHHHHHHHHHH-----HHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 455666666666665555566666665544 333333333334444333333344455556665555555555554
Q ss_pred h
Q psy7537 129 D 129 (224)
Q Consensus 129 D 129 (224)
+
T Consensus 709 ~ 709 (970)
T KOG0946|consen 709 D 709 (970)
T ss_pred H
Confidence 3
No 45
>PF10168 Nup88: Nuclear pore component; InterPro: IPR019321 Nup88 can be divided into two structural domains; the N-terminal two-thirds of the protein have no obvious structural motifs. It is, however, where it binds to Nup98; one of the components of the nuclear pore. The C-terminal end is a predicted coiled-coil domain []. Nup88 is over expressed in tumour cells [].
Probab=40.66 E-value=2.6e+02 Score=27.92 Aligned_cols=61 Identities=21% Similarity=0.216 Sum_probs=35.0
Q ss_pred hhhhhcchHHHHHHhhhhhhhh---hhccCcHHHHhhhhchHHHHHHhhccHHHHhhhhchHHHHHHhhch
Q psy7537 113 ETKKQVDSQAETKKQLDSQAET---KKQFDSQAETKKQFDSQAETKKQFDSQAETKKQLDSQAEMKKQFDS 180 (224)
Q Consensus 113 E~KrlsDe~AalKkQLDSnAqs---k~sadsqAE~KrlsDe~AalKkQlDSnAe~KrlsDe~AalKKQlDS 180 (224)
-.++..+.++.|-+.++.-.+. ..-.-|.|| ++..+|...++.+++ .+...+..+|++++.
T Consensus 601 R~e~a~d~Qe~L~~R~~~vl~~l~~~~P~LS~AE-r~~~~EL~~~~~~l~------~l~~si~~lk~k~~~ 664 (717)
T PF10168_consen 601 RYEEAKDKQEKLMKRVDRVLQLLNSQLPVLSEAE-REFKKELERMKDQLQ------DLKASIEQLKKKLDY 664 (717)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHhccCCCCCHHH-HHHHHHHHHHHHHHH------HHHHHHHHHHHHHHH
Confidence 4666677777777777654332 222355677 666666666666553 344445555555554
No 46
>PF05701 WEMBL: Weak chloroplast movement under blue light; InterPro: IPR008545 This family consists of several plant proteins of unknown function. Several sequences in this family are described as being myosin heavy chain-like.
Probab=40.31 E-value=3.2e+02 Score=25.68 Aligned_cols=66 Identities=20% Similarity=0.170 Sum_probs=41.3
Q ss_pred hhhhhcchhhhhhhhchhhhhhhcchHHHHHHhhhhhhhhhhccCcHHH------------HhhhhchHHHHHHhhcc
Q psy7537 95 ETKKQVDSQAESKKQVDAETKKQVDSQAETKKQLDSQAETKKQFDSQAE------------TKKQFDSQAETKKQFDS 160 (224)
Q Consensus 95 alkKQ~dS~aqSk~saDAE~KrlsDe~AalKkQLDSnAqsk~sadsqAE------------~KrlsDe~AalKkQlDS 160 (224)
..+.++++--.--..+=+|.....++...++.++++-...|..+-++|+ +.-|+.++.++|.-|++
T Consensus 113 ~~k~ele~~~~q~~~~~~eL~~~k~EL~~lr~e~~~~~~~k~~A~~~aeea~~~a~~~~~kve~L~~Ei~~lke~l~~ 190 (522)
T PF05701_consen 113 AWKAELESAREQYASAVAELDSVKQELEKLRQELASALDAKNAALKQAEEAVSAAEENEEKVEELSKEIIALKESLES 190 (522)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 3555555555555556666777777777777777777666666655543 34456666666666665
No 47
>KOG0933|consensus
Probab=39.87 E-value=3.7e+02 Score=29.25 Aligned_cols=147 Identities=14% Similarity=0.188 Sum_probs=78.5
Q ss_pred hhhhhHhhhhHHHHHHHhhHHHHhhhhhchHHHHhhhhcchhhhhhhcchhhhhhhhchhhhhhhcchHHHHHHhhhhhh
Q psy7537 53 ETETNKQVDNQAETKKQLESEIETKKQLESEAETKKQVDSQGETKKQVDSQAESKKQVDAETKKQVDSQAETKKQLDSQA 132 (224)
Q Consensus 53 eT~~~KQ~d~~a~~kkQL~s~ies~~ql~adAe~K~l~d~ngalkKQ~dS~aqSk~saDAE~KrlsDe~AalKkQLDSnA 132 (224)
+..++.++.++..++..++...---. ..+-++-++.-+.-++.+...+.-+.-.+.--....+--++..+.--++...
T Consensus 786 e~rlkdl~keik~~k~~~e~~~~~~e--k~~~e~e~l~lE~e~l~~e~~~~k~~l~~~~~~~~~l~~e~~~l~~kv~~~~ 863 (1174)
T KOG0933|consen 786 ERRLKDLEKEIKTAKQRAEESSKELE--KRENEYERLQLEHEELEKEISSLKQQLEQLEKQISSLKSELGNLEAKVDKVE 863 (1174)
T ss_pred HhHHHHHHHHHHHHHHHHHHHHHHHH--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhHH
Confidence 34455555555555555543321111 3345555666666666666666666555555555555566666665555554
Q ss_pred hhhhccCcHHHHhhhhchHHHHHHhhccH-HHHhhhh-------chHHHHHHhhchHHHHHHHHHHhHHHHHhhhcccc
Q psy7537 133 ETKKQFDSQAETKKQFDSQAETKKQFDSQ-AETKKQL-------DSQAEMKKQFDSQAEAKKHLAQRYDDFMKELDYKE 203 (224)
Q Consensus 133 qsk~sadsqAE~KrlsDe~AalKkQlDSn-Ae~Krls-------De~AalKKQlDSnAqsKkhlaqryddfmkeldyke 203 (224)
.....+ |+|++.+.+-+-..-++++.+ ...++.+ .+...|-..++.--+.++....+.+...++.+|-.
T Consensus 864 ~~~~~~--~~el~~~k~k~~~~dt~i~~~~~~~e~~~~e~~~~~l~~kkle~e~~~~~~e~~~~~k~v~~l~~k~~wi~ 940 (1174)
T KOG0933|consen 864 KDVKKA--QAELKDQKAKQRDIDTEISGLLTSQEKCLSEKSDGELERKKLEHEVTKLESEKANARKEVEKLLKKHEWIG 940 (1174)
T ss_pred hHHHHH--HHHHHHHHHHHHhhhHHHhhhhhHHHHHHHHhhcccchHHHHHhHHHHhhhhHHHHHHHHHHHHHhccchh
Confidence 443332 366666666555555555554 2222222 33344455566656666666666666666665543
No 48
>KOG0995|consensus
Probab=39.68 E-value=3.8e+02 Score=27.01 Aligned_cols=44 Identities=20% Similarity=0.263 Sum_probs=29.7
Q ss_pred chhhhhhhcchHHHHHHhhhhhhhhhhccCcHHHHhhhhchHHHHHHhhc
Q psy7537 110 VDAETKKQVDSQAETKKQLDSQAETKKQFDSQAETKKQFDSQAETKKQFD 159 (224)
Q Consensus 110 aDAE~KrlsDe~AalKkQLDSnAqsk~sadsqAE~KrlsDe~AalKkQlD 159 (224)
..-|.+++.-++..||+|++-+-- |-++++|.--|..+|++-++
T Consensus 306 kEeE~e~lq~~~d~Lk~~Ie~Q~i------S~~dve~mn~Er~~l~r~l~ 349 (581)
T KOG0995|consen 306 KEEEIEKLQKENDELKKQIELQGI------SGEDVERMNLERNKLKRELN 349 (581)
T ss_pred HHHHHHHHHHHHHHHHHHHHhcCC------CHHHHHHHHHHHHHHHHHHH
Confidence 345778888889999999987732 34666666666655555543
No 49
>PF11559 ADIP: Afadin- and alpha -actinin-Binding; InterPro: IPR021622 This family is found in mammals where it is localised at cell-cell adherens junctions [], and in Sch. pombe and other fungi where it anchors spindle-pole bodies to spindle microtubules []. It is a coiled-coil structure, and in pombe, it is required for anchoring the minus end of spindle microtubules to the centrosome equivalent, the spindle-pole body. The name ADIP derives from the family being composed of Afadin- and alpha -Actinin-Binding Proteins Localised at Cell-Cell Adherens Junctions.
Probab=38.67 E-value=1.8e+02 Score=22.29 Aligned_cols=18 Identities=11% Similarity=0.239 Sum_probs=8.7
Q ss_pred hhHHHHHHHhhHHHHhhh
Q psy7537 61 DNQAETKKQLESEIETKK 78 (224)
Q Consensus 61 d~~a~~kkQL~s~ies~~ 78 (224)
.-++.+-.+.+.+.+.+.
T Consensus 38 n~i~~Ll~~~~r~~~~~e 55 (151)
T PF11559_consen 38 NCIYDLLQQRDRDMEQRE 55 (151)
T ss_pred HHHHHHHHHHHHHHHHHH
Confidence 334455555555544444
No 50
>KOG4674|consensus
Probab=37.83 E-value=3e+02 Score=31.06 Aligned_cols=173 Identities=24% Similarity=0.296 Sum_probs=89.1
Q ss_pred HHHhhhhhhhhhcccCC---hhhhHHHHHhhHHH---HhhhccCCCcchhhhh----hh-------hhHhhhhHHHHHHH
Q psy7537 7 AQVLDKLKHEEAKFNVD---SDKIDSLIKKSEEI---LESKKIESPSAAIESE----TE-------TNKQVDNQAETKKQ 69 (224)
Q Consensus 7 aqvldklkheeakfnvd---sdkidslikkseei---leskkiesP~a~~~Se----T~-------~~KQ~d~~a~~kkQ 69 (224)
..+.+.|..+++.|+++ -+.+..++++.+++ .||.++= |.--... ++ ++.-+.+...-.++
T Consensus 1191 ~dL~~sL~~~r~~~q~~a~s~~e~~~i~~~v~~vNll~EsN~~L--Ree~~~~~~k~qEl~~~i~kl~~el~plq~~l~e 1268 (1822)
T KOG4674|consen 1191 DDLQRSLTAERASSQKSAVSDDEHKEILEKVEEVNLLRESNKVL--REENEANLEKIQELRDKIEKLNFELAPLQNELKE 1268 (1822)
T ss_pred HHHHHHHHHHHHhhccchhhhhhhhHHHHHHHHHHHHHHhHHHH--HHHHHHHHHHHHHHHHHHHHHHhhHhhHHHHHHH
Confidence 34567888899999544 45666777776653 3444330 0000111 11 12223334444556
Q ss_pred hhHHHHhhhhhchHHHHhhhhcchhhhhhhcchhhhhhhhc-hhhhhhhcchHHHHHHhhhhhhhhhhccCcHHHHhhhh
Q psy7537 70 LESEIETKKQLESEAETKKQVDSQGETKKQVDSQAESKKQV-DAETKKQVDSQAETKKQLDSQAETKKQFDSQAETKKQF 148 (224)
Q Consensus 70 L~s~ies~~ql~adAe~K~l~d~ngalkKQ~dS~aqSk~sa-DAE~KrlsDe~AalKkQLDSnAqsk~sadsqAE~Krls 148 (224)
|.+.++... |+.+.+-.++-..|++.+.--+.-+.+ -.+..||.+++-.||..|-+ +.-.. ||++..+
T Consensus 1269 l~~e~~~~~-----ael~~l~~e~~~wK~R~q~L~~k~k~~d~~~~~kL~~ei~~Lk~el~~----ke~~~--~el~~~~ 1337 (1822)
T KOG4674|consen 1269 LKAELQEKV-----AELKKLEEENDRWKQRNQDLLEKYKDSDKNDYEKLKSEISRLKEELEE----KENLI--AELKKEL 1337 (1822)
T ss_pred HHHHHHHHH-----HHHHHHHHHHHHHHHHHHHHHHHhhcCCHHHHHHHHHHHHHHHHHHHH----HHHHH--HHHHHHH
Confidence 666666666 777777777766666555544442222 34566666667666666651 11111 3444333
Q ss_pred chHH-HHHHhhccHHHHhhhhchHHHHHHhhchHHHHHHHHHHhHHHHHhh
Q psy7537 149 DSQA-ETKKQFDSQAETKKQLDSQAEMKKQFDSQAEAKKHLAQRYDDFMKE 198 (224)
Q Consensus 149 De~A-alKkQlDSnAe~KrlsDe~AalKKQlDSnAqsKkhlaqryddfmke 198 (224)
+... ..++|+|. +--+++.+.+++-.--..+-.|++-|..|.+.
T Consensus 1338 ~~~q~~~k~qld~------l~~e~~~lt~~~~ql~~~~~rL~~~~~e~~~q 1382 (1822)
T KOG4674|consen 1338 NRLQEKIKKQLDE------LNNEKANLTKELEQLEDLKTRLAAALSEKNAQ 1382 (1822)
T ss_pred HHHHHHHHHHHHH------HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 3322 33344443 22344555555555555566666666655543
No 51
>KOG0995|consensus
Probab=37.62 E-value=4.5e+02 Score=26.56 Aligned_cols=111 Identities=27% Similarity=0.304 Sum_probs=66.4
Q ss_pred hhhhhh-hhcccCChhhhHHHHHhhHHHHhhhc-cCCCcchhhhhhhhhH--------hhhhH--------H--HHHHHh
Q psy7537 11 DKLKHE-EAKFNVDSDKIDSLIKKSEEILESKK-IESPSAAIESETETNK--------QVDNQ--------A--ETKKQL 70 (224)
Q Consensus 11 dklkhe-eakfnvdsdkidslikkseeileskk-iesP~a~~~SeT~~~K--------Q~d~~--------a--~~kkQL 70 (224)
+.|+|- +.-|++....|+.|-+...++.|--. +|--.+.++|.-+.-. --+.- + -.-.+|
T Consensus 220 ~Elk~~l~~~~~~i~~~ie~l~~~n~~l~e~i~e~ek~~~~~eslre~~~~L~~D~nK~~~y~~~~~~k~~~~~~~l~~l 299 (581)
T KOG0995|consen 220 DELKHRLEKYFTSIANEIEDLKKTNRELEEMINEREKDPGKEESLREKKARLQDDVNKFQAYVSQMKSKKQHMEKKLEML 299 (581)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCcchHHHHHHHHHHHHhHHHHHHHHHHHHHhhhHHHHHHHHHH
Confidence 344443 34567789999999888887765321 2222223333322221 11100 0 011345
Q ss_pred hHHHHhhhhhchHHHHhhhhcchhhhhhhcchhhhhhhhchhhhhhhcchHHHHHHhhhh
Q psy7537 71 ESEIETKKQLESEAETKKQVDSQGETKKQVDSQAESKKQVDAETKKQVDSQAETKKQLDS 130 (224)
Q Consensus 71 ~s~ies~~ql~adAe~K~l~d~ngalkKQ~dS~aqSk~saDAE~KrlsDe~AalKkQLDS 130 (224)
.++|+.+- -|.+++.-.+-.|++|++-+-= --++++|+.-|..+|++-|+-
T Consensus 300 ~~Eie~kE-----eE~e~lq~~~d~Lk~~Ie~Q~i----S~~dve~mn~Er~~l~r~l~~ 350 (581)
T KOG0995|consen 300 KSEIEEKE-----EEIEKLQKENDELKKQIELQGI----SGEDVERMNLERNKLKRELNK 350 (581)
T ss_pred HHHHHHHH-----HHHHHHHHHHHHHHHHHHhcCC----CHHHHHHHHHHHHHHHHHHHH
Confidence 56666554 7888888889999999988733 346788888888888777653
No 52
>PRK10884 SH3 domain-containing protein; Provisional
Probab=37.42 E-value=1.9e+02 Score=24.56 Aligned_cols=26 Identities=4% Similarity=0.063 Sum_probs=16.1
Q ss_pred hhhchhhhhhhcchHHHHHHhhhhhh
Q psy7537 107 KKQVDAETKKQVDSQAETKKQLDSQA 132 (224)
Q Consensus 107 k~saDAE~KrlsDe~AalKkQLDSnA 132 (224)
..++-.-+.++..++++++.+|+.--
T Consensus 88 ~p~~~~rlp~le~el~~l~~~l~~~~ 113 (206)
T PRK10884 88 TPSLRTRVPDLENQVKTLTDKLNNID 113 (206)
T ss_pred CccHHHHHHHHHHHHHHHHHHHHHHH
Confidence 34444555666677777777776543
No 53
>KOG4674|consensus
Probab=35.89 E-value=7e+02 Score=28.35 Aligned_cols=92 Identities=26% Similarity=0.298 Sum_probs=52.0
Q ss_pred chhhhhhhcchHHHHHHhhhhhhhhhhccCcHHHHhhhhchHHHHHHhhccH----HHHhhhhch-HHHHHHhhchHHHH
Q psy7537 110 VDAETKKQVDSQAETKKQLDSQAETKKQFDSQAETKKQFDSQAETKKQFDSQ----AETKKQLDS-QAEMKKQFDSQAEA 184 (224)
Q Consensus 110 aDAE~KrlsDe~AalKkQLDSnAqsk~sadsqAE~KrlsDe~AalKkQlDSn----Ae~KrlsDe-~AalKKQlDSnAqs 184 (224)
.-||++.+--++-..|++-+.--+.-+..| ..+..||.++|-.||..|-+. +|.+..... +..+++|+|.--..
T Consensus 1276 ~~ael~~l~~e~~~wK~R~q~L~~k~k~~d-~~~~~kL~~ei~~Lk~el~~ke~~~~el~~~~~~~q~~~k~qld~l~~e 1354 (1822)
T KOG4674|consen 1276 KVAELKKLEEENDRWKQRNQDLLEKYKDSD-KNDYEKLKSEISRLKEELEEKENLIAELKKELNRLQEKIKKQLDELNNE 1354 (1822)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHhhcCC-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 346777787777777777766666644444 356677777777777666544 443332222 13444555554444
Q ss_pred H----------HHHHHhHHHHHhhhccc
Q psy7537 185 K----------KHLAQRYDDFMKELDYK 202 (224)
Q Consensus 185 K----------khlaqryddfmkeldyk 202 (224)
+ +-+-.||-.-..++--+
T Consensus 1355 ~~~lt~~~~ql~~~~~rL~~~~~e~~~q 1382 (1822)
T KOG4674|consen 1355 KANLTKELEQLEDLKTRLAAALSEKNAQ 1382 (1822)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 3 34445555444444444
No 54
>PF12128 DUF3584: Protein of unknown function (DUF3584); InterPro: IPR021979 This family consist of uncharacterised bacterial proteins.
Probab=35.46 E-value=5.2e+02 Score=26.70 Aligned_cols=35 Identities=11% Similarity=0.316 Sum_probs=17.3
Q ss_pred cchhhhhhhhchhhhhhhcchHHHHHHhhhhhhhh
Q psy7537 100 VDSQAESKKQVDAETKKQVDSQAETKKQLDSQAET 134 (224)
Q Consensus 100 ~dS~aqSk~saDAE~KrlsDe~AalKkQLDSnAqs 134 (224)
+......-.++.....++-++...++.++....+.
T Consensus 644 ~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~ 678 (1201)
T PF12128_consen 644 ITQAEQELKQAEQDLQRLKNEREQLKQEIEEAKEE 678 (1201)
T ss_pred HHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 33333334445555555555555555555544333
No 55
>PF07889 DUF1664: Protein of unknown function (DUF1664); InterPro: IPR012458 The members of this family are hypothetical plant proteins of unknown function. The region featured in this family is approximately 100 amino acids long.
Probab=35.30 E-value=61 Score=26.11 Aligned_cols=41 Identities=24% Similarity=0.383 Sum_probs=25.9
Q ss_pred HHHhhhhchHHHHHHhhccH-HHHhhhhchHHHHHHhhchHH
Q psy7537 142 AETKKQFDSQAETKKQFDSQ-AETKKQLDSQAEMKKQFDSQA 182 (224)
Q Consensus 142 AE~KrlsDe~AalKkQlDSn-Ae~KrlsDe~AalKKQlDSnA 182 (224)
..+..++++++++|+.|-+. ..+-+-+|++.++-++...+.
T Consensus 50 kql~~vs~~l~~tKkhLsqRId~vd~klDe~~ei~~~i~~eV 91 (126)
T PF07889_consen 50 KQLEQVSESLSSTKKHLSQRIDRVDDKLDEQKEISKQIKDEV 91 (126)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHH
Confidence 44667777777777777655 445555666666666665543
No 56
>KOG0240|consensus
Probab=34.62 E-value=5.1e+02 Score=26.35 Aligned_cols=33 Identities=18% Similarity=0.160 Sum_probs=17.6
Q ss_pred hhcchhhhhhhhchhhhhhhcchHHHHHHhhhh
Q psy7537 98 KQVDSQAESKKQVDAETKKQVDSQAETKKQLDS 130 (224)
Q Consensus 98 KQ~dS~aqSk~saDAE~KrlsDe~AalKkQLDS 130 (224)
.+..+....+++.+.+-.++..+++-|-+|||+
T Consensus 400 ~~~~~i~~~~~~~~~~~~~~~e~~~~L~qqlD~ 432 (607)
T KOG0240|consen 400 EEEMSITKLKGSLEEEEDILTERIESLYQQLDQ 432 (607)
T ss_pred hhhhhhhhcccchHHHHHHHHHHHHHHHHHHHH
Confidence 344444455555555555555555555555553
No 57
>COG4942 Membrane-bound metallopeptidase [Cell division and chromosome partitioning]
Probab=34.06 E-value=3.8e+02 Score=25.78 Aligned_cols=45 Identities=20% Similarity=0.300 Sum_probs=24.1
Q ss_pred HhhhhcchhhhhhhcchhhhhhhhchhhhhhhcchHHHHHHhhhh
Q psy7537 86 TKKQVDSQGETKKQVDSQAESKKQVDAETKKQVDSQAETKKQLDS 130 (224)
Q Consensus 86 ~K~l~d~ngalkKQ~dS~aqSk~saDAE~KrlsDe~AalKkQLDS 130 (224)
+...-+....|.+++.+....-.+++++.++..+.+.++.++++.
T Consensus 54 i~~~~~~~~kL~~~lk~~e~~i~~~~~ql~~s~~~l~~~~~~I~~ 98 (420)
T COG4942 54 IREQQDQRAKLEKQLKSLETEIASLEAQLIETADDLKKLRKQIAD 98 (420)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhHHHHHHhhHHH
Confidence 333344444555555555555555666666555555555555544
No 58
>COG1196 Smc Chromosome segregation ATPases [Cell division and chromosome partitioning]
Probab=33.32 E-value=5.5e+02 Score=26.32 Aligned_cols=37 Identities=14% Similarity=0.335 Sum_probs=20.4
Q ss_pred hhhhhcchhhhhhhhchhhhhhhcchHHHHHHhhhhh
Q psy7537 95 ETKKQVDSQAESKKQVDAETKKQVDSQAETKKQLDSQ 131 (224)
Q Consensus 95 alkKQ~dS~aqSk~saDAE~KrlsDe~AalKkQLDSn 131 (224)
++..++.+..+.......++..+-+++..++.++++.
T Consensus 811 ~~~~~~~~~~~~~~~~~~ei~~l~~~~~~~~~~~~~l 847 (1163)
T COG1196 811 ALERELESLEQRRERLEQEIEELEEEIEELEEKLDEL 847 (1163)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 4445555555555555556666655555555555543
No 59
>PF00846 Hanta_nucleocap: Hantavirus nucleocapsid protein; InterPro: IPR002214 Hantaviruses are ssRNA negative-strand viruses. The nucleocapsid protein is an internal protein of the virus particle [, ].; GO: 0019013 viral nucleocapsid; PDB: 2IC9_A 2IC6_A 2K48_A 4FI5_A.
Probab=31.91 E-value=1.1e+02 Score=29.79 Aligned_cols=103 Identities=26% Similarity=0.368 Sum_probs=35.5
Q ss_pred hhhhhcchHHHHHHhhhhhhhhhhccCcHHHHhhhhchHHHHHHhhc-cHHHHhhhhchHHHHHHhhchHHHHHH-----
Q psy7537 113 ETKKQVDSQAETKKQLDSQAETKKQFDSQAETKKQFDSQAETKKQFD-SQAETKKQLDSQAEMKKQFDSQAEAKK----- 186 (224)
Q Consensus 113 E~KrlsDe~AalKkQLDSnAqsk~sadsqAE~KrlsDe~AalKkQlD-SnAe~KrlsDe~AalKKQlDSnAqsKk----- 186 (224)
.++++-+|+.....||-.--|--+.|..+.|.- -|++. |.-+. ..++++-+-+.+++||+|+--...+-+
T Consensus 3 ~~~elq~e~~~~E~qL~~a~qkl~da~~~~e~d--pD~~n--k~~~~~R~~~v~~~~~Ki~elkr~lAd~v~~~k~~~~~ 78 (428)
T PF00846_consen 3 TLEELQEEITQHEQQLVIARQKLKDAEKQYEKD--PDDVN--KSTLQQRQSVVSALQDKIAELKRQLADRVAAGKQSAKP 78 (428)
T ss_dssp HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-----HHH--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCH----
T ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCC--ChHHH--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccccccCC
Confidence 466777777777777776655555555444322 22221 11111 225555666666666666644433333
Q ss_pred ----------HHHH----hHHHHH--hhhcccccCccceecc--eeeeEee
Q psy7537 187 ----------HLAQ----RYDDFM--KELDYKETSGTTIHHG--IRYRVQF 219 (224)
Q Consensus 187 ----------hlaq----ryddfm--keldyketsgttihhg--iryrvqf 219 (224)
||.+ ||-..+ .-+|..|.||-|-... +-|-+.|
T Consensus 79 ~dptG~epdDhl~~kS~LrYGnvlD~N~idleEPsGqtAdW~~i~~Yi~~f 129 (428)
T PF00846_consen 79 VDPTGVEPDDHLKEKSMLRYGNVLDVNPIDLEEPSGQTADWLSIITYIISF 129 (428)
T ss_dssp ---------------------------------------------------
T ss_pred CCCCCCCCchhhhHhhhccccceeccccccccCCCCCCCcHHHHHHHHHHH
Confidence 4444 344432 3578889999875443 3444444
No 60
>PRK11020 hypothetical protein; Provisional
Probab=31.58 E-value=71 Score=26.33 Aligned_cols=47 Identities=13% Similarity=0.148 Sum_probs=30.3
Q ss_pred chhhhhhhcchHHHHHHhhhhhhhhhhccCcHHHHhhhhchHHHHHHhhcc
Q psy7537 110 VDAETKKQVDSQAETKKQLDSQAETKKQFDSQAETKKQFDSQAETKKQFDS 160 (224)
Q Consensus 110 aDAE~KrlsDe~AalKkQLDSnAqsk~sadsqAE~KrlsDe~AalKkQlDS 160 (224)
.-+|.++|+|...++..+|+. |+.++..+ =+..+-+|++.|-+++-+
T Consensus 3 ~K~Eiq~L~drLD~~~~Klaa-a~~rgd~~---~i~qf~~E~~~l~k~I~~ 49 (118)
T PRK11020 3 EKNEIKRLSDRLDAIRHKLAA-ASLRGDAE---KYAQFEKEKATLEAEIAR 49 (118)
T ss_pred HHHHHHHHHHHHHHHHHHHHH-HHhcCCHH---HHHHHHHHHHHHHHHHHH
Confidence 457899999999999999875 34444332 334445666666655544
No 61
>PF01519 DUF16: Protein of unknown function DUF16; InterPro: IPR002862 Proteins that contain this domain are of unknown function. It appears to be confined to proteins from Mycoplasma pneumoniae [].; PDB: 2BA2_C.
Probab=31.15 E-value=1.2e+02 Score=24.34 Aligned_cols=37 Identities=19% Similarity=0.213 Sum_probs=19.6
Q ss_pred hhhchHHHHHHhhchHHHHHHHHHHhHHHHHhhhccc
Q psy7537 166 KQLDSQAEMKKQFDSQAEAKKHLAQRYDDFMKELDYK 202 (224)
Q Consensus 166 rlsDe~AalKKQlDSnAqsKkhlaqryddfmkeldyk 202 (224)
+|...+.++.-+.+++-+.=..+.++..+|-+.||--
T Consensus 64 ~QGEqIkel~~e~k~qgktL~~I~~~L~~inkRLD~~ 100 (102)
T PF01519_consen 64 KQGEQIKELQVEQKAQGKTLQLILKTLQSINKRLDKM 100 (102)
T ss_dssp HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhc
Confidence 3444444444444444444455566666666666643
No 62
>PF15070 GOLGA2L5: Putative golgin subfamily A member 2-like protein 5
Probab=30.67 E-value=5.4e+02 Score=25.42 Aligned_cols=27 Identities=19% Similarity=0.346 Sum_probs=12.1
Q ss_pred hhchHHHHHHhhchHHHHHHHHHHhHH
Q psy7537 167 QLDSQAEMKKQFDSQAEAKKHLAQRYD 193 (224)
Q Consensus 167 lsDe~AalKKQlDSnAqsKkhlaqryd 193 (224)
++.++++|+-.|.+--..++.|..+|.
T Consensus 179 ltne~~elt~~lq~Eq~~~keL~~kl~ 205 (617)
T PF15070_consen 179 LTNENMELTSALQSEQHVKKELQKKLG 205 (617)
T ss_pred HHHhhhHhhHHHHHHHHHHHHHHHHHH
Confidence 444444444444444444444444443
No 63
>PF02403 Seryl_tRNA_N: Seryl-tRNA synthetase N-terminal domain; InterPro: IPR015866 The aminoacyl-tRNA synthetases (6.1.1. from EC) catalyse the attachment of an amino acid to its cognate transfer RNA molecule in a highly specific two-step reaction. These proteins differ widely in size and oligomeric state, and have limited sequence homology []. The 20 aminoacyl-tRNA synthetases are divided into two classes, I and II. Class I aminoacyl-tRNA synthetases contain a characteristic Rossman fold catalytic domain and are mostly monomeric []. Class II aminoacyl-tRNA synthetases share an anti-parallel beta-sheet fold flanked by alpha-helices [], and are mostly dimeric or multimeric, containing at least three conserved regions [, , ]. However, tRNA binding involves an alpha-helical structure that is conserved between class I and class II synthetases. In reactions catalysed by the class I aminoacyl-tRNA synthetases, the aminoacyl group is coupled to the 2'-hydroxyl of the tRNA, while, in class II reactions, the 3'-hydroxyl site is preferred. The synthetases specific for arginine, cysteine, glutamic acid, glutamine, isoleucine, leucine, methionine, tyrosine, tryptophan and valine belong to class I synthetases. The synthetases specific for alanine, asparagine, aspartic acid, glycine, histidine, lysine, phenylalanine, proline, serine, and threonine belong to class-II synthetases []. Based on their mode of binding to the tRNA acceptor stem, both classes of tRNA synthetases have been subdivided into three subclasses, designated 1a, 1b, 1c and 2a, 2b, 2c. This entry represents the N-terminal domain of Seryl-tRNA synthetase, which consists of two helices in a long alpha-hairpin. Seryl-tRNA synthetase (6.1.1.11 from EC) exists as monomer and belongs to class IIa [].; GO: 0000166 nucleotide binding, 0004828 serine-tRNA ligase activity, 0005524 ATP binding, 0006434 seryl-tRNA aminoacylation, 0005737 cytoplasm; PDB: 3QO8_A 3QO5_A 3QO7_A 3QNE_A 3LSQ_A 3LSS_A 2DQ3_B 1SET_A 1SER_A 1SRY_B ....
Probab=30.12 E-value=1.3e+02 Score=21.64 Aligned_cols=41 Identities=29% Similarity=0.338 Sum_probs=17.3
Q ss_pred HHHhhhhcchhhhhhhcchhhhhhhhchhhhhhhcchHHHHHHhh
Q psy7537 84 AETKKQVDSQGETKKQVDSQAESKKQVDAETKKQVDSQAETKKQL 128 (224)
Q Consensus 84 Ae~K~l~d~ngalkKQ~dS~aqSk~saDAE~KrlsDe~AalKkQL 128 (224)
++.+.++-.-|.++++- ......-+|++.+.+++.++..++
T Consensus 50 ~~rN~~sk~I~~~~~~~----~~~~~l~~e~~~lk~~i~~le~~~ 90 (108)
T PF02403_consen 50 AERNELSKEIGKLKKAG----EDAEELKAEVKELKEEIKELEEQL 90 (108)
T ss_dssp HHHHHHHHHHHHHCHTT----CCTHHHHHHHHHHHHHHHHHHHHH
T ss_pred HHHhHHHHHHHHHhhCc----ccHHHHHHHHHHHHHHHHHHHHHH
Confidence 44445544444444421 222333344444444444444443
No 64
>PF05911 DUF869: Plant protein of unknown function (DUF869); InterPro: IPR008587 This family consists of a number of sequences found in plants. The function of this family is unknown.
Probab=29.97 E-value=5.6e+02 Score=26.20 Aligned_cols=39 Identities=26% Similarity=0.231 Sum_probs=28.8
Q ss_pred hhhHhhhhHHHHHHHhhHHHHhhhhhchHHHHhhhhcchhhhhh
Q psy7537 55 ETNKQVDNQAETKKQLESEIETKKQLESEAETKKQVDSQGETKK 98 (224)
Q Consensus 55 ~~~KQ~d~~a~~kkQL~s~ies~~ql~adAe~K~l~d~ngalkK 98 (224)
...|--.+...+|.||+.-++.+. +---|.+-..||||.
T Consensus 18 gwekae~e~~~lk~~l~~~~~~~~-----~~e~r~~hld~aLke 56 (769)
T PF05911_consen 18 GWEKAEAEAASLKQQLEAATQQKL-----ALEDRVSHLDGALKE 56 (769)
T ss_pred hHHHHHHHHHHHHHHHHHHHHHhH-----HHHHHhhhhhHHHHH
Confidence 344556677889999999988887 444566777888885
No 65
>PF14662 CCDC155: Coiled-coil region of CCDC155
Probab=29.79 E-value=1.2e+02 Score=26.50 Aligned_cols=70 Identities=19% Similarity=0.263 Sum_probs=41.1
Q ss_pred hhhhcchHHHHHHhhhhhhhhhhccCcHHHHhhhhchHHHHHHhhccH----HHHhhhhchHHHHHHhhch-HHHHHHHH
Q psy7537 114 TKKQVDSQAETKKQLDSQAETKKQFDSQAETKKQFDSQAETKKQFDSQ----AETKKQLDSQAEMKKQFDS-QAEAKKHL 188 (224)
Q Consensus 114 ~KrlsDe~AalKkQLDSnAqsk~sadsqAE~KrlsDe~AalKkQlDSn----Ae~KrlsDe~AalKKQlDS-nAqsKkhl 188 (224)
-+||-|+++.|+..++--- ..--+|..++..|++|+-|- -..|-+-+|+..||--+-+ +.+.++-+
T Consensus 17 n~~L~~en~kL~~~ve~~e---------e~na~L~~e~~~L~~q~~s~Qqal~~aK~l~eEledLk~~~~~lEE~~~~L~ 87 (193)
T PF14662_consen 17 NQKLADENAKLQRSVETAE---------EGNAQLAEEITDLRKQLKSLQQALQKAKALEEELEDLKTLAKSLEEENRSLL 87 (193)
T ss_pred hHHHHHHHHHHHHHHHHHH---------HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 3677788888877775422 22346777777888877765 2334445566666655443 33444445
Q ss_pred HHhH
Q psy7537 189 AQRY 192 (224)
Q Consensus 189 aqry 192 (224)
||.|
T Consensus 88 aq~r 91 (193)
T PF14662_consen 88 AQAR 91 (193)
T ss_pred HHHH
Confidence 5554
No 66
>PF14197 Cep57_CLD_2: Centrosome localisation domain of PPC89
Probab=29.77 E-value=1e+02 Score=22.28 Aligned_cols=20 Identities=20% Similarity=0.313 Sum_probs=8.9
Q ss_pred chhhhhhhcchHHHHHHhhh
Q psy7537 110 VDAETKKQVDSQAETKKQLD 129 (224)
Q Consensus 110 aDAE~KrlsDe~AalKkQLD 129 (224)
+-.|.-+|.+++.+|+++|+
T Consensus 45 a~~e~~~Lk~E~e~L~~el~ 64 (69)
T PF14197_consen 45 AYEENNKLKEENEALRKELE 64 (69)
T ss_pred HHHHHHHHHHHHHHHHHHHH
Confidence 33444444444444444443
No 67
>PF08826 DMPK_coil: DMPK coiled coil domain like; InterPro: IPR014930 This domain is found in the myotonic dystrophy protein kinase (DMPK) and adopts a coiled coil structure. It plays a role in dimerisation []. ; GO: 0004674 protein serine/threonine kinase activity, 0005524 ATP binding, 0006468 protein phosphorylation; PDB: 1WT6_D.
Probab=29.69 E-value=1.2e+02 Score=21.79 Aligned_cols=55 Identities=16% Similarity=0.279 Sum_probs=26.8
Q ss_pred HHHHhhHHHHhhhhhchHHHHhhhhcchhhhhhhcchhhhhhhhchhhhhhhcchHHHHHHhhh
Q psy7537 66 TKKQLESEIETKKQLESEAETKKQVDSQGETKKQVDSQAESKKQVDAETKKQVDSQAETKKQLD 129 (224)
Q Consensus 66 ~kkQL~s~ies~~ql~adAe~K~l~d~ngalkKQ~dS~aqSk~saDAE~KrlsDe~AalKkQLD 129 (224)
+.-.|+++|.++-.++ -|..+.-..|-++..++.. ++.-...|..++..|+++++
T Consensus 2 lQsaL~~EirakQ~~~--eEL~kvk~~n~~~e~kLqe-------aE~rn~eL~~ei~~L~~e~e 56 (61)
T PF08826_consen 2 LQSALEAEIRAKQAIQ--EELTKVKSANLAFESKLQE-------AEKRNRELEQEIERLKKEME 56 (61)
T ss_dssp HHHHHHHHHHHHHHHH--HHHHHHHHHHHHHHHHHHH-------HHHHHHHHHHHHHHHHHHHH
T ss_pred HHhHHHHHHHHHHHHH--HHHHHHHHHHHHHHHHHHH-------HHHHHHHHHHHHHHHHHHHH
Confidence 4456777887776333 3334444444444444432 12222334555555555553
No 68
>TIGR00219 mreC rod shape-determining protein MreC. MreC (murein formation C) is involved in the rod shape determination in E. coli, and more generally in cell shape determination of bacteria whether or not they are rod-shaped. Cells defective in MreC are round. Species with MreC include many of the Proteobacteria, Gram-positives, and spirochetes.
Probab=29.65 E-value=65 Score=28.02 Aligned_cols=14 Identities=7% Similarity=0.242 Sum_probs=6.9
Q ss_pred hhchHHHHHHhhch
Q psy7537 167 QLDSQAEMKKQFDS 180 (224)
Q Consensus 167 lsDe~AalKKQlDS 180 (224)
+..|++.|++-|+-
T Consensus 96 l~~EN~rLr~LL~~ 109 (283)
T TIGR00219 96 LKQENVRLRELLNS 109 (283)
T ss_pred HHHHHHHHHHHhcC
Confidence 34455555555443
No 69
>PF04420 CHD5: CHD5-like protein; InterPro: IPR007514 Members of this family are probably coiled-coil proteins that are similar to the CHD5 (Congenital heart disease 5) protein. The exact molecular function of these eukaryotic proteins is unknown.; PDB: 3SJA_H 3SJC_D 3SJB_D 3ZS8_D 3VLC_E.
Probab=29.22 E-value=1.4e+02 Score=24.00 Aligned_cols=26 Identities=15% Similarity=0.185 Sum_probs=4.6
Q ss_pred hHHHHHhhhcccccCccceecceeee
Q psy7537 191 RYDDFMKELDYKETSGTTIHHGIRYR 216 (224)
Q Consensus 191 ryddfmkeldyketsgttihhgiryr 216 (224)
.++....-+-+--|+|..+-=.+.||
T Consensus 95 ~~~~~~~~~~~~~~~~~~~~l~~~~r 120 (161)
T PF04420_consen 95 SFDKSLSKVLWVLTTLPFFVLRFWYR 120 (161)
T ss_dssp HHHHHHHHH-----------------
T ss_pred HHHHHHHHHHHHHHHHHHhhhheeec
Confidence 34555555555556655554444444
No 70
>KOG0240|consensus
Probab=29.16 E-value=4e+02 Score=27.04 Aligned_cols=60 Identities=20% Similarity=0.172 Sum_probs=41.6
Q ss_pred HHHHHhhhhhhhhhhccCcHHHHhhhhchHHHHHHhhccH-HHHhhhhchHHHHHHhhchHHH
Q psy7537 122 AETKKQLDSQAETKKQFDSQAETKKQFDSQAETKKQFDSQ-AETKKQLDSQAEMKKQFDSQAE 183 (224)
Q Consensus 122 AalKkQLDSnAqsk~sadsqAE~KrlsDe~AalKkQlDSn-Ae~KrlsDe~AalKKQlDSnAq 183 (224)
+-++.+.++.-..+++.+ .+-.++..+++-|-+|+|+- -++..++..+..|+-|+--|-+
T Consensus 396 ~~~~~~~~~i~~~~~~~~--~~~~~~~e~~~~L~qqlD~kd~~~n~~sqL~~~lk~q~~~qee 456 (607)
T KOG0240|consen 396 AILSEEEMSITKLKGSLE--EEEDILTERIESLYQQLDQKDDQINKQSQLMEKLKEQLLDQEE 456 (607)
T ss_pred hhhhhhhhhhhhcccchH--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh
Confidence 334566666666666644 77778888888888888864 6777777777777776655444
No 71
>KOG4603|consensus
Probab=29.05 E-value=1.2e+02 Score=27.00 Aligned_cols=46 Identities=17% Similarity=0.085 Sum_probs=34.6
Q ss_pred chhhhhhhhchhhhhhhcchHHHHHHhhhhhhhhhhccCcHHHHhhhh
Q psy7537 101 DSQAESKKQVDAETKKQVDSQAETKKQLDSQAETKKQFDSQAETKKQF 148 (224)
Q Consensus 101 dS~aqSk~saDAE~KrlsDe~AalKkQLDSnAqsk~sadsqAE~Krls 148 (224)
-.+-|-+--.|+|.-.|--++++|..++.|-+++-++.+ ||+|-|+
T Consensus 68 a~QDqF~~~~~eel~~ld~~i~~l~ek~q~l~~t~s~ve--aEik~L~ 113 (201)
T KOG4603|consen 68 ADQDQFDMVSDEELQVLDGKIVALTEKVQSLQQTCSYVE--AEIKELS 113 (201)
T ss_pred ecHHhhcCCChHHHHHHhHHHHHHHHHHHHHHHHHHHHH--HHHHHHH
Confidence 344555556678888888888888888888888888888 7777654
No 72
>PRK11020 hypothetical protein; Provisional
Probab=28.95 E-value=1.3e+02 Score=24.79 Aligned_cols=37 Identities=11% Similarity=0.177 Sum_probs=20.8
Q ss_pred HHHhhhhchHHHHHHhhccH------HHHhhhhchHHHHHHhh
Q psy7537 142 AETKKQFDSQAETKKQFDSQ------AETKKQLDSQAEMKKQF 178 (224)
Q Consensus 142 AE~KrlsDe~AalKkQlDSn------Ae~KrlsDe~AalKKQl 178 (224)
+|+++|+|.+.++..+++.- .-+....+|++.|-+++
T Consensus 5 ~Eiq~L~drLD~~~~Klaaa~~rgd~~~i~qf~~E~~~l~k~I 47 (118)
T PRK11020 5 NEIKRLSDRLDAIRHKLAAASLRGDAEKYAQFEKEKATLEAEI 47 (118)
T ss_pred HHHHHHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHHHHHHH
Confidence 56777777776666666543 22334445555554444
No 73
>PF03962 Mnd1: Mnd1 family; InterPro: IPR005647 This family of proteins includes meiotic nuclear division protein 1 (MND1) from Saccharomyces cerevisiae (Baker's yeast). The mnd1 protein forms a complex with hop2 to promote homologous chromosome pairing and meiotic double-strand break repair [].
Probab=28.56 E-value=2.8e+02 Score=22.94 Aligned_cols=26 Identities=23% Similarity=0.329 Sum_probs=14.5
Q ss_pred hhhhhhchhhhhhhcchHHHHHHhhh
Q psy7537 104 AESKKQVDAETKKQVDSQAETKKQLD 129 (224)
Q Consensus 104 aqSk~saDAE~KrlsDe~AalKkQLD 129 (224)
...|...-++...|..+++.|+++|.
T Consensus 102 ~~eR~~~l~~l~~l~~~~~~l~~el~ 127 (188)
T PF03962_consen 102 SEEREELLEELEELKKELKELKKELE 127 (188)
T ss_pred cHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 34455555555555556665555555
No 74
>PF06133 DUF964: Protein of unknown function (DUF964); InterPro: IPR010368 This entry consists of several relatively short bacterial and archaeal hypothetical sequences. It also includes YlbF and YmcA proteins which are involved in the formation of biofilms []. YlbF regulates sporulation prior to stage II, positively controlling the competence regulator ComK at a post-transcriptional level. It may also modulate the translation, stability or activity of ComS and may work together with YmcA to regulate community development [].; PDB: 2IAZ_C 2OEE_A 2OEQ_D 2PIH_A.
Probab=27.81 E-value=98 Score=22.02 Aligned_cols=73 Identities=12% Similarity=0.259 Sum_probs=44.4
Q ss_pred hhhhhchHHHHhhhhcchhhhhhhcchhhhhhhhchhhhhhhcchHHHHHHhhhhhhhhhhccCcHHHHhhhhchH
Q psy7537 76 TKKQLESEAETKKQVDSQGETKKQVDSQAESKKQVDAETKKQVDSQAETKKQLDSQAETKKQFDSQAETKKQFDSQ 151 (224)
Q Consensus 76 s~~ql~adAe~K~l~d~ngalkKQ~dS~aqSk~saDAE~KrlsDe~AalKkQLDSnAqsk~sadsqAE~KrlsDe~ 151 (224)
++..+.++.+.+++++.--.++.++ -.++..+..-+ +....+..++.++|..|..-+.-...+-++-.+...+
T Consensus 25 a~~~l~~d~e~~~l~~~f~~~q~~~-~~~q~~g~~~~--~e~~~~l~~~~~~l~~~p~v~~y~~ae~~~~~ll~~i 97 (108)
T PF06133_consen 25 AEEALEADPEAQKLIEEFQKLQQEL-QNAQMYGKEPP--KEEIEELQELQEELMQNPVVKEYLQAEQALQDLLQDI 97 (108)
T ss_dssp HHHHHHCSHHHHHHHHHHHHHHHHH-HHHHHTTSCHH--HHHHHHHHHHHHHHHTSHHHHHHHHHHHHHHHHHHHH
T ss_pred HHHHHHhCHHHHHHHHHHHHHHHHH-HHHHhhccCCc--HHHHHHHHHHHHHHHcCHHHHHHHHHHHHHHHHHHHH
Confidence 4445566666666666665555555 23333333322 5566677788888888887777777555555555444
No 75
>PF04111 APG6: Autophagy protein Apg6; InterPro: IPR007243 Macroautophagy is a bulk degradation process induced by starvation in eukaryotic cells. In yeast, 15 Apg proteins coordinate the formation of autophagosomes. No molecule involved in autophagy has yet been identified in higher eukaryotes []. The pre-autophagosomal structure contains at least five Apg proteins: Apg1p, Apg2p, Apg5p, Aut7p/Apg8p and Apg16p. It is found in the vacuole []. The C-terminal glycine of Apg12p is conjugated to a lysine residue of Apg5p via an isopeptide bond. During autophagy, cytoplasmic components are enclosed in autophagosomes and delivered to lysosomes/vacuoles. Auotphagy protein 16 (Apg16) has been shown to be bind to Apg5 and is required for the function of the Apg12p-Apg5p conjugate []. Autophagy protein 5 (Apg5) is directly required for the import of aminopeptidase I via the cytoplasm-to-vacuole targeting pathway []. Apg6/Vps30p has two distinct functions in the autophagic process, either associated with the membrane or in a retrieval step of the carboxypeptidase Y sorting pathway [].; GO: 0006914 autophagy; PDB: 3Q8T_A 3VP7_A 4DDP_A.
Probab=27.52 E-value=1.6e+02 Score=26.00 Aligned_cols=14 Identities=14% Similarity=0.370 Sum_probs=6.8
Q ss_pred hhchHHHHHHhhch
Q psy7537 167 QLDSQAEMKKQFDS 180 (224)
Q Consensus 167 lsDe~AalKKQlDS 180 (224)
..++...++.|++.
T Consensus 111 ~~~e~~sl~~q~~~ 124 (314)
T PF04111_consen 111 FQEERDSLKNQYEY 124 (314)
T ss_dssp HHHHHHHHHHHHHH
T ss_pred HHHHHHHHHHHHHH
Confidence 33445555555544
No 76
>PF05667 DUF812: Protein of unknown function (DUF812); InterPro: IPR008530 This family consists of several eukaryotic proteins of unknown function.
Probab=27.13 E-value=3.4e+02 Score=26.60 Aligned_cols=29 Identities=17% Similarity=0.289 Sum_probs=22.2
Q ss_pred HHHHHHhhchHHHHHHHHHHhHHHHHhhh
Q psy7537 171 QAEMKKQFDSQAEAKKHLAQRYDDFMKEL 199 (224)
Q Consensus 171 ~AalKKQlDSnAqsKkhlaqryddfmkel 199 (224)
++.|+..+++.++--.+|++.|..+-..|
T Consensus 396 i~kL~~~v~~s~~rl~~L~~qWe~~R~pL 424 (594)
T PF05667_consen 396 IAKLQALVEASEQRLVELAQQWEKHRAPL 424 (594)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHhHH
Confidence 46778888888888888888887765554
No 77
>TIGR03007 pepcterm_ChnLen polysaccharide chain length determinant protein, PEP-CTERM locus subfamily. Members of this protein family belong to the family of polysaccharide chain length determinant proteins (pfam02706). All are found in species that encode the PEP-CTERM/exosortase system predicted to act in protein sorting in a number of Gram-negative bacteria, and are found near the epsH homolog that is the putative exosortase gene.
Probab=27.11 E-value=4.5e+02 Score=23.40 Aligned_cols=21 Identities=10% Similarity=0.125 Sum_probs=13.5
Q ss_pred hhhhhhcchHHHHHHhhhhhh
Q psy7537 112 AETKKQVDSQAETKKQLDSQA 132 (224)
Q Consensus 112 AE~KrlsDe~AalKkQLDSnA 132 (224)
..|+.+-.++++++.++.+..
T Consensus 275 P~v~~l~~qi~~l~~~l~~~~ 295 (498)
T TIGR03007 275 PDVIATKREIAQLEEQKEEEG 295 (498)
T ss_pred hHHHHHHHHHHHHHHHHHhhc
Confidence 456666667777777765543
No 78
>PF13851 GAS: Growth-arrest specific micro-tubule binding
Probab=27.00 E-value=3.7e+02 Score=22.38 Aligned_cols=100 Identities=17% Similarity=0.274 Sum_probs=56.3
Q ss_pred HHHhhhhcchhhhhhhcchhhhhhhhchhhhhhhcchHH-------HHHHhhhhhhhhhhccCcHHHHhhhhchHHHHHH
Q psy7537 84 AETKKQVDSQGETKKQVDSQAESKKQVDAETKKQVDSQA-------ETKKQLDSQAETKKQFDSQAETKKQFDSQAETKK 156 (224)
Q Consensus 84 Ae~K~l~d~ngalkKQ~dS~aqSk~saDAE~KrlsDe~A-------alKkQLDSnAqsk~sadsqAE~KrlsDe~AalKk 156 (224)
+-.+.|-+.-+++++....+..--..+.+|-++|+.... +|+++|..-..-|. ..+.+|.
T Consensus 27 ~lIksLKeei~emkk~e~~~~k~m~ei~~eN~~L~epL~~a~~e~~eL~k~L~~y~kdK~-------------~L~~~k~ 93 (201)
T PF13851_consen 27 ELIKSLKEEIAEMKKKEERNEKLMAEISQENKRLSEPLKKAEEEVEELRKQLKNYEKDKQ-------------SLQNLKA 93 (201)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHH-------------HHHHHHH
Confidence 445555566667777777666666777777777776544 44444433222111 2223333
Q ss_pred hhcc-HHHHhhhhchHHHHHHhhchHHHHHHHHHHhHHHHH
Q psy7537 157 QFDS-QAETKKQLDSQAEMKKQFDSQAEAKKHLAQRYDDFM 196 (224)
Q Consensus 157 QlDS-nAe~KrlsDe~AalKKQlDSnAqsKkhlaqryddfm 196 (224)
.+-. ..+.+.+.-+...|.-+|..--+-+..|-.||..-+
T Consensus 94 rl~~~ek~l~~Lk~e~evL~qr~~kle~ErdeL~~kf~~~i 134 (201)
T PF13851_consen 94 RLKELEKELKDLKWEHEVLEQRFEKLEQERDELYRKFESAI 134 (201)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 3322 245566666666677777766666666666665443
No 79
>PRK13922 rod shape-determining protein MreC; Provisional
Probab=26.73 E-value=93 Score=25.91 Aligned_cols=37 Identities=11% Similarity=0.174 Sum_probs=18.8
Q ss_pred HHHhhhhchHHHHHHhhccHHHHhhhhchHHHHHHhhchH
Q psy7537 142 AETKKQFDSQAETKKQFDSQAETKKQLDSQAEMKKQFDSQ 181 (224)
Q Consensus 142 AE~KrlsDe~AalKkQlDSnAe~KrlsDe~AalKKQlDSn 181 (224)
+|-++|-.+++.|+.++ ++...+..|+..|++.|+..
T Consensus 76 ~en~~L~~e~~~l~~~~---~~~~~l~~en~~L~~lL~~~ 112 (276)
T PRK13922 76 EENEELKKELLELESRL---QELEQLEAENARLRELLNLK 112 (276)
T ss_pred HHHHHHHHHHHHHHHHH---HHHHHHHHHHHHHHHHhcCc
Confidence 44444444444444443 23334555666676666543
No 80
>PRK11281 hypothetical protein; Provisional
Probab=26.27 E-value=3.9e+02 Score=28.27 Aligned_cols=52 Identities=10% Similarity=0.182 Sum_probs=29.8
Q ss_pred hhhhhhcchHHHHHHhhhhhhhhhhccCcHHHHhhhhchHHHHHHhhccHHHHhhhhchHH
Q psy7537 112 AETKKQVDSQAETKKQLDSQAETKKQFDSQAETKKQFDSQAETKKQFDSQAETKKQLDSQA 172 (224)
Q Consensus 112 AE~KrlsDe~AalKkQLDSnAqsk~sadsqAE~KrlsDe~AalKkQlDSnAe~KrlsDe~A 172 (224)
..++++.+.|.+|-..|-..++ .|..+..++...|.++|.-.++-+-.+|+.
T Consensus 278 p~i~~~~~~N~~Ls~~L~~~t~---------~~~~l~~~~~~~~~~l~~~~q~~~~i~eqi 329 (1113)
T PRK11281 278 PLVAQELEINLQLSQRLLKATE---------KLNTLTQQNLRVKNWLDRLTQSERNIKEQI 329 (1113)
T ss_pred hHHHHHHHHHHHHHHHHHHHHH---------HHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 3344455555555555544433 456667777777777777666655444443
No 81
>PF08826 DMPK_coil: DMPK coiled coil domain like; InterPro: IPR014930 This domain is found in the myotonic dystrophy protein kinase (DMPK) and adopts a coiled coil structure. It plays a role in dimerisation []. ; GO: 0004674 protein serine/threonine kinase activity, 0005524 ATP binding, 0006468 protein phosphorylation; PDB: 1WT6_D.
Probab=25.91 E-value=1.7e+02 Score=21.14 Aligned_cols=52 Identities=12% Similarity=0.263 Sum_probs=28.3
Q ss_pred HhhhhhhhhhhccCcHHHHhhhhchHHHHHHhh-ccHHHHhhhhchHHHHHHhhc
Q psy7537 126 KQLDSQAETKKQFDSQAETKKQFDSQAETKKQF-DSQAETKKQLDSQAEMKKQFD 179 (224)
Q Consensus 126 kQLDSnAqsk~sadsqAE~KrlsDe~AalKkQl-DSnAe~KrlsDe~AalKKQlD 179 (224)
..|++--..|-..- .|+.+.-..+-++.++| |+-.....|..++..|+++++
T Consensus 4 saL~~EirakQ~~~--eEL~kvk~~n~~~e~kLqeaE~rn~eL~~ei~~L~~e~e 56 (61)
T PF08826_consen 4 SALEAEIRAKQAIQ--EELTKVKSANLAFESKLQEAEKRNRELEQEIERLKKEME 56 (61)
T ss_dssp HHHHHHHHHHHHHH--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
T ss_pred hHHHHHHHHHHHHH--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 34444444444443 55555554444555444 334555666677777777664
No 82
>PRK13729 conjugal transfer pilus assembly protein TraB; Provisional
Probab=25.89 E-value=84 Score=30.54 Aligned_cols=18 Identities=22% Similarity=0.209 Sum_probs=12.1
Q ss_pred HHHHhhhhchHHHHHHhh
Q psy7537 161 QAETKKQLDSQAEMKKQF 178 (224)
Q Consensus 161 nAe~KrlsDe~AalKKQl 178 (224)
++..+.+.+++++|+.|+
T Consensus 103 e~KIkeLEaE~~~Lk~Ql 120 (475)
T PRK13729 103 QRRIEKLGQDNAALAEQV 120 (475)
T ss_pred HHHHHHHHHHHHHHHHHH
Confidence 455666777777777776
No 83
>PF00458 WHEP-TRS: WHEP-TRS domain; InterPro: IPR000738 A conserved domain of 46 amino acids, called WHEP-TRS has been shown [] to exist in a number of higher eukaryote aminoacyl-transfer RNA synthetases. This domain is present one to six times in the several enzymes. There are three copies in mammalian multifunctional aminoacyl-tRNA synthetase in a region that separates the N-terminal glutamyl-tRNA synthetase domain from the C-terminal prolyl-tRNA synthetase domain, and six copies in the intercatalytic region of the Drosophila enzyme. The domain is found at the N-terminal extremity of the mammalian tryptophanyl- tRNA synthetase and histidyl-tRNA synthetase, and the mammalian, insect, nematode and plant glycyl- tRNA synthetases []. This domain could contain a central alpha-helical region and may play a role in the association of tRNA-synthetases into multienzyme complexes.; GO: 0004812 aminoacyl-tRNA ligase activity, 0005524 ATP binding, 0006418 tRNA aminoacylation for protein translation; PDB: 1R1B_A 1D2D_A 1X59_A 1FYJ_A 2ZT8_A 2ZXF_A 2ZT7_A 2ZT6_A 2PME_A 2ZT5_A ....
Probab=25.19 E-value=1.5e+02 Score=20.91 Aligned_cols=37 Identities=19% Similarity=0.185 Sum_probs=22.5
Q ss_pred HHHhhhhchHHHHHHhhccHHHHhhhhchHHHHHHhh
Q psy7537 142 AETKKQFDSQAETKKQFDSQAETKKQLDSQAEMKKQF 178 (224)
Q Consensus 142 AE~KrlsDe~AalKkQlDSnAe~KrlsDe~AalKKQl 178 (224)
+++..+-|.+..||..=.+..+++.-.+..-+||++|
T Consensus 3 ~~I~~QG~~VR~LKa~ka~k~~i~~aV~~Ll~LK~~~ 39 (56)
T PF00458_consen 3 AQIAAQGDKVRKLKAEKADKEEIDAAVAKLLELKAEL 39 (56)
T ss_dssp HHHHHHHHHHHHHHHTT--HHHHHHHHHHHHHHHHHH
T ss_pred HHHHHHHHHHHHHHccCCcHHHHHHHHHHHHHHHHHH
Confidence 4566666777777766555556666666666666665
No 84
>KOG1962|consensus
Probab=24.74 E-value=2e+02 Score=25.37 Aligned_cols=18 Identities=11% Similarity=0.303 Sum_probs=9.3
Q ss_pred HHHhhhhchHHHHHHhhc
Q psy7537 142 AETKKQFDSQAETKKQFD 159 (224)
Q Consensus 142 AE~KrlsDe~AalKkQlD 159 (224)
-|--||.++...|..|++
T Consensus 193 ~EydrLlee~~~Lq~~i~ 210 (216)
T KOG1962|consen 193 DEYDRLLEEYSKLQEQIE 210 (216)
T ss_pred cHHHHHHHHHHHHHHHHh
Confidence 355555555555555544
No 85
>KOG0980|consensus
Probab=24.46 E-value=9e+02 Score=25.99 Aligned_cols=36 Identities=19% Similarity=0.288 Sum_probs=15.2
Q ss_pred HHHHHhhhhhhhhhhccCcHHHHhhhhchHHHHHHhhc
Q psy7537 122 AETKKQLDSQAETKKQFDSQAETKKQFDSQAETKKQFD 159 (224)
Q Consensus 122 AalKkQLDSnAqsk~sadsqAE~KrlsDe~AalKkQlD 159 (224)
+..-|||.+.-++.--.. .+...|-|.+.++.++.+
T Consensus 448 ~di~kQle~~~~s~~~~~--~~~~~L~d~le~~~~~~~ 483 (980)
T KOG0980|consen 448 DDIQKQLESAEQSIDDVE--EENTNLNDQLEELQRAAG 483 (980)
T ss_pred HHHHHHHHHHHHhHHHHH--HHHHHHHHHHHHHHHHHH
Confidence 334455555544443222 333334444444444433
No 86
>PF10458 Val_tRNA-synt_C: Valyl tRNA synthetase tRNA binding arm; InterPro: IPR019499 The aminoacyl-tRNA synthetases (6.1.1. from EC) catalyse the attachment of an amino acid to its cognate transfer RNA molecule in a highly specific two-step reaction. These proteins differ widely in size and oligomeric state, and have limited sequence homology []. The 20 aminoacyl-tRNA synthetases are divided into two classes, I and II. Class I aminoacyl-tRNA synthetases contain a characteristic Rossman fold catalytic domain and are mostly monomeric []. Class II aminoacyl-tRNA synthetases share an anti-parallel beta-sheet fold flanked by alpha-helices [], and are mostly dimeric or multimeric, containing at least three conserved regions [, , ]. However, tRNA binding involves an alpha-helical structure that is conserved between class I and class II synthetases. In reactions catalysed by the class I aminoacyl-tRNA synthetases, the aminoacyl group is coupled to the 2'-hydroxyl of the tRNA, while, in class II reactions, the 3'-hydroxyl site is preferred. The synthetases specific for arginine, cysteine, glutamic acid, glutamine, isoleucine, leucine, methionine, tyrosine, tryptophan and valine belong to class I synthetases. The synthetases specific for alanine, asparagine, aspartic acid, glycine, histidine, lysine, phenylalanine, proline, serine, and threonine belong to class-II synthetases []. Based on their mode of binding to the tRNA acceptor stem, both classes of tRNA synthetases have been subdivided into three subclasses, designated 1a, 1b, 1c and 2a, 2b, 2c. This entry represents the C-terminal domain of Valyl-tRNA synthetase, which consists of two helices in a long alpha-hairpin. Valyl-tRNA synthetase (6.1.1.9 from EC) is an alpha monomer that belongs to class Ia.; GO: 0000166 nucleotide binding, 0004832 valine-tRNA ligase activity, 0005524 ATP binding, 0006438 valyl-tRNA aminoacylation, 0005737 cytoplasm; PDB: 1IVS_B 1GAX_B.
Probab=24.45 E-value=2.4e+02 Score=19.38 Aligned_cols=21 Identities=19% Similarity=0.355 Sum_probs=14.1
Q ss_pred chhhhhhhcchHHHHHHhhhh
Q psy7537 110 VDAETKKQVDSQAETKKQLDS 130 (224)
Q Consensus 110 aDAE~KrlsDe~AalKkQLDS 130 (224)
+++|..||..+++.+.+++.+
T Consensus 2 ~~~E~~rL~Kel~kl~~~i~~ 22 (66)
T PF10458_consen 2 VEAEIERLEKELEKLEKEIER 22 (66)
T ss_dssp HHHHHHHHHHHHHHHHHHHHH
T ss_pred HHHHHHHHHHHHHHHHHHHHH
Confidence 356777777777777666654
No 87
>PRK02224 chromosome segregation protein; Provisional
Probab=24.03 E-value=6.5e+02 Score=24.14 Aligned_cols=10 Identities=10% Similarity=0.119 Sum_probs=5.4
Q ss_pred HhHHHHHhhh
Q psy7537 190 QRYDDFMKEL 199 (224)
Q Consensus 190 qryddfmkel 199 (224)
.-|..++.++
T Consensus 436 ~~~~~~l~~~ 445 (880)
T PRK02224 436 RTARERVEEA 445 (880)
T ss_pred HHHHHHHHHH
Confidence 3455565555
No 88
>PF13851 GAS: Growth-arrest specific micro-tubule binding
Probab=22.72 E-value=4.5e+02 Score=21.88 Aligned_cols=33 Identities=21% Similarity=0.296 Sum_probs=17.9
Q ss_pred HHHhhhhchHHHHHHhhccHHHHhhhhchHHHHHHhhch
Q psy7537 142 AETKKQFDSQAETKKQFDSQAETKKQLDSQAEMKKQFDS 180 (224)
Q Consensus 142 AE~KrlsDe~AalKkQlDSnAe~KrlsDe~AalKKQlDS 180 (224)
.+++.|.-+...|..+|. ++..+-.+|...|.+
T Consensus 100 k~l~~Lk~e~evL~qr~~------kle~ErdeL~~kf~~ 132 (201)
T PF13851_consen 100 KELKDLKWEHEVLEQRFE------KLEQERDELYRKFES 132 (201)
T ss_pred HHHHHHHHHHHHHHHHHH------HHHHHHHHHHHHHHH
Confidence 556666666666665554 334444455555543
No 89
>PF10354 DUF2431: Domain of unknown function (DUF2431); InterPro: IPR019446 This entry represents the N-terminal domain of a family of proteins whose function is not known.
Probab=22.60 E-value=36 Score=27.30 Aligned_cols=30 Identities=30% Similarity=0.516 Sum_probs=23.1
Q ss_pred HHHHHHhHHHHHhhhcccccCccceeccee
Q psy7537 185 KKHLAQRYDDFMKELDYKETSGTTIHHGIR 214 (224)
Q Consensus 185 Kkhlaqryddfmkeldyketsgttihhgir 214 (224)
...|.+.|.++..-+++-+..|.+++||+-
T Consensus 32 ~~~l~~kY~~~~~nl~~L~~~g~~V~~~VD 61 (166)
T PF10354_consen 32 EEELLQKYPDAEENLEELRELGVTVLHGVD 61 (166)
T ss_pred hHHHHHhcccHHHHHHHHhhcCCccccCCC
Confidence 355677888776666666999999999973
No 90
>PF04912 Dynamitin: Dynamitin ; InterPro: IPR006996 Dynamitin is a subunit of the microtubule-dependent motor complex, it is also implicated in cell adhesion by binding to macrophage-enriched myristoylated alanine-rice C kinase substrate (MacMARCKS) []. It is also thought to modulate cytoplasmic dynein binding to an organelle, and plays a role in prometaphase chromosome alignment and spindle organisation during mitosis. Dynamitin is also involved in anchoring microtubules to centrosomes and may play a role in synapse formation during brain development []. ; GO: 0007017 microtubule-based process, 0005869 dynactin complex
Probab=22.52 E-value=1.7e+02 Score=26.08 Aligned_cols=53 Identities=17% Similarity=0.227 Sum_probs=33.4
Q ss_pred hchhhhhhhcchHHHHHHhhhhhhhhhhcc-CcHHHHhhhhchHHHHHHhhccH
Q psy7537 109 QVDAETKKQVDSQAETKKQLDSQAETKKQF-DSQAETKKQFDSQAETKKQFDSQ 161 (224)
Q Consensus 109 saDAE~KrlsDe~AalKkQLDSnAqsk~sa-dsqAE~KrlsDe~AalKkQlDSn 161 (224)
+.-.-..||--|.++|+..+..-....+.. +.....-.+.+.++.|+++|++-
T Consensus 91 s~~~kl~RL~~Ev~EL~eEl~~~~~~~~~~~~e~~~~~~l~~~~~~L~~~L~~l 144 (388)
T PF04912_consen 91 SPEQKLQRLRREVEELKEELEKRKADSKESDEEKISPEELAQQLEELSKQLDSL 144 (388)
T ss_pred CHHHHHHHHHHHHHHHHHHHHHHhhcccccccccCChhhHHHHHHHHHHHHHHh
Confidence 344556788888888888887654433333 22222334567788888888875
No 91
>PRK13922 rod shape-determining protein MreC; Provisional
Probab=22.40 E-value=1e+02 Score=25.74 Aligned_cols=12 Identities=8% Similarity=0.130 Sum_probs=4.6
Q ss_pred hhhchHHHHHHh
Q psy7537 166 KQLDSQAEMKKQ 177 (224)
Q Consensus 166 rlsDe~AalKKQ 177 (224)
+|..+++.|+.+
T Consensus 80 ~L~~e~~~l~~~ 91 (276)
T PRK13922 80 ELKKELLELESR 91 (276)
T ss_pred HHHHHHHHHHHH
Confidence 333333444333
No 92
>KOG0161|consensus
Probab=22.31 E-value=8.4e+02 Score=27.78 Aligned_cols=102 Identities=16% Similarity=0.222 Sum_probs=65.3
Q ss_pred hhhhhhhhchhhhhhhcchHHHHHHhhhhhhhhhhccCcHHHHhhhhchHHHHHHhhccH---------------HHHhh
Q psy7537 102 SQAESKKQVDAETKKQVDSQAETKKQLDSQAETKKQFDSQAETKKQFDSQAETKKQFDSQ---------------AETKK 166 (224)
Q Consensus 102 S~aqSk~saDAE~KrlsDe~AalKkQLDSnAqsk~sadsqAE~KrlsDe~AalKkQlDSn---------------Ae~Kr 166 (224)
+-+-+|+.+-+=-|++.-.+.+|.-+||.- -+...|-+-.++++..-+-.|..++|+- ....-
T Consensus 1594 ~E~r~k~e~~r~KKkle~di~elE~~ld~a--nk~~~d~~K~lkk~q~~~k~lq~~~e~~~~~~~e~~~q~~~aerr~~~ 1671 (1930)
T KOG0161|consen 1594 AETRSKSEALRSKKKLEGDINELEIQLDHA--NKANEDAQKQLKKLQAQLKELQRELEDAQRAREELLEQLAEAERRLAA 1671 (1930)
T ss_pred HHHHHHHHHHhhhhhhhcchHHHHHHHHHH--HHhhHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 334444444333344555555555566532 2344555556666666655555555543 33455
Q ss_pred hhchHHHHHHhhchHHHHHHHHHHhHHHHHhhhcccccC
Q psy7537 167 QLDSQAEMKKQFDSQAEAKKHLAQRYDDFMKELDYKETS 205 (224)
Q Consensus 167 lsDe~AalKKQlDSnAqsKkhlaqryddfmkeldykets 205 (224)
+..++..|+..+++--.+|+++-+-++.+-..+...-+.
T Consensus 1672 l~~E~eeL~~~l~~~~Rarr~aE~e~~E~~e~i~~~~~~ 1710 (1930)
T KOG0161|consen 1672 LQAELEELREKLEALERARRQAELELEELAERVNELNAQ 1710 (1930)
T ss_pred HHHHHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHhhc
Confidence 678899999999999999999999999998887765544
No 93
>PRK04778 septation ring formation regulator EzrA; Provisional
Probab=22.03 E-value=6.7e+02 Score=23.61 Aligned_cols=130 Identities=15% Similarity=0.227 Sum_probs=0.0
Q ss_pred hHHHHHHHhhHHHHhhhhhchHHHHhhhhcchh---hhhhhcchhhhhhhhchhhhhhhcchHHHHHHhhhhhhhhhhcc
Q psy7537 62 NQAETKKQLESEIETKKQLESEAETKKQVDSQG---ETKKQVDSQAESKKQVDAETKKQVDSQAETKKQLDSQAETKKQF 138 (224)
Q Consensus 62 ~~a~~kkQL~s~ies~~ql~adAe~K~l~d~ng---alkKQ~dS~aqSk~saDAE~KrlsDe~AalKkQLDSnAqsk~sa 138 (224)
.+..++.++......-+.|.-+.-...+-..+. .|-..+..-...+..++-...++.+....++.++..-.
T Consensus 257 ~i~~l~~~i~~~~~~l~~l~l~~~~~~~~~i~~~Id~Lyd~lekE~~A~~~vek~~~~l~~~l~~~~e~~~~l~------ 330 (569)
T PRK04778 257 EIQDLKEQIDENLALLEELDLDEAEEKNEEIQERIDQLYDILEREVKARKYVEKNSDTLPDFLEHAKEQNKELK------ 330 (569)
T ss_pred HHHHHHHHHHHHHHHHHhcChHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHH------
Q ss_pred CcHHHHhhhhch----HHHHHHhhccHHHHhhhhchHHHHHHhhchHHHHHHHHHHhHHHHHhhhc
Q psy7537 139 DSQAETKKQFDS----QAETKKQFDSQAETKKQLDSQAEMKKQFDSQAEAKKHLAQRYDDFMKELD 200 (224)
Q Consensus 139 dsqAE~KrlsDe----~AalKkQlDSnAe~KrlsDe~AalKKQlDSnAqsKkhlaqryddfmkeld 200 (224)
.|+.++..+ --++...-.-..+.+.+......+...+..++.+-..+..+|++..+.+.
T Consensus 331 ---~Ei~~l~~sY~l~~~e~~~~~~lekeL~~Le~~~~~~~~~i~~~~~~ysel~e~leel~e~le 393 (569)
T PRK04778 331 ---EEIDRVKQSYTLNESELESVRQLEKQLESLEKQYDEITERIAEQEIAYSELQEELEEILKQLE 393 (569)
T ss_pred ---HHHHHHHHccccCchhHHHHHHHHHHHHHHHHHHHHHHHHHHcCCCCHHHHHHHHHHHHHHHH
No 94
>KOG0946|consensus
Probab=21.57 E-value=8.1e+02 Score=26.29 Aligned_cols=58 Identities=22% Similarity=0.186 Sum_probs=30.5
Q ss_pred HHhhhhhhhhhcccCChhhhHHHHHhhHHHHhhhccCCCcchhhhhhhhhHhhhhHHHH
Q psy7537 8 QVLDKLKHEEAKFNVDSDKIDSLIKKSEEILESKKIESPSAAIESETETNKQVDNQAET 66 (224)
Q Consensus 8 qvldklkheeakfnvdsdkidslikkseeileskkiesP~a~~~SeT~~~KQ~d~~a~~ 66 (224)
|..+.+.--+..|..++++-+++..-+| ..+.+.-|--++-.+...-.|+|-..+.++
T Consensus 703 ql~~q~~~Lk~qLg~~~~~~~~~~q~~e-~~~t~~eel~a~~~e~k~l~~~q~~l~~~L 760 (970)
T KOG0946|consen 703 QLKDQLDLLKNQLGIISSKQRDLLQGAE-ASKTQNEELNAALSENKKLENDQELLTKEL 760 (970)
T ss_pred HHHHHHHHHHHHhcccccchhhHHhHHH-hccCChHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 4444444445667777777777665432 233333333334444444556666666665
No 95
>PF10212 TTKRSYEDQ: Predicted coiled-coil domain-containing protein; InterPro: IPR019348 This entry represents a C-terminal 500 residue region, which contains a conserved TTKRSYEDQ motif. It is found in a family of coiled-coil domain-containing proteins that are conserved from nematodes to humans. These proteins also contain an N-terminal domain with a KLRAQ motif (IPR019343 from INTERPRO). The function of these proteins is not known.
Probab=20.79 E-value=3.5e+02 Score=26.78 Aligned_cols=77 Identities=16% Similarity=0.236 Sum_probs=52.6
Q ss_pred hhhhchhhhhhhcchHHHHHHhhhhhhhhhhccCcHHHHhhhhchHHHHHHhhccH-----HHHhhhhchHHHHHHhhch
Q psy7537 106 SKKQVDAETKKQVDSQAETKKQLDSQAETKKQFDSQAETKKQFDSQAETKKQFDSQ-----AETKKQLDSQAEMKKQFDS 180 (224)
Q Consensus 106 Sk~saDAE~KrlsDe~AalKkQLDSnAqsk~sadsqAE~KrlsDe~AalKkQlDSn-----Ae~KrlsDe~AalKKQlDS 180 (224)
.+.-+|.-....-.|..+|.++|+.--+.+.+-+ .|++.....+..|+..|... .++--+++-+|.|-.||-.
T Consensus 428 qlQ~adSKa~~f~~Ec~aL~~rL~~aE~ek~~l~--eeL~~a~~~i~~LqDEL~TTr~NYE~QLs~MSEHLasmNeqL~~ 505 (518)
T PF10212_consen 428 QLQHADSKAVHFYAECRALQKRLESAEKEKESLE--EELKEANQNISRLQDELETTRRNYEEQLSMMSEHLASMNEQLAK 505 (518)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH--HHHHHHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHH
Confidence 3444566666677777888888887766666665 66666666666666666654 5666677788888777766
Q ss_pred HHHH
Q psy7537 181 QAEA 184 (224)
Q Consensus 181 nAqs 184 (224)
+.+-
T Consensus 506 Q~ee 509 (518)
T PF10212_consen 506 QREE 509 (518)
T ss_pred HHHH
Confidence 6543
No 96
>KOG1853|consensus
Probab=20.76 E-value=3.4e+02 Score=25.63 Aligned_cols=18 Identities=11% Similarity=0.340 Sum_probs=9.5
Q ss_pred chhhhhhhcchHHHHHHh
Q psy7537 110 VDAETKKQVDSQAETKKQ 127 (224)
Q Consensus 110 aDAE~KrlsDe~AalKkQ 127 (224)
+-+++.||--|+..+|..
T Consensus 64 l~t~nqrl~~E~e~~Kek 81 (333)
T KOG1853|consen 64 LETRNQRLTTEQERNKEK 81 (333)
T ss_pred HHHHHHHHHHHHHHHHHH
Confidence 344455555555555543
No 97
>PF10498 IFT57: Intra-flagellar transport protein 57 ; InterPro: IPR019530 Eukaryotic cilia and flagella are specialised organelles found at the periphery of cells of diverse organisms. Intra-flagellar transport (IFT) is required for the assembly and maintenance of eukaryotic cilia and flagella, and consists of the bi-directional movement of large protein particles between the base and the distal tip of the organelle. IFT particles contain multiple copies of two distinct protein complexes, A and B, which contain at least 6 and 11 protein subunits. IFT57 is part of complex B but is not, however, required for the core subunits to stay associated []. This protein is known as Huntington-interacting protein-1 in humans.
Probab=20.52 E-value=4.9e+02 Score=23.89 Aligned_cols=46 Identities=11% Similarity=0.188 Sum_probs=24.7
Q ss_pred hhhhcchHHHHHHhhhhhhhhhhccCcHHHHhhhhchHHHHHHhhccH
Q psy7537 114 TKKQVDSQAETKKQLDSQAETKKQFDSQAETKKQFDSQAETKKQFDSQ 161 (224)
Q Consensus 114 ~KrlsDe~AalKkQLDSnAqsk~sadsqAE~KrlsDe~AalKkQlDSn 161 (224)
-+...++.++.+.+...-.++-..- ..|+.++++++...|.+++..
T Consensus 275 Yr~~~~~ls~~~~~y~~~s~~V~~~--t~~L~~IseeLe~vK~emeer 320 (359)
T PF10498_consen 275 YRSAQDELSEVQEKYKQASEGVSER--TRELAEISEELEQVKQEMEER 320 (359)
T ss_pred HHHHHHHHHHHHHHHHHHhhHHHHH--HHHHHHHHHHHHHHHHHHHHh
Confidence 4444444444444443322222222 256677777777788777765
No 98
>COG3657 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=20.27 E-value=52 Score=26.45 Aligned_cols=39 Identities=38% Similarity=0.504 Sum_probs=24.6
Q ss_pred HHHHHHHHhHHHHHhhh--cccc----cCccceecceeeeEeecc
Q psy7537 183 EAKKHLAQRYDDFMKEL--DYKE----TSGTTIHHGIRYRVQFNE 221 (224)
Q Consensus 183 qsKkhlaqryddfmkel--dyke----tsgttihhgiryrvqfne 221 (224)
-||..++.|.+..+--. |.+. .+--.||||--|||-|-.
T Consensus 23 ~Aka~I~~Rl~rl~~GN~GD~kpvgeGV~ELRId~GpGyRvY~~~ 67 (100)
T COG3657 23 RAKAKIAARLDRLALGNFGDVKPVGEGVSELRIDHGPGYRVYFQQ 67 (100)
T ss_pred HHHHHHHHHHHHHhcCCCcCccccccchhhheeccCCceEEEEEe
Confidence 34556778877664321 2222 233479999999999864
No 99
>PRK03918 chromosome segregation protein; Provisional
Probab=20.21 E-value=7.5e+02 Score=23.49 Aligned_cols=32 Identities=9% Similarity=0.233 Sum_probs=13.5
Q ss_pred hhhcchhhhhhhhchhhhhhhcchHHHHHHhh
Q psy7537 97 KKQVDSQAESKKQVDAETKKQVDSQAETKKQL 128 (224)
Q Consensus 97 kKQ~dS~aqSk~saDAE~KrlsDe~AalKkQL 128 (224)
++++.+-...-.....+...+.+..+.++++|
T Consensus 199 ~~~~~~l~~ei~~l~~e~~~l~~~~~~~~~~l 230 (880)
T PRK03918 199 EKELEEVLREINEISSELPELREELEKLEKEV 230 (880)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 33333333333444444444444444444433
Done!