Query psy17567
Match_columns 362
No_of_seqs 208 out of 398
Neff 6.2
Searched_HMMs 46136
Date Fri Aug 16 16:51:23 2013
Command hhsearch -i /work/01045/syshi/Psyhhblits/psy17567.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/17567hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 KOG3618|consensus 99.9 6.4E-23 1.4E-27 212.4 20.3 264 66-362 13-286 (1318)
2 KOG3619|consensus 99.4 4.4E-13 9.6E-18 144.4 7.3 65 298-362 13-77 (867)
3 KOG2422|consensus 93.9 0.017 3.6E-07 60.7 0.2 15 66-80 113-127 (665)
4 KOG0112|consensus 93.9 0.031 6.7E-07 61.1 2.1 10 103-112 827-836 (975)
5 KOG4364|consensus 93.0 0.13 2.9E-06 54.7 4.9 10 86-95 393-402 (811)
6 KOG3619|consensus 92.9 4.6 0.0001 45.1 16.7 105 257-362 519-625 (867)
7 KOG0345|consensus 91.5 0.42 9.1E-06 49.6 6.3 7 66-72 539-545 (567)
8 PF06375 BLVR: Bovine leukaemi 91.2 0.062 1.3E-06 47.8 0.0 6 66-71 108-113 (154)
9 KOG2412|consensus 91.2 0.24 5.2E-06 51.8 4.2 30 2-31 213-242 (591)
10 KOG4364|consensus 90.7 0.36 7.7E-06 51.6 5.0 17 304-320 655-671 (811)
11 PF11600 CAF-1_p150: Chromatin 89.8 1.3 2.8E-05 41.3 7.6 11 86-96 199-209 (216)
12 KOG4055|consensus 87.7 1.2 2.5E-05 40.7 5.4 28 33-60 118-145 (213)
13 PF07946 DUF1682: Protein of u 87.4 0.55 1.2E-05 46.5 3.5 17 10-26 255-271 (321)
14 PF06658 DUF1168: Protein of u 86.9 1 2.2E-05 39.7 4.5 41 5-45 45-85 (142)
15 KOG0338|consensus 86.2 0.17 3.6E-06 53.0 -0.9 31 1-31 102-132 (691)
16 PTZ00121 MAEBL; Provisional 84.4 1.6 3.6E-05 50.6 5.6 7 77-83 1813-1819(2084)
17 PLN02381 valyl-tRNA synthetase 75.5 1.6 3.5E-05 50.0 2.0 19 13-31 16-34 (1066)
18 KOG4727|consensus 75.1 9.1 0.0002 34.7 6.2 7 66-72 174-180 (193)
19 PRK11281 hypothetical protein; 74.2 1.5E+02 0.0032 34.7 17.0 28 315-342 729-759 (1113)
20 KOG1962|consensus 73.2 7.4 0.00016 36.6 5.4 18 63-80 187-204 (216)
21 KOG2357|consensus 72.8 5.5 0.00012 40.7 4.8 8 14-21 389-396 (440)
22 PF15012 DUF4519: Domain of un 72.7 3.9 8.4E-05 30.3 2.7 21 1-21 1-22 (56)
23 KOG1297|consensus 71.3 1.8 3.9E-05 40.2 0.9 16 66-81 232-247 (249)
24 PLN02381 valyl-tRNA synthetase 71.0 3.6 7.8E-05 47.3 3.4 16 112-127 98-113 (1066)
25 KOG2412|consensus 70.8 2.9 6.3E-05 44.0 2.4 24 8-31 212-235 (591)
26 PF09889 DUF2116: Uncharacteri 70.0 14 0.00029 27.8 5.2 33 111-143 16-48 (59)
27 KOG2812|consensus 69.5 5 0.00011 40.3 3.6 15 65-79 276-290 (426)
28 PF11600 CAF-1_p150: Chromatin 67.3 8.6 0.00019 35.8 4.6 9 102-110 201-209 (216)
29 KOG2894|consensus 65.6 12 0.00025 36.6 5.0 26 66-91 127-152 (331)
30 KOG0345|consensus 61.2 11 0.00023 39.6 4.2 15 61-75 537-551 (567)
31 KOG2243|consensus 60.6 5.9 0.00013 46.1 2.4 13 111-123 4595-4607(5019)
32 PRK10929 putative mechanosensi 60.4 3E+02 0.0066 32.2 15.9 32 310-341 709-743 (1109)
33 PF09925 DUF2157: Predicted me 60.4 1.1E+02 0.0024 26.3 10.4 20 115-134 11-30 (145)
34 PF07543 PGA2: Protein traffic 60.3 8.3 0.00018 33.9 2.9 12 65-76 128-139 (140)
35 PF06027 DUF914: Eukaryotic pr 60.2 1.9E+02 0.0041 29.0 13.9 72 251-322 113-184 (334)
36 PF05262 Borrelia_P83: Borreli 60.1 21 0.00046 37.6 6.3 10 69-78 335-344 (489)
37 KOG2457|consensus 58.3 4.6 0.0001 41.1 1.0 29 1-29 16-45 (555)
38 PF14812 PBP1_TM: Transmembran 58.3 3.3 7.1E-05 33.1 0.0 15 32-46 42-56 (81)
39 KOG0163|consensus 57.0 24 0.00052 39.0 6.1 7 250-256 1170-1176(1259)
40 PF08432 Vfa1: AAA-ATPase Vps4 56.6 17 0.00036 33.1 4.3 25 67-91 114-138 (182)
41 PF15437 PGBA_C: Plasminogen-b 56.3 9.9 0.00021 30.3 2.4 16 8-23 45-60 (86)
42 KOG0569|consensus 55.5 2.2E+02 0.0047 30.1 12.9 26 319-344 447-472 (485)
43 KOG3054|consensus 54.9 14 0.00031 35.4 3.6 18 117-134 215-232 (299)
44 KOG4434|consensus 54.4 3.5 7.5E-05 41.7 -0.5 17 116-132 371-387 (520)
45 KOG1230|consensus 54.1 9.3 0.0002 39.4 2.4 18 63-80 33-50 (521)
46 PF14632 SPT6_acidic: Acidic N 51.9 9 0.00019 31.3 1.6 16 66-81 59-74 (92)
47 KOG4819|consensus 51.9 99 0.0021 25.6 7.5 12 66-77 88-99 (106)
48 PF09428 DUF2011: Fungal prote 51.1 32 0.0007 29.8 5.0 13 47-59 115-127 (131)
49 COG5269 ZUO1 Ribosome-associat 49.7 36 0.00079 33.3 5.5 18 67-84 323-340 (379)
50 PF11286 DUF3087: Protein of u 49.6 1.5E+02 0.0034 26.7 9.2 16 124-139 9-24 (165)
51 PTZ00053 methionine aminopepti 49.1 8.4 0.00018 40.3 1.3 16 95-110 120-135 (470)
52 PF05297 Herpes_LMP1: Herpesvi 48.6 5.8 0.00013 38.9 0.0 7 132-138 49-55 (381)
53 KOG1924|consensus 46.8 28 0.0006 38.7 4.7 22 10-31 991-1012(1102)
54 KOG4055|consensus 45.8 25 0.00055 32.3 3.6 29 28-56 117-145 (213)
55 KOG2571|consensus 45.2 36 0.00077 38.3 5.3 43 100-142 560-602 (862)
56 KOG3089|consensus 44.5 32 0.00068 32.8 4.1 18 66-83 79-96 (271)
57 KOG0163|consensus 43.0 66 0.0014 35.8 6.7 9 69-77 1006-1014(1259)
58 KOG2507|consensus 41.6 24 0.00051 36.5 3.0 20 109-128 326-345 (506)
59 PTZ00053 methionine aminopepti 40.8 16 0.00036 38.2 1.9 9 312-320 319-327 (470)
60 TIGR00869 sec62 protein transl 40.3 15 0.00033 34.9 1.4 15 51-65 216-230 (232)
61 PLN02316 synthase/transferase 38.8 24 0.00052 40.5 2.9 9 92-100 390-399 (1036)
62 PF09323 DUF1980: Domain of un 38.7 1.3E+02 0.0029 27.0 7.3 23 164-186 27-49 (182)
63 COG3601 Predicted membrane pro 37.9 3.3E+02 0.0071 25.1 9.6 30 257-286 148-177 (186)
64 PRK09510 tolA cell envelope in 37.6 91 0.002 32.0 6.5 10 67-76 258-267 (387)
65 PF03904 DUF334: Domain of unk 37.3 3.1E+02 0.0066 26.2 9.4 15 119-133 136-150 (230)
66 COG5593 Nucleic-acid-binding p 36.4 20 0.00042 38.2 1.6 12 35-46 779-790 (821)
67 PLN03237 DNA topoisomerase 2; 35.1 6.4 0.00014 46.4 -2.4 25 67-91 1259-1283(1465)
68 PF08496 Peptidase_S49_N: Pept 35.0 66 0.0014 28.7 4.5 13 69-81 119-131 (155)
69 PRK01637 hypothetical protein; 34.5 2.7E+02 0.0059 26.8 9.2 71 258-328 187-262 (286)
70 KOG3054|consensus 33.3 1.1E+02 0.0024 29.5 5.9 6 7-12 105-110 (299)
71 PLN03223 Polycystin cation cha 33.2 9.5E+02 0.021 29.1 14.9 28 305-332 1438-1467(1634)
72 PF14617 CMS1: U3-containing 9 33.2 3.1 6.8E-05 40.0 -4.5 9 69-77 56-64 (252)
73 PF12569 NARP1: NMDA receptor- 33.0 32 0.00069 36.5 2.6 9 125-133 497-505 (517)
74 PF09736 Bud13: Pre-mRNA-splic 32.9 2.6E+02 0.0056 24.6 7.9 19 66-84 68-86 (145)
75 PF08374 Protocadherin: Protoc 32.8 5.1 0.00011 37.6 -3.0 10 10-19 67-76 (221)
76 KOG1015|consensus 32.7 15 0.00032 41.8 -0.0 12 315-326 876-887 (1567)
77 KOG2062|consensus 31.9 20 0.00044 39.5 0.9 17 68-85 852-868 (929)
78 PF05793 TFIIF_alpha: Transcri 31.1 16 0.00035 38.8 0.0 6 68-73 362-367 (527)
79 COG3356 Predicted membrane pro 28.9 3.7E+02 0.008 29.0 9.5 18 116-133 3-20 (578)
80 PF15333 TAF1D: TATA box-bindi 28.8 29 0.00063 32.4 1.2 10 102-111 100-109 (217)
81 PRK11644 sensory histidine kin 28.6 6.9E+02 0.015 26.0 19.3 9 123-131 97-105 (495)
82 PF15049 DUF4534: Protein of u 27.3 4E+02 0.0087 24.1 8.1 44 164-207 83-130 (163)
83 PF13493 DUF4118: Domain of un 27.1 90 0.002 24.8 3.8 26 307-332 80-105 (105)
84 PF01086 Clathrin_lg_ch: Clath 27.0 1.1E+02 0.0024 28.7 4.9 16 83-98 180-197 (225)
85 PF06027 DUF914: Eukaryotic pr 26.7 6.6E+02 0.014 25.2 12.0 66 257-331 242-311 (334)
86 COG3083 Predicted hydrolase of 26.7 6.7E+02 0.014 27.0 10.7 10 228-237 99-108 (600)
87 PF05087 Rota_VP2: Rotavirus V 26.7 37 0.0008 36.8 1.7 18 67-84 89-107 (887)
88 PF01086 Clathrin_lg_ch: Clath 26.2 99 0.0021 29.1 4.4 9 88-96 205-213 (225)
89 KOG2256|consensus 26.2 20 0.00044 38.8 -0.3 8 262-269 439-446 (661)
90 PRK04214 rbn ribonuclease BN/u 25.8 3E+02 0.0065 28.1 8.1 31 258-288 194-227 (412)
91 KOG1070|consensus 25.7 29 0.00063 40.9 0.8 14 66-79 1442-1455(1710)
92 KOG1098|consensus 25.5 37 0.0008 36.9 1.5 25 25-49 340-364 (780)
93 TIGR00927 2A1904 K+-dependent 25.0 15 0.00033 41.6 -1.5 11 87-97 901-911 (1096)
94 KOG0299|consensus 24.9 60 0.0013 33.8 2.8 7 66-72 94-100 (479)
95 KOG0949|consensus 24.8 1.2E+02 0.0025 35.1 5.1 20 101-120 998-1017(1330)
96 PF12757 DUF3812: Protein of u 24.7 1.1E+02 0.0024 26.2 4.0 19 13-31 91-109 (126)
97 PF06645 SPC12: Microsomal sig 24.6 2.1E+02 0.0046 22.3 5.3 12 187-198 55-66 (76)
98 COG4901 Ribosomal protein S25 24.1 55 0.0012 27.3 1.9 23 61-83 37-59 (107)
99 COG5116 RPN2 26S proteasome re 24.0 20 0.00043 38.6 -0.8 34 66-101 849-884 (926)
100 PRK07668 hypothetical protein; 23.7 6.7E+02 0.015 24.2 16.4 17 187-204 129-145 (254)
101 KOG3915|consensus 23.4 1.8E+02 0.0039 30.6 5.8 18 47-64 563-580 (641)
102 KOG3045|consensus 23.4 22 0.00048 34.8 -0.6 14 117-130 146-159 (325)
103 PF13829 DUF4191: Domain of un 23.2 4.2E+02 0.0092 25.1 7.9 20 118-137 9-28 (224)
104 KOG2138|consensus 23.0 42 0.00091 36.8 1.3 7 67-73 851-857 (883)
105 PHA02644 hypothetical protein; 23.0 6.7 0.00015 31.5 -3.5 13 61-73 91-104 (112)
106 PRK09510 tolA cell envelope in 22.2 2.4E+02 0.0051 29.0 6.4 7 70-76 258-264 (387)
107 KOG2507|consensus 22.2 66 0.0014 33.4 2.4 9 108-116 360-368 (506)
108 TIGR00570 cdk7 CDK-activating 21.9 1.4E+02 0.003 29.8 4.5 51 9-59 152-221 (309)
109 PF08243 SPT2: SPT2 chromatin 21.7 1.3E+02 0.0028 25.5 3.8 13 33-45 93-105 (116)
110 PF07701 HNOBA: Heme NO bindin 21.3 2.3E+02 0.0049 26.5 5.8 44 319-362 168-211 (219)
111 PF01698 FLO_LFY: Floricaula / 20.9 55 0.0012 33.4 1.6 22 96-127 236-257 (386)
112 KOG2662|consensus 20.6 9.6E+02 0.021 24.9 11.6 16 114-129 297-312 (414)
113 KOG3142|consensus 20.3 6.9E+02 0.015 23.1 12.1 28 67-94 18-45 (187)
114 smart00441 FF Contains two con 20.2 74 0.0016 22.4 1.8 50 68-127 4-53 (55)
115 KOG1059|consensus 20.1 35 0.00076 37.6 0.0 20 46-65 850-869 (877)
No 1
>KOG3618|consensus
Probab=99.90 E-value=6.4e-23 Score=212.36 Aligned_cols=264 Identities=14% Similarity=0.172 Sum_probs=189.9
Q ss_pred cCHHhHHHHHHHhhhhccCc----cccccccccc-cccccccccccccCccCcCCCcchHHhHHHHHHHHhhhHHHHHHH
Q psy17567 66 QSVKEIEESLKERKEKTLTE----DSQCDTMIPV-FKKGVLYKGIYCPSLTNSFREPSLEMSYQRYSHRQRQKSLIIVNL 140 (362)
Q Consensus 66 ~~~~~~~~~l~e~~~~~lt~----~~~~~~~~~~-f~~g~~~~g~~~p~L~~kF~s~~LE~LYq~Y~~r~~q~sL~v~~~ 140 (362)
++.+++...+.|..+.+.++ .+||++++|. |++ +.+| | ++++|+|..||+.|+.-.+.|-|...--- +
T Consensus 13 ~~t~~v~~~~~Psi~s~~Ss~gRR~~~~~~~lP~~FER-Asar-W----wdPkFdS~~LEeqy~~~~Fpq~r~RfR~~-L 85 (1318)
T KOG3618|consen 13 GDTNSVRVKINPSISSSCSSSGRRVGGGGRLLPQLFER-ASAR-W----WDPKFDSVNLEEQYLERCFPQTRRRFRYA-L 85 (1318)
T ss_pred CCCcccceeeCcchhhhcCCCCcccCCCcccchHHHHH-hhhh-h----cCccCCcchHHHHHHHhhCHHHHHHHHHH-H
Confidence 78899999999999998876 4999999996 998 4445 3 78999999999999999888766554322 2
Q ss_pred HHHHHHHHHHHHHHHhcccccccccchhHHHHHHHHHHHHHHHHHh-hhhhhcchhhHHHHHHHHHHHHhhhhhhhhhhh
Q psy17567 141 VDMVLKAVLTVAWISYNEVELSQIDSTKLTWTVYTMMSNLAMCLLG-WWRCFANNYLQWAAVGTWILLNTQGWWRCFANN 219 (362)
Q Consensus 141 l~ill~~~l~~l~l~~~~~~~~~~~~~~~~~~~~~l~~~l~L~vl~-~~~~~~~~~l~~~~~~~wi~l~t~~~~~c~~~~ 219 (362)
..|.+++.++.+|+..+...+ ....+....|.+.+..++++++ +++.+.+.| .|.+++.-+++ |
T Consensus 86 ~YI~~~~l~W~lYfav~~rs~---fi~~~~~slc~lslv~~mf~~ft~~~lY~rhy-~~TS~~~tlLv-------c---- 150 (1318)
T KOG3618|consen 86 FYIGFACLLWSLYFAVHMRSR---FIVMVAPSLCFLSLVCVMFFLFTFTKLYARHY-AWTSLALTLLV-------C---- 150 (1318)
T ss_pred HHHHHHHHHHHHHheeccCce---eeeehHHHHHHHHHHHHHHHHHHHHHHHHHHh-hHHHHHHHHHH-------H----
Confidence 578889999999987665332 2222223333333333444443 677665555 67766522221 2
Q ss_pred hhhhhhhHHHHhhhhhhhhcccc-ccCCccchhHHHHHHHHHHhcccccchHHHHHhhhhhHHHHHHHHHHHHh---ccC
Q psy17567 220 YLQWAAVGTWILLNTQGFISQGI-GLNNKEYLIWYILFIVFVTYAMLPLPLKWCFIGGCTTSVLHVIITAQIKL---HRG 295 (362)
Q Consensus 220 yl~~~~~~~~v~~~~q~~~~~~~-~~~~~~~~~w~~lfiIf~iYtmLPl~l~~ail~g~~~Sl~hliv~~~~~~---~~~ 295 (362)
..++.. +.+++. .+-++-.....++-++++|||.+|+|++.++.+|+..|+.+-++...+.- +..
T Consensus 151 ------~~tLa~-----ltat~r~af~spvgsfa~c~evvlLiYTv~plPLyL~~~~gi~YSilFE~~~~~~ig~~~~~~ 219 (1318)
T KOG3618|consen 151 ------ALTLAN-----LTATARPAFLSPVGSFAMCIEVVLLIYTVMPLPLYLSLCLGIAYSILFETFGYHFIGEACFPS 219 (1318)
T ss_pred ------HHHHHH-----hhhccchhhhCchhHHHHHHHHHHHHHHhcchHHHHHHHHHHHHHHHHHHhhhhccccccccC
Confidence 111111 111222 22122334555778899999999999999999999999998876432110 001
Q ss_pred CCCchhhhhHHHHHHHHHHhhhhhhhhhhhhhHHhHHHHHHHHHHHHHHHHhhhHHHHHHHHhhhcC
Q psy17567 296 PLSPCSVHEMGALCLVYIGINSAGMYTKYLTDRGQRRAFLETHRSTETRMKTQKENDQQEKLLLSGM 362 (362)
Q Consensus 296 ~~~~~~~~ql~an~ll~lc~NliG~~~~~l~e~~qR~aFLetrq~i~~rl~Le~E~~qQErLLLSVL 362 (362)
+....+-+.++..++++.|+|++|++..+|+.+++|++||+..|++..|..||.|+|-+|++++||+
T Consensus 220 e~aG~~~~~l~~r~~lH~~vHliGVHlfim~qVR~R~TFlKVGQs~l~rkdLE~EkqlKe~MIhSVM 286 (1318)
T KOG3618|consen 220 EGAGALHWELLSRGLLHGCVHLIGVHLFIMSQVRSRSTFLKVGQSILHRKDLEVEKQLKERMIHSVM 286 (1318)
T ss_pred cccCchhHHHHHHHHHhhhhhhhhhHHHHHHHHHhhhhHHHhhHHHHhhhhhHHHHHHHHHHHHhhc
Confidence 1222456789999999999999999999999999999999999999999999999999999999996
No 2
>KOG3619|consensus
Probab=99.39 E-value=4.4e-13 Score=144.43 Aligned_cols=65 Identities=40% Similarity=0.654 Sum_probs=62.7
Q ss_pred CchhhhhHHHHHHHHHHhhhhhhhhhhhhhHHhHHHHHHHHHHHHHHHHhhhHHHHHHHHhhhcC
Q psy17567 298 SPCSVHEMGALCLVYIGINSAGMYTKYLTDRGQRRAFLETHRSTETRMKTQKENDQQEKLLLSGM 362 (362)
Q Consensus 298 ~~~~~~ql~an~ll~lc~NliG~~~~~l~e~~qR~aFLetrq~i~~rl~Le~E~~qQErLLLSVL 362 (362)
+...+.|+++|+++|+|+|++|+|++||+|++||++|+|||+||++|+++|.|++||||||+|||
T Consensus 13 ~~~~~~ql~an~~~~~~~nl~G~~~~~~~e~~~r~~f~~~~~~i~~r~~l~~~~~~qerlllsvl 77 (867)
T KOG3619|consen 13 DVLLLNQLLANAVLFLCINLVGIFTKYMMERAQRQAFLETRKCIEVRMELETEKQQQERLLLSVL 77 (867)
T ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhh
Confidence 45678899999999999999999999999999999999999999999999999999999999997
No 3
>KOG2422|consensus
Probab=93.94 E-value=0.017 Score=60.72 Aligned_cols=15 Identities=20% Similarity=0.016 Sum_probs=8.3
Q ss_pred cCHHhHHHHHHHhhh
Q psy17567 66 QSVKEIEESLKERKE 80 (362)
Q Consensus 66 ~~~~~~~~~l~e~~~ 80 (362)
.+||+-..+|+-.+.
T Consensus 113 s~deq~~a~l~~~k~ 127 (665)
T KOG2422|consen 113 SDDEQKLAILKYGKT 127 (665)
T ss_pred CchHHHHHHHHhhcc
Confidence 455566666655544
No 4
>KOG0112|consensus
Probab=93.89 E-value=0.031 Score=61.08 Aligned_cols=10 Identities=30% Similarity=0.481 Sum_probs=5.0
Q ss_pred ccccccCccC
Q psy17567 103 KGIYCPSLTN 112 (362)
Q Consensus 103 ~g~~~p~L~~ 112 (362)
+|+..|.|+.
T Consensus 827 k~v~~~ll~e 836 (975)
T KOG0112|consen 827 KGVIISLLKE 836 (975)
T ss_pred cchhhhhhhc
Confidence 4455555554
No 5
>KOG4364|consensus
Probab=93.00 E-value=0.13 Score=54.73 Aligned_cols=10 Identities=10% Similarity=-0.044 Sum_probs=6.0
Q ss_pred cccccccccc
Q psy17567 86 DSQCDTMIPV 95 (362)
Q Consensus 86 ~~~~~~~~~~ 95 (362)
.++|..+.|+
T Consensus 393 ~~s~~~f~pF 402 (811)
T KOG4364|consen 393 TVSDIRFEPF 402 (811)
T ss_pred cccccccccc
Confidence 3566666664
No 6
>KOG3619|consensus
Probab=92.89 E-value=4.6 Score=45.14 Aligned_cols=105 Identities=11% Similarity=-0.008 Sum_probs=61.2
Q ss_pred HHHHHhcccccchHHHHHhhhhhHHHHHHHHHHHHhc--cCCCCchhhhhHHHHHHHHHHhhhhhhhhhhhhhHHhHHHH
Q psy17567 257 IVFVTYAMLPLPLKWCFIGGCTTSVLHVIITAQIKLH--RGPLSPCSVHEMGALCLVYIGINSAGMYTKYLTDRGQRRAF 334 (362)
Q Consensus 257 iIf~iYtmLPl~l~~ail~g~~~Sl~hliv~~~~~~~--~~~~~~~~~~ql~an~ll~lc~NliG~~~~~l~e~~qR~aF 334 (362)
.+.++++++++|...-.++-.+....|.++....... ..+..+..-...... ++++-+-++=.++....|+.-|--|
T Consensus 519 ~~i~~~vf~rl~~~~k~llll~~~~~y~~l~~~~~~~~~~~~~~~~~~~k~~~~-i~l~lf~l~l~~~~Rq~E~~~RlDF 597 (867)
T KOG3619|consen 519 GLLSTAVFLRLPFMLKLLLLLIVFAAYGLLIELSYARELDEEPLPYMGLKLMLS-ILLSLFLLALLLHARQVEYTARLDF 597 (867)
T ss_pred HHHHHHheeeccHHHHHHHHHHHHHHHHhheeeeeccccCCCCccccchhHHHH-HHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 3455788889985444433333444444432111010 011111111122222 3333333445667788999999999
Q ss_pred HHHHHHHHHHHHhhhHHHHHHHHhhhcC
Q psy17567 335 LETHRSTETRMKTQKENDQQEKLLLSGM 362 (362)
Q Consensus 335 Letrq~i~~rl~Le~E~~qQErLLLSVL 362 (362)
+=.-|..+++-..|.=.+..++||..||
T Consensus 598 LWk~q~~~E~ee~e~m~~~Nr~LLeNiL 625 (867)
T KOG3619|consen 598 LWKVQAQEEKEEMETMQNLNRLLLENIL 625 (867)
T ss_pred HHHHHhHHHHHHHHHHHHHHHHHHHhcC
Confidence 9999999999888877788888888775
No 7
>KOG0345|consensus
Probab=91.53 E-value=0.42 Score=49.58 Aligned_cols=7 Identities=0% Similarity=0.017 Sum_probs=3.2
Q ss_pred cCHHhHH
Q psy17567 66 QSVKEIE 72 (362)
Q Consensus 66 ~~~~~~~ 72 (362)
|+||++|
T Consensus 539 e~e~~~~ 545 (567)
T KOG0345|consen 539 EEEELAD 545 (567)
T ss_pred hhHHHHH
Confidence 3444444
No 8
>PF06375 BLVR: Bovine leukaemia virus receptor (BLVR); InterPro: IPR010474 Bovine leukemia virus (BLV) is one of the most common infectious cattle viruses, with between 30 and 40% of cows in the United States being infected. It is closely related to the human T-cell leukaemia virus type 1 (HTLV-1) and has highly conserved envelope glycoprotein functional domains []. BLV is an oncogenic C-type retrovirus, which results in the animals developing a malignant lymphoma. BLV, like the human and simian T cell leukaemia viruses, is a deltaretrovirus. 182 residues at the amino-terminal of the BLV envelope glycoprotein surface unit encompass the receptor-binding domain. The metabolic activity in B cells, T cells, and thymocytes is indicated by the expression of the BLV-binding receptor []. A candidate gene of the receptor (BLVR) is related, but unique, to a gene family of the delta subunit of the adaptor protein (AP) complex 3, AP-3 []. The AP-3 complex is not clathrin-associated but is associated with the Golgi region as well as more peripheral structures. It facilitates the budding of vesicles from the Golgi membrane and may be directly involved in trafficking to lysosomes. ; PDB: 4AFI_B.
Probab=91.18 E-value=0.062 Score=47.83 Aligned_cols=6 Identities=33% Similarity=0.495 Sum_probs=0.0
Q ss_pred cCHHhH
Q psy17567 66 QSVKEI 71 (362)
Q Consensus 66 ~~~~~~ 71 (362)
+||||+
T Consensus 108 e~~ed~ 113 (154)
T PF06375_consen 108 ESDEDI 113 (154)
T ss_dssp ------
T ss_pred cccccc
Confidence 566655
No 9
>KOG2412|consensus
Probab=91.16 E-value=0.24 Score=51.81 Aligned_cols=30 Identities=23% Similarity=0.534 Sum_probs=20.7
Q ss_pred cccCCCchhhhhhHHHhhhHHHHHhhHHHH
Q psy17567 2 RKRRGGAGEKKKRIEKRNKEEEKEENEQEE 31 (362)
Q Consensus 2 ~~~~~~~~~~~~~~~~~~~~~~e~e~~e~~ 31 (362)
++|++..+|++.|||++|++++|++.+.++
T Consensus 213 ~~~~~~~e~kr~Eaerk~~~~qEe~Rqk~d 242 (591)
T KOG2412|consen 213 RERKERSEEKREEAERKRRAHQEELRQKED 242 (591)
T ss_pred HHHHHHHHhhhhhhHHHHHHHHHHHHHHHH
Confidence 456777788888888887776665544443
No 10
>KOG4364|consensus
Probab=90.68 E-value=0.36 Score=51.62 Aligned_cols=17 Identities=6% Similarity=-0.087 Sum_probs=7.6
Q ss_pred hHHHHHHHHHHhhhhhh
Q psy17567 304 EMGALCLVYIGINSAGM 320 (362)
Q Consensus 304 ql~an~ll~lc~NliG~ 320 (362)
+|+++.+=+.-+|..|+
T Consensus 655 alm~ql~pl~hgn~ns~ 671 (811)
T KOG4364|consen 655 ALMVQLFPLSHGNENSI 671 (811)
T ss_pred HHHHHHhhhhcccccch
Confidence 34444444444444443
No 11
>PF11600 CAF-1_p150: Chromatin assembly factor 1 complex p150 subunit, N-terminal; InterPro: IPR021644 P150 is a polypeptide subunit of CAF-1, which functions in depositing newly synthesised and acetylated histones H3/H4 into chromatin during DNA replication and repair [].P150 is the HP1 interaction site of CAF-1 and lies within the N-terminal region of the protein [].
Probab=89.77 E-value=1.3 Score=41.34 Aligned_cols=11 Identities=18% Similarity=0.486 Sum_probs=7.2
Q ss_pred ccccccccccc
Q psy17567 86 DSQCDTMIPVF 96 (362)
Q Consensus 86 ~~~~~~~~~~f 96 (362)
++.|..+.|||
T Consensus 199 ~~~~~~f~pF~ 209 (216)
T PF11600_consen 199 SDYEKKFLPFF 209 (216)
T ss_pred chhhccccCce
Confidence 45566677776
No 12
>KOG4055|consensus
Probab=87.69 E-value=1.2 Score=40.75 Aligned_cols=28 Identities=50% Similarity=0.686 Sum_probs=12.0
Q ss_pred hhhhhHHHHhhhhhhHHHHHHHHHhhhh
Q psy17567 33 EKEDVEEEKNKKKKKKKKKKKKKKKKKK 60 (362)
Q Consensus 33 ~~e~~~~e~~~~~~~~~~~~k~~~~~~~ 60 (362)
+.+...+|+-.++..|++|+|+++++++
T Consensus 118 k~~kaaEeKTaKKRaKRqk~Kq~akkkk 145 (213)
T KOG4055|consen 118 KNQKAAEEKTAKKRAKRQKKKQKAKKKK 145 (213)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHhhh
Confidence 3333333443444344555555443333
No 13
>PF07946 DUF1682: Protein of unknown function (DUF1682); InterPro: IPR012879 The members of this family are all hypothetical eukaryotic proteins of unknown function. One member (Q920S6 from SWISSPROT) is described as being an adipocyte-specific protein, but no evidence of this was found.
Probab=87.42 E-value=0.55 Score=46.48 Aligned_cols=17 Identities=41% Similarity=0.554 Sum_probs=8.2
Q ss_pred hhhhhHHHhhhHHHHHh
Q psy17567 10 EKKKRIEKRNKEEEKEE 26 (362)
Q Consensus 10 ~~~~~~~~~~~~~~e~e 26 (362)
|-++++++.|+++.++-
T Consensus 255 e~~~K~~k~R~~~~~~~ 271 (321)
T PF07946_consen 255 EAKKKAKKNREEEEEKI 271 (321)
T ss_pred HHHHHHHHHHHHHHHHH
Confidence 44455555555544433
No 14
>PF06658 DUF1168: Protein of unknown function (DUF1168); InterPro: IPR009548 This family consists of several hypothetical eukaryotic proteins of unknown function.
Probab=86.89 E-value=1 Score=39.67 Aligned_cols=41 Identities=27% Similarity=0.379 Sum_probs=18.0
Q ss_pred CCCchhhhhhHHHhhhHHHHHhhHHHHhhhhhhHHHHhhhh
Q psy17567 5 RGGAGEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKK 45 (362)
Q Consensus 5 ~~~~~~~~~~~~~~~~~~~e~e~~e~~~~~e~~~~e~~~~~ 45 (362)
|.++-++-+.-|+.-+++.++++=+.+-++...+++++..+
T Consensus 45 RRrE~~Rl~~me~~~~~e~~~~eF~~kree~~~~~eekTaK 85 (142)
T PF06658_consen 45 RRREYERLEYMEEEAKKEKEDEEFQRKREERKKEAEEKTAK 85 (142)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 33444555555555444444443333333333334444444
No 15
>KOG0338|consensus
Probab=86.23 E-value=0.17 Score=52.98 Aligned_cols=31 Identities=32% Similarity=0.503 Sum_probs=21.2
Q ss_pred CcccCCCchhhhhhHHHhhhHHHHHhhHHHH
Q psy17567 1 MRKRRGGAGEKKKRIEKRNKEEEKEENEQEE 31 (362)
Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~e~e~~e~~ 31 (362)
|+++|++++-...+.|+..+|++|.+++|.+
T Consensus 102 ~~~~k~~a~~~~~eeE~~~~E~eE~de~eDe 132 (691)
T KOG0338|consen 102 ARQRKPGAKSVSIEEEEEEKEGEEKDEEEDE 132 (691)
T ss_pred HHhhCCCCCccchhhhhhcccccccccccch
Confidence 4677888777777777776666666655554
No 16
>PTZ00121 MAEBL; Provisional
Probab=84.40 E-value=1.6 Score=50.55 Aligned_cols=7 Identities=43% Similarity=0.401 Sum_probs=2.6
Q ss_pred Hhhhhcc
Q psy17567 77 ERKEKTL 83 (362)
Q Consensus 77 e~~~~~l 83 (362)
.+.+..+
T Consensus 1813 qrIQqNL 1819 (2084)
T PTZ00121 1813 GGKEGNL 1819 (2084)
T ss_pred HHhhhcc
Confidence 3333333
No 17
>PLN02381 valyl-tRNA synthetase
Probab=75.54 E-value=1.6 Score=49.98 Aligned_cols=19 Identities=26% Similarity=0.569 Sum_probs=8.2
Q ss_pred hhHHHhhhHHHHHhhHHHH
Q psy17567 13 KRIEKRNKEEEKEENEQEE 31 (362)
Q Consensus 13 ~~~~~~~~~~~e~e~~e~~ 31 (362)
+|-|+++|++|+.+++|.+
T Consensus 16 ~~~~~~~~~~~~~~~~~~~ 34 (1066)
T PLN02381 16 EELERKKKKEEKAKEKELK 34 (1066)
T ss_pred HHHHHHHhhhHHHHHHHHH
Confidence 3444444444444444433
No 18
>KOG4727|consensus
Probab=75.14 E-value=9.1 Score=34.72 Aligned_cols=7 Identities=29% Similarity=0.420 Sum_probs=3.4
Q ss_pred cCHHhHH
Q psy17567 66 QSVKEIE 72 (362)
Q Consensus 66 ~~~~~~~ 72 (362)
+|.||+-
T Consensus 174 es~~~~~ 180 (193)
T KOG4727|consen 174 ESKDEMA 180 (193)
T ss_pred cccHHHH
Confidence 5555443
No 19
>PRK11281 hypothetical protein; Provisional
Probab=74.20 E-value=1.5e+02 Score=34.66 Aligned_cols=28 Identities=14% Similarity=0.032 Sum_probs=19.4
Q ss_pred hhhhhhhhhhhhhHHhHHHH---HHHHHHHH
Q psy17567 315 INSAGMYTKYLTDRGQRRAF---LETHRSTE 342 (362)
Q Consensus 315 ~NliG~~~~~l~e~~qR~aF---Letrq~i~ 342 (362)
.-+-++..|+|.=.+.|-+| .++|+-+.
T Consensus 729 ~l~~~~~~R~l~i~~RRla~~ra~~rR~~~~ 759 (1113)
T PRK11281 729 NLLYQTVLRGLSVAARRLAYRRALAKRQNLV 759 (1113)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 33466778888888888888 66665553
No 20
>KOG1962|consensus
Probab=73.24 E-value=7.4 Score=36.59 Aligned_cols=18 Identities=22% Similarity=0.250 Sum_probs=13.3
Q ss_pred ccccCHHhHHHHHHHhhh
Q psy17567 63 KEEQSVKEIEESLKERKE 80 (362)
Q Consensus 63 ~~~~~~~~~~~~l~e~~~ 80 (362)
..|+=++|.|+.++|+..
T Consensus 187 q~e~~~~EydrLlee~~~ 204 (216)
T KOG1962|consen 187 QSEGLQDEYDRLLEEYSK 204 (216)
T ss_pred HHHHcccHHHHHHHHHHH
Confidence 345578899999988543
No 21
>KOG2357|consensus
Probab=72.79 E-value=5.5 Score=40.70 Aligned_cols=8 Identities=13% Similarity=0.231 Sum_probs=3.0
Q ss_pred hHHHhhhH
Q psy17567 14 RIEKRNKE 21 (362)
Q Consensus 14 ~~~~~~~~ 21 (362)
||..+|++
T Consensus 389 EaaQ~kk~ 396 (440)
T KOG2357|consen 389 EAAQEKKA 396 (440)
T ss_pred HHHHHHHH
Confidence 33333333
No 22
>PF15012 DUF4519: Domain of unknown function (DUF4519)
Probab=72.70 E-value=3.9 Score=30.33 Aligned_cols=21 Identities=38% Similarity=0.631 Sum_probs=13.7
Q ss_pred CcccC-CCchhhhhhHHHhhhH
Q psy17567 1 MRKRR-GGAGEKKKRIEKRNKE 21 (362)
Q Consensus 1 ~~~~~-~~~~~~~~~~~~~~~~ 21 (362)
||+.| |+.+|-.|...|||++
T Consensus 1 MRqLkg~~~KET~K~K~ERk~~ 22 (56)
T PF15012_consen 1 MRQLKGGKTKETRKQKKERKKE 22 (56)
T ss_pred CCccCCCCccchHHHHHHHHHH
Confidence 78888 5556666666666655
No 23
>KOG1297|consensus
Probab=71.34 E-value=1.8 Score=40.19 Aligned_cols=16 Identities=19% Similarity=0.135 Sum_probs=11.3
Q ss_pred cCHHhHHHHHHHhhhh
Q psy17567 66 QSVKEIEESLKERKEK 81 (362)
Q Consensus 66 ~~~~~~~~~l~e~~~~ 81 (362)
.+|++||..|++.-.+
T Consensus 232 ~dde~fdeyledlfl~ 247 (249)
T KOG1297|consen 232 VDDENFDEYLEDLFLK 247 (249)
T ss_pred cchHHHHHHHHHhhcc
Confidence 4555599999887543
No 24
>PLN02381 valyl-tRNA synthetase
Probab=71.02 E-value=3.6 Score=47.25 Aligned_cols=16 Identities=13% Similarity=0.328 Sum_probs=8.9
Q ss_pred CcCCCcchHHhHHHHH
Q psy17567 112 NSFREPSLEMSYQRYS 127 (362)
Q Consensus 112 ~kF~s~~LE~LYq~Y~ 127 (362)
++++....|.-.+.|-
T Consensus 98 ~~yd~~~iE~~w~~~W 113 (1066)
T PLN02381 98 KQYSPSAVEKSWYAWW 113 (1066)
T ss_pred cCCCHHHHHHHHHHHH
Confidence 4555566666555543
No 25
>KOG2412|consensus
Probab=70.84 E-value=2.9 Score=44.02 Aligned_cols=24 Identities=29% Similarity=0.544 Sum_probs=12.9
Q ss_pred chhhhhhHHHhhhHHHHHhhHHHH
Q psy17567 8 AGEKKKRIEKRNKEEEKEENEQEE 31 (362)
Q Consensus 8 ~~~~~~~~~~~~~~~~e~e~~e~~ 31 (362)
..++|+..|++|+|.++.+.++++
T Consensus 212 i~~~~~~~e~kr~Eaerk~~~~qE 235 (591)
T KOG2412|consen 212 IRERKERSEEKREEAERKRRAHQE 235 (591)
T ss_pred HHHHHHHHHhhhhhhHHHHHHHHH
Confidence 345556666666665555444444
No 26
>PF09889 DUF2116: Uncharacterized protein containing a Zn-ribbon (DUF2116); InterPro: IPR019216 This entry contains various hypothetical prokaryotic proteins whose functions are unknown. They contain a conserved zinc ribbon motif in the N-terminal part and a predicted transmembrane segment in the C-terminal part.
Probab=69.96 E-value=14 Score=27.79 Aligned_cols=33 Identities=21% Similarity=0.420 Sum_probs=24.2
Q ss_pred cCcCCCcchHHhHHHHHHHHhhhHHHHHHHHHH
Q psy17567 111 TNSFREPSLEMSYQRYSHRQRQKSLIIVNLVDM 143 (362)
Q Consensus 111 ~~kF~s~~LE~LYq~Y~~r~~q~sL~v~~~l~i 143 (362)
+.+|=|++=++.|.+=+.|++++..+++.++.+
T Consensus 16 ~~~fCS~~C~~~~~k~qk~~~~~~~i~~~~~i~ 48 (59)
T PF09889_consen 16 DESFCSPKCREEYRKRQKRMRKTQYIFFGIFIL 48 (59)
T ss_pred chhhhCHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 456777888899998888888777766654333
No 27
>KOG2812|consensus
Probab=69.51 E-value=5 Score=40.28 Aligned_cols=15 Identities=33% Similarity=0.466 Sum_probs=6.3
Q ss_pred ccCHHhHHHHHHHhh
Q psy17567 65 EQSVKEIEESLKERK 79 (362)
Q Consensus 65 ~~~~~~~~~~l~e~~ 79 (362)
|.+-.+.+..+-|..
T Consensus 276 e~s~K~~ke~~~e~~ 290 (426)
T KOG2812|consen 276 ESSLKKSKEMISEAE 290 (426)
T ss_pred HHHHhhHHHhhhhhh
Confidence 334444444443333
No 28
>PF11600 CAF-1_p150: Chromatin assembly factor 1 complex p150 subunit, N-terminal; InterPro: IPR021644 P150 is a polypeptide subunit of CAF-1, which functions in depositing newly synthesised and acetylated histones H3/H4 into chromatin during DNA replication and repair [].P150 is the HP1 interaction site of CAF-1 and lies within the N-terminal region of the protein [].
Probab=67.32 E-value=8.6 Score=35.85 Aligned_cols=9 Identities=22% Similarity=0.600 Sum_probs=4.5
Q ss_pred cccccccCc
Q psy17567 102 YKGIYCPSL 110 (362)
Q Consensus 102 ~~g~~~p~L 110 (362)
|-|+++||.
T Consensus 201 ~~~~f~pF~ 209 (216)
T PF11600_consen 201 YEKKFLPFF 209 (216)
T ss_pred hhccccCce
Confidence 344555554
No 29
>KOG2894|consensus
Probab=65.61 E-value=12 Score=36.57 Aligned_cols=26 Identities=27% Similarity=0.307 Sum_probs=13.8
Q ss_pred cCHHhHHHHHHHhhhhccCccccccc
Q psy17567 66 QSVKEIEESLKERKEKTLTEDSQCDT 91 (362)
Q Consensus 66 ~~~~~~~~~l~e~~~~~lt~~~~~~~ 91 (362)
+++||.+...-+.+...+.++++.++
T Consensus 127 ededD~~~k~~~~Kk~klGKdP~VDT 152 (331)
T KOG2894|consen 127 EDEDDAEEKSIPLKKGKLGKDPDVDT 152 (331)
T ss_pred ccccccchhhcchhhhhcCCCCCccc
Confidence 34444443334445556777777665
No 30
>KOG0345|consensus
Probab=61.22 E-value=11 Score=39.61 Aligned_cols=15 Identities=20% Similarity=0.189 Sum_probs=8.2
Q ss_pred hcccccCHHhHHHHH
Q psy17567 61 KKKEEQSVKEIEESL 75 (362)
Q Consensus 61 ~~~~~~~~~~~~~~l 75 (362)
++.+|+.-||++...
T Consensus 537 ~ae~e~~~~d~~l~~ 551 (567)
T KOG0345|consen 537 TAEEEELADDIRLLK 551 (567)
T ss_pred hhhhHHHHHHHHHHH
Confidence 445654555776543
No 31
>KOG2243|consensus
Probab=60.58 E-value=5.9 Score=46.11 Aligned_cols=13 Identities=31% Similarity=0.279 Sum_probs=6.1
Q ss_pred cCcCCCcchHHhH
Q psy17567 111 TNSFREPSLEMSY 123 (362)
Q Consensus 111 ~~kF~s~~LE~LY 123 (362)
.+.|-+.+-|+.-
T Consensus 4595 ~~s~~d~~s~~~~ 4607 (5019)
T KOG2243|consen 4595 KNSFLDEESEEEA 4607 (5019)
T ss_pred hhccccchhhhhh
Confidence 3455555544443
No 32
>PRK10929 putative mechanosensitive channel protein; Provisional
Probab=60.41 E-value=3e+02 Score=32.19 Aligned_cols=32 Identities=9% Similarity=0.175 Sum_probs=19.9
Q ss_pred HHHHHhhhhhhhhhhhhhHHhHHHHH---HHHHHH
Q psy17567 310 LVYIGINSAGMYTKYLTDRGQRRAFL---ETHRST 341 (362)
Q Consensus 310 ll~lc~NliG~~~~~l~e~~qR~aFL---etrq~i 341 (362)
+.++..-+-++..|+|.=-+.|-+|- ++|+-+
T Consensus 709 l~~~~~l~y~~~~R~l~i~~RRlA~~Ra~~rR~~~ 743 (1109)
T PRK10929 709 IWFLLLVVYHIIRRWMLIQRRRIAFDRAKQRRAEI 743 (1109)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 33333444567778888777787783 355544
No 33
>PF09925 DUF2157: Predicted membrane protein (DUF2157); InterPro: IPR018677 This family of various hypothetical prokaryotic proteins has no known function.
Probab=60.36 E-value=1.1e+02 Score=26.33 Aligned_cols=20 Identities=20% Similarity=0.205 Sum_probs=14.1
Q ss_pred CCcchHHhHHHHHHHHhhhH
Q psy17567 115 REPSLEMSYQRYSHRQRQKS 134 (362)
Q Consensus 115 ~s~~LE~LYq~Y~~r~~q~s 134 (362)
.++..|++.+.|..+.++++
T Consensus 11 ~~~q~~~i~~~~~~~~~~~~ 30 (145)
T PF09925_consen 11 TPEQAEAILAFYGERPSRSS 30 (145)
T ss_pred CHHHHHHHHHHhhccccchh
Confidence 46678888888887644444
No 34
>PF07543 PGA2: Protein trafficking PGA2; InterPro: IPR011431 A Saccharomyces cerevisiae (Baker's yeast) member of this family (PGA2, P53903 from SWISSPROT) is a single pass membrane protein which has been implicated in protein trafficking [, ].
Probab=60.30 E-value=8.3 Score=33.86 Aligned_cols=12 Identities=42% Similarity=0.595 Sum_probs=9.8
Q ss_pred ccCHHhHHHHHH
Q psy17567 65 EQSVKEIEESLK 76 (362)
Q Consensus 65 ~~~~~~~~~~l~ 76 (362)
++||+||+..|+
T Consensus 128 ~~dD~DI~dlLe 139 (140)
T PF07543_consen 128 EDDDKDIEDLLE 139 (140)
T ss_pred cccHHHHHHHhc
Confidence 379999998875
No 35
>PF06027 DUF914: Eukaryotic protein of unknown function (DUF914); InterPro: IPR009262 This family consists of several hypothetical proteins of unknown function. Some of the sequences in this family are annotated as putative membrane proteins.
Probab=60.22 E-value=1.9e+02 Score=28.95 Aligned_cols=72 Identities=15% Similarity=0.220 Sum_probs=44.2
Q ss_pred hHHHHHHHHHHhcccccchHHHHHhhhhhHHHHHHHHHHHHhccCCCCchhhhhHHHHHHHHHHhhhhhhhh
Q psy17567 251 IWYILFIVFVTYAMLPLPLKWCFIGGCTTSVLHVIITAQIKLHRGPLSPCSVHEMGALCLVYIGINSAGMYT 322 (362)
Q Consensus 251 ~w~~lfiIf~iYtmLPl~l~~ail~g~~~Sl~hliv~~~~~~~~~~~~~~~~~ql~an~ll~lc~NliG~~~ 322 (362)
.|-..|+.++.|.+|--+..+.=++|+++++.=++.........+.++...-+.+..|++.++++-+-|++.
T Consensus 113 ~~~i~~~~~LS~~fL~~ry~~~~~~gv~i~i~Gv~lv~~sD~~~~~~~~~~~~~i~GDll~l~~a~lya~~n 184 (334)
T PF06027_consen 113 CTSIPFVMILSFIFLKRRYSWFHILGVLICIAGVVLVVVSDVLSGSDSSSGSNPILGDLLALLGAILYAVSN 184 (334)
T ss_pred hhhhHHHHHHHHHHHHhhhhHHHHHHHHHHHhhhhheeeecccccccCCCCCccchhHHHHHHHHHHHHHHH
Confidence 455677888889999999999988999988877665432111111111112245666666666665555543
No 36
>PF05262 Borrelia_P83: Borrelia P83/100 protein; InterPro: IPR007926 This family consists of several Borrelia P83/P100 antigen proteins.
Probab=60.15 E-value=21 Score=37.58 Aligned_cols=10 Identities=20% Similarity=0.464 Sum_probs=5.5
Q ss_pred HhHHHHHHHh
Q psy17567 69 KEIEESLKER 78 (362)
Q Consensus 69 ~~~~~~l~e~ 78 (362)
+|.++.+++.
T Consensus 335 kD~qk~~~e~ 344 (489)
T PF05262_consen 335 KDQQKLIEEQ 344 (489)
T ss_pred HHHHHHHhhh
Confidence 3555555554
No 37
>KOG2457|consensus
Probab=58.29 E-value=4.6 Score=41.09 Aligned_cols=29 Identities=34% Similarity=0.480 Sum_probs=14.7
Q ss_pred CcccCC-CchhhhhhHHHhhhHHHHHhhHH
Q psy17567 1 MRKRRG-GAGEKKKRIEKRNKEEEKEENEQ 29 (362)
Q Consensus 1 ~~~~~~-~~~~~~~~~~~~~~~~~e~e~~e 29 (362)
|||+++ ++.|+|.++|.++.+++++.++|
T Consensus 16 ~r~~~~~~~~~r~~~~~~e~~~~~~~~~~~ 45 (555)
T KOG2457|consen 16 MRKCREKKEAEREAEREAEREAEEEEKAEE 45 (555)
T ss_pred HhHHhhhchhhhHHHHHhhhhhHhhhhhhh
Confidence 455554 34566666655555544444333
No 38
>PF14812 PBP1_TM: Transmembrane domain of transglycosylase PBP1 at N-terminal; PDB: 3FWL_A 3VMA_A.
Probab=58.29 E-value=3.3 Score=33.06 Aligned_cols=15 Identities=33% Similarity=0.516 Sum_probs=0.0
Q ss_pred hhhhhhHHHHhhhhh
Q psy17567 32 KEKEDVEEEKNKKKK 46 (362)
Q Consensus 32 ~~~e~~~~e~~~~~~ 46 (362)
.+.++.|+|++-++|
T Consensus 42 DD~dDdeeee~m~rK 56 (81)
T PF14812_consen 42 DDDDDDEEEEPMPRK 56 (81)
T ss_dssp ---------------
T ss_pred ccccchhhccccccc
Confidence 334444556666654
No 39
>KOG0163|consensus
Probab=56.99 E-value=24 Score=39.03 Aligned_cols=7 Identities=43% Similarity=1.160 Sum_probs=3.6
Q ss_pred hhHHHHH
Q psy17567 250 LIWYILF 256 (362)
Q Consensus 250 ~~w~~lf 256 (362)
+.||.+|
T Consensus 1170 gmWyaHF 1176 (1259)
T KOG0163|consen 1170 GMWYAHF 1176 (1259)
T ss_pred ceEEEec
Confidence 4555554
No 40
>PF08432 Vfa1: AAA-ATPase Vps4-associated protein 1; InterPro: IPR013640 This is a family of fungal proteins of unknown function.
Probab=56.57 E-value=17 Score=33.13 Aligned_cols=25 Identities=20% Similarity=0.102 Sum_probs=12.5
Q ss_pred CHHhHHHHHHHhhhhccCccccccc
Q psy17567 67 SVKEIEESLKERKEKTLTEDSQCDT 91 (362)
Q Consensus 67 ~~~~~~~~l~e~~~~~lt~~~~~~~ 91 (362)
++++-+..+.+......+..+.-+.
T Consensus 114 ~~k~~~~~~~~~~~~~~~~~~~~~~ 138 (182)
T PF08432_consen 114 DEKEDKDKLKSLQKSYSETLSTVTE 138 (182)
T ss_pred cccchhHHHHHHHHhhccccCCccc
Confidence 3444555566666555544333333
No 41
>PF15437 PGBA_C: Plasminogen-binding protein pgbA C-terminal
Probab=56.30 E-value=9.9 Score=30.33 Aligned_cols=16 Identities=38% Similarity=0.497 Sum_probs=6.3
Q ss_pred chhhhhhHHHhhhHHH
Q psy17567 8 AGEKKKRIEKRNKEEE 23 (362)
Q Consensus 8 ~~~~~~~~~~~~~~~~ 23 (362)
++.+|-+||+|-.|=+
T Consensus 45 eEkKkAKAeqrA~EfE 60 (86)
T PF15437_consen 45 EEKKKAKAEQRAREFE 60 (86)
T ss_pred HHHHHHHHHHHHHHHH
Confidence 3333344444444433
No 42
>KOG0569|consensus
Probab=55.48 E-value=2.2e+02 Score=30.08 Aligned_cols=26 Identities=19% Similarity=0.111 Sum_probs=13.1
Q ss_pred hhhhhhhhhHHhHHHHHHHHHHHHHH
Q psy17567 319 GMYTKYLTDRGQRRAFLETHRSTETR 344 (362)
Q Consensus 319 G~~~~~l~e~~qR~aFLetrq~i~~r 344 (362)
++|.+....-.-.+++-|-.+.++.+
T Consensus 447 ~i~~~~~lPETkgr~~~eI~~~~~~~ 472 (485)
T KOG0569|consen 447 LIYLYRYLPETKGRTPYEIIEELEKR 472 (485)
T ss_pred HHHHHHhCcccCCCCHHHHHHHHHhC
Confidence 34444444444445666665555544
No 43
>KOG3054|consensus
Probab=54.89 E-value=14 Score=35.38 Aligned_cols=18 Identities=17% Similarity=0.159 Sum_probs=13.2
Q ss_pred cchHHhHHHHHHHHhhhH
Q psy17567 117 PSLEMSYQRYSHRQRQKS 134 (362)
Q Consensus 117 ~~LE~LYq~Y~~r~~q~s 134 (362)
..||.|=-.+-.|.+.+.
T Consensus 215 V~ledLas~f~Lrtqd~i 232 (299)
T KOG3054|consen 215 VPLEDLASEFGLRTQDSI 232 (299)
T ss_pred eeHHHHHHHhCccHHHHH
Confidence 678888877777776644
No 44
>KOG4434|consensus
Probab=54.37 E-value=3.5 Score=41.68 Aligned_cols=17 Identities=0% Similarity=0.143 Sum_probs=8.8
Q ss_pred CcchHHhHHHHHHHHhh
Q psy17567 116 EPSLEMSYQRYSHRQRQ 132 (362)
Q Consensus 116 s~~LE~LYq~Y~~r~~q 132 (362)
+++-+..+++-+...+.
T Consensus 371 ddddd~EWe~lqa~~~k 387 (520)
T KOG4434|consen 371 DDDDDAEWEELQASIQK 387 (520)
T ss_pred ccccHHHHHHHHHHHHH
Confidence 34445556665555543
No 45
>KOG1230|consensus
Probab=54.07 E-value=9.3 Score=39.38 Aligned_cols=18 Identities=22% Similarity=0.309 Sum_probs=10.0
Q ss_pred ccccCHHhHHHHHHHhhh
Q psy17567 63 KEEQSVKEIEESLKERKE 80 (362)
Q Consensus 63 ~~~~~~~~~~~~l~e~~~ 80 (362)
.+|.++++|+.++...-.
T Consensus 33 ~~e~de~~i~~~iq~~ea 50 (521)
T KOG1230|consen 33 NEELDEADIAEIIQSLEA 50 (521)
T ss_pred CcccchHHHHHHHHhhhh
Confidence 344457777766554433
No 46
>PF14632 SPT6_acidic: Acidic N-terminal SPT6
Probab=51.91 E-value=9 Score=31.30 Aligned_cols=16 Identities=6% Similarity=-0.031 Sum_probs=11.4
Q ss_pred cCHHhHHHHHHHhhhh
Q psy17567 66 QSVKEIEESLKERKEK 81 (362)
Q Consensus 66 ~~~~~~~~~l~e~~~~ 81 (362)
=++||++.+..-.+..
T Consensus 59 LDedDldLi~EN~g~~ 74 (92)
T PF14632_consen 59 LDEDDLDLIEENLGVK 74 (92)
T ss_pred cCHHHHHHHHHcCCcc
Confidence 6889998877655443
No 47
>KOG4819|consensus
Probab=51.87 E-value=99 Score=25.59 Aligned_cols=12 Identities=17% Similarity=0.191 Sum_probs=6.3
Q ss_pred cCHHhHHHHHHH
Q psy17567 66 QSVKEIEESLKE 77 (362)
Q Consensus 66 ~~~~~~~~~l~e 77 (362)
+=.+..|-.|..
T Consensus 88 nln~qM~~LL~~ 99 (106)
T KOG4819|consen 88 NLNDQMQLLLKQ 99 (106)
T ss_pred cHHHHHHHHHHH
Confidence 345555555543
No 48
>PF09428 DUF2011: Fungal protein of unknown function (DUF2011); InterPro: IPR018555 This is a family of fungal proteins whose function is unknown.
Probab=51.13 E-value=32 Score=29.84 Aligned_cols=13 Identities=54% Similarity=0.690 Sum_probs=4.8
Q ss_pred hHHHHHHHHHhhh
Q psy17567 47 KKKKKKKKKKKKK 59 (362)
Q Consensus 47 ~~~~~~k~~~~~~ 59 (362)
++..+||-+|+++
T Consensus 115 rknr~KK~kkR~~ 127 (131)
T PF09428_consen 115 RKNRKKKFKKRGK 127 (131)
T ss_pred HHHHHHHHHhhcc
Confidence 3333333333333
No 49
>COG5269 ZUO1 Ribosome-associated chaperone zuotin [Translation, ribosomal structure and biogenesis / Posttranslational modification, protein turnover, chaperones]
Probab=49.70 E-value=36 Score=33.30 Aligned_cols=18 Identities=6% Similarity=0.156 Sum_probs=11.7
Q ss_pred CHHhHHHHHHHhhhhccC
Q psy17567 67 SVKEIEESLKERKEKTLT 84 (362)
Q Consensus 67 ~~~~~~~~l~e~~~~~lt 84 (362)
=|+|++.+.+.....-+.
T Consensus 323 id~~v~li~~~l~d~el~ 340 (379)
T COG5269 323 IDEDVDLIMDKLGDEELG 340 (379)
T ss_pred cchhhhHHHHhcCcHHHH
Confidence 457777777766665553
No 50
>PF11286 DUF3087: Protein of unknown function (DUF3087); InterPro: IPR021438 This family of proteins with unknown function appears to be restricted to Gammaproteobacteria.
Probab=49.58 E-value=1.5e+02 Score=26.74 Aligned_cols=16 Identities=19% Similarity=0.312 Sum_probs=11.6
Q ss_pred HHHHHHHhhhHHHHHH
Q psy17567 124 QRYSHRQRQKSLIIVN 139 (362)
Q Consensus 124 q~Y~~r~~q~sL~v~~ 139 (362)
++|..++|+..+.+..
T Consensus 9 ~~YRk~~n~v~~~~v~ 24 (165)
T PF11286_consen 9 ERYRKHLNRVIVACVA 24 (165)
T ss_pred HHHHHHHHHHHHHHHH
Confidence 4788888887776543
No 51
>PTZ00053 methionine aminopeptidase 2; Provisional
Probab=49.15 E-value=8.4 Score=40.33 Aligned_cols=16 Identities=25% Similarity=0.110 Sum_probs=12.3
Q ss_pred ccccccccccccccCc
Q psy17567 95 VFKKGVLYKGIYCPSL 110 (362)
Q Consensus 95 ~f~~g~~~~g~~~p~L 110 (362)
.|..|...-|-.+.|-
T Consensus 120 ~~~~~~~p~g~~~~~~ 135 (470)
T PTZ00053 120 QFKDGEYPVGEIQEYP 135 (470)
T ss_pred hCCCCCCCcceEEecC
Confidence 6988888888766664
No 52
>PF05297 Herpes_LMP1: Herpesvirus latent membrane protein 1 (LMP1); InterPro: IPR007961 This family consists of several latent membrane protein 1 or LMP1s mostly from Epstein-Barr virus (strain GD1) (HHV-4) (Human herpesvirus 4). LMP1 of HHV-4 is a 62-65 kDa plasma membrane protein possessing six membrane spanning regions, a short cytoplasmic N terminus and a long cytoplasmic carboxy tail of 200 amino acids. HHV-4 virus latent membrane protein 1 (LMP1) is essential for HHV-4 mediated transformation and has been associated with several cases of malignancies. HHV-4-like viruses in Macaca fascicularis (Cynomolgus monkeys) have been associated with high lymphoma rates in immunosuppressed monkeys [].; GO: 0019087 transformation of host cell by virus, 0016021 integral to membrane; PDB: 1CZY_E 1ZMS_B.
Probab=48.61 E-value=5.8 Score=38.89 Aligned_cols=7 Identities=29% Similarity=0.525 Sum_probs=0.0
Q ss_pred hhHHHHH
Q psy17567 132 QKSLIIV 138 (362)
Q Consensus 132 q~sL~v~ 138 (362)
|+.++|+
T Consensus 49 ~~a~~vl 55 (381)
T PF05297_consen 49 QGALTVL 55 (381)
T ss_dssp -------
T ss_pred cchHHHH
Confidence 4444443
No 53
>KOG1924|consensus
Probab=46.81 E-value=28 Score=38.68 Aligned_cols=22 Identities=41% Similarity=0.436 Sum_probs=14.6
Q ss_pred hhhhhHHHhhhHHHHHhhHHHH
Q psy17567 10 EKKKRIEKRNKEEEKEENEQEE 31 (362)
Q Consensus 10 ~~~~~~~~~~~~~~e~e~~e~~ 31 (362)
|--+|+++||||+|+++..+.+
T Consensus 991 ea~~en~krRee~Ek~rr~k~a 1012 (1102)
T KOG1924|consen 991 EAVAENEKRREEEEKERRAKLA 1012 (1102)
T ss_pred HHHHHHHHHHHHHHHHHHHHHH
Confidence 4457788888887666654443
No 54
>KOG4055|consensus
Probab=45.85 E-value=25 Score=32.28 Aligned_cols=29 Identities=45% Similarity=0.614 Sum_probs=14.6
Q ss_pred HHHHhhhhhhHHHHhhhhhhHHHHHHHHH
Q psy17567 28 EQEEKEKEDVEEEKNKKKKKKKKKKKKKK 56 (362)
Q Consensus 28 ~e~~~~~e~~~~e~~~~~~~~~~~~k~~~ 56 (362)
|+.++..|+....++++-.|+++++|||+
T Consensus 117 ek~~kaaEeKTaKKRaKRqk~Kq~akkkk 145 (213)
T KOG4055|consen 117 EKNQKAAEEKTAKKRAKRQKKKQKAKKKK 145 (213)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHhhh
Confidence 33344444444555666655555555544
No 55
>KOG2571|consensus
Probab=45.24 E-value=36 Score=38.25 Aligned_cols=43 Identities=23% Similarity=0.213 Sum_probs=32.8
Q ss_pred cccccccccCccCcCCCcchHHhHHHHHHHHhhhHHHHHHHHH
Q psy17567 100 VLYKGIYCPSLTNSFREPSLEMSYQRYSHRQRQKSLIIVNLVD 142 (362)
Q Consensus 100 ~~~~g~~~p~L~~kF~s~~LE~LYq~Y~~r~~q~sL~v~~~l~ 142 (362)
...||+.+||+.+..-.-+-=..|-+...++|+.....|++.+
T Consensus 560 llskgy~l~Y~a~s~a~t~~Pe~~~efl~QrrRW~~s~f~~~~ 602 (862)
T KOG2571|consen 560 LLSKGYRLKYVAASDAETEAPESFLEFLNQRRRWLNSIFNAAD 602 (862)
T ss_pred HHhccceeeeeccccccccCcHhHHHHHHHhhhhcccchhHHH
Confidence 3456999999999887777777888888888887777666433
No 56
>KOG3089|consensus
Probab=44.53 E-value=32 Score=32.79 Aligned_cols=18 Identities=17% Similarity=0.202 Sum_probs=14.8
Q ss_pred cCHHhHHHHHHHhhhhcc
Q psy17567 66 QSVKEIEESLKERKEKTL 83 (362)
Q Consensus 66 ~~~~~~~~~l~e~~~~~l 83 (362)
.|+|+++..|.....+..
T Consensus 79 ~sPe~l~~ll~~yi~s~~ 96 (271)
T KOG3089|consen 79 GSPEDLQKLLKDYISSRR 96 (271)
T ss_pred CChHHHHHHHHHHHHhhc
Confidence 699999999998876543
No 57
>KOG0163|consensus
Probab=42.95 E-value=66 Score=35.83 Aligned_cols=9 Identities=11% Similarity=0.124 Sum_probs=3.7
Q ss_pred HhHHHHHHH
Q psy17567 69 KEIEESLKE 77 (362)
Q Consensus 69 ~~~~~~l~e 77 (362)
|-.|..+.-
T Consensus 1006 er~D~~la~ 1014 (1259)
T KOG0163|consen 1006 ERRDHELAL 1014 (1259)
T ss_pred HHHHHHHHH
Confidence 334444443
No 58
>KOG2507|consensus
Probab=41.60 E-value=24 Score=36.52 Aligned_cols=20 Identities=25% Similarity=0.330 Sum_probs=13.1
Q ss_pred CccCcCCCcchHHhHHHHHH
Q psy17567 109 SLTNSFREPSLEMSYQRYSH 128 (362)
Q Consensus 109 ~L~~kF~s~~LE~LYq~Y~~ 128 (362)
.++++|.+.+.=+.-.-|-.
T Consensus 326 sfte~Fps~~vL~~vr~yvr 345 (506)
T KOG2507|consen 326 SFTEKFPSTSVLRMVRDYVR 345 (506)
T ss_pred chhhcCCcchHHHHHHHHHH
Confidence 47788988776555555544
No 59
>PTZ00053 methionine aminopeptidase 2; Provisional
Probab=40.80 E-value=16 Score=38.19 Aligned_cols=9 Identities=22% Similarity=0.154 Sum_probs=4.1
Q ss_pred HHHhhhhhh
Q psy17567 312 YIGINSAGM 320 (362)
Q Consensus 312 ~lc~NliG~ 320 (362)
.++.|-+|.
T Consensus 319 nltGHgIG~ 327 (470)
T PTZ00053 319 NLNGHSIGP 327 (470)
T ss_pred CCcccCCCC
Confidence 344444444
No 60
>TIGR00869 sec62 protein translocation protein, Sec62 family. protein secretary systems of yeast microsomes. They are also the non-selective cation (NS) channels of the mammalian cytoplasmic membrane. The yeast Sec62 protein has been shown to be essential for cell growth. The mammalian NS channel proteins has been implicated in platelet derived growth factor(PGDF) dependent single channel current in fibroblasts. These channels are essentially closed in serum deprived tissue-culture cells and are specifically opened by exposure to PDGF. These channels are reported to exhibit equal selectivity for Na+, K+ and Cs+ with low permeability to Ca2+, and no permeability to anions.
Probab=40.33 E-value=15 Score=34.94 Aligned_cols=15 Identities=60% Similarity=0.592 Sum_probs=7.4
Q ss_pred HHHHHHhhhhhcccc
Q psy17567 51 KKKKKKKKKKKKKEE 65 (362)
Q Consensus 51 ~~k~~~~~~~~~~~~ 65 (362)
|+|+|++++|+++.|
T Consensus 216 ~~k~~~~~~~~~~~~ 230 (232)
T TIGR00869 216 KLKSKKLKKKQAKRE 230 (232)
T ss_pred HhhhhhhhHhhhhhc
Confidence 333334666555544
No 61
>PLN02316 synthase/transferase
Probab=38.81 E-value=24 Score=40.53 Aligned_cols=9 Identities=33% Similarity=0.704 Sum_probs=5.3
Q ss_pred ccc-cccccc
Q psy17567 92 MIP-VFKKGV 100 (362)
Q Consensus 92 ~~~-~f~~g~ 100 (362)
+++ ||..|.
T Consensus 390 ~mDfVFsdg~ 399 (1036)
T PLN02316 390 VLDWVFADGP 399 (1036)
T ss_pred EEEEEEecCC
Confidence 344 677764
No 62
>PF09323 DUF1980: Domain of unknown function (DUF1980); InterPro: IPR015402 Members of this occur in gene pairs with members of PF03773 from PFAM. The N-terminal region contains several predicted transmembrane helix regions while the few invariant residues (G, CxxD, and W) occur in the C-terminal region. Members of this family are found in a set of prokaryotic hypothetical proteins. Their exact function has not, as yet, been defined.
Probab=38.69 E-value=1.3e+02 Score=26.97 Aligned_cols=23 Identities=9% Similarity=0.216 Sum_probs=11.8
Q ss_pred ccchhHHHHHHHHHHHHHHHHHh
Q psy17567 164 IDSTKLTWTVYTMMSNLAMCLLG 186 (362)
Q Consensus 164 ~~~~~~~~~~~~l~~~l~L~vl~ 186 (362)
+++.+..++..+++++++|.+.-
T Consensus 27 I~P~~~~~~~~a~i~l~ilai~q 49 (182)
T PF09323_consen 27 IHPRYIPLLYFAAILLLILAIVQ 49 (182)
T ss_pred hCccHHHHHHHHHHHHHHHHHHH
Confidence 45555555555555554444443
No 63
>COG3601 Predicted membrane protein [Function unknown]
Probab=37.88 E-value=3.3e+02 Score=25.12 Aligned_cols=30 Identities=23% Similarity=0.296 Sum_probs=25.2
Q ss_pred HHHHHhcccccchHHHHHhhhhhHHHHHHH
Q psy17567 257 IVFVTYAMLPLPLKWCFIGGCTTSVLHVII 286 (362)
Q Consensus 257 iIf~iYtmLPl~l~~ail~g~~~Sl~hliv 286 (362)
.++..|+-+|+.+...++.++.+.+.|...
T Consensus 148 ~~~~~~~ilPFnLikg~i~sivf~~~y~~~ 177 (186)
T COG3601 148 IIYVVTTILPFNLIKGLIFSIVFGLLYKRL 177 (186)
T ss_pred eeeeeeeeccHHHHHHHHHHHHHHHHHHHH
Confidence 388889999999988888888887777765
No 64
>PRK09510 tolA cell envelope integrity inner membrane protein TolA; Provisional
Probab=37.59 E-value=91 Score=31.98 Aligned_cols=10 Identities=20% Similarity=0.328 Sum_probs=4.4
Q ss_pred CHHhHHHHHH
Q psy17567 67 SVKEIEESLK 76 (362)
Q Consensus 67 ~~~~~~~~l~ 76 (362)
..|||=..|+
T Consensus 258 ~~dd~~~gl~ 267 (387)
T PRK09510 258 EVDDLFGGLD 267 (387)
T ss_pred HHHHHhhccc
Confidence 3444444444
No 65
>PF03904 DUF334: Domain of unknown function (DUF334); InterPro: IPR005602 This is a family of proteins found in Staphylococcus aureus plasmid with no characterised function.
Probab=37.30 E-value=3.1e+02 Score=26.16 Aligned_cols=15 Identities=27% Similarity=0.532 Sum_probs=10.3
Q ss_pred hHHhHHHHHHHHhhh
Q psy17567 119 LEMSYQRYSHRQRQK 133 (362)
Q Consensus 119 LE~LYq~Y~~r~~q~ 133 (362)
.-..+.+|.-++..-
T Consensus 136 vK~~~E~y~k~~k~~ 150 (230)
T PF03904_consen 136 VKQSHEKYQKRQKSM 150 (230)
T ss_pred HHHHHHHHHHHHHHH
Confidence 455778888777653
No 66
>COG5593 Nucleic-acid-binding protein possibly involved in ribosomal biogenesis [Translation, ribosomal structure and biogenesis]
Probab=36.44 E-value=20 Score=38.22 Aligned_cols=12 Identities=50% Similarity=0.645 Sum_probs=5.2
Q ss_pred hhhHHHHhhhhh
Q psy17567 35 EDVEEEKNKKKK 46 (362)
Q Consensus 35 e~~~~e~~~~~~ 46 (362)
++.|+|++++..
T Consensus 779 ~e~eeee~~e~s 790 (821)
T COG5593 779 EEKEEEENKEVS 790 (821)
T ss_pred chhhhhhhhhHH
Confidence 333344555543
No 67
>PLN03237 DNA topoisomerase 2; Provisional
Probab=35.12 E-value=6.4 Score=46.36 Aligned_cols=25 Identities=12% Similarity=0.024 Sum_probs=10.4
Q ss_pred CHHhHHHHHHHhhhhccCccccccc
Q psy17567 67 SVKEIEESLKERKEKTLTEDSQCDT 91 (362)
Q Consensus 67 ~~~~~~~~l~e~~~~~lt~~~~~~~ 91 (362)
+|||+...-+-...=.+++++.-.+
T Consensus 1259 ~~~~~~~~~~~~~~~~~~~~~~~~~ 1283 (1465)
T PLN03237 1259 EEDEILDLKDRLAAYNLDSAPAQSA 1283 (1465)
T ss_pred ccccHHHHHHHHHhccccCCCCccc
Confidence 4444444333333334454443333
No 68
>PF08496 Peptidase_S49_N: Peptidase family S49 N-terminal; InterPro: IPR013703 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Proteolytic enzymes that exploit serine in their catalytic activity are ubiquitous, being found in viruses, bacteria and eukaryotes []. They include a wide range of peptidase activity, including exopeptidase, endopeptidase, oligopeptidase and omega-peptidase activity. Over 20 families (denoted S1 - S66) of serine protease have been identified, these being grouped into clans on the basis of structural similarity and other functional evidence []. Structures are known for members of the clans and the structures indicate that some appear to be totally unrelated, suggesting different evolutionary origins for the serine peptidases []. Not withstanding their different evolutionary origins, there are similarities in the reaction mechanisms of several peptidases. Chymotrypsin, subtilisin and carboxypeptidase C have a catalytic triad of serine, aspartate and histidine in common: serine acts as a nucleophile, aspartate as an electrophile, and histidine as a base []. The geometric orientations of the catalytic residues are similar between families, despite different protein folds []. The linear arrangements of the catalytic residues commonly reflect clan relationships. For example the catalytic triad in the chymotrypsin clan (PA) is ordered HDS, but is ordered DHS in the subtilisin clan (SB) and SDH in the carboxypeptidase clan (SC) [, ]. This domain is found to the N terminus of bacterial signal peptidases that belong to the MEROPS peptidase family S49 (protease IV family, clan SK) (see also IPR002142 from INTERPRO) [, ]. ; GO: 0004252 serine-type endopeptidase activity, 0005886 plasma membrane
Probab=34.97 E-value=66 Score=28.70 Aligned_cols=13 Identities=23% Similarity=0.248 Sum_probs=6.6
Q ss_pred HhHHHHHHHhhhh
Q psy17567 69 KEIEESLKERKEK 81 (362)
Q Consensus 69 ~~~~~~l~e~~~~ 81 (362)
|||..+|.-..+.
T Consensus 119 eeisail~~a~~~ 131 (155)
T PF08496_consen 119 EEISAILSVATPE 131 (155)
T ss_pred HHHHHHHHhCCCC
Confidence 4555555544443
No 69
>PRK01637 hypothetical protein; Reviewed
Probab=34.51 E-value=2.7e+02 Score=26.76 Aligned_cols=71 Identities=10% Similarity=0.223 Sum_probs=40.8
Q ss_pred HHHHhcccc---cchHHHHHhhhhhHHHHHHHHHHHHhc--cCCCCchhhhhHHHHHHHHHHhhhhhhhhhhhhhH
Q psy17567 258 VFVTYAMLP---LPLKWCFIGGCTTSVLHVIITAQIKLH--RGPLSPCSVHEMGALCLVYIGINSAGMYTKYLTDR 328 (362)
Q Consensus 258 If~iYtmLP---l~l~~ail~g~~~Sl~hliv~~~~~~~--~~~~~~~~~~ql~an~ll~lc~NliG~~~~~l~e~ 328 (362)
...+|.++| .+.+.+...|+++++...+....+..+ +..+.+..+..+.+=+++++..++.+...-+=.|.
T Consensus 187 f~~lY~~~P~~k~~~r~~~~Ga~~a~~~w~~~~~~f~~Yv~~~~~y~~~YGslg~vi~lllWlyl~~~ilL~Gael 262 (286)
T PRK01637 187 FWLLYSVVPNKKVPFRHALVGALVAALLFELGKKGFALYITTFPSYQLIYGALAVIPILFVWVYLSWCIVLLGAEI 262 (286)
T ss_pred HHHHHhhcCCCccchHHHHHHHHHHHHHHHHHHHHHHHHHHhccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 347799999 455666665555555555543322211 11222234566677777777777777666665544
No 70
>KOG3054|consensus
Probab=33.25 E-value=1.1e+02 Score=29.54 Aligned_cols=6 Identities=33% Similarity=0.213 Sum_probs=2.4
Q ss_pred Cchhhh
Q psy17567 7 GAGEKK 12 (362)
Q Consensus 7 ~~~~~~ 12 (362)
+.|++|
T Consensus 105 kkK~aK 110 (299)
T KOG3054|consen 105 KKKEAK 110 (299)
T ss_pred hHHHHH
Confidence 334444
No 71
>PLN03223 Polycystin cation channel protein; Provisional
Probab=33.20 E-value=9.5e+02 Score=29.09 Aligned_cols=28 Identities=14% Similarity=-0.053 Sum_probs=19.2
Q ss_pred HHHHHHHHHHhhhh-hhhh-hhhhhHHhHH
Q psy17567 305 MGALCLVYIGINSA-GMYT-KYLTDRGQRR 332 (362)
Q Consensus 305 l~an~ll~lc~Nli-G~~~-~~l~e~~qR~ 332 (362)
=+.+.+.--+-+++ |.|. .+..|...|+
T Consensus 1438 EIvDfm~~rfrslL~g~~~~~~i~~~~~~~ 1467 (1634)
T PLN03223 1438 ELFPMLRDKWRSMFKGWFYKNHIPEARVRR 1467 (1634)
T ss_pred HHHHHHHHHHHHHHhhhcccccCCcHHHHH
Confidence 35566677777888 9888 5666666555
No 72
>PF14617 CMS1: U3-containing 90S pre-ribosomal complex subunit
Probab=33.18 E-value=3.1 Score=39.96 Aligned_cols=9 Identities=11% Similarity=0.272 Sum_probs=3.4
Q ss_pred HhHHHHHHH
Q psy17567 69 KEIEESLKE 77 (362)
Q Consensus 69 ~~~~~~l~e 77 (362)
+.+...+..
T Consensus 56 ~~lad~l~~ 64 (252)
T PF14617_consen 56 ELLADYLAQ 64 (252)
T ss_pred HHHHHHHHH
Confidence 333333333
No 73
>PF12569 NARP1: NMDA receptor-regulated protein 1 ; InterPro: IPR021183 This group represents N-terminal acetyltransferase A (NatA) auxiliary subunit and represents a non-catalytic component of the NatA N-terminal acetyltransferase, which catalyzes acetylation of proteins beginning with Met-Ser, Met-Gly and Met-Ala. N-terminal acetylation plays a role in normal eukaryotic translation and processing, protect against proteolytic degradation and protein turnover. NAT1 anchors ARD1 and NAT5 to the ribosome and may present the N- terminal of nascent polypeptides for acetylation [], [].
Probab=32.99 E-value=32 Score=36.51 Aligned_cols=9 Identities=11% Similarity=0.165 Sum_probs=3.6
Q ss_pred HHHHHHhhh
Q psy17567 125 RYSHRQRQK 133 (362)
Q Consensus 125 ~Y~~r~~q~ 133 (362)
.|-.-.|+.
T Consensus 497 aFeVy~Rk~ 505 (517)
T PF12569_consen 497 AFEVYLRKG 505 (517)
T ss_pred HhHHHHhcC
Confidence 444444433
No 74
>PF09736 Bud13: Pre-mRNA-splicing factor of RES complex; InterPro: IPR018609 Bud site selection protein 13, also known as pre-mRNA-splicing factor CWC26, belongs to the pre-mRNA retention and splicing (RES) complex. May also be involved in positioning the proximal bud pole signal [, , ]. The presence of RES subunit homologues in numerous eukaryotes suggests that its function is evolutionarily conserved [].
Probab=32.93 E-value=2.6e+02 Score=24.58 Aligned_cols=19 Identities=21% Similarity=0.270 Sum_probs=14.9
Q ss_pred cCHHhHHHHHHHhhhhccC
Q psy17567 66 QSVKEIEESLKERKEKTLT 84 (362)
Q Consensus 66 ~~~~~~~~~l~e~~~~~lt 84 (362)
.+|+|+|..|.+...-..+
T Consensus 68 ~dD~~~~~~lk~~~r~~DP 86 (145)
T PF09736_consen 68 ADDEDLNEELKEQERWDDP 86 (145)
T ss_pred hhhHHHHHHHHhcccccCc
Confidence 5889999999888765554
No 75
>PF08374 Protocadherin: Protocadherin; InterPro: IPR013585 The structure of protocadherins is similar to that of classic cadherins (IPR002126 from INTERPRO), but they also have some unique features associated with the cytoplasmic domains. They are expressed in a variety of organisms and are found in high concentrations in the brain where they seem to be localised mainly at cell-cell contact sites. Their expression seems to be developmentally regulated [].
Probab=32.77 E-value=5.1 Score=37.60 Aligned_cols=10 Identities=20% Similarity=0.199 Sum_probs=4.0
Q ss_pred hhhhhHHHhh
Q psy17567 10 EKKKRIEKRN 19 (362)
Q Consensus 10 ~~~~~~~~~~ 19 (362)
+.|+-+++.+
T Consensus 67 ~~k~g~QagK 76 (221)
T PF08374_consen 67 PHKKGYQAGK 76 (221)
T ss_pred cccchhhhcc
Confidence 3444444333
No 76
>KOG1015|consensus
Probab=32.67 E-value=15 Score=41.82 Aligned_cols=12 Identities=17% Similarity=0.196 Sum_probs=7.5
Q ss_pred hhhhhhhhhhhh
Q psy17567 315 INSAGMYTKYLT 326 (362)
Q Consensus 315 ~NliG~~~~~l~ 326 (362)
-|++|.+.-|++
T Consensus 876 e~lLGs~~EfrN 887 (1567)
T KOG1015|consen 876 ENLLGSIKEFRN 887 (1567)
T ss_pred cccccCcHHHHH
Confidence 477787765543
No 77
>KOG2062|consensus
Probab=31.86 E-value=20 Score=39.48 Aligned_cols=17 Identities=6% Similarity=0.024 Sum_probs=7.5
Q ss_pred HHhHHHHHHHhhhhccCc
Q psy17567 68 VKEIEESLKERKEKTLTE 85 (362)
Q Consensus 68 ~~~~~~~l~e~~~~~lt~ 85 (362)
-|+.-+++ |.+.++.+-
T Consensus 852 l~NpaRV~-paQ~~~is~ 868 (929)
T KOG2062|consen 852 LDNPARVV-PAQLKYISF 868 (929)
T ss_pred hcChhhcc-hhhcceeee
Confidence 34444443 344445443
No 78
>PF05793 TFIIF_alpha: Transcription initiation factor IIF, alpha subunit (TFIIF-alpha); InterPro: IPR008851 Transcription initiation factor IIF, alpha subunit (TFIIF-alpha) or RNA polymerase II-associating protein 74 (RAP74) is the large subunit of transcription factor IIF (TFIIF), which is essential for accurate initiation and stimulates elongation by RNA polymerase II [].; GO: 0003677 DNA binding, 0045893 positive regulation of transcription, DNA-dependent, 0005634 nucleus; PDB: 1F3U_F 1NHA_A 1I27_A 1J2X_A 2K7L_A 1ONV_A.
Probab=31.13 E-value=16 Score=38.77 Aligned_cols=6 Identities=0% Similarity=0.058 Sum_probs=0.0
Q ss_pred HHhHHH
Q psy17567 68 VKEIEE 73 (362)
Q Consensus 68 ~~~~~~ 73 (362)
+.|+|.
T Consensus 362 ~sD~d~ 367 (527)
T PF05793_consen 362 DSDSDS 367 (527)
T ss_dssp ------
T ss_pred cccccc
Confidence 344444
No 79
>COG3356 Predicted membrane protein [Function unknown]
Probab=28.89 E-value=3.7e+02 Score=29.04 Aligned_cols=18 Identities=28% Similarity=0.416 Sum_probs=14.6
Q ss_pred CcchHHhHHHHHHHHhhh
Q psy17567 116 EPSLEMSYQRYSHRQRQK 133 (362)
Q Consensus 116 s~~LE~LYq~Y~~r~~q~ 133 (362)
.++.|+.|-+|+.++..+
T Consensus 3 t~~~~rsys~~~~~~p~s 20 (578)
T COG3356 3 TRKLERSYSKYFKTLPKS 20 (578)
T ss_pred cccccchHhhhhccCCCc
Confidence 357789999999999933
No 80
>PF15333 TAF1D: TATA box-binding protein-associated factor 1D
Probab=28.81 E-value=29 Score=32.36 Aligned_cols=10 Identities=30% Similarity=0.401 Sum_probs=5.7
Q ss_pred cccccccCcc
Q psy17567 102 YKGIYCPSLT 111 (362)
Q Consensus 102 ~~g~~~p~L~ 111 (362)
-||.-+|++.
T Consensus 100 dkg~~FPf~E 109 (217)
T PF15333_consen 100 DKGFQFPFVE 109 (217)
T ss_pred hcCCCCcchh
Confidence 3566666663
No 81
>PRK11644 sensory histidine kinase UhpB; Provisional
Probab=28.61 E-value=6.9e+02 Score=25.99 Aligned_cols=9 Identities=33% Similarity=0.704 Sum_probs=3.8
Q ss_pred HHHHHHHHh
Q psy17567 123 YQRYSHRQR 131 (362)
Q Consensus 123 Yq~Y~~r~~ 131 (362)
-+||....+
T Consensus 97 ~~~~~~~~~ 105 (495)
T PRK11644 97 ASRYRHQRD 105 (495)
T ss_pred HHHhcchhh
Confidence 344444433
No 82
>PF15049 DUF4534: Protein of unknown function (DUF4534)
Probab=27.32 E-value=4e+02 Score=24.07 Aligned_cols=44 Identities=23% Similarity=0.329 Sum_probs=30.1
Q ss_pred ccchhHHHHHHHHHHHHHHHHHhhh---hhh-cchhhHHHHHHHHHHH
Q psy17567 164 IDSTKLTWTVYTMMSNLAMCLLGWW---RCF-ANNYLQWAAVGTWILL 207 (362)
Q Consensus 164 ~~~~~~~~~~~~l~~~l~L~vl~~~---~~~-~~~~l~~~~~~~wi~l 207 (362)
..+.+++|...-=+++.++-++.+. +.+ --++++|+..++=+++
T Consensus 83 GlliY~iWIi~YE~~n~~i~lLtn~~i~~~f~ev~~l~w~g~iSRm~l 130 (163)
T PF15049_consen 83 GLLIYAIWIIFYELINFSIQLLTNGTIKEQFKEVRYLRWFGQISRMLL 130 (163)
T ss_pred hHHHHHHHHHHHHHHHHHHHHHhcCCcccchhHHHHHHHHHHHHHHHH
Confidence 3466777777777778788777763 222 2478899988877764
No 83
>PF13493 DUF4118: Domain of unknown function (DUF4118); PDB: 2KSF_A.
Probab=27.12 E-value=90 Score=24.85 Aligned_cols=26 Identities=23% Similarity=0.319 Sum_probs=19.4
Q ss_pred HHHHHHHHhhhhhhhhhhhhhHHhHH
Q psy17567 307 ALCLVYIGINSAGMYTKYLTDRGQRR 332 (362)
Q Consensus 307 an~ll~lc~NliG~~~~~l~e~~qR~ 332 (362)
.+.+.++...++|+.+-+++|+.+||
T Consensus 80 ~~~~~~~~~l~va~v~g~l~~~~r~~ 105 (105)
T PF13493_consen 80 QDWITFAVFLVVALVTGYLADRYRRQ 105 (105)
T ss_dssp HHHHHHHHHHHHHHHHHHHHHHH---
T ss_pred hHHHHHHHHHHHHHHHHHHHHHHhhC
Confidence 46777888888999999999988875
No 84
>PF01086 Clathrin_lg_ch: Clathrin light chain; InterPro: IPR000996 Proteins synthesized on the ribosome and processed in the endoplasmic reticulum are transported from the Golgi apparatus to the trans-Golgi network (TGN), and from there via small carrier vesicles to their final destination compartment. These vesicles have specific coat proteins (such as clathrin or coatomer) that are important for cargo selection and direction of transport []. Clathrin coats contain both clathrin (acts as a scaffold) and adaptor complexes that link clathrin to receptors in coated vesicles. Clathrin-associated protein complexes are believed to interact with the cytoplasmic tails of membrane proteins, leading to their selection and concentration. The two major types of clathrin adaptor complexes are the heterotetrameric adaptor protein (AP) complexes, and the monomeric GGA (Golgi-localising, Gamma-adaptin ear domain homology, ARF-binding proteins) adaptors [, ]. Clathrin is a trimer composed of three heavy chains and three light chains, each monomer projecting outwards like a leg; this three-legged structure is known as a triskelion [, ]. The heavy chains form the legs, their N-terminal beta-propeller regions extending outwards, while their C-terminal alpha-alpha-superhelical regions form the central hub of the triskelion. Peptide motifs can bind between the beta-propeller blades. The light chains appear to have a regulatory role, and may help orient the assembly and disassembly of clathrin coats as they interact with hsc70 uncoating ATPase []. Clathrin triskelia self-polymerise into a curved lattice by twisting individual legs together. The clathrin lattice forms around a vesicle as it buds from the TGN, plasma membrane or endosomes, acting to stabilise the vesicle and facilitate the budding process []. The multiple blades created when the triskelia polymerise are involved in multiple protein interactions, enabling the recruitment of different cargo adaptors and membrane attachment proteins []. This entry represents clathrin light chains, which are more divergent in sequence than the heavy chains []. In higher eukaryotes, two genes encode distinct but related light chains, each of which can yield two separate forms via alternative splicing. In yeast there is a single light chain whose sequence is only distantly related to that of higher eukaryotes. Clathrin light chains have a conserved acidic N-terminal domain, a central coiled-coil domain and a conserved C-terminal domain. More information about these proteins can be found at Protein of the Month: Clathrin [].; GO: 0005198 structural molecule activity, 0006886 intracellular protein transport, 0016192 vesicle-mediated transport, 0030130 clathrin coat of trans-Golgi network vesicle, 0030132 clathrin coat of coated pit; PDB: 3LVG_E 3LVH_D.
Probab=27.02 E-value=1.1e+02 Score=28.75 Aligned_cols=16 Identities=13% Similarity=0.366 Sum_probs=0.0
Q ss_pred cCccccccccccc--ccc
Q psy17567 83 LTEDSQCDTMIPV--FKK 98 (362)
Q Consensus 83 lt~~~~~~~~~~~--f~~ 98 (362)
...+..|.++... |..
T Consensus 180 ~~~~t~WerV~~Lid~~~ 197 (225)
T PF01086_consen 180 LQPGTEWERVAKLIDFNP 197 (225)
T ss_dssp ------------------
T ss_pred CCCCCcHHHHHHHhCCCC
Confidence 3456778888883 654
No 85
>PF06027 DUF914: Eukaryotic protein of unknown function (DUF914); InterPro: IPR009262 This family consists of several hypothetical proteins of unknown function. Some of the sequences in this family are annotated as putative membrane proteins.
Probab=26.75 E-value=6.6e+02 Score=25.15 Aligned_cols=66 Identities=18% Similarity=0.184 Sum_probs=38.0
Q ss_pred HHHHHhcccccchHHH----HHhhhhhHHHHHHHHHHHHhccCCCCchhhhhHHHHHHHHHHhhhhhhhhhhhhhHHhH
Q psy17567 257 IVFVTYAMLPLPLKWC----FIGGCTTSVLHVIITAQIKLHRGPLSPCSVHEMGALCLVYIGINSAGMYTKYLTDRGQR 331 (362)
Q Consensus 257 iIf~iYtmLPl~l~~a----il~g~~~Sl~hliv~~~~~~~~~~~~~~~~~ql~an~ll~lc~NliG~~~~~l~e~~qR 331 (362)
.+|+.|++.|+-++.+ +-+|.++|..|-++...+. +.. ...+..+++- +.-++|.+.+...+....
T Consensus 242 ~lf~~y~l~p~~l~~ssAt~~nLsLLTsd~~ali~~i~~-f~~---~~~~ly~~af-----~lIiiG~vvy~~~~~~~~ 311 (334)
T PF06027_consen 242 CLFLFYSLVPIVLRMSSATFFNLSLLTSDFYALIIDIFF-FGY---KFSWLYILAF-----ALIIIGFVVYNLAESPEE 311 (334)
T ss_pred HHHHHHHHHHHHHHhCccceeehHHHHhhHHHHHHHHHh-cCc---cccHHHHHHH-----HHHHHHhheEEccCCccc
Confidence 3688899999986533 4467777777766644321 111 1122233332 233678888877765443
No 86
>COG3083 Predicted hydrolase of alkaline phosphatase superfamily [General function prediction only]
Probab=26.70 E-value=6.7e+02 Score=27.03 Aligned_cols=10 Identities=30% Similarity=0.650 Sum_probs=5.9
Q ss_pred HHHhhhhhhh
Q psy17567 228 TWILLNTQGF 237 (362)
Q Consensus 228 ~~v~~~~q~~ 237 (362)
+++++|.+++
T Consensus 99 tLLlvD~evf 108 (600)
T COG3083 99 TLLLVDSEVF 108 (600)
T ss_pred HHHhhhHHHH
Confidence 4455676655
No 87
>PF05087 Rota_VP2: Rotavirus VP2 protein; InterPro: IPR007779 Rotavirus particles consist of three concentric proteinaceous capsid layers. The innermost capsid (core) is made of VP2. The genomic RNA and the two minor proteins VP1 and VP3 are encapsidated within this layer []. The N terminus of rotavirus VP2 is necessary for the encapsidation of VP1 and VP3 [].; GO: 0003723 RNA binding, 0019013 viral nucleocapsid; PDB: 3N09_A 3GZU_A 3KZ4_A.
Probab=26.68 E-value=37 Score=36.77 Aligned_cols=18 Identities=28% Similarity=0.218 Sum_probs=9.6
Q ss_pred CHHhHH-HHHHHhhhhccC
Q psy17567 67 SVKEIE-ESLKERKEKTLT 84 (362)
Q Consensus 67 ~~~~~~-~~l~e~~~~~lt 84 (362)
-..||| +++.-..+...+
T Consensus 89 hqKEIqyEIlqKtIPTFep 107 (887)
T PF05087_consen 89 HQKEIQYEILQKTIPTFEP 107 (887)
T ss_dssp HHHHHHHHHHHTTS-B-SS
T ss_pred HHHHHHHHHHHhhCCCCCc
Confidence 556777 666655554443
No 88
>PF01086 Clathrin_lg_ch: Clathrin light chain; InterPro: IPR000996 Proteins synthesized on the ribosome and processed in the endoplasmic reticulum are transported from the Golgi apparatus to the trans-Golgi network (TGN), and from there via small carrier vesicles to their final destination compartment. These vesicles have specific coat proteins (such as clathrin or coatomer) that are important for cargo selection and direction of transport []. Clathrin coats contain both clathrin (acts as a scaffold) and adaptor complexes that link clathrin to receptors in coated vesicles. Clathrin-associated protein complexes are believed to interact with the cytoplasmic tails of membrane proteins, leading to their selection and concentration. The two major types of clathrin adaptor complexes are the heterotetrameric adaptor protein (AP) complexes, and the monomeric GGA (Golgi-localising, Gamma-adaptin ear domain homology, ARF-binding proteins) adaptors [, ]. Clathrin is a trimer composed of three heavy chains and three light chains, each monomer projecting outwards like a leg; this three-legged structure is known as a triskelion [, ]. The heavy chains form the legs, their N-terminal beta-propeller regions extending outwards, while their C-terminal alpha-alpha-superhelical regions form the central hub of the triskelion. Peptide motifs can bind between the beta-propeller blades. The light chains appear to have a regulatory role, and may help orient the assembly and disassembly of clathrin coats as they interact with hsc70 uncoating ATPase []. Clathrin triskelia self-polymerise into a curved lattice by twisting individual legs together. The clathrin lattice forms around a vesicle as it buds from the TGN, plasma membrane or endosomes, acting to stabilise the vesicle and facilitate the budding process []. The multiple blades created when the triskelia polymerise are involved in multiple protein interactions, enabling the recruitment of different cargo adaptors and membrane attachment proteins []. This entry represents clathrin light chains, which are more divergent in sequence than the heavy chains []. In higher eukaryotes, two genes encode distinct but related light chains, each of which can yield two separate forms via alternative splicing. In yeast there is a single light chain whose sequence is only distantly related to that of higher eukaryotes. Clathrin light chains have a conserved acidic N-terminal domain, a central coiled-coil domain and a conserved C-terminal domain. More information about these proteins can be found at Protein of the Month: Clathrin [].; GO: 0005198 structural molecule activity, 0006886 intracellular protein transport, 0016192 vesicle-mediated transport, 0030130 clathrin coat of trans-Golgi network vesicle, 0030132 clathrin coat of coated pit; PDB: 3LVG_E 3LVH_D.
Probab=26.22 E-value=99 Score=29.09 Aligned_cols=9 Identities=11% Similarity=0.287 Sum_probs=0.0
Q ss_pred ccccccccc
Q psy17567 88 QCDTMIPVF 96 (362)
Q Consensus 88 ~~~~~~~~f 96 (362)
.-++|-.+.
T Consensus 205 D~sRmR~iL 213 (225)
T PF01086_consen 205 DVSRMREIL 213 (225)
T ss_dssp ---------
T ss_pred cHHHHHHHH
Confidence 334444443
No 89
>KOG2256|consensus
Probab=26.15 E-value=20 Score=38.77 Aligned_cols=8 Identities=38% Similarity=0.467 Sum_probs=3.7
Q ss_pred hcccccch
Q psy17567 262 YAMLPLPL 269 (362)
Q Consensus 262 YtmLPl~l 269 (362)
+.++|+++
T Consensus 439 ~qy~PLRl 446 (661)
T KOG2256|consen 439 PQYYPLRL 446 (661)
T ss_pred ccchhHHH
Confidence 34445553
No 90
>PRK04214 rbn ribonuclease BN/unknown domain fusion protein; Reviewed
Probab=25.77 E-value=3e+02 Score=28.09 Aligned_cols=31 Identities=26% Similarity=0.510 Sum_probs=22.7
Q ss_pred HHHHhcccc---cchHHHHHhhhhhHHHHHHHHH
Q psy17567 258 VFVTYAMLP---LPLKWCFIGGCTTSVLHVIITA 288 (362)
Q Consensus 258 If~iYtmLP---l~l~~ail~g~~~Sl~hliv~~ 288 (362)
...+|.++| +++++++..|++.++...+.-+
T Consensus 194 f~~lY~~~Pn~~v~~r~al~Gai~a~vl~~~~~~ 227 (412)
T PRK04214 194 LTLLYRVVPNHFVPLRHALPGALLTAVLLELVKW 227 (412)
T ss_pred HHHHHHHcCCCccchHHhHHHHHHHHHHHHHHHH
Confidence 346799999 5578888888888777766544
No 91
>KOG1070|consensus
Probab=25.68 E-value=29 Score=40.85 Aligned_cols=14 Identities=14% Similarity=0.351 Sum_probs=10.5
Q ss_pred cCHHhHHHHHHHhh
Q psy17567 66 QSVKEIEESLKERK 79 (362)
Q Consensus 66 ~~~~~~~~~l~e~~ 79 (362)
++.|||++.+....
T Consensus 1442 esaeDferlvrssP 1455 (1710)
T KOG1070|consen 1442 ESAEDFERLVRSSP 1455 (1710)
T ss_pred cCHHHHHHHHhcCC
Confidence 58899998876543
No 92
>KOG1098|consensus
Probab=25.49 E-value=37 Score=36.90 Aligned_cols=25 Identities=28% Similarity=0.405 Sum_probs=13.3
Q ss_pred HhhHHHHhhhhhhHHHHhhhhhhHH
Q psy17567 25 EENEQEEKEKEDVEEEKNKKKKKKK 49 (362)
Q Consensus 25 ~e~~e~~~~~e~~~~e~~~~~~~~~ 49 (362)
.|+||.++..++-++.+++.-|+|+
T Consensus 340 teEeeld~~l~~l~E~e~~~lkRkk 364 (780)
T KOG1098|consen 340 TEEEELDKLLAELSEDEKAALKRKK 364 (780)
T ss_pred cHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 3444555666666555555444443
No 93
>TIGR00927 2A1904 K+-dependent Na+/Ca+ exchanger.
Probab=24.99 E-value=15 Score=41.62 Aligned_cols=11 Identities=18% Similarity=0.244 Sum_probs=6.0
Q ss_pred ccccccccccc
Q psy17567 87 SQCDTMIPVFK 97 (362)
Q Consensus 87 ~~~~~~~~~f~ 97 (362)
+.|.....+|.
T Consensus 901 ~~~~~~~wvl~ 911 (1096)
T TIGR00927 901 TRQKQAIYLFL 911 (1096)
T ss_pred cccceeEeEEe
Confidence 44566555554
No 94
>KOG0299|consensus
Probab=24.88 E-value=60 Score=33.79 Aligned_cols=7 Identities=0% Similarity=0.321 Sum_probs=4.1
Q ss_pred cCHHhHH
Q psy17567 66 QSVKEIE 72 (362)
Q Consensus 66 ~~~~~~~ 72 (362)
.|++|+|
T Consensus 94 ~d~~~~~ 100 (479)
T KOG0299|consen 94 FDADDLD 100 (479)
T ss_pred cccchhh
Confidence 4556665
No 95
>KOG0949|consensus
Probab=24.83 E-value=1.2e+02 Score=35.09 Aligned_cols=20 Identities=40% Similarity=0.433 Sum_probs=14.0
Q ss_pred ccccccccCccCcCCCcchH
Q psy17567 101 LYKGIYCPSLTNSFREPSLE 120 (362)
Q Consensus 101 ~~~g~~~p~L~~kF~s~~LE 120 (362)
..-|+++|.=+.-|--++|-
T Consensus 998 LsLGiNMPCrTVvF~gDsLQ 1017 (1330)
T KOG0949|consen 998 LSLGINMPCRTVVFAGDSLQ 1017 (1330)
T ss_pred hhcccCCCceeEEEeccccc
Confidence 45688888877777666654
No 96
>PF12757 DUF3812: Protein of unknown function (DUF3812); InterPro: IPR024527 This family of fungal proteins represents the eisosome 1 family. Eisosome protein 1 is required for normal formation of eisosomes, large cytoplasmic protein assemblies that localize to specialised domains on plasma membrane and mark the site of endocytosis [].
Probab=24.68 E-value=1.1e+02 Score=26.22 Aligned_cols=19 Identities=26% Similarity=0.435 Sum_probs=9.9
Q ss_pred hhHHHhhhHHHHHhhHHHH
Q psy17567 13 KRIEKRNKEEEKEENEQEE 31 (362)
Q Consensus 13 ~~~~~~~~~~~e~e~~e~~ 31 (362)
.++|++|..+++.+.++++
T Consensus 91 erae~qRa~d~e~k~~~~~ 109 (126)
T PF12757_consen 91 ERAEAQRARDEEIKLDEEE 109 (126)
T ss_pred HHHHHHHHHHHHHHHHHHH
Confidence 4556666655555444443
No 97
>PF06645 SPC12: Microsomal signal peptidase 12 kDa subunit (SPC12); InterPro: IPR009542 This family consists of several microsomal signal peptidase 12 kDa subunit proteins. Translocation of polypeptide chains across the endoplasmic reticulum (ER) membrane is triggered by signal sequences. Subsequently, signal recognition particle interacts with its membrane receptor and the ribosome-bound nascent chain is targeted to the ER where it is transferred into a protein-conducting channel. At some point, a second signal sequence recognition event takes place in the membrane and translocation of the nascent chain through the membrane occurs. The signal sequence of most secretory and membrane proteins is cleaved off at this stage. Cleavage occurs by the signal peptidase complex (SPC) as soon as the lumenal domain of the translocating polypeptide is large enough to expose its cleavage site to the enzyme. The signal peptidase complex is possibly also involved in proteolytic events in the ER membrane other than the processing of the signal sequence, for example the further digestion of the cleaved signal peptide or the degradation of membrane proteins. Mammalian signal peptidase is as a complex of five different polypeptide chains. This family represents the 12 kDa subunit (SPC12).; GO: 0008233 peptidase activity, 0006465 signal peptide processing, 0005787 signal peptidase complex, 0016021 integral to membrane
Probab=24.58 E-value=2.1e+02 Score=22.27 Aligned_cols=12 Identities=25% Similarity=0.844 Sum_probs=6.7
Q ss_pred hhhhhcchhhHH
Q psy17567 187 WWRCFANNYLQW 198 (362)
Q Consensus 187 ~~~~~~~~~l~~ 198 (362)
.|..+.++.++|
T Consensus 55 ~Wp~y~r~p~~W 66 (76)
T PF06645_consen 55 PWPFYNRHPLKW 66 (76)
T ss_pred CcHhhcCCcccC
Confidence 566555555555
No 98
>COG4901 Ribosomal protein S25 [Translation, ribosomal structure and biogenesis]
Probab=24.08 E-value=55 Score=27.34 Aligned_cols=23 Identities=9% Similarity=0.127 Sum_probs=15.9
Q ss_pred hcccccCHHhHHHHHHHhhhhcc
Q psy17567 61 KKKEEQSVKEIEESLKERKEKTL 83 (362)
Q Consensus 61 ~~~~~~~~~~~~~~l~e~~~~~l 83 (362)
+....=|++-+|++++|..-...
T Consensus 37 ~~av~vdee~~~ki~KEV~~~r~ 59 (107)
T COG4901 37 RRAVTVDEELLDKIRKEVPRERV 59 (107)
T ss_pred hhhhhccHHHHHHHHHhccccee
Confidence 33344577889999999876443
No 99
>COG5116 RPN2 26S proteasome regulatory complex component [Posttranslational modification, protein turnover, chaperones]
Probab=23.96 E-value=20 Score=38.57 Aligned_cols=34 Identities=26% Similarity=0.487 Sum_probs=19.2
Q ss_pred cCHHhHHHHHHHhhhhccCccccccccccc--cccccc
Q psy17567 66 QSVKEIEESLKERKEKTLTEDSQCDTMIPV--FKKGVL 101 (362)
Q Consensus 66 ~~~~~~~~~l~e~~~~~lt~~~~~~~~~~~--f~~g~~ 101 (362)
+-.|++.+++- ++..+-+= --=+++.|| |+.|+.
T Consensus 849 ~~vdn~trilp-~q~~yisf-~~d~r~~pvrkf~ggvv 884 (926)
T COG5116 849 YKVDNMTRILP-QQSRYISF-IKDDRFVPVRKFKGGVV 884 (926)
T ss_pred cchhhhhhhcc-ccCceeEE-eeCCceEEeEeecCcEE
Confidence 46788888874 33333321 112567775 777554
No 100
>PRK07668 hypothetical protein; Validated
Probab=23.70 E-value=6.7e+02 Score=24.21 Aligned_cols=17 Identities=12% Similarity=-0.144 Sum_probs=6.7
Q ss_pred hhhhhcchhhHHHHHHHH
Q psy17567 187 WWRCFANNYLQWAAVGTW 204 (362)
Q Consensus 187 ~~~~~~~~~l~~~~~~~w 204 (362)
+...| +....|-..+.|
T Consensus 129 r~~~f-k~~~~~~~~i~~ 145 (254)
T PRK07668 129 RMASF-KSKLTEKWFLII 145 (254)
T ss_pred HHHHH-hccchhHHHHHH
Confidence 44434 333444333333
No 101
>KOG3915|consensus
Probab=23.43 E-value=1.8e+02 Score=30.64 Aligned_cols=18 Identities=44% Similarity=0.523 Sum_probs=7.6
Q ss_pred hHHHHHHHHHhhhhhccc
Q psy17567 47 KKKKKKKKKKKKKKKKKE 64 (362)
Q Consensus 47 ~~~~~~k~~~~~~~~~~~ 64 (362)
|+-||-||-|++++++-+
T Consensus 563 kr~kkEkk~k~k~qe~L~ 580 (641)
T KOG3915|consen 563 KRLKKEKKAKRKLQEALE 580 (641)
T ss_pred HHHHHHHHHHHHHHHHhh
Confidence 443344443444444433
No 102
>KOG3045|consensus
Probab=23.36 E-value=22 Score=34.75 Aligned_cols=14 Identities=21% Similarity=0.088 Sum_probs=7.6
Q ss_pred cchHHhHHHHHHHH
Q psy17567 117 PSLEMSYQRYSHRQ 130 (362)
Q Consensus 117 ~~LE~LYq~Y~~r~ 130 (362)
+.+=.+|+.=|..|
T Consensus 146 p~afdlYH~gfr~Q 159 (325)
T KOG3045|consen 146 PTAFDLYHAGFRSQ 159 (325)
T ss_pred cHHHHHHHHHHHHH
Confidence 35556676655444
No 103
>PF13829 DUF4191: Domain of unknown function (DUF4191)
Probab=23.23 E-value=4.2e+02 Score=25.14 Aligned_cols=20 Identities=10% Similarity=0.227 Sum_probs=14.0
Q ss_pred chHHhHHHHHHHHhhhHHHH
Q psy17567 118 SLEMSYQRYSHRQRQKSLII 137 (362)
Q Consensus 118 ~LE~LYq~Y~~r~~q~sL~v 137 (362)
-+-.+.|.|....++.+-+.
T Consensus 9 ~~~Qi~q~y~~trk~dp~l~ 28 (224)
T PF13829_consen 9 RRKQIWQAYKMTRKEDPKLP 28 (224)
T ss_pred HHHHHHHHHHHHHHHCcchH
Confidence 45677888988877765443
No 104
>KOG2138|consensus
Probab=23.02 E-value=42 Score=36.80 Aligned_cols=7 Identities=14% Similarity=0.193 Sum_probs=2.8
Q ss_pred CHHhHHH
Q psy17567 67 SVKEIEE 73 (362)
Q Consensus 67 ~~~~~~~ 73 (362)
|-|+=|+
T Consensus 851 Ssd~sdk 857 (883)
T KOG2138|consen 851 SSDSSDK 857 (883)
T ss_pred ccccchh
Confidence 4444343
No 105
>PHA02644 hypothetical protein; Provisional
Probab=22.95 E-value=6.7 Score=31.46 Aligned_cols=13 Identities=38% Similarity=0.547 Sum_probs=5.8
Q ss_pred hcccc-cCHHhHHH
Q psy17567 61 KKKEE-QSVKEIEE 73 (362)
Q Consensus 61 ~~~~~-~~~~~~~~ 73 (362)
+.+++ +++.||+.
T Consensus 91 pekpenedekefed 104 (112)
T PHA02644 91 PEKPENEDEKEFED 104 (112)
T ss_pred CCCCCCcchhhhcc
Confidence 33444 44455543
No 106
>PRK09510 tolA cell envelope integrity inner membrane protein TolA; Provisional
Probab=22.16 E-value=2.4e+02 Score=29.00 Aligned_cols=7 Identities=14% Similarity=0.520 Sum_probs=2.9
Q ss_pred hHHHHHH
Q psy17567 70 EIEESLK 76 (362)
Q Consensus 70 ~~~~~l~ 76 (362)
++|-.|.
T Consensus 258 ~~dd~~~ 264 (387)
T PRK09510 258 EVDDLFG 264 (387)
T ss_pred HHHHHhh
Confidence 3444443
No 107
>KOG2507|consensus
Probab=22.16 E-value=66 Score=33.40 Aligned_cols=9 Identities=22% Similarity=0.490 Sum_probs=3.5
Q ss_pred cCccCcCCC
Q psy17567 108 PSLTNSFRE 116 (362)
Q Consensus 108 p~L~~kF~s 116 (362)
||=+..|..
T Consensus 360 pyPRReft~ 368 (506)
T KOG2507|consen 360 PYPRREFTD 368 (506)
T ss_pred ccccccccc
Confidence 333334433
No 108
>TIGR00570 cdk7 CDK-activating kinase assembly factor MAT1. All proteins in this family for which functions are known are cyclin dependent protein kinases that are components of TFIIH, a complex that is involved in nucleotide excision repair and transcription initiation. Also known as MAT1 (menage a trois 1). This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University).
Probab=21.87 E-value=1.4e+02 Score=29.76 Aligned_cols=51 Identities=27% Similarity=0.305 Sum_probs=0.0
Q ss_pred hhhhhhHHHhhhHHHHHhhHHHHhhhhhhHH-------------------HHhhhhhhHHHHHHHHHhhh
Q psy17567 9 GEKKKRIEKRNKEEEKEENEQEEKEKEDVEE-------------------EKNKKKKKKKKKKKKKKKKK 59 (362)
Q Consensus 9 ~~~~~~~~~~~~~~~e~e~~e~~~~~e~~~~-------------------e~~~~~~~~~~~~k~~~~~~ 59 (362)
+..+..+++||++..+++++|+...+.+.++ .+++..+|++-.+++.-++.
T Consensus 152 ~~E~~~~~~rr~~~~~~e~ee~~~~~~~~~~~ld~L~~s~~~~~~~i~~~~~~~~~~k~~~~r~~~~~~~ 221 (309)
T TIGR00570 152 EFEKEEEEQRRLLLQKEEEEQQMNKRKNKQALLDELETSTLPAAELIAQHKKNSVKLEMQVEKPKPEKPN 221 (309)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCHHHHHHHHHHHHHHHHhhhhhccccchh
No 109
>PF08243 SPT2: SPT2 chromatin protein; InterPro: IPR013256 This entry includes the Saccharomyces cerevisiae (Baker's yeast) protein SPT2 which is a chromatin protein involved in transcriptional regulation []. These proteins shows conservation of several domains across numerous species, including having a cluster of positively charged amino acids. This cluster probably functions in the binding properties of the proteins []. Sin1p/Spt2p probably modulates the local chromatin structure by binding two strands of double-stranded DNA at their crossover point. Sin1p/Spt2p has sequence similarity to HMG1 and serves as a negative transcriptional regulator of a small family of genes that are activated by the SWI/SNF chromatin-remodelling complex. It is also involved in maintaining the integrity of chromatin during transcription elongation. Sin1p/Spt2 is required for, and is directly involved in, the efficient recruitment of the mRNA cleavage/polyadenylation complex []. Spt2 is also involved in regulating levels of histone H3 over transcribed regions [].
Probab=21.75 E-value=1.3e+02 Score=25.55 Aligned_cols=13 Identities=31% Similarity=0.486 Sum_probs=5.4
Q ss_pred hhhhhHHHHhhhh
Q psy17567 33 EKEDVEEEKNKKK 45 (362)
Q Consensus 33 ~~e~~~~e~~~~~ 45 (362)
..|+.+++...+.
T Consensus 93 ~~ED~~e~~~e~~ 105 (116)
T PF08243_consen 93 RLEDEREERREEE 105 (116)
T ss_pred HhhhHHHHHHHHH
Confidence 4444444444333
No 110
>PF07701 HNOBA: Heme NO binding associated; InterPro: IPR011645 The HNOBA (Haem NO Binding) domain is found associated with the HNOB domain and IPR001054 from INTERPRO in soluble cyclases and signalling proteins. The HNOB domain is predicted to function as a haem-dependent sensor for gaseous ligands, and transduce diverse downstream signals in both bacteria and animals.; GO: 0004383 guanylate cyclase activity, 0006182 cGMP biosynthetic process; PDB: 2P04_B 2P08_A 3HLS_E.
Probab=21.30 E-value=2.3e+02 Score=26.54 Aligned_cols=44 Identities=14% Similarity=0.126 Sum_probs=30.6
Q ss_pred hhhhhhhhhHHhHHHHHHHHHHHHHHHHhhhHHHHHHHHhhhcC
Q psy17567 319 GMYTKYLTDRGQRRAFLETHRSTETRMKTQKENDQQEKLLLSGM 362 (362)
Q Consensus 319 G~~~~~l~e~~qR~aFLetrq~i~~rl~Le~E~~qQErLLLSVL 362 (362)
|....-..+.+.+...-.+.+.=++...++.|+++-++||.|+|
T Consensus 168 ~~q~~a~~~l~~~le~~~~~~Le~~~~~l~~ek~ktd~LL~~ml 211 (219)
T PF07701_consen 168 GQQQSAELKLAKQLEQEKSAELEESMRELEEEKKKTDELLYSML 211 (219)
T ss_dssp HHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHTS
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhC
Confidence 44444444555555545555566778899999999999999986
No 111
>PF01698 FLO_LFY: Floricaula / Leafy protein; InterPro: IPR002910 This family consists of various plant development proteins which are homologues of Floricaula (FLO) and leafy (LFY) proteins which are floral meristem identity proteins. Mutations in the sequences of these proteins affect flower and leaf development.; GO: 0003677 DNA binding, 0006355 regulation of transcription, DNA-dependent; PDB: 2VY1_A 2VY2_A.
Probab=20.88 E-value=55 Score=33.39 Aligned_cols=22 Identities=18% Similarity=0.412 Sum_probs=12.0
Q ss_pred cccccccccccccCccCcCCCcchHHhHHHHH
Q psy17567 96 FKKGVLYKGIYCPSLTNSFREPSLEMSYQRYS 127 (362)
Q Consensus 96 f~~g~~~~g~~~p~L~~kF~s~~LE~LYq~Y~ 127 (362)
=+.|-++||. -.+|+=||..|-
T Consensus 236 TEPGEVARGK----------KNGLDYLFHLYE 257 (386)
T PF01698_consen 236 TEPGEVARGK----------KNGLDYLFHLYE 257 (386)
T ss_dssp --TT----SS-----------B-HHHHHHHHH
T ss_pred cccchhhccc----------ccCcchHHHHHH
Confidence 3999999994 259999999996
No 112
>KOG2662|consensus
Probab=20.61 E-value=9.6e+02 Score=24.88 Aligned_cols=16 Identities=25% Similarity=0.191 Sum_probs=13.2
Q ss_pred CCCcchHHhHHHHHHH
Q psy17567 114 FREPSLEMSYQRYSHR 129 (362)
Q Consensus 114 F~s~~LE~LYq~Y~~r 129 (362)
-+-+++|.|-+.|+.+
T Consensus 297 dd~eElEMLLEaYf~q 312 (414)
T KOG2662|consen 297 DDVEELEMLLEAYFMQ 312 (414)
T ss_pred ccHHHHHHHHHHHHHH
Confidence 4557899999999976
No 113
>KOG3142|consensus
Probab=20.27 E-value=6.9e+02 Score=23.06 Aligned_cols=28 Identities=18% Similarity=0.232 Sum_probs=19.5
Q ss_pred CHHhHHHHHHHhhhhccCcccccccccc
Q psy17567 67 SVKEIEESLKERKEKTLTEDSQCDTMIP 94 (362)
Q Consensus 67 ~~~~~~~~l~e~~~~~lt~~~~~~~~~~ 94 (362)
+-+++-....+..+..+++...|..+.+
T Consensus 18 ~~~~~~~~~~~~~~~~lst~RpW~ef~d 45 (187)
T KOG3142|consen 18 SVESISSRAKQTIQSGLSTRRPWSEFFD 45 (187)
T ss_pred chhhHHHHHHHHHHHHHhccCCHHHHHc
Confidence 3455666677777777777766887766
No 114
>smart00441 FF Contains two conserved F residues. A novel motif that often accompanies WW domains. Often contains two conserved Phe (F) residues.
Probab=20.18 E-value=74 Score=22.41 Aligned_cols=50 Identities=22% Similarity=0.282 Sum_probs=37.1
Q ss_pred HHhHHHHHHHhhhhccCccccccccccccccccccccccccCccCcCCCcchHHhHHHHH
Q psy17567 68 VKEIEESLKERKEKTLTEDSQCDTMIPVFKKGVLYKGIYCPSLTNSFREPSLEMSYQRYS 127 (362)
Q Consensus 68 ~~~~~~~l~e~~~~~lt~~~~~~~~~~~f~~g~~~~g~~~p~L~~kF~s~~LE~LYq~Y~ 127 (362)
-++|...|++... .+.++.|+...+.+.....|+.+ ..+..-|.+|+.|.
T Consensus 4 ~~~F~~LL~e~~~--~~~~~~W~~~~~~~~~d~ry~~l--------~~~~~r~~lF~~~i 53 (55)
T smart00441 4 KEAFKELLKEHEV--ITPDTTWSEARKKLKNDPRYKAL--------LSESEREQLFEDHI 53 (55)
T ss_pred HHHHHHHHHhCCC--CCCCCcHHHHHHHHhcChHHHHh--------cChHHHHHHHHHHH
Confidence 4678888988743 45789999999998776666543 44567778888885
No 115
>KOG1059|consensus
Probab=20.06 E-value=35 Score=37.59 Aligned_cols=20 Identities=60% Similarity=0.624 Sum_probs=10.2
Q ss_pred hhHHHHHHHHHhhhhhcccc
Q psy17567 46 KKKKKKKKKKKKKKKKKKEE 65 (362)
Q Consensus 46 ~~~~~~~k~~~~~~~~~~~~ 65 (362)
||||||++||+++.+....+
T Consensus 850 KkKskk~~k~~t~a~~~~~~ 869 (877)
T KOG1059|consen 850 KKKSKKKNKKKTKAKNVPEP 869 (877)
T ss_pred hhhhcccchhhhhhhcccCc
Confidence 34444444555666655543
Done!