Query psy15308
Match_columns 270
No_of_seqs 177 out of 1154
Neff 8.0
Searched_HMMs 46136
Date Fri Aug 16 17:42:24 2013
Command hhsearch -i /work/01045/syshi/Psyhhblits/psy15308.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/15308hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 KOG1690|consensus 100.0 1.3E-47 2.9E-52 312.4 20.8 205 63-270 7-213 (215)
2 KOG1692|consensus 100.0 3.4E-44 7.3E-49 291.5 17.3 186 70-270 14-199 (201)
3 KOG1691|consensus 100.0 1.3E-41 2.8E-46 281.7 23.9 202 61-270 7-208 (210)
4 KOG1693|consensus 100.0 1.1E-39 2.5E-44 266.5 19.8 192 65-270 12-205 (209)
5 KOG3287|consensus 100.0 1.5E-34 3.2E-39 239.1 21.4 184 74-270 33-227 (236)
6 PF01105 EMP24_GP25L: emp24/gp 100.0 5.5E-37 1.2E-41 255.5 0.5 180 76-267 1-183 (183)
7 KOG1690|consensus 98.3 1.6E-06 3.4E-11 72.0 7.3 40 24-64 6-45 (215)
8 KOG1693|consensus 95.9 0.74 1.6E-05 38.8 14.9 167 39-251 23-197 (209)
9 PF00630 Filamin: Filamin/ABP2 88.7 4.1 8.9E-05 29.8 8.3 43 122-164 42-91 (101)
10 PF04151 PPC: Bacterial pre-pe 87.6 2.4 5.2E-05 29.2 6.0 61 87-161 3-68 (70)
11 PF01835 A2M_N: MG2 domain; I 86.8 10 0.00022 27.7 11.1 66 95-164 13-86 (99)
12 smart00557 IG_FLMN Filamin-typ 86.4 5.3 0.00012 29.2 7.6 43 122-164 32-77 (93)
13 KOG1692|consensus 84.0 23 0.00051 29.8 10.8 73 39-149 20-92 (201)
14 PF01105 EMP24_GP25L: emp24/gp 80.2 0.54 1.2E-05 38.2 0.0 25 39-63 1-25 (183)
15 PF11589 DUF3244: Domain of un 74.0 12 0.00026 28.1 5.9 44 122-165 48-95 (106)
16 PF05738 Cna_B: Cna protein B- 73.0 8.6 0.00019 26.1 4.5 42 123-164 3-46 (70)
17 PF09315 DUF1973: Domain of un 71.8 53 0.0012 27.3 9.7 43 123-165 42-88 (179)
18 PF13860 FlgD_ig: FlgD Ig-like 71.7 24 0.00053 25.0 6.8 32 97-140 12-43 (81)
19 PF13473 Cupredoxin_1: Cupredo 69.8 13 0.00029 27.6 5.3 16 144-159 75-90 (104)
20 PF04728 LPP: Lipoprotein leuc 69.7 30 0.00066 23.1 6.9 44 197-240 4-47 (56)
21 PF10779 XhlA: Haemolysin XhlA 69.5 35 0.00075 23.7 9.0 19 198-216 8-26 (71)
22 PF13620 CarboxypepD_reg: Carb 66.4 7.8 0.00017 27.1 3.2 42 123-164 16-57 (82)
23 PF13897 GOLD_2: Golgi-dynamic 65.5 9.8 0.00021 30.2 3.8 30 148-180 104-133 (136)
24 PRK15396 murein lipoprotein; P 62.4 50 0.0011 23.6 6.7 45 196-240 25-69 (78)
25 PF07495 Y_Y_Y: Y_Y_Y domain; 59.1 50 0.0011 21.9 7.7 56 122-180 8-65 (66)
26 PRK09973 putative outer membra 56.8 66 0.0014 23.4 6.5 46 196-241 24-69 (85)
27 PRK06655 flgD flagellar basal 55.9 47 0.001 28.7 6.8 42 122-163 126-178 (225)
28 KOG3202|consensus 52.3 1.3E+02 0.0028 26.3 8.9 67 195-261 151-234 (235)
29 TIGR03503 conserved hypothetic 51.0 1.6E+02 0.0034 27.6 9.7 43 121-163 241-286 (374)
30 PRK12813 flgD flagellar basal 51.0 76 0.0016 27.5 7.2 57 96-164 110-174 (223)
31 PF12690 BsuPI: Intracellular 49.1 23 0.0005 25.4 3.2 20 121-140 23-42 (82)
32 COG1723 Uncharacterized conser 46.1 25 0.00055 32.0 3.6 55 200-260 270-324 (331)
33 KOG2861|consensus 45.0 43 0.00092 31.6 5.1 54 201-260 338-391 (399)
34 PF05377 FlaC_arch: Flagella a 44.8 92 0.002 20.7 6.1 32 198-229 2-33 (55)
35 PRK12812 flgD flagellar basal 43.6 98 0.0021 27.4 6.9 41 122-162 141-192 (259)
36 PF06703 SPC25: Microsomal sig 43.1 44 0.00095 27.1 4.4 18 15-32 25-42 (162)
37 PF05739 SNARE: SNARE domain; 42.0 99 0.0021 20.3 6.9 45 196-240 4-48 (63)
38 KOG0518|consensus 39.7 57 0.0012 34.4 5.3 45 121-165 882-929 (1113)
39 COG5415 Predicted integral mem 38.7 2.6E+02 0.0055 24.1 9.4 31 196-226 15-45 (251)
40 PF09753 Use1: Membrane fusion 38.2 2.3E+02 0.005 24.7 8.5 22 242-264 228-249 (251)
41 PRK05842 flgD flagellar basal 38.0 68 0.0015 29.0 5.1 42 122-163 166-220 (295)
42 PF04136 Sec34: Sec34-like fam 38.0 2.1E+02 0.0046 23.1 7.6 55 196-250 28-82 (157)
43 PRK14081 triple tyrosine motif 37.8 1.9E+02 0.0041 29.3 8.6 56 124-181 418-476 (667)
44 PRK12634 flgD flagellar basal 37.0 1.2E+02 0.0025 26.2 6.2 41 122-162 122-173 (221)
45 PF10731 Anophelin: Thrombin i 36.8 22 0.00047 24.0 1.3 20 18-37 3-22 (65)
46 PF01519 DUF16: Protein of unk 36.4 1.9E+02 0.004 21.8 6.8 46 195-240 52-97 (102)
47 PF10528 PA14_2: GLEYA domain; 36.3 1E+02 0.0022 23.5 5.2 46 82-139 55-102 (113)
48 PF07210 DUF1416: Protein of u 35.2 1.8E+02 0.0038 21.2 6.4 41 123-164 23-63 (85)
49 PF07835 COX4_pro_2: Bacterial 34.7 1.1E+02 0.0024 19.3 4.2 26 231-256 16-41 (44)
50 PF10805 DUF2730: Protein of u 34.5 1.4E+02 0.003 22.4 5.7 39 202-240 48-88 (106)
51 KOG3091|consensus 34.4 1.8E+02 0.0039 28.2 7.4 54 195-248 347-400 (508)
52 PF14524 Wzt_C: Wzt C-terminal 32.3 1.3E+02 0.0029 22.7 5.5 42 122-164 52-93 (142)
53 COG4467 Regulator of replicati 31.9 2.3E+02 0.005 21.6 6.2 45 196-240 8-52 (114)
54 PF09323 DUF1980: Domain of un 31.4 62 0.0013 26.8 3.6 33 236-268 27-59 (182)
55 TIGR03096 nitroso_cyanin nitro 31.3 2.7E+02 0.0058 22.1 9.9 79 75-159 25-116 (135)
56 PF08372 PRT_C: Plant phosphor 31.1 2.9E+02 0.0063 22.5 7.3 48 195-242 51-98 (156)
57 PRK12633 flgD flagellar basal 29.5 2.1E+02 0.0046 24.7 6.7 42 122-163 129-181 (230)
58 PF12669 P12: Virus attachment 29.4 34 0.00073 22.9 1.3 9 262-270 17-25 (58)
59 PRK14710 hypothetical protein; 29.4 44 0.00095 23.4 1.9 59 12-71 3-64 (86)
60 PRK14155 heat shock protein Gr 29.4 3.5E+02 0.0076 23.1 7.8 41 196-236 20-60 (208)
61 PF15232 DUF4585: Domain of un 29.0 1.5E+02 0.0032 21.0 4.5 33 78-115 9-41 (75)
62 PF00957 Synaptobrevin: Synapt 28.8 2.2E+02 0.0047 20.3 9.1 27 213-239 34-60 (89)
63 PRK14081 triple tyrosine motif 28.7 2E+02 0.0043 29.2 7.0 41 123-163 224-264 (667)
64 PF03554 Herpes_UL73: UL73 vir 28.4 85 0.0018 22.7 3.3 28 235-262 44-71 (82)
65 PF04678 DUF607: Protein of un 27.9 3.5E+02 0.0075 22.3 9.0 44 203-246 57-100 (180)
66 PF15417 DUF4624: Domain of un 27.5 2.9E+02 0.0063 21.3 8.4 65 84-164 37-109 (132)
67 KOG3385|consensus 26.9 2.2E+02 0.0048 22.0 5.5 27 194-220 34-60 (118)
68 PRK02935 hypothetical protein; 26.9 15 0.00031 27.9 -0.9 29 16-45 34-62 (110)
69 PF06156 DUF972: Protein of un 26.2 2.9E+02 0.0063 20.9 6.6 26 198-223 17-42 (107)
70 cd03441 R_hydratase_like (R)-h 26.0 2.7E+02 0.0058 20.5 6.4 48 86-138 74-121 (127)
71 PF00517 GP41: Retroviral enve 25.7 3.2E+02 0.007 23.1 7.0 61 197-257 102-168 (204)
72 PRK09619 flgD flagellar basal 25.6 2.8E+02 0.0062 23.8 6.7 42 122-164 123-172 (218)
73 PF14584 DUF4446: Protein of u 25.0 1.7E+02 0.0036 23.7 4.9 50 189-238 32-81 (151)
74 PRK14163 heat shock protein Gr 25.0 4.5E+02 0.0097 22.6 7.8 44 196-239 47-90 (214)
75 PF08842 Mfa2: Fimbrillin-A as 24.7 93 0.002 26.9 3.7 43 122-164 29-77 (283)
76 PRK14149 heat shock protein Gr 24.6 4.3E+02 0.0093 22.3 7.8 43 194-236 41-83 (191)
77 PF13464 DUF4115: Domain of un 24.5 2.4E+02 0.0053 19.4 6.3 40 123-164 8-47 (77)
78 PF07680 DoxA: TQO small subun 23.7 3.7E+02 0.0081 21.3 6.5 41 121-162 46-95 (133)
79 PF07125 DUF1378: Protein of u 23.4 1.4E+02 0.0029 19.9 3.2 29 239-269 7-35 (59)
80 PRK14154 heat shock protein Gr 23.3 4.8E+02 0.01 22.3 7.9 45 196-240 59-103 (208)
81 PF09451 ATG27: Autophagy-rela 23.0 55 0.0012 28.9 1.9 37 20-59 204-240 (268)
82 PRK13169 DNA replication intia 22.5 3.6E+02 0.0077 20.6 6.7 20 199-218 11-30 (110)
83 cd03446 MaoC_like MoaC_like 22.1 3.5E+02 0.0076 20.5 6.2 52 86-140 83-135 (140)
84 PRK14143 heat shock protein Gr 22.0 5.4E+02 0.012 22.5 8.0 40 196-235 74-113 (238)
85 cd03451 FkbR2 FkbR2 is a Strep 22.0 3.7E+02 0.008 20.6 7.4 52 87-140 86-137 (146)
86 PF10779 XhlA: Haemolysin XhlA 21.8 2.8E+02 0.006 19.1 9.6 33 200-232 3-35 (71)
87 PF05371 Phage_Coat_Gp8: Phage 21.8 1.1E+02 0.0023 20.1 2.5 22 247-268 30-51 (52)
88 PF14235 DUF4337: Domain of un 21.8 4.4E+02 0.0095 21.4 8.6 65 199-263 69-133 (157)
89 PF09889 DUF2116: Uncharacteri 21.5 2.7E+02 0.0058 18.8 4.8 18 234-251 34-51 (59)
90 COG4856 Uncharacterized protei 21.3 7E+02 0.015 23.5 9.2 39 121-162 69-111 (403)
91 PF05506 DUF756: Domain of unk 21.3 3.1E+02 0.0067 19.5 6.7 48 74-133 31-81 (89)
92 PRK14147 heat shock protein Gr 21.0 4.8E+02 0.01 21.5 8.1 43 196-238 25-67 (172)
93 PF14977 FAM194: FAM194 protei 20.6 5.4E+02 0.012 22.0 13.7 14 128-141 90-103 (208)
94 PHA03163 hypothetical protein; 20.5 2.1E+02 0.0046 21.0 4.1 28 235-262 53-80 (92)
95 PF01166 TSC22: TSC-22/dip/bun 20.4 2.8E+02 0.0062 18.7 4.7 30 195-224 13-42 (59)
96 PF03273 Baculo_gp64: Baculovi 20.4 5.2E+02 0.011 25.0 7.7 18 122-139 159-176 (498)
97 PF05399 EVI2A: Ectropic viral 20.1 1.1E+02 0.0025 26.2 3.0 32 236-267 125-166 (227)
98 COG1382 GimC Prefoldin, chaper 20.1 4.2E+02 0.0092 20.5 6.4 44 193-236 67-110 (119)
99 COG5415 Predicted integral mem 20.1 3.6E+02 0.0077 23.2 6.0 18 199-216 11-28 (251)
100 PF10754 DUF2569: Protein of u 20.0 3.3E+02 0.0071 21.6 5.7 29 242-270 54-82 (149)
No 1
>KOG1690|consensus
Probab=100.00 E-value=1.3e-47 Score=312.41 Aligned_cols=205 Identities=53% Similarity=0.840 Sum_probs=190.1
Q ss_pred HHHHHHHHhhcceeEEEEEcCCCceeEEEEccCCcEEEEEEEEEEeecCCCccccCCCCceeEEEEECCCCC--EEEeee
Q psy15308 63 LCVVLLAIIPFSYALYFHISETERKCFIEEIPEETQVIINYKTELFDTRTGGYMQSSPGVGMHVQISNPDSK--IILSRL 140 (270)
Q Consensus 63 ~~~lL~~l~~~a~~l~f~i~~g~~~Cf~e~v~~~~~i~~~y~v~~~d~~~g~~~~~~~~~~i~~~V~dP~g~--~v~~~~ 140 (270)
.+++|++|+..+.|++|++..++++||++++|+|+.+.|+|.+.-+|+..++|. +.|+.++.+.|.+|.++ +|++++
T Consensus 7 ~~lll~~l~~~~~a~yFy~~~~e~KCF~eelpk~tmv~G~yk~qlyd~~~~~y~-~~p~~gm~VeV~e~fdnnh~Vl~q~ 85 (215)
T KOG1690|consen 7 LLLLLLLLATQVQALYFYIAGTEKKCFIEELPKGTMVTGNYKAQLYDDQLKGYG-SYPNIGMHVEVKETFDNNHVVLSQQ 85 (215)
T ss_pred HHHHHHHHHhhccEEEEEecCCcccchhhhCCCCcEEEeeeeeeeecchhcccc-cCCCceEEEEeecCCCCceEEEeec
Confidence 445666777889999999999999999999999999999999999998888887 56789999999999766 999999
Q ss_pred ecccceEEEEeeCCccEEEEeecCCCCccccceeEEEEEEEeCccccchhhhhhhcchhHHHHHHHHHHHHHHHHHHHHH
Q psy15308 141 YSYEGRISFTSHMPGEHVICLYSNSTKWVGASQLRVHLDIQVGEHAIDYAQVAQKDKLSELQLRVRQLLDQCDQITKEQN 220 (270)
Q Consensus 141 ~~~eg~f~f~a~~~G~Y~iCf~n~~~~~~~~~~~~V~f~i~~g~~~~d~~~~~~~~~l~~l~~~l~~L~~~l~~I~~eq~ 220 (270)
..++|+|+|++.++|+|+||+.++++.|+.+.+.+|.+++.+|.++.++.+. +++.+.++.++..|++++.+|+.||+
T Consensus 86 ~ss~G~ftFta~~~GeH~IC~~s~s~awf~~aklRvhld~qvG~~a~l~a~~--ke~~k~l~~Rv~~L~~~~~~IrkEQ~ 163 (215)
T KOG1690|consen 86 YSSEGDFTFTALTPGEHRICIQSNSTAWFNGAKLRVHLDIQVGDHANLDAQI--KETDKLLEGRVRQLNSRLESIRKEQN 163 (215)
T ss_pred CCCCCceEEEccCCCceEEEEecccchhhccceEEEEEEEeeCchhhhhhhh--hhhhhhhHHHHHHHHHHHHHHHHHHH
Confidence 9999999999999999999999998899998899999999999988766553 46678888999999999999999999
Q ss_pred HHHHHHHHhHHHHHHHhhHHHHHHHHHHHHHHHHhhHHHHHHHHhhcccC
Q psy15308 221 YQRYREERFRATSESTNQRVFWWSLAQLGILVCMGVWQMRHLKSFFEAKK 270 (270)
Q Consensus 221 ~~r~re~~~r~~~est~~rV~~~silqi~vli~~s~~Qv~~LKrfF~~KK 270 (270)
++|.||+++|+++||+|+||+|||++|++||+++|+||+.|||+||.+||
T Consensus 164 ~~R~RE~~FR~tSES~NsRvm~Wsv~Q~vvL~~tc~wQmrhL~~FFvkqK 213 (215)
T KOG1690|consen 164 LQREREETFRDTSESANSRVMWWSVAQLVVLLVTCIWQMRHLKSFFVKQK 213 (215)
T ss_pred HHHHHHHHHHhhhhhhcceeeehhHHHHHHHHHHHHHHHHHHHHHHHHhh
Confidence 99999999999999999999999999999999999999999999999987
No 2
>KOG1692|consensus
Probab=100.00 E-value=3.4e-44 Score=291.52 Aligned_cols=186 Identities=28% Similarity=0.500 Sum_probs=169.8
Q ss_pred HhhcceeEEEEEcCCCceeEEEEccCCcEEEEEEEEEEeecCCCccccCCCCceeEEEEECCCCCEEEeeeecccceEEE
Q psy15308 70 IIPFSYALYFHISETERKCFIEEIPEETQVIINYKTELFDTRTGGYMQSSPGVGMHVQISNPDSKIILSRLYSYEGRISF 149 (270)
Q Consensus 70 l~~~a~~l~f~i~~g~~~Cf~e~v~~~~~i~~~y~v~~~d~~~g~~~~~~~~~~i~~~V~dP~g~~v~~~~~~~eg~f~f 149 (270)
|...+.++.+.+++.+++||+|++.+|+.+.++|+|. .||+ .++++.|.+|+|+.++..++.+.|+|+|
T Consensus 14 L~~~~~~~~is~~ah~eeCf~e~~~~gd~~~vsF~v~-----~gg~------~~vd~~I~gP~~~~i~~~~~~ssgk~tF 82 (201)
T KOG1692|consen 14 LFISAAGYGISLDAHEEECFFENLEEGDKLSVSFEVI-----DGGF------LGVDVEITGPDGKIIHKGKRESSGKYTF 82 (201)
T ss_pred HHHHhhheeEEEccchhhhHhhhhccCCEEEEEEEEe-----cCCc------cceeEEEECCCCchhhhcccccCceEEE
Confidence 3334888999999999999999999999999999998 3554 6899999999999999999999999999
Q ss_pred EeeCCccEEEEeecCCCCccccceeEEEEEEEeCccccchhhhhhhcchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh
Q psy15308 150 TSHMPGEHVICLYSNSTKWVGASQLRVHLDIQVGEHAIDYAQVAQKDKLSELQLRVRQLLDQCDQITKEQNYQRYREERF 229 (270)
Q Consensus 150 ~a~~~G~Y~iCf~n~~~~~~~~~~~~V~f~i~~g~~~~d~~~~~~~~~l~~l~~~l~~L~~~l~~I~~eq~~~r~re~~~ 229 (270)
+++.+|.|++||+|..+. ..++.|.|+|++|.... +++.+++++.+++++.+++|.+.|..++.||+|+..||.+|
T Consensus 83 ~a~~~G~Y~fCF~N~~s~---mtpk~V~F~ihvg~~~~-~~d~~~d~~~~~L~~~I~eL~~al~~Vk~EQeY~~~Rer~H 158 (201)
T KOG1692|consen 83 TAPKKGTYTFCFSNKMST---MTPKTVMFTIHVGHAPQ-RDDLAKDAHQNKLEEMIRELSEALTSVKHEQEYMEARERIH 158 (201)
T ss_pred EecCCceEEEEecCCCCC---CCceEEEEEEEEeeccc-cchhcccccccHHHHHHHHHHHHHHHhHHHHHHHHHHHHHH
Confidence 999999999999998542 35789999999987543 45578888999999999999999999999999999999999
Q ss_pred HHHHHHHhhHHHHHHHHHHHHHHHHhhHHHHHHHHhhcccC
Q psy15308 230 RATSESTNQRVFWWSLAQLGILVCMGVWQMRHLKSFFEAKK 270 (270)
Q Consensus 230 r~~~est~~rV~~~silqi~vli~~s~~Qv~~LKrfF~~KK 270 (270)
|.++|+||+||+|||++|.++||+++++|||||||||+.|+
T Consensus 159 r~~nEntn~RVv~wsife~~vLi~~s~~QVyYLkRfFEvkr 199 (201)
T KOG1692|consen 159 RNTNENTNSRVVLWSIFEALVLIAMSVLQVYYLKRFFEVKR 199 (201)
T ss_pred HHhhhcccceeehHHHHHHHHHHHHHHHHHHHHHHhheeee
Confidence 99999999999999999999999999999999999999885
No 3
>KOG1691|consensus
Probab=100.00 E-value=1.3e-41 Score=281.65 Aligned_cols=202 Identities=32% Similarity=0.542 Sum_probs=181.4
Q ss_pred HHHHHHHHHHhhcceeEEEEEcCCCceeEEEEccCCcEEEEEEEEEEeecCCCccccCCCCceeEEEEECCCCCEEEeee
Q psy15308 61 QLLCVVLLAIIPFSYALYFHISETERKCFIEEIPEETQVIINYKTELFDTRTGGYMQSSPGVGMHVQISNPDSKIILSRL 140 (270)
Q Consensus 61 ~~~~~lL~~l~~~a~~l~f~i~~g~~~Cf~e~v~~~~~i~~~y~v~~~d~~~g~~~~~~~~~~i~~~V~dP~g~~v~~~~ 140 (270)
..+++++++++..+.++.|+++++.++|+.|++.+|..+.|.|.++ |+. |+ . ...+++.|+||.|+.+++.+
T Consensus 7 ~~~l~i~~~~~~~~~a~~f~v~~~~~kCi~EeI~~n~lv~g~y~i~--~~~-~~----~-~~~~~~~Vts~~G~~~~~~e 78 (210)
T KOG1691|consen 7 SLLLLIFLLLLPLVHALRFDVPSKTTKCISEEIHENVLVVGDYEII--NPN-GD----H-SHKLSVKVTSPYGNNLHSKE 78 (210)
T ss_pred HHHHHHHHHHhhhhheEEEEecCCCCEeehhhhccCeEEEEEEEEe--cCC-CC----c-cceEEEEEEcCCCceeehhh
Confidence 3445566666788999999999999999999999999999999998 321 11 1 14699999999999999999
Q ss_pred ecccceEEEEeeCCccEEEEeecCCCCccccceeEEEEEEEeCccccchhhhhhhcchhHHHHHHHHHHHHHHHHHHHHH
Q psy15308 141 YSYEGRISFTSHMPGEHVICLYSNSTKWVGASQLRVHLDIQVGEHAIDYAQVAQKDKLSELQLRVRQLLDQCDQITKEQN 220 (270)
Q Consensus 141 ~~~eg~f~f~a~~~G~Y~iCf~n~~~~~~~~~~~~V~f~i~~g~~~~d~~~~~~~~~l~~l~~~l~~L~~~l~~I~~eq~ 220 (270)
..++|+|+|++.++|.|.+||.+.....-......|+|+++.|.+++||+++||+++++++|.++++|++.+..|+++..
T Consensus 79 nv~~gqFaFta~e~~~y~~Cf~~~~~~~~p~~~~~I~ld~k~Gv~akdw~~IAKkeklep~E~elrrLed~~~sI~~e~~ 158 (210)
T KOG1691|consen 79 NVTKGQFAFTAEESGMYEACFTADVPGHKPETKRSIDLDWKTGVEAKDWDSIAKKEKLEPLEVELRRLEDLVESIHEEMY 158 (210)
T ss_pred ccccceEEEEeccCCcEEEEEecccCCCCCCcceEEEEEeeccccccchHHHHhhhcCcHHHHHHHHHHHHHHHHHHHHH
Confidence 99999999999999999999998432211234578999999999999999999999999999999999999999999999
Q ss_pred HHHHHHHHhHHHHHHHhhHHHHHHHHHHHHHHHHhhHHHHHHHHhhcccC
Q psy15308 221 YQRYREERFRATSESTNQRVFWWSLAQLGILVCMGVWQMRHLKSFFEAKK 270 (270)
Q Consensus 221 ~~r~re~~~r~~~est~~rV~~~silqi~vli~~s~~Qv~~LKrfF~~KK 270 (270)
|+|.||+++|+++|+||+||.|+|++.++++++++.||++||||||++||
T Consensus 159 YLr~REeemr~~nesTNsrv~~fSi~Sl~v~~~va~~QvlyLK~fF~kKK 208 (210)
T KOG1691|consen 159 YLREREEEMRNTNESTNSRVAWFSILSLVVLLSVAGWQVLYLKRFFQKKK 208 (210)
T ss_pred HHHHHHHHHHhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcc
Confidence 99999999999999999999999999999999999999999999999998
No 4
>KOG1693|consensus
Probab=100.00 E-value=1.1e-39 Score=266.51 Aligned_cols=192 Identities=28% Similarity=0.471 Sum_probs=168.2
Q ss_pred HHHHHHhhcceeEEEEEcCCCceeEEEEccCCc-EEEEEEEEEEeecCCCccccCCCCceeEEEEECCCCCEEEeeeecc
Q psy15308 65 VVLLAIIPFSYALYFHISETERKCFIEEIPEET-QVIINYKTELFDTRTGGYMQSSPGVGMHVQISNPDSKIILSRLYSY 143 (270)
Q Consensus 65 ~lL~~l~~~a~~l~f~i~~g~~~Cf~e~v~~~~-~i~~~y~v~~~d~~~g~~~~~~~~~~i~~~V~dP~g~~v~~~~~~~ 143 (270)
+||+.+++.+..++|++|++.++|||+++++++ .+...|+|. +|| +++|++.|++|+|++++...++.
T Consensus 12 ~lla~~~s~a~elTfeLp~~aKqC~Y~d~~~~~~~~~~~fqV~-----tGG------~fDVD~~I~aPdgkvI~~~~kk~ 80 (209)
T KOG1693|consen 12 LLLALLFSEASELTFELPDNAKQCFYEDLKKDDDTTSFEFQVQ-----TGG------HFDVDYDIEAPDGKVIYSEKKKR 80 (209)
T ss_pred HHHHHHhhhcccEEEEcCCcchhheeeecccCCceEEEEEEEE-----eCC------ceeeEEEEECCCCCEEeeccccc
Confidence 344444555889999999999999999999966 599999998 676 47999999999999999999999
Q ss_pred cceEEEEeeCCccEEEEeecCCCCccccceeEEEEEEEeCccccchhhh-hhhcchhHHHHHHHHHHHHHHHHHHHHHHH
Q psy15308 144 EGRISFTSHMPGEHVICLYSNSTKWVGASQLRVHLDIQVGEHAIDYAQV-AQKDKLSELQLRVRQLLDQCDQITKEQNYQ 222 (270)
Q Consensus 144 eg~f~f~a~~~G~Y~iCf~n~~~~~~~~~~~~V~f~i~~g~~~~d~~~~-~~~~~l~~l~~~l~~L~~~l~~I~~eq~~~ 222 (270)
.++|.|++...|+|++||+|+.+ . ..++.+++++..|.+.+..... +++..++.++..+..|++.|+.|.+.|.|.
T Consensus 81 ~~~~~f~ae~~G~Y~fCFsN~fs-t--f~~Kiv~~~~q~~~~~~~~~~~~~~~~~~~~mena~~~I~~~L~~I~~~q~y~ 157 (209)
T KOG1693|consen 81 YDSFLFKAEGKGEYTFCFSNEFS-T--FSHKIVYMDFQVGEEPPLHPAVSNRDTALTQMENAIVEIHRALNKIDDTQTYY 157 (209)
T ss_pred cccEEEEEecceEEEEEecCccc-c--ccceEeeehhhhccccccCccccccchHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 99999999999999999999743 2 3568899999998654322222 334467899999999999999999999999
Q ss_pred HHHHHHhHHHHHHHhhHHHHHHHHHHHHHHHHhhHHHHHHHHhhcccC
Q psy15308 223 RYREERFRATSESTNQRVFWWSLAQLGILVCMGVWQMRHLKSFFEAKK 270 (270)
Q Consensus 223 r~re~~~r~~~est~~rV~~~silqi~vli~~s~~Qv~~LKrfF~~KK 270 (270)
|.||.|.|.+++|+++||+|||++++++++++++.||+.||.||..||
T Consensus 158 R~RE~rn~~tv~st~~Rv~~~Sl~e~~~vv~iSi~Qv~ilk~fFt~~r 205 (209)
T KOG1693|consen 158 RLREARNRSTVESTNSRVTWWSLLEIIAVVVISIAQVFILKFFFTDRR 205 (209)
T ss_pred HHHHhcCccchhcccchhhHHHHHHHHHHHHHHHHHHHHHHHHHhccC
Confidence 999999999999999999999999999999999999999999999775
No 5
>KOG3287|consensus
Probab=100.00 E-value=1.5e-34 Score=239.08 Aligned_cols=184 Identities=20% Similarity=0.386 Sum_probs=160.5
Q ss_pred ceeEEEEEcCCCceeEEEEccCCcEEEEEEEEEEeecCCCccccCCCCceeEEEEECCCCCEEEeeeecccceEEEEeeC
Q psy15308 74 SYALYFHISETERKCFIEEIPEETQVIINYKTELFDTRTGGYMQSSPGVGMHVQISNPDSKIILSRLYSYEGRISFTSHM 153 (270)
Q Consensus 74 a~~l~f~i~~g~~~Cf~e~v~~~~~i~~~y~v~~~d~~~g~~~~~~~~~~i~~~V~dP~g~~v~~~~~~~eg~f~f~a~~ 153 (270)
...++++||||+++|||+.++.+..+.++|+|++ | +.+.+|++.+.+|.|+++.+.+++.+|.+++.+.+
T Consensus 33 d~dftv~ipAGk~eCf~Q~v~~~~tle~eyQVi~-----G-----~GDl~i~Ftl~~P~G~~lv~~q~k~dg~ht~e~~e 102 (236)
T KOG3287|consen 33 DYDFTVMIPAGKTECFYQPVPQGATLEVEYQVID-----G-----AGDLDIDFTLLNPAGEVLVSDQRKVDGVHTVEVTE 102 (236)
T ss_pred ccceEEEecCCCceeeeeeccCCeEEEEEEEEEe-----c-----CCccceeeEEeCCCccEEeecccccCceeEeeccC
Confidence 7889999999999999999999999999999992 3 12478999999999999999999999999999999
Q ss_pred CccEEEEeecCCCCccccceeEEEEEEEeCcc------ccchhhhhh-----hcchhHHHHHHHHHHHHHHHHHHHHHHH
Q psy15308 154 PGEHVICLYSNSTKWVGASQLRVHLDIQVGEH------AIDYAQVAQ-----KDKLSELQLRVRQLLDQCDQITKEQNYQ 222 (270)
Q Consensus 154 ~G~Y~iCf~n~~~~~~~~~~~~V~f~i~~g~~------~~d~~~~~~-----~~~l~~l~~~l~~L~~~l~~I~~eq~~~ 222 (270)
+|.|++||+|+ .+.+++ +.|+|++....+ +..|++.++ ..+++++++.++++..+|+.+...|..+
T Consensus 103 ~GdY~~CfDNs-FS~fs~--K~Vffeli~~~~g~~~e~~e~w~k~~e~~~~Ld~kl~di~~~i~~i~~nl~k~~~~q~~l 179 (236)
T KOG3287|consen 103 TGDYQVCFDNS-FSTFSR--KLVFFELILDAHGEFYEGDETWHKYKERTEQLDVKLDDIEDSIGTIKNNLNKMWQYQALL 179 (236)
T ss_pred CcceEEEEcCc-cccccc--eEEEEEEEeccccchhccchhHhhhhhhhhhhcccHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 99999999987 456654 567777744322 123333222 3478999999999999999999999999
Q ss_pred HHHHHHhHHHHHHHhhHHHHHHHHHHHHHHHHhhHHHHHHHHhhcccC
Q psy15308 223 RYREERFRATSESTNQRVFWWSLAQLGILVCMGVWQMRHLKSFFEAKK 270 (270)
Q Consensus 223 r~re~~~r~~~est~~rV~~~silqi~vli~~s~~Qv~~LKrfF~~KK 270 (270)
|.||+|+|.+.+|+..||.|||.+|+.+||+++.+|||+|||+|+.|+
T Consensus 180 Ra~EaRDr~L~esNf~rVN~WS~vq~~vmi~v~~iQVf~lrslFe~~~ 227 (236)
T KOG3287|consen 180 RAREARDRNLQESNFDRVNFWSMVQTLVMILVGIIQVFMLRSLFEVKS 227 (236)
T ss_pred HHHHhhhHHHHhcccchhhHHHHHHHHHHHHHhhhhhhhhHHHhcCCC
Confidence 999999999999999999999999999999999999999999999763
No 6
>PF01105 EMP24_GP25L: emp24/gp25L/p24 family/GOLD; InterPro: IPR009038 The GOLD (for Golgi dynamics) domain is a protein module found in several eukaryotic Golgi and lipid-traffic proteins. It is typically between 90 and 150 amino acids long. Most of the size difference observed in the GOLD-domain superfamily is traceable to a single large low-complexity insert that is seen in some versions of the domain. With the exception of the p24 proteins, which have a simple architecture with the GOLD domain as their only globular domain, all other GOLD-domain proteins contain additional conserved globular domains. In these proteins, the GOLD domain co-occurs with lipid-, sterol- or fatty acid-binding domains such as PH, CRAL-TRIO, FYVE oxysterol binding- and acyl CoA-binding domains, suggesting that these proteins may interact with membranes. The GOLD domain can also be found associated with a RUN domain, which may have a role in the interaction of various proteins with cytoskeletal filaments. The GOLD domain is predicted to mediate diverse protein-protein interactions []. A secondary structure prediction for the GOLD domain reveals that it is likely to adopt a compact all-beta-fold structure with six to seven strands. Most of the sequence conservation is centred on the hydrophobic cores that support these predicted strands. The predicted secondary-structure elements and the size of the conserved core of the domain suggests that it may form a beta- sandwich fold with the strands arranged in two beta sheets stacked on each other []. Some proteins known to contain a GOLD domain are listed below: Eukaryotic proteins of the p24 family. Animal Sec14-like proteins. They are involved in secretion. Human Golgi resident protein GCP60. It interacts with the Golgi integral membrane protein Giantin. Yeast oxysterol-binding protein homologue 3 (OSH3). ; GO: 0006810 transport, 0016021 integral to membrane; PDB: 1P23_A 1M23_A.
Probab=100.00 E-value=5.5e-37 Score=255.46 Aligned_cols=180 Identities=38% Similarity=0.673 Sum_probs=5.5
Q ss_pred eEEEEEcCCCceeEEEEccCCcEEEEEEEEEEeecCCCccccCCCCceeEEEEE--CCCCCEEEeeeec-ccceEEEEee
Q psy15308 76 ALYFHISETERKCFIEEIPEETQVIINYKTELFDTRTGGYMQSSPGVGMHVQIS--NPDSKIILSRLYS-YEGRISFTSH 152 (270)
Q Consensus 76 ~l~f~i~~g~~~Cf~e~v~~~~~i~~~y~v~~~d~~~g~~~~~~~~~~i~~~V~--dP~g~~v~~~~~~-~eg~f~f~a~ 152 (270)
|++|+|+||+++||++++++|+.+.++|++.+ ++ ++.++++.|+ +|+|+.++.+... ++|+|+|+++
T Consensus 1 a~~f~l~~g~~~Cf~e~v~~~~~i~~~y~v~~-----~~-----~~~~v~~~i~~~~~~~~~i~~~~~~~~~~~f~f~~~ 70 (183)
T PF01105_consen 1 ALTFELEPGETECFYEEVPKGTTIRGSYRVTD-----GG-----GAYDVDFTIRDPDPNGEVIYSKSDKESEGSFSFTAK 70 (183)
T ss_dssp --------------------------------------------------------------------------------
T ss_pred CEEEEECCCCcEEEEEEcCCCcEEEEEEEEee-----cc-----ccceEEEEEEecccCCceeeeecccccCCcEEEEec
Confidence 68999999999999999999999999999982 22 1357999999 6666898888555 5579999999
Q ss_pred CCccEEEEeecCCCCccccceeEEEEEEEeCccccchhhhhhhcchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhHHH
Q psy15308 153 MPGEHVICLYSNSTKWVGASQLRVHLDIQVGEHAIDYAQVAQKDKLSELQLRVRQLLDQCDQITKEQNYQRYREERFRAT 232 (270)
Q Consensus 153 ~~G~Y~iCf~n~~~~~~~~~~~~V~f~i~~g~~~~d~~~~~~~~~l~~l~~~l~~L~~~l~~I~~eq~~~r~re~~~r~~ 232 (270)
++|+|++||+|..+++. ..+.|+|++..|.++.++++.++++++++++..+++|.+.++.|+++|+|++.|+.+|+++
T Consensus 71 ~~G~y~iCf~n~~~~~~--~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~l~~~l~~i~~~q~~~~~r~~~~~~~ 148 (183)
T PF01105_consen 71 ESGEYQICFDNSSSSFS--PSKRVSFDIDVGNENKDYKNVAKKEHLDPLEESLEKLESNLKEIKDEQKYLREREERHRQL 148 (183)
T ss_dssp --------------------------------------------------------------------------------
T ss_pred cCCCEEEEEEcCCCCcc--ccEEEEEEEEEeecccchhhhhhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 99999999999855321 1378999999987766777888999999999999999999999999999999999999999
Q ss_pred HHHHhhHHHHHHHHHHHHHHHHhhHHHHHHHHhhc
Q psy15308 233 SESTNQRVFWWSLAQLGILVCMGVWQMRHLKSFFE 267 (270)
Q Consensus 233 ~est~~rV~~~silqi~vli~~s~~Qv~~LKrfF~ 267 (270)
++++++|+++|+++|+++++++++||+++||+||+
T Consensus 149 ~es~~~~i~~~si~~~~vli~~~~~Qv~~lk~~f~ 183 (183)
T PF01105_consen 149 NESTNSRIMWWSIIQIVVLILVSVWQVYYLKKFFK 183 (183)
T ss_dssp -----------------------------HHHHHH
T ss_pred HHHhhheEEhHHHHHHHHHHHHHHHHHHHHHHHhC
Confidence 99999999999999999999999999999999995
No 7
>KOG1690|consensus
Probab=98.35 E-value=1.6e-06 Score=71.99 Aligned_cols=40 Identities=50% Similarity=0.808 Sum_probs=31.2
Q ss_pred HHHHHHhhhccccceeeEEeeccccceeeecCCChhhHHHH
Q psy15308 24 RLLCVVLLAVIPFSYALYFHISETERKCFIEEIPEETQLLC 64 (270)
Q Consensus 24 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~lp~~~~~~~ 64 (270)
+++++.+++...++ |+|||+.+++++||+|+||++|||.+
T Consensus 6 ~~~lll~~l~~~~~-a~yFy~~~~e~KCF~eelpk~tmv~G 45 (215)
T KOG1690|consen 6 RLLLLLLLLATQVQ-ALYFYIAGTEKKCFIEELPKGTMVTG 45 (215)
T ss_pred HHHHHHHHHHhhcc-EEEEEecCCcccchhhhCCCCcEEEe
Confidence 34444455555544 99999999999999999999987754
No 8
>KOG1693|consensus
Probab=95.87 E-value=0.74 Score=38.77 Aligned_cols=167 Identities=16% Similarity=0.200 Sum_probs=85.7
Q ss_pred eeEEeeccccceeeecCCChhhHHHHHHHHHHhhcceeEEEEEcCCCceeEEEEccCCcEEEEEEEEEEeecCCCccccC
Q psy15308 39 ALYFHISETERKCFIEEIPEETQLLCVVLLAIIPFSYALYFHISETERKCFIEEIPEETQVIINYKTELFDTRTGGYMQS 118 (270)
Q Consensus 39 ~~~~~~~~~~~~~~~~~lp~~~~~~~~lL~~l~~~a~~l~f~i~~g~~~Cf~e~v~~~~~i~~~y~v~~~d~~~g~~~~~ 118 (270)
.+-|-|+.+...|||+++|+. +.+--|...|..|..+.+.|.+. +|+
T Consensus 23 elTfeLp~~aKqC~Y~d~~~~------------------------~~~~~~~fqV~tGG~fDVD~~I~--aPd------- 69 (209)
T KOG1693|consen 23 ELTFELPDNAKQCFYEDLKKD------------------------DDTTSFEFQVQTGGHFDVDYDIE--APD------- 69 (209)
T ss_pred cEEEEcCCcchhheeeecccC------------------------CceEEEEEEEEeCCceeeEEEEE--CCC-------
Confidence 788999999999999999998 22222344566777778888876 331
Q ss_pred CCCceeEEEEECCCCCEEEeeeecccceEEEEeeC---CccEEEEe-ecCCCCccccceeEEEEEEEeCccccchhhhhh
Q psy15308 119 SPGVGMHVQISNPDSKIILSRLYSYEGRISFTSHM---PGEHVICL-YSNSTKWVGASQLRVHLDIQVGEHAIDYAQVAQ 194 (270)
Q Consensus 119 ~~~~~i~~~V~dP~g~~v~~~~~~~eg~f~f~a~~---~G~Y~iCf-~n~~~~~~~~~~~~V~f~i~~g~~~~d~~~~~~ 194 (270)
+..|--.=....+...+. ....|.|+|=-.. +-.+++-. ++.. ++. -...-+.+..++....-
T Consensus 70 --gkvI~~~~kk~~~~~~f~--ae~~G~Y~fCFsN~fstf~~Kiv~~~~q~-----~~~----~~~~~~~~~~~~~~~~m 136 (209)
T KOG1693|consen 70 --GKVIYSEKKKRYDSFLFK--AEGKGEYTFCFSNEFSTFSHKIVYMDFQV-----GEE----PPLHPAVSNRDTALTQM 136 (209)
T ss_pred --CCEEeeccccccccEEEE--EecceEEEEEecCccccccceEeeehhhh-----ccc----cccCccccccchHHHHH
Confidence 111211011122233332 2355666653321 11222221 1110 000 00011111223222222
Q ss_pred hcchhHHHHHHHHHHHHHHHHH--HHHHHHHHHHHHhHHHHHHHhhH--HHHHHHHHHHHH
Q psy15308 195 KDKLSELQLRVRQLLDQCDQIT--KEQNYQRYREERFRATSESTNQR--VFWWSLAQLGIL 251 (270)
Q Consensus 195 ~~~l~~l~~~l~~L~~~l~~I~--~eq~~~r~re~~~r~~~est~~r--V~~~silqi~vl 251 (270)
...+..|...|+.|.+..+..+ +.+.+.+....++|-+.=|.-.- |+..|++|+.++
T Consensus 137 ena~~~I~~~L~~I~~~q~y~R~RE~rn~~tv~st~~Rv~~~Sl~e~~~vv~iSi~Qv~il 197 (209)
T KOG1693|consen 137 ENAIVEIHRALNKIDDTQTYYRLREARNRSTVESTNSRVTWWSLLEIIAVVVISIAQVFIL 197 (209)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHhcCccchhcccchhhHHHHHHHHHHHHHHHHHHHHH
Confidence 3446677888888887776654 45566666666666666555544 444566666554
No 9
>PF00630 Filamin: Filamin/ABP280 repeat; InterPro: IPR017868 The many different actin cross-linking proteins share a common architecture, consisting of a globular actin-binding domain and an extended rod. Whereas their actin-binding domains consist of two calponin homology domains (see IPR001715 from INTERPRO), their rods fall into three families. The rod domain of the family including the Dictyostelium discoideum (Slime mould) gelation factor (ABP120) and human filamin (ABP280) is constructed from tandem repeats of a 100-residue motif that is glycine and proline rich []. The gelation factor's rod contains 6 copies of the repeat, whereas filamin has a rod constructed from 24 repeats. The resolution of the 3D structure of rod repeats from the gelation factor has shown that they consist of a beta-sandwich, formed by two beta-sheets arranged in an immunoglobulin-like fold [, ]. Because conserved residues that form the core of the repeats are preserved in filamin, the repeat structure should be common to the members of the gelation factor/filamin family. The head to tail homodimerisation is crucial to the function of the ABP120 and ABP280 proteins. This interaction involves a small portion at the distal end of the rod domains. For the gelation factor it has been shown that the carboxy-terminal repeat 6 dimerises through a double edge-to-edge extension of the beta-sheet and that repeat 5 contributes to dimerisation to some extent [, , ].; PDB: 2DI9_A 2EEC_A 2DIC_A 2EEA_A 2DMC_A 2EE9_A 2D7O_A 2D7N_A 2K7P_A 2NQC_A ....
Probab=88.71 E-value=4.1 Score=29.81 Aligned_cols=43 Identities=28% Similarity=0.422 Sum_probs=29.4
Q ss_pred ceeEEEEECCCCC----EEEe-eeecccc--eEEEEeeCCccEEEEeecC
Q psy15308 122 VGMHVQISNPDSK----IILS-RLYSYEG--RISFTSHMPGEHVICLYSN 164 (270)
Q Consensus 122 ~~i~~~V~dP~g~----~v~~-~~~~~eg--~f~f~a~~~G~Y~iCf~n~ 164 (270)
..+.+.|.+|++. .+.- -....+| ..+|+++..|.|++++.-+
T Consensus 42 ~~~~v~i~~p~~~~~~~~~~~~v~~~~~G~y~v~y~p~~~G~y~i~V~~~ 91 (101)
T PF00630_consen 42 DEFQVTITSPDGKEEPVPVPVEVIDNGDGTYTVSYTPTEPGKYKISVKIN 91 (101)
T ss_dssp SEEEEEEESSSSESS--EEEEEEEEESSSEEEEEEEESSSEEEEEEEEES
T ss_pred ceeEEEEeCCCCCccccccceEEEECCCCEEEEEEEeCccEeEEEEEEEC
Confidence 4688899999987 3222 2223445 4577888899999988744
No 10
>PF04151 PPC: Bacterial pre-peptidase C-terminal domain; InterPro: IPR007280 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. This domain is normally found at the C terminus of secreted archaeal and bacterial peptidases, the majority of which belong to MEROPS peptidase families M4 (vibriolysin, IPR001570 from INTERPRO), M9A amd M9B (microbial collangenase, IPR002169 from INTERPRO), M28 (aminopeptidase Ap1, IPR007484 from INTERPRO) and S8 (subtilisin family peptidases, IPR000209 from INTERPRO).; GO: 0008233 peptidase activity, 0006508 proteolysis; PDB: 4DY5_B 4DXZ_A 4DY3_B 3JQW_A 3JQX_C 1NQJ_B 1NQD_A 2O8O_A 1WMF_A 1WME_A ....
Probab=87.58 E-value=2.4 Score=29.21 Aligned_cols=61 Identities=18% Similarity=0.273 Sum_probs=40.9
Q ss_pred eeEEEEccCCcEEEEEEEEEEeecCCCccccCCCCceeEEEEECCCCCEEEeeeecc-----cceEEEEeeCCccEEEEe
Q psy15308 87 KCFIEEIPEETQVIINYKTELFDTRTGGYMQSSPGVGMHVQISNPDSKIILSRLYSY-----EGRISFTSHMPGEHVICL 161 (270)
Q Consensus 87 ~Cf~e~v~~~~~i~~~y~v~~~d~~~g~~~~~~~~~~i~~~V~dP~g~~v~~~~~~~-----eg~f~f~a~~~G~Y~iCf 161 (270)
..|..+++++..+.+.- . |+ ..+.++.+.+++|..+......+ .....|+++.+|+|.+=+
T Consensus 3 D~y~f~v~ag~~l~i~l--~------~~------~~d~dl~l~~~~g~~~~~~d~~~~~~~~~~~i~~~~~~~GtYyi~V 68 (70)
T PF04151_consen 3 DYYSFTVPAGGTLTIDL--S------GG------SGDADLYLYDSNGNSLASYDDSSQSGGNDESITFTAPAAGTYYIRV 68 (70)
T ss_dssp EEEEEEESTTEEEEEEE--C------ET------TSSEEEEEEETTSSSCEECCCCTCETTSEEEEEEEESSSEEEEEEE
T ss_pred EEEEEEEcCCCEEEEEE--c------CC------CCCeEEEEEcCCCCchhhheecCCCCCCccEEEEEcCCCEEEEEEE
Confidence 46778888988866543 2 11 12577899999987766633222 235678889999998754
No 11
>PF01835 A2M_N: MG2 domain; InterPro: IPR002890 The proteinase-binding alpha-macroglobulins (A2M) [] are large glycoproteins found in the plasma of vertebrates, in the hemolymph of some invertebrates and in reptilian and avian egg white. A2M-like proteins are able to inhibit all four classes of proteinases by a 'trapping' mechanism. They have a peptide stretch, called the 'bait region', which contains specific cleavage sites for different proteinases. When a proteinase cleaves the bait region, a conformational change is induced in the protein, thus trapping the proteinase. The entrapped enzyme remains active against low molecular weight substrates, whilst its activity toward larger substrates is greatly reduced, due to steric hindrance. Following cleavage in the bait region, a thiol ester bond, formed between the side chains of a cysteine and a glutamine, is cleaved and mediates the covalent binding of the A2M-like protein to the proteinase. This family includes the N-terminal region of the alpha-2-macroglobulin family. The inhibitor domains belong to MEROPS inhibitor family I39.; GO: 0004866 endopeptidase inhibitor activity; PDB: 2B39_B 3KLS_B 3PRX_C 3KM9_B 3PVM_C 3CU7_A 4E0S_A 4A5W_A 4ACQ_C 2P9R_B ....
Probab=86.82 E-value=10 Score=27.74 Aligned_cols=66 Identities=14% Similarity=0.122 Sum_probs=41.9
Q ss_pred CCcEEEEEEEEEEeecCCCccccCCCCceeEEEEECCCCCEEEeeee---cccceEEEEe--e---CCccEEEEeecC
Q psy15308 95 EETQVIINYKTELFDTRTGGYMQSSPGVGMHVQISNPDSKIILSRLY---SYEGRISFTS--H---MPGEHVICLYSN 164 (270)
Q Consensus 95 ~~~~i~~~y~v~~~d~~~g~~~~~~~~~~i~~~V~dP~g~~v~~~~~---~~eg~f~f~a--~---~~G~Y~iCf~n~ 164 (270)
+|+.+.+.--+...+ +. .....+..+.+.|.||+|+.+..+.. ...|.+++.- + ..|.|++=+...
T Consensus 13 PGetV~~~~~~~~~~---~~-~~~~~~~~~~v~i~dp~g~~v~~~~~~~~~~~G~~~~~~~lp~~~~~G~y~i~~~~~ 86 (99)
T PF01835_consen 13 PGETVHFRAIVRDLD---ND-FKPPANSPVTVTIKDPSGNEVFRWSVNTTNENGIFSGSFQLPDDAPLGTYTIRVKTD 86 (99)
T ss_dssp TTSEEEEEEEEEEEC---TT-CSCESSEEEEEEEEETTSEEEEEEEEEETTCTTEEEEEEE--SS---EEEEEEEEET
T ss_pred CCCEEEEEEEEeccc---cc-cccccCCceEEEEECCCCCEEEEEEeeeeCCCCEEEEEEECCCCCCCEeEEEEEEEc
Confidence 577777766665333 11 11233568999999999999988766 2456544433 2 258999888763
No 12
>smart00557 IG_FLMN Filamin-type immunoglobulin domains. These form a rod-like structure in the actin-binding cytoskeleton protein, filamin. The C-terminal repeats of filamin bind beta1-integrin (CD29).
Probab=86.40 E-value=5.3 Score=29.18 Aligned_cols=43 Identities=19% Similarity=0.386 Sum_probs=30.9
Q ss_pred ceeEEEEECCCCCEEEeee-ecccc--eEEEEeeCCccEEEEeecC
Q psy15308 122 VGMHVQISNPDSKIILSRL-YSYEG--RISFTSHMPGEHVICLYSN 164 (270)
Q Consensus 122 ~~i~~~V~dP~g~~v~~~~-~~~eg--~f~f~a~~~G~Y~iCf~n~ 164 (270)
..+.+.|.+|+|+.+.-+- ....| ..+|++...|.|++.+.-+
T Consensus 32 ~~~~v~i~~p~g~~~~~~v~d~~dGty~v~y~P~~~G~~~i~V~~~ 77 (93)
T smart00557 32 GELEVEVTGPSGKKVPVEVKDNGDGTYTVSYTPTEPGDYTVTVKFG 77 (93)
T ss_pred CcEEEEEECCCCCeeEeEEEeCCCCEEEEEEEeCCCEeEEEEEEEC
Confidence 4789999999996544322 22334 4578889999999998755
No 13
>KOG1692|consensus
Probab=83.97 E-value=23 Score=29.77 Aligned_cols=73 Identities=16% Similarity=0.301 Sum_probs=46.0
Q ss_pred eeEEeeccccceeeecCCChhhHHHHHHHHHHhhcceeEEEEEcCCCceeEEEEccCCcEEEEEEEEEEeecCCCccccC
Q psy15308 39 ALYFHISETERKCFIEEIPEETQLLCVVLLAIIPFSYALYFHISETERKCFIEEIPEETQVIINYKTELFDTRTGGYMQS 118 (270)
Q Consensus 39 ~~~~~~~~~~~~~~~~~lp~~~~~~~~lL~~l~~~a~~l~f~i~~g~~~Cf~e~v~~~~~i~~~y~v~~~d~~~g~~~~~ 118 (270)
++.-.++..++.||+|++...+ ..+++|++-. |....+.+.+.++|
T Consensus 20 ~~~is~~ah~eeCf~e~~~~gd-------------~~~vsF~v~~------------gg~~~vd~~I~gP~--------- 65 (201)
T KOG1692|consen 20 GYGISLDAHEEECFFENLEEGD-------------KLSVSFEVID------------GGFLGVDVEITGPD--------- 65 (201)
T ss_pred heeEEEccchhhhHhhhhccCC-------------EEEEEEEEec------------CCccceeEEEECCC---------
Confidence 6777889999999999998663 4566666654 55566667776432
Q ss_pred CCCceeEEEEECCCCCEEEeeeecccceEEE
Q psy15308 119 SPGVGMHVQISNPDSKIILSRLYSYEGRISF 149 (270)
Q Consensus 119 ~~~~~i~~~V~dP~g~~v~~~~~~~eg~f~f 149 (270)
+..|.=...+..|+.-+.- ..+|.|.|
T Consensus 66 --~~~i~~~~~~ssgk~tF~a--~~~G~Y~f 92 (201)
T KOG1692|consen 66 --GKIIHKGKRESSGKYTFTA--PKKGTYTF 92 (201)
T ss_pred --CchhhhcccccCceEEEEe--cCCceEEE
Confidence 2234334455666654443 34565555
No 14
>PF01105 EMP24_GP25L: emp24/gp25L/p24 family/GOLD; InterPro: IPR009038 The GOLD (for Golgi dynamics) domain is a protein module found in several eukaryotic Golgi and lipid-traffic proteins. It is typically between 90 and 150 amino acids long. Most of the size difference observed in the GOLD-domain superfamily is traceable to a single large low-complexity insert that is seen in some versions of the domain. With the exception of the p24 proteins, which have a simple architecture with the GOLD domain as their only globular domain, all other GOLD-domain proteins contain additional conserved globular domains. In these proteins, the GOLD domain co-occurs with lipid-, sterol- or fatty acid-binding domains such as PH, CRAL-TRIO, FYVE oxysterol binding- and acyl CoA-binding domains, suggesting that these proteins may interact with membranes. The GOLD domain can also be found associated with a RUN domain, which may have a role in the interaction of various proteins with cytoskeletal filaments. The GOLD domain is predicted to mediate diverse protein-protein interactions []. A secondary structure prediction for the GOLD domain reveals that it is likely to adopt a compact all-beta-fold structure with six to seven strands. Most of the sequence conservation is centred on the hydrophobic cores that support these predicted strands. The predicted secondary-structure elements and the size of the conserved core of the domain suggests that it may form a beta- sandwich fold with the strands arranged in two beta sheets stacked on each other []. Some proteins known to contain a GOLD domain are listed below: Eukaryotic proteins of the p24 family. Animal Sec14-like proteins. They are involved in secretion. Human Golgi resident protein GCP60. It interacts with the Golgi integral membrane protein Giantin. Yeast oxysterol-binding protein homologue 3 (OSH3). ; GO: 0006810 transport, 0016021 integral to membrane; PDB: 1P23_A 1M23_A.
Probab=80.21 E-value=0.54 Score=38.24 Aligned_cols=25 Identities=40% Similarity=0.761 Sum_probs=0.0
Q ss_pred eeEEeeccccceeeecCCChhhHHH
Q psy15308 39 ALYFHISETERKCFIEEIPEETQLL 63 (270)
Q Consensus 39 ~~~~~~~~~~~~~~~~~lp~~~~~~ 63 (270)
|++|.+.++++.||++++|+++.+.
T Consensus 1 a~~f~l~~g~~~Cf~e~v~~~~~i~ 25 (183)
T PF01105_consen 1 ALTFELEPGETECFYEEVPKGTTIR 25 (183)
T ss_dssp -------------------------
T ss_pred CEEEEECCCCcEEEEEEcCCCcEEE
Confidence 6899999999999999999986444
No 15
>PF11589 DUF3244: Domain of unknown function (DUF3244); InterPro: IPR021638 This family of proteins with unknown function appear to be restricted to Bacteroidetes. The protein may have an immunoglobulin-like beta-sandwich fold however this cannot be confirmed. ; PDB: 3D33_B 3SD2_A.
Probab=73.95 E-value=12 Score=28.09 Aligned_cols=44 Identities=14% Similarity=0.215 Sum_probs=30.6
Q ss_pred ceeEEEEECCCCCEEEeeeecccc--eEEEEe--eCCccEEEEeecCC
Q psy15308 122 VGMHVQISNPDSKIILSRLYSYEG--RISFTS--HMPGEHVICLYSNS 165 (270)
Q Consensus 122 ~~i~~~V~dP~g~~v~~~~~~~eg--~f~f~a--~~~G~Y~iCf~n~~ 165 (270)
..+.++|.|.+|+++|++...... ..++.. ..+|.|.+=+.+..
T Consensus 48 ~~vtI~I~d~~G~vVy~~~~~~~~~~~~~I~L~~~~~G~Y~l~i~~~~ 95 (106)
T PF11589_consen 48 GDVTITIKDSTGNVVYSETVSNSAGQSITIDLNGLPSGEYTLEITNGN 95 (106)
T ss_dssp SEEEEEEEETT--EEEEEEESCGGTTEEEEE-TTS-SEEEEEEEEECT
T ss_pred CCEEEEEEeCCCCEEEEEEccCCCCcEEEEEeCCCCCccEEEEEEeCC
Confidence 368999999999999998654433 456655 46899999998763
No 16
>PF05738 Cna_B: Cna protein B-type domain; InterPro: IPR008454 This entry represents a repeated B region domain found in the collagen-binding surface protein Cna in Staphylococcus aureus, as well as other related domains. The B region domain of Cna has a prealbumin-like beta-sandwich fold of seven strands in two sheets with a Greek key topology []. However, this domain does not mediate collagen binding, the IPR008456 from INTERPRO region carries out that function; instead it appears to form a stalk that presents the ligand binding domain away from the bacterial cell surface. Cna is a collagen-binding MSCRAMM (Microbial Surface Component Recognizing Adhesive Matrix Molecules), and is necessary and sufficient for S. aureus cells to adhere to cartilage.; PDB: 2X5P_A 3RKP_A 3KPT_A 1VLF_T 1TI2_F 1TI6_D 1TI4_J 1VLE_V 1VLD_X 3PF2_A ....
Probab=72.96 E-value=8.6 Score=26.13 Aligned_cols=42 Identities=14% Similarity=0.268 Sum_probs=34.3
Q ss_pred eeEEEEECCCCCEEEe--eeecccceEEEEeeCCccEEEEeecC
Q psy15308 123 GMHVQISNPDSKIILS--RLYSYEGRISFTSHMPGEHVICLYSN 164 (270)
Q Consensus 123 ~i~~~V~dP~g~~v~~--~~~~~eg~f~f~a~~~G~Y~iCf~n~ 164 (270)
++.|.+++.++..+.. ..-...|.+.|.-...|.|.+=....
T Consensus 3 Ga~f~L~~~~~~~~~~~~~~Td~~G~~~f~~L~~G~Y~l~E~~a 46 (70)
T PF05738_consen 3 GATFELYDEDGNEVIEVTVTTDENGKYTFKNLPPGTYTLKETKA 46 (70)
T ss_dssp TEEEEEEETTSEEEEEEEEEGGTTSEEEEEEEESEEEEEEEEET
T ss_pred CeEEEEEECCCCEEEEEEEEECCCCEEEEeecCCeEEEEEEEEC
Confidence 4778888988888765 45568899999999999999998763
No 17
>PF09315 DUF1973: Domain of unknown function (DUF1973); InterPro: IPR015394 These functionally uncharacterised domains are found in various eukaryotic calcium-dependent chloride channels.
Probab=71.80 E-value=53 Score=27.33 Aligned_cols=43 Identities=9% Similarity=0.120 Sum_probs=27.5
Q ss_pred eeEEEEECCCCCEEEe-eeecccceEEEEe---eCCccEEEEeecCC
Q psy15308 123 GMHVQISNPDSKIILS-RLYSYEGRISFTS---HMPGEHVICLYSNS 165 (270)
Q Consensus 123 ~i~~~V~dP~g~~v~~-~~~~~eg~f~f~a---~~~G~Y~iCf~n~~ 165 (270)
...+.+.+|+|+.+.. ..+.......+.. .+.|.+++.+.|..
T Consensus 42 ~p~i~L~~P~G~~~~~~~~d~~~~~~~i~ipg~ae~G~W~y~i~~~~ 88 (179)
T PF09315_consen 42 PPSITLTDPSGTVYTTFTTDSNSKTARIQIPGTAEVGTWTYSITNTS 88 (179)
T ss_pred CceEEEECCCCCEEeeeEEcccccEEEEECCCCcccccEEEEEecCC
Confidence 3567799999999765 1222233333333 46899999987653
No 18
>PF13860 FlgD_ig: FlgD Ig-like domain; PDB: 3C12_A 3OSV_A.
Probab=71.71 E-value=24 Score=24.99 Aligned_cols=32 Identities=16% Similarity=0.230 Sum_probs=21.7
Q ss_pred cEEEEEEEEEEeecCCCccccCCCCceeEEEEECCCCCEEEeee
Q psy15308 97 TQVIINYKTELFDTRTGGYMQSSPGVGMHVQISNPDSKIILSRL 140 (270)
Q Consensus 97 ~~i~~~y~v~~~d~~~g~~~~~~~~~~i~~~V~dP~g~~v~~~~ 140 (270)
....+.|.+. .+...+.+.|+|.+|++|....
T Consensus 12 ~~~~~~~~l~------------~~a~~v~v~I~d~~G~~V~t~~ 43 (81)
T PF13860_consen 12 TKGSIEYTLP------------EDADNVTVTIYDSNGQVVRTIS 43 (81)
T ss_dssp CEEEEEEEEC------------SSCEEEEEEEEETTS-EEEEEE
T ss_pred EEEEEEEeCC------------CcccEEEEEEEcCCCCEEEEEE
Confidence 3667766664 1124689999999999997643
No 19
>PF13473 Cupredoxin_1: Cupredoxin-like domain; PDB: 1IBZ_D 1IC0_E 1IBY_D.
Probab=69.84 E-value=13 Score=27.57 Aligned_cols=16 Identities=31% Similarity=0.544 Sum_probs=10.9
Q ss_pred cceEEEEeeCCccEEE
Q psy15308 144 EGRISFTSHMPGEHVI 159 (270)
Q Consensus 144 eg~f~f~a~~~G~Y~i 159 (270)
+..+.|++.++|+|.+
T Consensus 75 ~~~~~f~~~~~G~y~~ 90 (104)
T PF13473_consen 75 TATVTFTPLKPGEYEF 90 (104)
T ss_dssp EEEEEEEE-S-EEEEE
T ss_pred EEEEEEcCCCCEEEEE
Confidence 3467888899999986
No 20
>PF04728 LPP: Lipoprotein leucine-zipper; InterPro: IPR006817 This repeating sequence, NAKVDQLSNDV, is found in the enterobacterial outer membrane lipoprotein LPP. The outer membrane lipoprotein is the most abundant protein in an Escherichia coli cell. The messenger RNA for the lipoprotein of the E. coli outer membrane codes for a putative precursor, prolipoprotein, which has 20 additional amino acid residues extending from the amino terminus of the lipoprotein.; GO: 0019867 outer membrane; PDB: 1JCC_A 2GUV_C 2GUS_A 1JCD_A 1KFM_A 1T8Z_D 1KFN_A 1EQ7_A.
Probab=69.68 E-value=30 Score=23.09 Aligned_cols=44 Identities=25% Similarity=0.358 Sum_probs=26.3
Q ss_pred chhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhHHHHHHHhhHH
Q psy15308 197 KLSELQLRVRQLLDQCDQITKEQNYQRYREERFRATSESTNQRV 240 (270)
Q Consensus 197 ~l~~l~~~l~~L~~~l~~I~~eq~~~r~re~~~r~~~est~~rV 240 (270)
+++.|...++.|..++..+..+..-+|.--..-.+-....|.|+
T Consensus 4 kid~Ls~dVq~L~~kvdqLs~dv~~lr~~v~~ak~EAaRAN~Rl 47 (56)
T PF04728_consen 4 KIDQLSSDVQTLNSKVDQLSSDVNALRADVQAAKEEAARANQRL 47 (56)
T ss_dssp HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
T ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 45666677777777777777766666644443444444455554
No 21
>PF10779 XhlA: Haemolysin XhlA; InterPro: IPR019715 Haemolysin XhlA is a cell-surface associated haemolysin that lyses the two most prevalent types of insect immune cells (granulocytes and plasmatocytes) as well as rabbit and horse erythrocytes [].
Probab=69.51 E-value=35 Score=23.70 Aligned_cols=19 Identities=11% Similarity=0.365 Sum_probs=8.4
Q ss_pred hhHHHHHHHHHHHHHHHHH
Q psy15308 198 LSELQLRVRQLLDQCDQIT 216 (270)
Q Consensus 198 l~~l~~~l~~L~~~l~~I~ 216 (270)
+..++..++++.+.++.+.
T Consensus 8 l~~ie~~l~~~~~~i~~lE 26 (71)
T PF10779_consen 8 LNRIETKLDNHEERIDKLE 26 (71)
T ss_pred HHHHHHHHHHHHHHHHHHH
Confidence 3444444444444444443
No 22
>PF13620 CarboxypepD_reg: Carboxypeptidase regulatory-like domain; PDB: 3MN8_D 3P0D_I 3KCP_A 2B59_B 1UWY_A 1H8L_A 1QMU_A 2NSM_A.
Probab=66.38 E-value=7.8 Score=27.06 Aligned_cols=42 Identities=21% Similarity=0.408 Sum_probs=29.5
Q ss_pred eeEEEEECCCCCEEEeeeecccceEEEEeeCCccEEEEeecC
Q psy15308 123 GMHVQISNPDSKIILSRLYSYEGRISFTSHMPGEHVICLYSN 164 (270)
Q Consensus 123 ~i~~~V~dP~g~~v~~~~~~~eg~f~f~a~~~G~Y~iCf~n~ 164 (270)
+..+.+.+.++.......-..+|.|.|..-.+|.|.+=+...
T Consensus 16 ~a~V~l~~~~~~~~~~~~Td~~G~f~~~~l~~g~Y~l~v~~~ 57 (82)
T PF13620_consen 16 GATVTLTDQDGGTVYTTTTDSDGRFSFEGLPPGTYTLRVSAP 57 (82)
T ss_dssp T-EEEET--TTTECCEEE--TTSEEEEEEE-SEEEEEEEEBT
T ss_pred CEEEEEEEeeCCCEEEEEECCCceEEEEccCCEeEEEEEEEC
Confidence 577888888777766666678999999966779999999754
No 23
>PF13897 GOLD_2: Golgi-dynamics membrane-trafficking
Probab=65.53 E-value=9.8 Score=30.15 Aligned_cols=30 Identities=20% Similarity=0.337 Sum_probs=21.5
Q ss_pred EEEeeCCccEEEEeecCCCCccccceeEEEEEE
Q psy15308 148 SFTSHMPGEHVICLYSNSTKWVGASQLRVHLDI 180 (270)
Q Consensus 148 ~f~a~~~G~Y~iCf~n~~~~~~~~~~~~V~f~i 180 (270)
+++.+.+|.|-++|+|+. +|+.+ +++.+.+
T Consensus 104 s~~c~~~GvYvLkFDNSY-S~~rs--K~l~Y~V 133 (136)
T PF13897_consen 104 SHTCPGPGVYVLKFDNSY-SWFRS--KKLYYRV 133 (136)
T ss_pred EEECCCCeEEEEEeeCcc-eeEEe--eEEEEEE
Confidence 346689999999999874 46543 4566554
No 24
>PRK15396 murein lipoprotein; Provisional
Probab=62.41 E-value=50 Score=23.64 Aligned_cols=45 Identities=24% Similarity=0.352 Sum_probs=32.4
Q ss_pred cchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhHHHHHHHhhHH
Q psy15308 196 DKLSELQLRVRQLLDQCDQITKEQNYQRYREERFRATSESTNQRV 240 (270)
Q Consensus 196 ~~l~~l~~~l~~L~~~l~~I~~eq~~~r~re~~~r~~~est~~rV 240 (270)
.+++.+...++.|......+..+..-.|.--..-.+-.+..|.|+
T Consensus 25 ~kvd~LssqV~~L~~kvdql~~dv~~~~~~~~~a~~eA~raN~Rl 69 (78)
T PRK15396 25 AKIDQLSSDVQTLNAKVDQLSNDVNAMRSDVQAAKDDAARANQRL 69 (78)
T ss_pred hhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 467788888888888888888887777765555555566666665
No 25
>PF07495 Y_Y_Y: Y_Y_Y domain; InterPro: IPR011123 This region is mostly found at the end of the beta propellers (IPR011110 from INTERPRO) in a family of two component regulators. However they are also found tandemly repeated in Q891H4 from SWISSPROT without other signal conduction domains being present. It is named after the conserved tyrosines found in the alignment. The exact function is not known.; PDB: 3V9F_D 3VA6_B 3OTT_B 4A2M_D 4A2L_B.
Probab=59.12 E-value=50 Score=21.85 Aligned_cols=56 Identities=21% Similarity=0.374 Sum_probs=30.6
Q ss_pred ceeEEEEECCCCCEEEeeeecccceEEEEeeCCccEEEEeecCC--CCccccceeEEEEEE
Q psy15308 122 VGMHVQISNPDSKIILSRLYSYEGRISFTSHMPGEHVICLYSNS--TKWVGASQLRVHLDI 180 (270)
Q Consensus 122 ~~i~~~V~dP~g~~v~~~~~~~eg~f~f~a~~~G~Y~iCf~n~~--~~~~~~~~~~V~f~i 180 (270)
......+.+.+++-+....... .++|+...+|.|++-+.... ..|.. ....+.|.|
T Consensus 8 ~~Y~Y~l~g~d~~W~~~~~~~~--~~~~~~L~~G~Y~l~V~a~~~~~~~~~-~~~~l~i~I 65 (66)
T PF07495_consen 8 IRYRYRLEGFDDEWITLGSYSN--SISYTNLPPGKYTLEVRAKDNNGKWSS-DEKSLTITI 65 (66)
T ss_dssp EEEEEEEETTESSEEEESSTS---EEEEES--SEEEEEEEEEEETTS-B-S-S-EEEEEEE
T ss_pred eEEEEEEECCCCeEEECCCCcE--EEEEEeCCCEEEEEEEEEECCCCCcCc-ccEEEEEEE
Confidence 4455556676666543322111 88999999999999887532 33422 225566655
No 26
>PRK09973 putative outer membrane lipoprotein; Provisional
Probab=56.85 E-value=66 Score=23.45 Aligned_cols=46 Identities=15% Similarity=0.200 Sum_probs=34.9
Q ss_pred cchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhHHHHHHHhhHHH
Q psy15308 196 DKLSELQLRVRQLLDQCDQITKEQNYQRYREERFRATSESTNQRVF 241 (270)
Q Consensus 196 ~~l~~l~~~l~~L~~~l~~I~~eq~~~r~re~~~r~~~est~~rV~ 241 (270)
.+++.+...++.|+...+.+..+..-.+.--..-.+..+..|.|+-
T Consensus 24 ~kvdqLss~V~~L~~kvdql~~dv~~a~aaa~aAk~EA~RAN~RiD 69 (85)
T PRK09973 24 QKVNQLASNVQTLNAKIARLEQDMKALRPQIYAAKSEANRANTRLD 69 (85)
T ss_pred hhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhh
Confidence 4678888888888888888888877777666666666777777754
No 27
>PRK06655 flgD flagellar basal body rod modification protein; Reviewed
Probab=55.91 E-value=47 Score=28.71 Aligned_cols=42 Identities=19% Similarity=0.199 Sum_probs=29.8
Q ss_pred ceeEEEEECCCCCEEEeeee--cccceEEEEe---------eCCccEEEEeec
Q psy15308 122 VGMHVQISNPDSKIILSRLY--SYEGRISFTS---------HMPGEHVICLYS 163 (270)
Q Consensus 122 ~~i~~~V~dP~g~~v~~~~~--~~eg~f~f~a---------~~~G~Y~iCf~n 163 (270)
..+.+.|+|.+|++|+.... ...|.+.|+- ..+|.|++=+..
T Consensus 126 ~~vti~I~D~~G~~Vrt~~lg~~~aG~~~f~WDG~d~~G~~lp~G~Yt~~V~A 178 (225)
T PRK06655 126 DNVTVTITDSAGQVVRTIDLGAQSAGVVSFTWDGTDTDGNALPDGNYTIKASA 178 (225)
T ss_pred cEEEEEEEcCCCCEEEEEecCCcCCCceeEEECCCCCCCCcCCCeeEEEEEEE
Confidence 36889999999999976543 3466666632 346889888853
No 28
>KOG3202|consensus
Probab=52.30 E-value=1.3e+02 Score=26.28 Aligned_cols=67 Identities=15% Similarity=0.104 Sum_probs=33.1
Q ss_pred hcchhHHHHHHHHHHHHHHHHHHH--------------HHHHHHHHHHhHHHHHH---HhhHHHHHHHHHHHHHHHHhhH
Q psy15308 195 KDKLSELQLRVRQLLDQCDQITKE--------------QNYQRYREERFRATSES---TNQRVFWWSLAQLGILVCMGVW 257 (270)
Q Consensus 195 ~~~l~~l~~~l~~L~~~l~~I~~e--------------q~~~r~re~~~r~~~es---t~~rV~~~silqi~vli~~s~~ 257 (270)
++.++.+...+.++...-..+-.| .+....|-++-+..... ..+.--.|+++.+++.++.-++
T Consensus 151 De~Ld~ls~ti~rlk~~a~~~g~EL~~Q~~llDdl~~e~d~t~srl~~~~~~l~~v~~~~s~~~~~~~il~l~~~~~lvv 230 (235)
T KOG3202|consen 151 DEGLDGLSATVQRLKGMALAMGEELEEQGRLLDDLDNEMDRTESRLDRVMKRLAKVNRMASQCSQWCAILLLVGLLLLVV 230 (235)
T ss_pred HHHHHHHHHHHHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccccchhHHHHHHHHHHHHH
Confidence 456666666666655544444433 22222222222222222 2222334677776666666667
Q ss_pred HHHH
Q psy15308 258 QMRH 261 (270)
Q Consensus 258 Qv~~ 261 (270)
++|.
T Consensus 231 ~i~~ 234 (235)
T KOG3202|consen 231 IIFI 234 (235)
T ss_pred HHhc
Confidence 7764
No 29
>TIGR03503 conserved hypothetical protein TIGR03503. This set of conserved hypothetical protein has a phylogenetic range that closely matches that of TIGR03501, a putative C-terminal protein targeting signal.
Probab=51.03 E-value=1.6e+02 Score=27.63 Aligned_cols=43 Identities=12% Similarity=-0.019 Sum_probs=25.5
Q ss_pred CceeEEEEECCCCCEEEeeeecccceEEEEe---eCCccEEEEeec
Q psy15308 121 GVGMHVQISNPDSKIILSRLYSYEGRISFTS---HMPGEHVICLYS 163 (270)
Q Consensus 121 ~~~i~~~V~dP~g~~v~~~~~~~eg~f~f~a---~~~G~Y~iCf~n 163 (270)
...+++.++.|+|..........++.-.... .+.|.|++=..-
T Consensus 241 s~~~~~~~~~P~g~~~~~~~~~~~~~~~~~l~~~~~~G~Y~i~~~~ 286 (374)
T TIGR03503 241 SLVIHGELVFPNGQIQQFSIELEEPETRVDLPANYEFGKYRVKGTV 286 (374)
T ss_pred cEEEEEEEECCCCceEEecccCccCceEEeccCcCCCeEEEEEEEE
Confidence 4457777889999844333333444433332 357889887553
No 30
>PRK12813 flgD flagellar basal body rod modification protein; Reviewed
Probab=51.01 E-value=76 Score=27.45 Aligned_cols=57 Identities=11% Similarity=0.082 Sum_probs=37.0
Q ss_pred CcEEEEEEEEEEeecCCCccccCCCCceeEEEEECCCCCEEEeeee-cccceEEEEe-------eCCccEEEEeecC
Q psy15308 96 ETQVIINYKTELFDTRTGGYMQSSPGVGMHVQISNPDSKIILSRLY-SYEGRISFTS-------HMPGEHVICLYSN 164 (270)
Q Consensus 96 ~~~i~~~y~v~~~d~~~g~~~~~~~~~~i~~~V~dP~g~~v~~~~~-~~eg~f~f~a-------~~~G~Y~iCf~n~ 164 (270)
+..+.+.|... .+...+.+.|+|.+|++|+.... ...+.|.+.- ..+|.|+|=+.-.
T Consensus 110 g~~~~~~~~l~------------~~a~~v~v~I~D~~G~vV~t~~~~~G~~~f~WDG~d~~G~~l~~G~Yt~~V~A~ 174 (223)
T PRK12813 110 GTPVTISPNPA------------ADADKAELVVRDAAGAEVARETVPVGAGPVEWAGEDADGNPLPNGAYSFVVESY 174 (223)
T ss_pred CceeEEEEecc------------CCCceEEEEEEcCCCCEEEEEeeCCCceeEEeCCcCCCCCcCCCccEEEEEEEE
Confidence 44667777654 11246899999999999877543 2233455542 2358999888644
No 31
>PF12690 BsuPI: Intracellular proteinase inhibitor; InterPro: IPR020481 BsuPI is a intracellular proteinase inhibitor that directly regulates the major intracellular proteinase (ISP-1) activity in vivo. It inhibits ISP-1 in the early stages of sporulation and then may be inactivated by a membrane-bound proteinase [].; PDB: 3ISY_A.
Probab=49.15 E-value=23 Score=25.44 Aligned_cols=20 Identities=10% Similarity=0.166 Sum_probs=12.9
Q ss_pred CceeEEEEECCCCCEEEeee
Q psy15308 121 GVGMHVQISNPDSKIILSRL 140 (270)
Q Consensus 121 ~~~i~~~V~dP~g~~v~~~~ 140 (270)
+...++.|+|++|+.++.++
T Consensus 23 gq~~D~~v~d~~g~~vwrwS 42 (82)
T PF12690_consen 23 GQRYDFVVKDKEGKEVWRWS 42 (82)
T ss_dssp S--EEEEEE-TT--EEEETT
T ss_pred CCEEEEEEECCCCCEEEEec
Confidence 45788999999999999875
No 32
>COG1723 Uncharacterized conserved protein [Function unknown]
Probab=46.08 E-value=25 Score=31.96 Aligned_cols=55 Identities=16% Similarity=0.305 Sum_probs=43.8
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHhHHHHHHHhhHHHHHHHHHHHHHHHHhhHHHH
Q psy15308 200 ELQLRVRQLLDQCDQITKEQNYQRYREERFRATSESTNQRVFWWSLAQLGILVCMGVWQMR 260 (270)
Q Consensus 200 ~l~~~l~~L~~~l~~I~~eq~~~r~re~~~r~~~est~~rV~~~silqi~vli~~s~~Qv~ 260 (270)
++..+++-|+.+++-|.+..+.+.+ .++++...++-||-|+-|++-+++++++++
T Consensus 270 dI~~RvnvLN~Rl~vi~d~l~il~e------~ln~~~s~~lEWivIiLI~~eVllsl~~i~ 324 (331)
T COG1723 270 DINPRVNVLNRRLEVISDLLDILNE------QLNHSHSTRLEWIVIILIGLEVLLSLYNII 324 (331)
T ss_pred chhHHHHHHHHHHHHHHHHHHHHHH------HhhhcccceeEEEehhHHHHHHHHHHHHHH
Confidence 4666777788888888777666554 566788889999999999999999997775
No 33
>KOG2861|consensus
Probab=45.02 E-value=43 Score=31.57 Aligned_cols=54 Identities=17% Similarity=0.243 Sum_probs=43.9
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHhHHHHHHHhhHHHHHHHHHHHHHHHHhhHHHH
Q psy15308 201 LQLRVRQLLDQCDQITKEQNYQRYREERFRATSESTNQRVFWWSLAQLGILVCMGVWQMR 260 (270)
Q Consensus 201 l~~~l~~L~~~l~~I~~eq~~~r~re~~~r~~~est~~rV~~~silqi~vli~~s~~Qv~ 260 (270)
|..+++-|+.+++.+.+..+.+++ .++++...++-||-|+-|.+-++..++|++
T Consensus 338 I~qRv~vLN~kl~~i~~~~~~l~e------~ln~r~~~~LEWiIIiLI~~eV~i~i~~i~ 391 (399)
T KOG2861|consen 338 IGQRVNVLNYKLKVIEDLLDILQE------NLNERHSERLEWIIIILIAFEVAIEIYQIV 391 (399)
T ss_pred cchHHHHHHHHHHHHHHHHHHHHH------HhhhccccceehhhHHHHHHHHHHHHHHHH
Confidence 445667788888888888888776 456777888999999999999999999876
No 34
>PF05377 FlaC_arch: Flagella accessory protein C (FlaC); InterPro: IPR008039 Although archaeal flagella appear superficially similar to those of bacteria, they are quite distinct []. In several archaea, the flagellin genes are followed immediately by the flagellar accessory genes flaCDEFGHIJ. The gene products may have a role in translocation, secretion, or assembly of the flagellum. FlaC is a protein whose exact role is unknown but it has been shown to be membrane-associated (by immuno-blotting fractionated cells) [].
Probab=44.80 E-value=92 Score=20.73 Aligned_cols=32 Identities=16% Similarity=0.247 Sum_probs=20.3
Q ss_pred hhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh
Q psy15308 198 LSELQLRVRQLLDQCDQITKEQNYQRYREERF 229 (270)
Q Consensus 198 l~~l~~~l~~L~~~l~~I~~eq~~~r~re~~~ 229 (270)
++.++..+.++...+..++++..-++..-++.
T Consensus 2 i~elEn~~~~~~~~i~tvk~en~~i~~~ve~i 33 (55)
T PF05377_consen 2 IDELENELPRIESSINTVKKENEEISESVEKI 33 (55)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 45667777777777777777666665444333
No 35
>PRK12812 flgD flagellar basal body rod modification protein; Reviewed
Probab=43.60 E-value=98 Score=27.44 Aligned_cols=41 Identities=15% Similarity=0.125 Sum_probs=29.5
Q ss_pred ceeEEEEECCCCCEEEeeee--cccceEEEEee---------CCccEEEEee
Q psy15308 122 VGMHVQISNPDSKIILSRLY--SYEGRISFTSH---------MPGEHVICLY 162 (270)
Q Consensus 122 ~~i~~~V~dP~g~~v~~~~~--~~eg~f~f~a~---------~~G~Y~iCf~ 162 (270)
..+.+.|+|.+|++|+.-.. ...|.+.|.-. ..|.|+|=+.
T Consensus 141 ~~v~v~I~D~~G~~V~t~~lg~~~aG~~~f~WDG~d~~G~~~~~G~Yt~~v~ 192 (259)
T PRK12812 141 DEGTLEIYDSNNKLVEKIDFKEISQGLFTMEWDGRDNDGVYAGDGEYTIKAV 192 (259)
T ss_pred ceEEEEEEeCCCCEEEEEecCCCCCcceeEEECCCCCCCCcCCCeeeEEEEE
Confidence 36899999999999976543 34565555442 2689999886
No 36
>PF06703 SPC25: Microsomal signal peptidase 25 kDa subunit (SPC25); InterPro: IPR009582 This family consists of several microsomal signal peptidase 25 kDa subunit proteins. Translocation of polypeptide chains across the endoplasmic reticulum (ER) membrane is triggered by signal sequences. Subsequently, signal recognition particle interacts with its membrane receptor and the ribosome-bound nascent chain is targeted to the ER where it is transferred into a protein-conducting channel. At some point, a second signal sequence recognition event takes place in the membrane and translocation of the nascent chain through the membrane occurs. The signal sequence of most secretory and membrane proteins is cleaved off at this stage. Cleavage occurs by the signal peptidase complex (SPC) as soon as the lumenal domain of the translocating polypeptide is large enough to expose its cleavage site to the enzyme. The signal peptidase complex is possibly also involved in proteolytic events in the ER membrane other than the processing of the signal sequence, for example the further digestion of the cleaved signal peptide or the degradation of membrane proteins. Mammalian signal peptidase is as a complex of five different polypeptide chains [].; GO: 0008233 peptidase activity, 0006465 signal peptide processing, 0005787 signal peptidase complex, 0016021 integral to membrane
Probab=43.12 E-value=44 Score=27.07 Aligned_cols=18 Identities=22% Similarity=0.167 Sum_probs=13.8
Q ss_pred CCChhhHHHHHHHHHhhh
Q psy15308 15 NPDSKIILSRLLCVVLLA 32 (270)
Q Consensus 15 ~~~~~~~~~~~~~~~~~~ 32 (270)
-.|+|+.|+.+.|++-..
T Consensus 25 l~d~kL~lg~~a~~iA~~ 42 (162)
T PF06703_consen 25 LTDIKLALGYLAVIIAGF 42 (162)
T ss_pred EEcHHHHHHHHHHHHHHH
Confidence 468999999988875443
No 37
>PF05739 SNARE: SNARE domain; InterPro: IPR000727 The process of vesicular fusion with target membranes depends on a set of SNAREs (SNAP-Receptors), which are associated with the fusing membranes [, ]. Target SNAREs (t-SNAREs) are localised on the target membrane and belong to two different families, the syntaxin-like family and the SNAP-25 like family. One member of each family, together with a v-SNARE localised on the vesicular membrane, are required for fusion. The Syntaxins are type-I transmembrane proteins that contain several regions with coiled-coil propensity in their cytosolic part, the SNARE motif. SNAP-25 (IPR000928 from INTERPRO) is a protein consisting of two coiled-coil regions, which is associated with the membrane by lipid anchors. SNARE motifs assemble into parallel four helix bundles stabilised by the burial of these hydrophobic helix faces in the bundle core. Monomeric SNARE motifs are disordered so this assembly reaction is accompanied by a dramatic increase in alpha-helical secondary structure []. The parallel arrangement of SNARE motifs within complexes bring the transmembrane anchors, and the two membranes, into close proximity. Recently, it was shown that the two coiled-coil regions of SNAP-25 and one of the coiled-coil regions of the syntaxins are related []. This domain is found in both Syntaxin and SNAP-25 families as well as in other proteins.; GO: 0005515 protein binding; PDB: 1URQ_B 3RL0_R 1HVV_B 1SFC_B 1N7S_B 3IPD_B 3C98_B 3HD7_F 3RK2_B 1KIL_B ....
Probab=42.03 E-value=99 Score=20.29 Aligned_cols=45 Identities=20% Similarity=0.327 Sum_probs=30.4
Q ss_pred cchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhHHHHHHHhhHH
Q psy15308 196 DKLSELQLRVRQLLDQCDQITKEQNYQRYREERFRATSESTNQRV 240 (270)
Q Consensus 196 ~~l~~l~~~l~~L~~~l~~I~~eq~~~r~re~~~r~~~est~~rV 240 (270)
+.++.|+..+..|.....+|..+..-+-.--++....++.+..++
T Consensus 4 ~~l~~l~~~i~~l~~~~~~i~~ev~~Q~~~ld~i~~~vd~~~~~l 48 (63)
T PF05739_consen 4 EELDELEQSIQELKQMFQDIGEEVEEQNEMLDRIEDNVDRANENL 48 (63)
T ss_dssp HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHCHhhHHHHHHHHHHHHHHH
Confidence 567788888888888777777776666655555555555555544
No 38
>KOG0518|consensus
Probab=39.66 E-value=57 Score=34.40 Aligned_cols=45 Identities=16% Similarity=0.364 Sum_probs=32.4
Q ss_pred CceeEEEEECCCCCEEE---eeeecccceEEEEeeCCccEEEEeecCC
Q psy15308 121 GVGMHVQISNPDSKIIL---SRLYSYEGRISFTSHMPGEHVICLYSNS 165 (270)
Q Consensus 121 ~~~i~~~V~dP~g~~v~---~~~~~~eg~f~f~a~~~G~Y~iCf~n~~ 165 (270)
..++.+.+.||.|+..- ..-....-+..|+.++.|.|++|+.++.
T Consensus 882 ~~d~ta~vt~PSG~~~~aei~~~~~~~y~vrFtP~e~G~~tl~V~y~~ 929 (1113)
T KOG0518|consen 882 SQDITARVTDPSGRVFEAEIVDLGQGTYQVRFTPKEPGNHTLSVKYKD 929 (1113)
T ss_pred ccceEEEeeCCCCCccccEEEECCCceEEEEecCCCCCceEEEEEecC
Confidence 35688888999997622 2222233467899999999999999873
No 39
>COG5415 Predicted integral membrane metal-binding protein [General function prediction only]
Probab=38.68 E-value=2.6e+02 Score=24.09 Aligned_cols=31 Identities=19% Similarity=0.085 Sum_probs=18.9
Q ss_pred cchhHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy15308 196 DKLSELQLRVRQLLDQCDQITKEQNYQRYRE 226 (270)
Q Consensus 196 ~~l~~l~~~l~~L~~~l~~I~~eq~~~r~re 226 (270)
++++.++.+|.+++..+..++...+-.+.|-
T Consensus 15 ~~L~rle~qi~q~~~~~~~~qs~l~~~~~r~ 45 (251)
T COG5415 15 ADLSRLESQIHQLDVALKKSQSILSQWQSRL 45 (251)
T ss_pred hhHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 3456677777777766666665555554443
No 40
>PF09753 Use1: Membrane fusion protein Use1; InterPro: IPR019150 This entry represents a family of proteins, approximately 300 residues in length, involved in vesicle transport. They have a single C-terminal transmembrane domain and a SNARE [soluble NSF (N-ethylmaleimide-sensitive fusion protein) attachment protein receptor] domain of approximately 60 residues. The SNARE domains are essential for membrane fusion and are conserved from yeasts to humans. Use1 is one of the three protein subunits that make up the SNARE complex and it is specifically required for Golgi-endoplasmic reticulum retrograde transport [].
Probab=38.20 E-value=2.3e+02 Score=24.66 Aligned_cols=22 Identities=9% Similarity=0.159 Sum_probs=10.1
Q ss_pred HHHHHHHHHHHHHhhHHHHHHHH
Q psy15308 242 WWSLAQLGILVCMGVWQMRHLKS 264 (270)
Q Consensus 242 ~~silqi~vli~~s~~Qv~~LKr 264 (270)
||..+-++++++ .++-++.+=|
T Consensus 228 ~~~~~~i~~v~~-~Fi~mvl~ir 249 (251)
T PF09753_consen 228 CWTWLMIFVVII-VFIMMVLFIR 249 (251)
T ss_pred HHHHHHHHHHHH-HHHHHHHHhe
Confidence 666654444433 3444444433
No 41
>PRK05842 flgD flagellar basal body rod modification protein; Reviewed
Probab=37.97 E-value=68 Score=29.00 Aligned_cols=42 Identities=19% Similarity=0.105 Sum_probs=30.0
Q ss_pred ceeEEEEECCCCCEEEeeeec----ccceEEEEe---------eCCccEEEEeec
Q psy15308 122 VGMHVQISNPDSKIILSRLYS----YEGRISFTS---------HMPGEHVICLYS 163 (270)
Q Consensus 122 ~~i~~~V~dP~g~~v~~~~~~----~eg~f~f~a---------~~~G~Y~iCf~n 163 (270)
..+.+.|+|.+|++|+.-... ..|.+.|+- ...|.|+|=+..
T Consensus 166 ~~v~I~I~Da~G~vVrTi~l~~~~~~aG~~~f~WDG~d~~G~~~p~G~Yt~~V~a 220 (295)
T PRK05842 166 GVPAIQILNENNELVKTIPLKDYNGQKGYINFEWDGLNEKGEKVPKGNYKIKAEY 220 (295)
T ss_pred ceEEEEEEcCCCCEEEEEecCcccCCCcceeEEECCCCCCCCcCCCcceEEEEEE
Confidence 368999999999999875432 246666652 235899988864
No 42
>PF04136 Sec34: Sec34-like family ; InterPro: IPR007265 Sec34 and Sec35 form a sub-complex in a seven-protein complex that includes Dor1. This complex is thought to be important for tethering vesicles to the Golgi [].; GO: 0006886 intracellular protein transport, 0005801 cis-Golgi network, 0016020 membrane
Probab=37.97 E-value=2.1e+02 Score=23.14 Aligned_cols=55 Identities=13% Similarity=0.201 Sum_probs=42.5
Q ss_pred cchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhHHHHHHHhhHHHHHHHHHHHH
Q psy15308 196 DKLSELQLRVRQLLDQCDQITKEQNYQRYREERFRATSESTNQRVFWWSLAQLGI 250 (270)
Q Consensus 196 ~~l~~l~~~l~~L~~~l~~I~~eq~~~r~re~~~r~~~est~~rV~~~silqi~v 250 (270)
+.++.+...-..+..+...++..-+-+-..+.+.....++...++.+|.-++-+.
T Consensus 28 ~~l~~l~~~~~~Vs~kT~~l~~~ce~Ll~eq~~L~~~ae~I~~~L~yF~~Ld~it 82 (157)
T PF04136_consen 28 DQLDELQEQYNSVSEKTNSLHEACEQLLEEQTRLEELAEEISEKLQYFEELDPIT 82 (157)
T ss_pred HHHHHHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhHHHhhHHHHH
Confidence 3455666666667777777777777777888899999999999999998887654
No 43
>PRK14081 triple tyrosine motif-containing protein; Provisional
Probab=37.79 E-value=1.9e+02 Score=29.27 Aligned_cols=56 Identities=14% Similarity=0.128 Sum_probs=34.8
Q ss_pred eEEEEECCCCCEEEeeeecccceEEEEeeCCccEEEEeecCC--C-CccccceeEEEEEEE
Q psy15308 124 MHVQISNPDSKIILSRLYSYEGRISFTSHMPGEHVICLYSNS--T-KWVGASQLRVHLDIQ 181 (270)
Q Consensus 124 i~~~V~dP~g~~v~~~~~~~eg~f~f~a~~~G~Y~iCf~n~~--~-~~~~~~~~~V~f~i~ 181 (270)
..+.|+. +|+.+...+-.....+.|+++.+|.|++=+.... + +.+. .+..|++.+.
T Consensus 418 Y~f~ik~-ng~~ve~~~Y~~~~~~~f~P~~~G~Y~IeV~vKdk~S~~~yD-~~k~v~l~V~ 476 (667)
T PRK14081 418 YSFIIKK-DGKEEEKIDYGKNNWVNFIPEEKGNYELEVRVKDKYSDKEYD-AHTIVYIKVH 476 (667)
T ss_pred EEEEEEE-CCEEEEEeecccccEEEEEECCCeeEEEEEEEecccCchhcc-cceEEEEEEe
Confidence 3344444 5555555444567789999999999998776432 1 2233 2456777664
No 44
>PRK12634 flgD flagellar basal body rod modification protein; Reviewed
Probab=36.97 E-value=1.2e+02 Score=26.22 Aligned_cols=41 Identities=15% Similarity=0.175 Sum_probs=30.1
Q ss_pred ceeEEEEECCCCCEEEeeee--cccceEEEEee---------CCccEEEEee
Q psy15308 122 VGMHVQISNPDSKIILSRLY--SYEGRISFTSH---------MPGEHVICLY 162 (270)
Q Consensus 122 ~~i~~~V~dP~g~~v~~~~~--~~eg~f~f~a~---------~~G~Y~iCf~ 162 (270)
..+.+.|+|.+|++|+.... ...|.+.|.-. ..|.|++-+.
T Consensus 122 ~~v~i~I~d~~G~~V~t~~lg~~~aG~~~f~WDG~d~~G~~~~~G~Yt~~v~ 173 (221)
T PRK12634 122 GFVNFEITDANGAFVKQISVPASAAGEVSFAWDGTDANGNRMAAGKYGVTAT 173 (221)
T ss_pred CeEEEEEEcCCCCEEEEEecCCcCCCceeEEECCCCCCCCcCCCeeeEEEEE
Confidence 46889999999999987543 44566665442 3599999985
No 45
>PF10731 Anophelin: Thrombin inhibitor from mosquito; InterPro: IPR018932 Members of this family are all inhibitors of thrombin, the peptidase that is at the end of the blood coagulation cascade and which creates the clot by cleaving fibrinogen. The interaction between thrombin and fibrinogen involves two different areas of contact - via the thrombin active site and via a second substrate-binding site known as an exosite. The inhibitor acts by blocking the exosite, rather than by interacting with the active site. The inhibitors are from mosquitoes that feed on human blood and which, by inhibiting thrombin, prevent the blood from clotting and keep it flowing.
Probab=36.82 E-value=22 Score=24.03 Aligned_cols=20 Identities=45% Similarity=0.785 Sum_probs=15.6
Q ss_pred hhhHHHHHHHHHhhhccccc
Q psy15308 18 SKIILSRLLCVVLLAVIPFS 37 (270)
Q Consensus 18 ~~~~~~~~~~~~~~~~~~~~ 37 (270)
.|+++.-|||+.|++..+++
T Consensus 3 ~Kl~vialLC~aLva~vQ~A 22 (65)
T PF10731_consen 3 SKLIVIALLCVALVAIVQSA 22 (65)
T ss_pred chhhHHHHHHHHHHHHHhcC
Confidence 58888888999888866654
No 46
>PF01519 DUF16: Protein of unknown function DUF16; InterPro: IPR002862 Proteins that contain this domain are of unknown function. It appears to be confined to proteins from Mycoplasma pneumoniae [].; PDB: 2BA2_C.
Probab=36.42 E-value=1.9e+02 Score=21.85 Aligned_cols=46 Identities=22% Similarity=0.241 Sum_probs=34.1
Q ss_pred hcchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhHHHHHHHhhHH
Q psy15308 195 KDKLSELQLRVRQLLDQCDQITKEQNYQRYREERFRATSESTNQRV 240 (270)
Q Consensus 195 ~~~l~~l~~~l~~L~~~l~~I~~eq~~~r~re~~~r~~~est~~rV 240 (270)
-+.++.+.+.++..-+.++++..+++-.-.+-+..-.+..+++.|+
T Consensus 52 geqI~kL~e~V~~QGEqIkel~~e~k~qgktL~~I~~~L~~inkRL 97 (102)
T PF01519_consen 52 GEQINKLTEKVDKQGEQIKELQVEQKAQGKTLQLILKTLQSINKRL 97 (102)
T ss_dssp HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 3466667777777888888888888866666677777777888775
No 47
>PF10528 PA14_2: GLEYA domain; InterPro: IPR018871 This presumed domain is found in fungal adhesins and is related to the PA14 domain. ; PDB: 4A3X_A.
Probab=36.27 E-value=1e+02 Score=23.54 Aligned_cols=46 Identities=22% Similarity=0.330 Sum_probs=26.2
Q ss_pred cCCCceeEEEEccCCc--EEEEEEEEEEeecCCCccccCCCCceeEEEEECCCCCEEEee
Q psy15308 82 SETERKCFIEEIPEET--QVIINYKTELFDTRTGGYMQSSPGVGMHVQISNPDSKIILSR 139 (270)
Q Consensus 82 ~~g~~~Cf~e~v~~~~--~i~~~y~v~~~d~~~g~~~~~~~~~~i~~~V~dP~g~~v~~~ 139 (270)
..+...++..++.+|. +|++-| +. .++ ...+++.|++|+|..+.+.
T Consensus 55 ~~~~~~~~tv~L~aG~yyPiRi~~-~N-----~~g------~~~~~~~i~~P~G~~~~~~ 102 (113)
T PF10528_consen 55 STGASKSVTVYLTAGTYYPIRIVY-AN-----GGG------PGSFDFSITDPDGTVHTDD 102 (113)
T ss_dssp SS-SEEEEEEEE-TT-BEEEEEEE-EE------SS-------EEEEEEEEETT-S--B--
T ss_pred CCCCceEEEEEEECCcEEEEEEEE-Ec-----CCC------ceEEEEEEECCCCcEEecC
Confidence 4567789999999998 555544 22 122 2478999999999997554
No 48
>PF07210 DUF1416: Protein of unknown function (DUF1416); InterPro: IPR010814 This family consists of several hypothetical bacterial proteins of around 100 residues in length. Members of this family appear to be Actinomycete specific. The function of this family is unknown.
Probab=35.23 E-value=1.8e+02 Score=21.19 Aligned_cols=41 Identities=20% Similarity=0.320 Sum_probs=30.0
Q ss_pred eeEEEEECCCCCEEEeeeecccceEEEEeeCCccEEEEeecC
Q psy15308 123 GMHVQISNPDSKIILSRLYSYEGRISFTSHMPGEHVICLYSN 164 (270)
Q Consensus 123 ~i~~~V~dP~g~~v~~~~~~~eg~f~f~a~~~G~Y~iCf~n~ 164 (270)
.--+.+.|++|+..-+-.-..+|+|.|-+. +|.+++=...+
T Consensus 23 gAyVRLLD~sgEFtaEvvts~~G~FRFfaa-pG~WtvRal~~ 63 (85)
T PF07210_consen 23 GAYVRLLDSSGEFTAEVVTSATGDFRFFAA-PGSWTVRALSR 63 (85)
T ss_pred CeEEEEEcCCCCeEEEEEecCCccEEEEeC-CCceEEEEEcc
Confidence 455678899999855555568899998875 68887776544
No 49
>PF07835 COX4_pro_2: Bacterial aa3 type cytochrome c oxidase subunit IV; InterPro: IPR012422 Bacterial cytochrome c oxidase is found bound to the to the cell membrane, where it is involved in the generation of the transmembrane proton electrochemical gradient. It is composed of four subunits. Subunit IV consists of one transmembrane helix that does not interact directly with the other subunits, but maintains its position by indirect contacts via phospholipid molecules found in the structure. The function of subunit IV is as yet unknown []. ; PDB: 1QLE_D 1M57_J 1M56_J.
Probab=34.73 E-value=1.1e+02 Score=19.30 Aligned_cols=26 Identities=19% Similarity=0.326 Sum_probs=14.3
Q ss_pred HHHHHHhhHHHHHHHHHHHHHHHHhh
Q psy15308 231 ATSESTNQRVFWWSLAQLGILVCMGV 256 (270)
Q Consensus 231 ~~~est~~rV~~~silqi~vli~~s~ 256 (270)
++-+.--+-..|.+++-+++++.+++
T Consensus 16 ~Ty~gFi~~~k~~~~~~~~~li~lai 41 (44)
T PF07835_consen 16 KTYDGFIKLTKWGTIAIAAILIFLAI 41 (44)
T ss_dssp HHHHHHHHHHHHHHHHHHHHHHHHHH
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 34444444455556666666666655
No 50
>PF10805 DUF2730: Protein of unknown function (DUF2730); InterPro: IPR020269 This entry represents a family of various hypothetical proteins. The proteins, which include HI1498 and Gp25, from phage Mu, are currently uncharacterised.
Probab=34.54 E-value=1.4e+02 Score=22.45 Aligned_cols=39 Identities=10% Similarity=0.211 Sum_probs=18.3
Q ss_pred HHHHHHHHHHHHHH--HHHHHHHHHHHHHhHHHHHHHhhHH
Q psy15308 202 QLRVRQLLDQCDQI--TKEQNYQRYREERFRATSESTNQRV 240 (270)
Q Consensus 202 ~~~l~~L~~~l~~I--~~eq~~~r~re~~~r~~~est~~rV 240 (270)
+.+++.++..+.++ +++..-++.+-.+.+.....++.++
T Consensus 48 ~~Rl~~lE~~l~~LPt~~dv~~L~l~l~el~G~~~~l~~~l 88 (106)
T PF10805_consen 48 DRRLQALETKLEHLPTRDDVHDLQLELAELRGELKELSARL 88 (106)
T ss_pred HHHHHHHHHHHHhCCCHHHHHHHHHHHHHHHhHHHHHHHHH
Confidence 45555555555554 3444444444444444444444443
No 51
>KOG3091|consensus
Probab=34.38 E-value=1.8e+02 Score=28.23 Aligned_cols=54 Identities=9% Similarity=0.188 Sum_probs=41.9
Q ss_pred hcchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhHHHHHHHhhHHHHHHHHHH
Q psy15308 195 KDKLSELQLRVRQLLDQCDQITKEQNYQRYREERFRATSESTNQRVFWWSLAQL 248 (270)
Q Consensus 195 ~~~l~~l~~~l~~L~~~l~~I~~eq~~~r~re~~~r~~~est~~rV~~~silqi 248 (270)
.+.++.-..+|+.+.+.+.+++.++.-.-.+-++++....+...|++.-.+.+-
T Consensus 347 ~q~~~~~r~ri~~i~e~v~eLqk~~ad~~~KI~~~k~r~~~Ls~RiLRv~ikqe 400 (508)
T KOG3091|consen 347 DQEVKQHRIRINAIGERVTELQKHHADAVAKIEEAKNRHVELSHRILRVMIKQE 400 (508)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 345666777888888888888877777778888888888999999877665543
No 52
>PF14524 Wzt_C: Wzt C-terminal domain; PDB: 2R5O_B.
Probab=32.32 E-value=1.3e+02 Score=22.70 Aligned_cols=42 Identities=10% Similarity=0.059 Sum_probs=23.0
Q ss_pred ceeEEEEECCCCCEEEeeeecccceEEEEeeCCccEEEEeecC
Q psy15308 122 VGMHVQISNPDSKIILSRLYSYEGRISFTSHMPGEHVICLYSN 164 (270)
Q Consensus 122 ~~i~~~V~dP~g~~v~~~~~~~eg~f~f~a~~~G~Y~iCf~n~ 164 (270)
..+.+.|++.+|..++.......+ .......+|.|++++.-+
T Consensus 52 ~~~~~~i~~~~g~~v~~~~t~~~~-~~~~~~~~g~~~~~~~i~ 93 (142)
T PF14524_consen 52 PVFGFAIRDSDGQRVFGTNTYDSG-FPIPLSEGGTYEVTFTIP 93 (142)
T ss_dssp EEEEEEEEETT--EEEEEEHHHHT---EEE-TT-EEEEEEEEE
T ss_pred cEEEEEEEcCCCCEEEEECccccC-ccccccCCCEEEEEEEEc
Confidence 468888999999988874432222 333333367777776543
No 53
>COG4467 Regulator of replication initiation timing [Replication, recombination, and repair]
Probab=31.90 E-value=2.3e+02 Score=21.60 Aligned_cols=45 Identities=16% Similarity=0.248 Sum_probs=29.7
Q ss_pred cchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhHHHHHHHhhHH
Q psy15308 196 DKLSELQLRVRQLLDQCDQITKEQNYQRYREERFRATSESTNQRV 240 (270)
Q Consensus 196 ~~l~~l~~~l~~L~~~l~~I~~eq~~~r~re~~~r~~~est~~rV 240 (270)
+.+..++..+..+...+..|++....+.+--...+--++....|.
T Consensus 8 d~v~~le~~l~~l~~el~~lK~~l~~lvEEN~~L~lENe~LR~RL 52 (114)
T COG4467 8 DQVDNLEEQLGVLLAELGGLKQHLGSLVEENTALRLENEKLRERL 52 (114)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhHHHHhhHHHHHHHh
Confidence 456777788888888888888777776655555555555554443
No 54
>PF09323 DUF1980: Domain of unknown function (DUF1980); InterPro: IPR015402 Members of this occur in gene pairs with members of PF03773 from PFAM. The N-terminal region contains several predicted transmembrane helix regions while the few invariant residues (G, CxxD, and W) occur in the C-terminal region. Members of this family are found in a set of prokaryotic hypothetical proteins. Their exact function has not, as yet, been defined.
Probab=31.44 E-value=62 Score=26.78 Aligned_cols=33 Identities=9% Similarity=-0.065 Sum_probs=27.9
Q ss_pred HhhHHHHHHHHHHHHHHHHhhHHHHHHHHhhcc
Q psy15308 236 TNQRVFWWSLAQLGILVCMGVWQMRHLKSFFEA 268 (270)
Q Consensus 236 t~~rV~~~silqi~vli~~s~~Qv~~LKrfF~~ 268 (270)
.+-|..+++.+-++++++++++|++.+-+--++
T Consensus 27 I~P~~~~~~~~a~i~l~ilai~q~~~~~~~~~~ 59 (182)
T PF09323_consen 27 IHPRYIPLLYFAAILLLILAIVQLWRWFRPKRR 59 (182)
T ss_pred hCccHHHHHHHHHHHHHHHHHHHHHHHHhcccc
Confidence 567889999999999999999999987665444
No 55
>TIGR03096 nitroso_cyanin nitrosocyanin. Nitrosocyanin, as described from the obligate chemolithoautotroph Nitrosomonas europaea, is a red copper protein of unknown function with sequence similarity to a number of blue copper redox proteins.
Probab=31.26 E-value=2.7e+02 Score=22.12 Aligned_cols=79 Identities=16% Similarity=0.133 Sum_probs=40.1
Q ss_pred eeEEEEEcCCCceeEEEEccCC---cEEEEEEEEEEeecCCCccccCCCCceeEEEEECCCCC----EEE----eeeec-
Q psy15308 75 YALYFHISETERKCFIEEIPEE---TQVIINYKTELFDTRTGGYMQSSPGVGMHVQISNPDSK----IIL----SRLYS- 142 (270)
Q Consensus 75 ~~l~f~i~~g~~~Cf~e~v~~~---~~i~~~y~v~~~d~~~g~~~~~~~~~~i~~~V~dP~g~----~v~----~~~~~- 142 (270)
..+...|++.++.----+++.- ...-++|++.. + ...-..+..+.+.+.+.++- .+. +....
T Consensus 25 ~~f~~~i~~~~~~~~~~~~~~~~~~~i~a~n~~~~P-~-----~I~VkaGD~Vtl~vtN~d~~~H~f~i~~~gis~~I~p 98 (135)
T TIGR03096 25 QSFTVVINAYDTTIPELNVEGVTVKNIRAFNVLNEP-E-----ALVVKKGTPVKVTVENKSPISEGFSIDAYGISEVIKA 98 (135)
T ss_pred ceeEEEEeccccEeeEEEeCCEEEEEEEeeeeEEcC-C-----EEEECCCCEEEEEEEeCCCCccceEECCCCcceEECC
Confidence 4677777776665423333332 23355666541 1 01113345676666665432 111 11111
Q ss_pred -ccceEEEEeeCCccEEE
Q psy15308 143 -YEGRISFTSHMPGEHVI 159 (270)
Q Consensus 143 -~eg~f~f~a~~~G~Y~i 159 (270)
....++|++.++|+|.+
T Consensus 99 Get~TitF~adKpG~Y~y 116 (135)
T TIGR03096 99 GETKTISFKADKAGAFTI 116 (135)
T ss_pred CCeEEEEEECCCCEEEEE
Confidence 22357888899999984
No 56
>PF08372 PRT_C: Plant phosphoribosyltransferase C-terminal; InterPro: IPR013583 This domain is found at the C terminus of phosphoribosyltransferases and phosphoribosyltransferase-like proteins. It contains putative transmembrane regions. It often appears together with calcium-ion dependent C2 domains (IPR000008 from INTERPRO).
Probab=31.09 E-value=2.9e+02 Score=22.48 Aligned_cols=48 Identities=15% Similarity=0.119 Sum_probs=33.8
Q ss_pred hcchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhHHHHHHHhhHHHH
Q psy15308 195 KDKLSELQLRVRQLLDQCDQITKEQNYQRYREERFRATSESTNQRVFW 242 (270)
Q Consensus 195 ~~~l~~l~~~l~~L~~~l~~I~~eq~~~r~re~~~r~~~est~~rV~~ 242 (270)
+..-+.+..+.++|.+....++.-.......-+|.+.+..=..-+..+
T Consensus 51 ~~~~~~lr~Rydrlr~va~rvQ~vlgd~At~gERl~allsWrdP~aT~ 98 (156)
T PF08372_consen 51 SRPPDSLRMRYDRLRSVAGRVQNVLGDVATQGERLQALLSWRDPRATA 98 (156)
T ss_pred ccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccCCccHHH
Confidence 345567778888888888888888888887777777776654444433
No 57
>PRK12633 flgD flagellar basal body rod modification protein; Provisional
Probab=29.53 E-value=2.1e+02 Score=24.72 Aligned_cols=42 Identities=12% Similarity=0.173 Sum_probs=30.3
Q ss_pred ceeEEEEECCCCCEEEeeee--cccceEEEEe---------eCCccEEEEeec
Q psy15308 122 VGMHVQISNPDSKIILSRLY--SYEGRISFTS---------HMPGEHVICLYS 163 (270)
Q Consensus 122 ~~i~~~V~dP~g~~v~~~~~--~~eg~f~f~a---------~~~G~Y~iCf~n 163 (270)
..+.+.|+|.+|++|+..+. ...|.+.|+- ...|.|++=+.-
T Consensus 129 ~~v~v~I~D~~G~vV~t~~lg~~~aG~~~f~WDG~d~~G~~~~~G~Y~~~V~a 181 (230)
T PRK12633 129 TKVTVKVLDPSGAVVRTMELGDLKTGVHTLQWDGNNDGGQPLADGKYSITVSA 181 (230)
T ss_pred cEEEEEEEeCCCCEEEEEecCCCCCCceeEEECCCCCCCCcCCCcceEEEEEE
Confidence 36899999999999987542 3556666643 235899999864
No 58
>PF12669 P12: Virus attachment protein p12 family
Probab=29.42 E-value=34 Score=22.94 Aligned_cols=9 Identities=33% Similarity=0.774 Sum_probs=6.3
Q ss_pred HHHhhcccC
Q psy15308 262 LKSFFEAKK 270 (270)
Q Consensus 262 LKrfF~~KK 270 (270)
+|++++++|
T Consensus 17 ~r~~~k~~K 25 (58)
T PF12669_consen 17 IRKFIKDKK 25 (58)
T ss_pred HHHHHHHhh
Confidence 488887654
No 59
>PRK14710 hypothetical protein; Provisional
Probab=29.39 E-value=44 Score=23.37 Aligned_cols=59 Identities=19% Similarity=0.186 Sum_probs=34.8
Q ss_pred EecCCChhhHHHHHHHHHhhhccccceeeEEeecccc---ceeeecCCChhhHHHHHHHHHHh
Q psy15308 12 QISNPDSKIILSRLLCVVLLAVIPFSYALYFHISETE---RKCFIEEIPEETQLLCVVLLAII 71 (270)
Q Consensus 12 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~~~lp~~~~~~~~lL~~l~ 71 (270)
.||..=+|++.++|...+.+..+..+ -+|.|-++.- -.+=|..+|..--+.-+|=-++.
T Consensus 3 aissn~skm~ififaiii~v~lcv~t-ylyl~kDe~LVSk~y~nY~AIPE~~Gv~TWLP~FFP 64 (86)
T PRK14710 3 AISSNLSKMIIFIFAIIIIVVLCVIT-YLYLYKDESLVSKHYINYMAIPENDGVFTWLPDFFP 64 (86)
T ss_pred ccccchhHHHHHHHHHHHHHHHHHhh-heeeecchhHHHHhccceeeccccCCcEEeccccCC
Confidence 46667789999999887777766655 3444433311 11237888887444333333333
No 60
>PRK14155 heat shock protein GrpE; Provisional
Probab=29.35 E-value=3.5e+02 Score=23.12 Aligned_cols=41 Identities=22% Similarity=0.252 Sum_probs=29.2
Q ss_pred cchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhHHHHHHH
Q psy15308 196 DKLSELQLRVRQLLDQCDQITKEQNYQRYREERFRATSEST 236 (270)
Q Consensus 196 ~~l~~l~~~l~~L~~~l~~I~~eq~~~r~re~~~r~~~est 236 (270)
..++.++.++..+.+++.....+..-+|.|..+.++.....
T Consensus 20 ~~l~~le~e~~elkd~~lR~~AefeN~RKR~~kE~e~~~~~ 60 (208)
T PRK14155 20 QEIEALKAEVAALKDQALRYAAEAENTKRRAEREMNDARAY 60 (208)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 35667777888888888777777777777777766554443
No 61
>PF15232 DUF4585: Domain of unknown function (DUF4585)
Probab=28.99 E-value=1.5e+02 Score=21.05 Aligned_cols=33 Identities=15% Similarity=0.325 Sum_probs=25.0
Q ss_pred EEEEcCCCceeEEEEccCCcEEEEEEEEEEeecCCCcc
Q psy15308 78 YFHISETERKCFIEEIPEETQVIINYKTELFDTRTGGY 115 (270)
Q Consensus 78 ~f~i~~g~~~Cf~e~v~~~~~i~~~y~v~~~d~~~g~~ 115 (270)
++-+++..-+||+.+++..-.+..-| |+++|.|
T Consensus 9 KvL~DP~SG~Yy~vd~P~Qp~~k~lf-----DPETGqY 41 (75)
T PF15232_consen 9 KVLQDPESGQYYVVDAPVQPKTKTLF-----DPETGQY 41 (75)
T ss_pred cEeecCCCCCEEEEecCCCcceeeee-----cCCCCcE
Confidence 56778888889999999876666666 4557766
No 62
>PF00957 Synaptobrevin: Synaptobrevin; InterPro: IPR001388 Synaptobrevin is an intrinsic membrane protein of small synaptic vesicles [], specialised secretory organelles of neurons that actively accumulate neurotransmitters and participate in their calcium-dependent release by exocytosis. Vesicle function is mediated by proteins in their membranes, although the precise nature of the protein-protein interactions underlying this are still uncertain []. Synaptobrevin may play a role in the molecular events underlying neurotransmitter release and vesicle recycling and may be involved in the regulation of membrane flow in the nerve terminal, a process mediated by interaction with low molecular weight GTP-binding proteins []. Synaptic vesicle-associated membrane proteins (VAMPs) from Torpedo californica (Pacific electric ray) and SNC1 from yeast are related to synaptobrevin.; GO: 0016192 vesicle-mediated transport, 0016021 integral to membrane; PDB: 3EGX_C 2NUP_C 3EGD_C 2NUT_C 1IOU_A 1H8M_A 3B5N_A 3ZYM_A 2NPS_A 1SFC_E ....
Probab=28.83 E-value=2.2e+02 Score=20.28 Aligned_cols=27 Identities=4% Similarity=0.265 Sum_probs=11.3
Q ss_pred HHHHHHHHHHHHHHHHhHHHHHHHhhH
Q psy15308 213 DQITKEQNYQRYREERFRATSESTNQR 239 (270)
Q Consensus 213 ~~I~~eq~~~r~re~~~r~~~est~~r 239 (270)
+++...-.-+......++..+...+++
T Consensus 34 ~~L~~kt~~L~~~a~~F~k~a~~l~r~ 60 (89)
T PF00957_consen 34 EELEDKTEELSDNAKQFKKNAKKLKRK 60 (89)
T ss_dssp HHHHHHHHHHHHHHHHHHHHHHHHHHH
T ss_pred HHHHHHHHHHHHHhHHHHHHHHHHHHH
Confidence 333333334444444444444444433
No 63
>PRK14081 triple tyrosine motif-containing protein; Provisional
Probab=28.69 E-value=2e+02 Score=29.18 Aligned_cols=41 Identities=15% Similarity=0.125 Sum_probs=31.4
Q ss_pred eeEEEEECCCCCEEEeeeecccceEEEEeeCCccEEEEeec
Q psy15308 123 GMHVQISNPDSKIILSRLYSYEGRISFTSHMPGEHVICLYS 163 (270)
Q Consensus 123 ~i~~~V~dP~g~~v~~~~~~~eg~f~f~a~~~G~Y~iCf~n 163 (270)
--.+.+.+++|.....+.-.....|++++..+|.|++=...
T Consensus 224 LYKF~~i~~~G~~~~~qdYst~n~~~y~~~~~G~Y~i~~~V 264 (667)
T PRK14081 224 LYKFVKIDSDGKQTCIQDYSTKNIVSYKEKKSGDYKLLCLV 264 (667)
T ss_pred EEEEEEECCCCCEEEecCccccceEEEEeCCCccEEEEEEE
Confidence 34455667788777666667888999999999999985543
No 64
>PF03554 Herpes_UL73: UL73 viral envelope glycoprotein ; InterPro: IPR005211 This entry represents a conserved region found in a number of viral proteins: BLRF1, U46, 53, and UL73, collectively known as glycoprotein N. These UL73-like envelope glycoproteins, which associate in a high molecular mass complex with their counterpart protein gM, induce neutralizing antibody responses in the host. These glycoproteins are highly polymorphic, particularly in the N-terminal region [].; GO: 0019031 viral envelope
Probab=28.38 E-value=85 Score=22.69 Aligned_cols=28 Identities=25% Similarity=0.360 Sum_probs=22.2
Q ss_pred HHhhHHHHHHHHHHHHHHHHhhHHHHHH
Q psy15308 235 STNQRVFWWSLAQLGILVCMGVWQMRHL 262 (270)
Q Consensus 235 st~~rV~~~silqi~vli~~s~~Qv~~L 262 (270)
+..+-..+|.++..+++++.+++=+.|+
T Consensus 44 sl~SFsSIW~iiN~~il~~A~~vyLry~ 71 (82)
T PF03554_consen 44 SLSSFSSIWAIINVVILLCAFCVYLRYL 71 (82)
T ss_pred eehHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 3456677899999999999988766664
No 65
>PF04678 DUF607: Protein of unknown function, DUF607; InterPro: IPR006769 This entry represents the C-terminal domain of coiled-coil domain containing protein 109.
Probab=27.85 E-value=3.5e+02 Score=22.32 Aligned_cols=44 Identities=23% Similarity=0.359 Sum_probs=30.2
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHhHHHHHHHhhHHHHHHHH
Q psy15308 203 LRVRQLLDQCDQITKEQNYQRYREERFRATSESTNQRVFWWSLA 246 (270)
Q Consensus 203 ~~l~~L~~~l~~I~~eq~~~r~re~~~r~~~est~~rV~~~sil 246 (270)
.+...+...+..+.++..-+.....+-...++...+|++|-.+.
T Consensus 57 ~~~~~l~~~l~~~~~el~~le~~k~~id~~A~~~~~~~~w~gl~ 100 (180)
T PF04678_consen 57 SRERQLRKRLEELRQELAPLEKIKQEIDEKAEKRARRLLWGGLA 100 (180)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 44556677777777777777777777777777776777655443
No 66
>PF15417 DUF4624: Domain of unknown function (DUF4624)
Probab=27.49 E-value=2.9e+02 Score=21.28 Aligned_cols=65 Identities=15% Similarity=0.354 Sum_probs=39.2
Q ss_pred CCceeEEEEccC-CcEEEEEEEEEEeecCCCccccCCCCceeEEEEECCCC-CEEEeeeec---ccceEEEEe---eCCc
Q psy15308 84 TERKCFIEEIPE-ETQVIINYKTELFDTRTGGYMQSSPGVGMHVQISNPDS-KIILSRLYS---YEGRISFTS---HMPG 155 (270)
Q Consensus 84 g~~~Cf~e~v~~-~~~i~~~y~v~~~d~~~g~~~~~~~~~~i~~~V~dP~g-~~v~~~~~~---~eg~f~f~a---~~~G 155 (270)
+.-.|..+++.. +. .++|+.. |. ...+.|+|.+- +++++..+. ....|+... +...
T Consensus 37 ~rLFcVs~Die~L~a--Ev~f~mD------Ge--------~~iVEiKd~~~devLWsn~~~~~V~~dt~tisL~nlqk~k 100 (132)
T PF15417_consen 37 GRLFCVSEDIEALDA--EVYFQMD------GE--------SGIVEIKDRKTDEVLWSNTWNGKVSGDTFTISLNNLQKEK 100 (132)
T ss_pred ceEEEEecchheeee--EEEEEEc------Cc--------cceEEeccCCccceeeccccccccccceEEEEhhhcccCc
Confidence 667899988876 33 3456653 22 23467788644 567664332 233455543 4567
Q ss_pred cEEEEeecC
Q psy15308 156 EHVICLYSN 164 (270)
Q Consensus 156 ~Y~iCf~n~ 164 (270)
+|-+||...
T Consensus 101 EY~V~ftGt 109 (132)
T PF15417_consen 101 EYVVCFTGT 109 (132)
T ss_pred eEEEEEecc
Confidence 999999854
No 67
>KOG3385|consensus
Probab=26.93 E-value=2.2e+02 Score=21.99 Aligned_cols=27 Identities=22% Similarity=0.257 Sum_probs=14.6
Q ss_pred hhcchhHHHHHHHHHHHHHHHHHHHHH
Q psy15308 194 QKDKLSELQLRVRQLLDQCDQITKEQN 220 (270)
Q Consensus 194 ~~~~l~~l~~~l~~L~~~l~~I~~eq~ 220 (270)
.++.++.++.++..|...--+|-.|.+
T Consensus 34 Nee~~e~L~~kV~aLKsLs~dIg~Ev~ 60 (118)
T KOG3385|consen 34 NEEAAESLQQKVKALKSLSLDIGDEVR 60 (118)
T ss_pred hHHHHHHHHHHHHHHHHHHHHhccccc
Confidence 344556666666666555544444433
No 68
>PRK02935 hypothetical protein; Provisional
Probab=26.89 E-value=15 Score=27.87 Aligned_cols=29 Identities=34% Similarity=0.560 Sum_probs=16.3
Q ss_pred CChhhHHHHHHHHHhhhccccceeeEEeec
Q psy15308 16 PDSKIILSRLLCVVLLAVIPFSYALYFHIS 45 (270)
Q Consensus 16 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 45 (270)
+.+++++.+|+.+.+++...+. ++||.++
T Consensus 34 ~~~~~~m~ifm~~G~l~~l~S~-vvYFwiG 62 (110)
T PRK02935 34 RESIIIMTIFMLLGFLAVIAST-VVYFWIG 62 (110)
T ss_pred cccHHHHHHHHHHHHHHHHHHH-HHHHHHh
Confidence 4555666666666666555444 5555544
No 69
>PF06156 DUF972: Protein of unknown function (DUF972); InterPro: IPR010377 FUNCTION: Involved in initiation control of chromosome replication. SUBUNIT: Interacts with both DnaA and DnaN, acting as a bridge between these two proteins. SIMILARITY: Belongs to the YabA family.
Probab=26.23 E-value=2.9e+02 Score=20.90 Aligned_cols=26 Identities=23% Similarity=0.312 Sum_probs=10.6
Q ss_pred hhHHHHHHHHHHHHHHHHHHHHHHHH
Q psy15308 198 LSELQLRVRQLLDQCDQITKEQNYQR 223 (270)
Q Consensus 198 l~~l~~~l~~L~~~l~~I~~eq~~~r 223 (270)
+..+...+..|...+..+..+-.-++
T Consensus 17 l~~l~~~~~~LK~~~~~l~EEN~~L~ 42 (107)
T PF06156_consen 17 LGQLLEELEELKKQLQELLEENARLR 42 (107)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 33444444444444444444333333
No 70
>cd03441 R_hydratase_like (R)-hydratase [(R)-specific enoyl-CoA hydratase]. Catalyzes the hydration of trans-2-enoyl CoA to (R)-3-hydroxyacyl-CoA as part of the PHA (polyhydroxyalkanoate) biosynthetic pathway. The structure of the monomer includes a five-strand antiparallel beta-sheet wrapped around a central alpha helix, referred to as a hot dog fold. The active site lies within a substrate-binding tunnel formed by the homodimer. Other enzymes with this fold include MaoC dehydratase, Hydratase-Dehydrogenase-Epimerase protein (HDE), and the fatty acid synthase beta subunit.
Probab=26.03 E-value=2.7e+02 Score=20.45 Aligned_cols=48 Identities=13% Similarity=0.132 Sum_probs=31.0
Q ss_pred ceeEEEEccCCcEEEEEEEEEEeecCCCccccCCCCceeEEEEECCCCCEEEe
Q psy15308 86 RKCFIEEIPEETQVIINYKTELFDTRTGGYMQSSPGVGMHVQISNPDSKIILS 138 (270)
Q Consensus 86 ~~Cf~e~v~~~~~i~~~y~v~~~d~~~g~~~~~~~~~~i~~~V~dP~g~~v~~ 138 (270)
.-.|...+..|+.+++..++...++.. +. .-..+...+.+.+|+++..
T Consensus 74 ~~~f~~Pv~~Gd~l~~~~~v~~~~~~~-~~----~~v~~~~~~~n~~g~~v~~ 121 (127)
T cd03441 74 SVRFLAPVFPGDTLRVEVEVLGKRPSK-GR----GVVTVRTEARNQGGEVVLS 121 (127)
T ss_pred EEEEeCCcCCCCEEEEEEEEEEeeccC-CC----cEEEEEEEEEeCCCCEEEE
Confidence 344556688899999999998654211 10 1134666677888887654
No 71
>PF00517 GP41: Retroviral envelope protein; InterPro: IPR000328 This entry represents envelope proteins from a variety of retroviruses. It includes the GP41 subunit of the envelope protein complex from Human immunodeficiency virus (HIV) and Simian-Human immunodeficiency virus (SIV), which mediate membrane fusion during viral entry []. It has a core composed of a six-helix bundle and is folded by its trimeric N- and C-terminal heptad-repeats (NHR and CHR) []. Derivatives of this protein prevent HIV-1 from entering cell lines and primary human CD4+ cells in vitro [], making it an attractive subject of gene therapy studies against HIV and related retroviruses. The entry also represents envelop proteins from Bovine immunodeficiency virus, Feline immunodeficiency virus and Equine infectious anemia virus (EIAV) [, ], as well as the Gp36 protein from Mouse mammary tumor virus (MMTV) and Human endogenous retrovirus (HERV).; GO: 0005198 structural molecule activity, 0019031 viral envelope; PDB: 2EZO_B 2EZQ_B 2EZR_A 2JNR_B 1F23_D 2EZP_A 1JEK_A 2Q7C_A 2Q5U_A 2Q3I_A ....
Probab=25.66 E-value=3.2e+02 Score=23.09 Aligned_cols=61 Identities=11% Similarity=0.158 Sum_probs=24.8
Q ss_pred chhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhHHHHHH------HhhHHHHHHHHHHHHHHHHhhH
Q psy15308 197 KLSELQLRVRQLLDQCDQITKEQNYQRYREERFRATSES------TNQRVFWWSLAQLGILVCMGVW 257 (270)
Q Consensus 197 ~l~~l~~~l~~L~~~l~~I~~eq~~~r~re~~~r~~~es------t~~rV~~~silqi~vli~~s~~ 257 (270)
+...-+.+++...+.+..+..+-.-.+++..+.....++ -..-..|...+.++++|+++++
T Consensus 102 Tw~~W~~~i~~~~~~i~~ll~~a~~qqe~n~~~l~~Ld~w~~l~~wfdit~W~~~Iki~i~iv~~iI 168 (204)
T PF00517_consen 102 TWQQWEKEISNYTGNIYNLLEEAQNQQEKNEQDLLKLDSWTNLWSWFDITKWLWYIKIFIMIVIGII 168 (204)
T ss_dssp HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHTTTHHHHHHHCHHHHHHHHHH---------
T ss_pred hHHHHHHHhcccHHHHHHHHHHHHhchhhhhhhhcCCcHHhhhhhHHhHHHHHHHHHHHHHHHHHHH
Confidence 334455555555555555555444444443333333333 2333445556666666655543
No 72
>PRK09619 flgD flagellar basal body rod modification protein; Reviewed
Probab=25.61 E-value=2.8e+02 Score=23.77 Aligned_cols=42 Identities=17% Similarity=0.177 Sum_probs=28.3
Q ss_pred ceeEEEEECCCCCEEEeee--ecccceEEEEe------eCCccEEEEeecC
Q psy15308 122 VGMHVQISNPDSKIILSRL--YSYEGRISFTS------HMPGEHVICLYSN 164 (270)
Q Consensus 122 ~~i~~~V~dP~g~~v~~~~--~~~eg~f~f~a------~~~G~Y~iCf~n~ 164 (270)
..+.+.|+|.+|++ +... ....|.+.|+- -.+|.|++=+...
T Consensus 123 ~~v~v~I~D~~G~v-~t~~l~~~~aG~~~f~WDG~~~~lp~G~Y~~~V~a~ 172 (218)
T PRK09619 123 PTLTLHITDILGQE-KKIDLGKQPAGPVNFTLDPAALGLQPGQYQLSVVSG 172 (218)
T ss_pred cEEEEEEEeCCCCE-EEEecCCcCCCceeEEECCCCCCCCCceeEEEEEEe
Confidence 36889999999997 4332 23456666653 2468999988644
No 73
>PF14584 DUF4446: Protein of unknown function (DUF4446)
Probab=25.02 E-value=1.7e+02 Score=23.71 Aligned_cols=50 Identities=20% Similarity=0.249 Sum_probs=31.2
Q ss_pred hhhhhhhcchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhHHHHHHHhh
Q psy15308 189 YAQVAQKDKLSELQLRVRQLLDQCDQITKEQNYQRYREERFRATSESTNQ 238 (270)
Q Consensus 189 ~~~~~~~~~l~~l~~~l~~L~~~l~~I~~eq~~~r~re~~~r~~~est~~ 238 (270)
|+...+-.+-+++++.+....+.++++..+.+-.+.+-+.........-.
T Consensus 32 Y~~lm~g~~~~~lE~~l~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~ 81 (151)
T PF14584_consen 32 YDALMRGKDGKNLEDLLNELFDQIDELKEELEELEKRIEELEEKLRNCVQ 81 (151)
T ss_pred HHHHhCCCCcccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccc
Confidence 44443334445777777777788888877777776666555555544433
No 74
>PRK14163 heat shock protein GrpE; Provisional
Probab=24.96 E-value=4.5e+02 Score=22.61 Aligned_cols=44 Identities=18% Similarity=0.194 Sum_probs=30.4
Q ss_pred cchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhHHHHHHHhhH
Q psy15308 196 DKLSELQLRVRQLLDQCDQITKEQNYQRYREERFRATSESTNQR 239 (270)
Q Consensus 196 ~~l~~l~~~l~~L~~~l~~I~~eq~~~r~re~~~r~~~est~~r 239 (270)
..++.++.++..+.+.+.....+..-+|.|..+.++........
T Consensus 47 ~~l~~l~~e~~el~d~~lR~~AEfeN~rkR~~kE~e~~~~~a~~ 90 (214)
T PRK14163 47 AQLDQVRTALGERTADLQRLQAEYQNYRRRVERDRVTVKEIAVA 90 (214)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 34667777777777777777777777777777776665544433
No 75
>PF08842 Mfa2: Fimbrillin-A associated anchor proteins Mfa1 and Mfa2; InterPro: IPR014941 This family of proteins may be lipoproteins principally from bacilli. They are between 300 and 400 residues. Many Bacteroides-like bacterial species, including Porphyromonas gingivalis, the causal agent of periodontal infection, carry at least two types of fimbriae, namely FimA and Mfa1 fimbriae, following the names of their major subunit proteins []. Normally, FimA fimbriae are long filaments that are easily detached from cells, whereas Mfa1 fimbriae are short filaments that are tightly bound to cells; however, in the absence of Mfa2 protein, the Mfa1 fimbriae are also very long and are not attached. Mfa2 and Mfa1 are associated with each other in whole P. gingivalis cells to the extent that Mfa2 is located on the cell surface and probably associated with Mfa1 fimbriae in such a way that it anchors the Mfa1 fimbriae to the cell surface and regulates Mfa1 filament length [].; PDB: 3PAY_C 3GF8_A.
Probab=24.75 E-value=93 Score=26.86 Aligned_cols=43 Identities=19% Similarity=0.323 Sum_probs=26.5
Q ss_pred ceeEEEEECCCCCEEEeeeecc---c-ceEEE--EeeCCccEEEEeecC
Q psy15308 122 VGMHVQISNPDSKIILSRLYSY---E-GRISF--TSHMPGEHVICLYSN 164 (270)
Q Consensus 122 ~~i~~~V~dP~g~~v~~~~~~~---e-g~f~f--~a~~~G~Y~iCf~n~ 164 (270)
..+++.|+|.+|+.+....... . +.|.. .....|.|+++.-.+
T Consensus 29 ~~v~lyvFd~~g~~v~~~~~~~~~~~~~~y~~~~~~l~~G~Y~~va~~n 77 (283)
T PF08842_consen 29 KRVDLYVFDEDGKLVKQRTIDSEELEGGGYTMFLLDLPPGTYTFVAWGN 77 (283)
T ss_dssp -EEEEEEE-TTSBEEEEEEEECGGCCTTTEEE-CCT--SEEEEEEEEES
T ss_pred eEEEEEEEeCCCeEEEEEEcccccccCCceEEeeccCCCCcEEEEEEEC
Confidence 4699999999999665544321 2 44554 334579999987654
No 76
>PRK14149 heat shock protein GrpE; Provisional
Probab=24.61 E-value=4.3e+02 Score=22.28 Aligned_cols=43 Identities=9% Similarity=0.191 Sum_probs=30.9
Q ss_pred hhcchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhHHHHHHH
Q psy15308 194 QKDKLSELQLRVRQLLDQCDQITKEQNYQRYREERFRATSEST 236 (270)
Q Consensus 194 ~~~~l~~l~~~l~~L~~~l~~I~~eq~~~r~re~~~r~~~est 236 (270)
.++.++.++.++..+.+.+.....+..=+|.|..+-++.....
T Consensus 41 ~~~~~~~l~~e~~elkd~~lR~~AefEN~rKR~~kE~e~~~~~ 83 (191)
T PRK14149 41 EGEIKEDFELKYKEMHEKYLRVHADFENVKKRLERDKSMALEY 83 (191)
T ss_pred cHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 3456777888888888888888888777777766666554433
No 77
>PF13464 DUF4115: Domain of unknown function (DUF4115)
Probab=24.47 E-value=2.4e+02 Score=19.43 Aligned_cols=40 Identities=15% Similarity=0.192 Sum_probs=29.4
Q ss_pred eeEEEEECCCCCEEEeeeecccceEEEEeeCCccEEEEeecC
Q psy15308 123 GMHVQISNPDSKIILSRLYSYEGRISFTSHMPGEHVICLYSN 164 (270)
Q Consensus 123 ~i~~~V~dP~g~~v~~~~~~~eg~f~f~a~~~G~Y~iCf~n~ 164 (270)
+.=+.|+|.+|+.+++...+.-..+++ +....+++=+-|.
T Consensus 8 ~sWv~V~d~dG~~~~~~~l~~G~~~~~--~~~~~~~i~iGna 47 (77)
T PF13464_consen 8 DSWVEVTDADGKVLFSGTLKAGETKTF--EGKEPFRIRIGNA 47 (77)
T ss_pred CeEEEEEeCCCcEeeeeeeCCCcEEEE--eCCCCEEEEEeCC
Confidence 466788999999999877666666777 4455677777765
No 78
>PF07680 DoxA: TQO small subunit DoxA; InterPro: IPR011636 Thiosulphate:quinone oxidoreductase (TQO) catalyses one of the early steps in elemental sulphur oxidation. A novel TQO enzyme was purified from the thermo-acidophilic archaeon Acidianus ambivalens and shown to consist of a large subunit (DoxD) and a smaller subunit (DoxA). The DoxD- and DoxA-like two subunits are fused together in a single polypeptide in Q8AAF0 from SWISSPROT.
Probab=23.69 E-value=3.7e+02 Score=21.26 Aligned_cols=41 Identities=12% Similarity=0.216 Sum_probs=29.1
Q ss_pred CceeEEEEECCCCCEEEeeeec---------ccceEEEEeeCCccEEEEee
Q psy15308 121 GVGMHVQISNPDSKIILSRLYS---------YEGRISFTSHMPGEHVICLY 162 (270)
Q Consensus 121 ~~~i~~~V~dP~g~~v~~~~~~---------~eg~f~f~a~~~G~Y~iCf~ 162 (270)
..-+.+.+.|++|++++.+... -..+|.-. -.+|.|.+++-
T Consensus 46 sfl~~i~l~d~~g~vv~~~~~~~L~~lP~~~i~N~Yv~~-~~~g~~gl~vp 95 (133)
T PF07680_consen 46 SFLIGIQLKDSTGHVVLNWDQEKLSSLPKSNIKNDYVAK-VKPGKHGLVVP 95 (133)
T ss_pred ceeeEEEEECCCCCEEEEeCHHHhhhCChhHcCccEEcc-ccCCceeEEEE
Confidence 4568899999999999987542 23345322 34799999985
No 79
>PF07125 DUF1378: Protein of unknown function (DUF1378); InterPro: IPR009808 This entry is represented by Bacteriophage 933W, Orf25. The characteristics of the protein distribution suggest prophage matches in addition to the phage matches. This family consists of hypothetical bacterial and phage proteins of around 59 residues in length. Bacterial members of this family seem to be specific to Enterobacteria. The function of this family is unknown.
Probab=23.41 E-value=1.4e+02 Score=19.90 Aligned_cols=29 Identities=10% Similarity=0.427 Sum_probs=19.4
Q ss_pred HHHHHHHHHHHHHHHHhhHHHHHHHHhhccc
Q psy15308 239 RVFWWSLAQLGILVCMGVWQMRHLKSFFEAK 269 (270)
Q Consensus 239 rV~~~silqi~vli~~s~~Qv~~LKrfF~~K 269 (270)
-++||+-+-+++.++.+.|- .+|.||++|
T Consensus 7 ~lLyFctvVcaLYLvsGGyk--~IRnY~r~K 35 (59)
T PF07125_consen 7 ILLYFCTVVCALYLVSGGYK--VIRNYFRRK 35 (59)
T ss_pred HHHHHHHHHHHHHHHhccHH--HHHHHHHHH
Confidence 45667766666666666664 468888765
No 80
>PRK14154 heat shock protein GrpE; Provisional
Probab=23.28 E-value=4.8e+02 Score=22.33 Aligned_cols=45 Identities=20% Similarity=0.186 Sum_probs=30.9
Q ss_pred cchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhHHHHHHHhhHH
Q psy15308 196 DKLSELQLRVRQLLDQCDQITKEQNYQRYREERFRATSESTNQRV 240 (270)
Q Consensus 196 ~~l~~l~~~l~~L~~~l~~I~~eq~~~r~re~~~r~~~est~~rV 240 (270)
..++.++.++..+.+.+..+..+.+=+|.|..+.+.........-
T Consensus 59 ~el~~le~e~~elkd~~lRl~ADfeNyRKR~~kE~e~~~~~a~e~ 103 (208)
T PRK14154 59 GQLTRMERKVDEYKTQYLRAQAEMDNLRKRIEREKADIIKFGSKQ 103 (208)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 346677777777778777777777777777777766655544433
No 81
>PF09451 ATG27: Autophagy-related protein 27; InterPro: IPR018939 Autophagy is a degradative transport pathway that delivers cytosolic proteins to the lysosome (vacuole) [] and is induced by starvation []. Cytosolic proteins appear inside the vacuole enclosed in autophagic vesicles. Autophagy significantly differs from other transport pathways by using double membrane layered transport intermediates, called autophagosomes [, ]. The breakdown of vesicular transport intermediates is a unique feature of autophagy []. Autophagy can also function in the elimination of invading bacteria and antigens []. There are more than 25 AuTophaGy-related (ATG) genes that are essential for autophagy, although it is still not known how the autophagosome is made. Atg9 is a potential membrane carrier to deliver lipids that are used to form the vesicle. Atg27 is another transmembrane protein, and is a cycling protein []. It acts as an effector of VPS34 phosphatidylinositol 3-phosphate kinase signalling and regulates the cytoplasm to vacuole transport (Cvt) vesicle formation. It is also required for autophagy-dependent cycling of ATG9.
Probab=22.97 E-value=55 Score=28.94 Aligned_cols=37 Identities=24% Similarity=0.346 Sum_probs=25.0
Q ss_pred hHHHHHHHHHhhhccccceeeEEeeccccceeeecCCChh
Q psy15308 20 IILSRLLCVVLLAVIPFSYALYFHISETERKCFIEEIPEE 59 (270)
Q Consensus 20 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~lp~~ 59 (270)
.||++++++++++.+.+ +.|+.......+-| |-||+.
T Consensus 204 ~wl~i~~~l~~~~Y~i~--g~~~n~~~~g~~g~-e~iP~~ 240 (268)
T PF09451_consen 204 TWLFIILFLFLAAYLIF--GSWYNYNRYGARGF-ELIPHF 240 (268)
T ss_pred HHHHHHHHHHHHHHhhh--hhheeeccCCCCCc-eecccH
Confidence 57788888877777665 44555555555665 888877
No 82
>PRK13169 DNA replication intiation control protein YabA; Reviewed
Probab=22.47 E-value=3.6e+02 Score=20.60 Aligned_cols=20 Identities=20% Similarity=0.390 Sum_probs=8.3
Q ss_pred hHHHHHHHHHHHHHHHHHHH
Q psy15308 199 SELQLRVRQLLDQCDQITKE 218 (270)
Q Consensus 199 ~~l~~~l~~L~~~l~~I~~e 218 (270)
..++..+..+...+..++..
T Consensus 11 ~~le~~l~~l~~el~~LK~~ 30 (110)
T PRK13169 11 DDLEQNLGVLLKELGALKKQ 30 (110)
T ss_pred HHHHHHHHHHHHHHHHHHHH
Confidence 34444444444444444433
No 83
>cd03446 MaoC_like MoaC_like Similar to the MaoC (monoamine oxidase C) dehydratase regulatory protein but without the N-terminal PutA domain. This protein family has a hot-dog fold similar to that of (R)-specific enoyl-CoA hydratase, the peroxisomal Hydratase-Dehydrogenase-Epimerase (HDE) protein, and the fatty acid synthase beta subunit.
Probab=22.06 E-value=3.5e+02 Score=20.52 Aligned_cols=52 Identities=10% Similarity=0.113 Sum_probs=32.2
Q ss_pred cee-EEEEccCCcEEEEEEEEEEeecCCCccccCCCCceeEEEEECCCCCEEEeee
Q psy15308 86 RKC-FIEEIPEETQVIINYKTELFDTRTGGYMQSSPGVGMHVQISNPDSKIILSRL 140 (270)
Q Consensus 86 ~~C-f~e~v~~~~~i~~~y~v~~~d~~~g~~~~~~~~~~i~~~V~dP~g~~v~~~~ 140 (270)
..| |...+..|+.+....+|...++.+++- ..-..+...+++.+|+.+.+..
T Consensus 83 ~~~~f~~pv~~GD~l~~~~~v~~~~~~~~~~---~~~v~~~~~~~nq~g~~v~~~~ 135 (140)
T cd03446 83 DNLRFLNPVFIGDTIRAEAEVVEKEEKDGED---AGVVTRRIEVVNQRGEVVQSGE 135 (140)
T ss_pred ceEEEcCCCCCCCEEEEEEEEEEecccCCCC---ceEEEEEEEEEcCCCCEEEEEE
Confidence 345 445677899999999998654322110 0112455667889999876543
No 84
>PRK14143 heat shock protein GrpE; Provisional
Probab=21.98 E-value=5.4e+02 Score=22.47 Aligned_cols=40 Identities=20% Similarity=0.213 Sum_probs=26.0
Q ss_pred cchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhHHHHHH
Q psy15308 196 DKLSELQLRVRQLLDQCDQITKEQNYQRYREERFRATSES 235 (270)
Q Consensus 196 ~~l~~l~~~l~~L~~~l~~I~~eq~~~r~re~~~r~~~es 235 (270)
..++.++.++..+.+++..+..+..-+|.|..+.++....
T Consensus 74 ~el~~l~~e~~elkd~~lR~~AdfeN~RKR~~kE~e~~~~ 113 (238)
T PRK14143 74 QELESLKQELEELNSQYMRIAADFDNFRKRTSREQEDLRL 113 (238)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 3456666677777777777777777777776666554443
No 85
>cd03451 FkbR2 FkbR2 is a Streptomyces hygroscopicus protein with a hot dog fold that belongs to a conserved family of proteins found in prokaryotes and archaea but not in eukaryotes. FkbR2 has sequence similarity to (R)-specific enoyl-CoA hydratase, the peroxisomal Hydratase-Dehydrogenase-Epimerase (HDE) protein, and the fatty acid synthase beta subunit. The function of FkbR2 is unknown.
Probab=21.98 E-value=3.7e+02 Score=20.57 Aligned_cols=52 Identities=12% Similarity=-0.063 Sum_probs=31.5
Q ss_pred eeEEEEccCCcEEEEEEEEEEeecCCCccccCCCCceeEEEEECCCCCEEEeee
Q psy15308 87 KCFIEEIPEETQVIINYKTELFDTRTGGYMQSSPGVGMHVQISNPDSKIILSRL 140 (270)
Q Consensus 87 ~Cf~e~v~~~~~i~~~y~v~~~d~~~g~~~~~~~~~~i~~~V~dP~g~~v~~~~ 140 (270)
--|...+..|+.+....+|.+..+..+.- ...-..+...+.+.+|+++.+..
T Consensus 86 ~~f~~pv~~GDtl~~~~~v~~~~~~~~~~--~~~~v~~~~~~~nq~g~~V~~~~ 137 (146)
T cd03451 86 VRFPAPVFHGDTLYAESEVLSKRESKSRP--DAGIVTVRTVGYNQDGEPVLSFE 137 (146)
T ss_pred EEecCCCCCCCEEEEEEEEEEEecCCCCC--CCeEEEEEEEEECCCCCEEEEEE
Confidence 33445677899999999998654311110 00112456667888998876543
No 86
>PF10779 XhlA: Haemolysin XhlA; InterPro: IPR019715 Haemolysin XhlA is a cell-surface associated haemolysin that lyses the two most prevalent types of insect immune cells (granulocytes and plasmatocytes) as well as rabbit and horse erythrocytes [].
Probab=21.82 E-value=2.8e+02 Score=19.07 Aligned_cols=33 Identities=6% Similarity=0.142 Sum_probs=16.7
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHhHHH
Q psy15308 200 ELQLRVRQLLDQCDQITKEQNYQRYREERFRAT 232 (270)
Q Consensus 200 ~l~~~l~~L~~~l~~I~~eq~~~r~re~~~r~~ 232 (270)
.+.+++.+++..++++.+..+-+..+.+.+...
T Consensus 3 ~i~e~l~~ie~~l~~~~~~i~~lE~~~~~~e~~ 35 (71)
T PF10779_consen 3 DIKEKLNRIETKLDNHEERIDKLEKRDAANEKD 35 (71)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 455566666666665554444444444433333
No 87
>PF05371 Phage_Coat_Gp8: Phage major coat protein, Gp8; InterPro: IPR008020 The major coat protein in the capsid of filamentous bacteriophage forms a helical assembly of about 7000 identical protomers, with each protomer comprised of 46 amino acids, after the cleavage of the signal peptide. Each protomer forms a slightly curved helix that combines to form a tubular structure that encapsulates the viral DNA [].; PDB: 1IFK_A 2C0W_A 2HI5_A 1FDM_A 1IFJ_A 2C0X_A 1IFI_A 1IFD_A 1MZT_A 1IFL_A ....
Probab=21.82 E-value=1.1e+02 Score=20.11 Aligned_cols=22 Identities=23% Similarity=0.270 Sum_probs=17.2
Q ss_pred HHHHHHHHhhHHHHHHHHhhcc
Q psy15308 247 QLGILVCMGVWQMRHLKSFFEA 268 (270)
Q Consensus 247 qi~vli~~s~~Qv~~LKrfF~~ 268 (270)
-+++.+..+++=+...|+|+.+
T Consensus 30 ~vvv~v~gafigirlFKKf~sk 51 (52)
T PF05371_consen 30 PVVVLVTGAFIGIRLFKKFASK 51 (52)
T ss_dssp HHHHHHHHHHHHHHHHHHHHHH
T ss_pred HHHHHHHHHHHHHHHHHHHhcc
Confidence 3567777888889999999864
No 88
>PF14235 DUF4337: Domain of unknown function (DUF4337)
Probab=21.81 E-value=4.4e+02 Score=21.37 Aligned_cols=65 Identities=11% Similarity=0.110 Sum_probs=43.9
Q ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhHHHHHHHhhHHHHHHHHHHHHHHHHhhHHHHHHH
Q psy15308 199 SELQLRVRQLLDQCDQITKEQNYQRYREERFRATSESTNQRVFWWSLAQLGILVCMGVWQMRHLK 263 (270)
Q Consensus 199 ~~l~~~l~~L~~~l~~I~~eq~~~r~re~~~r~~~est~~rV~~~silqi~vli~~s~~Qv~~LK 263 (270)
+.++.++.+-.....+++.+.+-+..+-.......+..+.+=-.|.+..+++=|++.+.-+-.|.
T Consensus 69 ~~~~~~i~~Y~~~~~~~~~e~~~l~~~A~~~e~~~d~~~~~~~~f~~a~~~lQIaI~Lasit~Lt 133 (157)
T PF14235_consen 69 AAYQKKIARYKKEKARYKSEAEELEAKAKEAEAESDHALHHHHRFDLAVALLQIAIVLASITALT 133 (157)
T ss_pred hhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhHHHHhcccchhHHHHHHHHHHHHHHHHHHHH
Confidence 45666777777777777777777776666666677777777777777766666665555444443
No 89
>PF09889 DUF2116: Uncharacterized protein containing a Zn-ribbon (DUF2116); InterPro: IPR019216 This entry contains various hypothetical prokaryotic proteins whose functions are unknown. They contain a conserved zinc ribbon motif in the N-terminal part and a predicted transmembrane segment in the C-terminal part.
Probab=21.53 E-value=2.7e+02 Score=18.78 Aligned_cols=18 Identities=6% Similarity=0.143 Sum_probs=9.5
Q ss_pred HHHhhHHHHHHHHHHHHH
Q psy15308 234 ESTNQRVFWWSLAQLGIL 251 (270)
Q Consensus 234 est~~rV~~~silqi~vl 251 (270)
+..+.+.++|.++-++++
T Consensus 34 ~~~~~~~i~~~~~i~~l~ 51 (59)
T PF09889_consen 34 RMRKTQYIFFGIFILFLA 51 (59)
T ss_pred HHHHHHHHHHHHHHHHHH
Confidence 334556666666554333
No 90
>COG4856 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=21.34 E-value=7e+02 Score=23.53 Aligned_cols=39 Identities=18% Similarity=0.184 Sum_probs=26.1
Q ss_pred CceeEEEEECCCCCEEEeeeecccceEEEEe----eCCccEEEEee
Q psy15308 121 GVGMHVQISNPDSKIILSRLYSYEGRISFTS----HMPGEHVICLY 162 (270)
Q Consensus 121 ~~~i~~~V~dP~g~~v~~~~~~~eg~f~f~a----~~~G~Y~iCf~ 162 (270)
...+.+++.+|++..+..+.... |...+ ...|.|+.=+.
T Consensus 69 ~etV~Vtl~G~ns~~~~~~~~~d---FkV~ADLt~a~~Gt~evkl~ 111 (403)
T COG4856 69 PETVTVTLKGPNSIVLKSEKPED---FKVVADLTHAGVGTHEVKLQ 111 (403)
T ss_pred ceEEEEEEeCCcceeeeeecCcC---eEEEEEhhhcCCCceEeeeE
Confidence 46799999999999887654332 55544 34566665554
No 91
>PF05506 DUF756: Domain of unknown function (DUF756); InterPro: IPR008475 This domain is found, normally as a tandem repeat, at the C terminus of bacterial phospholipase C proteins.; GO: 0004629 phospholipase C activity, 0016042 lipid catabolic process
Probab=21.28 E-value=3.1e+02 Score=19.46 Aligned_cols=48 Identities=15% Similarity=0.352 Sum_probs=30.6
Q ss_pred ceeEEEEEcC---CCceeEEEEccCCcEEEEEEEEEEeecCCCccccCCCCceeEEEEECCCC
Q psy15308 74 SYALYFHISE---TERKCFIEEIPEETQVIINYKTELFDTRTGGYMQSSPGVGMHVQISNPDS 133 (270)
Q Consensus 74 a~~l~f~i~~---g~~~Cf~e~v~~~~~i~~~y~v~~~d~~~g~~~~~~~~~~i~~~V~dP~g 133 (270)
..+..|.|.+ +...+....|+.|+.....+.+.. ++++ -++.|..|+|
T Consensus 31 ~~~~~~~v~~~~y~~~~~~~~~v~ag~~~~~~w~l~~----s~gw--------YDl~v~~~~~ 81 (89)
T PF05506_consen 31 SAAVTFTVYDNAYGGGGPWTYTVAAGQTVSLTWPLAA----SGGW--------YDLTVTGPNG 81 (89)
T ss_pred CCcEEEEEEeCCcCCCCCEEEEECCCCEEEEEEeecC----CCCc--------EEEEEEcCCC
Confidence 4556666665 225577777888887777776631 2333 5677788887
No 92
>PRK14147 heat shock protein GrpE; Provisional
Probab=21.04 E-value=4.8e+02 Score=21.51 Aligned_cols=43 Identities=12% Similarity=0.070 Sum_probs=28.9
Q ss_pred cchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhHHHHHHHhh
Q psy15308 196 DKLSELQLRVRQLLDQCDQITKEQNYQRYREERFRATSESTNQ 238 (270)
Q Consensus 196 ~~l~~l~~~l~~L~~~l~~I~~eq~~~r~re~~~r~~~est~~ 238 (270)
..++.++.++..+.+.+.....+..-+|.|..+.+........
T Consensus 25 ~~l~~l~~e~~elkd~~lR~~Ad~eN~rkR~~kE~e~~~~~a~ 67 (172)
T PRK14147 25 AEVESLRSEIALVKADALRERADLENQRKRIARDVEQARKFAN 67 (172)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 3466777777777777777777777777777766665544433
No 93
>PF14977 FAM194: FAM194 protein
Probab=20.64 E-value=5.4e+02 Score=21.99 Aligned_cols=14 Identities=0% Similarity=-0.035 Sum_probs=10.2
Q ss_pred EECCCCCEEEeeee
Q psy15308 128 ISNPDSKIILSRLY 141 (270)
Q Consensus 128 V~dP~g~~v~~~~~ 141 (270)
+.|++|.++..|.+
T Consensus 90 ~~D~~G~~~k~W~W 103 (208)
T PF14977_consen 90 YFDQKGNRVKKWNW 103 (208)
T ss_pred EEcCCCCEEEEEec
Confidence 45788888777765
No 94
>PHA03163 hypothetical protein; Provisional
Probab=20.49 E-value=2.1e+02 Score=20.98 Aligned_cols=28 Identities=7% Similarity=0.050 Sum_probs=21.4
Q ss_pred HHhhHHHHHHHHHHHHHHHHhhHHHHHH
Q psy15308 235 STNQRVFWWSLAQLGILVCMGVWQMRHL 262 (270)
Q Consensus 235 st~~rV~~~silqi~vli~~s~~Qv~~L 262 (270)
+..+--.+|+++..+++++.+++=+.|+
T Consensus 53 sL~SFSSIWaliNv~Ivl~A~~iyL~y~ 80 (92)
T PHA03163 53 QLLSFSSIWAILNVLIMLIACIIYCIYM 80 (92)
T ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 4455667799999999998888766553
No 95
>PF01166 TSC22: TSC-22/dip/bun family; InterPro: IPR000580 Several eukaryotic proteins are evolutionary related and are thought to be involved in transcriptional regulation. These proteins are highly similar in a region of about 50 residues that include a conserved leucine-zipper domain most probably involved in homo- or hetero-dimerisation. Proteins containing this signature include: Vertebrate protein TSC-22 [], a transcriptional regulator which seems to act on C-type natriuretic peptide (CNP) promoter. Mammalian protein DIP (DSIP-immunoreactive peptide) [], a protein whose function is not yet known. Drosophila protein bunched [] (gene bun) (also known as shortsighted), a probable transcription factor required for peripheral nervous system morphogenesis, eye development and oogenesis. Caenorhabditis elegans hypothetical protein T18D3.7. ; GO: 0003700 sequence-specific DNA binding transcription factor activity, 0006355 regulation of transcription, DNA-dependent; PDB: 1DIP_B.
Probab=20.37 E-value=2.8e+02 Score=18.66 Aligned_cols=30 Identities=17% Similarity=0.371 Sum_probs=25.3
Q ss_pred hcchhHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy15308 195 KDKLSELQLRVRQLLDQCDQITKEQNYQRY 224 (270)
Q Consensus 195 ~~~l~~l~~~l~~L~~~l~~I~~eq~~~r~ 224 (270)
.+.++-+.++|..|.++...+..|-.++|.
T Consensus 13 rEEVevLK~~I~eL~~~n~~Le~EN~~Lk~ 42 (59)
T PF01166_consen 13 REEVEVLKEQIAELEERNSQLEEENNLLKQ 42 (59)
T ss_dssp TTSHHHHHHHHHHHHHHHHHHHHHHHHHHH
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHh
Confidence 467888999999999999999888887775
No 96
>PF03273 Baculo_gp64: Baculovirus gp64 envelope glycoprotein family; InterPro: IPR004955 Gp64 is the major envelope fusion glycoprotein in some, though not all, baculoviruses [, ]. It is found on the surface of both infected cells and budded virions as a homotrimer, and by determining the viral receptor preferences it defines the host range and infection efficiency. Baculovirus enters its host cells by endocytosis followed by a low-pH-induced fusion of the viral envelope with the endosomal membrane, allowing viral entrance into the cell cytoplasm. This membrane fusion, and also the efficient budding of virions from the infected cell, is dependent on Gp64. Gp75 is a homologous envelope glycoprotein from Thogotoviruses that, like Gp64, forms homotrimers and is involved in both binding and entering the host cell [].; GO: 0019048 virus-host interaction, 0019031 viral envelope; PDB: 3DUZ_A.
Probab=20.35 E-value=5.2e+02 Score=24.97 Aligned_cols=18 Identities=17% Similarity=0.285 Sum_probs=9.0
Q ss_pred ceeEEEEECCCCCEEEee
Q psy15308 122 VGMHVQISNPDSKIILSR 139 (270)
Q Consensus 122 ~~i~~~V~dP~g~~v~~~ 139 (270)
.++.+.+.||+|.+|--.
T Consensus 159 ~~v~v~t~d~NGs~I~~s 176 (498)
T PF03273_consen 159 DDVKVATLDPNGSVINVS 176 (498)
T ss_dssp --EEEEEB-TTS-B----
T ss_pred CceEEEEECCCCCEEecc
Confidence 368888999999876543
No 97
>PF05399 EVI2A: Ectropic viral integration site 2A protein (EVI2A); InterPro: IPR008608 This family contains several mammalian ectropic viral integration site 2A (EVI2A) proteins. The function of this protein is unknown although it is thought to be a membrane protein and may function as an oncogene in retrovirus induced myeloid tumours [, ].; GO: 0016021 integral to membrane
Probab=20.11 E-value=1.1e+02 Score=26.16 Aligned_cols=32 Identities=16% Similarity=0.198 Sum_probs=19.8
Q ss_pred HhhHHHHHHHHHHHHHHHHhhH----------HHHHHHHhhc
Q psy15308 236 TNQRVFWWSLAQLGILVCMGVW----------QMRHLKSFFE 267 (270)
Q Consensus 236 t~~rV~~~silqi~vli~~s~~----------Qv~~LKrfF~ 267 (270)
++...+.-.+|-|+||+++|.+ +|-+|||==.
T Consensus 125 ~~K~amLIClIIIAVLfLICT~LfLSTVVLANKVS~LKrskQ 166 (227)
T PF05399_consen 125 NNKMAMLICLIIIAVLFLICTLLFLSTVVLANKVSSLKRSKQ 166 (227)
T ss_pred ccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 3555666666666666665532 6777877544
No 98
>COG1382 GimC Prefoldin, chaperonin cofactor [Posttranslational modification, protein turnover, chaperones]
Probab=20.08 E-value=4.2e+02 Score=20.54 Aligned_cols=44 Identities=16% Similarity=0.250 Sum_probs=33.3
Q ss_pred hhhcchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhHHHHHHH
Q psy15308 193 AQKDKLSELQLRVRQLLDQCDQITKEQNYQRYREERFRATSEST 236 (270)
Q Consensus 193 ~~~~~l~~l~~~l~~L~~~l~~I~~eq~~~r~re~~~r~~~est 236 (270)
.+++-++.+++++..|+-++..+.+.++.+++|-.+.+....+.
T Consensus 67 ~k~~~~~eL~er~E~Le~ri~tLekQe~~l~e~l~eLq~~i~~~ 110 (119)
T COG1382 67 SKEEAVDELEERKETLELRIKTLEKQEEKLQERLEELQSEIQKA 110 (119)
T ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 34556678888888888888888888888888877776665544
No 99
>COG5415 Predicted integral membrane metal-binding protein [General function prediction only]
Probab=20.07 E-value=3.6e+02 Score=23.25 Aligned_cols=18 Identities=17% Similarity=0.252 Sum_probs=9.4
Q ss_pred hHHHHHHHHHHHHHHHHH
Q psy15308 199 SELQLRVRQLLDQCDQIT 216 (270)
Q Consensus 199 ~~l~~~l~~L~~~l~~I~ 216 (270)
.+....+.+|+.++.+..
T Consensus 11 ~~~~~~L~rle~qi~q~~ 28 (251)
T COG5415 11 TKYTADLSRLESQIHQLD 28 (251)
T ss_pred ccchhhHHHHHHHHHHHH
Confidence 345555555555555443
No 100
>PF10754 DUF2569: Protein of unknown function (DUF2569); InterPro: IPR019690 This entry represents a protein that is conserved in bacteria. The function is not known, but several members are annotated as being YdgK or a homologue thereof and associated to the inner membrane. This signature also matches proteins that are described as transglutaminase-like enzymes, although this could not be confirmed.
Probab=20.00 E-value=3.3e+02 Score=21.64 Aligned_cols=29 Identities=21% Similarity=0.277 Sum_probs=22.1
Q ss_pred HHHHHHHHHHHHHhhHHHHHHHHhhcccC
Q psy15308 242 WWSLAQLGILVCMGVWQMRHLKSFFEAKK 270 (270)
Q Consensus 242 ~~silqi~vli~~s~~Qv~~LKrfF~~KK 270 (270)
..-.++++..+++.++.++.+-.||++||
T Consensus 54 ~~~~~~~~~~~~~~~~~l~~~~lffkr~~ 82 (149)
T PF10754_consen 54 ALWYFEVAINIAMWLFTLWLLYLFFKRKR 82 (149)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHccc
Confidence 34456777777777888889999998875
Done!