Query psy3528
Match_columns 218
No_of_seqs 127 out of 1101
Neff 8.7
Searched_HMMs 46136
Date Fri Aug 16 18:57:00 2013
Command hhsearch -i /work/01045/syshi/Psyhhblits/psy3528.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/3528hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 KOG1692|consensus 100.0 1.6E-49 3.4E-54 302.2 17.9 184 28-212 18-201 (201)
2 KOG1693|consensus 100.0 3.3E-43 7.1E-48 269.1 20.8 186 26-211 19-206 (209)
3 KOG1690|consensus 100.0 1.1E-42 2.4E-47 265.7 18.2 194 17-212 7-215 (215)
4 KOG1691|consensus 100.0 6.8E-41 1.5E-45 259.8 21.1 192 21-212 13-210 (210)
5 KOG3287|consensus 100.0 1.6E-38 3.4E-43 246.1 22.4 189 28-216 33-233 (236)
6 PF01105 EMP24_GP25L: emp24/gp 100.0 1.9E-39 4.2E-44 254.4 0.5 178 30-207 1-183 (183)
7 PF13897 GOLD_2: Golgi-dynamic 93.8 0.1 2.2E-06 38.6 4.0 31 91-121 104-134 (136)
8 PF01835 A2M_N: MG2 domain; I 93.4 1.8 3.9E-05 29.9 11.0 60 49-108 13-87 (99)
9 PF04151 PPC: Bacterial pre-pe 92.4 0.59 1.3E-05 30.3 5.8 61 41-104 3-68 (70)
10 PF13860 FlgD_ig: FlgD Ig-like 89.7 3.1 6.7E-05 27.9 7.4 32 50-82 11-42 (81)
11 smart00557 IG_FLMN Filamin-typ 89.4 2.4 5.1E-05 29.2 6.8 45 63-107 30-77 (93)
12 PRK02710 plastocyanin; Provisi 87.6 8.7 0.00019 27.8 10.1 74 26-104 26-104 (119)
13 PF13473 Cupredoxin_1: Cupredo 86.4 2.3 5.1E-05 29.8 5.4 54 46-103 38-91 (104)
14 PF00630 Filamin: Filamin/ABP2 83.7 9.8 0.00021 26.0 7.6 41 66-106 43-90 (101)
15 PF11589 DUF3244: Domain of un 82.5 11 0.00023 26.6 7.4 58 49-109 35-96 (106)
16 PF05738 Cna_B: Cna protein B- 80.0 5.5 0.00012 25.4 4.8 43 66-108 3-47 (70)
17 PRK06655 flgD flagellar basal 77.8 9.1 0.0002 31.1 6.4 57 50-107 112-179 (225)
18 PRK12813 flgD flagellar basal 75.2 13 0.00029 30.1 6.7 56 50-107 110-174 (223)
19 PF14524 Wzt_C: Wzt C-terminal 75.1 26 0.00057 25.2 7.9 82 39-122 25-107 (142)
20 PF13620 CarboxypepD_reg: Carb 74.6 4.9 0.00011 26.4 3.5 43 66-108 16-58 (82)
21 PF10528 PA14_2: GLEYA domain; 74.3 12 0.00026 26.9 5.6 45 37-82 56-102 (113)
22 PF09315 DUF1973: Domain of un 74.2 40 0.00086 26.4 9.7 67 49-121 29-99 (179)
23 PRK14081 triple tyrosine motif 72.4 77 0.0017 30.1 11.6 71 50-121 401-476 (667)
24 PRK05842 flgD flagellar basal 71.4 17 0.00036 30.9 6.6 58 50-107 148-221 (295)
25 TIGR03503 conserved hypothetic 70.9 74 0.0016 28.0 11.8 45 62-106 239-286 (374)
26 PF15417 DUF4624: Domain of un 69.8 38 0.00083 24.3 7.1 78 38-122 37-122 (132)
27 PRK12812 flgD flagellar basal 67.8 52 0.0011 27.4 8.7 56 51-107 128-194 (259)
28 PRK12633 flgD flagellar basal 63.7 55 0.0012 26.7 8.0 55 52-107 117-182 (230)
29 KOG3202|consensus 63.5 48 0.0011 27.1 7.6 69 133-201 149-234 (235)
30 PF07210 DUF1416: Protein of u 63.4 39 0.00084 23.0 5.8 58 48-107 5-63 (85)
31 PRK10378 inactive ferrous ion 62.0 67 0.0015 28.3 8.7 67 30-103 31-103 (375)
32 PRK12634 flgD flagellar basal 61.1 70 0.0015 25.9 8.1 45 63-107 120-175 (221)
33 PF10779 XhlA: Haemolysin XhlA 59.7 46 0.00099 21.6 8.0 11 183-193 55-65 (71)
34 KOG0518|consensus 59.0 24 0.00051 34.9 5.7 58 52-109 870-930 (1113)
35 PF05753 TRAP_beta: Translocon 58.0 89 0.0019 24.4 10.4 30 44-73 31-61 (181)
36 PF08842 Mfa2: Fimbrillin-A as 57.9 18 0.00038 29.6 4.4 62 46-107 3-77 (283)
37 PF08234 Spindle_Spc25: Chromo 55.2 32 0.0007 22.5 4.4 31 94-124 3-33 (74)
38 PF07495 Y_Y_Y: Y_Y_Y domain; 54.9 49 0.0011 20.5 7.3 39 67-108 10-49 (66)
39 PRK09619 flgD flagellar basal 53.7 54 0.0012 26.5 6.3 57 50-108 109-173 (218)
40 PF10572 UPF0556: Uncharacteri 50.2 1.1E+02 0.0025 23.3 8.9 44 24-67 19-62 (158)
41 CHL00177 ccs1 c-type cytochrom 50.0 1.2E+02 0.0027 27.1 8.5 77 4-80 158-247 (426)
42 PF05739 SNARE: SNARE domain; 48.2 64 0.0014 19.9 5.6 45 136-180 4-48 (63)
43 PRK13211 N-acetylglucosamine-b 47.0 2.3E+02 0.005 25.9 10.8 69 52-120 328-404 (478)
44 PF07125 DUF1378: Protein of u 46.6 25 0.00054 21.8 2.5 30 178-209 6-35 (59)
45 PRK14081 triple tyrosine motif 46.3 75 0.0016 30.2 6.8 59 49-107 205-265 (667)
46 COG4856 Uncharacterized protei 43.9 1.4E+02 0.003 26.4 7.5 53 50-105 51-111 (403)
47 PF09323 DUF1980: Domain of un 42.1 30 0.00066 26.9 3.2 33 176-208 27-59 (182)
48 COG1723 Uncharacterized conser 41.9 52 0.0011 28.2 4.6 56 140-201 270-325 (331)
49 PF12690 BsuPI: Intracellular 40.9 95 0.0021 20.8 5.1 20 64-83 23-42 (82)
50 KOG3878|consensus 40.7 1.3E+02 0.0028 26.1 6.7 27 95-121 440-466 (469)
51 PF10670 DUF4198: Domain of un 40.4 1.2E+02 0.0026 23.6 6.5 21 85-105 190-210 (215)
52 KOG1693|consensus 39.9 1.9E+02 0.0042 23.1 9.3 154 28-191 33-197 (209)
53 KOG2861|consensus 39.2 74 0.0016 28.2 5.3 55 141-201 338-392 (399)
54 PF15432 Sec-ASP3: Accessory S 38.9 1.6E+02 0.0034 21.7 7.8 56 53-109 58-114 (128)
55 COG4549 Uncharacterized protei 38.8 1.8E+02 0.004 22.5 11.2 86 30-125 22-109 (178)
56 PF10754 DUF2569: Protein of u 37.8 85 0.0018 23.6 4.9 33 182-214 54-86 (149)
57 PF14830 Haemocyan_bet_s: Haem 34.9 1.7E+02 0.0036 20.8 6.1 51 22-81 13-91 (103)
58 PF00517 GP41: Retroviral enve 34.6 2.1E+02 0.0045 22.8 6.9 26 177-202 148-173 (204)
59 PF07116 DUF1372: Protein of u 34.0 69 0.0015 22.6 3.5 31 174-204 6-36 (104)
60 PF04728 LPP: Lipoprotein leuc 32.9 1.3E+02 0.0027 18.8 6.2 26 138-163 5-30 (56)
61 KOG0518|consensus 32.1 1.4E+02 0.003 29.9 6.3 55 52-106 377-435 (1113)
62 PF07835 COX4_pro_2: Bacterial 30.8 1E+02 0.0023 18.1 3.5 20 178-197 23-42 (44)
63 PF13464 DUF4115: Domain of un 30.0 1.4E+02 0.0031 19.3 4.5 40 66-107 8-47 (77)
64 TIGR03711 acc_sec_asp3 accesso 29.9 2.4E+02 0.0051 21.1 7.1 43 66-109 82-125 (135)
65 PF12669 P12: Virus attachment 28.7 31 0.00068 21.6 1.0 11 202-212 17-27 (58)
66 PF09753 Use1: Membrane fusion 28.0 3.3E+02 0.0072 22.2 7.9 23 182-205 228-250 (251)
67 PF07680 DoxA: TQO small subun 27.7 2.6E+02 0.0056 20.8 7.1 63 43-106 21-96 (133)
68 PHA02650 hypothetical protein; 27.3 79 0.0017 21.2 2.7 30 176-205 44-73 (81)
69 PF02408 CUB_2: CUB-like domai 27.2 2.3E+02 0.0051 20.1 7.9 42 34-78 37-79 (120)
70 COG5415 Predicted integral mem 27.2 3.4E+02 0.0073 22.0 8.4 31 136-166 15-45 (251)
71 COG4467 Regulator of replicati 26.9 2.4E+02 0.0052 20.2 5.6 45 135-179 7-51 (114)
72 PF05506 DUF756: Domain of unk 25.9 2.1E+02 0.0045 19.1 6.5 9 65-73 32-40 (89)
73 COG5415 Predicted integral mem 25.8 2.5E+02 0.0054 22.7 5.7 42 140-188 12-53 (251)
74 cd05864 Ig2_VEGFR-2 Second imm 25.4 1.6E+02 0.0034 18.7 4.0 27 94-121 43-69 (70)
75 PHA02665 hypothetical protein; 25.2 51 0.0011 26.6 1.8 29 35-63 25-53 (322)
76 PF09889 DUF2116: Uncharacteri 24.7 1.6E+02 0.0036 18.5 3.7 14 176-189 36-49 (59)
77 PF05399 EVI2A: Ectropic viral 24.3 1.1E+02 0.0023 24.7 3.5 32 176-207 125-166 (227)
78 PF07691 PA14: PA14 domain; I 24.1 2.7E+02 0.0059 19.8 7.5 70 52-121 49-133 (145)
79 COG0598 CorA Mg2+ and Co2+ tra 24.0 3.2E+02 0.007 23.2 6.8 32 166-197 248-279 (322)
80 PF13715 DUF4480: Domain of un 23.6 2.2E+02 0.0047 18.6 4.6 37 66-107 17-53 (88)
81 PHA03054 IMV membrane protein; 23.5 1.1E+02 0.0024 20.0 2.8 26 178-203 45-70 (72)
82 KOG3385|consensus 23.2 2.3E+02 0.005 20.5 4.7 26 136-161 36-61 (118)
83 PHA02819 hypothetical protein; 23.0 1.2E+02 0.0026 19.9 2.9 27 178-204 43-69 (71)
84 cd05860 Ig4_SCFR Fourth immuno 22.5 2.8E+02 0.0061 19.4 9.8 26 94-122 74-99 (101)
85 PF06156 DUF972: Protein of un 22.4 2.9E+02 0.0064 19.6 5.9 22 138-159 10-31 (107)
86 PF14686 fn3_3: Polysaccharide 22.2 2.7E+02 0.0059 19.1 6.0 31 76-106 38-68 (95)
87 PHA02955 hypothetical protein; 22.2 80 0.0017 25.4 2.4 25 182-207 180-204 (213)
88 PF05377 FlaC_arch: Flagella a 22.1 2.1E+02 0.0045 17.8 5.7 25 139-163 3-27 (55)
89 PF04234 CopC: CopC domain; I 21.7 2.1E+02 0.0046 19.5 4.3 10 97-106 71-80 (97)
90 TIGR00383 corA magnesium Mg(2+ 21.7 4.8E+02 0.01 21.8 8.3 17 173-189 251-267 (318)
91 PF14109 GldH_lipo: GldH lipop 21.6 3.3E+02 0.0072 19.9 7.1 44 65-108 68-116 (131)
92 KOG2678|consensus 21.1 4.6E+02 0.01 21.4 8.3 31 178-208 212-242 (244)
93 PRK13169 DNA replication intia 21.1 3.2E+02 0.007 19.5 6.0 22 138-159 10-31 (110)
94 cd04976 Ig2_VEGFR Second immun 20.9 1.6E+02 0.0034 18.5 3.3 27 94-121 44-70 (71)
95 PF03554 Herpes_UL73: UL73 vir 20.9 1.3E+02 0.0029 20.3 2.9 27 175-201 44-70 (82)
96 cd03455 SAV4209 SAV4209 is a S 20.5 3.1E+02 0.0068 19.2 6.0 37 46-82 79-118 (123)
97 PF04109 APG9: Autophagy prote 20.5 1.8E+02 0.0038 25.7 4.4 35 167-201 108-142 (370)
98 PF07523 Big_3: Bacterial Ig-l 20.5 2.3E+02 0.005 17.7 4.3 41 66-108 18-58 (67)
99 PLN00115 pollen allergen group 20.3 3.5E+02 0.0075 19.6 6.2 26 14-39 8-34 (118)
100 PF11166 DUF2951: Protein of u 20.1 3.1E+02 0.0068 19.1 7.8 33 172-206 64-96 (98)
No 1
>KOG1692|consensus
Probab=100.00 E-value=1.6e-49 Score=302.24 Aligned_cols=184 Identities=31% Similarity=0.524 Sum_probs=172.9
Q ss_pred ceEEEEEEeCCCcceEEEEccCCCEEEEEEEEEECCeeeEEEEEECCCCcEEeeceecccceEEEEeecceeEEEEEEec
Q psy3528 28 AVELTFELADNARECFFEDIKRNTSVILEFQVVTGGKYDVDVSIEAPTKEIIYRQVKSQFDTHQFVTLVEGTYVVCFSNE 107 (218)
Q Consensus 28 ~~~l~~~l~~g~~~Cf~e~v~~~~~i~~~y~v~~~~~~~i~v~v~~p~g~~v~~~~~~~~g~f~f~~~~~G~y~~Cf~n~ 107 (218)
+.++.+.+++.+++||+|++.+|+.+.++|++.+|+..++++.|++|+|+.++..++...|+|+|+|+.+|.|++||+|.
T Consensus 18 ~~~~~is~~ah~eeCf~e~~~~gd~~~vsF~v~~gg~~~vd~~I~gP~~~~i~~~~~~ssgk~tF~a~~~G~Y~fCF~N~ 97 (201)
T KOG1692|consen 18 AAGYGISLDAHEEECFFENLEEGDKLSVSFEVIDGGFLGVDVEITGPDGKIIHKGKRESSGKYTFTAPKKGTYTFCFSNK 97 (201)
T ss_pred hhheeEEEccchhhhHhhhhccCCEEEEEEEEecCCccceeEEEECCCCchhhhcccccCceEEEEecCCceEEEEecCC
Confidence 66788999999999999999999999999999999888999999999999999998889999999999999999999999
Q ss_pred CCCcceEEEEEEEEeccCCCCCCCcccccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhHHHHHhhhhhhHHhHhHH
Q psy3528 108 FSTYSHKVVYMDFQVGDEPQLPGVGEHVTVMTQMESSTQEIHKSLNTIIDYQTHHRLREAQGRKRAEDLNERVMWWSSIE 187 (218)
Q Consensus 108 ~~~~~~k~V~f~~~~g~~~~~~~~~~~~~~l~~l~~~l~~l~~~l~~i~~eq~~~~~re~~~~~~~est~~rv~~~sii~ 187 (218)
.+...||.|.|++++|..++.++. +++.+.+++++.+++|++.+..++.||+|+..||.+||+++|+|++|++|||++|
T Consensus 98 ~s~mtpk~V~F~ihvg~~~~~~d~-~~d~~~~~L~~~I~eL~~al~~Vk~EQeY~~~Rer~Hr~~nEntn~RVv~wsife 176 (201)
T KOG1692|consen 98 MSTMTPKTVMFTIHVGHAPQRDDL-AKDAHQNKLEEMIRELSEALTSVKHEQEYMEARERIHRNTNENTNSRVVLWSIFE 176 (201)
T ss_pred CCCCCceEEEEEEEEeeccccchh-cccccccHHHHHHHHHHHHHHHhHHHHHHHHHHHHHHHHhhhcccceeehHHHHH
Confidence 998889999999999988776543 4555667899999999999999999999999999999999999999999999999
Q ss_pred HHHHHhhhhhHHHHHHhhhcccccc
Q psy3528 188 FVVVLLASTLQVYIVKNFFASRNER 212 (218)
Q Consensus 188 i~vli~~~~~Qv~~lk~fF~~Kk~~ 212 (218)
.+++|+++++|||||||||+.|+.+
T Consensus 177 ~~vLi~~s~~QVyYLkRfFEvkrvv 201 (201)
T KOG1692|consen 177 ALVLIAMSVLQVYYLKRFFEVKRVV 201 (201)
T ss_pred HHHHHHHHHHHHHHHHHhheeeecC
Confidence 9999999999999999999999753
No 2
>KOG1693|consensus
Probab=100.00 E-value=3.3e-43 Score=269.15 Aligned_cols=186 Identities=52% Similarity=0.873 Sum_probs=171.8
Q ss_pred hcceEEEEEEeCCCcceEEEEccCCCE-EEEEEEEEECCeeeEEEEEECCCCcEEeeceecccceEEEEeecceeEEEEE
Q psy3528 26 VQAVELTFELADNARECFFEDIKRNTS-VILEFQVVTGGKYDVDVSIEAPTKEIIYRQVKSQFDTHQFVTLVEGTYVVCF 104 (218)
Q Consensus 26 ~~~~~l~~~l~~g~~~Cf~e~v~~~~~-i~~~y~v~~~~~~~i~v~v~~p~g~~v~~~~~~~~g~f~f~~~~~G~y~~Cf 104 (218)
.++..++|+||++..+|||+++++++. +++.|+|..||+++|++.|.+|+|+++++..++..++|.|++.+.|+|++||
T Consensus 19 s~a~elTfeLp~~aKqC~Y~d~~~~~~~~~~~fqV~tGG~fDVD~~I~aPdgkvI~~~~kk~~~~~~f~ae~~G~Y~fCF 98 (209)
T KOG1693|consen 19 SEASELTFELPDNAKQCFYEDLKKDDDTTSFEFQVQTGGHFDVDYDIEAPDGKVIYSEKKKRYDSFLFKAEGKGEYTFCF 98 (209)
T ss_pred hhcccEEEEcCCcchhheeeecccCCceEEEEEEEEeCCceeeEEEEECCCCCEEeeccccccccEEEEEecceEEEEEe
Confidence 337899999999999999999998655 9999999999999999999999999999999999999999999999999999
Q ss_pred EecCCCcceEEEEEEEEeccCCCCC-CCcccccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhHHHHHhhhhhhHHh
Q psy3528 105 SNEFSTYSHKVVYMDFQVGDEPQLP-GVGEHVTVMTQMESSTQEIHKSLNTIIDYQTHHRLREAQGRKRAEDLNERVMWW 183 (218)
Q Consensus 105 ~n~~~~~~~k~V~f~~~~g~~~~~~-~~~~~~~~l~~l~~~l~~l~~~l~~i~~eq~~~~~re~~~~~~~est~~rv~~~ 183 (218)
+|.++.+++|.+.+++.+|.+.... +..+.+..+..+++.+..|++.|+.|.+.|.|+|.||.+.+.+++|+++|++||
T Consensus 99 sN~fstf~~Kiv~~~~q~~~~~~~~~~~~~~~~~~~~mena~~~I~~~L~~I~~~q~y~R~RE~rn~~tv~st~~Rv~~~ 178 (209)
T KOG1693|consen 99 SNEFSTFSHKIVYMDFQVGEEPPLHPAVSNRDTALTQMENAIVEIHRALNKIDDTQTYYRLREARNRSTVESTNSRVTWW 178 (209)
T ss_pred cCccccccceEeeehhhhccccccCccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCccchhcccchhhHH
Confidence 9999988899999999999776542 222345567899999999999999999999999999999999999999999999
Q ss_pred HhHHHHHHHhhhhhHHHHHHhhhccccc
Q psy3528 184 SSIEFVVVLLASTLQVYIVKNFFASRNE 211 (218)
Q Consensus 184 sii~i~vli~~~~~Qv~~lk~fF~~Kk~ 211 (218)
|+.++++++++++.||..||.||..|+.
T Consensus 179 Sl~e~~~vv~iSi~Qv~ilk~fFt~~r~ 206 (209)
T KOG1693|consen 179 SLLEIIAVVVISIAQVFILKFFFTDRRK 206 (209)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHhccCc
Confidence 9999999999999999999999998864
No 3
>KOG1690|consensus
Probab=100.00 E-value=1.1e-42 Score=265.75 Aligned_cols=194 Identities=25% Similarity=0.423 Sum_probs=169.8
Q ss_pred HHHHHHhhhhcceEEEEEEeCCCcceEEEEccCCCEEEEEEEEEE----------CCeeeEEEEEECCCCc--EEeecee
Q psy3528 17 FFLLILHQIVQAVELTFELADNARECFFEDIKRNTSVILEFQVVT----------GGKYDVDVSIEAPTKE--IIYRQVK 84 (218)
Q Consensus 17 ~~~~~l~~~~~~~~l~~~l~~g~~~Cf~e~v~~~~~i~~~y~v~~----------~~~~~i~v~v~~p~g~--~v~~~~~ 84 (218)
++++++.++..+.|++|++..++++||++++|+|+.+.|+|.+.= .++.++.+.|.+|.++ +|.++..
T Consensus 7 ~~lll~~l~~~~~a~yFy~~~~e~KCF~eelpk~tmv~G~yk~qlyd~~~~~y~~~p~~gm~VeV~e~fdnnh~Vl~q~~ 86 (215)
T KOG1690|consen 7 LLLLLLLLATQVQALYFYIAGTEKKCFIEELPKGTMVTGNYKAQLYDDQLKGYGSYPNIGMHVEVKETFDNNHVVLSQQY 86 (215)
T ss_pred HHHHHHHHHhhccEEEEEecCCcccchhhhCCCCcEEEeeeeeeeecchhcccccCCCceEEEEeecCCCCceEEEeecC
Confidence 344555567889999999999999999999999999999998631 1345899999999666 9999889
Q ss_pred cccceEEEEeecceeEEEEEEecCCCc---ceEEEEEEEEeccCCCCCCCcccccchhHHHHHHHHHHHHHHHHHHHHHH
Q psy3528 85 SQFDTHQFVTLVEGTYVVCFSNEFSTY---SHKVVYMDFQVGDEPQLPGVGEHVTVMTQMESSTQEIHKSLNTIIDYQTH 161 (218)
Q Consensus 85 ~~~g~f~f~~~~~G~y~~Cf~n~~~~~---~~k~V~f~~~~g~~~~~~~~~~~~~~l~~l~~~l~~l~~~l~~i~~eq~~ 161 (218)
.++|+|+|++..+|+|+||+....+.| +.-.|.+++.+|+++++++. .+++.+.++.++..|.+++..|..||.+
T Consensus 87 ss~G~ftFta~~~GeH~IC~~s~s~awf~~aklRvhld~qvG~~a~l~a~--~ke~~k~l~~Rv~~L~~~~~~IrkEQ~~ 164 (215)
T KOG1690|consen 87 SSEGDFTFTALTPGEHRICIQSNSTAWFNGAKLRVHLDIQVGDHANLDAQ--IKETDKLLEGRVRQLNSRLESIRKEQNL 164 (215)
T ss_pred CCCCceEEEccCCCceEEEEecccchhhccceEEEEEEEeeCchhhhhhh--hhhhhhhhHHHHHHHHHHHHHHHHHHHH
Confidence 999999999999999999997654432 23469999999999888652 4455667888999999999999999999
Q ss_pred HHHHHHhhHHHHHhhhhhhHHhHhHHHHHHHhhhhhHHHHHHhhhcccccc
Q psy3528 162 HRLREAQGRKRAEDLNERVMWWSSIEFVVVLLASTLQVYIVKNFFASRNER 212 (218)
Q Consensus 162 ~~~re~~~~~~~est~~rv~~~sii~i~vli~~~~~Qv~~lk~fF~~Kk~~ 212 (218)
+|.||++.|++.||+|+|++|||++|++++++.|+||+.|||+||.++|++
T Consensus 165 ~R~RE~~FR~tSES~NsRvm~Wsv~Q~vvL~~tc~wQmrhL~~FFvkqKlv 215 (215)
T KOG1690|consen 165 QREREETFRDTSESANSRVMWWSVAQLVVLLVTCIWQMRHLKSFFVKQKLV 215 (215)
T ss_pred HHHHHHHHHhhhhhhcceeeehhHHHHHHHHHHHHHHHHHHHHHHHHhhcC
Confidence 999999999999999999999999999999999999999999999999974
No 4
>KOG1691|consensus
Probab=100.00 E-value=6.8e-41 Score=259.80 Aligned_cols=192 Identities=20% Similarity=0.336 Sum_probs=173.2
Q ss_pred HHhhhhcceEEEEEEeCCCcceEEEEccCCCEEEEEEEEEECCe---eeEEEEEECCCCcEEeeceecccceEEEEeecc
Q psy3528 21 ILHQIVQAVELTFELADNARECFFEDIKRNTSVILEFQVVTGGK---YDVDVSIEAPTKEIIYRQVKSQFDTHQFVTLVE 97 (218)
Q Consensus 21 ~l~~~~~~~~l~~~l~~g~~~Cf~e~v~~~~~i~~~y~v~~~~~---~~i~v~v~~p~g~~v~~~~~~~~g~f~f~~~~~ 97 (218)
++++...+.++.|++|++.++|+.|++.+|..+.|.|.+.++.. ..+++.|+||.|+.+++.++..+|+|+|++.+.
T Consensus 13 ~~~~~~~~~a~~f~v~~~~~kCi~EeI~~n~lv~g~y~i~~~~~~~~~~~~~~Vts~~G~~~~~~env~~gqFaFta~e~ 92 (210)
T KOG1691|consen 13 FLLLLPLVHALRFDVPSKTTKCISEEIHENVLVVGDYEIINPNGDHSHKLSVKVTSPYGNNLHSKENVTKGQFAFTAEES 92 (210)
T ss_pred HHHHhhhhheEEEEecCCCCEeehhhhccCeEEEEEEEEecCCCCccceEEEEEEcCCCceeehhhccccceEEEEeccC
Confidence 33445677889999999999999999999999999999987533 469999999999999999999999999999999
Q ss_pred eeEEEEEEec--CCCcc-eEEEEEEEEeccCCCCCCCcccccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhHHHHH
Q psy3528 98 GTYVVCFSNE--FSTYS-HKVVYMDFQVGDEPQLPGVGEHVTVMTQMESSTQEIHKSLNTIIDYQTHHRLREAQGRKRAE 174 (218)
Q Consensus 98 G~y~~Cf~n~--~~~~~-~k~V~f~~~~g~~~~~~~~~~~~~~l~~l~~~l~~l~~~l~~i~~eq~~~~~re~~~~~~~e 174 (218)
|.|.+||.+. ..... ...|+|++..|.++++.+.-++++.++++|-.+++|.+.+..|.++..|+|.||+++|++++
T Consensus 93 ~~y~~Cf~~~~~~~~p~~~~~I~ld~k~Gv~akdw~~IAKkeklep~E~elrrLed~~~sI~~e~~YLr~REeemr~~ne 172 (210)
T KOG1691|consen 93 GMYEACFTADVPGHKPETKRSIDLDWKTGVEAKDWDSIAKKEKLEPLEVELRRLEDLVESIHEEMYYLREREEEMRNTNE 172 (210)
T ss_pred CcEEEEEecccCCCCCCcceEEEEEeeccccccchHHHHhhhcCcHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhh
Confidence 9999999993 33222 35799999999998874323888999999999999999999999999999999999999999
Q ss_pred hhhhhhHHhHhHHHHHHHhhhhhHHHHHHhhhcccccc
Q psy3528 175 DLNERVMWWSSIEFVVVLLASTLQVYIVKNFFASRNER 212 (218)
Q Consensus 175 st~~rv~~~sii~i~vli~~~~~Qv~~lk~fF~~Kk~~ 212 (218)
+||+|+.|+|++.++++++++.||++|||+||++||+.
T Consensus 173 sTNsrv~~fSi~Sl~v~~~va~~QvlyLK~fF~kKKLI 210 (210)
T KOG1691|consen 173 STNSRVAWFSILSLVVLLSVAGWQVLYLKRFFQKKKLI 210 (210)
T ss_pred hhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCC
Confidence 99999999999999999999999999999999999974
No 5
>KOG3287|consensus
Probab=100.00 E-value=1.6e-38 Score=246.06 Aligned_cols=189 Identities=26% Similarity=0.521 Sum_probs=169.8
Q ss_pred ceEEEEEEeCCCcceEEEEccCCCEEEEEEEEEEC-CeeeEEEEEECCCCcEEeeceecccceEEEEeecceeEEEEEEe
Q psy3528 28 AVELTFELADNARECFFEDIKRNTSVILEFQVVTG-GKYDVDVSIEAPTKEIIYRQVKSQFDTHQFVTLVEGTYVVCFSN 106 (218)
Q Consensus 28 ~~~l~~~l~~g~~~Cf~e~v~~~~~i~~~y~v~~~-~~~~i~v~v~~p~g~~v~~~~~~~~g~f~f~~~~~G~y~~Cf~n 106 (218)
...++|.||||+++|||..++++..+.++|+|+.| +..+|++++.+|.|+++.+...+..|.+++...+.|.|++||+|
T Consensus 33 d~dftv~ipAGk~eCf~Q~v~~~~tle~eyQVi~G~GDl~i~Ftl~~P~G~~lv~~q~k~dg~ht~e~~e~GdY~~CfDN 112 (236)
T KOG3287|consen 33 DYDFTVMIPAGKTECFYQPVPQGATLEVEYQVIDGAGDLDIDFTLLNPAGEVLVSDQRKVDGVHTVEVTETGDYQVCFDN 112 (236)
T ss_pred ccceEEEecCCCceeeeeeccCCeEEEEEEEEEecCCccceeeEEeCCCccEEeecccccCceeEeeccCCcceEEEEcC
Confidence 46799999999999999999999999999999998 66789999999999999999999999999999999999999999
Q ss_pred cCCCcceEEEEEEEEeccCCC----CCCC---c----ccccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhHHHHHh
Q psy3528 107 EFSTYSHKVVYMDFQVGDEPQ----LPGV---G----EHVTVMTQMESSTQEIHKSLNTIIDYQTHHRLREAQGRKRAED 175 (218)
Q Consensus 107 ~~~~~~~k~V~f~~~~g~~~~----~~~~---~----~~~~~l~~l~~~l~~l~~~l~~i~~eq~~~~~re~~~~~~~es 175 (218)
++|.++.|.|.|++.+...-. +++. + .-..+++++++.+.++..++..+...|.+.|.||.|+|.+.++
T Consensus 113 sFS~fs~K~Vffeli~~~~g~~~e~~e~w~k~~e~~~~Ld~kl~di~~~i~~i~~nl~k~~~~q~~lRa~EaRDr~L~es 192 (236)
T KOG3287|consen 113 SFSTFSRKLVFFELILDAHGEFYEGDETWHKYKERTEQLDVKLDDIEDSIGTIKNNLNKMWQYQALLRAREARDRNLQES 192 (236)
T ss_pred ccccccceEEEEEEEeccccchhccchhHhhhhhhhhhhcccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhHHHHhc
Confidence 999999999999996532211 1110 1 1134678999999999999999999999999999999999999
Q ss_pred hhhhhHHhHhHHHHHHHhhhhhHHHHHHhhhccccccccCC
Q psy3528 176 LNERVMWWSSIEFVVVLLASTLQVYIVKNFFASRNERQYKP 216 (218)
Q Consensus 176 t~~rv~~~sii~i~vli~~~~~Qv~~lk~fF~~Kk~~~~~~ 216 (218)
+..||.|||+++++++|+++++|+|+||++|+.|+.++.|.
T Consensus 193 Nf~rVN~WS~vq~~vmi~v~~iQVf~lrslFe~~~~~~t~~ 233 (236)
T KOG3287|consen 193 NFDRVNFWSMVQTLVMILVGIIQVFMLRSLFEVKSKSDTKK 233 (236)
T ss_pred ccchhhHHHHHHHHHHHHHhhhhhhhhHHHhcCCCCccccc
Confidence 99999999999999999999999999999999999886653
No 6
>PF01105 EMP24_GP25L: emp24/gp25L/p24 family/GOLD; InterPro: IPR009038 The GOLD (for Golgi dynamics) domain is a protein module found in several eukaryotic Golgi and lipid-traffic proteins. It is typically between 90 and 150 amino acids long. Most of the size difference observed in the GOLD-domain superfamily is traceable to a single large low-complexity insert that is seen in some versions of the domain. With the exception of the p24 proteins, which have a simple architecture with the GOLD domain as their only globular domain, all other GOLD-domain proteins contain additional conserved globular domains. In these proteins, the GOLD domain co-occurs with lipid-, sterol- or fatty acid-binding domains such as PH, CRAL-TRIO, FYVE oxysterol binding- and acyl CoA-binding domains, suggesting that these proteins may interact with membranes. The GOLD domain can also be found associated with a RUN domain, which may have a role in the interaction of various proteins with cytoskeletal filaments. The GOLD domain is predicted to mediate diverse protein-protein interactions []. A secondary structure prediction for the GOLD domain reveals that it is likely to adopt a compact all-beta-fold structure with six to seven strands. Most of the sequence conservation is centred on the hydrophobic cores that support these predicted strands. The predicted secondary-structure elements and the size of the conserved core of the domain suggests that it may form a beta- sandwich fold with the strands arranged in two beta sheets stacked on each other []. Some proteins known to contain a GOLD domain are listed below: Eukaryotic proteins of the p24 family. Animal Sec14-like proteins. They are involved in secretion. Human Golgi resident protein GCP60. It interacts with the Golgi integral membrane protein Giantin. Yeast oxysterol-binding protein homologue 3 (OSH3). ; GO: 0006810 transport, 0016021 integral to membrane; PDB: 1P23_A 1M23_A.
Probab=100.00 E-value=1.9e-39 Score=254.42 Aligned_cols=178 Identities=36% Similarity=0.630 Sum_probs=5.6
Q ss_pred EEEEEEeCCCcceEEEEccCCCEEEEEEEEEECC-eeeEEEEEE--CCCCcEEeeceec-ccceEEEEeecceeEEEEEE
Q psy3528 30 ELTFELADNARECFFEDIKRNTSVILEFQVVTGG-KYDVDVSIE--APTKEIIYRQVKS-QFDTHQFVTLVEGTYVVCFS 105 (218)
Q Consensus 30 ~l~~~l~~g~~~Cf~e~v~~~~~i~~~y~v~~~~-~~~i~v~v~--~p~g~~v~~~~~~-~~g~f~f~~~~~G~y~~Cf~ 105 (218)
|++|.|+||+++||++++++|+.+.++|.+.+++ ..++++.|+ +|+|+.+++++.. .+|+|+|+++++|+|++||+
T Consensus 1 a~~f~l~~g~~~Cf~e~v~~~~~i~~~y~v~~~~~~~~v~~~i~~~~~~~~~i~~~~~~~~~~~f~f~~~~~G~y~iCf~ 80 (183)
T PF01105_consen 1 ALTFELEPGETECFYEEVPKGTTIRGSYRVTDGGGAYDVDFTIRDPDPNGEVIYSKSDKESEGSFSFTAKESGEYQICFD 80 (183)
T ss_dssp --------------------------------------------------------------------------------
T ss_pred CEEEEECCCCcEEEEEEcCCCcEEEEEEEEeeccccceEEEEEEecccCCceeeeecccccCCcEEEEeccCCCEEEEEE
Confidence 5899999999999999999999999999999887 789999999 5566898888655 45799999999999999999
Q ss_pred ecCCCcce-EEEEEEEEeccCCCCCCCcccccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhHHHHHhhhhhhHHhH
Q psy3528 106 NEFSTYSH-KVVYMDFQVGDEPQLPGVGEHVTVMTQMESSTQEIHKSLNTIIDYQTHHRLREAQGRKRAEDLNERVMWWS 184 (218)
Q Consensus 106 n~~~~~~~-k~V~f~~~~g~~~~~~~~~~~~~~l~~l~~~l~~l~~~l~~i~~eq~~~~~re~~~~~~~est~~rv~~~s 184 (218)
|..+.+++ +.|+|++.+|.++.++...++.+++++++..++++...++.|+++|.|++.|+.+|+++++++++|+++|+
T Consensus 81 n~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~l~~~l~~i~~~q~~~~~r~~~~~~~~es~~~~i~~~s 160 (183)
T PF01105_consen 81 NSSSSFSPSKRVSFDIDVGNENKDYKNVAKKEHLDPLEESLEKLESNLKEIKDEQKYLREREERHRQLNESTNSRIMWWS 160 (183)
T ss_dssp --------------------------------------------------------------------------------
T ss_pred cCCCCccccEEEEEEEEEeecccchhhhhhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhheEEhHH
Confidence 99887656 89999999987654422235678899999999999999999999999999999999999999999999999
Q ss_pred hHHHHHHHhhhhhHHHHHHhhhc
Q psy3528 185 SIEFVVVLLASTLQVYIVKNFFA 207 (218)
Q Consensus 185 ii~i~vli~~~~~Qv~~lk~fF~ 207 (218)
++++++++++++||+++||+||+
T Consensus 161 i~~~~vli~~~~~Qv~~lk~~f~ 183 (183)
T PF01105_consen 161 IIQIVVLILVSVWQVYYLKKFFK 183 (183)
T ss_dssp -----------------HHHHHH
T ss_pred HHHHHHHHHHHHHHHHHHHHHhC
Confidence 99999999999999999999996
No 7
>PF13897 GOLD_2: Golgi-dynamics membrane-trafficking
Probab=93.82 E-value=0.1 Score=38.59 Aligned_cols=31 Identities=26% Similarity=0.424 Sum_probs=26.9
Q ss_pred EEEeecceeEEEEEEecCCCcceEEEEEEEE
Q psy3528 91 QFVTLVEGTYVVCFSNEFSTYSHKVVYMDFQ 121 (218)
Q Consensus 91 ~f~~~~~G~y~~Cf~n~~~~~~~k~V~f~~~ 121 (218)
+++++..|.|-++|+|++|.+.+|++.+.+.
T Consensus 104 s~~c~~~GvYvLkFDNSYS~~rsK~l~Y~V~ 134 (136)
T PF13897_consen 104 SHTCPGPGVYVLKFDNSYSWFRSKKLYYRVY 134 (136)
T ss_pred EEECCCCeEEEEEeeCcceeEEeeEEEEEEE
Confidence 5778899999999999999888898887764
No 8
>PF01835 A2M_N: MG2 domain; InterPro: IPR002890 The proteinase-binding alpha-macroglobulins (A2M) [] are large glycoproteins found in the plasma of vertebrates, in the hemolymph of some invertebrates and in reptilian and avian egg white. A2M-like proteins are able to inhibit all four classes of proteinases by a 'trapping' mechanism. They have a peptide stretch, called the 'bait region', which contains specific cleavage sites for different proteinases. When a proteinase cleaves the bait region, a conformational change is induced in the protein, thus trapping the proteinase. The entrapped enzyme remains active against low molecular weight substrates, whilst its activity toward larger substrates is greatly reduced, due to steric hindrance. Following cleavage in the bait region, a thiol ester bond, formed between the side chains of a cysteine and a glutamine, is cleaved and mediates the covalent binding of the A2M-like protein to the proteinase. This family includes the N-terminal region of the alpha-2-macroglobulin family. The inhibitor domains belong to MEROPS inhibitor family I39.; GO: 0004866 endopeptidase inhibitor activity; PDB: 2B39_B 3KLS_B 3PRX_C 3KM9_B 3PVM_C 3CU7_A 4E0S_A 4A5W_A 4ACQ_C 2P9R_B ....
Probab=93.41 E-value=1.8 Score=29.94 Aligned_cols=60 Identities=17% Similarity=0.129 Sum_probs=39.4
Q ss_pred CCCEEEEEEEEEECC-------eeeEEEEEECCCCcEEeecee-c--ccceEEE--Eeec---ceeEEEEEEecC
Q psy3528 49 RNTSVILEFQVVTGG-------KYDVDVSIEAPTKEIIYRQVK-S--QFDTHQF--VTLV---EGTYVVCFSNEF 108 (218)
Q Consensus 49 ~~~~i~~~y~v~~~~-------~~~i~v~v~~p~g~~v~~~~~-~--~~g~f~f--~~~~---~G~y~~Cf~n~~ 108 (218)
.|+.|.+.-.+...+ ...+.+.|.||+|+.+..... . ..|.+++ ..+. .|.|++=+....
T Consensus 13 PGetV~~~~~~~~~~~~~~~~~~~~~~v~i~dp~g~~v~~~~~~~~~~~G~~~~~~~lp~~~~~G~y~i~~~~~~ 87 (99)
T PF01835_consen 13 PGETVHFRAIVRDLDNDFKPPANSPVTVTIKDPSGNEVFRWSVNTTNENGIFSGSFQLPDDAPLGTYTIRVKTDD 87 (99)
T ss_dssp TTSEEEEEEEEEEECTTCSCESSEEEEEEEEETTSEEEEEEEEEETTCTTEEEEEEE--SS---EEEEEEEEETT
T ss_pred CCCEEEEEEEEeccccccccccCCceEEEEECCCCCEEEEEEeeeeCCCCEEEEEEECCCCCCCEeEEEEEEEcc
Confidence 477777666654322 247999999999999987665 2 3455444 3333 699999888753
No 9
>PF04151 PPC: Bacterial pre-peptidase C-terminal domain; InterPro: IPR007280 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. This domain is normally found at the C terminus of secreted archaeal and bacterial peptidases, the majority of which belong to MEROPS peptidase families M4 (vibriolysin, IPR001570 from INTERPRO), M9A amd M9B (microbial collangenase, IPR002169 from INTERPRO), M28 (aminopeptidase Ap1, IPR007484 from INTERPRO) and S8 (subtilisin family peptidases, IPR000209 from INTERPRO).; GO: 0008233 peptidase activity, 0006508 proteolysis; PDB: 4DY5_B 4DXZ_A 4DY3_B 3JQW_A 3JQX_C 1NQJ_B 1NQD_A 2O8O_A 1WMF_A 1WME_A ....
Probab=92.42 E-value=0.59 Score=30.33 Aligned_cols=61 Identities=13% Similarity=0.219 Sum_probs=40.0
Q ss_pred ceEEEEccCCCEEEEEEEEEECCeeeEEEEEECCCCcEEeeceec-----ccceEEEEeecceeEEEEE
Q psy3528 41 ECFFEDIKRNTSVILEFQVVTGGKYDVDVSIEAPTKEIIYRQVKS-----QFDTHQFVTLVEGTYVVCF 104 (218)
Q Consensus 41 ~Cf~e~v~~~~~i~~~y~v~~~~~~~i~v~v~~p~g~~v~~~~~~-----~~g~f~f~~~~~G~y~~Cf 104 (218)
..|..++++|..+.+. +.++. .+.++.+++++|..+...... ......|+++.+|+|.+=+
T Consensus 3 D~y~f~v~ag~~l~i~--l~~~~-~d~dl~l~~~~g~~~~~~d~~~~~~~~~~~i~~~~~~~GtYyi~V 68 (70)
T PF04151_consen 3 DYYSFTVPAGGTLTID--LSGGS-GDADLYLYDSNGNSLASYDDSSQSGGNDESITFTAPAAGTYYIRV 68 (70)
T ss_dssp EEEEEEESTTEEEEEE--ECETT-SSEEEEEEETTSSSCEECCCCTCETTSEEEEEEEESSSEEEEEEE
T ss_pred EEEEEEEcCCCEEEEE--EcCCC-CCeEEEEEcCCCCchhhheecCCCCCCccEEEEEcCCCEEEEEEE
Confidence 3566778888876544 44433 267799999998766552211 1234678889999998754
No 10
>PF13860 FlgD_ig: FlgD Ig-like domain; PDB: 3C12_A 3OSV_A.
Probab=89.71 E-value=3.1 Score=27.86 Aligned_cols=32 Identities=16% Similarity=0.235 Sum_probs=24.1
Q ss_pred CCEEEEEEEEEECCeeeEEEEEECCCCcEEeec
Q psy3528 50 NTSVILEFQVVTGGKYDVDVSIEAPTKEIIYRQ 82 (218)
Q Consensus 50 ~~~i~~~y~v~~~~~~~i~v~v~~p~g~~v~~~ 82 (218)
+..+.+.|.+.++. ..+.+.|+|.+|++|++.
T Consensus 11 ~~~~~~~~~l~~~a-~~v~v~I~d~~G~~V~t~ 42 (81)
T PF13860_consen 11 GTKGSIEYTLPEDA-DNVTVTIYDSNGQVVRTI 42 (81)
T ss_dssp TCEEEEEEEECSSC-EEEEEEEEETTS-EEEEE
T ss_pred CEEEEEEEeCCCcc-cEEEEEEEcCCCCEEEEE
Confidence 34777888787643 468999999999999764
No 11
>smart00557 IG_FLMN Filamin-type immunoglobulin domains. These form a rod-like structure in the actin-binding cytoskeleton protein, filamin. The C-terminal repeats of filamin bind beta1-integrin (CD29).
Probab=89.43 E-value=2.4 Score=29.18 Aligned_cols=45 Identities=16% Similarity=0.244 Sum_probs=31.3
Q ss_pred CeeeEEEEEECCCCcEEeec-eecccce--EEEEeecceeEEEEEEec
Q psy3528 63 GKYDVDVSIEAPTKEIIYRQ-VKSQFDT--HQFVTLVEGTYVVCFSNE 107 (218)
Q Consensus 63 ~~~~i~v~v~~p~g~~v~~~-~~~~~g~--f~f~~~~~G~y~~Cf~n~ 107 (218)
+...+.+.|.+|+|+.+.-+ .+...|. .+|++...|.|.+.+.-.
T Consensus 30 G~~~~~v~i~~p~g~~~~~~v~d~~dGty~v~y~P~~~G~~~i~V~~~ 77 (93)
T smart00557 30 GGGELEVEVTGPSGKKVPVEVKDNGDGTYTVSYTPTEPGDYTVTVKFG 77 (93)
T ss_pred CCCcEEEEEECCCCCeeEeEEEeCCCCEEEEEEEeCCCEeEEEEEEEC
Confidence 34579999999998655322 2233454 468899999999998644
No 12
>PRK02710 plastocyanin; Provisional
Probab=87.61 E-value=8.7 Score=27.75 Aligned_cols=74 Identities=12% Similarity=0.083 Sum_probs=34.9
Q ss_pred hcceEEEEEEeCCC-cceEE---EEccCCCEEEEEEEEEECCeeeEEEEEECCCCcEEeec-eecccceEEEEeecceeE
Q psy3528 26 VQAVELTFELADNA-RECFF---EDIKRNTSVILEFQVVTGGKYDVDVSIEAPTKEIIYRQ-VKSQFDTHQFVTLVEGTY 100 (218)
Q Consensus 26 ~~~~~l~~~l~~g~-~~Cf~---e~v~~~~~i~~~y~v~~~~~~~i~v~v~~p~g~~v~~~-~~~~~g~f~f~~~~~G~y 100 (218)
..+...++.+.... .-.|. .+++.|+.|. +...+...+++.+ .+..+ ....+ ...+...++++....|.|
T Consensus 26 a~a~~~~V~~~~~~~~~~F~P~~i~v~~Gd~V~--~~N~~~~~H~v~~--~~~~~-~~~~~~~~~pg~t~~~tF~~~G~y 100 (119)
T PRK02710 26 ASAETVEVKMGSDAGMLAFEPSTLTIKAGDTVK--WVNNKLAPHNAVF--DGAKE-LSHKDLAFAPGESWEETFSEAGTY 100 (119)
T ss_pred cccceEEEEEccCCCeeEEeCCEEEEcCCCEEE--EEECCCCCceEEe--cCCcc-ccccccccCCCCEEEEEecCCEEE
Confidence 44556666665432 34554 2556788754 4333333334432 22211 11111 112333566666668988
Q ss_pred EEEE
Q psy3528 101 VVCF 104 (218)
Q Consensus 101 ~~Cf 104 (218)
.+=-
T Consensus 101 ~y~C 104 (119)
T PRK02710 101 TYYC 104 (119)
T ss_pred EEEc
Confidence 7533
No 13
>PF13473 Cupredoxin_1: Cupredoxin-like domain; PDB: 1IBZ_D 1IC0_E 1IBY_D.
Probab=86.39 E-value=2.3 Score=29.77 Aligned_cols=54 Identities=19% Similarity=0.199 Sum_probs=25.1
Q ss_pred EccCCCEEEEEEEEEECCeeeEEEEEECCCCcEEeeceecccceEEEEeecceeEEEE
Q psy3528 46 DIKRNTSVILEFQVVTGGKYDVDVSIEAPTKEIIYRQVKSQFDTHQFVTLVEGTYVVC 103 (218)
Q Consensus 46 ~v~~~~~i~~~y~v~~~~~~~i~v~v~~p~g~~v~~~~~~~~g~f~f~~~~~G~y~~C 103 (218)
.+++|+.+.+.+.-.+... -++.+.+-+-+... .......+.|++.+.|+|.+=
T Consensus 38 ~v~~G~~v~l~~~N~~~~~--h~~~i~~~~~~~~l--~~g~~~~~~f~~~~~G~y~~~ 91 (104)
T PF13473_consen 38 TVKAGQPVTLTFTNNDSRP--HEFVIPDLGISKVL--PPGETATVTFTPLKPGEYEFY 91 (104)
T ss_dssp EEETTCEEEEEEEE-SSS---EEEEEGGGTEEEEE---TT-EEEEEEEE-S-EEEEEB
T ss_pred EEcCCCeEEEEEEECCCCc--EEEEECCCceEEEE--CCCCEEEEEEcCCCCEEEEEE
Confidence 4566777766665443222 33444331111111 112234677888999999873
No 14
>PF00630 Filamin: Filamin/ABP280 repeat; InterPro: IPR017868 The many different actin cross-linking proteins share a common architecture, consisting of a globular actin-binding domain and an extended rod. Whereas their actin-binding domains consist of two calponin homology domains (see IPR001715 from INTERPRO), their rods fall into three families. The rod domain of the family including the Dictyostelium discoideum (Slime mould) gelation factor (ABP120) and human filamin (ABP280) is constructed from tandem repeats of a 100-residue motif that is glycine and proline rich []. The gelation factor's rod contains 6 copies of the repeat, whereas filamin has a rod constructed from 24 repeats. The resolution of the 3D structure of rod repeats from the gelation factor has shown that they consist of a beta-sandwich, formed by two beta-sheets arranged in an immunoglobulin-like fold [, ]. Because conserved residues that form the core of the repeats are preserved in filamin, the repeat structure should be common to the members of the gelation factor/filamin family. The head to tail homodimerisation is crucial to the function of the ABP120 and ABP280 proteins. This interaction involves a small portion at the distal end of the rod domains. For the gelation factor it has been shown that the carboxy-terminal repeat 6 dimerises through a double edge-to-edge extension of the beta-sheet and that repeat 5 contributes to dimerisation to some extent [, , ].; PDB: 2DI9_A 2EEC_A 2DIC_A 2EEA_A 2DMC_A 2EE9_A 2D7O_A 2D7N_A 2K7P_A 2NQC_A ....
Probab=83.74 E-value=9.8 Score=26.04 Aligned_cols=41 Identities=15% Similarity=0.264 Sum_probs=28.2
Q ss_pred eEEEEEECCCCc----EEe-eceecccce--EEEEeecceeEEEEEEe
Q psy3528 66 DVDVSIEAPTKE----IIY-RQVKSQFDT--HQFVTLVEGTYVVCFSN 106 (218)
Q Consensus 66 ~i~v~v~~p~g~----~v~-~~~~~~~g~--f~f~~~~~G~y~~Cf~n 106 (218)
.+.+.|.+|++. .+. .-.....|. .+|++...|.|++++.-
T Consensus 43 ~~~v~i~~p~~~~~~~~~~~~v~~~~~G~y~v~y~p~~~G~y~i~V~~ 90 (101)
T PF00630_consen 43 EFQVTITSPDGKEEPVPVPVEVIDNGDGTYTVSYTPTEPGKYKISVKI 90 (101)
T ss_dssp EEEEEEESSSSESS--EEEEEEEEESSSEEEEEEEESSSEEEEEEEEE
T ss_pred eeEEEEeCCCCCccccccceEEEECCCCEEEEEEEeCccEeEEEEEEE
Confidence 578899999886 332 222334554 45788899999998864
No 15
>PF11589 DUF3244: Domain of unknown function (DUF3244); InterPro: IPR021638 This family of proteins with unknown function appear to be restricted to Bacteroidetes. The protein may have an immunoglobulin-like beta-sandwich fold however this cannot be confirmed. ; PDB: 3D33_B 3SD2_A.
Probab=82.47 E-value=11 Score=26.62 Aligned_cols=58 Identities=21% Similarity=0.319 Sum_probs=37.9
Q ss_pred CCCEEEEEEEEEECCeeeEEEEEECCCCcEEeeceecc--cceEEEE--eecceeEEEEEEecCC
Q psy3528 49 RNTSVILEFQVVTGGKYDVDVSIEAPTKEIIYRQVKSQ--FDTHQFV--TLVEGTYVVCFSNEFS 109 (218)
Q Consensus 49 ~~~~i~~~y~v~~~~~~~i~v~v~~p~g~~v~~~~~~~--~g~f~f~--~~~~G~y~~Cf~n~~~ 109 (218)
.|..+.+.|.. ...++.++|+|.+|+++|+..-.. .....+. ....|.|.+=+.+...
T Consensus 35 ~~~~l~I~F~~---~~~~vtI~I~d~~G~vVy~~~~~~~~~~~~~I~L~~~~~G~Y~l~i~~~~g 96 (106)
T PF11589_consen 35 DGNNLSIEFES---PIGDVTITIKDSTGNVVYSETVSNSAGQSITIDLNGLPSGEYTLEITNGNG 96 (106)
T ss_dssp ETTEEEEEESS-----SEEEEEEEETT--EEEEEEESCGGTTEEEEE-TTS-SEEEEEEEEECTC
T ss_pred eCCEEEEEEcC---CCCCEEEEEEeCCCCEEEEEEccCCCCcEEEEEeCCCCCccEEEEEEeCCC
Confidence 35667777742 233689999999999999875433 2235555 4468999999998864
No 16
>PF05738 Cna_B: Cna protein B-type domain; InterPro: IPR008454 This entry represents a repeated B region domain found in the collagen-binding surface protein Cna in Staphylococcus aureus, as well as other related domains. The B region domain of Cna has a prealbumin-like beta-sandwich fold of seven strands in two sheets with a Greek key topology []. However, this domain does not mediate collagen binding, the IPR008456 from INTERPRO region carries out that function; instead it appears to form a stalk that presents the ligand binding domain away from the bacterial cell surface. Cna is a collagen-binding MSCRAMM (Microbial Surface Component Recognizing Adhesive Matrix Molecules), and is necessary and sufficient for S. aureus cells to adhere to cartilage.; PDB: 2X5P_A 3RKP_A 3KPT_A 1VLF_T 1TI2_F 1TI6_D 1TI4_J 1VLE_V 1VLD_X 3PF2_A ....
Probab=80.05 E-value=5.5 Score=25.42 Aligned_cols=43 Identities=12% Similarity=0.106 Sum_probs=34.0
Q ss_pred eEEEEEECCCCcEEee--ceecccceEEEEeecceeEEEEEEecC
Q psy3528 66 DVDVSIEAPTKEIIYR--QVKSQFDTHQFVTLVEGTYVVCFSNEF 108 (218)
Q Consensus 66 ~i~v~v~~p~g~~v~~--~~~~~~g~f~f~~~~~G~y~~Cf~n~~ 108 (218)
++.|.+++.++..... ..-...|.+.|.--..|.|.+=....-
T Consensus 3 Ga~f~L~~~~~~~~~~~~~~Td~~G~~~f~~L~~G~Y~l~E~~aP 47 (70)
T PF05738_consen 3 GATFELYDEDGNEVIEVTVTTDENGKYTFKNLPPGTYTLKETKAP 47 (70)
T ss_dssp TEEEEEEETTSEEEEEEEEEGGTTSEEEEEEEESEEEEEEEEETT
T ss_pred CeEEEEEECCCCEEEEEEEEECCCCEEEEeecCCeEEEEEEEECC
Confidence 4678889888887765 444567999999999999999988743
No 17
>PRK06655 flgD flagellar basal body rod modification protein; Reviewed
Probab=77.81 E-value=9.1 Score=31.12 Aligned_cols=57 Identities=14% Similarity=0.218 Sum_probs=39.3
Q ss_pred CCEEEEEEEEEECCeeeEEEEEECCCCcEEeecee--cccceEEEE---------eecceeEEEEEEec
Q psy3528 50 NTSVILEFQVVTGGKYDVDVSIEAPTKEIIYRQVK--SQFDTHQFV---------TLVEGTYVVCFSNE 107 (218)
Q Consensus 50 ~~~i~~~y~v~~~~~~~i~v~v~~p~g~~v~~~~~--~~~g~f~f~---------~~~~G~y~~Cf~n~ 107 (218)
+..+.+.|...++ ...+.+.|+|.+|++|+.-.. ...|.+.|+ ...+|.|.+=+...
T Consensus 112 ~~~~~~~~~l~~~-a~~vti~I~D~~G~~Vrt~~lg~~~aG~~~f~WDG~d~~G~~lp~G~Yt~~V~A~ 179 (225)
T PRK06655 112 GGTTPFGVELPSA-ADNVTVTITDSAGQVVRTIDLGAQSAGVVSFTWDGTDTDGNALPDGNYTIKASAS 179 (225)
T ss_pred CCceEEEEEcCCC-CcEEEEEEEcCCCCEEEEEecCCcCCCceeEEECCCCCCCCcCCCeeEEEEEEEE
Confidence 3466777776654 346999999999999976432 345766663 23478898888644
No 18
>PRK12813 flgD flagellar basal body rod modification protein; Reviewed
Probab=75.18 E-value=13 Score=30.13 Aligned_cols=56 Identities=11% Similarity=0.180 Sum_probs=38.9
Q ss_pred CCEEEEEEEEEECCeeeEEEEEECCCCcEEeeceecccc--eEEEEe-------ecceeEEEEEEec
Q psy3528 50 NTSVILEFQVVTGGKYDVDVSIEAPTKEIIYRQVKSQFD--THQFVT-------LVEGTYVVCFSNE 107 (218)
Q Consensus 50 ~~~i~~~y~v~~~~~~~i~v~v~~p~g~~v~~~~~~~~g--~f~f~~-------~~~G~y~~Cf~n~ 107 (218)
+..+.+.|...++ ...+.+.|+|.+|++|+.... ..| .|.+.- ..+|.|.|=+.-.
T Consensus 110 g~~~~~~~~l~~~-a~~v~v~I~D~~G~vV~t~~~-~~G~~~f~WDG~d~~G~~l~~G~Yt~~V~A~ 174 (223)
T PRK12813 110 GTPVTISPNPAAD-ADKAELVVRDAAGAEVARETV-PVGAGPVEWAGEDADGNPLPNGAYSFVVESY 174 (223)
T ss_pred CceeEEEEeccCC-CceEEEEEEcCCCCEEEEEee-CCCceeEEeCCcCCCCCcCCCccEEEEEEEE
Confidence 5577788877764 346999999999999976543 334 444442 2368999988654
No 19
>PF14524 Wzt_C: Wzt C-terminal domain; PDB: 2R5O_B.
Probab=75.08 E-value=26 Score=25.22 Aligned_cols=82 Identities=9% Similarity=0.059 Sum_probs=36.2
Q ss_pred CcceEEEEccCCCEEEEEEEEEEC-CeeeEEEEEECCCCcEEeeceecccceEEEEeecceeEEEEEEecCCCcceEEEE
Q psy3528 39 ARECFFEDIKRNTSVILEFQVVTG-GKYDVDVSIEAPTKEIIYRQVKSQFDTHQFVTLVEGTYVVCFSNEFSTYSHKVVY 117 (218)
Q Consensus 39 ~~~Cf~e~v~~~~~i~~~y~v~~~-~~~~i~v~v~~p~g~~v~~~~~~~~g~f~f~~~~~G~y~~Cf~n~~~~~~~k~V~ 117 (218)
.+.|=.....+.-.+.+.|++... ....+.+.|++.+|..++.......+ ..+.....|.|++++.-... ..+-...
T Consensus 25 g~~~~~~~~ge~~~i~i~~~~~~~i~~~~~~~~i~~~~g~~v~~~~t~~~~-~~~~~~~~g~~~~~~~i~~~-L~~G~Y~ 102 (142)
T PF14524_consen 25 GEPTSSFESGEPIRIRIDYEVNEDIDDPVFGFAIRDSDGQRVFGTNTYDSG-FPIPLSEGGTYEVTFTIPKP-LNPGEYS 102 (142)
T ss_dssp EES-SSEETTSEEEEEEEEEESS-EEEEEEEEEEEETT--EEEEEEHHHHT---EEE-TT-EEEEEEEEE---B-SEEEE
T ss_pred CCEeeEEeCCCEEEEEEEEEECCCCCccEEEEEEEcCCCCEEEEECccccC-ccccccCCCEEEEEEEEcCc-cCCCeEE
Confidence 344433333333344445554321 23468999999999988763222222 33333336666666654433 3334444
Q ss_pred EEEEe
Q psy3528 118 MDFQV 122 (218)
Q Consensus 118 f~~~~ 122 (218)
+++.+
T Consensus 103 i~v~l 107 (142)
T PF14524_consen 103 ISVGL 107 (142)
T ss_dssp EEEEE
T ss_pred EEEEE
Confidence 44444
No 20
>PF13620 CarboxypepD_reg: Carboxypeptidase regulatory-like domain; PDB: 3MN8_D 3P0D_I 3KCP_A 2B59_B 1UWY_A 1H8L_A 1QMU_A 2NSM_A.
Probab=74.64 E-value=4.9 Score=26.41 Aligned_cols=43 Identities=19% Similarity=0.237 Sum_probs=28.8
Q ss_pred eEEEEEECCCCcEEeeceecccceEEEEeecceeEEEEEEecC
Q psy3528 66 DVDVSIEAPTKEIIYRQVKSQFDTHQFVTLVEGTYVVCFSNEF 108 (218)
Q Consensus 66 ~i~v~v~~p~g~~v~~~~~~~~g~f~f~~~~~G~y~~Cf~n~~ 108 (218)
++.+.+.++++.......-...|.|.|..-..|.|.+=+....
T Consensus 16 ~a~V~l~~~~~~~~~~~~Td~~G~f~~~~l~~g~Y~l~v~~~g 58 (82)
T PF13620_consen 16 GATVTLTDQDGGTVYTTTTDSDGRFSFEGLPPGTYTLRVSAPG 58 (82)
T ss_dssp T-EEEET--TTTECCEEE--TTSEEEEEEE-SEEEEEEEEBTT
T ss_pred CEEEEEEEeeCCCEEEEEECCCceEEEEccCCEeEEEEEEECC
Confidence 5778888877777666666678999999666699999886543
No 21
>PF10528 PA14_2: GLEYA domain; InterPro: IPR018871 This presumed domain is found in fungal adhesins and is related to the PA14 domain. ; PDB: 4A3X_A.
Probab=74.33 E-value=12 Score=26.92 Aligned_cols=45 Identities=18% Similarity=0.222 Sum_probs=25.4
Q ss_pred CCCcceEEEEccCCCE--EEEEEEEEECCeeeEEEEEECCCCcEEeec
Q psy3528 37 DNARECFFEDIKRNTS--VILEFQVVTGGKYDVDVSIEAPTKEIIYRQ 82 (218)
Q Consensus 37 ~g~~~Cf~e~v~~~~~--i~~~y~v~~~~~~~i~v~v~~p~g~~v~~~ 82 (218)
.+...++..++.+|.. |++-| +..++...++++|++|+|..+.+.
T Consensus 56 ~~~~~~~tv~L~aG~yyPiRi~~-~N~~g~~~~~~~i~~P~G~~~~~~ 102 (113)
T PF10528_consen 56 TGASKSVTVYLTAGTYYPIRIVY-ANGGGPGSFDFSITDPDGTVHTDD 102 (113)
T ss_dssp S-SEEEEEEEE-TT-BEEEEEEE-EE-SS-EEEEEEEEETT-S--B--
T ss_pred CCCceEEEEEEECCcEEEEEEEE-EcCCCceEEEEEEECCCCcEEecC
Confidence 4556788888888874 44444 334455679999999999998764
No 22
>PF09315 DUF1973: Domain of unknown function (DUF1973); InterPro: IPR015394 These functionally uncharacterised domains are found in various eukaryotic calcium-dependent chloride channels.
Probab=74.22 E-value=40 Score=26.35 Aligned_cols=67 Identities=18% Similarity=0.232 Sum_probs=39.7
Q ss_pred CCCEEEEEEEEEECCeeeEEEEEECCCCcEEee-ceecccceEEEE---eecceeEEEEEEecCCCcceEEEEEEEE
Q psy3528 49 RNTSVILEFQVVTGGKYDVDVSIEAPTKEIIYR-QVKSQFDTHQFV---TLVEGTYVVCFSNEFSTYSHKVVYMDFQ 121 (218)
Q Consensus 49 ~~~~i~~~y~v~~~~~~~i~v~v~~p~g~~v~~-~~~~~~g~f~f~---~~~~G~y~~Cf~n~~~~~~~k~V~f~~~ 121 (218)
.++.+.+.|... ....+.+.+|+|+.+.. ..+.......+. ..+.|.+.+.+.|..+. +..+.+.+.
T Consensus 29 ~~T~f~v~w~~~----~~p~i~L~~P~G~~~~~~~~d~~~~~~~i~ipg~ae~G~W~y~i~~~~~~--~q~v~vtVt 99 (179)
T PF09315_consen 29 NNTVFTVTWQNS----SPPSITLTDPSGTVYTTFTTDSNSKTARIQIPGTAEVGTWTYSITNTSSS--SQTVTVTVT 99 (179)
T ss_pred CCeEEEEEECCC----CCceEEEECCCCCEEeeeEEcccccEEEEECCCCcccccEEEEEecCCCC--cceEEEEEE
Confidence 345555555333 24678899999998865 112222333343 35689999999877653 344544444
No 23
>PRK14081 triple tyrosine motif-containing protein; Provisional
Probab=72.37 E-value=77 Score=30.12 Aligned_cols=71 Identities=13% Similarity=0.191 Sum_probs=44.6
Q ss_pred CCEEEEEEEEEECCeeeEEEEEECCCCcEEeeceecccceEEEEeecceeEEEEEEec--CCC---cceEEEEEEEE
Q psy3528 50 NTSVILEFQVVTGGKYDVDVSIEAPTKEIIYRQVKSQFDTHQFVTLVEGTYVVCFSNE--FST---YSHKVVYMDFQ 121 (218)
Q Consensus 50 ~~~i~~~y~v~~~~~~~i~v~v~~p~g~~v~~~~~~~~g~f~f~~~~~G~y~~Cf~n~--~~~---~~~k~V~f~~~ 121 (218)
|+.+.+.-.+.+|.+....+.|+. +|..+....=.....+.|++..+|.|++=+... .+. ...+.|.+.+.
T Consensus 401 G~~i~i~v~a~gg~~~lY~f~ik~-ng~~ve~~~Y~~~~~~~f~P~~~G~Y~IeV~vKdk~S~~~yD~~k~v~l~V~ 476 (667)
T PRK14081 401 GEEIKIRVIAEGGTNLRYSFIIKK-DGKEEEKIDYGKNNWVNFIPEEKGNYELEVRVKDKYSDKEYDAHTIVYIKVH 476 (667)
T ss_pred CCeEEEEEEecCCCeEEEEEEEEE-CCEEEEEeecccccEEEEEECCCeeEEEEEEEecccCchhcccceEEEEEEe
Confidence 555555555555555555666655 666666554455678999999999998776543 331 12455666654
No 24
>PRK05842 flgD flagellar basal body rod modification protein; Reviewed
Probab=71.38 E-value=17 Score=30.87 Aligned_cols=58 Identities=14% Similarity=0.086 Sum_probs=38.7
Q ss_pred CCEEEEEEEEEE---CCeeeEEEEEECCCCcEEeeceec----ccceEEEEe---------ecceeEEEEEEec
Q psy3528 50 NTSVILEFQVVT---GGKYDVDVSIEAPTKEIIYRQVKS----QFDTHQFVT---------LVEGTYVVCFSNE 107 (218)
Q Consensus 50 ~~~i~~~y~v~~---~~~~~i~v~v~~p~g~~v~~~~~~----~~g~f~f~~---------~~~G~y~~Cf~n~ 107 (218)
+..+.+.|.+.. .+...+.+.|+|.+|++|+.-... ..|.+.|+- -..|.|+|=+...
T Consensus 148 g~~~~~~~~l~~~~~~~a~~v~I~I~Da~G~vVrTi~l~~~~~~aG~~~f~WDG~d~~G~~~p~G~Yt~~V~a~ 221 (295)
T PRK05842 148 NNKLSFSLFFDEKIDASKGVPAIQILNENNELVKTIPLKDYNGQKGYINFEWDGLNEKGEKVPKGNYKIKAEYN 221 (295)
T ss_pred CCceEEEEeccccccccCceEEEEEEcCCCCEEEEEecCcccCCCcceeEEECCCCCCCCcCCCcceEEEEEEE
Confidence 335666766542 233479999999999999765432 347766662 2368999998644
No 25
>TIGR03503 conserved hypothetical protein TIGR03503. This set of conserved hypothetical protein has a phylogenetic range that closely matches that of TIGR03501, a putative C-terminal protein targeting signal.
Probab=70.88 E-value=74 Score=28.00 Aligned_cols=45 Identities=13% Similarity=0.053 Sum_probs=27.9
Q ss_pred CCeeeEEEEEECCCCcEEeeceecccceEEEE---eecceeEEEEEEe
Q psy3528 62 GGKYDVDVSIEAPTKEIIYRQVKSQFDTHQFV---TLVEGTYVVCFSN 106 (218)
Q Consensus 62 ~~~~~i~v~v~~p~g~~v~~~~~~~~g~f~f~---~~~~G~y~~Cf~n 106 (218)
.++.-+++.+++|+|..........++...+. ..+.|.|++-..-
T Consensus 239 ~~s~~~~~~~~~P~g~~~~~~~~~~~~~~~~~l~~~~~~G~Y~i~~~~ 286 (374)
T TIGR03503 239 PGSLVIHGELVFPNGQIQQFSIELEEPETRVDLPANYEFGKYRVKGTV 286 (374)
T ss_pred cccEEEEEEEECCCCceEEecccCccCceEEeccCcCCCeEEEEEEEE
Confidence 34557888899999984433333344444443 3467999887654
No 26
>PF15417 DUF4624: Domain of unknown function (DUF4624)
Probab=69.82 E-value=38 Score=24.33 Aligned_cols=78 Identities=29% Similarity=0.404 Sum_probs=45.8
Q ss_pred CCcceEEEEccC-CCEEEEEEEEEECCeeeEEEEEECCC-CcEEeecee---cccceEEEE---eecceeEEEEEEecCC
Q psy3528 38 NARECFFEDIKR-NTSVILEFQVVTGGKYDVDVSIEAPT-KEIIYRQVK---SQFDTHQFV---TLVEGTYVVCFSNEFS 109 (218)
Q Consensus 38 g~~~Cf~e~v~~-~~~i~~~y~v~~~~~~~i~v~v~~p~-g~~v~~~~~---~~~g~f~f~---~~~~G~y~~Cf~n~~~ 109 (218)
|...|..+++.. +..+ +|+.. |+. --|.|+|.+ ++++|+... .....|+.. .....+|-+||.-..-
T Consensus 37 ~rLFcVs~Die~L~aEv--~f~mD-Ge~--~iVEiKd~~~devLWsn~~~~~V~~dt~tisL~nlqk~kEY~V~ftGtkI 111 (132)
T PF15417_consen 37 GRLFCVSEDIEALDAEV--YFQMD-GES--GIVEIKDRKTDEVLWSNTWNGKVSGDTFTISLNNLQKEKEYVVCFTGTKI 111 (132)
T ss_pred ceEEEEecchheeeeEE--EEEEc-Ccc--ceEEeccCCccceeeccccccccccceEEEEhhhcccCceEEEEEeccEe
Confidence 577899888875 5544 44443 333 247888764 567776432 233445554 3346799999985532
Q ss_pred CcceEEEEEEEEe
Q psy3528 110 TYSHKVVYMDFQV 122 (218)
Q Consensus 110 ~~~~k~V~f~~~~ 122 (218)
.+-.|.+.|..
T Consensus 112 --nhAvv~vtFeS 122 (132)
T PF15417_consen 112 --NHAVVKVTFES 122 (132)
T ss_pred --eeEEEEEEecc
Confidence 23445555543
No 27
>PRK12812 flgD flagellar basal body rod modification protein; Reviewed
Probab=67.83 E-value=52 Score=27.37 Aligned_cols=56 Identities=7% Similarity=0.010 Sum_probs=39.1
Q ss_pred CEEEEEEEEEECCeeeEEEEEECCCCcEEeecee--cccceEEEEee---------cceeEEEEEEec
Q psy3528 51 TSVILEFQVVTGGKYDVDVSIEAPTKEIIYRQVK--SQFDTHQFVTL---------VEGTYVVCFSNE 107 (218)
Q Consensus 51 ~~i~~~y~v~~~~~~~i~v~v~~p~g~~v~~~~~--~~~g~f~f~~~---------~~G~y~~Cf~n~ 107 (218)
..+.+.|.+.+. ...+.+.|+|.+|++|+.-.. ...|.+.|.-. .+|.|+|=+...
T Consensus 128 ~~~~~~~~l~~~-a~~v~v~I~D~~G~~V~t~~lg~~~aG~~~f~WDG~d~~G~~~~~G~Yt~~v~A~ 194 (259)
T PRK12812 128 ELIALKLYFPED-SDEGTLEIYDSNNKLVEKIDFKEISQGLFTMEWDGRDNDGVYAGDGEYTIKAVYN 194 (259)
T ss_pred ceeEEEEecCCc-CceEEEEEEeCCCCEEEEEecCCCCCcceeEEECCCCCCCCcCCCeeeEEEEEEE
Confidence 466777777553 346999999999999976542 34566666532 368999999743
No 28
>PRK12633 flgD flagellar basal body rod modification protein; Provisional
Probab=63.68 E-value=55 Score=26.67 Aligned_cols=55 Identities=15% Similarity=0.236 Sum_probs=37.9
Q ss_pred EEEEEEEEEECCeeeEEEEEECCCCcEEeecee--cccceEEEEe---------ecceeEEEEEEec
Q psy3528 52 SVILEFQVVTGGKYDVDVSIEAPTKEIIYRQVK--SQFDTHQFVT---------LVEGTYVVCFSNE 107 (218)
Q Consensus 52 ~i~~~y~v~~~~~~~i~v~v~~p~g~~v~~~~~--~~~g~f~f~~---------~~~G~y~~Cf~n~ 107 (218)
...+.|.+.++ ...+.+.|+|.+|++|+.-+. ...|.+.|+- -.+|.|+|=+.-.
T Consensus 117 ~~~~~~~l~~~-a~~v~v~I~D~~G~vV~t~~lg~~~aG~~~f~WDG~d~~G~~~~~G~Y~~~V~a~ 182 (230)
T PRK12633 117 ATPFGIDLQGD-ATKVTVKVLDPSGAVVRTMELGDLKTGVHTLQWDGNNDGGQPLADGKYSITVSAS 182 (230)
T ss_pred ceeEEEecCCc-CcEEEEEEEeCCCCEEEEEecCCCCCCceeEEECCCCCCCCcCCCcceEEEEEEE
Confidence 45566666543 446999999999999986532 3456666652 2368999999653
No 29
>KOG3202|consensus
Probab=63.50 E-value=48 Score=27.14 Aligned_cols=69 Identities=13% Similarity=0.159 Sum_probs=38.1
Q ss_pred ccccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhHHHHHhhhhh-----------------hHHhHhHHHHHHHhhh
Q psy3528 133 EHVTVMTQMESSTQEIHKSLNTIIDYQTHHRLREAQGRKRAEDLNER-----------------VMWWSSIEFVVVLLAS 195 (218)
Q Consensus 133 ~~~~~l~~l~~~l~~l~~~l~~i~~eq~~~~~re~~~~~~~est~~r-----------------v~~~sii~i~vli~~~ 195 (218)
++++.++.++.++.++...-..+-+|..-+-.--+.+-+-.+.|.+| --.|+++.+++.++.-
T Consensus 149 eQDe~Ld~ls~ti~rlk~~a~~~g~EL~~Q~~llDdl~~e~d~t~srl~~~~~~l~~v~~~~s~~~~~~~il~l~~~~~l 228 (235)
T KOG3202|consen 149 EQDEGLDGLSATVQRLKGMALAMGEELEEQGRLLDDLDNEMDRTESRLDRVMKRLAKVNRMASQCSQWCAILLLVGLLLL 228 (235)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccccchhHHHHHHHHHHH
Confidence 45567778888888887777666655443322222222222222222 2236677666666666
Q ss_pred hhHHHH
Q psy3528 196 TLQVYI 201 (218)
Q Consensus 196 ~~Qv~~ 201 (218)
++++|.
T Consensus 229 vv~i~~ 234 (235)
T KOG3202|consen 229 VVIIFI 234 (235)
T ss_pred HHHHhc
Confidence 666664
No 30
>PF07210 DUF1416: Protein of unknown function (DUF1416); InterPro: IPR010814 This family consists of several hypothetical bacterial proteins of around 100 residues in length. Members of this family appear to be Actinomycete specific. The function of this family is unknown.
Probab=63.36 E-value=39 Score=22.98 Aligned_cols=58 Identities=16% Similarity=0.151 Sum_probs=33.5
Q ss_pred cCCCEEEEEEEEEECC-eeeEEEEEECCCCcEEeeceecccceEEEEeecceeEEEEEEec
Q psy3528 48 KRNTSVILEFQVVTGG-KYDVDVSIEAPTKEIIYRQVKSQFDTHQFVTLVEGTYVVCFSNE 107 (218)
Q Consensus 48 ~~~~~i~~~y~v~~~~-~~~i~v~v~~p~g~~v~~~~~~~~g~f~f~~~~~G~y~~Cf~n~ 107 (218)
.+.+.|.+... .+|+ -..--+.+.|++|+..-+--...+|+|.|-+. +|.+.+=.-.+
T Consensus 5 ~ke~VItG~V~-~~G~Pv~gAyVRLLD~sgEFtaEvvts~~G~FRFfaa-pG~WtvRal~~ 63 (85)
T PF07210_consen 5 EKETVITGRVT-RDGEPVGGAYVRLLDSSGEFTAEVVTSATGDFRFFAA-PGSWTVRALSR 63 (85)
T ss_pred cceEEEEEEEe-cCCcCCCCeEEEEEcCCCCeEEEEEecCCccEEEEeC-CCceEEEEEcc
Confidence 34456666665 3322 12456888899998654433445678887654 45555544433
No 31
>PRK10378 inactive ferrous ion transporter periplasmic protein EfeO; Provisional
Probab=62.05 E-value=67 Score=28.28 Aligned_cols=67 Identities=10% Similarity=0.175 Sum_probs=36.2
Q ss_pred EEEEEEeCCCcceEEE--EccCCCEEEEEEEEEECCeeeEEEEEECCCCcEEeeceecc-c--ceEEEEeecceeEEE-E
Q psy3528 30 ELTFELADNARECFFE--DIKRNTSVILEFQVVTGGKYDVDVSIEAPTKEIIYRQVKSQ-F--DTHQFVTLVEGTYVV-C 103 (218)
Q Consensus 30 ~l~~~l~~g~~~Cf~e--~v~~~~~i~~~y~v~~~~~~~i~v~v~~p~g~~v~~~~~~~-~--g~f~f~~~~~G~y~~-C 103 (218)
.+.+.+.++ .|--. .++.|. ..|.|.+.+.....+.+.+. +.++-+.+... . +.+.++. +.|+|.+ |
T Consensus 31 ~v~Vti~d~--~c~p~~~tVpAG~---~~f~V~N~~~~~~Efe~~~~-~~vv~e~EnIaPG~s~~l~~~L-~pGtY~~~C 103 (375)
T PRK10378 31 QVKVTVNDK--QCEPMTLTVNAGK---TQFIIQNHSQKALEWEILKG-VMVVEERENIAPGFSQKMTANL-QPGEYDMTC 103 (375)
T ss_pred ceEEEEECC--ccccCceeeCCCC---EEEEEEeCCCCcceEEeecc-ccccccccccCCCCceEEEEec-CCceEEeec
Confidence 455555544 66655 446775 45666655555567777642 22222223322 2 3444333 6999998 8
No 32
>PRK12634 flgD flagellar basal body rod modification protein; Reviewed
Probab=61.08 E-value=70 Score=25.91 Aligned_cols=45 Identities=16% Similarity=0.085 Sum_probs=32.4
Q ss_pred CeeeEEEEEECCCCcEEeecee--cccceEEEEee---------cceeEEEEEEec
Q psy3528 63 GKYDVDVSIEAPTKEIIYRQVK--SQFDTHQFVTL---------VEGTYVVCFSNE 107 (218)
Q Consensus 63 ~~~~i~v~v~~p~g~~v~~~~~--~~~g~f~f~~~---------~~G~y~~Cf~n~ 107 (218)
....+.+.|+|.+|++|+.-.. ...|.+.|.-. .+|.|.|-+.-.
T Consensus 120 ~a~~v~i~I~d~~G~~V~t~~lg~~~aG~~~f~WDG~d~~G~~~~~G~Yt~~v~a~ 175 (221)
T PRK12634 120 SAGFVNFEITDANGAFVKQISVPASAAGEVSFAWDGTDANGNRMAAGKYGVTATQT 175 (221)
T ss_pred cCCeEEEEEEcCCCCEEEEEecCCcCCCceeEEECCCCCCCCcCCCeeeEEEEEEE
Confidence 3457999999999999977532 34577666532 369999999643
No 33
>PF10779 XhlA: Haemolysin XhlA; InterPro: IPR019715 Haemolysin XhlA is a cell-surface associated haemolysin that lyses the two most prevalent types of insect immune cells (granulocytes and plasmatocytes) as well as rabbit and horse erythrocytes [].
Probab=59.67 E-value=46 Score=21.62 Aligned_cols=11 Identities=9% Similarity=0.208 Sum_probs=4.6
Q ss_pred hHhHHHHHHHh
Q psy3528 183 WSSIEFVVVLL 193 (218)
Q Consensus 183 ~sii~i~vli~ 193 (218)
..++..++..+
T Consensus 55 r~iiGaiI~~i 65 (71)
T PF10779_consen 55 RTIIGAIITAI 65 (71)
T ss_pred HHHHHHHHHHH
Confidence 44444444433
No 34
>KOG0518|consensus
Probab=59.00 E-value=24 Score=34.91 Aligned_cols=58 Identities=9% Similarity=0.162 Sum_probs=39.0
Q ss_pred EEEEEEEEEECCeeeEEEEEECCCCcEEe---eceecccceEEEEeecceeEEEEEEecCC
Q psy3528 52 SVILEFQVVTGGKYDVDVSIEAPTKEIIY---RQVKSQFDTHQFVTLVEGTYVVCFSNEFS 109 (218)
Q Consensus 52 ~i~~~y~v~~~~~~~i~v~v~~p~g~~v~---~~~~~~~g~f~f~~~~~G~y~~Cf~n~~~ 109 (218)
...+.-...+.+..++.+.+.+|+|...- .+-+...-+..|+..+.|.|++|+.+..-
T Consensus 870 ~~~l~lk~~~~~~~d~ta~vt~PSG~~~~aei~~~~~~~y~vrFtP~e~G~~tl~V~y~~~ 930 (1113)
T KOG0518|consen 870 TCSLDLKATEASSQDITARVTDPSGRVFEAEIVDLGQGTYQVRFTPKEPGNHTLSVKYKDQ 930 (1113)
T ss_pred eeeeeeecCCCCccceEEEeeCCCCCccccEEEECCCceEEEEecCCCCCceEEEEEecCc
Confidence 33444444555667899999999986432 22222224567999999999999988743
No 35
>PF05753 TRAP_beta: Translocon-associated protein beta (TRAPB); InterPro: IPR008856 This family consists of several eukaryotic translocon-associated protein beta (TRAPB) or signal sequence receptor beta subunit (SSR-beta) proteins. The normal translocation of nascent polypeptides into the lumen of the endoplasmic reticulum (ER) is thought to be aided in part by a translocon-associated protein (TRAP) complex consisting of 4 protein subunits. The association of mature proteins with the ER and Golgi, or other intracellular locales, such as lysosomes, depends on the initial targeting of the nascent polypeptide to the ER membrane. A similar scenario must also exist for proteins destined for secretion [].; GO: 0005783 endoplasmic reticulum, 0016021 integral to membrane
Probab=57.98 E-value=89 Score=24.44 Aligned_cols=30 Identities=13% Similarity=0.266 Sum_probs=22.7
Q ss_pred EEEccCCCEEEEEEEEEE-CCeeeEEEEEEC
Q psy3528 44 FEDIKRNTSVILEFQVVT-GGKYDVDVSIEA 73 (218)
Q Consensus 44 ~e~v~~~~~i~~~y~v~~-~~~~~i~v~v~~ 73 (218)
-..+..|..+.+.|.+.. |++.-.+|++.|
T Consensus 31 ~~~~v~g~~v~V~~~iyN~G~~~A~dV~l~D 61 (181)
T PF05753_consen 31 NKYLVEGEDVTVTYTIYNVGSSAAYDVKLTD 61 (181)
T ss_pred cccccCCcEEEEEEEEEECCCCeEEEEEEEC
Confidence 345567999999999976 444567889988
No 36
>PF08842 Mfa2: Fimbrillin-A associated anchor proteins Mfa1 and Mfa2; InterPro: IPR014941 This family of proteins may be lipoproteins principally from bacilli. They are between 300 and 400 residues. Many Bacteroides-like bacterial species, including Porphyromonas gingivalis, the causal agent of periodontal infection, carry at least two types of fimbriae, namely FimA and Mfa1 fimbriae, following the names of their major subunit proteins []. Normally, FimA fimbriae are long filaments that are easily detached from cells, whereas Mfa1 fimbriae are short filaments that are tightly bound to cells; however, in the absence of Mfa2 protein, the Mfa1 fimbriae are also very long and are not attached. Mfa2 and Mfa1 are associated with each other in whole P. gingivalis cells to the extent that Mfa2 is located on the cell surface and probably associated with Mfa1 fimbriae in such a way that it anchors the Mfa1 fimbriae to the cell surface and regulates Mfa1 filament length [].; PDB: 3PAY_C 3GF8_A.
Probab=57.93 E-value=18 Score=29.63 Aligned_cols=62 Identities=13% Similarity=0.174 Sum_probs=31.2
Q ss_pred EccCCCEEEEEEEEEECC-------eeeEEEEEECCCCcEEeeceecc---c-ceEEE--EeecceeEEEEEEec
Q psy3528 46 DIKRNTSVILEFQVVTGG-------KYDVDVSIEAPTKEIIYRQVKSQ---F-DTHQF--VTLVEGTYVVCFSNE 107 (218)
Q Consensus 46 ~v~~~~~i~~~y~v~~~~-------~~~i~v~v~~p~g~~v~~~~~~~---~-g~f~f--~~~~~G~y~~Cf~n~ 107 (218)
+++.+..+.+.|.-..++ -..+++.|+|.+|+.+....... . +.+.. .....|.|+++....
T Consensus 3 dC~~~~~l~f~y~~~~~~~~~~~~~v~~v~lyvFd~~g~~v~~~~~~~~~~~~~~y~~~~~~l~~G~Y~~va~~n 77 (283)
T PF08842_consen 3 DCPGGLTLKFSYDYNMGNADAFEDEVKRVDLYVFDEDGKLVKQRTIDSEELEGGGYTMFLLDLPPGTYTFVAWGN 77 (283)
T ss_dssp -----EEEEEE---STT-S--HHHH--EEEEEEE-TTSBEEEEEEEECGGCCTTTEEE-CCT--SEEEEEEEEES
T ss_pred CCCceEEEEEEEeCCccccccccceEeEEEEEEEeCCCeEEEEEEcccccccCCceEEeeccCCCCcEEEEEEEC
Confidence 344566666666554111 23799999999999665433221 2 44544 344578999988653
No 37
>PF08234 Spindle_Spc25: Chromosome segregation protein Spc25; InterPro: IPR013255 This is a family of chromosome segregation proteins. It contains Spc25, which is a conserved eukaryotic kinetochore protein involved in cell division. In fungi the Spc25 protein is a subunit of the Nuf2-Ndc80 complex [], and in vertebrates it forms part of the Ndc80 complex []. ; PDB: 2VE7_B.
Probab=55.25 E-value=32 Score=22.53 Aligned_cols=31 Identities=6% Similarity=0.102 Sum_probs=17.0
Q ss_pred eecceeEEEEEEecCCCcceEEEEEEEEecc
Q psy3528 94 TLVEGTYVVCFSNEFSTYSHKVVYMDFQVGD 124 (218)
Q Consensus 94 ~~~~G~y~~Cf~n~~~~~~~k~V~f~~~~g~ 124 (218)
+.+.+..+|.|+|-......+..+|.+.++.
T Consensus 3 ~~~~d~lkf~F~~id~~d~~re~s~~l~i~~ 33 (74)
T PF08234_consen 3 AIGGDQLKFVFTNIDPNDPDREFSFTLDISS 33 (74)
T ss_dssp --STT-EEEEE-S-BTTBSSS-EEEEEE-SS
T ss_pred ccCCceEEEEEeEcCCCCCCceEEEEEEECC
Confidence 3455668888988765444567888888765
No 38
>PF07495 Y_Y_Y: Y_Y_Y domain; InterPro: IPR011123 This region is mostly found at the end of the beta propellers (IPR011110 from INTERPRO) in a family of two component regulators. However they are also found tandemly repeated in Q891H4 from SWISSPROT without other signal conduction domains being present. It is named after the conserved tyrosines found in the alignment. The exact function is not known.; PDB: 3V9F_D 3VA6_B 3OTT_B 4A2M_D 4A2L_B.
Probab=54.89 E-value=49 Score=20.49 Aligned_cols=39 Identities=15% Similarity=0.195 Sum_probs=23.1
Q ss_pred EEEEEECCCCcEEeeceecccc-eEEEEeecceeEEEEEEecC
Q psy3528 67 VDVSIEAPTKEIIYRQVKSQFD-THQFVTLVEGTYVVCFSNEF 108 (218)
Q Consensus 67 i~v~v~~p~g~~v~~~~~~~~g-~f~f~~~~~G~y~~Cf~n~~ 108 (218)
....+.+.+++-+.... .. .+.|+....|+|+|-+....
T Consensus 10 Y~Y~l~g~d~~W~~~~~---~~~~~~~~~L~~G~Y~l~V~a~~ 49 (66)
T PF07495_consen 10 YRYRLEGFDDEWITLGS---YSNSISYTNLPPGKYTLEVRAKD 49 (66)
T ss_dssp EEEEEETTESSEEEESS---TS-EEEEES--SEEEEEEEEEEE
T ss_pred EEEEEECCCCeEEECCC---CcEEEEEEeCCCEEEEEEEEEEC
Confidence 34445555555443221 22 78999999999999996543
No 39
>PRK09619 flgD flagellar basal body rod modification protein; Reviewed
Probab=53.67 E-value=54 Score=26.49 Aligned_cols=57 Identities=12% Similarity=0.073 Sum_probs=38.5
Q ss_pred CCEEEEEEEEEECCeeeEEEEEECCCCcEEeece--ecccceEEEEe------ecceeEEEEEEecC
Q psy3528 50 NTSVILEFQVVTGGKYDVDVSIEAPTKEIIYRQV--KSQFDTHQFVT------LVEGTYVVCFSNEF 108 (218)
Q Consensus 50 ~~~i~~~y~v~~~~~~~i~v~v~~p~g~~v~~~~--~~~~g~f~f~~------~~~G~y~~Cf~n~~ 108 (218)
+....+.|.+.+. ...+.+.|+|.+|++ +... ....|.+.|+- -.+|.|++-+....
T Consensus 109 ~~~~~~~~~L~~~-a~~v~v~I~D~~G~v-~t~~l~~~~aG~~~f~WDG~~~~lp~G~Y~~~V~a~~ 173 (218)
T PRK09619 109 EDPVAGRLTLKHP-APTLTLHITDILGQE-KKIDLGKQPAGPVNFTLDPAALGLQPGQYQLSVVSGS 173 (218)
T ss_pred CCeeEEEEecCCc-CcEEEEEEEeCCCCE-EEEecCCcCCCceeEEECCCCCCCCCceeEEEEEEeC
Confidence 3556777876654 346999999999997 4332 23457766663 24789999997553
No 40
>PF10572 UPF0556: Uncharacterised protein family UPF0556; InterPro: IPR018887 This family of proteins has no known function.
Probab=50.24 E-value=1.1e+02 Score=23.32 Aligned_cols=44 Identities=7% Similarity=0.120 Sum_probs=28.1
Q ss_pred hhhcceEEEEEEeCCCcceEEEEccCCCEEEEEEEEEECCeeeE
Q psy3528 24 QIVQAVELTFELADNARECFFEDIKRNTSVILEFQVVTGGKYDV 67 (218)
Q Consensus 24 ~~~~~~~l~~~l~~g~~~Cf~e~v~~~~~i~~~y~v~~~~~~~i 67 (218)
+.......-|++.||+.-=-++.....-...++|.+.+|.+..=
T Consensus 19 s~~e~~t~eFdvkP~G~~~t~~~~~~~~~C~FTYAaqGGTNEqW 62 (158)
T PF10572_consen 19 SVSEPTTKEFDVKPGGVVHTFSESLGKYKCTFTYAAQGGTNEQW 62 (158)
T ss_pred ccccccceeEEecCCCEEEEeEEecCceEEEEEEEecCCccceE
Confidence 34445677899999876333333334566788888887654433
No 41
>CHL00177 ccs1 c-type cytochrome biogenensis protein; Validated
Probab=50.00 E-value=1.2e+02 Score=27.13 Aligned_cols=77 Identities=9% Similarity=0.027 Sum_probs=44.4
Q ss_pred chhhhhhhHHHHHHHHHHHhhhhcceEEEEEEeCCCcceEEEEccCC------------CEEEEEEEEEE-CCeeeEEEE
Q psy3528 4 RISMLLKKQWIGFFFLLILHQIVQAVELTFELADNARECFFEDIKRN------------TSVILEFQVVT-GGKYDVDVS 70 (218)
Q Consensus 4 ~~~~~~~~~~~~~~~~~~l~~~~~~~~l~~~l~~g~~~Cf~e~v~~~------------~~i~~~y~v~~-~~~~~i~v~ 70 (218)
|+-+.+.+..++++++.+++.......-...+++|+..+...-...| +.+.+.|...+ ...+.-+++
T Consensus 158 r~G~~lvHlslIvIL~Gal~g~l~G~~g~~~i~eGe~~~i~~~~~~g~~~~l~~~svrl~~F~i~y~~~g~p~qF~Sdls 237 (426)
T CHL00177 158 RIAPIIVHFSMILILIGSILGALSGFTAQEIIPKTEIFHIQNIISSGQLSKIPQESARVNDFWITYTPEKSIKQFYSDLS 237 (426)
T ss_pred hhhHHHHHHHHHHHHHHHHHhcccceEEEEEEcCCCceecccccCCCCccccCCcEEEEeeEEEEEcCCCCcceEEEEEE
Confidence 45566677777777777777655555555668888877655422122 12334444322 123456777
Q ss_pred EECCCCcEEe
Q psy3528 71 IEAPTKEIIY 80 (218)
Q Consensus 71 v~~p~g~~v~ 80 (218)
+.|++|+.+.
T Consensus 238 i~d~~g~e~~ 247 (426)
T CHL00177 238 ILNSEGKEIK 247 (426)
T ss_pred EEcCCCCeee
Confidence 7777665443
No 42
>PF05739 SNARE: SNARE domain; InterPro: IPR000727 The process of vesicular fusion with target membranes depends on a set of SNAREs (SNAP-Receptors), which are associated with the fusing membranes [, ]. Target SNAREs (t-SNAREs) are localised on the target membrane and belong to two different families, the syntaxin-like family and the SNAP-25 like family. One member of each family, together with a v-SNARE localised on the vesicular membrane, are required for fusion. The Syntaxins are type-I transmembrane proteins that contain several regions with coiled-coil propensity in their cytosolic part, the SNARE motif. SNAP-25 (IPR000928 from INTERPRO) is a protein consisting of two coiled-coil regions, which is associated with the membrane by lipid anchors. SNARE motifs assemble into parallel four helix bundles stabilised by the burial of these hydrophobic helix faces in the bundle core. Monomeric SNARE motifs are disordered so this assembly reaction is accompanied by a dramatic increase in alpha-helical secondary structure []. The parallel arrangement of SNARE motifs within complexes bring the transmembrane anchors, and the two membranes, into close proximity. Recently, it was shown that the two coiled-coil regions of SNAP-25 and one of the coiled-coil regions of the syntaxins are related []. This domain is found in both Syntaxin and SNAP-25 families as well as in other proteins.; GO: 0005515 protein binding; PDB: 1URQ_B 3RL0_R 1HVV_B 1SFC_B 1N7S_B 3IPD_B 3C98_B 3HD7_F 3RK2_B 1KIL_B ....
Probab=48.21 E-value=64 Score=19.88 Aligned_cols=45 Identities=16% Similarity=0.183 Sum_probs=28.4
Q ss_pred cchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhHHHHHhhhhhh
Q psy3528 136 TVMTQMESSTQEIHKSLNTIIDYQTHHRLREAQGRKRAEDLNERV 180 (218)
Q Consensus 136 ~~l~~l~~~l~~l~~~l~~i~~eq~~~~~re~~~~~~~est~~rv 180 (218)
+.++.|+..+..|+.....|..+-.-+...-.+....++.+..++
T Consensus 4 ~~l~~l~~~i~~l~~~~~~i~~ev~~Q~~~ld~i~~~vd~~~~~l 48 (63)
T PF05739_consen 4 EELDELEQSIQELKQMFQDIGEEVEEQNEMLDRIEDNVDRANENL 48 (63)
T ss_dssp HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHCHhhHHHHHHHHHHHHHHH
Confidence 456778888888888887777775555544444444444444443
No 43
>PRK13211 N-acetylglucosamine-binding protein A; Reviewed
Probab=46.98 E-value=2.3e+02 Score=25.90 Aligned_cols=69 Identities=12% Similarity=0.187 Sum_probs=43.1
Q ss_pred EEEEEEEEEECCeeeEEEEEECCCCcEEeecee-cccceEEEE----eecceeEEEEEEecCCCc---ceEEEEEEE
Q psy3528 52 SVILEFQVVTGGKYDVDVSIEAPTKEIIYRQVK-SQFDTHQFV----TLVEGTYVVCFSNEFSTY---SHKVVYMDF 120 (218)
Q Consensus 52 ~i~~~y~v~~~~~~~i~v~v~~p~g~~v~~~~~-~~~g~f~f~----~~~~G~y~~Cf~n~~~~~---~~k~V~f~~ 120 (218)
.+.+.|.+..-+...|.++|+|.+|+.+..... ...+..+++ -.+.|.|.+=..-..+.. ....++|.+
T Consensus 328 ~~~i~ftv~a~g~~~vta~V~d~~g~~~~~~~~~v~d~s~~vtL~Ls~~~AG~y~Lvv~~t~~dG~~~~q~~~~~~v 404 (478)
T PRK13211 328 AATLDFTVTATGDMNVEATVYNHDGEALGSKSQTVNDGSQSVSLDLSKLKAGHHMLVVKAKPKDGELIKQQTLDFML 404 (478)
T ss_pred cEEEEEEEEeccceEEEEEEEcCCCCeeeeeeEEecCCceeEEEecccCCCceEEEEEEEEeCCCceeeeeeEEEEE
Confidence 455666666556778999999999998865432 223334444 336899999997654321 123455555
No 44
>PF07125 DUF1378: Protein of unknown function (DUF1378); InterPro: IPR009808 This entry is represented by Bacteriophage 933W, Orf25. The characteristics of the protein distribution suggest prophage matches in addition to the phage matches. This family consists of hypothetical bacterial and phage proteins of around 59 residues in length. Bacterial members of this family seem to be specific to Enterobacteria. The function of this family is unknown.
Probab=46.60 E-value=25 Score=21.78 Aligned_cols=30 Identities=10% Similarity=0.523 Sum_probs=22.3
Q ss_pred hhhHHhHhHHHHHHHhhhhhHHHHHHhhhccc
Q psy3528 178 ERVMWWSSIEFVVVLLASTLQVYIVKNFFASR 209 (218)
Q Consensus 178 ~rv~~~sii~i~vli~~~~~Qv~~lk~fF~~K 209 (218)
.-++||+.+-+.+.++.+.|- .+|.||++|
T Consensus 6 ~~lLyFctvVcaLYLvsGGyk--~IRnY~r~K 35 (59)
T PF07125_consen 6 TILLYFCTVVCALYLVSGGYK--VIRNYFRRK 35 (59)
T ss_pred HHHHHHHHHHHHHHHHhccHH--HHHHHHHHH
Confidence 346778888788888888775 478888876
No 45
>PRK14081 triple tyrosine motif-containing protein; Provisional
Probab=46.29 E-value=75 Score=30.21 Aligned_cols=59 Identities=10% Similarity=0.019 Sum_probs=36.8
Q ss_pred CCCEEEEEEEEE--ECCeeeEEEEEECCCCcEEeeceecccceEEEEeecceeEEEEEEec
Q psy3528 49 RNTSVILEFQVV--TGGKYDVDVSIEAPTKEIIYRQVKSQFDTHQFVTLVEGTYVVCFSNE 107 (218)
Q Consensus 49 ~~~~i~~~y~v~--~~~~~~i~v~v~~p~g~~v~~~~~~~~g~f~f~~~~~G~y~~Cf~n~ 107 (218)
.|+.|.+.-... +|...--.+.+.+++|.....+.-.....|++++..+|.|++=....
T Consensus 205 ~~~eI~f~~~a~~~~g~~~LYKF~~i~~~G~~~~~qdYst~n~~~y~~~~~G~Y~i~~~VK 265 (667)
T PRK14081 205 CDEELVFEVESVYEEDRTILYKFVKIDSDGKQTCIQDYSTKNIVSYKEKKSGDYKLLCLVK 265 (667)
T ss_pred cCcEEEEEEEEEeCCCceEEEEEEEECCCCCEEEecCccccceEEEEeCCCccEEEEEEEe
Confidence 355555443333 22233344556677777665555556788999999999999866544
No 46
>COG4856 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=43.89 E-value=1.4e+02 Score=26.40 Aligned_cols=53 Identities=17% Similarity=0.268 Sum_probs=32.6
Q ss_pred CCEEEEEEEE----EECCeeeEEEEEECCCCcEEeeceecccceEEEEe----ecceeEEEEEE
Q psy3528 50 NTSVILEFQV----VTGGKYDVDVSIEAPTKEIIYRQVKSQFDTHQFVT----LVEGTYVVCFS 105 (218)
Q Consensus 50 ~~~i~~~y~v----~~~~~~~i~v~v~~p~g~~v~~~~~~~~g~f~f~~----~~~G~y~~Cf~ 105 (218)
+..+.+.|.- .+|..+.+.+++.+|++..++...... |...| ...|.|+.=+.
T Consensus 51 dvpvdvlYD~~~y~isg~~etV~Vtl~G~ns~~~~~~~~~d---FkV~ADLt~a~~Gt~evkl~ 111 (403)
T COG4856 51 DVPVDVLYDSDKYFISGQPETVTVTLKGPNSIVLKSEKPED---FKVVADLTHAGVGTHEVKLQ 111 (403)
T ss_pred eceeEEEEccccccccCCceEEEEEEeCCcceeeeeecCcC---eEEEEEhhhcCCCceEeeeE
Confidence 3456666642 445667899999999998887643222 44332 34566655553
No 47
>PF09323 DUF1980: Domain of unknown function (DUF1980); InterPro: IPR015402 Members of this occur in gene pairs with members of PF03773 from PFAM. The N-terminal region contains several predicted transmembrane helix regions while the few invariant residues (G, CxxD, and W) occur in the C-terminal region. Members of this family are found in a set of prokaryotic hypothetical proteins. Their exact function has not, as yet, been defined.
Probab=42.10 E-value=30 Score=26.90 Aligned_cols=33 Identities=9% Similarity=0.062 Sum_probs=28.0
Q ss_pred hhhhhHHhHhHHHHHHHhhhhhHHHHHHhhhcc
Q psy3528 176 LNERVMWWSSIEFVVVLLASTLQVYIVKNFFAS 208 (218)
Q Consensus 176 t~~rv~~~sii~i~vli~~~~~Qv~~lk~fF~~ 208 (218)
.+-|+.+++.+.++++++++++|++.+-+--.+
T Consensus 27 I~P~~~~~~~~a~i~l~ilai~q~~~~~~~~~~ 59 (182)
T PF09323_consen 27 IHPRYIPLLYFAAILLLILAIVQLWRWFRPKRR 59 (182)
T ss_pred hCccHHHHHHHHHHHHHHHHHHHHHHHHhcccc
Confidence 567899999999999999999999986665554
No 48
>COG1723 Uncharacterized conserved protein [Function unknown]
Probab=41.87 E-value=52 Score=28.21 Aligned_cols=56 Identities=14% Similarity=0.221 Sum_probs=42.7
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHhhHHHHHhhhhhhHHhHhHHHHHHHhhhhhHHHH
Q psy3528 140 QMESSTQEIHKSLNTIIDYQTHHRLREAQGRKRAEDLNERVMWWSSIEFVVVLLASTLQVYI 201 (218)
Q Consensus 140 ~l~~~l~~l~~~l~~i~~eq~~~~~re~~~~~~~est~~rv~~~sii~i~vli~~~~~Qv~~ 201 (218)
++..+++-|+..+..|.+..+...+ .++.+...++-|+-|+-|++-+++++++++.
T Consensus 270 dI~~RvnvLN~Rl~vi~d~l~il~e------~ln~~~s~~lEWivIiLI~~eVllsl~~i~~ 325 (331)
T COG1723 270 DINPRVNVLNRRLEVISDLLDILNE------QLNHSHSTRLEWIVIILIGLEVLLSLYNIIV 325 (331)
T ss_pred chhHHHHHHHHHHHHHHHHHHHHHH------HhhhcccceeEEEehhHHHHHHHHHHHHHHH
Confidence 5677888888888888776655543 3455777889999999999999888877753
No 49
>PF12690 BsuPI: Intracellular proteinase inhibitor; InterPro: IPR020481 BsuPI is a intracellular proteinase inhibitor that directly regulates the major intracellular proteinase (ISP-1) activity in vivo. It inhibits ISP-1 in the early stages of sporulation and then may be inactivated by a membrane-bound proteinase [].; PDB: 3ISY_A.
Probab=40.91 E-value=95 Score=20.79 Aligned_cols=20 Identities=10% Similarity=0.144 Sum_probs=12.5
Q ss_pred eeeEEEEEECCCCcEEeece
Q psy3528 64 KYDVDVSIEAPTKEIIYRQV 83 (218)
Q Consensus 64 ~~~i~v~v~~p~g~~v~~~~ 83 (218)
++..++.|+|++|+.++.+.
T Consensus 23 gq~~D~~v~d~~g~~vwrwS 42 (82)
T PF12690_consen 23 GQRYDFVVKDKEGKEVWRWS 42 (82)
T ss_dssp S--EEEEEE-TT--EEEETT
T ss_pred CCEEEEEEECCCCCEEEEec
Confidence 35788999999999998764
No 50
>KOG3878|consensus
Probab=40.65 E-value=1.3e+02 Score=26.10 Aligned_cols=27 Identities=26% Similarity=0.518 Sum_probs=22.3
Q ss_pred ecceeEEEEEEecCCCcceEEEEEEEE
Q psy3528 95 LVEGTYVVCFSNEFSTYSHKVVYMDFQ 121 (218)
Q Consensus 95 ~~~G~y~~Cf~n~~~~~~~k~V~f~~~ 121 (218)
+..|-|-+=|+|+.+-|..|++...+.
T Consensus 440 PGrGvYLLKFDNSYSlWRsKtlYYRVY 466 (469)
T KOG3878|consen 440 PGRGVYLLKFDNSYSLWRSKTLYYRVY 466 (469)
T ss_pred CCCceEEEEecchhhhhcccceEEEEE
Confidence 678999999999998777888776653
No 51
>PF10670 DUF4198: Domain of unknown function (DUF4198)
Probab=40.40 E-value=1.2e+02 Score=23.60 Aligned_cols=21 Identities=14% Similarity=0.148 Sum_probs=15.8
Q ss_pred cccceEEEEeecceeEEEEEE
Q psy3528 85 SQFDTHQFVTLVEGTYVVCFS 105 (218)
Q Consensus 85 ~~~g~f~f~~~~~G~y~~Cf~ 105 (218)
..+|.++|+....|.|-+=..
T Consensus 190 D~~G~~~~~~~~~G~wli~a~ 210 (215)
T PF10670_consen 190 DANGRATFTLPRPGLWLIRAS 210 (215)
T ss_pred CCCCEEEEecCCCEEEEEEEE
Confidence 357888888888888866543
No 52
>KOG1693|consensus
Probab=39.95 E-value=1.9e+02 Score=23.07 Aligned_cols=154 Identities=6% Similarity=0.062 Sum_probs=77.0
Q ss_pred ceEEEEEEeCCCcceEEE-EccCCCEEEEEEEEEECCeeeEEEEEECCCCcEEeeceecccceEEEEeecceeEE-EE--
Q psy3528 28 AVELTFELADNARECFFE-DIKRNTSVILEFQVVTGGKYDVDVSIEAPTKEIIYRQVKSQFDTHQFVTLVEGTYV-VC-- 103 (218)
Q Consensus 28 ~~~l~~~l~~g~~~Cf~e-~v~~~~~i~~~y~v~~~~~~~i~v~v~~p~g~~v~~~~~~~~g~f~f~~~~~G~y~-~C-- 103 (218)
-..++-+++.+...|-++ .|..|....+.|.+..+++..|--.=.-..+...+..+. .|.|.|=- +++|. +|
T Consensus 33 KqC~Y~d~~~~~~~~~~~fqV~tGG~fDVD~~I~aPdgkvI~~~~kk~~~~~~f~ae~--~G~Y~fCF--sN~fstf~~K 108 (209)
T KOG1693|consen 33 KQCFYEDLKKDDDTTSFEFQVQTGGHFDVDYDIEAPDGKVIYSEKKKRYDSFLFKAEG--KGEYTFCF--SNEFSTFSHK 108 (209)
T ss_pred hhheeeecccCCceEEEEEEEEeCCceeeEEEEECCCCCEEeeccccccccEEEEEec--ceEEEEEe--cCccccccce
Confidence 467899999887766666 677898888888888776543322222234455555443 35444421 12221 11
Q ss_pred ---EEecCCCcceEEEEEEEEeccCCCCCCCcccccchhHHHHHHHHHHHHHHHHHH--HHHHHHHHHHhhHHHHHhhhh
Q psy3528 104 ---FSNEFSTYSHKVVYMDFQVGDEPQLPGVGEHVTVMTQMESSTQEIHKSLNTIID--YQTHHRLREAQGRKRAEDLNE 178 (218)
Q Consensus 104 ---f~n~~~~~~~k~V~f~~~~g~~~~~~~~~~~~~~l~~l~~~l~~l~~~l~~i~~--eq~~~~~re~~~~~~~est~~ 178 (218)
+++..... +. ..-+.++.+.....-......+...|+.+.+.....+- -+.+.+....++|-+.-|.-.
T Consensus 109 iv~~~~q~~~~-~~-----~~~~~~~~~~~~~~mena~~~I~~~L~~I~~~q~y~R~RE~rn~~tv~st~~Rv~~~Sl~e 182 (209)
T KOG1693|consen 109 IVYMDFQVGEE-PP-----LHPAVSNRDTALTQMENAIVEIHRALNKIDDTQTYYRLREARNRSTVESTNSRVTWWSLLE 182 (209)
T ss_pred Eeeehhhhccc-cc-----cCccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCccchhcccchhhHHHHHH
Confidence 12211100 00 00010112211001122345566666666666555433 355666666677777666655
Q ss_pred h--hHHhHhHHHHHH
Q psy3528 179 R--VMWWSSIEFVVV 191 (218)
Q Consensus 179 r--v~~~sii~i~vl 191 (218)
. ++..|+.|++++
T Consensus 183 ~~~vv~iSi~Qv~il 197 (209)
T KOG1693|consen 183 IIAVVVISIAQVFIL 197 (209)
T ss_pred HHHHHHHHHHHHHHH
Confidence 4 444556665544
No 53
>KOG2861|consensus
Probab=39.25 E-value=74 Score=28.22 Aligned_cols=55 Identities=16% Similarity=0.193 Sum_probs=43.1
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHhhHHHHHhhhhhhHHhHhHHHHHHHhhhhhHHHH
Q psy3528 141 MESSTQEIHKSLNTIIDYQTHHRLREAQGRKRAEDLNERVMWWSSIEFVVVLLASTLQVYI 201 (218)
Q Consensus 141 l~~~l~~l~~~l~~i~~eq~~~~~re~~~~~~~est~~rv~~~sii~i~vli~~~~~Qv~~ 201 (218)
+..++.-|+.+++.+.+..+.++. .+++....++-||-|+-|++-++++++|++.
T Consensus 338 I~qRv~vLN~kl~~i~~~~~~l~e------~ln~r~~~~LEWiIIiLI~~eV~i~i~~i~~ 392 (399)
T KOG2861|consen 338 IGQRVNVLNYKLKVIEDLLDILQE------NLNERHSERLEWIIIILIAFEVAIEIYQIVV 392 (399)
T ss_pred cchHHHHHHHHHHHHHHHHHHHHH------HhhhccccceehhhHHHHHHHHHHHHHHHHH
Confidence 556778888888888887777664 2445666889999999999999999988763
No 54
>PF15432 Sec-ASP3: Accessory Sec secretory system ASP3
Probab=38.91 E-value=1.6e+02 Score=21.73 Aligned_cols=56 Identities=16% Similarity=0.161 Sum_probs=35.5
Q ss_pred EEEEEEEEECCeeeEEEEEECCCCcEEeeceecccceEEEEeec-ceeEEEEEEecCC
Q psy3528 53 VILEFQVVTGGKYDVDVSIEAPTKEIIYRQVKSQFDTHQFVTLV-EGTYVVCFSNEFS 109 (218)
Q Consensus 53 i~~~y~v~~~~~~~i~v~v~~p~g~~v~~~~~~~~g~f~f~~~~-~G~y~~Cf~n~~~ 109 (218)
+...+++...++.-+.+..+|-.|+.+-... ..+++..|+.++ .=.|++++-|...
T Consensus 58 l~~~~~~~P~~svylki~F~dr~~e~i~~~i-~k~~~~~F~yP~~aysY~I~LinaG~ 114 (128)
T PF15432_consen 58 LKFNIDVVPENSVYLKIIFFDRQGEEIEEQI-IKNDSFEFTYPEEAYSYTISLINAGC 114 (128)
T ss_pred EEEEEEEccCCeEEEEEEEEccCCCEeeEEE-EecCceEEeCCCCceEEEEEEeeCCC
Confidence 3344444444455567777788777775432 234557887665 5699999988753
No 55
>COG4549 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=38.85 E-value=1.8e+02 Score=22.46 Aligned_cols=86 Identities=15% Similarity=0.140 Sum_probs=48.0
Q ss_pred EEEEEEeCCCcceEEEEccCCCEEEEEEEEEECCe--eeEEEEEECCCCcEEeeceecccceEEEEeecceeEEEEEEec
Q psy3528 30 ELTFELADNARECFFEDIKRNTSVILEFQVVTGGK--YDVDVSIEAPTKEIIYRQVKSQFDTHQFVTLVEGTYVVCFSNE 107 (218)
Q Consensus 30 ~l~~~l~~g~~~Cf~e~v~~~~~i~~~y~v~~~~~--~~i~v~v~~p~g~~v~~~~~~~~g~f~f~~~~~G~y~~Cf~n~ 107 (218)
.-+..+.+++ .+.|.......+|..|-+ ..-.|.|.=|.|-+..+.+.++.-..+- ..|+|+.-..|.
T Consensus 22 sAH~s~~~~e-------a~~gs~~~atlrVPhgcdgkaTtkV~vklPeGvi~~kp~PkpGW~le~---~Kg~y~~ty~~h 91 (178)
T COG4549 22 SAHVSLETGE-------AAAGSTYKATLRVPHGCDGKATTKVRVKLPEGVIFAKPQPKPGWTLET---IKGDYEKTYQNH 91 (178)
T ss_pred eEEEEecccc-------ccCCceEEEEEecCCCCCCCcceEEEEeCCCceeeecccCCCCcEEEE---eecceeeeeecc
Confidence 4456666553 456888888888876522 2456677778887665544443322221 237777777766
Q ss_pred CCCcceEEEEEEEEeccC
Q psy3528 108 FSTYSHKVVYMDFQVGDE 125 (218)
Q Consensus 108 ~~~~~~k~V~f~~~~g~~ 125 (218)
.+..+.-.....+..|+.
T Consensus 92 G~~i~~G~~ev~w~~gnl 109 (178)
T COG4549 92 GSGITSGVKEVFWKGGNL 109 (178)
T ss_pred CCccccceeEEEeccCCC
Confidence 553333333444444443
No 56
>PF10754 DUF2569: Protein of unknown function (DUF2569); InterPro: IPR019690 This entry represents a protein that is conserved in bacteria. The function is not known, but several members are annotated as being YdgK or a homologue thereof and associated to the inner membrane. This signature also matches proteins that are described as transglutaminase-like enzymes, although this could not be confirmed.
Probab=37.79 E-value=85 Score=23.55 Aligned_cols=33 Identities=9% Similarity=0.115 Sum_probs=27.1
Q ss_pred HhHhHHHHHHHhhhhhHHHHHHhhhcccccccc
Q psy3528 182 WWSSIEFVVVLLASTLQVYIVKNFFASRNERQY 214 (218)
Q Consensus 182 ~~sii~i~vli~~~~~Qv~~lk~fF~~Kk~~~~ 214 (218)
..-.++++.-+++.++.++..-.||++|+..|+
T Consensus 54 ~~~~~~~~~~~~~~~~~l~~~~lffkr~~~~P~ 86 (149)
T PF10754_consen 54 ALWYFEVAINIAMWLFTLWLLYLFFKRKRRFPK 86 (149)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHccchhHH
Confidence 444577888888889999999999999987764
No 57
>PF14830 Haemocyan_bet_s: Haemocyanin beta-sandwich; PDB: 1JS8_B 3QJO_A 1LNL_A.
Probab=34.94 E-value=1.7e+02 Score=20.80 Aligned_cols=51 Identities=22% Similarity=0.364 Sum_probs=29.6
Q ss_pred HhhhhcceEEEEEEeCCCcceEEEEccCCCEEEEEEEEEEC----------------------------CeeeEEEEEEC
Q psy3528 22 LHQIVQAVELTFELADNARECFFEDIKRNTSVILEFQVVTG----------------------------GKYDVDVSIEA 73 (218)
Q Consensus 22 l~~~~~~~~l~~~l~~g~~~Cf~e~v~~~~~i~~~y~v~~~----------------------------~~~~i~v~v~~ 73 (218)
|-....|..+.|.|-.....|.. .|.|.+.+| +.+.+.++|.+
T Consensus 13 L~Gi~~SA~V~~~ic~~~~~c~~---------AG~F~vLGG~~EMpW~FdRlykydIT~~l~~l~l~~~~~f~i~~~i~~ 83 (103)
T PF14830_consen 13 LHGIGTSADVKFYICKPDDDCHK---------AGTFFVLGGEKEMPWAFDRLYKYDITDALKKLGLHYDDPFHIKVEITA 83 (103)
T ss_dssp -B--SS-EEEEEEEEESSSSEEE---------EEEEEE---TTS---B-SS-EEEE-HHHHHHCT--TTSSEEEEEEEEE
T ss_pred ccCCCceEEEEEEEECCCCCceE---------ccEEEEcCCCccCccccCccchhhHHHHHHHcCCCCCCCEEEEEEEEe
Confidence 44556677888888877778832 334444333 24578888888
Q ss_pred CCCcEEee
Q psy3528 74 PTKEIIYR 81 (218)
Q Consensus 74 p~g~~v~~ 81 (218)
.||+.+-+
T Consensus 84 ~nGt~l~~ 91 (103)
T PF14830_consen 84 VNGTELPS 91 (103)
T ss_dssp SSSSBE-C
T ss_pred cCCCCCcc
Confidence 88887743
No 58
>PF00517 GP41: Retroviral envelope protein; InterPro: IPR000328 This entry represents envelope proteins from a variety of retroviruses. It includes the GP41 subunit of the envelope protein complex from Human immunodeficiency virus (HIV) and Simian-Human immunodeficiency virus (SIV), which mediate membrane fusion during viral entry []. It has a core composed of a six-helix bundle and is folded by its trimeric N- and C-terminal heptad-repeats (NHR and CHR) []. Derivatives of this protein prevent HIV-1 from entering cell lines and primary human CD4+ cells in vitro [], making it an attractive subject of gene therapy studies against HIV and related retroviruses. The entry also represents envelop proteins from Bovine immunodeficiency virus, Feline immunodeficiency virus and Equine infectious anemia virus (EIAV) [, ], as well as the Gp36 protein from Mouse mammary tumor virus (MMTV) and Human endogenous retrovirus (HERV).; GO: 0005198 structural molecule activity, 0019031 viral envelope; PDB: 2EZO_B 2EZQ_B 2EZR_A 2JNR_B 1F23_D 2EZP_A 1JEK_A 2Q7C_A 2Q5U_A 2Q3I_A ....
Probab=34.59 E-value=2.1e+02 Score=22.77 Aligned_cols=26 Identities=12% Similarity=0.338 Sum_probs=6.3
Q ss_pred hhhhHHhHhHHHHHHHhhhhhHHHHH
Q psy3528 177 NERVMWWSSIEFVVVLLASTLQVYIV 202 (218)
Q Consensus 177 ~~rv~~~sii~i~vli~~~~~Qv~~l 202 (218)
.+-..|+..+.++++|+++++=+..+
T Consensus 148 fdit~W~~~Iki~i~iv~~iI~lri~ 173 (204)
T PF00517_consen 148 FDITKWLWYIKIFIMIVIGIIGLRIL 173 (204)
T ss_dssp HCHHHHHHHHHH--------------
T ss_pred HhHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 33444555666666655555433333
No 59
>PF07116 DUF1372: Protein of unknown function (DUF1372); InterPro: IPR010779 This entry is represented by Streptococcus phage Sfi11, Gp93. The characteristics of the protein distribution suggest prophage matches in addition to the phage matches. This family consists of several Streptococcus bacteriophage sequences and related proteins from Streptococcus species. Members of this family are typically around 100 residues in length and their function is unknown.
Probab=34.04 E-value=69 Score=22.58 Aligned_cols=31 Identities=23% Similarity=0.339 Sum_probs=22.7
Q ss_pred HhhhhhhHHhHhHHHHHHHhhhhhHHHHHHh
Q psy3528 174 EDLNERVMWWSSIEFVVVLLASTLQVYIVKN 204 (218)
Q Consensus 174 est~~rv~~~sii~i~vli~~~~~Qv~~lk~ 204 (218)
+....+...++.+-+++.++.+++|+++.-.
T Consensus 6 ~~~~~~~~~~~~~~il~s~~~~~~~v~~~~~ 36 (104)
T PF07116_consen 6 KEKRDNLKRFATIIILISLLFNIWQVIYVIN 36 (104)
T ss_pred HhhhhhHHHHHHHHHHHHHHHhHHHHhhhhh
Confidence 3344566777778888888899999977554
No 60
>PF04728 LPP: Lipoprotein leucine-zipper; InterPro: IPR006817 This repeating sequence, NAKVDQLSNDV, is found in the enterobacterial outer membrane lipoprotein LPP. The outer membrane lipoprotein is the most abundant protein in an Escherichia coli cell. The messenger RNA for the lipoprotein of the E. coli outer membrane codes for a putative precursor, prolipoprotein, which has 20 additional amino acid residues extending from the amino terminus of the lipoprotein.; GO: 0019867 outer membrane; PDB: 1JCC_A 2GUV_C 2GUS_A 1JCD_A 1KFM_A 1T8Z_D 1KFN_A 1EQ7_A.
Probab=32.86 E-value=1.3e+02 Score=18.84 Aligned_cols=26 Identities=15% Similarity=0.276 Sum_probs=13.9
Q ss_pred hhHHHHHHHHHHHHHHHHHHHHHHHH
Q psy3528 138 MTQMESSTQEIHKSLNTIIDYQTHHR 163 (218)
Q Consensus 138 l~~l~~~l~~l~~~l~~i~~eq~~~~ 163 (218)
+++|...+..|+..+..+..+..-.+
T Consensus 5 id~Ls~dVq~L~~kvdqLs~dv~~lr 30 (56)
T PF04728_consen 5 IDQLSSDVQTLNSKVDQLSSDVNALR 30 (56)
T ss_dssp HHHHHHHHHHHHHHHHHHHHHHHHHH
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 44555555555555555555544443
No 61
>KOG0518|consensus
Probab=32.06 E-value=1.4e+02 Score=29.88 Aligned_cols=55 Identities=16% Similarity=0.205 Sum_probs=35.6
Q ss_pred EEEEEEEEEECCeeeEEEEEECCCCcEEe---eceecc-cceEEEEeecceeEEEEEEe
Q psy3528 52 SVILEFQVVTGGKYDVDVSIEAPTKEIIY---RQVKSQ-FDTHQFVTLVEGTYVVCFSN 106 (218)
Q Consensus 52 ~i~~~y~v~~~~~~~i~v~v~~p~g~~v~---~~~~~~-~g~f~f~~~~~G~y~~Cf~n 106 (218)
.+.+.|.+.+.+.-.+.+.|.||+|+... .+.... ...++|.....|.|.+=+.-
T Consensus 377 ~~d~~fD~~Dageg~levqV~gp~Gk~~~~~V~d~~~~~~h~vsY~pd~~G~y~i~v~~ 435 (1113)
T KOG0518|consen 377 EVDFTFDEGDAGEGLLEVQVVGPEGKEKEVVVRDNGRGGIHIVTYVPDCPGRYLIVVFY 435 (1113)
T ss_pred cceeeEEccccccceEEEEEECCCCCceeeEEEecCCCceEEEEEcCCCCCceEEEEEE
Confidence 34455555544444599999999997432 111112 45677888899999888754
No 62
>PF07835 COX4_pro_2: Bacterial aa3 type cytochrome c oxidase subunit IV; InterPro: IPR012422 Bacterial cytochrome c oxidase is found bound to the to the cell membrane, where it is involved in the generation of the transmembrane proton electrochemical gradient. It is composed of four subunits. Subunit IV consists of one transmembrane helix that does not interact directly with the other subunits, but maintains its position by indirect contacts via phospholipid molecules found in the structure. The function of subunit IV is as yet unknown []. ; PDB: 1QLE_D 1M57_J 1M56_J.
Probab=30.84 E-value=1e+02 Score=18.11 Aligned_cols=20 Identities=5% Similarity=0.215 Sum_probs=11.5
Q ss_pred hhhHHhHhHHHHHHHhhhhh
Q psy3528 178 ERVMWWSSIEFVVVLLASTL 197 (218)
Q Consensus 178 ~rv~~~sii~i~vli~~~~~ 197 (218)
+-..|.+++-+++++.++++
T Consensus 23 ~~~k~~~~~~~~~li~lai~ 42 (44)
T PF07835_consen 23 KLTKWGTIAIAAILIFLAIF 42 (44)
T ss_dssp HHHHHHHHHHHHHHHHHHHC
T ss_pred HHHHHHHHHHHHHHHHHHHH
Confidence 34555566666666666543
No 63
>PF13464 DUF4115: Domain of unknown function (DUF4115)
Probab=29.95 E-value=1.4e+02 Score=19.34 Aligned_cols=40 Identities=13% Similarity=0.202 Sum_probs=25.7
Q ss_pred eEEEEEECCCCcEEeeceecccceEEEEeecceeEEEEEEec
Q psy3528 66 DVDVSIEAPTKEIIYRQVKSQFDTHQFVTLVEGTYVVCFSNE 107 (218)
Q Consensus 66 ~i~v~v~~p~g~~v~~~~~~~~g~f~f~~~~~G~y~~Cf~n~ 107 (218)
+.=+.|+|.+|+.+++....+...++| .....+++=+-|.
T Consensus 8 ~sWv~V~d~dG~~~~~~~l~~G~~~~~--~~~~~~~i~iGna 47 (77)
T PF13464_consen 8 DSWVEVTDADGKVLFSGTLKAGETKTF--EGKEPFRIRIGNA 47 (77)
T ss_pred CeEEEEEeCCCcEeeeeeeCCCcEEEE--eCCCCEEEEEeCC
Confidence 456888888999888765555555566 3344555555544
No 64
>TIGR03711 acc_sec_asp3 accessory Sec system protein Asp3. This protein is designated Asp3 because, along with SecY2, SecA2, and other proteins it is part of the accessory Sec system. The system is involved in the export of serine-rich glycoproteins important for virulence in a number of Gram-positive species, including Streptococcus gordonii and Staphylococcus aureus. This protein family is assigned to transport rather than glycosylation function, but the specific molecular role is unknown.
Probab=29.90 E-value=2.4e+02 Score=21.06 Aligned_cols=43 Identities=14% Similarity=0.071 Sum_probs=23.8
Q ss_pred eEEEEEECCCCcEEeeceecccceEEEEee-cceeEEEEEEecCC
Q psy3528 66 DVDVSIEAPTKEIIYRQVKSQFDTHQFVTL-VEGTYVVCFSNEFS 109 (218)
Q Consensus 66 ~i~v~v~~p~g~~v~~~~~~~~g~f~f~~~-~~G~y~~Cf~n~~~ 109 (218)
-+.+...|..|+.+-... ..+.+..|..+ +.=.|++++-|...
T Consensus 82 ~~ki~F~dr~~~ei~~~i-~~~~~~~F~yP~~aysY~i~LinaG~ 125 (135)
T TIGR03711 82 YLKITFFDRQGEEIGTEI-EKDDSIIFIYPDEAYFYKISLISAGS 125 (135)
T ss_pred EEEEEEeccCCceeceEE-EecCceEEECCCcceEEEEEeeeCCC
Confidence 344445555555443321 12244566644 45689999988753
No 65
>PF12669 P12: Virus attachment protein p12 family
Probab=28.70 E-value=31 Score=21.62 Aligned_cols=11 Identities=9% Similarity=0.558 Sum_probs=8.1
Q ss_pred HHhhhcccccc
Q psy3528 202 VKNFFASRNER 212 (218)
Q Consensus 202 lk~fF~~Kk~~ 212 (218)
++++++++|-.
T Consensus 17 ~r~~~k~~K~G 27 (58)
T PF12669_consen 17 IRKFIKDKKKG 27 (58)
T ss_pred HHHHHHHhhcC
Confidence 48888887654
No 66
>PF09753 Use1: Membrane fusion protein Use1; InterPro: IPR019150 This entry represents a family of proteins, approximately 300 residues in length, involved in vesicle transport. They have a single C-terminal transmembrane domain and a SNARE [soluble NSF (N-ethylmaleimide-sensitive fusion protein) attachment protein receptor] domain of approximately 60 residues. The SNARE domains are essential for membrane fusion and are conserved from yeasts to humans. Use1 is one of the three protein subunits that make up the SNARE complex and it is specifically required for Golgi-endoplasmic reticulum retrograde transport [].
Probab=28.04 E-value=3.3e+02 Score=22.20 Aligned_cols=23 Identities=9% Similarity=0.316 Sum_probs=10.2
Q ss_pred HhHhHHHHHHHhhhhhHHHHHHhh
Q psy3528 182 WWSSIEFVVVLLASTLQVYIVKNF 205 (218)
Q Consensus 182 ~~sii~i~vli~~~~~Qv~~lk~f 205 (218)
||..+-+++ +++.++-++.+=|+
T Consensus 228 ~~~~~~i~~-v~~~Fi~mvl~iri 250 (251)
T PF09753_consen 228 CWTWLMIFV-VIIVFIMMVLFIRI 250 (251)
T ss_pred HHHHHHHHH-HHHHHHHHHHHhee
Confidence 555553333 33344444444333
No 67
>PF07680 DoxA: TQO small subunit DoxA; InterPro: IPR011636 Thiosulphate:quinone oxidoreductase (TQO) catalyses one of the early steps in elemental sulphur oxidation. A novel TQO enzyme was purified from the thermo-acidophilic archaeon Acidianus ambivalens and shown to consist of a large subunit (DoxD) and a smaller subunit (DoxA). The DoxD- and DoxA-like two subunits are fused together in a single polypeptide in Q8AAF0 from SWISSPROT.
Probab=27.72 E-value=2.6e+02 Score=20.79 Aligned_cols=63 Identities=10% Similarity=0.184 Sum_probs=35.7
Q ss_pred EEEEccCCCEEEEEEEEEEC----CeeeEEEEEECCCCcEEeeceecc---------cceEEEEeecceeEEEEEEe
Q psy3528 43 FFEDIKRNTSVILEFQVVTG----GKYDVDVSIEAPTKEIIYRQVKSQ---------FDTHQFVTLVEGTYVVCFSN 106 (218)
Q Consensus 43 f~e~v~~~~~i~~~y~v~~~----~~~~i~v~v~~p~g~~v~~~~~~~---------~g~f~f~~~~~G~y~~Cf~n 106 (218)
+..-.-++..+.++....+| ++.-+.+.+.|++|++++++.... ..+|. .--..|.|.++.--
T Consensus 21 is~~~~~~~~L~f~vyr~~G~D~Ygsfl~~i~l~d~~g~vv~~~~~~~L~~lP~~~i~N~Yv-~~~~~g~~gl~vpL 96 (133)
T PF07680_consen 21 ISDALIENGTLSFHVYRVEGPDVYGSFLIGIQLKDSTGHVVLNWDQEKLSSLPKSNIKNDYV-AKVKPGKHGLVVPL 96 (133)
T ss_pred EeeeEEeCCeEEEEEEEcCCCccCCceeeEEEEECCCCCEEEEeCHHHhhhCChhHcCccEE-ccccCCceeEEEEc
Confidence 34433344444444433343 245689999999999998764322 13332 22236778777753
No 68
>PHA02650 hypothetical protein; Provisional
Probab=27.29 E-value=79 Score=21.19 Aligned_cols=30 Identities=3% Similarity=0.041 Sum_probs=20.0
Q ss_pred hhhhhHHhHhHHHHHHHhhhhhHHHHHHhh
Q psy3528 176 LNERVMWWSSIEFVVVLLASTLQVYIVKNF 205 (218)
Q Consensus 176 t~~rv~~~sii~i~vli~~~~~Qv~~lk~f 205 (218)
....-.++-++-+++++++.++-..|||-.
T Consensus 44 ~~~~~~~~~ii~i~~v~i~~l~~flYLK~~ 73 (81)
T PHA02650 44 VSWFNGQNFIFLIFSLIIVALFSFFVFKGY 73 (81)
T ss_pred cCCchHHHHHHHHHHHHHHHHHHHHHHHHh
Confidence 334445555666777777888888888754
No 69
>PF02408 CUB_2: CUB-like domain; InterPro: IPR003366 This domain is found in a family of hypothetical Caenorhabditis elegans proteins. The aligned region has no known function nor do any of the proteins which possess it. However, this domain is related to the CUB domain (IPR000859 from INTERPRO). The aligned region is approximately 130 amino acids long and contains two conserved cysteine residues.
Probab=27.19 E-value=2.3e+02 Score=20.13 Aligned_cols=42 Identities=5% Similarity=0.256 Sum_probs=30.9
Q ss_pred EEeCCCcceEEE-EccCCCEEEEEEEEEECCeeeEEEEEECCCCcE
Q psy3528 34 ELADNARECFFE-DIKRNTSVILEFQVVTGGKYDVDVSIEAPTKEI 78 (218)
Q Consensus 34 ~l~~g~~~Cf~e-~v~~~~~i~~~y~v~~~~~~~i~v~v~~p~g~~ 78 (218)
.+| ....|.++ .+|+|-.+.+.....-.+ .-.+.|.|+.|..
T Consensus 37 ~~p-~n~~C~y~i~iP~G~~a~v~~~~~~~~--~d~i~v~D~~g~~ 79 (120)
T PF02408_consen 37 QFP-ANQNCTYQINIPKGYYAKVTLSANLND--NDSITVTDSNGKS 79 (120)
T ss_pred ccC-CCCceEEEEEcCCceEEEEEEEEecCC--CCEEEEEecCCCE
Confidence 344 45688888 889998877777665444 4578999998876
No 70
>COG5415 Predicted integral membrane metal-binding protein [General function prediction only]
Probab=27.17 E-value=3.4e+02 Score=21.97 Aligned_cols=31 Identities=13% Similarity=0.215 Sum_probs=19.1
Q ss_pred cchhHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy3528 136 TVMTQMESSTQEIHKSLNTIIDYQTHHRLRE 166 (218)
Q Consensus 136 ~~l~~l~~~l~~l~~~l~~i~~eq~~~~~re 166 (218)
.+|++++..+.+++..+..++...+-.+.|.
T Consensus 15 ~~L~rle~qi~q~~~~~~~~qs~l~~~~~r~ 45 (251)
T COG5415 15 ADLSRLESQIHQLDVALKKSQSILSQWQSRL 45 (251)
T ss_pred hhHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 4567777777777777666655544444443
No 71
>COG4467 Regulator of replication initiation timing [Replication, recombination, and repair]
Probab=26.87 E-value=2.4e+02 Score=20.16 Aligned_cols=45 Identities=16% Similarity=0.106 Sum_probs=27.4
Q ss_pred ccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhHHHHHhhhhh
Q psy3528 135 VTVMTQMESSTQEIHKSLNTIIDYQTHHRLREAQGRKRAEDLNER 179 (218)
Q Consensus 135 ~~~l~~l~~~l~~l~~~l~~i~~eq~~~~~re~~~~~~~est~~r 179 (218)
.+.+.+++..+..+...+..++.....+-+-....+--++....|
T Consensus 7 Fd~v~~le~~l~~l~~el~~lK~~l~~lvEEN~~L~lENe~LR~R 51 (114)
T COG4467 7 FDQVDNLEEQLGVLLAELGGLKQHLGSLVEENTALRLENEKLRER 51 (114)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhHHHHhhHHHHHHH
Confidence 356677888888888888887776665554333334334444444
No 72
>PF05506 DUF756: Domain of unknown function (DUF756); InterPro: IPR008475 This domain is found, normally as a tandem repeat, at the C terminus of bacterial phospholipase C proteins.; GO: 0004629 phospholipase C activity, 0016042 lipid catabolic process
Probab=25.94 E-value=2.1e+02 Score=19.12 Aligned_cols=9 Identities=11% Similarity=0.099 Sum_probs=4.3
Q ss_pred eeEEEEEEC
Q psy3528 65 YDVDVSIEA 73 (218)
Q Consensus 65 ~~i~v~v~~ 73 (218)
..+.++|++
T Consensus 32 ~~~~~~v~~ 40 (89)
T PF05506_consen 32 AAVTFTVYD 40 (89)
T ss_pred CcEEEEEEe
Confidence 344555554
No 73
>COG5415 Predicted integral membrane metal-binding protein [General function prediction only]
Probab=25.80 E-value=2.5e+02 Score=22.71 Aligned_cols=42 Identities=5% Similarity=0.105 Sum_probs=20.9
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHhhHHHHHhhhhhhHHhHhHHH
Q psy3528 140 QMESSTQEIHKSLNTIIDYQTHHRLREAQGRKRAEDLNERVMWWSSIEF 188 (218)
Q Consensus 140 ~l~~~l~~l~~~l~~i~~eq~~~~~re~~~~~~~est~~rv~~~sii~i 188 (218)
+....+.+|..++... ..+-.+.+.+..+...|+..+|+.-.
T Consensus 12 ~~~~~L~rle~qi~q~-------~~~~~~~qs~l~~~~~r~tv~slAl~ 53 (251)
T COG5415 12 KYTADLSRLESQIHQL-------DVALKKSQSILSQWQSRLTVYSLALT 53 (251)
T ss_pred cchhhHHHHHHHHHHH-------HHHHHHHHHHHHHHHHHHhHHHHHHH
Confidence 4555666666655433 12233444455555555555554433
No 74
>cd05864 Ig2_VEGFR-2 Second immunoglobulin (Ig)-like domain of vascular endothelial growth factor receptor 2 (VEGFR-2). Ig2_VEGF-2: Second immunoglobulin (Ig)-like domain of vascular endothelial growth factor receptor 2 (VEGFR-2). The VEGFRs have an extracellular component with seven Ig-like domains, a transmembrane segment, and an intracellular tyrosine kinase domain interrupted by a kinase-insert domain. VEGFRs bind VEGFs with high affinity at the Ig-like domains. VEGFR-2 (KDR/Flk-1) is a major mediator of the mitogenic, angiogenic and microvascular permeability-enhancing effects of VEGF-A; VEGF-A is important to the growth and maintenance of vascular endothelial cells and to the development of new blood- and lymphatic-vessels in physiological and pathological states. VEGF-A also interacts with VEGFR-1, which it binds more strongly than VEGFR-2. VEGFR-2 and -1 may mediate a chemotactic and a survival signal in hematopoietic stem cells or leukemia cells.
Probab=25.37 E-value=1.6e+02 Score=18.72 Aligned_cols=27 Identities=15% Similarity=0.271 Sum_probs=18.6
Q ss_pred eecceeEEEEEEecCCCcceEEEEEEEE
Q psy3528 94 TLVEGTYVVCFSNEFSTYSHKVVYMDFQ 121 (218)
Q Consensus 94 ~~~~G~y~~Cf~n~~~~~~~k~V~f~~~ 121 (218)
..+.|.|+-+..|.... .....+|++.
T Consensus 43 ~~D~G~YtC~a~N~~G~-~~~~~t~~l~ 69 (70)
T cd05864 43 EKDAGNYTVVLTNPITK-EEQRHTFQLV 69 (70)
T ss_pred HHHCEEEEEEEEECCCc-eeeEEEEEEE
Confidence 34579999999999753 2345666654
No 75
>PHA02665 hypothetical protein; Provisional
Probab=25.17 E-value=51 Score=26.62 Aligned_cols=29 Identities=24% Similarity=0.641 Sum_probs=25.8
Q ss_pred EeCCCcceEEEEccCCCEEEEEEEEEECC
Q psy3528 35 LADNARECFFEDIKRNTSVILEFQVVTGG 63 (218)
Q Consensus 35 l~~g~~~Cf~e~v~~~~~i~~~y~v~~~~ 63 (218)
.+||+++|.+-.+..+..+...|.|.+|+
T Consensus 25 ~ep~kkkc~ftkirt~~s~a~ry~vsdg~ 53 (322)
T PHA02665 25 FEPGKKKCVFTKIRTSSSLACRYAVSDGG 53 (322)
T ss_pred cccccceeEEEEEecchhhhheeeeccCc
Confidence 57899999999999999899999998875
No 76
>PF09889 DUF2116: Uncharacterized protein containing a Zn-ribbon (DUF2116); InterPro: IPR019216 This entry contains various hypothetical prokaryotic proteins whose functions are unknown. They contain a conserved zinc ribbon motif in the N-terminal part and a predicted transmembrane segment in the C-terminal part.
Probab=24.69 E-value=1.6e+02 Score=18.51 Aligned_cols=14 Identities=0% Similarity=0.183 Sum_probs=7.0
Q ss_pred hhhhhHHhHhHHHH
Q psy3528 176 LNERVMWWSSIEFV 189 (218)
Q Consensus 176 t~~rv~~~sii~i~ 189 (218)
.+.+.++|+++-++
T Consensus 36 ~~~~~i~~~~~i~~ 49 (59)
T PF09889_consen 36 RKTQYIFFGIFILF 49 (59)
T ss_pred HHHHHHHHHHHHHH
Confidence 34555555555443
No 77
>PF05399 EVI2A: Ectropic viral integration site 2A protein (EVI2A); InterPro: IPR008608 This family contains several mammalian ectropic viral integration site 2A (EVI2A) proteins. The function of this protein is unknown although it is thought to be a membrane protein and may function as an oncogene in retrovirus induced myeloid tumours [, ].; GO: 0016021 integral to membrane
Probab=24.31 E-value=1.1e+02 Score=24.72 Aligned_cols=32 Identities=28% Similarity=0.208 Sum_probs=16.7
Q ss_pred hhhhhHHhHhHHHHHHHhhhhh----------HHHHHHhhhc
Q psy3528 176 LNERVMWWSSIEFVVVLLASTL----------QVYIVKNFFA 207 (218)
Q Consensus 176 t~~rv~~~sii~i~vli~~~~~----------Qv~~lk~fF~ 207 (218)
+++..+.-.+|-|+|++++|.+ +|-||||--.
T Consensus 125 ~~K~amLIClIIIAVLfLICT~LfLSTVVLANKVS~LKrskQ 166 (227)
T PF05399_consen 125 NNKMAMLICLIIIAVLFLICTLLFLSTVVLANKVSSLKRSKQ 166 (227)
T ss_pred ccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 3455555555555555544432 5666666544
No 78
>PF07691 PA14: PA14 domain; InterPro: IPR011658 The PA14 domain forms an insert in bacterial beta-glucosidases, other glycosidases, glycosyltransferases, proteases, amidases, yeast adhesins and bacterial toxins, including anthrax protective antigen (PA). The domain also occurs in a Dictyostelium pre-spore cell-inducing factor Psi and in fibrocystin, the mammalian protein whose mutation leads to polycystic kidney and hepatic disease. The crystal structure of PA shows that this domain (named PA14 after its location in the PA20 pro-peptide) has a beta-barrel structure. The PA14 domain sequence suggests a binding function, rather than a catalytic role. The PA14 domain distribution is compatible with carbohydrate binding [].; PDB: 2XVG_A 2XVK_A 2XVL_A 2XJU_A 2XJT_A 2XJQ_A 2XJS_A 2XJV_A 2XJP_A 2XJR_A ....
Probab=24.11 E-value=2.7e+02 Score=19.85 Aligned_cols=70 Identities=7% Similarity=0.085 Sum_probs=36.2
Q ss_pred EEEEEEEEEECCeeeEEEEEECC-----CCcEEeeceec----------ccceEEEEeecceeEEEEEEecCCCcceEEE
Q psy3528 52 SVILEFQVVTGGKYDVDVSIEAP-----TKEIIYRQVKS----------QFDTHQFVTLVEGTYVVCFSNEFSTYSHKVV 116 (218)
Q Consensus 52 ~i~~~y~v~~~~~~~i~v~v~~p-----~g~~v~~~~~~----------~~g~f~f~~~~~G~y~~Cf~n~~~~~~~k~V 116 (218)
.+.+.+.+...+.+.+.+...|. +|+.+...... ......+.....|.|.|.+.-.........+
T Consensus 49 ~~~G~~~~~~~G~y~f~~~~~d~~~l~idg~~vid~~~~~~~~~~~~~~~~~~~~v~l~~g~~y~i~i~y~~~~~~~~~~ 128 (145)
T PF07691_consen 49 RWTGYFKPPETGTYTFSLTSDDGARLWIDGKLVIDNWGNQGGGFFNSGPSSTSGTVTLEAGGKYPIRIEYFNRGGDASLI 128 (145)
T ss_dssp EEEEEEEESSSEEEEEEEEESSEEEEEETTEEEEECSCTTTSTTTTTSBCCEEEEEEE-TT-EEEEEEEEEECSCSSCCE
T ss_pred EEEEEEecccCceEEEEEEecccEEEEECCEEEEcCCccccccccccccceEEEEEEeeCCeeEEEEEEEEECCCCCEEE
Confidence 45566666555555444443332 56666554332 2334556677788999999754432222223
Q ss_pred EEEEE
Q psy3528 117 YMDFQ 121 (218)
Q Consensus 117 ~f~~~ 121 (218)
.+.+.
T Consensus 129 ~l~w~ 133 (145)
T PF07691_consen 129 QLSWT 133 (145)
T ss_dssp EEEEE
T ss_pred EEEEE
Confidence 55554
No 79
>COG0598 CorA Mg2+ and Co2+ transporters [Inorganic ion transport and metabolism]
Probab=24.04 E-value=3.2e+02 Score=23.19 Aligned_cols=32 Identities=9% Similarity=-0.039 Sum_probs=17.6
Q ss_pred HHhhHHHHHhhhhhhHHhHhHHHHHHHhhhhh
Q psy3528 166 EAQGRKRAEDLNERVMWWSSIEFVVVLLASTL 197 (218)
Q Consensus 166 e~~~~~~~est~~rv~~~sii~i~vli~~~~~ 197 (218)
+......+...|.-+..+|++.++++...-+.
T Consensus 248 d~~~s~is~~~N~imk~LTi~s~iflPpTlIa 279 (322)
T COG0598 248 DAYLSLINNNQNEIMKILTIVSTIFLPPTLIT 279 (322)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHhhHHHH
Confidence 33444445556666666666666655544333
No 80
>PF13715 DUF4480: Domain of unknown function (DUF4480)
Probab=23.64 E-value=2.2e+02 Score=18.57 Aligned_cols=37 Identities=19% Similarity=0.210 Sum_probs=23.6
Q ss_pred eEEEEEECCCCcEEeeceecccceEEEEeecceeEEEEEEec
Q psy3528 66 DVDVSIEAPTKEIIYRQVKSQFDTHQFVTLVEGTYVVCFSNE 107 (218)
Q Consensus 66 ~i~v~v~~p~g~~v~~~~~~~~g~f~f~~~~~G~y~~Cf~n~ 107 (218)
.+.+.+.+.+... . -...|.|.+. ...|.|.+-|+..
T Consensus 17 ~a~V~~~~~~~~~-~---Td~~G~F~i~-~~~g~~~l~is~~ 53 (88)
T PF13715_consen 17 GATVYLKNTKKGT-V---TDENGRFSIK-LPEGDYTLKISYI 53 (88)
T ss_pred CeEEEEeCCcceE-E---ECCCeEEEEE-EcCCCeEEEEEEe
Confidence 4555555544111 1 2357999999 4478999999754
No 81
>PHA03054 IMV membrane protein; Provisional
Probab=23.49 E-value=1.1e+02 Score=20.03 Aligned_cols=26 Identities=27% Similarity=0.359 Sum_probs=17.2
Q ss_pred hhhHHhHhHHHHHHHhhhhhHHHHHH
Q psy3528 178 ERVMWWSSIEFVVVLLASTLQVYIVK 203 (218)
Q Consensus 178 ~rv~~~sii~i~vli~~~~~Qv~~lk 203 (218)
..-.++-++-++.++++.++-..|||
T Consensus 45 ~~~~~~~ii~l~~v~~~~l~~flYLK 70 (72)
T PHA03054 45 CWGWYWLIIIFFIVLILLLLIYLYLK 70 (72)
T ss_pred CchHHHHHHHHHHHHHHHHHHHHHHh
Confidence 44445555556677777777788877
No 82
>KOG3385|consensus
Probab=23.17 E-value=2.3e+02 Score=20.53 Aligned_cols=26 Identities=8% Similarity=0.022 Sum_probs=14.7
Q ss_pred cchhHHHHHHHHHHHHHHHHHHHHHH
Q psy3528 136 TVMTQMESSTQEIHKSLNTIIDYQTH 161 (218)
Q Consensus 136 ~~l~~l~~~l~~l~~~l~~i~~eq~~ 161 (218)
+.++.+++.+..|...--+|-+|-++
T Consensus 36 e~~e~L~~kV~aLKsLs~dIg~Ev~~ 61 (118)
T KOG3385|consen 36 EAAESLQQKVKALKSLSLDIGDEVRT 61 (118)
T ss_pred HHHHHHHHHHHHHHHHHHHhccccch
Confidence 34556666666666655555555333
No 83
>PHA02819 hypothetical protein; Provisional
Probab=22.95 E-value=1.2e+02 Score=19.88 Aligned_cols=27 Identities=11% Similarity=0.310 Sum_probs=17.6
Q ss_pred hhhHHhHhHHHHHHHhhhhhHHHHHHh
Q psy3528 178 ERVMWWSSIEFVVVLLASTLQVYIVKN 204 (218)
Q Consensus 178 ~rv~~~sii~i~vli~~~~~Qv~~lk~ 204 (218)
..-.++-++-++.++++.++-..|||-
T Consensus 43 ~~~~~~~ii~l~~~~~~~~~~flYLK~ 69 (71)
T PHA02819 43 SFLRYYLIIGLVTIVFVIIFIIFYLKV 69 (71)
T ss_pred ChhHHHHHHHHHHHHHHHHHHHHHHHh
Confidence 444455555566677777777888873
No 84
>cd05860 Ig4_SCFR Fourth immunoglobulin (Ig)-like domain of stem cell factor receptor (SCFR). Ig4_SCFR: The fourth Immunoglobulin (Ig)-like domain in stem cell factor receptor (SCFR). SCFR is organized as an extracellular component having five IG-like domains, a transmembrane segment, and a cytoplasmic portion having protein tyrosine kinase activity. SCFR and its ligand SCF are critical for normal hematopoiesis, mast cell development, melanocytes and gametogenesis. SCF binds to the second and third Ig-like domains of SCFR. This fourth Ig-like domain participates in SCFR dimerization, which follows ligand binding. Deletion of this fourth domain abolishes the ligand-induced dimerization of SCFR and completely inhibits signal transduction.
Probab=22.46 E-value=2.8e+02 Score=19.44 Aligned_cols=26 Identities=27% Similarity=0.243 Sum_probs=21.2
Q ss_pred eecceeEEEEEEecCCCcceEEEEEEEEe
Q psy3528 94 TLVEGTYVVCFSNEFSTYSHKVVYMDFQV 122 (218)
Q Consensus 94 ~~~~G~y~~Cf~n~~~~~~~k~V~f~~~~ 122 (218)
..+.|.|.+=..|... ...+.|++.+
T Consensus 74 ~~E~G~YTf~a~N~~~---~~s~tF~l~v 99 (101)
T cd05860 74 GTEGGTYTFLVSNSDA---SASVTFNVYV 99 (101)
T ss_pred hhhCcEEEEEEECCCC---eEEEEEEEEE
Confidence 4579999999999975 4678888875
No 85
>PF06156 DUF972: Protein of unknown function (DUF972); InterPro: IPR010377 FUNCTION: Involved in initiation control of chromosome replication. SUBUNIT: Interacts with both DnaA and DnaN, acting as a bridge between these two proteins. SIMILARITY: Belongs to the YabA family.
Probab=22.39 E-value=2.9e+02 Score=19.58 Aligned_cols=22 Identities=14% Similarity=0.255 Sum_probs=10.4
Q ss_pred hhHHHHHHHHHHHHHHHHHHHH
Q psy3528 138 MTQMESSTQEIHKSLNTIIDYQ 159 (218)
Q Consensus 138 l~~l~~~l~~l~~~l~~i~~eq 159 (218)
+..++..+..+...+..++...
T Consensus 10 l~~le~~l~~l~~~~~~LK~~~ 31 (107)
T PF06156_consen 10 LDQLEQQLGQLLEELEELKKQL 31 (107)
T ss_pred HHHHHHHHHHHHHHHHHHHHHH
Confidence 3444555555544444444443
No 86
>PF14686 fn3_3: Polysaccharide lyase family 4, domain II; PDB: 1NKG_A 2XHN_B 3NJX_A 3NJV_A.
Probab=22.22 E-value=2.7e+02 Score=19.15 Aligned_cols=31 Identities=16% Similarity=0.126 Sum_probs=18.6
Q ss_pred CcEEeeceecccceEEEEeecceeEEEEEEe
Q psy3528 76 KEIIYRQVKSQFDTHQFVTLVEGTYVVCFSN 106 (218)
Q Consensus 76 g~~v~~~~~~~~g~f~f~~~~~G~y~~Cf~n 106 (218)
....|..+...+|.|++.--.+|.|.+-...
T Consensus 38 ~~yqYwt~td~~G~Fti~~V~pGtY~L~ay~ 68 (95)
T PF14686_consen 38 KGYQYWTRTDSDGNFTIPNVRPGTYRLYAYA 68 (95)
T ss_dssp SS-EEEEE--TTSEEE---B-SEEEEEEEEE
T ss_pred CCCcEEEEeCCCCcEEeCCeeCcEeEEEEEE
Confidence 3344444555789999999999999999876
No 87
>PHA02955 hypothetical protein; Provisional
Probab=22.19 E-value=80 Score=25.40 Aligned_cols=25 Identities=20% Similarity=0.223 Sum_probs=17.9
Q ss_pred HhHhHHHHHHHhhhhhHHHHHHhhhc
Q psy3528 182 WWSSIEFVVVLLASTLQVYIVKNFFA 207 (218)
Q Consensus 182 ~~sii~i~vli~~~~~Qv~~lk~fF~ 207 (218)
-|.++-+++++++.++ ++|+||=..
T Consensus 180 ~w~ii~~v~ii~~~v~-l~yikR~i~ 204 (213)
T PHA02955 180 KWFIIYIVLCLLILII-LGYIYRTVR 204 (213)
T ss_pred cchhHHHHHHHHHHHH-HHHHHHHhe
Confidence 4666666666777777 999998654
No 88
>PF05377 FlaC_arch: Flagella accessory protein C (FlaC); InterPro: IPR008039 Although archaeal flagella appear superficially similar to those of bacteria, they are quite distinct []. In several archaea, the flagellin genes are followed immediately by the flagellar accessory genes flaCDEFGHIJ. The gene products may have a role in translocation, secretion, or assembly of the flagellum. FlaC is a protein whose exact role is unknown but it has been shown to be membrane-associated (by immuno-blotting fractionated cells) [].
Probab=22.12 E-value=2.1e+02 Score=17.80 Aligned_cols=25 Identities=20% Similarity=0.286 Sum_probs=14.9
Q ss_pred hHHHHHHHHHHHHHHHHHHHHHHHH
Q psy3528 139 TQMESSTQEIHKSLNTIIDYQTHHR 163 (218)
Q Consensus 139 ~~l~~~l~~l~~~l~~i~~eq~~~~ 163 (218)
+.+|..+.++...+..++.+..-.+
T Consensus 3 ~elEn~~~~~~~~i~tvk~en~~i~ 27 (55)
T PF05377_consen 3 DELENELPRIESSINTVKKENEEIS 27 (55)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 4566666666666666655554444
No 89
>PF04234 CopC: CopC domain; InterPro: IPR007348 CopC is a bacterial blue copper protein that binds 1 atom of copper per protein molecule. Along with CopA, CopC mediates copper resistance by sequestration of copper in the periplasm [].; GO: 0005507 copper ion binding, 0046688 response to copper ion, 0042597 periplasmic space; PDB: 1IX2_B 1LYQ_A 2C9P_C 2C9R_A 2C9Q_A 1M42_A 1OT4_A 1NM4_A.
Probab=21.69 E-value=2.1e+02 Score=19.51 Aligned_cols=10 Identities=40% Similarity=0.703 Sum_probs=5.4
Q ss_pred ceeEEEEEEe
Q psy3528 97 EGTYVVCFSN 106 (218)
Q Consensus 97 ~G~y~~Cf~n 106 (218)
.|.|.+=+..
T Consensus 71 ~G~YtV~wrv 80 (97)
T PF04234_consen 71 PGTYTVSWRV 80 (97)
T ss_dssp SEEEEEEEEE
T ss_pred CceEEEEEEE
Confidence 4555555544
No 90
>TIGR00383 corA magnesium Mg(2+) and cobalt Co(2+) transport protein (corA). The article in Microb Comp Genomics 1998;3(3):151-69 (Medline:98448512) discusses this family and suggests that some members may have functions other than Mg2+ transport.
Probab=21.67 E-value=4.8e+02 Score=21.79 Aligned_cols=17 Identities=12% Similarity=0.223 Sum_probs=7.3
Q ss_pred HHhhhhhhHHhHhHHHH
Q psy3528 173 AEDLNERVMWWSSIEFV 189 (218)
Q Consensus 173 ~est~~rv~~~sii~i~ 189 (218)
+..+|..+...+++.++
T Consensus 251 s~~~N~~mk~LTvvt~I 267 (318)
T TIGR00383 251 NNKMNEIMKILTVVSTI 267 (318)
T ss_pred HHHHHHHHHHHHHHHHH
Confidence 34444444444433333
No 91
>PF14109 GldH_lipo: GldH lipoprotein
Probab=21.61 E-value=3.3e+02 Score=19.90 Aligned_cols=44 Identities=7% Similarity=0.094 Sum_probs=27.7
Q ss_pred eeEEEEEECCCCcEEeecee-cccceEE----EEeecceeEEEEEEecC
Q psy3528 65 YDVDVSIEAPTKEIIYRQVK-SQFDTHQ----FVTLVEGTYVVCFSNEF 108 (218)
Q Consensus 65 ~~i~v~v~~p~g~~v~~~~~-~~~g~f~----f~~~~~G~y~~Cf~n~~ 108 (218)
..+.+.+.||+|+.+-+.-+ ..+.++. +..+++|.|.|.+.-.+
T Consensus 68 dtl~~~Lad~~G~w~G~G~~~~~e~~~~~~~~~~f~~~G~Y~~~i~q~M 116 (131)
T PF14109_consen 68 DTLECELADPDGKWLGKGIGDLYEYKLPYKENVRFPRKGSYTFTIEQAM 116 (131)
T ss_pred eeEEEEEECCCCcEeeeeEeEeEEEEEEeecceecCCCCcEEEEEEecc
Confidence 36888888888887644322 2223332 34567899999987543
No 92
>KOG2678|consensus
Probab=21.08 E-value=4.6e+02 Score=21.39 Aligned_cols=31 Identities=13% Similarity=0.508 Sum_probs=24.3
Q ss_pred hhhHHhHhHHHHHHHhhhhhHHHHHHhhhcc
Q psy3528 178 ERVMWWSSIEFVVVLLASTLQVYIVKNFFAS 208 (218)
Q Consensus 178 ~rv~~~sii~i~vli~~~~~Qv~~lk~fF~~ 208 (218)
+...+|--+.++++++++++-.+.+=++|++
T Consensus 212 sk~s~wf~~~miI~v~~sFVsMiliiqifkk 242 (244)
T KOG2678|consen 212 SKLSYWFYITMIIFVILSFVSMILIIQIFKK 242 (244)
T ss_pred hhhhHHHHHHHHHHHHHHHHHHHHHHHHhhc
Confidence 3446777777888888888888888888875
No 93
>PRK13169 DNA replication intiation control protein YabA; Reviewed
Probab=21.06 E-value=3.2e+02 Score=19.54 Aligned_cols=22 Identities=14% Similarity=0.217 Sum_probs=10.5
Q ss_pred hhHHHHHHHHHHHHHHHHHHHH
Q psy3528 138 MTQMESSTQEIHKSLNTIIDYQ 159 (218)
Q Consensus 138 l~~l~~~l~~l~~~l~~i~~eq 159 (218)
+..++..+..+...+..++...
T Consensus 10 l~~le~~l~~l~~el~~LK~~~ 31 (110)
T PRK13169 10 LDDLEQNLGVLLKELGALKKQL 31 (110)
T ss_pred HHHHHHHHHHHHHHHHHHHHHH
Confidence 4444555555555444444443
No 94
>cd04976 Ig2_VEGFR Second immunoglobulin (Ig)-like domain of vascular endothelial growth factor receptor (VEGFR). Ig2_VEGFR: Second immunoglobulin (Ig)-like domain of vascular endothelial growth factor receptor (VEGFR). The VEGFRs have an extracellular component with seven Ig-like domains, a transmembrane segment, and an intracellular tyrosine kinase domain interrupted by a kinase-insert domain. The VEGFR family consists of three members, VEGFR-1 (Flt-1), VEGFR-2 (KDR/Flk-1) and VEGFR-3 (Flt-4). VEGFRs bind VEGFs with high affinity at the Ig-like domains. VEGF-A is important to the growth and maintenance of vascular endothelial cells and to the development of new blood- and lymphatic-vessels in physiological and pathological states. VEGFR-2 is a major mediator of the mitogenic, angiogenic and microvascular permeability-enhancing effects of VEGF-A. VEGFR-1 may play an inhibitory part in these processes by binding VEGF and interfering with its interaction with VEGFR-2. VEGFR-1 has a signa
Probab=20.90 E-value=1.6e+02 Score=18.53 Aligned_cols=27 Identities=19% Similarity=0.262 Sum_probs=18.7
Q ss_pred eecceeEEEEEEecCCCcceEEEEEEEE
Q psy3528 94 TLVEGTYVVCFSNEFSTYSHKVVYMDFQ 121 (218)
Q Consensus 94 ~~~~G~y~~Cf~n~~~~~~~k~V~f~~~ 121 (218)
..+.|.|+-+..|.... ..+.+++.+.
T Consensus 44 ~~D~G~YtC~a~N~~g~-~~~~~~~~~~ 70 (71)
T cd04976 44 EEDAGNYTVVLTNKQAK-LEKRLTFTLV 70 (71)
T ss_pred HHHCEEEEEEEEcCCcc-EEEEEEEEEE
Confidence 45689999999998753 2455666553
No 95
>PF03554 Herpes_UL73: UL73 viral envelope glycoprotein ; InterPro: IPR005211 This entry represents a conserved region found in a number of viral proteins: BLRF1, U46, 53, and UL73, collectively known as glycoprotein N. These UL73-like envelope glycoproteins, which associate in a high molecular mass complex with their counterpart protein gM, induce neutralizing antibody responses in the host. These glycoproteins are highly polymorphic, particularly in the N-terminal region [].; GO: 0019031 viral envelope
Probab=20.87 E-value=1.3e+02 Score=20.33 Aligned_cols=27 Identities=22% Similarity=0.322 Sum_probs=20.4
Q ss_pred hhhhhhHHhHhHHHHHHHhhhhhHHHH
Q psy3528 175 DLNERVMWWSSIEFVVVLLASTLQVYI 201 (218)
Q Consensus 175 st~~rv~~~sii~i~vli~~~~~Qv~~ 201 (218)
+..+-..+|.+++.+++++.+++=+.|
T Consensus 44 sl~SFsSIW~iiN~~il~~A~~vyLry 70 (82)
T PF03554_consen 44 SLSSFSSIWAIINVVILLCAFCVYLRY 70 (82)
T ss_pred eehHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 445667789999999998887765544
No 96
>cd03455 SAV4209 SAV4209 is a Streptomyces avermitilis protein with a hot dog fold that is similar to those of (R)-specific enoyl-CoA hydratase, the peroxisomal Hydratase-Dehydrogenase-Epimerase (HDE) protein, and the fatty acid synthase beta subunit. The alpha- and gamma-proteobacterial members of this CD have, in addition to a hot dog fold, an N-terminal extension.
Probab=20.54 E-value=3.1e+02 Score=19.21 Aligned_cols=37 Identities=5% Similarity=0.042 Sum_probs=26.4
Q ss_pred EccCCCEEEEEEEEEECCe---eeEEEEEECCCCcEEeec
Q psy3528 46 DIKRNTSVILEFQVVTGGK---YDVDVSIEAPTKEIIYRQ 82 (218)
Q Consensus 46 ~v~~~~~i~~~y~v~~~~~---~~i~v~v~~p~g~~v~~~ 82 (218)
.+..|+.+++.-.+.+-.. ..+.+.+.+++|+.+...
T Consensus 79 pv~~Gdtl~~~~~v~~~~~~~~v~~~~~~~nq~G~~v~~g 118 (123)
T cd03455 79 PLYAGDTLRFGGRVTAKRDDEVVTVELWARNSEGDHVMAG 118 (123)
T ss_pred cccCCCEEEEEEEEEeeccCcEEEEEEEEEcCCCCEEEeE
Confidence 3457999999988875322 257778889999877543
No 97
>PF04109 APG9: Autophagy protein Apg9 ; InterPro: IPR007241 Macroautophagy is a bulk degradation process induced by starvation in eukaryotic cells. In yeast, 15 Apg proteins coordinate the formation of autophagosomes. No molecule involved in autophagy has yet been identified in higher eukaryotes []. The pre-autophagosomal structure contains at least five Apg proteins: Apg1p, Apg2p, Apg5p, Aut7p/Apg8p and Apg16p. It is found in the vacuole []. The C-terminal glycine of Apg12p is conjugated to a lysine residue of Apg5p via an isopeptide bond. During autophagy, cytoplasmic components are enclosed in autophagosomes and delivered to lysosomes/vacuoles. Auotphagy protein 16 (Apg16) has been shown to be bind to Apg5 and is required for the function of the Apg12p-Apg5p conjugate []. Autophagy protein 5 (Apg5) is directly required for the import of aminopeptidase I via the cytoplasm-to-vacuole targeting pathway []. Apg9 plays a direct role in the formation of the cytoplasm to vacuole targeting and autophagic vesicles, possibly serving as a marker for a specialised compartment essential for these vesicle-mediated alternative targeting pathways [].
Probab=20.51 E-value=1.8e+02 Score=25.65 Aligned_cols=35 Identities=14% Similarity=0.235 Sum_probs=29.5
Q ss_pred HhhHHHHHhhhhhhHHhHhHHHHHHHhhhhhHHHH
Q psy3528 167 AQGRKRAEDLNERVMWWSSIEFVVVLLASTLQVYI 201 (218)
Q Consensus 167 ~~~~~~~est~~rv~~~sii~i~vli~~~~~Qv~~ 201 (218)
.+..+..+...+|+.+.+++.+++.-.+.++|+.|
T Consensus 108 ~~r~~l~~~Lr~Rf~~~gi~nlll~Pfi~i~~il~ 142 (370)
T PF04109_consen 108 SRRKELAEELRKRFRLAGILNLLLSPFILIYQILY 142 (370)
T ss_pred hhHHHHHHHHHHHHHHHHHHHHHHhHHHHHHHHHH
Confidence 44567788999999999999999998888888753
No 98
>PF07523 Big_3: Bacterial Ig-like domain (group 3); InterPro: IPR011080 This entry represents bacterial domains with an Ig-like fold. These domains are found in a variety of bacterial surface proteins.; PDB: 2L7Y_A 2KPN_A.
Probab=20.47 E-value=2.3e+02 Score=17.68 Aligned_cols=41 Identities=15% Similarity=0.034 Sum_probs=22.5
Q ss_pred eEEEEEECCCCcEEeeceecccceEEEEeecceeEEEEEEecC
Q psy3528 66 DVDVSIEAPTKEIIYRQVKSQFDTHQFVTLVEGTYVVCFSNEF 108 (218)
Q Consensus 66 ~i~v~v~~p~g~~v~~~~~~~~g~f~f~~~~~G~y~~Cf~n~~ 108 (218)
+..+.+.+.+|..+-...-.-+| .+.....|.|.+-+.-..
T Consensus 18 ~~~v~at~~dG~~~~~~~~~vs~--~~d~~~~G~y~Vt~~y~~ 58 (67)
T PF07523_consen 18 GLFVTATYSDGTSLPLSDVTVSG--TVDTSKAGTYTVTYTYKG 58 (67)
T ss_dssp CHEEEEEETTS-ES-GCCSEEES-----TTS-CCEEEEEEECT
T ss_pred CCEEEEEEcCCCEeceeeeEEEe--eeecCCCceEEEEEEECC
Confidence 35577777788774333222234 677888899998887553
No 99
>PLN00115 pollen allergen group 3; Provisional
Probab=20.34 E-value=3.5e+02 Score=19.63 Aligned_cols=26 Identities=23% Similarity=0.289 Sum_probs=17.1
Q ss_pred HHHHHHHHHhhhhc-ceEEEEEEeCCC
Q psy3528 14 IGFFFLLILHQIVQ-AVELTFELADNA 39 (218)
Q Consensus 14 ~~~~~~~~l~~~~~-~~~l~~~l~~g~ 39 (218)
|..+.+..|+.... ...++|+|+.|+
T Consensus 8 ~~~~~~a~l~~~~~~g~~v~F~V~~gS 34 (118)
T PLN00115 8 LLAVALAALFAVGSCATEVTFKVGKGS 34 (118)
T ss_pred HHHHHHHHHhhhhhcCCceEEEECCCC
Confidence 44455555654433 448999999887
No 100
>PF11166 DUF2951: Protein of unknown function (DUF2951); InterPro: IPR021337 This family of proteins has no known function. It has a highly conserved sequence.
Probab=20.14 E-value=3.1e+02 Score=19.07 Aligned_cols=33 Identities=6% Similarity=0.166 Sum_probs=16.7
Q ss_pred HHHhhhhhhHHhHhHHHHHHHhhhhhHHHHHHhhh
Q psy3528 172 RAEDLNERVMWWSSIEFVVVLLASTLQVYIVKNFF 206 (218)
Q Consensus 172 ~~est~~rv~~~sii~i~vli~~~~~Qv~~lk~fF 206 (218)
.++.+-.-+..|-+.-++.+++.-+ +-.||.+|
T Consensus 64 ~n~Knir~~KmwilGlvgTi~gsli--ia~lr~~f 96 (98)
T PF11166_consen 64 KNDKNIRDIKMWILGLVGTIFGSLI--IALLRTIF 96 (98)
T ss_pred HHHhhHHHHHHHHHHHHHHHHHHHH--HHHHHHHh
Confidence 3334344466677666666554332 33455544
Done!