Query psy15726
Match_columns 171
No_of_seqs 169 out of 663
Neff 7.6
Searched_HMMs 46136
Date Fri Aug 16 19:50:57 2013
Command hhsearch -i /work/01045/syshi/Psyhhblits/psy15726.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/15726hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 KOG0518|consensus 100.0 2.3E-28 5E-33 221.8 16.6 154 15-170 722-880 (1113)
2 smart00557 IG_FLMN Filamin-typ 100.0 1.5E-27 3.3E-32 166.0 13.8 92 50-141 1-92 (93)
3 KOG0518|consensus 100.0 2E-27 4.3E-32 215.8 17.6 149 15-171 635-787 (1113)
4 PF00630 Filamin: Filamin/ABP2 99.9 1.1E-23 2.5E-28 147.4 11.7 89 48-136 2-101 (101)
5 smart00557 IG_FLMN Filamin-typ 98.6 4.2E-08 9.2E-13 67.8 4.3 31 16-46 62-92 (93)
6 PF00630 Filamin: Filamin/ABP2 98.2 1E-06 2.3E-11 61.0 2.7 26 16-41 76-101 (101)
7 PRK14081 triple tyrosine motif 97.2 0.015 3.3E-07 53.2 14.1 147 16-169 344-508 (667)
8 KOG1428|consensus 96.9 0.009 1.9E-07 58.5 10.5 46 98-144 2099-2144(3738)
9 PRK14081 triple tyrosine motif 96.9 0.06 1.3E-06 49.4 15.3 148 16-169 442-603 (667)
10 PF01835 A2M_N: MG2 domain; I 96.7 0.01 2.3E-07 40.8 7.1 64 63-126 11-86 (99)
11 TIGR03503 conserved hypothetic 95.8 0.21 4.7E-06 42.9 11.8 110 16-141 178-302 (374)
12 PF06312 Neurexophilin: Neurex 95.8 0.041 8.9E-07 43.9 6.9 45 81-125 70-115 (219)
13 PF13115 YtkA: YtkA-like 93.7 0.64 1.4E-05 30.9 7.6 43 81-123 38-85 (86)
14 COG2373 Large extracellular al 93.4 0.67 1.5E-05 46.8 10.2 106 63-168 405-522 (1621)
15 PF02369 Big_1: Bacterial Ig-l 93.3 1.1 2.4E-05 31.1 8.5 63 67-129 23-92 (100)
16 PF13860 FlgD_ig: FlgD Ig-like 92.5 1.6 3.5E-05 28.9 8.1 57 68-125 13-77 (81)
17 TIGR00864 PCC polycystin catio 91.4 12 0.00026 40.0 16.1 144 10-164 2001-2153(2740)
18 PF04234 CopC: CopC domain; I 89.7 0.27 5.8E-06 34.0 2.2 53 84-138 36-95 (97)
19 PF13473 Cupredoxin_1: Cupredo 89.4 2.8 6.1E-05 28.9 7.3 59 62-127 38-96 (104)
20 PF11896 DUF3416: Domain of un 89.1 4.1 8.8E-05 31.8 8.6 63 62-124 22-87 (187)
21 TIGR00868 hCaCC calcium-activa 88.9 12 0.00027 35.8 13.1 108 18-129 559-684 (863)
22 PF07705 CARDB: CARDB; InterP 87.8 3.7 8.1E-05 27.3 6.9 66 62-127 14-85 (101)
23 smart00736 CADG Dystroglycan-t 87.7 3.9 8.5E-05 27.9 7.0 81 62-143 8-95 (97)
24 PRK06655 flgD flagellar basal 87.0 5.3 0.00012 32.0 8.3 59 66-125 112-178 (225)
25 PRK10301 hypothetical protein; 86.6 2.3 4.9E-05 30.9 5.5 25 114-139 96-123 (124)
26 cd02858 Esterase_N_term Estera 85.4 8.7 0.00019 25.6 8.5 60 67-130 6-65 (85)
27 COG2372 CopC Uncharacterized p 84.6 7.1 0.00015 28.6 7.2 72 67-139 45-124 (127)
28 PF05404 TRAP-delta: Transloco 83.5 5.8 0.00013 30.4 6.6 54 67-124 43-101 (167)
29 KOG1692|consensus 83.3 8.5 0.00018 30.2 7.5 33 63-95 37-69 (201)
30 PF01483 P_proprotein: Proprot 83.0 6.5 0.00014 26.1 6.1 54 70-125 5-70 (87)
31 COG1470 Predicted membrane pro 82.7 12 0.00026 33.3 9.0 120 19-139 347-482 (513)
32 PRK09619 flgD flagellar basal 82.5 7.6 0.00017 30.9 7.3 70 52-124 97-170 (218)
33 PRK12812 flgD flagellar basal 82.2 10 0.00022 31.1 8.0 58 67-125 128-193 (259)
34 PF04151 PPC: Bacterial pre-pe 82.2 10 0.00022 24.0 7.4 52 71-124 15-69 (70)
35 TIGR03096 nitroso_cyanin nitro 81.8 7.3 0.00016 28.8 6.4 65 62-133 64-130 (135)
36 PF13620 CarboxypepD_reg: Carb 81.7 9 0.00019 24.6 6.4 46 81-129 15-60 (82)
37 PF07523 Big_3: Bacterial Ig-l 80.2 4.6 9.9E-05 25.6 4.4 44 82-130 18-61 (67)
38 PF13584 BatD: Oxygen toleranc 79.9 11 0.00024 33.1 8.2 62 106-167 74-151 (484)
39 PRK12633 flgD flagellar basal 78.2 12 0.00027 29.9 7.3 66 68-134 117-193 (230)
40 PF07495 Y_Y_Y: Y_Y_Y domain; 77.9 3.7 8E-05 25.5 3.4 19 108-126 30-48 (66)
41 KOG1428|consensus 77.8 8.3 0.00018 39.3 6.9 36 11-47 2107-2142(3738)
42 PF00017 SH2: SH2 domain; Int 77.1 6.7 0.00015 25.2 4.6 27 102-128 19-45 (77)
43 PF14545 DBB: Dof, BCAP, and B 75.3 32 0.00068 25.7 8.7 75 62-139 7-88 (142)
44 PF06016 Reovirus_L2: Reovirus 74.4 4 8.7E-05 40.2 4.0 103 63-167 1035-1149(1289)
45 PF13584 BatD: Oxygen toleranc 73.5 63 0.0014 28.3 15.0 61 16-76 79-152 (484)
46 PF14524 Wzt_C: Wzt C-terminal 72.5 31 0.00066 24.3 7.6 64 62-125 30-107 (142)
47 TIGR00864 PCC polycystin catio 72.2 1.3E+02 0.0029 32.7 14.1 139 17-167 1749-1891(2740)
48 PRK12634 flgD flagellar basal 72.1 37 0.0008 27.1 8.5 55 81-135 122-187 (221)
49 smart00252 SH2 Src homology 2 71.9 6.1 0.00013 25.8 3.4 28 100-127 18-45 (84)
50 PF10670 DUF4198: Domain of un 71.5 31 0.00068 26.3 7.9 62 64-127 148-213 (215)
51 PF11589 DUF3244: Domain of un 71.3 10 0.00022 26.4 4.6 53 81-133 48-102 (106)
52 smart00634 BID_1 Bacterial Ig- 70.1 30 0.00065 23.2 8.5 28 100-127 57-84 (92)
53 KOG4088|consensus 69.4 22 0.00047 26.6 6.0 53 68-124 44-101 (167)
54 PF09134 Invasin_D3: Invasin, 68.2 38 0.00083 23.7 6.9 70 67-138 21-95 (99)
55 PRK13211 N-acetylglucosamine-b 67.9 92 0.002 27.9 11.2 86 52-139 312-404 (478)
56 PF01105 EMP24_GP25L: emp24/gp 67.5 1.8 3.8E-05 32.2 0.0 61 64-126 19-82 (183)
57 PF09315 DUF1973: Domain of un 67.4 37 0.00079 26.2 7.3 59 64-124 26-85 (179)
58 KOG2996|consensus 67.2 5.1 0.00011 36.6 2.8 30 99-128 701-730 (865)
59 PRK15095 FKBP-type peptidyl-pr 67.1 37 0.00081 25.4 7.2 79 37-118 32-132 (156)
60 PRK12813 flgD flagellar basal 66.7 65 0.0014 25.8 9.0 57 66-125 110-173 (223)
61 PF05751 FixH: FixH; InterPro 65.9 48 0.001 23.9 10.1 61 65-125 67-132 (146)
62 TIGR03769 P_ac_wall_RPT actino 64.8 7.5 0.00016 22.6 2.3 24 15-38 6-29 (41)
63 COG4856 Uncharacterized protei 63.7 71 0.0015 27.8 8.9 74 3-85 16-108 (403)
64 KOG0544|consensus 63.7 40 0.00088 23.6 6.1 57 105-166 21-78 (108)
65 cd00173 SH2 Src homology 2 dom 62.5 17 0.00037 23.9 4.2 28 100-127 17-44 (94)
66 cd05771 IgC_Tapasin_R Tapasin- 61.9 56 0.0012 23.3 11.7 74 12-90 4-77 (139)
67 PF14874 PapD-like: Flagellar- 59.9 50 0.0011 22.1 9.8 74 63-139 16-97 (102)
68 PF12988 DUF3872: Domain of un 58.4 64 0.0014 23.9 6.8 82 33-126 25-120 (137)
69 PF10648 Gmad2: Immunoglobulin 57.5 50 0.0011 22.3 5.8 46 81-126 29-81 (88)
70 PF00801 PKD: PKD domain; Int 57.4 27 0.00058 21.8 4.3 55 62-126 7-61 (69)
71 PF03896 TRAP_alpha: Transloco 54.7 1.1E+02 0.0024 25.4 8.5 81 61-141 93-191 (285)
72 PF13754 Big_3_4: Bacterial Ig 52.5 32 0.0007 20.8 3.8 26 100-125 7-32 (54)
73 PF06832 BiPBP_C: Penicillin-B 52.0 67 0.0015 21.2 6.8 20 105-124 59-78 (89)
74 PRK05842 flgD flagellar basal 51.9 1E+02 0.0023 25.8 7.9 73 52-125 135-220 (295)
75 PF12975 DUF3859: Domain of un 51.2 91 0.002 22.5 6.9 70 62-131 37-119 (125)
76 KOG1693|consensus 50.8 77 0.0017 25.1 6.5 51 69-123 47-98 (209)
77 KOG4637|consensus 50.1 15 0.00033 31.8 2.7 34 95-128 35-68 (464)
78 TIGR03503 conserved hypothetic 49.4 1.8E+02 0.0038 25.3 13.6 77 63-139 102-216 (374)
79 PF07703 A2M_N_2: Alpha-2-macr 48.9 92 0.002 21.9 10.2 102 62-167 9-118 (136)
80 PF08308 PEGA: PEGA domain; I 46.7 71 0.0015 20.0 5.0 14 114-127 32-45 (71)
81 PF10633 NPCBM_assoc: NPCBM-as 46.4 78 0.0017 20.3 7.8 62 64-125 2-75 (78)
82 PF09912 DUF2141: Uncharacteri 46.0 1E+02 0.0022 21.6 6.6 28 99-126 35-62 (112)
83 PF04571 Lipin_N: lipin, N-ter 45.9 91 0.002 22.2 5.7 44 95-141 28-79 (110)
84 PF09430 DUF2012: Protein of u 45.9 1.1E+02 0.0023 21.8 6.4 37 100-140 25-61 (123)
85 PF13954 PapC_N: PapC N-termin 45.0 25 0.00055 25.8 3.0 20 17-36 25-44 (146)
86 PF00116 COX2: Cytochrome C ox 44.5 1.1E+02 0.0025 21.7 8.0 53 62-122 49-101 (120)
87 PF12863 DUF3821: Domain of un 43.8 1.6E+02 0.0035 23.3 7.7 32 62-93 101-142 (209)
88 PF05738 Cna_B: Cna protein B- 43.1 79 0.0017 19.5 7.8 61 85-147 6-69 (70)
89 cd03866 M14_CPM Peptidase M14 41.0 1.1E+02 0.0023 26.5 6.5 60 65-127 254-350 (376)
90 KOG1690|consensus 39.2 89 0.0019 24.8 5.2 46 81-127 65-110 (215)
91 COG5489 Uncharacterized conser 39.0 66 0.0014 22.7 4.0 40 76-115 63-104 (107)
92 PF09315 DUF1973: Domain of un 37.2 1.9E+02 0.0041 22.2 7.1 85 18-104 74-166 (179)
93 KOG3287|consensus 37.1 1.8E+02 0.0038 23.5 6.7 58 63-123 52-110 (236)
94 PF09394 Inhibitor_I42: Chagas 36.3 68 0.0015 21.1 3.8 22 105-126 53-74 (92)
95 KOG1366|consensus 34.3 1.9E+02 0.0041 29.7 7.8 71 64-134 127-203 (1436)
96 PF13753 SWM_repeat: Putative 33.8 2.7E+02 0.0058 22.9 7.9 27 100-126 193-224 (317)
97 PF03345 DDOST_48kD: Oligosacc 33.7 1.8E+02 0.0038 25.7 6.8 61 63-126 279-354 (423)
98 PF13750 Big_3_3: Bacterial Ig 33.7 2E+02 0.0044 21.5 10.5 109 17-127 10-133 (158)
99 PF13860 FlgD_ig: FlgD Ig-like 33.4 1.4E+02 0.003 19.4 6.8 50 82-133 13-64 (81)
100 PRK10737 FKBP-type peptidyl-pr 33.4 1.3E+02 0.0029 23.5 5.5 56 62-118 51-128 (196)
101 KOG0393|consensus 33.3 45 0.00097 26.3 2.8 32 5-36 16-54 (198)
102 PF02922 CBM_48: Carbohydrate- 32.8 1.3E+02 0.0029 19.1 8.9 59 67-127 11-74 (85)
103 PF03422 CBM_6: Carbohydrate b 31.7 1.7E+02 0.0037 20.1 7.5 46 81-127 60-110 (125)
104 PF07563 DUF1541: Protein of u 31.6 1E+02 0.0022 19.0 3.6 16 105-120 30-45 (53)
105 COG1622 CyoA Heme/copper-type 31.4 2.6E+02 0.0056 22.7 7.0 69 63-139 141-210 (247)
106 PF13715 DUF4480: Domain of un 29.3 1.6E+02 0.0035 18.9 7.2 26 100-128 30-55 (88)
107 PF11763 DIPSY: Cell-wall adhe 29.3 2.2E+02 0.0048 20.6 6.8 44 83-128 50-94 (123)
108 KOG1691|consensus 29.2 2.3E+02 0.005 22.6 6.1 42 81-125 60-101 (210)
109 PF04465 DUF499: Protein of un 29.0 2E+02 0.0044 28.4 6.9 65 63-128 836-903 (1035)
110 PF00254 FKBP_C: FKBP-type pep 28.3 12 0.00027 24.8 -1.0 39 131-169 31-69 (94)
111 MTH00047 COX2 cytochrome c oxi 26.7 3.1E+02 0.0067 21.3 8.5 71 63-142 120-192 (194)
112 PF09264 Sial-lect-inser: Vibr 25.6 2E+02 0.0043 22.6 5.1 105 56-166 44-153 (198)
113 TIGR02148 Fibro_Slime fibro-sl 25.3 96 0.0021 21.3 3.0 23 117-139 64-86 (90)
114 PLN00115 pollen allergen group 25.3 2.6E+02 0.0057 20.1 5.7 64 64-139 21-88 (118)
115 cd07696 IgC_CH3 CH3 domain (th 24.8 2.2E+02 0.0048 19.2 4.9 72 59-139 7-82 (96)
116 PF07691 PA14: PA14 domain; I 24.5 1.5E+02 0.0032 20.8 4.2 21 107-127 50-70 (145)
117 PF12690 BsuPI: Intracellular 24.4 1.7E+02 0.0038 19.3 4.1 17 106-122 63-81 (82)
118 PF04225 OapA: Opacity-associa 24.1 1.8E+02 0.0039 19.3 4.2 46 58-109 38-83 (85)
119 PF14347 DUF4399: Domain of un 24.1 1.4E+02 0.003 20.2 3.6 22 19-40 59-81 (87)
120 cd03858 M14_CP_N-E_like Carbox 23.8 3.1E+02 0.0068 23.3 6.6 43 82-127 299-351 (374)
121 PF08400 phage_tail_N: Prophag 23.2 1.7E+02 0.0036 21.6 4.1 25 102-129 44-68 (134)
122 PF13895 Ig_2: Immunoglobulin 22.8 81 0.0017 19.2 2.2 14 26-39 31-44 (80)
123 PF10980 DUF2787: Protein of u 22.6 1.5E+02 0.0032 21.7 3.7 37 5-43 22-58 (128)
124 PF08770 SoxZ: Sulphur oxidati 22.4 2.7E+02 0.0058 19.2 7.8 25 68-94 69-93 (100)
125 COG3354 FlaG Putative archaeal 22.2 3.3E+02 0.0072 20.5 5.5 44 103-146 66-116 (154)
126 cd03868 M14_CPD_I The first ca 22.1 3.2E+02 0.0069 23.3 6.3 61 64-127 254-349 (372)
127 PF05949 DUF881: Bacterial pro 22.0 58 0.0013 24.2 1.6 21 123-144 81-102 (149)
128 PRK06655 flgD flagellar basal 21.7 4.2E+02 0.009 21.1 6.8 74 64-139 95-176 (225)
129 PF10651 DUF2479: Domain of un 21.7 3.5E+02 0.0076 20.2 6.3 57 84-140 58-124 (170)
130 cd05767 IgC_MHC_II_alpha Class 21.6 2.6E+02 0.0057 18.8 8.2 43 66-109 15-62 (94)
131 PRK09937 stationary phase/star 21.5 2.3E+02 0.0051 18.4 4.2 37 55-91 34-70 (74)
132 cd06245 M14_CPD_III The third 21.0 3.6E+02 0.0077 23.1 6.4 43 83-128 289-340 (363)
133 KOG4278|consensus 21.0 2.1E+02 0.0045 27.3 5.1 42 103-147 172-213 (1157)
134 KOG3342|consensus 20.9 1.2E+02 0.0027 23.1 3.1 28 8-36 67-94 (180)
135 PHA03376 BARF1; Provisional 20.8 1.7E+02 0.0037 23.3 3.9 24 99-122 94-117 (221)
136 smart00720 calpain_III calpain 20.6 1.6E+02 0.0035 21.1 3.7 26 101-126 109-137 (143)
137 KOG3392|consensus 20.6 1.2E+02 0.0026 22.0 2.8 21 16-36 79-99 (147)
138 PHA03290 envelope glycoprotein 20.5 1.9E+02 0.004 24.8 4.4 35 8-43 128-162 (357)
139 PF10342 GPI-anchored: Ser-Thr 20.5 2.5E+02 0.0054 18.1 8.1 65 63-127 9-80 (93)
140 PF08842 Mfa2: Fimbrillin-A as 20.5 3.8E+02 0.0083 20.9 6.2 65 62-126 3-77 (283)
141 PF07965 Integrin_B_tail: Inte 20.4 2.6E+02 0.0057 18.6 4.5 30 95-124 47-77 (87)
142 PF01067 Calpain_III: Calpain 20.3 1.4E+02 0.0029 21.3 3.3 26 101-126 113-141 (147)
143 PF02792 Mago_nashi: Mago nash 20.3 1.3E+02 0.0027 22.5 2.9 20 16-35 75-94 (143)
144 PF00927 Transglut_C: Transglu 20.1 1.9E+02 0.0041 19.6 3.8 20 120-139 2-21 (107)
No 1
>KOG0518|consensus
Probab=99.96 E-value=2.3e-28 Score=221.82 Aligned_cols=154 Identities=19% Similarity=0.246 Sum_probs=141.0
Q ss_pred ccCCCCceeEEEEEEECCEEcCCCCeEEEecCC-CCCCCcEEEcCCCcceEeCCeEEEEEEeCCCcCCceEEEEECCCCC
Q psy15726 15 APSWISYGLVQARTLREETSFRSCPMEVPVVDP-AVGREPSGSGLGLYQARSGIVNSFTLETCGVASSEFDVIVTSPQGA 93 (171)
Q Consensus 15 ~~~P~~~G~~~i~V~~~g~~I~gSPf~v~V~~~-~d~~~v~v~G~Gl~~~~vg~~~~F~V~~~~ag~~~l~v~i~~p~g~ 93 (171)
+|.|+|+|+|.|+|+.+|+||++|||.+.|... .|+++++++|.|+.....-++++|.||++++|.+-|++.+++|+
T Consensus 722 tF~P~e~GeH~I~Vk~~G~hVpgsPf~i~V~~~e~dAsk~~v~g~g~~~G~t~ep~~fivDtr~agyGgLsi~~~Gps-- 799 (1113)
T KOG0518|consen 722 TFTPREVGEHKINVKVAGKHVPGSPFSIKVSESEIDASKVRVSGQGLKEGHTFEPAEFIVDTRKAGYGGLSISVQGPS-- 799 (1113)
T ss_pred EECCCcCcceEEEEEEcceECCCCCeEEEeccccccceeEEEecccccccccccchheEeccccCCCCceEEEEeCCc--
Confidence 489999999999999999999999999999876 57999999999998876669999999999999999999999999
Q ss_pred ccceEEEEcCCCEEEEEEEcCCCeeEEEEEEECCEEcCCCCeEEEecCCC----ceEEecCCcccEEcCCCEEEEEEccC
Q psy15726 94 AVPVRCYQQKFANLLAEFTPTTTGVYKIDVLQGARPVRGSPYLCQVYDAS----KVKIEHKGLSNIVVNDPISFKCKSTE 169 (171)
Q Consensus 94 ~~~~~v~~~~dg~y~v~y~P~~~G~~~i~V~~~g~~I~gSPF~v~V~d~s----kv~~~G~gl~~~~~g~~~~F~Vd~~~ 169 (171)
.++..+.|+.||+++|+|+|+++|.|.|+|+|+++||+||||.+++.+++ -+..+..++.-+.+|..++|.+++.+
T Consensus 800 kvd~~~~d~~dGt~kV~ytPtepG~Y~I~i~Fad~~I~gSPftVkv~~~~~vvesi~~~~~~~~va~~g~~~~l~lk~~~ 879 (1113)
T KOG0518|consen 800 KVDLNVEDREDGTCKVSYTPTEPGTYIINIKFADEHIKGSPFTVKVTGESRVVESITRDREAPSVARVGHTCSLDLKATE 879 (1113)
T ss_pred ccccceeecCCCeEEEEEeCCCCceEEEEEEEcCccCCCCceEEEecCCeeEeeeeeecccccceecccceeeeeeecCC
Confidence 46788999999999999999999999999999999999999999999988 34456666667999999999998875
Q ss_pred C
Q psy15726 170 E 170 (171)
Q Consensus 170 a 170 (171)
+
T Consensus 880 ~ 880 (1113)
T KOG0518|consen 880 A 880 (1113)
T ss_pred C
Confidence 4
No 2
>smart00557 IG_FLMN Filamin-type immunoglobulin domains. These form a rod-like structure in the actin-binding cytoskeleton protein, filamin. The C-terminal repeats of filamin bind beta1-integrin (CD29).
Probab=99.95 E-value=1.5e-27 Score=165.98 Aligned_cols=92 Identities=32% Similarity=0.496 Sum_probs=89.0
Q ss_pred CCCcEEEcCCCcceEeCCeEEEEEEeCCCcCCceEEEEECCCCCccceEEEEcCCCEEEEEEEcCCCeeEEEEEEECCEE
Q psy15726 50 GREPSGSGLGLYQARSGIVNSFTLETCGVASSEFDVIVTSPQGAAVPVRCYQQKFANLLAEFTPTTTGVYKIDVLQGARP 129 (171)
Q Consensus 50 ~~~v~v~G~Gl~~~~vg~~~~F~V~~~~ag~~~l~v~i~~p~g~~~~~~v~~~~dg~y~v~y~P~~~G~~~i~V~~~g~~ 129 (171)
+++|+++|+||+.+.+|++++|+|+++++|+++|+|.|.+|+|+.+++++.|++||+|.|+|+|.++|.|+|+|+|+|+|
T Consensus 1 p~~~~v~G~Gl~~~~vg~~~~f~v~~~d~G~~~~~v~i~~p~g~~~~~~v~d~~dGty~v~y~P~~~G~~~i~V~~~g~~ 80 (93)
T smart00557 1 ASKVKASGPGLEKGVVGEPAEFTIDTRGAGGGELEVEVTGPSGKKVPVEVKDNGDGTYTVSYTPTEPGDYTVTVKFGGEH 80 (93)
T ss_pred CCeEEEECCCcCceecCCCEEEEEEcCCCCCCcEEEEEECCCCCeeEeEEEeCCCCEEEEEEEeCCCEeEEEEEEECCEE
Confidence 47899999999999999999999999999999999999999999999999999999999999999999999999999999
Q ss_pred cCCCCeEEEecC
Q psy15726 130 VRGSPYLCQVYD 141 (171)
Q Consensus 130 I~gSPF~v~V~d 141 (171)
|+||||.++|.|
T Consensus 81 I~gSPF~v~V~~ 92 (93)
T smart00557 81 IPGSPFTVKVGP 92 (93)
T ss_pred CCCCCEEEEEeC
Confidence 999999999865
No 3
>KOG0518|consensus
Probab=99.95 E-value=2e-27 Score=215.76 Aligned_cols=149 Identities=21% Similarity=0.242 Sum_probs=135.6
Q ss_pred ccCCCCceeEEEEEEECCEEcCCCCeEEEecCCCCCCCcEEEcCCCcceEeCCeEEEEEEeCCCcCCceEEEEECCCCCc
Q psy15726 15 APSWISYGLVQARTLREETSFRSCPMEVPVVDPAVGREPSGSGLGLYQARSGIVNSFTLETCGVASSEFDVIVTSPQGAA 94 (171)
Q Consensus 15 ~~~P~~~G~~~i~V~~~g~~I~gSPf~v~V~~~~d~~~v~v~G~Gl~~~~vg~~~~F~V~~~~ag~~~l~v~i~~p~g~~ 94 (171)
+|.|..||.|+|.|+|+|+||++|||...|... + ++.+ ....||...++.|.-+.++...++++|.+|+|++
T Consensus 635 sy~P~vpG~y~i~V~~~~k~vpgSPF~~~Vt~~-~------s~r~-s~~~Vgs~~ev~in~~~~~~s~ltaeI~~PsGn~ 706 (1113)
T KOG0518|consen 635 SYTPTVPGVYKIEVKFGDKHVPGSPFVALVTGD-E------SIRT-SHLSVGSIHEVTINITEADLSVLTAEIVDPSGNP 706 (1113)
T ss_pred ccCCCCCceEEEEEEECCeeCCCCCceEEEecc-c------cccc-ccceecceeeEEeeccccCCceeEEEEECCCCCc
Confidence 489999999999999999999999999998752 1 1111 2478999999999988888899999999999999
Q ss_pred cceEEEEcCCCEEEEEEEcCCCeeEEEEEEECCEEcCCCCeEEEe----cCCCceEEecCCcccEEcCCCEEEEEEccCC
Q psy15726 95 VPVRCYQQKFANLLAEFTPTTTGVYKIDVLQGARPVRGSPYLCQV----YDASKVKIEHKGLSNIVVNDPISFKCKSTEE 170 (171)
Q Consensus 95 ~~~~v~~~~dg~y~v~y~P~~~G~~~i~V~~~g~~I~gSPF~v~V----~d~skv~~~G~gl~~~~~g~~~~F~Vd~~~a 170 (171)
.+|.++...+|.|.++|+|+++|+|+|+|+++|+||+||||.|.| .|++|++++|.||..+..-||++|.||+|+|
T Consensus 707 ~~c~~r~l~~g~~~itF~P~e~GeH~I~Vk~~G~hVpgsPf~i~V~~~e~dAsk~~v~g~g~~~G~t~ep~~fivDtr~a 786 (1113)
T KOG0518|consen 707 EPCLVRRLPNGHDGITFTPREVGEHKINVKVAGKHVPGSPFSIKVSESEIDASKVRVSGQGLKEGHTFEPAEFIVDTRKA 786 (1113)
T ss_pred cceeeEecCCCceeEEECCCcCcceEEEEEEcceECCCCCeEEEeccccccceeEEEecccccccccccchheEeccccC
Confidence 999999999999999999999999999999999999999999999 5799999999999998777999999999998
Q ss_pred C
Q psy15726 171 P 171 (171)
Q Consensus 171 ~ 171 (171)
+
T Consensus 787 g 787 (1113)
T KOG0518|consen 787 G 787 (1113)
T ss_pred C
Confidence 5
No 4
>PF00630 Filamin: Filamin/ABP280 repeat; InterPro: IPR017868 The many different actin cross-linking proteins share a common architecture, consisting of a globular actin-binding domain and an extended rod. Whereas their actin-binding domains consist of two calponin homology domains (see IPR001715 from INTERPRO), their rods fall into three families. The rod domain of the family including the Dictyostelium discoideum (Slime mould) gelation factor (ABP120) and human filamin (ABP280) is constructed from tandem repeats of a 100-residue motif that is glycine and proline rich []. The gelation factor's rod contains 6 copies of the repeat, whereas filamin has a rod constructed from 24 repeats. The resolution of the 3D structure of rod repeats from the gelation factor has shown that they consist of a beta-sandwich, formed by two beta-sheets arranged in an immunoglobulin-like fold [, ]. Because conserved residues that form the core of the repeats are preserved in filamin, the repeat structure should be common to the members of the gelation factor/filamin family. The head to tail homodimerisation is crucial to the function of the ABP120 and ABP280 proteins. This interaction involves a small portion at the distal end of the rod domains. For the gelation factor it has been shown that the carboxy-terminal repeat 6 dimerises through a double edge-to-edge extension of the beta-sheet and that repeat 5 contributes to dimerisation to some extent [, , ].; PDB: 2DI9_A 2EEC_A 2DIC_A 2EEA_A 2DMC_A 2EE9_A 2D7O_A 2D7N_A 2K7P_A 2NQC_A ....
Probab=99.91 E-value=1.1e-23 Score=147.42 Aligned_cols=89 Identities=35% Similarity=0.524 Sum_probs=82.3
Q ss_pred CCCCCcEEEcCCCcceEeCCeEEEEEEeCCCcCC-------ceEEEEECCCCC----ccceEEEEcCCCEEEEEEEcCCC
Q psy15726 48 AVGREPSGSGLGLYQARSGIVNSFTLETCGVASS-------EFDVIVTSPQGA----AVPVRCYQQKFANLLAEFTPTTT 116 (171)
Q Consensus 48 ~d~~~v~v~G~Gl~~~~vg~~~~F~V~~~~ag~~-------~l~v~i~~p~g~----~~~~~v~~~~dg~y~v~y~P~~~ 116 (171)
.||++|.++|+||+.+.+|++++|+|+++++++. .++|.|.+|++. .+++++.+++||+|.++|+|.++
T Consensus 2 ~dp~~~~v~G~gl~~~~~g~~~~F~V~~~d~~g~~~~~~~~~~~v~i~~p~~~~~~~~~~~~v~~~~~G~y~v~y~p~~~ 81 (101)
T PF00630_consen 2 IDPSKCKVSGPGLEPAVVGEPATFTVDTRDAGGNPVSSGGDEFQVTITSPDGKEEPVPVPVEVIDNGDGTYTVSYTPTEP 81 (101)
T ss_dssp CSGCGEEEESGGGTEEETTSEEEEEEEETTTTSSBEESTSSEEEEEEESSSSESS--EEEEEEEEESSSEEEEEEEESSS
T ss_pred CCCCEEEEECCccCCeECCCcEEEEEEEccCCCCccccCCceeEEEEeCCCCCccccccceEEEECCCCEEEEEEEeCcc
Confidence 4789999999999999999999999999998543 678999999997 68899999999999999999999
Q ss_pred eeEEEEEEECCEEcCCCCeE
Q psy15726 117 GVYKIDVLQGARPVRGSPYL 136 (171)
Q Consensus 117 G~~~i~V~~~g~~I~gSPF~ 136 (171)
|.|+|+|+|+|++|+||||+
T Consensus 82 G~y~i~V~~~g~~I~gSPf~ 101 (101)
T PF00630_consen 82 GKYKISVKINGQPIPGSPFT 101 (101)
T ss_dssp EEEEEEEEESSEB-TTSSEE
T ss_pred EeEEEEEEECCEECcCCCcC
Confidence 99999999999999999995
No 5
>smart00557 IG_FLMN Filamin-type immunoglobulin domains. These form a rod-like structure in the actin-binding cytoskeleton protein, filamin. The C-terminal repeats of filamin bind beta1-integrin (CD29).
Probab=98.62 E-value=4.2e-08 Score=67.85 Aligned_cols=31 Identities=13% Similarity=-0.040 Sum_probs=29.6
Q ss_pred cCCCCceeEEEEEEECCEEcCCCCeEEEecC
Q psy15726 16 PSWISYGLVQARTLREETSFRSCPMEVPVVD 46 (171)
Q Consensus 16 ~~P~~~G~~~i~V~~~g~~I~gSPf~v~V~~ 46 (171)
|.|.++|.|.|+|+|+|++|++|||.+.|.+
T Consensus 62 y~P~~~G~~~i~V~~~g~~I~gSPF~v~V~~ 92 (93)
T smart00557 62 YTPTEPGDYTVTVKFGGEHIPGSPFTVKVGP 92 (93)
T ss_pred EEeCCCEeEEEEEEECCEECCCCCEEEEEeC
Confidence 7899999999999999999999999999875
No 6
>PF00630 Filamin: Filamin/ABP280 repeat; InterPro: IPR017868 The many different actin cross-linking proteins share a common architecture, consisting of a globular actin-binding domain and an extended rod. Whereas their actin-binding domains consist of two calponin homology domains (see IPR001715 from INTERPRO), their rods fall into three families. The rod domain of the family including the Dictyostelium discoideum (Slime mould) gelation factor (ABP120) and human filamin (ABP280) is constructed from tandem repeats of a 100-residue motif that is glycine and proline rich []. The gelation factor's rod contains 6 copies of the repeat, whereas filamin has a rod constructed from 24 repeats. The resolution of the 3D structure of rod repeats from the gelation factor has shown that they consist of a beta-sandwich, formed by two beta-sheets arranged in an immunoglobulin-like fold [, ]. Because conserved residues that form the core of the repeats are preserved in filamin, the repeat structure should be common to the members of the gelation factor/filamin family. The head to tail homodimerisation is crucial to the function of the ABP120 and ABP280 proteins. This interaction involves a small portion at the distal end of the rod domains. For the gelation factor it has been shown that the carboxy-terminal repeat 6 dimerises through a double edge-to-edge extension of the beta-sheet and that repeat 5 contributes to dimerisation to some extent [, , ].; PDB: 2DI9_A 2EEC_A 2DIC_A 2EEA_A 2DMC_A 2EE9_A 2D7O_A 2D7N_A 2K7P_A 2NQC_A ....
Probab=98.19 E-value=1e-06 Score=61.05 Aligned_cols=26 Identities=8% Similarity=-0.004 Sum_probs=24.1
Q ss_pred cCCCCceeEEEEEEECCEEcCCCCeE
Q psy15726 16 PSWISYGLVQARTLREETSFRSCPME 41 (171)
Q Consensus 16 ~~P~~~G~~~i~V~~~g~~I~gSPf~ 41 (171)
|.|.++|.|+|+|+|+|++|++|||+
T Consensus 76 y~p~~~G~y~i~V~~~g~~I~gSPf~ 101 (101)
T PF00630_consen 76 YTPTEPGKYKISVKINGQPIPGSPFT 101 (101)
T ss_dssp EEESSSEEEEEEEEESSEB-TTSSEE
T ss_pred EEeCccEeEEEEEEECCEECcCCCcC
Confidence 78999999999999999999999995
No 7
>PRK14081 triple tyrosine motif-containing protein; Provisional
Probab=97.17 E-value=0.015 Score=53.15 Aligned_cols=147 Identities=12% Similarity=0.076 Sum_probs=86.5
Q ss_pred cCCCCceeEEEEEEECCEEcCCC-----CeEEEecCC-CCC---CCcEEEcCCCcceEeCCeEEEEEEeCCCcCCceEEE
Q psy15726 16 PSWISYGLVQARTLREETSFRSC-----PMEVPVVDP-AVG---REPSGSGLGLYQARSGIVNSFTLETCGVASSEFDVI 86 (171)
Q Consensus 16 ~~P~~~G~~~i~V~~~g~~I~gS-----Pf~v~V~~~-~d~---~~v~v~G~Gl~~~~vg~~~~F~V~~~~ag~~~l~v~ 86 (171)
|+|.++|.|+|.|..-+..-.+. -....+... ..| ..+.+. .-+...+|++..+.+.+.+..+-..+..
T Consensus 344 w~P~~~G~Y~I~v~VKDk~S~~eyD~~~~v~f~i~~~~~~p~~I~~vl~d--~~~~~lvG~~i~i~v~a~gg~~~lY~f~ 421 (667)
T PRK14081 344 WKPKIAGKYSITLWVKDISSKGEYEDKSSIDYTIEEKSKEPIKIEDVILD--KGKHILKGEEIKIRVIAEGGTNLRYSFI 421 (667)
T ss_pred EeeCCCceEEEEEEEEcCcCcccccceEEEEEEEcccCCCCeEEEEEEEC--CCCceEeCCeEEEEEEecCCCeEEEEEE
Confidence 79999999999998766532211 112223221 112 223332 2246889999999999887643333333
Q ss_pred EECCCCCccceEEEEcCC-CEEEEEEEcCCCeeEEEEEEECCEEcC-----CCCeEEEecCCCceEEec---CCcccEEc
Q psy15726 87 VTSPQGAAVPVRCYQQKF-ANLLAEFTPTTTGVYKIDVLQGARPVR-----GSPYLCQVYDASKVKIEH---KGLSNIVV 157 (171)
Q Consensus 87 i~~p~g~~~~~~v~~~~d-g~y~v~y~P~~~G~~~i~V~~~g~~I~-----gSPF~v~V~d~skv~~~G---~gl~~~~~ 157 (171)
|.. +|.. +.+.+= -.-.+.|+|.++|.|+|.|...+.-=. -.=+.+.|.++..+...- +-.....+
T Consensus 422 ik~-ng~~----ve~~~Y~~~~~~~f~P~~~G~Y~IeV~vKdk~S~~~yD~~k~v~l~V~e~~P~~i~~il~~~~~~~~v 496 (667)
T PRK14081 422 IKK-DGKE----EEKIDYGKNNWVNFIPEEKGNYELEVRVKDKYSDKEYDAHTIVYIKVHEYIPAEIDYILLPVKEYYLV 496 (667)
T ss_pred EEE-CCEE----EEEeecccccEEEEEECCCeeEEEEEEEecccCchhcccceEEEEEEeccCceeeeeEEecccccEEe
Confidence 333 4542 222211 245899999999999999988653211 111445555544332221 22334799
Q ss_pred CCCEEEEEEccC
Q psy15726 158 NDPISFKCKSTE 169 (171)
Q Consensus 158 g~~~~F~Vd~~~ 169 (171)
|++++|++.+.+
T Consensus 497 g~~i~~~~~~~~ 508 (667)
T PRK14081 497 GDDIEIEVIIQN 508 (667)
T ss_pred CCEEEEEEEEeC
Confidence 999999998765
No 8
>KOG1428|consensus
Probab=96.92 E-value=0.009 Score=58.54 Aligned_cols=46 Identities=20% Similarity=0.244 Sum_probs=38.9
Q ss_pred EEEEcCCCEEEEEEEcCCCeeEEEEEEECCEEcCCCCeEEEecCCCc
Q psy15726 98 RCYQQKFANLLAEFTPTTTGVYKIDVLQGARPVRGSPYLCQVYDASK 144 (171)
Q Consensus 98 ~v~~~~dg~y~v~y~P~~~G~~~i~V~~~g~~I~gSPF~v~V~d~sk 144 (171)
-+.-++||+|-.+|||...|.|+|+++++|..|.- -+.++|.|+-|
T Consensus 2099 ~ir~nnd~~y~~nWTP~a~G~Y~l~~tiDG~eId~-~~eVkVkd~Pk 2144 (3738)
T KOG1428|consen 2099 LIRVNNDGTYCANWTPGAIGLYTLHVTIDGIEIDA-GLEVKVKDPPK 2144 (3738)
T ss_pred eEEecCCCceecccCCccceeEEEEEEecceeecc-ceEEEeccCCC
Confidence 35567889999999999999999999999999974 38888866543
No 9
>PRK14081 triple tyrosine motif-containing protein; Provisional
Probab=96.89 E-value=0.06 Score=49.38 Aligned_cols=148 Identities=14% Similarity=-0.007 Sum_probs=83.4
Q ss_pred cCCCCceeEEEEEEECCEEc---CCC--CeEEEecCCCCCCCc-EEEcCCCcceEeCCeEEEEEEeCCCcCCceEEE-EE
Q psy15726 16 PSWISYGLVQARTLREETSF---RSC--PMEVPVVDPAVGREP-SGSGLGLYQARSGIVNSFTLETCGVASSEFDVI-VT 88 (171)
Q Consensus 16 ~~P~~~G~~~i~V~~~g~~I---~gS--Pf~v~V~~~~d~~~v-~v~G~Gl~~~~vg~~~~F~V~~~~ag~~~l~v~-i~ 88 (171)
|.|..+|.|+|.|..-+.-= -++ =+.+.|.... |... .+-=+-.+...+|.+.+|++.+.+.. ++..+ ..
T Consensus 442 f~P~~~G~Y~IeV~vKdk~S~~~yD~~k~v~l~V~e~~-P~~i~~il~~~~~~~~vg~~i~~~~~~~~~k--~v~y~y~~ 518 (667)
T PRK14081 442 FIPEEKGNYELEVRVKDKYSDKEYDAHTIVYIKVHEYI-PAEIDYILLPVKEYYLVGDDIEIEVIIQNTK--DVLIKYIL 518 (667)
T ss_pred EEECCCeeEEEEEEEecccCchhcccceEEEEEEeccC-ceeeeeEEecccccEEeCCEEEEEEEEeCCC--eEEEEEEE
Confidence 78999999999998654311 111 1233443321 1111 11112234478999999999988743 22222 33
Q ss_pred CCCCCccceEEEEcCCCEEEEEEEcCCCeeEEEEEEECCEEc---C--CCCeEEEecCCCceEEecCCcc--cEEcCCCE
Q psy15726 89 SPQGAAVPVRCYQQKFANLLAEFTPTTTGVYKIDVLQGARPV---R--GSPYLCQVYDASKVKIEHKGLS--NIVVNDPI 161 (171)
Q Consensus 89 ~p~g~~~~~~v~~~~dg~y~v~y~P~~~G~~~i~V~~~g~~I---~--gSPF~v~V~d~skv~~~G~gl~--~~~~g~~~ 161 (171)
.-+|..+ .+++=. -.-..+|+|..+|.|+|.|+..+.-= - -.=+.+.|.+|..++-.-.-.. ...+|++.
T Consensus 519 ~~NG~~v--~~t~Ys-~~~~ysf~P~~~GkY~I~V~aKn~~s~~~~D~~k~v~~~V~e~~pi~nt~~~~~~~~~~~n~~~ 595 (667)
T PRK14081 519 KINGHKV--EETDYI-KNKKYKFIPKCSGKYTIEVLAKNIKSTEEYDSKKEVKFYVREALPITNTKIKTSKKKFKCNEEV 595 (667)
T ss_pred EECCEEE--EEeecc-ccceEEEeecCCceEEEEEEEcccccccccccceEEEEEEcCCCCceeeEEEeecceEEcCCeE
Confidence 4455432 122211 13367889999999999999765320 0 1114445556544332221111 37889999
Q ss_pred EEEEEccC
Q psy15726 162 SFKCKSTE 169 (171)
Q Consensus 162 ~F~Vd~~~ 169 (171)
+|++++++
T Consensus 596 t~~~~~~g 603 (667)
T PRK14081 596 TFSVKSEG 603 (667)
T ss_pred EEEEEccC
Confidence 99999875
No 10
>PF01835 A2M_N: MG2 domain; InterPro: IPR002890 The proteinase-binding alpha-macroglobulins (A2M) [] are large glycoproteins found in the plasma of vertebrates, in the hemolymph of some invertebrates and in reptilian and avian egg white. A2M-like proteins are able to inhibit all four classes of proteinases by a 'trapping' mechanism. They have a peptide stretch, called the 'bait region', which contains specific cleavage sites for different proteinases. When a proteinase cleaves the bait region, a conformational change is induced in the protein, thus trapping the proteinase. The entrapped enzyme remains active against low molecular weight substrates, whilst its activity toward larger substrates is greatly reduced, due to steric hindrance. Following cleavage in the bait region, a thiol ester bond, formed between the side chains of a cysteine and a glutamine, is cleaved and mediates the covalent binding of the A2M-like protein to the proteinase. This family includes the N-terminal region of the alpha-2-macroglobulin family. The inhibitor domains belong to MEROPS inhibitor family I39.; GO: 0004866 endopeptidase inhibitor activity; PDB: 2B39_B 3KLS_B 3PRX_C 3KM9_B 3PVM_C 3CU7_A 4E0S_A 4A5W_A 4ACQ_C 2P9R_B ....
Probab=96.68 E-value=0.01 Score=40.80 Aligned_cols=64 Identities=22% Similarity=0.288 Sum_probs=47.7
Q ss_pred eEeCCeEEEEEEeCCCc-------CCceEEEEECCCCCccceEEE--EcCCCEEEEEEE-cCC--CeeEEEEEEEC
Q psy15726 63 ARSGIVNSFTLETCGVA-------SSEFDVIVTSPQGAAVPVRCY--QQKFANLLAEFT-PTT--TGVYKIDVLQG 126 (171)
Q Consensus 63 ~~vg~~~~F~V~~~~ag-------~~~l~v~i~~p~g~~~~~~v~--~~~dg~y~v~y~-P~~--~G~~~i~V~~~ 126 (171)
.+.|+.+.|.+-.++.. ...+.+.|.+|+|+.+.-... .+.+|.+..+|. |.. .|.|+|.+..+
T Consensus 11 YrPGetV~~~~~~~~~~~~~~~~~~~~~~v~i~dp~g~~v~~~~~~~~~~~G~~~~~~~lp~~~~~G~y~i~~~~~ 86 (99)
T PF01835_consen 11 YRPGETVHFRAIVRDLDNDFKPPANSPVTVTIKDPSGNEVFRWSVNTTNENGIFSGSFQLPDDAPLGTYTIRVKTD 86 (99)
T ss_dssp E-TTSEEEEEEEEEEECTTCSCESSEEEEEEEEETTSEEEEEEEEEETTCTTEEEEEEE--SS---EEEEEEEEET
T ss_pred cCCCCEEEEEEEEeccccccccccCCceEEEEECCCCCEEEEEEeeeeCCCCEEEEEEECCCCCCCEeEEEEEEEc
Confidence 56788888888865443 257889999999987654433 357799999997 665 49999999994
No 11
>TIGR03503 conserved hypothetical protein TIGR03503. This set of conserved hypothetical protein has a phylogenetic range that closely matches that of TIGR03501, a putative C-terminal protein targeting signal.
Probab=95.82 E-value=0.21 Score=42.87 Aligned_cols=110 Identities=8% Similarity=0.043 Sum_probs=67.3
Q ss_pred cCCCCceeEEEEEEECCE----------EcCCCCeEEEecCCCCCCCcEEEcCCCcceEeCCeEEEEEEeCC----CcCC
Q psy15726 16 PSWISYGLVQARTLREET----------SFRSCPMEVPVVDPAVGREPSGSGLGLYQARSGIVNSFTLETCG----VASS 81 (171)
Q Consensus 16 ~~P~~~G~~~i~V~~~g~----------~I~gSPf~v~V~~~~d~~~v~v~G~Gl~~~~vg~~~~F~V~~~~----ag~~ 81 (171)
..-..+|.|.+.+.-++. .|...|+.+.+..+. . .+.+..+.|.... .+.-
T Consensus 178 ~l~~~~G~Y~~~v~~~n~vF~R~~~q~v~V~p~Pi~~~~~~~~-------------~--~~~~h~l~v~~d~~~i~~~s~ 242 (374)
T TIGR03503 178 NLDVAPGEYRPTYQSRNPVFLREVEQPVLVYPLPVSYTVIQSE-------------D--ESGAHQLMVDADAGHIDPGSL 242 (374)
T ss_pred eccCCCceEEEEEEEcCceEEEEEEEeEEEECCCeeEEEEccC-------------C--CCCceEEEEEcccceeccccE
Confidence 346789999999987763 223344444333221 1 1334455555432 2445
Q ss_pred ceEEEEECCCCCccceEEEEcCC-CEEEEEEEcCCCeeEEEEEEECCEEcCCCCeEEEecC
Q psy15726 82 EFDVIVTSPQGAAVPVRCYQQKF-ANLLAEFTPTTTGVYKIDVLQGARPVRGSPYLCQVYD 141 (171)
Q Consensus 82 ~l~v~i~~p~g~~~~~~v~~~~d-g~y~v~y~P~~~G~~~i~V~~~g~~I~gSPF~v~V~d 141 (171)
.+++++.+|+|. ......+... .+...-=...+.|.|+++.+.-+..+.|.||.+...+
T Consensus 243 ~~~~~~~~P~g~-~~~~~~~~~~~~~~~~l~~~~~~G~Y~i~~~~~~Tt~~GRe~~~~lpe 302 (374)
T TIGR03503 243 VIHGELVFPNGQ-IQQFSIELEEPETRVDLPANYEFGKYRVKGTVFGTTITGRELQLTLPE 302 (374)
T ss_pred EEEEEEECCCCc-eEEecccCccCceEEeccCcCCCeEEEEEEEEEEecCCCCEEEEEcCC
Confidence 667788999996 4433333222 2222222344899999999999999999999998843
No 12
>PF06312 Neurexophilin: Neurexophilin
Probab=95.80 E-value=0.041 Score=43.91 Aligned_cols=45 Identities=16% Similarity=0.161 Sum_probs=38.7
Q ss_pred CceEEEEECCCC-CccceEEEEcCCCEEEEEEEcCCCeeEEEEEEE
Q psy15726 81 SEFDVIVTSPQG-AAVPVRCYQQKFANLLAEFTPTTTGVYKIDVLQ 125 (171)
Q Consensus 81 ~~l~v~i~~p~g-~~~~~~v~~~~dg~y~v~y~P~~~G~~~i~V~~ 125 (171)
|++.++|.+++- .-+..+|+|.+||||+|.|+-.-+|.-.|.|.+
T Consensus 70 GDfrAri~s~~l~ag~~G~V~Dh~NGTYtv~F~L~W~G~v~vsV~L 115 (219)
T PF06312_consen 70 GDFRARIFSPKLKAGAAGKVTDHGNGTYTVSFPLLWPGQVSVSVSL 115 (219)
T ss_pred CCcEEEEecCCccccceEEEEECCCCeEEEEEEeecCceEEEEEEE
Confidence 777999998874 336789999999999999999999998888875
No 13
>PF13115 YtkA: YtkA-like
Probab=93.67 E-value=0.64 Score=30.92 Aligned_cols=43 Identities=21% Similarity=0.247 Sum_probs=31.0
Q ss_pred CceEEEEECCC--CC---ccceEEEEcCCCEEEEEEEcCCCeeEEEEE
Q psy15726 81 SEFDVIVTSPQ--GA---AVPVRCYQQKFANLLAEFTPTTTGVYKIDV 123 (171)
Q Consensus 81 ~~l~v~i~~p~--g~---~~~~~v~~~~dg~y~v~y~P~~~G~~~i~V 123 (171)
+.+++.+..|. |. ...+.....++|.|.++.....+|.|.|.|
T Consensus 38 a~V~~~~~m~~~~g~~~~~~~~~~~~~~~G~Y~~~~~f~m~G~W~i~v 85 (86)
T PF13115_consen 38 ADVQFEIWMPDMEGMEPMTSKVELEETGPGVYEAEVTFSMAGTWQITV 85 (86)
T ss_pred CEEEEEEEeCCCCCCCCCceeeeeecCCCCeEEEEeecCCCeeEEEEE
Confidence 45666666664 21 233445556789999999999999999987
No 14
>COG2373 Large extracellular alpha-helical protein [General function prediction only]
Probab=93.43 E-value=0.67 Score=46.83 Aligned_cols=106 Identities=14% Similarity=0.165 Sum_probs=75.1
Q ss_pred eEeCCeEEEEEEeCCCc------CCceEEEEECCCCCccceE-EEEcCCCEEEEEEEcCC---CeeEEEEEEECCE-EcC
Q psy15726 63 ARSGIVNSFTLETCGVA------SSEFDVIVTSPQGAAVPVR-CYQQKFANLLAEFTPTT---TGVYKIDVLQGAR-PVR 131 (171)
Q Consensus 63 ~~vg~~~~F~V~~~~ag------~~~l~v~i~~p~g~~~~~~-v~~~~dg~y~v~y~P~~---~G~~~i~V~~~g~-~I~ 131 (171)
.+.|+..++.+-+|+.. ...+++.+.+|+|..+.-. +.....|.|+++|.-.+ .|.|+|.+.+.|. .+-
T Consensus 405 YRpGE~v~~~~~~R~~~~~~a~~~~p~~l~v~~PdG~~~~~~~~~~~~~G~~~~~~~l~~na~tG~w~l~~~~~~~~~~~ 484 (1621)
T COG2373 405 YRPGETVHVNALLRDFDGKTALDNQPLKLRVLDPDGSVLRTLTITLDEEGLYELSFPLPENALTGGYTLELYTGGKSAVI 484 (1621)
T ss_pred CCCCceeeeeeeehhhcccccccCCCeEEEEECCCCcEEEEEEEeccccCceEEeeeCCCCCCcceEEEEEEeCCcccee
Confidence 56799999999888643 3578999999999765543 44457799999997443 5999999999986 666
Q ss_pred CCCeEEEecCCCceEEecCCccc-EEcCCCEEEEEEcc
Q psy15726 132 GSPYLCQVYDASKVKIEHKGLSN-IVVNDPISFKCKST 168 (171)
Q Consensus 132 gSPF~v~V~d~skv~~~G~gl~~-~~~g~~~~F~Vd~~ 168 (171)
...|.|..+-+.+.++.=..-+. ...|++..++|+.+
T Consensus 485 s~~f~V~df~p~r~~i~l~~~k~~~~~g~~v~~~v~~~ 522 (1621)
T COG2373 485 SMSFRVEDFIPDRFKINLTLDKTEWVPGKDVKIKVDLR 522 (1621)
T ss_pred eeeEEhhHhCCceEEEecccccccccCCCcEEEEEEEE
Confidence 67788876555544432222112 47777777777654
No 15
>PF02369 Big_1: Bacterial Ig-like domain (group 1); InterPro: IPR003344 Proteins that contain this domain are found in a variety of bacterial and phage surface proteins such as intimins. Intimin is a bacterial cell-adhesion molecule that mediates the intimate bacterial host-cell interaction. It contains three domains; two immunoglobulin-like domains and a C-type lectin-like module implying that carbohydrate recognition may be important in intimin-mediated cell adhesion [].; PDB: 1CWV_A 4E9L_A 1F02_I 1F00_I.
Probab=93.30 E-value=1.1 Score=31.07 Aligned_cols=63 Identities=16% Similarity=0.181 Sum_probs=37.4
Q ss_pred CeEEEEEEeCCCcC---CceEEEE--ECCCCCccce--EEEEcCCCEEEEEEEcCCCeeEEEEEEECCEE
Q psy15726 67 IVNSFTLETCGVAS---SEFDVIV--TSPQGAAVPV--RCYQQKFANLLAEFTPTTTGVYKIDVLQGARP 129 (171)
Q Consensus 67 ~~~~F~V~~~~ag~---~~l~v~i--~~p~g~~~~~--~v~~~~dg~y~v~y~P~~~G~~~i~V~~~g~~ 129 (171)
....+++..+|+.+ ....|.+ ....+...+. ...-+.+|.+.++|+...+|.|.|...+++..
T Consensus 23 ~~~tltatV~D~~gnpv~g~~V~f~~~~~~~~l~~~~~~~~Td~~G~a~~tltst~aG~~~VtA~~~~~~ 92 (100)
T PF02369_consen 23 DTNTLTATVTDANGNPVPGQPVTFSSSSSGGTLSPTNTSATTDSNGIATVTLTSTKAGTYTVTATVDGGS 92 (100)
T ss_dssp S-EEEEEEEEETTSEB-TS-EEEE--EESSSEES-CEE-EEE-TTSEEEEEEE-SS-EEEEEEEEETTEE
T ss_pred CcEEEEEEEEcCCCCCCCCCEEEEEEcCCCcEEecCccccEECCCEEEEEEEEecCceEEEEEEEECCcc
Confidence 44555555556543 3344555 3333333233 35566789999999999999999999998753
No 16
>PF13860 FlgD_ig: FlgD Ig-like domain; PDB: 3C12_A 3OSV_A.
Probab=92.50 E-value=1.6 Score=28.93 Aligned_cols=57 Identities=16% Similarity=0.173 Sum_probs=33.6
Q ss_pred eEEEEEEeCCCcCCceEEEEECCCCCccce-EEEEcCCCEEEEEEE-------cCCCeeEEEEEEE
Q psy15726 68 VNSFTLETCGVASSEFDVIVTSPQGAAVPV-RCYQQKFANLLAEFT-------PTTTGVYKIDVLQ 125 (171)
Q Consensus 68 ~~~F~V~~~~ag~~~l~v~i~~p~g~~~~~-~v~~~~dg~y~v~y~-------P~~~G~~~i~V~~ 125 (171)
+..|.+..... ...+.+.|.+.+|+.+.. .+.....|.+.+.|- |...|.|++.|..
T Consensus 13 ~~~~~~~l~~~-a~~v~v~I~d~~G~~V~t~~~~~~~~G~~~~~WdG~d~~G~~~~~G~Y~~~v~a 77 (81)
T PF13860_consen 13 KGSIEYTLPED-ADNVTVTIYDSNGQVVRTISLGSQSAGEHSFTWDGKDDDGNPVPDGTYTFRVTA 77 (81)
T ss_dssp EEEEEEEECSS-CEEEEEEEEETTS-EEEEEEEEECSSEEEEEEE-SB-TTS-B--SEEEEEEEEE
T ss_pred EEEEEEeCCCc-ccEEEEEEEcCCCCEEEEEEcCCcCCceEEEEECCCCCCcCCCCCCCEEEEEEE
Confidence 45555555432 246788888888876543 333445577777777 6667777777653
No 17
>TIGR00864 PCC polycystin cation channel protein. Note: this model has been restricted to the amino half because for technical reasons.
Probab=91.44 E-value=12 Score=39.95 Aligned_cols=144 Identities=13% Similarity=0.051 Sum_probs=85.0
Q ss_pred Eeeccc--cCCCCceeEEEEEE-ECCEEcCCCCeEEEecCCCCCCCcEEE-cCCCcceEeCCeEEEEEEeCCCcCCceEE
Q psy15726 10 LLSEKA--PSWISYGLVQARTL-REETSFRSCPMEVPVVDPAVGREPSGS-GLGLYQARSGIVNSFTLETCGVASSEFDV 85 (171)
Q Consensus 10 ~~~~~~--~~P~~~G~~~i~V~-~~g~~I~gSPf~v~V~~~~d~~~v~v~-G~Gl~~~~vg~~~~F~V~~~~ag~~~l~v 85 (171)
.+++++ |+|.++|.|+|.|+ +|...-..-+-.+.|..+.. ...++ ++ ..+.+|+...|.+...- ++.++..
T Consensus 2001 ~~~G~~vty~~~~~G~~~I~v~A~N~l~s~n~t~~i~VQ~~i~--~~~L~~~p--~~~~~n~~v~f~~~~~~-~~~~v~~ 2075 (2740)
T TIGR00864 2001 IHMGKDVSYTAEAAGLLEIQLGAFNALGAENITLQLEAQDALM--DAALQAGP--QDCFTNKMAQFEAATSP-KPNFMAC 2075 (2740)
T ss_pred eecCCceEEccCCCcEEEEEEEEEcccCcccccEEEEEEehee--eEEEeecC--cceecCceEEEEEEEcc-CCCcEEE
Confidence 355665 79999999999997 77654445566777776642 22333 34 35778999999998743 3345666
Q ss_pred EEECCCCCccceEEEEcCCCEEEEEEEcCCCeeEEEEEEEC-CEEcCCCCeEEEe----cCCCceEEecCCcccEEcCCC
Q psy15726 86 IVTSPQGAAVPVRCYQQKFANLLAEFTPTTTGVYKIDVLQG-ARPVRGSPYLCQV----YDASKVKIEHKGLSNIVVNDP 160 (171)
Q Consensus 86 ~i~~p~g~~~~~~v~~~~dg~y~v~y~P~~~G~~~i~V~~~-g~~I~gSPF~v~V----~d~skv~~~G~gl~~~~~g~~ 160 (171)
...=-||.... .+ ...+++++-..+|+|.+.|... .....--=-.+.| +..-.|.+-|+.-....-.+|
T Consensus 2076 ~WDFGDGt~~~--~T----~~~~~sHtY~r~G~Y~V~V~a~N~VS~~~a~~~i~V~vl~C~~P~V~i~~~~~~~i~RSq~ 2149 (2740)
T TIGR00864 2076 HWDFGDGSAGQ--DT----DEPRAEHEYLHPGDYRVQVNASNLVSFFSAHAEINVQVLACEEPEVDVVLALQLAIRRSQP 2149 (2740)
T ss_pred EEEecCCCCcc--cc----CCceeEEEecCCccEEEEEEEEcceeeeeeEEEEEEEEecCCCCCcEeccCceeEEEecCc
Confidence 65544554221 11 2467888888889999998843 3221111112222 344445555554333444555
Q ss_pred EEEE
Q psy15726 161 ISFK 164 (171)
Q Consensus 161 ~~F~ 164 (171)
+.|.
T Consensus 2150 ~~lE 2153 (2740)
T TIGR00864 2150 NLLE 2153 (2740)
T ss_pred ceEE
Confidence 5554
No 18
>PF04234 CopC: CopC domain; InterPro: IPR007348 CopC is a bacterial blue copper protein that binds 1 atom of copper per protein molecule. Along with CopA, CopC mediates copper resistance by sequestration of copper in the periplasm [].; GO: 0005507 copper ion binding, 0046688 response to copper ion, 0042597 periplasmic space; PDB: 1IX2_B 1LYQ_A 2C9P_C 2C9R_A 2C9Q_A 1M42_A 1OT4_A 1NM4_A.
Probab=89.71 E-value=0.27 Score=33.97 Aligned_cols=53 Identities=26% Similarity=0.416 Sum_probs=25.5
Q ss_pred EEEEECCCCCccceE---EEEcCCCEEEEEEEc-CCCeeEEEEE---EECCEEcCCCCeEEE
Q psy15726 84 DVIVTSPQGAAVPVR---CYQQKFANLLAEFTP-TTTGVYKIDV---LQGARPVRGSPYLCQ 138 (171)
Q Consensus 84 ~v~i~~p~g~~~~~~---v~~~~dg~y~v~y~P-~~~G~~~i~V---~~~g~~I~gSPF~v~ 138 (171)
.+.|.+++|+.++.. +.+ +...+.+.-.+ ..+|.|.|.- .-+|.++.|+ |.+.
T Consensus 36 ~v~v~~~~g~~v~~~~~~~~~-~~~~~~~~l~~~l~~G~YtV~wrvvs~DGH~~~G~-~~F~ 95 (97)
T PF04234_consen 36 SVTVTDPDGKRVDLGEPTVDG-DGKTLTVPLPPPLPPGTYTVSWRVVSADGHPVSGS-FSFT 95 (97)
T ss_dssp EEEEEEEEETTSCTCEEEEEE-STTEEEEEESS---SEEEEEEEEEEETTSCEEEEE-EEEE
T ss_pred EEEEEcCCCceeecCcceecC-CceEEEEECCCCCCCceEEEEEEEEecCCCCcCCE-EEEE
Confidence 344555555443322 222 22455555555 5666666652 3466777666 4443
No 19
>PF13473 Cupredoxin_1: Cupredoxin-like domain; PDB: 1IBZ_D 1IC0_E 1IBY_D.
Probab=89.45 E-value=2.8 Score=28.89 Aligned_cols=59 Identities=12% Similarity=0.192 Sum_probs=33.3
Q ss_pred ceEeCCeEEEEEEeCCCcCCceEEEEECCCCCccceEEEEcCCCEEEEEEEcCCCeeEEEEEEECC
Q psy15726 62 QARSGIVNSFTLETCGVASSEFDVIVTSPQGAAVPVRCYQQKFANLLAEFTPTTTGVYKIDVLQGA 127 (171)
Q Consensus 62 ~~~vg~~~~F~V~~~~ag~~~l~v~i~~p~g~~~~~~v~~~~dg~y~v~y~P~~~G~~~i~V~~~g 127 (171)
.+..|++..+.+.-.|... -++.+.+ ...+..-..+.+.+++|+|.++|+|.+.=.+..
T Consensus 38 ~v~~G~~v~l~~~N~~~~~--h~~~i~~-----~~~~~~l~~g~~~~~~f~~~~~G~y~~~C~~~~ 96 (104)
T PF13473_consen 38 TVKAGQPVTLTFTNNDSRP--HEFVIPD-----LGISKVLPPGETATVTFTPLKPGEYEFYCTMHP 96 (104)
T ss_dssp EEETTCEEEEEEEE-SSS---EEEEEGG-----GTEEEEE-TT-EEEEEEEE-S-EEEEEB-SSS-
T ss_pred EEcCCCeEEEEEEECCCCc--EEEEECC-----CceEEEECCCCEEEEEEcCCCCEEEEEEcCCCC
Confidence 4778999999998776654 3333433 122222223468899999999999987654433
No 20
>PF11896 DUF3416: Domain of unknown function (DUF3416); InterPro: IPR021828 This presumed domain is functionally uncharacterised. This domain is found in bacteria and archaea. This domain is about 190 amino acids in length. This domain is found associated with PF00128 from PFAM. ; GO: 0004553 hydrolase activity, hydrolyzing O-glycosyl compounds; PDB: 3ZT7_A 3ZST_B 3ZT6_A 3ZSS_D 3ZT5_B.
Probab=89.08 E-value=4.1 Score=31.76 Aligned_cols=63 Identities=16% Similarity=0.112 Sum_probs=38.1
Q ss_pred ceEeCCeEEEEEEeCCCcCCceEEE--EECCCCCc-cceEEEEcCCCEEEEEEEcCCCeeEEEEEE
Q psy15726 62 QARSGIVNSFTLETCGVASSEFDVI--VTSPQGAA-VPVRCYQQKFANLLAEFTPTTTGVYKIDVL 124 (171)
Q Consensus 62 ~~~vg~~~~F~V~~~~ag~~~l~v~--i~~p~g~~-~~~~v~~~~dg~y~v~y~P~~~G~~~i~V~ 124 (171)
+..+|+...+.-+.=..|-+.+.+. ...|++.. -.+.....+|..|..+|+|...|.|...|.
T Consensus 22 KrvvGe~v~V~Adif~DGHD~l~A~l~~r~~~~~~w~~vpM~~~gnDrW~a~f~~~~~G~~~f~Ve 87 (187)
T PF11896_consen 22 KRVVGEPVPVSADIFRDGHDALAAELLWRHPGEREWQEVPMTPLGNDRWEASFTPDRPGRYEFRVE 87 (187)
T ss_dssp EEETT-EEEEEEEE--SSSS-EEEEEEEE-TTS-B----B-EESTS-EEEEEEE--SSEEEEEEEE
T ss_pred eeecCCeEEEEEEEEecCCCcEEEEEEEECCCCCcceeeccccCCCCEEEEEEECCCceeEEEEEE
Confidence 3678998888877755566666655 55777644 234455668889999999999999988874
No 21
>TIGR00868 hCaCC calcium-activated chloride channel protein 1. distributions. found a row in 1A13.INFO that was not parsed out
Probab=88.94 E-value=12 Score=35.76 Aligned_cols=108 Identities=10% Similarity=0.015 Sum_probs=66.5
Q ss_pred CCCceeEEEEEEECCEEcCCCCeEEEecCC---CCCCCcEEEcCCCc--ceEeCCeEEEEEEeCCCc----CCceEEEEE
Q psy15726 18 WISYGLVQARTLREETSFRSCPMEVPVVDP---AVGREPSGSGLGLY--QARSGIVNSFTLETCGVA----SSEFDVIVT 88 (171)
Q Consensus 18 P~~~G~~~i~V~~~g~~I~gSPf~v~V~~~---~d~~~v~v~G~Gl~--~~~vg~~~~F~V~~~~ag----~~~l~v~i~ 88 (171)
.-+.|.++.+|.-.+ ...+..+.|... .+..-+++.. -+. ...-..+..+..+.++.. +..+.+.|+
T Consensus 559 ~a~~G~W~y~l~~~~---~~q~i~vtVtsra~s~~~~pi~~~a-~~~~~~~~~psp~ivya~V~kG~~pVlgA~Vta~i~ 634 (863)
T TIGR00868 559 TAKVGTWTYSLQASA---NPQTLTLTVTSRARSPTLPPVTVTA-KMNKDTAKFPSPMIVYAKISQGFLPVLGANVTALIE 634 (863)
T ss_pred CCccceeEEEEeccC---CCceEEEEEEeccCCCCCCCEEEEE-EEccccccCCCceEEEEEEeeCCceeeccEEEEEEE
Confidence 457888888874331 123345555522 1122233321 111 122344444444444321 367888999
Q ss_pred CCCCCccceEEEEcC--------CCEEEEEEEc-CCCeeEEEEEEECCEE
Q psy15726 89 SPQGAAVPVRCYQQK--------FANLLAEFTP-TTTGVYKIDVLQGARP 129 (171)
Q Consensus 89 ~p~g~~~~~~v~~~~--------dg~y~v~y~P-~~~G~~~i~V~~~g~~ 129 (171)
.|+|+.+..++.|++ ||.|.--|+. ...|.|.|.|..+|..
T Consensus 635 ~~~g~~~~l~l~DnGaGaD~~knDGIYS~yft~f~~nGrYsl~V~v~~~~ 684 (863)
T TIGR00868 635 SENGHTVTLELLDNGAGADTVKNDGIYSRYFTAYDGNGRYSLKVRALGGV 684 (863)
T ss_pred CCCCCEEEEEEeecCCCCCccccCceeeeeeecccCCceEEEEEEEeCCC
Confidence 999998888888874 6999999875 4689999999988754
No 22
>PF07705 CARDB: CARDB; InterPro: IPR011635 The APHP (acidic peptide-dependent hydrolases/peptidase) domain is found in a variety of different proteins.; PDB: 2KUT_A 2L0D_A 3IDU_A 2KL6_A.
Probab=87.85 E-value=3.7 Score=27.32 Aligned_cols=66 Identities=15% Similarity=0.156 Sum_probs=37.6
Q ss_pred ceEeCCeEEEEEEeCCCcC---CceEEEEECCCCCccceEEEEcCC---CEEEEEEEcCCCeeEEEEEEECC
Q psy15726 62 QARSGIVNSFTLETCGVAS---SEFDVIVTSPQGAAVPVRCYQQKF---ANLLAEFTPTTTGVYKIDVLQGA 127 (171)
Q Consensus 62 ~~~vg~~~~F~V~~~~ag~---~~l~v~i~~p~g~~~~~~v~~~~d---g~y~v~y~P~~~G~~~i~V~~~g 127 (171)
.+..|++..|++.-+..|. ..+.+.+...+.......+..... .++.+.|++..+|.|.|.+.++.
T Consensus 14 ~~~~g~~~~i~~~V~N~G~~~~~~~~v~~~~~~~~~~~~~i~~L~~g~~~~v~~~~~~~~~G~~~i~~~iD~ 85 (101)
T PF07705_consen 14 NVVPGEPVTITVTVKNNGTADAENVTVRLYLDGNSVSTVTIPSLAPGESETVTFTWTPPSPGSYTIRVVIDP 85 (101)
T ss_dssp EEETTSEEEEEEEEEE-SSS-BEEEEEEEEETTEEEEEEEESEB-TTEEEEEEEEEE-SS-CEEEEEEEEST
T ss_pred cccCCCEEEEEEEEEECCCCCCCCEEEEEEECCceeccEEECCcCCCcEEEEEEEEEeCCCCeEEEEEEEee
Confidence 4667888888877765544 345566554332211222322222 35567778889999999998864
No 23
>smart00736 CADG Dystroglycan-type cadherin-like domains. Cadherin-homologous domains present in metazoan dystroglycans and alpha/epsilon sarcoglycans, yeast Axl2p and in a very large protein from magnetotactic bacteria. Likely to bind calcium ions.
Probab=87.66 E-value=3.9 Score=27.88 Aligned_cols=81 Identities=17% Similarity=0.199 Sum_probs=53.3
Q ss_pred ceEeCCeEEEEEEe---CCCcCCceEEEEECCCCCccceEEEEcCCCEEEEEEEcCC--CeeEEEEEEECCE--EcCCCC
Q psy15726 62 QARSGIVNSFTLET---CGVASSEFDVIVTSPQGAAVPVRCYQQKFANLLAEFTPTT--TGVYKIDVLQGAR--PVRGSP 134 (171)
Q Consensus 62 ~~~vg~~~~F~V~~---~~ag~~~l~v~i~~p~g~~~~~~v~~~~dg~y~v~y~P~~--~G~~~i~V~~~g~--~I~gSP 134 (171)
.+.+|+...|+|.. .+..+..+...+...+|..+|..+.=.. .+..+.-+|.. .|.|.|.|.-.+. .-..-.
T Consensus 8 ~a~~g~~f~~~i~~~tF~d~d~~~lty~~~~~~~~~lP~Wl~fd~-~~~~~~GtP~~~~~g~~~i~v~a~D~~g~~~~~~ 86 (97)
T smart00736 8 TATEGESFSYTIPSSTFTDADGDTLTYSATLSDGSALPSWLSFDS-DTGTLSGTPTNSDVGSLSLKVTATDSSGASASDT 86 (97)
T ss_pred EEeCCCeEEEEcChhceECCCCCeEEEEEEeCCCCCCCCeEEEeC-CCCEEEEECCCCCCcEEEEEEEEEECCCCEEEEE
Confidence 47788888888764 2344567777777667767787765332 24556668887 8999988875432 223346
Q ss_pred eEEEecCCC
Q psy15726 135 YLCQVYDAS 143 (171)
Q Consensus 135 F~v~V~d~s 143 (171)
|.+.|.+++
T Consensus 87 f~i~V~~~~ 95 (97)
T smart00736 87 FTITVVNTN 95 (97)
T ss_pred EEEEEeCCC
Confidence 888776543
No 24
>PRK06655 flgD flagellar basal body rod modification protein; Reviewed
Probab=87.05 E-value=5.3 Score=31.99 Aligned_cols=59 Identities=19% Similarity=0.222 Sum_probs=39.6
Q ss_pred CCeEEEEEEeCCCcCCceEEEEECCCCCccce-EEEEcCCCEEEEEE-------EcCCCeeEEEEEEE
Q psy15726 66 GIVNSFTLETCGVASSEFDVIVTSPQGAAVPV-RCYQQKFANLLAEF-------TPTTTGVYKIDVLQ 125 (171)
Q Consensus 66 g~~~~F~V~~~~ag~~~l~v~i~~p~g~~~~~-~v~~~~dg~y~v~y-------~P~~~G~~~i~V~~ 125 (171)
|....|.++..+. ...+.+.|.+.+|+.+.. .+-....|.+.++| .+..+|.|++.|..
T Consensus 112 ~~~~~~~~~l~~~-a~~vti~I~D~~G~~Vrt~~lg~~~aG~~~f~WDG~d~~G~~lp~G~Yt~~V~A 178 (225)
T PRK06655 112 GGTTPFGVELPSA-ADNVTVTITDSAGQVVRTIDLGAQSAGVVSFTWDGTDTDGNALPDGNYTIKASA 178 (225)
T ss_pred CCceEEEEEcCCC-CcEEEEEEEcCCCCEEEEEecCCcCCCceeEEECCCCCCCCcCCCeeEEEEEEE
Confidence 3466777775543 246889999999987653 33345567887777 24557899888853
No 25
>PRK10301 hypothetical protein; Provisional
Probab=86.62 E-value=2.3 Score=30.87 Aligned_cols=25 Identities=28% Similarity=0.534 Sum_probs=15.8
Q ss_pred CCCeeEEEE---EEECCEEcCCCCeEEEe
Q psy15726 114 TTTGVYKID---VLQGARPVRGSPYLCQV 139 (171)
Q Consensus 114 ~~~G~~~i~---V~~~g~~I~gSPF~v~V 139 (171)
..+|.|+|. +.-+|.++.|+ |.+.|
T Consensus 96 L~~G~YtV~Wrvvs~DGH~~~G~-~~F~V 123 (124)
T PRK10301 96 LKPGTYTVDWHVVSVDGHKTKGH-YTFSV 123 (124)
T ss_pred CCCccEEEEEEEEecCCCccCCe-EEEEE
Confidence 456666665 34578888887 55544
No 26
>cd02858 Esterase_N_term Esterase N-terminal domain. Esterases catalyze the hydrolysis of organic esters to release an alcohol or thiol and acid. The term can be applied to enzymes that hydrolyze carboxylate, phosphate and sulphate esters, but is more often restricted to the first class of substrate. The N-terminus of esterase may be related to the immunoglobulin and/or fibronectin type III superfamilies. These domains are associated with different types of catalytic domains at either the N-terminal or C-terminal end and may be involved in homodimeric/tetrameric/dodecameric interactions. Members of this family include members of the alpha amylase family, sialidase, galactose oxidase, cellulase, cellulose, hyaluronate lyase, chitobiase, and chitinase.
Probab=85.37 E-value=8.7 Score=25.59 Aligned_cols=60 Identities=15% Similarity=0.072 Sum_probs=40.5
Q ss_pred CeEEEEEEeCCCcCCceEEEEECCCCCccceEEEEcCCCEEEEEEEcCCCeeEEEEEEECCEEc
Q psy15726 67 IVNSFTLETCGVASSEFDVIVTSPQGAAVPVRCYQQKFANLLAEFTPTTTGVYKIDVLQGARPV 130 (171)
Q Consensus 67 ~~~~F~V~~~~ag~~~l~v~i~~p~g~~~~~~v~~~~dg~y~v~y~P~~~G~~~i~V~~~g~~I 130 (171)
..++|.+-+..| ..+.+.+...+. ....+...++|.|+++-.+..+|.|.-.+..+|..+
T Consensus 6 ~~v~F~vwAP~A--~~V~L~~~~~~~--~~~~m~~~~~G~W~~~v~~l~~g~Y~Y~~~vdg~~~ 65 (85)
T cd02858 6 RTVTFRLFAPKA--NEVQVRGSWGGA--GSHPMTKDEAGVWSVTTGPLAPGIYTYSFLVDGVRV 65 (85)
T ss_pred CcEEEEEECCCC--CEEEEEeecCCC--ccEeCeECCCeEEEEEECCCCCcEEEEEEEECCeEe
Confidence 457888888776 355555544221 223455567899999885577888999999998443
No 27
>COG2372 CopC Uncharacterized protein, homolog of Cu resistance protein CopC [General function prediction only]
Probab=84.63 E-value=7.1 Score=28.55 Aligned_cols=72 Identities=26% Similarity=0.332 Sum_probs=40.1
Q ss_pred CeEEEEEEeCCCcC-CceEEEEECCCCCccceE---EEEcCCCEEEEEE-EcCCCeeEEEE---EEECCEEcCCCCeEEE
Q psy15726 67 IVNSFTLETCGVAS-SEFDVIVTSPQGAAVPVR---CYQQKFANLLAEF-TPTTTGVYKID---VLQGARPVRGSPYLCQ 138 (171)
Q Consensus 67 ~~~~F~V~~~~ag~-~~l~v~i~~p~g~~~~~~---v~~~~dg~y~v~y-~P~~~G~~~i~---V~~~g~~I~gSPF~v~ 138 (171)
-|..|++.-.+.=. +-..+.+.+|+|..+... ..+.++-.++|.- .|.++|.|.+. |.-+|..+.|| +.+.
T Consensus 45 aP~~i~L~Fse~ve~~fs~~~l~~~d~~~v~t~~~~~~~~~~~~l~v~l~~~L~aG~Y~v~WrvvS~DGH~v~G~-~sFs 123 (127)
T COG2372 45 APAAITLEFSEGVEPGFSGAKLTGPDGEEVATAGTKLDEQNHTQLEVPLPQPLKAGVYTVDWRVVSSDGHVVKGS-ISFS 123 (127)
T ss_pred CceeEEEecCCccCCCcceeEEECCCCCccccCcccccccCCcEEEecCcccCCCCcEEEEEEEEecCCcEeccE-EEEE
Confidence 35556665554322 224567788887654332 2222222255544 35667788776 55688888887 4444
Q ss_pred e
Q psy15726 139 V 139 (171)
Q Consensus 139 V 139 (171)
|
T Consensus 124 V 124 (127)
T COG2372 124 V 124 (127)
T ss_pred e
Confidence 4
No 28
>PF05404 TRAP-delta: Translocon-associated protein, delta subunit precursor (TRAP-delta); InterPro: IPR008855 This family consists of several eukaryotic translocon-associated protein, delta subunit precursors (TRAP-delta or SSR-delta). The exact function of this protein is unknown [].; GO: 0005783 endoplasmic reticulum, 0016021 integral to membrane
Probab=83.49 E-value=5.8 Score=30.38 Aligned_cols=54 Identities=17% Similarity=0.356 Sum_probs=36.5
Q ss_pred CeEEEEEEeCC-CcCCceEEEEECCCCCccceEEEEcCCCEEEEEEEcC----CCeeEEEEEE
Q psy15726 67 IVNSFTLETCG-VASSEFDVIVTSPQGAAVPVRCYQQKFANLLAEFTPT----TTGVYKIDVL 124 (171)
Q Consensus 67 ~~~~F~V~~~~-ag~~~l~v~i~~p~g~~~~~~v~~~~dg~y~v~y~P~----~~G~~~i~V~ 124 (171)
=-++|++.|++ +..-.|.+.+. |+..|+..... .+.|+|+|+-. ..|.|.|.+.
T Consensus 43 fi~EFsl~C~n~~~~~~LyAeV~---Gk~~PVar~~~-~nkYQVSW~~e~k~a~sG~y~V~~f 101 (167)
T PF05404_consen 43 FIAEFSLKCSNGAKNISLYAEVN---GKILPVARSGD-TNKYQVSWTEEHKKASSGTYEVKFF 101 (167)
T ss_pred EEEEEEEEeCCCCcCccEEEEEC---CEEEEEEEcCC-CCceEEEEEechhhccCCceEEEEe
Confidence 34689999985 33456777775 67777654332 27999999843 2577877764
No 29
>KOG1692|consensus
Probab=83.26 E-value=8.5 Score=30.19 Aligned_cols=33 Identities=24% Similarity=0.263 Sum_probs=24.5
Q ss_pred eEeCCeEEEEEEeCCCcCCceEEEEECCCCCcc
Q psy15726 63 ARSGIVNSFTLETCGVASSEFDVIVTSPQGAAV 95 (171)
Q Consensus 63 ~~vg~~~~F~V~~~~ag~~~l~v~i~~p~g~~~ 95 (171)
+..|.+.+......+.|...+.+.|.+|+|+.+
T Consensus 37 ~~~gd~~~vsF~v~~gg~~~vd~~I~gP~~~~i 69 (201)
T KOG1692|consen 37 LEEGDKLSVSFEVIDGGFLGVDVEITGPDGKII 69 (201)
T ss_pred hccCCEEEEEEEEecCCccceeEEEECCCCchh
Confidence 456666666666666666789999999999654
No 30
>PF01483 P_proprotein: Proprotein convertase P-domain; InterPro: IPR002884 This domain, termed the P domain is approximately 150 amino acids in length and C-terminal to a serine endopeptidase domain which belong to MEROPS peptidase family S8 (clan SB), subfamily S8B (kexin). The domain is primarily associated with the calcium-dependent serine endopeptidases, kex2/subtilisin proprotein convertases (PCs), which have been identified in all eukaryotes [] and in the gammaproteobacteria, Nostoc (cyanobacteria) and in Streptomyces avermitilis. The P domain appears necessary for folding and maintaining the endopeptidase catalytic domain and to regulate its calcium and acidic pH dependence. In addition, contained within the middle of the P domain in most PC family members is the cognate integrin binding RGD sequence [], which may be required for intracellular compartmentalization and maintenance of enzyme stability within the ER. The integrity of the RGD sequence of proprotein convertase PC1 is critical for its zymogen and C-terminal processing and for its cellular trafficking [, ]. The carboxy-terminal tail provides uniqueness to each PC family member being the least conserved region of all convertases [].; GO: 0004252 serine-type endopeptidase activity, 0006508 proteolysis; PDB: 1P8J_E 1OT5_B 2ID4_A 1R64_A 3HJR_A.
Probab=82.95 E-value=6.5 Score=26.14 Aligned_cols=54 Identities=15% Similarity=0.274 Sum_probs=28.4
Q ss_pred EEEEEeCCCcCCceEEEEECCCCCccceEEEEcC------CCEEEEEEEcC------CCeeEEEEEEE
Q psy15726 70 SFTLETCGVASSEFDVIVTSPQGAAVPVRCYQQK------FANLLAEFTPT------TTGVYKIDVLQ 125 (171)
Q Consensus 70 ~F~V~~~~ag~~~l~v~i~~p~g~~~~~~v~~~~------dg~y~v~y~P~------~~G~~~i~V~~ 125 (171)
.+.++..-...++|.+.+.+|+|.. +.+.+.. ++.+..+|... ..|.++|.|.=
T Consensus 5 ~v~v~i~H~~~gdL~i~L~SP~Gt~--~~L~~~~~~d~~~~~~~~~~f~~~~f~Ge~~~G~W~L~v~D 70 (87)
T PF01483_consen 5 QVSVDITHPYRGDLRITLISPSGTR--STLKDRRGSDDSGSGFLNWTFTSVAFWGESANGTWTLRVTD 70 (87)
T ss_dssp EEEEEEEESSGGGEEEEEE-TT--E--EEEE-SSTTHHHSTSEEEEEEEESTTTT-B--EEEEEEEEE
T ss_pred EEEEEEEcCCcCCEEEEEECCCCCE--EEEECCcCCCcccccccccEEEEEeecCccCCCEEEEEEEE
Confidence 4445444445579999999999974 3344332 34566666632 14555555543
No 31
>COG1470 Predicted membrane protein [Function unknown]
Probab=82.69 E-value=12 Score=33.29 Aligned_cols=120 Identities=11% Similarity=0.119 Sum_probs=61.3
Q ss_pred CCceeEEEEEEECCE-EcCCCC-eEEEecCCCCCCCcEEEcCCCc-ceEeCCeEEEEEEeCCCcC---CceEEEEECCCC
Q psy15726 19 ISYGLVQARTLREET-SFRSCP-MEVPVVDPAVGREPSGSGLGLY-QARSGIVNSFTLETCGVAS---SEFDVIVTSPQG 92 (171)
Q Consensus 19 ~~~G~~~i~V~~~g~-~I~gSP-f~v~V~~~~d~~~v~v~G~Gl~-~~~vg~~~~F~V~~~~ag~---~~l~v~i~~p~g 92 (171)
-+||.|.+.|.-.+- .+..-+ ..+.+... ....+.+.+--++ .+..|+.....+.-.+.|. .++.+.|.+|+|
T Consensus 347 a~pG~Ynv~I~A~s~s~v~~e~~lki~~~g~-~~~~v~l~~g~~~lt~taGee~~i~i~I~NsGna~LtdIkl~v~~Pqg 425 (513)
T COG1470 347 ATPGTYNVTITASSSSGVTRELPLKIKNTGS-YNELVKLDNGPYRLTITAGEEKTIRISIENSGNAPLTDIKLTVNGPQG 425 (513)
T ss_pred CCCCceeEEEEEeccccceeeeeEEEEeccc-cceeEEccCCcEEEEecCCccceEEEEEEecCCCccceeeEEecCCcc
Confidence 467888887776553 222222 22222221 2223333221121 2667777776666655554 567799999998
Q ss_pred Cccce---EE---EEcCCCEEEEEEE-cC--CCeeEEEEEEECCEEc-CCCCeEEEe
Q psy15726 93 AAVPV---RC---YQQKFANLLAEFT-PT--TTGVYKIDVLQGARPV-RGSPYLCQV 139 (171)
Q Consensus 93 ~~~~~---~v---~~~~dg~y~v~y~-P~--~~G~~~i~V~~~g~~I-~gSPF~v~V 139 (171)
=.+.+ .| ..-+.++-.++-+ |. .+|+|.|.|+--.-.. ...=+++.|
T Consensus 426 Wei~Vd~~~I~sL~pge~~tV~ltI~vP~~a~aGdY~i~i~~ksDq~s~e~tlrV~V 482 (513)
T COG1470 426 WEIEVDESTIPSLEPGESKTVSLTITVPEDAGAGDYRITITAKSDQASSEDTLRVVV 482 (513)
T ss_pred ceEEECcccccccCCCCcceEEEEEEcCCCCCCCcEEEEEEEeeccccccceEEEEE
Confidence 32221 11 1223345555555 33 3788999887543222 223355555
No 32
>PRK09619 flgD flagellar basal body rod modification protein; Reviewed
Probab=82.50 E-value=7.6 Score=30.94 Aligned_cols=70 Identities=11% Similarity=0.057 Sum_probs=42.7
Q ss_pred CcEEEcCCCcceEeCCeEEEEEEeCCCcCCceEEEEECCCCCccceEEEEcCCCEEEEEEEc----CCCeeEEEEEE
Q psy15726 52 EPSGSGLGLYQARSGIVNSFTLETCGVASSEFDVIVTSPQGAAVPVRCYQQKFANLLAEFTP----TTTGVYKIDVL 124 (171)
Q Consensus 52 ~v~v~G~Gl~~~~vg~~~~F~V~~~~ag~~~l~v~i~~p~g~~~~~~v~~~~dg~y~v~y~P----~~~G~~~i~V~ 124 (171)
.|.+.|..+.- .+....|.+...++ ...+.+.|.+.+|+......-....|.+.+.|-+ ..+|.|++.|.
T Consensus 97 ~V~~~g~~~~l--~~~~~~~~~~L~~~-a~~v~v~I~D~~G~v~t~~l~~~~aG~~~f~WDG~~~~lp~G~Y~~~V~ 170 (218)
T PRK09619 97 QVMVESHEITL--GEDPVAGRLTLKHP-APTLTLHITDILGQEKKIDLGKQPAGPVNFTLDPAALGLQPGQYQLSVV 170 (218)
T ss_pred EEEEecceEEe--cCCeeEEEEecCCc-CcEEEEEEEeCCCCEEEEecCCcCCCceeEEECCCCCCCCCceeEEEEE
Confidence 34454444321 23455666664332 3568899999999742222223455888888876 45788999986
No 33
>PRK12812 flgD flagellar basal body rod modification protein; Reviewed
Probab=82.21 E-value=10 Score=31.11 Aligned_cols=58 Identities=14% Similarity=0.053 Sum_probs=41.0
Q ss_pred CeEEEEEEeCCCcCCceEEEEECCCCCccce-EEEEcCCCEEEEEEEc-------CCCeeEEEEEEE
Q psy15726 67 IVNSFTLETCGVASSEFDVIVTSPQGAAVPV-RCYQQKFANLLAEFTP-------TTTGVYKIDVLQ 125 (171)
Q Consensus 67 ~~~~F~V~~~~ag~~~l~v~i~~p~g~~~~~-~v~~~~dg~y~v~y~P-------~~~G~~~i~V~~ 125 (171)
....|.++.... ...+.+.|.+.+|+.++. .+.+...|.+.+.|-- ...|.|++.|..
T Consensus 128 ~~~~~~~~l~~~-a~~v~v~I~D~~G~~V~t~~lg~~~aG~~~f~WDG~d~~G~~~~~G~Yt~~v~A 193 (259)
T PRK12812 128 ELIALKLYFPED-SDEGTLEIYDSNNKLVEKIDFKEISQGLFTMEWDGRDNDGVYAGDGEYTIKAVY 193 (259)
T ss_pred ceeEEEEecCCc-CceEEEEEEeCCCCEEEEEecCCCCCcceeEEECCCCCCCCcCCCeeeEEEEEE
Confidence 467777776433 357899999999986643 3335556888888865 347999999974
No 34
>PF04151 PPC: Bacterial pre-peptidase C-terminal domain; InterPro: IPR007280 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. This domain is normally found at the C terminus of secreted archaeal and bacterial peptidases, the majority of which belong to MEROPS peptidase families M4 (vibriolysin, IPR001570 from INTERPRO), M9A amd M9B (microbial collangenase, IPR002169 from INTERPRO), M28 (aminopeptidase Ap1, IPR007484 from INTERPRO) and S8 (subtilisin family peptidases, IPR000209 from INTERPRO).; GO: 0008233 peptidase activity, 0006508 proteolysis; PDB: 4DY5_B 4DXZ_A 4DY3_B 3JQW_A 3JQX_C 1NQJ_B 1NQD_A 2O8O_A 1WMF_A 1WME_A ....
Probab=82.20 E-value=10 Score=24.00 Aligned_cols=52 Identities=19% Similarity=0.313 Sum_probs=32.5
Q ss_pred EEEEeCCCcCCceEEEEECCCCCccceEEEEcC--CC-EEEEEEEcCCCeeEEEEEE
Q psy15726 71 FTLETCGVASSEFDVIVTSPQGAAVPVRCYQQK--FA-NLLAEFTPTTTGVYKIDVL 124 (171)
Q Consensus 71 F~V~~~~ag~~~l~v~i~~p~g~~~~~~v~~~~--dg-~y~v~y~P~~~G~~~i~V~ 124 (171)
++|++.+.. +++.+.+.+++|..+... .+.. .+ .=.++|+..++|.|.|.|.
T Consensus 15 l~i~l~~~~-~d~dl~l~~~~g~~~~~~-d~~~~~~~~~~~i~~~~~~~GtYyi~V~ 69 (70)
T PF04151_consen 15 LTIDLSGGS-GDADLYLYDSNGNSLASY-DDSSQSGGNDESITFTAPAAGTYYIRVY 69 (70)
T ss_dssp EEEEECETT-SSEEEEEEETTSSSCEEC-CCCTCETTSEEEEEEEESSSEEEEEEEE
T ss_pred EEEEEcCCC-CCeEEEEEcCCCCchhhh-eecCCCCCCccEEEEEcCCCEEEEEEEE
Confidence 445554332 367788888887532211 1111 12 4678899999999999985
No 35
>TIGR03096 nitroso_cyanin nitrosocyanin. Nitrosocyanin, as described from the obligate chemolithoautotroph Nitrosomonas europaea, is a red copper protein of unknown function with sequence similarity to a number of blue copper redox proteins.
Probab=81.76 E-value=7.3 Score=28.82 Aligned_cols=65 Identities=17% Similarity=0.185 Sum_probs=39.0
Q ss_pred ceEeCCeEEEEEEeCCCcCCceEEEEECCCCCccceEEEEcCCCEEEEEEEcCCCeeEEEEEE--ECCEEcCCC
Q psy15726 62 QARSGIVNSFTLETCGVASSEFDVIVTSPQGAAVPVRCYQQKFANLLAEFTPTTTGVYKIDVL--QGARPVRGS 133 (171)
Q Consensus 62 ~~~vg~~~~F~V~~~~ag~~~l~v~i~~p~g~~~~~~v~~~~dg~y~v~y~P~~~G~~~i~V~--~~g~~I~gS 133 (171)
.+..|.++++++.-.+.-...+ .+.+ -| +...+ ..+.+.+++|++.++|.|...=. -...|++||
T Consensus 64 ~VkaGD~Vtl~vtN~d~~~H~f--~i~~-~g--is~~I--~pGet~TitF~adKpG~Y~y~C~~HP~~~H~~~~ 130 (135)
T TIGR03096 64 VVKKGTPVKVTVENKSPISEGF--SIDA-YG--ISEVI--KAGETKTISFKADKAGAFTIWCQLHPKNIHLPGS 130 (135)
T ss_pred EECCCCEEEEEEEeCCCCccce--EECC-CC--cceEE--CCCCeEEEEEECCCCEEEEEeCCCCChhhcCCCc
Confidence 3667888888776333322222 2222 11 12222 22468999999999999986533 336788887
No 36
>PF13620 CarboxypepD_reg: Carboxypeptidase regulatory-like domain; PDB: 3MN8_D 3P0D_I 3KCP_A 2B59_B 1UWY_A 1H8L_A 1QMU_A 2NSM_A.
Probab=81.66 E-value=9 Score=24.65 Aligned_cols=46 Identities=17% Similarity=0.165 Sum_probs=28.6
Q ss_pred CceEEEEECCCCCccceEEEEcCCCEEEEEEEcCCCeeEEEEEEECCEE
Q psy15726 81 SEFDVIVTSPQGAAVPVRCYQQKFANLLAEFTPTTTGVYKIDVLQGARP 129 (171)
Q Consensus 81 ~~l~v~i~~p~g~~~~~~v~~~~dg~y~v~y~P~~~G~~~i~V~~~g~~ 129 (171)
....|.+.+.++.. .....-+.+|.|.+.-.|. |.|.|.+...|..
T Consensus 15 ~~a~V~l~~~~~~~-~~~~~Td~~G~f~~~~l~~--g~Y~l~v~~~g~~ 60 (82)
T PF13620_consen 15 PGATVTLTDQDGGT-VYTTTTDSDGRFSFEGLPP--GTYTLRVSAPGYQ 60 (82)
T ss_dssp TT-EEEET--TTTE-CCEEE--TTSEEEEEEE-S--EEEEEEEEBTTEE
T ss_pred CCEEEEEEEeeCCC-EEEEEECCCceEEEEccCC--EeEEEEEEECCcc
Confidence 45566666655543 3445555779999997776 9999999887743
No 37
>PF07523 Big_3: Bacterial Ig-like domain (group 3); InterPro: IPR011080 This entry represents bacterial domains with an Ig-like fold. These domains are found in a variety of bacterial surface proteins.; PDB: 2L7Y_A 2KPN_A.
Probab=80.24 E-value=4.6 Score=25.60 Aligned_cols=44 Identities=11% Similarity=0.150 Sum_probs=25.4
Q ss_pred ceEEEEECCCCCccceEEEEcCCCEEEEEEEcCCCeeEEEEEEECCEEc
Q psy15726 82 EFDVIVTSPQGAAVPVRCYQQKFANLLAEFTPTTTGVYKIDVLQGARPV 130 (171)
Q Consensus 82 ~l~v~i~~p~g~~~~~~v~~~~dg~y~v~y~P~~~G~~~i~V~~~g~~I 130 (171)
.+.+.....+|..+ +..+-+..-.|....+|.|.|.++|.+...
T Consensus 18 ~~~v~at~~dG~~~-----~~~~~~vs~~~d~~~~G~y~Vt~~y~~~t~ 61 (67)
T PF07523_consen 18 GLFVTATYSDGTSL-----PLSDVTVSGTVDTSKAGTYTVTYTYKGVTA 61 (67)
T ss_dssp CHEEEEEETTS-ES------GCCSEEES---TTS-CCEEEEEEECTEEE
T ss_pred CCEEEEEEcCCCEe-----ceeeeEEEeeeecCCCceEEEEEEECCEEE
Confidence 45566777777652 222333333678899999999999999443
No 38
>PF13584 BatD: Oxygen tolerance
Probab=79.90 E-value=11 Score=33.08 Aligned_cols=62 Identities=21% Similarity=0.367 Sum_probs=45.0
Q ss_pred EEEEEEEcCCCeeEEEE---EEECCEEcCCCCeEEEecCCCc------------eEEecC-CcccEEcCCCEEEEEEc
Q psy15726 106 NLLAEFTPTTTGVYKID---VLQGARPVRGSPYLCQVYDASK------------VKIEHK-GLSNIVVNDPISFKCKS 167 (171)
Q Consensus 106 ~y~v~y~P~~~G~~~i~---V~~~g~~I~gSPF~v~V~d~sk------------v~~~G~-gl~~~~~g~~~~F~Vd~ 167 (171)
+|.+.+.|...|.|+|- |.++|+...--|..++|.+++. +.+.-. .-+.+.+||++.+++..
T Consensus 74 ~~~~~l~p~~~G~~~IP~~~v~v~Gk~~~S~pi~i~V~~~~~~~~~~~~~~~~~~~l~~~v~~~~~Yvge~v~lt~~l 151 (484)
T PF13584_consen 74 TYTYTLQPKKTGTFTIPPFTVEVDGKTYKSQPITIEVSKASQSPSQPPSNADDDVFLEAEVSKKSVYVGEPVILTLRL 151 (484)
T ss_pred EEEEEEEecccceEEEceEEEEECCEEEeecCEEEEEEecccCCccccccccccEEEEEEeCCCceecCCcEEEEEEE
Confidence 67888899999999875 6779999999999999944332 111111 12347889998888753
No 39
>PRK12633 flgD flagellar basal body rod modification protein; Provisional
Probab=78.17 E-value=12 Score=29.95 Aligned_cols=66 Identities=24% Similarity=0.361 Sum_probs=42.8
Q ss_pred eEEEEEEeCCCcCCceEEEEECCCCCccce-EEEEcCCCEEEEEEEcC-------CCeeEEEEEEE---CCEEcCCCC
Q psy15726 68 VNSFTLETCGVASSEFDVIVTSPQGAAVPV-RCYQQKFANLLAEFTPT-------TTGVYKIDVLQ---GARPVRGSP 134 (171)
Q Consensus 68 ~~~F~V~~~~ag~~~l~v~i~~p~g~~~~~-~v~~~~dg~y~v~y~P~-------~~G~~~i~V~~---~g~~I~gSP 134 (171)
...|.++..++ ...+.+.|.+.+|+.++. .+-....|.+.+.|-.. ..|.|++.|.. +|+.+.-.+
T Consensus 117 ~~~~~~~l~~~-a~~v~v~I~D~~G~vV~t~~lg~~~aG~~~f~WDG~d~~G~~~~~G~Y~~~V~a~~~~G~~~~~~~ 193 (230)
T PRK12633 117 ATPFGIDLQGD-ATKVTVKVLDPSGAVVRTMELGDLKTGVHTLQWDGNNDGGQPLADGKYSITVSASDADAKPVKAEA 193 (230)
T ss_pred ceeEEEecCCc-CcEEEEEEEeCCCCEEEEEecCCCCCCceeEEECCCCCCCCcCCCcceEEEEEEEeCCCcEEeeee
Confidence 45566654332 357889999999987653 33345567888888654 46889988865 345554333
No 40
>PF07495 Y_Y_Y: Y_Y_Y domain; InterPro: IPR011123 This region is mostly found at the end of the beta propellers (IPR011110 from INTERPRO) in a family of two component regulators. However they are also found tandemly repeated in Q891H4 from SWISSPROT without other signal conduction domains being present. It is named after the conserved tyrosines found in the alignment. The exact function is not known.; PDB: 3V9F_D 3VA6_B 3OTT_B 4A2M_D 4A2L_B.
Probab=77.89 E-value=3.7 Score=25.50 Aligned_cols=19 Identities=21% Similarity=0.371 Sum_probs=15.7
Q ss_pred EEEEEcCCCeeEEEEEEEC
Q psy15726 108 LAEFTPTTTGVYKIDVLQG 126 (171)
Q Consensus 108 ~v~y~P~~~G~~~i~V~~~ 126 (171)
.++|++..+|.|+|.|+..
T Consensus 30 ~~~~~~L~~G~Y~l~V~a~ 48 (66)
T PF07495_consen 30 SISYTNLPPGKYTLEVRAK 48 (66)
T ss_dssp EEEEES--SEEEEEEEEEE
T ss_pred EEEEEeCCCEEEEEEEEEE
Confidence 9999999999999999864
No 41
>KOG1428|consensus
Probab=77.75 E-value=8.3 Score=39.26 Aligned_cols=36 Identities=19% Similarity=0.100 Sum_probs=30.9
Q ss_pred eeccccCCCCceeEEEEEEECCEEcCCCCeEEEecCC
Q psy15726 11 LSEKAPSWISYGLVQARTLREETSFRSCPMEVPVVDP 47 (171)
Q Consensus 11 ~~~~~~~P~~~G~~~i~V~~~g~~I~gSPf~v~V~~~ 47 (171)
+++-+|+|...|.|+++++.+|..|.. -+.|.|.++
T Consensus 2107 ~y~~nWTP~a~G~Y~l~~tiDG~eId~-~~eVkVkd~ 2142 (3738)
T KOG1428|consen 2107 TYCANWTPGAIGLYTLHVTIDGIEIDA-GLEVKVKDP 2142 (3738)
T ss_pred ceecccCCccceeEEEEEEecceeecc-ceEEEeccC
Confidence 456789999999999999999999954 478888876
No 42
>PF00017 SH2: SH2 domain; InterPro: IPR000980 The Src homology 2 (SH2) domain is a protein domain of about 100 amino-acid residues first identified as a conserved sequence region between the oncoproteins Src and Fps []. Similar sequences were later found in many other intracellular signal-transducing proteins []. SH2 domains function as regulatory modules of intracellular signalling cascades by interacting with high affinity to phosphotyrosine-containing target peptides in a sequence-specific, SH2 domains recognise between 3-6 residues C-terminal to the phosphorylated tyrosine in a fashion that differs from one SH2 domain to another, and strictly phosphorylation-dependent manner [, , , ]. They are found in a wide variety of protein contexts e.g., in association with catalytic domains of phospholipase Cy (PLCy) and the non-receptor protein tyrosine kinases; within structural proteins such as fodrin and tensin; and in a group of small adaptor molecules, i.e Crk and Nck. The domains are frequently found as repeats in a single protein sequence and will then often bind both mono- and di-phosphorylated substrates. The structure of the SH2 domain belongs to the alpha+beta class, its overall shape forming a compact flattened hemisphere. The core structural elements comprise a central hydrophobic anti-parallel beta-sheet, flanked by 2 short alpha-helices. The loop between strands 2 and 3 provides many of the binding interactions with the phosphate group of its phosphopeptide ligand, and is hence designated the phosphate binding loop, the phosphorylated ligand binds perpendicular to the beta-sheet and typically interacts with the phosphate binding loop and a hydrophobic binding pocket that interacts with a pY+3 side chain. The N- and C-termini of the domain are close together in space and on the opposite face from the phosphopeptide binding surface and it has been speculated that this has facilitated their integration into surface-exposed regions of host proteins [].; GO: 0005515 protein binding; PDB: 1M27_A 1KA6_A 1D4W_B 1D4T_A 1D1Z_B 1KA7_A 1UUR_A 1UUS_A 1BLJ_A 1BLK_A ....
Probab=77.13 E-value=6.7 Score=25.16 Aligned_cols=27 Identities=15% Similarity=0.247 Sum_probs=24.7
Q ss_pred cCCCEEEEEEEcCCCeeEEEEEEECCE
Q psy15726 102 QKFANLLAEFTPTTTGVYKIDVLQGAR 128 (171)
Q Consensus 102 ~~dg~y~v~y~P~~~G~~~i~V~~~g~ 128 (171)
..+|+|.|......+|.|.|.|+++++
T Consensus 19 ~~~G~FLvR~s~~~~~~~~Lsv~~~~~ 45 (77)
T PF00017_consen 19 KPDGTFLVRPSSSKPGKYVLSVRFDGK 45 (77)
T ss_dssp SSTTEEEEEEESSSTTSEEEEEEETTE
T ss_pred CCCCeEEEEeccccccccccccccccc
Confidence 568999999999999999999999993
No 43
>PF14545 DBB: Dof, BCAP, and BANK (DBB) motif,
Probab=75.27 E-value=32 Score=25.69 Aligned_cols=75 Identities=12% Similarity=0.087 Sum_probs=50.4
Q ss_pred ceEeCCeEEEEEEeCC-C-cCCceEEEEECCCCCccceEEEEcCCCEEEEEEE-cCC----CeeEEEEEEECCEEcCCCC
Q psy15726 62 QARSGIVNSFTLETCG-V-ASSEFDVIVTSPQGAAVPVRCYQQKFANLLAEFT-PTT----TGVYKIDVLQGARPVRGSP 134 (171)
Q Consensus 62 ~~~vg~~~~F~V~~~~-a-g~~~l~v~i~~p~g~~~~~~v~~~~dg~y~v~y~-P~~----~G~~~i~V~~~g~~I~gSP 134 (171)
+++.|++ ++.|-.++ - +++.++|.+...++........-. .-|++++. |.. +|...++|..||..+.-.|
T Consensus 7 rI~cg~~-~vfIiL~~~l~~~~~~eVef~~~n~~~~~~~~~~~--N~yt~~~~aPd~~~~pag~V~v~v~~~g~~~~~~~ 83 (142)
T PF14545_consen 7 RIRCGQP-EVFIILRDPLDEEDTVEVEFESNNKPIRRVPAKWE--NPYTLQFKAPDFCLEPAGSVNVRVYCDGVSLGTRQ 83 (142)
T ss_pred eeecCCC-EEEEEEeCCCCCCCeEEEEEEeCCCeeEeccceEE--CCEEEEEECchhcCCCCceEEEEEEECCEEEEEEe
Confidence 4677888 55555544 3 246899999887764322222221 26999997 443 6999999999998887777
Q ss_pred eEEEe
Q psy15726 135 YLCQV 139 (171)
Q Consensus 135 F~v~V 139 (171)
.++.-
T Consensus 84 ikyys 88 (142)
T PF14545_consen 84 IKYYS 88 (142)
T ss_pred EEEEe
Confidence 66654
No 44
>PF06016 Reovirus_L2: Reovirus core-spike protein lambda-2 (L2); InterPro: IPR010311 This family consists of several Reovirus core-spike protein lambda-2 (L2) sequences. The reovirus L2 genome segment encodes the core spike protein lambda-2, which mediates enzymatic reactions in 5' capping of the viral plus-strand transcripts [].; GO: 0004482 mRNA (guanine-N7-)-methyltransferase activity, 0004484 mRNA guanylyltransferase activity, 0005524 ATP binding, 0006370 mRNA capping, 0019028 viral capsid; PDB: 1EJ6_A 3IYL_W 3K1Q_A.
Probab=74.42 E-value=4 Score=40.24 Aligned_cols=103 Identities=21% Similarity=0.284 Sum_probs=54.6
Q ss_pred eEeCCeEEEEEEeCCCcCCceEEE------EECCCCC----ccceEE-EEcCCCEEEEEEEcCCCeeEEEEEEECCEEcC
Q psy15726 63 ARSGIVNSFTLETCGVASSEFDVI------VTSPQGA----AVPVRC-YQQKFANLLAEFTPTTTGVYKIDVLQGARPVR 131 (171)
Q Consensus 63 ~~vg~~~~F~V~~~~ag~~~l~v~------i~~p~g~----~~~~~v-~~~~dg~y~v~y~P~~~G~~~i~V~~~g~~I~ 131 (171)
.+||++++++|..-+.. ..+.+. ++...|+ .-++.. .+..+.-+.++|+|..+|.|.+.+..++.+|+
T Consensus 1035 ~~VGq~~~i~I~~~~~~-~s~~~~~~gV~v~~~~~~~~~s~~~~v~~q~~~~~~~~~l~~tp~~aGi~~vv~~~~~~~v~ 1113 (1289)
T PF06016_consen 1035 LVVGQQCSITIPGPNPQ-ASVSVTHDGVEVFTGTAGNLVSMVGPVALQYDPANNEWTLTWTPSTAGILDVVVDDGPVPVP 1113 (1289)
T ss_dssp SCTTT-EEEEEESS-TT-EEEEEECTTCEEEECCCTTTCESSSEEEEEEETTTTEEEEEEE-SS-CEEEEEECTTTCCEE
T ss_pred ccccceEEEEEccCCCC-ceEEEEECCEEEEEecCCcccccccceeEEEeccCceEEEEEccCCCceEEEEeccCCcccc
Confidence 56777777777765432 111111 1112222 112332 35666799999999999999997777777775
Q ss_pred -CCCeEEEecCCCceEEecCCcccEEcCCCEEEEEEc
Q psy15726 132 -GSPYLCQVYDASKVKIEHKGLSNIVVNDPISFKCKS 167 (171)
Q Consensus 132 -gSPF~v~V~d~skv~~~G~gl~~~~~g~~~~F~Vd~ 167 (171)
|| |.+...|+.-...++.-++--.+|-.+..++|.
T Consensus 1114 ~GS-~~i~~Pd~~i~v~~Pa~~D~T~aGtd~~I~~d~ 1149 (1289)
T PF06016_consen 1114 LGS-FTIEPPDPTITVTWPANLDYTDAGTDVNITCDP 1149 (1289)
T ss_dssp EEE-EEEB-----EEEE--TT--SSTT-EEEEEEE-T
T ss_pred cce-eEeeCCCCceEEecccceeeeccCceeeEEecc
Confidence 55 777778887777777666666677666666653
No 45
>PF13584 BatD: Oxygen tolerance
Probab=73.53 E-value=63 Score=28.35 Aligned_cols=61 Identities=11% Similarity=0.034 Sum_probs=42.6
Q ss_pred cCCCCceeEEE---EEEECCEEcCCCCeEEEecCCCCC---------CCcEEEcC-CCcceEeCCeEEEEEEeC
Q psy15726 16 PSWISYGLVQA---RTLREETSFRSCPMEVPVVDPAVG---------REPSGSGL-GLYQARSGIVNSFTLETC 76 (171)
Q Consensus 16 ~~P~~~G~~~i---~V~~~g~~I~gSPf~v~V~~~~d~---------~~v~v~G~-Gl~~~~vg~~~~F~V~~~ 76 (171)
+.|.+.|.|.| .|++||+...-.|..+.|.+.... ..+.+.-. .-..+-+|++..+++..-
T Consensus 79 l~p~~~G~~~IP~~~v~v~Gk~~~S~pi~i~V~~~~~~~~~~~~~~~~~~~l~~~v~~~~~Yvge~v~lt~~ly 152 (484)
T PF13584_consen 79 LQPKKTGTFTIPPFTVEVDGKTYKSQPITIEVSKASQSPSQPPSNADDDVFLEAEVSKKSVYVGEPVILTLRLY 152 (484)
T ss_pred EEecccceEEEceEEEEECCEEEeecCEEEEEEecccCCccccccccccEEEEEEeCCCceecCCcEEEEEEEE
Confidence 57999999985 467799998899999999876321 12222211 223478898888777763
No 46
>PF14524 Wzt_C: Wzt C-terminal domain; PDB: 2R5O_B.
Probab=72.47 E-value=31 Score=24.25 Aligned_cols=64 Identities=19% Similarity=0.167 Sum_probs=38.0
Q ss_pred ceEeCCeEEEEEEeCCC---cCCceEEEEECCCCCcc--------ceEEEEcCCCEEEEEEE---cCCCeeEEEEEEE
Q psy15726 62 QARSGIVNSFTLETCGV---ASSEFDVIVTSPQGAAV--------PVRCYQQKFANLLAEFT---PTTTGVYKIDVLQ 125 (171)
Q Consensus 62 ~~~vg~~~~F~V~~~~a---g~~~l~v~i~~p~g~~~--------~~~v~~~~dg~y~v~y~---P~~~G~~~i~V~~ 125 (171)
....|++..|.++.+-. ..-.+.+.|.+.+|..+ ...+.....|+|+++|+ +..+|.|.|++.+
T Consensus 30 ~~~~ge~~~i~i~~~~~~~i~~~~~~~~i~~~~g~~v~~~~t~~~~~~~~~~~~g~~~~~~~i~~~L~~G~Y~i~v~l 107 (142)
T PF14524_consen 30 SFESGEPIRIRIDYEVNEDIDDPVFGFAIRDSDGQRVFGTNTYDSGFPIPLSEGGTYEVTFTIPKPLNPGEYSISVGL 107 (142)
T ss_dssp SEETTSEEEEEEEEEESS-EEEEEEEEEEEETT--EEEEEEHHHHT--EEE-TT-EEEEEEEEE--B-SEEEEEEEEE
T ss_pred EEeCCCEEEEEEEEEECCCCCccEEEEEEEcCCCCEEEEECccccCccccccCCCEEEEEEEEcCccCCCeEEEEEEE
Confidence 36678888777776522 22346677778777432 12333333788888876 3569999999999
No 47
>TIGR00864 PCC polycystin cation channel protein. Note: this model has been restricted to the amino half because for technical reasons.
Probab=72.22 E-value=1.3e+02 Score=32.70 Aligned_cols=139 Identities=14% Similarity=0.094 Sum_probs=71.3
Q ss_pred CCCCceeEEEEEEECCE-EcCCCCeEEEecCCCCCCCcEEEcCCCc-ceEeCCeEEEEEEeCCCcCCceEEEEECCCCCc
Q psy15726 17 SWISYGLVQARTLREET-SFRSCPMEVPVVDPAVGREPSGSGLGLY-QARSGIVNSFTLETCGVASSEFDVIVTSPQGAA 94 (171)
Q Consensus 17 ~P~~~G~~~i~V~~~g~-~I~gSPf~v~V~~~~d~~~v~v~G~Gl~-~~~vg~~~~F~V~~~~ag~~~l~v~i~~p~g~~ 94 (171)
+-..+|.|.|.|+..++ --......+.|..+...-.++...++-. -..+|....|+=... .+..+.-...=.+|..
T Consensus 1749 tf~~~G~y~VtVta~N~vss~~~s~~V~VqepV~GL~i~~~~~~~~~~~~ag~~v~F~a~vs--tGsnVsw~W~f~~g~s 1826 (2740)
T TIGR00864 1749 SFPSAGLHLVTMKAFNELGSANASEEVDVQEPISGLKIRAADAGEQNFFAADSSVCFQGELA--TGTNVSWCWAIDGGSS 1826 (2740)
T ss_pred EccCCceEEEEEEEEccccccceeEEEEEEeccccceEecCCCCcccceecCcEEEEEEEcc--CCCeeEEEEEeCCCCc
Confidence 45679999999985443 2223446666766542222222222111 244565555544322 1233333332222210
Q ss_pred cceEEEEcCCCEEEEEEEcCCCeeEEEEEEEC-CEEcCCCCeEEEecCCC-ceEEecCCcccEEcCCCEEEEEEc
Q psy15726 95 VPVRCYQQKFANLLAEFTPTTTGVYKIDVLQG-ARPVRGSPYLCQVYDAS-KVKIEHKGLSNIVVNDPISFKCKS 167 (171)
Q Consensus 95 ~~~~v~~~~dg~y~v~y~P~~~G~~~i~V~~~-g~~I~gSPF~v~V~d~s-kv~~~G~gl~~~~~g~~~~F~Vd~ 167 (171)
. . ...++|++.++|.|.|++..- +..=..-...+.|-|+= ...+. .+...+.+|+...|+++.
T Consensus 1827 ~--------~-gk~v~~Tf~~aG~ytV~L~AsN~vs~~~~s~~~~VQe~I~~L~L~-as~~~~~~n~~v~fsa~l 1891 (2740)
T TIGR00864 1827 K--------M-GKHACMTFPDAGTFAIRLNASNAVSGKSASREFFAEEPIFGLELK-ASKKIAAIGEKVEFQILL 1891 (2740)
T ss_pred c--------c-cceeEEecCCCeEEEEEEEEEcccCcceeeeeEEEEEecceEEEe-cccccccCCCEEEEEEEe
Confidence 0 1 367889999999999999843 22211123445554432 22222 223346778888887754
No 48
>PRK12634 flgD flagellar basal body rod modification protein; Reviewed
Probab=72.12 E-value=37 Score=27.11 Aligned_cols=55 Identities=18% Similarity=0.205 Sum_probs=37.6
Q ss_pred CceEEEEECCCCCccce-EEEEcCCCEEEEEEEcC-------CCeeEEEEEEE---CCEEcCCCCe
Q psy15726 81 SEFDVIVTSPQGAAVPV-RCYQQKFANLLAEFTPT-------TTGVYKIDVLQ---GARPVRGSPY 135 (171)
Q Consensus 81 ~~l~v~i~~p~g~~~~~-~v~~~~dg~y~v~y~P~-------~~G~~~i~V~~---~g~~I~gSPF 135 (171)
..+.+.|.+.+|+.+.. .+.....|.+.++|-.. ..|.|++.|.. +|+.+.-.+.
T Consensus 122 ~~v~i~I~d~~G~~V~t~~lg~~~aG~~~f~WDG~d~~G~~~~~G~Yt~~v~a~~~~G~~~~~~~~ 187 (221)
T PRK12634 122 GFVNFEITDANGAFVKQISVPASAAGEVSFAWDGTDANGNRMAAGKYGVTATQTDTAGSKSKLSTY 187 (221)
T ss_pred CeEEEEEEcCCCCEEEEEecCCcCCCceeEEECCCCCCCCcCCCeeeEEEEEEEeCCCcEEeeeee
Confidence 57889999999986643 23345568888888663 35899999864 4555554443
No 49
>smart00252 SH2 Src homology 2 domains. Src homology 2 domains bind phosphotyrosine-containing polypeptides via 2 surface pockets. Specificity is provided via interaction with residues that are distinct from the phosphotyrosine. Only a single occurrence of a SH2 domain has been found in S. cerevisiae.
Probab=71.89 E-value=6.1 Score=25.78 Aligned_cols=28 Identities=14% Similarity=0.170 Sum_probs=24.1
Q ss_pred EEcCCCEEEEEEEcCCCeeEEEEEEECC
Q psy15726 100 YQQKFANLLAEFTPTTTGVYKIDVLQGA 127 (171)
Q Consensus 100 ~~~~dg~y~v~y~P~~~G~~~i~V~~~g 127 (171)
.+..+|+|.+......+|.|.|.|++++
T Consensus 18 ~~~~~G~FLvR~s~~~~~~~~Lsv~~~~ 45 (84)
T smart00252 18 KNEGDGDFLVRDSESEPGDYVLSVRVKG 45 (84)
T ss_pred hcCCCcEEEEEcCCCCCCCEEEEEEECC
Confidence 3445899999999998999999999974
No 50
>PF10670 DUF4198: Domain of unknown function (DUF4198)
Probab=71.45 E-value=31 Score=26.30 Aligned_cols=62 Identities=19% Similarity=0.130 Sum_probs=42.4
Q ss_pred EeCCeEEEEEEeCCCcCCceEEEEECCCCCcc----ceEEEEcCCCEEEEEEEcCCCeeEEEEEEECC
Q psy15726 64 RSGIVNSFTLETCGVASSEFDVIVTSPQGAAV----PVRCYQQKFANLLAEFTPTTTGVYKIDVLQGA 127 (171)
Q Consensus 64 ~vg~~~~F~V~~~~ag~~~l~v~i~~p~g~~~----~~~v~~~~dg~y~v~y~P~~~G~~~i~V~~~g 127 (171)
.+|++..|.|.-.+..-....|.+..+++... ...++-+.+| .++|++..+|.|.|.+.+.+
T Consensus 148 ~~g~~~~~~vl~~GkPl~~a~V~~~~~~~~~~~~~~~~~~~TD~~G--~~~~~~~~~G~wli~a~~~~ 213 (215)
T PF10670_consen 148 KAGDPLPFQVLFDGKPLAGAEVEAFSPGGWYDVEHEAKTLKTDANG--RATFTLPRPGLWLIRASHKD 213 (215)
T ss_pred cCCCEEEEEEEECCeEcccEEEEEEECCCccccccceEEEEECCCC--EEEEecCCCEEEEEEEEEec
Confidence 47888888888777655567777776664321 2333333445 57788999999999988754
No 51
>PF11589 DUF3244: Domain of unknown function (DUF3244); InterPro: IPR021638 This family of proteins with unknown function appear to be restricted to Bacteroidetes. The protein may have an immunoglobulin-like beta-sandwich fold however this cannot be confirmed. ; PDB: 3D33_B 3SD2_A.
Probab=71.33 E-value=10 Score=26.36 Aligned_cols=53 Identities=11% Similarity=0.303 Sum_probs=32.4
Q ss_pred CceEEEEECCCCCccceEEEEcCC-CEEEEEEEcCCCeeEEEEEEECCE-EcCCC
Q psy15726 81 SEFDVIVTSPQGAAVPVRCYQQKF-ANLLAEFTPTTTGVYKIDVLQGAR-PVRGS 133 (171)
Q Consensus 81 ~~l~v~i~~p~g~~~~~~v~~~~d-g~y~v~y~P~~~G~~~i~V~~~g~-~I~gS 133 (171)
+++.++|.+..|..+--+...+.. ..+.+.-.=...|.|.|.+..+.- .+-|.
T Consensus 48 ~~vtI~I~d~~G~vVy~~~~~~~~~~~~~I~L~~~~~G~Y~l~i~~~~g~~l~G~ 102 (106)
T PF11589_consen 48 GDVTITIKDSTGNVVYSETVSNSAGQSITIDLNGLPSGEYTLEITNGNGTYLYGE 102 (106)
T ss_dssp SEEEEEEEETT--EEEEEEESCGGTTEEEEE-TTS-SEEEEEEEEECTC-EEEEE
T ss_pred CCEEEEEEeCCCCEEEEEEccCCCCcEEEEEeCCCCCccEEEEEEeCCCCEEEEE
Confidence 567888888777665444433322 356666655678999999988765 65554
No 52
>smart00634 BID_1 Bacterial Ig-like domain (group 1).
Probab=70.09 E-value=30 Score=23.16 Aligned_cols=28 Identities=21% Similarity=0.210 Sum_probs=24.0
Q ss_pred EEcCCCEEEEEEEcCCCeeEEEEEEECC
Q psy15726 100 YQQKFANLLAEFTPTTTGVYKIDVLQGA 127 (171)
Q Consensus 100 ~~~~dg~y~v~y~P~~~G~~~i~V~~~g 127 (171)
..+.+|...+.++...+|.+.|.+..++
T Consensus 57 ~Td~~G~a~~~l~~~~~G~~~vta~~~~ 84 (92)
T smart00634 57 TTDANGIATVTLTSTTAGVYTVTASLEN 84 (92)
T ss_pred eeCCCCEEEEEEECCCCcEEEEEEEECC
Confidence 4455899999999999999999998876
No 53
>KOG4088|consensus
Probab=69.41 E-value=22 Score=26.62 Aligned_cols=53 Identities=19% Similarity=0.398 Sum_probs=33.7
Q ss_pred eEEEEEEeCC-CcCCceEEEEECCCCCccceEEEEcCCCEEEEEEEcC--C--CeeEEEEEE
Q psy15726 68 VNSFTLETCG-VASSEFDVIVTSPQGAAVPVRCYQQKFANLLAEFTPT--T--TGVYKIDVL 124 (171)
Q Consensus 68 ~~~F~V~~~~-ag~~~l~v~i~~p~g~~~~~~v~~~~dg~y~v~y~P~--~--~G~~~i~V~ 124 (171)
-.+|++++++ .....+.+.+. |+.+|+.+.+. -+.|.|+|+-. . +|.|.+.+.
T Consensus 44 i~EftLqCsn~~~n~~l~Aev~---Gk~~PVs~~~d-~~kyQVSW~ldhK~a~agt~~vr~F 101 (167)
T KOG4088|consen 44 ITEFTLQCSNNPKNIQLTAEVN---GKLIPVSISDD-TAKYQVSWTLDHKDAGAGTFNVRIF 101 (167)
T ss_pred EEEEEEEeCCCCcceEEEEecC---CEEEeEEecCc-cceEEEEEEEeecccCCceEEEEEe
Confidence 3478888876 33345555543 78888776543 27999999854 2 455555543
No 54
>PF09134 Invasin_D3: Invasin, domain 3; InterPro: IPR015217 This domain adopts a structure consisting of an immunoglobulin-like beta-sandwich, with seven strands in two beta-sheets, arranged in a Greek-key topology. It forms part of the extracellular region of the protein, which can be expressed as a soluble protein (Inv497) that binds integrins and promotes subsequent uptake by cells when attached to bacteria []. ; PDB: 1CWV_A.
Probab=68.18 E-value=38 Score=23.67 Aligned_cols=70 Identities=11% Similarity=0.115 Sum_probs=36.0
Q ss_pred CeEEEEEEeCCCcCCc---eE-EEEECCCCCccce-EEEEcCCCEEEEEEEcCCCeeEEEEEEECCEEcCCCCeEEE
Q psy15726 67 IVNSFTLETCGVASSE---FD-VIVTSPQGAAVPV-RCYQQKFANLLAEFTPTTTGVYKIDVLQGARPVRGSPYLCQ 138 (171)
Q Consensus 67 ~~~~F~V~~~~ag~~~---l~-v~i~~p~g~~~~~-~v~~~~dg~y~v~y~P~~~G~~~i~V~~~g~~I~gSPF~v~ 138 (171)
..+.+++.++|+.+.. +. +... .+|..+.. .+.+ ..|.|+++.+=..+|.+.|.=.++|+++..-|-++.
T Consensus 21 dta~LTfta~D~~g~pvsGl~~l~f~-~~g~~vtls~itE-~~GvYtATl~Gt~~G~vtI~P~v~G~~l~~~~~k~t 95 (99)
T PF09134_consen 21 DTATLTFTARDANGNPVSGLSVLAFV-QDGVPVTLSPITE-NPGVYTATLKGTSAGEVTITPQVNGQPLATLQKKLT 95 (99)
T ss_dssp --EEEEEE-B-TTS-B--S--SEEEE-EEES--EE---EE--SS-EEEEEE-SS-EEEEEEEEETTEE-GGG-EEEE
T ss_pred ceEEEEEEEEcCCCCcccCceeEEEE-ccCCCcEEccccc-cCCEEEEEEEcccCceEEEEEeeCCEEccccceeEE
Confidence 3455677777764421 11 2221 23332221 2334 458999999999999999999999999987665543
No 55
>PRK13211 N-acetylglucosamine-binding protein A; Reviewed
Probab=67.89 E-value=92 Score=27.92 Aligned_cols=86 Identities=13% Similarity=0.101 Sum_probs=52.4
Q ss_pred CcEEEcCCCcceEeCCeE--EEEEEeCCCcCCceEEEEECCCCCccceEEEEcCCC--EEEEEEEcCCCeeEEEEEEE--
Q psy15726 52 EPSGSGLGLYQARSGIVN--SFTLETCGVASSEFDVIVTSPQGAAVPVRCYQQKFA--NLLAEFTPTTTGVYKIDVLQ-- 125 (171)
Q Consensus 52 ~v~v~G~Gl~~~~vg~~~--~F~V~~~~ag~~~l~v~i~~p~g~~~~~~v~~~~dg--~y~v~y~P~~~G~~~i~V~~-- 125 (171)
...++|..-+-...+..+ .|+|.+. +...+++.|.+.+|+.+--.-...+|+ .+++...-.++|.|.|.|+.
T Consensus 312 ~lsVsGL~~eY~I~dG~~~i~ftv~a~--g~~~vta~V~d~~g~~~~~~~~~v~d~s~~vtL~Ls~~~AG~y~Lvv~~t~ 389 (478)
T PRK13211 312 ELEVSGLAKEYKIGDGAATLDFTVTAT--GDMNVEATVYNHDGEALGSKSQTVNDGSQSVSLDLSKLKAGHHMLVVKAKP 389 (478)
T ss_pred ceEEecCcceeEEcCCcEEEEEEEEec--cceEEEEEEEcCCCCeeeeeeEEecCCceeEEEecccCCCceEEEEEEEEe
Confidence 345555433223333333 4555554 456888999999987654332333333 56666777789999999885
Q ss_pred -CCEEcCCCCeEEEe
Q psy15726 126 -GARPVRGSPYLCQV 139 (171)
Q Consensus 126 -~g~~I~gSPF~v~V 139 (171)
+|+.+.-.-+.+.|
T Consensus 390 ~dG~~~~q~~~~~~v 404 (478)
T PRK13211 390 KDGELIKQQTLDFML 404 (478)
T ss_pred CCCceeeeeeEEEEE
Confidence 45666445566666
No 56
>PF01105 EMP24_GP25L: emp24/gp25L/p24 family/GOLD; InterPro: IPR009038 The GOLD (for Golgi dynamics) domain is a protein module found in several eukaryotic Golgi and lipid-traffic proteins. It is typically between 90 and 150 amino acids long. Most of the size difference observed in the GOLD-domain superfamily is traceable to a single large low-complexity insert that is seen in some versions of the domain. With the exception of the p24 proteins, which have a simple architecture with the GOLD domain as their only globular domain, all other GOLD-domain proteins contain additional conserved globular domains. In these proteins, the GOLD domain co-occurs with lipid-, sterol- or fatty acid-binding domains such as PH, CRAL-TRIO, FYVE oxysterol binding- and acyl CoA-binding domains, suggesting that these proteins may interact with membranes. The GOLD domain can also be found associated with a RUN domain, which may have a role in the interaction of various proteins with cytoskeletal filaments. The GOLD domain is predicted to mediate diverse protein-protein interactions []. A secondary structure prediction for the GOLD domain reveals that it is likely to adopt a compact all-beta-fold structure with six to seven strands. Most of the sequence conservation is centred on the hydrophobic cores that support these predicted strands. The predicted secondary-structure elements and the size of the conserved core of the domain suggests that it may form a beta- sandwich fold with the strands arranged in two beta sheets stacked on each other []. Some proteins known to contain a GOLD domain are listed below: Eukaryotic proteins of the p24 family. Animal Sec14-like proteins. They are involved in secretion. Human Golgi resident protein GCP60. It interacts with the Golgi integral membrane protein Giantin. Yeast oxysterol-binding protein homologue 3 (OSH3). ; GO: 0006810 transport, 0016021 integral to membrane; PDB: 1P23_A 1M23_A.
Probab=67.50 E-value=1.8 Score=32.19 Aligned_cols=61 Identities=15% Similarity=0.146 Sum_probs=0.0
Q ss_pred EeCCeEEEEEEeCCCc-CCceEEEEECCCC--CccceEEEEcCCCEEEEEEEcCCCeeEEEEEEEC
Q psy15726 64 RSGIVNSFTLETCGVA-SSEFDVIVTSPQG--AAVPVRCYQQKFANLLAEFTPTTTGVYKIDVLQG 126 (171)
Q Consensus 64 ~vg~~~~F~V~~~~ag-~~~l~v~i~~p~g--~~~~~~v~~~~dg~y~v~y~P~~~G~~~i~V~~~ 126 (171)
..|....+.+...+.+ ...+.+.|.+|++ ..+-- .......-..+|++.+.|.|++.+.-.
T Consensus 19 ~~~~~i~~~y~v~~~~~~~~v~~~i~~~~~~~~~i~~--~~~~~~~~~f~f~~~~~G~y~iCf~n~ 82 (183)
T PF01105_consen 19 PKGTTIRGSYRVTDGGGAYDVDFTIRDPDPNGEVIYS--KSDKESEGSFSFTAKESGEYQICFDNS 82 (183)
T ss_dssp ------------------------------------------------------------------
T ss_pred CCCcEEEEEEEEeeccccceEEEEEEecccCCceeee--ecccccCCcEEEEeccCCCEEEEEEcC
Confidence 3366666666666555 5788899995554 42221 111111235667778899988877643
No 57
>PF09315 DUF1973: Domain of unknown function (DUF1973); InterPro: IPR015394 These functionally uncharacterised domains are found in various eukaryotic calcium-dependent chloride channels.
Probab=67.42 E-value=37 Score=26.18 Aligned_cols=59 Identities=14% Similarity=0.169 Sum_probs=35.7
Q ss_pred EeCCeEEEEEEeCCCcCCceEEEEECCCCCccceEEEEcCCCEEEEEEE-cCCCeeEEEEEE
Q psy15726 64 RSGIVNSFTLETCGVASSEFDVIVTSPQGAAVPVRCYQQKFANLLAEFT-PTTTGVYKIDVL 124 (171)
Q Consensus 64 ~vg~~~~F~V~~~~ag~~~l~v~i~~p~g~~~~~~v~~~~dg~y~v~y~-P~~~G~~~i~V~ 124 (171)
.+|....|.|.-... ..+.+.+.+|+|+....-..|.......+.-- ..++|.+++.|.
T Consensus 26 tvG~~T~f~v~w~~~--~~p~i~L~~P~G~~~~~~~~d~~~~~~~i~ipg~ae~G~W~y~i~ 85 (179)
T PF09315_consen 26 TVGNNTVFTVTWQNS--SPPSITLTDPSGTVYTTFTTDSNSKTARIQIPGTAEVGTWTYSIT 85 (179)
T ss_pred CCCCCeEEEEEECCC--CCceEEEECCCCCEEeeeEEcccccEEEEECCCCcccccEEEEEe
Confidence 478888888876654 46788899999976544233433233333321 145666666663
No 58
>KOG2996|consensus
Probab=67.19 E-value=5.1 Score=36.64 Aligned_cols=30 Identities=7% Similarity=0.135 Sum_probs=26.6
Q ss_pred EEEcCCCEEEEEEEcCCCeeEEEEEEECCE
Q psy15726 99 CYQQKFANLLAEFTPTTTGVYKIDVLQGAR 128 (171)
Q Consensus 99 v~~~~dg~y~v~y~P~~~G~~~i~V~~~g~ 128 (171)
+....||||.|.|.+.+++++-|+|+|+.+
T Consensus 701 Lk~~~ngT~LVR~r~kea~e~AISikynne 730 (865)
T KOG2996|consen 701 LKNRPNGTYLVRYRTKEAKEFAISIKYNNE 730 (865)
T ss_pred hhcCCCceEEEEecccchhheeEEEEeccc
Confidence 445678999999999999999999999973
No 59
>PRK15095 FKBP-type peptidyl-prolyl cis-trans isomerase; Provisional
Probab=67.10 E-value=37 Score=25.42 Aligned_cols=79 Identities=18% Similarity=0.183 Sum_probs=45.6
Q ss_pred CCCeEEEecCCCCCCCcEEEcCCCcceEeCCeEEEEEEeCCCcC------------------Cce----EEEEECCCCCc
Q psy15726 37 SCPMEVPVVDPAVGREPSGSGLGLYQARSGIVNSFTLETCGVAS------------------SEF----DVIVTSPQGAA 94 (171)
Q Consensus 37 gSPf~v~V~~~~d~~~v~v~G~Gl~~~~vg~~~~F~V~~~~ag~------------------~~l----~v~i~~p~g~~ 94 (171)
+.|+.+.+... .-+.-.-.+|....+|....|+|....+-+ +.+ .+.+.+++|+.
T Consensus 32 ~~P~~f~~G~g---~vi~gle~aL~gm~~Ge~~~v~ipp~~ayG~~d~~~v~~vp~~~f~~~~~~~~G~~~~~~~~~G~~ 108 (156)
T PRK15095 32 GKPALFRLGDG---SLSEGLEQQLLGLKVGDKKTFSLEPEAAFGVPSPDLIQYFSRRDFMDAGEPEIGAIMLFTAMDGSE 108 (156)
T ss_pred CCCEEEEeCCC---CccHHHHHHHcCCCCCCEEEEEEChHHhcCCCChHHEEEecHHHCCcccCCCCCCEEEEECCCCCE
Confidence 46777666532 111112234555788999999998765411 011 13345666666
Q ss_pred cceEEEEcCCCEEEEEEEcCCCee
Q psy15726 95 VPVRCYQQKFANLLAEFTPTTTGV 118 (171)
Q Consensus 95 ~~~~v~~~~dg~y~v~y~P~~~G~ 118 (171)
.++.|.+..+.+.++.|.-.-+|.
T Consensus 109 ~~~~V~~i~~~~v~vD~NHPLAGk 132 (156)
T PRK15095 109 MPGVIREINGDSITVDFNHPLAGQ 132 (156)
T ss_pred EEEEEEEEcCCEEEEECCCcCCCC
Confidence 666777777777777776555554
No 60
>PRK12813 flgD flagellar basal body rod modification protein; Reviewed
Probab=66.71 E-value=65 Score=25.78 Aligned_cols=57 Identities=16% Similarity=0.215 Sum_probs=39.2
Q ss_pred CCeEEEEEEeCCCcCCceEEEEECCCCCccceEEEEcCCCEEEEEEEc-------CCCeeEEEEEEE
Q psy15726 66 GIVNSFTLETCGVASSEFDVIVTSPQGAAVPVRCYQQKFANLLAEFTP-------TTTGVYKIDVLQ 125 (171)
Q Consensus 66 g~~~~F~V~~~~ag~~~l~v~i~~p~g~~~~~~v~~~~dg~y~v~y~P-------~~~G~~~i~V~~ 125 (171)
|....|.+....+ ...+.+.|.+.+|+.++.. +...|.+.++|-. ..+|.|++.|..
T Consensus 110 g~~~~~~~~l~~~-a~~v~v~I~D~~G~vV~t~--~~~~G~~~f~WDG~d~~G~~l~~G~Yt~~V~A 173 (223)
T PRK12813 110 GTPVTISPNPAAD-ADKAELVVRDAAGAEVARE--TVPVGAGPVEWAGEDADGNPLPNGAYSFVVES 173 (223)
T ss_pred CceeEEEEeccCC-CceEEEEEEcCCCCEEEEE--eeCCCceeEEeCCcCCCCCcCCCccEEEEEEE
Confidence 5566777765443 3578899999999876543 2345677777753 446899999875
No 61
>PF05751 FixH: FixH; InterPro: IPR008620 This family consists of several Rhizobium FixH like proteins. It has been suggested that the four proteins FixG, FixH, FixI, and FixS may participate in a membrane-bound complex coupling the FixI cation pump with a redox process catalysed by FixG [].
Probab=65.92 E-value=48 Score=23.93 Aligned_cols=61 Identities=10% Similarity=0.072 Sum_probs=42.6
Q ss_pred eCCeEEEEEEeCC---CcCCceEEEEECCCCCcc--ceEEEEcCCCEEEEEEEcCCCeeEEEEEEE
Q psy15726 65 SGIVNSFTLETCG---VASSEFDVIVTSPQGAAV--PVRCYQQKFANLLAEFTPTTTGVYKIDVLQ 125 (171)
Q Consensus 65 vg~~~~F~V~~~~---ag~~~l~v~i~~p~g~~~--~~~v~~~~dg~y~v~y~P~~~G~~~i~V~~ 125 (171)
.+....+.+.-.+ .....+++.+..|+.... ...+...++|.|.........|.+.|.|.+
T Consensus 67 ~~~~~~i~~~d~~g~~~~~~~~~l~l~rp~~~~~D~~~~l~~~~~g~y~~~~~~~~~G~W~l~l~~ 132 (146)
T PF05751_consen 67 DDNSLTIRLTDPNGAPVSGAKLTLSLYRPTDAKLDFTLTLTESAPGVYRAPVPLLKKGRWYLRLDW 132 (146)
T ss_pred cCCeEEEEEEcCCCCcCcCceEEEEEECCCCccCCeeEEeeECCCceEEEEcCCCCCccEEEEEEE
Confidence 3455555553222 123678888888886543 445667788999999988899999999955
No 62
>TIGR03769 P_ac_wall_RPT actinobacterial surface-anchored protein domain. This model describes a repeat domain that one to three times in Actinobacterial proteins, some of which have LPXTG-type sortase recognition motifs for covalent attachment to the Gram-positive cell wall. Where it occurs with duplication in an LPXTG-anchored protein, it tends to be adjacent to the substrate-binding protein of the gene trio of an ABC transporter system, where that substrate-binding protein has a single copy of this same domain. This arrangement suggests a substrate-binding relay system, with the LPXTG protein acting as a substrate receptor.
Probab=64.78 E-value=7.5 Score=22.60 Aligned_cols=24 Identities=8% Similarity=-0.140 Sum_probs=18.1
Q ss_pred ccCCCCceeEEEEEEECCEEcCCC
Q psy15726 15 APSWISYGLVQARTLREETSFRSC 38 (171)
Q Consensus 15 ~~~P~~~G~~~i~V~~~g~~I~gS 38 (171)
+|...+||.|.+.++..+....|.
T Consensus 6 nW~FT~PG~Y~l~~~a~~~~~~G~ 29 (41)
T TIGR03769 6 NWVFTKPGTYTLTVQATATLTDGK 29 (41)
T ss_pred ceeeCCCeEEEEEEEEEEEeCCCc
Confidence 577889999999998865544443
No 63
>COG4856 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=63.74 E-value=71 Score=27.77 Aligned_cols=74 Identities=12% Similarity=0.070 Sum_probs=43.5
Q ss_pred eeeeEEEEeecccc----------CCCCceeE-----EEEEEECCE--EcCCCCeEEEecCCCCCCCcEEEcC-CCc-ce
Q psy15726 3 FFTKSCILLSEKAP----------SWISYGLV-----QARTLREET--SFRSCPMEVPVVDPAVGREPSGSGL-GLY-QA 63 (171)
Q Consensus 3 ~~~~~~~~~~~~~~----------~P~~~G~~-----~i~V~~~g~--~I~gSPf~v~V~~~~d~~~v~v~G~-Gl~-~~ 63 (171)
||.-+||+|+=++. .+.+.+.+ .++|+||.+ +|.|-|+++.|. ++|+ ++. ..
T Consensus 16 ff~A~~Lfl~vn~~n~~N~~~t~~~~~~t~~~tltdvpvdvlYD~~~y~isg~~etV~Vt---------l~G~ns~~~~~ 86 (403)
T COG4856 16 FFFAILLFLYVNNNNFNNPISTIKPTSETYTLTLTDVPVDVLYDSDKYFISGQPETVTVT---------LKGPNSIVLKS 86 (403)
T ss_pred HHHHHHhheeecccccCCcccccccccccccceeeeceeEEEEccccccccCCceEEEEE---------EeCCcceeeee
Confidence 55667887775553 34455444 467889874 899999999874 4443 111 13
Q ss_pred EeCCeEEEEEEeCCCcCCceEE
Q psy15726 64 RSGIVNSFTLETCGVASSEFDV 85 (171)
Q Consensus 64 ~vg~~~~F~V~~~~ag~~~l~v 85 (171)
+..+......|...++.|..++
T Consensus 87 ~~~~dFkV~ADLt~a~~Gt~ev 108 (403)
T COG4856 87 EKPEDFKVVADLTHAGVGTHEV 108 (403)
T ss_pred ecCcCeEEEEEhhhcCCCceEe
Confidence 3444444455556665554443
No 64
>KOG0544|consensus
Probab=63.66 E-value=40 Score=23.58 Aligned_cols=57 Identities=16% Similarity=0.268 Sum_probs=37.6
Q ss_pred CEEEEEEEcC-CCeeEEEEEEECCEEcCCCCeEEEecCCCceEEecCCcccEEcCCCEEEEEE
Q psy15726 105 ANLLAEFTPT-TTGVYKIDVLQGARPVRGSPYLCQVYDASKVKIEHKGLSNIVVNDPISFKCK 166 (171)
Q Consensus 105 g~y~v~y~P~-~~G~~~i~V~~~g~~I~gSPF~v~V~d~skv~~~G~gl~~~~~g~~~~F~Vd 166 (171)
.+.++.||=. ..| =+++..-=+++||++++---.-++-+-.|+..-.+|+.+.++|.
T Consensus 21 qtvt~hYtg~L~dG-----~kfDSs~dr~kPfkf~IGkgeVIkGwdegv~qmsvGekakLti~ 78 (108)
T KOG0544|consen 21 QTVTVHYTGTLQDG-----KKFDSSRDRGKPFKFKIGKGEVIKGWDEGVAQMSVGEKAKLTIS 78 (108)
T ss_pred CEEEEEEEeEecCC-----cEeecccccCCCeeEEecCcceeechhhcchhccccccceeeec
Confidence 3667777632 333 24455555789999999655556666667666677777777764
No 65
>cd00173 SH2 Src homology 2 domains; Signal transduction, involved in recognition of phosphorylated tyrosine (pTyr). SH2 domains typically bind pTyr-containing ligands via two surface pockets, a pTyr and hydrophobic binding pocket, allowing proteins with SH2 domains to localize to tyrosine phosphorylated sites.
Probab=62.49 E-value=17 Score=23.93 Aligned_cols=28 Identities=14% Similarity=0.182 Sum_probs=24.8
Q ss_pred EEcCCCEEEEEEEcCCCeeEEEEEEECC
Q psy15726 100 YQQKFANLLAEFTPTTTGVYKIDVLQGA 127 (171)
Q Consensus 100 ~~~~dg~y~v~y~P~~~G~~~i~V~~~g 127 (171)
.+..+|+|.+......+|.|.|.|++++
T Consensus 17 ~~~~~G~FLiR~s~~~~~~~~Lsv~~~~ 44 (94)
T cd00173 17 KKKPDGTFLVRDSESSPGDYVLSVRVKG 44 (94)
T ss_pred hcCCCceEEEEecCCCCCCEEEEEEECC
Confidence 3457899999999999999999999988
No 66
>cd05771 IgC_Tapasin_R Tapasin-R immunoglobulin-like domain. IgC_Tapasin_R: Immunoglobulin-like domain on Tapasin-R. Tapasin is a V-C1 (variable-constant) immunoglobulin superfamily molecule present in the endoplasmic reticulum (ER), where it links MHC class I molecules to the transporter associated with antigen processing (TAP). Tapasin-R is a tapasin-related protein that contains similar structural motifs to Tapasin, with some marked differences, especially in the V domain, transmembrane and cytoplasmic regions. The majority of Tapasin-R is located within the ER; however, there may be some expression of Tapasin-R at the cell surface. Tapasin-R lacks an obvious ER retention signal.
Probab=61.86 E-value=56 Score=23.31 Aligned_cols=74 Identities=11% Similarity=-0.036 Sum_probs=42.1
Q ss_pred eccccCCCCceeEEEEEEECCEEcCCCCeEEEecCCCCCCCcEEEcCCCcceEeCCeEEEEEEeCCCcCCceEEEEECC
Q psy15726 12 SEKAPSWISYGLVQARTLREETSFRSCPMEVPVVDPAVGREPSGSGLGLYQARSGIVNSFTLETCGVASSEFDVIVTSP 90 (171)
Q Consensus 12 ~~~~~~P~~~G~~~i~V~~~g~~I~gSPf~v~V~~~~d~~~v~v~G~Gl~~~~vg~~~~F~V~~~~ag~~~l~v~i~~p 90 (171)
+=.+.++...|.|...+.. +..-...-.++.|..+ .++.+.-.... ...|....++-.+.+.-..++.+.....
T Consensus 4 ~i~~v~~~D~G~Y~C~~~~-~~~~~~~~~~l~v~~~---P~v~~~~~~~~-~~~~~~~~L~C~a~g~~P~~~~i~W~~~ 77 (139)
T cd05771 4 TLPGLTVHDEGTYICSVST-PPHQAQQIIQLSVSEP---PRVRLSLEKLV-SMIEEPQTLICHIAGYYPLDVQVEWTRE 77 (139)
T ss_pred EECCCCHHHCEEEEEEEEc-cCcceeeEEEEEEecC---CceEEeCCCce-eecCCcEEEEEEECCCCCCCeEEEEEEC
Confidence 3356788889999999863 3222334445666543 35555544332 2246666666666554344567776553
No 67
>PF14874 PapD-like: Flagellar-associated PapD-like
Probab=59.90 E-value=50 Score=22.15 Aligned_cols=74 Identities=16% Similarity=0.142 Sum_probs=39.8
Q ss_pred eEeCCeEEEEEEeCCCcCCceEEEEECCCCCccceE-------EEEcCCCEEEEEEE-cCCCeeEEEEEEECCEEcCCCC
Q psy15726 63 ARSGIVNSFTLETCGVASSEFDVIVTSPQGAAVPVR-------CYQQKFANLLAEFT-PTTTGVYKIDVLQGARPVRGSP 134 (171)
Q Consensus 63 ~~vg~~~~F~V~~~~ag~~~l~v~i~~p~g~~~~~~-------v~~~~dg~y~v~y~-P~~~G~~~i~V~~~g~~I~gSP 134 (171)
+.+|+....+|..++.+.-..+.++..|+....... +.....-...|+|. +...|.|.-.+.+ ...++.
T Consensus 16 v~~g~~~~~~v~l~N~s~~p~~f~v~~~~~~~~~~~v~~~~g~l~PG~~~~~~V~~~~~~~~g~~~~~l~i---~~e~~~ 92 (102)
T PF14874_consen 16 VFVGQTYSRTVTLTNTSSIPARFRVRQPESLSSFFSVEPPSGFLAPGESVELEVTFSPTKPLGDYEGSLVI---TTEGGS 92 (102)
T ss_pred EccCCEEEEEEEEEECCCCCEEEEEEeCCcCCCCEEEECCCCEECCCCEEEEEEEEEeCCCCceEEEEEEE---EECCeE
Confidence 556666666666665554444444444431111111 11122246788999 6677988766654 344566
Q ss_pred eEEEe
Q psy15726 135 YLCQV 139 (171)
Q Consensus 135 F~v~V 139 (171)
|.+.+
T Consensus 93 ~~i~v 97 (102)
T PF14874_consen 93 FEIPV 97 (102)
T ss_pred EEEEE
Confidence 66655
No 68
>PF12988 DUF3872: Domain of unknown function, B. Theta Gene description (DUF3872); InterPro: IPR024355 This entry represents proteins of unknown function found primarily in Bacteroides species. The Bacteroides thetaiotaomicron gene coding for this protein is located in a conjugate transposon and appears to be upregulated in the presence of host or other bacterial species compared to growth in pure culture [, ].; PDB: 2L3B_A 2L7Q_A.
Probab=58.39 E-value=64 Score=23.90 Aligned_cols=82 Identities=15% Similarity=0.216 Sum_probs=40.1
Q ss_pred EEcC-CCCeEEEecCCCCCCCcEEEcCCCcceEeCCeEEEEEEeCCCc---CCceEEEEECCCC---------Cc-cceE
Q psy15726 33 TSFR-SCPMEVPVVDPAVGREPSGSGLGLYQARSGIVNSFTLETCGVA---SSEFDVIVTSPQG---------AA-VPVR 98 (171)
Q Consensus 33 ~~I~-gSPf~v~V~~~~d~~~v~v~G~Gl~~~~vg~~~~F~V~~~~ag---~~~l~v~i~~p~g---------~~-~~~~ 98 (171)
..|. .=||++.+-|-. ..+..|+.+++.......| +....++-.-|+| .. +|=+
T Consensus 25 LDIQQ~YpF~v~tmPVp------------k~I~~GeTvEIR~~l~reG~y~~t~Y~iRYFQ~dGkG~L~~~~g~~~~pND 92 (137)
T PF12988_consen 25 LDIQQAYPFTVETMPVP------------KKIKKGETVEIRCELKREGNYADTRYTIRYFQPDGKGTLRMDDGTVLLPND 92 (137)
T ss_dssp S------SEEEEE----------------SS--TTEEEEEEEEEEESS--SS---EEEEE-SSS-EEEEETTS-EE-TTS
T ss_pred cceeecCCcEEEEeccc------------cccCCCCEEEEEEEEecCceecccEEEEEEEeecCCEEEEecCCcEecccc
Confidence 3554 578988876641 2356677666655554322 1334444444444 22 3444
Q ss_pred EEEcCCCEEEEEEEcCCCeeEEEEEEEC
Q psy15726 99 CYQQKFANLLAEFTPTTTGVYKIDVLQG 126 (171)
Q Consensus 99 v~~~~dg~y~v~y~P~~~G~~~i~V~~~ 126 (171)
..+...++|+.-||......+.|+|.+.
T Consensus 93 ~Y~L~~~~FRLYYTS~s~~~q~idv~ve 120 (137)
T PF12988_consen 93 RYPLEKEVFRLYYTSRSDDQQTIDVYVE 120 (137)
T ss_dssp EEE-S-SEEEEEEEE-SSS-EEEEEEEE
T ss_pred ceecCcCEEEEEEecCCCCCceeEEEEE
Confidence 5666678999999999999999998875
No 69
>PF10648 Gmad2: Immunoglobulin-like domain of bacterial spore germination; InterPro: IPR018911 This domain is found linked to IPR019606 from INTERPRO in some bacterial proteins. It is predicted to contain an immunoglobulin-like all-beta fold.
Probab=57.55 E-value=50 Score=22.33 Aligned_cols=46 Identities=13% Similarity=0.226 Sum_probs=28.7
Q ss_pred CceEEEEECCCCCcc-ceEEEE-cCC---CEE--EEEEEcCCCeeEEEEEEEC
Q psy15726 81 SEFDVIVTSPQGAAV-PVRCYQ-QKF---ANL--LAEFTPTTTGVYKIDVLQG 126 (171)
Q Consensus 81 ~~l~v~i~~p~g~~~-~~~v~~-~~d---g~y--~v~y~P~~~G~~~i~V~~~ 126 (171)
+.+.++|.|.+|..+ .-.+.- .+. |.| ++.|.+..++.+.|.|...
T Consensus 29 gtv~~rv~D~~g~vl~e~~~~a~~g~~~~g~F~~tv~~~~~~~~~g~l~v~~~ 81 (88)
T PF10648_consen 29 GTVNIRVRDGHGEVLAEGFVTATGGAPSWGPFEGTVSFPPPPPGKGTLEVFED 81 (88)
T ss_pred eEEEEEEEcCCCcEEEEeeEEeccCCCcccceEEEEEeCCCCCCceEEEEEEe
Confidence 567777777777654 222221 111 455 5678877788999988754
No 70
>PF00801 PKD: PKD domain; InterPro: IPR000601 The PKD (Polycystic Kidney Disease) domain was first identified in the Polycystic Kidney Disease protein, polycystin-1 (PDK1 gene), and contains an Ig-like fold consisting of a beta-sandwich of seven strands in two sheets with a Greek key topology, although some members have additional strands []. Polycystin-1 is a large cell-surface glycoprotein involved in adhesive protein-protein and protein-carbohydrate interactions; however it is not clear if the PKD domain mediates any of these interactions. PKD domains are also found in other proteins, usually in the extracellular parts of proteins involved in interactions with other proteins. For example, domains with a PKD-type fold are found in archaeal surface layer proteins that protect the cell from extreme environments [], and in the human VPS10 domain-containing receptor SorCS2 [].; PDB: 1B4R_A 2KZW_A 2C4X_A 2C26_A 2Y72_B 3JQU_A 3JS7_B 1WGO_A 1L0Q_A.
Probab=57.36 E-value=27 Score=21.81 Aligned_cols=55 Identities=13% Similarity=0.088 Sum_probs=33.0
Q ss_pred ceEeCCeEEEEEEeCCCcCCceEEEEECCCCCccceEEEEcCCCEEEEEEEcCCCeeEEEEEEEC
Q psy15726 62 QARSGIVNSFTLETCGVASSEFDVIVTSPQGAAVPVRCYQQKFANLLAEFTPTTTGVYKIDVLQG 126 (171)
Q Consensus 62 ~~~vg~~~~F~V~~~~ag~~~l~v~i~~p~g~~~~~~v~~~~dg~y~v~y~P~~~G~~~i~V~~~ 126 (171)
.+.+|+++.|..... .+..+.....=.+ .... .. .-.++|+...+|.|.|.|...
T Consensus 7 ~~~~g~~v~f~~~~~--~g~~~~y~W~fgd----~~~~--~~--~~~~t~ty~~~G~y~V~ltv~ 61 (69)
T PF00801_consen 7 YVPTGQPVTFTASSS--DGSPVTYSWDFGD----NGTV--ST--GSSVTHTYSSPGTYTVTLTVT 61 (69)
T ss_dssp CEBTTSEEEEEETTT--TSSECEEEEE-SS----ESEE--EC--SSEEEEEESSSEEEEEEEEEE
T ss_pred EEcCCCEEEEEEEcc--CCCCeEEEEEECC----CCcc--cc--CCCEEEEcCCCeEEEEEEEEE
Confidence 366788888888762 2344554444322 1111 11 125677888899999998764
No 71
>PF03896 TRAP_alpha: Translocon-associated protein (TRAP), alpha subunit; InterPro: IPR005595 The alpha-subunit of the TRAP complex (TRAP alpha) is a single-spanning membrane protein of the endoplasmic reticulum (ER) which is found in proximity of nascent polypeptide chains translocating across the membrane [].; GO: 0005783 endoplasmic reticulum
Probab=54.73 E-value=1.1e+02 Score=25.41 Aligned_cols=81 Identities=17% Similarity=0.155 Sum_probs=53.7
Q ss_pred cceEeCCeEEEEEEeCCCcCCceEEEEE-----CCCCC----------ccceEEEEcCCCEEEEEEEc---CCCeeEEEE
Q psy15726 61 YQARSGIVNSFTLETCGVASSEFDVIVT-----SPQGA----------AVPVRCYQQKFANLLAEFTP---TTTGVYKID 122 (171)
Q Consensus 61 ~~~~vg~~~~F~V~~~~ag~~~l~v~i~-----~p~g~----------~~~~~v~~~~dg~y~v~y~P---~~~G~~~i~ 122 (171)
.....|++++|.|.-++.|..++.|... -|..- .....|.....-++..+|+| ..++.|.|.
T Consensus 93 ~~l~aG~~~~~LvgftN~g~~~~~V~~i~aSl~~p~d~~~~iqNfTa~~y~~~V~pg~~aT~~YsF~~~~~l~pr~f~L~ 172 (285)
T PF03896_consen 93 KKLPAGEPVKFLVGFTNKGSEPFTVESIEASLRYPQDYSYYIQNFTAVRYNREVPPGEEATFPYSFTPSEELAPRPFGLV 172 (285)
T ss_pred ccccCCCeEEEEEEEEeCCCCCEEEEEEeeeecCccccceEEEeecccccCcccCCCCeEEEEEEEecchhcCCcceEEE
Confidence 4578899999999998877766665533 23211 01111222233577888888 457888888
Q ss_pred EEECCEEcCCCCeEEEecC
Q psy15726 123 VLQGARPVRGSPYLCQVYD 141 (171)
Q Consensus 123 V~~~g~~I~gSPF~v~V~d 141 (171)
|...=+...|.+|...+++
T Consensus 173 i~l~y~d~~g~~y~~~~fN 191 (285)
T PF03896_consen 173 INLIYEDSDGNQYQVTVFN 191 (285)
T ss_pred EEEEEEeCCCCEEEEEEec
Confidence 8776566788899988854
No 72
>PF13754 Big_3_4: Bacterial Ig-like domain (group 3)
Probab=52.47 E-value=32 Score=20.83 Aligned_cols=26 Identities=19% Similarity=0.161 Sum_probs=18.6
Q ss_pred EEcCCCEEEEEEEcCCCeeEEEEEEE
Q psy15726 100 YQQKFANLLAEFTPTTTGVYKIDVLQ 125 (171)
Q Consensus 100 ~~~~dg~y~v~y~P~~~G~~~i~V~~ 125 (171)
....+|.|.+++.-...|.|.|.|..
T Consensus 7 t~~~~G~Ws~t~~~~~dG~y~itv~a 32 (54)
T PF13754_consen 7 TVDSDGNWSFTVPALADGTYTITVTA 32 (54)
T ss_pred EECCCCcEEEeCCCCCCccEEEEEEE
Confidence 33566888888765578888877765
No 73
>PF06832 BiPBP_C: Penicillin-Binding Protein C-terminus Family; InterPro: IPR009647 This conserved region of approximately 90 residues is found in a sub-group of bacterial Penicillin-Binding Proteins (PBPs). A variable length loop region separates this region from the transpeptidase unit (IPR001460 from INTERPRO). It is predicted to be a beta fold.
Probab=52.03 E-value=67 Score=21.21 Aligned_cols=20 Identities=15% Similarity=0.232 Sum_probs=16.9
Q ss_pred CEEEEEEEcCCCeeEEEEEE
Q psy15726 105 ANLLAEFTPTTTGVYKIDVL 124 (171)
Q Consensus 105 g~y~v~y~P~~~G~~~i~V~ 124 (171)
..+.+.|.|..+|.|+|.|.
T Consensus 59 ~~~~~~~~~~~~G~h~l~vv 78 (89)
T PF06832_consen 59 PGHQLFWQPDRPGEHTLTVV 78 (89)
T ss_pred CCCeEEeCCCCCeeEEEEEE
Confidence 35688899999999999983
No 74
>PRK05842 flgD flagellar basal body rod modification protein; Reviewed
Probab=51.93 E-value=1e+02 Score=25.77 Aligned_cols=73 Identities=14% Similarity=0.017 Sum_probs=41.1
Q ss_pred CcEEEcCCCcceEeCCeEEEEEEeC---CCcCCceEEEEECCCCCccceE-EEEc--CCCEEEEEEEc-------CCCee
Q psy15726 52 EPSGSGLGLYQARSGIVNSFTLETC---GVASSEFDVIVTSPQGAAVPVR-CYQQ--KFANLLAEFTP-------TTTGV 118 (171)
Q Consensus 52 ~v~v~G~Gl~~~~vg~~~~F~V~~~---~ag~~~l~v~i~~p~g~~~~~~-v~~~--~dg~y~v~y~P-------~~~G~ 118 (171)
.|.+.|..+. ..-|....|.+... .+....+.+.|.+.+|+.+... +... ..|.+.++|-- ...|.
T Consensus 135 ~V~v~g~~~~-l~~g~~~~~~~~l~~~~~~~a~~v~I~I~Da~G~vVrTi~l~~~~~~aG~~~f~WDG~d~~G~~~p~G~ 213 (295)
T PRK05842 135 IAETDVSGAN-FDGNNKLSFSLFFDEKIDASKGVPAIQILNENNELVKTIPLKDYNGQKGYINFEWDGLNEKGEKVPKGN 213 (295)
T ss_pred EEEEecceEE-ecCCCceEEEEeccccccccCceEEEEEEcCCCCEEEEEecCcccCCCcceeEEECCCCCCCCcCCCcc
Confidence 4455444431 11233556665443 2233578899999999866432 2222 23677777743 34688
Q ss_pred EEEEEEE
Q psy15726 119 YKIDVLQ 125 (171)
Q Consensus 119 ~~i~V~~ 125 (171)
|++.|..
T Consensus 214 Yt~~V~a 220 (295)
T PRK05842 214 YKIKAEY 220 (295)
T ss_pred eEEEEEE
Confidence 8888754
No 75
>PF12975 DUF3859: Domain of unknown function (DUF3859); InterPro: IPR024331 This entry represents a functionally uncharacterised domain, which is found in bacterial proteins.; PDB: 3N6Y_B.
Probab=51.24 E-value=91 Score=22.50 Aligned_cols=70 Identities=17% Similarity=0.193 Sum_probs=36.0
Q ss_pred ceEeCCeEEEEEEeCCCcCCc-eEEEEECC---CCCc---cceEEEEc-CCCEEEEEEE-----cCCCeeEEEEEEECCE
Q psy15726 62 QARSGIVNSFTLETCGVASSE-FDVIVTSP---QGAA---VPVRCYQQ-KFANLLAEFT-----PTTTGVYKIDVLQGAR 128 (171)
Q Consensus 62 ~~~vg~~~~F~V~~~~ag~~~-l~v~i~~p---~g~~---~~~~v~~~-~dg~y~v~y~-----P~~~G~~~i~V~~~g~ 128 (171)
.+.+|..--|.+....+.+.. |++.+..| .+.. .+...... .+......|+ ...+|.+++.|..+|+
T Consensus 37 pa~lg~~FG~r~~~~~~~~~~~lt~~i~HP~ip~~~~g~~~~~~~~~v~~~~~~~~~~~f~~~~e~v~G~Wr~~i~~~g~ 116 (125)
T PF12975_consen 37 PAELGIEFGFRYNIKKAKGEKPLTVCITHPGIPDGDKGDTVEPWQGEVYVGSNDWDFYTFDTPWEPVPGEWRFTIELDGK 116 (125)
T ss_dssp E--TT-EEEEEEEES--TTS--EEEEEEEEEETTS-EEEEEEEEE---TT-S-EEEEEE--SGGG---EEEEEEEEETTE
T ss_pred chhcCCEEEEEEEeecCCCCccEEEEEeCCCCCCCCCCCEECcEEEEecCCCcceEEEEcCcccccCCceEEEEEEECCE
Confidence 366777777888777665555 99999996 4422 11111111 1123333332 3347999999999998
Q ss_pred EcC
Q psy15726 129 PVR 131 (171)
Q Consensus 129 ~I~ 131 (171)
.|.
T Consensus 117 ~i~ 119 (125)
T PF12975_consen 117 VIA 119 (125)
T ss_dssp EEE
T ss_pred EEE
Confidence 774
No 76
>KOG1693|consensus
Probab=50.77 E-value=77 Score=25.08 Aligned_cols=51 Identities=18% Similarity=0.247 Sum_probs=30.5
Q ss_pred EEEEEEeCCCcCCceEEEEECCCCCccceEEEEcCCCEE-EEEEEcCCCeeEEEEE
Q psy15726 69 NSFTLETCGVASSEFDVIVTSPQGAAVPVRCYQQKFANL-LAEFTPTTTGVYKIDV 123 (171)
Q Consensus 69 ~~F~V~~~~ag~~~l~v~i~~p~g~~~~~~v~~~~dg~y-~v~y~P~~~G~~~i~V 123 (171)
+.|.......|.=+++..|++|+|+.+-- +.+. .| .+.|+....|.|+..-
T Consensus 47 ~~~~fqV~tGG~fDVD~~I~aPdgkvI~~---~~kk-~~~~~~f~ae~~G~Y~fCF 98 (209)
T KOG1693|consen 47 TSFEFQVQTGGHFDVDYDIEAPDGKVIYS---EKKK-RYDSFLFKAEGKGEYTFCF 98 (209)
T ss_pred EEEEEEEEeCCceeeEEEEECCCCCEEee---cccc-ccccEEEEEecceEEEEEe
Confidence 44444443335567778899999975432 2222 23 5677777777777653
No 77
>KOG4637|consensus
Probab=50.14 E-value=15 Score=31.77 Aligned_cols=34 Identities=21% Similarity=0.275 Sum_probs=29.2
Q ss_pred cceEEEEcCCCEEEEEEEcCCCeeEEEEEEECCE
Q psy15726 95 VPVRCYQQKFANLLAEFTPTTTGVYKIDVLQGAR 128 (171)
Q Consensus 95 ~~~~v~~~~dg~y~v~y~P~~~G~~~i~V~~~g~ 128 (171)
+.-.+.|..||+|.|.=.-+..|+|++.++.+|.
T Consensus 35 v~~~L~d~PDGsFlVRdAstm~GdYTLtl~k~g~ 68 (464)
T KOG4637|consen 35 VNKKLRDQPDGSFLVRDASTMQGDYTLTLRKGGN 68 (464)
T ss_pred HHHHhcCCCCCcEEeeccccCCCceEEEEecCCc
Confidence 3445788999999999888899999999999985
No 78
>TIGR03503 conserved hypothetical protein TIGR03503. This set of conserved hypothetical protein has a phylogenetic range that closely matches that of TIGR03501, a putative C-terminal protein targeting signal.
Probab=49.39 E-value=1.8e+02 Score=25.30 Aligned_cols=77 Identities=18% Similarity=0.107 Sum_probs=48.7
Q ss_pred eEeCCeEEEEEEeCCCcC----------CceEEEEECCCCC------ccceEE---EE--------cCCCEEEEEEE-cC
Q psy15726 63 ARSGIVNSFTLETCGVAS----------SEFDVIVTSPQGA------AVPVRC---YQ--------QKFANLLAEFT-PT 114 (171)
Q Consensus 63 ~~vg~~~~F~V~~~~ag~----------~~l~v~i~~p~g~------~~~~~v---~~--------~~dg~y~v~y~-P~ 114 (171)
...|++..|+....+.+. -+++|++.+.+.. ..+..+ .| -+||.|+..+. -.
T Consensus 102 l~~gE~lk~ta~L~~d~~~i~~~~~l~~v~~~v~f~~~~~~~~e~~~~~~~vvg~f~DdG~g~DE~p~DGvFT~~l~l~~ 181 (374)
T TIGR03503 102 LFQGETLKVTAKLLNDGEPLKLPDFRDVVKLTVRFVSTNNAEDENFGADPIVVGEFEDDGEGLDERPGDGIFTGEFNLDV 181 (374)
T ss_pred ccCCCeEEEEEEEecCCEEeechhhhheEEEEEEEEecCCccccccccCCEEEEeeccCCccCCCCCCCceEEEEeeccC
Confidence 556888888887765432 1344555543221 112212 12 26899999996 44
Q ss_pred CCeeEEEEEEECC----------EEcCCCCeEEEe
Q psy15726 115 TTGVYKIDVLQGA----------RPVRGSPYLCQV 139 (171)
Q Consensus 115 ~~G~~~i~V~~~g----------~~I~gSPF~v~V 139 (171)
.+|.|.+.|..++ ..|.-.|+.+.+
T Consensus 182 ~~G~Y~~~v~~~n~vF~R~~~q~v~V~p~Pi~~~~ 216 (374)
T TIGR03503 182 APGEYRPTYQSRNPVFLREVEQPVLVYPLPVSYTV 216 (374)
T ss_pred CCceEEEEEEEcCceEEEEEEEeEEEECCCeeEEE
Confidence 6999999999876 356677888777
No 79
>PF07703 A2M_N_2: Alpha-2-macroglobulin family N-terminal region; InterPro: IPR011625 This is a domain of the alpha-2-macroglobulin family. The alpha-macroglobulin (aM) family of proteins includes protease inhibitors [], typified by the human tetrameric a2-macroglobulin (a2M); they belong to the MEROPS proteinase inhibitor family I39, clan IL. These protease inhibitors share several defining properties, which include (i) the ability to inhibit proteases from all catalytic classes, (ii) the presence of a 'bait region' and a thiol ester, (iii) a similar protease inhibitory mechanism and (iv) the inactivation of the inhibitory capacity by reaction of the thiol ester with small primary amines. aM protease inhibitors inhibit by steric hindrance []. The mechanism involves protease cleavage of the bait region, a segment of the aM that is particularly susceptible to proteolytic cleavage, which initiates a conformational change such that the aM collapses about the protease. In the resulting aM-protease complex, the active site of the protease is sterically shielded, thus substantially decreasing access to protein substrates. Two additional events occur as a consequence of bait region cleavage, namely (i) the h-cysteinyl-g-glutamyl thiol ester becomes highly reactive and (ii) a major conformational change exposes a conserved COOH-terminal receptor binding domain [] (RBD). RBD exposure allows the aM protease complex to bind to clearance receptors and be removed from circulation []. Tetrameric, dimeric, and, more recently, monomeric aM protease inhibitors have been identified [, ].; PDB: 2QKI_D 3L3O_D 3NMS_A 2ICF_A 2A73_A 2ICE_D 2HR0_A 2A74_A 2XWJ_G 3OHX_A ....
Probab=48.92 E-value=92 Score=21.88 Aligned_cols=102 Identities=15% Similarity=0.128 Sum_probs=63.3
Q ss_pred ceEeCCeEEEEEEeCCCcCCceEEEEECCCCCccceEEEEcCCC--EEEEEEEcCCCeeEEEEEEE----CCEEcCCCCe
Q psy15726 62 QARSGIVNSFTLETCGVASSEFDVIVTSPQGAAVPVRCYQQKFA--NLLAEFTPTTTGVYKIDVLQ----GARPVRGSPY 135 (171)
Q Consensus 62 ~~~vg~~~~F~V~~~~ag~~~l~v~i~~p~g~~~~~~v~~~~dg--~y~v~y~P~~~G~~~i~V~~----~g~~I~gSPF 135 (171)
...+|+.+.+.|.+... .+.+...|.+.+ +.+.....+..++ ++.+.-++.......|.+.+ +|+.+.+| -
T Consensus 9 ~~~~Ge~~~v~v~~~~~-~~~~~~~v~s~g-~I~~~~~~~~~~~~~~~~~~v~~~~~P~~~v~~~~v~~~~g~~~~~s-~ 85 (136)
T PF07703_consen 9 SYKPGETAKVTVQSPFP-NGTFLYLVESRG-KIVSTGSVELKNGSTTFEFPVTPDMAPNFYVLAYYVRPADGEVVADS-V 85 (136)
T ss_dssp SB-TTSEEEEEEEEESC-ESEEEEEEEETT-EEEEEEEEECTTTSSEEEEEE-GGGTSEEEEEEEEETTCTCEEEEEE-E
T ss_pred CcCCCCEEEEEEEcCCC-ccEEEEEEEECC-eEEEEEEEEecCCcEEEEEecchhcCCcEEEEEEEEcCCCCeEEEEE-E
Confidence 47899999999998876 356777777644 4445444444443 56777777766666666654 55666655 3
Q ss_pred EEEecC--CCceEEecCCcccEEcCCCEEEEEEc
Q psy15726 136 LCQVYD--ASKVKIEHKGLSNIVVNDPISFKCKS 167 (171)
Q Consensus 136 ~v~V~d--~skv~~~G~gl~~~~~g~~~~F~Vd~ 167 (171)
.+.|.. ..++.+.-.- +...-|++++++|.+
T Consensus 86 ~i~V~~~~~~~v~l~~~~-~~~~Pg~~~~~~i~~ 118 (136)
T PF07703_consen 86 WIEVEPCFELKVELTASP-DEYKPGEEVTLRIKA 118 (136)
T ss_dssp EEEBGCSGSSSEEEEESS-SSBTTTSEEEEEEEE
T ss_pred EEEecccccceEEEEEec-ceeCCCCEEEEEEEe
Confidence 444422 3456655421 356778888888876
No 80
>PF08308 PEGA: PEGA domain; InterPro: IPR013229 This domain is found in both archaea and bacteria and has similarity to S-layer (surface layer) proteins. It is named after the characteristic PEGA sequence motif found in this domain. The secondary structure of this domain is predicted to be beta-strands.
Probab=46.73 E-value=71 Score=19.96 Aligned_cols=14 Identities=14% Similarity=0.135 Sum_probs=8.2
Q ss_pred CCCeeEEEEEEECC
Q psy15726 114 TTTGVYKIDVLQGA 127 (171)
Q Consensus 114 ~~~G~~~i~V~~~g 127 (171)
..+|.|.|.|+..|
T Consensus 32 l~~G~~~v~v~~~G 45 (71)
T PF08308_consen 32 LPPGEHTVTVEKPG 45 (71)
T ss_pred cCCccEEEEEEECC
Confidence 34566666666555
No 81
>PF10633 NPCBM_assoc: NPCBM-associated, NEW3 domain of alpha-galactosidase; InterPro: IPR018905 This domain has been named NEW3, but its function is not known. It is found on proteins which are bacterial galactosidases [].; PDB: 1EUT_A 2BZD_A 1WCQ_C 2BER_A 1W8O_A 1EUU_A 1W8N_A.
Probab=46.37 E-value=78 Score=20.31 Aligned_cols=62 Identities=16% Similarity=0.268 Sum_probs=31.2
Q ss_pred EeCCeEEEEEEeCCCcC---CceEEEEECCCCCc---cceEEEEcCCC-EEEE--EEEcCC---CeeEEEEEEE
Q psy15726 64 RSGIVNSFTLETCGVAS---SEFDVIVTSPQGAA---VPVRCYQQKFA-NLLA--EFTPTT---TGVYKIDVLQ 125 (171)
Q Consensus 64 ~vg~~~~F~V~~~~ag~---~~l~v~i~~p~g~~---~~~~v~~~~dg-~y~v--~y~P~~---~G~~~i~V~~ 125 (171)
..|+...|++...+.|. .++.+++..|.|=. -+..+.....| +-.+ .-+|.+ +|.|.|.+..
T Consensus 2 ~~G~~~~~~~tv~N~g~~~~~~v~~~l~~P~GW~~~~~~~~~~~l~pG~s~~~~~~V~vp~~a~~G~y~v~~~a 75 (78)
T PF10633_consen 2 TPGETVTVTLTVTNTGTAPLTNVSLSLSLPEGWTVSASPASVPSLPPGESVTVTFTVTVPADAAPGTYTVTVTA 75 (78)
T ss_dssp -TTEEEEEEEEEE--SSS-BSS-EEEEE--TTSE---EEEEE--B-TTSEEEEEEEEEE-TT--SEEEEEEEEE
T ss_pred CCCCEEEEEEEEEECCCCceeeEEEEEeCCCCccccCCccccccCCCCCEEEEEEEEECCCCCCCceEEEEEEE
Confidence 35777778777765543 46778888898733 23333333332 3344 444432 6999888764
No 82
>PF09912 DUF2141: Uncharacterized protein conserved in bacteria (DUF2141); InterPro: IPR018673 This family of conserved hypothetical proteins has no known function.
Probab=45.98 E-value=1e+02 Score=21.63 Aligned_cols=28 Identities=21% Similarity=0.285 Sum_probs=21.9
Q ss_pred EEEcCCCEEEEEEEcCCCeeEEEEEEEC
Q psy15726 99 CYQQKFANLLAEFTPTTTGVYKIDVLQG 126 (171)
Q Consensus 99 v~~~~dg~y~v~y~P~~~G~~~i~V~~~ 126 (171)
.....+|..+++|.-..+|.|.|.|.++
T Consensus 35 ~~~~~~~~~~~~f~~lp~G~YAi~v~hD 62 (112)
T PF09912_consen 35 KVPAKGGTVTITFEDLPPGTYAIAVFHD 62 (112)
T ss_pred EEEcCCCcEEEEECCCCCccEEEEEEEe
Confidence 3444557888888877899999999886
No 83
>PF04571 Lipin_N: lipin, N-terminal conserved region; InterPro: IPR007651 Mutations in the lipin gene lead to fatty liver dystrophy in mice. The protein has been shown to be phosphorylated by the TOR Ser/Thr protein kinases in response to insulin stimulation. This entry represents a conserved domain found at the N terminus of the member proteins [, ].
Probab=45.89 E-value=91 Score=22.24 Aligned_cols=44 Identities=16% Similarity=0.208 Sum_probs=29.1
Q ss_pred cceEEEEcCCCEEEE-----EE---EcCCCeeEEEEEEECCEEcCCCCeEEEecC
Q psy15726 95 VPVRCYQQKFANLLA-----EF---TPTTTGVYKIDVLQGARPVRGSPYLCQVYD 141 (171)
Q Consensus 95 ~~~~v~~~~dg~y~v-----~y---~P~~~G~~~i~V~~~g~~I~gSPF~v~V~d 141 (171)
+++-+.++.||+|.. +| .-..+.+-.|+|..||+.+ +|.+++.+
T Consensus 28 iDVIVV~q~DGs~~sSPFhVRFGk~~vl~~~ek~V~I~VNG~~~---~~~MkLg~ 79 (110)
T PF04571_consen 28 IDVIVVEQPDGSLKSSPFHVRFGKLGVLRPREKVVDIEVNGKPV---DFHMKLGE 79 (110)
T ss_pred eeEEEEecCCCCEecCccEEEEcceeeecccCcEEEEEECCEEc---ceEEEECC
Confidence 455566666776543 33 2223556789999999997 58888744
No 84
>PF09430 DUF2012: Protein of unknown function (DUF2012); InterPro: IPR019008 This domain is found in different proteins, including uncharacterised protein family UPF0480 and nodal modulators. A nodal modulator has been identified as part of a protein complex that participates in the nodal signaling pathway during vertebrate development [].
Probab=45.85 E-value=1.1e+02 Score=21.81 Aligned_cols=37 Identities=19% Similarity=0.341 Sum_probs=25.2
Q ss_pred EEcCCCEEEEEEEcCCCeeEEEEEEECCEEcCCCCeEEEec
Q psy15726 100 YQQKFANLLAEFTPTTTGVYKIDVLQGARPVRGSPYLCQVY 140 (171)
Q Consensus 100 ~~~~dg~y~v~y~P~~~G~~~i~V~~~g~~I~gSPF~v~V~ 140 (171)
.-+.||+|.+.=.| +|.|.|+|..-+-.- +|++|.|.
T Consensus 25 ~v~~dG~F~f~~Vp--~GsY~L~V~s~~~~F--~~~RVdV~ 61 (123)
T PF09430_consen 25 FVRSDGSFVFHNVP--PGSYLLEVHSPDYVF--PPYRVDVS 61 (123)
T ss_pred EecCCCEEEeCCCC--CceEEEEEECCCccc--cCEEEEEe
Confidence 34556777766655 589999988765443 45888875
No 85
>PF13954 PapC_N: PapC N-terminal domain; PDB: 2VQI_B 3FIP_A 3RFZ_E 3OHN_A 1ZDV_A 1ZE3_D 3BWU_D 1ZDX_A.
Probab=45.02 E-value=25 Score=25.77 Aligned_cols=20 Identities=10% Similarity=-0.052 Sum_probs=16.7
Q ss_pred CCCCceeEEEEEEECCEEcC
Q psy15726 17 SWISYGLVQARTLREETSFR 36 (171)
Q Consensus 17 ~P~~~G~~~i~V~~~g~~I~ 36 (171)
+...||.|.++|.+||+.+.
T Consensus 25 ~~~~pG~Y~vdv~vN~~~~~ 44 (146)
T PF13954_consen 25 NAIPPGEYSVDVYVNGKFIG 44 (146)
T ss_dssp -SS-SEEEEEEEEETTEEEE
T ss_pred CCCCCeEEEEEEEECCeeee
Confidence 57889999999999999885
No 86
>PF00116 COX2: Cytochrome C oxidase subunit II, periplasmic domain This family corresponds to chains b and o.; InterPro: IPR002429 Cytochrome c oxidase (1.9.3.1 from EC) [, ] is an oligomeric enzymatic complex which is a component of the respiratory chain and is involved in the transfer of electrons from cytochrome c to oxygen. In eukaryotes this enzyme complex is located in the mitochondrial inner membrane; in aerobic prokaryotes it is found in the plasma membrane. The number of polypeptides in the complex ranges from 3-4 (prokaryotes), up to 13(mammals). Subunit 2 (CO II) transfers the electrons from cytochrome c to the catalytic subunit 1. It contains two adjacent transmembrane regions in its N terminus and the major part of the protein is exposed to the periplasmic or to the mitochondrial intermembrane space, respectively. CO II provides the substrate-binding site and contains a copper centre called Cu(A), probably the primary acceptor in cytochrome c oxidase. An exception is the corresponding subunit of the cbb3-type oxidase which lacks the copper A redox-centre. Several bacterial CO II have a C-terminal extension that contains a covalently bound haem c.; GO: 0004129 cytochrome-c oxidase activity, 0005507 copper ion binding, 0016020 membrane; PDB: 3OMN_D 3OMA_B 3OMI_D 3OM3_B 3EHB_B 1AR1_B 1QLE_B 3HB3_B 2IWK_B 2IWF_A ....
Probab=44.53 E-value=1.1e+02 Score=21.68 Aligned_cols=53 Identities=21% Similarity=0.248 Sum_probs=34.0
Q ss_pred ceEeCCeEEEEEEeCCCcCCceEEEEECCCCCccceEEEEcCCCEEEEEEEcCCCeeEEEE
Q psy15726 62 QARSGIVNSFTLETCGVASSEFDVIVTSPQGAAVPVRCYQQKFANLLAEFTPTTTGVYKID 122 (171)
Q Consensus 62 ~~~vg~~~~F~V~~~~ag~~~l~v~i~~p~g~~~~~~v~~~~dg~y~v~y~P~~~G~~~i~ 122 (171)
.++.|++..|.+.+.|.- ..+.+ |+ ...++.-....+-.+.|+|.++|.|.+.
T Consensus 49 ~lp~g~~v~~~ltS~DVi-Hsf~i----p~---~~~k~d~~PG~~~~~~~~~~~~G~y~~~ 101 (120)
T PF00116_consen 49 VLPAGQPVRFHLTSEDVI-HSFWI----PE---LGIKMDAIPGRTNSVTFTPDKPGTYYGQ 101 (120)
T ss_dssp EEETTSEEEEEEEESSS--EEEEE----TT---CTEEEEEBTTCEEEEEEEESSSEEEEEE
T ss_pred cccccceEeEEEEcCCcc-ccccc----cc---cCcccccccccceeeeeeeccCCcEEEc
Confidence 378899999999987752 22222 22 1233333334566788999999998764
No 87
>PF12863 DUF3821: Domain of unknown function (DUF3821); InterPro: IPR024277 This is a domain largely confined to sequences from Methanomicrobiales. It is found in putative lipases but the function is unknown.
Probab=43.79 E-value=1.6e+02 Score=23.32 Aligned_cols=32 Identities=28% Similarity=0.384 Sum_probs=23.7
Q ss_pred ceEeCCeEEEEEEeCC--------CcCC--ceEEEEECCCCC
Q psy15726 62 QARSGIVNSFTLETCG--------VASS--EFDVIVTSPQGA 93 (171)
Q Consensus 62 ~~~vg~~~~F~V~~~~--------ag~~--~l~v~i~~p~g~ 93 (171)
.+..|....|.|++.= .... -+.+.|++|+|.
T Consensus 101 ~V~~G~~v~FrI~tNL~~~~~R~g~~~~~~~v~I~V~~P~G~ 142 (209)
T PF12863_consen 101 TVPRGDNVNFRIDTNLYSIFQRGGYTPGDGPVDIKVTTPSGA 142 (209)
T ss_pred eeccCCeEEEEEcccHHHHhhcCCCCCCcceEEEEEeCCCCc
Confidence 3778999999998851 1122 388999999985
No 88
>PF05738 Cna_B: Cna protein B-type domain; InterPro: IPR008454 This entry represents a repeated B region domain found in the collagen-binding surface protein Cna in Staphylococcus aureus, as well as other related domains. The B region domain of Cna has a prealbumin-like beta-sandwich fold of seven strands in two sheets with a Greek key topology []. However, this domain does not mediate collagen binding, the IPR008456 from INTERPRO region carries out that function; instead it appears to form a stalk that presents the ligand binding domain away from the bacterial cell surface. Cna is a collagen-binding MSCRAMM (Microbial Surface Component Recognizing Adhesive Matrix Molecules), and is necessary and sufficient for S. aureus cells to adhere to cartilage.; PDB: 2X5P_A 3RKP_A 3KPT_A 1VLF_T 1TI2_F 1TI6_D 1TI4_J 1VLE_V 1VLD_X 3PF2_A ....
Probab=43.14 E-value=79 Score=19.50 Aligned_cols=61 Identities=11% Similarity=0.197 Sum_probs=34.6
Q ss_pred EEEECCCCCccce-EEEEcCCCEEEEEEEcCCCeeEEEEEEE--CCEEcCCCCeEEEecCCCceEE
Q psy15726 85 VIVTSPQGAAVPV-RCYQQKFANLLAEFTPTTTGVYKIDVLQ--GARPVRGSPYLCQVYDASKVKI 147 (171)
Q Consensus 85 v~i~~p~g~~~~~-~v~~~~dg~y~v~y~P~~~G~~~i~V~~--~g~~I~gSPF~v~V~d~skv~~ 147 (171)
+.+.+.++..... ...-..+|.|.+.-. .+|.|.|.-.- .|-.+...++.+.+.+-+++++
T Consensus 6 f~L~~~~~~~~~~~~~~Td~~G~~~f~~L--~~G~Y~l~E~~aP~GY~~~~~~~~~~i~~~~~~~v 69 (70)
T PF05738_consen 6 FELYDEDGNEVIEVTVTTDENGKYTFKNL--PPGTYTLKETKAPDGYQLDDTPYEFTITEDGDVTV 69 (70)
T ss_dssp EEEEETTSEEEEEEEEEGGTTSEEEEEEE--ESEEEEEEEEETTTTEEEEECEEEEEECTTSCEEE
T ss_pred EEEEECCCCEEEEEEEEECCCCEEEEeec--CCeEEEEEEEECCCCCEECCCceEEEEecCCEEEE
Confidence 3444444432221 344455676655543 57888887655 5666777777777655555443
No 89
>cd03866 M14_CPM Peptidase M14 Carboxypeptidase (CP) M (CPM) belongs to the N/E subfamily of the M14 family of metallocarboxypeptidases (MCPs).The M14 family are zinc-binding CPs which hydrolyze single, C-terminal amino acids from polypeptide chains, and have a recognition site for the free C-terminal carboxyl group, which is a key determinant of specificity. CPM is an extracellular glycoprotein, bound to cell membranes via a glycosyl-phosphatidylinositol on the C-terminus of the protein. It specifically removes C-terminal basic residues such as lysine and arginine from peptides and proteins. The highest levels of CPM have been found in human lung and placenta, but significant amounts are present in kidney, blood vessels, intestine, brain, and peripheral nerves. CPM has also been found in soluble form in various body fluids, including amniotic fluid, seminal plasma and urine. Due to its wide distribution in a variety of tissues, it is believed that it plays an important role in the cont
Probab=40.97 E-value=1.1e+02 Score=26.47 Aligned_cols=60 Identities=17% Similarity=0.099 Sum_probs=37.0
Q ss_pred eCCeEEEEEEeCCCc-------------------------CCceEEEEECCCCCccce-E--E---------EEcCCCEE
Q psy15726 65 SGIVNSFTLETCGVA-------------------------SSEFDVIVTSPQGAAVPV-R--C---------YQQKFANL 107 (171)
Q Consensus 65 vg~~~~F~V~~~~ag-------------------------~~~l~v~i~~p~g~~~~~-~--v---------~~~~dg~y 107 (171)
.+.+.+|+++.+... ...+.-.|.|.+|..++. . + ....+|.|
T Consensus 254 ~~~~~~~T~El~~~k~p~~~~l~~~w~~n~~~ll~~i~q~~~gI~G~V~D~~g~pi~~A~V~v~g~~~~~~~~T~~~G~y 333 (376)
T cd03866 254 WAQCFEITLELSCCKYPPEEQLPAFWEDNKAALIEYIKQVHLGVKGQVFDSNGNPIPNAIVEVKGRKHICPYRTNVNGEY 333 (376)
T ss_pred hCceEEEEEEecCCCCCCHHHHHHHHHHhHHHHHHHHHHhcCceEEEEECCCCCccCCeEEEEEcCCceeEEEECCCceE
Confidence 467778888887531 134666788888876652 2 2 23345666
Q ss_pred EEEEEcCCCeeEEEEEEECC
Q psy15726 108 LAEFTPTTTGVYKIDVLQGA 127 (171)
Q Consensus 108 ~v~y~P~~~G~~~i~V~~~g 127 (171)
.. ...+|.|.|.|...|
T Consensus 334 ~~---~l~pG~Y~v~vsa~G 350 (376)
T cd03866 334 FL---LLLPGKYMINVTAPG 350 (376)
T ss_pred EE---ecCCeeEEEEEEeCC
Confidence 22 255777777777666
No 90
>KOG1690|consensus
Probab=39.25 E-value=89 Score=24.79 Aligned_cols=46 Identities=22% Similarity=0.066 Sum_probs=28.9
Q ss_pred CceEEEEECCCCCccceEEEEcCCCEEEEEEEcCCCeeEEEEEEECC
Q psy15726 81 SEFDVIVTSPQGAAVPVRCYQQKFANLLAEFTPTTTGVYKIDVLQGA 127 (171)
Q Consensus 81 ~~l~v~i~~p~g~~~~~~v~~~~dg~y~v~y~P~~~G~~~i~V~~~g 127 (171)
-.+.|.|.+|-.+.- .-+.+.....=.++|+...+|+|+|.+.-+-
T Consensus 65 ~gm~VeV~e~fdnnh-~Vl~q~~ss~G~ftFta~~~GeH~IC~~s~s 110 (215)
T KOG1690|consen 65 IGMHVEVKETFDNNH-VVLSQQYSSEGDFTFTALTPGEHRICIQSNS 110 (215)
T ss_pred ceEEEEeecCCCCce-EEEeecCCCCCceEEEccCCCceEEEEeccc
Confidence 346777877765421 1122222233468899999999999987543
No 91
>COG5489 Uncharacterized conserved protein [Function unknown]
Probab=39.00 E-value=66 Score=22.74 Aligned_cols=40 Identities=8% Similarity=0.060 Sum_probs=28.5
Q ss_pred CCCcCCceEEEEECCCC-CccceEEEEc-CCCEEEEEEEcCC
Q psy15726 76 CGVASSEFDVIVTSPQG-AAVPVRCYQQ-KFANLLAEFTPTT 115 (171)
Q Consensus 76 ~~ag~~~l~v~i~~p~g-~~~~~~v~~~-~dg~y~v~y~P~~ 115 (171)
++.|..-|+|++.+|+- ..+.+.+..- ++++|.+-|.+..
T Consensus 63 s~~G~dYlsvkLddP~f~~~i~A~L~~~e~~~~~~liW~rp~ 104 (107)
T COG5489 63 SNSGRDYLSVKLDDPSFGAPIYANLFPAEGEGTYALIWNRPK 104 (107)
T ss_pred cccCcceEEEEecCCcCCCeeEeeeeecCCCCcEEEEecCCC
Confidence 45566789999999994 4555665443 4788998887543
No 92
>PF09315 DUF1973: Domain of unknown function (DUF1973); InterPro: IPR015394 These functionally uncharacterised domains are found in various eukaryotic calcium-dependent chloride channels.
Probab=37.16 E-value=1.9e+02 Score=22.19 Aligned_cols=85 Identities=14% Similarity=0.026 Sum_probs=44.7
Q ss_pred CCCceeEEEEEEECCEEcCCCCeEEEecCC---CCCCCcEEEc-CCCcceEeCCeEEEEEEeCCC----cCCceEEEEEC
Q psy15726 18 WISYGLVQARTLREETSFRSCPMEVPVVDP---AVGREPSGSG-LGLYQARSGIVNSFTLETCGV----ASSEFDVIVTS 89 (171)
Q Consensus 18 P~~~G~~~i~V~~~g~~I~gSPf~v~V~~~---~d~~~v~v~G-~Gl~~~~vg~~~~F~V~~~~a----g~~~l~v~i~~ 89 (171)
.-|+|.+.+.|...... -.+..+.|... .+..-+++.- .+-....-.++.-+..+-++. -++.+.+.|.+
T Consensus 74 ~ae~G~W~y~i~~~~~~--~q~v~vtVtS~~~~~~~~Pitv~~~~~~~~~~~~~p~~IyA~V~qg~~PVlgA~V~A~V~~ 151 (179)
T PF09315_consen 74 TAEVGTWTYSITNTSSS--SQTVTVTVTSRASSPTVPPITVTAWLSQSSVNFPSPVIIYAEVQQGYLPVLGANVTATVES 151 (179)
T ss_pred CcccccEEEEEecCCCC--cceEEEEEEeeECCCCCCCEEEEEEECCCCcCCCCcEEEEEEEEECCeeeeCCEEEEEEcC
Confidence 37899999988533321 23456666532 1111222221 110111112343333333332 23788899999
Q ss_pred CCCCccceEEEEcCC
Q psy15726 90 PQGAAVPVRCYQQKF 104 (171)
Q Consensus 90 p~g~~~~~~v~~~~d 104 (171)
|+|.....++.|++.
T Consensus 152 ~~g~~~~l~L~DnG~ 166 (179)
T PF09315_consen 152 PSGNTVTLELLDNGA 166 (179)
T ss_pred CCCCEEEEEEEcCCC
Confidence 999988888888754
No 93
>KOG3287|consensus
Probab=37.12 E-value=1.8e+02 Score=23.46 Aligned_cols=58 Identities=17% Similarity=0.236 Sum_probs=34.0
Q ss_pred eEeCCeEEEEEEeC-CCcCCceEEEEECCCCCccceEEEEcCCCEEEEEEEcCCCeeEEEEE
Q psy15726 63 ARSGIVNSFTLETC-GVASSEFDVIVTSPQGAAVPVRCYQQKFANLLAEFTPTTTGVYKIDV 123 (171)
Q Consensus 63 ~~vg~~~~F~V~~~-~ag~~~l~v~i~~p~g~~~~~~v~~~~dg~y~v~y~P~~~G~~~i~V 123 (171)
+..|...++..+-. ++|..++..++.+|-|..+--+ ....+|.|+++ ++++|.|++..
T Consensus 52 v~~~~tle~eyQVi~G~GDl~i~Ftl~~P~G~~lv~~-q~k~dg~ht~e--~~e~GdY~~Cf 110 (236)
T KOG3287|consen 52 VPQGATLEVEYQVIDGAGDLDIDFTLLNPAGEVLVSD-QRKVDGVHTVE--VTETGDYQVCF 110 (236)
T ss_pred ccCCeEEEEEEEEEecCCccceeeEEeCCCccEEeec-ccccCceeEee--ccCCcceEEEE
Confidence 34443334444333 4566788888999998432211 12345666654 67789998765
No 94
>PF09394 Inhibitor_I42: Chagasin family peptidase inhibitor I42; InterPro: IPR018990 Peptide proteinase inhibitors can be found as single domain proteins or as single or multiple domains within proteins; these are referred to as either simple or compound inhibitors, respectively. In many cases they are synthesised as part of a larger precursor protein, either as a prepropeptide or as an N-terminal domain associated with an inactive peptidase or zymogen. This domain prevents access of the substrate to the active site. Removal of the N-terminal inhibitor domain either by interaction with a second peptidase or by autocatalytic cleavage activates the zymogen. Other inhibitors interact direct with proteinases using a simple noncovalent lock and key mechanism; while yet others use a conformational change-based trapping mechanism that depends on their structural and thermodynamic properties. Chagasin reversible inhibitor of papain-like cysteine proteases []. Chagasin has a beta-barrel structure, which is a unique variant of the immunoglobulin fold with homology to human CD8alpha [, ].; PDB: 2NQD_A 2NNR_A 2H7W_B 3E1Z_A 3CBK_B 3CBJ_B 2OUL_B 2FO8_A 2WGN_B 2C34_A ....
Probab=36.34 E-value=68 Score=21.08 Aligned_cols=22 Identities=14% Similarity=0.083 Sum_probs=19.0
Q ss_pred CEEEEEEEcCCCeeEEEEEEEC
Q psy15726 105 ANLLAEFTPTTTGVYKIDVLQG 126 (171)
Q Consensus 105 g~y~v~y~P~~~G~~~i~V~~~ 126 (171)
|+....|.+..+|...|...|.
T Consensus 53 g~~~f~f~a~~~G~~~i~~~y~ 74 (92)
T PF09394_consen 53 GTRTFTFKALKPGTTTIKFEYR 74 (92)
T ss_dssp EEEEEEEEESSSEEEEEEEEEE
T ss_pred cEEEEEEEEecCeeEEEEEEEE
Confidence 6788899999999999998774
No 95
>KOG1366|consensus
Probab=34.31 E-value=1.9e+02 Score=29.72 Aligned_cols=71 Identities=17% Similarity=0.180 Sum_probs=50.0
Q ss_pred EeCCeEEEEEEeCCC--cCCceEEEEECCCCCccceEE-EEcCCCEEEEEEE---cCCCeeEEEEEEECCEEcCCCC
Q psy15726 64 RSGIVNSFTLETCGV--ASSEFDVIVTSPQGAAVPVRC-YQQKFANLLAEFT---PTTTGVYKIDVLQGARPVRGSP 134 (171)
Q Consensus 64 ~vg~~~~F~V~~~~a--g~~~l~v~i~~p~g~~~~~~v-~~~~dg~y~v~y~---P~~~G~~~i~V~~~g~~I~gSP 134 (171)
..|+...|.|-+.+. ...++.|.+.+|+|+.+.-.. .++..|.|.-+|. ....|.++|.|.+.+++=..++
T Consensus 127 ~pgqtV~~Rvv~~~~~~~~~~~~v~~~dp~~~~ikq~~~~~~~~gv~~~~f~ls~~~~~G~w~I~v~~~~~~~~~~~ 203 (1436)
T KOG1366|consen 127 KPGQTVRYRVVTLNLKLKPFKVTVYIEDPEGNRIKQWLSVQSDGGVYSLSFQLSSEPILGDWKIRVQYKNGLQTKSS 203 (1436)
T ss_pred CCCCEEEEEEEEeccccccccceEEEECCCCCchhhhhhccccCceEEeeecccCCCcCCceEEEEEEcCCCccccc
Confidence 346666777766432 123388999999998765433 3455688988886 3346999999999988777776
No 96
>PF13753 SWM_repeat: Putative flagellar system-associated repeat
Probab=33.83 E-value=2.7e+02 Score=22.89 Aligned_cols=27 Identities=19% Similarity=0.203 Sum_probs=20.1
Q ss_pred EEcCCCEEEEEEEc-----CCCeeEEEEEEEC
Q psy15726 100 YQQKFANLLAEFTP-----TTTGVYKIDVLQG 126 (171)
Q Consensus 100 ~~~~dg~y~v~y~P-----~~~G~~~i~V~~~ 126 (171)
....+|.|.+.|+| ...|.|.+.+..-
T Consensus 193 ~~~~~g~~t~~~~~~~~~~~~d~~~~v~v~~t 224 (317)
T PF13753_consen 193 TVGADGTWTVTVTPSDLAGLADGTYTVTVTVT 224 (317)
T ss_pred eecCCCcccccccccccccccCceEEEEEEee
Confidence 33467899999988 4568888887754
No 97
>PF03345 DDOST_48kD: Oligosaccharyltransferase 48 kDa subunit beta; InterPro: IPR005013 During N-linked glycosylation of proteins, oligosaccharide chains are assembled on the carrier molecule dolichyl pyrophosphate in the following order: 2 molecules of N-acetylglucosamine (GlcNAc), 9 molecules of mannose, and 3 molecules of glucose. These 14-residue oligosaccharide cores are then transferred to asparagine residues on nascent polypeptide chains in the endoplasmic reticulum (ER). As proteins progress through the Golgi apparatus, the oligosaccharide cores are modified by trimming and extension to generate a diverse array of glycosylated proteins [, ]. The oligosaccharyl transferase complex (OST complex) 2.4.1.119 from EC transfers 14-sugar branched oligosaccharides from dolichyl pyrophosphate to asparagine residues []. The complex contains nine protein subunits: Ost1p, Ost2p, Ost3p, Ost4p, Ost5p, Ost6p, Stt3p, Swp1p, and Wbp1p, all of which are integral membrane proteins of the ER. The OST complex interacts with the Sec61p pore complex [] involved in protein import into the ER. This entry represents subunits OST3 and OST6. OST3 is homologous to OST6 [], and several lines of evidence indicate that they are alternative members of the OST complex. Disruption of both OST3 and OST6 causes severe underglycosylation of soluble and membrane-bound glycoproteins and a defect in the assembly of the complex. Hence, the function of these genes seems to be essential for recruiting a fully active complex necessary for efficient N-glycosylation []. This entry also includes the magnesium transporter protein 1, also known as OST3 homologue B, which might be involved in N-glycosylation through its association with the oligosaccharyl transferase (OST) complex. Wbp1p is the beta subunit of the OST complex, one of the original six subunits purified []. Wbp1 is essential [, ], but conditional mutants have decreased transferase activity [, ]. Wbp1p is homologous to mammalian OST48 [].; GO: 0004579 dolichyl-diphosphooligosaccharide-protein glycotransferase activity, 0018279 protein N-linked glycosylation via asparagine, 0005789 endoplasmic reticulum membrane
Probab=33.73 E-value=1.8e+02 Score=25.71 Aligned_cols=61 Identities=15% Similarity=0.229 Sum_probs=38.7
Q ss_pred eEeCCeEEEEEEeCCCc--------CCceE--EEEECCCCCccceEEEE----cCCCEEEEEEE-cCCCeeEEEEEEEC
Q psy15726 63 ARSGIVNSFTLETCGVA--------SSEFD--VIVTSPQGAAVPVRCYQ----QKFANLLAEFT-PTTTGVYKIDVLQG 126 (171)
Q Consensus 63 ~~vg~~~~F~V~~~~ag--------~~~l~--v~i~~p~g~~~~~~v~~----~~dg~y~v~y~-P~~~G~~~i~V~~~ 126 (171)
.+++..+.|.|+...-. ..+++ ..+.||= ++..+.. .+.|+|..+|+ |..=|.|+..|.|.
T Consensus 279 Y~I~D~v~ysi~i~e~~~~~WvPf~~dDiQlEf~mLDPy---~R~~L~~~~~~~~~~~Y~~~FklPD~hGVF~F~vdY~ 354 (423)
T PF03345_consen 279 YRIKDDVEYSIEISEWNNGKWVPFKADDIQLEFVMLDPY---VRLTLKPSYSTDDNGTYSTTFKLPDVHGVFTFKVDYK 354 (423)
T ss_pred eEECCcEEEEEEEEEEeCCeEecCCCCcEEEEEEEcCcE---EEcccccccccCCCCEEEEEEECCCccceEEEEEEEe
Confidence 55666666666654311 13444 4455553 2222322 46699999997 88899999999986
No 98
>PF13750 Big_3_3: Bacterial Ig-like domain (group 3)
Probab=33.71 E-value=2e+02 Score=21.47 Aligned_cols=109 Identities=13% Similarity=0.116 Sum_probs=55.9
Q ss_pred CCCCceeEEEEE-EE----CCEEcCCCCeEEEecCCCCCCCcEE-EcCCCcc-eEeCCeEEEEEEeCCCcCC-ce-EEEE
Q psy15726 17 SWISYGLVQART-LR----EETSFRSCPMEVPVVDPAVGREPSG-SGLGLYQ-ARSGIVNSFTLETCGVASS-EF-DVIV 87 (171)
Q Consensus 17 ~P~~~G~~~i~V-~~----~g~~I~gSPf~v~V~~~~d~~~v~v-~G~Gl~~-~~vg~~~~F~V~~~~ag~~-~l-~v~i 87 (171)
+-...|.|.+.+ +- ++..-....+.+.+ +.. +..+.+ .|..... ..+.......|...|.... .+ .+.+
T Consensus 10 ~~l~dG~Y~l~~~~a~D~agN~~~~~~~~~~~i-D~T-~Ptisi~~~~~~~~g~~v~~~~~i~i~~tD~~~~~~i~sv~l 87 (158)
T PF13750_consen 10 STLPDGSYTLTVVTATDAAGNTSTSTVSETFTI-DNT-PPTISISDGASVANGSTVYGLVNISINVTDNSDDSKITSVSL 87 (158)
T ss_pred CcCCCccEEEEEEEEEecCCCEEEEEEeeEEEE-cCC-CCEEEEecCCccCCCccccceeeeEEEEEeCCCCceEEEEEE
Confidence 445679999988 32 22221112223333 221 223444 2223322 3444455666766665332 23 3445
Q ss_pred EC-CCCCccceEEEEcCCCEEEEEEEcC-----CCeeEEEEEEECC
Q psy15726 88 TS-PQGAAVPVRCYQQKFANLLAEFTPT-----TTGVYKIDVLQGA 127 (171)
Q Consensus 88 ~~-p~g~~~~~~v~~~~dg~y~v~y~P~-----~~G~~~i~V~~~g 127 (171)
.+ |....+.......++|.|...|... .-+.|.|.|.-.+
T Consensus 88 ~Gg~~~d~v~ls~~~~~~~~~~~~yp~~fpsle~~~~YtLtV~a~D 133 (158)
T PF13750_consen 88 TGGPASDSVSLSWTNKGNGVYTLEYPRIFPSLEADDSYTLTVSATD 133 (158)
T ss_pred ECCcccceEEEeeEeccCceEEeecccccCCcCCCCeEEEEEEEEe
Confidence 43 3333344445577889999988632 3457999988644
No 99
>PF13860 FlgD_ig: FlgD Ig-like domain; PDB: 3C12_A 3OSV_A.
Probab=33.42 E-value=1.4e+02 Score=19.42 Aligned_cols=50 Identities=14% Similarity=0.119 Sum_probs=29.4
Q ss_pred ceEEEEECCCC-CccceEEEEcCCC-EEEEEEEcCCCeeEEEEEEECCEEcCCC
Q psy15726 82 EFDVIVTSPQG-AAVPVRCYQQKFA-NLLAEFTPTTTGVYKIDVLQGARPVRGS 133 (171)
Q Consensus 82 ~l~v~i~~p~g-~~~~~~v~~~~dg-~y~v~y~P~~~G~~~i~V~~~g~~I~gS 133 (171)
...+.+.=|.. ..+.+.|.|...- .+++...+..+|.|. +.|+|+.-.|.
T Consensus 13 ~~~~~~~l~~~a~~v~v~I~d~~G~~V~t~~~~~~~~G~~~--~~WdG~d~~G~ 64 (81)
T PF13860_consen 13 KGSIEYTLPEDADNVTVTIYDSNGQVVRTISLGSQSAGEHS--FTWDGKDDDGN 64 (81)
T ss_dssp EEEEEEEECSSCEEEEEEEEETTS-EEEEEEEEECSSEEEE--EEE-SB-TTS-
T ss_pred EEEEEEeCCCcccEEEEEEEcCCCCEEEEEEcCCcCCceEE--EEECCCCCCcC
Confidence 45555333333 3467888887443 367888888999865 67886554444
No 100
>PRK10737 FKBP-type peptidyl-prolyl cis-trans isomerase; Provisional
Probab=33.37 E-value=1.3e+02 Score=23.49 Aligned_cols=56 Identities=14% Similarity=0.156 Sum_probs=33.6
Q ss_pred ceEeCCeEEEEEEeCCCcC------------------Cc----eEEEEECCCCCccceEEEEcCCCEEEEEEEcCCCee
Q psy15726 62 QARSGIVNSFTLETCGVAS------------------SE----FDVIVTSPQGAAVPVRCYQQKFANLLAEFTPTTTGV 118 (171)
Q Consensus 62 ~~~vg~~~~F~V~~~~ag~------------------~~----l~v~i~~p~g~~~~~~v~~~~dg~y~v~y~P~~~G~ 118 (171)
...+|...+|+|...+|-+ .. ..+....++| .+++.|++..+...++-|.-.-+|.
T Consensus 51 G~~~Gd~~~v~l~peeAyGe~d~~lV~~vpr~~F~~~~~l~~G~~~~~~~~~G-~~~~~V~ev~~d~V~vD~NHPLAG~ 128 (196)
T PRK10737 51 GHEVGDKFDVAVGANDAYGQYDENLVQRVPKDVFMGVDELQVGMRFLAETDQG-PVPVEITAVEDDHVVVDGNHMLAGQ 128 (196)
T ss_pred CCCCCCEEEEEEChHHhcCCCChHHEEEecHHHCCCccCCCCCCEEEEeCCCC-cEEEEEEEEcCCEEEEECCCcCCCC
Confidence 4678899999998766521 00 1233344555 3566666666666677776555554
No 101
>KOG0393|consensus
Probab=33.35 E-value=45 Score=26.28 Aligned_cols=32 Identities=16% Similarity=0.181 Sum_probs=25.7
Q ss_pred eeEEEEeecc------ccCCCCceeEEEEEEEC-CEEcC
Q psy15726 5 TKSCILLSEK------APSWISYGLVQARTLRE-ETSFR 36 (171)
Q Consensus 5 ~~~~~~~~~~------~~~P~~~G~~~i~V~~~-g~~I~ 36 (171)
-|+||+++.. +|.|+.-..|..+|..+ |.+|.
T Consensus 16 GKT~ll~~~t~~~fp~~yvPTVFdnys~~v~V~dg~~v~ 54 (198)
T KOG0393|consen 16 GKTCLLISYTTNAFPEEYVPTVFDNYSANVTVDDGKPVE 54 (198)
T ss_pred CceEEEEEeccCcCcccccCeEEccceEEEEecCCCEEE
Confidence 3788887766 58999999999999985 77664
No 102
>PF02922 CBM_48: Carbohydrate-binding module 48 (Isoamylase N-terminal domain); InterPro: IPR004193 O-Glycosyl hydrolases 3.2.1. from EC are a widespread group of enzymes that hydrolyse the glycosidic bond between two or more carbohydrates, or between a carbohydrate and a non-carbohydrate moiety. A classification system for glycosyl hydrolases, based on sequence similarity, has led to the definition of 85 different families [, ]. This classification is available on the CAZy (CArbohydrate-Active EnZymes) web site. Enzymes containing this domain belong to family 13 (GH13 from CAZY) of the glycosyl hydrolases. This domain is found in a range of enzymes that act on branched substrates ie. isoamylase, pullulanase and branching enzyme. Isoamylase hydrolyses 1,6-alpha-D-glucosidic branch linkages in glycogen, amylopectin and dextrin; 1,4-alpha-glucan branching enzyme functions in the formation of 1,6-glucosidic linkages of glycogen; and pullulanase is a starch-debranching enzyme.; GO: 0004553 hydrolase activity, hydrolyzing O-glycosyl compounds, 0005975 carbohydrate metabolic process; PDB: 2BHZ_A 2BY2_A 2BY3_A 2BXY_A 2BY1_A 2BHY_A 2BHU_A 2BXZ_A 2BY0_A 2FHB_A ....
Probab=32.76 E-value=1.3e+02 Score=19.13 Aligned_cols=59 Identities=8% Similarity=-0.037 Sum_probs=40.3
Q ss_pred CeEEEEEEeCCCcCCceEEEEECCC-CCccceEEE-EcCCCEEEEEEE-cCCCe--eEEEEEEECC
Q psy15726 67 IVNSFTLETCGVASSEFDVIVTSPQ-GAAVPVRCY-QQKFANLLAEFT-PTTTG--VYKIDVLQGA 127 (171)
Q Consensus 67 ~~~~F~V~~~~ag~~~l~v~i~~p~-g~~~~~~v~-~~~dg~y~v~y~-P~~~G--~~~i~V~~~g 127 (171)
..+.|.+=+.+|. .+.+.+...+ ......... ...+|.|.++-. ....| .|...|...+
T Consensus 11 ~~~~F~vwaP~A~--~V~l~~~~~~~~~~~~~~m~~~~~~G~w~~~~~~~~~~g~~~Y~y~i~~~~ 74 (85)
T PF02922_consen 11 GGVTFRVWAPNAK--SVELVLYFNGSWPAEEYPMTRKDDDGVWEVTVPGDLPPGGYYYKYRIDGDD 74 (85)
T ss_dssp TEEEEEEE-TTES--EEEEEEETTTSSEEEEEEEEEECTTTEEEEEEEGCGTTTT-EEEEEEEETT
T ss_pred CEEEEEEECCCCC--EEEEEEEeeecCCCceEEeeecCCCCEEEEEEcCCcCCCCEEEEEEEEeCC
Confidence 5788999988774 6777777766 222334455 467899999987 34444 7888888776
No 103
>PF03422 CBM_6: Carbohydrate binding module (family 6); InterPro: IPR005084 A carbohydrate-binding module (CBM) is defined as a contiguous amino acid sequence within a carbohydrate-active enzyme with a discreet fold having carbohydrate-binding activity. A few exceptions are CBMs in cellulosomal scaffolding proteins and rare instances of independent putative CBMs. The requirement of CBMs existing as modules within larger enzymes sets this class of carbohydrate-binding protein apart from other non-catalytic sugar binding proteins such as lectins and sugar transport proteins. CBMs were previously classified as cellulose-binding domains (CBDs) based on the initial discovery of several modules that bound cellulose [, ]. However, additional modules in carbohydrate-active enzymes are continually being found that bind carbohydrates other than cellulose yet otherwise meet the CBM criteria, hence the need to reclassify these polypeptides using more inclusive terminology. Previous classification of cellulose-binding domains were based on amino acid similarity. Groupings of CBDs were called "Types" and numbered with roman numerals (e.g. Type I or Type II CBDs). In keeping with the glycoside hydrolase classification, these groupings are now called families and numbered with Arabic numerals. Families 1 to 13 are the same as Types I to XIII. For a detailed review on the structure and binding modes of CBMs see []. This entry represents CBM6 from CAZY which was previously known as cellulose-binding domain family VI (CBD VI). CBM6 bind to amorphous cellulose, xylan, mixed beta-(1,3)(1,4)glucan and beta-1,3-glucan[, , ]. CBM6 adopts a classic lectin-like beta-jelly roll fold, predominantly consisting of five antiparallel beta-strands on one face and four antiparallel beta-strands on the other face. It contains two potential ligand binding sites, named respectively cleft A and B. These clefts include aromatic residues which are probably involved in the substrate binding. The cleft B is located on the concave surface of one beta-sheet, and the cleft A on one edge of the protein between the loop that connects the inner and outer beta-sheets of the jellyroll fold []. The multiple binding clefts confer the extensive range of specificities displayed by the domain [, , ].; GO: 0030246 carbohydrate binding; PDB: 1UY1_A 1UY3_A 1UY4_A 1UY2_A 1UYY_A 1UXZ_B 1UYZ_A 1UY0_B 1UYX_A 1UZ0_A ....
Probab=31.74 E-value=1.7e+02 Score=20.06 Aligned_cols=46 Identities=11% Similarity=0.141 Sum_probs=25.5
Q ss_pred CceEEEEECCCCCcc-ceEEEEcCC-CEE---EEEEEcCCCeeEEEEEEECC
Q psy15726 81 SEFDVIVTSPQGAAV-PVRCYQQKF-ANL---LAEFTPTTTGVYKIDVLQGA 127 (171)
Q Consensus 81 ~~l~v~i~~p~g~~~-~~~v~~~~d-g~y---~v~y~P~~~G~~~i~V~~~g 127 (171)
+.+++.+.+++|..+ .+.+...++ ..| .+.. ....|.|.|.+.+.+
T Consensus 60 ~~~~l~id~~~g~~~~~~~~~~tg~w~~~~~~~~~v-~l~~G~h~i~l~~~~ 110 (125)
T PF03422_consen 60 GTIELRIDGPDGTLIGTVSLPPTGGWDTWQTVSVSV-KLPAGKHTIYLVFNG 110 (125)
T ss_dssp EEEEEEETTTTSEEEEEEEEE-ESSTTEEEEEEEEE-EEESEEEEEEEEESS
T ss_pred cEEEEEECCCCCcEEEEEEEcCCCCccccEEEEEEE-eeCCCeeEEEEEEEC
Confidence 567777777666432 233322222 233 2222 334499999998876
No 104
>PF07563 DUF1541: Protein of unknown function (DUF1541); InterPro: IPR011438 This domain is found in several hypothetical bacterial proteins as a tandem repeat.; PDB: 2KY9_A 4FIB_B.
Probab=31.65 E-value=1e+02 Score=19.00 Aligned_cols=16 Identities=31% Similarity=0.501 Sum_probs=11.2
Q ss_pred CEEEEEEEcCCCeeEE
Q psy15726 105 ANLLAEFTPTTTGVYK 120 (171)
Q Consensus 105 g~y~v~y~P~~~G~~~ 120 (171)
-.|.|.|+|+.-|+-.
T Consensus 30 tvY~V~ytPt~gg~~v 45 (53)
T PF07563_consen 30 TVYMVDYTPTTGGEKV 45 (53)
T ss_dssp EEEEEEEE-TTT-SEE
T ss_pred eEEEEEEEECCCCcEe
Confidence 3699999999988743
No 105
>COG1622 CyoA Heme/copper-type cytochrome/quinol oxidases, subunit 2 [Energy production and conversion]
Probab=31.42 E-value=2.6e+02 Score=22.71 Aligned_cols=69 Identities=16% Similarity=0.004 Sum_probs=45.1
Q ss_pred eEeCCeEEEEEEeCCCcCCceEEEEECCCCCccceEEEEcCCCEEEEEEEcCCCeeEEEE-EEECCEEcCCCCeEEEe
Q psy15726 63 ARSGIVNSFTLETCGVASSEFDVIVTSPQGAAVPVRCYQQKFANLLAEFTPTTTGVYKID-VLQGARPVRGSPYLCQV 139 (171)
Q Consensus 63 ~~vg~~~~F~V~~~~ag~~~l~v~i~~p~g~~~~~~v~~~~dg~y~v~y~P~~~G~~~i~-V~~~g~~I~gSPF~v~V 139 (171)
+.+|++..|.+.+.|.- ..|.+ |+- ..++.-...-.+...++|.++|.|... =.|.|..-..=+|++.|
T Consensus 141 lPv~~~V~f~ltS~DVi-HsF~I----P~l---~~k~d~iPG~~~~~~~~~~~~G~Y~g~Cae~CG~gH~~M~~~v~v 210 (247)
T COG1622 141 LPVGRPVRFKLTSADVI-HSFWI----PQL---GGKIDAIPGMTTELWLTANKPGTYRGICAEYCGPGHSFMRFKVIV 210 (247)
T ss_pred EeCCCeEEEEEEechhc-eeEEe----cCC---CceeeecCCceEEEEEecCCCeEEEEEcHhhcCCCcccceEEEEE
Confidence 78899999999988753 22221 211 112222222467788899999999776 34667666667788888
No 106
>PF13715 DUF4480: Domain of unknown function (DUF4480)
Probab=29.30 E-value=1.6e+02 Score=18.91 Aligned_cols=26 Identities=12% Similarity=0.176 Sum_probs=19.2
Q ss_pred EEcCCCEEEEEEEcCCCeeEEEEEEECCE
Q psy15726 100 YQQKFANLLAEFTPTTTGVYKIDVLQGAR 128 (171)
Q Consensus 100 ~~~~dg~y~v~y~P~~~G~~~i~V~~~g~ 128 (171)
.-+.+|.|.+. .| .|.|.|.+.+-|-
T Consensus 30 ~Td~~G~F~i~-~~--~g~~~l~is~~Gy 55 (88)
T PF13715_consen 30 VTDENGRFSIK-LP--EGDYTLKISYIGY 55 (88)
T ss_pred EECCCeEEEEE-Ec--CCCeEEEEEEeCE
Confidence 34457889888 44 6889999888774
No 107
>PF11763 DIPSY: Cell-wall adhesin ligand-binding C-terminal; InterPro: IPR021746 The DIPSY domain is characterised by the distinctive D*I*PSY motif at the very C terminus of yeast cell-wall glycoproteins. It appears not to be conserved in any other species, however. In fungi, cell adhesion is required for flocculation, mating and virulence, and is mediated by covalently bound cell wall proteins termed adhesins. Map4, an adhesin required for mating in Schizosaccharomyces pombe, is N-glycosylated and O-glycosylated, and is an endogenous substrate for the mannosyl transferase Oma4p. Map4 has a modular structure with an N-terminal signal peptide, a serine and threonine (S/T)-rich domain that includes nine repeats of 36 amino acids (rich in serine and threonine residues, but lacking glutamines), and a C-terminal DIPSY domain with no glycosyl-phosphatidyl inositol (GPI)-anchor signal. The N-terminal S/T-rich regions, are required for cell wall attachment, but the C-terminal DIPSY domain is required for agglutination and mating in liquid and solid media [].
Probab=29.25 E-value=2.2e+02 Score=20.56 Aligned_cols=44 Identities=11% Similarity=0.189 Sum_probs=30.6
Q ss_pred eEEEEECCCCCccceEEEEcCCCEEEEEEEcCCCeeE-EEEEEECCE
Q psy15726 83 FDVIVTSPQGAAVPVRCYQQKFANLLAEFTPTTTGVY-KIDVLQGAR 128 (171)
Q Consensus 83 l~v~i~~p~g~~~~~~v~~~~dg~y~v~y~P~~~G~~-~i~V~~~g~ 128 (171)
-++...+|.+. .-.|.++.||+|+.+|+-...+++ .++++-+|+
T Consensus 50 Y~v~~~D~~~~--sf~I~kn~dG~~~Ft~~e~~~~ep~~l~~l~dgr 94 (123)
T PF11763_consen 50 YRVYHDDPNKS--SFNIIKNNDGTYQFTFVESSFSEPLDLHTLSDGR 94 (123)
T ss_pred EEEeecCCCcc--eEEEEecCCCcEEEEEcccCCCCcEEEEEecCCc
Confidence 34555666543 456888999999999988777763 566666653
No 108
>KOG1691|consensus
Probab=29.18 E-value=2.3e+02 Score=22.58 Aligned_cols=42 Identities=24% Similarity=0.282 Sum_probs=26.3
Q ss_pred CceEEEEECCCCCccceEEEEcCCCEEEEEEEcCCCeeEEEEEEE
Q psy15726 81 SEFDVIVTSPQGAAVPVRCYQQKFANLLAEFTPTTTGVYKIDVLQ 125 (171)
Q Consensus 81 ~~l~v~i~~p~g~~~~~~v~~~~dg~y~v~y~P~~~G~~~i~V~~ 125 (171)
..+.+.|++|.|+... +..+...| .+.|+..+.|.|.+...-
T Consensus 60 ~~~~~~Vts~~G~~~~-~~env~~g--qFaFta~e~~~y~~Cf~~ 101 (210)
T KOG1691|consen 60 HKLSVKVTSPYGNNLH-SKENVTKG--QFAFTAEESGMYEACFTA 101 (210)
T ss_pred ceEEEEEEcCCCceee-hhhccccc--eEEEEeccCCcEEEEEec
Confidence 4799999999996432 11122222 456777777777666554
No 109
>PF04465 DUF499: Protein of unknown function (DUF499); InterPro: IPR007555 This is a family of uncharacterised hypothetical prokaryotic proteins.
Probab=28.96 E-value=2e+02 Score=28.40 Aligned_cols=65 Identities=12% Similarity=0.201 Sum_probs=47.2
Q ss_pred eEeCCeEEEEEEeCCCcCCceEEEEECCCCCccceEEEEcCCCEEEEEEE---cCCCeeEEEEEEECCE
Q psy15726 63 ARSGIVNSFTLETCGVASSEFDVIVTSPQGAAVPVRCYQQKFANLLAEFT---PTTTGVYKIDVLQGAR 128 (171)
Q Consensus 63 ~~vg~~~~F~V~~~~ag~~~l~v~i~~p~g~~~~~~v~~~~dg~y~v~y~---P~~~G~~~i~V~~~g~ 128 (171)
+..|+..+|.|...-.|...+.|.+....|....-++.- +.+.+.++|+ |.+.+.|.|.++.+|.
T Consensus 836 ~~pge~~~~~v~~~~~g~~~~~v~l~~~~g~l~~~~~~~-~~~~~~~~w~~~~~~~~~~~~i~~~~~~~ 903 (1035)
T PF04465_consen 836 GKPGEVVEVKVVIKPLGNYRFEVELEVSYGELDSDEGIL-EGEPFKTTWRITIPKEKRIYRIEARSDGK 903 (1035)
T ss_pred CCCCCceEEEEEEEEcCCCceeEEEEeccceeeecceee-cCCCceEEEEEeccCCCceEEEEEeccCc
Confidence 457888889998888887888888887766432222322 3456788884 7888999999999773
No 110
>PF00254 FKBP_C: FKBP-type peptidyl-prolyl cis-trans isomerase; InterPro: IPR001179 Synonym(s): Peptidylprolyl cis-trans isomerase FKBP-type peptidylprolyl isomerases (5.2.1.8 from EC) in vertebrates, are receptors for the two immunosuppressants, FK506 and rapamycin. The drugs inhibit T cell proliferation by arresting two distinct cytoplasmic signal transmission pathways. Peptidylprolyl isomerases accelerate protein folding by catalysing the cis-trans isomerisation of proline imidic peptide bonds in oligopeptides. These proteins are found in a variety of organisms.; GO: 0006457 protein folding; PDB: 1IX5_A 3JXV_A 3JYM_A 1T11_A 1PBK_A 1FD9_A 2VCD_A 3B7X_A 1Q6H_B 1Q6I_B ....
Probab=28.33 E-value=12 Score=24.78 Aligned_cols=39 Identities=13% Similarity=0.184 Sum_probs=24.1
Q ss_pred CCCCeEEEecCCCceEEecCCcccEEcCCCEEEEEEccC
Q psy15726 131 RGSPYLCQVYDASKVKIEHKGLSNIVVNDPISFKCKSTE 169 (171)
Q Consensus 131 ~gSPF~v~V~d~skv~~~G~gl~~~~~g~~~~F~Vd~~~ 169 (171)
.+.||.+.+.+...+...-.+|..-.+|+...|.|....
T Consensus 31 ~~~~~~~~~g~~~~i~g~e~al~~m~~Ge~~~~~vp~~~ 69 (94)
T PF00254_consen 31 EGEPFEFRLGSGQVIPGLEEALIGMKVGEKREFYVPPEL 69 (94)
T ss_dssp TTSEEEEETTSSSSSHHHHHHHTTSBTTEEEEEEEEGGG
T ss_pred cCcceeeeeccCccccchhhhcccccCCCEeeeEeCChh
Confidence 445898888655544444444555667777777776544
No 111
>MTH00047 COX2 cytochrome c oxidase subunit II; Provisional
Probab=26.73 E-value=3.1e+02 Score=21.35 Aligned_cols=71 Identities=17% Similarity=0.154 Sum_probs=44.5
Q ss_pred eEeCCeEEEEEEeCCCcCCceEEEEECCCCCccceEEEEcCCC-EEEEEEEcCCCeeEEEE-EEECCEEcCCCCeEEEec
Q psy15726 63 ARSGIVNSFTLETCGVASSEFDVIVTSPQGAAVPVRCYQQKFA-NLLAEFTPTTTGVYKID-VLQGARPVRGSPYLCQVY 140 (171)
Q Consensus 63 ~~vg~~~~F~V~~~~ag~~~l~v~i~~p~g~~~~~~v~~~~dg-~y~v~y~P~~~G~~~i~-V~~~g~~I~gSPF~v~V~ 140 (171)
.++|++..|.+.+.|.= ..+.+ |+ ...++ |.-.| .-.+.++|.++|.|... -.+-|.--..=||.+.|.
T Consensus 120 lp~g~~v~~~ltS~DVi-Hsf~v----p~---l~~k~-d~~PG~~~~~~~~~~~~G~y~g~C~e~CG~~H~~M~~~v~v~ 190 (194)
T MTH00047 120 LVYGVPYHLLVTSSDVI-HSFSV----PD---LNLKM-DAIPGRINHLFFCPDRHGVFVGYCSELCGVGHSYMPIVIEVV 190 (194)
T ss_pred EeCCCEEEeeeecCccc-cceec----cc---cCcee-ecCCCceEEEEEEcCCCEEEEEEeehhhCcCcccCcEEEEEE
Confidence 67899999999988862 22222 11 01111 11223 34678889999998765 456677666677888775
Q ss_pred CC
Q psy15726 141 DA 142 (171)
Q Consensus 141 d~ 142 (171)
++
T Consensus 191 ~~ 192 (194)
T MTH00047 191 DV 192 (194)
T ss_pred cC
Confidence 43
No 112
>PF09264 Sial-lect-inser: Vibrio cholerae sialidase, lectin insertion; InterPro: IPR015344 This domain is predominantly found in Vibrio cholerae sialidase, and adopt a beta sandwich structure consisting of 12-14 strands arranged in two beta-sheets. It binds to lectins with high affinity helping to target the protein to sialic acid-rich environments, thereby enhancing the catalytic efficiency of the enzyme []. ; PDB: 1W0P_A 1W0O_A 1KIT_A 2W68_B.
Probab=25.55 E-value=2e+02 Score=22.64 Aligned_cols=105 Identities=10% Similarity=0.029 Sum_probs=54.9
Q ss_pred EcCCCcceEeCCeEEEEEEeCCCcCCceEEEEECCCCCc-cceEEEEc-CCCEEEEEEEcCCCeeEEEEEEECCEEcCCC
Q psy15726 56 SGLGLYQARSGIVNSFTLETCGVASSEFDVIVTSPQGAA-VPVRCYQQ-KFANLLAEFTPTTTGVYKIDVLQGARPVRGS 133 (171)
Q Consensus 56 ~G~Gl~~~~vg~~~~F~V~~~~ag~~~l~v~i~~p~g~~-~~~~v~~~-~dg~y~v~y~P~~~G~~~i~V~~~g~~I~gS 133 (171)
.|--......+-..+|.+...=-..+.|.+.+.+.++.. ++....|. +=-.|.+.|.|.. +.-...++|..|. +
T Consensus 44 ~G~~n~~yyAnG~~r~l~~lsvn~sG~LvA~L~g~ss~~~~~~~~~di~gyH~Y~i~~~p~~---~tASfy~DG~lI~-t 119 (198)
T PF09264_consen 44 SGGCNTNYYANGSKRYLPILSVNESGSLVAELEGQSSNTLLATTGADIHGYHKYEIVFSPLT---NTASFYFDGTLIA-T 119 (198)
T ss_dssp EES-EEEEEEESSEEEEEEEEE-TTS-EEEEETTS-S-EEEE-CHHHHCSEEEEEEEEETTT---TEEEEEETTEEEE-E
T ss_pred cCcceeEEEcCCceEEEEEEEEcCCCCEEEEEecCCCcEEEecccccccceeEEEEEecCCC---CceEEEECCEEEe-e
Confidence 343333333333444444443223468888888877642 11111111 1146899999953 3788899999988 3
Q ss_pred CeEEEecCCCceEEecCCccc---EEcCCCEEEEEE
Q psy15726 134 PYLCQVYDASKVKIEHKGLSN---IVVNDPISFKCK 166 (171)
Q Consensus 134 PF~v~V~d~skv~~~G~gl~~---~~~g~~~~F~Vd 166 (171)
+. --.-.+|..++|.|-+. ..+=+.+.|+|+
T Consensus 120 -w~-G~~ssqn~I~wGNg~SstdG~~~yr~I~f~i~ 153 (198)
T PF09264_consen 120 -WS-GEPSSQNMIVWGNGSSSTDGRAAYRDINFEIQ 153 (198)
T ss_dssp -E---EEECEEEEEEEE--SSS-EEEEEEEEEEEET
T ss_pred -cc-CccccCceEEEcCCccccccceeeeeEEEEEc
Confidence 21 11445678889988543 333466677663
No 113
>TIGR02148 Fibro_Slime fibro-slime domain. This model represents a conserved region of about 90 amino acids, shared in at least 4 distinct large putative proteins from the slime mold Dictyostelium discoideum and 10 proteins from the rumen bacterium Fibrobacter succinogenes, and in no other species so far. We propose here the name fibro-slime domain
Probab=25.31 E-value=96 Score=21.29 Aligned_cols=23 Identities=22% Similarity=0.404 Sum_probs=20.5
Q ss_pred eeEEEEEEECCEEcCCCCeEEEe
Q psy15726 117 GVYKIDVLQGARPVRGSPYLCQV 139 (171)
Q Consensus 117 G~~~i~V~~~g~~I~gSPF~v~V 139 (171)
..|.+++.+..+|-.+|.|+++.
T Consensus 64 ~~Y~~d~F~~ERht~~S~~~i~T 86 (90)
T TIGR02148 64 KTYPFDIFYCERHTTGSNFRIRT 86 (90)
T ss_pred cEeeEEEEEEeecCCCCCEEEEE
Confidence 45999999999999999999864
No 114
>PLN00115 pollen allergen group 3; Provisional
Probab=25.29 E-value=2.6e+02 Score=20.06 Aligned_cols=64 Identities=11% Similarity=0.295 Sum_probs=37.2
Q ss_pred EeCCeEEEEEEeCCCcCCceEEEEECCCCCccceEEEEcCCCEEEEEEE-cCC--Cee-EEEEEEECCEEcCCCCeEEEe
Q psy15726 64 RSGIVNSFTLETCGVASSEFDVIVTSPQGAAVPVRCYQQKFANLLAEFT-PTT--TGV-YKIDVLQGARPVRGSPYLCQV 139 (171)
Q Consensus 64 ~vg~~~~F~V~~~~ag~~~l~v~i~~p~g~~~~~~v~~~~dg~y~v~y~-P~~--~G~-~~i~V~~~g~~I~gSPF~v~V 139 (171)
..|....|+|+... ....|.+.+ ++....++|.+.+.. .|. |.. -|. +.++ .+.+.+| ||.+++
T Consensus 21 ~~g~~v~F~V~~gS-np~yL~ll~---~~dI~~V~Ik~~g~~----~W~~~M~rswGavW~~~---s~~pl~G-PlS~R~ 88 (118)
T PLN00115 21 SCATEVTFKVGKGS-SSTSLELVT---NVAISEVEIKEKGAK----DWVDDLKESSTNTWTLK---SKAPLKG-PFSVRF 88 (118)
T ss_pred hcCCceEEEECCCC-CcceEEEEE---eCCEEEEEEeecCCC----cccCccccCccceeEec---CCCCCCC-ceEEEE
Confidence 46788899998532 334454333 555667777775432 355 443 243 2221 2567788 999998
No 115
>cd07696 IgC_CH3 CH3 domain (third constant Ig domain of the heavy chain) in immunoglobulin. IgC_CH3: The third immunoglobulin constant domain (IgC) of immunoglobulin (Ig) heavy chains. This domain is found on the Fc fragment. The basic structure of Ig molecules is a tetramer of two light chains and two heavy chains linked by disulfide bonds. There are two types of light chains: kappa and lambda; each composed of a constant domain and a variable domain. There are five types of heavy chains: alpha, delta, epsilon, gamma and mu, all consisting of a variable domain (VH) and three (in alpha, delta and gamma) or four (in epsilon and mu) constant domains (CH1 to CH4). Ig molecules are modular proteins, in which the variable and constant domains have clear, conserved sequence patterns.
Probab=24.82 E-value=2.2e+02 Score=19.18 Aligned_cols=72 Identities=13% Similarity=0.107 Sum_probs=37.2
Q ss_pred CCcceEeCCeEEEEEEeCCC-cCCceEEEEECCCCCccceEEEEcCCCEEEEEEEcCCCeeEEEEEEEC---CEEcCCCC
Q psy15726 59 GLYQARSGIVNSFTLETCGV-ASSEFDVIVTSPQGAAVPVRCYQQKFANLLAEFTPTTTGVYKIDVLQG---ARPVRGSP 134 (171)
Q Consensus 59 Gl~~~~vg~~~~F~V~~~~a-g~~~l~v~i~~p~g~~~~~~v~~~~dg~y~v~y~P~~~G~~~i~V~~~---g~~I~gSP 134 (171)
-.+....++.+.++-.+.+- -..++.|+....++....... -...|...|.|.+..... +.--.|.-
T Consensus 7 s~~~l~~~~~~tLtC~v~gf~yP~~i~v~W~r~~~~~~~~~~---------~~~~~~~dgty~~~s~l~v~~~~w~~g~~ 77 (96)
T cd07696 7 SPKQLLLTKSVKLTCLVTNLTSPEGVNVTWTRGSGEPVKASI---------QPPESHNNATLSVTSTLNVSTDDWISGDT 77 (96)
T ss_pred CHHHhccCCCeEEEEEEeCCCCCCCcEEEEEECCeEeeeEEe---------cccccCCCCCEEEEEEEEeCHHHccCCCc
Confidence 33445567777777777776 566777776654432111110 011344445555544432 11235667
Q ss_pred eEEEe
Q psy15726 135 YLCQV 139 (171)
Q Consensus 135 F~v~V 139 (171)
|.|.|
T Consensus 78 ysC~V 82 (96)
T cd07696 78 FQCKV 82 (96)
T ss_pred EEEEE
Confidence 77777
No 116
>PF07691 PA14: PA14 domain; InterPro: IPR011658 The PA14 domain forms an insert in bacterial beta-glucosidases, other glycosidases, glycosyltransferases, proteases, amidases, yeast adhesins and bacterial toxins, including anthrax protective antigen (PA). The domain also occurs in a Dictyostelium pre-spore cell-inducing factor Psi and in fibrocystin, the mammalian protein whose mutation leads to polycystic kidney and hepatic disease. The crystal structure of PA shows that this domain (named PA14 after its location in the PA20 pro-peptide) has a beta-barrel structure. The PA14 domain sequence suggests a binding function, rather than a catalytic role. The PA14 domain distribution is compatible with carbohydrate binding [].; PDB: 2XVG_A 2XVK_A 2XVL_A 2XJU_A 2XJT_A 2XJQ_A 2XJS_A 2XJV_A 2XJP_A 2XJR_A ....
Probab=24.47 E-value=1.5e+02 Score=20.78 Aligned_cols=21 Identities=24% Similarity=0.469 Sum_probs=12.4
Q ss_pred EEEEEEcCCCeeEEEEEEECC
Q psy15726 107 LLAEFTPTTTGVYKIDVLQGA 127 (171)
Q Consensus 107 y~v~y~P~~~G~~~i~V~~~g 127 (171)
++-.++|.+.|.|++.+.-+|
T Consensus 50 ~~G~~~~~~~G~y~f~~~~~d 70 (145)
T PF07691_consen 50 WTGYFKPPETGTYTFSLTSDD 70 (145)
T ss_dssp EEEEEEESSSEEEEEEEEESS
T ss_pred EEEEEecccCceEEEEEEecc
Confidence 344556666677666665444
No 117
>PF12690 BsuPI: Intracellular proteinase inhibitor; InterPro: IPR020481 BsuPI is a intracellular proteinase inhibitor that directly regulates the major intracellular proteinase (ISP-1) activity in vivo. It inhibits ISP-1 in the early stages of sporulation and then may be inactivated by a membrane-bound proteinase [].; PDB: 3ISY_A.
Probab=24.40 E-value=1.7e+02 Score=19.31 Aligned_cols=17 Identities=12% Similarity=0.339 Sum_probs=9.9
Q ss_pred EEEEEEEcCC--CeeEEEE
Q psy15726 106 NLLAEFTPTT--TGVYKID 122 (171)
Q Consensus 106 ~y~v~y~P~~--~G~~~i~ 122 (171)
+|..++-... +|.|++.
T Consensus 63 ~~~~~~~~~~~~~G~Y~~~ 81 (82)
T PF12690_consen 63 TYEETWDLKDLSPGEYTLE 81 (82)
T ss_dssp EEEEEESS----SEEEEEE
T ss_pred EEEEEECCCCCCCceEEEe
Confidence 4566665554 7888775
No 118
>PF04225 OapA: Opacity-associated protein A LysM-like domain; InterPro: IPR007340 This entry includes the Haemophilus influenzae opacity-associated protein. This protein is required for efficient nasopharyngeal mucosal colonization, and its expression is associated with a distinctive transparent colony phenotype. OapA is thought to be a secreted protein, and its expression exhibits high-frequency phase variation [].; PDB: 2GU1_A.
Probab=24.12 E-value=1.8e+02 Score=19.35 Aligned_cols=46 Identities=13% Similarity=0.088 Sum_probs=20.8
Q ss_pred CCCcceEeCCeEEEEEEeCCCcCCceEEEEECCCCCccceEEEEcCCCEEEE
Q psy15726 58 LGLYQARSGIVNSFTLETCGVASSEFDVIVTSPQGAAVPVRCYQQKFANLLA 109 (171)
Q Consensus 58 ~Gl~~~~vg~~~~F~V~~~~ag~~~l~v~i~~p~g~~~~~~v~~~~dg~y~v 109 (171)
.-|.+..+|+..+|.++..+. -..|. +.... ..+......||.|.-
T Consensus 38 k~L~~L~pGq~l~f~~d~~g~-L~~L~--~~~~~---~~~~~~R~~DG~f~~ 83 (85)
T PF04225_consen 38 KPLTRLKPGQTLEFQLDEDGQ-LTALR--YERSP---KTTLYTRQSDGSFSY 83 (85)
T ss_dssp --GGG--TT-EEEEEE-TTS--EEEEE--EEEET---TEEEEEE-TTS-EEE
T ss_pred chHhhCCCCCEEEEEECCCCC-EEEEE--EEcCC---cEEEEEEeCCCCEEe
Confidence 346678999999999985421 12232 32211 144555667788763
No 119
>PF14347 DUF4399: Domain of unknown function (DUF4399)
Probab=24.06 E-value=1.4e+02 Score=20.18 Aligned_cols=22 Identities=9% Similarity=-0.016 Sum_probs=17.2
Q ss_pred CCceeEEEEEEECC-EEcCCCCe
Q psy15726 19 ISYGLVQARTLREE-TSFRSCPM 40 (171)
Q Consensus 19 ~~~G~~~i~V~~~g-~~I~gSPf 40 (171)
-+||.|.+.+.+++ .|++-+|-
T Consensus 59 L~PG~htLtl~~~d~~h~~~~~~ 81 (87)
T PF14347_consen 59 LPPGKHTLTLQLGDGDHVPHDPP 81 (87)
T ss_pred eCCCCEEEEEEeCCCCcccCCCc
Confidence 48999999988765 68777664
No 120
>cd03858 M14_CP_N-E_like Carboxypeptidase (CP) N/E-like subfamily of the M14 family of metallocarboxypeptidases (MCPs). The M14 family are zinc-binding CPs which hydrolyze single, C-terminal amino acids from polypeptide chains, and have a recognition site for the free C-terminal carboxyl group, which is a key determinant of specificity. The N/E subfamily includes eight members, of which five (CPN, CPE, CPM, CPD, CPZ) are considered enzymatically active, while the other three are non-active (CPX1, PCX2, ACLP/AEBP1) and lack the critical active site and substrate-binding residues considered necessary for CP activity. These non-active members may function as binding proteins or display catalytic activity towards other substrates. Unlike the A/B CP subfamily, enzymes belonging to the N/E subfamily are not produced as inactive precursors that require proteolysis to produce the active form; rather, they rely on their substrate specificity and subcellular compartmentalization to prevent inappr
Probab=23.77 E-value=3.1e+02 Score=23.27 Aligned_cols=43 Identities=9% Similarity=0.147 Sum_probs=27.2
Q ss_pred ceEEEEECCCCCccc-eE---------EEEcCCCEEEEEEEcCCCeeEEEEEEECC
Q psy15726 82 EFDVIVTSPQGAAVP-VR---------CYQQKFANLLAEFTPTTTGVYKIDVLQGA 127 (171)
Q Consensus 82 ~l~v~i~~p~g~~~~-~~---------v~~~~dg~y~v~y~P~~~G~~~i~V~~~g 127 (171)
.+.-.|.+.+|..++ +. ..-+.+|.|... ..+|.|+|.|.+.|
T Consensus 299 ~i~G~V~d~~g~pl~~A~V~i~~~~~~~~Td~~G~f~~~---l~~G~y~l~vs~~G 351 (374)
T cd03858 299 GIKGFVRDANGNPIANATISVEGINHDVTTAEDGDYWRL---LLPGTYNVTASAPG 351 (374)
T ss_pred ceEEEEECCCCCccCCeEEEEecceeeeEECCCceEEEe---cCCEeEEEEEEEcC
Confidence 455556666654432 11 223456888754 34999999999988
No 121
>PF08400 phage_tail_N: Prophage tail fibre N-terminal; InterPro: IPR013609 This entry represents the N terminus of phage 933W tail fibre protein. The characteristics of the protein distribution suggest prophage matches.
Probab=23.16 E-value=1.7e+02 Score=21.63 Aligned_cols=25 Identities=20% Similarity=0.421 Sum_probs=19.2
Q ss_pred cCCCEEEEEEEcCCCeeEEEEEEECCEE
Q psy15726 102 QKFANLLAEFTPTTTGVYKIDVLQGARP 129 (171)
Q Consensus 102 ~~dg~y~v~y~P~~~G~~~i~V~~~g~~ 129 (171)
..+|.|...- ++|.|.|.++.+|++
T Consensus 44 ~~~G~Ys~~~---epG~Y~V~l~~~g~~ 68 (134)
T PF08400_consen 44 GEAGEYSFDV---EPGVYRVTLKVEGRP 68 (134)
T ss_pred CCCceEEEEe---cCCeEEEEEEECCCC
Confidence 3568887764 569999999998863
No 122
>PF13895 Ig_2: Immunoglobulin domain; PDB: 2V5R_B 2V5M_A 2V5S_B 2GI7_A 3LAF_A 4DEP_C 3O4O_B 2EC8_A 2E9W_A 1J87_A ....
Probab=22.82 E-value=81 Score=19.19 Aligned_cols=14 Identities=7% Similarity=0.150 Sum_probs=7.3
Q ss_pred EEEEECCEEcCCCC
Q psy15726 26 ARTLREETSFRSCP 39 (171)
Q Consensus 26 i~V~~~g~~I~gSP 39 (171)
+.-..++..+..++
T Consensus 31 ~~w~~~~~~~~~~~ 44 (80)
T PF13895_consen 31 VQWYKNGSPINSSQ 44 (80)
T ss_dssp EEEEETTEEEEEES
T ss_pred eeeeeeeeeeeeee
Confidence 44445666665443
No 123
>PF10980 DUF2787: Protein of unknown function (DUF2787); InterPro: IPR021248 This bacterial family of proteins has no known function. ; PDB: 2W56_A 2V1L_A.
Probab=22.56 E-value=1.5e+02 Score=21.68 Aligned_cols=37 Identities=11% Similarity=-0.018 Sum_probs=24.4
Q ss_pred eeEEEEeeccccCCCCceeEEEEEEECCEEcCCCCeEEE
Q psy15726 5 TKSCILLSEKAPSWISYGLVQARTLREETSFRSCPMEVP 43 (171)
Q Consensus 5 ~~~~~~~~~~~~~P~~~G~~~i~V~~~g~~I~gSPf~v~ 43 (171)
.+..|+.-+.+|++++-|-|-|+|++.- -.++++.+.
T Consensus 22 ~~ltlNFRD~~Ysae~GGfHPVEIrl~r--~~~~~W~i~ 58 (128)
T PF10980_consen 22 NRLTLNFRDPSYSAERGGFHPVEIRLER--SSSDQWQIV 58 (128)
T ss_dssp -EEEEEEE-TT-BTTTB-S--EEEEEEE---TTS-EEEE
T ss_pred ceEEEEecCCCcccccCCcCCEEEEEEE--cCCCCEEEE
Confidence 3566888889999999999999999887 567888766
No 124
>PF08770 SoxZ: Sulphur oxidation protein SoxZ; InterPro: IPR014880 SoxZ forms an anti parallel beta structure and forms a complex with SoxY. Sulphur oxidation occurs at the thiol of a conserved cysteine residue of the SoxY subunit []. ; PDB: 1V8H_B 2OX5_E 2OXG_E 2OXH_C.
Probab=22.44 E-value=2.7e+02 Score=19.18 Aligned_cols=25 Identities=16% Similarity=0.202 Sum_probs=14.7
Q ss_pred eEEEEEEeCCCcCCceEEEEECCCCCc
Q psy15726 68 VNSFTLETCGVASSEFDVIVTSPQGAA 94 (171)
Q Consensus 68 ~~~F~V~~~~ag~~~l~v~i~~p~g~~ 94 (171)
-..|.+...++| .|++.++|-+|+.
T Consensus 69 ~l~F~~~~~~~g--~l~v~~~Dn~G~~ 93 (100)
T PF08770_consen 69 YLRFSFKGKKSG--TLTVTWTDNKGNS 93 (100)
T ss_dssp EEEEEEEESSSE--EEEEEEEETTS-E
T ss_pred cEEEEEecCCCc--EEEEEEEECCCCE
Confidence 345666655443 7777777766653
No 125
>COG3354 FlaG Putative archaeal flagellar protein G [Cell motility and secretion]
Probab=22.21 E-value=3.3e+02 Score=20.47 Aligned_cols=44 Identities=18% Similarity=0.107 Sum_probs=32.0
Q ss_pred CCCEEEEEEEcCCCee-------EEEEEEECCEEcCCCCeEEEecCCCceE
Q psy15726 103 KFANLLAEFTPTTTGV-------YKIDVLQGARPVRGSPYLCQVYDASKVK 146 (171)
Q Consensus 103 ~dg~y~v~y~P~~~G~-------~~i~V~~~g~~I~gSPF~v~V~d~skv~ 146 (171)
.+|.|+.+|--+-.|. -.|.|..+|.-+.+.=-.+...+.+.++
T Consensus 66 ~~g~~t~t~yiKNtG~~~~~fd~~sitVliDG~iv~~a~~~~~~~~gs~i~ 116 (154)
T COG3354 66 TDGPYTYTFYIKNTGSDSIAFDNTSITVLIDGNIVTPAYVTFTSVNGSSIR 116 (154)
T ss_pred CCCceEEEEEEecCCCcccccCCCeEEEEEcCcEeccceEEEEecCCCeeE
Confidence 3477777776666554 4788999998888877777667777666
No 126
>cd03868 M14_CPD_I The first carboxypeptidase (CP)-like domain of Carboxypeptidase D (CPD; EC 3.4.17.22), domain I. CPD differs from all other metallocarboxypeptidases in that it contains multiple CP-like domains. CPD belongs to the N/E-like subfamily of the M14 family of metallocarboxypeptidases (MCPs).The M14 family are zinc-binding CPs which hydrolyze single, C-terminal amino acids from polypeptide chains, and have a recognition site for the free C-terminal carboxyl group, which is a key determinant of specificity. CPD is a single-chain protein containing a signal peptide, three tandem repeats of CP-like domains separated by short bridge regions, followed by a transmembrane domain, and a C-terminal cytosolic tail. The first two CP-like domains of CPD contain all of the essential active site and substrate-binding residues, the third CP-like domain lacks critical residues necessary for enzymatic activity and is inactive towards standard CP substrates. Domain I is optimally active at p
Probab=22.10 E-value=3.2e+02 Score=23.31 Aligned_cols=61 Identities=16% Similarity=0.184 Sum_probs=40.4
Q ss_pred EeCCeEEEEEEeCCCc-------------------------CCceEEEEECCCCCccc-eE--E-------EEcCCCEEE
Q psy15726 64 RSGIVNSFTLETCGVA-------------------------SSEFDVIVTSPQGAAVP-VR--C-------YQQKFANLL 108 (171)
Q Consensus 64 ~vg~~~~F~V~~~~ag-------------------------~~~l~v~i~~p~g~~~~-~~--v-------~~~~dg~y~ 108 (171)
..+.+.+|+++.++.- ...+.-.|.+++|..++ +. + ..+.+|.|.
T Consensus 254 ~~~~~~~~T~El~~~~~p~~~~l~~~w~~n~~al~~~~~~~~~~i~G~V~d~~g~pv~~A~V~v~~~~~~~~td~~G~y~ 333 (372)
T cd03868 254 LHSNCFEITLELSCCKYPPASELPEEWNNNRESLLAYLEQVHIGVKGFVRDASGNPIEDATIMVAGIDHNVTTAKFGDYW 333 (372)
T ss_pred hccCeeEEEEEecCCCCCCHHHHHHHHHHhHHHHHHHHHHhCCceEEEEEcCCCCcCCCcEEEEEecccceEeCCCceEE
Confidence 3567788888886520 13467778888887655 22 2 233557785
Q ss_pred EEEEcCCCeeEEEEEEECC
Q psy15726 109 AEFTPTTTGVYKIDVLQGA 127 (171)
Q Consensus 109 v~y~P~~~G~~~i~V~~~g 127 (171)
. ...+|.|+|.|...|
T Consensus 334 -~--~l~~G~Y~l~vs~~G 349 (372)
T cd03868 334 -R--LLLPGTYTITAVAPG 349 (372)
T ss_pred -e--cCCCEEEEEEEEecC
Confidence 2 356899999999877
No 127
>PF05949 DUF881: Bacterial protein of unknown function (DUF881); InterPro: IPR010273 This family consists of a series of hypothetical bacterial proteins. One of the family members Q45543 from SWISSPROT from Bacillus subtilis is thought to be involved in cell division and sporulation [].; PDB: 3GMG_B.
Probab=22.04 E-value=58 Score=24.20 Aligned_cols=21 Identities=29% Similarity=0.653 Sum_probs=15.8
Q ss_pred EEECCEEcCCCCeEEEe-cCCCc
Q psy15726 123 VLQGARPVRGSPYLCQV-YDASK 144 (171)
Q Consensus 123 V~~~g~~I~gSPF~v~V-~d~sk 144 (171)
++.||+++. +||.+++ .||.+
T Consensus 81 i~Vng~~i~-~Py~I~AIGdp~~ 102 (149)
T PF05949_consen 81 ILVNGRPIS-PPYVIKAIGDPET 102 (149)
T ss_dssp EEETTEEE--SSEEEEEES-HHH
T ss_pred EEECCEEcc-CCEEEEEEeCHHH
Confidence 889999997 8999998 45443
No 128
>PRK06655 flgD flagellar basal body rod modification protein; Reviewed
Probab=21.74 E-value=4.2e+02 Score=21.11 Aligned_cols=74 Identities=14% Similarity=0.038 Sum_probs=42.1
Q ss_pred EeCCeEEEEEEeCCCc-CCceEEEEECCCC-CccceEEEEcCCC-EEEEEEEcCCCeeEEEEEEECCE-----EcCCCCe
Q psy15726 64 RSGIVNSFTLETCGVA-SSEFDVIVTSPQG-AAVPVRCYQQKFA-NLLAEFTPTTTGVYKIDVLQGAR-----PVRGSPY 135 (171)
Q Consensus 64 ~vg~~~~F~V~~~~ag-~~~l~v~i~~p~g-~~~~~~v~~~~dg-~y~v~y~P~~~G~~~i~V~~~g~-----~I~gSPF 135 (171)
-+|+.+.+.=+.-... ++.....+.=++. ..+.++|.|.... .+++..-+..+|.|. +.|+|. +++..-|
T Consensus 95 lIGk~V~~~~~~~~~~~~~~~~~~~~l~~~a~~vti~I~D~~G~~Vrt~~lg~~~aG~~~--f~WDG~d~~G~~lp~G~Y 172 (225)
T PRK06655 95 LVGRGVLVPGDTVLVGTGGTTPFGVELPSAADNVTVTITDSAGQVVRTIDLGAQSAGVVS--FTWDGTDTDGNALPDGNY 172 (225)
T ss_pred hcCCeEEEecceEEecCCCceEEEEEcCCCCcEEEEEEEcCCCCEEEEEecCCcCCCcee--EEECCCCCCCCcCCCeeE
Confidence 3666665544332111 2344555553443 3567788876433 345666678899875 557764 5555558
Q ss_pred EEEe
Q psy15726 136 LCQV 139 (171)
Q Consensus 136 ~v~V 139 (171)
.++|
T Consensus 173 t~~V 176 (225)
T PRK06655 173 TIKA 176 (225)
T ss_pred EEEE
Confidence 8887
No 129
>PF10651 DUF2479: Domain of unknown function (DUF2479); InterPro: IPR018913 This domain is found in phage from a number of different bacteria including (Listeria phage A118 (Bacteriophage A118)). It is thought to be a putative long tail fibre protein.; PDB: 4DIV_L 3UH8_A.
Probab=21.70 E-value=3.5e+02 Score=20.19 Aligned_cols=57 Identities=7% Similarity=0.125 Sum_probs=36.5
Q ss_pred EEEEECCCC---CccceEEEEcCCCEEEEEEEcC---CCeeE--EEEEEE--CCEEcCCCCeEEEec
Q psy15726 84 DVIVTSPQG---AAVPVRCYQQKFANLLAEFTPT---TTGVY--KIDVLQ--GARPVRGSPYLCQVY 140 (171)
Q Consensus 84 ~v~i~~p~g---~~~~~~v~~~~dg~y~v~y~P~---~~G~~--~i~V~~--~g~~I~gSPF~v~V~ 140 (171)
.+.+..++| ...++++.|..+|.++..+.-. ..|.+ .+.+.. +++.+.-.-|.+.|.
T Consensus 58 ~l~l~~~dg~~~~~~~~~i~d~~~G~i~y~lp~~~l~~~G~v~a~~~i~~~~~~~~~~t~~F~F~V~ 124 (170)
T PF10651_consen 58 KLVLKMSDGSIFIQDDVEIVDPTNGIIQYTLPDEFLKHVGKVKAQLFIYYNGDGQSISTANFTFEVE 124 (170)
T ss_dssp EEE-TTTTSSSSEEEE-SEEETTTTEEEEE--TTTTHSSEEEEEEEEEEEESSSEEEE---EEEEEE
T ss_pred EEEEEcCCCcEEEEecceEEcCcCcEEEEEECHHHhccCcEEEEEEEEEECCCCCEEEEeeEEEEEe
Confidence 334446666 2346778898899998888654 47887 567778 788888778999984
No 130
>cd05767 IgC_MHC_II_alpha Class II major histocompatibility complex (MHC) alpha chain immunoglobulin domain. IgC_MHC_II_alpha: Immunoglobulin (Ig) domain of major histocompatibility complex (MHC) class II alpha chain. MHC class II molecules play a key role in the initiation of the antigen-specific immune reponse. In both humans and in mice these molecules have been shown to be expressed constitutively on the cell surface of professional antigen-presenting cells (APCs), for example on B-lymphocytes, monocytes, and macrophages. The expression of these molecules has been shown to be induced in nonprofessional APCs such as keratinocyctes, and they are expressed on the surface of activated human T cells and on T cells from other species. The MHC II molecules present antigenic peptides to CD4(+) T-lymphocytes. These peptides derive mostly from protelytic processing via the endocytic pathway, of antigens internalized by the APC. These peptides bind to the MHC class II molecules in the endosom
Probab=21.59 E-value=2.6e+02 Score=18.76 Aligned_cols=43 Identities=16% Similarity=0.170 Sum_probs=20.2
Q ss_pred CCeEEEEEEeCCCcCCceEEEEECCCCCccc-----eEEEEcCCCEEEE
Q psy15726 66 GIVNSFTLETCGVASSEFDVIVTSPQGAAVP-----VRCYQQKFANLLA 109 (171)
Q Consensus 66 g~~~~F~V~~~~ag~~~l~v~i~~p~g~~~~-----~~v~~~~dg~y~v 109 (171)
++...++-.+.+=-..++.++... +|..+. ..+..++||+|..
T Consensus 15 ~~~~tL~C~a~gFyP~~I~v~W~~-~g~~~~~~~~~~~~~~~~Dgty~~ 62 (94)
T cd05767 15 GEPNTLICFVDNFFPPVLNVTWLK-NGVPVTDGVSETRYYPRQDLSFQK 62 (94)
T ss_pred CCCeEEEEEEEEEECCcCEEEEEE-CCeEccCceEEeeeEeCCCCcEEE
Confidence 444445554444334567776652 333221 1234455666543
No 131
>PRK09937 stationary phase/starvation inducible regulatory protein CspD; Provisional
Probab=21.48 E-value=2.3e+02 Score=18.37 Aligned_cols=37 Identities=8% Similarity=0.013 Sum_probs=25.6
Q ss_pred EEcCCCcceEeCCeEEEEEEeCCCcCCceEEEEECCC
Q psy15726 55 GSGLGLYQARSGIVNSFTLETCGVASSEFDVIVTSPQ 91 (171)
Q Consensus 55 v~G~Gl~~~~vg~~~~F~V~~~~ag~~~l~v~i~~p~ 91 (171)
+.+.|+.....|+.++|.+.....|.....|...++.
T Consensus 34 i~~~g~~~l~~G~~V~f~~~~~~~G~~A~~V~~~~~~ 70 (74)
T PRK09937 34 IQMDGYRTLKAGQSVQFDVHQGPKGNHASVIVPVEVE 70 (74)
T ss_pred ccccCCCCCCCCCEEEEEEEECCCCceeeEEEECCcc
Confidence 3456667788999999999877666555555555444
No 132
>cd06245 M14_CPD_III The third carboxypeptidase (CP)-like domain of Carboxypeptidase D (CPD; EC 3.4.17.22), domain III. CPD differs from all other metallocarboxypeptidases in that it contains multiple CP-like domains. CPD belongs to the N/E-like subfamily of the M14 family of metallocarboxypeptidases (MCPs).The M14 family are zinc-binding CPs which hydrolyze single, C-terminal amino acids from polypeptide chains, and have a recognition site for the free C-terminal carboxyl group, which is a key determinant of specificity. CPD is a single-chain protein containing a signal peptide, three tandem repeats of CP-like domains separated by short bridge regions, followed by a transmembrane domain, and a C-terminal cytosolic tail. The first two CP-like domains of CPD contain all of the essential active site and substrate-binding residues, the third CP-like domain lacks critical residues necessary for enzymatic activity and is inactive towards standard CP substrates. Domain I is optimally active a
Probab=21.01 E-value=3.6e+02 Score=23.12 Aligned_cols=43 Identities=16% Similarity=0.240 Sum_probs=25.9
Q ss_pred eEEEEECCCCCccc-eE--------EEEcCCCEEEEEEEcCCCeeEEEEEEECCE
Q psy15726 83 FDVIVTSPQGAAVP-VR--------CYQQKFANLLAEFTPTTTGVYKIDVLQGAR 128 (171)
Q Consensus 83 l~v~i~~p~g~~~~-~~--------v~~~~dg~y~v~y~P~~~G~~~i~V~~~g~ 128 (171)
+.-.|.+.+|..++ +. +.-+.+|.|... ..+|.|+|.|.+.|-
T Consensus 289 I~G~V~d~~g~pi~~A~V~v~g~~~~~T~~~G~y~~~---L~pG~y~v~vs~~Gy 340 (363)
T cd06245 289 VHGVVTDKAGKPISGATIVLNGGHRVYTKEGGYFHVL---LAPGQHNINVIAEGY 340 (363)
T ss_pred EEEEEEcCCCCCccceEEEEeCCCceEeCCCcEEEEe---cCCceEEEEEEEeCc
Confidence 44456666665443 21 223345777764 447889999888773
No 133
>KOG4278|consensus
Probab=20.96 E-value=2.1e+02 Score=27.30 Aligned_cols=42 Identities=17% Similarity=0.247 Sum_probs=32.3
Q ss_pred CCCEEEEEEEcCCCeeEEEEEEECCEEcCCCCeEEEecCCCceEE
Q psy15726 103 KFANLLAEFTPTTTGVYKIDVLQGARPVRGSPYLCQVYDASKVKI 147 (171)
Q Consensus 103 ~dg~y~v~y~P~~~G~~~i~V~~~g~~I~gSPF~v~V~d~skv~~ 147 (171)
-+|.|.|.=.-..+|.|.|.+.|.|+--. |+|++..-.|+.+
T Consensus 172 InGSFLVRESEsSpgQ~sISlRyeGRVyH---YRINt~~dgK~yv 213 (1157)
T KOG4278|consen 172 INGSFLVRESESSPGQYSISLRYEGRVYH---YRINTDNDGKMYV 213 (1157)
T ss_pred cccceEEeeccCCCcceeEEEEecceEEE---EEeeccCCccEEE
Confidence 46889998888899999999999998655 7777644444444
No 134
>KOG3342|consensus
Probab=20.91 E-value=1.2e+02 Score=23.09 Aligned_cols=28 Identities=14% Similarity=0.118 Sum_probs=20.1
Q ss_pred EEEeeccccCCCCceeEEEEEEECCEEcC
Q psy15726 8 CILLSEKAPSWISYGLVQARTLREETSFR 36 (171)
Q Consensus 8 ~~~~~~~~~~P~~~G~~~i~V~~~g~~I~ 36 (171)
.|.|++.+..|.+.|+-.+- ++.|.+||
T Consensus 67 lLfL~N~~~~p~~vGdivVf-~vegR~IP 94 (180)
T KOG3342|consen 67 LLFLTNRNEDPIRVGDIVVF-KVEGREIP 94 (180)
T ss_pred EEEEecCCCCcceeccEEEE-EECCccCc
Confidence 46677777888888876653 67777764
No 135
>PHA03376 BARF1; Provisional
Probab=20.76 E-value=1.7e+02 Score=23.29 Aligned_cols=24 Identities=21% Similarity=0.115 Sum_probs=19.1
Q ss_pred EEEcCCCEEEEEEEcCCCeeEEEE
Q psy15726 99 CYQQKFANLLAEFTPTTTGVYKID 122 (171)
Q Consensus 99 v~~~~dg~y~v~y~P~~~G~~~i~ 122 (171)
++-.+||+|++.|.-.+-|.|+..
T Consensus 94 l~lSDdGtY~C~fQkge~~~~k~e 117 (221)
T PHA03376 94 ANISHDGNYLCRMKLGETEVTKQE 117 (221)
T ss_pred eeecCCceEEEEEEcCCCceeeee
Confidence 445677999999998888888765
No 136
>smart00720 calpain_III calpain_III.
Probab=20.59 E-value=1.6e+02 Score=21.07 Aligned_cols=26 Identities=15% Similarity=0.344 Sum_probs=19.2
Q ss_pred EcCCCEEEE---EEEcCCCeeEEEEEEEC
Q psy15726 101 QQKFANLLA---EFTPTTTGVYKIDVLQG 126 (171)
Q Consensus 101 ~~~dg~y~v---~y~P~~~G~~~i~V~~~ 126 (171)
....|.|.+ +|.|.+.|.|.|.|.-.
T Consensus 109 ~L~~G~Y~iVPsT~~p~~~g~F~LrV~s~ 137 (143)
T smart00720 109 RLPPGEYVIVPSTFEPNQEGDFLLRVFSE 137 (143)
T ss_pred EcCCCCEEEEEeecCCCCccCEEEEEEec
Confidence 345577855 68888889999888654
No 137
>KOG3392|consensus
Probab=20.58 E-value=1.2e+02 Score=21.97 Aligned_cols=21 Identities=10% Similarity=0.048 Sum_probs=19.0
Q ss_pred cCCCCceeEEEEEEECCEEcC
Q psy15726 16 PSWISYGLVQARTLREETSFR 36 (171)
Q Consensus 16 ~~P~~~G~~~i~V~~~g~~I~ 36 (171)
|-|...|.-++.|.++++||.
T Consensus 79 p~pd~~gkqelei~~~~ehis 99 (147)
T KOG3392|consen 79 PPPDRVGRQELEIVIGDEHIS 99 (147)
T ss_pred cCCCcccceeEEEEECCeEEE
Confidence 478899999999999999993
No 138
>PHA03290 envelope glycoprotein I; Provisional
Probab=20.54 E-value=1.9e+02 Score=24.83 Aligned_cols=35 Identities=9% Similarity=-0.022 Sum_probs=27.5
Q ss_pred EEEeeccccCCCCceeEEEEEEECCEEcCCCCeEEE
Q psy15726 8 CILLSEKAPSWISYGLVQARTLREETSFRSCPMEVP 43 (171)
Q Consensus 8 ~~~~~~~~~~P~~~G~~~i~V~~~g~~I~gSPf~v~ 43 (171)
.++|.=.++.|...|.|.+.|+++|.. .-+=|.+.
T Consensus 128 ~vLL~I~~P~~~DSGiY~LRV~Ldga~-~sDvF~ls 162 (357)
T PHA03290 128 EIIFKINKPGIEDAGIYLLLVQLDHSR-LFDGFFLG 162 (357)
T ss_pred ceEEEeCCCCcccCeeEEEEEEeCCCc-ccceEEEE
Confidence 456777889999999999999999986 34446544
No 139
>PF10342 GPI-anchored: Ser-Thr-rich glycosyl-phosphatidyl-inositol-anchored membrane family; InterPro: IPR018466 This entry represents glycoproteins involved in cell wall (1-->6)-beta-glucan assembly. In yeast a null mutation leads to severe growth defects, aberrant multi-budded morphology, and mating defects [, ]. The entry includes DRMIP and Hesp-379, which are involved in both fruiting body formation and in host attack respectively. Hesp-379 is a haustorially expressed secreted protein; the haustorium being the small sucker that penetrates host tissue [].
Probab=20.48 E-value=2.5e+02 Score=18.07 Aligned_cols=65 Identities=9% Similarity=0.108 Sum_probs=34.3
Q ss_pred eEeCCeEEEEEEeCCCcCCceEEEEECCCCCc--cceEEE---EcCCCEEEEEEEcC--CCeeEEEEEEECC
Q psy15726 63 ARSGIVNSFTLETCGVASSEFDVIVTSPQGAA--VPVRCY---QQKFANLLAEFTPT--TTGVYKIDVLQGA 127 (171)
Q Consensus 63 ~~vg~~~~F~V~~~~ag~~~l~v~i~~p~g~~--~~~~v~---~~~dg~y~v~y~P~--~~G~~~i~V~~~g 127 (171)
..+|++..++.+..+...+.+.+.+...+... ....|. +..+|.|.+.--+. ..+.|.|.+.-.+
T Consensus 9 ~~~g~~~~I~W~~~~~~~~~~~I~L~~g~~~~~~~~~~ia~~v~~~~gs~~~~~p~~l~~~~~Y~i~~~~~~ 80 (93)
T PF10342_consen 9 WTAGQPITITWTSDGTDPGNVTIYLCNGNNTNLNFVQTIASNVSNSDGSYTWTIPSDLPSGGDYFIQIVNSS 80 (93)
T ss_pred EECCCcEEEEEeCCCCCCcEEEEEEEcCCCCCcceeEEEEecccCCCCEEEEEcCCCCCCCCcEEEEEEECC
Confidence 45677777777776555567777755443311 112232 23335665444221 1356888877443
No 140
>PF08842 Mfa2: Fimbrillin-A associated anchor proteins Mfa1 and Mfa2; InterPro: IPR014941 This family of proteins may be lipoproteins principally from bacilli. They are between 300 and 400 residues. Many Bacteroides-like bacterial species, including Porphyromonas gingivalis, the causal agent of periodontal infection, carry at least two types of fimbriae, namely FimA and Mfa1 fimbriae, following the names of their major subunit proteins []. Normally, FimA fimbriae are long filaments that are easily detached from cells, whereas Mfa1 fimbriae are short filaments that are tightly bound to cells; however, in the absence of Mfa2 protein, the Mfa1 fimbriae are also very long and are not attached. Mfa2 and Mfa1 are associated with each other in whole P. gingivalis cells to the extent that Mfa2 is located on the cell surface and probably associated with Mfa1 fimbriae in such a way that it anchors the Mfa1 fimbriae to the cell surface and regulates Mfa1 filament length [].; PDB: 3PAY_C 3GF8_A.
Probab=20.45 E-value=3.8e+02 Score=20.94 Aligned_cols=65 Identities=14% Similarity=0.143 Sum_probs=32.6
Q ss_pred ceEeCCeEEEEEEeCCCc-------CCceEEEEECCCCCccceEEEEcC---CCEEEEEEEcCCCeeEEEEEEEC
Q psy15726 62 QARSGIVNSFTLETCGVA-------SSEFDVIVTSPQGAAVPVRCYQQK---FANLLAEFTPTTTGVYKIDVLQG 126 (171)
Q Consensus 62 ~~~vg~~~~F~V~~~~ag-------~~~l~v~i~~p~g~~~~~~v~~~~---dg~y~v~y~P~~~G~~~i~V~~~ 126 (171)
.|..+-...|..+..... -..+++-|.+.+|+.+.....+.. .+.|.+......+|.|++-+.-|
T Consensus 3 dC~~~~~l~f~y~~~~~~~~~~~~~v~~v~lyvFd~~g~~v~~~~~~~~~~~~~~y~~~~~~l~~G~Y~~va~~n 77 (283)
T PF08842_consen 3 DCPGGLTLKFSYDYNMGNADAFEDEVKRVDLYVFDEDGKLVKQRTIDSEELEGGGYTMFLLDLPPGTYTFVAWGN 77 (283)
T ss_dssp -----EEEEEE---STT-S--HHHH--EEEEEEE-TTSBEEEEEEEECGGCCTTTEEE-CCT--SEEEEEEEEES
T ss_pred CCCceEEEEEEEeCCccccccccceEeEEEEEEEeCCCeEEEEEEcccccccCCceEEeeccCCCCcEEEEEEEC
Confidence 344455556665554221 157888899988875444433332 24588766677889999877665
No 141
>PF07965 Integrin_B_tail: Integrin beta tail domain; InterPro: IPR012896 Integrins are the major metazoan receptors for cell adhesion to extracellular matrix proteins and, in vertebrates, also play important roles in certain cell-cell adhesions, make transmembrane connections to the cytoskeleton and activate many intracellular signalling pathways [, ]. The integrin receptors are composed of alpha and beta subunit heterodimers. Each subunit crosses the membrane once, with most of the polypeptide residing in the extracellular space, and has two short cytoplasmic domains. Some members of this family have EGF repeats at the C terminus and also have a vWA domain inserted within the integrin domain at the N terminus. Most integrins recognise relatively short peptide motifs, and in general require an acidic amino acid to be present. Ligand specificity depends upon both the alpha and beta subunits []. There are at least 18 types of alpha and 8 types of beta subunits recognised in humans []. Each alpha subunit tends to associate only with one type of beta subunit, but there are exceptions to this rule []. Each association of alpha and beta subunits has its own binding specificity and signalling properties. Many integrins require activation on the cell surface before they can bind ligands. Integrins frequently intercommunicate, and binding at one integrin receptor activate or inhibit another. The structure of unliganded alphaV beta3 showed the molecule to be folded, with the head bent over towards the C termini of the legs which would normally be inserted into the membrane []. The head comprises a beta propeller domain at the end terminus of the alphaV subunit and an I/A domain inserted into a loop on the top of the hybrid domain in the beta subunit. The I/A domain consists of a Rossman fold with a core of beta parallel sheets surrounded by amphipathic alpha helices. This entry represents the tail domain of the integrin beta subunit. It forms a four-stranded beta-sheet that contains parallel and anttparallel strands and faces an alpha helix found at the N terminus of this domain []. Interactions between the alpha-helix and the beta-sheet are mostly hydrophobic and involve a disulphide bond. The rear of the beta sheet is covered with a long A-B loop.; GO: 0004872 receptor activity, 0005488 binding, 0007155 cell adhesion, 0007160 cell-matrix adhesion, 0007229 integrin-mediated signaling pathway, 0008305 integrin complex; PDB: 3K71_H 3K6S_H 3K72_D 3IJE_B 1M1X_B 1U8C_B 2RN0_A 3FCS_B 1JV2_B 1L5G_B ....
Probab=20.44 E-value=2.6e+02 Score=18.61 Aligned_cols=30 Identities=13% Similarity=0.090 Sum_probs=24.2
Q ss_pred cceEEEEcCCCEEEEEEE-cCCCeeEEEEEE
Q psy15726 95 VPVRCYQQKFANLLAEFT-PTTTGVYKIDVL 124 (171)
Q Consensus 95 ~~~~v~~~~dg~y~v~y~-P~~~G~~~i~V~ 124 (171)
+.+...|.+|-.|..+|. -...|.|.|.|.
T Consensus 47 ~~C~~~De~dC~~~F~y~~~~~~~~~~v~V~ 77 (87)
T PF07965_consen 47 VRCSFRDEDDCWFTFTYEQYDGNGKYVVYVQ 77 (87)
T ss_dssp EEEEEEETTSEEEEEEEE-ECTTTEEEEEEE
T ss_pred CccccCCCCCcEEEEEEEEeCCCCeEEEEEc
Confidence 368888888888888888 677888988875
No 142
>PF01067 Calpain_III: Calpain large subunit, domain III; InterPro: IPR022682 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Cysteine peptidases have characteristic molecular topologies, which can be seen not only in their three-dimensional structures, but commonly also in the two-dimensional structures. These are peptidases in which the nucleophile is the sulphydryl group of a cysteine residue. Cysteine proteases are divided into clans (proteins which are evolutionary related), and further sub-divided into families, on the basis of the architecture of their catalytic dyad or triad []. This group of cysteine peptidases belong to the MEROPS peptidase family C2 (calpain family, clan CA). A type example is calpain, which is an intracellular protease involved in many important cellular functions that are regulated by calcium []. The protein is a complex of 2 polypeptide chains (light and heavy), with three known forms in mammals [, ]: a highly calcium-sensitive (i.e., micro-molar range) form known as mu-calpain, mu-CANP or calpain I; a form sensitive to calcium in the milli-molar range, known as m-calpain, m-CANP or calpain II; and a third form, known as p94, which is found in skeletal muscle only []. All forms have identical light but different heavy chains. Both mu- and m-calpain are heterodimers containing an identical 28kDa subunit and an 80kDa subunit that shares 55-65% sequence homology between the two proteases [, ]. The crystallographic structure of m-calpain reveals six "domains" in the 80kDa subunit: A 19-amino acid NH2-terminal sequence; Active site domain IIa; Active site domain IIb. Domain 2 shows low levels of sequence similarity to papain; although the catalytic His has not been located by biochemical means, it is likely that calpain and papain are related []. Domain III; An 18-amino acid extended sequence linking domain III to domain IV; Domain IV, which resembles the penta EF-hand family of polypeptides, binds calcium and regulates activity []. />]. Ca2+-binding causes a rearrangement of the protein backbone, the net effect of which is that a Trp side chain, which acts as a wedge between catalytic domains IIa and IIb in the apo state, moves away from the active site cleft allowing for the proper formation of the catalytic triad []. Calpain-like mRNAs have been identified in other organisms including bacteria, but the molecules encoded by these mRNAs have not been isolated, so little is known about their properties. How calpain activity is regulated in these organisms cells is still unclear In metazoans, the activity of calpain is controlled by a single proteinase inhibitor, calpastatin (IPR001259 from INTERPRO). The calpastatin gene can produce eight or more calpastatin polypeptides ranging from 17 to 85 kDa by use of different promoters and alternative splicing events. The physiological significance of these different calpastatins is unclear, although all bind to three different places on the calpain molecule; binding to at least two of the sites is Ca2+ dependent. The calpains ostensibly participate in a variety of cellular processes including remodelling of cytoskeletal/membrane attachments, different signal transduction pathways, and apoptosis. Deregulated calpain activity following loss of Ca2+ homeostasis results in tissue damage in response to events such as myocardial infarcts, stroke, and brain trauma []. This entry represents domain III. It is found in association with PF00648 from PFAM. The function of the domain III and I are currently unknown. Domain II is a cysteine protease and domain IV is a calcium binding domain. Calpains are believed to participate in intracellular signaling pathways mediated by calcium ions. ; PDB: 1QXP_B 2QFE_A 1DF0_A 1U5I_A 3DF0_A 3BOW_A 1KFU_L 1KFX_L.
Probab=20.30 E-value=1.4e+02 Score=21.29 Aligned_cols=26 Identities=12% Similarity=0.327 Sum_probs=17.3
Q ss_pred EcCCCEEEE---EEEcCCCeeEEEEEEEC
Q psy15726 101 QQKFANLLA---EFTPTTTGVYKIDVLQG 126 (171)
Q Consensus 101 ~~~dg~y~v---~y~P~~~G~~~i~V~~~ 126 (171)
....|.|.+ +|.|...|.|.|.|..+
T Consensus 113 ~L~~G~YvIVPsT~~~~~~g~F~L~v~s~ 141 (147)
T PF01067_consen 113 TLPPGTYVIVPSTYEPGQEGEFTLRVFSD 141 (147)
T ss_dssp EE-SEEEEEEEEESSTT--EEEEEEEEES
T ss_pred EcCCCCEEEEEecCCCCCeeeEEEEEEEC
Confidence 445678876 56688899999998864
No 143
>PF02792 Mago_nashi: Mago nashi protein; InterPro: IPR004023 This family was originally identified in drosophila and called mago nashi, it is a strict maternal effect, grandchildless-like, gene []. The human homologue has been shown to interact with an RNA binding protein, ribonucleoprotein rbm8 (Q9Y5S9 from SWISSPROT) []. An RNAi knockout of the Caenorhabditis elegans homologue causes masculinization of the germ line (Mog phenotype) hermaphrodites, suggesting it is involved in hermaphrodite germ-line sex determination [] but the protein is also found in hermaphrodites and other organisms without a sexual differentiation.; GO: 0005634 nucleus; PDB: 2XB2_Y 2J0S_C 3EX7_A 2J0Q_F 1P27_C 2HYI_A 2X1G_B 1HL6_B 1RK8_B 1OO0_A ....
Probab=20.27 E-value=1.3e+02 Score=22.50 Aligned_cols=20 Identities=10% Similarity=0.024 Sum_probs=17.6
Q ss_pred cCCCCceeEEEEEEECCEEc
Q psy15726 16 PSWISYGLVQARTLREETSF 35 (171)
Q Consensus 16 ~~P~~~G~~~i~V~~~g~~I 35 (171)
|.|...|.-++.|.++++||
T Consensus 75 P~pd~~GrQELEI~lg~~Hi 94 (143)
T PF02792_consen 75 PEPDRVGRQELEIVLGNEHI 94 (143)
T ss_dssp --SBTTBEEEEEEEETTEEE
T ss_pred CCCCCccceeEEEEECCEEE
Confidence 68999999999999999998
No 144
>PF00927 Transglut_C: Transglutaminase family, C-terminal ig like domain; InterPro: IPR008958 Synonym(s): Protein-glutamine gamma-glutamyltransferase, Fibrinoligase, TGase Transglutaminases catalyse the post-translational modification of proteins at glutamine residues, with formation of isopeptide bonds. Members of the transglutaminase family usually have three domains: N-terminal (IPR001102 from INTERPRO), middle (IPR013808 from INTERPRO) and C-terminal. The middle domain is usually well conserved, but family members can display major differences in their N- and C-terminal domains, although their overall structure is conserved []. This entry represents the C-terminal domain found in transglutaminases, which consists of an immunoglobulin-like beta-sandwich consisting of seven strands in two sheets with a Greek key topology. The best known transglutaminase is blood coagulation factor XIII, a plasma tetrameric protein composed of two catalytic A subunits and two non-catalytic B subunits. Factor XIII is responsible for cross-linking fibrin chains, thus stabilising the fibrin clot. Protein-glutamine gamma-glutamyltransferases (2.3.2.13 from EC) are calcium-dependent enzymes that catalyse the cross-linking of proteins by promoting the formation of isopeptide bonds between the gamma-carboxyl group of a glutamine in one polypeptide chain and the epsilon-amino group of a lysine in a second polypeptide chain. TGases also catalyse the conjugation of polyamines to proteins [, ].; GO: 0003810 protein-glutamine gamma-glutamyltransferase activity, 0018149 peptide cross-linking; PDB: 2XZZ_A 1GGY_B 1FIE_B 1GGU_B 1GGT_B 1F13_A 1QRK_B 1EVU_A 1EX0_B 1L9N_B ....
Probab=20.08 E-value=1.9e+02 Score=19.58 Aligned_cols=20 Identities=25% Similarity=0.438 Sum_probs=17.2
Q ss_pred EEEEEECCEEcCCCCeEEEe
Q psy15726 120 KIDVLQGARPVRGSPYLCQV 139 (171)
Q Consensus 120 ~i~V~~~g~~I~gSPF~v~V 139 (171)
.+.|+..+..+-|++|.+.+
T Consensus 2 ~~~i~~~~~~~vG~d~~v~v 21 (107)
T PF00927_consen 2 EIKIKLPGDPVVGQDFTVSV 21 (107)
T ss_dssp EEEEEEESEEBTTSEEEEEE
T ss_pred eEEEEECCCccCCCCEEEEE
Confidence 36788889999999999988
Done!