Query psy192
Match_columns 898
No_of_seqs 283 out of 847
Neff 7.8
Searched_HMMs 46136
Date Fri Aug 16 19:39:51 2013
Command hhsearch -i /work/01045/syshi/Psyhhblits/psy192.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/192hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 KOG0518|consensus 100.0 6E-106 1E-110 925.2 77.4 851 2-898 116-1075(1113)
2 KOG0518|consensus 100.0 1.2E-98 3E-103 865.1 73.7 809 3-896 17-939 (1113)
3 smart00557 IG_FLMN Filamin-typ 99.8 1.2E-19 2.7E-24 163.4 12.4 90 804-898 1-90 (93)
4 smart00557 IG_FLMN Filamin-typ 99.8 2.2E-19 4.9E-24 161.7 13.2 90 208-297 1-92 (93)
5 PF00630 Filamin: Filamin/ABP2 99.7 5.5E-17 1.2E-21 148.7 12.7 93 802-895 2-101 (101)
6 PF00630 Filamin: Filamin/ABP2 99.7 3.1E-16 6.7E-21 143.7 10.9 87 206-292 2-101 (101)
7 PRK14081 triple tyrosine motif 99.4 7.8E-07 1.7E-11 104.4 57.1 424 20-472 55-555 (667)
8 PRK14081 triple tyrosine motif 99.3 3.3E-06 7.2E-11 99.2 59.7 374 71-472 55-459 (667)
9 TIGR00864 PCC polycystin catio 95.3 21 0.00045 49.7 32.7 293 70-407 1830-2141(2740)
10 PF13584 BatD: Oxygen toleranc 94.9 3.6 7.8E-05 48.6 22.7 203 69-277 74-354 (484)
11 KOG1948|consensus 94.8 13 0.00029 45.3 37.9 124 158-299 256-385 (1165)
12 TIGR03503 conserved hypothetic 94.4 0.54 1.2E-05 52.6 13.0 118 66-201 169-301 (374)
13 PF13584 BatD: Oxygen toleranc 92.6 8.6 0.00019 45.5 20.3 205 167-373 74-355 (484)
14 TIGR00864 PCC polycystin catio 92.4 65 0.0014 45.2 43.1 263 168-470 1830-2110(2740)
15 TIGR03503 conserved hypothetic 91.6 2.4 5.2E-05 47.6 12.9 116 258-390 168-300 (374)
16 KOG1428|consensus 90.5 1 2.2E-05 56.6 9.1 43 349-392 2099-2141(3738)
17 TIGR00868 hCaCC calcium-activa 90.2 10 0.00022 47.5 17.6 151 222-380 512-684 (863)
18 PF13115 YtkA: YtkA-like 88.2 4.1 8.9E-05 35.7 9.3 71 15-86 3-85 (86)
19 PF09134 Invasin_D3: Invasin, 87.6 5.9 0.00013 35.7 9.7 87 7-103 11-97 (99)
20 PF01835 A2M_N: MG2 domain; I 87.1 2 4.4E-05 38.7 6.8 64 314-377 10-86 (99)
21 PF10670 DUF4198: Domain of un 87.1 35 0.00076 35.1 17.2 60 222-283 148-213 (215)
22 PF01835 A2M_N: MG2 domain; I 86.9 3.4 7.4E-05 37.2 8.3 64 124-187 11-86 (99)
23 COG2373 Large extracellular al 86.6 1.4E+02 0.003 40.6 30.3 85 47-136 430-521 (1621)
24 PF09315 DUF1973: Domain of un 85.8 20 0.00043 36.4 13.8 129 222-354 26-165 (179)
25 COG1470 Predicted membrane pro 83.1 96 0.0021 35.8 37.9 75 263-344 49-123 (513)
26 KOG1428|consensus 82.3 3.5 7.6E-05 52.2 7.5 48 59-107 2097-2144(3738)
27 PF10670 DUF4198: Domain of un 79.5 13 0.00028 38.4 10.2 67 19-90 134-213 (215)
28 PF13753 SWM_repeat: Putative 79.2 1.1E+02 0.0024 33.9 24.9 142 227-376 63-223 (317)
29 PF06312 Neurexophilin: Neurex 73.5 9.7 0.00021 39.7 6.9 46 331-376 68-115 (219)
30 cd02858 Esterase_N_term Estera 73.2 20 0.00043 31.4 8.0 59 226-286 7-65 (85)
31 PF02369 Big_1: Bacterial Ig-l 71.8 21 0.00045 32.4 8.0 62 319-380 23-92 (100)
32 PF06312 Neurexophilin: Neurex 71.3 11 0.00025 39.2 6.8 45 142-186 70-115 (219)
33 smart00634 BID_1 Bacterial Ig- 70.1 22 0.00048 31.6 7.7 52 38-90 30-84 (92)
34 PF02369 Big_1: Bacterial Ig-l 69.3 26 0.00056 31.8 8.0 32 159-190 61-92 (100)
35 TIGR00868 hCaCC calcium-activa 69.1 3.2E+02 0.007 34.7 24.6 47 239-285 627-684 (863)
36 PF13860 FlgD_ig: FlgD Ig-like 68.2 33 0.00072 29.7 8.2 56 320-375 13-76 (81)
37 PF07495 Y_Y_Y: Y_Y_Y domain; 64.6 13 0.00028 30.5 4.7 37 614-653 13-49 (66)
38 PF11896 DUF3416: Domain of un 64.1 27 0.00058 35.7 7.8 62 124-185 23-87 (187)
39 PF13473 Cupredoxin_1: Cupredo 63.4 13 0.00029 33.8 5.0 57 222-283 40-96 (104)
40 PF13115 YtkA: YtkA-like 63.0 71 0.0015 27.7 9.4 27 348-374 59-85 (86)
41 COG1470 Predicted membrane pro 62.7 3E+02 0.0065 32.0 36.2 73 13-104 7-79 (513)
42 PF13473 Cupredoxin_1: Cupredo 60.7 25 0.00054 31.9 6.3 62 817-891 39-100 (104)
43 PF13860 FlgD_ig: FlgD Ig-like 60.3 62 0.0013 28.0 8.4 44 129-173 13-57 (81)
44 cd02858 Esterase_N_term Estera 57.2 93 0.002 27.2 9.1 60 414-475 6-66 (85)
45 KOG2996|consensus 55.7 7.2 0.00016 45.3 2.1 35 620-654 696-730 (865)
46 PF13753 SWM_repeat: Putative 55.3 1.3E+02 0.0028 33.4 12.1 142 131-281 64-223 (317)
47 TIGR03096 nitroso_cyanin nitro 53.7 97 0.0021 29.9 9.0 32 258-289 97-130 (135)
48 PF13620 CarboxypepD_reg: Carb 51.9 64 0.0014 27.4 7.2 45 143-190 16-60 (82)
49 smart00634 BID_1 Bacterial Ig- 51.9 67 0.0014 28.5 7.4 30 159-188 55-84 (92)
50 PF07705 CARDB: CARDB; InterP 51.0 69 0.0015 28.2 7.5 65 124-188 15-85 (101)
51 PRK13211 N-acetylglucosamine-b 49.5 2.4E+02 0.0052 33.2 13.3 114 73-201 287-405 (478)
52 PF09315 DUF1973: Domain of un 47.9 3.1E+02 0.0068 27.8 14.7 72 125-201 26-99 (179)
53 KOG2996|consensus 46.1 15 0.00032 42.9 2.6 32 858-889 701-734 (865)
54 PF04151 PPC: Bacterial pre-pe 44.9 78 0.0017 26.3 6.4 40 429-468 25-69 (70)
55 cd08548 Type_I_cohesin_like Ty 44.7 2.9E+02 0.0063 26.5 11.1 91 370-469 16-114 (135)
56 PF09134 Invasin_D3: Invasin, 43.5 72 0.0016 28.9 6.1 69 320-390 22-96 (99)
57 PF00017 SH2: SH2 domain; Int 43.0 39 0.00085 28.6 4.3 26 628-653 19-44 (77)
58 PF13620 CarboxypepD_reg: Carb 42.2 64 0.0014 27.4 5.7 50 38-92 10-60 (82)
59 COG2372 CopC Uncharacterized p 41.4 72 0.0016 30.3 6.0 70 319-388 45-124 (127)
60 PF04151 PPC: Bacterial pre-pe 40.8 1.6E+02 0.0034 24.4 7.6 53 323-375 15-69 (70)
61 PF07495 Y_Y_Y: Y_Y_Y domain; 40.4 32 0.00069 28.1 3.2 19 452-470 30-48 (66)
62 KOG1693|consensus 40.2 1.3E+02 0.0028 30.8 7.9 78 95-184 21-98 (209)
63 PF00017 SH2: SH2 domain; Int 40.1 39 0.00084 28.6 3.8 27 861-887 19-45 (77)
64 PF07705 CARDB: CARDB; InterP 38.6 75 0.0016 28.0 5.7 67 579-653 16-85 (101)
65 PF07691 PA14: PA14 domain; I 38.4 3.5E+02 0.0075 25.5 12.1 80 18-106 49-136 (145)
66 PF05751 FixH: FixH; InterPro 37.4 84 0.0018 30.3 6.3 50 834-886 85-134 (146)
67 TIGR03096 nitroso_cyanin nitro 37.4 34 0.00074 32.9 3.3 30 863-892 99-130 (135)
68 smart00252 SH2 Src homology 2 36.5 40 0.00086 29.1 3.4 29 625-653 17-45 (84)
69 smart00252 SH2 Src homology 2 36.0 33 0.00072 29.6 2.8 28 860-887 19-46 (84)
70 PF04234 CopC: CopC domain; I 35.9 32 0.00069 31.0 2.8 55 144-200 35-96 (97)
71 PRK06655 flgD flagellar basal 34.4 1.6E+02 0.0035 31.1 8.0 27 320-346 114-140 (225)
72 PF09430 DUF2012: Protein of u 34.0 1.6E+02 0.0035 27.7 7.3 55 48-106 10-64 (123)
73 PF03422 CBM_6: Carbohydrate b 33.4 2E+02 0.0044 26.5 8.0 64 25-91 41-111 (125)
74 PF01483 P_proprotein: Proprot 31.9 3E+02 0.0064 23.9 8.3 35 227-261 4-38 (87)
75 PF01483 P_proprotein: Proprot 31.6 2E+02 0.0043 25.0 7.1 61 130-202 4-70 (87)
76 PRK06655 flgD flagellar basal 31.5 1.9E+02 0.0041 30.5 8.1 14 143-156 127-140 (225)
77 COG2373 Large extracellular al 31.5 1.6E+03 0.034 31.1 41.6 117 210-331 395-525 (1621)
78 KOG1948|consensus 31.3 1.2E+03 0.026 29.6 41.5 122 348-486 256-384 (1165)
79 PRK10301 hypothetical protein; 30.1 1.5E+02 0.0032 28.1 6.3 68 30-99 45-122 (124)
80 PRK12633 flgD flagellar basal 30.0 1.8E+02 0.0039 30.8 7.6 45 320-364 117-162 (230)
81 PF09430 DUF2012: Protein of u 29.9 2.1E+02 0.0045 27.0 7.3 53 241-297 10-62 (123)
82 PF05751 FixH: FixH; InterPro 29.8 2E+02 0.0042 27.7 7.4 56 31-87 69-131 (146)
83 PF07703 A2M_N_2: Alpha-2-macr 29.6 4.7E+02 0.01 24.4 10.2 105 313-422 8-118 (136)
84 KOG1692|consensus 29.5 1.9E+02 0.0042 29.4 7.1 56 125-183 38-93 (201)
85 PRK10301 hypothetical protein; 28.8 1.4E+02 0.0031 28.2 6.0 55 145-200 62-123 (124)
86 PF04234 CopC: CopC domain; I 28.5 46 0.001 29.9 2.5 32 720-752 60-95 (97)
87 PF11896 DUF3416: Domain of un 28.3 2.3E+02 0.0049 29.0 7.7 60 222-281 24-88 (187)
88 cd00173 SH2 Src homology 2 dom 27.0 89 0.0019 27.3 4.1 29 625-653 16-44 (94)
89 cd00173 SH2 Src homology 2 dom 27.0 73 0.0016 27.8 3.6 27 861-887 19-45 (94)
90 KOG1690|consensus 26.1 1.8E+02 0.0038 29.9 6.2 79 4-89 27-109 (215)
91 PRK09619 flgD flagellar basal 25.5 2E+02 0.0044 30.1 7.0 45 320-364 111-155 (218)
92 PRK12812 flgD flagellar basal 25.4 2.2E+02 0.0047 30.7 7.3 45 320-364 129-174 (259)
93 KOG1692|consensus 25.2 1.6E+02 0.0034 30.0 5.6 55 412-466 39-93 (201)
94 PF07523 Big_3: Bacterial Ig-l 23.8 83 0.0018 26.0 3.1 42 142-188 17-58 (67)
95 PRK09619 flgD flagellar basal 23.8 2.3E+02 0.005 29.7 7.0 32 143-174 124-155 (218)
96 PF11589 DUF3244: Domain of un 23.8 2.3E+02 0.005 25.8 6.3 60 324-383 40-101 (106)
97 PRK12633 flgD flagellar basal 23.4 2.7E+02 0.0058 29.5 7.4 14 143-156 130-143 (230)
98 PF06016 Reovirus_L2: Reovirus 23.4 1.3E+02 0.0028 39.2 5.7 105 314-422 1034-1149(1289)
99 PF05404 TRAP-delta: Transloco 23.2 3.2E+02 0.0068 27.4 7.3 52 130-185 45-101 (167)
100 PF07523 Big_3: Bacterial Ig-l 22.5 1.4E+02 0.003 24.7 4.1 41 334-379 19-59 (67)
101 PF00801 PKD: PKD domain; Int 22.4 2.9E+02 0.0062 22.7 6.1 53 410-469 8-60 (69)
102 PF11589 DUF3244: Domain of un 22.2 1.5E+02 0.0033 27.0 4.7 52 142-193 48-101 (106)
103 COG4856 Uncharacterized protei 20.8 5.6E+02 0.012 29.1 9.3 25 646-670 53-79 (403)
104 PF10633 NPCBM_assoc: NPCBM-as 20.5 5.2E+02 0.011 21.8 7.5 67 817-884 1-75 (78)
No 1
>KOG0518|consensus
Probab=100.00 E-value=6.4e-106 Score=925.23 Aligned_cols=851 Identities=30% Similarity=0.498 Sum_probs=675.2
Q ss_pred CCCCcccCeEEeCCCCeEEEEEEcCCceeEEEEEEECCEEec--------------------------------------
Q psy192 2 PSGNVDKPVIEDNHDGTVSLHYDPREEGLHELALKFNGDHVQ-------------------------------------- 43 (898)
Q Consensus 2 p~g~~~~~~i~~~~d~~~~v~f~P~~~G~h~v~v~~~G~~v~-------------------------------------- 43 (898)
|+|....|.++++.||+|+++++|.+.|+|.+.++|++.+++
T Consensus 116 ~~g~~~~~~v~~~~dg~y~~k~~p~e~~~h~~e~~~~e~~~~~sP~~~~v~~~td~n~~~Alg~~le~~~vg~p~~f~v~ 195 (1113)
T KOG0518|consen 116 LIGSLTWTLVQDYGDGIYKTKRTPKEKGEHEVEVLYDEKPVPASPFVVKVNEGTDWNDVQALGPGLESARVGKPNVFEVE 195 (1113)
T ss_pred ccccceeEeeeccCCcceeeecCchhccchhhhhhhcccccccCCceeccccccCcccceEeccchhhcccCCCceeEEe
Confidence 678999999999999999999999999999999999999874
Q ss_pred ----CCCCeeEEEecCccceEEEEeCCCCeEEEEEeeCCCeEEEEEEEECCeecCCCCeE---EEEecCCcccceeeeee
Q psy192 44 ----GYGGLSLSIEGPSKAEIQCKDNADGSLNISYRPTEPGYYIINLKFADHHVEGSPFT---AKIVGEGSNRQREKIQR 116 (898)
Q Consensus 44 ----G~g~l~v~v~GP~~~~i~~~d~~dG~~~v~ytP~~~G~~~I~V~~~g~~i~gSPf~---~~v~~~~~~~~~~~i~~ 116 (898)
|.++|.+.++||+++...|.|+++|+|.++|+|.++|+|.+++.|.+.||+||||. ++..|+.+ . .+
T Consensus 196 ~~Ga~~gnl~~a~~gpse~~~~~~di~~g~c~veyip~~~gd~sv~~~~gg~~v~GsPf~~~t~~~~Dp~k--i--ki-- 269 (1113)
T KOG0518|consen 196 TPGAGQGNLEVAVEGPSEALTVIPDIKPGSCSVEYIPPEPGDYSVNTYYGGVPVPGSPFRVATCDKVDPSK--I--KI-- 269 (1113)
T ss_pred cCCccccceeeeecChhhhcccccccCCCcceeeecCCCCCceEEeeeecccccCCCCeeeeeecccChhh--e--ee--
Confidence 33689999999999999999999999999999999999999999999999999995 44556552 2 22
Q ss_pred eecccceeeeCcEEEEEEEcCCCcccceEEEEeCCCCCeeeeEEEEcCCCEEEEEEEcCcceeEEEEEEECCeecCCCce
Q psy192 117 QREAVPVTEVGSTCKLTFKMPGITAFDLSATVTSPGGVTEDAEINEVEDGLYAVHFVPKELGVHTVSVRYKDIHIPGSPF 196 (898)
Q Consensus 117 ~~~~~~~~~vg~~~~f~v~~~~~~~~~~~~~V~~p~G~~~~~~v~~~~~g~y~v~f~P~~~G~h~I~V~~~g~~V~gSPf 196 (898)
.+++..+..++..|++|++.++-..+.+.|-++.+-..+..+.++++++|+|.|+|.+.|.|.|+|.++++.++.|||
T Consensus 270 --~gl~~gr~~~p~~F~vD~S~aG~~pl~v~V~g~r~p~e~v~~v~~~d~t~~V~fvPs~~g~~~isV~~ade~vp~spf 347 (1113)
T KOG0518|consen 270 --SGLPKGRLNKPQYFTVDASKAGLLPLEVAVCGGRVPVEPVMAVALGDGTYNVSFVPSEDGPYTISVRIADEDVPRSPF 347 (1113)
T ss_pred --cCCCcccccCCeeEEEechhcCccceEEEecCCCCcccceEEecCCCceeEEEEeEcccCCeeEEEEecCCcCCCCcc
Confidence 134444778899999999999999999999887777566788899999999999999999999999999999999999
Q ss_pred EEEEccCCCCCcceEEEeCCCCCcc--eeeeeeEEEEEeccCCCCceEEEEEcCCcc--eeeEEECCCC-cEEEEEEeCC
Q psy192 197 QFTVGPLRDGGAHRVHAGGPGLERG--EQNQPCEFNVWTREAGAGSLAISVEGPSKA--EIDFKDRKDG-SCYVSYVVAE 271 (898)
Q Consensus 197 ~v~V~~~~~~d~~~v~a~G~GL~~~--~~g~~~~F~V~~~daG~g~l~v~v~gps~~--~i~~~d~~dG-ty~v~y~P~~ 271 (898)
.+.|.|. .+++++++.|+||... .+-.+..|++|.+|||.+.|++.|.||.+. +..+.++++| .|.++|.|.+
T Consensus 348 ~~kvlp~--~~as~~~a~g~gLt~~~v~as~~~d~~fD~~Dageg~levqV~gp~Gk~~~~~V~d~~~~~~h~vsY~pd~ 425 (1113)
T KOG0518|consen 348 LVKVLPT--VDASKVTASGAGLTDIGVNASVEVDFTFDEGDAGEGLLEVQVVGPEGKEKEVVVRDNGRGGIHIVTYVPDC 425 (1113)
T ss_pred eeeeccc--cchhhhccccCccccccccccccceeeEEccccccceEEEEEECCCCCceeeEEEecCCCceEEEEEcCCC
Confidence 9999974 5689999999999987 456778999999999999999999999764 4567888887 9999999999
Q ss_pred CceEEEEEEECCeeCCCCCceeEecCCCCCceeEEEeeeCCCceecCCCeEEEEEecCC-ccceEEEEECCCCCeeeeEE
Q psy192 272 PGEYRVGIKFNDQHIPDSPYKLFVSPAMGDAHKLEIAQFPQGVVMADKPTQFLVRKNGA-VGALDAKVISPSGTEDDCFI 350 (898)
Q Consensus 272 ~G~y~i~V~~~g~~I~gSPf~v~V~~~~~~~~~~~~~~~~~~~~~~g~~~~f~V~~~~a-~G~l~v~v~~p~g~~~~~~v 350 (898)
+|.|.|.|+|+|++|++|||++...+ |++++.+..-. .+++|.+..|.||++.+ .|.+.+.+.+|+++..+..+
T Consensus 426 ~G~y~i~v~~~gd~i~gSPf~~ra~~---dask~~~~~~i--~~~vg~~~~~~vdak~ag~g~~~~~~~~pdn~n~~~~v 500 (1113)
T KOG0518|consen 426 PGRYLIVVFYGGDPIPGSPFTARAYP---DASKEVLVPPI--NAQVGKEANFVVDAKAAGAGPVTCTVVTPDNRNVPADV 500 (1113)
T ss_pred CCceEEEEEECCcccCCCceEEEecc---CccceEecCCC--cccccceeeeEeehhhccCCCceeEEeccccCCCCcee
Confidence 99999999999999999999999876 56776654322 26899999999999877 47789999999999999999
Q ss_pred EecCCCeEEEEEEecCCeeEEEEEEECCeecCCCCeEEEeccCC------------CCCceEEecC-----------CCc
Q psy192 351 QPIDGDNYSIRFMPRENGIHNIHIKFNGVHIPGSPLRIKVGKGE------------ADPAAVHATG-----------NGL 407 (898)
Q Consensus 351 ~~~~~g~y~v~f~P~~~G~h~i~V~~~g~~I~gSPf~v~V~~~~------------~d~s~v~v~G-----------~Gl 407 (898)
+.+++|+|.+.|++.++|+|.|+|+|+|+|+++|||.+.+..+. .+-....... -+|
T Consensus 501 ~~~~dg~~~iv~ta~~~~~~~i~v~f~ge~vp~sp~~~~a~~~~~~~~~~r~v~e~h~v~v~~~~~~~~vi~~~v~k~~l 580 (1113)
T KOG0518|consen 501 QKNGDGTFDIVFTAQDPGTYTIDVRFGGEHVPDSPFIVSASAGIGTAVAARWVTEEHSVEVIEQQSKTLVIPFTVQKKEL 580 (1113)
T ss_pred eeccCcceeeEeccCCCCcceEEEEECCeeCCCCceeEEEecccccccccccccccccccccccccccccccccccccch
Confidence 99999999999999999999999999999999999999886430 0000000000 011
Q ss_pred ce----eeeCCeEEEEEEcCCCC---CCceEEEEECCCCceeeeEEeCCe-EEEEEecCCCeeEEEEEEECCEecCCCCe
Q psy192 408 AE----IKSGVKTDFIVDTCNAG---AGTLAVTIDGPSKVSMDCTEVEEG-YKVRYTPLVPGDYYVSLKYNGYHIVGSPF 479 (898)
Q Consensus 408 ~~----~~~g~~~~F~Vd~~~aG---~g~l~v~v~gp~~~~~~~~~~~~g-y~v~ytp~~~G~y~i~V~~~g~~I~gSPf 479 (898)
.. ...++..-++||..+.- .-.+.+.|.||++..+.|.+|+|| |.++|+|+.||.|+|.|+|+++||+||||
T Consensus 581 ~~ev~~~t~k~~~~~iiDn~d~t~~v~~~~t~aiegPskaei~c~dn~Dgt~~vsy~P~vpG~y~i~V~~~~k~vpgSPF 660 (1113)
T KOG0518|consen 581 AGEVNMPTGKQAKPEIIDNKDGTITVRYAPTLAIEGPSKAEIRCYDNKDGTCSVSYTPTVPGVYKIEVKFGDKHVPGSPF 660 (1113)
T ss_pred hccccCccccccCceeeccCCCceeEEeccccccCCCCccceeEEeCCCCcEEeccCCCCCceEEEEEEECCeeCCCCCc
Confidence 10 11123334556554431 112345578999999999999999 99999999999999999999999999999
Q ss_pred EEEEec-ccCCCCC--cceeeeEEEeeEEeeeccCCC-CCccccccCCCceEEEe--cCc--eeeeeccCccEEEEEeec
Q psy192 480 KVKCTG-KDLGERG--GQETSSVTVETVQKVAKNKTQ-GPVIPIFKSDASKVTCK--GMG--LKKAYAQKQNMFTIHCQD 551 (898)
Q Consensus 480 ~v~v~~-~~~~~~~--~~~~s~v~v~~~~~~~~~~~~-~~~~p~~~~~~~~v~~~--g~g--l~~~~~~~~~~f~v~~~~ 551 (898)
+..|.+ .++...- ...+..+.+.. .+..-.... ....|.. ....+.++ -.| -..+.+++.+.+.|+.+.
T Consensus 661 ~~~Vt~~~s~r~s~~~Vgs~~ev~in~-~~~~~s~ltaeI~~PsG--n~~~c~~r~l~~g~~~itF~P~e~GeH~I~Vk~ 737 (1113)
T KOG0518|consen 661 VALVTGDESIRTSHLSVGSIHEVTINI-TEADLSVLTAEIVDPSG--NPEPCLVRRLPNGHDGITFTPREVGEHKINVKV 737 (1113)
T ss_pred eEEEecccccccccceecceeeEEeec-cccCCceeEEEEECCCC--CccceeeEecCCCceeEEECCCcCcceEEEEEE
Confidence 999986 2111000 00111222210 000000000 0000111 11122221 112 224668888999998765
Q ss_pred C-----CCCeEEEEec--cCCcceeEeCCCccceecCCeeEEEEEecCCCCCCCceeeecccCCCCcEEEEeCCccceEE
Q psy192 552 A-----GSPFKLYVDS--IPSGYVTAYGPGLISGVSGEPCLFTISTKGAGAGSPFQFTVGPLRDGGLSMAVEGPSKAEIT 624 (898)
Q Consensus 552 ~-----G~~~~l~v~~--~~~~~v~~~Gpgl~~g~~g~p~~fti~t~~ag~g~p~~~~~~~~~~g~l~v~v~GPs~~~i~ 624 (898)
+ |+||.+.|.. .+.+++.++|++|..|+.-+|++|+|||+.||. |+|+++++||+|++++
T Consensus 738 ~G~hVpgsPf~i~V~~~e~dAsk~~v~g~g~~~G~t~ep~~fivDtr~agy-------------GgLsi~~~Gpskvd~~ 804 (1113)
T KOG0518|consen 738 AGKHVPGSPFSIKVSESEIDASKVRVSGQGLKEGHTFEPAEFIVDTRKAGY-------------GGLSISVQGPSKVDLN 804 (1113)
T ss_pred cceECCCCCeEEEeccccccceeEEEecccccccccccchheEeccccCCC-------------CceEEEEeCCcccccc
Confidence 4 5788777753 356789999999999988899999999999997 8999999999999999
Q ss_pred EEeCCCCeEEEEEEcCCCeeEEEEEEECCeecCCCceEEEEccCcccccee----------eecCceeecccceeeccCc
Q psy192 625 YHDNKDGTVAVSYLPTAPGEYKIAVKFGEKHIKGSPYLAKITGEGRKRNQI----------SVGSCSEVSFPGKVSDSDI 694 (898)
Q Consensus 625 ~~~~~dg~~~v~Y~P~~pG~Y~i~V~~~~~~I~GSPF~v~V~~~~~~~~~~----------~~g~~~~~~~~~~i~~~~~ 694 (898)
|.|++||+|+|+|+|++||.|.|+|+|+++||+||||+|+++++++.-+.+ .+|+. .+|..++++...
T Consensus 805 ~~d~~dGt~kV~ytPtepG~Y~I~i~Fad~~I~gSPftVkv~~~~~vvesi~~~~~~~~va~~g~~--~~l~lk~~~~~~ 882 (1113)
T KOG0518|consen 805 VEDREDGTCKVSYTPTEPGTYIINIKFADEHIKGSPFTVKVTGESRVVESITRDREAPSVARVGHT--CSLDLKATEASS 882 (1113)
T ss_pred eeecCCCeEEEEEeCCCCceEEEEEEEcCccCCCCceEEEecCCeeEeeeeeecccccceecccce--eeeeeecCCCCc
Confidence 999999999999999999999999999999999999999999987422222 22322 234445554444
Q ss_pred ceeEEEEECCCCCeeceEEEeeCCCcEEEEEEEcccccEEEEEEECCEEcCCCCeEEEeccccccCcceeEEeCcccccc
Q psy192 695 RSLNASIQAPSGLEEPCFLKKIPNGNLGISFTPREVGSHLVSVKKMGVHIKNSPFKINVGEREVGDAKKVKVFGQSLTEG 774 (898)
Q Consensus 695 ~~l~a~v~~psg~~~~~~v~~~~~g~y~v~ftP~e~G~h~i~V~~~G~~i~gSPf~i~V~~~e~~da~kv~v~G~gl~~g 774 (898)
...+|.+.+|||...+..+.++.+++|+|+|||+|+|.|+|+|+|.++|+.+|||++- +-+-+-++|+|+-|+||..+
T Consensus 883 ~d~ta~vt~PSG~~~~aei~~~~~~~y~vrFtP~e~G~~tl~V~y~~~~vegspftvg--s~~e~~ahkvra~g~Gl~R~ 960 (1113)
T KOG0518|consen 883 QDITARVTDPSGRVFEAEIVDLGQGTYQVRFTPKEPGNHTLSVKYKDQHVEGSPFTVG--SLGEGGAHKVRAAGPGLERG 960 (1113)
T ss_pred cceEEEeeCCCCCccccEEEECCCceEEEEecCCCCCceEEEEEecCccCCCCceeee--cccccccceeeccCCceeec
Confidence 5678889999999999999999999999999999999999999999999999997763 33457899999999999999
Q ss_pred ccCCCCceEEecCCCCCc-eEEecCCCCCCCceeEEecCCccce-ecCceeEEEEEcCCCCCceeEEEEeCCCccccccc
Q psy192 775 KTHEENPFTVDTRDAGSP-LRIKVGKGEADPAAVHATGNGLAEI-KSGVKTDFIVDTCNAGAGTLAVTIDGPSKVSVKKY 852 (898)
Q Consensus 775 ~~~~~~~f~vd~~~aG~p-~~v~v~~~~~~~~~v~a~G~GL~~~-~~g~~~~f~vdt~~aG~g~l~v~i~gp~~~~~~~~ 852 (898)
.++++ |+|++|+||-. +-+.+...+ =....++.++||... ..|.+..|.|+|+. +.|.+.+.+.+|+....
T Consensus 961 ~~g~~--F~I~~rea~~~g~~va~egps-c~v~y~v~~pGv~~l~~~~~~~~~~v~tn~-a~g~ida~v~~p~~~~~--- 1033 (1113)
T KOG0518|consen 961 EAGEP--FSIWTREAGAGGLAVAVEGPS-CGVSYTVQTPGVGSLQEQGVPASFAVTTNR-AKGDIDAKVHTPSNAVE--- 1033 (1113)
T ss_pred cccCc--eeEeehhcCccceEEEeeCCC-ceeEEEecCCCcchhhhcCcceEEEEEecc-ccCceeeEEeCCCCCcc---
Confidence 99998 99999999843 222232111 122345567776532 12247889999865 56788899999986432
Q ss_pred cceEEEEeeCCceEEEEEEeCCCeeEEEEEEECCccCCCCCceeeC
Q psy192 853 KDEIFTRHTGRNNFEVKYIVRDRGEYLLIVKWGDDHIPGSPFKVEV 898 (898)
Q Consensus 853 ~~~~~~~~~g~~~y~v~y~~~~~G~y~l~vk~~~~~i~GSPF~v~v 898 (898)
++++ ...++|.++|+|.|+|.|+|.|||+|.|||||||+|+|
T Consensus 1034 --~c~~--~~~~~ya~~f~P~e~G~h~i~Vkfd~~hv~gSPF~i~v 1075 (1113)
T KOG0518|consen 1034 --ECYV--SEIEKYAMRFIPKENGVHTIDVKFDGEHVPGSPFKIRV 1075 (1113)
T ss_pred --EEEe--eccceEEEEEeecCCCcEEEEEEeCCccCCCCCeEEEE
Confidence 2332 23449999999999999999999999999999999986
No 2
>KOG0518|consensus
Probab=100.00 E-value=1.2e-98 Score=865.06 Aligned_cols=809 Identities=32% Similarity=0.519 Sum_probs=649.5
Q ss_pred CCCcccCeEEeCCCCeEEEEEEcCCceeEEEEEEECCEEec----------------------CCCC-------------
Q psy192 3 SGNVDKPVIEDNHDGTVSLHYDPREEGLHELALKFNGDHVQ----------------------GYGG------------- 47 (898)
Q Consensus 3 ~g~~~~~~i~~~~d~~~~v~f~P~~~G~h~v~v~~~G~~v~----------------------G~g~------------- 47 (898)
+|..-+..|++|.|+||.+.|.|+.+|.+.++++|++.... |.-+
T Consensus 17 ~g~~ae~~i~~~~~~T~~~t~~~~~aG~~~~slk~~~~dg~~~p~~v~vl~~~~~skv~~~~p~~q~~~~v~~a~~~ft~ 96 (1113)
T KOG0518|consen 17 SGIVAEVKIQDNLDGTVTVTYDPQRAGFYILSLKYDGSDGVNLPSLVQVLSAVDTSKVKKKGPGIQGLHNVREALNKFTV 96 (1113)
T ss_pred cCcceeeEEEecCCceEEEEEccccCcceeEEEEecCccccccceeeEEeeccccceeEEecCCccCcchhhhhhhhhhh
Confidence 45666777999999999999999999999999999998652 1111
Q ss_pred ---eeEEEec---------Cccc--eEEEEeCCCCeEEEEEeeCCCeEEEEEEEECCeecCCCCeEEEEecCCcccceee
Q psy192 48 ---LSLSIEG---------PSKA--EIQCKDNADGSLNISYRPTEPGYYIINLKFADHHVEGSPFTAKIVGEGSNRQREK 113 (898)
Q Consensus 48 ---l~v~v~G---------P~~~--~i~~~d~~dG~~~v~ytP~~~G~~~I~V~~~g~~i~gSPf~~~v~~~~~~~~~~~ 113 (898)
+.+.+-| |... ..-++++.||+|+..|+|.+.|+|.+.++|++.+++-|||.+.+.--. +..+
T Consensus 97 d~r~~~nigs~hivd~v~~~~g~~~~~~v~~~~dg~y~~k~~p~e~~~h~~e~~~~e~~~~~sP~~~~v~~~t-d~n~-- 173 (1113)
T KOG0518|consen 97 DNRKETNIGSAHIVDHVVKLIGSLTWTLVQDYGDGIYKTKRTPKEKGEHEVEVLYDEKPVPASPFVVKVNEGT-DWND-- 173 (1113)
T ss_pred ccceeeccCCcccccccccccccceeEeeeccCCcceeeecCchhccchhhhhhhcccccccCCceecccccc-Cccc--
Confidence 1111112 1111 223467889999999999999999999999999999999998764211 1111
Q ss_pred eeeeecccceeeeCcEEEEEEEcCCCcccceEEEEeCCCCCeeeeEEEEcCCCEEEEEEEcCcceeEEEEEEECCeecCC
Q psy192 114 IQRQREAVPVTEVGSTCKLTFKMPGITAFDLSATVTSPGGVTEDAEINEVEDGLYAVHFVPKELGVHTVSVRYKDIHIPG 193 (898)
Q Consensus 114 i~~~~~~~~~~~vg~~~~f~v~~~~~~~~~~~~~V~~p~G~~~~~~v~~~~~g~y~v~f~P~~~G~h~I~V~~~g~~V~g 193 (898)
....+..+....+|++..|++..++++.+.+...+.+|+-- ...+.++++|..+++|+|.++|+|++.++|++.||+|
T Consensus 174 ~~Alg~~le~~~vg~p~~f~v~~~Ga~~gnl~~a~~gpse~--~~~~~di~~g~c~veyip~~~gd~sv~~~~gg~~v~G 251 (1113)
T KOG0518|consen 174 VQALGPGLESARVGKPNVFEVETPGAGQGNLEVAVEGPSEA--LTVIPDIKPGSCSVEYIPPEPGDYSVNTYYGGVPVPG 251 (1113)
T ss_pred ceEeccchhhcccCCCceeEEecCCccccceeeeecChhhh--cccccccCCCcceeeecCCCCCceEEeeeecccccCC
Confidence 11122344556889999999999999999999999988753 3467889999999999999999999999999999999
Q ss_pred CceEEEEccCCCCCcceEEEeCCCCCcceeeeeeEEEEEeccCCCCceEEEEEcCCc--ceeeEEECCCCcEEEEEEeCC
Q psy192 194 SPFQFTVGPLRDGGAHRVHAGGPGLERGEQNQPCEFNVWTREAGAGSLAISVEGPSK--AEIDFKDRKDGSCYVSYVVAE 271 (898)
Q Consensus 194 SPf~v~V~~~~~~d~~~v~a~G~GL~~~~~g~~~~F~V~~~daG~g~l~v~v~gps~--~~i~~~d~~dGty~v~y~P~~ 271 (898)
|||++.+.. ..|++++++. ||..+++++|..|+||+.+||-..|.|.+-++.. ..+...+++|++|+|.|+|..
T Consensus 252 sPf~~~t~~--~~Dp~kiki~--gl~~gr~~~p~~F~vD~S~aG~~pl~v~V~g~r~p~e~v~~v~~~d~t~~V~fvPs~ 327 (1113)
T KOG0518|consen 252 SPFRVATCD--KVDPSKIKIS--GLPKGRLNKPQYFTVDASKAGLLPLEVAVCGGRVPVEPVMAVALGDGTYNVSFVPSE 327 (1113)
T ss_pred CCeeeeeec--ccChhheeec--CCCcccccCCeeEEEechhcCccceEEEecCCCCcccceEEecCCCceeEEEEeEcc
Confidence 999877653 3578898888 6888899999999999999999999999988754 246778899999999999999
Q ss_pred CceEEEEEEECCeeCCCCCceeEecCCCCCceeEEEeeeCCC--ceecCCCeEEEEEecCC-ccceEEEEECCCCCeeee
Q psy192 272 PGEYRVGIKFNDQHIPDSPYKLFVSPAMGDAHKLEIAQFPQG--VVMADKPTQFLVRKNGA-VGALDAKVISPSGTEDDC 348 (898)
Q Consensus 272 ~G~y~i~V~~~g~~I~gSPf~v~V~~~~~~~~~~~~~~~~~~--~~~~g~~~~f~V~~~~a-~G~l~v~v~~p~g~~~~~ 348 (898)
.|.|.|+|.++++.++.|||.+.+.+..+ +++....++... ++.+-.+..|.+|..|+ .|.|+++|++|.|+..++
T Consensus 328 ~g~~~isV~~ade~vp~spf~~kvlp~~~-as~~~a~g~gLt~~~v~as~~~d~~fD~~Dageg~levqV~gp~Gk~~~~ 406 (1113)
T KOG0518|consen 328 DGPYTISVRIADEDVPRSPFLVKVLPTVD-ASKVTASGAGLTDIGVNASVEVDFTFDEGDAGEGLLEVQVVGPEGKEKEV 406 (1113)
T ss_pred cCCeeEEEEecCCcCCCCcceeeeccccc-hhhhccccCccccccccccccceeeEEccccccceEEEEEECCCCCceee
Confidence 99999999999999999999999998753 466666554433 23455678899999988 678999999999999999
Q ss_pred EEEecCCC-eEEEEEEecCCeeEEEEEEECCeecCCCCeEEEeccCCCCCceEEecCCCcceeeeCCeEEEEEEcCCCCC
Q psy192 349 FIQPIDGD-NYSIRFMPRENGIHNIHIKFNGVHIPGSPLRIKVGKGEADPAAVHATGNGLAEIKSGVKTDFIVDTCNAGA 427 (898)
Q Consensus 349 ~v~~~~~g-~y~v~f~P~~~G~h~i~V~~~g~~I~gSPf~v~V~~~~~d~s~v~v~G~Gl~~~~~g~~~~F~Vd~~~aG~ 427 (898)
.++++.++ +|.++|.|..+|.|.|.|+|+|++|++|||+++.. .|++|+.+..+.- .++|++.+|.||++.||.
T Consensus 407 ~V~d~~~~~~h~vsY~pd~~G~y~i~v~~~gd~i~gSPf~~ra~---~dask~~~~~~i~--~~vg~~~~~~vdak~ag~ 481 (1113)
T KOG0518|consen 407 VVRDNGRGGIHIVTYVPDCPGRYLIVVFYGGDPIPGSPFTARAY---PDASKEVLVPPIN--AQVGKEANFVVDAKAAGA 481 (1113)
T ss_pred EEEecCCCceEEEEEcCCCCCceEEEEEECCcccCCCceEEEec---cCccceEecCCCc--ccccceeeeEeehhhccC
Confidence 99997766 99999999999999999999999999999999985 3899988754432 688999999999999999
Q ss_pred CceEEEEECCCCcee--eeEEeCCe-EEEEEecCCCeeEEEEEEECCEecCCCCeEEEEecccCCCCCcceeeeEEEeeE
Q psy192 428 GTLAVTIDGPSKVSM--DCTEVEEG-YKVRYTPLVPGDYYVSLKYNGYHIVGSPFKVKCTGKDLGERGGQETSSVTVETV 504 (898)
Q Consensus 428 g~l~v~v~gp~~~~~--~~~~~~~g-y~v~ytp~~~G~y~i~V~~~g~~I~gSPf~v~v~~~~~~~~~~~~~s~v~v~~~ 504 (898)
+.+.+.+..|++..+ ++..+.+| |++.||+..+|+|.|.|+|+|+|+++|||.+.....-.. . .... .+...
T Consensus 482 g~~~~~~~~pdn~n~~~~v~~~~dg~~~iv~ta~~~~~~~i~v~f~ge~vp~sp~~~~a~~~~~~-~---~~~r-~v~e~ 556 (1113)
T KOG0518|consen 482 GPVTCTVVTPDNRNVPADVQKNGDGTFDIVFTAQDPGTYTIDVRFGGEHVPDSPFIVSASAGIGT-A---VAAR-WVTEE 556 (1113)
T ss_pred CCceeEEeccccCCCCceeeeccCcceeeEeccCCCCcceEEEEECCeeCCCCceeEEEeccccc-c---cccc-ccccc
Confidence 999999999987654 55677888 999999999999999999999999999999987643000 0 0000 00000
Q ss_pred EeeeccCCCCCccccccCCCceEEEecCceeeeeccCccEEEEEeecCCCCeEEEEeccCCcceeEeCCCccceecCCee
Q psy192 505 QKVAKNKTQGPVIPIFKSDASKVTCKGMGLKKAYAQKQNMFTIHCQDAGSPFKLYVDSIPSGYVTAYGPGLISGVSGEPC 584 (898)
Q Consensus 505 ~~~~~~~~~~~~~p~~~~~~~~v~~~g~gl~~~~~~~~~~f~v~~~~~G~~~~l~v~~~~~~~v~~~Gpgl~~g~~g~p~ 584 (898)
-.+ ..++ .+. + .+ .-.|++.-.+ + +..-++-++....|
T Consensus 557 h~v-------~v~~----------~~~----~-~~--vi~~~v~k~~------l-----------~~ev~~~t~k~~~~- 594 (1113)
T KOG0518|consen 557 HSV-------EVIE----------QQS----K-TL--VIPFTVQKKE------L-----------AGEVNMPTGKQAKP- 594 (1113)
T ss_pred ccc-------cccc----------ccc----c-cc--cccccccccc------h-----------hccccCccccccCc-
Confidence 000 0000 000 0 00 0011111000 0 00001111222233
Q ss_pred EEEEEecCCCCCCCceeeecccCCCCcEEEEeCCccceEEEEeCCCCeEEEEEEcCCCeeEEEEEEECCeecCCCceEEE
Q psy192 585 LFTISTKGAGAGSPFQFTVGPLRDGGLSMAVEGPSKAEITYHDNKDGTVAVSYLPTAPGEYKIAVKFGEKHIKGSPYLAK 664 (898)
Q Consensus 585 ~fti~t~~ag~g~p~~~~~~~~~~g~l~v~v~GPs~~~i~~~~~~dg~~~v~Y~P~~pG~Y~i~V~~~~~~I~GSPF~v~ 664 (898)
++|+.++.-. .++ =.+.++||||++++|+|+||+||+|.|+|+|+.||+|+|.|+|+|+|||||||+..
T Consensus 595 -~iiDn~d~t~------~v~----~~~t~aiegPskaei~c~dn~Dgt~~vsy~P~vpG~y~i~V~~~~k~vpgSPF~~~ 663 (1113)
T KOG0518|consen 595 -EIIDNKDGTI------TVR----YAPTLAIEGPSKAEIRCYDNKDGTCSVSYTPTVPGVYKIEVKFGDKHVPGSPFVAL 663 (1113)
T ss_pred -eeeccCCCce------eEE----eccccccCCCCccceeEEeCCCCcEEeccCCCCCceEEEEEEECCeeCCCCCceEE
Confidence 3455544321 111 12357899999999999999999999999999999999999999999999999999
Q ss_pred Ecc-CccccceeeecCceeecccceeeccCcceeEEEEECCCCCeeceEEEeeCCCcEEEEEEEcccccEEEEEEECCEE
Q psy192 665 ITG-EGRKRNQISVGSCSEVSFPGKVSDSDIRSLNASIQAPSGLEEPCFLKKIPNGNLGISFTPREVGSHLVSVKKMGVH 743 (898)
Q Consensus 665 V~~-~~~~~~~~~~g~~~~~~~~~~i~~~~~~~l~a~v~~psg~~~~~~v~~~~~g~y~v~ftP~e~G~h~i~V~~~G~~ 743 (898)
|++ .+.+.++.++|+..+..+. +++.+.+.|+|+|.+|+|..+||.++..+||.|.++|+|+|+|+|+|+|+++|+|
T Consensus 664 Vt~~~s~r~s~~~Vgs~~ev~in--~~~~~~s~ltaeI~~PsGn~~~c~~r~l~~g~~~itF~P~e~GeH~I~Vk~~G~h 741 (1113)
T KOG0518|consen 664 VTGDESIRTSHLSVGSIHEVTIN--ITEADLSVLTAEIVDPSGNPEPCLVRRLPNGHDGITFTPREVGEHKINVKVAGKH 741 (1113)
T ss_pred EecccccccccceecceeeEEee--ccccCCceeEEEEECCCCCccceeeEecCCCceeEEECCCcCcceEEEEEEcceE
Confidence 996 6677788999999888775 7788889999999999999999999999999999999999999999999999999
Q ss_pred cCCCCeEEEeccccccCcceeEEeCccccccccCCCCceEEecCCCC---------------------------------
Q psy192 744 IKNSPFKINVGEREVGDAKKVKVFGQSLTEGKTHEENPFTVDTRDAG--------------------------------- 790 (898)
Q Consensus 744 i~gSPf~i~V~~~e~~da~kv~v~G~gl~~g~~~~~~~f~vd~~~aG--------------------------------- 790 (898)
|++|||.|.|.+.|. ||+|++++|+||.+|.+.|+++|+||||+||
T Consensus 742 VpgsPf~i~V~~~e~-dAsk~~v~g~g~~~G~t~ep~~fivDtr~agyGgLsi~~~Gpskvd~~~~d~~dGt~kV~ytPt 820 (1113)
T KOG0518|consen 742 VPGSPFSIKVSESEI-DASKVRVSGQGLKEGHTFEPAEFIVDTRKAGYGGLSISVQGPSKVDLNVEDREDGTCKVSYTPT 820 (1113)
T ss_pred CCCCCeEEEeccccc-cceeEEEecccccccccccchheEeccccCCCCceEEEEeCCcccccceeecCCCeEEEEEeCC
Confidence 999999999999999 9999999999999999999999999999998
Q ss_pred ------------------CceEEecCCCCCCCc----eeEEecCCccceecCceeEEEEEcCCCCCceeEEEEeCCCccc
Q psy192 791 ------------------SPLRIKVGKGEADPA----AVHATGNGLAEIKSGVKTDFIVDTCNAGAGTLAVTIDGPSKVS 848 (898)
Q Consensus 791 ------------------~p~~v~v~~~~~~~~----~v~a~G~GL~~~~~g~~~~f~vdt~~aG~g~l~v~i~gp~~~~ 848 (898)
|||.|++. +++ .+.-..+++.-+..|+.++|.++.-+++.-..+++++.|++--
T Consensus 821 epG~Y~I~i~Fad~~I~gSPftVkv~----~~~~vvesi~~~~~~~~va~~g~~~~l~lk~~~~~~~d~ta~vt~PSG~~ 896 (1113)
T KOG0518|consen 821 EPGTYIINIKFADEHIKGSPFTVKVT----GESRVVESITRDREAPSVARVGHTCSLDLKATEASSQDITARVTDPSGRV 896 (1113)
T ss_pred CCceEEEEEEEcCccCCCCceEEEec----CCeeEeeeeeecccccceecccceeeeeeecCCCCccceEEEeeCCCCCc
Confidence 34444442 222 2333556677789999999999999999999999999999743
Q ss_pred cccccceEEEEeeCCceEEEEEEeCCCeeEEEEEEECCccCCCCCcee
Q psy192 849 VKKYKDEIFTRHTGRNNFEVKYIVRDRGEYLLIVKWGDDHIPGSPFKV 896 (898)
Q Consensus 849 ~~~~~~~~~~~~~g~~~y~v~y~~~~~G~y~l~vk~~~~~i~GSPF~v 896 (898)
.+..+.+.++++|.|+|+|+|+|.|+|+|+|.|+||+||||++
T Consensus 897 -----~~aei~~~~~~~y~vrFtP~e~G~~tl~V~y~~~~vegspftv 939 (1113)
T KOG0518|consen 897 -----FEAEIVDLGQGTYQVRFTPKEPGNHTLSVKYKDQHVEGSPFTV 939 (1113)
T ss_pred -----cccEEEECCCceEEEEecCCCCCceEEEEEecCccCCCCceee
Confidence 2566778999999999999999999999999999999999987
No 3
>smart00557 IG_FLMN Filamin-type immunoglobulin domains. These form a rod-like structure in the actin-binding cytoskeleton protein, filamin. The C-terminal repeats of filamin bind beta1-integrin (CD29).
Probab=99.81 E-value=1.2e-19 Score=163.38 Aligned_cols=90 Identities=39% Similarity=0.725 Sum_probs=84.0
Q ss_pred CceeEEecCCccceecCceeEEEEEcCCCCCceeEEEEeCCCccccccccceEEEEeeCCceEEEEEEeCCCeeEEEEEE
Q psy192 804 PAAVHATGNGLAEIKSGVKTDFIVDTCNAGAGTLAVTIDGPSKVSVKKYKDEIFTRHTGRNNFEVKYIVRDRGEYLLIVK 883 (898)
Q Consensus 804 ~~~v~a~G~GL~~~~~g~~~~f~vdt~~aG~g~l~v~i~gp~~~~~~~~~~~~~~~~~g~~~y~v~y~~~~~G~y~l~vk 883 (898)
|++|+|+|+||..+.+|++++|+|+|+++|.+.|+|.|.+|++..+ ++.+.+++||+|.|+|+|+++|.|.|+|+
T Consensus 1 p~~~~v~G~Gl~~~~vg~~~~f~v~~~d~G~~~~~v~i~~p~g~~~-----~~~v~d~~dGty~v~y~P~~~G~~~i~V~ 75 (93)
T smart00557 1 ASKVKASGPGLEKGVVGEPAEFTIDTRGAGGGELEVEVTGPSGKKV-----PVEVKDNGDGTYTVSYTPTEPGDYTVTVK 75 (93)
T ss_pred CCeEEEECCCcCceecCCCEEEEEEcCCCCCCcEEEEEECCCCCee-----EeEEEeCCCCEEEEEEEeCCCEeEEEEEE
Confidence 5789999999999999999999999999999999999999997332 56788999999999999999999999999
Q ss_pred ECCccCCCCCceeeC
Q psy192 884 WGDDHIPGSPFKVEV 898 (898)
Q Consensus 884 ~~~~~i~GSPF~v~v 898 (898)
|+|+||+||||++.|
T Consensus 76 ~~g~~I~gSPF~v~V 90 (93)
T smart00557 76 FGGEHIPGSPFTVKV 90 (93)
T ss_pred ECCEECCCCCEEEEE
Confidence 999999999999975
No 4
>smart00557 IG_FLMN Filamin-type immunoglobulin domains. These form a rod-like structure in the actin-binding cytoskeleton protein, filamin. The C-terminal repeats of filamin bind beta1-integrin (CD29).
Probab=99.81 E-value=2.2e-19 Score=161.67 Aligned_cols=90 Identities=48% Similarity=0.879 Sum_probs=85.3
Q ss_pred cceEEEeCCCCCcceeeeeeEEEEEeccCCCCceEEEEEcCCc--ceeeEEECCCCcEEEEEEeCCCceEEEEEEECCee
Q psy192 208 AHRVHAGGPGLERGEQNQPCEFNVWTREAGAGSLAISVEGPSK--AEIDFKDRKDGSCYVSYVVAEPGEYRVGIKFNDQH 285 (898)
Q Consensus 208 ~~~v~a~G~GL~~~~~g~~~~F~V~~~daG~g~l~v~v~gps~--~~i~~~d~~dGty~v~y~P~~~G~y~i~V~~~g~~ 285 (898)
+++|+|+|+||+.+.+|++++|+|+++|+|.+.|+|.|.+|++ .++++.|++||+|.|+|+|..+|.|.|+|+|+|++
T Consensus 1 p~~~~v~G~Gl~~~~vg~~~~f~v~~~d~G~~~~~v~i~~p~g~~~~~~v~d~~dGty~v~y~P~~~G~~~i~V~~~g~~ 80 (93)
T smart00557 1 ASKVKASGPGLEKGVVGEPAEFTIDTRGAGGGELEVEVTGPSGKKVPVEVKDNGDGTYTVSYTPTEPGDYTVTVKFGGEH 80 (93)
T ss_pred CCeEEEECCCcCceecCCCEEEEEEcCCCCCCcEEEEEECCCCCeeEeEEEeCCCCEEEEEEEeCCCEeEEEEEEECCEE
Confidence 4689999999999999999999999999999999999999986 46788999999999999999999999999999999
Q ss_pred CCCCCceeEecC
Q psy192 286 IPDSPYKLFVSP 297 (898)
Q Consensus 286 I~gSPf~v~V~~ 297 (898)
|+||||.+.|.+
T Consensus 81 I~gSPF~v~V~~ 92 (93)
T smart00557 81 IPGSPFTVKVGP 92 (93)
T ss_pred CCCCCEEEEEeC
Confidence 999999999865
No 5
>PF00630 Filamin: Filamin/ABP280 repeat; InterPro: IPR017868 The many different actin cross-linking proteins share a common architecture, consisting of a globular actin-binding domain and an extended rod. Whereas their actin-binding domains consist of two calponin homology domains (see IPR001715 from INTERPRO), their rods fall into three families. The rod domain of the family including the Dictyostelium discoideum (Slime mould) gelation factor (ABP120) and human filamin (ABP280) is constructed from tandem repeats of a 100-residue motif that is glycine and proline rich []. The gelation factor's rod contains 6 copies of the repeat, whereas filamin has a rod constructed from 24 repeats. The resolution of the 3D structure of rod repeats from the gelation factor has shown that they consist of a beta-sandwich, formed by two beta-sheets arranged in an immunoglobulin-like fold [, ]. Because conserved residues that form the core of the repeats are preserved in filamin, the repeat structure should be common to the members of the gelation factor/filamin family. The head to tail homodimerisation is crucial to the function of the ABP120 and ABP280 proteins. This interaction involves a small portion at the distal end of the rod domains. For the gelation factor it has been shown that the carboxy-terminal repeat 6 dimerises through a double edge-to-edge extension of the beta-sheet and that repeat 5 contributes to dimerisation to some extent [, , ].; PDB: 2DI9_A 2EEC_A 2DIC_A 2EEA_A 2DMC_A 2EE9_A 2D7O_A 2D7N_A 2K7P_A 2NQC_A ....
Probab=99.72 E-value=5.5e-17 Score=148.73 Aligned_cols=93 Identities=31% Similarity=0.512 Sum_probs=79.8
Q ss_pred CCCceeEEecCCccceecCceeEEEEEcCCCCCc-------eeEEEEeCCCccccccccceEEEEeeCCceEEEEEEeCC
Q psy192 802 ADPAAVHATGNGLAEIKSGVKTDFIVDTCNAGAG-------TLAVTIDGPSKVSVKKYKDEIFTRHTGRNNFEVKYIVRD 874 (898)
Q Consensus 802 ~~~~~v~a~G~GL~~~~~g~~~~f~vdt~~aG~g-------~l~v~i~gp~~~~~~~~~~~~~~~~~g~~~y~v~y~~~~ 874 (898)
+||++|+|+|+||..+.+|++++|+|++++++.+ .|.|+|.+|+...... ..++.+.+++||+|.++|+|.+
T Consensus 2 ~dp~~~~v~G~gl~~~~~g~~~~F~V~~~d~~g~~~~~~~~~~~v~i~~p~~~~~~~-~~~~~v~~~~~G~y~v~y~p~~ 80 (101)
T PF00630_consen 2 IDPSKCKVSGPGLEPAVVGEPATFTVDTRDAGGNPVSSGGDEFQVTITSPDGKEEPV-PVPVEVIDNGDGTYTVSYTPTE 80 (101)
T ss_dssp CSGCGEEEESGGGTEEETTSEEEEEEEETTTTSSBEESTSSEEEEEEESSSSESS---EEEEEEEEESSSEEEEEEEESS
T ss_pred CCCCEEEEECCccCCeECCCcEEEEEEEccCCCCccccCCceeEEEEeCCCCCcccc-ccceEEEECCCCEEEEEEEeCc
Confidence 6899999999999999999999999999998555 5779999998751100 1246677889999999999999
Q ss_pred CeeEEEEEEECCccCCCCCce
Q psy192 875 RGEYLLIVKWGDDHIPGSPFK 895 (898)
Q Consensus 875 ~G~y~l~vk~~~~~i~GSPF~ 895 (898)
+|.|.|+|+|+|+||+||||+
T Consensus 81 ~G~y~i~V~~~g~~I~gSPf~ 101 (101)
T PF00630_consen 81 PGKYKISVKINGQPIPGSPFT 101 (101)
T ss_dssp SEEEEEEEEESSEB-TTSSEE
T ss_pred cEeEEEEEEECCEECcCCCcC
Confidence 999999999999999999996
No 6
>PF00630 Filamin: Filamin/ABP280 repeat; InterPro: IPR017868 The many different actin cross-linking proteins share a common architecture, consisting of a globular actin-binding domain and an extended rod. Whereas their actin-binding domains consist of two calponin homology domains (see IPR001715 from INTERPRO), their rods fall into three families. The rod domain of the family including the Dictyostelium discoideum (Slime mould) gelation factor (ABP120) and human filamin (ABP280) is constructed from tandem repeats of a 100-residue motif that is glycine and proline rich []. The gelation factor's rod contains 6 copies of the repeat, whereas filamin has a rod constructed from 24 repeats. The resolution of the 3D structure of rod repeats from the gelation factor has shown that they consist of a beta-sandwich, formed by two beta-sheets arranged in an immunoglobulin-like fold [, ]. Because conserved residues that form the core of the repeats are preserved in filamin, the repeat structure should be common to the members of the gelation factor/filamin family. The head to tail homodimerisation is crucial to the function of the ABP120 and ABP280 proteins. This interaction involves a small portion at the distal end of the rod domains. For the gelation factor it has been shown that the carboxy-terminal repeat 6 dimerises through a double edge-to-edge extension of the beta-sheet and that repeat 5 contributes to dimerisation to some extent [, , ].; PDB: 2DI9_A 2EEC_A 2DIC_A 2EEA_A 2DMC_A 2EE9_A 2D7O_A 2D7N_A 2K7P_A 2NQC_A ....
Probab=99.67 E-value=3.1e-16 Score=143.75 Aligned_cols=87 Identities=34% Similarity=0.685 Sum_probs=78.1
Q ss_pred CCcceEEEeCCCCCcceeeeeeEEEEEeccCCCC-------ceEEEEEcCCcc------eeeEEECCCCcEEEEEEeCCC
Q psy192 206 GGAHRVHAGGPGLERGEQNQPCEFNVWTREAGAG-------SLAISVEGPSKA------EIDFKDRKDGSCYVSYVVAEP 272 (898)
Q Consensus 206 ~d~~~v~a~G~GL~~~~~g~~~~F~V~~~daG~g-------~l~v~v~gps~~------~i~~~d~~dGty~v~y~P~~~ 272 (898)
.||++|+|+|+||+.+.+|++++|+|+++|++++ .|.|.|.+|+.. .+++.+++||+|.++|+|+++
T Consensus 2 ~dp~~~~v~G~gl~~~~~g~~~~F~V~~~d~~g~~~~~~~~~~~v~i~~p~~~~~~~~~~~~v~~~~~G~y~v~y~p~~~ 81 (101)
T PF00630_consen 2 IDPSKCKVSGPGLEPAVVGEPATFTVDTRDAGGNPVSSGGDEFQVTITSPDGKEEPVPVPVEVIDNGDGTYTVSYTPTEP 81 (101)
T ss_dssp CSGCGEEEESGGGTEEETTSEEEEEEEETTTTSSBEESTSSEEEEEEESSSSESS--EEEEEEEEESSSEEEEEEEESSS
T ss_pred CCCCEEEEECCccCCeECCCcEEEEEEEccCCCCccccCCceeEEEEeCCCCCccccccceEEEECCCCEEEEEEEeCcc
Confidence 4789999999999999999999999999999655 567999999664 356789999999999999999
Q ss_pred ceEEEEEEECCeeCCCCCce
Q psy192 273 GEYRVGIKFNDQHIPDSPYK 292 (898)
Q Consensus 273 G~y~i~V~~~g~~I~gSPf~ 292 (898)
|.|+|+|+|+|++|+||||+
T Consensus 82 G~y~i~V~~~g~~I~gSPf~ 101 (101)
T PF00630_consen 82 GKYKISVKINGQPIPGSPFT 101 (101)
T ss_dssp EEEEEEEEESSEB-TTSSEE
T ss_pred EeEEEEEEECCEECcCCCcC
Confidence 99999999999999999996
No 7
>PRK14081 triple tyrosine motif-containing protein; Provisional
Probab=99.36 E-value=7.8e-07 Score=104.44 Aligned_cols=424 Identities=13% Similarity=0.154 Sum_probs=223.1
Q ss_pred EEEEEcCCceeEEEEEEECCEEec---------------------------------CCCCeeEEEecCccceEE--EEe
Q psy192 20 SLHYDPREEGLHELALKFNGDHVQ---------------------------------GYGGLSLSIEGPSKAEIQ--CKD 64 (898)
Q Consensus 20 ~v~f~P~~~G~h~v~v~~~G~~v~---------------------------------G~g~l~v~v~GP~~~~i~--~~d 64 (898)
++.++|+++|.+.|.|..-..--. |. .|.+.|++-++. +- ..-
T Consensus 55 ~~~w~p~~~GkY~imVq~K~~~S~~~fD~~~~~~~~v~~~~~~~I~~~~~~~~~l~vGe-~l~~~V~~~~e~-~LYKF~I 132 (667)
T PRK14081 55 EINWIPKEEGEYTIMVQAKKEDSNKPFDYVSKEDYVIGKAEEKLIKNIYLDKDTLNVGE-KIEIKVDSNKEP-LMYRYWI 132 (667)
T ss_pred eEEEeeCCCccEEEEEEEecCCCCCCcceeEEEEEEEcccchhhheeeEecCccccCCC-EEEEEEEeccCc-EEEEEEE
Confidence 689999999999999988432110 11 244555543322 21 112
Q ss_pred CCCCeEE----------EEEeeCCCeEEEEEEEECCeecCC-------CCeEEEEecCCcccceeeeeeeecccceeeeC
Q psy192 65 NADGSLN----------ISYRPTEPGYYIINLKFADHHVEG-------SPFTAKIVGEGSNRQREKIQRQREAVPVTEVG 127 (898)
Q Consensus 65 ~~dG~~~----------v~ytP~~~G~~~I~V~~~g~~i~g-------SPf~~~v~~~~~~~~~~~i~~~~~~~~~~~vg 127 (898)
++||.+. ++|+|+.+|.|.|-+..-+.-=.. .=|. |.... . ..|............|
T Consensus 133 ~~~~~w~~iqDYst~n~lsyt~~~~G~Y~ll~~~Kd~~S~~~fDD~~~v~y~--Vk~~~--~--v~I~~F~~lns~~i~~ 206 (667)
T PRK14081 133 KEDNNWKLIKDYSTENSLSYTANKPGKYELLVECKRIDSTKDFDDFKKVKFK--VKEID--K--VEITDFKCLNKELICD 206 (667)
T ss_pred cCCCcEEEEEecCCcceEEEEecCCCcEEEEEEEecCCCccccCcceEEEEE--cccCc--c--eEEEeccccCcceecC
Confidence 3455444 699999999999988765422110 0111 11111 0 0122111122335667
Q ss_pred cEEEEEEEcCCCcccceE--EEEeCCCCCeeeeEEEEcCCCEEEEEEEcCcceeEEEEEEECCeecCC-----CceEEEE
Q psy192 128 STCKLTFKMPGITAFDLS--ATVTSPGGVTEDAEINEVEDGLYAVHFVPKELGVHTVSVRYKDIHIPG-----SPFQFTV 200 (898)
Q Consensus 128 ~~~~f~v~~~~~~~~~~~--~~V~~p~G~~~~~~v~~~~~g~y~v~f~P~~~G~h~I~V~~~g~~V~g-----SPf~v~V 200 (898)
....|.+.+..-++..+- -.+.+++|... .+++=.. .=.++|+|..+|.|+|.+.+.+..=.+ .=..++|
T Consensus 207 ~eI~f~~~a~~~~g~~~LYKF~~i~~~G~~~--~~qdYst-~n~~~y~~~~~G~Y~i~~~VKD~~S~~eyDD~~~l~y~V 283 (667)
T PRK14081 207 EELVFEVESVYEEDRTILYKFVKIDSDGKQT--CIQDYST-KNIVSYKEKKSGDYKLLCLVKDMYSNNEFDDRAVLVYTV 283 (667)
T ss_pred cEEEEEEEEEeCCCceEEEEEEEECCCCCEE--EecCccc-cceEEEEeCCCccEEEEEEEeccCcccccccceEEEEEE
Confidence 888888777654443332 22445666532 2222111 235789999999999999997643332 1123334
Q ss_pred ccCCCCCcceEEEeCCCCCc-ceeeeeeEEEEEeccCCCCce--EEEEEcCCcceeeEEECCCCc-EEEEEEeCCCceEE
Q psy192 201 GPLRDGGAHRVHAGGPGLER-GEQNQPCEFNVWTREAGAGSL--AISVEGPSKAEIDFKDRKDGS-CYVSYVVAEPGEYR 276 (898)
Q Consensus 201 ~~~~~~d~~~v~a~G~GL~~-~~~g~~~~F~V~~~daG~g~l--~v~v~gps~~~i~~~d~~dGt-y~v~y~P~~~G~y~ 276 (898)
.+-. +-+++-.=.-+.. -.+|++..|.+.+.+ +.+| ...|.|+.. .+.+=++ -.+.|+|..+|.|+
T Consensus 284 k~~~---~vkI~~~~~d~~s~ql~g~~I~ika~a~G--G~~llYrf~I~G~~~-----e~~~Y~~~n~~~w~P~~~G~Y~ 353 (667)
T PRK14081 284 KPYK---DIKIRNFTTDLSSPQLTDTDIELKAVAEG--GKELLYRFIIKGKES-----EDSGYIRNNIYTWKPKIAGKYS 353 (667)
T ss_pred ecCC---CcEEEEEEEcCCCCeecCceEEEEEEecC--CCceEEEEEECCcEE-----eeccccccceEEEeeCCCceEE
Confidence 3211 1222211111222 358999999998874 4444 466776542 2222233 46899999999999
Q ss_pred EEEEECCeeCCCC-----CceeEecCCC-CC--ceeEEEeeeCCCceecCCCeEEEEEecCCccceEEEEECCCCCeeee
Q psy192 277 VGIKFNDQHIPDS-----PYKLFVSPAM-GD--AHKLEIAQFPQGVVMADKPTQFLVRKNGAVGALDAKVISPSGTEDDC 348 (898)
Q Consensus 277 i~V~~~g~~I~gS-----Pf~v~V~~~~-~~--~~~~~~~~~~~~~~~~g~~~~f~V~~~~a~G~l~v~v~~p~g~~~~~ 348 (898)
|.|..-|..-.+. -..+.|.... .+ ...+.+. ....+.+|++..|.+.+.+..+-+=--++--+|..+.
T Consensus 354 I~v~VKDk~S~~eyD~~~~v~f~i~~~~~~p~~I~~vl~d--~~~~~lvG~~i~i~v~a~gg~~~lY~f~ik~ng~~ve- 430 (667)
T PRK14081 354 ITLWVKDISSKGEYEDKSSIDYTIEEKSKEPIKIEDVILD--KGKHILKGEEIKIRVIAEGGTNLRYSFIIKKDGKEEE- 430 (667)
T ss_pred EEEEEEcCcCcccccceEEEEEEEcccCCCCeEEEEEEEC--CCCceEeCCeEEEEEEecCCCeEEEEEEEEECCEEEE-
Confidence 9999988654311 1112222210 01 1122221 1234678999999988776543221112222454321
Q ss_pred EEEecCCCeEEEEEEecCCeeEEEEEEECCeecC----C-CCeEEEeccCCCCCceE-EecCCCcceeeeCCeEEEEEEc
Q psy192 349 FIQPIDGDNYSIRFMPRENGIHNIHIKFNGVHIP----G-SPLRIKVGKGEADPAAV-HATGNGLAEIKSGVKTDFIVDT 422 (898)
Q Consensus 349 ~v~~~~~g~y~v~f~P~~~G~h~i~V~~~g~~I~----g-SPf~v~V~~~~~d~s~v-~v~G~Gl~~~~~g~~~~F~Vd~ 422 (898)
-.+-. -.-.+.|+|..+|.|+|.|..-+..=. . .=+.+.|.. .-|.+. .+-=+-.....+|+..+|.+.+
T Consensus 431 -~~~Y~-~~~~~~f~P~~~G~Y~IeV~vKdk~S~~~yD~~k~v~l~V~e--~~P~~i~~il~~~~~~~~vg~~i~~~~~~ 506 (667)
T PRK14081 431 -KIDYG-KNNWVNFIPEEKGNYELEVRVKDKYSDKEYDAHTIVYIKVHE--YIPAEIDYILLPVKEYYLVGDDIEIEVII 506 (667)
T ss_pred -Eeecc-cccEEEEEECCCeeEEEEEEEecccCchhcccceEEEEEEec--cCceeeeeEEecccccEEeCCEEEEEEEE
Confidence 11111 123789999999999999987653111 0 112334431 112110 0001122345789999999988
Q ss_pred CCCCCCceEEEEECCCCceeeeEEeCCeEEEEEecCCCeeEEEEEEECCE
Q psy192 423 CNAGAGTLAVTIDGPSKVSMDCTEVEEGYKVRYTPLVPGDYYVSLKYNGY 472 (898)
Q Consensus 423 ~~aG~g~l~v~v~gp~~~~~~~~~~~~gy~v~ytp~~~G~y~i~V~~~g~ 472 (898)
.+...-.+...+. -++.-+..++-...-..+|+|..+|.|+|.|..-+.
T Consensus 507 ~~~k~v~y~y~~~-~NG~~v~~t~Ys~~~~ysf~P~~~GkY~I~V~aKn~ 555 (667)
T PRK14081 507 QNTKDVLIKYILK-INGHKVEETDYIKNKKYKFIPKCSGKYTIEVLAKNI 555 (667)
T ss_pred eCCCeEEEEEEEE-ECCEEEEEeeccccceEEEeecCCceEEEEEEEccc
Confidence 7732211121111 112122222222224578899999999998876553
No 8
>PRK14081 triple tyrosine motif-containing protein; Provisional
Probab=99.33 E-value=3.3e-06 Score=99.24 Aligned_cols=374 Identities=14% Similarity=0.138 Sum_probs=198.1
Q ss_pred EEEEeeCCCeEEEEEEEECCeecCCCCeEE------EEecCCcccceeeeeeeecccceeeeCcEEEEEEEcCCCcccce
Q psy192 71 NISYRPTEPGYYIINLKFADHHVEGSPFTA------KIVGEGSNRQREKIQRQREAVPVTEVGSTCKLTFKMPGITAFDL 144 (898)
Q Consensus 71 ~v~ytP~~~G~~~I~V~~~g~~i~gSPf~~------~v~~~~~~~~~~~i~~~~~~~~~~~vg~~~~f~v~~~~~~~~~~ 144 (898)
.++|+|+++|.|.|-|..-...-.. ||-- .|...... -|..-........+|+.+.|.++...- ++
T Consensus 55 ~~~w~p~~~GkY~imVq~K~~~S~~-~fD~~~~~~~~v~~~~~~----~I~~~~~~~~~l~vGe~l~~~V~~~~e---~~ 126 (667)
T PRK14081 55 EINWIPKEEGEYTIMVQAKKEDSNK-PFDYVSKEDYVIGKAEEK----LIKNIYLDKDTLNVGEKIEIKVDSNKE---PL 126 (667)
T ss_pred eEEEeeCCCccEEEEEEEecCCCCC-CcceeEEEEEEEcccchh----hheeeEecCccccCCCEEEEEEEeccC---cE
Confidence 3799999999999999987655433 4431 12221110 111111112334678888888775432 12
Q ss_pred EEE-EeCCCCCeeeeEEEEcCCCEEEEEEEcCcceeEEEEEEECCeecCC-----CceEEEEccCCCCCcceEEEeCC-C
Q psy192 145 SAT-VTSPGGVTEDAEINEVEDGLYAVHFVPKELGVHTVSVRYKDIHIPG-----SPFQFTVGPLRDGGAHRVHAGGP-G 217 (898)
Q Consensus 145 ~~~-V~~p~G~~~~~~v~~~~~g~y~v~f~P~~~G~h~I~V~~~g~~V~g-----SPf~v~V~~~~~~d~~~v~a~G~-G 217 (898)
-=+ ....+|+.. .++|=.. .=.++|+|..+|.|+|.+..-+.-=.. .=..++|.+. .++.+.+- -
T Consensus 127 LYKF~I~~~~~w~--~iqDYst-~n~lsyt~~~~G~Y~ll~~~Kd~~S~~~fDD~~~v~y~Vk~~-----~~v~I~~F~~ 198 (667)
T PRK14081 127 MYRYWIKEDNNWK--LIKDYST-ENSLSYTANKPGKYELLVECKRIDSTKDFDDFKKVKFKVKEI-----DKVEITDFKC 198 (667)
T ss_pred EEEEEEcCCCcEE--EEEecCC-cceEEEEecCCCcEEEEEEEecCCCccccCcceEEEEEcccC-----cceEEEeccc
Confidence 111 133344422 2333111 236789999999999988775321110 0034555542 23444432 2
Q ss_pred CCcc-eeeeeeEEEEEeccCCCCceE--EEEEcCCcceeeEEECCCCcEEEEEEeCCCceEEEEEEECCeeCCCC-----
Q psy192 218 LERG-EQNQPCEFNVWTREAGAGSLA--ISVEGPSKAEIDFKDRKDGSCYVSYVVAEPGEYRVGIKFNDQHIPDS----- 289 (898)
Q Consensus 218 L~~~-~~g~~~~F~V~~~daG~g~l~--v~v~gps~~~i~~~d~~dGty~v~y~P~~~G~y~i~V~~~g~~I~gS----- 289 (898)
|... .+|+...|.+++...++..+- ..+.+..+ .+.|...=.-+-.++|+|..+|.|.|.+...|..-...
T Consensus 199 lns~~i~~~eI~f~~~a~~~~g~~~LYKF~~i~~~G-~~~~~qdYst~n~~~y~~~~~G~Y~i~~~VKD~~S~~eyDD~~ 277 (667)
T PRK14081 199 LNKELICDEELVFEVESVYEEDRTILYKFVKIDSDG-KQTCIQDYSTKNIVSYKEKKSGDYKLLCLVKDMYSNNEFDDRA 277 (667)
T ss_pred cCcceecCcEEEEEEEEEeCCCceEEEEEEEECCCC-CEEEecCccccceEEEEeCCCccEEEEEEEeccCcccccccce
Confidence 4443 589999999998866555543 33433333 22232111122368999999999999998877532111
Q ss_pred --CceeEecCCCCCceeEEEeeeCCCceecCCCeEEEEEecCCccceEEEEECCCCCeeeeEEEecCCCeEEEEEEecCC
Q psy192 290 --PYKLFVSPAMGDAHKLEIAQFPQGVVMADKPTQFLVRKNGAVGALDAKVISPSGTEDDCFIQPIDGDNYSIRFMPREN 367 (898)
Q Consensus 290 --Pf~v~V~~~~~~~~~~~~~~~~~~~~~~g~~~~f~V~~~~a~G~l~v~v~~p~g~~~~~~v~~~~~g~y~v~f~P~~~ 367 (898)
=|+|....+. ....+.+ ...+....|++..|...+.+..+-|=--++ .|... ...+- .-.=.+.|+|.++
T Consensus 278 ~l~y~Vk~~~~v-kI~~~~~--d~~s~ql~g~~I~ika~a~GG~~llYrf~I--~G~~~--e~~~Y-~~~n~~~w~P~~~ 349 (667)
T PRK14081 278 VLVYTVKPYKDI-KIRNFTT--DLSSPQLTDTDIELKAVAEGGKELLYRFII--KGKES--EDSGY-IRNNIYTWKPKIA 349 (667)
T ss_pred EEEEEEecCCCc-EEEEEEE--cCCCCeecCceEEEEEEecCCCceEEEEEE--CCcEE--eeccc-cccceEEEeeCCC
Confidence 1443211110 0111222 122345688988888887764332211122 24332 11111 1123789999999
Q ss_pred eeEEEEEEECCeecCC-----CCeEEEeccCCCCC---ceEEecCCCcceeeeCCeEEEEEEcCCCCCCceEEEEECCCC
Q psy192 368 GIHNIHIKFNGVHIPG-----SPLRIKVGKGEADP---AAVHATGNGLAEIKSGVKTDFIVDTCNAGAGTLAVTIDGPSK 439 (898)
Q Consensus 368 G~h~i~V~~~g~~I~g-----SPf~v~V~~~~~d~---s~v~v~G~Gl~~~~~g~~~~F~Vd~~~aG~g~l~v~v~gp~~ 439 (898)
|.|.|.|.+-+..=.+ .=..+++......| ..+.+.. -....+|++..|.+.+.+...=.....|.. ++
T Consensus 350 G~Y~I~v~VKDk~S~~eyD~~~~v~f~i~~~~~~p~~I~~vl~d~--~~~~lvG~~i~i~v~a~gg~~~lY~f~ik~-ng 426 (667)
T PRK14081 350 GKYSITLWVKDISSKGEYEDKSSIDYTIEEKSKEPIKIEDVILDK--GKHILKGEEIKIRVIAEGGTNLRYSFIIKK-DG 426 (667)
T ss_pred ceEEEEEEEEcCcCcccccceEEEEEEEcccCCCCeEEEEEEECC--CCceEeCCeEEEEEEecCCCeEEEEEEEEE-CC
Confidence 9999999987653321 11123332211122 2233332 234578999999988765532112222221 23
Q ss_pred ceeeeEEeCCeEEEEEecCCCeeEEEEEEECCE
Q psy192 440 VSMDCTEVEEGYKVRYTPLVPGDYYVSLKYNGY 472 (898)
Q Consensus 440 ~~~~~~~~~~gy~v~ytp~~~G~y~i~V~~~g~ 472 (898)
..++-++-...-.++|+|..+|.|+|+|..-+.
T Consensus 427 ~~ve~~~Y~~~~~~~f~P~~~G~Y~IeV~vKdk 459 (667)
T PRK14081 427 KEEEKIDYGKNNWVNFIPEEKGNYELEVRVKDK 459 (667)
T ss_pred EEEEEeecccccEEEEEECCCeeEEEEEEEecc
Confidence 233322222236899999999999998876543
No 9
>TIGR00864 PCC polycystin cation channel protein. Note: this model has been restricted to the amino half because for technical reasons.
Probab=95.26 E-value=21 Score=49.70 Aligned_cols=293 Identities=12% Similarity=0.093 Sum_probs=145.0
Q ss_pred EEEEEeeCCCeEEEEEEEECCeecCCC--CeEEEEecCCcccceeeeeeeecccceeeeCcEEEEEEEcCCCcccceEEE
Q psy192 70 LNISYRPTEPGYYIINLKFADHHVEGS--PFTAKIVGEGSNRQREKIQRQREAVPVTEVGSTCKLTFKMPGITAFDLSAT 147 (898)
Q Consensus 70 ~~v~ytP~~~G~~~I~V~~~g~~i~gS--Pf~~~v~~~~~~~~~~~i~~~~~~~~~~~vg~~~~f~v~~~~~~~~~~~~~ 147 (898)
..+.||+.++|.|+|++..-+.- ... ...+.|.++-.... + ........+|+..+|.+.+..-. +++..
T Consensus 1830 k~v~~Tf~~aG~ytV~L~AsN~v-s~~~~s~~~~VQe~I~~L~---L---~as~~~~~~n~~v~fsa~l~~GS--~Vtf~ 1900 (2740)
T TIGR00864 1830 KHACMTFPDAGTFAIRLNASNAV-SGKSASREFFAEEPIFGLE---L---KASKKIAAIGEKVEFQILLAAGS--AVNFR 1900 (2740)
T ss_pred ceeEEecCCCeEEEEEEEEEccc-CcceeeeeEEEEEecceEE---E---ecccccccCCCEEEEEEEecCCC--ceEEE
Confidence 56889999999999999843321 222 23334444321111 0 01223456788888887665322 23322
Q ss_pred EeCCCCCeeeeEEEEcCCCEEEEEEEcCcceeEEEEEEECCeecCCCceEEE--EccCCCCCcceEEEeCCCCCc-----
Q psy192 148 VTSPGGVTEDAEINEVEDGLYAVHFVPKELGVHTVSVRYKDIHIPGSPFQFT--VGPLRDGGAHRVHAGGPGLER----- 220 (898)
Q Consensus 148 V~~p~G~~~~~~v~~~~~g~y~v~f~P~~~G~h~I~V~~~g~~V~gSPf~v~--V~~~~~~d~~~v~a~G~GL~~----- 220 (898)
..=-+|... +.+ ...+|. +.=.++|.|.|+|...+. |......++ |... +.|.-|..
T Consensus 1901 w~fGdgs~~---~~t-~~~t~~--HsY~~~G~Y~VtV~A~N~-Vs~~~a~~~V~Vle~---------V~gL~I~~~c~~~ 1964 (2740)
T TIGR00864 1901 LQIGGAAPE---VLQ-PGPRFS--HSFPRVDDHMVNLRAKNE-VSCAQANLHIEVLEA---------VRGLQIPDCCAAG 1964 (2740)
T ss_pred EEeCCCCce---eec-CCCeEE--eecCCCCcEEEEEEEEec-cceeeeeEEEEEEEe---------ccceEecCCcccc
Confidence 221223321 111 122333 444789999999998764 444444443 3321 12211111
Q ss_pred ceeeeeeEEEEEeccCCCCceEEEEEcCCcceeeEEE---CCCCc-----EEEEEEeCCCceEEEEEEECCe-eCCCCCc
Q psy192 221 GEQNQPCEFNVWTREAGAGSLAISVEGPSKAEIDFKD---RKDGS-----CYVSYVVAEPGEYRVGIKFNDQ-HIPDSPY 291 (898)
Q Consensus 221 ~~~g~~~~F~V~~~daG~g~l~v~v~gps~~~i~~~d---~~dGt-----y~v~y~P~~~G~y~i~V~~~g~-~I~gSPf 291 (898)
..+|+...|+..... | ..+...= .+.. +++.. =++.|+|..+|.|+|+|+-.+. .-..-+-
T Consensus 1965 ~~~g~~~tF~A~v~~-g-~~V~f~W--------~f~~~~~~g~~~~~~~G~~vty~~~~~G~~~I~v~A~N~l~s~n~t~ 2034 (2740)
T TIGR00864 1965 IATGEEKNFTANVQR-G-KPVAFAW--------TFDLHHLHGDSLVIHMGKDVSYTAEAAGLLEIQLGAFNALGAENITL 2034 (2740)
T ss_pred eecCceEEEEEEEec-C-CceEEEE--------EeeecccCCCcceeecCCceEEccCCCcEEEEEEEEEcccCcccccE
Confidence 245677777665521 1 1111100 0000 11111 1489999999999999987542 2222334
Q ss_pred eeEecCCCCCceeEEEeeeCCCceecCCCeEEEEEecCCccceEEEEECCCCCeeeeEEEecCCCeEEEEEEecCCeeEE
Q psy192 292 KLFVSPAMGDAHKLEIAQFPQGVVMADKPTQFLVRKNGAVGALDAKVISPSGTEDDCFIQPIDGDNYSIRFMPRENGIHN 371 (898)
Q Consensus 292 ~v~V~~~~~~~~~~~~~~~~~~~~~~g~~~~f~V~~~~a~G~l~v~v~~p~g~~~~~~v~~~~~g~y~v~f~P~~~G~h~ 371 (898)
.+.|..... ...+... .....+|+...|.+...-..-++.....=-+|... ......+++++=.++|.|.
T Consensus 2035 ~i~VQ~~i~---~~~L~~~-p~~~~~n~~v~f~~~~~~~~~~v~~~WDFGDGt~~------~~T~~~~~sHtY~r~G~Y~ 2104 (2740)
T TIGR00864 2035 QLEAQDALM---DAALQAG-PQDCFTNKMAQFEAATSPKPNFMACHWDFGDGSAG------QDTDEPRAEHEYLHPGDYR 2104 (2740)
T ss_pred EEEEEehee---eEEEeec-CcceecCceEEEEEEEccCCCcEEEEEEecCCCCc------cccCCceeEEEecCCccEE
Confidence 445533221 1122111 12245577777776653221123333221123221 1112457778778889999
Q ss_pred EEEE-ECCeecCCCCeEEEeccCCCCCceEEecCCCc
Q psy192 372 IHIK-FNGVHIPGSPLRIKVGKGEADPAAVHATGNGL 407 (898)
Q Consensus 372 i~V~-~~g~~I~gSPf~v~V~~~~~d~s~v~v~G~Gl 407 (898)
+.|. +|.....-.=.++.|...++.+-.|.+.|+.-
T Consensus 2105 V~V~a~N~VS~~~a~~~i~V~vl~C~~P~V~i~~~~~ 2141 (2740)
T TIGR00864 2105 VQVNASNLVSFFSAHAEINVQVLACEEPEVDVVLALQ 2141 (2740)
T ss_pred EEEEEEcceeeeeeEEEEEEEEecCCCCCcEeccCce
Confidence 9987 45433322233444444457777777776643
No 10
>PF13584 BatD: Oxygen tolerance
Probab=94.87 E-value=3.6 Score=48.64 Aligned_cols=203 Identities=16% Similarity=0.269 Sum_probs=108.5
Q ss_pred eEEEEEeeCCCeEEEE---EEEECCeecCCCCeEEEEecCCcccc------eeeee-eeecccceeeeCcEEEEEEEcCC
Q psy192 69 SLNISYRPTEPGYYII---NLKFADHHVEGSPFTAKIVGEGSNRQ------REKIQ-RQREAVPVTEVGSTCKLTFKMPG 138 (898)
Q Consensus 69 ~~~v~ytP~~~G~~~I---~V~~~g~~i~gSPf~~~v~~~~~~~~------~~~i~-~~~~~~~~~~vg~~~~f~v~~~~ 138 (898)
+|.+.++|.++|.|+| .|.+||+...=-|..++|.+...... ...+. ..........+|++..+++++.-
T Consensus 74 ~~~~~l~p~~~G~~~IP~~~v~v~Gk~~~S~pi~i~V~~~~~~~~~~~~~~~~~~~l~~~v~~~~~Yvge~v~lt~~ly~ 153 (484)
T PF13584_consen 74 TYTYTLQPKKTGTFTIPPFTVEVDGKTYKSQPITIEVSKASQSPSQPPSNADDDVFLEAEVSKKSVYVGEPVILTLRLYT 153 (484)
T ss_pred EEEEEEEecccceEEEceEEEEECCEEEeecCEEEEEEecccCCccccccccccEEEEEEeCCCceecCCcEEEEEEEEE
Confidence 6778889999999995 67779988888899999987653211 11111 11112334678888877766431
Q ss_pred Ccc--------------cceEEEEeCCCCCeeeeEEE-EcCCCE-------EEEEEEcCcceeEEE-----EEEECC---
Q psy192 139 ITA--------------FDLSATVTSPGGVTEDAEIN-EVEDGL-------YAVHFVPKELGVHTV-----SVRYKD--- 188 (898)
Q Consensus 139 ~~~--------------~~~~~~V~~p~G~~~~~~v~-~~~~g~-------y~v~f~P~~~G~h~I-----~V~~~g--- 188 (898)
... .++.++-. +... ... ..-+|. ++....|.++|.+.| .+.+..
T Consensus 154 ~~~~~~~~~~~~~~p~~~~~~~~~~---~~~~--~~~~~~i~G~~y~~~~~~~~~l~P~ksG~l~I~~~~~~~~~~~~~~ 228 (484)
T PF13584_consen 154 RNNFRQLGIEELPPPDFEGFWVEQL---GDDR--QYEEERINGRRYRVIELRRYALFPQKSGTLTIPPATFEVTVSDPSG 228 (484)
T ss_pred ecCchhccccccCCCCCCCcEEEEC---CCCC--ceeEEEECCEEEEEEEEEEEEEEeCCceeEEecCEEEEEEEecccC
Confidence 111 11222212 1111 111 112331 345567999999988 333311
Q ss_pred --------------eecCCCceEEEEccCCC-C------Cc-ceEE--EeCCCCCcceeeeeeEEEEEeccCCCCc-eE-
Q psy192 189 --------------IHIPGSPFQFTVGPLRD-G------GA-HRVH--AGGPGLERGEQNQPCEFNVWTREAGAGS-LA- 242 (898)
Q Consensus 189 --------------~~V~gSPf~v~V~~~~~-~------d~-~~v~--a~G~GL~~~~~g~~~~F~V~~~daG~g~-l~- 242 (898)
..+...|.+++|.+... . .+ .+.+ ..-+ -....+|++.+.+|...+.|.-. +.
T Consensus 229 ~~~~fg~~~~~~~~~~~~s~~~~i~V~plP~~~~p~~f~~aVg~f~l~~~~~-~~~~~~Ge~vt~ti~i~g~Gn~~~~~l 307 (484)
T PF13584_consen 229 RRDFFGGNFGRSRPVSISSEPLTITVKPLPAEGAPADFSGAVGNFSLSQSWD-PTEVKVGEPVTRTITISGEGNLPSIQL 307 (484)
T ss_pred ccCccccccccceeEEecCCCeEEEeccCCcccCCCCcccceeEEEEEEEcC-cccccCCCeEEEEEEEEEEcchhcccC
Confidence 23566788888876433 1 11 1111 1111 12346888888888877654321 10
Q ss_pred EEEEcCCc-------ceeeEEECCCC-----cEEEEEEeCCCceEEE
Q psy192 243 ISVEGPSK-------AEIDFKDRKDG-----SCYVSYVVAEPGEYRV 277 (898)
Q Consensus 243 v~v~gps~-------~~i~~~d~~dG-----ty~v~y~P~~~G~y~i 277 (898)
-.+.-|.+ .........+| +++..|+|..+|.|.|
T Consensus 308 P~l~~~~~~~vy~~~~~~~~~~~~~g~~g~~~~~~~~ip~~~G~~~l 354 (484)
T PF13584_consen 308 PPLNLPKGFRVYPPKPQEQDKPSGGGLTGSRTFKYTLIPKKPGDFTL 354 (484)
T ss_pred CCCCCCcccEEcCCCccccccccCCcceEEEEEEEEEEeCCCCeEEc
Confidence 00111111 11122223333 3567888999999986
No 11
>KOG1948|consensus
Probab=94.82 E-value=13 Score=45.30 Aligned_cols=124 Identities=21% Similarity=0.258 Sum_probs=74.1
Q ss_pred eEEEEcCCCEEEEEEEcCcceeEEEEEEECCeecCCCceEEEEccCCCCCcceEEEeCCCCCcceeeeeeEEEEEec---
Q psy192 158 AEINEVEDGLYAVHFVPKELGVHTVSVRYKDIHIPGSPFQFTVGPLRDGGAHRVHAGGPGLERGEQNQPCEFNVWTR--- 234 (898)
Q Consensus 158 ~~v~~~~~g~y~v~f~P~~~G~h~I~V~~~g~~V~gSPf~v~V~~~~~~d~~~v~a~G~GL~~~~~g~~~~F~V~~~--- 234 (898)
|.-....+|.|.+.-+| .|.|.+--.+-| +|..++|.|. +-++.+.=+-|+-...-+-+-|+|..|
T Consensus 256 c~~vsd~~G~fsfksvP--sGkY~l~a~y~g-----e~~~fdvSP~----~l~v~Vehd~lqi~~ef~vtgfSvtGRVl~ 324 (1165)
T KOG1948|consen 256 CIGVSDPRGRFSFKSVP--SGKYYLAASYVG-----EPKSFDVSPN----PLKVVVEHDHLQIASEFRVTGFSVTGRVLV 324 (1165)
T ss_pred EEEEEcCCceEEEEEcC--CCCEEEEEEecC-----CceEEEeCCC----ceeEEEeccceeccceeEEEEEEeeeeEEe
Confidence 33345678888766666 699999888765 4555666542 334444444444333333344555443
Q ss_pred ---cCCCCceEEEEEcCCcceeeEEECCCCcEEEEEEeCCCceEEEEEEECCeeCCCCCceeEecCCC
Q psy192 235 ---EAGAGSLAISVEGPSKAEIDFKDRKDGSCYVSYVVAEPGEYRVGIKFNDQHIPDSPYKLFVSPAM 299 (898)
Q Consensus 235 ---daG~g~l~v~v~gps~~~i~~~d~~dGty~v~y~P~~~G~y~i~V~~~g~~I~gSPf~v~V~~~~ 299 (898)
+.+--...|.|.|-.+. .-..+|.|+++=. ...|.|+|++. -+|..-|+.++.|.+..
T Consensus 325 g~~g~~l~gvvvlvngk~~~----kTdaqGyykLen~-~t~gtytI~a~--kehlqFstv~~kv~pnt 385 (1165)
T KOG1948|consen 325 GSKGLPLSGVVVLVNGKSGG----KTDAQGYYKLENL-KTDGTYTITAK--KEHLQFSTVHAKVKPNT 385 (1165)
T ss_pred CCCCCCccceEEEEcCcccc----eEcccceEEeeee-eccCcEEEEEe--ccceeeeeEEEEecCCc
Confidence 11222334555554332 2347899988763 46899999885 46777788888887654
No 12
>TIGR03503 conserved hypothetical protein TIGR03503. This set of conserved hypothetical protein has a phylogenetic range that closely matches that of TIGR03501, a putative C-terminal protein targeting signal.
Probab=94.40 E-value=0.54 Score=52.62 Aligned_cols=118 Identities=17% Similarity=0.185 Sum_probs=74.0
Q ss_pred CCCeEEEEE-eeCCCeEEEEEEEECCee----------cCCCCeEEEEecCCcccceeeeeeeecccceeeeCcEEEEEE
Q psy192 66 ADGSLNISY-RPTEPGYYIINLKFADHH----------VEGSPFTAKIVGEGSNRQREKIQRQREAVPVTEVGSTCKLTF 134 (898)
Q Consensus 66 ~dG~~~v~y-tP~~~G~~~I~V~~~g~~----------i~gSPf~~~v~~~~~~~~~~~i~~~~~~~~~~~vg~~~~f~v 134 (898)
+||.|+..+ .-..+|.|.+.+..++.- |.-.|+++++..+. ..+.+..+.|
T Consensus 169 ~DGvFT~~l~l~~~~G~Y~~~v~~~n~vF~R~~~q~v~V~p~Pi~~~~~~~~------------------~~~~~h~l~v 230 (374)
T TIGR03503 169 GDGIFTGEFNLDVAPGEYRPTYQSRNPVFLREVEQPVLVYPLPVSYTVIQSE------------------DESGAHQLMV 230 (374)
T ss_pred CCceEEEEeeccCCCceEEEEEEEcCceEEEEEEEeEEEECCCeeEEEEccC------------------CCCCceEEEE
Confidence 799999999 456799999999987622 23334443333221 0123444444
Q ss_pred EcCC--C--cccceEEEEeCCCCCeeeeEEEEcCCCEEEEEEEcCcceeEEEEEEECCeecCCCceEEEEc
Q psy192 135 KMPG--I--TAFDLSATVTSPGGVTEDAEINEVEDGLYAVHFVPKELGVHTVSVRYKDIHIPGSPFQFTVG 201 (898)
Q Consensus 135 ~~~~--~--~~~~~~~~V~~p~G~~~~~~v~~~~~g~y~v~f~P~~~G~h~I~V~~~g~~V~gSPf~v~V~ 201 (898)
++.. . +.-.+.+++..|+|......+...+..+....=.+.+.|.|.++.+.-+..+.|-||.+.+.
T Consensus 231 ~~d~~~i~~~s~~~~~~~~~P~g~~~~~~~~~~~~~~~~~l~~~~~~G~Y~i~~~~~~Tt~~GRe~~~~lp 301 (374)
T TIGR03503 231 DADAGHIDPGSLVIHGELVFPNGQIQQFSIELEEPETRVDLPANYEFGKYRVKGTVFGTTITGRELQLTLP 301 (374)
T ss_pred EcccceeccccEEEEEEEECCCCceEEecccCccCceEEeccCcCCCeEEEEEEEEEEecCCCCEEEEEcC
Confidence 4331 2 23345678899999833222222333333332234489999999999999999999988874
No 13
>PF13584 BatD: Oxygen tolerance
Probab=92.63 E-value=8.6 Score=45.47 Aligned_cols=205 Identities=15% Similarity=0.267 Sum_probs=110.8
Q ss_pred EEEEEEEcCcceeEEE---EEEECCeecCCCceEEEEccCCCCC-------cce--EEEeCCCCCcceeeeeeEEEEEec
Q psy192 167 LYAVHFVPKELGVHTV---SVRYKDIHIPGSPFQFTVGPLRDGG-------AHR--VHAGGPGLERGEQNQPCEFNVWTR 234 (898)
Q Consensus 167 ~y~v~f~P~~~G~h~I---~V~~~g~~V~gSPf~v~V~~~~~~d-------~~~--v~a~G~GL~~~~~g~~~~F~V~~~ 234 (898)
+|.+.+.|.+.|.+.| .|.++|....-.|+.++|.+..... ... +++.= .-....+|++..+++...
T Consensus 74 ~~~~~l~p~~~G~~~IP~~~v~v~Gk~~~S~pi~i~V~~~~~~~~~~~~~~~~~~~l~~~v-~~~~~Yvge~v~lt~~ly 152 (484)
T PF13584_consen 74 TYTYTLQPKKTGTFTIPPFTVEVDGKTYKSQPITIEVSKASQSPSQPPSNADDDVFLEAEV-SKKSVYVGEPVILTLRLY 152 (484)
T ss_pred EEEEEEEecccceEEEceEEEEECCEEEeecCEEEEEEecccCCccccccccccEEEEEEe-CCCceecCCcEEEEEEEE
Confidence 6889999999999975 6788999999999999998653211 111 22111 112346888887777654
Q ss_pred cCCCCc-eE-EEEEcCC--cc-------eeeEE-ECCCCc-EE------EEEEeCCCceEEE-----EEEECC-------
Q psy192 235 EAGAGS-LA-ISVEGPS--KA-------EIDFK-DRKDGS-CY------VSYVVAEPGEYRV-----GIKFND------- 283 (898)
Q Consensus 235 daG~g~-l~-v~v~gps--~~-------~i~~~-d~~dGt-y~------v~y~P~~~G~y~i-----~V~~~g------- 283 (898)
-...-. +. ..+.-|. +. ..... ..-+|. |. ....|..+|++.| .+.+..
T Consensus 153 ~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~~~~~~i~G~~y~~~~~~~~~l~P~ksG~l~I~~~~~~~~~~~~~~~~~~ 232 (484)
T PF13584_consen 153 TRNNFRQLGIEELPPPDFEGFWVEQLGDDRQYEEERINGRRYRVIELRRYALFPQKSGTLTIPPATFEVTVSDPSGRRDF 232 (484)
T ss_pred EecCchhccccccCCCCCCCcEEEECCCCCceeEEEECCEEEEEEEEEEEEEEeCCceeEEecCEEEEEEEecccCccCc
Confidence 332211 11 0111111 00 11111 123453 32 2347999999998 333321
Q ss_pred ----------eeCCCCCceeEecCCCCC---------ceeEEEe-eeCCCceecCCCeEEEEEecCCccceE-E---EEE
Q psy192 284 ----------QHIPDSPYKLFVSPAMGD---------AHKLEIA-QFPQGVVMADKPTQFLVRKNGAVGALD-A---KVI 339 (898)
Q Consensus 284 ----------~~I~gSPf~v~V~~~~~~---------~~~~~~~-~~~~~~~~~g~~~~f~V~~~~a~G~l~-v---~v~ 339 (898)
..+...|.++.|.+-..+ ..++.+. ......+.+|++.+++|...+. |++. . .+.
T Consensus 233 fg~~~~~~~~~~~~s~~~~i~V~plP~~~~p~~f~~aVg~f~l~~~~~~~~~~~Ge~vt~ti~i~g~-Gn~~~~~lP~l~ 311 (484)
T PF13584_consen 233 FGGNFGRSRPVSISSEPLTITVKPLPAEGAPADFSGAVGNFSLSQSWDPTEVKVGEPVTRTITISGE-GNLPSIQLPPLN 311 (484)
T ss_pred cccccccceeEEecCCCeEEEeccCCcccCCCCcccceeEEEEEEEcCcccccCCCeEEEEEEEEEE-cchhcccCCCCC
Confidence 134556777777653221 1333332 2334567899999888876432 4433 1 112
Q ss_pred CCCCCee-e--e--EEEecCCC-----eEEEEEEecCCeeEEEE
Q psy192 340 SPSGTED-D--C--FIQPIDGD-----NYSIRFMPRENGIHNIH 373 (898)
Q Consensus 340 ~p~g~~~-~--~--~v~~~~~g-----~y~v~f~P~~~G~h~i~ 373 (898)
.|+|-.+ + . ......+| ++++.|+|.++|.|.|-
T Consensus 312 ~~~~~~vy~~~~~~~~~~~~~g~~g~~~~~~~~ip~~~G~~~lP 355 (484)
T PF13584_consen 312 LPKGFRVYPPKPQEQDKPSGGGLTGSRTFKYTLIPKKPGDFTLP 355 (484)
T ss_pred CCcccEEcCCCccccccccCCcceEEEEEEEEEEeCCCCeEEcC
Confidence 2233211 1 1 11112222 57788889999998753
No 14
>TIGR00864 PCC polycystin cation channel protein. Note: this model has been restricted to the amino half because for technical reasons.
Probab=92.39 E-value=65 Score=45.23 Aligned_cols=263 Identities=13% Similarity=0.211 Sum_probs=134.3
Q ss_pred EEEEEEcCcceeEEEEEEECCeecCCC--ceEEEEccCCCCCcceEEEeCCCCCcceeeeeeEEEEEeccCCCCceEEE-
Q psy192 168 YAVHFVPKELGVHTVSVRYKDIHIPGS--PFQFTVGPLRDGGAHRVHAGGPGLERGEQNQPCEFNVWTREAGAGSLAIS- 244 (898)
Q Consensus 168 y~v~f~P~~~G~h~I~V~~~g~~V~gS--Pf~v~V~~~~~~d~~~v~a~G~GL~~~~~g~~~~F~V~~~daG~g~l~v~- 244 (898)
..+.|++.++|.|.|++..-+. +... ...+.|... .....+. .+......|+.++|.+...- |. ++...
T Consensus 1830 k~v~~Tf~~aG~ytV~L~AsN~-vs~~~~s~~~~VQe~----I~~L~L~-as~~~~~~n~~v~fsa~l~~-GS-~Vtf~w 1901 (2740)
T TIGR00864 1830 KHACMTFPDAGTFAIRLNASNA-VSGKSASREFFAEEP----IFGLELK-ASKKIAAIGEKVEFQILLAA-GS-AVNFRL 1901 (2740)
T ss_pred ceeEEecCCCeEEEEEEEEEcc-cCcceeeeeEEEEEe----cceEEEe-cccccccCCCEEEEEEEecC-CC-ceEEEE
Confidence 5788999999999999998653 2332 334444321 1112222 23345668999999997753 33 34333
Q ss_pred -EEcCCcceeeEEECCCCcEEEEEEeCCCceEEEEEEECCeeCCCCCceeEe--cCCCCCceeEEEeeeCCCceecCCCe
Q psy192 245 -VEGPSKAEIDFKDRKDGSCYVSYVVAEPGEYRVGIKFNDQHIPDSPYKLFV--SPAMGDAHKLEIAQFPQGVVMADKPT 321 (898)
Q Consensus 245 -v~gps~~~i~~~d~~dGty~v~y~P~~~G~y~i~V~~~g~~I~gSPf~v~V--~~~~~~~~~~~~~~~~~~~~~~g~~~ 321 (898)
+.+-+ ..+ .+ ..- ++++.=..+|.|.|.|..-+. +....+.+.| ..+. +.+.+..-...++.+|+..
T Consensus 1902 ~fGdgs-~~~--~t-~~~--t~~HsY~~~G~Y~VtV~A~N~-Vs~~~a~~~V~Vle~V---~gL~I~~~c~~~~~~g~~~ 1971 (2740)
T TIGR00864 1902 QIGGAA-PEV--LQ-PGP--RFSHSFPRVDDHMVNLRAKNE-VSCAQANLHIEVLEAV---RGLQIPDCCAAGIATGEEK 1971 (2740)
T ss_pred EeCCCC-cee--ec-CCC--eEEeecCCCCcEEEEEEEEec-cceeeeeEEEEEEEec---cceEecCCcccceecCceE
Confidence 32211 111 11 112 344555789999999987643 2223333333 2211 1111111001123444444
Q ss_pred EEEEEecCCccceEEEEECCCCCeee--eEE--Eec-CCC-----eEEEEEEecCCeeEEEEEE-ECCeecCCCCeEEEe
Q psy192 322 QFLVRKNGAVGALDAKVISPSGTEDD--CFI--QPI-DGD-----NYSIRFMPRENGIHNIHIK-FNGVHIPGSPLRIKV 390 (898)
Q Consensus 322 ~f~V~~~~a~G~l~v~v~~p~g~~~~--~~v--~~~-~~g-----~y~v~f~P~~~G~h~i~V~-~~g~~I~gSPf~v~V 390 (898)
.|. |++. .|..+. ..+ ... ++. -=.++|+|.++|.|+|+|+ +|...-..=+..++|
T Consensus 1972 tF~-----------A~v~--~g~~V~f~W~f~~~~~~g~~~~~~~G~~vty~~~~~G~~~I~v~A~N~l~s~n~t~~i~V 2038 (2740)
T TIGR00864 1972 NFT-----------ANVQ--RGKPVAFAWTFDLHHLHGDSLVIHMGKDVSYTAEAAGLLEIQLGAFNALGAENITLQLEA 2038 (2740)
T ss_pred EEE-----------EEEe--cCCceEEEEEeeecccCCCcceeecCCceEEccCCCcEEEEEEEEEcccCcccccEEEEE
Confidence 444 4442 222211 000 000 010 0158999999999999998 554333334556666
Q ss_pred ccCCCCCceEEecCCCcceeeeCCeEEEEEEcCCCCCCceEEEEE-CCCCceeeeEEeCCeEEEEEecCCCeeEEEEEEE
Q psy192 391 GKGEADPAAVHATGNGLAEIKSGVKTDFIVDTCNAGAGTLAVTID-GPSKVSMDCTEVEEGYKVRYTPLVPGDYYVSLKY 469 (898)
Q Consensus 391 ~~~~~d~s~v~v~G~Gl~~~~~g~~~~F~Vd~~~aG~g~l~v~v~-gp~~~~~~~~~~~~gy~v~ytp~~~G~y~i~V~~ 469 (898)
-.... ...+. .+.....+|+.+.|.+...- +..++....+ |... ... .-+..+++++=.++|+|.+.|..
T Consensus 2039 Q~~i~---~~~L~-~~p~~~~~n~~v~f~~~~~~-~~~~v~~~WDFGDGt-~~~---~T~~~~~sHtY~r~G~Y~V~V~a 2109 (2740)
T TIGR00864 2039 QDALM---DAALQ-AGPQDCFTNKMAQFEAATSP-KPNFMACHWDFGDGS-AGQ---DTDEPRAEHEYLHPGDYRVQVNA 2109 (2740)
T ss_pred Eehee---eEEEe-ecCcceecCceEEEEEEEcc-CCCcEEEEEEecCCC-Ccc---ccCCceeEEEecCCccEEEEEEE
Confidence 43211 11111 11234567888888887632 2333444443 2111 111 11236788888899999999975
Q ss_pred C
Q psy192 470 N 470 (898)
Q Consensus 470 ~ 470 (898)
.
T Consensus 2110 ~ 2110 (2740)
T TIGR00864 2110 S 2110 (2740)
T ss_pred E
Confidence 3
No 15
>TIGR03503 conserved hypothetical protein TIGR03503. This set of conserved hypothetical protein has a phylogenetic range that closely matches that of TIGR03501, a putative C-terminal protein targeting signal.
Probab=91.63 E-value=2.4 Score=47.57 Aligned_cols=116 Identities=18% Similarity=0.251 Sum_probs=72.5
Q ss_pred CCCCcEEEEEEe-CCCceEEEEEEECCee----------CCCCCceeEecCCCCCceeEEEeeeCCCceecCCCeEEEEE
Q psy192 258 RKDGSCYVSYVV-AEPGEYRVGIKFNDQH----------IPDSPYKLFVSPAMGDAHKLEIAQFPQGVVMADKPTQFLVR 326 (898)
Q Consensus 258 ~~dGty~v~y~P-~~~G~y~i~V~~~g~~----------I~gSPf~v~V~~~~~~~~~~~~~~~~~~~~~~g~~~~f~V~ 326 (898)
-.||.|+.++.- ..+|.|.+.|..++.- +.-.|+.+.+..+. ..+.+..+.|.
T Consensus 168 p~DGvFT~~l~l~~~~G~Y~~~v~~~n~vF~R~~~q~v~V~p~Pi~~~~~~~~----------------~~~~~h~l~v~ 231 (374)
T TIGR03503 168 PGDGIFTGEFNLDVAPGEYRPTYQSRNPVFLREVEQPVLVYPLPVSYTVIQSE----------------DESGAHQLMVD 231 (374)
T ss_pred CCCceEEEEeeccCCCceEEEEEEEcCceEEEEEEEeEEEECCCeeEEEEccC----------------CCCCceEEEEE
Confidence 368999998864 4589999999998742 22334443332211 11334555555
Q ss_pred ecCC---ccc--eEEEEECCCCCeeeeEEEecC-CCeEEEEEEecCCeeEEEEEEECCeecCCCCeEEEe
Q psy192 327 KNGA---VGA--LDAKVISPSGTEDDCFIQPID-GDNYSIRFMPRENGIHNIHIKFNGVHIPGSPLRIKV 390 (898)
Q Consensus 327 ~~~a---~G~--l~v~v~~p~g~~~~~~v~~~~-~g~y~v~f~P~~~G~h~i~V~~~g~~I~gSPf~v~V 390 (898)
.... .+. +.+++..|+|. ........+ ..+...-=...+.|.|+++.+.-+..+.|.||.+..
T Consensus 232 ~d~~~i~~~s~~~~~~~~~P~g~-~~~~~~~~~~~~~~~~l~~~~~~G~Y~i~~~~~~Tt~~GRe~~~~l 300 (374)
T TIGR03503 232 ADAGHIDPGSLVIHGELVFPNGQ-IQQFSIELEEPETRVDLPANYEFGKYRVKGTVFGTTITGRELQLTL 300 (374)
T ss_pred cccceeccccEEEEEEEECCCCc-eEEecccCccCceEEeccCcCCCeEEEEEEEEEEecCCCCEEEEEc
Confidence 4332 243 56778899997 333333322 223222224559999999999999999999998876
No 16
>KOG1428|consensus
Probab=90.52 E-value=1 Score=56.58 Aligned_cols=43 Identities=23% Similarity=0.533 Sum_probs=37.6
Q ss_pred EEEecCCCeEEEEEEecCCeeEEEEEEECCeecCCCCeEEEecc
Q psy192 349 FIQPIDGDNYSIRFMPRENGIHNIHIKFNGVHIPGSPLRIKVGK 392 (898)
Q Consensus 349 ~v~~~~~g~y~v~f~P~~~G~h~i~V~~~g~~I~gSPf~v~V~~ 392 (898)
-++.+++|+|...|+|...|.|+|+++++|..|.- -++++|..
T Consensus 2099 ~ir~nnd~~y~~nWTP~a~G~Y~l~~tiDG~eId~-~~eVkVkd 2141 (3738)
T KOG1428|consen 2099 LIRVNNDGTYCANWTPGAIGLYTLHVTIDGIEIDA-GLEVKVKD 2141 (3738)
T ss_pred eEEecCCCceecccCCccceeEEEEEEecceeecc-ceEEEecc
Confidence 45567899999999999999999999999999975 68888863
No 17
>TIGR00868 hCaCC calcium-activated chloride channel protein 1. distributions. found a row in 1A13.INFO that was not parsed out
Probab=90.16 E-value=10 Score=47.51 Aligned_cols=151 Identities=15% Similarity=0.148 Sum_probs=87.0
Q ss_pred eeeeeeEEEEEeccCCCCceEEEEEcCCccee-eE-EECCCCcEEEEE-Ee--CCCceEEEEEEECCeeCCCCCceeEec
Q psy192 222 EQNQPCEFNVWTREAGAGSLAISVEGPSKAEI-DF-KDRKDGSCYVSY-VV--AEPGEYRVGIKFNDQHIPDSPYKLFVS 296 (898)
Q Consensus 222 ~~g~~~~F~V~~~daG~g~l~v~v~gps~~~i-~~-~d~~dGty~v~y-~P--~~~G~y~i~V~~~g~~I~gSPf~v~V~ 296 (898)
.+|..+.|.| +... ...++.+.+|++... .+ .|. .+..+.. +| .++|.|+.+|.-.. ..-+-+++|+
T Consensus 512 tvG~~T~F~v-tw~~--~~p~i~l~dP~G~~~~~~~~d~--~~~~a~l~ipg~a~~G~W~y~l~~~~---~~q~i~vtVt 583 (863)
T TIGR00868 512 TVGKDTFFLI-TWEF--LKPEIFLQDPSGKSTSDFLVDK--LNKMAYLQIPGTAKVGTWTYSLQASA---NPQTLTLTVT 583 (863)
T ss_pred CCCCCcEEEE-EcCC--CCceEEEECCCCCccccceEee--cceEEEEEcCCCCccceeEEEEeccC---CCceEEEEEE
Confidence 3577777777 4433 257889999987531 11 111 1112221 23 36899998885441 1122345554
Q ss_pred CC--CCCceeEEEe-eeCCCceecCCCeEEEEEecCC-----ccceEEEEECCCCCeeeeEEEec--------CCCeEEE
Q psy192 297 PA--MGDAHKLEIA-QFPQGVVMADKPTQFLVRKNGA-----VGALDAKVISPSGTEDDCFIQPI--------DGDNYSI 360 (898)
Q Consensus 297 ~~--~~~~~~~~~~-~~~~~~~~~g~~~~f~V~~~~a-----~G~l~v~v~~p~g~~~~~~v~~~--------~~g~y~v 360 (898)
.- ..+..-+++. .+........+|+.++..-+.. ..++.|.|..|+|..+...+.++ +||+|.-
T Consensus 584 sra~s~~~~pi~~~a~~~~~~~~~psp~ivya~V~kG~~pVlgA~Vta~i~~~~g~~~~l~l~DnGaGaD~~knDGIYS~ 663 (863)
T TIGR00868 584 SRARSPTLPPVTVTAKMNKDTAKFPSPMIVYAKISQGFLPVLGANVTALIESENGHTVTLELLDNGAGADTVKNDGIYSR 663 (863)
T ss_pred eccCCCCCCCEEEEEEEccccccCCCceEEEEEEeeCCceeeccEEEEEEECCCCCEEEEEEeecCCCCCccccCceeee
Confidence 21 1112223332 2222223345666555554433 23578888999998776677664 3899999
Q ss_pred EEEe-cCCeeEEEEEEECCee
Q psy192 361 RFMP-RENGIHNIHIKFNGVH 380 (898)
Q Consensus 361 ~f~P-~~~G~h~i~V~~~g~~ 380 (898)
.|+. ...|.|.|.|..+|..
T Consensus 664 yft~f~~nGrYsl~V~v~~~~ 684 (863)
T TIGR00868 664 YFTAYDGNGRYSLKVRALGGV 684 (863)
T ss_pred eeecccCCceEEEEEEEeCCC
Confidence 9975 5779999999988754
No 18
>PF13115 YtkA: YtkA-like
Probab=88.23 E-value=4.1 Score=35.66 Aligned_cols=71 Identities=14% Similarity=0.282 Sum_probs=50.6
Q ss_pred CCCeEEEEEE---cCCceeEEEEEEE--CCEEecCCCCeeEEEecCc-------cceEEEEeCCCCeEEEEEeeCCCeEE
Q psy192 15 HDGTVSLHYD---PREEGLHELALKF--NGDHVQGYGGLSLSIEGPS-------KAEIQCKDNADGSLNISYRPTEPGYY 82 (898)
Q Consensus 15 ~d~~~~v~f~---P~~~G~h~v~v~~--~G~~v~G~g~l~v~v~GP~-------~~~i~~~d~~dG~~~v~ytP~~~G~~ 82 (898)
..+.|+|++. +-..|...+.+.. +|+||.+.- +.+.+.=|. ...+.....++|.|.+..+-+.+|.|
T Consensus 3 ~~~~~~v~l~~~~~~~~g~~~i~v~~~~~g~pv~~a~-V~~~~~m~~~~g~~~~~~~~~~~~~~~G~Y~~~~~f~m~G~W 81 (86)
T PF13115_consen 3 QVGGYTVELVSPEPPKVGENTITVTVDQGGKPVTDAD-VQFEIWMPDMEGMEPMTSKVELEETGPGVYEAEVTFSMAGTW 81 (86)
T ss_pred ccccEEEEEecCCCCcCCceEEEEEECCCCCCCCCCE-EEEEEEeCCCCCCCCCceeeeeecCCCCeEEEEeecCCCeeE
Confidence 3467888887 3466777766664 588887663 666654442 23345555689999999999999999
Q ss_pred EEEE
Q psy192 83 IINL 86 (898)
Q Consensus 83 ~I~V 86 (898)
.|.|
T Consensus 82 ~i~v 85 (86)
T PF13115_consen 82 QITV 85 (86)
T ss_pred EEEE
Confidence 9987
No 19
>PF09134 Invasin_D3: Invasin, domain 3; InterPro: IPR015217 This domain adopts a structure consisting of an immunoglobulin-like beta-sandwich, with seven strands in two beta-sheets, arranged in a Greek-key topology. It forms part of the extracellular region of the protein, which can be expressed as a soluble protein (Inv497) that binds integrins and promotes subsequent uptake by cells when attached to bacteria []. ; PDB: 1CWV_A.
Probab=87.60 E-value=5.9 Score=35.74 Aligned_cols=87 Identities=18% Similarity=0.301 Sum_probs=50.1
Q ss_pred ccCeEEeCCCCeEEEEEEcCCceeEEEEEEECCEEecCCCCeeEEEecCccceEEEEeCCCCeEEEEEeeCCCeEEEEEE
Q psy192 7 DKPVIEDNHDGTVSLHYDPREEGLHELALKFNGDHVQGYGGLSLSIEGPSKAEIQCKDNADGSLNISYRPTEPGYYIINL 86 (898)
Q Consensus 7 ~~~~i~~~~d~~~~v~f~P~~~G~h~v~v~~~G~~v~G~g~l~v~v~GP~~~~i~~~d~~dG~~~v~ytP~~~G~~~I~V 86 (898)
-|..|.-++..+..+.|+++.. +|+||.|-..|.+.-.|- ...+.-..+..|.|+.+..-+.+|+++|.=
T Consensus 11 ~k~~I~ADG~dta~LTfta~D~---------~g~pvsGl~~l~f~~~g~-~vtls~itE~~GvYtATl~Gt~~G~vtI~P 80 (99)
T PF09134_consen 11 SKPSIIADGQDTATLTFTARDA---------NGNPVSGLSVLAFVQDGV-PVTLSPITENPGVYTATLKGTSAGEVTITP 80 (99)
T ss_dssp S-SEEETTS---EEEEEE-B-T---------TS-B--S--SEEEEEEES---EE---EE-SS-EEEEEE-SS-EEEEEEE
T ss_pred CccccccCCcceEEEEEEEEcC---------CCCcccCceeEEEEccCC-CcEEccccccCCEEEEEEEcccCceEEEEE
Confidence 3667777777888999998876 588887775566654432 122221223569999999999999999999
Q ss_pred EECCeecCCCCeEEEEe
Q psy192 87 KFADHHVEGSPFTAKIV 103 (898)
Q Consensus 87 ~~~g~~i~gSPf~~~v~ 103 (898)
.++|+++..=|-++..+
T Consensus 81 ~v~G~~l~~~~~k~tl~ 97 (99)
T PF09134_consen 81 QVNGQPLATLQKKLTLI 97 (99)
T ss_dssp EETTEE-GGG-EEEEEE
T ss_pred eeCCEEccccceeEEEe
Confidence 99999998777776654
No 20
>PF01835 A2M_N: MG2 domain; InterPro: IPR002890 The proteinase-binding alpha-macroglobulins (A2M) [] are large glycoproteins found in the plasma of vertebrates, in the hemolymph of some invertebrates and in reptilian and avian egg white. A2M-like proteins are able to inhibit all four classes of proteinases by a 'trapping' mechanism. They have a peptide stretch, called the 'bait region', which contains specific cleavage sites for different proteinases. When a proteinase cleaves the bait region, a conformational change is induced in the protein, thus trapping the proteinase. The entrapped enzyme remains active against low molecular weight substrates, whilst its activity toward larger substrates is greatly reduced, due to steric hindrance. Following cleavage in the bait region, a thiol ester bond, formed between the side chains of a cysteine and a glutamine, is cleaved and mediates the covalent binding of the A2M-like protein to the proteinase. This family includes the N-terminal region of the alpha-2-macroglobulin family. The inhibitor domains belong to MEROPS inhibitor family I39.; GO: 0004866 endopeptidase inhibitor activity; PDB: 2B39_B 3KLS_B 3PRX_C 3KM9_B 3PVM_C 3CU7_A 4E0S_A 4A5W_A 4ACQ_C 2P9R_B ....
Probab=87.10 E-value=2 Score=38.70 Aligned_cols=64 Identities=17% Similarity=0.246 Sum_probs=44.5
Q ss_pred ceecCCCeEEEEEecCCc--------cceEEEEECCCCCeeee-EE-EecCCCeEEEEEE-ecCC--eeEEEEEEEC
Q psy192 314 VVMADKPTQFLVRKNGAV--------GALDAKVISPSGTEDDC-FI-QPIDGDNYSIRFM-PREN--GIHNIHIKFN 377 (898)
Q Consensus 314 ~~~~g~~~~f~V~~~~a~--------G~l~v~v~~p~g~~~~~-~v-~~~~~g~y~v~f~-P~~~--G~h~i~V~~~ 377 (898)
-.+.|+.+.|.+-.++.. ..+.+.|.+|+|..+.- .. .....|.|..+|. |..+ |.|+|.+..+
T Consensus 10 iYrPGetV~~~~~~~~~~~~~~~~~~~~~~v~i~dp~g~~v~~~~~~~~~~~G~~~~~~~lp~~~~~G~y~i~~~~~ 86 (99)
T PF01835_consen 10 IYRPGETVHFRAIVRDLDNDFKPPANSPVTVTIKDPSGNEVFRWSVNTTNENGIFSGSFQLPDDAPLGTYTIRVKTD 86 (99)
T ss_dssp EE-TTSEEEEEEEEEEECTTCSCESSEEEEEEEEETTSEEEEEEEEEETTCTTEEEEEEE--SS---EEEEEEEEET
T ss_pred CcCCCCEEEEEEEEeccccccccccCCceEEEEECCCCCEEEEEEeeeeCCCCEEEEEEECCCCCCCEeEEEEEEEc
Confidence 356777777766533222 35889999999987633 33 3468999999995 6664 9999999994
No 21
>PF10670 DUF4198: Domain of unknown function (DUF4198)
Probab=87.08 E-value=35 Score=35.14 Aligned_cols=60 Identities=15% Similarity=0.298 Sum_probs=43.2
Q ss_pred eeeeeeEEEEEeccCCCCceEEEEEcCCcc------eeeEEECCCCcEEEEEEeCCCceEEEEEEECC
Q psy192 222 EQNQPCEFNVWTREAGAGSLAISVEGPSKA------EIDFKDRKDGSCYVSYVVAEPGEYRVGIKFND 283 (898)
Q Consensus 222 ~~g~~~~F~V~~~daG~g~l~v~v~gps~~------~i~~~d~~dGty~v~y~P~~~G~y~i~V~~~g 283 (898)
.+|++..|.|--.+..-.+..|.+..+... .....-+.+| .++|++..+|.|.|.+.+.+
T Consensus 148 ~~g~~~~~~vl~~GkPl~~a~V~~~~~~~~~~~~~~~~~~~TD~~G--~~~~~~~~~G~wli~a~~~~ 213 (215)
T PF10670_consen 148 KAGDPLPFQVLFDGKPLAGAEVEAFSPGGWYDVEHEAKTLKTDANG--RATFTLPRPGLWLIRASHKD 213 (215)
T ss_pred cCCCEEEEEEEECCeEcccEEEEEEECCCccccccceEEEEECCCC--EEEEecCCCEEEEEEEEEec
Confidence 467888888887776656666776655432 4445556677 67889999999999998754
No 22
>PF01835 A2M_N: MG2 domain; InterPro: IPR002890 The proteinase-binding alpha-macroglobulins (A2M) [] are large glycoproteins found in the plasma of vertebrates, in the hemolymph of some invertebrates and in reptilian and avian egg white. A2M-like proteins are able to inhibit all four classes of proteinases by a 'trapping' mechanism. They have a peptide stretch, called the 'bait region', which contains specific cleavage sites for different proteinases. When a proteinase cleaves the bait region, a conformational change is induced in the protein, thus trapping the proteinase. The entrapped enzyme remains active against low molecular weight substrates, whilst its activity toward larger substrates is greatly reduced, due to steric hindrance. Following cleavage in the bait region, a thiol ester bond, formed between the side chains of a cysteine and a glutamine, is cleaved and mediates the covalent binding of the A2M-like protein to the proteinase. This family includes the N-terminal region of the alpha-2-macroglobulin family. The inhibitor domains belong to MEROPS inhibitor family I39.; GO: 0004866 endopeptidase inhibitor activity; PDB: 2B39_B 3KLS_B 3PRX_C 3KM9_B 3PVM_C 3CU7_A 4E0S_A 4A5W_A 4ACQ_C 2P9R_B ....
Probab=86.94 E-value=3.4 Score=37.20 Aligned_cols=64 Identities=19% Similarity=0.329 Sum_probs=43.5
Q ss_pred eeeCcEEEEEEEcCC-------CcccceEEEEeCCCCCeeee-EE-EEcCCCEEEEEEE-cCcc--eeEEEEEEEC
Q psy192 124 TEVGSTCKLTFKMPG-------ITAFDLSATVTSPGGVTEDA-EI-NEVEDGLYAVHFV-PKEL--GVHTVSVRYK 187 (898)
Q Consensus 124 ~~vg~~~~f~v~~~~-------~~~~~~~~~V~~p~G~~~~~-~v-~~~~~g~y~v~f~-P~~~--G~h~I~V~~~ 187 (898)
-..|+.+.|.+-+.+ .....+++.|.+|+|+.+.- .. ..+..|.|..+|. |..+ |.|.|.+..+
T Consensus 11 YrPGetV~~~~~~~~~~~~~~~~~~~~~~v~i~dp~g~~v~~~~~~~~~~~G~~~~~~~lp~~~~~G~y~i~~~~~ 86 (99)
T PF01835_consen 11 YRPGETVHFRAIVRDLDNDFKPPANSPVTVTIKDPSGNEVFRWSVNTTNENGIFSGSFQLPDDAPLGTYTIRVKTD 86 (99)
T ss_dssp E-TTSEEEEEEEEEEECTTCSCESSEEEEEEEEETTSEEEEEEEEEETTCTTEEEEEEE--SS---EEEEEEEEET
T ss_pred cCCCCEEEEEEEEeccccccccccCCceEEEEECCCCCEEEEEEeeeeCCCCEEEEEEECCCCCCCEeEEEEEEEc
Confidence 345666666543221 12356889999999997644 33 3478999999998 6664 9999999994
No 23
>COG2373 Large extracellular alpha-helical protein [General function prediction only]
Probab=86.57 E-value=1.4e+02 Score=40.60 Aligned_cols=85 Identities=16% Similarity=0.230 Sum_probs=55.5
Q ss_pred CeeEEEecCccceEE---EEeCCCCeEEEEE-eeC--CCeEEEEEEEECCe-ecCCCCeEEEEecCCcccceeeeeeeec
Q psy192 47 GLSLSIEGPSKAEIQ---CKDNADGSLNISY-RPT--EPGYYIINLKFADH-HVEGSPFTAKIVGEGSNRQREKIQRQRE 119 (898)
Q Consensus 47 ~l~v~v~GP~~~~i~---~~d~~dG~~~v~y-tP~--~~G~~~I~V~~~g~-~i~gSPf~~~v~~~~~~~~~~~i~~~~~ 119 (898)
.+.+.+..|.+..+. ++-+.+|.+.++| +|. ..|.|++++.+.+. .+-...|.|.-+-+. +.+-.++..
T Consensus 430 p~~l~v~~PdG~~~~~~~~~~~~~G~~~~~~~l~~na~tG~w~l~~~~~~~~~~~s~~f~V~df~p~--r~~i~l~~~-- 505 (1621)
T COG2373 430 PLKLRVLDPDGSVLRTLTITLDEEGLYELSFPLPENALTGGYTLELYTGGKSAVISMSFRVEDFIPD--RFKINLTLD-- 505 (1621)
T ss_pred CeEEEEECCCCcEEEEEEEeccccCceEEeeeCCCCCCcceEEEEEEeCCccceeeeeEEhhHhCCc--eEEEecccc--
Confidence 477888899986654 4457899999999 554 47999999999987 666678877544333 222222211
Q ss_pred ccceeeeCcEEEEEEEc
Q psy192 120 AVPVTEVGSTCKLTFKM 136 (898)
Q Consensus 120 ~~~~~~vg~~~~f~v~~ 136 (898)
......|++.+++|+.
T Consensus 506 -k~~~~~g~~v~~~v~~ 521 (1621)
T COG2373 506 -KTEWVPGKDVKIKVDL 521 (1621)
T ss_pred -cccccCCCcEEEEEEE
Confidence 1113456677777664
No 24
>PF09315 DUF1973: Domain of unknown function (DUF1973); InterPro: IPR015394 These functionally uncharacterised domains are found in various eukaryotic calcium-dependent chloride channels.
Probab=85.82 E-value=20 Score=36.39 Aligned_cols=129 Identities=16% Similarity=0.162 Sum_probs=69.7
Q ss_pred eeeeeeEEEEEeccCCCCceEEEEEcCCcceeeE--EECCCCcEEEEEE-eCCCceEEEEEEECCeeCCCCCceeEecCC
Q psy192 222 EQNQPCEFNVWTREAGAGSLAISVEGPSKAEIDF--KDRKDGSCYVSYV-VAEPGEYRVGIKFNDQHIPDSPYKLFVSPA 298 (898)
Q Consensus 222 ~~g~~~~F~V~~~daG~g~l~v~v~gps~~~i~~--~d~~dGty~v~y~-P~~~G~y~i~V~~~g~~I~gSPf~v~V~~~ 298 (898)
.+|+.+.|.|.-... ....+.+.+|++....- .|.......+... ..++|.|.+.|...... --+..+.|...
T Consensus 26 tvG~~T~f~v~w~~~--~~p~i~L~~P~G~~~~~~~~d~~~~~~~i~ipg~ae~G~W~y~i~~~~~~--~q~v~vtVtS~ 101 (179)
T PF09315_consen 26 TVGNNTVFTVTWQNS--SPPSITLTDPSGTVYTTFTTDSNSKTARIQIPGTAEVGTWTYSITNTSSS--SQTVTVTVTSR 101 (179)
T ss_pred CCCCCeEEEEEECCC--CCceEEEECCCCCEEeeeEEcccccEEEEECCCCcccccEEEEEecCCCC--cceEEEEEEee
Confidence 357788898886654 56788999998865532 3333333334331 26789999999543322 13355555432
Q ss_pred C--CCceeEEEee-eCCCceecCCCeEEEEEecCC-----ccceEEEEECCCCCeeeeEEEecC
Q psy192 299 M--GDAHKLEIAQ-FPQGVVMADKPTQFLVRKNGA-----VGALDAKVISPSGTEDDCFIQPID 354 (898)
Q Consensus 299 ~--~~~~~~~~~~-~~~~~~~~g~~~~f~V~~~~a-----~G~l~v~v~~p~g~~~~~~v~~~~ 354 (898)
. ....-+.+.. +........+|+.+..+-+.. ..++.|.|..|+|......+.|++
T Consensus 102 ~~~~~~~Pitv~~~~~~~~~~~~~p~~IyA~V~qg~~PVlgA~V~A~V~~~~g~~~~l~L~DnG 165 (179)
T PF09315_consen 102 ASSPTVPPITVTAWLSQSSVNFPSPVIIYAEVQQGYLPVLGANVTATVESPSGNTVTLELLDNG 165 (179)
T ss_pred ECCCCCCCEEEEEEECCCCcCCCCcEEEEEEEEECCeeeeCCEEEEEEcCCCCCEEEEEEEcCC
Confidence 1 1111222221 111111223454444433322 235788888999987666666543
No 25
>COG1470 Predicted membrane protein [Function unknown]
Probab=83.12 E-value=96 Score=35.84 Aligned_cols=75 Identities=9% Similarity=0.108 Sum_probs=39.6
Q ss_pred EEEEEEeCCCceEEEEEEECCeeCCCCCceeEecCCCCCceeEEEeeeCCCceecCCCeEEEEEecCCccceEEEEECCC
Q psy192 263 CYVSYVVAEPGEYRVGIKFNDQHIPDSPYKLFVSPAMGDAHKLEIAQFPQGVVMADKPTQFLVRKNGAVGALDAKVISPS 342 (898)
Q Consensus 263 y~v~y~P~~~G~y~i~V~~~g~~I~gSPf~v~V~~~~~~~~~~~~~~~~~~~~~~g~~~~f~V~~~~a~G~l~v~v~~p~ 342 (898)
+...++|..+|.|.+....+|.+- |++.|.. ..+.++.+.-.. ....+|....|.+...+..+.+...++.|.
T Consensus 49 l~~~v~~~~~g~~~v~f~i~~~~~----~~v~v~~--~~~l~it~p~~~-~~~e~g~~v~~~~~l~~~~~~i~i~lk~pk 121 (513)
T COG1470 49 LQFKVLPGKAGSYSVKFSIEGVPY----WTVNVYT--SEPLQITLPIEA-QEAEAGLKVRVPELLEAGDSTISIDLKCPK 121 (513)
T ss_pred EEEEEecCCCCcEEEEEEECCcee----EEEEEEe--cccEEEeccchh-hhhccccEEEEeehhhCCcceEEEeeecCc
Confidence 566777888999988875554332 5555543 223444432111 112345555555544443444555555566
Q ss_pred CC
Q psy192 343 GT 344 (898)
Q Consensus 343 g~ 344 (898)
|.
T Consensus 122 ~w 123 (513)
T COG1470 122 GW 123 (513)
T ss_pred ch
Confidence 54
No 26
>KOG1428|consensus
Probab=82.26 E-value=3.5 Score=52.20 Aligned_cols=48 Identities=15% Similarity=0.360 Sum_probs=42.5
Q ss_pred eEEEEeCCCCeEEEEEeeCCCeEEEEEEEECCeecCCCCeEEEEecCCc
Q psy192 59 EIQCKDNADGSLNISYRPTEPGYYIINLKFADHHVEGSPFTAKIVGEGS 107 (898)
Q Consensus 59 ~i~~~d~~dG~~~v~ytP~~~G~~~I~V~~~g~~i~gSPf~~~v~~~~~ 107 (898)
.+.++.|.||+|...|||..+|.|+|+++.+|.+|. --+.++|.++..
T Consensus 2097 ~m~ir~nnd~~y~~nWTP~a~G~Y~l~~tiDG~eId-~~~eVkVkd~Pk 2144 (3738)
T KOG1428|consen 2097 NMLIRVNNDGTYCANWTPGAIGLYTLHVTIDGIEID-AGLEVKVKDPPK 2144 (3738)
T ss_pred cceEEecCCCceecccCCccceeEEEEEEecceeec-cceEEEeccCCC
Confidence 456778999999999999999999999999999997 468899988763
No 27
>PF10670 DUF4198: Domain of unknown function (DUF4198)
Probab=79.54 E-value=13 Score=38.37 Aligned_cols=67 Identities=19% Similarity=0.364 Sum_probs=49.2
Q ss_pred EEEEEEcC-------CceeEEEEEEECCEEecCCCCeeEEEecCccc------eEEEEeCCCCeEEEEEeeCCCeEEEEE
Q psy192 19 VSLHYDPR-------EEGLHELALKFNGDHVQGYGGLSLSIEGPSKA------EIQCKDNADGSLNISYRPTEPGYYIIN 85 (898)
Q Consensus 19 ~~v~f~P~-------~~G~h~v~v~~~G~~v~G~g~l~v~v~GP~~~------~i~~~d~~dG~~~v~ytP~~~G~~~I~ 85 (898)
+.++++|. .-.+-.+.|.|+|+|++| ..|.+..+... ....+-+.+|. ++|++..+|.|.|.
T Consensus 134 ~~lEIvPl~~P~~l~~g~~~~~~vl~~GkPl~~---a~V~~~~~~~~~~~~~~~~~~~TD~~G~--~~~~~~~~G~wli~ 208 (215)
T PF10670_consen 134 LPLEIVPLTNPYKLKAGDPLPFQVLFDGKPLAG---AEVEAFSPGGWYDVEHEAKTLKTDANGR--ATFTLPRPGLWLIR 208 (215)
T ss_pred CcEEEEECcCcccccCCCEEEEEEEECCeEccc---EEEEEEECCCccccccceEEEEECCCCE--EEEecCCCEEEEEE
Confidence 45778884 345678999999999985 56666555433 55666667885 56788899999999
Q ss_pred EEECC
Q psy192 86 LKFAD 90 (898)
Q Consensus 86 V~~~g 90 (898)
+.+.+
T Consensus 209 a~~~~ 213 (215)
T PF10670_consen 209 ASHKD 213 (215)
T ss_pred EEEec
Confidence 98754
No 28
>PF13753 SWM_repeat: Putative flagellar system-associated repeat
Probab=79.21 E-value=1.1e+02 Score=33.92 Aligned_cols=142 Identities=13% Similarity=0.246 Sum_probs=71.9
Q ss_pred eEEEEEeccCCCCc-eEEEEEcCCcceeeEEECCCCcEEEEEEeC---CCceEEEEEE-EC---CeeCCCCCceeEecCC
Q psy192 227 CEFNVWTREAGAGS-LAISVEGPSKAEIDFKDRKDGSCYVSYVVA---EPGEYRVGIK-FN---DQHIPDSPYKLFVSPA 298 (898)
Q Consensus 227 ~~F~V~~~daG~g~-l~v~v~gps~~~i~~~d~~dGty~v~y~P~---~~G~y~i~V~-~~---g~~I~gSPf~v~V~~~ 298 (898)
..|.....++..+. +.+.+. +....+....+|.|.+.+.|. ..|.|.|.+. .- |..-......+.|...
T Consensus 63 ~t~s~tvs~~~~g~~v~v~~~---g~~~t~~~~~~G~ws~t~~~~~~l~~g~~ti~v~~~tD~aGN~~t~~s~~~~vDt~ 139 (317)
T PF13753_consen 63 VTFSGTVSGAEPGSTVTVTIN---GTTGTLTADADGNWSVTVTPSDDLPDGDYTITVTTVTDAAGNTSTAASQTFTVDTT 139 (317)
T ss_pred eEEEEEecCCCCCCEEEEEEC---CEEEEEEEecCCcEEEeeccccccccCcceeEEEEEEccCCccccccccccccccc
Confidence 35655555555554 555552 223344445789999999884 7899999998 42 2221112333333221
Q ss_pred CCCceeEEEeeeCCCceecCCCeEEEEEecCCccc------eEEEEECCCCCeeeeEEEecCCCeEEEEEEe-----cCC
Q psy192 299 MGDAHKLEIAQFPQGVVMADKPTQFLVRKNGAVGA------LDAKVISPSGTEDDCFIQPIDGDNYSIRFMP-----REN 367 (898)
Q Consensus 299 ~~~~~~~~~~~~~~~~~~~g~~~~f~V~~~~a~G~------l~v~v~~p~g~~~~~~v~~~~~g~y~v~f~P-----~~~ 367 (898)
...+..+.+..+....+..+....+.+..++.-+. +.+.+ +|... ......+|.|.+.|.| ..-
T Consensus 140 ~~~~p~vti~~~~~~~~~~~~~~~~t~t~sg~v~~~~~~d~v~vt~---~G~~~--~~~~~~~g~~t~~~~~~~~~~~~d 214 (317)
T PF13753_consen 140 APTAPTVTITGISDDNIINGAESTVTVTFSGTVTGFDAGDTVTVTI---NGTTY--TTTVGADGTWTVTVTPSDLAGLAD 214 (317)
T ss_pred cccccccceecccCCceeeccceeecccccccceeeeeceeEEEee---ccccc--ceeecCCCcccccccccccccccC
Confidence 11123344443333333333333333333322111 23323 34322 2233467899999988 345
Q ss_pred eeEEEEEEE
Q psy192 368 GIHNIHIKF 376 (898)
Q Consensus 368 G~h~i~V~~ 376 (898)
|.|.+.+..
T Consensus 215 ~~~~v~v~~ 223 (317)
T PF13753_consen 215 GTYTVTVTV 223 (317)
T ss_pred ceEEEEEEe
Confidence 677777654
No 29
>PF06312 Neurexophilin: Neurexophilin
Probab=73.51 E-value=9.7 Score=39.68 Aligned_cols=46 Identities=15% Similarity=0.240 Sum_probs=39.3
Q ss_pred c-cceEEEEECCCCC-eeeeEEEecCCCeEEEEEEecCCeeEEEEEEE
Q psy192 331 V-GALDAKVISPSGT-EDDCFIQPIDGDNYSIRFMPRENGIHNIHIKF 376 (898)
Q Consensus 331 ~-G~l~v~v~~p~g~-~~~~~v~~~~~g~y~v~f~P~~~G~h~i~V~~ 376 (898)
+ |++.|+|.+++-+ ...+.|.|..||+|.++|.-.=+|.-+|+|.+
T Consensus 68 G~GDfrAri~s~~l~ag~~G~V~Dh~NGTYtv~F~L~W~G~v~vsV~L 115 (219)
T PF06312_consen 68 GWGDFRARIFSPKLKAGAAGKVTDHGNGTYTVSFPLLWPGQVSVSVSL 115 (219)
T ss_pred CCCCcEEEEecCCccccceEEEEECCCCeEEEEEEeecCceEEEEEEE
Confidence 5 7889999887643 35689999999999999999999999998875
No 30
>cd02858 Esterase_N_term Esterase N-terminal domain. Esterases catalyze the hydrolysis of organic esters to release an alcohol or thiol and acid. The term can be applied to enzymes that hydrolyze carboxylate, phosphate and sulphate esters, but is more often restricted to the first class of substrate. The N-terminus of esterase may be related to the immunoglobulin and/or fibronectin type III superfamilies. These domains are associated with different types of catalytic domains at either the N-terminal or C-terminal end and may be involved in homodimeric/tetrameric/dodecameric interactions. Members of this family include members of the alpha amylase family, sialidase, galactose oxidase, cellulase, cellulose, hyaluronate lyase, chitobiase, and chitinase.
Probab=73.23 E-value=20 Score=31.42 Aligned_cols=59 Identities=12% Similarity=0.196 Sum_probs=41.5
Q ss_pred eeEEEEEeccCCCCceEEEEEcCCcceeeEEECCCCcEEEEEEeCCCceEEEEEEECCeeC
Q psy192 226 PCEFNVWTREAGAGSLAISVEGPSKAEIDFKDRKDGSCYVSYVVAEPGEYRVGIKFNDQHI 286 (898)
Q Consensus 226 ~~~F~V~~~daG~g~l~v~v~gps~~~i~~~d~~dGty~v~y~P~~~G~y~i~V~~~g~~I 286 (898)
.++|.+.+-.| ..+.|.+.........+...++|.++++..+..+|.|.-.+..+|..+
T Consensus 7 ~v~F~vwAP~A--~~V~L~~~~~~~~~~~m~~~~~G~W~~~v~~l~~g~Y~Y~~~vdg~~~ 65 (85)
T cd02858 7 TVTFRLFAPKA--NEVQVRGSWGGAGSHPMTKDEAGVWSVTTGPLAPGIYTYSFLVDGVRV 65 (85)
T ss_pred cEEEEEECCCC--CEEEEEeecCCCccEeCeECCCeEEEEEECCCCCcEEEEEEEECCeEe
Confidence 46888888766 445555544323345566678999999875577888999999998555
No 31
>PF02369 Big_1: Bacterial Ig-like domain (group 1); InterPro: IPR003344 Proteins that contain this domain are found in a variety of bacterial and phage surface proteins such as intimins. Intimin is a bacterial cell-adhesion molecule that mediates the intimate bacterial host-cell interaction. It contains three domains; two immunoglobulin-like domains and a C-type lectin-like module implying that carbohydrate recognition may be important in intimin-mediated cell adhesion [].; PDB: 1CWV_A 4E9L_A 1F02_I 1F00_I.
Probab=71.79 E-value=21 Score=32.40 Aligned_cols=62 Identities=11% Similarity=0.106 Sum_probs=35.8
Q ss_pred CCeEEEEEecCCccc----eEEEE--ECCCCCeeee--EEEecCCCeEEEEEEecCCeeEEEEEEECCee
Q psy192 319 KPTQFLVRKNGAVGA----LDAKV--ISPSGTEDDC--FIQPIDGDNYSIRFMPRENGIHNIHIKFNGVH 380 (898)
Q Consensus 319 ~~~~f~V~~~~a~G~----l~v~v--~~p~g~~~~~--~v~~~~~g~y~v~f~P~~~G~h~i~V~~~g~~ 380 (898)
+...+++...|+.|+ ..+.+ ....+...+. ...-..+|.+.++|+...+|.|.|...+++..
T Consensus 23 ~~~tltatV~D~~gnpv~g~~V~f~~~~~~~~l~~~~~~~~Td~~G~a~~tltst~aG~~~VtA~~~~~~ 92 (100)
T PF02369_consen 23 DTNTLTATVTDANGNPVPGQPVTFSSSSSGGTLSPTNTSATTDSNGIATVTLTSTKAGTYTVTATVDGGS 92 (100)
T ss_dssp S-EEEEEEEEETTSEB-TS-EEEE--EESSSEES-CEE-EEE-TTSEEEEEEE-SS-EEEEEEEEETTEE
T ss_pred CcEEEEEEEEcCCCCCCCCCEEEEEEcCCCcEEecCccccEECCCEEEEEEEEecCceEEEEEEEECCcc
Confidence 344455554555443 34444 2223332222 34557899999999999999999999998753
No 32
>PF06312 Neurexophilin: Neurexophilin
Probab=71.34 E-value=11 Score=39.16 Aligned_cols=45 Identities=27% Similarity=0.292 Sum_probs=37.8
Q ss_pred cceEEEEeCCCCCe-eeeEEEEcCCCEEEEEEEcCcceeEEEEEEE
Q psy192 142 FDLSATVTSPGGVT-EDAEINEVEDGLYAVHFVPKELGVHTVSVRY 186 (898)
Q Consensus 142 ~~~~~~V~~p~G~~-~~~~v~~~~~g~y~v~f~P~~~G~h~I~V~~ 186 (898)
||+.|++.++.-+. ...+|+|.+||+|++.|+-.=+|.-.|+|.+
T Consensus 70 GDfrAri~s~~l~ag~~G~V~Dh~NGTYtv~F~L~W~G~v~vsV~L 115 (219)
T PF06312_consen 70 GDFRARIFSPKLKAGAAGKVTDHGNGTYTVSFPLLWPGQVSVSVSL 115 (219)
T ss_pred CCcEEEEecCCccccceEEEEECCCCeEEEEEEeecCceEEEEEEE
Confidence 77788888776432 4679999999999999999999998888876
No 33
>smart00634 BID_1 Bacterial Ig-like domain (group 1).
Probab=70.07 E-value=22 Score=31.59 Aligned_cols=52 Identities=17% Similarity=0.360 Sum_probs=39.5
Q ss_pred CCEEecCCCCeeEEEecCccce---EEEEeCCCCeEEEEEeeCCCeEEEEEEEECC
Q psy192 38 NGDHVQGYGGLSLSIEGPSKAE---IQCKDNADGSLNISYRPTEPGYYIINLKFAD 90 (898)
Q Consensus 38 ~G~~v~G~g~l~v~v~GP~~~~---i~~~d~~dG~~~v~ytP~~~G~~~I~V~~~g 90 (898)
+|.|+++. .+.++++|+.... -....+.+|.+.+.++.+.+|.++|.+..++
T Consensus 30 ~Gnpv~~~-~V~f~~~~~~~~~~~~~~~~Td~~G~a~~~l~~~~~G~~~vta~~~~ 84 (92)
T smart00634 30 NGNPVAGQ-EVTFTTPSGGALTLSKGTATTDANGIATVTLTSTTAGVYTVTASLEN 84 (92)
T ss_pred CCCCcCCC-EEEEEECCCceeeccCCeeeeCCCCEEEEEEECCCCcEEEEEEEECC
Confidence 58889875 4777777764211 2234567999999999999999999998776
No 34
>PF02369 Big_1: Bacterial Ig-like domain (group 1); InterPro: IPR003344 Proteins that contain this domain are found in a variety of bacterial and phage surface proteins such as intimins. Intimin is a bacterial cell-adhesion molecule that mediates the intimate bacterial host-cell interaction. It contains three domains; two immunoglobulin-like domains and a C-type lectin-like module implying that carbohydrate recognition may be important in intimin-mediated cell adhesion [].; PDB: 1CWV_A 4E9L_A 1F02_I 1F00_I.
Probab=69.32 E-value=26 Score=31.80 Aligned_cols=32 Identities=16% Similarity=0.202 Sum_probs=25.5
Q ss_pred EEEEcCCCEEEEEEEcCcceeEEEEEEECCee
Q psy192 159 EINEVEDGLYAVHFVPKELGVHTVSVRYKDIH 190 (898)
Q Consensus 159 ~v~~~~~g~y~v~f~P~~~G~h~I~V~~~g~~ 190 (898)
....+.+|.+.++|+...+|.+.|.+.+++..
T Consensus 61 ~~~Td~~G~a~~tltst~aG~~~VtA~~~~~~ 92 (100)
T PF02369_consen 61 SATTDSNGIATVTLTSTKAGTYTVTATVDGGS 92 (100)
T ss_dssp -EEE-TTSEEEEEEE-SS-EEEEEEEEETTEE
T ss_pred ccEECCCEEEEEEEEecCceEEEEEEEECCcc
Confidence 46778999999999999999999999999754
No 35
>TIGR00868 hCaCC calcium-activated chloride channel protein 1. distributions. found a row in 1A13.INFO that was not parsed out
Probab=69.13 E-value=3.2e+02 Score=34.70 Aligned_cols=47 Identities=13% Similarity=0.263 Sum_probs=32.9
Q ss_pred CceEEEEEcCCcc--eeeEEEC--------CCCcEEEEEEe-CCCceEEEEEEECCee
Q psy192 239 GSLAISVEGPSKA--EIDFKDR--------KDGSCYVSYVV-AEPGEYRVGIKFNDQH 285 (898)
Q Consensus 239 g~l~v~v~gps~~--~i~~~d~--------~dGty~v~y~P-~~~G~y~i~V~~~g~~ 285 (898)
..+.+.|+.|++. .+...|| .||.|+-.|+. ...|.|+|.|+.+|..
T Consensus 627 A~Vta~i~~~~g~~~~l~l~DnGaGaD~~knDGIYS~yft~f~~nGrYsl~V~v~~~~ 684 (863)
T TIGR00868 627 ANVTALIESENGHTVTLELLDNGAGADTVKNDGIYSRYFTAYDGNGRYSLKVRALGGV 684 (863)
T ss_pred cEEEEEEECCCCCEEEEEEeecCCCCCccccCceeeeeeecccCCceEEEEEEEeCCC
Confidence 3455666666543 3455564 48999988875 4569999999998765
No 36
>PF13860 FlgD_ig: FlgD Ig-like domain; PDB: 3C12_A 3OSV_A.
Probab=68.22 E-value=33 Score=29.73 Aligned_cols=56 Identities=7% Similarity=0.092 Sum_probs=30.2
Q ss_pred CeEEEEEecCCccceEEEEECCCCCeee-eEEEecCCCeEEEEEE-------ecCCeeEEEEEE
Q psy192 320 PTQFLVRKNGAVGALDAKVISPSGTEDD-CFIQPIDGDNYSIRFM-------PRENGIHNIHIK 375 (898)
Q Consensus 320 ~~~f~V~~~~a~G~l~v~v~~p~g~~~~-~~v~~~~~g~y~v~f~-------P~~~G~h~i~V~ 375 (898)
+..|.+.......++.+.|.+.+|+.+. ..+.....|.+.+.|- +-..|.|++.|.
T Consensus 13 ~~~~~~~l~~~a~~v~v~I~d~~G~~V~t~~~~~~~~G~~~~~WdG~d~~G~~~~~G~Y~~~v~ 76 (81)
T PF13860_consen 13 KGSIEYTLPEDADNVTVTIYDSNGQVVRTISLGSQSAGEHSFTWDGKDDDGNPVPDGTYTFRVT 76 (81)
T ss_dssp EEEEEEEECSSCEEEEEEEEETTS-EEEEEEEEECSSEEEEEEE-SB-TTS-B--SEEEEEEEE
T ss_pred EEEEEEeCCCcccEEEEEEEcCCCCEEEEEEcCCcCCceEEEEECCCCCCcCCCCCCCEEEEEE
Confidence 4455555544445678888888887652 2333344556666665 445555555554
No 37
>PF07495 Y_Y_Y: Y_Y_Y domain; InterPro: IPR011123 This region is mostly found at the end of the beta propellers (IPR011110 from INTERPRO) in a family of two component regulators. However they are also found tandemly repeated in Q891H4 from SWISSPROT without other signal conduction domains being present. It is named after the conserved tyrosines found in the alignment. The exact function is not known.; PDB: 3V9F_D 3VA6_B 3OTT_B 4A2M_D 4A2L_B.
Probab=64.59 E-value=13 Score=30.49 Aligned_cols=37 Identities=27% Similarity=0.481 Sum_probs=23.1
Q ss_pred EEeCCccceEEEEeCCCCeEEEEEEcCCCeeEEEEEEECC
Q psy192 614 AVEGPSKAEITYHDNKDGTVAVSYLPTAPGEYKIAVKFGE 653 (898)
Q Consensus 614 ~v~GPs~~~i~~~~~~dg~~~v~Y~P~~pG~Y~i~V~~~~ 653 (898)
.++|.++.-+...+... .++|++..||.|.+.|+.-+
T Consensus 13 ~l~g~d~~W~~~~~~~~---~~~~~~L~~G~Y~l~V~a~~ 49 (66)
T PF07495_consen 13 RLEGFDDEWITLGSYSN---SISYTNLPPGKYTLEVRAKD 49 (66)
T ss_dssp EEETTESSEEEESSTS----EEEEES--SEEEEEEEEEEE
T ss_pred EEECCCCeEEECCCCcE---EEEEEeCCCEEEEEEEEEEC
Confidence 34555543333333333 99999999999999998744
No 38
>PF11896 DUF3416: Domain of unknown function (DUF3416); InterPro: IPR021828 This presumed domain is functionally uncharacterised. This domain is found in bacteria and archaea. This domain is about 190 amino acids in length. This domain is found associated with PF00128 from PFAM. ; GO: 0004553 hydrolase activity, hydrolyzing O-glycosyl compounds; PDB: 3ZT7_A 3ZST_B 3ZT6_A 3ZSS_D 3ZT5_B.
Probab=64.15 E-value=27 Score=35.66 Aligned_cols=62 Identities=16% Similarity=0.255 Sum_probs=36.6
Q ss_pred eeeCcEEEEEEEcCCCcccceEEEE--eCCCCCe-eeeEEEEcCCCEEEEEEEcCcceeEEEEEE
Q psy192 124 TEVGSTCKLTFKMPGITAFDLSATV--TSPGGVT-EDAEINEVEDGLYAVHFVPKELGVHTVSVR 185 (898)
Q Consensus 124 ~~vg~~~~f~v~~~~~~~~~~~~~V--~~p~G~~-~~~~v~~~~~g~y~v~f~P~~~G~h~I~V~ 185 (898)
..+|+...++-++=.-|...+.|.+ ..++++. ....+...+++.|..+|+|.+.|.|+..|.
T Consensus 23 rvvGe~v~V~Adif~DGHD~l~A~l~~r~~~~~~w~~vpM~~~gnDrW~a~f~~~~~G~~~f~Ve 87 (187)
T PF11896_consen 23 RVVGEPVPVSADIFRDGHDALAAELLWRHPGEREWQEVPMTPLGNDRWEASFTPDRPGRYEFRVE 87 (187)
T ss_dssp EETT-EEEEEEEE--SSSS-EEEEEEEE-TTS-B----B-EESTS-EEEEEEE--SSEEEEEEEE
T ss_pred eecCCeEEEEEEEEecCCCcEEEEEEEECCCCCcceeeccccCCCCEEEEEEECCCceeEEEEEE
Confidence 3568888877766544555555544 6777765 455677889999999999999999877664
No 39
>PF13473 Cupredoxin_1: Cupredoxin-like domain; PDB: 1IBZ_D 1IC0_E 1IBY_D.
Probab=63.36 E-value=13 Score=33.81 Aligned_cols=57 Identities=14% Similarity=0.219 Sum_probs=32.0
Q ss_pred eeeeeeEEEEEeccCCCCceEEEEEcCCcceeeEEECCCCcEEEEEEeCCCceEEEEEEECC
Q psy192 222 EQNQPCEFNVWTREAGAGSLAISVEGPSKAEIDFKDRKDGSCYVSYVVAEPGEYRVGIKFND 283 (898)
Q Consensus 222 ~~g~~~~F~V~~~daG~g~l~v~v~gps~~~i~~~d~~dGty~v~y~P~~~G~y~i~V~~~g 283 (898)
..|++.++++.-.|... -++.+.+- .....-....+.+++|+|.++|+|.+...++.
T Consensus 40 ~~G~~v~l~~~N~~~~~--h~~~i~~~---~~~~~l~~g~~~~~~f~~~~~G~y~~~C~~~~ 96 (104)
T PF13473_consen 40 KAGQPVTLTFTNNDSRP--HEFVIPDL---GISKVLPPGETATVTFTPLKPGEYEFYCTMHP 96 (104)
T ss_dssp ETTCEEEEEEEE-SSS---EEEEEGGG---TEEEEE-TT-EEEEEEEE-S-EEEEEB-SSS-
T ss_pred cCCCeEEEEEEECCCCc--EEEEECCC---ceEEEECCCCEEEEEEcCCCCEEEEEEcCCCC
Confidence 45777777776665443 44455441 22222345677899999999999999876554
No 40
>PF13115 YtkA: YtkA-like
Probab=62.96 E-value=71 Score=27.71 Aligned_cols=27 Identities=11% Similarity=0.118 Sum_probs=22.9
Q ss_pred eEEEecCCCeEEEEEEecCCeeEEEEE
Q psy192 348 CFIQPIDGDNYSIRFMPRENGIHNIHI 374 (898)
Q Consensus 348 ~~v~~~~~g~y~v~f~P~~~G~h~i~V 374 (898)
..+...++|.|.+......+|.|.|.|
T Consensus 59 ~~~~~~~~G~Y~~~~~f~m~G~W~i~v 85 (86)
T PF13115_consen 59 VELEETGPGVYEAEVTFSMAGTWQITV 85 (86)
T ss_pred eeeecCCCCeEEEEeecCCCeeEEEEE
Confidence 444556899999999999999999987
No 41
>COG1470 Predicted membrane protein [Function unknown]
Probab=62.65 E-value=3e+02 Score=32.02 Aligned_cols=73 Identities=12% Similarity=0.219 Sum_probs=42.2
Q ss_pred eCCCCeEEEEEEcCCceeEEEEEEECCEEecCCCCeeEEEecCccceEEEEeCCCCeEEEEEeeCCCeEEEEEEEECCee
Q psy192 13 DNHDGTVSLHYDPREEGLHELALKFNGDHVQGYGGLSLSIEGPSKAEIQCKDNADGSLNISYRPTEPGYYIINLKFADHH 92 (898)
Q Consensus 13 ~~~d~~~~v~f~P~~~G~h~v~v~~~G~~v~G~g~l~v~v~GP~~~~i~~~d~~dG~~~v~ytP~~~G~~~I~V~~~g~~ 92 (898)
.|....-.+.+--.-.+...+...|+|..|.|- .......-.+.+.++|..+|.|.+....+|.+
T Consensus 7 n~~~~~~~~~ls~~~~~~~d~~f~y~g~~i~~~---------------~lr~~e~~~l~~~v~~~~~g~~~v~f~i~~~~ 71 (513)
T COG1470 7 NNGTILVTVLLSLMSLKGADVKFLYQGIEIKGL---------------KLRPKESVELQFKVLPGKAGSYSVKFSIEGVP 71 (513)
T ss_pred cCCcEEEEEEEEehhcCCccEEEEEeeeEeeee---------------EcCCCcceEEEEEEecCCCCcEEEEEEECCce
Confidence 333333344444444477777778888777421 11111233567777888999998887765533
Q ss_pred cCCCCeEEEEec
Q psy192 93 VEGSPFTAKIVG 104 (898)
Q Consensus 93 i~gSPf~~~v~~ 104 (898)
- |++.|+.
T Consensus 72 ~----~~v~v~~ 79 (513)
T COG1470 72 Y----WTVNVYT 79 (513)
T ss_pred e----EEEEEEe
Confidence 2 5666665
No 42
>PF13473 Cupredoxin_1: Cupredoxin-like domain; PDB: 1IBZ_D 1IC0_E 1IBY_D.
Probab=60.66 E-value=25 Score=31.95 Aligned_cols=62 Identities=15% Similarity=0.193 Sum_probs=34.8
Q ss_pred eecCceeEEEEEcCCCCCceeEEEEeCCCccccccccceEEEEeeCCceEEEEEEeCCCeeEEEEEEECCccCCC
Q psy192 817 IKSGVKTDFIVDTCNAGAGTLAVTIDGPSKVSVKKYKDEIFTRHTGRNNFEVKYIVRDRGEYLLIVKWGDDHIPG 891 (898)
Q Consensus 817 ~~~g~~~~f~vdt~~aG~g~l~v~i~gp~~~~~~~~~~~~~~~~~g~~~y~v~y~~~~~G~y~l~vk~~~~~i~G 891 (898)
+..|++..+++. |.|...-.+.+.+-.. + ..-....+.+++++|.++|+|.+.+.++.. ..|
T Consensus 39 v~~G~~v~l~~~--N~~~~~h~~~i~~~~~--------~--~~l~~g~~~~~~f~~~~~G~y~~~C~~~~~-m~G 100 (104)
T PF13473_consen 39 VKAGQPVTLTFT--NNDSRPHEFVIPDLGI--------S--KVLPPGETATVTFTPLKPGEYEFYCTMHPN-MKG 100 (104)
T ss_dssp EETTCEEEEEEE--E-SSS-EEEEEGGGTE--------E--EEE-TT-EEEEEEEE-S-EEEEEB-SSS-T-TB-
T ss_pred EcCCCeEEEEEE--ECCCCcEEEEECCCce--------E--EEECCCCEEEEEEcCCCCEEEEEEcCCCCc-cee
Confidence 455777777765 3444556666665321 1 112234588999999999999999997775 444
No 43
>PF13860 FlgD_ig: FlgD Ig-like domain; PDB: 3C12_A 3OSV_A.
Probab=60.27 E-value=62 Score=28.00 Aligned_cols=44 Identities=11% Similarity=0.222 Sum_probs=22.8
Q ss_pred EEEEEEEcCCCcccceEEEEeCCCCCeeee-EEEEcCCCEEEEEEE
Q psy192 129 TCKLTFKMPGITAFDLSATVTSPGGVTEDA-EINEVEDGLYAVHFV 173 (898)
Q Consensus 129 ~~~f~v~~~~~~~~~~~~~V~~p~G~~~~~-~v~~~~~g~y~v~f~ 173 (898)
...|.+.++.. ..++++.|.+.+|+.+.. .+.....|.+.+.|-
T Consensus 13 ~~~~~~~l~~~-a~~v~v~I~d~~G~~V~t~~~~~~~~G~~~~~Wd 57 (81)
T PF13860_consen 13 KGSIEYTLPED-ADNVTVTIYDSNGQVVRTISLGSQSAGEHSFTWD 57 (81)
T ss_dssp EEEEEEEECSS-CEEEEEEEEETTS-EEEEEEEEECSSEEEEEEE-
T ss_pred EEEEEEeCCCc-ccEEEEEEEcCCCCEEEEEEcCCcCCceEEEEEC
Confidence 44555555532 235778888888887532 233334445555544
No 44
>cd02858 Esterase_N_term Esterase N-terminal domain. Esterases catalyze the hydrolysis of organic esters to release an alcohol or thiol and acid. The term can be applied to enzymes that hydrolyze carboxylate, phosphate and sulphate esters, but is more often restricted to the first class of substrate. The N-terminus of esterase may be related to the immunoglobulin and/or fibronectin type III superfamilies. These domains are associated with different types of catalytic domains at either the N-terminal or C-terminal end and may be involved in homodimeric/tetrameric/dodecameric interactions. Members of this family include members of the alpha amylase family, sialidase, galactose oxidase, cellulase, cellulose, hyaluronate lyase, chitobiase, and chitinase.
Probab=57.21 E-value=93 Score=27.16 Aligned_cols=60 Identities=25% Similarity=0.279 Sum_probs=40.3
Q ss_pred CeEEEEEEcCCCCCCceEEEEECCCCceeeeEEeCCe-EEEEEecCCCeeEEEEEEECCEecC
Q psy192 414 VKTDFIVDTCNAGAGTLAVTIDGPSKVSMDCTEVEEG-YKVRYTPLVPGDYYVSLKYNGYHIV 475 (898)
Q Consensus 414 ~~~~F~Vd~~~aG~g~l~v~v~gp~~~~~~~~~~~~g-y~v~ytp~~~G~y~i~V~~~g~~I~ 475 (898)
...+|.+.+-.| ..+.+.+.............++| +.++-.+..+|.|.-.+..+|..++
T Consensus 6 ~~v~F~vwAP~A--~~V~L~~~~~~~~~~~m~~~~~G~W~~~v~~l~~g~Y~Y~~~vdg~~~~ 66 (85)
T cd02858 6 RTVTFRLFAPKA--NEVQVRGSWGGAGSHPMTKDEAGVWSVTTGPLAPGIYTYSFLVDGVRVI 66 (85)
T ss_pred CcEEEEEECCCC--CEEEEEeecCCCccEeCeECCCeEEEEEECCCCCcEEEEEEEECCeEec
Confidence 356888888766 45555555432223444556788 7888766788989988888886553
No 45
>KOG2996|consensus
Probab=55.74 E-value=7.2 Score=45.28 Aligned_cols=35 Identities=29% Similarity=0.526 Sum_probs=30.8
Q ss_pred cceEEEEeCCCCeEEEEEEcCCCeeEEEEEEECCe
Q psy192 620 KAEITYHDNKDGTVAVSYLPTAPGEYKIAVKFGEK 654 (898)
Q Consensus 620 ~~~i~~~~~~dg~~~v~Y~P~~pG~Y~i~V~~~~~ 654 (898)
+|+-.++...+|+|.|.|.+.+.++|.|+||||++
T Consensus 696 qaes~Lk~~~ngT~LVR~r~kea~e~AISikynne 730 (865)
T KOG2996|consen 696 QAESTLKNRPNGTYLVRYRTKEAKEFAISIKYNNE 730 (865)
T ss_pred hhhhHhhcCCCceEEEEecccchhheeEEEEeccc
Confidence 34566777889999999999999999999999985
No 46
>PF13753 SWM_repeat: Putative flagellar system-associated repeat
Probab=55.33 E-value=1.3e+02 Score=33.37 Aligned_cols=142 Identities=18% Similarity=0.209 Sum_probs=70.1
Q ss_pred EEEEEcCCCcccc-eEEEEeCCCCCeeeeEEEEcCCCEEEEEEEcC---cceeEEEEEE-EC---CeecCCCceEEEEcc
Q psy192 131 KLTFKMPGITAFD-LSATVTSPGGVTEDAEINEVEDGLYAVHFVPK---ELGVHTVSVR-YK---DIHIPGSPFQFTVGP 202 (898)
Q Consensus 131 ~f~v~~~~~~~~~-~~~~V~~p~G~~~~~~v~~~~~g~y~v~f~P~---~~G~h~I~V~-~~---g~~V~gSPf~v~V~~ 202 (898)
.|.....++..++ +.+.+ +|..... ....+|.|.+.+.|. ..|.|.|.+. .- |..-......+.|..
T Consensus 64 t~s~tvs~~~~g~~v~v~~---~g~~~t~--~~~~~G~ws~t~~~~~~l~~g~~ti~v~~~tD~aGN~~t~~s~~~~vDt 138 (317)
T PF13753_consen 64 TFSGTVSGAEPGSTVTVTI---NGTTGTL--TADADGNWSVTVTPSDDLPDGDYTITVTTVTDAAGNTSTAASQTFTVDT 138 (317)
T ss_pred EEEEEecCCCCCCEEEEEE---CCEEEEE--EEecCCcEEEeeccccccccCcceeEEEEEEccCCcccccccccccccc
Confidence 4555555554444 44444 4544333 344788999999884 7899999998 42 222111234444432
Q ss_pred CCCCCcceEEEeCCCCCcceeeeeeEEEEEecc----CCC-CceEEEEEcCCcceeeEEECCCCcEEEEEEe-----CCC
Q psy192 203 LRDGGAHRVHAGGPGLERGEQNQPCEFNVWTRE----AGA-GSLAISVEGPSKAEIDFKDRKDGSCYVSYVV-----AEP 272 (898)
Q Consensus 203 ~~~~d~~~v~a~G~GL~~~~~g~~~~F~V~~~d----aG~-g~l~v~v~gps~~~i~~~d~~dGty~v~y~P-----~~~ 272 (898)
.. ..+..+.+.+..-.....++...+++...+ ... ..+.+.+.|... ......||.|++.|.| ..-
T Consensus 139 ~~-~~~p~vti~~~~~~~~~~~~~~~~t~t~sg~v~~~~~~d~v~vt~~G~~~---~~~~~~~g~~t~~~~~~~~~~~~d 214 (317)
T PF13753_consen 139 TA-PTAPTVTITGISDDNIINGAESTVTVTFSGTVTGFDAGDTVTVTINGTTY---TTTVGADGTWTVTVTPSDLAGLAD 214 (317)
T ss_pred cc-ccccccceecccCCceeeccceeecccccccceeeeeceeEEEeeccccc---ceeecCCCcccccccccccccccC
Confidence 10 012334444211001111222223332222 122 225566655432 2334578899999988 344
Q ss_pred ceEEEEEEE
Q psy192 273 GEYRVGIKF 281 (898)
Q Consensus 273 G~y~i~V~~ 281 (898)
|.|.|.+..
T Consensus 215 ~~~~v~v~~ 223 (317)
T PF13753_consen 215 GTYTVTVTV 223 (317)
T ss_pred ceEEEEEEe
Confidence 666666654
No 47
>TIGR03096 nitroso_cyanin nitrosocyanin. Nitrosocyanin, as described from the obligate chemolithoautotroph Nitrosomonas europaea, is a red copper protein of unknown function with sequence similarity to a number of blue copper redox proteins.
Probab=53.66 E-value=97 Score=29.88 Aligned_cols=32 Identities=16% Similarity=0.434 Sum_probs=26.2
Q ss_pred CCCCcEEEEEEeCCCceEEEEEEEC--CeeCCCC
Q psy192 258 RKDGSCYVSYVVAEPGEYRVGIKFN--DQHIPDS 289 (898)
Q Consensus 258 ~~dGty~v~y~P~~~G~y~i~V~~~--g~~I~gS 289 (898)
..+.+.+++|++.+||.|..+..+. ..|+|||
T Consensus 97 ~pGet~TitF~adKpG~Y~y~C~~HP~~~H~~~~ 130 (135)
T TIGR03096 97 KAGETKTISFKADKAGAFTIWCQLHPKNIHLPGS 130 (135)
T ss_pred CCCCeEEEEEECCCCEEEEEeCCCCChhhcCCCc
Confidence 4466789999999999999888775 5677876
No 48
>PF13620 CarboxypepD_reg: Carboxypeptidase regulatory-like domain; PDB: 3MN8_D 3P0D_I 3KCP_A 2B59_B 1UWY_A 1H8L_A 1QMU_A 2NSM_A.
Probab=51.93 E-value=64 Score=27.44 Aligned_cols=45 Identities=22% Similarity=0.256 Sum_probs=29.6
Q ss_pred ceEEEEeCCCCCeeeeEEEEcCCCEEEEEEEcCcceeEEEEEEECCee
Q psy192 143 DLSATVTSPGGVTEDAEINEVEDGLYAVHFVPKELGVHTVSVRYKDIH 190 (898)
Q Consensus 143 ~~~~~V~~p~G~~~~~~v~~~~~g~y~v~f~P~~~G~h~I~V~~~g~~ 190 (898)
..++.+.+.++.. ........+|.|.+.-.|. |.|.|.+...|..
T Consensus 16 ~a~V~l~~~~~~~-~~~~~Td~~G~f~~~~l~~--g~Y~l~v~~~g~~ 60 (82)
T PF13620_consen 16 GATVTLTDQDGGT-VYTTTTDSDGRFSFEGLPP--GTYTLRVSAPGYQ 60 (82)
T ss_dssp T-EEEET--TTTE-CCEEE--TTSEEEEEEE-S--EEEEEEEEBTTEE
T ss_pred CEEEEEEEeeCCC-EEEEEECCCceEEEEccCC--EeEEEEEEECCcc
Confidence 3456666555553 4567778999999997777 9999999888854
No 49
>smart00634 BID_1 Bacterial Ig-like domain (group 1).
Probab=51.88 E-value=67 Score=28.47 Aligned_cols=30 Identities=20% Similarity=0.328 Sum_probs=25.9
Q ss_pred EEEEcCCCEEEEEEEcCcceeEEEEEEECC
Q psy192 159 EINEVEDGLYAVHFVPKELGVHTVSVRYKD 188 (898)
Q Consensus 159 ~v~~~~~g~y~v~f~P~~~G~h~I~V~~~g 188 (898)
....+.+|...+.++...+|.+.|.++.+|
T Consensus 55 ~~~Td~~G~a~~~l~~~~~G~~~vta~~~~ 84 (92)
T smart00634 55 TATTDANGIATVTLTSTTAGVYTVTASLEN 84 (92)
T ss_pred eeeeCCCCEEEEEEECCCCcEEEEEEEECC
Confidence 345667999999999999999999999876
No 50
>PF07705 CARDB: CARDB; InterPro: IPR011635 The APHP (acidic peptide-dependent hydrolases/peptidase) domain is found in a variety of different proteins.; PDB: 2KUT_A 2L0D_A 3IDU_A 2KL6_A.
Probab=50.97 E-value=69 Score=28.18 Aligned_cols=65 Identities=12% Similarity=0.168 Sum_probs=34.3
Q ss_pred eeeCcEEEEEEEcCCCcc---cceEEEEeCCCCCeeeeEEEEcC---CCEEEEEEEcCcceeEEEEEEECC
Q psy192 124 TEVGSTCKLTFKMPGITA---FDLSATVTSPGGVTEDAEINEVE---DGLYAVHFVPKELGVHTVSVRYKD 188 (898)
Q Consensus 124 ~~vg~~~~f~v~~~~~~~---~~~~~~V~~p~G~~~~~~v~~~~---~g~y~v~f~P~~~G~h~I~V~~~g 188 (898)
...|+..+|.+.+...|. ..+.+.+..-+.......|.... ..++.+.|.+..+|.|.|.+.++.
T Consensus 15 ~~~g~~~~i~~~V~N~G~~~~~~~~v~~~~~~~~~~~~~i~~L~~g~~~~v~~~~~~~~~G~~~i~~~iD~ 85 (101)
T PF07705_consen 15 VVPGEPVTITVTVKNNGTADAENVTVRLYLDGNSVSTVTIPSLAPGESETVTFTWTPPSPGSYTIRVVIDP 85 (101)
T ss_dssp EETTSEEEEEEEEEE-SSS-BEEEEEEEEETTEEEEEEEESEB-TTEEEEEEEEEE-SS-CEEEEEEEEST
T ss_pred ccCCCEEEEEEEEEECCCCCCCCEEEEEEECCceeccEEECCcCCCcEEEEEEEEEeCCCCeEEEEEEEee
Confidence 445666666666654433 23444443222111122332333 336788888889999999999874
No 51
>PRK13211 N-acetylglucosamine-binding protein A; Reviewed
Probab=49.53 E-value=2.4e+02 Score=33.23 Aligned_cols=114 Identities=14% Similarity=0.228 Sum_probs=65.2
Q ss_pred EEeeCCCeEEEEEEEECCeecCCCCeEEEEecCCcccceeeeeeeecccceeeeCcEEEEEEEcCCCcccceEEEEeCCC
Q psy192 73 SYRPTEPGYYIINLKFADHHVEGSPFTAKIVGEGSNRQREKIQRQREAVPVTEVGSTCKLTFKMPGITAFDLSATVTSPG 152 (898)
Q Consensus 73 ~ytP~~~G~~~I~V~~~g~~i~gSPf~~~v~~~~~~~~~~~i~~~~~~~~~~~vg~~~~f~v~~~~~~~~~~~~~V~~p~ 152 (898)
-|+=...|..+|+|.|+-+.. +--+.+.|++-. .. -...+..+++.+++...+..+++++|.+.+
T Consensus 287 iy~~~~s~~~~vei~~~~~~p-~p~~~lsVsGL~--~e------------Y~I~dG~~~i~ftv~a~g~~~vta~V~d~~ 351 (478)
T PRK13211 287 IYLKAGSGLERVEIGYDIEAP-APDYELEVSGLA--KE------------YKIGDGAATLDFTVTATGDMNVEATVYNHD 351 (478)
T ss_pred eEEeCCCCceEEEEEEEecCC-CCCCceEEecCc--ce------------eEEcCCcEEEEEEEEeccceEEEEEEEcCC
Confidence 345566666666666654332 113444444322 11 011123444444544455678999999999
Q ss_pred CCeeeeEEEEcCC--CEEEEEEEcCcceeEEEEEEE---CCeecCCCceEEEEc
Q psy192 153 GVTEDAEINEVED--GLYAVHFVPKELGVHTVSVRY---KDIHIPGSPFQFTVG 201 (898)
Q Consensus 153 G~~~~~~v~~~~~--g~y~v~f~P~~~G~h~I~V~~---~g~~V~gSPf~v~V~ 201 (898)
|+.+-..-....+ ..+++...-.++|.|.|.|+. +|+.+.-.-+.|.|.
T Consensus 352 g~~~~~~~~~v~d~s~~vtL~Ls~~~AG~y~Lvv~~t~~dG~~~~q~~~~~~v~ 405 (478)
T PRK13211 352 GEALGSKSQTVNDGSQSVSLDLSKLKAGHHMLVVKAKPKDGELIKQQTLDFMLE 405 (478)
T ss_pred CCeeeeeeEEecCCceeEEEecccCCCceEEEEEEEEeCCCceeeeeeEEEEEE
Confidence 9865332222333 356666667789999999986 456664555777773
No 52
>PF09315 DUF1973: Domain of unknown function (DUF1973); InterPro: IPR015394 These functionally uncharacterised domains are found in various eukaryotic calcium-dependent chloride channels.
Probab=47.91 E-value=3.1e+02 Score=27.76 Aligned_cols=72 Identities=21% Similarity=0.253 Sum_probs=44.2
Q ss_pred eeCcEEEEEEEcCCCcccceEEEEeCCCCCeeeeEEEEcCCCEEEEEEEc--CcceeEEEEEEECCeecCCCceEEEEc
Q psy192 125 EVGSTCKLTFKMPGITAFDLSATVTSPGGVTEDAEINEVEDGLYAVHFVP--KELGVHTVSVRYKDIHIPGSPFQFTVG 201 (898)
Q Consensus 125 ~vg~~~~f~v~~~~~~~~~~~~~V~~p~G~~~~~~v~~~~~g~y~v~f~P--~~~G~h~I~V~~~g~~V~gSPf~v~V~ 201 (898)
.+|....|.|..... ....+.+++|+|+....-..|.......+.. | -+.|..+++|..... .-.+..++|.
T Consensus 26 tvG~~T~f~v~w~~~--~~p~i~L~~P~G~~~~~~~~d~~~~~~~i~i-pg~ae~G~W~y~i~~~~~--~~q~v~vtVt 99 (179)
T PF09315_consen 26 TVGNNTVFTVTWQNS--SPPSITLTDPSGTVYTTFTTDSNSKTARIQI-PGTAEVGTWTYSITNTSS--SSQTVTVTVT 99 (179)
T ss_pred CCCCCeEEEEEECCC--CCceEEEECCCCCEEeeeEEcccccEEEEEC-CCCcccccEEEEEecCCC--CcceEEEEEE
Confidence 457788888877665 3467789999999765534444444444443 4 467888888753321 1233555554
No 53
>KOG2996|consensus
Probab=46.14 E-value=15 Score=42.91 Aligned_cols=32 Identities=19% Similarity=0.560 Sum_probs=28.0
Q ss_pred EEeeCCceEEEEEEeCCCeeEEEEEEECCc--cC
Q psy192 858 TRHTGRNNFEVKYIVRDRGEYLLIVKWGDD--HI 889 (898)
Q Consensus 858 ~~~~g~~~y~v~y~~~~~G~y~l~vk~~~~--~i 889 (898)
.+...+|||-|+|.+++.++|.|+|||+++ ||
T Consensus 701 Lk~~~ngT~LVR~r~kea~e~AISikynnevKHi 734 (865)
T KOG2996|consen 701 LKNRPNGTYLVRYRTKEAKEFAISIKYNNEVKHI 734 (865)
T ss_pred hhcCCCceEEEEecccchhheeEEEEeccccceE
Confidence 445578999999999999999999999985 55
No 54
>PF04151 PPC: Bacterial pre-peptidase C-terminal domain; InterPro: IPR007280 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. This domain is normally found at the C terminus of secreted archaeal and bacterial peptidases, the majority of which belong to MEROPS peptidase families M4 (vibriolysin, IPR001570 from INTERPRO), M9A amd M9B (microbial collangenase, IPR002169 from INTERPRO), M28 (aminopeptidase Ap1, IPR007484 from INTERPRO) and S8 (subtilisin family peptidases, IPR000209 from INTERPRO).; GO: 0008233 peptidase activity, 0006508 proteolysis; PDB: 4DY5_B 4DXZ_A 4DY3_B 3JQW_A 3JQX_C 1NQJ_B 1NQD_A 2O8O_A 1WMF_A 1WME_A ....
Probab=44.95 E-value=78 Score=26.34 Aligned_cols=40 Identities=15% Similarity=0.336 Sum_probs=26.9
Q ss_pred ceEEEEECCCCceeee-EEeC--Ce--EEEEEecCCCeeEEEEEE
Q psy192 429 TLAVTIDGPSKVSMDC-TEVE--EG--YKVRYTPLVPGDYYVSLK 468 (898)
Q Consensus 429 ~l~v~v~gp~~~~~~~-~~~~--~g--y~v~ytp~~~G~y~i~V~ 468 (898)
++.+.+..+++..+.- .+.. .+ -.++|++..+|.|.|.|.
T Consensus 25 d~dl~l~~~~g~~~~~~d~~~~~~~~~~~i~~~~~~~GtYyi~V~ 69 (70)
T PF04151_consen 25 DADLYLYDSNGNSLASYDDSSQSGGNDESITFTAPAAGTYYIRVY 69 (70)
T ss_dssp SEEEEEEETTSSSCEECCCCTCETTSEEEEEEEESSSEEEEEEEE
T ss_pred CeEEEEEcCCCCchhhheecCCCCCCccEEEEEcCCCEEEEEEEE
Confidence 6677777776433322 1222 23 578999999999999885
No 55
>cd08548 Type_I_cohesin_like Type I cohesin domain, interaction partner of dockerin. Bacterial cohesin domains bind to a complementary protein domain named dockerin, and this interaction is required for the formation of the cellulosome, a cellulose-degrading complex. The cellulosome consists of scaffoldin, a noncatalytic scaffolding polypeptide, that comprises repeating cohesion modules and a single carbohydrate-binding module (CBM). Specific calcium-dependent interactions between cohesins and dockerins appear to be essential for cellulosome assembly. This subfamily represents type I cohesins; their interactions with dockerin mediate assembly of a range of dockerin-borne enzymes to the complex.
Probab=44.73 E-value=2.9e+02 Score=26.48 Aligned_cols=91 Identities=13% Similarity=0.271 Sum_probs=49.8
Q ss_pred EEEEEEECCee---cCCCCeEEEeccCCCCCceEEecCCCcceeeeCCeEEEEEEcCCCCCCceEEEEECCCCceeeeEE
Q psy192 370 HNIHIKFNGVH---IPGSPLRIKVGKGEADPAAVHATGNGLAEIKSGVKTDFIVDTCNAGAGTLAVTIDGPSKVSMDCTE 446 (898)
Q Consensus 370 h~i~V~~~g~~---I~gSPf~v~V~~~~~d~s~v~v~G~Gl~~~~~g~~~~F~Vd~~~aG~g~l~v~v~gp~~~~~~~~~ 446 (898)
.++.|++.+.+ |.+..|.+. .|++.+.+..-.-.....+-...|.....+ .|.+.+.+..+..- .-.-
T Consensus 16 v~VpV~~~~v~~~~i~~~~f~l~-----yD~s~Lev~~v~~G~i~~~~~~~~~~n~~~--~g~i~~~~~~~~~~--~~~i 86 (135)
T cd08548 16 VTVPVTLSNVPSKGIGACDFVLS-----YDPSVLEVVSVEAGDIVKNPSANFSSNVSS--EGTIKLLFSDSTGG--SYLI 86 (135)
T ss_pred EEEEEEEecCCccCEEEEEEEEE-----eCCceeEEEecccCceecCCccceEeeccC--CCeEEEEEECCCCC--ceEE
Confidence 44555555443 555555544 366665554221111111124566665544 57788777766421 1122
Q ss_pred eCCe--EEEEEecCC---CeeEEEEEEE
Q psy192 447 VEEG--YKVRYTPLV---PGDYYVSLKY 469 (898)
Q Consensus 447 ~~~g--y~v~ytp~~---~G~y~i~V~~ 469 (898)
.++| ++++|+... .|.|.|.+.-
T Consensus 87 ~~~G~~atitfkv~~~a~~~~~~i~~~~ 114 (135)
T cd08548 87 TTDGVFATITFKVKSNAAGGTYAITLSS 114 (135)
T ss_pred ecCCEEEEEEEEEcCCCCCCcEEEEEec
Confidence 3677 689999887 7889887764
No 56
>PF09134 Invasin_D3: Invasin, domain 3; InterPro: IPR015217 This domain adopts a structure consisting of an immunoglobulin-like beta-sandwich, with seven strands in two beta-sheets, arranged in a Greek-key topology. It forms part of the extracellular region of the protein, which can be expressed as a soluble protein (Inv497) that binds integrins and promotes subsequent uptake by cells when attached to bacteria []. ; PDB: 1CWV_A.
Probab=43.46 E-value=72 Score=28.94 Aligned_cols=69 Identities=14% Similarity=0.198 Sum_probs=36.5
Q ss_pred CeEEEEEecCCccc----eE-EEEECCCCCeee-eEEEecCCCeEEEEEEecCCeeEEEEEEECCeecCCCCeEEEe
Q psy192 320 PTQFLVRKNGAVGA----LD-AKVISPSGTEDD-CFIQPIDGDNYSIRFMPRENGIHNIHIKFNGVHIPGSPLRIKV 390 (898)
Q Consensus 320 ~~~f~V~~~~a~G~----l~-v~v~~p~g~~~~-~~v~~~~~g~y~v~f~P~~~G~h~i~V~~~g~~I~gSPf~v~V 390 (898)
.+.+++.++|+.|+ |. ++.. .+|.... ..+.+ ..|.|+.+..=..+|..+|.=.++|+++..-|-+++.
T Consensus 22 ta~LTfta~D~~g~pvsGl~~l~f~-~~g~~vtls~itE-~~GvYtATl~Gt~~G~vtI~P~v~G~~l~~~~~k~tl 96 (99)
T PF09134_consen 22 TATLTFTARDANGNPVSGLSVLAFV-QDGVPVTLSPITE-NPGVYTATLKGTSAGEVTITPQVNGQPLATLQKKLTL 96 (99)
T ss_dssp -EEEEEE-B-TTS-B--S--SEEEE-EEES--EE---EE--SS-EEEEEE-SS-EEEEEEEEETTEE-GGG-EEEEE
T ss_pred eEEEEEEEEcCCCCcccCceeEEEE-ccCCCcEEccccc-cCCEEEEEEEcccCceEEEEEeeCCEEccccceeEEE
Confidence 45566666776553 22 2221 1232211 12333 4689999999999999999999999999876666654
No 57
>PF00017 SH2: SH2 domain; InterPro: IPR000980 The Src homology 2 (SH2) domain is a protein domain of about 100 amino-acid residues first identified as a conserved sequence region between the oncoproteins Src and Fps []. Similar sequences were later found in many other intracellular signal-transducing proteins []. SH2 domains function as regulatory modules of intracellular signalling cascades by interacting with high affinity to phosphotyrosine-containing target peptides in a sequence-specific, SH2 domains recognise between 3-6 residues C-terminal to the phosphorylated tyrosine in a fashion that differs from one SH2 domain to another, and strictly phosphorylation-dependent manner [, , , ]. They are found in a wide variety of protein contexts e.g., in association with catalytic domains of phospholipase Cy (PLCy) and the non-receptor protein tyrosine kinases; within structural proteins such as fodrin and tensin; and in a group of small adaptor molecules, i.e Crk and Nck. The domains are frequently found as repeats in a single protein sequence and will then often bind both mono- and di-phosphorylated substrates. The structure of the SH2 domain belongs to the alpha+beta class, its overall shape forming a compact flattened hemisphere. The core structural elements comprise a central hydrophobic anti-parallel beta-sheet, flanked by 2 short alpha-helices. The loop between strands 2 and 3 provides many of the binding interactions with the phosphate group of its phosphopeptide ligand, and is hence designated the phosphate binding loop, the phosphorylated ligand binds perpendicular to the beta-sheet and typically interacts with the phosphate binding loop and a hydrophobic binding pocket that interacts with a pY+3 side chain. The N- and C-termini of the domain are close together in space and on the opposite face from the phosphopeptide binding surface and it has been speculated that this has facilitated their integration into surface-exposed regions of host proteins [].; GO: 0005515 protein binding; PDB: 1M27_A 1KA6_A 1D4W_B 1D4T_A 1D1Z_B 1KA7_A 1UUR_A 1UUS_A 1BLJ_A 1BLK_A ....
Probab=42.96 E-value=39 Score=28.56 Aligned_cols=26 Identities=35% Similarity=0.644 Sum_probs=24.9
Q ss_pred CCCCeEEEEEEcCCCeeEEEEEEECC
Q psy192 628 NKDGTVAVSYLPTAPGEYKIAVKFGE 653 (898)
Q Consensus 628 ~~dg~~~v~Y~P~~pG~Y~i~V~~~~ 653 (898)
..+|+|.|....+.+|.|.|+|++++
T Consensus 19 ~~~G~FLvR~s~~~~~~~~Lsv~~~~ 44 (77)
T PF00017_consen 19 KPDGTFLVRPSSSKPGKYVLSVRFDG 44 (77)
T ss_dssp SSTTEEEEEEESSSTTSEEEEEEETT
T ss_pred CCCCeEEEEecccccccccccccccc
Confidence 56999999999999999999999999
No 58
>PF13620 CarboxypepD_reg: Carboxypeptidase regulatory-like domain; PDB: 3MN8_D 3P0D_I 3KCP_A 2B59_B 1UWY_A 1H8L_A 1QMU_A 2NSM_A.
Probab=42.23 E-value=64 Score=27.41 Aligned_cols=50 Identities=16% Similarity=0.361 Sum_probs=31.5
Q ss_pred CCEEecCCCCeeEEEecCcc-ceEEEEeCCCCeEEEEEeeCCCeEEEEEEEECCee
Q psy192 38 NGDHVQGYGGLSLSIEGPSK-AEIQCKDNADGSLNISYRPTEPGYYIINLKFADHH 92 (898)
Q Consensus 38 ~G~~v~G~g~l~v~v~GP~~-~~i~~~d~~dG~~~v~ytP~~~G~~~I~V~~~g~~ 92 (898)
+|.|++ +..|.+..... ......-+.+|.|.+.-+|. |.|.|.+...|..
T Consensus 10 ~g~pv~---~a~V~l~~~~~~~~~~~~Td~~G~f~~~~l~~--g~Y~l~v~~~g~~ 60 (82)
T PF13620_consen 10 TGQPVP---GATVTLTDQDGGTVYTTTTDSDGRFSFEGLPP--GTYTLRVSAPGYQ 60 (82)
T ss_dssp TSCBHT---T-EEEET--TTTECCEEE--TTSEEEEEEE-S--EEEEEEEEBTTEE
T ss_pred CCCCcC---CEEEEEEEeeCCCEEEEEECCCceEEEEccCC--EeEEEEEEECCcc
Confidence 577775 44666664332 23445667999999998876 9999999876643
No 59
>COG2372 CopC Uncharacterized protein, homolog of Cu resistance protein CopC [General function prediction only]
Probab=41.39 E-value=72 Score=30.26 Aligned_cols=70 Identities=17% Similarity=0.305 Sum_probs=40.6
Q ss_pred CCeEEEEEecCC--ccceEEEEECCCCCeeee-EEEec-CCCe-EEEEE-EecCCeeEEEE---EEECCeecCCC-CeEE
Q psy192 319 KPTQFLVRKNGA--VGALDAKVISPSGTEDDC-FIQPI-DGDN-YSIRF-MPRENGIHNIH---IKFNGVHIPGS-PLRI 388 (898)
Q Consensus 319 ~~~~f~V~~~~a--~G~l~v~v~~p~g~~~~~-~v~~~-~~g~-y~v~f-~P~~~G~h~i~---V~~~g~~I~gS-Pf~v 388 (898)
-|..|.+.-++. .+-..+++.+|+|..+.. ..... .+++ +.|.- .|-++|.|.+. |.-+|..+.|| +|.+
T Consensus 45 aP~~i~L~Fse~ve~~fs~~~l~~~d~~~v~t~~~~~~~~~~~~l~v~l~~~L~aG~Y~v~WrvvS~DGH~v~G~~sFsV 124 (127)
T COG2372 45 APAAITLEFSEGVEPGFSGAKLTGPDGEEVATAGTKLDEQNHTQLEVPLPQPLKAGVYTVDWRVVSSDGHVVKGSISFSV 124 (127)
T ss_pred CceeEEEecCCccCCCcceeEEECCCCCccccCcccccccCCcEEEecCcccCCCCcEEEEEEEEecCCcEeccEEEEEe
Confidence 466677766655 233568888888865422 11111 1222 55544 45677888777 45678888886 4444
No 60
>PF04151 PPC: Bacterial pre-peptidase C-terminal domain; InterPro: IPR007280 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. This domain is normally found at the C terminus of secreted archaeal and bacterial peptidases, the majority of which belong to MEROPS peptidase families M4 (vibriolysin, IPR001570 from INTERPRO), M9A amd M9B (microbial collangenase, IPR002169 from INTERPRO), M28 (aminopeptidase Ap1, IPR007484 from INTERPRO) and S8 (subtilisin family peptidases, IPR000209 from INTERPRO).; GO: 0008233 peptidase activity, 0006508 proteolysis; PDB: 4DY5_B 4DXZ_A 4DY3_B 3JQW_A 3JQX_C 1NQJ_B 1NQD_A 2O8O_A 1WMF_A 1WME_A ....
Probab=40.75 E-value=1.6e+02 Score=24.44 Aligned_cols=53 Identities=19% Similarity=0.308 Sum_probs=32.4
Q ss_pred EEEEecCCccceEEEEECCCCCeeeeEE-Eec-CCCeEEEEEEecCCeeEEEEEE
Q psy192 323 FLVRKNGAVGALDAKVISPSGTEDDCFI-QPI-DGDNYSIRFMPRENGIHNIHIK 375 (898)
Q Consensus 323 f~V~~~~a~G~l~v~v~~p~g~~~~~~v-~~~-~~g~y~v~f~P~~~G~h~i~V~ 375 (898)
+.|+.....+++...+.+++|....... ... .+..=.++|+..++|+|.|.|.
T Consensus 15 l~i~l~~~~~d~dl~l~~~~g~~~~~~d~~~~~~~~~~~i~~~~~~~GtYyi~V~ 69 (70)
T PF04151_consen 15 LTIDLSGGSGDADLYLYDSNGNSLASYDDSSQSGGNDESITFTAPAAGTYYIRVY 69 (70)
T ss_dssp EEEEECETTSSEEEEEEETTSSSCEECCCCTCETTSEEEEEEEESSSEEEEEEEE
T ss_pred EEEEEcCCCCCeEEEEEcCCCCchhhheecCCCCCCccEEEEEcCCCEEEEEEEE
Confidence 4444444445677778888864321100 111 1234678999999999999885
No 61
>PF07495 Y_Y_Y: Y_Y_Y domain; InterPro: IPR011123 This region is mostly found at the end of the beta propellers (IPR011110 from INTERPRO) in a family of two component regulators. However they are also found tandemly repeated in Q891H4 from SWISSPROT without other signal conduction domains being present. It is named after the conserved tyrosines found in the alignment. The exact function is not known.; PDB: 3V9F_D 3VA6_B 3OTT_B 4A2M_D 4A2L_B.
Probab=40.43 E-value=32 Score=28.12 Aligned_cols=19 Identities=32% Similarity=0.707 Sum_probs=15.4
Q ss_pred EEEEecCCCeeEEEEEEEC
Q psy192 452 KVRYTPLVPGDYYVSLKYN 470 (898)
Q Consensus 452 ~v~ytp~~~G~y~i~V~~~ 470 (898)
.++|++..||+|+|.|+..
T Consensus 30 ~~~~~~L~~G~Y~l~V~a~ 48 (66)
T PF07495_consen 30 SISYTNLPPGKYTLEVRAK 48 (66)
T ss_dssp EEEEES--SEEEEEEEEEE
T ss_pred EEEEEeCCCEEEEEEEEEE
Confidence 9999999999999988863
No 62
>KOG1693|consensus
Probab=40.21 E-value=1.3e+02 Score=30.81 Aligned_cols=78 Identities=17% Similarity=0.187 Sum_probs=47.6
Q ss_pred CCCeEEEEecCCcccceeeeeeeecccceeeeCcEEEEEEEcCCCcccceEEEEeCCCCCeeeeEEEEcCCCEEEEEEEc
Q psy192 95 GSPFTAKIVGEGSNRQREKIQRQREAVPVTEVGSTCKLTFKMPGITAFDLSATVTSPGGVTEDAEINEVEDGLYAVHFVP 174 (898)
Q Consensus 95 gSPf~~~v~~~~~~~~~~~i~~~~~~~~~~~vg~~~~f~v~~~~~~~~~~~~~V~~p~G~~~~~~v~~~~~g~y~v~f~P 174 (898)
=||+++..-+...+...+.+ ..-++.+.|.+.....|..|+...+.+|+|+++..+-...- =++.|+-
T Consensus 21 a~elTfeLp~~aKqC~Y~d~---------~~~~~~~~~~fqV~tGG~fDVD~~I~aPdgkvI~~~~kk~~---~~~~f~a 88 (209)
T KOG1693|consen 21 ASELTFELPDNAKQCFYEDL---------KKDDDTTSFEFQVQTGGHFDVDYDIEAPDGKVIYSEKKKRY---DSFLFKA 88 (209)
T ss_pred cccEEEEcCCcchhheeeec---------ccCCceEEEEEEEEeCCceeeEEEEECCCCCEEeecccccc---ccEEEEE
Confidence 35777665554433222222 12234466666655557788999999999997543333222 2467888
Q ss_pred CcceeEEEEE
Q psy192 175 KELGVHTVSV 184 (898)
Q Consensus 175 ~~~G~h~I~V 184 (898)
...|+|++..
T Consensus 89 e~~G~Y~fCF 98 (209)
T KOG1693|consen 89 EGKGEYTFCF 98 (209)
T ss_pred ecceEEEEEe
Confidence 8899988764
No 63
>PF00017 SH2: SH2 domain; InterPro: IPR000980 The Src homology 2 (SH2) domain is a protein domain of about 100 amino-acid residues first identified as a conserved sequence region between the oncoproteins Src and Fps []. Similar sequences were later found in many other intracellular signal-transducing proteins []. SH2 domains function as regulatory modules of intracellular signalling cascades by interacting with high affinity to phosphotyrosine-containing target peptides in a sequence-specific, SH2 domains recognise between 3-6 residues C-terminal to the phosphorylated tyrosine in a fashion that differs from one SH2 domain to another, and strictly phosphorylation-dependent manner [, , , ]. They are found in a wide variety of protein contexts e.g., in association with catalytic domains of phospholipase Cy (PLCy) and the non-receptor protein tyrosine kinases; within structural proteins such as fodrin and tensin; and in a group of small adaptor molecules, i.e Crk and Nck. The domains are frequently found as repeats in a single protein sequence and will then often bind both mono- and di-phosphorylated substrates. The structure of the SH2 domain belongs to the alpha+beta class, its overall shape forming a compact flattened hemisphere. The core structural elements comprise a central hydrophobic anti-parallel beta-sheet, flanked by 2 short alpha-helices. The loop between strands 2 and 3 provides many of the binding interactions with the phosphate group of its phosphopeptide ligand, and is hence designated the phosphate binding loop, the phosphorylated ligand binds perpendicular to the beta-sheet and typically interacts with the phosphate binding loop and a hydrophobic binding pocket that interacts with a pY+3 side chain. The N- and C-termini of the domain are close together in space and on the opposite face from the phosphopeptide binding surface and it has been speculated that this has facilitated their integration into surface-exposed regions of host proteins [].; GO: 0005515 protein binding; PDB: 1M27_A 1KA6_A 1D4W_B 1D4T_A 1D1Z_B 1KA7_A 1UUR_A 1UUS_A 1BLJ_A 1BLK_A ....
Probab=40.09 E-value=39 Score=28.62 Aligned_cols=27 Identities=22% Similarity=0.407 Sum_probs=24.7
Q ss_pred eCCceEEEEEEeCCCeeEEEEEEECCc
Q psy192 861 TGRNNFEVKYIVRDRGEYLLIVKWGDD 887 (898)
Q Consensus 861 ~g~~~y~v~y~~~~~G~y~l~vk~~~~ 887 (898)
..+|+|-|+.....+|.|.|+|++.++
T Consensus 19 ~~~G~FLvR~s~~~~~~~~Lsv~~~~~ 45 (77)
T PF00017_consen 19 KPDGTFLVRPSSSKPGKYVLSVRFDGK 45 (77)
T ss_dssp SSTTEEEEEEESSSTTSEEEEEEETTE
T ss_pred CCCCeEEEEeccccccccccccccccc
Confidence 458999999999999999999999983
No 64
>PF07705 CARDB: CARDB; InterPro: IPR011635 The APHP (acidic peptide-dependent hydrolases/peptidase) domain is found in a variety of different proteins.; PDB: 2KUT_A 2L0D_A 3IDU_A 2KL6_A.
Probab=38.60 E-value=75 Score=27.97 Aligned_cols=67 Identities=30% Similarity=0.539 Sum_probs=35.7
Q ss_pred ecCCeeEEEEEecCCCCCCCceeeecccCCCCcEEEEeCCccc--eE-EEEeCCCCeEEEEEEcCCCeeEEEEEEECC
Q psy192 579 VSGEPCLFTISTKGAGAGSPFQFTVGPLRDGGLSMAVEGPSKA--EI-TYHDNKDGTVAVSYLPTAPGEYKIAVKFGE 653 (898)
Q Consensus 579 ~~g~p~~fti~t~~ag~g~p~~~~~~~~~~g~l~v~v~GPs~~--~i-~~~~~~dg~~~v~Y~P~~pG~Y~i~V~~~~ 653 (898)
..|++..|++.-++.|....-.+.+ .+.+.|.... .+ .+...+..++.+.|.+..+|.|.|.+..+.
T Consensus 16 ~~g~~~~i~~~V~N~G~~~~~~~~v--------~~~~~~~~~~~~~i~~L~~g~~~~v~~~~~~~~~G~~~i~~~iD~ 85 (101)
T PF07705_consen 16 VPGEPVTITVTVKNNGTADAENVTV--------RLYLDGNSVSTVTIPSLAPGESETVTFTWTPPSPGSYTIRVVIDP 85 (101)
T ss_dssp ETTSEEEEEEEEEE-SSS-BEEEEE--------EEEETTEEEEEEEESEB-TTEEEEEEEEEE-SS-CEEEEEEEEST
T ss_pred cCCCEEEEEEEEEECCCCCCCCEEE--------EEEECCceeccEEECCcCCCcEEEEEEEEEeCCCCeEEEEEEEee
Confidence 4557778888777777643211111 1222222211 22 233344566778888889999999988754
No 65
>PF07691 PA14: PA14 domain; InterPro: IPR011658 The PA14 domain forms an insert in bacterial beta-glucosidases, other glycosidases, glycosyltransferases, proteases, amidases, yeast adhesins and bacterial toxins, including anthrax protective antigen (PA). The domain also occurs in a Dictyostelium pre-spore cell-inducing factor Psi and in fibrocystin, the mammalian protein whose mutation leads to polycystic kidney and hepatic disease. The crystal structure of PA shows that this domain (named PA14 after its location in the PA20 pro-peptide) has a beta-barrel structure. The PA14 domain sequence suggests a binding function, rather than a catalytic role. The PA14 domain distribution is compatible with carbohydrate binding [].; PDB: 2XVG_A 2XVK_A 2XVL_A 2XJU_A 2XJT_A 2XJQ_A 2XJS_A 2XJV_A 2XJP_A 2XJR_A ....
Probab=38.44 E-value=3.5e+02 Score=25.53 Aligned_cols=80 Identities=19% Similarity=0.200 Sum_probs=51.3
Q ss_pred eEEEEEEcCCceeEEEEEEECCEEecCCCCeeEEEecCccceEEEEeC--------CCCeEEEEEeeCCCeEEEEEEEEC
Q psy192 18 TVSLHYDPREEGLHELALKFNGDHVQGYGGLSLSIEGPSKAEIQCKDN--------ADGSLNISYRPTEPGYYIINLKFA 89 (898)
Q Consensus 18 ~~~v~f~P~~~G~h~v~v~~~G~~v~G~g~l~v~v~GP~~~~i~~~d~--------~dG~~~v~ytP~~~G~~~I~V~~~ 89 (898)
.|+-.|+|.+.|.|++.+.-+|. ..+.|.|-. -++...+ ..........-..-+.|.|.|.|-
T Consensus 49 ~~~G~~~~~~~G~y~f~~~~~d~-------~~l~idg~~--vid~~~~~~~~~~~~~~~~~~~~v~l~~g~~y~i~i~y~ 119 (145)
T PF07691_consen 49 RWTGYFKPPETGTYTFSLTSDDG-------ARLWIDGKL--VIDNWGNQGGGFFNSGPSSTSGTVTLEAGGKYPIRIEYF 119 (145)
T ss_dssp EEEEEEEESSSEEEEEEEEESSE-------EEEEETTEE--EEECSCTTTSTTTTTSBCCEEEEEEE-TT-EEEEEEEEE
T ss_pred EEEEEEecccCceEEEEEEeccc-------EEEEECCEE--EEcCCccccccccccccceEEEEEEeeCCeeEEEEEEEE
Confidence 46778999999999999997664 445555411 1111111 123455556666778999999998
Q ss_pred CeecCCCCeEEEEecCC
Q psy192 90 DHHVEGSPFTAKIVGEG 106 (898)
Q Consensus 90 g~~i~gSPf~~~v~~~~ 106 (898)
...-....|.+....+.
T Consensus 120 ~~~~~~~~~~l~w~~p~ 136 (145)
T PF07691_consen 120 NRGGDASLIQLSWTPPG 136 (145)
T ss_dssp ECSCSSCCEEEEEE-TT
T ss_pred ECCCCCEEEEEEEECCC
Confidence 87777777777777665
No 66
>PF05751 FixH: FixH; InterPro: IPR008620 This family consists of several Rhizobium FixH like proteins. It has been suggested that the four proteins FixG, FixH, FixI, and FixS may participate in a membrane-bound complex coupling the FixI cation pump with a redox process catalysed by FixG [].
Probab=37.45 E-value=84 Score=30.29 Aligned_cols=50 Identities=10% Similarity=0.309 Sum_probs=39.2
Q ss_pred CceeEEEEeCCCccccccccceEEEEeeCCceEEEEEEeCCCeeEEEEEEECC
Q psy192 834 AGTLAVTIDGPSKVSVKKYKDEIFTRHTGRNNFEVKYIVRDRGEYLLIVKWGD 886 (898)
Q Consensus 834 ~g~l~v~i~gp~~~~~~~~~~~~~~~~~g~~~y~v~y~~~~~G~y~l~vk~~~ 886 (898)
...+++.+.-|+....+ ..+.....++|.|........+|.|.|.|+|..
T Consensus 85 ~~~~~l~l~rp~~~~~D---~~~~l~~~~~g~y~~~~~~~~~G~W~l~l~~~~ 134 (146)
T PF05751_consen 85 GAKLTLSLYRPTDAKLD---FTLTLTESAPGVYRAPVPLLKKGRWYLRLDWEP 134 (146)
T ss_pred CceEEEEEECCCCccCC---eeEEeeECCCceEEEEcCCCCCccEEEEEEEec
Confidence 46788888888764332 246667788999999999889999999997743
No 67
>TIGR03096 nitroso_cyanin nitrosocyanin. Nitrosocyanin, as described from the obligate chemolithoautotroph Nitrosomonas europaea, is a red copper protein of unknown function with sequence similarity to a number of blue copper redox proteins.
Probab=37.44 E-value=34 Score=32.92 Aligned_cols=30 Identities=17% Similarity=0.383 Sum_probs=26.6
Q ss_pred CceEEEEEEeCCCeeEEEEEEECC--ccCCCC
Q psy192 863 RNNFEVKYIVRDRGEYLLIVKWGD--DHIPGS 892 (898)
Q Consensus 863 ~~~y~v~y~~~~~G~y~l~vk~~~--~~i~GS 892 (898)
+.+.++++++.+||+|...|.|+- .|+|||
T Consensus 99 Get~TitF~adKpG~Y~y~C~~HP~~~H~~~~ 130 (135)
T TIGR03096 99 GETKTISFKADKAGAFTIWCQLHPKNIHLPGS 130 (135)
T ss_pred CCeEEEEEECCCCEEEEEeCCCCChhhcCCCc
Confidence 458899999999999999999965 599997
No 68
>smart00252 SH2 Src homology 2 domains. Src homology 2 domains bind phosphotyrosine-containing polypeptides via 2 surface pockets. Specificity is provided via interaction with residues that are distinct from the phosphotyrosine. Only a single occurrence of a SH2 domain has been found in S. cerevisiae.
Probab=36.50 E-value=40 Score=29.08 Aligned_cols=29 Identities=24% Similarity=0.499 Sum_probs=25.1
Q ss_pred EEeCCCCeEEEEEEcCCCeeEEEEEEECC
Q psy192 625 YHDNKDGTVAVSYLPTAPGEYKIAVKFGE 653 (898)
Q Consensus 625 ~~~~~dg~~~v~Y~P~~pG~Y~i~V~~~~ 653 (898)
+...++|+|.|....+.+|.|.|++++++
T Consensus 17 L~~~~~G~FLvR~s~~~~~~~~Lsv~~~~ 45 (84)
T smart00252 17 LKNEGDGDFLVRDSESEPGDYVLSVRVKG 45 (84)
T ss_pred HhcCCCcEEEEEcCCCCCCCEEEEEEECC
Confidence 34456999999999999999999999973
No 69
>smart00252 SH2 Src homology 2 domains. Src homology 2 domains bind phosphotyrosine-containing polypeptides via 2 surface pockets. Specificity is provided via interaction with residues that are distinct from the phosphotyrosine. Only a single occurrence of a SH2 domain has been found in S. cerevisiae.
Probab=36.00 E-value=33 Score=29.58 Aligned_cols=28 Identities=25% Similarity=0.471 Sum_probs=24.0
Q ss_pred eeCCceEEEEEEeCCCeeEEEEEEECCc
Q psy192 860 HTGRNNFEVKYIVRDRGEYLLIVKWGDD 887 (898)
Q Consensus 860 ~~g~~~y~v~y~~~~~G~y~l~vk~~~~ 887 (898)
..++|+|-|+...+.+|.|.|++++.++
T Consensus 19 ~~~~G~FLvR~s~~~~~~~~Lsv~~~~~ 46 (84)
T smart00252 19 NEGDGDFLVRDSESEPGDYVLSVRVKGK 46 (84)
T ss_pred cCCCcEEEEEcCCCCCCCEEEEEEECCE
Confidence 3458999999999999999999999853
No 70
>PF04234 CopC: CopC domain; InterPro: IPR007348 CopC is a bacterial blue copper protein that binds 1 atom of copper per protein molecule. Along with CopA, CopC mediates copper resistance by sequestration of copper in the periplasm [].; GO: 0005507 copper ion binding, 0046688 response to copper ion, 0042597 periplasmic space; PDB: 1IX2_B 1LYQ_A 2C9P_C 2C9R_A 2C9Q_A 1M42_A 1OT4_A 1NM4_A.
Probab=35.88 E-value=32 Score=30.97 Aligned_cols=55 Identities=36% Similarity=0.467 Sum_probs=30.8
Q ss_pred eEEEEeCCCCCeeee---EEEEcCCCEEEEEEEc-CcceeEEEEEEE---CCeecCCCceEEEE
Q psy192 144 LSATVTSPGGVTEDA---EINEVEDGLYAVHFVP-KELGVHTVSVRY---KDIHIPGSPFQFTV 200 (898)
Q Consensus 144 ~~~~V~~p~G~~~~~---~v~~~~~g~y~v~f~P-~~~G~h~I~V~~---~g~~V~gSPf~v~V 200 (898)
-++.|++++|+.+.. .+.. ....+.+..-+ ..+|.|.|.-+. +|.++.|+ |.|.|
T Consensus 35 s~v~v~~~~g~~v~~~~~~~~~-~~~~~~~~l~~~l~~G~YtV~wrvvs~DGH~~~G~-~~F~V 96 (97)
T PF04234_consen 35 SSVTVTDPDGKRVDLGEPTVDG-DGKTLTVPLPPPLPPGTYTVSWRVVSADGHPVSGS-FSFTV 96 (97)
T ss_dssp -EEEEEEEEETTSCTCEEEEEE-STTEEEEEESS---SEEEEEEEEEEETTSCEEEEE-EEEEE
T ss_pred cEEEEEcCCCceeecCcceecC-CceEEEEECCCCCCCceEEEEEEEEecCCCCcCCE-EEEEE
Confidence 346677777765432 2222 34566777666 567777776555 67777776 44443
No 71
>PRK06655 flgD flagellar basal body rod modification protein; Reviewed
Probab=34.43 E-value=1.6e+02 Score=31.06 Aligned_cols=27 Identities=19% Similarity=0.209 Sum_probs=15.7
Q ss_pred CeEEEEEecCCccceEEEEECCCCCee
Q psy192 320 PTQFLVRKNGAVGALDAKVISPSGTED 346 (898)
Q Consensus 320 ~~~f~V~~~~a~G~l~v~v~~p~g~~~ 346 (898)
...|.++..++...+.+.|.+.+|+.+
T Consensus 114 ~~~~~~~l~~~a~~vti~I~D~~G~~V 140 (225)
T PRK06655 114 TTPFGVELPSAADNVTVTITDSAGQVV 140 (225)
T ss_pred ceEEEEEcCCCCcEEEEEEEcCCCCEE
Confidence 455555544443456677777777654
No 72
>PF09430 DUF2012: Protein of unknown function (DUF2012); InterPro: IPR019008 This domain is found in different proteins, including uncharacterised protein family UPF0480 and nodal modulators. A nodal modulator has been identified as part of a protein complex that participates in the nodal signaling pathway during vertebrate development [].
Probab=33.98 E-value=1.6e+02 Score=27.73 Aligned_cols=55 Identities=22% Similarity=0.454 Sum_probs=41.2
Q ss_pred eeEEEecCccceEEEEeCCCCeEEEEEeeCCCeEEEEEEEECCeecCCCCeEEEEecCC
Q psy192 48 LSLSIEGPSKAEIQCKDNADGSLNISYRPTEPGYYIINLKFADHHVEGSPFTAKIVGEG 106 (898)
Q Consensus 48 l~v~v~GP~~~~i~~~d~~dG~~~v~ytP~~~G~~~I~V~~~g~~i~gSPf~~~v~~~~ 106 (898)
..|++.|-....+...-+.||+|.+.=+| +|.|.++|..-+..-+ |+++.|.+.+
T Consensus 10 t~V~L~~g~~~~~~~~v~~dG~F~f~~Vp--~GsY~L~V~s~~~~F~--~~RVdV~~~~ 64 (123)
T PF09430_consen 10 TRVTLNGGQYRPISAFVRSDGSFVFHNVP--PGSYLLEVHSPDYVFP--PYRVDVSSSG 64 (123)
T ss_pred EEEEEeCCCccceEEEecCCCEEEeCCCC--CceEEEEEECCCcccc--CEEEEEecCC
Confidence 44566554444556666789999988775 5999999998877777 7999998555
No 73
>PF03422 CBM_6: Carbohydrate binding module (family 6); InterPro: IPR005084 A carbohydrate-binding module (CBM) is defined as a contiguous amino acid sequence within a carbohydrate-active enzyme with a discreet fold having carbohydrate-binding activity. A few exceptions are CBMs in cellulosomal scaffolding proteins and rare instances of independent putative CBMs. The requirement of CBMs existing as modules within larger enzymes sets this class of carbohydrate-binding protein apart from other non-catalytic sugar binding proteins such as lectins and sugar transport proteins. CBMs were previously classified as cellulose-binding domains (CBDs) based on the initial discovery of several modules that bound cellulose [, ]. However, additional modules in carbohydrate-active enzymes are continually being found that bind carbohydrates other than cellulose yet otherwise meet the CBM criteria, hence the need to reclassify these polypeptides using more inclusive terminology. Previous classification of cellulose-binding domains were based on amino acid similarity. Groupings of CBDs were called "Types" and numbered with roman numerals (e.g. Type I or Type II CBDs). In keeping with the glycoside hydrolase classification, these groupings are now called families and numbered with Arabic numerals. Families 1 to 13 are the same as Types I to XIII. For a detailed review on the structure and binding modes of CBMs see []. This entry represents CBM6 from CAZY which was previously known as cellulose-binding domain family VI (CBD VI). CBM6 bind to amorphous cellulose, xylan, mixed beta-(1,3)(1,4)glucan and beta-1,3-glucan[, , ]. CBM6 adopts a classic lectin-like beta-jelly roll fold, predominantly consisting of five antiparallel beta-strands on one face and four antiparallel beta-strands on the other face. It contains two potential ligand binding sites, named respectively cleft A and B. These clefts include aromatic residues which are probably involved in the substrate binding. The cleft B is located on the concave surface of one beta-sheet, and the cleft A on one edge of the protein between the loop that connects the inner and outer beta-sheets of the jellyroll fold []. The multiple binding clefts confer the extensive range of specificities displayed by the domain [, , ].; GO: 0030246 carbohydrate binding; PDB: 1UY1_A 1UY3_A 1UY4_A 1UY2_A 1UYY_A 1UXZ_B 1UYZ_A 1UY0_B 1UYX_A 1UZ0_A ....
Probab=33.43 E-value=2e+02 Score=26.55 Aligned_cols=64 Identities=22% Similarity=0.357 Sum_probs=39.3
Q ss_pred cCCceeEEEEEEECCEEecCCCCeeEEEecCcc---ceEEEEeCC-CCe---EEEEEeeCCCeEEEEEEEECCe
Q psy192 25 PREEGLHELALKFNGDHVQGYGGLSLSIEGPSK---AEIQCKDNA-DGS---LNISYRPTEPGYYIINLKFADH 91 (898)
Q Consensus 25 P~~~G~h~v~v~~~G~~v~G~g~l~v~v~GP~~---~~i~~~d~~-dG~---~~v~ytP~~~G~~~I~V~~~g~ 91 (898)
=.+.|.|.+.+++.... +.+.++|.+.|+.. ..+.+...+ -.+ ..+.. ....|.|+|.+.+.+.
T Consensus 41 ~~~~g~y~~~~~~a~~~--~~~~~~l~id~~~g~~~~~~~~~~tg~w~~~~~~~~~v-~l~~G~h~i~l~~~~~ 111 (125)
T PF03422_consen 41 VPEAGTYTLTIRYANGG--GGGTIELRIDGPDGTLIGTVSLPPTGGWDTWQTVSVSV-KLPAGKHTIYLVFNGG 111 (125)
T ss_dssp ESSSEEEEEEEEEEESS--SSEEEEEEETTTTSEEEEEEEEE-ESSTTEEEEEEEEE-EEESEEEEEEEEESSS
T ss_pred eCCCceEEEEEEEECCC--CCcEEEEEECCCCCcEEEEEEEcCCCCccccEEEEEEE-eeCCCeeEEEEEEECC
Confidence 45889999998884331 11568899988443 445553222 122 22333 3444999999999764
No 74
>PF01483 P_proprotein: Proprotein convertase P-domain; InterPro: IPR002884 This domain, termed the P domain is approximately 150 amino acids in length and C-terminal to a serine endopeptidase domain which belong to MEROPS peptidase family S8 (clan SB), subfamily S8B (kexin). The domain is primarily associated with the calcium-dependent serine endopeptidases, kex2/subtilisin proprotein convertases (PCs), which have been identified in all eukaryotes [] and in the gammaproteobacteria, Nostoc (cyanobacteria) and in Streptomyces avermitilis. The P domain appears necessary for folding and maintaining the endopeptidase catalytic domain and to regulate its calcium and acidic pH dependence. In addition, contained within the middle of the P domain in most PC family members is the cognate integrin binding RGD sequence [], which may be required for intracellular compartmentalization and maintenance of enzyme stability within the ER. The integrity of the RGD sequence of proprotein convertase PC1 is critical for its zymogen and C-terminal processing and for its cellular trafficking [, ]. The carboxy-terminal tail provides uniqueness to each PC family member being the least conserved region of all convertases [].; GO: 0004252 serine-type endopeptidase activity, 0006508 proteolysis; PDB: 1P8J_E 1OT5_B 2ID4_A 1R64_A 3HJR_A.
Probab=31.90 E-value=3e+02 Score=23.91 Aligned_cols=35 Identities=26% Similarity=0.358 Sum_probs=22.6
Q ss_pred eEEEEEeccCCCCceEEEEEcCCcceeeEEECCCC
Q psy192 227 CEFNVWTREAGAGSLAISVEGPSKAEIDFKDRKDG 261 (898)
Q Consensus 227 ~~F~V~~~daG~g~l~v~v~gps~~~i~~~d~~dG 261 (898)
+.+.++..-.-.++|.+.+.+|++..+.+.++..+
T Consensus 4 v~v~v~i~H~~~gdL~i~L~SP~Gt~~~L~~~~~~ 38 (87)
T PF01483_consen 4 VQVSVDITHPYRGDLRITLISPSGTRSTLKDRRGS 38 (87)
T ss_dssp EEEEEEEEESSGGGEEEEEE-TT--EEEEE-SSTT
T ss_pred EEEEEEEEcCCcCCEEEEEECCCCCEEEEECCcCC
Confidence 45556665667799999999999988877765433
No 75
>PF01483 P_proprotein: Proprotein convertase P-domain; InterPro: IPR002884 This domain, termed the P domain is approximately 150 amino acids in length and C-terminal to a serine endopeptidase domain which belong to MEROPS peptidase family S8 (clan SB), subfamily S8B (kexin). The domain is primarily associated with the calcium-dependent serine endopeptidases, kex2/subtilisin proprotein convertases (PCs), which have been identified in all eukaryotes [] and in the gammaproteobacteria, Nostoc (cyanobacteria) and in Streptomyces avermitilis. The P domain appears necessary for folding and maintaining the endopeptidase catalytic domain and to regulate its calcium and acidic pH dependence. In addition, contained within the middle of the P domain in most PC family members is the cognate integrin binding RGD sequence [], which may be required for intracellular compartmentalization and maintenance of enzyme stability within the ER. The integrity of the RGD sequence of proprotein convertase PC1 is critical for its zymogen and C-terminal processing and for its cellular trafficking [, ]. The carboxy-terminal tail provides uniqueness to each PC family member being the least conserved region of all convertases [].; GO: 0004252 serine-type endopeptidase activity, 0006508 proteolysis; PDB: 1P8J_E 1OT5_B 2ID4_A 1R64_A 3HJR_A.
Probab=31.56 E-value=2e+02 Score=25.04 Aligned_cols=61 Identities=20% Similarity=0.325 Sum_probs=32.4
Q ss_pred EEEEEEcCCCcccceEEEEeCCCCCeeeeEEEEcC------CCEEEEEEEcCcceeEEEEEEECCeecCCCceEEEEcc
Q psy192 130 CKLTFKMPGITAFDLSATVTSPGGVTEDAEINEVE------DGLYAVHFVPKELGVHTVSVRYKDIHIPGSPFQFTVGP 202 (898)
Q Consensus 130 ~~f~v~~~~~~~~~~~~~V~~p~G~~~~~~v~~~~------~g~y~v~f~P~~~G~h~I~V~~~g~~V~gSPf~v~V~~ 202 (898)
+.+.+++.....++|.+.+++|+|... .+.+.. ++.+...|+ ...|.|+...|. +++.|..
T Consensus 4 v~v~v~i~H~~~gdL~i~L~SP~Gt~~--~L~~~~~~d~~~~~~~~~~f~---------~~~f~Ge~~~G~-W~L~v~D 70 (87)
T PF01483_consen 4 VQVSVDITHPYRGDLRITLISPSGTRS--TLKDRRGSDDSGSGFLNWTFT---------SVAFWGESANGT-WTLRVTD 70 (87)
T ss_dssp EEEEEEEEESSGGGEEEEEE-TT--EE--EEE-SSTTHHHSTSEEEEEEE---------ESTTTT-B--EE-EEEEEEE
T ss_pred EEEEEEEEcCCcCCEEEEEECCCCCEE--EEECCcCCCcccccccccEEE---------EEeecCccCCCE-EEEEEEE
Confidence 345555555567999999999999864 344322 345666665 233456665554 5555543
No 76
>PRK06655 flgD flagellar basal body rod modification protein; Reviewed
Probab=31.55 E-value=1.9e+02 Score=30.48 Aligned_cols=14 Identities=21% Similarity=0.437 Sum_probs=8.3
Q ss_pred ceEEEEeCCCCCee
Q psy192 143 DLSATVTSPGGVTE 156 (898)
Q Consensus 143 ~~~~~V~~p~G~~~ 156 (898)
.+++.|.+.+|+.+
T Consensus 127 ~vti~I~D~~G~~V 140 (225)
T PRK06655 127 NVTVTITDSAGQVV 140 (225)
T ss_pred EEEEEEEcCCCCEE
Confidence 35666666666654
No 77
>COG2373 Large extracellular alpha-helical protein [General function prediction only]
Probab=31.48 E-value=1.6e+03 Score=31.09 Aligned_cols=117 Identities=15% Similarity=0.162 Sum_probs=77.3
Q ss_pred eEEEeC-CCCCcceeeeeeEEEEEeccCCC------CceEEEEEcCCccee---eEEECCCCcEEEEEEeC---CCceEE
Q psy192 210 RVHAGG-PGLERGEQNQPCEFNVWTREAGA------GSLAISVEGPSKAEI---DFKDRKDGSCYVSYVVA---EPGEYR 276 (898)
Q Consensus 210 ~v~a~G-~GL~~~~~g~~~~F~V~~~daG~------g~l~v~v~gps~~~i---~~~d~~dGty~v~y~P~---~~G~y~ 276 (898)
++.++. -||-+ .|+..++.+-+|+... ..+.+.+..|++..+ .+.-+..|.+.++|.-. ..|.|+
T Consensus 395 k~y~ftDRglYR--pGE~v~~~~~~R~~~~~~a~~~~p~~l~v~~PdG~~~~~~~~~~~~~G~~~~~~~l~~na~tG~w~ 472 (1621)
T COG2373 395 KVYLFTDRGLYR--PGETVHVNALLRDFDGKTALDNQPLKLRVLDPDGSVLRTLTITLDEEGLYELSFPLPENALTGGYT 472 (1621)
T ss_pred EEEEecCcccCC--CCceeeeeeeehhhcccccccCCCeEEEEECCCCcEEEEEEEeccccCceEEeeeCCCCCCcceEE
Confidence 444443 24543 5899999998887533 457899999998654 45557889999999643 359999
Q ss_pred EEEEECCe-eCCCCCceeEecCCCCCceeEEEeeeCCCceecCCCeEEEEEecCCc
Q psy192 277 VGIKFNDQ-HIPDSPYKLFVSPAMGDAHKLEIAQFPQGVVMADKPTQFLVRKNGAV 331 (898)
Q Consensus 277 i~V~~~g~-~I~gSPf~v~V~~~~~~~~~~~~~~~~~~~~~~g~~~~f~V~~~~a~ 331 (898)
|.+.+.|+ .+....|.|.-.- .+.=++.+... ...+..|++..+.|+.+...
T Consensus 473 l~~~~~~~~~~~s~~f~V~df~--p~r~~i~l~~~-k~~~~~g~~v~~~v~~~yL~ 525 (1621)
T COG2373 473 LELYTGGKSAVISMSFRVEDFI--PDRFKINLTLD-KTEWVPGKDVKIKVDLRYLY 525 (1621)
T ss_pred EEEEeCCccceeeeeEEhhHhC--CceEEEecccc-cccccCCCcEEEEEEEEecC
Confidence 99999987 5666677663221 12122333222 22357888999999876443
No 78
>KOG1948|consensus
Probab=31.30 E-value=1.2e+03 Score=29.59 Aligned_cols=122 Identities=20% Similarity=0.254 Sum_probs=64.5
Q ss_pred eEEEecCCCeEEEEEEecCCeeEEEEEEECCeecCCCCeEEEeccCCCCCceEEecCCCcceeeeCCeEEEEEEcC----
Q psy192 348 CFIQPIDGDNYSIRFMPRENGIHNIHIKFNGVHIPGSPLRIKVGKGEADPAAVHATGNGLAEIKSGVKTDFIVDTC---- 423 (898)
Q Consensus 348 ~~v~~~~~g~y~v~f~P~~~G~h~i~V~~~g~~I~gSPf~v~V~~~~~d~s~v~v~G~Gl~~~~~g~~~~F~Vd~~---- 423 (898)
|.-....+|.|.+.-.| .|+|.+.-.|-|+ |..++|.+ .+-++.+.-+-|+-...-+-+-|+|..|
T Consensus 256 c~~vsd~~G~fsfksvP--sGkY~l~a~y~ge-----~~~fdvSP---~~l~v~Vehd~lqi~~ef~vtgfSvtGRVl~g 325 (1165)
T KOG1948|consen 256 CIGVSDPRGRFSFKSVP--SGKYYLAASYVGE-----PKSFDVSP---NPLKVVVEHDHLQIASEFRVTGFSVTGRVLVG 325 (1165)
T ss_pred EEEEEcCCceEEEEEcC--CCCEEEEEEecCC-----ceEEEeCC---CceeEEEeccceeccceeEEEEEEeeeeEEeC
Confidence 33344568887777666 7999998887654 44555532 2333333322232222222233444332
Q ss_pred --CCCCCceEEEEECCCCceeeeEEeCCe-EEEEEecCCCeeEEEEEEECCEecCCCCeEEEEecc
Q psy192 424 --NAGAGTLAVTIDGPSKVSMDCTEVEEG-YKVRYTPLVPGDYYVSLKYNGYHIVGSPFKVKCTGK 486 (898)
Q Consensus 424 --~aG~g~l~v~v~gp~~~~~~~~~~~~g-y~v~ytp~~~G~y~i~V~~~g~~I~gSPf~v~v~~~ 486 (898)
+-|-....+.+.|..+. ....+| |+++= -.+.|.|+|+++ -+|..=|+.+++|.+.
T Consensus 326 ~~g~~l~gvvvlvngk~~~----kTdaqGyykLen-~~t~gtytI~a~--kehlqFstv~~kv~pn 384 (1165)
T KOG1948|consen 326 SKGLPLSGVVVLVNGKSGG----KTDAQGYYKLEN-LKTDGTYTITAK--KEHLQFSTVHAKVKPN 384 (1165)
T ss_pred CCCCCccceEEEEcCcccc----eEcccceEEeee-eeccCcEEEEEe--ccceeeeeEEEEecCC
Confidence 11222333444443322 122456 66654 346789988776 4566667888877654
No 79
>PRK10301 hypothetical protein; Provisional
Probab=30.12 E-value=1.5e+02 Score=28.13 Aligned_cols=68 Identities=24% Similarity=0.509 Sum_probs=38.1
Q ss_pred eEEEEEEECCEEecCCCCeeEEEecCccceEEEE----eCCCC-eEEEEEe-eCCCeEEEEEEE---ECCeecCCC-CeE
Q psy192 30 LHELALKFNGDHVQGYGGLSLSIEGPSKAEIQCK----DNADG-SLNISYR-PTEPGYYIINLK---FADHHVEGS-PFT 99 (898)
Q Consensus 30 ~h~v~v~~~G~~v~G~g~l~v~v~GP~~~~i~~~----d~~dG-~~~v~yt-P~~~G~~~I~V~---~~g~~i~gS-Pf~ 99 (898)
+-+|++.||...-++. ..+++.++.+..+... +..++ ++.+..- +-.+|.|+|+-+ -+|+++.|+ .|+
T Consensus 45 P~~V~L~F~e~v~~~~--s~i~v~~~~g~~v~~~~~~~~~~~~~~~~v~l~~~L~~G~YtV~Wrvvs~DGH~~~G~~~F~ 122 (124)
T PRK10301 45 PQALTLNFSEGIEPGF--SGATITGPKQENIKTLPAKRNEQDQKQLIVPLADSLKPGTYTVDWHVVSVDGHKTKGHYTFS 122 (124)
T ss_pred CCEEEEEcCCCccccc--cEEEEEcCCCCEeccCCccccCCCCcEEEEECCCCCCCccEEEEEEEEecCCCccCCeEEEE
Confidence 4578888875543322 3466667766555422 12222 3444442 357888887644 567777776 443
No 80
>PRK12633 flgD flagellar basal body rod modification protein; Provisional
Probab=29.99 E-value=1.8e+02 Score=30.80 Aligned_cols=45 Identities=18% Similarity=0.255 Sum_probs=24.2
Q ss_pred CeEEEEEecCCccceEEEEECCCCCeee-eEEEecCCCeEEEEEEe
Q psy192 320 PTQFLVRKNGAVGALDAKVISPSGTEDD-CFIQPIDGDNYSIRFMP 364 (898)
Q Consensus 320 ~~~f~V~~~~a~G~l~v~v~~p~g~~~~-~~v~~~~~g~y~v~f~P 364 (898)
.+.|.++..++...+.+.|++.+|+.+. ..+-....|.+.+.|-.
T Consensus 117 ~~~~~~~l~~~a~~v~v~I~D~~G~vV~t~~lg~~~aG~~~f~WDG 162 (230)
T PRK12633 117 ATPFGIDLQGDATKVTVKVLDPSGAVVRTMELGDLKTGVHTLQWDG 162 (230)
T ss_pred ceeEEEecCCcCcEEEEEEEeCCCCEEEEEecCCCCCCceeEEECC
Confidence 3445555444444577778887777643 22323345555555544
No 81
>PF09430 DUF2012: Protein of unknown function (DUF2012); InterPro: IPR019008 This domain is found in different proteins, including uncharacterised protein family UPF0480 and nodal modulators. A nodal modulator has been identified as part of a protein complex that participates in the nodal signaling pathway during vertebrate development [].
Probab=29.87 E-value=2.1e+02 Score=27.01 Aligned_cols=53 Identities=28% Similarity=0.408 Sum_probs=37.2
Q ss_pred eEEEEEcCCcceeeEEECCCCcEEEEEEeCCCceEEEEEEECCeeCCCCCceeEecC
Q psy192 241 LAISVEGPSKAEIDFKDRKDGSCYVSYVVAEPGEYRVGIKFNDQHIPDSPYKLFVSP 297 (898)
Q Consensus 241 l~v~v~gps~~~i~~~d~~dGty~v~y~P~~~G~y~i~V~~~g~~I~gSPf~v~V~~ 297 (898)
..|.+.+.....+...-+.||+|.+.=+| +|.|.|+|..-+-.. +|++|.|.+
T Consensus 10 t~V~L~~g~~~~~~~~v~~dG~F~f~~Vp--~GsY~L~V~s~~~~F--~~~RVdV~~ 62 (123)
T PF09430_consen 10 TRVTLNGGQYRPISAFVRSDGSFVFHNVP--PGSYLLEVHSPDYVF--PPYRVDVSS 62 (123)
T ss_pred EEEEEeCCCccceEEEecCCCEEEeCCCC--CceEEEEEECCCccc--cCEEEEEec
Confidence 34555554443556666889988776665 699999998776665 578888873
No 82
>PF05751 FixH: FixH; InterPro: IPR008620 This family consists of several Rhizobium FixH like proteins. It has been suggested that the four proteins FixG, FixH, FixI, and FixS may participate in a membrane-bound complex coupling the FixI cation pump with a redox process catalysed by FixG [].
Probab=29.77 E-value=2e+02 Score=27.69 Aligned_cols=56 Identities=25% Similarity=0.448 Sum_probs=41.2
Q ss_pred EEEEEEE---CCEEecCCCCeeEEEecCccc----eEEEEeCCCCeEEEEEeeCCCeEEEEEEE
Q psy192 31 HELALKF---NGDHVQGYGGLSLSIEGPSKA----EIQCKDNADGSLNISYRPTEPGYYIINLK 87 (898)
Q Consensus 31 h~v~v~~---~G~~v~G~g~l~v~v~GP~~~----~i~~~d~~dG~~~v~ytP~~~G~~~I~V~ 87 (898)
-.+.+.+ +|.|+.+. .+.+.+.-|+.. .+..+..++|.|........+|.|.|.|.
T Consensus 69 ~~~~i~~~d~~g~~~~~~-~~~l~l~rp~~~~~D~~~~l~~~~~g~y~~~~~~~~~G~W~l~l~ 131 (146)
T PF05751_consen 69 NSLTIRLTDPNGAPVSGA-KLTLSLYRPTDAKLDFTLTLTESAPGVYRAPVPLLKKGRWYLRLD 131 (146)
T ss_pred CeEEEEEEcCCCCcCcCc-eEEEEEECCCCccCCeeEEeeECCCceEEEEcCCCCCccEEEEEE
Confidence 3445555 56665433 577888888754 45666789999999998889999999994
No 83
>PF07703 A2M_N_2: Alpha-2-macroglobulin family N-terminal region; InterPro: IPR011625 This is a domain of the alpha-2-macroglobulin family. The alpha-macroglobulin (aM) family of proteins includes protease inhibitors [], typified by the human tetrameric a2-macroglobulin (a2M); they belong to the MEROPS proteinase inhibitor family I39, clan IL. These protease inhibitors share several defining properties, which include (i) the ability to inhibit proteases from all catalytic classes, (ii) the presence of a 'bait region' and a thiol ester, (iii) a similar protease inhibitory mechanism and (iv) the inactivation of the inhibitory capacity by reaction of the thiol ester with small primary amines. aM protease inhibitors inhibit by steric hindrance []. The mechanism involves protease cleavage of the bait region, a segment of the aM that is particularly susceptible to proteolytic cleavage, which initiates a conformational change such that the aM collapses about the protease. In the resulting aM-protease complex, the active site of the protease is sterically shielded, thus substantially decreasing access to protein substrates. Two additional events occur as a consequence of bait region cleavage, namely (i) the h-cysteinyl-g-glutamyl thiol ester becomes highly reactive and (ii) a major conformational change exposes a conserved COOH-terminal receptor binding domain [] (RBD). RBD exposure allows the aM protease complex to bind to clearance receptors and be removed from circulation []. Tetrameric, dimeric, and, more recently, monomeric aM protease inhibitors have been identified [, ].; PDB: 2QKI_D 3L3O_D 3NMS_A 2ICF_A 2A73_A 2ICE_D 2HR0_A 2A74_A 2XWJ_G 3OHX_A ....
Probab=29.59 E-value=4.7e+02 Score=24.44 Aligned_cols=105 Identities=14% Similarity=0.117 Sum_probs=57.8
Q ss_pred CceecCCCeEEEEEecCCccceEEEEECCCCCeeeeEEEecCCC--eEEEEEEecCCeeEEEEEEE----CCeecCCCCe
Q psy192 313 GVVMADKPTQFLVRKNGAVGALDAKVISPSGTEDDCFIQPIDGD--NYSIRFMPRENGIHNIHIKF----NGVHIPGSPL 386 (898)
Q Consensus 313 ~~~~~g~~~~f~V~~~~a~G~l~v~v~~p~g~~~~~~v~~~~~g--~y~v~f~P~~~G~h~i~V~~----~g~~I~gSPf 386 (898)
..+.+|+.+.+.|.+....+.+-..+.+ .|+.......+..++ .+.+..++.......+.+.+ +|+.+.+| .
T Consensus 8 ~~~~~Ge~~~v~v~~~~~~~~~~~~v~s-~g~I~~~~~~~~~~~~~~~~~~v~~~~~P~~~v~~~~v~~~~g~~~~~s-~ 85 (136)
T PF07703_consen 8 DSYKPGETAKVTVQSPFPNGTFLYLVES-RGKIVSTGSVELKNGSTTFEFPVTPDMAPNFYVLAYYVRPADGEVVADS-V 85 (136)
T ss_dssp SSB-TTSEEEEEEEEESCESEEEEEEEE-TTEEEEEEEEECTTTSSEEEEEE-GGGTSEEEEEEEEETTCTCEEEEEE-E
T ss_pred CCcCCCCEEEEEEEcCCCccEEEEEEEE-CCeEEEEEEEEecCCcEEEEEecchhcCCcEEEEEEEEcCCCCeEEEEE-E
Confidence 3578899999999987766767777765 344443433333333 57777778888777777665 45555443 2
Q ss_pred EEEeccCCCCCceEEecCCCcceeeeCCeEEEEEEc
Q psy192 387 RIKVGKGEADPAAVHATGNGLAEIKSGVKTDFIVDT 422 (898)
Q Consensus 387 ~v~V~~~~~d~s~v~v~G~Gl~~~~~g~~~~F~Vd~ 422 (898)
.+.|.. .-..++.+.-.- .....|+.+++.|.+
T Consensus 86 ~i~V~~--~~~~~v~l~~~~-~~~~Pg~~~~~~i~~ 118 (136)
T PF07703_consen 86 WIEVEP--CFELKVELTASP-DEYKPGEEVTLRIKA 118 (136)
T ss_dssp EEEBGC--SGSSSEEEEESS-SSBTTTSEEEEEEEE
T ss_pred EEEecc--cccceEEEEEec-ceeCCCCEEEEEEEe
Confidence 223321 122233332210 233456666666665
No 84
>KOG1692|consensus
Probab=29.49 E-value=1.9e+02 Score=29.44 Aligned_cols=56 Identities=21% Similarity=0.338 Sum_probs=37.8
Q ss_pred eeCcEEEEEEEcCCCcccceEEEEeCCCCCeeeeEEEEcCCCEEEEEEEcCcceeEEEE
Q psy192 125 EVGSTCKLTFKMPGITAFDLSATVTSPGGVTEDAEINEVEDGLYAVHFVPKELGVHTVS 183 (898)
Q Consensus 125 ~vg~~~~f~v~~~~~~~~~~~~~V~~p~G~~~~~~v~~~~~g~y~v~f~P~~~G~h~I~ 183 (898)
..|....+++..-+.+-.++..+|.+|+|+...-...+ ..|+| +|+....|.|+..
T Consensus 38 ~~gd~~~vsF~v~~gg~~~vd~~I~gP~~~~i~~~~~~-ssgk~--tF~a~~~G~Y~fC 93 (201)
T KOG1692|consen 38 EEGDKLSVSFEVIDGGFLGVDVEITGPDGKIIHKGKRE-SSGKY--TFTAPKKGTYTFC 93 (201)
T ss_pred ccCCEEEEEEEEecCCccceeEEEECCCCchhhhcccc-cCceE--EEEecCCceEEEE
Confidence 35677777776665566778899999999865333333 33555 5667777877765
No 85
>PRK10301 hypothetical protein; Provisional
Probab=28.76 E-value=1.4e+02 Score=28.23 Aligned_cols=55 Identities=24% Similarity=0.275 Sum_probs=30.9
Q ss_pred EEEEeCCCCCeeeeE---EEEcCCCEEEEEEE-cCcceeEEEEEEE---CCeecCCCceEEEE
Q psy192 145 SATVTSPGGVTEDAE---INEVEDGLYAVHFV-PKELGVHTVSVRY---KDIHIPGSPFQFTV 200 (898)
Q Consensus 145 ~~~V~~p~G~~~~~~---v~~~~~g~y~v~f~-P~~~G~h~I~V~~---~g~~V~gSPf~v~V 200 (898)
.+++.+++|+.+... +...+..++.+..- +-.+|.|+|+-+. +|.++.|+ |.|+|
T Consensus 62 ~i~v~~~~g~~v~~~~~~~~~~~~~~~~v~l~~~L~~G~YtV~Wrvvs~DGH~~~G~-~~F~V 123 (124)
T PRK10301 62 GATITGPKQENIKTLPAKRNEQDQKQLIVPLADSLKPGTYTVDWHVVSVDGHKTKGH-YTFSV 123 (124)
T ss_pred EEEEEcCCCCEeccCCccccCCCCcEEEEECCCCCCCccEEEEEEEEecCCCccCCe-EEEEE
Confidence 467777877654321 11122223444432 3467888776554 77888887 55554
No 86
>PF04234 CopC: CopC domain; InterPro: IPR007348 CopC is a bacterial blue copper protein that binds 1 atom of copper per protein molecule. Along with CopA, CopC mediates copper resistance by sequestration of copper in the periplasm [].; GO: 0005507 copper ion binding, 0046688 response to copper ion, 0042597 periplasmic space; PDB: 1IX2_B 1LYQ_A 2C9P_C 2C9R_A 2C9Q_A 1M42_A 1OT4_A 1NM4_A.
Probab=28.46 E-value=46 Score=29.94 Aligned_cols=32 Identities=25% Similarity=0.359 Sum_probs=13.9
Q ss_pred cEEEEEEE-cccccEEEEE---EECCEEcCCCCeEEE
Q psy192 720 NLGISFTP-REVGSHLVSV---KKMGVHIKNSPFKIN 752 (898)
Q Consensus 720 ~y~v~ftP-~e~G~h~i~V---~~~G~~i~gSPf~i~ 752 (898)
.+.+...+ ...|.|+|.- .-||.++.|+ |.|.
T Consensus 60 ~~~~~l~~~l~~G~YtV~wrvvs~DGH~~~G~-~~F~ 95 (97)
T PF04234_consen 60 TLTVPLPPPLPPGTYTVSWRVVSADGHPVSGS-FSFT 95 (97)
T ss_dssp EEEEEESS---SEEEEEEEEEEETTSCEEEEE-EEEE
T ss_pred EEEEECCCCCCCceEEEEEEEEecCCCCcCCE-EEEE
Confidence 34444433 4444444433 2366666555 4443
No 87
>PF11896 DUF3416: Domain of unknown function (DUF3416); InterPro: IPR021828 This presumed domain is functionally uncharacterised. This domain is found in bacteria and archaea. This domain is about 190 amino acids in length. This domain is found associated with PF00128 from PFAM. ; GO: 0004553 hydrolase activity, hydrolyzing O-glycosyl compounds; PDB: 3ZT7_A 3ZST_B 3ZT6_A 3ZSS_D 3ZT5_B.
Probab=28.28 E-value=2.3e+02 Score=29.04 Aligned_cols=60 Identities=17% Similarity=0.292 Sum_probs=36.3
Q ss_pred eeeeeeEEEEEeccCCCCceEEEE--EcCCc---ceeeEEECCCCcEEEEEEeCCCceEEEEEEE
Q psy192 222 EQNQPCEFNVWTREAGAGSLAISV--EGPSK---AEIDFKDRKDGSCYVSYVVAEPGEYRVGIKF 281 (898)
Q Consensus 222 ~~g~~~~F~V~~~daG~g~l~v~v--~gps~---~~i~~~d~~dGty~v~y~P~~~G~y~i~V~~ 281 (898)
.+|++..++-+.--.|-..|.+.+ ..+++ .++.+..-++..|..+|+|..+|.|+..|.-
T Consensus 24 vvGe~v~V~Adif~DGHD~l~A~l~~r~~~~~~w~~vpM~~~gnDrW~a~f~~~~~G~~~f~VeA 88 (187)
T PF11896_consen 24 VVGEPVPVSADIFRDGHDALAAELLWRHPGEREWQEVPMTPLGNDRWEASFTPDRPGRYEFRVEA 88 (187)
T ss_dssp ETT-EEEEEEEE--SSSS-EEEEEEEE-TTS-B----B-EESTS-EEEEEEE--SSEEEEEEEEE
T ss_pred ecCCeEEEEEEEEecCCCcEEEEEEEECCCCCcceeeccccCCCCEEEEEEECCCceeEEEEEEE
Confidence 567777777776666777776555 45544 3466667788899999999999999888753
No 88
>cd00173 SH2 Src homology 2 domains; Signal transduction, involved in recognition of phosphorylated tyrosine (pTyr). SH2 domains typically bind pTyr-containing ligands via two surface pockets, a pTyr and hydrophobic binding pocket, allowing proteins with SH2 domains to localize to tyrosine phosphorylated sites.
Probab=27.05 E-value=89 Score=27.29 Aligned_cols=29 Identities=28% Similarity=0.525 Sum_probs=26.0
Q ss_pred EEeCCCCeEEEEEEcCCCeeEEEEEEECC
Q psy192 625 YHDNKDGTVAVSYLPTAPGEYKIAVKFGE 653 (898)
Q Consensus 625 ~~~~~dg~~~v~Y~P~~pG~Y~i~V~~~~ 653 (898)
+.+..+|+|.|......+|.|.|+|++++
T Consensus 16 L~~~~~G~FLiR~s~~~~~~~~Lsv~~~~ 44 (94)
T cd00173 16 LKKKPDGTFLVRDSESSPGDYVLSVRVKG 44 (94)
T ss_pred HhcCCCceEEEEecCCCCCCEEEEEEECC
Confidence 34467999999999999999999999988
No 89
>cd00173 SH2 Src homology 2 domains; Signal transduction, involved in recognition of phosphorylated tyrosine (pTyr). SH2 domains typically bind pTyr-containing ligands via two surface pockets, a pTyr and hydrophobic binding pocket, allowing proteins with SH2 domains to localize to tyrosine phosphorylated sites.
Probab=27.00 E-value=73 Score=27.85 Aligned_cols=27 Identities=22% Similarity=0.327 Sum_probs=24.4
Q ss_pred eCCceEEEEEEeCCCeeEEEEEEECCc
Q psy192 861 TGRNNFEVKYIVRDRGEYLLIVKWGDD 887 (898)
Q Consensus 861 ~g~~~y~v~y~~~~~G~y~l~vk~~~~ 887 (898)
..+|+|-++.....+|.|.|+|++.++
T Consensus 19 ~~~G~FLiR~s~~~~~~~~Lsv~~~~~ 45 (94)
T cd00173 19 KPDGTFLVRDSESSPGDYVLSVRVKGK 45 (94)
T ss_pred CCCceEEEEecCCCCCCEEEEEEECCE
Confidence 468999999999999999999999873
No 90
>KOG1690|consensus
Probab=26.09 E-value=1.8e+02 Score=29.85 Aligned_cols=79 Identities=15% Similarity=0.183 Sum_probs=45.6
Q ss_pred CCcccCeEEeCCCCeEEEEEEcCCceeEEEEEEECCEEe-cCCCCee--EEEecCccc-eEEEEeCCCCeEEEEEeeCCC
Q psy192 4 GNVDKPVIEDNHDGTVSLHYDPREEGLHELALKFNGDHV-QGYGGLS--LSIEGPSKA-EIQCKDNADGSLNISYRPTEP 79 (898)
Q Consensus 4 g~~~~~~i~~~~d~~~~v~f~P~~~G~h~v~v~~~G~~v-~G~g~l~--v~v~GP~~~-~i~~~d~~dG~~~v~ytP~~~ 79 (898)
|-..+|-+++..+|+.- -|.|.+.+.-+-... .++++++ |.|+-|-.. .+-.......+=+++||...+
T Consensus 27 ~~e~KCF~eelpk~tmv-------~G~yk~qlyd~~~~~y~~~p~~gm~VeV~e~fdnnh~Vl~q~~ss~G~ftFta~~~ 99 (215)
T KOG1690|consen 27 GTEKKCFIEELPKGTMV-------TGNYKAQLYDDQLKGYGSYPNIGMHVEVKETFDNNHVVLSQQYSSEGDFTFTALTP 99 (215)
T ss_pred CCcccchhhhCCCCcEE-------EeeeeeeeecchhcccccCCCceEEEEeecCCCCceEEEeecCCCCCceEEEccCC
Confidence 55677888888886632 244444443221111 1334444 555555543 233333344455689999999
Q ss_pred eEEEEEEEEC
Q psy192 80 GYYIINLKFA 89 (898)
Q Consensus 80 G~~~I~V~~~ 89 (898)
|+|.|.+.-+
T Consensus 100 GeH~IC~~s~ 109 (215)
T KOG1690|consen 100 GEHRICIQSN 109 (215)
T ss_pred CceEEEEecc
Confidence 9999999854
No 91
>PRK09619 flgD flagellar basal body rod modification protein; Reviewed
Probab=25.54 E-value=2e+02 Score=30.13 Aligned_cols=45 Identities=11% Similarity=0.084 Sum_probs=21.9
Q ss_pred CeEEEEEecCCccceEEEEECCCCCeeeeEEEecCCCeEEEEEEe
Q psy192 320 PTQFLVRKNGAVGALDAKVISPSGTEDDCFIQPIDGDNYSIRFMP 364 (898)
Q Consensus 320 ~~~f~V~~~~a~G~l~v~v~~p~g~~~~~~v~~~~~g~y~v~f~P 364 (898)
...|.++..++...+.+.|.+.+|+.....+-....|.+.+.|-+
T Consensus 111 ~~~~~~~L~~~a~~v~v~I~D~~G~v~t~~l~~~~aG~~~f~WDG 155 (218)
T PRK09619 111 PVAGRLTLKHPAPTLTLHITDILGQEKKIDLGKQPAGPVNFTLDP 155 (218)
T ss_pred eeEEEEecCCcCcEEEEEEEeCCCCEEEEecCCcCCCceeEEECC
Confidence 334444433333346777777777642222222344555555544
No 92
>PRK12812 flgD flagellar basal body rod modification protein; Reviewed
Probab=25.38 E-value=2.2e+02 Score=30.72 Aligned_cols=45 Identities=2% Similarity=-0.046 Sum_probs=23.5
Q ss_pred CeEEEEEecCCccceEEEEECCCCCeee-eEEEecCCCeEEEEEEe
Q psy192 320 PTQFLVRKNGAVGALDAKVISPSGTEDD-CFIQPIDGDNYSIRFMP 364 (898)
Q Consensus 320 ~~~f~V~~~~a~G~l~v~v~~p~g~~~~-~~v~~~~~g~y~v~f~P 364 (898)
++.|.++......++.+.|.+.+|+.+. ..+.....|.+.+.|-.
T Consensus 129 ~~~~~~~l~~~a~~v~v~I~D~~G~~V~t~~lg~~~aG~~~f~WDG 174 (259)
T PRK12812 129 LIALKLYFPEDSDEGTLEIYDSNNKLVEKIDFKEISQGLFTMEWDG 174 (259)
T ss_pred eeEEEEecCCcCceEEEEEEeCCCCEEEEEecCCCCCcceeEEECC
Confidence 4555555444444577777777776542 22223344555444433
No 93
>KOG1692|consensus
Probab=25.17 E-value=1.6e+02 Score=30.04 Aligned_cols=55 Identities=22% Similarity=0.262 Sum_probs=35.8
Q ss_pred eCCeEEEEEEcCCCCCCceEEEEECCCCceeeeEEeCCeEEEEEecCCCeeEEEE
Q psy192 412 SGVKTDFIVDTCNAGAGTLAVTIDGPSKVSMDCTEVEEGYKVRYTPLVPGDYYVS 466 (898)
Q Consensus 412 ~g~~~~F~Vd~~~aG~g~l~v~v~gp~~~~~~~~~~~~gy~v~ytp~~~G~y~i~ 466 (898)
.|...+.+-..-+.|.-.+++.|.||.+-.+.-.+.+.+=..+|++...|.|+..
T Consensus 39 ~gd~~~vsF~v~~gg~~~vd~~I~gP~~~~i~~~~~~ssgk~tF~a~~~G~Y~fC 93 (201)
T KOG1692|consen 39 EGDKLSVSFEVIDGGFLGVDVEITGPDGKIIHKGKRESSGKYTFTAPKKGTYTFC 93 (201)
T ss_pred cCCEEEEEEEEecCCccceeEEEECCCCchhhhcccccCceEEEEecCCceEEEE
Confidence 3444444444444567788999999987554433333335678899999999873
No 94
>PF07523 Big_3: Bacterial Ig-like domain (group 3); InterPro: IPR011080 This entry represents bacterial domains with an Ig-like fold. These domains are found in a variety of bacterial surface proteins.; PDB: 2L7Y_A 2KPN_A.
Probab=23.77 E-value=83 Score=26.02 Aligned_cols=42 Identities=24% Similarity=0.202 Sum_probs=25.8
Q ss_pred cceEEEEeCCCCCeeeeEEEEcCCCEEEEEEEcCcceeEEEEEEECC
Q psy192 142 FDLSATVTSPGGVTEDAEINEVEDGLYAVHFVPKELGVHTVSVRYKD 188 (898)
Q Consensus 142 ~~~~~~V~~p~G~~~~~~v~~~~~g~y~v~f~P~~~G~h~I~V~~~g 188 (898)
..+.+.++..+|+.+ +..+-+.+..|-...+|.|.|.+++.+
T Consensus 17 ~~~~v~at~~dG~~~-----~~~~~~vs~~~d~~~~G~y~Vt~~y~~ 58 (67)
T PF07523_consen 17 TGLFVTATYSDGTSL-----PLSDVTVSGTVDTSKAGTYTVTYTYKG 58 (67)
T ss_dssp HCHEEEEEETTS-ES------GCCSEEES---TTS-CCEEEEEEECT
T ss_pred cCCEEEEEEcCCCEe-----ceeeeEEEeeeecCCCceEEEEEEECC
Confidence 345677777788762 233344444678899999999999999
No 95
>PRK09619 flgD flagellar basal body rod modification protein; Reviewed
Probab=23.77 E-value=2.3e+02 Score=29.70 Aligned_cols=32 Identities=16% Similarity=0.243 Sum_probs=14.9
Q ss_pred ceEEEEeCCCCCeeeeEEEEcCCCEEEEEEEc
Q psy192 143 DLSATVTSPGGVTEDAEINEVEDGLYAVHFVP 174 (898)
Q Consensus 143 ~~~~~V~~p~G~~~~~~v~~~~~g~y~v~f~P 174 (898)
.+++.|.+.+|+......-....|.+.+.|-+
T Consensus 124 ~v~v~I~D~~G~v~t~~l~~~~aG~~~f~WDG 155 (218)
T PRK09619 124 TLTLHITDILGQEKKIDLGKQPAGPVNFTLDP 155 (218)
T ss_pred EEEEEEEeCCCCEEEEecCCcCCCceeEEECC
Confidence 35666666666642222222334444444443
No 96
>PF11589 DUF3244: Domain of unknown function (DUF3244); InterPro: IPR021638 This family of proteins with unknown function appear to be restricted to Bacteroidetes. The protein may have an immunoglobulin-like beta-sandwich fold however this cannot be confirmed. ; PDB: 3D33_B 3SD2_A.
Probab=23.75 E-value=2.3e+02 Score=25.84 Aligned_cols=60 Identities=12% Similarity=0.129 Sum_probs=34.7
Q ss_pred EEEecCCccceEEEEECCCCCeeeeEEEe-cCCCeEEEEEEecCCeeEEEEEEECCe-ecCC
Q psy192 324 LVRKNGAVGALDAKVISPSGTEDDCFIQP-IDGDNYSIRFMPRENGIHNIHIKFNGV-HIPG 383 (898)
Q Consensus 324 ~V~~~~a~G~l~v~v~~p~g~~~~~~v~~-~~~g~y~v~f~P~~~G~h~i~V~~~g~-~I~g 383 (898)
.|.-....+++.++|++..|..+--.... .....+.+...-...|.|.|.+..+.- .+-|
T Consensus 40 ~I~F~~~~~~vtI~I~d~~G~vVy~~~~~~~~~~~~~I~L~~~~~G~Y~l~i~~~~g~~l~G 101 (106)
T PF11589_consen 40 SIEFESPIGDVTITIKDSTGNVVYSETVSNSAGQSITIDLNGLPSGEYTLEITNGNGTYLYG 101 (106)
T ss_dssp EEEESS--SEEEEEEEETT--EEEEEEESCGGTTEEEEE-TTS-SEEEEEEEEECTC-EEEE
T ss_pred EEEEcCCCCCEEEEEEeCCCCEEEEEEccCCCCcEEEEEeCCCCCccEEEEEEeCCCCEEEE
Confidence 44444455778888888778765333332 234467777766678999999887754 5443
No 97
>PRK12633 flgD flagellar basal body rod modification protein; Provisional
Probab=23.39 E-value=2.7e+02 Score=29.50 Aligned_cols=14 Identities=21% Similarity=0.463 Sum_probs=8.5
Q ss_pred ceEEEEeCCCCCee
Q psy192 143 DLSATVTSPGGVTE 156 (898)
Q Consensus 143 ~~~~~V~~p~G~~~ 156 (898)
++++.|.+.+|+.+
T Consensus 130 ~v~v~I~D~~G~vV 143 (230)
T PRK12633 130 KVTVKVLDPSGAVV 143 (230)
T ss_pred EEEEEEEeCCCCEE
Confidence 35666666666654
No 98
>PF06016 Reovirus_L2: Reovirus core-spike protein lambda-2 (L2); InterPro: IPR010311 This family consists of several Reovirus core-spike protein lambda-2 (L2) sequences. The reovirus L2 genome segment encodes the core spike protein lambda-2, which mediates enzymatic reactions in 5' capping of the viral plus-strand transcripts [].; GO: 0004482 mRNA (guanine-N7-)-methyltransferase activity, 0004484 mRNA guanylyltransferase activity, 0005524 ATP binding, 0006370 mRNA capping, 0019028 viral capsid; PDB: 1EJ6_A 3IYL_W 3K1Q_A.
Probab=23.35 E-value=1.3e+02 Score=39.24 Aligned_cols=105 Identities=11% Similarity=0.231 Sum_probs=53.2
Q ss_pred ceecCCCeEEEEEecCCccceEEE-----EE-CCCCCe----eeeEE-EecCCCeEEEEEEecCCeeEEEEEEECCeecC
Q psy192 314 VVMADKPTQFLVRKNGAVGALDAK-----VI-SPSGTE----DDCFI-QPIDGDNYSIRFMPRENGIHNIHIKFNGVHIP 382 (898)
Q Consensus 314 ~~~~g~~~~f~V~~~~a~G~l~v~-----v~-~p~g~~----~~~~v-~~~~~g~y~v~f~P~~~G~h~i~V~~~g~~I~ 382 (898)
...+|+.+++.|..-+....+.+. +. ...|.. -++.. -+..+..+.++|+|..+|.|.+.+..++..|+
T Consensus 1034 ~~~VGq~~~i~I~~~~~~~s~~~~~~gV~v~~~~~~~~~s~~~~v~~q~~~~~~~~~l~~tp~~aGi~~vv~~~~~~~v~ 1113 (1289)
T PF06016_consen 1034 QLVVGQQCSITIPGPNPQASVSVTHDGVEVFTGTAGNLVSMVGPVALQYDPANNEWTLTWTPSTAGILDVVVDDGPVPVP 1113 (1289)
T ss_dssp SSCTTT-EEEEEESS-TTEEEEEECTTCEEEECCCTTTCESSSEEEEEEETTTTEEEEEEE-SS-CEEEEEECTTTCCEE
T ss_pred CccccceEEEEEccCCCCceEEEEECCEEEEEecCCcccccccceeEEEeccCceEEEEEccCCCceEEEEeccCCcccc
Confidence 357899999999865443222222 11 122321 11222 24567899999999999999997777776665
Q ss_pred CCCeEEEeccCCCCCceEEecCCCcceeeeCCeEEEEEEc
Q psy192 383 GSPLRIKVGKGEADPAAVHATGNGLAEIKSGVKTDFIVDT 422 (898)
Q Consensus 383 gSPf~v~V~~~~~d~s~v~v~G~Gl~~~~~g~~~~F~Vd~ 422 (898)
- ..+.|.. .|+.-+.....-+..-.+|..++.++|.
T Consensus 1114 ~--GS~~i~~--Pd~~i~v~~Pa~~D~T~aGtd~~I~~d~ 1149 (1289)
T PF06016_consen 1114 L--GSFTIEP--PDPTITVTWPANLDYTDAGTDVNITCDP 1149 (1289)
T ss_dssp E--EEEEEB-------EEEE--TT--SSTT-EEEEEEE-T
T ss_pred c--ceeEeeC--CCCceEEecccceeeeccCceeeEEecc
Confidence 3 2344432 5776666655444444556555555554
No 99
>PF05404 TRAP-delta: Translocon-associated protein, delta subunit precursor (TRAP-delta); InterPro: IPR008855 This family consists of several eukaryotic translocon-associated protein, delta subunit precursors (TRAP-delta or SSR-delta). The exact function of this protein is unknown [].; GO: 0005783 endoplasmic reticulum, 0016021 integral to membrane
Probab=23.24 E-value=3.2e+02 Score=27.39 Aligned_cols=52 Identities=19% Similarity=0.267 Sum_probs=0.0
Q ss_pred EEEEEEcC-CCcccceEEEEeCCCCCeeeeEEEEcCCCEEEEEEEcC----cceeEEEEEE
Q psy192 130 CKLTFKMP-GITAFDLSATVTSPGGVTEDAEINEVEDGLYAVHFVPK----ELGVHTVSVR 185 (898)
Q Consensus 130 ~~f~v~~~-~~~~~~~~~~V~~p~G~~~~~~v~~~~~g~y~v~f~P~----~~G~h~I~V~ 185 (898)
++|+++.. +.....|-|+| +|+..|+ .+..+.++|.|+|+-. ..|+|.|.+.
T Consensus 45 ~EFsl~C~n~~~~~~LyAeV---~Gk~~PV-ar~~~~nkYQVSW~~e~k~a~sG~y~V~~f 101 (167)
T PF05404_consen 45 AEFSLKCSNGAKNISLYAEV---NGKILPV-ARSGDTNKYQVSWTEEHKKASSGTYEVKFF 101 (167)
T ss_pred EEEEEEeCCCCcCccEEEEE---CCEEEEE-EEcCCCCceEEEEEechhhccCCceEEEEe
No 100
>PF07523 Big_3: Bacterial Ig-like domain (group 3); InterPro: IPR011080 This entry represents bacterial domains with an Ig-like fold. These domains are found in a variety of bacterial surface proteins.; PDB: 2L7Y_A 2KPN_A.
Probab=22.46 E-value=1.4e+02 Score=24.68 Aligned_cols=41 Identities=20% Similarity=0.255 Sum_probs=23.4
Q ss_pred eEEEEECCCCCeeeeEEEecCCCeEEEEEEecCCeeEEEEEEECCe
Q psy192 334 LDAKVISPSGTEDDCFIQPIDGDNYSIRFMPRENGIHNIHIKFNGV 379 (898)
Q Consensus 334 l~v~v~~p~g~~~~~~v~~~~~g~y~v~f~P~~~G~h~i~V~~~g~ 379 (898)
+.+.....+|.. .+.++-+....|....+|.|.|.++|++.
T Consensus 19 ~~v~at~~dG~~-----~~~~~~~vs~~~d~~~~G~y~Vt~~y~~~ 59 (67)
T PF07523_consen 19 LFVTATYSDGTS-----LPLSDVTVSGTVDTSKAGTYTVTYTYKGV 59 (67)
T ss_dssp HEEEEEETTS-E-----S-GCCSEEES---TTS-CCEEEEEEECTE
T ss_pred CEEEEEEcCCCE-----eceeeeEEEeeeecCCCceEEEEEEECCE
Confidence 445556666755 22233334446789999999999999983
No 101
>PF00801 PKD: PKD domain; InterPro: IPR000601 The PKD (Polycystic Kidney Disease) domain was first identified in the Polycystic Kidney Disease protein, polycystin-1 (PDK1 gene), and contains an Ig-like fold consisting of a beta-sandwich of seven strands in two sheets with a Greek key topology, although some members have additional strands []. Polycystin-1 is a large cell-surface glycoprotein involved in adhesive protein-protein and protein-carbohydrate interactions; however it is not clear if the PKD domain mediates any of these interactions. PKD domains are also found in other proteins, usually in the extracellular parts of proteins involved in interactions with other proteins. For example, domains with a PKD-type fold are found in archaeal surface layer proteins that protect the cell from extreme environments [], and in the human VPS10 domain-containing receptor SorCS2 [].; PDB: 1B4R_A 2KZW_A 2C4X_A 2C26_A 2Y72_B 3JQU_A 3JS7_B 1WGO_A 1L0Q_A.
Probab=22.40 E-value=2.9e+02 Score=22.66 Aligned_cols=53 Identities=23% Similarity=0.311 Sum_probs=31.1
Q ss_pred eeeCCeEEEEEEcCCCCCCceEEEEECCCCceeeeEEeCCeEEEEEecCCCeeEEEEEEE
Q psy192 410 IKSGVKTDFIVDTCNAGAGTLAVTIDGPSKVSMDCTEVEEGYKVRYTPLVPGDYYVSLKY 469 (898)
Q Consensus 410 ~~~g~~~~F~Vd~~~aG~g~l~v~v~gp~~~~~~~~~~~~gy~v~ytp~~~G~y~i~V~~ 469 (898)
...|+++.|.... .....+....+=.+ ......+-.++|+-..+|.|.|.|+.
T Consensus 8 ~~~g~~v~f~~~~--~~g~~~~y~W~fgd-----~~~~~~~~~~t~ty~~~G~y~V~ltv 60 (69)
T PF00801_consen 8 VPTGQPVTFTASS--SDGSPVTYSWDFGD-----NGTVSTGSSVTHTYSSPGTYTVTLTV 60 (69)
T ss_dssp EBTTSEEEEEETT--TTSSECEEEEE-SS-----ESEEECSSEEEEEESSSEEEEEEEEE
T ss_pred EcCCCEEEEEEEc--cCCCCeEEEEEECC-----CCccccCCCEEEEcCCCeEEEEEEEE
Confidence 4567888888777 22233444333111 00112234778888999999998875
No 102
>PF11589 DUF3244: Domain of unknown function (DUF3244); InterPro: IPR021638 This family of proteins with unknown function appear to be restricted to Bacteroidetes. The protein may have an immunoglobulin-like beta-sandwich fold however this cannot be confirmed. ; PDB: 3D33_B 3SD2_A.
Probab=22.24 E-value=1.5e+02 Score=27.04 Aligned_cols=52 Identities=13% Similarity=0.069 Sum_probs=35.7
Q ss_pred cceEEEEeCCCCCeeeeEEEEcC-CCEEEEEEEcCcceeEEEEEEECCe-ecCC
Q psy192 142 FDLSATVTSPGGVTEDAEINEVE-DGLYAVHFVPKELGVHTVSVRYKDI-HIPG 193 (898)
Q Consensus 142 ~~~~~~V~~p~G~~~~~~v~~~~-~g~y~v~f~P~~~G~h~I~V~~~g~-~V~g 193 (898)
++++++|++-.|..+..+...+. ...+.+.--=...|.|.|.+..+.- .+-|
T Consensus 48 ~~vtI~I~d~~G~vVy~~~~~~~~~~~~~I~L~~~~~G~Y~l~i~~~~g~~l~G 101 (106)
T PF11589_consen 48 GDVTITIKDSTGNVVYSETVSNSAGQSITIDLNGLPSGEYTLEITNGNGTYLYG 101 (106)
T ss_dssp SEEEEEEEETT--EEEEEEESCGGTTEEEEE-TTS-SEEEEEEEEECTC-EEEE
T ss_pred CCEEEEEEeCCCCEEEEEEccCCCCcEEEEEeCCCCCccEEEEEEeCCCCEEEE
Confidence 56888999988988776666544 3467777655679999999998865 6655
No 103
>COG4856 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=20.78 E-value=5.6e+02 Score=29.05 Aligned_cols=25 Identities=24% Similarity=0.429 Sum_probs=19.1
Q ss_pred EEEEEECCe--ecCCCceEEEEccCcc
Q psy192 646 KIAVKFGEK--HIKGSPYLAKITGEGR 670 (898)
Q Consensus 646 ~i~V~~~~~--~I~GSPF~v~V~~~~~ 670 (898)
-+.|+|+.. +|+|-||+++|+=.+.
T Consensus 53 pvdvlYD~~~y~isg~~etV~Vtl~G~ 79 (403)
T COG4856 53 PVDVLYDSDKYFISGQPETVTVTLKGP 79 (403)
T ss_pred eeEEEEccccccccCCceEEEEEEeCC
Confidence 366888764 8999999998875543
No 104
>PF10633 NPCBM_assoc: NPCBM-associated, NEW3 domain of alpha-galactosidase; InterPro: IPR018905 This domain has been named NEW3, but its function is not known. It is found on proteins which are bacterial galactosidases [].; PDB: 1EUT_A 2BZD_A 1WCQ_C 2BER_A 1W8O_A 1EUU_A 1W8N_A.
Probab=20.46 E-value=5.2e+02 Score=21.84 Aligned_cols=67 Identities=15% Similarity=0.266 Sum_probs=0.0
Q ss_pred eecCceeEEEEEcCCCCCce---eEEEEeCCCccccccccceEEEEeeCCceEEEEEEeC-----CCeeEEEEEEE
Q psy192 817 IKSGVKTDFIVDTCNAGAGT---LAVTIDGPSKVSVKKYKDEIFTRHTGRNNFEVKYIVR-----DRGEYLLIVKW 884 (898)
Q Consensus 817 ~~~g~~~~f~vdt~~aG~g~---l~v~i~gp~~~~~~~~~~~~~~~~~g~~~y~v~y~~~-----~~G~y~l~vk~ 884 (898)
+..|+...|++.-+|.|... +.+.++.|++-.+..--.++. .-....+-.+.+... .+|.|.|.++.
T Consensus 1 v~~G~~~~~~~tv~N~g~~~~~~v~~~l~~P~GW~~~~~~~~~~-~l~pG~s~~~~~~V~vp~~a~~G~y~v~~~a 75 (78)
T PF10633_consen 1 VTPGETVTVTLTVTNTGTAPLTNVSLSLSLPEGWTVSASPASVP-SLPPGESVTVTFTVTVPADAAPGTYTVTVTA 75 (78)
T ss_dssp --TTEEEEEEEEEE--SSS-BSS-EEEEE--TTSE---EEEEE---B-TTSEEEEEEEEEE-TT--SEEEEEEEEE
T ss_pred CCCCCEEEEEEEEEECCCCceeeEEEEEeCCCCccccCCccccc-cCCCCCEEEEEEEEECCCCCCCceEEEEEEE
Done!