Query psy4599
Match_columns 201
No_of_seqs 141 out of 172
Neff 5.0
Searched_HMMs 46136
Date Fri Aug 16 21:01:22 2013
Command hhsearch -i /work/01045/syshi/Psyhhblits/psy4599.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/4599hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 KOG3317|consensus 100.0 5.2E-44 1.1E-48 295.5 10.6 141 1-191 42-182 (188)
2 PF05753 TRAP_beta: Translocon 100.0 9.9E-42 2.1E-46 285.2 12.1 142 1-191 38-179 (181)
3 KOG3317|consensus 100.0 1.7E-28 3.6E-33 203.4 8.4 79 80-158 44-122 (188)
4 PF05753 TRAP_beta: Translocon 99.9 5.6E-26 1.2E-30 190.2 11.0 81 82-162 42-122 (181)
5 PF07705 CARDB: CARDB; InterP 96.7 0.019 4.1E-07 41.3 9.0 60 2-70 20-81 (101)
6 COG1470 Predicted membrane pro 96.3 0.22 4.8E-06 48.0 15.8 138 4-148 287-466 (513)
7 PF10633 NPCBM_assoc: NPCBM-as 96.3 0.017 3.6E-07 41.4 6.2 65 2-70 6-73 (78)
8 TIGR01451 B_ant_repeat conserv 96.1 0.02 4.4E-07 38.9 5.5 38 2-41 13-50 (53)
9 COG1361 S-layer domain [Cell e 96.1 0.28 6E-06 46.3 15.0 149 2-154 48-245 (500)
10 PF01345 DUF11: Domain of unkn 95.7 0.025 5.5E-07 40.1 5.0 35 2-38 42-76 (76)
11 PF10633 NPCBM_assoc: NPCBM-as 95.5 0.037 7.9E-07 39.6 5.3 63 82-148 9-74 (78)
12 COG1361 S-layer domain [Cell e 94.8 1.2 2.7E-05 41.9 14.8 73 3-77 169-245 (500)
13 PF07705 CARDB: CARDB; InterP 94.2 0.32 7E-06 34.8 7.4 58 82-148 23-82 (101)
14 PF13473 Cupredoxin_1: Cupredo 92.8 0.55 1.2E-05 35.1 6.8 63 65-146 29-91 (104)
15 PF13584 BatD: Oxygen toleranc 92.4 9.6 0.00021 35.7 16.7 66 11-76 25-97 (484)
16 PF13473 Cupredoxin_1: Cupredo 91.7 0.83 1.8E-05 34.1 6.7 24 44-69 68-91 (104)
17 TIGR01451 B_ant_repeat conserv 90.8 0.61 1.3E-05 31.5 4.6 33 83-117 17-49 (53)
18 PF01345 DUF11: Domain of unkn 87.9 0.95 2.1E-05 31.9 4.1 31 83-115 46-76 (76)
19 PF12690 BsuPI: Intracellular 86.6 3.9 8.4E-05 30.1 6.9 67 3-70 2-82 (82)
20 PF00345 PapD_N: Pili and flag 85.3 14 0.00031 28.0 9.9 84 66-154 2-95 (122)
21 PF07919 Gryzun: Gryzun, putat 84.4 4.5 9.7E-05 38.2 7.8 67 3-74 487-553 (554)
22 PF13584 BatD: Oxygen toleranc 83.8 26 0.00057 32.8 12.6 134 3-138 145-299 (484)
23 TIGR02656 cyanin_plasto plasto 82.9 4 8.7E-05 30.4 5.6 81 55-145 2-82 (99)
24 PF09478 CBM49: Carbohydrate b 79.4 10 0.00022 27.4 6.5 55 4-58 20-79 (80)
25 PF13860 FlgD_ig: FlgD Ig-like 78.7 14 0.0003 26.5 7.1 51 69-128 12-64 (81)
26 PF12690 BsuPI: Intracellular 78.3 13 0.00028 27.3 6.9 27 120-146 53-81 (82)
27 TIGR03096 nitroso_cyanin nitro 75.4 36 0.00078 27.7 9.2 92 31-146 26-117 (135)
28 cd04036 C2_cPLA2 C2 domain pre 71.3 11 0.00023 28.3 5.0 50 95-148 66-116 (119)
29 PF09478 CBM49: Carbohydrate b 70.3 21 0.00045 25.7 6.2 53 82-134 21-78 (80)
30 PF00927 Transglut_C: Transglu 69.9 6.2 0.00014 29.4 3.5 68 1-68 15-84 (107)
31 PF07919 Gryzun: Gryzun, putat 68.4 60 0.0013 30.6 10.5 106 33-145 433-547 (554)
32 cd04036 C2_cPLA2 C2 domain pre 64.8 21 0.00045 26.7 5.5 62 4-70 53-115 (119)
33 KOG4386|consensus 64.2 14 0.0003 37.0 5.3 67 3-74 710-776 (809)
34 COG1470 Predicted membrane pro 63.5 51 0.0011 32.3 8.9 53 2-59 398-451 (513)
35 PF14796 AP3B1_C: Clathrin-ada 63.4 22 0.00048 29.2 5.7 50 3-56 87-136 (145)
36 PF14874 PapD-like: Flagellar- 60.2 63 0.0014 23.4 10.6 61 82-148 24-85 (102)
37 PF11611 DUF4352: Domain of un 59.2 40 0.00086 25.0 6.1 57 3-59 38-100 (123)
38 PF05506 DUF756: Domain of unk 58.7 67 0.0015 23.2 8.0 67 71-148 11-77 (89)
39 COG1572 Uncharacterized conser 58.6 1.5E+02 0.0032 29.9 11.4 58 82-148 427-486 (606)
40 PF08441 Integrin_alpha2: Inte 57.9 1.3E+02 0.0027 28.0 10.4 70 95-167 311-381 (457)
41 PF12584 TRAPPC10: Trafficking 56.7 1E+02 0.0022 24.6 11.4 44 34-78 72-115 (147)
42 PF07610 DUF1573: Protein of u 53.4 53 0.0011 21.1 5.2 38 83-130 1-40 (45)
43 PF10989 DUF2808: Protein of u 52.6 1.2E+02 0.0026 24.3 9.1 36 113-148 88-126 (146)
44 PF03896 TRAP_alpha: Transloco 51.9 1.5E+02 0.0032 27.0 9.5 35 122-157 145-182 (285)
45 PF05506 DUF756: Domain of unk 48.9 1E+02 0.0022 22.3 7.1 53 4-67 21-73 (89)
46 TIGR02656 cyanin_plasto plasto 48.0 45 0.00097 24.7 4.8 48 8-57 28-75 (99)
47 COG1572 Uncharacterized conser 44.3 2.8E+02 0.0061 27.9 10.9 62 1-71 423-486 (606)
48 KOG4386|consensus 44.2 2E+02 0.0043 29.2 9.6 131 12-150 635-775 (809)
49 PRK09918 putative fimbrial cha 42.6 2.2E+02 0.0048 24.5 9.3 65 64-134 24-91 (230)
50 PRK02710 plastocyanin; Provisi 42.5 1.3E+02 0.0027 23.2 6.7 76 50-145 27-102 (119)
51 PRK15188 fimbrial chaperone pr 40.7 1.9E+02 0.0041 25.2 8.2 67 63-134 26-95 (228)
52 PF13860 FlgD_ig: FlgD Ig-like 39.2 1E+02 0.0022 22.0 5.3 16 36-51 49-64 (81)
53 PF10725 DUF2517: Protein of u 39.2 12 0.00025 26.9 0.3 22 167-188 16-37 (63)
54 PF11797 DUF3324: Protein of u 39.1 93 0.002 24.7 5.6 44 1-49 59-106 (140)
55 PF14796 AP3B1_C: Clathrin-ada 38.6 57 0.0012 26.8 4.4 53 77-133 84-136 (145)
56 PF14975 DUF4512: Domain of un 38.5 6.5 0.00014 29.9 -1.1 16 170-185 2-17 (88)
57 PF02102 Peptidase_M35: Deuter 35.8 12 0.00026 35.0 0.0 58 2-59 39-115 (359)
58 PF14874 PapD-like: Flagellar- 35.3 1.7E+02 0.0037 21.1 9.7 60 3-69 22-83 (102)
59 PRK15254 fimbrial chaperone pr 34.7 3.2E+02 0.0068 24.0 8.8 67 63-134 15-84 (239)
60 PRK11385 putativi pili assembl 33.4 3.3E+02 0.0071 23.8 8.7 67 63-135 25-101 (236)
61 PF01835 A2M_N: MG2 domain; I 31.9 1.9E+02 0.0042 20.7 7.5 72 86-166 10-82 (99)
62 PF04495 GRASP55_65: GRASP55/6 31.6 1.8E+02 0.0038 23.4 6.1 95 8-108 2-121 (138)
63 PRK15299 fimbrial chaperone pr 31.6 3.3E+02 0.0073 23.3 8.7 65 64-134 22-92 (227)
64 TIGR03079 CH4_NH3mon_ox_B meth 31.5 63 0.0014 30.8 3.9 57 1-59 282-353 (399)
65 cd08379 C2D_MCTP_PRT_plant C2 31.1 1.8E+02 0.0039 22.7 6.0 46 14-62 61-113 (126)
66 PRK06655 flgD flagellar basal 30.4 3.6E+02 0.0079 23.4 10.7 64 70-142 114-183 (225)
67 PF04744 Monooxygenase_B: Mono 29.6 1.6E+02 0.0035 28.0 6.3 59 1-59 263-334 (381)
68 PRK15208 long polar fimbrial c 28.5 3.8E+02 0.0083 23.0 8.2 66 64-134 21-89 (228)
69 PF14646 MYCBPAP: MYCBP-associ 28.2 3.5E+02 0.0076 25.3 8.4 31 121-151 295-325 (426)
70 COG0832 UreB Urea amidohydrola 27.9 1.5E+02 0.0032 23.4 4.8 37 62-101 2-38 (106)
71 PRK13205 ureB urease subunit b 27.2 1.3E+02 0.0029 25.2 4.7 36 63-101 3-38 (162)
72 TIGR03102 halo_cynanin halocya 25.4 3.3E+02 0.0071 21.2 6.7 73 52-143 22-96 (115)
73 PF01263 Aldose_epim: Aldose 1 25.2 3.8E+02 0.0082 22.8 7.5 81 51-136 214-300 (300)
74 PF00127 Copper-bind: Copper b 25.2 1.2E+02 0.0027 22.2 3.9 14 44-57 62-75 (99)
75 PRK15195 fimbrial chaperone pr 25.0 4.5E+02 0.0098 22.7 8.7 66 64-134 25-93 (229)
76 PRK12813 flgD flagellar basal 25.0 4.7E+02 0.01 22.8 8.8 62 70-143 112-179 (223)
77 PF01002 Flavi_NS2B: Flaviviru 24.4 70 0.0015 25.7 2.6 48 48-102 32-79 (128)
78 PLN02171 endoglucanase 24.4 1.7E+02 0.0036 29.5 5.7 56 4-59 556-615 (629)
79 PRK10378 inactive ferrous ion 24.2 2E+02 0.0043 27.2 5.8 63 67-147 40-103 (375)
80 cd00407 Urease_beta Urease bet 23.9 1.8E+02 0.0038 22.8 4.6 36 63-101 3-38 (101)
81 COG1721 Uncharacterized conser 23.4 2E+02 0.0043 26.8 5.7 72 2-81 69-140 (416)
82 PRK15295 fimbrial assembly cha 23.2 4.9E+02 0.011 22.4 8.6 66 63-134 18-88 (226)
83 PRK13202 ureB urease subunit b 23.2 2.1E+02 0.0045 22.5 4.9 36 63-101 3-39 (104)
84 PRK15246 fimbrial assembly cha 23.2 5E+02 0.011 22.6 8.5 66 64-135 10-83 (233)
85 PRK10378 inactive ferrous ion 22.9 2E+02 0.0044 27.1 5.7 38 30-70 65-103 (375)
86 cd08678 C2_C21orf25-like C2 do 22.9 3.3E+02 0.0072 20.4 7.8 44 15-61 58-102 (126)
87 cd05902 Ig_Neurocan Immunoglob 22.4 1.2E+02 0.0026 23.1 3.5 23 37-59 22-44 (110)
88 TIGR03396 PC_PLC phospholipase 21.7 8.8E+02 0.019 24.8 15.4 132 4-148 516-662 (690)
89 cd08379 C2D_MCTP_PRT_plant C2 21.7 3.4E+02 0.0073 21.1 5.9 44 94-140 64-114 (126)
90 PRK13204 ureB urease subunit b 21.3 2E+02 0.0044 24.2 4.7 37 62-101 25-61 (159)
91 PF06159 DUF974: Protein of un 21.1 5.6E+02 0.012 22.3 15.3 74 4-78 17-94 (249)
92 cd05901 Ig_Versican Immunoglob 20.6 3.7E+02 0.008 21.1 5.9 55 35-89 24-86 (117)
93 PRK15211 fimbrial chaperone pr 20.2 5.8E+02 0.013 22.2 8.6 67 63-135 21-91 (229)
94 PRK13201 ureB urease subunit b 20.1 2.3E+02 0.0051 23.2 4.8 36 63-101 3-38 (136)
No 1
>KOG3317|consensus
Probab=100.00 E-value=5.2e-44 Score=295.49 Aligned_cols=141 Identities=30% Similarity=0.533 Sum_probs=132.2
Q ss_pred CceEEEEEEEEcCccceeeEEEecCCCCCCCceEecCeeeeeEeeeCCCCceEEEEEEEeceeeeEEeeeEEEEEEeCCC
Q psy4599 1 MDVVVKYTIYNIGNSVATNVVVSDSSFPPEAFEIQGGQFKFVIDRVPPQTNVTHVVIVQPKSYGAFNFSFAEVSYKASEN 80 (201)
Q Consensus 1 ~d~tV~y~IyNvG~s~A~dV~l~D~~fp~~~F~iv~G~~~~~~~ri~pg~nvsH~~vv~p~~~G~fn~t~A~VtY~~s~~ 80 (201)
||++++|+|||+|+++|+||+|+|+|||.+.||||+|+++++|||||||+|++|++||||+..|.||+++|+|||+++|+
T Consensus 42 rd~~leY~IyNvGsspAldVtLsD~Sfpt~~FeIvkG~~~~swerIpags~vsHsivl~prv~g~f~~t~atVty~~~e~ 121 (188)
T KOG3317|consen 42 RDVSLEYDIYNVGSSPALDVTLSDNSFPTKTFEIVKGNLSVSWERIPAGSNVSHSIVLRPRVKGVFNGTPATVTYRIPEK 121 (188)
T ss_pred eeeEEEEeeEEcCCCcceeEEecCCCCCccceeeeccccccceeecCCCCceEEEEEEeecccceeccCceEEEEEcCCC
Confidence 79999999999999999999999999999999999999999999999999999999999999999999999999999999
Q ss_pred CceeEEEecccceeeeEEEecCCCCCCceEEecCceeeeeeeeCCCCceEEEEEEeeeecceeeeeeeEEEEecCCCCcc
Q psy4599 81 AEDYTIYNIGNSVATNVVVSDSSFPPEAFEIQGGQFKFVIDRVPPQTNVTHVVIVQPKSYGAFNFSFAEVSYKASENAED 160 (201)
Q Consensus 81 a~~y~IyNvG~s~A~dV~l~D~sfp~~~F~iv~G~~s~~~~rI~pg~nvsH~~vv~p~~~G~fn~t~A~v~yr~s~~a~~ 160 (201)
.. ++..-.+.|.++.++...++ ++|+++ +.
T Consensus 122 g~--------------~~~~~ts~~~~gyila~re~----------------------------------~rr~~~--~~ 151 (188)
T KOG3317|consen 122 GA--------------LQEAYTSPPGPGYILAQREP----------------------------------DRRFDP--RL 151 (188)
T ss_pred Cc--------------eeEEeecCCCCcceeeecCc----------------------------------ccccCh--hH
Confidence 85 46666778887776666665 999999 99
Q ss_pred Cceeeeeehhcchhhhheeeeeehhhhhhhh
Q psy4599 161 GDWAAFAFMTLPPLLIPFLLWFQSKTKYEKL 191 (201)
Q Consensus 161 ~~~~~f~~~~~P~~~~p~~ly~~S~~~y~~~ 191 (201)
++|++|++|++|.+++|++||+.||+||++.
T Consensus 152 l~flgfgviv~p~t~ip~lL~~~sKrrysn~ 182 (188)
T KOG3317|consen 152 LAFLGFGVIVIPMTVIPILLVATSKRRYSNA 182 (188)
T ss_pred HHHHhhhhhhhhhhheeeeEEEecccccccc
Confidence 9999999999999999999999999999975
No 2
>PF05753 TRAP_beta: Translocon-associated protein beta (TRAPB); InterPro: IPR008856 This family consists of several eukaryotic translocon-associated protein beta (TRAPB) or signal sequence receptor beta subunit (SSR-beta) proteins. The normal translocation of nascent polypeptides into the lumen of the endoplasmic reticulum (ER) is thought to be aided in part by a translocon-associated protein (TRAP) complex consisting of 4 protein subunits. The association of mature proteins with the ER and Golgi, or other intracellular locales, such as lysosomes, depends on the initial targeting of the nascent polypeptide to the ER membrane. A similar scenario must also exist for proteins destined for secretion [].; GO: 0005783 endoplasmic reticulum, 0016021 integral to membrane
Probab=100.00 E-value=9.9e-42 Score=285.21 Aligned_cols=142 Identities=49% Similarity=0.795 Sum_probs=128.4
Q ss_pred CceEEEEEEEEcCccceeeEEEecCCCCCCCceEecCeeeeeEeeeCCCCceEEEEEEEeceeeeEEeeeEEEEEEeCCC
Q psy4599 1 MDVVVKYTIYNIGNSVATNVVVSDSSFPPEAFEIQGGQFKFVIDRVPPQTNVTHVVIVQPKSYGAFNFSFAEVSYKASEN 80 (201)
Q Consensus 1 ~d~tV~y~IyNvG~s~A~dV~l~D~~fp~~~F~iv~G~~~~~~~ri~pg~nvsH~~vv~p~~~G~fn~t~A~VtY~~s~~ 80 (201)
||++|+|+|||+|+++|+||+|.||+||+++|++|+|+++++|+||+||+|++|+|+|+|++.|.|++++|+|+|+++++
T Consensus 38 ~~v~V~~~iyN~G~~~A~dV~l~D~~fp~~~F~lvsG~~s~~~~~i~pg~~vsh~~vv~p~~~G~f~~~~a~VtY~~~~~ 117 (181)
T PF05753_consen 38 EDVTVTYTIYNVGSSAAYDVKLTDDSFPPEDFELVSGSLSASWERIPPGENVSHSYVVRPKKSGYFNFTPAVVTYRDSEG 117 (181)
T ss_pred cEEEEEEEEEECCCCeEEEEEEECCCCCccccEeccCceEEEEEEECCCCeEEEEEEEeeeeeEEEEccCEEEEEECCCC
Confidence 68999999999999999999999999999999999999999999999999999999999999999999999999999999
Q ss_pred CceeEEEecccceeeeEEEecCCCCCCceEEecCceeeeeeeeCCCCceEEEEEEeeeecceeeeeeeEEEEecCCCCcc
Q psy4599 81 AEDYTIYNIGNSVATNVVVSDSSFPPEAFEIQGGQFKFVIDRVPPQTNVTHVVIVQPKSYGAFNFSFAEVSYKASENAED 160 (201)
Q Consensus 81 a~~y~IyNvG~s~A~dV~l~D~sfp~~~F~iv~G~~s~~~~rI~pg~nvsH~~vv~p~~~G~fn~t~A~v~yr~s~~a~~ 160 (201)
+.+ .++.-.+-|.+...+-...+ ..|++. |.
T Consensus 118 ~~~-------------~~~a~Ss~~~~~~I~~~~~~----------------------------------~k~f~~--~~ 148 (181)
T PF05753_consen 118 AKE-------------LQVAYSSPPGEGDILAERDY----------------------------------DKKFSS--HV 148 (181)
T ss_pred Cce-------------eEEEEecCCCcceEEecccc----------------------------------chhhhh--hH
Confidence 765 45555566666554444444 777777 89
Q ss_pred Cceeeeeehhcchhhhheeeeeehhhhhhhh
Q psy4599 161 GDWAAFAFMTLPPLLIPFLLWFQSKTKYEKL 191 (201)
Q Consensus 161 ~~~~~f~~~~~P~~~~p~~ly~~S~~~y~~~ 191 (201)
++|+.|++|++|++++|++||++||+||++.
T Consensus 149 ~~w~~f~~~~~~~~~~p~ll~~~sKsky~~~ 179 (181)
T PF05753_consen 149 MDWGAFAIMTLPVLLIPYLLWYSSKSKYEKS 179 (181)
T ss_pred HHHHhHHHHHHHHHHHHHHhhhhhhhhcccc
Confidence 9999999999999999999999999999885
No 3
>KOG3317|consensus
Probab=99.95 E-value=1.7e-28 Score=203.39 Aligned_cols=79 Identities=38% Similarity=0.711 Sum_probs=76.3
Q ss_pred CCceeEEEecccceeeeEEEecCCCCCCceEEecCceeeeeeeeCCCCceEEEEEEeeeecceeeeeeeEEEEecCCCC
Q psy4599 80 NAEDYTIYNIGNSVATNVVVSDSSFPPEAFEIQGGQFKFVIDRVPPQTNVTHVVIVQPKSYGAFNFSFAEVSYKASENA 158 (201)
Q Consensus 80 ~a~~y~IyNvG~s~A~dV~l~D~sfp~~~F~iv~G~~s~~~~rI~pg~nvsH~~vv~p~~~G~fn~t~A~v~yr~s~~a 158 (201)
-..+|+|||+|+++|+||+|+|+|||++.||||+|+++++|||||||+||+|++||||++.|.||+++|+|+|+.+++.
T Consensus 44 ~~leY~IyNvGsspAldVtLsD~Sfpt~~FeIvkG~~~~swerIpags~vsHsivl~prv~g~f~~t~atVty~~~e~g 122 (188)
T KOG3317|consen 44 VSLEYDIYNVGSSPALDVTLSDNSFPTKTFEIVKGNLSVSWERIPAGSNVSHSIVLRPRVKGVFNGTPATVTYRIPEKG 122 (188)
T ss_pred eEEEEeeEEcCCCcceeEEecCCCCCccceeeeccccccceeecCCCCceEEEEEEeecccceeccCceEEEEEcCCCC
Confidence 3479999999999999999999999999999999999999999999999999999999999999999999999999864
No 4
>PF05753 TRAP_beta: Translocon-associated protein beta (TRAPB); InterPro: IPR008856 This family consists of several eukaryotic translocon-associated protein beta (TRAPB) or signal sequence receptor beta subunit (SSR-beta) proteins. The normal translocation of nascent polypeptides into the lumen of the endoplasmic reticulum (ER) is thought to be aided in part by a translocon-associated protein (TRAP) complex consisting of 4 protein subunits. The association of mature proteins with the ER and Golgi, or other intracellular locales, such as lysosomes, depends on the initial targeting of the nascent polypeptide to the ER membrane. A similar scenario must also exist for proteins destined for secretion [].; GO: 0005783 endoplasmic reticulum, 0016021 integral to membrane
Probab=99.93 E-value=5.6e-26 Score=190.24 Aligned_cols=81 Identities=46% Similarity=0.791 Sum_probs=77.3
Q ss_pred ceeEEEecccceeeeEEEecCCCCCCceEEecCceeeeeeeeCCCCceEEEEEEeeeecceeeeeeeEEEEecCCCCccC
Q psy4599 82 EDYTIYNIGNSVATNVVVSDSSFPPEAFEIQGGQFKFVIDRVPPQTNVTHVVIVQPKSYGAFNFSFAEVSYKASENAEDG 161 (201)
Q Consensus 82 ~~y~IyNvG~s~A~dV~l~D~sfp~~~F~iv~G~~s~~~~rI~pg~nvsH~~vv~p~~~G~fn~t~A~v~yr~s~~a~~~ 161 (201)
.+|+|||+|+++|+||+|.|++||++.|++++|+++.+|+|||||+|++|+++|+|++.|.|++++|+|+||.++++...
T Consensus 42 V~~~iyN~G~~~A~dV~l~D~~fp~~~F~lvsG~~s~~~~~i~pg~~vsh~~vv~p~~~G~f~~~~a~VtY~~~~~~~~~ 121 (181)
T PF05753_consen 42 VTYTIYNVGSSAAYDVKLTDDSFPPEDFELVSGSLSASWERIPPGENVSHSYVVRPKKSGYFNFTPAVVTYRDSEGAKEL 121 (181)
T ss_pred EEEEEEECCCCeEEEEEEECCCCCccccEeccCceEEEEEEECCCCeEEEEEEEeeeeeEEEEccCEEEEEECCCCCcee
Confidence 68999999999999999999999999999999999999999999999999999999999999999999999999985543
Q ss_pred c
Q psy4599 162 D 162 (201)
Q Consensus 162 ~ 162 (201)
+
T Consensus 122 ~ 122 (181)
T PF05753_consen 122 Q 122 (181)
T ss_pred E
Confidence 3
No 5
>PF07705 CARDB: CARDB; InterPro: IPR011635 The APHP (acidic peptide-dependent hydrolases/peptidase) domain is found in a variety of different proteins.; PDB: 2KUT_A 2L0D_A 3IDU_A 2KL6_A.
Probab=96.71 E-value=0.019 Score=41.34 Aligned_cols=60 Identities=25% Similarity=0.336 Sum_probs=42.8
Q ss_pred ceEEEEEEEEcCccceeeEEEecCCCCCCCceEecCee--eeeEeeeCCCCceEEEEEEEeceeeeEEeee
Q psy4599 2 DVVVKYTIYNIGNSVATNVVVSDSSFPPEAFEIQGGQF--KFVIDRVPPQTNVTHVVIVQPKSYGAFNFSF 70 (201)
Q Consensus 2 d~tV~y~IyNvG~s~A~dV~l~D~~fp~~~F~iv~G~~--~~~~~ri~pg~nvsH~~vv~p~~~G~fn~t~ 70 (201)
+++|.++|.|.|..+|.++.+. | .+.|.. ......|+||+..+..+.+.+...|.+.+..
T Consensus 20 ~~~i~~~V~N~G~~~~~~~~v~--------~-~~~~~~~~~~~i~~L~~g~~~~v~~~~~~~~~G~~~i~~ 81 (101)
T PF07705_consen 20 PVTITVTVKNNGTADAENVTVR--------L-YLDGNSVSTVTIPSLAPGESETVTFTWTPPSPGSYTIRV 81 (101)
T ss_dssp EEEEEEEEEE-SSS-BEEEEEE--------E-EETTEEEEEEEESEB-TTEEEEEEEEEE-SS-CEEEEEE
T ss_pred EEEEEEEEEECCCCCCCCEEEE--------E-EECCceeccEEECCcCCCcEEEEEEEEEeCCCCeEEEEE
Confidence 5789999999999998888776 3 233333 3444799999999999999998889888654
No 6
>COG1470 Predicted membrane protein [Function unknown]
Probab=96.34 E-value=0.22 Score=48.03 Aligned_cols=138 Identities=20% Similarity=0.360 Sum_probs=101.1
Q ss_pred EEEEEEEEcCcc-ceeeEEEecCCCCCC-CceEecCeeeeeEeeeCCCCceEEEEEEEec---eeeeEEeeeEEEE----
Q psy4599 4 VVKYTIYNIGNS-VATNVVVSDSSFPPE-AFEIQGGQFKFVIDRVPPQTNVTHVVIVQPK---SYGAFNFSFAEVS---- 74 (201)
Q Consensus 4 tV~y~IyNvG~s-~A~dV~l~D~~fp~~-~F~iv~G~~~~~~~ri~pg~nvsH~~vv~p~---~~G~fn~t~A~Vt---- 74 (201)
.+...|-|-|.. .-|++++. ++|+. .-++..|...++-=.|.||+.-.-++.|+|. ..|.||++=+..+
T Consensus 287 sf~V~IeN~g~~~d~y~Le~~--g~pe~w~~~Fteg~~~vt~vkL~~gE~kdvtleV~ps~na~pG~Ynv~I~A~s~s~v 364 (513)
T COG1470 287 SFTVSIENRGKQDDEYALELS--GLPEGWTAEFTEGELRVTSVKLKPGEEKDVTLEVYPSLNATPGTYNVTITASSSSGV 364 (513)
T ss_pred EEEEEEccCCCCCceeEEEec--cCCCCcceEEeeCceEEEEEEecCCCceEEEEEEecCCCCCCCceeEEEEEeccccc
Confidence 356678898876 34555554 34442 1224599999999999999999999999975 5799998877666
Q ss_pred ------------------------EEeC-----CCCceeEEEecccceeeeEEEecCCCCCCceEE-ecCceeeeeeeeC
Q psy4599 75 ------------------------YKAS-----ENAEDYTIYNIGNSVATNVVVSDSSFPPEAFEI-QGGQFKFVIDRVP 124 (201)
Q Consensus 75 ------------------------Y~~s-----~~a~~y~IyNvG~s~A~dV~l~D~sfp~~~F~i-v~G~~s~~~~rI~ 124 (201)
|+.+ |....+.|.|.|+.+-+||.|+-++ |++ +++ +.+. +.+.|+
T Consensus 365 ~~e~~lki~~~g~~~~~v~l~~g~~~lt~taGee~~i~i~I~NsGna~LtdIkl~v~~-Pqg-Wei~Vd~~---~I~sL~ 439 (513)
T COG1470 365 TRELPLKIKNTGSYNELVKLDNGPYRLTITAGEEKTIRISIENSGNAPLTDIKLTVNG-PQG-WEIEVDES---TIPSLE 439 (513)
T ss_pred eeeeeEEEEeccccceeEEccCCcEEEEecCCccceEEEEEEecCCCccceeeEEecC-Ccc-ceEEECcc---cccccC
Confidence 4432 3337889999999999999998763 433 443 3333 799999
Q ss_pred CCCceEEEEEEe-ee--ecceeeeeee
Q psy4599 125 PQTNVTHVVIVQ-PK--SYGAFNFSFA 148 (201)
Q Consensus 125 pg~nvsH~~vv~-p~--~~G~fn~t~A 148 (201)
||.+.+-.+.++ |. ..|-|-.+..
T Consensus 440 pge~~tV~ltI~vP~~a~aGdY~i~i~ 466 (513)
T COG1470 440 PGESKTVSLTITVPEDAGAGDYRITIT 466 (513)
T ss_pred CCCcceEEEEEEcCCCCCCCcEEEEEE
Confidence 999999999999 43 3477766554
No 7
>PF10633 NPCBM_assoc: NPCBM-associated, NEW3 domain of alpha-galactosidase; InterPro: IPR018905 This domain has been named NEW3, but its function is not known. It is found on proteins which are bacterial galactosidases [].; PDB: 1EUT_A 2BZD_A 1WCQ_C 2BER_A 1W8O_A 1EUU_A 1W8N_A.
Probab=96.27 E-value=0.017 Score=41.38 Aligned_cols=65 Identities=28% Similarity=0.384 Sum_probs=40.0
Q ss_pred ceEEEEEEEEcCccceeeEEEecCCCCCCCceEecCeeeeeEeeeCCCCceEEEEEEEece---eeeEEeee
Q psy4599 2 DVVVKYTIYNIGNSVATNVVVSDSSFPPEAFEIQGGQFKFVIDRVPPQTNVTHVVIVQPKS---YGAFNFSF 70 (201)
Q Consensus 2 d~tV~y~IyNvG~s~A~dV~l~D~~fp~~~F~iv~G~~~~~~~ri~pg~nvsH~~vv~p~~---~G~fn~t~ 70 (201)
.++++.++-|.|+.++.+++|.=+. ++.|. +... ..+...|+||++++..+.|+|-. .|.|.++.
T Consensus 6 ~~~~~~tv~N~g~~~~~~v~~~l~~--P~GW~-~~~~-~~~~~~l~pG~s~~~~~~V~vp~~a~~G~y~v~~ 73 (78)
T PF10633_consen 6 TVTVTLTVTNTGTAPLTNVSLSLSL--PEGWT-VSAS-PASVPSLPPGESVTVTFTVTVPADAAPGTYTVTV 73 (78)
T ss_dssp EEEEEEEEE--SSS-BSS-EEEEE----TTSE----E-EEEE--B-TTSEEEEEEEEEE-TT--SEEEEEEE
T ss_pred EEEEEEEEEECCCCceeeEEEEEeC--CCCcc-ccCC-ccccccCCCCCEEEEEEEEECCCCCCCceEEEEE
Confidence 4789999999999999998887432 35566 2222 23445999999999999999543 58877763
No 8
>TIGR01451 B_ant_repeat conserved repeat domain. This model represents the conserved region of about 53 amino acids shared between regions, usually repeated, of proteins from a small number of phylogenetically distant prokaryotes. Examples include a 132-residue region found repeated in three of the five longest proteins of Bacillus anthracis, a 131-residue repeat in a cell wall-anchored protein of Enterococcus faecalis, and a 120-residue repeat in Methanobacterium thermoautotrophicum. A similar region is found in some Chlamydial outer membrane proteins.
Probab=96.08 E-value=0.02 Score=38.88 Aligned_cols=38 Identities=32% Similarity=0.393 Sum_probs=31.3
Q ss_pred ceEEEEEEEEcCccceeeEEEecCCCCCCCceEecCeeee
Q psy4599 2 DVVVKYTIYNIGNSVATNVVVSDSSFPPEAFEIQGGQFKF 41 (201)
Q Consensus 2 d~tV~y~IyNvG~s~A~dV~l~D~~fp~~~F~iv~G~~~~ 41 (201)
.++.++++-|.|..+|.+|.|.|. .|+. +++++|++..
T Consensus 13 ~v~Yti~v~N~g~~~a~~v~v~D~-lP~g-~~~v~~S~~~ 50 (53)
T TIGR01451 13 TITYTITVTNNGNVPATNVVVTDI-LPSG-TTFVSNSVTV 50 (53)
T ss_pred EEEEEEEEEECCCCceEeEEEEEc-CCCC-CEEEeCcEEE
Confidence 367788889999999999999994 6654 8899998653
No 9
>COG1361 S-layer domain [Cell envelope biogenesis, outer membrane]
Probab=96.06 E-value=0.28 Score=46.34 Aligned_cols=149 Identities=21% Similarity=0.229 Sum_probs=100.1
Q ss_pred ceEEEEEEEEcCccceeeEEEecCCCCCCCceEecCeeeee-EeeeCC--CCceEEEEEE---EeceeeeEEeeeEEEEE
Q psy4599 2 DVVVKYTIYNIGNSVATNVVVSDSSFPPEAFEIQGGQFKFV-IDRVPP--QTNVTHVVIV---QPKSYGAFNFSFAEVSY 75 (201)
Q Consensus 2 d~tV~y~IyNvG~s~A~dV~l~D~~fp~~~F~iv~G~~~~~-~~ri~p--g~nvsH~~vv---~p~~~G~fn~t~A~VtY 75 (201)
|+.+..+|+|.|...+.|+.+... .... |.++.+.+... -..+.. |.-.++.+.+ ...+.|.+++.-..-.+
T Consensus 48 ~~~l~v~~~n~~~~~~~~v~v~i~-~~~~-~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~V~~~a~~g~y~i~v~~~~~ 125 (500)
T COG1361 48 DVDLTVTIENVGELLAEDVKVEIT-PEYP-FSLVSGETLLLSIGTLNFLGGEPATVKFKLTVDENAKSGDYEIDVYVSII 125 (500)
T ss_pred ceEEEEEeccccccccccEEEEEE-eccc-ceeeEEEeecCCCceeeecCCCcceEEEEEEEcCCCCCCcEEEeEEEEEE
Confidence 678999999999998888777763 2222 77777755222 333333 4444444433 35667777776665312
Q ss_pred EeCCCC---------------------------------------ceeEEEecccceeeeEEEecCCCCCCceEEecCce
Q psy4599 76 KASENA---------------------------------------EDYTIYNIGNSVATNVVVSDSSFPPEAFEIQGGQF 116 (201)
Q Consensus 76 ~~s~~a---------------------------------------~~y~IyNvG~s~A~dV~l~D~sfp~~~F~iv~G~~ 116 (201)
...+++ ..+.|.|.|+.+|.++.|...+ |...+.-+.+..
T Consensus 126 ~~~~~~~~~~~~~~~t~~~~~~~~~~~~~~v~~~~~~i~~G~~~~l~~~I~N~G~~~~~~v~l~~~~-~~~~~~~i~~~~ 204 (500)
T COG1361 126 GSSDESEETSFTVEVTIYVEVDVPVIESFEVVSSPEAIIPGETNTLTLTIKNPGEGPAKNVSLSLES-PTSYLGPIYSAN 204 (500)
T ss_pred eecccceeEEEEeeeeEEEEEeecccceeEEecCccccCCCCccEEEEEEEeCCcccccceEEEEeC-Ccceeccccccc
Confidence 211111 6789999999999999999865 444444445555
Q ss_pred -eeeeeeeCCCCceEEEEEEeee---ecceeeeeeeEEEEec
Q psy4599 117 -KFVIDRVPPQTNVTHVVIVQPK---SYGAFNFSFAEVSYKA 154 (201)
Q Consensus 117 -s~~~~rI~pg~nvsH~~vv~p~---~~G~fn~t~A~v~yr~ 154 (201)
..-..-+.||.++.-++-+... +.|.|..... ++|+-
T Consensus 205 ~~~~i~~l~p~es~~v~f~v~~~~~a~~g~y~i~i~-i~~~~ 245 (500)
T COG1361 205 DTPYIGALGPGESVNVTFSVYAGSNAEPGTYTINLE-ITYKD 245 (500)
T ss_pred cceeeeeeCCCceEEEEEEEEeecCCCCccEEEEEE-EEEec
Confidence 4679999999999999999888 5788887655 55655
No 10
>PF01345 DUF11: Domain of unknown function DUF11; InterPro: IPR001434 This group of sequences is represented by a conserved region of about 53 amino acids shared between regions, usually repeated, of proteins from a small number of phylogenetically distant prokaryotes. Examples include a 132-residue region found repeated in three of the five longest proteins of Bacillus anthracis, a 131-residue repeat in a cell wall-anchored protein of Enterococcus faecalis (Streptococcus faecalis), and a 120-residue repeat in Methanobacterium thermoautotrophicum. A similar region is found in some Chlamydia trachomatis outer membrane proteins. In C. trachomatis, three cysteine-rich proteins (also believed to be lipoproteins), MOMP, OMP6 and OMP3, make up the extracellular matrix of the outer membrane []. They are involved in the essential structural integrity of both the elementary body (EB) and recticulate body (RB) phase. They are thought to be involved in porin formation and, as these bacteria lack the peptidoglycan layer common to most Gram-negative microbes, such proteins are highly important in the pathogenicity of the organism.; GO: 0005727 extrachromosomal circular DNA
Probab=95.72 E-value=0.025 Score=40.11 Aligned_cols=35 Identities=37% Similarity=0.552 Sum_probs=29.3
Q ss_pred ceEEEEEEEEcCccceeeEEEecCCCCCCCceEecCe
Q psy4599 2 DVVVKYTIYNIGNSVATNVVVSDSSFPPEAFEIQGGQ 38 (201)
Q Consensus 2 d~tV~y~IyNvG~s~A~dV~l~D~~fp~~~F~iv~G~ 38 (201)
.++.++++.|.|+.+|.+|.|.|. +|.. +++++|+
T Consensus 42 ~v~ytitvtN~G~~~a~nv~v~D~-lp~g-~~~v~~S 76 (76)
T PF01345_consen 42 TVTYTITVTNTGPAPATNVVVTDT-LPAG-LTFVSGS 76 (76)
T ss_pred EEEEEEEEEECCCCeeEeEEEEEc-CCCC-CEEeCCC
Confidence 367888999999999999999994 6544 8888874
No 11
>PF10633 NPCBM_assoc: NPCBM-associated, NEW3 domain of alpha-galactosidase; InterPro: IPR018905 This domain has been named NEW3, but its function is not known. It is found on proteins which are bacterial galactosidases [].; PDB: 1EUT_A 2BZD_A 1WCQ_C 2BER_A 1W8O_A 1EUU_A 1W8N_A.
Probab=95.51 E-value=0.037 Score=39.61 Aligned_cols=63 Identities=25% Similarity=0.377 Sum_probs=38.2
Q ss_pred ceeEEEecccceeeeEEEecCCCCCCceEEecCceeeeeeeeCCCCceEEEEEEeeee---cceeeeeee
Q psy4599 82 EDYTIYNIGNSVATNVVVSDSSFPPEAFEIQGGQFKFVIDRVPPQTNVTHVVIVQPKS---YGAFNFSFA 148 (201)
Q Consensus 82 ~~y~IyNvG~s~A~dV~l~D~sfp~~~F~iv~G~~s~~~~rI~pg~nvsH~~vv~p~~---~G~fn~t~A 148 (201)
...++-|.|+.++.++.++-+. |+++.+-.+ ..+...|+||++++..+.|+|-. .|.|.++..
T Consensus 9 ~~~tv~N~g~~~~~~v~~~l~~--P~GW~~~~~--~~~~~~l~pG~s~~~~~~V~vp~~a~~G~y~v~~~ 74 (78)
T PF10633_consen 9 VTLTVTNTGTAPLTNVSLSLSL--PEGWTVSAS--PASVPSLPPGESVTVTFTVTVPADAAPGTYTVTVT 74 (78)
T ss_dssp EEEEEE--SSS-BSS-EEEEE----TTSE---E--EEEE--B-TTSEEEEEEEEEE-TT--SEEEEEEEE
T ss_pred EEEEEEECCCCceeeEEEEEeC--CCCccccCC--ccccccCCCCCEEEEEEEEECCCCCCCceEEEEEE
Confidence 3578899999998888887543 466662222 24555999999999999999754 488887653
No 12
>COG1361 S-layer domain [Cell envelope biogenesis, outer membrane]
Probab=94.83 E-value=1.2 Score=41.95 Aligned_cols=73 Identities=25% Similarity=0.252 Sum_probs=60.5
Q ss_pred eEEEEEEEEcCccceeeEEEecCCCCCCCceEecCee-eeeEeeeCCCCceEEEEEEEec---eeeeEEeeeEEEEEEe
Q psy4599 3 VVVKYTIYNIGNSVATNVVVSDSSFPPEAFEIQGGQF-KFVIDRVPPQTNVTHVVIVQPK---SYGAFNFSFAEVSYKA 77 (201)
Q Consensus 3 ~tV~y~IyNvG~s~A~dV~l~D~~fp~~~F~iv~G~~-~~~~~ri~pg~nvsH~~vv~p~---~~G~fn~t~A~VtY~~ 77 (201)
-+++++|.|.|+..|.++.|...+ |...+.-+.+.. ..-..-|.||+++.-++-+.+. ..|.|... ..++|++
T Consensus 169 ~~l~~~I~N~G~~~~~~v~l~~~~-~~~~~~~i~~~~~~~~i~~l~p~es~~v~f~v~~~~~a~~g~y~i~-i~i~~~~ 245 (500)
T COG1361 169 NTLTLTIKNPGEGPAKNVSLSLES-PTSYLGPIYSANDTPYIGALGPGESVNVTFSVYAGSNAEPGTYTIN-LEITYKD 245 (500)
T ss_pred cEEEEEEEeCCcccccceEEEEeC-CcceeccccccccceeeeeeCCCceEEEEEEEEeecCCCCccEEEE-EEEEEec
Confidence 378999999999999999999854 666677777777 5889999999999999999877 57777664 5677777
No 13
>PF07705 CARDB: CARDB; InterPro: IPR011635 The APHP (acidic peptide-dependent hydrolases/peptidase) domain is found in a variety of different proteins.; PDB: 2KUT_A 2L0D_A 3IDU_A 2KL6_A.
Probab=94.18 E-value=0.32 Score=34.81 Aligned_cols=58 Identities=24% Similarity=0.264 Sum_probs=38.4
Q ss_pred ceeEEEecccceeeeEEEecCCCCCCceEEecCc--eeeeeeeeCCCCceEEEEEEeeeecceeeeeee
Q psy4599 82 EDYTIYNIGNSVATNVVVSDSSFPPEAFEIQGGQ--FKFVIDRVPPQTNVTHVVIVQPKSYGAFNFSFA 148 (201)
Q Consensus 82 ~~y~IyNvG~s~A~dV~l~D~sfp~~~F~iv~G~--~s~~~~rI~pg~nvsH~~vv~p~~~G~fn~t~A 148 (201)
..++|.|.|..+|.++.+.- .+.|. -+.....|+||+..+.++.+.+...|.|.+...
T Consensus 23 i~~~V~N~G~~~~~~~~v~~---------~~~~~~~~~~~i~~L~~g~~~~v~~~~~~~~~G~~~i~~~ 82 (101)
T PF07705_consen 23 ITVTVKNNGTADAENVTVRL---------YLDGNSVSTVTIPSLAPGESETVTFTWTPPSPGSYTIRVV 82 (101)
T ss_dssp EEEEEEE-SSS-BEEEEEEE---------EETTEEEEEEEESEB-TTEEEEEEEEEE-SS-CEEEEEEE
T ss_pred EEEEEEECCCCCCCCEEEEE---------EECCceeccEEECCcCCCcEEEEEEEEEeCCCCeEEEEEE
Confidence 46688999999888777662 11222 233447899999999999999999999886543
No 14
>PF13473 Cupredoxin_1: Cupredoxin-like domain; PDB: 1IBZ_D 1IC0_E 1IBY_D.
Probab=92.80 E-value=0.55 Score=35.13 Aligned_cols=63 Identities=24% Similarity=0.385 Sum_probs=38.1
Q ss_pred eEEeeeEEEEEEeCCCCceeEEEecccceeeeEEEecCCCCCCceEEecCceeeeeeeeCCCCceEEEEEEeeeecceee
Q psy4599 65 AFNFSFAEVSYKASENAEDYTIYNIGNSVATNVVVSDSSFPPEAFEIQGGQFKFVIDRVPPQTNVTHVVIVQPKSYGAFN 144 (201)
Q Consensus 65 ~fn~t~A~VtY~~s~~a~~y~IyNvG~s~A~dV~l~D~sfp~~~F~iv~G~~s~~~~rI~pg~nvsH~~vv~p~~~G~fn 144 (201)
.+.|+|++++=+..+. ..+++-|.|+.+ -++.+.+ +..+ ..|+||.+.+.+| .|.+.|.|.
T Consensus 29 ~~~f~P~~i~v~~G~~-v~l~~~N~~~~~-h~~~i~~--------------~~~~-~~l~~g~~~~~~f--~~~~~G~y~ 89 (104)
T PF13473_consen 29 DFGFSPSTITVKAGQP-VTLTFTNNDSRP-HEFVIPD--------------LGIS-KVLPPGETATVTF--TPLKPGEYE 89 (104)
T ss_dssp -EEEES-EEEEETTCE-EEEEEEE-SSS--EEEEEGG--------------GTEE-EEE-TT-EEEEEE--EE-S-EEEE
T ss_pred CCeEecCEEEEcCCCe-EEEEEEECCCCc-EEEEECC--------------CceE-EEECCCCEEEEEE--cCCCCEEEE
Confidence 3478899888777543 468888888775 4444443 2222 7899999886665 799999887
Q ss_pred ee
Q psy4599 145 FS 146 (201)
Q Consensus 145 ~t 146 (201)
|.
T Consensus 90 ~~ 91 (104)
T PF13473_consen 90 FY 91 (104)
T ss_dssp EB
T ss_pred EE
Confidence 64
No 15
>PF13584 BatD: Oxygen tolerance
Probab=92.40 E-value=9.6 Score=35.73 Aligned_cols=66 Identities=18% Similarity=0.353 Sum_probs=41.6
Q ss_pred EcCccceeeEEEec----CCCCC-CCceEecCeeeeeEeeeCCC--CceEEEEEEEeceeeeEEeeeEEEEEE
Q psy4599 11 NIGNSVATNVVVSD----SSFPP-EAFEIQGGQFKFVIDRVPPQ--TNVTHVVIVQPKSYGAFNFSFAEVSYK 76 (201)
Q Consensus 11 NvG~s~A~dV~l~D----~~fp~-~~F~iv~G~~~~~~~ri~pg--~nvsH~~vv~p~~~G~fn~t~A~VtY~ 76 (201)
..|+.--+.|++.+ ..+|. +.|++.+.+.+.+-.-+.-. ...+..+.|.|++.|.|.++++.|++.
T Consensus 25 ~~ge~~~l~i~~~~~~~~~~~p~l~~f~v~~~~~s~~~~~inG~~~~~~~~~~~l~p~~~G~~~IP~~~v~v~ 97 (484)
T PF13584_consen 25 GLGETFQLTITINGDGDDPDLPELDGFEVLGPSQSSSTSIINGKVSSSTTYTYTLQPKKTGTFTIPPFTVEVD 97 (484)
T ss_pred cCCCEEEEEEEEecCcccCCCCCCCCeEEcceEEEEEEEEecCceEEEEEEEEEEEecccceEEEceEEEEEC
Confidence 34555455554443 23444 68998444555544333222 235677889999999999999999553
No 16
>PF13473 Cupredoxin_1: Cupredoxin-like domain; PDB: 1IBZ_D 1IC0_E 1IBY_D.
Probab=91.72 E-value=0.83 Score=34.13 Aligned_cols=24 Identities=25% Similarity=0.498 Sum_probs=12.4
Q ss_pred eeeCCCCceEEEEEEEeceeeeEEee
Q psy4599 44 DRVPPQTNVTHVVIVQPKSYGAFNFS 69 (201)
Q Consensus 44 ~ri~pg~nvsH~~vv~p~~~G~fn~t 69 (201)
..|+||++.+.. +.|.+.|.|.|.
T Consensus 68 ~~l~~g~~~~~~--f~~~~~G~y~~~ 91 (104)
T PF13473_consen 68 KVLPPGETATVT--FTPLKPGEYEFY 91 (104)
T ss_dssp EEE-TT-EEEEE--EEE-S-EEEEEB
T ss_pred EEECCCCEEEEE--EcCCCCEEEEEE
Confidence 567777665444 367777777664
No 17
>TIGR01451 B_ant_repeat conserved repeat domain. This model represents the conserved region of about 53 amino acids shared between regions, usually repeated, of proteins from a small number of phylogenetically distant prokaryotes. Examples include a 132-residue region found repeated in three of the five longest proteins of Bacillus anthracis, a 131-residue repeat in a cell wall-anchored protein of Enterococcus faecalis, and a 120-residue repeat in Methanobacterium thermoautotrophicum. A similar region is found in some Chlamydial outer membrane proteins.
Probab=90.75 E-value=0.61 Score=31.54 Aligned_cols=33 Identities=36% Similarity=0.441 Sum_probs=26.6
Q ss_pred eeEEEecccceeeeEEEecCCCCCCceEEecCcee
Q psy4599 83 DYTIYNIGNSVATNVVVSDSSFPPEAFEIQGGQFK 117 (201)
Q Consensus 83 ~y~IyNvG~s~A~dV~l~D~sfp~~~F~iv~G~~s 117 (201)
.+++-|.|..+|.+|.|.|. .| ++++.+.|++.
T Consensus 17 ti~v~N~g~~~a~~v~v~D~-lP-~g~~~v~~S~~ 49 (53)
T TIGR01451 17 TITVTNNGNVPATNVVVTDI-LP-SGTTFVSNSVT 49 (53)
T ss_pred EEEEEECCCCceEeEEEEEc-CC-CCCEEEeCcEE
Confidence 35678999999999999985 44 67778888864
No 18
>PF01345 DUF11: Domain of unknown function DUF11; InterPro: IPR001434 This group of sequences is represented by a conserved region of about 53 amino acids shared between regions, usually repeated, of proteins from a small number of phylogenetically distant prokaryotes. Examples include a 132-residue region found repeated in three of the five longest proteins of Bacillus anthracis, a 131-residue repeat in a cell wall-anchored protein of Enterococcus faecalis (Streptococcus faecalis), and a 120-residue repeat in Methanobacterium thermoautotrophicum. A similar region is found in some Chlamydia trachomatis outer membrane proteins. In C. trachomatis, three cysteine-rich proteins (also believed to be lipoproteins), MOMP, OMP6 and OMP3, make up the extracellular matrix of the outer membrane []. They are involved in the essential structural integrity of both the elementary body (EB) and recticulate body (RB) phase. They are thought to be involved in porin formation and, as these bacteria lack the peptidoglycan layer common to most Gram-negative microbes, such proteins are highly important in the pathogenicity of the organism.; GO: 0005727 extrachromosomal circular DNA
Probab=87.87 E-value=0.95 Score=31.90 Aligned_cols=31 Identities=39% Similarity=0.608 Sum_probs=24.9
Q ss_pred eeEEEecccceeeeEEEecCCCCCCceEEecCc
Q psy4599 83 DYTIYNIGNSVATNVVVSDSSFPPEAFEIQGGQ 115 (201)
Q Consensus 83 ~y~IyNvG~s~A~dV~l~D~sfp~~~F~iv~G~ 115 (201)
.+++.|.|..+|.+|.|.|. + +.++++++|+
T Consensus 46 titvtN~G~~~a~nv~v~D~-l-p~g~~~v~~S 76 (76)
T PF01345_consen 46 TITVTNTGPAPATNVVVTDT-L-PAGLTFVSGS 76 (76)
T ss_pred EEEEEECCCCeeEeEEEEEc-C-CCCCEEeCCC
Confidence 45678999999999999986 4 5677777764
No 19
>PF12690 BsuPI: Intracellular proteinase inhibitor; InterPro: IPR020481 BsuPI is a intracellular proteinase inhibitor that directly regulates the major intracellular proteinase (ISP-1) activity in vivo. It inhibits ISP-1 in the early stages of sporulation and then may be inactivated by a membrane-bound proteinase [].; PDB: 3ISY_A.
Probab=86.65 E-value=3.9 Score=30.11 Aligned_cols=67 Identities=18% Similarity=0.219 Sum_probs=36.7
Q ss_pred eEEEEEEEEcCccc---------eeeEEEecCCCCCCCceEecCee---eeeEeeeCCCCceEEEEEEEece--eeeEEe
Q psy4599 3 VVVKYTIYNIGNSV---------ATNVVVSDSSFPPEAFEIQGGQF---KFVIDRVPPQTNVTHVVIVQPKS--YGAFNF 68 (201)
Q Consensus 3 ~tV~y~IyNvG~s~---------A~dV~l~D~~fp~~~F~iv~G~~---~~~~~ri~pg~nvsH~~vv~p~~--~G~fn~ 68 (201)
+.+.++|-|.|+.+ -+|+.|.|. =..+.|..-.|.+ -..=..|+||+..+....+.... .|.|.+
T Consensus 2 v~~~l~v~N~s~~~v~l~f~sgq~~D~~v~d~-~g~~vwrwS~~~~FtQal~~~~l~pGe~~~~~~~~~~~~~~~G~Y~~ 80 (82)
T PF12690_consen 2 VEFTLTVTNNSDEPVTLQFPSGQRYDFVVKDK-EGKEVWRWSDGKMFTQALQEETLEPGESLTYEETWDLKDLSPGEYTL 80 (82)
T ss_dssp EEEEEEEEE-SSS-EEEEESSS--EEEEEE-T-T--EEEETTTT-------EEEEE-TT-EEEEEEEESS----SEEEEE
T ss_pred EEEEEEEEeCCCCeEEEEeCCCCEEEEEEECC-CCCEEEEecCCchhhheeeEEEECCCCEEEEEEEECCCCCCCceEEE
Confidence 67899999999854 345555542 1222232223322 33457899999999999998766 788876
Q ss_pred ee
Q psy4599 69 SF 70 (201)
Q Consensus 69 t~ 70 (201)
.+
T Consensus 81 ~a 82 (82)
T PF12690_consen 81 EA 82 (82)
T ss_dssp EE
T ss_pred eC
Confidence 53
No 20
>PF00345 PapD_N: Pili and flagellar-assembly chaperone, PapD N-terminal domain; InterPro: IPR016147 Most Gram-negative bacteria possess a supramolecular structure - the pili - on their surface, which mediates attachment to specific receptors. Many interactive subunits are required to assemble pili, but their assembly only takes place after translocation across the cytoplasmic membrane. Periplasmic chaperones assist pili assembly by binding to the subunits, thereby preventing premature aggregation [, ]. Pili chaperones are structurally, and possibly evolutionarily, related to the immunoglobulin superfamily [, ]: they contain two globular domains, with a topology identical to an immunoglobulin fold. This entry represents the N-terminal domain of pili assembly chaperone, and has a beta-sandwich fold consisting of seven strands in two sheets with a Greek key topology.; GO: 0007047 cellular cell wall organization, 0030288 outer membrane-bounded periplasmic space; PDB: 2CO6_B 2CO7_B 1L4I_B 3GFU_A 3F65_F 3F6L_A 3F6I_A 3GEW_B 3DSN_D 2OS7_B ....
Probab=85.32 E-value=14 Score=28.01 Aligned_cols=84 Identities=21% Similarity=0.262 Sum_probs=55.4
Q ss_pred EEeeeEEEEEEeCCCCceeEEEecccce-eeeEEEecC-----CCCCCceEEecCceeeeeeeeCCCCceEEEEEEeee-
Q psy4599 66 FNFSFAEVSYKASENAEDYTIYNIGNSV-ATNVVVSDS-----SFPPEAFEIQGGQFKFVIDRVPPQTNVTHVVIVQPK- 138 (201)
Q Consensus 66 fn~t~A~VtY~~s~~a~~y~IyNvG~s~-A~dV~l~D~-----sfp~~~F~iv~G~~s~~~~rI~pg~nvsH~~vv~p~- 138 (201)
+.+++..|.|..++....++|+|.|+.+ ...+.+.+. .-+.+.|.+ +=...+|+||+.-+--++..+.
T Consensus 2 i~i~~trii~~~~~~~~~i~v~N~~~~~~~vq~~v~~~~~~~~~~~~~~~~v-----sPp~~~L~pg~~q~vRv~~~~~~ 76 (122)
T PF00345_consen 2 IQISPTRIIFNESQRSASITVTNNSDQPYLVQVWVYDQDDEDEDEPTDPFIV-----SPPIFRLEPGESQTVRVYRGSKL 76 (122)
T ss_dssp EEESSSEEEEETTSSEEEEEEEESSSSEEEEEEEEEETTSTTSSSSSSSEEE-----ESSEEEEETTEEEEEEEEECSGS
T ss_pred EEEccEEEEEeCCCCEEEEEEEcCCCCcEEEEEEEEcCCCcccccccccEEE-----eCCceEeCCCCcEEEEEEecCCC
Confidence 4678889999999999999999999985 346667651 112225532 2345699999998888844322
Q ss_pred ---ecceeeeeeeEEEEec
Q psy4599 139 ---SYGAFNFSFAEVSYKA 154 (201)
Q Consensus 139 ---~~G~fn~t~A~v~yr~ 154 (201)
+--.|-+...++-...
T Consensus 77 ~~~~E~~yrl~~~~iP~~~ 95 (122)
T PF00345_consen 77 PIDRESLYRLSFREIPPSE 95 (122)
T ss_dssp -SSS-EEEEEEEEEEESCC
T ss_pred CCCceEEEEEEEEEEeccc
Confidence 1245555555555544
No 21
>PF07919 Gryzun: Gryzun, putative trafficking through Golgi; InterPro: IPR012880 The proteins featured in this family are all hypothetical eukaryotic proteins of unknown function. The region in question is approximately 150 residues long.
Probab=84.36 E-value=4.5 Score=38.22 Aligned_cols=67 Identities=19% Similarity=0.206 Sum_probs=53.7
Q ss_pred eEEEEEEEEcCccceeeEEEecCCCCCCCceEecCeeeeeEeeeCCCCceEEEEEEEeceeeeEEeeeEEEE
Q psy4599 3 VVVKYTIYNIGNSVATNVVVSDSSFPPEAFEIQGGQFKFVIDRVPPQTNVTHVVIVQPKSYGAFNFSFAEVS 74 (201)
Q Consensus 3 ~tV~y~IyNvG~s~A~dV~l~D~~fp~~~F~iv~G~~~~~~~ri~pg~nvsH~~vv~p~~~G~fn~t~A~Vt 74 (201)
++++|+|.|- +.-..++++.=+ ++++| +.+|.-+.++- |.|++.-+-.|.+.|...|...++.-.|.
T Consensus 487 ~~l~~~I~N~-T~~~~~~~~~me--~s~~F-~fsG~k~~~~~-llP~s~~~~~y~l~pl~~G~~~lP~l~v~ 553 (554)
T PF07919_consen 487 FTLSYTIENP-TNHFQTFELSME--PSDDF-MFSGPKQTTFS-LLPFSRHTVRYNLLPLVAGWWILPRLKVR 553 (554)
T ss_pred EEEEEEEECC-CCccEEEEEEEc--cCCCE-EEECCCcCceE-ECCCCcEEEEEEEEEccCCcEECCcEEEe
Confidence 6899999994 445566655533 34568 99999988887 99999999999999999999988766654
No 22
>PF13584 BatD: Oxygen tolerance
Probab=83.84 E-value=26 Score=32.84 Aligned_cols=134 Identities=15% Similarity=0.164 Sum_probs=73.3
Q ss_pred eEEEEEEEEcCccceee-EEEecCCCCCCCceEecCeeeeeEeee-CCCC---ceE-EEEEEEeceeeeEEeeeEEEEEE
Q psy4599 3 VVVKYTIYNIGNSVATN-VVVSDSSFPPEAFEIQGGQFKFVIDRV-PPQT---NVT-HVVIVQPKSYGAFNFSFAEVSYK 76 (201)
Q Consensus 3 ~tV~y~IyNvG~s~A~d-V~l~D~~fp~~~F~iv~G~~~~~~~ri-~pg~---nvs-H~~vv~p~~~G~fn~t~A~VtY~ 76 (201)
+.++|.+|=...-...+ ..+....+ +.|.+-.=.....+.+- --|. .+. +.+.|.|.+.|.+.+.++.+...
T Consensus 145 v~lt~~ly~~~~~~~~~~~~~~~p~~--~~~~~~~~~~~~~~~~~~i~G~~y~~~~~~~~~l~P~ksG~l~I~~~~~~~~ 222 (484)
T PF13584_consen 145 VILTLRLYTRNNFRQLGIEELPPPDF--EGFWVEQLGDDRQYEEERINGRRYRVIELRRYALFPQKSGTLTIPPATFEVT 222 (484)
T ss_pred EEEEEEEEEecCchhccccccCCCCC--CCcEEEECCCCCceeEEEECCEEEEEEEEEEEEEEeCCceeEEecCEEEEEE
Confidence 56677777655544222 12222222 23332222223344432 2222 133 45789999999999999999887
Q ss_pred eCCCCceeEEEecc---------cceeeeEEEec--CCCCCCceEEecCcee--eeee--eeCCCCceEEEEEEeee
Q psy4599 77 ASENAEDYTIYNIG---------NSVATNVVVSD--SSFPPEAFEIQGGQFK--FVID--RVPPQTNVTHVVIVQPK 138 (201)
Q Consensus 77 ~s~~a~~y~IyNvG---------~s~A~dV~l~D--~sfp~~~F~iv~G~~s--~~~~--rI~pg~nvsH~~vv~p~ 138 (201)
.......-+.+..+ .+.+..|++.. ....++.|.-.-|+++ .+|+ .+..|..++-++.|+..
T Consensus 223 ~~~~~~~~~~fg~~~~~~~~~~~~s~~~~i~V~plP~~~~p~~f~~aVg~f~l~~~~~~~~~~~Ge~vt~ti~i~g~ 299 (484)
T PF13584_consen 223 VSDPSGRRDFFGGNFGRSRPVSISSEPLTITVKPLPAEGAPADFSGAVGNFSLSQSWDPTEVKVGEPVTRTITISGE 299 (484)
T ss_pred EecccCccCccccccccceeEEecCCCeEEEeccCCcccCCCCcccceeEEEEEEEcCcccccCCCeEEEEEEEEEE
Confidence 75432111122221 34555565554 2244456643335444 4555 57789999999999877
No 23
>TIGR02656 cyanin_plasto plastocyanin. Members of this family are plastocyanin, a blue copper protein related to pseudoazurin, halocyanin, amicyanin, etc. This protein, located in the thylakoid luman, performs electron transport to photosystem I in Cyanobacteria and chloroplasts.
Probab=82.95 E-value=4 Score=30.43 Aligned_cols=81 Identities=20% Similarity=0.214 Sum_probs=51.8
Q ss_pred EEEEEeceeeeEEeeeEEEEEEeCCCCceeEEEecccceeeeEEEecCCCCCCceEEecCceeeeeeeeCCCCceEEEEE
Q psy4599 55 VVIVQPKSYGAFNFSFAEVSYKASENAEDYTIYNIGNSVATNVVVSDSSFPPEAFEIQGGQFKFVIDRVPPQTNVTHVVI 134 (201)
Q Consensus 55 ~~vv~p~~~G~fn~t~A~VtY~~s~~a~~y~IyNvG~s~A~dV~l~D~sfp~~~F~iv~G~~s~~~~rI~pg~nvsH~~v 134 (201)
.+.|- ...|.+.|.|+.|+-+..+ .+++-|.|. ..-++.+.+..+|....+. .+.+..+--.+.||...+++|.
T Consensus 2 ~v~~g-~~~g~~~F~P~~i~v~~G~---~V~~~N~~~-~~H~~~~~~~~~~~~~~~~-~~~~~~~~~~~~pG~t~~~tF~ 75 (99)
T TIGR02656 2 TVKMG-ADKGALVFEPAKISIAAGD---TVEWVNNKG-GPHNVVFDEDAVPAGVKEL-AKSLSHKDLLNSPGESYEVTFS 75 (99)
T ss_pred EEEEe-cCCCceeEeCCEEEECCCC---EEEEEECCC-CCceEEECCCCCccchhhh-cccccccccccCCCCEEEEEeC
Confidence 34443 4478899999999988875 467778764 5677888776666554322 1222222245789999888766
Q ss_pred Eeeeecceeee
Q psy4599 135 VQPKSYGAFNF 145 (201)
Q Consensus 135 v~p~~~G~fn~ 145 (201)
- .|.|.+
T Consensus 76 ~----~G~y~y 82 (99)
T TIGR02656 76 T----PGTYTF 82 (99)
T ss_pred C----CEEEEE
Confidence 3 565533
No 24
>PF09478 CBM49: Carbohydrate binding domain CBM49; InterPro: IPR019028 A carbohydrate-binding module (CBM) is defined as a contiguous amino acid sequence within a carbohydrate-active enzyme with a discreet fold having carbohydrate-binding activity. A few exceptions are CBMs in cellulosomal scaffolding proteins and rare instances of independent putative CBMs. The requirement of CBMs existing as modules within larger enzymes sets this class of carbohydrate-binding protein apart from other non-catalytic sugar binding proteins such as lectins and sugar transport proteins. CBMs were previously classified as cellulose-binding domains (CBDs) based on the initial discovery of several modules that bound cellulose [, ]. However, additional modules in carbohydrate-active enzymes are continually being found that bind carbohydrates other than cellulose yet otherwise meet the CBM criteria, hence the need to reclassify these polypeptides using more inclusive terminology. Previous classification of cellulose-binding domains were based on amino acid similarity. Groupings of CBDs were called "Types" and numbered with roman numerals (e.g. Type I or Type II CBDs). In keeping with the glycoside hydrolase classification, these groupings are now called families and numbered with Arabic numerals. Families 1 to 13 are the same as Types I to XIII. For a detailed review on the structure and binding modes of CBMs see []. This domain is found at the C-terminal of cellulases and in vitro binding studies have shown it to binds to crystalline cellulose []. ; GO: 0030246 carbohydrate binding, 0005576 extracellular region
Probab=79.43 E-value=10 Score=27.37 Aligned_cols=55 Identities=15% Similarity=0.191 Sum_probs=41.3
Q ss_pred EEEEEEEEcCccceeeEEEecCCCCCCCceEec---Ceeee-eEe-eeCCCCceEEEEEE
Q psy4599 4 VVKYTIYNIGNSVATNVVVSDSSFPPEAFEIQG---GQFKF-VID-RVPPQTNVTHVVIV 58 (201)
Q Consensus 4 tV~y~IyNvG~s~A~dV~l~D~~fp~~~F~iv~---G~~~~-~~~-ri~pg~nvsH~~vv 58 (201)
.+..+|.|.|..+-.++.|.=+.+..+-+++.. |.... +|. .|+||+..+--++.
T Consensus 20 qy~v~I~N~~~~~I~~~~i~~~~l~~~iW~l~~~~~~~y~lPs~~~~i~pg~s~~FGYI~ 79 (80)
T PF09478_consen 20 QYDVTITNNGSKPIKSLKISIDNLYGSIWGLDKVSGNTYTLPSYQPTIKPGQSFTFGYIS 79 (80)
T ss_pred EEEEEEEECCCCeEEEEEEEECccchhheeEEeccCCEEECCccccccCCCCEEEEEEEe
Confidence 467889999999999999987777777677766 33333 564 99999988766653
No 25
>PF13860 FlgD_ig: FlgD Ig-like domain; PDB: 3C12_A 3OSV_A.
Probab=78.72 E-value=14 Score=26.54 Aligned_cols=51 Identities=22% Similarity=0.406 Sum_probs=32.9
Q ss_pred eeEEEEEEeCCCCc--eeEEEecccceeeeEEEecCCCCCCceEEecCceeeeeeeeCCCCc
Q psy4599 69 SFAEVSYKASENAE--DYTIYNIGNSVATNVVVSDSSFPPEAFEIQGGQFKFVIDRVPPQTN 128 (201)
Q Consensus 69 t~A~VtY~~s~~a~--~y~IyNvG~s~A~dV~l~D~sfp~~~F~iv~G~~s~~~~rI~pg~n 128 (201)
.++.+.|..++.+. .+.|||.-...-..+.+... -.|..++.|+-....++
T Consensus 12 ~~~~~~~~l~~~a~~v~v~I~d~~G~~V~t~~~~~~---------~~G~~~~~WdG~d~~G~ 64 (81)
T PF13860_consen 12 TKGSIEYTLPEDADNVTVTIYDSNGQVVRTISLGSQ---------SAGEHSFTWDGKDDDGN 64 (81)
T ss_dssp CEEEEEEEECSSCEEEEEEEEETTS-EEEEEEEEEC---------SSEEEEEEE-SB-TTS-
T ss_pred EEEEEEEeCCCcccEEEEEEEcCCCCEEEEEEcCCc---------CCceEEEEECCCCCCcC
Confidence 46778888887774 78899986666666666431 24788899996665554
No 26
>PF12690 BsuPI: Intracellular proteinase inhibitor; InterPro: IPR020481 BsuPI is a intracellular proteinase inhibitor that directly regulates the major intracellular proteinase (ISP-1) activity in vivo. It inhibits ISP-1 in the early stages of sporulation and then may be inactivated by a membrane-bound proteinase [].; PDB: 3ISY_A.
Probab=78.28 E-value=13 Score=27.31 Aligned_cols=27 Identities=15% Similarity=0.214 Sum_probs=19.3
Q ss_pred eeeeCCCCceEEEEEEeeee--cceeeee
Q psy4599 120 IDRVPPQTNVTHVVIVQPKS--YGAFNFS 146 (201)
Q Consensus 120 ~~rI~pg~nvsH~~vv~p~~--~G~fn~t 146 (201)
=..|+||...++..++.... +|.|...
T Consensus 53 ~~~l~pGe~~~~~~~~~~~~~~~G~Y~~~ 81 (82)
T PF12690_consen 53 EETLEPGESLTYEETWDLKDLSPGEYTLE 81 (82)
T ss_dssp EEEE-TT-EEEEEEEESS----SEEEEEE
T ss_pred EEEECCCCEEEEEEEECCCCCCCceEEEe
Confidence 36799999999999998887 8988764
No 27
>TIGR03096 nitroso_cyanin nitrosocyanin. Nitrosocyanin, as described from the obligate chemolithoautotroph Nitrosomonas europaea, is a red copper protein of unknown function with sequence similarity to a number of blue copper redox proteins.
Probab=75.42 E-value=36 Score=27.70 Aligned_cols=92 Identities=13% Similarity=0.102 Sum_probs=56.6
Q ss_pred CceEecCeeeeeEeeeCCCCceEEEEEEEeceeeeEEeeeEEEEEEeCCCCceeEEEecccceeeeEEEecCCCCCCceE
Q psy4599 31 AFEIQGGQFKFVIDRVPPQTNVTHVVIVQPKSYGAFNFSFAEVSYKASENAEDYTIYNIGNSVATNVVVSDSSFPPEAFE 110 (201)
Q Consensus 31 ~F~iv~G~~~~~~~ri~pg~nvsH~~vv~p~~~G~fn~t~A~VtY~~s~~a~~y~IyNvG~s~A~dV~l~D~sfp~~~F~ 110 (201)
.|+++=-.-..+|..|..+....|++ +-|.|.|+|++|+=+..+ ..+..+-|.... +..|.
T Consensus 26 ~f~~~i~~~~~~~~~~~~~~~~~~~i-----~a~n~~~~P~~I~VkaGD-~Vtl~vtN~d~~-------------~H~f~ 86 (135)
T TIGR03096 26 SFTVVINAYDTTIPELNVEGVTVKNI-----RAFNVLNEPEALVVKKGT-PVKVTVENKSPI-------------SEGFS 86 (135)
T ss_pred eeEEEEeccccEeeEEEeCCEEEEEE-----EeeeeEEcCCEEEECCCC-EEEEEEEeCCCC-------------ccceE
Confidence 57777766677787775555444444 478999999999955443 344343443332 34444
Q ss_pred EecCceeeeeeeeCCCCceEEEEEEeeeecceeeee
Q psy4599 111 IQGGQFKFVIDRVPPQTNVTHVVIVQPKSYGAFNFS 146 (201)
Q Consensus 111 iv~G~~s~~~~rI~pg~nvsH~~vv~p~~~G~fn~t 146 (201)
+..-. . =..|+||...+ +.+.|.+.|.|.|.
T Consensus 87 i~~~g--i-s~~I~pGet~T--itF~adKpG~Y~y~ 117 (135)
T TIGR03096 87 IDAYG--I-SEVIKAGETKT--ISFKADKAGAFTIW 117 (135)
T ss_pred ECCCC--c-ceEECCCCeEE--EEEECCCCEEEEEe
Confidence 43322 2 25788876654 55668999988653
No 28
>cd04036 C2_cPLA2 C2 domain present in cytosolic PhosphoLipase A2 (cPLA2). A single copy of the C2 domain is present in cPLA2 which releases arachidonic acid from membranes initiating the biosynthesis of potent inflammatory mediators such as prostaglandins, leukotrienes, and platelet-activating factor. C2 domains fold into an 8-standed beta-sandwich that can adopt 2 structural arrangements: Type I and Type II, distinguished by a circular permutation involving their N- and C-terminal beta strands. Many C2 domains are Ca2+-dependent membrane-targeting modules that bind a wide variety of substances including bind phospholipids, inositol polyphosphates, and intracellular proteins. Most C2 domain proteins are either signal transduction enzymes that contain a single C2 domain, such as protein kinase C, or membrane trafficking proteins which contain at least two C2 domains, such as synaptotagmin 1. However, there are a few exceptions to this including RIM isoforms and some splice variants o
Probab=71.31 E-value=11 Score=28.33 Aligned_cols=50 Identities=14% Similarity=0.225 Sum_probs=40.9
Q ss_pred eeEEEec-CCCCCCceEEecCceeeeeeeeCCCCceEEEEEEeeeecceeeeeee
Q psy4599 95 TNVVVSD-SSFPPEAFEIQGGQFKFVIDRVPPQTNVTHVVIVQPKSYGAFNFSFA 148 (201)
Q Consensus 95 ~dV~l~D-~sfp~~~F~iv~G~~s~~~~rI~pg~nvsH~~vv~p~~~G~fn~t~A 148 (201)
..|++-| +.+ .+.+ -|........|.+|......+.|.++..|..++..+
T Consensus 66 l~v~v~d~d~~-~~~~---iG~~~~~l~~l~~g~~~~~~~~L~~~~~g~l~~~~~ 116 (119)
T cd04036 66 LELTVMDEDYV-MDDH---LGTVLFDVSKLKLGEKVRVTFSLNPQGKEELEVEFL 116 (119)
T ss_pred EEEEEEECCCC-CCcc---cEEEEEEHHHCCCCCcEEEEEECCCCCCceEEEEEE
Confidence 4577777 444 5665 499999999999999999999999999998887654
No 29
>PF09478 CBM49: Carbohydrate binding domain CBM49; InterPro: IPR019028 A carbohydrate-binding module (CBM) is defined as a contiguous amino acid sequence within a carbohydrate-active enzyme with a discreet fold having carbohydrate-binding activity. A few exceptions are CBMs in cellulosomal scaffolding proteins and rare instances of independent putative CBMs. The requirement of CBMs existing as modules within larger enzymes sets this class of carbohydrate-binding protein apart from other non-catalytic sugar binding proteins such as lectins and sugar transport proteins. CBMs were previously classified as cellulose-binding domains (CBDs) based on the initial discovery of several modules that bound cellulose [, ]. However, additional modules in carbohydrate-active enzymes are continually being found that bind carbohydrates other than cellulose yet otherwise meet the CBM criteria, hence the need to reclassify these polypeptides using more inclusive terminology. Previous classification of cellulose-binding domains were based on amino acid similarity. Groupings of CBDs were called "Types" and numbered with roman numerals (e.g. Type I or Type II CBDs). In keeping with the glycoside hydrolase classification, these groupings are now called families and numbered with Arabic numerals. Families 1 to 13 are the same as Types I to XIII. For a detailed review on the structure and binding modes of CBMs see []. This domain is found at the C-terminal of cellulases and in vitro binding studies have shown it to binds to crystalline cellulose []. ; GO: 0030246 carbohydrate binding, 0005576 extracellular region
Probab=70.32 E-value=21 Score=25.71 Aligned_cols=53 Identities=17% Similarity=0.230 Sum_probs=39.3
Q ss_pred ceeEEEecccceeeeEEEecCCCCCCceEEec---Cceee-eee-eeCCCCceEEEEE
Q psy4599 82 EDYTIYNIGNSVATNVVVSDSSFPPEAFEIQG---GQFKF-VID-RVPPQTNVTHVVI 134 (201)
Q Consensus 82 ~~y~IyNvG~s~A~dV~l~D~sfp~~~F~iv~---G~~s~-~~~-rI~pg~nvsH~~v 134 (201)
...+|.|.|..+-.++.|.-+.+..+-..+.. +.+.. +|- .|+||+..+--++
T Consensus 21 y~v~I~N~~~~~I~~~~i~~~~l~~~iW~l~~~~~~~y~lPs~~~~i~pg~s~~FGYI 78 (80)
T PF09478_consen 21 YDVTITNNGSKPIKSLKISIDNLYGSIWGLDKVSGNTYTLPSYQPTIKPGQSFTFGYI 78 (80)
T ss_pred EEEEEEECCCCeEEEEEEEECccchhheeEEeccCCEEECCccccccCCCCEEEEEEE
Confidence 36789999999999999988777666555554 44554 665 8899888776655
No 30
>PF00927 Transglut_C: Transglutaminase family, C-terminal ig like domain; InterPro: IPR008958 Synonym(s): Protein-glutamine gamma-glutamyltransferase, Fibrinoligase, TGase Transglutaminases catalyse the post-translational modification of proteins at glutamine residues, with formation of isopeptide bonds. Members of the transglutaminase family usually have three domains: N-terminal (IPR001102 from INTERPRO), middle (IPR013808 from INTERPRO) and C-terminal. The middle domain is usually well conserved, but family members can display major differences in their N- and C-terminal domains, although their overall structure is conserved []. This entry represents the C-terminal domain found in transglutaminases, which consists of an immunoglobulin-like beta-sandwich consisting of seven strands in two sheets with a Greek key topology. The best known transglutaminase is blood coagulation factor XIII, a plasma tetrameric protein composed of two catalytic A subunits and two non-catalytic B subunits. Factor XIII is responsible for cross-linking fibrin chains, thus stabilising the fibrin clot. Protein-glutamine gamma-glutamyltransferases (2.3.2.13 from EC) are calcium-dependent enzymes that catalyse the cross-linking of proteins by promoting the formation of isopeptide bonds between the gamma-carboxyl group of a glutamine in one polypeptide chain and the epsilon-amino group of a lysine in a second polypeptide chain. TGases also catalyse the conjugation of polyamines to proteins [, ].; GO: 0003810 protein-glutamine gamma-glutamyltransferase activity, 0018149 peptide cross-linking; PDB: 2XZZ_A 1GGY_B 1FIE_B 1GGU_B 1GGT_B 1F13_A 1QRK_B 1EVU_A 1EX0_B 1L9N_B ....
Probab=69.90 E-value=6.2 Score=29.43 Aligned_cols=68 Identities=18% Similarity=0.174 Sum_probs=42.0
Q ss_pred CceEEEEEEEEcCccceeeEEEecCCCCCCCceEe--cCeeeeeEeeeCCCCceEEEEEEEeceeeeEEe
Q psy4599 1 MDVVVKYTIYNIGNSVATNVVVSDSSFPPEAFEIQ--GGQFKFVIDRVPPQTNVTHVVIVQPKSYGAFNF 68 (201)
Q Consensus 1 ~d~tV~y~IyNvG~s~A~dV~l~D~~fp~~~F~iv--~G~~~~~~~ri~pg~nvsH~~vv~p~~~G~fn~ 68 (201)
+|+++..++.|-.+.+-++|++.=-.+.-+.=-+. .......--.|+||+..++.+.+.|..+|.-..
T Consensus 15 ~d~~v~v~~~N~~~~~l~~v~~~l~~~~v~ytG~~~~~~~~~~~~~~l~p~~~~~~~~~i~p~~yG~~~~ 84 (107)
T PF00927_consen 15 QDFTVSVSFTNPSSEPLRNVSLNLCAFTVEYTGLTRDQFKKEKFEVTLKPGETKSVEVTITPSQYGPKQL 84 (107)
T ss_dssp SEEEEEEEEEE-SSS-EECEEEEEEEEEEECTTTEEEEEEEEEEEEEE-TTEEEEEEEEE-HHSHEEECC
T ss_pred CCEEEEEEEEeCCcCccccceeEEEEEEEEECCcccccEeEEEcceeeCCCCEEEEEEEEEceeEecchh
Confidence 47889999999999887887665311111100011 112344556799999999999999999998444
No 31
>PF07919 Gryzun: Gryzun, putative trafficking through Golgi; InterPro: IPR012880 The proteins featured in this family are all hypothetical eukaryotic proteins of unknown function. The region in question is approximately 150 residues long.
Probab=68.36 E-value=60 Score=30.65 Aligned_cols=106 Identities=17% Similarity=0.138 Sum_probs=62.6
Q ss_pred eEecCeeeeeEeeeCCCCceEEEEEEEeceeeeEEeeeEEEE--EEeCCCC-------ceeEEEecccceeeeEEEecCC
Q psy4599 33 EIQGGQFKFVIDRVPPQTNVTHVVIVQPKSYGAFNFSFAEVS--YKASENA-------EDYTIYNIGNSVATNVVVSDSS 103 (201)
Q Consensus 33 ~iv~G~~~~~~~ri~pg~nvsH~~vv~p~~~G~fn~t~A~Vt--Y~~s~~a-------~~y~IyNvG~s~A~dV~l~D~s 103 (201)
...-|.+..+|.|..+++.- ..+.-+..-..++.....+. =..+..+ -.|+|.|-- .-..++++.=
T Consensus 433 ~~~~g~~~I~WrR~~~~s~~--~~~~t~l~lP~~~v~~~~~~v~~~~p~~~~~~~~~~l~~~I~N~T-~~~~~~~~~m-- 507 (554)
T PF07919_consen 433 VTLLGSLVIKWRRNSSNSSD--PVVTTPLPLPRVNVPSSPLRVLASVPPSAIVGEPFTLSYTIENPT-NHFQTFELSM-- 507 (554)
T ss_pred ceeEEEEEEEEEECCCCCCC--ceEEEEeecCceEccCCCcEEEEecCCccccCcEEEEEEEEECCC-CccEEEEEEE--
Confidence 34578999999994443311 22222333333322222211 1112111 578888833 3333444432
Q ss_pred CCCCceEEecCceeeeeeeeCCCCceEEEEEEeeeecceeee
Q psy4599 104 FPPEAFEIQGGQFKFVIDRVPPQTNVTHVVIVQPKSYGAFNF 145 (201)
Q Consensus 104 fp~~~F~iv~G~~s~~~~rI~pg~nvsH~~vv~p~~~G~fn~ 145 (201)
-+.++| +.+|.-+.++- |.|++.-+-.+.|.|...|...+
T Consensus 508 e~s~~F-~fsG~k~~~~~-llP~s~~~~~y~l~pl~~G~~~l 547 (554)
T PF07919_consen 508 EPSDDF-MFSGPKQTTFS-LLPFSRHTVRYNLLPLVAGWWIL 547 (554)
T ss_pred ccCCCE-EEECCCcCceE-ECCCCcEEEEEEEEEccCCcEEC
Confidence 345667 67888877776 99999999999999999997643
No 32
>cd04036 C2_cPLA2 C2 domain present in cytosolic PhosphoLipase A2 (cPLA2). A single copy of the C2 domain is present in cPLA2 which releases arachidonic acid from membranes initiating the biosynthesis of potent inflammatory mediators such as prostaglandins, leukotrienes, and platelet-activating factor. C2 domains fold into an 8-standed beta-sandwich that can adopt 2 structural arrangements: Type I and Type II, distinguished by a circular permutation involving their N- and C-terminal beta strands. Many C2 domains are Ca2+-dependent membrane-targeting modules that bind a wide variety of substances including bind phospholipids, inositol polyphosphates, and intracellular proteins. Most C2 domain proteins are either signal transduction enzymes that contain a single C2 domain, such as protein kinase C, or membrane trafficking proteins which contain at least two C2 domains, such as synaptotagmin 1. However, there are a few exceptions to this including RIM isoforms and some splice variants o
Probab=64.81 E-value=21 Score=26.69 Aligned_cols=62 Identities=13% Similarity=0.206 Sum_probs=46.3
Q ss_pred EEEEEEEEcCccceeeEEEec-CCCCCCCceEecCeeeeeEeeeCCCCceEEEEEEEeceeeeEEeee
Q psy4599 4 VVKYTIYNIGNSVATNVVVSD-SSFPPEAFEIQGGQFKFVIDRVPPQTNVTHVVIVQPKSYGAFNFSF 70 (201)
Q Consensus 4 tV~y~IyNvG~s~A~dV~l~D-~~fp~~~F~iv~G~~~~~~~ri~pg~nvsH~~vv~p~~~G~fn~t~ 70 (201)
++.+.+... ......|++-| |.+ .++| =|........|.+|......+.+.++..|..++.-
T Consensus 53 ~f~f~i~~~-~~~~l~v~v~d~d~~-~~~~---iG~~~~~l~~l~~g~~~~~~~~L~~~~~g~l~~~~ 115 (119)
T cd04036 53 TFEFRIQSQ-VKNVLELTVMDEDYV-MDDH---LGTVLFDVSKLKLGEKVRVTFSLNPQGKEELEVEF 115 (119)
T ss_pred EEEEEeCcc-cCCEEEEEEEECCCC-CCcc---cEEEEEEHHHCCCCCcEEEEEECCCCCCceEEEEE
Confidence 345555543 23567899988 444 4443 57888889999999999999999999889887753
No 33
>KOG4386|consensus
Probab=64.16 E-value=14 Score=36.97 Aligned_cols=67 Identities=22% Similarity=0.341 Sum_probs=55.8
Q ss_pred eEEEEEEEEcCccceeeEEEecCCCCCCCceEecCeeeeeEeeeCCCCceEEEEEEEeceeeeEEeeeEEEE
Q psy4599 3 VVVKYTIYNIGNSVATNVVVSDSSFPPEAFEIQGGQFKFVIDRVPPQTNVTHVVIVQPKSYGAFNFSFAEVS 74 (201)
Q Consensus 3 ~tV~y~IyNvG~s~A~dV~l~D~~fp~~~F~iv~G~~~~~~~ri~pg~nvsH~~vv~p~~~G~fn~t~A~Vt 74 (201)
+.|.|.+-|--+ -+.||++.=+ |+|+| +-+|.-..+. ||-||+.-.-.|++-|...|+.||++--+.
T Consensus 710 lpvkyhLqnktd-lvqdveisve--psDaF-MFSGlkqirl-riLPGteqemlynfypLmAGyqqlPslnin 776 (809)
T KOG4386|consen 710 LPVKYHLQNKTD-LVQDVEISVE--PSDAF-MFSGLKQIRL-RILPGTEQEMLYNFYPLMAGYQQLPSLNIN 776 (809)
T ss_pred ccEEEEeccccc-eeeeEEeecc--cchhh-eecccceEEE-EEcCCCceEEEEEEehhhchhhhCCccccc
Confidence 678999999765 7889998754 77889 7889776665 788999999999999999999999876553
No 34
>COG1470 Predicted membrane protein [Function unknown]
Probab=63.46 E-value=51 Score=32.29 Aligned_cols=53 Identities=21% Similarity=0.324 Sum_probs=43.7
Q ss_pred ceEEEEEEEEcCccceeeEEEecCCCCCCCceE-ecCeeeeeEeeeCCCCceEEEEEEE
Q psy4599 2 DVVVKYTIYNIGNSVATNVVVSDSSFPPEAFEI-QGGQFKFVIDRVPPQTNVTHVVIVQ 59 (201)
Q Consensus 2 d~tV~y~IyNvG~s~A~dV~l~D~~fp~~~F~i-v~G~~~~~~~ri~pg~nvsH~~vv~ 59 (201)
+.++...|.|.|+.+-.||.|.=+ =|++ |++ |.+. +.+.|+||++.+-.+.|+
T Consensus 398 e~~i~i~I~NsGna~LtdIkl~v~-~Pqg-Wei~Vd~~---~I~sL~pge~~tV~ltI~ 451 (513)
T COG1470 398 EKTIRISIENSGNAPLTDIKLTVN-GPQG-WEIEVDES---TIPSLEPGESKTVSLTIT 451 (513)
T ss_pred cceEEEEEEecCCCccceeeEEec-CCcc-ceEEECcc---cccccCCCCcceEEEEEE
Confidence 678999999999999999999876 3444 654 3443 899999999999999998
No 35
>PF14796 AP3B1_C: Clathrin-adaptor complex-3 beta-1 subunit C-terminal
Probab=63.36 E-value=22 Score=29.20 Aligned_cols=50 Identities=18% Similarity=0.233 Sum_probs=37.7
Q ss_pred eEEEEEEEEcCccceeeEEEecCCCCCCCceEecCeeeeeEeeeCCCCceEEEE
Q psy4599 3 VVVKYTIYNIGNSVATNVVVSDSSFPPEAFEIQGGQFKFVIDRVPPQTNVTHVV 56 (201)
Q Consensus 3 ~tV~y~IyNvG~s~A~dV~l~D~~fp~~~F~iv~G~~~~~~~ri~pg~nvsH~~ 56 (201)
+.|+.++-|.++.+..+|.+.+...+... -+++ -...++|+||++++-.+
T Consensus 87 vsIql~ftN~s~~~i~~I~i~~k~l~~g~--~i~~--F~~I~~L~pg~s~t~~l 136 (145)
T PF14796_consen 87 VSIQLTFTNNSDEPIKNIHIGEKKLPAGM--RIHE--FPEIESLEPGASVTVSL 136 (145)
T ss_pred EEEEEEEEecCCCeecceEECCCCCCCCc--Eeec--cCcccccCCCCeEEEEE
Confidence 67999999999999999999996544431 1111 34678999999988554
No 36
>PF14874 PapD-like: Flagellar-associated PapD-like
Probab=60.20 E-value=63 Score=23.40 Aligned_cols=61 Identities=18% Similarity=0.193 Sum_probs=37.7
Q ss_pred ceeEEEecccceeeeEEEecCCCCCCceEEecCceeeeeeeeCCCCceEEEEEEe-eeecceeeeeee
Q psy4599 82 EDYTIYNIGNSVATNVVVSDSSFPPEAFEIQGGQFKFVIDRVPPQTNVTHVVIVQ-PKSYGAFNFSFA 148 (201)
Q Consensus 82 ~~y~IyNvG~s~A~dV~l~D~sfp~~~F~iv~G~~s~~~~rI~pg~nvsH~~vv~-p~~~G~fn~t~A 148 (201)
..++|.|.|..++. ..+....-....|.+-... ..|+||..++-.+.+. +...|.|.....
T Consensus 24 ~~v~l~N~s~~p~~-f~v~~~~~~~~~~~v~~~~-----g~l~PG~~~~~~V~~~~~~~~g~~~~~l~ 85 (102)
T PF14874_consen 24 RTVTLTNTSSIPAR-FRVRQPESLSSFFSVEPPS-----GFLAPGESVELEVTFSPTKPLGDYEGSLV 85 (102)
T ss_pred EEEEEEECCCCCEE-EEEEeCCcCCCCEEEECCC-----CEECCCCEEEEEEEEEeCCCCceEEEEEE
Confidence 46677777777542 2232222223445443211 3599999999999999 666798876554
No 37
>PF11611 DUF4352: Domain of unknown function (DUF4352); InterPro: IPR021652 This entry is represented by Bacteriophage A118, Gp32. The characteristics of the protein distribution suggest prophage matches in addition to the phage matches. This entry represents a group of putative lipoproteins of unknown function.; PDB: 3CFU_A.
Probab=59.19 E-value=40 Score=24.98 Aligned_cols=57 Identities=21% Similarity=0.311 Sum_probs=28.7
Q ss_pred eEEEEEEEEcCccce----eeEEEecC-CCCCC-CceEecCeeeeeEeeeCCCCceEEEEEEE
Q psy4599 3 VVVKYTIYNIGNSVA----TNVVVSDS-SFPPE-AFEIQGGQFKFVIDRVPPQTNVTHVVIVQ 59 (201)
Q Consensus 3 ~tV~y~IyNvG~s~A----~dV~l~D~-~fp~~-~F~iv~G~~~~~~~ri~pg~nvsH~~vv~ 59 (201)
+.|+++|-|.|+.+- .+..|.|+ +-.-+ .+....-........|+||+.++=.++-.
T Consensus 38 v~v~v~v~N~~~~~~~~~~~~f~l~d~~g~~~~~~~~~~~~~~~~~~~~i~pG~~~~g~l~F~ 100 (123)
T PF11611_consen 38 VVVDVTVKNNGDEPLDFSPSDFKLYDSDGNKYDPDFSASSNDNDLFSETIKPGESVTGKLVFE 100 (123)
T ss_dssp EEEEEEEEE-SSS-EEEEGGGEEEE-TT--B--EEE-CCCTTTB--EEEE-TT-EEEEEEEEE
T ss_pred EEEEEEEEECCCCcEEecccceEEEeCCCCEEcccccchhccccccccEECCCCEEEEEEEEE
Confidence 689999999999753 35667652 11111 11111111114678999999988777766
No 38
>PF05506 DUF756: Domain of unknown function (DUF756); InterPro: IPR008475 This domain is found, normally as a tandem repeat, at the C terminus of bacterial phospholipase C proteins.; GO: 0004629 phospholipase C activity, 0016042 lipid catabolic process
Probab=58.69 E-value=67 Score=23.25 Aligned_cols=67 Identities=15% Similarity=0.153 Sum_probs=43.9
Q ss_pred EEEEEEeCCCCceeEEEecccceeeeEEEecCCCCCCceEEecCceeeeeeeeCCCCceEEEEEEeeeecceeeeeee
Q psy4599 71 AEVSYKASENAEDYTIYNIGNSVATNVVVSDSSFPPEAFEIQGGQFKFVIDRVPPQTNVTHVVIVQPKSYGAFNFSFA 148 (201)
Q Consensus 71 A~VtY~~s~~a~~y~IyNvG~s~A~dV~l~D~sfp~~~F~iv~G~~s~~~~rI~pg~nvsH~~vv~p~~~G~fn~t~A 148 (201)
+.+.|........++|-|.|..++ .+++.|+..... . ..=-.|+||+.+++.+-+ ....|-|-++..
T Consensus 11 v~~~~~~~~g~l~l~l~N~g~~~~-~~~v~~~~y~~~--------~-~~~~~v~ag~~~~~~w~l-~~s~gwYDl~v~ 77 (89)
T PF05506_consen 11 VTARYDPATGNLRLTLSNPGSAAV-TFTVYDNAYGGG--------G-PWTYTVAAGQTVSLTWPL-AASGGWYDLTVT 77 (89)
T ss_pred EEEEEECCCCEEEEEEEeCCCCcE-EEEEEeCCcCCC--------C-CEEEEECCCCEEEEEEee-cCCCCcEEEEEE
Confidence 456777777788999999987665 577777433211 0 011257788888888877 566677766543
No 39
>COG1572 Uncharacterized conserved protein [Function unknown]
Probab=58.58 E-value=1.5e+02 Score=29.87 Aligned_cols=58 Identities=21% Similarity=0.158 Sum_probs=42.0
Q ss_pred ceeEEEecccceeeeEEEecCCCCCCceEEecCc--eeeeeeeeCCCCceEEEEEEeeeecceeeeeee
Q psy4599 82 EDYTIYNIGNSVATNVVVSDSSFPPEAFEIQGGQ--FKFVIDRVPPQTNVTHVVIVQPKSYGAFNFSFA 148 (201)
Q Consensus 82 ~~y~IyNvG~s~A~dV~l~D~sfp~~~F~iv~G~--~s~~~~rI~pg~nvsH~~vv~p~~~G~fn~t~A 148 (201)
..++|-|.|.+.|.++++.. ++.|. .+.+...++||...+-.|.-.+...|.++++.-
T Consensus 427 i~l~i~N~G~~~a~~~~v~l---------~lnG~~~~~~~i~~l~~~~s~e~~v~~~~~s~G~~~Ls~~ 486 (606)
T COG1572 427 ITLNIKNLGEAYASGFQVDL---------VLNGTIVTVDSIPGLESGESREVVVNEVSTSGGSHTLSVV 486 (606)
T ss_pred EEEEEEeccccccCCceEEE---------EEcCceeeeEecccCCCCCceEEEEEEEecCCCceEEEEE
Confidence 67889999999988887765 34453 233555566777777777767888998888665
No 40
>PF08441 Integrin_alpha2: Integrin alpha; InterPro: IPR013649 This domain is found in integrin alpha and integrin alpha precursors to the C terminus of a number of IPR013517 from INTERPRO repeats and to the N terminus of the IPR013513 from INTERPRO cytoplasmic region. ; PDB: 1M1X_A 1U8C_A 1L5G_A 3IJE_A 1JV2_A 2VDN_A 2VC2_A 3NIF_A 3NIG_C 2VDM_A ....
Probab=57.86 E-value=1.3e+02 Score=27.96 Aligned_cols=70 Identities=19% Similarity=0.268 Sum_probs=36.1
Q ss_pred eeEEEecCCCCCCce-EEecCceeeeeeeeCCCCceEEEEEEeeeecceeeeeeeEEEEecCCCCccCceeeee
Q psy4599 95 TNVVVSDSSFPPEAF-EIQGGQFKFVIDRVPPQTNVTHVVIVQPKSYGAFNFSFAEVSYKASENAEDGDWAAFA 167 (201)
Q Consensus 95 ~dV~l~D~sfp~~~F-~iv~G~~s~~~~rI~pg~nvsH~~vv~p~~~G~fn~t~A~v~yr~s~~a~~~~~~~f~ 167 (201)
.++.+...+.|+..+ ..+.|....+ ..=..|..|.|.+.|+= .|...+..+++....+...+...|++|.
T Consensus 311 ~~~~i~g~s~p~~v~~~~~~~~~~~~-~~~~ig~~v~h~y~V~N--~Gps~i~~~~l~i~~P~~~~~~~~Llyl 381 (457)
T PF08441_consen 311 ADLSISGVSSPEQVFGGSVKGESAMN-SEEDIGPEVTHTYEVRN--NGPSTIPSASLNIMWPYQTSNGKWLLYL 381 (457)
T ss_dssp E-EEEEEEECSSEEECT-SS---SSS-CCHHH--EEEEEEEEEE---SSS-EEEEEEEEEEECESTTC-CSCEE
T ss_pred EEeEEEeeeccceeeccccccccccc-ccccCCCcEEEEEEeee--cCCCccccEEEEEeeChhhCCCCeEEEE
Confidence 345565555565544 1122222222 22333559999999975 4444777788887777765555575554
No 41
>PF12584 TRAPPC10: Trafficking protein particle complex subunit 10, TRAPPC10; InterPro: IPR022233 The trafficking protein particle complex TRAPP is a multi-protein complex needed in the early stages of the secretory pathway. To date, two kinds of TRAPP complexes have been studied, TRAPPI and TRAPP II. These complexes differ in subunit composition []. TRAPP I binds vesicles derived from the endoplasmic reticulum bringing them closer to the acceptor membrane. This entry represents a domain which forms part of the TRAPP complex for mediating vesicle docking and fusion in the Golgi apparatus. The fungal version is referred to as Trs130, and an alternative vertebrate alias is TMEM1 [, ].
Probab=56.72 E-value=1e+02 Score=24.63 Aligned_cols=44 Identities=11% Similarity=0.094 Sum_probs=37.5
Q ss_pred EecCeeeeeEeeeCCCCceEEEEEEEeceeeeEEeeeEEEEEEeC
Q psy4599 34 IQGGQFKFVIDRVPPQTNVTHVVIVQPKSYGAFNFSFAEVSYKAS 78 (201)
Q Consensus 34 iv~G~~~~~~~ri~pg~nvsH~~vv~p~~~G~fn~t~A~VtY~~s 78 (201)
+|+|--...+.- ..|+..+-.++|.|++.|+..++...|+=...
T Consensus 72 lV~Grrrg~f~~-~~~~~~~~~l~LIPL~~G~L~lP~V~i~~~~~ 115 (147)
T PF12584_consen 72 LVSGRRRGVFSL-SDGSEHEIPLTLIPLRAGYLPLPKVEIRPYDP 115 (147)
T ss_pred EEeccCcceEEe-cCCCeEEEEEEEEecccceecCCEEEEEeccC
Confidence 888887777765 78888888999999999999999999876663
No 42
>PF07610 DUF1573: Protein of unknown function (DUF1573); InterPro: IPR011467 These hypothetical proteins from bacteria, such as Rhodopirellula baltica, Bacteroides thetaiotaomicron and Porphyromonas gingivalis, share a region of conserved sequence towards their N termini.
Probab=53.39 E-value=53 Score=21.14 Aligned_cols=38 Identities=18% Similarity=0.297 Sum_probs=24.5
Q ss_pred eeEEEecccceeeeEEEecCCCCCCceEEecCceeeeee--eeCCCCceE
Q psy4599 83 DYTIYNIGNSVATNVVVSDSSFPPEAFEIQGGQFKFVID--RVPPQTNVT 130 (201)
Q Consensus 83 ~y~IyNvG~s~A~dV~l~D~sfp~~~F~iv~G~~s~~~~--rI~pg~nvs 130 (201)
.+.|.|.|+++ +.+.+- .--.|-+.++|+ .|+||+...
T Consensus 1 ~F~~~N~g~~~---L~I~~v-------~tsCgCt~~~~~~~~i~PGes~~ 40 (45)
T PF07610_consen 1 TFEFTNTGDSP---LVITDV-------QTSCGCTTAEYSKKPIAPGESGK 40 (45)
T ss_pred CEEEEECCCCc---EEEEEe-------eEccCCEEeeCCcceECCCCEEE
Confidence 37899999998 455542 122566666554 578887654
No 43
>PF10989 DUF2808: Protein of unknown function (DUF2808); InterPro: IPR021256 This family of proteins with unknown function appears to be restricted to Cyanobacteria.
Probab=52.63 E-value=1.2e+02 Score=24.26 Aligned_cols=36 Identities=39% Similarity=0.684 Sum_probs=27.2
Q ss_pred cCceeeeee-eeCCCCceEEEE-EE-eeeecceeeeeee
Q psy4599 113 GGQFKFVID-RVPPQTNVTHVV-IV-QPKSYGAFNFSFA 148 (201)
Q Consensus 113 ~G~~s~~~~-rI~pg~nvsH~~-vv-~p~~~G~fn~t~A 148 (201)
.+.+++.++ -|+||+.++=.+ -+ -|...|.|.|...
T Consensus 88 ~~~i~I~f~~PV~pG~tv~V~l~~v~NP~~~G~Y~f~v~ 126 (146)
T PF10989_consen 88 GRTITITFDEPVPPGTTVTVVLSPVRNPRSGGTYQFNVT 126 (146)
T ss_pred CCEEEEEeCCCCCCCCEEEEEEEeeeCCCCCCeEEEEEE
Confidence 467777776 499998887766 44 3888899999766
No 44
>PF03896 TRAP_alpha: Translocon-associated protein (TRAP), alpha subunit; InterPro: IPR005595 The alpha-subunit of the TRAP complex (TRAP alpha) is a single-spanning membrane protein of the endoplasmic reticulum (ER) which is found in proximity of nascent polypeptide chains translocating across the membrane [].; GO: 0005783 endoplasmic reticulum
Probab=51.92 E-value=1.5e+02 Score=26.96 Aligned_cols=35 Identities=23% Similarity=0.407 Sum_probs=20.5
Q ss_pred eeCCCCceEEEEEEee---eecceeeeeeeEEEEecCCC
Q psy4599 122 RVPPQTNVTHVVIVQP---KSYGAFNFSFAEVSYKASEN 157 (201)
Q Consensus 122 rI~pg~nvsH~~vv~p---~~~G~fn~t~A~v~yr~s~~ 157 (201)
.|+||...|-.+...| ...+.|.+... +.|+-+++
T Consensus 145 ~V~pg~~aT~~YsF~~~~~l~pr~f~L~i~-l~y~d~~g 182 (285)
T PF03896_consen 145 EVPPGEEATFPYSFTPSEELAPRPFGLVIN-LIYEDSDG 182 (285)
T ss_pred ccCCCCeEEEEEEEecchhcCCcceEEEEE-EEEEeCCC
Confidence 5788877776666665 33455666554 33554443
No 45
>PF05506 DUF756: Domain of unknown function (DUF756); InterPro: IPR008475 This domain is found, normally as a tandem repeat, at the C terminus of bacterial phospholipase C proteins.; GO: 0004629 phospholipase C activity, 0016042 lipid catabolic process
Probab=48.91 E-value=1e+02 Score=22.32 Aligned_cols=53 Identities=19% Similarity=0.173 Sum_probs=31.8
Q ss_pred EEEEEEEEcCccceeeEEEecCCCCCCCceEecCeeeeeEeeeCCCCceEEEEEEEeceeeeEE
Q psy4599 4 VVKYTIYNIGNSVATNVVVSDSSFPPEAFEIQGGQFKFVIDRVPPQTNVTHVVIVQPKSYGAFN 67 (201)
Q Consensus 4 tV~y~IyNvG~s~A~dV~l~D~~fp~~~F~iv~G~~~~~~~ri~pg~nvsH~~vv~p~~~G~fn 67 (201)
.+..+|-|.|. .+..+.+.|+..... ...=-.|+||++++..+-+ ....|-|-
T Consensus 21 ~l~l~l~N~g~-~~~~~~v~~~~y~~~---------~~~~~~v~ag~~~~~~w~l-~~s~gwYD 73 (89)
T PF05506_consen 21 NLRLTLSNPGS-AAVTFTVYDNAYGGG---------GPWTYTVAAGQTVSLTWPL-AASGGWYD 73 (89)
T ss_pred EEEEEEEeCCC-CcEEEEEEeCCcCCC---------CCEEEEECCCCEEEEEEee-cCCCCcEE
Confidence 57889999987 455788888544311 0011246777777766655 33444443
No 46
>TIGR02656 cyanin_plasto plastocyanin. Members of this family are plastocyanin, a blue copper protein related to pseudoazurin, halocyanin, amicyanin, etc. This protein, located in the thylakoid luman, performs electron transport to photosystem I in Cyanobacteria and chloroplasts.
Probab=48.01 E-value=45 Score=24.71 Aligned_cols=48 Identities=15% Similarity=0.075 Sum_probs=26.1
Q ss_pred EEEEcCccceeeEEEecCCCCCCCceEecCeeeeeEeeeCCCCceEEEEE
Q psy4599 8 TIYNIGNSVATNVVVSDSSFPPEAFEIQGGQFKFVIDRVPPQTNVTHVVI 57 (201)
Q Consensus 8 ~IyNvG~s~A~dV~l~D~~fp~~~F~iv~G~~~~~~~ri~pg~nvsH~~v 57 (201)
++-|.+. ...++.+.+..+|....+. .+.+...--.+.||++.+++|.
T Consensus 28 ~~~N~~~-~~H~~~~~~~~~~~~~~~~-~~~~~~~~~~~~pG~t~~~tF~ 75 (99)
T TIGR02656 28 EWVNNKG-GPHNVVFDEDAVPAGVKEL-AKSLSHKDLLNSPGESYEVTFS 75 (99)
T ss_pred EEEECCC-CCceEEECCCCCccchhhh-cccccccccccCCCCEEEEEeC
Confidence 3447653 5677777766566554322 1222112235678888777554
No 47
>COG1572 Uncharacterized conserved protein [Function unknown]
Probab=44.31 E-value=2.8e+02 Score=27.93 Aligned_cols=62 Identities=19% Similarity=0.180 Sum_probs=47.1
Q ss_pred CceEEEEEEEEcCccceeeEEEecCCCCCCCceEecCe--eeeeEeeeCCCCceEEEEEEEeceeeeEEeeeE
Q psy4599 1 MDVVVKYTIYNIGNSVATNVVVSDSSFPPEAFEIQGGQ--FKFVIDRVPPQTNVTHVVIVQPKSYGAFNFSFA 71 (201)
Q Consensus 1 ~d~tV~y~IyNvG~s~A~dV~l~D~~fp~~~F~iv~G~--~~~~~~ri~pg~nvsH~~vv~p~~~G~fn~t~A 71 (201)
+.+++.++|-|.|.+.|.+++..+ ++.|. ++.+..-|+||+..+-.|.-.+...|.++++.-
T Consensus 423 k~~~i~l~i~N~G~~~a~~~~v~l---------~lnG~~~~~~~i~~l~~~~s~e~~v~~~~~s~G~~~Ls~~ 486 (606)
T COG1572 423 KALTITLNIKNLGEAYASGFQVDL---------VLNGTIVTVDSIPGLESGESREVVVNEVSTSGGSHTLSVV 486 (606)
T ss_pred ceEEEEEEEEeccccccCCceEEE---------EEcCceeeeEecccCCCCCceEEEEEEEecCCCceEEEEE
Confidence 457899999999999998888775 45553 355666677888777777766888888887653
No 48
>KOG4386|consensus
Probab=44.18 E-value=2e+02 Score=29.16 Aligned_cols=131 Identities=20% Similarity=0.253 Sum_probs=81.8
Q ss_pred cCccceeeEEEecCCCCCCCceEecCeeeeeEeeeCCCCce---------EEEEEEE-eceeeeEEeeeEEEEEEeCCCC
Q psy4599 12 IGNSVATNVVVSDSSFPPEAFEIQGGQFKFVIDRVPPQTNV---------THVVIVQ-PKSYGAFNFSFAEVSYKASENA 81 (201)
Q Consensus 12 vG~s~A~dV~l~D~~fp~~~F~iv~G~~~~~~~ri~pg~nv---------sH~~vv~-p~~~G~fn~t~A~VtY~~s~~a 81 (201)
.|+++--=--|.-.+..+..=++--|....+|.|-.|=+|. -|+.|.- |.....--=+++.|. ..-.
T Consensus 635 tgEsasecfclqcpslgniEggvatGhyiisWkRtsameNipiittVitLphviVe~iPlhvnadlpsfgrVR---eslp 711 (809)
T KOG4386|consen 635 TGESASECFCLQCPSLGNIEGGVATGHYIISWKRTSAMENIPIITTVITLPHVIVEAIPLHVNADLPSFGRVR---ESLP 711 (809)
T ss_pred cccceeeeeeEeccccccccCCCccceEEEEEeecccccCCCceeeecccccceeeeccceeecCCCCcceec---cccc
Confidence 34444444445556666666677889999999998886553 3444433 443333222333332 1112
Q ss_pred ceeEEEecccceeeeEEEecCCCCCCceEEecCceeeeeeeeCCCCceEEEEEEeeeecceeeeeeeEE
Q psy4599 82 EDYTIYNIGNSVATNVVVSDSSFPPEAFEIQGGQFKFVIDRVPPQTNVTHVVIVQPKSYGAFNFSFAEV 150 (201)
Q Consensus 82 ~~y~IyNvG~s~A~dV~l~D~sfp~~~F~iv~G~~s~~~~rI~pg~nvsH~~vv~p~~~G~fn~t~A~v 150 (201)
..|.+-|--+ -+.||+++- -|+|.| +-+|.-.++. ||-||+.-.-.+.+-|.-.|+.++-+-.+
T Consensus 712 vkyhLqnktd-lvqdveisv--epsDaF-MFSGlkqirl-riLPGteqemlynfypLmAGyqqlPslni 775 (809)
T KOG4386|consen 712 VKYHLQNKTD-LVQDVEISV--EPSDAF-MFSGLKQIRL-RILPGTEQEMLYNFYPLMAGYQQLPSLNI 775 (809)
T ss_pred EEEEeccccc-eeeeEEeec--ccchhh-eecccceEEE-EEcCCCceEEEEEEehhhchhhhCCcccc
Confidence 3445544332 355666653 477777 5678766665 89999999999999999999999866533
No 49
>PRK09918 putative fimbrial chaperone protein; Provisional
Probab=42.57 E-value=2.2e+02 Score=24.49 Aligned_cols=65 Identities=12% Similarity=0.098 Sum_probs=44.9
Q ss_pred eeEEeeeEEEEEEeCCCCceeEEEecccceeeeEE--Eec-CCCCCCceEEecCceeeeeeeeCCCCceEEEEE
Q psy4599 64 GAFNFSFAEVSYKASENAEDYTIYNIGNSVATNVV--VSD-SSFPPEAFEIQGGQFKFVIDRVPPQTNVTHVVI 134 (201)
Q Consensus 64 G~fn~t~A~VtY~~s~~a~~y~IyNvG~s~A~dV~--l~D-~sfp~~~F~iv~G~~s~~~~rI~pg~nvsH~~v 134 (201)
+.+...+..|.|-.++.+..++|.|.|+.+ +-|+ +.| +.-+...|.+.-- ..||+||+.-+=-++
T Consensus 24 a~v~l~~tRvi~~~~~~~~si~v~N~~~~p-~lvQ~wv~~~~~~~~~~fivtPP-----l~rl~pg~~q~vRii 91 (230)
T PRK09918 24 AGMVPETSVVIVEESDGEGSINVKNTDSNP-ILLYTTLVDLPEDKSKLLLVTPP-----VARVEPGQSQQVRFI 91 (230)
T ss_pred eeEEEccEEEEEECCCCeEEEEEEcCCCCc-EEEEEEEecCCCCCCCCEEEcCC-----eEEECCCCceEEEEE
Confidence 567888999999999999999999999875 3333 333 2223445754443 358888887744444
No 50
>PRK02710 plastocyanin; Provisional
Probab=42.51 E-value=1.3e+02 Score=23.17 Aligned_cols=76 Identities=14% Similarity=0.177 Sum_probs=46.6
Q ss_pred CceEEEEEEEeceeeeEEeeeEEEEEEeCCCCceeEEEecccceeeeEEEecCCCCCCceEEecCceeeeeeeeCCCCce
Q psy4599 50 TNVTHVVIVQPKSYGAFNFSFAEVSYKASENAEDYTIYNIGNSVATNVVVSDSSFPPEAFEIQGGQFKFVIDRVPPQTNV 129 (201)
Q Consensus 50 ~nvsH~~vv~p~~~G~fn~t~A~VtY~~s~~a~~y~IyNvG~s~A~dV~l~D~sfp~~~F~iv~G~~s~~~~rI~pg~nv 129 (201)
..-+|++.+.. ..|.+.|.|+.|+=...+ .+++-|.++. .-++.+.+. +.+. ..+ -.++||...
T Consensus 27 ~a~~~~V~~~~-~~~~~~F~P~~i~v~~Gd---~V~~~N~~~~-~H~v~~~~~----~~~~--~~~-----~~~~pg~t~ 90 (119)
T PRK02710 27 SAETVEVKMGS-DAGMLAFEPSTLTIKAGD---TVKWVNNKLA-PHNAVFDGA----KELS--HKD-----LAFAPGESW 90 (119)
T ss_pred ccceEEEEEcc-CCCeeEEeCCEEEEcCCC---EEEEEECCCC-CceEEecCC----cccc--ccc-----cccCCCCEE
Confidence 45577777754 357899999999887764 4777787654 345554321 0010 000 147899988
Q ss_pred EEEEEEeeeecceeee
Q psy4599 130 THVVIVQPKSYGAFNF 145 (201)
Q Consensus 130 sH~~vv~p~~~G~fn~ 145 (201)
+++|.- .|.|.|
T Consensus 91 ~~tF~~----~G~y~y 102 (119)
T PRK02710 91 EETFSE----AGTYTY 102 (119)
T ss_pred EEEecC----CEEEEE
Confidence 877773 566544
No 51
>PRK15188 fimbrial chaperone protein BcfB; Provisional
Probab=40.74 E-value=1.9e+02 Score=25.21 Aligned_cols=67 Identities=13% Similarity=0.160 Sum_probs=45.3
Q ss_pred eeeEEeeeEEEEEEeCCCCceeEEEecccceeeeEEE--ec-CCCCCCceEEecCceeeeeeeeCCCCceEEEEE
Q psy4599 63 YGAFNFSFAEVSYKASENAEDYTIYNIGNSVATNVVV--SD-SSFPPEAFEIQGGQFKFVIDRVPPQTNVTHVVI 134 (201)
Q Consensus 63 ~G~fn~t~A~VtY~~s~~a~~y~IyNvG~s~A~dV~l--~D-~sfp~~~F~iv~G~~s~~~~rI~pg~nvsH~~v 134 (201)
.+-+.+....|-|..++....++|.|.|+...+=|+- .| ++-....|.+.-- ..||+||+.-+=.++
T Consensus 26 ~Agi~l~~TRvIy~~~~~~~sv~i~N~~~~~p~LvQsWv~~~~~~~~~pFivtPP-----lfrl~~~~~~~lRI~ 95 (228)
T PRK15188 26 AGGIALGATRVIYPQGSKQTSLPIINSSASNVFLIQSWVANADGSRSTDFIITPP-----LFVIQPKKENILRIM 95 (228)
T ss_pred cceEEECcEEEEEcCCCceEEEEEEeCCCCccEEEEEEEecCCCCccCCEEEcCC-----eEEECCCCceEEEEE
Confidence 3668889999999999999999999999765444543 22 2222234654443 358888877665555
No 52
>PF13860 FlgD_ig: FlgD Ig-like domain; PDB: 3C12_A 3OSV_A.
Probab=39.18 E-value=1e+02 Score=22.01 Aligned_cols=16 Identities=25% Similarity=0.381 Sum_probs=7.9
Q ss_pred cCeeeeeEeeeCCCCc
Q psy4599 36 GGQFKFVIDRVPPQTN 51 (201)
Q Consensus 36 ~G~~~~~~~ri~pg~n 51 (201)
.|.-+..|+-.....+
T Consensus 49 ~G~~~~~WdG~d~~G~ 64 (81)
T PF13860_consen 49 AGEHSFTWDGKDDDGN 64 (81)
T ss_dssp SEEEEEEE-SB-TTS-
T ss_pred CceEEEEECCCCCCcC
Confidence 3666778885555443
No 53
>PF10725 DUF2517: Protein of unknown function (DUF2517); InterPro: IPR019663 This entry represents proteins conserved in Proteobacteria and includes the predicted protein YbfA. The function is not known.
Probab=39.17 E-value=12 Score=26.86 Aligned_cols=22 Identities=14% Similarity=0.392 Sum_probs=18.6
Q ss_pred eehhcchhhhheeeeeehhhhh
Q psy4599 167 AFMTLPPLLIPFLLWFQSKTKY 188 (201)
Q Consensus 167 ~~~~~P~~~~p~~ly~~S~~~y 188 (201)
++..+.++++|.||+++-++||
T Consensus 16 ~vv~~GvlAlPvMLf~~dRArf 37 (63)
T PF10725_consen 16 FVVLAGVLALPVMLFRSDRARF 37 (63)
T ss_pred HHHHHHHhhhheeeeecchhHH
Confidence 4567788999999999999886
No 54
>PF11797 DUF3324: Protein of unknown function C-terminal (DUF3324); InterPro: IPR021759 This family consists of several hypothetical bacterial proteins of unknown function.
Probab=39.12 E-value=93 Score=24.69 Aligned_cols=44 Identities=14% Similarity=0.292 Sum_probs=27.5
Q ss_pred CceEEEEEEEEcCcc-ceeeEEEecCCC-CCCCceEecCeeeeeE--eeeCCC
Q psy4599 1 MDVVVKYTIYNIGNS-VATNVVVSDSSF-PPEAFEIQGGQFKFVI--DRVPPQ 49 (201)
Q Consensus 1 ~d~tV~y~IyNvG~s-~A~dV~l~D~~f-p~~~F~iv~G~~~~~~--~ri~pg 49 (201)
.+++|+..||+.|+. .-+.-+..+-.+ |...|.+ ...| ++|+||
T Consensus 59 ~~~~v~a~V~~~~~~k~~~~~~~~~~~mAPNS~f~~-----~i~~~~~~lk~G 106 (140)
T PF11797_consen 59 KKLTVDAKVTKKGSKKVLYTFKKENMQMAPNSNFNF-----PIPLGGKKLKPG 106 (140)
T ss_pred cCcEEEEEEEECCCCeEEEEeeccCCEECCCCeEEe-----EecCCCcCccCC
Confidence 368899999999974 555555555333 3345644 4445 477777
No 55
>PF14796 AP3B1_C: Clathrin-adaptor complex-3 beta-1 subunit C-terminal
Probab=38.63 E-value=57 Score=26.78 Aligned_cols=53 Identities=19% Similarity=0.216 Sum_probs=37.3
Q ss_pred eCCCCceeEEEecccceeeeEEEecCCCCCCceEEecCceeeeeeeeCCCCceEEEE
Q psy4599 77 ASENAEDYTIYNIGNSVATNVVVSDSSFPPEAFEIQGGQFKFVIDRVPPQTNVTHVV 133 (201)
Q Consensus 77 ~s~~a~~y~IyNvG~s~A~dV~l~D~sfp~~~F~iv~G~~s~~~~rI~pg~nvsH~~ 133 (201)
..--+.++.+-|.++.+..+|++.+...+... .+ ++ -...++|+||++++-.+
T Consensus 84 ~~mvsIql~ftN~s~~~i~~I~i~~k~l~~g~-~i-~~--F~~I~~L~pg~s~t~~l 136 (145)
T PF14796_consen 84 PSMVSIQLTFTNNSDEPIKNIHIGEKKLPAGM-RI-HE--FPEIESLEPGASVTVSL 136 (145)
T ss_pred CCcEEEEEEEEecCCCeecceEECCCCCCCCc-Ee-ec--cCcccccCCCCeEEEEE
Confidence 33345899999999999999999997655322 11 11 12578899999988443
No 56
>PF14975 DUF4512: Domain of unknown function (DUF4512)
Probab=38.46 E-value=6.5 Score=29.91 Aligned_cols=16 Identities=44% Similarity=0.997 Sum_probs=13.6
Q ss_pred hcchhhhheeeeeehh
Q psy4599 170 TLPPLLIPFLLWFQSK 185 (201)
Q Consensus 170 ~~P~~~~p~~ly~~S~ 185 (201)
++|+++||.|||...|
T Consensus 2 CIPCivIPvLLwIykk 17 (88)
T PF14975_consen 2 CIPCIVIPVLLWIYKK 17 (88)
T ss_pred ccchhHHHHHHHHHHH
Confidence 5899999999998754
No 57
>PF02102 Peptidase_M35: Deuterolysin metalloprotease (M35) family; InterPro: IPR001384 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to MEROPS peptidase family M35 (deuterolysin family, clan MA(M)). The protein fold of the peptidase domain for members of this family resembles that of thermolysin, the type example for clan MA. Deuterolysin is a microbial zinc-containing metalloprotease that shows some similarity to thermolysin []. The protein is expressed with a possible 19-residue signal sequence, a 155-residue propeptide, and an active peptide of 177 residues []. The latter contains an HEXXH motif towards the C terminus, but the other zinc ligands are as yet undetermined [, ].; GO: 0004222 metalloendopeptidase activity, 0006508 proteolysis; PDB: 1EB6_A.
Probab=35.77 E-value=12 Score=34.97 Aligned_cols=58 Identities=16% Similarity=0.094 Sum_probs=0.0
Q ss_pred ceEEEEEEEEcCccceeeEEE--ecCCCCCCCceEecC-----------------eeeeeEeeeCCCCceEEEEEEE
Q psy4599 2 DVVVKYTIYNIGNSVATNVVV--SDSSFPPEAFEIQGG-----------------QFKFVIDRVPPQTNVTHVVIVQ 59 (201)
Q Consensus 2 d~tV~y~IyNvG~s~A~dV~l--~D~~fp~~~F~iv~G-----------------~~~~~~~ri~pg~nvsH~~vv~ 59 (201)
|..|+-+|.|.|+.+-.=+++ ..|+.|-+.|.|-++ ...--|..|+||+.++|+|=+-
T Consensus 39 nt~VkA~VTNtG~e~l~llK~ntilD~~Pv~kv~V~~~g~~V~F~Gi~~~~~~~~L~~d~F~~L~pG~sve~~fDiA 115 (359)
T PF02102_consen 39 NTRVKATVTNTGSEDLKLLKYNTILDSAPVKKVSVYKDGKEVPFTGIRLRYDTSGLTEDAFQTLAPGESVEVEFDIA 115 (359)
T ss_dssp -----------------------------------------------------------------------------
T ss_pred CcEEEEEEEeCCCcceEEEeeceecCCCceeEEEEEcCCcccccccEEEEEecCCCCHHHceecCCCCeEEEEEcch
Confidence 456888999999976322221 224678888877655 3455678899999999887653
No 58
>PF14874 PapD-like: Flagellar-associated PapD-like
Probab=35.28 E-value=1.7e+02 Score=21.06 Aligned_cols=60 Identities=18% Similarity=0.151 Sum_probs=39.6
Q ss_pred eEEEEEEEEcCcccee-eEEEecCCCCCCCceEecCeeeeeEeeeCCCCceEEEEEEE-eceeeeEEee
Q psy4599 3 VVVKYTIYNIGNSVAT-NVVVSDSSFPPEAFEIQGGQFKFVIDRVPPQTNVTHVVIVQ-PKSYGAFNFS 69 (201)
Q Consensus 3 ~tV~y~IyNvG~s~A~-dV~l~D~~fp~~~F~iv~G~~~~~~~ri~pg~nvsH~~vv~-p~~~G~fn~t 69 (201)
.+...+|-|.|..++. .|.... -....|.+--. =..|+||+.++-.+.+. +...|.+...
T Consensus 22 ~~~~v~l~N~s~~p~~f~v~~~~--~~~~~~~v~~~-----~g~l~PG~~~~~~V~~~~~~~~g~~~~~ 83 (102)
T PF14874_consen 22 YSRTVTLTNTSSIPARFRVRQPE--SLSSFFSVEPP-----SGFLAPGESVELEVTFSPTKPLGDYEGS 83 (102)
T ss_pred EEEEEEEEECCCCCEEEEEEeCC--cCCCCEEEECC-----CCEECCCCEEEEEEEEEeCCCCceEEEE
Confidence 4567889999988653 454332 22344654321 23599999999999999 6667877643
No 59
>PRK15254 fimbrial chaperone protein StdC; Provisional
Probab=34.70 E-value=3.2e+02 Score=23.98 Aligned_cols=67 Identities=13% Similarity=0.124 Sum_probs=44.6
Q ss_pred eeeEEeeeEEEEEEeCCCCceeEEEecccceeeeEE--Eec-CCCCCCceEEecCceeeeeeeeCCCCceEEEEE
Q psy4599 63 YGAFNFSFAEVSYKASENAEDYTIYNIGNSVATNVV--VSD-SSFPPEAFEIQGGQFKFVIDRVPPQTNVTHVVI 134 (201)
Q Consensus 63 ~G~fn~t~A~VtY~~s~~a~~y~IyNvG~s~A~dV~--l~D-~sfp~~~F~iv~G~~s~~~~rI~pg~nvsH~~v 134 (201)
.+-+.+....|-|...+....++|.|.++...+-|+ +.| ++-+...|.+.-- ..||+||+.-+=.++
T Consensus 15 ~A~v~l~~TRvIy~~~~~~~sv~v~N~~~~~p~LvQsWv~d~~~~~~~pFivtPP-----lfrl~p~~~~~lRI~ 84 (239)
T PRK15254 15 PAAVNVDRTRIIMDAPQKTVAITLNNDDKTTPFLAQSWVTDADGVRTDALMALPP-----LQRIDAGQKSQVRIT 84 (239)
T ss_pred eEeEEECceEEEEeCCCceEEEEEEeCCCCCcEEEEEEEecCCCCCcCCEEEcCC-----eEEECCCCceEEEEE
Confidence 567788899999999999999999999975333343 333 2222234644333 457888776655554
No 60
>PRK11385 putativi pili assembly chaperone; Provisional
Probab=33.37 E-value=3.3e+02 Score=23.79 Aligned_cols=67 Identities=9% Similarity=-0.022 Sum_probs=44.3
Q ss_pred eeeEEeeeEEEEEEeCCCCceeEEEecccceeeeEEEecCCC----------CCCceEEecCceeeeeeeeCCCCceEEE
Q psy4599 63 YGAFNFSFAEVSYKASENAEDYTIYNIGNSVATNVVVSDSSF----------PPEAFEIQGGQFKFVIDRVPPQTNVTHV 132 (201)
Q Consensus 63 ~G~fn~t~A~VtY~~s~~a~~y~IyNvG~s~A~dV~l~D~sf----------p~~~F~iv~G~~s~~~~rI~pg~nvsH~ 132 (201)
.+-+.+....|-|..++....++|.|.|+.+ +-|+..-+.+ +...|.+.-- ..||+||+.-+=.
T Consensus 25 ~A~v~l~~TRvIy~~~~~~~sv~l~N~~~~p-~LvQswv~~~~~~~~~~~~~~~~pFivtPP-----lfrl~p~~~q~lR 98 (236)
T PRK11385 25 QAGVVVGGTRFIFPADRESISILLTNTSQES-WLINSKINRPTRWAGGEASTVPAPLLAAPP-----LILLKPGTTGTLR 98 (236)
T ss_pred eeeEEeCceEEEEcCCCceEEEEEEeCCCCc-EEEEEEcccCccccCcccccccCCEEEcCC-----eEEECCCCceEEE
Confidence 4677889999999999999999999999986 4444421111 1123543322 4577777766555
Q ss_pred EEE
Q psy4599 133 VIV 135 (201)
Q Consensus 133 ~vv 135 (201)
++.
T Consensus 99 Ii~ 101 (236)
T PRK11385 99 LLR 101 (236)
T ss_pred EEE
Confidence 553
No 61
>PF01835 A2M_N: MG2 domain; InterPro: IPR002890 The proteinase-binding alpha-macroglobulins (A2M) [] are large glycoproteins found in the plasma of vertebrates, in the hemolymph of some invertebrates and in reptilian and avian egg white. A2M-like proteins are able to inhibit all four classes of proteinases by a 'trapping' mechanism. They have a peptide stretch, called the 'bait region', which contains specific cleavage sites for different proteinases. When a proteinase cleaves the bait region, a conformational change is induced in the protein, thus trapping the proteinase. The entrapped enzyme remains active against low molecular weight substrates, whilst its activity toward larger substrates is greatly reduced, due to steric hindrance. Following cleavage in the bait region, a thiol ester bond, formed between the side chains of a cysteine and a glutamine, is cleaved and mediates the covalent binding of the A2M-like protein to the proteinase. This family includes the N-terminal region of the alpha-2-macroglobulin family. The inhibitor domains belong to MEROPS inhibitor family I39.; GO: 0004866 endopeptidase inhibitor activity; PDB: 2B39_B 3KLS_B 3PRX_C 3KM9_B 3PVM_C 3CU7_A 4E0S_A 4A5W_A 4ACQ_C 2P9R_B ....
Probab=31.89 E-value=1.9e+02 Score=20.68 Aligned_cols=72 Identities=21% Similarity=0.297 Sum_probs=44.1
Q ss_pred EEecccceeeeEEEecCCCCCCceEEecCceeeeeeeeCCCCceEEEEEE-eeeecceeeeeeeEEEEecCCCCccCcee
Q psy4599 86 IYNIGNSVATNVVVSDSSFPPEAFEIQGGQFKFVIDRVPPQTNVTHVVIV-QPKSYGAFNFSFAEVSYKASENAEDGDWA 164 (201)
Q Consensus 86 IyNvG~s~A~dV~l~D~sfp~~~F~iv~G~~s~~~~rI~pg~nvsH~~vv-~p~~~G~fn~t~A~v~yr~s~~a~~~~~~ 164 (201)
||--|++--..+-+.+.. .+ +.... ...+.+.-..|.++.-+...+ .....|.|.+ ++..++++....|.
T Consensus 10 iYrPGetV~~~~~~~~~~--~~-~~~~~-~~~~~v~i~dp~g~~v~~~~~~~~~~~G~~~~-----~~~lp~~~~~G~y~ 80 (99)
T PF01835_consen 10 IYRPGETVHFRAIVRDLD--ND-FKPPA-NSPVTVTIKDPSGNEVFRWSVNTTNENGIFSG-----SFQLPDDAPLGTYT 80 (99)
T ss_dssp EE-TTSEEEEEEEEEEEC--TT-CSCES-SEEEEEEEEETTSEEEEEEEEEETTCTTEEEE-----EEE--SS---EEEE
T ss_pred CcCCCCEEEEEEEEeccc--cc-ccccc-CCceEEEEECCCCCEEEEEEeeeeCCCCEEEE-----EEECCCCCCCEeEE
Confidence 677777777677666633 01 11112 255667888888888888888 6778888887 57777777777775
Q ss_pred ee
Q psy4599 165 AF 166 (201)
Q Consensus 165 ~f 166 (201)
+-
T Consensus 81 i~ 82 (99)
T PF01835_consen 81 IR 82 (99)
T ss_dssp EE
T ss_pred EE
Confidence 53
No 62
>PF04495 GRASP55_65: GRASP55/65 PDZ-like domain ; InterPro: IPR007583 GRASP55 (Golgi reassembly stacking protein of 55 kDa) and GRASP65 (a 65 kDa) protein are highly homologous. GRASP55 is a component of the Golgi stacking machinery. GRASP65, an N-ethylmaleimide-sensitive membrane protein required for the stacking of Golgi cisternae in a cell-free system [].; PDB: 3RLE_A 4EDJ_A.
Probab=31.62 E-value=1.8e+02 Score=23.41 Aligned_cols=95 Identities=16% Similarity=0.242 Sum_probs=59.5
Q ss_pred EEEEcCccceeeEEEec-CCCCCCCceEecCee--eeeEeeeC-CCCceEEEEEEEecee----eeEEeeeEEEEEEe--
Q psy4599 8 TIYNIGNSVATNVVVSD-SSFPPEAFEIQGGQF--KFVIDRVP-PQTNVTHVVIVQPKSY----GAFNFSFAEVSYKA-- 77 (201)
Q Consensus 8 ~IyNvG~s~A~dV~l~D-~~fp~~~F~iv~G~~--~~~~~ri~-pg~nvsH~~vv~p~~~----G~fn~t~A~VtY~~-- 77 (201)
++||.=...-++|.+.. +.|... |.+ +++|++.. ..+..-|++-|.|-.. |.-.|..=.|..-.
T Consensus 2 ~v~~~k~~~~R~v~i~ps~~w~~~------g~LG~sv~~~~~~~~~~~~~~Vl~V~p~SPA~~AGL~p~~DyIig~~~~~ 75 (138)
T PF04495_consen 2 NVYNAKGQTTREVSIVPSKKWGGQ------GLLGISVRFESFEGAEEEGWHVLRVAPNSPAAKAGLEPFFDYIIGIDGGL 75 (138)
T ss_dssp EEEETTTSSEEEEEE---SSSSSS------SSS-EEEEEEE-TTGCCCEEEEEEE-TTSHHHHTT--TTTEEEEEETTCE
T ss_pred ceEECCCCeEEEEEEccCcccCCC------CCCcEEEEEecccccccceEEEeEecCCCHHHHCCccccccEEEEcccee
Confidence 68999999999999976 445544 544 99999999 7888999998886543 33323222222210
Q ss_pred --CCCC------------ceeEEEecccceeeeEEEec-CCCCCCc
Q psy4599 78 --SENA------------EDYTIYNIGNSVATNVVVSD-SSFPPEA 108 (201)
Q Consensus 78 --s~~a------------~~y~IyNvG~s~A~dV~l~D-~sfp~~~ 108 (201)
.++. ....+||.....-.+|.+.- ..|..++
T Consensus 76 l~~~~~l~~~v~~~~~~~l~L~Vyns~~~~vR~V~i~P~~~WgG~G 121 (138)
T PF04495_consen 76 LDDEDDLFELVEANENKPLQLYVYNSKTDSVREVTITPSRNWGGRG 121 (138)
T ss_dssp --STCHHHHHHHHTTTS-EEEEEEETTTTCEEEEEE---TTSSSST
T ss_pred cCCHHHHHHHHHHcCCCcEEEEEEECCCCeEEEEEEEcCCCCCCCe
Confidence 1111 56678888888888888866 4677663
No 63
>PRK15299 fimbrial chaperone protein StiB; Provisional
Probab=31.61 E-value=3.3e+02 Score=23.34 Aligned_cols=65 Identities=14% Similarity=0.111 Sum_probs=43.2
Q ss_pred eeEEeeeEEEEEEeCCCCceeEEEecccceeeeEEE--ecCC----CCCCceEEecCceeeeeeeeCCCCceEEEEE
Q psy4599 64 GAFNFSFAEVSYKASENAEDYTIYNIGNSVATNVVV--SDSS----FPPEAFEIQGGQFKFVIDRVPPQTNVTHVVI 134 (201)
Q Consensus 64 G~fn~t~A~VtY~~s~~a~~y~IyNvG~s~A~dV~l--~D~s----fp~~~F~iv~G~~s~~~~rI~pg~nvsH~~v 134 (201)
+.+.+.+..|-|..++.+..++|.|.|+.+ +-|+- .|+. -+...|.+.-- ..||+||+.-.=.++
T Consensus 22 a~i~l~~TRvi~~~~~~~~sl~l~N~~~~p-~lvQsWv~~~~~~~~~~~~pfivtPP-----l~rl~p~~~q~lRI~ 92 (227)
T PRK15299 22 AGINIGTTRVIFHGDAKDASISISNSDNVP-YLIQSWAQSISETGASGDAPFMVTPP-----LFRLNGGQKNVLRII 92 (227)
T ss_pred eeEEECceEEEEeCCCcEEEEEEEeCCCCc-EEEEEEeecCCCCCCcCCCCEEEcCC-----eEEECCCCccEEEEE
Confidence 667888999999999999999999999874 44442 2211 12234643333 457888777554444
No 64
>TIGR03079 CH4_NH3mon_ox_B methane monooxygenase/ammonia monooxygenase, subunit B. Both ammonia oxidizers such as Nitrosomonas europaea and methanotrophs (obligate methane oxidizers) such as Methylococcus capsulatus each can grow only on their own characteristic substrate. However, both groups have the ability to oxidize both substrates, and so the relevant enzymes must be named here according to their ability to oxidze both. The protein family represented here reflects subunit B of both the particulate methane monooxygenase of methylotrophs and the ammonia monooxygenase of nitrifying bacteria.
Probab=31.45 E-value=63 Score=30.75 Aligned_cols=57 Identities=18% Similarity=0.215 Sum_probs=36.4
Q ss_pred CceEEEEEEEEcCcccee-------eEEEec--------CCCCCCCceEecCeeeeeEeeeCCCCceEEEEEEE
Q psy4599 1 MDVVVKYTIYNIGNSVAT-------NVVVSD--------SSFPPEAFEIQGGQFKFVIDRVPPQTNVTHVVIVQ 59 (201)
Q Consensus 1 ~d~tV~y~IyNvG~s~A~-------dV~l~D--------~~fp~~~F~iv~G~~~~~~~ri~pg~nvsH~~vv~ 59 (201)
|.|+++++|-|.|+++-+ +|.+.+ +.+|.+.-. .|..-..=+-|.||++.+-.+.++
T Consensus 282 R~l~~~~~VTN~g~~~vrlgEF~TA~vRFlN~~~v~~~~~~yP~~lla--~GL~v~d~~pI~PGETr~v~v~aq 353 (399)
T TIGR03079 282 RALRVTMEITNNGDQVISIGEFTTAGIRFMNANGVRVLDPDYPRELLA--EGLEVDDQSAIAPGETVEVKMEAK 353 (399)
T ss_pred cEEEEEEEEEcCCCCceEEEeEeecceEeeCcccccccCCCChHHHhh--ccceeCCCCCcCCCcceEEEEEEe
Confidence 568999999999998754 344443 345554322 143333334699999988776554
No 65
>cd08379 C2D_MCTP_PRT_plant C2 domain fourth repeat found in Multiple C2 domain and Transmembrane region Proteins (MCTP); plant subset. MCTPs are involved in Ca2+ signaling at the membrane. Plant-MCTPs are composed of a variable N-terminal sequence, four C2 domains, two transmembrane regions (TMRs), and a short C-terminal sequence. It is one of four protein classes that are anchored to membranes via a transmembrane region; the others being synaptotagmins, extended synaptotagmins, and ferlins. MCTPs are the only membrane-bound C2 domain proteins that contain two functional TMRs. MCTPs are unique in that they bind Ca2+ but not phospholipids. C2 domains fold into an 8-standed beta-sandwich that can adopt 2 structural arrangements: Type I and Type II, distinguished by a circular permutation involving their N- and C-terminal beta strands. Many C2 domains are Ca2+-dependent membrane-targeting modules that bind a wide variety of substances including bind phospholipids, inositol polyphosphate
Probab=31.08 E-value=1.8e+02 Score=22.72 Aligned_cols=46 Identities=15% Similarity=0.191 Sum_probs=34.6
Q ss_pred ccceeeEEEec-CCC------CCCCceEecCeeeeeEeeeCCCCceEEEEEEEece
Q psy4599 14 NSVATNVVVSD-SSF------PPEAFEIQGGQFKFVIDRVPPQTNVTHVVIVQPKS 62 (201)
Q Consensus 14 ~s~A~dV~l~D-~~f------p~~~F~iv~G~~~~~~~ri~pg~nvsH~~vv~p~~ 62 (201)
......|++-| +.+ ..++| + |........+.+|....|.+-|++..
T Consensus 61 ~~~~l~v~V~d~d~~~~~~~~~~dd~--l-G~~~i~l~~l~~~~~~~~~~~L~~~~ 113 (126)
T cd08379 61 PCTVLTVGVFDNSQSHWKEAVQPDVL--I-GKVRIRLSTLEDDRVYAHSYPLLSLN 113 (126)
T ss_pred CCCEEEEEEEECCCccccccCCCCce--E-EEEEEEHHHccCCCEEeeEEEeEeCC
Confidence 34589999988 333 14544 3 78888888999999999999999654
No 66
>PRK06655 flgD flagellar basal body rod modification protein; Reviewed
Probab=30.43 E-value=3.6e+02 Score=23.38 Aligned_cols=64 Identities=14% Similarity=0.139 Sum_probs=39.9
Q ss_pred eEEEEEEeCCCC--ceeEEEecccceeeeEEEecCCCCCCceEEecCceeeeeeeeCCCCce----EEEEEEeeeecce
Q psy4599 70 FAEVSYKASENA--EDYTIYNIGNSVATNVVVSDSSFPPEAFEIQGGQFKFVIDRVPPQTNV----THVVIVQPKSYGA 142 (201)
Q Consensus 70 ~A~VtY~~s~~a--~~y~IyNvG~s~A~dV~l~D~sfp~~~F~iv~G~~s~~~~rI~pg~nv----sH~~vv~p~~~G~ 142 (201)
.+...|..++.+ ..++|||.-...-+.+.+.. .-.|...+.||.+...++. .|+|.|.++..|.
T Consensus 114 ~~~~~~~l~~~a~~vti~I~D~~G~~Vrt~~lg~---------~~aG~~~f~WDG~d~~G~~lp~G~Yt~~V~A~~~g~ 183 (225)
T PRK06655 114 TTPFGVELPSAADNVTVTITDSAGQVVRTIDLGA---------QSAGVVSFTWDGTDTDGNALPDGNYTIKASASVGGK 183 (225)
T ss_pred ceEEEEEcCCCCcEEEEEEEcCCCCEEEEEecCC---------cCCCceeEEECCCCCCCCcCCCeeEEEEEEEEeCCc
Confidence 556667766665 55888886333333344432 1158888999997776653 5677776665554
No 67
>PF04744 Monooxygenase_B: Monooxygenase subunit B protein; InterPro: IPR006833 Ammonia monooxygenase and the particulate methane monooxygenase are both integral membrane proteins, occurring in ammonia oxidisers and methanotrophs respectively, which are thought to be evolutionarily related []. These enzymes have a relatively wide substrate specificity and can catalyse the oxidation of a range of substrates including ammonia, methane, halogenated hydrocarbons and aromatic molecules []. These enzymes are composed of 3 subunits - A (IPR003393 from INTERPRO), B (IPR006833 from INTERPRO) and C (IPR006980 from INTERPRO) - and contain various metal centres, including copper. Particulate methane monooxygenase from Methylococcus capsulatus str. Bath is an ABC homotrimer, which contains mononuclear and dinuclear copper metal centres, and a third metal centre containing a metal ion whose identity in vivo is not certain[]. The soluble regions of these enzymes derive primarily from the B subunit. This subunit forms two antiparallel beta-barrel-like structures and contains the mono- and di- nuclear copper metal centres [].; PDB: 3CHX_E 3RFR_A 3RGB_A 1YEW_A.
Probab=29.61 E-value=1.6e+02 Score=27.97 Aligned_cols=59 Identities=20% Similarity=0.244 Sum_probs=32.6
Q ss_pred CceEEEEEEEEcCcccee-------eEEEecCCCCCCCc----e-EecCeeeeeEe-eeCCCCceEEEEEEE
Q psy4599 1 MDVVVKYTIYNIGNSVAT-------NVVVSDSSFPPEAF----E-IQGGQFKFVID-RVPPQTNVTHVVIVQ 59 (201)
Q Consensus 1 ~d~tV~y~IyNvG~s~A~-------dV~l~D~~fp~~~F----~-iv~G~~~~~~~-ri~pg~nvsH~~vv~ 59 (201)
|-|+++.+|-|.|+++.+ +|.+.|...+.++- + +-.+-++++=+ -|+||++.+-++.++
T Consensus 263 R~l~~~l~VtN~g~~pv~LgeF~tA~vrFln~~v~~~~~~~P~~l~A~~gL~vs~~~pI~PGETrtl~V~a~ 334 (381)
T PF04744_consen 263 RTLTMTLTVTNNGDSPVRLGEFNTANVRFLNPDVPTDDPDYPDELLAERGLSVSDNSPIAPGETRTLTVEAQ 334 (381)
T ss_dssp SEEEEEEEEEEESSS-BEEEEEESSS-EEE-TTT-SS-S---TTTEETT-EEES--S-B-TT-EEEEEEEEE
T ss_pred cEEEEEEEEEcCCCCceEeeeEEeccEEEeCcccccCCCCCchhhhccCcceeCCCCCcCCCceEEEEEEee
Confidence 468899999999999876 47777755553211 1 22223444444 899999988877664
No 68
>PRK15208 long polar fimbrial chaperone LpfB; Provisional
Probab=28.49 E-value=3.8e+02 Score=23.03 Aligned_cols=66 Identities=18% Similarity=0.233 Sum_probs=42.0
Q ss_pred eeEEeeeEEEEEEeCCCCceeEEEecccceeeeEEE-ecCC--CCCCceEEecCceeeeeeeeCCCCceEEEEE
Q psy4599 64 GAFNFSFAEVSYKASENAEDYTIYNIGNSVATNVVV-SDSS--FPPEAFEIQGGQFKFVIDRVPPQTNVTHVVI 134 (201)
Q Consensus 64 G~fn~t~A~VtY~~s~~a~~y~IyNvG~s~A~dV~l-~D~s--fp~~~F~iv~G~~s~~~~rI~pg~nvsH~~v 134 (201)
+-+.+.+..|-|...+.+..+.|.|.|+...+-|+- .|+. -....|.+.-- ..||+||+.-+=.++
T Consensus 21 agv~l~~TRvI~~~~~~~~si~i~N~~~~~~~LvQsWv~~~~~~~~~pfivtPP-----l~rl~p~~~q~lRIi 89 (228)
T PRK15208 21 GGVALSSTRVIYDGSKKEASLTVNNKSKTEEFLIQSWIDDANGNKKTPFIITPP-----LFKLDPTKNNVLRIV 89 (228)
T ss_pred ccEEeCceEEEEeCCCceEEEEEEeCCCCCcEEEEEEEECCCCCccCCEEECCC-----eEEECCCCccEEEEE
Confidence 557788999999999999999999999765444542 2211 11123543322 357777776544444
No 69
>PF14646 MYCBPAP: MYCBP-associated protein family
Probab=28.17 E-value=3.5e+02 Score=25.32 Aligned_cols=31 Identities=10% Similarity=0.202 Sum_probs=27.0
Q ss_pred eeeCCCCceEEEEEEeeeecceeeeeeeEEE
Q psy4599 121 DRVPPQTNVTHVVIVQPKSYGAFNFSFAEVS 151 (201)
Q Consensus 121 ~rI~pg~nvsH~~vv~p~~~G~fn~t~A~v~ 151 (201)
.+|-||....-.|.-+|++.|.|.-..-..+
T Consensus 295 gvilPGe~~~~~~~F~s~~~Gif~E~W~L~t 325 (426)
T PF14646_consen 295 GVILPGETRNFPFMFKSRKVGIFKERWELRT 325 (426)
T ss_pred CEECCCceEEEEEEEeCCCceEEEEEEEEEE
Confidence 5799999999999999999999988766555
No 70
>COG0832 UreB Urea amidohydrolase (urease) beta subunit [Amino acid transport and metabolism]
Probab=27.91 E-value=1.5e+02 Score=23.38 Aligned_cols=37 Identities=14% Similarity=0.223 Sum_probs=31.2
Q ss_pred eeeeEEeeeEEEEEEeCCCCceeEEEecccceeeeEEEec
Q psy4599 62 SYGAFNFSFAEVSYKASENAEDYTIYNIGNSVATNVVVSD 101 (201)
Q Consensus 62 ~~G~fn~t~A~VtY~~s~~a~~y~IyNvG~s~A~dV~l~D 101 (201)
..|.+.+.+..|.--.......+.+-|.|+-+ |++-.
T Consensus 2 iPGe~~~~~g~IelN~gr~~~~i~V~NtGDRP---IQVGS 38 (106)
T COG0832 2 IPGEIILASGDIELNAGRPTVTIEVANTGDRP---IQVGS 38 (106)
T ss_pred CCceeEecCccEEEeCCCcceEEEEeecCCCc---eEeec
Confidence 36888888888988888888899999999988 77763
No 71
>PRK13205 ureB urease subunit beta; Reviewed
Probab=27.17 E-value=1.3e+02 Score=25.21 Aligned_cols=36 Identities=19% Similarity=0.291 Sum_probs=29.6
Q ss_pred eeeEEeeeEEEEEEeCCCCceeEEEecccceeeeEEEec
Q psy4599 63 YGAFNFSFAEVSYKASENAEDYTIYNIGNSVATNVVVSD 101 (201)
Q Consensus 63 ~G~fn~t~A~VtY~~s~~a~~y~IyNvG~s~A~dV~l~D 101 (201)
.|.+.+..-.|..-......++.+-|.|+-| |++..
T Consensus 3 PGei~~~~g~IelN~GR~~i~L~V~NtGDRP---IQVGS 38 (162)
T PRK13205 3 PGEYILSSESLTGNVGREAKTIEIINTGDRP---VQIGS 38 (162)
T ss_pred CceEecCCCCeEeCCCCcEEEEEEEeCCCCc---eEecc
Confidence 5777777777888888888999999999988 78774
No 72
>TIGR03102 halo_cynanin halocyanin domain. Halocyanins are blue (type I) copper redox proteins found in halophilic archaea such as Natronobacterium pharaonis. This model represents a domain duplicated in some halocyanins, while appearing once in others. This domain includes the characteristic copper ligand residues. This family does not include plastocyanins, and does not include certain divergent paralogs of halocyanin.
Probab=25.45 E-value=3.3e+02 Score=21.24 Aligned_cols=73 Identities=16% Similarity=0.139 Sum_probs=46.8
Q ss_pred eEEEEEEEece-eeeEEeeeEEEEEEeCCCCceeEEEecccceeeeEEEec-CCCCCCceEEecCceeeeeeeeCCCCce
Q psy4599 52 VTHVVIVQPKS-YGAFNFSFAEVSYKASENAEDYTIYNIGNSVATNVVVSD-SSFPPEAFEIQGGQFKFVIDRVPPQTNV 129 (201)
Q Consensus 52 vsH~~vv~p~~-~G~fn~t~A~VtY~~s~~a~~y~IyNvG~s~A~dV~l~D-~sfp~~~F~iv~G~~s~~~~rI~pg~nv 129 (201)
-.+.+.|=|.. .|.|.|.|+.|+=... ..++--|.++..+-+|+..+ ..|. +|.+ .++||...
T Consensus 22 ~~~~v~~G~~~~~g~~~F~P~~ltV~~G---dTVtw~~~~d~~~HnV~s~~~~~f~-------s~~~-----~~~~G~t~ 86 (115)
T TIGR03102 22 DEVTVDVGAEANGGGFAFDPPAIRVDPG---TTVVWEWTGEGGGHNVVSDGDGDLD-------ESER-----VSEEGTTY 86 (115)
T ss_pred ceEEEEecccCCCCceeEeCCEEEECCC---CEEEEEECCCCCCEEEEECCCCCcc-------cccc-----ccCCCCEE
Confidence 34556665555 4679999999886654 46777777777777777653 2222 1211 35688888
Q ss_pred EEEEEEeeeeccee
Q psy4599 130 THVVIVQPKSYGAF 143 (201)
Q Consensus 130 sH~~vv~p~~~G~f 143 (201)
+|+|. +.|.|
T Consensus 87 s~Tf~----~~G~Y 96 (115)
T TIGR03102 87 EHTFE----EPGIY 96 (115)
T ss_pred EEEec----CCcEE
Confidence 88885 45655
No 73
>PF01263 Aldose_epim: Aldose 1-epimerase; InterPro: IPR008183 Aldose 1-epimerase (5.1.3.3 from EC) (mutarotase) is the enzyme responsible for the anomeric interconversion of D-glucose and other aldoses between their alpha- and beta-forms. The sequence of mutarotase from two bacteria, Acinetobacter calcoaceticus and Streptococcus thermophilus is available []. It has also been shown that, on the basis of extensive sequence similarities, a mutarotase domain seems to be present in the C-terminal half of the fungal GAL10 protein which encodes, in the N-terminal part, UDP-glucose 4-epimerase.; GO: 0016853 isomerase activity, 0005975 carbohydrate metabolic process; PDB: 1YGA_A 3DCD_A 2CIQ_A 2CIS_A 2CIR_A 2HTB_C 2HTA_B 3Q1N_A 1NSZ_B 1NSR_B ....
Probab=25.23 E-value=3.8e+02 Score=22.78 Aligned_cols=81 Identities=12% Similarity=0.142 Sum_probs=45.6
Q ss_pred ceEEEEEEEe--ceeeeEEeeeEEEEEEeCCCCceeEEEecccce-eee---EEEecCCCCCCceEEecCceeeeeeeeC
Q psy4599 51 NVTHVVIVQP--KSYGAFNFSFAEVSYKASENAEDYTIYNIGNSV-ATN---VVVSDSSFPPEAFEIQGGQFKFVIDRVP 124 (201)
Q Consensus 51 nvsH~~vv~p--~~~G~fn~t~A~VtY~~s~~a~~y~IyNvG~s~-A~d---V~l~D~sfp~~~F~iv~G~~s~~~~rI~ 124 (201)
-+-|.++..+ ...-...-....++....++...+.|||-|... ... +.+--+.+|.+.+.-..| --.|+
T Consensus 214 ~~D~~f~~~~~~~~~~~~~~~~~~i~l~~~~~~p~~~v~t~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-----~~~L~ 288 (300)
T PF01263_consen 214 GLDHCFVLDPPGRPVIRLRSPGSGISLEFSTSFPWLVVWTPGFDPKYICIEPIAIEPDFVPDEANNPGKG-----LIVLA 288 (300)
T ss_dssp CEEEEEEESSSSEEEEEEETTTEEEEEEEEESSSEEEEEEETTSSTEETTCEEECEEESECECCCECCHE-----GEEEE
T ss_pred ccceEEEcCCCccEEEEeecccceEEEEecCCCCcEEEECCCCcccccCCcccccCCCeeecccccCCCC-----CEEEC
Confidence 5778887776 334444444556666666666678888887765 111 111112223233321112 34789
Q ss_pred CCCceEEEEEEe
Q psy4599 125 PQTNVTHVVIVQ 136 (201)
Q Consensus 125 pg~nvsH~~vv~ 136 (201)
||...++.+.++
T Consensus 289 pGe~~~~~~~~~ 300 (300)
T PF01263_consen 289 PGESYSFSTRYT 300 (300)
T ss_dssp TTSEEEEEEEEE
T ss_pred CCCEEEEEEEEC
Confidence 999999988764
No 74
>PF00127 Copper-bind: Copper binding proteins, plastocyanin/azurin family; InterPro: IPR000923 Blue (type 1) copper proteins are small proteins which bind a single copper atom and which are characterised by an intense electronic absorption band near 600 nm [, ]. The most well known members of this class of proteins are the plant chloroplastic plastocyanins, which exchange electrons with cytochrome c6, and the distantly related bacterial azurins, which exchange electrons with cytochrome c551. This family of proteins also includes amicyanin from bacteria such as Methylobacterium extorquens or Paracoccus versutus (Thiobacillus versutus) that can grow on methylamine; auracyanins A and B from Chloroflexus aurantiacus []; blue copper protein from Alcaligenes faecalis; cupredoxin (CPC) from Cucumis sativus (Cucumber) peelings []; cusacyanin (basic blue protein; plantacyanin, CBP) from cucumber; halocyanin from Natronomonas pharaonis (Natronobacterium pharaonis) [], a membrane associated copper-binding protein; pseudoazurin from Pseudomonas; rusticyanin from Thiobacillus ferrooxidans []; stellacyanin from Rhus vernicifera (Japanese lacquer tree); umecyanin from the roots of Armoracia rusticana (Horseradish); and allergen Ra3 from ragweed. This pollen protein is evolutionary related to the above proteins, but seems to have lost the ability to bind copper. Although there is an appreciable amount of divergence in the sequences of all these proteins, the copper ligand sites are conserved.; GO: 0005507 copper ion binding, 0009055 electron carrier activity; PDB: 1UAT_A 1CUO_A 1PLC_A 4PCY_A 3PCY_A 1PND_A 1PNC_A 1JXG_A 6PCY_A 1TKW_A ....
Probab=25.21 E-value=1.2e+02 Score=22.16 Aligned_cols=14 Identities=7% Similarity=0.100 Sum_probs=8.9
Q ss_pred eeeCCCCceEEEEE
Q psy4599 44 DRVPPQTNVTHVVI 57 (201)
Q Consensus 44 ~ri~pg~nvsH~~v 57 (201)
..|.||++.+++|.
T Consensus 62 ~~~~~G~~~~~tF~ 75 (99)
T PF00127_consen 62 PLLAPGETYSVTFT 75 (99)
T ss_dssp EEBSTTEEEEEEEE
T ss_pred eecCCCCEEEEEeC
Confidence 34667777666655
No 75
>PRK15195 fimbrial chaperone protein FimC; Provisional
Probab=25.04 E-value=4.5e+02 Score=22.71 Aligned_cols=66 Identities=18% Similarity=0.201 Sum_probs=44.2
Q ss_pred eeEEeeeEEEEEEeCCCCceeEEEecccceeeeEEE--ec-CCCCCCceEEecCceeeeeeeeCCCCceEEEEE
Q psy4599 64 GAFNFSFAEVSYKASENAEDYTIYNIGNSVATNVVV--SD-SSFPPEAFEIQGGQFKFVIDRVPPQTNVTHVVI 134 (201)
Q Consensus 64 G~fn~t~A~VtY~~s~~a~~y~IyNvG~s~A~dV~l--~D-~sfp~~~F~iv~G~~s~~~~rI~pg~nvsH~~v 134 (201)
+-+.+.+..|-|..++.+..+.|.|.++...+=|+- .| +.-....|.+.-- +.||+||+.-+=.++
T Consensus 25 Agi~i~~TRvIy~~~~~~~si~l~N~~~~~~~LvQsWv~~~~~~~~~pfivtPP-----lfrl~p~~~q~lRIi 93 (229)
T PRK15195 25 GGIALGATRVIYPADAKQTSLAIRNSHTNERYLVNSWIENSSGVKEKSFIVTPP-----LFVSEPKSENTLRII 93 (229)
T ss_pred eeEEECCeEEEEeCCCceEEEEEEeCCCCccEEEEEEecCCCCCccCCEEEcCC-----eEEECCCCceEEEEE
Confidence 567888999999999999999999999765444552 22 1111234644333 458888877665444
No 76
>PRK12813 flgD flagellar basal body rod modification protein; Reviewed
Probab=25.00 E-value=4.7e+02 Score=22.83 Aligned_cols=62 Identities=13% Similarity=0.087 Sum_probs=41.0
Q ss_pred eEEEEEEeCCCC--ceeEEEecccceeeeEEEecCCCCCCceEEecCceeeeeeeeCCCCce----EEEEEEeeeeccee
Q psy4599 70 FAEVSYKASENA--EDYTIYNIGNSVATNVVVSDSSFPPEAFEIQGGQFKFVIDRVPPQTNV----THVVIVQPKSYGAF 143 (201)
Q Consensus 70 ~A~VtY~~s~~a--~~y~IyNvG~s~A~dV~l~D~sfp~~~F~iv~G~~s~~~~rI~pg~nv----sH~~vv~p~~~G~f 143 (201)
.+..+|..++.+ ..++|||.-...-+.+++ -.|.-.+.|+-....++. .|+|.|.++..|.-
T Consensus 112 ~~~~~~~l~~~a~~v~v~I~D~~G~vV~t~~~------------~~G~~~f~WDG~d~~G~~l~~G~Yt~~V~A~~~g~~ 179 (223)
T PRK12813 112 PVTISPNPAADADKAELVVRDAAGAEVARETV------------PVGAGPVEWAGEDADGNPLPNGAYSFVVESYSGGEL 179 (223)
T ss_pred eeEEEEeccCCCceEEEEEEcCCCCEEEEEee------------CCCceeEEeCCcCCCCCcCCCccEEEEEEEEeCCce
Confidence 667778877666 578888876655444444 146777999965544333 58888888766653
No 77
>PF01002 Flavi_NS2B: Flavivirus non-structural protein NS2B; InterPro: IPR000487 Flaviviruses encode a single polyprotein. This is cleaved into three structural and seven non-structural proteins. All, but two, are cleaved by the NS2B-NS3 protease complex [, ].; GO: 0004252 serine-type endopeptidase activity, 0019012 virion; PDB: 2WV9_A 2FOM_A 2VBC_B 3U1I_C 3U1J_A 3LKW_A 3L6P_A 2GGV_A 3E90_C 2IJO_A ....
Probab=24.38 E-value=70 Score=25.74 Aligned_cols=48 Identities=19% Similarity=0.162 Sum_probs=19.2
Q ss_pred CCCceEEEEEEEeceeeeEEeeeEEEEEEeCCCCceeEEEecccceeeeEEEecC
Q psy4599 48 PQTNVTHVVIVQPKSYGAFNFSFAEVSYKASENAEDYTIYNIGNSVATNVVVSDS 102 (201)
Q Consensus 48 pg~nvsH~~vv~p~~~G~fn~t~A~VtY~~s~~a~~y~IyNvG~s~A~dV~l~D~ 102 (201)
.|+-+--.|++--++-+..---++.|+..+ ++ -|.|++.-+||++.|+
T Consensus 32 ~~gllll~yvisgks~~L~~E~ag~i~W~~--ea-----~~sG~s~rldV~~d~~ 79 (128)
T PF01002_consen 32 AGGLLLLAYVISGKSTDLWLEWAGDISWEE--EA-----EISGGSVRLDVKLDDD 79 (128)
T ss_dssp ---------------SSEEEEEEE-S---T--TH-----EEHSEEEEEEEEE-TT
T ss_pred HHHHHHHHHHhhcccCceEEEEEeccccCc--cc-----hhcCCceEEEEEECCC
Confidence 344555556666666666666666666543 22 3678888899998885
No 78
>PLN02171 endoglucanase
Probab=24.37 E-value=1.7e+02 Score=29.53 Aligned_cols=56 Identities=11% Similarity=0.068 Sum_probs=42.6
Q ss_pred EEEEEEEEcCccceeeEEEecCCCCCCCceEec---CeeeeeE-eeeCCCCceEEEEEEE
Q psy4599 4 VVKYTIYNIGNSVATNVVVSDSSFPPEAFEIQG---GQFKFVI-DRVPPQTNVTHVVIVQ 59 (201)
Q Consensus 4 tV~y~IyNvG~s~A~dV~l~D~~fp~~~F~iv~---G~~~~~~-~ri~pg~nvsH~~vv~ 59 (201)
.++.+|+|.+..++.++.+.-+.+..+-++|.. |..--+| ..|++|++.+-.++.+
T Consensus 556 qy~v~I~N~s~~~ik~i~i~~~~~~~~iW~v~~~~ngytlPs~~~sL~aG~s~tFgyI~~ 615 (629)
T PLN02171 556 RYSTTVTNRSAKTLKELHLGISKLYGPLWGLTKAGYGYVLPSWMPSLPAGKSLEFVYVHS 615 (629)
T ss_pred EEEEEEEECCCCceeeeeeeeccccccchheeecCCcccCchhhcccCCCCeeEEEeecC
Confidence 467899999999999999986557677777764 3333344 4899999988777754
No 79
>PRK10378 inactive ferrous ion transporter periplasmic protein EfeO; Provisional
Probab=24.19 E-value=2e+02 Score=27.23 Aligned_cols=63 Identities=13% Similarity=0.177 Sum_probs=37.9
Q ss_pred EeeeEEEEEEeCCCCceeEEEecccceeeeEEEecCCCCCCceEEecCce-eeeeeeeCCCCceEEEEEEeeeecceeee
Q psy4599 67 NFSFAEVSYKASENAEDYTIYNIGNSVATNVVVSDSSFPPEAFEIQGGQF-KFVIDRVPPQTNVTHVVIVQPKSYGAFNF 145 (201)
Q Consensus 67 n~t~A~VtY~~s~~a~~y~IyNvG~s~A~dV~l~D~sfp~~~F~iv~G~~-s~~~~rI~pg~nvsH~~vv~p~~~G~fn~ 145 (201)
.|.++.++=... ...+.|.|.|..+ ..|+++.|.+ -...+.|.||..-+-+ .+.+.|.|.|
T Consensus 40 ~c~p~~~tVpAG--~~~f~V~N~~~~~-------------~Efe~~~~~~vv~e~EnIaPG~s~~l~---~~L~pGtY~~ 101 (375)
T PRK10378 40 QCEPMTLTVNAG--KTQFIIQNHSQKA-------------LEWEILKGVMVVEERENIAPGFSQKMT---ANLQPGEYDM 101 (375)
T ss_pred ccccCceeeCCC--CEEEEEEeCCCCc-------------ceEEeeccccccccccccCCCCceEEE---EecCCceEEe
Confidence 456666665554 3666777766543 3355444331 1246899999887643 4557898887
Q ss_pred ee
Q psy4599 146 SF 147 (201)
Q Consensus 146 t~ 147 (201)
.-
T Consensus 102 ~C 103 (375)
T PRK10378 102 TC 103 (375)
T ss_pred ec
Confidence 64
No 80
>cd00407 Urease_beta Urease beta-subunit; Urease is a nickel-dependent metalloenzyme that catalyzes the hydrolysis of urea to form ammonia and carbon dioxide. Nickel-dependent ureases are found in bacteria, archaea, fungi and plants. Their primary role is to allow the use of external and internally-generated urea as a nitrogen source. The enzyme consists of three subunits, alpha, beta and gamma, which can exist as separate proteins or can be fused on a single protein chain. The alpha-beta-gamma heterotrimer forms multimers, mainly trimers. The large alpha subunit is the catalytic domain containing an active site with a bi-nickel center complexed by a carbamylated lysine. The beta and gamma subunits play a role in subunit association to form the higher order trimers.
Probab=23.85 E-value=1.8e+02 Score=22.77 Aligned_cols=36 Identities=17% Similarity=0.268 Sum_probs=28.4
Q ss_pred eeeEEeeeEEEEEEeCCCCceeEEEecccceeeeEEEec
Q psy4599 63 YGAFNFSFAEVSYKASENAEDYTIYNIGNSVATNVVVSD 101 (201)
Q Consensus 63 ~G~fn~t~A~VtY~~s~~a~~y~IyNvG~s~A~dV~l~D 101 (201)
.|.+....-.|..-..-+..++.+-|.|+-+ |++..
T Consensus 3 PGei~~~~~~I~lN~gr~~~~l~V~NtGDRp---IQVGS 38 (101)
T cd00407 3 PGEIILKEGDIELNAGREAVTLKVKNTGDRP---IQVGS 38 (101)
T ss_pred CceEEeCCCCeEeCCCCCEEEEEEEeCCCcc---eEEcc
Confidence 4666666777777777788899999999988 77764
No 81
>COG1721 Uncharacterized conserved protein (some members contain a von Willebrand factor type A (vWA) domain) [General function prediction only]
Probab=23.41 E-value=2e+02 Score=26.76 Aligned_cols=72 Identities=19% Similarity=0.249 Sum_probs=51.1
Q ss_pred ceEEEEEEEEcCccceeeEEEecCCCCCCCceEecCeeeeeEeeeCCCCceEEEEEEEeceeeeEEeeeEEEEEEeCCCC
Q psy4599 2 DVVVKYTIYNIGNSVATNVVVSDSSFPPEAFEIQGGQFKFVIDRVPPQTNVTHVVIVQPKSYGAFNFSFAEVSYKASENA 81 (201)
Q Consensus 2 d~tV~y~IyNvG~s~A~dV~l~D~~fp~~~F~iv~G~~~~~~~ri~pg~nvsH~~vv~p~~~G~fn~t~A~VtY~~s~~a 81 (201)
++.+++.+-| ...-.+.+.|+ .|++ |..+.|..... -.+.+++. ..+.+.|.+-|.|.+.+..+...+.-.-
T Consensus 69 ~~~v~~~v~~---r~~~~~~~~~~-~~~~-~~~~~~~~~~~-~~~~~~~~--~~~~~~~~~rG~~~~~~v~~~~~~~~gL 140 (416)
T COG1721 69 EVEVTLRVRN---RGRPRLLLVDD-IPPS-FLGVEGTEEVS-LRLGPGER--VAYKVTPLRRGEYRLPPVRVRAEDPFGL 140 (416)
T ss_pred cceeEEEEEe---cCccceEeeec-cCCc-ccccccCccee-eccCCCce--EEEEEecccCCcccccceEEEccCcccc
Confidence 5677888888 33445666664 5655 44665554432 34566666 9999999999999999999999887554
No 82
>PRK15295 fimbrial assembly chaperone SthB; Provisional
Probab=23.17 E-value=4.9e+02 Score=22.42 Aligned_cols=66 Identities=14% Similarity=0.085 Sum_probs=44.9
Q ss_pred eeeEEeeeEEEEEEeCCCCceeEEEecccceeeeEEE--ecC---CCCCCceEEecCceeeeeeeeCCCCceEEEEE
Q psy4599 63 YGAFNFSFAEVSYKASENAEDYTIYNIGNSVATNVVV--SDS---SFPPEAFEIQGGQFKFVIDRVPPQTNVTHVVI 134 (201)
Q Consensus 63 ~G~fn~t~A~VtY~~s~~a~~y~IyNvG~s~A~dV~l--~D~---sfp~~~F~iv~G~~s~~~~rI~pg~nvsH~~v 134 (201)
.+.+.+.+..|-|-..+.+..+.|.|.|+.+. -|+. .|+ .-+...|.+.-- ..||+||+.-+=.++
T Consensus 18 ~A~i~l~~TRvI~~~~~~~~si~i~N~~~~p~-LvQsWv~~~~~~~~~~~pFivtPP-----l~rl~p~~~q~lRI~ 88 (226)
T PRK15295 18 HASIVVGGTRLVFDGNNDESSINVENKDSKAN-LVQSWLSVVDPQVTNKQAFIITPP-----LFRLDAGQKNSIRVI 88 (226)
T ss_pred cccEEeCceEEEEeCCCceeEEEEEeCCCCcE-EEEEEEeCCCCCCCCCCCEEEcCC-----eEEECCCCceEEEEE
Confidence 36678889999999999999999999999864 4553 221 112334643332 457888877665544
No 83
>PRK13202 ureB urease subunit beta; Reviewed
Probab=23.16 E-value=2.1e+02 Score=22.48 Aligned_cols=36 Identities=19% Similarity=0.298 Sum_probs=27.1
Q ss_pred eeeEEeeeEEEEEEeCC-CCceeEEEecccceeeeEEEec
Q psy4599 63 YGAFNFSFAEVSYKASE-NAEDYTIYNIGNSVATNVVVSD 101 (201)
Q Consensus 63 ~G~fn~t~A~VtY~~s~-~a~~y~IyNvG~s~A~dV~l~D 101 (201)
.|.+...+..|..-... +..++.+-|.|+-+ |++..
T Consensus 3 PGei~~~~~~I~ln~grr~~~~l~V~NtGDRP---IQVGS 39 (104)
T PRK13202 3 PGEIFYGSGDIEMNAAALSRLQMRIINAGDRP---VQVGS 39 (104)
T ss_pred CceEecCCCCEEeCCCCCceEEEEEEeCCCCc---eEEcc
Confidence 46666777777776663 67889999999988 77764
No 84
>PRK15246 fimbrial assembly chaperone StbE; Provisional
Probab=23.15 E-value=5e+02 Score=22.56 Aligned_cols=66 Identities=15% Similarity=0.205 Sum_probs=43.7
Q ss_pred eeEEeeeEEEEEEeCCCCceeEEEecccceeeeEEE-ecC----CCCCC---ceEEecCceeeeeeeeCCCCceEEEEEE
Q psy4599 64 GAFNFSFAEVSYKASENAEDYTIYNIGNSVATNVVV-SDS----SFPPE---AFEIQGGQFKFVIDRVPPQTNVTHVVIV 135 (201)
Q Consensus 64 G~fn~t~A~VtY~~s~~a~~y~IyNvG~s~A~dV~l-~D~----sfp~~---~F~iv~G~~s~~~~rI~pg~nvsH~~vv 135 (201)
+-+......|-|..++....++|.|.|+.+. -|+. .|+ +-|++ .|.+.-- ..||+||+.-+=.++.
T Consensus 10 A~v~l~~TRvI~~~~~~~~sv~l~N~~~~p~-LvQsWvd~~~~~~~p~~~~~pFivtPP-----lfrl~~~~~~~lRI~~ 83 (233)
T PRK15246 10 AAVNIDRTRIIFASDDVAQSLTLSNDNTTPM-LLQVWTDAGNIDASPDNSKTPLVALPP-----VFKMQPGELRTLRLLL 83 (233)
T ss_pred EEEEECceEEEEcCCCceEEEEEEeCCCCcE-EEEEEEeCCCCccCcccccCcEEECCc-----ceEECCCCceEEEEEE
Confidence 5577888999999999999999999998853 3443 121 22222 4644433 3477777766655553
No 85
>PRK10378 inactive ferrous ion transporter periplasmic protein EfeO; Provisional
Probab=22.91 E-value=2e+02 Score=27.12 Aligned_cols=38 Identities=13% Similarity=0.191 Sum_probs=21.1
Q ss_pred CCceEecCee-eeeEeeeCCCCceEEEEEEEeceeeeEEeee
Q psy4599 30 EAFEIQGGQF-KFVIDRVPPQTNVTHVVIVQPKSYGAFNFSF 70 (201)
Q Consensus 30 ~~F~iv~G~~-~~~~~ri~pg~nvsH~~vv~p~~~G~fn~t~ 70 (201)
..|+++.|.+ -...|.|.||..-+- +.+.+.|.|.|.=
T Consensus 65 ~Efe~~~~~~vv~e~EnIaPG~s~~l---~~~L~pGtY~~~C 103 (375)
T PRK10378 65 LEWEILKGVMVVEERENIAPGFSQKM---TANLQPGEYDMTC 103 (375)
T ss_pred ceEEeeccccccccccccCCCCceEE---EEecCCceEEeec
Confidence 3455554331 123577888876652 2454677777754
No 86
>cd08678 C2_C21orf25-like C2 domain found in the Human chromosome 21 open reading frame 25 (C21orf25) protein. The members in this cd are named after the Human C21orf25 which contains a single C2 domain. Several other members contain a C1 domain downstream of the C2 domain. No other information on this protein is currently known. The C2 domain was first identified in PKC. C2 domains fold into an 8-standed beta-sandwich that can adopt 2 structural arrangements: Type I and Type II, distinguished by a circular permutation involving their N- and C-terminal beta strands. Many C2 domains are Ca2+-dependent membrane-targeting modules that bind a wide variety of substances including bind phospholipids, inositol polyphosphates, and intracellular proteins. Most C2 domain proteins are either signal transduction enzymes that contain a single C2 domain, such as protein kinase C, or membrane trafficking proteins which contain at least two C2 domains, such as synaptotagmin 1. However, there are a
Probab=22.87 E-value=3.3e+02 Score=20.37 Aligned_cols=44 Identities=16% Similarity=0.138 Sum_probs=34.4
Q ss_pred cceeeEEEec-CCCCCCCceEecCeeeeeEeeeCCCCceEEEEEEEec
Q psy4599 15 SVATNVVVSD-SSFPPEAFEIQGGQFKFVIDRVPPQTNVTHVVIVQPK 61 (201)
Q Consensus 15 s~A~dV~l~D-~~fp~~~F~iv~G~~~~~~~ri~pg~nvsH~~vv~p~ 61 (201)
.....|++-| +.+..++| + |........|..+.+..+.+.+.|+
T Consensus 58 ~~~l~~~v~d~~~~~~~~~-l--G~~~i~l~~l~~~~~~~~~~~L~~~ 102 (126)
T cd08678 58 SKELLFEVYDNGKKSDSKF-L--GLAIVPFDELRKNPSGRQIFPLQGR 102 (126)
T ss_pred CCEEEEEEEECCCCCCCce-E--EEEEEeHHHhccCCceeEEEEecCC
Confidence 4568888888 55555555 2 8888899999999988888888876
No 87
>cd05902 Ig_Neurocan Immunoglobulin (Ig)-like domain of the chondroitin sulfate proteoglycan core protein (CSPG), neurocan. Ig_Neurocan: immunoglobulin (Ig)-like domain of the chondroitin sulfate proteoglycan core protein (CSPG), neurocan. In CSPGs, the Ig-like domain is followed by hyaluronan (HA)-binding tandem repeats, and a C-terminal region with epidermal growth factor-like, lectin-like, and complement regulatory protein-like domains. Separating these N- and C-terminal regions is a nonhomologous glycosaminoglycan attachment region. In cartilage, the CSPG aggrecan (not included in this group) forms cartilage link protein stabilized aggregates with HA. These aggregates contribute to the tissue's load bearing properties. Unlike aggrecan which is widely distributed in connective tissue and extracellular matrices, neurocan is localized almost exclusively in nervous tissue. Aggregates having other CSPGs substituting for aggrecan may contribute to the structural integrity of many differen
Probab=22.44 E-value=1.2e+02 Score=23.14 Aligned_cols=23 Identities=4% Similarity=0.102 Sum_probs=17.2
Q ss_pred CeeeeeEeeeCCCCceEEEEEEE
Q psy4599 37 GQFKFVIDRVPPQTNVTHVVIVQ 59 (201)
Q Consensus 37 G~~~~~~~ri~pg~nvsH~~vv~ 59 (201)
..+..+|.++.||.+.....++.
T Consensus 22 ~~~~I~W~k~~~~~~~~~~~v~v 44 (110)
T cd05902 22 EGPRIKWTKLRPASGQKQRPVLV 44 (110)
T ss_pred CCCEEEEEEeCCCCCceEEEEEE
Confidence 46789999999987766655554
No 88
>TIGR03396 PC_PLC phospholipase C, phosphocholine-specific, Pseudomonas-type. Members of this protein family are bacterial, phosphatidylcholine-hydrolyzing phospholipase C enzymes, with a characteristic domain architecture as found in hemolytyic (PlcH) and nonhemolytic (PlcN) secreted enzymes of Pseudomonas aeruginosa. PlcH hydrolyzes phosphatidylcholine to diacylglycerol and phosphocholine, but unlike PlcN can also hydrolyze sphingomyelin to ceramide ((N-acylsphingosine)) and phosphocholine. Members of this family share the twin-arginine signal sequence for Sec-independent transport across the plasma membrane. PlcH is secreted as a heterodimer with a small chaperone, PlcR, encoded immediately downstream.
Probab=21.68 E-value=8.8e+02 Score=24.85 Aligned_cols=132 Identities=20% Similarity=0.268 Sum_probs=79.6
Q ss_pred EEEEEEEEcCccceeeEEEec-CCCC--CCCceEecC-eeeeeEeeeCCCCceEEEEEEEeceee-eEEee---------
Q psy4599 4 VVKYTIYNIGNSVATNVVVSD-SSFP--PEAFEIQGG-QFKFVIDRVPPQTNVTHVVIVQPKSYG-AFNFS--------- 69 (201)
Q Consensus 4 tV~y~IyNvG~s~A~dV~l~D-~~fp--~~~F~iv~G-~~~~~~~ri~pg~nvsH~~vv~p~~~G-~fn~t--------- 69 (201)
+|+.++-|.|...|. ..+.| .... +-.+.+-.| .++-.|..-.-+...-|--|.=|-..= .|...
T Consensus 516 ~~~l~~~n~g~~ga~-f~vy~~~~~~~~p~~ytv~ag~~l~d~w~~~~~~~g~ydl~v~GPnGf~R~f~G~~~~~~~~~~ 594 (690)
T TIGR03396 516 SLALTFANTGTAGAV-FHVYDRLHLDGIPRRYTVEAGKQLDDSWDAVSDAGGAYDLWVLGPNGFLREFRGNAAAAAGAVP 594 (690)
T ss_pred EEEEEEecCCCcEEE-EEEecCCCCCCCCcceEECCCCEEEEEEeccCCCCCceEEEEECCCceeEEeecccccccCCCC
Confidence 578999999987665 44445 2232 346888777 668889532333334444444465432 22222
Q ss_pred eEEEEEEeCCCCceeEEEecccceeeeEEEecCCCCCCceEEecCceeeee-eeeCCCCceEEEEEEeeeecceeeeeee
Q psy4599 70 FAEVSYKASENAEDYTIYNIGNSVATNVVVSDSSFPPEAFEIQGGQFKFVI-DRVPPQTNVTHVVIVQPKSYGAFNFSFA 148 (201)
Q Consensus 70 ~A~VtY~~s~~a~~y~IyNvG~s~A~dV~l~D~sfp~~~F~iv~G~~s~~~-~rI~pg~nvsH~~vv~p~~~G~fn~t~A 148 (201)
...+.|........++|-|.|.. +..+.+.|+....+ ..| -.|+||..++...-+... .|-|-++-.
T Consensus 595 ~~~~~~d~a~G~L~L~L~N~G~~-a~~ftV~d~~Y~~~----------~pr~ytV~aG~~~~~~w~l~~s-~GWYDLtVt 662 (690)
T TIGR03396 595 EVRVCYDVANGNLYLTLSNAGRS-PVTVTVTDNAYGGA----------GPRTVTVAPGQRVELHWDLSAS-GGWYDFTVT 662 (690)
T ss_pred ceEEEEecCCCEEEEEEEeCCCC-cEEEEEEeCCCCCC----------CCEEEEECCCCEEEEEEeccCC-CCceEEEEE
Confidence 24667777777789999999974 45588887654321 011 356777777776655432 377777654
No 89
>cd08379 C2D_MCTP_PRT_plant C2 domain fourth repeat found in Multiple C2 domain and Transmembrane region Proteins (MCTP); plant subset. MCTPs are involved in Ca2+ signaling at the membrane. Plant-MCTPs are composed of a variable N-terminal sequence, four C2 domains, two transmembrane regions (TMRs), and a short C-terminal sequence. It is one of four protein classes that are anchored to membranes via a transmembrane region; the others being synaptotagmins, extended synaptotagmins, and ferlins. MCTPs are the only membrane-bound C2 domain proteins that contain two functional TMRs. MCTPs are unique in that they bind Ca2+ but not phospholipids. C2 domains fold into an 8-standed beta-sandwich that can adopt 2 structural arrangements: Type I and Type II, distinguished by a circular permutation involving their N- and C-terminal beta strands. Many C2 domains are Ca2+-dependent membrane-targeting modules that bind a wide variety of substances including bind phospholipids, inositol polyphosphate
Probab=21.66 E-value=3.4e+02 Score=21.15 Aligned_cols=44 Identities=14% Similarity=0.166 Sum_probs=32.7
Q ss_pred eeeEEEecC-CC------CCCceEEecCceeeeeeeeCCCCceEEEEEEeeeec
Q psy4599 94 ATNVVVSDS-SF------PPEAFEIQGGQFKFVIDRVPPQTNVTHVVIVQPKSY 140 (201)
Q Consensus 94 A~dV~l~D~-sf------p~~~F~iv~G~~s~~~~rI~pg~nvsH~~vv~p~~~ 140 (201)
.+.|++-|. .+ -.+.|. |...+....+.+|..+.+.+.|++.+.
T Consensus 64 ~l~v~V~d~d~~~~~~~~~~dd~l---G~~~i~l~~l~~~~~~~~~~~L~~~~~ 114 (126)
T cd08379 64 VLTVGVFDNSQSHWKEAVQPDVLI---GKVRIRLSTLEDDRVYAHSYPLLSLNP 114 (126)
T ss_pred EEEEEEEECCCccccccCCCCceE---EEEEEEHHHccCCCEEeeEEEeEeCCC
Confidence 566777663 32 256664 888888889999999999999996653
No 90
>PRK13204 ureB urease subunit beta; Reviewed
Probab=21.30 E-value=2e+02 Score=24.16 Aligned_cols=37 Identities=11% Similarity=0.273 Sum_probs=30.7
Q ss_pred eeeeEEeeeEEEEEEeCCCCceeEEEecccceeeeEEEec
Q psy4599 62 SYGAFNFSFAEVSYKASENAEDYTIYNIGNSVATNVVVSD 101 (201)
Q Consensus 62 ~~G~fn~t~A~VtY~~s~~a~~y~IyNvG~s~A~dV~l~D 101 (201)
..|.+.+..-.|..-..-...++.+-|.|+-+ |++..
T Consensus 25 ~pGei~~~~~~I~lN~gr~~~~l~V~NtGDRP---IQVGS 61 (159)
T PRK13204 25 PVGGYVLAKDPIEINQGRPRTTLTVRNTGDRP---IQIGS 61 (159)
T ss_pred CCCeEEeCCCCeEeCCCCcEEEEEEEeCCCCc---eEecc
Confidence 46777777888888888888999999999988 78764
No 91
>PF06159 DUF974: Protein of unknown function (DUF974); InterPro: IPR010378 This is a family of uncharacterised eukaryotic proteins.
Probab=21.06 E-value=5.6e+02 Score=22.34 Aligned_cols=74 Identities=16% Similarity=0.247 Sum_probs=50.5
Q ss_pred EEEEEEEEcCccceeeEEEecCCC-CCC--CceEecCeee-eeEeeeCCCCceEEEEEEEeceeeeEEeeeEEEEEEeC
Q psy4599 4 VVKYTIYNIGNSVATNVVVSDSSF-PPE--AFEIQGGQFK-FVIDRVPPQTNVTHVVIVQPKSYGAFNFSFAEVSYKAS 78 (201)
Q Consensus 4 tV~y~IyNvG~s~A~dV~l~D~~f-p~~--~F~iv~G~~~-~~~~ri~pg~nvsH~~vv~p~~~G~fn~t~A~VtY~~s 78 (201)
...+.+-|--+.+..+|.+.=+=- |.. .+.+...... ..-..|+||+++.-.+--.=+..|.+.+ ...|+|...
T Consensus 17 ~~~l~~~N~s~~~v~~v~ikvemqT~s~~~r~~L~~~~~~~~~~~~L~p~~~l~~iv~~~lkE~G~h~L-~c~VsY~~~ 94 (249)
T PF06159_consen 17 SCYLSVNNDSNKPVRNVRIKVEMQTPSQSLRLPLSDNENSDSPVASLAPGESLDFIVSHELKELGNHTL-VCTVSYTDP 94 (249)
T ss_pred EEEEEeecCCCCceEEeEEEEEEeCCCCCccccCCCCccccccccccCCCCeEeEEEEEEeeecCceEE-EEEEEEecC
Confidence 455677787777777775543211 122 4544443332 3466799999999888888899999988 567888887
No 92
>cd05901 Ig_Versican Immunoglobulin (Ig)-like domain of the chondroitin sulfate proteoglycan core protein (CSPG), versican. Ig_Versican: immunoglobulin (Ig)-like domain of the chondroitin sulfate proteoglycan core protein (CSPG), versican. In CSPGs, the Ig-like domain is followed by hyaluronan (HA)-binding tandem repeats, and a C-terminal region with epidermal growth factor-like, lectin-like, and complement regulatory protein-like domains. Separating these N- and C-terminal regions is a nonhomologous glycosaminoglycan attachment region. In cartilage, the CSPG aggrecan (not included in this group) forms cartilage link protein stabilized aggregates with HA. These aggregates contribute to the tissue's load bearing properties. Like aggrecan, versican has a wide distribution in connective tissue and extracellular matrices. Aggregates having other CSPGs substituting for aggrecan may contribute to the structural integrity of many different tissues. Members of the vertebrate HPLN (hyaluronan/HA
Probab=20.62 E-value=3.7e+02 Score=21.06 Aligned_cols=55 Identities=5% Similarity=0.062 Sum_probs=34.4
Q ss_pred ecCeeeeeEeeeCC---CCceEEEEEEEecee-----eeEEeeeEEEEEEeCCCCceeEEEec
Q psy4599 35 QGGQFKFVIDRVPP---QTNVTHVVIVQPKSY-----GAFNFSFAEVSYKASENAEDYTIYNI 89 (201)
Q Consensus 35 v~G~~~~~~~ri~p---g~nvsH~~vv~p~~~-----G~fn~t~A~VtY~~s~~a~~y~IyNv 89 (201)
-.+.+..+|-++.. |.+.....++..... +.|..=....++......+...|-|+
T Consensus 24 ~~~~~rIkW~~~~~~~~g~~~~e~~Vlv~~~g~~~~~~~y~gRv~l~~~~~~~g~asL~i~~v 86 (117)
T cd05901 24 TSEFLRIKWTKIEVDKNGKDHKETTVLVAQNGIIKIGQEYMGRVSVPSHPEDQGDASLTIVKL 86 (117)
T ss_pred CCCCcEEEEEEecccccccCCCccEEEEEECCEEEECCcccCcEEeeccCCcCCceEEEEccc
Confidence 45678999999954 344455666554443 55555555555666666677777774
No 93
>PRK15211 fimbrial chaperone protein PefD; Provisional
Probab=20.22 E-value=5.8e+02 Score=22.15 Aligned_cols=67 Identities=10% Similarity=0.055 Sum_probs=44.6
Q ss_pred eeeEEeeeEEEEEEeCCCCceeEEEecccceeeeEEEecCCCCC----CceEEecCceeeeeeeeCCCCceEEEEEE
Q psy4599 63 YGAFNFSFAEVSYKASENAEDYTIYNIGNSVATNVVVSDSSFPP----EAFEIQGGQFKFVIDRVPPQTNVTHVVIV 135 (201)
Q Consensus 63 ~G~fn~t~A~VtY~~s~~a~~y~IyNvG~s~A~dV~l~D~sfp~----~~F~iv~G~~s~~~~rI~pg~nvsH~~vv 135 (201)
.+-+.+....|-|-..+.+..++|.|.|+.+ +-|+.--+.+.. ..|.+.- -..||+||+.-+=.++.
T Consensus 21 ~A~v~l~~TRvIy~~~~~~~si~i~N~~~~p-~LvQswv~~~~~~~~~~pFivtP-----Plfrl~p~~~q~lRI~~ 91 (229)
T PRK15211 21 MAAFVLNGTRFIYDEGRKNISFEVTNQADQT-YGGQVWIDNTTQGSSTVYMVPAP-----PFFKVRPKEKQIIRIMK 91 (229)
T ss_pred eEEEEECceEEEEcCCCceEEEEEEeCCCCc-EEEEEEEecCCCCCccCCEEEcC-----CeEEECCCCceEEEEEE
Confidence 3558889999999999999999999999986 444432222222 1354332 24688888776655553
No 94
>PRK13201 ureB urease subunit beta; Reviewed
Probab=20.14 E-value=2.3e+02 Score=23.22 Aligned_cols=36 Identities=17% Similarity=0.152 Sum_probs=28.4
Q ss_pred eeeEEeeeEEEEEEeCCCCceeEEEecccceeeeEEEec
Q psy4599 63 YGAFNFSFAEVSYKASENAEDYTIYNIGNSVATNVVVSD 101 (201)
Q Consensus 63 ~G~fn~t~A~VtY~~s~~a~~y~IyNvG~s~A~dV~l~D 101 (201)
.|.+....-.|..-..-...++.+-|.|+-| ||+..
T Consensus 3 PGei~~~~~~I~lN~gr~~~~l~V~NtGDRP---IQVGS 38 (136)
T PRK13201 3 PGEIITKSTEVEINNHHPETVIEVENTGDRP---IQVGS 38 (136)
T ss_pred CceEecCCCCeEeCCCCCEEEEEEEeCCCcc---eEecc
Confidence 4666666677777777788899999999988 78764
Done!