Query psy8027
Match_columns 250
No_of_seqs 99 out of 105
Neff 5.5
Searched_HMMs 46136
Date Fri Aug 16 16:33:33 2013
Command hhsearch -i /work/01045/syshi/Psyhhblits/psy8027.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/8027hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 PF09066 B2-adapt-app_C: Beta2 99.9 9.6E-27 2.1E-31 185.1 12.1 111 128-247 1-114 (114)
2 KOG1061|consensus 99.1 7.9E-12 1.7E-16 125.8 -2.0 107 18-148 628-734 (734)
3 smart00809 Alpha_adaptinC2 Ada 98.6 1.2E-07 2.6E-12 73.4 7.4 73 14-94 3-75 (104)
4 PF02883 Alpha_adaptinC2: Adap 98.2 1.2E-05 2.5E-10 63.5 9.4 92 9-113 4-100 (115)
5 PF14796 AP3B1_C: Clathrin-ada 97.7 0.00036 7.8E-09 58.7 9.9 89 2-94 52-142 (145)
6 PF14807 AP4E_app_platf: Adapt 96.5 0.059 1.3E-06 43.0 10.9 97 133-246 4-103 (104)
7 PF02296 Alpha_adaptin_C: Alph 95.0 0.34 7.4E-06 39.0 9.9 103 130-240 3-112 (113)
8 KOG1060|consensus 94.8 0.45 9.7E-06 49.9 12.5 177 8-235 780-957 (968)
9 KOG1077|consensus 92.7 2 4.3E-05 44.9 12.5 187 39-242 734-934 (938)
10 PF10633 NPCBM_assoc: NPCBM-as 89.4 0.89 1.9E-05 33.4 4.9 64 35-100 4-68 (78)
11 PF00635 Motile_Sperm: MSP (Ma 88.3 3 6.6E-05 31.7 7.5 88 35-145 17-106 (109)
12 PF14806 Coatomer_b_Cpla: Coat 82.0 16 0.00036 30.2 9.3 94 129-234 18-120 (129)
13 PF04744 Monooxygenase_B: Mono 79.2 6 0.00013 38.3 6.6 57 35-91 262-334 (381)
14 PRK15192 fimbrial chaperone Bc 74.2 20 0.00043 32.4 8.3 89 16-118 22-140 (234)
15 PRK15211 fimbrial chaperone pr 71.3 25 0.00054 31.6 8.2 94 13-120 19-138 (229)
16 KOG1078|consensus 71.2 20 0.00043 38.0 8.3 192 35-242 648-864 (865)
17 PRK15295 fimbrial assembly cha 71.0 25 0.00054 31.4 8.1 95 14-122 17-138 (226)
18 PRK11385 putativi pili assembl 70.5 24 0.00052 31.8 8.0 92 13-118 23-145 (236)
19 PF07705 CARDB: CARDB; InterP 70.2 6.4 0.00014 29.0 3.6 55 35-93 18-72 (101)
20 PF05506 DUF756: Domain of unk 68.5 23 0.0005 26.6 6.4 61 17-93 8-68 (89)
21 PF11614 FixG_C: IG-like fold 66.4 28 0.00061 27.2 6.8 71 35-110 28-101 (118)
22 PRK15218 fimbrial chaperone pr 63.2 38 0.00083 30.3 7.7 91 14-119 16-138 (226)
23 PRK09918 putative fimbrial cha 62.6 35 0.00076 30.4 7.4 93 14-120 22-136 (230)
24 PRK15274 putative periplasmic 58.9 47 0.001 30.5 7.6 93 15-122 25-145 (257)
25 PRK15249 fimbrial chaperone pr 55.0 37 0.00081 30.8 6.3 98 9-120 21-152 (253)
26 PRK15253 putative fimbrial ass 54.6 64 0.0014 29.3 7.7 90 15-119 32-153 (242)
27 PRK15224 pili assembly chapero 53.9 59 0.0013 29.4 7.4 92 13-119 25-148 (237)
28 PRK15254 fimbrial chaperone pr 53.9 70 0.0015 29.0 7.8 97 13-120 13-133 (239)
29 PRK15246 fimbrial assembly cha 52.6 78 0.0017 28.5 7.9 92 15-120 9-133 (233)
30 PF09478 CBM49: Carbohydrate b 51.1 27 0.00058 25.9 4.0 47 39-85 20-74 (80)
31 PF07718 Coatamer_beta_C: Coat 50.9 69 0.0015 27.0 6.7 66 18-92 58-123 (140)
32 PF14874 PapD-like: Flagellar- 50.2 1.1E+02 0.0023 23.0 8.4 51 35-91 19-72 (102)
33 PF05117 DUF695: Family of unk 49.1 1E+02 0.0022 24.8 7.4 52 129-206 48-100 (136)
34 KOG1058|consensus 45.6 2.3E+02 0.005 30.5 10.8 179 35-235 733-931 (948)
35 PRK15188 fimbrial chaperone pr 45.3 1.2E+02 0.0026 27.2 7.9 95 13-118 24-142 (228)
36 PF00927 Transglut_C: Transglu 43.8 21 0.00046 27.4 2.5 60 35-94 14-78 (107)
37 TIGR03079 CH4_NH3mon_ox_B meth 40.8 66 0.0014 31.4 5.7 57 35-91 281-353 (399)
38 PRK15285 putative fimbrial cha 39.3 1.5E+02 0.0034 26.9 7.7 89 15-118 24-140 (250)
39 PRK09926 putative chaperone pr 39.1 1.4E+02 0.0031 26.8 7.5 99 11-118 20-147 (246)
40 COG1470 Predicted membrane pro 38.9 1.1E+02 0.0025 30.8 7.1 73 35-111 396-469 (513)
41 PF11906 DUF3426: Protein of u 37.1 2E+02 0.0043 23.3 7.5 78 14-94 49-138 (149)
42 KOG0037|consensus 32.8 17 0.00038 32.8 0.5 49 129-184 106-155 (221)
43 cd09269 deoxyribose_mutarotase 27.1 92 0.002 28.9 4.3 41 33-75 115-155 (293)
44 PF13200 DUF4015: Putative gly 26.8 1.2E+02 0.0026 28.7 5.0 56 101-190 26-81 (316)
45 PF08563 P53_TAD: P53 transact 26.5 24 0.00052 21.3 0.2 18 131-148 7-24 (25)
46 PF11611 DUF4352: Domain of un 26.2 1.2E+02 0.0025 23.2 4.1 57 34-90 34-99 (123)
47 PF07610 DUF1573: Protein of u 25.2 1.2E+02 0.0025 20.0 3.4 42 42-88 2-43 (45)
48 PF03173 CHB_HEX: Putative car 22.8 1.1E+02 0.0024 26.1 3.7 48 38-88 32-100 (164)
49 PF04597 Ribophorin_I: Ribopho 21.6 2.6E+02 0.0056 27.4 6.4 99 17-130 10-137 (432)
50 PHA03001 putative virion core 21.4 77 0.0017 26.4 2.3 15 105-120 54-68 (132)
51 PF13833 EF-hand_8: EF-hand do 21.3 51 0.0011 21.8 1.0 17 129-145 37-53 (54)
52 smart00054 EFh EF-hand, calciu 21.2 66 0.0014 16.8 1.4 17 129-145 12-28 (29)
53 PRK15233 putative fimbrial cha 20.3 4.4E+02 0.0096 24.0 7.2 90 14-118 38-161 (246)
No 1
>PF09066 B2-adapt-app_C: Beta2-adaptin appendage, C-terminal sub-domain; InterPro: IPR015151 Proteins synthesized on the ribosome and processed in the endoplasmic reticulum are transported from the Golgi apparatus to the trans-Golgi network (TGN), and from there via small carrier vesicles to their final destination compartment. These vesicles have specific coat proteins (such as clathrin or coatomer) that are important for cargo selection and direction of transport []. Clathrin coats contain both clathrin (acts as a scaffold) and adaptor complexes that link clathrin to receptors in coated vesicles. Clathrin-associated protein complexes are believed to interact with the cytoplasmic tails of membrane proteins, leading to their selection and concentration. The two major types of clathrin adaptor complexes are the heterotetrameric adaptor protein (AP) complexes, and the monomeric GGA (Golgi-localising, Gamma-adaptin ear domain homology, ARF-binding proteins) adaptors [, ]. AP (adaptor protein) complexes are found in coated vesicles and clathrin-coated pits. AP complexes connect cargo proteins and lipids to clathrin at vesicle budding sites, as well as binding accessory proteins that regulate coat assembly and disassembly (such as AP180, epsins and auxilin). There are different AP complexes in mammals. AP1 is responsible for the transport of lysosomal hydrolases between the TGN and endosomes []. AP2 associates with the plasma membrane and is responsible for endocytosis []. AP3 is responsible for protein trafficking to lysosomes and other related organelles []. AP4 is less well characterised. AP complexes are heterotetramers composed of two large subunits (adaptins), a medium subunit (mu) and a small subunit (sigma). For example, in AP1 these subunits are gamma-1-adaptin, beta-1-adaptin, mu-1 and sigma-1, while in AP2 they are alpha-adaptin, beta-2-adaptin, mu-2 and sigma-2. Each subunit has a specific function. Adaptins recognise and bind to clathrin through their hinge region (clathrin box), and recruit accessory proteins that modulate AP function through their C-terminal ear (appendage) domains. Mu recognises tyrosine-based sorting signals within the cytoplasmic domains of transmembrane cargo proteins []. One function of clathrin and AP2 complex-mediated endocytosis is to regulate the number of GABA(A) receptors available at the cell surface []. This entry represents a subdomain of the appendage (ear) domain of beta-adaptin from AP clathrin adaptor complexes. This domain has a three-layer arrangement, alpha-beta-alpha, with a bifurcated antiparallel beta-sheet []. This domain is required for binding to clathrin, and its subsequent polymerisation. Furthermore, a hydrophobic patch present in the domain also binds to a subset of D-phi-F/W motif-containing proteins that are bound by the alpha-adaptin appendage domain (epsin, AP180, eps15) []. More information about these proteins can be found at Protein of the Month: Clathrin [].; GO: 0006886 intracellular protein transport, 0016192 vesicle-mediated transport, 0030131 clathrin adaptor complex; PDB: 1E42_B 2G30_A 2IV9_B 2IV8_A 3HS9_A 3H1Z_A.
Probab=99.94 E-value=9.6e-27 Score=185.08 Aligned_cols=111 Identities=39% Similarity=0.740 Sum_probs=90.8
Q ss_pred eccCCcccHHHHHhhhccCCCCc--cCccceeeeeeeEEecccccCHHHHHHHHhhCCeEEEeecccCC-cceEEEEEEe
Q psy8027 128 FMEDGEMERKTFLTTWKEIPTQN--EDGNAIFCCQVQFTLTNIALGADAIVSRMKQNNVFTIAKRNVEG-KDMLYQSLRL 204 (250)
Q Consensus 128 f~e~G~m~r~~Fl~~WksiP~~n--E~~~~~~~~~~~~~i~~~~~~~d~i~~kL~~~Nif~VAkr~~~g-q~~lY~S~k~ 204 (250)
|.|||+|||++|+++|++||+++ | .++.++....++|++.++|+++|||+||+|.+++ ++++|||+|+
T Consensus 1 f~~d~~~~~~~F~~~W~sl~~~~~~e---------~~~~~~~~~~~~~~i~~~L~~~nI~~iA~~~~~~~~~~~y~s~~~ 71 (114)
T PF09066_consen 1 FVEDGSMDPEEFQEMWKSLPDSNQQE---------LSIQLNASVPSPDAIEEKLQANNIFTIASGKVDNGQKFFYFSAKT 71 (114)
T ss_dssp B-TT----HHHHHHHHHHS-GGG--E---------EEEEETT----HHHHHHHHHCTT-EEEEEEECTT-EEEEEEEEEB
T ss_pred CCCCCccCHHHHHHHHHhCCcccceE---------EEEeccccCCcHHHHHHHHHHCCEEEEecCCCCccccEEEEEEEc
Confidence 68999999999999999999999 5 5566653457999999999999999999999885 9999999999
Q ss_pred eCCcEEEEEEeeeCCCCceEEEEeeCCcchhHHHHHHHHHHhc
Q psy8027 205 TNGIWVLNELKIQPGDPSVTLSLKSRSSDVAQDIFHAYDAILH 247 (250)
Q Consensus 205 ~n~i~~L~El~~~~~~~~~~~~lKs~~~~~~pli~~~~~~il~ 247 (250)
++|+++|+|++++++++.+++++||++++++|++.+++++||+
T Consensus 72 ~~~~~fL~El~~~~~~~~~~v~vK~~~~~~~~~f~~~~~~iL~ 114 (114)
T PF09066_consen 72 TNGIWFLVELTIDPGSPSVKVTVKSENPEMAPLFLQLFESILK 114 (114)
T ss_dssp TTS-EEEEEEEE-TT-SSEEEEEEESSCCCHHHHHHHHHHHCC
T ss_pred CCCcEEEEEEEEcCCCccEEEEEecCCHHHHHHHHHHHHHHhC
Confidence 9999999999999999999999999999999999999999985
No 2
>KOG1061|consensus
Probab=99.09 E-value=7.9e-12 Score=125.80 Aligned_cols=107 Identities=46% Similarity=0.740 Sum_probs=98.6
Q ss_pred eEEEEEEecCCCCCCCCCCeEEEEEEEEeCCCCccchhheecccCccccccCCCCCCCccCCCCCeEEEEEeeeecCCCC
Q psy8027 18 MEIWGTFSRKPNPATPGIGVIEMELTFTNKSMAPMANFAIQVNKNSFGLTPARPLQVMSPLLPSLSVEASLPMATTGAVQ 97 (250)
Q Consensus 18 LeI~g~f~r~~~~~~~~~g~i~l~l~itN~s~~~~~~faiQfNkNsFGl~P~~~l~~p~~l~pgqs~~~~lpl~~~g~~~ 97 (250)
+|++++|.|+ .+.+++.+.++|++.+...+|++||| | +|. .....|..|+++...++|+.+.|++.
T Consensus 628 ~e~~~~~t~~-------~~~~~~~~~~~~~~~~~~~~~~~~f~----~-a~s--~~~~~P~~~~~~~~~~~~l~~~~~~~ 693 (734)
T KOG1061|consen 628 LEVSSQFTRK-------EGELVIYMKFTNKANSIRIDFEIQFN----G-APS--LANSKPLLPNGKAVDSLPLGTFGLMR 693 (734)
T ss_pred hhhhcceecc-------cccccccccccccccchhhhhHhhcC----C-CCc--ccCCCCCccccchhhccCcchhhhhc
Confidence 6899999999 89999999999999999999999999 6 443 44345899999999999999999999
Q ss_pred CCCCCCceEEEeecCCcEEEEEecccceEEeccCCcccHHHHHhhhccCCC
Q psy8027 98 RMEPLTNLQVAVKNNLDVFYFACIVPINVYFMEDGEMERKTFLTTWKEIPT 148 (250)
Q Consensus 98 ~~~p~~~LQVAIKnN~dVfYF~~~ipl~vlf~e~G~m~r~~Fl~~WksiP~ 148 (250)
+++|..++|+|+|+| .+..+++.+|.++|+| |+++|+.||+
T Consensus 694 ~~~~~~~~q~~~~~~------~~~~~~~~~~v~~~~~----~~~t~~~~~~ 734 (734)
T KOG1061|consen 694 PMEPLSNLQEAVKNN------KALNMLKTLFVEDGSM----FLATWKGIPN 734 (734)
T ss_pred cCCCcchHHHHHhch------HhhccchhHHHHHHHH----HHHhhccCCC
Confidence 999999999999999 7888999999999999 9999999984
No 3
>smart00809 Alpha_adaptinC2 Adaptin C-terminal domain. Adaptins are components of the adaptor complexes which link clathrin to receptors in coated vesicles. Clathrin-associated protein complexes are believed to interact with the cytoplasmic tails of membrane proteins, leading to their selection and concentration. Gamma-adaptin is a subunit of the golgi adaptor. Alpha adaptin is a heterotetramer that regulates clathrin-bud formation. The carboxyl-terminal appendage of the alpha subunit regulates translocation of endocytic accessory proteins to the bud site. This Ig-fold domain is found in alpha, beta and gamma adaptins and consists of a beta-sandwich containing 7 strands in 2 beta-sheets in a greek-key topology PUBMED:10430869, PUBMED:12176391. The adaptor appendage contains an additional N-terminal strand.
Probab=98.63 E-value=1.2e-07 Score=73.40 Aligned_cols=73 Identities=30% Similarity=0.438 Sum_probs=63.4
Q ss_pred CCcceEEEEEEecCCCCCCCCCCeEEEEEEEEeCCCCccchhheecccCccccccCCCCCCCccCCCCCeEEEEEeeeec
Q psy8027 14 KGKGMEIWGTFSRKPNPATPGIGVIEMELTFTNKSMAPMANFAIQVNKNSFGLTPARPLQVMSPLLPSLSVEASLPMATT 93 (250)
Q Consensus 14 ~~~GLeI~g~f~r~~~~~~~~~g~i~l~l~itN~s~~~~~~faiQfNkNsFGl~P~~~l~~p~~l~pgqs~~~~lpl~~~ 93 (250)
+..||+|.+.+.|+ ++++.+.+.+.|+++.++++|.+|++++.++....+++.- ..|+||+++...+.+...
T Consensus 3 ~~~~l~I~~~~~~~-------~~~~~i~~~~~N~s~~~it~f~~~~avpk~~~l~l~~~s~-~~l~p~~~i~q~~~i~~~ 74 (104)
T smart00809 3 EKNGLQIGFKFERR-------PGLIRITLTFTNKSPSPITNFSFQAAVPKSLKLQLQPPSS-PTLPPGGQITQVLKVENP 74 (104)
T ss_pred cCCCEEEEEEEEcC-------CCeEEEEEEEEeCCCCeeeeEEEEEEcccceEEEEcCCCC-CccCCCCCEEEEEEEECC
Confidence 46799999999998 8899999999999999999999999999888887776654 379999988888887655
Q ss_pred C
Q psy8027 94 G 94 (250)
Q Consensus 94 g 94 (250)
+
T Consensus 75 ~ 75 (104)
T smart00809 75 G 75 (104)
T ss_pred C
Confidence 4
No 4
>PF02883 Alpha_adaptinC2: Adaptin C-terminal domain; InterPro: IPR008152 Proteins synthesized on the ribosome and processed in the endoplasmic reticulum are transported from the Golgi apparatus to the trans-Golgi network (TGN), and from there via small carrier vesicles to their final destination compartment. These vesicles have specific coat proteins (such as clathrin or coatomer) that are important for cargo selection and direction of transport []. Clathrin coats contain both clathrin (acts as a scaffold) and adaptor complexes that link clathrin to receptors in coated vesicles. Clathrin-associated protein complexes are believed to interact with the cytoplasmic tails of membrane proteins, leading to their selection and concentration. The two major types of clathrin adaptor complexes are the heterotetrameric adaptor protein (AP) complexes, and the monomeric GGA (Golgi-localising, Gamma-adaptin ear domain homology, ARF-binding proteins) adaptors [, ]. AP (adaptor protein) complexes are found in coated vesicles and clathrin-coated pits. AP complexes connect cargo proteins and lipids to clathrin at vesicle budding sites, as well as binding accessory proteins that regulate coat assembly and disassembly (such as AP180, epsins and auxilin). There are different AP complexes in mammals. AP1 is responsible for the transport of lysosomal hydrolases between the TGN and endosomes []. AP2 associates with the plasma membrane and is responsible for endocytosis []. AP3 is responsible for protein trafficking to lysosomes and other related organelles []. AP4 is less well characterised. AP complexes are heterotetramers composed of two large subunits (adaptins), a medium subunit (mu) and a small subunit (sigma). For example, in AP1 these subunits are gamma-1-adaptin, beta-1-adaptin, mu-1 and sigma-1, while in AP2 they are alpha-adaptin, beta-2-adaptin, mu-2 and sigma-2. Each subunit has a specific function. Adaptins recognise and bind to clathrin through their hinge region (clathrin box), and recruit accessory proteins that modulate AP function through their C-terminal ear (appendage) domains. Mu recognises tyrosine-based sorting signals within the cytoplasmic domains of transmembrane cargo proteins []. One function of clathrin and AP2 complex-mediated endocytosis is to regulate the number of GABA(A) receptors available at the cell surface []. GGAs (Golgi-localising, Gamma-adaptin ear domain homology, ARF-binding proteins) are a family of monomeric clathrin adaptor proteins that are conserved from yeasts to humans. GGAs regulate clathrin-mediated the transport of proteins (such as mannose 6-phosphate receptors) from the TGN to endosomes and lysosomes through interactions with TGN-sorting receptors, sometimes in conjunction with AP-1 [, ]. GGAs bind cargo, membranes, clathrin and accessory factors. GGA1, GGA2 and GGA3 all contain a domain homologous to the ear domain of gamma-adaptin. GGAs are composed of a single polypeptide with four domains: an N-terminal VHS (Vps27p/Hrs/Stam) domain, a GAT (GGA and Tom1) domain, a hinge region, and a C-terminal GAE (gamma-adaptin ear) domain. The VHS domain is responsible for endocytosis and signal transduction, recognising transmembrane cargo through the ACLL sequence in the cytoplasmic domains of sorting receptors []. The GAT domain (also found in Tom1 proteins) interacts with ARF (ADP-ribosylation factor) to regulate membrane trafficking [], and with ubiquitin for receptor sorting []. The hinge region contains a clathrin box for recognition and binding to clathrin, similar to that found in AP adaptins. The GAE domain is similar to the AP gamma-adaptin ear domain, and is responsible for the recruitment of accessory proteins that regulate clathrin-mediated endocytosis []. This entry represents a beta-sandwich structural motif found in the appendage (ear) domain of alpha-, beta- and gamma-adaptin from AP clathrin adaptor complexes, and the GAE (gamma-adaptin ear) domain of GGA adaptor proteins. These domains have an immunoglobulin-like beta-sandwich fold containing 7 or 8 strands in 2 beta-sheets in a Greek key topology [, ]. Although these domains share a similar fold, there is little sequence identity between the alpha/beta-adaptins and gamma-adaptin/GAE. More information about these proteins can be found at Protein of the Month: Clathrin [].; GO: 0006886 intracellular protein transport, 0016192 vesicle-mediated transport, 0030131 clathrin adaptor complex; PDB: 3MNM_B 3ZY7_B 1GYU_A 1GYW_B 2A7B_A 1GYV_A 2E9G_A 1E42_B 2G30_A 2IV9_B ....
Probab=98.21 E-value=1.2e-05 Score=63.52 Aligned_cols=92 Identities=21% Similarity=0.286 Sum_probs=63.2
Q ss_pred eeecCCCcceEEEEEEec--CCCCCCCCCCeEEEEEEEEeCCCCccchhheec---ccCccccccCCCCCCCccCCCCCe
Q psy8027 9 WLPAEKGKGMEIWGTFSR--KPNPATPGIGVIEMELTFTNKSMAPMANFAIQV---NKNSFGLTPARPLQVMSPLLPSLS 83 (250)
Q Consensus 9 ~l~a~~~~GLeI~g~f~r--~~~~~~~~~g~i~l~l~itN~s~~~~~~faiQf---NkNsFGl~P~~~l~~p~~l~pgqs 83 (250)
|+-.+.+ |+|.-.+.+ + ++++.+.|.++|++..+|++|.+|+ +...+.+.|.+. ..|+|++.
T Consensus 4 ~~ye~~~--l~I~~~~~~~~~-------~~~~~i~~~f~N~s~~~it~f~~q~avpk~~~l~l~~~s~----~~i~p~~~ 70 (115)
T PF02883_consen 4 VLYEDNG--LQIGFKSEKSPN-------PNQGRIKLTFGNKSSQPITNFSFQAAVPKSFKLQLQPPSS----STIPPGQQ 70 (115)
T ss_dssp EEEEETT--EEEEEEEEECCE-------TTEEEEEEEEEE-SSS-BEEEEEEEEEBTTSEEEEEESS-----SSB-TTTE
T ss_pred EEEeCCC--EEEEEEEEecCC-------CCEEEEEEEEEECCCCCcceEEEEEEeccccEEEEeCCCC----CeeCCCCe
Confidence 4444444 555555544 6 8999999999999999999999999 788888888753 35667898
Q ss_pred EEEEEeeeecCCCCCCCCCCceEEEeecCC
Q psy8027 84 VEASLPMATTGAVQRMEPLTNLQVAVKNNL 113 (250)
Q Consensus 84 ~~~~lpl~~~g~~~~~~p~~~LQVAIKnN~ 113 (250)
+...+.+...+...+..+...+++.|+.+.
T Consensus 71 i~Q~~~v~~~~~~~~~~~~l~~~~~vsy~~ 100 (115)
T PF02883_consen 71 ITQVIKVENSPFSEPTPKPLKPRLRVSYNV 100 (115)
T ss_dssp EEEEEEEEESS-BSTTSSTTEEEEEEEEEE
T ss_pred EEEEEEEEEeecccCCCCCcCeEEEEEEEE
Confidence 888888877222333444456777777543
No 5
>PF14796 AP3B1_C: Clathrin-adaptor complex-3 beta-1 subunit C-terminal
Probab=97.70 E-value=0.00036 Score=58.73 Aligned_cols=89 Identities=31% Similarity=0.470 Sum_probs=70.9
Q ss_pred CCCCce-eeeecCCCcceEEEEEEecCCCCCCCCCCeEEEEEEEEeCCCCccchhheecccC-ccccccCCCCCCCccCC
Q psy8027 2 YTPPKQ-CWLPAEKGKGMEIWGTFSRKPNPATPGIGVIEMELTFTNKSMAPMANFAIQVNKN-SFGLTPARPLQVMSPLL 79 (250)
Q Consensus 2 ~~~p~~-~~l~a~~~~GLeI~g~f~r~~~~~~~~~g~i~l~l~itN~s~~~~~~faiQfNkN-sFGl~P~~~l~~p~~l~ 79 (250)
|+++|. .+|....|+||.++=+|.|+ |......-+.++++|+|++..++.+..|- +|| .=|+.......++ .|.
T Consensus 52 ~v~~k~~eLL~~v~G~GL~v~Y~F~Rq--P~~~s~~mvsIql~ftN~s~~~i~~I~i~-~k~l~~g~~i~~F~~I~-~L~ 127 (145)
T PF14796_consen 52 FVPPKKYELLNRVNGKGLSVEYRFSRQ--PSLYSPSMVSIQLTFTNNSDEPIKNIHIG-EKKLPAGMRIHEFPEIE-SLE 127 (145)
T ss_pred ccCcceEEeeeccCCCceeEEEEEccC--CcCCCCCcEEEEEEEEecCCCeecceEEC-CCCCCCCcEeeccCccc-ccC
Confidence 556644 68888999999999999998 55566677789999999999999998876 555 5566555544455 799
Q ss_pred CCCeEEEEEeeeecC
Q psy8027 80 PSLSVEASLPMATTG 94 (250)
Q Consensus 80 pgqs~~~~lpl~~~g 94 (250)
||+++.+.+=+.++.
T Consensus 128 pg~s~t~~lgIDF~D 142 (145)
T PF14796_consen 128 PGASVTVSLGIDFND 142 (145)
T ss_pred CCCeEEEEEEEeccc
Confidence 999999999887764
No 6
>PF14807 AP4E_app_platf: Adaptin AP4 complex epsilon appendage platform
Probab=96.45 E-value=0.059 Score=43.03 Aligned_cols=97 Identities=13% Similarity=0.215 Sum_probs=71.3
Q ss_pred cccHHHHHhhhccCCCCccCccceeeeeeeEEec-ccccCHHHHHHHHh-hCCeEEEeecccCCcceEEEEEEeeCCcE-
Q psy8027 133 EMERKTFLTTWKEIPTQNEDGNAIFCCQVQFTLT-NIALGADAIVSRMK-QNNVFTIAKRNVEGKDMLYQSLRLTNGIW- 209 (250)
Q Consensus 133 ~m~r~~Fl~~WksiP~~nE~~~~~~~~~~~~~i~-~~~~~~d~i~~kL~-~~Nif~VAkr~~~gq~~lY~S~k~~n~i~- 209 (250)
+|+-++|-++|.+.+++.+ .+++ ....+.+.+.++|. +-++-.|-.= |+| ..+++++.++..
T Consensus 4 ~isTeeFG~~W~s~~~e~k-----------~~l~~~~~~t~~~~l~~l~~~l~lh~VevI---g~E-~I~A~~ll~~~~~ 68 (104)
T PF14807_consen 4 QISTEEFGQLWLSFSNERK-----------QNLPSSSQRTLPEFLQRLQQKLRLHVVEVI---GNE-GIFACQLLNSSPV 68 (104)
T ss_pred cccHHHHHHHHHcCCCeEE-----------EeccccCcCCHHHHHHHHHHhcCceEEEEe---Ccc-ceeeeeccCCCCe
Confidence 4788999999999966655 3443 33467777777876 5777666544 343 456777777776
Q ss_pred EEEEEeeeCCCCceEEEEeeCCcchhHHHHHHHHHHh
Q psy8027 210 VLNELKIQPGDPSVTLSLKSRSSDVAQDIFHAYDAIL 246 (250)
Q Consensus 210 ~L~El~~~~~~~~~~~~lKs~~~~~~pli~~~~~~il 246 (250)
+|+-.++ +.+.+.+.+|++++.+...+......++
T Consensus 69 ~L~H~~~--~~~~l~l~vrs~~~~l~d~ll~~~~~~~ 103 (104)
T PF14807_consen 69 CLLHCRV--NAGTLDLWVRSSDSPLTDCLLYQCQKIL 103 (104)
T ss_pred EEEEEEe--cCCeEEEEEEcCCCCcHHHHHHHHHHHh
Confidence 9999987 3338899999999999888888777765
No 7
>PF02296 Alpha_adaptin_C: Alpha adaptin AP2, C-terminal domain; InterPro: IPR003164 Proteins synthesized on the ribosome and processed in the endoplasmic reticulum are transported from the Golgi apparatus to the trans-Golgi network (TGN), and from there via small carrier vesicles to their final destination compartment. These vesicles have specific coat proteins (such as clathrin or coatomer) that are important for cargo selection and direction of transport []. Clathrin coats contain both clathrin (acts as a scaffold) and adaptor complexes that link clathrin to receptors in coated vesicles. Clathrin-associated protein complexes are believed to interact with the cytoplasmic tails of membrane proteins, leading to their selection and concentration. The two major types of clathrin adaptor complexes are the heterotetrameric adaptor protein (AP) complexes, and the monomeric GGA (Golgi-localising, Gamma-adaptin ear domain homology, ARF-binding proteins) adaptors [, ]. AP (adaptor protein) complexes are found in coated vesicles and clathrin-coated pits. AP complexes connect cargo proteins and lipids to clathrin at vesicle budding sites, as well as binding accessory proteins that regulate coat assembly and disassembly (such as AP180, epsins and auxilin). There are different AP complexes in mammals. AP1 is responsible for the transport of lysosomal hydrolases between the TGN and endosomes []. AP2 associates with the plasma membrane and is responsible for endocytosis []. AP3 is responsible for protein trafficking to lysosomes and other related organelles []. AP4 is less well characterised. AP complexes are heterotetramers composed of two large subunits (adaptins), a medium subunit (mu) and a small subunit (sigma). For example, in AP1 these subunits are gamma-1-adaptin, beta-1-adaptin, mu-1 and sigma-1, while in AP2 they are alpha-adaptin, beta-2-adaptin, mu-2 and sigma-2. Each subunit has a specific function. Adaptins recognise and bind to clathrin through their hinge region (clathrin box), and recruit accessory proteins that modulate AP function through their C-terminal ear (appendage) domains. Mu recognises tyrosine-based sorting signals within the cytoplasmic domains of transmembrane cargo proteins []. One function of clathrin and AP2 complex-mediated endocytosis is to regulate the number of GABA(A) receptors available at the cell surface []. AP adaptor alpha-adaptin can be divided into a trunk domain and the C-terminal appendage domain (or ear domain), separated by a linker region. The C-terminal appendage domain regulates translocation of endocytic accessory proteins to the bud site []. This entry represents a subdomain of the appendage (ear) domain of alpha-adaptin from AP clathrin adaptor complexes. This domain has a three-layer arrangement, alpha-beta-alpha, with a bifurcated antiparallel beta-sheet []. More information about these proteins can be found at Protein of the Month: Clathrin [].; GO: 0006886 intracellular protein transport, 0016192 vesicle-mediated transport, 0030131 clathrin adaptor complex; PDB: 1KYF_A 1B9K_A 1QTS_A 1KYU_A 2VJ0_A 1KY6_A 1W80_A 3HS8_A 1KYD_A 1QTP_A ....
Probab=94.96 E-value=0.34 Score=38.99 Aligned_cols=103 Identities=13% Similarity=0.246 Sum_probs=61.9
Q ss_pred cCCcccHHHHHhhhccCCCC-ccCccceeeeeeeEEec-cc-ccCHHHHHHHHhhCCeEEEeecccCCcceEEEE--EEe
Q psy8027 130 EDGEMERKTFLTTWKEIPTQ-NEDGNAIFCCQVQFTLT-NI-ALGADAIVSRMKQNNVFTIAKRNVEGKDMLYQS--LRL 204 (250)
Q Consensus 130 e~G~m~r~~Fl~~WksiP~~-nE~~~~~~~~~~~~~i~-~~-~~~~d~i~~kL~~~Nif~VAkr~~~gq~~lY~S--~k~ 204 (250)
+--.|+.++|.+-||.|... .| .+ ..+.+. .. .++.+.+.++|...++-.+- .-+.+.+.++-+ +.+
T Consensus 3 ~p~~l~~~~Ff~RWkql~~~~~E-~Q------~vf~~~~~~~~~~~~~~~~~l~g~~~~vl~-~vDpnp~n~v~Agi~~t 74 (113)
T PF02296_consen 3 EPTTLSSEDFFQRWKQLGGPPQE-AQ------EVFKLKDAKRPMDLESIRRKLEGFGFAVLD-GVDPNPNNIVGAGIFHT 74 (113)
T ss_dssp EE----HHHHHHHHTTT-SGGGE-EE------EEEE----SS---HHHHHHHHHHHTSEEET-SSSSSTTSEEEEEEEE-
T ss_pred CCccCCHHHHHHHHHhccCCccc-cE------EEEcccccCCcccHHHHHHHHhcCCeEecC-CCCCCcccEEEEEEEEe
Confidence 44568999999999999653 34 22 334433 22 47888999999988875542 223334444444 223
Q ss_pred e--CCcEEEEEEeeeCCCCceEEEEeeCCcchhHHHHH
Q psy8027 205 T--NGIWVLNELKIQPGDPSVTLSLKSRSSDVAQDIFH 240 (250)
Q Consensus 205 ~--n~i~~L~El~~~~~~~~~~~~lKs~~~~~~pli~~ 240 (250)
. .++-+|+-|.-......+++.+||.+.+++..+.+
T Consensus 75 ~~~g~~gcLlRlE~n~~~~~~RlTvRst~~~vs~~l~~ 112 (113)
T PF02296_consen 75 KSSGNVGCLLRLEPNQDAKMFRLTVRSTDPSVSKALCK 112 (113)
T ss_dssp S-S-EEEEEEEEEEETTTTEEEEEEEESSHHHHHHHHH
T ss_pred cCCCcEEEEEEEeECCcCCeEEEEEEECChhHHHHHhc
Confidence 2 44678888877666778899999999988776654
No 8
>KOG1060|consensus
Probab=94.82 E-value=0.45 Score=49.86 Aligned_cols=177 Identities=20% Similarity=0.214 Sum_probs=113.3
Q ss_pred eeeecCCCcceEEEEEEecCCCCCCCCCCeEEEEEEEEeCCCCccchhheecccCccccccCCCCCCCccCCCCCeEEEE
Q psy8027 8 CWLPAEKGKGMEIWGTFSRKPNPATPGIGVIEMELTFTNKSMAPMANFAIQVNKNSFGLTPARPLQVMSPLLPSLSVEAS 87 (250)
Q Consensus 8 ~~l~a~~~~GLeI~g~f~r~~~~~~~~~g~i~l~l~itN~s~~~~~~faiQfNkNsFGl~P~~~l~~p~~l~pgqs~~~~ 87 (250)
..|....|.|+-++=.|.|. ++ ..+.+.++|++-..+.+..+ |=+||+.......+ ..++||+++.+.
T Consensus 780 ell~~~~g~gl~~~y~f~r~-------~~-~~i~~~~~n~~~~~~~~~~l---~~p~gm~i~ef~~i-~s~~pg~~~~~~ 847 (968)
T KOG1060|consen 780 ELLNEVEGSGLDLEYSFSRL-------PD-VSISLHFTNKSDLELLGIHL---KLPAGMSIKEFSPI-ESLPPGASASVV 847 (968)
T ss_pred hhhhhcccCCcceeeeccCC-------CC-eeEEEecccCCCccccccee---eccccccccccccc-cccCCCcceeee
Confidence 46778899999999999998 44 88999999999999999888 88999988765544 379999999999
Q ss_pred EeeeecCCCCCCCCCCceEEEeecCCcEEEEEecccceEEeccCCcccHHHHHhhhccCCCCccCccceeeeeeeEEecc
Q psy8027 88 LPMATTGAVQRMEPLTNLQVAVKNNLDVFYFACIVPINVYFMEDGEMERKTFLTTWKEIPTQNEDGNAIFCCQVQFTLTN 167 (250)
Q Consensus 88 lpl~~~g~~~~~~p~~~LQVAIKnN~dVfYF~~~ipl~vlf~e~G~m~r~~Fl~~WksiP~~nE~~~~~~~~~~~~~i~~ 167 (250)
+-+.+....+..+ .|+- +..+ +++..-|.--+..+. .|++.+|..---.+..-|| .+..+.+
T Consensus 848 ~~i~F~dst~~~~----~~l~--~~~g--~~~~~~pvge~~~~v-~~~~~~~~~E~~~L~gln~---------~~~~l~~ 909 (968)
T KOG1060|consen 848 LGIDFCDSTQAAE----WQLL--TDDG--RVRFQPPVGELVQPV-RMSEEDFKKERGKLGGLNE---------HVIQLEN 909 (968)
T ss_pred eeeecccccccee----EEEE--eccC--cEEecCchhhhhccc-cCCHHHHHhhhhhhcccch---------hheeeec
Confidence 9887765433222 3443 3444 444444443333332 4566666554444444455 3333322
Q ss_pred cccCHHHHHHHHhhCCeEEEeecccCCcceEEEEEEeeCCc-EEEEEEeeeCCCCceEEEEeeCCcchh
Q psy8027 168 IALGADAIVSRMKQNNVFTIAKRNVEGKDMLYQSLRLTNGI-WVLNELKIQPGDPSVTLSLKSRSSDVA 235 (250)
Q Consensus 168 ~~~~~d~i~~kL~~~Nif~VAkr~~~gq~~lY~S~k~~n~i-~~L~El~~~~~~~~~~~~lKs~~~~~~ 235 (250)
. +...|+-+++. |.. ..|++||-.+- .+|+-|. .. ++.+-|+.+.+.
T Consensus 910 ~----------~~~an~~~~~~----g~~-~rFa~~tlss~~~~llT~~--~k----~l~ince~~ViG 957 (968)
T KOG1060|consen 910 P----------NPSANVLFVPS----GSS-HRFAGQTLSSKSLVLLTVD--EK----TLEINCEKTVIG 957 (968)
T ss_pred c----------cchhhhhcccC----Ccc-eeeeeeeccCCceEEEEee--hh----eeEecchhhhHH
Confidence 2 33344444443 344 88888876554 4554443 22 666677777553
No 9
>KOG1077|consensus
Probab=92.68 E-value=2 Score=44.86 Aligned_cols=187 Identities=14% Similarity=0.257 Sum_probs=111.4
Q ss_pred EEEEEEEeCCCCccchhheecccC---ccccccCCCCCCCccCCCCCeEEEEEeeeecCCCCCCCCCCceEEEeecC-C-
Q psy8027 39 EMELTFTNKSMAPMANFAIQVNKN---SFGLTPARPLQVMSPLLPSLSVEASLPMATTGAVQRMEPLTNLQVAVKNN-L- 113 (250)
Q Consensus 39 ~l~l~itN~s~~~~~~faiQfNkN---sFGl~P~~~l~~p~~l~pgqs~~~~lpl~~~g~~~~~~p~~~LQVAIKnN-~- 113 (250)
.|.|-+.|++..|+++|.=++=.- .|-|+-.. -.++..|+||..++.++-+.+-..+..+ ..|.+--|.+ .
T Consensus 734 rl~LfygNkts~~lt~~s~~ii~~~~~~~~L~~~~-kpv~~ti~~g~qvQQ~~~v~~i~d~~d~---pil~isfk~g~ti 809 (938)
T KOG1077|consen 734 RLYLFYGNKTSVPLTSLSPTIIPPGNLELHLAVQN-KPVTATIPPGAQVQQSLEVSCIRDFEDP---PILAISFKFGGTI 809 (938)
T ss_pred eEEEEecccccccccccceeeecCCchhhhhhhcC-cccCCCCCccceecceeeeeeecccccC---CeEEEEEEeCCch
Confidence 556678999999999987654331 11111000 0123578899877777766554433322 3478888886 3
Q ss_pred cEEE--EEecccceEEeccCCcccHHHHHhhhccCCCCccCccceeeeeeeEEeccc-ccCHHHHHHHHhhCCeEEEe-e
Q psy8027 114 DVFY--FACIVPINVYFMEDGEMERKTFLTTWKEIPTQNEDGNAIFCCQVQFTLTNI-ALGADAIVSRMKQNNVFTIA-K 189 (250)
Q Consensus 114 dVfY--F~~~ipl~vlf~e~G~m~r~~Fl~~WksiP~~nE~~~~~~~~~~~~~i~~~-~~~~d~i~~kL~~~Nif~VA-k 189 (250)
..+= +..++-++-+|.+ -.|+-++|..-||.+.+...+.+++ ++.. .++...+.++|...+. + .
T Consensus 810 ~~~ta~l~lp~~iskf~~P-t~l~s~~Ff~rWk~ls~~~~~~q~~--------fk~~rpld~~~l~~~l~gf~~---~l~ 877 (938)
T KOG1077|consen 810 NLKTAILKLPVLISKFFQP-TELTSEDFFSRWKQLSGPQQEAQEV--------FKANRPLDAPSLENKLLGFGQ---TLL 877 (938)
T ss_pred hhhhhceechhhHhhhcCc-ccccHHHHHHHHHhcccchhHHHHH--------HhcCCccccHHHHHHHhhhhH---HHh
Confidence 4555 6677778888888 7899999999999998766333333 3333 3666666677654432 2 2
Q ss_pred ccc-CCcceEEEE----EEeeCCcEEEEEEeeeCCCCceEEEEeeCCcchhHHHHHHH
Q psy8027 190 RNV-EGKDMLYQS----LRLTNGIWVLNELKIQPGDPSVTLSLKSRSSDVAQDIFHAY 242 (250)
Q Consensus 190 r~~-~gq~~lY~S----~k~~n~i~~L~El~~~~~~~~~~~~lKs~~~~~~pli~~~~ 242 (250)
+++ .+-+.+.-+ .++. ++-.|+.|.-+.+..-++|.+|+..+++...+.+.+
T Consensus 878 ~~VDpnp~nlv~agii~t~tq-~vgCL~RiEpn~~~~~~rltvrss~~tlak~l~e~l 934 (938)
T KOG1077|consen 878 DNVDPNPSNLVGAGIIHTKSQ-NVGCLLRIEPNAQAQMYRLTVRTSKPTLAKELVELL 934 (938)
T ss_pred ccCCCCccceEEEEEEeecce-eeeeEEEeccCCcceEEEEEEecCCchHHHHHHHHH
Confidence 233 223333333 3333 445665554333333457888888876655554443
No 10
>PF10633 NPCBM_assoc: NPCBM-associated, NEW3 domain of alpha-galactosidase; InterPro: IPR018905 This domain has been named NEW3, but its function is not known. It is found on proteins which are bacterial galactosidases [].; PDB: 1EUT_A 2BZD_A 1WCQ_C 2BER_A 1W8O_A 1EUU_A 1W8N_A.
Probab=89.36 E-value=0.89 Score=33.39 Aligned_cols=64 Identities=25% Similarity=0.321 Sum_probs=39.3
Q ss_pred CCeEEEEEEEEeCCCCccchhheecccCcccccc-CCCCCCCccCCCCCeEEEEEeeeecCCCCCCC
Q psy8027 35 IGVIEMELTFTNKSMAPMANFAIQVNKNSFGLTP-ARPLQVMSPLLPSLSVEASLPMATTGAVQRME 100 (250)
Q Consensus 35 ~g~i~l~l~itN~s~~~~~~faiQfNkNsFGl~P-~~~l~~p~~l~pgqs~~~~lpl~~~g~~~~~~ 100 (250)
|....+.++++|....++.+..+.++. +=|..- ..+-.++ .|.||++.++.+.+..-....+++
T Consensus 4 G~~~~~~~tv~N~g~~~~~~v~~~l~~-P~GW~~~~~~~~~~-~l~pG~s~~~~~~V~vp~~a~~G~ 68 (78)
T PF10633_consen 4 GETVTVTLTVTNTGTAPLTNVSLSLSL-PEGWTVSASPASVP-SLPPGESVTVTFTVTVPADAAPGT 68 (78)
T ss_dssp TEEEEEEEEEE--SSS-BSS-EEEEE---TTSE---EEEEE---B-TTSEEEEEEEEEE-TT--SEE
T ss_pred CCEEEEEEEEEECCCCceeeEEEEEeC-CCCccccCCccccc-cCCCCCEEEEEEEEECCCCCCCce
Confidence 677899999999999999999999887 777661 1223344 799999999999887654444343
No 11
>PF00635 Motile_Sperm: MSP (Major sperm protein) domain; InterPro: IPR000535 Major sperm proteins (MSP) are central components in molecular interactions underlying sperm motility in Caenorhabditis elegans, whose sperm employ an amoebae-like crawling motion using a MSP-containing lamellipod, rather than the flagellar-based swimming motion associated with other sperm. These proteins oligomerise to form an extensive filament system that extends from sperm villipoda, along the leading edge of the pseudopod. About 30 MSP isoforms may exist in C. elegans. MSPs form a fibrous network, whereby MSP dimers form helical subfilaments that coil around one another to produce filaments, which in turn form supercoils to produce bundles. The crystal structure of MSP from C. elegans reveals an immunoglobulin (Ig)-like seven-stranded beta sandwich fold []. ; GO: 0005198 structural molecule activity; PDB: 1MSP_A 3MSP_B 2BVU_B 2MSP_C 1Z9O_F 1Z9L_A 3IKK_A 1WIC_A 2CRI_A 2RR3_A ....
Probab=88.29 E-value=3 Score=31.74 Aligned_cols=88 Identities=22% Similarity=0.300 Sum_probs=52.3
Q ss_pred CCeEEEEEEEEeCCCCccchhheecccC-ccccccCCCCCCCccCCCCCeEEEEEeeeecCCCCCCCCCCceEEEeecCC
Q psy8027 35 IGVIEMELTFTNKSMAPMANFAIQVNKN-SFGLTPARPLQVMSPLLPSLSVEASLPMATTGAVQRMEPLTNLQVAVKNNL 113 (250)
Q Consensus 35 ~g~i~l~l~itN~s~~~~~~faiQfNkN-sFGl~P~~~l~~p~~l~pgqs~~~~lpl~~~g~~~~~~p~~~LQVAIKnN~ 113 (250)
+....-.|+|+|.+..++ -|.|.-+.+ .|-+.|.. .-|.||++.++.|-+....... ... . .
T Consensus 17 ~~~~~~~l~l~N~s~~~i-~fKiktt~~~~y~v~P~~-----G~i~p~~~~~i~I~~~~~~~~~-~~~-~---------~ 79 (109)
T PF00635_consen 17 NKQQSCELTLTNPSDKPI-AFKIKTTNPNRYRVKPSY-----GIIEPGESVEITITFQPFDFEP-SNK-K---------K 79 (109)
T ss_dssp SS-EEEEEEEEE-SSSEE-EEEEEES-TTTEEEESSE-----EEE-TTEEEEEEEEE-SSSTTT-TST-S---------S
T ss_pred CceEEEEEEEECCCCCcE-EEEEEcCCCceEEecCCC-----EEECCCCEEEEEEEEEecccCC-CCC-C---------C
Confidence 566788999999999874 567777654 68888753 3588999998888664432111 111 1 1
Q ss_pred cEEEEEecccceEEec-cCCcccHHHHHhhhcc
Q psy8027 114 DVFYFACIVPINVYFM-EDGEMERKTFLTTWKE 145 (250)
Q Consensus 114 dVfYF~~~ipl~vlf~-e~G~m~r~~Fl~~Wks 145 (250)
+.+-+..... ++..-.++.|.+.|++
T Consensus 80 ------dkf~I~~~~~~~~~~~~~~~~~~~~~~ 106 (109)
T PF00635_consen 80 ------DKFLIQSIVVPDNATDPKKDFKQIWKN 106 (109)
T ss_dssp ------EEEEEEEEEE-TT-SSSHHHHHCCHHH
T ss_pred ------CEEEEEEEEcCCCccchhhhHHHHHhc
Confidence 1112233344 3444577999999986
No 12
>PF14806 Coatomer_b_Cpla: Coatomer beta subunit appendage platform
Probab=82.01 E-value=16 Score=30.24 Aligned_cols=94 Identities=17% Similarity=0.315 Sum_probs=58.3
Q ss_pred ccCCcccHHHHHhhhccCCCCccCccceeeeeeeEEecccccCHHHHHHHH-hhCCeEEEeecc-cCCcceEE----EEE
Q psy8027 129 MEDGEMERKTFLTTWKEIPTQNEDGNAIFCCQVQFTLTNIALGADAIVSRM-KQNNVFTIAKRN-VEGKDMLY----QSL 202 (250)
Q Consensus 129 ~e~G~m~r~~Fl~~WksiP~~nE~~~~~~~~~~~~~i~~~~~~~d~i~~kL-~~~Nif~VAkr~-~~gq~~lY----~S~ 202 (250)
..-..++-.+|.++|..+.=+| +| .++....+.....+++ +.-|.-.+.... .+| +--| +.+
T Consensus 18 I~Pa~~~~~~FR~mW~eFEWEN----Ki-------~V~t~~~dl~~yl~~i~k~tnM~~Ltp~~~l~~-~~~fl~~Nlya 85 (129)
T PF14806_consen 18 IKPATCSDEEFRSMWAEFEWEN----KI-------SVNTNITDLREYLDHIMKSTNMKCLTPESALSG-DCGFLSANLYA 85 (129)
T ss_pred cCcccCCHHHHHHHHHhheeee----eE-------EEecCCCCHHHHHHHHHHhcCcceeccccccCC-CCCEEEEEEEE
Confidence 4446688999999999984333 34 3332223555556655 444766666442 222 3333 447
Q ss_pred EeeCCcEEEEEEeeeC-CCCce--EEEEeeCCcch
Q psy8027 203 RLTNGIWVLNELKIQP-GDPSV--TLSLKSRSSDV 234 (250)
Q Consensus 203 k~~n~i~~L~El~~~~-~~~~~--~~~lKs~~~~~ 234 (250)
|+.=|..+|+-|.+.. .++.+ .+-+||+.+.+
T Consensus 86 ~S~fgedaL~Nlsiek~~~~~i~G~vRIRSk~qgi 120 (129)
T PF14806_consen 86 RSIFGEDALANLSIEKQADGKISGHVRIRSKTQGI 120 (129)
T ss_pred EeccCCeeEEEEEEEecCCCeEEEEEEEeeCCcCh
Confidence 8888888999999987 45556 45666666554
No 13
>PF04744 Monooxygenase_B: Monooxygenase subunit B protein; InterPro: IPR006833 Ammonia monooxygenase and the particulate methane monooxygenase are both integral membrane proteins, occurring in ammonia oxidisers and methanotrophs respectively, which are thought to be evolutionarily related []. These enzymes have a relatively wide substrate specificity and can catalyse the oxidation of a range of substrates including ammonia, methane, halogenated hydrocarbons and aromatic molecules []. These enzymes are composed of 3 subunits - A (IPR003393 from INTERPRO), B (IPR006833 from INTERPRO) and C (IPR006980 from INTERPRO) - and contain various metal centres, including copper. Particulate methane monooxygenase from Methylococcus capsulatus str. Bath is an ABC homotrimer, which contains mononuclear and dinuclear copper metal centres, and a third metal centre containing a metal ion whose identity in vivo is not certain[]. The soluble regions of these enzymes derive primarily from the B subunit. This subunit forms two antiparallel beta-barrel-like structures and contains the mono- and di- nuclear copper metal centres [].; PDB: 3CHX_E 3RFR_A 3RGB_A 1YEW_A.
Probab=79.24 E-value=6 Score=38.25 Aligned_cols=57 Identities=23% Similarity=0.238 Sum_probs=31.3
Q ss_pred CCeEEEEEEEEeCCCCccchhh-----eec-ccCccccccC--------CCCC--CCccCCCCCeEEEEEeee
Q psy8027 35 IGVIEMELTFTNKSMAPMANFA-----IQV-NKNSFGLTPA--------RPLQ--VMSPLLPSLSVEASLPMA 91 (250)
Q Consensus 35 ~g~i~l~l~itN~s~~~~~~fa-----iQf-NkNsFGl~P~--------~~l~--~p~~l~pgqs~~~~lpl~ 91 (250)
+..+.|.|++||++..|+.-=- ++| |+.-+.-.+. ..+. -++||+||||.++++-+.
T Consensus 262 gR~l~~~l~VtN~g~~pv~LgeF~tA~vrFln~~v~~~~~~~P~~l~A~~gL~vs~~~pI~PGETrtl~V~a~ 334 (381)
T PF04744_consen 262 GRTLTMTLTVTNNGDSPVRLGEFNTANVRFLNPDVPTDDPDYPDELLAERGLSVSDNSPIAPGETRTLTVEAQ 334 (381)
T ss_dssp SSEEEEEEEEEEESSS-BEEEEEESSS-EEE-TTT-SS-S---TTTEETT-EEES--S-B-TT-EEEEEEEEE
T ss_pred CcEEEEEEEEEcCCCCceEeeeEEeccEEEeCcccccCCCCCchhhhccCcceeCCCCCcCCCceEEEEEEee
Confidence 8899999999999988765332 333 5543322111 0122 235899999999999764
No 14
>PRK15192 fimbrial chaperone BcfG; Provisional
Probab=74.16 E-value=20 Score=32.41 Aligned_cols=89 Identities=21% Similarity=0.258 Sum_probs=55.9
Q ss_pred cceEEEEEEecCCCCCCCCCCeEEEEEEEEeCCCCccchhheecc----------c-----CccccccCCCCCCCccCCC
Q psy8027 16 KGMEIWGTFSRKPNPATPGIGVIEMELTFTNKSMAPMANFAIQVN----------K-----NSFGLTPARPLQVMSPLLP 80 (250)
Q Consensus 16 ~GLeI~g~f~r~~~~~~~~~g~i~l~l~itN~s~~~~~~faiQfN----------k-----NsFGl~P~~~l~~p~~l~p 80 (250)
.|+.|.||=.--+ ++.-...+++.|.+..| +.+|-. + -+|=+.|. + ..|.|
T Consensus 22 Agi~l~~TRvIy~------~~~k~~sv~l~N~~~~p---~LvQswv~~~~~w~~~~~~~~~~PFivtPP--l---frl~p 87 (234)
T PRK15192 22 AGVVIGGTRFIYH------AGAPALSVPVSNHSEAS---WLIDTHILPGGRWPGTKNEGNITPFVVTPP--L---FMLSA 87 (234)
T ss_pred eeEEeCceEEEEc------CCCceEEEEEEeCCCCc---EEEEEEeccCccccccCCccccCCEEEcCC--e---EEECC
Confidence 5899999844431 67788999999998765 666662 1 13433331 1 25779
Q ss_pred CCeEEEEEeeeecC-C-------------CCCCC-CCCceEEEeecCCcEEEE
Q psy8027 81 SLSVEASLPMATTG-A-------------VQRME-PLTNLQVAVKNNLDVFYF 118 (250)
Q Consensus 81 gqs~~~~lpl~~~g-~-------------~~~~~-p~~~LQVAIKnN~dVfYF 118 (250)
|++....|-....+ + +.+.+ ..+.||+|+++.+.+||=
T Consensus 88 ~~~~~lRI~~~~~~LP~DRESlf~lnv~~IPp~~~~~n~l~iair~riKlFYR 140 (234)
T PRK15192 88 RQENSMRVVYTGAPLPADRESLFTLSIAAIPSGKPEANRVQMAFRSALKLLYR 140 (234)
T ss_pred CCceEEEEEECCCCCCCcceEEEEEEEEecCCCCCCCcEEEEEEEeeeeEEEc
Confidence 98777765543211 1 11111 135699999999988884
No 15
>PRK15211 fimbrial chaperone protein PefD; Provisional
Probab=71.26 E-value=25 Score=31.62 Aligned_cols=94 Identities=14% Similarity=0.096 Sum_probs=56.5
Q ss_pred CCCcceEEEEEEecCCCCCCCCCCeEEEEEEEEeCCCCccchhheec-----cc----CccccccCCCCCCCccCCCCCe
Q psy8027 13 EKGKGMEIWGTFSRKPNPATPGIGVIEMELTFTNKSMAPMANFAIQV-----NK----NSFGLTPARPLQVMSPLLPSLS 83 (250)
Q Consensus 13 ~~~~GLeI~g~f~r~~~~~~~~~g~i~l~l~itN~s~~~~~~faiQf-----Nk----NsFGl~P~~~l~~p~~l~pgqs 83 (250)
....|+.+.|+=.--+ ++.-...+++.|.+..| ..+|- +. -+|=++|. + ..|.||++
T Consensus 19 ~a~A~v~l~~TRvIy~------~~~~~~si~i~N~~~~p---~LvQswv~~~~~~~~~~pFivtPP--l---frl~p~~~ 84 (229)
T PRK15211 19 QAMAAFVLNGTRFIYD------EGRKNISFEVTNQADQT---YGGQVWIDNTTQGSSTVYMVPAPP--F---FKVRPKEK 84 (229)
T ss_pred HheEEEEECceEEEEc------CCCceEEEEEEeCCCCc---EEEEEEEecCCCCCccCCEEEcCC--e---EEECCCCc
Confidence 3456899999844431 67778889999988765 34443 11 13444442 2 25779988
Q ss_pred EEEEEeeeecC-CCC-------------CC---CCCCceEEEeecCCcEEEEEe
Q psy8027 84 VEASLPMATTG-AVQ-------------RM---EPLTNLQVAVKNNLDVFYFAC 120 (250)
Q Consensus 84 ~~~~lpl~~~g-~~~-------------~~---~p~~~LQVAIKnN~dVfYF~~ 120 (250)
..+.|-....+ +-+ +. +..+.||+||++.+.+||==.
T Consensus 85 q~lRI~~~~~~LP~DRESlf~lnv~~IP~~~~~~~~n~l~iair~~iKLfyRP~ 138 (229)
T PRK15211 85 QIIRIMKTDSALPKDRESLFWLNVQEIPPKPKASEGNVLAVALNTQVKLIYRPK 138 (229)
T ss_pred eEEEEEECCCCCCCCceEEEEEEEEEcCCCCCccccceEEEEEEeeeeeEEcch
Confidence 87776543211 111 11 112469999999998888533
No 16
>KOG1078|consensus
Probab=71.16 E-value=20 Score=38.01 Aligned_cols=192 Identities=16% Similarity=0.126 Sum_probs=110.5
Q ss_pred CCeEEEEEEEEeCCC-CccchhheecccCccccccCCC---CC-CC-ccCCCCCeEEEEEeeee--cCC-CCCCCCCCce
Q psy8027 35 IGVIEMELTFTNKSM-APMANFAIQVNKNSFGLTPARP---LQ-VM-SPLLPSLSVEASLPMAT--TGA-VQRMEPLTNL 105 (250)
Q Consensus 35 ~g~i~l~l~itN~s~-~~~~~faiQfNkNsFGl~P~~~---l~-~p-~~l~pgqs~~~~lpl~~--~g~-~~~~~p~~~L 105 (250)
.+++.+....+|+-. +-+.+- +-++.|... +. +| ..|+-||..++-.-... +-+ .-...-...+
T Consensus 648 ~~~~V~qf~~~Ntl~d~~L~~v-------~vv~~~~~~~evl~~i~~~slpy~qp~~~~tl~~~p~~~p~~v~~sf~~tl 720 (865)
T KOG1078|consen 648 KDHVVLQFDCTNTLNDQLLENV-------SVVLTPTGGEEVLEKVPTMSLPYDQPGSAFTLVEFPKDDPWAIAEGFGNTL 720 (865)
T ss_pred ccceEEEEeccCcchHHHHhhh-------eeeecCCCCceeeeeccccCCCCCCCcceEEEEEcCCCCchhhhccceeeE
Confidence 567778888888654 333222 444555433 11 11 13444443333222211 112 1122334567
Q ss_pred EEEeec---CCcEEE---EEecccceEEeccC----CcccHHHHHhhhccCCCCccCccceeeeeeeEEecccccCHHHH
Q psy8027 106 QVAVKN---NLDVFY---FACIVPINVYFMED----GEMERKTFLTTWKEIPTQNEDGNAIFCCQVQFTLTNIALGADAI 175 (250)
Q Consensus 106 QVAIKn---N~dVfY---F~~~ipl~vlf~e~----G~m~r~~Fl~~WksiP~~nE~~~~~~~~~~~~~i~~~~~~~d~i 175 (250)
++-+|- |.+.=- |.+.+-|+=+..-. .+.-+..|-..|.+++.+.| -.|.++...-=.+++
T Consensus 721 kFtvkdcdp~TgepdedGyeDEY~LEdlevtv~D~iqkv~k~NF~aawde~~~e~e---------etF~Ls~~~tl~eAv 791 (865)
T KOG1078|consen 721 KFTVKDCDPNTGEPDDEGYEDEYVLEDLEVTVGDFVQKVRKSNFPAAWDELGFEAE---------ETFNLSTVKSIQEAV 791 (865)
T ss_pred EEEEEecCCCCCCCCccCcccceeeeceeeehhhhhhHhhcccchhhHHhcCcchh---------eeeeccccchHHHHH
Confidence 777775 444333 56666666666333 34567889999999997766 456555432112344
Q ss_pred HHHHhhCCeEEEeecc----cCCcceEEEEEEeeCCcEEEEEEeeeCCCC--ceEEEEeeCCcchhHHHHHHH
Q psy8027 176 VSRMKQNNVFTIAKRN----VEGKDMLYQSLRLTNGIWVLNELKIQPGDP--SVTLSLKSRSSDVAQDIFHAY 242 (250)
Q Consensus 176 ~~kL~~~Nif~VAkr~----~~gq~~lY~S~k~~n~i~~L~El~~~~~~~--~~~~~lKs~~~~~~pli~~~~ 242 (250)
.+-+.-.|.--+-+.. ..+...+|.+..+-+|--+|+-.++--..+ ++++.+||.+.....+|..++
T Consensus 792 ~~Ii~~LgMqpcE~sd~vPenknsHtl~LsG~frgG~~vlvr~~ma~s~~~vtm~Vtvrs~e~~~vd~Iva~v 864 (865)
T KOG1078|consen 792 KKIIDLLGMQPCERTEKVPENKNSHTLLLSGVFRGGYKVLVRAKMALSSGGITMKVTVRSEDELVVDLIVALV 864 (865)
T ss_pred HHHHHHhCCccccccccCCCCCCceEEEEeeeeeCCceEEEeeeeeecCCCcEEEEEEecCCccHHHHHHHhc
Confidence 4444444544444332 134678999999999998998887765444 558999999998877776553
No 17
>PRK15295 fimbrial assembly chaperone SthB; Provisional
Probab=70.97 E-value=25 Score=31.42 Aligned_cols=95 Identities=14% Similarity=0.119 Sum_probs=56.6
Q ss_pred CCcceEEEEEEecCCCCCCCCCCeEEEEEEEEeCCCCccchhheec-----c-----cCccccccCCCCCCCccCCCCCe
Q psy8027 14 KGKGMEIWGTFSRKPNPATPGIGVIEMELTFTNKSMAPMANFAIQV-----N-----KNSFGLTPARPLQVMSPLLPSLS 83 (250)
Q Consensus 14 ~~~GLeI~g~f~r~~~~~~~~~g~i~l~l~itN~s~~~~~~faiQf-----N-----kNsFGl~P~~~l~~p~~l~pgqs 83 (250)
..-|+.|+|+=.--+ ++.-...+++.|.+..| +.+|- + +..|=++|. + ..|.||++
T Consensus 17 a~A~i~l~~TRvI~~------~~~~~~si~i~N~~~~p---~LvQsWv~~~~~~~~~~~pFivtPP--l---~rl~p~~~ 82 (226)
T PRK15295 17 AHASIVVGGTRLVFD------GNNDESSINVENKDSKA---NLVQSWLSVVDPQVTNKQAFIITPP--L---FRLDAGQK 82 (226)
T ss_pred hcccEEeCceEEEEe------CCCceeEEEEEeCCCCc---EEEEEEEeCCCCCCCCCCCEEEcCC--e---EEECCCCc
Confidence 346899999854431 67778889999988765 44552 1 112333331 1 25779988
Q ss_pred EEEEEeeeecC-CC-------------CC---CCCCCceEEEeecCCcEEEEEecc
Q psy8027 84 VEASLPMATTG-AV-------------QR---MEPLTNLQVAVKNNLDVFYFACIV 122 (250)
Q Consensus 84 ~~~~lpl~~~g-~~-------------~~---~~p~~~LQVAIKnN~dVfYF~~~i 122 (250)
..+.|-..... +. .+ .+..+.||+|+++.+.+||==..+
T Consensus 83 q~lRI~~~~~~LP~DrEslf~lnv~~IP~~~~~~~~n~l~iair~rIKLFyRP~~L 138 (226)
T PRK15295 83 NSIRVIRSGAPLPADRESMYWLNIKGIPSIDDNASANRVEISINTQIKLIYRPPAL 138 (226)
T ss_pred eEEEEEECCCCCCCCceEEEEEEEEEcCCCCCcCccceEEEEeeeeeeEEEchhhc
Confidence 87776432111 11 11 011246999999999999865544
No 18
>PRK11385 putativi pili assembly chaperone; Provisional
Probab=70.46 E-value=24 Score=31.83 Aligned_cols=92 Identities=20% Similarity=0.209 Sum_probs=57.6
Q ss_pred CCCcceEEEEEEecCCCCCCCCCCeEEEEEEEEeCCCCccchhheeccc---------------CccccccCCCCCCCcc
Q psy8027 13 EKGKGMEIWGTFSRKPNPATPGIGVIEMELTFTNKSMAPMANFAIQVNK---------------NSFGLTPARPLQVMSP 77 (250)
Q Consensus 13 ~~~~GLeI~g~f~r~~~~~~~~~g~i~l~l~itN~s~~~~~~faiQfNk---------------NsFGl~P~~~l~~p~~ 77 (250)
...-|+.+.++=..-+ +++-...+++.|.+..| +.+|-.. .+|=++|. + ..
T Consensus 23 ~a~A~v~l~~TRvIy~------~~~~~~sv~l~N~~~~p---~LvQswv~~~~~~~~~~~~~~~~pFivtPP--l---fr 88 (236)
T PRK11385 23 ELQAGVVVGGTRFIFP------ADRESISILLTNTSQES---WLINSKINRPTRWAGGEASTVPAPLLAAPP--L---IL 88 (236)
T ss_pred hheeeEEeCceEEEEc------CCCceEEEEEEeCCCCc---EEEEEEcccCccccCcccccccCCEEEcCC--e---EE
Confidence 3346899999855432 67788899999998865 5566522 12333331 1 26
Q ss_pred CCCCCeEEEEEeeeecC--CCC-------------CC-CCCCceEEEeecCCcEEEE
Q psy8027 78 LLPSLSVEASLPMATTG--AVQ-------------RM-EPLTNLQVAVKNNLDVFYF 118 (250)
Q Consensus 78 l~pgqs~~~~lpl~~~g--~~~-------------~~-~p~~~LQVAIKnN~dVfYF 118 (250)
|.||+.....|-....+ +-+ +. +..+.||+||++.+.+||=
T Consensus 89 l~p~~~q~lRIi~~~~~~LP~DRESlf~lnv~~IPp~~~~~n~L~iair~riKLFyR 145 (236)
T PRK11385 89 LKPGTTGTLRLLRTESDILPVDRETLFELSIASVPSGKVENQSVKVAMRSVFKLFWR 145 (236)
T ss_pred ECCCCceEEEEEECCCCCCCCCceEEEEEEEEecCCCcCCCceEEEEEEeeEEEEEc
Confidence 78999887776543211 111 11 1124599999999988883
No 19
>PF07705 CARDB: CARDB; InterPro: IPR011635 The APHP (acidic peptide-dependent hydrolases/peptidase) domain is found in a variety of different proteins.; PDB: 2KUT_A 2L0D_A 3IDU_A 2KL6_A.
Probab=70.22 E-value=6.4 Score=28.96 Aligned_cols=55 Identities=13% Similarity=0.195 Sum_probs=40.5
Q ss_pred CCeEEEEEEEEeCCCCccchhheecccCccccccCCCCCCCccCCCCCeEEEEEeeeec
Q psy8027 35 IGVIEMELTFTNKSMAPMANFAIQVNKNSFGLTPARPLQVMSPLLPSLSVEASLPMATT 93 (250)
Q Consensus 35 ~g~i~l~l~itN~s~~~~~~faiQfNkNsFGl~P~~~l~~p~~l~pgqs~~~~lpl~~~ 93 (250)
++.+.+.++++|.......++.+.|..+.-.. ....++ .|.||++..+.+.+...
T Consensus 18 g~~~~i~~~V~N~G~~~~~~~~v~~~~~~~~~---~~~~i~-~L~~g~~~~v~~~~~~~ 72 (101)
T PF07705_consen 18 GEPVTITVTVKNNGTADAENVTVRLYLDGNSV---STVTIP-SLAPGESETVTFTWTPP 72 (101)
T ss_dssp TSEEEEEEEEEE-SSS-BEEEEEEEEETTEEE---EEEEES-EB-TTEEEEEEEEEE-S
T ss_pred CCEEEEEEEEEECCCCCCCCEEEEEEECCcee---ccEEEC-CcCCCcEEEEEEEEEeC
Confidence 68889999999999988999999987775554 223344 79999999998887654
No 20
>PF05506 DUF756: Domain of unknown function (DUF756); InterPro: IPR008475 This domain is found, normally as a tandem repeat, at the C terminus of bacterial phospholipase C proteins.; GO: 0004629 phospholipase C activity, 0016042 lipid catabolic process
Probab=68.48 E-value=23 Score=26.55 Aligned_cols=61 Identities=21% Similarity=0.414 Sum_probs=40.8
Q ss_pred ceEEEEEEecCCCCCCCCCCeEEEEEEEEeCCCCccchhheecccCccccccCCCCCCCccCCCCCeEEEEEeeeec
Q psy8027 17 GMEIWGTFSRKPNPATPGIGVIEMELTFTNKSMAPMANFAIQVNKNSFGLTPARPLQVMSPLLPSLSVEASLPMATT 93 (250)
Q Consensus 17 GLeI~g~f~r~~~~~~~~~g~i~l~l~itN~s~~~~~~faiQfNkNsFGl~P~~~l~~p~~l~pgqs~~~~lpl~~~ 93 (250)
.+|+.+..... .| .|.|+|.|....+. .+++--|.++ -..+.. -.|.||++.+...++..+
T Consensus 8 ~~~v~~~~~~~-------~g--~l~l~l~N~g~~~~---~~~v~~~~y~--~~~~~~--~~v~ag~~~~~~w~l~~s 68 (89)
T PF05506_consen 8 APEVTARYDPA-------TG--NLRLTLSNPGSAAV---TFTVYDNAYG--GGGPWT--YTVAAGQTVSLTWPLAAS 68 (89)
T ss_pred CCEEEEEEECC-------CC--EEEEEEEeCCCCcE---EEEEEeCCcC--CCCCEE--EEECCCCEEEEEEeecCC
Confidence 68899998777 56 77888899876554 3344445553 122222 368899999999998433
No 21
>PF11614 FixG_C: IG-like fold at C-terminal of FixG, putative oxidoreductase; PDB: 2R39_A.
Probab=66.42 E-value=28 Score=27.24 Aligned_cols=71 Identities=21% Similarity=0.273 Sum_probs=39.4
Q ss_pred CCeE--EEEEEEEeCCCCccchhheecccCccccccCCCCCCCccCCCCCeEEEEEeeeecCCC-CCCCCCCceEEEee
Q psy8027 35 IGVI--EMELTFTNKSMAPMANFAIQVNKNSFGLTPARPLQVMSPLLPSLSVEASLPMATTGAV-QRMEPLTNLQVAVK 110 (250)
Q Consensus 35 ~g~i--~l~l~itN~s~~~~~~faiQfNkNsFGl~P~~~l~~p~~l~pgqs~~~~lpl~~~g~~-~~~~p~~~LQVAIK 110 (250)
+|.| .+.++++|++.++.. |.|.+.-.. |+....+-. +..|.||++.+..+-+...... .... ..+.+-|.
T Consensus 28 dg~I~N~Y~lkl~Nkt~~~~~-~~i~~~g~~-~~~l~~~~~-~i~v~~g~~~~~~v~v~~p~~~~~~~~--~~i~f~v~ 101 (118)
T PF11614_consen 28 DGSIRNQYTLKLTNKTNQPRT-YTISVEGLP-GAELQGPEN-TITVPPGETREVPVFVTAPPDALKSGS--TPITFTVT 101 (118)
T ss_dssp --SEEEEEEEEEEE-SSS-EE-EEEEEES-S-S-EE-ES---EEEE-TT-EEEEEEEEEE-GGG-SSSE--EEEEEEEE
T ss_pred CCeEEEEEEEEEEECCCCCEE-EEEEEecCC-CeEEECCCc-ceEECCCCEEEEEEEEEECHHHccCCC--eeEEEEEE
Confidence 6767 599999999999976 888877533 444422111 2368899999888887666542 2222 34666665
No 22
>PRK15218 fimbrial chaperone protein PegB; Provisional
Probab=63.23 E-value=38 Score=30.33 Aligned_cols=91 Identities=14% Similarity=0.239 Sum_probs=55.4
Q ss_pred CCcceEEEEE-EecCCCCCCCCCCeEEEEEEEEeCCCCccchhheec-----ccC--------ccccccCCCCCCCccCC
Q psy8027 14 KGKGMEIWGT-FSRKPNPATPGIGVIEMELTFTNKSMAPMANFAIQV-----NKN--------SFGLTPARPLQVMSPLL 79 (250)
Q Consensus 14 ~~~GLeI~g~-f~r~~~~~~~~~g~i~l~l~itN~s~~~~~~faiQf-----NkN--------sFGl~P~~~l~~p~~l~ 79 (250)
...|+.+.|| ++-. ++.-...+++.|.+..| +.+|- +++ +|=++|. + .-|.
T Consensus 16 a~Agi~l~~TRvIy~-------~~~~~~si~i~N~~~~p---yLvQsWvd~~~~~~~~~~~~~pFivtPP--l---fRl~ 80 (226)
T PRK15218 16 AWSGIYIYGTRIIYP-------AQKKDITVQLMNDGKRS---SLIQAWIDNGDTSLPPEKLQVPFIMTPP--V---IRVA 80 (226)
T ss_pred heeeEEeCceEEEEc-------CCCcEEEEEEEcCCCCc---EEEEEEEeCCCCCCCcccccCCEEECCC--e---EEEC
Confidence 3468999998 4445 77788889999988765 44443 222 4544441 2 2577
Q ss_pred CCCeEEEEEeeeecC-C-------------CCCC----CCCCceEEEeecCCcEEEEE
Q psy8027 80 PSLSVEASLPMATTG-A-------------VQRM----EPLTNLQVAVKNNLDVFYFA 119 (250)
Q Consensus 80 pgqs~~~~lpl~~~g-~-------------~~~~----~p~~~LQVAIKnN~dVfYF~ 119 (250)
||+.....|-...++ + +.+. +..+.||+|+++.+.+||==
T Consensus 81 p~~~~~lRI~~~~~~LP~DRESlfwlnv~~IPp~~~~~~~~n~L~iairtrIKLfYRP 138 (226)
T PRK15218 81 ANSGQQLKIKKLANNLPGDRESLFYLNVLDIPPNSDENKDKNIIKFALQNRIKLIYRP 138 (226)
T ss_pred CCCceEEEEEECCCCCCcceeEEEEEEEEEcCCCCCCcCcCcEEEEEeeeEEEEEEcc
Confidence 888776666542110 0 1111 11246999999999888843
No 23
>PRK09918 putative fimbrial chaperone protein; Provisional
Probab=62.62 E-value=35 Score=30.42 Aligned_cols=93 Identities=12% Similarity=0.147 Sum_probs=55.4
Q ss_pred CCcceEEEEEEecCCCCCCCCCCeEEEEEEEEeCCCCccchhheec-----ccC---ccccccCCCCCCCccCCCCCeEE
Q psy8027 14 KGKGMEIWGTFSRKPNPATPGIGVIEMELTFTNKSMAPMANFAIQV-----NKN---SFGLTPARPLQVMSPLLPSLSVE 85 (250)
Q Consensus 14 ~~~GLeI~g~f~r~~~~~~~~~g~i~l~l~itN~s~~~~~~faiQf-----NkN---sFGl~P~~~l~~p~~l~pgqs~~ 85 (250)
..-|+.+.++-..-+ +++-.-.++++|.+..|+- +|- +.+ .|=++|. + ..|.||++..
T Consensus 22 a~a~v~l~~tRvi~~------~~~~~~si~v~N~~~~p~l---vQ~wv~~~~~~~~~~fivtPP--l---~rl~pg~~q~ 87 (230)
T PRK09918 22 HAAGMVPETSVVIVE------ESDGEGSINVKNTDSNPIL---LYTTLVDLPEDKSKLLLVTPP--V---ARVEPGQSQQ 87 (230)
T ss_pred hEeeEEEccEEEEEE------CCCCeEEEEEEcCCCCcEE---EEEEEecCCCCCCCCEEEcCC--e---EEECCCCceE
Confidence 345899999866542 5667788888998876632 232 111 2333331 1 2678999887
Q ss_pred EEEeeeecCCCC-------------C-CCCCCceEEEeecCCcEEEEEe
Q psy8027 86 ASLPMATTGAVQ-------------R-MEPLTNLQVAVKNNLDVFYFAC 120 (250)
Q Consensus 86 ~~lpl~~~g~~~-------------~-~~p~~~LQVAIKnN~dVfYF~~ 120 (250)
+.+-.....+.+ + .+..+.||+|+++.+.+||==.
T Consensus 88 vRii~~~~lp~drEs~f~l~v~~IP~~~~~~~~l~ia~r~~iklfyRP~ 136 (230)
T PRK09918 88 VRFILKSGSPLNTEHLLRVSFEGVPPKPGGKNKVVMPIRQDLPVLIQPA 136 (230)
T ss_pred EEEEECCCCCCCeeEEEEEEEEEcCCCCCCCCEEEEEEEeEEEEEEeCC
Confidence 776543211111 1 1123469999999998888433
No 24
>PRK15274 putative periplasmic fimbrial chaperone protein SteC; Provisional
Probab=58.91 E-value=47 Score=30.47 Aligned_cols=93 Identities=17% Similarity=0.227 Sum_probs=56.6
Q ss_pred CcceEEEEE-EecCCCCCCCCCCeEEEEEEEEeCCCC-ccchhheec-----c--c--CccccccCCCCCCCccCCCCCe
Q psy8027 15 GKGMEIWGT-FSRKPNPATPGIGVIEMELTFTNKSMA-PMANFAIQV-----N--K--NSFGLTPARPLQVMSPLLPSLS 83 (250)
Q Consensus 15 ~~GLeI~g~-f~r~~~~~~~~~g~i~l~l~itN~s~~-~~~~faiQf-----N--k--NsFGl~P~~~l~~p~~l~pgqs 83 (250)
.-|+.+.|| ++-. +|.-...++++|.+.. | +.+|- + + .+|=+.|. + .-|.||+.
T Consensus 25 ~Agi~l~~TRvIy~-------e~~~~~sv~v~N~~~~~p---~LVQsWvdd~~~~~~~~pFivtPP--L---fRlep~~~ 89 (257)
T PRK15274 25 HSAIVPDRTRVIFN-------GNENSITVTLKNGNATLP---YLAQAWLEDDKFAKDTRYFTALPP--L---QRIEPKSD 89 (257)
T ss_pred eeeEEeCceEEEEe-------CCCceEEEEEEeCCCCCc---EEEEEEccCCCCCcccCCEEEcCC--e---EEECCCCc
Confidence 458889888 4445 6777889999998754 3 23332 1 1 24555552 2 25779887
Q ss_pred EEEEEeeeec-CC---------------CCC-CCCCCceEEEeecCCcEEEEEecc
Q psy8027 84 VEASLPMATT-GA---------------VQR-MEPLTNLQVAVKNNLDVFYFACIV 122 (250)
Q Consensus 84 ~~~~lpl~~~-g~---------------~~~-~~p~~~LQVAIKnN~dVfYF~~~i 122 (250)
..+.|-...+ +. +.+ .+..+.||+||++-+.+||==..+
T Consensus 90 q~lRI~~~~~~~~LP~DRESlFwlNv~eIPp~~~~~n~L~iairtrIKLFYRP~~L 145 (257)
T PRK15274 90 GQVKVQPLPAAASLPQDRESLFYFNVREIPPKSDKPNTLQLALQTRIKFFYRPVAV 145 (257)
T ss_pred eEEEEEECCCCCCCCCceeEEEEEEEEEcCCCCCcCceEEEEeeeeeeeEEccccc
Confidence 7777653321 10 111 111356999999999999954433
No 25
>PRK15249 fimbrial chaperone protein StbB; Provisional
Probab=55.01 E-value=37 Score=30.81 Aligned_cols=98 Identities=15% Similarity=0.236 Sum_probs=56.9
Q ss_pred eeecCCCcceEEEEEEecCCCCCCCCCCeEEEEEEEEeCCCCccchhheec-----ccC---------ccccccCCCCCC
Q psy8027 9 WLPAEKGKGMEIWGTFSRKPNPATPGIGVIEMELTFTNKSMAPMANFAIQV-----NKN---------SFGLTPARPLQV 74 (250)
Q Consensus 9 ~l~a~~~~GLeI~g~f~r~~~~~~~~~g~i~l~l~itN~s~~~~~~faiQf-----NkN---------sFGl~P~~~l~~ 74 (250)
|.+....-|+.|.++=..-+ +++-...+++.|.+..| +.+|- +.+ +|=..|. +
T Consensus 21 ~~~~~a~A~l~l~~TRviy~------~~~~~~sl~l~N~~~~p---~LvQsWv~~~~~~~~p~~~~~~pFivtPP--l-- 87 (253)
T PRK15249 21 LPATASWASVTILGSRIIYP------STASSVDVQLKNNDAIP---YIVQTWFDDGDMNTSPENSSAMPFIATPP--V-- 87 (253)
T ss_pred hhhHhheeEEEeCceEEEEe------CCCcceeEEEEcCCCCc---EEEEEEEeCCCCCCCccccccCcEEEcCC--e--
Confidence 34444566899999854431 66777888999987654 44453 111 2322221 1
Q ss_pred CccCCCCCeEEEEEeeeecCC--C-------------CCCC-----CCCceEEEeecCCcEEEEEe
Q psy8027 75 MSPLLPSLSVEASLPMATTGA--V-------------QRME-----PLTNLQVAVKNNLDVFYFAC 120 (250)
Q Consensus 75 p~~l~pgqs~~~~lpl~~~g~--~-------------~~~~-----p~~~LQVAIKnN~dVfYF~~ 120 (250)
..|.||+...+.|-...++. . .+.+ ..+.||+|+++.+.+||==.
T Consensus 88 -frl~p~~~q~lRI~~~~~~~lP~DRESlf~lnv~eIP~~~~~~~~~~n~l~ialr~~IKLFyRP~ 152 (253)
T PRK15249 88 -FRIQPKAGQVVRVIYNNTKKLPQDRESVFWFNVLQVPPTNIGSDSGQNKMLVMLRSRIKLFYRPD 152 (253)
T ss_pred -EEecCCCceEEEEEEcCCCCCCCCceEEEEEEeeecCCCCcccccccceEEEEeeeEEEEEEccc
Confidence 25778887777655431111 1 1110 11359999999999998543
No 26
>PRK15253 putative fimbrial assembly chaperone protein StcB; Provisional
Probab=54.59 E-value=64 Score=29.27 Aligned_cols=90 Identities=16% Similarity=0.215 Sum_probs=54.4
Q ss_pred CcceEEEEE-EecCCCCCCCCCCeEEEEEEEEeCCCCccchhheec---c--c--------CccccccCCCCCCCccCCC
Q psy8027 15 GKGMEIWGT-FSRKPNPATPGIGVIEMELTFTNKSMAPMANFAIQV---N--K--------NSFGLTPARPLQVMSPLLP 80 (250)
Q Consensus 15 ~~GLeI~g~-f~r~~~~~~~~~g~i~l~l~itN~s~~~~~~faiQf---N--k--------NsFGl~P~~~l~~p~~l~p 80 (250)
..|+.+.|| ++.. ++.-...+++.|.+..| +.+|- + . -+|=++|. + ..|.|
T Consensus 32 ~Agv~l~~TRvIy~-------~~~k~~sv~i~N~~~~p---yLvQsWvd~~~~~~~~~~~~~pFivtPP--l---fRl~p 96 (242)
T PRK15253 32 HAGIVIYGTRVIYP-------AEKKEVVVQLVNQGEQA---SLVQSWIDDGNTSLPPEKIQVPFMLTPP--V---ARVAA 96 (242)
T ss_pred eeeEEeCceEEEEe-------CCCceEEEEEEcCCCCc---EEEEEEEECCCCCCCcccccCCEEECCC--e---EEECC
Confidence 348999988 4445 67778888899988755 44554 2 1 13544441 2 25778
Q ss_pred CCeEEEEEeeeec-CC-------------CCC----CCCCCceEEEeecCCcEEEEE
Q psy8027 81 SLSVEASLPMATT-GA-------------VQR----MEPLTNLQVAVKNNLDVFYFA 119 (250)
Q Consensus 81 gqs~~~~lpl~~~-g~-------------~~~----~~p~~~LQVAIKnN~dVfYF~ 119 (250)
|+.....|-...+ -+ +.+ .+..+.||+|+++.+.+||==
T Consensus 97 ~~~~~lRI~~~~~~LP~DRESlfwlnv~~IPp~~~~~~~~n~l~iairtriKLFYRP 153 (242)
T PRK15253 97 ESGQQIKIKKMPNSLPDNKESLFYLNVLDIPPNSQENAGKNVLKFAMQNRIKLIWRP 153 (242)
T ss_pred CCceEEEEEECCCCCCcceeEEEEEEEEEcCCCCCCcCcCcEEEEEeeeEEEEEEcc
Confidence 8877776642211 00 111 011246999999999888853
No 27
>PRK15224 pili assembly chaperone protein SafB; Provisional
Probab=53.90 E-value=59 Score=29.42 Aligned_cols=92 Identities=16% Similarity=0.198 Sum_probs=53.6
Q ss_pred CCCcceEEEEE-EecCCCCCCCCCCeEEEEEEEEeCCCCccchhheec-----ccC---ccccccCCCCCCCccCCCCCe
Q psy8027 13 EKGKGMEIWGT-FSRKPNPATPGIGVIEMELTFTNKSMAPMANFAIQV-----NKN---SFGLTPARPLQVMSPLLPSLS 83 (250)
Q Consensus 13 ~~~~GLeI~g~-f~r~~~~~~~~~g~i~l~l~itN~s~~~~~~faiQf-----NkN---sFGl~P~~~l~~p~~l~pgqs 83 (250)
...-|+.|.|| ++.. ++.-...+++.|.+..| +.||- +.+ +|=++| |+ ..|.||+.
T Consensus 25 ~a~agv~l~~TRvIy~-------~~~k~~sl~v~N~~~~p---yLvQsWvd~~~~~~~~pFivtP--Pl---fRlep~~~ 89 (237)
T PRK15224 25 TKSFSVKLGATRVIYH-------AGTAGATLSVSNPQNYP---ILVQSSVKAADKSSPAPFLVMP--PL---FRLEANQQ 89 (237)
T ss_pred ceeEEEEeCceEEEEe-------CCCcEEEEEEEcCCCCc---EEEEEEEeCCCCCccCCEEECC--Ce---EEECCCCc
Confidence 44458888888 4444 67777888999998765 55554 222 344433 12 25778887
Q ss_pred EEEEEeeeec-CC-------------CCCCCC-----C----CceEEEeecCCcEEEEE
Q psy8027 84 VEASLPMATT-GA-------------VQRMEP-----L----TNLQVAVKNNLDVFYFA 119 (250)
Q Consensus 84 ~~~~lpl~~~-g~-------------~~~~~p-----~----~~LQVAIKnN~dVfYF~ 119 (250)
....|-.... -| +.+.+. . ..||+|+++-+.+||==
T Consensus 90 ~~lRI~~~~~~LP~DRESlFwlnv~~IPp~~~~~~~~~~~~~~~LqiairtrIKLFYRP 148 (237)
T PRK15224 90 SQLRIVRTGGDMPTDRETLQWVCIKAVPPENEPSDTQAKGATLDLNLSINVCDKLIFRP 148 (237)
T ss_pred eEEEEEECCCCCCCceeEEEEEEEEEcCCCCcccccccccccceEEEEeheeeeEEEch
Confidence 7666543211 00 111000 0 13899999988888843
No 28
>PRK15254 fimbrial chaperone protein StdC; Provisional
Probab=53.90 E-value=70 Score=28.96 Aligned_cols=97 Identities=14% Similarity=0.141 Sum_probs=55.1
Q ss_pred CCCcceEEEEEEecCCCCCCCCCCeEEEEEEEEeCCCC-ccc-hhhee-ccc---CccccccCCCCCCCccCCCCCeEEE
Q psy8027 13 EKGKGMEIWGTFSRKPNPATPGIGVIEMELTFTNKSMA-PMA-NFAIQ-VNK---NSFGLTPARPLQVMSPLLPSLSVEA 86 (250)
Q Consensus 13 ~~~~GLeI~g~f~r~~~~~~~~~g~i~l~l~itN~s~~-~~~-~faiQ-fNk---NsFGl~P~~~l~~p~~l~pgqs~~~ 86 (250)
...-|+.+.++=..-+ ++.-.-.+++.|.+.. |.- .--++ -+. .+|=++|. + ..|.||+...+
T Consensus 13 ~a~A~v~l~~TRvIy~------~~~~~~sv~v~N~~~~~p~LvQsWv~d~~~~~~~pFivtPP--l---frl~p~~~~~l 81 (239)
T PRK15254 13 YSPAAVNVDRTRIIMD------APQKTVAITLNNDDKTTPFLAQSWVTDADGVRTDALMALPP--L---QRIDAGQKSQV 81 (239)
T ss_pred hceEeEEECceEEEEe------CCCceEEEEEEeCCCCCcEEEEEEEecCCCCCcCCEEEcCC--e---EEECCCCceEE
Confidence 3456899999844431 6777888999998753 321 11111 111 24544442 2 25789998877
Q ss_pred EEeeee---cC-C-------------CCC-CCCCCceEEEeecCCcEEEEEe
Q psy8027 87 SLPMAT---TG-A-------------VQR-MEPLTNLQVAVKNNLDVFYFAC 120 (250)
Q Consensus 87 ~lpl~~---~g-~-------------~~~-~~p~~~LQVAIKnN~dVfYF~~ 120 (250)
.|-... ++ + +.+ .+..+.||+|+++.+.+||==.
T Consensus 82 RI~~~~~~~~~lP~DRESlf~lnv~~IP~~~~~~n~L~iair~~iKLFyRP~ 133 (239)
T PRK15254 82 RITQVRGLTDKLPQDRETLFWFNVRGVPPKPEDDNVLQLAMQSQLKLFYRPK 133 (239)
T ss_pred EEEEcccCCCCCCCCceEEEEEEEEEcCCCCCCCceEEEEEEeEEeEEEccc
Confidence 764321 00 0 111 1123569999999999988533
No 29
>PRK15246 fimbrial assembly chaperone StbE; Provisional
Probab=52.60 E-value=78 Score=28.47 Aligned_cols=92 Identities=15% Similarity=0.168 Sum_probs=55.2
Q ss_pred CcceEEEEEEecCCCCCCCCCCeEEEEEEEEeCCCCccchhheec-----ccC--------ccccccCCCCCCCccCCCC
Q psy8027 15 GKGMEIWGTFSRKPNPATPGIGVIEMELTFTNKSMAPMANFAIQV-----NKN--------SFGLTPARPLQVMSPLLPS 81 (250)
Q Consensus 15 ~~GLeI~g~f~r~~~~~~~~~g~i~l~l~itN~s~~~~~~faiQf-----NkN--------sFGl~P~~~l~~p~~l~pg 81 (250)
.-|+.|.++-..-+ +++-.-.+++.|.+..| +.+|- +.+ +|=++|. + ..|.||
T Consensus 9 ~A~v~l~~TRvI~~------~~~~~~sv~l~N~~~~p---~LvQsWvd~~~~~~~p~~~~~pFivtPP--l---frl~~~ 74 (233)
T PRK15246 9 QAAVNIDRTRIIFA------SDDVAQSLTLSNDNTTP---MLLQVWTDAGNIDASPDNSKTPLVALPP--V---FKMQPG 74 (233)
T ss_pred EEEEEECceEEEEc------CCCceEEEEEEeCCCCc---EEEEEEEeCCCCccCcccccCcEEECCc--c---eEECCC
Confidence 35799999865542 67778888999998765 44554 211 3444441 1 257788
Q ss_pred CeEEEEEeeeecCCC---------------CCCC-----CCCceEEEeecCCcEEEEEe
Q psy8027 82 LSVEASLPMATTGAV---------------QRME-----PLTNLQVAVKNNLDVFYFAC 120 (250)
Q Consensus 82 qs~~~~lpl~~~g~~---------------~~~~-----p~~~LQVAIKnN~dVfYF~~ 120 (250)
+...+.|-....+.. .+.+ ....||+||++.+.+||==.
T Consensus 75 ~~~~lRI~~~~~~~LP~DRESlf~lnv~~IP~~~~~~~~~~~~l~iair~rIKlFyRP~ 133 (233)
T PRK15246 75 ELRTLRLLLSSRQQLATDRESLFWLNIYQIPPVTQDIKNHPRKLVLPLRLRLKILIRPT 133 (233)
T ss_pred CceEEEEEECCCCCCCCCceEEEEEEEEEcCCCCcccccccceEEEEeeeEEEEEECCc
Confidence 877776554311111 1111 01359999999888888544
No 30
>PF09478 CBM49: Carbohydrate binding domain CBM49; InterPro: IPR019028 A carbohydrate-binding module (CBM) is defined as a contiguous amino acid sequence within a carbohydrate-active enzyme with a discreet fold having carbohydrate-binding activity. A few exceptions are CBMs in cellulosomal scaffolding proteins and rare instances of independent putative CBMs. The requirement of CBMs existing as modules within larger enzymes sets this class of carbohydrate-binding protein apart from other non-catalytic sugar binding proteins such as lectins and sugar transport proteins. CBMs were previously classified as cellulose-binding domains (CBDs) based on the initial discovery of several modules that bound cellulose [, ]. However, additional modules in carbohydrate-active enzymes are continually being found that bind carbohydrates other than cellulose yet otherwise meet the CBM criteria, hence the need to reclassify these polypeptides using more inclusive terminology. Previous classification of cellulose-binding domains were based on amino acid similarity. Groupings of CBDs were called "Types" and numbered with roman numerals (e.g. Type I or Type II CBDs). In keeping with the glycoside hydrolase classification, these groupings are now called families and numbered with Arabic numerals. Families 1 to 13 are the same as Types I to XIII. For a detailed review on the structure and binding modes of CBMs see []. This domain is found at the C-terminal of cellulases and in vitro binding studies have shown it to binds to crystalline cellulose []. ; GO: 0030246 carbohydrate binding, 0005576 extracellular region
Probab=51.14 E-value=27 Score=25.92 Aligned_cols=47 Identities=21% Similarity=0.427 Sum_probs=34.3
Q ss_pred EEEEEEEeCCCCccchhheecc---cCccccccC--CCCCCCc---cCCCCCeEE
Q psy8027 39 EMELTFTNKSMAPMANFAIQVN---KNSFGLTPA--RPLQVMS---PLLPSLSVE 85 (250)
Q Consensus 39 ~l~l~itN~s~~~~~~faiQfN---kNsFGl~P~--~~l~~p~---~l~pgqs~~ 85 (250)
.++.+|+|.+..++.++-|... .+..|+.-. ....+|. +|.|||+..
T Consensus 20 qy~v~I~N~~~~~I~~~~i~~~~l~~~iW~l~~~~~~~y~lPs~~~~i~pg~s~~ 74 (80)
T PF09478_consen 20 QYDVTITNNGSKPIKSLKISIDNLYGSIWGLDKVSGNTYTLPSYQPTIKPGQSFT 74 (80)
T ss_pred EEEEEEEECCCCeEEEEEEEECccchhheeEEeccCCEEECCccccccCCCCEEE
Confidence 5899999999999999988877 345566552 1233553 788998765
No 31
>PF07718 Coatamer_beta_C: Coatomer beta C-terminal region; InterPro: IPR011710 Proteins synthesised on the ribosome and processed in the endoplasmic reticulum are transported from the Golgi apparatus to the trans-Golgi network (TGN), and from there via small carrier vesicles to their final destination compartment. This traffic is bidirectional, to ensure that proteins required to form vesicles are recycled. Vesicles have specific coat proteins (such as clathrin or coatomer) that are important for cargo selection and direction of transfer []. While clathrin mediates endocytic protein transport, and transport from ER to Golgi, coatomers primarily mediate intra-Golgi transport, as well as the reverse Golgi to ER transport of dilysine-tagged proteins []. For example, the coatomer COP1 (coat protein complex 1) is responsible for reverse transport of recycled proteins from Golgi and pre-Golgi compartments back to the ER, while COPII buds vesicles from the ER to the Golgi []. Coatomers reversibly associate with Golgi (non-clathrin-coated) vesicles to mediate protein transport and for budding from Golgi membranes []. Activated small guanine triphosphatases (GTPases) attract coat proteins to specific membrane export sites, thereby linking coatomers to export cargos. As coat proteins polymerise, vesicles are formed and budded from membrane-bound organelles. Coatomer complexes also influence Golgi structural integrity, as well as the processing, activity, and endocytic recycling of LDL receptors. In mammals, coatomer complexes can only be recruited by membranes associated to ADP-ribosylation factors (ARFs), which are small GTP-binding proteins. Coatomer complexes are hetero-oligomers composed of at least an alpha, beta, beta', gamma, delta, epsilon and zeta subunits. This entry represents the C-terminal domain of the beta subunit from coatomer proteins (Beta-coat proteins). The C-terminal domain probably adapts the function of the N-terminal IPR002553 from INTERPRO domain. Coatomer protein complex I (COPI)-coated vesicles are involved in transport between the endoplasmic reticulum and the Golgi but also participate in transport from early to late endosomes within the endocytic pathway []. More information about these proteins can be found at Protein of the Month: Clathrin [].; GO: 0005198 structural molecule activity, 0006886 intracellular protein transport, 0016192 vesicle-mediated transport, 0030126 COPI vesicle coat
Probab=50.87 E-value=69 Score=27.00 Aligned_cols=66 Identities=11% Similarity=0.044 Sum_probs=44.8
Q ss_pred eEEEEEEecCCCCCCCCCCeEEEEEEEEeCCCCccchhheecccCccccccCCCCCCCccCCCCCeEEEEEeeee
Q psy8027 18 MEIWGTFSRKPNPATPGIGVIEMELTFTNKSMAPMANFAIQVNKNSFGLTPARPLQVMSPLLPSLSVEASLPMAT 92 (250)
Q Consensus 18 LeI~g~f~r~~~~~~~~~g~i~l~l~itN~s~~~~~~faiQfNkNsFGl~P~~~l~~p~~l~pgqs~~~~lpl~~ 92 (250)
+.++|...-. .=.|.||+.+.|++...+.+..+.|-. .=.|....-++ +..|.|++++..+..+..
T Consensus 58 vYaEA~v~v~-------q~DIvLDvllvNqT~~tLqNl~vElat-~gdLklve~p~-~~tL~P~~~~~i~~~iKV 123 (140)
T PF07718_consen 58 VYAEAYVTVH-------QYDIVLDVLLVNQTNETLQNLTVELAT-LGDLKLVERPQ-PITLAPHGFARIKATIKV 123 (140)
T ss_pred eEEEEEEEEE-------eeeEEEEEEEEeCChhhhhcEEEEEEe-cCCcEEccCCC-ceeeCCCcEEEEEEEEEE
Confidence 4556666554 567999999999999999999998865 22344433222 336778777766666544
No 32
>PF14874 PapD-like: Flagellar-associated PapD-like
Probab=50.17 E-value=1.1e+02 Score=22.95 Aligned_cols=51 Identities=27% Similarity=0.368 Sum_probs=36.6
Q ss_pred CCeEEEEEEEEeCCCCccchhheeccc---CccccccCCCCCCCccCCCCCeEEEEEeee
Q psy8027 35 IGVIEMELTFTNKSMAPMANFAIQVNK---NSFGLTPARPLQVMSPLLPSLSVEASLPMA 91 (250)
Q Consensus 35 ~g~i~l~l~itN~s~~~~~~faiQfNk---NsFGl~P~~~l~~p~~l~pgqs~~~~lpl~ 91 (250)
+......++++|.+..| ..|.+...+ ..|-+.|. +..|.||++.++.+-+.
T Consensus 19 g~~~~~~v~l~N~s~~p-~~f~v~~~~~~~~~~~v~~~-----~g~l~PG~~~~~~V~~~ 72 (102)
T PF14874_consen 19 GQTYSRTVTLTNTSSIP-ARFRVRQPESLSSFFSVEPP-----SGFLAPGESVELEVTFS 72 (102)
T ss_pred CCEEEEEEEEEECCCCC-EEEEEEeCCcCCCCEEEECC-----CCEECCCCEEEEEEEEE
Confidence 45677999999999988 456666543 34444442 34799999999888876
No 33
>PF05117 DUF695: Family of unknown function (DUF695) ; InterPro: IPR016097 This entry is found at the N terminus of a number of proteobacterial proteins of unknown function.
Probab=49.14 E-value=1e+02 Score=24.75 Aligned_cols=52 Identities=15% Similarity=0.236 Sum_probs=41.6
Q ss_pred ccCCcccHHHHHhhhccCCCCccCccceeeeeeeEEecccccCHHHHHHHHhhCC-eEEEeecccCCcceEEEEEEeeC
Q psy8027 129 MEDGEMERKTFLTTWKEIPTQNEDGNAIFCCQVQFTLTNIALGADAIVSRMKQNN-VFTIAKRNVEGKDMLYQSLRLTN 206 (250)
Q Consensus 129 ~e~G~m~r~~Fl~~WksiP~~nE~~~~~~~~~~~~~i~~~~~~~d~i~~kL~~~N-if~VAkr~~~gq~~lY~S~k~~n 206 (250)
.++|--+.+++-.++.- | |.+.++|++.+ ..+|++-..+|...+||.++-..
T Consensus 48 ~e~GlP~~ee~~~L~~i-----E---------------------d~i~~~l~~~~~~i~vG~~t~~g~r~~~fY~~d~~ 100 (136)
T PF05117_consen 48 DENGLPSEEEYEELNDI-----E---------------------DAIIEALEADGNAIYVGRITGNGRREFYFYCKDPE 100 (136)
T ss_pred CCCCCCCHHHHHHHHHH-----H---------------------HHHHHHhhcCCcceEEEEEEECCEEEEEEEECChh
Confidence 67787788888666632 3 78889998887 88889888999999999988763
No 34
>KOG1058|consensus
Probab=45.61 E-value=2.3e+02 Score=30.48 Aligned_cols=179 Identities=16% Similarity=0.240 Sum_probs=89.4
Q ss_pred CCeEEEEEEEEeCCCCccchhheecccCccccccCCCCCCCccCCCCCeEEEEEeeee----cC----C-CCCC-CCCCc
Q psy8027 35 IGVIEMELTFTNKSMAPMANFAIQVNKNSFGLTPARPLQVMSPLLPSLSVEASLPMAT----TG----A-VQRM-EPLTN 104 (250)
Q Consensus 35 ~g~i~l~l~itN~s~~~~~~faiQfNkNsFGl~P~~~l~~p~~l~pgqs~~~~lpl~~----~g----~-~~~~-~p~~~ 104 (250)
.=.|.+|+.+.|++...+.+.-++|-. .=||.-..-.+ |-.|.|......+..+.. +| + +-.. .....
T Consensus 733 qyDIVLDvL~VNqT~~tLQNl~lelAT-lgdLKlve~p~-p~~Laph~f~~ikatvKVsStenGvIfGnIvY~~~~~a~~ 810 (948)
T KOG1058|consen 733 QYDIVLDVLLVNQTKETLQNLSLELAT-LGDLKLVERPT-PFSLAPHDFVNIKATVKVSSTENGVIFGNIVYDTSEAAND 810 (948)
T ss_pred eeeEEEEEEEecCChHHHhhheeeeee-ccCceeeecCC-CcccCcccceeEEEEEEEeeccCcEEEEEEEecCcccccc
Confidence 456899999999999998888888754 33333321111 334556554444333322 22 2 1111 00011
Q ss_pred eEEEeecCCcEEEEEecccceEE-eccCCcccHHHHHhhhccCCCCccCccceeeeeeeEEecccccCHHHHH-HHHhhC
Q psy8027 105 LQVAVKNNLDVFYFACIVPINVY-FMEDGEMERKTFLTTWKEIPTQNEDGNAIFCCQVQFTLTNIALGADAIV-SRMKQN 182 (250)
Q Consensus 105 LQVAIKnN~dVfYF~~~ipl~vl-f~e~G~m~r~~Fl~~WksiP~~nE~~~~~~~~~~~~~i~~~~~~~d~i~-~kL~~~ 182 (250)
--+-+-|.+ +..++ +.+-+.++-+.|.++|+++.=+|. + +++....+-.... .-+++-
T Consensus 811 ~~~VvLndI---------hidImDYI~Pa~~~d~~FRtmW~efEWENK----v-------tv~t~~~~L~e~l~~i~kst 870 (948)
T KOG1058|consen 811 RNVVVLNDI---------HIDIMDYIKPAKCDDDEFRTMWAEFEWENK----V-------TVNTTIKTLREFLGHISKST 870 (948)
T ss_pred ceEEEeccc---------cccHHHhcCCCcCChHHHHHHHHHhhhcce----e-------eeccccccHHHHHHHHHhhc
Confidence 122222211 22211 356789999999999999855544 3 3322212222222 333555
Q ss_pred CeEEEeecccCCcceEEEE----EEeeCCcEEEEEEeeeCCC--Cce--EEEEeeCCcchh
Q psy8027 183 NVFTIAKRNVEGKDMLYQS----LRLTNGIWVLNELKIQPGD--PSV--TLSLKSRSSDVA 235 (250)
Q Consensus 183 Nif~VAkr~~~gq~~lY~S----~k~~n~i~~L~El~~~~~~--~~~--~~~lKs~~~~~~ 235 (250)
|.-........+.+--|.+ +|..=|.-.|+.|.+.... +.+ .+-+|++...++
T Consensus 871 NmkcLtpe~al~g~CgFlaaNlya~SifgedALaNvsieK~~~~~~v~g~iRIRsK~QGia 931 (948)
T KOG1058|consen 871 NMKCLTPEAALDGDCGFLAANLYAKSIFGEDALANVSIEKSSDGGPVSGHIRIRSKTQGIA 931 (948)
T ss_pred CCcccCchhhccCccchhhhhHHHHhhccchhheeeeeeccCCCCCceEEEEEEeccccee
Confidence 5433332222222222333 4555566678888776532 234 455666666553
No 35
>PRK15188 fimbrial chaperone protein BcfB; Provisional
Probab=45.29 E-value=1.2e+02 Score=27.23 Aligned_cols=95 Identities=17% Similarity=0.219 Sum_probs=54.6
Q ss_pred CCCcceEEEEEEecCCCCCCCCCCeEEEEEEEEeCCCC-ccc-hhhee-cccC---ccccccCCCCCCCccCCCCCeEEE
Q psy8027 13 EKGKGMEIWGTFSRKPNPATPGIGVIEMELTFTNKSMA-PMA-NFAIQ-VNKN---SFGLTPARPLQVMSPLLPSLSVEA 86 (250)
Q Consensus 13 ~~~~GLeI~g~f~r~~~~~~~~~g~i~l~l~itN~s~~-~~~-~faiQ-fNkN---sFGl~P~~~l~~p~~l~pgqs~~~ 86 (250)
....|+.|.++=..-+ ++.-.-.++++|.+.. |.- .--++ -+++ +|=++|. + ..|.||+....
T Consensus 24 ~~~Agi~l~~TRvIy~------~~~~~~sv~i~N~~~~~p~LvQsWv~~~~~~~~~pFivtPP--l---frl~~~~~~~l 92 (228)
T PRK15188 24 AQAGGIALGATRVIYP------QGSKQTSLPIINSSASNVFLIQSWVANADGSRSTDFIITPP--L---FVIQPKKENIL 92 (228)
T ss_pred hhcceEEECcEEEEEc------CCCceEEEEEEeCCCCccEEEEEEEecCCCCccCCEEEcCC--e---EEECCCCceEE
Confidence 4667899999855542 6777889999998854 311 11111 1222 3443331 2 25778887777
Q ss_pred EEeeeecCC--------------CCCCC----CCCceEEEeecCCcEEEE
Q psy8027 87 SLPMATTGA--------------VQRME----PLTNLQVAVKNNLDVFYF 118 (250)
Q Consensus 87 ~lpl~~~g~--------------~~~~~----p~~~LQVAIKnN~dVfYF 118 (250)
.|-....+- +.+.+ ..+.||+|+++.+.+||=
T Consensus 93 RI~~~~~~lP~DRESlf~lnv~~IP~~~~~~~~~n~l~ia~r~~IKLFyR 142 (228)
T PRK15188 93 RIMYVGPSLPTDRESVFYLNSKAIPSVDKNKLTGNSLQIATQSVIKLFIR 142 (228)
T ss_pred EEEECCCCCCCCceEEEEEEEEecCCCCccccccceEEEEEeeeEEEEEC
Confidence 655432110 11111 124699999999988884
No 36
>PF00927 Transglut_C: Transglutaminase family, C-terminal ig like domain; InterPro: IPR008958 Synonym(s): Protein-glutamine gamma-glutamyltransferase, Fibrinoligase, TGase Transglutaminases catalyse the post-translational modification of proteins at glutamine residues, with formation of isopeptide bonds. Members of the transglutaminase family usually have three domains: N-terminal (IPR001102 from INTERPRO), middle (IPR013808 from INTERPRO) and C-terminal. The middle domain is usually well conserved, but family members can display major differences in their N- and C-terminal domains, although their overall structure is conserved []. This entry represents the C-terminal domain found in transglutaminases, which consists of an immunoglobulin-like beta-sandwich consisting of seven strands in two sheets with a Greek key topology. The best known transglutaminase is blood coagulation factor XIII, a plasma tetrameric protein composed of two catalytic A subunits and two non-catalytic B subunits. Factor XIII is responsible for cross-linking fibrin chains, thus stabilising the fibrin clot. Protein-glutamine gamma-glutamyltransferases (2.3.2.13 from EC) are calcium-dependent enzymes that catalyse the cross-linking of proteins by promoting the formation of isopeptide bonds between the gamma-carboxyl group of a glutamine in one polypeptide chain and the epsilon-amino group of a lysine in a second polypeptide chain. TGases also catalyse the conjugation of polyamines to proteins [, ].; GO: 0003810 protein-glutamine gamma-glutamyltransferase activity, 0018149 peptide cross-linking; PDB: 2XZZ_A 1GGY_B 1FIE_B 1GGU_B 1GGT_B 1F13_A 1QRK_B 1EVU_A 1EX0_B 1L9N_B ....
Probab=43.77 E-value=21 Score=27.41 Aligned_cols=60 Identities=18% Similarity=0.269 Sum_probs=36.4
Q ss_pred CCeEEEEEEEEeCCCCccchhheecccCc---cccccC--CCCCCCccCCCCCeEEEEEeeeecC
Q psy8027 35 IGVIEMELTFTNKSMAPMANFAIQVNKNS---FGLTPA--RPLQVMSPLLPSLSVEASLPMATTG 94 (250)
Q Consensus 35 ~g~i~l~l~itN~s~~~~~~faiQfNkNs---FGl~P~--~~l~~p~~l~pgqs~~~~lpl~~~g 94 (250)
|..+.+.++|+|.+..++.+..+.+...+ =|+... ........|.||++.++.+.+.+..
T Consensus 14 G~d~~v~v~~~N~~~~~l~~v~~~l~~~~v~ytG~~~~~~~~~~~~~~l~p~~~~~~~~~i~p~~ 78 (107)
T PF00927_consen 14 GQDFTVSVSFTNPSSEPLRNVSLNLCAFTVEYTGLTRDQFKKEKFEVTLKPGETKSVEVTITPSQ 78 (107)
T ss_dssp TSEEEEEEEEEE-SSS-EECEEEEEEEEEEECTTTEEEEEEEEEEEEEE-TTEEEEEEEEE-HHS
T ss_pred CCCEEEEEEEEeCCcCccccceeEEEEEEEEECCcccccEeEEEcceeeCCCCEEEEEEEEEcee
Confidence 67789999999999999777433332221 155421 1111224899999999999987653
No 37
>TIGR03079 CH4_NH3mon_ox_B methane monooxygenase/ammonia monooxygenase, subunit B. Both ammonia oxidizers such as Nitrosomonas europaea and methanotrophs (obligate methane oxidizers) such as Methylococcus capsulatus each can grow only on their own characteristic substrate. However, both groups have the ability to oxidize both substrates, and so the relevant enzymes must be named here according to their ability to oxidze both. The protein family represented here reflects subunit B of both the particulate methane monooxygenase of methylotrophs and the ammonia monooxygenase of nitrifying bacteria.
Probab=40.81 E-value=66 Score=31.36 Aligned_cols=57 Identities=25% Similarity=0.362 Sum_probs=35.8
Q ss_pred CCeEEEEEEEEeCCCCccc--hhh---ee-cccC-ccccccCCC-------C--CCCccCCCCCeEEEEEeee
Q psy8027 35 IGVIEMELTFTNKSMAPMA--NFA---IQ-VNKN-SFGLTPARP-------L--QVMSPLLPSLSVEASLPMA 91 (250)
Q Consensus 35 ~g~i~l~l~itN~s~~~~~--~fa---iQ-fNkN-sFGl~P~~~-------l--~~p~~l~pgqs~~~~lpl~ 91 (250)
+-.+.|.|++||++..|+. +|. +- +|+| -+-+.|..| | +-++||.||||.++.+-..
T Consensus 281 GR~l~~~~~VTN~g~~~vrlgEF~TA~vRFlN~~~v~~~~~~yP~~lla~GL~v~d~~pI~PGETr~v~v~aq 353 (399)
T TIGR03079 281 GRALRVTMEITNNGDQVISIGEFTTAGIRFMNANGVRVLDPDYPRELLAEGLEVDDQSAIAPGETVEVKMEAK 353 (399)
T ss_pred CcEEEEEEEEEcCCCCceEEEeEeecceEeeCcccccccCCCChHHHhhccceeCCCCCcCCCcceEEEEEEe
Confidence 5678999999999988764 332 11 3553 333323221 1 1235899999999998653
No 38
>PRK15285 putative fimbrial chaperone protein StfD; Provisional
Probab=39.28 E-value=1.5e+02 Score=26.94 Aligned_cols=89 Identities=15% Similarity=0.191 Sum_probs=52.8
Q ss_pred CcceEEEEE-EecCCCCCCCCCCeEEEEEEEEeCCCC-ccchhheec-----ccC----ccccccCCCCCCCccCCCCCe
Q psy8027 15 GKGMEIWGT-FSRKPNPATPGIGVIEMELTFTNKSMA-PMANFAIQV-----NKN----SFGLTPARPLQVMSPLLPSLS 83 (250)
Q Consensus 15 ~~GLeI~g~-f~r~~~~~~~~~g~i~l~l~itN~s~~-~~~~faiQf-----NkN----sFGl~P~~~l~~p~~l~pgqs 83 (250)
.-|+.+.|| ++-. ++.-...++++|.+.. | +.+|- +.+ +|=++|. + .-|.||+.
T Consensus 24 ~Agv~l~~TRVIy~-------~~~~~~sv~i~N~~~~~p---~LvQsWvd~~~~~~~~~pFiVtPP--l---fRl~p~~~ 88 (250)
T PRK15285 24 QAAIAPDRTRLVFR-------GEDKSISVDLKNANSKLP---YLAQSWVEDEKGVKITSPLIVVPP--V---QRIEPSAI 88 (250)
T ss_pred eEeEEeCccEEEEc-------CCCceEEEEEEeCCCCCc---EEEEEEeeCCCCCcccCCEEEcCC--e---EEECCCCc
Confidence 357888887 4444 6777888999998754 3 33332 211 3544442 2 25779887
Q ss_pred EEEEEeeeec-C--CC-------------CCC-CCCCceEEEeecCCcEEEE
Q psy8027 84 VEASLPMATT-G--AV-------------QRM-EPLTNLQVAVKNNLDVFYF 118 (250)
Q Consensus 84 ~~~~lpl~~~-g--~~-------------~~~-~p~~~LQVAIKnN~dVfYF 118 (250)
....|-.... + +- .+. +..+.||+||++.+.+||=
T Consensus 89 ~~lRI~~~~~~~~LP~DRESlfwlnv~~IPp~~~~~n~L~iairtrIKLfYR 140 (250)
T PRK15285 89 GQVKIQGMPALASLPQDRETLFYYNVREIPPQSDKPNTLQIALQTRIKVFYR 140 (250)
T ss_pred eEEEEEECCCCCCCCCCceEEEEEEEEEcCCCCCCCcEEEEEeeeeeeEEEC
Confidence 7766643210 1 11 111 1135699999999998884
No 39
>PRK09926 putative chaperone protein EcpD; Provisional
Probab=39.06 E-value=1.4e+02 Score=26.84 Aligned_cols=99 Identities=18% Similarity=0.222 Sum_probs=55.4
Q ss_pred ecCCCcceEEEEEEecCCCCCCCCCCeEEEEEEEEeCCCCccchhheec--ccCccccccCC---CCC-CC--ccCCCCC
Q psy8027 11 PAEKGKGMEIWGTFSRKPNPATPGIGVIEMELTFTNKSMAPMANFAIQV--NKNSFGLTPAR---PLQ-VM--SPLLPSL 82 (250)
Q Consensus 11 ~a~~~~GLeI~g~f~r~~~~~~~~~g~i~l~l~itN~s~~~~~~faiQf--NkNsFGl~P~~---~l~-~p--~~l~pgq 82 (250)
+....-|+.|+++=..-+ +++-.-.+++.|.+..| ..+|- ....=.+.|.+ ++- .| ..|.||+
T Consensus 20 ~~~a~A~i~l~~TRvI~~------~~~~~~sv~l~N~~~~p---~LvQ~Wvd~~~~~~~p~~~~~pfivtPPl~rl~p~~ 90 (246)
T PRK09926 20 SSSSIADIVISGTRIIYK------SDQKDVNVRLENKGNNP---LLVQSWLDTGDDNAEPGSIKVPFTATPPVSRIDPKR 90 (246)
T ss_pred hHhheeeEEeCceEEEEe------CCCceEEEEEEeCCCCc---EEEEEEecCCCCccCccccCCCEEEcCCeEEECCCC
Confidence 444567899999844431 67778888999988765 23332 11111122211 111 12 2577888
Q ss_pred eEEEEEeeeec--CCCCC------------CC-------CCCceEEEeecCCcEEEE
Q psy8027 83 SVEASLPMATT--GAVQR------------ME-------PLTNLQVAVKNNLDVFYF 118 (250)
Q Consensus 83 s~~~~lpl~~~--g~~~~------------~~-------p~~~LQVAIKnN~dVfYF 118 (250)
...+.|-.... -+.++ +. ..+.||+|+++.+.+||=
T Consensus 91 ~q~lRIi~~~~~~lP~DrESlf~lnv~eIP~~~~~~~~~~~n~l~iair~~IKLFyR 147 (246)
T PRK09926 91 GQTIKLMYTASTALPKDRESVFWFNVLEVPPKPDAEKVANQSLLQLAFRTRIKLFYR 147 (246)
T ss_pred ccEEEEEeCCCCCCCCCceEEEEEEeeecCCCCccccccccceEEEeeeeeEEEEEc
Confidence 77777654321 11121 11 113599999999999994
No 40
>COG1470 Predicted membrane protein [Function unknown]
Probab=38.90 E-value=1.1e+02 Score=30.76 Aligned_cols=73 Identities=16% Similarity=0.217 Sum_probs=51.3
Q ss_pred CCeEEEEEEEEeCCCCccchhheecccCccccccC-CCCCCCccCCCCCeEEEEEeeeecCCCCCCCCCCceEEEeec
Q psy8027 35 IGVIEMELTFTNKSMAPMANFAIQVNKNSFGLTPA-RPLQVMSPLLPSLSVEASLPMATTGAVQRMEPLTNLQVAVKN 111 (250)
Q Consensus 35 ~g~i~l~l~itN~s~~~~~~faiQfNkNsFGl~P~-~~l~~p~~l~pgqs~~~~lpl~~~g~~~~~~p~~~LQVAIKn 111 (250)
|....+.+.|.|....|+++.-+-+|- +-|..-. ++-.+| .|.||++.++++.+......-.++ ..+++-.|.
T Consensus 396 Gee~~i~i~I~NsGna~LtdIkl~v~~-PqgWei~Vd~~~I~-sL~pge~~tV~ltI~vP~~a~aGd--Y~i~i~~ks 469 (513)
T COG1470 396 GEEKTIRISIENSGNAPLTDIKLTVNG-PQGWEIEVDESTIP-SLEPGESKTVSLTITVPEDAGAGD--YRITITAKS 469 (513)
T ss_pred CccceEEEEEEecCCCccceeeEEecC-CccceEEECccccc-ccCCCCcceEEEEEEcCCCCCCCc--EEEEEEEee
Confidence 456788999999999999999999886 5555442 223456 799999999999986553333333 335555554
No 41
>PF11906 DUF3426: Protein of unknown function (DUF3426); InterPro: IPR021834 This family of proteins are functionally uncharacterised. This protein is found in bacteria. Proteins in this family are typically between 262 to 463 amino acids in length.
Probab=37.15 E-value=2e+02 Score=23.26 Aligned_cols=78 Identities=26% Similarity=0.230 Sum_probs=41.6
Q ss_pred CCcceEEEEEEecCCCCCCCCCCeEEEEEEEEeCCCCccchhheecccC-c-------cccccCC----CCCCCccCCCC
Q psy8027 14 KGKGMEIWGTFSRKPNPATPGIGVIEMELTFTNKSMAPMANFAIQVNKN-S-------FGLTPAR----PLQVMSPLLPS 81 (250)
Q Consensus 14 ~~~GLeI~g~f~r~~~~~~~~~g~i~l~l~itN~s~~~~~~faiQfNkN-s-------FGl~P~~----~l~~p~~l~pg 81 (250)
....+.|++.-.|.. + .+++...++.++.|++..+..--.|++--- . --+.|.. .+.....++||
T Consensus 49 ~~~~l~i~~~~~~~~-~--~~~~~l~v~g~i~N~~~~~~~~P~l~l~L~D~~g~~l~~r~~~P~~yl~~~~~~~~~l~pg 125 (149)
T PF11906_consen 49 DIDALKIESSDLRPV-P--DGPGVLVVSGTIRNRADFPQALPALELSLLDAQGQPLARRVFTPADYLPPGLAAQAGLPPG 125 (149)
T ss_pred CcceEEEeeeeEEee-c--CCCCEEEEEEEEEeCCCCcccCceEEEEEECCCCCEEEEEEEChHHhcccccccccccCCC
Confidence 344566665444431 0 235666777777777765544444443211 1 1233411 11123479999
Q ss_pred CeEEEEEeeeecC
Q psy8027 82 LSVEASLPMATTG 94 (250)
Q Consensus 82 qs~~~~lpl~~~g 94 (250)
++...++.+...+
T Consensus 126 ~~~~~~~~~~~p~ 138 (149)
T PF11906_consen 126 ESVPFRLRLEDPP 138 (149)
T ss_pred CeEEEEEEeeCCC
Confidence 9999999875443
No 42
>KOG0037|consensus
Probab=32.83 E-value=17 Score=32.76 Aligned_cols=49 Identities=16% Similarity=0.292 Sum_probs=37.8
Q ss_pred ccCCcccHHHHHhhhccCCCCccCccceeeeeeeEEeccc-ccCHHHHHHHHhhCCe
Q psy8027 129 MEDGEMERKTFLTTWKEIPTQNEDGNAIFCCQVQFTLTNI-ALGADAIVSRMKQNNV 184 (250)
Q Consensus 129 ~e~G~m~r~~Fl~~WksiP~~nE~~~~~~~~~~~~~i~~~-~~~~d~i~~kL~~~Ni 184 (250)
..+|+|+..+|.++|+.|.+=. +|| .+++.++. .++...+.+.|.+-+.
T Consensus 106 ~~~G~i~f~EF~~Lw~~i~~Wr----~vF---~~~D~D~SG~I~~sEL~~Al~~~Gy 155 (221)
T KOG0037|consen 106 DNSGTIGFKEFKALWKYINQWR----NVF---RTYDRDRSGTIDSSELRQALTQLGY 155 (221)
T ss_pred CCCCccCHHHHHHHHHHHHHHH----HHH---HhcccCCCCcccHHHHHHHHHHcCc
Confidence 6789999999999999996532 345 56777766 4788888888888774
No 43
>cd09269 deoxyribose_mutarotase deoxyribose mutarotase_like. Salmonella enterica serovar Typhi DeoM (earlier named as DeoX) is a mutarotase with high specificity for deoxyribose. It is encoded by one of four genes beonging to the deoK operon. This operon has also been found in Escherichia coli where it is more common in pathogenic than in commensal strains and is associated with pathogenicity. It has been found on a pathogenicity island from a human blood isolate AL863 and confers the ability to use deoxyribose as a carbon source; deoxyribose is not fermented by non-pathogenic E.coli K-12. Proteins in this family are members of the aldose-1-epimerase superfamily. Aldose 1-epimerases, or mutarotases, are key enzymes of carbohydrate metabolism, catalyzing the interconversion of the alpha- and beta-anomers of hexose sugars such as glucose and galactose. This interconversion is an important step that allows anomer specific metabolic conversion of sugars. Studies of the catalytic mechani
Probab=27.12 E-value=92 Score=28.92 Aligned_cols=41 Identities=17% Similarity=0.313 Sum_probs=29.9
Q ss_pred CCCCeEEEEEEEEeCCCCccchhheecccCccccccCCCCCCC
Q psy8027 33 PGIGVIEMELTFTNKSMAPMANFAIQVNKNSFGLTPARPLQVM 75 (250)
Q Consensus 33 ~~~g~i~l~l~itN~s~~~~~~faiQfNkNsFGl~P~~~l~~p 75 (250)
.|.+.+.+++++||.+..||- +++-+=.| ||..|.+.+..|
T Consensus 115 ~g~~~l~i~~~VtN~g~~p~p-~~~~~H~n-fg~~~garl~~p 155 (293)
T cd09269 115 AGSALFDIGMDVTNLSAQPMP-LMYMCHMN-YAYVEGARIVQN 155 (293)
T ss_pred CCCCEEEEEEEEEECCCCCCh-hhEecccc-cCCCCCCEEEcc
Confidence 456899999999999998875 44444443 788887766644
No 44
>PF13200 DUF4015: Putative glycosyl hydrolase domain
Probab=26.83 E-value=1.2e+02 Score=28.72 Aligned_cols=56 Identities=21% Similarity=0.365 Sum_probs=38.5
Q ss_pred CCCceEEEeecCCcEEEEEecccceEEeccCCcccHHHHHhhhccCCCCccCccceeeeeeeEEecccccCHHHHHHHHh
Q psy8027 101 PLTNLQVAVKNNLDVFYFACIVPINVYFMEDGEMERKTFLTTWKEIPTQNEDGNAIFCCQVQFTLTNIALGADAIVSRMK 180 (250)
Q Consensus 101 p~~~LQVAIKnN~dVfYF~~~ipl~vlf~e~G~m~r~~Fl~~WksiP~~nE~~~~~~~~~~~~~i~~~~~~~d~i~~kL~ 180 (250)
-++.+.+=||...+..+|....|+.. .+... .....+.+.+.++|+
T Consensus 26 ~lNavVIDvKdd~G~i~y~s~~~~~~------------------~~ga~----------------~~~i~D~~~l~~~l~ 71 (316)
T PF13200_consen 26 ELNAVVIDVKDDDGNITYDSQVPLAR------------------EIGAV----------------KPYIKDLKALVKKLK 71 (316)
T ss_pred CCceEEEEEecCCceEEecCCCchhh------------------hcccc----------------cccccCHHHHHHHHH
Confidence 34668889999888888877666321 01111 011257899999999
Q ss_pred hCCeEEEeec
Q psy8027 181 QNNVFTIAKR 190 (250)
Q Consensus 181 ~~Nif~VAkr 190 (250)
.++||.|||=
T Consensus 72 e~gIY~IARI 81 (316)
T PF13200_consen 72 EHGIYPIARI 81 (316)
T ss_pred HCCCEEEEEE
Confidence 9999999965
No 45
>PF08563 P53_TAD: P53 transactivation motif; InterPro: IPR013872 The binding of this protein by regulatory proteins regulates p53 transcription activation. This entry is comprised of a single amphipathic alpha helix and contains a highly conserved motif [, ]. ; GO: 0005515 protein binding; PDB: 1YCQ_B 2Z5T_R 3DAB_B 3DAC_B 2Z5S_Q 2K8F_B 2L14_B 1YCR_B.
Probab=26.52 E-value=24 Score=21.30 Aligned_cols=18 Identities=22% Similarity=0.595 Sum_probs=12.4
Q ss_pred CCcccHHHHHhhhccCCC
Q psy8027 131 DGEMERKTFLTTWKEIPT 148 (250)
Q Consensus 131 ~G~m~r~~Fl~~WksiP~ 148 (250)
+-+|+++.|-.+|+.+++
T Consensus 7 ~~PLSQeTF~~LW~~l~~ 24 (25)
T PF08563_consen 7 ELPLSQETFSDLWNLLPE 24 (25)
T ss_dssp ----STCCHHHHHHTSST
T ss_pred CCCccHHHHHHHHHhccC
Confidence 346889999999998864
No 46
>PF11611 DUF4352: Domain of unknown function (DUF4352); InterPro: IPR021652 This entry is represented by Bacteriophage A118, Gp32. The characteristics of the protein distribution suggest prophage matches in addition to the phage matches. This entry represents a group of putative lipoproteins of unknown function.; PDB: 3CFU_A.
Probab=26.20 E-value=1.2e+02 Score=23.21 Aligned_cols=57 Identities=18% Similarity=0.240 Sum_probs=27.2
Q ss_pred CCCeEEEEEEEEeCCCCccch----hheec-ccCccccccCCC--C--CCCccCCCCCeEEEEEee
Q psy8027 34 GIGVIEMELTFTNKSMAPMAN----FAIQV-NKNSFGLTPARP--L--QVMSPLLPSLSVEASLPM 90 (250)
Q Consensus 34 ~~g~i~l~l~itN~s~~~~~~----faiQf-NkNsFGl~P~~~--l--~~p~~l~pgqs~~~~lpl 90 (250)
|+.-+.++++++|.+..++.- |.+.- +-+.+-...... . .+...|.||++.+..|-.
T Consensus 34 g~~fv~v~v~v~N~~~~~~~~~~~~f~l~d~~g~~~~~~~~~~~~~~~~~~~~i~pG~~~~g~l~F 99 (123)
T PF11611_consen 34 GNKFVVVDVTVKNNGDEPLDFSPSDFKLYDSDGNKYDPDFSASSNDNDLFSETIKPGESVTGKLVF 99 (123)
T ss_dssp -SEEEEEEEEEEE-SSS-EEEEGGGEEEE-TT--B--EEE-CCCTTTB--EEEE-TT-EEEEEEEE
T ss_pred CCEEEEEEEEEEECCCCcEEecccceEEEeCCCCEEcccccchhccccccccEECCCCEEEEEEEE
Confidence 345568999999999877653 33321 113333222211 1 134578899988777754
No 47
>PF07610 DUF1573: Protein of unknown function (DUF1573); InterPro: IPR011467 These hypothetical proteins from bacteria, such as Rhodopirellula baltica, Bacteroides thetaiotaomicron and Porphyromonas gingivalis, share a region of conserved sequence towards their N termini.
Probab=25.22 E-value=1.2e+02 Score=20.05 Aligned_cols=42 Identities=29% Similarity=0.407 Sum_probs=26.1
Q ss_pred EEEEeCCCCccchhheecccCccccccCCCCCCCccCCCCCeEEEEE
Q psy8027 42 LTFTNKSMAPMANFAIQVNKNSFGLTPARPLQVMSPLLPSLSVEASL 88 (250)
Q Consensus 42 l~itN~s~~~~~~faiQfNkNsFGl~P~~~l~~p~~l~pgqs~~~~l 88 (250)
.+|+|....|+.--.|+- +=|=.-+. .. ..+|.||++....+
T Consensus 2 F~~~N~g~~~L~I~~v~t---sCgCt~~~-~~-~~~i~PGes~~i~v 43 (45)
T PF07610_consen 2 FEFTNTGDSPLVITDVQT---SCGCTTAE-YS-KKPIAPGESGKIKV 43 (45)
T ss_pred EEEEECCCCcEEEEEeeE---ccCCEEee-CC-cceECCCCEEEEEE
Confidence 578898888887776663 22211111 11 24799999877665
No 48
>PF03173 CHB_HEX: Putative carbohydrate binding domain; InterPro: IPR004866 This domain represents the N-terminal domain in chitobiases and beta-hexosaminidases 3.2.1.52 from EC. Chitobiases degrade chitin, which forms the exoskeleton in insects and crustaceans, and which is one of the most abundant polysaccharides on earth []. Beta-hexosaminidases are composed of either a HexA/HexB heterodimer or a HexB homodimer, and can hydrolyse diverse substrates, including GM(2)-gangliosides; mutations in this enzyme are associated with Tay-Sachs disease []. HexB is structurally similar to chitobiase, consisting of a beta sandwich structure; this structure is similar to that found in the cellulose-binding domain of cellulase from Cellulomonas fimi (IPR001919 from INTERPRO), suggesting that it may function as a carbohydrate-binding domain.; GO: 0030246 carbohydrate binding; PDB: 1C7T_A 1QBA_A 1QBB_A 1C7S_A.
Probab=22.85 E-value=1.1e+02 Score=26.14 Aligned_cols=48 Identities=33% Similarity=0.467 Sum_probs=23.4
Q ss_pred EEEEEEEEeCCCCccch--hheecc-------------------cCccccccCCCCCCCccCCCCCeEEEEE
Q psy8027 38 IEMELTFTNKSMAPMAN--FAIQVN-------------------KNSFGLTPARPLQVMSPLLPSLSVEASL 88 (250)
Q Consensus 38 i~l~l~itN~s~~~~~~--faiQfN-------------------kNsFGl~P~~~l~~p~~l~pgqs~~~~l 88 (250)
-..+++++|....++.. .+|-|| =.-|-|.|.... ..|.||+|.++.+
T Consensus 32 ~~~~ltl~n~~~~~~~~~dW~IYf~~ir~i~~~~s~~f~i~hinGDl~kl~Pt~~F---~gl~~Ges~~I~~ 100 (164)
T PF03173_consen 32 FRAELTLTNPGDAPLPKSDWAIYFSSIRPILQVDSDQFKITHINGDLHKLTPTAGF---KGLAPGESLEIPF 100 (164)
T ss_dssp EEEEEEEEE-SS-B------EEEEE-SS-EEEESSTTEEEEE-STTEEEEEE-TT------B-TTEEEEEEE
T ss_pred eEEEEEEEcCCCccCCCCCeEEEEecceeeeccCCCCeEEEEEcCeEEEEeECCCC---CccCCCCEEEEEE
Confidence 45677777777666554 666655 344556665432 3688999665554
No 49
>PF04597 Ribophorin_I: Ribophorin I; InterPro: IPR007676 Ribophorin I is an essential subunit of oligosaccharyltransferase (OST), which is also known as dolichyl-diphosphooligosaccharide--protein glycosyltransferase, (2.4.1.119 from EC). OST catalyses the transfer of an oligosaccharide from dolichol pyrophosphate to selected asparagine residues of nascent polypeptides as they are translocated into the lumen of the rough endoplasmic reticulum. Ribophorin I and OST48 are thought to be responsible for OST catalytic activity []. Both yeast and mammalian proteins are glycosylated but the sites are not conserved. Glycosylation may contribute towards general solubility but is unlikely to be involved in a specific biochemical function []. Most family members are predicted to have a transmembrane helix at the C terminus of this region.; GO: 0004579 dolichyl-diphosphooligosaccharide-protein glycotransferase activity, 0006486 protein glycosylation, 0005783 endoplasmic reticulum, 0016021 integral to membrane
Probab=21.62 E-value=2.6e+02 Score=27.41 Aligned_cols=99 Identities=13% Similarity=0.278 Sum_probs=57.1
Q ss_pred ceEEEEEEecCCCCCCCCCCeEEEEEEEEeCCCCccchhheecccCccc----------------cc--------cCCC-
Q psy8027 17 GMEIWGTFSRKPNPATPGIGVIEMELTFTNKSMAPMANFAIQVNKNSFG----------------LT--------PARP- 71 (250)
Q Consensus 17 GLeI~g~f~r~~~~~~~~~g~i~l~l~itN~s~~~~~~faiQfNkNsFG----------------l~--------P~~~- 71 (250)
-++++++++|. .+++++.|....|.+.+.+.+...-+. .. +.+.
T Consensus 10 ~idl~~~~vk~-----------~~~i~i~N~g~~p~~~y~~~l~~~~~~~ls~~~a~~~~~~~~~~~~~~~~~~~~~~~~ 78 (432)
T PF04597_consen 10 TIDLSKSYVKE-----------TIEITIKNIGDEPVSEYYFALPNDEADHLSYVSAKDKDKKKKLKVSKEITEVNSGSEI 78 (432)
T ss_pred EEEccCcEEEE-----------EEEEEEEECCCCCceEEEEEECchhhccEEEEEEEECCCccccccccccccccCCCCc
Confidence 34555555553 677888888888877776666653321 11 1110
Q ss_pred ----CCCCccCCCCCeEEEEEeeeecCCCCCCCCCCceEEEeecCCcEEEEEecccceEEecc
Q psy8027 72 ----LQVMSPLLPSLSVEASLPMATTGAVQRMEPLTNLQVAVKNNLDVFYFACIVPINVYFME 130 (250)
Q Consensus 72 ----l~~p~~l~pgqs~~~~lpl~~~g~~~~~~p~~~LQVAIKnN~dVfYF~~~ipl~vlf~e 130 (250)
+.+|.||.||++++..+-...++...+.+ -++.....+-+.|-...+++.-.-++
T Consensus 79 ~~~~i~L~~pl~~~~~~~l~v~~~~~~~~~P~P----~~I~q~e~Q~v~~~~~~~~~SpY~t~ 137 (432)
T PF04597_consen 79 KYYEITLPKPLAPGEKVTLTVEYVLTHALKPYP----AEITQGEKQLVLFTGNAYPLSPYPTK 137 (432)
T ss_pred ceEEEECCCCCCCCCEEEEEEEEEecccceEcC----CcccCCCceEEEEEcCEEecCCcccc
Confidence 12577899999888888877776533222 13444444445555555555444433
No 50
>PHA03001 putative virion core protein; Provisional
Probab=21.41 E-value=77 Score=26.44 Aligned_cols=15 Identities=47% Similarity=1.165 Sum_probs=13.3
Q ss_pred eEEEeecCCcEEEEEe
Q psy8027 105 LQVAVKNNLDVFYFAC 120 (250)
Q Consensus 105 LQVAIKnN~dVfYF~~ 120 (250)
.-+|||+. |+|||.|
T Consensus 54 Fylvvrd~-diFyfkc 68 (132)
T PHA03001 54 FYLVVKDK-DIFYFKC 68 (132)
T ss_pred EEEEEecC-cEEEEEe
Confidence 57899976 9999999
No 51
>PF13833 EF-hand_8: EF-hand domain pair; PDB: 3KF9_A 1TTX_A 1WLZ_A 1ALV_A 1NX3_A 1ALW_A 1NX2_A 1NX1_A 1NX0_A 1DF0_A ....
Probab=21.30 E-value=51 Score=21.76 Aligned_cols=17 Identities=18% Similarity=0.321 Sum_probs=14.2
Q ss_pred ccCCcccHHHHHhhhcc
Q psy8027 129 MEDGEMERKTFLTTWKE 145 (250)
Q Consensus 129 ~e~G~m~r~~Fl~~Wks 145 (250)
..+|.++.++|.+.|++
T Consensus 37 ~~~G~I~~~EF~~~~~~ 53 (54)
T PF13833_consen 37 DGDGYISFDEFISMMQR 53 (54)
T ss_dssp SSSSSEEHHHHHHHHHH
T ss_pred CCCCCCCHHHHHHHHHh
Confidence 57889999999988864
No 52
>smart00054 EFh EF-hand, calcium binding motif. EF-hands are calcium-binding motifs that occur at least in pairs. Links between disease states and genes encoding EF-hands, particularly the S100 subclass, are emerging. Each motif consists of a 12 residue loop flanked on either side by a 12 residue alpha-helix. EF-hands undergo a conformational change unpon binding calcium ions.
Probab=21.23 E-value=66 Score=16.78 Aligned_cols=17 Identities=24% Similarity=0.352 Sum_probs=13.5
Q ss_pred ccCCcccHHHHHhhhcc
Q psy8027 129 MEDGEMERKTFLTTWKE 145 (250)
Q Consensus 129 ~e~G~m~r~~Fl~~Wks 145 (250)
..+|.++.++|...+++
T Consensus 12 ~~~g~i~~~e~~~~~~~ 28 (29)
T smart00054 12 DGDGKIDFEEFKDLLKA 28 (29)
T ss_pred CCCCcEeHHHHHHHHHh
Confidence 45688999999888765
No 53
>PRK15233 putative fimbrial chaperone protein SefB; Provisional
Probab=20.30 E-value=4.4e+02 Score=24.02 Aligned_cols=90 Identities=18% Similarity=0.219 Sum_probs=52.1
Q ss_pred CCcceEEEEE-EecCCCCCCCCCCeEEEEEEEEeCCCCccchhheec-----ccC---ccccccCCCCCCCccCCCCCeE
Q psy8027 14 KGKGMEIWGT-FSRKPNPATPGIGVIEMELTFTNKSMAPMANFAIQV-----NKN---SFGLTPARPLQVMSPLLPSLSV 84 (250)
Q Consensus 14 ~~~GLeI~g~-f~r~~~~~~~~~g~i~l~l~itN~s~~~~~~faiQf-----NkN---sFGl~P~~~l~~p~~l~pgqs~ 84 (250)
...|+.+.|| ++-. ++.-.-.+++.|.+..| +.+|- +.+ +|=++| |+ .-|.||+..
T Consensus 38 a~Agi~l~~TRvIy~-------~~~~~~sl~i~N~~~~p---~LvQsWvd~~~~~~~~pFiVtP--PL---fRLep~~~~ 102 (246)
T PRK15233 38 KYYGLRLGTTRVIYK-------EDAPSTSFWIMNEKEYP---ILVQTQVYNDDKSSKAPFIVTP--PI---LKVESNART 102 (246)
T ss_pred eeeeEEeCceEEEEe-------CCCcEEEEEEEcCCCCc---EEEEEEEecCCCCccCCEEECC--Ce---EEECCCCce
Confidence 3447888888 4444 66677888899987655 44443 211 343333 11 256788877
Q ss_pred EEEEeeeecC-CCC-------------CC---C-C-CC------ceEEEeecCCcEEEE
Q psy8027 85 EASLPMATTG-AVQ-------------RM---E-P-LT------NLQVAVKNNLDVFYF 118 (250)
Q Consensus 85 ~~~lpl~~~g-~~~-------------~~---~-p-~~------~LQVAIKnN~dVfYF 118 (250)
.+.|-...+. +.+ +. + . .+ .||+||++-+.+||=
T Consensus 103 ~lRIi~~~~~LP~DRESlfwlnv~~IPp~~~~~~~~~n~~~~~~~LqiairtrIKLFYR 161 (246)
T PRK15233 103 RLKVIPTSNLFNKNEESLYWLCVKGVPPLNDNESNNKNNITTNLNVNVVTNSCIKLIYR 161 (246)
T ss_pred EEEEEECCCCCCcCceEEEEEEEEEcCCCCcccccccccccccceEEEEeeeeeEEEEc
Confidence 7666543210 011 10 0 0 11 399999999988884
Done!