Query psy5364
Match_columns 78
No_of_seqs 107 out of 383
Neff 6.9
Searched_HMMs 46136
Date Sat Aug 17 00:41:52 2013
Command hhsearch -i /work/01045/syshi/Psyhhblits/psy5364.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/5364hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 KOG0518|consensus 99.7 6.6E-18 1.4E-22 133.7 8.2 77 1-78 724-801 (1113)
2 KOG0518|consensus 99.7 2.6E-17 5.6E-22 130.5 7.3 64 2-78 1049-1112(1113)
3 smart00557 IG_FLMN Filamin-typ 99.0 7.4E-10 1.6E-14 66.6 5.8 43 34-77 1-43 (93)
4 smart00557 IG_FLMN Filamin-typ 98.9 2.6E-09 5.7E-14 64.2 4.6 29 2-30 64-92 (93)
5 PF00630 Filamin: Filamin/ABP2 98.8 9.7E-09 2.1E-13 61.4 4.5 45 32-77 2-53 (101)
6 COG4856 Uncharacterized protei 84.1 5.5 0.00012 30.1 6.4 53 8-71 52-109 (403)
7 PF00927 Transglut_C: Transglu 81.0 8.8 0.00019 22.8 5.6 38 9-46 2-50 (107)
8 PF10670 DUF4198: Domain of un 72.4 24 0.00052 23.0 6.7 63 7-75 152-214 (215)
9 TIGR03769 P_ac_wall_RPT actino 71.7 5.2 0.00011 20.6 2.4 20 3-22 10-29 (41)
10 PF14347 DUF4399: Domain of un 70.1 4.3 9.4E-05 24.2 2.1 22 3-24 59-81 (87)
11 TIGR02148 Fibro_Slime fibro-sl 69.2 6.6 0.00014 23.8 2.8 26 3-28 60-86 (90)
12 PF08770 SoxZ: Sulphur oxidati 69.1 23 0.0005 21.5 5.8 25 52-77 67-91 (100)
13 PF08400 phage_tail_N: Prophag 68.8 3.1 6.8E-05 26.9 1.4 17 1-17 51-67 (134)
14 PF13954 PapC_N: PapC N-termin 68.6 6.9 0.00015 24.9 3.0 21 2-22 26-46 (146)
15 PRK15275 putative fimbrial pro 67.6 27 0.00058 23.1 5.7 62 8-75 25-88 (166)
16 PF07495 Y_Y_Y: Y_Y_Y domain; 60.3 8.9 0.00019 20.5 2.0 14 2-15 35-48 (66)
17 cd08548 Type_I_cohesin_like Ty 58.0 45 0.00097 21.0 5.9 34 4-41 11-48 (135)
18 PRK10301 hypothetical protein; 57.3 17 0.00038 22.8 3.2 25 3-28 96-123 (124)
19 PF09906 DUF2135: Uncharacteri 53.9 28 0.00061 18.5 3.2 18 4-21 2-19 (50)
20 PF09430 DUF2012: Protein of u 47.7 59 0.0013 20.1 4.5 38 3-42 37-76 (123)
21 PRK15286 putative minor fimbri 46.9 77 0.0017 21.2 5.2 62 7-76 28-93 (170)
22 TIGR03517 GldM_gliding gliding 46.5 65 0.0014 25.3 5.4 24 22-46 340-363 (523)
23 PF09087 Cyc-maltodext_N: Cycl 46.2 58 0.0013 19.5 4.1 41 35-75 28-72 (88)
24 PF10029 DUF2271: Predicted pe 44.0 40 0.00086 21.6 3.4 26 4-29 75-103 (139)
25 PRK15311 putative fimbrial pro 43.8 1.4E+02 0.003 22.5 6.7 67 6-76 277-356 (359)
26 PF12080 GldM_C: GldM C-termin 41.5 1.1E+02 0.0023 20.4 5.9 34 9-46 6-41 (181)
27 PF01483 P_proprotein: Proprot 41.2 46 0.001 18.9 3.1 22 55-78 7-28 (87)
28 PF07523 Big_3: Bacterial Ig-l 39.9 30 0.00065 18.9 2.1 14 3-16 45-58 (67)
29 PF05359 DUF748: Domain of Unk 39.8 90 0.002 19.2 4.6 52 11-71 14-65 (151)
30 PF12863 DUF3821: Domain of un 36.5 69 0.0015 22.1 3.8 30 49-78 103-142 (209)
31 PF10632 He_PIG_assoc: He_PIG 36.0 31 0.00067 16.7 1.4 9 20-28 13-21 (29)
32 PF08141 SspH: Small acid-solu 35.8 36 0.00077 18.9 1.9 12 8-19 14-25 (58)
33 PF08308 PEGA: PEGA domain; I 35.6 36 0.00078 18.5 2.0 15 3-17 32-46 (71)
34 PRK15315 outer membrane protei 35.0 2.2E+02 0.0048 25.9 7.1 63 8-77 229-299 (1865)
35 PF05688 DUF824: Salmonella re 34.5 36 0.00078 18.1 1.7 16 15-30 25-40 (47)
36 TIGR03503 conserved hypothetic 34.3 50 0.0011 24.8 3.0 28 2-29 274-301 (374)
37 PF02019 WIF: WIF domain; Int 32.4 63 0.0014 20.7 2.9 65 3-71 37-108 (132)
38 PF09898 DUF2125: Uncharacteri 32.3 1.4E+02 0.003 21.0 4.9 19 11-29 25-43 (312)
39 PRK03174 sspH acid-soluble spo 32.0 39 0.00085 18.9 1.7 11 9-19 15-25 (59)
40 COG3459 Cellobiose phosphoryla 31.6 2.5E+02 0.0054 24.2 6.7 60 3-76 997-1056(1056)
41 PF07162 B9-C2: Ciliary basal 31.3 55 0.0012 21.3 2.6 22 1-23 101-122 (168)
42 TIGR02861 SASP_H small acid-so 31.1 41 0.00088 18.7 1.6 11 9-19 15-25 (58)
43 PF04571 Lipin_N: lipin, N-ter 30.5 78 0.0017 19.9 3.0 20 10-29 31-50 (110)
44 PF13584 BatD: Oxygen toleranc 30.3 78 0.0017 23.7 3.5 31 1-31 80-113 (484)
45 PRK01625 sspH acid-soluble spo 30.2 43 0.00094 18.7 1.7 11 9-19 15-25 (59)
46 KOG0416|consensus 27.6 66 0.0014 21.9 2.5 24 55-78 19-42 (189)
47 PF10528 PA14_2: GLEYA domain; 27.4 59 0.0013 20.1 2.1 18 61-78 80-97 (113)
48 PRK15273 putative fimbrial out 27.2 68 0.0015 26.5 2.9 18 3-20 41-58 (881)
49 PF05949 DUF881: Bacterial pro 27.1 83 0.0018 20.2 2.8 18 11-29 80-97 (149)
50 TIGR03773 anch_rpt_wall putati 26.6 1.1E+02 0.0024 24.0 3.8 32 3-34 408-445 (513)
51 COG4856 Uncharacterized protei 26.0 1.4E+02 0.0031 22.8 4.2 43 36-78 345-387 (403)
52 PF13600 DUF4140: N-terminal d 25.9 89 0.0019 18.4 2.6 13 31-43 34-46 (104)
53 PRK15298 fimbrial outer membra 25.4 77 0.0017 26.1 2.9 20 3-22 61-80 (848)
54 PRK15255 fimbrial outer membra 25.1 78 0.0017 25.9 2.9 19 3-21 48-66 (829)
55 PRK15314 outer membrane protei 25.0 4.2E+02 0.009 25.0 7.1 60 14-78 767-838 (2435)
56 PF11797 DUF3324: Protein of u 24.6 65 0.0014 20.3 2.0 14 2-15 102-115 (140)
57 PRK15193 outer membrane usher 24.5 81 0.0017 26.1 2.9 22 3-24 63-84 (876)
58 PF13620 CarboxypepD_reg: Carb 24.4 95 0.0021 16.9 2.5 14 3-16 45-58 (82)
59 PF08695 Coa1: Cytochrome oxid 24.2 88 0.0019 18.8 2.4 15 63-77 66-80 (116)
60 PRK15198 outer membrane usher 24.0 83 0.0018 25.9 2.8 24 3-26 54-77 (860)
61 PRK15207 long polar fimbrial o 23.4 88 0.0019 25.7 2.9 24 3-28 62-85 (842)
62 PF14900 DUF4493: Domain of un 23.2 62 0.0013 22.0 1.7 24 2-25 61-86 (235)
63 PRK15248 fimbrial outer membra 22.8 1.2E+02 0.0027 24.9 3.6 24 3-26 53-77 (853)
64 KOG4680|consensus 22.0 79 0.0017 20.9 1.9 23 4-26 69-91 (153)
65 PRK15284 putative fimbrial out 21.8 99 0.0022 25.6 2.9 18 3-20 59-76 (881)
66 PF03422 CBM_6: Carbohydrate b 21.8 67 0.0014 19.1 1.5 15 2-16 96-110 (125)
67 COG1470 Predicted membrane pro 21.7 4.1E+02 0.0089 21.0 6.3 13 4-16 348-360 (513)
68 TIGR02186 alph_Pro_TM conserve 21.7 3.1E+02 0.0068 19.6 5.1 46 22-77 35-80 (261)
69 PRK09828 putative fimbrial out 21.4 1E+02 0.0022 25.4 2.9 20 2-21 59-78 (865)
70 PRK15304 putative fimbrial out 21.4 1.2E+02 0.0026 24.8 3.2 19 3-21 57-75 (801)
71 PRK15294 putative fimbrial out 21.3 1E+02 0.0022 25.3 2.8 19 3-21 57-75 (845)
72 PF00801 PKD: PKD domain; Int 21.3 1E+02 0.0022 16.5 2.1 14 3-16 49-62 (69)
73 PRK15316 RatA-like protein; Pr 21.2 2.5E+02 0.0055 26.4 5.2 62 14-78 753-824 (2683)
74 PF14734 DUF4469: Domain of un 20.7 93 0.002 19.0 2.0 12 3-14 74-85 (102)
75 COG4136 ABC-type uncharacteriz 20.4 68 0.0015 22.0 1.4 19 54-78 21-39 (213)
76 TIGR03292 PhnH_redo phosphonat 20.3 1.1E+02 0.0024 20.5 2.5 15 32-46 114-128 (174)
77 PF02018 CBM_4_9: Carbohydrate 20.2 1.9E+02 0.0041 16.6 3.3 26 3-30 58-84 (131)
No 1
>KOG0518|consensus
Probab=99.74 E-value=6.6e-18 Score=133.75 Aligned_cols=77 Identities=40% Similarity=0.628 Sum_probs=70.9
Q ss_pred CCccceeeEEEEEECCEecCCCCeEEEecC-CCCCCeEEEeCCCcccceeceeceEEEEEccCCCccceEEEEECCCCC
Q psy5364 1 MSLRRRRINLRTEGEGVDVPGSPFTVRVSG-APDASKVRVYGPGIEHGVLATFQSRFICDTRGAGAGQLTVRVRGPKGF 78 (78)
Q Consensus 1 ~~~~~G~~~i~V~~~g~~V~gSPf~v~V~~-~~d~~kv~v~GpGL~~g~~~~~~~~F~Id~~~aG~g~l~v~v~gP~g~ 78 (78)
.|.+.|.|.|+|+++|+||+||||+|+|.. ..||++++++|.||..|. .-+|++|+||+|+||.|.|.+.++||+++
T Consensus 724 ~P~e~GeH~I~Vk~~G~hVpgsPf~i~V~~~e~dAsk~~v~g~g~~~G~-t~ep~~fivDtr~agyGgLsi~~~Gpskv 801 (1113)
T KOG0518|consen 724 TPREVGEHKINVKVAGKHVPGSPFSIKVSESEIDASKVRVSGQGLKEGH-TFEPAEFIVDTRKAGYGGLSISVQGPSKV 801 (1113)
T ss_pred CCCcCcceEEEEEEcceECCCCCeEEEeccccccceeEEEecccccccc-cccchheEeccccCCCCceEEEEeCCccc
Confidence 378999999999999999999999999985 348999999999999997 45899999999999999999999999953
No 2
>KOG0518|consensus
Probab=99.70 E-value=2.6e-17 Score=130.45 Aligned_cols=64 Identities=41% Similarity=0.744 Sum_probs=60.9
Q ss_pred CccceeeEEEEEECCEecCCCCeEEEecCCCCCCeEEEeCCCcccceeceeceEEEEEccCCCccceEEEEECCCCC
Q psy5364 2 SLRRRRINLRTEGEGVDVPGSPFTVRVSGAPDASKVRVYGPGIEHGVLATFQSRFICDTRGAGAGQLTVRVRGPKGF 78 (78)
Q Consensus 2 ~~~~G~~~i~V~~~g~~V~gSPf~v~V~~~~d~~kv~v~GpGL~~g~~~~~~~~F~Id~~~aG~g~l~v~v~gP~g~ 78 (78)
|.|+|.|+|.|+|||+||+||||++.|. +++||+||++ +++|.|.|++||.|.|+|+|.||+++
T Consensus 1049 P~e~G~h~i~Vkfd~~hv~gSPF~i~v~-------vsayG~gle~------~sefvvnt~~agaG~lsv~~~gPsk~ 1112 (1113)
T KOG0518|consen 1049 PKENGVHTIDVKFDGEHVPGSPFKIRVG-------VSAYGPGLEG------KSEFVVNTSNAGAGALSVTIDGPSKA 1112 (1113)
T ss_pred ecCCCcEEEEEEeCCccCCCCCeEEEEE-------EEeecCCcCc------cceEEeeccCCCcceeeeecCCCccC
Confidence 6799999999999999999999999995 9999999998 57899999999999999999999986
No 3
>smart00557 IG_FLMN Filamin-type immunoglobulin domains. These form a rod-like structure in the actin-binding cytoskeleton protein, filamin. The C-terminal repeats of filamin bind beta1-integrin (CD29).
Probab=99.04 E-value=7.4e-10 Score=66.57 Aligned_cols=43 Identities=56% Similarity=0.927 Sum_probs=40.0
Q ss_pred CCeEEEeCCCcccceeceeceEEEEEccCCCccceEEEEECCCC
Q psy5364 34 ASKVRVYGPGIEHGVLATFQSRFICDTRGAGAGQLTVRVRGPKG 77 (78)
Q Consensus 34 ~~kv~v~GpGL~~g~~~~~~~~F~Id~~~aG~g~l~v~v~gP~g 77 (78)
|++|+|+|+||+.+. .++++.|+|+++++|.++|+|.|++|+|
T Consensus 1 p~~~~v~G~Gl~~~~-vg~~~~f~v~~~d~G~~~~~v~i~~p~g 43 (93)
T smart00557 1 ASKVKASGPGLEKGV-VGEPAEFTIDTRGAGGGELEVEVTGPSG 43 (93)
T ss_pred CCeEEEECCCcCcee-cCCCEEEEEEcCCCCCCcEEEEEECCCC
Confidence 689999999999985 5689999999999999999999999997
No 4
>smart00557 IG_FLMN Filamin-type immunoglobulin domains. These form a rod-like structure in the actin-binding cytoskeleton protein, filamin. The C-terminal repeats of filamin bind beta1-integrin (CD29).
Probab=98.90 E-value=2.6e-09 Score=64.15 Aligned_cols=29 Identities=31% Similarity=0.505 Sum_probs=27.0
Q ss_pred CccceeeEEEEEECCEecCCCCeEEEecC
Q psy5364 2 SLRRRRINLRTEGEGVDVPGSPFTVRVSG 30 (78)
Q Consensus 2 ~~~~G~~~i~V~~~g~~V~gSPf~v~V~~ 30 (78)
|.++|.|+|+|+|+|+||+||||.++|.+
T Consensus 64 P~~~G~~~i~V~~~g~~I~gSPF~v~V~~ 92 (93)
T smart00557 64 PTEPGDYTVTVKFGGEHIPGSPFTVKVGP 92 (93)
T ss_pred eCCCEeEEEEEEECCEECCCCCEEEEEeC
Confidence 78999999999999999999999999853
No 5
>PF00630 Filamin: Filamin/ABP280 repeat; InterPro: IPR017868 The many different actin cross-linking proteins share a common architecture, consisting of a globular actin-binding domain and an extended rod. Whereas their actin-binding domains consist of two calponin homology domains (see IPR001715 from INTERPRO), their rods fall into three families. The rod domain of the family including the Dictyostelium discoideum (Slime mould) gelation factor (ABP120) and human filamin (ABP280) is constructed from tandem repeats of a 100-residue motif that is glycine and proline rich []. The gelation factor's rod contains 6 copies of the repeat, whereas filamin has a rod constructed from 24 repeats. The resolution of the 3D structure of rod repeats from the gelation factor has shown that they consist of a beta-sandwich, formed by two beta-sheets arranged in an immunoglobulin-like fold [, ]. Because conserved residues that form the core of the repeats are preserved in filamin, the repeat structure should be common to the members of the gelation factor/filamin family. The head to tail homodimerisation is crucial to the function of the ABP120 and ABP280 proteins. This interaction involves a small portion at the distal end of the rod domains. For the gelation factor it has been shown that the carboxy-terminal repeat 6 dimerises through a double edge-to-edge extension of the beta-sheet and that repeat 5 contributes to dimerisation to some extent [, , ].; PDB: 2DI9_A 2EEC_A 2DIC_A 2EEA_A 2DMC_A 2EE9_A 2D7O_A 2D7N_A 2K7P_A 2NQC_A ....
Probab=98.79 E-value=9.7e-09 Score=61.43 Aligned_cols=45 Identities=40% Similarity=0.674 Sum_probs=39.4
Q ss_pred CCCCeEEEeCCCcccceeceeceEEEEEccCCCcc-------ceEEEEECCCC
Q psy5364 32 PDASKVRVYGPGIEHGVLATFQSRFICDTRGAGAG-------QLTVRVRGPKG 77 (78)
Q Consensus 32 ~d~~kv~v~GpGL~~g~~~~~~~~F~Id~~~aG~g-------~l~v~v~gP~g 77 (78)
.||++|+++|+||+.+. .++++.|+|++++++.+ .|.|.|.+|++
T Consensus 2 ~dp~~~~v~G~gl~~~~-~g~~~~F~V~~~d~~g~~~~~~~~~~~v~i~~p~~ 53 (101)
T PF00630_consen 2 IDPSKCKVSGPGLEPAV-VGEPATFTVDTRDAGGNPVSSGGDEFQVTITSPDG 53 (101)
T ss_dssp CSGCGEEEESGGGTEEE-TTSEEEEEEEETTTTSSBEESTSSEEEEEEESSSS
T ss_pred CCCCEEEEECCccCCeE-CCCcEEEEEEEccCCCCccccCCceeEEEEeCCCC
Confidence 69999999999998885 66899999999999665 57799999976
No 6
>COG4856 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=84.11 E-value=5.5 Score=30.10 Aligned_cols=53 Identities=26% Similarity=0.333 Sum_probs=32.0
Q ss_pred eEEEEEECC--EecCCCCeEEEecCCCCCCeEEEeCC-CcccceeceeceEEEE--EccCCCccceEEE
Q psy5364 8 INLRTEGEG--VDVPGSPFTVRVSGAPDASKVRVYGP-GIEHGVLATFQSRFIC--DTRGAGAGQLTVR 71 (78)
Q Consensus 8 ~~i~V~~~g--~~V~gSPf~v~V~~~~d~~kv~v~Gp-GL~~g~~~~~~~~F~I--d~~~aG~g~l~v~ 71 (78)
--+.|+|+. .+|+|-|++++| .+.|| ++...... +..|.| |-+.|+.|..+|.
T Consensus 52 vpvdvlYD~~~y~isg~~etV~V---------tl~G~ns~~~~~~~--~~dFkV~ADLt~a~~Gt~evk 109 (403)
T COG4856 52 VPVDVLYDSDKYFISGQPETVTV---------TLKGPNSIVLKSEK--PEDFKVVADLTHAGVGTHEVK 109 (403)
T ss_pred ceeEEEEccccccccCCceEEEE---------EEeCCcceeeeeec--CcCeEEEEEhhhcCCCceEee
Confidence 346788974 789999999988 45552 23222222 333666 5567766655444
No 7
>PF00927 Transglut_C: Transglutaminase family, C-terminal ig like domain; InterPro: IPR008958 Synonym(s): Protein-glutamine gamma-glutamyltransferase, Fibrinoligase, TGase Transglutaminases catalyse the post-translational modification of proteins at glutamine residues, with formation of isopeptide bonds. Members of the transglutaminase family usually have three domains: N-terminal (IPR001102 from INTERPRO), middle (IPR013808 from INTERPRO) and C-terminal. The middle domain is usually well conserved, but family members can display major differences in their N- and C-terminal domains, although their overall structure is conserved []. This entry represents the C-terminal domain found in transglutaminases, which consists of an immunoglobulin-like beta-sandwich consisting of seven strands in two sheets with a Greek key topology. The best known transglutaminase is blood coagulation factor XIII, a plasma tetrameric protein composed of two catalytic A subunits and two non-catalytic B subunits. Factor XIII is responsible for cross-linking fibrin chains, thus stabilising the fibrin clot. Protein-glutamine gamma-glutamyltransferases (2.3.2.13 from EC) are calcium-dependent enzymes that catalyse the cross-linking of proteins by promoting the formation of isopeptide bonds between the gamma-carboxyl group of a glutamine in one polypeptide chain and the epsilon-amino group of a lysine in a second polypeptide chain. TGases also catalyse the conjugation of polyamines to proteins [, ].; GO: 0003810 protein-glutamine gamma-glutamyltransferase activity, 0018149 peptide cross-linking; PDB: 2XZZ_A 1GGY_B 1FIE_B 1GGU_B 1GGT_B 1F13_A 1QRK_B 1EVU_A 1EX0_B 1L9N_B ....
Probab=81.03 E-value=8.8 Score=22.82 Aligned_cols=38 Identities=26% Similarity=0.355 Sum_probs=27.3
Q ss_pred EEEEEECCEecCCCCeEEEec---CCCCC--------CeEEEeCCCccc
Q psy5364 9 NLRTEGEGVDVPGSPFTVRVS---GAPDA--------SKVRVYGPGIEH 46 (78)
Q Consensus 9 ~i~V~~~g~~V~gSPf~v~V~---~~~d~--------~kv~v~GpGL~~ 46 (78)
.+.|+..+..+-|++|.+.+. +...+ ..-.++|.|+..
T Consensus 2 ~~~i~~~~~~~vG~d~~v~v~~~N~~~~~l~~v~~~l~~~~v~ytG~~~ 50 (107)
T PF00927_consen 2 EIKIKLPGDPVVGQDFTVSVSFTNPSSEPLRNVSLNLCAFTVEYTGLTR 50 (107)
T ss_dssp EEEEEEESEEBTTSEEEEEEEEEE-SSS-EECEEEEEEEEEEECTTTEE
T ss_pred eEEEEECCCccCCCCEEEEEEEEeCCcCccccceeEEEEEEEEECCccc
Confidence 477888899999999999876 22333 344568999864
No 8
>PF10670 DUF4198: Domain of unknown function (DUF4198)
Probab=72.37 E-value=24 Score=22.98 Aligned_cols=63 Identities=11% Similarity=0.093 Sum_probs=36.6
Q ss_pred eeEEEEEECCEecCCCCeEEEecCCCCCCeEEEeCCCcccceeceeceEEEEEccCCCccceEEEEECC
Q psy5364 7 RINLRTEGEGVDVPGSPFTVRVSGAPDASKVRVYGPGIEHGVLATFQSRFICDTRGAGAGQLTVRVRGP 75 (78)
Q Consensus 7 ~~~i~V~~~g~~V~gSPf~v~V~~~~d~~kv~v~GpGL~~g~~~~~~~~F~Id~~~aG~g~l~v~v~gP 75 (78)
...+.|+|+|++++|.+..+.-.. +....... ..-......-.|+|....+|.=.|.+..+.|
T Consensus 152 ~~~~~vl~~GkPl~~a~V~~~~~~--~~~~~~~~----~~~~~TD~~G~~~~~~~~~G~wli~a~~~~p 214 (215)
T PF10670_consen 152 PLPFQVLFDGKPLAGAEVEAFSPG--GWYDVEHE----AKTLKTDANGRATFTLPRPGLWLIRASHKDP 214 (215)
T ss_pred EEEEEEEECCeEcccEEEEEEECC--Cccccccc----eEEEEECCCCEEEEecCCCEEEEEEEEEecC
Confidence 488999999999999766543321 11100000 1111122334677777777777777777666
No 9
>TIGR03769 P_ac_wall_RPT actinobacterial surface-anchored protein domain. This model describes a repeat domain that one to three times in Actinobacterial proteins, some of which have LPXTG-type sortase recognition motifs for covalent attachment to the Gram-positive cell wall. Where it occurs with duplication in an LPXTG-anchored protein, it tends to be adjacent to the substrate-binding protein of the gene trio of an ABC transporter system, where that substrate-binding protein has a single copy of this same domain. This arrangement suggests a substrate-binding relay system, with the LPXTG protein acting as a substrate receptor.
Probab=71.71 E-value=5.2 Score=20.62 Aligned_cols=20 Identities=10% Similarity=0.127 Sum_probs=15.7
Q ss_pred ccceeeEEEEEECCEecCCC
Q psy5364 3 LRRRRINLRTEGEGVDVPGS 22 (78)
Q Consensus 3 ~~~G~~~i~V~~~g~~V~gS 22 (78)
.+||.|.|.+...+....|.
T Consensus 10 T~PG~Y~l~~~a~~~~~~G~ 29 (41)
T TIGR03769 10 TKPGTYTLTVQATATLTDGK 29 (41)
T ss_pred CCCeEEEEEEEEEEEeCCCc
Confidence 58999999999876555554
No 10
>PF14347 DUF4399: Domain of unknown function (DUF4399)
Probab=70.06 E-value=4.3 Score=24.21 Aligned_cols=22 Identities=23% Similarity=0.194 Sum_probs=17.7
Q ss_pred ccceeeEEEEEECC-EecCCCCe
Q psy5364 3 LRRRRINLRTEGEG-VDVPGSPF 24 (78)
Q Consensus 3 ~~~G~~~i~V~~~g-~~V~gSPf 24 (78)
+.||.|++.+.|.+ .|++-+|-
T Consensus 59 L~PG~htLtl~~~d~~h~~~~~~ 81 (87)
T PF14347_consen 59 LPPGKHTLTLQLGDGDHVPHDPP 81 (87)
T ss_pred eCCCCEEEEEEeCCCCcccCCCc
Confidence 68999999998864 78876664
No 11
>TIGR02148 Fibro_Slime fibro-slime domain. This model represents a conserved region of about 90 amino acids, shared in at least 4 distinct large putative proteins from the slime mold Dictyostelium discoideum and 10 proteins from the rumen bacterium Fibrobacter succinogenes, and in no other species so far. We propose here the name fibro-slime domain
Probab=69.19 E-value=6.6 Score=23.82 Aligned_cols=26 Identities=19% Similarity=0.227 Sum_probs=21.1
Q ss_pred cccee-eEEEEEECCEecCCCCeEEEe
Q psy5364 3 LRRRR-INLRTEGEGVDVPGSPFTVRV 28 (78)
Q Consensus 3 ~~~G~-~~i~V~~~g~~V~gSPf~v~V 28 (78)
+++|. |.+++.|...|-.+|.|+++-
T Consensus 60 l~~g~~Y~~d~F~~ERht~~S~~~i~T 86 (90)
T TIGR02148 60 LKEGKTYPFDIFYCERHTTGSNFRIRT 86 (90)
T ss_pred CccCcEeeEEEEEEeecCCCCCEEEEE
Confidence 34444 999999999999999998754
No 12
>PF08770 SoxZ: Sulphur oxidation protein SoxZ; InterPro: IPR014880 SoxZ forms an anti parallel beta structure and forms a complex with SoxY. Sulphur oxidation occurs at the thiol of a conserved cysteine residue of the SoxY subunit []. ; PDB: 1V8H_B 2OX5_E 2OXG_E 2OXH_C.
Probab=69.11 E-value=23 Score=21.47 Aligned_cols=25 Identities=28% Similarity=0.312 Sum_probs=14.8
Q ss_pred eceEEEEEccCCCccceEEEEECCCC
Q psy5364 52 FQSRFICDTRGAGAGQLTVRVRGPKG 77 (78)
Q Consensus 52 ~~~~F~Id~~~aG~g~l~v~v~gP~g 77 (78)
+|. |+...+..+.|.|++.+++-+|
T Consensus 67 NP~-l~F~~~~~~~g~l~v~~~Dn~G 91 (100)
T PF08770_consen 67 NPY-LRFSFKGKKSGTLTVTWTDNKG 91 (100)
T ss_dssp S-E-EEEEEEESSSEEEEEEEEETTS
T ss_pred CCc-EEEEEecCCCcEEEEEEEECCC
Confidence 443 4444444445599999887665
No 13
>PF08400 phage_tail_N: Prophage tail fibre N-terminal; InterPro: IPR013609 This entry represents the N terminus of phage 933W tail fibre protein. The characteristics of the protein distribution suggest prophage matches.
Probab=68.77 E-value=3.1 Score=26.91 Aligned_cols=17 Identities=12% Similarity=0.128 Sum_probs=14.8
Q ss_pred CCccceeeEEEEEECCE
Q psy5364 1 MSLRRRRINLRTEGEGV 17 (78)
Q Consensus 1 ~~~~~G~~~i~V~~~g~ 17 (78)
|.++||.|.|.++.+|.
T Consensus 51 ~~~epG~Y~V~l~~~g~ 67 (134)
T PF08400_consen 51 FDVEPGVYRVTLKVEGR 67 (134)
T ss_pred EEecCCeEEEEEEECCC
Confidence 67899999999999875
No 14
>PF13954 PapC_N: PapC N-terminal domain; PDB: 2VQI_B 3FIP_A 3RFZ_E 3OHN_A 1ZDV_A 1ZE3_D 3BWU_D 1ZDX_A.
Probab=68.60 E-value=6.9 Score=24.86 Aligned_cols=21 Identities=5% Similarity=-0.043 Sum_probs=17.0
Q ss_pred CccceeeEEEEEECCEecCCC
Q psy5364 2 SLRRRRINLRTEGEGVDVPGS 22 (78)
Q Consensus 2 ~~~~G~~~i~V~~~g~~V~gS 22 (78)
.+.||.|.|+|.+|++.+...
T Consensus 26 ~~~pG~Y~vdv~vN~~~~~~~ 46 (146)
T PF13954_consen 26 AIPPGEYSVDVYVNGKFIGRY 46 (146)
T ss_dssp SS-SEEEEEEEEETTEEEEEE
T ss_pred CCCCeEEEEEEEECCeeeeeE
Confidence 357999999999999998733
No 15
>PRK15275 putative fimbrial protein SteD; Provisional
Probab=67.62 E-value=27 Score=23.10 Aligned_cols=62 Identities=18% Similarity=0.156 Sum_probs=37.4
Q ss_pred eEEEEEECCEecCCCCeEEEec--CCCCCCeEEEeCCCcccceeceeceEEEEEccCCCccceEEEEECC
Q psy5364 8 INLRTEGEGVDVPGSPFTVRVS--GAPDASKVRVYGPGIEHGVLATFQSRFICDTRGAGAGQLTVRVRGP 75 (78)
Q Consensus 8 ~~i~V~~~g~~V~gSPf~v~V~--~~~d~~kv~v~GpGL~~g~~~~~~~~F~Id~~~aG~g~l~v~v~gP 75 (78)
-..+|.|.|.-+.-.|..|.=. +-.|-..-++.| .. -.++..|+|+|.......+++++.|-
T Consensus 25 ~~~~~~~~g~l~~pppC~In~~~VdFG~I~~~~idg-----~~-~~~~~~y~I~Cd~~~~~~vkltf~Gt 88 (166)
T PRK15275 25 KTVQLELHLVVTQPPPCTVGGASVEFGDVLTTKVGD-----AS-QTKPVGYSLNCDGRASDYLKLQIQGT 88 (166)
T ss_pred ccceEEEEEEEecCCCCEEcCCEEECCcceeeeEcC-----cc-cEeeecEEEECCCCCCCcEEEEEEcc
Confidence 3467788888887778776421 001212222233 11 12566899999877666888888874
No 16
>PF07495 Y_Y_Y: Y_Y_Y domain; InterPro: IPR011123 This region is mostly found at the end of the beta propellers (IPR011110 from INTERPRO) in a family of two component regulators. However they are also found tandemly repeated in Q891H4 from SWISSPROT without other signal conduction domains being present. It is named after the conserved tyrosines found in the alignment. The exact function is not known.; PDB: 3V9F_D 3VA6_B 3OTT_B 4A2M_D 4A2L_B.
Probab=60.26 E-value=8.9 Score=20.50 Aligned_cols=14 Identities=14% Similarity=0.266 Sum_probs=10.1
Q ss_pred CccceeeEEEEEEC
Q psy5364 2 SLRRRRINLRTEGE 15 (78)
Q Consensus 2 ~~~~G~~~i~V~~~ 15 (78)
++.||.|++.|+-.
T Consensus 35 ~L~~G~Y~l~V~a~ 48 (66)
T PF07495_consen 35 NLPPGKYTLEVRAK 48 (66)
T ss_dssp S--SEEEEEEEEEE
T ss_pred eCCCEEEEEEEEEE
Confidence 47899999999874
No 17
>cd08548 Type_I_cohesin_like Type I cohesin domain, interaction partner of dockerin. Bacterial cohesin domains bind to a complementary protein domain named dockerin, and this interaction is required for the formation of the cellulosome, a cellulose-degrading complex. The cellulosome consists of scaffoldin, a noncatalytic scaffolding polypeptide, that comprises repeating cohesion modules and a single carbohydrate-binding module (CBM). Specific calcium-dependent interactions between cohesins and dockerins appear to be essential for cellulosome assembly. This subfamily represents type I cohesins; their interactions with dockerin mediate assembly of a range of dockerin-borne enzymes to the complex.
Probab=57.96 E-value=45 Score=20.99 Aligned_cols=34 Identities=15% Similarity=0.228 Sum_probs=23.8
Q ss_pred ccee-eEEEEEECCEe---cCCCCeEEEecCCCCCCeEEEeC
Q psy5364 4 RRRR-INLRTEGEGVD---VPGSPFTVRVSGAPDASKVRVYG 41 (78)
Q Consensus 4 ~~G~-~~i~V~~~g~~---V~gSPf~v~V~~~~d~~kv~v~G 41 (78)
++|. .+|.|.+.+.+ |.+..|.+.- |++..++..
T Consensus 11 ~~G~tv~VpV~~~~v~~~~i~~~~f~l~y----D~s~Lev~~ 48 (135)
T cd08548 11 KPGDTVTVPVTLSNVPSKGIGACDFVLSY----DPSVLEVVS 48 (135)
T ss_pred cCCCEEEEEEEEecCCccCEEEEEEEEEe----CCceeEEEe
Confidence 3444 78888888776 7788887765 666666654
No 18
>PRK10301 hypothetical protein; Provisional
Probab=57.34 E-value=17 Score=22.78 Aligned_cols=25 Identities=20% Similarity=0.322 Sum_probs=15.5
Q ss_pred ccceeeEEEEEE---CCEecCCCCeEEEe
Q psy5364 3 LRRRRINLRTEG---EGVDVPGSPFTVRV 28 (78)
Q Consensus 3 ~~~G~~~i~V~~---~g~~V~gSPf~v~V 28 (78)
+.+|.|+|+-+. +|.++.|+ |.+.|
T Consensus 96 L~~G~YtV~Wrvvs~DGH~~~G~-~~F~V 123 (124)
T PRK10301 96 LKPGTYTVDWHVVSVDGHKTKGH-YTFSV 123 (124)
T ss_pred CCCccEEEEEEEEecCCCccCCe-EEEEE
Confidence 678888877554 56666655 44443
No 19
>PF09906 DUF2135: Uncharacterized protein conserved in bacteria (DUF2135); InterPro: IPR019220 This entry, found in various hypothetical prokaryotic proteins, has no known function.
Probab=53.87 E-value=28 Score=18.51 Aligned_cols=18 Identities=6% Similarity=-0.167 Sum_probs=14.7
Q ss_pred cceeeEEEEEECCEecCC
Q psy5364 4 RRRRINLRTEGEGVDVPG 21 (78)
Q Consensus 4 ~~G~~~i~V~~~g~~V~g 21 (78)
.+|.|.|-|.|.|-+=..
T Consensus 2 ~~G~Y~VyVnYyg~~~~~ 19 (50)
T PF09906_consen 2 PPGTYLVYVNYYGNRSQS 19 (50)
T ss_pred CCceEEEEEEeccCCCcC
Confidence 479999999999866554
No 20
>PF09430 DUF2012: Protein of unknown function (DUF2012); InterPro: IPR019008 This domain is found in different proteins, including uncharacterised protein family UPF0480 and nodal modulators. A nodal modulator has been identified as part of a protein complex that participates in the nodal signaling pathway during vertebrate development [].
Probab=47.65 E-value=59 Score=20.08 Aligned_cols=38 Identities=24% Similarity=0.294 Sum_probs=25.8
Q ss_pred ccceeeEEEEEECCEecCCCCeEEEecC--CCCCCeEEEeCC
Q psy5364 3 LRRRRINLRTEGEGVDVPGSPFTVRVSG--APDASKVRVYGP 42 (78)
Q Consensus 3 ~~~G~~~i~V~~~g~~V~gSPf~v~V~~--~~d~~kv~v~Gp 42 (78)
+.+|.|.++|.-.+-.- +|++|.|.+ ...+.++.+..+
T Consensus 37 Vp~GsY~L~V~s~~~~F--~~~RVdV~~~~~~~~~~~~~~~~ 76 (123)
T PF09430_consen 37 VPPGSYLLEVHSPDYVF--PPYRVDVSSSGKIRARKVNYWQP 76 (123)
T ss_pred CCCceEEEEEECCCccc--cCEEEEEecCCCCCceEEEeecC
Confidence 57899999998776554 679998863 234445555444
No 21
>PRK15286 putative minor fimbrial subunit StfE; Provisional
Probab=46.87 E-value=77 Score=21.25 Aligned_cols=62 Identities=16% Similarity=0.143 Sum_probs=37.7
Q ss_pred eeEEEEEECCEecCCCCeEEEecCCCCCCeEEEeCCCcc----cceeceeceEEEEEccCCCccceEEEEECCC
Q psy5364 7 RINLRTEGEGVDVPGSPFTVRVSGAPDASKVRVYGPGIE----HGVLATFQSRFICDTRGAGAGQLTVRVRGPK 76 (78)
Q Consensus 7 ~~~i~V~~~g~~V~gSPf~v~V~~~~d~~kv~v~GpGL~----~g~~~~~~~~F~Id~~~aG~g~l~v~v~gP~ 76 (78)
.-..++.|.+.-+.--|+.|+=. ..-+|. +- .|.--.++..++++|.+.....|+++++|..
T Consensus 28 ~~~~~~~~~g~l~~pPpC~In~g-------~V~FGd-V~~~~IDG~nY~q~v~Y~l~C~~~~~~~lkL~i~G~~ 93 (170)
T PRK15286 28 SKTVNMTLTIVVNAAPPCTVTGG-------EVEFGN-VLTTKVDGVNYRQAVGYRLSCNGRVSDYLKLQIQGNA 93 (170)
T ss_pred CccceEEEEEEEecCCCcEEcCC-------EEECCc-cccccccCeeeeEecCeEEEcCCCCCCcEEEEEEccc
Confidence 34577777777776566655411 122232 21 1222235678999998877778999998753
No 22
>TIGR03517 GldM_gliding gliding motility-associated protein GldM. This protein family, GldM, is named for the member from Flavobacterium johnsoniae, which is required for a type of rapid gliding motility found in certain members of the Bacteriodetes. However, members are found also in several members of the Bacteriodetes that appear not to be motile. The best conserved region, toward the N-terminus, is centered on a highly hydrobobic probable transmembrane helix. Two paralogs are found in Cytophaga hutchinsonii.
Probab=46.50 E-value=65 Score=25.26 Aligned_cols=24 Identities=25% Similarity=0.465 Sum_probs=17.0
Q ss_pred CCeEEEecCCCCCCeEEEeCCCccc
Q psy5364 22 SPFTVRVSGAPDASKVRVYGPGIEH 46 (78)
Q Consensus 22 SPf~v~V~~~~d~~kv~v~GpGL~~ 46 (78)
=|..|.| |...+.+++++++||..
T Consensus 340 NPisIsv-pgv~~~~v~as~~gl~~ 363 (523)
T TIGR03517 340 NPINISV-PGVGANYVTATGAGLKS 363 (523)
T ss_pred CceEEEe-cCCCcceEEEEccccee
Confidence 3555555 34667889999999864
No 23
>PF09087 Cyc-maltodext_N: Cyclomaltodextrinase, N-terminal; InterPro: IPR015171 This domain is found at the N terminus of cyclomaltodextrinase. The domain assumes a beta-sandwich structure composed of the eight antiparallel beta-strands. A ten residue linker is also present at the C-terminal end, which connects the N-terminal domain to a distal domain in the protein. This domain participates in oligomerisation of the protein, wherein the N-terminal domain of one subunit contacts the active centre of the other subunit, and is also required for binding of cyclodextrin to substrate []. ; PDB: 3EDK_B 3EDD_A 3EDJ_B 3EDE_A 1H3G_B 3EDF_B.
Probab=46.22 E-value=58 Score=19.47 Aligned_cols=41 Identities=20% Similarity=0.299 Sum_probs=21.5
Q ss_pred CeEEEeCCCcc--cceeceeceEEE--EEccCCCccceEEEEECC
Q psy5364 35 SKVRVYGPGIE--HGVLATFQSRFI--CDTRGAGAGQLTVRVRGP 75 (78)
Q Consensus 35 ~kv~v~GpGL~--~g~~~~~~~~F~--Id~~~aG~g~l~v~v~gP 75 (78)
.+|.+..+|+. .-.....|+-.. ++.++|=+|.|.+.+..+
T Consensus 28 ~~v~i~~~gV~i~~v~~~~npNYLFv~L~i~~akpg~~~i~~~~~ 72 (88)
T PF09087_consen 28 AEVSISYPGVTIKKVVKTDNPNYLFVYLDISDAKPGTFTINFKKG 72 (88)
T ss_dssp SEEEE-BTTEEEEEEEE-SSTTEEEEEEEE-T--SEEEEEEEEET
T ss_pred CEEEEeCCCeEEEEEEecCCCCEEEEEEecCCCCCcEEEEEEEcC
Confidence 45555666764 223344555443 345688888898888765
No 24
>PF10029 DUF2271: Predicted periplasmic protein (DUF2271); InterPro: IPR014469 There is currently no experimental data for members of this group or their homologues, nor do they exhibit features indicative of any function.
Probab=43.97 E-value=40 Score=21.60 Aligned_cols=26 Identities=27% Similarity=0.466 Sum_probs=15.5
Q ss_pred cceeeEEEEEE---CCEecCCCCeEEEec
Q psy5364 4 RRRRINLRTEG---EGVDVPGSPFTVRVS 29 (78)
Q Consensus 4 ~~G~~~i~V~~---~g~~V~gSPf~v~V~ 29 (78)
.+|.|++.... .+..++..=|++.|+
T Consensus 75 ~~G~~~~~~d~~~~~~~~l~~g~Y~l~vE 103 (139)
T PF10029_consen 75 PPGKYTLSWDGTDDIGNPLPDGGYTLRVE 103 (139)
T ss_pred CCCccEEEEEccccccCccCCCcEEEEEE
Confidence 46777666643 345566666777663
No 25
>PRK15311 putative fimbrial protein TcfD; Provisional
Probab=43.85 E-value=1.4e+02 Score=22.48 Aligned_cols=67 Identities=22% Similarity=0.286 Sum_probs=42.2
Q ss_pred eeeEEEEEECCEecC---CCCeEEEecCCCCCCe-EEEeCCCcccceeceeceE-------EEEEccCCCc--cceEEEE
Q psy5364 6 RRINLRTEGEGVDVP---GSPFTVRVSGAPDASK-VRVYGPGIEHGVLATFQSR-------FICDTRGAGA--GQLTVRV 72 (78)
Q Consensus 6 G~~~i~V~~~g~~V~---gSPf~v~V~~~~d~~k-v~v~GpGL~~g~~~~~~~~-------F~Id~~~aG~--g~l~v~v 72 (78)
=+|.|+..|+|..|. |-=|.++= .|-.. --|.=||+...+.+ -|+. |.|-.+.||. |.|+|++
T Consensus 277 iDY~Vsl~y~G~~i~v~Nge~~~l~~---v~~~~~R~V~LPgIs~pV~C-vPtPLTL~t~~F~v~~k~aG~Y~G~LtVt~ 352 (359)
T PRK15311 277 IDYAVSMNYGGRSIPVTRGVEFSLDN---VDKAATRPVVLPGQRQAVRC-VPVPLTLTTQPFNIREKRSGEYQGTLTVTM 352 (359)
T ss_pred eeEEEEEecCCceeccCCCceEEEec---cchhceeeeecCCCccceEE-eccccEeecCcccccCCCCceeeeEEEEEE
Confidence 358899988876664 44444421 23233 33566888876643 4553 5555588985 9999988
Q ss_pred ECCC
Q psy5364 73 RGPK 76 (78)
Q Consensus 73 ~gP~ 76 (78)
+=++
T Consensus 353 ~~~t 356 (359)
T PRK15311 353 LMGT 356 (359)
T ss_pred ecCC
Confidence 7543
No 26
>PF12080 GldM_C: GldM C-terminal domain; InterPro: IPR022719 This domain is found in bacteria at the C terminus of the GldM protein. This domain is typically between 169 to 182 amino acids in length and has two completely conserved residues (Y and N) that may be functionally important. GldM, is named for the member from Bacteriodetes Flavobacterium johnsoniae, which is required for a type of rapid gliding motility found in certain members of the Bacteriodetes [].
Probab=41.47 E-value=1.1e+02 Score=20.44 Aligned_cols=34 Identities=21% Similarity=0.393 Sum_probs=23.8
Q ss_pred EEEEEECCEecCCCCeEEEecCCCCCCeEEEeCCC--ccc
Q psy5364 9 NLRTEGEGVDVPGSPFTVRVSGAPDASKVRVYGPG--IEH 46 (78)
Q Consensus 9 ~i~V~~~g~~V~gSPf~v~V~~~~d~~kv~v~GpG--L~~ 46 (78)
+++|+|.|. .=|..+.+. ...+.++.++..| |..
T Consensus 6 ~MNVLY~G~---~NpisIsvp-gv~~~~v~~s~~ggsl~~ 41 (181)
T PF12080_consen 6 KMNVLYRGV---DNPISISVP-GVPSNKVPASATGGSLSK 41 (181)
T ss_pred ccceEecCC---CCcEEEEeC-CCCccccEEEeeCCEEEe
Confidence 467888875 457777664 4567888888877 654
No 27
>PF01483 P_proprotein: Proprotein convertase P-domain; InterPro: IPR002884 This domain, termed the P domain is approximately 150 amino acids in length and C-terminal to a serine endopeptidase domain which belong to MEROPS peptidase family S8 (clan SB), subfamily S8B (kexin). The domain is primarily associated with the calcium-dependent serine endopeptidases, kex2/subtilisin proprotein convertases (PCs), which have been identified in all eukaryotes [] and in the gammaproteobacteria, Nostoc (cyanobacteria) and in Streptomyces avermitilis. The P domain appears necessary for folding and maintaining the endopeptidase catalytic domain and to regulate its calcium and acidic pH dependence. In addition, contained within the middle of the P domain in most PC family members is the cognate integrin binding RGD sequence [], which may be required for intracellular compartmentalization and maintenance of enzyme stability within the ER. The integrity of the RGD sequence of proprotein convertase PC1 is critical for its zymogen and C-terminal processing and for its cellular trafficking [, ]. The carboxy-terminal tail provides uniqueness to each PC family member being the least conserved region of all convertases [].; GO: 0004252 serine-type endopeptidase activity, 0006508 proteolysis; PDB: 1P8J_E 1OT5_B 2ID4_A 1R64_A 3HJR_A.
Probab=41.17 E-value=46 Score=18.91 Aligned_cols=22 Identities=18% Similarity=0.234 Sum_probs=13.3
Q ss_pred EEEEEccCCCccceEEEEECCCCC
Q psy5364 55 RFICDTRGAGAGQLTVRVRGPKGF 78 (78)
Q Consensus 55 ~F~Id~~~aG~g~l~v~v~gP~g~ 78 (78)
.+.|.... .|+|.+.+..|+|.
T Consensus 7 ~v~i~H~~--~gdL~i~L~SP~Gt 28 (87)
T PF01483_consen 7 SVDITHPY--RGDLRITLISPSGT 28 (87)
T ss_dssp EEEEEESS--GGGEEEEEE-TT--
T ss_pred EEEEEcCC--cCCEEEEEECCCCC
Confidence 34444443 67899999999983
No 28
>PF07523 Big_3: Bacterial Ig-like domain (group 3); InterPro: IPR011080 This entry represents bacterial domains with an Ig-like fold. These domains are found in a variety of bacterial surface proteins.; PDB: 2L7Y_A 2KPN_A.
Probab=39.89 E-value=30 Score=18.86 Aligned_cols=14 Identities=7% Similarity=0.050 Sum_probs=11.8
Q ss_pred ccceeeEEEEEECC
Q psy5364 3 LRRRRINLRTEGEG 16 (78)
Q Consensus 3 ~~~G~~~i~V~~~g 16 (78)
.++|.|.|.++|.+
T Consensus 45 ~~~G~y~Vt~~y~~ 58 (67)
T PF07523_consen 45 SKAGTYTVTYTYKG 58 (67)
T ss_dssp TS-CCEEEEEEECT
T ss_pred CCCceEEEEEEECC
Confidence 47899999999998
No 29
>PF05359 DUF748: Domain of Unknown Function (DUF748); InterPro: IPR008023 This is a family of proteins of unknown function.
Probab=39.84 E-value=90 Score=19.19 Aligned_cols=52 Identities=17% Similarity=0.270 Sum_probs=24.9
Q ss_pred EEEECCEecCCCCeEEEecCCCCCCeEEEeCCCcccceeceeceEEEEEccCCCccceEEE
Q psy5364 11 RTEGEGVDVPGSPFTVRVSGAPDASKVRVYGPGIEHGVLATFQSRFICDTRGAGAGQLTVR 71 (78)
Q Consensus 11 ~V~~~g~~V~gSPf~v~V~~~~d~~kv~v~GpGL~~g~~~~~~~~F~Id~~~aG~g~l~v~ 71 (78)
+|.|.+..++. ||++.+.+ -...+.. |... .+.++.+.+.+.-.+.|.+.+.
T Consensus 14 ~i~f~D~~~~~-~~~~~l~~----i~~~l~~--l~~~--~~~~~~~~l~~~~~~~g~l~~~ 65 (151)
T PF05359_consen 14 RINFTDQSVPP-PFKLELSD----INLTLDN--LSTD--SNEPAPVSLSASLNGGGTLSAS 65 (151)
T ss_pred EEEEEeCCCCC-CceEEEEE----EEEEEEe--ccCC--CCCcEEEEEEEEeCCCCEEEEE
Confidence 45666655554 88776632 2223332 3211 1234555555554444444443
No 30
>PF12863 DUF3821: Domain of unknown function (DUF3821); InterPro: IPR024277 This is a domain largely confined to sequences from Methanomicrobiales. It is found in putative lipases but the function is unknown.
Probab=36.52 E-value=69 Score=22.12 Aligned_cols=30 Identities=27% Similarity=0.381 Sum_probs=20.4
Q ss_pred eceeceEEEEEcc--------CCCc--cceEEEEECCCCC
Q psy5364 49 LATFQSRFICDTR--------GAGA--GQLTVRVRGPKGF 78 (78)
Q Consensus 49 ~~~~~~~F~Id~~--------~aG~--g~l~v~v~gP~g~ 78 (78)
+.+....|+|++. +.+. +.+.+.|++|+|.
T Consensus 103 ~~G~~v~FrI~tNL~~~~~R~g~~~~~~~v~I~V~~P~G~ 142 (209)
T PF12863_consen 103 PRGDNVNFRIDTNLYSIFQRGGYTPGDGPVDIKVTTPSGA 142 (209)
T ss_pred ccCCeEEEEEcccHHHHhhcCCCCCCcceEEEEEeCCCCc
Confidence 3445678999863 1122 3399999999984
No 31
>PF10632 He_PIG_assoc: He_PIG associated, NEW1 domain of bacterial glycohydrolase; InterPro: IPR019599 This domain has been named NEW1 but its actual function is not known. It is found on proteins which are bacterial galactosidases []. The domain is associated with IPR008009 from INTERPRO, a putative Ig-containing domain.
Probab=36.05 E-value=31 Score=16.66 Aligned_cols=9 Identities=56% Similarity=1.209 Sum_probs=7.7
Q ss_pred CCCCeEEEe
Q psy5364 20 PGSPFTVRV 28 (78)
Q Consensus 20 ~gSPf~v~V 28 (78)
+||||...|
T Consensus 13 Pg~pfl~~I 21 (29)
T PF10632_consen 13 PGSPFLFTI 21 (29)
T ss_pred CCCcEEEEe
Confidence 689998877
No 32
>PF08141 SspH: Small acid-soluble spore protein H family; InterPro: IPR012610 This family consists of the small acid-soluble spore proteins (SASP) of the H type (sspH). SspH are unique to spores of Bacillus subtilis and are expressed only in the forespore compartment during sporulation of this organism. The sspH genes are monocistronic and are recognised by the forespore-specific sigma factor for RNA polymerase - sigma-G. The specific role of this protein is unclear but is thought to play a role in sporulation under conditions different from that of the common laboratory tests of spore properties [].; GO: 0030436 asexual sporulation, 0042601 endospore-forming forespore
Probab=35.80 E-value=36 Score=18.90 Aligned_cols=12 Identities=25% Similarity=0.188 Sum_probs=9.5
Q ss_pred eEEEEEECCEec
Q psy5364 8 INLRTEGEGVDV 19 (78)
Q Consensus 8 ~~i~V~~~g~~V 19 (78)
-.++|+|+|.+|
T Consensus 14 ~~i~V~y~G~pV 25 (58)
T PF08141_consen 14 DMIEVTYNGVPV 25 (58)
T ss_pred ceEEEEECCEEE
Confidence 357899998776
No 33
>PF08308 PEGA: PEGA domain; InterPro: IPR013229 This domain is found in both archaea and bacteria and has similarity to S-layer (surface layer) proteins. It is named after the characteristic PEGA sequence motif found in this domain. The secondary structure of this domain is predicted to be beta-strands.
Probab=35.62 E-value=36 Score=18.51 Aligned_cols=15 Identities=20% Similarity=0.202 Sum_probs=12.1
Q ss_pred ccceeeEEEEEECCE
Q psy5364 3 LRRRRINLRTEGEGV 17 (78)
Q Consensus 3 ~~~G~~~i~V~~~g~ 17 (78)
+.+|.|.|.++..|-
T Consensus 32 l~~G~~~v~v~~~Gy 46 (71)
T PF08308_consen 32 LPPGEHTVTVEKPGY 46 (71)
T ss_pred cCCccEEEEEEECCC
Confidence 568889988888873
No 34
>PRK15315 outer membrane protein RatA; Provisional
Probab=34.96 E-value=2.2e+02 Score=25.88 Aligned_cols=63 Identities=27% Similarity=0.402 Sum_probs=37.1
Q ss_pred eEEEEEE-CCEecCCCCeEEEecCCC-------CCCeEEEeCCCcccceeceeceEEEEEccCCCccceEEEEECCCC
Q psy5364 8 INLRTEG-EGVDVPGSPFTVRVSGAP-------DASKVRVYGPGIEHGVLATFQSRFICDTRGAGAGQLTVRVRGPKG 77 (78)
Q Consensus 8 ~~i~V~~-~g~~V~gSPf~v~V~~~~-------d~~kv~v~GpGL~~g~~~~~~~~F~Id~~~aG~g~l~v~v~gP~g 77 (78)
.+|.++. +|+++++-||.|+..+.. ++.-|++-|--| ....++|.= ..+| .|.++|.|+.|+|
T Consensus 229 LTVttk~~~g~~v~nv~f~i~~~da~NRQ~~vnn~~pV~vg~tel-----t~~~~~Y~G-~TDa-nG~aTv~vTqpnG 299 (1865)
T PRK15315 229 LTVTTKACDGEPAINAPFVIRREDAINRQGVVNNANPVRVGNTEL-----TTAQTEYHG-VTDG-QGNATVVVTQENG 299 (1865)
T ss_pred EEEEeeccCCcccCceeEEEEeccchhccCCcCCCCceEECCeec-----ccccceEee-eeCC-CCeEEEEEeCCCC
Confidence 4555566 479999999999865422 223455532211 111234332 2333 6789999998887
No 35
>PF05688 DUF824: Salmonella repeat of unknown function (DUF824); InterPro: IPR008542 This family consists of a series of repeated sequences (of around 180 residues) which are found in Salmonella typhimurium, Salmonella typhi and Escherichia coli. These repeats are almost always found with this entry. The repeats are associated with RatA and RatB, the coding sequences of which are found in the pathogeneicity island of Salmonella. The sequences may be determinants of pathogenicity [, ].
Probab=34.49 E-value=36 Score=18.09 Aligned_cols=16 Identities=38% Similarity=0.715 Sum_probs=13.4
Q ss_pred CCEecCCCCeEEEecC
Q psy5364 15 EGVDVPGSPFTVRVSG 30 (78)
Q Consensus 15 ~g~~V~gSPf~v~V~~ 30 (78)
+|.++++-||.++-..
T Consensus 25 ~G~pv~n~~f~l~r~~ 40 (47)
T PF05688_consen 25 NGNPVPNAPFTLTRGD 40 (47)
T ss_pred CCCCcCCceEEEEecC
Confidence 6899999999987743
No 36
>TIGR03503 conserved hypothetical protein TIGR03503. This set of conserved hypothetical protein has a phylogenetic range that closely matches that of TIGR03501, a putative C-terminal protein targeting signal.
Probab=34.29 E-value=50 Score=24.84 Aligned_cols=28 Identities=7% Similarity=0.204 Sum_probs=24.9
Q ss_pred CccceeeEEEEEECCEecCCCCeEEEec
Q psy5364 2 SLRRRRINLRTEGEGVDVPGSPFTVRVS 29 (78)
Q Consensus 2 ~~~~G~~~i~V~~~g~~V~gSPf~v~V~ 29 (78)
..+.|.|++.....+..+.|.||.+.+.
T Consensus 274 ~~~~G~Y~i~~~~~~Tt~~GRe~~~~lp 301 (374)
T TIGR03503 274 NYEFGKYRVKGTVFGTTITGRELQLTLP 301 (374)
T ss_pred cCCCeEEEEEEEEEEecCCCCEEEEEcC
Confidence 3588999999999999999999998774
No 37
>PF02019 WIF: WIF domain; InterPro: IPR003306 Wnt proteins constitute a large family of secreted molecules that are involved in intercellular signalling during development. The name derives from the first 2 members of the family to be discovered: int-1 (mouse) and wingless (Drosophila) []. It is now recognised that Wnt signalling controls many cell fate decisions in a variety of different organisms, including mammals []. Wnt signalling has been implicated in tumourigenesis, early mesodermal patterning of the embryo, morphogenesis of the brain and kidneys, regulation of mammary gland proliferation and Alzheimer's disease [, ]. Wnt-mediated signalling is believed to proceed initially through binding to cell surface receptors of the frizzled family; the signal is subsequently transduced through several cytoplasmic components to B-catenin, which enters the nucleus and activates the transcription of several genes important in development []. Several non-canonical Wnt signalling pathways have also been elucidated that act independently of B-catenin. Canonical and noncanonical Wnt signaling branches are highly interconnected, and cross-regulate each other []. Members of the Wnt gene family are defined by their sequence similarity to mouse Wnt-1 and Wingless in Drosophila. They encode proteins of ~350-400 residues in length, with orthologues identified in several, mostly vertebrate, species. Very little is known about the structure of Wnts as they are notoriously insoluble, but they share the following features characteristics of secretory proteins: a signal peptide, several potential N-glycosylation sites and 22 conserved cysteines [] that are probably involved in disulphide bonds. The Wnt proteins seem to adhere to the plasma membrane of the secreting cells and are therefore likely to signal over only few cell diameters. Fifteen major Wnt gene families have been identified in vertebrates, with multiple subtypes within some classes. This entry represents the WIF domain, and is found in the RYK tyrosine kinase receptors and WIF the Wnt-inhibitory-factor. The domain is extracellular and contains two conserved cysteines that may form a disulphide bridge. This domain is Wnt binding in WIF, and it has been suggested that RYK may also bind to Wnt [].; GO: 0004713 protein tyrosine kinase activity; PDB: 2YGP_A 2YGO_A 2YGN_A 2D3J_A 2YGQ_A.
Probab=32.35 E-value=63 Score=20.71 Aligned_cols=65 Identities=15% Similarity=0.207 Sum_probs=27.4
Q ss_pred ccceeeEEEEEECCEecCCCCeEEEecCCCCC-----CeEEEeCCCcccceeceeceEEEEEc--cCCCccceEEE
Q psy5364 3 LRRRRINLRTEGEGVDVPGSPFTVRVSGAPDA-----SKVRVYGPGIEHGVLATFQSRFICDT--RGAGAGQLTVR 71 (78)
Q Consensus 3 ~~~G~~~i~V~~~g~~V~gSPf~v~V~~~~d~-----~kv~v~GpGL~~g~~~~~~~~F~Id~--~~aG~g~l~v~ 71 (78)
..+..+.++.+|...-=..-++.+.+....|+ -.++++- .|.+-..+..|.|+- .+.-.|+..++
T Consensus 37 VPs~i~~l~FtW~s~~~~~~~Y~~~~l~s~d~~~l~~P~lnIs~----~G~VP~~~~~F~V~L~CtG~~~g~a~v~ 108 (132)
T PF02019_consen 37 VPSEINYLNFTWQSLGKKPYPYNFDVLKSSDPSILPHPSLNISK----SGEVPHSPQVFSVELPCTGKRSGEATVT 108 (132)
T ss_dssp B-TT-SEEEEEEEE-SSS-EEEEEEEEEES-TTTBB--EESS-S----EEEE-SS-EEEEEE--B-SSS-EEEEEE
T ss_pred CCCCcceEEEEEEcCCCceEEEEEEEEEccCcccccCCccccCC----CeeccCCCEEEEEEEEecCccceEEEEE
Confidence 44556777777865333355666666222222 2222221 233334577898874 34323444443
No 38
>PF09898 DUF2125: Uncharacterized protein conserved in bacteria (DUF2125); InterPro: IPR018666 This family of conserved hypothetical proteins has no known function.
Probab=32.30 E-value=1.4e+02 Score=21.03 Aligned_cols=19 Identities=32% Similarity=0.434 Sum_probs=16.8
Q ss_pred EEEECCEecCCCCeEEEec
Q psy5364 11 RTEGEGVDVPGSPFTVRVS 29 (78)
Q Consensus 11 ~V~~~g~~V~gSPf~v~V~ 29 (78)
.+.|.+..+.|-||++.+.
T Consensus 25 ~~~~~~~~~~GfPfR~~~~ 43 (312)
T PF09898_consen 25 QVECADRSVGGFPFRFDVR 43 (312)
T ss_pred EEEECCceeeccCeEEEEE
Confidence 5788999999999999875
No 39
>PRK03174 sspH acid-soluble spore protein H; Provisional
Probab=32.00 E-value=39 Score=18.87 Aligned_cols=11 Identities=18% Similarity=0.187 Sum_probs=8.9
Q ss_pred EEEEEECCEec
Q psy5364 9 NLRTEGEGVDV 19 (78)
Q Consensus 9 ~i~V~~~g~~V 19 (78)
.++|+|+|.+|
T Consensus 15 ~i~VtY~G~pV 25 (59)
T PRK03174 15 MANVTYNGVPI 25 (59)
T ss_pred ceEEEECCEEE
Confidence 57899998776
No 40
>COG3459 Cellobiose phosphorylase [Carbohydrate transport and metabolism]
Probab=31.60 E-value=2.5e+02 Score=24.19 Aligned_cols=60 Identities=20% Similarity=0.303 Sum_probs=40.3
Q ss_pred ccceeeEEEEEECCEecCCCCeEEEecCCCCCCeEEEeCCCcccceeceeceEEEEEccCCCccceEEEEECCC
Q psy5364 3 LRRRRINLRTEGEGVDVPGSPFTVRVSGAPDASKVRVYGPGIEHGVLATFQSRFICDTRGAGAGQLTVRVRGPK 76 (78)
Q Consensus 3 ~~~G~~~i~V~~~g~~V~gSPf~v~V~~~~d~~kv~v~GpGL~~g~~~~~~~~F~Id~~~aG~g~l~v~v~gP~ 76 (78)
.+=..|+++++|+ |+|++++|........+.+.|..++.-. .| -..+|...-.|.|.+|.
T Consensus 997 ~ew~G~~~~~~~~-----~~~~~i~V~~~~~~k~I~ldG~~~~~~~---~p------~~~~~~~~~evvv~~~~ 1056 (1056)
T COG3459 997 EEWDGYEATYELR-----NAPYDITVERPQGVKEILLDGQKIEGKI---LP------VGEAGDHKQEVVVKGPA 1056 (1056)
T ss_pred cccCCeEEEEEEC-----CceEEEEEEcCCCcceEEEecccccccc---cc------ccccCCcceEEEEecCC
Confidence 3445577887776 6999999987555577888887665421 11 23467777778887763
No 41
>PF07162 B9-C2: Ciliary basal body-associated, B9 protein; InterPro: IPR010796 Proteins in this entry include the MSK1 protein (Q9NXB0 from SWISSPROT) and other known or predicted flagellar basal body proteome components [] or cilia-containing species. Although the function is unknown, a cilia-specific role has been suggested for the poorly characterised B9 domain [, , ]. Mutations in MSK1 have been shown to cause Meckel syndrome type 1, a severe foetal development disorder that has been reported in most populations.
Probab=31.30 E-value=55 Score=21.28 Aligned_cols=22 Identities=18% Similarity=0.131 Sum_probs=16.5
Q ss_pred CCccceeeEEEEEECCEecCCCC
Q psy5364 1 MSLRRRRINLRTEGEGVDVPGSP 23 (78)
Q Consensus 1 ~~~~~G~~~i~V~~~g~~V~gSP 23 (78)
+|+.||.|+++|... .+..+|-
T Consensus 101 lP~~pG~h~~~v~~w-rP~~~s~ 122 (168)
T PF07162_consen 101 LPTQPGRHEVEVPTW-RPVSGSI 122 (168)
T ss_pred eCCCCceEEEEEEEE-eecCCCH
Confidence 488999999999777 4545543
No 42
>TIGR02861 SASP_H small acid-soluble spore protein, H-type. This model is derived from pfam08141 but has been expanded to include in the seed corresponding proteins from three species of Clostridium. Members of this family should occur only in endospore-forming bacteria, typically with two members per genome, but may be absent from the genomes of some endospore-forming bacteria. SspH (previously designated YfjU) was shown to be expressed specifically in spores of Bacillus subtilis.
Probab=31.11 E-value=41 Score=18.67 Aligned_cols=11 Identities=27% Similarity=0.326 Sum_probs=8.9
Q ss_pred EEEEEECCEec
Q psy5364 9 NLRTEGEGVDV 19 (78)
Q Consensus 9 ~i~V~~~g~~V 19 (78)
.+.|+|+|++|
T Consensus 15 ~i~V~Y~G~pV 25 (58)
T TIGR02861 15 MINVTYKGVPV 25 (58)
T ss_pred ceEEEECCEEE
Confidence 47899998776
No 43
>PF04571 Lipin_N: lipin, N-terminal conserved region; InterPro: IPR007651 Mutations in the lipin gene lead to fatty liver dystrophy in mice. The protein has been shown to be phosphorylated by the TOR Ser/Thr protein kinases in response to insulin stimulation. This entry represents a conserved domain found at the N terminus of the member proteins [, ].
Probab=30.55 E-value=78 Score=19.86 Aligned_cols=20 Identities=30% Similarity=0.449 Sum_probs=13.3
Q ss_pred EEEEECCEecCCCCeEEEec
Q psy5364 10 LRTEGEGVDVPGSPFTVRVS 29 (78)
Q Consensus 10 i~V~~~g~~V~gSPf~v~V~ 29 (78)
|-|+..+-...-|||-++..
T Consensus 31 IVV~q~DGs~~sSPFhVRFG 50 (110)
T PF04571_consen 31 IVVEQPDGSLKSSPFHVRFG 50 (110)
T ss_pred EEEecCCCCEecCccEEEEc
Confidence 34556666777888877654
No 44
>PF13584 BatD: Oxygen tolerance
Probab=30.28 E-value=78 Score=23.67 Aligned_cols=31 Identities=23% Similarity=0.335 Sum_probs=25.2
Q ss_pred CCccceeeEE---EEEECCEecCCCCeEEEecCC
Q psy5364 1 MSLRRRRINL---RTEGEGVDVPGSPFTVRVSGA 31 (78)
Q Consensus 1 ~~~~~G~~~i---~V~~~g~~V~gSPf~v~V~~~ 31 (78)
+|.++|.|+| .|.++|+...=.|++++|.+.
T Consensus 80 ~p~~~G~~~IP~~~v~v~Gk~~~S~pi~i~V~~~ 113 (484)
T PF13584_consen 80 QPKKTGTFTIPPFTVEVDGKTYKSQPITIEVSKA 113 (484)
T ss_pred EecccceEEEceEEEEECCEEEeecCEEEEEEec
Confidence 4778999886 567799888889999999764
No 45
>PRK01625 sspH acid-soluble spore protein H; Provisional
Probab=30.25 E-value=43 Score=18.68 Aligned_cols=11 Identities=36% Similarity=0.456 Sum_probs=9.0
Q ss_pred EEEEEECCEec
Q psy5364 9 NLRTEGEGVDV 19 (78)
Q Consensus 9 ~i~V~~~g~~V 19 (78)
.+.|+|+|.+|
T Consensus 15 ~i~V~Y~G~pV 25 (59)
T PRK01625 15 RIDVTYEGVPV 25 (59)
T ss_pred ceEEEECCEEE
Confidence 57899998877
No 46
>KOG0416|consensus
Probab=27.60 E-value=66 Score=21.93 Aligned_cols=24 Identities=21% Similarity=0.382 Sum_probs=17.3
Q ss_pred EEEEEccCCCccceEEEEECCCCC
Q psy5364 55 RFICDTRGAGAGQLTVRVRGPKGF 78 (78)
Q Consensus 55 ~F~Id~~~aG~g~l~v~v~gP~g~ 78 (78)
.+.|+.-+-+-.+|.|+++||.++
T Consensus 19 ~yeV~~ind~m~ef~V~f~GP~ds 42 (189)
T KOG0416|consen 19 DYEVTIINDGMQEFYVKFHGPKDS 42 (189)
T ss_pred CCeEEEecCcccEEEEEeeCCCCC
Confidence 456666666778888888888763
No 47
>PF10528 PA14_2: GLEYA domain; InterPro: IPR018871 This presumed domain is found in fungal adhesins and is related to the PA14 domain. ; PDB: 4A3X_A.
Probab=27.42 E-value=59 Score=20.05 Aligned_cols=18 Identities=28% Similarity=0.713 Sum_probs=13.0
Q ss_pred cCCCccceEEEEECCCCC
Q psy5364 61 RGAGAGQLTVRVRGPKGF 78 (78)
Q Consensus 61 ~~aG~g~l~v~v~gP~g~ 78 (78)
...|.+.|..+|..|+|.
T Consensus 80 N~~g~~~~~~~i~~P~G~ 97 (113)
T PF10528_consen 80 NGGGPGSFDFSITDPDGT 97 (113)
T ss_dssp E-SS-EEEEEEEEETT-S
T ss_pred cCCCceEEEEEEECCCCc
Confidence 446789999999999984
No 48
>PRK15273 putative fimbrial outer membrane usher protein SteB; Provisional
Probab=27.15 E-value=68 Score=26.53 Aligned_cols=18 Identities=17% Similarity=0.187 Sum_probs=16.5
Q ss_pred ccceeeEEEEEECCEecC
Q psy5364 3 LRRRRINLRTEGEGVDVP 20 (78)
Q Consensus 3 ~~~G~~~i~V~~~g~~V~ 20 (78)
+.||.|.|+|.+||+.+.
T Consensus 41 ~~pG~Y~vdv~vN~~~~~ 58 (881)
T PRK15273 41 ILPGVYLLQIKINGQTLP 58 (881)
T ss_pred cCCeEEEEEEEECCEEcc
Confidence 579999999999999996
No 49
>PF05949 DUF881: Bacterial protein of unknown function (DUF881); InterPro: IPR010273 This family consists of a series of hypothetical bacterial proteins. One of the family members Q45543 from SWISSPROT from Bacillus subtilis is thought to be involved in cell division and sporulation [].; PDB: 3GMG_B.
Probab=27.15 E-value=83 Score=20.23 Aligned_cols=18 Identities=11% Similarity=0.346 Sum_probs=14.3
Q ss_pred EEEECCEecCCCCeEEEec
Q psy5364 11 RTEGEGVDVPGSPFTVRVS 29 (78)
Q Consensus 11 ~V~~~g~~V~gSPf~v~V~ 29 (78)
.++.||+++. +||.+++.
T Consensus 80 ~i~Vng~~i~-~Py~I~AI 97 (149)
T PF05949_consen 80 TILVNGRPIS-PPYVIKAI 97 (149)
T ss_dssp EEEETTEEE--SSEEEEEE
T ss_pred EEEECCEEcc-CCEEEEEE
Confidence 4678999996 89999876
No 50
>TIGR03773 anch_rpt_wall putative ABC transporter-associated repeat protein. Members of this protein family occur in genomes that contain a three-gene ABC transporter operon associated with the presence of domain TIGR03769. That domain occurs as a single-copy insert in the substrate-binding protein, and occurs in two or more copies in members of this protein family. Members of this family typically are encoded adjacent to the said transporter operon and may serve as a substrate receptor.
Probab=26.61 E-value=1.1e+02 Score=23.97 Aligned_cols=32 Identities=16% Similarity=0.224 Sum_probs=19.9
Q ss_pred ccceeeEEEEEEC-----CEecC-CCCeEEEecCCCCC
Q psy5364 3 LRRRRINLRTEGE-----GVDVP-GSPFTVRVSGAPDA 34 (78)
Q Consensus 3 ~~~G~~~i~V~~~-----g~~V~-gSPf~v~V~~~~d~ 34 (78)
.+||.|+|.+++. |..+. -.-+++.|....||
T Consensus 408 TkPG~Y~l~~~~sa~~~~G~~vS~~~tltfaVG~~~~~ 445 (513)
T TIGR03773 408 SKPGTYKLQVTQSATTTNGKKASDTTTLTFAVGDNNDP 445 (513)
T ss_pred CCCcEEEEEEEEEEEeCCCceecccEEEEEEeCCCCCc
Confidence 5899999999974 44443 23345566554444
No 51
>COG4856 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=25.99 E-value=1.4e+02 Score=22.77 Aligned_cols=43 Identities=19% Similarity=0.268 Sum_probs=24.9
Q ss_pred eEEEeCCCcccceeceeceEEEEEccCCCccceEEEEECCCCC
Q psy5364 36 KVRVYGPGIEHGVLATFQSRFICDTRGAGAGQLTVRVRGPKGF 78 (78)
Q Consensus 36 kv~v~GpGL~~g~~~~~~~~F~Id~~~aG~g~l~v~v~gP~g~ 78 (78)
-+.++|.+-..=..+...+.|.+..-..|.+...+.|+||++.
T Consensus 345 ~~ta~g~~~~~v~~~di~~~~d~~~L~~g~~~v~v~v~gp~~l 387 (403)
T COG4856 345 DVTAKGYSTNNVKDGDIKAYVDVSNLAEGNHSVSVEVTGPSGL 387 (403)
T ss_pred eeeeccCcccceeecceeEEEehhhcccccceeEEEEECCccc
Confidence 3556665542211122233344444457889999999999973
No 52
>PF13600 DUF4140: N-terminal domain of unknown function (DUF4140)
Probab=25.90 E-value=89 Score=18.37 Aligned_cols=13 Identities=38% Similarity=0.644 Sum_probs=10.1
Q ss_pred CCCCCeEEEeCCC
Q psy5364 31 APDASKVRVYGPG 43 (78)
Q Consensus 31 ~~d~~kv~v~GpG 43 (78)
..|+.-+++.|+|
T Consensus 34 ~~d~~Sl~V~~~g 46 (104)
T PF13600_consen 34 SLDPDSLRVSGEG 46 (104)
T ss_pred ccCCCcEEEEecC
Confidence 3678888888877
No 53
>PRK15298 fimbrial outer membrane usher protein StiC; Provisional
Probab=25.36 E-value=77 Score=26.06 Aligned_cols=20 Identities=0% Similarity=-0.102 Sum_probs=17.3
Q ss_pred ccceeeEEEEEECCEecCCC
Q psy5364 3 LRRRRINLRTEGEGVDVPGS 22 (78)
Q Consensus 3 ~~~G~~~i~V~~~g~~V~gS 22 (78)
+.||.|.|+|.+||+.+...
T Consensus 61 ~~pG~Y~vdi~vN~~~~~~~ 80 (848)
T PRK15298 61 QLPGTYYVTININKTEIGAR 80 (848)
T ss_pred cCCeEEEEEEEECCeEcccE
Confidence 46999999999999998653
No 54
>PRK15255 fimbrial outer membrane usher protein StdB; Provisional
Probab=25.07 E-value=78 Score=25.93 Aligned_cols=19 Identities=11% Similarity=-0.155 Sum_probs=16.8
Q ss_pred ccceeeEEEEEECCEecCC
Q psy5364 3 LRRRRINLRTEGEGVDVPG 21 (78)
Q Consensus 3 ~~~G~~~i~V~~~g~~V~g 21 (78)
+.||.|.|+|.+||+.+..
T Consensus 48 ~~pG~Y~vdi~vN~~~~~~ 66 (829)
T PRK15255 48 VAPGNYILDIWLNDQPVRE 66 (829)
T ss_pred cCCeEEEEEEEECCEEccc
Confidence 4699999999999999954
No 55
>PRK15314 outer membrane protein RatB; Provisional
Probab=24.99 E-value=4.2e+02 Score=24.97 Aligned_cols=60 Identities=28% Similarity=0.413 Sum_probs=34.5
Q ss_pred ECCEecCCCCeEEEecCC----------CCCC--eEEEeCCCcccceeceeceEEEEEccCCCccceEEEEECCCCC
Q psy5364 14 GEGVDVPGSPFTVRVSGA----------PDAS--KVRVYGPGIEHGVLATFQSRFICDTRGAGAGQLTVRVRGPKGF 78 (78)
Q Consensus 14 ~~g~~V~gSPf~v~V~~~----------~d~~--kv~v~GpGL~~g~~~~~~~~F~Id~~~aG~g~l~v~v~gP~g~ 78 (78)
.+|.+|++-+|+++-... .||+ .+.+- |+--+ .++++...=...+| .|.+.|.|+.|+|.
T Consensus 767 ~ng~pvp~~~ftvt~~~g~nRqg~~~G~~D~~~g~v~i~--G~~y~--t~q~s~~Y~G~Tda-~G~~tv~ITqP~Gv 838 (2435)
T PRK15314 767 LNGLSVPNASFTVTMSHGKNRDNATTGFTDPSDGTLVMG--GTPFG--SSQASMTYQGMTDA-EGNATLVIEQPQGV 838 (2435)
T ss_pred cCCCccCCcceEEEeccccccCCcccCCCCCCcceEEEC--ccccc--cCccceeeeeeecC-CCcEEEEEeCCCCc
Confidence 358999999999976532 1221 11221 11111 13333333344555 78999999999884
No 56
>PF11797 DUF3324: Protein of unknown function C-terminal (DUF3324); InterPro: IPR021759 This family consists of several hypothetical bacterial proteins of unknown function.
Probab=24.57 E-value=65 Score=20.33 Aligned_cols=14 Identities=14% Similarity=0.361 Sum_probs=10.9
Q ss_pred CccceeeEEEEEEC
Q psy5364 2 SLRRRRINLRTEGE 15 (78)
Q Consensus 2 ~~~~G~~~i~V~~~ 15 (78)
.++||.|++.+...
T Consensus 102 ~lk~G~Y~l~~~~~ 115 (140)
T PF11797_consen 102 KLKPGKYTLKITAK 115 (140)
T ss_pred CccCCEEEEEEEEE
Confidence 46899999987654
No 57
>PRK15193 outer membrane usher protein; Provisional
Probab=24.52 E-value=81 Score=26.08 Aligned_cols=22 Identities=5% Similarity=-0.127 Sum_probs=18.0
Q ss_pred ccceeeEEEEEECCEecCCCCe
Q psy5364 3 LRRRRINLRTEGEGVDVPGSPF 24 (78)
Q Consensus 3 ~~~G~~~i~V~~~g~~V~gSPf 24 (78)
+.||.|.|+|.+||+.+.-.-.
T Consensus 63 ~~pG~Y~vdi~vN~~~~~~~~i 84 (876)
T PRK15193 63 IAAGVYRVDIVLNNTFVATRDV 84 (876)
T ss_pred cCCeEEEEEEEECCeEcccEEE
Confidence 4699999999999999865433
No 58
>PF13620 CarboxypepD_reg: Carboxypeptidase regulatory-like domain; PDB: 3MN8_D 3P0D_I 3KCP_A 2B59_B 1UWY_A 1H8L_A 1QMU_A 2NSM_A.
Probab=24.38 E-value=95 Score=16.85 Aligned_cols=14 Identities=29% Similarity=0.389 Sum_probs=7.9
Q ss_pred ccceeeEEEEEECC
Q psy5364 3 LRRRRINLRTEGEG 16 (78)
Q Consensus 3 ~~~G~~~i~V~~~g 16 (78)
+.+|.|.|.+...|
T Consensus 45 l~~g~Y~l~v~~~g 58 (82)
T PF13620_consen 45 LPPGTYTLRVSAPG 58 (82)
T ss_dssp E-SEEEEEEEEBTT
T ss_pred cCCEeEEEEEEECC
Confidence 34566666666555
No 59
>PF08695 Coa1: Cytochrome oxidase complex assembly protein 1; InterPro: IPR014807 Coa1 is an inner mitochondrial membrane protein that associates with Shy1 and is required for cytochrome oxidase complex IV assembly. It contains a conserved hydrophobic segment (amino acids 74-92) with the potential to form a membrane-spanning helix. The N terminus of Coa1 is rich in positively charged amino acids and could form an amphipathic alpha helix, characteristic of a mitochondrial presequence. A cleavage site for the mitochondrial processing peptidase is predicted adjacent to the presequence. Upon in vitro import into mitochondria, Coa1 is processed to a mature form, indicating that it possesses a cleavable presequence []. The eukaryotic cytochrome oxidase complex consists of 12-13 subunits, with three mitochondrial encoded subunits, Cox1-Cox3, forming the core enzyme. Translation of the Cox1 transcript requires the two promoters, Pet309 and Mss51, and the latter has an additional role in translational elongation. Coa1 is necessary for linking the activity of Mss51 to Cox1 insertion into the assembly complex [].
Probab=24.19 E-value=88 Score=18.78 Aligned_cols=15 Identities=47% Similarity=0.928 Sum_probs=8.5
Q ss_pred CCccceEEEEECCCC
Q psy5364 63 AGAGQLTVRVRGPKG 77 (78)
Q Consensus 63 aG~g~l~v~v~gP~g 77 (78)
.|.-+|++.|+||++
T Consensus 66 ~g~a~~~~pV~G~k~ 80 (116)
T PF08695_consen 66 KGRADLSFPVKGPKG 80 (116)
T ss_pred CcEEEEEEEEEcCCC
Confidence 445556666666654
No 60
>PRK15198 outer membrane usher protein FimD; Provisional
Probab=24.03 E-value=83 Score=25.91 Aligned_cols=24 Identities=0% Similarity=-0.216 Sum_probs=18.9
Q ss_pred ccceeeEEEEEECCEecCCCCeEE
Q psy5364 3 LRRRRINLRTEGEGVDVPGSPFTV 26 (78)
Q Consensus 3 ~~~G~~~i~V~~~g~~V~gSPf~v 26 (78)
+.||.|.|+|.+||+.+.-.-..+
T Consensus 54 ~~pG~Y~vdi~vN~~~~~~~~i~f 77 (860)
T PRK15198 54 QPPGVYRVDIWRNDEFIGTQDIRF 77 (860)
T ss_pred cCCeEEEEEEEECCeEcccEEEEE
Confidence 469999999999999886543333
No 61
>PRK15207 long polar fimbrial outer membrane usher protein LpfC; Provisional
Probab=23.45 E-value=88 Score=25.71 Aligned_cols=24 Identities=4% Similarity=-0.056 Sum_probs=19.0
Q ss_pred ccceeeEEEEEECCEecCCCCeEEEe
Q psy5364 3 LRRRRINLRTEGEGVDVPGSPFTVRV 28 (78)
Q Consensus 3 ~~~G~~~i~V~~~g~~V~gSPf~v~V 28 (78)
..||.|.|+|.+||+.+... .|.+
T Consensus 62 ~~pG~Y~vdi~vN~~~~~~~--~i~f 85 (842)
T PRK15207 62 QLPGKYRVSVFVNEEKMESR--TLEF 85 (842)
T ss_pred cCCeEEEEEEEECCeEcccE--EEEE
Confidence 46999999999999998543 4444
No 62
>PF14900 DUF4493: Domain of unknown function (DUF4493)
Probab=23.16 E-value=62 Score=21.96 Aligned_cols=24 Identities=8% Similarity=0.097 Sum_probs=16.8
Q ss_pred CccceeeEEEEEECCEecC--CCCeE
Q psy5364 2 SLRRRRINLRTEGEGVDVP--GSPFT 25 (78)
Q Consensus 2 ~~~~G~~~i~V~~~g~~V~--gSPf~ 25 (78)
.+..|.|+|.+.+++.... ..||-
T Consensus 61 ~L~~G~Ytv~A~~g~~~~~~~d~pyy 86 (235)
T PF14900_consen 61 ELPVGSYTVKASYGDNVAAGFDKPYY 86 (235)
T ss_pred eecCCcEEEEEEcCCCccccccCcee
Confidence 5788999999997654443 25763
No 63
>PRK15248 fimbrial outer membrane usher protein StbC; Provisional
Probab=22.85 E-value=1.2e+02 Score=24.88 Aligned_cols=24 Identities=13% Similarity=-0.071 Sum_probs=19.6
Q ss_pred ccceeeEEEEEECCEecCCC-CeEE
Q psy5364 3 LRRRRINLRTEGEGVDVPGS-PFTV 26 (78)
Q Consensus 3 ~~~G~~~i~V~~~g~~V~gS-Pf~v 26 (78)
+.||.|.|+|.+||+.+... -.++
T Consensus 53 ~~pG~Y~vdi~vN~~~~~~~~~i~f 77 (853)
T PRK15248 53 TPSGDYLADVYVNGTLVTSSTNIRF 77 (853)
T ss_pred cCCeEEEEEEEECCEEcCceeEEEE
Confidence 46999999999999998764 5554
No 64
>KOG4680|consensus
Probab=21.98 E-value=79 Score=20.89 Aligned_cols=23 Identities=4% Similarity=0.167 Sum_probs=19.1
Q ss_pred cceeeEEEEEECCEecCCCCeEE
Q psy5364 4 RRRRINLRTEGEGVDVPGSPFTV 26 (78)
Q Consensus 4 ~~G~~~i~V~~~g~~V~gSPf~v 26 (78)
..|.|-|+|+|.+.+|..-=|.+
T Consensus 69 s~Gk~VIeV~y~gi~ihsethDL 91 (153)
T KOG4680|consen 69 SEGKYVIEVSYGGIRIHSETHDL 91 (153)
T ss_pred eCCeEEEEEEEeeEEEeeccccc
Confidence 57999999999999998766654
No 65
>PRK15284 putative fimbrial outer membrane usher protein StfC; Provisional
Probab=21.82 E-value=99 Score=25.59 Aligned_cols=18 Identities=11% Similarity=0.161 Sum_probs=16.3
Q ss_pred ccceeeEEEEEECCEecC
Q psy5364 3 LRRRRINLRTEGEGVDVP 20 (78)
Q Consensus 3 ~~~G~~~i~V~~~g~~V~ 20 (78)
..||.|.|+|.+|++.+.
T Consensus 59 ~~pG~Y~v~v~vN~~~~~ 76 (881)
T PRK15284 59 VEPGKYNLQVQLNKQPLA 76 (881)
T ss_pred cCCeEEEEEEEECCEEcc
Confidence 469999999999999986
No 66
>PF03422 CBM_6: Carbohydrate binding module (family 6); InterPro: IPR005084 A carbohydrate-binding module (CBM) is defined as a contiguous amino acid sequence within a carbohydrate-active enzyme with a discreet fold having carbohydrate-binding activity. A few exceptions are CBMs in cellulosomal scaffolding proteins and rare instances of independent putative CBMs. The requirement of CBMs existing as modules within larger enzymes sets this class of carbohydrate-binding protein apart from other non-catalytic sugar binding proteins such as lectins and sugar transport proteins. CBMs were previously classified as cellulose-binding domains (CBDs) based on the initial discovery of several modules that bound cellulose [, ]. However, additional modules in carbohydrate-active enzymes are continually being found that bind carbohydrates other than cellulose yet otherwise meet the CBM criteria, hence the need to reclassify these polypeptides using more inclusive terminology. Previous classification of cellulose-binding domains were based on amino acid similarity. Groupings of CBDs were called "Types" and numbered with roman numerals (e.g. Type I or Type II CBDs). In keeping with the glycoside hydrolase classification, these groupings are now called families and numbered with Arabic numerals. Families 1 to 13 are the same as Types I to XIII. For a detailed review on the structure and binding modes of CBMs see []. This entry represents CBM6 from CAZY which was previously known as cellulose-binding domain family VI (CBD VI). CBM6 bind to amorphous cellulose, xylan, mixed beta-(1,3)(1,4)glucan and beta-1,3-glucan[, , ]. CBM6 adopts a classic lectin-like beta-jelly roll fold, predominantly consisting of five antiparallel beta-strands on one face and four antiparallel beta-strands on the other face. It contains two potential ligand binding sites, named respectively cleft A and B. These clefts include aromatic residues which are probably involved in the substrate binding. The cleft B is located on the concave surface of one beta-sheet, and the cleft A on one edge of the protein between the loop that connects the inner and outer beta-sheets of the jellyroll fold []. The multiple binding clefts confer the extensive range of specificities displayed by the domain [, , ].; GO: 0030246 carbohydrate binding; PDB: 1UY1_A 1UY3_A 1UY4_A 1UY2_A 1UYY_A 1UXZ_B 1UYZ_A 1UY0_B 1UYX_A 1UZ0_A ....
Probab=21.77 E-value=67 Score=19.07 Aligned_cols=15 Identities=13% Similarity=0.062 Sum_probs=12.0
Q ss_pred CccceeeEEEEEECC
Q psy5364 2 SLRRRRINLRTEGEG 16 (78)
Q Consensus 2 ~~~~G~~~i~V~~~g 16 (78)
.+..|.|+|.+.+.+
T Consensus 96 ~l~~G~h~i~l~~~~ 110 (125)
T PF03422_consen 96 KLPAGKHTIYLVFNG 110 (125)
T ss_dssp EEESEEEEEEEEESS
T ss_pred eeCCCeeEEEEEEEC
Confidence 356799999999876
No 67
>COG1470 Predicted membrane protein [Function unknown]
Probab=21.75 E-value=4.1e+02 Score=21.02 Aligned_cols=13 Identities=8% Similarity=0.074 Sum_probs=8.7
Q ss_pred cceeeEEEEEECC
Q psy5364 4 RRRRINLRTEGEG 16 (78)
Q Consensus 4 ~~G~~~i~V~~~g 16 (78)
+||.|.+.|.--+
T Consensus 348 ~pG~Ynv~I~A~s 360 (513)
T COG1470 348 TPGTYNVTITASS 360 (513)
T ss_pred CCCceeEEEEEec
Confidence 6777777776543
No 68
>TIGR02186 alph_Pro_TM conserved hypothetical protein. This family consists of predicted transmembrane proteins of about 270 amino acids. Members are found, so far, only among the Alphaproteobacteria and only once in each genome.
Probab=21.73 E-value=3.1e+02 Score=19.64 Aligned_cols=46 Identities=15% Similarity=0.359 Sum_probs=26.8
Q ss_pred CCeEEEecCCCCCCeEEEeCCCcccceeceeceEEEEEccCCCccceEEEEECCCC
Q psy5364 22 SPFTVRVSGAPDASKVRVYGPGIEHGVLATFQSRFICDTRGAGAGQLTVRVRGPKG 77 (78)
Q Consensus 22 SPf~v~V~~~~d~~kv~v~GpGL~~g~~~~~~~~F~Id~~~aG~g~l~v~v~gP~g 77 (78)
|+-+|.|....+-+++.++|- + .+ +.. ...+.|.-++-|.|+||..
T Consensus 35 S~~~V~Its~F~G~elliFGA-----v-~~-~~~---~~~~~~~~dVVV~v~GP~~ 80 (261)
T TIGR02186 35 STEEIAITSNFRGADLTLFGA-----L-DN-DDP---TLLAQGAYDIVVTLEGPRD 80 (261)
T ss_pred cCceEEEecCccCcEEEEEEe-----e-cC-Ccc---CCCCCCCccEEEEEeCCCC
Confidence 566667766677777777771 1 11 110 0122346678888888864
No 69
>PRK09828 putative fimbrial outer membrane usher protein; Provisional
Probab=21.38 E-value=1e+02 Score=25.35 Aligned_cols=20 Identities=0% Similarity=-0.087 Sum_probs=16.9
Q ss_pred CccceeeEEEEEECCEecCC
Q psy5364 2 SLRRRRINLRTEGEGVDVPG 21 (78)
Q Consensus 2 ~~~~G~~~i~V~~~g~~V~g 21 (78)
+..||.|.++|..||+.+..
T Consensus 59 ~~~pG~y~v~v~vNg~~~~~ 78 (865)
T PRK09828 59 PTLPGTYDVSVYVNDQPISS 78 (865)
T ss_pred ccCCeEEEEEEEECCeEccc
Confidence 35799999999999998754
No 70
>PRK15304 putative fimbrial outer membrane usher protein; Provisional
Probab=21.35 E-value=1.2e+02 Score=24.75 Aligned_cols=19 Identities=5% Similarity=-0.062 Sum_probs=16.5
Q ss_pred ccceeeEEEEEECCEecCC
Q psy5364 3 LRRRRINLRTEGEGVDVPG 21 (78)
Q Consensus 3 ~~~G~~~i~V~~~g~~V~g 21 (78)
+.||.|.++|..||+.+.-
T Consensus 57 ~~pG~Y~v~v~vNg~~~~~ 75 (801)
T PRK15304 57 FLPGSHSVMVKINGKKRGS 75 (801)
T ss_pred cCCeEeEEEEEECCeEccc
Confidence 5799999999999998753
No 71
>PRK15294 putative fimbrial outer membrane usher protein SthC; Provisional
Probab=21.29 E-value=1e+02 Score=25.30 Aligned_cols=19 Identities=11% Similarity=-0.043 Sum_probs=16.2
Q ss_pred ccceeeEEEEEECCEecCC
Q psy5364 3 LRRRRINLRTEGEGVDVPG 21 (78)
Q Consensus 3 ~~~G~~~i~V~~~g~~V~g 21 (78)
+.||.|.|+|.+||+.+..
T Consensus 57 ~~pG~y~vdi~vNg~~~~~ 75 (845)
T PRK15294 57 QAEGTYNVDIYINNTSVET 75 (845)
T ss_pred CCCeEEEEEEEECCeEccc
Confidence 3599999999999998753
No 72
>PF00801 PKD: PKD domain; InterPro: IPR000601 The PKD (Polycystic Kidney Disease) domain was first identified in the Polycystic Kidney Disease protein, polycystin-1 (PDK1 gene), and contains an Ig-like fold consisting of a beta-sandwich of seven strands in two sheets with a Greek key topology, although some members have additional strands []. Polycystin-1 is a large cell-surface glycoprotein involved in adhesive protein-protein and protein-carbohydrate interactions; however it is not clear if the PKD domain mediates any of these interactions. PKD domains are also found in other proteins, usually in the extracellular parts of proteins involved in interactions with other proteins. For example, domains with a PKD-type fold are found in archaeal surface layer proteins that protect the cell from extreme environments [], and in the human VPS10 domain-containing receptor SorCS2 [].; PDB: 1B4R_A 2KZW_A 2C4X_A 2C26_A 2Y72_B 3JQU_A 3JS7_B 1WGO_A 1L0Q_A.
Probab=21.28 E-value=1e+02 Score=16.50 Aligned_cols=14 Identities=0% Similarity=-0.173 Sum_probs=10.6
Q ss_pred ccceeeEEEEEECC
Q psy5364 3 LRRRRINLRTEGEG 16 (78)
Q Consensus 3 ~~~G~~~i~V~~~g 16 (78)
..+|.|.|.|...+
T Consensus 49 ~~~G~y~V~ltv~n 62 (69)
T PF00801_consen 49 SSPGTYTVTLTVTN 62 (69)
T ss_dssp SSSEEEEEEEEEEE
T ss_pred CCCeEEEEEEEEEE
Confidence 46899999887643
No 73
>PRK15316 RatA-like protein; Provisional
Probab=21.17 E-value=2.5e+02 Score=26.37 Aligned_cols=62 Identities=23% Similarity=0.244 Sum_probs=37.3
Q ss_pred ECCEecCCCCeEEEecC----------CCCCCeEEEeCCCcccceeceeceEEEEEccCCCccceEEEEECCCCC
Q psy5364 14 GEGVDVPGSPFTVRVSG----------APDASKVRVYGPGIEHGVLATFQSRFICDTRGAGAGQLTVRVRGPKGF 78 (78)
Q Consensus 14 ~~g~~V~gSPf~v~V~~----------~~d~~kv~v~GpGL~~g~~~~~~~~F~Id~~~aG~g~l~v~v~gP~g~ 78 (78)
+||.+|++-=|+|+... ..||+...+.=.|..-+ ..|+..-.=+..+| .|.++|.|+.|.|+
T Consensus 753 ln~~pV~~~aftVt~~~g~nR~g~~tgf~dps~g~i~~nG~~y~--~~q~s~~Y~G~TDa-~G~atv~itqP~Gv 824 (2683)
T PRK15316 753 LNNAPVPYTAFTITKDMGKNRQGQTTGFDDPTRGAIEMNGTLYG--TSQPSLVYAGTTDA-QGFATVEIKQSQGV 824 (2683)
T ss_pred cCCCcccceeEEEEeccccccCCcccCCCCCCCceEEECCEecc--cCCCcceeeeeecC-CCeEEEEEeCCCCc
Confidence 46889999999887652 24555544433232222 12333333344454 78899999999985
No 74
>PF14734 DUF4469: Domain of unknown function (DUF4469) with IG-like fold
Probab=20.68 E-value=93 Score=18.98 Aligned_cols=12 Identities=17% Similarity=0.160 Sum_probs=10.1
Q ss_pred ccceeeEEEEEE
Q psy5364 3 LRRRRINLRTEG 14 (78)
Q Consensus 3 ~~~G~~~i~V~~ 14 (78)
+.+|.|+|.|+-
T Consensus 74 L~~G~Y~l~V~T 85 (102)
T PF14734_consen 74 LAAGEYTLEVRT 85 (102)
T ss_pred cCceEEEEEEEE
Confidence 689999999964
No 75
>COG4136 ABC-type uncharacterized transport system, ATPase component [General function prediction only]
Probab=20.39 E-value=68 Score=21.98 Aligned_cols=19 Identities=32% Similarity=0.623 Sum_probs=13.6
Q ss_pred eEEEEEccCCCccceEEEEECCCCC
Q psy5364 54 SRFICDTRGAGAGQLTVRVRGPKGF 78 (78)
Q Consensus 54 ~~F~Id~~~aG~g~l~v~v~gP~g~ 78 (78)
.+|+|+- |+ -|+++|||||
T Consensus 21 ~n~Tia~-----Ge-ivtlMGPSGc 39 (213)
T COG4136 21 VNFTIAK-----GE-IVTLMGPSGC 39 (213)
T ss_pred eeEEecC-----Cc-EEEEECCCCc
Confidence 4677753 34 3889999997
No 76
>TIGR03292 PhnH_redo phosphonate C-P lyase system protein PhnH. PhnH is a component of the C-P lyase system (GenProp0232) for the catabolism of phosphonate compounds. The specific function of this component is unknown. This model is based on Pfam model pfam05845.2, and has been broadened to include sequences missed by that model which are clearly true positive hits based on genome context.
Probab=20.33 E-value=1.1e+02 Score=20.47 Aligned_cols=15 Identities=27% Similarity=0.603 Sum_probs=11.7
Q ss_pred CCCCeEEEeCCCccc
Q psy5364 32 PDASKVRVYGPGIEH 46 (78)
Q Consensus 32 ~d~~kv~v~GpGL~~ 46 (78)
.+...++.+||||+.
T Consensus 114 ~~g~~l~L~GPGI~~ 128 (174)
T TIGR03292 114 ASGPPLTLTGPGIAT 128 (174)
T ss_pred CCCCceEEECCCcCC
Confidence 355679999999974
No 77
>PF02018 CBM_4_9: Carbohydrate binding domain; InterPro: IPR003305 The 1,4-beta-glucanase CenC from Cellulomonas fimi contains two cellulose-binding domains, CBD(N1) and CBD(N2), arranged in tandem at its N terminus. These homologous CBDs are distinct in their selectivity for binding amorphous and not crystalline cellulose []. Multidimensional heteronuclear nuclear magnetic resonance (NMR) spectroscopy was used to determine the tertiary structure of the 152 amino acid N-terminal cellulose-binding domain from C. fimi 1,4-beta-glucanase CenC (CBDN1) []. The tertiary structure of CBDN1 is strikingly similar to that of the bacterial 1,3-1,4-beta-glucanases, as well as other sugar-binding proteins with jelly-roll folds.; GO: 0016798 hydrolase activity, acting on glycosyl bonds; PDB: 3OEA_B 2ZEX_B 3OEB_A 2ZEY_A 2ZEW_A 1GUI_A 2W5F_A 2WZE_A 2WYS_A 2ZEZ_B ....
Probab=20.17 E-value=1.9e+02 Score=16.64 Aligned_cols=26 Identities=8% Similarity=0.029 Sum_probs=13.2
Q ss_pred cccee-eEEEEEECCEecCCCCeEEEecC
Q psy5364 3 LRRRR-INLRTEGEGVDVPGSPFTVRVSG 30 (78)
Q Consensus 3 ~~~G~-~~i~V~~~g~~V~gSPf~v~V~~ 30 (78)
+++|. |++++....... .++.+.+..
T Consensus 58 l~~G~~Y~~s~~vk~~~~--~~~~~~~~~ 84 (131)
T PF02018_consen 58 LKPGKTYTVSFWVKADSG--GTVSVSLRD 84 (131)
T ss_dssp E-TTSEEEEEEEEEESSS--EEEEEEEEE
T ss_pred ecCCCEEEEEEEEEeCCC--CEEEEEEEE
Confidence 34444 666666553222 556665554
Done!