Query psy189
Match_columns 171
No_of_seqs 163 out of 913
Neff 6.6
Searched_HMMs 46136
Date Fri Aug 16 18:51:40 2013
Command hhsearch -i /work/01045/syshi/Psyhhblits/psy189.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/189hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 KOG1692|consensus 99.9 3.3E-25 7.1E-30 175.5 9.7 101 69-169 18-118 (201)
2 KOG1693|consensus 99.9 3.7E-24 8E-29 170.4 11.8 97 69-165 21-118 (209)
3 KOG3829|consensus 99.9 1E-25 2.2E-30 195.1 1.8 113 3-120 179-302 (486)
4 KOG3287|consensus 99.8 8E-20 1.7E-24 147.3 11.8 94 68-161 32-126 (236)
5 PF01105 EMP24_GP25L: emp24/gp 99.7 5.7E-19 1.2E-23 136.6 0.0 96 71-166 1-101 (183)
6 KOG1691|consensus 99.7 1.5E-16 3.2E-21 128.1 12.1 107 63-169 14-126 (210)
7 KOG1690|consensus 99.6 1.2E-14 2.6E-19 116.1 12.0 102 63-164 12-128 (215)
8 PF15051 FAM198: FAM198 protei 95.7 0.0014 3E-08 56.2 -1.7 39 24-63 65-103 (326)
9 PF13897 GOLD_2: Golgi-dynamic 94.6 0.051 1.1E-06 41.5 3.9 30 132-161 104-133 (136)
10 PF04151 PPC: Bacterial pre-pe 94.3 0.21 4.6E-06 33.0 6.2 60 82-144 3-67 (70)
11 PF01835 A2M_N: MG2 domain; I 93.0 2.2 4.7E-05 29.7 10.9 62 90-151 13-89 (99)
12 smart00557 IG_FLMN Filamin-typ 92.9 0.76 1.7E-05 32.1 7.3 44 104-147 30-76 (93)
13 PF06702 DUF1193: Protein of u 91.2 0.14 2.9E-06 42.3 2.0 41 74-116 1-42 (221)
14 PF13860 FlgD_ig: FlgD Ig-like 89.0 4.4 9.6E-05 27.6 8.0 44 92-136 12-57 (81)
15 PF11589 DUF3244: Domain of un 88.5 3.9 8.5E-05 29.3 7.7 57 91-150 36-96 (106)
16 PF00630 Filamin: Filamin/ABP2 85.3 5.9 0.00013 27.3 7.0 40 107-146 43-89 (101)
17 PF05738 Cna_B: Cna protein B- 83.2 4.2 9.1E-05 26.3 5.2 44 108-151 4-49 (70)
18 PF09315 DUF1973: Domain of un 82.4 23 0.0005 28.1 10.2 75 81-161 18-98 (179)
19 PRK14081 triple tyrosine motif 82.1 16 0.00036 34.8 10.4 72 91-163 401-477 (667)
20 PF07495 Y_Y_Y: Y_Y_Y domain; 79.7 13 0.00028 23.6 7.0 54 107-162 9-66 (66)
21 PRK12812 flgD flagellar basal 78.4 37 0.0008 28.7 10.4 53 93-146 129-192 (259)
22 PF13620 CarboxypepD_reg: Carb 75.2 3.3 7.1E-05 27.5 2.8 42 107-148 16-57 (82)
23 PRK12634 flgD flagellar basal 74.6 27 0.00058 28.7 8.4 45 102-146 118-173 (221)
24 PRK12813 flgD flagellar basal 73.2 17 0.00036 30.1 6.9 56 91-148 110-174 (223)
25 PF13473 Cupredoxin_1: Cupredo 69.7 25 0.00054 24.7 6.4 56 88-147 39-95 (104)
26 PF08842 Mfa2: Fimbrillin-A as 67.9 8.6 0.00019 31.2 4.2 62 86-147 2-76 (283)
27 PRK06655 flgD flagellar basal 67.7 68 0.0015 26.4 10.4 54 93-147 114-178 (225)
28 PRK05842 flgD flagellar basal 67.3 22 0.00048 30.7 6.6 56 93-148 150-221 (295)
29 PF14524 Wzt_C: Wzt C-terminal 66.8 42 0.0009 24.0 7.3 41 80-122 25-68 (142)
30 PRK13211 N-acetylglucosamine-b 63.9 82 0.0018 29.0 10.0 70 93-162 328-405 (478)
31 KOG0518|consensus 63.1 21 0.00046 35.7 6.3 56 96-151 873-931 (1113)
32 PRK12633 flgD flagellar basal 55.3 1.2E+02 0.0026 25.0 9.2 53 94-147 118-181 (230)
33 PF10528 PA14_2: GLEYA domain; 49.8 53 0.0011 24.0 5.3 46 78-123 56-102 (113)
34 PF15417 DUF4624: Domain of un 49.8 1.1E+02 0.0023 23.0 7.1 77 81-164 39-123 (132)
35 PRK14081 triple tyrosine motif 49.2 61 0.0013 31.1 6.8 54 108-161 225-283 (667)
36 PF10670 DUF4198: Domain of un 48.6 1.1E+02 0.0024 23.6 7.4 50 96-145 153-209 (215)
37 PF13280 WYL: WYL domain 48.3 7 0.00015 28.9 0.4 28 23-50 41-68 (172)
38 KOG3878|consensus 41.0 97 0.0021 27.5 6.3 96 64-161 324-465 (469)
39 PF08329 ChitinaseA_N: Chitina 39.3 1.6E+02 0.0034 22.4 6.6 52 91-145 37-89 (133)
40 PF07210 DUF1416: Protein of u 35.8 1.6E+02 0.0034 20.8 6.1 42 107-149 23-64 (85)
41 cd02848 Chitinase_N_term Chiti 34.2 1.9E+02 0.0041 21.2 8.5 65 91-157 33-102 (106)
42 PF12690 BsuPI: Intracellular 33.7 1.2E+02 0.0027 20.7 4.9 20 105-124 23-42 (82)
43 PRK09619 flgD flagellar basal 30.0 3.1E+02 0.0066 22.4 10.3 56 92-149 110-173 (218)
44 TIGR03096 nitroso_cyanin nitro 29.7 1.5E+02 0.0033 22.6 5.1 20 126-145 99-119 (135)
45 PF10794 DUF2606: Protein of u 28.8 1.9E+02 0.0042 21.8 5.4 27 125-151 84-110 (131)
46 PF02927 CelD_N: N-terminal ig 28.4 2E+02 0.0043 19.8 5.3 41 108-148 36-87 (91)
47 KOG0518|consensus 27.8 1.6E+02 0.0034 29.9 6.0 52 95-146 379-434 (1113)
48 PF10648 Gmad2: Immunoglobulin 27.6 2.1E+02 0.0047 19.8 5.4 34 90-123 10-46 (88)
49 cd05860 Ig4_SCFR Fourth immuno 26.7 1.2E+02 0.0025 22.0 3.8 25 136-163 75-99 (101)
50 PRK02710 plastocyanin; Provisi 26.7 2.5E+02 0.0054 20.3 9.3 73 68-145 27-105 (119)
51 PF13715 DUF4480: Domain of un 26.5 1.8E+02 0.004 19.1 4.7 35 108-147 18-52 (88)
52 PF12904 Collagen_bind_2: Puta 25.7 1.1E+02 0.0023 21.8 3.4 54 83-138 26-80 (93)
53 PF13464 DUF4115: Domain of un 25.3 2.1E+02 0.0045 18.9 4.8 25 109-133 10-34 (77)
54 cd02858 Esterase_N_term Estera 25.3 2.2E+02 0.0048 19.2 6.8 49 96-145 8-58 (85)
55 PHA02665 hypothetical protein; 24.9 51 0.0011 27.5 1.9 31 75-105 24-54 (322)
56 PF07691 PA14: PA14 domain; I 22.8 2.9E+02 0.0063 19.7 7.0 51 108-161 72-132 (145)
57 PF07523 Big_3: Bacterial Ig-l 22.1 2.2E+02 0.0047 18.2 4.2 47 108-161 19-65 (67)
58 PRK15036 hydroxyisourate hydro 21.7 3E+02 0.0066 20.8 5.5 25 137-161 81-113 (137)
59 PF08896 DUF1842: Domain of un 21.0 3.5E+02 0.0076 19.9 8.5 56 93-148 3-65 (114)
60 PF02974 Inh: Protease inhibit 20.9 3.1E+02 0.0067 19.3 5.3 33 110-142 64-96 (99)
61 cd09011 Glo_EDI_BRP_like_23 Th 20.6 86 0.0019 21.7 2.2 15 108-122 102-116 (120)
No 1
>KOG1692|consensus
Probab=99.92 E-value=3.3e-25 Score=175.49 Aligned_cols=101 Identities=30% Similarity=0.541 Sum_probs=96.4
Q ss_pred ceEEEEEEcCCCcceEEEeccCCcEEEEEEEEEeCCeeeEEEEEEcCCCcEEEEeeecceeeEEEEeccceeEEEEeecC
Q psy189 69 AVELTFELADNARECFFEDIKRNTSVILEFQVVTGGKYDVDVSIEAPTKEIIYRQVKSQFDTHQFVTLVEGTYVVCFSNE 148 (171)
Q Consensus 69 ~~~ltf~l~~~~~~CFye~~~~~~~v~~~y~V~~gg~~dI~v~V~dP~g~~v~~~~~~s~G~f~Fta~~~G~Y~~CF~N~ 148 (171)
+..+.+.+++.+++||||++..|++++++|+|..||..+||+.|++|+|++|++.++.++|+|+|+|+.+|.|++||+|.
T Consensus 18 ~~~~~is~~ah~eeCf~e~~~~gd~~~vsF~v~~gg~~~vd~~I~gP~~~~i~~~~~~ssgk~tF~a~~~G~Y~fCF~N~ 97 (201)
T KOG1692|consen 18 AAGYGISLDAHEEECFFENLEEGDKLSVSFEVIDGGFLGVDVEITGPDGKIIHKGKRESSGKYTFTAPKKGTYTFCFSNK 97 (201)
T ss_pred hhheeEEEccchhhhHhhhhccCCEEEEEEEEecCCccceeEEEECCCCchhhhcccccCceEEEEecCCceEEEEecCC
Confidence 66778888999999999999999999999999999999999999999999999999999999999999999999999999
Q ss_pred CCCCcceEEEEEEEEEEEeec
Q psy189 149 FSTYSSALEVISTHAAVAVES 169 (171)
Q Consensus 149 ~s~~~~K~V~F~i~~~i~~~~ 169 (171)
.|+.++|+|.|+||+|.+.+.
T Consensus 98 ~s~mtpk~V~F~ihvg~~~~~ 118 (201)
T KOG1692|consen 98 MSTMTPKTVMFTIHVGHAPQR 118 (201)
T ss_pred CCCCCceEEEEEEEEeecccc
Confidence 999999999999999998753
No 2
>KOG1693|consensus
Probab=99.91 E-value=3.7e-24 Score=170.40 Aligned_cols=97 Identities=46% Similarity=0.839 Sum_probs=92.5
Q ss_pred ceEEEEEEcCCCcceEEEeccCCc-EEEEEEEEEeCCeeeEEEEEEcCCCcEEEEeeecceeeEEEEeccceeEEEEeec
Q psy189 69 AVELTFELADNARECFFEDIKRNT-SVILEFQVVTGGKYDVDVSIEAPTKEIIYRQVKSQFDTHQFVTLVEGTYVVCFSN 147 (171)
Q Consensus 69 ~~~ltf~l~~~~~~CFye~~~~~~-~v~~~y~V~~gg~~dI~v~V~dP~g~~v~~~~~~s~G~f~Fta~~~G~Y~~CF~N 147 (171)
+..+||+||++.++|||+++++++ ..++.|+|+.||+.|||+.|.+|+|++|++..+++.++|.|++...|+|+|||+|
T Consensus 21 a~elTfeLp~~aKqC~Y~d~~~~~~~~~~~fqV~tGG~fDVD~~I~aPdgkvI~~~~kk~~~~~~f~ae~~G~Y~fCFsN 100 (209)
T KOG1693|consen 21 ASELTFELPDNAKQCFYEDLKKDDDTTSFEFQVQTGGHFDVDYDIEAPDGKVIYSEKKKRYDSFLFKAEGKGEYTFCFSN 100 (209)
T ss_pred cccEEEEcCCcchhheeeecccCCceEEEEEEEEeCCceeeEEEEECCCCCEEeeccccccccEEEEEecceEEEEEecC
Confidence 678999999999999999999865 4999999999999999999999999999999999999999999999999999999
Q ss_pred CCCCCcceEEEEEEEEEE
Q psy189 148 EFSTYSSALEVISTHAAV 165 (171)
Q Consensus 148 ~~s~~~~K~V~F~i~~~i 165 (171)
++|++++|+|.++++.+-
T Consensus 101 ~fstf~~Kiv~~~~q~~~ 118 (209)
T KOG1693|consen 101 EFSTFSHKIVYMDFQVGE 118 (209)
T ss_pred ccccccceEeeehhhhcc
Confidence 999999999999998763
No 3
>KOG3829|consensus
Probab=99.91 E-value=1e-25 Score=195.09 Aligned_cols=113 Identities=21% Similarity=0.275 Sum_probs=92.7
Q ss_pred ceEEecCceeeeec-----cCcccCCceeeEeeeccccceeEEeechhhhHhhhhhhHHHHHHHhhhhc----c-cceEE
Q psy189 3 WIGSLEGSAFAMFP-----RDQQTLPNHFYFTDYERHNSEIATFHLDRISMLLKKQWIGFFFLLILHQI----V-QAVEL 72 (171)
Q Consensus 3 ~~~~~~~~~~~~~~-----~~~~~~~~~~~~~~~~~h~~ei~~~~l~~~~~~~~~~~~~~~~~~~l~~~----~-~~~~l 72 (171)
.++.|++.|||||| ||++++|+ ||+||||||||||||||||||+|||.||++ +|++++... + ....-
T Consensus 179 lll~~~~~~KavfKPmR~~Rd~~~~~~--yfs~~dRHnAEiAAFHLDRiL~FrRappvV-GRvvNlttEI~~~a~~~Llq 255 (486)
T KOG3829|consen 179 LLLRLSHQQKVVFKPMRYPRDEVIDGM--YYSGFDRHNAEVAAFHLDRVLDFRRAPPVV-GRVVNLTTEIYEKAEEELLQ 255 (486)
T ss_pred EEEEecCCceeeeccccCCccccCCCc--ccccccccchhhhhhhhhhhhcccccCccc-ceeeeeehHHHHhhHHHHHh
Confidence 47899999999999 99999999 789999999999999999999999999884 988887542 2 22333
Q ss_pred EEEEcCCCcceEEEeccCCcEEEEEEEEEeCC-eeeEEEEEEcCCCcEE
Q psy189 73 TFELADNARECFFEDIKRNTSVILEFQVVTGG-KYDVDVSIEAPTKEII 120 (171)
Q Consensus 73 tf~l~~~~~~CFye~~~~~~~v~~~y~V~~gg-~~dI~v~V~dP~g~~v 120 (171)
||+++|+++.|||++|++ ++.-++.|++-+ ..+-++.+.-|+-+.+
T Consensus 256 tfFvsp~~N~CF~gKC~Y--yC~t~~avCg~pdmlEGS~~~fLP~~~~~ 302 (486)
T KOG3829|consen 256 TFFVSPAENYCFFGKCDY--YCDTEEAVCGDPDMLEGSLIAFLPDESTL 302 (486)
T ss_pred heeeccCcceEEeecccc--ccCCcccccCCcccccceEEEEcCCcccc
Confidence 899999999999999987 455568888533 3466788888875544
No 4
>KOG3287|consensus
Probab=99.82 E-value=8e-20 Score=147.33 Aligned_cols=94 Identities=27% Similarity=0.570 Sum_probs=90.3
Q ss_pred cceEEEEEEcCCCcceEEEeccCCcEEEEEEEEEeC-CeeeEEEEEEcCCCcEEEEeeecceeeEEEEeccceeEEEEee
Q psy189 68 QAVELTFELADNARECFFEDIKRNTSVILEFQVVTG-GKYDVDVSIEAPTKEIIYRQVKSQFDTHQFVTLVEGTYVVCFS 146 (171)
Q Consensus 68 ~~~~ltf~l~~~~~~CFye~~~~~~~v~~~y~V~~g-g~~dI~v~V~dP~g~~v~~~~~~s~G~f~Fta~~~G~Y~~CF~ 146 (171)
....+|++||+|+++|||..++.+.++.++|+|+.| |+.+|++++.+|.|.+|.+..++..|.+++...+.|.|++||+
T Consensus 32 ~d~dftv~ipAGk~eCf~Q~v~~~~tle~eyQVi~G~GDl~i~Ftl~~P~G~~lv~~q~k~dg~ht~e~~e~GdY~~CfD 111 (236)
T KOG3287|consen 32 DDYDFTVMIPAGKTECFYQPVPQGATLEVEYQVIDGAGDLDIDFTLLNPAGEVLVSDQRKVDGVHTVEVTETGDYQVCFD 111 (236)
T ss_pred cccceEEEecCCCceeeeeeccCCeEEEEEEEEEecCCccceeeEEeCCCccEEeecccccCceeEeeccCCcceEEEEc
Confidence 356799999999999999999999999999999999 7889999999999999999999999999999999999999999
Q ss_pred cCCCCCcceEEEEEE
Q psy189 147 NEFSTYSSALEVIST 161 (171)
Q Consensus 147 N~~s~~~~K~V~F~i 161 (171)
|++|+++.|+|.|++
T Consensus 112 NsFS~fs~K~Vffel 126 (236)
T KOG3287|consen 112 NSFSTFSRKLVFFEL 126 (236)
T ss_pred CccccccceEEEEEE
Confidence 999999999999997
No 5
>PF01105 EMP24_GP25L: emp24/gp25L/p24 family/GOLD; InterPro: IPR009038 The GOLD (for Golgi dynamics) domain is a protein module found in several eukaryotic Golgi and lipid-traffic proteins. It is typically between 90 and 150 amino acids long. Most of the size difference observed in the GOLD-domain superfamily is traceable to a single large low-complexity insert that is seen in some versions of the domain. With the exception of the p24 proteins, which have a simple architecture with the GOLD domain as their only globular domain, all other GOLD-domain proteins contain additional conserved globular domains. In these proteins, the GOLD domain co-occurs with lipid-, sterol- or fatty acid-binding domains such as PH, CRAL-TRIO, FYVE oxysterol binding- and acyl CoA-binding domains, suggesting that these proteins may interact with membranes. The GOLD domain can also be found associated with a RUN domain, which may have a role in the interaction of various proteins with cytoskeletal filaments. The GOLD domain is predicted to mediate diverse protein-protein interactions []. A secondary structure prediction for the GOLD domain reveals that it is likely to adopt a compact all-beta-fold structure with six to seven strands. Most of the sequence conservation is centred on the hydrophobic cores that support these predicted strands. The predicted secondary-structure elements and the size of the conserved core of the domain suggests that it may form a beta- sandwich fold with the strands arranged in two beta sheets stacked on each other []. Some proteins known to contain a GOLD domain are listed below: Eukaryotic proteins of the p24 family. Animal Sec14-like proteins. They are involved in secretion. Human Golgi resident protein GCP60. It interacts with the Golgi integral membrane protein Giantin. Yeast oxysterol-binding protein homologue 3 (OSH3). ; GO: 0006810 transport, 0016021 integral to membrane; PDB: 1P23_A 1M23_A.
Probab=99.72 E-value=5.7e-19 Score=136.58 Aligned_cols=96 Identities=31% Similarity=0.623 Sum_probs=0.0
Q ss_pred EEEEEEcCCCcceEEEeccCCcEEEEEEEEEeCC-eeeEEEEEE--cCCCcEEEEeeec-ceeeEEEEeccceeEEEEee
Q psy189 71 ELTFELADNARECFFEDIKRNTSVILEFQVVTGG-KYDVDVSIE--APTKEIIYRQVKS-QFDTHQFVTLVEGTYVVCFS 146 (171)
Q Consensus 71 ~ltf~l~~~~~~CFye~~~~~~~v~~~y~V~~gg-~~dI~v~V~--dP~g~~v~~~~~~-s~G~f~Fta~~~G~Y~~CF~ 146 (171)
|++|.|+||+++||++++++|+.+.++|.|.+++ ..+|++.|+ +|+|+.++++... ++|+|+|+++++|+|++||+
T Consensus 1 a~~f~l~~g~~~Cf~e~v~~~~~i~~~y~v~~~~~~~~v~~~i~~~~~~~~~i~~~~~~~~~~~f~f~~~~~G~y~iCf~ 80 (183)
T PF01105_consen 1 ALTFELEPGETECFYEEVPKGTTIRGSYRVTDGGGAYDVDFTIRDPDPNGEVIYSKSDKESEGSFSFTAKESGEYQICFD 80 (183)
T ss_dssp --------------------------------------------------------------------------------
T ss_pred CEEEEECCCCcEEEEEEcCCCcEEEEEEEEeeccccceEEEEEEecccCCceeeeecccccCCcEEEEeccCCCEEEEEE
Confidence 5899999999999999999999999999999888 789999999 6666999987555 45799999999999999999
Q ss_pred cCCCCCcc-eEEEEEEEEEEE
Q psy189 147 NEFSTYSS-ALEVISTHAAVA 166 (171)
Q Consensus 147 N~~s~~~~-K~V~F~i~~~i~ 166 (171)
|..+..++ +.|.|+++++..
T Consensus 81 n~~~~~~~~~~v~~~~~~~~~ 101 (183)
T PF01105_consen 81 NSSSSFSPSKRVSFDIDVGNE 101 (183)
T ss_dssp ---------------------
T ss_pred cCCCCccccEEEEEEEEEeec
Confidence 99998777 999999998853
No 6
>KOG1691|consensus
Probab=99.71 E-value=1.5e-16 Score=128.11 Aligned_cols=107 Identities=15% Similarity=0.304 Sum_probs=92.3
Q ss_pred hhhcccceEEEEEEcCCCcceEEEeccCCcEEEEEEEEEeCCe---eeEEEEEEcCCCcEEEEeeecceeeEEEEeccce
Q psy189 63 LHQIVQAVELTFELADNARECFFEDIKRNTSVILEFQVVTGGK---YDVDVSIEAPTKEIIYRQVKSQFDTHQFVTLVEG 139 (171)
Q Consensus 63 l~~~~~~~~ltf~l~~~~~~CFye~~~~~~~v~~~y~V~~gg~---~dI~v~V~dP~g~~v~~~~~~s~G~f~Fta~~~G 139 (171)
++......++.|.++++.++|+.|++.+|..+.++|.+.+... +.+++.|+||.|+.+++.++..+|+|+|++.+.|
T Consensus 14 ~~~~~~~~a~~f~v~~~~~kCi~EeI~~n~lv~g~y~i~~~~~~~~~~~~~~Vts~~G~~~~~~env~~gqFaFta~e~~ 93 (210)
T KOG1691|consen 14 LLLLPLVHALRFDVPSKTTKCISEEIHENVLVVGDYEIINPNGDHSHKLSVKVTSPYGNNLHSKENVTKGQFAFTAEESG 93 (210)
T ss_pred HHHhhhhheEEEEecCCCCEeehhhhccCeEEEEEEEEecCCCCccceEEEEEEcCCCceeehhhccccceEEEEeccCC
Confidence 3444667899999999999999999999999999999985322 4699999999999999999999999999999999
Q ss_pred eEEEEeecCCCCC---cceEEEEEEEEEEEeec
Q psy189 140 TYVVCFSNEFSTY---SSALEVISTHAAVAVES 169 (171)
Q Consensus 140 ~Y~~CF~N~~s~~---~~K~V~F~i~~~i~~~~ 169 (171)
.|.+||.|.-... ....|.|||++|+...+
T Consensus 94 ~y~~Cf~~~~~~~~p~~~~~I~ld~k~Gv~akd 126 (210)
T KOG1691|consen 94 MYEACFTADVPGHKPETKRSIDLDWKTGVEAKD 126 (210)
T ss_pred cEEEEEecccCCCCCCcceEEEEEeeccccccc
Confidence 9999999944332 24789999999997643
No 7
>KOG1690|consensus
Probab=99.60 E-value=1.2e-14 Score=116.12 Aligned_cols=102 Identities=17% Similarity=0.317 Sum_probs=81.1
Q ss_pred hhhcccceEEEEEEcCCCcceEEEeccCCcEEEEEEEEE----------eCCeeeEEEEEEcCCCc--EEEEeeecceee
Q psy189 63 LHQIVQAVELTFELADNARECFFEDIKRNTSVILEFQVV----------TGGKYDVDVSIEAPTKE--IIYRQVKSQFDT 130 (171)
Q Consensus 63 l~~~~~~~~ltf~l~~~~~~CFye~~~~~~~v~~~y~V~----------~gg~~dI~v~V~dP~g~--~v~~~~~~s~G~ 130 (171)
+..+..+.++.|.+..++++||++++++|++++++|.++ +..+..+.+.|.+|..+ .|++++..++|+
T Consensus 12 ~~l~~~~~a~yFy~~~~e~KCF~eelpk~tmv~G~yk~qlyd~~~~~y~~~p~~gm~VeV~e~fdnnh~Vl~q~~ss~G~ 91 (215)
T KOG1690|consen 12 LLLATQVQALYFYIAGTEKKCFIEELPKGTMVTGNYKAQLYDDQLKGYGSYPNIGMHVEVKETFDNNHVVLSQQYSSEGD 91 (215)
T ss_pred HHHHhhccEEEEEecCCcccchhhhCCCCcEEEeeeeeeeecchhcccccCCCceEEEEeecCCCCceEEEeecCCCCCc
Confidence 344567889999999999999999999999999999987 11234577888888655 999999999999
Q ss_pred EEEEeccceeEEEEeecCCCC-Ccc--eEEEEEEEEE
Q psy189 131 HQFVTLVEGTYVVCFSNEFST-YSS--ALEVISTHAA 164 (171)
Q Consensus 131 f~Fta~~~G~Y~~CF~N~~s~-~~~--K~V~F~i~~~ 164 (171)
|+|++..+|+|+||+....+. ++. -.|.+++.+|
T Consensus 92 ftFta~~~GeH~IC~~s~s~awf~~aklRvhld~qvG 128 (215)
T KOG1690|consen 92 FTFTALTPGEHRICIQSNSTAWFNGAKLRVHLDIQVG 128 (215)
T ss_pred eEEEccCCCceEEEEecccchhhccceEEEEEEEeeC
Confidence 999999999999999766552 222 2355555444
No 8
>PF15051 FAM198: FAM198 protein
Probab=95.75 E-value=0.0014 Score=56.16 Aligned_cols=39 Identities=21% Similarity=0.059 Sum_probs=29.6
Q ss_pred ceeeEeeeccccceeEEeechhhhHhhhhhhHHHHHHHhh
Q psy189 24 NHFYFTDYERHNSEIATFHLDRISMLLKKQWIGFFFLLIL 63 (171)
Q Consensus 24 ~~~~~~~~~~h~~ei~~~~l~~~~~~~~~~~~~~~~~~~l 63 (171)
..-....-...++||.||||||||++.|.-|.+ .|.+..
T Consensus 65 ~~CGLiKrp~D~~EVfAFHLDRVLGLNRTLPaV-sRkf~~ 103 (326)
T PF15051_consen 65 GLCGLIKRPLDMSEVFAFHLDRVLGLNRTLPAV-SRKFEF 103 (326)
T ss_pred CceeeECCCCcHHHHHHHHHHHHhcccccchHh-Hhhhcc
Confidence 334556778899999999999999999866663 555544
No 9
>PF13897 GOLD_2: Golgi-dynamics membrane-trafficking
Probab=94.56 E-value=0.051 Score=41.51 Aligned_cols=30 Identities=23% Similarity=0.284 Sum_probs=27.3
Q ss_pred EEEeccceeEEEEeecCCCCCcceEEEEEE
Q psy189 132 QFVTLVEGTYVVCFSNEFSTYSSALEVIST 161 (171)
Q Consensus 132 ~Fta~~~G~Y~~CF~N~~s~~~~K~V~F~i 161 (171)
+++.+.+|.|-++|+|.+|.+.+|+|.+.|
T Consensus 104 s~~c~~~GvYvLkFDNSYS~~rsK~l~Y~V 133 (136)
T PF13897_consen 104 SHTCPGPGVYVLKFDNSYSWFRSKKLYYRV 133 (136)
T ss_pred EEECCCCeEEEEEeeCcceeEEeeEEEEEE
Confidence 567889999999999999999999999886
No 10
>PF04151 PPC: Bacterial pre-peptidase C-terminal domain; InterPro: IPR007280 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. This domain is normally found at the C terminus of secreted archaeal and bacterial peptidases, the majority of which belong to MEROPS peptidase families M4 (vibriolysin, IPR001570 from INTERPRO), M9A amd M9B (microbial collangenase, IPR002169 from INTERPRO), M28 (aminopeptidase Ap1, IPR007484 from INTERPRO) and S8 (subtilisin family peptidases, IPR000209 from INTERPRO).; GO: 0008233 peptidase activity, 0006508 proteolysis; PDB: 4DY5_B 4DXZ_A 4DY3_B 3JQW_A 3JQX_C 1NQJ_B 1NQD_A 2O8O_A 1WMF_A 1WME_A ....
Probab=94.32 E-value=0.21 Score=33.02 Aligned_cols=60 Identities=12% Similarity=0.210 Sum_probs=40.9
Q ss_pred ceEEEeccCCcEEEEEEEEEeCCeeeEEEEEEcCCCcEEEEeee-----cceeeEEEEeccceeEEEE
Q psy189 82 ECFFEDIKRNTSVILEFQVVTGGKYDVDVSIEAPTKEIIYRQVK-----SQFDTHQFVTLVEGTYVVC 144 (171)
Q Consensus 82 ~CFye~~~~~~~v~~~y~V~~gg~~dI~v~V~dP~g~~v~~~~~-----~s~G~f~Fta~~~G~Y~~C 144 (171)
..|.-++++|..+.++ +.+ ...+.++.+.+++|..+..... .......|+++.+|+|.+=
T Consensus 3 D~y~f~v~ag~~l~i~--l~~-~~~d~dl~l~~~~g~~~~~~d~~~~~~~~~~~i~~~~~~~GtYyi~ 67 (70)
T PF04151_consen 3 DYYSFTVPAGGTLTID--LSG-GSGDADLYLYDSNGNSLASYDDSSQSGGNDESITFTAPAAGTYYIR 67 (70)
T ss_dssp EEEEEEESTTEEEEEE--ECE-TTSSEEEEEEETTSSSCEECCCCTCETTSEEEEEEEESSSEEEEEE
T ss_pred EEEEEEEcCCCEEEEE--EcC-CCCCeEEEEEcCCCCchhhheecCCCCCCccEEEEEcCCCEEEEEE
Confidence 3566677777775544 333 3337789999999887776322 2345678899999999763
No 11
>PF01835 A2M_N: MG2 domain; InterPro: IPR002890 The proteinase-binding alpha-macroglobulins (A2M) [] are large glycoproteins found in the plasma of vertebrates, in the hemolymph of some invertebrates and in reptilian and avian egg white. A2M-like proteins are able to inhibit all four classes of proteinases by a 'trapping' mechanism. They have a peptide stretch, called the 'bait region', which contains specific cleavage sites for different proteinases. When a proteinase cleaves the bait region, a conformational change is induced in the protein, thus trapping the proteinase. The entrapped enzyme remains active against low molecular weight substrates, whilst its activity toward larger substrates is greatly reduced, due to steric hindrance. Following cleavage in the bait region, a thiol ester bond, formed between the side chains of a cysteine and a glutamine, is cleaved and mediates the covalent binding of the A2M-like protein to the proteinase. This family includes the N-terminal region of the alpha-2-macroglobulin family. The inhibitor domains belong to MEROPS inhibitor family I39.; GO: 0004866 endopeptidase inhibitor activity; PDB: 2B39_B 3KLS_B 3PRX_C 3KM9_B 3PVM_C 3CU7_A 4E0S_A 4A5W_A 4ACQ_C 2P9R_B ....
Probab=93.00 E-value=2.2 Score=29.74 Aligned_cols=62 Identities=16% Similarity=0.127 Sum_probs=40.5
Q ss_pred CCcEEEEEEEEEeCC-------eeeEEEEEEcCCCcEEEEeee---cceeeEEEE--ec---cceeEEEEeecCCCC
Q psy189 90 RNTSVILEFQVVTGG-------KYDVDVSIEAPTKEIIYRQVK---SQFDTHQFV--TL---VEGTYVVCFSNEFST 151 (171)
Q Consensus 90 ~~~~v~~~y~V~~gg-------~~dI~v~V~dP~g~~v~~~~~---~s~G~f~Ft--a~---~~G~Y~~CF~N~~s~ 151 (171)
.|+.|.+.--+.... ...+.+.|.||+|+.+.+... ...|.+.++ .+ ..|.|++=.....+.
T Consensus 13 PGetV~~~~~~~~~~~~~~~~~~~~~~v~i~dp~g~~v~~~~~~~~~~~G~~~~~~~lp~~~~~G~y~i~~~~~~~~ 89 (99)
T PF01835_consen 13 PGETVHFRAIVRDLDNDFKPPANSPVTVTIKDPSGNEVFRWSVNTTNENGIFSGSFQLPDDAPLGTYTIRVKTDDDG 89 (99)
T ss_dssp TTSEEEEEEEEEEECTTCSCESSEEEEEEEEETTSEEEEEEEEEETTCTTEEEEEEE--SS---EEEEEEEEETTTT
T ss_pred CCCEEEEEEEEeccccccccccCCceEEEEECCCCCEEEEEEeeeeCCCCEEEEEEECCCCCCCEeEEEEEEEccCC
Confidence 477777766554322 247999999999999988755 345655544 33 369999998875443
No 12
>smart00557 IG_FLMN Filamin-type immunoglobulin domains. These form a rod-like structure in the actin-binding cytoskeleton protein, filamin. The C-terminal repeats of filamin bind beta1-integrin (CD29).
Probab=92.91 E-value=0.76 Score=32.13 Aligned_cols=44 Identities=16% Similarity=0.254 Sum_probs=32.2
Q ss_pred CeeeEEEEEEcCCCcEEEEe-eecceeeE--EEEeccceeEEEEeec
Q psy189 104 GKYDVDVSIEAPTKEIIYRQ-VKSQFDTH--QFVTLVEGTYVVCFSN 147 (171)
Q Consensus 104 g~~dI~v~V~dP~g~~v~~~-~~~s~G~f--~Fta~~~G~Y~~CF~N 147 (171)
|..++.+.|.+|+|+.+--+ .+...|.| +|+.+..|.|.+...-
T Consensus 30 G~~~~~v~i~~p~g~~~~~~v~d~~dGty~v~y~P~~~G~~~i~V~~ 76 (93)
T smart00557 30 GGGELEVEVTGPSGKKVPVEVKDNGDGTYTVSYTPTEPGDYTVTVKF 76 (93)
T ss_pred CCCcEEEEEECCCCCeeEeEEEeCCCCEEEEEEEeCCCEeEEEEEEE
Confidence 44579999999999655443 34445544 7889999999988853
No 13
>PF06702 DUF1193: Protein of unknown function (DUF1193); InterPro: IPR009581 This family is baesd on the C terminus of several hypothetical eukaryotic proteins of unknown function. Proteins in this entry contain two conserved motifs: DRHHYE and QCC, as well as a number of conserved cysteine residues.
Probab=91.20 E-value=0.14 Score=42.29 Aligned_cols=41 Identities=15% Similarity=0.248 Sum_probs=32.4
Q ss_pred EEEcCCCcceEEEeccCCcEEEEEEEEEeC-CeeeEEEEEEcCC
Q psy189 74 FELADNARECFFEDIKRNTSVILEFQVVTG-GKYDVDVSIEAPT 116 (171)
Q Consensus 74 f~l~~~~~~CFye~~~~~~~v~~~y~V~~g-g~~dI~v~V~dP~ 116 (171)
|+++|+++.|||++|.+ ...-++.|.+. +..+.++.++-|+
T Consensus 1 ff~s~~~n~Cf~g~C~y--yC~~~~avCg~~d~lEgs~~~~LP~ 42 (221)
T PF06702_consen 1 FFISPANNTCFYGKCSY--YCKTEHAVCGNPDMLEGSLAAFLPD 42 (221)
T ss_pred CccCCCCCEEEeecCcc--ccCCcccccCCCCceeeeeeeecCC
Confidence 67899999999999976 56667888853 4457788888887
No 14
>PF13860 FlgD_ig: FlgD Ig-like domain; PDB: 3C12_A 3OSV_A.
Probab=89.03 E-value=4.4 Score=27.59 Aligned_cols=44 Identities=16% Similarity=0.184 Sum_probs=28.7
Q ss_pred cEEEEEEEEEeCCeeeEEEEEEcCCCcEEEEee--ecceeeEEEEec
Q psy189 92 TSVILEFQVVTGGKYDVDVSIEAPTKEIIYRQV--KSQFDTHQFVTL 136 (171)
Q Consensus 92 ~~v~~~y~V~~gg~~dI~v~V~dP~g~~v~~~~--~~s~G~f~Fta~ 136 (171)
....+.|.+... ...+.+.|+|.+|++|.+.. ..+.|.+.|+-.
T Consensus 12 ~~~~~~~~l~~~-a~~v~v~I~d~~G~~V~t~~~~~~~~G~~~~~Wd 57 (81)
T PF13860_consen 12 TKGSIEYTLPED-ADNVTVTIYDSNGQVVRTISLGSQSAGEHSFTWD 57 (81)
T ss_dssp CEEEEEEEECSS-CEEEEEEEEETTS-EEEEEEEEECSSEEEEEEE-
T ss_pred EEEEEEEeCCCc-ccEEEEEEEcCCCCEEEEEEcCCcCCceEEEEEC
Confidence 467777766643 44689999999999998753 344565555543
No 15
>PF11589 DUF3244: Domain of unknown function (DUF3244); InterPro: IPR021638 This family of proteins with unknown function appear to be restricted to Bacteroidetes. The protein may have an immunoglobulin-like beta-sandwich fold however this cannot be confirmed. ; PDB: 3D33_B 3SD2_A.
Probab=88.48 E-value=3.9 Score=29.35 Aligned_cols=57 Identities=19% Similarity=0.292 Sum_probs=38.2
Q ss_pred CcEEEEEEEEEeCCeeeEEEEEEcCCCcEEEEeeecc--eeeEEEEec--cceeEEEEeecCCC
Q psy189 91 NTSVILEFQVVTGGKYDVDVSIEAPTKEIIYRQVKSQ--FDTHQFVTL--VEGTYVVCFSNEFS 150 (171)
Q Consensus 91 ~~~v~~~y~V~~gg~~dI~v~V~dP~g~~v~~~~~~s--~G~f~Fta~--~~G~Y~~CF~N~~s 150 (171)
|..+.+.|. ....++.++|+|.+|++||++.-.. ....++... ..|+|.+=+.+...
T Consensus 36 ~~~l~I~F~---~~~~~vtI~I~d~~G~vVy~~~~~~~~~~~~~I~L~~~~~G~Y~l~i~~~~g 96 (106)
T PF11589_consen 36 GNNLSIEFE---SPIGDVTITIKDSTGNVVYSETVSNSAGQSITIDLNGLPSGEYTLEITNGNG 96 (106)
T ss_dssp TTEEEEEES---S--SEEEEEEEETT--EEEEEEESCGGTTEEEEE-TTS-SEEEEEEEEECTC
T ss_pred CCEEEEEEc---CCCCCEEEEEEeCCCCEEEEEEccCCCCcEEEEEeCCCCCccEEEEEEeCCC
Confidence 456777773 2334689999999999999985433 334677765 69999999998776
No 16
>PF00630 Filamin: Filamin/ABP280 repeat; InterPro: IPR017868 The many different actin cross-linking proteins share a common architecture, consisting of a globular actin-binding domain and an extended rod. Whereas their actin-binding domains consist of two calponin homology domains (see IPR001715 from INTERPRO), their rods fall into three families. The rod domain of the family including the Dictyostelium discoideum (Slime mould) gelation factor (ABP120) and human filamin (ABP280) is constructed from tandem repeats of a 100-residue motif that is glycine and proline rich []. The gelation factor's rod contains 6 copies of the repeat, whereas filamin has a rod constructed from 24 repeats. The resolution of the 3D structure of rod repeats from the gelation factor has shown that they consist of a beta-sandwich, formed by two beta-sheets arranged in an immunoglobulin-like fold [, ]. Because conserved residues that form the core of the repeats are preserved in filamin, the repeat structure should be common to the members of the gelation factor/filamin family. The head to tail homodimerisation is crucial to the function of the ABP120 and ABP280 proteins. This interaction involves a small portion at the distal end of the rod domains. For the gelation factor it has been shown that the carboxy-terminal repeat 6 dimerises through a double edge-to-edge extension of the beta-sheet and that repeat 5 contributes to dimerisation to some extent [, , ].; PDB: 2DI9_A 2EEC_A 2DIC_A 2EEA_A 2DMC_A 2EE9_A 2D7O_A 2D7N_A 2K7P_A 2NQC_A ....
Probab=85.27 E-value=5.9 Score=27.33 Aligned_cols=40 Identities=15% Similarity=0.292 Sum_probs=28.7
Q ss_pred eEEEEEEcCCCc----EEE-EeeecceeeE--EEEeccceeEEEEee
Q psy189 107 DVDVSIEAPTKE----IIY-RQVKSQFDTH--QFVTLVEGTYVVCFS 146 (171)
Q Consensus 107 dI~v~V~dP~g~----~v~-~~~~~s~G~f--~Fta~~~G~Y~~CF~ 146 (171)
++.+.|.+|++. .+. .-.....|.| +|+.+..|.|++...
T Consensus 43 ~~~v~i~~p~~~~~~~~~~~~v~~~~~G~y~v~y~p~~~G~y~i~V~ 89 (101)
T PF00630_consen 43 EFQVTITSPDGKEEPVPVPVEVIDNGDGTYTVSYTPTEPGKYKISVK 89 (101)
T ss_dssp EEEEEEESSSSESS--EEEEEEEEESSSEEEEEEEESSSEEEEEEEE
T ss_pred eeEEEEeCCCCCccccccceEEEECCCCEEEEEEEeCccEeEEEEEE
Confidence 578899999987 332 2234455555 788999999998874
No 17
>PF05738 Cna_B: Cna protein B-type domain; InterPro: IPR008454 This entry represents a repeated B region domain found in the collagen-binding surface protein Cna in Staphylococcus aureus, as well as other related domains. The B region domain of Cna has a prealbumin-like beta-sandwich fold of seven strands in two sheets with a Greek key topology []. However, this domain does not mediate collagen binding, the IPR008456 from INTERPRO region carries out that function; instead it appears to form a stalk that presents the ligand binding domain away from the bacterial cell surface. Cna is a collagen-binding MSCRAMM (Microbial Surface Component Recognizing Adhesive Matrix Molecules), and is necessary and sufficient for S. aureus cells to adhere to cartilage.; PDB: 2X5P_A 3RKP_A 3KPT_A 1VLF_T 1TI2_F 1TI6_D 1TI4_J 1VLE_V 1VLD_X 3PF2_A ....
Probab=83.20 E-value=4.2 Score=26.34 Aligned_cols=44 Identities=11% Similarity=0.098 Sum_probs=35.9
Q ss_pred EEEEEEcCCCcEEEE--eeecceeeEEEEeccceeEEEEeecCCCC
Q psy189 108 VDVSIEAPTKEIIYR--QVKSQFDTHQFVTLVEGTYVVCFSNEFST 151 (171)
Q Consensus 108 I~v~V~dP~g~~v~~--~~~~s~G~f~Fta~~~G~Y~~CF~N~~s~ 151 (171)
+.|.+++.++..+.. ....+.|.+.|.-...|.|.+=....-..
T Consensus 4 a~f~L~~~~~~~~~~~~~~Td~~G~~~f~~L~~G~Y~l~E~~aP~G 49 (70)
T PF05738_consen 4 ATFELYDEDGNEVIEVTVTTDENGKYTFKNLPPGTYTLKETKAPDG 49 (70)
T ss_dssp EEEEEEETTSEEEEEEEEEGGTTSEEEEEEEESEEEEEEEEETTTT
T ss_pred eEEEEEECCCCEEEEEEEEECCCCEEEEeecCCeEEEEEEEECCCC
Confidence 678889999888876 56678999999999999999988774433
No 18
>PF09315 DUF1973: Domain of unknown function (DUF1973); InterPro: IPR015394 These functionally uncharacterised domains are found in various eukaryotic calcium-dependent chloride channels.
Probab=82.36 E-value=23 Score=28.12 Aligned_cols=75 Identities=16% Similarity=0.207 Sum_probs=45.8
Q ss_pred cceEEEecc--CCcEEEEEEEEEeCCeeeEEEEEEcCCCcEEEE-eeecceeeEEEEe---ccceeEEEEeecCCCCCcc
Q psy189 81 RECFFEDIK--RNTSVILEFQVVTGGKYDVDVSIEAPTKEIIYR-QVKSQFDTHQFVT---LVEGTYVVCFSNEFSTYSS 154 (171)
Q Consensus 81 ~~CFye~~~--~~~~v~~~y~V~~gg~~dI~v~V~dP~g~~v~~-~~~~s~G~f~Fta---~~~G~Y~~CF~N~~s~~~~ 154 (171)
+.+|+-+-. .++.+.+.+.-. ....+.+++|+|+.+.. ..+.......+.. .+.|.+++.+.|..+. +
T Consensus 18 ~gtv~ID~tvG~~T~f~v~w~~~----~~p~i~L~~P~G~~~~~~~~d~~~~~~~i~ipg~ae~G~W~y~i~~~~~~--~ 91 (179)
T PF09315_consen 18 TGTVYIDSTVGNNTVFTVTWQNS----SPPSITLTDPSGTVYTTFTTDSNSKTARIQIPGTAEVGTWTYSITNTSSS--S 91 (179)
T ss_pred EeEEEECCCCCCCeEEEEEECCC----CCceEEEECCCCCEEeeeEEcccccEEEEECCCCcccccEEEEEecCCCC--c
Confidence 355555443 344444444322 34678899999999866 2333444444444 4699999999887766 4
Q ss_pred eEEEEEE
Q psy189 155 ALEVIST 161 (171)
Q Consensus 155 K~V~F~i 161 (171)
+.+.+.+
T Consensus 92 q~v~vtV 98 (179)
T PF09315_consen 92 QTVTVTV 98 (179)
T ss_pred ceEEEEE
Confidence 5555554
No 19
>PRK14081 triple tyrosine motif-containing protein; Provisional
Probab=82.13 E-value=16 Score=34.82 Aligned_cols=72 Identities=11% Similarity=0.147 Sum_probs=48.1
Q ss_pred CcEEEEEEEEEeCCeeeEEEEEEcCCCcEEEEeeecceeeEEEEeccceeEEE--EeecCCCC---CcceEEEEEEEE
Q psy189 91 NTSVILEFQVVTGGKYDVDVSIEAPTKEIIYRQVKSQFDTHQFVTLVEGTYVV--CFSNEFST---YSSALEVISTHA 163 (171)
Q Consensus 91 ~~~v~~~y~V~~gg~~dI~v~V~dP~g~~v~~~~~~s~G~f~Fta~~~G~Y~~--CF~N~~s~---~~~K~V~F~i~~ 163 (171)
|+.+.++-...+|.+.-..+.|+. +|+.+...+-..+..+.|+...+|.|++ ...|..|. -..+.|.|.++-
T Consensus 401 G~~i~i~v~a~gg~~~lY~f~ik~-ng~~ve~~~Y~~~~~~~f~P~~~G~Y~IeV~vKdk~S~~~yD~~k~v~l~V~e 477 (667)
T PRK14081 401 GEEIKIRVIAEGGTNLRYSFIIKK-DGKEEEKIDYGKNNWVNFIPEEKGNYELEVRVKDKYSDKEYDAHTIVYIKVHE 477 (667)
T ss_pred CCeEEEEEEecCCCeEEEEEEEEE-CCEEEEEeecccccEEEEEECCCeeEEEEEEEecccCchhcccceEEEEEEec
Confidence 333333333333434445555555 7888877777889999999999999965 45666664 236888888753
No 20
>PF07495 Y_Y_Y: Y_Y_Y domain; InterPro: IPR011123 This region is mostly found at the end of the beta propellers (IPR011110 from INTERPRO) in a family of two component regulators. However they are also found tandemly repeated in Q891H4 from SWISSPROT without other signal conduction domains being present. It is named after the conserved tyrosines found in the alignment. The exact function is not known.; PDB: 3V9F_D 3VA6_B 3OTT_B 4A2M_D 4A2L_B.
Probab=79.70 E-value=13 Score=23.58 Aligned_cols=54 Identities=15% Similarity=0.114 Sum_probs=30.3
Q ss_pred eEEEEEEcCCCcEEEEeeecceeeEEEEeccceeEEEEee--cCCCC--CcceEEEEEEE
Q psy189 107 DVDVSIEAPTKEIIYRQVKSQFDTHQFVTLVEGTYVVCFS--NEFST--YSSALEVISTH 162 (171)
Q Consensus 107 dI~v~V~dP~g~~v~~~~~~s~G~f~Fta~~~G~Y~~CF~--N~~s~--~~~K~V~F~i~ 162 (171)
...+.+.+.+++-+....... ..+|+...+|.|++-.. |..+. ...+.+.|.||
T Consensus 9 ~Y~Y~l~g~d~~W~~~~~~~~--~~~~~~L~~G~Y~l~V~a~~~~~~~~~~~~~l~i~I~ 66 (66)
T PF07495_consen 9 RYRYRLEGFDDEWITLGSYSN--SISYTNLPPGKYTLEVRAKDNNGKWSSDEKSLTITIL 66 (66)
T ss_dssp EEEEEEETTESSEEEESSTS---EEEEES--SEEEEEEEEEEETTS-B-SS-EEEEEEEE
T ss_pred EEEEEEECCCCeEEECCCCcE--EEEEEeCCCEEEEEEEEEECCCCCcCcccEEEEEEEC
Confidence 344556666666554422211 88999999999988874 33332 22367777764
No 21
>PRK12812 flgD flagellar basal body rod modification protein; Reviewed
Probab=78.40 E-value=37 Score=28.69 Aligned_cols=53 Identities=8% Similarity=0.042 Sum_probs=39.1
Q ss_pred EEEEEEEEEeCCeeeEEEEEEcCCCcEEEEee--ecceeeEEEEecc---------ceeEEEEee
Q psy189 93 SVILEFQVVTGGKYDVDVSIEAPTKEIIYRQV--KSQFDTHQFVTLV---------EGTYVVCFS 146 (171)
Q Consensus 93 ~v~~~y~V~~gg~~dI~v~V~dP~g~~v~~~~--~~s~G~f~Fta~~---------~G~Y~~CF~ 146 (171)
.+.+.|.+.. ....+.+.|+|.+|++|+..+ ..+.|.+.|+-.. +|.|+|=..
T Consensus 129 ~~~~~~~l~~-~a~~v~v~I~D~~G~~V~t~~lg~~~aG~~~f~WDG~d~~G~~~~~G~Yt~~v~ 192 (259)
T PRK12812 129 LIALKLYFPE-DSDEGTLEIYDSNNKLVEKIDFKEISQGLFTMEWDGRDNDGVYAGDGEYTIKAV 192 (259)
T ss_pred eeEEEEecCC-cCceEEEEEEeCCCCEEEEEecCCCCCcceeEEECCCCCCCCcCCCeeeEEEEE
Confidence 5566666543 344699999999999998753 4457877777654 799999986
No 22
>PF13620 CarboxypepD_reg: Carboxypeptidase regulatory-like domain; PDB: 3MN8_D 3P0D_I 3KCP_A 2B59_B 1UWY_A 1H8L_A 1QMU_A 2NSM_A.
Probab=75.17 E-value=3.3 Score=27.46 Aligned_cols=42 Identities=19% Similarity=0.283 Sum_probs=28.9
Q ss_pred eEEEEEEcCCCcEEEEeeecceeeEEEEeccceeEEEEeecC
Q psy189 107 DVDVSIEAPTKEIIYRQVKSQFDTHQFVTLVEGTYVVCFSNE 148 (171)
Q Consensus 107 dI~v~V~dP~g~~v~~~~~~s~G~f~Fta~~~G~Y~~CF~N~ 148 (171)
+..+.+.+.++.......-.+.|.|.|..-.+|.|.+=++-.
T Consensus 16 ~a~V~l~~~~~~~~~~~~Td~~G~f~~~~l~~g~Y~l~v~~~ 57 (82)
T PF13620_consen 16 GATVTLTDQDGGTVYTTTTDSDGRFSFEGLPPGTYTLRVSAP 57 (82)
T ss_dssp T-EEEET--TTTECCEEE--TTSEEEEEEE-SEEEEEEEEBT
T ss_pred CEEEEEEEeeCCCEEEEEECCCceEEEEccCCEeEEEEEEEC
Confidence 477888888888777767789999999966669999888543
No 23
>PRK12634 flgD flagellar basal body rod modification protein; Reviewed
Probab=74.60 E-value=27 Score=28.72 Aligned_cols=45 Identities=16% Similarity=0.061 Sum_probs=35.0
Q ss_pred eCCeeeEEEEEEcCCCcEEEEee--ecceeeEEEEecc---------ceeEEEEee
Q psy189 102 TGGKYDVDVSIEAPTKEIIYRQV--KSQFDTHQFVTLV---------EGTYVVCFS 146 (171)
Q Consensus 102 ~gg~~dI~v~V~dP~g~~v~~~~--~~s~G~f~Fta~~---------~G~Y~~CF~ 146 (171)
......+.+.|+|.+|++|++.+ ....|.+.|.-.. +|.|.|-..
T Consensus 118 ~~~a~~v~i~I~d~~G~~V~t~~lg~~~aG~~~f~WDG~d~~G~~~~~G~Yt~~v~ 173 (221)
T PRK12634 118 ATSAGFVNFEITDANGAFVKQISVPASAAGEVSFAWDGTDANGNRMAAGKYGVTAT 173 (221)
T ss_pred cccCCeEEEEEEcCCCCEEEEEecCCcCCCceeEEECCCCCCCCcCCCeeeEEEEE
Confidence 34456789999999999998753 4567888887643 699999995
No 24
>PRK12813 flgD flagellar basal body rod modification protein; Reviewed
Probab=73.24 E-value=17 Score=30.05 Aligned_cols=56 Identities=9% Similarity=0.098 Sum_probs=37.5
Q ss_pred CcEEEEEEEEEeCCeeeEEEEEEcCCCcEEEEeeecceeeEEEEec---------cceeEEEEeecC
Q psy189 91 NTSVILEFQVVTGGKYDVDVSIEAPTKEIIYRQVKSQFDTHQFVTL---------VEGTYVVCFSNE 148 (171)
Q Consensus 91 ~~~v~~~y~V~~gg~~dI~v~V~dP~g~~v~~~~~~s~G~f~Fta~---------~~G~Y~~CF~N~ 148 (171)
+..+.+.|.+.+. ...+.+.|+|.+|++|++.+- +.|.+.|+-. .+|.|+|=..-.
T Consensus 110 g~~~~~~~~l~~~-a~~v~v~I~D~~G~vV~t~~~-~~G~~~f~WDG~d~~G~~l~~G~Yt~~V~A~ 174 (223)
T PRK12813 110 GTPVTISPNPAAD-ADKAELVVRDAAGAEVARETV-PVGAGPVEWAGEDADGNPLPNGAYSFVVESY 174 (223)
T ss_pred CceeEEEEeccCC-CceEEEEEEcCCCCEEEEEee-CCCceeEEeCCcCCCCCcCCCccEEEEEEEE
Confidence 4456666666543 346899999999999987543 4454444432 369999888543
No 25
>PF13473 Cupredoxin_1: Cupredoxin-like domain; PDB: 1IBZ_D 1IC0_E 1IBY_D.
Probab=69.68 E-value=25 Score=24.70 Aligned_cols=56 Identities=21% Similarity=0.236 Sum_probs=27.0
Q ss_pred ccCCcEEEEEEEEEeCCeeeEEEEEEcCCCcEEEEeeecceeeEEEEeccceeEEE-Eeec
Q psy189 88 IKRNTSVILEFQVVTGGKYDVDVSIEAPTKEIIYRQVKSQFDTHQFVTLVEGTYVV-CFSN 147 (171)
Q Consensus 88 ~~~~~~v~~~y~V~~gg~~dI~v~V~dP~g~~v~~~~~~s~G~f~Fta~~~G~Y~~-CF~N 147 (171)
+++|+.+++.+.=.+... -++.+.+ ...-..-...++..++|++.++|+|.+ |=-+
T Consensus 39 v~~G~~v~l~~~N~~~~~--h~~~i~~--~~~~~~l~~g~~~~~~f~~~~~G~y~~~C~~~ 95 (104)
T PF13473_consen 39 VKAGQPVTLTFTNNDSRP--HEFVIPD--LGISKVLPPGETATVTFTPLKPGEYEFYCTMH 95 (104)
T ss_dssp EETTCEEEEEEEE-SSS---EEEEEGG--GTEEEEE-TT-EEEEEEEE-S-EEEEEB-SSS
T ss_pred EcCCCeEEEEEEECCCCc--EEEEECC--CceEEEECCCCEEEEEEcCCCCEEEEEEcCCC
Confidence 456776666654333222 2344433 222122234567788999999999965 5533
No 26
>PF08842 Mfa2: Fimbrillin-A associated anchor proteins Mfa1 and Mfa2; InterPro: IPR014941 This family of proteins may be lipoproteins principally from bacilli. They are between 300 and 400 residues. Many Bacteroides-like bacterial species, including Porphyromonas gingivalis, the causal agent of periodontal infection, carry at least two types of fimbriae, namely FimA and Mfa1 fimbriae, following the names of their major subunit proteins []. Normally, FimA fimbriae are long filaments that are easily detached from cells, whereas Mfa1 fimbriae are short filaments that are tightly bound to cells; however, in the absence of Mfa2 protein, the Mfa1 fimbriae are also very long and are not attached. Mfa2 and Mfa1 are associated with each other in whole P. gingivalis cells to the extent that Mfa2 is located on the cell surface and probably associated with Mfa1 fimbriae in such a way that it anchors the Mfa1 fimbriae to the cell surface and regulates Mfa1 filament length [].; PDB: 3PAY_C 3GF8_A.
Probab=67.85 E-value=8.6 Score=31.16 Aligned_cols=62 Identities=13% Similarity=0.174 Sum_probs=30.1
Q ss_pred EeccCCcEEEEEEEEEeCC-------eeeEEEEEEcCCCcEEEEeeec---ce-eeEEE--EeccceeEEEEeec
Q psy189 86 EDIKRNTSVILEFQVVTGG-------KYDVDVSIEAPTKEIIYRQVKS---QF-DTHQF--VTLVEGTYVVCFSN 147 (171)
Q Consensus 86 e~~~~~~~v~~~y~V~~gg-------~~dI~v~V~dP~g~~v~~~~~~---s~-G~f~F--ta~~~G~Y~~CF~N 147 (171)
++|+.+..+.+.|.--.+. -.++++.|+|.+|+.+...... .. +.|+. .....|+|+++.--
T Consensus 2 ~dC~~~~~l~f~y~~~~~~~~~~~~~v~~v~lyvFd~~g~~v~~~~~~~~~~~~~~y~~~~~~l~~G~Y~~va~~ 76 (283)
T PF08842_consen 2 DDCPGGLTLKFSYDYNMGNADAFEDEVKRVDLYVFDEDGKLVKQRTIDSEELEGGGYTMFLLDLPPGTYTFVAWG 76 (283)
T ss_dssp ------EEEEEE---STT-S--HHHH--EEEEEEE-TTSBEEEEEEEECGGCCTTTEEE-CCT--SEEEEEEEEE
T ss_pred CCCCceEEEEEEEeCCccccccccceEeEEEEEEEeCCCeEEEEEEcccccccCCceEEeeccCCCCcEEEEEEE
Confidence 4677666666665332111 2369999999999966554321 12 44554 34468999888753
No 27
>PRK06655 flgD flagellar basal body rod modification protein; Reviewed
Probab=67.75 E-value=68 Score=26.37 Aligned_cols=54 Identities=13% Similarity=0.169 Sum_probs=38.3
Q ss_pred EEEEEEEEEeCCeeeEEEEEEcCCCcEEEEee--ecceeeEEEEec---------cceeEEEEeec
Q psy189 93 SVILEFQVVTGGKYDVDVSIEAPTKEIIYRQV--KSQFDTHQFVTL---------VEGTYVVCFSN 147 (171)
Q Consensus 93 ~v~~~y~V~~gg~~dI~v~V~dP~g~~v~~~~--~~s~G~f~Fta~---------~~G~Y~~CF~N 147 (171)
.+.+.|.... ....+.+.|+|.+|++|++.. ..+.|.+.|+-. .+|.|.+=..-
T Consensus 114 ~~~~~~~l~~-~a~~vti~I~D~~G~~Vrt~~lg~~~aG~~~f~WDG~d~~G~~lp~G~Yt~~V~A 178 (225)
T PRK06655 114 TTPFGVELPS-AADNVTVTITDSAGQVVRTIDLGAQSAGVVSFTWDGTDTDGNALPDGNYTIKASA 178 (225)
T ss_pred ceEEEEEcCC-CCcEEEEEEEcCCCCEEEEEecCCcCCCceeEEECCCCCCCCcCCCeeEEEEEEE
Confidence 4555665543 344689999999999998653 356888888432 38999998853
No 28
>PRK05842 flgD flagellar basal body rod modification protein; Reviewed
Probab=67.34 E-value=22 Score=30.65 Aligned_cols=56 Identities=13% Similarity=0.029 Sum_probs=38.7
Q ss_pred EEEEEEEEEe---CCeeeEEEEEEcCCCcEEEEeeec----ceeeEEEEec---------cceeEEEEeecC
Q psy189 93 SVILEFQVVT---GGKYDVDVSIEAPTKEIIYRQVKS----QFDTHQFVTL---------VEGTYVVCFSNE 148 (171)
Q Consensus 93 ~v~~~y~V~~---gg~~dI~v~V~dP~g~~v~~~~~~----s~G~f~Fta~---------~~G~Y~~CF~N~ 148 (171)
.+.+.|.+.. .....+.+.|+|.+|++|.+-.-. ..|.++|+-. .+|.|+|=....
T Consensus 150 ~~~~~~~l~~~~~~~a~~v~I~I~Da~G~vVrTi~l~~~~~~aG~~~f~WDG~d~~G~~~p~G~Yt~~V~a~ 221 (295)
T PRK05842 150 KLSFSLFFDEKIDASKGVPAIQILNENNELVKTIPLKDYNGQKGYINFEWDGLNEKGEKVPKGNYKIKAEYN 221 (295)
T ss_pred ceEEEEeccccccccCceEEEEEEcCCCCEEEEEecCcccCCCcceeEEECCCCCCCCcCCCcceEEEEEEE
Confidence 4566665542 234579999999999999875422 3588888732 379999988643
No 29
>PF14524 Wzt_C: Wzt C-terminal domain; PDB: 2R5O_B.
Probab=66.79 E-value=42 Score=24.02 Aligned_cols=41 Identities=2% Similarity=0.045 Sum_probs=23.4
Q ss_pred CcceEEEeccCCcEEEEEEEEEeCC---eeeEEEEEEcCCCcEEEE
Q psy189 80 ARECFFEDIKRNTSVILEFQVVTGG---KYDVDVSIEAPTKEIIYR 122 (171)
Q Consensus 80 ~~~CFye~~~~~~~v~~~y~V~~gg---~~dI~v~V~dP~g~~v~~ 122 (171)
++.|=. +..|+.+.+.+.+.... ...+.+.|++.+|..|+.
T Consensus 25 g~~~~~--~~~ge~~~i~i~~~~~~~i~~~~~~~~i~~~~g~~v~~ 68 (142)
T PF14524_consen 25 GEPTSS--FESGEPIRIRIDYEVNEDIDDPVFGFAIRDSDGQRVFG 68 (142)
T ss_dssp EES-SS--EETTSEEEEEEEEEESS-EEEEEEEEEEEETT--EEEE
T ss_pred CCEeeE--EeCCCEEEEEEEEEECCCCCccEEEEEEEcCCCCEEEE
Confidence 344433 33466666555555322 345889999999988886
No 30
>PRK13211 N-acetylglucosamine-binding protein A; Reviewed
Probab=63.86 E-value=82 Score=28.99 Aligned_cols=70 Identities=10% Similarity=0.098 Sum_probs=45.0
Q ss_pred EEEEEEEEEeCCeeeEEEEEEcCCCcEEEEee-ecceeeEEEEec----cceeEEEEeecCCC--C-CcceEEEEEEE
Q psy189 93 SVILEFQVVTGGKYDVDVSIEAPTKEIIYRQV-KSQFDTHQFVTL----VEGTYVVCFSNEFS--T-YSSALEVISTH 162 (171)
Q Consensus 93 ~v~~~y~V~~gg~~dI~v~V~dP~g~~v~~~~-~~s~G~f~Fta~----~~G~Y~~CF~N~~s--~-~~~K~V~F~i~ 162 (171)
.++++|.+..-++..|.++|+|.+|+.+.... ....+...++.. +.|.|++=..-..+ . ....++.|.++
T Consensus 328 ~~~i~ftv~a~g~~~vta~V~d~~g~~~~~~~~~v~d~s~~vtL~Ls~~~AG~y~Lvv~~t~~dG~~~~q~~~~~~v~ 405 (478)
T PRK13211 328 AATLDFTVTATGDMNVEATVYNHDGEALGSKSQTVNDGSQSVSLDLSKLKAGHHMLVVKAKPKDGELIKQQTLDFMLE 405 (478)
T ss_pred cEEEEEEEEeccceEEEEEEEcCCCCeeeeeeEEecCCceeEEEecccCCCceEEEEEEEEeCCCceeeeeeEEEEEE
Confidence 35566666666777899999999999987753 223333444433 69999998864333 2 22456666653
No 31
>KOG0518|consensus
Probab=63.11 E-value=21 Score=35.71 Aligned_cols=56 Identities=9% Similarity=0.180 Sum_probs=39.6
Q ss_pred EEEEEEeCCeeeEEEEEEcCCCcEEEEe---eecceeeEEEEeccceeEEEEeecCCCC
Q psy189 96 LEFQVVTGGKYDVDVSIEAPTKEIIYRQ---VKSQFDTHQFVTLVEGTYVVCFSNEFST 151 (171)
Q Consensus 96 ~~y~V~~gg~~dI~v~V~dP~g~~v~~~---~~~s~G~f~Fta~~~G~Y~~CF~N~~s~ 151 (171)
+.-.....+.+|+.+.++||.|...-.. -....=+..|+.++.|.|++|..++.-.
T Consensus 873 l~lk~~~~~~~d~ta~vt~PSG~~~~aei~~~~~~~y~vrFtP~e~G~~tl~V~y~~~~ 931 (1113)
T KOG0518|consen 873 LDLKATEASSQDITARVTDPSGRVFEAEIVDLGQGTYQVRFTPKEPGNHTLSVKYKDQH 931 (1113)
T ss_pred eeeecCCCCccceEEEeeCCCCCccccEEEECCCceEEEEecCCCCCceEEEEEecCcc
Confidence 3333344567789999999998743222 2334456789999999999999887654
No 32
>PRK12633 flgD flagellar basal body rod modification protein; Provisional
Probab=55.30 E-value=1.2e+02 Score=25.01 Aligned_cols=53 Identities=15% Similarity=0.251 Sum_probs=37.5
Q ss_pred EEEEEEEEeCCeeeEEEEEEcCCCcEEEEee--ecceeeEEEEec---------cceeEEEEeec
Q psy189 94 VILEFQVVTGGKYDVDVSIEAPTKEIIYRQV--KSQFDTHQFVTL---------VEGTYVVCFSN 147 (171)
Q Consensus 94 v~~~y~V~~gg~~dI~v~V~dP~g~~v~~~~--~~s~G~f~Fta~---------~~G~Y~~CF~N 147 (171)
..+.|.+. .....+.+.|+|.+|++|+..+ ..+.|.+.|+-. .+|.|+|=..-
T Consensus 118 ~~~~~~l~-~~a~~v~v~I~D~~G~vV~t~~lg~~~aG~~~f~WDG~d~~G~~~~~G~Y~~~V~a 181 (230)
T PRK12633 118 TPFGIDLQ-GDATKVTVKVLDPSGAVVRTMELGDLKTGVHTLQWDGNNDGGQPLADGKYSITVSA 181 (230)
T ss_pred eeEEEecC-CcCcEEEEEEEeCCCCEEEEEecCCCCCCceeEEECCCCCCCCcCCCcceEEEEEE
Confidence 34445443 3444699999999999998753 456787887743 36899998863
No 33
>PF10528 PA14_2: GLEYA domain; InterPro: IPR018871 This presumed domain is found in fungal adhesins and is related to the PA14 domain. ; PDB: 4A3X_A.
Probab=49.83 E-value=53 Score=24.03 Aligned_cols=46 Identities=17% Similarity=0.161 Sum_probs=24.9
Q ss_pred CCCcceEEEeccCCcEEEEEEE-EEeCCeeeEEEEEEcCCCcEEEEe
Q psy189 78 DNARECFFEDIKRNTSVILEFQ-VVTGGKYDVDVSIEAPTKEIIYRQ 123 (171)
Q Consensus 78 ~~~~~CFye~~~~~~~v~~~y~-V~~gg~~dI~v~V~dP~g~~v~~~ 123 (171)
.+...++-..+.+|...=+..- +..++....++.|++|+|+.+.+.
T Consensus 56 ~~~~~~~tv~L~aG~yyPiRi~~~N~~g~~~~~~~i~~P~G~~~~~~ 102 (113)
T PF10528_consen 56 TGASKSVTVYLTAGTYYPIRIVYANGGGPGSFDFSITDPDGTVHTDD 102 (113)
T ss_dssp S-SEEEEEEEE-TT-BEEEEEEEEE-SS-EEEEEEEEETT-S--B--
T ss_pred CCCceEEEEEEECCcEEEEEEEEEcCCCceEEEEEEECCCCcEEecC
Confidence 4455677777777775433332 334566678999999999998764
No 34
>PF15417 DUF4624: Domain of unknown function (DUF4624)
Probab=49.83 E-value=1.1e+02 Score=22.96 Aligned_cols=77 Identities=26% Similarity=0.361 Sum_probs=45.2
Q ss_pred cceEEEeccC-CcEEEEEEEEEeCCeeeEEEEEEcCC-CcEEEEee---ecceeeEEEEe---ccceeEEEEeecCCCCC
Q psy189 81 RECFFEDIKR-NTSVILEFQVVTGGKYDVDVSIEAPT-KEIIYRQV---KSQFDTHQFVT---LVEGTYVVCFSNEFSTY 152 (171)
Q Consensus 81 ~~CFye~~~~-~~~v~~~y~V~~gg~~dI~v~V~dP~-g~~v~~~~---~~s~G~f~Fta---~~~G~Y~~CF~N~~s~~ 152 (171)
--|.-+++.. +. .++|++ .|... -|.|+|-+ .+++++.. ..+...|+... +..-+|-+||.-..-.
T Consensus 39 LFcVs~Die~L~a--Ev~f~m-DGe~~--iVEiKd~~~devLWsn~~~~~V~~dt~tisL~nlqk~kEY~V~ftGtkIn- 112 (132)
T PF15417_consen 39 LFCVSEDIEALDA--EVYFQM-DGESG--IVEIKDRKTDEVLWSNTWNGKVSGDTFTISLNNLQKEKEYVVCFTGTKIN- 112 (132)
T ss_pred EEEEecchheeee--EEEEEE-cCccc--eEEeccCCccceeeccccccccccceEEEEhhhcccCceEEEEEeccEee-
Confidence 3577777764 33 233443 34432 36777764 56777763 45566677664 4577999999654333
Q ss_pred cceEEEEEEEEE
Q psy189 153 SSALEVISTHAA 164 (171)
Q Consensus 153 ~~K~V~F~i~~~ 164 (171)
+-.|.+.+...
T Consensus 113 -hAvv~vtFeS~ 123 (132)
T PF15417_consen 113 -HAVVKVTFESE 123 (132)
T ss_pred -eEEEEEEecch
Confidence 45565555443
No 35
>PRK14081 triple tyrosine motif-containing protein; Provisional
Probab=49.17 E-value=61 Score=31.10 Aligned_cols=54 Identities=15% Similarity=0.162 Sum_probs=37.6
Q ss_pred EEEEEEcCCCcEEEEeeecceeeEEEEeccceeEEE-EeecC-CCC---CcceEEEEEE
Q psy189 108 VDVSIEAPTKEIIYRQVKSQFDTHQFVTLVEGTYVV-CFSNE-FST---YSSALEVIST 161 (171)
Q Consensus 108 I~v~V~dP~g~~v~~~~~~s~G~f~Fta~~~G~Y~~-CF~N~-~s~---~~~K~V~F~i 161 (171)
-.+.+.+++|.....++-.....|++++..+|.|++ |.--. .|. -..+.+.|.|
T Consensus 225 YKF~~i~~~G~~~~~qdYst~n~~~y~~~~~G~Y~i~~~VKD~~S~~eyDD~~~l~y~V 283 (667)
T PRK14081 225 YKFVKIDSDGKQTCIQDYSTKNIVSYKEKKSGDYKLLCLVKDMYSNNEFDDRAVLVYTV 283 (667)
T ss_pred EEEEEECCCCCEEEecCccccceEEEEeCCCccEEEEEEEeccCcccccccceEEEEEE
Confidence 345566788888888788899999999999999976 44222 221 1245666665
No 36
>PF10670 DUF4198: Domain of unknown function (DUF4198)
Probab=48.61 E-value=1.1e+02 Score=23.63 Aligned_cols=50 Identities=20% Similarity=0.188 Sum_probs=30.5
Q ss_pred EEEEEEeCCe--eeEEEEEEcCCCcE-----EEEeeecceeeEEEEeccceeEEEEe
Q psy189 96 LEFQVVTGGK--YDVDVSIEAPTKEI-----IYRQVKSQFDTHQFVTLVEGTYVVCF 145 (171)
Q Consensus 96 ~~y~V~~gg~--~dI~v~V~dP~g~~-----v~~~~~~s~G~f~Fta~~~G~Y~~CF 145 (171)
+.++|.--|. -+..|.+..+++.. -..-....+|.++|+.+.+|.|.+=.
T Consensus 153 ~~~~vl~~GkPl~~a~V~~~~~~~~~~~~~~~~~~~TD~~G~~~~~~~~~G~wli~a 209 (215)
T PF10670_consen 153 LPFQVLFDGKPLAGAEVEAFSPGGWYDVEHEAKTLKTDANGRATFTLPRPGLWLIRA 209 (215)
T ss_pred EEEEEEECCeEcccEEEEEEECCCccccccceEEEEECCCCEEEEecCCCEEEEEEE
Confidence 4455553332 24666766665441 11123347999999999999997644
No 37
>PF13280 WYL: WYL domain
Probab=48.34 E-value=7 Score=28.95 Aligned_cols=28 Identities=36% Similarity=0.698 Sum_probs=24.2
Q ss_pred CceeeEeeeccccceeEEeechhhhHhh
Q psy189 23 PNHFYFTDYERHNSEIATFHLDRISMLL 50 (171)
Q Consensus 23 ~~~~~~~~~~~h~~ei~~~~l~~~~~~~ 50 (171)
.+.+|..++++.+.++..|.||||.+.+
T Consensus 41 ~~~~Yl~~~~~~~~~~r~frldrI~~i~ 68 (172)
T PF13280_consen 41 NGRWYLIAYDREREEFRTFRLDRIRSIE 68 (172)
T ss_pred CCEEEEEEEeccccccccccccceEEEE
Confidence 3567899999999999999999998655
No 38
>KOG3878|consensus
Probab=40.97 E-value=97 Score=27.54 Aligned_cols=96 Identities=16% Similarity=0.273 Sum_probs=61.2
Q ss_pred hhcccceEEEEEEcCCCc-ceEEEeccC---CcEEEEEEEEEeCCeeeEEEEEEcCCCcE--------------------
Q psy189 64 HQIVQAVELTFELADNAR-ECFFEDIKR---NTSVILEFQVVTGGKYDVDVSIEAPTKEI-------------------- 119 (171)
Q Consensus 64 ~~~~~~~~ltf~l~~~~~-~CFye~~~~---~~~v~~~y~V~~gg~~dI~v~V~dP~g~~-------------------- 119 (171)
+.+.+....|+.+|.+.+ .|.|-+... +..+-+.|+-.......|.|.|.+.+...
T Consensus 324 ItvGhGetVTVRVPThenGsclFWEFATD~YDIGFGvYFEWt~~~~n~VsVHVSeSddded~~~~~e~E~~e~G~~~~E~ 403 (469)
T KOG3878|consen 324 ITVGHGETVTVRVPTHENGSCLFWEFATDSYDIGFGVYFEWTKPVTNEVSVHVSESDDDEDCVYLSETEDLESGSLSQER 403 (469)
T ss_pred EEecCCceEEEeccccCCCceEEEEeccccccccceEEEEeecCCCceeEEEecccccchhhhhhhhhhhhhcCCchhhh
Confidence 345667888999988765 787766543 44566667666666667777776653221
Q ss_pred -------------------EEEee---ecceeeEEEEeccceeEEEEeecCCCCCcceEEEEEE
Q psy189 120 -------------------IYRQV---KSQFDTHQFVTLVEGTYVVCFSNEFSTYSSALEVIST 161 (171)
Q Consensus 120 -------------------v~~~~---~~s~G~f~Fta~~~G~Y~~CF~N~~s~~~~K~V~F~i 161 (171)
||.++ ..=.|++ .-|..|.|-+=|+|.+|--.+|+++..+
T Consensus 404 gA~~n~~~anKp~~deIvPvYRRdCheEVYaGSH--~YPGrGvYLLKFDNSYSlWRsKtlYYRV 465 (469)
T KOG3878|consen 404 GAVNNPTAANKPPIDEIVPVYRRDCHEEVYAGSH--SYPGRGVYLLKFDNSYSLWRSKTLYYRV 465 (469)
T ss_pred hhhcCCCCCCCCCcccccchhhhhhhHHhhcccc--cCCCCceEEEEecchhhhhcccceEEEE
Confidence 11110 0112332 2357899999999999866689988764
No 39
>PF08329 ChitinaseA_N: Chitinase A, N-terminal domain; InterPro: IPR013540 This domain is found in a number of bacterial chitinases and similar viral proteins. It is organised into a fibronectin III module domain-like fold, comprising only beta strands. Its function is not known, but it may be involved in interaction with the enzyme substrate, chitin [, ]. It is separated by a hinge region from the catalytic domain (IPR001223 from INTERPRO); this hinge region is probably mobile, allowing the N-terminal domain to have different relative positions in solution []. ; GO: 0004568 chitinase activity; PDB: 2WLY_A 1EDQ_A 2WM0_A 1X6N_A 1NH6_A 2WK2_A 1EHN_A 2WLZ_A 1EIB_A 1FFR_A ....
Probab=39.33 E-value=1.6e+02 Score=22.44 Aligned_cols=52 Identities=15% Similarity=0.262 Sum_probs=30.6
Q ss_pred CcEEEEEEEEEeCCeee-EEEEEEcCCCcEEEEeeecceeeEEEEeccceeEEEEe
Q psy189 91 NTSVILEFQVVTGGKYD-VDVSIEAPTKEIIYRQVKSQFDTHQFVTLVEGTYVVCF 145 (171)
Q Consensus 91 ~~~v~~~y~V~~gg~~d-I~v~V~dP~g~~v~~~~~~s~G~f~Fta~~~G~Y~~CF 145 (171)
...|.+++.+-.|...+ -.+.. ||++|.+....+.++-+|+....|.|++=.
T Consensus 37 ~v~V~VtwN~WsG~~Gd~~kly~---dG~~V~tG~~~~~~~a~~~~~~gG~y~~~V 89 (133)
T PF08329_consen 37 QVDVSVTWNVWSGTNGDTAKLYF---DGVLVWTGPSPQQKSATFTVTKGGRYQMQV 89 (133)
T ss_dssp SEEEEEEEE-SSS---SEEEEEE---TTEEEEEEE--SEEEEEEEE-S-EEEEEEE
T ss_pred ceEEEEEEEEecCCCCCEEEEEE---CCEEEEeCCCccCceEEEEecCCCEEEEEE
Confidence 45677888887775433 33343 788888765445777799999999986533
No 40
>PF07210 DUF1416: Protein of unknown function (DUF1416); InterPro: IPR010814 This family consists of several hypothetical bacterial proteins of around 100 residues in length. Members of this family appear to be Actinomycete specific. The function of this family is unknown.
Probab=35.80 E-value=1.6e+02 Score=20.80 Aligned_cols=42 Identities=17% Similarity=0.143 Sum_probs=30.9
Q ss_pred eEEEEEEcCCCcEEEEeeecceeeEEEEeccceeEEEEeecCC
Q psy189 107 DVDVSIEAPTKEIIYRQVKSQFDTHQFVTLVEGTYVVCFSNEF 149 (171)
Q Consensus 107 dI~v~V~dP~g~~v~~~~~~s~G~f~Fta~~~G~Y~~CF~N~~ 149 (171)
.--+.+.|+.|+-.-.-....+|+|.|-+. +|.+++=--...
T Consensus 23 gAyVRLLD~sgEFtaEvvts~~G~FRFfaa-pG~WtvRal~~~ 64 (85)
T PF07210_consen 23 GAYVRLLDSSGEFTAEVVTSATGDFRFFAA-PGSWTVRALSRG 64 (85)
T ss_pred CeEEEEEcCCCCeEEEEEecCCccEEEEeC-CCceEEEEEccC
Confidence 456889999998755545678999999986 788877654433
No 41
>cd02848 Chitinase_N_term Chitinase N-terminus domain. Chitinases hydrolyze the abundant natural biopolymer chitin, producing smaller chito-oligosaccharides. Chitin consists of multiple N-acetyl-D-glucosamine (NAG) residues connected via beta-1,4-glycosidic linkages and is an important structural element of fungal cell wall and arthropod exoskeletons. On the basis of the mode of chitin hydrolysis, chitinases are classified as random, endo-, and exo-chitinases and based on sequence criteria, chitinases belong to families 18 and 19 of glycosyl hydrolases. The N-terminus of chitinase may be related to the immunoglobulin and/or fibronectin type III superfamilies. These domains are associated with different types of catalytic domains at either the N-terminal or C-terminal end and may be involved in homodimeric/tetrameric/dodecameric interactions. Members of this family include members of the alpha amylase family, sialidase, galactose oxidase, cellulase, cellulose, hyaluronate lyase, chitob
Probab=34.22 E-value=1.9e+02 Score=21.22 Aligned_cols=65 Identities=15% Similarity=0.179 Sum_probs=40.2
Q ss_pred CcEEEEEEEEEeCCeeeEEEEEEcCCCcEEEEee-ecceeeEEEEeccceeEE----EEeecCCCCCcceEE
Q psy189 91 NTSVILEFQVVTGGKYDVDVSIEAPTKEIIYRQV-KSQFDTHQFVTLVEGTYV----VCFSNEFSTYSSALE 157 (171)
Q Consensus 91 ~~~v~~~y~V~~gg~~dI~v~V~dP~g~~v~~~~-~~s~G~f~Fta~~~G~Y~----~CF~N~~s~~~~K~V 157 (171)
...|.+++.+-+|...+..-... ||+++.+.. ..++++-+|...+.|.|+ +|=.+..|.-.++.|
T Consensus 33 ~v~V~V~wnvWsG~~Gd~a~vl~--dg~~V~~G~~~~~~~~at~~v~kgG~y~m~V~lCn~dGCS~S~~~~I 102 (106)
T cd02848 33 AADVSVKWNAWSGDPGDTYKVLL--DGKEVWSGALTGSSGTATFKVGKGGRYQMQVALCNGDGCSTSAAKEI 102 (106)
T ss_pred ceEEEEEEeeecCCCCcEEEEEE--CCeEEEcccCCCCccEEEEEeCCCCeEEEEEEEECCCCccCcCCEEE
Confidence 45688888998876654322222 566776654 334567888899999985 444445554334544
No 42
>PF12690 BsuPI: Intracellular proteinase inhibitor; InterPro: IPR020481 BsuPI is a intracellular proteinase inhibitor that directly regulates the major intracellular proteinase (ISP-1) activity in vivo. It inhibits ISP-1 in the early stages of sporulation and then may be inactivated by a membrane-bound proteinase [].; PDB: 3ISY_A.
Probab=33.69 E-value=1.2e+02 Score=20.69 Aligned_cols=20 Identities=10% Similarity=0.144 Sum_probs=12.3
Q ss_pred eeeEEEEEEcCCCcEEEEee
Q psy189 105 KYDVDVSIEAPTKEIIYRQV 124 (171)
Q Consensus 105 ~~dI~v~V~dP~g~~v~~~~ 124 (171)
.+..|+.|+|++|+.|+...
T Consensus 23 gq~~D~~v~d~~g~~vwrwS 42 (82)
T PF12690_consen 23 GQRYDFVVKDKEGKEVWRWS 42 (82)
T ss_dssp S--EEEEEE-TT--EEEETT
T ss_pred CCEEEEEEECCCCCEEEEec
Confidence 34578899999999999863
No 43
>PRK09619 flgD flagellar basal body rod modification protein; Reviewed
Probab=30.04 E-value=3.1e+02 Score=22.41 Aligned_cols=56 Identities=11% Similarity=0.021 Sum_probs=38.6
Q ss_pred cEEEEEEEEEeCCeeeEEEEEEcCCCcEEEEee--ecceeeEEEEec------cceeEEEEeecCC
Q psy189 92 TSVILEFQVVTGGKYDVDVSIEAPTKEIIYRQV--KSQFDTHQFVTL------VEGTYVVCFSNEF 149 (171)
Q Consensus 92 ~~v~~~y~V~~gg~~dI~v~V~dP~g~~v~~~~--~~s~G~f~Fta~------~~G~Y~~CF~N~~ 149 (171)
....+.|.+.. ....+.+.|+|.+|++ +... ..+.|.+.|+-. .+|.|++=.....
T Consensus 110 ~~~~~~~~L~~-~a~~v~v~I~D~~G~v-~t~~l~~~~aG~~~f~WDG~~~~lp~G~Y~~~V~a~~ 173 (218)
T PRK09619 110 DPVAGRLTLKH-PAPTLTLHITDILGQE-KKIDLGKQPAGPVNFTLDPAALGLQPGQYQLSVVSGS 173 (218)
T ss_pred CeeEEEEecCC-cCcEEEEEEEeCCCCE-EEEecCCcCCCceeEEECCCCCCCCCceeEEEEEEeC
Confidence 34566666543 4446999999999997 4432 346788888853 3799999987543
No 44
>TIGR03096 nitroso_cyanin nitrosocyanin. Nitrosocyanin, as described from the obligate chemolithoautotroph Nitrosomonas europaea, is a red copper protein of unknown function with sequence similarity to a number of blue copper redox proteins.
Probab=29.74 E-value=1.5e+02 Score=22.58 Aligned_cols=20 Identities=20% Similarity=0.330 Sum_probs=15.2
Q ss_pred cceeeEEEEeccceeEE-EEe
Q psy189 126 SQFDTHQFVTLVEGTYV-VCF 145 (171)
Q Consensus 126 ~s~G~f~Fta~~~G~Y~-~CF 145 (171)
.+...++|++.++|+|. .|=
T Consensus 99 Get~TitF~adKpG~Y~y~C~ 119 (135)
T TIGR03096 99 GETKTISFKADKAGAFTIWCQ 119 (135)
T ss_pred CCeEEEEEECCCCEEEEEeCC
Confidence 45567899999999995 454
No 45
>PF10794 DUF2606: Protein of unknown function (DUF2606); InterPro: IPR019730 This entry represents bacterial proteins with unknown function.
Probab=28.81 E-value=1.9e+02 Score=21.83 Aligned_cols=27 Identities=15% Similarity=0.186 Sum_probs=20.8
Q ss_pred ecceeeEEEEeccceeEEEEeecCCCC
Q psy189 125 KSQFDTHQFVTLVEGTYVVCFSNEFST 151 (171)
Q Consensus 125 ~~s~G~f~Fta~~~G~Y~~CF~N~~s~ 151 (171)
...+|+........|.|.+-|.|....
T Consensus 84 TD~~Gki~Wk~~~kG~Y~v~l~n~e~~ 110 (131)
T PF10794_consen 84 TDEEGKIIWKNGRKGKYIVFLPNGETQ 110 (131)
T ss_pred cCCCCcEEEecCCcceEEEEEcCCCce
Confidence 346888888888888888888876544
No 46
>PF02927 CelD_N: N-terminal ig-like domain of cellulase; InterPro: IPR004197 O-Glycosyl hydrolases 3.2.1. from EC are a widespread group of enzymes that hydrolyse the glycosidic bond between two or more carbohydrates, or between a carbohydrate and a non-carbohydrate moiety. A classification system for glycosyl hydrolases, based on sequence similarity, has led to the definition of 85 different families [, ]. This classification is available on the CAZy (CArbohydrate-Active EnZymes) web site. Cellulases (Endoglucanases) 3.2.1.4 from EC catalyse the endohydrolysis of 1,4-beta-D-glucosidic linkages in cellulose. This is the N-terminal ig-like domain of cellulase, enzymes containing this domain belong to family 9 of the glycoside hydrolases (GH9 from CAZY).; GO: 0008810 cellulase activity, 0005975 carbohydrate metabolic process; PDB: 1CLC_A 1WMX_B 2C24_B 1RQ5_A 3K4Z_A 3H7L_B 3RX5_A 3RX8_A 3H2W_A 3RX7_A ....
Probab=28.38 E-value=2e+02 Score=19.76 Aligned_cols=41 Identities=10% Similarity=0.109 Sum_probs=20.2
Q ss_pred EEEEEEcCCCcEEEEeee------cceeeEEEEe-----ccceeEEEEeecC
Q psy189 108 VDVSIEAPTKEIIYRQVK------SQFDTHQFVT-----LVEGTYVVCFSNE 148 (171)
Q Consensus 108 I~v~V~dP~g~~v~~~~~------~s~G~f~Fta-----~~~G~Y~~CF~N~ 148 (171)
..+.|.|.+++.+++..- ..+|...+.+ .++|+|.+...+.
T Consensus 36 ~~f~l~d~~~~~V~~g~~~~~~~~~~s~~~~~~~DFS~~~~~G~Y~i~~~~~ 87 (91)
T PF02927_consen 36 STFELVDASGGKVYTGKLSPAGVDPWSGEYVYRIDFSDLTTPGTYYIRVGGA 87 (91)
T ss_dssp -EEEEEETTSBEEEEEEEEEEEECTTTTEEEEEEE-TT--S-EEEEEEETTE
T ss_pred eEEEEEcCCCCEEEEEEeeCccccCCCCCeEEEEEcCCcCCCEEEEEEECCc
Confidence 456666666655554321 1233333332 3688888887543
No 47
>KOG0518|consensus
Probab=27.84 E-value=1.6e+02 Score=29.89 Aligned_cols=52 Identities=13% Similarity=0.137 Sum_probs=33.5
Q ss_pred EEEEEEEeCCeeeEEEEEEcCCCcEEEEe-eecce---eeEEEEeccceeEEEEee
Q psy189 95 ILEFQVVTGGKYDVDVSIEAPTKEIIYRQ-VKSQF---DTHQFVTLVEGTYVVCFS 146 (171)
Q Consensus 95 ~~~y~V~~gg~~dI~v~V~dP~g~~v~~~-~~~s~---G~f~Fta~~~G~Y~~CF~ 146 (171)
.+.|.+.+.+...+++.|.||+|+....- .+..+ -.++|.....|.|.+=+.
T Consensus 379 d~~fD~~Dageg~levqV~gp~Gk~~~~~V~d~~~~~~h~vsY~pd~~G~y~i~v~ 434 (1113)
T KOG0518|consen 379 DFTFDEGDAGEGLLEVQVVGPEGKEKEVVVRDNGRGGIHIVTYVPDCPGRYLIVVF 434 (1113)
T ss_pred eeeEEccccccceEEEEEECCCCCceeeEEEecCCCceEEEEEcCCCCCceEEEEE
Confidence 33444444455558999999999854221 22333 356777889999988774
No 48
>PF10648 Gmad2: Immunoglobulin-like domain of bacterial spore germination; InterPro: IPR018911 This domain is found linked to IPR019606 from INTERPRO in some bacterial proteins. It is predicted to contain an immunoglobulin-like all-beta fold.
Probab=27.57 E-value=2.1e+02 Score=19.82 Aligned_cols=34 Identities=12% Similarity=0.148 Sum_probs=23.1
Q ss_pred CCcEEEEEEEEEeCC---eeeEEEEEEcCCCcEEEEe
Q psy189 90 RNTSVILEFQVVTGG---KYDVDVSIEAPTKEIIYRQ 123 (171)
Q Consensus 90 ~~~~v~~~y~V~~gg---~~dI~v~V~dP~g~~v~~~ 123 (171)
.|+.|.-.+.|.+.. .-.+.+.|.|.+|+++...
T Consensus 10 pg~~V~sp~~V~G~A~~FEgtv~~rv~D~~g~vl~e~ 46 (88)
T PF10648_consen 10 PGDTVSSPVKVSGKARVFEGTVNIRVRDGHGEVLAEG 46 (88)
T ss_pred CcCCcCCCEEEEEEEEEeeeEEEEEEEcCCCcEEEEe
Confidence 356666666665322 2258899999999999553
No 49
>cd05860 Ig4_SCFR Fourth immunoglobulin (Ig)-like domain of stem cell factor receptor (SCFR). Ig4_SCFR: The fourth Immunoglobulin (Ig)-like domain in stem cell factor receptor (SCFR). SCFR is organized as an extracellular component having five IG-like domains, a transmembrane segment, and a cytoplasmic portion having protein tyrosine kinase activity. SCFR and its ligand SCF are critical for normal hematopoiesis, mast cell development, melanocytes and gametogenesis. SCF binds to the second and third Ig-like domains of SCFR. This fourth Ig-like domain participates in SCFR dimerization, which follows ligand binding. Deletion of this fourth domain abolishes the ligand-induced dimerization of SCFR and completely inhibits signal transduction.
Probab=26.69 E-value=1.2e+02 Score=21.95 Aligned_cols=25 Identities=24% Similarity=0.190 Sum_probs=18.8
Q ss_pred ccceeEEEEeecCCCCCcceEEEEEEEE
Q psy189 136 LVEGTYVVCFSNEFSTYSSALEVISTHA 163 (171)
Q Consensus 136 ~~~G~Y~~CF~N~~s~~~~K~V~F~i~~ 163 (171)
.+.|.|.|=..|.+.. +.+.|++++
T Consensus 75 ~E~G~YTf~a~N~~~~---~s~tF~l~v 99 (101)
T cd05860 75 TEGGTYTFLVSNSDAS---ASVTFNVYV 99 (101)
T ss_pred hhCcEEEEEEECCCCe---EEEEEEEEE
Confidence 3789999999887766 567777654
No 50
>PRK02710 plastocyanin; Provisional
Probab=26.68 E-value=2.5e+02 Score=20.27 Aligned_cols=73 Identities=14% Similarity=0.100 Sum_probs=32.6
Q ss_pred cceEEEEEEcCCC-cceEE---EeccCCcEEEEEEEEEeCCeeeEEEEEEcCCCcEEEEe-eecceeeEEEEeccceeE-
Q psy189 68 QAVELTFELADNA-RECFF---EDIKRNTSVILEFQVVTGGKYDVDVSIEAPTKEIIYRQ-VKSQFDTHQFVTLVEGTY- 141 (171)
Q Consensus 68 ~~~~ltf~l~~~~-~~CFy---e~~~~~~~v~~~y~V~~gg~~dI~v~V~dP~g~~v~~~-~~~s~G~f~Fta~~~G~Y- 141 (171)
.+...++.+...+ .-.|. -+++.|+.|.+ ...+...+++.+ .+... .-.+. .-.....++++...+|+|
T Consensus 27 ~a~~~~V~~~~~~~~~~F~P~~i~v~~Gd~V~~--~N~~~~~H~v~~--~~~~~-~~~~~~~~~pg~t~~~tF~~~G~y~ 101 (119)
T PRK02710 27 SAETVEVKMGSDAGMLAFEPSTLTIKAGDTVKW--VNNKLAPHNAVF--DGAKE-LSHKDLAFAPGESWEETFSEAGTYT 101 (119)
T ss_pred ccceEEEEEccCCCeeEEeCCEEEEcCCCEEEE--EECCCCCceEEe--cCCcc-ccccccccCCCCEEEEEecCCEEEE
Confidence 4556666665432 33554 24567886543 322333444433 22211 10111 111222344554458999
Q ss_pred EEEe
Q psy189 142 VVCF 145 (171)
Q Consensus 142 ~~CF 145 (171)
-+|-
T Consensus 102 y~C~ 105 (119)
T PRK02710 102 YYCE 105 (119)
T ss_pred EEcC
Confidence 4575
No 51
>PF13715 DUF4480: Domain of unknown function (DUF4480)
Probab=26.54 E-value=1.8e+02 Score=19.11 Aligned_cols=35 Identities=20% Similarity=0.240 Sum_probs=23.3
Q ss_pred EEEEEEcCCCcEEEEeeecceeeEEEEeccceeEEEEeec
Q psy189 108 VDVSIEAPTKEIIYRQVKSQFDTHQFVTLVEGTYVVCFSN 147 (171)
Q Consensus 108 I~v~V~dP~g~~v~~~~~~s~G~f~Fta~~~G~Y~~CF~N 147 (171)
+.+.+.+.+.. ......|.|++.++ .|.|.+-|+-
T Consensus 18 a~V~~~~~~~~----~~Td~~G~F~i~~~-~g~~~l~is~ 52 (88)
T PF13715_consen 18 ATVYLKNTKKG----TVTDENGRFSIKLP-EGDYTLKISY 52 (88)
T ss_pred eEEEEeCCcce----EEECCCeEEEEEEc-CCCeEEEEEE
Confidence 55555555411 12357999999955 8899998854
No 52
>PF12904 Collagen_bind_2: Putative collagen-binding domain of a collagenase ; InterPro: IPR024749 This domain is likely to be the collagen-binding domain of a family of bacterial collagenase enzymes. The structure of one family member, Q8A905 from SWISSPROT, has been characterised. The domain occurs in the C-terminal region of the protein.; PDB: 3KZS_D.
Probab=25.72 E-value=1.1e+02 Score=21.79 Aligned_cols=54 Identities=13% Similarity=0.080 Sum_probs=27.7
Q ss_pred eEEEeccCCcEEEEEEEEEeCCeeeEEEEEEcC-CCcEEEEeeecceeeEEEEeccc
Q psy189 83 CFFEDIKRNTSVILEFQVVTGGKYDVDVSIEAP-TKEIIYRQVKSQFDTHQFVTLVE 138 (171)
Q Consensus 83 CFye~~~~~~~v~~~y~V~~gg~~dI~v~V~dP-~g~~v~~~~~~s~G~f~Fta~~~ 138 (171)
+.+-..+.|..+.+.-.-++| ..+.+.-+|| +|+.-...+-...+...|+.+..
T Consensus 26 y~~VY~~~Gr~~~vdl~~l~g--~~~~a~WfdPR~G~~~~~g~~~~~~~~~F~pP~~ 80 (93)
T PF12904_consen 26 YALVYTPTGRPFTVDLSKLSG--KKVKAWWFDPRTGKYTYIGEFSNKGIQTFTPPSG 80 (93)
T ss_dssp EEEEEESS---EEEEGGGSS---SEEEEEEEETTT-BEEEEEEEES-SEEEE--SS-
T ss_pred EEEEECCCCCEEEEEcccccC--CceeEEEEcCCCCCEEEeeeecCCcceEecCCCC
Confidence 555556666654444322233 2378888999 56666555555568888888866
No 53
>PF13464 DUF4115: Domain of unknown function (DUF4115)
Probab=25.28 E-value=2.1e+02 Score=18.88 Aligned_cols=25 Identities=12% Similarity=0.225 Sum_probs=14.6
Q ss_pred EEEEEcCCCcEEEEeeecceeeEEE
Q psy189 109 DVSIEAPTKEIIYRQVKSQFDTHQF 133 (171)
Q Consensus 109 ~v~V~dP~g~~v~~~~~~s~G~f~F 133 (171)
=+.|+|.+|+.+++..-.....+++
T Consensus 10 Wv~V~d~dG~~~~~~~l~~G~~~~~ 34 (77)
T PF13464_consen 10 WVEVTDADGKVLFSGTLKAGETKTF 34 (77)
T ss_pred EEEEEeCCCcEeeeeeeCCCcEEEE
Confidence 4566666777766654444444555
No 54
>cd02858 Esterase_N_term Esterase N-terminal domain. Esterases catalyze the hydrolysis of organic esters to release an alcohol or thiol and acid. The term can be applied to enzymes that hydrolyze carboxylate, phosphate and sulphate esters, but is more often restricted to the first class of substrate. The N-terminus of esterase may be related to the immunoglobulin and/or fibronectin type III superfamilies. These domains are associated with different types of catalytic domains at either the N-terminal or C-terminal end and may be involved in homodimeric/tetrameric/dodecameric interactions. Members of this family include members of the alpha amylase family, sialidase, galactose oxidase, cellulase, cellulose, hyaluronate lyase, chitobiase, and chitinase.
Probab=25.26 E-value=2.2e+02 Score=19.18 Aligned_cols=49 Identities=16% Similarity=0.144 Sum_probs=28.3
Q ss_pred EEEEEEeCCeeeEEEEEEcCCCcEEEEeeecceeeEEEEec--cceeEEEEe
Q psy189 96 LEFQVVTGGKYDVDVSIEAPTKEIIYRQVKSQFDTHQFVTL--VEGTYVVCF 145 (171)
Q Consensus 96 ~~y~V~~gg~~dI~v~V~dP~g~~v~~~~~~s~G~f~Fta~--~~G~Y~~CF 145 (171)
+.|.+-...+..|.+.+.+.+...+ .-++...|-|+.+.. .+|.|.+.|
T Consensus 8 v~F~vwAP~A~~V~L~~~~~~~~~~-~m~~~~~G~W~~~v~~l~~g~Y~Y~~ 58 (85)
T cd02858 8 VTFRLFAPKANEVQVRGSWGGAGSH-PMTKDEAGVWSVTTGPLAPGIYTYSF 58 (85)
T ss_pred EEEEEECCCCCEEEEEeecCCCccE-eCeECCCeEEEEEECCCCCcEEEEEE
Confidence 4566666666667666655322221 113445788888763 567666655
No 55
>PHA02665 hypothetical protein; Provisional
Probab=24.90 E-value=51 Score=27.54 Aligned_cols=31 Identities=23% Similarity=0.557 Sum_probs=27.2
Q ss_pred EEcCCCcceEEEeccCCcEEEEEEEEEeCCe
Q psy189 75 ELADNARECFFEDIKRNTSVILEFQVVTGGK 105 (171)
Q Consensus 75 ~l~~~~~~CFye~~~~~~~v~~~y~V~~gg~ 105 (171)
..+|+.++|.|-++..+......|.|.+|+-
T Consensus 24 v~ep~kkkc~ftkirt~~s~a~ry~vsdg~l 54 (322)
T PHA02665 24 VFEPGKKKCVFTKIRTSSSLACRYAVSDGGL 54 (322)
T ss_pred ccccccceeEEEEEecchhhhheeeeccCcc
Confidence 3479999999999999888899999998874
No 56
>PF07691 PA14: PA14 domain; InterPro: IPR011658 The PA14 domain forms an insert in bacterial beta-glucosidases, other glycosidases, glycosyltransferases, proteases, amidases, yeast adhesins and bacterial toxins, including anthrax protective antigen (PA). The domain also occurs in a Dictyostelium pre-spore cell-inducing factor Psi and in fibrocystin, the mammalian protein whose mutation leads to polycystic kidney and hepatic disease. The crystal structure of PA shows that this domain (named PA14 after its location in the PA20 pro-peptide) has a beta-barrel structure. The PA14 domain sequence suggests a binding function, rather than a catalytic role. The PA14 domain distribution is compatible with carbohydrate binding [].; PDB: 2XVG_A 2XVK_A 2XVL_A 2XJU_A 2XJT_A 2XJQ_A 2XJS_A 2XJV_A 2XJP_A 2XJR_A ....
Probab=22.77 E-value=2.9e+02 Score=19.68 Aligned_cols=51 Identities=10% Similarity=0.019 Sum_probs=28.1
Q ss_pred EEEEEEcCCCcEEEEeee----------cceeeEEEEeccceeEEEEeecCCCCCcceEEEEEE
Q psy189 108 VDVSIEAPTKEIIYRQVK----------SQFDTHQFVTLVEGTYVVCFSNEFSTYSSALEVIST 161 (171)
Q Consensus 108 I~v~V~dP~g~~v~~~~~----------~s~G~f~Fta~~~G~Y~~CF~N~~s~~~~K~V~F~i 161 (171)
+.+.| +|+.+..... ...-..++.....|.|.+.+.-.........+.+.|
T Consensus 72 ~~l~i---dg~~vid~~~~~~~~~~~~~~~~~~~~v~l~~g~~y~i~i~y~~~~~~~~~~~l~w 132 (145)
T PF07691_consen 72 ARLWI---DGKLVIDNWGNQGGGFFNSGPSSTSGTVTLEAGGKYPIRIEYFNRGGDASLIQLSW 132 (145)
T ss_dssp EEEEE---TTEEEEECSCTTTSTTTTTSBCCEEEEEEE-TT-EEEEEEEEEECSCSSCCEEEEE
T ss_pred EEEEE---CCEEEEcCCccccccccccccceEEEEEEeeCCeeEEEEEEEEECCCCCEEEEEEE
Confidence 34455 7777776543 235556677778899999986433332222245554
No 57
>PF07523 Big_3: Bacterial Ig-like domain (group 3); InterPro: IPR011080 This entry represents bacterial domains with an Ig-like fold. These domains are found in a variety of bacterial surface proteins.; PDB: 2L7Y_A 2KPN_A.
Probab=22.09 E-value=2.2e+02 Score=18.20 Aligned_cols=47 Identities=13% Similarity=0.039 Sum_probs=25.7
Q ss_pred EEEEEEcCCCcEEEEeeecceeeEEEEeccceeEEEEeecCCCCCcceEEEEEE
Q psy189 108 VDVSIEAPTKEIIYRQVKSQFDTHQFVTLVEGTYVVCFSNEFSTYSSALEVIST 161 (171)
Q Consensus 108 I~v~V~dP~g~~v~~~~~~s~G~f~Fta~~~G~Y~~CF~N~~s~~~~K~V~F~i 161 (171)
.-+..++.+|+.+-..+-.-+| .+.....|.|.+=+.-.. +...|.+
T Consensus 19 ~~v~at~~dG~~~~~~~~~vs~--~~d~~~~G~y~Vt~~y~~-----~t~t~~V 65 (67)
T PF07523_consen 19 LFVTATYSDGTSLPLSDVTVSG--TVDTSKAGTYTVTYTYKG-----VTATFTV 65 (67)
T ss_dssp HEEEEEETTS-ES-GCCSEEES-----TTS-CCEEEEEEECT-----EEEEEEE
T ss_pred CEEEEEEcCCCEeceeeeEEEe--eeecCCCceEEEEEEECC-----EEEEEEE
Confidence 4577888888885433333334 678888999988775443 4445544
No 58
>PRK15036 hydroxyisourate hydrolase; Provisional
Probab=21.67 E-value=3e+02 Score=20.78 Aligned_cols=25 Identities=16% Similarity=0.174 Sum_probs=14.0
Q ss_pred cceeEEEEeecCCC--------CCcceEEEEEE
Q psy189 137 VEGTYVVCFSNEFS--------TYSSALEVIST 161 (171)
Q Consensus 137 ~~G~Y~~CF~N~~s--------~~~~K~V~F~i 161 (171)
.+|+|++=|+-... -+....|.|.+
T Consensus 81 ~~G~Y~L~F~t~~Yf~~~~~~~F~p~v~v~F~i 113 (137)
T PRK15036 81 TTGDYRVVFKTGDYFKKQNLESFFPEIPVEFHI 113 (137)
T ss_pred CCeeEEEEEEcchhhhccCCCCCcceeEEEEEE
Confidence 46777777753322 13345666665
No 59
>PF08896 DUF1842: Domain of unknown function (DUF1842); InterPro: IPR014992 This domain is found at the N terminus of proteins that are functionally uncharacterised.
Probab=20.96 E-value=3.5e+02 Score=19.95 Aligned_cols=56 Identities=18% Similarity=0.255 Sum_probs=33.7
Q ss_pred EEEEEEEEEeC----CeeeEEEEEEcCCCcEEEEee--ecceeeEEEEeccceeE-EEEeecC
Q psy189 93 SVILEFQVVTG----GKYDVDVSIEAPTKEIIYRQV--KSQFDTHQFVTLVEGTY-VVCFSNE 148 (171)
Q Consensus 93 ~v~~~y~V~~g----g~~dI~v~V~dP~g~~v~~~~--~~s~G~f~Fta~~~G~Y-~~CF~N~ 148 (171)
.+..+|.|-.+ ....+++.|.-|++++.-..+ ....--..|.++-.|.| .+|+.+.
T Consensus 3 LF~v~y~i~~~~~Gap~L~L~L~V~~~~~~VsG~a~ItQat~ppl~~~s~v~G~~~~~~~~~~ 65 (114)
T PF08896_consen 3 LFPVSYRIGTGLPGAPVLTLDLLVNTPDKSVSGRARITQATNPPLNFHSDVWGQYSYMGLMPP 65 (114)
T ss_pred eeEEEEEecCCCCCCcEEEEEEEEeCCCCEEEeEEEEEEecCCCcceEEEeEEeEEEEEecCC
Confidence 34567777432 245788888888777654431 22233456777777877 5677533
No 60
>PF02974 Inh: Protease inhibitor Inh; InterPro: IPR021140 This entry represents the metalloprotease inhibitor I38, as well as the outer membrane lipoprotein Omp19. Peptide proteinase inhibitors can be found as single domain proteins or as single or multiple domains within proteins; these are referred to as either simple or compound inhibitors, respectively. In many cases they are synthesised as part of a larger precursor protein, either as a prepropeptide or as an N-terminal domain associated with an inactive peptidase or zymogen. This domain prevents access of the substrate to the active site. Removal of the N-terminal inhibitor domain either by interaction with a second peptidase or by autocatalytic cleavage activates the zymogen. Other inhibitors interact direct with proteinases using a simple noncovalent lock and key mechanism; while yet others use a conformational change-based trapping mechanism that depends on their structural and thermodynamic properties. This family of proteins represent monomeric serralysin inhibitors of about 125 residues, which interact with specific metalloprotease which are synthesised by serralysin secretors and characterised by being plant, insect and animal pathogens. It is probable that the serralysin inhibitors protect the host from proteolysis during export of the protease. The members of this family belong to MEROPS proteinase inhibitor family I38, clan IK. X-ray crystallography of a complex between the Serratia marcescens protease, SmaPI, and the inhibitor of Erwinia chrysanthemi, Inh, reveals that Inh is folded into an eight-stranded b-barrel with an N-terminal trunk of 10 residues. Residues 1-5 occupy part of the extended active site of the proteinase, thereby preventing access of the substrate. Residues 6-10 form a linker that connects the N-terminal proteinase-binding peptide to the body of the b-barrel. The backbone carbonyl of Ser-1 interacts with the catalytic zinc; the Ser-2 side chain occupies the S1'-binding site and also forms a hydrogen bond to the carboxyl end of the catalytic Glu, whereas Leu-3 occupies the S2' recognition site. Penetration of the trunk region further than 5 residues into the substrate binding cleft appears to be prevented by the b-barrel, which itself interacts with the proteinase near its Met turn (19). Peptide mimetics of the trunk at concentrations up to about 100 mM do not inhibit the protease, demonstrating that the barrel is essential for inhibitory activity [, ]. Structurally and functionally these inhibitors are closely related to the lipocalins, fatty acid-binding proteins, avidins and the enigmatic triabin. Together these five protein families constitute the calycin superfamily []. The proteins are characterised by their high specificity for small hydrophobic molecules and by their ability to form complexes with soluble macromolecules either through intramolecular disulphides or protein-protein interactions []. ; PDB: 1JIW_I 2RN4_A 1SMP_I.
Probab=20.86 E-value=3.1e+02 Score=19.27 Aligned_cols=33 Identities=6% Similarity=-0.113 Sum_probs=21.7
Q ss_pred EEEEcCCCcEEEEeeecceeeEEEEeccceeEE
Q psy189 110 VSIEAPTKEIIYRQVKSQFDTHQFVTLVEGTYV 142 (171)
Q Consensus 110 v~V~dP~g~~v~~~~~~s~G~f~Fta~~~G~Y~ 142 (171)
+.++|.+|+.|..-.+...|.|.=..+..+.+.
T Consensus 64 l~L~d~~G~~v~~f~~~~~g~~~g~~~~g~~~~ 96 (99)
T PF02974_consen 64 LVLTDADGSVVAFFYRSGDGRFEGQTPDGQPLS 96 (99)
T ss_dssp EEEE-TTS-EEEEEEEECTTEEEEEECCCEEEE
T ss_pred EEEECCCCCEEEEEEccCCeeEEeEcCCCCEEE
Confidence 677888888887766667777776666555543
No 61
>cd09011 Glo_EDI_BRP_like_23 This conserved domain belongs to a superfamily including the bleomycin resistance protein, glyoxalase I, and type I ring-cleaving dioxygenases. This protein family belongs to a conserved domain superfamily that is found in a variety of structurally related metalloproteins, including the bleomycin resistance protein, glyoxalase I, and type I ring-cleaving dioxygenases. A bound metal ion is required for protein activities for the members of this superfamily. A variety of metal ions have been found in the catalytic centers of these proteins including Fe(II), Mn(II), Zn(II), Ni(II) and Mg(II). The protein superfamily contains members with or without domain swapping.
Probab=20.57 E-value=86 Score=21.69 Aligned_cols=15 Identities=33% Similarity=0.242 Sum_probs=12.3
Q ss_pred EEEEEEcCCCcEEEE
Q psy189 108 VDVSIEAPTKEIIYR 122 (171)
Q Consensus 108 I~v~V~dP~g~~v~~ 122 (171)
-.+.+.||+|+.+.-
T Consensus 102 r~~~~~DPdGn~iei 116 (120)
T cd09011 102 RVVRFYDPDKHIIEV 116 (120)
T ss_pred EEEEEECCCCCEEEE
Confidence 468899999998753
Done!