Query psy3287 Match_columns 122 No_of_seqs 129 out of 788 Neff 6.4 Searched_HMMs 46136 Date Fri Aug 16 21:09:16 2013 Command hhsearch -i /work/01045/syshi/Psyhhblits/psy3287.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/3287hhsearch_cdd -cpu 12 -v 0 No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM 1 PF00930 DPPIV_N: Dipeptidyl p 99.9 1.3E-23 2.9E-28 168.8 5.4 83 28-121 47-133 (353) 2 KOG2281|consensus 99.8 1.7E-20 3.6E-25 161.1 4.8 87 35-121 215-301 (867) 3 KOG2100|consensus 99.4 4.2E-13 9.1E-18 118.3 6.6 83 28-120 152-238 (755) 4 KOG2281|consensus 98.8 2.9E-09 6.2E-14 92.6 2.6 112 2-121 229-349 (867) 5 PRK04043 tolB translocation pr 97.0 0.0047 1E-07 51.4 8.7 69 2-110 265-340 (419) 6 PRK05137 tolB translocation pr 96.9 0.0081 1.8E-07 49.6 9.1 72 2-111 278-356 (435) 7 PRK03629 tolB translocation pr 96.9 0.0087 1.9E-07 49.6 9.1 72 2-111 275-353 (429) 8 PRK01029 tolB translocation pr 96.5 0.016 3.4E-07 48.3 8.1 53 27-107 188-249 (428) 9 PRK04043 tolB translocation pr 96.4 0.022 4.7E-07 47.5 8.2 69 2-108 221-296 (419) 10 PRK03629 tolB translocation pr 96.3 0.035 7.7E-07 46.0 8.9 56 27-110 246-308 (429) 11 PRK04922 tolB translocation pr 96.2 0.045 9.8E-07 45.2 9.2 54 29-110 297-357 (433) 12 PRK04792 tolB translocation pr 96.1 0.061 1.3E-06 44.9 9.5 55 29-111 311-372 (448) 13 PRK04792 tolB translocation pr 96.0 0.075 1.6E-06 44.4 9.9 57 28-112 222-285 (448) 14 PRK00178 tolB translocation pr 96.0 0.072 1.6E-06 43.6 9.3 55 29-111 292-353 (430) 15 PRK00178 tolB translocation pr 95.9 0.067 1.5E-06 43.7 8.8 55 29-111 204-265 (430) 16 PRK05137 tolB translocation pr 95.8 0.069 1.5E-06 44.1 8.5 56 28-111 206-268 (435) 17 PRK01029 tolB translocation pr 95.6 0.086 1.9E-06 43.9 8.6 55 28-110 331-392 (428) 18 PRK02889 tolB translocation pr 95.4 0.14 3E-06 42.4 9.1 55 29-112 333-394 (427) 19 PRK02889 tolB translocation pr 95.4 0.16 3.4E-06 42.0 9.3 62 28-117 200-268 (427) 20 PRK01742 tolB translocation pr 95.2 0.2 4.4E-06 41.3 9.3 62 28-117 252-320 (429) 21 PRK01742 tolB translocation pr 95.2 0.21 4.6E-06 41.2 9.4 61 28-116 208-275 (429) 22 PRK04922 tolB translocation pr 95.0 0.3 6.6E-06 40.3 9.8 53 29-109 209-268 (433) 23 TIGR02800 propeller_TolB tol-p 94.6 0.34 7.3E-06 39.0 9.1 53 30-110 284-343 (417) 24 COG0823 TolB Periplasmic compo 94.4 0.2 4.4E-06 42.1 7.5 69 22-119 236-311 (425) 25 COG0823 TolB Periplasmic compo 92.6 0.66 1.4E-05 39.0 7.7 54 28-109 286-346 (425) 26 PF07676 PD40: WD40-like Beta 92.4 0.26 5.7E-06 27.0 3.6 19 88-110 12-30 (39) 27 TIGR02800 propeller_TolB tol-p 92.1 1.7 3.7E-05 34.9 9.3 52 29-108 195-253 (417) 28 COG4946 Uncharacterized protei 91.7 0.53 1.1E-05 40.8 6.0 49 2-60 67-126 (668) 29 PF08450 SGL: SMP-30/Gluconola 91.1 0.4 8.7E-06 36.1 4.4 66 34-118 90-163 (246) 30 TIGR02171 Fb_sc_TIGR02171 Fibr 88.9 1.7 3.6E-05 40.1 7.1 53 29-109 315-371 (912) 31 PRK13616 lipoprotein LpqB; Pro 88.2 2.2 4.7E-05 37.5 7.2 18 86-107 449-466 (591) 32 PF00930 DPPIV_N: Dipeptidyl p 87.6 0.086 1.9E-06 42.4 -1.7 39 2-47 71-110 (353) 33 PF14583 Pectate_lyase22: Olig 87.2 2.8 6.2E-05 35.1 7.0 56 2-59 18-78 (386) 34 KOG0318|consensus 72.1 15 0.00032 32.3 6.5 65 28-111 189-258 (603) 35 PF08662 eIF2A: Eukaryotic tra 70.6 7.8 0.00017 28.7 4.1 28 87-118 146-178 (194) 36 PF14583 Pectate_lyase22: Olig 67.8 17 0.00037 30.6 5.8 52 1-59 67-120 (386) 37 PF10647 Gmad1: Lipoprotein Lp 66.1 29 0.00064 26.7 6.6 23 86-112 113-135 (253) 38 PRK13616 lipoprotein LpqB; Pro 65.0 33 0.00072 30.2 7.4 31 28-59 354-396 (591) 39 COG4946 Uncharacterized protei 56.6 9.9 0.00021 33.3 2.6 25 35-59 53-77 (668) 40 PF08662 eIF2A: Eukaryotic tra 55.4 57 0.0012 24.0 6.3 27 88-118 104-132 (194) 41 COG3386 Gluconolactonase [Carb 54.8 17 0.00036 29.4 3.5 31 84-118 162-192 (307) 42 TIGR00667 aat leucyl/phenylala 53.2 11 0.00023 28.7 2.0 26 73-106 13-38 (185) 43 COG5169 HSF1 Heat shock transc 50.0 15 0.00032 29.7 2.5 19 85-107 25-43 (282) 44 PF09729 Gti1_Pac2: Gti1/Pac2 48.5 13 0.00029 27.7 1.9 33 77-113 45-84 (168) 45 PF02897 Peptidase_S9_N: Proly 44.7 59 0.0013 26.2 5.3 22 88-113 173-194 (414) 46 TIGR03866 PQQ_ABC_repeats PQQ- 42.8 1.3E+02 0.0028 21.9 6.7 29 87-119 251-279 (300) 47 TIGR03866 PQQ_ABC_repeats PQQ- 42.2 1.3E+02 0.0029 21.9 6.7 25 89-117 35-59 (300) 48 PF12566 DUF3748: Protein of u 41.8 22 0.00048 25.2 2.1 18 88-109 71-88 (122) 49 PLN02919 haloacid dehalogenase 39.6 1.5E+02 0.0033 27.9 7.7 59 35-97 809-871 (1057) 50 KOG0643|consensus 37.2 13 0.00028 30.3 0.3 21 77-97 273-300 (327) 51 PRK10115 protease 2; Provision 36.6 44 0.00095 29.8 3.6 28 86-118 128-155 (686) 52 PF05787 DUF839: Bacterial pro 36.5 1.1E+02 0.0024 26.6 6.0 20 32-51 437-457 (524) 53 TIGR03606 non_repeat_PQQ dehyd 35.8 2E+02 0.0042 24.7 7.2 55 35-95 35-89 (454) 54 PF01436 NHL: NHL repeat; Int 35.3 47 0.001 16.8 2.3 16 35-50 7-22 (28) 55 COG3211 PhoX Predicted phospha 34.9 54 0.0012 29.2 3.8 18 33-50 502-520 (616) 56 KOG2394|consensus 34.9 25 0.00054 31.1 1.7 31 75-109 316-353 (636) 57 PF03088 Str_synth: Strictosid 33.0 87 0.0019 20.7 3.8 32 86-121 58-89 (89) 58 TIGR03606 non_repeat_PQQ dehyd 31.7 2.1E+02 0.0046 24.5 6.8 70 29-111 347-426 (454) 59 PF10584 Proteasome_A_N: Prote 29.7 25 0.00053 17.8 0.5 6 92-97 8-13 (23) 60 PF06045 Rhamnogal_lyase: Rham 29.7 70 0.0015 24.6 3.3 13 85-97 54-69 (203) 61 PF08768 DUF1794: Domain of un 27.8 1.3E+02 0.0029 21.5 4.3 29 84-116 60-89 (154) 62 PF15660 Imm49: Immunity prote 27.6 94 0.002 19.9 3.1 45 24-78 11-59 (84) 63 PF06977 SdiA-regulated: SdiA- 27.5 31 0.00067 27.0 1.0 32 17-48 206-240 (248) 64 smart00415 HSF heat shock fact 27.3 12 0.00027 25.1 -1.1 10 88-97 20-29 (105) 65 KOG2139|consensus 27.2 2.3E+02 0.005 24.2 6.1 22 85-110 239-260 (445) 66 PF02239 Cytochrom_D1: Cytochr 27.1 1.8E+02 0.0038 23.8 5.5 27 87-117 80-106 (369) 67 KOG2314|consensus 26.9 1.2E+02 0.0026 27.2 4.5 97 11-111 247-369 (698) 68 PF12894 Apc4_WD40: Anaphase-p 26.4 1.4E+02 0.003 17.2 3.7 27 87-118 14-40 (47) 69 TIGR02604 Piru_Ver_Nterm putat 25.9 1.4E+02 0.003 24.1 4.6 15 34-48 18-32 (367) 70 KOG4497|consensus 25.8 74 0.0016 26.8 3.0 12 86-97 93-104 (447) 71 PRK00301 aat leucyl/phenylalan 25.8 59 0.0013 25.5 2.3 27 72-106 40-66 (233) 72 PF10656 DUF2483: Hypothetical 25.2 67 0.0015 20.7 2.1 36 38-77 11-46 (72) 73 PF03588 Leu_Phe_trans: Leucyl 25.1 34 0.00073 25.6 0.8 25 73-105 11-35 (173) 74 PF12904 Collagen_bind_2: Puta 24.9 2.2E+02 0.0047 19.0 6.6 30 86-119 48-78 (93) 75 COG2360 Aat Leu/Phe-tRNA-prote 24.7 50 0.0011 25.7 1.7 21 73-97 34-54 (221) 76 PF15492 Nbas_N: Neuroblastoma 24.1 2.1E+02 0.0046 23.2 5.2 20 84-107 43-62 (282) 77 KOG2919|consensus 23.5 59 0.0013 27.3 1.9 21 86-110 51-71 (406) 78 KOG0645|consensus 23.4 68 0.0015 26.2 2.3 22 88-113 65-86 (312) 79 PF10596 U6-snRNA_bdg: U6-snRN 23.2 74 0.0016 23.6 2.3 29 85-113 90-119 (160) 80 PF10282 Lactonase: Lactonase, 23.2 2.8E+02 0.006 22.0 5.8 73 31-118 246-321 (345) 81 PRK11028 6-phosphogluconolacto 22.3 3.5E+02 0.0076 20.9 6.1 26 88-117 83-108 (330) 82 COG1770 PtrB Protease II [Amin 21.1 4E+02 0.0087 24.3 6.7 29 80-112 168-197 (682) 83 KOG2139|consensus 21.0 1.5E+02 0.0032 25.3 3.8 30 86-119 282-311 (445) 84 KOG0293|consensus 20.8 56 0.0012 28.2 1.4 20 88-111 273-292 (519) 85 PF14315 DUF4380: Domain of un 20.7 2.6E+02 0.0056 21.8 5.1 46 69-119 56-105 (274) 86 PTZ00152 cofilin/actin-depolym 20.3 34 0.00073 24.0 -0.0 24 86-109 72-95 (122) No 1 >PF00930 DPPIV_N: Dipeptidyl peptidase IV (DPP IV) N-terminal region; InterPro: IPR002469 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Proteolytic enzymes that exploit serine in their catalytic activity are ubiquitous, being found in viruses, bacteria and eukaryotes []. They include a wide range of peptidase activity, including exopeptidase, endopeptidase, oligopeptidase and omega-peptidase activity. Over 20 families (denoted S1 - S66) of serine protease have been identified, these being grouped into clans on the basis of structural similarity and other functional evidence []. Structures are known for members of the clans and the structures indicate that some appear to be totally unrelated, suggesting different evolutionary origins for the serine peptidases []. Not withstanding their different evolutionary origins, there are similarities in the reaction mechanisms of several peptidases. Chymotrypsin, subtilisin and carboxypeptidase C have a catalytic triad of serine, aspartate and histidine in common: serine acts as a nucleophile, aspartate as an electrophile, and histidine as a base []. The geometric orientations of the catalytic residues are similar between families, despite different protein folds []. The linear arrangements of the catalytic residues commonly reflect clan relationships. For example the catalytic triad in the chymotrypsin clan (PA) is ordered HDS, but is ordered DHS in the subtilisin clan (SB) and SDH in the carboxypeptidase clan (SC) [, ]. This domain defines serine peptidases belonging to MEROPS peptidase family S9 (clan SC), subfamily S9B (dipeptidyl-peptidase IV). The protein fold of the peptidase domain for members of this family resembles that of serine carboxypeptidase D, the type example of clan SC. This domain is an alignment of the region to the N-terminal side of the active site, which is found in IPR001375 from INTERPRO. CD26 (3.4.14.5 from EC) is also called adenosine deaminase-binding protein (ADA-binding protein) or dipeptidylpeptidase IV (DPP IV ectoenzyme). The exopeptidase cleaves off N-terminal X-Pro or X-Ala dipeptides from polypeptides (dipeptidyl peptidase IV activity). CD26 serves as the costimulatory molecule in T cell activation and is an associated marker of autoimmune diseases, adenosine deaminase-deficiency and HIV pathogenesis. Dipeptidyl peptidase IV (DPP IV) is responsible for the removal of N-terminal dipeptides sequentially from polypeptides having unsubstituted N termini, provided that the penultimate residue is proline. The enzyme catalyses the reaction:Dipeptidyl-Polypeptide + H(2)O = Dipeptide + Polypeptide It is a type II membrane protein that forms a homodimer. CD molecules are leucocyte antigens on cell surfaces. CD antigens nomenclature is updated at Protein Reviews On The Web (http://prow.nci.nih.gov/). ; GO: 0006508 proteolysis, 0016020 membrane; PDB: 2RIP_A 3Q8W_B 2AJL_I 1TKR_B 1TK3_B 3C45_A 2G5P_A 3G0C_D 1R9M_C 1RWQ_A .... Probab=99.88 E-value=1.3e-23 Score=168.76 Aligned_cols=83 Identities=25% Similarity=0.524 Sum_probs=68.7 Q ss_pred Ccceece---EEEEEeCCEEEEeCCCCCeeEEeccccCCcccCCCCeeeccCCccchhe-ecccceEEecCCCCCCceEE Q psy3287 28 IPSYVTQ---EEFSRYQGYWWQPKTTGSTQRLTFAHKGGRNLTDDPLAAGIPSYVTQEE-FSRYQGYWWQPKTTDGVYRI 103 (122) Q Consensus 28 v~~f~~~---~aFvr~~nLyv~~~~~~~~~rlT~~~~g~~~v~~~~i~~G~~d~v~eEE-~~~~~g~wWSPd~~~~~~~i 103 (122) .+.++|+ ++|++++|||+.+++++.++|||++| + ..++||+|||||||| |++..++||||||+ +| T Consensus 47 ~~~~sP~g~~~~~v~~~nly~~~~~~~~~~~lT~dg--~-----~~i~nG~~dwvyeEEv~~~~~~~~WSpd~~----~l 115 (353) T PF00930_consen 47 DAKWSPDGKYIAFVRDNNLYLRDLATGQETQLTTDG--E-----PGIYNGVPDWVYEEEVFDRRSAVWWSPDSK----YL 115 (353) T ss_dssp EEEE-SSSTEEEEEETTEEEEESSTTSEEEESES----T-----TTEEESB--HHHHHHTSSSSBSEEE-TTSS----EE T ss_pred cceeecCCCeeEEEecCceEEEECCCCCeEEecccc--c-----eeEEcCccceeccccccccccceEECCCCC----EE Confidence 4555555 99999999999999999999999995 3 889999999999999 79999999999999 99 Q ss_pred EEEEEcCCcceeEECCCC Q psy3287 104 LYEEIDESDVKIYNFPSS 121 (122) Q Consensus 104 ay~~~Des~V~~~~~~~~ 121 (122) ||+++|+++|+.+++++. T Consensus 116 a~~~~d~~~v~~~~~~~~ 133 (353) T PF00930_consen 116 AFLRFDEREVPEYPLPDY 133 (353) T ss_dssp EEEEEE-TTS-EEEEEEE T ss_pred EEEEECCcCCceEEeecc Confidence 999999999999998753 No 2 >KOG2281|consensus Probab=99.81 E-value=1.7e-20 Score=161.07 Aligned_cols=87 Identities=44% Similarity=0.867 Sum_probs=83.9 Q ss_pred EEEEEeCCEEEEeCCCCCeeEEeccccCCcccCCCCeeeccCCccchheecccceEEecCCCCCCceEEEEEEEcCCcce Q psy3287 35 EEFSRYQGYWWQPKTTGSTQRLTFAHKGGRNLTDDPLAAGIPSYVTQEEFSRYQGYWWQPKTTDGVYRILYEEIDESDVK 114 (122) Q Consensus 35 ~aFvr~~nLyv~~~~~~~~~rlT~~~~g~~~v~~~~i~~G~~d~v~eEE~~~~~g~wWSPd~~~~~~~iay~~~Des~V~ 114 (122) +||+++.+||+.++.++.++|||+.++|.+++.++.+++|+|.||.||||+|+.||||||-+..+++||||+++|||+|+ T Consensus 215 ia~i~~~dl~V~n~~~~~ekrlt~~h~g~sn~~dd~~saGVasyv~QEEfdR~~GyWW~p~s~~~~~rIlYeeVdesev~ 294 (867) T KOG2281|consen 215 IAYIKVCDLWVLNILTGEEKRLTYIHNGSSNSKDDAISAGVASYVVQEEFDRFEGYWWSPPSGLKILRILYEEVDESEVE 294 (867) T ss_pred eeeeehhhhhhhhhhhchhhceeeeeccccccccchhhcCcchHHHHHHHHhhcceeecCCCCCceEEEEeeecChhheE Confidence 99999999999999999999999999999999999999999999999999999999999987678999999999999999 Q ss_pred eEECCCC Q psy3287 115 IYNFPSS 121 (122) Q Consensus 115 ~~~~~~~ 121 (122) +++.++| T Consensus 295 V~h~~s~ 301 (867) T KOG2281|consen 295 VIHVPSP 301 (867) T ss_pred EEeccCc Confidence 9999886 No 3 >KOG2100|consensus Probab=99.40 E-value=4.2e-13 Score=118.33 Aligned_cols=83 Identities=20% Similarity=0.466 Sum_probs=73.2 Q ss_pred Ccceece---EEEEEeCCEEEEeCCCCCeeEEeccccCCcccCCCCeeeccCCccchhee-cccceEEecCCCCCCceEE Q psy3287 28 IPSYVTQ---EEFSRYQGYWWQPKTTGSTQRLTFAHKGGRNLTDDPLAAGIPSYVTQEEF-SRYQGYWWQPKTTDGVYRI 103 (122) Q Consensus 28 v~~f~~~---~aFvr~~nLyv~~~~~~~~~rlT~~~~g~~~v~~~~i~~G~~d~v~eEE~-~~~~g~wWSPd~~~~~~~i 103 (122) .+.+++. ++|+.++|||++........++|.++.. +.++||.+||+|+||+ .+.+++|||||++ ++ T Consensus 152 ~~~wsp~~~~l~yv~~~niy~~~~~~~~~~~~~~~~~~------~~i~ng~~Dw~yeeEv~~~~~a~wwsp~g~----~l 221 (755) T KOG2100|consen 152 YASWSPLGNDLAYVLHNNIYYQSSEEDEDVRIVSNGGE------DVIFNGKPDWIYEEEVLSSDSAIWWSPDGD----RL 221 (755) T ss_pred EEEEcCCCCEEEEEEecccccccCcCCCceEEEecCCC------ceEEcCCCCceeehhhcccCccceeCCCCc----ee Confidence 3445544 9999999999999888888899999553 5699999999999985 8999999999999 99 Q ss_pred EEEEEcCCcceeEECCC Q psy3287 104 LYEEIDESDVKIYNFPS 120 (122) Q Consensus 104 ay~~~Des~V~~~~~~~ 120 (122) +|+.+|+++|+.+.++. T Consensus 222 a~~~~~dt~V~~~~~~~ 238 (755) T KOG2100|consen 222 AYASFNDTKVETINLPG 238 (755) T ss_pred EEEEecccccceEEecc Confidence 99999999999998874 No 4 >KOG2281|consensus Probab=98.78 E-value=2.9e-09 Score=92.63 Aligned_cols=112 Identities=29% Similarity=0.585 Sum_probs=79.9 Q ss_pred CCCceeeeeecCCCCCCCCCCCCCCCCcceeceEEEEEeCCEEEEeCCCCCeeEEeccccCCcccCCCCeeeccCCccch Q psy3287 2 VLGSTQRLTFAHKGGRNLTDDPLAAGIPSYVTQEEFSRYQGYWWQPKTTGSTQRLTFAHKGGRNLTDDPLAAGIPSYVTQ 81 (122) Q Consensus 2 ~~~~~~rlt~~~~~~~~~~~~~~~~Gv~~f~~~~aFvr~~nLyv~~~~~~~~~rlT~~~~g~~~v~~~~i~~G~~d~v~e 81 (122) .+++++|||++|+|.+|.-++++++|+++|++|++|.|.+++||++.......||.|.+++++.| ....++.-+.+ T Consensus 229 ~~~~ekrlt~~h~g~sn~~dd~~saGVasyv~QEEfdR~~GyWW~p~s~~~~~rIlYeeVdesev----~V~h~~s~~~~ 304 (867) T KOG2281|consen 229 LTGEEKRLTYIHNGSSNSKDDAISAGVASYVVQEEFDRFEGYWWSPPSGLKILRILYEEVDESEV----EVIHVPSPVLG 304 (867) T ss_pred hhchhhceeeeeccccccccchhhcCcchHHHHHHHHhhcceeecCCCCCceEEEEeeecChhhe----EEEeccCcccC Confidence 47899999999999999999999999999999999999999999997767899999999987533 22333333333 Q ss_pred he----e-cccceEEecCCCCCCceEEEEEEEcC----CcceeEECCCC Q psy3287 82 EE----F-SRYQGYWWQPKTTDGVYRILYEEIDE----SDVKIYNFPSS 121 (122) Q Consensus 82 EE----~-~~~~g~wWSPd~~~~~~~iay~~~De----s~V~~~~~~~~ 121 (122) ++ + -...| +++.+ .+++++=...|+ .+|+...++.| T Consensus 305 ~~~~d~~rYPrtG---t~Nak-~~Lkmv~~~~~~~gk~~~v~~kdlv~~ 349 (867) T KOG2281|consen 305 DRRVDSYRYPRTG---TKNAK-STLKMVEFENDAEGKIYDVPLKDLVQP 349 (867) T ss_pred CCccccccCCcCC---CCCce-eeEEEEEEeeccccceeccchhhcccc Confidence 32 2 11234 55544 245555555543 34455555444 No 5 >PRK04043 tolB translocation protein TolB; Provisional Probab=97.01 E-value=0.0047 Score=51.44 Aligned_cols=69 Identities=14% Similarity=0.171 Sum_probs=50.2 Q ss_pred CCCceeeeeecCCCCCCCCCCCCCCCCcceece---EEEEEeC----CEEEEeCCCCCeeEEeccccCCcccCCCCeeec Q psy3287 2 VLGSTQRLTFAHKGGRNLTDDPLAAGIPSYVTQ---EEFSRYQ----GYWWQPKTTGSTQRLTFAHKGGRNLTDDPLAAG 74 (122) Q Consensus 2 ~~~~~~rlt~~~~~~~~~~~~~~~~Gv~~f~~~---~aFvr~~----nLyv~~~~~~~~~rlT~~~~g~~~v~~~~i~~G 74 (122) .+++.+|||... + ....+.|+++ ++|+.+. +||+.++.+|+.+|||+. +. T Consensus 265 ~~g~~~~LT~~~-~---------~d~~p~~SPDG~~I~F~Sdr~g~~~Iy~~dl~~g~~~rlt~~--g~----------- 321 (419) T PRK04043 265 NTKTLTQITNYP-G---------IDVNGNFVEDDKRIVFVSDRLGYPNIFMKKLNSGSVEQVVFH--GK----------- 321 (419) T ss_pred CCCcEEEcccCC-C---------ccCccEECCCCCEEEEEECCCCCceEEEEECCCCCeEeCccC--CC----------- Confidence 466777887632 1 1223467776 9999876 899999999999999886 41 Q ss_pred cCCccchheecccceEEecCCCCCCceEEEEEEEcC Q psy3287 75 IPSYVTQEEFSRYQGYWWQPKTTDGVYRILYEEIDE 110 (122) Q Consensus 75 ~~d~v~eEE~~~~~g~wWSPd~~~~~~~iay~~~De 110 (122) + + .-|||||+ +|+|..... T Consensus 322 -----~------~--~~~SPDG~----~Ia~~~~~~ 340 (419) T PRK04043 322 -----N------N--SSVSTYKN----YIVYSSRET 340 (419) T ss_pred -----c------C--ceECCCCC----EEEEEEcCC Confidence 0 1 25999999 999998754 No 6 >PRK05137 tolB translocation protein TolB; Provisional Probab=96.89 E-value=0.0081 Score=49.55 Aligned_cols=72 Identities=25% Similarity=0.428 Sum_probs=48.1 Q ss_pred CCCceeeeeecCCCCCCCCCCCCCCCCcceece---EEEEEeC----CEEEEeCCCCCeeEEeccccCCcccCCCCeeec Q psy3287 2 VLGSTQRLTFAHKGGRNLTDDPLAAGIPSYVTQ---EEFSRYQ----GYWWQPKTTGSTQRLTFAHKGGRNLTDDPLAAG 74 (122) Q Consensus 2 ~~~~~~rlt~~~~~~~~~~~~~~~~Gv~~f~~~---~aFvr~~----nLyv~~~~~~~~~rlT~~~~g~~~v~~~~i~~G 74 (122) .+++.+|||... + ....+.|+++ ++|..+. +||+.++.++..++||.. +. T Consensus 278 ~~~~~~~Lt~~~-~---------~~~~~~~spDG~~i~f~s~~~g~~~Iy~~d~~g~~~~~lt~~--~~----------- 334 (435) T PRK05137 278 RSGTTTRLTDSP-A---------IDTSPSYSPDGSQIVFESDRSGSPQLYVMNADGSNPRRISFG--GG----------- 334 (435) T ss_pred CCCceEEccCCC-C---------ccCceeEcCCCCEEEEEECCCCCCeEEEEECCCCCeEEeecC--CC----------- Confidence 456667776532 1 1123566666 7888753 699999888888888875 21 Q ss_pred cCCccchheecccceEEecCCCCCCceEEEEEEEcCC Q psy3287 75 IPSYVTQEEFSRYQGYWWQPKTTDGVYRILYEEIDES 111 (122) Q Consensus 75 ~~d~v~eEE~~~~~g~wWSPd~~~~~~~iay~~~Des 111 (122) .+....|||||+ +|+|...+.. T Consensus 335 -----------~~~~~~~SpdG~----~ia~~~~~~~ 356 (435) T PRK05137 335 -----------RYSTPVWSPRGD----LIAFTKQGGG 356 (435) T ss_pred -----------cccCeEECCCCC----EEEEEEcCCC Confidence 123456999999 9999886543 No 7 >PRK03629 tolB translocation protein TolB; Provisional Probab=96.86 E-value=0.0087 Score=49.62 Aligned_cols=72 Identities=14% Similarity=0.179 Sum_probs=48.5 Q ss_pred CCCceeeeeecCCCCCCCCCCCCCCCCcceece---EEEEEeC----CEEEEeCCCCCeeEEeccccCCcccCCCCeeec Q psy3287 2 VLGSTQRLTFAHKGGRNLTDDPLAAGIPSYVTQ---EEFSRYQ----GYWWQPKTTGSTQRLTFAHKGGRNLTDDPLAAG 74 (122) Q Consensus 2 ~~~~~~rlt~~~~~~~~~~~~~~~~Gv~~f~~~---~aFvr~~----nLyv~~~~~~~~~rlT~~~~g~~~v~~~~i~~G 74 (122) .+++.+|||.... ....+.|+++ ++|+.+. +||..++.++..++||.. +. T Consensus 275 ~tg~~~~lt~~~~----------~~~~~~wSPDG~~I~f~s~~~g~~~Iy~~d~~~g~~~~lt~~--~~----------- 331 (429) T PRK03629 275 ASGQIRQVTDGRS----------NNTEPTWFPDSQNLAYTSDQAGRPQVYKVNINGGAPQRITWE--GS----------- 331 (429) T ss_pred CCCCEEEccCCCC----------CcCceEECCCCCEEEEEeCCCCCceEEEEECCCCCeEEeecC--CC----------- Confidence 3566677765421 1224566665 8888754 699988888888888765 21 Q ss_pred cCCccchheecccceEEecCCCCCCceEEEEEEEcCC Q psy3287 75 IPSYVTQEEFSRYQGYWWQPKTTDGVYRILYEEIDES 111 (122) Q Consensus 75 ~~d~v~eEE~~~~~g~wWSPd~~~~~~~iay~~~Des 111 (122) .....+|||||+ +|+|...+.. T Consensus 332 -----------~~~~~~~SpDG~----~Ia~~~~~~g 353 (429) T PRK03629 332 -----------QNQDADVSSDGK----FMVMVSSNGG 353 (429) T ss_pred -----------CccCEEECCCCC----EEEEEEccCC Confidence 123467999999 9999876543 No 8 >PRK01029 tolB translocation protein TolB; Provisional Probab=96.48 E-value=0.016 Score=48.26 Aligned_cols=53 Identities=6% Similarity=0.105 Sum_probs=38.4 Q ss_pred CCcceece-----EEEEEe----CCEEEEeCCCCCeeEEeccccCCcccCCCCeeeccCCccchheecccceEEecCCCC Q psy3287 27 GIPSYVTQ-----EEFSRY----QGYWWQPKTTGSTQRLTFAHKGGRNLTDDPLAAGIPSYVTQEEFSRYQGYWWQPKTT 97 (122) Q Consensus 27 Gv~~f~~~-----~aFvr~----~nLyv~~~~~~~~~rlT~~~~g~~~v~~~~i~~G~~d~v~eEE~~~~~g~wWSPd~~ 97 (122) ..|.|+|+ ++|+.. .+||+.++.+|+.++||..... ..+.-|||||+ T Consensus 188 ~sP~wSPDG~~~~~~y~S~~~g~~~I~~~~l~~g~~~~lt~~~g~------------------------~~~p~wSPDG~ 243 (428) T PRK01029 188 ITPTWMHIGSGFPYLYVSYKLGVPKIFLGSLENPAGKKILALQGN------------------------QLMPTFSPRKK 243 (428) T ss_pred ccceEccCCCceEEEEEEccCCCceEEEEECCCCCceEeecCCCC------------------------ccceEECCCCC Confidence 35677777 455554 4699999999999999875111 12346999999 Q ss_pred CCceEEEEEE Q psy3287 98 DGVYRILYEE 107 (122) Q Consensus 98 ~~~~~iay~~ 107 (122) +|||.. T Consensus 244 ----~Laf~s 249 (428) T PRK01029 244 ----LLAFIS 249 (428) T ss_pred ----EEEEEE Confidence 999976 No 9 >PRK04043 tolB translocation protein TolB; Provisional Probab=96.36 E-value=0.022 Score=47.52 Aligned_cols=69 Identities=7% Similarity=0.028 Sum_probs=46.0 Q ss_pred CCCceeeeeecCCCCCCCCCCCCCCCCcceece---EEEEEe----CCEEEEeCCCCCeeEEeccccCCcccCCCCeeec Q psy3287 2 VLGSTQRLTFAHKGGRNLTDDPLAAGIPSYVTQ---EEFSRY----QGYWWQPKTTGSTQRLTFAHKGGRNLTDDPLAAG 74 (122) Q Consensus 2 ~~~~~~rlt~~~~~~~~~~~~~~~~Gv~~f~~~---~aFvr~----~nLyv~~~~~~~~~rlT~~~~g~~~v~~~~i~~G 74 (122) .||+.++||.. . .....+.|+++ ++|... .+||+.++.++..++||.. . . + T Consensus 221 ~tg~~~~lt~~-~---------g~~~~~~~SPDG~~la~~~~~~g~~~Iy~~dl~~g~~~~LT~~--~------~-~--- 278 (419) T PRK04043 221 YTGKKEKIASS-Q---------GMLVVSDVSKDGSKLLLTMAPKGQPDIYLYDTNTKTLTQITNY--P------G-I--- 278 (419) T ss_pred CCCcEEEEecC-C---------CcEEeeEECCCCCEEEEEEccCCCcEEEEEECCCCcEEEcccC--C------C-c--- Confidence 46777777752 1 11223456766 777764 6799999988988999865 2 0 0 Q ss_pred cCCccchheecccceEEecCCCCCCceEEEEEEE Q psy3287 75 IPSYVTQEEFSRYQGYWWQPKTTDGVYRILYEEI 108 (122) Q Consensus 75 ~~d~v~eEE~~~~~g~wWSPd~~~~~~~iay~~~ 108 (122) ...--|||||+ +|+|..- T Consensus 279 ------------d~~p~~SPDG~----~I~F~Sd 296 (419) T PRK04043 279 ------------DVNGNFVEDDK----RIVFVSD 296 (419) T ss_pred ------------cCccEECCCCC----EEEEEEC Confidence 11124999999 9999863 No 10 >PRK03629 tolB translocation protein TolB; Provisional Probab=96.25 E-value=0.035 Score=45.99 Aligned_cols=56 Identities=14% Similarity=0.287 Sum_probs=40.0 Q ss_pred CCcceece---EEEEEe----CCEEEEeCCCCCeeEEeccccCCcccCCCCeeeccCCccchheecccceEEecCCCCCC Q psy3287 27 GIPSYVTQ---EEFSRY----QGYWWQPKTTGSTQRLTFAHKGGRNLTDDPLAAGIPSYVTQEEFSRYQGYWWQPKTTDG 99 (122) Q Consensus 27 Gv~~f~~~---~aFvr~----~nLyv~~~~~~~~~rlT~~~~g~~~v~~~~i~~G~~d~v~eEE~~~~~g~wWSPd~~~~ 99 (122) +.+.|+++ ++|+.. .+||+.++.++..++||.. .. ......|||||+ T Consensus 246 ~~~~~SPDG~~La~~~~~~g~~~I~~~d~~tg~~~~lt~~--~~----------------------~~~~~~wSPDG~-- 299 (429) T PRK03629 246 GAPAFSPDGSKLAFALSKTGSLNLYVMDLASGQIRQVTDG--RS----------------------NNTEPTWFPDSQ-- 299 (429) T ss_pred CCeEECCCCCEEEEEEcCCCCcEEEEEECCCCCEEEccCC--CC----------------------CcCceEECCCCC-- Confidence 34677777 788643 3599999988888888755 21 023457999999 Q ss_pred ceEEEEEEEcC Q psy3287 100 VYRILYEEIDE 110 (122) Q Consensus 100 ~~~iay~~~De 110 (122) +|+|..-+. T Consensus 300 --~I~f~s~~~ 308 (429) T PRK03629 300 --NLAYTSDQA 308 (429) T ss_pred --EEEEEeCCC Confidence 999987543 No 11 >PRK04922 tolB translocation protein TolB; Provisional Probab=96.18 E-value=0.045 Score=45.18 Aligned_cols=54 Identities=20% Similarity=0.282 Sum_probs=38.8 Q ss_pred cceece---EEEEEeC----CEEEEeCCCCCeeEEeccccCCcccCCCCeeeccCCccchheecccceEEecCCCCCCce Q psy3287 29 PSYVTQ---EEFSRYQ----GYWWQPKTTGSTQRLTFAHKGGRNLTDDPLAAGIPSYVTQEEFSRYQGYWWQPKTTDGVY 101 (122) Q Consensus 29 ~~f~~~---~aFvr~~----nLyv~~~~~~~~~rlT~~~~g~~~v~~~~i~~G~~d~v~eEE~~~~~g~wWSPd~~~~~~ 101 (122) +.|+++ ++|..+. +||+.++.++..++||+. +. ......|||||+ T Consensus 297 ~~~spDG~~l~f~sd~~g~~~iy~~dl~~g~~~~lt~~--g~----------------------~~~~~~~SpDG~---- 348 (433) T PRK04922 297 PTWAPDGKSIYFTSDRGGRPQIYRVAASGGSAERLTFQ--GN----------------------YNARASVSPDGK---- 348 (433) T ss_pred eEECCCCCEEEEEECCCCCceEEEEECCCCCeEEeecC--CC----------------------CccCEEECCCCC---- Confidence 466765 7887643 499999888888888865 21 023457999999 Q ss_pred EEEEEEEcC Q psy3287 102 RILYEEIDE 110 (122) Q Consensus 102 ~iay~~~De 110 (122) +|+|...+. T Consensus 349 ~Ia~~~~~~ 357 (433) T PRK04922 349 KIAMVHGSG 357 (433) T ss_pred EEEEEECCC Confidence 999986543 No 12 >PRK04792 tolB translocation protein TolB; Provisional Probab=96.07 E-value=0.061 Score=44.93 Aligned_cols=55 Identities=20% Similarity=0.291 Sum_probs=38.3 Q ss_pred cceece---EEEEEe----CCEEEEeCCCCCeeEEeccccCCcccCCCCeeeccCCccchheecccceEEecCCCCCCce Q psy3287 29 PSYVTQ---EEFSRY----QGYWWQPKTTGSTQRLTFAHKGGRNLTDDPLAAGIPSYVTQEEFSRYQGYWWQPKTTDGVY 101 (122) Q Consensus 29 ~~f~~~---~aFvr~----~nLyv~~~~~~~~~rlT~~~~g~~~v~~~~i~~G~~d~v~eEE~~~~~g~wWSPd~~~~~~ 101 (122) +.|+++ ++|..+ .+||+.++.+++.++||+. +. .+....|||||+ T Consensus 311 p~wSpDG~~I~f~s~~~g~~~Iy~~dl~~g~~~~Lt~~--g~----------------------~~~~~~~SpDG~---- 362 (448) T PRK04792 311 PSWHPDGKSLIFTSERGGKPQIYRVNLASGKVSRLTFE--GE----------------------QNLGGSITPDGR---- 362 (448) T ss_pred eEECCCCCEEEEEECCCCCceEEEEECCCCCEEEEecC--CC----------------------CCcCeeECCCCC---- Confidence 456665 777764 3599999988888999876 31 011235899999 Q ss_pred EEEEEEEcCC Q psy3287 102 RILYEEIDES 111 (122) Q Consensus 102 ~iay~~~Des 111 (122) +|+|...+.. T Consensus 363 ~l~~~~~~~g 372 (448) T PRK04792 363 SMIMVNRTNG 372 (448) T ss_pred EEEEEEecCC Confidence 9999776543 No 13 >PRK04792 tolB translocation protein TolB; Provisional Probab=96.04 E-value=0.075 Score=44.40 Aligned_cols=57 Identities=11% Similarity=0.177 Sum_probs=39.8 Q ss_pred Ccceece---EEEEEeC----CEEEEeCCCCCeeEEeccccCCcccCCCCeeeccCCccchheecccceEEecCCCCCCc Q psy3287 28 IPSYVTQ---EEFSRYQ----GYWWQPKTTGSTQRLTFAHKGGRNLTDDPLAAGIPSYVTQEEFSRYQGYWWQPKTTDGV 100 (122) Q Consensus 28 v~~f~~~---~aFvr~~----nLyv~~~~~~~~~rlT~~~~g~~~v~~~~i~~G~~d~v~eEE~~~~~g~wWSPd~~~~~ 100 (122) .+.|+++ ++|+.+. +||+.++.+|+.++|+... + ......|||||+ T Consensus 222 ~p~wSPDG~~La~~s~~~g~~~L~~~dl~tg~~~~lt~~~-g-----------------------~~~~~~wSPDG~--- 274 (448) T PRK04792 222 SPAWSPDGRKLAYVSFENRKAEIFVQDIYTQVREKVTSFP-G-----------------------INGAPRFSPDGK--- 274 (448) T ss_pred CceECCCCCEEEEEEecCCCcEEEEEECCCCCeEEecCCC-C-----------------------CcCCeeECCCCC--- Confidence 5667776 8887643 5999999888877776541 1 012467999999 Q ss_pred eEEEEEEEcCCc Q psy3287 101 YRILYEEIDESD 112 (122) Q Consensus 101 ~~iay~~~Des~ 112 (122) +|+|..-.+.. T Consensus 275 -~La~~~~~~g~ 285 (448) T PRK04792 275 -KLALVLSKDGQ 285 (448) T ss_pred -EEEEEEeCCCC Confidence 99997654443 No 14 >PRK00178 tolB translocation protein TolB; Provisional Probab=95.96 E-value=0.072 Score=43.55 Aligned_cols=55 Identities=11% Similarity=0.116 Sum_probs=39.1 Q ss_pred cceece---EEEEEeC----CEEEEeCCCCCeeEEeccccCCcccCCCCeeeccCCccchheecccceEEecCCCCCCce Q psy3287 29 PSYVTQ---EEFSRYQ----GYWWQPKTTGSTQRLTFAHKGGRNLTDDPLAAGIPSYVTQEEFSRYQGYWWQPKTTDGVY 101 (122) Q Consensus 29 ~~f~~~---~aFvr~~----nLyv~~~~~~~~~rlT~~~~g~~~v~~~~i~~G~~d~v~eEE~~~~~g~wWSPd~~~~~~ 101 (122) +.|+++ ++|..+. +||+.++.+++.++|++.+.. .....|||||+ T Consensus 292 ~~~spDg~~i~f~s~~~g~~~iy~~d~~~g~~~~lt~~~~~------------------------~~~~~~Spdg~---- 343 (430) T PRK00178 292 PFWGKDGRTLYFTSDRGGKPQIYKVNVNGGRAERVTFVGNY------------------------NARPRLSADGK---- 343 (430) T ss_pred eEECCCCCEEEEEECCCCCceEEEEECCCCCEEEeecCCCC------------------------ccceEECCCCC---- Confidence 456665 7787643 699999888888888865210 12346999999 Q ss_pred EEEEEEEcCC Q psy3287 102 RILYEEIDES 111 (122) Q Consensus 102 ~iay~~~Des 111 (122) +|+|...+.. T Consensus 344 ~i~~~~~~~~ 353 (430) T PRK00178 344 TLVMVHRQDG 353 (430) T ss_pred EEEEEEccCC Confidence 9999876543 No 15 >PRK00178 tolB translocation protein TolB; Provisional Probab=95.90 E-value=0.067 Score=43.75 Aligned_cols=55 Identities=15% Similarity=0.340 Sum_probs=37.6 Q ss_pred cceece---EEEEEeC----CEEEEeCCCCCeeEEeccccCCcccCCCCeeeccCCccchheecccceEEecCCCCCCce Q psy3287 29 PSYVTQ---EEFSRYQ----GYWWQPKTTGSTQRLTFAHKGGRNLTDDPLAAGIPSYVTQEEFSRYQGYWWQPKTTDGVY 101 (122) Q Consensus 29 ~~f~~~---~aFvr~~----nLyv~~~~~~~~~rlT~~~~g~~~v~~~~i~~G~~d~v~eEE~~~~~g~wWSPd~~~~~~ 101 (122) +.++++ ++|+... +||+.++.+|+.++|+... + ......|||||+ T Consensus 204 p~wSpDG~~la~~s~~~~~~~l~~~~l~~g~~~~l~~~~-g-----------------------~~~~~~~SpDG~---- 255 (430) T PRK00178 204 PRWSPDGKRIAYVSFEQKRPRIFVQNLDTGRREQITNFE-G-----------------------LNGAPAWSPDGS---- 255 (430) T ss_pred eeECCCCCEEEEEEcCCCCCEEEEEECCCCCEEEccCCC-C-----------------------CcCCeEECCCCC---- Confidence 455655 8887633 5999999888877776431 1 023467999999 Q ss_pred EEEEEEEcCC Q psy3287 102 RILYEEIDES 111 (122) Q Consensus 102 ~iay~~~Des 111 (122) +|+|..-... T Consensus 256 ~la~~~~~~g 265 (430) T PRK00178 256 KLAFVLSKDG 265 (430) T ss_pred EEEEEEccCC Confidence 9999765443 No 16 >PRK05137 tolB translocation protein TolB; Provisional Probab=95.78 E-value=0.069 Score=44.05 Aligned_cols=56 Identities=9% Similarity=0.138 Sum_probs=39.0 Q ss_pred Ccceece---EEEEEe----CCEEEEeCCCCCeeEEeccccCCcccCCCCeeeccCCccchheecccceEEecCCCCCCc Q psy3287 28 IPSYVTQ---EEFSRY----QGYWWQPKTTGSTQRLTFAHKGGRNLTDDPLAAGIPSYVTQEEFSRYQGYWWQPKTTDGV 100 (122) Q Consensus 28 v~~f~~~---~aFvr~----~nLyv~~~~~~~~~rlT~~~~g~~~v~~~~i~~G~~d~v~eEE~~~~~g~wWSPd~~~~~ 100 (122) -+.|+++ ++|+.. ..||+.++.+|..++|+.. . + ...+..|||||+ T Consensus 206 ~p~wSpDG~~lay~s~~~g~~~i~~~dl~~g~~~~l~~~--~------g----------------~~~~~~~SPDG~--- 258 (435) T PRK05137 206 TPRFSPNRQEITYMSYANGRPRVYLLDLETGQRELVGNF--P------G----------------MTFAPRFSPDGR--- 258 (435) T ss_pred eeEECCCCCEEEEEEecCCCCEEEEEECCCCcEEEeecC--C------C----------------cccCcEECCCCC--- Confidence 3556666 888864 5799999988887777643 1 0 123457999999 Q ss_pred eEEEEEEEcCC Q psy3287 101 YRILYEEIDES 111 (122) Q Consensus 101 ~~iay~~~Des 111 (122) +|+|..-... T Consensus 259 -~la~~~~~~g 268 (435) T PRK05137 259 -KVVMSLSQGG 268 (435) T ss_pred -EEEEEEecCC Confidence 9998765443 No 17 >PRK01029 tolB translocation protein TolB; Provisional Probab=95.64 E-value=0.086 Score=43.89 Aligned_cols=55 Identities=15% Similarity=0.212 Sum_probs=39.0 Q ss_pred Ccceece---EEEEEe----CCEEEEeCCCCCeeEEeccccCCcccCCCCeeeccCCccchheecccceEEecCCCCCCc Q psy3287 28 IPSYVTQ---EEFSRY----QGYWWQPKTTGSTQRLTFAHKGGRNLTDDPLAAGIPSYVTQEEFSRYQGYWWQPKTTDGV 100 (122) Q Consensus 28 v~~f~~~---~aFvr~----~nLyv~~~~~~~~~rlT~~~~g~~~v~~~~i~~G~~d~v~eEE~~~~~g~wWSPd~~~~~ 100 (122) .+.++++ ++|+.. .+||+.++.+++.++||.. +. . ....-|||||+ T Consensus 331 ~p~wSPDG~~Laf~~~~~g~~~I~v~dl~~g~~~~Lt~~--~~------~----------------~~~p~wSpDG~--- 383 (428) T PRK01029 331 CPAWSPDGKKIAFCSVIKGVRQICVYDLATGRDYQLTTS--PE------N----------------KESPSWAIDSL--- 383 (428) T ss_pred ceeECCCCCEEEEEEcCCCCcEEEEEECCCCCeEEccCC--CC------C----------------ccceEECCCCC--- Confidence 4566776 888864 4699999999988888754 20 0 12256999999 Q ss_pred eEEEEEEEcC Q psy3287 101 YRILYEEIDE 110 (122) Q Consensus 101 ~~iay~~~De 110 (122) +|+|...+. T Consensus 384 -~L~f~~~~~ 392 (428) T PRK01029 384 -HLVYSAGNS 392 (428) T ss_pred -EEEEEECCC Confidence 999876653 No 18 >PRK02889 tolB translocation protein TolB; Provisional Probab=95.43 E-value=0.14 Score=42.38 Aligned_cols=55 Identities=18% Similarity=0.207 Sum_probs=37.8 Q ss_pred cceece---EEEEEeC----CEEEEeCCCCCeeEEeccccCCcccCCCCeeeccCCccchheecccceEEecCCCCCCce Q psy3287 29 PSYVTQ---EEFSRYQ----GYWWQPKTTGSTQRLTFAHKGGRNLTDDPLAAGIPSYVTQEEFSRYQGYWWQPKTTDGVY 101 (122) Q Consensus 29 ~~f~~~---~aFvr~~----nLyv~~~~~~~~~rlT~~~~g~~~v~~~~i~~G~~d~v~eEE~~~~~g~wWSPd~~~~~~ 101 (122) +.++++ ++|.... +||+.++.++..++||..+. ....-|||||+ T Consensus 333 ~~~SpDG~~Ia~~s~~~g~~~I~v~d~~~g~~~~lt~~~~-------------------------~~~p~~spdg~---- 383 (427) T PRK02889 333 PRISPDGKLLAYISRVGGAFKLYVQDLATGQVTALTDTTR-------------------------DESPSFAPNGR---- 383 (427) T ss_pred eEECCCCCEEEEEEccCCcEEEEEEECCCCCeEEccCCCC-------------------------ccCceECCCCC---- Confidence 456665 7777643 48999988888777764311 01235999999 Q ss_pred EEEEEEEcCCc Q psy3287 102 RILYEEIDESD 112 (122) Q Consensus 102 ~iay~~~Des~ 112 (122) .|+|...+... T Consensus 384 ~l~~~~~~~g~ 394 (427) T PRK02889 384 YILYATQQGGR 394 (427) T ss_pred EEEEEEecCCC Confidence 99998876543 No 19 >PRK02889 tolB translocation protein TolB; Provisional Probab=95.39 E-value=0.16 Score=42.04 Aligned_cols=62 Identities=13% Similarity=0.284 Sum_probs=41.9 Q ss_pred Ccceece---EEEEEe----CCEEEEeCCCCCeeEEeccccCCcccCCCCeeeccCCccchheecccceEEecCCCCCCc Q psy3287 28 IPSYVTQ---EEFSRY----QGYWWQPKTTGSTQRLTFAHKGGRNLTDDPLAAGIPSYVTQEEFSRYQGYWWQPKTTDGV 100 (122) Q Consensus 28 v~~f~~~---~aFvr~----~nLyv~~~~~~~~~rlT~~~~g~~~v~~~~i~~G~~d~v~eEE~~~~~g~wWSPd~~~~~ 100 (122) -+.|+++ ++|+.+ .+||+.++.+|+..+|+... + ......|||||+ T Consensus 200 ~p~wSPDG~~la~~s~~~~~~~I~~~dl~~g~~~~l~~~~-g-----------------------~~~~~~~SPDG~--- 252 (427) T PRK02889 200 SPAWSPDGTKLAYVSFESKKPVVYVHDLATGRRRVVANFK-G-----------------------SNSAPAWSPDGR--- 252 (427) T ss_pred cceEcCCCCEEEEEEccCCCcEEEEEECCCCCEEEeecCC-C-----------------------CccceEECCCCC--- Confidence 3566666 788763 34999999888887886431 1 123467999999 Q ss_pred eEEEEEEEcCCcceeEE Q psy3287 101 YRILYEEIDESDVKIYN 117 (122) Q Consensus 101 ~~iay~~~Des~V~~~~ 117 (122) +|+|..-++.....|. T Consensus 253 -~la~~~~~~g~~~Iy~ 268 (427) T PRK02889 253 -TLAVALSRDGNSQIYT 268 (427) T ss_pred -EEEEEEccCCCceEEE Confidence 9999765554444443 No 20 >PRK01742 tolB translocation protein TolB; Provisional Probab=95.18 E-value=0.2 Score=41.34 Aligned_cols=62 Identities=16% Similarity=0.329 Sum_probs=40.7 Q ss_pred Ccceece---EEEEE--eC--CEEEEeCCCCCeeEEeccccCCcccCCCCeeeccCCccchheecccceEEecCCCCCCc Q psy3287 28 IPSYVTQ---EEFSR--YQ--GYWWQPKTTGSTQRLTFAHKGGRNLTDDPLAAGIPSYVTQEEFSRYQGYWWQPKTTDGV 100 (122) Q Consensus 28 v~~f~~~---~aFvr--~~--nLyv~~~~~~~~~rlT~~~~g~~~v~~~~i~~G~~d~v~eEE~~~~~g~wWSPd~~~~~ 100 (122) .+.|+++ ++|.. ++ +||+.++.++..++||.. . .......|||||+ T Consensus 252 ~~~wSPDG~~La~~~~~~g~~~Iy~~d~~~~~~~~lt~~--~----------------------~~~~~~~wSpDG~--- 304 (429) T PRK01742 252 APAFSPDGSRLAFASSKDGVLNIYVMGANGGTPSQLTSG--A----------------------GNNTEPSWSPDGQ--- 304 (429) T ss_pred ceeECCCCCEEEEEEecCCcEEEEEEECCCCCeEeeccC--C----------------------CCcCCEEECCCCC--- Confidence 4677776 77764 33 488888877777777653 1 0124567999999 Q ss_pred eEEEEEEEcCCcceeEE Q psy3287 101 YRILYEEIDESDVKIYN 117 (122) Q Consensus 101 ~~iay~~~Des~V~~~~ 117 (122) +|+|..-....+..+. T Consensus 305 -~i~f~s~~~g~~~I~~ 320 (429) T PRK01742 305 -SILFTSDRSGSPQVYR 320 (429) T ss_pred -EEEEEECCCCCceEEE Confidence 9999865444444443 No 21 >PRK01742 tolB translocation protein TolB; Provisional Probab=95.15 E-value=0.21 Score=41.20 Aligned_cols=61 Identities=11% Similarity=0.292 Sum_probs=40.0 Q ss_pred Ccceece---EEEEEeC----CEEEEeCCCCCeeEEeccccCCcccCCCCeeeccCCccchheecccceEEecCCCCCCc Q psy3287 28 IPSYVTQ---EEFSRYQ----GYWWQPKTTGSTQRLTFAHKGGRNLTDDPLAAGIPSYVTQEEFSRYQGYWWQPKTTDGV 100 (122) Q Consensus 28 v~~f~~~---~aFvr~~----nLyv~~~~~~~~~rlT~~~~g~~~v~~~~i~~G~~d~v~eEE~~~~~g~wWSPd~~~~~ 100 (122) -+.|+++ ++|+... .||+.++.+|..++|+... + ......|||||+ T Consensus 208 ~p~wSPDG~~la~~s~~~~~~~i~i~dl~tg~~~~l~~~~-g-----------------------~~~~~~wSPDG~--- 260 (429) T PRK01742 208 SPAWSPDGSKLAYVSFENKKSQLVVHDLRSGARKVVASFR-G-----------------------HNGAPAFSPDGS--- 260 (429) T ss_pred cceEcCCCCEEEEEEecCCCcEEEEEeCCCCceEEEecCC-C-----------------------ccCceeECCCCC--- Confidence 4667776 8887643 4999998877766665331 1 012457999999 Q ss_pred eEEEEEEEcCCcceeE Q psy3287 101 YRILYEEIDESDVKIY 116 (122) Q Consensus 101 ~~iay~~~Des~V~~~ 116 (122) +|+|..-.+..+.++ T Consensus 261 -~La~~~~~~g~~~Iy 275 (429) T PRK01742 261 -RLAFASSKDGVLNIY 275 (429) T ss_pred -EEEEEEecCCcEEEE Confidence 999986544444433 No 22 >PRK04922 tolB translocation protein TolB; Provisional Probab=94.98 E-value=0.3 Score=40.29 Aligned_cols=53 Identities=13% Similarity=0.224 Sum_probs=35.4 Q ss_pred cceece---EEEEEe----CCEEEEeCCCCCeeEEeccccCCcccCCCCeeeccCCccchheecccceEEecCCCCCCce Q psy3287 29 PSYVTQ---EEFSRY----QGYWWQPKTTGSTQRLTFAHKGGRNLTDDPLAAGIPSYVTQEEFSRYQGYWWQPKTTDGVY 101 (122) Q Consensus 29 ~~f~~~---~aFvr~----~nLyv~~~~~~~~~rlT~~~~g~~~v~~~~i~~G~~d~v~eEE~~~~~g~wWSPd~~~~~~ 101 (122) +.|+++ ++|+.. ..||+.++.+|..++|+.. .+. ....-|||||+ T Consensus 209 p~wSpDg~~la~~s~~~~~~~l~~~dl~~g~~~~l~~~-~g~-----------------------~~~~~~SpDG~---- 260 (433) T PRK04922 209 PAWSPDGKKLAYVSFERGRSAIYVQDLATGQRELVASF-RGI-----------------------NGAPSFSPDGR---- 260 (433) T ss_pred ccCCCCCCEEEEEecCCCCcEEEEEECCCCCEEEeccC-CCC-----------------------ccCceECCCCC---- Confidence 344555 788763 3599999888877777643 110 13456999999 Q ss_pred EEEEEEEc Q psy3287 102 RILYEEID 109 (122) Q Consensus 102 ~iay~~~D 109 (122) +|+|..-. T Consensus 261 ~l~~~~s~ 268 (433) T PRK04922 261 RLALTLSR 268 (433) T ss_pred EEEEEEeC Confidence 99987543 No 23 >TIGR02800 propeller_TolB tol-pal system beta propeller repeat protein TolB. The Tol-PAL system is required for bacterial outer membrane integrity. E. coli TolB is involved in the tonB-independent uptake of group A colicins (colicins A, E1, E2, E3 and K), and is necessary for the colicins to reach their respective targets after initial binding to the bacteria. It is also involved in uptake of filamentous DNA. Study of its structure suggest that the TolB protein might be involved in the recycling of peptidoglycan or in its covalent linking with lipoproteins. The Tol-Pal system is also implicated in pathogenesis of E. coli, Haemophilus ducreyi, Salmonella enterica and Vibrio cholerae, but the mechanism(s) is unclear. Probab=94.64 E-value=0.34 Score=38.98 Aligned_cols=53 Identities=19% Similarity=0.303 Sum_probs=36.3 Q ss_pred ceece---EEEEEeC----CEEEEeCCCCCeeEEeccccCCcccCCCCeeeccCCccchheecccceEEecCCCCCCceE Q psy3287 30 SYVTQ---EEFSRYQ----GYWWQPKTTGSTQRLTFAHKGGRNLTDDPLAAGIPSYVTQEEFSRYQGYWWQPKTTDGVYR 102 (122) Q Consensus 30 ~f~~~---~aFvr~~----nLyv~~~~~~~~~rlT~~~~g~~~v~~~~i~~G~~d~v~eEE~~~~~g~wWSPd~~~~~~~ 102 (122) .|+++ ++|+.+. +||+.++.++..++|+..+. ......||||++ + T Consensus 284 ~~s~dg~~l~~~s~~~g~~~iy~~d~~~~~~~~l~~~~~------------------------~~~~~~~spdg~----~ 335 (417) T TIGR02800 284 SWSPDGKSIAFTSDRGGSPQIYMMDADGGEVRRLTFRGG------------------------YNASPSWSPDGD----L 335 (417) T ss_pred EECCCCCEEEEEECCCCCceEEEEECCCCCEEEeecCCC------------------------CccCeEECCCCC----E Confidence 45554 6776543 68988887777777776521 123557999999 9 Q ss_pred EEEEEEcC Q psy3287 103 ILYEEIDE 110 (122) Q Consensus 103 iay~~~De 110 (122) |+|...+. T Consensus 336 i~~~~~~~ 343 (417) T TIGR02800 336 IAFVHREG 343 (417) T ss_pred EEEEEccC Confidence 99887665 No 24 >COG0823 TolB Periplasmic component of the Tol biopolymer transport system [Intracellular trafficking and secretion] Probab=94.40 E-value=0.2 Score=42.07 Aligned_cols=69 Identities=16% Similarity=0.176 Sum_probs=49.6 Q ss_pred CCCCCCCcceece---EEEEEe----CCEEEEeCCCCCeeEEeccccCCcccCCCCeeeccCCccchheecccceEEecC Q psy3287 22 DPLAAGIPSYVTQ---EEFSRY----QGYWWQPKTTGSTQRLTFAHKGGRNLTDDPLAAGIPSYVTQEEFSRYQGYWWQP 94 (122) Q Consensus 22 ~~~~~Gv~~f~~~---~aFvr~----~nLyv~~~~~~~~~rlT~~~~g~~~v~~~~i~~G~~d~v~eEE~~~~~g~wWSP 94 (122) .+...+.+.|+++ ++|+.. -+||+.++.++..++||.. . +.+..--||| T Consensus 236 ~~g~~~~P~fspDG~~l~f~~~rdg~~~iy~~dl~~~~~~~Lt~~--~----------------------gi~~~Ps~sp 291 (425) T COG0823 236 FNGNNGAPAFSPDGSKLAFSSSRDGSPDIYLMDLDGKNLPRLTNG--F----------------------GINTSPSWSP 291 (425) T ss_pred cCCccCCccCCCCCCEEEEEECCCCCccEEEEcCCCCcceecccC--C----------------------ccccCccCCC Confidence 4567778999997 888874 4699999988888887765 2 1123345899 Q ss_pred CCCCCceEEEEEEEcCCcceeEECC Q psy3287 95 KTTDGVYRILYEEIDESDVKIYNFP 119 (122) Q Consensus 95 d~~~~~~~iay~~~Des~V~~~~~~ 119 (122) ||+ +|+|. .|...-+.+.+. T Consensus 292 dG~----~ivf~-Sdr~G~p~I~~~ 311 (425) T COG0823 292 DGS----KIVFT-SDRGGRPQIYLY 311 (425) T ss_pred CCC----EEEEE-eCCCCCcceEEE Confidence 999 99998 566665555443 No 25 >COG0823 TolB Periplasmic component of the Tol biopolymer transport system [Intracellular trafficking and secretion] Probab=92.64 E-value=0.66 Score=39.02 Aligned_cols=54 Identities=22% Similarity=0.417 Sum_probs=41.8 Q ss_pred Ccceece---EEEEEeC----CEEEEeCCCCCeeEEeccccCCcccCCCCeeeccCCccchheecccceEEecCCCCCCc Q psy3287 28 IPSYVTQ---EEFSRYQ----GYWWQPKTTGSTQRLTFAHKGGRNLTDDPLAAGIPSYVTQEEFSRYQGYWWQPKTTDGV 100 (122) Q Consensus 28 v~~f~~~---~aFvr~~----nLyv~~~~~~~~~rlT~~~~g~~~v~~~~i~~G~~d~v~eEE~~~~~g~wWSPd~~~~~ 100 (122) .+.++++ ++|+-+. +||+.+...+..+|||..+... .-=-|||||+ T Consensus 286 ~Ps~spdG~~ivf~Sdr~G~p~I~~~~~~g~~~~riT~~~~~~------------------------~~p~~SpdG~--- 338 (425) T COG0823 286 SPSWSPDGSKIVFTSDRGGRPQIYLYDLEGSQVTRLTFSGGGN------------------------SNPVWSPDGD--- 338 (425) T ss_pred CccCCCCCCEEEEEeCCCCCcceEEECCCCCceeEeeccCCCC------------------------cCccCCCCCC--- Confidence 6777777 8888654 6999998888889999984431 1113999999 Q ss_pred eEEEEEEEc Q psy3287 101 YRILYEEID 109 (122) Q Consensus 101 ~~iay~~~D 109 (122) +|+|+..+ T Consensus 339 -~i~~~~~~ 346 (425) T COG0823 339 -KIVFESSS 346 (425) T ss_pred -EEEEEecc Confidence 99999955 No 26 >PF07676 PD40: WD40-like Beta Propeller Repeat; InterPro: IPR011659 WD-40 repeats (also known as WD or beta-transducin repeats) are short ~40 amino acid motifs, often terminating in a Trp-Asp (W-D) dipeptide. WD40 repeats usually assume a 7-8 bladed beta-propeller fold, but proteins have been found with 4 to 16 repeated units, which also form a circularised beta-propeller structure. WD-repeat proteins are a large family found in all eukaryotes and are implicated in a variety of functions ranging from signal transduction and transcription regulation to cell cycle control and apoptosis. Repeated WD40 motifs act as a site for protein-protein interaction, and proteins containing WD40 repeats are known to serve as platforms for the assembly of protein complexes or mediators of transient interplay among other proteins. The specificity of the proteins is determined by the sequences outside the repeats themselves. Examples of such complexes are G proteins (beta subunit is a beta-propeller), TAFII transcription factor, and E3 ubiquitin ligase [, ]. In Arabidopsis spp., several WD40-containing proteins act as key regulators of plant-specific developmental events. This region appears to be related to the IPR001680 from INTERPRO repeat. This model is likely to miss copies within a sequence.; PDB: 2HQS_D 1C5K_A 2IVZ_A 2W8B_D 3IAX_A 1CRZ_A 1N6F_D 1N6D_C 1N6E_C 1K32_A .... Probab=92.44 E-value=0.26 Score=26.99 Aligned_cols=19 Identities=16% Similarity=0.342 Sum_probs=14.7 Q ss_pred ceEEecCCCCCCceEEEEEEEcC Q psy3287 88 QGYWWQPKTTDGVYRILYEEIDE 110 (122) Q Consensus 88 ~g~wWSPd~~~~~~~iay~~~De 110 (122) ...-|||||+ +|+|...-. T Consensus 12 ~~p~~SpDGk----~i~f~s~~~ 30 (39) T PF07676_consen 12 GSPAWSPDGK----YIYFTSNRN 30 (39) T ss_dssp EEEEE-TTSS----EEEEEEECT T ss_pred cCEEEecCCC----EEEEEecCC Confidence 4456999999 999998765 No 27 >TIGR02800 propeller_TolB tol-pal system beta propeller repeat protein TolB. The Tol-PAL system is required for bacterial outer membrane integrity. E. coli TolB is involved in the tonB-independent uptake of group A colicins (colicins A, E1, E2, E3 and K), and is necessary for the colicins to reach their respective targets after initial binding to the bacteria. It is also involved in uptake of filamentous DNA. Study of its structure suggest that the TolB protein might be involved in the recycling of peptidoglycan or in its covalent linking with lipoproteins. The Tol-Pal system is also implicated in pathogenesis of E. coli, Haemophilus ducreyi, Salmonella enterica and Vibrio cholerae, but the mechanism(s) is unclear. Probab=92.13 E-value=1.7 Score=34.92 Aligned_cols=52 Identities=10% Similarity=0.192 Sum_probs=32.9 Q ss_pred cceece---EEEEEe----CCEEEEeCCCCCeeEEeccccCCcccCCCCeeeccCCccchheecccceEEecCCCCCCce Q psy3287 29 PSYVTQ---EEFSRY----QGYWWQPKTTGSTQRLTFAHKGGRNLTDDPLAAGIPSYVTQEEFSRYQGYWWQPKTTDGVY 101 (122) Q Consensus 29 ~~f~~~---~aFvr~----~nLyv~~~~~~~~~rlT~~~~g~~~v~~~~i~~G~~d~v~eEE~~~~~g~wWSPd~~~~~~ 101 (122) +.++++ ++|+.. ..||+.++.++...+++.. . +......|||||+ T Consensus 195 p~~Spdg~~la~~~~~~~~~~i~v~d~~~g~~~~~~~~--~----------------------~~~~~~~~spDg~---- 246 (417) T TIGR02800 195 PAWSPDGQKLAYVSFESGKPEIYVQDLATGQREKVASF--P----------------------GMNGAPAFSPDGS---- 246 (417) T ss_pred ccCCCCCCEEEEEEcCCCCcEEEEEECCCCCEEEeecC--C----------------------CCccceEECCCCC---- Confidence 345555 777754 4588888877766665533 1 0112356999999 Q ss_pred EEEEEEE Q psy3287 102 RILYEEI 108 (122) Q Consensus 102 ~iay~~~ 108 (122) +|+|..- T Consensus 247 ~l~~~~~ 253 (417) T TIGR02800 247 KLAVSLS 253 (417) T ss_pred EEEEEEC Confidence 8888643 No 28 >COG4946 Uncharacterized protein related to the periplasmic component of the Tol biopolymer transport system [Function unknown] Probab=91.67 E-value=0.53 Score=40.85 Aligned_cols=49 Identities=24% Similarity=0.373 Sum_probs=34.9 Q ss_pred CCCceeeeeecCCCCCCCCCCCCCCCCcceece---EEEEE--------eCCEEEEeCCCCCeeEEeccc Q psy3287 2 VLGSTQRLTFAHKGGRNLTDDPLAAGIPSYVTQ---EEFSR--------YQGYWWQPKTTGSTQRLTFAH 60 (122) Q Consensus 2 ~~~~~~rlt~~~~~~~~~~~~~~~~Gv~~f~~~---~aFvr--------~~nLyv~~~~~~~~~rlT~~~ 60 (122) .+|+.+|||-. -|.- .-+.|+++ +||.| -.+||+.+..+|+.+||||-| T Consensus 67 k~g~~~ritS~-lGVv---------nn~kf~pdGrkvaf~rv~~~ss~~taDly~v~~e~Ge~kRiTyfG 126 (668) T COG4946 67 KDGKPLRITSG-LGVV---------NNPKFSPDGRKVAFSRVMLGSSLQTADLYVVPSEDGEAKRITYFG 126 (668) T ss_pred ccCCeeEEecc-ccee---------ccccCCCCCcEEEEEEEEecCCCccccEEEEeCCCCcEEEEEEec Confidence 46888999874 3331 12334444 67743 679999999999999999993 No 29 >PF08450 SGL: SMP-30/Gluconolaconase/LRE-like region; InterPro: IPR013658 This family describes a region that is found in proteins expressed by a variety of eukaryotic and prokaryotic species. These proteins include various enzymes, such as senescence marker protein 30 (SMP-30, Q15493 from SWISSPROT), gluconolactonase (Q01578 from SWISSPROT) and luciferin-regenerating enzyme (LRE, Q86DU5 from SWISSPROT). SMP-30 is known to hydrolyse diisopropyl phosphorofluoridate in the liver, and has been noted as having sequence similarity, in the region described in this family, with PON1 (P52430 from SWISSPROT) and LRE. ; PDB: 2GHS_A 2DG0_L 2DG1_D 2DSO_D 3E5Z_A 2IAT_A 2IAV_A 2GVV_A 3HLI_A 2GVU_A .... Probab=91.08 E-value=0.4 Score=36.12 Aligned_cols=66 Identities=11% Similarity=0.155 Sum_probs=38.5 Q ss_pred eEEEEEeCCEEEEeCCC--------CCeeEEeccccCCcccCCCCeeeccCCccchheecccceEEecCCCCCCceEEEE Q psy3287 34 QEEFSRYQGYWWQPKTT--------GSTQRLTFAHKGGRNLTDDPLAAGIPSYVTQEEFSRYQGYWWQPKTTDGVYRILY 105 (122) Q Consensus 34 ~~aFvr~~nLyv~~~~~--------~~~~rlT~~~~g~~~v~~~~i~~G~~d~v~eEE~~~~~g~wWSPd~~~~~~~iay 105 (122) ++++..+++||+.+... |...++..+ +. .. +..+.+....|+.||||++ .|.+ T Consensus 90 D~~vd~~G~ly~t~~~~~~~~~~~~g~v~~~~~~--~~------------~~-~~~~~~~~pNGi~~s~dg~----~lyv 150 (246) T PF08450_consen 90 DVAVDPDGNLYVTDSGGGGASGIDPGSVYRIDPD--GK------------VT-VVADGLGFPNGIAFSPDGK----TLYV 150 (246) T ss_dssp EEEE-TTS-EEEEEECCBCTTCGGSEEEEEEETT--SE------------EE-EEEEEESSEEEEEEETTSS----EEEE T ss_pred eEEEcCCCCEEEEecCCCccccccccceEEECCC--Ce------------EE-EEecCcccccceEECCcch----heee Confidence 38999999999987643 223444444 21 11 1123466679999999998 5555 Q ss_pred EEEcCCcceeEEC Q psy3287 106 EEIDESDVKIYNF 118 (122) Q Consensus 106 ~~~Des~V~~~~~ 118 (122) .......|-.+.+ T Consensus 151 ~ds~~~~i~~~~~ 163 (246) T PF08450_consen 151 ADSFNGRIWRFDL 163 (246) T ss_dssp EETTTTEEEEEEE T ss_pred cccccceeEEEec Confidence 5555554444443 No 30 >TIGR02171 Fb_sc_TIGR02171 Fibrobacter succinogenes paralogous family TIGR02171. This model describes a paralogous family of the rumen bacterium Fibrobacter succinogenes. Eleven members are found in Fibrobacter succinogenes S85, averaging over 900 amino acids in length. More than half are predicted lipoproteins. The function is unknown. Probab=88.89 E-value=1.7 Score=40.10 Aligned_cols=53 Identities=11% Similarity=0.147 Sum_probs=36.9 Q ss_pred cceeceEEEEEe--CCEEEEeCCCCCeeEE-eccccCCcccCCCCeeeccCCccchheecccceEEecCCCCCCceEEEE Q psy3287 29 PSYVTQEEFSRY--QGYWWQPKTTGSTQRL-TFAHKGGRNLTDDPLAAGIPSYVTQEEFSRYQGYWWQPKTTDGVYRILY 105 (122) Q Consensus 29 ~~f~~~~aFvr~--~nLyv~~~~~~~~~rl-T~~~~g~~~v~~~~i~~G~~d~v~eEE~~~~~g~wWSPd~~~~~~~iay 105 (122) -+|...+||+-+ ++|.+.+....+.+.| +.+ . ..++ +. -|||||+ +||| T Consensus 315 ~~f~tkiAfv~~~~~~L~~~D~dG~n~~~ve~~~--~------~~i~--------------sP--~~SPDG~----~vAY 366 (912) T TIGR02171 315 GTYKAKLAFRNDVTGNLAYIDYTKGASRAVEIED--T------ISVY--------------HP--DISPDGK----KVAF 366 (912) T ss_pred ccceeeEEEEEcCCCeEEEEecCCCCceEEEecC--C------Ccee--------------cC--cCCCCCC----EEEE Confidence 457777999986 5899999877777777 555 2 1111 11 3899999 9999 Q ss_pred -EEEc Q psy3287 106 -EEID 109 (122) Q Consensus 106 -~~~D 109 (122) ..+. T Consensus 367 ~ts~e 371 (912) T TIGR02171 367 CTGIE 371 (912) T ss_pred EEeec Confidence 4443 No 31 >PRK13616 lipoprotein LpqB; Provisional Probab=88.21 E-value=2.2 Score=37.46 Aligned_cols=18 Identities=6% Similarity=-0.178 Sum_probs=15.7 Q ss_pred ccceEEecCCCCCCceEEEEEE Q psy3287 86 RYQGYWWQPKTTDGVYRILYEE 107 (122) Q Consensus 86 ~~~g~wWSPd~~~~~~~iay~~ 107 (122) .-+.+-|||||+ +|||.. T Consensus 449 ~Issl~wSpDG~----RiA~i~ 466 (591) T PRK13616 449 PISELQLSRDGV----RAAMII 466 (591) T ss_pred CcCeEEECCCCC----EEEEEE Confidence 467889999999 999976 No 32 >PF00930 DPPIV_N: Dipeptidyl peptidase IV (DPP IV) N-terminal region; InterPro: IPR002469 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Proteolytic enzymes that exploit serine in their catalytic activity are ubiquitous, being found in viruses, bacteria and eukaryotes []. They include a wide range of peptidase activity, including exopeptidase, endopeptidase, oligopeptidase and omega-peptidase activity. Over 20 families (denoted S1 - S66) of serine protease have been identified, these being grouped into clans on the basis of structural similarity and other functional evidence []. Structures are known for members of the clans and the structures indicate that some appear to be totally unrelated, suggesting different evolutionary origins for the serine peptidases []. Not withstanding their different evolutionary origins, there are similarities in the reaction mechanisms of several peptidases. Chymotrypsin, subtilisin and carboxypeptidase C have a catalytic triad of serine, aspartate and histidine in common: serine acts as a nucleophile, aspartate as an electrophile, and histidine as a base []. The geometric orientations of the catalytic residues are similar between families, despite different protein folds []. The linear arrangements of the catalytic residues commonly reflect clan relationships. For example the catalytic triad in the chymotrypsin clan (PA) is ordered HDS, but is ordered DHS in the subtilisin clan (SB) and SDH in the carboxypeptidase clan (SC) [, ]. This domain defines serine peptidases belonging to MEROPS peptidase family S9 (clan SC), subfamily S9B (dipeptidyl-peptidase IV). The protein fold of the peptidase domain for members of this family resembles that of serine carboxypeptidase D, the type example of clan SC. This domain is an alignment of the region to the N-terminal side of the active site, which is found in IPR001375 from INTERPRO. CD26 (3.4.14.5 from EC) is also called adenosine deaminase-binding protein (ADA-binding protein) or dipeptidylpeptidase IV (DPP IV ectoenzyme). The exopeptidase cleaves off N-terminal X-Pro or X-Ala dipeptides from polypeptides (dipeptidyl peptidase IV activity). CD26 serves as the costimulatory molecule in T cell activation and is an associated marker of autoimmune diseases, adenosine deaminase-deficiency and HIV pathogenesis. Dipeptidyl peptidase IV (DPP IV) is responsible for the removal of N-terminal dipeptides sequentially from polypeptides having unsubstituted N termini, provided that the penultimate residue is proline. The enzyme catalyses the reaction:Dipeptidyl-Polypeptide + H(2)O = Dipeptide + Polypeptide It is a type II membrane protein that forms a homodimer. CD molecules are leucocyte antigens on cell surfaces. CD antigens nomenclature is updated at Protein Reviews On The Web (http://prow.nci.nih.gov/). ; GO: 0006508 proteolysis, 0016020 membrane; PDB: 2RIP_A 3Q8W_B 2AJL_I 1TKR_B 1TK3_B 3C45_A 2G5P_A 3G0C_D 1R9M_C 1RWQ_A .... Probab=87.55 E-value=0.086 Score=42.44 Aligned_cols=39 Identities=33% Similarity=0.746 Sum_probs=28.8 Q ss_pred CCCceeeeeecCCCCCCCCCCCCCCCCcceeceEE-EEEeCCEEEEe Q psy3287 2 VLGSTQRLTFAHKGGRNLTDDPLAAGIPSYVTQEE-FSRYQGYWWQP 47 (122) Q Consensus 2 ~~~~~~rlt~~~~~~~~~~~~~~~~Gv~~f~~~~a-Fvr~~nLyv~~ 47 (122) .+++++|||..+. .....|+++++.+++ |.+...+||++ T Consensus 71 ~~~~~~~lT~dg~-------~~i~nG~~dwvyeEEv~~~~~~~~WSp 110 (353) T PF00930_consen 71 ATGQETQLTTDGE-------PGIYNGVPDWVYEEEVFDRRSAVWWSP 110 (353) T ss_dssp TTSEEEESES--T-------TTEEESB--HHHHHHTSSSSBSEEE-T T ss_pred CCCCeEEeccccc-------eeEEcCccceeccccccccccceEECC Confidence 5789999999873 577899999999876 88899999987 No 33 >PF14583 Pectate_lyase22: Oligogalacturonate lyase; PDB: 3C5M_C 3PE7_A. Probab=87.23 E-value=2.8 Score=35.13 Aligned_cols=56 Identities=18% Similarity=0.081 Sum_probs=25.5 Q ss_pred CCC-ceeeeeecCCCCCCCCCCCCCCCCcceeceEEEEE----eCCEEEEeCCCCCeeEEecc Q psy3287 2 VLG-STQRLTFAHKGGRNLTDDPLAAGIPSYVTQEEFSR----YQGYWWQPKTTGSTQRLTFA 59 (122) Q Consensus 2 ~~~-~~~rlt~~~~~~~~~~~~~~~~Gv~~f~~~~aFvr----~~nLyv~~~~~~~~~rlT~~ 59 (122) .|| +..|||.-..-+|.+=|.. .+...=.-.+.|.- ..|||..++.+++.+|||.. T Consensus 18 ~TG~~VtrLT~~~~~~h~~YF~~--~~ft~dG~kllF~s~~dg~~nly~lDL~t~~i~QLTdg 78 (386) T PF14583_consen 18 DTGHRVTRLTPPDGHSHRLYFYQ--NCFTDDGRKLLFASDFDGNRNLYLLDLATGEITQLTDG 78 (386) T ss_dssp TT--EEEE-S-TTS-EE---TTS----B-TTS-EEEEEE-TTSS-EEEEEETTT-EEEE---S T ss_pred CCCceEEEecCCCCcccceeecC--CCcCCCCCEEEEEeccCCCcceEEEEcccCEEEECccC Confidence 356 7788988654344222221 11111112255543 68999999999999999986 No 34 >KOG0318|consensus Probab=72.07 E-value=15 Score=32.32 Aligned_cols=65 Identities=17% Similarity=0.245 Sum_probs=40.4 Q ss_pred CcceeceEEEEEeCCEEEEeCCCCCeeEEeccccCCcccCCCCeeeccCCccchhe----e-cccceEEecCCCCCCceE Q psy3287 28 IPSYVTQEEFSRYQGYWWQPKTTGSTQRLTFAHKGGRNLTDDPLAAGIPSYVTQEE----F-SRYQGYWWQPKTTDGVYR 102 (122) Q Consensus 28 v~~f~~~~aFvr~~nLyv~~~~~~~~~rlT~~~~g~~~v~~~~i~~G~~d~v~eEE----~-~~~~g~wWSPd~~~~~~~ 102 (122) ...|+-.+-|.-++.+|+....+|+ -+.|||+. + +.|.+-| - +.--++-||||++ + T Consensus 189 HskFV~~VRysPDG~~Fat~gsDgk--i~iyDGkt------g-------e~vg~l~~~~aHkGsIfalsWsPDs~----~ 249 (603) T KOG0318|consen 189 HSKFVNCVRYSPDGSRFATAGSDGK--IYIYDGKT------G-------EKVGELEDSDAHKGSIFALSWSPDST----Q 249 (603) T ss_pred cccceeeEEECCCCCeEEEecCCcc--EEEEcCCC------c-------cEEEEecCCCCccccEEEEEECCCCc----e Confidence 5567777888888888887655553 44567552 1 1222211 1 2334568999999 8 Q ss_pred EEEEEEcCC Q psy3287 103 ILYEEIDES 111 (122) Q Consensus 103 iay~~~Des 111 (122) +|-..-|-+ T Consensus 250 ~~T~SaDkt 258 (603) T KOG0318|consen 250 FLTVSADKT 258 (603) T ss_pred EEEecCCce Confidence 887776643 No 35 >PF08662 eIF2A: Eukaryotic translation initiation factor eIF2A; InterPro: IPR013979 This entry contains beta propellor domains found in eukaryotic translation initiation factors and TolB domain-containing proteins. Probab=70.65 E-value=7.8 Score=28.69 Aligned_cols=28 Identities=18% Similarity=0.411 Sum_probs=20.1 Q ss_pred cceEEecCCCCCCceEEEEEEEc-----CCcceeEEC Q psy3287 87 YQGYWWQPKTTDGVYRILYEEID-----ESDVKIYNF 118 (122) Q Consensus 87 ~~g~wWSPd~~~~~~~iay~~~D-----es~V~~~~~ 118 (122) -+.+-|||||+ +||.+... +..+.++++ T Consensus 146 ~t~~~WsPdGr----~~~ta~t~~r~~~dng~~Iw~~ 178 (194) T PF08662_consen 146 ATDVEWSPDGR----YLATATTSPRLRVDNGFKIWSF 178 (194) T ss_pred EEEEEEcCCCC----EEEEEEeccceeccccEEEEEe Confidence 46678999999 99988775 444455543 No 36 >PF14583 Pectate_lyase22: Oligogalacturonate lyase; PDB: 3C5M_C 3PE7_A. Probab=67.77 E-value=17 Score=30.56 Aligned_cols=52 Identities=13% Similarity=0.068 Sum_probs=26.3 Q ss_pred CCCCceeeeeecCCCCCCCCCCCCCCC-CcceeceEEEEEe-CCEEEEeCCCCCeeEEecc Q psy3287 1 MVLGSTQRLTFAHKGGRNLTDDPLAAG-IPSYVTQEEFSRY-QGYWWQPKTTGSTQRLTFA 59 (122) Q Consensus 1 ~~~~~~~rlt~~~~~~~~~~~~~~~~G-v~~f~~~~aFvr~-~nLyv~~~~~~~~~rlT~~ 59 (122) .+|++..|||... + +..-.+ ++.-.-++.|+++ .+|+-.++.+.+++.|... T Consensus 67 L~t~~i~QLTdg~-g------~~~~g~~~s~~~~~~~Yv~~~~~l~~vdL~T~e~~~vy~~ 120 (386) T PF14583_consen 67 LATGEITQLTDGP-G------DNTFGGFLSPDDRALYYVKNGRSLRRVDLDTLEERVVYEV 120 (386) T ss_dssp TTT-EEEE---SS--------B-TTT-EE-TTSSEEEEEETTTEEEEEETTT--EEEEEE- T ss_pred cccCEEEECccCC-C------CCccceEEecCCCeEEEEECCCeEEEEECCcCcEEEEEEC Confidence 3689999999953 2 111122 2222333778875 4899999999887766544 No 37 >PF10647 Gmad1: Lipoprotein LpqB beta-propeller domain; InterPro: IPR018910 The Gmad1 domain is found associated with IPR019606 from INTERPRO, in bacterial spore formation. It is predicted to have a beta-propeller fold and to have a passive binding role rather than a catalytic function owing to the low number of conserved hydrophilic residues. Probab=66.10 E-value=29 Score=26.73 Aligned_cols=23 Identities=22% Similarity=0.196 Sum_probs=18.8 Q ss_pred ccceEEecCCCCCCceEEEEEEEcCCc Q psy3287 86 RYQGYWWQPKTTDGVYRILYEEIDESD 112 (122) Q Consensus 86 ~~~g~wWSPd~~~~~~~iay~~~Des~ 112 (122) +-+.+-+||||+ |||+..-+... T Consensus 113 ~I~~l~vSpDG~----RvA~v~~~~~~ 135 (253) T PF10647_consen 113 RITALRVSPDGT----RVAVVVEDGGG 135 (253) T ss_pred ceEEEEECCCCc----EEEEEEecCCC Confidence 567899999999 99999865443 No 38 >PRK13616 lipoprotein LpqB; Provisional Probab=64.96 E-value=33 Score=30.18 Aligned_cols=31 Identities=10% Similarity=0.089 Sum_probs=17.6 Q ss_pred Ccceece---EEEEE---------eCCEEEEeCCCCCeeEEecc Q psy3287 28 IPSYVTQ---EEFSR---------YQGYWWQPKTTGSTQRLTFA 59 (122) Q Consensus 28 v~~f~~~---~aFvr---------~~nLyv~~~~~~~~~rlT~~ 59 (122) .+..+++ ++|++ ...||+.+. .+..+++|.. T Consensus 354 spaiSpdG~~vA~v~~~~~~~~d~~s~Lwv~~~-gg~~~~lt~g 396 (591) T PRK13616 354 SAALSRSGRQVAAVVTLGRGAPDPASSLWVGPL-GGVAVQVLEG 396 (591) T ss_pred cceECCCCCEEEEEEeecCCCCCcceEEEEEeC-CCcceeeecC Confidence 3334444 88887 236777764 3344666543 No 39 >COG4946 Uncharacterized protein related to the periplasmic component of the Tol biopolymer transport system [Function unknown] Probab=56.59 E-value=9.9 Score=33.30 Aligned_cols=25 Identities=20% Similarity=0.280 Sum_probs=23.7 Q ss_pred EEEEEeCCEEEEeCCCCCeeEEecc Q psy3287 35 EEFSRYQGYWWQPKTTGSTQRLTFA 59 (122) Q Consensus 35 ~aFvr~~nLyv~~~~~~~~~rlT~~ 59 (122) +.|+-..+||...+.+|+.+|||.. T Consensus 53 IiFt~~DdlWe~slk~g~~~ritS~ 77 (668) T COG4946 53 IIFTCCDDLWEYSLKDGKPLRITSG 77 (668) T ss_pred EEEEechHHHHhhhccCCeeEEecc Confidence 8999999999999999999999976 No 40 >PF08662 eIF2A: Eukaryotic translation initiation factor eIF2A; InterPro: IPR013979 This entry contains beta propellor domains found in eukaryotic translation initiation factors and TolB domain-containing proteins. Probab=55.44 E-value=57 Score=24.03 Aligned_cols=27 Identities=11% Similarity=0.401 Sum_probs=20.0 Q ss_pred ceEEecCCCCCCceEEEEEEEcCC--cceeEEC Q psy3287 88 QGYWWQPKTTDGVYRILYEEIDES--DVKIYNF 118 (122) Q Consensus 88 ~g~wWSPd~~~~~~~iay~~~Des--~V~~~~~ 118 (122) ..+.|||+|+ +||..-++.. .+..+.+ T Consensus 104 n~i~wsP~G~----~l~~~g~~n~~G~l~~wd~ 132 (194) T PF08662_consen 104 NTISWSPDGR----FLVLAGFGNLNGDLEFWDV 132 (194) T ss_pred eEEEECCCCC----EEEEEEccCCCcEEEEEEC Confidence 4678999999 9999887744 4555543 No 41 >COG3386 Gluconolactonase [Carbohydrate transport and metabolism] Probab=54.83 E-value=17 Score=29.38 Aligned_cols=31 Identities=10% Similarity=0.317 Sum_probs=22.8 Q ss_pred ecccceEEecCCCCCCceEEEEEEEcCCcceeEEC Q psy3287 84 FSRYQGYWWQPKTTDGVYRILYEEIDESDVKIYNF 118 (122) Q Consensus 84 ~~~~~g~wWSPd~~~~~~~iay~~~Des~V~~~~~ 118 (122) +....|+.||||++ .+.+.......+..+.+ T Consensus 162 ~~~~NGla~SpDg~----tly~aDT~~~~i~r~~~ 192 (307) T COG3386 162 LTIPNGLAFSPDGK----TLYVADTPANRIHRYDL 192 (307) T ss_pred EEecCceEECCCCC----EEEEEeCCCCeEEEEec Confidence 55578999999998 77777666666665554 No 42 >TIGR00667 aat leucyl/phenylalanyl-tRNA--protein transferase. The N-terminal residue controls the biological half-life of many proteins via the N-end rule pathway. This enzyme transfers a Leu or Phe to the amino end of certain proteins to enable degradation. Probab=53.24 E-value=11 Score=28.70 Aligned_cols=26 Identities=23% Similarity=0.634 Sum_probs=17.5 Q ss_pred eccCCccchheecccceEEecCCCCCCceEEEEE Q psy3287 73 AGIPSYVTQEEFSRYQGYWWQPKTTDGVYRILYE 106 (122) Q Consensus 73 ~G~~d~v~eEE~~~~~g~wWSPd~~~~~~~iay~ 106 (122) +|+--|-.++ ..-+|||||-. .|++. T Consensus 13 ~GiFPw~~~~----~pi~WwsP~pR----~vl~p 38 (185) T TIGR00667 13 QGIFPWFDPG----DPILWWSPDPR----AVLWP 38 (185) T ss_pred cCeecCCCCC----CCeeEEcCCCC----eEecc Confidence 5666664332 25679999988 77764 No 43 >COG5169 HSF1 Heat shock transcription factor [Transcription] Probab=50.04 E-value=15 Score=29.67 Aligned_cols=19 Identities=16% Similarity=0.403 Sum_probs=14.2 Q ss_pred cccceEEecCCCCCCceEEEEEE Q psy3287 85 SRYQGYWWQPKTTDGVYRILYEE 107 (122) Q Consensus 85 ~~~~g~wWSPd~~~~~~~iay~~ 107 (122) .-+..+||||||. .++..- T Consensus 25 e~~k~I~Ws~~G~----sfvI~~ 43 (282) T COG5169 25 EYYKLIQWSPDGR----SFVILD 43 (282) T ss_pred ccCCceEECCCCC----EEEEeC Confidence 3468899999999 665543 No 44 >PF09729 Gti1_Pac2: Gti1/Pac2 family; InterPro: IPR018608 In Schizosaccharomyces pombe (Fission yeast) the gti1 protein promotes the onset of gluconate uptake upon glucose starvation []. In S. pombe the Pac2 protein controls the onset of sexual development, by inhibiting the expression of ste11, in a pathway that is independent of the cAMP cascade []. Probab=48.46 E-value=13 Score=27.70 Aligned_cols=33 Identities=30% Similarity=0.642 Sum_probs=24.5 Q ss_pred Cccchhe---ecc-cceEEecCC---CCCCceEEEEEEEcCCcc Q psy3287 77 SYVTQEE---FSR-YQGYWWQPK---TTDGVYRILYEEIDESDV 113 (122) Q Consensus 77 d~v~eEE---~~~-~~g~wWSPd---~~~~~~~iay~~~Des~V 113 (122) -||++|| |.| -.|.-|||- |. +|+|-++|.... T Consensus 45 vfV~~e~~sgikRWTDG~~WspSri~g~----fLiYrEl~~~~~ 84 (168) T PF09729_consen 45 VFVFEEEESGIKRWTDGRSWSPSRILGN----FLIYRELDKPTP 84 (168) T ss_pred EEEEeccccCceeccCCcccCcccccCc----eeEEEEeccCCC Confidence 4677776 545 367789995 77 999999987663 No 45 >PF02897 Peptidase_S9_N: Prolyl oligopeptidase, N-terminal beta-propeller domain; InterPro: IPR004106 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Proteolytic enzymes that exploit serine in their catalytic activity are ubiquitous, being found in viruses, bacteria and eukaryotes []. They include a wide range of peptidase activity, including exopeptidase, endopeptidase, oligopeptidase and omega-peptidase activity. Over 20 families (denoted S1 - S66) of serine protease have been identified, these being grouped into clans on the basis of structural similarity and other functional evidence []. Structures are known for members of the clans and the structures indicate that some appear to be totally unrelated, suggesting different evolutionary origins for the serine peptidases []. Not withstanding their different evolutionary origins, there are similarities in the reaction mechanisms of several peptidases. Chymotrypsin, subtilisin and carboxypeptidase C have a catalytic triad of serine, aspartate and histidine in common: serine acts as a nucleophile, aspartate as an electrophile, and histidine as a base []. The geometric orientations of the catalytic residues are similar between families, despite different protein folds []. The linear arrangements of the catalytic residues commonly reflect clan relationships. For example the catalytic triad in the chymotrypsin clan (PA) is ordered HDS, but is ordered DHS in the subtilisin clan (SB) and SDH in the carboxypeptidase clan (SC) [, ]. This entry represents the beta-propeller domain found at the N-terminal of prolyl oligopeptidase, including acylamino-acid-releasing enzyme (also known as acylaminoacyl peptidase), which belong to the MEROPS peptidase family S9 (clan SC), subfamily S9A. The prolyl oligopeptidase family consist of a number of evolutionary related peptidases whose catalytic activity seems to be provided by a charge relay system similar to that of the trypsin family of serine proteases, but which evolved by independent convergent evolution. The N-terminal domain of prolyl oligopeptidases form an unusual 7-bladed beta-propeller consisting of seven 4-stranded beta-sheet motifs. Prolyl oligopeptidase is a large cytosolic enzyme involved in the maturation and degradation of peptide hormones and neuropeptides, which relate to the induction of amnesia. The enzyme contains a peptidase domain, where its catalytic triad (Ser554, His680, Asp641) is covered by the central tunnel of the N-terminal beta-propeller domain. In this way, large structured peptides are excluded from the active site, thereby protecting larger peptides and proteins from proteolysis in the cytosol []. The protein fold of the peptidase domain for members of this family resembles that of serine carboxypeptidase D, the type example of clan SC. Mammalian acylaminoacyl peptidase is an exopeptidase that is a member of the same prolyl oligopeptidase family of serine peptidases. This enzyme removes acylated amino acid residues from the N terminus of oligopeptides [].; GO: 0004252 serine-type endopeptidase activity, 0006508 proteolysis; PDB: 2BKL_B 3DDU_A 1YR2_A 2XE4_A 1VZ3_A 3EQ9_A 1O6F_A 3EQ7_A 4AN0_A 1UOP_A .... Probab=44.66 E-value=59 Score=26.23 Aligned_cols=22 Identities=18% Similarity=0.428 Sum_probs=18.5 Q ss_pred ceEEecCCCCCCceEEEEEEEcCCcc Q psy3287 88 QGYWWQPKTTDGVYRILYEEIDESDV 113 (122) Q Consensus 88 ~g~wWSPd~~~~~~~iay~~~Des~V 113 (122) +.+-|++|++ .+.|.++|+..- T Consensus 173 ~~~~W~~d~~----~~~y~~~~~~~~ 194 (414) T PF02897_consen 173 SSVSWSDDGK----GFFYTRFDEDQR 194 (414) T ss_dssp EEEEECTTSS----EEEEEECSTTTS T ss_pred ceEEEeCCCC----EEEEEEeCcccc Confidence 4488999999 999999999643 No 46 >TIGR03866 PQQ_ABC_repeats PQQ-dependent catabolism-associated beta-propeller protein. Members of this protein family consist of seven repeats each of the YVTN family beta-propeller repeat (see TIGR02276). Members occur invariably as part of a transport operon that is associated with PQQ-dependent catabolism of alcohols such as phenylethanol. Probab=42.85 E-value=1.3e+02 Score=21.92 Aligned_cols=29 Identities=14% Similarity=0.164 Sum_probs=20.2 Q ss_pred cceEEecCCCCCCceEEEEEEEcCCcceeEECC Q psy3287 87 YQGYWWQPKTTDGVYRILYEEIDESDVKIYNFP 119 (122) Q Consensus 87 ~~g~wWSPd~~~~~~~iay~~~Des~V~~~~~~ 119 (122) ...+.||||++ +|+...-++..|..+.+. T Consensus 251 ~~~~~~~~~g~----~l~~~~~~~~~i~v~d~~ 279 (300) T TIGR03866 251 VWQLAFTPDEK----YLLTTNGVSNDVSVIDVA 279 (300) T ss_pred cceEEECCCCC----EEEEEcCCCCeEEEEECC Confidence 34668999999 876655556677776653 No 47 >TIGR03866 PQQ_ABC_repeats PQQ-dependent catabolism-associated beta-propeller protein. Members of this protein family consist of seven repeats each of the YVTN family beta-propeller repeat (see TIGR02276). Members occur invariably as part of a transport operon that is associated with PQQ-dependent catabolism of alcohols such as phenylethanol. Probab=42.18 E-value=1.3e+02 Score=21.86 Aligned_cols=25 Identities=8% Similarity=0.137 Sum_probs=12.4 Q ss_pred eEEecCCCCCCceEEEEEEEcCCcceeEE Q psy3287 89 GYWWQPKTTDGVYRILYEEIDESDVKIYN 117 (122) Q Consensus 89 g~wWSPd~~~~~~~iay~~~Des~V~~~~ 117 (122) +..||||++ .++...-++..|..+. T Consensus 35 ~l~~~~dg~----~l~~~~~~~~~v~~~d 59 (300) T TIGR03866 35 GITLSKDGK----LLYVCASDSDTIQVID 59 (300) T ss_pred ceEECCCCC----EEEEEECCCCeEEEEE Confidence 455677766 5433333444444443 No 48 >PF12566 DUF3748: Protein of unknown function (DUF3748); InterPro: IPR022223 This domain family is found in bacteria and eukaryotes, and is approximately 120 amino acids in length. Probab=41.84 E-value=22 Score=25.19 Aligned_cols=18 Identities=11% Similarity=0.163 Sum_probs=13.9 Q ss_pred ceEEecCCCCCCceEEEEEEEc Q psy3287 88 QGYWWQPKTTDGVYRILYEEID 109 (122) Q Consensus 88 ~g~wWSPd~~~~~~~iay~~~D 109 (122) --+-|||||+ +|-|--.| T Consensus 71 HvHvfSpDG~----~lSFTYND 88 (122) T PF12566_consen 71 HVHVFSPDGS----WLSFTYND 88 (122) T ss_pred cceEECCCCC----EEEEEecc Confidence 3457999999 99887654 No 49 >PLN02919 haloacid dehalogenase-like hydrolase family protein Probab=39.65 E-value=1.5e+02 Score=27.94 Aligned_cols=59 Identities=10% Similarity=0.031 Sum_probs=36.1 Q ss_pred EEEEEeCCEEEEeCCCCCeeEEeccccCCcccCCCCeeeccCCcc----chheecccceEEecCCCC Q psy3287 35 EEFSRYQGYWWQPKTTGSTQRLTFAHKGGRNLTDDPLAAGIPSYV----TQEEFSRYQGYWWQPKTT 97 (122) Q Consensus 35 ~aFvr~~nLyv~~~~~~~~~rlT~~~~g~~~v~~~~i~~G~~d~v----~eEE~~~~~g~wWSPd~~ 97 (122) ++|..+++||+++..++..+.+..++.-.. ...-.|.+.|. .+..|.+-.|+..+|||+ T Consensus 809 vavd~dG~LYVADs~N~rIrviD~~tg~v~----tiaG~G~~G~~dG~~~~a~l~~P~GIavd~dG~ 871 (1057) T PLN02919 809 VLCAKDGQIYVADSYNHKIKKLDPATKRVT----TLAGTGKAGFKDGKALKAQLSEPAGLALGENGR 871 (1057) T ss_pred eeEeCCCcEEEEECCCCEEEEEECCCCeEE----EEeccCCcCCCCCcccccccCCceEEEEeCCCC Confidence 889999999999977776666654311100 00011222221 244577788999999986 No 50 >KOG0643|consensus Probab=37.16 E-value=13 Score=30.30 Aligned_cols=21 Identities=24% Similarity=0.458 Sum_probs=17.5 Q ss_pred CccchheecccceE-------EecCCCC Q psy3287 77 SYVTQEEFSRYQGY-------WWQPKTT 97 (122) Q Consensus 77 d~v~eEE~~~~~g~-------wWSPd~~ 97 (122) +-++||||+|.+|+ ..+|||+ T Consensus 273 h~i~eEEigrvkGHFGPINsvAfhPdGk 300 (327) T KOG0643|consen 273 HLIFEEEIGRVKGHFGPINSVAFHPDGK 300 (327) T ss_pred HHHHHHHhccccccccCcceeEECCCCc Confidence 45889999998775 5899998 No 51 >PRK10115 protease 2; Provisional Probab=36.59 E-value=44 Score=29.81 Aligned_cols=28 Identities=18% Similarity=0.172 Sum_probs=21.1 Q ss_pred ccceEEecCCCCCCceEEEEEEEcCCcceeEEC Q psy3287 86 RYQGYWWQPKTTDGVYRILYEEIDESDVKIYNF 118 (122) Q Consensus 86 ~~~g~wWSPd~~~~~~~iay~~~Des~V~~~~~ 118 (122) .-.++-||||++ +|||. .|...-+.+.| T Consensus 128 ~l~~~~~Spdg~----~la~~-~d~~G~E~~~l 155 (686) T PRK10115 128 TLGGMAITPDNT----IMALA-EDFLSRRQYGI 155 (686) T ss_pred EEeEEEECCCCC----EEEEE-ecCCCcEEEEE Confidence 346778999999 89888 77776666543 No 52 >PF05787 DUF839: Bacterial protein of unknown function (DUF839); InterPro: IPR008557 This family consists of bacterial proteins of unknown function. Probab=36.51 E-value=1.1e+02 Score=26.57 Aligned_cols=20 Identities=20% Similarity=0.333 Sum_probs=15.5 Q ss_pred ece-EEEEEeCCEEEEeCCCC Q psy3287 32 VTQ-EEFSRYQGYWWQPKTTG 51 (122) Q Consensus 32 ~~~-~aFvr~~nLyv~~~~~~ 51 (122) +|| ++|..+++|||+.-..+ T Consensus 437 sPDNL~~d~~G~LwI~eD~~~ 457 (524) T PF05787_consen 437 SPDNLAFDPDGNLWIQEDGGG 457 (524) T ss_pred CCCceEECCCCCEEEEeCCCC Confidence 344 99999999999975443 No 53 >TIGR03606 non_repeat_PQQ dehydrogenase, PQQ-dependent, s-GDH family. PQQ, or pyrroloquinoline-quinone, serves as a cofactor for a number of sugar and alcohol dehydrogenases in a limited number of bacterial species. Most characterized PQQ-dependent enzymes have multiple repeats of a sequence region described by pfam01011 (PQQ enzyme repeat), but this protein family in unusual in lacking that repeat. Below the noise cutoff are related proteins mostly from species that lack PQQ biosynthesis. Probab=35.84 E-value=2e+02 Score=24.74 Aligned_cols=55 Identities=16% Similarity=0.279 Sum_probs=32.9 Q ss_pred EEEEEeCCEEEEeCCCCCeeEEeccccCCcccCCCCeeeccCCccchheecccceEEecCC Q psy3287 35 EEFSRYQGYWWQPKTTGSTQRLTFAHKGGRNLTDDPLAAGIPSYVTQEEFSRYQGYWWQPK 95 (122) Q Consensus 35 ~aFvr~~nLyv~~~~~~~~~rlT~~~~g~~~v~~~~i~~G~~d~v~eEE~~~~~g~wWSPd 95 (122) ++|.-+++||+.....|+.++|...+ +. .....+++..+....-+-.-++.++|| T Consensus 35 maflPDG~llVtER~~G~I~~v~~~~-~~-----~~~~~~l~~v~~~~ge~GLlglal~Pd 89 (454) T TIGR03606 35 LLWGPDNQLWVTERATGKILRVNPET-GE-----VKVVFTLPEIVNDAQHNGLLGLALHPD 89 (454) T ss_pred EEEcCCCeEEEEEecCCEEEEEeCCC-Cc-----eeeeecCCceeccCCCCceeeEEECCC Confidence 89999999999987778888875432 11 223345555443211122345566766 No 54 >PF01436 NHL: NHL repeat; InterPro: IPR001258 The NHL repeat, named after NCL-1, HT2A and Lin-41, is found largely in a large number of eukaryotic and prokaryotic proteins. For example, the repeat is found in a variety of enzymes of the copper type II, ascorbate-dependent monooxygenase family which catalyse the C terminus alpha-amidation of biological peptides []. In many it occurs in tandem arrays, for example in the ringfinger beta-box, coiled-coil (RBCC) eukaryotic growth regulators []. The 'Brain Tumor' protein (Brat) is one such growth regulator that contains a 6-bladed NHL-repeat beta-propeller [, ]. The NHL repeats are also found in serine/threonine protein kinase (STPK) in diverse range of pathogenic bacteria. These STPK are transmembrane receptors with a intracellular N-terminal kinase domain and extracellular C-terminal sensor domain. In the STPK, PknD, from Mycobacterium tuberculosis, the sensor domain forms a rigid, six-bladed b-propeller composed of NHL repeats with a flexible tether to the transmembrane domain.; GO: 0005515 protein binding; PDB: 3FVZ_A 3FW0_A 1RWL_A 1RWI_A 1Q7F_A. Probab=35.32 E-value=47 Score=16.84 Aligned_cols=16 Identities=0% Similarity=-0.112 Sum_probs=12.9 Q ss_pred EEEEEeCCEEEEeCCC Q psy3287 35 EEFSRYQGYWWQPKTT 50 (122) Q Consensus 35 ~aFvr~~nLyv~~~~~ 50 (122) ++...+++||+++... T Consensus 7 vav~~~g~i~VaD~~n 22 (28) T PF01436_consen 7 VAVDSDGNIYVADSGN 22 (28) T ss_dssp EEEETTSEEEEEECCC T ss_pred EEEeCCCCEEEEECCC Confidence 7778999999998543 No 55 >COG3211 PhoX Predicted phosphatase [General function prediction only] Probab=34.95 E-value=54 Score=29.23 Aligned_cols=18 Identities=17% Similarity=0.427 Sum_probs=14.7 Q ss_pred ce-EEEEEeCCEEEEeCCC Q psy3287 33 TQ-EEFSRYQGYWWQPKTT 50 (122) Q Consensus 33 ~~-~aFvr~~nLyv~~~~~ 50 (122) || ++|...++||++.-.. T Consensus 502 PDnl~fD~~GrLWi~TDg~ 520 (616) T COG3211 502 PDNLAFDPWGRLWIQTDGS 520 (616) T ss_pred CCceEECCCCCEEEEecCC Confidence 44 9999999999997543 No 56 >KOG2394|consensus Probab=34.90 E-value=25 Score=31.12 Aligned_cols=31 Identities=29% Similarity=0.462 Sum_probs=18.8 Q ss_pred cCCccchhee---c-ccceE---EecCCCCCCceEEEEEEEc Q psy3287 75 IPSYVTQEEF---S-RYQGY---WWQPKTTDGVYRILYEEID 109 (122) Q Consensus 75 ~~d~v~eEE~---~-~~~g~---wWSPd~~~~~~~iay~~~D 109 (122) +-+|..||=+ . -+-|+ .|||||+ +|+----| T Consensus 316 vF~fdt~eLlg~mkSYFGGLLCvcWSPDGK----yIvtGGED 353 (636) T KOG2394|consen 316 IFDFDTQELLGVMKSYFGGLLCVCWSPDGK----YIVTGGED 353 (636) T ss_pred EeeccHHHHHHHHHhhccceEEEEEcCCcc----EEEecCCc Confidence 4466666632 2 23333 7999999 88754433 No 57 >PF03088 Str_synth: Strictosidine synthase; InterPro: IPR018119 This entry represents a conserved region found in strictosidine synthase (4.3.3.2 from EC), a key enzyme in alkaloid biosynthesis. It catalyses the Pictet-Spengler stereospecific condensation of tryptamine with secologanin to form strictosidine []. The structure of the native enzyme from the Indian medicinal plant Rauvolfia serpentina (Serpentwood) (Devilpepper) represents the first example of a six-bladed four-stranded beta-propeller fold from the plant kingdom [].; GO: 0016844 strictosidine synthase activity, 0009058 biosynthetic process; PDB: 2FPB_A 2V91_B 2FP8_A 3V1S_B 2FPC_A 2VAQ_A 2FP9_B. Probab=32.98 E-value=87 Score=20.72 Aligned_cols=32 Identities=16% Similarity=0.152 Sum_probs=27.2 Q ss_pred ccceEEecCCCCCCceEEEEEEEcCCcceeEECCCC Q psy3287 86 RYQGYWWQPKTTDGVYRILYEEIDESDVKIYNFPSS 121 (122) Q Consensus 86 ~~~g~wWSPd~~~~~~~iay~~~Des~V~~~~~~~~ 121 (122) -..|+.-|||++ .|++.++--..|..|+|.-| T Consensus 58 fpNGVals~d~~----~vlv~Et~~~Ri~rywl~Gp 89 (89) T PF03088_consen 58 FPNGVALSPDES----FVLVAETGRYRILRYWLKGP 89 (89) T ss_dssp SEEEEEE-TTSS----EEEEEEGGGTEEEEEESSST T ss_pred ccCeEEEcCCCC----EEEEEeccCceEEEEEEeCC Confidence 357888899999 99999999999999998755 No 58 >TIGR03606 non_repeat_PQQ dehydrogenase, PQQ-dependent, s-GDH family. PQQ, or pyrroloquinoline-quinone, serves as a cofactor for a number of sugar and alcohol dehydrogenases in a limited number of bacterial species. Most characterized PQQ-dependent enzymes have multiple repeats of a sequence region described by pfam01011 (PQQ enzyme repeat), but this protein family in unusual in lacking that repeat. Below the noise cutoff are related proteins mostly from species that lack PQQ biosynthesis. Probab=31.71 E-value=2.1e+02 Score=24.51 Aligned_cols=70 Identities=14% Similarity=0.194 Sum_probs=44.1 Q ss_pred cceece-EEEEE---------eCCEEEEeCCCCCeeEEeccccCCcccCCCCeeeccCCccchheecccceEEecCCCCC Q psy3287 29 PSYVTQ-EEFSR---------YQGYWWQPKTTGSTQRLTFAHKGGRNLTDDPLAAGIPSYVTQEEFSRYQGYWWQPKTTD 98 (122) Q Consensus 29 ~~f~~~-~aFvr---------~~nLyv~~~~~~~~~rlT~~~~g~~~v~~~~i~~G~~d~v~eEE~~~~~g~wWSPd~~~ 98 (122) ++-++. ++|.. +++|++..+..+...||..++... . ..|...-+-++ .+|.+-+-=+|||. T Consensus 347 psiapsg~~~y~~~g~~~p~w~g~llv~~L~~~~l~r~~l~~~~~------~-v~~~~~~~~~~-~~RiRdv~~~pDg~- 417 (454) T TIGR03606 347 PTIAPSSAYYYKGGEKGITGWENSLLIPSLKRGVIYRIKLDPDYS------T-VYGDAVPMFKT-NNRYRDVIASPDGN- 417 (454) T ss_pred CCcCCceeEEEecCcccCcccCCCEEEEEcCCCeEEEEEecCCcc------e-ecceeEEeecC-CCeeEEEEECCCCC- Confidence 443444 88887 489999999999999999873320 1 11111111122 46888888899974 Q ss_pred CceEEEEEEEcCC Q psy3287 99 GVYRILYEEIDES 111 (122) Q Consensus 99 ~~~~iay~~~Des 111 (122) .+|+-+|+. T Consensus 418 ----~iy~~td~~ 426 (454) T TIGR03606 418 ----VLYVATDNF 426 (454) T ss_pred ----EEEEEEcCC Confidence 256666643 No 59 >PF10584 Proteasome_A_N: Proteasome subunit A N-terminal signature; InterPro: IPR000426 The proteasome (or macropain) (3.4.25.1 from EC) [, , , , ] is a eukaryotic and archaeal multicatalytic proteinase complex that seems to be involved in an ATP/ubiquitin-dependent nonlysosomal proteolytic pathway. In eukaryotes the proteasome is composed of about 28 distinct subunits which form a highly ordered ring-shaped structure (20S ring) of about 700 kDa. Most proteasome subunits can be classified, on the basis on sequence similarities into two groups, alpha (A) and beta (B). This family contains the alpha subunit sequences which range from 210 to 290 amino acids. These sequences are classified as non-peptidase homologues in MEROPS peptidase family T1 (clan PB(T)). ; GO: 0004175 endopeptidase activity, 0006511 ubiquitin-dependent protein catabolic process, 0019773 proteasome core complex, alpha-subunit complex; PDB: 3H4P_M 1IRU_O 3UN4_U 1FNT_A 3OEV_G 3OEU_U 3SDK_U 3DY3_G 3MG7_G 3L5Q_C .... Probab=29.71 E-value=25 Score=17.83 Aligned_cols=6 Identities=17% Similarity=0.589 Sum_probs=5.1 Q ss_pred ecCCCC Q psy3287 92 WQPKTT 97 (122) Q Consensus 92 WSPd~~ 97 (122) |||+|+ T Consensus 8 FSp~Gr 13 (23) T PF10584_consen 8 FSPDGR 13 (23) T ss_dssp BBTTSS T ss_pred ECCCCe Confidence 799986 No 60 >PF06045 Rhamnogal_lyase: Rhamnogalacturonate lyase family; InterPro: IPR010325 Rhamnogalacturonate lyase degrades the rhamnogalacturonan I (RG-I) backbone of pectin []. This family contains mainly members from plants, but also contains the plant pathogen Erwinia chrysanthemi. Probab=29.66 E-value=70 Score=24.62 Aligned_cols=13 Identities=31% Similarity=1.112 Sum_probs=10.1 Q ss_pred cccceEE---ecCCCC Q psy3287 85 SRYQGYW---WQPKTT 97 (122) Q Consensus 85 ~~~~g~w---WSPd~~ 97 (122) ..+.||| |+..|+ T Consensus 54 e~nrGYwD~~W~~~G~ 69 (203) T PF06045_consen 54 ENNRGYWDLVWNEPGS 69 (203) T ss_pred ccCCceEEEecccCCc Confidence 3578898 998876 No 61 >PF08768 DUF1794: Domain of unknown function (DUF1794); InterPro: IPR014878 This protein forms a beta barrel structure. It is sometimes found on proteins containing a THAP (IPR006612 from INTERPRO) domain. ; PDB: 2FR2_A 2A13_A 2Q4N_A 3EMM_A 3IA8_B 2FWV_A. Probab=27.80 E-value=1.3e+02 Score=21.52 Aligned_cols=29 Identities=21% Similarity=0.416 Sum_probs=20.6 Q ss_pred ecccceEE-ecCCCCCCceEEEEEEEcCCcceeE Q psy3287 84 FSRYQGYW-WQPKTTDGVYRILYEEIDESDVKIY 116 (122) Q Consensus 84 ~~~~~g~w-WSPd~~~~~~~iay~~~Des~V~~~ 116 (122) |.+..||| |.|+++ .|.+.-...+.|..+ T Consensus 60 ~h~E~Gf~~~~~~~~----~v~~~~~~p~G~~~l 89 (154) T PF08768_consen 60 FHDETGFWRVDPATN----EVELSLAHPRGVTEL 89 (154) T ss_dssp EEEEEEEEEESTTSE----EEEEEEEETTTEEEE T ss_pred cccceeEEEEeCCCC----EEEEEEEeCCceEEE Confidence 56779996 777776 677777666666554 No 62 >PF15660 Imm49: Immunity protein 49 Probab=27.60 E-value=94 Score=19.93 Aligned_cols=45 Identities=22% Similarity=0.500 Sum_probs=24.6 Q ss_pred CCCCCcceece---EEEEE-eCCEEEEeCCCCCeeEEeccccCCcccCCCCeeeccCCc Q psy3287 24 LAAGIPSYVTQ---EEFSR-YQGYWWQPKTTGSTQRLTFAHKGGRNLTDDPLAAGIPSY 78 (122) Q Consensus 24 ~~~Gv~~f~~~---~aFvr-~~nLyv~~~~~~~~~rlT~~~~g~~~v~~~~i~~G~~d~ 78 (122) .-+|+.+|.-. ++..- ...|-+. ..-|++-||.|- ++-+.|.|.| T Consensus 11 llpgvttfidsvrleaiddkadrlmvt-tsvgeearlvyf---------npdfagtptf 59 (84) T PF15660_consen 11 LLPGVTTFIDSVRLEAIDDKADRLMVT-TSVGEEARLVYF---------NPDFAGTPTF 59 (84) T ss_pred hcCCcceeehheeeeeeccccceEEEE-eccCceeEEEEe---------CCCcCCCcch Confidence 45677777655 22222 2233333 344667777776 4556666655 No 63 >PF06977 SdiA-regulated: SdiA-regulated; InterPro: IPR009722 This entry represents a conserved region approximately 100 residues long within a number of hypothetical bacterial proteins that may be regulated by SdiA, a member of the LuxR family of transcriptional regulators []. Some proteins contain the IPR001258 from INTERPRO repeat.; PDB: 3QQZ_A. Probab=27.52 E-value=31 Score=27.01 Aligned_cols=32 Identities=16% Similarity=0.152 Sum_probs=17.6 Q ss_pred CCCCCCCCCCCCcceece---EEEEEeCCEEEEeC Q psy3287 17 RNLTDDPLAAGIPSYVTQ---EEFSRYQGYWWQPK 48 (122) Q Consensus 17 ~~~~~~~~~~Gv~~f~~~---~aFvr~~nLyv~~~ 48 (122) ..++.....+|...-.+| ++|..+++||+.+- T Consensus 206 ~~~~L~~g~~gl~~~~~QpEGIa~d~~G~LYIvsE 240 (248) T PF06977_consen 206 SSLSLDRGFHGLSKDIPQPEGIAFDPDGNLYIVSE 240 (248) T ss_dssp EEEE-STTGGG-SS---SEEEEEE-TT--EEEEET T ss_pred EEEEeCCcccCcccccCCccEEEECCCCCEEEEcC Confidence 445555556666677777 99999999999974 No 64 >smart00415 HSF heat shock factor. Probab=27.30 E-value=12 Score=25.14 Aligned_cols=10 Identities=20% Similarity=0.471 Sum_probs=8.5 Q ss_pred ceEEecCCCC Q psy3287 88 QGYWWQPKTT 97 (122) Q Consensus 88 ~g~wWSPd~~ 97 (122) .-++|||||+ T Consensus 20 ~iI~W~~~G~ 29 (105) T smart00415 20 KIISWSPSGK 29 (105) T ss_pred CEEEECCCCC Confidence 3679999998 No 65 >KOG2139|consensus Probab=27.19 E-value=2.3e+02 Score=24.17 Aligned_cols=22 Identities=14% Similarity=0.339 Sum_probs=17.3 Q ss_pred cccceEEecCCCCCCceEEEEEEEcC Q psy3287 85 SRYQGYWWQPKTTDGVYRILYEEIDE 110 (122) Q Consensus 85 ~~~~g~wWSPd~~~~~~~iay~~~De 110 (122) +..+-+-||||++ +|.-+++|- T Consensus 239 gg~slLkwSPdgd----~lfaAt~da 260 (445) T KOG2139|consen 239 GGFSLLKWSPDGD----VLFAATCDA 260 (445) T ss_pred CceeeEEEcCCCC----EEEEecccc Confidence 3346678999999 888888774 No 66 >PF02239 Cytochrom_D1: Cytochrome D1 heme domain; PDB: 1NNO_B 1HZU_A 1N15_B 1N50_A 1GJQ_A 1BL9_B 1NIR_B 1N90_B 1HZV_A 1AOQ_A .... Probab=27.10 E-value=1.8e+02 Score=23.84 Aligned_cols=27 Identities=11% Similarity=0.205 Sum_probs=17.1 Q ss_pred cceEEecCCCCCCceEEEEEEEcCCcceeEE Q psy3287 87 YQGYWWQPKTTDGVYRILYEEIDESDVKIYN 117 (122) Q Consensus 87 ~~g~wWSPd~~~~~~~iay~~~Des~V~~~~ 117 (122) ..++-.||||+ +|+-.-..+..|..+. T Consensus 80 ~~~i~~s~DG~----~~~v~n~~~~~v~v~D 106 (369) T PF02239_consen 80 PRGIAVSPDGK----YVYVANYEPGTVSVID 106 (369) T ss_dssp EEEEEE--TTT----EEEEEEEETTEEEEEE T ss_pred cceEEEcCCCC----EEEEEecCCCceeEec Confidence 46677899999 8877766666555543 No 67 >KOG2314|consensus Probab=26.95 E-value=1.2e+02 Score=27.22 Aligned_cols=97 Identities=18% Similarity=0.254 Sum_probs=55.2 Q ss_pred ecCCCCCCCCCCCCCCCCcceeceEEEE-----EeCCEEEEeCCCCCee----------------EEeccccCCcccCCC Q psy3287 11 FAHKGGRNLTDDPLAAGIPSYVTQEEFS-----RYQGYWWQPKTTGSTQ----------------RLTFAHKGGRNLTDD 69 (122) Q Consensus 11 ~~~~~~~~~~~~~~~~Gv~~f~~~~aFv-----r~~nLyv~~~~~~~~~----------------rlT~~~~g~~~v~~~ 69 (122) |.|.+.++.++.|-..=+++|++-..-+ -...|.+-++.+|... |-.+|.+=-+++..+ T Consensus 247 F~Hp~Vq~idfSP~EkYLVT~s~~p~~~~~~d~e~~~l~IWDI~tG~lkrsF~~~~~~~~~WP~frWS~DdKy~Arm~~~ 326 (698) T KOG2314|consen 247 FYHPGVQFIDFSPNEKYLVTYSPEPIIVEEDDNEGQQLIIWDIATGLLKRSFPVIKSPYLKWPIFRWSHDDKYFARMTGN 326 (698) T ss_pred ccCCCceeeecCCccceEEEecCCccccCcccCCCceEEEEEccccchhcceeccCCCccccceEEeccCCceeEEeccc Confidence 4578888888888888888888773222 2356778888887533 223332211222222 Q ss_pred Cee-eccCCccchhe----ecccceEEecCCCCCCceEEEEEEEcCC Q psy3287 70 PLA-AGIPSYVTQEE----FSRYQGYWWQPKTTDGVYRILYEEIDES 111 (122) Q Consensus 70 ~i~-~G~~d~v~eEE----~~~~~g~wWSPd~~~~~~~iay~~~Des 111 (122) .|+ +--|.+..-|- ++--+.|-|||.+. -|||..=-.. T Consensus 327 sisIyEtpsf~lld~Kslki~gIr~FswsP~~~----llAYwtpe~~ 369 (698) T KOG2314|consen 327 SISIYETPSFMLLDKKSLKISGIRDFSWSPTSN----LLAYWTPETN 369 (698) T ss_pred eEEEEecCceeeecccccCCccccCcccCCCcc----eEEEEccccc Confidence 222 12222221111 23346788999999 9999754433 No 68 >PF12894 Apc4_WD40: Anaphase-promoting complex subunit 4 WD40 domain Probab=26.37 E-value=1.4e+02 Score=17.25 Aligned_cols=27 Identities=19% Similarity=0.429 Sum_probs=18.4 Q ss_pred cceEEecCCCCCCceEEEEEEEcCCcceeEEC Q psy3287 87 YQGYWWQPKTTDGVYRILYEEIDESDVKIYNF 118 (122) Q Consensus 87 ~~g~wWSPd~~~~~~~iay~~~Des~V~~~~~ 118 (122) -+-+-|||+.+ -||.. .++.+|-.+.+ T Consensus 14 v~~~~w~P~md----LiA~~-t~~g~v~v~Rl 40 (47) T PF12894_consen 14 VSCMSWCPTMD----LIALG-TEDGEVLVYRL 40 (47) T ss_pred EEEEEECCCCC----EEEEE-ECCCeEEEEEC Confidence 34678999998 55544 45666777765 No 69 >TIGR02604 Piru_Ver_Nterm putative membrane-bound dehydrogenase domain. All proteins that score above the trusted cutoff score of 45 to this model are large proteins of either Pirellula sp. 1 or Verrucomicrobium spinosum. These proteins all contain, in addition to this domain, several hundred residues of highly variable sequence, and then a well-conserved C-terminal domain (TIGR02603) that features a putative cytochrome c-type heme binding motif CXXCH. The membrane-bound L-sorbosone dehydrogenase from Acetobacter liquefaciens (Gluconacetobacter liquefaciens) is homologous to this domain but lacks additional sequence regions shared by members of this family and belongs to a different clade of the larger family of homologs. It and its closely related homologs are excluded from the this model by scoring between the trusted (45) and noise (18) cutoffs. Probab=25.94 E-value=1.4e+02 Score=24.08 Aligned_cols=15 Identities=13% Similarity=0.197 Sum_probs=12.9 Q ss_pred eEEEEEeCCEEEEeC Q psy3287 34 QEEFSRYQGYWWQPK 48 (122) Q Consensus 34 ~~aFvr~~nLyv~~~ 48 (122) +++|.-+++||+... T Consensus 18 ~ia~d~~G~l~V~e~ 32 (367) T TIGR02604 18 AVCFDERGRLWVAEG 32 (367) T ss_pred eeeECCCCCEEEEeC Confidence 488999999999974 No 70 >KOG4497|consensus Probab=25.78 E-value=74 Score=26.83 Aligned_cols=12 Identities=17% Similarity=0.467 Sum_probs=9.5 Q ss_pred ccceEEecCCCC Q psy3287 86 RYQGYWWQPKTT 97 (122) Q Consensus 86 ~~~g~wWSPd~~ 97 (122) .-+...|||||+ T Consensus 93 gls~~~WSPdgr 104 (447) T KOG4497|consen 93 GLSSISWSPDGR 104 (447) T ss_pred cceeeeECCCcc Confidence 346678999997 No 71 >PRK00301 aat leucyl/phenylalanyl-tRNA--protein transferase; Reviewed Probab=25.76 E-value=59 Score=25.53 Aligned_cols=27 Identities=26% Similarity=0.594 Sum_probs=17.8 Q ss_pred eeccCCccchheecccceEEecCCCCCCceEEEEE Q psy3287 72 AAGIPSYVTQEEFSRYQGYWWQPKTTDGVYRILYE 106 (122) Q Consensus 72 ~~G~~d~v~eEE~~~~~g~wWSPd~~~~~~~iay~ 106 (122) .+|+--|-.+ ...-+|||||-. -|++. T Consensus 40 ~~GiFPw~~~----~~pi~WwsP~pR----~Vl~~ 66 (233) T PRK00301 40 AQGIFPWFSD----GQPILWWSPDPR----AVLPP 66 (233) T ss_pred hcCcccCCCC----CCCceEEcCCCC----EEecc Confidence 3577777442 135679999988 66653 No 72 >PF10656 DUF2483: Hypothetical protein of unknown function (DUF2483); InterPro: IPR018918 This entry is represented by Bacteriophage phi ETA, Orf16. The characteristics of the protein distribution suggest prophage matches in addition to the phage matches. This is a family of proteins found in bacteriophage particularly of the SA bacteriophages 11, Mu50B, family, homologous to phi-ETA Orf16. Probab=25.22 E-value=67 Score=20.75 Aligned_cols=36 Identities=11% Similarity=0.210 Sum_probs=24.7 Q ss_pred EEeCCEEEEeCCCCCeeEEeccccCCcccCCCCeeeccCC Q psy3287 38 SRYQGYWWQPKTTGSTQRLTFAHKGGRNLTDDPLAAGIPS 77 (122) Q Consensus 38 vr~~nLyv~~~~~~~~~rlT~~~~g~~~v~~~~i~~G~~d 77 (122) .++.+||+.+.++..+..|.|...-. ....++|+-+ T Consensus 11 ~kD~~lyITNkPt~~~~ti~Yst~rr----~Ar~f~Gld~ 46 (72) T PF10656_consen 11 HKDNDLYITNKPTEPEDTIKYSTDRR----DAREFDGLDD 46 (72) T ss_pred eccCCeEEecCCCCCCcccccccccc----cccccCCchh Confidence 46889999999988777777764422 2456666543 No 73 >PF03588 Leu_Phe_trans: Leucyl/phenylalanyl-tRNA protein transferase; InterPro: IPR004616 Leucyl/phenylalanyl-tRNA--protein transferase 2.3.2.6 from EC transfers a Leu or Phe to the amino end of certain proteins to enable degradation. The N-terminal residue controls the biological half-life of many proteins via the N-end rule pathway.; GO: 0008914 leucyltransferase activity, 0030163 protein catabolic process; PDB: 2Z3L_A 2Z3O_A 2Z3P_A 2DPT_B 2Z3M_B 2Z3N_B 2DPS_B 2Z3K_B 2CXA_A. Probab=25.08 E-value=34 Score=25.59 Aligned_cols=25 Identities=24% Similarity=0.612 Sum_probs=10.5 Q ss_pred eccCCccchheecccceEEecCCCCCCceEEEE Q psy3287 73 AGIPSYVTQEEFSRYQGYWWQPKTTDGVYRILY 105 (122) Q Consensus 73 ~G~~d~v~eEE~~~~~g~wWSPd~~~~~~~iay 105 (122) +|+-.|..+ ...-+|||||-. -|++ T Consensus 11 ~GiFPw~~~----~~pilW~sp~pR----~vl~ 35 (173) T PF03588_consen 11 QGIFPWFSE----GDPILWWSPDPR----AVLP 35 (173) T ss_dssp TT-EE---T----TS--EEE--SSE----EEE- T ss_pred cCccCCCCC----CCceeeecCCCc----EEEe Confidence 455566444 245679999987 6654 No 74 >PF12904 Collagen_bind_2: Putative collagen-binding domain of a collagenase ; InterPro: IPR024749 This domain is likely to be the collagen-binding domain of a family of bacterial collagenase enzymes. The structure of one family member, Q8A905 from SWISSPROT, has been characterised. The domain occurs in the C-terminal region of the protein.; PDB: 3KZS_D. Probab=24.88 E-value=2.2e+02 Score=18.99 Aligned_cols=30 Identities=13% Similarity=0.543 Sum_probs=15.9 Q ss_pred ccceEEecC-CCCCCceEEEEEEEcCCcceeEECC Q psy3287 86 RYQGYWWQP-KTTDGVYRILYEEIDESDVKIYNFP 119 (122) Q Consensus 86 ~~~g~wWSP-d~~~~~~~iay~~~Des~V~~~~~~ 119 (122) +.++.|.+| +|+ +-..-.+....+..+.-| T Consensus 48 ~~~a~WfdPR~G~----~~~~g~~~~~~~~~F~pP 78 (93) T PF12904_consen 48 KVKAWWFDPRTGK----YTYIGEFSNKGIQTFTPP 78 (93) T ss_dssp EEEEEEEETTT-B----EEEEEEEES-SEEEE--S T ss_pred ceeEEEEcCCCCC----EEEeeeecCCcceEecCC Confidence 458899999 454 444445555455555433 No 75 >COG2360 Aat Leu/Phe-tRNA-protein transferase [Posttranslational modification, protein turnover, chaperones] Probab=24.68 E-value=50 Score=25.74 Aligned_cols=21 Identities=24% Similarity=0.634 Sum_probs=14.1 Q ss_pred eccCCccchheecccceEEecCCCC Q psy3287 73 AGIPSYVTQEEFSRYQGYWWQPKTT 97 (122) Q Consensus 73 ~G~~d~v~eEE~~~~~g~wWSPd~~ 97 (122) +|+--|-. ....-+|||||.. T Consensus 34 ~GiFPw~~----~~~~~~W~spdpR 54 (221) T COG2360 34 QGIFPWAD----DGDPLLWWSPDPR 54 (221) T ss_pred cCcccccc----CCCceeEECCCCC Confidence 46665544 2346679999988 No 76 >PF15492 Nbas_N: Neuroblastoma-amplified sequence, N terminal Probab=24.10 E-value=2.1e+02 Score=23.16 Aligned_cols=20 Identities=25% Similarity=0.584 Sum_probs=15.9 Q ss_pred ecccceEEecCCCCCCceEEEEEE Q psy3287 84 FSRYQGYWWQPKTTDGVYRILYEE 107 (122) Q Consensus 84 ~~~~~g~wWSPd~~~~~~~iay~~ 107 (122) +...+-..||||++ -|||++ T Consensus 43 ~PQWRkl~WSpD~t----lLa~a~ 62 (282) T PF15492_consen 43 NPQWRKLAWSPDCT----LLAYAE 62 (282) T ss_pred CchheEEEECCCCc----EEEEEc Confidence 44567788999999 888875 No 77 >KOG2919|consensus Probab=23.50 E-value=59 Score=27.29 Aligned_cols=21 Identities=29% Similarity=0.446 Sum_probs=15.6 Q ss_pred ccceEEecCCCCCCceEEEEEEEcC Q psy3287 86 RYQGYWWQPKTTDGVYRILYEEIDE 110 (122) Q Consensus 86 ~~~g~wWSPd~~~~~~~iay~~~De 110 (122) -.+|-.|||||+ -||-..-|. T Consensus 51 f~kgckWSPDGS----ciL~~sedn 71 (406) T KOG2919|consen 51 FLKGCKWSPDGS----CILSLSEDN 71 (406) T ss_pred hhccceeCCCCc----eEEeecccC Confidence 357888999999 777665544 No 78 >KOG0645|consensus Probab=23.37 E-value=68 Score=26.16 Aligned_cols=22 Identities=18% Similarity=0.369 Sum_probs=18.4 Q ss_pred ceEEecCCCCCCceEEEEEEEcCCcc Q psy3287 88 QGYWWQPKTTDGVYRILYEEIDESDV 113 (122) Q Consensus 88 ~g~wWSPd~~~~~~~iay~~~Des~V 113 (122) +-..|||.|+ +||-+.||-+-+ T Consensus 65 RsvAwsp~g~----~La~aSFD~t~~ 86 (312) T KOG0645|consen 65 RSVAWSPHGR----YLASASFDATVV 86 (312) T ss_pred eeeeecCCCc----EEEEeeccceEE Confidence 5568999999 999999997644 No 79 >PF10596 U6-snRNA_bdg: U6-snRNA interacting domain of PrP8; InterPro: IPR019580 This entry represents the interacting site for U6-snRNA, which is part of U4/U6. U5 tri-snRNPs complex of the spliceosome is a prime candidate for the role of cofactor in the spliceosome's RNA core. The essential spliceosomal protein Prp8 interacts with U5 and U6 snRNAs and with specific pre-mRNA sequences that participate in catalysis. This close association with crucial RNA sequences, together with extensive genetic evidence, suggests that Prp8 could directly affect the function of the catalytic core, perhaps acting as a splicing cofactor []. Probab=23.24 E-value=74 Score=23.62 Aligned_cols=29 Identities=17% Similarity=0.511 Sum_probs=17.9 Q ss_pred cccceEEecCCCCCCceEEEE-EEEcCCcc Q psy3287 85 SRYQGYWWQPKTTDGVYRILY-EEIDESDV 113 (122) Q Consensus 85 ~~~~g~wWSPd~~~~~~~iay-~~~Des~V 113 (122) .|.=.+||||+=+-+--++.| .++|.+.| T Consensus 90 NRrf~LWWsPtINR~nvyvGfqvqldlTgI 119 (160) T PF10596_consen 90 NRRFTLWWSPTINRANVYVGFQVQLDLTGI 119 (160) T ss_pred ccccccccCCccccCCceEEEEeeccccce Confidence 344567999993311125555 57888877 No 80 >PF10282 Lactonase: Lactonase, 7-bladed beta-propeller; InterPro: IPR019405 6-phosphogluconolactonases (6PGL) 3.1.1.31 from EC, which hydrolyses 6-phosphogluconolactone to 6-phosphogluconate is opne of the enzymes in the pentose phosphate pathway. Two families of structurally dissimilar 6PGLs are known to exist: the Escherichia coli (strain K12) YbhE IPR022528 from INTERPRO [] and the Pseudomonas aeruginosa DevB IPR005900 from INTERPRO [] types. This entry contains bacterial 6-phosphogluconolactonases (6PGL) YbhE-type 3.1.1.31 from EC which hydrolyse 6-phosphogluconolactone to 6-phosphogluconate. The entry also contains the fungal muconate lactonizing enzyme carboxy-cis,cis-muconate cyclase 5.5.1.5 from EC and muconate cycloisomerase 5.5.1.1 from EC, which convert cis,cis-muconates to muconolactones and vice versa as part of the microbial beta-ketoadipate pathway. Structures have been reported for the E. coli 6-phosphogluconolactonase and Neurospora crassa muconate cycloisomerase. Structures of proteins in this family have revealed a 7-bladed beta-propeller fold [].; PDB: 3SCY_A 1L0Q_A 3HFQ_B 3FGB_A 1RI6_A 3U4Y_A 3BWS_A 1JOF_H. Probab=23.15 E-value=2.8e+02 Score=22.04 Aligned_cols=73 Identities=10% Similarity=0.137 Sum_probs=39.3 Q ss_pred eeceEEEEEeCC-EEEEeCCCCCeeEEeccccCCcccCCCCeeeccCCccchhee--cccceEEecCCCCCCceEEEEEE Q psy3287 31 YVTQEEFSRYQG-YWWQPKTTGSTQRLTFAHKGGRNLTDDPLAAGIPSYVTQEEF--SRYQGYWWQPKTTDGVYRILYEE 107 (122) Q Consensus 31 f~~~~aFvr~~n-Lyv~~~~~~~~~rlT~~~~g~~~v~~~~i~~G~~d~v~eEE~--~~~~g~wWSPd~~~~~~~iay~~ 107 (122) ....+++..+++ ||+++...+....+..+.. .|....+-.-.. ..-+.+..|||++ +|+..- T Consensus 246 ~~~~i~ispdg~~lyvsnr~~~sI~vf~~d~~-----------~g~l~~~~~~~~~G~~Pr~~~~s~~g~----~l~Va~ 310 (345) T PF10282_consen 246 APAEIAISPDGRFLYVSNRGSNSISVFDLDPA-----------TGTLTLVQTVPTGGKFPRHFAFSPDGR----YLYVAN 310 (345) T ss_dssp SEEEEEE-TTSSEEEEEECTTTEEEEEEECTT-----------TTTEEEEEEEEESSSSEEEEEE-TTSS----EEEEEE T ss_pred CceeEEEecCCCEEEEEeccCCEEEEEEEecC-----------CCceEEEEEEeCCCCCccEEEEeCCCC----EEEEEe Confidence 445577776665 6777755544333333211 111111211111 1246778899999 888887 Q ss_pred EcCCcceeEEC Q psy3287 108 IDESDVKIYNF 118 (122) Q Consensus 108 ~Des~V~~~~~ 118 (122) .+...|..+.+ T Consensus 311 ~~s~~v~vf~~ 321 (345) T PF10282_consen 311 QDSNTVSVFDI 321 (345) T ss_dssp TTTTEEEEEEE T ss_pred cCCCeEEEEEE Confidence 77777877765 No 81 >PRK11028 6-phosphogluconolactonase; Provisional Probab=22.34 E-value=3.5e+02 Score=20.86 Aligned_cols=26 Identities=4% Similarity=-0.074 Sum_probs=15.4 Q ss_pred ceEEecCCCCCCceEEEEEEEcCCcceeEE Q psy3287 88 QGYWWQPKTTDGVYRILYEEIDESDVKIYN 117 (122) Q Consensus 88 ~g~wWSPd~~~~~~~iay~~~Des~V~~~~ 117 (122) ..+-++||++ +|.-....+..|..+. T Consensus 83 ~~i~~~~~g~----~l~v~~~~~~~v~v~~ 108 (330) T PRK11028 83 THISTDHQGR----FLFSASYNANCVSVSP 108 (330) T ss_pred eEEEECCCCC----EEEEEEcCCCeEEEEE Confidence 4567888887 6655554444444443 No 82 >COG1770 PtrB Protease II [Amino acid transport and metabolism] Probab=21.06 E-value=4e+02 Score=24.30 Aligned_cols=29 Identities=17% Similarity=0.511 Sum_probs=23.3 Q ss_pred chhee-cccceEEecCCCCCCceEEEEEEEcCCc Q psy3287 80 TQEEF-SRYQGYWWQPKTTDGVYRILYEEIDESD 112 (122) Q Consensus 80 ~eEE~-~~~~g~wWSPd~~~~~~~iay~~~Des~ 112 (122) +.|++ +...++.|.+|++ .+-|.+.|+.. T Consensus 168 ~~d~i~~~~~~~~Wa~d~~----~lfYt~~d~~~ 197 (682) T COG1770 168 LPDEITNTSGSFAWAADGK----TLFYTRLDENH 197 (682) T ss_pred cchhhcccccceEEecCCC----eEEEEEEcCCC Confidence 44555 4577889999999 99999999985 No 83 >KOG2139|consensus Probab=21.00 E-value=1.5e+02 Score=25.34 Aligned_cols=30 Identities=20% Similarity=0.341 Sum_probs=22.2 Q ss_pred ccceEEecCCCCCCceEEEEEEEcCCcceeEECC Q psy3287 86 RYQGYWWQPKTTDGVYRILYEEIDESDVKIYNFP 119 (122) Q Consensus 86 ~~~g~wWSPd~~~~~~~iay~~~Des~V~~~~~~ 119 (122) |-.+-.|||+|. +|||.-.-++.+-..+++ T Consensus 282 rvqtacWspcGs----fLLf~~sgsp~lysl~f~ 311 (445) T KOG2139|consen 282 RVQTACWSPCGS----FLLFACSGSPRLYSLTFD 311 (445) T ss_pred ceeeeeecCCCC----EEEEEEcCCceEEEEeec Confidence 334456999999 999998887776555543 No 84 >KOG0293|consensus Probab=20.84 E-value=56 Score=28.20 Aligned_cols=20 Identities=25% Similarity=0.403 Sum_probs=17.1 Q ss_pred ceEEecCCCCCCceEEEEEEEcCC Q psy3287 88 QGYWWQPKTTDGVYRILYEEIDES 111 (122) Q Consensus 88 ~g~wWSPd~~~~~~~iay~~~Des 111 (122) ..+-||||.+ +|+-+-+||. T Consensus 273 ~yi~wSPDdr----yLlaCg~~e~ 292 (519) T KOG0293|consen 273 SYIMWSPDDR----YLLACGFDEV 292 (519) T ss_pred EEEEECCCCC----eEEecCchHh Confidence 3457999999 9999999987 No 85 >PF14315 DUF4380: Domain of unknown function (DUF4380) Probab=20.71 E-value=2.6e+02 Score=21.80 Aligned_cols=46 Identities=20% Similarity=0.245 Sum_probs=29.9 Q ss_pred CCeeeccCCccchheecccceEEecCC----CCCCceEEEEEEEcCCcceeEECC Q psy3287 69 DPLAAGIPSYVTQEEFSRYQGYWWQPK----TTDGVYRILYEEIDESDVKIYNFP 119 (122) Q Consensus 69 ~~i~~G~~d~v~eEE~~~~~g~wWSPd----~~~~~~~iay~~~Des~V~~~~~~ 119 (122) ...+-|=--|+..|.+.|.+..-|.|| +. .+.+.. ++..|..-.=+ T Consensus 56 ~~~~GGhrlW~~Pe~~~r~~~~~~~Pd~~ld~~----p~~~~~-~~~~v~L~s~~ 105 (274) T PF14315_consen 56 WINYGGHRLWPSPENPPRTSKWVWPPDPVLDNG----PYEVEI-DDDGVRLTSPP 105 (274) T ss_pred ccCCCcceeecCCCCccccccccCCCcccccCC----ceeEEE-cCCEEEEecCC Confidence 345556677888888866633349999 76 666666 66666554433 No 86 >PTZ00152 cofilin/actin-depolymerizing factor 1-like protein; Provisional Probab=20.34 E-value=34 Score=23.98 Aligned_cols=24 Identities=8% Similarity=0.275 Sum_probs=15.9 Q ss_pred ccceEEecCCCCCCceEEEEEEEc Q psy3287 86 RYQGYWWQPKTTDGVYRILYEEID 109 (122) Q Consensus 86 ~~~g~wWSPd~~~~~~~iay~~~D 109 (122) +.-=+.|+||+.+.-.+.+|+..- T Consensus 72 klvFI~w~Pd~a~ik~KMlYASsK 95 (122) T PTZ00152 72 KIHFFMYARESSNSRDRMTYASSK 95 (122) T ss_pred CEEEEEECCCCCChHHhhhhHhHH Confidence 444558999998555566666543 Done!