Query psy4773
Match_columns 160
No_of_seqs 129 out of 569
Neff 6.9
Searched_HMMs 46136
Date Fri Aug 16 17:20:14 2013
Command hhsearch -i /work/01045/syshi/Psyhhblits/psy4773.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/4773hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 KOG1520|consensus 100.0 5.3E-40 1.2E-44 277.4 13.0 141 3-159 235-376 (376)
2 COG3386 Gluconolactonase [Carb 99.1 8.1E-10 1.8E-14 92.8 11.2 93 2-150 178-277 (307)
3 PF08450 SGL: SMP-30/Gluconola 99.0 2.8E-09 6E-14 85.3 9.5 88 3-148 150-245 (246)
4 KOG4499|consensus 98.0 4.2E-05 9.2E-10 62.5 9.1 88 2-147 173-273 (310)
5 COG4257 Vgb Streptogramin lyas 96.3 0.027 5.8E-07 47.3 8.3 92 3-157 119-216 (353)
6 PF03088 Str_synth: Strictosid 95.8 0.0094 2E-07 41.5 3.0 43 7-53 35-77 (89)
7 TIGR03032 conserved hypothetic 95.1 0.09 2E-06 44.7 7.1 48 3-56 217-265 (335)
8 PF08450 SGL: SMP-30/Gluconola 93.7 1.2 2.7E-05 35.2 10.6 88 20-158 68-163 (246)
9 PLN02919 haloacid dehalogenase 91.9 2.6 5.7E-05 41.2 11.9 30 130-159 858-888 (1057)
10 COG3386 Gluconolactonase [Carb 89.9 4.6 9.9E-05 34.1 10.2 95 21-159 94-193 (307)
11 TIGR02604 Piru_Ver_Nterm putat 87.9 10 0.00022 32.3 11.1 46 8-53 89-144 (367)
12 PLN02919 haloacid dehalogenase 85.4 22 0.00047 35.0 13.1 44 3-50 584-642 (1057)
13 PF07494 Reg_prop: Two compone 84.2 0.98 2.1E-05 23.5 1.9 13 38-50 10-22 (24)
14 PF05787 DUF839: Bacterial pro 82.8 1.9 4E-05 39.1 4.4 45 9-53 391-456 (524)
15 PF07995 GSDH: Glucose / Sorbo 82.6 1.8 4E-05 36.5 4.0 46 8-53 81-134 (331)
16 PF06977 SdiA-regulated: SdiA- 82.2 3.4 7.3E-05 33.9 5.3 49 100-148 185-240 (248)
17 PF01436 NHL: NHL repeat; Int 80.1 0.91 2E-05 24.4 0.9 19 34-52 3-21 (28)
18 COG4257 Vgb Streptogramin lyas 79.1 17 0.00037 30.9 8.4 29 19-52 217-252 (353)
19 PRK11028 6-phosphogluconolacto 79.0 34 0.00074 28.0 10.6 39 8-48 56-95 (330)
20 PF07995 GSDH: Glucose / Sorbo 77.6 1.7 3.7E-05 36.6 2.3 49 3-51 275-324 (331)
21 COG3204 Uncharacterized protei 72.6 11 0.00023 32.1 5.6 52 99-150 246-304 (316)
22 TIGR03606 non_repeat_PQQ dehyd 68.8 26 0.00055 31.4 7.5 46 8-53 115-166 (454)
23 KOG0639|consensus 66.7 33 0.00072 31.4 7.7 63 6-75 436-505 (705)
24 KOG1520|consensus 66.4 6.1 0.00013 34.4 3.1 45 5-53 195-239 (376)
25 COG3211 PhoX Predicted phospha 64.6 5.8 0.00012 36.5 2.7 45 9-53 455-520 (616)
26 KOG0404|consensus 63.8 3.5 7.6E-05 34.1 1.1 16 24-39 53-68 (322)
27 cd08033 LARP_6 La RNA-binding 63.3 3 6.6E-05 28.2 0.5 14 108-121 63-76 (77)
28 PRK11028 6-phosphogluconolacto 62.1 86 0.0019 25.6 11.7 40 8-49 196-245 (330)
29 COG2088 SpoVG Uncharacterized 60.8 26 0.00056 24.5 4.7 69 7-77 4-74 (95)
30 PF03022 MRJP: Major royal jel 55.8 65 0.0014 26.7 7.4 64 31-149 184-255 (287)
31 PHA02865 MHC-like TNF binding 53.7 12 0.00025 32.3 2.6 23 31-53 264-289 (338)
32 PF06977 SdiA-regulated: SdiA- 52.8 44 0.00095 27.3 5.8 54 101-156 37-91 (248)
33 cd08030 LA_like_plant La-motif 50.7 19 0.00042 25.0 2.9 14 108-121 76-89 (90)
34 cd08031 LARP_4_5_like La RNA-b 48.8 18 0.00039 24.3 2.4 14 108-121 61-74 (75)
35 TIGR02604 Piru_Ver_Nterm putat 46.3 32 0.0007 29.2 4.2 50 3-52 286-340 (367)
36 cd08035 LARP_4 La RNA-binding 44.5 8.1 0.00018 26.0 0.2 14 108-121 61-74 (75)
37 COG3292 Predicted periplasmic 43.0 58 0.0013 30.3 5.4 48 110-157 227-274 (671)
38 COG3292 Predicted periplasmic 41.6 87 0.0019 29.2 6.3 33 115-149 362-395 (671)
39 cd08036 LARP_5 La RNA-binding 40.9 11 0.00023 25.5 0.3 14 108-121 61-74 (75)
40 COG2133 Glucose/sorbosone dehy 40.0 84 0.0018 27.7 5.8 44 10-53 148-197 (399)
41 COG3204 Uncharacterized protei 39.6 68 0.0015 27.3 5.0 51 101-151 101-160 (316)
42 PF09154 DUF1939: Domain of un 37.0 69 0.0015 20.2 3.7 38 12-53 12-49 (57)
43 PF00058 Ldl_recept_b: Low-den 35.5 53 0.0011 19.0 2.8 36 3-41 5-41 (42)
44 COG4294 Uve UV damage repair e 32.7 10 0.00022 32.3 -0.9 36 1-39 167-205 (347)
45 cd08032 LARP_7 La RNA-binding 32.6 20 0.00044 24.5 0.7 14 108-121 68-81 (82)
46 PF13570 PQQ_3: PQQ-like domai 32.4 95 0.0021 17.3 4.2 20 133-152 14-33 (40)
47 PF06739 SBBP: Beta-propeller 32.3 42 0.00091 19.2 1.9 20 34-53 14-33 (38)
48 cd06481 ACD_HspB9_like Alpha c 32.0 30 0.00066 23.4 1.5 28 22-50 7-36 (87)
49 cd08029 LA_like_fungal La-moti 31.9 42 0.00091 22.5 2.1 14 108-121 62-75 (76)
50 PF11208 DUF2992: Protein of u 31.8 27 0.00059 26.0 1.3 12 43-54 6-17 (132)
51 PF06236 MelC1: Tyrosinase co- 31.4 43 0.00092 24.8 2.3 47 2-52 44-95 (125)
52 TIGR03032 conserved hypothetic 31.1 2.9E+02 0.0064 23.8 7.5 30 130-159 202-231 (335)
53 PF06079 Apyrase: Apyrase; In 30.2 2.6E+02 0.0056 23.7 6.9 53 100-153 66-123 (291)
54 PTZ00486 apyrase Superfamily; 29.3 3.4E+02 0.0073 23.6 7.6 53 100-152 127-185 (352)
55 PF13449 Phytase-like: Esteras 29.2 92 0.002 26.0 4.3 18 36-53 88-105 (326)
56 COG2706 3-carboxymuconate cycl 28.9 3.9E+02 0.0084 23.2 11.3 52 108-159 66-119 (346)
57 PHA02598 denA endonuclease II; 28.6 71 0.0015 24.0 3.1 39 110-148 13-51 (138)
58 PF13600 DUF4140: N-terminal d 27.9 1.1E+02 0.0024 20.9 3.9 26 20-45 20-47 (104)
59 PF03022 MRJP: Major royal jel 27.7 3.5E+02 0.0076 22.3 10.2 52 108-159 34-96 (287)
60 KOG0639|consensus 26.6 72 0.0016 29.3 3.3 44 3-52 525-572 (705)
61 TIGR03606 non_repeat_PQQ dehyd 26.5 4.7E+02 0.01 23.4 9.2 29 22-51 20-48 (454)
62 PRK11138 outer membrane biogen 25.4 2.5E+02 0.0054 23.7 6.3 43 107-151 344-387 (394)
63 TIGR02276 beta_rpt_yvtn 40-res 24.4 1.3E+02 0.0029 16.3 3.5 19 140-158 3-21 (42)
64 PF14583 Pectate_lyase22: Olig 24.3 2.5E+02 0.0055 24.7 6.1 83 9-127 102-188 (386)
65 PF03088 Str_synth: Strictosid 24.0 2.5E+02 0.0053 19.3 8.1 51 105-158 34-86 (89)
66 PF01731 Arylesterase: Arylest 23.6 1.1E+02 0.0024 20.9 3.1 30 22-52 44-74 (86)
67 PF06089 Asparaginase_II: L-as 23.3 86 0.0019 26.9 3.0 24 105-128 13-36 (324)
68 cd07323 LAM LA motif RNA-bindi 22.3 41 0.00089 22.3 0.7 15 63-77 36-50 (75)
69 PF07463 NUMOD4: NUMOD4 motif; 22.3 77 0.0017 19.2 1.9 23 25-48 1-23 (51)
70 TIGR03300 assembly_YfgL outer 22.2 2.5E+02 0.0054 23.4 5.7 42 106-150 73-115 (377)
71 PF05411 Peptidase_C32: Equine 21.7 44 0.00095 24.7 0.8 42 107-148 58-102 (127)
72 TIGR03300 assembly_YfgL outer 21.0 3.2E+02 0.007 22.7 6.1 44 106-151 328-372 (377)
73 PF01502 PRA-CH: Phosphoribosy 20.8 2.1E+02 0.0046 19.2 3.9 33 13-53 28-60 (75)
74 PF04398 DUF538: Protein of un 20.7 86 0.0019 22.4 2.2 24 30-53 10-39 (110)
75 PHA02122 hypothetical protein 20.7 2.4E+02 0.0053 18.0 4.0 33 114-147 28-60 (65)
76 PF13360 PQQ_2: PQQ-like domai 20.1 3.9E+02 0.0084 20.1 7.1 47 108-158 46-93 (238)
No 1
>KOG1520|consensus
Probab=100.00 E-value=5.3e-40 Score=277.38 Aligned_cols=141 Identities=33% Similarity=0.603 Sum_probs=123.5
Q ss_pred cccCCCcEEEEEEecCCCCCcceEecccCCCCCCceeecCCCCeEEEeecCCCCcChhhhhhcCCChHHHHHHHHhhhhh
Q psy4773 3 QTDYCNFSFWRYYLKGPKQGKSEVFIDGLPGLPDNVKRDSKGNFLVSLVCPVDEYTPQLLHIIGPFPNIRKFVARFLHLL 82 (160)
Q Consensus 3 ~~Et~~~Ri~Ry~l~G~k~G~~evfid~LPG~PDNI~~~~~G~fWval~~~r~~~~~~~~~~l~~~P~lRk~i~~l~~~~ 82 (160)
+|||+++||+||||+|+|+|++|+|++|||||||||+++++|+||||+.+.|+ .+.+++.+|||+||++.++|..+
T Consensus 235 ~~Et~~~ri~rywi~g~k~gt~EvFa~~LPG~PDNIR~~~~G~fWVal~~~~~----~~~~~~~~~p~vr~~~~~~~~~~ 310 (376)
T KOG1520|consen 235 VAETTTARIKRYWIKGPKAGTSEVFAEGLPGYPDNIRRDSTGHFWVALHSKRS----TLWRLLMKYPWVRKFIAKLPKYM 310 (376)
T ss_pred EEeeccceeeeeEecCCccCchhhHhhcCCCCCcceeECCCCCEEEEEecccc----hHHHhhhcChHHHHHHHhhccch
Confidence 79999999999999999999999999999999999999999999999999854 48999999999999999998765
Q ss_pred ccCCccccceeccceeeeeeecCCeEEEEEEeCCCeEEEEEEcCCCC-CcCceEEEEECCEEEEecCCCCeEEEEeCC
Q psy4773 83 EKIPSEDVKHVVGHFESILFLQGTRYTLLVISSQGEIVDALHSVDGS-LKGSSDVEEYNGAYYFGSPISKHLARVPLA 159 (160)
Q Consensus 83 ~~~~~~~~~~~~~~~~~~~~~~~~~~~vv~vd~~G~il~~l~d~~g~-~~~iS~v~e~~g~LylGS~~~~~i~~~~l~ 159 (160)
... .++. .....++.|.++|++|+|+++|||++++ +..+|+|.|++|+||+||+.++||+|++|+
T Consensus 311 ~~~------~~~~------~~~~p~~~V~~~d~~G~il~~lhD~~g~~~~~~sev~E~dg~LyiGS~~~p~i~~lkl~ 376 (376)
T KOG1520|consen 311 ELL------YFLN------NGGKPHSAVKLSDETGKILESLHDKEGKVITLVSEVGEHDGHLYIGSLFNPYIARLKLP 376 (376)
T ss_pred hhh------hhhh------ccCCCceEEEEecCCCcEEEEEecCCCCceEEEEEEeecCCeEEEcccCcceeEEEecC
Confidence 431 1110 0112237788888999999999999974 799999999999999999999999999985
No 2
>COG3386 Gluconolactonase [Carbohydrate transport and metabolism]
Probab=99.11 E-value=8.1e-10 Score=92.78 Aligned_cols=93 Identities=26% Similarity=0.330 Sum_probs=76.0
Q ss_pred CcccCCCcEEEEEEec---CCCCCcc-eEecccCCCCCCceeecCCCCeEEEeecCCCCcChhhhhhcCCChHHHHHHHH
Q psy4773 2 GQTDYCNFSFWRYYLK---GPKQGKS-EVFIDGLPGLPDNVKRDSKGNFLVSLVCPVDEYTPQLLHIIGPFPNIRKFVAR 77 (160)
Q Consensus 2 ~~~Et~~~Ri~Ry~l~---G~k~G~~-evfid~LPG~PDNI~~~~~G~fWval~~~r~~~~~~~~~~l~~~P~lRk~i~~ 77 (160)
=++||..+||+||++. |+..+.. .++.+.-||+||+++.|++|++|++....
T Consensus 178 y~aDT~~~~i~r~~~d~~~g~~~~~~~~~~~~~~~G~PDG~~vDadG~lw~~a~~~------------------------ 233 (307)
T COG3386 178 YVADTPANRIHRYDLDPATGPIGGRRGFVDFDEEPGLPDGMAVDADGNLWVAAVWG------------------------ 233 (307)
T ss_pred EEEeCCCCeEEEEecCcccCccCCcceEEEccCCCCCCCceEEeCCCCEEEecccC------------------------
Confidence 3789999999999998 7765542 45565678999999999999999733221
Q ss_pred hhhhhccCCccccceeccceeeeeeecCCeEEEEEEeCCCeEEEEEEcCCCCCcCceEEEEEC---CEEEEecCCC
Q psy4773 78 FLHLLEKIPSEDVKHVVGHFESILFLQGTRYTLLVISSQGEIVDALHSVDGSLKGSSDVEEYN---GAYYFGSPIS 150 (160)
Q Consensus 78 l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vv~vd~~G~il~~l~d~~g~~~~iS~v~e~~---g~LylGS~~~ 150 (160)
-+.|.+++++|+.+..+..|. ..+|.+...| ++||+.|...
T Consensus 234 -----------------------------g~~v~~~~pdG~l~~~i~lP~---~~~t~~~FgG~~~~~L~iTs~~~ 277 (307)
T COG3386 234 -----------------------------GGRVVRFNPDGKLLGEIKLPV---KRPTNPAFGGPDLNTLYITSARS 277 (307)
T ss_pred -----------------------------CceEEEECCCCcEEEEEECCC---CCCccceEeCCCcCEEEEEecCC
Confidence 046889999999999999997 5688888887 9999999888
No 3
>PF08450 SGL: SMP-30/Gluconolaconase/LRE-like region; InterPro: IPR013658 This family describes a region that is found in proteins expressed by a variety of eukaryotic and prokaryotic species. These proteins include various enzymes, such as senescence marker protein 30 (SMP-30, Q15493 from SWISSPROT), gluconolactonase (Q01578 from SWISSPROT) and luciferin-regenerating enzyme (LRE, Q86DU5 from SWISSPROT). SMP-30 is known to hydrolyse diisopropyl phosphorofluoridate in the liver, and has been noted as having sequence similarity, in the region described in this family, with PON1 (P52430 from SWISSPROT) and LRE. ; PDB: 2GHS_A 2DG0_L 2DG1_D 2DSO_D 3E5Z_A 2IAT_A 2IAV_A 2GVV_A 3HLI_A 2GVU_A ....
Probab=99.00 E-value=2.8e-09 Score=85.28 Aligned_cols=88 Identities=24% Similarity=0.403 Sum_probs=70.4
Q ss_pred cccCCCcEEEEEEecCC--CCCcceEecccCC---CCCCceeecCCCCeEEEeecCCCCcChhhhhhcCCChHHHHHHHH
Q psy4773 3 QTDYCNFSFWRYYLKGP--KQGKSEVFIDGLP---GLPDNVKRDSKGNFLVSLVCPVDEYTPQLLHIIGPFPNIRKFVAR 77 (160)
Q Consensus 3 ~~Et~~~Ri~Ry~l~G~--k~G~~evfid~LP---G~PDNI~~~~~G~fWval~~~r~~~~~~~~~~l~~~P~lRk~i~~ 77 (160)
++||.++||+||.++.+ ..+..++|++ ++ |+||++..|++|++|||....
T Consensus 150 v~ds~~~~i~~~~~~~~~~~~~~~~~~~~-~~~~~g~pDG~~vD~~G~l~va~~~~------------------------ 204 (246)
T PF08450_consen 150 VADSFNGRIWRFDLDADGGELSNRRVFID-FPGGPGYPDGLAVDSDGNLWVADWGG------------------------ 204 (246)
T ss_dssp EEETTTTEEEEEEEETTTCCEEEEEEEEE--SSSSCEEEEEEEBTTS-EEEEEETT------------------------
T ss_pred ecccccceeEEEeccccccceeeeeeEEE-cCCCCcCCCcceEcCCCCEEEEEcCC------------------------
Confidence 48999999999999754 3466788876 54 469999999999999998763
Q ss_pred hhhhhccCCccccceeccceeeeeeecCCeEEEEEEeCCCeEEEEEEcCCCCCcCceEEEEEC---CEEEEecC
Q psy4773 78 FLHLLEKIPSEDVKHVVGHFESILFLQGTRYTLLVISSQGEIVDALHSVDGSLKGSSDVEEYN---GAYYFGSP 148 (160)
Q Consensus 78 l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vv~vd~~G~il~~l~d~~g~~~~iS~v~e~~---g~LylGS~ 148 (160)
+.|.++|++|+++..++.|. +.+|.+...+ ++||+.|-
T Consensus 205 ------------------------------~~I~~~~p~G~~~~~i~~p~---~~~t~~~fgg~~~~~L~vTta 245 (246)
T PF08450_consen 205 ------------------------------GRIVVFDPDGKLLREIELPV---PRPTNCAFGGPDGKTLYVTTA 245 (246)
T ss_dssp ------------------------------TEEEEEETTSCEEEEEE-SS---SSEEEEEEESTTSSEEEEEEB
T ss_pred ------------------------------CEEEEECCCccEEEEEcCCC---CCEEEEEEECCCCCEEEEEeC
Confidence 45789999999999999884 4789999853 88999874
No 4
>KOG4499|consensus
Probab=98.02 E-value=4.2e-05 Score=62.53 Aligned_cols=88 Identities=18% Similarity=0.207 Sum_probs=70.4
Q ss_pred CcccCCCcEE--EEEEecCCCCCcceEecccC-------CCCCCceeecCCCCeEEEeecCCCCcChhhhhhcCCChHHH
Q psy4773 2 GQTDYCNFSF--WRYYLKGPKQGKSEVFIDGL-------PGLPDNVKRDSKGNFLVSLVCPVDEYTPQLLHIIGPFPNIR 72 (160)
Q Consensus 2 ~~~Et~~~Ri--~Ry~l~G~k~G~~evfid~L-------PG~PDNI~~~~~G~fWval~~~r~~~~~~~~~~l~~~P~lR 72 (160)
-|.||..+.| ..|+..+-.....++.++ | +=.||++..|.+|.+|||.++.
T Consensus 173 Y~iDsln~~V~a~dyd~~tG~~snr~~i~d-lrk~~~~e~~~PDGm~ID~eG~L~Va~~ng------------------- 232 (310)
T KOG4499|consen 173 YYIDSLNYEVDAYDYDCPTGDLSNRKVIFD-LRKSQPFESLEPDGMTIDTEGNLYVATFNG------------------- 232 (310)
T ss_pred EEEccCceEEeeeecCCCcccccCcceeEE-eccCCCcCCCCCCcceEccCCcEEEEEecC-------------------
Confidence 4789999999 555577666666666665 3 2389999999999999999885
Q ss_pred HHHHHhhhhhccCCccccceeccceeeeeeecCCeEEEEEEeC-CCeEEEEEEcCCCCCcCceEEEEEC---CEEEEec
Q psy4773 73 KFVARFLHLLEKIPSEDVKHVVGHFESILFLQGTRYTLLVISS-QGEIVDALHSVDGSLKGSSDVEEYN---GAYYFGS 147 (160)
Q Consensus 73 k~i~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vv~vd~-~G~il~~l~d~~g~~~~iS~v~e~~---g~LylGS 147 (160)
+.|.++|+ +||++..+.-|. +.||++.+.| +-||...
T Consensus 233 -----------------------------------~~V~~~dp~tGK~L~eiklPt---~qitsccFgGkn~d~~yvT~ 273 (310)
T KOG4499|consen 233 -----------------------------------GTVQKVDPTTGKILLEIKLPT---PQITSCCFGGKNLDILYVTT 273 (310)
T ss_pred -----------------------------------cEEEEECCCCCcEEEEEEcCC---CceEEEEecCCCccEEEEEe
Confidence 66899995 699999999996 6799999887 5677654
No 5
>COG4257 Vgb Streptogramin lyase [Defense mechanisms]
Probab=96.30 E-value=0.027 Score=47.34 Aligned_cols=92 Identities=20% Similarity=0.251 Sum_probs=70.4
Q ss_pred cccCCCcEEEEEEecCCCCCcceEecccCCC--CCCcee---ecCCCCeEEEeecCCCCcChhhhhhcCCChHHHHHHHH
Q psy4773 3 QTDYCNFSFWRYYLKGPKQGKSEVFIDGLPG--LPDNVK---RDSKGNFLVSLVCPVDEYTPQLLHIIGPFPNIRKFVAR 77 (160)
Q Consensus 3 ~~Et~~~Ri~Ry~l~G~k~G~~evfid~LPG--~PDNI~---~~~~G~fWval~~~r~~~~~~~~~~l~~~P~lRk~i~~ 77 (160)
+|||.. -|.|+ ++|.++.+.|- ||+ -++|+. .|++|++|-.-...
T Consensus 119 itd~~~-aI~R~---dpkt~evt~f~--lp~~~a~~nlet~vfD~~G~lWFt~q~G------------------------ 168 (353)
T COG4257 119 ITDTGL-AIGRL---DPKTLEVTRFP--LPLEHADANLETAVFDPWGNLWFTGQIG------------------------ 168 (353)
T ss_pred EecCcc-eeEEe---cCcccceEEee--cccccCCCcccceeeCCCccEEEeeccc------------------------
Confidence 578877 89998 56778888886 675 688885 68999999754322
Q ss_pred hhhhhccCCccccceeccceeeeeeecCCeEEEEEEeCCCeEEEEEEcCCCCCcCceEEEE-ECCEEEEecCCCCeEEEE
Q psy4773 78 FLHLLEKIPSEDVKHVVGHFESILFLQGTRYTLLVISSQGEIVDALHSVDGSLKGSSDVEE-YNGAYYFGSPISKHLARV 156 (160)
Q Consensus 78 l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vv~vd~~G~il~~l~d~~g~~~~iS~v~e-~~g~LylGS~~~~~i~~~ 156 (160)
-++ |+|+.-..++.+..|.|. ....++. -+|.+|..|+..+||+++
T Consensus 169 ----------------------------~yG---rLdPa~~~i~vfpaPqG~--gpyGi~atpdGsvwyaslagnaiari 215 (353)
T COG4257 169 ----------------------------AYG---RLDPARNVISVFPAPQGG--GPYGICATPDGSVWYASLAGNAIARI 215 (353)
T ss_pred ----------------------------cce---ecCcccCceeeeccCCCC--CCcceEECCCCcEEEEeccccceEEc
Confidence 123 789888999999998864 2345554 479999999999999998
Q ss_pred e
Q psy4773 157 P 157 (160)
Q Consensus 157 ~ 157 (160)
+
T Consensus 216 d 216 (353)
T COG4257 216 D 216 (353)
T ss_pred c
Confidence 7
No 6
>PF03088 Str_synth: Strictosidine synthase; InterPro: IPR018119 This entry represents a conserved region found in strictosidine synthase (4.3.3.2 from EC), a key enzyme in alkaloid biosynthesis. It catalyses the Pictet-Spengler stereospecific condensation of tryptamine with secologanin to form strictosidine []. The structure of the native enzyme from the Indian medicinal plant Rauvolfia serpentina (Serpentwood) (Devilpepper) represents the first example of a six-bladed four-stranded beta-propeller fold from the plant kingdom [].; GO: 0016844 strictosidine synthase activity, 0009058 biosynthetic process; PDB: 2FPB_A 2V91_B 2FP8_A 3V1S_B 2FPC_A 2VAQ_A 2FP9_B.
Probab=95.81 E-value=0.0094 Score=41.52 Aligned_cols=43 Identities=21% Similarity=0.324 Sum_probs=34.1
Q ss_pred CCcEEEEEEecCCCCCcceEecccCCCCCCceeecCCCCeEEEeecC
Q psy4773 7 CNFSFWRYYLKGPKQGKSEVFIDGLPGLPDNVKRDSKGNFLVSLVCP 53 (160)
Q Consensus 7 ~~~Ri~Ry~l~G~k~G~~evfid~LPG~PDNI~~~~~G~fWval~~~ 53 (160)
...|++||..+ .++.++++++|. ||.+|..++||.|.+-.-+.
T Consensus 35 ~~GRll~ydp~---t~~~~vl~~~L~-fpNGVals~d~~~vlv~Et~ 77 (89)
T PF03088_consen 35 PTGRLLRYDPS---TKETTVLLDGLY-FPNGVALSPDESFVLVAETG 77 (89)
T ss_dssp --EEEEEEETT---TTEEEEEEEEES-SEEEEEE-TTSSEEEEEEGG
T ss_pred CCcCEEEEECC---CCeEEEehhCCC-ccCeEEEcCCCCEEEEEecc
Confidence 46899999755 588999999999 89999999999987755443
No 7
>TIGR03032 conserved hypothetical protein TIGR03032. This protein family is uncharacterized. A number of motifs are conserved perfectly among all member sequences. The function of this protein is unknown.
Probab=95.12 E-value=0.09 Score=44.74 Aligned_cols=48 Identities=17% Similarity=0.093 Sum_probs=40.1
Q ss_pred cccCCCcEEEEEEecCCCCCcceEecccCCCCCCceeecCCCC-eEEEeecCCCC
Q psy4773 3 QTDYCNFSFWRYYLKGPKQGKSEVFIDGLPGLPDNVKRDSKGN-FLVSLVCPVDE 56 (160)
Q Consensus 3 ~~Et~~~Ri~Ry~l~G~k~G~~evfid~LPG~PDNI~~~~~G~-fWval~~~r~~ 56 (160)
++|+.+.++.+++.+ .|+.|+.++ +||+|-+|-.. |. ..|+++.+|.+
T Consensus 217 vldsgtGev~~vD~~---~G~~e~Va~-vpG~~rGL~f~--G~llvVgmSk~R~~ 265 (335)
T TIGR03032 217 LLNSGRGELGYVDPQ---AGKFQPVAF-LPGFTRGLAFA--GDFAFVGLSKLRES 265 (335)
T ss_pred EEECCCCEEEEEcCC---CCcEEEEEE-CCCCCccccee--CCEEEEEeccccCC
Confidence 678899999999754 478999997 99999999988 55 56999999753
No 8
>PF08450 SGL: SMP-30/Gluconolaconase/LRE-like region; InterPro: IPR013658 This family describes a region that is found in proteins expressed by a variety of eukaryotic and prokaryotic species. These proteins include various enzymes, such as senescence marker protein 30 (SMP-30, Q15493 from SWISSPROT), gluconolactonase (Q01578 from SWISSPROT) and luciferin-regenerating enzyme (LRE, Q86DU5 from SWISSPROT). SMP-30 is known to hydrolyse diisopropyl phosphorofluoridate in the liver, and has been noted as having sequence similarity, in the region described in this family, with PON1 (P52430 from SWISSPROT) and LRE. ; PDB: 2GHS_A 2DG0_L 2DG1_D 2DSO_D 3E5Z_A 2IAT_A 2IAV_A 2GVV_A 3HLI_A 2GVU_A ....
Probab=93.71 E-value=1.2 Score=35.16 Aligned_cols=88 Identities=19% Similarity=0.265 Sum_probs=59.2
Q ss_pred CCCcceEecccCC------CCCCceeecCCCCeEEEeecCCCCcChhhhhhcCCChHHHHHHHHhhhhhccCCcccccee
Q psy4773 20 KQGKSEVFIDGLP------GLPDNVKRDSKGNFLVSLVCPVDEYTPQLLHIIGPFPNIRKFVARFLHLLEKIPSEDVKHV 93 (160)
Q Consensus 20 k~G~~evfid~LP------G~PDNI~~~~~G~fWval~~~r~~~~~~~~~~l~~~P~lRk~i~~l~~~~~~~~~~~~~~~ 93 (160)
+.|+.+.+++ ++ -.|..+..|++|++|++-..... ..
T Consensus 68 ~~g~~~~~~~-~~~~~~~~~~~ND~~vd~~G~ly~t~~~~~~---~~--------------------------------- 110 (246)
T PF08450_consen 68 DTGKVTVLAD-LPDGGVPFNRPNDVAVDPDGNLYVTDSGGGG---AS--------------------------------- 110 (246)
T ss_dssp TTTEEEEEEE-EETTCSCTEEEEEEEE-TTS-EEEEEECCBC---TT---------------------------------
T ss_pred CCCcEEEEee-ccCCCcccCCCceEEEcCCCCEEEEecCCCc---cc---------------------------------
Confidence 4578888887 42 36777889999999999876521 00
Q ss_pred ccceeeeeeecCCeEEEEEEeCCCeEEEEEEcCCCCCcCceEEEEE--CCEEEEecCCCCeEEEEeC
Q psy4773 94 VGHFESILFLQGTRYTLLVISSQGEIVDALHSVDGSLKGSSDVEEY--NGAYYFGSPISKHLARVPL 158 (160)
Q Consensus 94 ~~~~~~~~~~~~~~~~vv~vd~~G~il~~l~d~~g~~~~iS~v~e~--~g~LylGS~~~~~i~~~~l 158 (160)
....+.+.+++.+|++.....+- ...-.+... +..||+.......|.++++
T Consensus 111 ----------~~~~g~v~~~~~~~~~~~~~~~~----~~pNGi~~s~dg~~lyv~ds~~~~i~~~~~ 163 (246)
T PF08450_consen 111 ----------GIDPGSVYRIDPDGKVTVVADGL----GFPNGIAFSPDGKTLYVADSFNGRIWRFDL 163 (246)
T ss_dssp ----------CGGSEEEEEEETTSEEEEEEEEE----SSEEEEEEETTSSEEEEEETTTTEEEEEEE
T ss_pred ----------cccccceEEECCCCeEEEEecCc----ccccceEECCcchheeecccccceeEEEec
Confidence 00117889999998877766542 233444444 4579999999999998886
No 9
>PLN02919 haloacid dehalogenase-like hydrolase family protein
Probab=91.91 E-value=2.6 Score=41.20 Aligned_cols=30 Identities=13% Similarity=0.196 Sum_probs=22.6
Q ss_pred CcCceEEEEE-CCEEEEecCCCCeEEEEeCC
Q psy4773 130 LKGSSDVEEY-NGAYYFGSPISKHLARVPLA 159 (160)
Q Consensus 130 ~~~iS~v~e~-~g~LylGS~~~~~i~~~~l~ 159 (160)
+...+++... +|+||+....++-|-++++.
T Consensus 858 l~~P~GIavd~dG~lyVaDt~Nn~Irvid~~ 888 (1057)
T PLN02919 858 LSEPAGLALGENGRLFVADTNNSLIRYLDLN 888 (1057)
T ss_pred cCCceEEEEeCCCCEEEEECCCCEEEEEECC
Confidence 4456666654 68899999999999888764
No 10
>COG3386 Gluconolactonase [Carbohydrate transport and metabolism]
Probab=89.94 E-value=4.6 Score=34.08 Aligned_cols=95 Identities=15% Similarity=0.157 Sum_probs=58.3
Q ss_pred CCcc-eEecccCCCCCCc----eeecCCCCeEEEeecCCCCcChhhhhhcCCChHHHHHHHHhhhhhccCCccccceecc
Q psy4773 21 QGKS-EVFIDGLPGLPDN----VKRDSKGNFLVSLVCPVDEYTPQLLHIIGPFPNIRKFVARFLHLLEKIPSEDVKHVVG 95 (160)
Q Consensus 21 ~G~~-evfid~LPG~PDN----I~~~~~G~fWval~~~r~~~~~~~~~~l~~~P~lRk~i~~l~~~~~~~~~~~~~~~~~ 95 (160)
.|.. +.+++.-+|.|+| .+.+++|.||++-... .++..
T Consensus 94 ~~~~~t~~~~~~~~~~~~r~ND~~v~pdG~~wfgt~~~-~~~~~------------------------------------ 136 (307)
T COG3386 94 TGGKITLLAEPEDGLPLNRPNDGVVDPDGRIWFGDMGY-FDLGK------------------------------------ 136 (307)
T ss_pred CCceeEEeccccCCCCcCCCCceeEcCCCCEEEeCCCc-cccCc------------------------------------
Confidence 3444 6777756665554 5688999999988652 00000
Q ss_pred ceeeeeeecCCeEEEEEEeCCCeEEEEEEcCCCCCcCceEEEEECCEEEEecCCCCeEEEEeCC
Q psy4773 96 HFESILFLQGTRYTLLVISSQGEIVDALHSVDGSLKGSSDVEEYNGAYYFGSPISKHLARVPLA 159 (160)
Q Consensus 96 ~~~~~~~~~~~~~~vv~vd~~G~il~~l~d~~g~~~~iS~v~e~~g~LylGS~~~~~i~~~~l~ 159 (160)
.-.++.+.+.++|++|.+++.+.+.- .++.--.....+..||+.--..+.|-|++++
T Consensus 137 ------~~~~~~G~lyr~~p~g~~~~l~~~~~-~~~NGla~SpDg~tly~aDT~~~~i~r~~~d 193 (307)
T COG3386 137 ------SEERPTGSLYRVDPDGGVVRLLDDDL-TIPNGLAFSPDGKTLYVADTPANRIHRYDLD 193 (307)
T ss_pred ------cccCCcceEEEEcCCCCEEEeecCcE-EecCceEECCCCCEEEEEeCCCCeEEEEecC
Confidence 01233478999999899988887621 1111111222345899988888888888764
No 11
>TIGR02604 Piru_Ver_Nterm putative membrane-bound dehydrogenase domain. All proteins that score above the trusted cutoff score of 45 to this model are large proteins of either Pirellula sp. 1 or Verrucomicrobium spinosum. These proteins all contain, in addition to this domain, several hundred residues of highly variable sequence, and then a well-conserved C-terminal domain (TIGR02603) that features a putative cytochrome c-type heme binding motif CXXCH. The membrane-bound L-sorbosone dehydrogenase from Acetobacter liquefaciens (Gluconacetobacter liquefaciens) is homologous to this domain but lacks additional sequence regions shared by members of this family and belongs to a different clade of the larger family of homologs. It and its closely related homologs are excluded from the this model by scoring between the trusted (45) and noise (18) cutoffs.
Probab=87.87 E-value=10 Score=32.27 Aligned_cols=46 Identities=15% Similarity=0.195 Sum_probs=33.7
Q ss_pred CcEEEEEEe-cCC--CCCcceEecccCCCC-------CCceeecCCCCeEEEeecC
Q psy4773 8 NFSFWRYYL-KGP--KQGKSEVFIDGLPGL-------PDNVKRDSKGNFLVSLVCP 53 (160)
Q Consensus 8 ~~Ri~Ry~l-~G~--k~G~~evfid~LPG~-------PDNI~~~~~G~fWval~~~ 53 (160)
..+|+||.- .|. ..|+.|++++.+++- |.++..+++|.+|++....
T Consensus 89 ~~~i~~~~d~~gdg~ad~~~~~l~~~~~~~~~~~~~~~~~l~~gpDG~LYv~~G~~ 144 (367)
T TIGR02604 89 PPDILFLRDKDGDDKADGEREVLLSGFGGQINNHHHSLNSLAWGPDGWLYFNHGNT 144 (367)
T ss_pred CCeEEEEeCCCCCCCCCCccEEEEEccCCCCCcccccccCceECCCCCEEEecccC
Confidence 457888843 343 234788999877652 6789999999999988753
No 12
>PLN02919 haloacid dehalogenase-like hydrolase family protein
Probab=85.40 E-value=22 Score=35.00 Aligned_cols=44 Identities=20% Similarity=0.105 Sum_probs=29.1
Q ss_pred cccCCCcEEEEEEecCCCCCcceEeccc--------------CCCCCCceeecCCCC-eEEEe
Q psy4773 3 QTDYCNFSFWRYYLKGPKQGKSEVFIDG--------------LPGLPDNVKRDSKGN-FLVSL 50 (160)
Q Consensus 3 ~~Et~~~Ri~Ry~l~G~k~G~~evfid~--------------LPG~PDNI~~~~~G~-fWval 50 (160)
++|+..+||+++++.|. ...++.. .-..|-+|..|++|+ +||+=
T Consensus 584 VaDs~n~rI~v~d~~G~----~i~~ig~~g~~G~~dG~~~~a~f~~P~GIavd~~gn~LYVaD 642 (1057)
T PLN02919 584 ISDSNHNRIVVTDLDGN----FIVQIGSTGEEGLRDGSFEDATFNRPQGLAYNAKKNLLYVAD 642 (1057)
T ss_pred EEECCCCeEEEEeCCCC----EEEEEccCCCcCCCCCchhccccCCCcEEEEeCCCCEEEEEe
Confidence 57899999999998874 2222221 112477888887765 66754
No 13
>PF07494 Reg_prop: Two component regulator propeller; InterPro: IPR011110 A large group of two component regulator proteins appear to have the same N-terminal structure of 14 tandem repeats. These repeats show homology to members of IPR002372 from INTERPRO and IPR001680 from INTERPRO indicating that they are likely to form a beta-propeller. This family has been built with artificially high cut-offs in order to avoid overlaps with other beta-propeller families. The fourteen repeats are likely to form two propellers; it is not clear if these structures are likely to recruit other proteins or interact with DNA.; PDB: 3V9F_D 3VA6_B 3OTT_B 4A2M_D 4A2L_B.
Probab=84.19 E-value=0.98 Score=23.46 Aligned_cols=13 Identities=31% Similarity=0.583 Sum_probs=10.0
Q ss_pred eeecCCCCeEEEe
Q psy4773 38 VKRDSKGNFLVSL 50 (160)
Q Consensus 38 I~~~~~G~fWval 50 (160)
|..|++|.+|||-
T Consensus 10 i~~D~~G~lWigT 22 (24)
T PF07494_consen 10 IYEDSDGNLWIGT 22 (24)
T ss_dssp EEE-TTSCEEEEE
T ss_pred EEEcCCcCEEEEe
Confidence 4578999999985
No 14
>PF05787 DUF839: Bacterial protein of unknown function (DUF839); InterPro: IPR008557 This family consists of bacterial proteins of unknown function.
Probab=82.84 E-value=1.9 Score=39.08 Aligned_cols=45 Identities=27% Similarity=0.301 Sum_probs=31.6
Q ss_pred cEEEEEEecCCC----CCcceEecccC-----------------CCCCCceeecCCCCeEEEeecC
Q psy4773 9 FSFWRYYLKGPK----QGKSEVFIDGL-----------------PGLPDNVKRDSKGNFLVSLVCP 53 (160)
Q Consensus 9 ~Ri~Ry~l~G~k----~G~~evfid~L-----------------PG~PDNI~~~~~G~fWval~~~ 53 (160)
..|+||..++.. ..+.++|+..- -..||||..|++|++||+-=..
T Consensus 391 G~I~r~~~~~~d~~~~~f~~~~~~~~g~~~~~~~~~~~~~~~~~f~sPDNL~~d~~G~LwI~eD~~ 456 (524)
T PF05787_consen 391 GQIYRYDPDGNDHAATTFTWELFLVGGDPTDASGNGSNKCDDNGFASPDNLAFDPDGNLWIQEDGG 456 (524)
T ss_pred cEEEEecccCCccccceeEEEEEEEecCcccccccccCcccCCCcCCCCceEECCCCCEEEEeCCC
Confidence 478898887641 23556666521 1279999999999999986443
No 15
>PF07995 GSDH: Glucose / Sorbosone dehydrogenase; InterPro: IPR012938 Proteins containing this domain are thought to be glucose/sorbosone dehydrogenases. The best characterised of these proteins is soluble glucose dehydrogenase (P13650 from SWISSPROT) from Acinetobacter calcoaceticus, which oxidises glucose to gluconolactone. The enzyme is a calcium-dependent homodimer which uses PQQ as a cofactor [].; GO: 0016901 oxidoreductase activity, acting on the CH-OH group of donors, quinone or similar compound as acceptor, 0048038 quinone binding, 0005975 carbohydrate metabolic process; PDB: 2ISM_A 2WG3_D 3HO5_A 3HO4_A 3HO3_A 2WFT_A 2WG4_B 2WFX_B 1CRU_A 1CQ1_B ....
Probab=82.62 E-value=1.8 Score=36.46 Aligned_cols=46 Identities=22% Similarity=0.326 Sum_probs=33.1
Q ss_pred CcEEEEEEecCC--CCCcceEecccCCCCCCc------eeecCCCCeEEEeecC
Q psy4773 8 NFSFWRYYLKGP--KQGKSEVFIDGLPGLPDN------VKRDSKGNFLVSLVCP 53 (160)
Q Consensus 8 ~~Ri~Ry~l~G~--k~G~~evfid~LPG~PDN------I~~~~~G~fWval~~~ 53 (160)
..||.||-++.. ..++.+++++.+|..+.+ |..+++|.+||++-..
T Consensus 81 ~~~v~r~~~~~~~~~~~~~~~l~~~~p~~~~~~H~g~~l~fgpDG~LYvs~G~~ 134 (331)
T PF07995_consen 81 DNRVVRFTLSDGDGDLSSEEVLVTGLPDTSSGNHNGGGLAFGPDGKLYVSVGDG 134 (331)
T ss_dssp EEEEEEEEEETTSCEEEEEEEEEEEEES-CSSSS-EEEEEE-TTSEEEEEEB-T
T ss_pred ceeeEEEeccCCccccccceEEEEEeCCCCCCCCCCccccCCCCCcEEEEeCCC
Confidence 468999999765 456677777768864444 6788999999999876
No 16
>PF06977 SdiA-regulated: SdiA-regulated; InterPro: IPR009722 This entry represents a conserved region approximately 100 residues long within a number of hypothetical bacterial proteins that may be regulated by SdiA, a member of the LuxR family of transcriptional regulators []. Some proteins contain the IPR001258 from INTERPRO repeat.; PDB: 3QQZ_A.
Probab=82.17 E-value=3.4 Score=33.85 Aligned_cols=49 Identities=24% Similarity=0.377 Sum_probs=30.5
Q ss_pred eeeecCCeEEEEEEeCCCeEEEEEEcCCC------CCcCceEEEEE-CCEEEEecC
Q psy4773 100 ILFLQGTRYTLLVISSQGEIVDALHSVDG------SLKGSSDVEEY-NGAYYFGSP 148 (160)
Q Consensus 100 ~~~~~~~~~~vv~vd~~G~il~~l~d~~g------~~~~iS~v~e~-~g~LylGS~ 148 (160)
++.++.+..+++++|.+|+++..+.-..+ .++..-.+... +|.||+-|=
T Consensus 185 lliLS~es~~l~~~d~~G~~~~~~~L~~g~~gl~~~~~QpEGIa~d~~G~LYIvsE 240 (248)
T PF06977_consen 185 LLILSDESRLLLELDRQGRVVSSLSLDRGFHGLSKDIPQPEGIAFDPDGNLYIVSE 240 (248)
T ss_dssp EEEEETTTTEEEEE-TT--EEEEEE-STTGGG-SS---SEEEEEE-TT--EEEEET
T ss_pred EEEEECCCCeEEEECCCCCEEEEEEeCCcccCcccccCCccEEEECCCCCEEEEcC
Confidence 45567777899999999999999987653 35666666664 599999983
No 17
>PF01436 NHL: NHL repeat; InterPro: IPR001258 The NHL repeat, named after NCL-1, HT2A and Lin-41, is found largely in a large number of eukaryotic and prokaryotic proteins. For example, the repeat is found in a variety of enzymes of the copper type II, ascorbate-dependent monooxygenase family which catalyse the C terminus alpha-amidation of biological peptides []. In many it occurs in tandem arrays, for example in the ringfinger beta-box, coiled-coil (RBCC) eukaryotic growth regulators []. The 'Brain Tumor' protein (Brat) is one such growth regulator that contains a 6-bladed NHL-repeat beta-propeller [, ]. The NHL repeats are also found in serine/threonine protein kinase (STPK) in diverse range of pathogenic bacteria. These STPK are transmembrane receptors with a intracellular N-terminal kinase domain and extracellular C-terminal sensor domain. In the STPK, PknD, from Mycobacterium tuberculosis, the sensor domain forms a rigid, six-bladed b-propeller composed of NHL repeats with a flexible tether to the transmembrane domain.; GO: 0005515 protein binding; PDB: 3FVZ_A 3FW0_A 1RWL_A 1RWI_A 1Q7F_A.
Probab=80.09 E-value=0.91 Score=24.39 Aligned_cols=19 Identities=37% Similarity=0.452 Sum_probs=16.3
Q ss_pred CCCceeecCCCCeEEEeec
Q psy4773 34 LPDNVKRDSKGNFLVSLVC 52 (160)
Q Consensus 34 ~PDNI~~~~~G~fWval~~ 52 (160)
.|-+|..+++|.+||+=..
T Consensus 3 ~P~gvav~~~g~i~VaD~~ 21 (28)
T PF01436_consen 3 YPHGVAVDSDGNIYVADSG 21 (28)
T ss_dssp SEEEEEEETTSEEEEEECC
T ss_pred CCcEEEEeCCCCEEEEECC
Confidence 5889999999999998744
No 18
>COG4257 Vgb Streptogramin lyase [Defense mechanisms]
Probab=79.07 E-value=17 Score=30.93 Aligned_cols=29 Identities=24% Similarity=0.373 Sum_probs=18.2
Q ss_pred CCCCcceEecccCCCCCCc-------eeecCCCCeEEEeec
Q psy4773 19 PKQGKSEVFIDGLPGLPDN-------VKRDSKGNFLVSLVC 52 (160)
Q Consensus 19 ~k~G~~evfid~LPG~PDN-------I~~~~~G~fWval~~ 52 (160)
|++|..|+|.. |+. |-.|+-|+.|+.-..
T Consensus 217 p~~~~aev~p~-----P~~~~~gsRriwsdpig~~wittwg 252 (353)
T COG4257 217 PFAGHAEVVPQ-----PNALKAGSRRIWSDPIGRAWITTWG 252 (353)
T ss_pred cccCCcceecC-----CCcccccccccccCccCcEEEeccC
Confidence 45666676654 555 345677888887544
No 19
>PRK11028 6-phosphogluconolactonase; Provisional
Probab=79.00 E-value=34 Score=27.99 Aligned_cols=39 Identities=28% Similarity=0.601 Sum_probs=20.7
Q ss_pred CcEEEEEEecCCCCCcceEecc-cCCCCCCceeecCCCCeEE
Q psy4773 8 NFSFWRYYLKGPKQGKSEVFID-GLPGLPDNVKRDSKGNFLV 48 (160)
Q Consensus 8 ~~Ri~Ry~l~G~k~G~~evfid-~LPG~PDNI~~~~~G~fWv 48 (160)
..+|.-|.+++ .|+.+..-. .++|-|..|..+++|++..
T Consensus 56 ~~~i~~~~~~~--~g~l~~~~~~~~~~~p~~i~~~~~g~~l~ 95 (330)
T PRK11028 56 EFRVLSYRIAD--DGALTFAAESPLPGSPTHISTDHQGRFLF 95 (330)
T ss_pred CCcEEEEEECC--CCceEEeeeecCCCCceEEEECCCCCEEE
Confidence 35566666652 233221111 2556778888888876433
No 20
>PF07995 GSDH: Glucose / Sorbosone dehydrogenase; InterPro: IPR012938 Proteins containing this domain are thought to be glucose/sorbosone dehydrogenases. The best characterised of these proteins is soluble glucose dehydrogenase (P13650 from SWISSPROT) from Acinetobacter calcoaceticus, which oxidises glucose to gluconolactone. The enzyme is a calcium-dependent homodimer which uses PQQ as a cofactor [].; GO: 0016901 oxidoreductase activity, acting on the CH-OH group of donors, quinone or similar compound as acceptor, 0048038 quinone binding, 0005975 carbohydrate metabolic process; PDB: 2ISM_A 2WG3_D 3HO5_A 3HO4_A 3HO3_A 2WFT_A 2WG4_B 2WFX_B 1CRU_A 1CQ1_B ....
Probab=77.56 E-value=1.7 Score=36.61 Aligned_cols=49 Identities=29% Similarity=0.468 Sum_probs=39.0
Q ss_pred cccCCCcEEEEEEec-CCCCCcceEecccCCCCCCceeecCCCCeEEEee
Q psy4773 3 QTDYCNFSFWRYYLK-GPKQGKSEVFIDGLPGLPDNVKRDSKGNFLVSLV 51 (160)
Q Consensus 3 ~~Et~~~Ri~Ry~l~-G~k~G~~evfid~LPG~PDNI~~~~~G~fWval~ 51 (160)
+++-..+||+++.++ +.+..+.+.|+.+..+.|-.|..++||.+||+-.
T Consensus 275 ~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~r~~~v~~~pDG~Lyv~~d 324 (331)
T PF07995_consen 275 VADYGGGRIWRLDLDEDGSVTEEEEFLGGFGGRPRDVAQGPDGALYVSDD 324 (331)
T ss_dssp EEETTTTEEEEEEEETTEEEEEEEEECTTSSS-EEEEEEETTSEEEEEE-
T ss_pred EecCCCCEEEEEeeecCCCccceEEccccCCCCceEEEEcCCCeEEEEEC
Confidence 466778999999997 3345678888887888899999999999999865
No 21
>COG3204 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=72.56 E-value=11 Score=32.12 Aligned_cols=52 Identities=23% Similarity=0.392 Sum_probs=37.3
Q ss_pred eeeeecCCeEEEEEEeCCCeEEEEEEcCCC------CCcCceEEEE-ECCEEEEecCCC
Q psy4773 99 SILFLQGTRYTLLVISSQGEIVDALHSVDG------SLKGSSDVEE-YNGAYYFGSPIS 150 (160)
Q Consensus 99 ~~~~~~~~~~~vv~vd~~G~il~~l~d~~g------~~~~iS~v~e-~~g~LylGS~~~ 150 (160)
+++.++.+..+++++|.+|++++.+....| .++..-.|.- .+|.||+-|=-+
T Consensus 246 ~LLVLS~ESr~l~Evd~~G~~~~~lsL~~g~~gL~~dipqaEGiamDd~g~lYIvSEPn 304 (316)
T COG3204 246 SLLVLSDESRRLLEVDLSGEVIELLSLTKGNHGLSSDIPQAEGIAMDDDGNLYIVSEPN 304 (316)
T ss_pred cEEEEecCCceEEEEecCCCeeeeEEeccCCCCCcccCCCcceeEECCCCCEEEEecCC
Confidence 456677888999999999999998876544 2344444544 359999987433
No 22
>TIGR03606 non_repeat_PQQ dehydrogenase, PQQ-dependent, s-GDH family. PQQ, or pyrroloquinoline-quinone, serves as a cofactor for a number of sugar and alcohol dehydrogenases in a limited number of bacterial species. Most characterized PQQ-dependent enzymes have multiple repeats of a sequence region described by pfam01011 (PQQ enzyme repeat), but this protein family in unusual in lacking that repeat. Below the noise cutoff are related proteins mostly from species that lack PQQ biosynthesis.
Probab=68.82 E-value=26 Score=31.36 Aligned_cols=46 Identities=15% Similarity=0.184 Sum_probs=31.8
Q ss_pred CcEEEEEEecC--CCCCcceEecccCCCCC----CceeecCCCCeEEEeecC
Q psy4773 8 NFSFWRYYLKG--PKQGKSEVFIDGLPGLP----DNVKRDSKGNFLVSLVCP 53 (160)
Q Consensus 8 ~~Ri~Ry~l~G--~k~G~~evfid~LPG~P----DNI~~~~~G~fWval~~~ 53 (160)
..||.||-+.. .+....++.++.+|... -.|..++||.+||++-..
T Consensus 115 ~~~I~R~~l~~~~~~l~~~~~Il~~lP~~~~H~GgrI~FgPDG~LYVs~GD~ 166 (454)
T TIGR03606 115 HTKIVRYTYDKSTQTLEKPVDLLAGLPAGNDHNGGRLVFGPDGKIYYTIGEQ 166 (454)
T ss_pred CcEEEEEEecCCCCccccceEEEecCCCCCCcCCceEEECCCCcEEEEECCC
Confidence 57999999963 23333444455588532 367789999999998764
No 23
>KOG0639|consensus
Probab=66.70 E-value=33 Score=31.42 Aligned_cols=63 Identities=21% Similarity=0.281 Sum_probs=42.3
Q ss_pred CCCcEEEEEE-ecCCCCCcceEecccCCCC-CCceee----cCCCC-eEEEeecCCCCcChhhhhhcCCChHHHHHH
Q psy4773 6 YCNFSFWRYY-LKGPKQGKSEVFIDGLPGL-PDNVKR----DSKGN-FLVSLVCPVDEYTPQLLHIIGPFPNIRKFV 75 (160)
Q Consensus 6 t~~~Ri~Ry~-l~G~k~G~~evfid~LPG~-PDNI~~----~~~G~-fWval~~~r~~~~~~~~~~l~~~P~lRk~i 75 (160)
|+..++.++| |+++ |..-... .|+.+ +||--+ -+||+ +.||--.. +-+++|+.++.|.++--+
T Consensus 436 TgGkgcVKVWdis~p--g~k~Pvs-qLdcl~rdnyiRSckL~pdgrtLivGGeas----tlsiWDLAapTprikael 505 (705)
T KOG0639|consen 436 TGGKGCVKVWDISQP--GNKSPVS-QLDCLNRDNYIRSCKLLPDGRTLIVGGEAS----TLSIWDLAAPTPRIKAEL 505 (705)
T ss_pred ecCCCeEEEeeccCC--CCCCccc-cccccCcccceeeeEecCCCceEEeccccc----eeeeeeccCCCcchhhhc
Confidence 6777888887 4555 4444444 48887 999754 47776 55555432 446899999999887543
No 24
>KOG1520|consensus
Probab=66.35 E-value=6.1 Score=34.42 Aligned_cols=45 Identities=20% Similarity=0.365 Sum_probs=35.8
Q ss_pred cCCCcEEEEEEecCCCCCcceEecccCCCCCCceeecCCCCeEEEeecC
Q psy4773 5 DYCNFSFWRYYLKGPKQGKSEVFIDGLPGLPDNVKRDSKGNFLVSLVCP 53 (160)
Q Consensus 5 Et~~~Ri~Ry~l~G~k~G~~evfid~LPG~PDNI~~~~~G~fWval~~~ 53 (160)
-....|++||+-+ +-++++++|+|- ||.++..++++.|-+-+-+.
T Consensus 195 g~~~GRl~~YD~~---tK~~~VLld~L~-F~NGlaLS~d~sfvl~~Et~ 239 (376)
T KOG1520|consen 195 GDPTGRLFRYDPS---TKVTKVLLDGLY-FPNGLALSPDGSFVLVAETT 239 (376)
T ss_pred CCCccceEEecCc---ccchhhhhhccc-ccccccCCCCCCEEEEEeec
Confidence 3467899999754 467889999998 79999999999987655443
No 25
>COG3211 PhoX Predicted phosphatase [General function prediction only]
Probab=64.65 E-value=5.8 Score=36.47 Aligned_cols=45 Identities=18% Similarity=0.203 Sum_probs=30.0
Q ss_pred cEEEEEEecCC----CCCcceEeccc-----C------------CCCCCceeecCCCCeEEEeecC
Q psy4773 9 FSFWRYYLKGP----KQGKSEVFIDG-----L------------PGLPDNVKRDSKGNFLVSLVCP 53 (160)
Q Consensus 9 ~Ri~Ry~l~G~----k~G~~evfid~-----L------------PG~PDNI~~~~~G~fWval~~~ 53 (160)
.+|.||--.+. ..++-|+|+.- + =+.||||..|+.|++||+--..
T Consensus 455 G~I~r~~p~~~d~t~~~ftWdlF~~aG~~~~~~~~~~~~~~~~~f~~PDnl~fD~~GrLWi~TDg~ 520 (616)
T COG3211 455 GQIVRWIPATGDHTDTKFTWDLFVEAGNPSVLEGGASANINANWFNSPDNLAFDPWGRLWIQTDGS 520 (616)
T ss_pred cceEEEecCCCCccCccceeeeeeecCCccccccccccCcccccccCCCceEECCCCCEEEEecCC
Confidence 46777655432 23467777751 1 1239999999999999987543
No 26
>KOG0404|consensus
Probab=63.84 E-value=3.5 Score=34.13 Aligned_cols=16 Identities=25% Similarity=0.775 Sum_probs=13.0
Q ss_pred ceEecccCCCCCCcee
Q psy4773 24 SEVFIDGLPGLPDNVK 39 (160)
Q Consensus 24 ~evfid~LPG~PDNI~ 39 (160)
+.+=++|.|||||+|.
T Consensus 53 TTT~veNfPGFPdgi~ 68 (322)
T KOG0404|consen 53 TTTDVENFPGFPDGIT 68 (322)
T ss_pred eeeccccCCCCCcccc
Confidence 4455789999999996
No 27
>cd08033 LARP_6 La RNA-binding domain of La-related protein 6. This domain is found in animal and plant proteins related to the La autoantigen. A variety of La-related proteins (LARPs or La ribonucleoproteins), with differing domain architecture, appear to function as RNA-binding proteins in eukaryotic cellular processes.
Probab=63.25 E-value=3 Score=28.17 Aligned_cols=14 Identities=36% Similarity=0.479 Sum_probs=10.7
Q ss_pred EEEEEEeCCCeEEE
Q psy4773 108 YTLLVISSQGEIVD 121 (160)
Q Consensus 108 ~~vv~vd~~G~il~ 121 (160)
+-+++++++|+-|+
T Consensus 63 S~~lev~~d~~~VR 76 (77)
T cd08033 63 SSKLVVSEDGKKVR 76 (77)
T ss_pred CCeEEEcCCCCccC
Confidence 44789999998664
No 28
>PRK11028 6-phosphogluconolactonase; Provisional
Probab=62.07 E-value=86 Score=25.57 Aligned_cols=40 Identities=23% Similarity=0.297 Sum_probs=21.7
Q ss_pred CcEEEEEEecCCCCCcceEecccCCCCCC---------ceeecCCCCe-EEE
Q psy4773 8 NFSFWRYYLKGPKQGKSEVFIDGLPGLPD---------NVKRDSKGNF-LVS 49 (160)
Q Consensus 8 ~~Ri~Ry~l~G~k~G~~evfid~LPG~PD---------NI~~~~~G~f-Wva 49 (160)
...|..|.++. ..|+.+.... +...|+ .|..+++|++ +++
T Consensus 196 ~~~v~v~~~~~-~~~~~~~~~~-~~~~p~~~~~~~~~~~i~~~pdg~~lyv~ 245 (330)
T PRK11028 196 NSSVDVWQLKD-PHGEIECVQT-LDMMPADFSDTRWAADIHITPDGRHLYAC 245 (330)
T ss_pred CCEEEEEEEeC-CCCCEEEEEE-EecCCCcCCCCccceeEEECCCCCEEEEe
Confidence 56777777763 2344444332 333333 4777888874 443
No 29
>COG2088 SpoVG Uncharacterized protein, involved in the regulation of septum location [Cell envelope biogenesis, outer membrane]
Probab=60.78 E-value=26 Score=24.47 Aligned_cols=69 Identities=10% Similarity=0.161 Sum_probs=39.9
Q ss_pred CCcEEEEEEecCCCCCcceEecccCCCCCCcee-ecCCCCeEEEeecCCCCcChhhhhhcCC-ChHHHHHHHH
Q psy4773 7 CNFSFWRYYLKGPKQGKSEVFIDGLPGLPDNVK-RDSKGNFLVSLVCPVDEYTPQLLHIIGP-FPNIRKFVAR 77 (160)
Q Consensus 7 ~~~Ri~Ry~l~G~k~G~~evfid~LPG~PDNI~-~~~~G~fWval~~~r~~~~~~~~~~l~~-~P~lRk~i~~ 77 (160)
+.-||.|..=+|.-.--..+=+||-= .--+|+ .+.+-++|||+++.|. ++..+-|..|+ ++..|+-|..
T Consensus 4 TdVRirkv~~dgrmkA~vsvT~D~ef-VvhdirVi~G~~GlfVAMPSrrt-~dgEFrDI~HPI~~~~R~kIq~ 74 (95)
T COG2088 4 TDVRIRKVDTDGRMKAYVSVTLDNEF-VVHDIRVIEGNNGLFVAMPSRRT-PDGEFRDIAHPINSDTREKIQD 74 (95)
T ss_pred eeEEEEEecCCCcEEEEEEEEecceE-EEeccEEEeCCcceEEEccCccC-CCcchhhccCcCCHHHHHHHHH
Confidence 34577777655431111122222111 123444 4566679999999876 55578888887 6788876554
No 30
>PF03022 MRJP: Major royal jelly protein; InterPro: IPR003534 The major royal jelly proteins (MRJPs) comprise 12.5% of the mass, and 82-90% of the protein content [], of honeybee (Apis mellifera) royal jelly. Royal jelly is a substance secreted by the cephalic glands of nurse bees [] and it is used to trigger development of a queen bee from a bee larva. The biological function of the MRJPs is unknown, but they are believed to play a major role in nutrition due to their high essential amino acid content []. Two royal jelly proteins, MRJP3 and MRJP5, contain a tandem repeat that results from a high genetic variablility. This polymorphism may be useful for genotyping individual bees [].; PDB: 3Q6P_B 3Q6K_A 3Q6T_A 2QE8_B.
Probab=55.79 E-value=65 Score=26.66 Aligned_cols=64 Identities=14% Similarity=0.090 Sum_probs=40.6
Q ss_pred CCCCCCceeecCCCCeEEEeecCCCCcChhhhhhcCCChHHHHHHHHhhhhhccCCccccceeccceeeeeeecCCeEEE
Q psy4773 31 LPGLPDNVKRDSKGNFLVSLVCPVDEYTPQLLHIIGPFPNIRKFVARFLHLLEKIPSEDVKHVVGHFESILFLQGTRYTL 110 (160)
Q Consensus 31 LPG~PDNI~~~~~G~fWval~~~r~~~~~~~~~~l~~~P~lRk~i~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v 110 (160)
-++--|++..|++|.++.+.... ..|
T Consensus 184 k~~~s~g~~~D~~G~ly~~~~~~------------------------------------------------------~aI 209 (287)
T PF03022_consen 184 KGSQSDGMAIDPNGNLYFTDVEQ------------------------------------------------------NAI 209 (287)
T ss_dssp ---SECEEEEETTTEEEEEECCC------------------------------------------------------TEE
T ss_pred cCCCCceEEECCCCcEEEecCCC------------------------------------------------------CeE
Confidence 44567999999999999888663 236
Q ss_pred EEEeCCC-----eEEEEEEcCCCCCcCceEEEEEC---CEEEEecCC
Q psy4773 111 LVISSQG-----EIVDALHSVDGSLKGSSDVEEYN---GAYYFGSPI 149 (160)
Q Consensus 111 v~vd~~G-----~il~~l~d~~g~~~~iS~v~e~~---g~LylGS~~ 149 (160)
.+.|+++ +.--..+|+. .+..+++..... |+||+-|-.
T Consensus 210 ~~w~~~~~~~~~~~~~l~~d~~-~l~~pd~~~i~~~~~g~L~v~snr 255 (287)
T PF03022_consen 210 GCWDPDGPYTPENFEILAQDPR-TLQWPDGLKIDPEGDGYLWVLSNR 255 (287)
T ss_dssp EEEETTTSB-GCCEEEEEE-CC--GSSEEEEEE-T--TS-EEEEE-S
T ss_pred EEEeCCCCcCccchheeEEcCc-eeeccceeeeccccCceEEEEECc
Confidence 7788888 5545556654 355667776655 999997643
No 31
>PHA02865 MHC-like TNF binding protein; Provisional
Probab=53.73 E-value=12 Score=32.30 Aligned_cols=23 Identities=39% Similarity=0.720 Sum_probs=19.5
Q ss_pred CCCCCCceeecCCC-Ce--EEEeecC
Q psy4773 31 LPGLPDNVKRDSKG-NF--LVSLVCP 53 (160)
Q Consensus 31 LPG~PDNI~~~~~G-~f--Wval~~~ 53 (160)
-||||-|++.|.|| +| |+++..+
T Consensus 264 e~~~~~~~~pn~DggTfQ~~~~v~v~ 289 (338)
T PHA02865 264 EPGFPTNTKKDNDKNTFSSTPSVRVP 289 (338)
T ss_pred ccccccceeeCCCCCeeEEEEEEEeC
Confidence 58999999999995 75 9988665
No 32
>PF06977 SdiA-regulated: SdiA-regulated; InterPro: IPR009722 This entry represents a conserved region approximately 100 residues long within a number of hypothetical bacterial proteins that may be regulated by SdiA, a member of the LuxR family of transcriptional regulators []. Some proteins contain the IPR001258 from INTERPRO repeat.; PDB: 3QQZ_A.
Probab=52.78 E-value=44 Score=27.32 Aligned_cols=54 Identities=13% Similarity=0.190 Sum_probs=30.4
Q ss_pred eeecCCeEEEEEEeCCCeEEEEEEcCCCCCcCceEEEEE-CCEEEEecCCCCeEEEE
Q psy4773 101 LFLQGTRYTLLVISSQGEIVDALHSVDGSLKGSSDVEEY-NGAYYFGSPISKHLARV 156 (160)
Q Consensus 101 ~~~~~~~~~vv~vd~~G~il~~l~d~~g~~~~iS~v~e~-~g~LylGS~~~~~i~~~ 156 (160)
.....+.+.++++|.+|++++.+.-.. +...-++... +|.+.+.+=...-+-++
T Consensus 37 faV~d~~~~i~els~~G~vlr~i~l~g--~~D~EgI~y~g~~~~vl~~Er~~~L~~~ 91 (248)
T PF06977_consen 37 FAVQDEPGEIYELSLDGKVLRRIPLDG--FGDYEGITYLGNGRYVLSEERDQRLYIF 91 (248)
T ss_dssp EEEETTTTEEEEEETT--EEEEEE-SS---SSEEEEEE-STTEEEEEETTTTEEEEE
T ss_pred EEEECCCCEEEEEcCCCCEEEEEeCCC--CCCceeEEEECCCEEEEEEcCCCcEEEE
Confidence 345566788999999999999997543 2233333333 46666666334444433
No 33
>cd08030 LA_like_plant La-motif domain of plant proteins similar to the La autoantigen. This domain is found in plant proteins related to the La autoantigen. A variety of La-related proteins (LARPs or La ribonucleoproteins), with differing domain architecture, appear to function as RNA-binding proteins in eukaryotic cellular processes.
Probab=50.73 E-value=19 Score=25.03 Aligned_cols=14 Identities=43% Similarity=0.529 Sum_probs=10.6
Q ss_pred EEEEEEeCCCeEEE
Q psy4773 108 YTLLVISSQGEIVD 121 (160)
Q Consensus 108 ~~vv~vd~~G~il~ 121 (160)
+-++++++||+-|+
T Consensus 76 S~~levseD~~~VR 89 (90)
T cd08030 76 STLLKVSEDGKRVG 89 (90)
T ss_pred CCEEEEcCCCCccC
Confidence 44789999997653
No 34
>cd08031 LARP_4_5_like La RNA-binding domain of proteins similar to La-related proteins 4 and 5. This domain is found in proteins similar to La-related proteins 4 and 5 (LARP4, LARP5). A variety of La-related proteins (LARPs or La ribonucleoproteins), with differing domain architecture, appear to function as RNA-binding proteins in eukaryotic cellular processes.
Probab=48.84 E-value=18 Score=24.28 Aligned_cols=14 Identities=21% Similarity=0.287 Sum_probs=10.5
Q ss_pred EEEEEEeCCCeEEE
Q psy4773 108 YTLLVISSQGEIVD 121 (160)
Q Consensus 108 ~~vv~vd~~G~il~ 121 (160)
+.+++++++|+-|+
T Consensus 61 S~~lev~ed~~~VR 74 (75)
T cd08031 61 SPNVQVDEKGEKVR 74 (75)
T ss_pred CCeEEEcCCCCccC
Confidence 44788999998653
No 35
>TIGR02604 Piru_Ver_Nterm putative membrane-bound dehydrogenase domain. All proteins that score above the trusted cutoff score of 45 to this model are large proteins of either Pirellula sp. 1 or Verrucomicrobium spinosum. These proteins all contain, in addition to this domain, several hundred residues of highly variable sequence, and then a well-conserved C-terminal domain (TIGR02603) that features a putative cytochrome c-type heme binding motif CXXCH. The membrane-bound L-sorbosone dehydrogenase from Acetobacter liquefaciens (Gluconacetobacter liquefaciens) is homologous to this domain but lacks additional sequence regions shared by members of this family and belongs to a different clade of the larger family of homologs. It and its closely related homologs are excluded from the this model by scoring between the trusted (45) and noise (18) cutoffs.
Probab=46.35 E-value=32 Score=29.17 Aligned_cols=50 Identities=24% Similarity=0.197 Sum_probs=35.4
Q ss_pred cccCCCcEEEEEEecCCC---CCcceEecccC-C-CCCCceeecCCCCeEEEeec
Q psy4773 3 QTDYCNFSFWRYYLKGPK---QGKSEVFIDGL-P-GLPDNVKRDSKGNFLVSLVC 52 (160)
Q Consensus 3 ~~Et~~~Ri~Ry~l~G~k---~G~~evfid~L-P-G~PDNI~~~~~G~fWval~~ 52 (160)
++|-...+|.|+.|+... +++.+.|+... . +-|=.+..++||.+||+=..
T Consensus 286 v~~~~~~~v~~~~l~~~g~~~~~~~~~~l~~~~~~~rp~dv~~~pDG~Lyv~d~~ 340 (367)
T TIGR02604 286 VGDAHGQLIVRYSLEPKGAGFKGERPEFLRSNDTWFRPVNVTVGPDGALYVSDWY 340 (367)
T ss_pred eeeccCCEEEEEEeecCCCccEeecCceEecCCCcccccceeECCCCCEEEEEec
Confidence 467788999999997332 23334555532 2 67888999999999998644
No 36
>cd08035 LARP_4 La RNA-binding domain of La-related protein 4. This domain is found in vertebrate La-related protein 4 (LARP4), also known as c-MPL binding protein. La-type domains often co-occur with RNA-recognition motifs (RRMs). A variety of La-related proteins (LARPs or La ribonucleoproteins), with differing domain architecture, appear to function as RNA-binding proteins in eukaryotic cellular processes.
Probab=44.47 E-value=8.1 Score=26.04 Aligned_cols=14 Identities=21% Similarity=0.389 Sum_probs=10.5
Q ss_pred EEEEEEeCCCeEEE
Q psy4773 108 YTLLVISSQGEIVD 121 (160)
Q Consensus 108 ~~vv~vd~~G~il~ 121 (160)
+..+++|++|+-|+
T Consensus 61 S~~levsedg~kVR 74 (75)
T cd08035 61 SPMVQVDETGEKVR 74 (75)
T ss_pred CCeEEEcCCCCccC
Confidence 34689999998653
No 37
>COG3292 Predicted periplasmic ligand-binding sensor domain [Signal transduction mechanisms]
Probab=43.05 E-value=58 Score=30.26 Aligned_cols=48 Identities=15% Similarity=0.127 Sum_probs=26.4
Q ss_pred EEEEeCCCeEEEEEEcCCCCCcCceEEEEECCEEEEecCCCCeEEEEe
Q psy4773 110 LLVISSQGEIVDALHSVDGSLKGSSDVEEYNGAYYFGSPISKHLARVP 157 (160)
Q Consensus 110 vv~vd~~G~il~~l~d~~g~~~~iS~v~e~~g~LylGS~~~~~i~~~~ 157 (160)
|...++.|.-...+-++-..-.-...+....|++|+||.+.=.+.|.|
T Consensus 227 v~~~e~~G~~~sn~~~~lp~~~I~ll~qD~qG~lWiGTenGl~r~~l~ 274 (671)
T COG3292 227 VYLQEAEGWRASNWGPMLPSGNILLLVQDAQGELWIGTENGLWRTRLP 274 (671)
T ss_pred eEEEchhhccccccCCCCcchheeeeecccCCCEEEeecccceeEecC
Confidence 445556664444443322111112233344699999998876666665
No 38
>COG3292 Predicted periplasmic ligand-binding sensor domain [Signal transduction mechanisms]
Probab=41.64 E-value=87 Score=29.19 Aligned_cols=33 Identities=30% Similarity=0.497 Sum_probs=23.2
Q ss_pred CCCeEEEEEEcCCCCCcCceEEEEE-CCEEEEecCC
Q psy4773 115 SQGEIVDALHSVDGSLKGSSDVEEY-NGAYYFGSPI 149 (160)
Q Consensus 115 ~~G~il~~l~d~~g~~~~iS~v~e~-~g~LylGS~~ 149 (160)
..|.++.+.+--.+ ..||.+.+. +|.||+||..
T Consensus 362 stG~~v~sv~q~Rg--~nit~~~~d~~g~lWlgs~q 395 (671)
T COG3292 362 STGELVRSVHQLRG--MNITTTLEDSRGRLWLGSMQ 395 (671)
T ss_pred CCCcEEEEeeeccc--cccchhhhccCCcEEEEecc
Confidence 56888888765444 345555543 7999999976
No 39
>cd08036 LARP_5 La RNA-binding domain of La-related protein 5. This domain is found in vertebrate La-related protein 5 (LARP5). A variety of La-related proteins (LARPs or La ribonucleoproteins), with differing domain architecture, appear to function as RNA-binding proteins in eukaryotic cellular processes.
Probab=40.86 E-value=11 Score=25.51 Aligned_cols=14 Identities=29% Similarity=0.498 Sum_probs=10.7
Q ss_pred EEEEEEeCCCeEEE
Q psy4773 108 YTLLVISSQGEIVD 121 (160)
Q Consensus 108 ~~vv~vd~~G~il~ 121 (160)
+..+++|++|+-++
T Consensus 61 S~~vevse~g~kVR 74 (75)
T cd08036 61 LPLVQVDEKGEKVR 74 (75)
T ss_pred CCeEEECCCCCccC
Confidence 44789999998653
No 40
>COG2133 Glucose/sorbosone dehydrogenases [Carbohydrate transport and metabolism]
Probab=40.04 E-value=84 Score=27.71 Aligned_cols=44 Identities=18% Similarity=0.187 Sum_probs=28.5
Q ss_pred EEEEEEec-CC-CCCcceEecccCCCCC----CceeecCCCCeEEEeecC
Q psy4773 10 SFWRYYLK-GP-KQGKSEVFIDGLPGLP----DNVKRDSKGNFLVSLVCP 53 (160)
Q Consensus 10 Ri~Ry~l~-G~-k~G~~evfid~LPG~P----DNI~~~~~G~fWval~~~ 53 (160)
|+-+.-+. |. |..+..+++++||+.+ -.|..++||.++|++-..
T Consensus 148 ~~~~~~~~~g~~~l~~~~~i~~~lP~~~~H~g~~l~f~pDG~Lyvs~G~~ 197 (399)
T COG2133 148 RVAIGRLPGGDTKLSEPKVIFRGIPKGGHHFGGRLVFGPDGKLYVTTGSN 197 (399)
T ss_pred EEEEEEcCCCccccccccEEeecCCCCCCcCcccEEECCCCcEEEEeCCC
Confidence 44444443 42 4444455555699754 678899999999998775
No 41
>COG3204 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=39.63 E-value=68 Score=27.34 Aligned_cols=51 Identities=10% Similarity=0.216 Sum_probs=34.8
Q ss_pred eeecCCeEEEEEEeCCCeEEEEE-----EcCCC-CC---cCceEEEEECCEEEEecCCCC
Q psy4773 101 LFLQGTRYTLLVISSQGEIVDAL-----HSVDG-SL---KGSSDVEEYNGAYYFGSPISK 151 (160)
Q Consensus 101 ~~~~~~~~~vv~vd~~G~il~~l-----~d~~g-~~---~~iS~v~e~~g~LylGS~~~~ 151 (160)
-....+.+.+++++.+|++++.+ +||++ .+ ...--+.|.+..||+-++...
T Consensus 101 Fav~n~p~~iVElt~~GdlirtiPL~g~~DpE~Ieyig~n~fvi~dER~~~l~~~~vd~~ 160 (316)
T COG3204 101 FAVTNKPAAIVELTKEGDLIRTIPLTGFSDPETIEYIGGNQFVIVDERDRALYLFTVDAD 160 (316)
T ss_pred EEecCCCceEEEEecCCceEEEecccccCChhHeEEecCCEEEEEehhcceEEEEEEcCC
Confidence 34556778999999999999887 45543 11 222334467788888887765
No 42
>PF09154 DUF1939: Domain of unknown function (DUF1939); InterPro: IPR015237 This entry represents a C-terminal domain associated with prokaryotic alpha-amylases. It adopts a secondary structure consisting of an eight-stranded antiparallel beta-sheet containing a Greek key motif. Its exact function has not, as yet, been determined []. ; GO: 0004553 hydrolase activity, hydrolyzing O-glycosyl compounds; PDB: 1MXD_A 1MWO_A 1MXG_A 1W9X_A 2DIE_A 1VJS_A 1BPL_B 1BLI_A 1OB0_A 1E3Z_A ....
Probab=36.98 E-value=69 Score=20.18 Aligned_cols=38 Identities=21% Similarity=0.274 Sum_probs=25.8
Q ss_pred EEEEecCCCCCcceEecccCCCCCCceeecCCCCeEEEeecC
Q psy4773 12 WRYYLKGPKQGKSEVFIDGLPGLPDNVKRDSKGNFLVSLVCP 53 (160)
Q Consensus 12 ~Ry~l~G~k~G~~evfid~LPG~PDNI~~~~~G~fWval~~~ 53 (160)
.+.|+....+| ++|.|-+--..+.+..|++| |+-+..+
T Consensus 12 k~~~Vgt~~ag--~~~~D~tGn~~~~vtid~dG--~~~f~v~ 49 (57)
T PF09154_consen 12 KRMWVGTNWAG--KTFYDYTGNSSETVTIDEDG--WGEFPVP 49 (57)
T ss_dssp EEEEEEGGGTT--EEEEETTSSSSSEEEE-TTS--EEEEEE-
T ss_pred EEEEEccccCC--CEEEEccCCCCCeEEECCCe--EEEEEEC
Confidence 35566555565 59999555578899999999 7766554
No 43
>PF00058 Ldl_recept_b: Low-density lipoprotein receptor repeat class B; InterPro: IPR000033 The low-density lipoprotein receptor (LDLR) is the major cholesterol-carrying lipoprotein of plasma, acting to regulate cholesterol homeostasis in mammalian cells. The LDL receptor binds LDL and transports it into cells by acidic endocytosis. In order to be internalized, the receptor-ligand complex must first cluster into clathrin-coated pits. Once inside the cell, the LDLR separates from its ligand, which is degraded in the lysosomes, while the receptor returns to the cell surface []. The internal dissociation of the LDLR with its ligand is mediated by proton pumps within the walls of the endosome that lower the pH. The LDLR is a multi-domain protein, containing: The ligand-binding domain contains seven or eight 40-amino acid LDLR class A (cysteine-rich) repeats, each of which contains a coordinated calcium ion and six cysteine residues involved in disulphide bond formation []. Similar domains have been found in other extracellular and membrane proteins []. The second conserved region contains two EGF repeats, followed by six LDLR class B (YWTD) repeats, and another EGF repeat. The LDLR class B repeats each contain a conserved YWTD motif, and is predicted to form a beta-propeller structure []. This region is critical for ligand release and recycling of the receptor []. The third domain is rich in serine and threonine residues and contains clustered O-linked carbohydrate chains. The fourth domain is the hydrophobic transmembrane region. The fifth domain is the cytoplasmic tail that directs the receptor to clathrin-coated pits. LDLR is closely related in structure to several other receptors, including LRP1, LRP1b, megalin/LRP2, VLDL receptor, lipoprotein receptor, MEGF7/LRP4, and LRP8/apolipoprotein E receptor2); these proteins participate in a wide range of physiological processes, including the regulation of lipid metabolism, protection against atherosclerosis, neurodevelopment, and transport of nutrients and vitamins []. This entry represents the LDLR classB (YWTD) repeat, the structure of which has been solved []. The six YWTD repeats together fold into a six-bladed beta-propeller. Each blade of the propeller consists of four antiparallel beta-strands; the innermost strand of each blade is labeled 1 and the outermost strand, 4. The sequence repeats are offset with respect to the blades of the propeller, such that any given 40-residue YWTD repeat spans strands 24 of one propeller blade and strand 1 of the subsequent blade. This offset ensures circularization of the propeller because the last strand of the final sequence repeat acts as an innermost strand 1 of the blade that harbors strands 24 from the first sequence repeat. The repeat is found in a variety of proteins that include, vitellogenin receptor from Drosophila melanogaster, low-density lipoprotein (LDL) receptor [], preproepidermal growth factor, and nidogen (entactin).; PDB: 3S2K_A 3S8Z_A 3S8V_B 4A0P_A 3SOB_B 3S94_B 4DG6_A 3SOV_A 3SOQ_A 1NPE_A ....
Probab=35.47 E-value=53 Score=18.99 Aligned_cols=36 Identities=25% Similarity=0.270 Sum_probs=26.0
Q ss_pred cccCCCc-EEEEEEecCCCCCcceEecccCCCCCCceeec
Q psy4773 3 QTDYCNF-SFWRYYLKGPKQGKSEVFIDGLPGLPDNVKRD 41 (160)
Q Consensus 3 ~~Et~~~-Ri~Ry~l~G~k~G~~evfid~LPG~PDNI~~~ 41 (160)
++|.... +|.+-.+.|.. .++++.+--.-|.+|..|
T Consensus 5 WtD~~~~~~I~~a~~dGs~---~~~vi~~~l~~P~giaVD 41 (42)
T PF00058_consen 5 WTDWSQDPSIERANLDGSN---RRTVISDDLQHPEGIAVD 41 (42)
T ss_dssp EEETTTTEEEEEEETTSTS---EEEEEESSTSSEEEEEEE
T ss_pred EEECCCCcEEEEEECCCCC---eEEEEECCCCCcCEEEEC
Confidence 5677888 99999999853 455555334478888776
No 44
>COG4294 Uve UV damage repair endonuclease [DNA replication, recombination, and repair]
Probab=32.68 E-value=10 Score=32.28 Aligned_cols=36 Identities=42% Similarity=0.750 Sum_probs=26.1
Q ss_pred CCcccCCCcEEEEEEe---cCCCCCcceEecccCCCCCCcee
Q psy4773 1 MGQTDYCNFSFWRYYL---KGPKQGKSEVFIDGLPGLPDNVK 39 (160)
Q Consensus 1 ~~~~Et~~~Ri~Ry~l---~G~k~G~~evfid~LPG~PDNI~ 39 (160)
||++| + -++-..+ .|-|.++.|-|+.|.+-+||+|+
T Consensus 167 mgl~~--R-s~~~lhlgg~~gGK~~~~eqf~kni~~LP~~vk 205 (347)
T COG4294 167 MGLAE--R-SVWNLHLGGTHGGKKERLEQFIKNIQRLPDSVK 205 (347)
T ss_pred cCCCc--C-CceEEEeccccCCchhHHHHHHHHHhhcCHHHH
Confidence 67777 2 2333444 45567778999999999999996
No 45
>cd08032 LARP_7 La RNA-binding domain of La-related protein 7. LARP7 is a component of the 7SK snRNP, a key factor in the regulation of RNA polymerase II transcription. 7SK functionality is dependent on the presence of LARP7, which is thought to stabilize the 7SK RNA by interacting with its 3' end. The release of 7SK RNA from P-TEFb/HEXIM/7SK complexes activates the cyclin-dependent kinase P-TEFb, which in turn phosphorylates the C-terminal domain of RNA pol II and mediates a transition into productive transcription elongation.
Probab=32.57 E-value=20 Score=24.47 Aligned_cols=14 Identities=7% Similarity=0.275 Sum_probs=10.6
Q ss_pred EEEEEEeCCCeEEE
Q psy4773 108 YTLLVISSQGEIVD 121 (160)
Q Consensus 108 ~~vv~vd~~G~il~ 121 (160)
+-++++|++|+-|+
T Consensus 68 S~~lev~ed~~~VR 81 (82)
T cd08032 68 SSVVELNLEGTRIR 81 (82)
T ss_pred CCEEEEcCCCCccC
Confidence 45789999998653
No 46
>PF13570 PQQ_3: PQQ-like domain; PDB: 3HXJ_B 3Q54_A.
Probab=32.37 E-value=95 Score=17.26 Aligned_cols=20 Identities=20% Similarity=0.147 Sum_probs=12.3
Q ss_pred ceEEEEECCEEEEecCCCCe
Q psy4773 133 SSDVEEYNGAYYFGSPISKH 152 (160)
Q Consensus 133 iS~v~e~~g~LylGS~~~~~ 152 (160)
.++....+|+||+|+.....
T Consensus 14 ~~~~~v~~g~vyv~~~dg~l 33 (40)
T PF13570_consen 14 WSSPAVAGGRVYVGTGDGNL 33 (40)
T ss_dssp -S--EECTSEEEEE-TTSEE
T ss_pred CcCCEEECCEEEEEcCCCEE
Confidence 35557779999999985543
No 47
>PF06739 SBBP: Beta-propeller repeat; InterPro: IPR010620 This family is related to IPR001680 from INTERPRO and is likely to also form a beta-propeller. SBBP stands for Seven Bladed Beta Propeller.
Probab=32.27 E-value=42 Score=19.17 Aligned_cols=20 Identities=25% Similarity=0.315 Sum_probs=16.6
Q ss_pred CCCceeecCCCCeEEEeecC
Q psy4773 34 LPDNVKRDSKGNFLVSLVCP 53 (160)
Q Consensus 34 ~PDNI~~~~~G~fWval~~~ 53 (160)
.|-+|..|++|+++|+-.+-
T Consensus 14 ~~~~IavD~~GNiYv~G~T~ 33 (38)
T PF06739_consen 14 YGNGIAVDSNGNIYVTGYTN 33 (38)
T ss_pred eEEEEEECCCCCEEEEEeec
Confidence 46789999999999987663
No 48
>cd06481 ACD_HspB9_like Alpha crystallin domain (ACD) found in mammalian small heat shock protein (sHsp) HspB9 and similar proteins. sHsps are molecular chaperones that suppress protein aggregation and protect against cell stress, and are generally active as large oligomers consisting of multiple subunits. Human (h) HspB9 is expressed exclusively in the normal testis and in various tumor samples and is a cancer/testis antigen. hHspB9 interacts with TCTEL1 (T-complex testis expressed protein -1), a subunit of dynein. hHspB9 and TCTEL1 are co-expressed in similar cells within the testis and in tumor cells. Included in this group is Xenopus Hsp30, a developmentally-regulated heat-inducible molecular chaperone.
Probab=32.04 E-value=30 Score=23.39 Aligned_cols=28 Identities=18% Similarity=0.404 Sum_probs=19.7
Q ss_pred CcceEecccCCCC-CCceeecCCCC-eEEEe
Q psy4773 22 GKSEVFIDGLPGL-PDNVKRDSKGN-FLVSL 50 (160)
Q Consensus 22 G~~evfid~LPG~-PDNI~~~~~G~-fWval 50 (160)
...++-++ |||+ ||+|...-+++ +.|..
T Consensus 7 d~~~v~~d-lpG~~~edI~V~v~~~~L~I~g 36 (87)
T cd06481 7 EGFSLKLD-VRGFSPEDLSVRVDGRKLVVTG 36 (87)
T ss_pred ceEEEEEE-CCCCChHHeEEEEECCEEEEEE
Confidence 34567776 9997 99999876554 55543
No 49
>cd08029 LA_like_fungal La-motif domain of fungal proteins similar to the La autoantigen. This domain is found in fungal proteins related to the La autoantigen. A variety of La-related proteins (LARPs or La ribonucleoproteins), with differing domain architecture, appear to function as RNA-binding proteins in eukaryotic cellular processes.
Probab=31.91 E-value=42 Score=22.46 Aligned_cols=14 Identities=43% Similarity=0.501 Sum_probs=10.6
Q ss_pred EEEEEEeCCCeEEE
Q psy4773 108 YTLLVISSQGEIVD 121 (160)
Q Consensus 108 ~~vv~vd~~G~il~ 121 (160)
+-++++|++|+-|+
T Consensus 62 S~~lev~~d~~~VR 75 (76)
T cd08029 62 SELLEVSEDGENVR 75 (76)
T ss_pred CCeEEEeCCCCccc
Confidence 34789999998664
No 50
>PF11208 DUF2992: Protein of unknown function (DUF2992); InterPro: IPR016787 There is currently no experimental data for members of this group or their homologues, nor do they exhibit features indicative of any function.
Probab=31.80 E-value=27 Score=25.97 Aligned_cols=12 Identities=25% Similarity=0.343 Sum_probs=9.9
Q ss_pred CCCeEEEeecCC
Q psy4773 43 KGNFLVSLVCPV 54 (160)
Q Consensus 43 ~G~fWval~~~r 54 (160)
||.||||++.-.
T Consensus 6 dg~FWvGv~E~~ 17 (132)
T PF11208_consen 6 DGPFWVGVFERH 17 (132)
T ss_pred cCCcEEEEEEEE
Confidence 688999998853
No 51
>PF06236 MelC1: Tyrosinase co-factor MelC1; InterPro: IPR010928 This family consists of several tyrosinase co-factor MELC1 proteins from a number of Streptomyces species. The melanin operon (melC) of Streptomyces antibioticus contains two genes, melC1 and melC2 (apotyrosinase). It is thought that MelC1 forms a transient binary complex with the downstream apotyrosinase MelC2 to facilitate the incorporation of copper ion and the secretion of tyrosinase indicating that MelC1 is a chaperone for the apotyrosinase MelC2 [].; GO: 0005507 copper ion binding, 0042438 melanin biosynthetic process; PDB: 1WX4_B 2ZWD_B 2ZMZ_B 1WX2_B 1WX5_D 3AWX_B 3AWS_B 2ZMY_B 3AX0_B 2ZWG_B ....
Probab=31.43 E-value=43 Score=24.80 Aligned_cols=47 Identities=26% Similarity=0.193 Sum_probs=29.4
Q ss_pred CcccCCCcEEEEEEecCC-----CCCcceEecccCCCCCCceeecCCCCeEEEeec
Q psy4773 2 GQTDYCNFSFWRYYLKGP-----KQGKSEVFIDGLPGLPDNVKRDSKGNFLVSLVC 52 (160)
Q Consensus 2 ~~~Et~~~Ri~Ry~l~G~-----k~G~~evfid~LPG~PDNI~~~~~G~fWval~~ 52 (160)
.|.|+.+.|-.+-+.... ..+..+|+|| |-|=-+.+..|| =|+..+.
T Consensus 44 ~FdE~YrGRrI~g~~~~~~~~~~~~~~~~V~ID---Gr~LhvMr~ADG-swlS~V~ 95 (125)
T PF06236_consen 44 SFDEVYRGRRIQGTPAAAGGHAAHGGGYEVTID---GRPLHVMRRADG-SWLSVVN 95 (125)
T ss_dssp -EEEEETTEEEEEEE-----------SEEEEET---TEEE-EEE-TTS--EEETTE
T ss_pred ccceeecceeEEeeeCCCCccccCCCceEEEEC---CeEeeeEEcCCC-CEEeeee
Confidence 488999999888777532 2345689999 666678888888 5776644
No 52
>TIGR03032 conserved hypothetical protein TIGR03032. This protein family is uncharacterized. A number of motifs are conserved perfectly among all member sequences. The function of this protein is unknown.
Probab=31.12 E-value=2.9e+02 Score=23.80 Aligned_cols=30 Identities=17% Similarity=0.122 Sum_probs=24.7
Q ss_pred CcCceEEEEECCEEEEecCCCCeEEEEeCC
Q psy4773 130 LKGSSDVEEYNGAYYFGSPISKHLARVPLA 159 (160)
Q Consensus 130 ~~~iS~v~e~~g~LylGS~~~~~i~~~~l~ 159 (160)
+.+.-+-..++|+||+-......+.+++++
T Consensus 202 LsmPhSPRWhdgrLwvldsgtGev~~vD~~ 231 (335)
T TIGR03032 202 LSMPHSPRWYQGKLWLLNSGRGELGYVDPQ 231 (335)
T ss_pred ccCCcCCcEeCCeEEEEECCCCEEEEEcCC
Confidence 455566778999999999999999988864
No 53
>PF06079 Apyrase: Apyrase; InterPro: IPR009283 This family consists of several eukaryotic apyrase (or adenosine diphosphatase) proteins (3.6.1.5 from EC), and related nucleoside diphosphatases (3.6.1.6 from EC). The salivary apyrases of blood-feeding arthropods are nucleotide hydrolysing enzymes implicated in the inhibition of host platelet aggregation through the hydrolysis of extracellular adenosine diphosphate [].; GO: 0005509 calcium ion binding, 0016462 pyrophosphatase activity; PDB: 2H2N_A 1S18_A 2H2U_A 1S1D_B.
Probab=30.22 E-value=2.6e+02 Score=23.69 Aligned_cols=53 Identities=15% Similarity=0.122 Sum_probs=29.0
Q ss_pred eeeecCCeEEEEEEeCCCeEEEE--EEcCCCCC--cCceE-EEEECCEEEEecCCCCeE
Q psy4773 100 ILFLQGTRYTLLVISSQGEIVDA--LHSVDGSL--KGSSD-VEEYNGAYYFGSPISKHL 153 (160)
Q Consensus 100 ~~~~~~~~~~vv~vd~~G~il~~--l~d~~g~~--~~iS~-v~e~~g~LylGS~~~~~i 153 (160)
++.+--+.|+|.++..+ +.+-- |.|-+|.. .+=++ +...++.||+||....+.
T Consensus 66 Lys~DDrTGiVyeI~~~-~~vPwviL~dGdG~~~kGfK~EWaTVKd~~LyvGs~Gkewt 123 (291)
T PF06079_consen 66 LYSFDDRTGIVYEIKGD-KAVPWVILSDGDGNTSKGFKAEWATVKDDKLYVGSIGKEWT 123 (291)
T ss_dssp EEEEETTT-EEEEEETT-EEEEEEE-BSTTTTESSB----EEEEETTEEEEE--SS-EE
T ss_pred EeeeeCCCceEEEEeCC-ceeceEEEeCCCCCccccccceeeEEeCCeeeeccCCCceE
Confidence 34456677999999865 66554 44444431 11122 557799999999886554
No 54
>PTZ00486 apyrase Superfamily; Provisional
Probab=29.27 E-value=3.4e+02 Score=23.62 Aligned_cols=53 Identities=13% Similarity=0.131 Sum_probs=32.6
Q ss_pred eeeecCCeEEEEEEeCCCe-EE--EEEEcCCCCC--cCceE-EEEECCEEEEecCCCCe
Q psy4773 100 ILFLQGTRYTLLVISSQGE-IV--DALHSVDGSL--KGSSD-VEEYNGAYYFGSPISKH 152 (160)
Q Consensus 100 ~~~~~~~~~~vv~vd~~G~-il--~~l~d~~g~~--~~iS~-v~e~~g~LylGS~~~~~ 152 (160)
+...--+.|.|.+++.+++ .+ ..|.|-+|.. .+=.+ +...+++||+||....+
T Consensus 127 Lys~DDrTGiVy~i~~~~~~~~PwvIL~dGdG~~~kGfK~EWaTVKd~~LyVGs~Gkew 185 (352)
T PTZ00486 127 LYGFDDRTGIVYEIDIDKKKAYPRHILSDGNGNSDKGMKIEWATVYDDKLYVGSIGKEF 185 (352)
T ss_pred EEEEeCCceEEEEEEcCCCcEeeEEEEecCCCCCCCCcceeeEEEECCEEEEeccccee
Confidence 3455667799999985554 33 3445544421 22223 44589999999998443
No 55
>PF13449 Phytase-like: Esterase-like activity of phytase
Probab=29.22 E-value=92 Score=26.03 Aligned_cols=18 Identities=17% Similarity=0.289 Sum_probs=14.4
Q ss_pred CceeecCCCCeEEEeecC
Q psy4773 36 DNVKRDSKGNFLVSLVCP 53 (160)
Q Consensus 36 DNI~~~~~G~fWval~~~ 53 (160)
-.|...++|.|||+.-..
T Consensus 88 Egi~~~~~g~~~is~E~~ 105 (326)
T PF13449_consen 88 EGIAVPPDGSFWISSEGG 105 (326)
T ss_pred hHeEEecCCCEEEEeCCc
Confidence 378888899999988654
No 56
>COG2706 3-carboxymuconate cyclase [Carbohydrate transport and metabolism]
Probab=28.94 E-value=3.9e+02 Score=23.21 Aligned_cols=52 Identities=17% Similarity=0.116 Sum_probs=33.9
Q ss_pred EEEEEEeCC-CeEEEEEEcCC-CCCcCceEEEEECCEEEEecCCCCeEEEEeCC
Q psy4773 108 YTLLVISSQ-GEIVDALHSVD-GSLKGSSDVEEYNGAYYFGSPISKHLARVPLA 159 (160)
Q Consensus 108 ~~vv~vd~~-G~il~~l~d~~-g~~~~iS~v~e~~g~LylGS~~~~~i~~~~l~ 159 (160)
..++++|.+ |+.-..=+... |.-+-=.++.+.+.+||..+.....|.++|+.
T Consensus 66 vaay~iD~~~G~Lt~ln~~~~~g~~p~yvsvd~~g~~vf~AnY~~g~v~v~p~~ 119 (346)
T COG2706 66 VAAYRIDPDDGRLTFLNRQTLPGSPPCYVSVDEDGRFVFVANYHSGSVSVYPLQ 119 (346)
T ss_pred EEEEEEcCCCCeEEEeeccccCCCCCeEEEECCCCCEEEEEEccCceEEEEEcc
Confidence 456788865 77544333333 33222234556677999999999999999974
No 57
>PHA02598 denA endonuclease II; Provisional
Probab=28.65 E-value=71 Score=24.02 Aligned_cols=39 Identities=13% Similarity=0.143 Sum_probs=26.7
Q ss_pred EEEEeCCCeEEEEEEcCCCCCcCceEEEEECCEEEEecC
Q psy4773 110 LLVISSQGEIVDALHSVDGSLKGSSDVEEYNGAYYFGSP 148 (160)
Q Consensus 110 vv~vd~~G~il~~l~d~~g~~~~iS~v~e~~g~LylGS~ 148 (160)
=+.+|.+|.|.+++-........|=.....|..||+|-.
T Consensus 13 ~l~l~~~~~i~~~f~~~~~~~n~VY~~~~~~~viYVGKA 51 (138)
T PHA02598 13 ELELDKNGRIDRSFIKCPNKKNVIYAIAVDDELVYIGKT 51 (138)
T ss_pred EEEecCCCcCcccccCCcccceEEEEEEeCCeEEEEeeh
Confidence 367899999999987543333344444456788999864
No 58
>PF13600 DUF4140: N-terminal domain of unknown function (DUF4140)
Probab=27.91 E-value=1.1e+02 Score=20.93 Aligned_cols=26 Identities=31% Similarity=0.727 Sum_probs=21.5
Q ss_pred CCCcceEecccCCC--CCCceeecCCCC
Q psy4773 20 KQGKSEVFIDGLPG--LPDNVKRDSKGN 45 (160)
Q Consensus 20 k~G~~evfid~LPG--~PDNI~~~~~G~ 45 (160)
++|..++.+.+||. -||-|+....|+
T Consensus 20 ~~G~~~i~~~~Lp~~~d~~Sl~V~~~g~ 47 (104)
T PF13600_consen 20 PAGENEIIFEGLPPSLDPDSLRVSGEGG 47 (104)
T ss_pred CCCceEEEEeCCCcccCCCcEEEEecCC
Confidence 46899999999996 677888888776
No 59
>PF03022 MRJP: Major royal jelly protein; InterPro: IPR003534 The major royal jelly proteins (MRJPs) comprise 12.5% of the mass, and 82-90% of the protein content [], of honeybee (Apis mellifera) royal jelly. Royal jelly is a substance secreted by the cephalic glands of nurse bees [] and it is used to trigger development of a queen bee from a bee larva. The biological function of the MRJPs is unknown, but they are believed to play a major role in nutrition due to their high essential amino acid content []. Two royal jelly proteins, MRJP3 and MRJP5, contain a tandem repeat that results from a high genetic variablility. This polymorphism may be useful for genotyping individual bees [].; PDB: 3Q6P_B 3Q6K_A 3Q6T_A 2QE8_B.
Probab=27.67 E-value=3.5e+02 Score=22.29 Aligned_cols=52 Identities=13% Similarity=0.011 Sum_probs=37.8
Q ss_pred EEEEEEe-CCCeEEEEEEcCCCC---CcCceEEEEEC-------CEEEEecCCCCeEEEEeCC
Q psy4773 108 YTLLVIS-SQGEIVDALHSVDGS---LKGSSDVEEYN-------GAYYFGSPISKHLARVPLA 159 (160)
Q Consensus 108 ~~vv~vd-~~G~il~~l~d~~g~---~~~iS~v~e~~-------g~LylGS~~~~~i~~~~l~ 159 (160)
..++.+| .++++++.++-|... -+.+...+... ++.|+.-...+.|-++++.
T Consensus 34 pKLv~~Dl~t~~li~~~~~p~~~~~~~s~lndl~VD~~~~~~~~~~aYItD~~~~glIV~dl~ 96 (287)
T PF03022_consen 34 PKLVAFDLKTNQLIRRYPFPPDIAPPDSFLNDLVVDVRDGNCDDGFAYITDSGGPGLIVYDLA 96 (287)
T ss_dssp -EEEEEETTTTCEEEEEE--CCCS-TCGGEEEEEEECTTTTS-SEEEEEEETTTCEEEEEETT
T ss_pred cEEEEEECCCCcEEEEEECChHHcccccccceEEEEccCCCCcceEEEEeCCCcCcEEEEEcc
Confidence 5678899 578999999987643 35666666544 6999999999988888775
No 60
>KOG0639|consensus
Probab=26.61 E-value=72 Score=29.33 Aligned_cols=44 Identities=9% Similarity=0.110 Sum_probs=32.2
Q ss_pred cccCCCcEEEEEEecCCCCCcceEecccCCCCCCcee---ecCCC-CeEEEeec
Q psy4773 3 QTDYCNFSFWRYYLKGPKQGKSEVFIDGLPGLPDNVK---RDSKG-NFLVSLVC 52 (160)
Q Consensus 3 ~~Et~~~Ri~Ry~l~G~k~G~~evfid~LPG~PDNI~---~~~~G-~fWval~~ 52 (160)
|+...-.-|.-++|.. ++.+..++|++|+.+ ...+| ++|-|-..
T Consensus 525 FsccsdGnI~vwDLhn------q~~VrqfqGhtDGascIdis~dGtklWTGGlD 572 (705)
T KOG0639|consen 525 FSCCSDGNIAVWDLHN------QTLVRQFQGHTDGASCIDISKDGTKLWTGGLD 572 (705)
T ss_pred eeeccCCcEEEEEccc------ceeeecccCCCCCceeEEecCCCceeecCCCc
Confidence 5555666677777774 577888999999976 45667 49987644
No 61
>TIGR03606 non_repeat_PQQ dehydrogenase, PQQ-dependent, s-GDH family. PQQ, or pyrroloquinoline-quinone, serves as a cofactor for a number of sugar and alcohol dehydrogenases in a limited number of bacterial species. Most characterized PQQ-dependent enzymes have multiple repeats of a sequence region described by pfam01011 (PQQ enzyme repeat), but this protein family in unusual in lacking that repeat. Below the noise cutoff are related proteins mostly from species that lack PQQ biosynthesis.
Probab=26.48 E-value=4.7e+02 Score=23.42 Aligned_cols=29 Identities=17% Similarity=0.193 Sum_probs=23.3
Q ss_pred CcceEecccCCCCCCceeecCCCCeEEEee
Q psy4773 22 GKSEVFIDGLPGLPDNVKRDSKGNFLVSLV 51 (160)
Q Consensus 22 G~~evfid~LPG~PDNI~~~~~G~fWval~ 51 (160)
=+.++++++|. .|=.|...++|++||+--
T Consensus 20 f~~~~va~GL~-~Pw~maflPDG~llVtER 48 (454)
T TIGR03606 20 FDKKVLLSGLN-KPWALLWGPDNQLWVTER 48 (454)
T ss_pred cEEEEEECCCC-CceEEEEcCCCeEEEEEe
Confidence 35788888887 588898888898888764
No 62
>PRK11138 outer membrane biogenesis protein BamB; Provisional
Probab=25.44 E-value=2.5e+02 Score=23.73 Aligned_cols=43 Identities=12% Similarity=-0.029 Sum_probs=28.3
Q ss_pred eEEEEEEe-CCCeEEEEEEcCCCCCcCceEEEEECCEEEEecCCCC
Q psy4773 107 RYTLLVIS-SQGEIVDALHSVDGSLKGSSDVEEYNGAYYFGSPISK 151 (160)
Q Consensus 107 ~~~vv~vd-~~G~il~~l~d~~g~~~~iS~v~e~~g~LylGS~~~~ 151 (160)
.+.+..+| .+|+++...+-..+. ..++-...+++||+++-...
T Consensus 344 ~G~l~~ld~~tG~~~~~~~~~~~~--~~s~P~~~~~~l~v~t~~G~ 387 (394)
T PRK11138 344 EGYLHWINREDGRFVAQQKVDSSG--FLSEPVVADDKLLIQARDGT 387 (394)
T ss_pred CCEEEEEECCCCCEEEEEEcCCCc--ceeCCEEECCEEEEEeCCce
Confidence 35566777 478888877643221 23444567999999987654
No 63
>TIGR02276 beta_rpt_yvtn 40-residue YVTN family beta-propeller repeat. This repeat of about 40 amino acids is found in up to 14 copies per protein. Archaea Methanosarcina mazei and Methanosarcina acetivorans each have over 10 genes that encode tandem copies of this repeat, which is also found in other species. PSIPRED predicts with high confidence that each 40-residue repeats contains four beta strands. This model overlaps somewhat with the NHL repeat (Pfam pfam01436) and also shows sequence similarity to the WD domain, G-beta repeat (Pfam pfam00400).
Probab=24.37 E-value=1.3e+02 Score=16.29 Aligned_cols=19 Identities=11% Similarity=0.128 Sum_probs=11.0
Q ss_pred CCEEEEecCCCCeEEEEeC
Q psy4773 140 NGAYYFGSPISKHLARVPL 158 (160)
Q Consensus 140 ~g~LylGS~~~~~i~~~~l 158 (160)
+++||+.....+.|..+++
T Consensus 3 ~~~lyv~~~~~~~v~~id~ 21 (42)
T TIGR02276 3 GTKLYVTNSGSNTVSVIDT 21 (42)
T ss_pred CCEEEEEeCCCCEEEEEEC
Confidence 3456666666666665553
No 64
>PF14583 Pectate_lyase22: Oligogalacturonate lyase; PDB: 3C5M_C 3PE7_A.
Probab=24.31 E-value=2.5e+02 Score=24.66 Aligned_cols=83 Identities=17% Similarity=0.205 Sum_probs=41.9
Q ss_pred cEEEEEEecCCCCCcceEecccCC-CCC--CceeecCCCCeEEEeecCCCCcChhhhhhcCCChHHHHHHHHhhhhhccC
Q psy4773 9 FSFWRYYLKGPKQGKSEVFIDGLP-GLP--DNVKRDSKGNFLVSLVCPVDEYTPQLLHIIGPFPNIRKFVARFLHLLEKI 85 (160)
Q Consensus 9 ~Ri~Ry~l~G~k~G~~evfid~LP-G~P--DNI~~~~~G~fWval~~~r~~~~~~~~~~l~~~P~lRk~i~~l~~~~~~~ 85 (160)
.++.++.|++- +.++..+ .| ++- .+...+++++..+++-..+. ...-+.++..+++++..
T Consensus 102 ~~l~~vdL~T~---e~~~vy~-~p~~~~g~gt~v~n~d~t~~~g~e~~~~-----d~~~l~~~~~f~e~~~a-------- 164 (386)
T PF14583_consen 102 RSLRRVDLDTL---EERVVYE-VPDDWKGYGTWVANSDCTKLVGIEISRE-----DWKPLTKWKGFREFYEA-------- 164 (386)
T ss_dssp TEEEEEETTT-----EEEEEE---TTEEEEEEEEE-TTSSEEEEEEEEGG-----G-----SHHHHHHHHHC--------
T ss_pred CeEEEEECCcC---cEEEEEE-CCcccccccceeeCCCccEEEEEEEeeh-----hccCccccHHHHHHHhh--------
Confidence 47888888863 3444444 44 222 23446888999888866432 22223456667766543
Q ss_pred CccccceeccceeeeeeecCCeEEEEEEe-CCCeEEEEEEcCC
Q psy4773 86 PSEDVKHVVGHFESILFLQGTRYTLLVIS-SQGEIVDALHSVD 127 (160)
Q Consensus 86 ~~~~~~~~~~~~~~~~~~~~~~~~vv~vd-~~G~il~~l~d~~ 127 (160)
.++..++.+| ..|++-..+++..
T Consensus 165 -------------------~p~~~i~~idl~tG~~~~v~~~~~ 188 (386)
T PF14583_consen 165 -------------------RPHCRIFTIDLKTGERKVVFEDTD 188 (386)
T ss_dssp ----------------------EEEEEEETTT--EEEEEEESS
T ss_pred -------------------CCCceEEEEECCCCceeEEEecCc
Confidence 2346778888 4566655555544
No 65
>PF03088 Str_synth: Strictosidine synthase; InterPro: IPR018119 This entry represents a conserved region found in strictosidine synthase (4.3.3.2 from EC), a key enzyme in alkaloid biosynthesis. It catalyses the Pictet-Spengler stereospecific condensation of tryptamine with secologanin to form strictosidine []. The structure of the native enzyme from the Indian medicinal plant Rauvolfia serpentina (Serpentwood) (Devilpepper) represents the first example of a six-bladed four-stranded beta-propeller fold from the plant kingdom [].; GO: 0016844 strictosidine synthase activity, 0009058 biosynthetic process; PDB: 2FPB_A 2V91_B 2FP8_A 3V1S_B 2FPC_A 2VAQ_A 2FP9_B.
Probab=23.95 E-value=2.5e+02 Score=19.26 Aligned_cols=51 Identities=14% Similarity=-0.053 Sum_probs=33.4
Q ss_pred CCeEEEEEEeCCCeEEEEEEcCCCCCcCceEEEEE--CCEEEEecCCCCeEEEEeC
Q psy4773 105 GTRYTLLVISSQGEIVDALHSVDGSLKGSSDVEEY--NGAYYFGSPISKHLARVPL 158 (160)
Q Consensus 105 ~~~~~vv~vd~~G~il~~l~d~~g~~~~iS~v~e~--~g~LylGS~~~~~i~~~~l 158 (160)
...|.+++.|+..+.+..|-+. +.....|... +..|.+.--...-|-|+=|
T Consensus 34 ~~~GRll~ydp~t~~~~vl~~~---L~fpNGVals~d~~~vlv~Et~~~Ri~rywl 86 (89)
T PF03088_consen 34 RPTGRLLRYDPSTKETTVLLDG---LYFPNGVALSPDESFVLVAETGRYRILRYWL 86 (89)
T ss_dssp ---EEEEEEETTTTEEEEEEEE---ESSEEEEEE-TTSSEEEEEEGGGTEEEEEES
T ss_pred CCCcCEEEEECCCCeEEEehhC---CCccCeEEEcCCCCEEEEEeccCceEEEEEE
Confidence 3459999999988877777642 3456666665 3568887777766666544
No 66
>PF01731 Arylesterase: Arylesterase; InterPro: IPR002640 The serum paraoxonases/arylesterases are enzymes that catalyse the hydrolysis of the toxic metabolites of a variety of organophosphorus insecticides. The enzymes hydrolyse a broad spectrum of organophosphate substrates, including paraoxon and a number of aromatic carboxylic acid esters (e.g., phenyl acetate), and hence confer resistance to organophosphate toxicity []. Mammals have 3 distinct paraoxonase types, termed PON1-3 [, ]. In mice and humans, the PON genes are found on the same chromosome in close proximity. PON activity has been found in variety of tissues, with highest levels in liver and serum - the source of serum PON is thought to be the liver. Unlike mammals, fish and avian species lack paraoxonase activity. Human and rabbit PONs appear to have two distinct Ca2+ binding sites, one required for stability and one required for catalytic activity. The Ca2+ dependency of PONs suggests a mechanism of hydrolysis where Ca2+ acts as the electrophillic catalyst, like that proposed for phospholipase A2. The paraoxonase enzymes, PON1 and PON3, are high density lipoprotein (HDL)- associated proteins capable of preventing oxidative modification of low density lipoproteins (LPL) []. Although PON2 has oxidative properties, the enzyme does not associate with HDL. Within a given species, PON1, PON2 and PON3 share ~60% amino acid sequence identity, whereas between mammalian species particular PONs (1,2 or 3) share 79-90% identity at the amino acid level. Human PON1 and PON3 share numerous conserved phosphorylation and N-glycosylation sites; however, it is not known whether the PON proteins are modified at these sites, or whether modification at these sites is required for activity in vivo []. This family consists of arylesterases (Also known as serum paraoxonase) 3.1.1.2 from EC. These enzymes hydrolyse organophosphorus esters such as paraoxon and are found in the liver and blood. They confer resistance to organophosphate toxicity []. Human arylesterase (PON1) P27169 from SWISSPROT is associated with HDL and may protect against LDL oxidation [].; GO: 0004064 arylesterase activity
Probab=23.65 E-value=1.1e+02 Score=20.89 Aligned_cols=30 Identities=10% Similarity=0.232 Sum_probs=23.2
Q ss_pred CcceEecccCCCCCCceeecCCCC-eEEEeec
Q psy4773 22 GKSEVFIDGLPGLPDNVKRDSKGN-FLVSLVC 52 (160)
Q Consensus 22 G~~evfid~LPG~PDNI~~~~~G~-fWval~~ 52 (160)
++..+.+++++ +|.+|..++++. ++||-..
T Consensus 44 ~~~~~va~g~~-~aNGI~~s~~~k~lyVa~~~ 74 (86)
T PF01731_consen 44 KEVKVVASGFS-FANGIAISPDKKYLYVASSL 74 (86)
T ss_pred CEeEEeeccCC-CCceEEEcCCCCEEEEEecc
Confidence 34778888887 799999998875 6776644
No 67
>PF06089 Asparaginase_II: L-asparaginase II; InterPro: IPR010349 This family consists of several bacterial L-asparaginase II proteins. L-asparaginase (3.5.1.1 from EC) catalyses the hydrolysis of L-asparagine to L-aspartate and ammonium. Rhizobium etli possesses two asparaginases: asparaginase I, which is thermostable and constitutive, and asparaginase II, which is thermolabile, induced by asparagine and repressed by the carbon source [].
Probab=23.27 E-value=86 Score=26.86 Aligned_cols=24 Identities=17% Similarity=0.264 Sum_probs=21.1
Q ss_pred CCeEEEEEEeCCCeEEEEEEcCCC
Q psy4773 105 GTRYTLLVISSQGEIVDALHSVDG 128 (160)
Q Consensus 105 ~~~~~vv~vd~~G~il~~l~d~~g 128 (160)
..++.++.+|.+|+++.++-|++.
T Consensus 13 ~H~G~~vVvd~~G~v~~~~Gd~~~ 36 (324)
T PF06089_consen 13 VHRGHAVVVDADGRVLASAGDPDR 36 (324)
T ss_pred eEEEEEEEECCCCCEEEEecCCCC
Confidence 356889999999999999999984
No 68
>cd07323 LAM LA motif RNA-binding domain. This domain is found at the N-terminus of La RNA-binding proteins as well as in other related proteins. Typically, the domain co-occurs with an RNA-recognition motif (RRM), and together these domains function to bind primary transcripts of RNA polymerase III in the La autoantigen (Lupus La protein, LARP3, or Sjoegren syndrome type B antigen, SS-B). A variety of La-related proteins (LARPs or La ribonucleoproteins), with differing domain architecture, appear to function as RNA-binding proteins in eukaryotic cellular processes.
Probab=22.28 E-value=41 Score=22.34 Aligned_cols=15 Identities=13% Similarity=0.534 Sum_probs=10.1
Q ss_pred hhcCCChHHHHHHHH
Q psy4773 63 HIIGPFPNIRKFVAR 77 (160)
Q Consensus 63 ~~l~~~P~lRk~i~~ 77 (160)
+.+.+++.+|++...
T Consensus 36 ~~i~~F~r~k~l~~~ 50 (75)
T cd07323 36 SLLASFNRVKKLTTD 50 (75)
T ss_pred HHHhCchHHHHHcCC
Confidence 456677888877643
No 69
>PF07463 NUMOD4: NUMOD4 motif; InterPro: IPR010902 NUMOD4 is a putative DNA-binding motif found in homing endonucleases and related proteins [].; GO: 0016788 hydrolase activity, acting on ester bonds; PDB: 1U3E_M.
Probab=22.26 E-value=77 Score=19.20 Aligned_cols=23 Identities=17% Similarity=0.380 Sum_probs=14.9
Q ss_pred eEecccCCCCCCceeecCCCCeEE
Q psy4773 25 EVFIDGLPGLPDNVKRDSKGNFLV 48 (160)
Q Consensus 25 evfid~LPG~PDNI~~~~~G~fWv 48 (160)
|++.+ .||||.--..+..|++.-
T Consensus 1 E~Wk~-I~g~~~~Y~VSn~GrVrs 23 (51)
T PF07463_consen 1 EIWKP-IPGYEGKYEVSNLGRVRS 23 (51)
T ss_dssp --EEE--TTSTTTEEEETTS-EEE
T ss_pred CcceE-cCCCCCcEEEcCCceEEE
Confidence 56666 889998788888887654
No 70
>TIGR03300 assembly_YfgL outer membrane assembly lipoprotein YfgL. Members of this protein family are YfgL, a lipoprotein component of a complex that acts protein insertion into the bacterial outer membrane. Other members of this complex are NlpB, YfiO, and YaeT. This protein contains multiple copies of a repeat that, in other contexts, are associated with binding of the coenzyme PQQ.
Probab=22.20 E-value=2.5e+02 Score=23.36 Aligned_cols=42 Identities=14% Similarity=0.162 Sum_probs=29.1
Q ss_pred CeEEEEEEe-CCCeEEEEEEcCCCCCcCceEEEEECCEEEEecCCC
Q psy4773 106 TRYTLLVIS-SQGEIVDALHSVDGSLKGSSDVEEYNGAYYFGSPIS 150 (160)
Q Consensus 106 ~~~~vv~vd-~~G~il~~l~d~~g~~~~iS~v~e~~g~LylGS~~~ 150 (160)
..+.+..+| .+|+++=..+.+.. ..+.....++.+|+|+...
T Consensus 73 ~~g~v~a~d~~tG~~~W~~~~~~~---~~~~p~v~~~~v~v~~~~g 115 (377)
T TIGR03300 73 ADGTVVALDAETGKRLWRVDLDER---LSGGVGADGGLVFVGTEKG 115 (377)
T ss_pred CCCeEEEEEccCCcEeeeecCCCC---cccceEEcCCEEEEEcCCC
Confidence 346788999 68999877765442 1234455689999998654
No 71
>PF05411 Peptidase_C32: Equine arteritis virus putative proteinase; InterPro: IPR008742 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Cysteine peptidases have characteristic molecular topologies, which can be seen not only in their three-dimensional structures, but commonly also in the two-dimensional structures. These are peptidases in which the nucleophile is the sulphydryl group of a cysteine residue. Cysteine proteases are divided into clans (proteins which are evolutionary related), and further sub-divided into families, on the basis of the architecture of their catalytic dyad or triad []. This group of cysteine peptidases corresponds to MEROPS peptidase family C32 (clan CA). The type example is equine arteritis virus-type cysteine proteinase (porcine reproductive and respiratory syndrome virus), which is involved in viral polyprotein processing [].; GO: 0003968 RNA-directed RNA polymerase activity, 0004197 cysteine-type endopeptidase activity, 0004252 serine-type endopeptidase activity, 0005524 ATP binding, 0008270 zinc ion binding, 0016818 hydrolase activity, acting on acid anhydrides, in phosphorus-containing anhydrides; PDB: 3MTV_A.
Probab=21.73 E-value=44 Score=24.74 Aligned_cols=42 Identities=21% Similarity=0.170 Sum_probs=25.2
Q ss_pred eEEEEEEeCCCeE-EEEEEcCCCCCcCceEEEE--ECCEEEEecC
Q psy4773 107 RYTLLVISSQGEI-VDALHSVDGSLKGSSDVEE--YNGAYYFGSP 148 (160)
Q Consensus 107 ~~~vv~vd~~G~i-l~~l~d~~g~~~~iS~v~e--~~g~LylGS~ 148 (160)
+|+-..+|++|++ ++.++.++.-+.+|+-+-| ..|-..|+++
T Consensus 58 ~GLraV~d~~G~~~v~af~~~~SwirHv~l~~ep~~pgfv~l~r~ 102 (127)
T PF05411_consen 58 NGLRAVVDPYGPIHVYAFSCPESWIRHVSLADEPVPPGFVRLGRI 102 (127)
T ss_dssp TTEEEEE-TT-SEEEEEESSTT-S-EEEEETTS---TTEEEEEEE
T ss_pred cCeeEEECCCCCEEEEEecCCccceeeeEecCCCCCCCcEEEEEE
Confidence 4677788999995 5677767766677776654 4466666653
No 72
>TIGR03300 assembly_YfgL outer membrane assembly lipoprotein YfgL. Members of this protein family are YfgL, a lipoprotein component of a complex that acts protein insertion into the bacterial outer membrane. Other members of this complex are NlpB, YfiO, and YaeT. This protein contains multiple copies of a repeat that, in other contexts, are associated with binding of the coenzyme PQQ.
Probab=20.96 E-value=3.2e+02 Score=22.67 Aligned_cols=44 Identities=14% Similarity=-0.000 Sum_probs=29.5
Q ss_pred CeEEEEEEeC-CCeEEEEEEcCCCCCcCceEEEEECCEEEEecCCCC
Q psy4773 106 TRYTLLVISS-QGEIVDALHSVDGSLKGSSDVEEYNGAYYFGSPISK 151 (160)
Q Consensus 106 ~~~~vv~vd~-~G~il~~l~d~~g~~~~iS~v~e~~g~LylGS~~~~ 151 (160)
..+.+..+|. +|+++..+.-..+ ...++-...+++||+++....
T Consensus 328 ~~G~l~~~d~~tG~~~~~~~~~~~--~~~~sp~~~~~~l~v~~~dG~ 372 (377)
T TIGR03300 328 FEGYLHWLSREDGSFVARLKTDGS--GIASPPVVVGDGLLVQTRDGD 372 (377)
T ss_pred CCCEEEEEECCCCCEEEEEEcCCC--ccccCCEEECCEEEEEeCCce
Confidence 3466777774 5888887764332 124455677889999998754
No 73
>PF01502 PRA-CH: Phosphoribosyl-AMP cyclohydrolase; InterPro: IPR002496 Phosphoribosyl-AMP cyclohydrolase 3.5.4.19 from EC catalyses the third step in the histidine biosynthetic pathway: 5-phosphoribosyl-AMP + H2O = 5-(5-phospho-D-ribosylaminoformimino)-1-(5-phospho-ribosyl) imidazole-4-carboxamide It requires Zn2+ ions for activity [].; GO: 0004635 phosphoribosyl-AMP cyclohydrolase activity, 0000105 histidine biosynthetic process; PDB: 1ZPS_B.
Probab=20.83 E-value=2.1e+02 Score=19.18 Aligned_cols=33 Identities=24% Similarity=0.481 Sum_probs=21.9
Q ss_pred EEEecCCCCCcceEecccCCCCCCceeecCCCCeEEEeecC
Q psy4773 13 RYYLKGPKQGKSEVFIDGLPGLPDNVKRDSKGNFLVSLVCP 53 (160)
Q Consensus 13 Ry~l~G~k~G~~evfid~LPG~PDNI~~~~~G~fWval~~~ 53 (160)
+.|.||+..|....+++ |+.|=|+.-.+..+.+
T Consensus 28 ~lW~KGetSG~~q~v~~--------i~~DCD~D~ll~~V~~ 60 (75)
T PF01502_consen 28 RLWRKGETSGNTQKVVE--------IRLDCDGDALLFKVEQ 60 (75)
T ss_dssp EEEETTTTTS--EEEEE--------EEE-TTSSEEEEEEEE
T ss_pred cEeeEECCCCCEEEEEE--------EEecCCCCeEEEEEEe
Confidence 78999999998888776 6677666655544443
No 74
>PF04398 DUF538: Protein of unknown function, DUF538; InterPro: IPR007493 This family consists of several plant proteins of unknown function.; PDB: 1YDU_A.
Probab=20.71 E-value=86 Score=22.41 Aligned_cols=24 Identities=46% Similarity=0.726 Sum_probs=13.7
Q ss_pred cCCC--CCCcee---ecCC-CCeEEEeecC
Q psy4773 30 GLPG--LPDNVK---RDSK-GNFLVSLVCP 53 (160)
Q Consensus 30 ~LPG--~PDNI~---~~~~-G~fWval~~~ 53 (160)
|||. +|.+.. .+++ |.|||-+..+
T Consensus 10 glP~GLLP~~v~~y~l~~~tG~f~v~l~~~ 39 (110)
T PF04398_consen 10 GLPRGLLPLGVTEYGLNRDTGFFWVKLKSP 39 (110)
T ss_dssp S-TT-TTTSSS-EEEE-TTT-SEEEE-SS-
T ss_pred CCCCCcCCCCceEEEEecCCcEEEEEecCC
Confidence 6774 788774 4565 9999999775
No 75
>PHA02122 hypothetical protein
Probab=20.66 E-value=2.4e+02 Score=18.00 Aligned_cols=33 Identities=9% Similarity=0.053 Sum_probs=21.6
Q ss_pred eCCCeEEEEEEcCCCCCcCceEEEEECCEEEEec
Q psy4773 114 SSQGEIVDALHSVDGSLKGSSDVEEYNGAYYFGS 147 (160)
Q Consensus 114 d~~G~il~~l~d~~g~~~~iS~v~e~~g~LylGS 147 (160)
|-+|-|+.++.|....+ -+---..++|.|++|.
T Consensus 28 ~~~~iiihs~~~~gd~v-~vn~e~~~ng~l~i~q 60 (65)
T PHA02122 28 GCENIIIHSFKDDGDEV-IVNFELVVNGKLIINQ 60 (65)
T ss_pred CCCcEEEEeeccCCCEE-EEEEEEEECCEEEEee
Confidence 56899999998754322 1222234789999985
No 76
>PF13360 PQQ_2: PQQ-like domain; PDB: 3HXJ_B 1YIQ_A 1KV9_A 3Q54_A 2YH3_A 3PRW_A 3P1L_A 3Q7M_A 3Q7O_A 3Q7N_A ....
Probab=20.14 E-value=3.9e+02 Score=20.09 Aligned_cols=47 Identities=15% Similarity=0.231 Sum_probs=33.5
Q ss_pred EEEEEEeC-CCeEEEEEEcCCCCCcCceEEEEECCEEEEecCCCCeEEEEeC
Q psy4773 108 YTLLVISS-QGEIVDALHSVDGSLKGSSDVEEYNGAYYFGSPISKHLARVPL 158 (160)
Q Consensus 108 ~~vv~vd~-~G~il~~l~d~~g~~~~iS~v~e~~g~LylGS~~~~~i~~~~l 158 (160)
+.++.+|. +|+++-....+. .+... ....++.+|+++..+ .|-.++.
T Consensus 46 ~~l~~~d~~tG~~~W~~~~~~-~~~~~--~~~~~~~v~v~~~~~-~l~~~d~ 93 (238)
T PF13360_consen 46 GNLYALDAKTGKVLWRFDLPG-PISGA--PVVDGGRVYVGTSDG-SLYALDA 93 (238)
T ss_dssp SEEEEEETTTSEEEEEEECSS-CGGSG--EEEETTEEEEEETTS-EEEEEET
T ss_pred CEEEEEECCCCCEEEEeeccc-cccce--eeeccccccccccee-eeEeccc
Confidence 67889995 999999998743 22222 477889999998444 6666653
Done!