Query psy17622
Match_columns 194
No_of_seqs 113 out of 347
Neff 5.3
Searched_HMMs 46136
Date Fri Aug 16 18:46:48 2013
Command hhsearch -i /work/01045/syshi/Psyhhblits/psy17622.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/17622hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 KOG1349|consensus 100.0 2E-62 4.3E-67 423.6 13.2 152 28-179 26-186 (309)
2 KOG1348|consensus 100.0 7.4E-60 1.6E-64 423.6 15.1 152 29-182 44-209 (477)
3 PF01650 Peptidase_C13: Peptid 100.0 5.2E-57 1.1E-61 391.3 14.9 146 31-178 1-153 (256)
4 COG5206 GPI8 Glycosylphosphati 100.0 3.5E-52 7.7E-57 363.1 12.4 153 27-179 25-186 (382)
5 PF01650 Peptidase_C13: Peptid 98.0 4.2E-06 9.1E-11 73.1 2.6 114 6-123 1-141 (256)
6 PF00656 Peptidase_C14: Caspas 98.0 7E-06 1.5E-10 68.1 3.7 84 48-170 21-113 (248)
7 KOG1546|consensus 95.9 0.031 6.8E-07 51.1 7.6 80 28-143 61-150 (362)
8 KOG1349|consensus 94.2 0.01 2.2E-07 52.8 -0.7 119 3-123 26-173 (309)
9 COG5206 GPI8 Glycosylphosphati 92.7 0.025 5.4E-07 50.8 -0.7 117 2-123 25-173 (382)
10 smart00115 CASc Caspase, inter 90.1 0.93 2E-05 38.9 6.3 56 109-164 49-106 (241)
11 cd00032 CASc Caspase, interleu 85.2 1.8 3.9E-05 37.0 5.1 57 109-165 51-108 (243)
12 PF14538 Raptor_N: Raptor N-te 85.1 1.7 3.6E-05 35.5 4.6 54 110-164 70-133 (154)
13 PF12770 CHAT: CHAT domain 74.7 6 0.00013 33.5 4.9 65 105-182 122-201 (287)
14 KOG1348|consensus 73.3 2.9 6.3E-05 39.3 2.7 22 5-26 45-66 (477)
15 PF00381 PTS-HPr: PTS HPr comp 67.8 6.4 0.00014 28.2 3.0 55 99-170 29-83 (84)
16 COG2949 SanA Uncharacterized m 67.5 10 0.00023 33.1 4.7 62 6-70 58-130 (235)
17 KOG1017|consensus 64.7 6.1 0.00013 34.5 2.7 23 50-72 204-226 (267)
18 KOG1654|consensus 60.5 8.1 0.00018 30.4 2.5 35 109-144 52-86 (116)
19 cd00367 PTS-HPr_like Histidine 54.2 17 0.00038 25.4 3.2 39 99-140 25-63 (77)
20 PRK10649 hypothetical protein; 51.6 11 0.00025 36.6 2.5 20 125-144 446-465 (577)
21 cd01612 APG12_C Ubiquitin-like 50.3 20 0.00044 26.5 3.1 30 109-139 23-52 (87)
22 PF10686 DUF2493: Protein of u 44.4 20 0.00043 25.4 2.2 23 33-55 5-27 (71)
23 COG2143 Thioredoxin-related pr 43.4 90 0.0019 26.3 6.2 54 112-175 102-155 (182)
24 PRK10834 vancomycin high tempe 43.3 25 0.00055 30.8 3.2 67 5-77 45-122 (239)
25 COG2194 Predicted membrane-ass 41.7 14 0.00031 36.0 1.5 20 124-143 437-456 (555)
26 TIGR01003 PTS_HPr_family Phosp 41.7 33 0.00071 24.6 3.1 40 98-140 28-67 (82)
27 PRK10897 phosphohistidinoprote 39.8 45 0.00098 24.6 3.6 40 99-141 30-70 (90)
28 COG4566 TtrR Response regulato 39.5 30 0.00066 29.7 3.0 63 102-170 51-122 (202)
29 PF02698 DUF218: DUF218 domain 38.9 37 0.0008 26.2 3.2 35 34-70 41-75 (155)
30 COG4899 Uncharacterized protei 37.5 26 0.00055 29.0 2.1 55 68-122 30-85 (166)
31 cd06259 YdcF-like YdcF-like. Y 37.2 34 0.00074 26.3 2.7 38 31-70 35-72 (150)
32 KOG2091|consensus 36.1 78 0.0017 29.5 5.2 67 95-165 298-369 (392)
33 PF03568 Peptidase_C50: Peptid 35.4 39 0.00084 31.1 3.2 26 109-143 294-319 (383)
34 PF11181 YflT: Heat induced st 34.8 52 0.0011 24.4 3.3 31 46-76 7-37 (103)
35 PRK09598 lipid A phosphoethano 34.7 20 0.00044 34.5 1.3 19 125-143 423-441 (522)
36 COG1925 FruB Phosphotransferas 34.6 59 0.0013 24.2 3.5 51 84-140 17-67 (88)
37 PRK13626 transcriptional regul 33.3 74 0.0016 30.3 4.9 39 133-175 178-216 (552)
38 cd01611 GABARAP Ubiquitin doma 33.2 52 0.0011 25.4 3.2 30 109-139 48-77 (112)
39 PRK13782 phosphocarrier protei 32.2 58 0.0013 23.3 3.1 50 84-139 17-66 (82)
40 cd06183 cyt_b5_reduct_like Cyt 32.0 32 0.0007 28.0 2.0 35 30-71 200-234 (234)
41 PF10116 Host_attach: Protein 31.6 62 0.0013 25.1 3.4 29 155-183 73-101 (138)
42 cd07563 Peptidase_S41_IRBP Int 31.0 1.4E+02 0.0031 25.2 5.8 60 110-178 37-100 (250)
43 PRK10494 hypothetical protein; 30.9 52 0.0011 28.7 3.1 38 31-70 121-158 (259)
44 PF02633 Creatininase: Creatin 30.6 37 0.0008 28.7 2.1 51 106-170 72-126 (237)
45 cd07561 Peptidase_S41_CPP_like 29.6 1.9E+02 0.0041 25.1 6.4 64 110-178 35-99 (256)
46 PTZ00380 microtubule-associate 28.4 52 0.0011 26.1 2.4 29 109-139 48-76 (121)
47 PRK10850 PTS system phosphohis 28.3 80 0.0017 23.0 3.3 39 99-140 29-67 (85)
48 PRK13780 phosphocarrier protei 27.1 74 0.0016 23.3 3.0 55 99-170 29-83 (88)
49 cd08511 PBP2_NikA_DppA_OppA_li 26.9 1.1E+02 0.0024 27.8 4.6 35 133-171 59-93 (467)
50 PF00496 SBP_bac_5: Bacterial 26.7 68 0.0015 27.8 3.1 26 147-172 28-53 (374)
51 PF02991 Atg8: Autophagy prote 25.9 59 0.0013 24.9 2.3 31 109-140 40-70 (104)
52 cd08506 PBP2_clavulanate_OppA2 25.5 63 0.0014 29.4 2.8 36 130-169 61-96 (466)
53 cd00659 Topo_IB_C DNA topoisom 25.2 64 0.0014 27.7 2.6 60 95-168 78-146 (218)
54 TIGR03417 chol_sulfatase choli 24.7 64 0.0014 30.4 2.8 14 130-143 279-292 (500)
55 KOG1552|consensus 24.6 69 0.0015 28.6 2.8 42 48-92 112-161 (258)
56 COG2604 Uncharacterized protei 24.3 78 0.0017 31.4 3.3 24 54-77 258-281 (594)
57 PF04911 ATP-synt_J: ATP synth 24.1 4.7 0.0001 27.8 -3.6 26 61-86 22-47 (54)
58 cd00995 PBP2_NikA_DppA_OppA_li 23.5 1.2E+02 0.0025 27.2 4.1 43 125-171 50-92 (466)
59 cd08507 PBP2_SgrR_like The C-t 22.5 1.4E+02 0.0031 27.2 4.5 67 97-173 34-100 (448)
60 cd08490 PBP2_NikA_DppA_OppA_li 22.2 72 0.0016 28.9 2.5 37 131-171 54-90 (470)
61 PF08259 Periviscerokin: Periv 21.8 44 0.00094 16.0 0.5 10 142-151 1-10 (11)
62 cd08494 PBP2_NikA_DppA_OppA_li 21.6 76 0.0017 28.5 2.5 25 147-171 69-93 (448)
63 PRK11598 putative metal depend 21.3 59 0.0013 31.6 1.9 14 131-144 445-458 (545)
64 PRK13759 arylsulfatase; Provis 20.7 79 0.0017 29.7 2.5 14 130-143 294-307 (485)
65 PRK04452 acetyl-CoA decarbonyl 20.7 47 0.001 30.3 1.0 45 19-70 163-208 (319)
66 cd04508 TUDOR Tudor domains ar 20.6 1.1E+02 0.0023 18.9 2.4 24 125-148 19-42 (48)
67 PF14681 UPRTase: Uracil phosp 20.5 77 0.0017 26.5 2.2 25 49-73 135-159 (207)
68 cd08496 PBP2_NikA_DppA_OppA_li 20.4 82 0.0018 28.6 2.5 64 97-171 29-92 (454)
69 PF00691 OmpA: OmpA family; I 20.0 1.9E+02 0.0041 20.3 3.9 32 44-75 48-80 (97)
No 1
>KOG1349|consensus
Probab=100.00 E-value=2e-62 Score=423.63 Aligned_cols=152 Identities=77% Similarity=1.306 Sum_probs=143.8
Q ss_pred CCCcEEEEEeCCCCCccchhhhhhhhHHHHHHhcCCCCCcEEEEecCCCCCCCCCCCCCeeecCCCccccccCCCceeec
Q psy17622 28 HSNNWAVLVDTSRFWFNYRHVANVLSIYRSVKRLGIPDSHIILMIADDMACNPRNPRPATVFNNANQHIDVYGEDVEVDY 107 (194)
Q Consensus 28 ~~~~wavlvagS~~w~NYRHqadv~~~Y~~L~~~Gi~denIIlm~~DD~a~~p~Np~pG~i~~~~~~~~nlY~~~v~IDY 107 (194)
|++|||||||+||+||||||.||++.+|+.+|+.||||+|||+|++||+|||+|||+||++|++.+.+.|+||+++++||
T Consensus 26 htnNwAVLv~tSRfwfNYRH~aNvl~~YrsvKrlGipDsqIilmladd~acn~RN~~pg~Vy~n~~~~~nlygd~vevdy 105 (309)
T KOG1349|consen 26 HTNNWAVLVCTSRFWFNYRHVANVLSVYRSVKRLGIPDSQIILMLADDMACNSRNPRPGTVYNNENHALNLYGDDVEVDY 105 (309)
T ss_pred ccCceEEEEecchhhhhHHHHHHHHHHHHHHHHcCCCcccEEEEeccccccccCCCCCcceeccccccccccCCcceeec
Confidence 45666666666666699999999999999999999999999999999999999999999999999999999999999999
Q ss_pred cCCCCCHHHHHHHHcc---------CccccCCCccEEEEEeCCCCCceeecCCCCccCHHHHHHHHHHHHHcCCcceeEE
Q psy17622 108 RGYEVTVENFIRLLTA---------TSVLTDEGSNILIYLTGHGGDGFLKFQDSEEVTSQELGDALEQMWQKRRYHEVRA 178 (194)
Q Consensus 108 ~g~~Vt~~~fl~vL~G---------k~l~s~~~dnVfvY~tgHGg~g~i~f~d~~~l~~~dL~~~l~~M~~~~~Y~~lv~ 178 (194)
+|.+||+|+|+++||| |+|.+++++|||||+|||||+|||||||.|+|+.+||++++++|+|++||+|++|
T Consensus 106 rgyevtvEnflr~LTgR~~~~tprSKrlltDe~SNIlIYmtGHGgd~FlKFqd~eelts~dLadai~qm~e~~Ryneil~ 185 (309)
T KOG1349|consen 106 RGYEVTVENFLRVLTGRHPNNTPRSKRLLTDEGSNILIYLTGHGGDGFLKFQDAEELTSDDLADAIQQMWEKKRYNEILF 185 (309)
T ss_pred ccchhHHHHHHHHHcCCCCCCCchhhhhcccCCCcEEEEEccCCCccceecccHHHhhhHHHHHHHHHHHHhhhhceEEE
Confidence 9999999999999999 7899999999999999999999999999999999999999999999999999998
Q ss_pred E
Q psy17622 179 C 179 (194)
Q Consensus 179 ~ 179 (194)
.
T Consensus 186 m 186 (309)
T KOG1349|consen 186 M 186 (309)
T ss_pred E
Confidence 4
No 2
>KOG1348|consensus
Probab=100.00 E-value=7.4e-60 Score=423.63 Aligned_cols=152 Identities=36% Similarity=0.667 Sum_probs=147.3
Q ss_pred CCcEEEEEeCCCCCccchhhhhhhhHHHHHHhcCCCCCcEEEEecCCCCCCCCCCCCCeeecCCCccccccCCCceeecc
Q psy17622 29 SNNWAVLVDTSRFWFNYRHVANVLSIYRSVKRLGIPDSHIILMIADDMACNPRNPRPATVFNNANQHIDVYGEDVEVDYR 108 (194)
Q Consensus 29 ~~~wavlvagS~~w~NYRHqadv~~~Y~~L~~~Gi~denIIlm~~DD~a~~p~Np~pG~i~~~~~~~~nlY~~~v~IDY~ 108 (194)
+..||||||||+||||||||||+||+||+||++|+|+||||+|||||||.+|.||+||+|+|.|+ +.|||+ ||++||+
T Consensus 44 gt~waVLVAGSngyyNYRHQADvcHAYqiLrkgGikeEnIvv~MYDDIA~~~~NPrpG~iiN~P~-G~DvY~-GvpkDYt 121 (477)
T KOG1348|consen 44 GTRWAVLVAGSNGYYNYRHQADVCHAYQILRKGGIKEENIVVMMYDDIANNEENPRPGVIINRPN-GKDVYQ-GVPKDYT 121 (477)
T ss_pred ceeEEEEEecCCcccchhhhhhHHHHHHHHHhcCCCchhEEEEEehhhhcCCCCCCCceeecCCC-chhhhc-CCCCccc
Confidence 48999999999999999999999999999999999999999999999999999999999999995 899995 8999999
Q ss_pred CCCCCHHHHHHHHcc----------CccccCCCccEEEEEeCCCCCceeecCCCCccCHHHHHHHHHHHHHcCCcceeEE
Q psy17622 109 GYEVTVENFIRLLTA----------TSVLTDEGSNILIYLTGHGGDGFLKFQDSEEVTSQELGDALEQMWQKRRYHEVRA 178 (194)
Q Consensus 109 g~~Vt~~~fl~vL~G----------k~l~s~~~dnVfvY~tgHGg~g~i~f~d~~~l~~~dL~~~l~~M~~~~~Y~~lv~ 178 (194)
|++|||+||++||.| |+++|+|+|+|||||+||||+|.|.||+++.|+++||+++|++||+.++|++|||
T Consensus 122 g~~Vt~~Nf~aVllGd~savkGGsGKV~~SgpnDhiFiYytDHG~pGvl~mP~~~~l~akdlnevL~kmhk~k~Y~~mvf 201 (477)
T KOG1348|consen 122 GEDVTPQNFLAVLLGDASAVKGGSGKVLKSGPNDHIFIYYTDHGGPGVLGMPTSPDLYAKDLNEVLKKMHKSKTYKKMVF 201 (477)
T ss_pred CCcCCHHHHHHHHhcccccccCCCceeeccCCCceEEEEEecCCCCceEecCCCcchhHHHHHHHHHHHHhccchheEEE
Confidence 999999999999999 8899999999999999999999999999999999999999999999999999997
Q ss_pred ----Eecc
Q psy17622 179 ----CNRY 182 (194)
Q Consensus 179 ----~~~~ 182 (194)
|+|-
T Consensus 202 YlEACESG 209 (477)
T KOG1348|consen 202 YLEACESG 209 (477)
T ss_pred EeeeccCc
Confidence 7763
No 3
>PF01650 Peptidase_C13: Peptidase C13 family; InterPro: IPR001096 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Cysteine peptidases have characteristic molecular topologies, which can be seen not only in their three-dimensional structures, but commonly also in the two-dimensional structures. These are peptidases in which the nucleophile is the sulphydryl group of a cysteine residue. Cysteine proteases are divided into clans (proteins which are evolutionary related), and further sub-divided into families, on the basis of the architecture of their catalytic dyad or triad []. This group of cysteine peptidases belong to the MEROPS peptidase family C13 (legumain family, clan CD). A type example is legumain from Canavalia ensiformis (Jack bean, Horse bean). The blood fluke parasite Schistosoma mansoni has two cysteine proteases in its digestive tract, one a cathepsin B-like protease, the other termed hemoglobinase [, ]. The latter has been hard to purify, free of cathepsin B, and expressed forms in Escherichia coli prove to be inactive, suggesting that hemoglobinase may act in association with cathepsin B [, ]. Plant vacuolar processing enzyme and legumain from legumes [] have been shown to have sequence and functional similarity to hemoglobinase. The catalytic residues of the family are currently unknown, but sequence alignments reveal one totally conserved cysteine and two totally conserved histidines.; GO: 0004197 cysteine-type endopeptidase activity, 0006508 proteolysis
Probab=100.00 E-value=5.2e-57 Score=391.35 Aligned_cols=146 Identities=54% Similarity=0.974 Sum_probs=141.9
Q ss_pred cEEEEEeCCCCCccchhhhhhhhHHHHHHhcCCCCCcEEEEecCCCCCCCCCCCCCeeecCCCccccccCCCceeeccCC
Q psy17622 31 NWAVLVDTSRFWFNYRHVANVLSIYRSVKRLGIPDSHIILMIADDMACNPRNPRPATVFNNANQHIDVYGEDVEVDYRGY 110 (194)
Q Consensus 31 ~wavlvagS~~w~NYRHqadv~~~Y~~L~~~Gi~denIIlm~~DD~a~~p~Np~pG~i~~~~~~~~nlY~~~v~IDY~g~ 110 (194)
+||||||||++|+|||||||+|++||+|+++|+|+||||||++||+||+|+||+||+||++++ +.|+|. +++|||+|.
T Consensus 1 ~wAvlvagS~~~~NYRh~ad~~~~Y~~l~~~G~~~~~Iil~~~dd~a~~~~Np~~g~i~~~~~-~~n~y~-~~~iDY~g~ 78 (256)
T PF01650_consen 1 NWAVLVAGSNGWFNYRHQADVCHAYQLLKRNGIPDENIILMMYDDIACNPRNPFPGKIFNDPD-GTNVYK-GVEIDYRGE 78 (256)
T ss_pred CEEEEEeccCCceeeeEehHHHHHHHHHHHcCCCCceEEEEecCCccchhhCCCCceEEeCCC-cccccC-Ccccccccc
Confidence 699999999999999999999999999999999999999999999999999999999999996 889996 599999999
Q ss_pred CCCHHHHHHHHcc-------CccccCCCccEEEEEeCCCCCceeecCCCCccCHHHHHHHHHHHHHcCCcceeEE
Q psy17622 111 EVTVENFIRLLTA-------TSVLTDEGSNILIYLTGHGGDGFLKFQDSEEVTSQELGDALEQMWQKRRYHEVRA 178 (194)
Q Consensus 111 ~Vt~~~fl~vL~G-------k~l~s~~~dnVfvY~tgHGg~g~i~f~d~~~l~~~dL~~~l~~M~~~~~Y~~lv~ 178 (194)
+||+++|++||+| |+|.|+++|+|||||+||||+|+|+||+.+.|+++||+++|++|++++||+|||+
T Consensus 79 ~v~~~~fl~vL~G~~~~~~~kvl~s~~~D~vfiy~~~HG~~~~l~~~~~~~l~~~~L~~~L~~m~~~~~y~~lv~ 153 (256)
T PF01650_consen 79 DVTPENFLNVLTGDKSVPSGKVLNSTENDNVFIYFTGHGGPGFLKFPDGEELTADDLADALDKMHEKKRYKKLVF 153 (256)
T ss_pred ccCHHHHHHHhcCCCCCCccccccCCCCCeEEEEEeccCCCCcccCCCcccccHHHHHHHHHHHHhhCCcceEEE
Confidence 9999999999998 5789999999999999999999999998899999999999999999999999997
No 4
>COG5206 GPI8 Glycosylphosphatidylinositol transamidase (GPIT), subunit GPI8 [Posttranslational modification, protein turnover, chaperones]
Probab=100.00 E-value=3.5e-52 Score=363.05 Aligned_cols=153 Identities=67% Similarity=1.088 Sum_probs=147.1
Q ss_pred cCCCcEEEEEeCCCCCccchhhhhhhhHHHHHHhcCCCCCcEEEEecCCCCCCCCCCCCCeeecCCCccccccCCCceee
Q psy17622 27 KHSNNWAVLVDTSRFWFNYRHVANVLSIYRSVKRLGIPDSHIILMIADDMACNPRNPRPATVFNNANQHIDVYGEDVEVD 106 (194)
Q Consensus 27 ~~~~~wavlvagS~~w~NYRHqadv~~~Y~~L~~~Gi~denIIlm~~DD~a~~p~Np~pG~i~~~~~~~~nlY~~~v~ID 106 (194)
.+++|||||+++||+||||||.|||+.+|..+|+.|+||++||+|.+||.|||.||-+||.+|++.+...++|+++++||
T Consensus 25 t~tnNwAvLlstSRfwfNYRHmANVl~~Yr~vkrlGipDsQIilm~~dd~acnsRnlfpgsvf~N~Dra~dlyge~~eid 104 (382)
T COG5206 25 TNTNNWAVLLSTSRFWFNYRHMANVLVFYRVVKRLGIPDSQIILMSYDDQACNSRNLFPGSVFNNSDRAGDLYGEDSEID 104 (382)
T ss_pred ccCCceEEEEecccceeehhhhhhHHHHHHHHHHcCCCcceEEEEechhhhhhhcccCCcccccCcccccceeCcccccc
Confidence 56788888888888889999999999999999999999999999999999999999999999999888899999999999
Q ss_pred ccCCCCCHHHHHHHHcc---------CccccCCCccEEEEEeCCCCCceeecCCCCccCHHHHHHHHHHHHHcCCcceeE
Q psy17622 107 YRGYEVTVENFIRLLTA---------TSVLTDEGSNILIYLTGHGGDGFLKFQDSEEVTSQELGDALEQMWQKRRYHEVR 177 (194)
Q Consensus 107 Y~g~~Vt~~~fl~vL~G---------k~l~s~~~dnVfvY~tgHGg~g~i~f~d~~~l~~~dL~~~l~~M~~~~~Y~~lv 177 (194)
|+|.+||+|+|.+.|+. |++.+++++||||||+||||++||||||-++++++||++++++|+++|||+|++
T Consensus 105 Y~gyevTve~firLLt~r~~en~p~sKrlltdE~SNIfIYmtGHGgd~FlKFqdaeemtseDladai~ql~~~kRyNeIl 184 (382)
T COG5206 105 YSGYEVTVEVFIRLLTARSGENHPKSKRLLTDESSNIFIYMTGHGGDAFLKFQDAEEMTSEDLADAISQLAAKKRYNEIL 184 (382)
T ss_pred cccccchHHHHHHHHHhhccCCChhhhhhcccccCcEEEEEccCCCccceecccHHHhhhHHHHHHHHHHHHhhhhceEE
Confidence 99999999999999997 789999999999999999999999999999999999999999999999999999
Q ss_pred EE
Q psy17622 178 AC 179 (194)
Q Consensus 178 ~~ 179 (194)
|+
T Consensus 185 fm 186 (382)
T COG5206 185 FM 186 (382)
T ss_pred EE
Confidence 84
No 5
>PF01650 Peptidase_C13: Peptidase C13 family; InterPro: IPR001096 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Cysteine peptidases have characteristic molecular topologies, which can be seen not only in their three-dimensional structures, but commonly also in the two-dimensional structures. These are peptidases in which the nucleophile is the sulphydryl group of a cysteine residue. Cysteine proteases are divided into clans (proteins which are evolutionary related), and further sub-divided into families, on the basis of the architecture of their catalytic dyad or triad []. This group of cysteine peptidases belong to the MEROPS peptidase family C13 (legumain family, clan CD). A type example is legumain from Canavalia ensiformis (Jack bean, Horse bean). The blood fluke parasite Schistosoma mansoni has two cysteine proteases in its digestive tract, one a cathepsin B-like protease, the other termed hemoglobinase [, ]. The latter has been hard to purify, free of cathepsin B, and expressed forms in Escherichia coli prove to be inactive, suggesting that hemoglobinase may act in association with cathepsin B [, ]. Plant vacuolar processing enzyme and legumain from legumes [] have been shown to have sequence and functional similarity to hemoglobinase. The catalytic residues of the family are currently unknown, but sequence alignments reveal one totally conserved cysteine and two totally conserved histidines.; GO: 0004197 cysteine-type endopeptidase activity, 0006508 proteolysis
Probab=97.96 E-value=4.2e-06 Score=73.12 Aligned_cols=114 Identities=25% Similarity=0.386 Sum_probs=72.6
Q ss_pred ceEEEEecCcccchhhhhccc----------cCCCcEEEEEeCCC-------CC----ccchhhhhhhhH-HHHHHhcCC
Q psy17622 6 NWAVLVDTSRFWFNYRHVANA----------KHSNNWAVLVDTSR-------FW----FNYRHVANVLSI-YRSVKRLGI 63 (194)
Q Consensus 6 nw~vlv~~s~~w~n~r~~~~~----------~~~~~wavlvagS~-------~w----~NYRHqadv~~~-Y~~L~~~Gi 63 (194)
||||||.+|++|+||||-+++ +..+.-.|++..-. -+ +|-....|+... --..+...+
T Consensus 1 ~wAvlvagS~~~~NYRh~ad~~~~Y~~l~~~G~~~~~Iil~~~dd~a~~~~Np~~g~i~~~~~~~n~y~~~~iDY~g~~v 80 (256)
T PF01650_consen 1 NWAVLVAGSNGWFNYRHQADVCHAYQLLKRNGIPDENIILMMYDDIACNPRNPFPGKIFNDPDGTNVYKGVEIDYRGEDV 80 (256)
T ss_pred CEEEEEeccCCceeeeEehHHHHHHHHHHHcCCCCceEEEEecCCccchhhCCCCceEEeCCCcccccCCcccccccccc
Confidence 899999999999999999995 34444444433221 11 111112222221 001246678
Q ss_pred CCCcEEEEecCCCCCCCCCCCCCeeecCCCcccccc----CCCceeecc-CCCCCHHHHHHHHcc
Q psy17622 64 PDSHIILMIADDMACNPRNPRPATVFNNANQHIDVY----GEDVEVDYR-GYEVTVENFIRLLTA 123 (194)
Q Consensus 64 ~denIIlm~~DD~a~~p~Np~pG~i~~~~~~~~nlY----~~~v~IDY~-g~~Vt~~~fl~vL~G 123 (194)
.+++++-+|..+.. - |.+.++-.+++..+.+| |+.-.+.+. ++.++++.|.+.|.-
T Consensus 81 ~~~~fl~vL~G~~~-~---~~~kvl~s~~~D~vfiy~~~HG~~~~l~~~~~~~l~~~~L~~~L~~ 141 (256)
T PF01650_consen 81 TPENFLNVLTGDKS-V---PSGKVLNSTENDNVFIYFTGHGGPGFLKFPDGEELTADDLADALDK 141 (256)
T ss_pred CHHHHHHHhcCCCC-C---CccccccCCCCCeEEEEEeccCCCCcccCCCcccccHHHHHHHHHH
Confidence 99999888888875 2 55555555555555566 333357776 899999999999876
No 6
>PF00656 Peptidase_C14: Caspase domain; InterPro: IPR011600 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Cysteine peptidases have characteristic molecular topologies, which can be seen not only in their three-dimensional structures, but commonly also in the two-dimensional structures. These are peptidases in which the nucleophile is the sulphydryl group of a cysteine residue. Cysteine proteases are divided into clans (proteins which are evolutionary related), and further sub-divided into families, on the basis of the architecture of their catalytic dyad or triad []. This group of sequences represent the p20 (20kDa) and p10 (10kDa) subunits of caspases, which together form the catalytic domain of the caspase and are derived from the p45 (45 kDa) precursor (IPR002398 from INTERPRO) []. Caspases (Cysteine-dependent ASPartyl-specific proteASE) are cysteine peptidases that belong to the MEROPS peptidase family C14 (caspase family, clan CD) based on the architecture of their catalytic dyad or triad []. Caspases are tightly regulated proteins that require zymogen activation to become active, and once active can be regulated by caspase inhibitors. Activated caspases act as cysteine proteases, using the sulphydryl group of a cysteine side chain for catalysing peptide bond cleavage at aspartyl residues in their substrates. The catalytic cysteine and histidine residues are on the p20 subunit after cleavage of the p45 precursor. Caspases are mainly involved in mediating cell death (apoptosis) [, , ]. They have two main roles within the apoptosis cascade: as initiators that trigger the cell death process, and as effectors of the process itself. Caspase-mediated apoptosis follows two main pathways, one extrinsic and the other intrinsic or mitochondrial-mediated. The extrinsic pathway involves the stimulation of various TNF (tumour necrosis factor) cell surface receptors on cells targeted to die by various TNF cytokines that are produced by cells such as cytotoxic T cells. The activated receptor transmits the signal to the cytoplasm by recruiting FADD, which forms a death-inducing signalling complex (DISC) with caspase-8. The subsequent activation of caspase-8 initiates the apoptosis cascade involving caspases 3, 4, 6, 7, 9 and 10. The intrinsic pathway arises from signals that originate within the cell as a consequence of cellular stress or DNA damage. The stimulation or inhibition of different Bcl-2 family receptors results in the leakage of cytochrome c from the mitochondria, and the formation of an apoptosome composed of cytochrome c, Apaf1 and caspase-9. The subsequent activation of caspase-9 initiates the apoptosis cascade involving caspases 3 and 7, among others. At the end of the cascade, caspases act on a variety of signal transduction proteins, cytoskeletal and nuclear proteins, chromatin-modifying proteins, DNA repair proteins and endonucleases that destroy the cell by disintegrating its contents, including its DNA. The different caspases have different domain architectures depending upon where they fit into the apoptosis cascades, however they all carry the catalytic p10 and p20 subunits. Caspases can have roles other than in apoptosis, such as caspase-1 (interleukin-1 beta convertase) (3.4.22.36 from EC), which is involved in the inflammatory process. The activation of apoptosis can sometimes lead to caspase-1 activation, providing a link between apoptosis and inflammation, such as during the targeting of infected cells. Caspases may also be involved in cell differentiation [].; GO: 0004197 cysteine-type endopeptidase activity, 0006508 proteolysis; PDB: 1M72_C 2NN3_C 3V4L_A 3IBF_B 2QLF_D 2QLB_C 3IBC_B 2QL9_A 3R5K_B 3H1P_A ....
Probab=97.95 E-value=7e-06 Score=68.09 Aligned_cols=84 Identities=23% Similarity=0.354 Sum_probs=58.0
Q ss_pred hhhhhhHHHHHHhcCCCCCcEEEEecCCCCCCCCCCCCCeeecCCCccccccCCCceeeccCCCCCHHHHHHHHccCccc
Q psy17622 48 VANVLSIYRSVKRLGIPDSHIILMIADDMACNPRNPRPATVFNNANQHIDVYGEDVEVDYRGYEVTVENFIRLLTATSVL 127 (194)
Q Consensus 48 qadv~~~Y~~L~~~Gi~denIIlm~~DD~a~~p~Np~pG~i~~~~~~~~nlY~~~v~IDY~g~~Vt~~~fl~vL~Gk~l~ 127 (194)
..|+-.+.+.|++.|++.++| ++ ++ +|.+++++.|+--.-.
T Consensus 21 ~~D~~~~~~~L~~~gf~~~~~-l~--~~------------------------------------~t~~~i~~~l~~l~~~ 61 (248)
T PF00656_consen 21 VNDAEAMAEALEKLGFDVENI-LI--DN------------------------------------ATRANILKALRELLQR 61 (248)
T ss_dssp HHHHHHHHHHHHHTTEEEEEE-EE--ES------------------------------------SSHHHHHHHHHHHHTS
T ss_pred HHHHHHHHHHHHHcCCceeec-cc--cc------------------------------------hHHHHHHHHHhhhhcc
Confidence 468899999999999999998 32 22 7888888888652222
Q ss_pred cCCCccEEEEEeCCCCC------ceeecCCCCccCHHH---HHHHHHHHHHc
Q psy17622 128 TDEGSNILIYLTGHGGD------GFLKFQDSEEVTSQE---LGDALEQMWQK 170 (194)
Q Consensus 128 s~~~dnVfvY~tgHGg~------g~i~f~d~~~l~~~d---L~~~l~~M~~~ 170 (194)
..++|-++|||+|||.. +.+.-.|+..+..+. +.+.|..+..+
T Consensus 62 ~~~~D~~~~yfsGHG~~~~~~~~~~~~~~d~~~~~~d~~~~~~~~l~~~~~~ 113 (248)
T PF00656_consen 62 AQPGDSVVFYFSGHGIQVDGEGGDEDSGYDGYLLPLDANLILDDELRDLLCK 113 (248)
T ss_dssp GGTCSEEEEEEESEEETETTCCSTEEEETSSEEEEHHHHEEHHHHTSTTTTG
T ss_pred CCCCCeeEEEEeccccccCCccCcccccccceeeecchhhhHHHHHhhhhhh
Confidence 34889999999999954 222222432344444 67777777665
No 7
>KOG1546|consensus
Probab=95.93 E-value=0.031 Score=51.08 Aligned_cols=80 Identities=25% Similarity=0.433 Sum_probs=56.8
Q ss_pred CCCcEEEEEeCCCCCccchhh--------hhhhhHHHHH-HhcCCCCCcEEEEecCCCCCCC-CCCCCCeeecCCCcccc
Q psy17622 28 HSNNWAVLVDTSRFWFNYRHV--------ANVLSIYRSV-KRLGIPDSHIILMIADDMACNP-RNPRPATVFNNANQHID 97 (194)
Q Consensus 28 ~~~~wavlvagS~~w~NYRHq--------adv~~~Y~~L-~~~Gi~denIIlm~~DD~a~~p-~Np~pG~i~~~~~~~~n 97 (194)
+..+-|||+. -||.-+ .||-+|.+.| .++|++.|+|++| .|+-+ +| +-
T Consensus 61 ~gkrrAvLiG-----INY~gTk~ELrGCINDv~~M~~~Lv~rfGFs~ddI~~L-tDt~~-s~~~~--------------- 118 (362)
T KOG1546|consen 61 AGKRRAVLIG-----INYPGTKNELRGCINDVHRMRKLLVERFGFSEDDILML-TDTDE-SPVRI--------------- 118 (362)
T ss_pred cccceEEEEe-----ecCCCcHHHHhhhHHHHHHHHHHHHHhhCCChhheEEE-ecCCC-ccccc---------------
Confidence 3456788886 455554 3788899887 6999999998654 44432 12 11
Q ss_pred ccCCCceeeccCCCCCHHHHHHHHccCccccCCCccEEEEEeCCCC
Q psy17622 98 VYGEDVEVDYRGYEVTVENFIRLLTATSVLTDEGSNILIYLTGHGG 143 (194)
Q Consensus 98 lY~~~v~IDY~g~~Vt~~~fl~vL~Gk~l~s~~~dnVfvY~tgHGg 143 (194)
-|.+|+++.|+-.+....++|-+|+=|||||+
T Consensus 119 --------------PT~~Nir~Al~wLV~~aq~gD~LvfHYSGHGt 150 (362)
T KOG1546|consen 119 --------------PTGKNIRRALRWLVESAQPGDSLVFHYSGHGT 150 (362)
T ss_pred --------------CcHHHHHHHHHHHHhcCCCCCEEEEEecCCCC
Confidence 24467778886555566788999999999998
No 8
>KOG1349|consensus
Probab=94.18 E-value=0.01 Score=52.76 Aligned_cols=119 Identities=29% Similarity=0.529 Sum_probs=94.1
Q ss_pred CccceEEEEecCcccchhhhhccc----------cCCCcEEEEEeC------CCC------Cccchhhhhhhh--HHHHH
Q psy17622 3 HSNNWAVLVDTSRFWFNYRHVANA----------KHSNNWAVLVDT------SRF------WFNYRHVANVLS--IYRSV 58 (194)
Q Consensus 3 ~~~nw~vlv~~s~~w~n~r~~~~~----------~~~~~wavlvag------S~~------w~NYRHqadv~~--~Y~~L 58 (194)
||||||||||||||||||||++|. |+.+...||+.. ||. |-|-.|.-|+.. ++-..
T Consensus 26 htnNwAVLv~tSRfwfNYRH~aNvl~~YrsvKrlGipDsqIilmladd~acn~RN~~pg~Vy~n~~~~~nlygd~vevdy 105 (309)
T KOG1349|consen 26 HTNNWAVLVCTSRFWFNYRHVANVLSVYRSVKRLGIPDSQIILMLADDMACNSRNPRPGTVYNNENHALNLYGDDVEVDY 105 (309)
T ss_pred ccCceEEEEecchhhhhHHHHHHHHHHHHHHHHcCCCcccEEEEeccccccccCCCCCcceeccccccccccCCcceeec
Confidence 999999999999999999999998 678888888744 443 223334444333 33335
Q ss_pred HhcCCCCCcEEEEecCCCCCCCCCCCCCeeecCCCcccccc----CCCceeeccC-CCCCHHHHHHHHcc
Q psy17622 59 KRLGIPDSHIILMIADDMACNPRNPRPATVFNNANQHIDVY----GEDVEVDYRG-YEVTVENFIRLLTA 123 (194)
Q Consensus 59 ~~~Gi~denIIlm~~DD~a~~p~Np~pG~i~~~~~~~~nlY----~~~v~IDY~g-~~Vt~~~fl~vL~G 123 (194)
+.+.+..|++|..+++++. +..|+.+++.+|+++++.+| |++..+.+.. +++|.+.+...++-
T Consensus 106 rgyevtvEnflr~LTgR~~--~~tprSKrlltDe~SNIlIYmtGHGgd~FlKFqd~eelts~dLadai~q 173 (309)
T KOG1349|consen 106 RGYEVTVENFLRVLTGRHP--NNTPRSKRLLTDEGSNILIYLTGHGGDGFLKFQDAEELTSDDLADAIQQ 173 (309)
T ss_pred ccchhHHHHHHHHHcCCCC--CCCchhhhhcccCCCcEEEEEccCCCccceecccHHHhhhHHHHHHHHH
Confidence 7888999999999999985 55799999999998888888 5667788977 88999999888776
No 9
>COG5206 GPI8 Glycosylphosphatidylinositol transamidase (GPIT), subunit GPI8 [Posttranslational modification, protein turnover, chaperones]
Probab=92.73 E-value=0.025 Score=50.84 Aligned_cols=117 Identities=27% Similarity=0.529 Sum_probs=91.7
Q ss_pred CCccceEEEEecCcccchhhhhccc----------cCCCcEEEEEeCCCCCccch------------hhhhhhhHHHH--
Q psy17622 2 KHSNNWAVLVDTSRFWFNYRHVANA----------KHSNNWAVLVDTSRFWFNYR------------HVANVLSIYRS-- 57 (194)
Q Consensus 2 ~~~~nw~vlv~~s~~w~n~r~~~~~----------~~~~~wavlvagS~~w~NYR------------Hqadv~~~Y~~-- 57 (194)
.||||||||++||||||||||+||. +..+...+|++.-+.-.|-| |..| .|..
T Consensus 25 t~tnNwAvLlstSRfwfNYRHmANVl~~Yr~vkrlGipDsQIilm~~dd~acnsRnlfpgsvf~N~Dra~d---lyge~~ 101 (382)
T COG5206 25 TNTNNWAVLLSTSRFWFNYRHMANVLVFYRVVKRLGIPDSQIILMSYDDQACNSRNLFPGSVFNNSDRAGD---LYGEDS 101 (382)
T ss_pred ccCCceEEEEecccceeehhhhhhHHHHHHHHHHcCCCcceEEEEechhhhhhhcccCCcccccCcccccc---eeCccc
Confidence 6999999999999999999999998 67888889886555544444 3333 3332
Q ss_pred ---HHhcCCCCCcEEEEecCCCCCCCCCCCCCeeecCCCcccccc----CCCceeeccC-CCCCHHHHHHHHcc
Q psy17622 58 ---VKRLGIPDSHIILMIADDMACNPRNPRPATVFNNANQHIDVY----GEDVEVDYRG-YEVTVENFIRLLTA 123 (194)
Q Consensus 58 ---L~~~Gi~denIIlm~~DD~a~~p~Np~pG~i~~~~~~~~nlY----~~~v~IDY~g-~~Vt~~~fl~vL~G 123 (194)
.+.+.+..|..|.++.+....| .|..+++.++++++..+| |++..+.+.. ++++.|.+...++.
T Consensus 102 eidY~gyevTve~firLLt~r~~en--~p~sKrlltdE~SNIfIYmtGHGgd~FlKFqdaeemtseDladai~q 173 (382)
T COG5206 102 EIDYSGYEVTVEVFIRLLTARSGEN--HPKSKRLLTDESSNIFIYMTGHGGDAFLKFQDAEEMTSEDLADAISQ 173 (382)
T ss_pred ccccccccchHHHHHHHHHhhccCC--ChhhhhhcccccCcEEEEEccCCCccceecccHHHhhhHHHHHHHHH
Confidence 2566778899999998887433 488999999998888888 5567788877 88999999888776
No 10
>smart00115 CASc Caspase, interleukin-1 beta converting enzyme (ICE) homologues. Cysteine aspartases that mediate programmed cell death (apoptosis). Caspases are synthesised as zymogens and activated by proteolysis of the peptide backbone adjacent to an aspartate. The resulting two subunits associate to form an (alpha)2(beta)2-tetramer which is the active enzyme. Activation of caspases can be mediated by other caspase homologues.
Probab=90.06 E-value=0.93 Score=38.90 Aligned_cols=56 Identities=16% Similarity=0.214 Sum_probs=41.1
Q ss_pred CCCCCHHHHHHHHcc--CccccCCCccEEEEEeCCCCCceeecCCCCccCHHHHHHHH
Q psy17622 109 GYEVTVENFIRLLTA--TSVLTDEGSNILIYLTGHGGDGFLKFQDSEEVTSQELGDAL 164 (194)
Q Consensus 109 g~~Vt~~~fl~vL~G--k~l~s~~~dnVfvY~tgHGg~g~i~f~d~~~l~~~dL~~~l 164 (194)
..++|.+.+.+.|.- ++.+-...|-+++|+.+||+.|+|.-.|++.+.-++|.+.|
T Consensus 49 ~~dlt~~em~~~l~~~~~~~~~~~~d~~v~~~~sHG~~~~l~~~D~~~v~l~~i~~~f 106 (241)
T smart00115 49 KNNLTAEEMLEELKEFAERPEHSDSDSFVCVLLSHGEEGGIYGTDHSPLPLDEIFSLF 106 (241)
T ss_pred ecCCCHHHHHHHHHHHHhccccCCCCEEEEEEcCCCCCCeEEEecCCEEEHHHHHHhc
Confidence 346677777777754 22233356778899999999999998887777777777666
No 11
>cd00032 CASc Caspase, interleukin-1 beta converting enzyme (ICE) homologues; Cysteine-dependent aspartate-directed proteases that mediate programmed cell death (apoptosis). Caspases are synthesized as inactive zymogens and activated by proteolysis of the peptide backbone adjacent to an aspartate. The resulting two subunits associate to form an (alpha)2(beta)2-tetramer which is the active enzyme. Activation of caspases can be mediated by other caspase homologs.
Probab=85.16 E-value=1.8 Score=37.00 Aligned_cols=57 Identities=16% Similarity=0.239 Sum_probs=42.0
Q ss_pred CCCCCHHHHHHHHccCc-cccCCCccEEEEEeCCCCCceeecCCCCccCHHHHHHHHH
Q psy17622 109 GYEVTVENFIRLLTATS-VLTDEGSNILIYLTGHGGDGFLKFQDSEEVTSQELGDALE 165 (194)
Q Consensus 109 g~~Vt~~~fl~vL~Gk~-l~s~~~dnVfvY~tgHGg~g~i~f~d~~~l~~~dL~~~l~ 165 (194)
..++|.+.+.+.|.--. -+....|-+++|+.+||..+.|.-.|++.+.-++|.+.|.
T Consensus 51 ~~nlt~~~~~~~l~~f~~~~~~~~d~~v~~~~sHG~~~~l~~~D~~~v~l~~i~~~f~ 108 (243)
T cd00032 51 KNNLTAEEILEELKEFASPDHSDSDSFVCVILSHGEEGGIYGTDGDVVPIDEITSLFN 108 (243)
T ss_pred eCCCCHHHHHHHHHHHHhccCCCCCeeEEEECCCCCCCEEEEecCcEEEHHHHHHhhc
Confidence 56778888888886511 2345566788999999999999988866777777766654
No 12
>PF14538 Raptor_N: Raptor N-terminal CASPase like domain
Probab=85.07 E-value=1.7 Score=35.51 Aligned_cols=54 Identities=22% Similarity=0.315 Sum_probs=37.5
Q ss_pred CCCCHHHHHHHHccCccccCCCccEEEEEeCCCCCc------eeecCCCC----ccCHHHHHHHH
Q psy17622 110 YEVTVENFIRLLTATSVLTDEGSNILIYLTGHGGDG------FLKFQDSE----EVTSQELGDAL 164 (194)
Q Consensus 110 ~~Vt~~~fl~vL~Gk~l~s~~~dnVfvY~tgHGg~g------~i~f~d~~----~l~~~dL~~~l 164 (194)
-+.|+|++.+.+..-|- ..+++.|++-|.|||-|. +-.|.++- .++-.||.+.+
T Consensus 70 ~dpt~e~~~~~~~~~R~-~a~~~RvLFHYnGhGvP~Pt~~GeIw~f~~~~tqyip~si~dL~~~l 133 (154)
T PF14538_consen 70 LDPTVEDLKRLCQSLRR-NAKDERVLFHYNGHGVPRPTENGEIWVFNKNYTQYIPLSIYDLQSWL 133 (154)
T ss_pred cCCCHHHHHHHHHHHHh-hCCCceEEEEECCCCCCCCCCCCeEEEEcCCCCcceEEEHHHHHHhc
Confidence 68899999998875332 234489999999999983 44444432 36777776654
No 13
>PF12770 CHAT: CHAT domain
Probab=74.75 E-value=6 Score=33.47 Aligned_cols=65 Identities=18% Similarity=0.249 Sum_probs=45.8
Q ss_pred eeccCCCCCHHHHHHHHccCccccCCCccEEEEEeCCCCCc-------eeecC-----CCCccCHHHHHHHHHHHHHcCC
Q psy17622 105 VDYRGYEVTVENFIRLLTATSVLTDEGSNILIYLTGHGGDG-------FLKFQ-----DSEEVTSQELGDALEQMWQKRR 172 (194)
Q Consensus 105 IDY~g~~Vt~~~fl~vL~Gk~l~s~~~dnVfvY~tgHGg~g-------~i~f~-----d~~~l~~~dL~~~l~~M~~~~~ 172 (194)
.-..+.+.|.++|+..|. ...-=.|.|+|||... .|.+. +...+++.||.. + .-.+
T Consensus 122 ~~~~~~~at~~~l~~~l~-------~~~~~ilH~a~Hg~~~~~~~~~~~l~l~~~~~~~~~~l~~~~l~~-l---~l~~- 189 (287)
T PF12770_consen 122 RVLVGPEATKDALLEALE-------RRGPDILHFAGHGTFDPDPPDQSGLVLSDESGQEDGLLSAEELAQ-L---DLRG- 189 (287)
T ss_pred eEeeccCCCHHHHHhhhc-------cCCCCEEEEEcccccCCCCCCCCEEEEeccCCCCCcccCHHHHHh-h---cCCC-
Confidence 345678889999998882 1222378999999987 88887 456899999988 2 1112
Q ss_pred cceeEE---Eecc
Q psy17622 173 YHEVRA---CNRY 182 (194)
Q Consensus 173 Y~~lv~---~~~~ 182 (194)
-++|+ |.+.
T Consensus 190 -~~lVvLsaC~s~ 201 (287)
T PF12770_consen 190 -PRLVVLSACESA 201 (287)
T ss_pred -CCEEEecCcCCc
Confidence 67776 6665
No 14
>KOG1348|consensus
Probab=73.25 E-value=2.9 Score=39.30 Aligned_cols=22 Identities=50% Similarity=0.890 Sum_probs=19.8
Q ss_pred cceEEEEecCcccchhhhhccc
Q psy17622 5 NNWAVLVDTSRFWFNYRHVANA 26 (194)
Q Consensus 5 ~nw~vlv~~s~~w~n~r~~~~~ 26 (194)
.-|||||-.|.=|.||||-|..
T Consensus 45 t~waVLVAGSngyyNYRHQADv 66 (477)
T KOG1348|consen 45 TRWAVLVAGSNGYYNYRHQADV 66 (477)
T ss_pred eeEEEEEecCCcccchhhhhhH
Confidence 3599999999999999998877
No 15
>PF00381 PTS-HPr: PTS HPr component phosphorylation site; InterPro: IPR005698 The histidine-containing phosphocarrier protein (HPr) is a central component of the phosphoenolpyruvate-dependent sugar phosphotransferase system (PTS), which transfers metabolic carbohydrates across the cell membrane in many bacterial species [, ]. PTS catalyses the phosphorylation of incoming sugar substrates concomitant with their translocation across the cell membrane. The general mechanism of the PTS is as follows: a phosphoryl group from phosphoenolpyruvate (PEP) is transferred to Enzyme I (EI) of the PTS, which in turn transfers it to the phosphoryl carrier protein (HPr) [, ]. Phospho-HPr then transfers the phosphoryl group to a sugar-specific permease complex (enzymes EII/EIII). HPr [, ] is a small cytoplasmic protein of 70 to 90 amino acid residues. In some bacteria, HPr is a domain in a larger protein that includes a EIII(Fru) (IIA) domain and in some cases also the EI domain. A conserved histidine in the N-terminal section of HPr serves as an acceptor for the phosphoryl group of EI. In the central part of HPr, there is a conserved serine which (in Gram-positive bacteria only) is phosphorylated by an ATP-dependent protein kinase; a process which probably play a regulatory role in sugar transport. The overall architecture of the HPr domain has been described as an open faced beta-sandwich in which a beta-sheet is packed against three alpha-helices. Regulatory phosphorylation at the conserved Ser residue does not appear to induce large structural changes to the HPr domain, in particular in the region of the active site [, ].; GO: 0005351 sugar:hydrogen symporter activity, 0009401 phosphoenolpyruvate-dependent sugar phosphotransferase system; PDB: 1TXE_A 1QR5_A 1RZR_S 2NZU_L 2OEN_L 2NZV_L 1Y51_B 1Y4Y_A 1Y50_A 2HPR_A ....
Probab=67.82 E-value=6.4 Score=28.19 Aligned_cols=55 Identities=18% Similarity=0.377 Sum_probs=43.0
Q ss_pred cCCCceeeccCCCCCHHHHHHHHccCccccCCCccEEEEEeCCCCCceeecCCCCccCHHHHHHHHHHHHHc
Q psy17622 99 YGEDVEVDYRGYEVTVENFIRLLTATSVLTDEGSNILIYLTGHGGDGFLKFQDSEEVTSQELGDALEQMWQK 170 (194)
Q Consensus 99 Y~~~v~IDY~g~~Vt~~~fl~vL~Gk~l~s~~~dnVfvY~tgHGg~g~i~f~d~~~l~~~dL~~~l~~M~~~ 170 (194)
|.+++.|.+.+..|++++++.+|+ |...+.+.|-|...|- -+++..+.|+++.++
T Consensus 29 ~~~~i~i~~~~~~vdakSil~l~~---L~~~~G~~i~i~~~G~--------------de~~a~~~i~~~~~~ 83 (84)
T PF00381_consen 29 FDSDITIRKGGKTVDAKSILGLMS---LGAKKGDEIEIEAEGE--------------DEEEALEAIAEFLES 83 (84)
T ss_dssp SSSEEEEEETTEEEETTSHHHHHH---HTBSTTEEEEEEEEST--------------THHHHHHHHHHHHHH
T ss_pred CCCEEEEEeCceeEecCCHHHHhh---hhcCCCCEEEEEEECc--------------CHHHHHHHHHHHHhc
Confidence 567899999999999999999998 8888888898888764 244555666655443
No 16
>COG2949 SanA Uncharacterized membrane protein [Function unknown]
Probab=67.50 E-value=10 Score=33.08 Aligned_cols=62 Identities=21% Similarity=0.315 Sum_probs=43.1
Q ss_pred ceEEEEecCcccch--------hhhhccc---cCCCcEEEEEeCCCCCccchhhhhhhhHHHHHHhcCCCCCcEEE
Q psy17622 6 NWAVLVDTSRFWFN--------YRHVANA---KHSNNWAVLVDTSRFWFNYRHVANVLSIYRSVKRLGIPDSHIIL 70 (194)
Q Consensus 6 nw~vlv~~s~~w~n--------~r~~~~~---~~~~~wavlvagS~~w~NYRHqadv~~~Y~~L~~~Gi~denIIl 70 (194)
+.+|+.+||.|..+ +|--+-. ..++---+|+||.|+=-+|.- .-.|-+.|++.|+|.++|.+
T Consensus 58 ~vgvVLGtsky~~~g~~N~yy~~Ri~aA~~ly~~gKV~~LLlSGDN~~~sYnE---p~tM~kdL~~~GVp~~~i~l 130 (235)
T COG2949 58 QVGVVLGTSKYLAKGPPNRYYTYRIDAAIALYKAGKVNYLLLSGDNATVSYNE---PRTMRKDLIAAGVPAKNIFL 130 (235)
T ss_pred ceEEEEeccccccCCCccHhHHHHHHHHHHHHhcCCeeEEEEecCCCcccccc---hHHHHHHHHHcCCCHHHeee
Confidence 56888999988755 2221111 134444577888877666544 47799999999999999965
No 17
>KOG1017|consensus
Probab=64.74 E-value=6.1 Score=34.48 Aligned_cols=23 Identities=39% Similarity=0.590 Sum_probs=20.6
Q ss_pred hhhhHHHHHHhcCCCCCcEEEEe
Q psy17622 50 NVLSIYRSVKRLGIPDSHIILMI 72 (194)
Q Consensus 50 dv~~~Y~~L~~~Gi~denIIlm~ 72 (194)
-||-+-..|+.+|+||++|||..
T Consensus 204 TV~~Av~VL~EhgVp~s~IiL~s 226 (267)
T KOG1017|consen 204 TVCKAVEVLKEHGVPDSNIILVS 226 (267)
T ss_pred cHHHHHHHHHHcCCCcccEEEEE
Confidence 47889999999999999999864
No 18
>KOG1654|consensus
Probab=60.54 E-value=8.1 Score=30.42 Aligned_cols=35 Identities=11% Similarity=0.339 Sum_probs=31.1
Q ss_pred CCCCCHHHHHHHHccCccccCCCccEEEEEeCCCCC
Q psy17622 109 GYEVTVENFIRLLTATSVLTDEGSNILIYLTGHGGD 144 (194)
Q Consensus 109 g~~Vt~~~fl~vL~Gk~l~s~~~dnVfvY~tgHGg~ 144 (194)
..++|...|+.|++ ||++-.|++-+|++..+|--+
T Consensus 52 P~dltvgqfi~iIR-kRiqL~~~kA~flfVn~~~p~ 86 (116)
T KOG1654|consen 52 PDDLTVGQFIKIIR-KRIQLSPEKAFFLFVNNTSPP 86 (116)
T ss_pred cccccHHHHHHHHH-HHhccChhHeEEEEEcCcCCc
Confidence 46899999999999 999999999999999998643
No 19
>cd00367 PTS-HPr_like Histidine-containing phosphocarrier protein (HPr)-like proteins. HPr is a central component of the bacterial phosphoenolpyruvate sugar phosphotransferase system (PTS). The PTS catalyses the phosphorylation of sugar substrates during their translocation across the cell membrane. The phosphoryl group from phosphoenolpyruvate is transferred to HPr by enzyme I (EI). Phospho-HPr then transfers the phosphoryl group to one of several sugar-specific phosphoprotein intermediates. The conserved histidine in the N-terminus of HPr serves as an acceptor for the phosphoryl group of EI. In addition to the phosphotransferase proteins HPr and E1, this family also includes the closely related Carbon Catabolite Repressor (CCR) proteins which use the same phosphorylation mechanism and interact with transcriptional regulators to control expression of genes coding for utilization of less favored carbon sources.
Probab=54.18 E-value=17 Score=25.44 Aligned_cols=39 Identities=15% Similarity=0.405 Sum_probs=34.0
Q ss_pred cCCCceeeccCCCCCHHHHHHHHccCccccCCCccEEEEEeC
Q psy17622 99 YGEDVEVDYRGYEVTVENFIRLLTATSVLTDEGSNILIYLTG 140 (194)
Q Consensus 99 Y~~~v~IDY~g~~Vt~~~fl~vL~Gk~l~s~~~dnVfvY~tg 140 (194)
|.+++.|-+.+..++.+.++.+|+ |...+.+.|-|...|
T Consensus 25 ~~~~v~i~~~~~~vdakSil~i~~---L~~~~G~~i~i~~~G 63 (77)
T cd00367 25 FKSDITLRKGGRKANAKSILGLMS---LGAKQGDEITLSAEG 63 (77)
T ss_pred CCCEEEEEECCEEEcHHhHHHHHH---cCCCCCCEEEEEEEC
Confidence 667888888899999999999998 888888888888876
No 20
>PRK10649 hypothetical protein; Provisional
Probab=51.60 E-value=11 Score=36.56 Aligned_cols=20 Identities=20% Similarity=0.592 Sum_probs=15.9
Q ss_pred ccccCCCccEEEEEeCCCCC
Q psy17622 125 SVLTDEGSNILIYLTGHGGD 144 (194)
Q Consensus 125 ~l~s~~~dnVfvY~tgHGg~ 144 (194)
.|+...++.++||++|||..
T Consensus 446 ~Lk~~~~nt~iiy~SDHGe~ 465 (577)
T PRK10649 446 DFKATDPNGFLVYFSDHGEE 465 (577)
T ss_pred HHhcCCCCeEEEEECCCCcc
Confidence 45555678999999999975
No 21
>cd01612 APG12_C Ubiquitin-like domain of APG12. APG12_C The carboxy-terminal ubiquitin-like domain of APG12. Autophagy is a process in which cytoplasmic components are delivered to the lysosome/vacuole for degradation. Autophagy requires a ubiquitin-like protein conjugation system, in which APG12 is covalently bound to APG5.
Probab=50.28 E-value=20 Score=26.45 Aligned_cols=30 Identities=10% Similarity=0.318 Sum_probs=26.0
Q ss_pred CCCCCHHHHHHHHccCccccCCCccEEEEEe
Q psy17622 109 GYEVTVENFIRLLTATSVLTDEGSNILIYLT 139 (194)
Q Consensus 109 g~~Vt~~~fl~vL~Gk~l~s~~~dnVfvY~t 139 (194)
..+.|...|+.+|+ |+|+-.+++.||+|..
T Consensus 23 ~~~~tv~~~~~~lr-k~L~l~~~~slflyvn 52 (87)
T cd01612 23 SATQSFQAVIDFLR-KRLKLKASDSLFLYIN 52 (87)
T ss_pred CCCCCHHHHHHHHH-HHhCCCccCeEEEEEC
Confidence 35778999999998 7899889999999983
No 22
>PF10686 DUF2493: Protein of unknown function (DUF2493); InterPro: IPR019627 This entry is represented by Mycobacteriophage D29, Gp61. The characteristics of the protein distribution suggest prophage matches in addition to the phage matches. Members of this family are mainly Proteobacteria. The function is not known.
Probab=44.40 E-value=20 Score=25.45 Aligned_cols=23 Identities=22% Similarity=0.299 Sum_probs=18.6
Q ss_pred EEEEeCCCCCccchhhhhhhhHH
Q psy17622 33 AVLVDTSRFWFNYRHVANVLSIY 55 (194)
Q Consensus 33 avlvagS~~w~NYRHqadv~~~Y 55 (194)
-|||+|||.|.++....++|..+
T Consensus 5 rVli~GgR~~~D~~~i~~~Ld~~ 27 (71)
T PF10686_consen 5 RVLITGGRDWTDHELIWAALDKV 27 (71)
T ss_pred EEEEEECCccccHHHHHHHHHHH
Confidence 58999999999988877776644
No 23
>COG2143 Thioredoxin-related protein [Posttranslational modification, protein turnover, chaperones]
Probab=43.39 E-value=90 Score=26.35 Aligned_cols=54 Identities=11% Similarity=0.250 Sum_probs=38.9
Q ss_pred CCHHHHHHHHccCccccCCCccEEEEEeCCCCCceeecCCCCccCHHHHHHHHHHHHHcCCcce
Q psy17622 112 VTVENFIRLLTATSVLTDEGSNILIYLTGHGGDGFLKFQDSEEVTSQELGDALEQMWQKRRYHE 175 (194)
Q Consensus 112 Vt~~~fl~vL~Gk~l~s~~~dnVfvY~tgHGg~g~i~f~d~~~l~~~dL~~~l~~M~~~~~Y~~ 175 (194)
++-+.+.+.+. .+|+|. +.|.||-|.+++..|+ ++-.+++...|+.-. ++.|++
T Consensus 102 ~s~~ELa~kf~---vrstPt----fvFfdk~Gk~Il~lPG--Y~ppe~Fl~vlkYVa-~g~ykd 155 (182)
T COG2143 102 MSTEELAQKFA---VRSTPT----FVFFDKTGKTILELPG--YMPPEQFLAVLKYVA-DGKYKD 155 (182)
T ss_pred ecHHHHHHHhc---cccCce----EEEEcCCCCEEEecCC--CCCHHHHHHHHHHHH-HHHHhh
Confidence 44455555554 788887 4566888899999997 899999999888764 334443
No 24
>PRK10834 vancomycin high temperature exclusion protein; Provisional
Probab=43.33 E-value=25 Score=30.78 Aligned_cols=67 Identities=16% Similarity=0.302 Sum_probs=40.7
Q ss_pred cceEEEEecCcccch--------hhhhcccc---CCCcEEEEEeCCCCCccchhhhhhhhHHHHHHhcCCCCCcEEEEec
Q psy17622 5 NNWAVLVDTSRFWFN--------YRHVANAK---HSNNWAVLVDTSRFWFNYRHVANVLSIYRSVKRLGIPDSHIILMIA 73 (194)
Q Consensus 5 ~nw~vlv~~s~~w~n--------~r~~~~~~---~~~~wavlvagS~~w~NYRHqadv~~~Y~~L~~~Gi~denIIlm~~ 73 (194)
...+|+.+|+.+... .|..+... ....-.+|+||.++=-.| ..+-.|.+.|.+.|||++.|++
T Consensus 45 ~d~ivVLGa~~~~~~g~ps~~l~~Rl~~A~~LYk~gk~~~ilvSGg~~~~~~---~Ea~~M~~yLi~~GVp~e~Ii~--- 118 (239)
T PRK10834 45 RQVGVVLGTAKYYRTGVINQYYRYRIQGAINAYNSGKVNYLLLSGDNALQSY---NEPMTMRKDLIAAGVDPSDIVL--- 118 (239)
T ss_pred CCEEEEcCCcccCCCCCcCHHHHHHHHHHHHHHHhCCCCEEEEeCCCCCCCC---CHHHHHHHHHHHcCCCHHHEEe---
Confidence 356777788765421 22222111 122234788887642222 4556799999999999999988
Q ss_pred CCCC
Q psy17622 74 DDMA 77 (194)
Q Consensus 74 DD~a 77 (194)
|+-.
T Consensus 119 e~~s 122 (239)
T PRK10834 119 DYAG 122 (239)
T ss_pred cCCC
Confidence 6643
No 25
>COG2194 Predicted membrane-associated, metal-dependent hydrolase [General function prediction only]
Probab=41.67 E-value=14 Score=35.98 Aligned_cols=20 Identities=25% Similarity=0.542 Sum_probs=15.8
Q ss_pred CccccCCCccEEEEEeCCCC
Q psy17622 124 TSVLTDEGSNILIYLTGHGG 143 (194)
Q Consensus 124 k~l~s~~~dnVfvY~tgHGg 143 (194)
+.|+..+++..+||+||||-
T Consensus 437 ~~Lk~~~~~~~liY~SDHGE 456 (555)
T COG2194 437 DQLKDKKDNTSLIYFSDHGE 456 (555)
T ss_pred HHHHhCCCCeEEEEEcCccH
Confidence 34565556899999999997
No 26
>TIGR01003 PTS_HPr_family Phosphotransferase System HPr (HPr) Family. The HPr family are bacterial proteins (or domains of proteins) which function in phosphoryl transfer system (PTS) systems. They include energy-coupling components which catalyze sugar uptake via a group translocation mechanism. The functions of most of these proteins are not known, but they presumably function in PTS-related regulatory capacities. All seed members are stand-alone HPr proteins, although the model also recognizes HPr domains of PTS fusion proteins. This family includes the related NPr protein.
Probab=41.66 E-value=33 Score=24.62 Aligned_cols=40 Identities=13% Similarity=0.390 Sum_probs=32.4
Q ss_pred ccCCCceeeccCCCCCHHHHHHHHccCccccCCCccEEEEEeC
Q psy17622 98 VYGEDVEVDYRGYEVTVENFIRLLTATSVLTDEGSNILIYLTG 140 (194)
Q Consensus 98 lY~~~v~IDY~g~~Vt~~~fl~vL~Gk~l~s~~~dnVfvY~tg 140 (194)
=|.+++.+.+.+..++.+..+.+|+ |...+.|.|=|...|
T Consensus 28 ~f~s~I~i~~~~~~~dakSil~ll~---Lg~~~G~~i~i~~~G 67 (82)
T TIGR01003 28 GFDSEITLTKNGKEVNAKSIMGIMM---LGAGQGTEVTVSADG 67 (82)
T ss_pred hCCCEEEEEECCEEEehHhHHHHHh---cCCCCCCEEEEEEeC
Confidence 3677888888889999999999998 777777777776643
No 27
>PRK10897 phosphohistidinoprotein-hexose phosphotransferase component of N-regulated PTS system (Npr); Provisional
Probab=39.77 E-value=45 Score=24.63 Aligned_cols=40 Identities=25% Similarity=0.377 Sum_probs=33.8
Q ss_pred cCCCceeecc-CCCCCHHHHHHHHccCccccCCCccEEEEEeCC
Q psy17622 99 YGEDVEVDYR-GYEVTVENFIRLLTATSVLTDEGSNILIYLTGH 141 (194)
Q Consensus 99 Y~~~v~IDY~-g~~Vt~~~fl~vL~Gk~l~s~~~dnVfvY~tgH 141 (194)
|.+.+.+-+. |..++.+.++.||+ |.....|.|-|-..|-
T Consensus 30 f~s~v~l~~~~g~~~~akSil~lm~---Lg~~~G~~i~v~~~G~ 70 (90)
T PRK10897 30 FDAEVLLRNDEGTEAEANSVIALLM---LDSAKGRQIEVEATGP 70 (90)
T ss_pred CCCEEEEEeCCCcEEchHhHHHHHH---hCCCCCCEEEEEEECc
Confidence 6677888885 88999999999998 8888888888887764
No 28
>COG4566 TtrR Response regulator [Signal transduction mechanisms]
Probab=39.53 E-value=30 Score=29.73 Aligned_cols=63 Identities=25% Similarity=0.306 Sum_probs=41.3
Q ss_pred CceeeccCCCCCHHHHHHHHccCccccCCCccEEEEEeCCCCC---------ceeecCCCCccCHHHHHHHHHHHHHc
Q psy17622 102 DVEVDYRGYEVTVENFIRLLTATSVLTDEGSNILIYLTGHGGD---------GFLKFQDSEEVTSQELGDALEQMWQK 170 (194)
Q Consensus 102 ~v~IDY~g~~Vt~~~fl~vL~Gk~l~s~~~dnVfvY~tgHGg~---------g~i~f~d~~~l~~~dL~~~l~~M~~~ 170 (194)
.+..|-+-...+--+|.+-|..+ ..+-=-|++||||-- |-+-|=. .++..++|.++++...+.
T Consensus 51 clllDvrMPg~sGlelq~~L~~~-----~~~~PVIfiTGhgDIpmaV~AmK~GAvDFLe-KP~~~q~Lldav~~Al~~ 122 (202)
T COG4566 51 CLLLDVRMPGMSGLELQDRLAER-----GIRLPVIFLTGHGDIPMAVQAMKAGAVDFLE-KPFSEQDLLDAVERALAR 122 (202)
T ss_pred eEEEecCCCCCchHHHHHHHHhc-----CCCCCEEEEeCCCChHHHHHHHHcchhhHHh-CCCchHHHHHHHHHHHHH
Confidence 34556666666666666666431 122235899999973 5555554 378899999999988765
No 29
>PF02698 DUF218: DUF218 domain; InterPro: IPR003848 This domain of unknown function is found in several uncharacterised proteins.; PDB: 3CA8_A.
Probab=38.93 E-value=37 Score=26.24 Aligned_cols=35 Identities=20% Similarity=0.249 Sum_probs=19.0
Q ss_pred EEEeCCCCCccchhhhhhhhHHHHHHhcCCCCCcEEE
Q psy17622 34 VLVDTSRFWFNYRHVANVLSIYRSVKRLGIPDSHIIL 70 (194)
Q Consensus 34 vlvagS~~w~NYRHqadv~~~Y~~L~~~Gi~denIIl 70 (194)
||++|..+...+ .+.+-.+-+.+.+.|+|++.|++
T Consensus 41 il~SGg~~~~~~--~~ea~~~~~~l~~~gvp~~~I~~ 75 (155)
T PF02698_consen 41 ILFSGGYGHGDG--RSEAEAMRDYLIELGVPEERIIL 75 (155)
T ss_dssp EEEE--SSTTHT--S-HHHHHHHHHHHT---GGGEEE
T ss_pred EEECCCCCCCCC--CCHHHHHHHHHHhcccchheeEc
Confidence 677776655543 34444555667778999999877
No 30
>COG4899 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=37.50 E-value=26 Score=28.98 Aligned_cols=55 Identities=31% Similarity=0.480 Sum_probs=44.0
Q ss_pred EEEEecCCCCCCCCCCCCCeeecCCCccccccCCCceeeccC-CCCCHHHHHHHHc
Q psy17622 68 IILMIADDMACNPRNPRPATVFNNANQHIDVYGEDVEVDYRG-YEVTVENFIRLLT 122 (194)
Q Consensus 68 IIlm~~DD~a~~p~Np~pG~i~~~~~~~~nlY~~~v~IDY~g-~~Vt~~~fl~vL~ 122 (194)
+.-.-+||-.|--+||.|.+-|...+-.+-+=|+.-..||-+ -+|..+.|.+.|.
T Consensus 30 ~AQv~~DdD~Cqi~~p~~~r~FIr~~lpIPi~G~~~~l~yG~WveVe~~~f~~Yl~ 85 (166)
T COG4899 30 DAQVRYDDDMCQILNPKPKRHFIRSNLPIPIDGGKKKLDYGGWVEVEHSDFMTYLN 85 (166)
T ss_pred hhhheecccceeecCCCccceeeeecccceecCccceeccceEEEEeHHHHHHHHH
Confidence 334567888899999999999998765555666667889988 4899999999997
No 31
>cd06259 YdcF-like YdcF-like. YdcF-like is a large family of mainly bacterial proteins, with a few members found in fungi, plants, and archaea. Escherichia coli YdcF has been shown to bind S-adenosyl-L-methionine (AdoMet), but a biochemical function has not been idenitified. The family also includes Escherichia coli sanA and Salmonella typhimurium sfiX, which are involved in vancomycin resistance; sfiX may also be involved in murein synthesis.
Probab=37.17 E-value=34 Score=26.28 Aligned_cols=38 Identities=18% Similarity=0.167 Sum_probs=26.3
Q ss_pred cEEEEEeCCCCCccchhhhhhhhHHHHHHhcCCCCCcEEE
Q psy17622 31 NWAVLVDTSRFWFNYRHVANVLSIYRSVKRLGIPDSHIIL 70 (194)
Q Consensus 31 ~wavlvagS~~w~NYRHqadv~~~Y~~L~~~Gi~denIIl 70 (194)
.--||++|..+-... .+.+-.+.+.+.+.|+|++.|++
T Consensus 35 ~~~ii~sGg~~~~~~--~~ea~~m~~~l~~~gv~~~~I~~ 72 (150)
T cd06259 35 APKLIVSGGQGPGEG--YSEAEAMARYLIELGVPAEAILL 72 (150)
T ss_pred CCEEEEcCCCCCCCC--CCHHHHHHHHHHHcCCCHHHeee
Confidence 335677777665522 34445567888999999999877
No 32
>KOG2091|consensus
Probab=36.13 E-value=78 Score=29.51 Aligned_cols=67 Identities=25% Similarity=0.463 Sum_probs=46.0
Q ss_pred cccccCCCcee-eccCCCCCHHHHHHHHcc--CccccCCC--ccEEEEEeCCCCCceeecCCCCccCHHHHHHHHH
Q psy17622 95 HIDVYGEDVEV-DYRGYEVTVENFIRLLTA--TSVLTDEG--SNILIYLTGHGGDGFLKFQDSEEVTSQELGDALE 165 (194)
Q Consensus 95 ~~nlY~~~v~I-DY~g~~Vt~~~fl~vL~G--k~l~s~~~--dnVfvY~tgHGg~g~i~f~d~~~l~~~dL~~~l~ 165 (194)
+.|.||-+..+ | .|..+|...+|.+|+| .++.=+++ .+.|.|=-+--|...+-|| .|++-++.-.+.
T Consensus 298 GlNFYG~d~~~gd-g~~~IT~~rYL~lLk~~k~~~~~Dees~EH~f~~k~n~~gkhivfyP---TL~Sl~~Ri~lA 369 (392)
T KOG2091|consen 298 GLNFYGNDFNLGD-GGEAITAKRYLQLLKGEKSVFKFDEESKEHFFEYKRNDDGKHIVFYP---TLTSLELRIELA 369 (392)
T ss_pred eeeccccccccCC-CCCceeHHHHHHHHhccCcceeeccccchhheeeeccCCCceEEEec---chHhHHHHHHHH
Confidence 56889744444 3 3578999999999999 34444443 4777777566667788888 477766655443
No 33
>PF03568 Peptidase_C50: Peptidase family C50; InterPro: IPR005314 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Cysteine peptidases have characteristic molecular topologies, which can be seen not only in their three-dimensional structures, but commonly also in the two-dimensional structures. These are peptidases in which the nucleophile is the sulphydryl group of a cysteine residue. Cysteine proteases are divided into clans (proteins which are evolutionary related), and further sub-divided into families, on the basis of the architecture of their catalytic dyad or triad []. This group of cysteine peptidases belong to MEROPS peptidase family C50 (separase family, clan CD). The active site residues for members of this family and family C14 occur in the same order in the sequence: H,C. The separases are caspase-like proteases, which plays a central role in the chromosome segregation. In yeast they cleave the rad21 subunit of the cohesin complex at the onset of anaphase. During most of the cell cycle, separase is inactivated by the securin/cut2 protein, which probably covers its active site. ; GO: 0008233 peptidase activity, 0006508 proteolysis, 0005634 nucleus
Probab=35.42 E-value=39 Score=31.07 Aligned_cols=26 Identities=42% Similarity=0.820 Sum_probs=16.4
Q ss_pred CCCCCHHHHHHHHccCccccCCCccEEEEEeCCCC
Q psy17622 109 GYEVTVENFIRLLTATSVLTDEGSNILIYLTGHGG 143 (194)
Q Consensus 109 g~~Vt~~~fl~vL~Gk~l~s~~~dnVfvY~tgHGg 143 (194)
|..-+++.|+..|+ +.++||| .||||
T Consensus 294 g~~P~~~e~~~~l~--------~~dlf~Y-~GHG~ 319 (383)
T PF03568_consen 294 GRAPTEEEFLQALT--------SSDLFLY-CGHGS 319 (383)
T ss_pred CCCCCHHHHHHHHH--------hCCeEEE-ecCCc
Confidence 34455555555553 4668885 59998
No 34
>PF11181 YflT: Heat induced stress protein YflT
Probab=34.84 E-value=52 Score=24.45 Aligned_cols=31 Identities=19% Similarity=0.278 Sum_probs=25.5
Q ss_pred hhhhhhhhHHHHHHhcCCCCCcEEEEecCCC
Q psy17622 46 RHVANVLSIYRSVKRLGIPDSHIILMIADDM 76 (194)
Q Consensus 46 RHqadv~~~Y~~L~~~Gi~denIIlm~~DD~ 76 (194)
.-+..+..+=+.|+..|+..++|.++..|+-
T Consensus 7 ~~~~E~~~~I~~L~~~Gy~~ddI~Vva~d~~ 37 (103)
T PF11181_consen 7 DNEEEALSAIEELKAQGYSEDDIYVVAKDKD 37 (103)
T ss_pred CCHHHHHHHHHHHHHcCCCcccEEEEEcCch
Confidence 3466778888899999999999999886553
No 35
>PRK09598 lipid A phosphoethanolamine transferase; Reviewed
Probab=34.68 E-value=20 Score=34.54 Aligned_cols=19 Identities=26% Similarity=0.525 Sum_probs=14.6
Q ss_pred ccccCCCccEEEEEeCCCC
Q psy17622 125 SVLTDEGSNILIYLTGHGG 143 (194)
Q Consensus 125 ~l~s~~~dnVfvY~tgHGg 143 (194)
.|+..+.+.++||++|||.
T Consensus 423 ~Lk~~~~~t~iIy~SDHGe 441 (522)
T PRK09598 423 MLKNLKQPALMIYLSDHGE 441 (522)
T ss_pred HHHhCCCCeEEEEEccCcc
Confidence 3444444999999999996
No 36
>COG1925 FruB Phosphotransferase system, HPr-related proteins [Carbohydrate transport and metabolism]
Probab=34.61 E-value=59 Score=24.17 Aligned_cols=51 Identities=18% Similarity=0.388 Sum_probs=39.2
Q ss_pred CCCeeecCCCccccccCCCceeeccCCCCCHHHHHHHHccCccccCCCccEEEEEeC
Q psy17622 84 RPATVFNNANQHIDVYGEDVEVDYRGYEVTVENFIRLLTATSVLTDEGSNILIYLTG 140 (194)
Q Consensus 84 ~pG~i~~~~~~~~nlY~~~v~IDY~g~~Vt~~~fl~vL~Gk~l~s~~~dnVfvY~tg 140 (194)
+|...|...- .=|.+.+.+.|.|..|+.+.++.+|. |...+++.|-|.-.|
T Consensus 17 RPAa~lv~~a---~~f~s~i~l~~~g~~~~akSim~lm~---Lg~~~G~~i~i~a~G 67 (88)
T COG1925 17 RPAAKLVKLA---SKFDSEITLTNNGKEANAKSIMGLMA---LGAKKGDEIELSAEG 67 (88)
T ss_pred hhHHHHHHHH---hcCCceEEEEeCCEEechHhHHHHHH---hCcCCCCEEEEEEeC
Confidence 4555555432 23567899999999999999999998 888888888887654
No 37
>PRK13626 transcriptional regulator SgrR; Provisional
Probab=33.29 E-value=74 Score=30.34 Aligned_cols=39 Identities=10% Similarity=0.188 Sum_probs=29.5
Q ss_pred cEEEEEeCCCCCceeecCCCCccCHHHHHHHHHHHHHcCCcce
Q psy17622 133 NILIYLTGHGGDGFLKFQDSEEVTSQELGDALEQMWQKRRYHE 175 (194)
Q Consensus 133 nVfvY~tgHGg~g~i~f~d~~~l~~~dL~~~l~~M~~~~~Y~~ 175 (194)
..+.|.-=-| ++|.||+.|+|+|+..+|+.+.....|..
T Consensus 178 ~~wtF~LR~g----v~FhDG~plTA~DVv~s~~rl~~~~~~~~ 216 (552)
T PRK13626 178 LHWRFYLRPA----IHFHHGRELEMEDVIASLKRLNTLPLYSH 216 (552)
T ss_pred CEEEEEeCCC----CEeCCCCCCcHHHHHHHHHHHhhCccccC
Confidence 5665554333 59999999999999999999876555543
No 38
>cd01611 GABARAP Ubiquitin domain of GABA-receptor-associated protein. GABARAP (GABA-receptor-associated protein) belongs ot a large family of proteins that mediate intracellular membrane trafficking and/or fusion. GABARAP binds not only to GABA, type A but also to tubulin, gephrin, and ULK1. Orthologues of GABARAP include Gate-16 (golgi-associated ATPase enhancer), LC3 (microtubule-associated protein light chain 3), and ATG8 (autophagy protein 8). ATG8 is a ubiquitin-like protein that is conjugated to the membrane phospholipid, phosphatidylethanolamine as part of a ubiquitin-like conjugation system essential for autophagosome-formation.
Probab=33.18 E-value=52 Score=25.42 Aligned_cols=30 Identities=10% Similarity=0.377 Sum_probs=26.2
Q ss_pred CCCCCHHHHHHHHccCccccCCCccEEEEEe
Q psy17622 109 GYEVTVENFIRLLTATSVLTDEGSNILIYLT 139 (194)
Q Consensus 109 g~~Vt~~~fl~vL~Gk~l~s~~~dnVfvY~t 139 (194)
..+.|...|..+|+ ++|.-++++.+|+|..
T Consensus 48 p~~~tv~~f~~~ir-k~l~l~~~~slfl~Vn 77 (112)
T cd01611 48 PSDLTVGQFVYIIR-KRIQLRPEKALFLFVN 77 (112)
T ss_pred cCCCCHHHHHHHHH-HHhCCCccceEEEEEC
Confidence 46889999999998 7788889999999984
No 39
>PRK13782 phosphocarrier protein Chr; Provisional
Probab=32.18 E-value=58 Score=23.29 Aligned_cols=50 Identities=24% Similarity=0.446 Sum_probs=35.3
Q ss_pred CCCeeecCCCccccccCCCceeeccCCCCCHHHHHHHHccCccccCCCccEEEEEe
Q psy17622 84 RPATVFNNANQHIDVYGEDVEVDYRGYEVTVENFIRLLTATSVLTDEGSNILIYLT 139 (194)
Q Consensus 84 ~pG~i~~~~~~~~nlY~~~v~IDY~g~~Vt~~~fl~vL~Gk~l~s~~~dnVfvY~t 139 (194)
+|...|.+. ..-|.+++.|.+.|..++.+.++.+|+ |....++.|-|-..
T Consensus 17 RPA~~lv~~---a~~f~~~i~l~~~~~~vdaKSil~lls---Lg~~~g~~v~v~~~ 66 (82)
T PRK13782 17 RPAALFVQE---ANRFHADIFIEKDGKKVNAKSIMGLMS---LAIGTGSMITIITE 66 (82)
T ss_pred HHHHHHHHH---HHhCCCEEEEEECCeEEecHhHHHHHh---cCCCCCCEEEEEEe
Confidence 444444432 233667788888889999999999998 77777777766543
No 40
>cd06183 cyt_b5_reduct_like Cytochrome b5 reductase catalyzes the reduction of 2 molecules of cytochrome b5 using NADH as an electron donor. Like ferredoxin reductases, these proteins have an N-terminal FAD binding subdomain and a C-terminal NADH binding subdomain, separated by a cleft, which accepts FAD. The NADH-binding moiety interacts with part of the FAD and resembles a Rossmann fold. However, NAD is bound differently than in canonical Rossmann fold proteins. Nitrate reductases, flavoproteins similar to pyridine nucleotide cytochrome reductases, catalyze the reduction of nitrate to nitrite. The enzyme can be divided into three functional fragments that bind the cofactors molybdopterin, heme-iron, and FAD/NADH.
Probab=31.99 E-value=32 Score=28.02 Aligned_cols=35 Identities=17% Similarity=0.160 Sum_probs=25.7
Q ss_pred CcEEEEEeCCCCCccchhhhhhhhHHHHHHhcCCCCCcEEEE
Q psy17622 30 NNWAVLVDTSRFWFNYRHVANVLSIYRSVKRLGIPDSHIILM 71 (194)
Q Consensus 30 ~~wavlvagS~~w~NYRHqadv~~~Y~~L~~~Gi~denIIlm 71 (194)
..-.+++|||.++-. -.+.+.|++.|+|+++|.+|
T Consensus 200 ~~~~~~icGp~~~~~-------~~~~~~l~~~G~~~~~i~~~ 234 (234)
T cd06183 200 EDTLVLVCGPPPMIE-------GAVKGLLKELGYKKDNVFKF 234 (234)
T ss_pred CCeEEEEECCHHHHH-------HHHHHHHHHcCCCHHHEEeC
Confidence 345678888876643 25666788999999999764
No 41
>PF10116 Host_attach: Protein required for attachment to host cells; InterPro: IPR019291 Members of this family of bacterial proteins are required for the attachment of the bacterium to host cells [, ].
Probab=31.59 E-value=62 Score=25.10 Aligned_cols=29 Identities=17% Similarity=0.265 Sum_probs=24.9
Q ss_pred cCHHHHHHHHHHHHHcCCcceeEEEecce
Q psy17622 155 VTSQELGDALEQMWQKRRYHEVRACNRYR 183 (194)
Q Consensus 155 l~~~dL~~~l~~M~~~~~Y~~lv~~~~~~ 183 (194)
-.+.+|++.|.++..++.|.+|+++-+-+
T Consensus 73 ~Fa~~vA~~L~~~~~~~~~~~LvlvA~p~ 101 (138)
T PF10116_consen 73 RFAREVADRLEKARRAGKFDRLVLVAPPR 101 (138)
T ss_pred HHHHHHHHHHHHHHHhCCCCeEEEEECHH
Confidence 35788999999999999999999986643
No 42
>cd07563 Peptidase_S41_IRBP Interphotoreceptor retinoid-binding protein; serine protease family S41. Interphotoreceptor retinoid-binding protein (IRBP) is a homolog of the S41 protease, C-terminal processing peptidase (CTPase) family. It is thought to facilitate the compartmentalization of the visual cycle that requires poorly soluble and potentially toxic retinoids to cross the aqueous subretinal space between the photoreceptors and the retinal pigment epithelium (RPE). IRBP is secreted by photoreceptors into the interphotoreceptor matrix (IPM) where it is rapidly turned over by a combination of RPE and photoreceptor endocytosis. It is the most abundant soluble protein component of the IPM, consisting of homologous modules, each repeat structure arising through the duplication (as in teleost IRBP) or quadruplication (in tetrapods) of an ancient gene, arisen in the early evolution of the vertebrate eye. IRBP has been shown to promote the release of all-trans retinol from photoreceptors
Probab=31.00 E-value=1.4e+02 Score=25.19 Aligned_cols=60 Identities=13% Similarity=0.190 Sum_probs=43.1
Q ss_pred CCCCHHHHHHHHccCccccCCCccEEEEEeCCCCCceeecCCCCc----cCHHHHHHHHHHHHHcCCcceeEE
Q psy17622 110 YEVTVENFIRLLTATSVLTDEGSNILIYLTGHGGDGFLKFQDSEE----VTSQELGDALEQMWQKRRYHEVRA 178 (194)
Q Consensus 110 ~~Vt~~~fl~vL~Gk~l~s~~~dnVfvY~tgHGg~g~i~f~d~~~----l~~~dL~~~l~~M~~~~~Y~~lv~ 178 (194)
...+++.|..+|+ +-|..-.+.|+.|.. -|+|+++.-.. -..+.+.++++++.+. +.+++
T Consensus 37 ~~~~~~~l~~~l~-~~l~~l~D~H~~~~~-----IgYl~i~~f~~~~~~~~~~~~~~~~~~l~~~---~~LII 100 (250)
T cd07563 37 DITSPEELAAVLT-ADLQELGDGHLNVSY-----IGYLRIDSFGGFEIAAAEALLDEALDKLADT---DALII 100 (250)
T ss_pred cCCCHHHHHHHHH-HhhhccCCCcEEEEE-----eEEEEEcccCChhhhhhHHHHHHHHHHhcCC---CeEEE
Confidence 5678899999998 445544666777777 89999997543 3556778888887554 66664
No 43
>PRK10494 hypothetical protein; Provisional
Probab=30.89 E-value=52 Score=28.74 Aligned_cols=38 Identities=21% Similarity=0.206 Sum_probs=28.2
Q ss_pred cEEEEEeCCCCCccchhhhhhhhHHHHHHhcCCCCCcEEE
Q psy17622 31 NWAVLVDTSRFWFNYRHVANVLSIYRSVKRLGIPDSHIIL 70 (194)
Q Consensus 31 ~wavlvagS~~w~NYRHqadv~~~Y~~L~~~Gi~denIIl 70 (194)
.--||++|..+..+ -.+.+-.+-+.+++.|+|++.|++
T Consensus 121 ~~~ii~SGg~~~~~--~~sEA~~~~~~l~~lGVp~~~Ii~ 158 (259)
T PRK10494 121 GAKLIFTGGAAKTN--TVSTAEVGARVAQSLGVPREDIIT 158 (259)
T ss_pred CCEEEEECCCCCCC--CCCHHHHHHHHHHHcCCCHHHeee
Confidence 45678888776444 235666777888999999999977
No 44
>PF02633 Creatininase: Creatinine amidohydrolase; InterPro: IPR003785 This family includes the enzymes creatininase and 2-amino-5-formylamino-6-ribosylaminopyrimidin-4(3H)-one 5'-monophosphate deformylase, also known as formamide hydrolase. Creatinase or creatinine amidohydrolase (3.5.2.10 from EC) catalyses the hydrolysis of creatinine to creatine, which can then be metabolised to urea and sarcosine by creatinase (3.5.3.3 from EC). Creatininase is a member of the urease-related amidohydrolase superfamily []. Formamide hydrolase catalyzes the hydrolysis of the formamide of 2-amino-5-formylamino-6-ribosylamino-4(3H)-pyrimidinone 5'-monophosphate (FAPy) to form 2,5-diamino-6-ribosylamino-4(3H)-pyrimidinone 5'-phosphate (APy) (3.5.1.102 from EC). ; PDB: 3A6K_F 3A6F_A 3A6D_B 1J2U_B 3A6J_C 1J2T_A 3A6G_C 3A6H_F 1Q3K_E 3A6L_C ....
Probab=30.64 E-value=37 Score=28.75 Aligned_cols=51 Identities=25% Similarity=0.432 Sum_probs=32.4
Q ss_pred eccC-CCCCHHHHHHHHcc--Ccc-ccCCCccEEEEEeCCCCCceeecCCCCccCHHHHHHHHHHHHHc
Q psy17622 106 DYRG-YEVTVENFIRLLTA--TSV-LTDEGSNILIYLTGHGGDGFLKFQDSEEVTSQELGDALEQMWQK 170 (194)
Q Consensus 106 DY~g-~~Vt~~~fl~vL~G--k~l-~s~~~dnVfvY~tgHGg~g~i~f~d~~~l~~~dL~~~l~~M~~~ 170 (194)
+|.| -.++++.|+++|+- +.| +.|-+ -+|.++|||| + ..-|..+.+++.++
T Consensus 72 ~fpGTisl~~~t~~~~l~di~~sl~~~Gf~--~ivivngHgG-N-----------~~~l~~~~~~l~~~ 126 (237)
T PF02633_consen 72 GFPGTISLSPETLIALLRDILRSLARHGFR--RIVIVNGHGG-N-----------IAALEAAARELRQE 126 (237)
T ss_dssp TSTT-BBB-HHHHHHHHHHHHHHHHHHT----EEEEEESSTT-H-----------HHHHHHHHHHHHHH
T ss_pred CCCCeEEeCHHHHHHHHHHHHHHHHHcCCC--EEEEEECCHh-H-----------HHHHHHHHHHHHhh
Confidence 3555 47888999999875 222 24443 6899999999 1 11266777777766
No 45
>cd07561 Peptidase_S41_CPP_like C-terminal processing peptidase-like; serine protease family S41. Bacterial protease homologs of the S41 family related to C-terminal processing peptidase (CPP). CPP-1 is believed to be important for the degradation of incorrectly synthesized proteins as well as protection from thermal and osmotic stresses. CPP is synthesized with an extension on its carboxyl-terminus and specifically recognizes a C-terminal tripeptide, but cleaves at variable distance from the C-terminus. The CPP active site consists of a serine/lysine catalytic dyad. Conservation of these residues is seen in the CPP-like proteins of this group. CPP proteins contain a PDZ domain that promotes protein-protein interactions and is important for substrate recognition however, most of CPP-like proteins only have an internal fragment or lack the PDZ domain.
Probab=29.63 E-value=1.9e+02 Score=25.08 Aligned_cols=64 Identities=20% Similarity=0.306 Sum_probs=43.3
Q ss_pred CCCCHHHHHHHHccCccccCCCccEEEEEe-CCCCCceeecCCCCccCHHHHHHHHHHHHHcCCcceeEE
Q psy17622 110 YEVTVENFIRLLTATSVLTDEGSNILIYLT-GHGGDGFLKFQDSEEVTSQELGDALEQMWQKRRYHEVRA 178 (194)
Q Consensus 110 ~~Vt~~~fl~vL~Gk~l~s~~~dnVfvY~t-gHGg~g~i~f~d~~~l~~~dL~~~l~~M~~~~~Y~~lv~ 178 (194)
..-+|+.|+.-|+-+. +.. .-|=|++ +.+.-|+|.+..-..-++++|.++|++|.+++ -+.|++
T Consensus 35 ~~~~p~~~~~~l~~~~---~~~-~~fs~~~~~~~~IGYi~i~~F~~~~~~~l~~a~~~l~~~~-~~~LIl 99 (256)
T cd07561 35 YFDDPEDFLESLLSEK---DGK-DRFSYIVDGGKKVGYLVYNSFTSGYDDELNQAFAEFKAQG-VTELVL 99 (256)
T ss_pred ccCCHHHHHHHhhCcc---ccC-CceEEEeeCCCcEEEEEECccccchHHHHHHHHHHHHHcC-CCeEEE
Confidence 4567899998885211 111 3565555 44556999998755557899999999998764 466664
No 46
>PTZ00380 microtubule-associated protein (MAP); Provisional
Probab=28.37 E-value=52 Score=26.10 Aligned_cols=29 Identities=10% Similarity=0.127 Sum_probs=24.4
Q ss_pred CCCCCHHHHHHHHccCccccCCCccEEEEEe
Q psy17622 109 GYEVTVENFIRLLTATSVLTDEGSNILIYLT 139 (194)
Q Consensus 109 g~~Vt~~~fl~vL~Gk~l~s~~~dnVfvY~t 139 (194)
..++|..+|..+++ ++|.-.+++ +|+|..
T Consensus 48 P~d~tV~qF~~iIR-krl~l~~~k-~flfVn 76 (121)
T PTZ00380 48 PRDATVAELEAAVR-QALGTSAKK-VTLAIE 76 (121)
T ss_pred CCCCcHHHHHHHHH-HHcCCChhH-EEEEEC
Confidence 57899999999998 678888888 887763
No 47
>PRK10850 PTS system phosphohistidinoprotein-hexose phosphotransferase subunit Hpr; Provisional
Probab=28.30 E-value=80 Score=23.00 Aligned_cols=39 Identities=18% Similarity=0.375 Sum_probs=33.5
Q ss_pred cCCCceeeccCCCCCHHHHHHHHccCccccCCCccEEEEEeC
Q psy17622 99 YGEDVEVDYRGYEVTVENFIRLLTATSVLTDEGSNILIYLTG 140 (194)
Q Consensus 99 Y~~~v~IDY~g~~Vt~~~fl~vL~Gk~l~s~~~dnVfvY~tg 140 (194)
|.+.+.+-+.|..++.+.++.+|+ |....+|.|=|-..|
T Consensus 29 ~~s~v~l~~~~~~~~akSil~lm~---Lg~~~G~~v~i~~~G 67 (85)
T PRK10850 29 FTSEITVTSNGKSASAKSLFKLQT---LGLTQGTVVTISAEG 67 (85)
T ss_pred CCCEEEEEECCeEEchHhHHHHHh---cCCCCCCEEEEEEeC
Confidence 667788888889999999999998 888888888888766
No 48
>PRK13780 phosphocarrier protein HPr; Provisional
Probab=27.12 E-value=74 Score=23.30 Aligned_cols=55 Identities=20% Similarity=0.464 Sum_probs=42.0
Q ss_pred cCCCceeeccCCCCCHHHHHHHHccCccccCCCccEEEEEeCCCCCceeecCCCCccCHHHHHHHHHHHHHc
Q psy17622 99 YGEDVEVDYRGYEVTVENFIRLLTATSVLTDEGSNILIYLTGHGGDGFLKFQDSEEVTSQELGDALEQMWQK 170 (194)
Q Consensus 99 Y~~~v~IDY~g~~Vt~~~fl~vL~Gk~l~s~~~dnVfvY~tgHGg~g~i~f~d~~~l~~~dL~~~l~~M~~~ 170 (194)
|.+.+.+-+.|..++.+.++.+|+ |.....+.|=|...|- .+++..++|..+.++
T Consensus 29 ~~s~i~l~~~~~~vdakSil~lm~---Lg~~~G~~v~i~a~G~--------------De~~Al~~l~~~l~~ 83 (88)
T PRK13780 29 FDSDITLEYNGKSVNLKSIMGVMS---LGVGQGADITISAEGA--------------DAADAIAAIEETMKK 83 (88)
T ss_pred CCCEEEEEECCEEEechhHHHHHh---cCCCCCCEEEEEEeCc--------------CHHHHHHHHHHHHHh
Confidence 667788888889999999999998 8888888888887772 245566666666543
No 49
>cd08511 PBP2_NikA_DppA_OppA_like_5 The substrate-binding component of an uncharacterized ABC-type nickel/dipeptide/oligopeptide-like import system contains the type 2 periplasmic binding fold. This family represents the substrate-binding domain of an uncharacterized ATP-binding cassette (ABC) type nickel/dipeptide/oligopeptide-like transporter. The oligopeptide-binding protein OppA and the dipeptide-binding protein DppA show significant sequence similarity to NikA, the initial nickel receptor. The DppA binds dipeptides and some tripeptides and is involved in chemotaxis toward dipeptides, whereas the OppA binds peptides of a wide range of lengths (2-35 amino acid residues) and plays a role in recycling of cell wall peptides, which precludes any involvement in chemotaxis. Most of other periplasmic binding proteins are comprised of only two globular subdomains corresponding to domains I and III of the dipeptide/oligopeptide binding proteins. The structural topology of these domains is mos
Probab=26.85 E-value=1.1e+02 Score=27.79 Aligned_cols=35 Identities=20% Similarity=0.320 Sum_probs=26.6
Q ss_pred cEEEEEeCCCCCceeecCCCCccCHHHHHHHHHHHHHcC
Q psy17622 133 NILIYLTGHGGDGFLKFQDSEEVTSQELGDALEQMWQKR 171 (194)
Q Consensus 133 nVfvY~tgHGg~g~i~f~d~~~l~~~dL~~~l~~M~~~~ 171 (194)
..+.|--=.| ++|.|++.||++|+..+++.+....
T Consensus 59 ~~~~f~Lr~~----~~wsdG~pvTA~Dv~~s~~~~~~~~ 93 (467)
T cd08511 59 KTLTLKLRKG----VKFHDGTPFDAAAVKANLERLLTLP 93 (467)
T ss_pred cEEEEEeCCC----CCccCCCCCCHHHHHHHHHHHhCCC
Confidence 4444444434 4899999999999999999987654
No 50
>PF00496 SBP_bac_5: Bacterial extracellular solute-binding proteins, family 5 Middle; InterPro: IPR000914 Bacterial high affinity transport systems are involved in active transport of solutes across the cytoplasmic membrane. The protein components of these traffic systems include one or two transmembrane protein components, one or two membrane-associated ATP-binding proteins and a high affinity periplasmic solute-binding protein. The latter are thought to bind the substrate in the vicinity of the inner membrane, and to transfer it to a complex of inner membrane proteins for concentration into Gram-positive bacteria which are surrounded by a single membrane and therefore have no periplasmic region the equivalent proteins are bound to the membrane via an N-terminal lipid anchor. These homologue proteins do not play an integral role in the transport process per se, but probably serve as receptors to trigger or initiate translocation of the solute throught the membrane by binding to external sites of the integral membrane proteins of the efflux system. In addition at least some solute-binding proteins function in the initiation of sensory transduction pathways. On the basis of sequence similarities, the vast majority of these solute-binding proteins can be grouped [] into eight families of clusters, which generally correlate with the nature of the solute bound. Family 5 currently includes periplasmic oligopeptide-binding proteins (oppA) of Gram-negative bacteria and homologous lipoproteins in Gram-positive bacteria (oppA, amiA or appA); periplasmic dipeptide-binding proteins of Escherichia coli (dppA) and Bacillus subtilis (dppE); periplasmic murein peptide-binding protein of E. coli (mppA); periplasmic peptide-binding proteins sapA of E. coli, Salmonella typhimurium and Haemophilus influenzae; periplasmic nickel-binding protein (nikA) of E. coli; haem-binding lipoprotein (hbpA or dppA) from H. influenzae; lipoprotein xP55 from Streptomyces lividans; and hypothetical proteins from H. influenzae (HI0213) and Rhizobium sp. (strain NGR234) symbiotic plasmid (y4tO and y4wM).; GO: 0005215 transporter activity, 0006810 transport; PDB: 1B51_A 1B0H_A 1QKA_A 1B9J_A 1B6H_A 1OLA_A 1B3L_C 1JEV_A 1B5H_A 1JET_A ....
Probab=26.70 E-value=68 Score=27.81 Aligned_cols=26 Identities=23% Similarity=0.368 Sum_probs=20.3
Q ss_pred eecCCCCccCHHHHHHHHHHHHHcCC
Q psy17622 147 LKFQDSEEVTSQELGDALEQMWQKRR 172 (194)
Q Consensus 147 i~f~d~~~l~~~dL~~~l~~M~~~~~ 172 (194)
++|.|++.||++|+..+++.+.....
T Consensus 28 ~~wsDG~~lTA~Dv~~s~~~~~~~~~ 53 (374)
T PF00496_consen 28 LKWSDGEPLTAEDVVFSFERLADPDY 53 (374)
T ss_dssp -B-TTSTB-SHHHHHHHHHHHHHCCG
T ss_pred eeecCCCcceeeEEeeeehhcccCCc
Confidence 58999999999999999999886543
No 51
>PF02991 Atg8: Autophagy protein Atg8 ubiquitin like; InterPro: IPR004241 Autophagy is generally known as a process involved in the degradation of bulk cytoplasmic components that are non-specifically sequestered into an autophagosome, where they are sequestered into double-membrane vesicles and delivered to the degradative organelle, the lysosome/vacuole, for breakdown and eventual recycling of the resulting macromolecules. The yeast proteins are involved in the autophagosome, and Atg8 binds Atg19, via its N terminus and the C terminus of Atg19. Light chain 3 is proposed to function primarily as a subunit of microtubule associated proteins 1A and 1B and that its expression may regulate microtubule binding activity [] Related proteins that belong to this group include the human ganglioside expression factor and a symbiosis-related fungal protein.; PDB: 3ECI_A 3D32_B 1GNU_A 1KM7_A 1KLV_A 1KOT_A 3DOW_A 1KJT_A 1V49_A 2ZJD_C ....
Probab=25.86 E-value=59 Score=24.87 Aligned_cols=31 Identities=10% Similarity=0.363 Sum_probs=23.7
Q ss_pred CCCCCHHHHHHHHccCccccCCCccEEEEEeC
Q psy17622 109 GYEVTVENFIRLLTATSVLTDEGSNILIYLTG 140 (194)
Q Consensus 109 g~~Vt~~~fl~vL~Gk~l~s~~~dnVfvY~tg 140 (194)
..++|..+|+.+++ ++|.-+++..+|+|..+
T Consensus 40 p~~~tv~qf~~~ir-~rl~l~~~~alfl~Vn~ 70 (104)
T PF02991_consen 40 PKDLTVGQFVYIIR-KRLQLSPEQALFLFVNN 70 (104)
T ss_dssp ETTSBHHHHHHHHH-HHTT--TTS-EEEEBTT
T ss_pred cCCCchhhHHHHhh-hhhcCCCCceEEEEEcC
Confidence 56889999999998 77888888999998754
No 52
>cd08506 PBP2_clavulanate_OppA2 The substrate-binding domain of an oligopeptide binding protein (OppA2) from the biosynthesis pathway of the beta-lactamase inhibitor clavulanic acid contains the type 2 periplasmic binding fold. Clavulanic acid (CA), a clinically important beta-lactamase inhibitor, is one of a family of clavams produced as secondary metabolites by fermentation of Streptomyces clavuligeru. The biosynthesis of CA proceeds via multiple steps from the precursors, glyceraldehyde-3-phosphate and arginine. CA possesses a characteristic (3R,5R) stereochemistry essential for reaction with penicillin-binding proteins and beta-lactamases. Two genes (oppA1 and oppA2) in the clavulanic acid gene cluster encode oligopeptide-binding proteins that are required for CA biosynthesis. OppA1/2 is involved in the binding and transport of peptides across the cell membrane of Streptomyces clavuligerus. Most of other periplasmic binding proteins are comprised of only two globular subdomains cor
Probab=25.48 E-value=63 Score=29.42 Aligned_cols=36 Identities=22% Similarity=0.491 Sum_probs=27.5
Q ss_pred CCccEEEEEeCCCCCceeecCCCCccCHHHHHHHHHHHHH
Q psy17622 130 EGSNILIYLTGHGGDGFLKFQDSEEVTSQELGDALEQMWQ 169 (194)
Q Consensus 130 ~~dnVfvY~tgHGg~g~i~f~d~~~l~~~dL~~~l~~M~~ 169 (194)
++...+.|--=.| ++|.|++.|||+|+..+++.+..
T Consensus 61 ~d~~~~tf~Lr~~----vkf~dG~p~TA~Dv~~s~~~~~~ 96 (466)
T cd08506 61 DDGKTWTYTLRDG----LKFEDGTPITAKDVKYGIERSFA 96 (466)
T ss_pred CCceEEEEEECCC----CEeCCCCeeeHHHHHHhhhheEE
Confidence 3445666655555 49999999999999999997754
No 53
>cd00659 Topo_IB_C DNA topoisomerase IB, C-terminal catalytic domain. Topoisomerase I promotes the relaxation of both positive and negative DNA superhelical tension by introducing a transient single-stranded break in duplex DNA. This function is vital for the processes of replication, transcription, and recombination. Unlike Topo IA enzymes, Topo IB enzymes do not require a single-stranded region of DNA or metal ions for their function. The type IB family of DNA topoisomerases includes eukaryotic nuclear topoisomerase I, topoisomerases of poxviruses and bacterial versions of Topo IB. They belong to the superfamily of DNA breaking-rejoining enzymes, which share the same fold in their C-terminal catalytic domain and the overall reaction mechanism with tyrosine recombinases. The C-terminal catalytic domain in topoisomerases is linked to a divergent N-terminal domain that shows no sequence or structure similarity to the N-terminal domains of tyrosine recombinases.
Probab=25.16 E-value=64 Score=27.74 Aligned_cols=60 Identities=23% Similarity=0.435 Sum_probs=39.2
Q ss_pred cccccCCCceeeccCCC-------CCH-HHHHHHHccCcccc-CCCccEEEEEeCCCCCceeecCCCCccCHHHHHHHHH
Q psy17622 95 HIDVYGEDVEVDYRGYE-------VTV-ENFIRLLTATSVLT-DEGSNILIYLTGHGGDGFLKFQDSEEVTSQELGDALE 165 (194)
Q Consensus 95 ~~nlY~~~v~IDY~g~~-------Vt~-~~fl~vL~Gk~l~s-~~~dnVfvY~tgHGg~g~i~f~d~~~l~~~dL~~~l~ 165 (194)
+..+-++.+.++|.|.+ |.. ..+.++|. .+.. .|++.+|-|. +...|++.++++-|+
T Consensus 78 Hv~~~~~~i~f~F~GK~g~~~~~~v~d~~~l~~~l~--~~~~k~pg~~LF~y~------------~~~~v~s~~vN~yl~ 143 (218)
T cd00659 78 HVTLKPNVVRFDFLGKDSIRYENEVEVEPRLFKNLR--KFLDKLPGDDLFQYL------------QVDRLNSSKLNAYLR 143 (218)
T ss_pred heeEeCCEEEEEEECCCCcEEEEEECChHHHHHHHH--HHHHcCCcHHhhccC------------CCCcCCHHHHHHHHH
Confidence 33444555677777752 343 45666664 3433 6778899887 334799999999988
Q ss_pred HHH
Q psy17622 166 QMW 168 (194)
Q Consensus 166 ~M~ 168 (194)
++-
T Consensus 144 e~~ 146 (218)
T cd00659 144 EFM 146 (218)
T ss_pred HHh
Confidence 874
No 54
>TIGR03417 chol_sulfatase choline-sulfatase.
Probab=24.73 E-value=64 Score=30.44 Aligned_cols=14 Identities=21% Similarity=0.662 Sum_probs=12.2
Q ss_pred CCccEEEEEeCCCC
Q psy17622 130 EGSNILIYLTGHGG 143 (194)
Q Consensus 130 ~~dnVfvY~tgHGg 143 (194)
-+++|+||.+|||.
T Consensus 279 ~dnTivvf~sDHG~ 292 (500)
T TIGR03417 279 ADDTIVLFTSDHGD 292 (500)
T ss_pred cCCcEEEEECCCch
Confidence 46789999999995
No 55
>KOG1552|consensus
Probab=24.64 E-value=69 Score=28.56 Aligned_cols=42 Identities=31% Similarity=0.532 Sum_probs=31.0
Q ss_pred hhhhhhHHHHHH-hcCCCCCcEEEEecC-------CCCCCCCCCCCCeeecCC
Q psy17622 48 VANVLSIYRSVK-RLGIPDSHIILMIAD-------DMACNPRNPRPATVFNNA 92 (194)
Q Consensus 48 qadv~~~Y~~L~-~~Gi~denIIlm~~D-------D~a~~p~Np~pG~i~~~~ 92 (194)
.+|+-++|+-|+ ++| ++|.|||+-.= |.| .+.|..|.|...|
T Consensus 112 y~Di~avye~Lr~~~g-~~~~Iil~G~SiGt~~tv~La--sr~~~~alVL~SP 161 (258)
T KOG1552|consen 112 YADIKAVYEWLRNRYG-SPERIILYGQSIGTVPTVDLA--SRYPLAAVVLHSP 161 (258)
T ss_pred hhhHHHHHHHHHhhcC-CCceEEEEEecCCchhhhhHh--hcCCcceEEEecc
Confidence 489999999996 677 99999998542 222 4566777777766
No 56
>COG2604 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=24.32 E-value=78 Score=31.43 Aligned_cols=24 Identities=21% Similarity=0.277 Sum_probs=21.9
Q ss_pred HHHHHHhcCCCCCcEEEEecCCCC
Q psy17622 54 IYRSVKRLGIPDSHIILMIADDMA 77 (194)
Q Consensus 54 ~Y~~L~~~Gi~denIIlm~~DD~a 77 (194)
+|..|.++||.||=++.+=+||+|
T Consensus 258 al~~L~k~GIkPDyVc~ld~~di~ 281 (594)
T COG2604 258 ALPILAKHGIKPDYVCSLDPDDIA 281 (594)
T ss_pred cchHHHhcCCCCCeEEEecchHHH
Confidence 678999999999999999999877
No 57
>PF04911 ATP-synt_J: ATP synthase j chain; InterPro: IPR006995 ATPases (or ATP synthases) are membrane-bound enzyme complexes/ion transporters that combine ATP synthesis and/or hydrolysis with the transport of protons across a membrane. ATPases can harness the energy from a proton gradient, using the flux of ions across the membrane via the ATPase proton channel to drive the synthesis of ATP. Some ATPases work in reverse, using the energy from the hydrolysis of ATP to create a proton gradient. There are different types of ATPases, which can differ in function (ATP synthesis and/or hydrolysis), structure (e.g., F-, V- and A-ATPases, which contain rotary motors) and in the type of ions they transport [, ]. The different types include: F-ATPases (F1F0-ATPases), which are found in mitochondria, chloroplasts and bacterial plasma membranes where they are the prime producers of ATP, using the proton gradient generated by oxidative phosphorylation (mitochondria) or photosynthesis (chloroplasts). V-ATPases (V1V0-ATPases), which are primarily found in eukaryotic vacuoles and catalyse ATP hydrolysis to transport solutes and lower pH in organelles. A-ATPases (A1A0-ATPases), which are found in Archaea and function like F-ATPases (though with respect to their structure and some inhibitor responses, A-ATPases are more closely related to the V-ATPases). P-ATPases (E1E2-ATPases), which are found in bacteria and in eukaryotic plasma membranes and organelles, and function to transport a variety of different ions across membranes. E-ATPases, which are cell-surface enzymes that hydrolyse a range of NTPs, including extracellular ATP. F-ATPases (also known as F1F0-ATPase, or H(+)-transporting two-sector ATPase) (3.6.3.14 from EC) are composed of two linked complexes: the F1 ATPase complex is the catalytic core and is composed of 5 subunits (alpha, beta, gamma, delta, epsilon), while the F0 ATPase complex is the membrane-embedded proton channel that is composed of at least 3 subunits (A-C), nine in mitochondria (A-G, F6, F8). Both the F1 and F0 complexes are rotary motors that are coupled back-to-back. In the F1 complex, the central gamma subunit forms the rotor inside the cylinder made of the alpha(3)beta(3) subunits, while in the F0 complex, the ring-shaped C subunits forms the rotor. The two rotors rotate in opposite directions, but the F0 rotor is usually stronger, using the force from the proton gradient to push the F1 rotor in reverse in order to drive ATP synthesis []. These ATPases can also work in reverse to hydrolyse ATP to create a proton gradient. This entry represents subunit J found in the F0 complex of F-ATPases from fungal mitochondria. This subunit does not appear to display sequence similarity with subunits of F-ATPases found in other organisms []. More information about this protein can be found at Protein of the Month: ATP Synthases [].; GO: 0015078 hydrogen ion transmembrane transporter activity, 0015986 ATP synthesis coupled proton transport, 0045263 proton-transporting ATP synthase complex, coupling factor F(o)
Probab=24.11 E-value=4.7 Score=27.78 Aligned_cols=26 Identities=35% Similarity=0.686 Sum_probs=18.7
Q ss_pred cCCCCCcEEEEecCCCCCCCCCCCCC
Q psy17622 61 LGIPDSHIILMIADDMACNPRNPRPA 86 (194)
Q Consensus 61 ~Gi~denIIlm~~DD~a~~p~Np~pG 86 (194)
+|+..-+-..|-+|...+|||||+-+
T Consensus 22 ygv~k~~~a~~ns~E~~NDPRNP~~~ 47 (54)
T PF04911_consen 22 YGVNKAQNAMMNSDEFKNDPRNPRAA 47 (54)
T ss_pred HHHHHHHHHHhcCHHHhcCCCChhhh
Confidence 34444455667788899999999754
No 58
>cd00995 PBP2_NikA_DppA_OppA_like The substrate-binding domain of an ABC-type nickel/oligopeptide-like import system contains the type 2 periplasmic binding fold. This family represents the periplasmic substrate-binding domain of nickel/dipeptide/oligopeptide transport systems, which function in the import of nickel and peptides, and other closely related proteins. The oligopeptide-binding protein OppA is a periplasmic component of an ATP-binding cassette (ABC) transport system OppABCDEF consisting of five subunits: two homologous integral membrane proteins OppB and OppF that form the translocation pore; two homologous nucleotide-binding domains OppD and OppF that drive the transport process through binding and hydrolysis of ATP; and the substrate-binding protein or receptor OppA that determines the substrate specificity of the transport system. The dipeptide (DppA) and oligopeptide (OppA) binding proteins differ in several ways. The DppA binds dipeptides and some tripeptides and is inv
Probab=23.49 E-value=1.2e+02 Score=27.15 Aligned_cols=43 Identities=16% Similarity=0.309 Sum_probs=30.9
Q ss_pred ccccCCCccEEEEEeCCCCCceeecCCCCccCHHHHHHHHHHHHHcC
Q psy17622 125 SVLTDEGSNILIYLTGHGGDGFLKFQDSEEVTSQELGDALEQMWQKR 171 (194)
Q Consensus 125 ~l~s~~~dnVfvY~tgHGg~g~i~f~d~~~l~~~dL~~~l~~M~~~~ 171 (194)
.+...++...+.|--=-| ++|.|++.|+++|+.-+++.+...+
T Consensus 50 ~~~~~~d~~~~tf~Lr~~----~~~sDG~p~TA~Dv~~s~~~~~~~~ 92 (466)
T cd00995 50 SWEVSDDGKTYTFKLRDG----VKFHDGTPLTAEDVVFSFERLADPK 92 (466)
T ss_pred eeEEcCCCcEEEEEecCC----CCcCCCCCcCHHHHHhHHHHhcCCc
Confidence 344444444555544444 5999999999999999999987765
No 59
>cd08507 PBP2_SgrR_like The C-terminal solute-binding domain of DNA-binding transcriptional regulator SgrR is related to the ABC-type oligopeptide-binding proteins and contains the type 2 periplasmic-binding fold. A novel family of SgrR transcriptional regulator contains a two-domain structure with an N terminal DNA-binding domain of the winged helix family and a C-terminal solute-binding domain. The C-terminal domain shows strong homology with the ABC-type oligopeptide-binding protein family, a member of the type 2 periplasmic-binding fold protein (PBP2) superfamily that also includes the C-terminal substrate-binding domain of LysR-type transcriptional regulators. SgrR (SugaR transport-related Regulator) is negatively autoregulated and activates transcription of divergent operon SgrS, which encodes a small RNA required for recovery from glucose-phosphate stress. Hence, the small RNA SgrS and SgrR, the transcription factor that controls sgrS expression, are both required for recovery f
Probab=22.48 E-value=1.4e+02 Score=27.21 Aligned_cols=67 Identities=15% Similarity=0.279 Sum_probs=41.5
Q ss_pred cccCCCceeeccCCCCCHHHHHHHHccCccccCCCccEEEEEeCCCCCceeecCCCCccCHHHHHHHHHHHHHcCCc
Q psy17622 97 DVYGEDVEVDYRGYEVTVENFIRLLTATSVLTDEGSNILIYLTGHGGDGFLKFQDSEEVTSQELGDALEQMWQKRRY 173 (194)
Q Consensus 97 nlY~~~v~IDY~g~~Vt~~~fl~vL~Gk~l~s~~~dnVfvY~tgHGg~g~i~f~d~~~l~~~dL~~~l~~M~~~~~Y 173 (194)
.+|.+-+.+|..+.++.| . |. +.+...++...+.|----| ++|.|++.++++|+.-+++.|.+...+
T Consensus 34 ~v~e~L~~~d~~~~~~~p----~-LA-~~~~~~~d~~~~tf~LR~g----v~f~DG~pvtA~DV~~s~~r~~~~~~~ 100 (448)
T cd08507 34 QIFDGLVRYDEENGEIEP----D-LA-HHWESNDDLTHWTFYLRKG----VRFHNGRELTAEDVVFTLLRLRELESY 100 (448)
T ss_pred HHhhhheeEccccCCCCh----h-hh-ceeEECCCCcEEEEEECCC----CEecCCCCccHHHHHHHHHHHhccccc
Confidence 466444556652234444 1 21 3344433334555554444 489999999999999999999776543
No 60
>cd08490 PBP2_NikA_DppA_OppA_like_3 The substrate-binding component of an uncharacterized ABC-type nickel/dipeptide/oligopeptide-like import system contains the type 2 periplasmic binding fold. This CD represents the substrate-binding domain of an uncharacterized ATP-binding cassette (ABC) type nickel/dipeptide/oligopeptide-like transporter. The oligopeptide-binding protein OppA and the dipeptide-binding protein DppA show significant sequence similarity to NikA, the initial nickel receptor. The DppA binds dipeptides and some tripeptides and is involved in chemotaxis toward dipeptides, whereas the OppA binds peptides of a wide range of lengths (2-35 amino acid residues) and plays a role in recycling of cell wall peptides, which precludes any involvement in chemotaxis. Most of other periplasmic binding proteins are comprised of only two globular subdomains corresponding to domains I and III of the dipeptide/oligopeptide binding proteins. The structural topology of these domains is most s
Probab=22.24 E-value=72 Score=28.89 Aligned_cols=37 Identities=22% Similarity=0.383 Sum_probs=28.6
Q ss_pred CccEEEEEeCCCCCceeecCCCCccCHHHHHHHHHHHHHcC
Q psy17622 131 GSNILIYLTGHGGDGFLKFQDSEEVTSQELGDALEQMWQKR 171 (194)
Q Consensus 131 ~dnVfvY~tgHGg~g~i~f~d~~~l~~~dL~~~l~~M~~~~ 171 (194)
+...+.|---.| ++|.||+.||++|+.-+++.+...+
T Consensus 54 d~~~~tf~Lr~~----~~wsDG~plTA~Dv~~s~~~~~~~~ 90 (470)
T cd08490 54 DDTTWEFTLRDG----VKFHDGTPLTAEAVKASLERALAKS 90 (470)
T ss_pred CCCEEEEEECCC----CCccCCCCCCHHHHHHHHHHHhccC
Confidence 344565555555 4899999999999999999887654
No 61
>PF08259 Periviscerokin: Periviscerokinin family; InterPro: IPR013231 Perviscerokinin neuropeptides are found in the abdominal perisympathetic organs of insects. They mediate visceral muscle contractile activity (myotropic activity). CAPA, which are in the periviscerokinin and pyrokinin peptide families, has potential medical importance. This is due to its myotropic effects on, for example, heart muscles and due to its occurrence in the Ixodoidea (ticks), which are important vectors in the transmission of many animal diseases []. These peptides also have a strong diuretic or anti-diuretic effect, suggesting they have significant medical implications [].
Probab=21.85 E-value=44 Score=15.98 Aligned_cols=10 Identities=30% Similarity=0.826 Sum_probs=7.5
Q ss_pred CCCceeecCC
Q psy17622 142 GGDGFLKFQD 151 (194)
Q Consensus 142 Gg~g~i~f~d 151 (194)
|+.|.|.||-
T Consensus 1 gssGlI~fpR 10 (11)
T PF08259_consen 1 GSSGLIPFPR 10 (11)
T ss_pred CCccccccCC
Confidence 6678888873
No 62
>cd08494 PBP2_NikA_DppA_OppA_like_6 The substrate-binding component of an uncharacterized ABC-type nickel/dipeptide/oligopeptide-like import system contains the type 2 periplasmic binding fold. This CD represents the substrate-binding domain of an uncharacterized ATP-binding cassette (ABC) type nickel/dipeptide/oligopeptide-like transporter. The oligopeptide-binding protein OppA and the dipeptide-binding protein DppA show significant sequence similarity to NikA, the initial nickel receptor. The DppA binds dipeptides and some tripeptides and is involved in chemotaxis toward dipeptides, whereas the OppA binds peptides of a wide range of lengths (2-35 amino acid residues) and plays a role in recycling of cell wall peptides, which precludes any involvement in chemotaxis. Most of other periplasmic binding proteins are comprised of only two globular subdomains corresponding to domains I and III of the dipeptide/oligopeptide binding proteins. The structural topology of these domains is most si
Probab=21.60 E-value=76 Score=28.52 Aligned_cols=25 Identities=12% Similarity=0.260 Sum_probs=22.1
Q ss_pred eecCCCCccCHHHHHHHHHHHHHcC
Q psy17622 147 LKFQDSEEVTSQELGDALEQMWQKR 171 (194)
Q Consensus 147 i~f~d~~~l~~~dL~~~l~~M~~~~ 171 (194)
++|.||+.||++|+..+++.|...+
T Consensus 69 ~~fsdG~pvTA~Dv~~s~~~~~~~~ 93 (448)
T cd08494 69 VTFHDGTPFDAADVKFSLQRARAPD 93 (448)
T ss_pred CEecCcCCCCHHHHHhHHHHHhCCC
Confidence 5899999999999999999987653
No 63
>PRK11598 putative metal dependent hydrolase; Provisional
Probab=21.33 E-value=59 Score=31.62 Aligned_cols=14 Identities=36% Similarity=0.648 Sum_probs=11.8
Q ss_pred CccEEEEEeCCCCC
Q psy17622 131 GSNILIYLTGHGGD 144 (194)
Q Consensus 131 ~dnVfvY~tgHGg~ 144 (194)
++.++||++|||..
T Consensus 445 ~nT~iIy~SDHGe~ 458 (545)
T PRK11598 445 FNTSLVYLSDHGES 458 (545)
T ss_pred CCeEEEEECcCCCc
Confidence 45799999999974
No 64
>PRK13759 arylsulfatase; Provisional
Probab=20.75 E-value=79 Score=29.73 Aligned_cols=14 Identities=21% Similarity=0.703 Sum_probs=12.0
Q ss_pred CCccEEEEEeCCCC
Q psy17622 130 EGSNILIYLTGHGG 143 (194)
Q Consensus 130 ~~dnVfvY~tgHGg 143 (194)
.++.|+||.+|||.
T Consensus 294 ~dnTiiv~tsDHG~ 307 (485)
T PRK13759 294 LDNTIILFVSDHGD 307 (485)
T ss_pred ccCeEEEEECCCcc
Confidence 46789999999996
No 65
>PRK04452 acetyl-CoA decarbonylase/synthase complex subunit delta; Provisional
Probab=20.74 E-value=47 Score=30.31 Aligned_cols=45 Identities=24% Similarity=0.391 Sum_probs=31.5
Q ss_pred hhhhhccccCCCcEEEEEeCCCCCccchhhhhh-hhHHHHHHhcCCCCCcEEE
Q psy17622 19 NYRHVANAKHSNNWAVLVDTSRFWFNYRHVANV-LSIYRSVKRLGIPDSHIIL 70 (194)
Q Consensus 19 n~r~~~~~~~~~~wavlvagS~~w~NYRHqadv-~~~Y~~L~~~Gi~denIIl 70 (194)
||+.+........++|++.++.. -|. -.++..|.+.|+|.++|++
T Consensus 163 n~~~i~~lA~~y~~~Vva~s~~D-------ln~ak~L~~~l~~~Gi~~edIvi 208 (319)
T PRK04452 163 NYKKIAAAAMAYGHAVIAWSPLD-------INLAKQLNILLTELGVPRERIVM 208 (319)
T ss_pred HHHHHHHHHHHhCCeEEEEcHHH-------HHHHHHHHHHHHHcCCCHHHEEE
Confidence 56666666666667777777554 333 3367778899999999987
No 66
>cd04508 TUDOR Tudor domains are found in many eukaryotic organisms and have been implicated in protein-protein interactions in which methylated protein substrates bind to these domains. For example, the Tudor domain of Survival of Motor Neuron (SMN) binds to symmetrically dimethylated arginines of arginine-glycine (RG) rich sequences found in the C-terminal tails of Sm proteins. The SMN protein is linked to spinal muscular atrophy. Another example is the tandem tudor domains of 53BP1, which bind to histone H4 specifically dimethylated at Lys20 (H4-K20me2). 53BP1 is a key transducer of the DNA damage checkpoint signal.
Probab=20.56 E-value=1.1e+02 Score=18.93 Aligned_cols=24 Identities=4% Similarity=0.126 Sum_probs=17.6
Q ss_pred ccccCCCccEEEEEeCCCCCceee
Q psy17622 125 SVLTDEGSNILIYLTGHGGDGFLK 148 (194)
Q Consensus 125 ~l~s~~~dnVfvY~tgHGg~g~i~ 148 (194)
++...++..+.|+|.|.|....+.
T Consensus 19 V~~~~~~~~~~V~f~DyG~~~~v~ 42 (48)
T cd04508 19 ITSILSDGKVEVFFVDYGNTEVVP 42 (48)
T ss_pred EEEECCCCcEEEEEEcCCCcEEEe
Confidence 344445778999999999877654
No 67
>PF14681 UPRTase: Uracil phosphoribosyltransferase; PDB: 1V9S_B 1UPF_A 1UPU_D 1JLR_B 1BD4_A 1BD3_C 1JLS_D 1XTV_C 1XTU_H 3G6W_C ....
Probab=20.55 E-value=77 Score=26.54 Aligned_cols=25 Identities=20% Similarity=0.509 Sum_probs=20.6
Q ss_pred hhhhhHHHHHHhcCCCCCcEEEEec
Q psy17622 49 ANVLSIYRSVKRLGIPDSHIILMIA 73 (194)
Q Consensus 49 adv~~~Y~~L~~~Gi~denIIlm~~ 73 (194)
..++.+-+.|+++|+++++|+++..
T Consensus 135 ~s~~~ai~~L~~~G~~~~~I~~v~~ 159 (207)
T PF14681_consen 135 GSAIAAIEILKEHGVPEENIIIVSV 159 (207)
T ss_dssp HHHHHHHHHHHHTTG-GGEEEEEEE
T ss_pred hhHHHHHHHHHHcCCCcceEEEEEE
Confidence 4678899999999999999998743
No 68
>cd08496 PBP2_NikA_DppA_OppA_like_9 The substrate-binding component of an uncharacterized ABC-type nickel/dipeptide/oligopeptide-like import system contains the type 2 periplasmic binding fold. This CD represents the substrate-binding domain of an uncharacterized ATP-binding cassette (ABC) type nickel/dipeptide/oligopeptide-like transporter. The oligopeptide-binding protein OppA and the dipeptide-binding protein DppA show significant sequence similarity to NikA, the initial nickel receptor. The DppA binds dipeptides and some tripeptides and is involved in chemotaxis toward dipeptides, whereas the OppA can bind peptides of a wide range of lengths (2-35 amino-acid residues) and plays a role in recycling of cell wall peptides, which precludes any involvement in chemotaxis. Most of other periplasmic binding proteins are comprised of only two globular subdomains corresponding to domains I and III of the dipeptide/oligopeptide binding proteins. The structural topology of these domains is most
Probab=20.36 E-value=82 Score=28.60 Aligned_cols=64 Identities=16% Similarity=0.183 Sum_probs=41.3
Q ss_pred cccCCCceeeccCCCCCHHHHHHHHccCccccCCCccEEEEEeCCCCCceeecCCCCccCHHHHHHHHHHHHHcC
Q psy17622 97 DVYGEDVEVDYRGYEVTVENFIRLLTATSVLTDEGSNILIYLTGHGGDGFLKFQDSEEVTSQELGDALEQMWQKR 171 (194)
Q Consensus 97 nlY~~~v~IDY~g~~Vt~~~fl~vL~Gk~l~s~~~dnVfvY~tgHGg~g~i~f~d~~~l~~~dL~~~l~~M~~~~ 171 (194)
.+|.+-+.+|..| ++.|. |. +.+...++...+.|---.|. +|.|++.+|++|+.-+++.+....
T Consensus 29 ~~~e~L~~~d~~~-~~~p~-----LA-~~~~~s~d~~t~tf~Lr~~~----~f~DG~pvTA~Dv~~s~~~~~~~~ 92 (454)
T cd08496 29 LLYDTLIKLDPDG-KLEPG-----LA-ESWEYNADGTTLTLHLREGL----TFSDGTPLDAAAVKANLDRGKSTG 92 (454)
T ss_pred HhcccceeECCCC-CCCcc-----cc-ceeeECCCCeEEEEEeCCCC----CccCCCCcCHHHHHHHHHHHhCCC
Confidence 3554334555443 34442 21 33444455566777666664 899999999999999999987654
No 69
>PF00691 OmpA: OmpA family; InterPro: IPR006665 This entry represents domain with a beta/alpha/beta/alpha-beta(2) structure found in the C-terminal region of many Gram-negative bacterial outer membrane proteins [], such as porin-like integral membrane proteins (such as ompA) [], small lipid-anchored proteins (such as pal) [], and MotB proton channels []. The N-terminal half is variable although some of the proteins in this group have the OmpA-like transmembrane domain IPR000498 from INTERPRO at the N terminus. OmpA from Escherichia coli is required for pathogenesis, and can interact with host receptor molecules []. MotB (and MotA) serves two functions in E. coli, the MotA(4)-MotB(2) complex attaches to the cell wall via MotB to form the stator of the flagellar motor, and the MotA-MotB complex couples the flow of ions across the cell membrane to movement of the rotor [].; GO: 0009279 cell outer membrane; PDB: 1OAP_A 2W8B_G 2HQS_C 4ERH_A 2ZF8_A 2ZOV_A 2ZVZ_B 2ZVY_A 3TD4_B 3TD5_D ....
Probab=20.01 E-value=1.9e+02 Score=20.26 Aligned_cols=32 Identities=16% Similarity=0.171 Sum_probs=23.3
Q ss_pred cchhhhh-hhhHHHHHHhcCCCCCcEEEEecCC
Q psy17622 44 NYRHVAN-VLSIYRSVKRLGIPDSHIILMIADD 75 (194)
Q Consensus 44 NYRHqad-v~~~Y~~L~~~Gi~denIIlm~~DD 75 (194)
|.+-... +-.+-+.|...|+++++|.+..+..
T Consensus 48 n~~LS~~RA~~V~~~L~~~gi~~~ri~~~~~G~ 80 (97)
T PF00691_consen 48 NQELSQRRAEAVKQYLVENGIPPERISVVGYGE 80 (97)
T ss_dssp HHHHHHHHHHHHHHHHHHTTSSGGGEEEEEETT
T ss_pred HhHHHHHHHHHHHHHHHHcCCChHhEEEEEEcc
Confidence 4443333 4557778899999999998887766
Done!