Query psy623
Match_columns 69
No_of_seqs 142 out of 1065
Neff 8.5
Searched_HMMs 46136
Date Fri Aug 16 21:06:33 2013
Command hhsearch -i /work/01045/syshi/Psyhhblits/psy623.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/623hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 KOG0417|consensus 99.9 1.4E-22 3E-27 117.7 6.6 57 9-66 2-58 (148)
2 COG5078 Ubiquitin-protein liga 99.9 6.7E-22 1.5E-26 116.3 8.8 59 8-66 5-63 (153)
3 KOG0419|consensus 99.8 3.7E-21 8E-26 110.0 6.7 60 5-65 1-60 (152)
4 KOG0426|consensus 99.8 7.6E-21 1.6E-25 109.0 6.3 61 6-66 2-62 (165)
5 PTZ00390 ubiquitin-conjugating 99.8 5.6E-20 1.2E-24 108.3 8.2 57 9-66 3-59 (152)
6 PLN00172 ubiquitin conjugating 99.8 6.6E-20 1.4E-24 107.4 8.2 57 9-66 2-58 (147)
7 KOG0894|consensus 99.8 3.8E-20 8.3E-25 113.0 6.6 60 6-66 3-62 (244)
8 PF00179 UQ_con: Ubiquitin-con 99.7 1.4E-17 3E-22 96.4 7.0 55 12-66 1-55 (140)
9 cd00195 UBCc Ubiquitin-conjuga 99.7 3.4E-17 7.5E-22 94.9 7.6 55 11-66 2-56 (141)
10 KOG0425|consensus 99.7 4.1E-17 8.9E-22 95.7 7.0 58 9-66 6-63 (171)
11 KOG0421|consensus 99.7 1.2E-16 2.5E-21 92.9 7.1 62 4-66 25-86 (175)
12 smart00212 UBCc Ubiquitin-conj 99.7 6.3E-16 1.4E-20 90.0 7.5 56 11-66 1-56 (145)
13 KOG0418|consensus 99.6 5.4E-16 1.2E-20 93.2 6.4 57 9-66 4-63 (200)
14 KOG0424|consensus 99.6 5.4E-15 1.2E-19 85.8 6.9 61 5-65 1-65 (158)
15 KOG0428|consensus 99.6 2.5E-15 5.5E-20 93.8 5.0 58 7-66 10-67 (314)
16 KOG0427|consensus 99.5 4E-14 8.8E-19 81.3 6.7 60 6-67 13-72 (161)
17 KOG0416|consensus 99.3 3.4E-12 7.3E-17 76.0 5.2 55 8-66 3-57 (189)
18 KOG0423|consensus 99.3 1.8E-12 3.8E-17 77.6 2.5 57 7-64 9-65 (223)
19 KOG0422|consensus 99.2 1.4E-11 3E-16 71.3 4.2 57 9-66 3-59 (153)
20 KOG0895|consensus 98.7 2.7E-08 5.9E-13 71.8 6.2 59 7-66 281-339 (1101)
21 KOG0420|consensus 98.5 3.2E-07 7E-12 55.0 5.5 57 7-66 27-86 (184)
22 KOG0429|consensus 98.5 5.3E-07 1.2E-11 56.0 6.0 53 12-66 23-75 (258)
23 KOG0895|consensus 98.4 1.6E-07 3.5E-12 68.0 2.7 55 11-66 854-908 (1101)
24 KOG0896|consensus 98.3 1.1E-06 2.3E-11 50.8 4.2 57 8-64 5-64 (138)
25 smart00340 HALZ homeobox assoc 88.1 0.55 1.2E-05 22.2 1.9 15 9-23 20-34 (44)
26 PF11619 P53_C: Transcription 72.9 8.5 0.00018 19.9 3.2 24 36-66 2-25 (71)
27 PF15241 Cylicin_N: Cylicin N- 64.5 3.7 8.1E-05 23.2 0.9 11 59-69 26-36 (110)
28 PF04881 Adeno_GP19K: Adenovir 60.5 11 0.00023 22.1 2.3 30 35-64 44-74 (139)
29 PF12065 DUF3545: Protein of u 59.6 8.4 0.00018 19.4 1.6 11 11-21 37-47 (59)
30 PF12075 KN_motif: KN motif; 58.8 4.5 9.7E-05 18.7 0.5 12 50-64 5-16 (39)
31 PF08203 RNA_polI_A14: Yeast R 57.6 9 0.00019 20.3 1.6 13 8-20 59-71 (76)
32 PF04904 NCD1: NAB conserved r 53.0 23 0.0005 18.9 2.7 17 8-24 61-77 (82)
33 PF09929 DUF2161: Uncharacteri 49.5 35 0.00077 19.5 3.3 24 4-27 23-46 (118)
34 PF04314 DUF461: Protein of un 48.5 29 0.00064 19.0 2.8 27 40-66 77-103 (110)
35 TIGR02422 protocat_beta protoc 48.1 19 0.00042 22.6 2.2 18 38-55 203-220 (220)
36 PF12011 DUF3503: Domain of un 47.2 23 0.0005 21.6 2.4 46 13-64 23-68 (170)
37 PF05709 Sipho_tail: Phage tai 45.7 67 0.0015 19.4 6.1 57 8-66 52-110 (249)
38 PF12259 DUF3609: Protein of u 45.7 19 0.00041 24.3 2.1 55 9-66 33-87 (361)
39 PF01107 MP: Viral movement pr 44.9 69 0.0015 19.3 4.8 37 26-64 130-167 (189)
40 COG2847 Copper(I)-binding prot 44.8 67 0.0014 19.2 4.1 27 40-66 104-130 (151)
41 KOG3285|consensus 43.3 73 0.0016 19.8 4.1 43 7-50 118-160 (203)
42 PF02970 TBCA: Tubulin binding 40.9 37 0.00081 18.2 2.4 15 7-21 6-20 (90)
43 PF14824 Sirohm_synth_M: Siroh 40.3 36 0.00078 14.8 2.3 16 5-20 14-29 (30)
44 PF14501 HATPase_c_5: GHKL dom 39.2 26 0.00056 18.5 1.6 13 53-65 88-100 (100)
45 PF14436 EndoU_bacteria: Bacte 38.7 46 0.001 18.6 2.7 18 51-68 42-59 (128)
46 PF12915 DUF3833: Protein of u 35.2 85 0.0018 19.0 3.5 25 35-63 107-131 (164)
47 PF06675 DUF1177: Protein of u 34.7 24 0.00052 23.0 1.2 17 48-64 180-196 (276)
48 PF07547 RSD-2: RSD-2 N-termin 34.6 73 0.0016 17.1 2.9 28 38-65 16-44 (84)
49 cd02860 Pullulanase_N_term Pul 33.6 79 0.0017 16.7 3.4 24 37-64 45-68 (100)
50 PF12627 PolyA_pol_RNAbd: Prob 33.4 54 0.0012 15.7 2.2 19 7-25 21-39 (64)
51 PF00868 Transglut_N: Transglu 33.4 85 0.0018 17.5 3.2 21 41-63 95-115 (118)
52 cd02848 Chitinase_N_term Chiti 33.2 47 0.001 18.7 2.1 12 53-64 77-88 (106)
53 TIGR03601 B_an_ocin probable h 31.5 49 0.0011 17.3 1.8 22 13-42 4-25 (79)
54 TIGR01633 phi3626_gp14_N putat 31.1 94 0.002 16.8 7.0 57 9-67 63-121 (124)
55 PF06305 DUF1049: Protein of u 31.0 59 0.0013 15.9 2.1 15 9-23 49-63 (68)
56 CHL00130 rbcS ribulose-1,5-bis 29.9 1.2E+02 0.0027 17.8 4.9 33 9-43 60-93 (138)
57 PF08872 KGK: KGK domain; Int 29.7 1.1E+02 0.0024 17.3 3.8 25 41-65 67-93 (114)
58 PF09943 DUF2175: Uncharacteri 29.0 68 0.0015 17.9 2.3 20 40-61 2-21 (101)
59 PF07691 PA14: PA14 domain; I 28.6 1.1E+02 0.0023 16.9 3.1 20 39-63 48-67 (145)
60 PHA00672 hypothetical protein 27.7 55 0.0012 19.2 1.8 20 48-67 38-57 (152)
61 KOG0680|consensus 26.7 1.8E+02 0.0039 20.0 4.2 44 8-54 332-377 (400)
62 COG5402 Uncharacterized conser 26.7 1.6E+02 0.0036 18.2 3.9 29 40-69 139-173 (194)
63 PF14135 DUF4302: Domain of un 25.3 1.8E+02 0.0039 18.2 5.4 46 8-65 9-54 (235)
64 PF13115 YtkA: YtkA-like 25.0 69 0.0015 16.3 1.8 14 53-66 64-77 (86)
65 PF05015 Plasmid_killer: Plasm 24.7 1.2E+02 0.0027 16.1 5.4 39 8-46 25-70 (93)
66 KOG2193|consensus 24.2 1.1E+02 0.0025 21.7 3.1 40 15-54 431-470 (584)
67 COG3866 PelB Pectate lyase [Ca 23.7 1.1E+02 0.0024 20.7 2.8 25 39-63 197-222 (345)
68 PHA00458 single-stranded DNA-b 23.7 1.3E+02 0.0029 19.2 3.1 16 52-67 33-48 (233)
69 COG2880 Uncharacterized protei 23.7 20 0.00042 18.6 -0.5 9 52-60 10-18 (67)
70 PF15586 Imm47: Immunity prote 23.4 1.5E+02 0.0033 16.7 4.2 28 34-62 22-49 (116)
71 PF03366 YEATS: YEATS family; 23.3 23 0.00049 18.8 -0.3 11 40-50 3-13 (84)
72 PF02922 CBM_48: Carbohydrate- 23.3 1.2E+02 0.0025 15.2 3.7 27 36-65 46-73 (85)
73 PTZ00281 actin; Provisional 23.0 1.8E+02 0.004 19.3 3.8 35 7-42 308-342 (376)
74 PF15590 Imm15: Immunity prote 22.4 1.3E+02 0.0029 15.6 3.1 39 15-57 13-51 (69)
75 PF04447 DUF550: Protein of un 22.0 1.1E+02 0.0023 17.1 2.1 17 8-24 22-38 (100)
76 PF13086 AAA_11: AAA domain; P 21.1 67 0.0014 18.8 1.4 9 43-51 20-28 (236)
77 PF01954 DUF104: Protein of un 21.0 36 0.00079 17.0 0.2 9 52-60 8-16 (60)
78 PF09036 Bcr-Abl_Oligo: Bcr-Ab 21.0 1.2E+02 0.0027 16.0 2.2 12 8-19 40-51 (79)
79 PF01774 UreD: UreD urease acc 20.8 1.5E+02 0.0032 18.2 2.9 23 43-65 2-24 (209)
80 PF13119 DUF3973: Domain of un 20.7 46 0.001 15.4 0.5 12 47-58 15-26 (41)
81 KOG0744|consensus 20.7 1E+02 0.0022 21.3 2.3 16 36-51 171-188 (423)
82 COG3254 Uncharacterized conser 20.6 31 0.00067 19.4 -0.1 12 23-35 36-47 (105)
83 cd03463 3,4-PCD_alpha Protocat 20.6 1.8E+02 0.0038 17.8 3.2 17 37-53 167-183 (185)
84 PF14172 DUF4309: Domain of un 20.4 86 0.0019 18.1 1.7 17 53-69 108-125 (134)
85 TIGR03517 GldM_gliding gliding 20.3 1.1E+02 0.0023 21.9 2.4 28 37-64 231-258 (523)
No 1
>KOG0417|consensus
Probab=99.88 E-value=1.4e-22 Score=117.74 Aligned_cols=57 Identities=32% Similarity=0.641 Sum_probs=54.1
Q ss_pred HHHHHHHHHHHhhhCCCCCEEEEecCCCCcceEEEEEECCCCCCCCCcEEEEEEEecC
Q psy623 9 ALRALSMEYKSLQEEPVEGFRVKLVNDDNLFEWEVAIFGPPDTLYQGGYFKVCVVWYK 66 (69)
Q Consensus 9 ~~~Rl~kEl~~l~~~~~~~~~~~~~~~~nl~~w~~~i~GP~~tpY~gg~f~l~i~fp~ 66 (69)
+.+||.||+++++.+++++|++. ++++|+++|+|+|.||.+||||||+|+|.|.||.
T Consensus 2 a~~RI~kE~~~l~~dp~~~~~~~-~~~dnl~~w~a~I~GP~~SpYEgG~F~l~I~~p~ 58 (148)
T KOG0417|consen 2 ASKRIIKELQDLLRDPPPGCSAG-PVGDNLFHWQATILGPPGSPYEGGVFFLEIHFPE 58 (148)
T ss_pred cHHHHHHHHHHHhcCCCCCCccC-CCCCceeeEEEEEECCCCCCcCCCEEEEEEECCC
Confidence 34699999999999999999999 9999999999999999999999999999999974
No 2
>COG5078 Ubiquitin-protein ligase [Posttranslational modification, protein turnover, chaperones]
Probab=99.87 E-value=6.7e-22 Score=116.30 Aligned_cols=59 Identities=49% Similarity=0.849 Sum_probs=54.5
Q ss_pred HHHHHHHHHHHHhhhCCCCCEEEEecCCCCcceEEEEEECCCCCCCCCcEEEEEEEecC
Q psy623 8 SALRALSMEYKSLQEEPVEGFRVKLVNDDNLFEWEVAIFGPPDTLYQGGYFKVCVVWYK 66 (69)
Q Consensus 8 ~~~~Rl~kEl~~l~~~~~~~~~~~~~~~~nl~~w~~~i~GP~~tpY~gg~f~l~i~fp~ 66 (69)
.+.+||++|+++++++++++|++.+.+++|+++|+++|.||+|||||||+|++.|+||.
T Consensus 5 ~a~~RL~kE~~~l~~~~~~~~~a~p~~d~~l~~w~~~i~GP~dtpYegg~f~~~l~fP~ 63 (153)
T COG5078 5 SALKRLLKELKKLQKDPPPGISAGPVDDDNLFHWEATITGPPDTPYEGGIFKLTLEFPE 63 (153)
T ss_pred hHHHHHHHHHHHHhcCCCCceEEEECCCCcceeEEEEEECCCCCCcCCCEEEEEEECCC
Confidence 38899999999999999999999955555999999999999999999999999999964
No 3
>KOG0419|consensus
Probab=99.85 E-value=3.7e-21 Score=110.05 Aligned_cols=60 Identities=37% Similarity=0.733 Sum_probs=56.7
Q ss_pred CchHHHHHHHHHHHHhhhCCCCCEEEEecCCCCcceEEEEEECCCCCCCCCcEEEEEEEec
Q psy623 5 PTSSALRALSMEYKSLQEEPVEGFRVKLVNDDNLFEWEVAIFGPPDTLYQGGYFKVCVVWY 65 (69)
Q Consensus 5 ~~~~~~~Rl~kEl~~l~~~~~~~~~~~~~~~~nl~~w~~~i~GP~~tpY~gg~f~l~i~fp 65 (69)
|+..+.+||++|+++++++++.|++.. +.++|+..|.+.|.||++|||+||+|+|.|+|+
T Consensus 1 MstpArrrLmrDfkrlqedpp~gisa~-P~~~niM~W~a~I~Gp~~tp~e~gtFkLtl~Ft 60 (152)
T KOG0419|consen 1 MSTPARRRLMRDFKRLQEDPPAGISAA-PVENNIMEWNAVIFGPQDTPFEGGTFKLTLEFT 60 (152)
T ss_pred CCchHHHHHHHHHHHhhcCCCCCccCC-CCccceeeeeeeEEcCCCCCcCCceEEEEEEcc
Confidence 456789999999999999999999999 889999999999999999999999999999993
No 4
>KOG0426|consensus
Probab=99.84 E-value=7.6e-21 Score=108.99 Aligned_cols=61 Identities=41% Similarity=0.721 Sum_probs=57.8
Q ss_pred chHHHHHHHHHHHHhhhCCCCCEEEEecCCCCcceEEEEEECCCCCCCCCcEEEEEEEecC
Q psy623 6 TSSALRALSMEYKSLQEEPVEGFRVKLVNDDNLFEWEVAIFGPPDTLYQGGYFKVCVVWYK 66 (69)
Q Consensus 6 ~~~~~~Rl~kEl~~l~~~~~~~~~~~~~~~~nl~~w~~~i~GP~~tpY~gg~f~l~i~fp~ 66 (69)
...+.+||++||++|..++++||.+.|.++||++.|.+.|.||++|+|+||+|-.++.||+
T Consensus 2 ~~~AlkRLm~EykqLt~~~P~GIvAgP~~EdnfF~W~cLI~GP~~T~f~~GvfpA~l~FP~ 62 (165)
T KOG0426|consen 2 AGTALKRLMAEYKQLTLNPPEGIVAGPINEDNFFEWECLIQGPEDTCFEGGVFPARLSFPL 62 (165)
T ss_pred chhHHHHHHHHHHHHccCCCCcceeCCCCccceeeeeeeeeCCCCCcccCCccceeeecCC
Confidence 3468999999999999999999999988999999999999999999999999999999986
No 5
>PTZ00390 ubiquitin-conjugating enzyme; Provisional
Probab=99.82 E-value=5.6e-20 Score=108.26 Aligned_cols=57 Identities=23% Similarity=0.504 Sum_probs=53.8
Q ss_pred HHHHHHHHHHHhhhCCCCCEEEEecCCCCcceEEEEEECCCCCCCCCcEEEEEEEecC
Q psy623 9 ALRALSMEYKSLQEEPVEGFRVKLVNDDNLFEWEVAIFGPPDTLYQGGYFKVCVVWYK 66 (69)
Q Consensus 9 ~~~Rl~kEl~~l~~~~~~~~~~~~~~~~nl~~w~~~i~GP~~tpY~gg~f~l~i~fp~ 66 (69)
+.+||++|++++.+++++++.+. +.++|+++|+++|.||++|||+||.|+++|.||.
T Consensus 3 ~~kRl~~E~~~l~~~~~~~i~~~-~~~~d~~~w~~~i~GP~~tpY~gg~f~~~i~~p~ 59 (152)
T PTZ00390 3 ISKRIEKETQNLANDPPPGIKAE-PDPGNYRHFKILMEGPDGTPYEGGYYKLELFLPE 59 (152)
T ss_pred HHHHHHHHHHHHHhCCCCCeEEE-ECCCCccEEEEEEEcCCCCCCcCcEEEEEEECcc
Confidence 56899999999999999999998 7788999999999999999999999999999975
No 6
>PLN00172 ubiquitin conjugating enzyme; Provisional
Probab=99.82 E-value=6.6e-20 Score=107.40 Aligned_cols=57 Identities=32% Similarity=0.651 Sum_probs=53.5
Q ss_pred HHHHHHHHHHHhhhCCCCCEEEEecCCCCcceEEEEEECCCCCCCCCcEEEEEEEecC
Q psy623 9 ALRALSMEYKSLQEEPVEGFRVKLVNDDNLFEWEVAIFGPPDTLYQGGYFKVCVVWYK 66 (69)
Q Consensus 9 ~~~Rl~kEl~~l~~~~~~~~~~~~~~~~nl~~w~~~i~GP~~tpY~gg~f~l~i~fp~ 66 (69)
+.+||+||++++++++++++.+. ++++|+++|+++|.||++|||+||.|+++|.||.
T Consensus 2 a~~Rl~kE~~~l~~~~~~~~~~~-~~~~nl~~w~~~i~GP~~tpyegg~f~~~i~fp~ 58 (147)
T PLN00172 2 ATKRIQKEHKDLLKDPPSNCSAG-PSDENLFRWTASIIGPSDSPYAGGVFFLSILFPP 58 (147)
T ss_pred hHHHHHHHHHHHHhCCCCCeEEE-ECCCChheEEEEEECCCCCCCCCCEEEEEEECCc
Confidence 35899999999999999999999 7888999999999999999999999999999974
No 7
>KOG0894|consensus
Probab=99.82 E-value=3.8e-20 Score=113.02 Aligned_cols=60 Identities=32% Similarity=0.666 Sum_probs=57.4
Q ss_pred chHHHHHHHHHHHHhhhCCCCCEEEEecCCCCcceEEEEEECCCCCCCCCcEEEEEEEecC
Q psy623 6 TSSALRALSMEYKSLQEEPVEGFRVKLVNDDNLFEWEVAIFGPPDTLYQGGYFKVCVVWYK 66 (69)
Q Consensus 6 ~~~~~~Rl~kEl~~l~~~~~~~~~~~~~~~~nl~~w~~~i~GP~~tpY~gg~f~l~i~fp~ 66 (69)
+..+.+||+|||+.|.++|.+.|.+. +.++|+.+||.+|.||+||||+||.|+.+|.||+
T Consensus 3 ~k~a~kRl~keY~~l~k~Pv~~i~A~-P~p~nILEWHYvl~GpedTPy~GG~YhGkl~FP~ 62 (244)
T KOG0894|consen 3 SKAAVKRLQKEYRALCKDPVPYIVAR-PNPNNILEWHYVLRGPEDTPYYGGYYHGKLIFPP 62 (244)
T ss_pred chHHHHHHHHHHHHHHhCCchhhccC-CCccceeeeEEEeeCCCCCCccCceeeeEEeCCC
Confidence 45799999999999999999999999 9999999999999999999999999999999975
No 8
>PF00179 UQ_con: Ubiquitin-conjugating enzyme; InterPro: IPR000608 The post-translational attachment of ubiquitin (IPR000626 from INTERPRO) to proteins (ubiquitinylation) alters the function, location or trafficking of a protein, or targets it to the 26S proteasome for degradation [, , ]. Ubiquitinylation is an ATP-dependent process that involves the action of at least three enzymes: a ubiquitin-activating enzyme (E1, IPR000011 from INTERPRO), a ubiquitin-conjugating enzyme (E2), and a ubiquitin ligase (E3, IPR000569 from INTERPRO, IPR003613 from INTERPRO), which work sequentially in a cascade []. The E1 enzyme mediates an ATP-dependent transfer of a thioester-linked ubiquitin molecule to a cysteine residue on the E2 enzyme. The E2 enzyme (6.3.2.19 from EC) then either transfers the ubiquitin moiety directly to a substrate, or to an E3 ligase, which can also ubiquitinylate a substrate. There are several different E2 enzymes (over 30 in humans), which are broadly grouped into four classes, all of which have a core catalytic domain (containing the active site cysteine), and some of which have short N- and C-terminal amino acid extensions: class I enzymes consist of just the catalytic core domain (UBC), class II possess a UBC and a C-terminal extension, class III possess a UBC and an N-terminal extension, and class IV possess a UBC and both N- and C-terminal extensions. These extensions appear to be important for some subfamily function, including E2 localisation and protein-protein interactions []. In addition, there are proteins with an E2-like fold that are devoid of catalytic activity, but which appear to assist in poly-ubiquitin chain formation.; GO: 0016881 acid-amino acid ligase activity; PDB: 2AAK_A 3SY2_C 1FBV_C 3SQV_C 1C4Z_D 1JAT_B 2GMI_B 2H2Y_D 2R0J_A 3E95_B ....
Probab=99.73 E-value=1.4e-17 Score=96.41 Aligned_cols=55 Identities=49% Similarity=0.877 Sum_probs=50.8
Q ss_pred HHHHHHHHhhhCCCCCEEEEecCCCCcceEEEEEECCCCCCCCCcEEEEEEEecC
Q psy623 12 ALSMEYKSLQEEPVEGFRVKLVNDDNLFEWEVAIFGPPDTLYQGGYFKVCVVWYK 66 (69)
Q Consensus 12 Rl~kEl~~l~~~~~~~~~~~~~~~~nl~~w~~~i~GP~~tpY~gg~f~l~i~fp~ 66 (69)
||++|+++++++++.++.+.+.+++|+++|+++|.||++|||+||.|+|.|.||.
T Consensus 1 Rl~~E~~~l~~~~~~~~~~~~~~~~~~~~w~~~i~gp~~t~y~gg~f~~~i~~p~ 55 (140)
T PF00179_consen 1 RLQKELKELQKNPPPGISVQPSEDDNLFEWHVTIFGPPGTPYEGGIFKFRISFPP 55 (140)
T ss_dssp HHHHHHHHHHHSHTTTEEEEEESTTETTEEEEEEEBETTSTTTTSEEEEEEEETT
T ss_pred CHHHHHHHHhhCCCCCEEEEECCCCChheEEEEEeccCccceecccccccccccc
Confidence 8999999999999999999944445999999999999999999999999999975
No 9
>cd00195 UBCc Ubiquitin-conjugating enzyme E2, catalytic (UBCc) domain. This is part of the ubiquitin-mediated protein degradation pathway in which a thiol-ester linkage forms between a conserved cysteine and the C-terminus of ubiquitin and complexes with ubiquitin protein ligase enzymes, E3. This pathway regulates many fundamental cellular processes. There are also other E2s which form thiol-ester linkages without the use of E3s as well as several UBC homologs (TSG101, Mms2, Croc-1 and similar proteins) which lack the active site cysteine essential for ubiquitination and appear to function in DNA repair pathways which were omitted from the scope of this CD.
Probab=99.72 E-value=3.4e-17 Score=94.89 Aligned_cols=55 Identities=38% Similarity=0.753 Sum_probs=52.1
Q ss_pred HHHHHHHHHhhhCCCCCEEEEecCCCCcceEEEEEECCCCCCCCCcEEEEEEEecC
Q psy623 11 RALSMEYKSLQEEPVEGFRVKLVNDDNLFEWEVAIFGPPDTLYQGGYFKVCVVWYK 66 (69)
Q Consensus 11 ~Rl~kEl~~l~~~~~~~~~~~~~~~~nl~~w~~~i~GP~~tpY~gg~f~l~i~fp~ 66 (69)
+||++|+++++++++.++.+. ++++|+++|+++|.||++|||+||.|++.|.||+
T Consensus 2 ~Rl~~E~~~l~~~~~~~~~v~-~~~~~~~~w~~~i~g~~~t~y~g~~~~~~~~~p~ 56 (141)
T cd00195 2 KRLQKELKDLKKDPPSGISAE-PVEENLLEWHGTIRGPPDTPYEGGIFKLDIEFPE 56 (141)
T ss_pred chHHHHHHHHHhCCCCCeEEE-ECCCChhEEEEEEecCCCCCccCCEEEEEEECCC
Confidence 699999999999999999999 7788999999999999999999999999999974
No 10
>KOG0425|consensus
Probab=99.71 E-value=4.1e-17 Score=95.66 Aligned_cols=58 Identities=59% Similarity=1.019 Sum_probs=54.1
Q ss_pred HHHHHHHHHHHhhhCCCCCEEEEecCCCCcceEEEEEECCCCCCCCCcEEEEEEEecC
Q psy623 9 ALRALSMEYKSLQEEPVEGFRVKLVNDDNLFEWEVAIFGPPDTLYQGGYFKVCVVWYK 66 (69)
Q Consensus 9 ~~~Rl~kEl~~l~~~~~~~~~~~~~~~~nl~~w~~~i~GP~~tpY~gg~f~l~i~fp~ 66 (69)
+..-|.++|++|++++.+++++...++.|+++|.+.|.||+||.|+||.|+..+.||.
T Consensus 6 a~~ll~~qlk~L~~~pv~gf~~glvd~~dif~WeV~i~gppdTlYeGG~FkA~m~FP~ 63 (171)
T KOG0425|consen 6 ASLLLLKQLKELQEEPVEGFSVGLVDDSDIFEWEVAIIGPPDTLYEGGFFKAHMKFPQ 63 (171)
T ss_pred hHHHHHHHHHHHhcCCCCccccccccCCceeEEEEEEEcCCCccccCceeEEEEeCcc
Confidence 6677899999999999999999977788999999999999999999999999999975
No 11
>KOG0421|consensus
Probab=99.69 E-value=1.2e-16 Score=92.88 Aligned_cols=62 Identities=31% Similarity=0.485 Sum_probs=57.0
Q ss_pred CCchHHHHHHHHHHHHhhhCCCCCEEEEecCCCCcceEEEEEECCCCCCCCCcEEEEEEEecC
Q psy623 4 VPTSSALRALSMEYKSLQEEPVEGFRVKLVNDDNLFEWEVAIFGPPDTLYQGGYFKVCVVWYK 66 (69)
Q Consensus 4 ~~~~~~~~Rl~kEl~~l~~~~~~~~~~~~~~~~nl~~w~~~i~GP~~tpY~gg~f~l~i~fp~ 66 (69)
.......+||++||..|.....++|++. +++||++.|..+|.||.||+|+|-.|++.+.||.
T Consensus 25 ~~~~~V~KRLq~ELm~Lmms~~~gISAF-P~~dnlf~WvGtItGp~dTvyegl~yklSl~Fp~ 86 (175)
T KOG0421|consen 25 VDGHSVTKRLQSELMGLMMSNTPGISAF-PESDNLFKWVGTITGPKDTVYEGLKYKLSLSFPN 86 (175)
T ss_pred ccCchHHHHHHHHHHHHHhcCCCCcccC-cCcCceeEEeeEeeCCCCccccCcEEEEEEecCC
Confidence 3445678999999999999999999999 8889999999999999999999999999999974
No 12
>smart00212 UBCc Ubiquitin-conjugating enzyme E2, catalytic domain homologues. Proteins destined for proteasome-mediated degradation may be ubiquitinated. Ubiquitination follows conjugation of ubiquitin to a conserved cysteine residue of UBC homologues. This pathway functions in regulating many fundamental processes required for cell viability.TSG101 is one of several UBC homologues that lacks this active site cysteine.
Probab=99.66 E-value=6.3e-16 Score=89.96 Aligned_cols=56 Identities=39% Similarity=0.732 Sum_probs=50.7
Q ss_pred HHHHHHHHHhhhCCCCCEEEEecCCCCcceEEEEEECCCCCCCCCcEEEEEEEecC
Q psy623 11 RALSMEYKSLQEEPVEGFRVKLVNDDNLFEWEVAIFGPPDTLYQGGYFKVCVVWYK 66 (69)
Q Consensus 11 ~Rl~kEl~~l~~~~~~~~~~~~~~~~nl~~w~~~i~GP~~tpY~gg~f~l~i~fp~ 66 (69)
+||++|++++++++++++.+.+.+++|+++|+++|.||++|||+||.|++.|.||.
T Consensus 1 ~Rl~~E~~~~~~~~~~~~~v~~~~~~~~~~w~~~i~gp~~~~y~g~~f~~~l~~p~ 56 (145)
T smart00212 1 KRLLKELKELLKDPPPGISAYPVDEDNLLEWTGTIVGPPGTPYEGGIFKLTIEFPP 56 (145)
T ss_pred ChHHHHHHHHHhCCCCCeEEEECCCCChheEEEEEEcCCCCCcCCcEEEEEEECCc
Confidence 59999999999999999998844445999999999999999999999999999964
No 13
>KOG0418|consensus
Probab=99.65 E-value=5.4e-16 Score=93.20 Aligned_cols=57 Identities=32% Similarity=0.524 Sum_probs=52.3
Q ss_pred HHHHHHHHHHHhhhCC---CCCEEEEecCCCCcceEEEEEECCCCCCCCCcEEEEEEEecC
Q psy623 9 ALRALSMEYKSLQEEP---VEGFRVKLVNDDNLFEWEVAIFGPPDTLYQGGYFKVCVVWYK 66 (69)
Q Consensus 9 ~~~Rl~kEl~~l~~~~---~~~~~~~~~~~~nl~~w~~~i~GP~~tpY~gg~f~l~i~fp~ 66 (69)
+.+||++|++++..++ ..+|.+. ..++|+.+....|.||+|||||||+|.++|+||-
T Consensus 4 ~~~ri~~e~k~v~~~~eisq~~I~ve-~vn~~~~~ikG~I~GP~~TPYEGG~FeldI~iPe 63 (200)
T KOG0418|consen 4 AFKRINREQKEVLDDPEISQAGIIVE-MVNENLKEIKGHIAGPEDTPYEGGVFELDIKIPE 63 (200)
T ss_pred HHHHHHHHHHHhccChhhhhcceEEE-EccCChhhceeEecCCCCCCCCCceEEEEEecCC
Confidence 7799999999999887 5789998 7778999999999999999999999999999963
No 14
>KOG0424|consensus
Probab=99.60 E-value=5.4e-15 Score=85.78 Aligned_cols=61 Identities=33% Similarity=0.492 Sum_probs=54.3
Q ss_pred CchHHHHHHHHHHHHhhhCCCCCEEEEecCC----CCcceEEEEEECCCCCCCCCcEEEEEEEec
Q psy623 5 PTSSALRALSMEYKSLQEEPVEGFRVKLVND----DNLFEWEVAIFGPPDTLYQGGYFKVCVVWY 65 (69)
Q Consensus 5 ~~~~~~~Rl~kEl~~l~~~~~~~~~~~~~~~----~nl~~w~~~i~GP~~tpY~gg~f~l~i~fp 65 (69)
|+..+..||+.|-+.+.++.+-+|.+.|..+ .|+..|++.|.|+++|+||||.|.+.+.||
T Consensus 1 ~s~~~~~rl~eErk~wrk~hp~gf~AkP~~~~dg~~nl~~Wec~IPG~~~t~wEGg~y~l~v~F~ 65 (158)
T KOG0424|consen 1 MSGIALNRLAEERKKWRKDHPFGFYAKPVKNADGTLNLMNWECGIPGKKGTPWEGGLYKLTVNFP 65 (158)
T ss_pred CcchHHHHHHHHHHHHhhcCCCceeeeccCCCCCcceeEEEEeecCCCCCCcCcCceEEEEEeCC
Confidence 3456889999999999999999999996432 379999999999999999999999999993
No 15
>KOG0428|consensus
Probab=99.58 E-value=2.5e-15 Score=93.77 Aligned_cols=58 Identities=34% Similarity=0.680 Sum_probs=53.7
Q ss_pred hHHHHHHHHHHHHhhhCCCCCEEEEecCCCCcceEEEEEECCCCCCCCCcEEEEEEEecC
Q psy623 7 SSALRALSMEYKSLQEEPVEGFRVKLVNDDNLFEWEVAIFGPPDTLYQGGYFKVCVVWYK 66 (69)
Q Consensus 7 ~~~~~Rl~kEl~~l~~~~~~~~~~~~~~~~nl~~w~~~i~GP~~tpY~gg~f~l~i~fp~ 66 (69)
.++.+||+||.++|+ +|...+... +-+||+++|+.+|.||.||-|+||+|+.+|.||.
T Consensus 10 npaVkRlmkEa~El~-~Ptd~yha~-plEdNlFEWhFtiRGp~dtdFeGGiYHGRI~lPa 67 (314)
T KOG0428|consen 10 NPAVKRLMKEAAELK-DPTDHYHAQ-PLEDNLFEWHFTIRGPPDTDFEGGIYHGRIVLPA 67 (314)
T ss_pred CHHHHHHHHHHHHhc-Cchhhhhhc-cchhceeeEEEEeeCCCCCCccCceeeeeEecCC
Confidence 478999999999998 788888888 8899999999999999999999999999999974
No 16
>KOG0427|consensus
Probab=99.53 E-value=4e-14 Score=81.29 Aligned_cols=60 Identities=28% Similarity=0.517 Sum_probs=55.0
Q ss_pred chHHHHHHHHHHHHhhhCCCCCEEEEecCCCCcceEEEEEECCCCCCCCCcEEEEEEEecCC
Q psy623 6 TSSALRALSMEYKSLQEEPVEGFRVKLVNDDNLFEWEVAIFGPPDTLYQGGYFKVCVVWYKC 67 (69)
Q Consensus 6 ~~~~~~Rl~kEl~~l~~~~~~~~~~~~~~~~nl~~w~~~i~GP~~tpY~gg~f~l~i~fp~~ 67 (69)
+..+.+||+|||.+++.++++|+... ..||+.+|.+.+.|.++|.|+|.+|.|.++||.+
T Consensus 13 s~~at~RLqKEl~e~q~~pP~G~~~~--v~dnlqqWii~v~Ga~GTLYa~e~~qLq~~F~~~ 72 (161)
T KOG0427|consen 13 SKIATNRLQKELSEWQNNPPTGFKHR--VTDNLQQWIIEVTGAPGTLYANETYQLQVEFPEH 72 (161)
T ss_pred HHHHHHHHHHHHHHHhcCCCCcceee--cccchheeEEEEecCCceeecCcEEEEEEecCCC
Confidence 45678999999999999999999987 4679999999999999999999999999999864
No 17
>KOG0416|consensus
Probab=99.33 E-value=3.4e-12 Score=76.01 Aligned_cols=55 Identities=29% Similarity=0.479 Sum_probs=47.7
Q ss_pred HHHHHHHHHHHHhhhCCCCCEEEEecCCCCcceEEEEEECCCCCCCCCcEEEEEEEecC
Q psy623 8 SALRALSMEYKSLQEEPVEGFRVKLVNDDNLFEWEVAIFGPPDTLYQGGYFKVCVVWYK 66 (69)
Q Consensus 8 ~~~~Rl~kEl~~l~~~~~~~~~~~~~~~~nl~~w~~~i~GP~~tpY~gg~f~l~i~fp~ 66 (69)
...+||..|+.+|... +..+. ..++++.+++|.+.||.+|||+||+|+++|++|.
T Consensus 3 ~~~rRid~Dv~KL~~s---~yeV~-~ind~m~ef~V~f~GP~ds~YegGvWkv~V~lPd 57 (189)
T KOG0416|consen 3 SGKRRIDTDVMKLLMS---DYEVT-IINDGMQEFYVKFHGPKDSPYEGGVWKVRVELPD 57 (189)
T ss_pred CcccchhhHHHHHHhc---CCeEE-EecCcccEEEEEeeCCCCCcccCceEEEEEECCC
Confidence 3458999999998653 46777 7788999999999999999999999999999974
No 18
>KOG0423|consensus
Probab=99.28 E-value=1.8e-12 Score=77.57 Aligned_cols=57 Identities=32% Similarity=0.601 Sum_probs=53.0
Q ss_pred hHHHHHHHHHHHHhhhCCCCCEEEEecCCCCcceEEEEEECCCCCCCCCcEEEEEEEe
Q psy623 7 SSALRALSMEYKSLQEEPVEGFRVKLVNDDNLFEWEVAIFGPPDTLYQGGYFKVCVVW 64 (69)
Q Consensus 7 ~~~~~Rl~kEl~~l~~~~~~~~~~~~~~~~nl~~w~~~i~GP~~tpY~gg~f~l~i~f 64 (69)
....+-+.||++.+..+|+.||.+. ++++|+...++.|.||.+|||++|.|++++.+
T Consensus 9 p~vik~~~kEl~~l~~~PPdGIKV~-~NeeD~tdiqa~IeGP~GTPYa~GlFRmKL~L 65 (223)
T KOG0423|consen 9 PNVIKQLAKELKSLDESPPDGIKVV-VNEEDFTDIQADIEGPVGTPYANGLFRMKLAL 65 (223)
T ss_pred hHHHHHHHHHHHhcccCCCCceEEe-cChHHhHHHHhhccCCCCCccccceeeehhhh
Confidence 3566789999999999999999999 99999999999999999999999999999876
No 19
>KOG0422|consensus
Probab=99.23 E-value=1.4e-11 Score=71.35 Aligned_cols=57 Identities=28% Similarity=0.455 Sum_probs=50.0
Q ss_pred HHHHHHHHHHHhhhCCCCCEEEEecCCCCcceEEEEEECCCCCCCCCcEEEEEEEecC
Q psy623 9 ALRALSMEYKSLQEEPVEGFRVKLVNDDNLFEWEVAIFGPPDTLYQGGYFKVCVVWYK 66 (69)
Q Consensus 9 ~~~Rl~kEl~~l~~~~~~~~~~~~~~~~nl~~w~~~i~GP~~tpY~gg~f~l~i~fp~ 66 (69)
+.+||.|||.+|+++....+.-...+++|+..|++.|. |+..||..|.|+++|.||.
T Consensus 3 a~~Rl~kEL~dl~~~~~~~~rn~~~~e~nll~wt~lli-pd~ppY~kgaF~l~I~fp~ 59 (153)
T KOG0422|consen 3 APRRLRKELADLQKNKMKFFRNIEVDEANLLKWTGLLI-PDKPPYNKGAFRLEIDFPV 59 (153)
T ss_pred hhHHHHHHHHHHHhccHHHHhhhhcccccceeEEeEec-CCCCCccCcceEEEeeCCC
Confidence 56899999999999988877544367889999999999 9999999999999999964
No 20
>KOG0895|consensus
Probab=98.75 E-value=2.7e-08 Score=71.82 Aligned_cols=59 Identities=22% Similarity=0.322 Sum_probs=55.4
Q ss_pred hHHHHHHHHHHHHhhhCCCCCEEEEecCCCCcceEEEEEECCCCCCCCCcEEEEEEEecC
Q psy623 7 SSALRALSMEYKSLQEEPVEGFRVKLVNDDNLFEWEVAIFGPPDTLYQGGYFKVCVVWYK 66 (69)
Q Consensus 7 ~~~~~Rl~kEl~~l~~~~~~~~~~~~~~~~nl~~w~~~i~GP~~tpY~gg~f~l~i~fp~ 66 (69)
....+|+++|++-+.++.++++.+. +.+..+...++.|.||.+|||++|+|.|+|.||.
T Consensus 281 ~~~skrv~ke~~llskdlpEgifvr-p~e~RMd~I~alIig~~gtPy~~glf~Fdiq~P~ 339 (1101)
T KOG0895|consen 281 KNWSKKVAKELKLLSKDLPEGIFVR-PDEGRMDLIKALIIGPDGTPYADGLFLFDIQFPD 339 (1101)
T ss_pred hhhHHHHHHHhhhhcccCCCCcccc-ccccccceeeeEEecCCCCCCcCCceeeEeecCC
Confidence 4577899999999999999999998 8899999999999999999999999999999985
No 21
>KOG0420|consensus
Probab=98.53 E-value=3.2e-07 Score=54.96 Aligned_cols=57 Identities=28% Similarity=0.380 Sum_probs=41.5
Q ss_pred hHHHHHHHHHHHHhhhCCCCCEEEEecC-CCCcc--eEEEEEECCCCCCCCCcEEEEEEEecC
Q psy623 7 SSALRALSMEYKSLQEEPVEGFRVKLVN-DDNLF--EWEVAIFGPPDTLYQGGYFKVCVVWYK 66 (69)
Q Consensus 7 ~~~~~Rl~kEl~~l~~~~~~~~~~~~~~-~~nl~--~w~~~i~GP~~tpY~gg~f~l~i~fp~ 66 (69)
+.+.-||++|+-++. .++.+++..++ .+++. +++++|. |+++.|+||.|+|++++|.
T Consensus 27 s~a~lrl~~di~eln--Lp~t~~~s~~~~~~d~~~~~~elti~-PdEGyY~gGkf~F~~~v~~ 86 (184)
T KOG0420|consen 27 SAALLRLKKDILELN--LPPTCSLSFPDSPDDLNNLEFELTIT-PDEGYYQGGKFRFKFKVPN 86 (184)
T ss_pred cHHHHHHHhhhhhcc--CCCccccccccCCcccccceEEEEEc-cCcceecCceEEEEEECCC
Confidence 456678888887775 45555543233 33333 5999999 9999999999999999964
No 22
>KOG0429|consensus
Probab=98.49 E-value=5.3e-07 Score=56.01 Aligned_cols=53 Identities=26% Similarity=0.412 Sum_probs=46.0
Q ss_pred HHHHHHHHhhhCCCCCEEEEecCCCCcceEEEEEECCCCCCCCCcEEEEEEEecC
Q psy623 12 ALSMEYKSLQEEPVEGFRVKLVNDDNLFEWEVAIFGPPDTLYQGGYFKVCVVWYK 66 (69)
Q Consensus 12 Rl~kEl~~l~~~~~~~~~~~~~~~~nl~~w~~~i~GP~~tpY~gg~f~l~i~fp~ 66 (69)
.|..|+....+.+.+||.+. |+-.|-+.|.++|.+ ..+.|+||+|+|+|-+|.
T Consensus 23 ~llAEf~lV~~ekL~gIyvi-PSyan~l~WFGViFv-r~GiyaggVFRFtIliPd 75 (258)
T KOG0429|consen 23 ALLAEFVLVCREKLDGIYVI-PSYANKLLWFGVIFV-RKGIYAGGVFRFTILIPD 75 (258)
T ss_pred HHHHHHHHHHhccCCceEEc-ccccccceEEEEEEE-ecccccCceEEEEEEcCc
Confidence 46677778888899999999 888999999999994 456899999999999874
No 23
>KOG0895|consensus
Probab=98.42 E-value=1.6e-07 Score=67.96 Aligned_cols=55 Identities=24% Similarity=0.319 Sum_probs=47.4
Q ss_pred HHHHHHHHHhhhCCCCCEEEEecCCCCcceEEEEEECCCCCCCCCcEEEEEEEecC
Q psy623 11 RALSMEYKSLQEEPVEGFRVKLVNDDNLFEWEVAIFGPPDTLYQGGYFKVCVVWYK 66 (69)
Q Consensus 11 ~Rl~kEl~~l~~~~~~~~~~~~~~~~nl~~w~~~i~GP~~tpY~gg~f~l~i~fp~ 66 (69)
+..+.|+.-|..+.+.+|.+. ..++.+.-..+.|.||.+|||++|.|.|.|.||.
T Consensus 854 ~~~~~~~~~~~~~~~~~~~vr-~~e~r~d~~~~~~~g~~~tpy~~~~f~fd~~~~~ 908 (1101)
T KOG0895|consen 854 KKVQTEWKILPLSLPSGIFVR-AYEDRMDLLRAVIVGAAGTPYQDGLFFFDFQFPQ 908 (1101)
T ss_pred HHHHHHHHhhhccCCCceEEE-echHHHHHHHHHhhCCCCCccccceEEEEeecCC
Confidence 334456667777888999999 8888888889999999999999999999999975
No 24
>KOG0896|consensus
Probab=98.33 E-value=1.1e-06 Score=50.82 Aligned_cols=57 Identities=23% Similarity=0.360 Sum_probs=43.4
Q ss_pred HHHHHHHHHHHHhhhCCCC-CEEEEecCCCC--cceEEEEEECCCCCCCCCcEEEEEEEe
Q psy623 8 SALRALSMEYKSLQEEPVE-GFRVKLVNDDN--LFEWEVAIFGPPDTLYQGGYFKVCVVW 64 (69)
Q Consensus 8 ~~~~Rl~kEl~~l~~~~~~-~~~~~~~~~~n--l~~w~~~i~GP~~tpY~gg~f~l~i~f 64 (69)
+...||.+|+.+-++.--+ .++...-+++| +..|..+|.||+.|+||+.+|.|+|+.
T Consensus 5 Prnfrlleele~g~kg~g~~~~s~gl~d~~dmtl~rWtg~IiGPprT~yEnRiysLKI~C 64 (138)
T KOG0896|consen 5 PRNFRLLEELEEGEKGIGDGTVSWGLEDDDDMTLTRWTGTIIGPPRTMYENRIYSLKIEC 64 (138)
T ss_pred ccchhhhhhhccccccccCceeeccccCCCcceEeeeccceeCCCCcccccceeeEEEec
Confidence 3556888888887664433 35555223444 679999999999999999999999987
No 25
>smart00340 HALZ homeobox associated leucin zipper.
Probab=88.11 E-value=0.55 Score=22.16 Aligned_cols=15 Identities=27% Similarity=0.242 Sum_probs=12.4
Q ss_pred HHHHHHHHHHHhhhC
Q psy623 9 ALRALSMEYKSLQEE 23 (69)
Q Consensus 9 ~~~Rl~kEl~~l~~~ 23 (69)
-++||++|+++|+.-
T Consensus 20 eNrRL~ke~~eLral 34 (44)
T smart00340 20 ENRRLQKEVQELRAL 34 (44)
T ss_pred HHHHHHHHHHHHHhc
Confidence 468999999999753
No 26
>PF11619 P53_C: Transcription factor P53 - C terminal domain; InterPro: IPR024631 The p53 tumour suppressor [, , , , ] is a protein found in increased amounts in a wide variety of transformed cells. It is also detectable in many proliferating non-transformed cells, but it is undetectable or present at low levels in resting cells. It is frequently mutated or inactivated in many types of cancer. p53 seems to act as a tumour suppressor in some, but probably not all, tumour types. p53 has been implicated in cell cycle regulation, particularly in the monitoring of genomic DNA integrity prior to replication; for this reason it has been dubbed `guardian of the genome'. p53 is a sequence-specific DNA-binding protein and transcription factor. The structure of p53 comprises 4 domains: an N-terminal transactivation domain; a central DNA-binding domain; an oligomerisation domain; and a C-terminal, basic, regulatory domain [, ]. The structure of the oligomerisation domain consists of a dimer of dimers, each dimer consisting of 2 anti-parallel alpha-helices and an anti-parallel beta-sheet. The sheets lie on opposite sides of the tetramer and the helices form an unusual 4-helix bundle [, ]. While the majority of p53 mutations found in human cancers are located in the DNA-binding domain, some are also found in the oligomerisation domain. This entry represents the C-terminal domain of Drosophila transcription factor p53. While the rest of the protein is quite conserved between the different transcription factors such as p53 and p73, the C-terminal domain is highly divergent. The Drosophila p53 structure is characterised by an additional N-terminal beta-strand and a C-terminal helix [].; PDB: 2RP4_B.
Probab=72.90 E-value=8.5 Score=19.87 Aligned_cols=24 Identities=21% Similarity=0.474 Sum_probs=16.7
Q ss_pred CCcceEEEEEECCCCCCCCCcEEEEEEEecC
Q psy623 36 DNLFEWEVAIFGPPDTLYQGGYFKVCVVWYK 66 (69)
Q Consensus 36 ~nl~~w~~~i~GP~~tpY~gg~f~l~i~fp~ 66 (69)
++-.+|.|.-. .+|-|+|.|++|+
T Consensus 2 ~~~~dW~Vsrt-------~dGdYrL~itcp~ 25 (71)
T PF11619_consen 2 NSAADWEVSRT-------LDGDYRLVITCPK 25 (71)
T ss_dssp TTT-S-EEEEE-------TTTCEEEEEEESS
T ss_pred Cccccceeeec-------cCCceEEEEecCc
Confidence 34567888866 2578999999986
No 27
>PF15241 Cylicin_N: Cylicin N-terminus
Probab=64.48 E-value=3.7 Score=23.17 Aligned_cols=11 Identities=9% Similarity=0.168 Sum_probs=8.8
Q ss_pred EEEEEecCCCC
Q psy623 59 KVCVVWYKCSQ 69 (69)
Q Consensus 59 ~l~i~fp~~~~ 69 (69)
+|.+.|||+||
T Consensus 26 hFaLtFPKPpr 36 (110)
T PF15241_consen 26 HFALTFPKPPR 36 (110)
T ss_pred heeeccCCCCC
Confidence 57788999886
No 28
>PF04881 Adeno_GP19K: Adenovirus GP19K; InterPro: IPR006965 This 19 kDa glycoprotein binds the major histocompatibility (MHC) class I antigens in the endoplasmic reticulum (ER). The ER retention signal at the C terminus of Gp19K causes retention of the complex in the ER, preventing lysis of the cell by cytotoxic T-lymphocytes [].; GO: 0005537 mannose binding, 0050690 regulation of defense response to virus by virus
Probab=60.54 E-value=11 Score=22.11 Aligned_cols=30 Identities=13% Similarity=0.275 Sum_probs=21.0
Q ss_pred CCCcceEEEEEECCCCCCCC-CcEEEEEEEe
Q psy623 35 DDNLFEWEVAIFGPPDTLYQ-GGYFKVCVVW 64 (69)
Q Consensus 35 ~~nl~~w~~~i~GP~~tpY~-gg~f~l~i~f 64 (69)
+.|-..|.|++.|+++|++. +-+|-+.+-|
T Consensus 44 PGd~~~ytVtV~G~dGs~~~~n~tf~~~FiF 74 (139)
T PF04881_consen 44 PGDPEWYTVTVQGPDGSIRKSNNTFMYKFIF 74 (139)
T ss_pred CCCCcceEEEEECCCCcceeccccchheeeH
Confidence 44567888999999998865 4455555444
No 29
>PF12065 DUF3545: Protein of unknown function (DUF3545); InterPro: IPR021932 This family of proteins is functionally uncharacterised. This protein is found in bacteria. Proteins in this family are typically between 60 to 77 amino acids in length. This protein has two completely conserved residues (R and L) that may be functionally important.
Probab=59.64 E-value=8.4 Score=19.42 Aligned_cols=11 Identities=18% Similarity=0.383 Sum_probs=7.8
Q ss_pred HHHHHHHHHhh
Q psy623 11 RALSMEYKSLQ 21 (69)
Q Consensus 11 ~Rl~kEl~~l~ 21 (69)
.||.|||+++.
T Consensus 37 ~rL~kEL~d~D 47 (59)
T PF12065_consen 37 QRLRKELQDMD 47 (59)
T ss_pred HHHHHHHHHcc
Confidence 57777877764
No 30
>PF12075 KN_motif: KN motif; InterPro: IPR021939 This small motif is found at the N terminus of Kank proteins and has been called the KN (for Kank N-terminal) motif. This protein is found in eukaryotes. Proteins in this family are typically between 413 to 1202 amino acids in length. This protein is found associated with PF00023 from PFAM. This protein has two conserved sequence motifs: TPYG and LDLDF. Kank1 was obtained by positional cloning of a tumor suppressor gene in renal cell carcinoma, while the other members were found by homology search. The family is involved in the regulation of actin polymerisation and cell motility through signaling pathways containing PI3K/Akt and/or unidentified modulators/effectors [].
Probab=58.76 E-value=4.5 Score=18.71 Aligned_cols=12 Identities=25% Similarity=0.706 Sum_probs=8.8
Q ss_pred CCCCCCcEEEEEEEe
Q psy623 50 DTLYQGGYFKVCVVW 64 (69)
Q Consensus 50 ~tpY~gg~f~l~i~f 64 (69)
+||| | |++.+.|
T Consensus 5 ~tPY--G-yhiDLDF 16 (39)
T PF12075_consen 5 ETPY--G-YHIDLDF 16 (39)
T ss_pred cCCc--c-eeecchH
Confidence 4788 3 8888877
No 31
>PF08203 RNA_polI_A14: Yeast RNA polymerase I subunit RPA14; InterPro: IPR013239 Saccharomyces cerevisiae RNA polymerase I (Pol I) is a complex consisting of 14 subunits. Subunit RPA14 forms part of a Pol I subcomplex consisting of RPA14 and and RPA43. The RPA14 and RPA43 heterodimer is proposed to play a role in the recruitment of Pol I to the promoter []. ; PDB: 2RF4_F.
Probab=57.56 E-value=9 Score=20.28 Aligned_cols=13 Identities=23% Similarity=0.493 Sum_probs=10.5
Q ss_pred HHHHHHHHHHHHh
Q psy623 8 SALRALSMEYKSL 20 (69)
Q Consensus 8 ~~~~Rl~kEl~~l 20 (69)
.-.|||+|+++-|
T Consensus 59 SQLKRiQRdlrGL 71 (76)
T PF08203_consen 59 SQLKRIQRDLRGL 71 (76)
T ss_dssp HHHHHHHHHHHHS
T ss_pred HHHHHHHHhhCCC
Confidence 3568999999876
No 32
>PF04904 NCD1: NAB conserved region 1 (NCD1); InterPro: IPR006988 Nab1 and Nab2 are co-repressors that specifically interact with and repress transcription mediated by the three members of the NGFI-A (Egr-1, Krox24, zif/268) family of eukaryotic (metazoa) transcription factors []. This entry represents the N-terminal NAB domain, which interacts with the EGR1 inhibitory domain (R1) []. It may also mediate multimerisation.; GO: 0045892 negative regulation of transcription, DNA-dependent, 0005634 nucleus
Probab=53.02 E-value=23 Score=18.89 Aligned_cols=17 Identities=18% Similarity=0.303 Sum_probs=14.6
Q ss_pred HHHHHHHHHHHHhhhCC
Q psy623 8 SALRALSMEYKSLQEEP 24 (69)
Q Consensus 8 ~~~~Rl~kEl~~l~~~~ 24 (69)
...+|++|-+.++..+|
T Consensus 61 LHVrRlqKAL~ew~~~p 77 (82)
T PF04904_consen 61 LHVRRLQKALQEWSTNP 77 (82)
T ss_pred HHHHHHHHHHHHHhcCh
Confidence 57899999999998765
No 33
>PF09929 DUF2161: Uncharacterized conserved protein (DUF2161); InterPro: IPR018679 This family of various hypothetical prokaryotic proteins has no known function.
Probab=49.47 E-value=35 Score=19.55 Aligned_cols=24 Identities=17% Similarity=0.206 Sum_probs=17.8
Q ss_pred CCchHHHHHHHHHHHHhhhCCCCC
Q psy623 4 VPTSSALRALSMEYKSLQEEPVEG 27 (69)
Q Consensus 4 ~~~~~~~~Rl~kEl~~l~~~~~~~ 27 (69)
..+.....||.+|++....++..|
T Consensus 23 Rk~~krr~rLl~Ef~rR~GDpn~G 46 (118)
T PF09929_consen 23 RKNKKRRSRLLREFQRRSGDPNVG 46 (118)
T ss_pred CCCHHHHHHHHHHHHHhcCCCCCC
Confidence 345567789999999988776543
No 34
>PF04314 DUF461: Protein of unknown function (DUF461); InterPro: IPR007410 This entry represents a domain found in of proteins of unknown function, including DR1885 from Deinococcus radiodurans and CC3502 from Caulobacter crescentus (Caulobacter vibrioides), which share a potential metal binding motif H(M)X10MX21HXM. DR1885 was found to bind copper(I) through a histidine and three Mets in a cupredoxin-like fold []. The surface location of the copper-binding site as well as the type of coordination are well poised for metal transfer chemistry, suggesting that DR1885 might transfer copper, taking the role of Cox17 in bacteria (Cox17 being an accessory protein required for correct assembly of eukaryotic cyochrome c oxidase). ; PDB: 2K6W_A 2K6Z_A 2K6Y_A 2K70_A 1X9L_A 2JQA_A.
Probab=48.48 E-value=29 Score=19.04 Aligned_cols=27 Identities=11% Similarity=0.288 Sum_probs=21.3
Q ss_pred eEEEEEECCCCCCCCCcEEEEEEEecC
Q psy623 40 EWEVAIFGPPDTLYQGGYFKVCVVWYK 66 (69)
Q Consensus 40 ~w~~~i~GP~~tpY~gg~f~l~i~fp~ 66 (69)
-.++.+.|++..+=+|..+.+.+.|-.
T Consensus 77 g~HlmL~g~~~~l~~G~~v~ltL~f~~ 103 (110)
T PF04314_consen 77 GYHLMLMGLKRPLKPGDTVPLTLTFED 103 (110)
T ss_dssp CCEEEEECESS-B-TTEEEEEEEEETT
T ss_pred CEEEEEeCCcccCCCCCEEEEEEEECC
Confidence 368889999999999999999999843
No 35
>TIGR02422 protocat_beta protocatechuate 3,4-dioxygenase, beta subunit. This model represents the beta chain of protocatechuate 3,4-dioxygenase. The most closely related family outside this family is that of the alpha chain (TIGR02423), typically encoded in an adjacent locus. This enzyme acts in the degradation of aromatic compounds by way of p-hydroxybenzoate to succinate and acetyl-CoA.
Probab=48.10 E-value=19 Score=22.65 Aligned_cols=18 Identities=11% Similarity=0.410 Sum_probs=16.0
Q ss_pred cceEEEEEECCCCCCCCC
Q psy623 38 LFEWEVAIFGPPDTLYQG 55 (69)
Q Consensus 38 l~~w~~~i~GP~~tpY~g 55 (69)
..+|.+.+.|..+|+||+
T Consensus 203 ~y~FDI~Lqg~~etvf~~ 220 (220)
T TIGR02422 203 GYRFDIVLRGRRATPFEN 220 (220)
T ss_pred eEEEEEEEcCCCcCcCCC
Confidence 379999999999999985
No 36
>PF12011 DUF3503: Domain of unknown function (DUF3503); InterPro: IPR021892 RNA helicase NPH-II or I8 is found in Poxviridae. It is essential for viral replication. Plays an important role during transcription of early mRNAs, presumably by preventing R-loop formation behind the elongating RNA polymerase. Acts as NTP-dependent helicase that catalyzes unidirectional unwinding of 3'tailed duplex RNAs. It might also play a role in the export of newly synthesized mRNA chains out of the core into the cytoplasm and is required for propagation of viral particles[].; GO: 0017111 nucleoside-triphosphatase activity
Probab=47.22 E-value=23 Score=21.57 Aligned_cols=46 Identities=17% Similarity=0.214 Sum_probs=28.4
Q ss_pred HHHHHHHhhhCCCCCEEEEecCCCCcceEEEEEECCCCCCCCCcEEEEEEEe
Q psy623 13 LSMEYKSLQEEPVEGFRVKLVNDDNLFEWEVAIFGPPDTLYQGGYFKVCVVW 64 (69)
Q Consensus 13 l~kEl~~l~~~~~~~~~~~~~~~~nl~~w~~~i~GP~~tpY~gg~f~l~i~f 64 (69)
-++|+.++..+....|+.. .-+=--++|+-... ....|+|+|.+++
T Consensus 23 Sq~E~~~m~~~~~~~Fs~A-vFPliKHRW~~a~v-----~~~~~~ykLn~E~ 68 (170)
T PF12011_consen 23 SQKEFEEMPKEERKAFSYA-VFPLIKHRWKGAYV-----VRDNRVYKLNVES 68 (170)
T ss_pred cHHHHHhcChhhhhcceee-eeeeehhcccCceE-----EEECCEEEEEeeh
Confidence 4678888876555554433 22223478875443 2557899999985
No 37
>PF05709 Sipho_tail: Phage tail protein; InterPro: IPR008841 This family consists of several Siphovirus and other phage tail component proteins as well as some bacterial proteins of unknown function. The characteristics of the protein distribution suggest prophage matches in addition to the phage matches.; PDB: 4DIV_X 2X8K_C.
Probab=45.69 E-value=67 Score=19.41 Aligned_cols=57 Identities=18% Similarity=0.195 Sum_probs=32.0
Q ss_pred HHHHHHHHHHHHhhhCCCCCEEEEecCCCCcceEEEEEECCCC--CCCCCcEEEEEEEecC
Q psy623 8 SALRALSMEYKSLQEEPVEGFRVKLVNDDNLFEWEVAIFGPPD--TLYQGGYFKVCVVWYK 66 (69)
Q Consensus 8 ~~~~Rl~kEl~~l~~~~~~~~~~~~~~~~nl~~w~~~i~GP~~--tpY~gg~f~l~i~fp~ 66 (69)
....++.+++..+.... ....+. .+++.-..|.+.+.+..+ .-...|.|.+.+.+|.
T Consensus 52 ~~~~~~~~~l~~~l~~~-~~~~l~-f~d~p~~~y~~~~~~~~~~~~~~~~~~~ti~f~c~d 110 (249)
T PF05709_consen 52 EDFEQKRRELASWLNPK-EPVKLI-FDDDPDKYYYAKVSGSPDPDEGNNSGTFTITFTCPD 110 (249)
T ss_dssp HHHHHHHHHHHHHH--S-S-EEEE-ETTSTT-EEEEEEEEEEE--SSSSCEEEEEEEEEEE
T ss_pred HHHHHHHHHHHHhhCcC-CCEEEE-EECCCCEEEEEEECCcccccccceeEEEEEEEEECC
Confidence 34456677888776433 337777 566656788888776422 2233446666666543
No 38
>PF12259 DUF3609: Protein of unknown function (DUF3609); InterPro: IPR022048 This domain family is found in eukaryotes and viruses, and is typically between 348 and 360 amino acids in length.
Probab=45.65 E-value=19 Score=24.29 Aligned_cols=55 Identities=9% Similarity=0.028 Sum_probs=31.0
Q ss_pred HHHHHHHHHHHhhhCCCCCEEEEecCCCCcceEEEEEECCCCCCCCCcEEEEEEEecC
Q psy623 9 ALRALSMEYKSLQEEPVEGFRVKLVNDDNLFEWEVAIFGPPDTLYQGGYFKVCVVWYK 66 (69)
Q Consensus 9 ~~~Rl~kEl~~l~~~~~~~~~~~~~~~~nl~~w~~~i~GP~~tpY~gg~f~l~i~fp~ 66 (69)
.-++|..|+.++....+.+..+- ++..++....-.+..- ....+...-|.|++|-
T Consensus 33 sP~~L~~em~~V~~~L~~~~~lp-~~~~~m~~ly~l~~v~--~~~~~~~liF~i~VPL 87 (361)
T PF12259_consen 33 SPKQLLDEMKNVSSHLPRDWSLP-LEKSNMHDLYRLIKVH--FITPDNKLIFVIEVPL 87 (361)
T ss_pred CHHHHHHHHHHHHhcCCcccccC-ccccHHHHHhhhEEEE--EEeeCCEEEEEEEece
Confidence 45799999999987776666543 4444444333222110 2233556666666653
No 39
>PF01107 MP: Viral movement protein (MP); InterPro: IPR001022 The movement protein of tobamoviruses is necessary for the initial cell-to-cell movement during the early stages of a viral infection. This movement is active, and involves the interaction of the movement protein with the plasmodesmata. The movement protein possesses the ability to bind to RNA to achieve its role []. The N terminus contains two particularly well-conserved regions, substitutions in one of these results in temperature-sensitive cell-to-cell movement. The C terminus contains three sub-regions characterised by the distributions of charged amino acid residues [].; GO: 0003676 nucleic acid binding, 0006810 transport
Probab=44.92 E-value=69 Score=19.31 Aligned_cols=37 Identities=8% Similarity=0.142 Sum_probs=25.6
Q ss_pred CCEEEEecCC-CCcceEEEEEECCCCCCCCCcEEEEEEEe
Q psy623 26 EGFRVKLVND-DNLFEWEVAIFGPPDTLYQGGYFKVCVVW 64 (69)
Q Consensus 26 ~~~~~~~~~~-~nl~~w~~~i~GP~~tpY~gg~f~l~i~f 64 (69)
|++.+. .++ +.-..|.+.+. -.+..++.|..-|.|++
T Consensus 130 Pn~~vs-l~D~~~~~~l~l~~~-~~g~~~~~G~~p~SI~~ 167 (189)
T PF01107_consen 130 PNYSVS-LDDPNLNDSLQLSVK-FEGVDMEDGSHPFSIEY 167 (189)
T ss_pred CCcEEE-ecchhcCCcEEEEEE-ccccccCCCcEeEEEEE
Confidence 455555 434 34567777776 66667999999998876
No 40
>COG2847 Copper(I)-binding protein [Inorganic ion transport and metabolism]
Probab=44.84 E-value=67 Score=19.17 Aligned_cols=27 Identities=22% Similarity=0.370 Sum_probs=23.6
Q ss_pred eEEEEEECCCCCCCCCcEEEEEEEecC
Q psy623 40 EWEVAIFGPPDTLYQGGYFKVCVVWYK 66 (69)
Q Consensus 40 ~w~~~i~GP~~tpY~gg~f~l~i~fp~ 66 (69)
-.++.|.|++-..=+|..|.++++|-+
T Consensus 104 gyHvMlm~lK~pl~eGd~v~vtL~f~~ 130 (151)
T COG2847 104 GYHVMLMGLKKPLKEGDKVPVTLKFEK 130 (151)
T ss_pred CEEEEEeccCCCccCCCEEEEEEEEec
Confidence 467889999999999999999999954
No 41
>KOG3285|consensus
Probab=43.27 E-value=73 Score=19.80 Aligned_cols=43 Identities=12% Similarity=0.209 Sum_probs=26.4
Q ss_pred hHHHHHHHHHHHHhhhCCCCCEEEEecCCCCcceEEEEEECCCC
Q psy623 7 SSALRALSMEYKSLQEEPVEGFRVKLVNDDNLFEWEVAIFGPPD 50 (69)
Q Consensus 7 ~~~~~Rl~kEl~~l~~~~~~~~~~~~~~~~nl~~w~~~i~GP~~ 50 (69)
.-..+||++|++.+.+.--..++.- |.=+-...|.+.+..-.|
T Consensus 118 ~k~~~~iq~EIraviRQItasVtfL-P~Le~~ctFdvLiyTdkD 160 (203)
T KOG3285|consen 118 VKDLKRIQNEIRAVIRQITASVTFL-PLLEEICTFDVLIYTDKD 160 (203)
T ss_pred hhHHHHHHHHHHHHHHHHhhheeec-ccccceeEEEEEEEeCCC
Confidence 3467899999999876554555554 322333566666654444
No 42
>PF02970 TBCA: Tubulin binding cofactor A; InterPro: IPR004226 The folding pathway of tubulins includes highly specific interactions with a series of cofactors (A, B, C, D and E) after they are released from the eukaryotic chaperonin CCT. Cofactors A and D capture and stabilise tubulin in a quasi-native conformation. Cofactor E binds to the cofactor D-tubulin complex, and interaction with cofactor C then causes the release of tubulin poypeptides in the native state. This family is the tubulin-specific chaperone A.; GO: 0051082 unfolded protein binding, 0007021 tubulin complex assembly, 0005874 microtubule; PDB: 3MXZ_A 1QSD_A 1H7C_A.
Probab=40.86 E-value=37 Score=18.20 Aligned_cols=15 Identities=20% Similarity=0.242 Sum_probs=11.3
Q ss_pred hHHHHHHHHHHHHhh
Q psy623 7 SSALRALSMEYKSLQ 21 (69)
Q Consensus 7 ~~~~~Rl~kEl~~l~ 21 (69)
..+.+||.||+....
T Consensus 6 t~~vkRL~KE~~~Y~ 20 (90)
T PF02970_consen 6 TGVVKRLLKEEASYE 20 (90)
T ss_dssp HHHHHHHHHHHHHHH
T ss_pred HHHHHHHHHHHHHHH
Confidence 357899999987654
No 43
>PF14824 Sirohm_synth_M: Sirohaem biosynthesis protein central; PDB: 1KYQ_B.
Probab=40.33 E-value=36 Score=14.75 Aligned_cols=16 Identities=19% Similarity=0.237 Sum_probs=10.2
Q ss_pred CchHHHHHHHHHHHHh
Q psy623 5 PTSSALRALSMEYKSL 20 (69)
Q Consensus 5 ~~~~~~~Rl~kEl~~l 20 (69)
++...+++|.+|+++.
T Consensus 14 ~sP~la~~iR~~ie~~ 29 (30)
T PF14824_consen 14 KSPRLARLIRKEIERL 29 (30)
T ss_dssp S-HHHHHHHHHHHHHH
T ss_pred CChHHHHHHHHHHHHh
Confidence 4555667788887753
No 44
>PF14501 HATPase_c_5: GHKL domain
Probab=39.15 E-value=26 Score=18.53 Aligned_cols=13 Identities=31% Similarity=0.388 Sum_probs=8.0
Q ss_pred CCCcEEEEEEEec
Q psy623 53 YQGGYFKVCVVWY 65 (69)
Q Consensus 53 Y~gg~f~l~i~fp 65 (69)
.+++.|.++|.||
T Consensus 88 ~~~~~f~~~i~ip 100 (100)
T PF14501_consen 88 SEDGIFTVKIVIP 100 (100)
T ss_pred EECCEEEEEEEEC
Confidence 3456666766665
No 45
>PF14436 EndoU_bacteria: Bacterial EndoU nuclease
Probab=38.69 E-value=46 Score=18.58 Aligned_cols=18 Identities=11% Similarity=-0.044 Sum_probs=15.2
Q ss_pred CCCCCcEEEEEEEecCCC
Q psy623 51 TLYQGGYFKVCVVWYKCS 68 (69)
Q Consensus 51 tpY~gg~f~l~i~fp~~~ 68 (69)
.++.+|+|.+.+.++...
T Consensus 42 ~~~~~GVy~~~v~i~~~~ 59 (128)
T PF14436_consen 42 PPNPNGVYTAKVEIPGPD 59 (128)
T ss_pred ccCCCCeEEEEEEEecCC
Confidence 489999999999997654
No 46
>PF12915 DUF3833: Protein of unknown function (DUF3833); InterPro: IPR024409 This is a family of uncharacterised proteins found in Proteobacteria.
Probab=35.19 E-value=85 Score=18.97 Aligned_cols=25 Identities=20% Similarity=0.296 Sum_probs=19.1
Q ss_pred CCCcceEEEEEECCCCCCCCCcEEEEEEE
Q psy623 35 DDNLFEWEVAIFGPPDTLYQGGYFKVCVV 63 (69)
Q Consensus 35 ~~nl~~w~~~i~GP~~tpY~gg~f~l~i~ 63 (69)
..|-++|+..+.- |+.|+.+.+++.
T Consensus 107 ~Gnal~w~Y~l~l----p~~g~~~~v~fd 131 (164)
T PF12915_consen 107 AGNALNWRYDLDL----PVDGSTYKVNFD 131 (164)
T ss_pred eccEEEEEEEEEe----eeCCcEEEEEEE
Confidence 4578899999884 567788888764
No 47
>PF06675 DUF1177: Protein of unknown function (DUF1177); InterPro: IPR009561 This family consists of several hypothetical archaeal and bacterial proteins of around 300 residues in length. The function of this family is unknown.
Probab=34.67 E-value=24 Score=23.00 Aligned_cols=17 Identities=18% Similarity=0.327 Sum_probs=13.4
Q ss_pred CCCCCCCCcEEEEEEEe
Q psy623 48 PPDTLYQGGYFKVCVVW 64 (69)
Q Consensus 48 P~~tpY~gg~f~l~i~f 64 (69)
-+=|||.+|+|++.-.+
T Consensus 180 qDITPygNgvyHiNSIl 196 (276)
T PF06675_consen 180 QDITPYGNGVYHINSIL 196 (276)
T ss_pred cccccCCCCceeeeccc
Confidence 34499999999998554
No 48
>PF07547 RSD-2: RSD-2 N-terminal domain; InterPro: IPR011508 This domain is found in three copies at the N terminus of the Caenorhabditis elegans RSD-2 protein. RSD-2 (RNAi spreading defective) is involved in systemic RNAi []. Mutations in the rsd-2 gene do not affect somatic genes but only germline expressed genes [].
Probab=34.57 E-value=73 Score=17.10 Aligned_cols=28 Identities=11% Similarity=0.220 Sum_probs=21.7
Q ss_pred cceEEEEEECCC-CCCCCCcEEEEEEEec
Q psy623 38 LFEWEVAIFGPP-DTLYQGGYFKVCVVWY 65 (69)
Q Consensus 38 l~~w~~~i~GP~-~tpY~gg~f~l~i~fp 65 (69)
...|++.+.-++ .+.=.+++|+..|.+-
T Consensus 16 yad~yG~V~~s~~~~~~~~~vyk~~I~vn 44 (84)
T PF07547_consen 16 YADWYGLVDCSDVESMERGTVYKTEITVN 44 (84)
T ss_pred eccccCeEEcccccccccccEEEEEEEEE
Confidence 347888887676 6777788999999874
No 49
>cd02860 Pullulanase_N_term Pullulanase domain N-terminus. Pullulanase (AKA dextrinase; alpha-dextrin endo-1,6-alpha glucosidase) is an enzyme with action similar to that of isoamylase; it cleaves 1,6-alpha-glucosidic linkages in pullulan, amylopectin, and glycogen, and in alpha-and beta-amylase limit-dextrins of amylopectin and glycogen. The N-terminus of pullulanase may be related to the immunoglobulin and/or fibronectin type III superfamilies. These domains are associated with different types of catalytic domains at either the N-terminal or C-terminal end and may be involved in homodimeric/tetrameric/dodecameric interactions. Members of this family include members of the alpha amylase family, sialidase, galactose oxidase, cellulase, cellulose, hyaluronate lyase, chitobiase, and chitinase.
Probab=33.56 E-value=79 Score=16.69 Aligned_cols=24 Identities=25% Similarity=0.328 Sum_probs=16.0
Q ss_pred CcceEEEEEECCCCCCCCCcEEEEEEEe
Q psy623 37 NLFEWEVAIFGPPDTLYQGGYFKVCVVW 64 (69)
Q Consensus 37 nl~~w~~~i~GP~~tpY~gg~f~l~i~f 64 (69)
.--.|++.+.|. .+|..|.+.|.-
T Consensus 45 ~~gvw~~~v~~~----~~g~~Y~y~i~~ 68 (100)
T cd02860 45 ENGVWSVTLDGD----LEGYYYLYEVKV 68 (100)
T ss_pred CCCEEEEEeCCc----cCCcEEEEEEEE
Confidence 345788887753 566677777754
No 50
>PF12627 PolyA_pol_RNAbd: Probable RNA and SrmB- binding site of polymerase A; PDB: 1OU5_B 3H38_A 3H3A_B 3H39_B 3H37_A 3AQN_A 3AQK_A 3AQM_B 3AQL_B 1MIY_A ....
Probab=33.41 E-value=54 Score=15.73 Aligned_cols=19 Identities=11% Similarity=0.170 Sum_probs=13.8
Q ss_pred hHHHHHHHHHHHHhhhCCC
Q psy623 7 SSALRALSMEYKSLQEEPV 25 (69)
Q Consensus 7 ~~~~~Rl~kEl~~l~~~~~ 25 (69)
..+..||..|+..+...+.
T Consensus 21 ~is~ERi~~El~kil~~~~ 39 (64)
T PF12627_consen 21 KISKERIREELEKILSSPN 39 (64)
T ss_dssp GS-HHHHHHHHHHHHTSTT
T ss_pred cCCHHHHHHHHHHHHcCCC
Confidence 4567899999999876543
No 51
>PF00868 Transglut_N: Transglutaminase family; InterPro: IPR001102 Synonym(s): Protein-glutamine gamma-glutamyltransferase, Fibrinoligase, TGase Protein-glutamine gamma-glutamyltransferases (2.3.2.13 from EC) (TGase) are calcium-dependent enzymes that catalyse the cross-linking of proteins by promoting the formation of isopeptide bonds between the gamma-carboxyl group of a glutamine in one polypeptide chain and the epsilon-amino group of a lysine in a second polypeptide chain. TGases also catalyse the conjugation of polyamines to proteins [, ]. Transglutaminases are widely distributed in various organs, tissues and body fluids. The best known transglutaminase is blood coagulation factor XIII, a plasma tetrameric protein composed of two catalytic A subunits and two non-catalytic B subunits. Factor XIII is responsible for cross-linking fibrin chains, thus stabilising the fibrin clot. There are commonly three domains: N-terminal, middle (IPR013808 from INTERPRO) and C-terminal (IPR013807 from INTERPRO). This entry represents the N-terminal domain found in transglutaminases.; GO: 0018149 peptide cross-linking; PDB: 1L9N_B 1NUF_A 1NUD_A 1NUG_B 1L9M_A 1KV3_C 3S3S_A 2Q3Z_A 3LY6_A 3S3P_A ....
Probab=33.38 E-value=85 Score=17.54 Aligned_cols=21 Identities=24% Similarity=0.428 Sum_probs=9.1
Q ss_pred EEEEEECCCCCCCCCcEEEEEEE
Q psy623 41 WEVAIFGPPDTLYQGGYFKVCVV 63 (69)
Q Consensus 41 w~~~i~GP~~tpY~gg~f~l~i~ 63 (69)
..+.|.-|.+.+- |.|+|.|.
T Consensus 95 ~tv~V~spa~A~V--G~y~l~v~ 115 (118)
T PF00868_consen 95 VTVSVTSPANAPV--GRYKLSVE 115 (118)
T ss_dssp EEEEEE--TTS----EEEEEEEE
T ss_pred EEEEEECCCCCce--EEEEEEEE
Confidence 3444444555554 66666654
No 52
>cd02848 Chitinase_N_term Chitinase N-terminus domain. Chitinases hydrolyze the abundant natural biopolymer chitin, producing smaller chito-oligosaccharides. Chitin consists of multiple N-acetyl-D-glucosamine (NAG) residues connected via beta-1,4-glycosidic linkages and is an important structural element of fungal cell wall and arthropod exoskeletons. On the basis of the mode of chitin hydrolysis, chitinases are classified as random, endo-, and exo-chitinases and based on sequence criteria, chitinases belong to families 18 and 19 of glycosyl hydrolases. The N-terminus of chitinase may be related to the immunoglobulin and/or fibronectin type III superfamilies. These domains are associated with different types of catalytic domains at either the N-terminal or C-terminal end and may be involved in homodimeric/tetrameric/dodecameric interactions. Members of this family include members of the alpha amylase family, sialidase, galactose oxidase, cellulase, cellulose, hyaluronate lyase, chitob
Probab=33.24 E-value=47 Score=18.68 Aligned_cols=12 Identities=25% Similarity=0.487 Sum_probs=10.3
Q ss_pred CCCcEEEEEEEe
Q psy623 53 YQGGYFKVCVVW 64 (69)
Q Consensus 53 Y~gg~f~l~i~f 64 (69)
=+||.|.+.|++
T Consensus 77 ~kgG~y~m~V~l 88 (106)
T cd02848 77 GKGGRYQMQVAL 88 (106)
T ss_pred CCCCeEEEEEEE
Confidence 349999999987
No 53
>TIGR03601 B_an_ocin probable heterocycle-containing bacteriocin, BA_2677 family. Numerous bacteria encode systems for producing bacteriocins by extensive modification of ribosomally produced precursors. These proteins are recognizable in part by proximity to the modification proteins, and in part by small size, with leader peptide-like N-terminal sequence followed by low-complexity sequence rich in Cys, Gly, and/or Ser. This protein family represents a probable member of the class, though previously unrecognized because it is not encoded adjacent to its modification proteins.
Probab=31.51 E-value=49 Score=17.29 Aligned_cols=22 Identities=23% Similarity=0.451 Sum_probs=13.2
Q ss_pred HHHHHHHhhhCCCCCEEEEecCCCCcceEE
Q psy623 13 LSMEYKSLQEEPVEGFRVKLVNDDNLFEWE 42 (69)
Q Consensus 13 l~kEl~~l~~~~~~~~~~~~~~~~nl~~w~ 42 (69)
.++||+.|..+. ...+|+..|.
T Consensus 4 FqqELq~L~lnd--------y~~gnvvyWD 25 (79)
T TIGR03601 4 FQQELQSLSLND--------YQTGNVVYWD 25 (79)
T ss_pred HHHHHHhccccc--------ccCCcEEEec
Confidence 578888876543 3344555564
No 54
>TIGR01633 phi3626_gp14_N putative phage tail component, N-terminal domain. This model represents the best-conserved region of about 125 amino acids, toward the N-terminus, of a family of proteins from temperate phage of a number of Gram-positive bacteria. These phage proteins range in length from 230 to 525 amino acids.
Probab=31.10 E-value=94 Score=16.82 Aligned_cols=57 Identities=14% Similarity=-0.020 Sum_probs=33.3
Q ss_pred HHHHHHHHHHHhhhCCCCCEEEEecCCCCcceEEEEEECCCC--CCCCCcEEEEEEEecCC
Q psy623 9 ALRALSMEYKSLQEEPVEGFRVKLVNDDNLFEWEVAIFGPPD--TLYQGGYFKVCVVWYKC 67 (69)
Q Consensus 9 ~~~Rl~kEl~~l~~~~~~~~~~~~~~~~nl~~w~~~i~GP~~--tpY~gg~f~l~i~fp~~ 67 (69)
..+.+.++++.+.... ....+. .+++.=..|.+.+.+..+ .....|.+.+.+.+|.|
T Consensus 63 ~~~~~~~~l~~~L~~~-~~~~L~-f~dePd~yy~a~~~~~~~~~~~~~~~~~titF~c~dP 121 (124)
T TIGR01633 63 DLRELFRELAGWLNSQ-EPVPLI-FSDEPDKTYYARVDEEIDLDEDTTFGKGTLNFICPDP 121 (124)
T ss_pred HHHHHHHHHHHHhCCC-CCcceE-eccCCCcEEEEEEcCccCHHHhhcccEEEEEEEecCC
Confidence 4556777788876533 223444 333333577788776432 22345788888887664
No 55
>PF06305 DUF1049: Protein of unknown function (DUF1049); InterPro: IPR010445 This entry consists of several hypothetical bacterial proteins of unknown function.
Probab=30.96 E-value=59 Score=15.88 Aligned_cols=15 Identities=40% Similarity=0.581 Sum_probs=9.9
Q ss_pred HHHHHHHHHHHhhhC
Q psy623 9 ALRALSMEYKSLQEE 23 (69)
Q Consensus 9 ~~~Rl~kEl~~l~~~ 23 (69)
..+|+.+|+++++++
T Consensus 49 ~~~~~~k~l~~le~e 63 (68)
T PF06305_consen 49 RIRRLRKELKKLEKE 63 (68)
T ss_pred HHHHHHHHHHHHHHH
Confidence 456777777777653
No 56
>CHL00130 rbcS ribulose-1,5-bisphosphate carboxylase/oxygenase small subunit; Reviewed
Probab=29.87 E-value=1.2e+02 Score=17.82 Aligned_cols=33 Identities=18% Similarity=0.118 Sum_probs=22.5
Q ss_pred HHHHHHHHHHHhhhCCCCC-EEEEecCCCCcceEEE
Q psy623 9 ALRALSMEYKSLQEEPVEG-FRVKLVNDDNLFEWEV 43 (69)
Q Consensus 9 ~~~Rl~kEl~~l~~~~~~~-~~~~~~~~~nl~~w~~ 43 (69)
....+++|+.++++.-+.. |.+ .--||...|++
T Consensus 60 D~~~Vl~Ei~~CrkayP~~yIRl--~gFDn~rq~Q~ 93 (138)
T CHL00130 60 DPAAVMFEINECRKQKPNGYIKV--NAFDASRGVES 93 (138)
T ss_pred CHHHHHHHHHHHHHHCCCcEEEE--EEeeCCCcEEE
Confidence 3457899999998865443 344 34567777777
No 57
>PF08872 KGK: KGK domain; InterPro: IPR014971 This protein is found in one or two copies in cyanobacterial proteins. It is named after a short sequence motif.
Probab=29.75 E-value=1.1e+02 Score=17.27 Aligned_cols=25 Identities=16% Similarity=0.223 Sum_probs=19.5
Q ss_pred EEEEEECCCCCCCCCcEEEE--EEEec
Q psy623 41 WEVAIFGPPDTLYQGGYFKV--CVVWY 65 (69)
Q Consensus 41 w~~~i~GP~~tpY~gg~f~l--~i~fp 65 (69)
..|.|..+...-|.-|.+++ .++|-
T Consensus 67 i~CeiL~~g~~~W~kGK~ri~~~leF~ 93 (114)
T PF08872_consen 67 IECEILRFGSKGWQKGKVRIKVSLEFI 93 (114)
T ss_pred eeEEEeccCCCCCccceEEEEEEEEEe
Confidence 45667777777899999999 77773
No 58
>PF09943 DUF2175: Uncharacterized protein conserved in archaea (DUF2175); InterPro: IPR018686 This family of various hypothetical archaeal proteins has no known function.
Probab=29.03 E-value=68 Score=17.88 Aligned_cols=20 Identities=30% Similarity=0.823 Sum_probs=15.6
Q ss_pred eEEEEEECCCCCCCCCcEEEEE
Q psy623 40 EWEVAIFGPPDTLYQGGYFKVC 61 (69)
Q Consensus 40 ~w~~~i~GP~~tpY~gg~f~l~ 61 (69)
.|.+.|-| +..|+|-.|.|.
T Consensus 2 kWkC~iCg--~~I~~gqlFTF~ 21 (101)
T PF09943_consen 2 KWKCYICG--KPIYEGQLFTFT 21 (101)
T ss_pred ceEEEecC--CeeeecceEEEe
Confidence 68888876 368999888775
No 59
>PF07691 PA14: PA14 domain; InterPro: IPR011658 The PA14 domain forms an insert in bacterial beta-glucosidases, other glycosidases, glycosyltransferases, proteases, amidases, yeast adhesins and bacterial toxins, including anthrax protective antigen (PA). The domain also occurs in a Dictyostelium pre-spore cell-inducing factor Psi and in fibrocystin, the mammalian protein whose mutation leads to polycystic kidney and hepatic disease. The crystal structure of PA shows that this domain (named PA14 after its location in the PA20 pro-peptide) has a beta-barrel structure. The PA14 domain sequence suggests a binding function, rather than a catalytic role. The PA14 domain distribution is compatible with carbohydrate binding [].; PDB: 2XVG_A 2XVK_A 2XVL_A 2XJU_A 2XJT_A 2XJQ_A 2XJS_A 2XJV_A 2XJP_A 2XJR_A ....
Probab=28.56 E-value=1.1e+02 Score=16.85 Aligned_cols=20 Identities=15% Similarity=0.404 Sum_probs=13.3
Q ss_pred ceEEEEEECCCCCCCCCcEEEEEEE
Q psy623 39 FEWEVAIFGPPDTLYQGGYFKVCVV 63 (69)
Q Consensus 39 ~~w~~~i~GP~~tpY~gg~f~l~i~ 63 (69)
.+|...|.-|.+ |.|.|.+.
T Consensus 48 ~~~~G~~~~~~~-----G~y~f~~~ 67 (145)
T PF07691_consen 48 VRWTGYFKPPET-----GTYTFSLT 67 (145)
T ss_dssp EEEEEEEEESSS-----EEEEEEEE
T ss_pred EEEEEEEecccC-----ceEEEEEE
Confidence 478888883333 77777664
No 60
>PHA00672 hypothetical protein
Probab=27.75 E-value=55 Score=19.24 Aligned_cols=20 Identities=15% Similarity=0.325 Sum_probs=15.6
Q ss_pred CCCCCCCCcEEEEEEEecCC
Q psy623 48 PPDTLYQGGYFKVCVVWYKC 67 (69)
Q Consensus 48 P~~tpY~gg~f~l~i~fp~~ 67 (69)
|-.--|++|+|-=+|.+|+-
T Consensus 38 pv~H~Fs~GvYARei~IPkG 57 (152)
T PHA00672 38 PTAHLFHAGVYARTIRIPAG 57 (152)
T ss_pred hhhhhhccceeEEEEeccCc
Confidence 33346889999999999974
No 61
>KOG0680|consensus
Probab=26.69 E-value=1.8e+02 Score=20.00 Aligned_cols=44 Identities=20% Similarity=0.222 Sum_probs=24.7
Q ss_pred HHHHHHHHHHHHhhhCCCCC--EEEEecCCCCcceEEEEEECCCCCCCC
Q psy623 8 SALRALSMEYKSLQEEPVEG--FRVKLVNDDNLFEWEVAIFGPPDTLYQ 54 (69)
Q Consensus 8 ~~~~Rl~kEl~~l~~~~~~~--~~~~~~~~~nl~~w~~~i~GP~~tpY~ 54 (69)
.-+.||.+|++.+. |.. +++..+.+.-.+-|++.-.-++.+-|+
T Consensus 332 gF~~RL~~Elr~l~---P~d~~v~V~~p~dp~~~~W~~g~~~~~~~~~~ 377 (400)
T KOG0680|consen 332 GFRQRLARELRSLL---PADWEVSVSVPEDPITFAWEGGSEFAKTDSFE 377 (400)
T ss_pred chHHHHHHHHHhhC---CccceEEEecCCCcceeeehhccccccCcchh
Confidence 46789999999874 333 344423222356777654434444343
No 62
>COG5402 Uncharacterized conserved protein [Function unknown]
Probab=26.68 E-value=1.6e+02 Score=18.18 Aligned_cols=29 Identities=21% Similarity=0.468 Sum_probs=19.5
Q ss_pred eEEEEEECCCCCCCC------CcEEEEEEEecCCCC
Q psy623 40 EWEVAIFGPPDTLYQ------GGYFKVCVVWYKCSQ 69 (69)
Q Consensus 40 ~w~~~i~GP~~tpY~------gg~f~l~i~fp~~~~ 69 (69)
.|.++|. |...|+. +|.|.|-++|...||
T Consensus 139 ~~~I~vs-p~a~pgnWl~~~g~g~f~lVltlydtp~ 173 (194)
T COG5402 139 SWVITVS-PDAQPGNWLPVRGEGNFILVLTLYDTPQ 173 (194)
T ss_pred cEEEEEC-CCCCCCceeEecCCccEEEEEEeecCcc
Confidence 3444444 6666654 688999999877765
No 63
>PF14135 DUF4302: Domain of unknown function (DUF4302)
Probab=25.30 E-value=1.8e+02 Score=18.16 Aligned_cols=46 Identities=13% Similarity=0.308 Sum_probs=24.7
Q ss_pred HHHHHHHHHHHHhhhCCCCCEEEEecCCCCcceEEEEEECCCCCCCCCcEEEEEEEec
Q psy623 8 SALRALSMEYKSLQEEPVEGFRVKLVNDDNLFEWEVAIFGPPDTLYQGGYFKVCVVWY 65 (69)
Q Consensus 8 ~~~~Rl~kEl~~l~~~~~~~~~~~~~~~~nl~~w~~~i~GP~~tpY~gg~f~l~i~fp 65 (69)
.+..||...++++++.. .. .-.-|.+....-.+.-| || |.+-++|-
T Consensus 9 s~~eR~~e~~~~~k~~L--------~~--a~~GW~~~yyp~~~~~~-GG-y~f~~kF~ 54 (235)
T PF14135_consen 9 SPAERINEALAEYKKIL--------TS--APNGWKLEYYPKTDQSY-GG-YTFLMKFD 54 (235)
T ss_pred CHHHHHHHHHHHHHHHH--------hc--CCCceEEEEECCCCccC-Cc-EEEEEEEC
Confidence 45678887777665532 01 11236666662222224 33 77777773
No 64
>PF13115 YtkA: YtkA-like
Probab=24.97 E-value=69 Score=16.30 Aligned_cols=14 Identities=14% Similarity=0.195 Sum_probs=11.0
Q ss_pred CCCcEEEEEEEecC
Q psy623 53 YQGGYFKVCVVWYK 66 (69)
Q Consensus 53 Y~gg~f~l~i~fp~ 66 (69)
.+.|.|...+.|+-
T Consensus 64 ~~~G~Y~~~~~f~m 77 (86)
T PF13115_consen 64 TGPGVYEAEVTFSM 77 (86)
T ss_pred CCCCeEEEEeecCC
Confidence 46799999988863
No 65
>PF05015 Plasmid_killer: Plasmid maintenance system killer protein; InterPro: IPR007711 Several plasmids with proteic killer gene systems have been reported. All of them encode a stable toxin and an unstable antidote. Upon loss of the plasmid, the less stable inhibitor is inactivated more rapidly than the toxin, allowing the toxin to be activated. The activation of those systems result in cell filamentation and cessation of viable cell production. It has been verified that both the stable killer and the unstable inhibitor of the systems are short polypeptides. This family corresponds to the toxin.
Probab=24.69 E-value=1.2e+02 Score=16.05 Aligned_cols=39 Identities=15% Similarity=0.251 Sum_probs=19.2
Q ss_pred HHHHHHHHHHHHhhhC-------CCCCEEEEecCCCCcceEEEEEE
Q psy623 8 SALRALSMEYKSLQEE-------PVEGFRVKLVNDDNLFEWEVAIF 46 (69)
Q Consensus 8 ~~~~Rl~kEl~~l~~~-------~~~~~~~~~~~~~nl~~w~~~i~ 46 (69)
...+++.+=|..|... .+++....++..+.--.|.+.|.
T Consensus 25 ~~~~~~~~~L~~L~aa~~~~dl~~~p~~r~h~L~G~~~g~~Si~i~ 70 (93)
T PF05015_consen 25 DIAKKLRRRLDQLDAATSLEDLRSPPSNRLHKLKGDRKGQWSIRIN 70 (93)
T ss_pred HHHHHHHHHHHHHHhCCCHHHHhcCcCCCcccccCCCCCcEEEEeC
Confidence 3455666666555432 23355555344443445555555
No 66
>KOG2193|consensus
Probab=24.17 E-value=1.1e+02 Score=21.66 Aligned_cols=40 Identities=20% Similarity=0.290 Sum_probs=29.9
Q ss_pred HHHHHhhhCCCCCEEEEecCCCCcceEEEEEECCCCCCCC
Q psy623 15 MEYKSLQEEPVEGFRVKLVNDDNLFEWEVAIFGPPDTLYQ 54 (69)
Q Consensus 15 kEl~~l~~~~~~~~~~~~~~~~nl~~w~~~i~GP~~tpY~ 54 (69)
+-++.|.+.....+.+.++.-.+..+-.|+|.||+.--|+
T Consensus 431 ~hIKql~RfagASiKIappE~pdvseRMViItGppeaqfK 470 (584)
T KOG2193|consen 431 QHIKQLSRFAGASIKIAPPEIPDVSERMVIITGPPEAQFK 470 (584)
T ss_pred hhHHHHHHhccceeeecCCCCCCcceeEEEecCChHHHHh
Confidence 3466676666677888877767788888999999875554
No 67
>COG3866 PelB Pectate lyase [Carbohydrate transport and metabolism]
Probab=23.74 E-value=1.1e+02 Score=20.67 Aligned_cols=25 Identities=24% Similarity=0.441 Sum_probs=19.9
Q ss_pred ceEEEEEECCCCC-CCCCcEEEEEEE
Q psy623 39 FEWEVAIFGPPDT-LYQGGYFKVCVV 63 (69)
Q Consensus 39 ~~w~~~i~GP~~t-pY~gg~f~l~i~ 63 (69)
.+|.+.|.|-.|+ -|.+|.+++.+.
T Consensus 197 dh~Kssl~G~sD~~~~~~~~~kvT~h 222 (345)
T COG3866 197 DHDKSSLLGSSDSSNYDDGKYKVTIH 222 (345)
T ss_pred cCCeeeeeccCCcccccCCceeEEEe
Confidence 3789999986654 788999998875
No 68
>PHA00458 single-stranded DNA-binding protein
Probab=23.69 E-value=1.3e+02 Score=19.22 Aligned_cols=16 Identities=19% Similarity=0.243 Sum_probs=13.6
Q ss_pred CCCCcEEEEEEEecCC
Q psy623 52 LYQGGYFKVCVVWYKC 67 (69)
Q Consensus 52 pY~gg~f~l~i~fp~~ 67 (69)
-.+.|.|++.+.|++.
T Consensus 33 ~n~rG~Ykv~lt~~~d 48 (233)
T PHA00458 33 FNPRGVYKVSLTVSND 48 (233)
T ss_pred cCCCceeEEEEEecCC
Confidence 4678999999999864
No 69
>COG2880 Uncharacterized protein conserved in archaea [Function unknown]
Probab=23.66 E-value=20 Score=18.56 Aligned_cols=9 Identities=33% Similarity=0.800 Sum_probs=6.8
Q ss_pred CCCCcEEEE
Q psy623 52 LYQGGYFKV 60 (69)
Q Consensus 52 pY~gg~f~l 60 (69)
.|++|+|+-
T Consensus 10 iYEnGVfKP 18 (67)
T COG2880 10 IYENGVLKP 18 (67)
T ss_pred HHhcccccc
Confidence 588888863
No 70
>PF15586 Imm47: Immunity protein 47
Probab=23.38 E-value=1.5e+02 Score=16.68 Aligned_cols=28 Identities=32% Similarity=0.493 Sum_probs=18.1
Q ss_pred CCCCcceEEEEEECCCCCCCCCcEEEEEE
Q psy623 34 NDDNLFEWEVAIFGPPDTLYQGGYFKVCV 62 (69)
Q Consensus 34 ~~~nl~~w~~~i~GP~~tpY~gg~f~l~i 62 (69)
+.++...|--...||.+..= +-.|.+.|
T Consensus 22 d~~~F~~~l~l~IG~~~~~G-~d~F~v~V 49 (116)
T PF15586_consen 22 DPDNFCFWLELDIGPEGEDG-SDYFQVFV 49 (116)
T ss_pred CCcceEEEEEEEECCCCCCc-cceEEEEE
Confidence 45566666666779988773 34466655
No 71
>PF03366 YEATS: YEATS family; InterPro: IPR005033 Named the YEATS family, after `YNK7', `ENL', `AF-9', and `TFIIF small subunit', this family also contains the GAS41 protein. All these proteins are thought to have a transcription stimulatory activity.; GO: 0006355 regulation of transcription, DNA-dependent, 0005634 nucleus; PDB: 3QRL_A 2L7E_A 3FK3_C 3RLS_A.
Probab=23.33 E-value=23 Score=18.84 Aligned_cols=11 Identities=27% Similarity=0.673 Sum_probs=5.2
Q ss_pred eEEEEEECCCC
Q psy623 40 EWEVAIFGPPD 50 (69)
Q Consensus 40 ~w~~~i~GP~~ 50 (69)
+|.+.+.|+.+
T Consensus 3 ~W~v~Vr~~~~ 13 (84)
T PF03366_consen 3 KWTVYVRGLDN 13 (84)
T ss_dssp EEEEEEEECCC
T ss_pred EEEEEEEeCCC
Confidence 45555544443
No 72
>PF02922 CBM_48: Carbohydrate-binding module 48 (Isoamylase N-terminal domain); InterPro: IPR004193 O-Glycosyl hydrolases 3.2.1. from EC are a widespread group of enzymes that hydrolyse the glycosidic bond between two or more carbohydrates, or between a carbohydrate and a non-carbohydrate moiety. A classification system for glycosyl hydrolases, based on sequence similarity, has led to the definition of 85 different families [, ]. This classification is available on the CAZy (CArbohydrate-Active EnZymes) web site. Enzymes containing this domain belong to family 13 (GH13 from CAZY) of the glycosyl hydrolases. This domain is found in a range of enzymes that act on branched substrates ie. isoamylase, pullulanase and branching enzyme. Isoamylase hydrolyses 1,6-alpha-D-glucosidic branch linkages in glycogen, amylopectin and dextrin; 1,4-alpha-glucan branching enzyme functions in the formation of 1,6-glucosidic linkages of glycogen; and pullulanase is a starch-debranching enzyme.; GO: 0004553 hydrolase activity, hydrolyzing O-glycosyl compounds, 0005975 carbohydrate metabolic process; PDB: 2BHZ_A 2BY2_A 2BY3_A 2BXY_A 2BY1_A 2BHY_A 2BHU_A 2BXZ_A 2BY0_A 2FHB_A ....
Probab=23.30 E-value=1.2e+02 Score=15.24 Aligned_cols=27 Identities=37% Similarity=0.539 Sum_probs=16.3
Q ss_pred CCcceEEEEEECCCCCCCCCc-EEEEEEEec
Q psy623 36 DNLFEWEVAIFGPPDTLYQGG-YFKVCVVWY 65 (69)
Q Consensus 36 ~nl~~w~~~i~GP~~tpY~gg-~f~l~i~fp 65 (69)
++--.|++.|.+ ..-+|+ .|++.|+.+
T Consensus 46 ~~~G~w~~~~~~---~~~~g~~~Y~y~i~~~ 73 (85)
T PF02922_consen 46 DDDGVWEVTVPG---DLPPGGYYYKYRIDGD 73 (85)
T ss_dssp CTTTEEEEEEEG---CGTTTT-EEEEEEEET
T ss_pred CCCCEEEEEEcC---CcCCCCEEEEEEEEeC
Confidence 344578888875 122234 788887765
No 73
>PTZ00281 actin; Provisional
Probab=22.97 E-value=1.8e+02 Score=19.32 Aligned_cols=35 Identities=9% Similarity=0.157 Sum_probs=20.2
Q ss_pred hHHHHHHHHHHHHhhhCCCCCEEEEecCCCCcceEE
Q psy623 7 SSALRALSMEYKSLQEEPVEGFRVKLVNDDNLFEWE 42 (69)
Q Consensus 7 ~~~~~Rl~kEl~~l~~~~~~~~~~~~~~~~nl~~w~ 42 (69)
..-..||++|++.+.... ..+.+....+.+..-|.
T Consensus 308 ~Gf~~RL~~El~~~~p~~-~~v~v~~~~~r~~~aW~ 342 (376)
T PTZ00281 308 PGIADRMNKELTALAPST-MKIKIIAPPERKYSVWI 342 (376)
T ss_pred cCHHHHHHHHHHHhCCCC-cceEEecCCCCceeEEE
Confidence 456689999999875321 23444422333455565
No 74
>PF15590 Imm15: Immunity protein 15
Probab=22.39 E-value=1.3e+02 Score=15.58 Aligned_cols=39 Identities=18% Similarity=0.350 Sum_probs=20.7
Q ss_pred HHHHHhhhCCCCCEEEEecCCCCcceEEEEEECCCCCCCCCcE
Q psy623 15 MEYKSLQEEPVEGFRVKLVNDDNLFEWEVAIFGPPDTLYQGGY 57 (69)
Q Consensus 15 kEl~~l~~~~~~~~~~~~~~~~nl~~w~~~i~GP~~tpY~gg~ 57 (69)
+-|+++...+. +......|..+=..|.-+ +..+-|+||-
T Consensus 13 n~Lqkv~~~~d-~We~~y~DP~D~r~W~~~---~~~s~~hGGG 51 (69)
T PF15590_consen 13 NRLQKVASSPD-GWETLYQDPRDGRYWEKS---YPESHMHGGG 51 (69)
T ss_pred HHHHHHhcCCc-chhhhccCCCCCceeEEe---cCcccccCCC
Confidence 34555554332 443332555555666543 6667788773
No 75
>PF04447 DUF550: Protein of unknown function (DUF550); InterPro: IPR007538 This entry represents the N terminus of a protein of unknown function, found in a range of Proteobacteria and a few P22-like dsDNA virus particles.
Probab=21.97 E-value=1.1e+02 Score=17.13 Aligned_cols=17 Identities=29% Similarity=0.374 Sum_probs=14.3
Q ss_pred HHHHHHHHHHHHhhhCC
Q psy623 8 SALRALSMEYKSLQEEP 24 (69)
Q Consensus 8 ~~~~Rl~kEl~~l~~~~ 24 (69)
...+.|.||+.++..+|
T Consensus 22 g~lkHl~kE~~E~~~~p 38 (100)
T PF04447_consen 22 GPLKHLSKEALEAEAAP 38 (100)
T ss_pred hHHHHHHHHHHHHHhCC
Confidence 46789999999998775
No 76
>PF13086 AAA_11: AAA domain; PDB: 2XZL_A 2XZO_A 2WJY_A 2WJV_A 2XZP_A 2GK6_A 2GK7_A 2GJK_A.
Probab=21.08 E-value=67 Score=18.83 Aligned_cols=9 Identities=56% Similarity=0.911 Sum_probs=6.2
Q ss_pred EEEECCCCC
Q psy623 43 VAIFGPPDT 51 (69)
Q Consensus 43 ~~i~GP~~t 51 (69)
..|.||+||
T Consensus 20 ~~i~GpPGT 28 (236)
T PF13086_consen 20 TLIQGPPGT 28 (236)
T ss_dssp EEEE-STTS
T ss_pred EEEECCCCC
Confidence 467899987
No 77
>PF01954 DUF104: Protein of unknown function DUF104; InterPro: IPR008203 This family includes short archaebacterial proteins of unknown function. Archaeoglobus fulgidus has twelve copies of this protein, with several being clustered together in the genome.; PDB: 2NWT_A.
Probab=21.04 E-value=36 Score=16.99 Aligned_cols=9 Identities=33% Similarity=0.800 Sum_probs=7.3
Q ss_pred CCCCcEEEE
Q psy623 52 LYQGGYFKV 60 (69)
Q Consensus 52 pY~gg~f~l 60 (69)
.|++|+|+.
T Consensus 8 iYe~GvlkP 16 (60)
T PF01954_consen 8 IYENGVLKP 16 (60)
T ss_dssp EEETTEEEE
T ss_pred EEECCEEEE
Confidence 589999874
No 78
>PF09036 Bcr-Abl_Oligo: Bcr-Abl oncoprotein oligomerisation domain; InterPro: IPR015123 This entry represents the oligomerisation domain of the breakpoint cluster region oncoprotein Bcr, and the Bcr/Abl (Abelson-leukemia-virus) fusion protein created by a reciprocal (9;22) fusion []. Brc displays serine/threonine protein kinase activity (2.7.11.1 from EC), acting as a GTPase-activating protein for RAC1 and CDC42. Brc promotes the exchange of RAC or CDC42-bound GDP by GTP, thereby activating them []. The Bcr/Abl fusion protein loses some of the regulatory function of Bcr with regards to small Rho-like GTPases with negative consequences on cell motility, in particular on the capacity to adhere to endothelial cells []. The Bcr, Bcr/Abl oncoprotein oligomerisation domain consists of a short N-terminal helix (alpha-1), a flexible loop and a long C-terminal helix (alpha-2). Together these form an N-shaped structure, with the loop allowing the two helices to assume a parallel orientation. The monomeric domains associate into a dimer through the formation of an antiparallel coiled coil between the alpha-2 helices and domain swapping of two alpha-1 helices, where one alpha-1 helix swings back and packs against the alpha-2 helix from the second monomer. Two dimers then associate into a tetramer. The oligomerisation domain is essential for the oncogenicity of the Bcr-Abl protein []. ; GO: 0004674 protein serine/threonine kinase activity, 0005096 GTPase activator activity, 0006468 protein phosphorylation, 0007165 signal transduction; PDB: 1K1F_C.
Probab=20.98 E-value=1.2e+02 Score=16.01 Aligned_cols=12 Identities=25% Similarity=0.437 Sum_probs=8.7
Q ss_pred HHHHHHHHHHHH
Q psy623 8 SALRALSMEYKS 19 (69)
Q Consensus 8 ~~~~Rl~kEl~~ 19 (69)
.+.+||.+|+.+
T Consensus 40 ~sirrLeqevnk 51 (79)
T PF09036_consen 40 ASIRRLEQEVNK 51 (79)
T ss_dssp HHHHHHHHHHHH
T ss_pred HHHHHHHHHHHH
Confidence 467888888754
No 79
>PF01774 UreD: UreD urease accessory protein; InterPro: IPR002669 UreD is a urease accessory protein. Urease IPR005848 from INTERPRO hydrolyses urea into ammonia and carbamic acid []. UreD is involved in activation of the urease enzyme via the UreD-UreF-UreG-urease complex [] and is required for urease nickel metallocentre assembly []. See also UreF IPR002639 from INTERPRO, UreG IPR004400 from INTERPRO.; GO: 0016151 nickel ion binding, 0006807 nitrogen compound metabolic process; PDB: 3SF5_D.
Probab=20.84 E-value=1.5e+02 Score=18.17 Aligned_cols=23 Identities=17% Similarity=0.320 Sum_probs=13.8
Q ss_pred EEEECCCCCCCCCcEEEEEEEec
Q psy623 43 VAIFGPPDTLYQGGYFKVCVVWY 65 (69)
Q Consensus 43 ~~i~GP~~tpY~gg~f~l~i~fp 65 (69)
+.+..|.+++..|..+.++|++-
T Consensus 2 ~yl~~~~gGl~~GD~l~~~i~v~ 24 (209)
T PF01774_consen 2 VYLMNPSGGLLGGDRLRIDITVG 24 (209)
T ss_dssp EEEEE--S-B-TT-EEEEEEEE-
T ss_pred EEEEeCCCcCcCCCEEEEEEEEC
Confidence 45666888999999999999873
No 80
>PF13119 DUF3973: Domain of unknown function (DUF3973)
Probab=20.74 E-value=46 Score=15.42 Aligned_cols=12 Identities=25% Similarity=0.958 Sum_probs=7.0
Q ss_pred CCCCCCCCCcEE
Q psy623 47 GPPDTLYQGGYF 58 (69)
Q Consensus 47 GP~~tpY~gg~f 58 (69)
++.+..|++|.|
T Consensus 15 ~~~~~iF~ngfY 26 (41)
T PF13119_consen 15 GINDKIFKNGFY 26 (41)
T ss_pred cchhhhhhCcee
Confidence 355556666655
No 81
>KOG0744|consensus
Probab=20.74 E-value=1e+02 Score=21.29 Aligned_cols=16 Identities=44% Similarity=1.099 Sum_probs=11.8
Q ss_pred CCcceEE--EEEECCCCC
Q psy623 36 DNLFEWE--VAIFGPPDT 51 (69)
Q Consensus 36 ~nl~~w~--~~i~GP~~t 51 (69)
.|+-.|+ +.+.||++|
T Consensus 171 tnlIt~NRliLlhGPPGT 188 (423)
T KOG0744|consen 171 TNLITWNRLILLHGPPGT 188 (423)
T ss_pred CceeeeeeEEEEeCCCCC
Confidence 5677787 445799987
No 82
>COG3254 Uncharacterized conserved protein [Function unknown]
Probab=20.63 E-value=31 Score=19.38 Aligned_cols=12 Identities=0% Similarity=0.382 Sum_probs=6.0
Q ss_pred CCCCCEEEEecCC
Q psy623 23 EPVEGFRVKLVND 35 (69)
Q Consensus 23 ~~~~~~~~~~~~~ 35 (69)
....++++. +++
T Consensus 36 agi~nYSIf-Lde 47 (105)
T COG3254 36 AGIRNYSIF-LDE 47 (105)
T ss_pred cCCceeEEE-ecC
Confidence 344555665 443
No 83
>cd03463 3,4-PCD_alpha Protocatechuate 3,4-dioxygenase (3,4-PCD) , alpha subunit. 3,4-PCD catalyzes the oxidative ring cleavage of 3,4-dihydroxybenzoate to produce beta-carboxy-cis,cis-muconate. 3,4-PCDs are large aggregates of 12 protomers, each composed of an alpha- and beta-subunit and an Fe3+ ion bound in the beta-subunit at the alpha-subunit-beta-subunit interface. 3,4-PCD is a member of the aromatic dioxygenases which are non-heme iron intradiol-cleaving enzymes that break the C1-C2 bond and utilize Fe3+.
Probab=20.59 E-value=1.8e+02 Score=17.79 Aligned_cols=17 Identities=12% Similarity=0.595 Sum_probs=13.5
Q ss_pred CcceEEEEEECCCCCCC
Q psy623 37 NLFEWEVAIFGPPDTLY 53 (69)
Q Consensus 37 nl~~w~~~i~GP~~tpY 53 (69)
..++|.+++.|..+|+|
T Consensus 167 ~~~~fdIvL~g~~~t~~ 183 (185)
T cd03463 167 GAYRFDIRLQGEGETVF 183 (185)
T ss_pred CeEEEEEEecCCCcccc
Confidence 45788888888888876
No 84
>PF14172 DUF4309: Domain of unknown function (DUF4309)
Probab=20.37 E-value=86 Score=18.11 Aligned_cols=17 Identities=29% Similarity=0.538 Sum_probs=13.0
Q ss_pred CCCcE-EEEEEEecCCCC
Q psy623 53 YQGGY-FKVCVVWYKCSQ 69 (69)
Q Consensus 53 Y~gg~-f~l~i~fp~~~~ 69 (69)
|.-|- |+|.+.|++.++
T Consensus 108 Y~~g~~Y~L~Fvf~~~t~ 125 (134)
T PF14172_consen 108 YKAGSDYKLEFVFPKPTS 125 (134)
T ss_pred EecCCceEEEEEEcCCCC
Confidence 55566 999999988653
No 85
>TIGR03517 GldM_gliding gliding motility-associated protein GldM. This protein family, GldM, is named for the member from Flavobacterium johnsoniae, which is required for a type of rapid gliding motility found in certain members of the Bacteriodetes. However, members are found also in several members of the Bacteriodetes that appear not to be motile. The best conserved region, toward the N-terminus, is centered on a highly hydrobobic probable transmembrane helix. Two paralogs are found in Cytophaga hutchinsonii.
Probab=20.27 E-value=1.1e+02 Score=21.94 Aligned_cols=28 Identities=14% Similarity=0.381 Sum_probs=23.1
Q ss_pred CcceEEEEEECCCCCCCCCcEEEEEEEe
Q psy623 37 NLFEWEVAIFGPPDTLYQGGYFKVCVVW 64 (69)
Q Consensus 37 nl~~w~~~i~GP~~tpY~gg~f~l~i~f 64 (69)
++.+..+.+.-.....|+|..|..+|.+
T Consensus 231 ~~d~~~a~V~~~s~~v~~Ge~~~a~vvL 258 (523)
T TIGR03517 231 SVDNYQAIVIPKSDAVFAGETYEAEVVL 258 (523)
T ss_pred cccceeEEEEcCCceeecCCeEEEEEEE
Confidence 4567778888677789999999999987
Done!