Query psy405
Match_columns 66
No_of_seqs 105 out of 363
Neff 5.9
Searched_HMMs 46136
Date Sat Aug 17 00:02:40 2013
Command hhsearch -i /work/01045/syshi/Psyhhblits/psy405.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/405hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 smart00501 BRIGHT BRIGHT, ARID 99.8 2.3E-20 4.9E-25 111.9 4.6 51 15-65 2-52 (93)
2 PF01388 ARID: ARID/BRIGHT DNA 99.8 7.1E-20 1.5E-24 108.9 4.5 55 11-65 2-56 (92)
3 KOG2744|consensus 99.5 3.1E-15 6.7E-20 112.3 3.8 54 12-65 160-213 (512)
4 KOG2510|consensus 98.6 2.5E-08 5.4E-13 75.3 3.8 53 13-65 291-343 (532)
5 PRK02922 glycogen synthesis pr 70.7 2.8 6E-05 24.3 1.3 31 12-43 7-37 (67)
6 cd07268 Glo_EDI_BRP_like_4 Thi 64.6 2.8 6E-05 27.7 0.5 44 15-59 10-53 (149)
7 PF14475 Mso1_Sec1_bdg: Sec1-b 61.2 15 0.00033 19.3 2.9 31 5-35 8-38 (41)
8 COG0043 UbiD 3-polyprenyl-4-hy 56.1 16 0.00036 28.0 3.4 50 2-51 81-134 (477)
9 PRK11700 hypothetical protein; 54.2 4.1 9E-05 27.8 -0.0 39 20-59 53-91 (187)
10 PF06185 YecM: YecM protein; 52.1 16 0.00035 24.8 2.6 39 18-57 46-84 (185)
11 PF10281 Ish1: Putative stress 49.7 20 0.00044 17.7 2.1 18 21-38 7-24 (38)
12 cd07353 harmonin_N N-terminal 48.9 15 0.00033 21.7 1.8 23 10-32 16-38 (79)
13 cd07356 HN_L-whirlin_R1_like F 48.8 23 0.00051 21.0 2.6 20 12-31 17-36 (78)
14 cd01478 Sec23-like Sec23-like: 45.1 36 0.00079 23.7 3.6 48 13-61 203-250 (267)
15 PF02639 DUF188: Uncharacteriz 42.6 41 0.00089 21.3 3.2 25 4-28 105-129 (130)
16 smart00805 AGTRAP Angiotensin 39.4 16 0.00035 24.4 1.0 21 43-63 101-121 (159)
17 cd02879 GH18_plant_chitinase_c 39.1 42 0.00091 23.4 3.1 20 12-31 89-108 (299)
18 PF08971 GlgS: Glycogen synthe 38.9 14 0.00029 21.4 0.5 30 11-41 5-34 (66)
19 PF12496 BNIP2: Bcl2-/adenovir 38.5 6.5 0.00014 25.2 -0.9 26 36-61 95-124 (127)
20 PHA03149 hypothetical protein; 38.5 9.7 0.00021 21.9 -0.1 29 25-57 6-34 (66)
21 PF15643 Tox-PL-2: Papain fold 38.0 33 0.00072 21.3 2.2 35 14-48 19-53 (100)
22 PF14639 YqgF: Holliday-juncti 37.4 36 0.00078 21.9 2.4 41 12-55 44-84 (150)
23 cd00598 GH18_chitinase-like Th 36.4 53 0.0012 20.8 3.1 21 11-31 84-104 (210)
24 PRK00124 hypothetical protein; 36.3 60 0.0013 21.3 3.3 26 4-29 121-146 (151)
25 cd02876 GH18_SI-CLP Stabilin-1 36.1 52 0.0011 23.0 3.2 20 12-31 89-108 (318)
26 cd06548 GH18_chitinase The GH1 35.2 50 0.0011 23.1 3.0 20 12-31 106-125 (322)
27 cd06545 GH18_3CO4_chitinase Th 34.6 53 0.0011 22.1 3.0 20 12-31 80-99 (253)
28 cd02873 GH18_IDGF The IDGF's ( 33.3 58 0.0013 24.0 3.2 20 12-31 102-121 (413)
29 COG1671 Uncharacterized protei 31.7 80 0.0017 20.9 3.3 27 3-29 119-145 (150)
30 PF09441 Abp2: ARS binding pro 31.7 35 0.00076 23.1 1.7 23 35-57 43-65 (175)
31 PF00704 Glyco_hydro_18: Glyco 31.6 47 0.001 22.7 2.4 27 13-47 97-123 (343)
32 cd02874 GH18_CFLE_spore_hydrol 31.5 67 0.0015 22.2 3.2 19 13-31 85-103 (313)
33 smart00636 Glyco_18 Glycosyl h 30.6 67 0.0015 22.2 3.0 19 13-31 89-107 (334)
34 cd02878 GH18_zymocin_alpha Zym 30.1 72 0.0016 22.7 3.2 20 12-31 88-107 (345)
35 cd06549 GH18_trifunctional GH1 29.0 78 0.0017 22.0 3.2 19 13-31 86-104 (298)
36 PRK02363 DNA-directed RNA poly 28.6 79 0.0017 20.1 2.9 34 17-61 3-36 (129)
37 PRK05089 cytochrome C oxidase 28.5 33 0.00072 23.3 1.2 18 45-62 32-49 (188)
38 COG3175 COX11 Cytochrome oxida 28.1 31 0.00067 23.7 1.0 18 44-61 31-48 (195)
39 PTZ00417 lysine-tRNA ligase; P 27.9 1.1E+02 0.0024 24.1 4.0 50 13-63 253-324 (585)
40 PF00586 AIRS: AIR synthase re 27.3 1E+02 0.0023 17.5 3.1 31 8-43 65-95 (96)
41 PF15412 Nse4-Nse3_bdg: Bindin 27.3 61 0.0013 17.5 1.9 22 11-32 23-44 (56)
42 PF03823 Neurokinin_B: Neuroki 26.5 30 0.00064 19.5 0.6 16 40-55 36-51 (59)
43 cd02872 GH18_chitolectin_chito 26.5 93 0.002 21.9 3.2 19 13-31 94-112 (362)
44 KOG4588|consensus 26.3 76 0.0017 22.8 2.7 32 13-44 201-232 (267)
45 PF09153 DUF1938: Domain of un 25.8 78 0.0017 19.1 2.3 24 12-35 29-56 (86)
46 PHA02128 hypothetical protein 25.7 68 0.0015 20.7 2.2 42 24-65 39-80 (151)
47 PF04442 CtaG_Cox11: Cytochrom 25.6 23 0.0005 23.3 0.0 19 45-63 2-20 (152)
48 PTZ00128 cytochrome c oxidase 25.5 42 0.0009 23.6 1.2 20 44-63 74-93 (232)
49 cd06544 GH18_narbonin Narbonin 25.3 97 0.0021 21.5 3.1 18 14-31 96-113 (253)
50 cd02875 GH18_chitobiase Chitob 25.2 93 0.002 22.4 3.1 20 12-31 93-112 (358)
51 COG0317 SpoT Guanosine polypho 25.0 38 0.00083 27.3 1.1 48 7-58 209-256 (701)
52 cd07347 harmonin_N_like N-term 25.0 98 0.0021 18.1 2.6 21 12-32 17-37 (78)
53 PF00727 IL4: Interleukin 4 Th 24.7 65 0.0014 20.4 1.9 16 15-30 100-115 (117)
54 PLN00162 transport protein sec 24.4 95 0.0021 25.0 3.2 49 12-61 323-371 (761)
55 PF09425 CCT_2: Divergent CCT 24.3 56 0.0012 15.7 1.2 14 21-34 6-19 (27)
56 KOG2265|consensus 24.2 60 0.0013 22.1 1.8 20 13-32 151-170 (179)
57 cd06546 GH18_CTS3_chitinase GH 23.7 1.1E+02 0.0024 21.1 3.1 20 12-31 93-112 (256)
58 PF05066 HARE-HTH: HB1, ASXL, 23.6 41 0.00088 18.5 0.8 21 42-62 15-35 (72)
59 PF02337 Gag_p10: Retroviral G 23.3 73 0.0016 19.2 1.9 21 15-35 6-26 (90)
60 PF06396 AGTRAP: Angiotensin I 23.1 42 0.00091 22.3 0.9 21 42-62 100-120 (162)
61 cd01468 trunk_domain trunk dom 23.1 73 0.0016 21.3 2.1 47 14-61 172-218 (239)
62 PF03875 Statherin: Statherin; 23.0 93 0.002 16.2 2.0 20 15-34 3-22 (42)
63 PF07875 Coat_F: Coat F domain 22.7 69 0.0015 17.3 1.6 14 48-61 48-61 (64)
64 cd02871 GH18_chitinase_D-like 22.5 1.1E+02 0.0023 21.5 2.9 20 12-31 91-110 (312)
65 COG1190 LysU Lysyl-tRNA synthe 22.0 1.7E+02 0.0038 22.9 4.1 48 15-63 182-251 (502)
66 PF03808 Glyco_tran_WecB: Glyc 21.8 1.3E+02 0.0029 19.2 3.0 53 2-58 20-72 (172)
67 PRK12445 lysyl-tRNA synthetase 21.8 1.6E+02 0.0035 22.6 3.9 50 13-63 184-255 (505)
68 PF10813 DUF2733: Protein of u 20.6 42 0.00091 16.7 0.4 13 40-52 18-30 (32)
69 cd01479 Sec24-like Sec24-like: 20.4 1.1E+02 0.0023 20.9 2.4 45 16-61 171-215 (244)
70 cd06542 GH18_EndoS-like Endo-b 20.0 1.6E+02 0.0035 19.6 3.3 20 12-31 85-104 (255)
No 1
>smart00501 BRIGHT BRIGHT, ARID (A/T-rich interaction domain) domain. DNA-binding domain containing a helix-turn-helix structure
Probab=99.81 E-value=2.3e-20 Score=111.94 Aligned_cols=51 Identities=45% Similarity=0.812 Sum_probs=49.2
Q ss_pred hHHHHHHHHHHHHHhcCCCCCCCceecCeecchhhHHHHHHhcCcccceec
Q psy405 15 EKESFLRDLQHFHETRGTPCRHVPKIGGKEIDLYLLYCLVTSNGGWVKKCT 65 (66)
Q Consensus 15 ~~~~Fl~~L~~F~~~rg~p~~~~P~i~gk~lDL~~Ly~~V~~~GG~~~Vt~ 65 (66)
+++.|+++|.+||+++|+|+.++|+|+|++||||+||.+|+++||+++||.
T Consensus 2 ~~~~F~~~L~~F~~~~g~~~~~~P~i~g~~vdL~~Ly~~V~~~GG~~~v~~ 52 (93)
T smart00501 2 ERVLFLDRLYKFMEERGSPLKKIPVIGGKPLDLYRLYRLVQERGGYDQVTK 52 (93)
T ss_pred cHHHHHHHHHHHHHHcCCcCCcCCeECCEeCcHHHHHHHHHHccCHHHHcC
Confidence 678999999999999999999999999999999999999999999999984
No 2
>PF01388 ARID: ARID/BRIGHT DNA binding domain; InterPro: IPR001606 Members of the recently discovered ARID (AT-rich interaction domain; also known as BRIGHT domain)) family of DNA-binding proteins are found in fungi and invertebrate and vertebrate metazoans. ARID-encoding genes are involved in a variety of biological processes including embryonic development, cell lineage gene regulation and cell cycle control. Although the specific roles of this domain and of ARID-containing proteins in transcriptional regulation are yet to be elucidated, they include both positive and negative transcriptional regulation and a likely involvement in the modification of chromatin structure []. The basic structure of the ARID domain domain appears to be a series of six alpha-helices separated by beta-strands, loops, or turns, but the structured region may extend to an additional helix at either or both ends of the basic six. Based on primary sequence homology, they can be partitioned into three structural classes: Minimal ARID proteins that consist of a core domain formed by six alpha helices; ARID proteins that supplement the core domain with an N-terminal alpha-helix; and Extended-ARID proteins, which contain the core domain and additional alpha-helices at their N- and C-termini. The human SWI-SNF complex protein p270 is an ARID family member with non-sequence-specific DNA binding activity. The ARID consensus and other structural features are common to both p270 and yeast SWI1, suggesting that p270 is a human counterpart of SWI1 []. The approximately 100-residue ARID sequence is present in a series of proteins strongly implicated in the regulation of cell growth, development, and tissue-specific gene expression. Although about a dozen ARID proteins can be identified from database searches, to date, only Bright (a regulator of B-cell-specific gene expression), dead ringer (a Drosophila melanogaster gene product required for normal development), and MRF-2 (which represses expression from the Cytomegalovirus enhancer) have been analyzed directly in regard to their DNA binding properties. Each binds preferentially to AT-rich sites. In contrast, p270 shows no sequence preference in its DNA binding activity, thereby demonstrating that AT-rich binding is not an intrinsic property of ARID domains and that ARID family proteins may be involved in a wider range of DNA interactions [].; GO: 0003677 DNA binding, 0005622 intracellular; PDB: 1C20_A 1KQQ_A 2JRZ_A 2LM1_A 2YQE_A 2JXJ_A 2EH9_A 2CXY_A 2LI6_A 1KN5_A ....
Probab=99.79 E-value=7.1e-20 Score=108.88 Aligned_cols=55 Identities=49% Similarity=0.824 Sum_probs=49.9
Q ss_pred cchHhHHHHHHHHHHHHHhcCCCCCCCceecCeecchhhHHHHHHhcCcccceec
Q psy405 11 AYFKEKESFLRDLQHFHETRGTPCRHVPKIGGKEIDLYLLYCLVTSNGGWVKKCT 65 (66)
Q Consensus 11 ~~~~~~~~Fl~~L~~F~~~rg~p~~~~P~i~gk~lDL~~Ly~~V~~~GG~~~Vt~ 65 (66)
....+++.|++.|.+||+++|+++.++|.|+|++||||+||.+|.++||+++||.
T Consensus 2 ~~~~~~~~F~~~L~~f~~~~g~~~~~~P~i~g~~vDL~~Ly~~V~~~GG~~~V~~ 56 (92)
T PF01388_consen 2 ANTREREQFLEQLREFHESRGTPIDRPPVIGGKPVDLYKLYKAVMKRGGFDKVTK 56 (92)
T ss_dssp SSCHHHHHHHHHHHHHHHHTTSSSSS-SEETTSE-SHHHHHHHHHHHTSHHHHHH
T ss_pred CcchHHHHHHHHHHHHHHHcCCCCCCCCcCCCEeCcHHHHHHHHHhCcCcccCcc
Confidence 3568899999999999999999999999999999999999999999999999974
No 3
>KOG2744|consensus
Probab=99.55 E-value=3.1e-15 Score=112.27 Aligned_cols=54 Identities=43% Similarity=0.712 Sum_probs=51.2
Q ss_pred chHhHHHHHHHHHHHHHhcCCCCCCCceecCeecchhhHHHHHHhcCcccceec
Q psy405 12 YFKEKESFLRDLQHFHETRGTPCRHVPKIGGKEIDLYLLYCLVTSNGGWVKKCT 65 (66)
Q Consensus 12 ~~~~~~~Fl~~L~~F~~~rg~p~~~~P~i~gk~lDL~~Ly~~V~~~GG~~~Vt~ 65 (66)
...+++.||++|+.||+++|+|++++|+|+|++||||.||.+|+++||+++|+.
T Consensus 160 ~~~~~eeF~~dl~~f~~~~~~~~~~iPii~~~~ldL~~Ly~lV~s~GG~~~V~~ 213 (512)
T KOG2744|consen 160 VPKSSEEFMEDLRRFMKKRGTKVKSIPIIGGQPLDLHWLYALVTSRGGLDEVTN 213 (512)
T ss_pred ccccHHHHHHHHHHHHHHhCCcceeccccCCCcchHHHHHHHHhcCCchhHhhh
Confidence 345899999999999999999999999999999999999999999999999975
No 4
>KOG2510|consensus
Probab=98.65 E-value=2.5e-08 Score=75.29 Aligned_cols=53 Identities=32% Similarity=0.622 Sum_probs=50.8
Q ss_pred hHhHHHHHHHHHHHHHhcCCCCCCCceecCeecchhhHHHHHHhcCcccceec
Q psy405 13 FKEKESFLRDLQHFHETRGTPCRHVPKIGGKEIDLYLLYCLVTSNGGWVKKCT 65 (66)
Q Consensus 13 ~~~~~~Fl~~L~~F~~~rg~p~~~~P~i~gk~lDL~~Ly~~V~~~GG~~~Vt~ 65 (66)
..+++.++++|+.|++++.+|++-.|.++.|+||||+||..|.++||+..|+.
T Consensus 291 qp~r~~wvDR~raF~ee~~Sp~t~~p~~gakPldl~rlYvsvke~gg~~~v~k 343 (532)
T KOG2510|consen 291 QPERKEWVDRLRAFTEERASPMTNLPAVGAKPLDLYRLYVSVKEIGGLTQVNK 343 (532)
T ss_pred CcchhhHHHHHHHHHHhhcCcccccccccccchhHHHHHHHHHHhccceeecc
Confidence 57889999999999999999999999999999999999999999999999975
No 5
>PRK02922 glycogen synthesis protein GlgS; Provisional
Probab=70.65 E-value=2.8 Score=24.34 Aligned_cols=31 Identities=19% Similarity=0.306 Sum_probs=24.7
Q ss_pred chHhHHHHHHHHHHHHHhcCCCCCCCceecCe
Q psy405 12 YFKEKESFLRDLQHFHETRGTPCRHVPKIGGK 43 (66)
Q Consensus 12 ~~~~~~~Fl~~L~~F~~~rg~p~~~~P~i~gk 43 (66)
+.-..-.|+.+-.+||+.+|.++. ++.|.|+
T Consensus 7 ys~~~~DFlAsS~A~Me~Qgr~Id-vd~V~gn 37 (67)
T PRK02922 7 YSLNNFDFLARSFARMHAEGRPVD-IQAVTGN 37 (67)
T ss_pred ccccchhHHHHHHHHHHHcCCCcc-HHHHHhc
Confidence 334566899999999999999998 6666554
No 6
>cd07268 Glo_EDI_BRP_like_4 This conserved domain belongs to a superfamily including the bleomycin resistance protein, glyoxalase I, and type I ring-cleaving dioxygenases. This protein family belongs to a conserved domain superfamily that is found in a variety of structurally related metalloproteins, including the bleomycin resistance protein, glyoxalase I, and type I ring-cleaving dioxygenases. A bound metal ion is required for protein activities for the members of this superfamily. A variety of metal ions have been found in the catalytic centers of these proteins including Fe(II), Mn(II), Zn(II), Ni(II) and Mg(II). The protein superfamily contains members with or without domain swapping. The proteins of this family share three conserved metal binding amino acids with the type I extradiol dioxygenases, which shows no domain swapping.
Probab=64.63 E-value=2.8 Score=27.70 Aligned_cols=44 Identities=16% Similarity=0.044 Sum_probs=36.2
Q ss_pred hHHHHHHHHHHHHHhcCCCCCCCceecCeecchhhHHHHHHhcCc
Q psy405 15 EKESFLRDLQHFHETRGTPCRHVPKIGGKEIDLYLLYCLVTSNGG 59 (66)
Q Consensus 15 ~~~~Fl~~L~~F~~~rg~p~~~~P~i~gk~lDL~~Ly~~V~~~GG 59 (66)
...+..+.+++-+.+.|+-+. .=+|+||||-||+|.+.+.-.|+
T Consensus 10 n~~~~A~~w~~~l~~~G~llS-en~INGRPI~l~~L~qPl~~~~~ 53 (149)
T cd07268 10 NENQTAERWKEGLLQCGELLS-ENEINGRPIALIKLEKPLQFAGW 53 (149)
T ss_pred CCHHHHHHHHHHHHHhchhhh-ccccCCeeEEEEEcCCCceeCCc
Confidence 445667788888888998887 78999999999999988875554
No 7
>PF14475 Mso1_Sec1_bdg: Sec1-binding region of Mso1
Probab=61.23 E-value=15 Score=19.27 Aligned_cols=31 Identities=16% Similarity=0.325 Sum_probs=24.6
Q ss_pred cCCCCccchHhHHHHHHHHHHHHHhcCCCCC
Q psy405 5 TKKDGDAYFKEKESFLRDLQHFHETRGTPCR 35 (66)
Q Consensus 5 ~~~~~~~~~~~~~~Fl~~L~~F~~~rg~p~~ 35 (66)
+..+..+.+.+.-.-...|++|..++|.|+.
T Consensus 8 l~~E~DGdteddT~v~r~l~~yY~~k~~~~P 38 (41)
T PF14475_consen 8 LSSESDGDTEDDTHVHRVLRKYYTEKGRPFP 38 (41)
T ss_pred cccccCCCCcchhHHHHHHHHHHHHcCCCCC
Confidence 4555566677777888999999999998875
No 8
>COG0043 UbiD 3-polyprenyl-4-hydroxybenzoate decarboxylase and related decarboxylases [Coenzyme metabolism]
Probab=56.14 E-value=16 Score=28.04 Aligned_cols=50 Identities=24% Similarity=0.284 Sum_probs=37.7
Q ss_pred CcccCCCCccchHhHHHHHHHHHHHHHhcCCCCCCCc----eecCeecchhhHH
Q psy405 2 AKVTKKDGDAYFKEKESFLRDLQHFHETRGTPCRHVP----KIGGKEIDLYLLY 51 (66)
Q Consensus 2 ~~~~~~~~~~~~~~~~~Fl~~L~~F~~~rg~p~~~~P----~i~gk~lDL~~Ly 51 (66)
|..+.-+++....+...|+..+.+........+.+.| ++.|..+||.+|-
T Consensus 81 ~~~~~~~~p~~~~~~~~~l~~~~~~~~~~p~~~~~~p~~~~v~~~~~v~L~~LP 134 (477)
T COG0043 81 ALLLEPPPPKSFKDILDFLRKAKDALKMPPKRVSRAPCQENVLEGDEVDLSRLP 134 (477)
T ss_pred hhhcCCCCCccHHHHHHHHHHHHHhccCCCcccccCCccceecccccCCHhhCC
Confidence 4455567777888888888888888777666666667 7888889998774
No 9
>PRK11700 hypothetical protein; Provisional
Probab=54.17 E-value=4.1 Score=27.78 Aligned_cols=39 Identities=15% Similarity=0.009 Sum_probs=26.6
Q ss_pred HHHHHHHHHhcCCCCCCCceecCeecchhhHHHHHHhcCc
Q psy405 20 LRDLQHFHETRGTPCRHVPKIGGKEIDLYLLYCLVTSNGG 59 (66)
Q Consensus 20 l~~L~~F~~~rg~p~~~~P~i~gk~lDL~~Ly~~V~~~GG 59 (66)
.+.+++-+.+.|+-+. .=+|+||||-||+|...+.-.|+
T Consensus 53 Ae~w~~~l~~~G~llS-en~INGRPI~l~~L~qPl~~~~w 91 (187)
T PRK11700 53 AERWRQGFLQCGELLS-ENIINGRPICLFELDQPLQVGHW 91 (187)
T ss_pred HHHHHHHHHHhchhhh-ccccCCeeEEEEEcCCCceeCCc
Confidence 3344444444455553 56899999999999998875554
No 10
>PF06185 YecM: YecM protein; InterPro: IPR010393 This family consists of several bacterial YecM proteins of unknown function.; PDB: 1K4N_A.
Probab=52.13 E-value=16 Score=24.83 Aligned_cols=39 Identities=15% Similarity=0.056 Sum_probs=23.9
Q ss_pred HHHHHHHHHHHhcCCCCCCCceecCeecchhhHHHHHHhc
Q psy405 18 SFLRDLQHFHETRGTPCRHVPKIGGKEIDLYLLYCLVTSN 57 (66)
Q Consensus 18 ~Fl~~L~~F~~~rg~p~~~~P~i~gk~lDL~~Ly~~V~~~ 57 (66)
+..+.+.+-+.+.|+-+. .-+|+|||+-+|+|-+.+...
T Consensus 46 ~~A~~~~~~l~~~G~llS-en~INGRPI~l~~L~qPL~~~ 84 (185)
T PF06185_consen 46 ETAERWKQALLQCGELLS-ENMINGRPICLFKLNQPLQFG 84 (185)
T ss_dssp HHHHHHHHHHTTTEEEEE-EEEETTEEEEEEEEEEEEEET
T ss_pred HHHHHHHHHHHHhChhhh-hceeCCeeEEEEEcCCchhcC
Confidence 333444444444453333 458999999999987766433
No 11
>PF10281 Ish1: Putative stress-responsive nuclear envelope protein; InterPro: IPR018803 This group of proteins, found primarily in fungi, consists of putative stress-responsive nuclear envelope protein Ish1 and homologues [].
Probab=49.66 E-value=20 Score=17.75 Aligned_cols=18 Identities=22% Similarity=0.508 Sum_probs=14.8
Q ss_pred HHHHHHHHhcCCCCCCCc
Q psy405 21 RDLQHFHETRGTPCRHVP 38 (66)
Q Consensus 21 ~~L~~F~~~rg~p~~~~P 38 (66)
++|++|+.++|.+..+..
T Consensus 7 ~~L~~wL~~~gi~~~~~~ 24 (38)
T PF10281_consen 7 SDLKSWLKSHGIPVPKSA 24 (38)
T ss_pred HHHHHHHHHcCCCCCCCC
Confidence 689999999998877443
No 12
>cd07353 harmonin_N N-terminal protein-binding module of harmonin. Harmonin is a postsynaptic density-95/discs-large/ZO-1 (PDZ) domain-containing scaffold protein, which organizes the Usher protein network of the inner ear and the retina. Harmonin contains a single copy of this domain, which is found at the N-terminus of all three harmonin isoform classes (a, b and c), and which preceeds the first PDZ protein-binding domain, PDZ1. This harmonin_N domain binds specifically to a short internal peptide fragment of the cadherin 23 cytoplasmic domain; cadherin 23 is a component of the Usher protein network.
Probab=48.93 E-value=15 Score=21.69 Aligned_cols=23 Identities=17% Similarity=0.183 Sum_probs=19.2
Q ss_pred ccchHhHHHHHHHHHHHHHhcCC
Q psy405 10 DAYFKEKESFLRDLQHFHETRGT 32 (66)
Q Consensus 10 ~~~~~~~~~Fl~~L~~F~~~rg~ 32 (66)
.+.+.+++.|.+.|+.||++-..
T Consensus 16 ie~EaEkd~lY~~Lr~YHqSm~l 38 (79)
T cd07353 16 IDNEAEKDYLYDVLRMYHQSMNL 38 (79)
T ss_pred hccHHHHHHHHHHHHHHHhccCH
Confidence 34568899999999999998763
No 13
>cd07356 HN_L-whirlin_R1_like First harmonin_N_like domain (repeat 1) of the long isoform of whirlin, and related domains. This subgroup contains the first of two harmonin_N_like domains of the long isoform of whirlin, and related domains. Whirlin is a postsynaptic density-95/discs-large/ZO-1 (PDZ) domain-containing scaffold protein which binds various components of the Usher protein network of the inner ear and the retina: erythrocyte protein p55, usherin, VlGR1, and myosin XVa. The long isoform of whirlin contains two harmonin_N_like domains, and three PDZ protein-binding domains, PDZ1-3. This first harmonin_N_like domain precedes PDZ1, and is a putative protein-binding module based on its sequence similarity to the N-terminal domain of harmonin. This first harmonin_N_like domain has been assayed for interaction with the cytoplasmic domain of cadherin 23 (a component of the Usher network and an interacting partner of the harmonin N-domain), however no interaction could be detected. Th
Probab=48.78 E-value=23 Score=21.03 Aligned_cols=20 Identities=30% Similarity=0.542 Sum_probs=17.8
Q ss_pred chHhHHHHHHHHHHHHHhcC
Q psy405 12 YFKEKESFLRDLQHFHETRG 31 (66)
Q Consensus 12 ~~~~~~~Fl~~L~~F~~~rg 31 (66)
...+++.|...|..|+..|.
T Consensus 17 s~~Er~~f~h~Ln~Y~~~Rn 36 (78)
T cd07356 17 SEAEREEFIHCLNDYHAKRN 36 (78)
T ss_pred cHHHHHHHHHHHHHHHhccc
Confidence 45789999999999999985
No 14
>cd01478 Sec23-like Sec23-like: Protein and membrane traffic in eukaryotes is mediated by at least in part by the budding and fusion of intracellular transport vesicles that selectively carry cargo proteins and lipids from donor to acceptor organelles. The two main classes of vesicular carriers within the endocytic and the biosynthetic pathways are COP- and clathrin-coated vesicles. Formation of COPII vesicles requires the ordered assembly of the coat built from several cytosolic components GTPase Sar1, complexes of Sec23-Sec24 and Sec13-Sec31. The process is initiated by the conversion of GDP to GTP by the GTPase Sar1 which then recruits the heterodimeric complex of Sec23 and Sec24. This heterodimeric complex generates the pre-budding complex. The final step leading to membrane deformation and budding of COPII-coated vesicles is carried by the heterodimeric complex Sec13-Sec31. The members of this CD belong to the Sec23-like family. Sec 23 is very similar to Sec24. The Sec23 and Sec24
Probab=45.11 E-value=36 Score=23.72 Aligned_cols=48 Identities=23% Similarity=0.298 Sum_probs=40.1
Q ss_pred hHhHHHHHHHHHHHHHhcCCCCCCCceecCeecchhhHHHHHHhcCccc
Q psy405 13 FKEKESFLRDLQHFHETRGTPCRHVPKIGGKEIDLYLLYCLVTSNGGWV 61 (66)
Q Consensus 13 ~~~~~~Fl~~L~~F~~~rg~p~~~~P~i~gk~lDL~~Ly~~V~~~GG~~ 61 (66)
-.....|++.|..=+.+.|..+. +=..+...+||+.+...+..-||.-
T Consensus 203 ~~~a~~fY~~la~~~~~~~vsvD-lF~~s~d~vglaem~~l~~~TGG~v 250 (267)
T cd01478 203 YKKAVKFYDSLAKRLAANGHAVD-IFAGCLDQVGLLEMKVLVNSTGGHV 250 (267)
T ss_pred hhhHHHHHHHHHHHHHhCCeEEE-EEeccccccCHHHHHHHHHhcCcEE
Confidence 34567899999999988998877 5566778899999999999999964
No 15
>PF02639 DUF188: Uncharacterized BCR, YaiI/YqxD family COG1671; InterPro: IPR003791 This entry describes proteins of unknown function.
Probab=42.61 E-value=41 Score=21.28 Aligned_cols=25 Identities=20% Similarity=0.228 Sum_probs=17.5
Q ss_pred ccCCCCccchHhHHHHHHHHHHHHH
Q psy405 4 VTKKDGDAYFKEKESFLRDLQHFHE 28 (66)
Q Consensus 4 ~~~~~~~~~~~~~~~Fl~~L~~F~~ 28 (66)
.++.+++=...+++.|...|..+++
T Consensus 105 ~~~gpk~~~~~d~~~F~~~L~~~l~ 129 (130)
T PF02639_consen 105 RTKGPKKFTKKDRQRFANALDRLLQ 129 (130)
T ss_pred CCCCCCCCCHHHHHHHHHHHHHHHc
Confidence 3455555667788888888887765
No 16
>smart00805 AGTRAP Angiotensin II, type I receptor-associated protein. This family consists of several angiotensin II, type I receptor-associated protein (AGTRAP) sequences. AGTRAP is known to interact specifically with the C-terminal cytoplasmic region of the angiotensin II type 1 (AT(1)) receptor to regulate different aspects of AT(1) receptor physiology. The function of this family is unclear.
Probab=39.39 E-value=16 Score=24.36 Aligned_cols=21 Identities=24% Similarity=0.425 Sum_probs=18.7
Q ss_pred eecchhhHHHHHHhcCcccce
Q psy405 43 KEIDLYLLYCLVTSNGGWVKK 63 (66)
Q Consensus 43 k~lDL~~Ly~~V~~~GG~~~V 63 (66)
||+.++.||++-.+|||.-.|
T Consensus 101 rP~S~~lly~~~~eRGG~l~v 121 (159)
T smart00805 101 KPLSCCLVYHMYRERGGELPV 121 (159)
T ss_pred HHHHHHHHHHHHHHccCcccc
Confidence 789999999999999997654
No 17
>cd02879 GH18_plant_chitinase_class_V The class V plant chitinases have a glycosyl hydrolase family 18 (GH18) domain, but lack the chitin-binding domain present in other GH18 enzymes. The GH18 domain of the class V chitinases has endochitinase activity in some cases and no catalytic activity in others. Included in this family is a lectin found in black locust (Robinia pseudoacacia) bark, which binds chitin but lacks chitinase activity. Also included is a chitinase-related receptor-like kinase (CHRK1) from tobacco (Nicotiana tabacum), with an N-terminal GH18 domain and a C-terminal kinase domain, which is thought to be part of a plant signaling pathway. The GH18 domain of CHRK1 is expressed extracellularly where it binds chitin but lacks chitinase activity.
Probab=39.11 E-value=42 Score=23.43 Aligned_cols=20 Identities=10% Similarity=0.174 Sum_probs=17.3
Q ss_pred chHhHHHHHHHHHHHHHhcC
Q psy405 12 YFKEKESFLRDLQHFHETRG 31 (66)
Q Consensus 12 ~~~~~~~Fl~~L~~F~~~rg 31 (66)
....++.|.+++.+|+.+.|
T Consensus 89 ~~~~R~~fi~siv~~l~~~~ 108 (299)
T cd02879 89 DPTARKAFINSSIKVARKYG 108 (299)
T ss_pred CHHHHHHHHHHHHHHHHHhC
Confidence 34678999999999999986
No 18
>PF08971 GlgS: Glycogen synthesis protein; InterPro: IPR015065 Members of this protein are involved in glycogen synthesis in Enterobacteria. The structure of the polypeptide chain comprises a bundle of two parallel amphipathic helices, alpha-1 and alpha-3, and a short hydrophobic helix alpha-2 sandwiched between them []. ; GO: 0005978 glycogen biosynthetic process; PDB: 1RRZ_A.
Probab=38.95 E-value=14 Score=21.39 Aligned_cols=30 Identities=23% Similarity=0.227 Sum_probs=19.5
Q ss_pred cchHhHHHHHHHHHHHHHhcCCCCCCCceec
Q psy405 11 AYFKEKESFLRDLQHFHETRGTPCRHVPKIG 41 (66)
Q Consensus 11 ~~~~~~~~Fl~~L~~F~~~rg~p~~~~P~i~ 41 (66)
-+.-..-.|+..=..||+.+|.++. +..|.
T Consensus 5 ~ys~~nfDFlAsS~ArMe~qgr~vd-~~~I~ 34 (66)
T PF08971_consen 5 VYSHTNFDFLASSFARMEAQGRKVD-VDAIT 34 (66)
T ss_dssp TTHHHHHHHHHHHHHHHHHHT-----HHHHH
T ss_pred hhccccchHHHHHHHHHHHcCCCCC-HHHHh
Confidence 3455667899999999999999988 44333
No 19
>PF12496 BNIP2: Bcl2-/adenovirus E1B nineteen kDa-interacting protein 2; InterPro: IPR022181 This domain family is found in eukaryotes, and is typically between 119 and 133 amino acids in length. There is a conserved HGGY sequence motif. This family is Bcl2-/adenovirus E1B nineteen kDa-interacting protein 2. It interacts with pro- and anti- apoptotic molecules in the cell.
Probab=38.53 E-value=6.5 Score=25.18 Aligned_cols=26 Identities=35% Similarity=0.554 Sum_probs=17.9
Q ss_pred CCceecC--eecch--hhHHHHHHhcCccc
Q psy405 36 HVPKIGG--KEIDL--YLLYCLVTSNGGWV 61 (66)
Q Consensus 36 ~~P~i~g--k~lDL--~~Ly~~V~~~GG~~ 61 (66)
|.=+|++ +.||+ ..-|+.|.++|||.
T Consensus 95 R~v~iG~qE~rIDMkvIEPYkrViSHGGYy 124 (127)
T PF12496_consen 95 RTVRIGEQEHRIDMKVIEPYKRVISHGGYY 124 (127)
T ss_pred eEEEECCccEEEeeEecccceeeeccCCcc
Confidence 3445554 34665 45699999999994
No 20
>PHA03149 hypothetical protein; Provisional
Probab=38.48 E-value=9.7 Score=21.92 Aligned_cols=29 Identities=21% Similarity=0.218 Sum_probs=21.7
Q ss_pred HHHHhcCCCCCCCceecCeecchhhHHHHHHhc
Q psy405 25 HFHETRGTPCRHVPKIGGKEIDLYLLYCLVTSN 57 (66)
Q Consensus 25 ~F~~~rg~p~~~~P~i~gk~lDL~~Ly~~V~~~ 57 (66)
+-..+|..|+. +.|++|||+.=|.++.+.
T Consensus 6 SiCkRr~nPlD----V~G~~Inv~~DFE~fsE~ 34 (66)
T PHA03149 6 SICKRRPNPLD----TEGKVINLPDDFEELSET 34 (66)
T ss_pred eeecCCCCCcc----CCCcEecchhhhhhhCcc
Confidence 33456667775 999999999999887543
No 21
>PF15643 Tox-PL-2: Papain fold toxin 2
Probab=37.97 E-value=33 Score=21.26 Aligned_cols=35 Identities=14% Similarity=0.186 Sum_probs=26.1
Q ss_pred HhHHHHHHHHHHHHHhcCCCCCCCceecCeecchh
Q psy405 14 KEKESFLRDLQHFHETRGTPCRHVPKIGGKEIDLY 48 (66)
Q Consensus 14 ~~~~~Fl~~L~~F~~~rg~p~~~~P~i~gk~lDL~ 48 (66)
-++.+|.+.|.+|+.++|.|.+.+-.-.|-.-++|
T Consensus 19 ~qC~~cA~Al~~~L~~~gI~Gk~i~l~T~~~~~~~ 53 (100)
T PF15643_consen 19 FQCVECASALKQFLKQAGIPGKIIRLYTGYHEGPF 53 (100)
T ss_pred eehHHHHHHHHHHHHHCCCCceEEEEEecCCCCce
Confidence 35778999999999999988875555554444544
No 22
>PF14639 YqgF: Holliday-junction resolvase-like of SPT6 ; PDB: 3PSI_A 3PSF_A.
Probab=37.42 E-value=36 Score=21.94 Aligned_cols=41 Identities=24% Similarity=0.253 Sum_probs=27.1
Q ss_pred chHhHHHHHHHHHHHHHhcCCCCCCCceecCeecchhhHHHHHH
Q psy405 12 YFKEKESFLRDLQHFHETRGTPCRHVPKIGGKEIDLYLLYCLVT 55 (66)
Q Consensus 12 ~~~~~~~Fl~~L~~F~~~rg~p~~~~P~i~gk~lDL~~Ly~~V~ 55 (66)
....++.|++.|.+|+.++...+ =.|+|-..+-.+|+..|.
T Consensus 44 ~~~~~~~~~~~l~~~i~~~kP~v---I~v~g~~~~s~~l~~~v~ 84 (150)
T PF14639_consen 44 DRERKEEDMERLKKFIEKHKPDV---IAVGGNSRESRKLYDDVR 84 (150)
T ss_dssp -SS-SHHHHHHHHHHHHHH--SE---EEE--SSTHHHHHHHHHH
T ss_pred chHHHHHHHHHHHHHHHHcCCeE---EEEcCCChhHHHHHHHHH
Confidence 34556799999999999875433 357888888888888764
No 23
>cd00598 GH18_chitinase-like The GH18 (glycosyl hydrolase, family 18) type II chitinases hydrolyze chitin, an abundant polymer of beta-1,4-linked N-acetylglucosamine (GlcNAc) which is a major component of the cell wall of fungi and the exoskeleton of arthropods. Chitinases have been identified in viruses, bacteria, fungi, protozoan parasites, insects, and plants. The structure of the GH18 domain is an eight-stranded beta/alpha barrel with a pronounced active-site cleft at the C-terminal end of the beta-barrel. The GH18 family includes chitotriosidase, chitobiase, hevamine, zymocin-alpha, narbonin, SI-CLP (stabilin-1 interacting chitinase-like protein), IDGF (imaginal disc growth factor), CFLE (cortical fragment-lytic enzyme) spore hydrolase, the type III and type V plant chitinases, the endo-beta-N-acetylglucosaminidases, and the chitolectins. The GH85 (glycosyl hydrolase, family 85) ENGases (endo-beta-N-acetylglucosaminidases) are closely related to the GH18 chitinases and are inclu
Probab=36.41 E-value=53 Score=20.83 Aligned_cols=21 Identities=24% Similarity=0.328 Sum_probs=17.6
Q ss_pred cchHhHHHHHHHHHHHHHhcC
Q psy405 11 AYFKEKESFLRDLQHFHETRG 31 (66)
Q Consensus 11 ~~~~~~~~Fl~~L~~F~~~rg 31 (66)
.....+++|.+++.+|+++.|
T Consensus 84 ~~~~~~~~f~~~~~~~v~~~~ 104 (210)
T cd00598 84 SDPASRAAFANSLVSFLKTYG 104 (210)
T ss_pred cCHHHHHHHHHHHHHHHHHcC
Confidence 345678999999999999986
No 24
>PRK00124 hypothetical protein; Validated
Probab=36.34 E-value=60 Score=21.27 Aligned_cols=26 Identities=12% Similarity=0.097 Sum_probs=22.0
Q ss_pred ccCCCCccchHhHHHHHHHHHHHHHh
Q psy405 4 VTKKDGDAYFKEKESFLRDLQHFHET 29 (66)
Q Consensus 4 ~~~~~~~~~~~~~~~Fl~~L~~F~~~ 29 (66)
.++.+++=..++++.|.+.|..++.+
T Consensus 121 ~t~Gp~~~~~~Dr~~F~~~L~~~l~~ 146 (151)
T PRK00124 121 RTGGPKPFTQEDRSRFEAELDKLIRR 146 (151)
T ss_pred CCCCCCCCCHHHHHHHHHHHHHHHHH
Confidence 45667777889999999999999986
No 25
>cd02876 GH18_SI-CLP Stabilin-1 interacting chitinase-like protein (SI-CLP) is a eukaryotic chitinase-like protein of unknown function that interacts with the endocytic/sorting transmembrane receptor stabilin-1 and is secreted from the lysosome. SI-CLP has a glycosyl hydrolase family 18 (GH18) domain but lacks a chitin-binding domain. The catalytic amino acids of the GH18 domain are not conserved in SI-CLP, similar to the chitolectins YKL-39, YKL-40, and YM1/2. Human SI-CLP is sorted to late endosomes and secretory lysosomes in alternatively activated macrophages.
Probab=36.12 E-value=52 Score=22.96 Aligned_cols=20 Identities=15% Similarity=0.130 Sum_probs=17.2
Q ss_pred chHhHHHHHHHHHHHHHhcC
Q psy405 12 YFKEKESFLRDLQHFHETRG 31 (66)
Q Consensus 12 ~~~~~~~Fl~~L~~F~~~rg 31 (66)
....++.|.+++.+|+++.|
T Consensus 89 ~~~~R~~fi~s~~~~~~~~~ 108 (318)
T cd02876 89 DEQEREKLIKLLVTTAKKNH 108 (318)
T ss_pred CHHHHHHHHHHHHHHHHHcC
Confidence 34568899999999999997
No 26
>cd06548 GH18_chitinase The GH18 (glycosyl hydrolases, family 18) type II chitinases hydrolyze chitin, an abundant polymer of N-acetylglucosamine and have been identified in bacteria, fungi, insects, plants, viruses, and protozoan parasites. The structure of this domain is an eight-stranded alpha/beta barrel with a pronounced active-site cleft at the C-terminal end of the beta-barrel.
Probab=35.23 E-value=50 Score=23.13 Aligned_cols=20 Identities=15% Similarity=0.179 Sum_probs=17.0
Q ss_pred chHhHHHHHHHHHHHHHhcC
Q psy405 12 YFKEKESFLRDLQHFHETRG 31 (66)
Q Consensus 12 ~~~~~~~Fl~~L~~F~~~rg 31 (66)
....++.|.+++.+|+++.|
T Consensus 106 ~~~~r~~Fi~siv~~l~~~~ 125 (322)
T cd06548 106 TEASRAKFADSAVDFIRKYG 125 (322)
T ss_pred CHHHHHHHHHHHHHHHHhcC
Confidence 34568899999999999985
No 27
>cd06545 GH18_3CO4_chitinase The Bacteroides thetaiotaomicron protein represented by pdb structure 3CO4 is an uncharacterized bacterial member of the family 18 glycosyl hydrolases with homologs found in Flavobacterium, Stigmatella, and Pseudomonas.
Probab=34.59 E-value=53 Score=22.14 Aligned_cols=20 Identities=0% Similarity=0.112 Sum_probs=16.8
Q ss_pred chHhHHHHHHHHHHHHHhcC
Q psy405 12 YFKEKESFLRDLQHFHETRG 31 (66)
Q Consensus 12 ~~~~~~~Fl~~L~~F~~~rg 31 (66)
....++.|.+++.+|+++.|
T Consensus 80 ~~~~r~~fi~~lv~~~~~~~ 99 (253)
T cd06545 80 DPAKRKALVDKIINYVVSYN 99 (253)
T ss_pred CHHHHHHHHHHHHHHHHHhC
Confidence 44678899999999999876
No 28
>cd02873 GH18_IDGF The IDGF's (imaginal disc growth factors) are a family of growth factors identified in insects that include at least five members, some of which are encoded by genes in a tight cluster. The IDGF's have an eight-stranded alpha/beta barrel fold and are related to the glycosyl hydrolase family 18 (GH18) chitinases, but they have an amino acid substitution known to abolish chitinase catalytic activity. IDGFs may have evolved from chitinases to gain new functions as growth factors, interacting with cell surface glycoproteins involved in growth-promoting processes.
Probab=33.34 E-value=58 Score=23.97 Aligned_cols=20 Identities=15% Similarity=0.341 Sum_probs=17.1
Q ss_pred chHhHHHHHHHHHHHHHhcC
Q psy405 12 YFKEKESFLRDLQHFHETRG 31 (66)
Q Consensus 12 ~~~~~~~Fl~~L~~F~~~rg 31 (66)
....++.|.+++.+|+.+.|
T Consensus 102 ~~~~R~~Fi~siv~~l~~~~ 121 (413)
T cd02873 102 SSESRNAFINSAHSLLKTYG 121 (413)
T ss_pred CHHHHHHHHHHHHHHHHHcC
Confidence 34668899999999999986
No 29
>COG1671 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=31.69 E-value=80 Score=20.87 Aligned_cols=27 Identities=15% Similarity=0.172 Sum_probs=21.6
Q ss_pred cccCCCCccchHhHHHHHHHHHHHHHh
Q psy405 3 KVTKKDGDAYFKEKESFLRDLQHFHET 29 (66)
Q Consensus 3 ~~~~~~~~~~~~~~~~Fl~~L~~F~~~ 29 (66)
+..+.+++-..++++.|.+.|..|+.+
T Consensus 119 ~~~~gp~~~~~rDr~~F~~~ldr~~~~ 145 (150)
T COG1671 119 KKTGGPAAFSSRDRSRFANALDRLLSR 145 (150)
T ss_pred cccCCCCccChHHHHHHHHHHHHHHHH
Confidence 345677777889999999999998764
No 30
>PF09441 Abp2: ARS binding protein 2; InterPro: IPR018562 This DNA-binding protein binds to the autonomously replicating sequence (ARS) binding element. It may play a role in regulating the cell cycle response to stress signals [].
Probab=31.66 E-value=35 Score=23.09 Aligned_cols=23 Identities=30% Similarity=0.521 Sum_probs=20.2
Q ss_pred CCCceecCeecchhhHHHHHHhc
Q psy405 35 RHVPKIGGKEIDLYLLYCLVTSN 57 (66)
Q Consensus 35 ~~~P~i~gk~lDL~~Ly~~V~~~ 57 (66)
..+|.-+||..+.|.||..|.+.
T Consensus 43 r~pPkS~Gk~Fs~~~Lf~LI~k~ 65 (175)
T PF09441_consen 43 RSPPKSDGKSFSTFTLFELIRKL 65 (175)
T ss_pred cCCCCcCCccchHHHHHHHHHHH
Confidence 34899999999999999998764
No 31
>PF00704 Glyco_hydro_18: Glycosyl hydrolases family 18; InterPro: IPR001223 O-Glycosyl hydrolases 3.2.1. from EC are a widespread group of enzymes that hydrolyse the glycosidic bond between two or more carbohydrates, or between a carbohydrate and a non-carbohydrate moiety. A classification system for glycosyl hydrolases, based on sequence similarity, has led to the definition of 85 different families [, ]. This classification is available on the CAZy (CArbohydrate-Active EnZymes) web site. Some members of this family, GH18 from CAZY, belong to the chitinase class II group which includes chitinase, chitodextrinase and the killer toxin of Kluyveromyces lactis. The chitinases hydrolyse chitin oligosaccharides. The family also includes various glycoproteins from mammals; cartilage glycoprotein and the oviduct-specific glycoproteins are two examples.; GO: 0004553 hydrolase activity, hydrolyzing O-glycosyl compounds, 0005975 carbohydrate metabolic process; PDB: 1ITX_A 3ALG_A 3ALF_A 1NAR_A 3QOK_A 3G6L_A 3G6M_A 2DT1_A 2B31_A 2O92_A ....
Probab=31.57 E-value=47 Score=22.66 Aligned_cols=27 Identities=22% Similarity=0.534 Sum_probs=21.1
Q ss_pred hHhHHHHHHHHHHHHHhcCCCCCCCceecCeecch
Q psy405 13 FKEKESFLRDLQHFHETRGTPCRHVPKIGGKEIDL 47 (66)
Q Consensus 13 ~~~~~~Fl~~L~~F~~~rg~p~~~~P~i~gk~lDL 47 (66)
...++.|.+++.+|+++.| ++|-+||.
T Consensus 97 ~~~r~~f~~~i~~~l~~y~--------~DGidiD~ 123 (343)
T PF00704_consen 97 PAKRQNFINNIVSFLKKYG--------FDGIDIDW 123 (343)
T ss_dssp HHHHHHHHHHHHHHHHHHT---------SEEEEEE
T ss_pred HHHHHHHHHhhhhhhcccC--------cceeeeee
Confidence 4568999999999999988 66666653
No 32
>cd02874 GH18_CFLE_spore_hydrolase Cortical fragment-lytic enzyme (CFLE) is a peptidoglycan hydrolase involved in bacterial endospore germination. CFLE is expressed as an inactive preprotein (called SleB) in the forespore compartment of sporulating cells. SleB translocates across the forespore inner membrane and is deposited as a mature enzyme in the cortex layer of the spore. As part of a sensory mechanism capable of initiating germination, CFLE degrades a spore-specific peptidoglycan constituent called muramic-acid delta-lactam that comprises the outer cortex. CFLE has a C-terminal glycosyl hydrolase family 18 (GH18) catalytic domain as well as two N-terminal LysM peptidoglycan-binding domains. In addition to SleB, this family includes YaaH, YdhD, and YvbX from Bacillus subtilis.
Probab=31.47 E-value=67 Score=22.20 Aligned_cols=19 Identities=5% Similarity=0.293 Sum_probs=16.4
Q ss_pred hHhHHHHHHHHHHHHHhcC
Q psy405 13 FKEKESFLRDLQHFHETRG 31 (66)
Q Consensus 13 ~~~~~~Fl~~L~~F~~~rg 31 (66)
...++.|.+++.+|+++.|
T Consensus 85 ~~~r~~fi~~iv~~l~~~~ 103 (313)
T cd02874 85 PEARQRLINNILALAKKYG 103 (313)
T ss_pred HHHHHHHHHHHHHHHHHhC
Confidence 3567899999999999986
No 33
>smart00636 Glyco_18 Glycosyl hydrolase family 18.
Probab=30.57 E-value=67 Score=22.24 Aligned_cols=19 Identities=16% Similarity=0.392 Sum_probs=16.7
Q ss_pred hHhHHHHHHHHHHHHHhcC
Q psy405 13 FKEKESFLRDLQHFHETRG 31 (66)
Q Consensus 13 ~~~~~~Fl~~L~~F~~~rg 31 (66)
...++.|.+++.+|+.+.|
T Consensus 89 ~~~r~~fi~~i~~~~~~~~ 107 (334)
T smart00636 89 PASRKKFIDSIVSFLKKYG 107 (334)
T ss_pred HHHHHHHHHHHHHHHHHcC
Confidence 4678899999999999986
No 34
>cd02878 GH18_zymocin_alpha Zymocin, alpha subunit. Zymocin is a heterotrimeric enzyme that inhibits yeast cell cycle progression. The zymocin alpha subunit has a chitinase activity that is essential for holoenzyme action from the cell exterior while the gamma subunit contains the intracellular toxin responsible for G1 phase cell cycle arrest. The zymocin alpha and beta subunits are thought to act from the cell's exterior by docking to the cell wall-associated chitin, thus mediating gamma-toxin translocation. The alpha subunit has an eight-stranded TIM barrel fold similar to that of family 18 glycosyl hydrolases such as hevamine, chitolectin, and chitobiase.
Probab=30.11 E-value=72 Score=22.72 Aligned_cols=20 Identities=10% Similarity=0.267 Sum_probs=17.4
Q ss_pred chHhHHHHHHHHHHHHHhcC
Q psy405 12 YFKEKESFLRDLQHFHETRG 31 (66)
Q Consensus 12 ~~~~~~~Fl~~L~~F~~~rg 31 (66)
....++.|.+++.+|+++.|
T Consensus 88 ~~~~R~~Fi~si~~~~~~~~ 107 (345)
T cd02878 88 KPANRDTFANNVVNFVNKYN 107 (345)
T ss_pred CHHHHHHHHHHHHHHHHHcC
Confidence 35678999999999999986
No 35
>cd06549 GH18_trifunctional GH18 domain of an uncharacterized family of bacterial proteins, which share a common three-domain architecture: an N-terminal glycosyl hydrolase family 18 (GH18) domain, a glycosyl transferase family 2 domain, and a C-terminal polysaccharide deacetylase domain.
Probab=29.01 E-value=78 Score=22.02 Aligned_cols=19 Identities=11% Similarity=0.321 Sum_probs=16.5
Q ss_pred hHhHHHHHHHHHHHHHhcC
Q psy405 13 FKEKESFLRDLQHFHETRG 31 (66)
Q Consensus 13 ~~~~~~Fl~~L~~F~~~rg 31 (66)
...++.|++++.+|+++.|
T Consensus 86 ~~~R~~fi~~iv~~~~~~~ 104 (298)
T cd06549 86 PSARAKFIANIAAYLERNQ 104 (298)
T ss_pred HHHHHHHHHHHHHHHHHhC
Confidence 3567889999999999987
No 36
>PRK02363 DNA-directed RNA polymerase subunit delta; Reviewed
Probab=28.56 E-value=79 Score=20.10 Aligned_cols=34 Identities=12% Similarity=0.071 Sum_probs=25.1
Q ss_pred HHHHHHHHHHHHhcCCCCCCCceecCeecchhhHHHHHHhcCccc
Q psy405 17 ESFLRDLQHFHETRGTPCRHVPKIGGKEIDLYLLYCLVTSNGGWV 61 (66)
Q Consensus 17 ~~Fl~~L~~F~~~rg~p~~~~P~i~gk~lDL~~Ly~~V~~~GG~~ 61 (66)
.++++.-+.++.+++. ++.++.|++.|.+..|+.
T Consensus 3 ~S~idvAy~iL~~~~~-----------~m~f~dL~~ev~~~~~~s 36 (129)
T PRK02363 3 LSLIEVAYEILKEKKE-----------PMSFYDLVNEIQKYLGKS 36 (129)
T ss_pred ccHHHHHHHHHHHcCC-----------cccHHHHHHHHHHHhCCC
Confidence 4567777778877654 455889999998888865
No 37
>PRK05089 cytochrome C oxidase assembly protein; Provisional
Probab=28.53 E-value=33 Score=23.33 Aligned_cols=18 Identities=33% Similarity=0.636 Sum_probs=15.8
Q ss_pred cchhhHHHHHHhcCcccc
Q psy405 45 IDLYLLYCLVTSNGGWVK 62 (66)
Q Consensus 45 lDL~~Ly~~V~~~GG~~~ 62 (66)
|-||++|+.|+..||-.+
T Consensus 32 VPLY~~fC~~TG~~G~t~ 49 (188)
T PRK05089 32 VPLYDVFCEVTGINGTTQ 49 (188)
T ss_pred hHHHHHHHHhhCCCceec
Confidence 569999999999999765
No 38
>COG3175 COX11 Cytochrome oxidase assembly factor [Posttranslational modification, protein turnover, chaperones]
Probab=28.14 E-value=31 Score=23.75 Aligned_cols=18 Identities=44% Similarity=0.835 Sum_probs=15.1
Q ss_pred ecchhhHHHHHHhcCccc
Q psy405 44 EIDLYLLYCLVTSNGGWV 61 (66)
Q Consensus 44 ~lDL~~Ly~~V~~~GG~~ 61 (66)
.|-||+||+.|..-||--
T Consensus 31 aVPLY~lfC~vTGygGtt 48 (195)
T COG3175 31 AVPLYKLFCRVTGYGGTT 48 (195)
T ss_pred hhhHHHHHhhhhccCCEe
Confidence 367999999999999853
No 39
>PTZ00417 lysine-tRNA ligase; Provisional
Probab=27.87 E-value=1.1e+02 Score=24.11 Aligned_cols=50 Identities=32% Similarity=0.451 Sum_probs=32.4
Q ss_pred hHhHHHHHHHHHHHHHhcC-----CCCCCCceecCee------------cchh-----hHHHHHHhcCcccce
Q psy405 13 FKEKESFLRDLQHFHETRG-----TPCRHVPKIGGKE------------IDLY-----LLYCLVTSNGGWVKK 63 (66)
Q Consensus 13 ~~~~~~Fl~~L~~F~~~rg-----~p~~~~P~i~gk~------------lDL~-----~Ly~~V~~~GG~~~V 63 (66)
-..+...+..+++|+.++| ||+- .+.-+|-. .|+| .||..-.-.||+++|
T Consensus 253 fr~RS~Ii~aiR~Ff~~rGFlEVeTPiL-~~~~GGA~a~PF~T~~n~~d~~lYLriSpEL~lKrLlvgG~~rV 324 (585)
T PTZ00417 253 FITRTKIINYLRNFLNDRGFIEVETPTM-NLVAGGANARPFITHHNDLDLDLYLRIATELPLKMLIVGGIDKV 324 (585)
T ss_pred HHHHHHHHHHHHHHHHHCCeEEEeCCee-eccCCcccceeEEecccCCCcceEEeecHHHHHHHHHHhCCCCE
Confidence 3567788999999999999 5554 22223322 1222 466666666888877
No 40
>PF00586 AIRS: AIR synthase related protein, N-terminal domain; InterPro: IPR000728 This family includes Hydrogen expression/formation protein, HypE, which may be involved in the maturation of NifE hydrogenase; AIR synthase and FGAM synthase, which are involved in de novo purine biosynthesis; and selenide, water dikinase, an enzyme which synthesizes selenophosphate from selenide and ATP.; GO: 0003824 catalytic activity; PDB: 3VIU_A 2Z1T_A 2Z1U_A 3C9U_B 3C9S_A 3C9R_A 1VQV_A 3C9T_B 3M84_A 3QTY_A ....
Probab=27.31 E-value=1e+02 Score=17.45 Aligned_cols=31 Identities=16% Similarity=0.340 Sum_probs=22.4
Q ss_pred CCccchHhHHHHHHHHHHHHHhcCCCCCCCceecCe
Q psy405 8 DGDAYFKEKESFLRDLQHFHETRGTPCRHVPKIGGK 43 (66)
Q Consensus 8 ~~~~~~~~~~~Fl~~L~~F~~~rg~p~~~~P~i~gk 43 (66)
+++.....-+++++.+.++.++-| +|+++|+
T Consensus 65 ~~~~~~~~l~~~~~Gi~~~~~~~g-----~~ivGG~ 95 (96)
T PF00586_consen 65 PNPESPEELKEIVKGIAEACREFG-----IPIVGGD 95 (96)
T ss_dssp STTSBHHHHHHHHHHHHHHHHHHT------EEEEEE
T ss_pred CCCCCHHHHHHHHHHHHHHHHHhC-----CcEeCcC
Confidence 334455667888889999999888 6777775
No 41
>PF15412 Nse4-Nse3_bdg: Binding domain of Nse4/EID3 to Nse3-MAGE
Probab=27.30 E-value=61 Score=17.46 Aligned_cols=22 Identities=14% Similarity=0.141 Sum_probs=17.4
Q ss_pred cchHhHHHHHHHHHHHHHhcCC
Q psy405 11 AYFKEKESFLRDLQHFHETRGT 32 (66)
Q Consensus 11 ~~~~~~~~Fl~~L~~F~~~rg~ 32 (66)
...-+...|+..|..||.....
T Consensus 23 ~~~fd~deFv~~l~~fm~~~~~ 44 (56)
T PF15412_consen 23 GSGFDVDEFVSKLKTFMGGNRF 44 (56)
T ss_pred CCccCHHHHHHHHHHHhCcccC
Confidence 3445788999999999987654
No 42
>PF03823 Neurokinin_B: Neurokinin B; InterPro: IPR003635 Tachykinins [, , ] are a group of biologically active peptides which excite neurons, evoke behavioral responses, are potent vasodilatators and contract (directly or indirectly) many smooth muscles. This family includes neurokinins, as well as many other peptides. Like other tachykinins, neurokinins are synthesized as larger protein precursors that are enzymatically converted to their mature forms.; GO: 0007217 tachykinin receptor signaling pathway
Probab=26.53 E-value=30 Score=19.53 Aligned_cols=16 Identities=31% Similarity=0.258 Sum_probs=11.5
Q ss_pred ecCeecchhhHHHHHH
Q psy405 40 IGGKEIDLYLLYCLVT 55 (66)
Q Consensus 40 i~gk~lDL~~Ly~~V~ 55 (66)
-.+|+.|||+|-..+-
T Consensus 36 ~skKdsdLyqLPpslL 51 (59)
T PF03823_consen 36 HSKKDSDLYQLPPSLL 51 (59)
T ss_pred ccccCcchhhCCHHHH
Confidence 3458999999866543
No 43
>cd02872 GH18_chitolectin_chitotriosidase This conserved domain family includes a large number of catalytically inactive chitinase-like lectins (chitolectins) including YKL-39, YKL-40 (HCGP39), YM1, oviductin, and AMCase (acidic mammalian chitinase), as well as catalytically active chitotriosidases. The conserved domain is an eight-stranded alpha/beta barrel fold belonging to the family 18 glycosyl hydrolases. The fold has a pronounced active-site cleft at the C-terminal end of the beta-barrel. The chitolectins lack a key active site glutamate (the proton donor required for hydrolytic activity) but retain highly conserved residues involved in oligosaccharide binding. Chitotriosidase is a chitinolytic enzyme expressed in maturing macrophages, which suggests that it plays a part in antimicrobial defense. Chitotriosidase hydrolyzes chitotriose, as well as colloidal chitin to yield chitobiose and is therefore considered an exochitinase. Chitotriosidase occurs in two major forms, the la
Probab=26.47 E-value=93 Score=21.92 Aligned_cols=19 Identities=16% Similarity=0.408 Sum_probs=16.7
Q ss_pred hHhHHHHHHHHHHHHHhcC
Q psy405 13 FKEKESFLRDLQHFHETRG 31 (66)
Q Consensus 13 ~~~~~~Fl~~L~~F~~~rg 31 (66)
...++.|.+++.+|+++.|
T Consensus 94 ~~~r~~fi~~iv~~l~~~~ 112 (362)
T cd02872 94 PENRKTFIKSAIAFLRKYG 112 (362)
T ss_pred HHHHHHHHHHHHHHHHHcC
Confidence 4668899999999999986
No 44
>KOG4588|consensus
Probab=26.27 E-value=76 Score=22.76 Aligned_cols=32 Identities=16% Similarity=0.103 Sum_probs=26.7
Q ss_pred hHhHHHHHHHHHHHHHhcCCCCCCCceecCee
Q psy405 13 FKEKESFLRDLQHFHETRGTPCRHVPKIGGKE 44 (66)
Q Consensus 13 ~~~~~~Fl~~L~~F~~~rg~p~~~~P~i~gk~ 44 (66)
......|++.++.|..+|.+.+...|+|+.-+
T Consensus 201 ~~~~~k~le~~R~~~~sr~sK~~sa~qidslp 232 (267)
T KOG4588|consen 201 ENFAAKILETLRKASGSRISKLFSAPQIDSLP 232 (267)
T ss_pred hhHHHHHHHHHHHHhhhhhhhcccccccccCC
Confidence 45667899999999999999998899886544
No 45
>PF09153 DUF1938: Domain of unknown function (DUF1938); InterPro: IPR015236 This domain, which is predominantly found in the archaeal protein O6-alkylguanine-DNA alkyltransferase, adopts a secondary structure consisting of a three stranded antiparallel beta-sheet and three alpha helices. The exact function has not, as yet, been defined, though it has been postulated that this domain may confer thermostability to the protein []. ; GO: 0005737 cytoplasm; PDB: 1MGT_A.
Probab=25.83 E-value=78 Score=19.14 Aligned_cols=24 Identities=33% Similarity=0.576 Sum_probs=16.0
Q ss_pred chHhHHHHHH----HHHHHHHhcCCCCC
Q psy405 12 YFKEKESFLR----DLQHFHETRGTPCR 35 (66)
Q Consensus 12 ~~~~~~~Fl~----~L~~F~~~rg~p~~ 35 (66)
+.-+..+|++ .|..|+.+||..+.
T Consensus 29 fslDg~efl~eri~~L~~~L~kRgv~v~ 56 (86)
T PF09153_consen 29 FSLDGEEFLRERISRLIEFLKKRGVSVS 56 (86)
T ss_dssp EESSHHHHHH-HHHHHHHHHHHTT----
T ss_pred EEeccHHHHHHHHHHHHHHHHhcCceeE
Confidence 4456677777 89999999998776
No 46
>PHA02128 hypothetical protein
Probab=25.74 E-value=68 Score=20.67 Aligned_cols=42 Identities=21% Similarity=0.196 Sum_probs=31.2
Q ss_pred HHHHHhcCCCCCCCceecCeecchhhHHHHHHhcCcccceec
Q psy405 24 QHFHETRGTPCRHVPKIGGKEIDLYLLYCLVTSNGGWVKKCT 65 (66)
Q Consensus 24 ~~F~~~rg~p~~~~P~i~gk~lDL~~Ly~~V~~~GG~~~Vt~ 65 (66)
.+|+.++.+---.+-.+.||.--|..|-.+|...||..-+|.
T Consensus 39 ssflrehrtihlnlgr~tgkt~gl~~lane~~aqgg~r~itm 80 (151)
T PHA02128 39 SSFLREHRTIHLNLGRMTGKTTGLLHLANEVSAQGGARIITM 80 (151)
T ss_pred hHHHhhcceeEeeecccccccchHHHHHHHHHhcCCeEEEEe
Confidence 345555544322367889999999999999999999876653
No 47
>PF04442 CtaG_Cox11: Cytochrome c oxidase assembly protein CtaG/Cox11; InterPro: IPR007533 Cytochrome c oxidase assembly protein is essential for the assembly of functional cytochrome oxidase protein. In eukaryotes it is an integral protein of the mitochondrial inner membrane. Cox11 is essential for the insertion of Cu(I) ions to form the CuB site. This is essential for the stability of other structures in subunit I, for example haems a and a3, and the magnesium/manganese centre. Cox11 is probably only required in sub-stoichiometric amounts relative to the structural units []. The C-terminal region of the protein is known to form a dimer. Each monomer coordinates one Cu(I) ion via three conserved cysteine residues (111, 208 and 210) in Saccharomyces cerevisiae (P19516 from SWISSPROT). Met 224 is also thought to play a role in copper transfer or stabilising the copper site [].; GO: 0005507 copper ion binding; PDB: 1SO9_A 1SP0_A.
Probab=25.60 E-value=23 Score=23.27 Aligned_cols=19 Identities=32% Similarity=0.644 Sum_probs=0.0
Q ss_pred cchhhHHHHHHhcCcccce
Q psy405 45 IDLYLLYCLVTSNGGWVKK 63 (66)
Q Consensus 45 lDL~~Ly~~V~~~GG~~~V 63 (66)
|-||++|+.|...||-.+.
T Consensus 2 VPLY~~fC~vTG~~Gtt~~ 20 (152)
T PF04442_consen 2 VPLYDVFCEVTGFNGTTQR 20 (152)
T ss_dssp -------------------
T ss_pred CchHHHHHHHhCCCCEeCc
Confidence 5699999999999997765
No 48
>PTZ00128 cytochrome c oxidase assembly protein-like; Provisional
Probab=25.47 E-value=42 Score=23.63 Aligned_cols=20 Identities=40% Similarity=0.657 Sum_probs=16.5
Q ss_pred ecchhhHHHHHHhcCcccce
Q psy405 44 EIDLYLLYCLVTSNGGWVKK 63 (66)
Q Consensus 44 ~lDL~~Ly~~V~~~GG~~~V 63 (66)
-|-||++|+.|+..||-.+.
T Consensus 74 ~VPLY~~fC~~TG~~Gtt~~ 93 (232)
T PTZ00128 74 FVPLYRLFCQSTGYGGDADK 93 (232)
T ss_pred HhHHHHHHHHhcCCCccccc
Confidence 36799999999999996544
No 49
>cd06544 GH18_narbonin Narbonin is a plant 2S protein from the globulin fraction of narbon bean (Vicia narbonensis L.) cotyledons with unknown function. Narbonin has a glycosyl hydrolase family 18 (GH18) domain without the conserved catalytic residues and with no known enzymatic activity. Narbonin amounts to up to 3% of the total seed globulins of mature seeds and was thought to be a storage protein but was found to degrade too slowly during germination. This family also includes the VfNOD32 nodulin from Vicia faba.
Probab=25.34 E-value=97 Score=21.46 Aligned_cols=18 Identities=11% Similarity=0.010 Sum_probs=14.9
Q ss_pred HhHHHHHHHHHHHHHhcC
Q psy405 14 KEKESFLRDLQHFHETRG 31 (66)
Q Consensus 14 ~~~~~Fl~~L~~F~~~rg 31 (66)
.-+++|.+++.+|+++.|
T Consensus 96 ~~~~~fv~S~~~~l~~~~ 113 (253)
T cd06544 96 SWVSNAVSSLTSIIQTYN 113 (253)
T ss_pred hHHHHHHHHHHHHHHHhC
Confidence 345677999999999998
No 50
>cd02875 GH18_chitobiase Chitobiase (also known as di-N-acetylchitobiase) is a lysosomal glycosidase that hydrolyzes the reducing-end N-acetylglucosamine from the chitobiose core of oligosaccharides during the ordered degradation of asparagine-linked glycoproteins in eukaryotes. Chitobiase can only do so if the asparagine that joins the oligosaccharide to protein is previously removed by a glycosylasparaginase. Chitobiase is therefore the final step in the lysosomal degradation of the protein/carbohydrate linkage component of asparagine-linked glycoproteins. The catalytic domain of chitobiase is an eight-stranded alpha/beta barrel fold similar to that of other family 18 glycosyl hydrolases such as hevamine and chitotriosidase.
Probab=25.24 E-value=93 Score=22.44 Aligned_cols=20 Identities=0% Similarity=0.016 Sum_probs=17.3
Q ss_pred chHhHHHHHHHHHHHHHhcC
Q psy405 12 YFKEKESFLRDLQHFHETRG 31 (66)
Q Consensus 12 ~~~~~~~Fl~~L~~F~~~rg 31 (66)
....++.|.+++.+|+++.|
T Consensus 93 ~~~~R~~fi~siv~~~~~~g 112 (358)
T cd02875 93 NPTYRTQWIQQKVELAKSQF 112 (358)
T ss_pred CHHHHHHHHHHHHHHHHHhC
Confidence 44678899999999999996
No 51
>COG0317 SpoT Guanosine polyphosphate pyrophosphohydrolases/synthetases [Signal transduction mechanisms / Transcription]
Probab=25.04 E-value=38 Score=27.33 Aligned_cols=48 Identities=13% Similarity=0.217 Sum_probs=37.6
Q ss_pred CCCccchHhHHHHHHHHHHHHHhcCCCCCCCceecCeecchhhHHHHHHhcC
Q psy405 7 KDGDAYFKEKESFLRDLQHFHETRGTPCRHVPKIGGKEIDLYLLYCLVTSNG 58 (66)
Q Consensus 7 ~~~~~~~~~~~~Fl~~L~~F~~~rg~p~~~~P~i~gk~lDL~~Ly~~V~~~G 58 (66)
..-.+...-=..|.+.|..-+++.|.... |.||+=-+|..|+.-+..|
T Consensus 209 e~r~~re~~i~~~~~~l~~~L~~~gi~a~----v~gR~KhiYSIyrKM~~k~ 256 (701)
T COG0317 209 EKRLEREQYIENVVSELREELKAAGIKAE----VSGRPKHIYSIYRKMQKKK 256 (701)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHcCCeEE----EEcCCCcccHHHHHHHHcc
Confidence 33344445557799999999999998855 9999999999999866543
No 52
>cd07347 harmonin_N_like N-terminal protein-binding module of harmonin and similar domains. This domain is found in harmonin, and similar proteins such as delphilin, and whirlin. These are postsynaptic density-95/discs-large/ZO-1 (PDZ) domain-containing scaffold proteins. Harmonin and whirlin are organizers of the Usher protein network of the inner ear and the retina, delphilin is found at the cerebellar parallel fiber-Purkinje cell synapses. This harmonin_N_like domain is found in either one or two copies. Harmonin contains a single copy, which is found at its N-terminus and binds specifically to a short internal peptide fragment of the cadherin 23 cytoplasmic domain; cadherin 23 is a component of the Usher protein network. Whirlin contains two copies of the harmonin_N_like domain; the first of these has been assayed for interaction with the cytoplasmic domain of cadherin 23 and no interaction could be detected.
Probab=24.97 E-value=98 Score=18.14 Aligned_cols=21 Identities=29% Similarity=0.487 Sum_probs=17.7
Q ss_pred chHhHHHHHHHHHHHHHhcCC
Q psy405 12 YFKEKESFLRDLQHFHETRGT 32 (66)
Q Consensus 12 ~~~~~~~Fl~~L~~F~~~rg~ 32 (66)
...++..|...|..|++.++.
T Consensus 17 ~~~Er~~~~~~L~~Y~~~~~V 37 (78)
T cd07347 17 TDAEREQVTRALERYHQERNV 37 (78)
T ss_pred CHHHHHHHHHHHHHHHhcCCH
Confidence 356899999999999999853
No 53
>PF00727 IL4: Interleukin 4 This family is a subset of the SCOP family; InterPro: IPR002354 Cytokines are protein messengers that carry information from cell to cell []. Interleukin is one such molecule, and participates in several B-cell activation processes: e.g., it enhances production and secretion of IgG1 and IgE []; it induces expression of class II major histocompatability complex (MHC) molecules on resting B-cells; and it regulates expression of the low affinity Fc receptor for IgE on lymphocytes and monocytes. Interleukin-4 (IL4) has a compact, globular fold (similar to other cytokines), stabilised by 3 disulphide bonds []. One half of the structure is dominated by a 4 alpha-helix bundle with a left-handed twist []. The helices are anti-parallel, with 2 overhand connections, which fall into a 2-stranded anti-parallel beta-sheet [].; GO: 0005136 interleukin-4 receptor binding, 0008083 growth factor activity, 0006955 immune response, 0005576 extracellular region; PDB: 1HIK_A 1HZI_A 1ITI_A 2INT_A 1RCB_A 1CYL_A 3QB7_A 1BBN_A 2B8Z_A 1ITM_A ....
Probab=24.74 E-value=65 Score=20.43 Aligned_cols=16 Identities=19% Similarity=0.414 Sum_probs=13.0
Q ss_pred hHHHHHHHHHHHHHhc
Q psy405 15 EKESFLRDLQHFHETR 30 (66)
Q Consensus 15 ~~~~Fl~~L~~F~~~r 30 (66)
.=+.|++.|..+|+++
T Consensus 100 tLkdFLe~Lk~imq~k 115 (117)
T PF00727_consen 100 TLKDFLERLKTIMQEK 115 (117)
T ss_dssp EHHHHHHHHHHHHHHH
T ss_pred hHHHHHHHHHHHHHhh
Confidence 3578999999999864
No 54
>PLN00162 transport protein sec23; Provisional
Probab=24.42 E-value=95 Score=25.00 Aligned_cols=49 Identities=20% Similarity=0.278 Sum_probs=40.7
Q ss_pred chHhHHHHHHHHHHHHHhcCCCCCCCceecCeecchhhHHHHHHhcCccc
Q psy405 12 YFKEKESFLRDLQHFHETRGTPCRHVPKIGGKEIDLYLLYCLVTSNGGWV 61 (66)
Q Consensus 12 ~~~~~~~Fl~~L~~F~~~rg~p~~~~P~i~gk~lDL~~Ly~~V~~~GG~~ 61 (66)
.-.....|+++|..=..+.|..+. +=..+...+||+.+-..+..-||.-
T Consensus 323 ~~~~a~~fY~~la~~~~~~gisvD-lF~~s~dqvglaem~~l~~~TGG~v 371 (761)
T PLN00162 323 YYKKAVKFYEGLAKQLVAQGHVLD-VFACSLDQVGVAEMKVAVERTGGLV 371 (761)
T ss_pred hcchHHHHHHHHHHHHHHcCceEE-EEEccccccCHHHHhhhHhhcCcEE
Confidence 445677899999999999998877 5456678899999999999999964
No 55
>PF09425 CCT_2: Divergent CCT motif; InterPro: IPR018467 The short CCT (CO, COL, TOC1) motif is found in a number of plant proteins, including Constans (CO), Constans-like (COL) and TOC1. The CCT motif is about 45 amino acids long and contains a putative nuclear localisation signal within the second half of the CCT motif []. The CCT motif is found in the Arabidopsis circadian rhythm protein TOC1, an autoregulatory response regulator homologue the controls the photoperiodic flowering through its clock function []. ; GO: 0005515 protein binding; PDB: 3OGK_V 3OGL_S 3OGM_W.
Probab=24.28 E-value=56 Score=15.65 Aligned_cols=14 Identities=36% Similarity=0.411 Sum_probs=8.4
Q ss_pred HHHHHHHHhcCCCC
Q psy405 21 RDLQHFHETRGTPC 34 (66)
Q Consensus 21 ~~L~~F~~~rg~p~ 34 (66)
.+|+.|+++|..-+
T Consensus 6 ~SLqRFLeKRK~R~ 19 (27)
T PF09425_consen 6 ASLQRFLEKRKDRL 19 (27)
T ss_dssp HHHHHHHHHH----
T ss_pred HHHHHHHHHHHHhh
Confidence 57899999886443
No 56
>KOG2265|consensus
Probab=24.17 E-value=60 Score=22.07 Aligned_cols=20 Identities=10% Similarity=0.069 Sum_probs=17.6
Q ss_pred hHhHHHHHHHHHHHHHhcCC
Q psy405 13 FKEKESFLRDLQHFHETRGT 32 (66)
Q Consensus 13 ~~~~~~Fl~~L~~F~~~rg~ 32 (66)
..+...+++.|.+||.+.|.
T Consensus 151 ~sde~~~~d~Lkk~~~~~~~ 170 (179)
T KOG2265|consen 151 TSDELKKHDMLKKFMDQHPE 170 (179)
T ss_pred CCchhhHHHHHHHHHHhCCC
Confidence 56778899999999999986
No 57
>cd06546 GH18_CTS3_chitinase GH18 domain of CTS3 (chitinase 3), an uncharacterized protein from the human fungal pathogen Coccidioides posadasii. CTS3 has a chitinase-like glycosyl hydrolase family 18 (GH18) domain; and has homologs in bacteria as well as fungi.
Probab=23.72 E-value=1.1e+02 Score=21.06 Aligned_cols=20 Identities=20% Similarity=0.310 Sum_probs=16.9
Q ss_pred chHhHHHHHHHHHHHHHhcC
Q psy405 12 YFKEKESFLRDLQHFHETRG 31 (66)
Q Consensus 12 ~~~~~~~Fl~~L~~F~~~rg 31 (66)
....++.|...+.+|+.+-|
T Consensus 93 ~~~~r~~f~~s~~~~~~~~~ 112 (256)
T cd06546 93 DDEDFERYYGQLRDMIRRRG 112 (256)
T ss_pred CHHHHHHHHHHHHHHHHHhC
Confidence 45678899999999998876
No 58
>PF05066 HARE-HTH: HB1, ASXL, restriction endonuclease HTH domain; InterPro: IPR007759 DNA-directed RNA polymerases 2.7.7.6 from EC (also known as DNA-dependent RNA polymerases) are responsible for the polymerisation of ribonucleotides into a sequence complementary to the template DNA. In eukaryotes, there are three different forms of DNA-directed RNA polymerases transcribing different sets of genes. Most RNA polymerases are multimeric enzymes and are composed of a variable number of subunits. The core RNA polymerase complex consists of five subunits (two alpha, one beta, one beta-prime and one omega) and is sufficient for transcription elongation and termination but is unable to initiate transcription. Transcription initiation from promoter elements requires a sixth, dissociable subunit called a sigma factor, which reversibly associates with the core RNA polymerase complex to form a holoenzyme []. The core RNA polymerase complex forms a "crab claw"-like structure with an internal channel running along the full length []. The key functional sites of the enzyme, as defined by mutational and cross-linking analysis, are located on the inner wall of this channel. RNA synthesis follows after the attachment of RNA polymerase to a specific site, the promoter, on the template DNA strand. The RNA synthesis process continues until a termination sequence is reached. The RNA product, which is synthesised in the 5' to 3'direction, is known as the primary transcript. Eukaryotic nuclei contain three distinct types of RNA polymerases that differ in the RNA they synthesise: RNA polymerase I: located in the nucleoli, synthesises precursors of most ribosomal RNAs. RNA polymerase II: occurs in the nucleoplasm, synthesises mRNA precursors. RNA polymerase III: also occurs in the nucleoplasm, synthesises the precursors of 5S ribosomal RNA, the tRNAs, and a variety of other small nuclear and cytosolic RNAs. Eukaryotic cells are also known to contain separate mitochondrial and chloroplast RNA polymerases. Eukaryotic RNA polymerases, whose molecular masses vary in size from 500 to 700 kDa, contain two non-identical large (>100 kDa) subunits and an array of up to 12 different small (less than 50 kDa) subunits. The delta protein is a dispensable subunit of Bacillus subtilis RNA polymerase (RNAP) that has major effects on the biochemical properties of the purified enzyme. In the presence of delta, RNAP displays an increased specificity of transcription, a decreased affinity for nucleic acids, and an increased efficiency of RNA synthesis because of enhanced recycling []. The delta protein, contains two distinct regions, an N-terminal domain and a glutamate and aspartate residue-rich C-terminal region [].; GO: 0003677 DNA binding, 0006351 transcription, DNA-dependent; PDB: 2KRC_A.
Probab=23.61 E-value=41 Score=18.49 Aligned_cols=21 Identities=14% Similarity=0.333 Sum_probs=15.0
Q ss_pred CeecchhhHHHHHHhcCcccc
Q psy405 42 GKEIDLYLLYCLVTSNGGWVK 62 (66)
Q Consensus 42 gk~lDL~~Ly~~V~~~GG~~~ 62 (66)
|+|+..-.++..+.++|++..
T Consensus 15 ~~pm~~~eI~~~i~~~~~~~~ 35 (72)
T PF05066_consen 15 GRPMTFKEIWEEIQERGLYKK 35 (72)
T ss_dssp -S-EEHHHHHHHHHHHHTS--
T ss_pred CCCcCHHHHHHHHHHhCCCCc
Confidence 466778899999999999874
No 59
>PF02337 Gag_p10: Retroviral GAG p10 protein; InterPro: IPR003322 Retroviral matrix proteins (or major core proteins) are components of envelope-associated capsids, which line the inner surface of virus envelopes and are associated with viral membranes []. Matrix proteins are produced as part of Gag precursor polyproteins. During viral maturation, the Gag polyprotein is cleaved into major structural proteins by the viral protease, yielding the matrix (MA), capsid (CA), nucleocapsid (NC), and some smaller peptides. Gag-derived proteins govern the entire assembly and release of the virus particles, with matrix proteins playing key roles in Gag stability, capsid assembly, transport and budding. Although matrix proteins from different retroviruses appear to perform similar functions and can have similar structural folds, their primary sequences can be very different. This entry represents matrix proteins from beta-retroviruses such as Mason-Pfizer monkey virus (MPMV) (Simian Mason-Pfizer virus) and Mouse mammary tumor virus (MMTV) [, ]. This entry also identifies matrix proteins from several eukaryotic endogenous retroviruses, which arise when one or more copies of the retroviral genome becomes integrated into the host genome [].; GO: 0005198 structural molecule activity, 0019028 viral capsid; PDB: 2F77_X 2F76_X.
Probab=23.25 E-value=73 Score=19.17 Aligned_cols=21 Identities=29% Similarity=0.564 Sum_probs=14.8
Q ss_pred hHHHHHHHHHHHHHhcCCCCC
Q psy405 15 EKESFLRDLQHFHETRGTPCR 35 (66)
Q Consensus 15 ~~~~Fl~~L~~F~~~rg~p~~ 35 (66)
.++.|+..|...+..||..++
T Consensus 6 S~~~fv~~Lk~lLk~rGi~v~ 26 (90)
T PF02337_consen 6 SKQPFVSILKHLLKERGIRVK 26 (90)
T ss_dssp -HHHHHHHHHHHHHCCT----
T ss_pred hhhHHHHHHHHHHHHcCeeec
Confidence 337999999999999997776
No 60
>PF06396 AGTRAP: Angiotensin II, type I receptor-associated protein (AGTRAP); InterPro: IPR009436 This family consists of several angiotensin II, type I receptor-associated protein (AGTRAP) sequences. AGTRAP is known to interact specifically with the C-terminal cytoplasmic region of the angiotensin II type 1 (AT(1)) receptor to regulate different aspects of AT(1) receptor physiology. The function of this family is unclear.
Probab=23.14 E-value=42 Score=22.33 Aligned_cols=21 Identities=29% Similarity=0.490 Sum_probs=17.8
Q ss_pred CeecchhhHHHHHHhcCcccc
Q psy405 42 GKEIDLYLLYCLVTSNGGWVK 62 (66)
Q Consensus 42 gk~lDL~~Ly~~V~~~GG~~~ 62 (66)
=||+-++.||++=.+|||.-.
T Consensus 100 lKP~s~~~l~r~~reRGG~l~ 120 (162)
T PF06396_consen 100 LKPISCFFLYRMYRERGGELS 120 (162)
T ss_pred HHHHHHHHHHHHHHhcCCcee
Confidence 368899999999999999544
No 61
>cd01468 trunk_domain trunk domain. COPII-coated vesicles carry proteins from the endoplasmic reticulum to the Golgi complex. This vesicular transport can be reconstituted by using three cytosolic components containing five proteins: the small GTPase Sar1p, the Sec23p/24p complex, and the Sec13p/Sec31p complex. This domain is known as the trunk domain and has an alpha/beta vWA fold and forms the dimer interface. Some members of this family possess a partial MIDAS motif that is a characteristic feature of most vWA domain proteins.
Probab=23.11 E-value=73 Score=21.35 Aligned_cols=47 Identities=23% Similarity=0.222 Sum_probs=38.5
Q ss_pred HhHHHHHHHHHHHHHhcCCCCCCCceecCeecchhhHHHHHHhcCccc
Q psy405 14 KEKESFLRDLQHFHETRGTPCRHVPKIGGKEIDLYLLYCLVTSNGGWV 61 (66)
Q Consensus 14 ~~~~~Fl~~L~~F~~~rg~p~~~~P~i~gk~lDL~~Ly~~V~~~GG~~ 61 (66)
.....|+++|..-..+.|..+. +=..+...+|+..|-..+..-||.-
T Consensus 172 ~~a~~fY~~la~~~~~~~isvd-lF~~~~~~~dl~~l~~l~~~TGG~v 218 (239)
T cd01468 172 KPATKFYKSLAKECVKSGICVD-LFAFSLDYVDVATLKQLAKSTGGQV 218 (239)
T ss_pred cccHHHHHHHHHHHHHcCeEEE-EEeccccccCHHHhhhhhhcCCceE
Confidence 3457899999999999998777 4455677899999999999989863
No 62
>PF03875 Statherin: Statherin; InterPro: IPR005575 Statherin functions biologically to inhibit the nucleation and growth of calcium phosphate minerals. The N terminus of statherin is highly charged, the glutamic acids of which have been shown to be important in the recognition hydroxyapatite [].
Probab=23.02 E-value=93 Score=16.25 Aligned_cols=20 Identities=40% Similarity=0.566 Sum_probs=14.7
Q ss_pred hHHHHHHHHHHHHHhcCCCC
Q psy405 15 EKESFLRDLQHFHETRGTPC 34 (66)
Q Consensus 15 ~~~~Fl~~L~~F~~~rg~p~ 34 (66)
.++.|+..|..|-.-|=-|.
T Consensus 3 seekflrrl~r~~~grygpy 22 (42)
T PF03875_consen 3 SEEKFLRRLGRFFYGRYGPY 22 (42)
T ss_pred hHHHHHHHHHhhcccccCCc
Confidence 45699999999987753333
No 63
>PF07875 Coat_F: Coat F domain; InterPro: IPR012851 The Coat F proteins contribute to the Bacillales spore coat. They occur multiple times in the genomes in which they are found. Bacillus subtilis endospore protein coats protect them and may play a role in their germination []. Spore coat protein F, on the outer surface of the endospore, is one of a suite of proteins that could be used to differentiate between members of the Bacillus genus [].
Probab=22.71 E-value=69 Score=17.27 Aligned_cols=14 Identities=21% Similarity=0.491 Sum_probs=9.5
Q ss_pred hhHHHHHHhcCccc
Q psy405 48 YLLYCLVTSNGGWV 61 (66)
Q Consensus 48 ~~Ly~~V~~~GG~~ 61 (66)
+.||....++|+|+
T Consensus 48 ~~l~~~m~~kGwY~ 61 (64)
T PF07875_consen 48 YELFNYMNQKGWYQ 61 (64)
T ss_pred HHHHHHHHHcCCcC
Confidence 56677777777764
No 64
>cd02871 GH18_chitinase_D-like GH18 domain of Chitinase D (ChiD). ChiD, a chitinase found in Bacillus circulans, hydrolyzes the 1,4-beta-linkages of N-acetylglucosamine in chitin and chitodextrins. The domain architecture of ChiD includes a catalytic glycosyl hydrolase family 18 (GH18) domain, a chitin-binding domain, and a fibronectin type III domain. The chitin-binding and fibronectin type III domains are located either N-terminal or C-terminal to the catalytic domain. This family includes exochitinase Chi36 from Bacillus cereus.
Probab=22.51 E-value=1.1e+02 Score=21.52 Aligned_cols=20 Identities=10% Similarity=0.267 Sum_probs=17.0
Q ss_pred chHhHHHHHHHHHHHHHhcC
Q psy405 12 YFKEKESFLRDLQHFHETRG 31 (66)
Q Consensus 12 ~~~~~~~Fl~~L~~F~~~rg 31 (66)
....++.|.+.|.+|+.+.|
T Consensus 91 ~~~~~~~fa~sl~~~~~~~g 110 (312)
T cd02871 91 HTAQEDNFVDSIVAIIKEYG 110 (312)
T ss_pred CHHHHHHHHHHHHHHHHHhC
Confidence 45678999999999998886
No 65
>COG1190 LysU Lysyl-tRNA synthetase (class II) [Translation, ribosomal structure and biogenesis]
Probab=22.05 E-value=1.7e+02 Score=22.86 Aligned_cols=48 Identities=33% Similarity=0.635 Sum_probs=32.4
Q ss_pred hHHHHHHHHHHHHHhcC-----CCCCCCceecC------------eecchh-----hHHHHHHhcCcccce
Q psy405 15 EKESFLRDLQHFHETRG-----TPCRHVPKIGG------------KEIDLY-----LLYCLVTSNGGWVKK 63 (66)
Q Consensus 15 ~~~~Fl~~L~~F~~~rg-----~p~~~~P~i~g------------k~lDL~-----~Ly~~V~~~GG~~~V 63 (66)
.+..-++.++.|+..+| ||+- -|+-|| -++||| .||..=.-.||+++|
T Consensus 182 ~Rs~ii~~iR~fl~~~gFlEVETP~l-q~i~GGA~ArPF~ThhNald~dlyLRIApELyLKRliVGG~erV 251 (502)
T COG1190 182 KRSKIIRAIREFLDDRGFLEVETPML-QPIPGGAAARPFITHHNALDMDLYLRIAPELYLKRLIVGGFERV 251 (502)
T ss_pred HHHHHHHHHHHHHHHCCCeEeccccc-cccCCCcccccceeeecccCCceEEeeccHHHHHHHHhcCchhh
Confidence 44555778999999998 6665 333333 234555 477776677999887
No 66
>PF03808 Glyco_tran_WecB: Glycosyl transferase WecB/TagA/CpsF family; InterPro: IPR004629 The WecG member of this superfamily, believed to be UDP-N-acetyl-D-mannosaminuronic acid transferase, plays a role in Enterobacterial common antigen (eca) synthesis in Escherichia coli. Another family member, the Bacillus subtilis TagA protein, is involved in the biosynthesis of the cell wall polymer poly(glycerol phosphate). The third family member, CpsF, CMP-N-acetylneuraminic acid synthetase has a role in the capsular polysaccharide biosynthesis pathway.; GO: 0009058 biosynthetic process
Probab=21.83 E-value=1.3e+02 Score=19.24 Aligned_cols=53 Identities=19% Similarity=0.083 Sum_probs=39.4
Q ss_pred CcccCCCCccchHhHHHHHHHHHHHHHhcCCCCCCCceecCeecchhhHHHHHHhcC
Q psy405 2 AKVTKKDGDAYFKEKESFLRDLQHFHETRGTPCRHVPKIGGKEIDLYLLYCLVTSNG 58 (66)
Q Consensus 2 ~~~~~~~~~~~~~~~~~Fl~~L~~F~~~rg~p~~~~P~i~gk~lDL~~Ly~~V~~~G 58 (66)
++.++.+-+ ..-...+|+..|.+..++++.++. .+||++-.+-++.....+..
T Consensus 20 ~~~~g~~~~-~rv~g~dl~~~l~~~~~~~~~~if---llG~~~~~~~~~~~~l~~~y 72 (172)
T PF03808_consen 20 ARLLGRPLP-ERVTGSDLFPDLLRRAEQRGKRIF---LLGGSEEVLEKAAANLRRRY 72 (172)
T ss_pred HHHcCCCCC-cccCHHHHHHHHHHHHHHcCCeEE---EEeCCHHHHHHHHHHHHHHC
Confidence 455644444 566778899999999999987555 79999988888777766654
No 67
>PRK12445 lysyl-tRNA synthetase; Reviewed
Probab=21.81 E-value=1.6e+02 Score=22.57 Aligned_cols=50 Identities=30% Similarity=0.531 Sum_probs=31.9
Q ss_pred hHhHHHHHHHHHHHHHhcC-----CCCCCCceecC---ee---------cchh-----hHHHHHHhcCcccce
Q psy405 13 FKEKESFLRDLQHFHETRG-----TPCRHVPKIGG---KE---------IDLY-----LLYCLVTSNGGWVKK 63 (66)
Q Consensus 13 ~~~~~~Fl~~L~~F~~~rg-----~p~~~~P~i~g---k~---------lDL~-----~Ly~~V~~~GG~~~V 63 (66)
-..+...+..+++|+.++| ||+- .|+-+| +| .++| .||..-.-.||+++|
T Consensus 184 ~r~Rs~i~~~iR~f~~~~gFiEVeTPiL-~~~~gGa~a~pF~t~~~~~~~~~yL~~SpELylKrlivgG~~rV 255 (505)
T PRK12445 184 FVVRSKILAAIRQFMVARGFMEVETPMM-QVIPGGASARPFITHHNALDLDMYLRIAPELYLKRLVVGGFERV 255 (505)
T ss_pred HHHHHHHHHHHHHHHHHCCCEEeeCCee-EecCCCCcccceecccccCCcceeeecCHHHHHHHHHhccCCcE
Confidence 3567788899999999998 4433 122222 12 2233 377766667888887
No 68
>PF10813 DUF2733: Protein of unknown function (DUF2733); InterPro: IPR024360 The UL11 gene product of herpes simplex virus is a membrane-associated tegument protein that is incorporated into the HSV virion and functions in viral envelopment []. UL11 is acylated, which is crucial for lipid raft association [].
Probab=20.57 E-value=42 Score=16.74 Aligned_cols=13 Identities=31% Similarity=0.588 Sum_probs=10.4
Q ss_pred ecCeecchhhHHH
Q psy405 40 IGGKEIDLYLLYC 52 (66)
Q Consensus 40 i~gk~lDL~~Ly~ 52 (66)
+.|++|||+.=|.
T Consensus 18 v~G~~Inl~~dFe 30 (32)
T PF10813_consen 18 VKGNPINLYKDFE 30 (32)
T ss_pred cCCCEEechhccc
Confidence 8899999987553
No 69
>cd01479 Sec24-like Sec24-like: Protein and membrane traffic in eukaryotes is mediated by at least in part by the budding and fusion of intracellular transport vesicles that selectively carry cargo proteins and lipids from donor to acceptor organelles. The two main classes of vesicular carriers within the endocytic and the biosynthetic pathways are COP- and clathrin-coated vesicles. Formation of COPII vesicles requires the ordered assembly of the coat built from several cytosolic components GTPase Sar1, complexes of Sec23-Sec24 and Sec13-Sec31. The process is initiated by the conversion of GDP to GTP by the GTPase Sar1 which then recruits the heterodimeric complex of Sec23 and Sec24. This heterodimeric complex generates the pre-budding complex. The final step leading to membrane deformation and budding of COPII-coated vesicles is carried by the heterodimeric complex Sec13-Sec31. The members of this CD belong to the Sec23-like family. Sec 24 is very similar to Sec23. The Sec23 and Sec24
Probab=20.37 E-value=1.1e+02 Score=20.88 Aligned_cols=45 Identities=20% Similarity=0.257 Sum_probs=37.8
Q ss_pred HHHHHHHHHHHHHhcCCCCCCCceecCeecchhhHHHHHHhcCccc
Q psy405 16 KESFLRDLQHFHETRGTPCRHVPKIGGKEIDLYLLYCLVTSNGGWV 61 (66)
Q Consensus 16 ~~~Fl~~L~~F~~~rg~p~~~~P~i~gk~lDL~~Ly~~V~~~GG~~ 61 (66)
...|+++|..-+.+.|..+. +=......+||..+-..+..-||.-
T Consensus 171 ~~~fY~~la~~~~~~~isvD-lF~~~~~~~dla~l~~l~~~TGG~v 215 (244)
T cd01479 171 QTDFYKKLALECVKSQISVD-LFLFSNQYVDVATLGCLSRLTGGQV 215 (244)
T ss_pred chHHHHHHHHHHHHcCeEEE-EEEccCcccChhhhhhhhhhcCceE
Confidence 46899999999999998877 4455677899999999999999953
No 70
>cd06542 GH18_EndoS-like Endo-beta-N-acetylglucosaminidases are bacterial chitinases that hydrolyze the chitin core of various asparagine (N)-linked glycans and glycoproteins. The endo-beta-N-acetylglucosaminidases have a glycosyl hydrolase family 18 (GH18) catalytic domain. Some members also have an additional C-terminal glycosyl hydrolase family 20 (GH20) domain while others have an N-terminal domain of unknown function (pfam08522). Members of this family include endo-beta-N-acetylglucosaminidase S (EndoS) from Streptococcus pyogenes, EndoF1, EndoF2, EndoF3, and EndoH from Flavobacterium meningosepticum, and EndoE from Enterococcus faecalis. EndoS is a secreted endoglycosidase from Streptococcus pyogenes that specifically hydrolyzes the glycan on human IgG between two core N-acetylglucosamine residues. EndoE is a secreted endoglycosidase, encoded by the ndoE gene in Enterococcus faecalis, that hydrolyzes the glycan on human RNase B.
Probab=20.03 E-value=1.6e+02 Score=19.59 Aligned_cols=20 Identities=5% Similarity=0.110 Sum_probs=16.9
Q ss_pred chHhHHHHHHHHHHHHHhcC
Q psy405 12 YFKEKESFLRDLQHFHETRG 31 (66)
Q Consensus 12 ~~~~~~~Fl~~L~~F~~~rg 31 (66)
....++.|.+.+.+++.+-|
T Consensus 85 ~~~~~~~fa~~l~~~v~~yg 104 (255)
T cd06542 85 SDAAAKAYAKAIVDTVDKYG 104 (255)
T ss_pred CHHHHHHHHHHHHHHHHHhC
Confidence 44668999999999998876
Done!