Query psy13631
Match_columns 81
No_of_seqs 120 out of 626
Neff 8.6
Searched_HMMs 46136
Date Fri Aug 16 16:12:45 2013
Command hhsearch -i /work/01045/syshi/Psyhhblits/psy13631.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/13631hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 KOG1214|consensus 99.2 8.8E-11 1.9E-15 86.4 7.2 69 11-80 978-1053(1289)
2 KOG1215|consensus 99.1 5.6E-10 1.2E-14 82.9 8.5 75 2-81 389-466 (877)
3 smart00135 LY Low-density lipo 98.4 1.4E-06 3.1E-11 41.6 4.9 32 49-80 6-37 (43)
4 PF12662 cEGF: Complement Clr- 97.8 9.9E-06 2.1E-10 35.0 1.1 15 2-16 5-20 (24)
5 PF14670 FXa_inhibition: Coagu 97.6 3.1E-05 6.8E-10 36.6 1.3 14 2-15 22-36 (36)
6 PF00058 Ldl_recept_b: Low-den 97.1 0.00076 1.7E-08 32.7 2.8 17 64-80 1-18 (42)
7 KOG1215|consensus 96.7 0.0028 6E-08 47.8 4.3 40 2-42 694-734 (877)
8 PF09064 Tme5_EGF_like: Thromb 96.3 0.0027 6E-08 29.4 1.5 14 2-15 21-34 (34)
9 KOG1214|consensus 96.2 0.011 2.4E-07 45.0 4.8 57 22-80 1038-1096(1289)
10 PF02822 Antistasin: Antistasi 93.6 0.043 9.4E-07 23.9 1.1 12 1-12 8-20 (26)
11 PF08450 SGL: SMP-30/Gluconola 93.5 0.17 3.8E-06 32.3 4.3 25 56-80 4-28 (246)
12 KOG4659|consensus 92.5 0.63 1.4E-05 37.6 6.5 62 19-80 374-435 (1899)
13 PF07645 EGF_CA: Calcium-bindi 91.6 0.088 1.9E-06 25.2 0.8 14 2-15 28-42 (42)
14 PF07995 GSDH: Glucose / Sorbo 91.3 0.88 1.9E-05 30.9 5.6 44 29-74 148-203 (331)
15 PF08450 SGL: SMP-30/Gluconola 88.7 2.9 6.4E-05 26.6 6.3 45 31-79 117-161 (246)
16 COG2133 Glucose/sorbosone dehy 87.4 3.7 7.9E-05 29.1 6.5 26 49-74 236-261 (399)
17 TIGR02604 Piru_Ver_Nterm putat 85.0 4.8 0.0001 27.6 6.1 44 30-77 165-208 (367)
18 PF01436 NHL: NHL repeat; Int 82.7 2.9 6.3E-05 18.0 4.8 26 52-78 2-27 (28)
19 KOG1520|consensus 80.9 6.2 0.00013 27.8 5.3 52 22-74 128-182 (376)
20 TIGR03606 non_repeat_PQQ dehyd 80.7 6 0.00013 28.5 5.3 49 30-79 200-259 (454)
21 PLN02919 haloacid dehalogenase 79.8 13 0.00028 29.5 7.2 57 22-78 581-650 (1057)
22 PF01826 TIL: Trypsin Inhibito 79.3 0.83 1.8E-05 22.8 0.5 15 2-16 36-50 (55)
23 COG3386 Gluconolactonase [Carb 78.2 2.5 5.3E-05 28.8 2.7 23 57-79 30-52 (307)
24 TIGR01492 CPW_WPC Plasmodium f 75.4 1.9 4.1E-05 22.4 1.2 16 2-17 9-25 (62)
25 PF12661 hEGF: Human growth fa 75.4 1.3 2.9E-05 16.0 0.5 7 2-8 3-9 (13)
26 PLN02919 haloacid dehalogenase 73.5 31 0.00068 27.5 7.8 58 21-78 636-709 (1057)
27 COG3386 Gluconolactonase [Carb 72.0 20 0.00043 24.4 5.8 29 51-79 162-190 (307)
28 PF01731 Arylesterase: Arylest 69.2 17 0.00038 20.1 4.8 31 48-78 50-80 (86)
29 PF07995 GSDH: Glucose / Sorbo 69.1 33 0.00072 23.3 7.2 55 23-78 273-331 (331)
30 PF04942 CC: CC domain; Inter 69.0 3.6 7.9E-05 19.2 1.3 14 2-15 19-33 (36)
31 KOG0315|consensus 68.1 15 0.00032 25.0 4.4 47 20-67 52-99 (311)
32 PF02425 GBP_PSP: Paralytic/GB 66.7 4.1 8.8E-05 17.0 1.0 14 2-16 6-20 (23)
33 smart00032 CCP Domain abundant 65.6 2.6 5.7E-05 20.1 0.4 15 2-16 29-45 (57)
34 PF13948 DUF4215: Domain of un 64.4 5.1 0.00011 19.6 1.3 15 2-16 3-18 (47)
35 smart00179 EGF_CA Calcium-bind 64.0 4.2 9.1E-05 18.1 1.0 6 3-8 28-33 (39)
36 PF00396 Granulin: Granulin; 62.8 4 8.6E-05 19.8 0.7 14 3-16 28-42 (43)
37 KOG4499|consensus 58.0 9.2 0.0002 25.8 2.0 23 56-78 19-41 (310)
38 PF09717 CPW_WPC: Plasmodium f 57.4 4.8 0.0001 20.6 0.6 16 2-17 8-23 (60)
39 cd00033 CCP Complement control 56.5 5 0.00011 19.2 0.5 15 2-16 28-44 (57)
40 PF14339 DUF4394: Domain of un 55.9 44 0.00096 22.1 4.9 25 52-77 27-51 (236)
41 PF06977 SdiA-regulated: SdiA- 55.3 54 0.0012 21.7 5.3 33 46-78 15-48 (248)
42 PF14280 DUF4365: Domain of un 50.4 38 0.00082 19.6 3.7 24 53-76 84-107 (138)
43 PF13947 GUB_WAK_bind: Wall-as 50.0 29 0.00063 19.3 3.1 33 4-39 26-59 (106)
44 PF12947 EGF_3: EGF domain; I 49.9 6.9 0.00015 18.1 0.4 12 3-15 25-36 (36)
45 PRK11028 6-phosphogluconolacto 49.0 76 0.0016 21.0 7.6 54 23-76 4-59 (330)
46 smart00494 ChtBD2 Chitin-bindi 46.3 17 0.00038 17.2 1.6 16 1-16 31-47 (56)
47 PF06045 Rhamnogal_lyase: Rham 45.6 48 0.001 21.5 3.8 42 27-69 20-61 (203)
48 PF12075 KN_motif: KN motif; 45.2 13 0.00027 17.7 0.9 15 2-16 5-21 (39)
49 PF03088 Str_synth: Strictosid 44.1 57 0.0012 18.1 4.8 31 49-79 54-84 (89)
50 PF00084 Sushi: Sushi domain ( 42.7 7 0.00015 18.7 -0.2 15 2-16 28-44 (56)
51 PF14761 HPS3_N: Hermansky-Pud 42.7 96 0.0021 20.3 5.6 49 21-70 29-78 (215)
52 PF05098 LEF-4: Late expressio 41.1 70 0.0015 23.3 4.4 58 7-72 238-299 (450)
53 PF03022 MRJP: Major royal jel 40.3 66 0.0014 21.6 4.0 25 53-78 187-211 (287)
54 PF09603 Fib_succ_major: Fibro 39.3 9.9 0.00021 23.2 0.0 9 2-10 69-77 (184)
55 PF08366 LLGL: LLGL2; InterPr 38.7 80 0.0017 18.3 4.3 11 19-29 27-37 (105)
56 smart00181 EGF Epidermal growt 38.0 17 0.00036 15.7 0.7 7 3-9 24-30 (35)
57 smart00284 OLF Olfactomedin-li 37.2 91 0.002 20.9 4.2 41 34-74 205-246 (255)
58 PTZ00444 hypothetical protein; 36.5 18 0.0004 23.0 0.9 14 2-16 104-118 (184)
59 PF00622 SPRY: SPRY domain; I 35.2 82 0.0018 17.4 3.9 21 52-72 67-87 (124)
60 PF02191 OLF: Olfactomedin-lik 34.8 1.4E+02 0.003 19.8 6.7 44 34-77 200-244 (250)
61 smart00104 ANATO Anaphylatoxin 34.8 16 0.00035 16.4 0.4 14 3-16 2-16 (35)
62 PF01607 CBM_14: Chitin bindin 34.7 19 0.00041 17.0 0.6 15 2-16 30-45 (53)
63 PHA02885 putative interleukin 33.9 1E+02 0.0022 18.1 4.3 15 58-72 55-69 (135)
64 KOG4649|consensus 32.6 1.7E+02 0.0037 20.3 5.2 19 59-77 38-56 (354)
65 PF10282 Lactonase: Lactonase, 32.0 1.6E+02 0.0035 19.9 9.1 58 19-77 255-317 (345)
66 KOG0650|consensus 31.2 68 0.0015 24.5 3.1 51 19-72 577-637 (733)
67 PF05281 Secretogranin_V: Neur 31.1 22 0.00047 23.1 0.6 15 2-18 102-116 (208)
68 PF12191 stn_TNFRSF12A: Tumour 28.8 34 0.00074 20.6 1.1 15 2-16 35-50 (129)
69 PF01623 Carla_C4: Carlavirus 28.7 43 0.00093 19.0 1.4 8 11-18 73-80 (91)
70 PF12093 Corona_NS8: Coronavir 27.3 32 0.0007 19.9 0.8 19 19-37 43-61 (118)
71 PF02664 LuxS: S-Ribosylhomocy 27.3 36 0.00077 21.2 1.0 10 1-10 80-89 (157)
72 PF00008 EGF: EGF-like domain 26.9 30 0.00065 15.2 0.5 7 2-8 23-29 (32)
73 smart00289 WR1 Worm-specific r 25.4 34 0.00073 15.2 0.6 9 2-10 21-29 (38)
74 KOG1446|consensus 24.3 2.5E+02 0.0055 19.5 5.5 48 18-71 111-158 (311)
75 PF10015 DUF2258: Uncharacteri 24.2 80 0.0017 17.2 1.9 15 54-68 61-75 (75)
76 PF06387 Calcyon: D1 dopamine 24.2 39 0.00084 21.5 0.8 14 2-16 112-125 (186)
77 KOG4283|consensus 23.8 2.7E+02 0.0058 19.7 6.7 53 22-76 158-213 (397)
78 smart00449 SPRY Domain in SPla 23.6 1.4E+02 0.0031 16.4 3.8 24 52-75 66-89 (122)
79 PRK02260 S-ribosylhomocysteina 23.1 42 0.00092 20.9 0.8 10 1-10 80-89 (158)
80 PHA02639 EEV host range protei 22.9 43 0.00093 22.8 0.9 15 2-16 177-193 (295)
81 PF00020 TNFR_c6: TNFR/NGFR cy 22.5 59 0.0013 14.7 1.1 14 3-16 1-16 (39)
82 KOG1007|consensus 22.5 91 0.002 21.8 2.4 47 22-69 186-232 (370)
83 COG1854 LuxS LuxS protein invo 22.5 52 0.0011 20.5 1.1 12 1-12 80-92 (161)
84 PF02655 ATP-grasp_3: ATP-gras 22.0 82 0.0018 18.9 1.9 25 49-73 132-156 (161)
85 TIGR02145 Fib_succ_major Fibro 21.6 31 0.00067 21.5 0.0 9 2-10 59-67 (171)
86 PF08310 LGFP: LGFP repeat; I 21.3 81 0.0017 15.5 1.5 12 61-72 21-32 (54)
87 PF12860 PAS_7: PAS fold 20.9 1.3E+02 0.0028 16.5 2.5 19 50-69 3-21 (115)
88 PF13619 KTSC: KTSC domain 20.7 1.2E+02 0.0025 15.2 2.1 16 52-67 4-19 (60)
89 PF03302 VSP: Giardia variant- 20.1 59 0.0013 23.0 1.1 15 2-16 3-18 (397)
No 1
>KOG1214|consensus
Probab=99.18 E-value=8.8e-11 Score=86.38 Aligned_cols=69 Identities=26% Similarity=0.349 Sum_probs=56.9
Q ss_pred CCCCCCCC---CCcEEEEEeCCcEEEEeCCCCCcee----eEEEeCCccceEEEEEeCCCCEEEEEECCCCcEEeec
Q psy13631 11 DNFTCANG---SQEMLLLVQRTDICKISLDTPDHTN----VMLPVVGVKHAIAIDWDPVEQHLYWTDDEARNKSGES 80 (81)
Q Consensus 11 d~~tC~~~---~~~~Ll~s~~~~I~~i~l~~~~~~~----~~~~~~~~~~~~ald~d~~~~~iyW~d~~~~~I~r~s 80 (81)
++.+|... ...||||+++..|.++++++..+.. .++.+++ +-+++||||+++++|||+|+...+|.|+|
T Consensus 978 p~~~~~v~p~~~gt~LL~aqg~~I~~lplng~~~~K~~ak~~l~~p~-~IiVGidfDC~e~mvyWtDv~g~SI~ras 1053 (1289)
T KOG1214|consen 978 PTPRPDVTPPSVGTFLLYAQGQQIGYLPLNGTRLQKDAAKTLLSLPG-SIIVGIDFDCRERMVYWTDVAGRSISRAS 1053 (1289)
T ss_pred CCCCCCCcCCCCcceEEEeccceEEEeecCcchhchhhhhceEeccc-ceeeeeecccccceEEEeecCCCcccccc
Confidence 66777722 3569999999999999999887543 2244455 77899999999999999999999999987
No 2
>KOG1215|consensus
Probab=99.10 E-value=5.6e-10 Score=82.89 Aligned_cols=75 Identities=33% Similarity=0.603 Sum_probs=66.3
Q ss_pred CCCCceEEc-CCCCCC--CCCCcEEEEEeCCcEEEEeCCCCCceeeEEEeCCccceEEEEEeCCCCEEEEEECCCCcEEe
Q psy13631 2 FHFYGVRLV-DNFTCA--NGSQEMLLLVQRTDICKISLDTPDHTNVMLPVVGVKHAIAIDWDPVEQHLYWTDDEARNKSG 78 (81)
Q Consensus 2 ~~~~Gy~L~-d~~tC~--~~~~~~Ll~s~~~~I~~i~l~~~~~~~~~~~~~~~~~~~ald~d~~~~~iyW~d~~~~~I~r 78 (81)
-|++||.|. ++ |+ ..+++||+++++.+|++++++.. ....|+.++.+++++|+|..++++||+|...+.|.+
T Consensus 389 ~c~~g~~~~~~~--c~~~~~~~~~l~~s~~~~ir~~~~~~~---~~~~p~~~~~~~~~~d~d~~~~~i~~~d~~~~~i~~ 463 (877)
T KOG1215|consen 389 ACSPGYELRLDK--CEASDQPEAFLLFSNRHDIRRISLDCS---DVSRPLEGIKNAVALDFDVLNNRIYWADLSDEKICR 463 (877)
T ss_pred ecCCCcEeccCC--ceecCCCCcEEEEecCccceecccCCC---cceEEccCCccceEEEEEecCCEEEEEeccCCeEee
Confidence 499999999 87 76 44899999999999999999986 456677788899999999999999999999999998
Q ss_pred ecC
Q psy13631 79 ESH 81 (81)
Q Consensus 79 ~s~ 81 (81)
+++
T Consensus 464 ~~~ 466 (877)
T KOG1215|consen 464 ASQ 466 (877)
T ss_pred ecc
Confidence 763
No 3
>smart00135 LY Low-density lipoprotein-receptor YWTD domain. Type "B" repeats in low-density lipoprotein (LDL) receptor that plays a central role in mammalian cholesterol metabolism. Also present in a variety of molecules similar to gp300/megalin.
Probab=98.38 E-value=1.4e-06 Score=41.61 Aligned_cols=32 Identities=28% Similarity=0.361 Sum_probs=28.0
Q ss_pred CCccceEEEEEeCCCCEEEEEECCCCcEEeec
Q psy13631 49 VGVKHAIAIDWDPVEQHLYWTDDEARNKSGES 80 (81)
Q Consensus 49 ~~~~~~~ald~d~~~~~iyW~d~~~~~I~r~s 80 (81)
.++..+.++++|+.++++||+|.....|.|+.
T Consensus 6 ~~~~~~~~la~d~~~~~lYw~D~~~~~I~~~~ 37 (43)
T smart00135 6 EGLGHPNGLAVDWIEGRLYWTDWGLDVIEVAN 37 (43)
T ss_pred CCCCCcCEEEEeecCCEEEEEeCCCCEEEEEe
Confidence 46777888999999999999999999999864
No 4
>PF12662 cEGF: Complement Clr-like EGF-like
Probab=97.81 E-value=9.9e-06 Score=35.05 Aligned_cols=15 Identities=27% Similarity=0.310 Sum_probs=14.1
Q ss_pred CCCCceEEc-CCCCCC
Q psy13631 2 FHFYGVRLV-DNFTCA 16 (81)
Q Consensus 2 ~~~~Gy~L~-d~~tC~ 16 (81)
.|++||+|. |+++|.
T Consensus 5 ~C~~Gy~l~~d~~~C~ 20 (24)
T PF12662_consen 5 SCPPGYQLSPDGRSCE 20 (24)
T ss_pred eCCCCCcCCCCCCccc
Confidence 499999999 999998
No 5
>PF14670 FXa_inhibition: Coagulation Factor Xa inhibitory site; PDB: 3Q3K_B 1NFY_B 1LQD_A 1G2L_B 1IQF_L 2UWP_B 2VH6_B 3KQC_L 2P93_L 2BQW_A ....
Probab=97.61 E-value=3.1e-05 Score=36.61 Aligned_cols=14 Identities=36% Similarity=0.411 Sum_probs=11.1
Q ss_pred CCCCceEEc-CCCCC
Q psy13631 2 FHFYGVRLV-DNFTC 15 (81)
Q Consensus 2 ~~~~Gy~L~-d~~tC 15 (81)
.||+||+|+ |+++|
T Consensus 22 ~C~~Gy~L~~D~~tC 36 (36)
T PF14670_consen 22 SCPPGYKLAEDGRTC 36 (36)
T ss_dssp E-STTEEE-TTSSSE
T ss_pred ECCCCCEECcCCCCC
Confidence 499999999 99998
No 6
>PF00058 Ldl_recept_b: Low-density lipoprotein receptor repeat class B; InterPro: IPR000033 The low-density lipoprotein receptor (LDLR) is the major cholesterol-carrying lipoprotein of plasma, acting to regulate cholesterol homeostasis in mammalian cells. The LDL receptor binds LDL and transports it into cells by acidic endocytosis. In order to be internalized, the receptor-ligand complex must first cluster into clathrin-coated pits. Once inside the cell, the LDLR separates from its ligand, which is degraded in the lysosomes, while the receptor returns to the cell surface []. The internal dissociation of the LDLR with its ligand is mediated by proton pumps within the walls of the endosome that lower the pH. The LDLR is a multi-domain protein, containing: The ligand-binding domain contains seven or eight 40-amino acid LDLR class A (cysteine-rich) repeats, each of which contains a coordinated calcium ion and six cysteine residues involved in disulphide bond formation []. Similar domains have been found in other extracellular and membrane proteins []. The second conserved region contains two EGF repeats, followed by six LDLR class B (YWTD) repeats, and another EGF repeat. The LDLR class B repeats each contain a conserved YWTD motif, and is predicted to form a beta-propeller structure []. This region is critical for ligand release and recycling of the receptor []. The third domain is rich in serine and threonine residues and contains clustered O-linked carbohydrate chains. The fourth domain is the hydrophobic transmembrane region. The fifth domain is the cytoplasmic tail that directs the receptor to clathrin-coated pits. LDLR is closely related in structure to several other receptors, including LRP1, LRP1b, megalin/LRP2, VLDL receptor, lipoprotein receptor, MEGF7/LRP4, and LRP8/apolipoprotein E receptor2); these proteins participate in a wide range of physiological processes, including the regulation of lipid metabolism, protection against atherosclerosis, neurodevelopment, and transport of nutrients and vitamins []. This entry represents the LDLR classB (YWTD) repeat, the structure of which has been solved []. The six YWTD repeats together fold into a six-bladed beta-propeller. Each blade of the propeller consists of four antiparallel beta-strands; the innermost strand of each blade is labeled 1 and the outermost strand, 4. The sequence repeats are offset with respect to the blades of the propeller, such that any given 40-residue YWTD repeat spans strands 24 of one propeller blade and strand 1 of the subsequent blade. This offset ensures circularization of the propeller because the last strand of the final sequence repeat acts as an innermost strand 1 of the blade that harbors strands 24 from the first sequence repeat. The repeat is found in a variety of proteins that include, vitellogenin receptor from Drosophila melanogaster, low-density lipoprotein (LDL) receptor [], preproepidermal growth factor, and nidogen (entactin).; PDB: 3S2K_A 3S8Z_A 3S8V_B 4A0P_A 3SOB_B 3S94_B 4DG6_A 3SOV_A 3SOQ_A 1NPE_A ....
Probab=97.07 E-value=0.00076 Score=32.66 Aligned_cols=17 Identities=24% Similarity=0.491 Sum_probs=15.0
Q ss_pred CEEEEEECCCC-cEEeec
Q psy13631 64 QHLYWTDDEAR-NKSGES 80 (81)
Q Consensus 64 ~~iyW~d~~~~-~I~r~s 80 (81)
++|||+|...+ .|.++.
T Consensus 1 ~~iYWtD~~~~~~I~~a~ 18 (42)
T PF00058_consen 1 GKIYWTDWSQDPSIERAN 18 (42)
T ss_dssp TEEEEEETTTTEEEEEEE
T ss_pred CEEEEEECCCCcEEEEEE
Confidence 58999999999 999874
No 7
>KOG1215|consensus
Probab=96.70 E-value=0.0028 Score=47.80 Aligned_cols=40 Identities=35% Similarity=0.411 Sum_probs=34.1
Q ss_pred CCCCceEEc-CCCCCCCCCCcEEEEEeCCcEEEEeCCCCCce
Q psy13631 2 FHFYGVRLV-DNFTCANGSQEMLLLVQRTDICKISLDTPDHT 42 (81)
Q Consensus 2 ~~~~Gy~L~-d~~tC~~~~~~~Ll~s~~~~I~~i~l~~~~~~ 42 (81)
-||+|+.|. ++++|.. +..+++++....|..+..+.....
T Consensus 694 ~c~~~~~l~~~~~~C~~-~~~~~~~~~~~~~~~~~~~~~~~~ 734 (877)
T KOG1215|consen 694 ACPEGYRLSPDGKSCSS-PEGYLLITSRTGIPCISLDSELSP 734 (877)
T ss_pred eCCCCCeecCCCCeecC-cccccccccccccceeecCcccCC
Confidence 499999999 9999997 777889999999999888775443
No 8
>PF09064 Tme5_EGF_like: Thrombomodulin like fifth domain, EGF-like; InterPro: IPR015149 This domain adopts a fold similar to other EGF domains, with a flat major and a twisted minor beta sheet. Disulphide pairing, however, is not of the usual 1-3, 2-4, 5-6 type; rather 1-2, 3-4, 5-6 pairing is found. Its extended major sheet (strands beta-2 and beta-3 and the connecting loop) projects into thrombin's active site groove. This domain is required for interaction of thrombomodulin with thrombin, and subsequent activation of protein-C []. ; GO: 0004888 transmembrane signaling receptor activity, 0016021 integral to membrane
Probab=96.30 E-value=0.0027 Score=29.44 Aligned_cols=14 Identities=29% Similarity=0.349 Sum_probs=10.4
Q ss_pred CCCCceEEcCCCCC
Q psy13631 2 FHFYGVRLVDNFTC 15 (81)
Q Consensus 2 ~~~~Gy~L~d~~tC 15 (81)
+||+||.|+++.-|
T Consensus 21 ~CPeGyIlde~~~c 34 (34)
T PF09064_consen 21 FCPEGYILDEGSMC 34 (34)
T ss_pred eCCCceEecCCccC
Confidence 79999999844333
No 9
>KOG1214|consensus
Probab=96.19 E-value=0.011 Score=45.01 Aligned_cols=57 Identities=19% Similarity=0.353 Sum_probs=43.5
Q ss_pred EEEEEe--CCcEEEEeCCCCCceeeEEEeCCccceEEEEEeCCCCEEEEEECCCCcEEeec
Q psy13631 22 MLLLVQ--RTDICKISLDTPDHTNVMLPVVGVKHAIAIDWDPVEQHLYWTDDEARNKSGES 80 (81)
Q Consensus 22 ~Ll~s~--~~~I~~i~l~~~~~~~~~~~~~~~~~~~ald~d~~~~~iyW~d~~~~~I~r~s 80 (81)
.+.++. +.+|++-+|++.+.+.++- +++.++-+|+.|....-|||+|...++|.-|+
T Consensus 1038 mvyWtDv~g~SI~rasL~G~Ep~ti~n--~~L~SPEGiAVDh~~Rn~ywtDS~lD~IevA~ 1096 (1289)
T KOG1214|consen 1038 MVYWTDVAGRSISRASLEGAEPETIVN--SGLISPEGIAVDHIRRNMYWTDSVLDKIEVAL 1096 (1289)
T ss_pred eEEEeecCCCccccccccCCCCceeec--ccCCCccceeeeeccceeeeeccccchhheee
Confidence 444444 5688898998776544332 58888888888889999999999999987664
No 10
>PF02822 Antistasin: Antistasin family; InterPro: IPR004094 Peptide proteinase inhibitors can be found as single domain proteins or as single or multiple domains within proteins; these are referred to as either simple or compound inhibitors, respectively. In many cases they are synthesised as part of a larger precursor protein, either as a prepropeptide or as an N-terminal domain associated with an inactive peptidase or zymogen. This domain prevents access of the substrate to the active site. Removal of the N-terminal inhibitor domain either by interaction with a second peptidase or by autocatalytic cleavage activates the zymogen. Other inhibitors interact direct with proteinases using a simple noncovalent lock and key mechanism; while yet others use a conformational change-based trapping mechanism that depends on their structural and thermodynamic properties. This group of serine protease inhibitors belong to MEROPS inhibitor family I15, clan IO. They inhibit serine peptidases of the S1 family (IPR001254 from INTERPRO) [] and are characterised by a well conserved pattern of cysteine residues. Many of the proteins that belong to this family are anti-coagulants.; GO: 0004867 serine-type endopeptidase inhibitor activity; PDB: 1HIA_I 1BX8_A 1BX7_A 3BG4_D 1C9P_B 1EJA_B 1SKZ_A.
Probab=93.64 E-value=0.043 Score=23.93 Aligned_cols=12 Identities=25% Similarity=0.553 Sum_probs=6.1
Q ss_pred CCCCCceEEc-CC
Q psy13631 1 MFHFYGVRLV-DN 12 (81)
Q Consensus 1 ~~~~~Gy~L~-d~ 12 (81)
|.||.||+++ +|
T Consensus 8 ~~C~~Gf~~D~~G 20 (26)
T PF02822_consen 8 MYCPYGFKTDENG 20 (26)
T ss_dssp ---TT-EEE-TTS
T ss_pred CcCCCcccCCCCC
Confidence 6899999998 66
No 11
>PF08450 SGL: SMP-30/Gluconolaconase/LRE-like region; InterPro: IPR013658 This family describes a region that is found in proteins expressed by a variety of eukaryotic and prokaryotic species. These proteins include various enzymes, such as senescence marker protein 30 (SMP-30, Q15493 from SWISSPROT), gluconolactonase (Q01578 from SWISSPROT) and luciferin-regenerating enzyme (LRE, Q86DU5 from SWISSPROT). SMP-30 is known to hydrolyse diisopropyl phosphorofluoridate in the liver, and has been noted as having sequence similarity, in the region described in this family, with PON1 (P52430 from SWISSPROT) and LRE. ; PDB: 2GHS_A 2DG0_L 2DG1_D 2DSO_D 3E5Z_A 2IAT_A 2IAV_A 2GVV_A 3HLI_A 2GVU_A ....
Probab=93.51 E-value=0.17 Score=32.32 Aligned_cols=25 Identities=28% Similarity=0.555 Sum_probs=22.0
Q ss_pred EEEEeCCCCEEEEEECCCCcEEeec
Q psy13631 56 AIDWDPVEQHLYWTDDEARNKSGES 80 (81)
Q Consensus 56 ald~d~~~~~iyW~d~~~~~I~r~s 80 (81)
+.-||..++.+||+|+..+.|.|..
T Consensus 4 gp~~d~~~g~l~~~D~~~~~i~~~~ 28 (246)
T PF08450_consen 4 GPVWDPRDGRLYWVDIPGGRIYRVD 28 (246)
T ss_dssp EEEEETTTTEEEEEETTTTEEEEEE
T ss_pred ceEEECCCCEEEEEEcCCCEEEEEE
Confidence 4678999999999999999999864
No 12
>KOG4659|consensus
Probab=92.50 E-value=0.63 Score=37.60 Aligned_cols=62 Identities=24% Similarity=0.235 Sum_probs=48.9
Q ss_pred CCcEEEEEeCCcEEEEeCCCCCceeeEEEeCCccceEEEEEeCCCCEEEEEECCCCcEEeec
Q psy13631 19 SQEMLLLVQRTDICKISLDTPDHTNVMLPVVGVKHAIAIDWDPVEQHLYWTDDEARNKSGES 80 (81)
Q Consensus 19 ~~~~Ll~s~~~~I~~i~l~~~~~~~~~~~~~~~~~~~ald~d~~~~~iyW~d~~~~~I~r~s 80 (81)
+.-.|++..-+-||+|.-+++..+-..+.+.+...-.=++.++.++.||-+|...+.|.|.+
T Consensus 374 ~DGSl~VGDfNyIRRI~~dg~v~tIl~L~~t~~sh~Yy~AvsPvdgtlyvSdp~s~qv~rv~ 435 (1899)
T KOG4659|consen 374 PDGSLIVGDFNYIRRISQDGQVSTILTLGLTDTSHSYYIAVSPVDGTLYVSDPLSKQVWRVS 435 (1899)
T ss_pred CCCcEEEccchheeeecCCCceEEEEEecCCCccceeEEEecCcCceEEecCCCcceEEEec
Confidence 34456667778999999998765545554456666778999999999999999999999976
No 13
>PF07645 EGF_CA: Calcium-binding EGF domain; InterPro: IPR001881 A sequence of about forty amino-acid residues found in epidermal growth factor (EGF) has been shown [, , , , , ] to be present in a large number of membrane-bound and extracellular, mostly animal, proteins. Many of these proteins require calcium for their biological function and a calcium-binding site has been found at the N terminus of some EGF-like domains []. Calcium-binding may be crucial for numerous protein-protein interactions. For human coagulation factor IX it has been shown [] that the calcium-ligands form a pentagonal bipyramid. The first, third and fourth conserved negatively charged or polar residues are side chain ligands. The latter is possibly hydroxylated (see aspartic acid and asparagine hydroxylation site) []. A conserved aromatic residue, as well as the second conserved negative residue, are thought to be involved in stabilising the calcium-binding site. As in non-calcium binding EGF-like domains, there are six conserved cysteines and the structure of both types is very similar as calcium-binding induces only strictly local structural changes []. +------------------+ +---------+ | | | | nxnnC-x(3,14)-C-x(3,7)-CxxbxxxxaxC-x(1,6)-C-x(8,13)-Cx | | +------------------+ 'n': negatively charged or polar residue [DEQN] 'b': possibly beta-hydroxylated residue [DN] 'a': aromatic amino acid 'C': cysteine, involved in disulphide bond 'x': any amino acid. ; GO: 0005509 calcium ion binding; PDB: 2VJ3_A 1TOZ_A 1LMJ_A 1UZQ_A 1UZK_A 1UZJ_B 1UZP_A 1EMO_A 1EMN_A 2RR0_A ....
Probab=91.61 E-value=0.088 Score=25.17 Aligned_cols=14 Identities=36% Similarity=0.325 Sum_probs=11.7
Q ss_pred CCCCceEEc-CCCCC
Q psy13631 2 FHFYGVRLV-DNFTC 15 (81)
Q Consensus 2 ~~~~Gy~L~-d~~tC 15 (81)
.|++||++. ++++|
T Consensus 28 ~C~~Gy~~~~~~~~C 42 (42)
T PF07645_consen 28 SCPPGYELNDDGTTC 42 (42)
T ss_dssp EESTTEEECTTSSEE
T ss_pred eCCCCcEECCCCCcC
Confidence 389999988 88776
No 14
>PF07995 GSDH: Glucose / Sorbosone dehydrogenase; InterPro: IPR012938 Proteins containing this domain are thought to be glucose/sorbosone dehydrogenases. The best characterised of these proteins is soluble glucose dehydrogenase (P13650 from SWISSPROT) from Acinetobacter calcoaceticus, which oxidises glucose to gluconolactone. The enzyme is a calcium-dependent homodimer which uses PQQ as a cofactor [].; GO: 0016901 oxidoreductase activity, acting on the CH-OH group of donors, quinone or similar compound as acceptor, 0048038 quinone binding, 0005975 carbohydrate metabolic process; PDB: 2ISM_A 2WG3_D 3HO5_A 3HO4_A 3HO3_A 2WFT_A 2WG4_B 2WFX_B 1CRU_A 1CQ1_B ....
Probab=91.26 E-value=0.88 Score=30.92 Aligned_cols=44 Identities=18% Similarity=0.400 Sum_probs=30.4
Q ss_pred CcEEEEeCCCCC------------ceeeEEEeCCccceEEEEEeCCCCEEEEEECCCC
Q psy13631 29 TDICKISLDTPD------------HTNVMLPVVGVKHAIAIDWDPVEQHLYWTDDEAR 74 (81)
Q Consensus 29 ~~I~~i~l~~~~------------~~~~~~~~~~~~~~~ald~d~~~~~iyW~d~~~~ 74 (81)
-.|.+++.+++. ...+. ..|+.|+.+++||+.++.||-+|...+
T Consensus 148 G~ilri~~dG~~p~dnP~~~~~~~~~~i~--A~GlRN~~~~~~d~~tg~l~~~d~G~~ 203 (331)
T PF07995_consen 148 GKILRIDPDGSIPADNPFVGDDGADSEIY--AYGLRNPFGLAFDPNTGRLWAADNGPD 203 (331)
T ss_dssp TEEEEEETTSSB-TTSTTTTSTTSTTTEE--EE--SEEEEEEEETTTTEEEEEEE-SS
T ss_pred ceEEEecccCcCCCCCccccCCCceEEEE--EeCCCccccEEEECCCCcEEEEccCCC
Confidence 368888887651 11222 359999999999999999999997654
No 15
>PF08450 SGL: SMP-30/Gluconolaconase/LRE-like region; InterPro: IPR013658 This family describes a region that is found in proteins expressed by a variety of eukaryotic and prokaryotic species. These proteins include various enzymes, such as senescence marker protein 30 (SMP-30, Q15493 from SWISSPROT), gluconolactonase (Q01578 from SWISSPROT) and luciferin-regenerating enzyme (LRE, Q86DU5 from SWISSPROT). SMP-30 is known to hydrolyse diisopropyl phosphorofluoridate in the liver, and has been noted as having sequence similarity, in the region described in this family, with PON1 (P52430 from SWISSPROT) and LRE. ; PDB: 2GHS_A 2DG0_L 2DG1_D 2DSO_D 3E5Z_A 2IAT_A 2IAV_A 2GVV_A 3HLI_A 2GVU_A ....
Probab=88.69 E-value=2.9 Score=26.63 Aligned_cols=45 Identities=20% Similarity=0.231 Sum_probs=33.5
Q ss_pred EEEEeCCCCCceeeEEEeCCccceEEEEEeCCCCEEEEEECCCCcEEee
Q psy13631 31 ICKISLDTPDHTNVMLPVVGVKHAIAIDWDPVEQHLYWTDDEARNKSGE 79 (81)
Q Consensus 31 I~~i~l~~~~~~~~~~~~~~~~~~~ald~d~~~~~iyW~d~~~~~I~r~ 79 (81)
|.+++.+. . ...+. .++..+-+|.|++.++.+|++|...+.|.+.
T Consensus 117 v~~~~~~~-~-~~~~~--~~~~~pNGi~~s~dg~~lyv~ds~~~~i~~~ 161 (246)
T PF08450_consen 117 VYRIDPDG-K-VTVVA--DGLGFPNGIAFSPDGKTLYVADSFNGRIWRF 161 (246)
T ss_dssp EEEEETTS-E-EEEEE--EEESSEEEEEEETTSSEEEEEETTTTEEEEE
T ss_pred eEEECCCC-e-EEEEe--cCcccccceEECCcchheeecccccceeEEE
Confidence 66677662 2 22222 4677778999999999999999999999874
No 16
>COG2133 Glucose/sorbosone dehydrogenases [Carbohydrate transport and metabolism]
Probab=87.41 E-value=3.7 Score=29.12 Aligned_cols=26 Identities=23% Similarity=0.538 Sum_probs=23.2
Q ss_pred CCccceEEEEEeCCCCEEEEEECCCC
Q psy13631 49 VGVKHAIAIDWDPVEQHLYWTDDEAR 74 (81)
Q Consensus 49 ~~~~~~~ald~d~~~~~iyW~d~~~~ 74 (81)
-|+.|+++++||+.++.+|-++...+
T Consensus 236 ~G~RN~qGl~w~P~tg~Lw~~e~g~d 261 (399)
T COG2133 236 YGHRNPQGLAWHPVTGALWTTEHGPD 261 (399)
T ss_pred eccCCccceeecCCCCcEEEEecCCC
Confidence 48999999999999999999987763
No 17
>TIGR02604 Piru_Ver_Nterm putative membrane-bound dehydrogenase domain. All proteins that score above the trusted cutoff score of 45 to this model are large proteins of either Pirellula sp. 1 or Verrucomicrobium spinosum. These proteins all contain, in addition to this domain, several hundred residues of highly variable sequence, and then a well-conserved C-terminal domain (TIGR02603) that features a putative cytochrome c-type heme binding motif CXXCH. The membrane-bound L-sorbosone dehydrogenase from Acetobacter liquefaciens (Gluconacetobacter liquefaciens) is homologous to this domain but lacks additional sequence regions shared by members of this family and belongs to a different clade of the larger family of homologs. It and its closely related homologs are excluded from the this model by scoring between the trusted (45) and noise (18) cutoffs.
Probab=85.05 E-value=4.8 Score=27.65 Aligned_cols=44 Identities=11% Similarity=0.207 Sum_probs=30.3
Q ss_pred cEEEEeCCCCCceeeEEEeCCccceEEEEEeCCCCEEEEEECCCCcEE
Q psy13631 30 DICKISLDTPDHTNVMLPVVGVKHAIAIDWDPVEQHLYWTDDEARNKS 77 (81)
Q Consensus 30 ~I~~i~l~~~~~~~~~~~~~~~~~~~ald~d~~~~~iyW~d~~~~~I~ 77 (81)
.|.+++.++...+ ++ ..++.++.+++||. ++.+|.+|.......
T Consensus 165 ~i~r~~pdg~~~e-~~--a~G~rnp~Gl~~d~-~G~l~~tdn~~~~~~ 208 (367)
T TIGR02604 165 GLFRYNPDGGKLR-VV--AHGFQNPYGHSVDS-WGDVFFCDNDDPPLC 208 (367)
T ss_pred eEEEEecCCCeEE-EE--ecCcCCCccceECC-CCCEEEEccCCCcee
Confidence 4777777765432 22 36899999999998 567888887544333
No 18
>PF01436 NHL: NHL repeat; InterPro: IPR001258 The NHL repeat, named after NCL-1, HT2A and Lin-41, is found largely in a large number of eukaryotic and prokaryotic proteins. For example, the repeat is found in a variety of enzymes of the copper type II, ascorbate-dependent monooxygenase family which catalyse the C terminus alpha-amidation of biological peptides []. In many it occurs in tandem arrays, for example in the ringfinger beta-box, coiled-coil (RBCC) eukaryotic growth regulators []. The 'Brain Tumor' protein (Brat) is one such growth regulator that contains a 6-bladed NHL-repeat beta-propeller [, ]. The NHL repeats are also found in serine/threonine protein kinase (STPK) in diverse range of pathogenic bacteria. These STPK are transmembrane receptors with a intracellular N-terminal kinase domain and extracellular C-terminal sensor domain. In the STPK, PknD, from Mycobacterium tuberculosis, the sensor domain forms a rigid, six-bladed b-propeller composed of NHL repeats with a flexible tether to the transmembrane domain.; GO: 0005515 protein binding; PDB: 3FVZ_A 3FW0_A 1RWL_A 1RWI_A 1Q7F_A.
Probab=82.72 E-value=2.9 Score=17.99 Aligned_cols=26 Identities=12% Similarity=0.166 Sum_probs=20.3
Q ss_pred cceEEEEEeCCCCEEEEEECCCCcEEe
Q psy13631 52 KHAIAIDWDPVEQHLYWTDDEARNKSG 78 (81)
Q Consensus 52 ~~~~ald~d~~~~~iyW~d~~~~~I~r 78 (81)
..+.+|..| .++.||=+|.....|..
T Consensus 2 ~~P~gvav~-~~g~i~VaD~~n~rV~v 27 (28)
T PF01436_consen 2 NYPHGVAVD-SDGNIYVADSGNHRVQV 27 (28)
T ss_dssp SSEEEEEEE-TTSEEEEEECCCTEEEE
T ss_pred cCCcEEEEe-CCCCEEEEECCCCEEEE
Confidence 356677788 78899999988887753
No 19
>KOG1520|consensus
Probab=80.92 E-value=6.2 Score=27.81 Aligned_cols=52 Identities=19% Similarity=0.241 Sum_probs=32.9
Q ss_pred EEEEEeCC-cEEEEeCCCCCceeeEEEeCCcc--ceEEEEEeCCCCEEEEEECCCC
Q psy13631 22 MLLLVQRT-DICKISLDTPDHTNVMLPVVGVK--HAIAIDWDPVEQHLYWTDDEAR 74 (81)
Q Consensus 22 ~Ll~s~~~-~I~~i~l~~~~~~~~~~~~~~~~--~~~ald~d~~~~~iyW~d~~~~ 74 (81)
=|++|... .+..++..+...+....++.|.. -.=++|.|. ++.|||+|...+
T Consensus 128 dL~VaDAYlGL~~V~p~g~~a~~l~~~~~G~~~kf~N~ldI~~-~g~vyFTDSSsk 182 (376)
T KOG1520|consen 128 DLYVADAYLGLLKVGPEGGLAELLADEAEGKPFKFLNDLDIDP-EGVVYFTDSSSK 182 (376)
T ss_pred eEEEEecceeeEEECCCCCcceeccccccCeeeeecCceeEcC-CCeEEEeccccc
Confidence 46667664 67777777665333333333422 234688888 899999998653
No 20
>TIGR03606 non_repeat_PQQ dehydrogenase, PQQ-dependent, s-GDH family. PQQ, or pyrroloquinoline-quinone, serves as a cofactor for a number of sugar and alcohol dehydrogenases in a limited number of bacterial species. Most characterized PQQ-dependent enzymes have multiple repeats of a sequence region described by pfam01011 (PQQ enzyme repeat), but this protein family in unusual in lacking that repeat. Below the noise cutoff are related proteins mostly from species that lack PQQ biosynthesis.
Probab=80.68 E-value=6 Score=28.50 Aligned_cols=49 Identities=12% Similarity=0.224 Sum_probs=33.5
Q ss_pred cEEEEeCCCCCce--------eeEEEeCCccceEEEEEeCCCCEEEEEECCC---CcEEee
Q psy13631 30 DICKISLDTPDHT--------NVMLPVVGVKHAIAIDWDPVEQHLYWTDDEA---RNKSGE 79 (81)
Q Consensus 30 ~I~~i~l~~~~~~--------~~~~~~~~~~~~~ald~d~~~~~iyW~d~~~---~~I~r~ 79 (81)
.|.||+.+++.-. ..-+-.-|++|+.+++||+ ++.||.+|... +.|.+.
T Consensus 200 kILRin~DGsiP~dNPf~~g~~~eIyA~G~RNp~Gla~dp-~G~Lw~~e~Gp~~~DEiN~I 259 (454)
T TIGR03606 200 KVLRLNLDGSIPKDNPSINGVVSHIFTYGHRNPQGLAFTP-DGTLYASEQGPNSDDELNII 259 (454)
T ss_pred EEEEEcCCCCCCCCCCccCCCcceEEEEeccccceeEECC-CCCEEEEecCCCCCcEEEEe
Confidence 6888888865210 0112235899999999999 78999999865 445543
No 21
>PLN02919 haloacid dehalogenase-like hydrolase family protein
Probab=79.80 E-value=13 Score=29.54 Aligned_cols=57 Identities=14% Similarity=0.153 Sum_probs=36.1
Q ss_pred EEEEEeC--CcEEEEeCCCCCceeeEE-----------EeCCccceEEEEEeCCCCEEEEEECCCCcEEe
Q psy13631 22 MLLLVQR--TDICKISLDTPDHTNVML-----------PVVGVKHAIAIDWDPVEQHLYWTDDEARNKSG 78 (81)
Q Consensus 22 ~Ll~s~~--~~I~~i~l~~~~~~~~~~-----------~~~~~~~~~ald~d~~~~~iyW~d~~~~~I~r 78 (81)
-|+++.. +.|+.+++++........ ....+..+.+|.+|..++.||++|.....|.+
T Consensus 581 ~lyVaDs~n~rI~v~d~~G~~i~~ig~~g~~G~~dG~~~~a~f~~P~GIavd~~gn~LYVaDt~n~~Ir~ 650 (1057)
T PLN02919 581 RLFISDSNHNRIVVTDLDGNFIVQIGSTGEEGLRDGSFEDATFNRPQGLAYNAKKNLLYVADTENHALRE 650 (1057)
T ss_pred eEEEEECCCCeEEEEeCCCCEEEEEccCCCcCCCCCchhccccCCCcEEEEeCCCCEEEEEeCCCceEEE
Confidence 4555553 578888877542111100 00113457899999999999999998877765
No 22
>PF01826 TIL: Trypsin Inhibitor like cysteine rich domain; InterPro: IPR002919 This domain is found in proteinase inhibitors as well as in many extracellular proteins. The domain typically contains ten cysteine residues that form five disulphide bonds. The cysteine residues that form the disulphide bonds are 1-7, 2-6, 3-5, 4-10 and 8-9. This inhibitor domain belongs to MEROPS inhibitor family I8 (clan IA). Proteins containing this domain inhibit peptidases belonging to families S1 (IPR001254 from INTERPRO), S8 (IPR000209 from INTERPRO), and M4 (IPR001570 from INTERPRO) [] and are restricted to the chordata, nematoda, arthropoda and echinodermata. Examples of proteins containing this domain are: chymotrypsin/elastase inhibitor from Ascaris suum (pig roundworm) Acp62F protein from Drosophila melanogaster Bombina trypsin inhibitor from Bombina maxima (large-webbed bell toad) Bombyx subtilisin inhibitor from Bombyx mori (silk moth) von Willebrand factor ; PDB: 2P3F_N 1HX2_A 1CCV_A 1EAI_D 2H9E_C 1COU_A 1ATE_A 1ATB_A 1ATD_A 1ATA_A ....
Probab=79.32 E-value=0.83 Score=22.80 Aligned_cols=15 Identities=33% Similarity=0.281 Sum_probs=12.0
Q ss_pred CCCCceEEcCCCCCC
Q psy13631 2 FHFYGVRLVDNFTCA 16 (81)
Q Consensus 2 ~~~~Gy~L~d~~tC~ 16 (81)
+|++||.+.++..|.
T Consensus 36 ~C~~G~v~~~~~~CV 50 (55)
T PF01826_consen 36 FCPPGYVRNDNGRCV 50 (55)
T ss_dssp EETTTEEEETTSEEE
T ss_pred CCCCCeeEcCCCCEE
Confidence 699999988447776
No 23
>COG3386 Gluconolactonase [Carbohydrate transport and metabolism]
Probab=78.16 E-value=2.5 Score=28.77 Aligned_cols=23 Identities=26% Similarity=0.388 Sum_probs=19.9
Q ss_pred EEEeCCCCEEEEEECCCCcEEee
Q psy13631 57 IDWDPVEQHLYWTDDEARNKSGE 79 (81)
Q Consensus 57 ld~d~~~~~iyW~d~~~~~I~r~ 79 (81)
--||...+.+||+|+..++|.|.
T Consensus 30 P~w~~~~~~L~w~DI~~~~i~r~ 52 (307)
T COG3386 30 PVWDPDRGALLWVDILGGRIHRL 52 (307)
T ss_pred ccCcCCCCEEEEEeCCCCeEEEe
Confidence 34889999999999999999874
No 24
>TIGR01492 CPW_WPC Plasmodium falciparum CPW-WPC domain. This model represents a domain of about 61 residues in length with six well-conserved cysteine residues and six well-conserved aromatic sites. The domain can be found in tandem repeats, and is known so far only in Plasmodium falciparum. It is named for motifs of CPxxW and (less well conserved) WPC.
Probab=75.38 E-value=1.9 Score=22.39 Aligned_cols=16 Identities=13% Similarity=0.011 Sum_probs=13.7
Q ss_pred CCCCceEEc-CCCCCCC
Q psy13631 2 FHFYGVRLV-DNFTCAN 17 (81)
Q Consensus 2 ~~~~Gy~L~-d~~tC~~ 17 (81)
.||.|+.+. |+..|..
T Consensus 9 ~CP~~W~~~~~~~~C~a 25 (62)
T TIGR01492 9 PCPENWIQKNDKYGCIA 25 (62)
T ss_pred cCCccceecCCCCeeCC
Confidence 699999999 8777993
No 25
>PF12661 hEGF: Human growth factor-like EGF; PDB: 2YGQ_A 2E26_A 3A7Q_A 2YGP_A 2YGO_A 1HRE_A 1HAE_A 1HAF_A 1HRF_A.
Probab=75.38 E-value=1.3 Score=15.98 Aligned_cols=7 Identities=14% Similarity=-0.330 Sum_probs=4.7
Q ss_pred CCCCceE
Q psy13631 2 FHFYGVR 8 (81)
Q Consensus 2 ~~~~Gy~ 8 (81)
.|++||.
T Consensus 3 ~C~~G~~ 9 (13)
T PF12661_consen 3 QCPPGWT 9 (13)
T ss_dssp EE-TTEE
T ss_pred cCcCCCc
Confidence 3899996
No 26
>PLN02919 haloacid dehalogenase-like hydrolase family protein
Probab=73.55 E-value=31 Score=27.46 Aligned_cols=58 Identities=9% Similarity=0.143 Sum_probs=36.7
Q ss_pred cEEEEEeC--CcEEEEeCCCCCceeeEE--------------EeCCccceEEEEEeCCCCEEEEEECCCCcEEe
Q psy13631 21 EMLLLVQR--TDICKISLDTPDHTNVML--------------PVVGVKHAIAIDWDPVEQHLYWTDDEARNKSG 78 (81)
Q Consensus 21 ~~Ll~s~~--~~I~~i~l~~~~~~~~~~--------------~~~~~~~~~ald~d~~~~~iyW~d~~~~~I~r 78 (81)
..|.++.. +.|+.+++.+..-....- ..+.+..+.+|.+|+.++.+|++|.....|.+
T Consensus 636 n~LYVaDt~n~~Ir~id~~~~~V~tlag~G~~g~~~~gg~~~~~~~ln~P~gVa~dp~~g~LyVad~~~~~I~v 709 (1057)
T PLN02919 636 NLLYVADTENHALREIDFVNETVRTLAGNGTKGSDYQGGKKGTSQVLNSPWDVCFEPVNEKVYIAMAGQHQIWE 709 (1057)
T ss_pred CEEEEEeCCCceEEEEecCCCEEEEEeccCcccCCCCCChhhhHhhcCCCeEEEEecCCCeEEEEECCCCeEEE
Confidence 45667665 568888876542211110 00013456789999989999999988777754
No 27
>COG3386 Gluconolactonase [Carbohydrate transport and metabolism]
Probab=72.00 E-value=20 Score=24.44 Aligned_cols=29 Identities=17% Similarity=0.236 Sum_probs=25.1
Q ss_pred ccceEEEEEeCCCCEEEEEECCCCcEEee
Q psy13631 51 VKHAIAIDWDPVEQHLYWTDDEARNKSGE 79 (81)
Q Consensus 51 ~~~~~ald~d~~~~~iyW~d~~~~~I~r~ 79 (81)
+..+=+|.|++.++.+|++|.....|.|.
T Consensus 162 ~~~~NGla~SpDg~tly~aDT~~~~i~r~ 190 (307)
T COG3386 162 LTIPNGLAFSPDGKTLYVADTPANRIHRY 190 (307)
T ss_pred EEecCceEECCCCCEEEEEeCCCCeEEEE
Confidence 56666788999999999999999999885
No 28
>PF01731 Arylesterase: Arylesterase; InterPro: IPR002640 The serum paraoxonases/arylesterases are enzymes that catalyse the hydrolysis of the toxic metabolites of a variety of organophosphorus insecticides. The enzymes hydrolyse a broad spectrum of organophosphate substrates, including paraoxon and a number of aromatic carboxylic acid esters (e.g., phenyl acetate), and hence confer resistance to organophosphate toxicity []. Mammals have 3 distinct paraoxonase types, termed PON1-3 [, ]. In mice and humans, the PON genes are found on the same chromosome in close proximity. PON activity has been found in variety of tissues, with highest levels in liver and serum - the source of serum PON is thought to be the liver. Unlike mammals, fish and avian species lack paraoxonase activity. Human and rabbit PONs appear to have two distinct Ca2+ binding sites, one required for stability and one required for catalytic activity. The Ca2+ dependency of PONs suggests a mechanism of hydrolysis where Ca2+ acts as the electrophillic catalyst, like that proposed for phospholipase A2. The paraoxonase enzymes, PON1 and PON3, are high density lipoprotein (HDL)- associated proteins capable of preventing oxidative modification of low density lipoproteins (LPL) []. Although PON2 has oxidative properties, the enzyme does not associate with HDL. Within a given species, PON1, PON2 and PON3 share ~60% amino acid sequence identity, whereas between mammalian species particular PONs (1,2 or 3) share 79-90% identity at the amino acid level. Human PON1 and PON3 share numerous conserved phosphorylation and N-glycosylation sites; however, it is not known whether the PON proteins are modified at these sites, or whether modification at these sites is required for activity in vivo []. This family consists of arylesterases (Also known as serum paraoxonase) 3.1.1.2 from EC. These enzymes hydrolyse organophosphorus esters such as paraoxon and are found in the liver and blood. They confer resistance to organophosphate toxicity []. Human arylesterase (PON1) P27169 from SWISSPROT is associated with HDL and may protect against LDL oxidation [].; GO: 0004064 arylesterase activity
Probab=69.24 E-value=17 Score=20.11 Aligned_cols=31 Identities=23% Similarity=0.215 Sum_probs=26.8
Q ss_pred eCCccceEEEEEeCCCCEEEEEECCCCcEEe
Q psy13631 48 VVGVKHAIAIDWDPVEQHLYWTDDEARNKSG 78 (81)
Q Consensus 48 ~~~~~~~~ald~d~~~~~iyW~d~~~~~I~r 78 (81)
+.++..+-+|.+++..++||=++...++|.-
T Consensus 50 a~g~~~aNGI~~s~~~k~lyVa~~~~~~I~v 80 (86)
T PF01731_consen 50 ASGFSFANGIAISPDKKYLYVASSLAHSIHV 80 (86)
T ss_pred eccCCCCceEEEcCCCCEEEEEeccCCeEEE
Confidence 3588888899999999999999999888764
No 29
>PF07995 GSDH: Glucose / Sorbosone dehydrogenase; InterPro: IPR012938 Proteins containing this domain are thought to be glucose/sorbosone dehydrogenases. The best characterised of these proteins is soluble glucose dehydrogenase (P13650 from SWISSPROT) from Acinetobacter calcoaceticus, which oxidises glucose to gluconolactone. The enzyme is a calcium-dependent homodimer which uses PQQ as a cofactor [].; GO: 0016901 oxidoreductase activity, acting on the CH-OH group of donors, quinone or similar compound as acceptor, 0048038 quinone binding, 0005975 carbohydrate metabolic process; PDB: 2ISM_A 2WG3_D 3HO5_A 3HO4_A 3HO3_A 2WFT_A 2WG4_B 2WFX_B 1CRU_A 1CQ1_B ....
Probab=69.08 E-value=33 Score=23.30 Aligned_cols=55 Identities=18% Similarity=0.207 Sum_probs=35.3
Q ss_pred EEEEe--CCcEEEEeCCCCCce-eeEEEeCCcc-ceEEEEEeCCCCEEEEEECCCCcEEe
Q psy13631 23 LLLVQ--RTDICKISLDTPDHT-NVMLPVVGVK-HAIAIDWDPVEQHLYWTDDEARNKSG 78 (81)
Q Consensus 23 Ll~s~--~~~I~~i~l~~~~~~-~~~~~~~~~~-~~~ald~d~~~~~iyW~d~~~~~I~r 78 (81)
++++. +..|.++.++..... ....-+.+.. +++.|.+++. +.||-++-..++|+|
T Consensus 273 ~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~r~~~v~~~pD-G~Lyv~~d~~G~iyR 331 (331)
T PF07995_consen 273 LFVADYGGGRIWRLDLDEDGSVTEEEEFLGGFGGRPRDVAQGPD-GALYVSDDSDGKIYR 331 (331)
T ss_dssp EEEEETTTTEEEEEEEETTEEEEEEEEECTTSSS-EEEEEEETT-SEEEEEE-TTTTEEE
T ss_pred EEEecCCCCEEEEEeeecCCCccceEEccccCCCCceEEEEcCC-CeEEEEECCCCeEeC
Confidence 44555 358888888754221 1111224555 7889999984 789999889999987
No 30
>PF04942 CC: CC domain; InterPro: IPR007026 This short domain contains four conserved cysteines that are probably required for the formation of two disulphide bonds. The domain is only found in proteins from Caenorhabditis species. The domain is named after the characteristic CC motif.
Probab=68.96 E-value=3.6 Score=19.15 Aligned_cols=14 Identities=21% Similarity=0.311 Sum_probs=10.3
Q ss_pred CCCCceEEc-CCCCC
Q psy13631 2 FHFYGVRLV-DNFTC 15 (81)
Q Consensus 2 ~~~~Gy~L~-d~~tC 15 (81)
.||.||.|- .+.=|
T Consensus 19 ~CP~G~~~i~g~~CC 33 (36)
T PF04942_consen 19 VCPSGYTVITGNGCC 33 (36)
T ss_pred cCCCCCEEECCCccC
Confidence 599999998 55433
No 31
>KOG0315|consensus
Probab=68.11 E-value=15 Score=24.98 Aligned_cols=47 Identities=15% Similarity=0.228 Sum_probs=33.0
Q ss_pred CcEEEEEeCCcEEEEeCCCCCceeeEEEeCC-ccceEEEEEeCCCCEEE
Q psy13631 20 QEMLLLVQRTDICKISLDTPDHTNVMLPVVG-VKHAIAIDWDPVEQHLY 67 (81)
Q Consensus 20 ~~~Ll~s~~~~I~~i~l~~~~~~~~~~~~~~-~~~~~ald~d~~~~~iy 67 (81)
..+|..+-...||..++++.+..+++. ..+ -+|+.++.|.+....+|
T Consensus 52 k~~LAaa~~qhvRlyD~~S~np~Pv~t-~e~h~kNVtaVgF~~dgrWMy 99 (311)
T KOG0315|consen 52 KKDLAAAGNQHVRLYDLNSNNPNPVAT-FEGHTKNVTAVGFQCDGRWMY 99 (311)
T ss_pred cchhhhccCCeeEEEEccCCCCCceeE-EeccCCceEEEEEeecCeEEE
Confidence 345655556689999998876544332 233 38899999999888777
No 32
>PF02425 GBP_PSP: Paralytic/GBP/PSP peptide; InterPro: IPR003463 This family includes insect peptides that are short (23 amino acids) and contain 1 disulphide bridge. The family includes growth-blocking peptide (GBP) of Pseudaletia separata (Oriental armyworm) and the paralytic peptides from Manduca sexta (Tobacco hawkmoth), Heliothis virescens (Noctuid moth), and Spodoptera exigua (Beet armyworm) [] as well as plasmatocyte-spreading peptide (PSP1) []. These peptides function to halt metamorphosis from larvae to pupae.; PDB: 1V28_A 2DJC_A 2EQQ_A 2EQH_A 2EQT_A 1BQF_A 2DJ9_A 1HRL_A 1IRR_A 1B5N_A ....
Probab=66.66 E-value=4.1 Score=16.95 Aligned_cols=14 Identities=21% Similarity=0.166 Sum_probs=11.1
Q ss_pred CCCCceEEc-CCCCCC
Q psy13631 2 FHFYGVRLV-DNFTCA 16 (81)
Q Consensus 2 ~~~~Gy~L~-d~~tC~ 16 (81)
-|.+||+-. || .||
T Consensus 6 gc~~gy~rtadg-rck 20 (23)
T PF02425_consen 6 GCATGYMRTADG-RCK 20 (23)
T ss_dssp SSSTTEEEETTT-EEE
T ss_pred cccccceEcCCc-ccc
Confidence 499999888 76 576
No 33
>smart00032 CCP Domain abundant in complement control proteins; SUSHI repeat; short complement-like repeat (SCR). The complement control protein (CCP) modules (also known as short consensus repeats SCRs or SUSHI repeats) contain approximately 60 amino acid residues and have been identified in several proteins of the complement system. A missense mutation in seventh CCP domain causes deficiency of the b subunit of factor XIII.
Probab=65.62 E-value=2.6 Score=20.12 Aligned_cols=15 Identities=27% Similarity=0.377 Sum_probs=11.7
Q ss_pred CCCCceEEc-C-CCCCC
Q psy13631 2 FHFYGVRLV-D-NFTCA 16 (81)
Q Consensus 2 ~~~~Gy~L~-d-~~tC~ 16 (81)
.|.+||+|. . ..+|.
T Consensus 29 ~C~~Gy~l~g~~~~~C~ 45 (57)
T smart00032 29 SCNPGYTLIGSSTITCL 45 (57)
T ss_pred EcCCCCEEcCCCeeEEC
Confidence 589999988 4 56787
No 34
>PF13948 DUF4215: Domain of unknown function (DUF4215)
Probab=64.43 E-value=5.1 Score=19.59 Aligned_cols=15 Identities=27% Similarity=0.240 Sum_probs=13.4
Q ss_pred CCCCceEEc-CCCCCC
Q psy13631 2 FHFYGVRLV-DNFTCA 16 (81)
Q Consensus 2 ~~~~Gy~L~-d~~tC~ 16 (81)
.|..||.|. ...+|.
T Consensus 3 ~C~~G~~l~~~~~~C~ 18 (47)
T PF13948_consen 3 QCQDGYYLNQNNNKCE 18 (47)
T ss_pred cccCccEEcCCCCccc
Confidence 589999999 888997
No 35
>smart00179 EGF_CA Calcium-binding EGF-like domain.
Probab=64.01 E-value=4.2 Score=18.05 Aligned_cols=6 Identities=17% Similarity=-0.158 Sum_probs=3.0
Q ss_pred CCCceE
Q psy13631 3 HFYGVR 8 (81)
Q Consensus 3 ~~~Gy~ 8 (81)
|+.||.
T Consensus 28 C~~g~~ 33 (39)
T smart00179 28 CPPGYT 33 (39)
T ss_pred CCCCCc
Confidence 455554
No 36
>PF00396 Granulin: Granulin; InterPro: IPR000118 Metazoan granulins [] are a family of cysteine-rich peptides of about 6 Kd which may have multiple biological activity. A precursor protein (known as acrogranin) potentially encodes seven different forms of granulin (grnA to grnG) which are probably released by post-translational proteolytic processing. Granulins are evolutionary related to a PMP-D1, a peptide extracted from the pars intercerebralis of migratory locusts []. A schematic representation of the structure of a granulin is shown below: xxxCxxxxxCxxxxxCCxxxxxxxxCCxxxxxxCCxxxxxCCxxxxxCxxxxxxCx 'C': conserved cysteine probably involved in a disulphide bond. In plants a granulin domain is often associated with the C terminus of cysteine proteases belong to the MEROPS peptidase family C1, subfamily C1A (papain).; PDB: 1I8Y_A 1QGM_A 1I8X_A 2JYT_A 2JYU_A 1FWO_A 2JYV_A 2JYE_A 1G26_A.
Probab=62.83 E-value=4 Score=19.82 Aligned_cols=14 Identities=21% Similarity=0.138 Sum_probs=11.0
Q ss_pred CCCceEEc-CCCCCC
Q psy13631 3 HFYGVRLV-DNFTCA 16 (81)
Q Consensus 3 ~~~Gy~L~-d~~tC~ 16 (81)
||.||.=+ ++.+|.
T Consensus 28 CP~G~~C~~~~~~C~ 42 (43)
T PF00396_consen 28 CPHGYTCDPDGGSCI 42 (43)
T ss_dssp SSTTSEEECTTTEEE
T ss_pred CCCcCEECCCCCEEc
Confidence 89999877 777774
No 37
>KOG4499|consensus
Probab=58.01 E-value=9.2 Score=25.82 Aligned_cols=23 Identities=30% Similarity=0.413 Sum_probs=17.2
Q ss_pred EEEEeCCCCEEEEEECCCCcEEe
Q psy13631 56 AIDWDPVEQHLYWTDDEARNKSG 78 (81)
Q Consensus 56 ald~d~~~~~iyW~d~~~~~I~r 78 (81)
+..||..++.+||+|+..+.+.|
T Consensus 19 gp~w~~~~~sLl~VDi~ag~v~r 41 (310)
T KOG4499|consen 19 GPHWDVERQSLLYVDIEAGEVHR 41 (310)
T ss_pred CCceEEecceEEEEEeccCceeh
Confidence 34577888889998887776654
No 38
>PF09717 CPW_WPC: Plasmodium falciparum domain of unknown function (CPW_WPC); InterPro: IPR006387 This entry represents a domain of about 61 residues in length with six well-conserved cysteine residues and six well-conserved aromatic sites. The domain can be found in tandem repeats. It is named for motifs of CPxxW and (less well conserved) WPC. Its function is unknown.
Probab=57.40 E-value=4.8 Score=20.57 Aligned_cols=16 Identities=25% Similarity=0.140 Sum_probs=12.7
Q ss_pred CCCCceEEcCCCCCCC
Q psy13631 2 FHFYGVRLVDNFTCAN 17 (81)
Q Consensus 2 ~~~~Gy~L~d~~tC~~ 17 (81)
.||.|+.+.++..|..
T Consensus 8 ~CP~~W~~~~~~~C~a 23 (60)
T PF09717_consen 8 PCPEGWILSDDGSCIA 23 (60)
T ss_pred ccccccccCCCCeEcC
Confidence 5999999666677993
No 39
>cd00033 CCP Complement control protein (CCP) modules (aka short consensus repeats SCRs or SUSHI repeats) have been identified in several proteins of the complement system. SUSHI repeats (short complement-like repeat, SCR) are abundant in complement control proteins. The complement control protein (CCP) modules (also known as short consensus repeats SCRs or SUSHI repeats) contain approximately 60 amino acid residues and have been identified in several proteins of the complement system. Typically, 2 to 4 modules contribute to a binding site, implying that the orientation of the modules to each other is critical for function.
Probab=56.54 E-value=5 Score=19.23 Aligned_cols=15 Identities=33% Similarity=0.432 Sum_probs=11.6
Q ss_pred CCCCceEEc-C-CCCCC
Q psy13631 2 FHFYGVRLV-D-NFTCA 16 (81)
Q Consensus 2 ~~~~Gy~L~-d-~~tC~ 16 (81)
.|.+||+|. . ..+|.
T Consensus 28 ~C~~Gy~~~g~~~~~C~ 44 (57)
T cd00033 28 SCNEGYTLVGSSTITCT 44 (57)
T ss_pred ECCCCCeEeCCCeeEEC
Confidence 488999988 4 46787
No 40
>PF14339 DUF4394: Domain of unknown function (DUF4394)
Probab=55.88 E-value=44 Score=22.14 Aligned_cols=25 Identities=20% Similarity=0.360 Sum_probs=18.5
Q ss_pred cceEEEEEeCCCCEEEEEECCCCcEE
Q psy13631 52 KHAIAIDWDPVEQHLYWTDDEARNKS 77 (81)
Q Consensus 52 ~~~~ald~d~~~~~iyW~d~~~~~I~ 77 (81)
...++|||.+.++.+|=.. ....|+
T Consensus 27 e~l~GID~Rpa~G~LYgl~-~~g~lY 51 (236)
T PF14339_consen 27 ESLVGIDFRPANGQLYGLG-STGRLY 51 (236)
T ss_pred CeEEEEEeecCCCCEEEEe-CCCcEE
Confidence 5689999999999999663 334444
No 41
>PF06977 SdiA-regulated: SdiA-regulated; InterPro: IPR009722 This entry represents a conserved region approximately 100 residues long within a number of hypothetical bacterial proteins that may be regulated by SdiA, a member of the LuxR family of transcriptional regulators []. Some proteins contain the IPR001258 from INTERPRO repeat.; PDB: 3QQZ_A.
Probab=55.35 E-value=54 Score=21.67 Aligned_cols=33 Identities=18% Similarity=0.340 Sum_probs=20.8
Q ss_pred EEeCCcc-ceEEEEEeCCCCEEEEEECCCCcEEe
Q psy13631 46 LPVVGVK-HAIAIDWDPVEQHLYWTDDEARNKSG 78 (81)
Q Consensus 46 ~~~~~~~-~~~ald~d~~~~~iyW~d~~~~~I~r 78 (81)
.+++++. ++-+|.|++.++++|=+.-....|..
T Consensus 15 ~~l~g~~~e~SGLTy~pd~~tLfaV~d~~~~i~e 48 (248)
T PF06977_consen 15 KPLPGILDELSGLTYNPDTGTLFAVQDEPGEIYE 48 (248)
T ss_dssp EE-TT--S-EEEEEEETTTTEEEEEETTTTEEEE
T ss_pred eECCCccCCccccEEcCCCCeEEEEECCCCEEEE
Confidence 3667766 48899999988887666555555554
No 42
>PF14280 DUF4365: Domain of unknown function (DUF4365)
Probab=50.36 E-value=38 Score=19.60 Aligned_cols=24 Identities=21% Similarity=0.303 Sum_probs=18.6
Q ss_pred ceEEEEEeCCCCEEEEEECCCCcE
Q psy13631 53 HAIAIDWDPVEQHLYWTDDEARNK 76 (81)
Q Consensus 53 ~~~ald~d~~~~~iyW~d~~~~~I 76 (81)
.++.+=+|..++.+||.+.....+
T Consensus 84 PvilV~~~~~~~~~yW~~l~~~~~ 107 (138)
T PF14280_consen 84 PVILVLVDPDSDCAYWVSLKGYPI 107 (138)
T ss_pred CEEEEEEeCCCCEEEEEEcccCcc
Confidence 355667899999999999875544
No 43
>PF13947 GUB_WAK_bind: Wall-associated receptor kinase galacturonan-binding
Probab=49.98 E-value=29 Score=19.33 Aligned_cols=33 Identities=24% Similarity=0.284 Sum_probs=18.0
Q ss_pred CCceEEc-CCCCCCCCCCcEEEEEeCCcEEEEeCCCC
Q psy13631 4 FYGVRLV-DNFTCANGSQEMLLLVQRTDICKISLDTP 39 (81)
Q Consensus 4 ~~Gy~L~-d~~tC~~~~~~~Ll~s~~~~I~~i~l~~~ 39 (81)
.+||+|. +..+ .+..+++.....+|..|++...
T Consensus 26 ~~~F~L~C~~~~---~~~~l~l~~~~~~V~~I~~~~~ 59 (106)
T PF13947_consen 26 DPGFELTCNNNT---SPPKLLLSSGNYEVLSISYENG 59 (106)
T ss_pred CCCcEEECCCCC---CCceeEecCCcEEEEEEecCCC
Confidence 3678876 4322 2333444344457777887654
No 44
>PF12947 EGF_3: EGF domain; InterPro: IPR024731 This entry represents an EGF domain found in the the C terminus of malarial parasite merozoite surface protein 1 [], as well as other proteins.; PDB: 2NPR_A 1N1I_C 1B9W_A 1YO8_A 2RHP_A.
Probab=49.86 E-value=6.9 Score=18.05 Aligned_cols=12 Identities=33% Similarity=0.423 Sum_probs=6.5
Q ss_pred CCCceEEcCCCCC
Q psy13631 3 HFYGVRLVDNFTC 15 (81)
Q Consensus 3 ~~~Gy~L~d~~tC 15 (81)
|++||.. ||..|
T Consensus 25 C~~Gy~G-dG~~C 36 (36)
T PF12947_consen 25 CKPGYEG-DGFFC 36 (36)
T ss_dssp E-CEEEC-CSTCE
T ss_pred CCCCCcc-CCcCC
Confidence 7788753 45444
No 45
>PRK11028 6-phosphogluconolactonase; Provisional
Probab=49.04 E-value=76 Score=20.98 Aligned_cols=54 Identities=13% Similarity=0.103 Sum_probs=29.6
Q ss_pred EEEEe--CCcEEEEeCCCCCceeeEEEeCCccceEEEEEeCCCCEEEEEECCCCcE
Q psy13631 23 LLLVQ--RTDICKISLDTPDHTNVMLPVVGVKHAIAIDWDPVEQHLYWTDDEARNK 76 (81)
Q Consensus 23 Ll~s~--~~~I~~i~l~~~~~~~~~~~~~~~~~~~ald~d~~~~~iyW~d~~~~~I 76 (81)
+++++ ...|..++++....-..+-.++...++..+.+++..+++|.+....+.|
T Consensus 4 ~y~~~~~~~~I~~~~~~~~g~l~~~~~~~~~~~~~~l~~spd~~~lyv~~~~~~~i 59 (330)
T PRK11028 4 VYIASPESQQIHVWNLNHEGALTLLQVVDVPGQVQPMVISPDKRHLYVGVRPEFRV 59 (330)
T ss_pred EEEEcCCCCCEEEEEECCCCceeeeeEEecCCCCccEEECCCCCEEEEEECCCCcE
Confidence 44443 3577777775322111111112234566777888888888876655555
No 46
>smart00494 ChtBD2 Chitin-binding domain type 2.
Probab=46.28 E-value=17 Score=17.15 Aligned_cols=16 Identities=19% Similarity=0.069 Sum_probs=12.4
Q ss_pred CCCCCceEEc-CCCCCC
Q psy13631 1 MFHFYGVRLV-DNFTCA 16 (81)
Q Consensus 1 ~~~~~Gy~L~-d~~tC~ 16 (81)
+.||.|+.-. ..+.|.
T Consensus 31 ~~C~~g~~f~~~~~~C~ 47 (56)
T smart00494 31 GSCPAGLVFDPATQTCD 47 (56)
T ss_pred ccCcCCCeECCCCCCcC
Confidence 4688888777 777887
No 47
>PF06045 Rhamnogal_lyase: Rhamnogalacturonate lyase family; InterPro: IPR010325 Rhamnogalacturonate lyase degrades the rhamnogalacturonan I (RG-I) backbone of pectin []. This family contains mainly members from plants, but also contains the plant pathogen Erwinia chrysanthemi.
Probab=45.56 E-value=48 Score=21.49 Aligned_cols=42 Identities=14% Similarity=0.379 Sum_probs=25.6
Q ss_pred eCCcEEEEeCCCCCceeeEEEeCCccceEEEEEeCCCCEEEEE
Q psy13631 27 QRTDICKISLDTPDHTNVMLPVVGVKHAIAIDWDPVEQHLYWT 69 (81)
Q Consensus 27 ~~~~I~~i~l~~~~~~~~~~~~~~~~~~~ald~d~~~~~iyW~ 69 (81)
-.+.|..+.+......-.-+...|+.+...+. +...++-||-
T Consensus 20 ldNGiVqVtls~p~G~VtgIkYnGi~NLle~~-n~e~nrGYwD 61 (203)
T PF06045_consen 20 LDNGIVQVTLSKPGGIVTGIKYNGIDNLLEVA-NKENNRGYWD 61 (203)
T ss_pred EECCEEEEEEcCCCceEEEEEECCEehhhccc-CcccCCceEE
Confidence 44568888887643222222235666665555 7788889995
No 48
>PF12075 KN_motif: KN motif; InterPro: IPR021939 This small motif is found at the N terminus of Kank proteins and has been called the KN (for Kank N-terminal) motif. This protein is found in eukaryotes. Proteins in this family are typically between 413 to 1202 amino acids in length. This protein is found associated with PF00023 from PFAM. This protein has two conserved sequence motifs: TPYG and LDLDF. Kank1 was obtained by positional cloning of a tumor suppressor gene in renal cell carcinoma, while the other members were found by homology search. The family is involved in the regulation of actin polymerisation and cell motility through signaling pathways containing PI3K/Akt and/or unidentified modulators/effectors [].
Probab=45.22 E-value=13 Score=17.73 Aligned_cols=15 Identities=20% Similarity=0.051 Sum_probs=12.0
Q ss_pred CCCCceEEc-CC-CCCC
Q psy13631 2 FHFYGVRLV-DN-FTCA 16 (81)
Q Consensus 2 ~~~~Gy~L~-d~-~tC~ 16 (81)
.||-||.|+ |- +.|.
T Consensus 5 ~tPYGyhiDLDFvkyve 21 (39)
T PF12075_consen 5 ETPYGYHIDLDFVKYVE 21 (39)
T ss_pred cCCcceeecchHHHHHH
Confidence 479999999 84 6677
No 49
>PF03088 Str_synth: Strictosidine synthase; InterPro: IPR018119 This entry represents a conserved region found in strictosidine synthase (4.3.3.2 from EC), a key enzyme in alkaloid biosynthesis. It catalyses the Pictet-Spengler stereospecific condensation of tryptamine with secologanin to form strictosidine []. The structure of the native enzyme from the Indian medicinal plant Rauvolfia serpentina (Serpentwood) (Devilpepper) represents the first example of a six-bladed four-stranded beta-propeller fold from the plant kingdom [].; GO: 0016844 strictosidine synthase activity, 0009058 biosynthetic process; PDB: 2FPB_A 2V91_B 2FP8_A 3V1S_B 2FPC_A 2VAQ_A 2FP9_B.
Probab=44.13 E-value=57 Score=18.14 Aligned_cols=31 Identities=10% Similarity=0.045 Sum_probs=25.7
Q ss_pred CCccceEEEEEeCCCCEEEEEECCCCcEEee
Q psy13631 49 VGVKHAIAIDWDPVEQHLYWTDDEARNKSGE 79 (81)
Q Consensus 49 ~~~~~~~ald~d~~~~~iyW~d~~~~~I~r~ 79 (81)
.++.-+-+|.....+..|..++....+|.|.
T Consensus 54 ~~L~fpNGVals~d~~~vlv~Et~~~Ri~ry 84 (89)
T PF03088_consen 54 DGLYFPNGVALSPDESFVLVAETGRYRILRY 84 (89)
T ss_dssp EEESSEEEEEE-TTSSEEEEEEGGGTEEEEE
T ss_pred hCCCccCeEEEcCCCCEEEEEeccCceEEEE
Confidence 5788888888899999999999988888873
No 50
>PF00084 Sushi: Sushi domain (SCR repeat); InterPro: IPR000436 Sushi domains are also known as Complement control protein (CCP) modules, or short consensus repeats (SCR), exist in a wide variety of complement and adhesion proteins. The structure is known for this domain, it is based on a beta-sandwich arrangement; one face made up of three beta-strands hydrogen-bonded to form a triple-stranded region at its centre and the other face formed from two separate beta-strands []. CD21 (also called C3d receptor, CR2, Epstein Barr virus receptor or EBV-R) is the receptor for EBV and for C3d, C3dg and iC3b. Complement components may activate B cells through CD21. CD21 is part of a large signal-transduction complex that also involves CD19, CD81, and Leu13. Some of the proteins in this group are responsible for the molecular basis of the blood group antigens, surface markers on the outside of the red blood cell membrane. Most of these markers are proteins, but some are carbohydrates attached to lipids or proteins [Reid M.E., Lomas-Francis C. The Blood Group Antigen FactsBook Academic Press, London / San Diego, (1997)]. Complement decay-accelerating factor (Antigen CD55) belongs to the Cromer blood group system and is associated with Cr(a), Dr(a), Es(a), Tc(a/b/c), Wd(a), WES(a/b), IFC and UMC antigens. Complement receptor type 1 (C3b/C4b receptor) (Antigen CD35) belongs to the Knops blood group system and is associated with Kn(a/b), McC(a), Sl(a) and Yk(a) antigens. CD molecules are leucocyte antigens on cell surfaces. CD antigens nomenclature is updated at Protein Reviews On The Web (http://prow.nci.nih.gov/). ; PDB: 1HFH_A 1HFI_A 1E5G_A 2UWN_A 2V8E_A 2XWB_H 3HS0_I 2OK5_A 2XWJ_J 3HRZ_D ....
Probab=42.73 E-value=7 Score=18.72 Aligned_cols=15 Identities=27% Similarity=0.321 Sum_probs=11.1
Q ss_pred CCCCceEEc--CCCCCC
Q psy13631 2 FHFYGVRLV--DNFTCA 16 (81)
Q Consensus 2 ~~~~Gy~L~--d~~tC~ 16 (81)
.|.+||+|. ...+|.
T Consensus 28 ~C~~Gy~~~g~~~~~C~ 44 (56)
T PF00084_consen 28 SCNPGYELSGSSTITCQ 44 (56)
T ss_dssp EESTTEEEESSSEEEEE
T ss_pred cCCCCCEecCCCEEEEC
Confidence 488999988 446784
No 51
>PF14761 HPS3_N: Hermansky-Pudlak syndrome 3
Probab=42.67 E-value=96 Score=20.31 Aligned_cols=49 Identities=8% Similarity=0.163 Sum_probs=31.0
Q ss_pred cEEEE-EeCCcEEEEeCCCCCceeeEEEeCCccceEEEEEeCCCCEEEEEE
Q psy13631 21 EMLLL-VQRTDICKISLDTPDHTNVMLPVVGVKHAIAIDWDPVEQHLYWTD 70 (81)
Q Consensus 21 ~~Ll~-s~~~~I~~i~l~~~~~~~~~~~~~~~~~~~ald~d~~~~~iyW~d 70 (81)
.+|++ +.+..|...+|..+.. ......+.++.+..+.|...+++|-=.+
T Consensus 29 d~Lfva~~g~~Vev~~l~~~~~-~~~~~F~Tv~~V~~l~y~~~GDYlvTlE 78 (215)
T PF14761_consen 29 DALFVAASGCKVEVYDLEQEEC-PLLCTFSTVGRVLQLVYSEAGDYLVTLE 78 (215)
T ss_pred ceEEEEcCCCEEEEEEcccCCC-ceeEEEcchhheeEEEeccccceEEEEE
Confidence 45544 6677888888884433 2333335677777888887777765443
No 52
>PF05098 LEF-4: Late expression factor 4 (LEF-4); InterPro: IPR007790 The baculovirus Autographa californica nuclear polyhedrosis virus (AcMNPV) virus encodes a DNA-dependent RNA polymerase that is required for transcription of viral late genes. This polymerase is composed of four equimolar subunits, LEF-8, LEF-4, LEF-9, and p47. LEF-4 carries out all the enzymatic functions related to mRNA capping []. ; GO: 0006355 regulation of transcription, DNA-dependent
Probab=41.07 E-value=70 Score=23.30 Aligned_cols=58 Identities=14% Similarity=0.204 Sum_probs=31.3
Q ss_pred eEEc-CCCCCC--CCCCcEEEEEeCCcEEEEeCCCCCceeeEEEeCCccceEEEEEeCCC-CEEEEEECC
Q psy13631 7 VRLV-DNFTCA--NGSQEMLLLVQRTDICKISLDTPDHTNVMLPVVGVKHAIAIDWDPVE-QHLYWTDDE 72 (81)
Q Consensus 7 y~L~-d~~tC~--~~~~~~Ll~s~~~~I~~i~l~~~~~~~~~~~~~~~~~~~ald~d~~~-~~iyW~d~~ 72 (81)
|-|. ||.+|| -...-++++......+...+++.. -.+.+++|+..-..+ +.+|-+|+.
T Consensus 238 WAlKLDGvRGkG~~~~~~~~i~~DDMq~fsg~l~~~~--------f~~Nnvv~fQcE~i~~~~~YITDlL 299 (450)
T PF05098_consen 238 WALKLDGVRGKGYFTNGFIIIQMDDMQMFSGKLDPSP--------FSLNNVVAFQCELIDDETFYITDLL 299 (450)
T ss_pred EEEeecccceeeEEeccEEEEEEchhhhhhcccccch--------hhcccEEEEEEEEecCceEEEeeee
Confidence 4577 998888 112333344444444444442221 134556666655555 778888864
No 53
>PF03022 MRJP: Major royal jelly protein; InterPro: IPR003534 The major royal jelly proteins (MRJPs) comprise 12.5% of the mass, and 82-90% of the protein content [], of honeybee (Apis mellifera) royal jelly. Royal jelly is a substance secreted by the cephalic glands of nurse bees [] and it is used to trigger development of a queen bee from a bee larva. The biological function of the MRJPs is unknown, but they are believed to play a major role in nutrition due to their high essential amino acid content []. Two royal jelly proteins, MRJP3 and MRJP5, contain a tandem repeat that results from a high genetic variablility. This polymorphism may be useful for genotyping individual bees [].; PDB: 3Q6P_B 3Q6K_A 3Q6T_A 2QE8_B.
Probab=40.31 E-value=66 Score=21.55 Aligned_cols=25 Identities=28% Similarity=0.431 Sum_probs=21.7
Q ss_pred ceEEEEEeCCCCEEEEEECCCCcEEe
Q psy13631 53 HAIAIDWDPVEQHLYWTDDEARNKSG 78 (81)
Q Consensus 53 ~~~ald~d~~~~~iyW~d~~~~~I~r 78 (81)
+..++.+|. ++.||++++....|.+
T Consensus 187 ~s~g~~~D~-~G~ly~~~~~~~aI~~ 211 (287)
T PF03022_consen 187 QSDGMAIDP-NGNLYFTDVEQNAIGC 211 (287)
T ss_dssp SECEEEEET-TTEEEEEECCCTEEEE
T ss_pred CCceEEECC-CCcEEEecCCCCeEEE
Confidence 567888998 8899999999999876
No 54
>PF09603 Fib_succ_major: Fibrobacter succinogenes major domain (Fib_succ_major); InterPro: IPR011871 This domain of about 175 to 200 amino acids is found, in from one to five copies, in over 50 proteins in Fibrobacter succinogenes subsp. succinogenes S85, an obligate anaerobe of the rumen. Many members of this family have an apparent lipoprotein signal sequence. Conserved cysteine residues, suggestive of disulphide bond formation, are also consistent with an extracytoplasmic location for this domain. This domain can also be found in small numbers of proteins in Chlorobium tepidum and Bacteroides thetaiotaomicron.
Probab=39.34 E-value=9.9 Score=23.21 Aligned_cols=9 Identities=33% Similarity=0.154 Sum_probs=7.7
Q ss_pred CCCCceEEc
Q psy13631 2 FHFYGVRLV 10 (81)
Q Consensus 2 ~~~~Gy~L~ 10 (81)
-||+||+|-
T Consensus 69 ~CP~GWrlP 77 (184)
T PF09603_consen 69 ICPEGWRLP 77 (184)
T ss_pred CCCCCCCCC
Confidence 499999887
No 55
>PF08366 LLGL: LLGL2; InterPro: IPR013577 This domain is found in lethal giant larvae homologue 2 (LLGL2) proteins and syntaxin-binding proteins like tomosyn []. It has been identified in eukaryotes and tends to be found together with WD repeats (IPR001680 from INTERPRO).
Probab=38.66 E-value=80 Score=18.25 Aligned_cols=11 Identities=0% Similarity=0.220 Sum_probs=7.5
Q ss_pred CCcEEEEEeCC
Q psy13631 19 SQEMLLLVQRT 29 (81)
Q Consensus 19 ~~~~Ll~s~~~ 29 (81)
.++|++|+-+.
T Consensus 27 ~~~~iiFsGGm 37 (105)
T PF08366_consen 27 GEPFIIFSGGM 37 (105)
T ss_pred CCcEEEEeCCc
Confidence 56777777664
No 56
>smart00181 EGF Epidermal growth factor-like domain.
Probab=38.04 E-value=17 Score=15.70 Aligned_cols=7 Identities=14% Similarity=-0.344 Sum_probs=3.4
Q ss_pred CCCceEE
Q psy13631 3 HFYGVRL 9 (81)
Q Consensus 3 ~~~Gy~L 9 (81)
|+.||.+
T Consensus 24 C~~g~~g 30 (35)
T smart00181 24 CPPGYTG 30 (35)
T ss_pred CCCCCcc
Confidence 4555543
No 57
>smart00284 OLF Olfactomedin-like domains.
Probab=37.21 E-value=91 Score=20.88 Aligned_cols=41 Identities=15% Similarity=0.302 Sum_probs=25.8
Q ss_pred EeCCCCCceeeEEEeCCc-cceEEEEEeCCCCEEEEEECCCC
Q psy13631 34 ISLDTPDHTNVMLPVVGV-KHAIAIDWDPVEQHLYWTDDEAR 74 (81)
Q Consensus 34 i~l~~~~~~~~~~~~~~~-~~~~ald~d~~~~~iyW~d~~~~ 74 (81)
.++..+....+-+|.++. +....|||++.+++||==|-+..
T Consensus 205 yDt~t~~~~~~~i~f~n~y~~~s~l~YNP~d~~LY~wdng~~ 246 (255)
T smart00284 205 YDTNTGKEGHLDIPFENMYEYISMLDYNPNDRKLYAWNNGHL 246 (255)
T ss_pred EECCCCccceeeeeeccccccceeceeCCCCCeEEEEeCCeE
Confidence 345544434444555543 45678999999999996564433
No 58
>PTZ00444 hypothetical protein; Provisional
Probab=36.54 E-value=18 Score=23.00 Aligned_cols=14 Identities=14% Similarity=-0.035 Sum_probs=11.8
Q ss_pred CCCCceEEc-CCCCCC
Q psy13631 2 FHFYGVRLV-DNFTCA 16 (81)
Q Consensus 2 ~~~~Gy~L~-d~~tC~ 16 (81)
.||.||... || .|.
T Consensus 104 ~CP~GW~k~~dg-~Ci 118 (184)
T PTZ00444 104 PCPEGWTKNSNG-QCW 118 (184)
T ss_pred cCCccceecCCC-eee
Confidence 599999888 76 687
No 59
>PF00622 SPRY: SPRY domain; InterPro: IPR003877 The SPRY domain is of unknown function. Distant homologues are domains in butyrophilin/marenostrin/pyrin []. Ca2+-release from the sarcoplasmic or endoplasmic reticulum, the intracellular Ca2+ store, is mediated by the ryanodine receptor (RyR) and/or the inositol trisphosphate receptor (IP3R).; GO: 0005515 protein binding; PDB: 2V24_A 3EK9_A 2AFJ_A 2IWG_E 3EMW_A 2WL1_A 3TOJ_B 2VOK_A 2VOL_B 2FNJ_A ....
Probab=35.15 E-value=82 Score=17.37 Aligned_cols=21 Identities=14% Similarity=0.124 Sum_probs=18.6
Q ss_pred cceEEEEEeCCCCEEEEEECC
Q psy13631 52 KHAIAIDWDPVEQHLYWTDDE 72 (81)
Q Consensus 52 ~~~~ald~d~~~~~iyW~d~~ 72 (81)
..++++-+|...+.|.|++..
T Consensus 67 ~dvIG~~lD~~~g~l~F~~ng 87 (124)
T PF00622_consen 67 GDVIGCGLDLDNGELSFYKNG 87 (124)
T ss_dssp TSEEEEEEETTTTEEEEEETT
T ss_pred CcEEEEEEeecccEEEEEECC
Confidence 678999999999999999754
No 60
>PF02191 OLF: Olfactomedin-like domain; InterPro: IPR003112 The olfactomedin-domain was first identified in olfactomedin, an extracellular matrix protein of the olfactory neuroepithelium []. Members of this extracellular domain-family have since been shown to be present in several metazoan proteins, such as latrophilins, myocilins, optimedins and noelins, the latter being involved in the generation of neural crest cells. Myocilin is of considerable interest, as mutations in its olfactomedin-domain can lead to glaucoma []. The olfactomedin-domains in myocilin and optimedin are essential for the interaction between these two proteins [].; GO: 0005515 protein binding
Probab=34.83 E-value=1.4e+02 Score=19.80 Aligned_cols=44 Identities=16% Similarity=0.256 Sum_probs=28.4
Q ss_pred EeCCCCCceeeEEEeCCc-cceEEEEEeCCCCEEEEEECCCCcEE
Q psy13631 34 ISLDTPDHTNVMLPVVGV-KHAIAIDWDPVEQHLYWTDDEARNKS 77 (81)
Q Consensus 34 i~l~~~~~~~~~~~~~~~-~~~~ald~d~~~~~iyW~d~~~~~I~ 77 (81)
.++.......+-+|..+. .....|+|++.+++||==|-+...++
T Consensus 200 fDt~t~~~~~~~i~f~~~~~~~~~l~YNP~dk~LY~wd~G~~v~Y 244 (250)
T PF02191_consen 200 FDTYTGKEEDVSIPFPNPYGNISMLSYNPRDKKLYAWDNGYQVTY 244 (250)
T ss_pred EECCCCceeceeeeeccccCceEeeeECCCCCeEEEEECCeEEEE
Confidence 455544444444555543 46789999999999997665544443
No 61
>smart00104 ANATO Anaphylatoxin homologous domain. C3a, C4a and C5a anaphylatoxins are protein fragments generated enzymatically in serum during activation of complement molecules C3, C4, and C5. They induce smooth muscle contraction. These fragments are homologous to a three-fold repeat in fibulins.
Probab=34.81 E-value=16 Score=16.38 Aligned_cols=14 Identities=36% Similarity=0.567 Sum_probs=11.7
Q ss_pred CCCceEEc-CCCCCC
Q psy13631 3 HFYGVRLV-DNFTCA 16 (81)
Q Consensus 3 ~~~Gy~L~-d~~tC~ 16 (81)
|.+|+++. .+.+|.
T Consensus 2 C~dG~~~~~~~~tC~ 16 (35)
T smart00104 2 CADGMRLAPEGETCE 16 (35)
T ss_pred cccccccCCCCCCHH
Confidence 77888888 788897
No 62
>PF01607 CBM_14: Chitin binding Peritrophin-A domain; InterPro: IPR002557 This entry represents a chitin binding domain []. It is found in (amongst others) the Peritrophin-A chitin binding proteins, particularly the peritrophic matrix proteins of insects and animal chitinases [, , ]. Copies of the domain are also found in some baculoviruses. It is an extracellular domain that contains six conserved cysteines that probably form three disulphide bridges. Chitin binding has been demonstrated for a protein containing only two of these domains [].; GO: 0008061 chitin binding, 0006030 chitin metabolic process, 0005576 extracellular region; PDB: 1DQC_A.
Probab=34.70 E-value=19 Score=16.97 Aligned_cols=15 Identities=13% Similarity=0.111 Sum_probs=10.3
Q ss_pred CCCCceEEc-CCCCCC
Q psy13631 2 FHFYGVRLV-DNFTCA 16 (81)
Q Consensus 2 ~~~~Gy~L~-d~~tC~ 16 (81)
.||+|+.-+ +...|.
T Consensus 30 ~C~~g~~fd~~~~~C~ 45 (53)
T PF01607_consen 30 RCPEGLYFDPSSQRCV 45 (53)
T ss_dssp E-TTS-EE-TTTSSEE
T ss_pred CCcCCCEECcCcCEEc
Confidence 599998888 888898
No 63
>PHA02885 putative interleukin binding protein; Provisional
Probab=33.86 E-value=1e+02 Score=18.06 Aligned_cols=15 Identities=13% Similarity=0.463 Sum_probs=11.0
Q ss_pred EEeCCCCEEEEEECC
Q psy13631 58 DWDPVEQHLYWTDDE 72 (81)
Q Consensus 58 d~d~~~~~iyW~d~~ 72 (81)
-...++++|||.--.
T Consensus 55 ~lnsedgyiywi~pn 69 (135)
T PHA02885 55 CLNSEDGYIYWIGPN 69 (135)
T ss_pred eecCCCceEEEEeCC
Confidence 356688999998543
No 64
>KOG4649|consensus
Probab=32.56 E-value=1.7e+02 Score=20.27 Aligned_cols=19 Identities=21% Similarity=0.282 Sum_probs=9.7
Q ss_pred EeCCCCEEEEEECCCCcEE
Q psy13631 59 WDPVEQHLYWTDDEARNKS 77 (81)
Q Consensus 59 ~d~~~~~iyW~d~~~~~I~ 77 (81)
.|+.++.+||-.+...+|.
T Consensus 38 vd~~sG~~~We~ilg~RiE 56 (354)
T KOG4649|consen 38 VDPQSGNLIWEAILGVRIE 56 (354)
T ss_pred ecCCCCcEEeehhhCceee
Confidence 3445555666555444443
No 65
>PF10282 Lactonase: Lactonase, 7-bladed beta-propeller; InterPro: IPR019405 6-phosphogluconolactonases (6PGL) 3.1.1.31 from EC, which hydrolyses 6-phosphogluconolactone to 6-phosphogluconate is opne of the enzymes in the pentose phosphate pathway. Two families of structurally dissimilar 6PGLs are known to exist: the Escherichia coli (strain K12) YbhE IPR022528 from INTERPRO [] and the Pseudomonas aeruginosa DevB IPR005900 from INTERPRO [] types. This entry contains bacterial 6-phosphogluconolactonases (6PGL) YbhE-type 3.1.1.31 from EC which hydrolyse 6-phosphogluconolactone to 6-phosphogluconate. The entry also contains the fungal muconate lactonizing enzyme carboxy-cis,cis-muconate cyclase 5.5.1.5 from EC and muconate cycloisomerase 5.5.1.1 from EC, which convert cis,cis-muconates to muconolactones and vice versa as part of the microbial beta-ketoadipate pathway. Structures have been reported for the E. coli 6-phosphogluconolactonase and Neurospora crassa muconate cycloisomerase. Structures of proteins in this family have revealed a 7-bladed beta-propeller fold [].; PDB: 3SCY_A 1L0Q_A 3HFQ_B 3FGB_A 1RI6_A 3U4Y_A 3BWS_A 1JOF_H.
Probab=32.05 E-value=1.6e+02 Score=19.87 Aligned_cols=58 Identities=22% Similarity=0.340 Sum_probs=39.2
Q ss_pred CCcEEEEEeCC--cEEEEeCCCCCc--ee-eEEEeCCccceEEEEEeCCCCEEEEEECCCCcEE
Q psy13631 19 SQEMLLLVQRT--DICKISLDTPDH--TN-VMLPVVGVKHAIAIDWDPVEQHLYWTDDEARNKS 77 (81)
Q Consensus 19 ~~~~Ll~s~~~--~I~~i~l~~~~~--~~-~~~~~~~~~~~~ald~d~~~~~iyW~d~~~~~I~ 77 (81)
...||.++++. .|-.+.+++... .. ..++..+ ..+..+.+++..++||=++...+.|.
T Consensus 255 dg~~lyvsnr~~~sI~vf~~d~~~g~l~~~~~~~~~G-~~Pr~~~~s~~g~~l~Va~~~s~~v~ 317 (345)
T PF10282_consen 255 DGRFLYVSNRGSNSISVFDLDPATGTLTLVQTVPTGG-KFPRHFAFSPDGRYLYVANQDSNTVS 317 (345)
T ss_dssp TSSEEEEEECTTTEEEEEEECTTTTTEEEEEEEEESS-SSEEEEEE-TTSSEEEEEETTTTEEE
T ss_pred CCCEEEEEeccCCEEEEEEEecCCCceEEEEEEeCCC-CCccEEEEeCCCCEEEEEecCCCeEE
Confidence 35689999874 677777754322 11 2233333 56889999999999999988877765
No 66
>KOG0650|consensus
Probab=31.21 E-value=68 Score=24.47 Aligned_cols=51 Identities=16% Similarity=0.322 Sum_probs=31.1
Q ss_pred CCcEEEEEeCCcEEEEeCCCCCceeeEE---------Ee-CCccceEEEEEeCCCCEEEEEECC
Q psy13631 19 SQEMLLLVQRTDICKISLDTPDHTNVML---------PV-VGVKHAIAIDWDPVEQHLYWTDDE 72 (81)
Q Consensus 19 ~~~~Ll~s~~~~I~~i~l~~~~~~~~~~---------~~-~~~~~~~ald~d~~~~~iyW~d~~ 72 (81)
..|+|++|+++.+|--+|-.......++ .+ ++--|.+...|| +++.|-|..
T Consensus 577 s~p~lfVaTq~~vRiYdL~kqelvKkL~tg~kwiS~msihp~GDnli~gs~d---~k~~WfDld 637 (733)
T KOG0650|consen 577 SKPYLFVATQRSVRIYDLSKQELVKKLLTGSKWISSMSIHPNGDNLILGSYD---KKMCWFDLD 637 (733)
T ss_pred CCceEEEEeccceEEEehhHHHHHHHHhcCCeeeeeeeecCCCCeEEEecCC---CeeEEEEcc
Confidence 4689999999999888776532111111 01 122355555554 689999874
No 67
>PF05281 Secretogranin_V: Neuroendocrine protein 7B2 precursor (Secretogranin V); InterPro: IPR007945 Mature peptide hormones and neuropeptides are typically synthesised from much larger precursors and require several post-translational processing steps--including proteolytic cleavage--for the formation of the bioactive species. The subtilisin-related proteolytic enzymes that accomplish neuroendocrine-specific cleavages are known as prohormone convertases 1 and 2 (PC1 and PC2), which belong to MEROPS peptidase family S8B. The cell biology of these proteases within the regulated secretory pathway of neuroendocrine cells is complex, and they are themselves initially synthesised as inactive precursor molecules. ProPC1 propeptide cleavage occurs rapidly in the endoplasmic reticulum, yet its major site of action on prohormones takes place later in the secretory pathway. PC1 undergoes an interesting carboxyl terminal processing event whose function appears to be to activate the enzyme. ProPC2, on the other hand, exhibits comparatively long initial folding times and exits the endoplasmic reticulum without propeptide cleavage, in association with the neuroendocrine-specific protein 7B2. Once the proPC2/7B2 complex arrives at the trans-Golgi network, 7B2 is internally cleaved into two domains, the 21kDa fragment and a carboxy-terminal 31 residue peptide. PC2 propeptide removal occurs in the maturing secretory granule, most likely through autocatalysis, and 7B2 association does not appear to be directly required for this cleavage event. However, if proPC2 has not encountered 7B2 intracellularly, it cannot generate a catalytically active mature species. The molecular mechanism behind the intriguing intracellular association of 7B2 and proPC2 is still unknown, but may involve conformational rearrangement or stabilisation of a proPC2 conformer mediated by a 36-residue internal segment of 21kDa 7B2. This family represents, 7B2 (secretogranin V), which is the molecular escort protein for PC2. 7B2 is a bifunctional protein with an N-terminal activation domain and a C-terminal inhibitory domain (MEROPS inhibitor family I21, clan I-) separated by a furin cleavage site []. Although 7B2 represents a potent inhibitor of PC2, there is an absolute requirement of 7B2 for the activation of PC2, which is synthesised as a zymogen. Both the full length, 27 kDa, and the C-terminal peptide (CT domain) derived from intramolecular cleavage of 7B2 are potent inhibitors of PC2. Studies have shown that the active peptide in the CT domain to be LLRVHK, active in the nanomolar range not only against PC2 but also PC1 [, ]. Knock-out studies have shown that the PC2 nulls are not phenotypically equivalent to the 7B2 nulls, which suggests that 7B2 may have other activities in addition to being the activator of PC2 []. 7B2 exhibits both structural and functional homology to proSAAS (IPR010832 from INTERPRO), which is the PC1 binding protein. The CT domain of proSAAS contains the same inhibitor hexapeptide as 7B2, consequently both 7B2 and proSAAS are two members of a homologous family of prohormone convertase inhibitor proteins. ; GO: 0007218 neuropeptide signaling pathway, 0030141 stored secretory granule
Probab=31.14 E-value=22 Score=23.12 Aligned_cols=15 Identities=20% Similarity=0.200 Sum_probs=10.8
Q ss_pred CCCCceEEcCCCCCCCC
Q psy13631 2 FHFYGVRLVDNFTCANG 18 (81)
Q Consensus 2 ~~~~Gy~L~d~~tC~~~ 18 (81)
-||.||.-+|| |...
T Consensus 102 PCPvGyt~~dg--Cle~ 116 (208)
T PF05281_consen 102 PCPVGYTAEDG--CLEN 116 (208)
T ss_pred CCCCCcccccC--cccC
Confidence 59999993277 8743
No 68
>PF12191 stn_TNFRSF12A: Tumour necrosis factor receptor stn_TNFRSF12A_TNFR domain; InterPro: IPR022316 The tumour necrosis factor (TNF) receptor (TNFR) superfamily comprises more than 20 type-I transmembrane proteins. Family members are defined based on similarity in their extracellular domain - a region that contains many cysteine residues arranged in a specific repetitive pattern []. The cysteines allow formation of an extended rod-like structure, responsible for ligand binding []. Upon receptor activation, different intracellular signalling complexes are assembled for different members of the TNFR superfamily, depending on their intracellular domains and sequences []. Activation of TNFRs can therefore induce a range of disparate effects, including cell proliferation, differentiation, survival, or apoptotic cell death, depending upon the receptor involved []. TNFRs are widely distributed and play important roles in many crucial biological processes, such as lymphoid and neuronal development, innate and adaptive immunity, and maintenance of cellular homeostasis []. Drugs that manipulate their signalling have potential roles in the prevention and treatment of many diseases, such as viral infections, coronary heart disease, transplant rejection, and immune disease []. TNF receptor 12 (also known as TWEAK receptor, and fibroblast growth factor-inducible-14 (Fn14)) has been implicated in endothelial cell growth and migration []. The receptor may also play a role in cell-matrix interactions [].; PDB: 2KN0_A 2RPJ_A 2KMZ_A 2EQP_A.
Probab=28.83 E-value=34 Score=20.57 Aligned_cols=15 Identities=20% Similarity=0.027 Sum_probs=11.5
Q ss_pred CCCCceEEc-CCCCCC
Q psy13631 2 FHFYGVRLV-DNFTCA 16 (81)
Q Consensus 2 ~~~~Gy~L~-d~~tC~ 16 (81)
+||.|+.-+ |...|.
T Consensus 35 ~Cp~G~sWs~dLdkCm 50 (129)
T PF12191_consen 35 PCPRGSSWSADLDKCM 50 (129)
T ss_dssp SS-SSEEEETTTTEEE
T ss_pred CCCCCCcCcccccccc
Confidence 799999999 876664
No 69
>PF01623 Carla_C4: Carlavirus putative nucleic acid binding protein; InterPro: IPR002568 This family of carlavirus nucleic acid binding proteins includes a motif for a potential C-4 type zinc finger this has four highly conserved cysteine residues and is a conserved feature of the carlaviruses 3' terminal ORF []. These proteins may function as viral transcriptional regulators. The carlavirus family includes Garlic latent virus and Potato virus S and Potato virus M, these viruses are positive strand, ssRNA with no DNA stage.; GO: 0003676 nucleic acid binding, 0006355 regulation of transcription, DNA-dependent
Probab=28.73 E-value=43 Score=18.96 Aligned_cols=8 Identities=63% Similarity=1.041 Sum_probs=6.6
Q ss_pred CCCCCCCC
Q psy13631 11 DNFTCANG 18 (81)
Q Consensus 11 d~~tC~~~ 18 (81)
||+||..|
T Consensus 73 DnkTC~PG 80 (91)
T PF01623_consen 73 DNKTCVPG 80 (91)
T ss_pred CCCcccCC
Confidence 99999954
No 70
>PF12093 Corona_NS8: Coronavirus NS8 protein; InterPro: IPR022722 This family of proteins is functionally uncharacterised. This protein is found in coronaviruses. Proteins in this family are typically between 39 to 121 amino acids in length. This protein has two conserved sequence motifs: EDPCP and INCQ.
Probab=27.32 E-value=32 Score=19.89 Aligned_cols=19 Identities=16% Similarity=0.088 Sum_probs=11.3
Q ss_pred CCcEEEEEeCCcEEEEeCC
Q psy13631 19 SQEMLLLVQRTDICKISLD 37 (81)
Q Consensus 19 ~~~~Ll~s~~~~I~~i~l~ 37 (81)
++.|+=+..+++-|...|-
T Consensus 43 ~dW~ik~~~R~sArL~~Lc 61 (118)
T PF12093_consen 43 SDWFIKIGTRKSARLVQLC 61 (118)
T ss_pred ccceEEEcCccchhHhhhc
Confidence 5666666666655555553
No 71
>PF02664 LuxS: S-Ribosylhomocysteinase (LuxS); InterPro: IPR003815 In bacteria, the regulation of gene expression in response to changes in cell density is called quorum sensing. Quorum-sensing bacteria produce, release, and respond to hormone-like molecules (autoinducers) that accumulate in the external environment as the cell population grows. For example, enteric bacteria use quorum sensing to regulate several traits that allow them to establish and maintain infection in their host, including motility, biofilm formation, and virulence-specific genes []. The LuxS/AI-2 system is one of several quorum sensing mechanisms. AI-2 (autoinducer-2) is a signalling molecule that functions in interspecies communication by regulating niche-specific genes with diverse functions in various bacteria, often in response to population density. LuxS (S-ribosylhomocysteinase; 4.4.1.21 from EC) is an autoinducer-production protein that has a metabolic function as a component of the activated methyl cycle. LuxS converts S-ribosylhomocysteine to homocysteine and 4,5-dihydroxy-2,3-pentanedione (DPD); DPD can then spontaneously cyclise to active AI-2 [, ]. LuxS is a homodimeric iron-dependent metalloenzyme containing two identical tetrahedral metal-binding sites similar to those found in peptidases and amidases []. ; GO: 0005506 iron ion binding, 0009372 quorum sensing; PDB: 1J6X_B 1VGX_A 1INN_B 1VJE_B 1J6V_A 1VH2_A 1J6W_B 1JOE_B 1J98_A 1IE0_A ....
Probab=27.27 E-value=36 Score=21.20 Aligned_cols=10 Identities=30% Similarity=0.158 Sum_probs=7.7
Q ss_pred CCCCCceEEc
Q psy13631 1 MFHFYGVRLV 10 (81)
Q Consensus 1 ~~~~~Gy~L~ 10 (81)
|-|.+||.|-
T Consensus 80 MGCrTGFYli 89 (157)
T PF02664_consen 80 MGCRTGFYLI 89 (157)
T ss_dssp -TTSSEEEEE
T ss_pred cccccccEEE
Confidence 7799999876
No 72
>PF00008 EGF: EGF-like domain This is a sub-family of the Pfam entry This is a sub-family of the Pfam entry; InterPro: IPR006209 A sequence of about thirty to forty amino-acid residues long found in the sequence of epidermal growth factor (EGF) has been shown [, , , , ] to be present, in a more or less conserved form, in a large number of other, mostly animal proteins. The list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied. The functional significance of EGF domains in what appear to be unrelated proteins is not yet clear. However, a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase). The EGF domain includes six cysteine residues which have been shown (in EGF) to be involved in disulphide bonds. The main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet. Subdomains between the conserved cysteines vary in length.; GO: 0005515 protein binding; PDB: 1WHE_A 1CCF_A 1APO_A 1WHF_A 2VJ3_A 1TOZ_A 4D90_B 3CFW_A 1EDM_B 1IXA_A ....
Probab=26.92 E-value=30 Score=15.18 Aligned_cols=7 Identities=14% Similarity=-0.320 Sum_probs=5.3
Q ss_pred CCCCceE
Q psy13631 2 FHFYGVR 8 (81)
Q Consensus 2 ~~~~Gy~ 8 (81)
.|++||.
T Consensus 23 ~C~~G~~ 29 (32)
T PF00008_consen 23 ECPPGYT 29 (32)
T ss_dssp EEBTTEE
T ss_pred ECCCCCc
Confidence 3888886
No 73
>smart00289 WR1 Worm-specific repeat type 1. Worm-specific repeat type 1. Cysteine-rich domain apparently unique (so far) to C. elegans. Often appears with KU domains. About 3 dozen worm proteins contain this domain.
Probab=25.45 E-value=34 Score=15.24 Aligned_cols=9 Identities=11% Similarity=-0.274 Sum_probs=6.3
Q ss_pred CCCCceEEc
Q psy13631 2 FHFYGVRLV 10 (81)
Q Consensus 2 ~~~~Gy~L~ 10 (81)
.||.||+-.
T Consensus 21 ~CP~g~~C~ 29 (38)
T smart00289 21 SCPSGYSCQ 29 (38)
T ss_pred CCCCCCEEe
Confidence 588888644
No 74
>KOG1446|consensus
Probab=24.32 E-value=2.5e+02 Score=19.51 Aligned_cols=48 Identities=13% Similarity=0.102 Sum_probs=30.5
Q ss_pred CCCcEEEEEeCCcEEEEeCCCCCceeeEEEeCCccceEEEEEeCCCCEEEEEEC
Q psy13631 18 GSQEMLLLVQRTDICKISLDTPDHTNVMLPVVGVKHAIAIDWDPVEQHLYWTDD 71 (81)
Q Consensus 18 ~~~~~Ll~s~~~~I~~i~l~~~~~~~~~~~~~~~~~~~ald~d~~~~~iyW~d~ 71 (81)
.++.||--+....||.-++...... ..+ .+...-..+||+.+ ++++-.
T Consensus 111 ~~d~FlS~S~D~tvrLWDlR~~~cq-g~l---~~~~~pi~AfDp~G--LifA~~ 158 (311)
T KOG1446|consen 111 KDDTFLSSSLDKTVRLWDLRVKKCQ-GLL---NLSGRPIAAFDPEG--LIFALA 158 (311)
T ss_pred CCCeEEecccCCeEEeeEecCCCCc-eEE---ecCCCcceeECCCC--cEEEEe
Confidence 3577888777788999888855433 233 23444456788866 555544
No 75
>PF10015 DUF2258: Uncharacterized protein conserved in archaea (DUF2258); InterPro: IPR017140 There is currently no experimental data for members of this group or their homologues, nor do they exhibit features indicative of any function.
Probab=24.21 E-value=80 Score=17.22 Aligned_cols=15 Identities=20% Similarity=0.501 Sum_probs=12.0
Q ss_pred eEEEEEeCCCCEEEE
Q psy13631 54 AIAIDWDPVEQHLYW 68 (81)
Q Consensus 54 ~~ald~d~~~~~iyW 68 (81)
.+.++||..++.+-|
T Consensus 61 RI~v~y~v~~~kl~~ 75 (75)
T PF10015_consen 61 RIRVEYDVEDGKLVF 75 (75)
T ss_pred EEEEEEEecCCEeeC
Confidence 357889999988876
No 76
>PF06387 Calcyon: D1 dopamine receptor-interacting protein (calcyon); InterPro: IPR009431 This family consists of several D1 dopamine receptor-interacting (calcyon) proteins. D1/D5 dopamine receptors in the basal ganglia, hippocampus, and cerebral cortex modulate motor, reward, and cognitive behaviour. D1-like dopamine receptors likely modulate neocortical and hippocampal neuronal excitability and synaptic function via Ca2+ as well as cAMP-dependent signalling []. Defective calcyon proteins have been implicated in both attention-deficit/hyperactivity disorder (ADHD) [] and schizophrenia.; GO: 0050780 dopamine receptor binding, 0007212 dopamine receptor signaling pathway, 0016021 integral to membrane
Probab=24.20 E-value=39 Score=21.51 Aligned_cols=14 Identities=21% Similarity=0.147 Sum_probs=11.2
Q ss_pred CCCCceEEcCCCCCC
Q psy13631 2 FHFYGVRLVDNFTCA 16 (81)
Q Consensus 2 ~~~~Gy~L~d~~tC~ 16 (81)
.||+||.+. .+.|.
T Consensus 112 sCPdGFv~k-hk~C~ 125 (186)
T PF06387_consen 112 SCPDGFVLK-HKRCT 125 (186)
T ss_pred cCCCcceee-ccccc
Confidence 499999887 35688
No 77
>KOG4283|consensus
Probab=23.83 E-value=2.7e+02 Score=19.66 Aligned_cols=53 Identities=11% Similarity=0.217 Sum_probs=36.8
Q ss_pred EEEEEeCC---cEEEEeCCCCCceeeEEEeCCccceEEEEEeCCCCEEEEEECCCCcE
Q psy13631 22 MLLLVQRT---DICKISLDTPDHTNVMLPVVGVKHAIAIDWDPVEQHLYWTDDEARNK 76 (81)
Q Consensus 22 ~Ll~s~~~---~I~~i~l~~~~~~~~~~~~~~~~~~~ald~d~~~~~iyW~d~~~~~I 76 (81)
.-|+|.+. .|+.-++.+..+...+. ..-.++.|+++-+..++|..+-...+.|
T Consensus 158 HcLiA~gtr~~~VrLCDi~SGs~sH~Ls--GHr~~vlaV~Wsp~~e~vLatgsaDg~i 213 (397)
T KOG4283|consen 158 HCLIAAGTRDVQVRLCDIASGSFSHTLS--GHRDGVLAVEWSPSSEWVLATGSADGAI 213 (397)
T ss_pred ceEEEEecCCCcEEEEeccCCcceeeec--cccCceEEEEeccCceeEEEecCCCceE
Confidence 45556543 67777787777665443 2345688999999999998876655553
No 78
>smart00449 SPRY Domain in SPla and the RYanodine Receptor. Domain of unknown function. Distant homologues are domains in butyrophilin/marenostrin/pyrin homologues.
Probab=23.56 E-value=1.4e+02 Score=16.44 Aligned_cols=24 Identities=8% Similarity=-0.051 Sum_probs=19.7
Q ss_pred cceEEEEEeCCCCEEEEEECCCCc
Q psy13631 52 KHAIAIDWDPVEQHLYWTDDEARN 75 (81)
Q Consensus 52 ~~~~ald~d~~~~~iyW~d~~~~~ 75 (81)
...+++-+|...+.|++.+.....
T Consensus 66 gd~iGv~lD~~~g~l~F~~ng~~~ 89 (122)
T smart00449 66 GDVIGCFLDLEAGTISFYKNGKYL 89 (122)
T ss_pred CCEEEEEEECCCCEEEEEECCCEe
Confidence 668899999999999999876544
No 79
>PRK02260 S-ribosylhomocysteinase; Provisional
Probab=23.12 E-value=42 Score=20.88 Aligned_cols=10 Identities=30% Similarity=0.158 Sum_probs=8.4
Q ss_pred CCCCCceEEc
Q psy13631 1 MFHFYGVRLV 10 (81)
Q Consensus 1 ~~~~~Gy~L~ 10 (81)
|-|.+||.|-
T Consensus 80 MGCrTGFYli 89 (158)
T PRK02260 80 MGCRTGFYLI 89 (158)
T ss_pred CccccccEEE
Confidence 6799999877
No 80
>PHA02639 EEV host range protein; Provisional
Probab=22.90 E-value=43 Score=22.80 Aligned_cols=15 Identities=33% Similarity=0.324 Sum_probs=12.3
Q ss_pred CCCCceEEc--CCCCCC
Q psy13631 2 FHFYGVRLV--DNFTCA 16 (81)
Q Consensus 2 ~~~~Gy~L~--d~~tC~ 16 (81)
.|.+||.|. ...+|.
T Consensus 177 sC~~GY~L~Gs~~~tC~ 193 (295)
T PHA02639 177 SCKSGFDLVGEKYSTCN 193 (295)
T ss_pred EcCCCCeEcCCCcEEEC
Confidence 499999998 467896
No 81
>PF00020 TNFR_c6: TNFR/NGFR cysteine-rich region; InterPro: IPR001368 A number of proteins, some of which are known to be receptors for growth factors have been found to contain a cysteine-rich domain at the N-terminal region that can be subdivided into four (or in some cases, three) repeats containing six conserved cysteines all of which are involved in intrachain disulphide bonds []. CD27 (also called S152 or T14) mediates a co-stimulatory signal for T and B cell activation and is involved in murine T cell development. Tyrosine-phosphorylation of ZAP-70 following CD27 ligation of T cells has been reported [], but not confirmed independently. CD30 was originally identified as Ki-1, an antigen expressed on Reed-Sternberg cells in Hodgkin's lymphomas and other non-Hodgkin's lymphomas, particularly diffuse large-cell lymphoma and immunoblastic lymphoma. CD30 has pleiotropic effects on CD30-positive lymphoma cell lines ranging from cell proliferation to cell death. It is thought to be involved in negative selection of T-cells in the thymus and is involved in TCR-mediated cell death. CD30 is a member of the TNFR family of molecules, activate NFkB through interaction with TRAF2 and TRAF5. CD40 (Bp50) plays a central role in the regulation of cell-mediated immunity as well as antibody mediated immunity. It is central to T cell dependent (TD)-responses and may influence survival of B cell lymphomas. CD95 (also called APO-1, fas antigen, Fas tumour necrosis factor receptor superfamily, member 6, TNFRSF6 or apoptosis antigen 1, APT1) is expressed, typically at high levels, on activated T and B cells. It is involved in the mediation of apoptosis-inducing signals. Other proteins known to belong to this family [, , , ] are, tumour Necrosis Factor type I and type II receptors (TNFR), Rabbit fibroma virus soluble TNF receptor (protein T2), lymphotoxin alpha/beta receptor, low-affinity nerve growth factor receptor (LA-NGFR) (p75), T-cell antigen OX40, Wsl-1, a receptor (for a yet undefined ligand) that mediates apoptosis and Vaccinia virus protein A53 (SalF19R). CD molecules are leucocyte antigens on cell surfaces. CD antigens nomenclature is updated at Protein Reviews On The Web (http://prow.nci.nih.gov/).; GO: 0005488 binding; PDB: 3TJE_F 3QD6_S 3ALQ_U 3IJ2_Y 3BUK_D 1SG1_X 1NCF_B 1EXT_A 1FT4_B 1TNR_R ....
Probab=22.54 E-value=59 Score=14.73 Aligned_cols=14 Identities=21% Similarity=0.112 Sum_probs=9.6
Q ss_pred CCCceEEc-CC-CCCC
Q psy13631 3 HFYGVRLV-DN-FTCA 16 (81)
Q Consensus 3 ~~~Gy~L~-d~-~tC~ 16 (81)
|++|+.++ .+ ..|.
T Consensus 1 C~~g~y~~~~~~~~C~ 16 (39)
T PF00020_consen 1 CPPGTYSDSENHPQCL 16 (39)
T ss_dssp ECTTEEEESSCSSSEE
T ss_pred CccCcccCCCCCCcCC
Confidence 78888887 55 4553
No 82
>KOG1007|consensus
Probab=22.54 E-value=91 Score=21.79 Aligned_cols=47 Identities=17% Similarity=0.207 Sum_probs=25.1
Q ss_pred EEEEEeCCcEEEEeCCCCCceeeEEEeCCccceEEEEEeCCCCEEEEE
Q psy13631 22 MLLLVQRTDICKISLDTPDHTNVMLPVVGVKHAIAIDWDPVEQHLYWT 69 (81)
Q Consensus 22 ~Ll~s~~~~I~~i~l~~~~~~~~~~~~~~~~~~~ald~d~~~~~iyW~ 69 (81)
.+..++...++..++.+......+-- ...+.+..+||++..+++.=+
T Consensus 186 qv~tt~d~tl~~~D~RT~~~~~sI~d-AHgq~vrdlDfNpnkq~~lvt 232 (370)
T KOG1007|consen 186 QVATTSDSTLQFWDLRTMKKNNSIED-AHGQRVRDLDFNPNKQHILVT 232 (370)
T ss_pred eEEEeCCCcEEEEEccchhhhcchhh-hhcceeeeccCCCCceEEEEE
Confidence 34445555566666654322111111 122567788999888876543
No 83
>COG1854 LuxS LuxS protein involved in autoinducer AI2 synthesis [Signal transduction mechanisms]
Probab=22.53 E-value=52 Score=20.54 Aligned_cols=12 Identities=17% Similarity=0.005 Sum_probs=9.4
Q ss_pred CCCCCceEEc-CC
Q psy13631 1 MFHFYGVRLV-DN 12 (81)
Q Consensus 1 ~~~~~Gy~L~-d~ 12 (81)
|-|.+||.|. -|
T Consensus 80 MGCrTGFYm~l~G 92 (161)
T COG1854 80 MGCRTGFYMILIG 92 (161)
T ss_pred cccccceEEEEEC
Confidence 6789999887 54
No 84
>PF02655 ATP-grasp_3: ATP-grasp domain; InterPro: IPR003806 The ATP-grasp fold is one of several distinct ATP-binding folds, and is found in enzymes that catalyze the formation of amide bonds, catalyzing the ATP-dependent ligation of a carboxylate-containing molecule to an amino or thiol group-containing molecule []. This fold is found in many different enzyme families, including various peptide synthetases, biotin carboxylase, synapsin, succinyl-CoA synthetase, pyruvate phosphate dikinase, and glutathione synthetase, amongst others []. These enzymes contribute predominantly to macromolecular synthesis, using ATP-hydrolysis to activate their substrates. The ATP-grasp fold shares functional and structural similarities with the PIPK (phosphatidylinositol phosphate kinase) and protein kinase superfamilies. The ATP-grasp domain consists of two subdomains with different alpha+beta folds, which grasp the ATP molecule between them. Each subdomain provides a variable loop that forms part of the active site, with regions from other domains also contributing to the active site, even though these other domains are not conserved between the various ATP-grasp enzymes []. This entry describes a type of ATP-grasp fold that is found in a set of proteins of unknown function.; GO: 0005524 ATP binding, 0046872 metal ion binding; PDB: 3DF7_A.
Probab=21.98 E-value=82 Score=18.93 Aligned_cols=25 Identities=16% Similarity=0.206 Sum_probs=18.9
Q ss_pred CCccceEEEEEeCCCCEEEEEECCC
Q psy13631 49 VGVKHAIAIDWDPVEQHLYWTDDEA 73 (81)
Q Consensus 49 ~~~~~~~ald~d~~~~~iyW~d~~~ 73 (81)
+++...+++||-..++.+|..+++.
T Consensus 132 ~gl~G~~giD~I~~~~~~~viEINP 156 (161)
T PF02655_consen 132 PGLRGYVGIDFILDDGGPYVIEINP 156 (161)
T ss_dssp TT--EEEEEEEEESS-SEEEEEEES
T ss_pred CCCeeeEeEEEEEeCCcEEEEEEcC
Confidence 4778889999999999999998764
No 85
>TIGR02145 Fib_succ_major Fibrobacter succinogenes major paralogous domain. This domain of about 175 to 200 amino acids is found, in from one to five copies, in over 50 proteins in Fibrobacter succinogenes S85, an obligate anaerobe of the rumen. Many members of this family have an apparent lipoprotein signal sequence. Conserved cysteine residues, suggestive of disulfide bond formation, are also consistent with an extracytoplasmic location for this domain. This domain can also be found in small numbers of proteins in Chlorobium tepidum and Bacteroides thetaiotaomicron.
Probab=21.58 E-value=31 Score=21.54 Aligned_cols=9 Identities=22% Similarity=0.003 Sum_probs=7.6
Q ss_pred CCCCceEEc
Q psy13631 2 FHFYGVRLV 10 (81)
Q Consensus 2 ~~~~Gy~L~ 10 (81)
-||+|+.|-
T Consensus 59 ~cP~GWhlP 67 (171)
T TIGR02145 59 ICPEGWHLP 67 (171)
T ss_pred cCCCCCCCC
Confidence 499999987
No 86
>PF08310 LGFP: LGFP repeat; InterPro: IPR013207 This 54 amino acid repeat is found in many hypothetical proteins. Several hypothetical proteins from Corynebacterium glutamicum (Brevibacterium flavum) and Corynebacterium efficiens along with PS1 protein contain this repeat region. The N-terminal region of PS1 contains an esterase domain which transfers corynomycolic acid. The C-terminal region consists of 4 tandem LGFP repeats. It is hypothesised that the PS1 proteins in Corynebacterium, when associated with the cell wall, may be anchored via the LGFP tandem repeats that may be important for maintaining cell wall integrity. Deletion of Q01377 from SWISSPROT protein results in a 10-fold increase in the cell volume of the organism and infers the corresponding involvement of the protein in the cell shape formation []. The secondary structure of each repeat is predicted to comprise two beta-strands and one alpha-helix.
Probab=21.31 E-value=81 Score=15.48 Aligned_cols=12 Identities=17% Similarity=0.542 Sum_probs=8.8
Q ss_pred CCCCEEEEEECC
Q psy13631 61 PVEQHLYWTDDE 72 (81)
Q Consensus 61 ~~~~~iyW~d~~ 72 (81)
.....|||+...
T Consensus 21 F~~G~Iywsp~t 32 (54)
T PF08310_consen 21 FQNGTIYWSPAT 32 (54)
T ss_pred cCCeEEEEeCCC
Confidence 356899999654
No 87
>PF12860 PAS_7: PAS fold
Probab=20.86 E-value=1.3e+02 Score=16.48 Aligned_cols=19 Identities=16% Similarity=0.669 Sum_probs=12.3
Q ss_pred CccceEEEEEeCCCCEEEEE
Q psy13631 50 GVKHAIAIDWDPVEQHLYWT 69 (81)
Q Consensus 50 ~~~~~~ald~d~~~~~iyW~ 69 (81)
.+...+++ ||....-+||=
T Consensus 3 ~l~~Gv~v-~D~~~rl~~~N 21 (115)
T PF12860_consen 3 SLPQGVAV-FDSDGRLVFWN 21 (115)
T ss_pred CcCceEEE-EcCCCeEEeEc
Confidence 34445555 78888778773
No 88
>PF13619 KTSC: KTSC domain
Probab=20.71 E-value=1.2e+02 Score=15.24 Aligned_cols=16 Identities=19% Similarity=0.403 Sum_probs=10.8
Q ss_pred cceEEEEEeCCCCEEE
Q psy13631 52 KHAIAIDWDPVEQHLY 67 (81)
Q Consensus 52 ~~~~ald~d~~~~~iy 67 (81)
+++.+++||..++.|.
T Consensus 4 s~I~~v~Yd~~~~~L~ 19 (60)
T PF13619_consen 4 SNIRSVGYDPETRTLE 19 (60)
T ss_pred CcccEEeECCCCCEEE
Confidence 4566778887776654
No 89
>PF03302 VSP: Giardia variant-specific surface protein; InterPro: IPR005127 During infection, the intestinal protozoan parasite Giardia lamblia virus undergoes continuous antigenic variation which is determined by diversification of the parasite's major surface antigen, named VSP (variant surface protein).
Probab=20.12 E-value=59 Score=22.98 Aligned_cols=15 Identities=27% Similarity=0.408 Sum_probs=13.3
Q ss_pred CCCCceEEc-CCCCCC
Q psy13631 2 FHFYGVRLV-DNFTCA 16 (81)
Q Consensus 2 ~~~~Gy~L~-d~~tC~ 16 (81)
.|.+||.|. |+..|.
T Consensus 3 ~C~~gy~~~~~~t~C~ 18 (397)
T PF03302_consen 3 ECTSGYKLSTDKTSCV 18 (397)
T ss_pred cccCCceECCCCCccc
Confidence 489999999 988887
Done!