Query psy3970
Match_columns 104
No_of_seqs 104 out of 245
Neff 4.5
Searched_HMMs 46136
Date Fri Aug 16 22:00:23 2013
Command hhsearch -i /work/01045/syshi/Psyhhblits/psy3970.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/3970hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 cd01396 MeCP2_MBD MeCP2, MBD1, 99.9 1.1E-27 2.3E-32 159.5 6.8 70 24-94 2-75 (77)
2 smart00391 MBD Methyl-CpG bind 99.9 3.3E-26 7.2E-31 152.2 7.2 69 22-91 2-74 (77)
3 PF01429 MBD: Methyl-CpG bindi 99.9 1.1E-25 2.3E-30 148.5 7.3 71 19-89 1-76 (77)
4 cd00122 MBD MeCP2, MBD1, MBD2, 99.9 2.6E-24 5.7E-29 136.9 6.0 57 24-82 2-62 (62)
5 cd01397 HAT_MBD Methyl-CpG bin 99.9 2.2E-24 4.8E-29 143.3 5.5 62 28-90 5-70 (73)
6 KOG4161|consensus 99.8 1.3E-20 2.9E-25 149.4 5.0 88 15-102 5-92 (272)
7 cd01395 HMT_MBD Methyl-CpG bin 99.6 5.1E-16 1.1E-20 99.9 4.2 53 28-82 5-60 (60)
8 KOG4161|consensus 95.0 0.019 4.2E-07 46.0 2.7 82 18-102 166-254 (272)
9 PF00397 WW: WW domain; Inter 86.9 0.86 1.9E-05 24.9 2.7 24 29-59 1-25 (31)
10 PF14030 DUF4245: Protein of u 67.1 14 0.0003 27.4 4.7 38 25-62 59-96 (169)
11 PF12528 DUF3728: Prepilin pep 66.3 13 0.00028 24.9 4.0 35 27-61 40-74 (84)
12 PRK10332 hypothetical protein; 63.7 14 0.00029 26.3 3.9 33 27-60 63-95 (107)
13 smart00456 WW Domain with 2 co 63.2 7.8 0.00017 20.5 2.1 21 29-57 1-21 (32)
14 PF12874 zf-met: Zinc-finger o 60.2 18 0.00039 17.9 3.1 22 52-73 1-22 (25)
15 cd01259 PH_Apbb1ip Apbb1ip (Am 50.9 28 0.00061 25.1 3.8 34 30-72 13-47 (114)
16 KOG1141|consensus 44.5 10 0.00022 36.0 0.9 45 47-91 617-665 (1262)
17 PF00096 zf-C2H2: Zinc finger, 42.7 37 0.00079 16.3 2.5 17 56-72 5-21 (23)
18 PF07624 PSD2: Protein of unkn 42.2 20 0.00043 23.1 1.8 19 56-74 1-19 (76)
19 KOG3259|consensus 41.0 17 0.00037 27.7 1.5 22 28-56 6-27 (163)
20 smart00497 IENR1 Intron encode 38.8 44 0.00096 19.0 2.8 24 50-73 3-29 (53)
21 PF13894 zf-C2H2_4: C2H2-type 38.1 47 0.001 15.4 2.6 16 57-72 6-21 (24)
22 PF12680 SnoaL_2: SnoaL-like d 31.4 77 0.0017 18.9 3.2 26 48-74 21-46 (102)
23 cd00201 WW Two conserved trypt 30.7 51 0.0011 16.8 2.0 19 30-56 1-19 (31)
24 PHA00616 hypothetical protein 26.9 33 0.00071 20.8 0.8 17 57-73 7-23 (44)
25 PF03633 Glyco_hydro_65C: Glyc 25.9 26 0.00055 20.8 0.2 9 27-35 1-9 (54)
26 COG4957 Predicted transcriptio 25.6 34 0.00074 25.7 0.9 37 37-80 66-104 (148)
27 PF14657 Integrase_AP2: AP2-li 24.2 1.4E+02 0.0031 17.1 3.3 25 49-73 4-36 (46)
28 PF07676 PD40: WD40-like Beta 23.9 89 0.0019 16.7 2.2 14 48-61 10-23 (39)
29 PF07453 NUMOD1: NUMOD1 domain 23.8 63 0.0014 17.6 1.6 16 59-74 14-29 (37)
30 KOG1891|consensus 22.9 56 0.0012 26.6 1.7 14 28-41 93-106 (271)
31 KOG1141|consensus 22.4 77 0.0017 30.5 2.7 53 28-85 806-861 (1262)
32 PF12171 zf-C2H2_jaz: Zinc-fin 21.9 33 0.00071 17.6 0.1 21 52-72 2-22 (27)
33 PF10411 DsbC_N: Disulfide bon 21.3 79 0.0017 19.2 1.8 12 50-61 34-45 (57)
34 PF07894 DUF1669: Protein of u 20.2 1.1E+02 0.0023 25.2 2.8 47 21-72 97-143 (284)
No 1
>cd01396 MeCP2_MBD MeCP2, MBD1, MBD2, MBD3, and MBD4 are members of a protein family that share the methyl-CpG-binding domain (MBD). The MBD, consists of about 70 residues and is defined as the minimal region required for binding to methylated DNA by a methyl-CpG-binding protein which binds specifically to methylated DNA. The MBD can recognize a single symmetrically methylated CpG either as naked DNA or within chromatin. MeCP2, MBD1 and MBD2 (and likely MBD3) form complexes with histone deacetylase and are involved in histone deacetylase-dependent repression of transcription. MBD4 is an endonuclease that forms a complex with the DNA mismatch-repair protein MLH1.
Probab=99.94 E-value=1.1e-27 Score=159.51 Aligned_cols=70 Identities=44% Similarity=0.743 Sum_probs=64.5
Q ss_pred ccCCCCCCCcEEEEEEccCCCCCCceeEEEEcCCCceeccHHHHHHHHhC----CCCCcceeeecCcccchhhhh
Q psy3970 24 YECPQLPPGWYREEVTRNKGLSVGKVDVFYYSPCGKKLRTRSEVLQVIRG----NIPGVSFDFRSGKLNSLQKHR 94 (104)
Q Consensus 24 ~~~p~Lp~GW~Rev~~R~~g~s~gk~DVYY~sP~GkkfRSk~eV~ryL~~----~l~~e~FdF~~~k~~~~~~~k 94 (104)
.+||+||+||+||+++|++| ++++.||||++|+|++|||+.||++||+. .+++++|||+++++..++..+
T Consensus 2 ~~~~~lp~GW~r~~~~R~~g-s~~k~DvyY~sP~Gkk~RS~~ev~~yL~~~~~~~~~~~~FdF~~~k~~~~~~~~ 75 (77)
T cd01396 2 PEDPRLPPGWKRELVPRKSG-SAGKFDVYYISPTGKKFRSKVELARYLEKNGPTSLDLSDFDFTVPKKLGLGSPR 75 (77)
T ss_pred CCCCCCCCCCEEEEEEecCC-CCCcceEEEECCCCCEEECHHHHHHHHHhCCCCCCcHhHcccCCCcccccccCC
Confidence 57899999999999999999 78999999999999999999999999998 499999999999988776543
No 2
>smart00391 MBD Methyl-CpG binding domain. Methyl-CpG binding domain, also known as the TAM (TTF-IIP5, ARBP, MeCP1) domain
Probab=99.93 E-value=3.3e-26 Score=152.20 Aligned_cols=69 Identities=42% Similarity=0.729 Sum_probs=63.0
Q ss_pred ccccCCCCCCCcEEEEEEccCCCCCCceeEEEEcCCCceeccHHHHHHHHhCC----CCCcceeeecCcccchh
Q psy3970 22 KRYECPQLPPGWYREEVTRNKGLSVGKVDVFYYSPCGKKLRTRSEVLQVIRGN----IPGVSFDFRSGKLNSLQ 91 (104)
Q Consensus 22 ~~~~~p~Lp~GW~Rev~~R~~g~s~gk~DVYY~sP~GkkfRSk~eV~ryL~~~----l~~e~FdF~~~k~~~~~ 91 (104)
..+++| ||+||+|++++|++|.++++.||||+||||++|||+.||++||.++ +++++|||+++.++...
T Consensus 2 ~~~~~P-lp~GW~R~~~~r~~g~~~~~~dV~Y~sP~GkklRs~~ev~~YL~~~~~~~~~~~~F~F~~~~~~~~~ 74 (77)
T smart00391 2 DPLRLP-LPCGWRRETKQRKSGRSAGKFDVYYISPCGKKLRSKSELARYLHKNGDLSLDLECFDFNATVPVGPK 74 (77)
T ss_pred CcccCC-CCCCcEEEEEEecCCCCCCcccEEEECCCCCeeeCHHHHHHHHHhCCCcccccccccCcCCcccccc
Confidence 457887 9999999999999999999999999999999999999999999974 78899999999887654
No 3
>PF01429 MBD: Methyl-CpG binding domain; InterPro: IPR001739 Methylation at CpG dinucleotide, the most common DNA modification in eukaryotes, has been correlated with gene silencing associated with various phenomena such as genomic imprinting, transposon and chromosome X inactivation, differentiation, and cancer. Effects of DNA methylation are mediated through proteins which bind to symmetrically methylated CpGs. Such proteins contain a specific domain of ~70 residues, the methyl-CpG-binding domain (MBD), which is linked to additional domains associated with chromatin, such as the bromodomain, the AT hook motif,the SET domain, or the PHD finger. MBD-containing proteins appear to act as structural proteins, which recruit a variety of histone deacetylase (HDAC) complexes and chromatin remodelling factors, leading to chromatin compaction and, consequently, to transcriptional repression. The MBD of MeCP2, MBD1, MBD2, MBD4 and BAZ2 mediates binding to DNA, in case of MeCP2, MBD1 and MBD2 preferentially to methylated CpG. In case of human MBD3 and SETDB1 the MBD has been shown to mediate protein-protein interactions [, ]. The MBD folds into an alpha/beta sandwich structure comprising a layer of twisted beta sheet, backed by another layer formed by the alpha1 helix and a hairpin loop at the C terminus. These layers are both amphipathic, with the alpha1 helix and the beta sheet lying parallel and the hydrophobic faces tightly packed against each other. The beta sheet is composed of two long inner strands (beta2 and beta3) sandwiched by two shorter outer strands (beta1 and beta4) [].; GO: 0003677 DNA binding, 0005634 nucleus; PDB: 2KY8_A 1UB1_A 1D9N_A 1IG4_A 1QK9_A 3C2I_A.
Probab=99.93 E-value=1.1e-25 Score=148.49 Aligned_cols=71 Identities=39% Similarity=0.734 Sum_probs=60.1
Q ss_pred cccccccCCCCCCCcEEEEEEccCCCCCCceeEEEEcCCCceeccHHHHHHHHhCC-----CCCcceeeecCcccc
Q psy3970 19 PTKKRYECPQLPPGWYREEVTRNKGLSVGKVDVFYYSPCGKKLRTRSEVLQVIRGN-----IPGVSFDFRSGKLNS 89 (104)
Q Consensus 19 ~~~~~~~~p~Lp~GW~Rev~~R~~g~s~gk~DVYY~sP~GkkfRSk~eV~ryL~~~-----l~~e~FdF~~~k~~~ 89 (104)
|+.....+++||+||+||+++|++|++++++||||+||+|++|||+.||.+||..+ +++++|+|.+.+++.
T Consensus 1 ~~~~~~~~~~Lp~GW~re~~~R~~g~~~~~~dv~Y~sP~Gk~~RS~~eV~~yL~~~~~~~~l~~~~F~F~~~~~~~ 76 (77)
T PF01429_consen 1 IHRISPLDPPLPDGWKREVVVRKSGSSAGKKDVYYYSPCGKRFRSKKEVVRYLKENPSEHDLKPENFSFSKRLIML 76 (77)
T ss_dssp --SSECEBTTSTTT-EEEEEESSSSTTTTSEEEEEEETTSEEESSHHHHHHHHTTSS---SS-CTTBBTTTTB---
T ss_pred CCccccccCCCCCCCEEEEEEecCCCcCCceEEEEECCCCCEEeCHHHHHHHHHhCCCcccCCHhHCCCCCCcccC
Confidence 35567788899999999999999999999999999999999999999999999874 679999999887653
No 4
>cd00122 MBD MeCP2, MBD1, MBD2, MBD3, MBD4, CLLD8-like, and BAZ2A-like proteins constitute a family of proteins that share the methyl-CpG-binding domain (MBD). The MBD consists of about 70 residues and is defined as the minimal region required for binding to methylated DNA by a methyl-CpG-binding protein which binds specifically to methylated DNA. The MBD can recognize a single symmetrically methylated CpG either as naked DNA or within chromatin. MeCP2, MBD1 and MBD2 (and likely MBD3) form complexes with histone deacetylase and are involved in histone deacetylase-dependent repression of transcription. MBD4 is an endonuclease that forms a complex with the DNA mismatch-repair protein MLH1. The MBDs present in putative chromatin remodelling subunit, BAZ2A, and putative histone methyltransferase, CLLD8, represent two phylogenetically distinct groups within the MBD protein family.
Probab=99.90 E-value=2.6e-24 Score=136.86 Aligned_cols=57 Identities=51% Similarity=0.938 Sum_probs=52.7
Q ss_pred ccCCCCCCCcEEEEEEccCCCCCCceeEEEEcCCCceeccHHHHHHHHhCC----CCCcceee
Q psy3970 24 YECPQLPPGWYREEVTRNKGLSVGKVDVFYYSPCGKKLRTRSEVLQVIRGN----IPGVSFDF 82 (104)
Q Consensus 24 ~~~p~Lp~GW~Rev~~R~~g~s~gk~DVYY~sP~GkkfRSk~eV~ryL~~~----l~~e~FdF 82 (104)
+.+ |+|+||+|++++|++| ++++.||||++|||++|||+.||++||.++ +++++|||
T Consensus 2 l~~-P~p~GW~R~~~~r~~g-~~~k~dv~Y~sP~Gk~~Rs~~ev~~yL~~~~~~~l~~~~F~F 62 (62)
T cd00122 2 LRD-PLPPGWKRELVIRKSG-SAGKGDVYYYSPCGKKLRSKPEVARYLEKTGPSSLDLENFSF 62 (62)
T ss_pred CCC-CCCCCeEEEEEEcCCC-CCCcceEEEECCCCceecCHHHHHHHHHhCCCCCCcHHHCCC
Confidence 345 4699999999999999 779999999999999999999999999974 99999998
No 5
>cd01397 HAT_MBD Methyl-CpG binding domains (MBD) present in putative chromatin remodelling factor such as BAZ2A; BAZ2A contains a MBD, DDT, PHD-type zinc finger and Bromo domain suggesting that BAZ2A might be associated with histone acetyltransferase (HAT) activity. The Drosophila melanogaster toutatis protein, a putative subunit of the chromatin-remodeling complex, and other such proteins in this group share a similar domain architecture with BAZ2A, as does the Caenorhabditis elegans flectin homolog.
Probab=99.90 E-value=2.2e-24 Score=143.35 Aligned_cols=62 Identities=34% Similarity=0.554 Sum_probs=57.5
Q ss_pred CCCCCcEEEEEEccCCCCCCceeEEEEcCCCceeccHHHHHHHHhC----CCCCcceeeecCcccch
Q psy3970 28 QLPPGWYREEVTRNKGLSVGKVDVFYYSPCGKKLRTRSEVLQVIRG----NIPGVSFDFRSGKLNSL 90 (104)
Q Consensus 28 ~Lp~GW~Rev~~R~~g~s~gk~DVYY~sP~GkkfRSk~eV~ryL~~----~l~~e~FdF~~~k~~~~ 90 (104)
||+.||+||+++|+.| ++.++|||||+|||||+||++||++||.. .+++|||+|++..++++
T Consensus 5 Pl~~GW~Re~vir~~~-~~~~~dV~Y~aPcGKklRs~~ev~~yL~~~~~~~Lt~dnFsF~~~~~vg~ 70 (73)
T cd01397 5 PLELGWRRETRIRGLG-GRIQGEVAYYAPCGKKLRQYPEVIKYLSKNGISLLSRENFSFSARAPVGD 70 (73)
T ss_pred CCCCCceeEEEeccCC-CCccceEEEECCCCcccccHHHHHHHHHhCCccCccHhHccccCCccccc
Confidence 6999999999999998 67999999999999999999999999996 48999999998887764
No 6
>KOG4161|consensus
Probab=99.81 E-value=1.3e-20 Score=149.37 Aligned_cols=88 Identities=42% Similarity=0.728 Sum_probs=83.8
Q ss_pred cccccccccccCCCCCCCcEEEEEEccCCCCCCceeEEEEcCCCceeccHHHHHHHHhCCCCCcceeeecCcccchhhhh
Q psy3970 15 LKMLPTKKRYECPQLPPGWYREEVTRNKGLSVGKVDVFYYSPCGKKLRTRSEVLQVIRGNIPGVSFDFRSGKLNSLQKHR 94 (104)
Q Consensus 15 ~~~~~~~~~~~~p~Lp~GW~Rev~~R~~g~s~gk~DVYY~sP~GkkfRSk~eV~ryL~~~l~~e~FdF~~~k~~~~~~~k 94 (104)
...+++.++++||.||+||+|++++|++|.++|++||||+||.|++|||+.+++.||+...+...|+|-++++..+..++
T Consensus 5 ~~~~~~~~~~~c~~lp~GW~~~~~~r~~~~~~g~~dv~~~sp~g~~frsk~~l~~~~~~~~~~s~~~~v~~k~~~~~~~~ 84 (272)
T KOG4161|consen 5 SSATIAGKRSDCPALPPGWTREEVQRSSGLSAGKSDVYYISPSGKKFRSKPQLARYLGKVGDLSLFDFVTGKMSPSERQK 84 (272)
T ss_pred cccccccCcccCCCCCCCcchhhhcccCCCcccccceEEeCCcccccccccHHHHHhccccccccCcccccccccccccc
Confidence 45789999999999999999999999999999999999999999999999999999999999999999999999999999
Q ss_pred hhhcccCC
Q psy3970 95 NKQRGNQF 102 (104)
Q Consensus 95 ~k~r~~~~ 102 (104)
++|+.+.+
T Consensus 85 ~~q~~~~~ 92 (272)
T KOG4161|consen 85 NKQRKPED 92 (272)
T ss_pred cCCCCCCc
Confidence 99888754
No 7
>cd01395 HMT_MBD Methyl-CpG binding domains (MBD) present in putative histone methyltransferases (HMT) such as CLLD8 and SETDB1 proteins; CLLD8 contains a MBD, a PreSET and a bifurcated SET domain, suggesting that CLLD8 might be associated with methylation-mediated transcriptional repression. SETDB1 and other proteins in this group have a similar domain architecture. SETDB1 is a novel KAP-1-associated histone H3, lysine 9-specific methyltransferase that contributes to HP1-mediated silencing of euchromatic genes by KRAB zinc-finger proteins.
Probab=99.61 E-value=5.1e-16 Score=99.88 Aligned_cols=53 Identities=34% Similarity=0.534 Sum_probs=48.8
Q ss_pred CCCCCcEEEEEEccCCCCCCceeEEEEcCCCceeccHHHHHHHHhCC---CCCcceee
Q psy3970 28 QLPPGWYREEVTRNKGLSVGKVDVFYYSPCGKKLRTRSEVLQVIRGN---IPGVSFDF 82 (104)
Q Consensus 28 ~Lp~GW~Rev~~R~~g~s~gk~DVYY~sP~GkkfRSk~eV~ryL~~~---l~~e~FdF 82 (104)
||-.||+|+...|+.|+ .+.||+|.+||||++|+..||++||.++ +++|+|+|
T Consensus 5 Pll~gw~R~~~~~~~~~--~k~~V~Y~aPCGr~Lr~~~EV~~YL~~t~~~L~~d~FsF 60 (60)
T cd01395 5 PLLCGFQRMKYRARVGK--VKKHVIYKAPCGRSLRNMSEVHRYLRETCSFLTVDNFSF 60 (60)
T ss_pred ccccCeEEEEEeccCCC--cccceEEECCcchhhhcHHHHHHHHHhccccceeecccC
Confidence 67899999999998873 5899999999999999999999999975 88999998
No 8
>KOG4161|consensus
Probab=94.99 E-value=0.019 Score=45.97 Aligned_cols=82 Identities=17% Similarity=0.174 Sum_probs=66.3
Q ss_pred ccccccccCCCCCCCcEEEEEEccCCCCCCceeEEEEcCCCceeccHHHHHHHHhCC-------CCCcceeeecCcccch
Q psy3970 18 LPTKKRYECPQLPPGWYREEVTRNKGLSVGKVDVFYYSPCGKKLRTRSEVLQVIRGN-------IPGVSFDFRSGKLNSL 90 (104)
Q Consensus 18 ~~~~~~~~~p~Lp~GW~Rev~~R~~g~s~gk~DVYY~sP~GkkfRSk~eV~ryL~~~-------l~~e~FdF~~~k~~~~ 90 (104)
..+.+.+..+.-+.||.+....+..- ...+.||..|+++.+++..+++.++..+ ..+.+|.|....+..+
T Consensus 166 ~e~~d~~~l~~~~~g~~~~~~~~s~~---~~~~~~~~~p~~~~~~~~~~~~~~~~~n~~~~~~~p~l~~~~~~~~~ir~~ 242 (272)
T KOG4161|consen 166 GESIDKLSLPKTPQGSGRSSAGESLL---SSVATYLETPSGKKHGESPEAVAWKNANGPSETEQPLLGDFIVTEPDIRRQ 242 (272)
T ss_pred ccccCccccCcCCCccCccccccccc---cccCcccccCCCcccccchhhhhcccCCCCCcccCCCcccccccCCCcCcc
Confidence 45566677778899999988666543 5789999999999999999999998764 5778999999999999
Q ss_pred hhhhhhhcccCC
Q psy3970 91 QKHRNKQRGNQF 102 (104)
Q Consensus 91 ~~~k~k~r~~~~ 102 (104)
..+..+.|++++
T Consensus 243 ~~~~~~~r~~~~ 254 (272)
T KOG4161|consen 243 ESRVKNVRRSLF 254 (272)
T ss_pred ccchhhhhhccc
Confidence 887666565543
No 9
>PF00397 WW: WW domain; InterPro: IPR001202 Synonym(s): Rsp5 or WWP domain The WW domain is a short conserved region in a number of unrelated proteins, which folds as a stable, triple stranded beta-sheet. This short domain of approximately 40 amino acids, may be repeated up to four times in some proteins [, , , ]. The name WW or WWP derives from the presence of two signature tryptophan residues that are spaced 20-23 amino acids apart and are present in most WW domains known to date, as well as that of a conserved Pro. The WW domain binds to proteins with particular proline-motifs, [AP]-P-P-[AP]-Y, and/or phosphoserine- phosphothreonine-containing motifs [, ]. It is frequently associated with other domains typical for proteins in signal transduction processes. A large variety of proteins containing the WW domain are known. These include; dystrophin, a multidomain cytoskeletal protein; utrophin, a dystrophin-like protein of unknown function; vertebrate YAP protein, substrate of an unknown serine kinase; Mus musculus (Mouse) NEDD-4, involved in the embryonic development and differentiation of the central nervous system; Saccharomyces cerevisiae (Baker's yeast) RSP5, similar to NEDD-4 in its molecular organisation; Rattus norvegicus (Rat) FE65, a transcription-factor activator expressed preferentially in liver; Nicotiana tabacum (Common tobacco) DB10 protein, amongst others.; GO: 0005515 protein binding; PDB: 2JXW_A 2DK1_A 2JOC_A 2JO9_A 1YIU_A 1O6W_A 2JMF_A 1TK7_A 2KYK_A 2L5F_A ....
Probab=86.92 E-value=0.86 Score=24.89 Aligned_cols=24 Identities=33% Similarity=0.699 Sum_probs=15.0
Q ss_pred CCCCcEEEEEEccCCCCCCceeEEEEcC-CCc
Q psy3970 29 LPPGWYREEVTRNKGLSVGKVDVFYYSP-CGK 59 (104)
Q Consensus 29 Lp~GW~Rev~~R~~g~s~gk~DVYY~sP-~Gk 59 (104)
||+||+...- .. .+.+|||+. +|.
T Consensus 1 LP~gW~~~~~--~~-----~g~~YY~N~~t~~ 25 (31)
T PF00397_consen 1 LPPGWEEYFD--PD-----SGRPYYYNHETGE 25 (31)
T ss_dssp SSTTEEEEEE--TT-----TSEEEEEETTTTE
T ss_pred CCcCCEEEEc--CC-----CCCEEEEeCCCCC
Confidence 7999973332 11 346899987 443
No 10
>PF14030 DUF4245: Protein of unknown function (DUF4245)
Probab=67.06 E-value=14 Score=27.45 Aligned_cols=38 Identities=29% Similarity=0.498 Sum_probs=29.5
Q ss_pred cCCCCCCCcEEEEEEccCCCCCCceeEEEEcCCCceec
Q psy3970 25 ECPQLPPGWYREEVTRNKGLSVGKVDVFYYSPCGKKLR 62 (104)
Q Consensus 25 ~~p~Lp~GW~Rev~~R~~g~s~gk~DVYY~sP~GkkfR 62 (104)
..|.||+||+---..|........+.|=|++|.|.-.+
T Consensus 59 ~~P~lP~gW~~nSar~~~~~g~~~w~vG~vt~~~~yv~ 96 (169)
T PF14030_consen 59 RAPELPEGWKANSARRQGVGGVPAWHVGYVTPDGQYVQ 96 (169)
T ss_pred eCCCCCCCceeeeEEecCCCCcceEEEEEEcCCCCEEE
Confidence 35789999988888666554446899999999987654
No 11
>PF12528 DUF3728: Prepilin peptidase dependent protein C (DUF3728); InterPro: IPR022204 This family of proteins is found in bacteria. Proteins in this family are typically between 106 and 121 amino acids in length. The family is found in association with PF07963 from PFAM. There are two completely conserved C residues that may be functionally important. This family is frequently annotated as prepilin peptidase dependent protein C however there is little accompanying literature to confirm this.
Probab=66.27 E-value=13 Score=24.91 Aligned_cols=35 Identities=17% Similarity=0.302 Sum_probs=28.3
Q ss_pred CCCCCCcEEEEEEccCCCCCCceeEEEEcCCCcee
Q psy3970 27 PQLPPGWYREEVTRNKGLSVGKVDVFYYSPCGKKL 61 (104)
Q Consensus 27 p~Lp~GW~Rev~~R~~g~s~gk~DVYY~sP~Gkkf 61 (104)
+.+|+||+.+.........+...-|-..+|.|+..
T Consensus 40 ~~~~~gWq~~~~~~~~~~~C~~itvtv~tP~~~~a 74 (84)
T PF12528_consen 40 SGPPPGWQYSRQQTSIQGGCRSITVTVTTPQNQQA 74 (84)
T ss_pred cCCCCCceeeeeeeccCCCeEEEEEEEecCCCccc
Confidence 36899999998888755567788899999987754
No 12
>PRK10332 hypothetical protein; Provisional
Probab=63.73 E-value=14 Score=26.32 Aligned_cols=33 Identities=18% Similarity=0.347 Sum_probs=25.5
Q ss_pred CCCCCCcEEEEEEccCCCCCCceeEEEEcCCCce
Q psy3970 27 PQLPPGWYREEVTRNKGLSVGKVDVFYYSPCGKK 60 (104)
Q Consensus 27 p~Lp~GW~Rev~~R~~g~s~gk~DVYY~sP~Gkk 60 (104)
+++|+||+.+-.+. ....+....|-..+|.|++
T Consensus 63 ~~~~~gWq~~~~~~-~~~gC~~itv~v~~P~~~~ 95 (107)
T PRK10332 63 ISPPANWQVNRMQT-SQAGCVSISVTLVSPGGRQ 95 (107)
T ss_pred CCCCCCceeeeeec-CCCCcEEEEEEEecCCCch
Confidence 57899998876666 3345788999999998765
No 13
>smart00456 WW Domain with 2 conserved Trp (W) residues. Also known as the WWP or rsp5 domain. Binds proline-rich polypeptides.
Probab=63.22 E-value=7.8 Score=20.47 Aligned_cols=21 Identities=33% Similarity=0.779 Sum_probs=13.6
Q ss_pred CCCCcEEEEEEccCCCCCCceeEEEEcCC
Q psy3970 29 LPPGWYREEVTRNKGLSVGKVDVFYYSPC 57 (104)
Q Consensus 29 Lp~GW~Rev~~R~~g~s~gk~DVYY~sP~ 57 (104)
||.||....-. .+.+||++..
T Consensus 1 lp~gW~~~~~~--------~g~~yy~n~~ 21 (32)
T smart00456 1 LPPGWEERKDP--------DGRPYYYNHE 21 (32)
T ss_pred CCCCCEEEECC--------CCCEEEEECC
Confidence 68999654431 1468998763
No 14
>PF12874 zf-met: Zinc-finger of C2H2 type; PDB: 1ZU1_A 2KVG_A.
Probab=60.22 E-value=18 Score=17.91 Aligned_cols=22 Identities=27% Similarity=0.613 Sum_probs=19.1
Q ss_pred EEEcCCCceeccHHHHHHHHhC
Q psy3970 52 FYYSPCGKKLRTRSEVLQVIRG 73 (104)
Q Consensus 52 YY~sP~GkkfRSk~eV~ryL~~ 73 (104)
||-+.+++.|.|...+...|.+
T Consensus 1 ~~C~~C~~~f~s~~~~~~H~~s 22 (25)
T PF12874_consen 1 FYCDICNKSFSSENSLRQHLRS 22 (25)
T ss_dssp EEETTTTEEESSHHHHHHHHTT
T ss_pred CCCCCCCCCcCCHHHHHHHHCc
Confidence 6778899999999999988764
No 15
>cd01259 PH_Apbb1ip Apbb1ip (Amyloid beta (A4) Precursor protein-Binding, family B, member 1 Interacting Protein) pleckstrin homology (PH) domain. Apbb1ip (Amyloid beta (A4) Precursor protein-Binding, family B, member 1 Interacting Protein) pleckstrin homology (PH) domain. Apbb1ip consists of a Ras-associated domain and a PH domain. PH domains share little sequence conservation, but all have a common fold, which is electrostatically polarized. PH domains also have diverse functions. They are often involved in targeting proteins to the plasma membrane, but few display strong specificity in lipid binding. Any specificity is usually determined by loop regions or insertions in the N-terminus of the domain, which are not conserved across all PH domains.
Probab=50.87 E-value=28 Score=25.11 Aligned_cols=34 Identities=24% Similarity=0.379 Sum_probs=25.7
Q ss_pred CCCc-EEEEEEccCCCCCCceeEEEEcCCCceeccHHHHHHHHh
Q psy3970 30 PPGW-YREEVTRNKGLSVGKVDVFYYSPCGKKLRTRSEVLQVIR 72 (104)
Q Consensus 30 p~GW-~Rev~~R~~g~s~gk~DVYY~sP~GkkfRSk~eV~ryL~ 72 (104)
-..| ++-.++|.+| .||+|-| +-++..+|+.+..
T Consensus 13 kKsWKk~~f~LR~SG--------LYy~~Kg-ksk~srdL~cl~~ 47 (114)
T cd01259 13 KKSWKKYYFVLRSSG--------LYYFPKE-KTKNTRDLACLNL 47 (114)
T ss_pred CccceEEEEEEeCCe--------eEEccCC-CcCCHHHHHHHHh
Confidence 3579 6678899998 4888888 4567788887765
No 16
>KOG1141|consensus
Probab=44.50 E-value=10 Score=36.01 Aligned_cols=45 Identities=24% Similarity=0.341 Sum_probs=36.9
Q ss_pred CceeEEEEcCCCceeccHHHHHHHHhCC----CCCcceeeecCcccchh
Q psy3970 47 GKVDVFYYSPCGKKLRTRSEVLQVIRGN----IPGVSFDFRSGKLNSLQ 91 (104)
Q Consensus 47 gk~DVYY~sP~GkkfRSk~eV~ryL~~~----l~~e~FdF~~~k~~~~~ 91 (104)
.-.-|-|-.|||.-+|.+.||.|||-.+ +-++.|+|..-.+...+
T Consensus 617 ~~fhv~yktpcg~~lr~~~el~ryL~et~c~flf~~~f~~~~yV~~~r~ 665 (1262)
T KOG1141|consen 617 RDFHVEYKTPCGMPLRMRIELYRYLVETRCKFLFVIGFDRAFYVVRHRA 665 (1262)
T ss_pred hcceeeccCCCccchHHHHHHHHHHHHhcCcEEEEeecccchheeeccc
Confidence 4567899999999999999999999864 67788888876665554
No 17
>PF00096 zf-C2H2: Zinc finger, C2H2 type; InterPro: IPR007087 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger: #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C], where X can be any amino acid, and numbers in brackets indicate the number of residues. The positions marked # are those that are important for the stable fold of the zinc finger. The final position can be either his or cys. The C2H2 zinc finger is composed of two short beta strands followed by an alpha helix. The amino terminal part of the helix binds the major groove in DNA binding zinc fingers. The accepted consensus binding sequence for Sp1 is usually defined by the asymmetric hexanucleotide core GGGCGG but this sequence does not include, among others, the GAG (=CTC) repeat that constitutes a high-affinity site for Sp1 binding to the wt1 promoter []. This entry represents the classical C2H2 zinc finger domain. More information about these proteins can be found at Protein of the Month: Zinc Fingers [].; GO: 0008270 zinc ion binding, 0005622 intracellular; PDB: 2D9H_A 2EPC_A 1SP1_A 1VA3_A 2WBT_B 2ELR_A 2YTP_A 2YTT_A 1VA1_A 2ELO_A ....
Probab=42.67 E-value=37 Score=16.35 Aligned_cols=17 Identities=29% Similarity=0.673 Sum_probs=14.4
Q ss_pred CCCceeccHHHHHHHHh
Q psy3970 56 PCGKKLRTRSEVLQVIR 72 (104)
Q Consensus 56 P~GkkfRSk~eV~ryL~ 72 (104)
-+|+.|.++.++.+.+.
T Consensus 5 ~C~~~f~~~~~l~~H~~ 21 (23)
T PF00096_consen 5 ICGKSFSSKSNLKRHMR 21 (23)
T ss_dssp TTTEEESSHHHHHHHHH
T ss_pred CCCCccCCHHHHHHHHh
Confidence 38999999999988764
No 18
>PF07624 PSD2: Protein of unknown function (DUF1585); InterPro: IPR011478 This entry represents a conserved region at the C terminus of a family of cytochrome-like proteins found in bacteria such as Rhodopirellula baltica and Solibacter usitatus. These proteins also contain IPR013036 from INTERPRO, IPR013039 from INTERPRO, IPR013042 from INTERPRO and IPR013043 from INTERPRO.
Probab=42.25 E-value=20 Score=23.13 Aligned_cols=19 Identities=21% Similarity=0.345 Sum_probs=17.7
Q ss_pred CCCceeccHHHHHHHHhCC
Q psy3970 56 PCGKKLRTRSEVLQVIRGN 74 (104)
Q Consensus 56 P~GkkfRSk~eV~ryL~~~ 74 (104)
|+|..|....|+..+|.++
T Consensus 1 pdG~~f~~~~eLk~~L~~~ 19 (76)
T PF07624_consen 1 PDGTSFEGAAELKQYLAER 19 (76)
T ss_pred CCCCccCCHHHHHHHHHHC
Confidence 8999999999999999875
No 19
>KOG3259|consensus
Probab=41.03 E-value=17 Score=27.71 Aligned_cols=22 Identities=32% Similarity=0.881 Sum_probs=14.6
Q ss_pred CCCCCcEEEEEEccCCCCCCceeEEEEcC
Q psy3970 28 QLPPGWYREEVTRNKGLSVGKVDVFYYSP 56 (104)
Q Consensus 28 ~Lp~GW~Rev~~R~~g~s~gk~DVYY~sP 56 (104)
.||+||.+..- |. ..-+|||++
T Consensus 6 ~LP~~Wekr~S-rs------~gr~YyfN~ 27 (163)
T KOG3259|consen 6 KLPPGWEKRMS-RS------SGRPYYFNT 27 (163)
T ss_pred cCCchhheecc-cc------CCCcceecc
Confidence 69999955432 33 345799976
No 20
>smart00497 IENR1 Intron encoded nuclease repeat motif. Repeat of unknown function, but possibly DNA-binding via helix-turn-helix motif (Ponting, unpublished).
Probab=38.77 E-value=44 Score=19.04 Aligned_cols=24 Identities=17% Similarity=0.265 Sum_probs=19.0
Q ss_pred eEEEEcCCCc---eeccHHHHHHHHhC
Q psy3970 50 DVFYYSPCGK---KLRTRSEVLQVIRG 73 (104)
Q Consensus 50 DVYY~sP~Gk---kfRSk~eV~ryL~~ 73 (104)
-||-++.+|. .|.|..|.+++|+-
T Consensus 3 ~V~~~d~~~~~i~~f~S~~eAa~~lg~ 29 (53)
T smart00497 3 PVYVYDLDGNLIGEFSSIREAAKYLGI 29 (53)
T ss_pred cEEEEeCCCCEEEEecCHHHHHHHhCC
Confidence 3677777763 59999999999985
No 21
>PF13894 zf-C2H2_4: C2H2-type zinc finger; PDB: 2ELX_A 2EPP_A 2DLK_A 1X6H_A 2EOU_A 2EMB_A 2GQJ_A 2CSH_A 2WBT_B 2ELM_A ....
Probab=38.07 E-value=47 Score=15.40 Aligned_cols=16 Identities=50% Similarity=0.949 Sum_probs=13.2
Q ss_pred CCceeccHHHHHHHHh
Q psy3970 57 CGKKLRTRSEVLQVIR 72 (104)
Q Consensus 57 ~GkkfRSk~eV~ryL~ 72 (104)
+|+.|+++.++...+.
T Consensus 6 C~~~~~~~~~l~~H~~ 21 (24)
T PF13894_consen 6 CGKSFRSKSELRQHMR 21 (24)
T ss_dssp TS-EESSHHHHHHHHH
T ss_pred CCCcCCcHHHHHHHHH
Confidence 8999999999988764
No 22
>PF12680 SnoaL_2: SnoaL-like domain; PDB: 3F40_A 3RGA_A 3G8Z_A 3DMC_A 3FH1_A 1TUH_A 3F14_A 3ER7_A 1Z1S_A 3F7X_A ....
Probab=31.39 E-value=77 Score=18.88 Aligned_cols=26 Identities=23% Similarity=0.411 Sum_probs=21.7
Q ss_pred ceeEEEEcCCCceeccHHHHHHHHhCC
Q psy3970 48 KVDVFYYSPCGKKLRTRSEVLQVIRGN 74 (104)
Q Consensus 48 k~DVYY~sP~GkkfRSk~eV~ryL~~~ 74 (104)
.-|+.|..| +..++.+.++..++...
T Consensus 21 ~~d~~~~~~-~~~~~g~~~~~~~~~~~ 46 (102)
T PF12680_consen 21 APDAVFHDP-GGTLRGREAIREFFEEF 46 (102)
T ss_dssp EEEEEEEET-TSEEESHHHHHHHHHHH
T ss_pred CCCEEEEeC-CCcccCHHHHHHHHHHH
Confidence 358889999 66699999999999863
No 23
>cd00201 WW Two conserved tryptophans domain; also known as the WWP or rsp5 domain; around 40 amino acids; functions as an interaction module in a diverse set of signalling proteins; binds specific proline-rich sequences but at low affinities compared to other peptide recognition proteins such as antibodies and receptors; WW domains have a single groove formed by a conserved Trp and Tyr which recognizes a pair of residues of the sequence X-Pro; variable loops and neighboring domains confer specificity in this domain; there are five distinct groups based on binding: 1) PPXY motifs 2) the PPLP motif; 3) PGM motifs; 4) PSP or PTP motifs; 5) PR motifs.
Probab=30.70 E-value=51 Score=16.80 Aligned_cols=19 Identities=37% Similarity=0.838 Sum_probs=11.9
Q ss_pred CCCcEEEEEEccCCCCCCceeEEEEcC
Q psy3970 30 PPGWYREEVTRNKGLSVGKVDVFYYSP 56 (104)
Q Consensus 30 p~GW~Rev~~R~~g~s~gk~DVYY~sP 56 (104)
|.||...... .+.+||++.
T Consensus 1 p~~W~~~~~~--------~g~~yy~n~ 19 (31)
T cd00201 1 PPGWEERWDP--------DGRVYYYNH 19 (31)
T ss_pred CCCCEEEECC--------CCCEEEEEC
Confidence 6789644331 146888876
No 24
>PHA00616 hypothetical protein
Probab=26.92 E-value=33 Score=20.77 Aligned_cols=17 Identities=29% Similarity=0.686 Sum_probs=16.1
Q ss_pred CCceeccHHHHHHHHhC
Q psy3970 57 CGKKLRTRSEVLQVIRG 73 (104)
Q Consensus 57 ~GkkfRSk~eV~ryL~~ 73 (104)
||+.|+.+.++.+++..
T Consensus 7 CG~~F~~~s~l~~H~r~ 23 (44)
T PHA00616 7 CGGIFRKKKEVIEHLLS 23 (44)
T ss_pred hhHHHhhHHHHHHHHHH
Confidence 89999999999999976
No 25
>PF03633 Glyco_hydro_65C: Glycosyl hydrolase family 65, C-terminal domain ; InterPro: IPR005194 O-Glycosyl hydrolases 3.2.1. from EC are a widespread group of enzymes that hydrolyse the glycosidic bond between two or more carbohydrates, or between a carbohydrate and a non-carbohydrate moiety. A classification system for glycosyl hydrolases, based on sequence similarity, has led to the definition of 85 different families [, ]. This classification is available on the CAZy (CArbohydrate-Active EnZymes) web site. This family of glycosyl hydrolases (GH65 from CAZY) contains this domain and includes vacuolar acid trehalase and maltose phosphorylases. Maltose phosphorylase (MP) is a dimeric enzyme that catalyzes the conversion of maltose and inorganic phosphate into beta-D-glucose-1-phosphate and glucose. The C-terminal domain forms a two layered jelly roll motif. This domain is situated at the base of the catalytic domain, however its function remains unknown [].; PDB: 1H54_A.
Probab=25.90 E-value=26 Score=20.81 Aligned_cols=9 Identities=44% Similarity=1.154 Sum_probs=3.0
Q ss_pred CCCCCCcEE
Q psy3970 27 PQLPPGWYR 35 (104)
Q Consensus 27 p~Lp~GW~R 35 (104)
|.||.+|.+
T Consensus 1 P~LP~~w~~ 9 (54)
T PF03633_consen 1 PRLPKQWSS 9 (54)
T ss_dssp ----TT-SE
T ss_pred CcCCCccCE
Confidence 679999954
No 26
>COG4957 Predicted transcriptional regulator [Transcription]
Probab=25.56 E-value=34 Score=25.70 Aligned_cols=37 Identities=11% Similarity=0.152 Sum_probs=23.8
Q ss_pred EEEccCCCCCCceeEEEEcCCCceeccHHHHHHHHhCC--CCCcce
Q psy3970 37 EVTRNKGLSVGKVDVFYYSPCGKKLRTRSEVLQVIRGN--IPGVSF 80 (104)
Q Consensus 37 v~~R~~g~s~gk~DVYY~sP~GkkfRSk~eV~ryL~~~--l~~e~F 80 (104)
+-+|++ .+-|-..+=-+||+|.| |.|+|..+ +++++|
T Consensus 66 V~~kKS----VtpD~IicLEDGkkfKS---LKRHL~t~~gmTPd~Y 104 (148)
T COG4957 66 VPIKKS----VTPDYIICLEDGKKFKS---LKRHLTTHYGLTPDEY 104 (148)
T ss_pred CCcccc----cCCCeEEEeccCcchHH---HHHHHhcccCCCHHHH
Confidence 344554 35565555569999987 67899874 455443
No 27
>PF14657 Integrase_AP2: AP2-like DNA-binding integrase domain
Probab=24.23 E-value=1.4e+02 Score=17.09 Aligned_cols=25 Identities=24% Similarity=0.503 Sum_probs=17.0
Q ss_pred eeEE-EEcCCCce-------eccHHHHHHHHhC
Q psy3970 49 VDVF-YYSPCGKK-------LRTRSEVLQVIRG 73 (104)
Q Consensus 49 ~DVY-Y~sP~Gkk-------fRSk~eV~ryL~~ 73 (104)
..|| |...+|++ |+++.|...++..
T Consensus 4 ~~v~g~~~~~Gkrk~~~k~GF~TkkeA~~~~~~ 36 (46)
T PF14657_consen 4 YRVYGYDDETGKRKQKTKRGFKTKKEAEKALAK 36 (46)
T ss_pred EEEEEEECCCCCEEEEEcCCCCcHHHHHHHHHH
Confidence 3454 44448866 8888888888754
No 28
>PF07676 PD40: WD40-like Beta Propeller Repeat; InterPro: IPR011659 WD-40 repeats (also known as WD or beta-transducin repeats) are short ~40 amino acid motifs, often terminating in a Trp-Asp (W-D) dipeptide. WD40 repeats usually assume a 7-8 bladed beta-propeller fold, but proteins have been found with 4 to 16 repeated units, which also form a circularised beta-propeller structure. WD-repeat proteins are a large family found in all eukaryotes and are implicated in a variety of functions ranging from signal transduction and transcription regulation to cell cycle control and apoptosis. Repeated WD40 motifs act as a site for protein-protein interaction, and proteins containing WD40 repeats are known to serve as platforms for the assembly of protein complexes or mediators of transient interplay among other proteins. The specificity of the proteins is determined by the sequences outside the repeats themselves. Examples of such complexes are G proteins (beta subunit is a beta-propeller), TAFII transcription factor, and E3 ubiquitin ligase [, ]. In Arabidopsis spp., several WD40-containing proteins act as key regulators of plant-specific developmental events. This region appears to be related to the IPR001680 from INTERPRO repeat. This model is likely to miss copies within a sequence.; PDB: 2HQS_D 1C5K_A 2IVZ_A 2W8B_D 3IAX_A 1CRZ_A 1N6F_D 1N6D_C 1N6E_C 1K32_A ....
Probab=23.91 E-value=89 Score=16.67 Aligned_cols=14 Identities=29% Similarity=0.285 Sum_probs=8.1
Q ss_pred ceeEEEEcCCCcee
Q psy3970 48 KVDVFYYSPCGKKL 61 (104)
Q Consensus 48 k~DVYY~sP~Gkkf 61 (104)
.----.+||+|+++
T Consensus 10 ~~~~p~~SpDGk~i 23 (39)
T PF07676_consen 10 DDGSPAWSPDGKYI 23 (39)
T ss_dssp SEEEEEE-TTSSEE
T ss_pred cccCEEEecCCCEE
Confidence 34445678888765
No 29
>PF07453 NUMOD1: NUMOD1 domain; InterPro: IPR010896 This helix-turn-helix-containing DNA-binding domain is found associated in homing nucleases [].
Probab=23.78 E-value=63 Score=17.64 Aligned_cols=16 Identities=13% Similarity=0.106 Sum_probs=14.0
Q ss_pred ceeccHHHHHHHHhCC
Q psy3970 59 KKLRTRSEVLQVIRGN 74 (104)
Q Consensus 59 kkfRSk~eV~ryL~~~ 74 (104)
+.|.|..|.+++|+-+
T Consensus 14 ~~F~Si~eAa~~l~i~ 29 (37)
T PF07453_consen 14 KSFDSIREAARYLGIS 29 (37)
T ss_pred EEEcCHHHHHHHhCCC
Confidence 7899999999999753
No 30
>KOG1891|consensus
Probab=22.87 E-value=56 Score=26.63 Aligned_cols=14 Identities=50% Similarity=0.919 Sum_probs=12.1
Q ss_pred CCCCCcEEEEEEcc
Q psy3970 28 QLPPGWYREEVTRN 41 (104)
Q Consensus 28 ~Lp~GW~Rev~~R~ 41 (104)
|||+||-.+..++.
T Consensus 93 PLPpgWav~~T~~g 106 (271)
T KOG1891|consen 93 PLPPGWAVEFTTEG 106 (271)
T ss_pred CCCCCcceeeEecC
Confidence 89999999988774
No 31
>KOG1141|consensus
Probab=22.36 E-value=77 Score=30.49 Aligned_cols=53 Identities=21% Similarity=0.295 Sum_probs=38.8
Q ss_pred CCCCCcEEEEEEccCCCCCCceeEEEEcCCCceeccHHHHHHHHhCC---CCCcceeeecC
Q psy3970 28 QLPPGWYREEVTRNKGLSVGKVDVFYYSPCGKKLRTRSEVLQVIRGN---IPGVSFDFRSG 85 (104)
Q Consensus 28 ~Lp~GW~Rev~~R~~g~s~gk~DVYY~sP~GkkfRSk~eV~ryL~~~---l~~e~FdF~~~ 85 (104)
+.-.||.|...-=..| +-..|+.|+|--.+...|+..|+..+ ++|++|+|..+
T Consensus 806 t~~kGWg~rclddi~~-----g~fVciy~g~~l~~~~sdks~~~~~~~~~~~id~~~f~~~ 861 (1262)
T KOG1141|consen 806 TIHKGWGRRCLDDITG-----GNFVCIYPGGALLHQISDKSEYIHVTRSLLTIDCFSFDAR 861 (1262)
T ss_pred ccccccceEeeeecCC-----ceEEEEecchhhhhhhchhhhhcccchhhhcccccchhcc
Confidence 4557997665544433 25677889999999999999888754 78888888754
No 32
>PF12171 zf-C2H2_jaz: Zinc-finger double-stranded RNA-binding; InterPro: IPR022755 This zinc finger is found in archaea and eukaryotes, and is approximately 30 amino acids in length. The mammalian members of this group occur multiple times along the protein, joined by flexible linkers, and are referred to as JAZ - dsRNA-binding ZF protein - zinc-fingers. The JAZ proteins are expressed in all tissues tested and localise in the nucleus, particularly the nucleolus []. JAZ preferentially binds to double-stranded (ds) RNA or RNA/DNA hybrids rather than DNA. In addition to binding double-stranded RNA, these zinc-fingers are required for nucleolar localisation. This entry represents the multiple-adjacent-C2H2 zinc finger, JAZ. ; PDB: 4DGW_A 1ZR9_A.
Probab=21.93 E-value=33 Score=17.60 Aligned_cols=21 Identities=24% Similarity=0.630 Sum_probs=16.5
Q ss_pred EEEcCCCceeccHHHHHHHHh
Q psy3970 52 FYYSPCGKKLRTRSEVLQVIR 72 (104)
Q Consensus 52 YY~sP~GkkfRSk~eV~ryL~ 72 (104)
||...|++.|.+...+...+.
T Consensus 2 ~~C~~C~k~f~~~~~~~~H~~ 22 (27)
T PF12171_consen 2 FYCDACDKYFSSENQLKQHMK 22 (27)
T ss_dssp CBBTTTTBBBSSHHHHHCCTT
T ss_pred CCcccCCCCcCCHHHHHHHHc
Confidence 567789999999988776554
No 33
>PF10411 DsbC_N: Disulfide bond isomerase protein N-terminus; InterPro: IPR018950 This is the N-terminal domain of the disulphide bond isomerase DsbC. The whole molecule is V-shaped, where each arm is a DsbC monomer of two domains linked by a hinge; and the N-termini of each monomer join to form the dimer interface at the base of the V, so are vital for dimerisation []. DsbC is required for disulphide bond formation and functions as a disulphide bond isomerase during oxidative protein-folding in bacterial periplasm. It also has chaperone activity []. ; PDB: 1EEJ_B 2IYJ_A 1TJD_A 1JZD_B 1JZO_A 1G0T_B 1T3B_A.
Probab=21.28 E-value=79 Score=19.23 Aligned_cols=12 Identities=25% Similarity=0.553 Sum_probs=10.4
Q ss_pred eEEEEcCCCcee
Q psy3970 50 DVFYYSPCGKKL 61 (104)
Q Consensus 50 DVYY~sP~Gkkf 61 (104)
.++|++++|+-|
T Consensus 34 ~i~Y~~~dg~yl 45 (57)
T PF10411_consen 34 GILYVDEDGRYL 45 (57)
T ss_dssp EEEEEETTSSEE
T ss_pred eEEEEcCCCCEE
Confidence 599999999765
No 34
>PF07894 DUF1669: Protein of unknown function (DUF1669); InterPro: IPR012461 This family is composed of sequences derived from hypothetical eukaryotic proteins of unknown function. Some members of this family are annotated as being potential phospholipases but no literature was found to support this.
Probab=20.16 E-value=1.1e+02 Score=25.20 Aligned_cols=47 Identities=23% Similarity=0.406 Sum_probs=29.6
Q ss_pred cccccCCCCCCCcEEEEEEccCCCCCCceeEEEEcCCCceeccHHHHHHHHh
Q psy3970 21 KKRYECPQLPPGWYREEVTRNKGLSVGKVDVFYYSPCGKKLRTRSEVLQVIR 72 (104)
Q Consensus 21 ~~~~~~p~Lp~GW~Rev~~R~~g~s~gk~DVYY~sP~GkkfRSk~eV~ryL~ 72 (104)
..-.+.|.|..||--.-..| | ...+.||+..|. -+--+-+||+|-+-
T Consensus 97 ~SD~~~P~LdLGWP~~~~~~--g--~Tr~~vy~qPp~-~~~p~IKE~vR~~I 143 (284)
T PF07894_consen 97 QSDTEPPPLDLGWPETPSYK--G--VTRATVYFQPPK-DGQPHIKEVVRRMI 143 (284)
T ss_pred cCCCCCCCCCCCCCCCCccc--C--CceEEEEeCCCC-CCCCCHHHHHHHHH
Confidence 44556789999996632222 2 247888888876 44556666666543
Done!