Query psy8172
Match_columns 98
No_of_seqs 26 out of 28
Neff 2.4
Searched_HMMs 46136
Date Fri Aug 16 20:28:08 2013
Command hhsearch -i /work/01045/syshi/Psyhhblits/psy8172.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/8172hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 smart00391 MBD Methyl-CpG bind 99.5 2.5E-14 5.5E-19 94.4 4.7 56 20-82 8-73 (77)
2 cd01397 HAT_MBD Methyl-CpG bin 98.7 1.5E-08 3.2E-13 67.9 4.2 54 20-80 6-68 (73)
3 PF01429 MBD: Methyl-CpG bindi 98.6 6E-08 1.3E-12 63.0 4.3 54 20-79 11-74 (77)
4 cd00122 MBD MeCP2, MBD1, MBD2, 98.6 7.3E-08 1.6E-12 60.4 4.1 36 21-56 7-51 (62)
5 cd01396 MeCP2_MBD MeCP2, MBD1, 98.2 1.8E-06 3.8E-11 57.2 4.0 51 20-77 7-66 (77)
6 cd01395 HMT_MBD Methyl-CpG bin 97.9 1.1E-05 2.5E-10 52.2 3.0 34 22-55 8-49 (60)
7 KOG4161|consensus 93.0 0.061 1.3E-06 42.7 1.9 39 19-57 18-66 (272)
8 PF12043 DUF3527: Domain of un 78.5 1.8 3.9E-05 36.3 2.5 47 12-68 227-276 (346)
9 PF12104 Tcell_CD4_Cterm: T ce 68.5 2.8 6.1E-05 24.3 1.0 19 43-61 5-23 (28)
10 cd03142 GATase1_ThuA Type 1 gl 62.4 7.9 0.00017 29.8 2.7 34 20-53 173-209 (215)
11 PF07624 PSD2: Protein of unkn 60.0 8.4 0.00018 24.7 2.1 19 37-55 1-19 (76)
12 PF06085 Rz1: Lipoprotein Rz1 59.1 4 8.8E-05 25.3 0.5 29 9-40 10-39 (40)
13 cd03078 GST_N_Metaxin1_like GS 57.4 14 0.00031 23.2 2.8 28 29-56 45-73 (73)
14 PF06283 ThuA: Trehalose utili 53.1 7.5 0.00016 28.1 1.2 26 14-39 169-195 (217)
15 cd00525 AE_Prim_S_like AE_Prim 48.6 31 0.00067 22.8 3.5 38 41-78 22-65 (136)
16 KOG3859|consensus 44.5 13 0.00028 32.1 1.4 17 33-49 153-169 (406)
17 PF01473 CW_binding_1: Putativ 42.7 20 0.00042 17.6 1.4 18 22-40 1-18 (19)
18 cd01774 Faf1_like2_UBX Faf1 ik 42.5 58 0.0013 21.5 4.1 44 35-84 10-61 (85)
19 PF12677 DUF3797: Domain of un 37.2 14 0.0003 23.7 0.4 7 56-62 40-46 (49)
20 PF08671 SinI: Anti-repressor 33.9 31 0.00068 19.7 1.5 11 44-54 18-28 (30)
21 PF10568 Tom37: Outer mitochon 33.5 40 0.00087 21.6 2.2 38 16-53 33-72 (72)
22 PF10379 nec1: Virulence prote 33.0 24 0.00051 27.6 1.2 20 69-88 156-175 (184)
23 smart00504 Ubox Modified RING 32.9 51 0.0011 19.1 2.4 25 33-58 14-38 (63)
24 COG3828 Uncharacterized protei 32.3 31 0.00068 28.1 1.8 30 14-43 167-196 (239)
25 PF00397 WW: WW domain; Inter 31.6 40 0.00087 18.2 1.7 18 21-38 2-22 (31)
26 smart00034 CLECT C-type lectin 31.2 66 0.0014 19.4 2.7 31 21-56 2-33 (126)
27 PF10281 Ish1: Putative stress 31.0 44 0.00096 18.9 1.8 14 43-56 4-17 (38)
28 PF11307 DUF3109: Protein of u 30.8 12 0.00025 29.3 -0.8 12 65-76 126-137 (183)
29 cd00173 SH2 Src homology 2 dom 30.7 1E+02 0.0022 18.7 3.6 33 33-66 58-90 (94)
30 PHA01635 hypothetical protein 30.7 1.3E+02 0.0029 24.5 5.0 61 13-76 125-192 (231)
31 cd00246 RabGEF Nucleotide exch 30.2 30 0.00064 24.9 1.2 37 1-37 54-97 (103)
32 PF13409 GST_N_2: Glutathione 29.8 40 0.00086 20.3 1.6 31 25-55 38-70 (70)
33 smart00456 WW Domain with 2 co 29.6 37 0.00081 17.7 1.3 18 20-37 1-20 (32)
34 cd03593 CLECT_NK_receptors_lik 26.6 1.1E+02 0.0025 19.0 3.3 31 21-56 2-33 (116)
35 COG3139 Uncharacterized protei 25.5 13 0.00029 26.4 -1.2 22 18-44 9-35 (90)
36 PF02943 FeThRed_B: Ferredoxin 24.3 30 0.00064 24.6 0.3 11 54-64 77-87 (108)
37 TIGR01204 bioW 6-carboxyhexano 23.8 1.2E+02 0.0025 24.5 3.6 34 18-55 184-227 (232)
38 PF00799 Gemini_AL1: Geminivir 22.6 48 0.001 23.5 1.1 23 35-57 81-103 (114)
39 PF13417 GST_N_3: Glutathione 22.1 47 0.001 20.0 0.9 31 24-54 38-69 (75)
40 PF03128 CXCXC: CXCXC repeat; 21.8 38 0.00083 16.3 0.4 8 54-61 5-12 (14)
No 1
>smart00391 MBD Methyl-CpG binding domain. Methyl-CpG binding domain, also known as the TAM (TTF-IIP5, ARBP, MeCP1) domain
Probab=99.50 E-value=2.5e-14 Score=94.38 Aligned_cols=56 Identities=23% Similarity=0.398 Sum_probs=50.8
Q ss_pred ecccceeeec---------cce-eEeCCCccccccHHHHHHHHhhCceeccCCCccccccceeecCCCCCccc
Q psy8172 20 TTTDYTSEVG---------FWK-QTTRPSNTALSSMEQVKAYLQTQGTCKCGLECPLQCDSVFSFDAKPRERA 82 (98)
Q Consensus 20 vp~gw~r~v~---------k~~-~YiSPsgt~L~S~eqv~~YL~t~GTCKCGLeCPl~~~~vFnFdp~v~~~P 82 (98)
+|.||+|++- ++- +|+||+|..|+|..||..||...|.|+++++| |+|++.++.-+
T Consensus 8 lp~GW~R~~~~r~~g~~~~~~dV~Y~sP~GkklRs~~ev~~YL~~~~~~~~~~~~-------F~F~~~~~~~~ 73 (77)
T smart00391 8 LPCGWRRETKQRKSGRSAGKFDVYYISPCGKKLRSKSELARYLHKNGDLSLDLEC-------FDFNATVPVGP 73 (77)
T ss_pred CCCCcEEEEEEecCCCCCCcccEEEECCCCCeeeCHHHHHHHHHhCCCccccccc-------ccCcCCccccc
Confidence 6899999986 333 89999999999999999999999999999998 99999998754
No 2
>cd01397 HAT_MBD Methyl-CpG binding domains (MBD) present in putative chromatin remodelling factor such as BAZ2A; BAZ2A contains a MBD, DDT, PHD-type zinc finger and Bromo domain suggesting that BAZ2A might be associated with histone acetyltransferase (HAT) activity. The Drosophila melanogaster toutatis protein, a putative subunit of the chromatin-remodeling complex, and other such proteins in this group share a similar domain architecture with BAZ2A, as does the Caenorhabditis elegans flectin homolog.
Probab=98.73 E-value=1.5e-08 Score=67.86 Aligned_cols=54 Identities=24% Similarity=0.330 Sum_probs=42.8
Q ss_pred ecccceeee------ccce---eEeCCCccccccHHHHHHHHhhCceeccCCCccccccceeecCCCCCc
Q psy8172 20 TTTDYTSEV------GFWK---QTTRPSNTALSSMEQVKAYLQTQGTCKCGLECPLQCDSVFSFDAKPRE 80 (98)
Q Consensus 20 vp~gw~r~v------~k~~---~YiSPsgt~L~S~eqv~~YL~t~GTCKCGLeCPl~~~~vFnFdp~v~~ 80 (98)
.++||+|++ .+.+ +|.||+|..|+|..||..||...+ .|-|..| -|.|+++++.
T Consensus 6 l~~GW~Re~vir~~~~~~~~dV~Y~aPcGKklRs~~ev~~yL~~~~------~~~Lt~d-nFsF~~~~~v 68 (73)
T cd01397 6 LELGWRRETRIRGLGGRIQGEVAYYAPCGKKLRQYPEVIKYLSKNG------ISLLSRE-NFSFSARAPV 68 (73)
T ss_pred CCCCceeEEEeccCCCCccceEEEECCCCcccccHHHHHHHHHhCC------ccCccHh-HccccCCccc
Confidence 469999998 2333 899999999999999999999876 2344444 5899988765
No 3
>PF01429 MBD: Methyl-CpG binding domain; InterPro: IPR001739 Methylation at CpG dinucleotide, the most common DNA modification in eukaryotes, has been correlated with gene silencing associated with various phenomena such as genomic imprinting, transposon and chromosome X inactivation, differentiation, and cancer. Effects of DNA methylation are mediated through proteins which bind to symmetrically methylated CpGs. Such proteins contain a specific domain of ~70 residues, the methyl-CpG-binding domain (MBD), which is linked to additional domains associated with chromatin, such as the bromodomain, the AT hook motif,the SET domain, or the PHD finger. MBD-containing proteins appear to act as structural proteins, which recruit a variety of histone deacetylase (HDAC) complexes and chromatin remodelling factors, leading to chromatin compaction and, consequently, to transcriptional repression. The MBD of MeCP2, MBD1, MBD2, MBD4 and BAZ2 mediates binding to DNA, in case of MeCP2, MBD1 and MBD2 preferentially to methylated CpG. In case of human MBD3 and SETDB1 the MBD has been shown to mediate protein-protein interactions [, ]. The MBD folds into an alpha/beta sandwich structure comprising a layer of twisted beta sheet, backed by another layer formed by the alpha1 helix and a hairpin loop at the C terminus. These layers are both amphipathic, with the alpha1 helix and the beta sheet lying parallel and the hydrophobic faces tightly packed against each other. The beta sheet is composed of two long inner strands (beta2 and beta3) sandwiched by two shorter outer strands (beta1 and beta4) [].; GO: 0003677 DNA binding, 0005634 nucleus; PDB: 2KY8_A 1UB1_A 1D9N_A 1IG4_A 1QK9_A 3C2I_A.
Probab=98.61 E-value=6e-08 Score=63.00 Aligned_cols=54 Identities=20% Similarity=0.149 Sum_probs=38.7
Q ss_pred ecccceeeec--c-----ce---eEeCCCccccccHHHHHHHHhhCceeccCCCccccccceeecCCCCC
Q psy8172 20 TTTDYTSEVG--F-----WK---QTTRPSNTALSSMEQVKAYLQTQGTCKCGLECPLQCDSVFSFDAKPR 79 (98)
Q Consensus 20 vp~gw~r~v~--k-----~~---~YiSPsgt~L~S~eqv~~YL~t~GTCKCGLeCPl~~~~vFnFdp~v~ 79 (98)
+|.||+|++- + ++ +|+||+|..++|+.||..||...+- +...--.-|.|++...
T Consensus 11 Lp~GW~re~~~R~~g~~~~~~dv~Y~sP~Gk~~RS~~eV~~yL~~~~~------~~~l~~~~F~F~~~~~ 74 (77)
T PF01429_consen 11 LPDGWKREVVVRKSGSSAGKKDVYYYSPCGKRFRSKKEVVRYLKENPS------EHDLKPENFSFSKRLI 74 (77)
T ss_dssp STTT-EEEEEESSSSTTTTSEEEEEEETTSEEESSHHHHHHHHTTSS---------SS-CTTBBTTTTB-
T ss_pred CCCCCEEEEEEecCCCcCCceEEEEECCCCCEEeCHHHHHHHHHhCCC------cccCCHhHCCCCCCcc
Confidence 6899999876 1 23 9999999999999999999999876 2222334577777654
No 4
>cd00122 MBD MeCP2, MBD1, MBD2, MBD3, MBD4, CLLD8-like, and BAZ2A-like proteins constitute a family of proteins that share the methyl-CpG-binding domain (MBD). The MBD consists of about 70 residues and is defined as the minimal region required for binding to methylated DNA by a methyl-CpG-binding protein which binds specifically to methylated DNA. The MBD can recognize a single symmetrically methylated CpG either as naked DNA or within chromatin. MeCP2, MBD1 and MBD2 (and likely MBD3) form complexes with histone deacetylase and are involved in histone deacetylase-dependent repression of transcription. MBD4 is an endonuclease that forms a complex with the DNA mismatch-repair protein MLH1. The MBDs present in putative chromatin remodelling subunit, BAZ2A, and putative histone methyltransferase, CLLD8, represent two phylogenetically distinct groups within the MBD protein family.
Probab=98.59 E-value=7.3e-08 Score=60.42 Aligned_cols=36 Identities=25% Similarity=0.345 Sum_probs=32.1
Q ss_pred cccceeeecc------ce---eEeCCCccccccHHHHHHHHhhCc
Q psy8172 21 TTDYTSEVGF------WK---QTTRPSNTALSSMEQVKAYLQTQG 56 (98)
Q Consensus 21 p~gw~r~v~k------~~---~YiSPsgt~L~S~eqv~~YL~t~G 56 (98)
|.||+|++-. ++ +|+||.|..|+|+.||..||...|
T Consensus 7 p~GW~R~~~~r~~g~~~k~dv~Y~sP~Gk~~Rs~~ev~~yL~~~~ 51 (62)
T cd00122 7 PPGWKRELVIRKSGSAGKGDVYYYSPCGKKLRSKPEVARYLEKTG 51 (62)
T ss_pred CCCeEEEEEEcCCCCCCcceEEEECCCCceecCHHHHHHHHHhCC
Confidence 8999998742 33 899999999999999999999988
No 5
>cd01396 MeCP2_MBD MeCP2, MBD1, MBD2, MBD3, and MBD4 are members of a protein family that share the methyl-CpG-binding domain (MBD). The MBD, consists of about 70 residues and is defined as the minimal region required for binding to methylated DNA by a methyl-CpG-binding protein which binds specifically to methylated DNA. The MBD can recognize a single symmetrically methylated CpG either as naked DNA or within chromatin. MeCP2, MBD1 and MBD2 (and likely MBD3) form complexes with histone deacetylase and are involved in histone deacetylase-dependent repression of transcription. MBD4 is an endonuclease that forms a complex with the DNA mismatch-repair protein MLH1.
Probab=98.22 E-value=1.8e-06 Score=57.19 Aligned_cols=51 Identities=22% Similarity=0.299 Sum_probs=39.1
Q ss_pred ecccceeeecc------ce---eEeCCCccccccHHHHHHHHhhCceeccCCCccccccceeecCCC
Q psy8172 20 TTTDYTSEVGF------WK---QTTRPSNTALSSMEQVKAYLQTQGTCKCGLECPLQCDSVFSFDAK 77 (98)
Q Consensus 20 vp~gw~r~v~k------~~---~YiSPsgt~L~S~eqv~~YL~t~GTCKCGLeCPl~~~~vFnFdp~ 77 (98)
.|+||+|++-. ++ +|+||.|..++|.-||..||...+ ++-+.+ .-|+|.+.
T Consensus 7 lp~GW~r~~~~R~~gs~~k~DvyY~sP~Gkk~RS~~ev~~yL~~~~------~~~~~~-~~FdF~~~ 66 (77)
T cd01396 7 LPPGWKRELVPRKSGSAGKFDVYYISPTGKKFRSKVELARYLEKNG------PTSLDL-SDFDFTVP 66 (77)
T ss_pred CCCCCEEEEEEecCCCCCcceEEEECCCCCEEECHHHHHHHHHhCC------CCCCcH-hHcccCCC
Confidence 79999998721 22 899999999999999999999987 223333 35777764
No 6
>cd01395 HMT_MBD Methyl-CpG binding domains (MBD) present in putative histone methyltransferases (HMT) such as CLLD8 and SETDB1 proteins; CLLD8 contains a MBD, a PreSET and a bifurcated SET domain, suggesting that CLLD8 might be associated with methylation-mediated transcriptional repression. SETDB1 and other proteins in this group have a similar domain architecture. SETDB1 is a novel KAP-1-associated histone H3, lysine 9-specific methyltransferase that contributes to HP1-mediated silencing of euchromatic genes by KRAB zinc-finger proteins.
Probab=97.89 E-value=1.1e-05 Score=52.20 Aligned_cols=34 Identities=18% Similarity=0.246 Sum_probs=30.6
Q ss_pred ccceeeeccce--------eEeCCCccccccHHHHHHHHhhC
Q psy8172 22 TDYTSEVGFWK--------QTTRPSNTALSSMEQVKAYLQTQ 55 (98)
Q Consensus 22 ~gw~r~v~k~~--------~YiSPsgt~L~S~eqv~~YL~t~ 55 (98)
.||+|+..+.+ +|.+|.|..|+++.||..||...
T Consensus 8 ~gw~R~~~~~~~~~~k~~V~Y~aPCGr~Lr~~~EV~~YL~~t 49 (60)
T cd01395 8 CGFQRMKYRARVGKVKKHVIYKAPCGRSLRNMSEVHRYLRET 49 (60)
T ss_pred cCeEEEEEeccCCCcccceEEECCcchhhhcHHHHHHHHHhc
Confidence 59999986655 89999999999999999999876
No 7
>KOG4161|consensus
Probab=92.98 E-value=0.061 Score=42.69 Aligned_cols=39 Identities=26% Similarity=0.288 Sum_probs=33.0
Q ss_pred Eecccceeee---------ccce-eEeCCCccccccHHHHHHHHhhCce
Q psy8172 19 LTTTDYTSEV---------GFWK-QTTRPSNTALSSMEQVKAYLQTQGT 57 (98)
Q Consensus 19 ~vp~gw~r~v---------~k~~-~YiSPsgt~L~S~eqv~~YL~t~GT 57 (98)
.+|-||+|.+ +++- +|+||+|..++|--+.+.||-..|+
T Consensus 18 ~lp~GW~~~~~~r~~~~~~g~~dv~~~sp~g~~frsk~~l~~~~~~~~~ 66 (272)
T KOG4161|consen 18 ALPPGWTREEVQRSSGLSAGKSDVYYISPSGKKFRSKPQLARYLGKVGD 66 (272)
T ss_pred CCCCCcchhhhcccCCCcccccceEEeCCcccccccccHHHHHhccccc
Confidence 5799999874 3444 9999999999999999999987765
No 8
>PF12043 DUF3527: Domain of unknown function (DUF3527); InterPro: IPR021916 This presumed domain is functionally uncharacterised. This domain is found in eukaryotes. This domain is about 120 amino acids in length. This domain has a conserved CDCGGWD sequence motif.
Probab=78.55 E-value=1.8 Score=36.29 Aligned_cols=47 Identities=23% Similarity=0.487 Sum_probs=35.4
Q ss_pred CCCCCeeEecccceeeeccceeEeCCCccccccHHHHHHHHhhCceeccC---CCccccc
Q psy8172 12 PDPKSTVLTTTDYTSEVGFWKQTTRPSNTALSSMEQVKAYLQTQGTCKCG---LECPLQC 68 (98)
Q Consensus 12 p~p~s~v~vp~gw~r~v~k~~~YiSPsgt~L~S~eqv~~YL~t~GTCKCG---LeCPl~~ 68 (98)
.+-+.+|++|.|+-.+-..+- -.|| -+.+=..+.|-|-|| |-|||.+
T Consensus 227 ~~~s~tVvlP~G~Hg~P~~g~--~~PS--------~LI~RWrSGG~CDCGGWDmgC~L~V 276 (346)
T PF12043_consen 227 SSCSTTVVLPSGVHGLPNTGN--GGPS--------SLIDRWRSGGSCDCGGWDMGCPLRV 276 (346)
T ss_pred CCccEEEEecCCccCCCCCCC--CCCh--------HHHHHhhhCCCcCCCCcccCCCeEE
Confidence 345678999999998876651 1233 356778899999998 8999975
No 9
>PF12104 Tcell_CD4_Cterm: T cell CD4 receptor C terminal region; InterPro: IPR021963 This domain is the C-terminal domain of the CD4 T cell receptor. The C-terminal domain is the cytoplasmic domain which relays the signal for T cell activation. This process involves co-receptor internalisation. This domain is involved in binding to the N-terminal of Lck co-receptor in a Zn2+ clasp structure. ; PDB: 2JKR_Q 3B71_D 1Q68_A 2JKT_Q 2KLU_A.
Probab=68.48 E-value=2.8 Score=24.34 Aligned_cols=19 Identities=37% Similarity=0.751 Sum_probs=13.3
Q ss_pred ccHHHHHHHHhhCceeccC
Q psy8172 43 SSMEQVKAYLQTQGTCKCG 61 (98)
Q Consensus 43 ~S~eqv~~YL~t~GTCKCG 61 (98)
.-+.|++.||.---||.|.
T Consensus 5 ~Rmsqikr~LsekkTCqC~ 23 (28)
T PF12104_consen 5 ERMSQIKRLLSEKKTCQCP 23 (28)
T ss_dssp TCHHCHHHHHHTTSS----
T ss_pred HHHHHHHHHHhhcccCCCc
Confidence 3478999999999999985
No 10
>cd03142 GATase1_ThuA Type 1 glutamine amidotransferase (GATase1)-like domain found in Sinorhizobium meliloti Rm1021 ThuA (SmThuA). Type 1 glutamine amidotransferase (GATase1)-like domain found in Sinorhizobium meliloti Rm1021 ThuA (SmThuA). This group includes proteins similar to SmThuA which plays a role in a major pathway for trehalose catabolism. SmThuA is induced by trehalose but not by related structurally similar disaccharides like sucrose or maltose. Proteins in this group lack the catalytic triad of typical GATase1 domains: a His replaces the reactive Cys found in the sharp turn between a beta strand and an alpha helix termed the nucleophile elbow. S. meliloti Rm1021 thuA mutants are impaired in competitive colonization of Medicago sativa roots but are more competitive than the wild-type Rml021 in infecting alfalfa roots and forming nitrogen-fixing nodules.
Probab=62.38 E-value=7.9 Score=29.76 Aligned_cols=34 Identities=15% Similarity=0.100 Sum_probs=24.6
Q ss_pred ecccceeeeccce-eEeCCCccccc--cHHHHHHHHh
Q psy8172 20 TTTDYTSEVGFWK-QTTRPSNTALS--SMEQVKAYLQ 53 (98)
Q Consensus 20 vp~gw~r~v~k~~-~YiSPsgt~L~--S~eqv~~YL~ 53 (98)
-|.+|++..|+|| .|.+|-.+.-. .-.++++-|.
T Consensus 173 ~p~~W~~~~G~GRVFyta~GH~~~~~~~~p~~~~ll~ 209 (215)
T cd03142 173 FRSGCTYRRGAGRIFYFSPGHETYPVYHHPDVQKVLA 209 (215)
T ss_pred ceEEEEEEECCcCEEEECCCCCCcccccCHhHHHHHH
Confidence 4899999999999 88888775422 3355555443
No 11
>PF07624 PSD2: Protein of unknown function (DUF1585); InterPro: IPR011478 This entry represents a conserved region at the C terminus of a family of cytochrome-like proteins found in bacteria such as Rhodopirellula baltica and Solibacter usitatus. These proteins also contain IPR013036 from INTERPRO, IPR013039 from INTERPRO, IPR013042 from INTERPRO and IPR013043 from INTERPRO.
Probab=59.96 E-value=8.4 Score=24.74 Aligned_cols=19 Identities=26% Similarity=0.588 Sum_probs=17.9
Q ss_pred CCccccccHHHHHHHHhhC
Q psy8172 37 PSNTALSSMEQVKAYLQTQ 55 (98)
Q Consensus 37 Psgt~L~S~eqv~~YL~t~ 55 (98)
|+|+...+..++++||..+
T Consensus 1 pdG~~f~~~~eLk~~L~~~ 19 (76)
T PF07624_consen 1 PDGTSFEGAAELKQYLAER 19 (76)
T ss_pred CCCCccCCHHHHHHHHHHC
Confidence 8999999999999999876
No 12
>PF06085 Rz1: Lipoprotein Rz1 precursor; InterPro: IPR010346 This family consists of several bacteria and phage lipoprotein Rz1 precursors. Rz1 is a proline-rich lipoprotein from bacteriophage lambda, which is known to have fusogenic properties. Rz1-induced liposome fusion is thought to be mediated primarily by the generation of local perturbation in the bilayer lipid membrane and to a lesser extent by electrostatic forces [].; GO: 0019064 viral envelope fusion with host membrane, 0019867 outer membrane
Probab=59.05 E-value=4 Score=25.26 Aligned_cols=29 Identities=17% Similarity=0.313 Sum_probs=22.3
Q ss_pred CCCCCCCCeeEec-ccceeeeccceeEeCCCcc
Q psy8172 9 LQPPDPKSTVLTT-TDYTSEVGFWKQTTRPSNT 40 (98)
Q Consensus 9 ~~~p~p~s~v~vp-~gw~r~v~k~~~YiSPsgt 40 (98)
++||+|..=+|-| -+|+.+.+.= ||+|++
T Consensus 10 ~kpppPpAW~M~p~~n~~~~Ld~i---is~S~~ 39 (40)
T PF06085_consen 10 VKPPPPPAWAMQPPPNWQQLLDGI---ISVSET 39 (40)
T ss_pred cCCCCCchhhccCChhhHHHhhce---ecccCC
Confidence 5667776666666 7999999887 888875
No 13
>cd03078 GST_N_Metaxin1_like GST_N family, Metaxin subfamily, Metaxin 1-like proteins; composed of metaxins 1 and 3, and similar proteins including Tom37 from fungi. Mammalian metaxin (or metaxin 1) and the fungal protein Tom37 are components of preprotein import complexes of the mitochondrial outer membrane. Metaxin extends to the cytosol and is anchored to the mitochondrial membrane through its C-terminal domain. In mice, metaxin is required for embryonic development. Like the murine gene, the human metaxin gene is located downstream to the glucocerebrosidase (GBA) pseudogene and is convergently transcribed. Inherited deficiency of GBA results in Gaucher disease, which presents many diverse clinical phenotypes. Alterations in the metaxin gene, in addition to GBA mutations, may be associated with Gaucher disease. Genome sequencing shows that a third metaxin gene also exists in zebrafish, Xenopus, chicken and mammals.
Probab=57.44 E-value=14 Score=23.16 Aligned_cols=28 Identities=32% Similarity=0.345 Sum_probs=24.0
Q ss_pred ccce-eEeCCCccccccHHHHHHHHhhCc
Q psy8172 29 GFWK-QTTRPSNTALSSMEQVKAYLQTQG 56 (98)
Q Consensus 29 ~k~~-~YiSPsgt~L~S~eqv~~YL~t~G 56 (98)
.+|+ +||-.+|+.+...+.|.+||...|
T Consensus 45 p~gkLP~l~~~~~~i~d~~~Ii~~L~~~~ 73 (73)
T cd03078 45 PTGKLPALLTSGTKISGPEKIIEYLRKQG 73 (73)
T ss_pred CCCccCEEEECCEEecChHHHHHHHHHcC
Confidence 4677 788889999999999999998654
No 14
>PF06283 ThuA: Trehalose utilisation; PDB: 4E5V_A 1T0B_A.
Probab=53.05 E-value=7.5 Score=28.10 Aligned_cols=26 Identities=15% Similarity=-0.041 Sum_probs=18.0
Q ss_pred CCCeeEecccceeeeccce-eEeCCCc
Q psy8172 14 PKSTVLTTTDYTSEVGFWK-QTTRPSN 39 (98)
Q Consensus 14 p~s~v~vp~gw~r~v~k~~-~YiSPsg 39 (98)
+..-...|..|.+.+||+| .|.++-.
T Consensus 169 ~~~~~~~Pv~W~~~~GkGRvf~~~lGH 195 (217)
T PF06283_consen 169 GGEGGDHPVAWTREYGKGRVFYTTLGH 195 (217)
T ss_dssp TTTSSEEEEEEEEECTTEEEEEE----
T ss_pred ccCCCeEEEEEEEEeCCeeEEEECCCC
Confidence 3555667889999999999 7777744
No 15
>cd00525 AE_Prim_S_like AE_Prim_S_like: primase domain similar to that found in the small subunit of archaeal and eukaryotic (A/E) DNA primases. The replication machineries of A/Es are distinct from that of bacteria. Primases are DNA-dependent RNA polymerases which synthesis the short RNA primers required for DNA replication. In eukaryotes, this small catalytically active primase subunit (p50) and a larger primase subunit (p60), referred to jointly as the core primase, associate with the B subunit and the DNA polymerase alpha subunit in a complex, called Pol alpha-pri. In addition to its catalytic role in replication, eukaryotic DNA primase may play a role in coupling replication to DNA damage repair and in checkpoint control during S phase. Pfu41 and Pfu46 comprise the primase complex of the archaea Pyrococcus furiosus; these proteins have sequence identity to the eukaryotic p50 and p60 primase proteins respectively. Pfu41 preferentially uses dNTPs as substrate. Pfu46 regulates the pri
Probab=48.57 E-value=31 Score=22.80 Aligned_cols=38 Identities=21% Similarity=0.194 Sum_probs=32.5
Q ss_pred ccccHHHHHHHHhhCceeccCCCccc------cccceeecCCCC
Q psy8172 41 ALSSMEQVKAYLQTQGTCKCGLECPL------QCDSVFSFDAKP 78 (98)
Q Consensus 41 ~L~S~eqv~~YL~t~GTCKCGLeCPl------~~~~vFnFdp~v 78 (98)
+..+.+|+++|+...+.+--|+.|.. .-.-||++|+..
T Consensus 22 ~~~d~~~~~~~~~~~~~~~i~i~~~~~~~~~~~~~iv~DiD~~~ 65 (136)
T cd00525 22 ATTDDAEILAWLANLPPGNIGLSLGRYDKLWKPDLLVFDLDPDD 65 (136)
T ss_pred cccCcHHHHHHHHhcCCeEEEcCccccCccCCCCEEEEECCCCC
Confidence 67899999999999999888988887 467788888764
No 16
>KOG3859|consensus
Probab=44.50 E-value=13 Score=32.12 Aligned_cols=17 Identities=24% Similarity=0.380 Sum_probs=15.5
Q ss_pred eEeCCCccccccHHHHH
Q psy8172 33 QTTRPSNTALSSMEQVK 49 (98)
Q Consensus 33 ~YiSPsgt~L~S~eqv~ 49 (98)
|||||+|+-|.|++.|-
T Consensus 153 YFI~PTGH~LKslDLvt 169 (406)
T KOG3859|consen 153 YFISPTGHSLKSLDLVT 169 (406)
T ss_pred EEecCCCcchhHHHHHH
Confidence 89999999999999874
No 17
>PF01473 CW_binding_1: Putative cell wall binding repeat; InterPro: IPR018337 The cell wall-binding repeat (CW) is an about 20 amino acid residue module, essentially found in two bacterial Gram-positive protein families; the choline binding proteins and glucosyltransferases (2.4.1.5 from EC). In choline-binding proteins cell wall binding repeats bind to choline moieties of both teichoic and lipoteichoic acids, two components peculiar to the cell surface of Gram-positive bacteria [, ]. In glucosyltransferases the region spanning the CW repeats is a glucan binding domain []. Several crystal structures of CW have been solved [, ]. In the choline binding protein LytA, the repeats adopt a solenoid fold consisting exclusively of beta-hairpins that stack to form a left-handed superhelix with a boomerang-like shape. The choline groups bind between beta-hairpin 'steps' of the superhelix []. In Cpl-1 CW repeats assemble in two sub-domains: an N-terminal superhelical moiety similar to the LytA one and a C-terminal beta-sheet involved in interactions with the lysozyme domain. Choline is bound between repeats 1 and 2, and, 2 and 3 of the superhelical sub-domain []. Some proteins known to contain cell-wall binding repeats include: Pneumococcal N-acetylmuramoyl-L-alanine amidase (autolysin, lytA) (3.5.1.28 from EC). It is a surface-exposed enzyme that rules the self-destruction of pneumococcal cells through degradation of their peptidoglycan backbone. It mediates the release of toxic substances that damage the host tissues. Pneumococcal endo-beta-N-acetylglucosaminidase (lytB) (3.2.1.96 from EC). It plays an important role in cell wall degradation and cell separation. Pneumococcal teichoic acid phosphorylcholine esterase (pce or cbpE), a cell wall hydrolase important for cellular adhesion and colonisation. Lactobacillales glucosyltransferase. It catalyses the transfer of glucosyl units from the cleavage of sucrose to a growing chain of glucan. Clostridium difficile toxin A (tcdA) and toxin B (tcdb). They are the causative agents of the antibiotic-associated pseudomembranous colitis. They are intracellular acting toxins that reach their targets after receptor-mediated endocytosis. Clostridium acetobutylicum cspA protein. Siphoviridae bacteriophages N-acetylmuramoyl-L-alanine amidase. It lyses the bacterial host cell wall. Podoviridae lysozyme protein (cpl-1). It is capable of digesting the pneumococcal cell wall. The cell wall binding repeats are also known as the choline-binding repeats (ChBr) or the choline-binding domain (ChBD). ; PDB: 1GVM_C 2BML_B 1HCX_A 1OBA_A 1H09_A 2J8F_A 2IXU_A 2J8G_A 2IXV_A 2X8O_A ....
Probab=42.68 E-value=20 Score=17.65 Aligned_cols=18 Identities=11% Similarity=0.305 Sum_probs=12.6
Q ss_pred ccceeeeccceeEeCCCcc
Q psy8172 22 TDYTSEVGFWKQTTRPSNT 40 (98)
Q Consensus 22 ~gw~r~v~k~~~YiSPsgt 40 (98)
+||..+-++| +|+..+|.
T Consensus 1 n~W~~~~~~w-Yy~~~~G~ 18 (19)
T PF01473_consen 1 NGWVQDNGNW-YYFDSDGY 18 (19)
T ss_dssp SEEEEETTEE-EEETTTSB
T ss_pred CcCEEECCEE-EEeCCCcc
Confidence 4677774333 99999885
No 18
>cd01774 Faf1_like2_UBX Faf1 ike-2 UBX domain. Faf1_like2 is a protein of unknown function with a domain architecture that includes the UAS (ubiquitin-associated) and UBX (ubiquitin-like) domains. This protein is related to other UBA/UBX-containing proteins like Faf1, p47, and SAKS1 and may serve as an adaptor molecule that shuttles proteins to the proteasome for degradation. The UBX domain has a beta-grasp fold similar to that of ubiquitin however, UBX lacks the c-terminal double glycine motif and is thus unlikely to be conjugated to other proteins.
Probab=42.53 E-value=58 Score=21.50 Aligned_cols=44 Identities=20% Similarity=0.306 Sum_probs=29.7
Q ss_pred eCCCcccc-------ccHHHHHHHHhhCceeccCCCccccccceeecCCCCCc-cccc
Q psy8172 35 TRPSNTAL-------SSMEQVKAYLQTQGTCKCGLECPLQCDSVFSFDAKPRE-RATE 84 (98)
Q Consensus 35 iSPsgt~L-------~S~eqv~~YL~t~GTCKCGLeCPl~~~~vFnFdp~v~~-~P~~ 84 (98)
.=|+|+.| .+++.|.+|+.+.| ..|-...-+.||-.++.. .|.+
T Consensus 10 Rlp~G~Rl~rrF~~~~tl~~l~~fv~~~~------~~~~~f~L~t~FPrr~~~~~~~~ 61 (85)
T cd01774 10 KLPNGTRVERRFLFTQSLRVIHDFLFSLK------ETPEKFQIVTNFPRRVLPCLPSE 61 (85)
T ss_pred ECCCCCEEEEEeCCCCcHHHHHHHHHhCC------CCCCcEEEecCCCCccccccccc
Confidence 33555554 78999999996433 466667777888777665 3444
No 19
>PF12677 DUF3797: Domain of unknown function (DUF3797); InterPro: IPR024256 This presumed domain is functionally uncharacterised. This domain family is found in bacteria and viruses, and is approximately 50 amino acids in length. There is a conserved CGN sequence motif.
Probab=37.19 E-value=14 Score=23.73 Aligned_cols=7 Identities=71% Similarity=1.725 Sum_probs=5.6
Q ss_pred ceeccCC
Q psy8172 56 GTCKCGL 62 (98)
Q Consensus 56 GTCKCGL 62 (98)
-|||||.
T Consensus 40 RtCkCGf 46 (49)
T PF12677_consen 40 RTCKCGF 46 (49)
T ss_pred eeecccc
Confidence 4899995
No 20
>PF08671 SinI: Anti-repressor SinI; InterPro: IPR010981 The SinR repressor is part of a group of Sin (sporulation inhibition) proteins in Bacillus subtilis that regulate the commitment to sporulation in response to extreme adversity []. SinR is a tetrameric repressor protein that binds to the promoters of genes essential for entry into sporulation and prevents their transcription. This repression is overcome through the activity of SinI, which disrupts the SinR tetramer through the formation of a SinI-SinR heterodimer, thereby allowing sporulation to proceed. The SinR structure consists of two domains: a dimerisation domain stabilised by a hydrophobic core, and a DNA-binding domain that is identical to domains of the bacteriophage 434 CI and Cro proteins that regulate prophage induction. The dimerisation domain is a four-helical bundle formed from two helices from the C-terminal residues of SinR and two helices from the central residues of SinI. These regions in SinR and SinI are similar in both structure and sequence. The interaction of SinR monomers to form tetramers is weaker than between SinR and SinI, since SinI can effectively disrupt SinR tetramers. This entry represents the dimerisation domain in both SinI and SinR proteins.; GO: 0005488 binding, 0006355 regulation of transcription, DNA-dependent; PDB: 1B0N_A 2YAL_A.
Probab=33.94 E-value=31 Score=19.72 Aligned_cols=11 Identities=27% Similarity=0.622 Sum_probs=8.9
Q ss_pred cHHHHHHHHhh
Q psy8172 44 SMEQVKAYLQT 54 (98)
Q Consensus 44 S~eqv~~YL~t 54 (98)
|.||||+||.+
T Consensus 18 s~eeir~FL~~ 28 (30)
T PF08671_consen 18 SKEEIREFLEF 28 (30)
T ss_dssp -HHHHHHHHHH
T ss_pred CHHHHHHHHHh
Confidence 78999999975
No 21
>PF10568 Tom37: Outer mitochondrial membrane transport complex protein; InterPro: IPR019564 Tom37 is one of the outer membrane proteins that make up the TOM complex for guiding cytosolic mitochondrial beta-barrel proteins from the cytosol across the outer mitochondrial membrane into the intramembrane space. In conjunction with Tom70, it guides peptides without an mitochondrial targeting sequence (MTS) into Tom40, the protein that forms the passage through the outer membrane []. It has homology with metaxin, also part of the outer mitochondrial membrane beta-barrel protein transport complex []. This entry represents outer mitochondrial membrane transport complex proteins Tom37 and metaxin.; GO: 0006626 protein targeting to mitochondrion, 0005741 mitochondrial outer membrane
Probab=33.55 E-value=40 Score=21.60 Aligned_cols=38 Identities=11% Similarity=0.041 Sum_probs=29.3
Q ss_pred CeeEecccceeeeccce-eEeCC-CccccccHHHHHHHHh
Q psy8172 16 STVLTTTDYTSEVGFWK-QTTRP-SNTALSSMEQVKAYLQ 53 (98)
Q Consensus 16 s~v~vp~gw~r~v~k~~-~YiSP-sgt~L~S~eqv~~YL~ 53 (98)
.--||+...+.+.-.|+ +++-- .++.++..++|.+||.
T Consensus 33 ~~~vv~s~n~~~Sptg~LP~L~~~~~~~vsg~~~Iv~yL~ 72 (72)
T PF10568_consen 33 QFKVVPSNNPWLSPTGELPALIDSGGTWVSGFRNIVEYLR 72 (72)
T ss_pred eEEEEEcCCCCcCCCCCCCEEEECCCcEEECHHHHHHhhC
Confidence 36677777777777777 55555 7888999999999984
No 22
>PF10379 nec1: Virulence protein nec1; InterPro: IPR018836 The Nec1 protein has necrogenic activity on excised potato tuber tissue, and the encoding gene is highly conserved in plant-pathogenic Streptomyces spp. The G+C content of nec1 indicates lateral transfer from an unrelated taxon, but its origins are unclear. Deletion analysis of nec1 demonstrated that the 151-amino-acid C-terminal region of the Nec1 protein is sufficient to confer necrogenic activity. Streptomyces turgidiscabies containing a nec1 deletion was greatly compromised in virulence on Arabidopsis thaliana (Mouse-ear cress), Nicotiana tabacum (Common tobacco), and Raphanus sativus (Radish) seedlings. The wild-type strain, S. turgidiscabies Car8, aggressively colonized and infected the root meristem of radish, whereas the delta-nec1 mutant Car811 did not. Taken together, the data suggest that Nec1 is a secreted virulence protein with a conserved plant cell target that acts early in plant infection [].
Probab=33.02 E-value=24 Score=27.56 Aligned_cols=20 Identities=15% Similarity=0.101 Sum_probs=16.2
Q ss_pred cceeecCCCCCcccccCCCC
Q psy8172 69 DSVFSFDAKPRERATEVHEP 88 (98)
Q Consensus 69 ~~vFnFdp~v~~~P~~~~~~ 88 (98)
..-||||-++++.||+-+-+
T Consensus 156 aqrfnfdqgipnlpwndqvs 175 (184)
T PF10379_consen 156 AQRFNFDQGIPNLPWNDQVS 175 (184)
T ss_pred hhhcccccCCCCCCcccccc
Confidence 45699999999999985443
No 23
>smart00504 Ubox Modified RING finger domain. Modified RING finger domain, without the full complement of Zn2+-binding ligands. Probable involvement in E2-dependent ubiquitination.
Probab=32.88 E-value=51 Score=19.05 Aligned_cols=25 Identities=20% Similarity=0.291 Sum_probs=18.3
Q ss_pred eEeCCCccccccHHHHHHHHhhCcee
Q psy8172 33 QTTRPSNTALSSMEQVKAYLQTQGTC 58 (98)
Q Consensus 33 ~YiSPsgt~L~S~eqv~~YL~t~GTC 58 (98)
+.++|+|+ .-..+-+.+|+...++|
T Consensus 14 Pv~~~~G~-v~~~~~i~~~~~~~~~c 38 (63)
T smart00504 14 PVILPSGQ-TYERRAIEKWLLSHGTD 38 (63)
T ss_pred CEECCCCC-EEeHHHHHHHHHHCCCC
Confidence 46778884 34788899999886554
No 24
>COG3828 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=32.32 E-value=31 Score=28.12 Aligned_cols=30 Identities=13% Similarity=-0.012 Sum_probs=23.7
Q ss_pred CCCeeEecccceeeeccceeEeCCCccccc
Q psy8172 14 PKSTVLTTTDYTSEVGFWKQTTRPSNTALS 43 (98)
Q Consensus 14 p~s~v~vp~gw~r~v~k~~~YiSPsgt~L~ 43 (98)
=+--|++|-=|+|-.||||+|-|-=|+.-+
T Consensus 167 ~~~~v~mPVvWtR~~GkGRvFYssLGHta~ 196 (239)
T COG3828 167 LIDGVVMPVVWTRRYGKGRVFYSSLGHTAS 196 (239)
T ss_pred cccCeeeeeEEEeeecCceEEEeccCCCcc
Confidence 344599999999999999977777776544
No 25
>PF00397 WW: WW domain; InterPro: IPR001202 Synonym(s): Rsp5 or WWP domain The WW domain is a short conserved region in a number of unrelated proteins, which folds as a stable, triple stranded beta-sheet. This short domain of approximately 40 amino acids, may be repeated up to four times in some proteins [, , , ]. The name WW or WWP derives from the presence of two signature tryptophan residues that are spaced 20-23 amino acids apart and are present in most WW domains known to date, as well as that of a conserved Pro. The WW domain binds to proteins with particular proline-motifs, [AP]-P-P-[AP]-Y, and/or phosphoserine- phosphothreonine-containing motifs [, ]. It is frequently associated with other domains typical for proteins in signal transduction processes. A large variety of proteins containing the WW domain are known. These include; dystrophin, a multidomain cytoskeletal protein; utrophin, a dystrophin-like protein of unknown function; vertebrate YAP protein, substrate of an unknown serine kinase; Mus musculus (Mouse) NEDD-4, involved in the embryonic development and differentiation of the central nervous system; Saccharomyces cerevisiae (Baker's yeast) RSP5, similar to NEDD-4 in its molecular organisation; Rattus norvegicus (Rat) FE65, a transcription-factor activator expressed preferentially in liver; Nicotiana tabacum (Common tobacco) DB10 protein, amongst others.; GO: 0005515 protein binding; PDB: 2JXW_A 2DK1_A 2JOC_A 2JO9_A 1YIU_A 1O6W_A 2JMF_A 1TK7_A 2KYK_A 2L5F_A ....
Probab=31.60 E-value=40 Score=18.25 Aligned_cols=18 Identities=0% Similarity=-0.180 Sum_probs=13.1
Q ss_pred cccceeeec-c-ce-eEeCCC
Q psy8172 21 TTDYTSEVG-F-WK-QTTRPS 38 (98)
Q Consensus 21 p~gw~r~v~-k-~~-~YiSPs 38 (98)
|-||+..++ . || +|+...
T Consensus 2 P~gW~~~~~~~~g~~YY~N~~ 22 (31)
T PF00397_consen 2 PPGWEEYFDPDSGRPYYYNHE 22 (31)
T ss_dssp STTEEEEEETTTSEEEEEETT
T ss_pred CcCCEEEEcCCCCCEEEEeCC
Confidence 779998885 3 77 776553
No 26
>smart00034 CLECT C-type lectin (CTL) or carbohydrate-recognition domain (CRD). Many of these domains function as calcium-dependent carbohydrate binding modules.
Probab=31.20 E-value=66 Score=19.40 Aligned_cols=31 Identities=16% Similarity=0.154 Sum_probs=17.8
Q ss_pred cccceeeeccce-eEeCCCccccccHHHHHHHHhhCc
Q psy8172 21 TTDYTSEVGFWK-QTTRPSNTALSSMEQVKAYLQTQG 56 (98)
Q Consensus 21 p~gw~r~v~k~~-~YiSPsgt~L~S~eqv~~YL~t~G 56 (98)
|.||... .++ ++++... .+.++.++|+...|
T Consensus 2 p~~w~~~--~~~Cy~~~~~~---~~~~~A~~~C~~~~ 33 (126)
T smart00034 2 PSGWISY--GGKCYKFSTEK---KTWADAQAFCQSLG 33 (126)
T ss_pred Ccchhhh--CCEEEEEECCc---cCHHHHHHHHHhcC
Confidence 6788887 444 5555542 45555555555443
No 27
>PF10281 Ish1: Putative stress-responsive nuclear envelope protein; InterPro: IPR018803 This group of proteins, found primarily in fungi, consists of putative stress-responsive nuclear envelope protein Ish1 and homologues [].
Probab=31.00 E-value=44 Score=18.87 Aligned_cols=14 Identities=29% Similarity=0.524 Sum_probs=11.9
Q ss_pred ccHHHHHHHHhhCc
Q psy8172 43 SSMEQVKAYLQTQG 56 (98)
Q Consensus 43 ~S~eqv~~YL~t~G 56 (98)
+|.+|+++||.+-|
T Consensus 4 Ws~~~L~~wL~~~g 17 (38)
T PF10281_consen 4 WSDSDLKSWLKSHG 17 (38)
T ss_pred CCHHHHHHHHHHcC
Confidence 57899999998865
No 28
>PF11307 DUF3109: Protein of unknown function (DUF3109); InterPro: IPR021458 This bacterial family of proteins has no known function.
Probab=30.78 E-value=12 Score=29.27 Aligned_cols=12 Identities=33% Similarity=0.642 Sum_probs=7.1
Q ss_pred cccccceeecCC
Q psy8172 65 PLQCDSVFSFDA 76 (98)
Q Consensus 65 Pl~~~~vFnFdp 76 (98)
|+++.+.|.||.
T Consensus 126 PIRv~~~~~~~~ 137 (183)
T PF11307_consen 126 PIRVKKYSGFDA 137 (183)
T ss_pred ceEEEeecCcCE
Confidence 555566666664
No 29
>cd00173 SH2 Src homology 2 domains; Signal transduction, involved in recognition of phosphorylated tyrosine (pTyr). SH2 domains typically bind pTyr-containing ligands via two surface pockets, a pTyr and hydrophobic binding pocket, allowing proteins with SH2 domains to localize to tyrosine phosphorylated sites.
Probab=30.66 E-value=1e+02 Score=18.73 Aligned_cols=33 Identities=15% Similarity=0.099 Sum_probs=24.2
Q ss_pred eEeCCCccccccHHHHHHHHhhCceeccCCCccc
Q psy8172 33 QTTRPSNTALSSMEQVKAYLQTQGTCKCGLECPL 66 (98)
Q Consensus 33 ~YiSPsgt~L~S~eqv~~YL~t~GTCKCGLeCPl 66 (98)
.++...+..-.|++|+.+|...... .-++.|.|
T Consensus 58 ~~~~~~~~~f~sl~eLv~~y~~~~~-~~~~~~~L 90 (94)
T cd00173 58 YYLLGEGRSFPSLPELIEHYQKNPL-SDGLGVKL 90 (94)
T ss_pred eEEecCCCccCCHHHHHHHHhhCcc-CCCcccEe
Confidence 4444478889999999999998876 44455544
No 30
>PHA01635 hypothetical protein
Probab=30.65 E-value=1.3e+02 Score=24.51 Aligned_cols=61 Identities=20% Similarity=0.336 Sum_probs=47.4
Q ss_pred CCCCeeEecc-cceeeeccce--eEeCCCcc----ccccHHHHHHHHhhCceeccCCCccccccceeecCC
Q psy8172 13 DPKSTVLTTT-DYTSEVGFWK--QTTRPSNT----ALSSMEQVKAYLQTQGTCKCGLECPLQCDSVFSFDA 76 (98)
Q Consensus 13 ~p~s~v~vp~-gw~r~v~k~~--~YiSPsgt----~L~S~eqv~~YL~t~GTCKCGLeCPl~~~~vFnFdp 76 (98)
||+|+|-||. ||.-+|.-+- .=+.|-|. ..-+.+||.+|=+..|. .+.||=.+-.||+.--
T Consensus 125 pP~svi~vPEYGF~nLvs~~~rLleVkpg~in~Y~iiYdy~EvveYe~e~g~---nv~~pPdPy~v~Sy~~ 192 (231)
T PHA01635 125 PPNSVIHVPEYGFVNLVSLSGRLIEVKPGSLNLYAIIYDYAEVVEYEQEAGI---NVASPPDPYAVFSYQF 192 (231)
T ss_pred CCCCEEEccccceeeeeeecceEEEeecCCceEEEEEecHHHHhhHHHHhCc---ccCCCCCCeEEEEEEe
Confidence 7999999996 7888776443 66777764 34689999999999997 6788888888886443
No 31
>cd00246 RabGEF Nucleotide exchange factor for Rab-like small GTPases (RabGEF), Mss4 type; RabGEF positely regulates the function of Rab GTPase by promoting exchange of GDP for GTP; members of the Rab subfamily of Ras GTPases are important in vesicular transport;
Probab=30.16 E-value=30 Score=24.86 Aligned_cols=37 Identities=14% Similarity=0.047 Sum_probs=27.8
Q ss_pred CcccccCCCCCCCCCCeeEe-------cccceeeeccceeEeCC
Q psy8172 1 MFALMNSNLQPPDPKSTVLT-------TTDYTSEVGFWKQTTRP 37 (98)
Q Consensus 1 ~~~~~~~~~~~p~p~s~v~v-------p~gw~r~v~k~~~YiSP 37 (98)
||+.+|-...-+.+.--.++ |+||..++++..-|++.
T Consensus 54 m~~FeNigfs~~~~~~k~L~CadCe~GPiG~~~~~~~~~~yva~ 97 (103)
T cd00246 54 MFTFENVGFSKDVGNLKYLVCADCEKGPIGYHCLDDKNSEYVAL 97 (103)
T ss_pred cceeeeeeeccCCCCeeEEEeccCCCCceEeEeecccchhhhhh
Confidence 67788877766666666777 89999988777667764
No 32
>PF13409 GST_N_2: Glutathione S-transferase, N-terminal domain; PDB: 3C8E_B 3M1G_A 3R3E_A 3O3T_A 1RK4_A 1K0O_B 1K0N_A 3QR6_A 3SWL_A 3TGZ_B ....
Probab=29.77 E-value=40 Score=20.32 Aligned_cols=31 Identities=19% Similarity=0.150 Sum_probs=23.6
Q ss_pred eeeeccce--eEeCCCccccccHHHHHHHHhhC
Q psy8172 25 TSEVGFWK--QTTRPSNTALSSMEQVKAYLQTQ 55 (98)
Q Consensus 25 ~r~v~k~~--~YiSPsgt~L~S~eqv~~YL~t~ 55 (98)
.++-.+++ +-+-++|+++....+|-+||-++
T Consensus 38 ~~~~p~~~VP~L~~~~g~vi~eS~~I~~yL~~~ 70 (70)
T PF13409_consen 38 LALNPRGKVPVLVDPDGTVINESLAILEYLEEQ 70 (70)
T ss_dssp HHHSTT-SSSEEEETTTEEEESHHHHHHHHHHT
T ss_pred hccCcCeEEEEEEECCCCEeeCHHHHHHHHhcC
Confidence 45566777 45558999999999999999753
No 33
>smart00456 WW Domain with 2 conserved Trp (W) residues. Also known as the WWP or rsp5 domain. Binds proline-rich polypeptides.
Probab=29.64 E-value=37 Score=17.71 Aligned_cols=18 Identities=0% Similarity=-0.193 Sum_probs=12.4
Q ss_pred ecccceeeeccc-e-eEeCC
Q psy8172 20 TTTDYTSEVGFW-K-QTTRP 37 (98)
Q Consensus 20 vp~gw~r~v~k~-~-~YiSP 37 (98)
.|.||+...+.. + +|+..
T Consensus 1 lp~gW~~~~~~~g~~yy~n~ 20 (32)
T smart00456 1 LPPGWEERKDPDGRPYYYNH 20 (32)
T ss_pred CCCCCEEEECCCCCEEEEEC
Confidence 378998887754 4 66653
No 34
>cd03593 CLECT_NK_receptors_like C-type lectin-like domain (CTLD) of the type found in natural killer cell receptors (NKRs). CLECT_NK_receptors_like: C-type lectin-like domain (CTLD) of the type found in natural killer cell receptors (NKRs), including proteins similar to oxidized low density lipoprotein (OxLDL) receptor (LOX-1), CD94, CD69, NKG2-A and -D, osteoclast inhibitory lectin (OCIL), dendritic cell-associated C-type lectin-1 (dectin-1), human myeloid inhibitory C-type lectin-like receptor (MICL), mast cell-associated functional antigen (MAFA), killer cell lectin-like receptors: subfamily F, member 1 (KLRF1) and subfamily B, member 1 (KLRB1), and lys49 receptors. CTLD refers to a domain homologous to the carbohydrate-recognition domains (CRDs) of the C-type lectins. NKRs are variously associated with activation or inhibition of natural killer (NK) cells. Activating NKRs stimulate cytolysis by NK cells of virally infected or transformed cells; inhibitory NKRs block cytolysis up
Probab=26.58 E-value=1.1e+02 Score=18.99 Aligned_cols=31 Identities=10% Similarity=0.045 Sum_probs=19.5
Q ss_pred cccceeeeccce-eEeCCCccccccHHHHHHHHhhCc
Q psy8172 21 TTDYTSEVGFWK-QTTRPSNTALSSMEQVKAYLQTQG 56 (98)
Q Consensus 21 p~gw~r~v~k~~-~YiSPsgt~L~S~eqv~~YL~t~G 56 (98)
|.||.+.-+ + +|++... .+.++.+.|....|
T Consensus 2 p~gw~~~~~--~Cy~~~~~~---~~w~~A~~~C~~~g 33 (116)
T cd03593 2 PKDWICYGN--KCYYFSMEK---KTWNESKEACSSKN 33 (116)
T ss_pred CCcCEEeCC--EEEEEEcCC---CCHHHHHHHHHhCC
Confidence 678877543 4 7777764 45666666655554
No 35
>COG3139 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=25.48 E-value=13 Score=26.37 Aligned_cols=22 Identities=32% Similarity=0.625 Sum_probs=18.3
Q ss_pred eEecccceee-----eccceeEeCCCcccccc
Q psy8172 18 VLTTTDYTSE-----VGFWKQTTRPSNTALSS 44 (98)
Q Consensus 18 v~vp~gw~r~-----v~k~~~YiSPsgt~L~S 44 (98)
.|+|.-|||+ .||| |+|++|..
T Consensus 9 ~mtPEiYQrL~~AvElGKW-----PdG~~Ltq 35 (90)
T COG3139 9 SMTPEIYQRLSTAVELGKW-----PDGVALTQ 35 (90)
T ss_pred hcCHHHHHHHHHHHHhcCC-----CCCCcCCH
Confidence 4789999996 4788 99999964
No 36
>PF02943 FeThRed_B: Ferredoxin thioredoxin reductase catalytic beta chain; InterPro: IPR004209 Ferredoxin thioredoxin reductase is a [4FE-4S] protein present in organisms performing oxygenic photosynthesis, and plays an important role in the ferredoxin/thioredoxin regulatory chain. It converts an electron signal (photoreduced ferredoxin) to a thiol signal (reduced thioredoxin), regulating enzymes by reduction of specific disulphide groups. It catalyses the light-dependent activation of several photosynthetis enzymes. Ferredoxin thioredoxin reductase is a heterodimer of subunit alpha and subunit beta. Subunit alpha is the variable subunit, and beta is the catalytic chain []. The structure of the beta subunit has been determined and found to fold around the FeS cluster [].; GO: 0008937 ferredoxin-NAD(P) reductase activity, 0055114 oxidation-reduction process; PDB: 2PUK_E 2PVD_A 2PVG_A 2PUO_A 2PU9_A 1DJ7_A 2PVO_A.
Probab=24.28 E-value=30 Score=24.58 Aligned_cols=11 Identities=45% Similarity=1.053 Sum_probs=5.0
Q ss_pred hCceeccCCCc
Q psy8172 54 TQGTCKCGLEC 64 (98)
Q Consensus 54 t~GTCKCGLeC 64 (98)
..|.|.|||=+
T Consensus 77 ~~G~ChC~Lf~ 87 (108)
T PF02943_consen 77 EYGHCHCGLFV 87 (108)
T ss_dssp -----TTSSSB
T ss_pred ccccceEeeeE
Confidence 78999999965
No 37
>TIGR01204 bioW 6-carboxyhexanoate--CoA ligase. Alternate name: pimeloyl-CoA synthase.
Probab=23.75 E-value=1.2e+02 Score=24.49 Aligned_cols=34 Identities=9% Similarity=-0.039 Sum_probs=24.8
Q ss_pred eEecc-cceeee--------ccce-eEeCCCccccccHHHHHHHHhhC
Q psy8172 18 VLTTT-DYTSEV--------GFWK-QTTRPSNTALSSMEQVKAYLQTQ 55 (98)
Q Consensus 18 v~vp~-gw~r~v--------~k~~-~YiSPsgt~L~S~eqv~~YL~t~ 55 (98)
|-+.. ||.||- .-|| +|++++ + +++++.+||...
T Consensus 184 VA~~~~gY~RI~~lK~~G~~~GGRvffv~~~--~--~l~~~i~yLE~~ 227 (232)
T TIGR01204 184 VSGKEIGYVRITPLKEKGDELGGRVFFVSRK--N--ELSEYIHCLEQK 227 (232)
T ss_pred EEeCCCCeEeCccccccCCCCCCEEEEEeCC--C--CHHHHHHHHhcC
Confidence 55565 898874 2267 777776 3 999999999864
No 38
>PF00799 Gemini_AL1: Geminivirus Rep catalytic domain; InterPro: IPR022690 Geminiviruses are characterised by a genome of circular single-stranded DNA encapsidated in twinned (geminate) quasi-isometric particles, from which the group derives its name []. Most geminiviruses can be divided into two subgroups on the basis of host range and/or insect vector: i.e. those that infect dicotyledenous plants and are transmitted by the same whitefly species, and those that infect monocotyledenous plants and are transmitted by different leafhopper vectors. The genomes of the whitefly-transmitted African cassava mosaic virus, Tomato golden mosaic virus (TGMV) and Bean golden mosaic virus (BGMV) possess a bipartite genome. By contrast, only a single DNA component has been identified for the leafhopper-transmitted Maize streak virus (MSV) and Wheat dwarf virus (WDV) [, ]. Beet curly top virus (BCTV), and Tobacco yellow dwarf virus belong to a third possible subgroup. Like MSV and WDV, BCTV is transmitted by a specific leafhopper species, yet like the whitefly-transmitted geminiviruses it has a host range confined to dicotyledenous plants. Sequence comparison of the whitefly-transmitted Squash leaf curl virus (SqLCV) and Tomato yellow leaf curl virus (TYLCV) with the genomic components of TGMV and BGMV reveals a close evolutionary relationship [, , ]. Amino acid sequence alignments of Potato yellow mosaic virus (PYMV) proteins with those encoded by other geminiviruses show that PYMV is closely related to geminiviruses isolated from the New World, especially in the putative coat protein gene regions []. Comparison of MSV DNA-encoded proteins with those of other geminiviruses infecting monocotyledonous plants, including Panicum streak virus [] and Miscanthus streak virus (MiSV) [], reveal high levels of similarity. The AL1 proteins encodes the replication initiator protein (Rep) of geminiviruses, which is a replicon-specific initiator enzyme and is an essential component of the replisome []. For geminivirus Rep protein, this N-terminal region is crucial for origin recognition and DNA cleavage and nucleotidyl transfer []. It is found in association with PF08283 from PFAM. ; GO: 0006260 DNA replication; PDB: 1L5I_A 1L2M_A.
Probab=22.61 E-value=48 Score=23.54 Aligned_cols=23 Identities=26% Similarity=0.479 Sum_probs=15.2
Q ss_pred eCCCccccccHHHHHHHHhhCce
Q psy8172 35 TRPSNTALSSMEQVKAYLQTQGT 57 (98)
Q Consensus 35 iSPsgt~L~S~eqv~~YL~t~GT 57 (98)
..|--...+|.++|++|+..||.
T Consensus 81 fHPNIq~aKs~s~vk~YI~KDgd 103 (114)
T PF00799_consen 81 FHPNIQGAKSSSDVKSYIEKDGD 103 (114)
T ss_dssp E--EEEEESSSTHHHHHHH--SS
T ss_pred cccccccccCHHHHHHHhhcCCC
Confidence 34555566889999999999993
No 39
>PF13417 GST_N_3: Glutathione S-transferase, N-terminal domain; PDB: 3ERG_B 3IBH_A 3ERF_A 3UBL_A 3UBK_A 3IR4_A 3M8N_B 2R4V_A 2PER_A 2R5G_A ....
Probab=22.07 E-value=47 Score=19.98 Aligned_cols=31 Identities=13% Similarity=0.096 Sum_probs=25.4
Q ss_pred ceeeeccce-eEeCCCccccccHHHHHHHHhh
Q psy8172 24 YTSEVGFWK-QTTRPSNTALSSMEQVKAYLQT 54 (98)
Q Consensus 24 w~r~v~k~~-~YiSPsgt~L~S~eqv~~YL~t 54 (98)
+.++..+++ +.+-++|..+.....|.+||-.
T Consensus 38 ~~~~~p~~~vPvL~~~g~~l~dS~~I~~yL~~ 69 (75)
T PF13417_consen 38 FLKLNPKGKVPVLVDDGEVLTDSAAIIEYLEE 69 (75)
T ss_dssp HHHHSTTSBSSEEEETTEEEESHHHHHHHHHH
T ss_pred HHhhcccccceEEEECCEEEeCHHHHHHHHHH
Confidence 455667788 6666889999999999999965
No 40
>PF03128 CXCXC: CXCXC repeat; InterPro: IPR004153 This repeat contains the conserved pattern CXCXC where X can be any amino acid. The repeat is found in up to five copies in Vascular endothelial growth factor C []. In the salivary glands of the dipteran Chironomus tentans, a specific messenger ribonucleoprotein (mRNP) particle, the Balbiani ring (BR) granule, can be visualized during its assembly on the gene and during its nucleocytoplasmic transport. This repeat is found over 70 copies in the balbiani ring protein 3 (Q03376 from SWISSPROT). It is also found in some silk proteins [].
Probab=21.85 E-value=38 Score=16.27 Aligned_cols=8 Identities=38% Similarity=1.153 Sum_probs=5.9
Q ss_pred hCceeccC
Q psy8172 54 TQGTCKCG 61 (98)
Q Consensus 54 t~GTCKCG 61 (98)
.+.||+|+
T Consensus 5 n~~tC~C~ 12 (14)
T PF03128_consen 5 NDDTCQCE 12 (14)
T ss_pred cCCCcCcc
Confidence 46789886
Done!