Query psy10394
Match_columns 82
No_of_seqs 105 out of 117
Neff 3.0
Searched_HMMs 46136
Date Fri Aug 16 16:44:27 2013
Command hhsearch -i /work/01045/syshi/Psyhhblits/psy10394.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/10394hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 KOG0985|consensus 100.0 1.5E-34 3.3E-39 254.2 1.0 68 15-82 664-731 (1666)
2 PF02178 AT_hook: AT hook moti 93.4 0.03 6.4E-07 26.3 0.4 9 6-14 1-9 (13)
3 smart00384 AT_hook DNA binding 90.7 0.11 2.4E-06 28.5 0.8 10 6-15 1-10 (26)
4 smart00299 CLH Clathrin heavy 89.0 0.047 1E-06 35.6 -1.9 44 39-82 4-48 (140)
5 cd01025 TOPRIM_recR TOPRIM_rec 64.8 5.5 0.00012 27.5 2.0 25 48-72 11-35 (112)
6 COG2345 Predicted transcriptio 55.6 5 0.00011 30.6 0.6 13 5-17 60-72 (218)
7 PRK13844 recombination protein 49.1 13 0.00028 28.1 1.9 25 48-72 93-117 (200)
8 PF09998 DUF2239: Uncharacteri 47.8 9.9 0.00022 28.9 1.1 11 4-14 77-87 (187)
9 KOG1547|consensus 47.7 12 0.00027 30.6 1.7 48 2-53 255-318 (336)
10 PF13565 HTH_32: Homeodomain-l 45.5 33 0.00072 20.2 3.0 23 7-29 24-51 (77)
11 PRK07122 RNA polymerase sigma 45.1 30 0.00066 25.8 3.3 36 26-61 47-91 (264)
12 PRK00076 recR recombination pr 44.0 18 0.00039 27.2 2.0 25 47-71 88-112 (196)
13 PF00637 Clathrin: Region in C 41.8 1.4 3E-05 28.5 -3.7 42 38-79 3-46 (143)
14 TIGR00615 recR recombination p 41.8 20 0.00043 27.0 1.9 25 47-71 88-112 (195)
15 COG0568 RpoD DNA-directed RNA 41.6 16 0.00036 29.6 1.6 23 15-41 102-124 (342)
16 COG3644 Uncharacterized protei 40.6 13 0.00029 28.3 0.9 10 5-14 70-79 (194)
17 KOG1473|consensus 39.0 16 0.00035 34.8 1.3 19 1-19 1-19 (1414)
18 PF05414 DUF1717: Viral domain 37.6 33 0.00071 23.9 2.3 35 45-79 28-62 (101)
19 PF04701 Pox_D2: Pox virus D2 35.4 20 0.00043 26.3 1.0 15 47-61 120-134 (143)
20 PF15508 NAAA-beta: beta subun 34.6 39 0.00084 21.7 2.2 41 10-52 12-53 (95)
21 PF14621 RFX5_DNA_bdg: RFX5 DN 33.8 15 0.00032 28.6 0.1 12 4-15 66-77 (219)
22 KOG2003|consensus 32.6 68 0.0015 28.8 4.0 31 27-57 556-586 (840)
23 PF02001 DUF134: Protein of un 32.0 21 0.00046 24.5 0.7 14 6-19 5-19 (106)
24 PRK04966 hypothetical protein; 31.7 31 0.00067 22.5 1.4 25 41-65 6-30 (72)
25 PF07797 DUF1639: Protein of u 29.4 29 0.00062 21.5 0.9 13 3-15 13-25 (50)
26 PF03374 ANT: Phage antirepres 28.6 90 0.002 19.8 3.2 45 11-64 9-53 (111)
27 PRK05658 RNA polymerase sigma 26.1 78 0.0017 26.6 3.2 41 16-60 381-430 (619)
28 PRK05901 RNA polymerase sigma 25.9 73 0.0016 27.0 2.9 35 26-60 278-321 (509)
29 PF07568 HisKA_2: Histidine ki 25.7 1.1E+02 0.0023 18.9 3.1 29 25-53 6-34 (76)
30 PRK04217 hypothetical protein; 25.5 40 0.00087 23.1 1.2 48 1-50 1-66 (110)
31 PRK11675 LexA regulated protei 24.6 35 0.00075 23.1 0.7 22 6-27 17-40 (90)
32 COG1191 FliA DNA-directed RNA 23.9 1.1E+02 0.0023 23.7 3.3 36 26-61 30-75 (247)
33 PF04441 Pox_VERT_large: Poxvi 22.9 43 0.00092 30.0 1.1 21 59-80 34-54 (700)
34 COG1829 Predicted archaeal kin 22.1 1E+02 0.0022 24.9 2.9 45 14-61 191-237 (283)
35 PF04155 Ground-like: Ground-l 21.8 73 0.0016 19.6 1.7 27 15-41 9-36 (76)
36 smart00674 CENPB Putative DNA- 21.8 1.4E+02 0.003 17.2 2.9 30 15-44 13-42 (66)
37 TIGR02393 RpoD_Cterm RNA polym 21.6 1.1E+02 0.0025 21.9 3.0 35 26-60 7-50 (238)
38 PHA02646 virion protein; Provi 21.5 59 0.0013 24.3 1.4 19 47-65 121-144 (156)
39 smart00528 HNS Domain in histo 21.3 51 0.0011 19.5 0.9 15 8-23 23-37 (46)
40 COG1312 UxuA D-mannonate dehyd 20.8 57 0.0012 27.2 1.4 49 30-78 143-207 (362)
41 KOG0831|consensus 20.6 1.1E+02 0.0023 25.3 2.8 24 37-61 302-325 (334)
42 PF06794 UPF0270: Uncharacteri 20.5 59 0.0013 21.0 1.1 25 40-64 5-29 (70)
43 PHA02852 putative virion struc 20.1 66 0.0014 23.9 1.4 20 46-65 116-142 (153)
No 1
>KOG0985|consensus
Probab=100.00 E-value=1.5e-34 Score=254.19 Aligned_cols=68 Identities=60% Similarity=1.026 Sum_probs=67.3
Q ss_pred hhHHHhcccCcchhHHHHHHHHHHHHhhhCHHHHHHHHHhcCccceeeeeeccccccccCccceeeeC
Q psy10394 15 RYMHTVFPQDLKEKKLEVSQIATKYHEQLTTKALIDLFESFKSYEGLFYFLGSIVNYSQDQEVHFKYI 82 (82)
Q Consensus 15 ~~L~elL~~N~rqNlQiVVqVA~kYsd~lg~~~lI~lFE~~ks~EGLyyyLgsiVn~seDpdVhFKYI 82 (82)
-||++||++|+|||||+|||||+|||||||+++||+|||+||||||||||||||||+|||||||||||
T Consensus 664 eclkaml~~NirqNlQi~VQvatky~eqlg~~~li~lFE~fks~eGL~yfLgSivn~seDpevh~KYI 731 (1666)
T KOG0985|consen 664 ECLKAMLSANIRQNLQIVVQVATKYHEQLGAQALIELFESFKSYEGLYYFLGSIVNFSEDPEVHFKYI 731 (1666)
T ss_pred HHHHHHHHHHHHhhhHHHHHHHHHHHHHhCHHHHHHHHHhhccchhHHHHHHHHhccccCchHHHHHH
Confidence 59999999999999999999999999999999999999999999999999999999999999999998
No 2
>PF02178 AT_hook: AT hook motif; InterPro: IPR017956 AT hooks are DNA-binding motifs with a preference for A/T rich regions. These motifs are found in a variety of proteins, including the high mobility group (HMG) proteins [], in DNA-binding proteins from plants [] and in hBRG1 protein, a central ATPase of the human switching/sucrose non-fermenting (SWI/SNF) remodeling complex []. High mobility group (HMG) proteins are a family of relatively low molecular weight non-histone components in chromatin []. HMG-I and HMG-Y (HMGA) are proteins of about 100 amino acid residues which are produced by the alternative splicing of a single gene. HMG-I/Y proteins bind preferentially to the minor groove of AT-rich regions in double-stranded DNA in a non-sequence specific manner [, ]. It is suggested that these proteins could function in nucleosome phasing and in the 3' end processing of mRNA transcripts. They are also involved in the transcription regulation of genes containing, or in close proximity to, AT-rich regions. ; GO: 0003677 DNA binding; PDB: 2EZE_A 2EZD_A 2EZF_A 2EZG_A.
Probab=93.38 E-value=0.03 Score=26.32 Aligned_cols=9 Identities=67% Similarity=1.287 Sum_probs=4.0
Q ss_pred ccCCCCccc
Q psy10394 6 KRDRGRPKT 14 (82)
Q Consensus 6 ~~~~~~~~~ 14 (82)
+|.||||++
T Consensus 1 ~r~RGRP~k 9 (13)
T PF02178_consen 1 KRKRGRPRK 9 (13)
T ss_dssp S--SS--TT
T ss_pred CCcCCCCcc
Confidence 578999986
No 3
>smart00384 AT_hook DNA binding domain with preference for A/T rich regions. Small DNA-binding motif first described in the high mobility group non-histone chromosomal protein HMG-I(Y).
Probab=90.74 E-value=0.11 Score=28.54 Aligned_cols=10 Identities=60% Similarity=1.139 Sum_probs=8.4
Q ss_pred ccCCCCccch
Q psy10394 6 KRDRGRPKTR 15 (82)
Q Consensus 6 ~~~~~~~~~~ 15 (82)
+|+||||++.
T Consensus 1 kRkRGRPrK~ 10 (26)
T smart00384 1 KRKRGRPRKA 10 (26)
T ss_pred CCCCCCCCCC
Confidence 6899999874
No 4
>smart00299 CLH Clathrin heavy chain repeat homology.
Probab=89.05 E-value=0.047 Score=35.59 Aligned_cols=44 Identities=23% Similarity=0.401 Sum_probs=37.0
Q ss_pred HHhhhCHHHHHHHHHhcCccceeeeeecccccc-ccCccceeeeC
Q psy10394 39 YHEQLTTKALIDLFESFKSYEGLFYFLGSIVNY-SQDQEVHFKYI 82 (82)
Q Consensus 39 Ysd~lg~~~lI~lFE~~ks~EGLyyyLgsiVn~-seDpdVhFKYI 82 (82)
..+.+.++.+|..|++.+..+-++-||-+++.. +.++++|-.+|
T Consensus 4 ~~~~~~~~~vv~~~~~~~~~~~l~~yLe~~~~~~~~~~~~~~~li 48 (140)
T smart00299 4 VSDPIDVSEVVELFEKRNLLEELIPYLESALKLNSENPALQTKLI 48 (140)
T ss_pred CCCcCCHHHHHHHHHhCCcHHHHHHHHHHHHccCccchhHHHHHH
Confidence 467889999999999999999999999999877 57777776543
No 5
>cd01025 TOPRIM_recR TOPRIM_recR: topoisomerase-primase (TOPRIM) nucleotidyl transferase/hydrolase domain of the type found in Escherichia coli RecR. RecR participates in the RecFOR pathway of homologous recombinational repair in prokaryotes. This pathway provides a single-stranded DNA molecule coated with RecA to allow invasion of a homologous molecule. The RecFOR system directs the loading of RecA onto gapped DNA coated with SSB protein. The TOPRIM domain has two conserved motifs, one of which centers at a conserved glutamate and the other one at two conserved aspartates (DxD). In RecR sequences this glutamate in the first turn of the TOPRIM domain is semiconserved, the DXD motif is not conserved.
Probab=64.81 E-value=5.5 Score=27.46 Aligned_cols=25 Identities=24% Similarity=0.393 Sum_probs=20.6
Q ss_pred HHHHHHhcCccceeeeeeccccccc
Q psy10394 48 LIDLFESFKSYEGLFYFLGSIVNYS 72 (82)
Q Consensus 48 lI~lFE~~ks~EGLyyyLgsiVn~s 72 (82)
=+..+|+-+.|.|+|+.|+..++-.
T Consensus 11 Dv~~iE~~~~y~G~Y~VL~G~ispl 35 (112)
T cd01025 11 DVLAIEESGEYRGLYHVLGGLISPL 35 (112)
T ss_pred HHHHHHhhCccceEEEEeCCCcCCC
Confidence 3567899999999999998887653
No 6
>COG2345 Predicted transcriptional regulator [Transcription]
Probab=55.61 E-value=5 Score=30.61 Aligned_cols=13 Identities=38% Similarity=0.687 Sum_probs=11.0
Q ss_pred CccCCCCccchhH
Q psy10394 5 GKRDRGRPKTRYM 17 (82)
Q Consensus 5 ~~~~~~~~~~~~L 17 (82)
.+.|+|||.+.|-
T Consensus 60 ~~~g~GRP~~~y~ 72 (218)
T COG2345 60 QQGGRGRPAKLYR 72 (218)
T ss_pred ccCCCCCCceeee
Confidence 4778999999885
No 7
>PRK13844 recombination protein RecR; Provisional
Probab=49.06 E-value=13 Score=28.13 Aligned_cols=25 Identities=16% Similarity=0.232 Sum_probs=20.6
Q ss_pred HHHHHHhcCccceeeeeeccccccc
Q psy10394 48 LIDLFESFKSYEGLFYFLGSIVNYS 72 (82)
Q Consensus 48 lI~lFE~~ks~EGLyyyLgsiVn~s 72 (82)
=+..+|+-+.|.|+|+.||..++--
T Consensus 93 Dv~aiE~t~~y~G~YhVL~G~ispl 117 (200)
T PRK13844 93 DMIAIEEAGIYRGKYFVLNGRISPL 117 (200)
T ss_pred HHHHHHhhCccceEEEEccCccCcc
Confidence 3567899999999999998886543
No 8
>PF09998 DUF2239: Uncharacterized protein conserved in bacteria (DUF2239); InterPro: IPR018715 This family of uncharacterised bacterial proteins has no known function.
Probab=47.79 E-value=9.9 Score=28.93 Aligned_cols=11 Identities=55% Similarity=0.930 Sum_probs=9.0
Q ss_pred CCccCCCCccc
Q psy10394 4 GGKRDRGRPKT 14 (82)
Q Consensus 4 ~~~~~~~~~~~ 14 (82)
...|||||||-
T Consensus 77 ~~prgrGRPKL 87 (187)
T PF09998_consen 77 AAPRGRGRPKL 87 (187)
T ss_pred CCCCCCCCCcc
Confidence 45799999994
No 9
>KOG1547|consensus
Probab=47.74 E-value=12 Score=30.56 Aligned_cols=48 Identities=29% Similarity=0.533 Sum_probs=35.0
Q ss_pred ccCCccCCCCccchh---------------HHHhcccCcchhHHHHHHHHHHHH-hhhCHHHHHHHHH
Q psy10394 2 EVGGKRDRGRPKTRY---------------MHTVFPQDLKEKKLEVSQIATKYH-EQLTTKALIDLFE 53 (82)
Q Consensus 2 ~~~~~~~~~~~~~~~---------------L~elL~~N~rqNlQiVVqVA~kYs-d~lg~~~lI~lFE 53 (82)
.|+|+|=+|| |+|| |+++| ||-.||=+..+..-+| |..-+..|.++=|
T Consensus 255 ~vnG~~vlGR-ktrWg~IeVen~nhCeF~~LRdfl---iRtHlQdlke~Ts~iHyE~yR~krLn~~~~ 318 (336)
T KOG1547|consen 255 QVNGRRVLGR-KTRWGIIEVENLNHCEFVLLRDFL---IRTHLQDLKETTSNIHYETYRAKRLNELKE 318 (336)
T ss_pred EEcCeEeecc-ccccceEEecccccchhHHHHHHH---HHHHHHHHHHHhhhhhHhHHHHHHHhhhcC
Confidence 5899999999 7776 77777 4778888888877665 4455666665544
No 10
>PF13565 HTH_32: Homeodomain-like domain
Probab=45.49 E-value=33 Score=20.17 Aligned_cols=23 Identities=13% Similarity=0.279 Sum_probs=12.8
Q ss_pred cCCCCccc-----hhHHHhcccCcchhH
Q psy10394 7 RDRGRPKT-----RYMHTVFPQDLKEKK 29 (82)
Q Consensus 7 ~~~~~~~~-----~~L~elL~~N~rqNl 29 (82)
+..|||.+ .++.+++..+.+-++
T Consensus 24 ~~~Grp~~~~e~~~~i~~~~~~~p~wt~ 51 (77)
T PF13565_consen 24 PRPGRPRKDPEQRERIIALIEEHPRWTP 51 (77)
T ss_pred CCCCCCCCcHHHHHHHHHHHHhCCCCCH
Confidence 56888866 444555554443333
No 11
>PRK07122 RNA polymerase sigma factor SigF; Reviewed
Probab=45.09 E-value=30 Score=25.76 Aligned_cols=36 Identities=14% Similarity=0.238 Sum_probs=27.4
Q ss_pred chhHHHHHHHHHHHHhh---------hCHHHHHHHHHhcCcccee
Q psy10394 26 KEKKLEVSQIATKYHEQ---------LTTKALIDLFESFKSYEGL 61 (82)
Q Consensus 26 rqNlQiVVqVA~kYsd~---------lg~~~lI~lFE~~ks~EGL 61 (82)
.+|+.+|+.||.+|+.. -|.--||+-.|+|....|.
T Consensus 47 ~~~l~LV~~iA~~y~~~g~~~~DLiQeG~iGLi~AierFDp~~G~ 91 (264)
T PRK07122 47 TRCLPLADHIARRFDGRGEPRDDLVQVARVGLVNAVNRFDVETGS 91 (264)
T ss_pred HHhHHHHHHHHHHHHhCCCCHHHHHHHHHHHHHHHHHHcCCCCCC
Confidence 46799999999999754 3445677888888877664
No 12
>PRK00076 recR recombination protein RecR; Reviewed
Probab=44.05 E-value=18 Score=27.18 Aligned_cols=25 Identities=28% Similarity=0.438 Sum_probs=20.0
Q ss_pred HHHHHHHhcCccceeeeeecccccc
Q psy10394 47 ALIDLFESFKSYEGLFYFLGSIVNY 71 (82)
Q Consensus 47 ~lI~lFE~~ks~EGLyyyLgsiVn~ 71 (82)
+=+..||+-+.|.|+|+.||..++-
T Consensus 88 ~Dv~aiE~s~~y~G~YhVL~G~isp 112 (196)
T PRK00076 88 ADVLAIERTGEYRGLYHVLGGLLSP 112 (196)
T ss_pred HHHHHHHhhCcCceEEEEecCCcCC
Confidence 3456789999999999999966543
No 13
>PF00637 Clathrin: Region in Clathrin and VPS; InterPro: IPR000547 Proteins synthesized on the ribosome and processed in the endoplasmic reticulum are transported from the Golgi apparatus to the trans-Golgi network (TGN), and from there via small carrier vesicles to their final destination compartment. These vesicles have specific coat proteins (such as clathrin or coatomer) that are important for cargo selection and direction of transport []. Clathrin coats contain both clathrin (acts as a scaffold) and adaptor complexes that link clathrin to receptors in coated vesicles. Clathrin-associated protein complexes are believed to interact with the cytoplasmic tails of membrane proteins, leading to their selection and concentration. The two major types of clathrin adaptor complexes are the heterotetrameric adaptor protein (AP) complexes, and the monomeric GGA (Golgi-localising, Gamma-adaptin ear domain homology, ARF-binding proteins) adaptors [, ]. Clathrin is a trimer composed of three heavy chains and three light chains, each monomer projecting outwards like a leg; this three-legged structure is known as a triskelion [, ]. The heavy chains form the legs, their N-terminal beta-propeller regions extending outwards, while their C-terminal alpha-alpha-superhelical regions form the central hub of the triskelion. Peptide motifs can bind between the beta-propeller blades. The light chains appear to have a regulatory role, and may help orient the assembly and disassembly of clathrin coats as they interact with hsc70 uncoating ATPase []. Clathrin triskelia self-polymerise into a curved lattice by twisting individual legs together. The clathrin lattice forms around a vesicle as it buds from the TGN, plasma membrane or endosomes, acting to stabilise the vesicle and facilitate the budding process []. The multiple blades created when the triskelia polymerise are involved in multiple protein interactions, enabling the recruitment of different cargo adaptors and membrane attachment proteins []. This entry represents the 7-fold alpha-alpha-superhelical ARM-type repeat found at the C-terminal of clathrin heavy chains and in VPS (vacuolar protein sorting-associated) proteins. In clathrin heavy chains, the C-terminal 7-fold ARM-type repeats interact to form the central hub of the triskelion. VPS proteins are required for vacuolar assembly and vacuolar traffick, and contain one clathrin-type repeat []. More information about these proteins can be found at Protein of the Month: Clathrin [].; GO: 0006886 intracellular protein transport, 0016192 vesicle-mediated transport; PDB: 3LVH_A 3LVG_C 1B89_A 3QIL_L.
Probab=41.83 E-value=1.4 Score=28.53 Aligned_cols=42 Identities=21% Similarity=0.362 Sum_probs=31.7
Q ss_pred HHHhhhCHHHHHHHHHhcCccceeeeeecccccc--ccCcccee
Q psy10394 38 KYHEQLTTKALIDLFESFKSYEGLFYFLGSIVNY--SQDQEVHF 79 (82)
Q Consensus 38 kYsd~lg~~~lI~lFE~~ks~EGLyyyLgsiVn~--seDpdVhF 79 (82)
+..+.+.++.+|..|++-+..+-+.-||-.++.- ..++++|-
T Consensus 3 ~~~~~~~~~~vi~~~~~~~~~~~l~~yLe~~~~~~~~~~~~~~~ 46 (143)
T PF00637_consen 3 ESTDPLEISEVISAFEERNQPEELIEYLEALVKENKENNPDLHT 46 (143)
T ss_dssp HHCTTSCSCCCHHHCTTTT-GGGCTCCHHHHHHTSTC-SHHHHH
T ss_pred cCCCccCHHHHHHHHHhCCCHHHHHHHHHHHHhcccccCHHHHH
Confidence 4567788899999999999999999999998842 35566654
No 14
>TIGR00615 recR recombination protein RecR. This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University).
Probab=41.81 E-value=20 Score=27.02 Aligned_cols=25 Identities=20% Similarity=0.301 Sum_probs=19.9
Q ss_pred HHHHHHHhcCccceeeeeecccccc
Q psy10394 47 ALIDLFESFKSYEGLFYFLGSIVNY 71 (82)
Q Consensus 47 ~lI~lFE~~ks~EGLyyyLgsiVn~ 71 (82)
+=+..+|+-+.|.|+|+.||..++-
T Consensus 88 ~Dv~aiE~~~~y~G~YhVL~G~iSP 112 (195)
T TIGR00615 88 KDVFALEKTKEFRGRYHVLGGHISP 112 (195)
T ss_pred HHHHHHHhhCccceEEEEccCccCc
Confidence 3356789999999999999866543
No 15
>COG0568 RpoD DNA-directed RNA polymerase, sigma subunit (sigma70/sigma32) [Transcription]
Probab=41.59 E-value=16 Score=29.56 Aligned_cols=23 Identities=22% Similarity=0.163 Sum_probs=17.7
Q ss_pred hhHHHhcccCcchhHHHHHHHHHHHHh
Q psy10394 15 RYMHTVFPQDLKEKKLEVSQIATKYHE 41 (82)
Q Consensus 15 ~~L~elL~~N~rqNlQiVVqVA~kYsd 41 (82)
...+.|... ||..||.||.+|..
T Consensus 102 ~Ak~klv~s----nLRlVvsIAk~Y~~ 124 (342)
T COG0568 102 DAKKKLVES----NLRLVVSIAKKYTG 124 (342)
T ss_pred HHHHHHHHH----HHHHHHHHHHHhhc
Confidence 355666655 59999999999974
No 16
>COG3644 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=40.60 E-value=13 Score=28.35 Aligned_cols=10 Identities=60% Similarity=1.022 Sum_probs=8.5
Q ss_pred CccCCCCccc
Q psy10394 5 GKRDRGRPKT 14 (82)
Q Consensus 5 ~~~~~~~~~~ 14 (82)
|.|++||||-
T Consensus 70 ~prg~GRPKL 79 (194)
T COG3644 70 GPRGPGRPKL 79 (194)
T ss_pred CCCCCCCCcc
Confidence 5799999994
No 17
>KOG1473|consensus
Probab=39.01 E-value=16 Score=34.81 Aligned_cols=19 Identities=42% Similarity=0.524 Sum_probs=15.4
Q ss_pred CccCCccCCCCccchhHHH
Q psy10394 1 MEVGGKRDRGRPKTRYMHT 19 (82)
Q Consensus 1 ~~~~~~~~~~~~~~~~L~e 19 (82)
|+-++-|.||||++|==.+
T Consensus 1 m~g~~arpRGRp~k~p~~e 19 (1414)
T KOG1473|consen 1 MEGKVARPRGRPRKRPRSE 19 (1414)
T ss_pred CCCCCCCCCCCCCCCCCcc
Confidence 7778899999999886544
No 18
>PF05414 DUF1717: Viral domain of unknown function (DUF1717); InterPro: IPR008745 The domain is found towards the N terminus of the polyprotein of Apple stem grooving virus (strain P-209) (ASGV), Citrus tatter leaf virus and from Apple stem grooving virus (strain Korea) (ASGV) (Pear black necrotic leaf spot virus). Its function is unknown [, ].
Probab=37.63 E-value=33 Score=23.91 Aligned_cols=35 Identities=20% Similarity=0.456 Sum_probs=30.7
Q ss_pred HHHHHHHHHhcCccceeeeeeccccccccCcccee
Q psy10394 45 TKALIDLFESFKSYEGLFYFLGSIVNYSQDQEVHF 79 (82)
Q Consensus 45 ~~~lI~lFE~~ks~EGLyyyLgsiVn~seDpdVhF 79 (82)
+.+++++--.|-.|+-+-||=-||+-++.||+.|-
T Consensus 28 SskllE~cc~flgyrVhiyyGdSiiklnD~~nMha 62 (101)
T PF05414_consen 28 SSKLLEICCKFLGYRVHIYYGDSIIKLNDNPNMHA 62 (101)
T ss_pred cHHHHHHHHhhcccEEEEEecceEEEecCCcchhe
Confidence 45788888889999999999999999999998873
No 19
>PF04701 Pox_D2: Pox virus D2 protein; InterPro: IPR006791 This entry represents the Pox virus D2 proteins.
Probab=35.36 E-value=20 Score=26.30 Aligned_cols=15 Identities=33% Similarity=0.485 Sum_probs=12.5
Q ss_pred HHHHHHHhcCcccee
Q psy10394 47 ALIDLFESFKSYEGL 61 (82)
Q Consensus 47 ~lI~lFE~~ks~EGL 61 (82)
.|.+||.+|++..+.
T Consensus 120 iLleLFNsFr~gk~~ 134 (143)
T PF04701_consen 120 ILLELFNSFRFGKKD 134 (143)
T ss_pred HHHHHHHHhhcCCCC
Confidence 588999999987665
No 20
>PF15508 NAAA-beta: beta subunit of N-acylethanolamine-hydrolyzing acid amidase
Probab=34.59 E-value=39 Score=21.73 Aligned_cols=41 Identities=15% Similarity=0.346 Sum_probs=26.3
Q ss_pred CCccchhHHHhcccCcchhHHHHHHHHHHHHhhhCHH-HHHHHH
Q psy10394 10 GRPKTRYMHTVFPQDLKEKKLEVSQIATKYHEQLTTK-ALIDLF 52 (82)
Q Consensus 10 ~~~~~~~L~elL~~N~rqNlQiVVqVA~kYsd~lg~~-~lI~lF 52 (82)
+.|..||.+-+ + +.+..++.+++...+.-+.+.+. .+..+.
T Consensus 12 lpP~eRw~~i~-~-~~k~~i~~l~~~~~~~~~~~~~~~~~~~~v 53 (95)
T PF15508_consen 12 LPPEERWVQIA-K-DYKDEIRELIEVLKDLLQSFVPSGKVLDFV 53 (95)
T ss_pred CCHHHHHHHHH-H-HHHHHHHHHHHHHHHHHHHhCCchHHHHHH
Confidence 56899998753 3 44677777888777776654443 444443
No 21
>PF14621 RFX5_DNA_bdg: RFX5 DNA-binding domain
Probab=33.76 E-value=15 Score=28.63 Aligned_cols=12 Identities=50% Similarity=1.057 Sum_probs=10.1
Q ss_pred CCccCCCCccch
Q psy10394 4 GGKRDRGRPKTR 15 (82)
Q Consensus 4 ~~~~~~~~~~~~ 15 (82)
|.||.||||.+.
T Consensus 66 dAKRKRGRPRKK 77 (219)
T PF14621_consen 66 DAKRKRGRPRKK 77 (219)
T ss_pred hhhhhcCCCccC
Confidence 679999999863
No 22
>KOG2003|consensus
Probab=32.59 E-value=68 Score=28.79 Aligned_cols=31 Identities=23% Similarity=0.172 Sum_probs=26.3
Q ss_pred hhHHHHHHHHHHHHhhhCHHHHHHHHHhcCc
Q psy10394 27 EKKLEVSQIATKYHEQLTTKALIDLFESFKS 57 (82)
Q Consensus 27 qNlQiVVqVA~kYsd~lg~~~lI~lFE~~ks 57 (82)
.|.||.||+|.-|.-.=.+..-|+++-+-.|
T Consensus 556 nn~evl~qianiye~led~aqaie~~~q~~s 586 (840)
T KOG2003|consen 556 NNAEVLVQIANIYELLEDPAQAIELLMQANS 586 (840)
T ss_pred hhHHHHHHHHHHHHHhhCHHHHHHHHHHhcc
Confidence 3699999999999988888888988876554
No 23
>PF02001 DUF134: Protein of unknown function DUF134; InterPro: IPR002852 The bacterial and archaeal proteins in this family have no known function.
Probab=32.03 E-value=21 Score=24.51 Aligned_cols=14 Identities=43% Similarity=0.831 Sum_probs=9.2
Q ss_pred ccCCCCccc-hhHHH
Q psy10394 6 KRDRGRPKT-RYMHT 19 (82)
Q Consensus 6 ~~~~~~~~~-~~L~e 19 (82)
+|+||||++ |++..
T Consensus 5 r~~~~RP~k~R~v~~ 19 (106)
T PF02001_consen 5 RRGRGRPRKPRRVCF 19 (106)
T ss_pred cCcCCCCCCCEEecc
Confidence 578899865 55443
No 24
>PRK04966 hypothetical protein; Provisional
Probab=31.72 E-value=31 Score=22.51 Aligned_cols=25 Identities=36% Similarity=0.388 Sum_probs=22.4
Q ss_pred hhhCHHHHHHHHHhcCccceeeeee
Q psy10394 41 EQLTTKALIDLFESFKSYEGLFYFL 65 (82)
Q Consensus 41 d~lg~~~lI~lFE~~ks~EGLyyyL 65 (82)
++|-++.|-.|.|+|=+-||-.|=.
T Consensus 6 ~~L~~eTL~nLIeefv~ReGTdyG~ 30 (72)
T PRK04966 6 QDLAPETLENLIESFVLREGTDYGE 30 (72)
T ss_pred HhCCHHHHHHHHHHHHhccCccCCc
Confidence 5899999999999999999988843
No 25
>PF07797 DUF1639: Protein of unknown function (DUF1639); InterPro: IPR012438 This approximately 50-residue region is found in a number of sequences derived from hypothetical plant proteins. This region features a highly basic 5 amino-acid stretch towards its centre.
Probab=29.37 E-value=29 Score=21.51 Aligned_cols=13 Identities=38% Similarity=0.708 Sum_probs=10.6
Q ss_pred cCCccCCCCccch
Q psy10394 3 VGGKRDRGRPKTR 15 (82)
Q Consensus 3 ~~~~~~~~~~~~~ 15 (82)
+-|.|-..|||+|
T Consensus 13 m~G~kpprRPkKR 25 (50)
T PF07797_consen 13 MTGSKPPRRPKKR 25 (50)
T ss_pred HhCCCCCCCCCcc
Confidence 4578888999988
No 26
>PF03374 ANT: Phage antirepressor protein KilAC domain; InterPro: IPR005039 This entry is represented by Bacteriophage P1, Ant1 C-terminal domain, which represents the processed Ant2 chain. The characteristics of the protein distribution suggest prophage matches in addition to the phage matches. Prophages P1 and P7 exist as unit copy DNA plasmids in the bacterial cell. Maintenance of the prophage state requires the continuous expression of two repressors: (i) C1 is a protein which negatively regulates the expression of lytic genes including the C1 inactivator gene coi, and (ii) C4 is an antisense RNA which specifically inhibits the synthesis of an anti-repressor Ant.; GO: 0003677 DNA binding
Probab=28.58 E-value=90 Score=19.82 Aligned_cols=45 Identities=16% Similarity=0.333 Sum_probs=36.6
Q ss_pred CccchhHHHhcccCcchhHHHHHHHHHHHHhhhCHHHHHHHHHhcCccceeeee
Q psy10394 11 RPKTRYMHTVFPQDLKEKKLEVSQIATKYHEQLTTKALIDLFESFKSYEGLFYF 64 (82)
Q Consensus 11 ~~~~~~L~elL~~N~rqNlQiVVqVA~kYsd~lg~~~lI~lFE~~ks~EGLyyy 64 (82)
.||--+.+.++..+ ++=-+-++|+.. .+|+..|++...+ +|..|-
T Consensus 9 ~PKa~~~d~~~~~~---~~~ti~~~AK~L--~i~~~~l~~~Lr~----~g~l~~ 53 (111)
T PF03374_consen 9 APKAEFYDAFVDSD---GLYTIREAAKLL--GIGRNKLFQWLRE----KGWLYR 53 (111)
T ss_pred ChhhHHHHHHHcCC---CCccHHHHHHHh--CCCHHHHHHHHHh----CCceEE
Confidence 69999999999865 666778889888 8999999999887 455444
No 27
>PRK05658 RNA polymerase sigma factor RpoD; Validated
Probab=26.12 E-value=78 Score=26.64 Aligned_cols=41 Identities=20% Similarity=0.233 Sum_probs=27.8
Q ss_pred hHHHhcccCcchhHHHHHHHHHHHHhh---------hCHHHHHHHHHhcCccce
Q psy10394 16 YMHTVFPQDLKEKKLEVSQIATKYHEQ---------LTTKALIDLFESFKSYEG 60 (82)
Q Consensus 16 ~L~elL~~N~rqNlQiVVqVA~kYsd~---------lg~~~lI~lFE~~ks~EG 60 (82)
...+|.. .||.+|+.||.||+.. -|.--||+--++|.-.-|
T Consensus 381 a~~~Li~----~nlrlV~~iA~ky~~~gl~~~DLiQeG~iGL~~Av~kfd~~~G 430 (619)
T PRK05658 381 AKKEMVE----ANLRLVISIAKKYTNRGLQFLDLIQEGNIGLMKAVDKFEYRRG 430 (619)
T ss_pred HHHHHHH----HHHHHHHHHHHHHhhCCCCHHHHHHHHHHHHHHHHHhcCccCC
Confidence 3445554 4699999999999842 355556777777765555
No 28
>PRK05901 RNA polymerase sigma factor; Provisional
Probab=25.86 E-value=73 Score=27.02 Aligned_cols=35 Identities=23% Similarity=0.316 Sum_probs=25.7
Q ss_pred chhHHHHHHHHHHHHhh---------hCHHHHHHHHHhcCccce
Q psy10394 26 KEKKLEVSQIATKYHEQ---------LTTKALIDLFESFKSYEG 60 (82)
Q Consensus 26 rqNlQiVVqVA~kYsd~---------lg~~~lI~lFE~~ks~EG 60 (82)
..||.+|+.||.+|... -|.--||+-.|+|...-|
T Consensus 278 ~sNLrLVvsIAkrY~~~Gl~~eDLIQEGnIGLikAvekFDp~rG 321 (509)
T PRK05901 278 EANLRLVVSLAKRYTNRGLSFLDLIQEGNLGLIKAVEKFDYTKG 321 (509)
T ss_pred HHhHHHHHHHHHHHhcCCCCHHHHHHHHHHHHHHHHHHhCcccC
Confidence 34699999999999643 344557777788876666
No 29
>PF07568 HisKA_2: Histidine kinase; InterPro: IPR011495 Two-component signal transduction systems enable bacteria to sense, respond, and adapt to a wide range of environments, stressors, and growth conditions []. Some bacteria can contain up to as many as 200 two-component systems that need tight regulation to prevent unwanted cross-talk []. These pathways have been adapted to response to a wide variety of stimuli, including nutrients, cellular redox state, changes in osmolarity, quorum signals, antibiotics, and more []. Two-component systems are comprised of a sensor histidine kinase (HK) and its cognate response regulator (RR) []. The HK catalyses its own auto-phosphorylation followed by the transfer of the phosphoryl group to the receiver domain on RR; phosphorylation of the RR usually activates an attached output domain, which can then effect changes in cellular physiology, often by regulating gene expression. Some HK are bifunctional, catalysing both the phosphorylation and dephosphorylation of their cognate RR. The input stimuli can regulate either the kinase or phosphatase activity of the bifunctional HK. A variant of the two-component system is the phospho-relay system. Here a hybrid HK auto-phosphorylates and then transfers the phosphoryl group to an internal receiver domain, rather than to a separate RR protein. The phosphoryl group is then shuttled to histidine phosphotransferase (HPT) and subsequently to a terminal RR, which can evoke the desired response [, ]. Signal transducing histidine kinases are the key elements in two-component signal transduction systems, which control complex processes such as the initiation of development in microorganisms [, ]. Examples of histidine kinases are EnvZ, which plays a central role in osmoregulation [], and CheA, which plays a central role in the chemotaxis system []. Histidine kinases usually have an N-terminal ligand-binding domain and a C-terminal kinase domain, but other domains may also be present. The kinase domain is responsible for the autophosphorylation of the histidine with ATP, the phosphotransfer from the kinase to an aspartate of the response regulator, and (with bifunctional enzymes) the phosphotransfer from aspartyl phosphate back to ADP or to water []. The kinase core has a unique fold, distinct from that of the Ser/Thr/Tyr kinase superfamily. HKs can be roughly divided into two classes: orthodox and hybrid kinases [, ]. Most orthodox HKs, typified by the Escherichia coli EnvZ protein, function as periplasmic membrane receptors and have a signal peptide and transmembrane segment(s) that separate the protein into a periplasmic N-terminal sensing domain and a highly conserved cytoplasmic C-terminal kinase core. Members of this family, however, have an integral membrane sensor domain. Not all orthodox kinases are membrane bound, e.g., the nitrogen regulatory kinase NtrB (GlnL) is a soluble cytoplasmic HK []. Hybrid kinases contain multiple phosphodonor and phosphoacceptor sites and use multi-step phospho-relay schemes instead of promoting a single phosphoryl transfer. In addition to the sensor domain and kinase core, they contain a CheY-like receiver domain and a His-containing phosphotransfer (HPt) domain. This is the dimerisation and phosphoacceptor domain of a subfamily of histidine kinases. It shares sequence similarity with IPR003661 from INTERPRO and IPR011102 from INTERPRO. It is usually found adjacent to a C-terminal ATPase domain (IPR003594 from INTERPRO). This domain is found in a wide range of bacteria and also several archaea.
Probab=25.69 E-value=1.1e+02 Score=18.89 Aligned_cols=29 Identities=10% Similarity=0.062 Sum_probs=20.0
Q ss_pred cchhHHHHHHHHHHHHhhhCHHHHHHHHH
Q psy10394 25 LKEKKLEVSQIATKYHEQLTTKALIDLFE 53 (82)
Q Consensus 25 ~rqNlQiVVqVA~kYsd~lg~~~lI~lFE 53 (82)
++.|||+|..+..--......+..-+.++
T Consensus 6 VkNnLq~i~sll~lq~~~~~~~e~~~~L~ 34 (76)
T PF07568_consen 6 VKNNLQIISSLLRLQARRSEDPEAREALE 34 (76)
T ss_pred HHhHHHHHHHHHHHHHhcCCCHHHHHHHH
Confidence 45689999888877777766655444444
No 30
>PRK04217 hypothetical protein; Provisional
Probab=25.49 E-value=40 Score=23.08 Aligned_cols=48 Identities=25% Similarity=0.180 Sum_probs=27.1
Q ss_pred CccCCccCCCCccc-h-----------------hHHHhcccCcchhHHHHHHHHHHHHhhhCHHHHHH
Q psy10394 1 MEVGGKRDRGRPKT-R-----------------YMHTVFPQDLKEKKLEVSQIATKYHEQLTTKALID 50 (82)
Q Consensus 1 ~~~~~~~~~~~~~~-~-----------------~L~elL~~N~rqNlQiVVqVA~kYsd~lg~~~lI~ 50 (82)
|-.|=-||||||.+ | |++++--..+...-+-++ ...|.+.+..+.+-+
T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~Lt~eereai--~l~~~eGlS~~EIAk 66 (110)
T PRK04217 1 MPFGWGRGRGRRRKMRMIGFIPQVRHFYPAIPPVGPPKPPIFMTYEEFEAL--RLVDYEGLTQEEAGK 66 (110)
T ss_pred CCcccccCCCCCCCCeEeeccCCcceEeCCCCCccCCCCcccCCHHHHHHH--HHHHHcCCCHHHHHH
Confidence 45566799999987 2 666655555555444333 233434454444443
No 31
>PRK11675 LexA regulated protein; Provisional
Probab=24.58 E-value=35 Score=23.12 Aligned_cols=22 Identities=27% Similarity=0.471 Sum_probs=14.0
Q ss_pred ccCCCCccchhH--HHhcccCcch
Q psy10394 6 KRDRGRPKTRYM--HTVFPQDLKE 27 (82)
Q Consensus 6 ~~~~~~~~~~~L--~elL~~N~rq 27 (82)
-+.+||||+.=| ++=|+-|-|.
T Consensus 17 ~~~~grp~tnpl~r~~ql~inkrn 40 (90)
T PRK11675 17 ERRPGRPKTNPLSRDEQLRINKRN 40 (90)
T ss_pred CCCCCCCCCCCCChHHHHHhhHHH
Confidence 467899998644 3445666443
No 32
>COG1191 FliA DNA-directed RNA polymerase specialized sigma subunit [Transcription]
Probab=23.90 E-value=1.1e+02 Score=23.65 Aligned_cols=36 Identities=25% Similarity=0.337 Sum_probs=28.7
Q ss_pred chhHHHHHHHHHHHHh----------hhCHHHHHHHHHhcCcccee
Q psy10394 26 KEKKLEVSQIATKYHE----------QLTTKALIDLFESFKSYEGL 61 (82)
Q Consensus 26 rqNlQiVVqVA~kYsd----------~lg~~~lI~lFE~~ks~EGL 61 (82)
++|+.+|-.+|.+|+. ++|.--||+-.|.|...-|.
T Consensus 30 ~~ylpLV~~ia~k~~~r~~~~~dDLiqiG~iGLi~Aieryd~~kg~ 75 (247)
T COG1191 30 ERYLPLVKSIARKFENRGPSEYDDLIQIGMIGLIKAIERYDPSKGT 75 (247)
T ss_pred HHHHHHHHHHHHHHHhcCCCchhHHHHHHHHHHHHHHHHcCcccCc
Confidence 5679999999999993 45677788888888776653
No 33
>PF04441 Pox_VERT_large: Poxvirus early transcription factor (VETF), large subunit ; InterPro: IPR007532 The poxvirus early transcription factor (VETF), in addition to the viral RNA polymerase, is required for efficient transcription of early genes in vitro. VETF is a heterodimeric protein that binds specifically to early gene promoters. The heterodimer is comprised of an 82 kDa (this family) subunit and a 70 kDa subunit.; GO: 0045893 positive regulation of transcription, DNA-dependent
Probab=22.91 E-value=43 Score=30.02 Aligned_cols=21 Identities=33% Similarity=0.670 Sum_probs=18.8
Q ss_pred ceeeeeeccccccccCccceee
Q psy10394 59 EGLFYFLGSIVNYSQDQEVHFK 80 (82)
Q Consensus 59 EGLyyyLgsiVn~seDpdVhFK 80 (82)
-++||||.+..+. ++||||=+
T Consensus 34 s~iYYfl~~hl~i-~~pe~~~r 54 (700)
T PF04441_consen 34 SPIYYFLSSHLKI-TNPEVHNR 54 (700)
T ss_pred Cceeeehhhcccc-CCcceEee
Confidence 6899999999999 99999854
No 34
>COG1829 Predicted archaeal kinase (sugar kinase superfamily) [General function prediction only]
Probab=22.12 E-value=1e+02 Score=24.90 Aligned_cols=45 Identities=13% Similarity=0.212 Sum_probs=35.2
Q ss_pred chhHHHhcccCcchhHHHHHHHHHHHHhhhC--HHHHHHHHHhcCcccee
Q psy10394 14 TRYMHTVFPQDLKEKKLEVSQIATKYHEQLT--TKALIDLFESFKSYEGL 61 (82)
Q Consensus 14 ~~~L~elL~~N~rqNlQiVVqVA~kYsd~lg--~~~lI~lFE~~ks~EGL 61 (82)
+|||.+||+ |.. +.-.+..|..|.+.+| ++.+.+..|+-.+.-++
T Consensus 191 ~~~l~ellk-~Pt--le~f~~~a~~FA~~~Gl~~~~v~~~~~~~~~~g~l 237 (283)
T COG1829 191 KRALAELLK-NPT--LENFMEEAREFARRTGLMSEEVKEAAEALHSVGGL 237 (283)
T ss_pred HHHHHHHHh-CCC--HHHHHHHHHHHHHHhCcCHHHHHHHHHHhhhcCch
Confidence 689999998 444 8888999999999988 45777788877753333
No 35
>PF04155 Ground-like: Ground-like domain; InterPro: IPR007284 This group of proteins contain one or more copies of the ground-like domain, which are specific to Caenorhabditis elegans and Caenorhabditis briggsae. It has been proposed that the ground-like domain containing proteins may bind and modulate the activity of Patched-like membrane molecules, reminiscent of the modulating activities of neuropeptides [].
Probab=21.83 E-value=73 Score=19.62 Aligned_cols=27 Identities=11% Similarity=0.081 Sum_probs=19.4
Q ss_pred hhHHHhcccCcc-hhHHHHHHHHHHHHh
Q psy10394 15 RYMHTVFPQDLK-EKKLEVSQIATKYHE 41 (82)
Q Consensus 15 ~~L~elL~~N~r-qNlQiVVqVA~kYsd 41 (82)
+=|+++|..||. .|++...+.-.+-.+
T Consensus 9 ~~L~~ii~~~~~~~~~~~s~~~Iq~~~e 36 (76)
T PF04155_consen 9 EELRKIILKNMKECNLSISKRAIQKAAE 36 (76)
T ss_pred HHHHHHHHHHhccCCHHHHHHHHHHHHH
Confidence 348888999999 888887665555443
No 36
>smart00674 CENPB Putative DNA-binding domain in centromere protein B, mouse jerky and transposases.
Probab=21.80 E-value=1.4e+02 Score=17.15 Aligned_cols=30 Identities=7% Similarity=0.047 Sum_probs=26.1
Q ss_pred hhHHHhcccCcchhHHHHHHHHHHHHhhhC
Q psy10394 15 RYMHTVFPQDLKEKKLEVSQIATKYHEQLT 44 (82)
Q Consensus 15 ~~L~elL~~N~rqNlQiVVqVA~kYsd~lg 44 (82)
.|+.++...++.-+-+.+.+.|......+|
T Consensus 13 ~wi~~~~~~g~~it~~~i~~~A~~i~~~~~ 42 (66)
T smart00674 13 EWILRQEALGIPISGEQIREKALEILQRLG 42 (66)
T ss_pred HHHHHHHHCCCCCCHHHHHHHHHHHHHHcC
Confidence 588888888888889999999999999887
No 37
>TIGR02393 RpoD_Cterm RNA polymerase sigma factor RpoD, C-terminal domain. This model represents the well-conserved C-terminal region of the major, essential sigma factor of most bacteria. Members of this clade show considerable variability in domain architecture and molecular weight, as well as in nomenclature: RpoD in E. coli and other Proteobacteria, SigA in Bacillus subtilis and many other Gram-positive bacteria, HrdB in Streptomyces, MysA in Mycobacterium smegmatis, etc.
Probab=21.58 E-value=1.1e+02 Score=21.90 Aligned_cols=35 Identities=29% Similarity=0.332 Sum_probs=23.7
Q ss_pred chhHHHHHHHHHHHHh---------hhCHHHHHHHHHhcCccce
Q psy10394 26 KEKKLEVSQIATKYHE---------QLTTKALIDLFESFKSYEG 60 (82)
Q Consensus 26 rqNlQiVVqVA~kYsd---------~lg~~~lI~lFE~~ks~EG 60 (82)
.+|+..|..+|.+|.. |-|.-.|++-.++|...-|
T Consensus 7 ~~~~~lv~~ia~~~~~~~~~~eDLiQeG~igL~~A~~~fd~~~g 50 (238)
T TIGR02393 7 ESNLRLVVSIAKKYTNRGLSFLDLIQEGNIGLMKAVEKFDYRKG 50 (238)
T ss_pred HHhHHHHHHHHHHHhcCCCCHHHHHHHHHHHHHHHHHHhCCCCC
Confidence 4679999999999974 2344455666666655444
No 38
>PHA02646 virion protein; Provisional
Probab=21.46 E-value=59 Score=24.28 Aligned_cols=19 Identities=42% Similarity=0.707 Sum_probs=14.2
Q ss_pred HHHHHHHhcCccce-----eeeee
Q psy10394 47 ALIDLFESFKSYEG-----LFYFL 65 (82)
Q Consensus 47 ~lI~lFE~~ks~EG-----LyyyL 65 (82)
.|.+||.+||...+ -||||
T Consensus 121 iLLeLFNsFr~gk~~~~~~PyYyL 144 (156)
T PHA02646 121 VLLNLFNSFKYGELCKKSLPYYYS 144 (156)
T ss_pred HHHHHHHHhhcCCCCCCCCceeec
Confidence 58999999987543 37776
No 39
>smart00528 HNS Domain in histone-like proteins of HNS family.
Probab=21.32 E-value=51 Score=19.48 Aligned_cols=15 Identities=27% Similarity=0.651 Sum_probs=11.0
Q ss_pred CCCCccchhHHHhccc
Q psy10394 8 DRGRPKTRYMHTVFPQ 23 (82)
Q Consensus 8 ~~~~~~~~~L~elL~~ 23 (82)
|||| +.+||++.|..
T Consensus 23 GrGr-~P~W~~~~l~~ 37 (46)
T smart00528 23 GRGR-TPRWLAAALDS 37 (46)
T ss_pred CCCC-CCHHHHHHHHc
Confidence 6676 46799998854
No 40
>COG1312 UxuA D-mannonate dehydratase [Carbohydrate transport and metabolism]
Probab=20.82 E-value=57 Score=27.17 Aligned_cols=49 Identities=20% Similarity=0.379 Sum_probs=38.7
Q ss_pred HHHHHHHHHHHhhhC----------HHHHHHHHHhcCc------cceeeeeeccccccccCccce
Q psy10394 30 LEVSQIATKYHEQLT----------TKALIDLFESFKS------YEGLFYFLGSIVNYSQDQEVH 78 (82)
Q Consensus 30 QiVVqVA~kYsd~lg----------~~~lI~lFE~~ks------~EGLyyyLgsiVn~seDpdVh 78 (82)
|-.+.-|.++...+. -+++-++||.|+. +|.|=|||-.|++..|+-+|+
T Consensus 143 ~~~~~~~~~~~~~m~glPG~~~~~tl~~~~~~~~~y~~Id~~~L~~nL~yFL~~ViPVAEe~gVk 207 (362)
T COG1312 143 QEEIARAEERFAEMSGLPGWEEGYTLDQFRELLELYGGIDEEKLWENLAYFLKEVIPVAEEVGVK 207 (362)
T ss_pred HHHHHHHHHhhhcccCCCCCcccccHHHHHHHHHHhcCCCHHHHHHHHHHHHHhhcchHHhcCce
Confidence 444445666666666 7899999999974 789999999999999988775
No 41
>KOG0831|consensus
Probab=20.60 E-value=1.1e+02 Score=25.30 Aligned_cols=24 Identities=38% Similarity=0.571 Sum_probs=18.3
Q ss_pred HHHHhhhCHHHHHHHHHhcCcccee
Q psy10394 37 TKYHEQLTTKALIDLFESFKSYEGL 61 (82)
Q Consensus 37 ~kYsd~lg~~~lI~lFE~~ks~EGL 61 (82)
.|||++. .++|-+|||++|.-+|.
T Consensus 302 d~~H~~y-~~~L~~LF~~hK~k~g~ 325 (334)
T KOG0831|consen 302 DKYHGLY-IDALRKLFDEHKTKYGV 325 (334)
T ss_pred HHHHHHH-HHHHHHHHHhhccccCC
Confidence 4555554 56899999999998874
No 42
>PF06794 UPF0270: Uncharacterised protein family (UPF0270); InterPro: IPR010648 There is currently no experimental data for members of this group or their homologues, nor do they exhibit features indicative of any function.; PDB: 1Y0N_A.
Probab=20.49 E-value=59 Score=20.98 Aligned_cols=25 Identities=36% Similarity=0.385 Sum_probs=15.4
Q ss_pred HhhhCHHHHHHHHHhcCccceeeee
Q psy10394 40 HEQLTTKALIDLFESFKSYEGLFYF 64 (82)
Q Consensus 40 sd~lg~~~lI~lFE~~ks~EGLyyy 64 (82)
.++|-++.|-.|.|+|=+-||-.|=
T Consensus 5 ~~~L~~eTL~nLIeefv~ReGTdyG 29 (70)
T PF06794_consen 5 YQQLPPETLNNLIEEFVLREGTDYG 29 (70)
T ss_dssp GGGS-HHHHHHHHHHHHH-------
T ss_pred hHHCCHHHHHHHHHHHHHccCcccC
Confidence 3689999999999999999998873
No 43
>PHA02852 putative virion structural protein; Provisional
Probab=20.06 E-value=66 Score=23.90 Aligned_cols=20 Identities=40% Similarity=0.743 Sum_probs=14.6
Q ss_pred HHHHHHHHhcCccce-------eeeee
Q psy10394 46 KALIDLFESFKSYEG-------LFYFL 65 (82)
Q Consensus 46 ~~lI~lFE~~ks~EG-------LyyyL 65 (82)
-.|.+||.+||...+ =||||
T Consensus 116 ~iLLeLFNsFr~gk~~~~~~~~PyYyL 142 (153)
T PHA02852 116 MILLELFNSFKFGKKSNYLVKYPYYYL 142 (153)
T ss_pred HHHHHHHHHhhccCCCCcccCCCeEec
Confidence 358999999987543 36665
Done!