Query psy15915
Match_columns 99
No_of_seqs 94 out of 120
Neff 3.0
Searched_HMMs 46136
Date Fri Aug 16 16:26:43 2013
Command hhsearch -i /work/01045/syshi/Psyhhblits/psy15915.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/15915hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 KOG0985|consensus 100.0 6.1E-43 1.3E-47 316.1 4.4 84 14-97 663-746 (1666)
2 smart00299 CLH Clathrin heavy 96.8 0.0005 1.1E-08 46.5 1.2 60 39-98 4-64 (140)
3 PF00637 Clathrin: Region in C 92.8 0.03 6.5E-07 37.6 -0.1 59 38-96 3-63 (143)
4 PF02178 AT_hook: AT hook moti 92.3 0.051 1.1E-06 26.3 0.4 10 6-15 1-10 (13)
5 smart00384 AT_hook DNA binding 88.9 0.2 4.3E-06 28.4 0.8 10 6-15 1-10 (26)
6 cd01025 TOPRIM_recR TOPRIM_rec 87.6 0.59 1.3E-05 33.3 2.8 47 48-94 11-60 (112)
7 smart00671 SEL1 Sel1-like repe 87.4 0.62 1.3E-05 24.4 2.2 29 60-90 8-36 (36)
8 TIGR01408 Ube1 ubiquitin-activ 79.2 4 8.6E-05 38.1 5.2 74 15-88 641-723 (1008)
9 PRK13844 recombination protein 77.3 2.3 5.1E-05 33.2 2.7 47 48-94 93-142 (200)
10 TIGR00615 recR recombination p 75.9 2.7 5.9E-05 32.6 2.7 48 47-94 88-138 (195)
11 PF14559 TPR_19: Tetratricopep 63.0 2.6 5.6E-05 24.5 0.1 27 70-96 20-46 (68)
12 PRK00076 recR recombination pr 61.8 7.5 0.00016 30.2 2.5 47 47-94 88-137 (196)
13 PF04781 DUF627: Protein of un 61.1 5.6 0.00012 28.6 1.6 25 61-88 42-66 (111)
14 PF04700 Baculo_gp41: Structur 55.7 24 0.00052 27.8 4.4 71 15-90 91-186 (186)
15 PF11846 DUF3366: Domain of un 52.0 5.8 0.00012 28.5 0.4 30 69-98 138-167 (193)
16 KOG4014|consensus 51.8 22 0.00049 29.0 3.7 64 17-90 40-104 (248)
17 PF08238 Sel1: Sel1 repeat; I 51.3 8.5 0.00018 20.3 0.9 20 71-90 20-39 (39)
18 KOG3785|consensus 49.2 8.6 0.00019 34.1 1.1 71 26-96 343-414 (557)
19 COG2345 Predicted transcriptio 46.0 33 0.00072 27.1 3.8 52 5-56 60-113 (218)
20 PF01535 PPR: PPR repeat; Int 45.7 5.2 0.00011 19.9 -0.5 17 81-97 6-22 (31)
21 PF03374 ANT: Phage antirepres 43.3 36 0.00077 22.5 3.2 45 11-64 9-53 (111)
22 KOG1547|consensus 43.0 15 0.00034 31.0 1.7 48 2-53 255-318 (336)
23 PF13146 TRL: TRL-like protein 42.9 16 0.00034 24.5 1.4 16 82-97 42-57 (78)
24 PF09998 DUF2239: Uncharacteri 42.1 14 0.0003 29.1 1.2 11 4-14 77-87 (187)
25 PF13565 HTH_32: Homeodomain-l 41.7 42 0.00091 20.4 3.1 23 7-29 24-51 (77)
26 PF04190 DUF410: Protein of un 39.6 29 0.00062 27.0 2.6 65 32-96 36-111 (260)
27 PF08311 Mad3_BUB1_I: Mad3/BUB 37.6 85 0.0019 21.8 4.5 60 16-84 31-91 (126)
28 COG3644 Uncharacterized protei 35.7 18 0.00039 28.5 0.9 10 5-14 70-79 (194)
29 PF04701 Pox_D2: Pox virus D2 34.6 20 0.00044 27.1 1.0 15 47-61 120-134 (143)
30 PRK07122 RNA polymerase sigma 32.8 62 0.0013 24.9 3.5 36 26-61 47-91 (264)
31 PF11464 Rbsn: Rabenosyn Rab b 29.3 21 0.00045 21.9 0.2 19 78-96 8-26 (42)
32 PF02134 UBACT: Repeat in ubiq 29.2 46 0.00099 21.1 1.9 18 71-88 2-19 (67)
33 KOG1473|consensus 27.5 32 0.0007 34.0 1.3 19 1-19 1-19 (1414)
34 COG0568 RpoD DNA-directed RNA 27.4 39 0.00084 28.4 1.6 41 15-59 102-151 (342)
35 KOG2085|consensus 26.5 48 0.001 29.3 2.1 22 33-54 167-188 (457)
36 PF14621 RFX5_DNA_bdg: RFX5 DN 26.5 22 0.00047 28.5 0.0 13 4-16 66-78 (219)
37 PF05414 DUF1717: Viral domain 25.8 38 0.00083 24.3 1.1 35 45-79 28-62 (101)
38 COG1816 Add Adenosine deaminas 25.6 80 0.0017 26.4 3.2 80 9-91 9-100 (345)
39 PF02001 DUF134: Protein of un 25.3 33 0.00072 24.3 0.8 13 6-18 5-18 (106)
40 PRK04966 hypothetical protein; 24.8 50 0.0011 22.3 1.5 25 41-65 6-30 (72)
41 COG4263 NosZ Nitrous oxide red 24.6 33 0.00072 31.2 0.8 54 35-96 239-294 (637)
42 PF03597 CcoS: Cytochrome oxid 24.6 33 0.00071 21.0 0.5 14 84-97 23-36 (45)
43 KOG2003|consensus 24.4 1.1E+02 0.0024 28.3 4.0 32 26-57 555-586 (840)
44 PF11753 DUF3310: Protein of u 23.8 24 0.00051 22.3 -0.2 40 44-93 11-50 (60)
45 PF13812 PPR_3: Pentatricopept 23.6 24 0.00051 17.8 -0.2 17 81-97 7-23 (34)
46 PF15508 NAAA-beta: beta subun 23.3 87 0.0019 20.8 2.5 42 10-53 12-54 (95)
47 TIGR00233 trpS tryptophanyl-tR 23.1 85 0.0019 25.4 2.8 61 24-89 148-222 (328)
48 COG0353 RecR Recombinational D 23.1 88 0.0019 24.8 2.8 45 49-93 91-138 (198)
49 PF13428 TPR_14: Tetratricopep 22.5 41 0.00088 18.7 0.6 23 75-97 1-23 (44)
50 PRK07688 thiamine/molybdopteri 21.8 35 0.00075 27.7 0.3 65 9-89 77-141 (339)
51 PRK08223 hypothetical protein; 21.6 26 0.00057 28.4 -0.4 67 7-89 76-144 (287)
52 PF13877 RPAP3_C: Potential Mo 21.5 1.3E+02 0.0029 19.3 3.0 58 35-92 23-82 (94)
53 TIGR02552 LcrH_SycD type III s 21.4 2.3E+02 0.005 18.0 4.9 18 74-91 84-101 (135)
54 COG1312 UxuA D-mannonate dehyd 21.2 74 0.0016 27.4 2.2 49 30-78 143-207 (362)
55 PF05158 RNA_pol_Rpc34: RNA po 21.1 68 0.0015 26.3 1.9 38 52-89 57-100 (327)
56 TIGR02355 moeB molybdopterin s 21.1 35 0.00076 26.2 0.2 67 7-89 73-139 (240)
57 PRK04217 hypothetical protein; 21.0 56 0.0012 23.1 1.2 50 1-52 1-68 (110)
58 PRK11675 LexA regulated protei 20.9 46 0.00099 23.3 0.7 21 6-26 17-39 (90)
59 PF12569 NARP1: NMDA receptor- 20.8 66 0.0014 28.0 1.8 71 15-87 147-222 (517)
60 PF07797 DUF1639: Protein of u 20.6 52 0.0011 21.0 0.9 14 3-16 13-26 (50)
61 PF05030 SSXT: SSXT protein (N 20.0 2.5E+02 0.0054 18.6 4.1 60 7-74 1-60 (65)
No 1
>KOG0985|consensus
Probab=100.00 E-value=6.1e-43 Score=316.11 Aligned_cols=84 Identities=52% Similarity=0.850 Sum_probs=82.9
Q ss_pred chhHHHhcccCcchhHHHHHHHHHHHhhhhChHHHHHHHHhccccchhhheehhhccccCCchhHHHHHHHhhhcCCcee
Q psy15915 14 TRYMHTVFPQDLKEKKLEVSQIATKYHEQLTTKALIDLFESFKSYEGLFYFLGSIVNYSQDQEVHFKYIQVQGKMHYTTK 93 (99)
Q Consensus 14 ~~~l~~lL~~N~rqNlQiVVqvA~kYsd~lg~~~lI~lFE~~ks~EGLyyyLgsiVn~seDpdVhFKYIeAAak~gQ~kE 93 (99)
-+||++||++|+|||||+|||||+||||+||+++||+|||+||||||||||||||||+|||||||||||||||||||++|
T Consensus 663 ~eclkaml~~NirqNlQi~VQvatky~eqlg~~~li~lFE~fks~eGL~yfLgSivn~seDpevh~KYIqAA~kt~QikE 742 (1666)
T KOG0985|consen 663 LECLKAMLSANIRQNLQIVVQVATKYHEQLGAQALIELFESFKSYEGLYYFLGSIVNFSEDPEVHFKYIQAACKTGQIKE 742 (1666)
T ss_pred HHHHHHHHHHHHHhhhHHHHHHHHHHHHHhCHHHHHHHHHhhccchhHHHHHHHHhccccCchHHHHHHHHHHhhccHHH
Confidence 47999999999999999999999999999999999999999999999999999999999999999999999999999999
Q ss_pred eccc
Q psy15915 94 VGDL 97 (99)
Q Consensus 94 VERv 97 (99)
||||
T Consensus 743 vERi 746 (1666)
T KOG0985|consen 743 VERI 746 (1666)
T ss_pred HHHH
Confidence 9998
No 2
>smart00299 CLH Clathrin heavy chain repeat homology.
Probab=96.83 E-value=0.0005 Score=46.46 Aligned_cols=60 Identities=20% Similarity=0.348 Sum_probs=53.1
Q ss_pred HhhhhChHHHHHHHHhccccchhhheehhhccc-cCCchhHHHHHHHhhhcCCceeecccC
Q psy15915 39 YHEQLTTKALIDLFESFKSYEGLFYFLGSIVNY-SQDQEVHFKYIQVQGKMHYTTKVGDLR 98 (99)
Q Consensus 39 Ysd~lg~~~lI~lFE~~ks~EGLyyyLgsiVn~-seDpdVhFKYIeAAak~gQ~kEVERv~ 98 (99)
..+.+.++.+|..|++.+..+-++-||-+++.. +.+|++|-.+|+..++..+-+.++.++
T Consensus 4 ~~~~~~~~~vv~~~~~~~~~~~l~~yLe~~~~~~~~~~~~~~~li~ly~~~~~~~ll~~l~ 64 (140)
T smart00299 4 VSDPIDVSEVVELFEKRNLLEELIPYLESALKLNSENPALQTKLIELYAKYDPQKEIERLD 64 (140)
T ss_pred CCCcCCHHHHHHHHHhCCcHHHHHHHHHHHHccCccchhHHHHHHHHHHHHCHHHHHHHHH
Confidence 467889999999999999999999999999877 789999999999999988877766553
No 3
>PF00637 Clathrin: Region in Clathrin and VPS; InterPro: IPR000547 Proteins synthesized on the ribosome and processed in the endoplasmic reticulum are transported from the Golgi apparatus to the trans-Golgi network (TGN), and from there via small carrier vesicles to their final destination compartment. These vesicles have specific coat proteins (such as clathrin or coatomer) that are important for cargo selection and direction of transport []. Clathrin coats contain both clathrin (acts as a scaffold) and adaptor complexes that link clathrin to receptors in coated vesicles. Clathrin-associated protein complexes are believed to interact with the cytoplasmic tails of membrane proteins, leading to their selection and concentration. The two major types of clathrin adaptor complexes are the heterotetrameric adaptor protein (AP) complexes, and the monomeric GGA (Golgi-localising, Gamma-adaptin ear domain homology, ARF-binding proteins) adaptors [, ]. Clathrin is a trimer composed of three heavy chains and three light chains, each monomer projecting outwards like a leg; this three-legged structure is known as a triskelion [, ]. The heavy chains form the legs, their N-terminal beta-propeller regions extending outwards, while their C-terminal alpha-alpha-superhelical regions form the central hub of the triskelion. Peptide motifs can bind between the beta-propeller blades. The light chains appear to have a regulatory role, and may help orient the assembly and disassembly of clathrin coats as they interact with hsc70 uncoating ATPase []. Clathrin triskelia self-polymerise into a curved lattice by twisting individual legs together. The clathrin lattice forms around a vesicle as it buds from the TGN, plasma membrane or endosomes, acting to stabilise the vesicle and facilitate the budding process []. The multiple blades created when the triskelia polymerise are involved in multiple protein interactions, enabling the recruitment of different cargo adaptors and membrane attachment proteins []. This entry represents the 7-fold alpha-alpha-superhelical ARM-type repeat found at the C-terminal of clathrin heavy chains and in VPS (vacuolar protein sorting-associated) proteins. In clathrin heavy chains, the C-terminal 7-fold ARM-type repeats interact to form the central hub of the triskelion. VPS proteins are required for vacuolar assembly and vacuolar traffick, and contain one clathrin-type repeat []. More information about these proteins can be found at Protein of the Month: Clathrin [].; GO: 0006886 intracellular protein transport, 0016192 vesicle-mediated transport; PDB: 3LVH_A 3LVG_C 1B89_A 3QIL_L.
Probab=92.77 E-value=0.03 Score=37.63 Aligned_cols=59 Identities=19% Similarity=0.260 Sum_probs=50.1
Q ss_pred HHhhhhChHHHHHHHHhccccchhhheehhhcc--ccCCchhHHHHHHHhhhcCCceeecc
Q psy15915 38 KYHEQLTTKALIDLFESFKSYEGLFYFLGSIVN--YSQDQEVHFKYIQVQGKMHYTTKVGD 96 (99)
Q Consensus 38 kYsd~lg~~~lI~lFE~~ks~EGLyyyLgsiVn--~seDpdVhFKYIeAAak~gQ~kEVER 96 (99)
+..+.+.++.+|..|++-+..+-+.-||-.++. ...++++|-.+|+..++.+..+++++
T Consensus 3 ~~~~~~~~~~vi~~~~~~~~~~~l~~yLe~~~~~~~~~~~~~~~~L~~ly~~~~~~~~l~~ 63 (143)
T PF00637_consen 3 ESTDPLEISEVISAFEERNQPEELIEYLEALVKENKENNPDLHTLLLELYIKYDPYEKLLE 63 (143)
T ss_dssp HHCTTSCSCCCHHHCTTTT-GGGCTCCHHHHHHTSTC-SHHHHHHHHHHHHCTTTCCHHHH
T ss_pred cCCCccCHHHHHHHHHhCCCHHHHHHHHHHHHhcccccCHHHHHHHHHHHHhcCCchHHHH
Confidence 456788899999999999999999999999985 34789999999999999988777665
No 4
>PF02178 AT_hook: AT hook motif; InterPro: IPR017956 AT hooks are DNA-binding motifs with a preference for A/T rich regions. These motifs are found in a variety of proteins, including the high mobility group (HMG) proteins [], in DNA-binding proteins from plants [] and in hBRG1 protein, a central ATPase of the human switching/sucrose non-fermenting (SWI/SNF) remodeling complex []. High mobility group (HMG) proteins are a family of relatively low molecular weight non-histone components in chromatin []. HMG-I and HMG-Y (HMGA) are proteins of about 100 amino acid residues which are produced by the alternative splicing of a single gene. HMG-I/Y proteins bind preferentially to the minor groove of AT-rich regions in double-stranded DNA in a non-sequence specific manner [, ]. It is suggested that these proteins could function in nucleosome phasing and in the 3' end processing of mRNA transcripts. They are also involved in the transcription regulation of genes containing, or in close proximity to, AT-rich regions. ; GO: 0003677 DNA binding; PDB: 2EZE_A 2EZD_A 2EZF_A 2EZG_A.
Probab=92.34 E-value=0.051 Score=26.26 Aligned_cols=10 Identities=60% Similarity=1.169 Sum_probs=4.0
Q ss_pred ccCCCCccch
Q psy15915 6 KRDRGRPKTR 15 (99)
Q Consensus 6 ~~~~~~~~~~ 15 (99)
+|.||||++-
T Consensus 1 ~r~RGRP~k~ 10 (13)
T PF02178_consen 1 KRKRGRPRKN 10 (13)
T ss_dssp S--SS--TT-
T ss_pred CCcCCCCccc
Confidence 5789999863
No 5
>smart00384 AT_hook DNA binding domain with preference for A/T rich regions. Small DNA-binding motif first described in the high mobility group non-histone chromosomal protein HMG-I(Y).
Probab=88.86 E-value=0.2 Score=28.42 Aligned_cols=10 Identities=60% Similarity=1.139 Sum_probs=8.3
Q ss_pred ccCCCCccch
Q psy15915 6 KRDRGRPKTR 15 (99)
Q Consensus 6 ~~~~~~~~~~ 15 (99)
+|+||||++-
T Consensus 1 kRkRGRPrK~ 10 (26)
T smart00384 1 KRKRGRPRKA 10 (26)
T ss_pred CCCCCCCCCC
Confidence 5899999874
No 6
>cd01025 TOPRIM_recR TOPRIM_recR: topoisomerase-primase (TOPRIM) nucleotidyl transferase/hydrolase domain of the type found in Escherichia coli RecR. RecR participates in the RecFOR pathway of homologous recombinational repair in prokaryotes. This pathway provides a single-stranded DNA molecule coated with RecA to allow invasion of a homologous molecule. The RecFOR system directs the loading of RecA onto gapped DNA coated with SSB protein. The TOPRIM domain has two conserved motifs, one of which centers at a conserved glutamate and the other one at two conserved aspartates (DxD). In RecR sequences this glutamate in the first turn of the TOPRIM domain is semiconserved, the DXD motif is not conserved.
Probab=87.65 E-value=0.59 Score=33.31 Aligned_cols=47 Identities=15% Similarity=0.212 Sum_probs=35.1
Q ss_pred HHHHHHhccccchhhheehhhccccCCc---hhHHHHHHHhhhcCCceee
Q psy15915 48 LIDLFESFKSYEGLFYFLGSIVNYSQDQ---EVHFKYIQVQGKMHYTTKV 94 (99)
Q Consensus 48 lI~lFE~~ks~EGLyyyLgsiVn~seDp---dVhFKYIeAAak~gQ~kEV 94 (99)
=+..||+-+.|.|+|+.|+..++--..- +..++=+...++.++++||
T Consensus 11 Dv~~iE~~~~y~G~Y~VL~G~ispl~gi~p~~l~i~~L~~ri~~~~i~EV 60 (112)
T cd01025 11 DVLAIEESGEYRGLYHVLGGLISPLDGIGPDDLNIDKLLERIAKGQVKEV 60 (112)
T ss_pred HHHHHHhhCccceEEEEeCCCcCCCCCCCccccCHHHHHHHHhcCCCcEE
Confidence 3567899999999999998887654322 4566666677777788876
No 7
>smart00671 SEL1 Sel1-like repeats. These represent a subfamily of TPR (tetratricopeptide repeat) sequences.
Probab=87.35 E-value=0.62 Score=24.38 Aligned_cols=29 Identities=21% Similarity=0.366 Sum_probs=22.8
Q ss_pred hhhheehhhccccCCchhHHHHHHHhhhcCC
Q psy15915 60 GLFYFLGSIVNYSQDQEVHFKYIQVQGKMHY 90 (99)
Q Consensus 60 GLyyyLgsiVn~seDpdVhFKYIeAAak~gQ 90 (99)
|.+|+-|. ...+|++--++|.+.||+.|.
T Consensus 8 g~~~~~G~--g~~~d~~~A~~~~~~Aa~~g~ 36 (36)
T smart00671 8 GQMYEYGL--GVKKDLEKALEYYKKAAELGN 36 (36)
T ss_pred HHHHHcCC--CCCcCHHHHHHHHHHHHHccC
Confidence 44555565 567999999999999999874
No 8
>TIGR01408 Ube1 ubiquitin-activating enzyme E1. This model represents the full length, over a thousand amino acids, of a multicopy family of eukaryotic proteins, many of which are designated ubiquitin-activating enzyme E1. Members have two copies of the ThiF family domain (pfam00899), a repeat found in ubiquitin-activating proteins (pfam02134), and other regions.
Probab=79.21 E-value=4 Score=38.14 Aligned_cols=74 Identities=14% Similarity=0.279 Sum_probs=53.0
Q ss_pred hhHHHhcccCcchhHHHHHHHHHHHhhhh---ChHHHHHHH-Hhccccchhhheeh-----hhccccCCchhHHHHHHHh
Q psy15915 15 RYMHTVFPQDLKEKKLEVSQIATKYHEQL---TTKALIDLF-ESFKSYEGLFYFLG-----SIVNYSQDQEVHFKYIQVQ 85 (99)
Q Consensus 15 ~~l~~lL~~N~rqNlQiVVqvA~kYsd~l---g~~~lI~lF-E~~ks~EGLyyyLg-----siVn~seDpdVhFKYIeAA 85 (99)
+.++..|..+...+.+-+|..|..--+.+ .+..|...| ...+..+|.-|+-| ..+.+..|-+.|+.||.||
T Consensus 641 ~~i~~~l~~~~p~~~~~cv~~a~~~f~~~F~~~I~qLl~~fP~d~~~~~G~~fWs~~kr~P~pl~Fd~~~~~h~~Fi~aa 720 (1008)
T TIGR01408 641 EQIIKLLSKEKPRNFSQCVEWARLKFEKYFNNKALQLLHCFPLDIRTSTGSPFWSSPKRPPSPLKFDLNEPLHLSFIQAA 720 (1008)
T ss_pred HHHHHHHhhcCCCCHHHHHHHHHHHHHHHHHHHHHHHHHhCCccccccCCCccccCCCCCCCceeeCCCCHHHHHHHHHH
Confidence 44555555555678888888887655555 455555555 45678899999887 2567777777899999999
Q ss_pred hhc
Q psy15915 86 GKM 88 (99)
Q Consensus 86 ak~ 88 (99)
|++
T Consensus 721 anL 723 (1008)
T TIGR01408 721 AKL 723 (1008)
T ss_pred HHH
Confidence 874
No 9
>PRK13844 recombination protein RecR; Provisional
Probab=77.34 E-value=2.3 Score=33.16 Aligned_cols=47 Identities=15% Similarity=0.165 Sum_probs=34.0
Q ss_pred HHHHHHhccccchhhheehhhccccCC---chhHHHHHHHhhhcCCceee
Q psy15915 48 LIDLFESFKSYEGLFYFLGSIVNYSQD---QEVHFKYIQVQGKMHYTTKV 94 (99)
Q Consensus 48 lI~lFE~~ks~EGLyyyLgsiVn~seD---pdVhFKYIeAAak~gQ~kEV 94 (99)
=+.-+|+-+.|.|+|+.||..++--+. .++.++-+..=.+.++++||
T Consensus 93 Dv~aiE~t~~y~G~YhVL~G~ispl~gi~p~~l~i~~L~~Ri~~~~v~EV 142 (200)
T PRK13844 93 DMIAIEEAGIYRGKYFVLNGRISPLDGIGPSELKLDILQQIIADRKIDEV 142 (200)
T ss_pred HHHHHHhhCccceEEEEccCccCccCCCChhhcCHHHHHHHHhcCCCcEE
Confidence 356789999999999999888765443 25566655555666777776
No 10
>TIGR00615 recR recombination protein RecR. This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University).
Probab=75.90 E-value=2.7 Score=32.64 Aligned_cols=48 Identities=13% Similarity=0.176 Sum_probs=35.7
Q ss_pred HHHHHHHhccccchhhheehhhccccCC---chhHHHHHHHhhhcCCceee
Q psy15915 47 ALIDLFESFKSYEGLFYFLGSIVNYSQD---QEVHFKYIQVQGKMHYTTKV 94 (99)
Q Consensus 47 ~lI~lFE~~ks~EGLyyyLgsiVn~seD---pdVhFKYIeAAak~gQ~kEV 94 (99)
.=+..||+-+.|.|+|+.||..++--+. .|+.++-+..=.+.++++||
T Consensus 88 ~Dv~aiE~~~~y~G~YhVL~G~iSPldgigp~~l~i~~L~~Ri~~~~v~EV 138 (195)
T TIGR00615 88 KDVFALEKTKEFRGRYHVLGGHISPLDGIGPEDLTIAALLKRLQEESVKEV 138 (195)
T ss_pred HHHHHHHhhCccceEEEEccCccCccCCCChhhcCHHHHHHHHhcCCCcEE
Confidence 3456789999999999999866554333 26777777776777888886
No 11
>PF14559 TPR_19: Tetratricopeptide repeat; PDB: 2R5S_A 3QDN_B 3QOU_A 3ASG_A 3ASD_A 3AS5_A 3AS4_A 3ASH_B 3FP3_A 3LCA_A ....
Probab=63.04 E-value=2.6 Score=24.51 Aligned_cols=27 Identities=11% Similarity=0.035 Sum_probs=22.1
Q ss_pred cccCCchhHHHHHHHhhhcCCceeecc
Q psy15915 70 NYSQDQEVHFKYIQVQGKMHYTTKVGD 96 (99)
Q Consensus 70 n~seDpdVhFKYIeAAak~gQ~kEVER 96 (99)
-..+|+++.+.+.++..+.||+.+.+.
T Consensus 20 ~~p~~~~~~~~la~~~~~~g~~~~A~~ 46 (68)
T PF14559_consen 20 RNPDNPEARLLLAQCYLKQGQYDEAEE 46 (68)
T ss_dssp HTTTSHHHHHHHHHHHHHTT-HHHHHH
T ss_pred HCCCCHHHHHHHHHHHHHcCCHHHHHH
Confidence 345789999999999999999988654
No 12
>PRK00076 recR recombination protein RecR; Reviewed
Probab=61.80 E-value=7.5 Score=30.16 Aligned_cols=47 Identities=17% Similarity=0.228 Sum_probs=29.6
Q ss_pred HHHHHHHhccccchhhheehhhccccCC---chhHHHHHHHhhhcCCceee
Q psy15915 47 ALIDLFESFKSYEGLFYFLGSIVNYSQD---QEVHFKYIQVQGKMHYTTKV 94 (99)
Q Consensus 47 ~lI~lFE~~ks~EGLyyyLgsiVn~seD---pdVhFKYIeAAak~gQ~kEV 94 (99)
.=+..||+-+.|.|+|+.||..++--+. .|+.++=+..-. .++++||
T Consensus 88 ~Dv~aiE~s~~y~G~YhVL~G~ispl~gi~p~~l~i~~L~~ri-~~~v~EV 137 (196)
T PRK00076 88 ADVLAIERTGEYRGLYHVLGGLLSPLDGIGPEDLNIDELLERL-DGEVKEV 137 (196)
T ss_pred HHHHHHHhhCcCceEEEEecCCcCCCCCCCccccCHHHHHHHH-hCCCCEE
Confidence 3456789999999999999866543221 244444333333 4667775
No 13
>PF04781 DUF627: Protein of unknown function (DUF627); InterPro: IPR006866 This domain represents the N-terminal region of several plant proteins of unknown function.
Probab=61.14 E-value=5.6 Score=28.63 Aligned_cols=25 Identities=24% Similarity=0.551 Sum_probs=19.4
Q ss_pred hhheehhhccccCCchhHHHHHHHhhhc
Q psy15915 61 LFYFLGSIVNYSQDQEVHFKYIQVQGKM 88 (99)
Q Consensus 61 LyyyLgsiVn~seDpdVhFKYIeAAak~ 88 (99)
+||-|+.. ++|||+-|.|.-+|.-+
T Consensus 42 if~~lA~~---ten~d~k~~yLl~sve~ 66 (111)
T PF04781_consen 42 IFYKLAKK---TENPDVKFRYLLGSVEC 66 (111)
T ss_pred HHHHHHHh---ccCchHHHHHHHHhHHH
Confidence 45566666 89999999999887643
No 14
>PF04700 Baculo_gp41: Structural glycoprotein p40/gp41 conserved region; InterPro: IPR006790 This is a family of viral structural glycoproteins [] from the baculoviridae.; GO: 0005198 structural molecule activity, 0019012 virion
Probab=55.74 E-value=24 Score=27.77 Aligned_cols=71 Identities=14% Similarity=0.305 Sum_probs=41.4
Q ss_pred hhHHHhc-ccCcchhHHHHHHHHHHHhhhhChHHHHHHHH----h-ccccchhhhe------------------ehhhcc
Q psy15915 15 RYMHTVF-PQDLKEKKLEVSQIATKYHEQLTTKALIDLFE----S-FKSYEGLFYF------------------LGSIVN 70 (99)
Q Consensus 15 ~~l~~lL-~~N~rqNlQiVVqvA~kYsd~lg~~~lI~lFE----~-~ks~EGLyyy------------------LgsiVn 70 (99)
+||+-|+ +.++..|+|=+|+- .+| .+|. -|.++|. + |....|-||| ++-.++
T Consensus 91 eYl~lL~~ka~ip~Niq~ai~~-r~~-~~ln--~~~diiN~vV~d~f~g~~~~YY~~~~Ln~~~r~rV~~fkeNi~yL~P 166 (186)
T PF04700_consen 91 EYLTLLLQKAQIPPNIQQAIES-RNN-PQLN--MCNDIINNVVDDVFAGTHNDYYYNYCLNEKNRARVISFKENITYLAP 166 (186)
T ss_pred HHHHHHHHhccCChHHHHHHHh-Ccc-hHHH--HHHHHHHHHHHHHHcCCCCCchhhhhcCHHHHHHHHHHHhhhhhccc
Confidence 5777666 46888899887762 222 2222 3333332 2 4454556666 334456
Q ss_pred ccCCchhHHHHHHH-hhhcCC
Q psy15915 71 YSQDQEVHFKYIQV-QGKMHY 90 (99)
Q Consensus 71 ~seDpdVhFKYIeA-Aak~gQ 90 (99)
+|..-+| |.||+. |++-||
T Consensus 167 Ls~stnI-F~~ia~~At~~Gk 186 (186)
T PF04700_consen 167 LSASTNI-FEYIAELATRNGK 186 (186)
T ss_pred cchhhhH-HHHHHHHHHhcCC
Confidence 6765555 999974 566664
No 15
>PF11846 DUF3366: Domain of unknown function (DUF3366); InterPro: IPR021797 This domain is functionally uncharacterised. This domain is found in bacteria. This presumed domain is about 200 amino acids in length.
Probab=51.97 E-value=5.8 Score=28.52 Aligned_cols=30 Identities=10% Similarity=0.067 Sum_probs=25.3
Q ss_pred ccccCCchhHHHHHHHhhhcCCceeecccC
Q psy15915 69 VNYSQDQEVHFKYIQVQGKMHYTTKVGDLR 98 (99)
Q Consensus 69 Vn~seDpdVhFKYIeAAak~gQ~kEVERv~ 98 (99)
+...-||.+..+||.+++..|+..|.++++
T Consensus 138 l~~~P~~~~~~~~a~~l~~~G~~~eA~~~~ 167 (193)
T PF11846_consen 138 LRRRPDPNVYQRYALALALLGDPEEARQWL 167 (193)
T ss_pred HHhCCCHHHHHHHHHHHHHcCCHHHHHHHH
Confidence 455678999999999999999999887653
No 16
>KOG4014|consensus
Probab=51.77 E-value=22 Score=28.95 Aligned_cols=64 Identities=6% Similarity=0.150 Sum_probs=50.0
Q ss_pred HHHhcccCcchhHHHHHHHHHHHhhhhC-hHHHHHHHHhccccchhhheehhhccccCCchhHHHHHHHhhhcCC
Q psy15915 17 MHTVFPQDLKEKKLEVSQIATKYHEQLT-TKALIDLFESFKSYEGLFYFLGSIVNYSQDQEVHFKYIQVQGKMHY 90 (99)
Q Consensus 17 l~~lL~~N~rqNlQiVVqvA~kYsd~lg-~~~lI~lFE~~ks~EGLyyyLgsiVn~seDpdVhFKYIeAAak~gQ 90 (99)
|-+-|. -|++|+|-.|.|-++.-|.-+ +++|-++ |.|+++|.- -.+.||.+-.+|.+-||..+.
T Consensus 40 LgdYlE-gi~knF~~A~kv~K~nCden~y~kSCyKy--------G~y~~~GKg-G~~~~l~~a~r~~~~aC~~n~ 104 (248)
T KOG4014|consen 40 LGDYLE-GIQKNFQAAVKVFKKNCDENSYPKSCYKY--------GMYMLAGKG-GDDASLSKAIRPMKIACDANI 104 (248)
T ss_pred HHHHHH-HHHHHHHHHHHHHHhcccccCCcHHHHHh--------hhhhhcccC-CCccCHHHHHHHHHHHhccCC
Confidence 344454 489999999999888777654 4555553 899999987 888999999999999887654
No 17
>PF08238 Sel1: Sel1 repeat; InterPro: IPR006597 Sel1-like repeats are tetratricopeptide repeat sequences originally identified in a Caenorhabditis elegans receptor molecule which is a key negative regulator of the Notch pathway []. Mammalian homologues have since been identified although these mainly pancreatic proteins have yet to have a function assigned.; PDB: 2XM6_A 3RJV_A 1OUV_A 1KLX_A.
Probab=51.28 E-value=8.5 Score=20.29 Aligned_cols=20 Identities=20% Similarity=0.258 Sum_probs=17.2
Q ss_pred ccCCchhHHHHHHHhhhcCC
Q psy15915 71 YSQDQEVHFKYIQVQGKMHY 90 (99)
Q Consensus 71 ~seDpdVhFKYIeAAak~gQ 90 (99)
+.+|++--++|.+.||+.|+
T Consensus 20 ~~~d~~~A~~~~~~Aa~~g~ 39 (39)
T PF08238_consen 20 VPKDYEKAFKWYEKAAEQGH 39 (39)
T ss_dssp SCHHHHHHHHHHHHHHHTT-
T ss_pred ccccccchHHHHHHHHHccC
Confidence 56789999999999999884
No 18
>KOG3785|consensus
Probab=49.21 E-value=8.6 Score=34.12 Aligned_cols=71 Identities=11% Similarity=0.159 Sum_probs=52.4
Q ss_pred chhHHHHHHHHHHHhhhhChHHHHHHHHhccccchhhheehhhcc-ccCCchhHHHHHHHhhhcCCceeecc
Q psy15915 26 KEKKLEVSQIATKYHEQLTTKALIDLFESFKSYEGLFYFLGSIVN-YSQDQEVHFKYIQVQGKMHYTTKVGD 96 (99)
Q Consensus 26 rqNlQiVVqvA~kYsd~lg~~~lI~lFE~~ks~EGLyyyLgsiVn-~seDpdVhFKYIeAAak~gQ~kEVER 96 (99)
+||+|+|=.-|+.--..-|-+..-.-|==+.-++-..-||.||-. ++.|.+..|.|-||-+.+|.+.|.|.
T Consensus 343 qqffqlVG~Sa~ecDTIpGRQsmAs~fFL~~qFddVl~YlnSi~sYF~NdD~Fn~N~AQAk~atgny~eaEe 414 (557)
T KOG3785|consen 343 QQFFQLVGESALECDTIPGRQSMASYFFLSFQFDDVLTYLNSIESYFTNDDDFNLNLAQAKLATGNYVEAEE 414 (557)
T ss_pred HHHHHHhcccccccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHhcCcchhhhHHHHHHHHhcChHHHHH
Confidence 566777766666655555666655555445556777778888854 57888999999999999999999885
No 19
>COG2345 Predicted transcriptional regulator [Transcription]
Probab=46.04 E-value=33 Score=27.08 Aligned_cols=52 Identities=21% Similarity=0.151 Sum_probs=26.3
Q ss_pred CccCCCCccchhHHHhcccC--cchhHHHHHHHHHHHhhhhChHHHHHHHHhcc
Q psy15915 5 GKRDRGRPKTRYMHTVFPQD--LKEKKLEVSQIATKYHEQLTTKALIDLFESFK 56 (99)
Q Consensus 5 ~~~~~~~~~~~~l~~lL~~N--~rqNlQiVVqvA~kYsd~lg~~~lI~lFE~~k 56 (99)
.+.|+|||++.|--.==... ...-.-+.++.+.-=++..|.+.+-.++++..
T Consensus 60 ~~~g~GRP~~~y~Lt~~g~~~f~~~y~~l~~~~l~~l~~~~G~~~l~~~l~~r~ 113 (218)
T COG2345 60 QQGGRGRPAKLYRLTEKGREQFPKRYGELALALLDALEETGGEEALNAFLEKRA 113 (218)
T ss_pred ccCCCCCCceeeeecccchhhcchhhHHHHHHHHHHHHHhccHHHHHHHHHHHH
Confidence 47789999998853211000 01123344444444455555555555555443
No 20
>PF01535 PPR: PPR repeat; InterPro: IPR002885 This entry represents the PPR repeat. Pentatricopeptide repeat (PPR) proteins are characterised by tandem repeats of a degenerate 35 amino acid motif []. Most of PPR proteins have roles in mitochondria or plastid []. PPR repeats were discovered while screening Arabidopsis proteins for those predicted to be targeted to mitochondria or chloroplast [, ]. Some of these proteins have been shown to play a role in post-transcriptional processes within organelles and they are thought to be sequence-specific RNA-binding proteins [, , ]. Plant genomes have between one hundred to five hundred PPR genes per genome whereas non-plant genomes encode two to six PPR proteins. Although no PPR structures are yet known, the motif is predicted to fold into a helix-turn-helix structure similar to those found in the tetratricopeptide repeat (TPR) family (see PDOC50005 from PROSITEDOC) []. The plant PPR protein family has been divided in two subfamilies on the basis of their motif content and organisation [, ]. Examples of PPR repeat-containing proteins include PET309 P32522 from SWISSPROT, which may be involved in RNA stabilisation [], and crp1, which is involved in RNA processing []. The repeat is associated with a predicted plant protein O49549 from SWISSPROT that has a domain organisation similar to the human BRCA1 protein.
Probab=45.69 E-value=5.2 Score=19.86 Aligned_cols=17 Identities=18% Similarity=0.016 Sum_probs=14.2
Q ss_pred HHHHhhhcCCceeeccc
Q psy15915 81 YIQVQGKMHYTTKVGDL 97 (99)
Q Consensus 81 YIeAAak~gQ~kEVERv 97 (99)
-|.+.++.|++.+++++
T Consensus 6 li~~~~~~~~~~~a~~~ 22 (31)
T PF01535_consen 6 LISGYCKMGQFEEALEV 22 (31)
T ss_pred HHHHHHccchHHHHHHH
Confidence 47889999999988765
No 21
>PF03374 ANT: Phage antirepressor protein KilAC domain; InterPro: IPR005039 This entry is represented by Bacteriophage P1, Ant1 C-terminal domain, which represents the processed Ant2 chain. The characteristics of the protein distribution suggest prophage matches in addition to the phage matches. Prophages P1 and P7 exist as unit copy DNA plasmids in the bacterial cell. Maintenance of the prophage state requires the continuous expression of two repressors: (i) C1 is a protein which negatively regulates the expression of lytic genes including the C1 inactivator gene coi, and (ii) C4 is an antisense RNA which specifically inhibits the synthesis of an anti-repressor Ant.; GO: 0003677 DNA binding
Probab=43.31 E-value=36 Score=22.47 Aligned_cols=45 Identities=16% Similarity=0.333 Sum_probs=37.2
Q ss_pred CccchhHHHhcccCcchhHHHHHHHHHHHhhhhChHHHHHHHHhccccchhhhe
Q psy15915 11 RPKTRYMHTVFPQDLKEKKLEVSQIATKYHEQLTTKALIDLFESFKSYEGLFYF 64 (99)
Q Consensus 11 ~~~~~~l~~lL~~N~rqNlQiVVqvA~kYsd~lg~~~lI~lFE~~ks~EGLyyy 64 (99)
.||..+.+.++..+ ++=-+-++|+.+ .+|+..|+++..+ .|..|-
T Consensus 9 ~PKa~~~d~~~~~~---~~~ti~~~AK~L--~i~~~~l~~~Lr~----~g~l~~ 53 (111)
T PF03374_consen 9 APKAEFYDAFVDSD---GLYTIREAAKLL--GIGRNKLFQWLRE----KGWLYR 53 (111)
T ss_pred ChhhHHHHHHHcCC---CCccHHHHHHHh--CCCHHHHHHHHHh----CCceEE
Confidence 69999999999876 677778889888 8999999999987 455444
No 22
>KOG1547|consensus
Probab=42.99 E-value=15 Score=30.96 Aligned_cols=48 Identities=29% Similarity=0.533 Sum_probs=33.4
Q ss_pred ccCCccCCCCccchh---------------HHHhcccCcchhHHHHHHHHHHHh-hhhChHHHHHHHH
Q psy15915 2 EVGGKRDRGRPKTRY---------------MHTVFPQDLKEKKLEVSQIATKYH-EQLTTKALIDLFE 53 (99)
Q Consensus 2 ~~~~~~~~~~~~~~~---------------l~~lL~~N~rqNlQiVVqvA~kYs-d~lg~~~lI~lFE 53 (99)
.|+|+|=+|| |+|| |+++| ||-.||=+..+..-+| |..-+..|.++=|
T Consensus 255 ~vnG~~vlGR-ktrWg~IeVen~nhCeF~~LRdfl---iRtHlQdlke~Ts~iHyE~yR~krLn~~~~ 318 (336)
T KOG1547|consen 255 QVNGRRVLGR-KTRWGIIEVENLNHCEFVLLRDFL---IRTHLQDLKETTSNIHYETYRAKRLNELKE 318 (336)
T ss_pred EEcCeEeecc-ccccceEEecccccchhHHHHHHH---HHHHHHHHHHHhhhhhHhHHHHHHHhhhcC
Confidence 5899999999 7776 77777 4777888888776665 3455555555433
No 23
>PF13146 TRL: TRL-like protein family
Probab=42.92 E-value=16 Score=24.48 Aligned_cols=16 Identities=25% Similarity=0.264 Sum_probs=14.1
Q ss_pred HHHhhhcCCceeeccc
Q psy15915 82 IQVQGKMHYTTKVGDL 97 (99)
Q Consensus 82 IeAAak~gQ~kEVERv 97 (99)
||+|+|-|.+++|-.|
T Consensus 42 i~~Aak~ggI~kI~~V 57 (78)
T PF13146_consen 42 IEAAAKNGGITKISSV 57 (78)
T ss_pred HHHHHHhCCCcEEEEE
Confidence 7999999999998654
No 24
>PF09998 DUF2239: Uncharacterized protein conserved in bacteria (DUF2239); InterPro: IPR018715 This family of uncharacterised bacterial proteins has no known function.
Probab=42.15 E-value=14 Score=29.10 Aligned_cols=11 Identities=55% Similarity=0.930 Sum_probs=9.0
Q ss_pred CCccCCCCccc
Q psy15915 4 GGKRDRGRPKT 14 (99)
Q Consensus 4 ~~~~~~~~~~~ 14 (99)
...|||||||-
T Consensus 77 ~~prgrGRPKL 87 (187)
T PF09998_consen 77 AAPRGRGRPKL 87 (187)
T ss_pred CCCCCCCCCcc
Confidence 45799999994
No 25
>PF13565 HTH_32: Homeodomain-like domain
Probab=41.70 E-value=42 Score=20.40 Aligned_cols=23 Identities=13% Similarity=0.279 Sum_probs=12.9
Q ss_pred cCCCCccc-----hhHHHhcccCcchhH
Q psy15915 7 RDRGRPKT-----RYMHTVFPQDLKEKK 29 (99)
Q Consensus 7 ~~~~~~~~-----~~l~~lL~~N~rqNl 29 (99)
+..|||.+ .++.+++..+.+-++
T Consensus 24 ~~~Grp~~~~e~~~~i~~~~~~~p~wt~ 51 (77)
T PF13565_consen 24 PRPGRPRKDPEQRERIIALIEEHPRWTP 51 (77)
T ss_pred CCCCCCCCcHHHHHHHHHHHHhCCCCCH
Confidence 56889876 444555554443333
No 26
>PF04190 DUF410: Protein of unknown function (DUF410) ; InterPro: IPR007317 This is a family of conserved eukaryotic proteins with undetermined function.; PDB: 3LKU_E 2WPV_G.
Probab=39.56 E-value=29 Score=26.99 Aligned_cols=65 Identities=18% Similarity=0.211 Sum_probs=38.1
Q ss_pred HHHHHHHHhhhhCh---HHHHHHHHhccccc-hhhheehhhcccc-------CCchhHHHHHHHhhhcCCceeecc
Q psy15915 32 VSQIATKYHEQLTT---KALIDLFESFKSYE-GLFYFLGSIVNYS-------QDQEVHFKYIQVQGKMHYTTKVGD 96 (99)
Q Consensus 32 VVqvA~kYsd~lg~---~~lI~lFE~~ks~E-GLyyyLgsiVn~s-------eDpdVhFKYIeAAak~gQ~kEVER 96 (99)
+|+|-++-+..... .+|++++..+.+.+ ..-=|+...++-| -||++|.-.=+.-.+-|++.+.|+
T Consensus 36 liev~~~~~~~~~~~~~~rl~~l~~~~~~~~p~r~~fi~~ai~WS~~~~~~~Gdp~LH~~~a~~~~~e~~~~~A~~ 111 (260)
T PF04190_consen 36 LIEVYEKSEDPVDEESIARLIELISLFPPEEPERKKFIKAAIKWSKFGSYKFGDPELHHLLAEKLWKEGNYYEAER 111 (260)
T ss_dssp HHHHHHHTT---SHHHHHHHHHHHHHS-TT-TTHHHHHHHHHHHHHTSS-TT--HHHHHHHHHHHHHTT-HHHHHH
T ss_pred HHHHHHHcCCCCCHHHHHHHHHHHHhCCCCcchHHHHHHHHHHHHccCCCCCCCHHHHHHHHHHHHhhccHHHHHH
Confidence 34554444333333 57888888887655 6666776666666 678888877777777777766553
No 27
>PF08311 Mad3_BUB1_I: Mad3/BUB1 homology region 1; InterPro: IPR013212 Proteins containing this domain are checkpoint proteins involved in cell division. This region has been shown to be essential for the binding of BUB1 and MAD3 to CDC20p [].; PDB: 3ESL_B 4AEZ_I 4A1G_B 2LAH_A 2WVI_A 3SI5_B.
Probab=37.59 E-value=85 Score=21.84 Aligned_cols=60 Identities=17% Similarity=0.266 Sum_probs=34.1
Q ss_pred hHHHhccc-CcchhHHHHHHHHHHHhhhhChHHHHHHHHhccccchhhheehhhccccCCchhHHHHHHH
Q psy15915 16 YMHTVFPQ-DLKEKKLEVSQIATKYHEQLTTKALIDLFESFKSYEGLFYFLGSIVNYSQDQEVHFKYIQV 84 (99)
Q Consensus 16 ~l~~lL~~-N~rqNlQiVVqvA~kYsd~lg~~~lI~lFE~~ks~EGLyyyLgsiVn~seDpdVhFKYIeA 84 (99)
|+.+-... +..++|+-+.+=|+++.... +.|+..+=..-..=..+.++.||.-.|+|..+
T Consensus 31 w~~~~~p~~~~~~~L~~lLer~~~~f~~~---------~~Y~nD~RylkiWi~ya~~~~~~~~if~~l~~ 91 (126)
T PF08311_consen 31 WIEENYPSGGKQSGLLELLERCIRKFKDD---------ERYKNDERYLKIWIKYADLSSDPREIFKFLYS 91 (126)
T ss_dssp HHHHHCTTCCCCHHHHHHHHHHHHHHTTS---------GGGTT-HHHHHHHHHHHTTBSHHHHHHHHHHH
T ss_pred HHHHHCCCCCchhHHHHHHHHHHHHHhhh---------HhhcCCHHHHHHHHHHHHHccCHHHHHHHHHH
Confidence 45554444 45677776766555543221 33444444444444456677799999998765
No 28
>COG3644 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=35.72 E-value=18 Score=28.54 Aligned_cols=10 Identities=60% Similarity=1.022 Sum_probs=8.6
Q ss_pred CccCCCCccc
Q psy15915 5 GKRDRGRPKT 14 (99)
Q Consensus 5 ~~~~~~~~~~ 14 (99)
|.||+||||-
T Consensus 70 ~prg~GRPKL 79 (194)
T COG3644 70 GPRGPGRPKL 79 (194)
T ss_pred CCCCCCCCcc
Confidence 5799999994
No 29
>PF04701 Pox_D2: Pox virus D2 protein; InterPro: IPR006791 This entry represents the Pox virus D2 proteins.
Probab=34.60 E-value=20 Score=27.07 Aligned_cols=15 Identities=33% Similarity=0.485 Sum_probs=12.6
Q ss_pred HHHHHHHhccccchh
Q psy15915 47 ALIDLFESFKSYEGL 61 (99)
Q Consensus 47 ~lI~lFE~~ks~EGL 61 (99)
.|.+||.+|++..+.
T Consensus 120 iLleLFNsFr~gk~~ 134 (143)
T PF04701_consen 120 ILLELFNSFRFGKKD 134 (143)
T ss_pred HHHHHHHHhhcCCCC
Confidence 588999999987765
No 30
>PRK07122 RNA polymerase sigma factor SigF; Reviewed
Probab=32.81 E-value=62 Score=24.95 Aligned_cols=36 Identities=14% Similarity=0.238 Sum_probs=28.3
Q ss_pred chhHHHHHHHHHHHhhh---------hChHHHHHHHHhccccchh
Q psy15915 26 KEKKLEVSQIATKYHEQ---------LTTKALIDLFESFKSYEGL 61 (99)
Q Consensus 26 rqNlQiVVqvA~kYsd~---------lg~~~lI~lFE~~ks~EGL 61 (99)
.+|+.+|+.||.+|+.. -|.--||+-.++|....|.
T Consensus 47 ~~~l~LV~~iA~~y~~~g~~~~DLiQeG~iGLi~AierFDp~~G~ 91 (264)
T PRK07122 47 TRCLPLADHIARRFDGRGEPRDDLVQVARVGLVNAVNRFDVETGS 91 (264)
T ss_pred HHhHHHHHHHHHHHHhCCCCHHHHHHHHHHHHHHHHHHcCCCCCC
Confidence 46799999999999654 3455678888999888774
No 31
>PF11464 Rbsn: Rabenosyn Rab binding domain; InterPro: IPR021565 Rabenosyn-5 (Rbsn) is a multivalent effector with interacts with the Rab family.Rsbn contains distinct Rab4 and Rab5 binding sites within residues 264-500 and 627-784 respectively []. Rab proteins are GTPases involved in the regulation of all stages of membrane trafficking []. ; PDB: 1Z0K_B 1YZM_A 1Z0J_B.
Probab=29.32 E-value=21 Score=21.94 Aligned_cols=19 Identities=16% Similarity=0.092 Sum_probs=15.1
Q ss_pred HHHHHHHhhhcCCceeecc
Q psy15915 78 HFKYIQVQGKMHYTTKVGD 96 (99)
Q Consensus 78 hFKYIeAAak~gQ~kEVER 96 (99)
.=.||+-|.+.|.+.||+-
T Consensus 8 I~~~I~qAk~~~r~dEV~~ 26 (42)
T PF11464_consen 8 IESYIKQAKAARRFDEVAT 26 (42)
T ss_dssp HHHHHHHHHHTT-HHHHHH
T ss_pred HHHHHHHHHHhcCcHHHHH
Confidence 3579999999999999863
No 32
>PF02134 UBACT: Repeat in ubiquitin-activating (UBA) protein; InterPro: IPR000127 The post-translational attachment of ubiquitin (IPR000626 from INTERPRO) to proteins (ubiquitinylation) alters the function, location or trafficking of a protein, or targets it to the 26S proteasome for degradation [, , ]. Ubiquitinylation is an ATP-dependent process that involves the action of at least three enzymes: a ubiquitin-activating enzyme (E1, IPR000011 from INTERPRO), a ubiquitin-conjugating enzyme (E2, IPR000608 from INTERPRO), and a ubiquitin ligase (E3, IPR000569 from INTERPRO, IPR003613 from INTERPRO), which work sequentially in a cascade []. The E1 enzyme is responsible for activating ubiquitin, the first step in ubiquitinylation. The E1 enzyme hydrolyses ATP and adenylates the C-terminal glycine residue of ubiquitin, and then links this residue to the active site cysteine of E1, yielding a ubiquitin-thioester and free AMP. To be fully active, E1 must non-covalently bind to and adenylate a second ubiquitin molecule. The E1 enzyme can then transfer the thioester-linked ubiquitin molecule to a cysteine residue on the ubiquitin-conjugating enzyme, E2, in an ATP-dependent reaction. This domain is found 2 times in each member of the ubiquitin activating enzymes and is located downstream of the active site cysteine [].; GO: 0005524 ATP binding, 0008641 small protein activating enzyme activity, 0006464 protein modification process; PDB: 1Z7L_A 3CMM_A 3GZN_D 3DBL_F 1R4N_H 1R4M_D 2NVU_B 1TT5_D 3DBR_D 3DBH_H ....
Probab=29.20 E-value=46 Score=21.11 Aligned_cols=18 Identities=17% Similarity=0.628 Sum_probs=13.8
Q ss_pred ccCCchhHHHHHHHhhhc
Q psy15915 71 YSQDQEVHFKYIQVQGKM 88 (99)
Q Consensus 71 ~seDpdVhFKYIeAAak~ 88 (99)
+..|.+.|+.||-|||..
T Consensus 2 Fd~dd~~h~~fI~a~anL 19 (67)
T PF02134_consen 2 FDKDDPLHLDFIYAAANL 19 (67)
T ss_dssp --TTSHHHHHHHHHHHHH
T ss_pred CCCCcHHHHHHHHHHHHH
Confidence 346788999999999864
No 33
>KOG1473|consensus
Probab=27.46 E-value=32 Score=33.96 Aligned_cols=19 Identities=42% Similarity=0.524 Sum_probs=14.7
Q ss_pred CccCCccCCCCccchhHHH
Q psy15915 1 MEVGGKRDRGRPKTRYMHT 19 (99)
Q Consensus 1 ~~~~~~~~~~~~~~~~l~~ 19 (99)
|+-++-|.||||++|==.+
T Consensus 1 m~g~~arpRGRp~k~p~~e 19 (1414)
T KOG1473|consen 1 MEGKVARPRGRPRKRPRSE 19 (1414)
T ss_pred CCCCCCCCCCCCCCCCCcc
Confidence 6778889999998875443
No 34
>COG0568 RpoD DNA-directed RNA polymerase, sigma subunit (sigma70/sigma32) [Transcription]
Probab=27.35 E-value=39 Score=28.39 Aligned_cols=41 Identities=20% Similarity=0.169 Sum_probs=25.7
Q ss_pred hhHHHhcccCcchhHHHHHHHHHHHhh-------h--hChHHHHHHHHhccccc
Q psy15915 15 RYMHTVFPQDLKEKKLEVSQIATKYHE-------Q--LTTKALIDLFESFKSYE 59 (99)
Q Consensus 15 ~~l~~lL~~N~rqNlQiVVqvA~kYsd-------~--lg~~~lI~lFE~~ks~E 59 (99)
...+.|... ||..||.||.+|.. + =|.--||+--|+|.-.-
T Consensus 102 ~Ak~klv~s----nLRlVvsIAk~Y~~rGL~~~DLIQEGniGLmkAVekFdp~r 151 (342)
T COG0568 102 DAKKKLVES----NLRLVVSIAKKYTGRGLPFLDLIQEGNIGLMKAVEKFDPEK 151 (342)
T ss_pred HHHHHHHHH----HHHHHHHHHHHhhcCCCcHHHHHhcccHHHHHHHHhcCccc
Confidence 355566655 58899999999974 2 24444555555554333
No 35
>KOG2085|consensus
Probab=26.53 E-value=48 Score=29.29 Aligned_cols=22 Identities=41% Similarity=0.622 Sum_probs=19.9
Q ss_pred HHHHHHHhhhhChHHHHHHHHh
Q psy15915 33 SQIATKYHEQLTTKALIDLFES 54 (99)
Q Consensus 33 VqvA~kYsd~lg~~~lI~lFE~ 54 (99)
-+||++|-|+-=.-+|++||.+
T Consensus 167 ~~vaK~yid~~FvlkLLdLFdS 188 (457)
T KOG2085|consen 167 PSVAKKYIDQKFVLKLLDLFDS 188 (457)
T ss_pred HHHHHHHhhHHHHHHHHHHhcC
Confidence 3789999999999999999983
No 36
>PF14621 RFX5_DNA_bdg: RFX5 DNA-binding domain
Probab=26.49 E-value=22 Score=28.54 Aligned_cols=13 Identities=46% Similarity=0.928 Sum_probs=10.6
Q ss_pred CCccCCCCccchh
Q psy15915 4 GGKRDRGRPKTRY 16 (99)
Q Consensus 4 ~~~~~~~~~~~~~ 16 (99)
|.||.||||.+..
T Consensus 66 dAKRKRGRPRKKs 78 (219)
T PF14621_consen 66 DAKRKRGRPRKKS 78 (219)
T ss_pred hhhhhcCCCccCC
Confidence 7799999998643
No 37
>PF05414 DUF1717: Viral domain of unknown function (DUF1717); InterPro: IPR008745 The domain is found towards the N terminus of the polyprotein of Apple stem grooving virus (strain P-209) (ASGV), Citrus tatter leaf virus and from Apple stem grooving virus (strain Korea) (ASGV) (Pear black necrotic leaf spot virus). Its function is unknown [, ].
Probab=25.79 E-value=38 Score=24.32 Aligned_cols=35 Identities=20% Similarity=0.456 Sum_probs=30.8
Q ss_pred hHHHHHHHHhccccchhhheehhhccccCCchhHH
Q psy15915 45 TKALIDLFESFKSYEGLFYFLGSIVNYSQDQEVHF 79 (99)
Q Consensus 45 ~~~lI~lFE~~ks~EGLyyyLgsiVn~seDpdVhF 79 (99)
+.+++++-=.|-.|+-+-||=-||+-++.||+.|-
T Consensus 28 SskllE~cc~flgyrVhiyyGdSiiklnD~~nMha 62 (101)
T PF05414_consen 28 SSKLLEICCKFLGYRVHIYYGDSIIKLNDNPNMHA 62 (101)
T ss_pred cHHHHHHHHhhcccEEEEEecceEEEecCCcchhe
Confidence 45778888889999999999999999999999874
No 38
>COG1816 Add Adenosine deaminase [Nucleotide transport and metabolism]
Probab=25.63 E-value=80 Score=26.40 Aligned_cols=80 Identities=18% Similarity=0.334 Sum_probs=55.1
Q ss_pred CCCccchhHHHhcccCcchhHHHHHHHHHHHhhhhC---------hHHHHHH--HHhccccchhhheehhhccccCCc-h
Q psy15915 9 RGRPKTRYMHTVFPQDLKEKKLEVSQIATKYHEQLT---------TKALIDL--FESFKSYEGLFYFLGSIVNYSQDQ-E 76 (99)
Q Consensus 9 ~~~~~~~~l~~lL~~N~rqNlQiVVqvA~kYsd~lg---------~~~lI~l--FE~~ks~EGLyyyLgsiVn~seDp-d 76 (99)
+..||.+ ++-=|..|++. +.|...|.+|.=.+. ++-.... |.+..++...||...++.-+-+|. +
T Consensus 9 ~~~pkae-lH~HL~g~l~p--~~v~~la~r~gi~~~~~~~~~~~~~~~~~~~~~~~~l~~fl~~~y~~~~v~~~~~~~~~ 85 (345)
T COG1816 9 RHLPKAE-LHRHLEGSLRP--ELVLELARRYGIALPPAEFDETILEELRAEYNKFNDLQEFLEKYYRGASVLRTEEDFYR 85 (345)
T ss_pred hhchhhH-hhhcccCCcCH--HHHHHHHHHhCccCCcccccchhHHHHHHHHhccccHHHHHHHHHHHHHhhccHHHHHH
Confidence 4456654 44446777774 456788888766555 3333444 377888999999988886665665 6
Q ss_pred hHHHHHHHhhhcCCc
Q psy15915 77 VHFKYIQVQGKMHYT 91 (99)
Q Consensus 77 VhFKYIeAAak~gQ~ 91 (99)
+-+-||+.|++-|-.
T Consensus 86 la~~~~~~~~~~~~v 100 (345)
T COG1816 86 LAYEYLEDAAADNVV 100 (345)
T ss_pred HHHHHHHHHHhcCCe
Confidence 788999999987753
No 39
>PF02001 DUF134: Protein of unknown function DUF134; InterPro: IPR002852 The bacterial and archaeal proteins in this family have no known function.
Probab=25.30 E-value=33 Score=24.35 Aligned_cols=13 Identities=46% Similarity=0.951 Sum_probs=8.8
Q ss_pred ccCCCCccc-hhHH
Q psy15915 6 KRDRGRPKT-RYMH 18 (99)
Q Consensus 6 ~~~~~~~~~-~~l~ 18 (99)
+|++|||++ |++.
T Consensus 5 r~~~~RP~k~R~v~ 18 (106)
T PF02001_consen 5 RRGRGRPRKPRRVC 18 (106)
T ss_pred cCcCCCCCCCEEec
Confidence 578899875 4443
No 40
>PRK04966 hypothetical protein; Provisional
Probab=24.85 E-value=50 Score=22.26 Aligned_cols=25 Identities=36% Similarity=0.388 Sum_probs=22.5
Q ss_pred hhhChHHHHHHHHhccccchhhhee
Q psy15915 41 EQLTTKALIDLFESFKSYEGLFYFL 65 (99)
Q Consensus 41 d~lg~~~lI~lFE~~ks~EGLyyyL 65 (99)
++|-++.|-.|.|+|=+-||-.|=.
T Consensus 6 ~~L~~eTL~nLIeefv~ReGTdyG~ 30 (72)
T PRK04966 6 QDLAPETLENLIESFVLREGTDYGE 30 (72)
T ss_pred HhCCHHHHHHHHHHHHhccCccCCc
Confidence 5899999999999999999998844
No 41
>COG4263 NosZ Nitrous oxide reductase [Energy production and conversion]
Probab=24.59 E-value=33 Score=31.17 Aligned_cols=54 Identities=24% Similarity=0.388 Sum_probs=40.6
Q ss_pred HHHHHhhhhChHHHHHHHHhccccchhhheehhhccccCCchhHH--HHHHHhhhcCCceeecc
Q psy15915 35 IATKYHEQLTTKALIDLFESFKSYEGLFYFLGSIVNYSQDQEVHF--KYIQVQGKMHYTTKVGD 96 (99)
Q Consensus 35 vA~kYsd~lg~~~lI~lFE~~ks~EGLyyyLgsiVn~seDpdVhF--KYIeAAak~gQ~kEVER 96 (99)
|++.|+-.+-...| |+|-||. -|+-....-+|--|.| |=||||.+.|.|+|+--
T Consensus 239 vd~dy~Gk~aa~tc------YNsE~g~--~l~dMMssEqDWvVvFNlkRIEeAVa~G~fKeiG~ 294 (637)
T COG4263 239 VDADYQGKYAAATC------YNSEEGF--TLADMMSSERDWVVVFNLKRIEAAVKAGRFKTIGD 294 (637)
T ss_pred cccccccceeeeee------ccccccc--CHHHHhhcccceEEEEehHHHHHHHhcCCceeeCC
Confidence 46677766666654 6777774 4566777778887776 78999999999999854
No 42
>PF03597 CcoS: Cytochrome oxidase maturation protein cbb3-type; InterPro: IPR004714 Cytochrome cbb3 oxidases are found almost exclusively in Proteobacteria, and represent a distinctive class of proton-pumping respiratory haem-copper oxidases (HCO) that lack many of the key structural features that contribute to the reaction cycle of the intensely studied mitochondrial cytochrome c oxidase (CcO). Expression of cytochrome cbb3 oxidase allows human pathogens to colonise anoxic tissues and agronomically important diazotrophs to sustain nitrogen fixation []. Genes encoding a cytochrome cbb3 oxidase were initially designated fixNOQP (ccoNOQP), the ccoNOQP operon is always found close to a second gene cluster, known as fixGHIS (ccoGHIS) whose expression is necessary for the assembly of a functional cbb3 oxidase. On the basis of their derived amino acid sequences each of the four proteins encoded by the ccoGHIS operon are thought to be membrane-bound. It has been suggested that they may function in concert as a multi-subunit complex, possibly playing a role in the uptake and metabolism of copper required for the assembly of the binuclear centre of cytochrome cbb3 oxidase.
Probab=24.56 E-value=33 Score=20.99 Aligned_cols=14 Identities=7% Similarity=-0.242 Sum_probs=11.3
Q ss_pred HhhhcCCceeeccc
Q psy15915 84 VQGKMHYTTKVGDL 97 (99)
Q Consensus 84 AAak~gQ~kEVERv 97 (99)
-|.|.|||...|+-
T Consensus 23 Wavk~GQfdD~e~~ 36 (45)
T PF03597_consen 23 WAVKSGQFDDLEGP 36 (45)
T ss_pred HHHccCCCCCCcch
Confidence 37899999988863
No 43
>KOG2003|consensus
Probab=24.43 E-value=1.1e+02 Score=28.33 Aligned_cols=32 Identities=22% Similarity=0.145 Sum_probs=25.3
Q ss_pred chhHHHHHHHHHHHhhhhChHHHHHHHHhccc
Q psy15915 26 KEKKLEVSQIATKYHEQLTTKALIDLFESFKS 57 (99)
Q Consensus 26 rqNlQiVVqvA~kYsd~lg~~~lI~lFE~~ks 57 (99)
+.|.|+.||+|.-|.-.=.+..-|+++-+-.|
T Consensus 555 ~nn~evl~qianiye~led~aqaie~~~q~~s 586 (840)
T KOG2003|consen 555 LNNAEVLVQIANIYELLEDPAQAIELLMQANS 586 (840)
T ss_pred HhhHHHHHHHHHHHHHhhCHHHHHHHHHHhcc
Confidence 34799999999999888888888887765443
No 44
>PF11753 DUF3310: Protein of unknwon function (DUF3310); InterPro: IPR021739 This entry is represented by Bacteriophage T7, Gp1.7. The characteristics of the protein distribution suggest prophage matches in addition to the phage matches.
Probab=23.80 E-value=24 Score=22.26 Aligned_cols=40 Identities=28% Similarity=0.334 Sum_probs=29.0
Q ss_pred ChHHHHHHHHhccccchhhheehhhccccCCchhHHHHHHHhhhcCCcee
Q psy15915 44 TTKALIDLFESFKSYEGLFYFLGSIVNYSQDQEVHFKYIQVQGKMHYTTK 93 (99)
Q Consensus 44 g~~~lI~lFE~~ks~EGLyyyLgsiVn~seDpdVhFKYIeAAak~gQ~kE 93 (99)
|.-.+|++.+.+...+.+=|.+|++ +||+=-|-+-+-.+.
T Consensus 11 ~~ie~id~~~~~~~~~~~~f~~gna----------iKY~~R~~~K~~~eD 50 (60)
T PF11753_consen 11 GGIECIDFIEQFTEEQFLGFCLGNA----------IKYLWRAGKKNGIED 50 (60)
T ss_pred CCccHHHHHHHhcchhhhhHHHHHH----------HHHHHHHcccCcHHH
Confidence 4456777777777778888899999 899877665544333
No 45
>PF13812 PPR_3: Pentatricopeptide repeat domain
Probab=23.56 E-value=24 Score=17.77 Aligned_cols=17 Identities=12% Similarity=0.024 Sum_probs=11.8
Q ss_pred HHHHhhhcCCceeeccc
Q psy15915 81 YIQVQGKMHYTTKVGDL 97 (99)
Q Consensus 81 YIeAAak~gQ~kEVERv 97 (99)
.|.|+++.|+++.+.++
T Consensus 7 ll~a~~~~g~~~~a~~~ 23 (34)
T PF13812_consen 7 LLRACAKAGDPDAALQL 23 (34)
T ss_pred HHHHHHHCCCHHHHHHH
Confidence 47788888887766543
No 46
>PF15508 NAAA-beta: beta subunit of N-acylethanolamine-hydrolyzing acid amidase
Probab=23.27 E-value=87 Score=20.76 Aligned_cols=42 Identities=14% Similarity=0.359 Sum_probs=25.4
Q ss_pred CCccchhHHHhcccCcchhHHHHHHHHHHHhhhhChH-HHHHHHH
Q psy15915 10 GRPKTRYMHTVFPQDLKEKKLEVSQIATKYHEQLTTK-ALIDLFE 53 (99)
Q Consensus 10 ~~~~~~~l~~lL~~N~rqNlQiVVqvA~kYsd~lg~~-~lI~lFE 53 (99)
+.|..||.+-+ + +.+..++.+++...+.-+.+.+. .++.+..
T Consensus 12 lpP~eRw~~i~-~-~~k~~i~~l~~~~~~~~~~~~~~~~~~~~v~ 54 (95)
T PF15508_consen 12 LPPEERWVQIA-K-DYKDEIRELIEVLKDLLQSFVPSGKVLDFVD 54 (95)
T ss_pred CCHHHHHHHHH-H-HHHHHHHHHHHHHHHHHHHhCCchHHHHHHH
Confidence 56888997743 2 34667777777766665554443 4444443
No 47
>TIGR00233 trpS tryptophanyl-tRNA synthetase. This model represents tryptophanyl-tRNA synthetase. Some members of the family have a pfam00458 domain amino-terminal to the region described by this model.
Probab=23.08 E-value=85 Score=25.43 Aligned_cols=61 Identities=13% Similarity=0.164 Sum_probs=42.8
Q ss_pred CcchhHHHHHHHHHHHhhhhC-----hHHHHHHH--------H-hccccchhhheehhhccccCCchhHHHHHHHhhhcC
Q psy15915 24 DLKEKKLEVSQIATKYHEQLT-----TKALIDLF--------E-SFKSYEGLFYFLGSIVNYSQDQEVHFKYIQVQGKMH 89 (99)
Q Consensus 24 N~rqNlQiVVqvA~kYsd~lg-----~~~lI~lF--------E-~~ks~EGLyyyLgsiVn~seDpdVhFKYIeAAak~g 89 (99)
..+|++.+.-+||.++....| |..++.-| . +-....+- |.++++.+||...|-|..|+-.+
T Consensus 148 DQ~~h~elaRdia~r~n~~~~~~f~~P~~l~~~~~~~l~gl~~~KMSKS~~~-----s~I~L~D~~e~I~~KI~~a~td~ 222 (328)
T TIGR00233 148 DQDQHLELTRDLAERFNKKFKNFFPKPESLISKFFPRLMGLSGKKMSKSDPN-----SAIFLTDTPKQIKKKIRKAATDG 222 (328)
T ss_pred ccHHHHHHHHHHHHHhhhhcCcccCCChhhhccccCCCCCCCCCcCCCCCCC-----CeEeecCCHHHHHHHHHhcCCCC
Confidence 458999999999999998776 55555333 1 22222222 57799999999999999765443
No 48
>COG0353 RecR Recombinational DNA repair protein (RecF pathway) [DNA replication, recombination, and repair]
Probab=23.06 E-value=88 Score=24.78 Aligned_cols=45 Identities=11% Similarity=0.223 Sum_probs=27.3
Q ss_pred HHHHHhccccchhhheehhhccccCC---chhHHHHHHHhhhcCCcee
Q psy15915 49 IDLFESFKSYEGLFYFLGSIVNYSQD---QEVHFKYIQVQGKMHYTTK 93 (99)
Q Consensus 49 I~lFE~~ks~EGLyyyLgsiVn~seD---pdVhFKYIeAAak~gQ~kE 93 (99)
+-.+|+-+.|-|+|+.|+..++--.- .|+..+=+..=...+++.|
T Consensus 91 v~a~E~~~~f~G~YhVL~G~lspl~gigpe~l~i~~L~~Rl~~~~~~E 138 (198)
T COG0353 91 VLALEKTGEFRGLYHVLGGLLSPLDGIGPEDLNIDELLQRLAEGSIKE 138 (198)
T ss_pred HHHHHHhcccCeeEEEecCccCcccCCCcccccHHHHHHHHhcCCCce
Confidence 45679999999999999965433221 2445554444444444444
No 49
>PF13428 TPR_14: Tetratricopeptide repeat
Probab=22.52 E-value=41 Score=18.73 Aligned_cols=23 Identities=4% Similarity=0.082 Sum_probs=17.6
Q ss_pred chhHHHHHHHhhhcCCceeeccc
Q psy15915 75 QEVHFKYIQVQGKMHYTTKVGDL 97 (99)
Q Consensus 75 pdVhFKYIeAAak~gQ~kEVERv 97 (99)
|++.+.+-++-...||+.+-+++
T Consensus 1 p~~~~~la~~~~~~G~~~~A~~~ 23 (44)
T PF13428_consen 1 PAAWLALARAYRRLGQPDEAERL 23 (44)
T ss_pred CHHHHHHHHHHHHcCCHHHHHHH
Confidence 67778888888888888776653
No 50
>PRK07688 thiamine/molybdopterin biosynthesis ThiF/MoeB-like protein; Validated
Probab=21.78 E-value=35 Score=27.67 Aligned_cols=65 Identities=14% Similarity=0.165 Sum_probs=41.7
Q ss_pred CCCccchhHHHhcccCcchhHHHHHHHHHHHhhhhChHHHHHHHHhccccchhhheehhhccccCCchhHHHHHHHhhhc
Q psy15915 9 RGRPKTRYMHTVFPQDLKEKKLEVSQIATKYHEQLTTKALIDLFESFKSYEGLFYFLGSIVNYSQDQEVHFKYIQVQGKM 88 (99)
Q Consensus 9 ~~~~~~~~l~~lL~~N~rqNlQiVVqvA~kYsd~lg~~~lI~lFE~~ks~EGLyyyLgsiVn~seDpdVhFKYIeAAak~ 88 (99)
+|+||...+.+-|+. +..++.+- -+.+.+.++.+-++|++|. -||..+.+++..+.--++|.+.
T Consensus 77 ~g~~Ka~aa~~~l~~-inp~v~v~-----~~~~~~~~~~~~~~~~~~D----------lVid~~Dn~~~r~~ln~~~~~~ 140 (339)
T PRK07688 77 NNLPKAVAAKKRLEE-INSDVRVE-----AIVQDVTAEELEELVTGVD----------LIIDATDNFETRFIVNDAAQKY 140 (339)
T ss_pred CCCcHHHHHHHHHHH-HCCCcEEE-----EEeccCCHHHHHHHHcCCC----------EEEEcCCCHHHHHHHHHHHHHh
Confidence 478999988776652 22333321 2344566777777776542 3677788888888766777766
Q ss_pred C
Q psy15915 89 H 89 (99)
Q Consensus 89 g 89 (99)
|
T Consensus 141 ~ 141 (339)
T PRK07688 141 G 141 (339)
T ss_pred C
Confidence 5
No 51
>PRK08223 hypothetical protein; Validated
Probab=21.57 E-value=26 Score=28.42 Aligned_cols=67 Identities=10% Similarity=0.089 Sum_probs=44.4
Q ss_pred cCCCCccchhHHHhcccCcchhHHHHHHHHHHHhhhhChHHHHHHHHhccccchhhheehhhccccCCc--hhHHHHHHH
Q psy15915 7 RDRGRPKTRYMHTVFPQDLKEKKLEVSQIATKYHEQLTTKALIDLFESFKSYEGLFYFLGSIVNYSQDQ--EVHFKYIQV 84 (99)
Q Consensus 7 ~~~~~~~~~~l~~lL~~N~rqNlQiVVqvA~kYsd~lg~~~lI~lFE~~ks~EGLyyyLgsiVn~seDp--dVhFKYIeA 84 (99)
.+=|+||.+.+++-|+ .+-.+.+|. .|.+.+.++++-++|+.+.- ||+-+.++ +..+.--++
T Consensus 76 ~diG~~Kve~a~~~l~-~iNP~v~V~-----~~~~~l~~~n~~~ll~~~Dl----------VvD~~D~~~~~~r~~ln~~ 139 (287)
T PRK08223 76 STLGRPKAEVLAEMVR-DINPELEIR-----AFPEGIGKENADAFLDGVDV----------YVDGLDFFEFDARRLVFAA 139 (287)
T ss_pred hHCCCcHHHHHHHHHH-HHCCCCEEE-----EEecccCccCHHHHHhCCCE----------EEECCCCCcHHHHHHHHHH
Confidence 4568999999888775 234445543 25667778888888876531 56766665 566666677
Q ss_pred hhhcC
Q psy15915 85 QGKMH 89 (99)
Q Consensus 85 Aak~g 89 (99)
|.+.|
T Consensus 140 c~~~~ 144 (287)
T PRK08223 140 CQQRG 144 (287)
T ss_pred HHHcC
Confidence 77665
No 52
>PF13877 RPAP3_C: Potential Monad-binding region of RPAP3
Probab=21.46 E-value=1.3e+02 Score=19.33 Aligned_cols=58 Identities=17% Similarity=0.216 Sum_probs=37.0
Q ss_pred HHHHHhhhhChHHHHHHHHhccccchhhheehhhcc--ccCCchhHHHHHHHhhhcCCce
Q psy15915 35 IATKYHEQLTTKALIDLFESFKSYEGLFYFLGSIVN--YSQDQEVHFKYIQVQGKMHYTT 92 (99)
Q Consensus 35 vA~kYsd~lg~~~lI~lFE~~ks~EGLyyyLgsiVn--~seDpdVhFKYIeAAak~gQ~k 92 (99)
-.-+|=..++++.+=++|..-=+.|=|=-+|..+-. .+.|++..+.++++=++++.|.
T Consensus 23 ~~~~yL~~i~p~~l~~if~~~l~~~~L~~il~~l~~~~~~~~~~~i~~~L~~L~~~~RF~ 82 (94)
T PF13877_consen 23 ERYEYLKSIPPDSLPKIFKNSLEPEFLSEILEALNEHFIPEDPEFIFEILEALSKVKRFD 82 (94)
T ss_pred HHHHHHHhCChHHHHHHHHccCCHHHHHHHHHHHHHHHccCCHHHHHHHHHHhcCCCCHH
Confidence 344566666777777777663334433334433322 3468899999999999988764
No 53
>TIGR02552 LcrH_SycD type III secretion low calcium response chaperone LcrH/SycD. ScyD/LcrH contains three central tetratricopeptide-like repeats that are predicted to fold into an all-alpha-helical array.
Probab=21.44 E-value=2.3e+02 Score=18.01 Aligned_cols=18 Identities=11% Similarity=-0.005 Sum_probs=7.1
Q ss_pred CchhHHHHHHHhhhcCCc
Q psy15915 74 DQEVHFKYIQVQGKMHYT 91 (99)
Q Consensus 74 DpdVhFKYIeAAak~gQ~ 91 (99)
++++++..=.+-...|++
T Consensus 84 ~~~~~~~la~~~~~~g~~ 101 (135)
T TIGR02552 84 DPRPYFHAAECLLALGEP 101 (135)
T ss_pred ChHHHHHHHHHHHHcCCH
Confidence 334444433333334443
No 54
>COG1312 UxuA D-mannonate dehydratase [Carbohydrate transport and metabolism]
Probab=21.20 E-value=74 Score=27.38 Aligned_cols=49 Identities=20% Similarity=0.362 Sum_probs=37.6
Q ss_pred HHHHHHHHHHhhhhC----------hHHHHHHHHhcc------ccchhhheehhhccccCCchhH
Q psy15915 30 LEVSQIATKYHEQLT----------TKALIDLFESFK------SYEGLFYFLGSIVNYSQDQEVH 78 (99)
Q Consensus 30 QiVVqvA~kYsd~lg----------~~~lI~lFE~~k------s~EGLyyyLgsiVn~seDpdVh 78 (99)
|-.+.-|.++...+. -+.+-++||.|+ .++.|-|||-.|++..|+-+|.
T Consensus 143 ~~~~~~~~~~~~~m~glPG~~~~~tl~~~~~~~~~y~~Id~~~L~~nL~yFL~~ViPVAEe~gVk 207 (362)
T COG1312 143 QEEIARAEERFAEMSGLPGWEEGYTLDQFRELLELYGGIDEEKLWENLAYFLKEVIPVAEEVGVK 207 (362)
T ss_pred HHHHHHHHHhhhcccCCCCCcccccHHHHHHHHHHhcCCCHHHHHHHHHHHHHhhcchHHhcCce
Confidence 444445556666665 788999999997 4889999999999998876663
No 55
>PF05158 RNA_pol_Rpc34: RNA polymerase Rpc34 subunit; InterPro: IPR007832 The family comprises a subunit specific to RNA Pol III, the tRNA specific polymerase. The C34 subunit of Saccharomyces cerevisiae RNA Pol III is part of a subcomplex of three subunits which have no counterpart in the other two nuclear RNA polymerases. This subunit interacts with TFIIIB70 and therefore participates in Pol III recruitment [].; GO: 0003677 DNA binding, 0003899 DNA-directed RNA polymerase activity, 0006351 transcription, DNA-dependent; PDB: 2DK8_A 2DK5_A 2YU3_A.
Probab=21.09 E-value=68 Score=26.25 Aligned_cols=38 Identities=18% Similarity=0.295 Sum_probs=25.7
Q ss_pred HHhccccchhhheeh------hhccccCCchhHHHHHHHhhhcC
Q psy15915 52 FESFKSYEGLFYFLG------SIVNYSQDQEVHFKYIQVQGKMH 89 (99)
Q Consensus 52 FE~~ks~EGLyyyLg------siVn~seDpdVhFKYIeAAak~g 89 (99)
|+=++...++.|=+- .+-+++.|..++|.+||+|...|
T Consensus 57 ~~~~~~~~~l~~~~~~~~~a~k~~~l~~~e~lvy~~I~~ag~~G 100 (327)
T PF05158_consen 57 LKLLKKGGGLSYKAVSEEEAKKLKGLSDEERLVYQLIEEAGNKG 100 (327)
T ss_dssp EEEEE-SSSEEEEE--SSS-----SSSCCHHHHHHHHHHHTTT-
T ss_pred EEEEEcCCEEEEEEeCHHHHhhhcCCCHHHHHHHHHHHHhCCCC
Confidence 344556677777654 36788999999999999998887
No 56
>TIGR02355 moeB molybdopterin synthase sulfurylase MoeB. This model describes the molybdopterin biosynthesis protein MoeB in E. coli and related species. The enzyme covalently modifies the molybdopterin synthase MoaD by sulfurylation. This enzyme is closely related to ThiF, a thiamine biosynthesis enzyme that modifies ThiS by an analogous adenylation. Both MoeB and ThiF belong to the HesA/MoeB/ThiF family (pfam00899).
Probab=21.08 E-value=35 Score=26.19 Aligned_cols=67 Identities=15% Similarity=0.176 Sum_probs=43.6
Q ss_pred cCCCCccchhHHHhcccCcchhHHHHHHHHHHHhhhhChHHHHHHHHhccccchhhheehhhccccCCchhHHHHHHHhh
Q psy15915 7 RDRGRPKTRYMHTVFPQDLKEKKLEVSQIATKYHEQLTTKALIDLFESFKSYEGLFYFLGSIVNYSQDQEVHFKYIQVQG 86 (99)
Q Consensus 7 ~~~~~~~~~~l~~lL~~N~rqNlQiVVqvA~kYsd~lg~~~lI~lFE~~ks~EGLyyyLgsiVn~seDpdVhFKYIeAAa 86 (99)
.+=|+||...+.+.|+. +-.++.+. -+.+.+..+.+-++|+++. -||..+.+++..+.--++|.
T Consensus 73 ~diG~~Ka~~a~~~l~~-inp~v~i~-----~~~~~i~~~~~~~~~~~~D----------lVvd~~D~~~~r~~ln~~~~ 136 (240)
T TIGR02355 73 ANIGQPKVESAKDALTQ-INPHIAIN-----PINAKLDDAELAALIAEHD----------IVVDCTDNVEVRNQLNRQCF 136 (240)
T ss_pred hhCCCcHHHHHHHHHHH-HCCCcEEE-----EEeccCCHHHHHHHhhcCC----------EEEEcCCCHHHHHHHHHHHH
Confidence 46699999988877752 22333322 2455666677777777654 36777788888777767776
Q ss_pred hcC
Q psy15915 87 KMH 89 (99)
Q Consensus 87 k~g 89 (99)
+.|
T Consensus 137 ~~~ 139 (240)
T TIGR02355 137 AAK 139 (240)
T ss_pred HcC
Confidence 654
No 57
>PRK04217 hypothetical protein; Provisional
Probab=21.00 E-value=56 Score=23.14 Aligned_cols=50 Identities=24% Similarity=0.169 Sum_probs=27.3
Q ss_pred CccCCccCCCCccc-h-----------------hHHHhcccCcchhHHHHHHHHHHHhhhhChHHHHHHH
Q psy15915 1 MEVGGKRDRGRPKT-R-----------------YMHTVFPQDLKEKKLEVSQIATKYHEQLTTKALIDLF 52 (99)
Q Consensus 1 ~~~~~~~~~~~~~~-~-----------------~l~~lL~~N~rqNlQiVVqvA~kYsd~lg~~~lI~lF 52 (99)
|-.|=-||||||.+ + |++++-...+...-+.++ ...|.+.+..+.+-+.+
T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~Lt~eereai--~l~~~eGlS~~EIAk~L 68 (110)
T PRK04217 1 MPFGWGRGRGRRRKMRMIGFIPQVRHFYPAIPPVGPPKPPIFMTYEEFEAL--RLVDYEGLTQEEAGKRM 68 (110)
T ss_pred CCcccccCCCCCCCCeEeeccCCcceEeCCCCCccCCCCcccCCHHHHHHH--HHHHHcCCCHHHHHHHH
Confidence 45666799999987 2 555555555444444332 22344445555444433
No 58
>PRK11675 LexA regulated protein; Provisional
Probab=20.95 E-value=46 Score=23.29 Aligned_cols=21 Identities=29% Similarity=0.479 Sum_probs=13.4
Q ss_pred ccCCCCccchhH--HHhcccCcc
Q psy15915 6 KRDRGRPKTRYM--HTVFPQDLK 26 (99)
Q Consensus 6 ~~~~~~~~~~~l--~~lL~~N~r 26 (99)
-+.+||||+.=| ++=|+-|-|
T Consensus 17 ~~~~grp~tnpl~r~~ql~inkr 39 (90)
T PRK11675 17 ERRPGRPKTNPLSRDEQLRINKR 39 (90)
T ss_pred CCCCCCCCCCCCChHHHHHhhHH
Confidence 467899998644 334555644
No 59
>PF12569 NARP1: NMDA receptor-regulated protein 1 ; InterPro: IPR021183 This group represents N-terminal acetyltransferase A (NatA) auxiliary subunit and represents a non-catalytic component of the NatA N-terminal acetyltransferase, which catalyzes acetylation of proteins beginning with Met-Ser, Met-Gly and Met-Ala. N-terminal acetylation plays a role in normal eukaryotic translation and processing, protect against proteolytic degradation and protein turnover. NAT1 anchors ARD1 and NAT5 to the ribosome and may present the N- terminal of nascent polypeptides for acetylation [], [].
Probab=20.84 E-value=66 Score=27.96 Aligned_cols=71 Identities=17% Similarity=0.265 Sum_probs=43.0
Q ss_pred hhHHHhcccCcchhHHHHHHHHHHHhhhhChHHHHHHHH--hccccc---hhhheehhhccccCCchhHHHHHHHhhh
Q psy15915 15 RYMHTVFPQDLKEKKLEVSQIATKYHEQLTTKALIDLFE--SFKSYE---GLFYFLGSIVNYSQDQEVHFKYIQVQGK 87 (99)
Q Consensus 15 ~~l~~lL~~N~rqNlQiVVqvA~kYsd~lg~~~lI~lFE--~~ks~E---GLyyyLgsiVn~seDpdVhFKYIeAAak 87 (99)
.-|+.|... ....+++.+++.+|.+.+.+..-..--. +-..-. =.+|||+++-+...|.+--..||+.|-.
T Consensus 147 ~~lk~Ly~d--~~K~~~i~~l~~~~~~~l~~~~~~~~~~~~~~~~p~~~lw~~~~lAqhyd~~g~~~~Al~~Id~aI~ 222 (517)
T PF12569_consen 147 SNLKPLYKD--PEKAAIIESLVEEYVNSLESNGSFSNGDDEEKEPPSTLLWTLYFLAQHYDYLGDYEKALEYIDKAIE 222 (517)
T ss_pred HHHHHHHcC--hhHHHHHHHHHHHHHHhhcccCCCCCccccccCCchHHHHHHHHHHHHHHHhCCHHHHHHHHHHHHh
Confidence 345555542 4557778888888776553321100000 000001 1679999999999999999999998754
No 60
>PF07797 DUF1639: Protein of unknown function (DUF1639); InterPro: IPR012438 This approximately 50-residue region is found in a number of sequences derived from hypothetical plant proteins. This region features a highly basic 5 amino-acid stretch towards its centre.
Probab=20.60 E-value=52 Score=21.00 Aligned_cols=14 Identities=36% Similarity=0.584 Sum_probs=11.0
Q ss_pred cCCccCCCCccchh
Q psy15915 3 VGGKRDRGRPKTRY 16 (99)
Q Consensus 3 ~~~~~~~~~~~~~~ 16 (99)
+-|.|-..|||+|-
T Consensus 13 m~G~kpprRPkKRp 26 (50)
T PF07797_consen 13 MTGSKPPRRPKKRP 26 (50)
T ss_pred HhCCCCCCCCCccc
Confidence 45788889999873
No 61
>PF05030 SSXT: SSXT protein (N-terminal region); InterPro: IPR007726 SSXT (also known as SYT or SS18) appears to function synergistically with RBM14 as a transcriptional coactivator []. The SSXT protein is involved in synovial sarcoma in humans. A SYT-SSX fusion gene resulting from the chromosomal translocation t(X;18) (p11;q11) is characteristic of synovial sarcomas. This translocation fuses the SSXT (SYT) gene from chromosome 18 to either of two homologous genes at Xp11, SSX1 or SSX2 []. This entry also includes SS18-like protein 1, a transcriptional activator which is required for calcium-dependent dendritic growth and branching in cortical neurons [],[].
Probab=20.03 E-value=2.5e+02 Score=18.55 Aligned_cols=60 Identities=13% Similarity=0.309 Sum_probs=39.1
Q ss_pred cCCCCccchhHHHhcccCcchhHHHHHHHHHHHhhhhChHHHHHHHHhccccchhhheehhhccccCC
Q psy15915 7 RDRGRPKTRYMHTVFPQDLKEKKLEVSQIATKYHEQLTTKALIDLFESFKSYEGLFYFLGSIVNYSQD 74 (99)
Q Consensus 7 ~~~~~~~~~~l~~lL~~N~rqNlQiVVqvA~kYsd~lg~~~lI~lFE~~ks~EGLyyyLgsiVn~seD 74 (99)
||+..|-++-++.||..| -|+| +...+|..+--...++..++..- .. ..||++|.+.+.+
T Consensus 1 r~~~~~~~~~IQk~LdEN----~~LI-~~I~e~qn~Gr~~Ec~qyq~~Lh--rN-L~yLA~iAD~~~~ 60 (65)
T PF05030_consen 1 RGKAQITTEQIQKMLDEN----DQLI-QCIQEYQNKGRAQECVQYQQILH--RN-LVYLATIADASQN 60 (65)
T ss_pred CCCCCCCHHHHHHHHHHH----HHHH-HHHHHHHHcCCHHHHHHHHHHHH--HH-HHHHHHHHcCCcc
Confidence 788889999999999986 5554 44456766666666666554321 22 3467777655443
Done!