Query 009621
Match_columns 530
No_of_seqs 116 out of 144
Neff 3.4
Searched_HMMs 46136
Date Thu Mar 28 15:20:03 2013
Command hhsearch -i /work/01045/syshi/csienesis_hhblits_a3m/009621.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/009621hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 PF11891 DUF3411: Domain of un 100.0 5.3E-70 1.1E-74 513.1 12.7 170 245-414 1-180 (180)
2 PF06524 NOA36: NOA36 protein; 89.7 0.32 6.9E-06 50.3 3.6 12 102-113 218-229 (314)
3 PF14812 PBP1_TM: Transmembran 83.1 0.37 8E-06 41.8 0.0 14 148-161 35-48 (81)
4 PF02084 Bindin: Bindin; Inte 80.5 1.5 3.3E-05 44.4 3.2 30 181-210 104-133 (238)
5 PF02979 NHase_alpha: Nitrile 71.5 2.5 5.5E-05 41.8 2.0 48 205-255 17-68 (188)
6 KOG0943 Predicted ubiquitin-pr 70.9 1.2 2.5E-05 54.1 -0.5 72 177-266 1796-1869(3015)
7 COG4907 Predicted membrane pro 70.8 2.6 5.6E-05 46.8 2.1 6 116-121 565-570 (595)
8 PLN03138 Protein TOC75; Provis 68.8 7.6 0.00016 45.6 5.3 10 448-457 497-506 (796)
9 COG4907 Predicted membrane pro 56.3 8.1 0.00018 43.2 2.5 16 124-139 576-591 (595)
10 PF11705 RNA_pol_3_Rpc31: DNA- 54.2 7.7 0.00017 38.4 1.8 7 66-72 90-96 (233)
11 PLN03138 Protein TOC75; Provis 52.4 10 0.00022 44.5 2.7 15 197-211 187-201 (796)
12 KOG3540 Beta amyloid precursor 51.2 18 0.00039 40.7 4.1 21 171-192 255-275 (615)
13 PF09849 DUF2076: Uncharacteri 51.1 17 0.00037 37.2 3.8 9 68-76 157-165 (247)
14 PF14812 PBP1_TM: Transmembran 47.7 6.1 0.00013 34.5 0.0 11 150-160 33-43 (81)
15 TIGR02877 spore_yhbH sporulati 45.1 22 0.00047 38.6 3.6 11 459-470 335-345 (371)
16 PF10446 DUF2457: Protein of u 44.4 12 0.00027 41.3 1.6 14 321-334 294-307 (458)
17 TIGR01323 nitrile_alph nitrile 40.5 32 0.0007 34.2 3.7 46 205-253 11-60 (185)
18 PF09026 CENP-B_dimeris: Centr 39.7 9.7 0.00021 34.5 0.0 18 185-202 51-68 (101)
19 PF04889 Cwf_Cwc_15: Cwf15/Cwc 39.3 24 0.00052 36.0 2.7 6 221-226 186-191 (244)
20 PLN00151 potassium transporter 39.3 38 0.00082 40.3 4.6 10 487-496 431-440 (852)
21 PF04931 DNA_pol_phi: DNA poly 38.3 43 0.00092 38.9 4.8 6 69-74 559-564 (784)
22 PHA00458 single-stranded DNA-b 34.8 40 0.00087 34.5 3.4 7 121-127 180-186 (233)
23 PF11705 RNA_pol_3_Rpc31: DNA- 31.7 29 0.00062 34.4 1.8 8 152-159 213-220 (233)
24 PF08595 RXT2_N: RXT2-like, N- 30.6 28 0.0006 33.3 1.5 7 221-227 103-109 (149)
25 PF07415 Herpes_LMP2: Gammaher 29.5 18 0.00039 39.5 0.0 12 328-339 257-268 (489)
26 KOG3074 Transcriptional regula 28.5 34 0.00074 35.4 1.8 19 198-216 91-109 (263)
27 PRK05325 hypothetical protein; 27.9 81 0.0018 34.6 4.5 10 460-470 358-367 (401)
28 COG2818 Tag 3-methyladenine DN 27.8 30 0.00066 34.4 1.2 125 167-300 50-177 (188)
29 KOG2023 Nuclear transport rece 27.2 34 0.00074 40.1 1.6 26 467-492 774-799 (885)
30 KOG2023 Nuclear transport rece 27.0 90 0.002 36.9 4.9 40 89-128 302-344 (885)
31 PF04285 DUF444: Protein of un 25.8 61 0.0013 35.7 3.2 7 407-413 319-325 (421)
32 PF04931 DNA_pol_phi: DNA poly 25.6 39 0.00084 39.2 1.8 8 209-216 728-735 (784)
33 KOG0921 Dosage compensation co 24.9 67 0.0014 39.1 3.4 6 30-35 1119-1124(1282)
34 COG1512 Beta-propeller domains 24.2 95 0.0021 32.4 4.1 16 93-108 220-235 (271)
35 PLN03083 E3 UFM1-protein ligas 23.0 1.2E+02 0.0026 36.2 5.0 35 176-210 484-522 (803)
36 PF08671 SinI: Anti-repressor 22.8 84 0.0018 22.9 2.4 21 201-222 9-29 (30)
37 PF14407 Frankia_peptide: Ribo 22.4 57 0.0012 27.4 1.7 22 229-253 4-25 (61)
38 KOG3241 Uncharacterized conser 22.1 63 0.0014 32.5 2.2 6 135-140 183-188 (227)
39 PF11332 DUF3134: Protein of u 20.8 53 0.0011 28.3 1.2 7 149-155 60-66 (73)
40 COG3685 Uncharacterized protei 20.1 65 0.0014 31.7 1.8 52 173-226 57-118 (167)
No 1
>PF11891 DUF3411: Domain of unknown function (DUF3411); InterPro: IPR021825 This presumed domain is functionally uncharacterised. This domain is found in eukaryotes. This domain is typically between 168 to 186 amino acids in length. This domain has a conserved RYQ sequence motif.
Probab=100.00 E-value=5.3e-70 Score=513.09 Aligned_cols=170 Identities=45% Similarity=0.732 Sum_probs=163.8
Q ss_pred hhhccCchhhHHHHHHHHhhhhhhhhhhhhhcchhhhhHHHHHHHHHHHHHHHhHHHHhhcccccccCCCcc----cchH
Q 009621 245 GRMLADPSFLYKLILEQAATIGCTVLWELENRKERIKQEWDLALINVLTVTACNAFVVWSLAPCRSYGNTFR----FDLQ 320 (530)
Q Consensus 245 eRlLADP~FLfKL~iE~~I~i~~~v~aE~~~Rge~F~~ElDfV~sdvv~gsi~nfaLVwLLAPt~S~g~~~~----~~lq 320 (530)
|||||||+|||||++||+||++|+++|||++|||+||+|||||+||+++++|+||+||||||||++++++++ +.+|
T Consensus 1 ~RllADP~Fl~Kl~~E~~i~i~~~~~~e~~~R~e~f~~E~d~v~~d~v~~~i~n~~lv~llAPt~s~~~~~~~~~~~~~~ 80 (180)
T PF11891_consen 1 ERLLADPSFLFKLAIEEVIGIGCATAAEYAKRGERFWNELDFVFSDVVVGSIVNFALVWLLAPTRSFGSPAASSPGGGLQ 80 (180)
T ss_pred CcccccchHHHHHHHHHHHHHHHHHHHHHHHcccchHHHHHHHHHHHHHHHHHHHHHHHhccchHhhCcccccccchHHH
Confidence 799999999999999999999999999999999999999999999999999999999999999999999887 6799
Q ss_pred HHhhcCCCccccccCCCCCcchhhhHHHHHhhchhhhhhhhhHHhhHHHHHHHHhh--ccCC----CCCCCCccchhhhh
Q 009621 321 NTLQKLPNNIFERSYPFREFDLQKRIHSLFYKAAELCMVGLSAGAVQGSLSNYLAG--KKDR----LSVTIPSVSTSALG 394 (530)
Q Consensus 321 ~~l~klP~N~Fq~s~pg~~fsL~qRiga~v~KGa~l~~VG~~aGlvG~glSN~L~~--kK~~----~s~~vPpv~~SAl~ 394 (530)
+++++||+||||+++||++||++||++||+|||++|++|||+||++|+++||+|++ |+++ +++++|||++||++
T Consensus 81 ~~~~~~P~n~Fq~~~~g~~fsl~qR~~~~~~kg~~l~~VG~~ag~vg~~lsn~L~~~rk~~~~~~e~~~~~ppv~~ta~~ 160 (180)
T PF11891_consen 81 KFLGSLPNNAFQKGYPGRSFSLAQRIGAFVYKGAKLAAVGFIAGLVGTGLSNALIAARKKVDPSFEPSVPVPPVLKTALG 160 (180)
T ss_pred HHHHhChHHHhccCCCCCcccHHHHHHHHHHcchHhhhhHHHHHHHHHHHHHHHHHHHHhcCccccCCCCCCCHHHHHHH
Confidence 99999999999999999999999999999999999999999999999999999998 4444 45789999999999
Q ss_pred HHHHhhchhhhHHHHHHhHH
Q 009621 395 YGAFLGLYANMRYQLLCGFD 414 (530)
Q Consensus 395 wG~fMGvSSNLRYQlL~GvE 414 (530)
||+|||+|||+|||+|||+|
T Consensus 161 ~g~fmGvSsNlRYQil~GiE 180 (180)
T PF11891_consen 161 WGAFMGVSSNLRYQILNGIE 180 (180)
T ss_pred HHHHHhhhHhHHHHHHcCCC
Confidence 99999999999999999997
No 2
>PF06524 NOA36: NOA36 protein; InterPro: IPR010531 This family consists of several NOA36 proteins which contain 29 highly conserved cysteine residues. The function of this protein is unknown.; GO: 0008270 zinc ion binding, 0005634 nucleus
Probab=89.68 E-value=0.32 Score=50.32 Aligned_cols=12 Identities=25% Similarity=0.210 Sum_probs=5.5
Q ss_pred eeeeecccccee
Q 009621 102 VTLEKGKLDTTQ 113 (530)
Q Consensus 102 ~tleks~l~~~q 113 (530)
.|-|-..|.+|-
T Consensus 218 et~eTkdLSmSt 229 (314)
T PF06524_consen 218 ETQETKDLSMST 229 (314)
T ss_pred cccccccceeee
Confidence 344444455543
No 3
>PF14812 PBP1_TM: Transmembrane domain of transglycosylase PBP1 at N-terminal; PDB: 3FWL_A 3VMA_A.
Probab=83.08 E-value=0.37 Score=41.81 Aligned_cols=14 Identities=71% Similarity=0.902 Sum_probs=0.0
Q ss_pred CCCCCCCCCCCCCC
Q 009621 148 DDDDYFDDFDDGDE 161 (530)
Q Consensus 148 ddddy~d~~ddgd~ 161 (530)
++|||+||++++||
T Consensus 35 ~ddd~~DDD~dDde 48 (81)
T PF14812_consen 35 YDDDYEDDDDDDDE 48 (81)
T ss_dssp --------------
T ss_pred cccccccccccchh
Confidence 34445555444444
No 4
>PF02084 Bindin: Bindin; InterPro: IPR000775 Bindin, the major protein component of the acrosome granule of sea urchin sperm, mediates species-specific adhesion of sperm to the egg surface during fertilisation [, ]. The protein coats the acrosomal process after externalisation by the acrosome reaction; it binds to sulphated, fucose-containing polysaccharides on the vitelline-layer receptor proteoglycans that cover the egg plasma membrane. Bindins from different genera show high levels of sequence similarity in both the mature bindin domain and in the probindin precursor region. The most highly conserved region is a 42-residue segment in the central portion of the mature bindin protein. This domain may be responsible for conserved functions of bindin, while the more highly divergent flanking regions may be responsible for its species-specific properties [].; GO: 0007342 fusion of sperm to egg plasma membrane
Probab=80.52 E-value=1.5 Score=44.44 Aligned_cols=30 Identities=27% Similarity=0.554 Sum_probs=22.5
Q ss_pred HHHHHHHHHHHhhccCchHHHHHHHHcCCc
Q 009621 181 KFVDAVLNEWMKTMMDLPAGFRQAYEMGLV 210 (530)
Q Consensus 181 ~~v~aVL~e~~rt~~sLPaDl~~A~e~Glv 210 (530)
|-.+.+.+-.+.|-.+||-||-+=++.||+
T Consensus 104 Kvm~~ikavLgaTKiDLPVDINDPYDlGLL 133 (238)
T PF02084_consen 104 KVMEDIKAVLGATKIDLPVDINDPYDLGLL 133 (238)
T ss_pred HHHHHHHHHhcccccccccccCChhhHHHH
Confidence 333444444578999999999999999863
No 5
>PF02979 NHase_alpha: Nitrile hydratase, alpha chain; InterPro: IPR004232 Nitrile hydratases (4.2.1.84 from EC) are bacterial enzymes that catalyse the hydration of nitrile compounds to the corresponding amides. They are used as biocatalysts in acrylamide production, one of the few commercial scale bioprocesses, as well as in environmental remediation for the removal of nitriles from waste streams. Nitrile hydratases are composed of two subunits, alpha and beta, and are normally active as a tetramer, alpha(2)beta(2). Nitrile hydratases contain either a non-haem iron or a non-corrinoid cobalt centre, both types sharing a highly conserved peptide sequence in the alpha subunit (CXLCSC) that provides all the residues involved in coordinating the metal ion. Each type of nitrile hydratase specifically incorporated its metal with the help of activator proteins encoded by flanking regions of the nitrile hydratase genes that are necessary for metal insertion. The Fe-containing enzyme is photo-regulated: in the dark the enzyme is inactivated due to the association of nitric oxide (NO) to the iron, while in the light the enzyme is active by photo-dissociation of NO. The NO is held in place by a claw setting formed through specific oxygen atoms in two modified cysteines and a serine residue in the active site [, ]. The cobalt-containing enzyme is unaffected by NO, but was shown to undergo a similar effect with carbon monoxide [, ]. Fe- and cobalt-containing enzymes also display different inhibition patterns with nitrophenols. Thiocyanate hydrolase (SCNase) is a cobalt-containing metalloenzyme with a cysteine-sulphinic acid ligand that hydrolyses thiocyanate to carbonyl sulphide and ammonia []. The two enzymes, nitrile hydratase and SCNase, are homologous over regions corresponding to almost the entire coding regions of the genes: the beta and alpha subunits of thiocyanate hydrolase were homologous to the amino- and carboxyl-terminal halves of the beta subunit of nitrile hydratase, and the gamma subunit of thiocyanate hydrolase was homologous to the alpha subunit of nitrile hydratase []. This entry represents the structural domain of the alpha subunit of both iron- and cobalt-containing nitrile hydratases; the alpha subunit is a duplication of two structural repeats, each consisting of 4 layers, alpha/beta/beta/alpha []. This structure is also found in the related protein, the gamma subunit of thiocyanate hydrolase (SCNase).; GO: 0003824 catalytic activity, 0046914 transition metal ion binding, 0006807 nitrogen compound metabolic process; PDB: 2DPP_A 3HHT_A 1V29_A 2ZZD_I 2DXC_F 2DXB_F 2DD5_C 2DD4_C 2ZPH_A 2CYZ_A ....
Probab=71.55 E-value=2.5 Score=41.76 Aligned_cols=48 Identities=27% Similarity=0.479 Sum_probs=35.5
Q ss_pred HHcCCcCHHHHHHHHHhccCC----chhHHHHhhcCcchhhhhhhhhccCchhhH
Q 009621 205 YEMGLVSSAQMVKFLAINARP----TTTRFISRSLPQGISRAFIGRMLADPSFLY 255 (530)
Q Consensus 205 ~e~Glvssa~L~rFl~L~~~P----~~~~~l~R~lp~~lsrgfReRlLADP~FLf 255 (530)
+|.|+|+++.+.++.+...+- +-++.+.|.+-. .+||.|||+||.=..
T Consensus 17 ~ekg~~~~~~~~~~~~~~~~~~~P~~GarvVArAW~D---p~FK~rLLaD~~aA~ 68 (188)
T PF02979_consen 17 IEKGLITPAEVDRIIETYESRVGPRNGARVVARAWTD---PAFKARLLADPTAAI 68 (188)
T ss_dssp HHTTSS-HHHHHHHHHHHHHTSSHHHHHHHHHHHHH----HHHHHHHHHSHHHHH
T ss_pred HHcCCCCHHHHHHHHHHHHhccCccccceeehhhhCC---HHHHHHHHHCHHHHH
Confidence 689999999999988876543 236777777644 699999999997443
No 6
>KOG0943 consensus Predicted ubiquitin-protein ligase/hyperplastic discs protein, HECT superfamily [Posttranslational modification, protein turnover, chaperones]
Probab=70.86 E-value=1.2 Score=54.13 Aligned_cols=72 Identities=25% Similarity=0.154 Sum_probs=39.7
Q ss_pred hhcHHHHHHHHHHHHhhccCchHHHHHHHHcCCcCHHHHHHHHHhccCCchhHHHHhhcCcchhhhhhh--hhccCchhh
Q 009621 177 LFDRKFVDAVLNEWMKTMMDLPAGFRQAYEMGLVSSAQMVKFLAINARPTTTRFISRSLPQGISRAFIG--RMLADPSFL 254 (530)
Q Consensus 177 ~fdR~~v~aVL~e~~rt~~sLPaDl~~A~e~Glvssa~L~rFl~L~~~P~~~~~l~R~lp~~lsrgfRe--RlLADP~FL 254 (530)
.=||-+|.++..|..+--+.+|+-+.+|--.-.-+.. +-+.+-. | ..+.++ =|+|||.-+
T Consensus 1796 ~~dRieVq~ataeSE~aaepV~~afe~adpqp~dpd~------dDnSSds----------Q--e~~a~e~aFmiad~ahp 1857 (3015)
T KOG0943|consen 1796 SGDRIEVQAATAESEAAAEPVPAAFEEADPQPNDPDD------DDNSSDS----------Q--EDDAEEEAFMIADPAHP 1857 (3015)
T ss_pred ccceeEEeecccccccccCCcccccCcCCCCCCCCCc------ccccccc----------c--cccccccceeecCCcch
Confidence 3466677778888888778888766554322111100 0000000 0 011111 189999999
Q ss_pred HHHHHHHHhhhh
Q 009621 255 YKLILEQAATIG 266 (530)
Q Consensus 255 fKL~iE~~I~i~ 266 (530)
..|+.|+-.-+.
T Consensus 1858 leva~aqpaN~p 1869 (3015)
T KOG0943|consen 1858 LEVALAQPANSP 1869 (3015)
T ss_pred hhHhhcccCCCc
Confidence 999999876543
No 7
>COG4907 Predicted membrane protein [Function unknown]
Probab=70.79 E-value=2.6 Score=46.84 Aligned_cols=6 Identities=17% Similarity=0.158 Sum_probs=2.2
Q ss_pred cccCce
Q 009621 116 SETTPE 121 (530)
Q Consensus 116 ~~~~p~ 121 (530)
+..+|.
T Consensus 565 sa~a~S 570 (595)
T COG4907 565 SAIASS 570 (595)
T ss_pred hccccc
Confidence 333333
No 8
>PLN03138 Protein TOC75; Provisional
Probab=68.79 E-value=7.6 Score=45.57 Aligned_cols=10 Identities=20% Similarity=0.292 Sum_probs=6.8
Q ss_pred hHhcCCCCCC
Q 009621 448 AWLGVEADPL 457 (530)
Q Consensus 448 ~wlGv~~~p~ 457 (530)
-||.-..||+
T Consensus 497 PWi~g~~d~~ 506 (796)
T PLN03138 497 PYLDGVYNPR 506 (796)
T ss_pred ccccCCCCcc
Confidence 4887666774
No 9
>COG4907 Predicted membrane protein [Function unknown]
Probab=56.30 E-value=8.1 Score=43.16 Aligned_cols=16 Identities=50% Similarity=0.896 Sum_probs=7.0
Q ss_pred cCCCCCCCCCCCCCCC
Q 009621 124 TGGGGGDIGKKINHGG 139 (530)
Q Consensus 124 ~ggggg~~g~~~~~GG 139 (530)
+|||+|..|||.||||
T Consensus 576 ~GGG~G~~gGg~GGGG 591 (595)
T COG4907 576 SGGGGGFSGGGSGGGG 591 (595)
T ss_pred CCCCCCcCCCCCCCCC
Confidence 3444444454433333
No 10
>PF11705 RNA_pol_3_Rpc31: DNA-directed RNA polymerase III subunit Rpc31; InterPro: IPR024661 DNA-directed RNA polymerases 2.7.7.6 from EC (also known as DNA-dependent RNA polymerases) are responsible for the polymerisation of ribonucleotides into a sequence complementary to the template DNA. In eukaryotes, there are three different forms of DNA-directed RNA polymerases transcribing different sets of genes. Most RNA polymerases are multimeric enzymes and are composed of a variable number of subunits. The core RNA polymerase complex consists of five subunits (two alpha, one beta, one beta-prime and one omega) and is sufficient for transcription elongation and termination but is unable to initiate transcription. Transcription initiation from promoter elements requires a sixth, dissociable subunit called a sigma factor, which reversibly associates with the core RNA polymerase complex to form a holoenzyme []. The core RNA polymerase complex forms a "crab claw"-like structure with an internal channel running along the full length []. The key functional sites of the enzyme, as defined by mutational and cross-linking analysis, are located on the inner wall of this channel. RNA synthesis follows after the attachment of RNA polymerase to a specific site, the promoter, on the template DNA strand. The RNA synthesis process continues until a termination sequence is reached. The RNA product, which is synthesised in the 5' to 3'direction, is known as the primary transcript. Eukaryotic nuclei contain three distinct types of RNA polymerases that differ in the RNA they synthesise: RNA polymerase I: located in the nucleoli, synthesises precursors of most ribosomal RNAs. RNA polymerase II: occurs in the nucleoplasm, synthesises mRNA precursors. RNA polymerase III: also occurs in the nucleoplasm, synthesises the precursors of 5S ribosomal RNA, the tRNAs, and a variety of other small nuclear and cytosolic RNAs. Eukaryotic cells are also known to contain separate mitochondrial and chloroplast RNA polymerases. Eukaryotic RNA polymerases, whose molecular masses vary in size from 500 to 700 kDa, contain two non-identical large (>100 kDa) subunits and an array of up to 12 different small (less than 50 kDa) subunits. RNA polymerase III contains seventeen subunits in yeasts and in human cells. Twelve of these are akin to RNA polymerase I or II and the other five are RNA polymerase III-specific, and form the functionally distinct groups: (i) Rpc31-Rpc34-Rpc82, and (ii) Rpc37-Rpc53. Rpc31, Rpc34 and Rpc82 form a cluster of enzyme-specific subunits that contribute to transcription initiation in Saccharomyces cerevisiae and Homo sapiens. There is evidence that these subunits are anchored at or near the N-terminal Zn-fold of Rpc1, itself prolonged by a highly conserved but RNA polymerase III-specific domain []. This entry represents the Rpc31 subunit.
Probab=54.21 E-value=7.7 Score=38.41 Aligned_cols=7 Identities=29% Similarity=0.183 Sum_probs=4.8
Q ss_pred hhhhhhh
Q 009621 66 VVLLERC 72 (530)
Q Consensus 66 ~~~~er~ 72 (530)
...+||+
T Consensus 90 ~~~ierY 96 (233)
T PF11705_consen 90 FDDIERY 96 (233)
T ss_pred cccHHHH
Confidence 4567886
No 11
>PLN03138 Protein TOC75; Provisional
Probab=52.38 E-value=10 Score=44.54 Aligned_cols=15 Identities=13% Similarity=0.237 Sum_probs=6.7
Q ss_pred chHHHHHHHHcCCcC
Q 009621 197 LPAGFRQAYEMGLVS 211 (530)
Q Consensus 197 LPaDl~~A~e~Glvs 211 (530)
|=+|+..-.+.|...
T Consensus 187 Lq~dv~~I~~tG~F~ 201 (796)
T PLN03138 187 LQKELETLASCGMFE 201 (796)
T ss_pred HHHHHHHHHhcCCce
Confidence 334444444445444
No 12
>KOG3540 consensus Beta amyloid precursor protein [General function prediction only]
Probab=51.24 E-value=18 Score=40.74 Aligned_cols=21 Identities=43% Similarity=0.664 Sum_probs=14.6
Q ss_pred hhHHHHhhcHHHHHHHHHHHHh
Q 009621 171 RKILEELFDRKFVDAVLNEWMK 192 (530)
Q Consensus 171 r~~~~e~fdR~~v~aVL~e~~r 192 (530)
..-++| =-|+-++.||+||+.
T Consensus 255 kmrlee-khr~rmd~VmkEW~~ 275 (615)
T KOG3540|consen 255 KMRLEE-KHRKRMDKVMKEWEE 275 (615)
T ss_pred HHHHHH-HHHHHHHHHHHHHHH
Confidence 333443 357889999999973
No 13
>PF09849 DUF2076: Uncharacterized protein conserved in bacteria (DUF2076); InterPro: IPR018648 This family of hypothetical prokaryotic proteins has no known function but includes putative perimplasmic ligand-binding sensor proteins.
Probab=51.06 E-value=17 Score=37.17 Aligned_cols=9 Identities=11% Similarity=0.139 Sum_probs=4.5
Q ss_pred hhhhhhcCC
Q 009621 68 LLERCYKAP 76 (530)
Q Consensus 68 ~~er~~~~~ 76 (530)
+||=-|...
T Consensus 157 ~i~~lF~~~ 165 (247)
T PF09849_consen 157 GIESLFGGH 165 (247)
T ss_pred HHHHHhcCC
Confidence 355555543
No 14
>PF14812 PBP1_TM: Transmembrane domain of transglycosylase PBP1 at N-terminal; PDB: 3FWL_A 3VMA_A.
Probab=47.70 E-value=6.1 Score=34.47 Aligned_cols=11 Identities=64% Similarity=1.192 Sum_probs=0.0
Q ss_pred CCCCCCCCCCC
Q 009621 150 DDYFDDFDDGD 160 (530)
Q Consensus 150 ddy~d~~ddgd 160 (530)
|||+||.+|+|
T Consensus 33 DD~ddd~~DDD 43 (81)
T PF14812_consen 33 DDYDDDYEDDD 43 (81)
T ss_dssp -----------
T ss_pred hcccccccccc
Confidence 33333333333
No 15
>TIGR02877 spore_yhbH sporulation protein YhbH. This protein family, typified by YhbH in Bacillus subtilis, is found in nearly every endospore-forming bacterium and in no other genome (but note that the trusted cutoff score is set high to exclude a single high-scoring sequence from Nitrosococcus oceani ATCC 19707, which is classified in the Gammaproteobacteria). The gene in Bacillus subtilis was shown to be in the regulon of the sporulation sigma factor, sigma-E, and its mutation was shown to create a sporulation defect.
Probab=45.09 E-value=22 Score=38.57 Aligned_cols=11 Identities=18% Similarity=0.341 Sum_probs=6.2
Q ss_pred CchhHHHHHhcC
Q 009621 459 QSDDLLKKAYNR 470 (530)
Q Consensus 459 ~~~~ll~~~y~~ 470 (530)
.+.+|. .+|++
T Consensus 335 ~~~~l~-~~y~~ 345 (371)
T TIGR02877 335 YSNTLK-NKFKN 345 (371)
T ss_pred CcchHH-HHHHh
Confidence 344555 77755
No 16
>PF10446 DUF2457: Protein of unknown function (DUF2457); InterPro: IPR018853 This entry represents a family of uncharacterised proteins.
Probab=44.40 E-value=12 Score=41.33 Aligned_cols=14 Identities=14% Similarity=0.370 Sum_probs=7.5
Q ss_pred HHhhcCCCcccccc
Q 009621 321 NTLQKLPNNIFERS 334 (530)
Q Consensus 321 ~~l~klP~N~Fq~s 334 (530)
+.++.=|-..|...
T Consensus 294 k~~g~SPrrlf~sp 307 (458)
T PF10446_consen 294 KLRGRSPRRLFRSP 307 (458)
T ss_pred hccCCCCcccccCC
Confidence 34445566666554
No 17
>TIGR01323 nitrile_alph nitrile hydratase, alpha subunit. This model describes both iron- and cobalt-containing nitrile hydratase alpha chains. It excludes the thiocyanate hydrolase gamma subunit of Thiobacillus thioparus, a sequence that appears to have evolved from within the family of nitrile hydratase alpha subunits but which differs by several indels and a more rapid accumulation of point mutations.
Probab=40.50 E-value=32 Score=34.15 Aligned_cols=46 Identities=13% Similarity=0.323 Sum_probs=35.4
Q ss_pred HHcCCcCHHHHHHHHHhccC---C-chhHHHHhhcCcchhhhhhhhhccCchh
Q 009621 205 YEMGLVSSAQMVKFLAINAR---P-TTTRFISRSLPQGISRAFIGRMLADPSF 253 (530)
Q Consensus 205 ~e~Glvssa~L~rFl~L~~~---P-~~~~~l~R~lp~~lsrgfReRlLADP~F 253 (530)
+|.|+|+++.+.+.++.... | +-++.+.|.+-. .+||.|||+|..=
T Consensus 11 ~eKGli~~~~id~~i~~~~~~~gP~nGA~vVArAW~D---p~fk~~Ll~d~~a 60 (185)
T TIGR01323 11 KSKGLIPEGAVDQLTSLYENEWGPENGAKVVAKAWVD---PEFRALLLKDATA 60 (185)
T ss_pred HHcCCCCHHHHHHHHHHHHhccCCcchhhhhhHHhcC---HHHHHHHHhChHH
Confidence 67899999998888876543 2 236778887654 6999999999863
No 18
>PF09026 CENP-B_dimeris: Centromere protein B dimerisation domain; InterPro: IPR015115 Centromere protein B (CENP-B) interacts with centromeric heterochromatin in chromosomes and binds to a specific subset of alphoid satellite DNA, called the CENP-B box. CENP-B may organise arrays of centromere satellite DNA into a higher order structure, which then directs centromere formation and kinetochore assembly in mammalian chromosomes. The CENP-B dimerisation domain is composed of two alpha-helices, which are folded into an antiparallel configuration. Dimerisation of CENP-B is mediated by this domain, in which monomers dimerise to form a symmetrical, antiparallel, four-helix bundle structure with a large hydrophobic patch in which 23 residues of one monomer form van der Waals contacts with the other monomer. This CENP-B dimer configuration may be suitable for capturing two distant CENP-B boxes during centromeric heterochromatin formation []. ; GO: 0003677 DNA binding, 0003682 chromatin binding, 0006355 regulation of transcription, DNA-dependent, 0000775 chromosome, centromeric region, 0005634 nucleus; PDB: 1UFI_A.
Probab=39.71 E-value=9.7 Score=34.49 Aligned_cols=18 Identities=11% Similarity=0.157 Sum_probs=8.6
Q ss_pred HHHHHHHhhccCchHHHH
Q 009621 185 AVLNEWMKTMMDLPAGFR 202 (530)
Q Consensus 185 aVL~e~~rt~~sLPaDl~ 202 (530)
+-|..-.|-+-|+|-|=+
T Consensus 51 ~~~~~v~rYltSf~id~~ 68 (101)
T PF09026_consen 51 AYFTMVKRYLTSFPIDDK 68 (101)
T ss_dssp HHHHHHHHHHCTS---HH
T ss_pred hhcchHhhhhhccchhHh
Confidence 344445556777776654
No 19
>PF04889 Cwf_Cwc_15: Cwf15/Cwc15 cell cycle control protein; InterPro: IPR006973 This family represents Cwf15/Cwc15 (from Schizosaccharomyces pombe and Saccharomyces cerevisiae respectively) and their homologues. The function of these proteins is unknown, but they form part of the spliceosome and are thus thought to be involved in mRNA splicing [].; GO: 0000398 nuclear mRNA splicing, via spliceosome, 0005681 spliceosomal complex
Probab=39.33 E-value=24 Score=35.96 Aligned_cols=6 Identities=17% Similarity=0.600 Sum_probs=2.7
Q ss_pred hccCCc
Q 009621 221 INARPT 226 (530)
Q Consensus 221 L~~~P~ 226 (530)
+...|+
T Consensus 186 ~~~NpL 191 (244)
T PF04889_consen 186 LSGNPL 191 (244)
T ss_pred HhCCcc
Confidence 344554
No 20
>PLN00151 potassium transporter; Provisional
Probab=39.28 E-value=38 Score=40.33 Aligned_cols=10 Identities=20% Similarity=0.371 Sum_probs=6.2
Q ss_pred eehhhhhhcc
Q 009621 487 ISKNAIVSGL 496 (530)
Q Consensus 487 ~s~~~~~~gl 496 (530)
|.-.|++||-
T Consensus 431 IASQA~ISgt 440 (852)
T PLN00151 431 IASRAMTTAT 440 (852)
T ss_pred HHHHHHHHHH
Confidence 5566666664
No 21
>PF04931 DNA_pol_phi: DNA polymerase phi; InterPro: IPR007015 Proteins of this family are predominantly nucleolar. The majority are described as transcription factor transactivators. The family also includes the fifth essential DNA polymerase (Pol5p) of Schizosaccharomyces pombe (Fission yeast) and Saccharomyces cerevisiae (Baker's yeast) (2.7.7.7 from EC). Pol5p is localized exclusively to the nucleolus and binds near or at the enhancer region of rRNA-encoding DNA repeating units.; GO: 0003677 DNA binding, 0003887 DNA-directed DNA polymerase activity, 0006351 transcription, DNA-dependent
Probab=38.29 E-value=43 Score=38.88 Aligned_cols=6 Identities=50% Similarity=1.320 Sum_probs=3.0
Q ss_pred hhhhhc
Q 009621 69 LERCYK 74 (530)
Q Consensus 69 ~er~~~ 74 (530)
|.-|+.
T Consensus 559 l~~c~~ 564 (784)
T PF04931_consen 559 LQICYE 564 (784)
T ss_pred HHHHHH
Confidence 445554
No 22
>PHA00458 single-stranded DNA-binding protein
Probab=34.83 E-value=40 Score=34.55 Aligned_cols=7 Identities=71% Similarity=0.852 Sum_probs=3.1
Q ss_pred eeecCCC
Q 009621 121 ELATGGG 127 (530)
Q Consensus 121 ~~~~ggg 127 (530)
+|...+|
T Consensus 180 eL~ef~g 186 (233)
T PHA00458 180 ELAEFGG 186 (233)
T ss_pred EEEEecC
Confidence 4444433
No 23
>PF11705 RNA_pol_3_Rpc31: DNA-directed RNA polymerase III subunit Rpc31; InterPro: IPR024661 DNA-directed RNA polymerases 2.7.7.6 from EC (also known as DNA-dependent RNA polymerases) are responsible for the polymerisation of ribonucleotides into a sequence complementary to the template DNA. In eukaryotes, there are three different forms of DNA-directed RNA polymerases transcribing different sets of genes. Most RNA polymerases are multimeric enzymes and are composed of a variable number of subunits. The core RNA polymerase complex consists of five subunits (two alpha, one beta, one beta-prime and one omega) and is sufficient for transcription elongation and termination but is unable to initiate transcription. Transcription initiation from promoter elements requires a sixth, dissociable subunit called a sigma factor, which reversibly associates with the core RNA polymerase complex to form a holoenzyme []. The core RNA polymerase complex forms a "crab claw"-like structure with an internal channel running along the full length []. The key functional sites of the enzyme, as defined by mutational and cross-linking analysis, are located on the inner wall of this channel. RNA synthesis follows after the attachment of RNA polymerase to a specific site, the promoter, on the template DNA strand. The RNA synthesis process continues until a termination sequence is reached. The RNA product, which is synthesised in the 5' to 3'direction, is known as the primary transcript. Eukaryotic nuclei contain three distinct types of RNA polymerases that differ in the RNA they synthesise: RNA polymerase I: located in the nucleoli, synthesises precursors of most ribosomal RNAs. RNA polymerase II: occurs in the nucleoplasm, synthesises mRNA precursors. RNA polymerase III: also occurs in the nucleoplasm, synthesises the precursors of 5S ribosomal RNA, the tRNAs, and a variety of other small nuclear and cytosolic RNAs. Eukaryotic cells are also known to contain separate mitochondrial and chloroplast RNA polymerases. Eukaryotic RNA polymerases, whose molecular masses vary in size from 500 to 700 kDa, contain two non-identical large (>100 kDa) subunits and an array of up to 12 different small (less than 50 kDa) subunits. RNA polymerase III contains seventeen subunits in yeasts and in human cells. Twelve of these are akin to RNA polymerase I or II and the other five are RNA polymerase III-specific, and form the functionally distinct groups: (i) Rpc31-Rpc34-Rpc82, and (ii) Rpc37-Rpc53. Rpc31, Rpc34 and Rpc82 form a cluster of enzyme-specific subunits that contribute to transcription initiation in Saccharomyces cerevisiae and Homo sapiens. There is evidence that these subunits are anchored at or near the N-terminal Zn-fold of Rpc1, itself prolonged by a highly conserved but RNA polymerase III-specific domain []. This entry represents the Rpc31 subunit.
Probab=31.67 E-value=29 Score=34.44 Aligned_cols=8 Identities=50% Similarity=0.999 Sum_probs=4.2
Q ss_pred CCCCCCCC
Q 009621 152 YFDDFDDG 159 (530)
Q Consensus 152 y~d~~ddg 159 (530)
|||++||+
T Consensus 213 YFDnGedD 220 (233)
T PF11705_consen 213 YFDNGEDD 220 (233)
T ss_pred cCCCCCcc
Confidence 36665543
No 24
>PF08595 RXT2_N: RXT2-like, N-terminal; InterPro: IPR013904 The entry represents the N-terminal region of RXT2-like proteins. In Saccharomyces cerevisiae (Baker's yeast), RXT2 has been demonstrated to be involved in conjugation with cellular fusion (mating) and invasive growth []. A high throughput localisation study has localised RXT2 to the nucleus [].
Probab=30.62 E-value=28 Score=33.30 Aligned_cols=7 Identities=14% Similarity=0.510 Sum_probs=3.3
Q ss_pred hccCCch
Q 009621 221 INARPTT 227 (530)
Q Consensus 221 L~~~P~~ 227 (530)
|-.+|.+
T Consensus 103 l~tHPai 109 (149)
T PF08595_consen 103 LPTHPAI 109 (149)
T ss_pred hcCCccc
Confidence 3445543
No 25
>PF07415 Herpes_LMP2: Gammaherpesvirus latent membrane protein (LMP2) protein; InterPro: IPR010881 This family consists of several Gammaherpesvirus latent membrane protein (LMP2) proteins. Epstein-Barr virus (strain GD1) (HHV-4) (Human herpesvirus 4) is a human gammaherpesvirus that infects and establishes latency in B lymphocytes in vivo. The latent membrane protein 2 (LMP2) gene is expressed in latently infected B cells and encodes two protein isoforms, LMP2A and LMP2B, that are identical except for an additional N-terminal 119 aa cytoplasmic domain which is present in the LMP2A isoform. LMP2A is thought to play a key role in either the establishment or the maintenance of latency and/or the reactivation of productive infection from the latent state. The significance of LMP2B and its role in pathogenesis remain unclear [].; GO: 0019042 latent virus infection, 0033644 host cell membrane; PDB: 2JO9_B 1UXW_C.
Probab=29.46 E-value=18 Score=39.45 Aligned_cols=12 Identities=17% Similarity=0.119 Sum_probs=0.0
Q ss_pred CccccccCCCCC
Q 009621 328 NNIFERSYPFRE 339 (530)
Q Consensus 328 ~N~Fq~s~pg~~ 339 (530)
+.+||.+.-+++
T Consensus 257 daI~Q~SpLLg~ 268 (489)
T PF07415_consen 257 DAIIQLSPLLGA 268 (489)
T ss_dssp ------------
T ss_pred HHHHhhchHhhH
Confidence 445555444433
No 26
>KOG3074 consensus Transcriptional regulator of the PUR family, single-stranded-DNA-binding [Transcription]
Probab=28.54 E-value=34 Score=35.42 Aligned_cols=19 Identities=16% Similarity=0.198 Sum_probs=8.0
Q ss_pred hHHHHHHHHcCCcCHHHHH
Q 009621 198 PAGFRQAYEMGLVSSAQMV 216 (530)
Q Consensus 198 PaDl~~A~e~Glvssa~L~ 216 (530)
|.+++...|...+-++.|+
T Consensus 91 ~~~~~~~~ee~~lkSe~L~ 109 (263)
T KOG3074|consen 91 PPELAAPSEEHELKSEELQ 109 (263)
T ss_pred CcccccchhhhhHHHHHHh
Confidence 3344444444433334343
No 27
>PRK05325 hypothetical protein; Provisional
Probab=27.92 E-value=81 Score=34.64 Aligned_cols=10 Identities=30% Similarity=0.670 Sum_probs=6.5
Q ss_pred chhHHHHHhcC
Q 009621 460 SDDLLKKAYNR 470 (530)
Q Consensus 460 ~~~ll~~~y~~ 470 (530)
...|. ++|.+
T Consensus 358 ~~~l~-~~y~~ 367 (401)
T PRK05325 358 HQTLW-REYER 367 (401)
T ss_pred chHHH-HHHHH
Confidence 45566 88874
No 28
>COG2818 Tag 3-methyladenine DNA glycosylase [DNA replication, recombination, and repair]
Probab=27.79 E-value=30 Score=34.43 Aligned_cols=125 Identities=10% Similarity=0.021 Sum_probs=79.7
Q ss_pred chhhhhHHHHhhcHHHHHHHHHHHHhhccCchHHHHHHHHcCCcCHHH--HHHHHHhccC-CchhHHHHhhcCcchhhhh
Q 009621 167 LFRRRKILEELFDRKFVDAVLNEWMKTMMDLPAGFRQAYEMGLVSSAQ--MVKFLAINAR-PTTTRFISRSLPQGISRAF 243 (530)
Q Consensus 167 ~~r~r~~~~e~fdR~~v~aVL~e~~rt~~sLPaDl~~A~e~Glvssa~--L~rFl~L~~~-P~~~~~l~R~lp~~lsrgf 243 (530)
++++|...||.|..=+++.|.+.-..-++.|=+|-.=.--.|+|-+.+ =+.+++|+.. .....|+--+++. +.-
T Consensus 50 VL~KRe~freaF~~Fd~~kVA~~~~~dverLl~d~gIIR~r~KI~A~i~NA~~~l~l~~e~Gsf~~flWsf~~~---~~~ 126 (188)
T COG2818 50 VLKKREAFREAFHGFDPEKVAAMTEEDVERLLADAGIIRNRGKIKATINNARAVLELQKEFGSFSEFLWSFVGG---KPS 126 (188)
T ss_pred HHHhHHHHHHHHhcCCHHHHHcCCHHHHHHHHhCcchhhhHHHHHHHHHHHHHHHHHHHHcCCHHHHHHHhcCC---Ccc
Confidence 467888889998887777776654444444333321111233332222 3568888876 6666666555544 455
Q ss_pred hhhhccCchhhHHHHHHHHhhhhhhhhhhhhhcchhhhhHHHHHHHHHHHHHHHhHH
Q 009621 244 IGRMLADPSFLYKLILEQAATIGCTVLWELENRKERIKQEWDLALINVLTVTACNAF 300 (530)
Q Consensus 244 ReRlLADP~FLfKL~iE~~I~i~~~v~aE~~~Rge~F~~ElDfV~sdvv~gsi~nfa 300 (530)
+.....=+.|+.+ ..++..+..++++||-.|.-..=.+.-...+|.|.|-+
T Consensus 127 ~~~~~~~~~~pa~------t~~S~~mskaLKkrGf~fvGpTt~yafmqA~G~vndH~ 177 (188)
T COG2818 127 RNQVNDGSEVPAS------TELSDAMSKALKKRGFKFVGPTTVYAFMQATGLVNDHA 177 (188)
T ss_pred cccccchhhcccc------chhHHHHHHHHHHccCeecCcHHHHHHHHHHcchHHHH
Confidence 5655666677776 34455788899999999988877777777777777544
No 29
>KOG2023 consensus Nuclear transport receptor Karyopherin-beta2/Transportin (importin beta superfamily) [Nuclear structure; Intracellular trafficking, secretion, and vesicular transport]
Probab=27.17 E-value=34 Score=40.14 Aligned_cols=26 Identities=19% Similarity=0.201 Sum_probs=16.0
Q ss_pred HhcCCCCCcCCCCCCCCcceeehhhh
Q 009621 467 AYNRPSQDVNGSTQKSPRWLISKNAI 492 (530)
Q Consensus 467 ~y~~~~~~~~~~~~~~~~~~~s~~~~ 492 (530)
.|-+|.+.+...++--..|+.|-++|
T Consensus 774 g~~~Pe~vAp~l~~f~~pWc~sl~~i 799 (885)
T KOG2023|consen 774 GYICPEEVAPHLDSFMRPWCTSLRNI 799 (885)
T ss_pred hccCHHhcchhHHHHHHHHHHHhccc
Confidence 47777666655566666677654443
No 30
>KOG2023 consensus Nuclear transport receptor Karyopherin-beta2/Transportin (importin beta superfamily) [Nuclear structure; Intracellular trafficking, secretion, and vesicular transport]
Probab=27.03 E-value=90 Score=36.90 Aligned_cols=40 Identities=15% Similarity=0.327 Sum_probs=18.5
Q ss_pred CCccCc-ccCcccceeeeeccccceecc--cccCceeecCCCC
Q 009621 89 PLMKGG-QYGAFGAVTLEKGKLDTTQQQ--SETTPELATGGGG 128 (530)
Q Consensus 89 ~~~~g~-~~g~~ga~tleks~l~~~q~~--~~~~p~~~~gggg 128 (530)
||.-++ .|.----+.|+.-.=|-++.- ...+|+.+.+--.
T Consensus 302 PvLl~~M~Ysd~D~~LL~~~eeD~~vpDreeDIkPRfhksk~~ 344 (885)
T KOG2023|consen 302 PVLLSGMVYSDDDIILLKNNEEDESVPDREEDIKPRFHKSKEH 344 (885)
T ss_pred HHHHccCccccccHHHhcCccccccCCchhhhccchhhhchhc
Confidence 665554 444333333332333333333 3457887765543
No 31
>PF04285 DUF444: Protein of unknown function (DUF444); InterPro: IPR006698 This entry is represented by Thermus phage phiYS40, Orf56. The characteristics of the protein distribution suggest prophage matches in addition to the phage matches [].
Probab=25.78 E-value=61 Score=35.71 Aligned_cols=7 Identities=29% Similarity=0.524 Sum_probs=3.2
Q ss_pred HHHHHhH
Q 009621 407 YQLLCGF 413 (530)
Q Consensus 407 YQlL~Gv 413 (530)
|++.+=|
T Consensus 319 ~~l~~~i 325 (421)
T PF04285_consen 319 YELALEI 325 (421)
T ss_pred HHHHHHH
Confidence 4444444
No 32
>PF04931 DNA_pol_phi: DNA polymerase phi; InterPro: IPR007015 Proteins of this family are predominantly nucleolar. The majority are described as transcription factor transactivators. The family also includes the fifth essential DNA polymerase (Pol5p) of Schizosaccharomyces pombe (Fission yeast) and Saccharomyces cerevisiae (Baker's yeast) (2.7.7.7 from EC). Pol5p is localized exclusively to the nucleolus and binds near or at the enhancer region of rRNA-encoding DNA repeating units.; GO: 0003677 DNA binding, 0003887 DNA-directed DNA polymerase activity, 0006351 transcription, DNA-dependent
Probab=25.60 E-value=39 Score=39.19 Aligned_cols=8 Identities=25% Similarity=0.302 Sum_probs=3.5
Q ss_pred CcCHHHHH
Q 009621 209 LVSSAQMV 216 (530)
Q Consensus 209 lvssa~L~ 216 (530)
-+.-++|-
T Consensus 728 ~mdDe~m~ 735 (784)
T PF04931_consen 728 DMDDEQMM 735 (784)
T ss_pred ccCHHHHH
Confidence 34444443
No 33
>KOG0921 consensus Dosage compensation complex, subunit MLE [Transcription]
Probab=24.87 E-value=67 Score=39.13 Aligned_cols=6 Identities=0% Similarity=0.097 Sum_probs=2.2
Q ss_pred CCCCCC
Q 009621 30 APRYST 35 (530)
Q Consensus 30 ~~~~~~ 35 (530)
|.+|-+
T Consensus 1119 IsqLdp 1124 (1282)
T KOG0921|consen 1119 ISQLDP 1124 (1282)
T ss_pred hhccCc
Confidence 333333
No 34
>COG1512 Beta-propeller domains of methanol dehydrogenase type [General function prediction only]
Probab=24.24 E-value=95 Score=32.36 Aligned_cols=16 Identities=19% Similarity=0.167 Sum_probs=9.6
Q ss_pred CcccCcccceeeeecc
Q 009621 93 GGQYGAFGAVTLEKGK 108 (530)
Q Consensus 93 g~~~g~~ga~tleks~ 108 (530)
+.++..|+..++++.+
T Consensus 220 ~~~~~~~~~~~~~~~~ 235 (271)
T COG1512 220 GRQPDRWLNGVLGRRR 235 (271)
T ss_pred ccccccccceeEEeee
Confidence 3356666666666655
No 35
>PLN03083 E3 UFM1-protein ligase 1 homolog; Provisional
Probab=23.02 E-value=1.2e+02 Score=36.16 Aligned_cols=35 Identities=14% Similarity=0.253 Sum_probs=23.0
Q ss_pred HhhcHHHHHHHHHHHHhhccC----chHHHHHHHHcCCc
Q 009621 176 ELFDRKFVDAVLNEWMKTMMD----LPAGFRQAYEMGLV 210 (530)
Q Consensus 176 e~fdR~~v~aVL~e~~rt~~s----LPaDl~~A~e~Glv 210 (530)
.+|.=++|..++.+|...+++ -|..|...+..-+.
T Consensus 484 ~i~~~e~i~~~l~~~~~d~~e~~~~~~~~ll~~lA~~l~ 522 (803)
T PLN03083 484 NIPPEEWVMKKILEWVPDLEEDGTEDPGSILKHLADHLR 522 (803)
T ss_pred ccccHHHHHHHHHHHcccchhcccccHHHHHHHHHHHHH
Confidence 345667888999988887544 66666666544433
No 36
>PF08671 SinI: Anti-repressor SinI; InterPro: IPR010981 The SinR repressor is part of a group of Sin (sporulation inhibition) proteins in Bacillus subtilis that regulate the commitment to sporulation in response to extreme adversity []. SinR is a tetrameric repressor protein that binds to the promoters of genes essential for entry into sporulation and prevents their transcription. This repression is overcome through the activity of SinI, which disrupts the SinR tetramer through the formation of a SinI-SinR heterodimer, thereby allowing sporulation to proceed. The SinR structure consists of two domains: a dimerisation domain stabilised by a hydrophobic core, and a DNA-binding domain that is identical to domains of the bacteriophage 434 CI and Cro proteins that regulate prophage induction. The dimerisation domain is a four-helical bundle formed from two helices from the C-terminal residues of SinR and two helices from the central residues of SinI. These regions in SinR and SinI are similar in both structure and sequence. The interaction of SinR monomers to form tetramers is weaker than between SinR and SinI, since SinI can effectively disrupt SinR tetramers. This entry represents the dimerisation domain in both SinI and SinR proteins.; GO: 0005488 binding, 0006355 regulation of transcription, DNA-dependent; PDB: 1B0N_A 2YAL_A.
Probab=22.81 E-value=84 Score=22.89 Aligned_cols=21 Identities=38% Similarity=0.635 Sum_probs=15.6
Q ss_pred HHHHHHcCCcCHHHHHHHHHhc
Q 009621 201 FRQAYEMGLVSSAQMVKFLAIN 222 (530)
Q Consensus 201 l~~A~e~Glvssa~L~rFl~L~ 222 (530)
|.+|.++|+ |.+++.+||..+
T Consensus 9 i~eA~~~Gl-s~eeir~FL~~~ 29 (30)
T PF08671_consen 9 IKEAKESGL-SKEEIREFLEFN 29 (30)
T ss_dssp HHHHHHTT---HHHHHHHHHHH
T ss_pred HHHHHHcCC-CHHHHHHHHHhC
Confidence 567888874 689999999865
No 37
>PF14407 Frankia_peptide: Ribosomally synthesized peptide prototyped by Frankia Franean1_4349.
Probab=22.41 E-value=57 Score=27.39 Aligned_cols=22 Identities=41% Similarity=0.688 Sum_probs=16.9
Q ss_pred HHHHhhcCcchhhhhhhhhccCchh
Q 009621 229 RFISRSLPQGISRAFIGRMLADPSF 253 (530)
Q Consensus 229 ~~l~R~lp~~lsrgfReRlLADP~F 253 (530)
++|-|.+-. .+||.|+++||.-
T Consensus 4 ~lIgrAv~D---~~FRqqllad~~~ 25 (61)
T PF14407_consen 4 RLIGRAVTD---EAFRQQLLADPEE 25 (61)
T ss_pred HHHHHHHcC---HHHHHHHhcCHHH
Confidence 456666655 5899999999976
No 38
>KOG3241 consensus Uncharacterized conserved protein [Function unknown]
Probab=22.14 E-value=63 Score=32.50 Aligned_cols=6 Identities=50% Similarity=0.783 Sum_probs=2.5
Q ss_pred CCCCCC
Q 009621 135 INHGGG 140 (530)
Q Consensus 135 ~~~GGG 140 (530)
+++||.
T Consensus 183 i~~~~~ 188 (227)
T KOG3241|consen 183 IGHGSV 188 (227)
T ss_pred cccCCC
Confidence 344443
No 39
>PF11332 DUF3134: Protein of unknown function (DUF3134); InterPro: IPR021481 This family of proteins with unknown function appears to be restricted to Cyanobacteria.
Probab=20.79 E-value=53 Score=28.28 Aligned_cols=7 Identities=86% Similarity=1.730 Sum_probs=3.0
Q ss_pred CCCCCCC
Q 009621 149 DDDYFDD 155 (530)
Q Consensus 149 dddy~d~ 155 (530)
||+|+|+
T Consensus 60 dd~~~d~ 66 (73)
T PF11332_consen 60 DDDYYDD 66 (73)
T ss_pred Ccccccc
Confidence 3444444
No 40
>COG3685 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=20.07 E-value=65 Score=31.67 Aligned_cols=52 Identities=25% Similarity=0.437 Sum_probs=45.3
Q ss_pred HHHHhhcH----------HHHHHHHHHHHhhccCchHHHHHHHHcCCcCHHHHHHHHHhccCCc
Q 009621 173 ILEELFDR----------KFVDAVLNEWMKTMMDLPAGFRQAYEMGLVSSAQMVKFLAINARPT 226 (530)
Q Consensus 173 ~~~e~fdR----------~~v~aVL~e~~rt~~sLPaDl~~A~e~Glvssa~L~rFl~L~~~P~ 226 (530)
.++|||+| ..++-++.|...-++..|.+ ++++.|++.+++.+.-+++.-+.+
T Consensus 57 rLe~Vfe~~g~~~~~~~cda~~giiaegq~i~~~~~~~--evlda~L~~aaq~vEhyEIA~Ygt 118 (167)
T COG3685 57 RLEQVFERLGKKARRVTCDAMEGLIAEGQEIMEEFKSN--EVLDAGLIGAAQKVEHYEIACYGT 118 (167)
T ss_pred HHHHHHHHhCcccccchHHHHHHHHHHHHHHHHhcccc--HHHHHHHHHHHHHHHHHHHHHHHH
Confidence 47899999 56778888888889999999 999999999999999999877654
Done!