Query         009621
Match_columns 530
No_of_seqs    116 out of 144
Neff          3.4 
Searched_HMMs 46136
Date          Thu Mar 28 15:20:03 2013
Command       hhsearch -i /work/01045/syshi/csienesis_hhblits_a3m/009621.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/009621hhsearch_cdd -cpu 12 -v 0 

 No Hit                             Prob E-value P-value  Score    SS Cols Query HMM  Template HMM
  1 PF11891 DUF3411:  Domain of un 100.0 5.3E-70 1.1E-74  513.1  12.7  170  245-414     1-180 (180)
  2 PF06524 NOA36:  NOA36 protein;  89.7    0.32 6.9E-06   50.3   3.6   12  102-113   218-229 (314)
  3 PF14812 PBP1_TM:  Transmembran  83.1    0.37   8E-06   41.8   0.0   14  148-161    35-48  (81)
  4 PF02084 Bindin:  Bindin;  Inte  80.5     1.5 3.3E-05   44.4   3.2   30  181-210   104-133 (238)
  5 PF02979 NHase_alpha:  Nitrile   71.5     2.5 5.5E-05   41.8   2.0   48  205-255    17-68  (188)
  6 KOG0943 Predicted ubiquitin-pr  70.9     1.2 2.5E-05   54.1  -0.5   72  177-266  1796-1869(3015)
  7 COG4907 Predicted membrane pro  70.8     2.6 5.6E-05   46.8   2.1    6  116-121   565-570 (595)
  8 PLN03138 Protein TOC75; Provis  68.8     7.6 0.00016   45.6   5.3   10  448-457   497-506 (796)
  9 COG4907 Predicted membrane pro  56.3     8.1 0.00018   43.2   2.5   16  124-139   576-591 (595)
 10 PF11705 RNA_pol_3_Rpc31:  DNA-  54.2     7.7 0.00017   38.4   1.8    7   66-72     90-96  (233)
 11 PLN03138 Protein TOC75; Provis  52.4      10 0.00022   44.5   2.7   15  197-211   187-201 (796)
 12 KOG3540 Beta amyloid precursor  51.2      18 0.00039   40.7   4.1   21  171-192   255-275 (615)
 13 PF09849 DUF2076:  Uncharacteri  51.1      17 0.00037   37.2   3.8    9   68-76    157-165 (247)
 14 PF14812 PBP1_TM:  Transmembran  47.7     6.1 0.00013   34.5   0.0   11  150-160    33-43  (81)
 15 TIGR02877 spore_yhbH sporulati  45.1      22 0.00047   38.6   3.6   11  459-470   335-345 (371)
 16 PF10446 DUF2457:  Protein of u  44.4      12 0.00027   41.3   1.6   14  321-334   294-307 (458)
 17 TIGR01323 nitrile_alph nitrile  40.5      32  0.0007   34.2   3.7   46  205-253    11-60  (185)
 18 PF09026 CENP-B_dimeris:  Centr  39.7     9.7 0.00021   34.5   0.0   18  185-202    51-68  (101)
 19 PF04889 Cwf_Cwc_15:  Cwf15/Cwc  39.3      24 0.00052   36.0   2.7    6  221-226   186-191 (244)
 20 PLN00151 potassium transporter  39.3      38 0.00082   40.3   4.6   10  487-496   431-440 (852)
 21 PF04931 DNA_pol_phi:  DNA poly  38.3      43 0.00092   38.9   4.8    6   69-74    559-564 (784)
 22 PHA00458 single-stranded DNA-b  34.8      40 0.00087   34.5   3.4    7  121-127   180-186 (233)
 23 PF11705 RNA_pol_3_Rpc31:  DNA-  31.7      29 0.00062   34.4   1.8    8  152-159   213-220 (233)
 24 PF08595 RXT2_N:  RXT2-like, N-  30.6      28  0.0006   33.3   1.5    7  221-227   103-109 (149)
 25 PF07415 Herpes_LMP2:  Gammaher  29.5      18 0.00039   39.5   0.0   12  328-339   257-268 (489)
 26 KOG3074 Transcriptional regula  28.5      34 0.00074   35.4   1.8   19  198-216    91-109 (263)
 27 PRK05325 hypothetical protein;  27.9      81  0.0018   34.6   4.5   10  460-470   358-367 (401)
 28 COG2818 Tag 3-methyladenine DN  27.8      30 0.00066   34.4   1.2  125  167-300    50-177 (188)
 29 KOG2023 Nuclear transport rece  27.2      34 0.00074   40.1   1.6   26  467-492   774-799 (885)
 30 KOG2023 Nuclear transport rece  27.0      90   0.002   36.9   4.9   40   89-128   302-344 (885)
 31 PF04285 DUF444:  Protein of un  25.8      61  0.0013   35.7   3.2    7  407-413   319-325 (421)
 32 PF04931 DNA_pol_phi:  DNA poly  25.6      39 0.00084   39.2   1.8    8  209-216   728-735 (784)
 33 KOG0921 Dosage compensation co  24.9      67  0.0014   39.1   3.4    6   30-35   1119-1124(1282)
 34 COG1512 Beta-propeller domains  24.2      95  0.0021   32.4   4.1   16   93-108   220-235 (271)
 35 PLN03083 E3 UFM1-protein ligas  23.0 1.2E+02  0.0026   36.2   5.0   35  176-210   484-522 (803)
 36 PF08671 SinI:  Anti-repressor   22.8      84  0.0018   22.9   2.4   21  201-222     9-29  (30)
 37 PF14407 Frankia_peptide:  Ribo  22.4      57  0.0012   27.4   1.7   22  229-253     4-25  (61)
 38 KOG3241 Uncharacterized conser  22.1      63  0.0014   32.5   2.2    6  135-140   183-188 (227)
 39 PF11332 DUF3134:  Protein of u  20.8      53  0.0011   28.3   1.2    7  149-155    60-66  (73)
 40 COG3685 Uncharacterized protei  20.1      65  0.0014   31.7   1.8   52  173-226    57-118 (167)

No 1  
>PF11891 DUF3411:  Domain of unknown function (DUF3411);  InterPro: IPR021825  This presumed domain is functionally uncharacterised. This domain is found in eukaryotes. This domain is typically between 168 to 186 amino acids in length. This domain has a conserved RYQ sequence motif. 
Probab=100.00  E-value=5.3e-70  Score=513.09  Aligned_cols=170  Identities=45%  Similarity=0.732  Sum_probs=163.8

Q ss_pred             hhhccCchhhHHHHHHHHhhhhhhhhhhhhhcchhhhhHHHHHHHHHHHHHHHhHHHHhhcccccccCCCcc----cchH
Q 009621          245 GRMLADPSFLYKLILEQAATIGCTVLWELENRKERIKQEWDLALINVLTVTACNAFVVWSLAPCRSYGNTFR----FDLQ  320 (530)
Q Consensus       245 eRlLADP~FLfKL~iE~~I~i~~~v~aE~~~Rge~F~~ElDfV~sdvv~gsi~nfaLVwLLAPt~S~g~~~~----~~lq  320 (530)
                      |||||||+|||||++||+||++|+++|||++|||+||+|||||+||+++++|+||+||||||||++++++++    +.+|
T Consensus         1 ~RllADP~Fl~Kl~~E~~i~i~~~~~~e~~~R~e~f~~E~d~v~~d~v~~~i~n~~lv~llAPt~s~~~~~~~~~~~~~~   80 (180)
T PF11891_consen    1 ERLLADPSFLFKLAIEEVIGIGCATAAEYAKRGERFWNELDFVFSDVVVGSIVNFALVWLLAPTRSFGSPAASSPGGGLQ   80 (180)
T ss_pred             CcccccchHHHHHHHHHHHHHHHHHHHHHHHcccchHHHHHHHHHHHHHHHHHHHHHHHhccchHhhCcccccccchHHH
Confidence            799999999999999999999999999999999999999999999999999999999999999999999887    6799


Q ss_pred             HHhhcCCCccccccCCCCCcchhhhHHHHHhhchhhhhhhhhHHhhHHHHHHHHhh--ccCC----CCCCCCccchhhhh
Q 009621          321 NTLQKLPNNIFERSYPFREFDLQKRIHSLFYKAAELCMVGLSAGAVQGSLSNYLAG--KKDR----LSVTIPSVSTSALG  394 (530)
Q Consensus       321 ~~l~klP~N~Fq~s~pg~~fsL~qRiga~v~KGa~l~~VG~~aGlvG~glSN~L~~--kK~~----~s~~vPpv~~SAl~  394 (530)
                      +++++||+||||+++||++||++||++||+|||++|++|||+||++|+++||+|++  |+++    +++++|||++||++
T Consensus        81 ~~~~~~P~n~Fq~~~~g~~fsl~qR~~~~~~kg~~l~~VG~~ag~vg~~lsn~L~~~rk~~~~~~e~~~~~ppv~~ta~~  160 (180)
T PF11891_consen   81 KFLGSLPNNAFQKGYPGRSFSLAQRIGAFVYKGAKLAAVGFIAGLVGTGLSNALIAARKKVDPSFEPSVPVPPVLKTALG  160 (180)
T ss_pred             HHHHhChHHHhccCCCCCcccHHHHHHHHHHcchHhhhhHHHHHHHHHHHHHHHHHHHHhcCccccCCCCCCCHHHHHHH
Confidence            99999999999999999999999999999999999999999999999999999998  4444    45789999999999


Q ss_pred             HHHHhhchhhhHHHHHHhHH
Q 009621          395 YGAFLGLYANMRYQLLCGFD  414 (530)
Q Consensus       395 wG~fMGvSSNLRYQlL~GvE  414 (530)
                      ||+|||+|||+|||+|||+|
T Consensus       161 ~g~fmGvSsNlRYQil~GiE  180 (180)
T PF11891_consen  161 WGAFMGVSSNLRYQILNGIE  180 (180)
T ss_pred             HHHHHhhhHhHHHHHHcCCC
Confidence            99999999999999999997


No 2  
>PF06524 NOA36:  NOA36 protein;  InterPro: IPR010531 This family consists of several NOA36 proteins which contain 29 highly conserved cysteine residues. The function of this protein is unknown.; GO: 0008270 zinc ion binding, 0005634 nucleus
Probab=89.68  E-value=0.32  Score=50.32  Aligned_cols=12  Identities=25%  Similarity=0.210  Sum_probs=5.5

Q ss_pred             eeeeecccccee
Q 009621          102 VTLEKGKLDTTQ  113 (530)
Q Consensus       102 ~tleks~l~~~q  113 (530)
                      .|-|-..|.+|-
T Consensus       218 et~eTkdLSmSt  229 (314)
T PF06524_consen  218 ETQETKDLSMST  229 (314)
T ss_pred             cccccccceeee
Confidence            344444455543


No 3  
>PF14812 PBP1_TM:  Transmembrane domain of transglycosylase PBP1 at N-terminal; PDB: 3FWL_A 3VMA_A.
Probab=83.08  E-value=0.37  Score=41.81  Aligned_cols=14  Identities=71%  Similarity=0.902  Sum_probs=0.0

Q ss_pred             CCCCCCCCCCCCCC
Q 009621          148 DDDDYFDDFDDGDE  161 (530)
Q Consensus       148 ddddy~d~~ddgd~  161 (530)
                      ++|||+||++++||
T Consensus        35 ~ddd~~DDD~dDde   48 (81)
T PF14812_consen   35 YDDDYEDDDDDDDE   48 (81)
T ss_dssp             --------------
T ss_pred             cccccccccccchh
Confidence            34445555444444


No 4  
>PF02084 Bindin:  Bindin;  InterPro: IPR000775 Bindin, the major protein component of the acrosome granule of sea urchin sperm, mediates species-specific adhesion of sperm to the egg surface during fertilisation [, ]. The protein coats the acrosomal process after externalisation by the acrosome reaction; it binds to sulphated, fucose-containing polysaccharides on the vitelline-layer receptor proteoglycans that cover the egg plasma membrane. Bindins from different genera show high levels of sequence similarity in both the mature bindin domain and in the probindin precursor region. The most highly conserved region is a 42-residue segment in the central portion of the mature bindin protein. This domain may be responsible for conserved functions of bindin, while the more highly divergent flanking regions may be responsible for its species-specific properties [].; GO: 0007342 fusion of sperm to egg plasma membrane
Probab=80.52  E-value=1.5  Score=44.44  Aligned_cols=30  Identities=27%  Similarity=0.554  Sum_probs=22.5

Q ss_pred             HHHHHHHHHHHhhccCchHHHHHHHHcCCc
Q 009621          181 KFVDAVLNEWMKTMMDLPAGFRQAYEMGLV  210 (530)
Q Consensus       181 ~~v~aVL~e~~rt~~sLPaDl~~A~e~Glv  210 (530)
                      |-.+.+.+-.+.|-.+||-||-+=++.||+
T Consensus       104 Kvm~~ikavLgaTKiDLPVDINDPYDlGLL  133 (238)
T PF02084_consen  104 KVMEDIKAVLGATKIDLPVDINDPYDLGLL  133 (238)
T ss_pred             HHHHHHHHHhcccccccccccCChhhHHHH
Confidence            333444444578999999999999999863


No 5  
>PF02979 NHase_alpha:  Nitrile hydratase, alpha chain;  InterPro: IPR004232 Nitrile hydratases (4.2.1.84 from EC) are bacterial enzymes that catalyse the hydration of nitrile compounds to the corresponding amides. They are used as biocatalysts in acrylamide production, one of the few commercial scale bioprocesses, as well as in environmental remediation for the removal of nitriles from waste streams. Nitrile hydratases are composed of two subunits, alpha and beta, and are normally active as a tetramer, alpha(2)beta(2). Nitrile hydratases contain either a non-haem iron or a non-corrinoid cobalt centre, both types sharing a highly conserved peptide sequence in the alpha subunit (CXLCSC) that provides all the residues involved in coordinating the metal ion. Each type of nitrile hydratase specifically incorporated its metal with the help of activator proteins encoded by flanking regions of the nitrile hydratase genes that are necessary for metal insertion. The Fe-containing enzyme is photo-regulated: in the dark the enzyme is inactivated due to the association of nitric oxide (NO) to the iron, while in the light the enzyme is active by photo-dissociation of NO. The NO is held in place by a claw setting formed through specific oxygen atoms in two modified cysteines and a serine residue in the active site [, ]. The cobalt-containing enzyme is unaffected by NO, but was shown to undergo a similar effect with carbon monoxide [, ]. Fe- and cobalt-containing enzymes also display different inhibition patterns with nitrophenols. Thiocyanate hydrolase (SCNase) is a cobalt-containing metalloenzyme with a cysteine-sulphinic acid ligand that hydrolyses thiocyanate to carbonyl sulphide and ammonia []. The two enzymes, nitrile hydratase and SCNase, are homologous over regions corresponding to almost the entire coding regions of the genes: the beta and alpha subunits of thiocyanate hydrolase were homologous to the amino- and carboxyl-terminal halves of the beta subunit of nitrile hydratase, and the gamma subunit of thiocyanate hydrolase was homologous to the alpha subunit of nitrile hydratase [].  This entry represents the structural domain of the alpha subunit of both iron- and cobalt-containing nitrile hydratases; the alpha subunit is a duplication of two structural repeats, each consisting of 4 layers, alpha/beta/beta/alpha []. This structure is also found in the related protein, the gamma subunit of thiocyanate hydrolase (SCNase).; GO: 0003824 catalytic activity, 0046914 transition metal ion binding, 0006807 nitrogen compound metabolic process; PDB: 2DPP_A 3HHT_A 1V29_A 2ZZD_I 2DXC_F 2DXB_F 2DD5_C 2DD4_C 2ZPH_A 2CYZ_A ....
Probab=71.55  E-value=2.5  Score=41.76  Aligned_cols=48  Identities=27%  Similarity=0.479  Sum_probs=35.5

Q ss_pred             HHcCCcCHHHHHHHHHhccCC----chhHHHHhhcCcchhhhhhhhhccCchhhH
Q 009621          205 YEMGLVSSAQMVKFLAINARP----TTTRFISRSLPQGISRAFIGRMLADPSFLY  255 (530)
Q Consensus       205 ~e~Glvssa~L~rFl~L~~~P----~~~~~l~R~lp~~lsrgfReRlLADP~FLf  255 (530)
                      +|.|+|+++.+.++.+...+-    +-++.+.|.+-.   .+||.|||+||.=..
T Consensus        17 ~ekg~~~~~~~~~~~~~~~~~~~P~~GarvVArAW~D---p~FK~rLLaD~~aA~   68 (188)
T PF02979_consen   17 IEKGLITPAEVDRIIETYESRVGPRNGARVVARAWTD---PAFKARLLADPTAAI   68 (188)
T ss_dssp             HHTTSS-HHHHHHHHHHHHHTSSHHHHHHHHHHHHH----HHHHHHHHHSHHHHH
T ss_pred             HHcCCCCHHHHHHHHHHHHhccCccccceeehhhhCC---HHHHHHHHHCHHHHH
Confidence            689999999999988876543    236777777644   699999999997443


No 6  
>KOG0943 consensus Predicted ubiquitin-protein ligase/hyperplastic discs protein, HECT superfamily [Posttranslational modification, protein turnover, chaperones]
Probab=70.86  E-value=1.2  Score=54.13  Aligned_cols=72  Identities=25%  Similarity=0.154  Sum_probs=39.7

Q ss_pred             hhcHHHHHHHHHHHHhhccCchHHHHHHHHcCCcCHHHHHHHHHhccCCchhHHHHhhcCcchhhhhhh--hhccCchhh
Q 009621          177 LFDRKFVDAVLNEWMKTMMDLPAGFRQAYEMGLVSSAQMVKFLAINARPTTTRFISRSLPQGISRAFIG--RMLADPSFL  254 (530)
Q Consensus       177 ~fdR~~v~aVL~e~~rt~~sLPaDl~~A~e~Glvssa~L~rFl~L~~~P~~~~~l~R~lp~~lsrgfRe--RlLADP~FL  254 (530)
                      .=||-+|.++..|..+--+.+|+-+.+|--.-.-+..      +-+.+-.          |  ..+.++  =|+|||.-+
T Consensus      1796 ~~dRieVq~ataeSE~aaepV~~afe~adpqp~dpd~------dDnSSds----------Q--e~~a~e~aFmiad~ahp 1857 (3015)
T KOG0943|consen 1796 SGDRIEVQAATAESEAAAEPVPAAFEEADPQPNDPDD------DDNSSDS----------Q--EDDAEEEAFMIADPAHP 1857 (3015)
T ss_pred             ccceeEEeecccccccccCCcccccCcCCCCCCCCCc------ccccccc----------c--cccccccceeecCCcch
Confidence            3466677778888888778888766554322111100      0000000          0  011111  189999999


Q ss_pred             HHHHHHHHhhhh
Q 009621          255 YKLILEQAATIG  266 (530)
Q Consensus       255 fKL~iE~~I~i~  266 (530)
                      ..|+.|+-.-+.
T Consensus      1858 leva~aqpaN~p 1869 (3015)
T KOG0943|consen 1858 LEVALAQPANSP 1869 (3015)
T ss_pred             hhHhhcccCCCc
Confidence            999999876543


No 7  
>COG4907 Predicted membrane protein [Function unknown]
Probab=70.79  E-value=2.6  Score=46.84  Aligned_cols=6  Identities=17%  Similarity=0.158  Sum_probs=2.2

Q ss_pred             cccCce
Q 009621          116 SETTPE  121 (530)
Q Consensus       116 ~~~~p~  121 (530)
                      +..+|.
T Consensus       565 sa~a~S  570 (595)
T COG4907         565 SAIASS  570 (595)
T ss_pred             hccccc
Confidence            333333


No 8  
>PLN03138 Protein TOC75; Provisional
Probab=68.79  E-value=7.6  Score=45.57  Aligned_cols=10  Identities=20%  Similarity=0.292  Sum_probs=6.8

Q ss_pred             hHhcCCCCCC
Q 009621          448 AWLGVEADPL  457 (530)
Q Consensus       448 ~wlGv~~~p~  457 (530)
                      -||.-..||+
T Consensus       497 PWi~g~~d~~  506 (796)
T PLN03138        497 PYLDGVYNPR  506 (796)
T ss_pred             ccccCCCCcc
Confidence            4887666774


No 9  
>COG4907 Predicted membrane protein [Function unknown]
Probab=56.30  E-value=8.1  Score=43.16  Aligned_cols=16  Identities=50%  Similarity=0.896  Sum_probs=7.0

Q ss_pred             cCCCCCCCCCCCCCCC
Q 009621          124 TGGGGGDIGKKINHGG  139 (530)
Q Consensus       124 ~ggggg~~g~~~~~GG  139 (530)
                      +|||+|..|||.||||
T Consensus       576 ~GGG~G~~gGg~GGGG  591 (595)
T COG4907         576 SGGGGGFSGGGSGGGG  591 (595)
T ss_pred             CCCCCCcCCCCCCCCC
Confidence            3444444454433333


No 10 
>PF11705 RNA_pol_3_Rpc31:  DNA-directed RNA polymerase III subunit Rpc31;  InterPro: IPR024661 DNA-directed RNA polymerases 2.7.7.6 from EC (also known as DNA-dependent RNA polymerases) are responsible for the polymerisation of ribonucleotides into a sequence complementary to the template DNA. In eukaryotes, there are three different forms of DNA-directed RNA polymerases transcribing different sets of genes. Most RNA polymerases are multimeric enzymes and are composed of a variable number of subunits. The core RNA polymerase complex consists of five subunits (two alpha, one beta, one beta-prime and one omega) and is sufficient for transcription elongation and termination but is unable to initiate transcription. Transcription initiation from promoter elements requires a sixth, dissociable subunit called a sigma factor, which reversibly associates with the core RNA polymerase complex to form a holoenzyme []. The core RNA polymerase complex forms a "crab claw"-like structure with an internal channel running along the full length []. The key functional sites of the enzyme, as defined by mutational and cross-linking analysis, are located on the inner wall of this channel. RNA synthesis follows after the attachment of RNA polymerase to a specific site, the promoter, on the template DNA strand. The RNA synthesis process continues until a termination sequence is reached. The RNA product, which is synthesised in the 5' to 3'direction, is known as the primary transcript. Eukaryotic nuclei contain three distinct types of RNA polymerases that differ in the RNA they synthesise:  RNA polymerase I: located in the nucleoli, synthesises precursors of most ribosomal RNAs. RNA polymerase II: occurs in the nucleoplasm, synthesises mRNA precursors.  RNA polymerase III: also occurs in the nucleoplasm, synthesises the precursors of 5S ribosomal RNA, the tRNAs, and a variety of other small nuclear and cytosolic RNAs.   Eukaryotic cells are also known to contain separate mitochondrial and chloroplast RNA polymerases. Eukaryotic RNA polymerases, whose molecular masses vary in size from 500 to 700 kDa, contain two non-identical large (>100 kDa) subunits and an array of up to 12 different small (less than 50 kDa) subunits. RNA polymerase III contains seventeen subunits in yeasts and in human cells. Twelve of these are akin to RNA polymerase I or II and the other five are RNA polymerase III-specific, and form the functionally distinct groups: (i) Rpc31-Rpc34-Rpc82, and (ii) Rpc37-Rpc53. Rpc31, Rpc34 and Rpc82 form a cluster of enzyme-specific subunits that contribute to transcription initiation in Saccharomyces cerevisiae and Homo sapiens. There is evidence that these subunits are anchored at or near the N-terminal Zn-fold of Rpc1, itself prolonged by a highly conserved but RNA polymerase III-specific domain []. This entry represents the Rpc31 subunit.
Probab=54.21  E-value=7.7  Score=38.41  Aligned_cols=7  Identities=29%  Similarity=0.183  Sum_probs=4.8

Q ss_pred             hhhhhhh
Q 009621           66 VVLLERC   72 (530)
Q Consensus        66 ~~~~er~   72 (530)
                      ...+||+
T Consensus        90 ~~~ierY   96 (233)
T PF11705_consen   90 FDDIERY   96 (233)
T ss_pred             cccHHHH
Confidence            4567886


No 11 
>PLN03138 Protein TOC75; Provisional
Probab=52.38  E-value=10  Score=44.54  Aligned_cols=15  Identities=13%  Similarity=0.237  Sum_probs=6.7

Q ss_pred             chHHHHHHHHcCCcC
Q 009621          197 LPAGFRQAYEMGLVS  211 (530)
Q Consensus       197 LPaDl~~A~e~Glvs  211 (530)
                      |=+|+..-.+.|...
T Consensus       187 Lq~dv~~I~~tG~F~  201 (796)
T PLN03138        187 LQKELETLASCGMFE  201 (796)
T ss_pred             HHHHHHHHHhcCCce
Confidence            334444444445444


No 12 
>KOG3540 consensus Beta amyloid precursor protein [General function prediction only]
Probab=51.24  E-value=18  Score=40.74  Aligned_cols=21  Identities=43%  Similarity=0.664  Sum_probs=14.6

Q ss_pred             hhHHHHhhcHHHHHHHHHHHHh
Q 009621          171 RKILEELFDRKFVDAVLNEWMK  192 (530)
Q Consensus       171 r~~~~e~fdR~~v~aVL~e~~r  192 (530)
                      ..-++| =-|+-++.||+||+.
T Consensus       255 kmrlee-khr~rmd~VmkEW~~  275 (615)
T KOG3540|consen  255 KMRLEE-KHRKRMDKVMKEWEE  275 (615)
T ss_pred             HHHHHH-HHHHHHHHHHHHHHH
Confidence            333443 357889999999973


No 13 
>PF09849 DUF2076:  Uncharacterized protein conserved in bacteria (DUF2076);  InterPro: IPR018648  This family of hypothetical prokaryotic proteins has no known function but includes putative perimplasmic ligand-binding sensor proteins.
Probab=51.06  E-value=17  Score=37.17  Aligned_cols=9  Identities=11%  Similarity=0.139  Sum_probs=4.5

Q ss_pred             hhhhhhcCC
Q 009621           68 LLERCYKAP   76 (530)
Q Consensus        68 ~~er~~~~~   76 (530)
                      +||=-|...
T Consensus       157 ~i~~lF~~~  165 (247)
T PF09849_consen  157 GIESLFGGH  165 (247)
T ss_pred             HHHHHhcCC
Confidence            355555543


No 14 
>PF14812 PBP1_TM:  Transmembrane domain of transglycosylase PBP1 at N-terminal; PDB: 3FWL_A 3VMA_A.
Probab=47.70  E-value=6.1  Score=34.47  Aligned_cols=11  Identities=64%  Similarity=1.192  Sum_probs=0.0

Q ss_pred             CCCCCCCCCCC
Q 009621          150 DDYFDDFDDGD  160 (530)
Q Consensus       150 ddy~d~~ddgd  160 (530)
                      |||+||.+|+|
T Consensus        33 DD~ddd~~DDD   43 (81)
T PF14812_consen   33 DDYDDDYEDDD   43 (81)
T ss_dssp             -----------
T ss_pred             hcccccccccc
Confidence            33333333333


No 15 
>TIGR02877 spore_yhbH sporulation protein YhbH. This protein family, typified by YhbH in Bacillus subtilis, is found in nearly every endospore-forming bacterium and in no other genome (but note that the trusted cutoff score is set high to exclude a single high-scoring sequence from Nitrosococcus oceani ATCC 19707, which is classified in the Gammaproteobacteria). The gene in Bacillus subtilis was shown to be in the regulon of the sporulation sigma factor, sigma-E, and its mutation was shown to create a sporulation defect.
Probab=45.09  E-value=22  Score=38.57  Aligned_cols=11  Identities=18%  Similarity=0.341  Sum_probs=6.2

Q ss_pred             CchhHHHHHhcC
Q 009621          459 QSDDLLKKAYNR  470 (530)
Q Consensus       459 ~~~~ll~~~y~~  470 (530)
                      .+.+|. .+|++
T Consensus       335 ~~~~l~-~~y~~  345 (371)
T TIGR02877       335 YSNTLK-NKFKN  345 (371)
T ss_pred             CcchHH-HHHHh
Confidence            344555 77755


No 16 
>PF10446 DUF2457:  Protein of unknown function (DUF2457);  InterPro: IPR018853  This entry represents a family of uncharacterised proteins. 
Probab=44.40  E-value=12  Score=41.33  Aligned_cols=14  Identities=14%  Similarity=0.370  Sum_probs=7.5

Q ss_pred             HHhhcCCCcccccc
Q 009621          321 NTLQKLPNNIFERS  334 (530)
Q Consensus       321 ~~l~klP~N~Fq~s  334 (530)
                      +.++.=|-..|...
T Consensus       294 k~~g~SPrrlf~sp  307 (458)
T PF10446_consen  294 KLRGRSPRRLFRSP  307 (458)
T ss_pred             hccCCCCcccccCC
Confidence            34445566666554


No 17 
>TIGR01323 nitrile_alph nitrile hydratase, alpha subunit. This model describes both iron- and cobalt-containing nitrile hydratase alpha chains. It excludes the thiocyanate hydrolase gamma subunit of Thiobacillus thioparus, a sequence that appears to have evolved from within the family of nitrile hydratase alpha subunits but which differs by several indels and a more rapid accumulation of point mutations.
Probab=40.50  E-value=32  Score=34.15  Aligned_cols=46  Identities=13%  Similarity=0.323  Sum_probs=35.4

Q ss_pred             HHcCCcCHHHHHHHHHhccC---C-chhHHHHhhcCcchhhhhhhhhccCchh
Q 009621          205 YEMGLVSSAQMVKFLAINAR---P-TTTRFISRSLPQGISRAFIGRMLADPSF  253 (530)
Q Consensus       205 ~e~Glvssa~L~rFl~L~~~---P-~~~~~l~R~lp~~lsrgfReRlLADP~F  253 (530)
                      +|.|+|+++.+.+.++....   | +-++.+.|.+-.   .+||.|||+|..=
T Consensus        11 ~eKGli~~~~id~~i~~~~~~~gP~nGA~vVArAW~D---p~fk~~Ll~d~~a   60 (185)
T TIGR01323        11 KSKGLIPEGAVDQLTSLYENEWGPENGAKVVAKAWVD---PEFRALLLKDATA   60 (185)
T ss_pred             HHcCCCCHHHHHHHHHHHHhccCCcchhhhhhHHhcC---HHHHHHHHhChHH
Confidence            67899999998888876543   2 236778887654   6999999999863


No 18 
>PF09026 CENP-B_dimeris:  Centromere protein B dimerisation domain;  InterPro: IPR015115 Centromere protein B (CENP-B) interacts with centromeric heterochromatin in chromosomes and binds to a specific subset of alphoid satellite DNA, called the CENP-B box. CENP-B may organise arrays of centromere satellite DNA into a higher order structure, which then directs centromere formation and kinetochore assembly in mammalian chromosomes. The CENP-B dimerisation domain is composed of two alpha-helices, which are folded into an antiparallel configuration. Dimerisation of CENP-B is mediated by this domain, in which monomers dimerise to form a symmetrical, antiparallel, four-helix bundle structure with a large hydrophobic patch in which 23 residues of one monomer form van der Waals contacts with the other monomer. This CENP-B dimer configuration may be suitable for capturing two distant CENP-B boxes during centromeric heterochromatin formation []. ; GO: 0003677 DNA binding, 0003682 chromatin binding, 0006355 regulation of transcription, DNA-dependent, 0000775 chromosome, centromeric region, 0005634 nucleus; PDB: 1UFI_A.
Probab=39.71  E-value=9.7  Score=34.49  Aligned_cols=18  Identities=11%  Similarity=0.157  Sum_probs=8.6

Q ss_pred             HHHHHHHhhccCchHHHH
Q 009621          185 AVLNEWMKTMMDLPAGFR  202 (530)
Q Consensus       185 aVL~e~~rt~~sLPaDl~  202 (530)
                      +-|..-.|-+-|+|-|=+
T Consensus        51 ~~~~~v~rYltSf~id~~   68 (101)
T PF09026_consen   51 AYFTMVKRYLTSFPIDDK   68 (101)
T ss_dssp             HHHHHHHHHHCTS---HH
T ss_pred             hhcchHhhhhhccchhHh
Confidence            344445556777776654


No 19 
>PF04889 Cwf_Cwc_15:  Cwf15/Cwc15 cell cycle control protein;  InterPro: IPR006973 This family represents Cwf15/Cwc15 (from Schizosaccharomyces pombe and Saccharomyces cerevisiae respectively) and their homologues. The function of these proteins is unknown, but they form part of the spliceosome and are thus thought to be involved in mRNA splicing [].; GO: 0000398 nuclear mRNA splicing, via spliceosome, 0005681 spliceosomal complex
Probab=39.33  E-value=24  Score=35.96  Aligned_cols=6  Identities=17%  Similarity=0.600  Sum_probs=2.7

Q ss_pred             hccCCc
Q 009621          221 INARPT  226 (530)
Q Consensus       221 L~~~P~  226 (530)
                      +...|+
T Consensus       186 ~~~NpL  191 (244)
T PF04889_consen  186 LSGNPL  191 (244)
T ss_pred             HhCCcc
Confidence            344554


No 20 
>PLN00151 potassium transporter; Provisional
Probab=39.28  E-value=38  Score=40.33  Aligned_cols=10  Identities=20%  Similarity=0.371  Sum_probs=6.2

Q ss_pred             eehhhhhhcc
Q 009621          487 ISKNAIVSGL  496 (530)
Q Consensus       487 ~s~~~~~~gl  496 (530)
                      |.-.|++||-
T Consensus       431 IASQA~ISgt  440 (852)
T PLN00151        431 IASRAMTTAT  440 (852)
T ss_pred             HHHHHHHHHH
Confidence            5566666664


No 21 
>PF04931 DNA_pol_phi:  DNA polymerase phi;  InterPro: IPR007015 Proteins of this family are predominantly nucleolar. The majority are described as transcription factor transactivators. The family also includes the fifth essential DNA polymerase (Pol5p) of Schizosaccharomyces pombe (Fission yeast) and Saccharomyces cerevisiae (Baker's yeast) (2.7.7.7 from EC). Pol5p is localized exclusively to the nucleolus and binds near or at the enhancer region of rRNA-encoding DNA repeating units.; GO: 0003677 DNA binding, 0003887 DNA-directed DNA polymerase activity, 0006351 transcription, DNA-dependent
Probab=38.29  E-value=43  Score=38.88  Aligned_cols=6  Identities=50%  Similarity=1.320  Sum_probs=3.0

Q ss_pred             hhhhhc
Q 009621           69 LERCYK   74 (530)
Q Consensus        69 ~er~~~   74 (530)
                      |.-|+.
T Consensus       559 l~~c~~  564 (784)
T PF04931_consen  559 LQICYE  564 (784)
T ss_pred             HHHHHH
Confidence            445554


No 22 
>PHA00458 single-stranded DNA-binding protein
Probab=34.83  E-value=40  Score=34.55  Aligned_cols=7  Identities=71%  Similarity=0.852  Sum_probs=3.1

Q ss_pred             eeecCCC
Q 009621          121 ELATGGG  127 (530)
Q Consensus       121 ~~~~ggg  127 (530)
                      +|...+|
T Consensus       180 eL~ef~g  186 (233)
T PHA00458        180 ELAEFGG  186 (233)
T ss_pred             EEEEecC
Confidence            4444433


No 23 
>PF11705 RNA_pol_3_Rpc31:  DNA-directed RNA polymerase III subunit Rpc31;  InterPro: IPR024661 DNA-directed RNA polymerases 2.7.7.6 from EC (also known as DNA-dependent RNA polymerases) are responsible for the polymerisation of ribonucleotides into a sequence complementary to the template DNA. In eukaryotes, there are three different forms of DNA-directed RNA polymerases transcribing different sets of genes. Most RNA polymerases are multimeric enzymes and are composed of a variable number of subunits. The core RNA polymerase complex consists of five subunits (two alpha, one beta, one beta-prime and one omega) and is sufficient for transcription elongation and termination but is unable to initiate transcription. Transcription initiation from promoter elements requires a sixth, dissociable subunit called a sigma factor, which reversibly associates with the core RNA polymerase complex to form a holoenzyme []. The core RNA polymerase complex forms a "crab claw"-like structure with an internal channel running along the full length []. The key functional sites of the enzyme, as defined by mutational and cross-linking analysis, are located on the inner wall of this channel. RNA synthesis follows after the attachment of RNA polymerase to a specific site, the promoter, on the template DNA strand. The RNA synthesis process continues until a termination sequence is reached. The RNA product, which is synthesised in the 5' to 3'direction, is known as the primary transcript. Eukaryotic nuclei contain three distinct types of RNA polymerases that differ in the RNA they synthesise:  RNA polymerase I: located in the nucleoli, synthesises precursors of most ribosomal RNAs. RNA polymerase II: occurs in the nucleoplasm, synthesises mRNA precursors.  RNA polymerase III: also occurs in the nucleoplasm, synthesises the precursors of 5S ribosomal RNA, the tRNAs, and a variety of other small nuclear and cytosolic RNAs.   Eukaryotic cells are also known to contain separate mitochondrial and chloroplast RNA polymerases. Eukaryotic RNA polymerases, whose molecular masses vary in size from 500 to 700 kDa, contain two non-identical large (>100 kDa) subunits and an array of up to 12 different small (less than 50 kDa) subunits. RNA polymerase III contains seventeen subunits in yeasts and in human cells. Twelve of these are akin to RNA polymerase I or II and the other five are RNA polymerase III-specific, and form the functionally distinct groups: (i) Rpc31-Rpc34-Rpc82, and (ii) Rpc37-Rpc53. Rpc31, Rpc34 and Rpc82 form a cluster of enzyme-specific subunits that contribute to transcription initiation in Saccharomyces cerevisiae and Homo sapiens. There is evidence that these subunits are anchored at or near the N-terminal Zn-fold of Rpc1, itself prolonged by a highly conserved but RNA polymerase III-specific domain []. This entry represents the Rpc31 subunit.
Probab=31.67  E-value=29  Score=34.44  Aligned_cols=8  Identities=50%  Similarity=0.999  Sum_probs=4.2

Q ss_pred             CCCCCCCC
Q 009621          152 YFDDFDDG  159 (530)
Q Consensus       152 y~d~~ddg  159 (530)
                      |||++||+
T Consensus       213 YFDnGedD  220 (233)
T PF11705_consen  213 YFDNGEDD  220 (233)
T ss_pred             cCCCCCcc
Confidence            36665543


No 24 
>PF08595 RXT2_N:  RXT2-like, N-terminal;  InterPro: IPR013904  The entry represents the N-terminal region of RXT2-like proteins. In Saccharomyces cerevisiae (Baker's yeast), RXT2 has been demonstrated to be involved in conjugation with cellular fusion (mating) and invasive growth []. A high throughput localisation study has localised RXT2 to the nucleus []. 
Probab=30.62  E-value=28  Score=33.30  Aligned_cols=7  Identities=14%  Similarity=0.510  Sum_probs=3.3

Q ss_pred             hccCCch
Q 009621          221 INARPTT  227 (530)
Q Consensus       221 L~~~P~~  227 (530)
                      |-.+|.+
T Consensus       103 l~tHPai  109 (149)
T PF08595_consen  103 LPTHPAI  109 (149)
T ss_pred             hcCCccc
Confidence            3445543


No 25 
>PF07415 Herpes_LMP2:  Gammaherpesvirus latent membrane protein (LMP2) protein;  InterPro: IPR010881 This family consists of several Gammaherpesvirus latent membrane protein (LMP2) proteins. Epstein-Barr virus (strain GD1) (HHV-4) (Human herpesvirus 4) is a human gammaherpesvirus that infects and establishes latency in B lymphocytes in vivo. The latent membrane protein 2 (LMP2) gene is expressed in latently infected B cells and encodes two protein isoforms, LMP2A and LMP2B, that are identical except for an additional N-terminal 119 aa cytoplasmic domain which is present in the LMP2A isoform. LMP2A is thought to play a key role in either the establishment or the maintenance of latency and/or the reactivation of productive infection from the latent state. The significance of LMP2B and its role in pathogenesis remain unclear [].; GO: 0019042 latent virus infection, 0033644 host cell membrane; PDB: 2JO9_B 1UXW_C.
Probab=29.46  E-value=18  Score=39.45  Aligned_cols=12  Identities=17%  Similarity=0.119  Sum_probs=0.0

Q ss_pred             CccccccCCCCC
Q 009621          328 NNIFERSYPFRE  339 (530)
Q Consensus       328 ~N~Fq~s~pg~~  339 (530)
                      +.+||.+.-+++
T Consensus       257 daI~Q~SpLLg~  268 (489)
T PF07415_consen  257 DAIIQLSPLLGA  268 (489)
T ss_dssp             ------------
T ss_pred             HHHHhhchHhhH
Confidence            445555444433


No 26 
>KOG3074 consensus Transcriptional regulator of the PUR family, single-stranded-DNA-binding [Transcription]
Probab=28.54  E-value=34  Score=35.42  Aligned_cols=19  Identities=16%  Similarity=0.198  Sum_probs=8.0

Q ss_pred             hHHHHHHHHcCCcCHHHHH
Q 009621          198 PAGFRQAYEMGLVSSAQMV  216 (530)
Q Consensus       198 PaDl~~A~e~Glvssa~L~  216 (530)
                      |.+++...|...+-++.|+
T Consensus        91 ~~~~~~~~ee~~lkSe~L~  109 (263)
T KOG3074|consen   91 PPELAAPSEEHELKSEELQ  109 (263)
T ss_pred             CcccccchhhhhHHHHHHh
Confidence            3344444444433334343


No 27 
>PRK05325 hypothetical protein; Provisional
Probab=27.92  E-value=81  Score=34.64  Aligned_cols=10  Identities=30%  Similarity=0.670  Sum_probs=6.5

Q ss_pred             chhHHHHHhcC
Q 009621          460 SDDLLKKAYNR  470 (530)
Q Consensus       460 ~~~ll~~~y~~  470 (530)
                      ...|. ++|.+
T Consensus       358 ~~~l~-~~y~~  367 (401)
T PRK05325        358 HQTLW-REYER  367 (401)
T ss_pred             chHHH-HHHHH
Confidence            45566 88874


No 28 
>COG2818 Tag 3-methyladenine DNA glycosylase [DNA replication, recombination, and repair]
Probab=27.79  E-value=30  Score=34.43  Aligned_cols=125  Identities=10%  Similarity=0.021  Sum_probs=79.7

Q ss_pred             chhhhhHHHHhhcHHHHHHHHHHHHhhccCchHHHHHHHHcCCcCHHH--HHHHHHhccC-CchhHHHHhhcCcchhhhh
Q 009621          167 LFRRRKILEELFDRKFVDAVLNEWMKTMMDLPAGFRQAYEMGLVSSAQ--MVKFLAINAR-PTTTRFISRSLPQGISRAF  243 (530)
Q Consensus       167 ~~r~r~~~~e~fdR~~v~aVL~e~~rt~~sLPaDl~~A~e~Glvssa~--L~rFl~L~~~-P~~~~~l~R~lp~~lsrgf  243 (530)
                      ++++|...||.|..=+++.|.+.-..-++.|=+|-.=.--.|+|-+.+  =+.+++|+.. .....|+--+++.   +.-
T Consensus        50 VL~KRe~freaF~~Fd~~kVA~~~~~dverLl~d~gIIR~r~KI~A~i~NA~~~l~l~~e~Gsf~~flWsf~~~---~~~  126 (188)
T COG2818          50 VLKKREAFREAFHGFDPEKVAAMTEEDVERLLADAGIIRNRGKIKATINNARAVLELQKEFGSFSEFLWSFVGG---KPS  126 (188)
T ss_pred             HHHhHHHHHHHHhcCCHHHHHcCCHHHHHHHHhCcchhhhHHHHHHHHHHHHHHHHHHHHcCCHHHHHHHhcCC---Ccc
Confidence            467888889998887777776654444444333321111233332222  3568888876 6666666555544   455


Q ss_pred             hhhhccCchhhHHHHHHHHhhhhhhhhhhhhhcchhhhhHHHHHHHHHHHHHHHhHH
Q 009621          244 IGRMLADPSFLYKLILEQAATIGCTVLWELENRKERIKQEWDLALINVLTVTACNAF  300 (530)
Q Consensus       244 ReRlLADP~FLfKL~iE~~I~i~~~v~aE~~~Rge~F~~ElDfV~sdvv~gsi~nfa  300 (530)
                      +.....=+.|+.+      ..++..+..++++||-.|.-..=.+.-...+|.|.|-+
T Consensus       127 ~~~~~~~~~~pa~------t~~S~~mskaLKkrGf~fvGpTt~yafmqA~G~vndH~  177 (188)
T COG2818         127 RNQVNDGSEVPAS------TELSDAMSKALKKRGFKFVGPTTVYAFMQATGLVNDHA  177 (188)
T ss_pred             cccccchhhcccc------chhHHHHHHHHHHccCeecCcHHHHHHHHHHcchHHHH
Confidence            5655666677776      34455788899999999988877777777777777544


No 29 
>KOG2023 consensus Nuclear transport receptor Karyopherin-beta2/Transportin (importin beta superfamily) [Nuclear structure; Intracellular trafficking, secretion, and vesicular transport]
Probab=27.17  E-value=34  Score=40.14  Aligned_cols=26  Identities=19%  Similarity=0.201  Sum_probs=16.0

Q ss_pred             HhcCCCCCcCCCCCCCCcceeehhhh
Q 009621          467 AYNRPSQDVNGSTQKSPRWLISKNAI  492 (530)
Q Consensus       467 ~y~~~~~~~~~~~~~~~~~~~s~~~~  492 (530)
                      .|-+|.+.+...++--..|+.|-++|
T Consensus       774 g~~~Pe~vAp~l~~f~~pWc~sl~~i  799 (885)
T KOG2023|consen  774 GYICPEEVAPHLDSFMRPWCTSLRNI  799 (885)
T ss_pred             hccCHHhcchhHHHHHHHHHHHhccc
Confidence            47777666655566666677654443


No 30 
>KOG2023 consensus Nuclear transport receptor Karyopherin-beta2/Transportin (importin beta superfamily) [Nuclear structure; Intracellular trafficking, secretion, and vesicular transport]
Probab=27.03  E-value=90  Score=36.90  Aligned_cols=40  Identities=15%  Similarity=0.327  Sum_probs=18.5

Q ss_pred             CCccCc-ccCcccceeeeeccccceecc--cccCceeecCCCC
Q 009621           89 PLMKGG-QYGAFGAVTLEKGKLDTTQQQ--SETTPELATGGGG  128 (530)
Q Consensus        89 ~~~~g~-~~g~~ga~tleks~l~~~q~~--~~~~p~~~~gggg  128 (530)
                      ||.-++ .|.----+.|+.-.=|-++.-  ...+|+.+.+--.
T Consensus       302 PvLl~~M~Ysd~D~~LL~~~eeD~~vpDreeDIkPRfhksk~~  344 (885)
T KOG2023|consen  302 PVLLSGMVYSDDDIILLKNNEEDESVPDREEDIKPRFHKSKEH  344 (885)
T ss_pred             HHHHccCccccccHHHhcCccccccCCchhhhccchhhhchhc
Confidence            665554 444333333332333333333  3457887765543


No 31 
>PF04285 DUF444:  Protein of unknown function (DUF444);  InterPro: IPR006698 This entry is represented by Thermus phage phiYS40, Orf56. The characteristics of the protein distribution suggest prophage matches in addition to the phage matches [].
Probab=25.78  E-value=61  Score=35.71  Aligned_cols=7  Identities=29%  Similarity=0.524  Sum_probs=3.2

Q ss_pred             HHHHHhH
Q 009621          407 YQLLCGF  413 (530)
Q Consensus       407 YQlL~Gv  413 (530)
                      |++.+=|
T Consensus       319 ~~l~~~i  325 (421)
T PF04285_consen  319 YELALEI  325 (421)
T ss_pred             HHHHHHH
Confidence            4444444


No 32 
>PF04931 DNA_pol_phi:  DNA polymerase phi;  InterPro: IPR007015 Proteins of this family are predominantly nucleolar. The majority are described as transcription factor transactivators. The family also includes the fifth essential DNA polymerase (Pol5p) of Schizosaccharomyces pombe (Fission yeast) and Saccharomyces cerevisiae (Baker's yeast) (2.7.7.7 from EC). Pol5p is localized exclusively to the nucleolus and binds near or at the enhancer region of rRNA-encoding DNA repeating units.; GO: 0003677 DNA binding, 0003887 DNA-directed DNA polymerase activity, 0006351 transcription, DNA-dependent
Probab=25.60  E-value=39  Score=39.19  Aligned_cols=8  Identities=25%  Similarity=0.302  Sum_probs=3.5

Q ss_pred             CcCHHHHH
Q 009621          209 LVSSAQMV  216 (530)
Q Consensus       209 lvssa~L~  216 (530)
                      -+.-++|-
T Consensus       728 ~mdDe~m~  735 (784)
T PF04931_consen  728 DMDDEQMM  735 (784)
T ss_pred             ccCHHHHH
Confidence            34444443


No 33 
>KOG0921 consensus Dosage compensation complex, subunit MLE [Transcription]
Probab=24.87  E-value=67  Score=39.13  Aligned_cols=6  Identities=0%  Similarity=0.097  Sum_probs=2.2

Q ss_pred             CCCCCC
Q 009621           30 APRYST   35 (530)
Q Consensus        30 ~~~~~~   35 (530)
                      |.+|-+
T Consensus      1119 IsqLdp 1124 (1282)
T KOG0921|consen 1119 ISQLDP 1124 (1282)
T ss_pred             hhccCc
Confidence            333333


No 34 
>COG1512 Beta-propeller domains of methanol dehydrogenase type [General function prediction only]
Probab=24.24  E-value=95  Score=32.36  Aligned_cols=16  Identities=19%  Similarity=0.167  Sum_probs=9.6

Q ss_pred             CcccCcccceeeeecc
Q 009621           93 GGQYGAFGAVTLEKGK  108 (530)
Q Consensus        93 g~~~g~~ga~tleks~  108 (530)
                      +.++..|+..++++.+
T Consensus       220 ~~~~~~~~~~~~~~~~  235 (271)
T COG1512         220 GRQPDRWLNGVLGRRR  235 (271)
T ss_pred             ccccccccceeEEeee
Confidence            3356666666666655


No 35 
>PLN03083 E3 UFM1-protein ligase 1 homolog; Provisional
Probab=23.02  E-value=1.2e+02  Score=36.16  Aligned_cols=35  Identities=14%  Similarity=0.253  Sum_probs=23.0

Q ss_pred             HhhcHHHHHHHHHHHHhhccC----chHHHHHHHHcCCc
Q 009621          176 ELFDRKFVDAVLNEWMKTMMD----LPAGFRQAYEMGLV  210 (530)
Q Consensus       176 e~fdR~~v~aVL~e~~rt~~s----LPaDl~~A~e~Glv  210 (530)
                      .+|.=++|..++.+|...+++    -|..|...+..-+.
T Consensus       484 ~i~~~e~i~~~l~~~~~d~~e~~~~~~~~ll~~lA~~l~  522 (803)
T PLN03083        484 NIPPEEWVMKKILEWVPDLEEDGTEDPGSILKHLADHLR  522 (803)
T ss_pred             ccccHHHHHHHHHHHcccchhcccccHHHHHHHHHHHHH
Confidence            345667888999988887544    66666666544433


No 36 
>PF08671 SinI:  Anti-repressor SinI;  InterPro: IPR010981 The SinR repressor is part of a group of Sin (sporulation inhibition) proteins in Bacillus subtilis that regulate the commitment to sporulation in response to extreme adversity []. SinR is a tetrameric repressor protein that binds to the promoters of genes essential for entry into sporulation and prevents their transcription. This repression is overcome through the activity of SinI, which disrupts the SinR tetramer through the formation of a SinI-SinR heterodimer, thereby allowing sporulation to proceed. The SinR structure consists of two domains: a dimerisation domain stabilised by a hydrophobic core, and a DNA-binding domain that is identical to domains of the bacteriophage 434 CI and Cro proteins that regulate prophage induction. The dimerisation domain is a four-helical bundle formed from two helices from the C-terminal residues of SinR and two helices from the central residues of SinI. These regions in SinR and SinI are similar in both structure and sequence. The interaction of SinR monomers to form tetramers is weaker than between SinR and SinI, since SinI can effectively disrupt SinR tetramers. This entry represents the dimerisation domain in both SinI and SinR proteins.; GO: 0005488 binding, 0006355 regulation of transcription, DNA-dependent; PDB: 1B0N_A 2YAL_A.
Probab=22.81  E-value=84  Score=22.89  Aligned_cols=21  Identities=38%  Similarity=0.635  Sum_probs=15.6

Q ss_pred             HHHHHHcCCcCHHHHHHHHHhc
Q 009621          201 FRQAYEMGLVSSAQMVKFLAIN  222 (530)
Q Consensus       201 l~~A~e~Glvssa~L~rFl~L~  222 (530)
                      |.+|.++|+ |.+++.+||..+
T Consensus         9 i~eA~~~Gl-s~eeir~FL~~~   29 (30)
T PF08671_consen    9 IKEAKESGL-SKEEIREFLEFN   29 (30)
T ss_dssp             HHHHHHTT---HHHHHHHHHHH
T ss_pred             HHHHHHcCC-CHHHHHHHHHhC
Confidence            567888874 689999999865


No 37 
>PF14407 Frankia_peptide:  Ribosomally synthesized peptide prototyped by Frankia Franean1_4349.
Probab=22.41  E-value=57  Score=27.39  Aligned_cols=22  Identities=41%  Similarity=0.688  Sum_probs=16.9

Q ss_pred             HHHHhhcCcchhhhhhhhhccCchh
Q 009621          229 RFISRSLPQGISRAFIGRMLADPSF  253 (530)
Q Consensus       229 ~~l~R~lp~~lsrgfReRlLADP~F  253 (530)
                      ++|-|.+-.   .+||.|+++||.-
T Consensus         4 ~lIgrAv~D---~~FRqqllad~~~   25 (61)
T PF14407_consen    4 RLIGRAVTD---EAFRQQLLADPEE   25 (61)
T ss_pred             HHHHHHHcC---HHHHHHHhcCHHH
Confidence            456666655   5899999999976


No 38 
>KOG3241 consensus Uncharacterized conserved protein [Function unknown]
Probab=22.14  E-value=63  Score=32.50  Aligned_cols=6  Identities=50%  Similarity=0.783  Sum_probs=2.5

Q ss_pred             CCCCCC
Q 009621          135 INHGGG  140 (530)
Q Consensus       135 ~~~GGG  140 (530)
                      +++||.
T Consensus       183 i~~~~~  188 (227)
T KOG3241|consen  183 IGHGSV  188 (227)
T ss_pred             cccCCC
Confidence            344443


No 39 
>PF11332 DUF3134:  Protein of unknown function (DUF3134);  InterPro: IPR021481  This family of proteins with unknown function appears to be restricted to Cyanobacteria. 
Probab=20.79  E-value=53  Score=28.28  Aligned_cols=7  Identities=86%  Similarity=1.730  Sum_probs=3.0

Q ss_pred             CCCCCCC
Q 009621          149 DDDYFDD  155 (530)
Q Consensus       149 dddy~d~  155 (530)
                      ||+|+|+
T Consensus        60 dd~~~d~   66 (73)
T PF11332_consen   60 DDDYYDD   66 (73)
T ss_pred             Ccccccc
Confidence            3444444


No 40 
>COG3685 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=20.07  E-value=65  Score=31.67  Aligned_cols=52  Identities=25%  Similarity=0.437  Sum_probs=45.3

Q ss_pred             HHHHhhcH----------HHHHHHHHHHHhhccCchHHHHHHHHcCCcCHHHHHHHHHhccCCc
Q 009621          173 ILEELFDR----------KFVDAVLNEWMKTMMDLPAGFRQAYEMGLVSSAQMVKFLAINARPT  226 (530)
Q Consensus       173 ~~~e~fdR----------~~v~aVL~e~~rt~~sLPaDl~~A~e~Glvssa~L~rFl~L~~~P~  226 (530)
                      .++|||+|          ..++-++.|...-++..|.+  ++++.|++.+++.+.-+++.-+.+
T Consensus        57 rLe~Vfe~~g~~~~~~~cda~~giiaegq~i~~~~~~~--evlda~L~~aaq~vEhyEIA~Ygt  118 (167)
T COG3685          57 RLEQVFERLGKKARRVTCDAMEGLIAEGQEIMEEFKSN--EVLDAGLIGAAQKVEHYEIACYGT  118 (167)
T ss_pred             HHHHHHHHhCcccccchHHHHHHHHHHHHHHHHhcccc--HHHHHHHHHHHHHHHHHHHHHHHH
Confidence            47899999          56778888888889999999  999999999999999999877654


Done!