Query         psy16911
Match_columns 194
No_of_seqs    103 out of 247
Neff          6.6 
Searched_HMMs 46136
Date          Fri Aug 16 16:50:20 2013
Command       hhsearch -i /work/01045/syshi/Psyhhblits/psy16911.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/16911hhsearch_cdd -cpu 12 -v 0 

 No Hit                             Prob E-value P-value  Score    SS Cols Query HMM  Template HMM
  1 KOG4024|consensus              100.0 9.9E-48 2.1E-52  314.4  22.1  191    1-194    70-265 (266)
  2 PF02330 MAM33:  Mitochondrial  100.0 1.4E-44   3E-49  298.6  15.6  170    8-192     1-204 (204)
  3 KOG2536|consensus              100.0 3.9E-42 8.6E-47  288.6  19.2  182    2-194    61-263 (263)
  4 PF08513 LisH:  LisH;  InterPro  75.7     5.4 0.00012   22.2   3.3   25  146-170     1-25  (27)
  5 TIGR01881 cas_Cmr5 CRISPR-asso  73.7      11 0.00025   28.9   5.7   43  147-189    70-121 (127)
  6 KOG4233|consensus               61.7      14 0.00029   26.5   3.6   35  154-188    33-67  (90)
  7 PF10929 DUF2811:  Protein of u  58.8       9  0.0002   25.6   2.2   32  142-173     4-42  (57)
  8 PHA01623 hypothetical protein   54.5      41 0.00088   22.1   4.8   30  143-174    19-48  (56)
  9 PHA02360 hypothetical protein   53.2      14 0.00031   25.2   2.4   17  146-162    29-45  (70)
 10 PF14265 DUF4355:  Domain of un  52.5      44 0.00095   24.9   5.4   40  147-187    69-108 (125)
 11 PF00123 Hormone_2:  Peptide ho  49.7      31 0.00067   19.6   3.1   22  165-186     6-27  (28)
 12 PF13154 DUF3991:  Protein of u  49.3      11 0.00023   26.1   1.4   18  154-171     1-19  (77)
 13 PF02961 BAF:  Barrier to autoi  49.0      28 0.00061   25.3   3.6   34  152-188    31-67  (89)
 14 PF08275 Toprim_N:  DNA primase  45.0      16 0.00036   27.7   2.0   18  150-167    12-29  (128)
 15 PF13376 OmdA:  Bacteriocin-pro  43.9      55  0.0012   21.6   4.3   39  143-189     5-43  (63)
 16 PF07063 DUF1338:  Domain of un  37.0      68  0.0015   28.3   4.9   46  142-187   100-180 (302)
 17 PF00070 Pyr_redox:  Pyridine n  37.0      29 0.00063   23.5   2.1   21  142-162    36-56  (80)
 18 PRK11814 cysteine desulfurase   36.8      66  0.0014   30.2   5.0   44  141-188   428-476 (486)
 19 PF13794 MiaE_2:  tRNA-(MS[2]IO  35.6      31 0.00067   28.4   2.3   45  148-192    51-98  (185)
 20 smart00777 Mad3_BUB1_I Mad3/BU  32.2      61  0.0013   24.8   3.4   30  150-179    83-112 (125)
 21 COG4003 Uncharacterized protei  31.9      72  0.0016   23.1   3.4   26  151-176    52-78  (98)
 22 PF09442 DUF2018:  Domain of un  30.4 1.5E+02  0.0032   21.1   4.8   39  148-188    31-76  (81)
 23 PF13434 K_oxygenase:  L-lysine  29.9      52  0.0011   29.1   3.0   33  151-186    70-105 (341)
 24 COG3592 Uncharacterized conser  29.8      38 0.00083   23.5   1.7   15   44-58      9-23  (74)
 25 PF04026 SpoVG:  SpoVG;  InterP  29.5 1.3E+02  0.0029   21.4   4.5   35   86-130     4-40  (84)
 26 PF02268 TFIIA_gamma_N:  Transc  29.0   1E+02  0.0022   19.9   3.4   30  142-171     9-38  (49)
 27 smart00070 GLUCA Glucagon like  28.8      76  0.0016   17.8   2.5   16  170-185    11-26  (27)
 28 cd06206 bifunctional_CYPOR The  28.3      70  0.0015   28.6   3.6   41  146-186    79-120 (384)
 29 smart00667 LisH Lissencephaly   27.4   1E+02  0.0023   16.5   3.1   27  146-172     4-30  (34)
 30 TIGR02793 nikR nickel-responsi  26.9      59  0.0013   25.0   2.5   20  143-162     6-25  (129)
 31 PF12122 DUF3582:  Protein of u  26.7      62  0.0013   23.8   2.5   21  145-165     9-29  (101)
 32 PF10281 Ish1:  Putative stress  22.8      70  0.0015   18.9   1.8   15  150-164     7-21  (38)
 33 PRK13031 preprotein translocas  22.2 1.7E+02  0.0037   23.3   4.3   30   85-124    58-88  (149)
 34 PF08311 Mad3_BUB1_I:  Mad3/BUB  21.8      80  0.0017   23.8   2.4   29  151-179    84-112 (126)
 35 PF08747 DUF1788:  Domain of un  21.7      69  0.0015   24.5   2.0   17  147-163     5-21  (126)
 36 PHA03416 hypothetical E4 prote  21.4 1.9E+02  0.0042   21.0   4.1   28   29-56     64-91  (92)
 37 PRK04460 nickel responsive reg  21.3 2.1E+02  0.0046   22.2   4.7   20  143-162     8-27  (137)
 38 PF09868 DUF2095:  Uncharacteri  21.0 1.3E+02  0.0028   23.1   3.2   23  151-173    82-105 (128)
 39 PRK13654 magnesium-protoporphy  21.0      79  0.0017   28.5   2.4   33  137-170    69-101 (355)
 40 PHA01748 hypothetical protein   20.8 2.6E+02  0.0056   18.4   5.1   20  143-162     8-27  (60)
 41 PF10437 Lip_prot_lig_C:  Bacte  20.4 1.4E+02  0.0031   20.6   3.3   23   37-59      4-26  (86)
 42 PF01316 Arg_repressor:  Argini  20.1   1E+02  0.0022   21.2   2.4   17  149-165    22-38  (70)
 43 TIGR02029 AcsF magnesium-proto  20.1      87  0.0019   28.0   2.5   31  139-170    61-91  (337)

No 1  
>KOG4024|consensus
Probab=100.00  E-value=9.9e-48  Score=314.45  Aligned_cols=191  Identities=42%  Similarity=0.655  Sum_probs=176.5

Q ss_pred             ChHHHHHHHHHHHHHHhcCCCCCCCCCCCCcEEEeCCCEEEEEEeeCCeEEEEEEEeCCCCCCCCCCCCCCCCCCc-ccc
Q psy16911          1 EKELFEFLSEEIQAEKKIQKSKSIPSEIDGFQVKLDGPEVSLEKTVGSEKIEIHFNVNHSVDADAEPQIDPNMDEP-EIE   79 (194)
Q Consensus         1 ~~~L~~~L~~EI~~E~~~~~~~~~p~~~~~f~v~~~g~~v~L~r~~~~E~I~V~f~v~~~~~~~~~~e~~~~~~~~-e~~   79 (194)
                      +++|+++|..||+.|+++++...+|..+.||+|+..|++|.|+|+.++|+|.|.||||++++++++.|+++++... ..+
T Consensus        70 ~~alv~fl~~Ei~~erk~qkgkt~Pkt~~Gf~v~l~gaev~Ltkk~~~Eki~V~FNvNhsvd~~e~~e~ePsq~~~~~pe  149 (266)
T KOG4024|consen   70 QQALVRFLEAEIQLERKNQKGKTAPKTFAGFQVTLKGAEVRLTKKNGSEKILVVFNVNHSVDMDEGFEDEPSQAVAPVPE  149 (266)
T ss_pred             HHHHHHHHHHHHHHHHHhhcCCccCccccceEEEecCceeEEeeccCCceEEEEEecCCccCcccCCccCccccCCCCcc
Confidence            4799999999999999999999999999999999999999999999999999999999999998877777776643 348


Q ss_pred             cCCCCceEEEEEeCCceEEEEEEecCCCCCCCCCCCCCceEEeeeeeecCC----CCCcccccCCCCCCCHHHHHHHHHH
Q psy16911         80 MKSKPSFEVNFIKGNKTLGFTCSFIPPESEPSEDGYNDLFSIDEVDTYERE----NGKTTTIACLEEVLDGPLYDLLMNY  155 (194)
Q Consensus        80 ~~~~~~f~V~I~K~~~~L~f~c~~~~~e~~~~e~~~~d~f~I~~v~~~e~~----~~~~~~Y~g~~~~LDe~Lq~~~~~y  155 (194)
                      ++|.|+|+|.|.||+++|.|+|++.++|.  |++++++.|.+..+..+.+.    |++. +|+..++.||+.||++|++|
T Consensus       150 ~tS~P~F~VeiiKG~~~L~f~c~~~~~e~--QedEy~~~~~~~~~a~f~~~gn~~~~d~-vYa~~gd~lD~~LyDlL~~~  226 (266)
T KOG4024|consen  150 ATSMPPFTVEIIKGDQRLCFHCELVPVED--QEDEYDFRVEEFYVAPFAKNGNEDVPDE-VYASSGDYLDPDLYDLLFVY  226 (266)
T ss_pred             cccCCCeEEEEEeCCeeEEEEeecCCccc--CcccccceeeeEEeeecccCCCccCCcc-eeeccCcccCHHHHHHHHHH
Confidence            89999999999999999999999999987  46788988888888888655    4577 99999999999999999999


Q ss_pred             HHHcCCCHHHHHHHHHhhhhHHHHHHHHHHHHHhhhhcC
Q psy16911        156 LEEKGISNEFVDKLSNLSTGYEHASYINLLSEIQKFTSQ  194 (194)
Q Consensus       156 LeeRGId~ela~fl~~y~~~kEq~EYi~wL~~vk~Fv~~  194 (194)
                      |++|||+++||.-|++|++.+||++||.+|+++|+|+++
T Consensus       227 LeerG~d~~Fa~~Lv~laTa~EH~~YIglLeklkkF~~~  265 (266)
T KOG4024|consen  227 LEERGLDARFAKTLVALATAYEHSQYIGLLEKLKKFISK  265 (266)
T ss_pred             HHHcCccHHHHHHHHHHHHHhhhHHHHHHHHHHHHHhcC
Confidence            999999999999999999999999999999999999974


No 2  
>PF02330 MAM33:  Mitochondrial glycoprotein;  InterPro: IPR003428 This mitochondrial matrix protein family contains members of the MAM33 family which bind to the globular 'heads' of C1Q.; GO: 0005759 mitochondrial matrix; PDB: 3QV0_A 1YQF_F 3JV1_A 1P32_A 3RPX_A.
Probab=100.00  E-value=1.4e-44  Score=298.57  Aligned_cols=170  Identities=37%  Similarity=0.537  Sum_probs=128.3

Q ss_pred             HHHHHHHHHhcCCC-----CCCCCCCCCcEEE--eCCCEEEEEEeeCCeEEEEEEEeCCCCCCC--CCCCCCC----CCC
Q psy16911          8 LSEEIQAEKKIQKS-----KSIPSEIDGFQVK--LDGPEVSLEKTVGSEKIEIHFNVNHSVDAD--AEPQIDP----NMD   74 (194)
Q Consensus         8 L~~EI~~E~~~~~~-----~~~p~~~~~f~v~--~~g~~v~L~r~~~~E~I~V~f~v~~~~~~~--~~~e~~~----~~~   74 (194)
                      |++||++|++++..     ..+|+  .+|+++  .|+++|+|+|+||+|+|+|+|||++++++.  ++++.+.    +.+
T Consensus         1 L~~EI~~E~~~~~~~~~~~~~l~~--~~f~i~d~~g~~~V~L~r~~~~E~I~V~f~v~~~~~~~~~~~~~~~~~~e~~~~   78 (204)
T PF02330_consen    1 LKSEIKYEKENYESPDSLKEFLPP--SPFEIEDKPGGSEVTLTREFGGEKIRVTFNVNDSVDPPFDEDEDEEEGEEEEEE   78 (204)
T ss_dssp             HHHHHHHHHHHCT-HHHHCCHHHH--TTTCEEEECCTTEEEEEEEETTEEEEEEEECCCH--------------------
T ss_pred             ChHHHHHHHHcccccHHHHhhCCC--CCCEEEEcCCCcEEEEEEecCCcEEEEEEECCccCCCCCCccccchhccccccc
Confidence            78999999999872     23444  456665  567899999999999999999999876332  1111111    000


Q ss_pred             --Ccc---ccc-----CCCCceEEEEEeCCceEEEEEEecCCCCCCCCCCCCCceEEeeeeeecC----------CCCCc
Q psy16911         75 --EPE---IEM-----KSKPSFEVNFIKGNKTLGFTCSFIPPESEPSEDGYNDLFSIDEVDTYER----------ENGKT  134 (194)
Q Consensus        75 --~~e---~~~-----~~~~~f~V~I~K~~~~L~f~c~~~~~e~~~~e~~~~d~f~I~~v~~~e~----------~~~~~  134 (194)
                        ..+   ..+     .+.+.|+|.|+|++++|.|.|++.+           |.|.|++|+++..          +|  +
T Consensus        79 ~~~~~~fe~~~~~~~~~~~~~~~v~~~k~~~~l~f~~~~~~-----------d~~~I~~v~~~~~~~~~~~~~~~e~--~  145 (204)
T PF02330_consen   79 EISEPDFEVEIPKEDEVSFPFFVVVISKPGDGLEFDCSAQD-----------DEFSIDNVSFYPDGDSTKDSAEAEW--E  145 (204)
T ss_dssp             ------CEEECCCCCCCEEEEEEEEEECTCEEEEEEEECCT-----------TEEEEEEEEEESSHHHH--SHHHHH--H
T ss_pred             ccccccchhcccccccccceeEEEEEEcCCCceEEeEeccC-----------ceEEEEEEEEecCccccccccchhc--c
Confidence              000   011     1235688999999888999999954           4499999999853          44  3


Q ss_pred             ccccCC-CCCCCHHHHHHHHHHHHHcCCCHHHHHHHHHhhhhHHHHHHHHHHHHHhhhh
Q psy16911        135 TTIACL-EEVLDGPLYDLLMNYLEEKGISNEFVDKLSNLSTGYEHASYINLLSEIQKFT  192 (194)
Q Consensus       135 ~~Y~g~-~~~LDe~Lq~~~~~yLeeRGId~ela~fl~~y~~~kEq~EYi~wL~~vk~Fv  192 (194)
                      ..|.|| |++||++||++|++||++||||++||+||++||++|||++|++||++|++||
T Consensus       146 ~~y~gp~f~~LDe~Lq~~~~~yLeeRGId~~la~fl~~y~~~kEq~eYi~wL~~l~~Fv  204 (204)
T PF02330_consen  146 DAYYGPDFDELDENLQDAFMNYLEERGIDEELANFLHDYSTDKEQREYIRWLKNLKKFV  204 (204)
T ss_dssp             CSS-ECEGGGSBHHHHHHHHHHHHHTT-SHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
T ss_pred             ccccCCccccCCHHHHHHHHHHHHHhCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHhhC
Confidence            479997 9999999999999999999999999999999999999999999999999997


No 3  
>KOG2536|consensus
Probab=100.00  E-value=3.9e-42  Score=288.62  Aligned_cols=182  Identities=25%  Similarity=0.360  Sum_probs=144.8

Q ss_pred             hHHHHHHHHHHHHHHhcCCCCC-----CCCCCCCcEEE--eCCCEEEEEEeeCCeEEEEEEEeCCCCCCCCCCCCC--CC
Q psy16911          2 KELFEFLSEEIQAEKKIQKSKS-----IPSEIDGFQVK--LDGPEVSLEKTVGSEKIEIHFNVNHSVDADAEPQID--PN   72 (194)
Q Consensus         2 ~~L~~~L~~EI~~E~~~~~~~~-----~p~~~~~f~v~--~~g~~v~L~r~~~~E~I~V~f~v~~~~~~~~~~e~~--~~   72 (194)
                      ..|++.+.+||+++++.....+     .|.....|.+.  +|+++|+|+|+|+||+|+|.+++...+..+.+++.+  ..
T Consensus        61 ~~~~r~i~~EI~s~~~~~~~~~~~~~~~~~~~~pf~ied~PG~q~V~L~r~~~GE~i~V~v~v~~~~~~~~~e~~de~e~  140 (263)
T KOG2536|consen   61 QNLLRVILSEIKSEKKHDPPHQVLEEEIPLSKFPFEIEDRPGEQWVVLTRKFEGETIKVEVDVTQLVTDEPEEDEDETEE  140 (263)
T ss_pred             cchHHHHHHHHHHhhhcCCchhhhhhccccCCCCcceecCCCceEEEEEeecCCceEEEEEeeehhccCCcchhcccccc
Confidence            4689999999999999876544     45444668877  455899999999999999999998654432211111  00


Q ss_pred             CC-----Cc--c-cccCCCCceEEEEEeC-CceEEEEEEecCCCCCCCCCCCCCceEEeeeeeecCCC--CCcccccCC-
Q psy16911         73 MD-----EP--E-IEMKSKPSFEVNFIKG-NKTLGFTCSFIPPESEPSEDGYNDLFSIDEVDTYEREN--GKTTTIACL-  140 (194)
Q Consensus        73 ~~-----~~--e-~~~~~~~~f~V~I~K~-~~~L~f~c~~~~~e~~~~e~~~~d~f~I~~v~~~e~~~--~~~~~Y~g~-  140 (194)
                      ++     ++  + .+..+.++|.|+|+|+ +.+|.|.|+++||+           |.|++++++.+.-  ..+.+|.|| 
T Consensus       141 ~~~~~~~~~~~~~~~~~~~iplvV~vsK~~~~sLef~cta~pD~-----------i~Idslsv~~~~d~~~~~l~Y~GP~  209 (263)
T KOG2536|consen  141 EDEDEDQEDDDPLSDNISNIPLVVTVSKGDGVSLEFLCTAFPDE-----------IVIDSLSVRPPIDASEDQLAYEGPS  209 (263)
T ss_pred             ccccccccccccCCCcceeeEEEEEEecCCCceEEEEeecCCCc-----------eEEeeeEecCCCCccchhhhccCCC
Confidence            00     00  1 1344668899999995 68999999998754           9999999985432  233599996 


Q ss_pred             CCCCCHHHHHHHHHHHHHcCCCHHHHHHHHHhhhhHHHHHHHHHHHHHhhhhcC
Q psy16911        141 EEVLDGPLYDLLMNYLEEKGISNEFVDKLSNLSTGYEHASYINLLSEIQKFTSQ  194 (194)
Q Consensus       141 ~~~LDe~Lq~~~~~yLeeRGId~ela~fl~~y~~~kEq~EYi~wL~~vk~Fv~~  194 (194)
                      |.+||++||++||+|||+|||+++|+.||++||.+|+++||++||++||+||++
T Consensus       210 F~eLDe~Lqd~fh~fLEeRGI~esl~~FL~~ym~~Kd~rEYl~WlksvK~Fv~~  263 (263)
T KOG2536|consen  210 FEELDEELQDSFHRFLEERGIKESLASFLHAYMKNKDSREYLRWLKSVKSFVEQ  263 (263)
T ss_pred             hhHHhHHHHHHHHHHHHHcCCCHHHHHHHHHHHhhhhHHHHHHHHHHHHHHhcC
Confidence            999999999999999999999999999999999999999999999999999975


No 4  
>PF08513 LisH:  LisH;  InterPro: IPR013720 The LisH motif is found in a large number of eukaryotic proteins, from metazoa, fungi and plants that have a wide range of functions. The recently solved structure of the LisH domain in the N-terminal region of LIS1 depicted it as a novel dimerization motif, and that other structural elements are likely to play an important role in dimerisation [, , ].  The LisH (lis homology) domain mediates protein dimerisation and tetramerisation. The LisH domain is found in Sif2, a component of the Set3 complex which is responsible for repressing meiotic genes. It has been shown that the LisH domain helps mediate interaction with components of the Set3 complex []. ; PDB: 2XTE_L 2XTC_B 2XTD_A 1UUJ_B.
Probab=75.66  E-value=5.4  Score=22.25  Aligned_cols=25  Identities=24%  Similarity=0.313  Sum_probs=20.4

Q ss_pred             HHHHHHHHHHHHHcCCCHHHHHHHH
Q psy16911        146 GPLYDLLMNYLEEKGISNEFVDKLS  170 (194)
Q Consensus       146 e~Lq~~~~~yLeeRGId~ela~fl~  170 (194)
                      ++|..++++||.++|-......|..
T Consensus         1 ~~Ln~lI~~YL~~~Gy~~tA~~f~~   25 (27)
T PF08513_consen    1 EELNQLIYDYLVENGYKETAKAFAK   25 (27)
T ss_dssp             HHHHHHHHHHHHHCT-HHHHHHHHH
T ss_pred             CHHHHHHHHHHHHCCcHHHHHHHHh
Confidence            4678899999999999998888764


No 5  
>TIGR01881 cas_Cmr5 CRISPR-associated protein, Cmr5 family. CRISPR is a term for Clustered, Regularly Interspaced Short Palidromic Repeats. A number of protein families appear only in association with these repeats and are designated Cas (CRISPR-Associated) proteins. This family, represented by TM1791.1 of Thermotoga maritima, is found in both archaeal and bacterial species as part of the 6-gene CRISPR RAMP module.
Probab=73.67  E-value=11  Score=28.91  Aligned_cols=43  Identities=19%  Similarity=0.393  Sum_probs=28.8

Q ss_pred             HHHHHHHHHHHHcCCCHH---HHHHHHH-----hh-hhHHHHHHHHHHHHHh
Q psy16911        147 PLYDLLMNYLEEKGISNE---FVDKLSN-----LS-TGYEHASYINLLSEIQ  189 (194)
Q Consensus       147 ~Lq~~~~~yLeeRGId~e---la~fl~~-----y~-~~kEq~EYi~wL~~vk  189 (194)
                      -|+.++..||.+||+-+.   +...|.+     |+ ...|=-+|..||+.+.
T Consensus        70 ~l~~~l~~~L~~~~~~~~~~~l~~~il~~~~~~y~~~T~eaL~~l~WLKRfA  121 (127)
T TIGR01881        70 KYYAHILYWLKERELVDKFKNALKELMEKDYSDSRIATEEALRYLNWLKRLA  121 (127)
T ss_pred             HHHHHHHHHHHHcccccchHHHHHHHHHccHHHHHHHHHHHHHHHHHHHHHH
Confidence            488999999999997654   3333322     32 3346677888887753


No 6  
>KOG4233|consensus
Probab=61.75  E-value=14  Score=26.50  Aligned_cols=35  Identities=17%  Similarity=0.117  Sum_probs=26.2

Q ss_pred             HHHHHcCCCHHHHHHHHHhhhhHHHHHHHHHHHHH
Q psy16911        154 NYLEEKGISNEFVDKLSNLSTGYEHASYINLLSEI  188 (194)
Q Consensus       154 ~yLeeRGId~ela~fl~~y~~~kEq~EYi~wL~~v  188 (194)
                      .-|.++|+|-...-+=.=+...|...++..||+..
T Consensus        33 ~~L~~~GfdkAYvllGQfLllkKdE~lF~~Wlk~~   67 (90)
T KOG4233|consen   33 IKLVDAGFDKAYVLLGQFLLLKKDEDLFQEWLKET   67 (90)
T ss_pred             hhHHhccccHHHHHHHHHHHhcccHHHHHHHHHHH
Confidence            44778999987554444456789999999999864


No 7  
>PF10929 DUF2811:  Protein of unknown function (DUF2811);  InterPro: IPR021231  This is a bacterial family of uncharacterised proteins. 
Probab=58.82  E-value=9  Score=25.56  Aligned_cols=32  Identities=9%  Similarity=0.141  Sum_probs=26.5

Q ss_pred             CCCCHHHHHHHHHHHHH-------cCCCHHHHHHHHHhh
Q psy16911        142 EVLDGPLYDLLMNYLEE-------KGISNEFVDKLSNLS  173 (194)
Q Consensus       142 ~~LDe~Lq~~~~~yLee-------RGId~ela~fl~~y~  173 (194)
                      .++++.|++++.+||+.       |=+.+.||.||.+-.
T Consensus         4 aeiPe~L~~~m~~fie~hP~WDQ~Rl~~aALa~FL~QnG   42 (57)
T PF10929_consen    4 AEIPEDLHQAMKDFIETHPNWDQYRLFQAALAGFLLQNG   42 (57)
T ss_pred             ccccHHHHHHHHHHHHcCCCchHHHHHHHHHHHHHHHcC
Confidence            46899999999999985       678888999987643


No 8  
>PHA01623 hypothetical protein
Probab=54.48  E-value=41  Score=22.08  Aligned_cols=30  Identities=13%  Similarity=0.223  Sum_probs=25.3

Q ss_pred             CCCHHHHHHHHHHHHHcCCCHHHHHHHHHhhh
Q psy16911        143 VLDGPLYDLLMNYLEEKGISNEFVDKLSNLST  174 (194)
Q Consensus       143 ~LDe~Lq~~~~~yLeeRGId~ela~fl~~y~~  174 (194)
                      .||++|...|..|.+..|++  .+++|.+-..
T Consensus        19 rldeel~~~Ld~y~~~~g~~--rSe~IreAI~   48 (56)
T PHA01623         19 YMDKDLKTRLKVYCAKNNLQ--LTQAIEEAIK   48 (56)
T ss_pred             EeCHHHHHHHHHHHHHcCCC--HHHHHHHHHH
Confidence            49999999999999999998  7777765543


No 9  
>PHA02360 hypothetical protein
Probab=53.23  E-value=14  Score=25.22  Aligned_cols=17  Identities=41%  Similarity=0.729  Sum_probs=16.0

Q ss_pred             HHHHHHHHHHHHHcCCC
Q psy16911        146 GPLYDLLMNYLEEKGIS  162 (194)
Q Consensus       146 e~Lq~~~~~yLeeRGId  162 (194)
                      +-||..+++|-|+|||+
T Consensus        29 PklY~~i~k~YEe~gid   45 (70)
T PHA02360         29 PKLYKKIRKYYEEEGID   45 (70)
T ss_pred             HHHHHHHHHHHHHcCCc
Confidence            67999999999999997


No 10 
>PF14265 DUF4355:  Domain of unknown function (DUF4355)
Probab=52.46  E-value=44  Score=24.94  Aligned_cols=40  Identities=25%  Similarity=0.253  Sum_probs=30.8

Q ss_pred             HHHHHHHHHHHHcCCCHHHHHHHHHhhhhHHHHHHHHHHHH
Q psy16911        147 PLYDLLMNYLEEKGISNEFVDKLSNLSTGYEHASYINLLSE  187 (194)
Q Consensus       147 ~Lq~~~~~yLeeRGId~ela~fl~~y~~~kEq~EYi~wL~~  187 (194)
                      +|.....+.|.+.|++.+|+.+|. ..+..+-...+..|..
T Consensus        69 e~~~~~~~~l~e~GLp~~l~~~l~-g~d~e~~~~~v~~l~~  108 (125)
T PF14265_consen   69 ELRSEAKKVLAEKGLPAELADFLV-GDDEEETKANVKALKK  108 (125)
T ss_pred             HHHHHHHHHHHHcCCCHHHHHHHh-CCCHHHHHHHHHHHHH
Confidence            456777889999999999999996 5666666666666654


No 11 
>PF00123 Hormone_2:  Peptide hormone;  InterPro: IPR000532 A number of polypeptidic hormones, mainly expressed in the intestine or the pancreas, belong to a group of structurally related peptides [, ]. Once such hormone, glucagon is widely distributed and produced in the alpha-cells of pancreatic islets []. It affects glucose metabolism in the liver [] by inhibiting glycogen synthesis, stimulating glycogenolysis and enchancing gluconeogenesis. It also increases mobilisation of glucose, free fatty acids and ketone bodies, which are metabolites produced in excess in diabetes mellitus. Glucagon is produced, like other peptide hormones, as part of a larger precursor (preproglucagon), which is cleaved to produce glucagon, glucagon-like protein I and glucagon-like protein II []. The structure of glucagon itself is fully conserved in all known mammalian species []. Other members of the structurally similar group include glicentin precursor, secretin, gastric inhibitory protein, vasoactive intestinal peptide (VIP), prealbumin, peptide HI-27 and growth hormone releasing factor.; GO: 0005179 hormone activity, 0005576 extracellular region; PDB: 1T5Q_A 2OBU_A 2QKH_B 2L70_A 2L71_A 2B4N_A 1GCN_A 1D0R_A 3IOL_B 2RRI_A ....
Probab=49.69  E-value=31  Score=19.62  Aligned_cols=22  Identities=14%  Similarity=0.264  Sum_probs=14.5

Q ss_pred             HHHHHHHhhhhHHHHHHHHHHH
Q psy16911        165 FVDKLSNLSTGYEHASYINLLS  186 (194)
Q Consensus       165 la~fl~~y~~~kEq~EYi~wL~  186 (194)
                      |.+-...|....--|+|++||-
T Consensus         6 ftsdys~~L~~~aak~fl~~L~   27 (28)
T PF00123_consen    6 FTSDYSKYLDQLAAKKFLQWLM   27 (28)
T ss_dssp             HHHHHHHHHHHHHHHHHHHHHC
T ss_pred             ccHHHHHHHHHHHHHHHHHHHh
Confidence            4444555555566689999984


No 12 
>PF13154 DUF3991:  Protein of unknown function (DUF3991)
Probab=49.34  E-value=11  Score=26.10  Aligned_cols=18  Identities=50%  Similarity=0.818  Sum_probs=14.7

Q ss_pred             HHHHH-cCCCHHHHHHHHH
Q psy16911        154 NYLEE-KGISNEFVDKLSN  171 (194)
Q Consensus       154 ~yLee-RGId~ela~fl~~  171 (194)
                      +||.+ |||+.++.+.+..
T Consensus         1 ~YL~~~RgI~~~~v~~~~~   19 (77)
T PF13154_consen    1 AYLTEERGIDPEIVDAFIN   19 (77)
T ss_pred             CchhhhcCcCHHHHHHHHH
Confidence            37776 9999999988865


No 13 
>PF02961 BAF:  Barrier to autointegration factor;  InterPro: IPR004122 Barrier-to-autointegration factor (BAF) is an essential protein that is highly conserved in metazoan evolution, and which may act as a DNA-bridging protein []. BAF binds directly to double-stranded DNA, to transcription activators, and to inner nuclear membrane proteins, including lamin A filament proteins that anchor nuclear-pore complexes in place, and nuclear LEM-domain proteins that bind to laminins filaments and chromatin. New findings suggest that BAF has structural roles in nuclear assembly and chromatin organisation, represses gene expression and might interlink chromatin structure, nuclear architecture and gene regulation in metazoans []. BAF can be exploited by retroviruses to act as a host component of pre-integration complexes, which promote the integration of the retroviral DNA into the host chromosome by preventing autointegration of retroviral DNA []. BAF might contribute to the assembly or activity of retroviral pre-integration complexes through direct binding to the retroviral proteins p55 Gag and matrix, as well as to DNA.; GO: 0003677 DNA binding; PDB: 2ODG_A 2BZF_A 2EZX_B 2EZY_B 1QCK_B 1CI4_B 2EZZ_B.
Probab=49.00  E-value=28  Score=25.28  Aligned_cols=34  Identities=24%  Similarity=0.280  Sum_probs=22.9

Q ss_pred             HHHHHHHcCCCHH---HHHHHHHhhhhHHHHHHHHHHHHH
Q psy16911        152 LMNYLEEKGISNE---FVDKLSNLSTGYEHASYINLLSEI  188 (194)
Q Consensus       152 ~~~yLeeRGId~e---la~fl~~y~~~kEq~EYi~wL~~v  188 (194)
                      +-.=|+++|++-.   |++||   ...|.+..+..||++.
T Consensus        31 lg~~L~~~GfdKAy~vLGqfL---ll~kde~~F~~WLk~~   67 (89)
T PF02961_consen   31 LGKRLEEKGFDKAYVVLGQFL---LLKKDEELFQDWLKDT   67 (89)
T ss_dssp             HHHHHHHTT--BHHHHHHHHH---HTTT-HHHHHHHHHHH
T ss_pred             HHHHHHHCCCcHHHHHhhhhh---hccCcHHHHHHHHHHH
Confidence            4456889999874   34444   5788899999999974


No 14 
>PF08275 Toprim_N:  DNA primase catalytic core, N-terminal domain;  InterPro: IPR013264 This is the N-terminal, catalytic core domain of DNA primases. DNA primase (2.7.7 from EC) is a nucleotidyltransferase which synthesizes the oligoribonucleotide primers required for DNA replication on the lagging strand of the replication fork. It can also prime the leading strand and has been implicated in cell division []. ; PDB: 1EQN_E 1DD9_A 3B39_B 1DDE_A 2AU3_A.
Probab=44.97  E-value=16  Score=27.73  Aligned_cols=18  Identities=28%  Similarity=0.665  Sum_probs=13.2

Q ss_pred             HHHHHHHHHcCCCHHHHH
Q psy16911        150 DLLMNYLEEKGISNEFVD  167 (194)
Q Consensus       150 ~~~~~yLeeRGId~ela~  167 (194)
                      ..-.+||..|||+.+++.
T Consensus        12 ~~a~~YL~~Rgl~~e~i~   29 (128)
T PF08275_consen   12 KEALEYLKKRGLSDETIK   29 (128)
T ss_dssp             HHHHHHHHHTT--HHHHH
T ss_pred             HHHHHHHHHcCCCHHHHH
Confidence            347899999999999874


No 15 
>PF13376 OmdA:  Bacteriocin-protection, YdeI or OmpD-Associated
Probab=43.90  E-value=55  Score=21.57  Aligned_cols=39  Identities=15%  Similarity=0.146  Sum_probs=27.8

Q ss_pred             CCCHHHHHHHHHHHHHcCCCHHHHHHHHHhhhhHHHHHHHHHHHHHh
Q psy16911        143 VLDGPLYDLLMNYLEEKGISNEFVDKLSNLSTGYEHASYINLLSEIQ  189 (194)
Q Consensus       143 ~LDe~Lq~~~~~yLeeRGId~ela~fl~~y~~~kEq~EYi~wL~~vk  189 (194)
                      .++++|+.+|..       +.....|-..++ .-.|+.|+.|+...|
T Consensus         5 ~vP~dl~~aL~~-------~p~a~~~f~~l~-~~~rr~~i~wi~~AK   43 (63)
T PF13376_consen    5 EVPEDLEAALEA-------NPEAKEFFESLT-PSYRREYIRWINSAK   43 (63)
T ss_pred             CCCHHHHHHHHC-------CHHHHHHHHHCC-HHHHHHHHHHHHHcC
Confidence            577888888764       455556655554 456999999998755


No 16 
>PF07063 DUF1338:  Domain of unknown function (DUF1338);  InterPro: IPR009770 This domain is found in a variety of bacterial and fungal hypothetical proteins of unknown function. The structure of this domain has been solved by structural genomics. The structure implies a zinc-binding function, so it is a putative metal hydrolase (information derived from TOPSAN for PDB:3iuz).; PDB: 3LHO_A 3IUZ_A 2RJB_C.
Probab=37.05  E-value=68  Score=28.32  Aligned_cols=46  Identities=17%  Similarity=0.242  Sum_probs=33.2

Q ss_pred             CCCCHHHHHHHHHHHHHcCCCHHHHHH-----------------------------------HHHhhhhHHHHHHHHHHH
Q psy16911        142 EVLDGPLYDLLMNYLEEKGISNEFVDK-----------------------------------LSNLSTGYEHASYINLLS  186 (194)
Q Consensus       142 ~~LDe~Lq~~~~~yLeeRGId~ela~f-----------------------------------l~~y~~~kEq~EYi~wL~  186 (194)
                      +.|.+.+|+.+.+||..|.+-..-+.-                                   ..+|-.-++.++|..|+-
T Consensus       100 e~ls~~~q~~i~~~l~~r~~~~~~~~~~l~~~~~~~~l~~~~a~~~v~~~~~~~~W~~~~ps~~~Y~~L~~eSe~aAWi~  179 (302)
T PF07063_consen  100 EELSPELQEIIEKYLSRRDPFSPRPLELLDQAERQGGLSAEQADEFVAEALSGFPWQHRAPSLADYQTLLAESEYAAWIA  179 (302)
T ss_dssp             GGSSHHHHHHHHHHHTTS-GCGHHHHHHHHHHHHHSCEBHHHHHHHHHHHGGGCSS-SB-SBHHHHHHHHHH-HHHHHHH
T ss_pred             hhcCHHHHHHHHHHHhcccCCCHHHHHHHHHHHHcCCCCHHHHHHHHHHHHcccCCCCCCCCHHHHHHHHHHhHHHHhhc
Confidence            678899999999999777776433222                                   235777789999999986


Q ss_pred             H
Q psy16911        187 E  187 (194)
Q Consensus       187 ~  187 (194)
                      -
T Consensus       180 ~  180 (302)
T PF07063_consen  180 A  180 (302)
T ss_dssp             H
T ss_pred             c
Confidence            5


No 17 
>PF00070 Pyr_redox:  Pyridine nucleotide-disulphide oxidoreductase;  InterPro: IPR001327  FAD flavoproteins belonging to the family of pyridine nucleotide-disulphide oxidoreductases (glutathione reductase, trypanothione reductase, lipoamide dehydrogenase, mercuric reductase, thioredoxin reductase, alkyl hydroperoxide reductase) share sequence similarity with a number of other flavoprotein oxidoreductases, in particular with ferredoxin-NAD+ reductases involved in oxidative metabolism of a variety of hydrocarbons (rubredoxin reductase, putidaredoxin reductase, terpredoxin reductase, ferredoxin-NAD+ reductase components of benzene 1,2-dioxygenase, toluene 1,2-dioxygenase, chlorobenzene dioxygenase, biphenyl dioxygenase), NADH oxidase and NADH peroxidase [, , ]. Comparison of the crystal structures of human glutathione reductase and Escherichia coli thioredoxin reductase reveals different locations of their active sites, suggesting that the enzymes diverged from an ancestral FAD/NAD(P)H reductase and acquired their disulphide reductase activities independently [].   Despite functional similarities, oxidoreductases of this family show no sequence similarity with adrenodoxin reductases [] and flavoprotein pyridine nucleotide cytochrome reductases (FPNCR) []. Assuming that disulphide reductase activity emerged later, during divergent evolution, the family can be referred to as FAD-dependent pyridine nucleotide reductases, FADPNR. To date, 3D structures of glutathione reductase [], thioredoxin reductase [], mercuric reductase [], lipoamide dehydrogenase [], trypanothione reductase [] and NADH peroxidase [] have been solved. The enzymes share similar tertiary structures based on a doubly-wound alpha/beta fold, but the relative orientations of their FAD- and NAD(P)H-binding domains may vary significantly. By contrast with the FPNCR family, the folds of the FAD- and NAD(P)H-binding domains are similar, suggesting that the domains evolved by gene duplication [].  This entry describes a small NADH binding domain within a larger FAD binding domain described by IPR023753 from INTERPRO. It is found in both class I and class II oxidoreductases. ; GO: 0016491 oxidoreductase activity, 0050660 flavin adenine dinucleotide binding, 0055114 oxidation-reduction process; PDB: 1ZKQ_A 3DGZ_A 1ZDL_A 2R9Z_B 2RAB_A 2A87_B 1M6I_A 2YVG_A 2GR1_A 2GQW_A ....
Probab=36.97  E-value=29  Score=23.45  Aligned_cols=21  Identities=24%  Similarity=0.468  Sum_probs=19.5

Q ss_pred             CCCCHHHHHHHHHHHHHcCCC
Q psy16911        142 EVLDGPLYDLLMNYLEEKGIS  162 (194)
Q Consensus       142 ~~LDe~Lq~~~~~yLeeRGId  162 (194)
                      ..+|+.+...+.++|+.+||+
T Consensus        36 ~~~~~~~~~~~~~~l~~~gV~   56 (80)
T PF00070_consen   36 PGFDPDAAKILEEYLRKRGVE   56 (80)
T ss_dssp             TTSSHHHHHHHHHHHHHTTEE
T ss_pred             hhcCHHHHHHHHHHHHHCCCE
Confidence            579999999999999999986


No 18 
>PRK11814 cysteine desulfurase activator complex subunit SufB; Provisional
Probab=36.78  E-value=66  Score=30.19  Aligned_cols=44  Identities=14%  Similarity=0.099  Sum_probs=33.2

Q ss_pred             CCCCCHHHHHHHHHHHHHcCCCHHHHHHHHHhhhhHHH-----HHHHHHHHHH
Q psy16911        141 EEVLDGPLYDLLMNYLEEKGISNEFVDKLSNLSTGYEH-----ASYINLLSEI  188 (194)
Q Consensus       141 ~~~LDe~Lq~~~~~yLeeRGId~ela~fl~~y~~~kEq-----~EYi~wL~~v  188 (194)
                      ...||++    -.=||..|||+.+-|.-|.-+....|=     -++++|+.++
T Consensus       428 vG~idee----qLFYL~SRGi~e~eA~~Llv~gF~~evi~~ip~e~~~e~~~~  476 (486)
T PRK11814        428 TSKISED----QLFYCRQRGISEEDAVSMIVNGFCKEVFQELPMEFAVEAQKL  476 (486)
T ss_pred             cCCCCHH----HHHHHHHcCCCHHHHHHHHHHHHHHHHHHHCCcHHHHHHHHH
Confidence            3557754    345999999999999988887776663     4788888764


No 19 
>PF13794 MiaE_2:  tRNA-(MS[2]IO[6]A)-hydroxylase (MiaE)-like; PDB: 3EZ0_C.
Probab=35.58  E-value=31  Score=28.39  Aligned_cols=45  Identities=16%  Similarity=0.249  Sum_probs=32.4

Q ss_pred             HHHHHHHHHHHcCCCHH--HHHHHHHhhhhHHHHHHHHHHHH-Hhhhh
Q psy16911        148 LYDLLMNYLEEKGISNE--FVDKLSNLSTGYEHASYINLLSE-IQKFT  192 (194)
Q Consensus       148 Lq~~~~~yLeeRGId~e--la~fl~~y~~~kEq~EYi~wL~~-vk~Fv  192 (194)
                      -++.+...|.+||++..  +.-|...+-.+.....=-.|++. ||.||
T Consensus        51 hf~~L~~~l~~~G~d~~~am~pf~~~ld~f~~rT~P~dW~E~LvKaYV   98 (185)
T PF13794_consen   51 HFERLEARLAERGVDPEEAMEPFVGALDAFHARTRPSDWLESLVKAYV   98 (185)
T ss_dssp             HHHHHHHHHHHTT--HHHHHGGGHHHHHHHHHTT--SSHHHHHHHHHH
T ss_pred             HHHHHHHHHHHcCCCHHHHHHHHHHHHHHHHhcCCCCChHHHHHHHHH
Confidence            46788999999999996  77788888777777777889987 45554


No 20 
>smart00777 Mad3_BUB1_I Mad3/BUB1 hoMad3/BUB1 homology region 1. Proteins containing this domain are checkpoint proteins involved in cell division. This region has been shown to be essential for the binding of the binding of BUB1 and MAD3 to CDC20p.
Probab=32.18  E-value=61  Score=24.82  Aligned_cols=30  Identities=23%  Similarity=0.255  Sum_probs=25.8

Q ss_pred             HHHHHHHHHcCCCHHHHHHHHHhhhhHHHH
Q psy16911        150 DLLMNYLEEKGISNEFVDKLSNLSTGYEHA  179 (194)
Q Consensus       150 ~~~~~yLeeRGId~ela~fl~~y~~~kEq~  179 (194)
                      ..+..||..+||-..+|.|=..|+...|.+
T Consensus        83 ~~if~~L~~~~IG~~~AlfYe~~A~~lE~~  112 (125)
T smart00777       83 RELFQFLYSKGIGTKLALFYEEWAQLLEAA  112 (125)
T ss_pred             HHHHHHHHHCCcchhhHHHHHHHHHHHHHc
Confidence            457899999999999999999888776654


No 21 
>COG4003 Uncharacterized protein conserved in archaea [Function unknown]
Probab=31.89  E-value=72  Score=23.15  Aligned_cols=26  Identities=31%  Similarity=0.474  Sum_probs=18.4

Q ss_pred             HHHHHHHHcC-CCHHHHHHHHHhhhhH
Q psy16911        151 LLMNYLEEKG-ISNEFVDKLSNLSTGY  176 (194)
Q Consensus       151 ~~~~yLeeRG-Id~ela~fl~~y~~~k  176 (194)
                      .+.+||+.|| |+.+.|.+|..-..+|
T Consensus        52 Eii~yleKrGEi~~E~A~~L~~~~~~k   78 (98)
T COG4003          52 EIINYLEKRGEITPEMAKALRVTLVHK   78 (98)
T ss_pred             HHHHHHHHhCCCCHHHHHHHHhhHHHH
Confidence            4568888888 7888888876544443


No 22 
>PF09442 DUF2018:  Domain of unknown function (DUF2018);  InterPro: IPR018563  This entry represents proteins of unknown function. The crystal structure of HP0242, a hypothetical protein from Helicobacter pylori (Campylobacter pylori) has been determined. It reveals an acid-adaptive protein possibly of physiological significance when H. pylori colonises the human stomach. The protein adopts a unique four alpha-helical triangular conformations. The biologically active form is thought to be a tetramer. The gene exist in an operon along with 6 other genes where the gene products appear to be related to iron storage and haem biosynthesis []. ; PDB: 3MLG_B 3MLI_C 2OUF_A 2BO3_A.
Probab=30.44  E-value=1.5e+02  Score=21.15  Aligned_cols=39  Identities=21%  Similarity=0.395  Sum_probs=25.3

Q ss_pred             HHHHHHHHHHHcCCCHHHHHHHHHhh-------hhHHHHHHHHHHHHH
Q psy16911        148 LYDLLMNYLEEKGISNEFVDKLSNLS-------TGYEHASYINLLSEI  188 (194)
Q Consensus       148 Lq~~~~~yLeeRGId~ela~fl~~y~-------~~kEq~EYi~wL~~v  188 (194)
                      -+.++...|+++|++  +-.+|..|.       ..+-+.-||....++
T Consensus        31 r~aamE~llEe~g~~--~e~~ik~~~~~~~~~i~~~~~dl~Ie~mg~I   76 (81)
T PF09442_consen   31 RLAAMEELLEENGLD--VEEEIKNFIFENEDKIENRKNDLYIESMGNI   76 (81)
T ss_dssp             HHHHHHHHHHHTT-H--HHHHHHGGGT-HHHHHHHHHHHHHHHHHHHH
T ss_pred             HHHHHHHHHHHcCCC--HHHHHHHHHhccHHHHHHHhcchHHHHHHHH
Confidence            466788999999999  444444443       445567777776654


No 23 
>PF13434 K_oxygenase:  L-lysine 6-monooxygenase (NADPH-requiring); PDB: 3S61_B 3S5W_B.
Probab=29.88  E-value=52  Score=29.14  Aligned_cols=33  Identities=21%  Similarity=0.436  Sum_probs=25.5

Q ss_pred             HHHHHHHHcCCCHHHHHHH---HHhhhhHHHHHHHHHHH
Q psy16911        151 LLMNYLEEKGISNEFVDKL---SNLSTGYEHASYINLLS  186 (194)
Q Consensus       151 ~~~~yLeeRGId~ela~fl---~~y~~~kEq~EYi~wL~  186 (194)
                      .|.+||.++|   .|..|+   .-+....|=..|++|..
T Consensus        70 sflnYL~~~~---rl~~f~~~~~~~p~R~ef~dYl~Wva  105 (341)
T PF13434_consen   70 SFLNYLHEHG---RLYEFYNRGYFFPSRREFNDYLRWVA  105 (341)
T ss_dssp             SHHHHHHHTT----HHHHHHH--SS-BHHHHHHHHHHHH
T ss_pred             cHHHHHHHcC---ChhhhhhcCCCCCCHHHHHHHHHHHH
Confidence            5899999998   666666   46678899999999974


No 24 
>COG3592 Uncharacterized conserved protein [Function unknown]
Probab=29.84  E-value=38  Score=23.53  Aligned_cols=15  Identities=40%  Similarity=0.598  Sum_probs=13.8

Q ss_pred             EeeCCeEEEEEEEeC
Q psy16911         44 KTVGSEKIEIHFNVN   58 (194)
Q Consensus        44 r~~~~E~I~V~f~v~   58 (194)
                      |+|.+|+|-|.||.+
T Consensus         9 r~y~Gekidi~fn~~   23 (74)
T COG3592           9 RKYRGEKIDIYFNTA   23 (74)
T ss_pred             ceeccceEEEEeccc
Confidence            789999999999976


No 25 
>PF04026 SpoVG:  SpoVG;  InterPro: IPR007170 This is a stage V sporulation protein G. It is essential for sporulation and specific to stage V sporulation in Bacillus megaterium and Bacillus subtilis []. In B. subtilis, expression decreases after 30-60 minutes of cold shock [].; GO: 0030435 sporulation resulting in formation of a cellular spore; PDB: 2IA9_F 2I9X_B 2I9Z_A.
Probab=29.50  E-value=1.3e+02  Score=21.37  Aligned_cols=35  Identities=26%  Similarity=0.359  Sum_probs=25.4

Q ss_pred             eEEEEEe--CCceEEEEEEecCCCCCCCCCCCCCceEEeeeeeecCC
Q psy16911         86 FEVNFIK--GNKTLGFTCSFIPPESEPSEDGYNDLFSIDEVDTYERE  130 (194)
Q Consensus        86 f~V~I~K--~~~~L~f~c~~~~~e~~~~e~~~~d~f~I~~v~~~e~~  130 (194)
                      ..|.|.+  +.+.|.--|+..          .+|.|.|.+|.+.+|+
T Consensus         4 tdVri~~~~~~~~lka~asV~----------~dd~f~I~~ikVieg~   40 (84)
T PF04026_consen    4 TDVRIRKIEPEGKLKAFASVT----------FDDCFVIHDIKVIEGE   40 (84)
T ss_dssp             EEEEEEETTSSSSEEEEEEEE----------ETTTEEEEEEEEEEET
T ss_pred             EEEEEEEecCCCCEEEEEEEE----------ECCEEEEEeEEEEECC
Confidence            3677877  447777777773          3456999999998764


No 26 
>PF02268 TFIIA_gamma_N:  Transcription initiation factor IIA, gamma subunit, helical domain;  InterPro: IPR015872 Transcription factor IIA (TFIIA) is one of several factors that form part of a transcription pre-initiation complex along with RNA polymerase II, the TATA-box-binding protein (TBP) and TBP-associated factors, on the TATA-box sequence upstream of the initiation start site. After initiation, some components of the pre-initiation complex (including TFIIA) remain attached and re-initiate a subsequent round of transcription. TFIIA binds to TBP to stabilise TBP binding to the TATA element. TFIIA also inhibits the cytokine HMGB1 (high mobility group 1 protein) binding to TBP [], and can dissociate HMGB1 already bound to TBP/TATA-box. Human and Drosophila TFIIA have three subunits: two large subunits, LN/alpha and LC/beta, derived from the same gene, and a small subunit, S/gamma. Yeast TFIIA has two subunits: a large TOA1 subunit that shows sequence similarity to the N-terminal of LN/alpha and the C-terminal of LC/beta, and a small subunit, TOA2 that is highly homologous with S/gamma. The conserved regions of the large and small subunits of TFIIA combine to form two domains: a four-helix bundle (helical domain) composed of two helices from each of the N-terminal regions of TOA1 and TOA2 in yeast; and a beta-barrel (beta-barrel domain) composed of beta-sheets from the C-terminal regions of TOA1 and TOA2 []. This entry represents the alpha-helical domain found at the N-terminal of the gamma subunit of transcription factor TFIIA.; GO: 0006367 transcription initiation from RNA polymerase II promoter, 0005672 transcription factor TFIIA complex; PDB: 1NVP_D 1RM1_B 1YTF_D 1NH2_D.
Probab=29.01  E-value=1e+02  Score=19.87  Aligned_cols=30  Identities=23%  Similarity=0.253  Sum_probs=24.9

Q ss_pred             CCCCHHHHHHHHHHHHHcCCCHHHHHHHHH
Q psy16911        142 EVLDGPLYDLLMNYLEEKGISNEFVDKLSN  171 (194)
Q Consensus       142 ~~LDe~Lq~~~~~yLeeRGId~ela~fl~~  171 (194)
                      ..|=-.|+++|.+...+.-|++.||..+..
T Consensus         9 stlG~aL~dtLDeli~~~~I~p~La~kVL~   38 (49)
T PF02268_consen    9 STLGIALTDTLDELIQEGKITPQLAMKVLE   38 (49)
T ss_dssp             SHHHHHHHHHHHHHHHTTSS-HHHHHHHHH
T ss_pred             chHHHHHHHHHHHHHHcCCCCHHHHHHHHH
Confidence            345578999999999999999999998864


No 27 
>smart00070 GLUCA Glucagon like hormones.
Probab=28.78  E-value=76  Score=17.83  Aligned_cols=16  Identities=13%  Similarity=0.225  Sum_probs=9.6

Q ss_pred             HHhhhhHHHHHHHHHH
Q psy16911        170 SNLSTGYEHASYINLL  185 (194)
Q Consensus       170 ~~y~~~kEq~EYi~wL  185 (194)
                      ..|....--+.|++||
T Consensus        11 sk~L~~~~ar~fl~~L   26 (27)
T smart00070       11 SKYLDQLAAKKFLQWL   26 (27)
T ss_pred             HHHHHHHHHHHHHHHh
Confidence            3344444457788887


No 28 
>cd06206 bifunctional_CYPOR These bifunctional proteins fuse N-terminal cytochrome p450 with a cytochrome p450 reductase (CYPOR). NADPH cytochrome p450 reductase serves as an electron donor in several oxygenase systems and is a component of nitric oxide synthases and methionine synthase reductases. CYPOR transfers two electrons from NADPH to the heme of cytochrome p450 via FAD and FMN. Ferredoxin-NADP+ (oxido)reductase is an FAD-containing enzyme that catalyzes the reversible electron transfer between NADP(H) and electron carrier proteins such as ferredoxin and flavodoxin. Isoforms of these flavoproteins (i.e. having a non-covalently bound FAD as a prosthetic group) are present in chloroplasts, mitochondria, and bacteria in which they participate in a wide variety of redox metabolic pathways. The C-terminal domain contains most of the NADP(H) binding residues and the N-terminal domain interacts non-covalently with the isoalloxazine rings of the flavin molecule which lies largely in a la
Probab=28.30  E-value=70  Score=28.61  Aligned_cols=41  Identities=15%  Similarity=0.199  Sum_probs=36.2

Q ss_pred             HHHHHHHHHHHHHcCC-CHHHHHHHHHhhhhHHHHHHHHHHH
Q psy16911        146 GPLYDLLMNYLEEKGI-SNEFVDKLSNLSTGYEHASYINLLS  186 (194)
Q Consensus       146 e~Lq~~~~~yLeeRGI-d~ela~fl~~y~~~kEq~EYi~wL~  186 (194)
                      -.|.++|..||.=+|+ ...|-..|..|+++.++++-+..+.
T Consensus        79 ~tl~~~l~~~~Di~~~p~~~~l~~la~~~~~~~~k~~l~~~~  120 (384)
T cd06206          79 ISVSELLSSYVELSQPATRRQLAALAEATRCPDTKALLERLA  120 (384)
T ss_pred             EEHHHHHHhhccccCCCCHHHHHHHHHHCCCHHHHHHHHHhh
Confidence            5899999999999999 8999999999999998887665553


No 29 
>smart00667 LisH Lissencephaly type-1-like homology motif. Alpha-helical motif present in Lis1, treacle, Nopp140, some katanin p60 subunits, muskelin, tonneau, LEUNIG and numerous WD40 repeat-containing proteins. It is suggested that LisH motifs contribute to the regulation of microtubule dynamics, either by mediating dimerisation, or else by binding cytoplasmic dynein heavy chain or microtubules directly.
Probab=27.42  E-value=1e+02  Score=16.55  Aligned_cols=27  Identities=19%  Similarity=0.258  Sum_probs=22.0

Q ss_pred             HHHHHHHHHHHHHcCCCHHHHHHHHHh
Q psy16911        146 GPLYDLLMNYLEEKGISNEFVDKLSNL  172 (194)
Q Consensus       146 e~Lq~~~~~yLeeRGId~ela~fl~~y  172 (194)
                      ..|..++.+||-..|-......|..+.
T Consensus         4 ~~l~~lI~~yL~~~g~~~ta~~l~~e~   30 (34)
T smart00667        4 SELNRLILEYLLRNGYEETAETLQKES   30 (34)
T ss_pred             HHHHHHHHHHHHHcCHHHHHHHHHHHh
Confidence            467888999999999998887776543


No 30 
>TIGR02793 nikR nickel-responsive transcriptional regulator NikR. Three members of the seed for this model, from Escherichia coli, Pseudomonas putida, and Brucella melitensis, are found associated with a nickel ABC transporter operon that acts to import nickel for use as a cofactor in urease or hydrogenase. These proteins, with characterized nickel-binding and DNA-binding domains, act as nickel-responsive transcriptional regulators. In the larger family of full-length homologs, most others both lack proximity to the nickel ABC transporter operon and form a separate clade. Several of the homologs not within the scope of this model, but rather scoring between the trusted and noise cutoffs, have been shown to bind nickel, copper, or both, and to regulate genes in response to nickel.
Probab=26.90  E-value=59  Score=25.00  Aligned_cols=20  Identities=25%  Similarity=0.448  Sum_probs=18.7

Q ss_pred             CCCHHHHHHHHHHHHHcCCC
Q psy16911        143 VLDGPLYDLLMNYLEEKGIS  162 (194)
Q Consensus       143 ~LDe~Lq~~~~~yLeeRGId  162 (194)
                      .||++|.+.|.++.++||.+
T Consensus         6 slp~~ll~~lD~~~~~~g~~   25 (129)
T TIGR02793         6 TLDDDLLETLDRLIARRGYQ   25 (129)
T ss_pred             EcCHHHHHHHHHHHHHcCCC
Confidence            48999999999999999986


No 31 
>PF12122 DUF3582:  Protein of unknown function (DUF3582);  InterPro: IPR022732 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Proteolytic enzymes that exploit serine in their catalytic activity are ubiquitous, being found in viruses, bacteria and eukaryotes []. They include a wide range of peptidase activity, including exopeptidase, endopeptidase, oligopeptidase and omega-peptidase activity. Over 20 families (denoted S1 - S66) of serine protease have been identified, these being grouped into clans on the basis of structural similarity and other functional evidence []. Structures are known for members of the clans and the structures indicate that some appear to be totally unrelated, suggesting different evolutionary origins for the serine peptidases []. Not withstanding their different evolutionary origins, there are similarities in the reaction mechanisms of several peptidases. Chymotrypsin, subtilisin and carboxypeptidase C have a catalytic triad of serine, aspartate and histidine in common: serine acts as a nucleophile, aspartate as an electrophile, and histidine as a base []. The geometric orientations of the catalytic residues are similar between families, despite different protein folds []. The linear arrangements of the catalytic residues commonly reflect clan relationships. For example the catalytic triad in the chymotrypsin clan (PA) is ordered HDS, but is ordered DHS in the subtilisin clan (SB) and SDH in the carboxypeptidase clan (SC) [, ].  This entry represents the N-terminal domain of membrane-bound serine endopeptidases belonging to MEROPS peptidase family S54 (rhomboid-1, clan ST). This domain contains a conserved ASW sequence motif and a single completely conserved residue F that may be functionally important.  The tertiary structure of the GlpG protein from Escherichia coli has been determined []. The GlpG protein has six transmembrane domains (other members of the family are predicted to have seven), with the N- and C-terminal ends anchored in the cytoplasm. One transmembrane domain is shorter than the rest, creating an internal, aqueous cavity just below the membrane surface and it is here were proteolysis occurs. There is also a membrane-embedded loop between the first and second transmembrane domains which is postulated to act as a gate controlling substrate access to the active site. No other family of serine peptidases is known to have active site residues within transmembrane domains (although transmembrane active sites are known for aspartic peptidase and metallopeptidases), and the GlpG protein has the type structure for clan ST.; GO: 0004252 serine-type endopeptidase activity, 0016021 integral to membrane; PDB: 3UBB_A 3B45_A 3B44_A 2NRF_A 3TXT_A 2O7L_A 2XTU_A 2IRV_A 2XOW_A 2XTV_A ....
Probab=26.71  E-value=62  Score=23.85  Aligned_cols=21  Identities=19%  Similarity=0.425  Sum_probs=16.0

Q ss_pred             CHHHHHHHHHHHHHcCCCHHH
Q psy16911        145 DGPLYDLLMNYLEEKGISNEF  165 (194)
Q Consensus       145 De~Lq~~~~~yLeeRGId~el  165 (194)
                      ++.+-.+|.+||..+||+..+
T Consensus         9 n~r~AqaF~DYl~sqgI~~~i   29 (101)
T PF12122_consen    9 NPRAAQAFIDYLASQGIELQI   29 (101)
T ss_dssp             SHHHHHHHHHHHHHTT--EEE
T ss_pred             CHHHHHHHHHHHHHCCCeEEE
Confidence            467889999999999998654


No 32 
>PF10281 Ish1:  Putative stress-responsive nuclear envelope protein;  InterPro: IPR018803  This group of proteins, found primarily in fungi, consists of putative stress-responsive nuclear envelope protein Ish1 and homologues []. 
Probab=22.78  E-value=70  Score=18.93  Aligned_cols=15  Identities=27%  Similarity=0.410  Sum_probs=12.1

Q ss_pred             HHHHHHHHHcCCCHH
Q psy16911        150 DLLMNYLEEKGISNE  164 (194)
Q Consensus       150 ~~~~~yLeeRGId~e  164 (194)
                      +.|.+||.+.||...
T Consensus         7 ~~L~~wL~~~gi~~~   21 (38)
T PF10281_consen    7 SDLKSWLKSHGIPVP   21 (38)
T ss_pred             HHHHHHHHHcCCCCC
Confidence            568899999999643


No 33 
>PRK13031 preprotein translocase subunit SecB; Provisional
Probab=22.15  E-value=1.7e+02  Score=23.25  Aligned_cols=30  Identities=7%  Similarity=0.305  Sum_probs=19.5

Q ss_pred             ceEEEEE-eCCceEEEEEEecCCCCCCCCCCCCCceEEeee
Q psy16911         85 SFEVNFI-KGNKTLGFTCSFIPPESEPSEDGYNDLFSIDEV  124 (194)
Q Consensus        85 ~f~V~I~-K~~~~L~f~c~~~~~e~~~~e~~~~d~f~I~~v  124 (194)
                      .++|+++ |.++...|-|..          .+.++|.|.++
T Consensus        58 ~L~vtvta~~~~~t~FlvEv----------~qaGIF~i~ni   88 (149)
T PRK13031         58 VLTLEVKVENDGMVAFEAEV----------KQAGIFTVANM   88 (149)
T ss_pred             EEEEEEEEEeCCceEEEEEE----------eEeeEEEEcCC
Confidence            4455555 334455588877          35688999887


No 34 
>PF08311 Mad3_BUB1_I:  Mad3/BUB1 homology region 1;  InterPro: IPR013212 Proteins containing this domain are checkpoint proteins involved in cell division. This region has been shown to be essential for the binding of BUB1 and MAD3 to CDC20p [].; PDB: 3ESL_B 4AEZ_I 4A1G_B 2LAH_A 2WVI_A 3SI5_B.
Probab=21.83  E-value=80  Score=23.83  Aligned_cols=29  Identities=17%  Similarity=0.226  Sum_probs=25.8

Q ss_pred             HHHHHHHHcCCCHHHHHHHHHhhhhHHHH
Q psy16911        151 LLMNYLEEKGISNEFVDKLSNLSTGYEHA  179 (194)
Q Consensus       151 ~~~~yLeeRGId~ela~fl~~y~~~kEq~  179 (194)
                      .+.+||..+||-..+|.|=..|+...|++
T Consensus        84 ~if~~l~~~~IG~~~A~fY~~wA~~le~~  112 (126)
T PF08311_consen   84 EIFKFLYSKGIGTKLALFYEEWAEFLEKR  112 (126)
T ss_dssp             HHHHHHHHHTTSTTBHHHHHHHHHHHHHT
T ss_pred             HHHHHHHHcCccHHHHHHHHHHHHHHHHc
Confidence            46789999999999999999999887765


No 35 
>PF08747 DUF1788:  Domain of unknown function (DUF1788);  InterPro: IPR014858 This entry represents a putative uncharacterised protein of length around 200 amino acids. 
Probab=21.71  E-value=69  Score=24.50  Aligned_cols=17  Identities=41%  Similarity=0.901  Sum_probs=14.8

Q ss_pred             HHHHHHHHHHHHcCCCH
Q psy16911        147 PLYDLLMNYLEEKGISN  163 (194)
Q Consensus       147 ~Lq~~~~~yLeeRGId~  163 (194)
                      +|++.+.++|++||+=+
T Consensus         5 nL~~~~l~~l~~~~~~d   21 (126)
T PF08747_consen    5 NLYEIFLEILEERGILD   21 (126)
T ss_pred             EHHHHHHHHHHhcChHH
Confidence            69999999999999743


No 36 
>PHA03416 hypothetical E4 protein; Provisional
Probab=21.38  E-value=1.9e+02  Score=21.05  Aligned_cols=28  Identities=14%  Similarity=0.188  Sum_probs=22.5

Q ss_pred             CCcEEEeCCCEEEEEEeeCCeEEEEEEE
Q psy16911         29 DGFQVKLDGPEVSLEKTVGSEKIEIHFN   56 (194)
Q Consensus        29 ~~f~v~~~g~~v~L~r~~~~E~I~V~f~   56 (194)
                      ..|.|...|..|+|+.+-.|-.|.|++.
T Consensus        64 ~pWTV~tt~~svtlt~t~~GttVtvtlh   91 (92)
T PHA03416         64 GPWTVTATPYSLELRTTQQGTQITLTVS   91 (92)
T ss_pred             CCcEEeccceEEEEEEecCCcEEEEEEe
Confidence            5799999999999986656667888764


No 37 
>PRK04460 nickel responsive regulator; Provisional
Probab=21.31  E-value=2.1e+02  Score=22.16  Aligned_cols=20  Identities=35%  Similarity=0.521  Sum_probs=18.7

Q ss_pred             CCCHHHHHHHHHHHHHcCCC
Q psy16911        143 VLDGPLYDLLMNYLEEKGIS  162 (194)
Q Consensus       143 ~LDe~Lq~~~~~yLeeRGId  162 (194)
                      .||++|.+.|.++.++||.+
T Consensus         8 slp~~ll~~lD~~~~~~gy~   27 (137)
T PRK04460          8 SLDSDLLEKFDELIEEKGYQ   27 (137)
T ss_pred             EeCHHHHHHHHHHHHHcCCC
Confidence            48999999999999999987


No 38 
>PF09868 DUF2095:  Uncharacterized protein conserved in archaea (DUF2095);  InterPro: IPR018662  This domain, found in various hypothetical prokaryotic proteins, has no known function. 
Probab=21.02  E-value=1.3e+02  Score=23.13  Aligned_cols=23  Identities=35%  Similarity=0.619  Sum_probs=18.4

Q ss_pred             HHHHHHHHcC-CCHHHHHHHHHhh
Q psy16911        151 LLMNYLEEKG-ISNEFVDKLSNLS  173 (194)
Q Consensus       151 ~~~~yLeeRG-Id~ela~fl~~y~  173 (194)
                      .+.+||+.|| |+.+.|.+|..-.
T Consensus        82 EVInylek~GEIt~e~A~eLr~~L  105 (128)
T PF09868_consen   82 EVINYLEKRGEITPEEAKELRSIL  105 (128)
T ss_pred             HHHHHHHHhCCCCHHHHHHHHHHH
Confidence            3578999999 8999999887543


No 39 
>PRK13654 magnesium-protoporphyrin IX monomethyl ester cyclase; Provisional
Probab=20.97  E-value=79  Score=28.51  Aligned_cols=33  Identities=21%  Similarity=0.232  Sum_probs=26.0

Q ss_pred             ccCCCCCCCHHHHHHHHHHHHHcCCCHHHHHHHH
Q psy16911        137 IACLEEVLDGPLYDLLMNYLEEKGISNEFVDKLS  170 (194)
Q Consensus       137 Y~g~~~~LDe~Lq~~~~~yLeeRGId~ela~fl~  170 (194)
                      +.+..+.||+++-..|.+||+ |.+.+||.-||.
T Consensus        69 f~~~~d~l~~e~r~~FidFLe-rSctaEFSGflL  101 (355)
T PRK13654         69 FDQDWDHLDPETRKEFIDFLE-RSCTAEFSGFLL  101 (355)
T ss_pred             hhhchhhCCHHHHHHHHHHHH-HHhhhhhhhHHH
Confidence            334457899999999999995 778888877663


No 40 
>PHA01748 hypothetical protein
Probab=20.78  E-value=2.6e+02  Score=18.37  Aligned_cols=20  Identities=25%  Similarity=0.506  Sum_probs=18.7

Q ss_pred             CCCHHHHHHHHHHHHHcCCC
Q psy16911        143 VLDGPLYDLLMNYLEEKGIS  162 (194)
Q Consensus       143 ~LDe~Lq~~~~~yLeeRGId  162 (194)
                      .||++|...|.+|.+..|++
T Consensus         8 rLp~el~~eld~~a~~~g~~   27 (60)
T PHA01748          8 KIEEDLLELLDRYAIKHGLN   27 (60)
T ss_pred             ECCHHHHHHHHHHHHHhCCC
Confidence            48999999999999999997


No 41 
>PF10437 Lip_prot_lig_C:  Bacterial lipoate protein ligase C-terminus;  InterPro: IPR019491  This is the C-terminal domain of a bacterial lipoate protein ligase. There is no conservation between this C terminus and that of vertebrate lipoate protein ligase C-termini, but both are associated with IPR004143 from INTERPRO, further upstream. This C-terminal domain is more stable than IPR004143 from INTERPRO and the hypothesis is that the C-terminal domain has a role in recognising the lipoyl domain and/or transferring the lipoyl group onto it from the lipoyl-AMP intermediate. C-terminal fragments of length 172 to 193 amino acid residues are observed in the eubacterial enzymes whereas in their archaeal counterparts the C-terminal segment is significantly smaller, ranging in size from 87 to 107 amino acid residues. ; PDB: 1X2G_A 3A7R_A 3A7A_A 1X2H_C 1VQZ_A 3R07_C.
Probab=20.43  E-value=1.4e+02  Score=20.56  Aligned_cols=23  Identities=26%  Similarity=0.336  Sum_probs=18.8

Q ss_pred             CCEEEEEEeeCCeEEEEEEEeCC
Q psy16911         37 GPEVSLEKTVGSEKIEIHFNVNH   59 (194)
Q Consensus        37 g~~v~L~r~~~~E~I~V~f~v~~   59 (194)
                      .-.+.+++.|...+|.|.++|.+
T Consensus         4 ~f~~~~~~rf~~G~v~v~~~V~~   26 (86)
T PF10437_consen    4 EFTFSKERRFPWGTVEVHLNVKN   26 (86)
T ss_dssp             CESEEEEEEETTEEEEEEEEEET
T ss_pred             CCcEeeeeEcCCceEEEEEEEEC
Confidence            34678899999999999998873


No 42 
>PF01316 Arg_repressor:  Arginine repressor, DNA binding domain;  InterPro: IPR020900 The arginine dihydrolase (AD) pathway is found in many prokaryotes and some primitive eukaryotes, an example of the latter being Giardia lamblia (Giardia intestinalis) []. The three-enzyme anaerobic pathway breaks down L-arginine to form 1 mol of ATP, carbon dioxide and ammonia. In simpler bacteria, the first enzyme, arginine deiminase, can account for up to 10% of total cell protein []. Most prokaryotic arginine deiminase pathways are under the control of a repressor gene, termed ArgR []. This is a negative regulator, and will only release the arginine deiminase operon for expression in the presence of arginine []. The crystal structure of apo-ArgR from Bacillus stearothermophilus has been determined to 2.5A by means of X-ray crystallography []. The protein exists as a hexamer of identical subunits, and is shown to have six DNA-binding domains, clustered around a central oligomeric core when bound to arginine. It predominantly interacts with A.T residues in ARG boxes. This hexameric protein binds DNA at its N terminus to repress arginine biosyntheis or activate arginine catabolism. Some species have several ArgR paralogs. In a neighbour-joining tree, some of these paralogous sequences show long branches and differ significantly from the well-conserved C-terminal region. ; GO: 0003700 sequence-specific DNA binding transcription factor activity, 0006355 regulation of transcription, DNA-dependent, 0006525 arginine metabolic process; PDB: 1AOY_A 3V4G_A 3LAJ_D 3FHZ_A 3LAP_B 3ERE_D 2P5L_C 1F9N_D 2P5K_A 1B4A_A ....
Probab=20.14  E-value=1e+02  Score=21.19  Aligned_cols=17  Identities=35%  Similarity=0.452  Sum_probs=11.9

Q ss_pred             HHHHHHHHHHcCCCHHH
Q psy16911        149 YDLLMNYLEEKGISNEF  165 (194)
Q Consensus       149 q~~~~~yLeeRGId~el  165 (194)
                      |..|.++|+++||.-.-
T Consensus        22 Q~eL~~~L~~~Gi~vTQ   38 (70)
T PF01316_consen   22 QEELVELLEEEGIEVTQ   38 (70)
T ss_dssp             HHHHHHHHHHTT-T--H
T ss_pred             HHHHHHHHHHcCCCcch
Confidence            77889999999998543


No 43 
>TIGR02029 AcsF magnesium-protoporphyrin IX monomethyl ester aerobic oxidative cyclase. This model respresents the oxidative cyclase responsible for forming the distinctive E-ring of the chlorin ring system under aerobic conditions. This enzyme is believed to utilize a binuclear iron center and molecular oxygen. There are two isoforms of this enzyme in some plants and cyanobacterai which are differentially regulated based on the levels of copper and oxygen. This step is essential in the biosynthesis of both bacteriochlorophyll and chlorophyll under aerobic conditions (a separate enzyme, BchE, acts under anaerobic conditions). This enzyme is found in plants, cyanobacteria and other photosynthetic bacteria.
Probab=20.10  E-value=87  Score=28.05  Aligned_cols=31  Identities=29%  Similarity=0.422  Sum_probs=25.2

Q ss_pred             CCCCCCCHHHHHHHHHHHHHcCCCHHHHHHHH
Q psy16911        139 CLEEVLDGPLYDLLMNYLEEKGISNEFVDKLS  170 (194)
Q Consensus       139 g~~~~LDe~Lq~~~~~yLeeRGId~ela~fl~  170 (194)
                      +..+.||+++-..|.+||+ |.+.+||.-||.
T Consensus        61 ~~~~~l~~e~r~~FidFLe-rScTaEFSGflL   91 (337)
T TIGR02029        61 QSWEHIDGELRQAFIEFLE-RSCTSEFSGFLL   91 (337)
T ss_pred             cchhhCCHHHHHHHHHHHH-HHhhhhhhhhHH
Confidence            3346799999999999995 788888877763


Done!