Query psy16911
Match_columns 194
No_of_seqs 103 out of 247
Neff 6.6
Searched_HMMs 46136
Date Fri Aug 16 16:50:20 2013
Command hhsearch -i /work/01045/syshi/Psyhhblits/psy16911.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/16911hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 KOG4024|consensus 100.0 9.9E-48 2.1E-52 314.4 22.1 191 1-194 70-265 (266)
2 PF02330 MAM33: Mitochondrial 100.0 1.4E-44 3E-49 298.6 15.6 170 8-192 1-204 (204)
3 KOG2536|consensus 100.0 3.9E-42 8.6E-47 288.6 19.2 182 2-194 61-263 (263)
4 PF08513 LisH: LisH; InterPro 75.7 5.4 0.00012 22.2 3.3 25 146-170 1-25 (27)
5 TIGR01881 cas_Cmr5 CRISPR-asso 73.7 11 0.00025 28.9 5.7 43 147-189 70-121 (127)
6 KOG4233|consensus 61.7 14 0.00029 26.5 3.6 35 154-188 33-67 (90)
7 PF10929 DUF2811: Protein of u 58.8 9 0.0002 25.6 2.2 32 142-173 4-42 (57)
8 PHA01623 hypothetical protein 54.5 41 0.00088 22.1 4.8 30 143-174 19-48 (56)
9 PHA02360 hypothetical protein 53.2 14 0.00031 25.2 2.4 17 146-162 29-45 (70)
10 PF14265 DUF4355: Domain of un 52.5 44 0.00095 24.9 5.4 40 147-187 69-108 (125)
11 PF00123 Hormone_2: Peptide ho 49.7 31 0.00067 19.6 3.1 22 165-186 6-27 (28)
12 PF13154 DUF3991: Protein of u 49.3 11 0.00023 26.1 1.4 18 154-171 1-19 (77)
13 PF02961 BAF: Barrier to autoi 49.0 28 0.00061 25.3 3.6 34 152-188 31-67 (89)
14 PF08275 Toprim_N: DNA primase 45.0 16 0.00036 27.7 2.0 18 150-167 12-29 (128)
15 PF13376 OmdA: Bacteriocin-pro 43.9 55 0.0012 21.6 4.3 39 143-189 5-43 (63)
16 PF07063 DUF1338: Domain of un 37.0 68 0.0015 28.3 4.9 46 142-187 100-180 (302)
17 PF00070 Pyr_redox: Pyridine n 37.0 29 0.00063 23.5 2.1 21 142-162 36-56 (80)
18 PRK11814 cysteine desulfurase 36.8 66 0.0014 30.2 5.0 44 141-188 428-476 (486)
19 PF13794 MiaE_2: tRNA-(MS[2]IO 35.6 31 0.00067 28.4 2.3 45 148-192 51-98 (185)
20 smart00777 Mad3_BUB1_I Mad3/BU 32.2 61 0.0013 24.8 3.4 30 150-179 83-112 (125)
21 COG4003 Uncharacterized protei 31.9 72 0.0016 23.1 3.4 26 151-176 52-78 (98)
22 PF09442 DUF2018: Domain of un 30.4 1.5E+02 0.0032 21.1 4.8 39 148-188 31-76 (81)
23 PF13434 K_oxygenase: L-lysine 29.9 52 0.0011 29.1 3.0 33 151-186 70-105 (341)
24 COG3592 Uncharacterized conser 29.8 38 0.00083 23.5 1.7 15 44-58 9-23 (74)
25 PF04026 SpoVG: SpoVG; InterP 29.5 1.3E+02 0.0029 21.4 4.5 35 86-130 4-40 (84)
26 PF02268 TFIIA_gamma_N: Transc 29.0 1E+02 0.0022 19.9 3.4 30 142-171 9-38 (49)
27 smart00070 GLUCA Glucagon like 28.8 76 0.0016 17.8 2.5 16 170-185 11-26 (27)
28 cd06206 bifunctional_CYPOR The 28.3 70 0.0015 28.6 3.6 41 146-186 79-120 (384)
29 smart00667 LisH Lissencephaly 27.4 1E+02 0.0023 16.5 3.1 27 146-172 4-30 (34)
30 TIGR02793 nikR nickel-responsi 26.9 59 0.0013 25.0 2.5 20 143-162 6-25 (129)
31 PF12122 DUF3582: Protein of u 26.7 62 0.0013 23.8 2.5 21 145-165 9-29 (101)
32 PF10281 Ish1: Putative stress 22.8 70 0.0015 18.9 1.8 15 150-164 7-21 (38)
33 PRK13031 preprotein translocas 22.2 1.7E+02 0.0037 23.3 4.3 30 85-124 58-88 (149)
34 PF08311 Mad3_BUB1_I: Mad3/BUB 21.8 80 0.0017 23.8 2.4 29 151-179 84-112 (126)
35 PF08747 DUF1788: Domain of un 21.7 69 0.0015 24.5 2.0 17 147-163 5-21 (126)
36 PHA03416 hypothetical E4 prote 21.4 1.9E+02 0.0042 21.0 4.1 28 29-56 64-91 (92)
37 PRK04460 nickel responsive reg 21.3 2.1E+02 0.0046 22.2 4.7 20 143-162 8-27 (137)
38 PF09868 DUF2095: Uncharacteri 21.0 1.3E+02 0.0028 23.1 3.2 23 151-173 82-105 (128)
39 PRK13654 magnesium-protoporphy 21.0 79 0.0017 28.5 2.4 33 137-170 69-101 (355)
40 PHA01748 hypothetical protein 20.8 2.6E+02 0.0056 18.4 5.1 20 143-162 8-27 (60)
41 PF10437 Lip_prot_lig_C: Bacte 20.4 1.4E+02 0.0031 20.6 3.3 23 37-59 4-26 (86)
42 PF01316 Arg_repressor: Argini 20.1 1E+02 0.0022 21.2 2.4 17 149-165 22-38 (70)
43 TIGR02029 AcsF magnesium-proto 20.1 87 0.0019 28.0 2.5 31 139-170 61-91 (337)
No 1
>KOG4024|consensus
Probab=100.00 E-value=9.9e-48 Score=314.45 Aligned_cols=191 Identities=42% Similarity=0.655 Sum_probs=176.5
Q ss_pred ChHHHHHHHHHHHHHHhcCCCCCCCCCCCCcEEEeCCCEEEEEEeeCCeEEEEEEEeCCCCCCCCCCCCCCCCCCc-ccc
Q psy16911 1 EKELFEFLSEEIQAEKKIQKSKSIPSEIDGFQVKLDGPEVSLEKTVGSEKIEIHFNVNHSVDADAEPQIDPNMDEP-EIE 79 (194)
Q Consensus 1 ~~~L~~~L~~EI~~E~~~~~~~~~p~~~~~f~v~~~g~~v~L~r~~~~E~I~V~f~v~~~~~~~~~~e~~~~~~~~-e~~ 79 (194)
+++|+++|..||+.|+++++...+|..+.||+|+..|++|.|+|+.++|+|.|.||||++++++++.|+++++... ..+
T Consensus 70 ~~alv~fl~~Ei~~erk~qkgkt~Pkt~~Gf~v~l~gaev~Ltkk~~~Eki~V~FNvNhsvd~~e~~e~ePsq~~~~~pe 149 (266)
T KOG4024|consen 70 QQALVRFLEAEIQLERKNQKGKTAPKTFAGFQVTLKGAEVRLTKKNGSEKILVVFNVNHSVDMDEGFEDEPSQAVAPVPE 149 (266)
T ss_pred HHHHHHHHHHHHHHHHHhhcCCccCccccceEEEecCceeEEeeccCCceEEEEEecCCccCcccCCccCccccCCCCcc
Confidence 4799999999999999999999999999999999999999999999999999999999999998877777776643 348
Q ss_pred cCCCCceEEEEEeCCceEEEEEEecCCCCCCCCCCCCCceEEeeeeeecCC----CCCcccccCCCCCCCHHHHHHHHHH
Q psy16911 80 MKSKPSFEVNFIKGNKTLGFTCSFIPPESEPSEDGYNDLFSIDEVDTYERE----NGKTTTIACLEEVLDGPLYDLLMNY 155 (194)
Q Consensus 80 ~~~~~~f~V~I~K~~~~L~f~c~~~~~e~~~~e~~~~d~f~I~~v~~~e~~----~~~~~~Y~g~~~~LDe~Lq~~~~~y 155 (194)
++|.|+|+|.|.||+++|.|+|++.++|. |++++++.|.+..+..+.+. |++. +|+..++.||+.||++|++|
T Consensus 150 ~tS~P~F~VeiiKG~~~L~f~c~~~~~e~--QedEy~~~~~~~~~a~f~~~gn~~~~d~-vYa~~gd~lD~~LyDlL~~~ 226 (266)
T KOG4024|consen 150 ATSMPPFTVEIIKGDQRLCFHCELVPVED--QEDEYDFRVEEFYVAPFAKNGNEDVPDE-VYASSGDYLDPDLYDLLFVY 226 (266)
T ss_pred cccCCCeEEEEEeCCeeEEEEeecCCccc--CcccccceeeeEEeeecccCCCccCCcc-eeeccCcccCHHHHHHHHHH
Confidence 89999999999999999999999999987 46788988888888888655 4577 99999999999999999999
Q ss_pred HHHcCCCHHHHHHHHHhhhhHHHHHHHHHHHHHhhhhcC
Q psy16911 156 LEEKGISNEFVDKLSNLSTGYEHASYINLLSEIQKFTSQ 194 (194)
Q Consensus 156 LeeRGId~ela~fl~~y~~~kEq~EYi~wL~~vk~Fv~~ 194 (194)
|++|||+++||.-|++|++.+||++||.+|+++|+|+++
T Consensus 227 LeerG~d~~Fa~~Lv~laTa~EH~~YIglLeklkkF~~~ 265 (266)
T KOG4024|consen 227 LEERGLDARFAKTLVALATAYEHSQYIGLLEKLKKFISK 265 (266)
T ss_pred HHHcCccHHHHHHHHHHHHHhhhHHHHHHHHHHHHHhcC
Confidence 999999999999999999999999999999999999974
No 2
>PF02330 MAM33: Mitochondrial glycoprotein; InterPro: IPR003428 This mitochondrial matrix protein family contains members of the MAM33 family which bind to the globular 'heads' of C1Q.; GO: 0005759 mitochondrial matrix; PDB: 3QV0_A 1YQF_F 3JV1_A 1P32_A 3RPX_A.
Probab=100.00 E-value=1.4e-44 Score=298.57 Aligned_cols=170 Identities=37% Similarity=0.537 Sum_probs=128.3
Q ss_pred HHHHHHHHHhcCCC-----CCCCCCCCCcEEE--eCCCEEEEEEeeCCeEEEEEEEeCCCCCCC--CCCCCCC----CCC
Q psy16911 8 LSEEIQAEKKIQKS-----KSIPSEIDGFQVK--LDGPEVSLEKTVGSEKIEIHFNVNHSVDAD--AEPQIDP----NMD 74 (194)
Q Consensus 8 L~~EI~~E~~~~~~-----~~~p~~~~~f~v~--~~g~~v~L~r~~~~E~I~V~f~v~~~~~~~--~~~e~~~----~~~ 74 (194)
|++||++|++++.. ..+|+ .+|+++ .|+++|+|+|+||+|+|+|+|||++++++. ++++.+. +.+
T Consensus 1 L~~EI~~E~~~~~~~~~~~~~l~~--~~f~i~d~~g~~~V~L~r~~~~E~I~V~f~v~~~~~~~~~~~~~~~~~~e~~~~ 78 (204)
T PF02330_consen 1 LKSEIKYEKENYESPDSLKEFLPP--SPFEIEDKPGGSEVTLTREFGGEKIRVTFNVNDSVDPPFDEDEDEEEGEEEEEE 78 (204)
T ss_dssp HHHHHHHHHHHCT-HHHHCCHHHH--TTTCEEEECCTTEEEEEEEETTEEEEEEEECCCH--------------------
T ss_pred ChHHHHHHHHcccccHHHHhhCCC--CCCEEEEcCCCcEEEEEEecCCcEEEEEEECCccCCCCCCccccchhccccccc
Confidence 78999999999872 23444 456665 567899999999999999999999876332 1111111 000
Q ss_pred --Ccc---ccc-----CCCCceEEEEEeCCceEEEEEEecCCCCCCCCCCCCCceEEeeeeeecC----------CCCCc
Q psy16911 75 --EPE---IEM-----KSKPSFEVNFIKGNKTLGFTCSFIPPESEPSEDGYNDLFSIDEVDTYER----------ENGKT 134 (194)
Q Consensus 75 --~~e---~~~-----~~~~~f~V~I~K~~~~L~f~c~~~~~e~~~~e~~~~d~f~I~~v~~~e~----------~~~~~ 134 (194)
..+ ..+ .+.+.|+|.|+|++++|.|.|++.+ |.|.|++|+++.. +| +
T Consensus 79 ~~~~~~fe~~~~~~~~~~~~~~~v~~~k~~~~l~f~~~~~~-----------d~~~I~~v~~~~~~~~~~~~~~~e~--~ 145 (204)
T PF02330_consen 79 EISEPDFEVEIPKEDEVSFPFFVVVISKPGDGLEFDCSAQD-----------DEFSIDNVSFYPDGDSTKDSAEAEW--E 145 (204)
T ss_dssp ------CEEECCCCCCCEEEEEEEEEECTCEEEEEEEECCT-----------TEEEEEEEEEESSHHHH--SHHHHH--H
T ss_pred ccccccchhcccccccccceeEEEEEEcCCCceEEeEeccC-----------ceEEEEEEEEecCccccccccchhc--c
Confidence 000 011 1235688999999888999999954 4499999999853 44 3
Q ss_pred ccccCC-CCCCCHHHHHHHHHHHHHcCCCHHHHHHHHHhhhhHHHHHHHHHHHHHhhhh
Q psy16911 135 TTIACL-EEVLDGPLYDLLMNYLEEKGISNEFVDKLSNLSTGYEHASYINLLSEIQKFT 192 (194)
Q Consensus 135 ~~Y~g~-~~~LDe~Lq~~~~~yLeeRGId~ela~fl~~y~~~kEq~EYi~wL~~vk~Fv 192 (194)
..|.|| |++||++||++|++||++||||++||+||++||++|||++|++||++|++||
T Consensus 146 ~~y~gp~f~~LDe~Lq~~~~~yLeeRGId~~la~fl~~y~~~kEq~eYi~wL~~l~~Fv 204 (204)
T PF02330_consen 146 DAYYGPDFDELDENLQDAFMNYLEERGIDEELANFLHDYSTDKEQREYIRWLKNLKKFV 204 (204)
T ss_dssp CSS-ECEGGGSBHHHHHHHHHHHHHTT-SHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
T ss_pred ccccCCccccCCHHHHHHHHHHHHHhCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHhhC
Confidence 479997 9999999999999999999999999999999999999999999999999997
No 3
>KOG2536|consensus
Probab=100.00 E-value=3.9e-42 Score=288.62 Aligned_cols=182 Identities=25% Similarity=0.360 Sum_probs=144.8
Q ss_pred hHHHHHHHHHHHHHHhcCCCCC-----CCCCCCCcEEE--eCCCEEEEEEeeCCeEEEEEEEeCCCCCCCCCCCCC--CC
Q psy16911 2 KELFEFLSEEIQAEKKIQKSKS-----IPSEIDGFQVK--LDGPEVSLEKTVGSEKIEIHFNVNHSVDADAEPQID--PN 72 (194)
Q Consensus 2 ~~L~~~L~~EI~~E~~~~~~~~-----~p~~~~~f~v~--~~g~~v~L~r~~~~E~I~V~f~v~~~~~~~~~~e~~--~~ 72 (194)
..|++.+.+||+++++.....+ .|.....|.+. +|+++|+|+|+|+||+|+|.+++...+..+.+++.+ ..
T Consensus 61 ~~~~r~i~~EI~s~~~~~~~~~~~~~~~~~~~~pf~ied~PG~q~V~L~r~~~GE~i~V~v~v~~~~~~~~~e~~de~e~ 140 (263)
T KOG2536|consen 61 QNLLRVILSEIKSEKKHDPPHQVLEEEIPLSKFPFEIEDRPGEQWVVLTRKFEGETIKVEVDVTQLVTDEPEEDEDETEE 140 (263)
T ss_pred cchHHHHHHHHHHhhhcCCchhhhhhccccCCCCcceecCCCceEEEEEeecCCceEEEEEeeehhccCCcchhcccccc
Confidence 4689999999999999876544 45444668877 455899999999999999999998654432211111 00
Q ss_pred CC-----Cc--c-cccCCCCceEEEEEeC-CceEEEEEEecCCCCCCCCCCCCCceEEeeeeeecCCC--CCcccccCC-
Q psy16911 73 MD-----EP--E-IEMKSKPSFEVNFIKG-NKTLGFTCSFIPPESEPSEDGYNDLFSIDEVDTYEREN--GKTTTIACL- 140 (194)
Q Consensus 73 ~~-----~~--e-~~~~~~~~f~V~I~K~-~~~L~f~c~~~~~e~~~~e~~~~d~f~I~~v~~~e~~~--~~~~~Y~g~- 140 (194)
++ ++ + .+..+.++|.|+|+|+ +.+|.|.|+++||+ |.|++++++.+.- ..+.+|.||
T Consensus 141 ~~~~~~~~~~~~~~~~~~~iplvV~vsK~~~~sLef~cta~pD~-----------i~Idslsv~~~~d~~~~~l~Y~GP~ 209 (263)
T KOG2536|consen 141 EDEDEDQEDDDPLSDNISNIPLVVTVSKGDGVSLEFLCTAFPDE-----------IVIDSLSVRPPIDASEDQLAYEGPS 209 (263)
T ss_pred ccccccccccccCCCcceeeEEEEEEecCCCceEEEEeecCCCc-----------eEEeeeEecCCCCccchhhhccCCC
Confidence 00 00 1 1344668899999995 68999999998754 9999999985432 233599996
Q ss_pred CCCCCHHHHHHHHHHHHHcCCCHHHHHHHHHhhhhHHHHHHHHHHHHHhhhhcC
Q psy16911 141 EEVLDGPLYDLLMNYLEEKGISNEFVDKLSNLSTGYEHASYINLLSEIQKFTSQ 194 (194)
Q Consensus 141 ~~~LDe~Lq~~~~~yLeeRGId~ela~fl~~y~~~kEq~EYi~wL~~vk~Fv~~ 194 (194)
|.+||++||++||+|||+|||+++|+.||++||.+|+++||++||++||+||++
T Consensus 210 F~eLDe~Lqd~fh~fLEeRGI~esl~~FL~~ym~~Kd~rEYl~WlksvK~Fv~~ 263 (263)
T KOG2536|consen 210 FEELDEELQDSFHRFLEERGIKESLASFLHAYMKNKDSREYLRWLKSVKSFVEQ 263 (263)
T ss_pred hhHHhHHHHHHHHHHHHHcCCCHHHHHHHHHHHhhhhHHHHHHHHHHHHHHhcC
Confidence 999999999999999999999999999999999999999999999999999975
No 4
>PF08513 LisH: LisH; InterPro: IPR013720 The LisH motif is found in a large number of eukaryotic proteins, from metazoa, fungi and plants that have a wide range of functions. The recently solved structure of the LisH domain in the N-terminal region of LIS1 depicted it as a novel dimerization motif, and that other structural elements are likely to play an important role in dimerisation [, , ]. The LisH (lis homology) domain mediates protein dimerisation and tetramerisation. The LisH domain is found in Sif2, a component of the Set3 complex which is responsible for repressing meiotic genes. It has been shown that the LisH domain helps mediate interaction with components of the Set3 complex []. ; PDB: 2XTE_L 2XTC_B 2XTD_A 1UUJ_B.
Probab=75.66 E-value=5.4 Score=22.25 Aligned_cols=25 Identities=24% Similarity=0.313 Sum_probs=20.4
Q ss_pred HHHHHHHHHHHHHcCCCHHHHHHHH
Q psy16911 146 GPLYDLLMNYLEEKGISNEFVDKLS 170 (194)
Q Consensus 146 e~Lq~~~~~yLeeRGId~ela~fl~ 170 (194)
++|..++++||.++|-......|..
T Consensus 1 ~~Ln~lI~~YL~~~Gy~~tA~~f~~ 25 (27)
T PF08513_consen 1 EELNQLIYDYLVENGYKETAKAFAK 25 (27)
T ss_dssp HHHHHHHHHHHHHCT-HHHHHHHHH
T ss_pred CHHHHHHHHHHHHCCcHHHHHHHHh
Confidence 4678899999999999998888764
No 5
>TIGR01881 cas_Cmr5 CRISPR-associated protein, Cmr5 family. CRISPR is a term for Clustered, Regularly Interspaced Short Palidromic Repeats. A number of protein families appear only in association with these repeats and are designated Cas (CRISPR-Associated) proteins. This family, represented by TM1791.1 of Thermotoga maritima, is found in both archaeal and bacterial species as part of the 6-gene CRISPR RAMP module.
Probab=73.67 E-value=11 Score=28.91 Aligned_cols=43 Identities=19% Similarity=0.393 Sum_probs=28.8
Q ss_pred HHHHHHHHHHHHcCCCHH---HHHHHHH-----hh-hhHHHHHHHHHHHHHh
Q psy16911 147 PLYDLLMNYLEEKGISNE---FVDKLSN-----LS-TGYEHASYINLLSEIQ 189 (194)
Q Consensus 147 ~Lq~~~~~yLeeRGId~e---la~fl~~-----y~-~~kEq~EYi~wL~~vk 189 (194)
-|+.++..||.+||+-+. +...|.+ |+ ...|=-+|..||+.+.
T Consensus 70 ~l~~~l~~~L~~~~~~~~~~~l~~~il~~~~~~y~~~T~eaL~~l~WLKRfA 121 (127)
T TIGR01881 70 KYYAHILYWLKERELVDKFKNALKELMEKDYSDSRIATEEALRYLNWLKRLA 121 (127)
T ss_pred HHHHHHHHHHHHcccccchHHHHHHHHHccHHHHHHHHHHHHHHHHHHHHHH
Confidence 488999999999997654 3333322 32 3346677888887753
No 6
>KOG4233|consensus
Probab=61.75 E-value=14 Score=26.50 Aligned_cols=35 Identities=17% Similarity=0.117 Sum_probs=26.2
Q ss_pred HHHHHcCCCHHHHHHHHHhhhhHHHHHHHHHHHHH
Q psy16911 154 NYLEEKGISNEFVDKLSNLSTGYEHASYINLLSEI 188 (194)
Q Consensus 154 ~yLeeRGId~ela~fl~~y~~~kEq~EYi~wL~~v 188 (194)
.-|.++|+|-...-+=.=+...|...++..||+..
T Consensus 33 ~~L~~~GfdkAYvllGQfLllkKdE~lF~~Wlk~~ 67 (90)
T KOG4233|consen 33 IKLVDAGFDKAYVLLGQFLLLKKDEDLFQEWLKET 67 (90)
T ss_pred hhHHhccccHHHHHHHHHHHhcccHHHHHHHHHHH
Confidence 44778999987554444456789999999999864
No 7
>PF10929 DUF2811: Protein of unknown function (DUF2811); InterPro: IPR021231 This is a bacterial family of uncharacterised proteins.
Probab=58.82 E-value=9 Score=25.56 Aligned_cols=32 Identities=9% Similarity=0.141 Sum_probs=26.5
Q ss_pred CCCCHHHHHHHHHHHHH-------cCCCHHHHHHHHHhh
Q psy16911 142 EVLDGPLYDLLMNYLEE-------KGISNEFVDKLSNLS 173 (194)
Q Consensus 142 ~~LDe~Lq~~~~~yLee-------RGId~ela~fl~~y~ 173 (194)
.++++.|++++.+||+. |=+.+.||.||.+-.
T Consensus 4 aeiPe~L~~~m~~fie~hP~WDQ~Rl~~aALa~FL~QnG 42 (57)
T PF10929_consen 4 AEIPEDLHQAMKDFIETHPNWDQYRLFQAALAGFLLQNG 42 (57)
T ss_pred ccccHHHHHHHHHHHHcCCCchHHHHHHHHHHHHHHHcC
Confidence 46899999999999985 678888999987643
No 8
>PHA01623 hypothetical protein
Probab=54.48 E-value=41 Score=22.08 Aligned_cols=30 Identities=13% Similarity=0.223 Sum_probs=25.3
Q ss_pred CCCHHHHHHHHHHHHHcCCCHHHHHHHHHhhh
Q psy16911 143 VLDGPLYDLLMNYLEEKGISNEFVDKLSNLST 174 (194)
Q Consensus 143 ~LDe~Lq~~~~~yLeeRGId~ela~fl~~y~~ 174 (194)
.||++|...|..|.+..|++ .+++|.+-..
T Consensus 19 rldeel~~~Ld~y~~~~g~~--rSe~IreAI~ 48 (56)
T PHA01623 19 YMDKDLKTRLKVYCAKNNLQ--LTQAIEEAIK 48 (56)
T ss_pred EeCHHHHHHHHHHHHHcCCC--HHHHHHHHHH
Confidence 49999999999999999998 7777765543
No 9
>PHA02360 hypothetical protein
Probab=53.23 E-value=14 Score=25.22 Aligned_cols=17 Identities=41% Similarity=0.729 Sum_probs=16.0
Q ss_pred HHHHHHHHHHHHHcCCC
Q psy16911 146 GPLYDLLMNYLEEKGIS 162 (194)
Q Consensus 146 e~Lq~~~~~yLeeRGId 162 (194)
+-||..+++|-|+|||+
T Consensus 29 PklY~~i~k~YEe~gid 45 (70)
T PHA02360 29 PKLYKKIRKYYEEEGID 45 (70)
T ss_pred HHHHHHHHHHHHHcCCc
Confidence 67999999999999997
No 10
>PF14265 DUF4355: Domain of unknown function (DUF4355)
Probab=52.46 E-value=44 Score=24.94 Aligned_cols=40 Identities=25% Similarity=0.253 Sum_probs=30.8
Q ss_pred HHHHHHHHHHHHcCCCHHHHHHHHHhhhhHHHHHHHHHHHH
Q psy16911 147 PLYDLLMNYLEEKGISNEFVDKLSNLSTGYEHASYINLLSE 187 (194)
Q Consensus 147 ~Lq~~~~~yLeeRGId~ela~fl~~y~~~kEq~EYi~wL~~ 187 (194)
+|.....+.|.+.|++.+|+.+|. ..+..+-...+..|..
T Consensus 69 e~~~~~~~~l~e~GLp~~l~~~l~-g~d~e~~~~~v~~l~~ 108 (125)
T PF14265_consen 69 ELRSEAKKVLAEKGLPAELADFLV-GDDEEETKANVKALKK 108 (125)
T ss_pred HHHHHHHHHHHHcCCCHHHHHHHh-CCCHHHHHHHHHHHHH
Confidence 456777889999999999999996 5666666666666654
No 11
>PF00123 Hormone_2: Peptide hormone; InterPro: IPR000532 A number of polypeptidic hormones, mainly expressed in the intestine or the pancreas, belong to a group of structurally related peptides [, ]. Once such hormone, glucagon is widely distributed and produced in the alpha-cells of pancreatic islets []. It affects glucose metabolism in the liver [] by inhibiting glycogen synthesis, stimulating glycogenolysis and enchancing gluconeogenesis. It also increases mobilisation of glucose, free fatty acids and ketone bodies, which are metabolites produced in excess in diabetes mellitus. Glucagon is produced, like other peptide hormones, as part of a larger precursor (preproglucagon), which is cleaved to produce glucagon, glucagon-like protein I and glucagon-like protein II []. The structure of glucagon itself is fully conserved in all known mammalian species []. Other members of the structurally similar group include glicentin precursor, secretin, gastric inhibitory protein, vasoactive intestinal peptide (VIP), prealbumin, peptide HI-27 and growth hormone releasing factor.; GO: 0005179 hormone activity, 0005576 extracellular region; PDB: 1T5Q_A 2OBU_A 2QKH_B 2L70_A 2L71_A 2B4N_A 1GCN_A 1D0R_A 3IOL_B 2RRI_A ....
Probab=49.69 E-value=31 Score=19.62 Aligned_cols=22 Identities=14% Similarity=0.264 Sum_probs=14.5
Q ss_pred HHHHHHHhhhhHHHHHHHHHHH
Q psy16911 165 FVDKLSNLSTGYEHASYINLLS 186 (194)
Q Consensus 165 la~fl~~y~~~kEq~EYi~wL~ 186 (194)
|.+-...|....--|+|++||-
T Consensus 6 ftsdys~~L~~~aak~fl~~L~ 27 (28)
T PF00123_consen 6 FTSDYSKYLDQLAAKKFLQWLM 27 (28)
T ss_dssp HHHHHHHHHHHHHHHHHHHHHC
T ss_pred ccHHHHHHHHHHHHHHHHHHHh
Confidence 4444555555566689999984
No 12
>PF13154 DUF3991: Protein of unknown function (DUF3991)
Probab=49.34 E-value=11 Score=26.10 Aligned_cols=18 Identities=50% Similarity=0.818 Sum_probs=14.7
Q ss_pred HHHHH-cCCCHHHHHHHHH
Q psy16911 154 NYLEE-KGISNEFVDKLSN 171 (194)
Q Consensus 154 ~yLee-RGId~ela~fl~~ 171 (194)
+||.+ |||+.++.+.+..
T Consensus 1 ~YL~~~RgI~~~~v~~~~~ 19 (77)
T PF13154_consen 1 AYLTEERGIDPEIVDAFIN 19 (77)
T ss_pred CchhhhcCcCHHHHHHHHH
Confidence 37776 9999999988865
No 13
>PF02961 BAF: Barrier to autointegration factor; InterPro: IPR004122 Barrier-to-autointegration factor (BAF) is an essential protein that is highly conserved in metazoan evolution, and which may act as a DNA-bridging protein []. BAF binds directly to double-stranded DNA, to transcription activators, and to inner nuclear membrane proteins, including lamin A filament proteins that anchor nuclear-pore complexes in place, and nuclear LEM-domain proteins that bind to laminins filaments and chromatin. New findings suggest that BAF has structural roles in nuclear assembly and chromatin organisation, represses gene expression and might interlink chromatin structure, nuclear architecture and gene regulation in metazoans []. BAF can be exploited by retroviruses to act as a host component of pre-integration complexes, which promote the integration of the retroviral DNA into the host chromosome by preventing autointegration of retroviral DNA []. BAF might contribute to the assembly or activity of retroviral pre-integration complexes through direct binding to the retroviral proteins p55 Gag and matrix, as well as to DNA.; GO: 0003677 DNA binding; PDB: 2ODG_A 2BZF_A 2EZX_B 2EZY_B 1QCK_B 1CI4_B 2EZZ_B.
Probab=49.00 E-value=28 Score=25.28 Aligned_cols=34 Identities=24% Similarity=0.280 Sum_probs=22.9
Q ss_pred HHHHHHHcCCCHH---HHHHHHHhhhhHHHHHHHHHHHHH
Q psy16911 152 LMNYLEEKGISNE---FVDKLSNLSTGYEHASYINLLSEI 188 (194)
Q Consensus 152 ~~~yLeeRGId~e---la~fl~~y~~~kEq~EYi~wL~~v 188 (194)
+-.=|+++|++-. |++|| ...|.+..+..||++.
T Consensus 31 lg~~L~~~GfdKAy~vLGqfL---ll~kde~~F~~WLk~~ 67 (89)
T PF02961_consen 31 LGKRLEEKGFDKAYVVLGQFL---LLKKDEELFQDWLKDT 67 (89)
T ss_dssp HHHHHHHTT--BHHHHHHHHH---HTTT-HHHHHHHHHHH
T ss_pred HHHHHHHCCCcHHHHHhhhhh---hccCcHHHHHHHHHHH
Confidence 4456889999874 34444 5788899999999974
No 14
>PF08275 Toprim_N: DNA primase catalytic core, N-terminal domain; InterPro: IPR013264 This is the N-terminal, catalytic core domain of DNA primases. DNA primase (2.7.7 from EC) is a nucleotidyltransferase which synthesizes the oligoribonucleotide primers required for DNA replication on the lagging strand of the replication fork. It can also prime the leading strand and has been implicated in cell division []. ; PDB: 1EQN_E 1DD9_A 3B39_B 1DDE_A 2AU3_A.
Probab=44.97 E-value=16 Score=27.73 Aligned_cols=18 Identities=28% Similarity=0.665 Sum_probs=13.2
Q ss_pred HHHHHHHHHcCCCHHHHH
Q psy16911 150 DLLMNYLEEKGISNEFVD 167 (194)
Q Consensus 150 ~~~~~yLeeRGId~ela~ 167 (194)
..-.+||..|||+.+++.
T Consensus 12 ~~a~~YL~~Rgl~~e~i~ 29 (128)
T PF08275_consen 12 KEALEYLKKRGLSDETIK 29 (128)
T ss_dssp HHHHHHHHHTT--HHHHH
T ss_pred HHHHHHHHHcCCCHHHHH
Confidence 347899999999999874
No 15
>PF13376 OmdA: Bacteriocin-protection, YdeI or OmpD-Associated
Probab=43.90 E-value=55 Score=21.57 Aligned_cols=39 Identities=15% Similarity=0.146 Sum_probs=27.8
Q ss_pred CCCHHHHHHHHHHHHHcCCCHHHHHHHHHhhhhHHHHHHHHHHHHHh
Q psy16911 143 VLDGPLYDLLMNYLEEKGISNEFVDKLSNLSTGYEHASYINLLSEIQ 189 (194)
Q Consensus 143 ~LDe~Lq~~~~~yLeeRGId~ela~fl~~y~~~kEq~EYi~wL~~vk 189 (194)
.++++|+.+|.. +.....|-..++ .-.|+.|+.|+...|
T Consensus 5 ~vP~dl~~aL~~-------~p~a~~~f~~l~-~~~rr~~i~wi~~AK 43 (63)
T PF13376_consen 5 EVPEDLEAALEA-------NPEAKEFFESLT-PSYRREYIRWINSAK 43 (63)
T ss_pred CCCHHHHHHHHC-------CHHHHHHHHHCC-HHHHHHHHHHHHHcC
Confidence 577888888764 455556655554 456999999998755
No 16
>PF07063 DUF1338: Domain of unknown function (DUF1338); InterPro: IPR009770 This domain is found in a variety of bacterial and fungal hypothetical proteins of unknown function. The structure of this domain has been solved by structural genomics. The structure implies a zinc-binding function, so it is a putative metal hydrolase (information derived from TOPSAN for PDB:3iuz).; PDB: 3LHO_A 3IUZ_A 2RJB_C.
Probab=37.05 E-value=68 Score=28.32 Aligned_cols=46 Identities=17% Similarity=0.242 Sum_probs=33.2
Q ss_pred CCCCHHHHHHHHHHHHHcCCCHHHHHH-----------------------------------HHHhhhhHHHHHHHHHHH
Q psy16911 142 EVLDGPLYDLLMNYLEEKGISNEFVDK-----------------------------------LSNLSTGYEHASYINLLS 186 (194)
Q Consensus 142 ~~LDe~Lq~~~~~yLeeRGId~ela~f-----------------------------------l~~y~~~kEq~EYi~wL~ 186 (194)
+.|.+.+|+.+.+||..|.+-..-+.- ..+|-.-++.++|..|+-
T Consensus 100 e~ls~~~q~~i~~~l~~r~~~~~~~~~~l~~~~~~~~l~~~~a~~~v~~~~~~~~W~~~~ps~~~Y~~L~~eSe~aAWi~ 179 (302)
T PF07063_consen 100 EELSPELQEIIEKYLSRRDPFSPRPLELLDQAERQGGLSAEQADEFVAEALSGFPWQHRAPSLADYQTLLAESEYAAWIA 179 (302)
T ss_dssp GGSSHHHHHHHHHHHTTS-GCGHHHHHHHHHHHHHSCEBHHHHHHHHHHHGGGCSS-SB-SBHHHHHHHHHH-HHHHHHH
T ss_pred hhcCHHHHHHHHHHHhcccCCCHHHHHHHHHHHHcCCCCHHHHHHHHHHHHcccCCCCCCCCHHHHHHHHHHhHHHHhhc
Confidence 678899999999999777776433222 235777789999999986
Q ss_pred H
Q psy16911 187 E 187 (194)
Q Consensus 187 ~ 187 (194)
-
T Consensus 180 ~ 180 (302)
T PF07063_consen 180 A 180 (302)
T ss_dssp H
T ss_pred c
Confidence 5
No 17
>PF00070 Pyr_redox: Pyridine nucleotide-disulphide oxidoreductase; InterPro: IPR001327 FAD flavoproteins belonging to the family of pyridine nucleotide-disulphide oxidoreductases (glutathione reductase, trypanothione reductase, lipoamide dehydrogenase, mercuric reductase, thioredoxin reductase, alkyl hydroperoxide reductase) share sequence similarity with a number of other flavoprotein oxidoreductases, in particular with ferredoxin-NAD+ reductases involved in oxidative metabolism of a variety of hydrocarbons (rubredoxin reductase, putidaredoxin reductase, terpredoxin reductase, ferredoxin-NAD+ reductase components of benzene 1,2-dioxygenase, toluene 1,2-dioxygenase, chlorobenzene dioxygenase, biphenyl dioxygenase), NADH oxidase and NADH peroxidase [, , ]. Comparison of the crystal structures of human glutathione reductase and Escherichia coli thioredoxin reductase reveals different locations of their active sites, suggesting that the enzymes diverged from an ancestral FAD/NAD(P)H reductase and acquired their disulphide reductase activities independently []. Despite functional similarities, oxidoreductases of this family show no sequence similarity with adrenodoxin reductases [] and flavoprotein pyridine nucleotide cytochrome reductases (FPNCR) []. Assuming that disulphide reductase activity emerged later, during divergent evolution, the family can be referred to as FAD-dependent pyridine nucleotide reductases, FADPNR. To date, 3D structures of glutathione reductase [], thioredoxin reductase [], mercuric reductase [], lipoamide dehydrogenase [], trypanothione reductase [] and NADH peroxidase [] have been solved. The enzymes share similar tertiary structures based on a doubly-wound alpha/beta fold, but the relative orientations of their FAD- and NAD(P)H-binding domains may vary significantly. By contrast with the FPNCR family, the folds of the FAD- and NAD(P)H-binding domains are similar, suggesting that the domains evolved by gene duplication []. This entry describes a small NADH binding domain within a larger FAD binding domain described by IPR023753 from INTERPRO. It is found in both class I and class II oxidoreductases. ; GO: 0016491 oxidoreductase activity, 0050660 flavin adenine dinucleotide binding, 0055114 oxidation-reduction process; PDB: 1ZKQ_A 3DGZ_A 1ZDL_A 2R9Z_B 2RAB_A 2A87_B 1M6I_A 2YVG_A 2GR1_A 2GQW_A ....
Probab=36.97 E-value=29 Score=23.45 Aligned_cols=21 Identities=24% Similarity=0.468 Sum_probs=19.5
Q ss_pred CCCCHHHHHHHHHHHHHcCCC
Q psy16911 142 EVLDGPLYDLLMNYLEEKGIS 162 (194)
Q Consensus 142 ~~LDe~Lq~~~~~yLeeRGId 162 (194)
..+|+.+...+.++|+.+||+
T Consensus 36 ~~~~~~~~~~~~~~l~~~gV~ 56 (80)
T PF00070_consen 36 PGFDPDAAKILEEYLRKRGVE 56 (80)
T ss_dssp TTSSHHHHHHHHHHHHHTTEE
T ss_pred hhcCHHHHHHHHHHHHHCCCE
Confidence 579999999999999999986
No 18
>PRK11814 cysteine desulfurase activator complex subunit SufB; Provisional
Probab=36.78 E-value=66 Score=30.19 Aligned_cols=44 Identities=14% Similarity=0.099 Sum_probs=33.2
Q ss_pred CCCCCHHHHHHHHHHHHHcCCCHHHHHHHHHhhhhHHH-----HHHHHHHHHH
Q psy16911 141 EEVLDGPLYDLLMNYLEEKGISNEFVDKLSNLSTGYEH-----ASYINLLSEI 188 (194)
Q Consensus 141 ~~~LDe~Lq~~~~~yLeeRGId~ela~fl~~y~~~kEq-----~EYi~wL~~v 188 (194)
...||++ -.=||..|||+.+-|.-|.-+....|= -++++|+.++
T Consensus 428 vG~idee----qLFYL~SRGi~e~eA~~Llv~gF~~evi~~ip~e~~~e~~~~ 476 (486)
T PRK11814 428 TSKISED----QLFYCRQRGISEEDAVSMIVNGFCKEVFQELPMEFAVEAQKL 476 (486)
T ss_pred cCCCCHH----HHHHHHHcCCCHHHHHHHHHHHHHHHHHHHCCcHHHHHHHHH
Confidence 3557754 345999999999999988887776663 4788888764
No 19
>PF13794 MiaE_2: tRNA-(MS[2]IO[6]A)-hydroxylase (MiaE)-like; PDB: 3EZ0_C.
Probab=35.58 E-value=31 Score=28.39 Aligned_cols=45 Identities=16% Similarity=0.249 Sum_probs=32.4
Q ss_pred HHHHHHHHHHHcCCCHH--HHHHHHHhhhhHHHHHHHHHHHH-Hhhhh
Q psy16911 148 LYDLLMNYLEEKGISNE--FVDKLSNLSTGYEHASYINLLSE-IQKFT 192 (194)
Q Consensus 148 Lq~~~~~yLeeRGId~e--la~fl~~y~~~kEq~EYi~wL~~-vk~Fv 192 (194)
-++.+...|.+||++.. +.-|...+-.+.....=-.|++. ||.||
T Consensus 51 hf~~L~~~l~~~G~d~~~am~pf~~~ld~f~~rT~P~dW~E~LvKaYV 98 (185)
T PF13794_consen 51 HFERLEARLAERGVDPEEAMEPFVGALDAFHARTRPSDWLESLVKAYV 98 (185)
T ss_dssp HHHHHHHHHHHTT--HHHHHGGGHHHHHHHHHTT--SSHHHHHHHHHH
T ss_pred HHHHHHHHHHHcCCCHHHHHHHHHHHHHHHHhcCCCCChHHHHHHHHH
Confidence 46788999999999996 77788888777777777889987 45554
No 20
>smart00777 Mad3_BUB1_I Mad3/BUB1 hoMad3/BUB1 homology region 1. Proteins containing this domain are checkpoint proteins involved in cell division. This region has been shown to be essential for the binding of the binding of BUB1 and MAD3 to CDC20p.
Probab=32.18 E-value=61 Score=24.82 Aligned_cols=30 Identities=23% Similarity=0.255 Sum_probs=25.8
Q ss_pred HHHHHHHHHcCCCHHHHHHHHHhhhhHHHH
Q psy16911 150 DLLMNYLEEKGISNEFVDKLSNLSTGYEHA 179 (194)
Q Consensus 150 ~~~~~yLeeRGId~ela~fl~~y~~~kEq~ 179 (194)
..+..||..+||-..+|.|=..|+...|.+
T Consensus 83 ~~if~~L~~~~IG~~~AlfYe~~A~~lE~~ 112 (125)
T smart00777 83 RELFQFLYSKGIGTKLALFYEEWAQLLEAA 112 (125)
T ss_pred HHHHHHHHHCCcchhhHHHHHHHHHHHHHc
Confidence 457899999999999999999888776654
No 21
>COG4003 Uncharacterized protein conserved in archaea [Function unknown]
Probab=31.89 E-value=72 Score=23.15 Aligned_cols=26 Identities=31% Similarity=0.474 Sum_probs=18.4
Q ss_pred HHHHHHHHcC-CCHHHHHHHHHhhhhH
Q psy16911 151 LLMNYLEEKG-ISNEFVDKLSNLSTGY 176 (194)
Q Consensus 151 ~~~~yLeeRG-Id~ela~fl~~y~~~k 176 (194)
.+.+||+.|| |+.+.|.+|..-..+|
T Consensus 52 Eii~yleKrGEi~~E~A~~L~~~~~~k 78 (98)
T COG4003 52 EIINYLEKRGEITPEMAKALRVTLVHK 78 (98)
T ss_pred HHHHHHHHhCCCCHHHHHHHHhhHHHH
Confidence 4568888888 7888888876544443
No 22
>PF09442 DUF2018: Domain of unknown function (DUF2018); InterPro: IPR018563 This entry represents proteins of unknown function. The crystal structure of HP0242, a hypothetical protein from Helicobacter pylori (Campylobacter pylori) has been determined. It reveals an acid-adaptive protein possibly of physiological significance when H. pylori colonises the human stomach. The protein adopts a unique four alpha-helical triangular conformations. The biologically active form is thought to be a tetramer. The gene exist in an operon along with 6 other genes where the gene products appear to be related to iron storage and haem biosynthesis []. ; PDB: 3MLG_B 3MLI_C 2OUF_A 2BO3_A.
Probab=30.44 E-value=1.5e+02 Score=21.15 Aligned_cols=39 Identities=21% Similarity=0.395 Sum_probs=25.3
Q ss_pred HHHHHHHHHHHcCCCHHHHHHHHHhh-------hhHHHHHHHHHHHHH
Q psy16911 148 LYDLLMNYLEEKGISNEFVDKLSNLS-------TGYEHASYINLLSEI 188 (194)
Q Consensus 148 Lq~~~~~yLeeRGId~ela~fl~~y~-------~~kEq~EYi~wL~~v 188 (194)
-+.++...|+++|++ +-.+|..|. ..+-+.-||....++
T Consensus 31 r~aamE~llEe~g~~--~e~~ik~~~~~~~~~i~~~~~dl~Ie~mg~I 76 (81)
T PF09442_consen 31 RLAAMEELLEENGLD--VEEEIKNFIFENEDKIENRKNDLYIESMGNI 76 (81)
T ss_dssp HHHHHHHHHHHTT-H--HHHHHHGGGT-HHHHHHHHHHHHHHHHHHHH
T ss_pred HHHHHHHHHHHcCCC--HHHHHHHHHhccHHHHHHHhcchHHHHHHHH
Confidence 466788999999999 444444443 445567777776654
No 23
>PF13434 K_oxygenase: L-lysine 6-monooxygenase (NADPH-requiring); PDB: 3S61_B 3S5W_B.
Probab=29.88 E-value=52 Score=29.14 Aligned_cols=33 Identities=21% Similarity=0.436 Sum_probs=25.5
Q ss_pred HHHHHHHHcCCCHHHHHHH---HHhhhhHHHHHHHHHHH
Q psy16911 151 LLMNYLEEKGISNEFVDKL---SNLSTGYEHASYINLLS 186 (194)
Q Consensus 151 ~~~~yLeeRGId~ela~fl---~~y~~~kEq~EYi~wL~ 186 (194)
.|.+||.++| .|..|+ .-+....|=..|++|..
T Consensus 70 sflnYL~~~~---rl~~f~~~~~~~p~R~ef~dYl~Wva 105 (341)
T PF13434_consen 70 SFLNYLHEHG---RLYEFYNRGYFFPSRREFNDYLRWVA 105 (341)
T ss_dssp SHHHHHHHTT----HHHHHHH--SS-BHHHHHHHHHHHH
T ss_pred cHHHHHHHcC---ChhhhhhcCCCCCCHHHHHHHHHHHH
Confidence 5899999998 666666 46678899999999974
No 24
>COG3592 Uncharacterized conserved protein [Function unknown]
Probab=29.84 E-value=38 Score=23.53 Aligned_cols=15 Identities=40% Similarity=0.598 Sum_probs=13.8
Q ss_pred EeeCCeEEEEEEEeC
Q psy16911 44 KTVGSEKIEIHFNVN 58 (194)
Q Consensus 44 r~~~~E~I~V~f~v~ 58 (194)
|+|.+|+|-|.||.+
T Consensus 9 r~y~Gekidi~fn~~ 23 (74)
T COG3592 9 RKYRGEKIDIYFNTA 23 (74)
T ss_pred ceeccceEEEEeccc
Confidence 789999999999976
No 25
>PF04026 SpoVG: SpoVG; InterPro: IPR007170 This is a stage V sporulation protein G. It is essential for sporulation and specific to stage V sporulation in Bacillus megaterium and Bacillus subtilis []. In B. subtilis, expression decreases after 30-60 minutes of cold shock [].; GO: 0030435 sporulation resulting in formation of a cellular spore; PDB: 2IA9_F 2I9X_B 2I9Z_A.
Probab=29.50 E-value=1.3e+02 Score=21.37 Aligned_cols=35 Identities=26% Similarity=0.359 Sum_probs=25.4
Q ss_pred eEEEEEe--CCceEEEEEEecCCCCCCCCCCCCCceEEeeeeeecCC
Q psy16911 86 FEVNFIK--GNKTLGFTCSFIPPESEPSEDGYNDLFSIDEVDTYERE 130 (194)
Q Consensus 86 f~V~I~K--~~~~L~f~c~~~~~e~~~~e~~~~d~f~I~~v~~~e~~ 130 (194)
..|.|.+ +.+.|.--|+.. .+|.|.|.+|.+.+|+
T Consensus 4 tdVri~~~~~~~~lka~asV~----------~dd~f~I~~ikVieg~ 40 (84)
T PF04026_consen 4 TDVRIRKIEPEGKLKAFASVT----------FDDCFVIHDIKVIEGE 40 (84)
T ss_dssp EEEEEEETTSSSSEEEEEEEE----------ETTTEEEEEEEEEEET
T ss_pred EEEEEEEecCCCCEEEEEEEE----------ECCEEEEEeEEEEECC
Confidence 3677877 447777777773 3456999999998764
No 26
>PF02268 TFIIA_gamma_N: Transcription initiation factor IIA, gamma subunit, helical domain; InterPro: IPR015872 Transcription factor IIA (TFIIA) is one of several factors that form part of a transcription pre-initiation complex along with RNA polymerase II, the TATA-box-binding protein (TBP) and TBP-associated factors, on the TATA-box sequence upstream of the initiation start site. After initiation, some components of the pre-initiation complex (including TFIIA) remain attached and re-initiate a subsequent round of transcription. TFIIA binds to TBP to stabilise TBP binding to the TATA element. TFIIA also inhibits the cytokine HMGB1 (high mobility group 1 protein) binding to TBP [], and can dissociate HMGB1 already bound to TBP/TATA-box. Human and Drosophila TFIIA have three subunits: two large subunits, LN/alpha and LC/beta, derived from the same gene, and a small subunit, S/gamma. Yeast TFIIA has two subunits: a large TOA1 subunit that shows sequence similarity to the N-terminal of LN/alpha and the C-terminal of LC/beta, and a small subunit, TOA2 that is highly homologous with S/gamma. The conserved regions of the large and small subunits of TFIIA combine to form two domains: a four-helix bundle (helical domain) composed of two helices from each of the N-terminal regions of TOA1 and TOA2 in yeast; and a beta-barrel (beta-barrel domain) composed of beta-sheets from the C-terminal regions of TOA1 and TOA2 []. This entry represents the alpha-helical domain found at the N-terminal of the gamma subunit of transcription factor TFIIA.; GO: 0006367 transcription initiation from RNA polymerase II promoter, 0005672 transcription factor TFIIA complex; PDB: 1NVP_D 1RM1_B 1YTF_D 1NH2_D.
Probab=29.01 E-value=1e+02 Score=19.87 Aligned_cols=30 Identities=23% Similarity=0.253 Sum_probs=24.9
Q ss_pred CCCCHHHHHHHHHHHHHcCCCHHHHHHHHH
Q psy16911 142 EVLDGPLYDLLMNYLEEKGISNEFVDKLSN 171 (194)
Q Consensus 142 ~~LDe~Lq~~~~~yLeeRGId~ela~fl~~ 171 (194)
..|=-.|+++|.+...+.-|++.||..+..
T Consensus 9 stlG~aL~dtLDeli~~~~I~p~La~kVL~ 38 (49)
T PF02268_consen 9 STLGIALTDTLDELIQEGKITPQLAMKVLE 38 (49)
T ss_dssp SHHHHHHHHHHHHHHHTTSS-HHHHHHHHH
T ss_pred chHHHHHHHHHHHHHHcCCCCHHHHHHHHH
Confidence 345578999999999999999999998864
No 27
>smart00070 GLUCA Glucagon like hormones.
Probab=28.78 E-value=76 Score=17.83 Aligned_cols=16 Identities=13% Similarity=0.225 Sum_probs=9.6
Q ss_pred HHhhhhHHHHHHHHHH
Q psy16911 170 SNLSTGYEHASYINLL 185 (194)
Q Consensus 170 ~~y~~~kEq~EYi~wL 185 (194)
..|....--+.|++||
T Consensus 11 sk~L~~~~ar~fl~~L 26 (27)
T smart00070 11 SKYLDQLAAKKFLQWL 26 (27)
T ss_pred HHHHHHHHHHHHHHHh
Confidence 3344444457788887
No 28
>cd06206 bifunctional_CYPOR These bifunctional proteins fuse N-terminal cytochrome p450 with a cytochrome p450 reductase (CYPOR). NADPH cytochrome p450 reductase serves as an electron donor in several oxygenase systems and is a component of nitric oxide synthases and methionine synthase reductases. CYPOR transfers two electrons from NADPH to the heme of cytochrome p450 via FAD and FMN. Ferredoxin-NADP+ (oxido)reductase is an FAD-containing enzyme that catalyzes the reversible electron transfer between NADP(H) and electron carrier proteins such as ferredoxin and flavodoxin. Isoforms of these flavoproteins (i.e. having a non-covalently bound FAD as a prosthetic group) are present in chloroplasts, mitochondria, and bacteria in which they participate in a wide variety of redox metabolic pathways. The C-terminal domain contains most of the NADP(H) binding residues and the N-terminal domain interacts non-covalently with the isoalloxazine rings of the flavin molecule which lies largely in a la
Probab=28.30 E-value=70 Score=28.61 Aligned_cols=41 Identities=15% Similarity=0.199 Sum_probs=36.2
Q ss_pred HHHHHHHHHHHHHcCC-CHHHHHHHHHhhhhHHHHHHHHHHH
Q psy16911 146 GPLYDLLMNYLEEKGI-SNEFVDKLSNLSTGYEHASYINLLS 186 (194)
Q Consensus 146 e~Lq~~~~~yLeeRGI-d~ela~fl~~y~~~kEq~EYi~wL~ 186 (194)
-.|.++|..||.=+|+ ...|-..|..|+++.++++-+..+.
T Consensus 79 ~tl~~~l~~~~Di~~~p~~~~l~~la~~~~~~~~k~~l~~~~ 120 (384)
T cd06206 79 ISVSELLSSYVELSQPATRRQLAALAEATRCPDTKALLERLA 120 (384)
T ss_pred EEHHHHHHhhccccCCCCHHHHHHHHHHCCCHHHHHHHHHhh
Confidence 5899999999999999 8999999999999998887665553
No 29
>smart00667 LisH Lissencephaly type-1-like homology motif. Alpha-helical motif present in Lis1, treacle, Nopp140, some katanin p60 subunits, muskelin, tonneau, LEUNIG and numerous WD40 repeat-containing proteins. It is suggested that LisH motifs contribute to the regulation of microtubule dynamics, either by mediating dimerisation, or else by binding cytoplasmic dynein heavy chain or microtubules directly.
Probab=27.42 E-value=1e+02 Score=16.55 Aligned_cols=27 Identities=19% Similarity=0.258 Sum_probs=22.0
Q ss_pred HHHHHHHHHHHHHcCCCHHHHHHHHHh
Q psy16911 146 GPLYDLLMNYLEEKGISNEFVDKLSNL 172 (194)
Q Consensus 146 e~Lq~~~~~yLeeRGId~ela~fl~~y 172 (194)
..|..++.+||-..|-......|..+.
T Consensus 4 ~~l~~lI~~yL~~~g~~~ta~~l~~e~ 30 (34)
T smart00667 4 SELNRLILEYLLRNGYEETAETLQKES 30 (34)
T ss_pred HHHHHHHHHHHHHcCHHHHHHHHHHHh
Confidence 467888999999999998887776543
No 30
>TIGR02793 nikR nickel-responsive transcriptional regulator NikR. Three members of the seed for this model, from Escherichia coli, Pseudomonas putida, and Brucella melitensis, are found associated with a nickel ABC transporter operon that acts to import nickel for use as a cofactor in urease or hydrogenase. These proteins, with characterized nickel-binding and DNA-binding domains, act as nickel-responsive transcriptional regulators. In the larger family of full-length homologs, most others both lack proximity to the nickel ABC transporter operon and form a separate clade. Several of the homologs not within the scope of this model, but rather scoring between the trusted and noise cutoffs, have been shown to bind nickel, copper, or both, and to regulate genes in response to nickel.
Probab=26.90 E-value=59 Score=25.00 Aligned_cols=20 Identities=25% Similarity=0.448 Sum_probs=18.7
Q ss_pred CCCHHHHHHHHHHHHHcCCC
Q psy16911 143 VLDGPLYDLLMNYLEEKGIS 162 (194)
Q Consensus 143 ~LDe~Lq~~~~~yLeeRGId 162 (194)
.||++|.+.|.++.++||.+
T Consensus 6 slp~~ll~~lD~~~~~~g~~ 25 (129)
T TIGR02793 6 TLDDDLLETLDRLIARRGYQ 25 (129)
T ss_pred EcCHHHHHHHHHHHHHcCCC
Confidence 48999999999999999986
No 31
>PF12122 DUF3582: Protein of unknown function (DUF3582); InterPro: IPR022732 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Proteolytic enzymes that exploit serine in their catalytic activity are ubiquitous, being found in viruses, bacteria and eukaryotes []. They include a wide range of peptidase activity, including exopeptidase, endopeptidase, oligopeptidase and omega-peptidase activity. Over 20 families (denoted S1 - S66) of serine protease have been identified, these being grouped into clans on the basis of structural similarity and other functional evidence []. Structures are known for members of the clans and the structures indicate that some appear to be totally unrelated, suggesting different evolutionary origins for the serine peptidases []. Not withstanding their different evolutionary origins, there are similarities in the reaction mechanisms of several peptidases. Chymotrypsin, subtilisin and carboxypeptidase C have a catalytic triad of serine, aspartate and histidine in common: serine acts as a nucleophile, aspartate as an electrophile, and histidine as a base []. The geometric orientations of the catalytic residues are similar between families, despite different protein folds []. The linear arrangements of the catalytic residues commonly reflect clan relationships. For example the catalytic triad in the chymotrypsin clan (PA) is ordered HDS, but is ordered DHS in the subtilisin clan (SB) and SDH in the carboxypeptidase clan (SC) [, ]. This entry represents the N-terminal domain of membrane-bound serine endopeptidases belonging to MEROPS peptidase family S54 (rhomboid-1, clan ST). This domain contains a conserved ASW sequence motif and a single completely conserved residue F that may be functionally important. The tertiary structure of the GlpG protein from Escherichia coli has been determined []. The GlpG protein has six transmembrane domains (other members of the family are predicted to have seven), with the N- and C-terminal ends anchored in the cytoplasm. One transmembrane domain is shorter than the rest, creating an internal, aqueous cavity just below the membrane surface and it is here were proteolysis occurs. There is also a membrane-embedded loop between the first and second transmembrane domains which is postulated to act as a gate controlling substrate access to the active site. No other family of serine peptidases is known to have active site residues within transmembrane domains (although transmembrane active sites are known for aspartic peptidase and metallopeptidases), and the GlpG protein has the type structure for clan ST.; GO: 0004252 serine-type endopeptidase activity, 0016021 integral to membrane; PDB: 3UBB_A 3B45_A 3B44_A 2NRF_A 3TXT_A 2O7L_A 2XTU_A 2IRV_A 2XOW_A 2XTV_A ....
Probab=26.71 E-value=62 Score=23.85 Aligned_cols=21 Identities=19% Similarity=0.425 Sum_probs=16.0
Q ss_pred CHHHHHHHHHHHHHcCCCHHH
Q psy16911 145 DGPLYDLLMNYLEEKGISNEF 165 (194)
Q Consensus 145 De~Lq~~~~~yLeeRGId~el 165 (194)
++.+-.+|.+||..+||+..+
T Consensus 9 n~r~AqaF~DYl~sqgI~~~i 29 (101)
T PF12122_consen 9 NPRAAQAFIDYLASQGIELQI 29 (101)
T ss_dssp SHHHHHHHHHHHHHTT--EEE
T ss_pred CHHHHHHHHHHHHHCCCeEEE
Confidence 467889999999999998654
No 32
>PF10281 Ish1: Putative stress-responsive nuclear envelope protein; InterPro: IPR018803 This group of proteins, found primarily in fungi, consists of putative stress-responsive nuclear envelope protein Ish1 and homologues [].
Probab=22.78 E-value=70 Score=18.93 Aligned_cols=15 Identities=27% Similarity=0.410 Sum_probs=12.1
Q ss_pred HHHHHHHHHcCCCHH
Q psy16911 150 DLLMNYLEEKGISNE 164 (194)
Q Consensus 150 ~~~~~yLeeRGId~e 164 (194)
+.|.+||.+.||...
T Consensus 7 ~~L~~wL~~~gi~~~ 21 (38)
T PF10281_consen 7 SDLKSWLKSHGIPVP 21 (38)
T ss_pred HHHHHHHHHcCCCCC
Confidence 568899999999643
No 33
>PRK13031 preprotein translocase subunit SecB; Provisional
Probab=22.15 E-value=1.7e+02 Score=23.25 Aligned_cols=30 Identities=7% Similarity=0.305 Sum_probs=19.5
Q ss_pred ceEEEEE-eCCceEEEEEEecCCCCCCCCCCCCCceEEeee
Q psy16911 85 SFEVNFI-KGNKTLGFTCSFIPPESEPSEDGYNDLFSIDEV 124 (194)
Q Consensus 85 ~f~V~I~-K~~~~L~f~c~~~~~e~~~~e~~~~d~f~I~~v 124 (194)
.++|+++ |.++...|-|.. .+.++|.|.++
T Consensus 58 ~L~vtvta~~~~~t~FlvEv----------~qaGIF~i~ni 88 (149)
T PRK13031 58 VLTLEVKVENDGMVAFEAEV----------KQAGIFTVANM 88 (149)
T ss_pred EEEEEEEEEeCCceEEEEEE----------eEeeEEEEcCC
Confidence 4455555 334455588877 35688999887
No 34
>PF08311 Mad3_BUB1_I: Mad3/BUB1 homology region 1; InterPro: IPR013212 Proteins containing this domain are checkpoint proteins involved in cell division. This region has been shown to be essential for the binding of BUB1 and MAD3 to CDC20p [].; PDB: 3ESL_B 4AEZ_I 4A1G_B 2LAH_A 2WVI_A 3SI5_B.
Probab=21.83 E-value=80 Score=23.83 Aligned_cols=29 Identities=17% Similarity=0.226 Sum_probs=25.8
Q ss_pred HHHHHHHHcCCCHHHHHHHHHhhhhHHHH
Q psy16911 151 LLMNYLEEKGISNEFVDKLSNLSTGYEHA 179 (194)
Q Consensus 151 ~~~~yLeeRGId~ela~fl~~y~~~kEq~ 179 (194)
.+.+||..+||-..+|.|=..|+...|++
T Consensus 84 ~if~~l~~~~IG~~~A~fY~~wA~~le~~ 112 (126)
T PF08311_consen 84 EIFKFLYSKGIGTKLALFYEEWAEFLEKR 112 (126)
T ss_dssp HHHHHHHHHTTSTTBHHHHHHHHHHHHHT
T ss_pred HHHHHHHHcCccHHHHHHHHHHHHHHHHc
Confidence 46789999999999999999999887765
No 35
>PF08747 DUF1788: Domain of unknown function (DUF1788); InterPro: IPR014858 This entry represents a putative uncharacterised protein of length around 200 amino acids.
Probab=21.71 E-value=69 Score=24.50 Aligned_cols=17 Identities=41% Similarity=0.901 Sum_probs=14.8
Q ss_pred HHHHHHHHHHHHcCCCH
Q psy16911 147 PLYDLLMNYLEEKGISN 163 (194)
Q Consensus 147 ~Lq~~~~~yLeeRGId~ 163 (194)
+|++.+.++|++||+=+
T Consensus 5 nL~~~~l~~l~~~~~~d 21 (126)
T PF08747_consen 5 NLYEIFLEILEERGILD 21 (126)
T ss_pred EHHHHHHHHHHhcChHH
Confidence 69999999999999743
No 36
>PHA03416 hypothetical E4 protein; Provisional
Probab=21.38 E-value=1.9e+02 Score=21.05 Aligned_cols=28 Identities=14% Similarity=0.188 Sum_probs=22.5
Q ss_pred CCcEEEeCCCEEEEEEeeCCeEEEEEEE
Q psy16911 29 DGFQVKLDGPEVSLEKTVGSEKIEIHFN 56 (194)
Q Consensus 29 ~~f~v~~~g~~v~L~r~~~~E~I~V~f~ 56 (194)
..|.|...|..|+|+.+-.|-.|.|++.
T Consensus 64 ~pWTV~tt~~svtlt~t~~GttVtvtlh 91 (92)
T PHA03416 64 GPWTVTATPYSLELRTTQQGTQITLTVS 91 (92)
T ss_pred CCcEEeccceEEEEEEecCCcEEEEEEe
Confidence 5799999999999986656667888764
No 37
>PRK04460 nickel responsive regulator; Provisional
Probab=21.31 E-value=2.1e+02 Score=22.16 Aligned_cols=20 Identities=35% Similarity=0.521 Sum_probs=18.7
Q ss_pred CCCHHHHHHHHHHHHHcCCC
Q psy16911 143 VLDGPLYDLLMNYLEEKGIS 162 (194)
Q Consensus 143 ~LDe~Lq~~~~~yLeeRGId 162 (194)
.||++|.+.|.++.++||.+
T Consensus 8 slp~~ll~~lD~~~~~~gy~ 27 (137)
T PRK04460 8 SLDSDLLEKFDELIEEKGYQ 27 (137)
T ss_pred EeCHHHHHHHHHHHHHcCCC
Confidence 48999999999999999987
No 38
>PF09868 DUF2095: Uncharacterized protein conserved in archaea (DUF2095); InterPro: IPR018662 This domain, found in various hypothetical prokaryotic proteins, has no known function.
Probab=21.02 E-value=1.3e+02 Score=23.13 Aligned_cols=23 Identities=35% Similarity=0.619 Sum_probs=18.4
Q ss_pred HHHHHHHHcC-CCHHHHHHHHHhh
Q psy16911 151 LLMNYLEEKG-ISNEFVDKLSNLS 173 (194)
Q Consensus 151 ~~~~yLeeRG-Id~ela~fl~~y~ 173 (194)
.+.+||+.|| |+.+.|.+|..-.
T Consensus 82 EVInylek~GEIt~e~A~eLr~~L 105 (128)
T PF09868_consen 82 EVINYLEKRGEITPEEAKELRSIL 105 (128)
T ss_pred HHHHHHHHhCCCCHHHHHHHHHHH
Confidence 3578999999 8999999887543
No 39
>PRK13654 magnesium-protoporphyrin IX monomethyl ester cyclase; Provisional
Probab=20.97 E-value=79 Score=28.51 Aligned_cols=33 Identities=21% Similarity=0.232 Sum_probs=26.0
Q ss_pred ccCCCCCCCHHHHHHHHHHHHHcCCCHHHHHHHH
Q psy16911 137 IACLEEVLDGPLYDLLMNYLEEKGISNEFVDKLS 170 (194)
Q Consensus 137 Y~g~~~~LDe~Lq~~~~~yLeeRGId~ela~fl~ 170 (194)
+.+..+.||+++-..|.+||+ |.+.+||.-||.
T Consensus 69 f~~~~d~l~~e~r~~FidFLe-rSctaEFSGflL 101 (355)
T PRK13654 69 FDQDWDHLDPETRKEFIDFLE-RSCTAEFSGFLL 101 (355)
T ss_pred hhhchhhCCHHHHHHHHHHHH-HHhhhhhhhHHH
Confidence 334457899999999999995 778888877663
No 40
>PHA01748 hypothetical protein
Probab=20.78 E-value=2.6e+02 Score=18.37 Aligned_cols=20 Identities=25% Similarity=0.506 Sum_probs=18.7
Q ss_pred CCCHHHHHHHHHHHHHcCCC
Q psy16911 143 VLDGPLYDLLMNYLEEKGIS 162 (194)
Q Consensus 143 ~LDe~Lq~~~~~yLeeRGId 162 (194)
.||++|...|.+|.+..|++
T Consensus 8 rLp~el~~eld~~a~~~g~~ 27 (60)
T PHA01748 8 KIEEDLLELLDRYAIKHGLN 27 (60)
T ss_pred ECCHHHHHHHHHHHHHhCCC
Confidence 48999999999999999997
No 41
>PF10437 Lip_prot_lig_C: Bacterial lipoate protein ligase C-terminus; InterPro: IPR019491 This is the C-terminal domain of a bacterial lipoate protein ligase. There is no conservation between this C terminus and that of vertebrate lipoate protein ligase C-termini, but both are associated with IPR004143 from INTERPRO, further upstream. This C-terminal domain is more stable than IPR004143 from INTERPRO and the hypothesis is that the C-terminal domain has a role in recognising the lipoyl domain and/or transferring the lipoyl group onto it from the lipoyl-AMP intermediate. C-terminal fragments of length 172 to 193 amino acid residues are observed in the eubacterial enzymes whereas in their archaeal counterparts the C-terminal segment is significantly smaller, ranging in size from 87 to 107 amino acid residues. ; PDB: 1X2G_A 3A7R_A 3A7A_A 1X2H_C 1VQZ_A 3R07_C.
Probab=20.43 E-value=1.4e+02 Score=20.56 Aligned_cols=23 Identities=26% Similarity=0.336 Sum_probs=18.8
Q ss_pred CCEEEEEEeeCCeEEEEEEEeCC
Q psy16911 37 GPEVSLEKTVGSEKIEIHFNVNH 59 (194)
Q Consensus 37 g~~v~L~r~~~~E~I~V~f~v~~ 59 (194)
.-.+.+++.|...+|.|.++|.+
T Consensus 4 ~f~~~~~~rf~~G~v~v~~~V~~ 26 (86)
T PF10437_consen 4 EFTFSKERRFPWGTVEVHLNVKN 26 (86)
T ss_dssp CESEEEEEEETTEEEEEEEEEET
T ss_pred CCcEeeeeEcCCceEEEEEEEEC
Confidence 34678899999999999998873
No 42
>PF01316 Arg_repressor: Arginine repressor, DNA binding domain; InterPro: IPR020900 The arginine dihydrolase (AD) pathway is found in many prokaryotes and some primitive eukaryotes, an example of the latter being Giardia lamblia (Giardia intestinalis) []. The three-enzyme anaerobic pathway breaks down L-arginine to form 1 mol of ATP, carbon dioxide and ammonia. In simpler bacteria, the first enzyme, arginine deiminase, can account for up to 10% of total cell protein []. Most prokaryotic arginine deiminase pathways are under the control of a repressor gene, termed ArgR []. This is a negative regulator, and will only release the arginine deiminase operon for expression in the presence of arginine []. The crystal structure of apo-ArgR from Bacillus stearothermophilus has been determined to 2.5A by means of X-ray crystallography []. The protein exists as a hexamer of identical subunits, and is shown to have six DNA-binding domains, clustered around a central oligomeric core when bound to arginine. It predominantly interacts with A.T residues in ARG boxes. This hexameric protein binds DNA at its N terminus to repress arginine biosyntheis or activate arginine catabolism. Some species have several ArgR paralogs. In a neighbour-joining tree, some of these paralogous sequences show long branches and differ significantly from the well-conserved C-terminal region. ; GO: 0003700 sequence-specific DNA binding transcription factor activity, 0006355 regulation of transcription, DNA-dependent, 0006525 arginine metabolic process; PDB: 1AOY_A 3V4G_A 3LAJ_D 3FHZ_A 3LAP_B 3ERE_D 2P5L_C 1F9N_D 2P5K_A 1B4A_A ....
Probab=20.14 E-value=1e+02 Score=21.19 Aligned_cols=17 Identities=35% Similarity=0.452 Sum_probs=11.9
Q ss_pred HHHHHHHHHHcCCCHHH
Q psy16911 149 YDLLMNYLEEKGISNEF 165 (194)
Q Consensus 149 q~~~~~yLeeRGId~el 165 (194)
|..|.++|+++||.-.-
T Consensus 22 Q~eL~~~L~~~Gi~vTQ 38 (70)
T PF01316_consen 22 QEELVELLEEEGIEVTQ 38 (70)
T ss_dssp HHHHHHHHHHTT-T--H
T ss_pred HHHHHHHHHHcCCCcch
Confidence 77889999999998543
No 43
>TIGR02029 AcsF magnesium-protoporphyrin IX monomethyl ester aerobic oxidative cyclase. This model respresents the oxidative cyclase responsible for forming the distinctive E-ring of the chlorin ring system under aerobic conditions. This enzyme is believed to utilize a binuclear iron center and molecular oxygen. There are two isoforms of this enzyme in some plants and cyanobacterai which are differentially regulated based on the levels of copper and oxygen. This step is essential in the biosynthesis of both bacteriochlorophyll and chlorophyll under aerobic conditions (a separate enzyme, BchE, acts under anaerobic conditions). This enzyme is found in plants, cyanobacteria and other photosynthetic bacteria.
Probab=20.10 E-value=87 Score=28.05 Aligned_cols=31 Identities=29% Similarity=0.422 Sum_probs=25.2
Q ss_pred CCCCCCCHHHHHHHHHHHHHcCCCHHHHHHHH
Q psy16911 139 CLEEVLDGPLYDLLMNYLEEKGISNEFVDKLS 170 (194)
Q Consensus 139 g~~~~LDe~Lq~~~~~yLeeRGId~ela~fl~ 170 (194)
+..+.||+++-..|.+||+ |.+.+||.-||.
T Consensus 61 ~~~~~l~~e~r~~FidFLe-rScTaEFSGflL 91 (337)
T TIGR02029 61 QSWEHIDGELRQAFIEFLE-RSCTSEFSGFLL 91 (337)
T ss_pred cchhhCCHHHHHHHHHHHH-HHhhhhhhhhHH
Confidence 3346799999999999995 788888877763
Done!