Query psy8609
Match_columns 182
No_of_seqs 103 out of 1055
Neff 9.2
Searched_HMMs 46136
Date Fri Aug 16 23:10:36 2013
Command hhsearch -i /work/01045/syshi/Psyhhblits/psy8609.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/8609hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 PF05649 Peptidase_M13_N: Pept 100.0 1.4E-28 3.1E-33 200.8 16.9 175 4-181 154-329 (390)
2 COG3590 PepO Predicted metallo 99.9 6.4E-27 1.4E-31 192.0 14.1 176 2-182 162-340 (654)
3 KOG3624|consensus 99.8 1.7E-18 3.7E-23 151.5 15.8 153 9-182 211-363 (687)
4 PRK08311 putative RNA polymera 52.1 98 0.0021 23.9 7.2 31 101-132 201-231 (237)
5 KOG1076|consensus 46.6 12 0.00026 33.5 1.5 97 79-179 217-324 (843)
6 KOG1088|consensus 43.8 51 0.0011 22.6 3.9 48 75-122 43-93 (124)
7 PF15200 KRTDAP: Keratinocyte 43.7 19 0.0004 22.3 1.7 17 71-87 48-64 (77)
8 PF05044 HPD: Homeo-prospero d 41.8 21 0.00045 25.4 1.9 102 65-181 34-143 (158)
9 PRK11032 hypothetical protein; 38.3 1.1E+02 0.0023 22.3 5.2 107 12-154 3-109 (160)
10 COG5579 Uncharacterized conser 34.5 48 0.001 22.8 2.7 16 121-136 58-73 (143)
11 KOG3932|consensus 28.6 50 0.0011 26.3 2.3 52 121-177 269-320 (357)
12 PF10523 BEN: BEN domain; Int 27.7 1.1E+02 0.0024 18.5 3.5 29 149-179 45-73 (79)
13 PF06543 Lac_bphage_repr: Lact 26.3 39 0.00085 19.1 1.0 23 66-89 4-26 (49)
14 PF02083 Urotensin_II: Urotens 25.5 17 0.00038 14.3 -0.4 6 166-171 7-12 (12)
15 PF12086 DUF3563: Protein of u 25.5 1E+02 0.0022 18.4 2.7 21 33-53 24-44 (59)
16 TIGR02895 spore_sigI RNA polym 23.4 2.4E+02 0.0052 21.5 5.2 15 39-53 120-134 (218)
17 TIGR01834 PHA_synth_III_E poly 22.4 3.4E+02 0.0073 22.2 6.0 40 14-53 269-309 (320)
18 PF11554 DUF3232: Protein of u 22.2 1.7E+02 0.0038 20.8 3.8 51 10-60 19-77 (152)
19 KOG3779|consensus 21.5 4.8E+02 0.01 22.5 6.8 97 66-176 612-715 (737)
20 PF15372 DUF4600: Domain of un 21.2 2E+02 0.0044 20.1 3.9 38 37-74 19-56 (129)
21 COG5552 Uncharacterized conser 20.7 2.1E+02 0.0046 17.9 3.5 23 141-163 15-37 (88)
22 PHA03043 hypothetical protein; 20.6 64 0.0014 22.5 1.4 12 123-134 10-21 (130)
No 1
>PF05649 Peptidase_M13_N: Peptidase family M13 This is family M13 in the peptidase classification. ; InterPro: IPR008753 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to the MEROPS peptidase family M13 (neprilysin family, clan MA(E)). The protein fold of the peptidase domain for members of this family resembles that of thermolysin, the type example for clan MA and the predicted active site residues for members of this family and thermolysin occur in the motif HEXXH []. M13 peptidases are well-studied proteases found in a wide range of organisms including mammals and bacteria. In mammals they participate in processes such as cardiovascular development, blood-pressure regulation, nervous control of respiration, and regulation of the function of neuropeptides in the central nervous system. In bacteria they may be used for digestion of milk [, ]. The family includes eukaryotic and prokaryotic oligopeptidases, as well as some of the proteins responsible for the molecular basis of the blood group antigens e.g. Kell []. Neprilysin (3.4.24.11 from EC), is another member of this group, it is variously known as common acute lymphoblastic leukemia antigen (CALLA), enkephalinase (gp100) and neutral endopeptidase metalloendopeptidase (NEP). It is a plasma membrane-bound mammalian enzyme that is able to digest biologically-active peptides, including enkephalins []. The zinc ligands of neprilysin are known and are analogous to those in thermolysin, a related peptidase [, ]. Neprilysins, like thermolysin, are inhibited by phosphoramidon, which appears to selectively inhibit this family in mammals. The enzymes are all oligopeptidases, digesting oligo- and polypeptides, but not proteins []. Neprilysin consists of a short cytoplasmic domain, a membrane-spanning region and a large extracellular domain. The cytoplasmic domain contains a conformationally-restrained octapeptide, which is thought to act as a stop transfer sequence that prevents proteolysis and secretion [, ].; GO: 0008237 metallopeptidase activity, 0006508 proteolysis; PDB: 3DWB_A 3ZUK_A 2QPJ_A 1R1I_A 1R1J_A 1Y8J_A 1R1H_A 1DMT_A 2YB9_A.
Probab=99.96 E-value=1.4e-28 Score=200.84 Aligned_cols=175 Identities=32% Similarity=0.646 Sum_probs=149.5
Q ss_pred cccCCC-cHHHHHHHHHHHHHHHHHhCCCHHHHHHHHHHHHHHHHHHhhccCChHhhcchhhhcCccCHHHHHhhCCCCC
Q psy8609 4 STLAGL-DEKIVRAYYKYMLDIAVLMGADREAAVEELTESLNFEIALANISLPQEERRNATKLYNPMKISELQERYPSIP 82 (182)
Q Consensus 4 ~~~~~~-~~~~~~~y~~~i~~~~~~~g~~~~~~~~~~~~i~~fE~~la~~~~~~~~~~~~~~~~~~~t~~~l~~~~p~i~ 82 (182)
++.++. .......|++++.+++..+|.+...+.+.+++|++||++|+++..+..+++++...|+++|+++|++.+|++|
T Consensus 154 ~~~~~~~~~~~~~~y~~~v~~~l~~~g~~~~~~~~~~~~i~~~e~~L~~~~~~~~~~~~~~~~~~~~tl~eL~~~~p~i~ 233 (390)
T PF05649_consen 154 YYRDPHSSKEYLQAYREYVREVLKLLGVDDDEASKLADDIVKFEKQLAKLSSEDEERRDPQKLYNPMTLSELQSQYPQID 233 (390)
T ss_dssp GGCTCGGCHHHHHHHHHHHHHHHHHCSS-HHHHHHHHHHHHHHHHHHHHHS--GGGCT-HHHH-EEEEHHHHHHHSTTS-
T ss_pred HhhcchhhHHHHHHHHHHHHHHHhhccCChHHHHHHHHHHHHHHHHHHHHhhhhHhhccHHHhcCcccHHHHHHHHhhhh
Confidence 344433 3888999999999999999998899999999999999999999999988999999999999999999999999
Q ss_pred HHHHHHHhcCCCCCCCCCCeEEEcChHHHHHHHHHHccCChhHHhHHHHHHHHHHHHhhhhHHHHHHHHHHHHhhCCCCC
Q psy8609 83 WTEYINTILSPNAQLKDDETIIVTEPEYIHDLEKLLSTTPKRTMANYVMWRVTAASVGFFTEAIGARRLAFITAVTGVSE 162 (182)
Q Consensus 83 w~~~l~~l~~~~~~~~~~~~i~v~~~~y~~~l~~ll~~t~~~~l~nYl~~~~i~~~~~~l~~~~~~~~~~~~~~~~g~~~ 162 (182)
|..+|+.++++ ++.++.|+|.+|+||++|..+++++++++|+||++|+++..++++++.+++....+|..++.|..+
T Consensus 234 w~~~l~~~~~~---~~~~~~V~v~~~~y~~~l~~ll~~~~~~~l~nYl~~~~i~~~~~~~~~~~~~~~~~~~~~~~g~~~ 310 (390)
T PF05649_consen 234 WDRYLKALFGE---VTESDEVIVRSPDYFKKLSELLKQTSKETLANYLGWRLIRELSPYLSSELRDLYFEFYRTLSGVEE 310 (390)
T ss_dssp HHHHHHHHCCC---CCTTSEEEES-HHHHHHHHHHHHHS-HHHHHHHHHHHHHHHHGGGS-HHHHHHHHHHHTHHH--SS
T ss_pred HHHHHHHHccc---cCCCceEEEcCcHHHHHHHHHHHhCCcHHHHHHHHHHHHHHHHhhccHHHHHHHHHHHHHhcCCcc
Confidence 99999999875 345679999999999999999999999999999999999999999999999999999999999999
Q ss_pred CchhHhhHHHHHhcccccC
Q psy8609 163 EEARWKECVGRVKGGFSLA 181 (182)
Q Consensus 163 ~~~r~~~C~~~v~~~~~~~ 181 (182)
..|||+.|+..+.+.||++
T Consensus 311 ~~~~~~~C~~~~~~~l~~~ 329 (390)
T PF05649_consen 311 SRPRWQRCVRHVEKLLPFA 329 (390)
T ss_dssp ---HHHHHHHHHHHHHHHH
T ss_pred chhhHHHHHHHHHHHHHHH
Confidence 9999999999999999876
No 2
>COG3590 PepO Predicted metalloendopeptidase [Posttranslational modification, protein turnover, chaperones]
Probab=99.95 E-value=6.4e-27 Score=192.04 Aligned_cols=176 Identities=23% Similarity=0.393 Sum_probs=164.5
Q ss_pred CCcccCCCcHHHHHHHHHHHHHHHHHhCCCHH--HHHHHHHHHHHHHHHHhhccCChHhhcchhhhcCccCHHHHHhhCC
Q psy8609 2 ASSTLAGLDEKIVRAYYKYMLDIAVLMGADRE--AAVEELTESLNFEIALANISLPQEERRNATKLYNPMKISELQERYP 79 (182)
Q Consensus 2 ~~~~~~~~~~~~~~~y~~~i~~~~~~~g~~~~--~~~~~~~~i~~fE~~la~~~~~~~~~~~~~~~~~~~t~~~l~~~~p 79 (182)
++||+++++.+.+.+|+.++.+++..+|.+++ .+.+.+.+|+.||++||..+.+..++||-...||+.|+.+|+...|
T Consensus 162 ~~YY~de~~~e~l~ay~~hv~rml~~~~~~~~~~d~a~~A~~v~a~et~lA~~~~~~~~~rd~~~~Yn~~tf~~l~~~~p 241 (654)
T COG3590 162 TTYYRDEQHAELLAAYKEHVARMLGLFGFSEEEEDAAKHALRVVALETKLANASWEVVKYRDLYHTYNPATFAELQPELP 241 (654)
T ss_pred hhhhhhhhHHHHHHHHHHHHHHHHHHhCCchhhhhHHHHHHHHHHHHHHHhhhhhHHHHHHhhhcCcChhhHHHhcccCC
Confidence 46899999999999999999999999999775 6889999999999999999999999999999999999999999999
Q ss_pred CCCHHHHHHHhcCCCCCCCCCCeEEEcChHHHHHHHHHHccCChhHHhHHHHHHHHHHHHhhhhHHHHHHHHHH-HHhhC
Q psy8609 80 SIPWTEYINTILSPNAQLKDDETIIVTEPEYIHDLEKLLSTTPKRTMANYVMWRVTAASVGFFTEAIGARRLAF-ITAVT 158 (182)
Q Consensus 80 ~i~w~~~l~~l~~~~~~~~~~~~i~v~~~~y~~~l~~ll~~t~~~~l~nYl~~~~i~~~~~~l~~~~~~~~~~~-~~~~~ 158 (182)
..+|..++..+ +.-++..|++.+|.|++.+..++.+++...++.|+.|+++...+|||+++++...++| .++++
T Consensus 242 ~~~w~~~~~~~-----G~~PD~~iv~~~p~y~~~~~~~~~e~~~~~wKawl~~~~~~~~ap~Lt~dl~~~~f~fy~r~Ls 316 (654)
T COG3590 242 GDDWSLLFSAL-----GQLPDKVIVVENPFYLKEFASLLAEENWADWKAWLRWKLIRAAAPYLTEDLVDEHFDFYGRTLS 316 (654)
T ss_pred CccHHHHHHHh-----cCCCCeeEeecCchHHHHHHHHHhcCcHHHHHHHHHHHHHhhcchhhhhHHHHhhhHHHhhhcc
Confidence 99999999987 4223578888999999999999999999999999999999999999999999999966 69999
Q ss_pred CCCCCchhHhhHHHHHhcccccCC
Q psy8609 159 GVSEEEARWKECVGRVKGGFSLAI 182 (182)
Q Consensus 159 g~~~~~~r~~~C~~~v~~~~~~~l 182 (182)
|+++..||||+.+.++++.||.+|
T Consensus 317 G~~E~r~rwKr~~~l~~~~~geai 340 (654)
T COG3590 317 GQPEARDRWKRAVNLAERLFGEAI 340 (654)
T ss_pred CChhhhHHHHHHHHHHHHHHHHHH
Confidence 999999999999999999999874
No 3
>KOG3624|consensus
Probab=99.80 E-value=1.7e-18 Score=151.46 Aligned_cols=153 Identities=33% Similarity=0.668 Sum_probs=136.4
Q ss_pred CcHHHHHHHHHHHHHHHHHhCCCHHHHHHHHHHHHHHHHHHhhccCChHhhcchhhhcCccCHHHHHhhCCCCCHHHHHH
Q psy8609 9 LDEKIVRAYYKYMLDIAVLMGADREAAVEELTESLNFEIALANISLPQEERRNATKLYNPMKISELQERYPSIPWTEYIN 88 (182)
Q Consensus 9 ~~~~~~~~y~~~i~~~~~~~g~~~~~~~~~~~~i~~fE~~la~~~~~~~~~~~~~~~~~~~t~~~l~~~~p~i~w~~~l~ 88 (182)
...+.+.+|..++..+...+|.+++.+.+.++.+++||+++++.+.++..+++....++.+++.+|++.+|.++|..++.
T Consensus 211 ~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~e~~l~~~~~~~~~~~~~~~~~~~~~l~~l~~~~~~~~~~~~~~ 290 (687)
T KOG3624|consen 211 SDSEKLRAYLLLANELLQLLGLDSDEAEEYARLVIELERQLANITKPDENRRELQALYNSVNLAELQKKFPSIDWKQLLR 290 (687)
T ss_pred cHHHHHHHHHHHHHHHHHHcCCCHHHHHHHHHHHHHHHHHHHhccchhhhhhhHHHHcccccHHHHHHhcccccHHHHhh
Confidence 33467889999999999999998999999999999999999999998888999999999999999999999999999999
Q ss_pred HhcCCCCCCCCCCeEEEcChHHHHHHHHHHccCChhHHhHHHHHHHHHHHHhhhhHHHHHHHHHHHHhhCCCCCCchhHh
Q psy8609 89 TILSPNAQLKDDETIIVTEPEYIHDLEKLLSTTPKRTMANYVMWRVTAASVGFFTEAIGARRLAFITAVTGVSEEEARWK 168 (182)
Q Consensus 89 ~l~~~~~~~~~~~~i~v~~~~y~~~l~~ll~~t~~~~l~nYl~~~~i~~~~~~l~~~~~~~~~~~~~~~~g~~~~~~r~~ 168 (182)
.+... ... ...+++.+++|+.+|..+|.+|++++|+|||+|+++..+...++...+ ||.
T Consensus 291 ~~~~~--~~~-~~~~~~~~~~y~~~l~~ii~~t~~~~laNYl~~~~~~~~~~~~~~~~~------------------~~~ 349 (687)
T KOG3624|consen 291 SVLGE--ILP-HNEVVVFNPDYLVNLSAIIQSTPKRTLANYLIWRLLDDLLDLLPKPRK------------------RWA 349 (687)
T ss_pred hhccc--ccc-CCceEEecHHHHHHHHHHHHhCCHHHHHHHHHHHHHHHHhcccchhhH------------------HHH
Confidence 99776 222 338999999999999999999999999999999999988874443322 899
Q ss_pred hHHHHHhcccccCC
Q psy8609 169 ECVGRVKGGFSLAI 182 (182)
Q Consensus 169 ~C~~~v~~~~~~~l 182 (182)
.|++.|.++||.|+
T Consensus 350 ~Cv~~~~~~~p~a~ 363 (687)
T KOG3624|consen 350 DCVELVRTLLPLAL 363 (687)
T ss_pred HHHHHHHHhhhhhh
Confidence 99999999999874
No 4
>PRK08311 putative RNA polymerase sigma factor SigI; Reviewed
Probab=52.05 E-value=98 Score=23.89 Aligned_cols=31 Identities=13% Similarity=0.217 Sum_probs=20.8
Q ss_pred CeEEEcChHHHHHHHHHHccCChhHHhHHHHH
Q psy8609 101 ETIIVTEPEYIHDLEKLLSTTPKRTMANYVMW 132 (182)
Q Consensus 101 ~~i~v~~~~y~~~l~~ll~~t~~~~l~nYl~~ 132 (182)
..++-++-+|+-.+.-++ ..+...|+.|+.+
T Consensus 201 rktier~rkyIia~~li~-~~~~~~l~~y~~~ 231 (237)
T PRK08311 201 RKTLERNRKYIIAVAIIL-AGDYPYLKEYIRG 231 (237)
T ss_pred HHHHHhhhHHHHHHHHHH-cCCcHHHHHHHhh
Confidence 455556777777777777 5566777777765
No 5
>KOG1076|consensus
Probab=46.59 E-value=12 Score=33.54 Aligned_cols=97 Identities=13% Similarity=0.301 Sum_probs=56.0
Q ss_pred CCCCHHHHHHHhcCCCCCCCCCCeEEEcChHHHHHHHHHHccCChhHHhHHHHHHHHHHHHhhhhHHH-----------H
Q psy8609 79 PSIPWTEYINTILSPNAQLKDDETIIVTEPEYIHDLEKLLSTTPKRTMANYVMWRVTAASVGFFTEAI-----------G 147 (182)
Q Consensus 79 p~i~w~~~l~~l~~~~~~~~~~~~i~v~~~~y~~~l~~ll~~t~~~~l~nYl~~~~i~~~~~~l~~~~-----------~ 147 (182)
.+..|...+..-+.. ....++++|. .+..|++|..++..-.+.....-....++..+......+. -
T Consensus 217 d~~~we~v~s~~~~~-~~~~~~eEit--~~~v~kkl~eI~~ARGKK~tdr~~~i~~L~~L~~IA~~~~~~~~v~i~f~iI 293 (843)
T KOG1076|consen 217 DGVTWELVVSGKEKP-KMFQKPEEIT--HADVFKKLNEIMSARGKKTTDRQELIDLLEELLTIAETPGQGVKVKIKFNII 293 (843)
T ss_pred CCcceeeeccccccc-cccCCchhcC--HHHHHHHHHHHHHhhccccccHHHHHHHHHHHHHHHcccCcceEEEEEEEEe
Confidence 467788877654333 1233345555 6788999999996654444444444444444433333331 1
Q ss_pred HHHHHHHHhhCCCCCCchhHhhHHHHHhcccc
Q psy8609 148 ARRLAFITAVTGVSEEEARWKECVGRVKGGFS 179 (182)
Q Consensus 148 ~~~~~~~~~~~g~~~~~~r~~~C~~~v~~~~~ 179 (182)
.+.|+++..++|- .....|+.|+.-++++|.
T Consensus 294 sa~fD~N~~~s~y-Mpid~W~~~l~~~~slLd 324 (843)
T KOG1076|consen 294 SARFDYNAGLSGY-MPIDQWKKCLNNMNSLLD 324 (843)
T ss_pred ehhhccCcCcccC-cCHHHHHHHHHHHHHHHH
Confidence 2334444333332 235699999999998775
No 6
>KOG1088|consensus
Probab=43.79 E-value=51 Score=22.57 Aligned_cols=48 Identities=23% Similarity=0.465 Sum_probs=31.5
Q ss_pred HhhCCCCCHHHHHHHhcCCC-CCCCC--CCeEEEcChHHHHHHHHHHccCC
Q psy8609 75 QERYPSIPWTEYINTILSPN-AQLKD--DETIIVTEPEYIHDLEKLLSTTP 122 (182)
Q Consensus 75 ~~~~p~i~w~~~l~~l~~~~-~~~~~--~~~i~v~~~~y~~~l~~ll~~t~ 122 (182)
....|.+||.+++....+-. ..+++ .+.+.-.+..+++.+--++-+++
T Consensus 43 ~~m~~kidW~All~~A~~l~~~~lP~~kPe~~~e~de~~Lr~lHhlLle~~ 93 (124)
T KOG1088|consen 43 IRMLPKIDWSALLEAARSLGNTELPDEKPEKLVEDDEEFLRKLHHLLLEID 93 (124)
T ss_pred HHhhhhccHHHHHHHHHHhccccCCCCCccccccccHHHHHHHHHHHhhhh
Confidence 34578999999998864321 22332 34444447788898888876654
No 7
>PF15200 KRTDAP: Keratinocyte differentiation-associated
Probab=43.74 E-value=19 Score=22.27 Aligned_cols=17 Identities=12% Similarity=0.628 Sum_probs=11.7
Q ss_pred HHHHHhhCCCCCHHHHH
Q psy8609 71 ISELQERYPSIPWTEYI 87 (182)
Q Consensus 71 ~~~l~~~~p~i~w~~~l 87 (182)
++-+++..|.+||+.|-
T Consensus 48 fe~iK~kLPFlNWdafP 64 (77)
T PF15200_consen 48 FEAIKRKLPFLNWDAFP 64 (77)
T ss_pred HHHHHHhCcccchhhhh
Confidence 45566677888887763
No 8
>PF05044 HPD: Homeo-prospero domain; InterPro: IPR007738 The homeobox gene Prox1 is expressed in a subpopulation of endothelial cells that, after budding from veins, gives rise to the mammalian lymphatic system []. Prox1 has been found to be an early specific marker for the developing liver and pancreas in the mammalian foregut endoderm []. This family contains an atypical homeobox domain.; GO: 0003677 DNA binding, 0006355 regulation of transcription, DNA-dependent, 0007275 multicellular organismal development, 0005634 nucleus; PDB: 2LMD_A 1XPX_A 1MIJ_A.
Probab=41.84 E-value=21 Score=25.39 Aligned_cols=102 Identities=13% Similarity=0.222 Sum_probs=53.4
Q ss_pred hcCccCHHHHHhhCCCC------CHHHHHHHhcCCCCCCCCCCeEEE-cChHHHHHHHHHHccCC-hhHHhHHHHHHHHH
Q psy8609 65 LYNPMKISELQERYPSI------PWTEYINTILSPNAQLKDDETIIV-TEPEYIHDLEKLLSTTP-KRTMANYVMWRVTA 136 (182)
Q Consensus 65 ~~~~~t~~~l~~~~p~i------~w~~~l~~l~~~~~~~~~~~~i~v-~~~~y~~~l~~ll~~t~-~~~l~nYl~~~~i~ 136 (182)
..|+.+.+||-+.+.++ .-+.|-+..... |+.+.+.|.| .+..-|+.|+-=.+..+ .++=.+++..-
T Consensus 34 ~Fnr~~TsQLiKWFSNFREFyYiQMEK~ARqa~~e--Gv~~~~~l~V~rdsELfr~LN~HYNk~N~~evP~~Fl~v~--- 108 (158)
T PF05044_consen 34 KFNRCNTSQLIKWFSNFREFYYIQMEKFARQAVSE--GVKNADDLRVTRDSELFRVLNMHYNKNNDFEVPDRFLEVV--- 108 (158)
T ss_dssp ---HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH--T-S-GGGSSSS-TTCHHHHHHHHHHTT-STT--HHHHHHH---
T ss_pred hhhhhhHHHHHHHhccchhHHHHHHHHHHHHHHHc--cCCcHHHeeeccchHHHHHHHhhcccCCCccCchhHHHHH---
Confidence 34555566666543322 234444444444 5655455555 67778888887776543 34434443321
Q ss_pred HHHhhhhHHHHHHHHHHHHhhCCCCCCchhHhhHHHHHhcccccC
Q psy8609 137 ASVGFFTEAIGARRLAFITAVTGVSEEEARWKECVGRVKGGFSLA 181 (182)
Q Consensus 137 ~~~~~l~~~~~~~~~~~~~~~~g~~~~~~r~~~C~~~v~~~~~~~ 181 (182)
+....||..++...+...|-|+..+-.|..-|..+
T Consensus 109 ----------~~tLrEFf~AI~~gkD~dpsWKK~IYKvI~kLD~~ 143 (158)
T PF05044_consen 109 ----------QITLREFFNAIQAGKDSDPSWKKPIYKVISKLDDP 143 (158)
T ss_dssp ----------HHHHHHHHHHHHCTCCCSTCCCHHHHHHHHCC-SS
T ss_pred ----------HHHHHHHHHHHHcCCCCCccccHHHHHHHHHccCc
Confidence 12334776666655666789999988887655443
No 9
>PRK11032 hypothetical protein; Provisional
Probab=38.25 E-value=1.1e+02 Score=22.26 Aligned_cols=107 Identities=9% Similarity=0.056 Sum_probs=66.7
Q ss_pred HHHHHHHHHHHHHHHHhCCCHHHHHHHHHHHHHHHHHHhhccCChHhhcchhhhcCccCHHHHHhhCCCCCHHHHHHHhc
Q psy8609 12 KIVRAYYKYMLDIAVLMGADREAAVEELTESLNFEIALANISLPQEERRNATKLYNPMKISELQERYPSIPWTEYINTIL 91 (182)
Q Consensus 12 ~~~~~y~~~i~~~~~~~g~~~~~~~~~~~~i~~fE~~la~~~~~~~~~~~~~~~~~~~t~~~l~~~~p~i~w~~~l~~l~ 91 (182)
+...+|.+++..+-..+...+..+++.++...++......++.+..+. ++..-+. |...|.+..-
T Consensus 3 k~~~~Y~~ll~~v~~~l~~~~~~l~~~ve~a~~~~~~~~elT~dEl~l-----------v~~ylkR----DL~ef~~~~~ 67 (160)
T PRK11032 3 KVAQYYRELVASLTERLRNGERDIDALVESARKRVDAAGELTRDEVDL-----------ITRAVRR----DLEEFARSYE 67 (160)
T ss_pred hHHHHHHHHHHHHHHHHHhCHHHHHHHHHHHHHHHHHHHhcCHHHHHH-----------HHHHHHH----HHHHHHHHHH
Confidence 567889999999988887667677777777777777776665433221 1222222 5555555431
Q ss_pred CCCCCCCCCCeEEEcChHHHHHHHHHHccCChhHHhHHHHHHHHHHHHhhhhHHHHHHHHHHH
Q psy8609 92 SPNAQLKDDETIIVTEPEYIHDLEKLLSTTPKRTMANYVMWRVTAASVGFFTEAIGARRLAFI 154 (182)
Q Consensus 92 ~~~~~~~~~~~i~v~~~~y~~~l~~ll~~t~~~~l~nYl~~~~i~~~~~~l~~~~~~~~~~~~ 154 (182)
.. .+.+. ..|. ...|.+-+ |.-+...+...--++++....+.
T Consensus 68 ~~------~~~~~-~s~~-------------~~~i~~sl-w~~L~~ItDrTqvEw~el~~dl~ 109 (160)
T PRK11032 68 ES------KEEFS-DSVF-------------MRVIKESL-WQELADITDKTQLEWREVFQDLN 109 (160)
T ss_pred hc------ccccc-ccHH-------------HHHHHHHH-HHHHHHHHHHhHHHHHHHHHHhh
Confidence 11 11111 1221 23566666 88888888888888888888884
No 10
>COG5579 Uncharacterized conserved protein [Function unknown]
Probab=34.48 E-value=48 Score=22.83 Aligned_cols=16 Identities=6% Similarity=0.163 Sum_probs=11.1
Q ss_pred CChhHHhHHHHHHHHH
Q psy8609 121 TPKRTMANYVMWRVTA 136 (182)
Q Consensus 121 t~~~~l~nYl~~~~i~ 136 (182)
.+.+.-+.|+-|+++-
T Consensus 58 ssL~EAqAYLAh~iLG 73 (143)
T COG5579 58 SSLEEAQAYLAHDLLG 73 (143)
T ss_pred ccHHHHHHHHhhhhhc
Confidence 3456677888888754
No 11
>KOG3932|consensus
Probab=28.64 E-value=50 Score=26.31 Aligned_cols=52 Identities=12% Similarity=0.282 Sum_probs=31.2
Q ss_pred CChhHHhHHHHHHHHHHHHhhhhHHHHHHHHHHHHhhCCCCCCchhHhhHHHHHhcc
Q psy8609 121 TPKRTMANYVMWRVTAASVGFFTEAIGARRLAFITAVTGVSEEEARWKECVGRVKGG 177 (182)
Q Consensus 121 t~~~~l~nYl~~~~i~~~~~~l~~~~~~~~~~~~~~~~g~~~~~~r~~~C~~~v~~~ 177 (182)
.+.+.|+.++.-++..+|+ |++.++.=...-| +-+. ....-|.+|+..|+++
T Consensus 269 ~s~~eLqA~~LtCLYlsYS-YMGNEISYPLKPF---LVe~-dke~FWdRCl~~vn~l 320 (357)
T KOG3932|consen 269 HSLEELQAWILTCLYLSYS-YMGNEISYPLKPF---LVEN-DKETFWDRCLAMVNSL 320 (357)
T ss_pred ccHHHHHHHHHHHHHHHHH-hhcccccccccce---eecc-cHHHHHHHHHHHHHHh
Confidence 4567788888888887775 6555431111111 0111 1245899999999865
No 12
>PF10523 BEN: BEN domain; InterPro: IPR018379 The BEN domain is found in diverse proteins including: SMAR1 (Scaffold/Matrix attachment region-binding protein 1; also known as BANP), a tumour-suppressor MAR-binding protein that down-regulates Cyclin D1 expression by recruiting HDAC1-mSin3A co-repressor complex at Cyclin D1 promoter locus; SMAR1 is the target of prostaglandin A2 (PGA2) induced growth arrest [, ]. NAC1, a novel member of the POZ/BTB (Pox virus and Zinc finger/Bric-a-bracTramtrack Broad complex), but which varies from other proteins of this class in that it lacks the characteristic DNA-binding motif []. Mod(mdg4) isoform C, the modifier of the mdg4 locus in Drosophila melanogaster (Fruit fly), where mdg4 encodes chromatin proteins which are involved in position effect variegation, establishment of chromatin boundaries, nerve path finding, meiotic chromosome pairing and apoptosis []. Trans-splicing of Mod(mdg4) produces at least 26 transcripts. E5R protein from Chordopoxvirus virosomes, which is found in cytoplasmic sites of viral DNA replication []. Several proteins of polydnaviruses. The BEN domain is predicted to function as an adaptor for the higher-order structuring of chromatin, and recruitment of chromatin modifying factors in transcriptional regulation. It has been suggested to mediate protein-DNA and protein-protein interactions during chromatin organisation and transcription. The presence of BEN domains in a poxviral early virosomal protein and in polydnaviral proteins also suggests a possible role in the organisation of viral DNA during replication or transcription. They are generally linked to other globular domains with functions related to transcriptional regulation and chromatin structure, such as BTB, C4DM, and C2H2 fingers []. This domain is predicted to form an all-alpha fold with four conserved helices. Its conservation pattern revealed several conserved residues, most of which have hydrophobic side-chains and are likely to stabilise the fold through helix-helix packing [].
Probab=27.71 E-value=1.1e+02 Score=18.51 Aligned_cols=29 Identities=10% Similarity=0.350 Sum_probs=15.9
Q ss_pred HHHHHHHhhCCCCCCchhHhhHHHHHhcccc
Q psy8609 149 RRLAFITAVTGVSEEEARWKECVGRVKGGFS 179 (182)
Q Consensus 149 ~~~~~~~~~~g~~~~~~r~~~C~~~v~~~~~ 179 (182)
+...|-....|..+ ..|..|...+++.+.
T Consensus 45 ~I~~~v~~~~~~~~--~~w~~~~~~i~~~~~ 73 (79)
T PF10523_consen 45 AIRNYVEERFPSEK--RKWRECRQSINNKCR 73 (79)
T ss_pred HHHHHHHHHCCcch--hhHHHHHHHHHHHHH
Confidence 33333333344433 567788887776553
No 13
>PF06543 Lac_bphage_repr: Lactococcus bacteriophage repressor; InterPro: IPR009498 This entry represents the C terminus of various Lactococcus bacteriophage repressor proteins.
Probab=26.30 E-value=39 Score=19.12 Aligned_cols=23 Identities=17% Similarity=0.768 Sum_probs=17.6
Q ss_pred cCccCHHHHHhhCCCCCHHHHHHH
Q psy8609 66 YNPMKISELQERYPSIPWTEYINT 89 (182)
Q Consensus 66 ~~~~t~~~l~~~~p~i~w~~~l~~ 89 (182)
.+++-+++|... |.+||..++..
T Consensus 4 ~epiDLa~lvDd-~kvdWd~wvSf 26 (49)
T PF06543_consen 4 NEPIDLADLVDD-PKVDWDKWVSF 26 (49)
T ss_pred cCcccHHHHcCC-cccchHHheee
Confidence 467778888744 77999999864
No 14
>PF02083 Urotensin_II: Urotensin II; InterPro: IPR001483 Urotensin II, a small peptide that contains a disulphide bridge, was originally isolated from the caudal portion of the spinal cord of teleost and elasmobranch fish []. The peptide has also been found in the brain of frogs []. Urotensin II seems to be involved in smooth muscle stimulation.; GO: 0005179 hormone activity, 0005576 extracellular region
Probab=25.52 E-value=17 Score=14.29 Aligned_cols=6 Identities=67% Similarity=1.464 Sum_probs=3.3
Q ss_pred hHhhHH
Q psy8609 166 RWKECV 171 (182)
Q Consensus 166 r~~~C~ 171 (182)
-|+.|+
T Consensus 7 FWKYCv 12 (12)
T PF02083_consen 7 FWKYCV 12 (12)
T ss_pred hhhhcC
Confidence 466663
No 15
>PF12086 DUF3563: Protein of unknown function (DUF3563); InterPro: IPR021946 This family of proteins is functionally uncharacterised. This protein is found in bacteria. Proteins in this family are about 50 amino acids in length. This protein has conserved AYL and DLE sequence motifs.
Probab=25.46 E-value=1e+02 Score=18.40 Aligned_cols=21 Identities=19% Similarity=0.311 Sum_probs=16.6
Q ss_pred HHHHHHHHHHHHHHHHHhhcc
Q psy8609 33 EAAVEELTESLNFEIALANIS 53 (182)
Q Consensus 33 ~~~~~~~~~i~~fE~~la~~~ 53 (182)
+.....+.++.++|.++.++-
T Consensus 24 eaYLA~s~D~~DLErRmr~le 44 (59)
T PF12086_consen 24 EAYLAQSTDIYDLERRMRELE 44 (59)
T ss_pred HHHHHhcccHHHHHHHHHHHh
Confidence 345567889999999998875
No 16
>TIGR02895 spore_sigI RNA polymerase sigma-I factor. Members of this sigma factor protein family are strictly limited to endospore-forming species in the Firmicutes lineage of bacteria, but are not universally present among such species. Sigma-I was shown to be induced by heat shock (PubMed:11157964) in Bacillus subtilis and is suggested by its phylogenetic profile to be connected to the program of sporulation (PubMed:16311624).
Probab=23.36 E-value=2.4e+02 Score=21.46 Aligned_cols=15 Identities=20% Similarity=0.266 Sum_probs=10.2
Q ss_pred HHHHHHHHHHHhhcc
Q psy8609 39 LTESLNFEIALANIS 53 (182)
Q Consensus 39 ~~~i~~fE~~la~~~ 53 (182)
.++|..|++.|+.+-
T Consensus 120 ~eEI~~~~~~L~~~g 134 (218)
T TIGR02895 120 RLEILEYKKLLKQFG 134 (218)
T ss_pred HHHHHHHHHHHHHcC
Confidence 467777777776664
No 17
>TIGR01834 PHA_synth_III_E poly(R)-hydroxyalkanoic acid synthase, class III, PhaE subunit. This model represents the PhaE subunit of the heterodimeric class (class III) of polymerase for poly(R)-hydroxyalkanoic acids (PHAs), carbon and energy storage polymers of many bacteria. The most common PHA is polyhydroxybutyrate but about 150 different constituent hydroxyalkanoic acids (HAs) have been identified in various species. This model must be designated subfamily to indicate the heterogeneity of PHAs.
Probab=22.39 E-value=3.4e+02 Score=22.15 Aligned_cols=40 Identities=5% Similarity=0.006 Sum_probs=24.6
Q ss_pred HHHHHHHHHHHHHHhCCC-HHHHHHHHHHHHHHHHHHhhcc
Q psy8609 14 VRAYYKYMLDIAVLMGAD-REAAVEELTESLNFEIALANIS 53 (182)
Q Consensus 14 ~~~y~~~i~~~~~~~g~~-~~~~~~~~~~i~~fE~~la~~~ 53 (182)
+.++.+.+-.++..+|.+ ..+..+.-+.|.++|++++.+.
T Consensus 269 r~~~qe~~e~~L~~LnlPTRsElDe~~krL~ELrR~vr~L~ 309 (320)
T TIGR01834 269 RIQQQEIVEALLKMLNLPTRSELDEAHQRIQQLRREVKSLK 309 (320)
T ss_pred HHHHHHHHHHHHHhCCCCCHHHHHHHHHHHHHHHHHHHHHH
Confidence 344445555556666653 4556667777777777776654
No 18
>PF11554 DUF3232: Protein of unknown function (DUF3232); InterPro: IPR021618 This bacterial family of proteins has no known function. ; PDB: 2RDC_A.
Probab=22.23 E-value=1.7e+02 Score=20.79 Aligned_cols=51 Identities=18% Similarity=0.105 Sum_probs=29.5
Q ss_pred cHHHHHHHHHHHHHH------HHHhCCCHHHHHHHHHHHHHHHHHHhhc--cCChHhhc
Q psy8609 10 DEKIVRAYYKYMLDI------AVLMGADREAAVEELTESLNFEIALANI--SLPQEERR 60 (182)
Q Consensus 10 ~~~~~~~y~~~i~~~------~~~~g~~~~~~~~~~~~i~~fE~~la~~--~~~~~~~~ 60 (182)
.+..++-|++.+... +.++..-.+.+.+.+..++.||.+|+.. ....++-|
T Consensus 19 ~~~~~~rf~~v~~~~~~D~~~l~liE~~ids~~~Y~~~V~~mE~~l~t~rfrlegeeYR 77 (152)
T PF11554_consen 19 SEETMNRFKRVISITKDDEDALDLIEELIDSFKEYVLIVYRMEDQLQTWRFRLEGEEYR 77 (152)
T ss_dssp TTSHHHHHHHHHHHHTTSHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCTS-HHHHH
T ss_pred hHHHHHHHHHHHHHhcCcHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccHHHH
Confidence 345566677555443 1122112355778899999999998754 44444443
No 19
>KOG3779|consensus
Probab=21.50 E-value=4.8e+02 Score=22.55 Aligned_cols=97 Identities=12% Similarity=0.215 Sum_probs=48.9
Q ss_pred cCccCHHHHHhhCCCCC------HHHHHHHhcCCCCCCCCCCeEEE-cChHHHHHHHHHHccCChhHHhHHHHHHHHHHH
Q psy8609 66 YNPMKISELQERYPSIP------WTEYINTILSPNAQLKDDETIIV-TEPEYIHDLEKLLSTTPKRTMANYVMWRVTAAS 138 (182)
Q Consensus 66 ~~~~t~~~l~~~~p~i~------w~~~l~~l~~~~~~~~~~~~i~v-~~~~y~~~l~~ll~~t~~~~l~nYl~~~~i~~~ 138 (182)
+|+.-.++|-+.+.++- -+.|-+..+.. ++.+...|.| .+-.-|+.|+--.+..+.-.+-.=+++-
T Consensus 612 FNr~~TsQLiKWFSNFREFYY~QMEK~ARQAi~d--GvT~~~ei~itrD~EL~r~LN~HYNk~N~~~~PeR~~~V----- 684 (737)
T KOG3779|consen 612 FNRCITSQLIKWFSNFREFYYIQMEKYARQAIND--GVTSTEELSITRDCELYRALNMHYNKANDFEVPERFLEV----- 684 (737)
T ss_pred cccccHHHHHHHHHHHHHHHHHHHHHHHHHHHhc--ccCchhheeeeccHHHHHHHHhhhcccccccccHHHHHH-----
Confidence 44555566665544331 12222232333 5665444544 5777777777666554321111111111
Q ss_pred HhhhhHHHHHHHHHHHHhhCCCCCCchhHhhHHHHHhc
Q psy8609 139 VGFFTEAIGARRLAFITAVTGVSEEEARWKECVGRVKG 176 (182)
Q Consensus 139 ~~~l~~~~~~~~~~~~~~~~g~~~~~~r~~~C~~~v~~ 176 (182)
.+....+|..++--.+...|-|+..+-.|..
T Consensus 685 -------~Q~TLREF~~AI~~GKDvdPSWKK~IYKVI~ 715 (737)
T KOG3779|consen 685 -------AQITLREFFNAIIAGKDVDPSWKKAIYKVIC 715 (737)
T ss_pred -------HHHHHHHHHHHHhcCCcCCchHHHHHHHHHH
Confidence 1233456665554444567999998887754
No 20
>PF15372 DUF4600: Domain of unknown function (DUF4600)
Probab=21.18 E-value=2e+02 Score=20.06 Aligned_cols=38 Identities=13% Similarity=0.221 Sum_probs=23.9
Q ss_pred HHHHHHHHHHHHHhhccCChHhhcchhhhcCccCHHHH
Q psy8609 37 EELTESLNFEIALANISLPQEERRNATKLYNPMKISEL 74 (182)
Q Consensus 37 ~~~~~i~~fE~~la~~~~~~~~~~~~~~~~~~~t~~~l 74 (182)
+.-.+|+.++.++.++...+.++..+...|..++...|
T Consensus 19 QLekqi~~l~~kiek~r~n~~drl~siR~ye~Ms~~~l 56 (129)
T PF15372_consen 19 QLEKQIIILREKIEKIRGNPSDRLSSIRRYEQMSVESL 56 (129)
T ss_pred HHHHHHHHHHHHHHHHhCCCccccHHHHHHhhccHHHH
Confidence 45567777888888887666665555555555554444
No 21
>COG5552 Uncharacterized conserved protein [Function unknown]
Probab=20.71 E-value=2.1e+02 Score=17.89 Aligned_cols=23 Identities=13% Similarity=0.275 Sum_probs=18.7
Q ss_pred hhhHHHHHHHHHHHHhhCCCCCC
Q psy8609 141 FFTEAIGARRLAFITAVTGVSEE 163 (182)
Q Consensus 141 ~l~~~~~~~~~~~~~~~~g~~~~ 163 (182)
..+.+.+.+...|-++++|....
T Consensus 15 AT~~EvrdAAlQfVRKlSGtT~P 37 (88)
T COG5552 15 ATPVEVRDAALQFVRKLSGTTHP 37 (88)
T ss_pred CCcHHHHHHHHHHHHHhcCCCCc
Confidence 34668899999999999998765
No 22
>PHA03043 hypothetical protein; Provisional
Probab=20.58 E-value=64 Score=22.48 Aligned_cols=12 Identities=33% Similarity=0.836 Sum_probs=9.9
Q ss_pred hhHHhHHHHHHH
Q psy8609 123 KRTMANYVMWRV 134 (182)
Q Consensus 123 ~~~l~nYl~~~~ 134 (182)
...|++|++||-
T Consensus 10 ~~iL~~Yi~wRG 21 (130)
T PHA03043 10 KDIIKEYIRWRG 21 (130)
T ss_pred HHHHHHHHhhhc
Confidence 467999999993
Done!