Query psy14480
Match_columns 190
No_of_seqs 220 out of 1456
Neff 7.4
Searched_HMMs 46136
Date Fri Aug 16 20:54:58 2013
Command hhsearch -i /work/01045/syshi/Psyhhblits/psy14480.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/14480hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 KOG2084|consensus 99.6 4.3E-15 9.3E-20 131.8 13.5 113 76-189 169-288 (482)
2 PF00856 SET: SET domain; Int 99.5 4.3E-15 9.4E-20 112.1 4.7 51 97-147 109-162 (162)
3 smart00317 SET SET (Su(var)3-9 99.3 1.3E-12 2.9E-17 94.4 3.9 46 101-146 68-116 (116)
4 KOG2589|consensus 98.8 2.6E-09 5.6E-14 91.9 4.5 80 88-177 176-257 (453)
5 COG2940 Proteins containing SE 98.4 1.2E-07 2.6E-12 85.9 3.1 72 105-176 404-478 (480)
6 KOG4442|consensus 98.3 4.9E-07 1.1E-11 83.3 4.0 51 103-153 189-243 (729)
7 KOG1080|consensus 98.3 6.1E-07 1.3E-11 86.7 3.7 46 106-151 939-987 (1005)
8 KOG1337|consensus 97.8 1.8E-05 3.9E-10 71.6 4.5 88 74-167 197-291 (472)
9 KOG1082|consensus 97.7 2.4E-05 5.1E-10 68.7 3.1 44 107-150 273-323 (364)
10 KOG1141|consensus 97.3 0.00011 2.4E-09 69.3 2.3 58 107-172 1190-1254(1262)
11 KOG1083|consensus 97.0 0.0006 1.3E-08 65.9 3.5 42 108-149 1252-1296(1306)
12 KOG1085|consensus 96.9 0.00061 1.3E-08 57.6 2.9 44 107-150 334-380 (392)
13 KOG1338|consensus 96.8 0.004 8.6E-08 54.9 6.8 87 62-149 162-261 (466)
14 KOG1079|consensus 95.8 0.0094 2E-07 55.5 3.7 44 106-149 665-711 (739)
15 KOG1081|consensus 92.7 0.043 9.2E-07 49.8 0.7 50 101-150 365-418 (463)
16 KOG2461|consensus 76.5 2.2 4.8E-05 38.1 2.7 27 124-150 120-146 (396)
17 TIGR02059 swm_rep_I cyanobacte 75.9 3.9 8.4E-05 29.5 3.3 25 126-150 75-99 (101)
18 PF08666 SAF: SAF domain; Int 69.7 3.1 6.8E-05 26.3 1.6 17 129-145 3-19 (63)
19 KOG2155|consensus 60.2 9.7 0.00021 34.7 3.3 61 103-163 201-266 (631)
20 smart00858 SAF This domain fam 45.9 15 0.00032 23.0 1.6 16 129-144 3-18 (64)
21 KOG2084|consensus 40.0 0.21 4.5E-06 44.3 -10.6 80 99-178 365-452 (482)
22 PF12947 EGF_3: EGF domain; I 24.7 29 0.00064 19.9 0.3 15 110-124 4-18 (36)
23 COG4118 Phd Antitoxin of toxin 23.6 29 0.00063 24.1 0.2 16 133-148 18-33 (84)
24 PF06718 DUF1203: Protein of u 23.3 75 0.0016 23.5 2.4 20 131-150 6-26 (117)
25 COG1188 Ribosome-associated he 22.7 62 0.0014 23.3 1.8 21 130-150 43-63 (100)
26 COG1996 RPC10 DNA-directed RNA 22.3 40 0.00087 21.1 0.6 22 169-190 8-29 (49)
27 PRK00398 rpoP DNA-directed RNA 21.8 53 0.0011 19.6 1.1 21 169-189 5-25 (46)
28 PF14353 CpXC: CpXC protein 20.5 1.5E+02 0.0033 21.6 3.6 33 145-178 17-49 (128)
No 1
>KOG2084|consensus
Probab=99.63 E-value=4.3e-15 Score=131.85 Aligned_cols=113 Identities=35% Similarity=0.620 Sum_probs=97.0
Q ss_pred HHHHHHHHHHHHhcccccCC----CC--CceEEEcccccccCCCCCCCceeEecCCCEEEEEEcccCCCCC-eeeeeccC
Q psy14480 76 QEEILKICGIIQVNAHEMPL----TE--PSYIAIFDRASFIEHNCYPNLYKSFTDSGQVLLRAMKPIAPGD-HLSICYTD 148 (190)
Q Consensus 76 ~~~~~~~~~~~~~N~f~i~~----~~--~~~~gl~p~~s~~NHSC~PN~~~~~~~~~~~~~~a~r~I~~Ge-EltisY~~ 148 (190)
.+.+..++..+..+++.... .. ..+.|+||..+++||||.||+...|+ +....+++...+.+++ |++++|++
T Consensus 169 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~hsC~pn~~~~~~-~~~~~~~~~~~~~~~~~~l~~~y~~ 247 (482)
T KOG2084|consen 169 ADCISKLFPSLLCNSITNASSLRVPEPLFLGRGLFPGSSLFNHSCFPNISVIFD-GRGLALLVPAGIDAGEEELTISYTD 247 (482)
T ss_pred HHHHHHHHHHHHHhhhhhhhhhhccccccceeeecccchhcccCCCCCeEEEEC-CceeEEEeecccCCCCCEEEEeecc
Confidence 44556667777777776654 11 48999999999999999999998886 6677777888888876 99999999
Q ss_pred CCCCHHHHHHHhccCCCeeeecCCCCCCCCCCccccccccC
Q psy14480 149 PLWGTINRRHHLQTSKYFICQCERCRDPTELNTFYDGVKCP 189 (190)
Q Consensus 149 ~~~~~~~R~~~L~~~~~F~C~C~~C~~~~e~~~~~~~~~C~ 189 (190)
..+++..|++.|+..|.|.|.|++|.++++.+...++++|.
T Consensus 248 ~~~~~~~r~~~l~~~~~f~c~c~rc~d~~~~~~~~~~~~c~ 288 (482)
T KOG2084|consen 248 PLLSTASRQKQLRQSKLFSCQCPRCLDPTELGTFLSSLRCE 288 (482)
T ss_pred cccCHHHHHHHHhhccceeeecCCCCCCCccccchhhhhhc
Confidence 99999999999999999999999999999999999988884
No 2
>PF00856 SET: SET domain; InterPro: IPR001214 The SET domain appears generally as one part of a larger multidomain protein, and recently there were described three structures of very different proteins with distinct domain compositions: Neurospora crassa DIM-5, a member of the Su(var) family of HKMTs which methylate histone H3 on lysine 9,human SET7 (also called SET9), which methylates H3 on lysine 4 and garden pea Rubisco LSMT, an enzyme that does not modify histones, but instead methylates lysine 14 in the flexible tail of the large subunit of the enzyme Rubisco. The SET domain itself turned out to be an uncommon structure. Although in all three studies, electron density maps revealed the location of the AdoMet or AdoHcy cofactor, the SET domain bears no similarity at all to the canonical/AdoMet-dependent methyltransferase fold. Strictly conserved in the C-terminal motif of the SET domain tyrosine could be involved in abstracting a proton from the protonated amino group of the substrate lysine, promoting its nucleophilic attack on the sulphonium methyl group of the AdoMet cofactor. In contrast to the AdoMet-dependent protein methyltranferases of the classical type, which tend to bind their polypeptide substrates on top of the cofactor, it is noted from the Rubisco LSMT structure that the AdoMet seems to bind in a separate cleft, suggesting how a polypeptide substrate could be subjected to multiple rounds of methylation without having to be released from the enzyme. In contrast, SET7/9 is able to add only a single methyl group to its substrate. It has been demonstrated that association of SET domain and myotubularin-related proteins modulates growth control []. The SET domain-containing Drosophila melanogaster (Fruit fly) protein, enhancer of zeste, has a function in segment determination and the mammalian homologue may be involved in the regulation of gene transcription and chromatin structure. Histone lysine methylation is part of the histone code that regulated chromatin function and epigenetic control of gene function. Histone lysine methyltransferases (HMTase) differ both in their substrate specificity for the various acceptor lysines as well as in their product specificity for the number of methyl groups (one, two, or three) they transfer. With just one exception [], the HMTases belong to SET family that can be classified according to the sequences surrounding the SET domain [, ]. Structural studies on the human SET7/9, a mono-methylase, have revealed the molecular basis for the specificity of the enzyme for the histone-target and the roles of the invariant residues in the SET domain in determining the methylation specificities []. The pre-SET domain, as found in the SUV39 SET family, contains nine invariant cysteine residues that are grouped into two segments separated by a region of variable length. These 9 cysteines coordinate 3 zinc ions to form to form a triangular cluster, where each of the zinc ions is coordinated by 4 four cysteines to give a tetrahedral configuration. The function of this domain is structural, holding together 2 long segments of random coils. The C-terminal region including the post-SET domain is disordered when not interacting with a histone tail and in the absence of zinc. The three conserved cysteines in the post-SET domain form a zinc-binding site when coupled to a fourth conserved cysteine in the knot-like structure close to the SET domain active site []. The structured post-SET region brings in the C-terminal residues that participate in S-adenosylmethine-binding and histone tail interactions. The three conserved cysteine residues are essential for HMTase activity, as replacement with serine abolishes HMTase activity [], []. ; GO: 0005515 protein binding; PDB: 3TG5_A 3S7F_A 3RIB_B 3TG4_A 3S7J_A 3S7D_A 3S7B_A 3H6L_A 3SMT_A 3K5K_A ....
Probab=99.55 E-value=4.3e-15 Score=112.13 Aligned_cols=51 Identities=33% Similarity=0.614 Sum_probs=45.4
Q ss_pred CCceEEEcccccccCCCCCCCceeEec---CCCEEEEEEcccCCCCCeeeeecc
Q psy14480 97 EPSYIAIFDRASFIEHNCYPNLYKSFT---DSGQVLLRAMKPIAPGDHLSICYT 147 (190)
Q Consensus 97 ~~~~~gl~p~~s~~NHSC~PN~~~~~~---~~~~~~~~a~r~I~~GeEltisY~ 147 (190)
+..+.+|||.++++||||.|||.+.++ .+..+.++|.|+|++||||+|+|+
T Consensus 109 ~~~~~~l~p~~d~~NHsc~pn~~~~~~~~~~~~~~~~~a~r~I~~GeEi~isYG 162 (162)
T PF00856_consen 109 DRDGIALYPFADMLNHSCDPNCEVSFDFDGDGGCLVVRATRDIKKGEEIFISYG 162 (162)
T ss_dssp EEEEEEEETGGGGSEEESSTSEEEEEEEETTTTEEEEEESS-B-TTSBEEEEST
T ss_pred cccccccCcHhHheccccccccceeeEeecccceEEEEECCccCCCCEEEEEEC
Confidence 356799999999999999999999996 478999999999999999999997
No 3
>smart00317 SET SET (Su(var)3-9, Enhancer-of-zeste, Trithorax) domain. Putative methyl transferase, based on outlier plant homologues
Probab=99.32 E-value=1.3e-12 Score=94.37 Aligned_cols=46 Identities=35% Similarity=0.636 Sum_probs=40.2
Q ss_pred EEEcccccccCCCCCCCceeEecC-CC--EEEEEEcccCCCCCeeeeec
Q psy14480 101 IAIFDRASFIEHNCYPNLYKSFTD-SG--QVLLRAMKPIAPGDHLSICY 146 (190)
Q Consensus 101 ~gl~p~~s~~NHSC~PN~~~~~~~-~~--~~~~~a~r~I~~GeEltisY 146 (190)
..++|.++++||||.||+.+.+.. ++ .+.++|+|+|++|||||++|
T Consensus 68 ~~~~~~~~~iNHsc~pN~~~~~~~~~~~~~~~~~a~r~I~~GeEi~i~Y 116 (116)
T smart00317 68 RRKGNIARFINHSCEPNCELLFVEVNGDSRIVIFALRDIKPGEELTIDY 116 (116)
T ss_pred CccCcHHHeeCCCCCCCEEEEEEEECCCcEEEEEECCCcCCCCEEeecC
Confidence 447899999999999999988742 22 69999999999999999998
No 4
>KOG2589|consensus
Probab=98.85 E-value=2.6e-09 Score=91.91 Aligned_cols=80 Identities=21% Similarity=0.462 Sum_probs=60.5
Q ss_pred hcccccCCCCC-ceEEEccc-ccccCCCCCCCceeEecCCCEEEEEEcccCCCCCeeeeeccCCCCCHHHHHHHhccCCC
Q psy14480 88 VNAHEMPLTEP-SYIAIFDR-ASFIEHNCYPNLYKSFTDSGQVLLRAMKPIAPGDHLSICYTDPLWGTINRRHHLQTSKY 165 (190)
Q Consensus 88 ~N~f~i~~~~~-~~~gl~p~-~s~~NHSC~PN~~~~~~~~~~~~~~a~r~I~~GeEltisY~~~~~~~~~R~~~L~~~~~ 165 (190)
-|-|.|..+.+ .-..||.. ++++||+|.|||.++-.++.++.|+++|||++|||||.=|+..+.+. ..
T Consensus 176 ~nDFSvmyStRk~caqLwLGPaafINHDCrpnCkFvs~g~~tacvkvlRDIePGeEITcFYgs~fFG~----------~N 245 (453)
T KOG2589|consen 176 GNDFSVMYSTRKRCAQLWLGPAAFINHDCRPNCKFVSTGRDTACVKVLRDIEPGEEITCFYGSGFFGE----------NN 245 (453)
T ss_pred CCceeeeeecccchhhheeccHHhhcCCCCCCceeecCCCceeeeehhhcCCCCceeEEeecccccCC----------CC
Confidence 45566554222 22455543 67889999999998886558999999999999999999999887763 23
Q ss_pred eeeecCCCCCCC
Q psy14480 166 FICQCERCRDPT 177 (190)
Q Consensus 166 F~C~C~~C~~~~ 177 (190)
-.|.|.-|....
T Consensus 246 ~~CeC~TCER~g 257 (453)
T KOG2589|consen 246 EECECVTCERRG 257 (453)
T ss_pred ceeEEeeccccc
Confidence 489999997653
No 5
>COG2940 Proteins containing SET domain [General function prediction only]
Probab=98.44 E-value=1.2e-07 Score=85.89 Aligned_cols=72 Identities=26% Similarity=0.407 Sum_probs=59.3
Q ss_pred ccccccCCCCCCCceeEecCC---CEEEEEEcccCCCCCeeeeeccCCCCCHHHHHHHhccCCCeeeecCCCCCC
Q psy14480 105 DRASFIEHNCYPNLYKSFTDS---GQVLLRAMKPIAPGDHLSICYTDPLWGTINRRHHLQTSKYFICQCERCRDP 176 (190)
Q Consensus 105 p~~s~~NHSC~PN~~~~~~~~---~~~~~~a~r~I~~GeEltisY~~~~~~~~~R~~~L~~~~~F~C~C~~C~~~ 176 (190)
..+.++||||.||+....... ..+.++|++||++||||++.|....+....+...+...++..|.|.+|...
T Consensus 404 ~~~r~~nHS~~pN~~~~~~~~~g~~~~~~~~~rDI~~geEl~~dy~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 478 (480)
T COG2940 404 DVARFINHSCTPNCEASPIEVNGIFKISIYAIRDIKAGEELTYDYGPSLEDNRELKKLLEKRWGCACGEDRCSHT 478 (480)
T ss_pred cccceeecCCCCCcceecccccccceeeecccccchhhhhhccccccccccchhhhhhhhhhhccccCCCccCCC
Confidence 456689999999999887432 388999999999999999999988775554556677788999999999754
No 6
>KOG4442|consensus
Probab=98.31 E-value=4.9e-07 Score=83.35 Aligned_cols=51 Identities=29% Similarity=0.452 Sum_probs=38.9
Q ss_pred Eccccc-ccCCCCCCCceeE-e--cCCCEEEEEEcccCCCCCeeeeeccCCCCCH
Q psy14480 103 IFDRAS-FIEHNCYPNLYKS-F--TDSGQVLLRAMKPIAPGDHLSICYTDPLWGT 153 (190)
Q Consensus 103 l~p~~s-~~NHSC~PN~~~~-~--~~~~~~~~~a~r~I~~GeEltisY~~~~~~~ 153 (190)
.+-..| ++||||+|||..- | .+..+|-+.|.++|++|||||+.|--..++.
T Consensus 189 ~KGnlaRFiNHSC~PNa~~~KWtV~~~lRvGiFakk~I~~GEEITFDYqf~rYGr 243 (729)
T KOG4442|consen 189 KKGNLARFINHSCDPNAEVQKWTVPDELRVGIFAKKVIKPGEEITFDYQFDRYGR 243 (729)
T ss_pred ccCcHHHhhcCCCCCCceeeeeeeCCeeEEEEeEecccCCCceeeEecccccccc
Confidence 343444 4699999999553 3 2236888999999999999999999776654
No 7
>KOG1080|consensus
Probab=98.26 E-value=6.1e-07 Score=86.71 Aligned_cols=46 Identities=30% Similarity=0.566 Sum_probs=38.0
Q ss_pred cccccCCCCCCCceeEe---cCCCEEEEEEcccCCCCCeeeeeccCCCC
Q psy14480 106 RASFIEHNCYPNLYKSF---TDSGQVLLRAMKPIAPGDHLSICYTDPLW 151 (190)
Q Consensus 106 ~~s~~NHSC~PN~~~~~---~~~~~~~~~a~r~I~~GeEltisY~~~~~ 151 (190)
.|.++||||.|||..-. ++..+++|+|.|+|++|||||..|-....
T Consensus 939 iAr~InHsC~PNCyakvi~V~g~~~IvIyakr~I~~~EElTYDYkF~~e 987 (1005)
T KOG1080|consen 939 IARFINHSCNPNCYAKVITVEGDKRIVIYSKRDIAAGEELTYDYKFPTE 987 (1005)
T ss_pred hhheeecccCCCceeeEEEecCeeEEEEEEecccccCceeeeecccccc
Confidence 47789999999995432 33479999999999999999999986654
No 8
>KOG1337|consensus
Probab=97.83 E-value=1.8e-05 Score=71.59 Aligned_cols=88 Identities=17% Similarity=0.111 Sum_probs=69.1
Q ss_pred CCHHHHHHHHHHHHhcccccCCC-------CCceEEEcccccccCCCCCCCceeEecCCCEEEEEEcccCCCCCeeeeec
Q psy14480 74 FTQEEILKICGIIQVNAHEMPLT-------EPSYIAIFDRASFIEHNCYPNLYKSFTDSGQVLLRAMKPIAPGDHLSICY 146 (190)
Q Consensus 74 ~~~~~~~~~~~~~~~N~f~i~~~-------~~~~~gl~p~~s~~NHSC~PN~~~~~~~~~~~~~~a~r~I~~GeEltisY 146 (190)
++.+.+...++++.+.+|..... ...-.++.|...+.||+|.+........+..+.+++.++|++||||+|+|
T Consensus 197 ~~~~~~~w~~~~~~sr~~~~~~~~~~~~~~~~~~~~L~P~~D~~NH~~~~~~~~~~~~d~~~~l~~~~~v~~geevfi~Y 276 (472)
T KOG1337|consen 197 FTFSAFKWAYSIVNSRAFYLPSLQRLTAGDPDDNEALAPLIDLLNHSPEVIKAGYNQEDEAVELVAERDVSAGEEVFINY 276 (472)
T ss_pred cchHHHHHHHHHHhhhhhccccccccccCCCCcchhhhhhHHhhccCchhccccccCCCCcEEEEEeeeecCCCeEEEec
Confidence 56677889999999999987642 13447899999999999999444444334589999999999999999999
Q ss_pred cCCCCCHHHHHHHhccCCCee
Q psy14480 147 TDPLWGTINRRHHLQTSKYFI 167 (190)
Q Consensus 147 ~~~~~~~~~R~~~L~~~~~F~ 167 (190)
++... ..|...|||.
T Consensus 277 G~~~N------~eLL~~YGFv 291 (472)
T KOG1337|consen 277 GPKSN------AELLLHYGFV 291 (472)
T ss_pred CCCch------HHHHHhcCCC
Confidence 98432 2566689996
No 9
>KOG1082|consensus
Probab=97.71 E-value=2.4e-05 Score=68.69 Aligned_cols=44 Identities=34% Similarity=0.633 Sum_probs=36.0
Q ss_pred ccccCCCCCCCceeEec--CC-----CEEEEEEcccCCCCCeeeeeccCCC
Q psy14480 107 ASFIEHNCYPNLYKSFT--DS-----GQVLLRAMKPIAPGDHLSICYTDPL 150 (190)
Q Consensus 107 ~s~~NHSC~PN~~~~~~--~~-----~~~~~~a~r~I~~GeEltisY~~~~ 150 (190)
+-++||||.||+.+... ++ ..+.+.|+++|++|+|+|..|+...
T Consensus 273 ~RfinHSC~PN~~~~~v~~~~~~~~~~~i~ffa~~~I~p~~ELT~dYg~~~ 323 (364)
T KOG1082|consen 273 ARFINHSCSPNLLYQAVFQDEFVLLYLRIGFFALRDISPGEELTLDYGKAY 323 (364)
T ss_pred cccccCCCCccceeeeeeecCCccchheeeeeeccccCCCcccchhhcccc
Confidence 44679999999987652 11 4778999999999999999999764
No 10
>KOG1141|consensus
Probab=97.32 E-value=0.00011 Score=69.30 Aligned_cols=58 Identities=28% Similarity=0.461 Sum_probs=42.7
Q ss_pred ccccCCCCCCCceeE--ecC--C---CEEEEEEcccCCCCCeeeeeccCCCCCHHHHHHHhccCCCeeeecCC
Q psy14480 107 ASFIEHNCYPNLYKS--FTD--S---GQVLLRAMKPIAPGDHLSICYTDPLWGTINRRHHLQTSKYFICQCER 172 (190)
Q Consensus 107 ~s~~NHSC~PN~~~~--~~~--~---~~~~~~a~r~I~~GeEltisY~~~~~~~~~R~~~L~~~~~F~C~C~~ 172 (190)
+-++||||.||..+. |.+ + .-+.+.|.+-|++|+|||..|....-... ..-..|.|..
T Consensus 1190 GRfLNHSC~PNl~VQnVfvdTHdlrfPwVAFFt~kyVkAgtELTWDY~Ye~g~v~--------~keL~C~CGa 1254 (1262)
T KOG1141|consen 1190 GRFLNHSCDPNLHVQNVFVDTHDLRFPWVAFFTRKYVKAGTELTWDYQYEQGQVA--------TKELTCHCGA 1254 (1262)
T ss_pred hhhhccCCCccceeeeeeeeccccCCchhhhhhhhhhccCceeeeeccccccccc--------cceEEEecCh
Confidence 557899999998653 421 1 46678899999999999999997654332 3456888864
No 11
>KOG1083|consensus
Probab=96.98 E-value=0.0006 Score=65.90 Aligned_cols=42 Identities=29% Similarity=0.502 Sum_probs=33.7
Q ss_pred cccCCCCCCCceeE-ec-CC-CEEEEEEcccCCCCCeeeeeccCC
Q psy14480 108 SFIEHNCYPNLYKS-FT-DS-GQVLLRAMKPIAPGDHLSICYTDP 149 (190)
Q Consensus 108 s~~NHSC~PN~~~~-~~-~~-~~~~~~a~r~I~~GeEltisY~~~ 149 (190)
-++||||.|||... |. +| .++.|.|.|||.+|||||..|-..
T Consensus 1252 RfinhscKPNc~~qkwSVNG~~Rv~L~A~rDi~kGEELtYDYN~k 1296 (1306)
T KOG1083|consen 1252 RFINHSCKPNCEMQKWSVNGEYRVGLFALRDLPKGEELTYDYNFK 1296 (1306)
T ss_pred cccccccCCCCccccccccceeeeeeeecCCCCCCceEEEecccc
Confidence 34599999999543 32 23 689999999999999999998754
No 12
>KOG1085|consensus
Probab=96.95 E-value=0.00061 Score=57.64 Aligned_cols=44 Identities=30% Similarity=0.388 Sum_probs=37.0
Q ss_pred ccccCCCCCCCceeE---ecCCCEEEEEEcccCCCCCeeeeeccCCC
Q psy14480 107 ASFIEHNCYPNLYKS---FTDSGQVLLRAMKPIAPGDHLSICYTDPL 150 (190)
Q Consensus 107 ~s~~NHSC~PN~~~~---~~~~~~~~~~a~r~I~~GeEltisY~~~~ 150 (190)
.-++|||=.+|+.-- .+....+++.|.+||++||||+..|+|-.
T Consensus 334 GRLINHS~~gNl~TKvv~Idg~pHLiLvA~rdIa~GEELlYDYGDRS 380 (392)
T KOG1085|consen 334 GRLINHSVRGNLKTKVVEIDGSPHLILVARRDIAQGEELLYDYGDRS 380 (392)
T ss_pred hhhhcccccCcceeeEEEecCCceEEEEeccccccchhhhhhccccc
Confidence 567899999998432 34448999999999999999999999865
No 13
>KOG1338|consensus
Probab=96.80 E-value=0.004 Score=54.85 Aligned_cols=87 Identities=13% Similarity=0.090 Sum_probs=69.5
Q ss_pred HHHHHhhccCCCCCHHHHHHHHHHHHhcccccCC------------CCCceEEEcccccccCCCCC-CCceeEecCCCEE
Q psy14480 62 QFLLNFFKLNEEFTQEEILKICGIIQVNAHEMPL------------TEPSYIAIFDRASFIEHNCY-PNLYKSFTDSGQV 128 (190)
Q Consensus 62 ~~l~~~~~~~~~~~~~~~~~~~~~~~~N~f~i~~------------~~~~~~gl~p~~s~~NHSC~-PN~~~~~~~~~~~ 128 (190)
.|...+...-..++.|.++.+.+++..-+|.+.. .+..|-..-|.+.++||.=. -|+...++ +..+
T Consensus 162 pf~~~~p~vfs~~slEdF~y~~Al~laysfdve~~~s~~~~eee~e~e~ngk~m~p~ad~lNhd~~k~nanl~y~-~NcL 240 (466)
T KOG1338|consen 162 PFKQHCPIVFSRPSLEDFMYAYALGLAYSFDVEFLLSLDNLEEESEIECNGKLMTPIADFLNHDGLKANANLRYE-DNCL 240 (466)
T ss_pred HHHHhCcchhcccCHHHHHHHHHHHHHHheeeehhcchhhhhhhhccccCcccccchhhhhccchhhcccceecc-Ccce
Confidence 3444444333347899999999999999999864 23678889999999999876 67777775 6788
Q ss_pred EEEEcccCCCCCeeeeeccCC
Q psy14480 129 LLRAMKPIAPGDHLSICYTDP 149 (190)
Q Consensus 129 ~~~a~r~I~~GeEltisY~~~ 149 (190)
.++|.|+|++|+|+.-+|+-.
T Consensus 241 ~mva~r~iekgdev~n~dg~~ 261 (466)
T KOG1338|consen 241 EMVADRNIEKGDEVDNSDGLK 261 (466)
T ss_pred eeeecCCCCCccccccccccC
Confidence 999999999999999999843
No 14
>KOG1079|consensus
Probab=95.77 E-value=0.0094 Score=55.52 Aligned_cols=44 Identities=23% Similarity=0.408 Sum_probs=35.4
Q ss_pred cccccCCCCCCCceeE--ec-CCCEEEEEEcccCCCCCeeeeeccCC
Q psy14480 106 RASFIEHNCYPNLYKS--FT-DSGQVLLRAMKPIAPGDHLSICYTDP 149 (190)
Q Consensus 106 ~~s~~NHSC~PN~~~~--~~-~~~~~~~~a~r~I~~GeEltisY~~~ 149 (190)
.+-+.|||=.|||..- +. +..++-++|.|.|.+|||||+.|...
T Consensus 665 k~rFANHS~nPNCYAkvm~V~GdhRIGifAkRaIeagEELffDYrYs 711 (739)
T KOG1079|consen 665 KIRFANHSFNPNCYAKVMMVAGDHRIGIFAKRAIEAGEELFFDYRYS 711 (739)
T ss_pred hhhhccCCCCCCcEEEEEEecCCcceeeeehhhcccCceeeeeeccC
Confidence 4556799999998543 22 33788999999999999999999854
No 15
>KOG1081|consensus
Probab=92.70 E-value=0.043 Score=49.85 Aligned_cols=50 Identities=24% Similarity=0.310 Sum_probs=38.6
Q ss_pred EEEcccccc-cCCCCCCCceeEe---cCCCEEEEEEcccCCCCCeeeeeccCCC
Q psy14480 101 IAIFDRASF-IEHNCYPNLYKSF---TDSGQVLLRAMKPIAPGDHLSICYTDPL 150 (190)
Q Consensus 101 ~gl~p~~s~-~NHSC~PN~~~~~---~~~~~~~~~a~r~I~~GeEltisY~~~~ 150 (190)
.+-+...+. .||||.||+.-.- .....+.+.|.+.|++|+|+|.+|....
T Consensus 365 ~~~~~n~sr~~nh~~~~~v~~~k~~~~~~t~~~~~a~~~i~~g~e~t~~~n~~~ 418 (463)
T KOG1081|consen 365 AGPKGNYSRFLNHSCQPNVETEKWQVIGDTRVGLFAPRQIEAGEELTFNYNGNC 418 (463)
T ss_pred cccccchhhhhcccCCCceeechhheecccccccccccccccchhhhheeeccc
Confidence 455566655 5999999986542 2246788999999999999999999654
No 16
>KOG2461|consensus
Probab=76.52 E-value=2.2 Score=38.11 Aligned_cols=27 Identities=19% Similarity=0.453 Sum_probs=24.3
Q ss_pred CCCEEEEEEcccCCCCCeeeeeccCCC
Q psy14480 124 DSGQVLLRAMKPIAPGDHLSICYTDPL 150 (190)
Q Consensus 124 ~~~~~~~~a~r~I~~GeEltisY~~~~ 150 (190)
.+..+..+|+|+|++||||.+=|++..
T Consensus 120 ~~~~Ifyrt~r~I~p~eELlVWY~~e~ 146 (396)
T KOG2461|consen 120 IGENIFYRTIRDIRPNEELLVWYGSEY 146 (396)
T ss_pred ccCceEEEecccCCCCCeEEEEeccch
Confidence 467899999999999999999999764
No 17
>TIGR02059 swm_rep_I cyanobacterial long protein repeat. This domain appears in 29 copies in a large (10000 amino protein in Synechococcus sp. WH8102 associated with a novel flagellar system, as one of three different repeats. Similar domains are found in two different large (<3500) proteins of Synechocystis PCC6803.
Probab=75.88 E-value=3.9 Score=29.46 Aligned_cols=25 Identities=20% Similarity=0.406 Sum_probs=22.3
Q ss_pred CEEEEEEcccCCCCCeeeeeccCCC
Q psy14480 126 GQVLLRAMKPIAPGDHLSICYTDPL 150 (190)
Q Consensus 126 ~~~~~~a~r~I~~GeEltisY~~~~ 150 (190)
..+.+.-.++|+.|++++++|.++.
T Consensus 75 ktVTLTL~~~V~~Gq~VTVsYt~ps 99 (101)
T TIGR02059 75 TTITLTLAQVVEDGDEVTLSYTKNS 99 (101)
T ss_pred cEEEEEecccccCCCEEEEEeeCCC
Confidence 4888988999999999999998764
No 18
>PF08666 SAF: SAF domain; InterPro: IPR013974 This entry includes a range of different proteins, such as antifreeze proteins, flagellar FlgA proteins, and CpaB pilus proteins. ; PDB: 1C89_A 3NLA_A 3RDN_A 1C8A_A 3FRN_A 1WVO_A 3K3S_H 3G8R_B 1XUU_A 1XUZ_A ....
Probab=69.66 E-value=3.1 Score=26.34 Aligned_cols=17 Identities=18% Similarity=0.296 Sum_probs=12.2
Q ss_pred EEEEcccCCCCCeeeee
Q psy14480 129 LLRAMKPIAPGDHLSIC 145 (190)
Q Consensus 129 ~~~a~r~I~~GeEltis 145 (190)
+++|.++|++|+.|+-.
T Consensus 3 vvVA~~di~~G~~i~~~ 19 (63)
T PF08666_consen 3 VVVAARDIPAGTVITAE 19 (63)
T ss_dssp EEEESSTB-TT-BECTT
T ss_pred EEEEeCccCCCCEEccC
Confidence 57899999999998543
No 19
>KOG2155|consensus
Probab=60.23 E-value=9.7 Score=34.68 Aligned_cols=61 Identities=18% Similarity=0.248 Sum_probs=46.0
Q ss_pred EcccccccCCCCCCCceeE---ecCC--CEEEEEEcccCCCCCeeeeeccCCCCCHHHHHHHhccC
Q psy14480 103 IFDRASFIEHNCYPNLYKS---FTDS--GQVLLRAMKPIAPGDHLSICYTDPLWGTINRRHHLQTS 163 (190)
Q Consensus 103 l~p~~s~~NHSC~PN~~~~---~~~~--~~~~~~a~r~I~~GeEltisY~~~~~~~~~R~~~L~~~ 163 (190)
+.-.+|++.||=.||..+. |-+. .--.+..++++..|||+|-.+.........|...|+-.
T Consensus 201 MDefGsrvrHsdePnf~~aPf~fmPq~vaYsimwp~k~~~tgeE~trDfasg~~~p~~Rk~~l~pW 266 (631)
T KOG2155|consen 201 MDEFGSRVRHSDEPNFRIAPFMFMPQNVAYSIMWPTKPVNTGEEITRDFASGVIHPEWRKYILQPW 266 (631)
T ss_pred HhhhhhhhccCCCCcceeeeheecchhcceeEEeeccCCCCchHHHHHHhhcCCCHHHHHHHhccc
Confidence 3366899999999998664 3222 23457789999999999999887776677787777643
No 20
>smart00858 SAF This domain family includes a range of different proteins. Such as antifreeze proteins and flagellar FlgA proteins, and CpaB pilus proteins.
Probab=45.90 E-value=15 Score=23.03 Aligned_cols=16 Identities=13% Similarity=0.407 Sum_probs=13.8
Q ss_pred EEEEcccCCCCCeeee
Q psy14480 129 LLRAMKPIAPGDHLSI 144 (190)
Q Consensus 129 ~~~a~r~I~~GeElti 144 (190)
.++|.++|++|+.|+-
T Consensus 3 v~va~~~i~~G~~i~~ 18 (64)
T smart00858 3 VVVAARDLPAGEVITA 18 (64)
T ss_pred EEEEeCccCCCCCcch
Confidence 5778899999999875
No 21
>KOG2084|consensus
Probab=40.03 E-value=0.21 Score=44.28 Aligned_cols=80 Identities=25% Similarity=0.401 Sum_probs=62.5
Q ss_pred ceEEEcccccccC--CCCCCCceeEecCC---CEEEEEEcccCCCCCeeeeeccCCCCCHHH---HHHHhccCCCeeeec
Q psy14480 99 SYIAIFDRASFIE--HNCYPNLYKSFTDS---GQVLLRAMKPIAPGDHLSICYTDPLWGTIN---RRHHLQTSKYFICQC 170 (190)
Q Consensus 99 ~~~gl~p~~s~~N--HSC~PN~~~~~~~~---~~~~~~a~r~I~~GeEltisY~~~~~~~~~---R~~~L~~~~~F~C~C 170 (190)
...++++..+.++ |+|.||........ ....+.+.+++.+|.+.++.|....++... +...+...+.+.|.|
T Consensus 365 ~~~~~~il~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 444 (482)
T KOG2084|consen 365 KLHLLFILGSLLGAFLSCSPNAELERLLNLFECRELLKALRDVKPGEEPLIAYLDYELGKLARELREKVLAEDALKDCKC 444 (482)
T ss_pred HHHHHHHHHHHHhhhhccchhhHHHHHHHhhhhhHHHHHHHhhcchhhhhhHHHHHHHHHHHHHHHHHHhhhcCcchhHH
Confidence 4567788888888 99999998766312 256788899999999999999988765443 445677789999999
Q ss_pred CCCCCCCC
Q psy14480 171 ERCRDPTE 178 (190)
Q Consensus 171 ~~C~~~~e 178 (190)
..|.....
T Consensus 445 ~~~~~~~~ 452 (482)
T KOG2084|consen 445 IMCLARAE 452 (482)
T ss_pred HHHHHHHH
Confidence 99976543
No 22
>PF12947 EGF_3: EGF domain; InterPro: IPR024731 This entry represents an EGF domain found in the the C terminus of malarial parasite merozoite surface protein 1 [], as well as other proteins.; PDB: 2NPR_A 1N1I_C 1B9W_A 1YO8_A 2RHP_A.
Probab=24.71 E-value=29 Score=19.93 Aligned_cols=15 Identities=27% Similarity=0.288 Sum_probs=10.2
Q ss_pred cCCCCCCCceeEecC
Q psy14480 110 IEHNCYPNLYKSFTD 124 (190)
Q Consensus 110 ~NHSC~PN~~~~~~~ 124 (190)
-||.|.||+.=...+
T Consensus 4 ~~~~C~~nA~C~~~~ 18 (36)
T PF12947_consen 4 NNGGCHPNATCTNTG 18 (36)
T ss_dssp GGGGS-TTCEEEE-T
T ss_pred CCCCCCCCcEeecCC
Confidence 489999999877653
No 23
>COG4118 Phd Antitoxin of toxin-antitoxin stability system [Cell division and chromosome partitioning]
Probab=23.63 E-value=29 Score=24.08 Aligned_cols=16 Identities=13% Similarity=0.385 Sum_probs=12.4
Q ss_pred cccCCCCCeeeeeccC
Q psy14480 133 MKPIAPGDHLSICYTD 148 (190)
Q Consensus 133 ~r~I~~GeEltisY~~ 148 (190)
++.+++||||.|+=-.
T Consensus 18 L~rV~aGEev~IT~~G 33 (84)
T COG4118 18 LRRVRAGEEVIITKRG 33 (84)
T ss_pred HHHHhCCCEEEEeeCC
Confidence 3568999999998654
No 24
>PF06718 DUF1203: Protein of unknown function (DUF1203); InterPro: IPR009593 This family consists of several hypothetical bacterial proteins of around 155 residues in length. Family members are present in Rhizobium, Agrobacterium and Streptomyces species.
Probab=23.30 E-value=75 Score=23.46 Aligned_cols=20 Identities=20% Similarity=0.468 Sum_probs=16.0
Q ss_pred EEcccCCCCCe-eeeeccCCC
Q psy14480 131 RAMKPIAPGDH-LSICYTDPL 150 (190)
Q Consensus 131 ~a~r~I~~GeE-ltisY~~~~ 150 (190)
..++++++||+ |.+||.+..
T Consensus 6 ~cL~~~~~Ge~~lLlsy~p~~ 26 (117)
T PF06718_consen 6 HCLRDAEPGEELLLLSYRPFP 26 (117)
T ss_pred EecccCCCCCeEEEEecCCCC
Confidence 46899999998 568998764
No 25
>COG1188 Ribosome-associated heat shock protein implicated in the recycling of the 50S subunit (S4 paralog) [Translation, ribosomal structure and biogenesis]
Probab=22.68 E-value=62 Score=23.29 Aligned_cols=21 Identities=24% Similarity=0.370 Sum_probs=18.3
Q ss_pred EEEcccCCCCCeeeeeccCCC
Q psy14480 130 LRAMKPIAPGDHLSICYTDPL 150 (190)
Q Consensus 130 ~~a~r~I~~GeEltisY~~~~ 150 (190)
+.+...|+.|++|+|.|+...
T Consensus 43 aKpS~~VK~GD~l~i~~~~~~ 63 (100)
T COG1188 43 AKPSKEVKVGDILTIRFGNKE 63 (100)
T ss_pred cccccccCCCCEEEEEeCCcE
Confidence 488899999999999999753
No 26
>COG1996 RPC10 DNA-directed RNA polymerase, subunit RPC10 (contains C4-type Zn-finger) [Transcription]
Probab=22.27 E-value=40 Score=21.05 Aligned_cols=22 Identities=36% Similarity=0.907 Sum_probs=16.5
Q ss_pred ecCCCCCCCCCCccccccccCC
Q psy14480 169 QCERCRDPTELNTFYDGVKCPE 190 (190)
Q Consensus 169 ~C~~C~~~~e~~~~~~~~~C~~ 190 (190)
.|.+|...-++.....+++||.
T Consensus 8 ~C~~Cg~~~~~~~~~~~irCp~ 29 (49)
T COG1996 8 KCARCGREVELDQETRGIRCPY 29 (49)
T ss_pred EhhhcCCeeehhhccCceeCCC
Confidence 5888888777666777888873
No 27
>PRK00398 rpoP DNA-directed RNA polymerase subunit P; Provisional
Probab=21.75 E-value=53 Score=19.64 Aligned_cols=21 Identities=43% Similarity=1.110 Sum_probs=11.9
Q ss_pred ecCCCCCCCCCCccccccccC
Q psy14480 169 QCERCRDPTELNTFYDGVKCP 189 (190)
Q Consensus 169 ~C~~C~~~~e~~~~~~~~~C~ 189 (190)
.|+.|-..-+.+.....++||
T Consensus 5 ~C~~CG~~~~~~~~~~~~~Cp 25 (46)
T PRK00398 5 KCARCGREVELDEYGTGVRCP 25 (46)
T ss_pred ECCCCCCEEEECCCCCceECC
Confidence 466666655444444466665
No 28
>PF14353 CpXC: CpXC protein
Probab=20.51 E-value=1.5e+02 Score=21.56 Aligned_cols=33 Identities=12% Similarity=0.158 Sum_probs=23.5
Q ss_pred eccCCCCCHHHHHHHhccCCCeeeecCCCCCCCC
Q psy14480 145 CYTDPLWGTINRRHHLQTSKYFICQCERCRDPTE 178 (190)
Q Consensus 145 sY~~~~~~~~~R~~~L~~~~~F~C~C~~C~~~~e 178 (190)
+.++.......|.+.| +.-.|.-.|+.|.....
T Consensus 17 ~~I~~~~~p~l~e~il-~g~l~~~~CP~Cg~~~~ 49 (128)
T PF14353_consen 17 TSINADEDPELKEKIL-DGSLFSFTCPSCGHKFR 49 (128)
T ss_pred eEEcCcCCHHHHHHHH-cCCcCEEECCCCCCcee
Confidence 3445555666777777 56789999999986543
Done!