Query psy3274
Match_columns 115
No_of_seqs 96 out of 98
Neff 4.0
Searched_HMMs 46136
Date Fri Aug 16 20:52:29 2013
Command hhsearch -i /work/01045/syshi/Psyhhblits/psy3274.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/3274hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 PF13867 SAP30_Sin3_bdg: Sin3 99.9 7.6E-26 1.6E-30 144.3 6.1 53 46-98 1-53 (53)
2 PF13866 zf-SAP30: SAP30 zinc- 99.4 1.5E-14 3.2E-19 99.9 0.6 29 2-30 50-78 (78)
3 PF10281 Ish1: Putative stress 97.3 0.00071 1.5E-08 40.0 4.7 37 43-79 1-38 (38)
4 PF02037 SAP: SAP domain; Int 94.5 0.092 2E-06 30.5 4.0 35 43-79 1-35 (35)
5 smart00513 SAP Putative DNA-bi 90.4 0.92 2E-05 26.0 4.5 34 43-78 1-34 (35)
6 PF07498 Rho_N: Rho terminatio 87.8 0.84 1.8E-05 27.6 3.2 37 42-78 1-37 (43)
7 PF12949 HeH: HeH/LEM domain; 85.0 0.89 1.9E-05 27.0 2.2 33 44-76 2-34 (35)
8 TIGR00578 ku70 ATP-dependent D 83.1 1.7 3.6E-05 38.9 4.1 37 42-80 548-584 (584)
9 PF09124 Endonuc-dimeris: T4 r 61.9 15 0.00032 23.9 3.5 39 42-80 12-50 (54)
10 COG5126 FRQ1 Ca2+-binding prot 55.3 33 0.0007 26.4 5.0 48 40-89 8-58 (160)
11 smart00540 LEM in nuclear memb 55.1 31 0.00067 21.4 4.0 37 42-78 2-40 (44)
12 KOG3093|consensus 52.7 9.7 0.00021 30.7 1.8 46 37-89 133-191 (200)
13 TIGR00988 hip integration host 49.5 31 0.00067 23.1 3.7 34 66-99 1-35 (94)
14 PF02758 PYRIN: PAAD/DAPIN/Pyr 46.4 35 0.00075 22.7 3.5 40 43-82 9-59 (83)
15 PF15008 DUF4518: Domain of un 46.1 43 0.00093 27.8 4.7 40 46-85 61-100 (262)
16 PF05193 Peptidase_M16_C: Pept 45.9 30 0.00066 23.3 3.2 40 46-85 2-47 (184)
17 PF08006 DUF1700: Protein of u 35.9 40 0.00086 25.1 2.8 40 49-93 8-50 (181)
18 PRK00199 ihfB integration host 34.8 72 0.0016 21.4 3.7 34 66-99 1-35 (94)
19 PF11923 DUF3441: Domain of un 32.9 43 0.00093 24.2 2.5 31 47-79 37-68 (112)
20 PLN02365 2-oxoglutarate-depend 31.5 56 0.0012 26.5 3.2 53 37-89 5-63 (300)
21 PF13720 Acetyltransf_11: Udp 30.7 80 0.0017 21.3 3.4 48 46-95 28-75 (83)
22 PF01204 Trehalase: Trehalase; 28.6 52 0.0011 29.0 2.7 51 47-97 18-76 (512)
23 PHA00426 type II holin 27.5 62 0.0013 22.0 2.3 29 70-98 20-48 (67)
24 PF11116 DUF2624: Protein of u 26.9 92 0.002 21.9 3.2 37 43-84 12-48 (85)
25 PLN02984 oxidoreductase, 2OG-F 25.4 79 0.0017 26.4 3.1 53 37-90 38-95 (341)
26 PHA02564 V virion protein; Pro 25.3 2E+02 0.0043 21.7 5.0 25 69-93 111-136 (141)
27 PRK12461 UDP-N-acetylglucosami 24.8 1.2E+02 0.0026 24.2 4.0 48 46-95 200-247 (255)
28 PF08564 CDC37_C: Cdc37 C term 24.1 81 0.0017 22.5 2.5 50 44-94 7-59 (99)
29 PF10746 Phage_holin_6: Phage 24.1 86 0.0019 21.3 2.5 27 71-97 20-46 (66)
30 PF09106 SelB-wing_2: Elongati 23.8 84 0.0018 19.4 2.3 39 49-92 3-42 (59)
31 PF00797 Acetyltransf_2: N-ace 23.1 59 0.0013 24.7 1.8 23 66-89 218-240 (240)
32 cd01104 HTH_MlrA-CarA Helix-Tu 22.3 94 0.002 18.9 2.3 18 45-62 11-28 (68)
33 KOG1946|consensus 21.3 95 0.0021 25.4 2.8 48 49-96 126-192 (240)
34 smart00027 EH Eps15 homology d 21.2 2.5E+02 0.0055 18.4 5.4 32 46-77 4-38 (96)
35 PF04501 Baculo_VP39: Baculovi 20.8 38 0.00083 28.8 0.4 60 5-64 44-110 (300)
36 COG2971 Predicted N-acetylgluc 20.6 84 0.0018 26.7 2.4 34 68-105 175-208 (301)
37 PF12794 MscS_TM: Mechanosensi 20.6 38 0.00081 28.2 0.3 16 37-52 291-306 (340)
38 PF01436 NHL: NHL repeat; Int 20.2 28 0.0006 18.7 -0.4 14 4-17 13-26 (28)
No 1
>PF13867 SAP30_Sin3_bdg: Sin3 binding region of histone deacetylase complex subunit SAP30; PDB: 2LD7_A.
Probab=99.92 E-value=7.6e-26 Score=144.28 Aligned_cols=53 Identities=51% Similarity=0.799 Sum_probs=41.2
Q ss_pred CCHHHHHHHHHhcCCCCCCCCCHHHHHHHHHhhhcCCCCChHHHHHHHHHHHh
Q psy3274 46 LQVNTLRRYKKYFKVTSRPGLNKAQLAETLLKHFKTIPVVEKDVLSFFFYTVK 98 (115)
Q Consensus 46 L~~~tLrRYkr~f~L~~~p~~sK~QLa~~V~kHF~s~pV~E~evI~~Fiy~VK 98 (115)
|+++|||||+++|+|+..|+++|+|||++|++||+++||+|+|||++|||+||
T Consensus 1 L~~~tLrrY~~~~~l~~~~~~sK~qLa~~V~kHF~s~~v~E~evI~~Fly~vk 53 (53)
T PF13867_consen 1 LQTPTLRRYKKHYKLPERPRSSKEQLANAVRKHFNSQPVDENEVIANFLYKVK 53 (53)
T ss_dssp S-HHHHHHHHHHTT----SS--HHHHHHHHHHHHTT----HHHHHHHHHHHHH
T ss_pred CchHHHHHHHHHhCCCCCCCCCHHHHHHHHHHHHhcCCCCHHHHHHHHHHHhC
Confidence 68999999999999999999999999999999999999999999999999997
No 2
>PF13866 zf-SAP30: SAP30 zinc-finger; PDB: 2KDP_A 2LD7_A.
Probab=99.45 E-value=1.5e-14 Score=99.87 Aligned_cols=29 Identities=79% Similarity=1.297 Sum_probs=18.1
Q ss_pred CccceeecccchhhhHHhhhhccCCCCCC
Q psy3274 2 KARHIYICEYHKCLIQSVRNKRRRKDSED 30 (115)
Q Consensus 2 ~a~h~yicd~hk~~iqs~r~~r~~~~s~~ 30 (115)
+|+|||||||||++|||||++|+|++|||
T Consensus 50 sa~H~YiCd~Hk~~Iqs~R~kr~rk~~ed 78 (78)
T PF13866_consen 50 SARHIYICDFHKNMIQSVRNKRKRKDSED 78 (78)
T ss_dssp T-S-----HHHHHHHHTTSSS-S-SS--S
T ss_pred ccchhhHHHHHHHHHhcccccccccccCC
Confidence 68999999999999999999999999986
No 3
>PF10281 Ish1: Putative stress-responsive nuclear envelope protein; InterPro: IPR018803 This group of proteins, found primarily in fungi, consists of putative stress-responsive nuclear envelope protein Ish1 and homologues [].
Probab=97.29 E-value=0.00071 Score=40.00 Aligned_cols=37 Identities=11% Similarity=0.155 Sum_probs=34.2
Q ss_pred cccCCHHHHHHHHHhcCCCCCCCC-CHHHHHHHHHhhh
Q psy3274 43 LFQLQVNTLRRYKKYFKVTSRPGL-NKAQLAETLLKHF 79 (115)
Q Consensus 43 f~kL~~~tLrRYkr~f~L~~~p~~-sK~QLa~~V~kHF 79 (115)
|+..+.+-|+.|-..+++.+++.. +|+||++.|++|+
T Consensus 1 fdtWs~~~L~~wL~~~gi~~~~~~~~rd~Ll~~~k~~y 38 (38)
T PF10281_consen 1 FDTWSDSDLKSWLKSHGIPVPKSAKTRDELLKLAKKNY 38 (38)
T ss_pred CCCCCHHHHHHHHHHcCCCCCCCCCCHHHHHHHHHHhC
Confidence 677899999999999999999998 9999999999874
No 4
>PF02037 SAP: SAP domain; InterPro: IPR003034 The SAP (after SAF-A/B, Acinus and PIAS) motif is a putative DNA binding domain found in diverse nuclear proteins involved in chromosomal organisation [], including in apoptosis []. In yeast, SAP is found in the most distal N-terminal region of E3 SUMO-protein ligase SIZ1, where it is involved in nuclear localization [].; GO: 0003676 nucleic acid binding; PDB: 2RNN_A 1JEQ_A 2KW9_A 2KVU_A 2DO1_A 1ZBU_B 1ZBH_A 2DO5_A 2RNO_A 1H1J_S ....
Probab=94.47 E-value=0.092 Score=30.53 Aligned_cols=35 Identities=31% Similarity=0.333 Sum_probs=29.5
Q ss_pred cccCCHHHHHHHHHhcCCCCCCCCCHHHHHHHHHhhh
Q psy3274 43 LFQLQVNTLRRYKKYFKVTSRPGLNKAQLAETLLKHF 79 (115)
Q Consensus 43 f~kL~~~tLrRYkr~f~L~~~p~~sK~QLa~~V~kHF 79 (115)
|+.|.++.|+.+.+..+|++ +-+|++|++-+..||
T Consensus 1 l~~l~v~eLk~~l~~~gL~~--~G~K~~Li~Rl~~~l 35 (35)
T PF02037_consen 1 LSKLTVAELKEELKERGLST--SGKKAELIERLKEHL 35 (35)
T ss_dssp TTTSHHHHHHHHHHHTTS-S--TSSHHHHHHHHHHHH
T ss_pred CCcCcHHHHHHHHHHCCCCC--CCCHHHHHHHHHHhC
Confidence 46788999999999999954 455999999999986
No 5
>smart00513 SAP Putative DNA-binding (bihelical) motif predicted to be involved in chromosomal organisation.
Probab=90.36 E-value=0.92 Score=25.95 Aligned_cols=34 Identities=29% Similarity=0.343 Sum_probs=29.0
Q ss_pred cccCCHHHHHHHHHhcCCCCCCCCCHHHHHHHHHhh
Q psy3274 43 LFQLQVNTLRRYKKYFKVTSRPGLNKAQLAETLLKH 78 (115)
Q Consensus 43 f~kL~~~tLrRYkr~f~L~~~p~~sK~QLa~~V~kH 78 (115)
++.+..+.|+...+.++|++ .-+|++|+.-+.+|
T Consensus 1 ~~~l~~~~Lk~~l~~~gl~~--~G~K~~Lv~Rl~~~ 34 (35)
T smart00513 1 LAKLKVSELKDELKKRGLST--SGTKAELVDRLLEA 34 (35)
T ss_pred CCcCcHHHHHHHHHHcCCCC--CCCHHHHHHHHHHh
Confidence 45788999999999999975 44699999998876
No 6
>PF07498 Rho_N: Rho termination factor, N-terminal domain; InterPro: IPR011112 The Rho termination factor disengages newly transcribed RNA from its DNA template at certain, specific transcripts. It is thought that two copies of Rho bind to RNA and that Rho functions as a hexamer of protomers []. This domain is found to the N terminus of the RNA binding domain (IPR011113 from INTERPRO).; GO: 0006353 transcription termination, DNA-dependent; PDB: 1A8V_B 1PVO_A 1PV4_D 3ICE_A 1XPU_C 1XPO_D 1XPR_F 2A8V_B 2HT1_B 1A63_A ....
Probab=87.76 E-value=0.84 Score=27.56 Aligned_cols=37 Identities=22% Similarity=0.327 Sum_probs=29.9
Q ss_pred CcccCCHHHHHHHHHhcCCCCCCCCCHHHHHHHHHhh
Q psy3274 42 DLFQLQVNTLRRYKKYFKVTSRPGLNKAQLAETLLKH 78 (115)
Q Consensus 42 Df~kL~~~tLrRYkr~f~L~~~p~~sK~QLa~~V~kH 78 (115)
||.+++..-|+.+-+.+++.-...+.|.||+.+|.+.
T Consensus 1 eL~~~~~~eL~~iAk~lgI~~~~~~~K~eLI~~Il~~ 37 (43)
T PF07498_consen 1 ELKSMTLSELREIAKELGIEGYSKMRKQELIFAILKA 37 (43)
T ss_dssp HHHCS-HHHHHHHHHCTT-TTGCCS-HHHHHHHHHHH
T ss_pred CcccCCHHHHHHHHHHcCCCCCCcCCHHHHHHHHHHH
Confidence 4678899999999999999878889999999988765
No 7
>PF12949 HeH: HeH/LEM domain; PDB: 2OUT_A.
Probab=85.00 E-value=0.89 Score=27.04 Aligned_cols=33 Identities=21% Similarity=0.184 Sum_probs=22.2
Q ss_pred ccCCHHHHHHHHHhcCCCCCCCCCHHHHHHHHH
Q psy3274 44 FQLQVNTLRRYKKYFKVTSRPGLNKAQLAETLL 76 (115)
Q Consensus 44 ~kL~~~tLrRYkr~f~L~~~p~~sK~QLa~~V~ 76 (115)
..|.++-||+.-..+++..+.+..|++|+..+.
T Consensus 2 ~sltV~~Lk~iL~~~~I~~ps~AkKaeLv~L~~ 34 (35)
T PF12949_consen 2 KSLTVAQLKRILDEHGIEFPSNAKKAELVALFN 34 (35)
T ss_dssp TT--SHHHHHHHHHHT---SSS--SHHHHHH--
T ss_pred CcCcHHHHHHHHHHcCCCCCCCCCHHHHHHHHc
Confidence 468899999999999999999999999998764
No 8
>TIGR00578 ku70 ATP-dependent DNA helicase ii, 70 kDa subunit (ku70). Proteins in this family are involved in non-homologous end joining, a process used for the repair of double stranded DNA breaks. This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University). Cutoff does not detect the putative ku70 homologs in yeast.
Probab=83.11 E-value=1.7 Score=38.88 Aligned_cols=37 Identities=32% Similarity=0.559 Sum_probs=32.2
Q ss_pred CcccCCHHHHHHHHHhcCCCCCCCCCHHHHHHHHHhhhc
Q psy3274 42 DLFQLQVNTLRRYKKYFKVTSRPGLNKAQLAETLLKHFK 80 (115)
Q Consensus 42 Df~kL~~~tLrRYkr~f~L~~~p~~sK~QLa~~V~kHF~ 80 (115)
.|++|+++.|+-|.+.++|. .+..|++|++.|.+||.
T Consensus 548 ~l~kltv~~Lk~~l~~~g~~--~~~kKadLi~~i~~~~~ 584 (584)
T TIGR00578 548 TLGKLTVSVLKDFCRAYGLR--SGSKKQELLDALTKHFK 584 (584)
T ss_pred ChhhccHHHHHHHHHHcCCC--ccccHHHHHHHHHHHhC
Confidence 67889999999999999876 44569999999999983
No 9
>PF09124 Endonuc-dimeris: T4 recombination endonuclease VII, dimerisation; InterPro: IPR015208 This entry represents a dimerisation domain predominantly found in Bacteriophage T4 recombination endonuclease VII. It adopts a helical secondary structure, with three alpha helices oriented parallel to each other. As well as mediating dimerisation of the protein, this domain is also involved in binding to the DNA major groove []. ; PDB: 1EN7_B 1E7L_B 2QNF_A 2QNC_A 1E7D_A.
Probab=61.87 E-value=15 Score=23.94 Aligned_cols=39 Identities=18% Similarity=0.095 Sum_probs=28.0
Q ss_pred CcccCCHHHHHHHHHhcCCCCCCCCCHHHHHHHHHhhhc
Q psy3274 42 DLFQLQVNTLRRYKKYFKVTSRPGLNKAQLAETLLKHFK 80 (115)
Q Consensus 42 Df~kL~~~tLrRYkr~f~L~~~p~~sK~QLa~~V~kHF~ 80 (115)
-|+.|+......=...+++..+++.+|+||+..-++-|.
T Consensus 12 ~FSRl~k~eMiaem~~~G~~y~~~~tK~~Lvk~fkKql~ 50 (54)
T PF09124_consen 12 WFSRLTKPEMIAEMDSYGFEYNEKDTKAQLVKIFKKQLK 50 (54)
T ss_dssp HHHTS-HHHHHHHHHHTT----TTS-HHHHHHHHHHHHH
T ss_pred HHHhcCHHHHHHHHHHhCCcCCccccHHHHHHHHHHHHH
Confidence 477888888888888999999999999999998887664
No 10
>COG5126 FRQ1 Ca2+-binding protein (EF-Hand superfamily) [Signal transduction mechanisms / Cytoskeleton / Cell division and chromosome partitioning / General function prediction only]
Probab=55.30 E-value=33 Score=26.41 Aligned_cols=48 Identities=15% Similarity=0.202 Sum_probs=37.6
Q ss_pred ccCcccCCHHHHHHHHHhcCCCC---CCCCCHHHHHHHHHhhhcCCCCChHHH
Q psy3274 40 DVDLFQLQVNTLRRYKKYFKVTS---RPGLNKAQLAETLLKHFKTIPVVEKDV 89 (115)
Q Consensus 40 ~vDf~kL~~~tLrRYkr~f~L~~---~p~~sK~QLa~~V~kHF~s~pV~E~ev 89 (115)
.+-|.+|+.+-+.++++.|.|-- .-.++|.+|..+++ |-.+...|.++
T Consensus 8 ~~~~~~~t~~qi~~lkeaF~l~D~d~~G~I~~~el~~ilr--~lg~~~s~~ei 58 (160)
T COG5126 8 LLTFTQLTEEQIQELKEAFQLFDRDSDGLIDRNELGKILR--SLGFNPSEAEI 58 (160)
T ss_pred hhhcccCCHHHHHHHHHHHHHhCcCCCCCCcHHHHHHHHH--HcCCCCcHHHH
Confidence 34678899999999999999933 34589999999887 66666666554
No 11
>smart00540 LEM in nuclear membrane-associated proteins. LEM, domain in nuclear membrane-associated proteins, including lamino-associated polypeptide 2 and emerin.
Probab=55.06 E-value=31 Score=21.35 Aligned_cols=37 Identities=22% Similarity=0.240 Sum_probs=29.8
Q ss_pred CcccCCHHHHHHHHHhcCCCCCC--CCCHHHHHHHHHhh
Q psy3274 42 DLFQLQVNTLRRYKKYFKVTSRP--GLNKAQLAETLLKH 78 (115)
Q Consensus 42 Df~kL~~~tLrRYkr~f~L~~~p--~~sK~QLa~~V~kH 78 (115)
|+..|+++-|++=...|+++..| ..||.-+..-+.++
T Consensus 2 d~~~LSd~eL~~~L~~~G~~~gPIt~sTR~vy~kkL~~~ 40 (44)
T smart00540 2 DVDRLSDAELRAELKQYGLPPGPITDTTRKLYEKKLRKL 40 (44)
T ss_pred chhHcCHHHHHHHHHHcCCCCCCcCcchHHHHHHHHHHH
Confidence 78899999999999999997776 57787666655554
No 12
>KOG3093|consensus
Probab=52.68 E-value=9.7 Score=30.67 Aligned_cols=46 Identities=33% Similarity=0.361 Sum_probs=33.4
Q ss_pred CCCccCcccC----------CHHHHHHHHHhcCCCCCC--CCC-HHHHHHHHHhhhcCCCCChHHH
Q psy3274 37 ESPDVDLFQL----------QVNTLRRYKKYFKVTSRP--GLN-KAQLAETLLKHFKTIPVVEKDV 89 (115)
Q Consensus 37 ~~p~vDf~kL----------~~~tLrRYkr~f~L~~~p--~~s-K~QLa~~V~kHF~s~pV~E~ev 89 (115)
..|.|-|+.. =+++|.||..||.-+-+- ++. |||++. .+||.|+|+
T Consensus 133 ivPGvAFd~~g~RlGhGkGYYD~flkry~~~~~~~kp~~vgL~l~EQI~~-------~IPv~~~Dv 191 (200)
T KOG3093|consen 133 IVPGVAFDRKGARLGHGKGYYDDFLKRYQIHAPEQKPLLVGLCLKEQILS-------EIPVEEHDV 191 (200)
T ss_pred EecccccchhhhhccCCcchHHHHHHHHHHhccccCchhhhhhhhHhhcc-------cCCCCccce
Confidence 3588888753 378999999998432221 344 899887 799999886
No 13
>TIGR00988 hip integration host factor, beta subunit. This protein forms a site-specific DNA-binding heterodimer with the homologous integration host factor alpha subunit. It is closely related to the DNA-binding protein HU.
Probab=49.53 E-value=31 Score=23.06 Aligned_cols=34 Identities=18% Similarity=0.338 Sum_probs=25.0
Q ss_pred CCHHHHHHHHHhhhcCCCCChH-HHHHHHHHHHhh
Q psy3274 66 LNKAQLAETLLKHFKTIPVVEK-DVLSFFFYTVKS 99 (115)
Q Consensus 66 ~sK~QLa~~V~kHF~s~pV~E~-evI~~Fiy~VK~ 99 (115)
++|.||++.|.++...++-.+. .+|..|+-.+..
T Consensus 1 m~k~eli~~i~~~~~~~s~~~v~~vv~~~~~~i~~ 35 (94)
T TIGR00988 1 MTKSELIERIATQQSHLPAKDVEDAVKTMLEHMAS 35 (94)
T ss_pred CCHHHHHHHHHHHcCCCCHHHHHHHHHHHHHHHHH
Confidence 5899999999988766554443 677777776644
No 14
>PF02758 PYRIN: PAAD/DAPIN/Pyrin domain; InterPro: IPR004020 Pyrin domain was identified as putative protein-protein interaction domain at the N-terminal region of several proteins thought to function in apoptotic and inflammatory signalling pathways. Using secondary structure prediction and potential-based fold recognition methods, the PYRIN domain is predicted to be a member of the six-helix bundle death domain-fold superfamily that includes death domains (DDs), death effector domains (DEDs), and caspase recruitment domains (CARDs). Members of the death domain-fold superfamily are well established mediators of protein-protein interactions found in many proteins involved in apoptosis and inflammation, indicating further that the PYRIN domains serve a similar function. Comparison of a circular dichroism spectrum of the PYRIN domain of CARD7/DEFCAP/NAC/NALP1 with spectra of several proteins known to adopt the death domain-fold provides experimental support for the structure prediction [] It is found in interferon-inducible proteins, pyrin and myeloid cell nuclear differentiation antigen.; PDB: 2DO9_A 2YU0_A 2KN6_A 1UCP_A 2L6A_A 2KM6_A 1PN5_A 2DBG_A 3QF2_B 2HM2_Q.
Probab=46.37 E-value=35 Score=22.70 Aligned_cols=40 Identities=23% Similarity=0.420 Sum_probs=29.4
Q ss_pred cccCCHHHHHHHHHhcCCCCCC-----------CCCHHHHHHHHHhhhcCC
Q psy3274 43 LFQLQVNTLRRYKKYFKVTSRP-----------GLNKAQLAETLLKHFKTI 82 (115)
Q Consensus 43 f~kL~~~tLrRYkr~f~L~~~p-----------~~sK~QLa~~V~kHF~s~ 82 (115)
|..|+..-|++.|-+......+ +.++.+||+.+-+||...
T Consensus 9 Le~L~~~efk~FK~~L~~~~~~~~~~Ip~~~le~ad~~~la~lLv~~y~~~ 59 (83)
T PF02758_consen 9 LEELSEEEFKRFKWLLKEPVKEGFPPIPRGELEKADREDLADLLVQHYGEQ 59 (83)
T ss_dssp HHTS-HHHHHHHHHHHHSTSSTTTCSSSHCHHHHSSHHHHHHHHHHHTCHH
T ss_pred HHhCCHHHHHHHHHHhcchhhcCCCCCCHHHHhhCCHHHHHHHHHHHcCHH
Confidence 5678899999999887422211 367899999999999754
No 15
>PF15008 DUF4518: Domain of unknown function (DUF4518)
Probab=46.15 E-value=43 Score=27.80 Aligned_cols=40 Identities=20% Similarity=0.211 Sum_probs=35.3
Q ss_pred CCHHHHHHHHHhcCCCCCCCCCHHHHHHHHHhhhcCCCCC
Q psy3274 46 LQVNTLRRYKKYFKVTSRPGLNKAQLAETLLKHFKTIPVV 85 (115)
Q Consensus 46 L~~~tLrRYkr~f~L~~~p~~sK~QLa~~V~kHF~s~pV~ 85 (115)
..-+.|=+|--..++.++|+.+|.+|+..+.++..+++..
T Consensus 61 V~RelLFkYLa~kgv~v~p~s~K~~LI~~~l~~W~~~~~~ 100 (262)
T PF15008_consen 61 VKRELLFKYLASKGVIVPPDSEKADLIQRVLQYWSSQLQQ 100 (262)
T ss_pred ccHHHHHHHHHHcCCCCCCCccHHHHHHHHHHHHhccccc
Confidence 5667788899999999999999999999999999887654
No 16
>PF05193 Peptidase_M16_C: Peptidase M16 inactive domain; InterPro: IPR007863 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. These metallopeptidases belong to MEROPS peptidase family M16 (clan ME). They include proteins, which are classified as non-peptidase homologues either have been found experimentally to be without peptidase activity, or lack amino acid residues that are believed to be essential for the catalytic activity. The peptidases in this group of sequences include: Insulinase, insulin-degrading enzyme (3.4.24.56 from EC) Mitochondrial processing peptidase alpha subunit, (Alpha-MPP, 3.4.24.64 from EC) Pitrlysin, Protease III precursor (3.4.24.55 from EC) Nardilysin, (3.4.24.61 from EC) Ubiquinol-cytochrome C reductase complex core protein I,mitochondrial precursor (1.10.2.2 from EC) Coenzyme PQQ synthesis protein F (3.4.99 from EC) These proteins do not share many regions of sequence similarity; the most noticeable is in the N-terminal section. This region includes a conserved histidine followed, two residues later by a glutamate and another histidine. In pitrilysin, it has been shown [] that this H-x-x-E-H motif is involved in enzymatic activity; the two histidines bind zinc and the glutamate is necessary for catalytic activity. The mitochondrial processing peptidase consists of two structurally related domains. One is the active peptidase whereas the other, the C-terminal region, is inactive. The two domains hold the substrate like a clamp [].; GO: 0004222 metalloendopeptidase activity, 0008270 zinc ion binding, 0006508 proteolysis; PDB: 1BE3_B 1PP9_B 2A06_B 1SQB_B 1SQP_B 1L0N_B 1SQX_B 1NU1_B 1L0L_B 2FYU_B ....
Probab=45.89 E-value=30 Score=23.34 Aligned_cols=40 Identities=23% Similarity=0.378 Sum_probs=29.5
Q ss_pred CCHHHHHHHHHhcCCCC------CCCCCHHHHHHHHHhhhcCCCCC
Q psy3274 46 LQVNTLRRYKKYFKVTS------RPGLNKAQLAETLLKHFKTIPVV 85 (115)
Q Consensus 46 L~~~tLrRYkr~f~L~~------~p~~sK~QLa~~V~kHF~s~pV~ 85 (115)
++.+.|+.|.+.|--+. --+++-+++...|.++|.++|..
T Consensus 2 it~e~l~~f~~~~y~p~n~~l~i~Gd~~~~~~~~~i~~~~~~l~~~ 47 (184)
T PF05193_consen 2 ITLEDLRAFYKKFYRPSNMTLVIVGDIDPDELEKLIEKYFGSLPKS 47 (184)
T ss_dssp --HHHHHHHHHHHSSGGGEEEEEEESSGHHHHHHHHHHHHTTSSHS
T ss_pred CCHHHHHHHHHHhcCccceEEEEEcCccHHHHHHHHHhhhhhhccc
Confidence 56788899887665432 12588999999999999998854
No 17
>PF08006 DUF1700: Protein of unknown function (DUF1700); InterPro: IPR012963 This family contains many hypothetical bacterial proteins and two putative membrane proteins (Q6GFD0 from SWISSPROT and Q6G806 from SWISSPROT).
Probab=35.86 E-value=40 Score=25.09 Aligned_cols=40 Identities=20% Similarity=0.346 Sum_probs=25.3
Q ss_pred HHHHHHHHhcCCCCCCCCCHHHHHHHHHhhhcC---CCCChHHHHHHH
Q psy3274 49 NTLRRYKKYFKVTSRPGLNKAQLAETLLKHFKT---IPVVEKDVLSFF 93 (115)
Q Consensus 49 ~tLrRYkr~f~L~~~p~~sK~QLa~~V~kHF~s---~pV~E~evI~~F 93 (115)
+.|++|-+. | |...+++..+--..||.. -..+|.|++..+
T Consensus 8 ~~L~~~L~~--l---p~~e~~e~l~~Y~e~f~d~~~~G~sEeeii~~L 50 (181)
T PF08006_consen 8 NELEKYLKK--L---PEEEREEILEYYEEYFDDAGEEGKSEEEIIAEL 50 (181)
T ss_pred HHHHHHHHc--C---CHHHHHHHHHHHHHHHHHhhhCCCCHHHHHHHc
Confidence 345555542 2 444577788888888874 345688887764
No 18
>PRK00199 ihfB integration host factor subunit beta; Reviewed
Probab=34.76 E-value=72 Score=21.36 Aligned_cols=34 Identities=12% Similarity=0.200 Sum_probs=22.9
Q ss_pred CCHHHHHHHHHhhhcCCCCChH-HHHHHHHHHHhh
Q psy3274 66 LNKAQLAETLLKHFKTIPVVEK-DVLSFFFYTVKS 99 (115)
Q Consensus 66 ~sK~QLa~~V~kHF~s~pV~E~-evI~~Fiy~VK~ 99 (115)
+||.||++.|...-..++-.+. .++..|+..+..
T Consensus 1 mtk~eli~~ia~~~~~~s~~~~~~vv~~~~~~i~~ 35 (94)
T PRK00199 1 MTKSELIERLAARNPHLSAKDVENAVKEILEEMSD 35 (94)
T ss_pred CCHHHHHHHHHHHcCCCCHHHHHHHHHHHHHHHHH
Confidence 5899999999876544443333 667777766543
No 19
>PF11923 DUF3441: Domain of unknown function (DUF3441); InterPro: IPR021846 This presumed domain is functionally uncharacterised. This domain is found in archaea and eukaryotes. This domain is typically between 104 to 119 amino acids in length. This domain is found associated with PF05833 from PFAM, PF05670 from PFAM. This domain has two conserved residues (P and G) that may be functionally important.
Probab=32.93 E-value=43 Score=24.17 Aligned_cols=31 Identities=32% Similarity=0.552 Sum_probs=24.8
Q ss_pred CHHHHHHHHHhcCCCCCCCCC-HHHHHHHHHhhh
Q psy3274 47 QVNTLRRYKKYFKVTSRPGLN-KAQLAETLLKHF 79 (115)
Q Consensus 47 ~~~tLrRYkr~f~L~~~p~~s-K~QLa~~V~kHF 79 (115)
+-.+|.+|+.+-+| -||.. |-..|..+-.||
T Consensus 37 P~sal~~yKYkvKl--~PG~~KKGKaak~il~~f 68 (112)
T PF11923_consen 37 PYSALSKYKYKVKL--QPGNAKKGKAAKEILEYF 68 (112)
T ss_pred CHHHHhhCceeEEE--cCCCcchHHHHHHHHHHH
Confidence 35699999888777 45655 679999999999
No 20
>PLN02365 2-oxoglutarate-dependent dioxygenase
Probab=31.51 E-value=56 Score=26.49 Aligned_cols=53 Identities=21% Similarity=0.343 Sum_probs=37.6
Q ss_pred CCCccCcccC--CHHHHHHHHHhcCC--CCCCCCCHHHHHH--HHHhhhcCCCCChHHH
Q psy3274 37 ESPDVDLFQL--QVNTLRRYKKYFKV--TSRPGLNKAQLAE--TLLKHFKTIPVVEKDV 89 (115)
Q Consensus 37 ~~p~vDf~kL--~~~tLrRYkr~f~L--~~~p~~sK~QLa~--~V~kHF~s~pV~E~ev 89 (115)
..|-|||+.+ ....|+.=++.++. -+..+++.+.+.+ .+.+.|=++|..|+.-
T Consensus 5 ~iPvIDls~~~~~~~~l~~Ac~~~GfF~l~nHGi~~~l~~~~~~~~~~FF~LP~e~K~~ 63 (300)
T PLN02365 5 NIPTIDLEEFPGQIEKLREACERWGCFRVVNHGVSLSLMAEMKKVVRSLFDLPDEVKRR 63 (300)
T ss_pred CCCEEEChhhHHHHHHHHHHHHHCCEEEEEcCCCCHHHHHHHHHHHHHHHcCCHHHHhh
Confidence 5789999987 34667777777776 3567888754433 2346777899999875
No 21
>PF13720 Acetyltransf_11: Udp N-acetylglucosamine O-acyltransferase; Domain 2; PDB: 3I3A_A 3I3X_A 3HSQ_B 2JF2_A 1LXA_A 2AQ9_A 2QIV_X 2QIA_A 2JF3_A 4EQY_F ....
Probab=30.67 E-value=80 Score=21.28 Aligned_cols=48 Identities=15% Similarity=0.265 Sum_probs=36.6
Q ss_pred CCHHHHHHHHHhcCCCCCCCCCHHHHHHHHHhhhcCCCCChHHHHHHHHH
Q psy3274 46 LQVNTLRRYKKYFKVTSRPGLNKAQLAETLLKHFKTIPVVEKDVLSFFFY 95 (115)
Q Consensus 46 L~~~tLrRYkr~f~L~~~p~~sK~QLa~~V~kHF~s~pV~E~evI~~Fiy 95 (115)
++.+++...++.|++=...+.+-+|-++.|..-+.. ..|.+-|..||-
T Consensus 28 fs~~~i~~l~~ayr~l~~~~~~~~~a~~~l~~~~~~--~~~v~~~~~Fi~ 75 (83)
T PF13720_consen 28 FSKEEISALRRAYRILFRSGLTLEEALEELEEEYPD--SPEVREIVDFIR 75 (83)
T ss_dssp S-HHHHHHHHHHHHHHHTSSS-HHHHHHHHHHHTTS--CHHHHHHHHHHH
T ss_pred CCHHHHHHHHHHHHHHHhCCCCHHHHHHHHHHhccC--CHHHHHHHHHHH
Confidence 677888888888888666778889999999987754 667777778875
No 22
>PF01204 Trehalase: Trehalase; InterPro: IPR001661 O-Glycosyl hydrolases 3.2.1. from EC are a widespread group of enzymes that hydrolyse the glycosidic bond between two or more carbohydrates, or between a carbohydrate and a non-carbohydrate moiety. A classification system for glycosyl hydrolases, based on sequence similarity, has led to the definition of 85 different families [, ]. This classification is available on the CAZy (CArbohydrate-Active EnZymes) web site. Glycoside hydrolase family 37 GH37 from CAZY comprises enzymes with only one known activity; trehalase (3.2.1.28 from EC). Trehalase is the enzyme responsible for the degradation of the disaccharide alpha,alpha-trehalose yielding two glucose subunits []. It is an enzyme found in a wide variety of organisms and whose sequence has been highly conserved throughout evolution.; GO: 0004555 alpha,alpha-trehalase activity, 0005991 trehalose metabolic process; PDB: 2JJB_B 2WYN_B 2JG0_A 2JF4_A 3C67_A 3D3I_B 3C69_A 3C68_A 2Z07_B.
Probab=28.56 E-value=52 Score=28.97 Aligned_cols=51 Identities=24% Similarity=0.303 Sum_probs=24.1
Q ss_pred CHHHHHHHHHhcCCCCCCCCCHHHHHHHHHhhhcCC--------CCChHHHHHHHHHHH
Q psy3274 47 QVNTLRRYKKYFKVTSRPGLNKAQLAETLLKHFKTI--------PVVEKDVLSFFFYTV 97 (115)
Q Consensus 47 ~~~tLrRYkr~f~L~~~p~~sK~QLa~~V~kHF~s~--------pV~E~evI~~Fiy~V 97 (115)
...+|+.|+..+.-....++++++|..-|.+||..- |.+..+--..|+..+
T Consensus 18 ~~~~~~~~~~~~~~~~~~~~~~~~l~~fv~~~f~~~G~el~~~~~~d~~~~~~~~~~~i 76 (512)
T PF01204_consen 18 PDEILAAFKKLFWNTLTRNISKFDLKQFVEDHFDEPGSELEPWTPPDWKEDPPYFLSRI 76 (512)
T ss_dssp HHHHHHHHHHHHTS------TT--HHHHHHHHEE------------S------------
T ss_pred HHHHHHHHHHhhhhccccCCCHHHHHHHHHHhCcCccccccccCCcccCCCCccccccC
Confidence 567899999998876677889999999999999864 344444334455544
No 23
>PHA00426 type II holin
Probab=27.46 E-value=62 Score=22.01 Aligned_cols=29 Identities=21% Similarity=0.106 Sum_probs=25.6
Q ss_pred HHHHHHHhhhcCCCCChHHHHHHHHHHHh
Q psy3274 70 QLAETLLKHFKTIPVVEKDVLSFFFYTVK 98 (115)
Q Consensus 70 QLa~~V~kHF~s~pV~E~evI~~Fiy~VK 98 (115)
--|+.+.+-|-++..+|==-|+.|+|.|=
T Consensus 20 ~~ad~~~r~F~Glsl~~WfyvAai~YTVv 48 (67)
T PHA00426 20 AGADGAARLFFGLSLNEWFYVAAIAYTVV 48 (67)
T ss_pred chhhHHHHHHhcccHHHHHHHHHHHHHHH
Confidence 35788999999999999999999999873
No 24
>PF11116 DUF2624: Protein of unknown function (DUF2624); InterPro: IPR020277 This entry contains proteins with no known function.
Probab=26.87 E-value=92 Score=21.88 Aligned_cols=37 Identities=22% Similarity=0.395 Sum_probs=24.5
Q ss_pred cccCCHHHHHHHHHhcCCCCCCCCCHHHHHHHHHhhhcCCCC
Q psy3274 43 LFQLQVNTLRRYKKYFKVTSRPGLNKAQLAETLLKHFKTIPV 84 (115)
Q Consensus 43 f~kL~~~tLrRYkr~f~L~~~p~~sK~QLa~~V~kHF~s~pV 84 (115)
+..++.+-|..|.+.|+++. +++| |+.|..+-.+.|+
T Consensus 12 ln~iT~~eLlkyskqy~i~i----t~~Q-A~~I~~~lr~k~i 48 (85)
T PF11116_consen 12 LNNITAKELLKYSKQYNISI----TKKQ-AEQIANILRGKNI 48 (85)
T ss_pred HhcCCHHHHHHHHHHhCCCC----CHHH-HHHHHHHHhcCCC
Confidence 34688999999999999964 3333 3344445555443
No 25
>PLN02984 oxidoreductase, 2OG-Fe(II) oxygenase family protein
Probab=25.42 E-value=79 Score=26.42 Aligned_cols=53 Identities=25% Similarity=0.328 Sum_probs=40.2
Q ss_pred CCCccCcccCCHHHHHHHHHhcCC--CCCCCCCHH---HHHHHHHhhhcCCCCChHHHH
Q psy3274 37 ESPDVDLFQLQVNTLRRYKKYFKV--TSRPGLNKA---QLAETLLKHFKTIPVVEKDVL 90 (115)
Q Consensus 37 ~~p~vDf~kL~~~tLrRYkr~f~L--~~~p~~sK~---QLa~~V~kHF~s~pV~E~evI 90 (115)
..|-|||+.+..+.|++=.+.++. -+..+++.+ ++....+ .|-++|..|+.-+
T Consensus 38 ~IPvIDls~~~~~~l~~A~~~~GFF~v~nHGI~~~li~~~~~~s~-~FF~LP~eeK~k~ 95 (341)
T PLN02984 38 DIPVIDMECLDMEKLREACKDWGIFRLENHGIPLTLMSQLKEISE-SLLSLPFESKREL 95 (341)
T ss_pred CCCeEeCcHHHHHHHHHHHHhCcEEEEECCCCCHHHHHHHHHHHH-HHHcCCHHHHhhh
Confidence 589999999988999999998887 457788753 4444444 4556999998765
No 26
>PHA02564 V virion protein; Provisional
Probab=25.27 E-value=2e+02 Score=21.74 Aligned_cols=25 Identities=16% Similarity=0.263 Sum_probs=17.9
Q ss_pred HHHHHHHHhhhcC-CCCChHHHHHHH
Q psy3274 69 AQLAETLLKHFKT-IPVVEKDVLSFF 93 (115)
Q Consensus 69 ~QLa~~V~kHF~s-~pV~E~evI~~F 93 (115)
..|+.++.+|=.+ +|..+.+|..+|
T Consensus 111 ~~l~~Aii~~EnG~~pys~~~i~~~~ 136 (141)
T PHA02564 111 AALVTAIIRHENGQQPYSAEQIAQGV 136 (141)
T ss_pred HHHHHHHHHHhcCCCCCCHHHHHHHH
Confidence 4677778888777 777776666655
No 27
>PRK12461 UDP-N-acetylglucosamine acyltransferase; Provisional
Probab=24.76 E-value=1.2e+02 Score=24.22 Aligned_cols=48 Identities=23% Similarity=0.246 Sum_probs=38.1
Q ss_pred CCHHHHHHHHHhcCCCCCCCCCHHHHHHHHHhhhcCCCCChHHHHHHHHH
Q psy3274 46 LQVNTLRRYKKYFKVTSRPGLNKAQLAETLLKHFKTIPVVEKDVLSFFFY 95 (115)
Q Consensus 46 L~~~tLrRYkr~f~L~~~p~~sK~QLa~~V~kHF~s~pV~E~evI~~Fiy 95 (115)
++.++++..++.|++=.+.+.+.+|.++.++.-+. +-.|.+-|..||-
T Consensus 200 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~f~~ 247 (255)
T PRK12461 200 FSSRAIRALKRAYKIIYRSGLSVQQAVAELELQQF--ESPEVEELIDFIK 247 (255)
T ss_pred CCHHHHHHHHHHHHHHHhcCCCHHHHHHHHHHhcc--CCHHHHHHHHHHH
Confidence 77888888899999988889999998888887553 4556666777774
No 28
>PF08564 CDC37_C: Cdc37 C terminal domain; InterPro: IPR013873 Cdc37 is a protein required for the activity of numerous eukaryotic protein kinases. This entry corresponds to the C-terminal domain whose function is unclear. It is found C-terminal to the Hsp90 chaperone (heat shock protein 90) binding domain IPR013874 from INTERPRO and the N-terminal kinase binding domain of Cdc37 IPR013855 from INTERPRO []. ; PDB: 1US7_B.
Probab=24.08 E-value=81 Score=22.53 Aligned_cols=50 Identities=18% Similarity=0.189 Sum_probs=21.4
Q ss_pred ccCCHHHHHHHHHhcCCCCCCCCCH---HHHHHHHHhhhcCCCCChHHHHHHHH
Q psy3274 44 FQLQVNTLRRYKKYFKVTSRPGLNK---AQLAETLLKHFKTIPVVEKDVLSFFF 94 (115)
Q Consensus 44 ~kL~~~tLrRYkr~f~L~~~p~~sK---~QLa~~V~kHF~s~pV~E~evI~~Fi 94 (115)
..|+..--++=|.-|. ..+|.+.+ .+=++.|.+-|..+||.|.+-+...+
T Consensus 7 ~~lDp~~~~~~~evFe-slP~emq~Alet~~ld~vnkVl~~M~veeAE~~v~~~ 59 (99)
T PF08564_consen 7 EGLDPEEERKAREVFE-SLPPEMQKALETGDLDEVNKVLGKMPVEEAEYHVERC 59 (99)
T ss_dssp -S--HH------HHHH-HS-TT------------HHHHHT--SSSHHHHHHHHH
T ss_pred CCCCcchhhhHHHHHH-HCCHHHHHHHHcCCHHHHHHHHHhCCHHHHHHHHHHH
Confidence 3455522223334454 55777665 47889999999999999998877654
No 29
>PF10746 Phage_holin_6: Phage holin family 6; InterPro: IPR019682 This entry represents a protein conserved in Caudovirales (known as tailed bacteriophages). Holins are a diverse family of proteins that cause bacterial membrane lysis during late-protein synthesis.
Probab=24.07 E-value=86 Score=21.25 Aligned_cols=27 Identities=22% Similarity=0.194 Sum_probs=24.5
Q ss_pred HHHHHHhhhcCCCCChHHHHHHHHHHH
Q psy3274 71 LAETLLKHFKTIPVVEKDVLSFFFYTV 97 (115)
Q Consensus 71 La~~V~kHF~s~pV~E~evI~~Fiy~V 97 (115)
-|+.+.+-|.+++.+|==.|+.++|.|
T Consensus 20 ~A~~~a~~f~GLslneWfyiati~Ytv 46 (66)
T PF10746_consen 20 GADVVARYFWGLSLNEWFYIATIAYTV 46 (66)
T ss_pred hHHHHHHHHcCCCHHHHHHHHHHHHHH
Confidence 478889999999999999999999986
No 30
>PF09106 SelB-wing_2: Elongation factor SelB, winged helix ; InterPro: IPR015190 This entry represents a domain with a winged helix-type fold, which consists of a closed 3-helical bundle with a right-handed twist, and a small beta-sheet wing []. Different winged helix domains share a common structure, but can differ in sequence. This entry is designated "type 2". The winged helix motif is involved in both DNA and RNA binding. In the elongation factor SelB, the winged helix domains recognise RNA, allowing the complex to wrap around the small ribosomal subunit. In bacteria, the incorporation of the amino acid selenocysteine into proteins requires elongation factor SelB, which binds both transfer RNA (tRNA) and mRNA. SelB binds to an mRNA hairpin formed by the selenocysteine insertion sequence (SECIS) with extremely high specificity []. ; GO: 0003723 RNA binding, 0003746 translation elongation factor activity, 0005525 GTP binding, 0001514 selenocysteine incorporation, 0005737 cytoplasm; PDB: 2V9V_A 1LVA_A 2PLY_A 2UWM_A.
Probab=23.75 E-value=84 Score=19.42 Aligned_cols=39 Identities=31% Similarity=0.469 Sum_probs=23.5
Q ss_pred HHHHHHHHhcCCCCCCCCCHHHHHHHHHhhhc-CCCCChHHHHHH
Q psy3274 49 NTLRRYKKYFKVTSRPGLNKAQLAETLLKHFK-TIPVVEKDVLSF 92 (115)
Q Consensus 49 ~tLrRYkr~f~L~~~p~~sK~QLa~~V~kHF~-s~pV~E~evI~~ 92 (115)
+.|..|-+.|-+ .+|++|++|.+-+ |. .+|-+..+.+..
T Consensus 3 ~~L~~yH~~~Pl--r~G~~keeLrsrl---~~~~l~~k~~~~ll~ 42 (59)
T PF09106_consen 3 EILAEYHRENPL--RPGMPKEELRSRL---FKPRLPPKLFNALLE 42 (59)
T ss_dssp HHHHHHHHH-TT--SS-EEHHHHHHHC---ST-TS-HCCHHHHHH
T ss_pred HHHHHHHHHCcC--ccCcCHHHHHHHH---hhccCCHHHHHHHHH
Confidence 578888877755 8999999998733 33 445444444433
No 31
>PF00797 Acetyltransf_2: N-acetyltransferase; InterPro: IPR001447 Arylamine N-acetyltransferase (NAT) is a cytosolic enzyme of approximately 30 kDa. It facilitates the transfer of an acetyl group from acetyl coenzyme A on to a wide range of arylamine, N-hydroxyarylamines and hydrazines. Acetylation of these compounds generally results in inactivation. NAT is found in many species from Mycobacteria (Mycobacterium tuberculosis, Mycobacterium smegmatis etc) to Homo sapiens (Human). It was the first enzyme to be observed to have polymorphic activity amongst human individuals. NAT is responsible for the inactivation of Isoniazid (a drug used to treat tuberculosis) in humans. The NAT protein has also been shown to be involved in the breakdown of folic acid. NAT catalyses the reaction: Acetyl-coA + arylamine = coA + N-acetylarylamine NAT is the target of a common genetic polymorphism of clinical relevance in humans. The N-acetylation polymorphism is determined by low or high NAT activity in liver. NAT has been implicated in the action and toxicity of amine-containing drugs, and in the susceptibility to cancer and systematic lupus erythematosus. Two highly similar human genes for NAT, termed NAT1 and NAT2, encode genetically invariant and variant NAT proteins, respectively. ; GO: 0016407 acetyltransferase activity, 0008152 metabolic process; PDB: 1W6F_A 1W5R_A 1GX3_D 2PQT_A 2IJA_A 1W4T_A 2BSZ_B 3D9W_B 3LTW_A 3LNB_A ....
Probab=23.06 E-value=59 Score=24.72 Aligned_cols=23 Identities=30% Similarity=0.410 Sum_probs=15.6
Q ss_pred CCHHHHHHHHHhhhcCCCCChHHH
Q psy3274 66 LNKAQLAETLLKHFKTIPVVEKDV 89 (115)
Q Consensus 66 ~sK~QLa~~V~kHF~s~pV~E~ev 89 (115)
.+.+|+.++++++| ++.++|.+|
T Consensus 218 ~~~ee~~~iL~~~F-gi~l~~~~~ 240 (240)
T PF00797_consen 218 TSEEELLEILKEYF-GIDLDEEEV 240 (240)
T ss_dssp SSHHHHHHHHHHTS----GSS---
T ss_pred CCHHHHHHHHHHhC-CcccCCCCC
Confidence 67899999999999 588887664
No 32
>cd01104 HTH_MlrA-CarA Helix-Turn-Helix DNA binding domain of the transcription regulators MlrA and CarA. Helix-turn-helix (HTH) transcription regulator MlrA (merR-like regulator A), N-terminal domain. The MlrA protein, also known as YehV, has been shown to control cell-cell aggregation by co-regulating the expression of curli and extracellular matrix production in Escherichia coli and Salmonella typhimurium. Its close homolog, CarA from Myxococcus xanthus, is involved in activation of the carotenoid biosynthesis genes by light. These proteins belong to the MerR superfamily of transcription regulators that promote expression of several stress regulon genes by reconfiguring the spacer between the -35 and -10 promoter elements. Their conserved N-terminal domains contain predicted HTH motifs that mediate DNA binding, while the dissimilar C-terminal domains bind specific coactivator molecules. Many MlrA- and CarA-like proteins in this group appear to lack the long dimerization helix seen i
Probab=22.31 E-value=94 Score=18.88 Aligned_cols=18 Identities=17% Similarity=0.468 Sum_probs=14.5
Q ss_pred cCCHHHHHHHHHhcCCCC
Q psy3274 45 QLQVNTLRRYKKYFKVTS 62 (115)
Q Consensus 45 kL~~~tLrRYkr~f~L~~ 62 (115)
.++..|||+|.+.+++..
T Consensus 11 gvs~~tlr~w~~~~g~~~ 28 (68)
T cd01104 11 GVSPDTLRAWERRYGLPA 28 (68)
T ss_pred CcCHHHHHHHHHhCCCCC
Confidence 478899999998887743
No 33
>KOG1946|consensus
Probab=21.33 E-value=95 Score=25.38 Aligned_cols=48 Identities=17% Similarity=0.135 Sum_probs=34.4
Q ss_pred HHHHHHHHhcCCCCCCC------------------CCHHHHHHHHHhhhcCCC-CChHHHHHHHHHH
Q psy3274 49 NTLRRYKKYFKVTSRPG------------------LNKAQLAETLLKHFKTIP-VVEKDVLSFFFYT 96 (115)
Q Consensus 49 ~tLrRYkr~f~L~~~p~------------------~sK~QLa~~V~kHF~s~p-V~E~evI~~Fiy~ 96 (115)
..||.|-+.++|+.|.+ +.-=++...+..||...+ ....+...+|++.
T Consensus 126 k~iw~YIke~nLqDP~nkr~IlCDekL~~iF~~k~v~~fem~KLL~~H~~~~~d~~~~~~~~~~~~e 192 (240)
T KOG1946|consen 126 KKIWAYIKEHNLQDPKNKREILCDEKLKSIFGKKRVGMFEMLKLLTKHFLKNQDMVGTDLKQGEETE 192 (240)
T ss_pred HHHHHHHHHhccCCccccCeeeeHHHHHHHhccCccceeeHHHHHHHhccCccccccchhhcCcccc
Confidence 46899999999988764 344578899999998643 3334555666654
No 34
>smart00027 EH Eps15 homology domain. Pair of EF hand motifs that recognise proteins containing Asn-Pro-Phe (NPF) sequences.
Probab=21.20 E-value=2.5e+02 Score=18.40 Aligned_cols=32 Identities=22% Similarity=0.368 Sum_probs=22.2
Q ss_pred CCHHHHHHHHHhcCC-CC-CC-CCCHHHHHHHHHh
Q psy3274 46 LQVNTLRRYKKYFKV-TS-RP-GLNKAQLAETLLK 77 (115)
Q Consensus 46 L~~~tLrRYkr~f~L-~~-~p-~~sK~QLa~~V~k 77 (115)
++.+-+.+|+..|.+ +. .. .++.++|..+++.
T Consensus 4 ls~~~~~~l~~~F~~~D~d~~G~Is~~el~~~l~~ 38 (96)
T smart00027 4 ISPEDKAKYEQIFRSLDKNQDGTVTGAQAKPILLK 38 (96)
T ss_pred CCHHHHHHHHHHHHHhCCCCCCeEeHHHHHHHHHH
Confidence 567778888888877 32 22 3777888777765
No 35
>PF04501 Baculo_VP39: Baculovirus major capsid protein VP39; InterPro: IPR007589 This family constitutes the 39 kDa major capsid protein of the Baculoviridae [].; GO: 0005198 structural molecule activity, 0019028 viral capsid
Probab=20.77 E-value=38 Score=28.77 Aligned_cols=60 Identities=17% Similarity=0.231 Sum_probs=29.0
Q ss_pred ceeecccchhhhHHhhhhccCCCCCCCCCCCC--C-----CCccCcccCCHHHHHHHHHhcCCCCCC
Q psy3274 5 HIYICEYHKCLIQSVRNKRRRKDSEDDSNEND--E-----SPDVDLFQLQVNTLRRYKKYFKVTSRP 64 (115)
Q Consensus 5 h~yicd~hk~~iqs~r~~r~~~~s~~~s~~~d--~-----~p~vDf~kL~~~tLrRYkr~f~L~~~p 64 (115)
-.|||+||-+.-=...++.=...|.++-...- + ....+-...=..|=..|..+++++..+
T Consensus 44 G~fIC~yHL~~~FKm~K~~l~Ipdg~~n~~~r~vg~SLv~~~~~~~~RILIPt~~NY~tvlnv~~ms 110 (300)
T PF04501_consen 44 GTFICNYHLSKYFKMEKMVLPIPDGDDNSFKRLVGKSLVQHTATGNKRILIPTKENYETVLNVNSMS 110 (300)
T ss_pred CeEEechhhhhheeeEEEeecCcCCCCceeeeEEeEeccCCCCCCCceEEeecchhHHHHhCccccC
Confidence 37999999875333322222233443311000 1 111122222233556799999986544
No 36
>COG2971 Predicted N-acetylglucosamine kinase [Carbohydrate transport and metabolism]
Probab=20.59 E-value=84 Score=26.65 Aligned_cols=34 Identities=18% Similarity=0.377 Sum_probs=26.8
Q ss_pred HHHHHHHHHhhhcCCCCChHHHHHHHHHHHhhCCcccc
Q psy3274 68 KAQLAETLLKHFKTIPVVEKDVLSFFFYTVKSNTNKLD 105 (115)
Q Consensus 68 K~QLa~~V~kHF~s~pV~E~evI~~Fiy~VK~~k~Kfd 105 (115)
+.-|+++|++||.. +. |-|..|+|+.+-.++++-
T Consensus 175 ~t~L~d~v~~~f~~---d~-edlv~~~y~a~~~~~~ia 208 (301)
T COG2971 175 ATPLTDAVMAEFNL---DP-EDLVAFIYKAGPGDKKIA 208 (301)
T ss_pred CChHHHHHHHHhCC---CH-HHHHHHHHhcCCchHHHH
Confidence 33499999999987 33 668899999988776654
No 37
>PF12794 MscS_TM: Mechanosensitive ion channel inner membrane domain 1
Probab=20.56 E-value=38 Score=28.20 Aligned_cols=16 Identities=44% Similarity=0.746 Sum_probs=13.9
Q ss_pred CCCccCcccCCHHHHH
Q psy3274 37 ESPDVDLFQLQVNTLR 52 (115)
Q Consensus 37 ~~p~vDf~kL~~~tLr 52 (115)
+.|++|+++.+.-++|
T Consensus 291 ee~~ldl~~I~~Qslr 306 (340)
T PF12794_consen 291 EEPELDLEQISQQSLR 306 (340)
T ss_pred CCcccCHHHHHHHHHH
Confidence 7899999999888876
No 38
>PF01436 NHL: NHL repeat; InterPro: IPR001258 The NHL repeat, named after NCL-1, HT2A and Lin-41, is found largely in a large number of eukaryotic and prokaryotic proteins. For example, the repeat is found in a variety of enzymes of the copper type II, ascorbate-dependent monooxygenase family which catalyse the C terminus alpha-amidation of biological peptides []. In many it occurs in tandem arrays, for example in the ringfinger beta-box, coiled-coil (RBCC) eukaryotic growth regulators []. The 'Brain Tumor' protein (Brat) is one such growth regulator that contains a 6-bladed NHL-repeat beta-propeller [, ]. The NHL repeats are also found in serine/threonine protein kinase (STPK) in diverse range of pathogenic bacteria. These STPK are transmembrane receptors with a intracellular N-terminal kinase domain and extracellular C-terminal sensor domain. In the STPK, PknD, from Mycobacterium tuberculosis, the sensor domain forms a rigid, six-bladed b-propeller composed of NHL repeats with a flexible tether to the transmembrane domain.; GO: 0005515 protein binding; PDB: 3FVZ_A 3FW0_A 1RWL_A 1RWI_A 1Q7F_A.
Probab=20.15 E-value=28 Score=18.74 Aligned_cols=14 Identities=21% Similarity=0.432 Sum_probs=10.4
Q ss_pred cceeecccchhhhH
Q psy3274 4 RHIYICEYHKCLIQ 17 (115)
Q Consensus 4 ~h~yicd~hk~~iq 17 (115)
-+||+||..-..||
T Consensus 13 g~i~VaD~~n~rV~ 26 (28)
T PF01436_consen 13 GNIYVADSGNHRVQ 26 (28)
T ss_dssp SEEEEEECCCTEEE
T ss_pred CCEEEEECCCCEEE
Confidence 46999998766554
Done!