Query psy12912
Match_columns 185
No_of_seqs 105 out of 161
Neff 4.7
Searched_HMMs 46136
Date Fri Aug 16 22:56:16 2013
Command hhsearch -i /work/01045/syshi/Psyhhblits/psy12912.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/12912hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 KOG3284|consensus 100.0 6.8E-83 1.5E-87 530.5 14.6 183 3-185 8-196 (213)
2 PF03997 VPS28: VPS28 protein; 100.0 4.9E-77 1.1E-81 495.7 9.7 164 22-185 1-170 (188)
3 PF09454 Vps23_core: Vps23 cor 93.3 0.4 8.7E-06 33.9 6.3 48 10-57 6-53 (65)
4 PF09851 SHOCT: Short C-termin 89.9 0.48 1E-05 28.8 3.1 29 29-57 2-30 (31)
5 PF06464 DMAP_binding: DMAP1-b 61.2 18 0.00039 27.8 4.6 35 28-62 9-43 (111)
6 PF14090 HTH_39: Helix-turn-he 52.4 24 0.00051 24.6 3.6 41 59-100 2-45 (70)
7 cd00155 RasGEF Guanine nucleot 47.9 1.2E+02 0.0026 25.0 7.8 115 19-171 91-222 (237)
8 cd01145 TroA_c Periplasmic bin 47.2 28 0.0006 28.7 3.8 53 49-102 146-202 (203)
9 PF14056 DUF4250: Domain of un 45.2 21 0.00045 24.7 2.3 29 69-97 17-46 (55)
10 PF05120 GvpG: Gas vesicle pro 45.1 89 0.0019 23.0 5.8 48 14-65 23-70 (79)
11 COG3462 Predicted membrane pro 44.2 34 0.00073 27.1 3.6 26 30-55 89-114 (117)
12 cd00672 CysRS_core catalytic c 42.6 29 0.00063 29.3 3.3 60 36-102 73-132 (213)
13 PF08044 DUF1707: Domain of un 40.4 62 0.0014 21.8 4.1 27 31-57 11-37 (53)
14 COG5150 Class 2 transcription 39.0 62 0.0013 26.4 4.5 30 28-66 71-100 (148)
15 PF00157 Pou: Pou domain - N-t 34.4 48 0.001 24.3 3.0 64 74-145 8-72 (75)
16 PRK08719 ribonuclease H; Revie 34.0 47 0.001 26.4 3.1 43 18-62 48-91 (147)
17 PF08365 IGF2_C: Insulin-like 30.2 22 0.00048 24.8 0.6 16 88-103 11-26 (56)
18 PF11740 KfrA_N: Plasmid repli 29.0 68 0.0015 23.8 3.1 59 44-102 1-64 (120)
19 PF07240 Turandot: Stress-indu 28.8 38 0.00083 25.4 1.7 43 52-99 11-55 (85)
20 COG2257 Uncharacterized homolo 27.4 62 0.0013 24.7 2.6 18 16-33 69-86 (92)
21 PF02725 Paramyxo_NS_C: Non-st 27.4 89 0.0019 26.2 3.7 66 73-151 91-162 (165)
22 COG4496 Uncharacterized protei 27.2 1.2E+02 0.0025 23.5 4.0 41 16-56 5-47 (100)
23 PF05816 TelA: Toxic anion res 26.7 1.8E+02 0.0039 26.1 5.9 53 110-166 22-77 (333)
24 PF09756 DDRGK: DDRGK domain; 25.5 1E+02 0.0022 26.1 3.8 47 56-105 100-150 (188)
25 PLN02618 tryptophan synthase, 24.6 94 0.002 29.1 3.8 34 27-61 30-63 (410)
26 COG1725 Predicted transcriptio 24.6 3.3E+02 0.007 21.7 6.3 91 49-145 13-121 (125)
27 cd07977 TFIIE_beta_winged_heli 23.5 87 0.0019 22.5 2.7 36 130-173 18-54 (75)
28 PF11112 PyocinActivator: Pyoc 22.8 1.2E+02 0.0027 21.9 3.4 38 55-99 4-44 (76)
29 PRK04346 tryptophan synthase s 22.7 1.1E+02 0.0024 28.5 3.8 33 27-60 22-54 (397)
30 PF06757 Ins_allergen_rp: Inse 22.5 4.2E+02 0.009 21.5 6.9 85 16-101 2-90 (179)
31 PF12668 DUF3791: Protein of u 20.9 80 0.0017 21.5 2.0 24 75-98 8-32 (62)
32 PRK06548 ribonuclease H; Provi 20.8 83 0.0018 25.7 2.4 61 19-79 41-104 (161)
33 PF04651 Pox_A12: Poxvirus A12 20.7 1E+02 0.0023 26.3 3.0 27 44-70 9-35 (189)
34 cd01019 ZnuA Zinc binding prot 20.2 2.7E+02 0.0058 24.2 5.6 39 49-87 159-201 (286)
No 1
>KOG3284|consensus
Probab=100.00 E-value=6.8e-83 Score=530.45 Aligned_cols=183 Identities=63% Similarity=1.057 Sum_probs=180.4
Q ss_pred CccccccCCCchHHHhhhhHHHHHHHHHHHHHHHHhhhccCCChHHHHHHHHHHHHHHHHHHHhhcCCCCCCHHHHHHHh
Q psy12912 3 NLEEVKLFTNTKEREKFDNQADLYAVINTLQNLEKAYIRDCVTPKEYTAACSKLLVQYKAAFKQVQGEEYPTVEHFVKKY 82 (185)
Q Consensus 3 ~~~EV~L~~~~~eRe~ye~lAelysII~tle~LEkAyirD~It~~eYt~~c~kLl~Qyk~~~~~v~~~~~~~l~~F~~~y 82 (185)
+++|||||+|++|||+|||||||||||+|+|+|||||+||+|++.|||++|.|||.|||+++++++|.+|++||.|+++|
T Consensus 8 l~~Evkl~~N~rerE~~enlseLyaIi~ale~LEKAyirD~is~sey~s~c~kLi~Q~k~~~~~~~~~~f~SiE~Fc~ky 87 (213)
T KOG3284|consen 8 LYEEVKLFNNAREREVYENLSELYAIIKALEQLEKAYIRDCISPSEYTSECSKLIVQYKVAFRSVQGTEFPSIEDFCKKY 87 (213)
T ss_pred HHHHHHHhhchhHHHHHHHHHHHHHHHHHHHHHHHHHHHccCCHHHHHHHHHHHHHHHHHHHHHhcccccCcHHHHHHHH
Confidence 57899999999999999999999999999999999999999999999999999999999999999988999999999999
Q ss_pred cCCChHHHhhhhhcCCCceec------CCCCchhHHHhhhhhhHHHHhhhccccchhhhhcchHHHHHHHhhhcCCCCCC
Q psy12912 83 RLDCPSALERIREDRPITIKD------DKGNTSKCIADIVSLFITTMDKLRLDIKAMDELQPDLRDLMDTMNRLSILPSG 156 (185)
Q Consensus 83 ~l~cp~A~~Rl~~G~PaTie~------~~~~~~~~Iae~t~~FIT~MDaLKL~~~a~DqLhPlL~dL~~sln~~~~~p~d 156 (185)
+|+||+|++||++|+|+|++| ++|+++|+|||+||||||+|||||||++|||||||+|+||+.|||+++.+|+|
T Consensus 88 rl~cp~Ai~Ri~~~~piT~e~~ia~s~dk~~~ak~IAe~v~nFIT~mDaLrLn~~A~Dql~PlL~dL~~smnrls~~p~d 167 (213)
T KOG3284|consen 88 RLDCPAAIERIREGRPITVEDRIAPSADKGNSAKCIAEIVQNFITVMDALRLNINAVDQLYPLLSDLSASMNRLSRLPPD 167 (213)
T ss_pred ccCChHHHHHHHcCCCCcccccccccCCcccHHHHHHHHHHHHHHHHHHHHHhhHHHHhhcchHHHHHHHHHhhccCCcc
Confidence 999999999999999999999 78999999999999999999999999999999999999999999999999999
Q ss_pred CcchhhHHHHHHHHhcCCCCccCCccccC
Q psy12912 157 FEGKEKVSGWLTTLESMSASDELNETQFN 185 (185)
Q Consensus 157 f~gr~kl~~Wl~kLn~M~AsdeL~eeq~~ 185 (185)
|+||.||++||++||+|.|||||+|.|+|
T Consensus 168 fe~r~Kv~~Wl~rls~M~asDeL~e~q~R 196 (213)
T KOG3284|consen 168 FEGRTKVKQWLIRLSKMSASDELTEQQVR 196 (213)
T ss_pred chhHHHHHHHHHHHhcccccccccHHHHH
Confidence 99999999999999999999999999987
No 2
>PF03997 VPS28: VPS28 protein; InterPro: IPR007143 The Endosomal Sorting Complex Required for Transport (ESCRT) complexes form the machinery driving protein sorting from endosomes to lysosomes. ESCRT complexes are central to receptor down-regulation, lysosome biogenesis, and budding of HIV. Yeast ESCRT-I consists of three protein subunits, VPS23, VPS28, and VPS37. In humans, ESCRT-I comprises TSG101, VPS28, and one of four potential human VPS37 homologues. The main role of ESCRT-I is to recognise ubiquitinated cargo via the UEV domain of the VPS23/TSG101 subunit. The assembly of the ESCRT-I complex is directed by the C-terminal steadiness box (SB) of VPS23, the N-terminal half of VPS28, and the C-terminal half of VPS37. The structure is primarily composed of three long, parallel helical hairpins, each corresponding to a different subunit. The additional domains and motifs extending beyond the core serve as gripping tools for ESCRT-I critical functions [, ].; PDB: 2J9W_B 2J9U_C 2G3K_A 2F66_E 2F6M_D 2J9V_A 2CAZ_E 2P22_B.
Probab=100.00 E-value=4.9e-77 Score=495.71 Aligned_cols=164 Identities=54% Similarity=0.908 Sum_probs=142.5
Q ss_pred HHHHHHHHHHHHHHHHhhhccCCChHHHHHHHHHHHHHHHHHHHhhcCCCCCCHHHHHHHhcCCChHHHhhhhhcCCCce
Q psy12912 22 QADLYAVINTLQNLEKAYIRDCVTPKEYTAACSKLLVQYKAAFKQVQGEEYPTVEHFVKKYRLDCPSALERIREDRPITI 101 (185)
Q Consensus 22 lAelysII~tle~LEkAyirD~It~~eYt~~c~kLl~Qyk~~~~~v~~~~~~~l~~F~~~y~l~cp~A~~Rl~~G~PaTi 101 (185)
||||||||+|||+||||||||+||++|||++|+|||+|||++++.++++.|+||++|+++|+|+||+|++||++|+|+||
T Consensus 1 LAeLysII~tle~LEkayikD~It~~eYt~~c~kLl~Qyk~~~~~~~~~~~~~le~F~~~y~l~cp~A~~Rl~~G~P~Ti 80 (188)
T PF03997_consen 1 LAELYSIIKTLEHLEKAYIKDSITEKEYTTACNKLLNQYKTILKQLKDDEFPDLEEFMKKYNLDCPAALERLREGVPATI 80 (188)
T ss_dssp HHHHHHHHHHHHHHHHHHHTTSS-HHHHHHHHHHHHHHHHHHHTSTTHHHHHHHHHHHHHTTS-HHHHHHHHHCTSS---
T ss_pred ChHHHHHHHHHHHHHHHHhhccCCHHHHHHHHHHHHHHHHHHHHHcccccCCCHHHHHHHhcccCChHHHHHHcCCCCch
Confidence 79999999999999999999999999999999999999999999986547899999999999999999999999999999
Q ss_pred ec------CCCCchhHHHhhhhhhHHHHhhhccccchhhhhcchHHHHHHHhhhcCCCCCCCcchhhHHHHHHHHhcCCC
Q psy12912 102 KD------DKGNTSKCIADIVSLFITTMDKLRLDIKAMDELQPDLRDLMDTMNRLSILPSGFEGKEKVSGWLTTLESMSA 175 (185)
Q Consensus 102 e~------~~~~~~~~Iae~t~~FIT~MDaLKL~~~a~DqLhPlL~dL~~sln~~~~~p~df~gr~kl~~Wl~kLn~M~A 175 (185)
+| +++++||+|||+||||||+|||||||++|||||||+|+||+.||||++.+|+||+||+||++|+++||+|+|
T Consensus 81 e~~~~~~~~~~~~ak~Vae~t~~FIT~mDaLKLn~~a~DqLhPlL~dL~~slnr~~~~~~dfe~r~kl~~Wl~~Ln~m~a 160 (188)
T PF03997_consen 81 EHRISSSSDKGNSAKLVAEATQNFITLMDALKLNYRAKDQLHPLLSDLMQSLNRVTDLPPDFEGRSKLVEWLIKLNGMKA 160 (188)
T ss_dssp -----------CHHHHHHHHHHHHHHHHHHHHTT--BHHHHHHHHHHHHHHHHHCTTS-TT-CCHHHHHHHHHHHHTS-T
T ss_pred hhhcccccCCchHHHHHHHHhChhhhhhHHHhccchhHhhHhhHHHHHHHHHhccCCCCCCCccHHHHHHHHHHHhCCCc
Confidence 99 467789999999999999999999999999999999999999999999999999999999999999999999
Q ss_pred CccCCccccC
Q psy12912 176 SDELNETQFN 185 (185)
Q Consensus 176 sdeL~eeq~~ 185 (185)
+|||||+|+|
T Consensus 161 sdeL~e~q~r 170 (188)
T PF03997_consen 161 SDELSEEQAR 170 (188)
T ss_dssp T-B--HHHHH
T ss_pred ccccCHHHHH
Confidence 9999999986
No 3
>PF09454 Vps23_core: Vps23 core domain; InterPro: IPR017916 The Endosomal Sorting Complex Required for Transport (ESCRT) complexes form the machinery driving protein sorting from endosomes to lysosomes. ESCRT complexes are central to receptor down-regulation, lysosome biogenesis, and budding of HIV. Yeast ESCRT-I consists of three protein subunits, VPS23, VPS28, and VPS37. In humans, ESCRT-I comprises TSG101, VPS28, and one of four potential human VPS37 homologues. The main role of ESCRT-I is to recognise ubiquitinated cargo via the UEV domain of the VPS23/TSG101 subunit. The assembly of the ESCRT-I complex is directed by the C-terminal steadiness box (SB) of VPS23, the N-terminal half of VPS28, and the C-terminal half of VPS37. The structure is primarily composed of three long, parallel helical hairpins, each corresponding to a different subunit. The additional domains and motifs extending beyond the core serve as gripping tools for ESCRT-I critical functions [, ]. This entry represents the Steadiness box domain.; PDB: 2CAZ_A 2F66_D 2F6M_A 2P22_A.
Probab=93.33 E-value=0.4 Score=33.92 Aligned_cols=48 Identities=15% Similarity=0.205 Sum_probs=42.2
Q ss_pred CCCchHHHhhhhHHHHHHHHHHHHHHHHhhhccCCChHHHHHHHHHHH
Q psy12912 10 FTNTKEREKFDNQADLYAVINTLQNLEKAYIRDCVTPKEYTAACSKLL 57 (185)
Q Consensus 10 ~~~~~eRe~ye~lAelysII~tle~LEkAyirD~It~~eYt~~c~kLl 57 (185)
-.++..++.||-.|+-.+|==|+.+|.+|+-++.|+-+.|-.....|=
T Consensus 6 ~~~~l~~Ql~el~Aed~AieDtiy~L~~al~~g~I~~d~~lK~vR~La 53 (65)
T PF09454_consen 6 AEDPLSNQLYELVAEDHAIEDTIYYLDRALQRGSIDLDTFLKQVRSLA 53 (65)
T ss_dssp -SSHHHHHHHHHHHHHHHHHHHHHHHHHHHHTTSS-HHHHHHHHHHHH
T ss_pred cCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCCCHHHHHHHHHHHH
Confidence 467889999999999999999999999999999999999988876653
No 4
>PF09851 SHOCT: Short C-terminal domain; InterPro: IPR018649 This family of hypothetical prokaryotic proteins has no known function.
Probab=89.89 E-value=0.48 Score=28.76 Aligned_cols=29 Identities=24% Similarity=0.458 Sum_probs=25.7
Q ss_pred HHHHHHHHHhhhccCCChHHHHHHHHHHH
Q psy12912 29 INTLQNLEKAYIRDCVTPKEYTAACSKLL 57 (185)
Q Consensus 29 I~tle~LEkAyirD~It~~eYt~~c~kLl 57 (185)
.-.|+.|...|-++.||++||...=.+||
T Consensus 2 ~~~L~~L~~l~~~G~IseeEy~~~k~~ll 30 (31)
T PF09851_consen 2 EDRLEKLKELYDKGEISEEEYEQKKARLL 30 (31)
T ss_pred hHHHHHHHHHHHcCCCCHHHHHHHHHHHh
Confidence 45688999999999999999999888876
No 5
>PF06464 DMAP_binding: DMAP1-binding Domain; InterPro: IPR010506 This domain binds DMAP1, a transcriptional co-repressor.; GO: 0008134 transcription factor binding, 0005634 nucleus
Probab=61.20 E-value=18 Score=27.78 Aligned_cols=35 Identities=37% Similarity=0.417 Sum_probs=31.7
Q ss_pred HHHHHHHHHHhhhccCCChHHHHHHHHHHHHHHHH
Q psy12912 28 VINTLQNLEKAYIRDCVTPKEYTAACSKLLVQYKA 62 (185)
Q Consensus 28 II~tle~LEkAyirD~It~~eYt~~c~kLl~Qyk~ 62 (185)
+-..|..||.-|-.+.||.+.|...=.+||.+|-.
T Consensus 9 vq~~L~~L~~el~~GdiT~KGY~kkr~~LL~~yl~ 43 (111)
T PF06464_consen 9 VQNRLQELDLELEEGDITQKGYEKKRSKLLAPYLP 43 (111)
T ss_pred HHHHHHHHHHhhhcCcchHHHHHHHHHHHHHHHHh
Confidence 44578899999999999999999999999999974
No 6
>PF14090 HTH_39: Helix-turn-helix domain
Probab=52.41 E-value=24 Score=24.61 Aligned_cols=41 Identities=22% Similarity=0.297 Sum_probs=25.9
Q ss_pred HHHHHHHhh-cCCCCCCHHHHHHHhcCCChHHH-hhhh-hcCCCc
Q psy12912 59 QYKAAFKQV-QGEEYPTVEHFVKKYRLDCPSAL-ERIR-EDRPIT 100 (185)
Q Consensus 59 Qyk~~~~~v-~~~~~~~l~~F~~~y~l~cp~A~-~Rl~-~G~PaT 100 (185)
|.+.++..+ .+..+.++| +...|+.-||+|. ..|+ .|.|+-
T Consensus 2 Q~~rIL~~L~~~~~it~~e-a~~~~gi~~~~aRI~eLR~~G~~I~ 45 (70)
T PF14090_consen 2 QCKRILAALRRGGSITTLE-ARRELGIMRLAARISELRKKGYPIV 45 (70)
T ss_pred HHHHHHHHHHcCCCcCHHH-HHHHcCCCCHHHHHHHHHHcCCeee
Confidence 555566655 344444444 4588888888774 4664 599983
No 7
>cd00155 RasGEF Guanine nucleotide exchange factor for Ras-like small GTPases. Small GTP-binding proteins of the Ras superfamily function as molecular switches in fundamental events such as signal transduction, cytoskeleton dynamics and intracellular trafficking. Guanine-nucleotide-exchange factors (GEFs) positively regulate these GTP-binding proteins in response to a variety of signals. GEFs catalyze the dissociation of GDP from the inactive GTP-binding proteins. GTP can then bind and induce structural changes that allow interaction with effectors.
Probab=47.91 E-value=1.2e+02 Score=24.99 Aligned_cols=115 Identities=18% Similarity=0.259 Sum_probs=62.5
Q ss_pred hhhHHHHHHHHHHHHH-----HHHhhhccCCCh--HHHHHHHHHHHH---HHHHHHHhh-----cCCCCCCHHHHHHHhc
Q psy12912 19 FDNQADLYAVINTLQN-----LEKAYIRDCVTP--KEYTAACSKLLV---QYKAAFKQV-----QGEEYPTVEHFVKKYR 83 (185)
Q Consensus 19 ye~lAelysII~tle~-----LEkAyirD~It~--~eYt~~c~kLl~---Qyk~~~~~v-----~~~~~~~l~~F~~~y~ 83 (185)
.-|+.-+++|+.+|.+ |.+.+ +.++. .++.....++.. .|+.+.+.+ ...-+|-+--|.+..
T Consensus 91 l~Nfns~~aI~~~L~~~~I~RL~~tw--~~l~~~~~~~~~~l~~l~~~~~n~~~yr~~l~~~~~~~p~IP~lg~~l~dl- 167 (237)
T cd00155 91 LNNFNSLMAIVSALSSSPISRLKKTW--EVLSSKLKKLFEELEELVDPSRNFKNYRKLLKSVGPNPPCVPFLGVYLKDL- 167 (237)
T ss_pred hCCHHHHHHHHHHhCCccchhHHHHH--HHCCHHHHHHHHHHHHHhCcHhhHHHHHHHHHhcCCCCCCeeehhHHHHHH-
Confidence 5677778888888864 44432 23443 334444445554 555444333 223456666666653
Q ss_pred CCChHHHhhhhhcCCCceecCCCCc--hhHHHhhhhhhHHHHhhhccccchhhhhcchHHHHHHHhhhcCCCCCCCcchh
Q psy12912 84 LDCPSALERIREDRPITIKDDKGNT--SKCIADIVSLFITTMDKLRLDIKAMDELQPDLRDLMDTMNRLSILPSGFEGKE 161 (185)
Q Consensus 84 l~cp~A~~Rl~~G~PaTie~~~~~~--~~~Iae~t~~FIT~MDaLKL~~~a~DqLhPlL~dL~~sln~~~~~p~df~gr~ 161 (185)
.++..|.|.+++..--|- -+.+++.+..|..+ ...|-.|....
T Consensus 168 -------~~~~e~~~~~~~~~~iN~~K~~~i~~~i~~~~~~----------------------------Q~~~Y~~~~~~ 212 (237)
T cd00155 168 -------TFLHEGNPDFLEGNLVNFEKRRKIAEILREIRQL----------------------------QSNSYELNRDE 212 (237)
T ss_pred -------HHHHccCCccCcCCCccHHHHHHHHHHHHHHHHH----------------------------hhCCCCCCCcH
Confidence 457778887765222222 23355555554433 22345566666
Q ss_pred hHHHHHHHHh
Q psy12912 162 KVSGWLTTLE 171 (185)
Q Consensus 162 kl~~Wl~kLn 171 (185)
.+++|+.++.
T Consensus 213 ~iq~~l~~~~ 222 (237)
T cd00155 213 DILAFLWKLL 222 (237)
T ss_pred HHHHHHHhcc
Confidence 7777776665
No 8
>cd01145 TroA_c Periplasmic binding protein TroA_c. These proteins are predicted to function as initial receptors in the ABC metal ion uptake in eubacteria and archaea. They belong to the TroA superfamily of helical backbone metal receptor proteins that share a distinct fold and ligand binding mechanism. A typical TroA protein is comprised of two globular subdomains connected by a single helix and can bind their ligands in the cleft between these domains.
Probab=47.22 E-value=28 Score=28.66 Aligned_cols=53 Identities=19% Similarity=0.282 Sum_probs=32.1
Q ss_pred HHHHHHHHHHHHHHHHHhhcCCCC----CCHHHHHHHhcCCChHHHhhhhhcCCCcee
Q psy12912 49 YTAACSKLLVQYKAAFKQVQGEEY----PTVEHFVKKYRLDCPSALERIREDRPITIK 102 (185)
Q Consensus 49 Yt~~c~kLl~Qyk~~~~~v~~~~~----~~l~~F~~~y~l~cp~A~~Rl~~G~PaTie 102 (185)
|......|-.+++..++.+++..| +.+.-|++.|++.. .++-.+..|+|.+..
T Consensus 146 ~~~~l~~l~~~~~~~l~~~~~~~~v~~H~af~Y~~~~yGl~~-~~~~~~~~~~~p~~~ 202 (203)
T cd01145 146 FLAKLNKLLREWERQFEGLKGIQVVAYHPSYQYLADWLGIEV-VASLEPLPELPPTSS 202 (203)
T ss_pred HHHHHHHHHHHHHHHhhccCCCeEEEecccHHHHHHHcCCce-eeeeccCCCCCCCCC
Confidence 444444444445544444433322 56999999999985 555566677776653
No 9
>PF14056 DUF4250: Domain of unknown function (DUF4250)
Probab=45.18 E-value=21 Score=24.70 Aligned_cols=29 Identities=17% Similarity=0.406 Sum_probs=24.9
Q ss_pred CCCCCCHHHHHHHhcCCChHHHhhhhh-cC
Q psy12912 69 GEEYPTVEHFVKKYRLDCPSALERIRE-DR 97 (185)
Q Consensus 69 ~~~~~~l~~F~~~y~l~cp~A~~Rl~~-G~ 97 (185)
.++++|+++||..|+++...=.+||.. |.
T Consensus 17 RD~~~sLd~Lc~~~~id~~~l~~kL~~~Gy 46 (55)
T PF14056_consen 17 RDEYSSLDELCYDYDIDKEELEEKLASIGY 46 (55)
T ss_pred HhccCCHHHHHHHhCCCHHHHHHHHHHcCC
Confidence 477899999999999999999999953 54
No 10
>PF05120 GvpG: Gas vesicle protein G ; InterPro: IPR007804 Gas vesicles are intracellular, protein-coated, and hollow organelles found in cyanobacteria and halophilic archaea. They are permeable to ambient gases by diffusion and provide buoyancy, enabling cells to move upwards in water to access oxygen and/or light. Proteins containing this family are involved in the formation of gas vesicles [].
Probab=45.12 E-value=89 Score=23.01 Aligned_cols=48 Identities=25% Similarity=0.337 Sum_probs=39.9
Q ss_pred hHHHhhhhHHHHHHHHHHHHHHHHhhhccCCChHHHHHHHHHHHHHHHHHHH
Q psy12912 14 KEREKFDNQADLYAVINTLQNLEKAYIRDCVTPKEYTAACSKLLVQYKAAFK 65 (185)
Q Consensus 14 ~eRe~ye~lAelysII~tle~LEkAyirD~It~~eYt~~c~kLl~Qyk~~~~ 65 (185)
.||+.||- =.|=.-|..|+.+|-.+-||++||...=..||.++..+.+
T Consensus 23 Ae~E~~Dp----~~i~~~L~~L~~~~e~GEIseeEf~~~E~eLL~rL~~~~~ 70 (79)
T PF05120_consen 23 AERELYDP----AAIRRELAELQEALEAGEISEEEFERREDELLDRLEEARR 70 (79)
T ss_pred HHHHHcCH----HHHHHHHHHHHHHHHcCCCCHHHHHHHHHHHHHHHHHHHH
Confidence 46666643 2466778899999999999999999999999999987754
No 11
>COG3462 Predicted membrane protein [Function unknown]
Probab=44.19 E-value=34 Score=27.13 Aligned_cols=26 Identities=19% Similarity=0.316 Sum_probs=21.8
Q ss_pred HHHHHHHHhhhccCCChHHHHHHHHH
Q psy12912 30 NTLQNLEKAYIRDCVTPKEYTAACSK 55 (185)
Q Consensus 30 ~tle~LEkAyirD~It~~eYt~~c~k 55 (185)
.|.|-|-..|.|+-||++||+.--..
T Consensus 89 RA~eIlkER~AkGEItEEEY~r~~~~ 114 (117)
T COG3462 89 RAEEILKERYAKGEITEEEYRRIIRT 114 (117)
T ss_pred HHHHHHHHHHhcCCCCHHHHHHHHHH
Confidence 46788888999999999999986554
No 12
>cd00672 CysRS_core catalytic core domain of cysteinyl tRNA synthetase. Cysteinyl tRNA synthetase (CysRS) catalytic core domain. This class I enzyme is a monomer which aminoacylates the 2'-OH of the nucleotide at the 3' of the appropriate tRNA. The core domain is based on the Rossman fold and is responsible for the ATP-dependent formation of the enzyme bound aminoacyl-adenylate. It contains the characteristic class I HIGH and KMSKS motifs, which are involved in ATP binding.
Probab=42.60 E-value=29 Score=29.31 Aligned_cols=60 Identities=12% Similarity=0.168 Sum_probs=38.4
Q ss_pred HHhhhccCCChHHHHHHHHHHHHHHHHHHHhhcCCCCCCHHHHHHHhcCCChHHHhhhhhcCCCcee
Q psy12912 36 EKAYIRDCVTPKEYTAACSKLLVQYKAAFKQVQGEEYPTVEHFVKKYRLDCPSALERIREDRPITIK 102 (185)
Q Consensus 36 EkAyirD~It~~eYt~~c~kLl~Qyk~~~~~v~~~~~~~l~~F~~~y~l~cp~A~~Rl~~G~PaTie 102 (185)
+++=.+..+++.||+... ..+|+..++.+. =.+| +.|...|++.|.+...+. -|-|..|-
T Consensus 73 ~~~A~~~g~~p~e~~~~~---~~~f~~~~~~l~-i~~~--d~~~rtWh~ec~am~~~~-lg~~~dih 132 (213)
T cd00672 73 IKRAREEGLSWKEVADYY---TKEFFEDMKALN-VLPP--DVVPRVWHIECSAMAMKY-LGETFDIH 132 (213)
T ss_pred HHHHHHcCCCHHHHHHHH---HHHHHHHHHHcC-CCCC--CcceeehhHHHHHHHHHH-cCCCccEE
Confidence 333344568888776655 566777777662 2222 557777999999766653 46677765
No 13
>PF08044 DUF1707: Domain of unknown function (DUF1707); InterPro: IPR012551 This domain is found in a variety of actinomycetales proteins. All of the proteins containing this domain are hypothetical and probably membrane bound or associated. Currently, it is unclear to the function of this domain.
Probab=40.36 E-value=62 Score=21.82 Aligned_cols=27 Identities=11% Similarity=0.203 Sum_probs=24.2
Q ss_pred HHHHHHHhhhccCCChHHHHHHHHHHH
Q psy12912 31 TLQNLEKAYIRDCVTPKEYTAACSKLL 57 (185)
Q Consensus 31 tle~LEkAyirD~It~~eYt~~c~kLl 57 (185)
+++.|-.||-.+-++..||..-|.+..
T Consensus 11 ~~~~L~~a~a~GrL~~~Ef~~R~~~a~ 37 (53)
T PF08044_consen 11 AVDLLRAAFAEGRLSLDEFDERLDAAY 37 (53)
T ss_pred HHHHHHHHHHCCCCCHHHHHHHHHHHH
Confidence 688999999999999999999887654
No 14
>COG5150 Class 2 transcription repressor NC2, beta subunit (Dr1) [Transcription]
Probab=39.01 E-value=62 Score=26.38 Aligned_cols=30 Identities=43% Similarity=0.598 Sum_probs=24.2
Q ss_pred HHHHHHHHHHhhhccCCChHHHHHHHHHHHHHHHHHHHh
Q psy12912 28 VINTLQNLEKAYIRDCVTPKEYTAACSKLLVQYKAAFKQ 66 (185)
Q Consensus 28 II~tle~LEkAyirD~It~~eYt~~c~kLl~Qyk~~~~~ 66 (185)
||+||+-|| -++|.+.|.+....||...++
T Consensus 71 viKALenLe---------f~eyi~~~~e~~~n~k~~qK~ 100 (148)
T COG5150 71 VIKALENLE---------FEEYIESCMEEHENYKSYQKQ 100 (148)
T ss_pred HHHHHHhcc---------HHHHHHHHHHHHHHHHHHHhh
Confidence 566666665 578999999999999998765
No 15
>PF00157 Pou: Pou domain - N-terminal to homeobox domain; InterPro: IPR000327 POU proteins are eukaryotic transcription factors containing a bipartite DNA binding domain referred to as the POU domain. The acronym POU (pronounced 'pow') is derived from the names of three mammalian transcription factors, the pituitary-specific Pit-1, the octamer-binding proteins Oct-1 and Oct-2, and the neural Unc-86 from Caenorhabditis elegans. POU domain genes have been identified in diverse organisms including nematodes, flies, amphibians, fish and mammals but have not been yet identified in plants and fungi. The various members of the POU family have a wide variety of functions, all of which are related to the function of the neuroendocrine system [] and the development of an organism []. Some other genes are also regulated, including those for immunoglobulin light and heavy chains (Oct-2) [, ], and trophic hormone genes, such as those for prolactin and growth hormone (Pit-1). The POU domain is a bipartite domain composed of two subunits separated by a non-conserved region of 15-55 aa. The N-terminal subunit is known as the POU-specific (POUs) domain (IPR000327 from INTERPRO), while the C-terminal subunit is a homeobox domain (IPR007103 from INTERPRO). 3D structures of complexes including both POU subdomains bound to DNA are available. Both subdomains contain the structural motif 'helix-turn-helix', which directly associates with the two components of bipartite DNA binding sites, and both are required for high affinity sequence-specific DNA-binding. The domain may also be involved in protein-protein interactions []. The subdomains are connected by a flexible linker [, , ]. In proteins a POU-specific domain is always accompanied by a homeodomain. Despite of the lack of sequence homology, 3D structure of POUs is similar to 3D structure of bacteriophage lambda repressor and other members of HTH_3 family [, ]. This entry represents the POU-specific subunit of the POU domain.; GO: 0003700 sequence-specific DNA binding transcription factor activity, 0006355 regulation of transcription, DNA-dependent; PDB: 3D1N_O 1AU7_A 3L1P_A 2XSD_C 1O4X_A 1HF0_B 1GT0_C 1POU_A 1CQT_B 1E3O_C ....
Probab=34.41 E-value=48 Score=24.29 Aligned_cols=64 Identities=20% Similarity=0.319 Sum_probs=43.0
Q ss_pred CHHHHHHHhcCCChHHHhhhhhcCCCceecCCCCchhH-HHhhhhhhHHHHhhhccccchhhhhcchHHHHHH
Q psy12912 74 TVEHFVKKYRLDCPSALERIREDRPITIKDDKGNTSKC-IADIVSLFITTMDKLRLDIKAMDELQPDLRDLMD 145 (185)
Q Consensus 74 ~l~~F~~~y~l~cp~A~~Rl~~G~PaTie~~~~~~~~~-Iae~t~~FIT~MDaLKL~~~a~DqLhPlL~dL~~ 145 (185)
.+++|.+.|+ ..||+-|.--+-- +...++. -.+.+|..|.-+-+|-|..+.+=.|.|+|..-+.
T Consensus 8 ele~Fa~~fk------~rRi~LG~TQ~dV--g~al~~~~G~~~SQttI~RFE~L~LS~kn~~klkP~L~kWL~ 72 (75)
T PF00157_consen 8 ELEQFAKEFK------QRRIKLGYTQADV--GAALGRLYGKEFSQTTICRFEALQLSFKNMCKLKPLLEKWLE 72 (75)
T ss_dssp HHHHHHHHHH------HHHHHTT--HHHH--HHHHHHHHSSGGSHHHHHHHHTTTSCHHHHHHHHHHHHHHHH
T ss_pred HHHHHHHHHH------HhhhhcccCHHHH--hHHHHHhcCccccchhhhhhHhcccCHHHHHHHHHHHHHHHH
Confidence 4889999985 6788887651110 0001111 1235788899999999999999999999976543
No 16
>PRK08719 ribonuclease H; Reviewed
Probab=33.98 E-value=47 Score=26.41 Aligned_cols=43 Identities=16% Similarity=0.239 Sum_probs=34.6
Q ss_pred hhhhHHHHHHHHHHHHHHHHhhhccCCChHHHH-HHHHHHHHHHHH
Q psy12912 18 KFDNQADLYAVINTLQNLEKAYIRDCVTPKEYT-AACSKLLVQYKA 62 (185)
Q Consensus 18 ~ye~lAelysII~tle~LEkAyirD~It~~eYt-~~c~kLl~Qyk~ 62 (185)
...+-|||-+|+.+|+.+.+.- -..|+++|. ..+++-+.+++.
T Consensus 48 ~Tnn~aEl~A~~~aL~~~~~~~--~i~tDS~yvi~~i~~~~~~W~~ 91 (147)
T PRK08719 48 TDNAELELLALIEALEYARDGD--VIYSDSDYCVRGFNEWLDTWKQ 91 (147)
T ss_pred ccHHHHHHHHHHHHHHHcCCCC--EEEechHHHHHHHHHHHHHHHh
Confidence 4678899999999999987763 477889998 566777888864
No 17
>PF08365 IGF2_C: Insulin-like growth factor II E-peptide; InterPro: IPR013576 The insulin family of proteins groups together several evolutionarily related active peptides []: these include insulin [, ], relaxin [, ], insect prothoracicotropic hormone (bombyxin) [], insulin-like growth factors (IGF1 and IGF2) [, ], mammalian Leydig cell-specific insulin-like peptide (gene INSL3), early placenta insulin-like peptide (ELIP) (gene INSL4), locust insulin-related peptide (LIRP), molluscan insulin-related peptides (MIP), and Caenorhabditis elegans insulin-like peptides. The 3D structures of a number of family members have been determined [, , ]. The fold comprises two polypeptide chains (A and B) linked by two disulphide bonds: all share a conserved arrangement of 4 cysteines in their A chain, the first of which is linked by a disulphide bond to the third, while the second and fourth are linked by interchain disulphide bonds to cysteines in the B chain. Insulin is found in many animals, and is involved in the regulation of normal glucose homeostasis. It also has other specific physiological effects, such as increasing the permeability of cells to monosaccharides, amino acids and fatty acids, and accelerating glycolysis and glycogen synthesis in the liver []. Insulin exerts its effects by interaction with a cell-surface receptor, which may also result in the promotion of cell growth []. Insulin is synthesised as a prepropeptide from which an endoplasmic reticulum-targeting sequence is cleaved to yield proinsulin. The sequence of prosinsulin contains 2 well-conserved regions (designated A and B), separated by an intervening connecting region (C), which is variable between species []. The connecting region is cleaved, liberating the active protein, which contains the A and B chains, held together by 2 disulphide bonds []. Insulin-like Growth Factor Binding Proteins (IGFBP) are a group of vertebrate secreted proteins, which bind to IGF-I and IGF-II with high affinity and modulate the biological actions of IGFs. The IGFBP family has six distinct subgroups, IGFBP-1 through 6, based on conservation of gene (intron-exon) organisation, structural similarity, and binding affinity for IGFs. Across species, IGFBP-5 exhibits the most sequence conservation, while IGFBP-6 exhibits the least sequence conservation. The IGFBPs contain inhibitor domain homologues, which are related to MEROPS protease inhibitor family I31 (equistatin, clan IX). All IGFBPs share a common domain architecture (IPR000867 from INTERPRO:IPR000716 from INTERPRO). While the N-terminal (IPR000867 from INTERPRO, IGF binding protein domain), and the C-terminal (IPR000716 from INTERPRO, thyroglobulin type-1 repeat) domains are conserved across vertebrate species, the mid-region is highly variable with respect to protease cleavage sites and phosphorylation and glycosylation sites. IGFBPs contain 16-18 conserved cysteines located in the N-terminal and the C-terminal regions, which form 8-9 disulphide bonds []. As demonstrated for human IGFBP-5, the N terminus is the primary binding site for IGF. This region, comprised of Val49, Tyr50, Pro62 and Lys68-Leu75, forms a hydrophobic patch on the surface of the protein []. The C terminus is also required for high affinity IGF binding, as well as for binding to the extracellular matrix [] and for nuclear translocation [, ] of IGFBP-3 and -5. IGFBPs are unusually pleiotropic molecules. Like other binding proteins, IGFBP can prolong the half-life of IGFs via high affinity binding of the ligands. In addition to functioning as simple carrier proteins, serum IGFBPs also serve to regulate the endocrine and paracrine/autocrine actions of IGF by modulating the IGF available to bind to signalling IGF-I receptors [, ]. Furthermore, IGFBPs can function as growth modulators independent of IGFs. For example, IGFBP-5 stimulates markers of bone formation in osteoblasts lacking functional IGFs []. The binding of IGFBP to its putative receptor on the cell membrane may stimulate the signalling pathway independent of an IGF receptor, to mediate the effects of IGFBPs in certain target cell types. IGFBP-1 and -2, but not other IGFBPs, contain a C-terminal Arg-Gly-Asp integrin-binding motif. Thus, IGFBP-1 can also stimulate cell migration of CHO and human trophoblast cells through an action mediated by alpha 5 beta 1 integrin []. Finally, IGFBPs transported into the nucleus (via the nuclear localisation signal) may also exert IGF-independent effects by transcriptional activation of genes. This domain is the C-terminal domain of insulin-like growth factor II proteins (IGF-2, also see IPR004825 from INTERPRO) in vertebrates and seems to represent the E-peptide [, ].
Probab=30.23 E-value=22 Score=24.83 Aligned_cols=16 Identities=19% Similarity=0.493 Sum_probs=13.4
Q ss_pred HHHhhhhhcCCCceec
Q psy12912 88 SALERIREDRPITIKD 103 (185)
Q Consensus 88 ~A~~Rl~~G~PaTie~ 103 (185)
.|.+||+.|+||-...
T Consensus 11 ksaqRLRRG~PaiLRa 26 (56)
T PF08365_consen 11 KSAQRLRRGLPAILRA 26 (56)
T ss_pred HHHHHHHccchHHHHh
Confidence 5889999999987753
No 18
>PF11740 KfrA_N: Plasmid replication region DNA-binding N-term; InterPro: IPR021104 The KfrA family of protiens are encoded on plasmids, generally in or near gene clusters invloved in stable inheritance functions. These proteins are thought to form an all-helical structure, consisting of an N-terminal helix-turn-helix DNA binding domain and an extended coiled-coil tail. The best-characterised KfrA protein, encoded on the broad host-range Plasmid RK2, is a site-specific DNA-binding protein whose operator overlaps its own promoter. The DNA-binding domain is essential for function, while the coiled-coil domain is probably responsible for formation of multimers, and may provide an example of a bridge to host structures required for plasmid partitioning []. This entry represents the N-terminal DNA-binding domain.
Probab=29.00 E-value=68 Score=23.81 Aligned_cols=59 Identities=20% Similarity=0.327 Sum_probs=39.4
Q ss_pred CChHHHHHHHHHHHHHHH-----HHHHhhcCCCCCCHHHHHHHhcCCChHHHhhhhhcCCCcee
Q psy12912 44 VTPKEYTAACSKLLVQYK-----AAFKQVQGEEYPTVEHFVKKYRLDCPSALERIREDRPITIK 102 (185)
Q Consensus 44 It~~eYt~~c~kLl~Qyk-----~~~~~v~~~~~~~l~~F~~~y~l~cp~A~~Rl~~G~PaTie 102 (185)
||.++...+|.+|+.+=+ .+...+-+...++|..+++.|+-.-+....--..++|..+.
T Consensus 1 IT~e~V~~Aa~~L~~~G~~pT~~~Vr~~lG~GS~~ti~~~l~~w~~~~~~~~~~~~~~lP~~l~ 64 (120)
T PF11740_consen 1 ITYEDVIEAADELLAAGKKPTVRAVRERLGGGSMSTISKHLKEWREEREAQVSEAAPDLPEALQ 64 (120)
T ss_pred CcHHHHHHHHHHHHHcCCCCCHHHHHHHHCCCCHHHHHHHHHHHHHhhhccccccccCCChhHH
Confidence 788888899999997643 22233433567889999999987766665222346665553
No 19
>PF07240 Turandot: Stress-inducible humoral factor Turandot; InterPro: IPR010825 This family consists of several Drosophila species specific Turandot proteins. The Turandot A (TotA) gene encodes a humoral factor, which is secreted from the fat body and accumulates in the body fluids. TotA is strongly induced upon bacterial challenge, as well as by other types of stress such as high temperature, mechanical pressure, dehydration, UV irradiation, and oxidative agents. It is also upregulated during metamorphosis and at high age. Flies that overexpress TotA show prolonged survival and retain normal activity at otherwise lethal temperatures. Although TotA is only induced by severe stress, it responds to a much wider range of stimuli than heat shock genes such as hsp70 or immune genes such as Cecropin A1 [].
Probab=28.80 E-value=38 Score=25.41 Aligned_cols=43 Identities=12% Similarity=0.390 Sum_probs=29.0
Q ss_pred HHHHHHHHHHHHHHhhc--CCCCCCHHHHHHHhcCCChHHHhhhhhcCCC
Q psy12912 52 ACSKLLVQYKAAFKQVQ--GEEYPTVEHFVKKYRLDCPSALERIREDRPI 99 (185)
Q Consensus 52 ~c~kLl~Qyk~~~~~v~--~~~~~~l~~F~~~y~l~cp~A~~Rl~~G~Pa 99 (185)
..-.||.=|+-+-..++ .+.-.+++.|+++|. +-.++--|+|+
T Consensus 11 ni~eLi~fY~ky~~~~~L~~~~r~~~d~~i~~y~-----~~~~lVDGvPa 55 (85)
T PF07240_consen 11 NIQELIAFYEKYSPRLPLTPQDRQRIDRFIRRYK-----EENNLVDGVPA 55 (85)
T ss_pred hHHHHHHHHHHcCccCCCCHHHHHHHHHHHHHHH-----HHhhcccCcCC
Confidence 45578888887654431 111236999999997 55557789996
No 20
>COG2257 Uncharacterized homolog of the cytoplasmic domain of flagellar protein FhlB [Function unknown]
Probab=27.42 E-value=62 Score=24.73 Aligned_cols=18 Identities=22% Similarity=0.495 Sum_probs=15.7
Q ss_pred HHhhhhHHHHHHHHHHHH
Q psy12912 16 REKFDNQADLYAVINTLQ 33 (185)
Q Consensus 16 Re~ye~lAelysII~tle 33 (185)
-+.|+..||+|+.|..+|
T Consensus 69 eelY~vVAEifafi~~~~ 86 (92)
T COG2257 69 EELYEVVAEIFAFIYEVD 86 (92)
T ss_pred HHHHHHHHHHHHHHHHHH
Confidence 378999999999998776
No 21
>PF02725 Paramyxo_NS_C: Non-structural protein C; InterPro: IPR003875 This family consists of the polymerase accessory protein C from members of the paramyxoviridae.
Probab=27.40 E-value=89 Score=26.21 Aligned_cols=66 Identities=21% Similarity=0.115 Sum_probs=39.1
Q ss_pred CCHHHHHHHhcCCChHHHhhhhhcCCCceecC------CCCchhHHHhhhhhhHHHHhhhccccchhhhhcchHHHHHHH
Q psy12912 73 PTVEHFVKKYRLDCPSALERIREDRPITIKDD------KGNTSKCIADIVSLFITTMDKLRLDIKAMDELQPDLRDLMDT 146 (185)
Q Consensus 73 ~~l~~F~~~y~l~cp~A~~Rl~~G~PaTie~~------~~~~~~~Iae~t~~FIT~MDaLKL~~~a~DqLhPlL~dL~~s 146 (185)
-|+-.||.+ |+.||++|.|.|..=- -.++. |=..|.-|+|=-|..=+ =.+++|+..||+-+
T Consensus 91 YsviMFmiT-------aVkRLrEskmLT~sWf~Qal~~i~~S~----eE~~~L~~Am~~la~lI--P~e~l~ltGDllp~ 157 (165)
T PF02725_consen 91 YSVIMFMIT-------AVKRLRESKMLTLSWFNQALMVICQSQ----EEKENLHTAMWILANLI--PREMLYLTGDLLPA 157 (165)
T ss_pred hhHHHHHHH-------HHHHHhccccchHHHHHHHHHHHcccH----HHHHHHHHHHHHHHHHh--hHHHHHhhhhHHHH
Confidence 468899997 9999999999997410 00110 00111222222221111 24688999999999
Q ss_pred hhhcC
Q psy12912 147 MNRLS 151 (185)
Q Consensus 147 ln~~~ 151 (185)
|+...
T Consensus 158 l~~~~ 162 (165)
T PF02725_consen 158 LMSTE 162 (165)
T ss_pred Hhhcc
Confidence 98765
No 22
>COG4496 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=27.16 E-value=1.2e+02 Score=23.45 Aligned_cols=41 Identities=27% Similarity=0.397 Sum_probs=31.5
Q ss_pred HHhhhhHHHHHHHHHHHHHHHHh--hhccCCChHHHHHHHHHH
Q psy12912 16 REKFDNQADLYAVINTLQNLEKA--YIRDCVTPKEYTAACSKL 56 (185)
Q Consensus 16 Re~ye~lAelysII~tle~LEkA--yirD~It~~eYt~~c~kL 56 (185)
|-+-..|.+||--|.+|+-+|.. |.+|..|.+|-.+...||
T Consensus 5 klr~~~Ld~l~dailtL~n~eecy~FfdDlcTinEiqslaqRl 47 (100)
T COG4496 5 KLRGAALDELFDAILTLENLEECYAFFDDLCTINEIQSLAQRL 47 (100)
T ss_pred chhhHHHHHHHHHHHHhccHHHHHHHHHhhcCHHHHHHHHHHH
Confidence 33456788999999999999975 678888888866655443
No 23
>PF05816 TelA: Toxic anion resistance protein (TelA); InterPro: IPR008863 This family consists of several prokaryotic TelA like proteins. TelA and KlA are associated with tellurite resistance [] and plasmid fertility inhibition [].
Probab=26.73 E-value=1.8e+02 Score=26.10 Aligned_cols=53 Identities=25% Similarity=0.345 Sum_probs=33.6
Q ss_pred hHHHhhhhhhH-HHHhhhccccchhhhhcchHHHHHHHhhhcCCCCCCC--cchhhHHHH
Q psy12912 110 KCIADIVSLFI-TTMDKLRLDIKAMDELQPDLRDLMDTMNRLSILPSGF--EGKEKVSGW 166 (185)
Q Consensus 110 ~~Iae~t~~FI-T~MDaLKL~~~a~DqLhPlL~dL~~sln~~~~~p~df--~gr~kl~~W 166 (185)
+...+-...|- .+++.++-. ..+++--.|.+|+.-++.+. |.+| .++.++-+|
T Consensus 22 ~~~~~~~a~~s~~iL~~v~~~--d~~~vg~~L~~L~~~~~~~d--p~~~~~~~~~~~l~k 77 (333)
T PF05816_consen 22 AEAQEKIAQFSDRILDRVRNK--DSGEVGELLNELRKEMDELD--PSELKDEKKKGFLGK 77 (333)
T ss_pred HHHHHHHHHHHHHHHHHHHHh--ccchHhHHHHHHHHHHHhCC--hhhhhhhhhhhHHHH
Confidence 33444444554 556664433 77788889999999999997 5565 344344444
No 24
>PF09756 DDRGK: DDRGK domain; InterPro: IPR019153 This is a family of proteins of approximately 300 residues. They contain a highly conserved DDRGK motif. The function is unknown. ; PDB: 1WI9_A.
Probab=25.49 E-value=1e+02 Score=26.12 Aligned_cols=47 Identities=23% Similarity=0.421 Sum_probs=32.5
Q ss_pred HHHHHHHHHHhhcCCCCCCHHHHHHHhcCCChHHHhhhh----hcCCCceecCC
Q psy12912 56 LLVQYKAAFKQVQGEEYPTVEHFVKKYRLDCPSALERIR----EDRPITIKDDK 105 (185)
Q Consensus 56 Ll~Qyk~~~~~v~~~~~~~l~~F~~~y~l~cp~A~~Rl~----~G~PaTie~~~ 105 (185)
||..|-.+++.- .+..|+.....|+|.-+-+++||+ .|.=..|-++.
T Consensus 100 lL~~Fi~yIK~~---Kvv~ledla~~f~l~t~~~i~ri~~L~~~g~ltGv~Ddr 150 (188)
T PF09756_consen 100 LLQEFINYIKEH---KVVNLEDLAAEFGLRTQDVINRIQELEAEGRLTGVIDDR 150 (188)
T ss_dssp HHHHHHHHHHH----SEE-HHHHHHHH-S-HHHHHHHHHHHHHHSSS-EEE-TT
T ss_pred HHHHHHHHHHHc---ceeeHHHHHHHcCCCHHHHHHHHHHHHHCCCceeeEcCC
Confidence 899998777653 367899999999999999999995 36555555543
No 25
>PLN02618 tryptophan synthase, beta chain
Probab=24.60 E-value=94 Score=29.13 Aligned_cols=34 Identities=24% Similarity=0.408 Sum_probs=29.5
Q ss_pred HHHHHHHHHHHhhhccCCChHHHHHHHHHHHHHHH
Q psy12912 27 AVINTLQNLEKAYIRDCVTPKEYTAACSKLLVQYK 61 (185)
Q Consensus 27 sII~tle~LEkAyirD~It~~eYt~~c~kLl~Qyk 61 (185)
.++.+|+.||++|.+ ...+.+|..+-..++.+|-
T Consensus 30 ~~~~~~~~~~~~~~~-~~~~~~f~~~~~~~l~~~v 63 (410)
T PLN02618 30 TLMTALSELEAAFNA-LATDPEFQEELAGILKDYV 63 (410)
T ss_pred HHHHHHHHHHHHHHH-HhcChhhHHHHHHHHHHhc
Confidence 468899999999976 7788889999999999884
No 26
>COG1725 Predicted transcriptional regulators [Transcription]
Probab=24.57 E-value=3.3e+02 Score=21.66 Aligned_cols=91 Identities=18% Similarity=0.343 Sum_probs=56.3
Q ss_pred HHHHHHHHHHHHHHHHH-hh-cCCCCCCHHHHHHHhcCC---ChHHHhhh-hhcCCCceecCC------CC------chh
Q psy12912 49 YTAACSKLLVQYKAAFK-QV-QGEEYPTVEHFVKKYRLD---CPSALERI-REDRPITIKDDK------GN------TSK 110 (185)
Q Consensus 49 Yt~~c~kLl~Qyk~~~~-~v-~~~~~~~l~~F~~~y~l~---cp~A~~Rl-~~G~PaTie~~~------~~------~~~ 110 (185)
|+.-+++...+- .+ .+ .|+..||+-++....+++ ..+|.+=| +.|+=-|....+ +. ...
T Consensus 13 Y~QI~~qIk~~I---~~g~l~pGdkLPSvRelA~~~~VNpnTv~raY~eLE~eG~i~t~rg~G~fV~~~~~~~~~~~~~~ 89 (125)
T COG1725 13 YEQIANQIKEQI---ASGELKPGDKLPSVRELAKDLGVNPNTVQRAYQELEREGIVETKRGKGTFVTEDAKEILDQLKRE 89 (125)
T ss_pred HHHHHHHHHHHH---HhCCcCCCCCCCcHHHHHHHhCCCHHHHHHHHHHHHHCCCEEEecCeeEEEcCCchhhHHHHHHH
Confidence 666554444332 12 23 478899999999999986 34666666 357765544211 11 234
Q ss_pred HHHhhhhhhHHHHhhhccccchhhhhcchHHHHHH
Q psy12912 111 CIADIVSLFITTMDKLRLDIKAMDELQPDLRDLMD 145 (185)
Q Consensus 111 ~Iae~t~~FIT~MDaLKL~~~a~DqLhPlL~dL~~ 145 (185)
.+.+.+.+||.-|=++-+ +.++++.++.+...
T Consensus 90 ~~~~~l~~~I~~~~~~G~---s~eei~~~~~~~~~ 121 (125)
T COG1725 90 LAEEELEEFIEEAKALGL---SLEEILELLKEIYE 121 (125)
T ss_pred HHHHHHHHHHHHHHHcCC---CHHHHHHHHHHHHh
Confidence 577778888865444443 57888888877654
No 27
>cd07977 TFIIE_beta_winged_helix TFIIE_beta_winged_helix domain, located at the central core region of TFIIE beta, with double-stranded DNA binding activity. Transcription Factor IIE (TFIIE) beta winged-helix (or forkhead) domain is located at the central core region of TFIIE beta. The winged-helix is a form of helix-turn-helix (HTH) domain which typically binds DNA with the 3rd helix. The winged-helix domain is distinguished by the presence of a C-terminal beta-strand hairpin unit (the wing) that packs against the cleft of the tri-helical core. Although most winged-helix domains are multi-member families, TFIIE beta winged-helix domain is typically found as a single orthologous group. TFIIE is one of the six eukaryotic general transcription factors (TFIIA, TFIIB, TFIID, TFIIE, TFIIF and TFIIH) that are required for transcription initiation of protein-coding genes. TFIIE is a heterotetramer consisting of two copies each of alpha and beta subunits. TFIIE beta contains several functional
Probab=23.54 E-value=87 Score=22.49 Aligned_cols=36 Identities=22% Similarity=0.427 Sum_probs=23.9
Q ss_pred cchhhhhcch-HHHHHHHhhhcCCCCCCCcchhhHHHHHHHHhcC
Q psy12912 130 IKAMDELQPD-LRDLMDTMNRLSILPSGFEGKEKVSGWLTTLESM 173 (185)
Q Consensus 130 ~~a~DqLhPl-L~dL~~sln~~~~~p~df~gr~kl~~Wl~kLn~M 173 (185)
|+.++| ||+ +.|++.-++-+. ...++++||......
T Consensus 18 mK~r~~-~Plt~~EIl~~ls~~d-------~~~~~~~~L~~~~~~ 54 (75)
T cd07977 18 MKKRHQ-HPLTLDEILDYLSLLD-------IGPKLKEWLKSEALV 54 (75)
T ss_pred HHhcCC-CCccHHHHHHHHhccC-------ccHHHHHHHHhhhhc
Confidence 457777 886 566766666533 237788999876654
No 28
>PF11112 PyocinActivator: Pyocin activator protein PrtN
Probab=22.82 E-value=1.2e+02 Score=21.95 Aligned_cols=38 Identities=24% Similarity=0.406 Sum_probs=27.4
Q ss_pred HHHHHHHHHHHhhcCCCCCCHHHHHHHh--cCCChHHHhhhhhc-CCC
Q psy12912 55 KLLVQYKAAFKQVQGEEYPTVEHFVKKY--RLDCPSALERIRED-RPI 99 (185)
Q Consensus 55 kLl~Qyk~~~~~v~~~~~~~l~~F~~~y--~l~cp~A~~Rl~~G-~Pa 99 (185)
-|+.||- |+....+++.+++| +|.-..|..+++.| +|.
T Consensus 4 lLma~~~-------~~~~IpL~~v~~~yf~~lt~~~a~rk~~~g~lpl 44 (76)
T PF11112_consen 4 LLMAQYF-------GDPVIPLEEVCEDYFPHLTPKTAKRKANAGELPL 44 (76)
T ss_pred HHHHHHc-------CCCCCcHHHHHHHHHccCCHHHHHHHHHCCCCCC
Confidence 4566662 45567799999999 99777888888764 444
No 29
>PRK04346 tryptophan synthase subunit beta; Validated
Probab=22.72 E-value=1.1e+02 Score=28.52 Aligned_cols=33 Identities=30% Similarity=0.516 Sum_probs=29.0
Q ss_pred HHHHHHHHHHHhhhccCCChHHHHHHHHHHHHHH
Q psy12912 27 AVINTLQNLEKAYIRDCVTPKEYTAACSKLLVQY 60 (185)
Q Consensus 27 sII~tle~LEkAyirD~It~~eYt~~c~kLl~Qy 60 (185)
.++.+|+.||++|.+ ...+.+|..+-..|+..|
T Consensus 22 ~~~~~~~~~~~~~~~-~~~~~~f~~~~~~~~~~~ 54 (397)
T PRK04346 22 TLMPALEELEEAYEK-AKNDPEFQAELDYLLKNY 54 (397)
T ss_pred HHHHHHHHHHHHHHH-HhcCHHHHHHHHHHHHHh
Confidence 467899999999977 777888999999999998
No 30
>PF06757 Ins_allergen_rp: Insect allergen related repeat, nitrile-specifier detoxification; InterPro: IPR010629 This entry represents several insect specific allergen repeats. These repeats are commonly found in various proteins from cockroaches, fruit flies and mosquitos. It has been suggested that the repeat sequences have evolved by duplication of an ancestral amino acid domain, which may have arisen from the mitochondrial energy transfer proteins []. This family exemplifies a case of novel gene evolution. The case in point is the arms-race between plants and their infective insective herbivores in the area of the glucosinolate-myrosinase system. Brassicas have developed the glucosinolate-myrosinase system as chemical defence mechanism against the insects, and consequently the insects have adapted to produce a detoxifying molecule, nitrile-specifier protein (NSP). NSP is present in the Pieris rapae (Cabbage white butterfly). NSP is structurally different from and has no amino acid homology to any known detoxifying enzymes, and it appears to have arisen by a process of domain and gene duplication of a sequence of unknown function that is widespread in insect species and referred to as insect-allergen-repeat protein. Thus this family is found either as a single domain or as a multiple repeat-domain [].
Probab=22.48 E-value=4.2e+02 Score=21.45 Aligned_cols=85 Identities=18% Similarity=0.212 Sum_probs=56.5
Q ss_pred HHhhhhHHHHHHHHHHHHHHHHhhhccCCChHHHHHHHHHHHHH-HHHHHHhhc-CCCCCCHHHHHHHhcCCChHHHhhh
Q psy12912 16 REKFDNQADLYAVINTLQNLEKAYIRDCVTPKEYTAACSKLLVQ-YKAAFKQVQ-GEEYPTVEHFVKKYRLDCPSALERI 93 (185)
Q Consensus 16 Re~ye~lAelysII~tle~LEkAyirD~It~~eYt~~c~kLl~Q-yk~~~~~v~-~~~~~~l~~F~~~y~l~cp~A~~Rl 93 (185)
|...+.+.++..+|..-+ +..-..+=...+.|+-.+..=|-+. |+.++..+. -.++..+-.|.+..+++....++++
T Consensus 2 ~~L~~d~~dfl~lIp~~~-i~~i~~~Y~~~D~efq~~~~yl~s~~f~~l~~~l~~~pE~~~l~~yL~~~gldv~~~i~~i 80 (179)
T PF06757_consen 2 RSLQEDFQDFLDLIPMEE-IQDIVQRYYLEDAEFQAAVRYLNSSEFKQLWQQLEALPEVKALLDYLESAGLDVYYYINQI 80 (179)
T ss_pred hhHHHHHHHHHHhcCHHH-HHHHHHHHHHcCHHHHHHHHHHcChHHHHHHHHHHcCHHHHHHHHHHHHCCCCHHHHHHHH
Confidence 456778888888887554 4433333335566665555444332 555555442 2356679999999999999999999
Q ss_pred h--hcCCCce
Q psy12912 94 R--EDRPITI 101 (185)
Q Consensus 94 ~--~G~PaTi 101 (185)
. .|+|.+.
T Consensus 81 ~~~l~~~~~~ 90 (179)
T PF06757_consen 81 NDLLGLPPLN 90 (179)
T ss_pred HHHHcCCcCC
Confidence 5 4988664
No 31
>PF12668 DUF3791: Protein of unknown function (DUF3791); InterPro: IPR024269 This entry represents proteins of unknown function.
Probab=20.91 E-value=80 Score=21.48 Aligned_cols=24 Identities=21% Similarity=0.318 Sum_probs=20.2
Q ss_pred HHHHHHHhcCCChHHHhhhh-hcCC
Q psy12912 75 VEHFVKKYRLDCPSALERIR-EDRP 98 (185)
Q Consensus 75 l~~F~~~y~l~cp~A~~Rl~-~G~P 98 (185)
|+.|++++++.-+.|.+++. .|+-
T Consensus 8 Ie~~A~~~~~s~~ea~~~~~~~~~~ 32 (62)
T PF12668_consen 8 IEEFAKKLNISGEEAYNYFKRSGVI 32 (62)
T ss_pred HHHHHHHHCcCHHHHHHHHHHcCcH
Confidence 78999999999999999995 3543
No 32
>PRK06548 ribonuclease H; Provisional
Probab=20.80 E-value=83 Score=25.72 Aligned_cols=61 Identities=18% Similarity=0.177 Sum_probs=35.5
Q ss_pred hhhHHHHHHHHHHHHHHHHhhh-ccCCChHHHHHHHH-HHHHHHHHH-HHhhcCCCCCCHHHHH
Q psy12912 19 FDNQADLYAVINTLQNLEKAYI-RDCVTPKEYTAACS-KLLVQYKAA-FKQVQGEEYPTVEHFV 79 (185)
Q Consensus 19 ye~lAelysII~tle~LEkAyi-rD~It~~eYt~~c~-kLl~Qyk~~-~~~v~~~~~~~l~~F~ 79 (185)
-.|-|||.++|.+|+.+..... =-..|+++|.--|- +=+..+|.- ++.-.|..+.+.|-+.
T Consensus 41 TNnraEl~Aii~aL~~~~~~~~~v~I~TDS~yvi~~i~~W~~~Wk~~gWk~s~G~pV~N~dL~~ 104 (161)
T PRK06548 41 TNNIAELTAVRELLIATRHTDRPILILSDSKYVINSLTKWVYSWKMRKWRKADGKPVLNQEIIQ 104 (161)
T ss_pred CHHHHHHHHHHHHHHhhhcCCceEEEEeChHHHHHHHHHHHHHHHHCCCcccCCCccccHHHHH
Confidence 4567999999999987754311 13478899976653 345555421 1122234455555553
No 33
>PF04651 Pox_A12: Poxvirus A12 protein; InterPro: IPR006744 This family contains vaccinia virus protein A12 and its homologues. VVA12 is a virion protein though its function is unknown.
Probab=20.67 E-value=1e+02 Score=26.29 Aligned_cols=27 Identities=19% Similarity=0.399 Sum_probs=23.3
Q ss_pred CChHHHHHHHHHHHHHHHHHHHhhcCC
Q psy12912 44 VTPKEYTAACSKLLVQYKAAFKQVQGE 70 (185)
Q Consensus 44 It~~eYt~~c~kLl~Qyk~~~~~v~~~ 70 (185)
=+.++|.++.+||--|.|+++.++.++
T Consensus 9 SSYDDYIeTinKitPQLrtlLahI~ge 35 (189)
T PF04651_consen 9 SSYDDYIETINKITPQLRTLLAHISGE 35 (189)
T ss_pred ccHHHHHHHHHHhCHHHHHHHHHhccc
Confidence 356899999999999999999998543
No 34
>cd01019 ZnuA Zinc binding protein ZnuA. These proteins have been shown to function as initial receptors in the ABC uptake of Zn2+. They belong to the TroA superfamily of periplasmic metal binding proteins that share a distinct fold and ligand binding mechanism. They are comprised of two globular subdomains connected by a single helix and bind their specific ligands in the cleft between these domains. A typical TroA protein is comprised of two globular subdomains connected by a single helix and can bind the metal ion in the cleft between these domains. In addition, these proteins sometimes have a low complexity region containing a metal-binding histidine-rich motif (repetitive HDH sequence).
Probab=20.16 E-value=2.7e+02 Score=24.17 Aligned_cols=39 Identities=21% Similarity=0.315 Sum_probs=22.5
Q ss_pred HHHHHHHHHHHHHHHHHhhcCCCC----CCHHHHHHHhcCCCh
Q psy12912 49 YTAACSKLLVQYKAAFKQVQGEEY----PTVEHFVKKYRLDCP 87 (185)
Q Consensus 49 Yt~~c~kLl~Qyk~~~~~v~~~~~----~~l~~F~~~y~l~cp 87 (185)
|.+....|..+|+..++.+++..+ +.+.-|+++|++..-
T Consensus 159 ~~~~L~~l~~~~~~~~~~~~~~~~v~~H~af~Yl~~~~gl~~~ 201 (286)
T cd01019 159 FNARLAELDATIKERLAPVKTKPFFVFHDAYGYFEKRYGLTQA 201 (286)
T ss_pred HHHHHHHHHHHHHHHhhccCCCeEEEecccHHHHHHHcCCcee
Confidence 444444444445555544432222 468999999999743
Done!