Query psy3529
Match_columns 97
No_of_seqs 108 out of 667
Neff 5.6
Searched_HMMs 46136
Date Fri Aug 16 18:58:19 2013
Command hhsearch -i /work/01045/syshi/Psyhhblits/psy3529.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/3529hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 KOG1692|consensus 99.9 2.2E-23 4.8E-28 154.0 6.9 71 1-73 130-200 (201)
2 KOG1690|consensus 99.8 2.9E-21 6.2E-26 143.6 7.1 70 2-73 145-214 (215)
3 KOG1693|consensus 99.8 1.1E-20 2.4E-25 140.4 7.3 72 1-74 136-207 (209)
4 KOG1691|consensus 99.8 3.3E-20 7.2E-25 138.4 8.2 71 1-73 139-209 (210)
5 KOG3287|consensus 99.7 1.5E-17 3.3E-22 125.2 7.8 76 1-78 158-233 (236)
6 PF01105 EMP24_GP25L: emp24/gp 99.6 7.6E-17 1.7E-21 111.8 -0.3 67 1-69 117-183 (183)
7 PF10805 DUF2730: Protein of u 51.7 21 0.00044 23.8 3.1 29 42-70 6-34 (106)
8 PF09323 DUF1980: Domain of un 49.8 30 0.00065 24.8 3.9 36 34-71 25-60 (182)
9 PF00517 GP41: Retroviral enve 41.9 1.3E+02 0.0029 22.1 6.4 45 44-90 153-197 (204)
10 COG3346 Uncharacterized conser 39.8 9.5 0.00021 29.6 -0.0 38 46-92 20-57 (252)
11 PF10754 DUF2569: Protein of u 38.3 91 0.002 21.6 4.8 30 46-75 56-85 (149)
12 PF06624 RAMP4: Ribosome assoc 34.6 18 0.0004 22.4 0.7 23 42-64 38-60 (63)
13 PHA02148 hypothetical protein 34.3 1.5E+02 0.0032 20.1 6.9 52 9-62 50-102 (110)
14 PF04678 DUF607: Protein of un 33.5 1.8E+02 0.0039 20.8 7.9 38 4-41 58-95 (180)
15 COG5102 SFT2 Membrane protein 32.7 52 0.0011 24.7 2.9 34 39-74 99-134 (201)
16 PF14654 Epiglycanin_C: Mucin, 30.7 1.2E+02 0.0026 20.7 4.1 29 46-74 24-52 (106)
17 KOG1656|consensus 30.4 1.8E+02 0.0038 22.3 5.5 40 2-41 20-63 (221)
18 KOG1924|consensus 24.9 1.6E+02 0.0036 27.1 5.0 83 2-87 364-460 (1102)
19 PF10779 XhlA: Haemolysin XhlA 24.3 1.8E+02 0.0038 17.7 8.0 37 3-39 6-42 (71)
20 PF11446 DUF2897: Protein of u 22.8 1.3E+02 0.0029 18.0 3.0 24 43-66 2-25 (55)
21 PF04728 LPP: Lipoprotein leuc 22.4 1.9E+02 0.0042 17.5 5.4 40 2-41 9-48 (56)
22 COG2864 FdnI Cytochrome b subu 22.3 1.3E+02 0.0029 22.9 3.5 31 35-68 11-41 (218)
23 PF11969 DcpS_C: Scavenger mRN 22.2 1.9E+02 0.004 19.1 4.0 15 79-93 80-94 (116)
24 cd01042 DMQH Demethoxyubiquino 21.8 3.1E+02 0.0068 19.7 5.8 27 13-39 114-140 (165)
25 PF07509 DUF1523: Protein of u 21.6 1.5E+02 0.0033 21.9 3.6 26 43-68 150-175 (175)
26 PF06196 DUF997: Protein of un 21.4 1.8E+02 0.004 18.5 3.6 28 43-70 43-71 (80)
27 TIGR00383 corA magnesium Mg(2+ 20.9 3.7E+02 0.0081 20.2 8.5 36 24-61 242-277 (318)
28 COG2155 Uncharacterized conser 20.8 1.6E+02 0.0035 19.1 3.2 29 45-73 42-70 (79)
29 PF02301 HORMA: HORMA domain; 20.3 1.4E+02 0.003 21.2 3.2 32 46-77 5-36 (208)
30 PRK10884 SH3 domain-containing 20.2 3.7E+02 0.0081 20.0 8.0 23 36-60 167-189 (206)
31 PHA01750 hypothetical protein 20.0 1.7E+02 0.0038 18.6 3.2 28 39-68 4-32 (75)
No 1
>KOG1692|consensus
Probab=99.89 E-value=2.2e-23 Score=153.96 Aligned_cols=71 Identities=18% Similarity=0.202 Sum_probs=68.7
Q ss_pred ChHHHHHHHHHHHHHHHHHHHHHHHHHhhHHHHHhHhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHccCCccc
Q psy3529 1 MESSTQEIHKSLNTIIDYQTHHRLREAQGRKRAEDLNERAGRPALSLGARIAKIVPSAADAALIRLYPTGTQI 73 (97)
Q Consensus 1 me~si~~L~~~L~~I~~~Q~y~R~RE~~~R~~~EstnsRV~~~~wSi~q~~vlI~vs~~Qv~~LR~fF~~kk~ 73 (97)
++++|.+|++.|..++.||+|+..||.+||++||+||+||. |||++|.++||+++++||||||+||+.|+.
T Consensus 130 L~~~I~eL~~al~~Vk~EQeY~~~Rer~Hr~~nEntn~RVv--~wsife~~vLi~~s~~QVyYLkRfFEvkrv 200 (201)
T KOG1692|consen 130 LEEMIRELSEALTSVKHEQEYMEARERIHRNTNENTNSRVV--LWSIFEALVLIAMSVLQVYYLKRFFEVKRV 200 (201)
T ss_pred HHHHHHHHHHHHHHhHHHHHHHHHHHHHHHHhhhcccceee--hHHHHHHHHHHHHHHHHHHHHHHhheeeec
Confidence 47899999999999999999999999999999999999999 999999999999999999999999999864
No 2
>KOG1690|consensus
Probab=99.84 E-value=2.9e-21 Score=143.61 Aligned_cols=70 Identities=19% Similarity=0.216 Sum_probs=67.2
Q ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHHhhHHHHHhHhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHccCCccc
Q psy3529 2 ESSTQEIHKSLNTIIDYQTHHRLREAQGRKRAEDLNERAGRPALSLGARIAKIVPSAADAALIRLYPTGTQI 73 (97)
Q Consensus 2 e~si~~L~~~L~~I~~~Q~y~R~RE~~~R~~~EstnsRV~~~~wSi~q~~vlI~vs~~Qv~~LR~fF~~kk~ 73 (97)
+..+..|...+.+|+.||+++|.||+++|+++||+|+||+ |||++|+++|++.|+||+.+||+||.++|.
T Consensus 145 ~~Rv~~L~~~~~~IrkEQ~~~R~RE~~FR~tSES~NsRvm--~Wsv~Q~vvL~~tc~wQmrhL~~FFvkqKl 214 (215)
T KOG1690|consen 145 EGRVRQLNSRLESIRKEQNLQREREETFRDTSESANSRVM--WWSVAQLVVLLVTCIWQMRHLKSFFVKQKL 214 (215)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhcceee--ehhHHHHHHHHHHHHHHHHHHHHHHHHhhc
Confidence 5567889999999999999999999999999999999999 999999999999999999999999999885
No 3
>KOG1693|consensus
Probab=99.83 E-value=1.1e-20 Score=140.38 Aligned_cols=72 Identities=39% Similarity=0.431 Sum_probs=69.4
Q ss_pred ChHHHHHHHHHHHHHHHHHHHHHHHHHhhHHHHHhHhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHccCCcccc
Q psy3529 1 MESSTQEIHKSLNTIIDYQTHHRLREAQGRKRAEDLNERAGRPALSLGARIAKIVPSAADAALIRLYPTGTQIT 74 (97)
Q Consensus 1 me~si~~L~~~L~~I~~~Q~y~R~RE~~~R~~~EstnsRV~~~~wSi~q~~vlI~vs~~Qv~~LR~fF~~kk~~ 74 (97)
||+++.+|+++|+.|.+.|+|+|.||+|+|.++||||+||. |||++++++++++|++||+.||.||+.++..
T Consensus 136 mena~~~I~~~L~~I~~~q~y~R~RE~rn~~tv~st~~Rv~--~~Sl~e~~~vv~iSi~Qv~ilk~fFt~~r~~ 207 (209)
T KOG1693|consen 136 MENAIVEIHRALNKIDDTQTYYRLREARNRSTVESTNSRVT--WWSLLEIIAVVVISIAQVFILKFFFTDRRKR 207 (209)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHhcCccchhcccchhh--HHHHHHHHHHHHHHHHHHHHHHHHHhccCcC
Confidence 68999999999999999999999999999999999999999 9999999999999999999999999988654
No 4
>KOG1691|consensus
Probab=99.82 E-value=3.3e-20 Score=138.41 Aligned_cols=71 Identities=14% Similarity=0.176 Sum_probs=68.5
Q ss_pred ChHHHHHHHHHHHHHHHHHHHHHHHHHhhHHHHHhHhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHccCCccc
Q psy3529 1 MESSTQEIHKSLNTIIDYQTHHRLREAQGRKRAEDLNERAGRPALSLGARIAKIVPSAADAALIRLYPTGTQI 73 (97)
Q Consensus 1 me~si~~L~~~L~~I~~~Q~y~R~RE~~~R~~~EstnsRV~~~~wSi~q~~vlI~vs~~Qv~~LR~fF~~kk~ 73 (97)
||-.+++|.+.+++|.++..|+|.||+++|++||+||+||. |+|++.+++++++++||++|||+||++||.
T Consensus 139 ~E~elrrLed~~~sI~~e~~YLr~REeemr~~nesTNsrv~--~fSi~Sl~v~~~va~~QvlyLK~fF~kKKL 209 (210)
T KOG1691|consen 139 LEVELRRLEDLVESIHEEMYYLREREEEMRNTNESTNSRVA--WFSILSLVVLLSVAGWQVLYLKRFFQKKKL 209 (210)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhHHH--HHHHHHHHHHHHHHHHHHHHHHHHHHhccC
Confidence 46789999999999999999999999999999999999999 999999999999999999999999999974
No 5
>KOG3287|consensus
Probab=99.72 E-value=1.5e-17 Score=125.16 Aligned_cols=76 Identities=22% Similarity=0.283 Sum_probs=71.4
Q ss_pred ChHHHHHHHHHHHHHHHHHHHHHHHHHhhHHHHHhHhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHccCCcccccccc
Q psy3529 1 MESSTQEIHKSLNTIIDYQTHHRLREAQGRKRAEDLNERAGRPALSLGARIAKIVPSAADAALIRLYPTGTQITNSRN 78 (97)
Q Consensus 1 me~si~~L~~~L~~I~~~Q~y~R~RE~~~R~~~EstnsRV~~~~wSi~q~~vlI~vs~~Qv~~LR~fF~~kk~~~~~~ 78 (97)
++++++.++.+|++....|+++|.||+|+|++.|++..||. |||.+|+++||++|+.|||+||++|+.|+...+++
T Consensus 158 i~~~i~~i~~nl~k~~~~q~~lRa~EaRDr~L~esNf~rVN--~WS~vq~~vmi~v~~iQVf~lrslFe~~~~~~t~~ 233 (236)
T KOG3287|consen 158 IEDSIGTIKNNLNKMWQYQALLRAREARDRNLQESNFDRVN--FWSMVQTLVMILVGIIQVFMLRSLFEVKSKSDTKK 233 (236)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHhhhHHHHhcccchhh--HHHHHHHHHHHHHhhhhhhhhHHHhcCCCCccccc
Confidence 36789999999999999999999999999999999999999 99999999999999999999999999998775543
No 6
>PF01105 EMP24_GP25L: emp24/gp25L/p24 family/GOLD; InterPro: IPR009038 The GOLD (for Golgi dynamics) domain is a protein module found in several eukaryotic Golgi and lipid-traffic proteins. It is typically between 90 and 150 amino acids long. Most of the size difference observed in the GOLD-domain superfamily is traceable to a single large low-complexity insert that is seen in some versions of the domain. With the exception of the p24 proteins, which have a simple architecture with the GOLD domain as their only globular domain, all other GOLD-domain proteins contain additional conserved globular domains. In these proteins, the GOLD domain co-occurs with lipid-, sterol- or fatty acid-binding domains such as PH, CRAL-TRIO, FYVE oxysterol binding- and acyl CoA-binding domains, suggesting that these proteins may interact with membranes. The GOLD domain can also be found associated with a RUN domain, which may have a role in the interaction of various proteins with cytoskeletal filaments. The GOLD domain is predicted to mediate diverse protein-protein interactions []. A secondary structure prediction for the GOLD domain reveals that it is likely to adopt a compact all-beta-fold structure with six to seven strands. Most of the sequence conservation is centred on the hydrophobic cores that support these predicted strands. The predicted secondary-structure elements and the size of the conserved core of the domain suggests that it may form a beta- sandwich fold with the strands arranged in two beta sheets stacked on each other []. Some proteins known to contain a GOLD domain are listed below: Eukaryotic proteins of the p24 family. Animal Sec14-like proteins. They are involved in secretion. Human Golgi resident protein GCP60. It interacts with the Golgi integral membrane protein Giantin. Yeast oxysterol-binding protein homologue 3 (OSH3). ; GO: 0006810 transport, 0016021 integral to membrane; PDB: 1P23_A 1M23_A.
Probab=99.60 E-value=7.6e-17 Score=111.81 Aligned_cols=67 Identities=24% Similarity=0.221 Sum_probs=5.4
Q ss_pred ChHHHHHHHHHHHHHHHHHHHHHHHHHhhHHHHHhHhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHccC
Q psy3529 1 MESSTQEIHKSLNTIIDYQTHHRLREAQGRKRAEDLNERAGRPALSLGARIAKIVPSAADAALIRLYPT 69 (97)
Q Consensus 1 me~si~~L~~~L~~I~~~Q~y~R~RE~~~R~~~EstnsRV~~~~wSi~q~~vlI~vs~~Qv~~LR~fF~ 69 (97)
++..+.+|.+.+..|+++|.|++.||+++++++|+++.||. |||+++++++++++++|+++||+||+
T Consensus 117 ~~~~l~~l~~~l~~i~~~q~~~~~r~~~~~~~~es~~~~i~--~~si~~~~vli~~~~~Qv~~lk~~f~ 183 (183)
T PF01105_consen 117 LEESLEKLESNLKEIKDEQKYLREREERHRQLNESTNSRIM--WWSIIQIVVLILVSVWQVYYLKKFFK 183 (183)
T ss_dssp ---------------------------------------------------------------HHHHHH
T ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhheEE--hHHHHHHHHHHHHHHHHHHHHHHHhC
Confidence 46788999999999999999999999999999999999999 99999999999999999999999995
No 7
>PF10805 DUF2730: Protein of unknown function (DUF2730); InterPro: IPR020269 This entry represents a family of various hypothetical proteins. The proteins, which include HI1498 and Gp25, from phage Mu, are currently uncharacterised.
Probab=51.68 E-value=21 Score=23.80 Aligned_cols=29 Identities=14% Similarity=-0.028 Sum_probs=24.3
Q ss_pred hhhHHHHHHHHHHHHHHHHHHHHHHccCC
Q psy3529 42 RPALSLGARIAKIVPSAADAALIRLYPTG 70 (97)
Q Consensus 42 ~~~wSi~q~~vlI~vs~~Qv~~LR~fF~~ 70 (97)
+.||+++..++.++.++++.+.=|+|-++
T Consensus 6 ~~~w~ii~a~~~~~~~~~~~~l~~~~a~~ 34 (106)
T PF10805_consen 6 KKNWGIIWAVFGIAGGIFWLWLRRTYAKR 34 (106)
T ss_pred HhCcHHHHHHHHHHHHHHHHHHHHhhccH
Confidence 45999999999999999999888877653
No 8
>PF09323 DUF1980: Domain of unknown function (DUF1980); InterPro: IPR015402 Members of this occur in gene pairs with members of PF03773 from PFAM. The N-terminal region contains several predicted transmembrane helix regions while the few invariant residues (G, CxxD, and W) occur in the C-terminal region. Members of this family are found in a set of prokaryotic hypothetical proteins. Their exact function has not, as yet, been defined.
Probab=49.82 E-value=30 Score=24.76 Aligned_cols=36 Identities=11% Similarity=-0.170 Sum_probs=29.1
Q ss_pred HhHhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHccCCc
Q psy3529 34 EDLNERAGRPALSLGARIAKIVPSAADAALIRLYPTGT 71 (97)
Q Consensus 34 EstnsRV~~~~wSi~q~~vlI~vs~~Qv~~LR~fF~~k 71 (97)
--.|-|.. +++.+..++++++++.|++.+-+-...+
T Consensus 25 ~YI~P~~~--~~~~~a~i~l~ilai~q~~~~~~~~~~~ 60 (182)
T PF09323_consen 25 LYIHPRYI--PLLYFAAILLLILAIVQLWRWFRPKRRK 60 (182)
T ss_pred HHhCccHH--HHHHHHHHHHHHHHHHHHHHHHhccccc
Confidence 34577888 9999999999999999999766655443
No 9
>PF00517 GP41: Retroviral envelope protein; InterPro: IPR000328 This entry represents envelope proteins from a variety of retroviruses. It includes the GP41 subunit of the envelope protein complex from Human immunodeficiency virus (HIV) and Simian-Human immunodeficiency virus (SIV), which mediate membrane fusion during viral entry []. It has a core composed of a six-helix bundle and is folded by its trimeric N- and C-terminal heptad-repeats (NHR and CHR) []. Derivatives of this protein prevent HIV-1 from entering cell lines and primary human CD4+ cells in vitro [], making it an attractive subject of gene therapy studies against HIV and related retroviruses. The entry also represents envelop proteins from Bovine immunodeficiency virus, Feline immunodeficiency virus and Equine infectious anemia virus (EIAV) [, ], as well as the Gp36 protein from Mouse mammary tumor virus (MMTV) and Human endogenous retrovirus (HERV).; GO: 0005198 structural molecule activity, 0019031 viral envelope; PDB: 2EZO_B 2EZQ_B 2EZR_A 2JNR_B 1F23_D 2EZP_A 1JEK_A 2Q7C_A 2Q5U_A 2Q3I_A ....
Probab=41.88 E-value=1.3e+02 Score=22.13 Aligned_cols=45 Identities=13% Similarity=-0.085 Sum_probs=4.0
Q ss_pred hHHHHHHHHHHHHHHHHHHHHHHccCCcccccccceeeecceeeecC
Q psy3529 44 ALSLGARIAKIVPSAADAALIRLYPTGTQITNSRNLTVDANIRIHAP 90 (97)
Q Consensus 44 ~wSi~q~~vlI~vs~~Qv~~LR~fF~~kk~~~~~~~~~~~~~~~~~~ 90 (97)
|...+-++++|++++.=+..+.-.... ..+-++.|....+.||..
T Consensus 153 W~~~Iki~i~iv~~iI~lri~~~i~~~--~~~~~qgy~~lq~~~~~~ 197 (204)
T PF00517_consen 153 WLWYIKIFIMIVIGIIGLRILMCILRI--CRRVRQGYSPLQTPIHPD 197 (204)
T ss_dssp HHHHHHH----------------------------------------
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHH--HHHHhCCCCCccccCCCC
Confidence 445566666666665544444333333 234466666666666643
No 10
>COG3346 Uncharacterized conserved protein [Function unknown]
Probab=39.82 E-value=9.5 Score=29.62 Aligned_cols=38 Identities=16% Similarity=0.134 Sum_probs=23.8
Q ss_pred HHHHHHHHHHHHHHHHHHHHHccCCcccccccceeeecceeeecCCc
Q psy3529 46 SLGARIAKIVPSAADAALIRLYPTGTQITNSRNLTVDANIRIHAPPS 92 (97)
Q Consensus 46 Si~q~~vlI~vs~~Qv~~LR~fF~~kk~~~~~~~~~~~~~~~~~~~~ 92 (97)
.++-.++++++|.||+ ++...|.... =++..++|+||-
T Consensus 20 ~~~~~~il~~LGtWQl---~Rl~wK~~li------a~ie~r~~appv 57 (252)
T COG3346 20 VLATFAILLGLGTWQL---QRLHWKLALI------ARIEARLHAPPV 57 (252)
T ss_pred HHHHHHHHHhhhhhhh---hhHHHHHHHH------HHHHHHhcCCCC
Confidence 3456677788888887 4444443322 235678899884
No 11
>PF10754 DUF2569: Protein of unknown function (DUF2569); InterPro: IPR019690 This entry represents a protein that is conserved in bacteria. The function is not known, but several members are annotated as being YdgK or a homologue thereof and associated to the inner membrane. This signature also matches proteins that are described as transglutaminase-like enzymes, although this could not be confirmed.
Probab=38.30 E-value=91 Score=21.63 Aligned_cols=30 Identities=10% Similarity=-0.172 Sum_probs=24.9
Q ss_pred HHHHHHHHHHHHHHHHHHHHHccCCccccc
Q psy3529 46 SLGARIAKIVPSAADAALIRLYPTGTQITN 75 (97)
Q Consensus 46 Si~q~~vlI~vs~~Qv~~LR~fF~~kk~~~ 75 (97)
-.++.+.-+++.+.-++.+-.||++|+..+
T Consensus 56 ~~~~~~~~~~~~~~~l~~~~lffkr~~~~P 85 (149)
T PF10754_consen 56 WYFEVAINIAMWLFTLWLLYLFFKRKRRFP 85 (149)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHccchhH
Confidence 356788888888999999999999988763
No 12
>PF06624 RAMP4: Ribosome associated membrane protein RAMP4; InterPro: IPR010580 This entry contains Serp1/Ramp4, which has been shown to interacts with target proteins during their translocation into the lumen of the endoplasmic reticulum. It has also been shown to protect unfolded target proteins against degradation during ER stress. It may facilitate glycosylation of target proteins after termination of ER stress and may modulate the use of N-glycosylation sites on target proteins [, ].
Probab=34.61 E-value=18 Score=22.36 Aligned_cols=23 Identities=9% Similarity=-0.285 Sum_probs=19.8
Q ss_pred hhhHHHHHHHHHHHHHHHHHHHH
Q psy3529 42 RPALSLGARIAKIVPSAADAALI 64 (97)
Q Consensus 42 ~~~wSi~q~~vlI~vs~~Qv~~L 64 (97)
-|||=.+-++|++|-+++|++-+
T Consensus 38 gp~~L~l~iFVV~Gs~ifqiir~ 60 (63)
T PF06624_consen 38 GPWLLGLFIFVVCGSAIFQIIRS 60 (63)
T ss_pred CHHHHhhhheeeEcHHHHHHHHH
Confidence 47999999999999999999754
No 13
>PHA02148 hypothetical protein
Probab=34.33 E-value=1.5e+02 Score=20.11 Aligned_cols=52 Identities=15% Similarity=0.097 Sum_probs=41.8
Q ss_pred HHHHHHH-HHHHHHHHHHHHhhHHHHHhHhhhhhhhhHHHHHHHHHHHHHHHHHH
Q psy3529 9 HKSLNTI-IDYQTHHRLREAQGRKRAEDLNERAGRPALSLGARIAKIVPSAADAA 62 (97)
Q Consensus 9 ~~~L~~I-~~~Q~y~R~RE~~~R~~~EstnsRV~~~~wSi~q~~vlI~vs~~Qv~ 62 (97)
...|+++ +.-|+-+|..|..-|...|+-.-.+. -.|.-|.++-=.+-.+|+.
T Consensus 50 ~~~L~~~A~~R~NEL~~HE~Air~~M~~HS~~M~--~LS~~~~LiSDS~K~~~~~ 102 (110)
T PHA02148 50 QTRLEELAKSRENELRRHEAAIRLAMDSHSIKMD--DLSLQEALISDSLKGGKYL 102 (110)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhc--cchHHHHHHHHHHHHHHHH
Confidence 3344444 45678899999999999999999999 9999999988887777763
No 14
>PF04678 DUF607: Protein of unknown function, DUF607; InterPro: IPR006769 This entry represents the C-terminal domain of coiled-coil domain containing protein 109.
Probab=33.49 E-value=1.8e+02 Score=20.83 Aligned_cols=38 Identities=13% Similarity=0.089 Sum_probs=29.5
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhhHHHHHhHhhhhh
Q psy3529 4 STQEIHKSLNTIIDYQTHHRLREAQGRKRAEDLNERAG 41 (97)
Q Consensus 4 si~~L~~~L~~I~~~Q~y~R~RE~~~R~~~EstnsRV~ 41 (97)
..+.+.+.++.++++..-+.....+-...++..-.|+.
T Consensus 58 ~~~~l~~~l~~~~~el~~le~~k~~id~~A~~~~~~~~ 95 (180)
T PF04678_consen 58 RERQLRKRLEELRQELAPLEKIKQEIDEKAEKRARRLL 95 (180)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 45667888888888888888878777777777766766
No 15
>COG5102 SFT2 Membrane protein involved in ER to Golgi transport [Intracellular trafficking and secretion]
Probab=32.66 E-value=52 Score=24.67 Aligned_cols=34 Identities=6% Similarity=-0.025 Sum_probs=26.5
Q ss_pred hhhhhhHHHHHHHHHHHHHHHH--HHHHHHccCCcccc
Q psy3529 39 RAGRPALSLGARIAKIVPSAAD--AALIRLYPTGTQIT 74 (97)
Q Consensus 39 RV~~~~wSi~q~~vlI~vs~~Q--v~~LR~fF~~kk~~ 74 (97)
-+. .|.++..+..++.|+.| +-|+|.++.+++..
T Consensus 99 FiL--lwTmgslLfvl~Fg~l~Gf~ayl~~Lts~erlp 134 (201)
T COG5102 99 FIL--LWTMGSLLFVLMFGFLLGFRAYLEGLTSKERLP 134 (201)
T ss_pred eee--ehhHHHHHHHHHHHHHHhHHHHHHhhhhhhccc
Confidence 355 78888888888888776 67899999888754
No 16
>PF14654 Epiglycanin_C: Mucin, catalytic, TM and cytoplasmic tail region
Probab=30.72 E-value=1.2e+02 Score=20.73 Aligned_cols=29 Identities=10% Similarity=-0.103 Sum_probs=20.4
Q ss_pred HHHHHHHHHHHHHHHHHHHHHccCCcccc
Q psy3529 46 SLGARIAKIVPSAADAALIRLYPTGTQIT 74 (97)
Q Consensus 46 Si~q~~vlI~vs~~Qv~~LR~fF~~kk~~ 74 (97)
+++..++-+++.++=.+.+|++|.-+...
T Consensus 24 tLasVvvavGl~aGLfFcvR~~lslrn~~ 52 (106)
T PF14654_consen 24 TLASVVVAVGLFAGLFFCVRNSLSLRNTF 52 (106)
T ss_pred HHHHHHHHHHHHHHHHHHhhhcccccccc
Confidence 44455666777778888899988766543
No 17
>KOG1656|consensus
Probab=30.44 E-value=1.8e+02 Score=22.34 Aligned_cols=40 Identities=13% Similarity=0.131 Sum_probs=30.7
Q ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHHhhHH----HHHhHhhhhh
Q psy3529 2 ESSTQEIHKSLNTIIDYQTHHRLREAQGRK----RAEDLNERAG 41 (97)
Q Consensus 2 e~si~~L~~~L~~I~~~Q~y~R~RE~~~R~----~~EstnsRV~ 41 (97)
++.|++|.+.++-..+.|+|++.+=..... ..-..|.|+.
T Consensus 20 ~eaI~kLrEteemL~KKqe~Le~ki~~e~e~~A~k~~tkNKR~A 63 (221)
T KOG1656|consen 20 QEAIQKLRETEEMLEKKQEFLEKKIEQEVENNARKYGTKNKRMA 63 (221)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhHHHH
Confidence 468999999999999999999977655533 3445677765
No 18
>KOG1924|consensus
Probab=24.92 E-value=1.6e+02 Score=27.13 Aligned_cols=83 Identities=16% Similarity=0.037 Sum_probs=56.3
Q ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHHhhHHHHHhHhhhhhhhhHHHHHHHHHH--------------HHHHHHHHHHHHc
Q psy3529 2 ESSTQEIHKSLNTIIDYQTHHRLREAQGRKRAEDLNERAGRPALSLGARIAKI--------------VPSAADAALIRLY 67 (97)
Q Consensus 2 e~si~~L~~~L~~I~~~Q~y~R~RE~~~R~~~EstnsRV~~~~wSi~q~~vlI--------------~vs~~Qv~~LR~f 67 (97)
|+.+.+|...|+.|+.+.+-+-.=-.--.+++++|-..=. +.|++|-+.+| =-|+.|+.+=|..
T Consensus 364 e~Dl~el~~rledir~emDd~~~~f~lL~n~vkdT~aE~y--fLSILQhlllirnDy~~rpqYykLIEecISqIvlHr~~ 441 (1102)
T KOG1924|consen 364 EDDLEELSGRLEDIRAEMDDANEVFELLANTVKDTGAEPY--FLSILQHLLLIRNDYYIRPQYYKLIEECISQIVLHRTG 441 (1102)
T ss_pred hhhHHHHHhHHHhhhhhhccHHHHHHHHHHhhhhccccch--HHHHHHHHHHHhhhhhhhHHHHHHHHHHHHHHHHhcCC
Confidence 4567788888888988887666555666788888888888 89999988764 5677787666554
Q ss_pred cCCcccccccceeeecceee
Q psy3529 68 PTGTQITNSRNLTVDANIRI 87 (97)
Q Consensus 68 F~~kk~~~~~~~~~~~~~~~ 87 (97)
-.. ...|.-++-+|...+|
T Consensus 442 ~DP-df~yr~~l~id~~~li 460 (1102)
T KOG1924|consen 442 MDP-DFKYRFRLDIDLTELI 460 (1102)
T ss_pred CCC-CcchhhcccCcHHHHH
Confidence 332 2223334555544433
No 19
>PF10779 XhlA: Haemolysin XhlA; InterPro: IPR019715 Haemolysin XhlA is a cell-surface associated haemolysin that lyses the two most prevalent types of insect immune cells (granulocytes and plasmatocytes) as well as rabbit and horse erythrocytes [].
Probab=24.25 E-value=1.8e+02 Score=17.72 Aligned_cols=37 Identities=19% Similarity=0.262 Sum_probs=15.3
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHhhHHHHHhHhhh
Q psy3529 3 SSTQEIHKSLNTIIDYQTHHRLREAQGRKRAEDLNER 39 (97)
Q Consensus 3 ~si~~L~~~L~~I~~~Q~y~R~RE~~~R~~~EstnsR 39 (97)
+.+.+++..++.+.+.-.-+..+.+....-.++.|.+
T Consensus 6 e~l~~ie~~l~~~~~~i~~lE~~~~~~e~~i~~~~~~ 42 (71)
T PF10779_consen 6 EKLNRIETKLDNHEERIDKLEKRDAANEKDIKNLNKQ 42 (71)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 3444444444444333333344444443333333333
No 20
>PF11446 DUF2897: Protein of unknown function (DUF2897); InterPro: IPR021550 This is a bacterial family of uncharacterised proteins.
Probab=22.76 E-value=1.3e+02 Score=17.97 Aligned_cols=24 Identities=4% Similarity=-0.137 Sum_probs=19.4
Q ss_pred hhHHHHHHHHHHHHHHHHHHHHHH
Q psy3529 43 PALSLGARIAKIVPSAADAALIRL 66 (97)
Q Consensus 43 ~~wSi~q~~vlI~vs~~Qv~~LR~ 66 (97)
+|+..+-+++.+|+-++-+..||.
T Consensus 2 ~~~~wlIIviVlgvIigNia~LK~ 25 (55)
T PF11446_consen 2 TWNPWLIIVIVLGVIIGNIAALKY 25 (55)
T ss_pred cchhhHHHHHHHHHHHhHHHHHHH
Confidence 477777788888888899988884
No 21
>PF04728 LPP: Lipoprotein leucine-zipper; InterPro: IPR006817 This repeating sequence, NAKVDQLSNDV, is found in the enterobacterial outer membrane lipoprotein LPP. The outer membrane lipoprotein is the most abundant protein in an Escherichia coli cell. The messenger RNA for the lipoprotein of the E. coli outer membrane codes for a putative precursor, prolipoprotein, which has 20 additional amino acid residues extending from the amino terminus of the lipoprotein.; GO: 0019867 outer membrane; PDB: 1JCC_A 2GUV_C 2GUS_A 1JCD_A 1KFM_A 1T8Z_D 1KFN_A 1EQ7_A.
Probab=22.37 E-value=1.9e+02 Score=17.49 Aligned_cols=40 Identities=15% Similarity=0.205 Sum_probs=27.2
Q ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHHhhHHHHHhHhhhhh
Q psy3529 2 ESSTQEIHKSLNTIIDYQTHHRLREAQGRKRAEDLNERAG 41 (97)
Q Consensus 2 e~si~~L~~~L~~I~~~Q~y~R~RE~~~R~~~EstnsRV~ 41 (97)
...++.|...+.++..+..-+|.--..-..=+...|.|+-
T Consensus 9 s~dVq~L~~kvdqLs~dv~~lr~~v~~ak~EAaRAN~RlD 48 (56)
T PF04728_consen 9 SSDVQTLNSKVDQLSSDVNALRADVQAAKEEAARANQRLD 48 (56)
T ss_dssp HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 4557778888888888888877554444555556666653
No 22
>COG2864 FdnI Cytochrome b subunit of formate dehydrogenase [Energy production and conversion]
Probab=22.25 E-value=1.3e+02 Score=22.91 Aligned_cols=31 Identities=10% Similarity=-0.088 Sum_probs=27.8
Q ss_pred hHhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHcc
Q psy3529 35 DLNERAGRPALSLGARIAKIVPSAADAALIRLYP 68 (97)
Q Consensus 35 stnsRV~~~~wSi~q~~vlI~vs~~Qv~~LR~fF 68 (97)
+..+|+. -|...-+.++.+++..++++- .|+
T Consensus 11 t~~eR~~--Hw~~ai~f~il~lSG~~~~~~-~F~ 41 (218)
T COG2864 11 TALERIN--HWIVAICFFILALSGLAFFFP-DFM 41 (218)
T ss_pred chHHHHH--HHHHHHHHHHHHHHHHHHHHh-HHH
Confidence 4678999 999999999999999999998 665
No 23
>PF11969 DcpS_C: Scavenger mRNA decapping enzyme C-term binding; PDB: 1VLR_B 1XMM_D 1XML_B 1ST0_A 3BLA_B 3BL9_B 3BL7_B 1ST4_B 1XQU_B.
Probab=22.16 E-value=1.9e+02 Score=19.11 Aligned_cols=15 Identities=33% Similarity=0.510 Sum_probs=11.0
Q ss_pred eeeecceeeecCCcc
Q psy3529 79 LTVDANIRIHAPPSF 93 (97)
Q Consensus 79 ~~~~~~~~~~~~~~~ 93 (97)
..-+.+.-.|+|||+
T Consensus 80 ~~~~~~~gfH~~PS~ 94 (116)
T PF11969_consen 80 DSDDIRLGFHYPPSV 94 (116)
T ss_dssp EGGGEEEEEESS-SS
T ss_pred chhhhcccccCCCCc
Confidence 445688999999986
No 24
>cd01042 DMQH Demethoxyubiquinone hydroxylase, ferritin-like diiron-binding domain. Demethoxyubiquinone hydroxylases (DMQH) are members of the ferritin-like, diiron-carboxylate family which are present in eukaryotes (the CLK-1/CAT5 family) and prokaryotes (the Coq7 family). DMQH participates in one of the last steps of ubiquinone biosysnthesis and is responsible for DMQ hydroxylation, resulting in the formation of hydroxyubiquinone, a precursor of ubiquinone. CLK-1 is a mitochondrial inner membrane protein and Coq7 is a proposed interfacial integral membrane protein. Mutations in the Caenorhabditis elegans gene clk-1 affect biological timing and extend longevity. The conserved residues of a diiron center are present in this domain.
Probab=21.81 E-value=3.1e+02 Score=19.72 Aligned_cols=27 Identities=19% Similarity=0.166 Sum_probs=21.1
Q ss_pred HHHHHHHHHHHHHHHhhHHHHHhHhhh
Q psy3529 13 NTIIDYQTHHRLREAQGRKRAEDLNER 39 (97)
Q Consensus 13 ~~I~~~Q~y~R~RE~~~R~~~EstnsR 39 (97)
.++++.-.-+|.-|.+|++.+......
T Consensus 114 ~~l~~~l~~~r~DE~~H~d~A~~~ga~ 140 (165)
T cd01042 114 KELRAIIEQFRDDELEHADIAEELGAE 140 (165)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHCCCC
Confidence 344455566788899999999999887
No 25
>PF07509 DUF1523: Protein of unknown function (DUF1523); InterPro: IPR011088 This entry is represented by Bacteriophage phiNM3, A0EWY4 from SWISSPROT. The characteristics of the protein distribution suggest prophage matches in addition to the phage matches. The members of this family are restricted to the Gammaproteobacteria and Epsilonproteobacteria, the function of these proteins is unknown.
Probab=21.56 E-value=1.5e+02 Score=21.93 Aligned_cols=26 Identities=12% Similarity=-0.088 Sum_probs=19.1
Q ss_pred hhHHHHHHHHHHHHHHHHHHHHHHcc
Q psy3529 43 PALSLGARIAKIVPSAADAALIRLYP 68 (97)
Q Consensus 43 ~~wSi~q~~vlI~vs~~Qv~~LR~fF 68 (97)
||++++-.+++.++...=.-..|+||
T Consensus 150 Pwfn~i~~~ll~~~~~~~~~~~r~fr 175 (175)
T PF07509_consen 150 PWFNIIFYVLLAALFFAIRRRWRRFR 175 (175)
T ss_pred chHHHHHHHHHHHHHHHHHHHHHhcC
Confidence 68888877777777776666677765
No 26
>PF06196 DUF997: Protein of unknown function (DUF997); InterPro: IPR010398 This is a family of predicted bacterial membrane protein with unknown function.
Probab=21.42 E-value=1.8e+02 Score=18.52 Aligned_cols=28 Identities=7% Similarity=0.026 Sum_probs=17.9
Q ss_pred hhHHHHHHHHHHHH-HHHHHHHHHHccCC
Q psy3529 43 PALSLGARIAKIVP-SAADAALIRLYPTG 70 (97)
Q Consensus 43 ~~wSi~q~~vlI~v-s~~Qv~~LR~fF~~ 70 (97)
|.|=+..+++.-++ .++=.+.+|.+|+.
T Consensus 43 PlWF~~SCi~~~il~~~l~~~~vk~~Fkd 71 (80)
T PF06196_consen 43 PLWFFYSCIGGPILFIILVWLMVKFFFKD 71 (80)
T ss_pred cHHHHHHHHHHHHHHHHHHHHHHHHHhhc
Confidence 66777776644444 55555668888854
No 27
>TIGR00383 corA magnesium Mg(2+) and cobalt Co(2+) transport protein (corA). The article in Microb Comp Genomics 1998;3(3):151-69 (Medline:98448512) discusses this family and suggests that some members may have functions other than Mg2+ transport.
Probab=20.91 E-value=3.7e+02 Score=20.23 Aligned_cols=36 Identities=17% Similarity=0.017 Sum_probs=21.8
Q ss_pred HHHHhhHHHHHhHhhhhhhhhHHHHHHHHHHHHHHHHH
Q psy3529 24 LREAQGRKRAEDLNERAGRPALSLGARIAKIVPSAADA 61 (97)
Q Consensus 24 ~RE~~~R~~~EstnsRV~~~~wSi~q~~vlI~vs~~Qv 61 (97)
..+...-..++.+|+-+. ..+++.++.+...-+.-+
T Consensus 242 l~d~~~~~~s~~~N~~mk--~LTvvt~IflP~t~IaGi 277 (318)
T TIGR00383 242 LMDLYLSLVNNKMNEIMK--ILTVVSTIFIPLTFIAGI 277 (318)
T ss_pred HHHHHHHHHHHHHHHHHH--HHHHHHHHHHHHHHHHHH
Confidence 344555566777888776 666666665555444443
No 28
>COG2155 Uncharacterized conserved protein [Function unknown]
Probab=20.76 E-value=1.6e+02 Score=19.09 Aligned_cols=29 Identities=7% Similarity=-0.224 Sum_probs=18.2
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHccCCccc
Q psy3529 45 LSLGARIAKIVPSAADAALIRLYPTGTQI 73 (97)
Q Consensus 45 wSi~q~~vlI~vs~~Qv~~LR~fF~~kk~ 73 (97)
||-+--++.-..|++|+..+-+.|.+++.
T Consensus 42 lsri~YiivGlagv~~il~l~k~~~~~~~ 70 (79)
T COG2155 42 LTRIIYIIVGLAGVYQILPLFKVCEKKSR 70 (79)
T ss_pred HHHHHHHHHHHHHHHHHHHhcchhhcchh
Confidence 34444455555677787777777776653
No 29
>PF02301 HORMA: HORMA domain; InterPro: IPR003511 The HORMA (for Hop1p, Rev7p and MAD2) domain has been suggested to recognise chromatin states that result from DNA adducts, double stranded breaks or non-attachment to the spindle and acts as an adaptor that recruits other proteins. Hop1 is a meiosis-specific protein, Rev7 is required for DNA damage induced mutagenesis, and MAD2 is a spindle checkpoint protein which prevents progression of the cell cycle upon detection of a defect in mitotic spindle integrity [, ].; GO: 0007067 mitosis; PDB: 3ABD_B 3ABE_C 4AEZ_H 2V64_A 2VFX_A 1GO4_D 2QYF_C 3GMH_B 1KLQ_A 1S2H_A ....
Probab=20.30 E-value=1.4e+02 Score=21.18 Aligned_cols=32 Identities=13% Similarity=0.010 Sum_probs=23.3
Q ss_pred HHHHHHHHHHHHHHHHHHHHHccCCccccccc
Q psy3529 46 SLGARIAKIVPSAADAALIRLYPTGTQITNSR 77 (97)
Q Consensus 46 Si~q~~vlI~vs~~Qv~~LR~fF~~kk~~~~~ 77 (97)
|..-+--++.+++-|+.|+|.+|-...-...+
T Consensus 5 S~~~v~~~L~~~is~Ily~R~lyP~~~F~~~~ 36 (208)
T PF02301_consen 5 SLELVKEFLEVAISQILYLRGLYPEDCFEDRR 36 (208)
T ss_dssp HHHHHHHHHHHHHHHHHHHTTSS-GGGEEEEE
T ss_pred HHHHHHHHHHHHHHHHHHHhccCCHHHCcEEE
Confidence 55556678889999999999999776554333
No 30
>PRK10884 SH3 domain-containing protein; Provisional
Probab=20.22 E-value=3.7e+02 Score=19.96 Aligned_cols=23 Identities=13% Similarity=-0.133 Sum_probs=15.9
Q ss_pred HhhhhhhhhHHHHHHHHHHHHHHHH
Q psy3529 36 LNERAGRPALSLGARIAKIVPSAAD 60 (97)
Q Consensus 36 tnsRV~~~~wSi~q~~vlI~vs~~Q 60 (97)
.+.+.. ||-.+-.++++|+-+|=
T Consensus 167 ~~~~~~--wf~~Gg~v~~~GlllGl 189 (206)
T PRK10884 167 RTIIMQ--WFMYGGGVAGIGLLLGL 189 (206)
T ss_pred HHHHHH--HHHHchHHHHHHHHHHH
Confidence 344445 99888888888776553
No 31
>PHA01750 hypothetical protein
Probab=20.05 E-value=1.7e+02 Score=18.59 Aligned_cols=28 Identities=18% Similarity=0.043 Sum_probs=21.4
Q ss_pred hhhhhhHHHHHHHHHHHHHHHHHHH-HHHcc
Q psy3529 39 RAGRPALSLGARIAKIVPSAADAAL-IRLYP 68 (97)
Q Consensus 39 RV~~~~wSi~q~~vlI~vs~~Qv~~-LR~fF 68 (97)
-|. ..++.....-+.+++.|+|+ +|.-|
T Consensus 4 ~VL--vLtlmSTtaTtlFaIiqlYlKIKq~l 32 (75)
T PHA01750 4 TVL--VLTLMSTTATTLFAIIQLYLKIKQAL 32 (75)
T ss_pred hhh--hHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 356 77888999999999999986 34443
Done!