Query 044591
Match_columns 241
No_of_seqs 116 out of 430
Neff 6.7
Searched_HMMs 46136
Date Fri Mar 29 04:50:24 2013
Command hhsearch -i /work/01045/syshi/csienesis_hhblits_a3m/044591.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/044591hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 PF04755 PAP_fibrillin: PAP_fi 100.0 7.8E-41 1.7E-45 284.2 20.3 179 47-230 2-198 (198)
2 PF08212 Lipocalin_2: Lipocali 57.2 54 0.0012 26.0 7.0 35 203-237 81-118 (143)
3 PF14869 DUF4488: Domain of un 53.5 1.2E+02 0.0025 24.6 8.1 72 96-183 2-75 (133)
4 COG3040 Blc Bacterial lipocali 48.8 72 0.0016 26.9 6.5 32 205-236 111-145 (174)
5 PF12283 Protein_K: Bacterioph 37.4 67 0.0015 21.6 3.7 36 45-84 5-41 (56)
6 PF05973 Gp49: Phage derived p 31.9 1.4E+02 0.003 21.6 5.2 40 68-108 8-49 (91)
7 PF07624 PSD2: Protein of unkn 29.5 1.4E+02 0.0031 21.2 4.8 26 58-86 35-60 (76)
8 COG3908 Uncharacterized protei 29.2 83 0.0018 22.6 3.3 23 206-231 12-34 (77)
9 PF10047 DUF2281: Protein of u 29.1 1.3E+02 0.0028 20.9 4.4 48 49-105 2-49 (66)
10 PRK03760 hypothetical protein; 29.1 28 0.00062 27.2 1.1 15 206-220 60-74 (117)
11 KOG1513 Nuclear helicase MOP-3 27.6 65 0.0014 33.8 3.5 39 38-82 769-807 (1300)
12 PF03076 GP3: Equine arteritis 23.0 60 0.0013 25.9 1.9 17 203-219 122-138 (160)
13 PHA00097 K protein K 22.7 1.7E+02 0.0036 19.7 3.6 36 45-84 5-41 (56)
14 TIGR02116 toxin_Txe_YoeB toxin 22.6 99 0.0021 22.0 2.9 35 70-106 19-53 (80)
No 1
>PF04755 PAP_fibrillin: PAP_fibrillin; InterPro: IPR006843 This family identifies a conserved domain found in a number of plastid lipid-associated proteins (PAPs) that are thought to form together with other plastoglobulins a coat on the surface of the lipoprotein particle. The coat may contain receptors for attachment to the thylakoid membrane as well as regulatory proteins that may function in the transfer of lipids to and from the thylakoid membranes.). This entry also represents a number of putative fibrillin proteins.; GO: 0005198 structural molecule activity, 0009507 chloroplast
Probab=100.00 E-value=7.8e-41 Score=284.17 Aligned_cols=179 Identities=36% Similarity=0.684 Sum_probs=146.5
Q ss_pred HHHHHHHHHHhhcCCCCCCCChhhHHHHHHHHHHHHhcCCCCCCCCCC-CCcCcEEEEEEeecccccccccccCcccccc
Q 044591 47 QIKTELLQVVQGINRGIFGVPSAKKSEIEALVELLESQNPTPHPTANL-DKVGGTWKLVYSTITILGSKRTKLGLRDFIT 125 (241)
Q Consensus 47 ~lK~~LL~~ia~t~rG~~asp~~~r~~I~~li~~LE~~np~~~P~~~~-~lL~G~W~LvyTt~~~l~s~~~~lG~~~~~~ 125 (241)
++|++||+++++++||+.+++ +++++|+++|++||++||+++|+ ++ ++|+|+|+|+|||+...... ...+..++..
T Consensus 2 ~~K~~Ll~~~~~~~rG~~~~~-~~~~~i~~~v~~LE~~np~~~p~-~s~~~L~G~W~Lvytt~~~~~~~-l~~~~~~~~~ 78 (198)
T PF04755_consen 2 DLKQELLQAVAGTNRGLRASP-EDREEIEELVEELEALNPTPDPA-DSLPLLDGRWELVYTTSPEIRSL-LQRGRLPGVR 78 (198)
T ss_pred hHHHHHHHHHhccCCCccCCH-HHHHHHHHHHHHHHHhCCCCCCc-CCchhcCcEEEEEeecCCCcccc-cccccccccc
Confidence 699999999999999999886 58999999999999999999998 55 99999999999998754321 0012224567
Q ss_pred cCeeEEEEECCCCeEEEEEEEeecCccceeeEEEEEEEEEEcCCceEEEEeecceechhhhh--------hhh-------
Q 044591 126 LGDFFQSIDVAKGKAVNVIKFNVRGLNLLNGQLTIEASFKIASKSRVDIAYDNSTITPEQLM--------NMF------- 190 (241)
Q Consensus 126 ~g~v~Q~Id~~~~~~~N~v~f~~~gl~~l~g~v~v~a~~~~~~~~rv~v~F~~~~v~~~~l~--------~l~------- 190 (241)
++++||+||.+++++.|+|+|.. +|.+.|.+.+.+++++.++.|+.++|+++.+.++.+. +.+
T Consensus 79 ~~~v~Q~id~~~~~~~N~v~~~~--~~~~~~~~~v~a~~~~~~~~rv~v~f~~~~l~~~~~l~~~l~~~~~~~~~v~~~~ 156 (198)
T PF04755_consen 79 VGRVFQTIDADNGRVENVVELSG--FPLLEGSVSVRASLEVRSPRRVEVTFERASLKPPSLLKGVLGPLKDALNNVPRGI 156 (198)
T ss_pred ccceEEEEECCCceEEEEEEEec--cCceEEEEEEEEEEEEccccEEEEEEEeeEEcccceeeccchhhhhhhhhccccc
Confidence 99999999999999999999765 5667889999999999999999999999999864421 000
Q ss_pred h--hccccccccCCCceEEEEEEecCCeEEEeCCCCcEEEEE
Q 044591 191 R--KNYDLLLGIFNPDGWLEISYVDDTMRIGRDDKGNIFILE 230 (241)
Q Consensus 191 ~--~~~~~~lp~~~p~g~l~~tYLDedlRI~Rg~~G~~FV~~ 230 (241)
. .+..++++...++|||+||||||||||+||++|++|||.
T Consensus 157 ~~~~~~~~~~~~~~~~g~l~~tYLDedlRI~Rg~~G~~fVl~ 198 (198)
T PF04755_consen 157 SDELPVPLPLPGGSPKGWLDTTYLDEDLRISRGNKGSLFVLK 198 (198)
T ss_pred ccccccccccCCCCCceEEEEEEECCCeEEEEcCCCCEEEeC
Confidence 0 111222333467999999999999999999999999984
No 2
>PF08212 Lipocalin_2: Lipocalin-like domain; InterPro: IPR000566 Proteins which transport small hydrophobic molecules such as steroids, bilins, retinoids, and lipids share limited regions of sequence homology and a common tertiary structure architecture [, , , , ]. This is an eight stranded antiparallel beta-barrel with a repeated + 1 topology enclosing a internal ligand binding site [, ]. The name 'lipocalin' has been proposed [] for this protein family, but cytosolic fatty-acid binding proteins are also included. The sequences of most members of the family, the core or kernal lipocalins, are characterised by three short conserved stretches of residues, while others, the outlier lipocalin group, share only one or two of these [, ]. Proteins known to belong to this family include alpha-1-microglobulin (protein HC); alpha-1-acid glycoprotein (orosomucoid) []; aphrodisin; apolipoprotein D; beta-lactoglobulin; complement component C8 gamma chain []; crustacyanin []; epididymal-retinoic acid binding protein (E-RABP) []; insectacyanin; odorant-binding protein (OBP); human pregnancy-associated endometrial alpha-2 globulin; probasin (PB), a rat prostatic protein; prostaglandin D synthase (5.3.99.2 from EC) []; purpurin; Von Ebner's gland protein (VEGP) []; and lizard epididymal secretory protein IV (LESP IV) [].; GO: 0005488 binding; PDB: 3EBW_B 1QWD_A 2ACO_A 3MBT_A.
Probab=57.19 E-value=54 Score=25.96 Aligned_cols=35 Identities=29% Similarity=0.502 Sum_probs=25.9
Q ss_pred CceEEEEEEecCC---eEEEeCCCCcEEEEEecCCCCC
Q 044591 203 PDGWLEISYVDDT---MRIGRDDKGNIFILERSSSAGE 237 (241)
Q Consensus 203 p~g~l~~tYLDed---lRI~Rg~~G~~FV~~R~~~~~~ 237 (241)
+.+-..++|+|.| .=|+-.+...++||.|.....+
T Consensus 81 ~~~~YwVl~~D~dY~~~iv~~~~~~~~WILsR~p~~~~ 118 (143)
T PF08212_consen 81 PKGNYWVLYTDYDYSWAIVGSPDREYLWILSRTPQLSE 118 (143)
T ss_dssp EEEEEEEEEEBTTSSEEEEEECCCCEEEEEESSSS--H
T ss_pred CCcceEEEEEcCCccEEEEecCCCCEEEEEeCCCCCCH
Confidence 3677789999998 4455667889999999877543
No 3
>PF14869 DUF4488: Domain of unknown function (DUF4488)
Probab=53.50 E-value=1.2e+02 Score=24.56 Aligned_cols=72 Identities=15% Similarity=0.299 Sum_probs=43.7
Q ss_pred CcCcEEEEE-Eeecc-cccccccccCcccccccCeeEEEEECCCCeEEEEEEEeecCccceeeEEEEEEEEEEcCCceEE
Q 044591 96 KVGGTWKLV-YSTIT-ILGSKRTKLGLRDFITLGDFFQSIDVAKGKAVNVIKFNVRGLNLLNGQLTIEASFKIASKSRVD 173 (241)
Q Consensus 96 lL~G~W~Lv-yTt~~-~l~s~~~~lG~~~~~~~g~v~Q~Id~~~~~~~N~v~f~~~gl~~l~g~v~v~a~~~~~~~~rv~ 173 (241)
.|.|-|+|. |-+.. .... . ...+.++-++. ++|++.|..-....+ ..+...++|+..++....
T Consensus 2 ~l~GVWQ~c~~~~~~~~~~g---~------l~~~~~lKilS-~Dgtf~Ni~~~~~~~-----aiIt~~GtY~~~sD~~Y~ 66 (133)
T PF14869_consen 2 SLQGVWQLCHYVSESPEVPG---K------LKPSNVLKILS-DDGTFVNITMIPKSG-----AIITGYGTYEQPSDNIYV 66 (133)
T ss_pred CceEEEEEEEEeecCcccCc---e------EeecccEEEEc-CCCcEEEEEEeCCCC-----cEEEEeEEEEEcCCccce
Confidence 378999999 44332 2111 0 12345677774 678999997643222 346778899998877655
Q ss_pred EEeecceech
Q 044591 174 IAYDNSTITP 183 (241)
Q Consensus 174 v~F~~~~v~~ 183 (241)
-..++ .+.+
T Consensus 67 E~IeK-nv~l 75 (133)
T PF14869_consen 67 ESIEK-NVHL 75 (133)
T ss_pred eeecc-eecC
Confidence 55544 4443
No 4
>COG3040 Blc Bacterial lipocalin [Cell envelope biogenesis, outer membrane]
Probab=48.78 E-value=72 Score=26.89 Aligned_cols=32 Identities=16% Similarity=0.261 Sum_probs=25.1
Q ss_pred eEEEEEEecCCeEEE---eCCCCcEEEEEecCCCC
Q 044591 205 GWLEISYVDDTMRIG---RDDKGNIFILERSSSAG 236 (241)
Q Consensus 205 g~l~~tYLDedlRI~---Rg~~G~~FV~~R~~~~~ 236 (241)
|-..++|+|+|-+.. =.++-.+|++.|.....
T Consensus 111 g~Y~Vl~~d~eYs~aiVgsPdr~ylWlLsRtP~~s 145 (174)
T COG3040 111 GDYWVLALDPEYSWAIVGSPDREYLWLLSRTPTLS 145 (174)
T ss_pred ccEEEEEECCCccEEEEeCCCcceEEEEecCCCCC
Confidence 666799999998865 34577899999987544
No 5
>PF12283 Protein_K: Bacteriophage protein K; InterPro: IPR020962 This family of proteins is found in the microviridae (isometric ssDNA phages) and are approximately 60 amino acids in length. The function of these proteins are unknown. In phi X174 site-directed mutagenesis of gene K produces small plaques on su- hosts. The mutant phage has an identical latent period, but a more reduced burst size than that of the wild-type phi X174. The reduced burst size in the gene K mutant suggests that the gene K protein, although not essential, has a role in increasing infectivity by increasing the burst size three to six fold [].
Probab=37.37 E-value=67 Score=21.57 Aligned_cols=36 Identities=36% Similarity=0.501 Sum_probs=28.8
Q ss_pred HHHHHHHHHHHHhhcCC-CCCCCChhhHHHHHHHHHHHHhc
Q 044591 45 LTQIKTELLQVVQGINR-GIFGVPSAKKSEIEALVELLESQ 84 (241)
Q Consensus 45 ~~~lK~~LL~~ia~t~r-G~~asp~~~r~~I~~li~~LE~~ 84 (241)
..-+|++||-+....+| |..+ ..++|.....+||.+
T Consensus 5 ~tli~qellll~yelnrsgllv----ene~i~~~l~~le~l 41 (56)
T PF12283_consen 5 TTLIKQELLLLTYELNRSGLLV----ENEEIQSQLKQLEKL 41 (56)
T ss_pred HHHHHHHHHHHHHHhccccccc----ccHHHHHHHHHHHHH
Confidence 45689999999999997 6543 235899999999987
No 6
>PF05973 Gp49: Phage derived protein Gp49-like (DUF891); InterPro: IPR009241 This entry consists of several hypothetical viral and bacterial proteins some are annotated as addiction module killer proteins.
Probab=31.87 E-value=1.4e+02 Score=21.56 Aligned_cols=40 Identities=20% Similarity=0.287 Sum_probs=28.2
Q ss_pred hhhHHHHHHHHHHHHhcCCCCCCCCCCCCc--CcEEEEEEeec
Q 044591 68 SAKKSEIEALVELLESQNPTPHPTANLDKV--GGTWKLVYSTI 108 (241)
Q Consensus 68 ~~~r~~I~~li~~LE~~np~~~P~~~~~lL--~G~W~LvyTt~ 108 (241)
...+.+|...++.|+..+|...+. ....+ .|-|+|.....
T Consensus 8 ~~~~~~i~~~l~~l~~~G~~l~~~-~~k~l~~~~i~ElR~~~~ 49 (91)
T PF05973_consen 8 DKERAKILAQLERLEEHGPSLGEP-LFKHLKGDGIYELRVRGG 49 (91)
T ss_pred HHHHHHHHHHHHHHHhcCCccCCC-cccccCcCCeEEEEEeec
Confidence 346789999999998888543322 23344 69999998765
No 7
>PF07624 PSD2: Protein of unknown function (DUF1585); InterPro: IPR011478 This entry represents a conserved region at the C terminus of a family of cytochrome-like proteins found in bacteria such as Rhodopirellula baltica and Solibacter usitatus. These proteins also contain IPR013036 from INTERPRO, IPR013039 from INTERPRO, IPR013042 from INTERPRO and IPR013043 from INTERPRO.
Probab=29.52 E-value=1.4e+02 Score=21.21 Aligned_cols=26 Identities=19% Similarity=0.346 Sum_probs=19.4
Q ss_pred hcCCCCCCCChhhHHHHHHHHHHHHhcCC
Q 044591 58 GINRGIFGVPSAKKSEIEALVELLESQNP 86 (241)
Q Consensus 58 ~t~rG~~asp~~~r~~I~~li~~LE~~np 86 (241)
.++|+.. . .|+..|++++++++..+-
T Consensus 35 AlGR~~~--~-~D~~~i~~i~~~~~~~~y 60 (76)
T PF07624_consen 35 ALGRPLE--F-SDRCEIDRIVEAFKANGY 60 (76)
T ss_pred HcCCCCC--c-chHHHHHHHHHHHHHcCC
Confidence 4466664 3 378999999999998764
No 8
>COG3908 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=29.17 E-value=83 Score=22.58 Aligned_cols=23 Identities=26% Similarity=0.449 Sum_probs=16.0
Q ss_pred EEEEEEecCCeEEEeCCCCcEEEEEe
Q 044591 206 WLEISYVDDTMRIGRDDKGNIFILER 231 (241)
Q Consensus 206 ~l~~tYLDedlRI~Rg~~G~~FV~~R 231 (241)
.-.|.|||.|++|.. -|+ ||++-
T Consensus 12 ea~iryldgdf~vv~--~Gs-fV~CA 34 (77)
T COG3908 12 EAVIRYLDGDFQVVS--PGS-FVLCA 34 (77)
T ss_pred eeEEEEecCceEEEc--CCc-EEEEE
Confidence 346899999999984 443 55443
No 9
>PF10047 DUF2281: Protein of unknown function (DUF2281); InterPro: IPR018739 This domain is found in putative uncharacterised proteins, though some proteins contaning this domain are described as a transcritional regulator of the Xre family.
Probab=29.10 E-value=1.3e+02 Score=20.90 Aligned_cols=48 Identities=25% Similarity=0.426 Sum_probs=32.3
Q ss_pred HHHHHHHHhhcCCCCCCCChhhHHHHHHHHHHHHhcCCCCCCCCCCCCcCcEEEEEE
Q 044591 49 KTELLQVVQGINRGIFGVPSAKKSEIEALVELLESQNPTPHPTANLDKVGGTWKLVY 105 (241)
Q Consensus 49 K~~LL~~ia~t~rG~~asp~~~r~~I~~li~~LE~~np~~~P~~~~~lL~G~W~Lvy 105 (241)
+++|++.+... |++...+|.+.|+=|...++.... ....+.|.|.=.+
T Consensus 2 ~~~l~~~i~~L-------P~~~~~Evldfi~fL~~k~~~~~~--~~~~~~~~~~G~l 49 (66)
T PF10047_consen 2 EEELLEKIQQL-------PEELQQEVLDFIEFLLQKYQQKES--KKKKLLGDWKGKL 49 (66)
T ss_pred hHHHHHHHHHC-------CHHHHHHHHHHHHHHHHhcccccc--cccccccccCCcc
Confidence 46777777765 667889999999999988843322 2344566665333
No 10
>PRK03760 hypothetical protein; Provisional
Probab=29.06 E-value=28 Score=27.23 Aligned_cols=15 Identities=13% Similarity=0.414 Sum_probs=12.5
Q ss_pred EEEEEEecCCeEEEe
Q 044591 206 WLEISYVDDTMRIGR 220 (241)
Q Consensus 206 ~l~~tYLDedlRI~R 220 (241)
-+|+.|||.+-||..
T Consensus 60 pLDiiFld~~g~Vv~ 74 (117)
T PRK03760 60 SIDVIFLDSNRRVVD 74 (117)
T ss_pred eeEEEEECCCCeEEE
Confidence 469999999988874
No 11
>KOG1513 consensus Nuclear helicase MOP-3/SNO (DEAD-box superfamily) [Transcription; Signal transduction mechanisms]
Probab=27.62 E-value=65 Score=33.84 Aligned_cols=39 Identities=26% Similarity=0.525 Sum_probs=31.6
Q ss_pred CCCChhhHHHHHHHHHHHHhhcCCCCCCCChhhHHHHHHHHHHHH
Q 044591 38 DSSDKRTLTQIKTELLQVVQGINRGIFGVPSAKKSEIEALVELLE 82 (241)
Q Consensus 38 ~~~~~~~~~~lK~~LL~~ia~t~rG~~asp~~~r~~I~~li~~LE 82 (241)
+..+++.+..+|++||+.+...+|-+- | +-+++||++|-
T Consensus 769 ~e~~~er~~~mkeeLL~kverLg~~LP--~----NTLDqLIdelG 807 (1300)
T KOG1513|consen 769 DEEEAERVERMKEELLDKVERLGRELP--P----NTLDQLIDELG 807 (1300)
T ss_pred cHHHHHHHHHHHHHHHHHHHHhcccCC--c----chHHHHHHHhC
Confidence 677778899999999999999987763 2 35788888874
No 12
>PF03076 GP3: Equine arteritis virus GP3; InterPro: IPR004310 This entry contains proteins encoded by ORF3 of Equine arteritis virus. They are possible envelope glcoproteins.
Probab=22.99 E-value=60 Score=25.92 Aligned_cols=17 Identities=29% Similarity=0.593 Sum_probs=14.5
Q ss_pred CceEEEEEEecCCeEEE
Q 044591 203 PDGWLEISYVDDTMRIG 219 (241)
Q Consensus 203 p~g~l~~tYLDedlRI~ 219 (241)
.-|.+...|+|||||.-
T Consensus 122 glg~~sfsfidedlrlh 138 (160)
T PF03076_consen 122 GLGQLSFSFIDEDLRLH 138 (160)
T ss_pred CcceEEEEEecccceee
Confidence 36788999999999974
No 13
>PHA00097 K protein K
Probab=22.72 E-value=1.7e+02 Score=19.65 Aligned_cols=36 Identities=33% Similarity=0.447 Sum_probs=28.2
Q ss_pred HHHHHHHHHHHHhhcCC-CCCCCChhhHHHHHHHHHHHHhc
Q 044591 45 LTQIKTELLQVVQGINR-GIFGVPSAKKSEIEALVELLESQ 84 (241)
Q Consensus 45 ~~~lK~~LL~~ia~t~r-G~~asp~~~r~~I~~li~~LE~~ 84 (241)
..-++++||-+....+| |..+ ..++|.....+||.+
T Consensus 5 ttli~qelllltyelnrsgllv----eneeiqs~lk~le~l 41 (56)
T PHA00097 5 TTLILQELLLLTYELNRSGLLV----ENEEIQSQLKKLEKL 41 (56)
T ss_pred hHHHHHHHHHHHHhhcccccee----ccHHHHHHHHHHHHH
Confidence 34678999999999997 6553 235899999999987
No 14
>TIGR02116 toxin_Txe_YoeB toxin-antitoxin system, toxin component, Txe/YoeB family. The Axe-Txe pair in Enterococcus faecium and the homologous YefM-YoeB pair in Escherichia coli have been shown to act as an antitoxin-toxin pair. This model describes the toxin component. Nearly every example found is next to an identifiable antitoxin, as indicated by matches to TIGR01552 and/or pfam02604.
Probab=22.59 E-value=99 Score=21.98 Aligned_cols=35 Identities=17% Similarity=0.262 Sum_probs=20.8
Q ss_pred hHHHHHHHHHHHHhcCCCCCCCCCCCCcCcEEEEEEe
Q 044591 70 KKSEIEALVELLESQNPTPHPTANLDKVGGTWKLVYS 106 (241)
Q Consensus 70 ~r~~I~~li~~LE~~np~~~P~~~~~lL~G~W~LvyT 106 (241)
.+..|.++|++|+ .||.+... ....|.|.|.-.|.
T Consensus 19 ~~~~i~~~i~~l~-~~P~~~~~-~~~~L~G~~~g~~r 53 (80)
T TIGR02116 19 LKKKINELIKDVR-RDPFKGKG-KPEPLKGDLSGYWS 53 (80)
T ss_pred HHHHHHHHHHHHH-cCCCCCCC-CcccCCCCCCCcEE
Confidence 5678999998886 45553112 33557776543333
Done!