Query psy6812
Match_columns 207
No_of_seqs 114 out of 408
Neff 5.7
Searched_HMMs 46136
Date Fri Aug 16 19:03:05 2013
Command hhsearch -i /work/01045/syshi/Psyhhblits/psy6812.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/6812hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 PHA02748 viral inexin-like pro 100.0 6.3E-60 1.4E-64 429.8 16.9 182 1-183 1-182 (360)
2 PF00876 Innexin: Innexin; In 100.0 8E-47 1.7E-51 341.6 12.9 149 20-176 1-157 (348)
3 PF05402 PqqD: Coenzyme PQQ sy 51.2 48 0.001 22.3 5.0 35 125-159 18-54 (68)
4 PF04088 Peroxin-13_N: Peroxin 38.3 23 0.0005 29.3 2.0 22 111-133 136-157 (158)
5 COG1981 Predicted membrane pro 34.8 32 0.00069 28.3 2.3 28 106-133 5-37 (149)
6 PF08038 Tom7: TOM7 family; I 29.5 30 0.00064 22.5 1.0 16 103-118 11-26 (42)
7 PF10524 NfI_DNAbd_pre-N: Nucl 29.0 33 0.00072 22.4 1.2 26 164-189 12-37 (44)
8 PF14880 COX14: Cytochrome oxi 28.0 30 0.00066 23.7 0.9 22 22-44 9-30 (59)
9 KOG4449|consensus 27.2 30 0.00066 23.6 0.8 16 103-118 18-33 (53)
10 PF06708 DUF1195: Protein of u 26.7 43 0.00094 27.5 1.7 53 106-170 35-101 (157)
11 PF01233 NMT: Myristoyl-CoA:pr 24.3 49 0.0011 27.6 1.7 30 5-34 37-66 (162)
No 1
>PHA02748 viral inexin-like protein; Provisional
Probab=100.00 E-value=6.3e-60 Score=429.82 Aligned_cols=182 Identities=35% Similarity=0.721 Sum_probs=171.2
Q ss_pred ChhhHHHHHhhhcccccccCcceeeeehhhHHHHHHHHHHHHhhccccCCCeeeeeCCCCchhhhhheeeeeeeeecCcc
Q psy6812 1 MLKLLGGLKEYLKWQEITTDSTIFRLHNLFTTALLLTCSLIITATQYVGSPIQCIVNGLPAHPINTYCWIMSTFTMPDAF 80 (207)
Q Consensus 1 Ml~~~~~L~~~l~~~~~~~Dd~vdRL~y~~Tv~iLl~~sllvt~kqy~G~PI~C~~~~~~~~y~~~YCwi~~Ty~v~~~~ 80 (207)
|++++++|++++|++++.+||++|||||++|++||++||+++|+|||||+||+||+|+.+++|+|+||||||||++|.+.
T Consensus 1 M~~~~~~l~~~lk~~~v~~Dd~vdrLhy~~Tv~iL~~~silvs~kQy~G~PI~C~~p~~~~~~~n~yCwi~~Ty~v~~~~ 80 (360)
T PHA02748 1 MLDLFSALRGLLKVQSVSIDNNFFRLHYKITVIILLAFSLLVTSRQYFGDPIDCDFPDYPNGSLNTYCYVQSTFLVERKV 80 (360)
T ss_pred ChhHHHHHHHhhCcceeeccchhheehHHHHHHHHHHHHHHHHHHHHCCCCeEeccCCCccchhhceeeeeeEEEeeccc
Confidence 88999999999999999999999999999999999999999999999999999999998899999999999999999988
Q ss_pred cccccccccCCCCCCCCCCccccceeeehhHHHHHHHHHHHHhhhchHHHHHhcCCcHHHHHHhhccCCCCHHHHHHHHH
Q psy6812 81 QRQVGVEVAHPGVANDFGDEDAKKYYSYYQWVCFVLFFQGIMCYTPKWIWDNFEGGLMKTLVMGLRIGICAEEEKCNKKK 160 (207)
Q Consensus 81 ~~~~~~~vp~pg~~~~~~~~~~~~~~~YYQWVpfiL~lQA~lFy~P~~iWk~~~gg~i~~l~~~~~~~~~~~e~r~~~~~ 160 (207)
+.+.++++|+||+++..+ ++++++++|||||||+|++||++||+|+++||.||||++++++.+++.+..++|+|+++++
T Consensus 81 ~~~~~~~~~~~~~~~~~~-~~~~~~i~YYQWVpfvL~lQA~lFYiP~~iWk~~egg~i~~l~~~~~~~~~~~~~~~~~i~ 159 (360)
T PHA02748 81 THTVNSTVPDPGVSGDTE-EDELRYYGYYQWVFITLFLQAVFFYIPHYIWKAWEGGKMKMLAVELASPVLSKDCIEKNTQ 159 (360)
T ss_pred ccccccccCCCCCCCCcc-ccceeEeceeeHHHHHHHHHHHHHHchHHHHHHhccCcHHHHHHhhccccCCHHHHHHHHH
Confidence 888888888998886544 3458899999999999999999999999999999999999999988888889999999999
Q ss_pred HHHHHHHHHHHHhhhhHhhhhhc
Q psy6812 161 AIIDYLLKHIKRTLKFGFQYREV 183 (207)
Q Consensus 161 ~l~~yl~~~l~~h~~ya~~y~~~ 183 (207)
.+++|+.++++.||.||++|+-.
T Consensus 160 ~l~~y~~~~l~~h~~y~~~Y~~~ 182 (360)
T PHA02748 160 PLVDYFFMNLHSHNAYAYKYFTC 182 (360)
T ss_pred HHHHHHHHHHHhhhHHHHHHHHH
Confidence 99999999999999999998743
No 2
>PF00876 Innexin: Innexin; InterPro: IPR000990 The pannexin family combines invertebrate gap junction proteins and their vertebrate homologs. These proteins have been named innexins []. Gap junctions are composed of membrane proteins, which form a channel permeable for ions and small molecules connecting cytoplasm of adjacent cells. Although gap junctions provide similar functions in all multicellular organisms, until recently it was believed that vertebrates and invertebrates use unrelated proteins for this purpose. While the connexins family of gap junction proteins is well- characterised in vertebrates, no homologs have been found in invertebrates. In turn, gap junction molecules with no sequence homology to connexins have been identified in insects and nematodes. It has been suggested that these proteins are specific invertebrate gap junctions, and they were thus named innexins (invertebrate analog of connexins) []. As innexin homologs were recently identified in other taxonomic groups including vertebrates, indicating their ubiquitous distribution in the animal kingdom, they were called pannexins (from the Latin pan-all, throughout, and nexus-connection, bond) [, , ]. Genomes of vertebrates carry probably a conserved set of 3 pannexin paralogs (PANX1, PANX2 and PANX3). Invertebrate genomes may contain more than a dozen pannexin (innexin) genes. Vinnexins, viral homologs of pannexins/innexins, were identified in Polydnaviruses that occur in obligate symbiotic associations with parasitoid wasps. It was suggested that virally encoded vinnexin proteins may function to alter gap junction proteins in infected host cells, possibly modifying cell-cell communication during encapsulation responses in parasitized insects [, ]. Structurally pannexins are simillar to connexins. Both types of protein consist of a cytoplasmic N-terminal domain, followed by four transmembrane segments that delimit two extracellular and one cytoplasmic loops; the C- terminal domain is cytoplasmic.; GO: 0005921 gap junction
Probab=100.00 E-value=8e-47 Score=341.65 Aligned_cols=149 Identities=35% Similarity=0.646 Sum_probs=129.8
Q ss_pred CcceeeeehhhHHHHHHHHHHHHhhccccCCCeeeeeC-CC---CchhhhhheeeeeeeeecCcccccccccccCCCCCC
Q psy6812 20 DSTIFRLHNLFTTALLLTCSLIITATQYVGSPIQCIVN-GL---PAHPINTYCWIMSTFTMPDAFQRQVGVEVAHPGVAN 95 (207)
Q Consensus 20 Dd~vdRL~y~~Tv~iLl~~sllvt~kqy~G~PI~C~~~-~~---~~~y~~~YCwi~~Ty~v~~~~~~~~~~~vp~pg~~~ 95 (207)
||++|||||++||++|+++|+++++|||||+||+||+| ++ |++|+|+||||+|||++|. ++++| |.++
T Consensus 1 Dd~~dRl~~~~Tv~lL~~~a~lv~~kqy~G~PI~C~~P~~f~~~~~~y~~~yCwi~~Ty~v~~------~~~~~--~~~~ 72 (348)
T PF00876_consen 1 DDFVDRLNYKYTVILLLFFALLVSAKQYFGSPIQCWVPAEFTGSWEEYANSYCWIQNTYFVPM------NEDVP--GTDP 72 (348)
T ss_pred CChhhhhHHHHHHHHHHHHHHHHHHHHhcCCCceeeCCCCCCcchHhHHHhhhcccceEeccc------cccCC--CCcc
Confidence 89999999999999999999999999999999999985 55 5899999999999999986 44444 3333
Q ss_pred CCCCccccceeeehhHHHHHHHHHHHHhhhchHHHH---HhcCCcHHHHHHhhccCC-CCHHHHHHHHHHHHHHHHHHHH
Q psy6812 96 DFGDEDAKKYYSYYQWVCFVLFFQGIMCYTPKWIWD---NFEGGLMKTLVMGLRIGI-CAEEEKCNKKKAIIDYLLKHIK 171 (207)
Q Consensus 96 ~~~~~~~~~~~~YYQWVpfiL~lQA~lFy~P~~iWk---~~~gg~i~~l~~~~~~~~-~~~e~r~~~~~~l~~yl~~~l~ 171 (207)
..++.+++++++|||||||+|++||+|||+|+++|| .++|++++++++.++... .++|+|+++++.+++|+.+.++
T Consensus 73 ~~~~~~~~~~i~YYQWVPfiL~lQA~lfylP~~iW~~~~~~~g~~l~~i~~~~~~~~~~~~~~r~~~~~~l~~~l~~~l~ 152 (348)
T PF00876_consen 73 EWRREREKREISYYQWVPFILLLQAILFYLPHLIWRLLNKWSGIDLKSIVEEADKAQNSDPEERKKKIEKLARYLERYLK 152 (348)
T ss_pred ccccccccceEeeehhhHHHHHHHHHHHHhHHHHHHHhhhccCccHHHHHHHHHhhhcccHHHHHHHHHHHHHHHHHHHH
Confidence 322233478999999999999999999999999999 568999999999887554 4459999999999999999999
Q ss_pred Hhhhh
Q psy6812 172 RTLKF 176 (207)
Q Consensus 172 ~h~~y 176 (207)
.|+.|
T Consensus 153 ~~~~~ 157 (348)
T PF00876_consen 153 QHRRY 157 (348)
T ss_pred hhhhh
Confidence 99988
No 3
>PF05402 PqqD: Coenzyme PQQ synthesis protein D (PqqD); InterPro: IPR008792 This family contains several bacterial coenzyme PQQ synthesis protein D (PqqD) sequences. This protein is required for coenzyme pyrrolo-quinoline-quinone (PQQ) biosynthesis.; PDB: 3G2B_A.
Probab=51.17 E-value=48 Score=22.32 Aligned_cols=35 Identities=23% Similarity=0.359 Sum_probs=19.0
Q ss_pred hchHHHHHhcCC-cHHHHHHhhccC-CCCHHHHHHHH
Q psy6812 125 TPKWIWDNFEGG-LMKTLVMGLRIG-ICAEEEKCNKK 159 (207)
Q Consensus 125 ~P~~iWk~~~gg-~i~~l~~~~~~~-~~~~e~r~~~~ 159 (207)
.=..+|+.++|+ .+..|++.+... ..++++.++.+
T Consensus 18 ~a~~Iw~~~~g~~t~~ei~~~l~~~y~~~~~~~~~dv 54 (68)
T PF05402_consen 18 TAAFIWELLDGPRTVEEIVDALAEEYDVDPEEAEEDV 54 (68)
T ss_dssp HHHHHHHH--SSS-HHHHHHHHHHHTT--HHHHHHHH
T ss_pred HHHHHHHHccCCCCHHHHHHHHHHHcCCCHHHHHHHH
Confidence 446899999886 688998876433 33344333333
No 4
>PF04088 Peroxin-13_N: Peroxin 13, N-terminal region; InterPro: IPR007223 Peroxin-13 is a component of the peroxisomal translocation machinery with Peroxin-14 and Peroxin-17. Both termini of Peroxin-13 are oriented to the cytosol. It is required for peroxisomal association of peroxin-14 []. The proteins also contain an SH3 domain (IPR001452 from INTERPRO).; GO: 0016560 protein import into peroxisome matrix, docking, 0005777 peroxisome, 0016021 integral to membrane
Probab=38.33 E-value=23 Score=29.28 Aligned_cols=22 Identities=23% Similarity=0.395 Sum_probs=18.6
Q ss_pred HHHHHHHHHHHHhhhchHHHHHh
Q psy6812 111 WVCFVLFFQGIMCYTPKWIWDNF 133 (207)
Q Consensus 111 WVpfiL~lQA~lFy~P~~iWk~~ 133 (207)
|-|+++++-| .|=+|+++||.+
T Consensus 136 ~~PlllF~~~-v~G~PyLi~Kli 157 (158)
T PF04088_consen 136 SKPLLLFLAA-VFGLPYLIWKLI 157 (158)
T ss_pred cccHHHHHHH-HHHHHHHHHHHh
Confidence 6688888766 899999999975
No 5
>COG1981 Predicted membrane protein [Function unknown]
Probab=34.76 E-value=32 Score=28.27 Aligned_cols=28 Identities=25% Similarity=0.470 Sum_probs=21.5
Q ss_pred eeehhHHH-----HHHHHHHHHhhhchHHHHHh
Q psy6812 106 YSYYQWVC-----FVLFFQGIMCYTPKWIWDNF 133 (207)
Q Consensus 106 ~~YYQWVp-----fiL~lQA~lFy~P~~iWk~~ 133 (207)
..||-|+= ++..--|.+||+|+++----
T Consensus 5 ~~~y~W~KafHiiavisWmAglfYLPRlFVyha 37 (149)
T COG1981 5 ADYYLWVKAFHLIAVISWMAGLFYLPRLFVYHA 37 (149)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 36888863 55666799999999988754
No 6
>PF08038 Tom7: TOM7 family; InterPro: IPR012621 This family consists of TOM7 family of mitochondrial import receptors. TOM7 forms part of the translocase of the outer mitochondrial membrane (TOM) complex and it appears to function as a modulator of the dynamics of the mitochondrial protein transport machinery by promoting the dissociation of subunits of the outer membrane translocase [].; GO: 0015450 P-P-bond-hydrolysis-driven protein transmembrane transporter activity, 0006886 intracellular protein transport, 0005741 mitochondrial outer membrane
Probab=29.51 E-value=30 Score=22.51 Aligned_cols=16 Identities=19% Similarity=0.446 Sum_probs=13.2
Q ss_pred cceeeehhHHHHHHHH
Q psy6812 103 KKYYSYYQWVCFVLFF 118 (207)
Q Consensus 103 ~~~~~YYQWVpfiL~l 118 (207)
.+.+-.|-|+||||.+
T Consensus 11 ~k~~~HyGfIP~Ilyl 26 (42)
T PF08038_consen 11 AKTVFHYGFIPLILYL 26 (42)
T ss_pred heeeEEeeehHHHHHH
Confidence 4568899999999865
No 7
>PF10524 NfI_DNAbd_pre-N: Nuclear factor I protein pre-N-terminus; InterPro: IPR019548 Nuclear factor I (NF-I) or CCAAT box-binding transcription factor (CTF) [, ] (also known as TGGCA-binding proteins) are a family of vertebrate nuclear proteins which recognise and bind, as dimers, the palindromic DNA sequence 5'-TGGCANNNTGCCA-3'. CTF/NF-I binding sites are present in viral and cellular promoters and in the origin of DNA replication of Human adenovirus 2 (HAdV-2). The CTF/NF-I proteins were first identified as nuclear factor I, a collection of proteins that activate the replication of several Adenovirus serotypes (together with NF-II and NF-III) []. The family of proteins was also identified as the CTF transcription factors, before the NFI and CTF families were found to be identical []. The CTF/NF-I proteins are individually capable of activating transcription and DNA replication. In a given species, there are a large number of different CTF/NF-I proteins, generated both by alternative splicing and by the occurrence of four different genes. CTF/NF-1 proteins contain 400 to 600 amino acids. The N-terminal 200 amino-acid sequence, almost perfectly conserved in all species and genes sequenced, mediates site-specific DNA recognition, protein dimerisation and Adenovirus DNA replication. The C-terminal 100 amino acids contain the transcriptional activation domain. This activation domain is the target of gene expression regulatory pathways elicited by growth factors and it interacts with basal transcription factors and with histone H3 []. This entry represents the N terminus, of which 200 residues contain the DNA-binding and dimerisation domain, but also has an 8-47 residue highly conserved region 5' of this, whose function is not known. Deletion of the N-terminal 200 amino acids removes the DNA-binding activity, dimerisation-ability and the stimulation of adenovirus DNA replication [].
Probab=28.97 E-value=33 Score=22.43 Aligned_cols=26 Identities=19% Similarity=0.369 Sum_probs=20.3
Q ss_pred HHHHHHHHHhhhhHhhhhhcccccce
Q psy6812 164 DYLLKHIKRTLKFGFQYREVAAHKRK 189 (207)
Q Consensus 164 ~yl~~~l~~h~~ya~~y~~~~~~~~~ 189 (207)
-+++..|-.=+.+||..||..|-|||
T Consensus 12 pFiEalLP~Vk~fay~WfnLQA~KRk 37 (44)
T PF10524_consen 12 PFIEALLPYVKAFAYTWFNLQAAKRK 37 (44)
T ss_pred hHHHHHHHHHHHHHHHHHhHHHHHHH
Confidence 34555566667799999999999997
No 8
>PF14880 COX14: Cytochrome oxidase c assembly
Probab=27.99 E-value=30 Score=23.67 Aligned_cols=22 Identities=14% Similarity=-0.138 Sum_probs=17.0
Q ss_pred ceeeeehhhHHHHHHHHHHHHhh
Q psy6812 22 TIFRLHNLFTTALLLTCSLIITA 44 (207)
Q Consensus 22 ~vdRL~y~~Tv~iLl~~sllvt~ 44 (207)
..|++| +.||.-|+.+++..+.
T Consensus 9 ~~D~~H-R~tV~~Lig~T~~~g~ 30 (59)
T PF14880_consen 9 LADIAH-RTTVLGLIGFTVYGGG 30 (59)
T ss_pred HHHHHH-HHHHHHHHHHHHHHHH
Confidence 456776 8899999999887653
No 9
>KOG4449|consensus
Probab=27.19 E-value=30 Score=23.57 Aligned_cols=16 Identities=31% Similarity=0.849 Sum_probs=12.5
Q ss_pred cceeeehhHHHHHHHH
Q psy6812 103 KKYYSYYQWVCFVLFF 118 (207)
Q Consensus 103 ~~~~~YYQWVpfiL~l 118 (207)
.+...-|-||||||-+
T Consensus 18 s~~vih~G~IP~VlyL 33 (53)
T KOG4449|consen 18 SKFVIHYGWIPLVLYL 33 (53)
T ss_pred hheeeeechhhHHHHh
Confidence 3467789999999855
No 10
>PF06708 DUF1195: Protein of unknown function (DUF1195); InterPro: IPR010608 This family consists of several plant specific hypothetical proteins of around 160 residues in length. The function of this family is unknown.
Probab=26.69 E-value=43 Score=27.45 Aligned_cols=53 Identities=17% Similarity=0.363 Sum_probs=35.2
Q ss_pred eeehhHHHHHHHHHHHHhhhchHHHHHhc--------CCcHHHHHHhhccCCCC------HHHHHHHHHHHHHHHHHHH
Q psy6812 106 YSYYQWVCFVLFFQGIMCYTPKWIWDNFE--------GGLMKTLVMGLRIGICA------EEEKCNKKKAIIDYLLKHI 170 (207)
Q Consensus 106 ~~YYQWVpfiL~lQA~lFy~P~~iWk~~~--------gg~i~~l~~~~~~~~~~------~e~r~~~~~~l~~yl~~~l 170 (207)
-.|=-|+--.+++.| +|.++. .|++..+-.+++.++.+ -|+|+| ++++|++..
T Consensus 35 grYK~WaLaAIlLLA--------fWSM~tgsvtLrwS~g~l~~~~~dl~~~~~dDlDvLEmEeREK----vVr~MWDvY 101 (157)
T PF06708_consen 35 GRYKFWALAAILLLA--------FWSMFTGSVTLRWSAGNLNSVSDDLDFPIHDDLDVLEMEEREK----VVRHMWDVY 101 (157)
T ss_pred chhHHHHHHHHHHHH--------HHHHhheeeEEEeccCcccccccccCCcccccccHHHHHHHHH----HHHHHHHHh
Confidence 468889887777777 477754 47777777777766543 355554 566677643
No 11
>PF01233 NMT: Myristoyl-CoA:protein N-myristoyltransferase, N-terminal domain; InterPro: IPR022676 Myristoyl-CoA:protein N-myristoyltransferase (2.3.1.97 from EC) (Nmt) [] is the enzyme responsible for transferring a myristate group on the N-terminal glycine of a number of cellular eukaryotics and viral proteins. Nmt is a monomeric protein of about 50 to 60kDa whose sequence appears to be well conserved. The N and C-terminal domains of NMT are structurally similar, each adopting an acyl-CoA N-acyltransferase-like fold. This entry represents the N-terminal region. ; GO: 0004379 glycylpeptide N-tetradecanoyltransferase activity; PDB: 2P6G_B 2P6F_F 2P6E_A 1IIC_A 1IID_A 2NMT_A 4A33_A 3H5Z_A 4A2Z_A 2WSA_A ....
Probab=24.34 E-value=49 Score=27.61 Aligned_cols=30 Identities=23% Similarity=0.213 Sum_probs=21.3
Q ss_pred HHHHHhhhcccccccCcceeeeehhhHHHH
Q psy6812 5 LGGLKEYLKWQEITTDSTIFRLHNLFTTAL 34 (207)
Q Consensus 5 ~~~L~~~l~~~~~~~Dd~vdRL~y~~Tv~i 34 (207)
+..+-.+|.-+.+.+||..||++|.--.+-
T Consensus 37 l~ely~lL~~nYVEDdd~~fRf~YS~efL~ 66 (162)
T PF01233_consen 37 LKELYELLNENYVEDDDNMFRFDYSKEFLK 66 (162)
T ss_dssp HHHHHHHHHHHSSBTTTSSEEE---HHHHH
T ss_pred HHHHHHHHHhcCccCCcceEEeeCCHHHHh
Confidence 566777887788999999999999865443
Done!