Query psy7201
Match_columns 65
No_of_seqs 105 out of 290
Neff 4.8
Searched_HMMs 46136
Date Fri Aug 16 20:46:44 2013
Command hhsearch -i /work/01045/syshi/Psyhhblits/psy7201.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/7201hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 cd06901 lectin_VIP36_VIPL VIP3 99.8 6.3E-21 1.4E-25 135.0 5.9 57 2-58 175-231 (248)
2 PF03388 Lectin_leg-like: Legu 99.8 3.6E-19 7.8E-24 123.6 4.0 48 4-51 182-229 (229)
3 cd06902 lectin_ERGIC-53_ERGL E 99.7 2.8E-18 6E-23 119.9 5.0 45 6-50 180-224 (225)
4 cd06903 lectin_EMP46_EMP47 EMP 99.7 3E-17 6.4E-22 114.3 4.9 45 2-50 169-214 (215)
5 KOG3839|consensus 99.6 3.9E-17 8.5E-22 121.5 1.4 57 1-57 226-282 (351)
6 KOG3838|consensus 99.6 6.2E-16 1.4E-20 118.1 5.5 56 7-63 212-267 (497)
7 cd07308 lectin_leg-like legume 99.6 4.7E-15 1E-19 100.9 5.3 46 5-50 173-218 (218)
8 cd01951 lectin_L-type legume l 98.0 1E-05 2.2E-10 55.0 4.6 39 9-47 180-221 (223)
9 cd06899 lectin_legume_LecRK_Ar 97.6 6.1E-05 1.3E-09 52.4 3.7 28 20-47 205-232 (236)
10 PF00139 Lectin_legB: Legume l 96.8 0.00084 1.8E-08 46.3 2.2 27 21-47 209-235 (236)
11 cd06900 lectin_VcfQ VcfQ bacte 92.7 0.13 2.8E-06 37.7 3.1 28 20-47 226-253 (255)
12 PF09116 gp45-slide_C: gp45 sl 86.0 0.63 1.4E-05 30.2 2.0 28 4-31 59-87 (112)
13 PHA02545 45 sliding clamp; Pro 82.9 1.3 2.8E-05 31.9 2.7 30 4-33 171-201 (223)
14 PF07013 DUF1314: Protein of u 57.4 8.5 0.00018 27.0 1.9 25 12-36 6-30 (177)
15 PF09175 DUF1944: Domain of un 53.6 15 0.00031 25.3 2.6 44 3-47 38-93 (165)
16 KOG2540|consensus 47.1 26 0.00057 25.9 3.2 49 3-53 141-190 (269)
17 PHA02290 hypothetical protein 34.9 20 0.00044 25.8 1.0 20 21-42 92-111 (234)
18 PF02408 CUB_2: CUB-like domai 32.0 71 0.0015 19.8 3.1 23 11-35 42-66 (120)
19 PF02465 FliD_N: Flagellar hoo 29.0 66 0.0014 19.1 2.5 26 17-42 64-89 (99)
20 PHA03371 circ protein; Provisi 28.9 54 0.0012 24.1 2.4 28 13-40 28-55 (240)
21 PF05015 Plasmid_killer: Plasm 27.7 31 0.00067 20.8 0.8 15 2-18 67-81 (93)
22 KOG2107|consensus 26.6 74 0.0016 22.4 2.6 29 6-34 111-141 (179)
23 COG3549 HigB Plasmid maintenan 25.8 34 0.00073 21.8 0.8 12 7-18 70-81 (94)
24 PF07495 Y_Y_Y: Y_Y_Y domain; 24.8 70 0.0015 17.1 1.9 34 8-41 19-57 (66)
25 PF07472 PA-IIL: Fucose-bindin 22.4 75 0.0016 20.6 1.9 16 19-34 1-16 (107)
No 1
>cd06901 lectin_VIP36_VIPL VIP36 and VIPL type 1 transmembrane proteins, lectin domain. The vesicular integral protein of 36 kDa (VIP36) is a type 1 transmembrane protein of the mammalian early secretory pathway that acts as a cargo receptor transporting high mannose type glycoproteins between the Golgi and the endoplasmic reticulum (ER). Lectins of the early secretory pathway are involved in the selective transport of newly synthesized glycoproteins from the ER to the ER-Golgi intermediate compartment (ERGIC). The most prominent cycling lectin is the mannose-binding type1 membrane protein ERGIC-53, which functions as a cargo receptor to facilitate export of glycoproteins from the ER. L-type lectins have a dome-shaped beta-barrel carbohydrate recognition domain with a curved seven-stranded beta-sheet referred to as the "front face" and a flat six-stranded beta-sheet referred to as the "back face". This domain homodimerizes so that adjacent back sheets form a contiguous 12-stranded she
Probab=99.83 E-value=6.3e-21 Score=134.95 Aligned_cols=57 Identities=60% Similarity=1.120 Sum_probs=53.8
Q ss_pred cccccccccccceeecceecCCCCeEEEeecCCCCCCCeeeEEEEEeeecCCCCcCC
Q psy7201 2 TDFENKAAWKECFKVSGVKLPTGYYFGVSAATGDLSDNHDVLGIRTYELEFPGEKLS 58 (65)
Q Consensus 2 ~di~~~~~w~~Cf~~~~v~LP~~~yfGiSAaTG~lsDnhDIis~~~~~l~~~~~~~~ 58 (65)
+|++++++|+.||++++|.||+++||||||+||+++|+|||++|++|++..+.++++
T Consensus 175 vd~~~~~~w~~Cf~~~~v~LP~~~yfGiSA~Tg~~sd~hdIlsv~~~~l~~~~~~~~ 231 (248)
T cd06901 175 TDIDGKNEWKECFDVTGVRLPTGYYFGASAATGDLSDNHDIISMKLYELDVEETPEE 231 (248)
T ss_pred EecCCCCceeeeEEeCCeecCCCCEEEEEecCCCCCCcEEEEEEEEecCcccccccc
Confidence 678899999999999999999999999999999999999999999999999988743
No 2
>PF03388 Lectin_leg-like: Legume-like lectin family; InterPro: IPR005052 Lectins are structurally diverse proteins that bind to specific carbohydrates. This family includes the VIP36 and ERGIC-53 lectins. These two proteins were the first members of the family of animal lectins similar to the leguminous plant lectins []. The alignment for this family is towards the N terminus, where the similarity of VIP36 and ERGIC-53 is greatest. Although they have been identified as a family of animal lectins, this alignment also includes yeast sequences[]. ERGIC-53 is a 53kDa protein, localised to the intermediate region between the endoplasmic reticulum and the Golgi apparatus (ER-Golgi-Intermediate Compartment, ERGIC). It was identified as a calcium-dependent, mannose-specific lectin []. Its dysfunction has been associated with combined factors V and VIII deficiency, suggesting an important and substrate-specific role for ERGIC-53 in the glycoprotein-secreting pathway [,]. The L-type lectin-like domain has an overall globular shape composed of a beta-sandwich of two major twisted antiparallel beta-sheets. The beta-sandwich comprises a major concave beta-sheet and a minor convex beta-sheet, in a variation of the jelly roll fold [, , , ]. ; GO: 0016020 membrane; PDB: 3A4U_A 3LCP_B 2A6Z_A 2A71_C 2A70_B 2A6Y_A 2A6X_A 2A6W_B 2A6V_B 2E6V_B ....
Probab=99.76 E-value=3.6e-19 Score=123.60 Aligned_cols=48 Identities=60% Similarity=1.217 Sum_probs=45.5
Q ss_pred cccccccccceeecceecCCCCeEEEeecCCCCCCCeeeEEEEEeeec
Q psy7201 4 FENKAAWKECFKVSGVKLPTGYYFGVSAATGDLSDNHDVLGIRTYELE 51 (65)
Q Consensus 4 i~~~~~w~~Cf~~~~v~LP~~~yfGiSAaTG~lsDnhDIis~~~~~l~ 51 (65)
.++.++|+.||++++|.||.++||||||+||+++|||||+||++|+|.
T Consensus 182 ~~~~~~~~~Cf~~~~v~LP~~~yfGvSA~Tg~~~d~hdi~s~~~~~l~ 229 (229)
T PF03388_consen 182 LKNQDDWELCFTTDGVDLPEGYYFGVSAATGELSDNHDILSVKTYSLD 229 (229)
T ss_dssp CSECCTTEEEEEESTEEGGSSBEEEEEEEESSSGGEEEEEEEEEEEEE
T ss_pred ccCCcCCcEEEEcCCeecCCCCEEEEEecCCCCCCcEEEEEEEEeecC
Confidence 567899999999999999999999999999999999999999999974
No 3
>cd06902 lectin_ERGIC-53_ERGL ERGIC-53 and ERGL type 1 transmembrane proteins, N-terminal lectin domain. ERGIC-53 and ERGL, N-terminal carbohydrate recognition domain. ERGIC-53 and ERGL are eukaryotic mannose-binding type 1 transmembrane proteins of the early secretory pathway that transport newly synthesized glycoproteins from the endoplasmic reticulum (ER) to the ER-Golgi intermediate compartment (ERGIC). ERGIC-53 and ERGL have an N-terminal lectin-like carbohydrate recognition domain (represented by this alignment model) as well as a C-terminal transmembrane domain. ERGIC-53 functions as a 'cargo receptor' to facilitate the export of glycoproteins with different characteristics from the ER, while the ERGIC-53-like protein (ERGL) which may act as a regulator of ERGIC-53. In mammals, ERGIC-53 forms a complex with MCFD2 (multi-coagulation factor deficiency 2) which then recruits blood coagulation factors V and VIII. Mutations in either MCFD2 or ERGIC-53 cause a mild form of inherite
Probab=99.73 E-value=2.8e-18 Score=119.85 Aligned_cols=45 Identities=51% Similarity=0.851 Sum_probs=42.7
Q ss_pred cccccccceeecceecCCCCeEEEeecCCCCCCCeeeEEEEEeee
Q psy7201 6 NKAAWKECFKVSGVKLPTGYYFGVSAATGDLSDNHDVLGIRTYEL 50 (65)
Q Consensus 6 ~~~~w~~Cf~~~~v~LP~~~yfGiSAaTG~lsDnhDIis~~~~~l 50 (65)
+.++|+.||++++|.||.++||||||+||+++|+|||+||++|++
T Consensus 180 ~~~~~~~Cf~~~~v~LP~~~yfGiSA~Tg~l~d~hDIls~~~~sl 224 (225)
T cd06902 180 NKDDYELCTRVENMVLPPNGYFGVSAATGGLADDHDVLSFLTFSL 224 (225)
T ss_pred CCCcccEEEecCCeeCCCCCEEEEEecCCCCCCcEeEEEEEEecc
Confidence 357899999999999999999999999999999999999999976
No 4
>cd06903 lectin_EMP46_EMP47 EMP46 and EMP47 type 1 transmembrane proteins, N-terminal lectin domain. EMP46 and EMP47, N-terminal carbohydrate recognition domain. EMP46 and EMP47 are fungal type-I transmembrane proteins that cycle between the endoplasmic reticulum and the golgi apparatus and are thought to function as cargo receptors that transport newly synthesized glycoproteins. EMP47 is a receptor for EMP46 responsible for the selective transport of EMP46 by forming hetero-oligomerization between the two proteins. EMP46 and EMP47 have an N-terminal lectin-like carbohydrate recognition domain (represented by this alignment model) as well as a C-terminal transmembrane domain. EMP46 and EMP47 are 45% sequence-identical to one another and have sequence homology to a class of intracellular lectins defined by ERGIC-53 and VIP36. L-type lectins have a dome-shaped beta-barrel carbohydrate recognition domain with a curved seven-stranded beta-sheet referred to as the "front face" and a flat s
Probab=99.69 E-value=3e-17 Score=114.29 Aligned_cols=45 Identities=31% Similarity=0.735 Sum_probs=41.7
Q ss_pred cccccccccccceeecceecC-CCCeEEEeecCCCCCCCeeeEEEEEeee
Q psy7201 2 TDFENKAAWKECFKVSGVKLP-TGYYFGVSAATGDLSDNHDVLGIRTYEL 50 (65)
Q Consensus 2 ~di~~~~~w~~Cf~~~~v~LP-~~~yfGiSAaTG~lsDnhDIis~~~~~l 50 (65)
||+|+ +.||++++|.|| .+|||||||+||+++|+|||++|++|+.
T Consensus 169 v~vd~----~~Cf~~~~v~lP~~~y~fGiSAaTg~~~d~hdIl~~~~~~~ 214 (215)
T cd06903 169 VQVDN----RLCFQTDKVQLPQGGYRFGITAANADNPESFEILKLKVWNG 214 (215)
T ss_pred EEECC----CEEEecCCeecCCCCCEEEEEEcCCCCCCcEEEEEEEEecC
Confidence 67776 899999999999 8999999999999999999999999974
No 5
>KOG3839|consensus
Probab=99.64 E-value=3.9e-17 Score=121.50 Aligned_cols=57 Identities=46% Similarity=0.876 Sum_probs=53.6
Q ss_pred CcccccccccccceeecceecCCCCeEEEeecCCCCCCCeeeEEEEEeeecCCCCcC
Q psy7201 1 STDFENKAAWKECFKVSGVKLPTGYYFGVSAATGDLSDNHDVLGIRTYELEFPGEKL 57 (65)
Q Consensus 1 ~~di~~~~~w~~Cf~~~~v~LP~~~yfGiSAaTG~lsDnhDIis~~~~~l~~~~~~~ 57 (65)
++|+.+.++|..|+.+.+|+||.|||||+||+||+++|+|||+|++.|+|...+++.
T Consensus 226 ~~dl~~~~~~~~c~~~n~v~lp~g~~fg~SasTGdlSd~HdivS~kl~~L~d~~~~~ 282 (351)
T KOG3839|consen 226 SIDLEGPNEWIDCFSLNNVELPLGYFFGVSASTGDLSDSHDIVSLKLFELTDIHVEG 282 (351)
T ss_pred EEecCCCceeeeeeeecceecccceEEeeeeccCccchhhHHHHhhhhhhhcccCCC
Confidence 478999999999999999999999999999999999999999999999999886654
No 6
>KOG3838|consensus
Probab=99.61 E-value=6.2e-16 Score=118.05 Aligned_cols=56 Identities=45% Similarity=0.800 Sum_probs=51.1
Q ss_pred ccccccceeecceecCCCCeEEEeecCCCCCCCeeeEEEEEeeecCCCCcCCCCCCC
Q psy7201 7 KAAWKECFKVSGVKLPTGYYFGVSAATGDLSDNHDVLGIRTYELEFPGEKLSKGPFI 63 (65)
Q Consensus 7 ~~~w~~Cf~~~~v~LP~~~yfGiSAaTG~lsDnhDIis~~~~~l~~~~~~~~~~~~~ 63 (65)
.+.|+.|.+++++.||.+.|||||||||+|+|+|||+|+.||+|..|. ++.++||.
T Consensus 212 ~d~yE~C~rve~~~lp~nGyFGvSAATGgLADDHDVl~FltfsL~e~~-qeppqP~~ 267 (497)
T KOG3838|consen 212 SDDYEFCVRVENLLLPPNGYFGVSAATGGLADDHDVLSFLTFSLSEPG-QEPPQPFL 267 (497)
T ss_pred CCCcceeEeccceeccCCCeeeeeecccccccccceeeeEEeeecCCC-CCCCCchh
Confidence 478999999999999999999999999999999999999999998554 55888875
No 7
>cd07308 lectin_leg-like legume-like lectins: ERGIC-53, ERGL, VIP36, VIPL, EMP46, and EMP47. The legume-like (leg-like) lectins are eukaryotic intracellular sugar transport proteins with a carbohydrate recognition domain similar to that of the legume lectins. This domain binds high-mannose-type oligosaccharides for transport from the endoplasmic reticulum to the Golgi complex. These leg-like lectins include ERGIC-53, ERGL, VIP36, VIPL, EMP46, EMP47, and the UIP5 (ULP1-interacting protein 5) precursor protein. Leg-like lectins have different intracellular distributions and dynamics in the endoplasmic reticulum-Golgi system of the secretory pathway and interact with N-glycans of glycoproteins in a calcium-dependent manner, suggesting a role in glycoprotein sorting and trafficking. L-type lectins have a dome-shaped beta-barrel carbohydrate recognition domain with a curved seven-stranded beta-sheet referred to as the "front face" and a flat six-stranded beta-sheet referred to as the "ba
Probab=99.56 E-value=4.7e-15 Score=100.93 Aligned_cols=46 Identities=61% Similarity=1.059 Sum_probs=42.8
Q ss_pred ccccccccceeecceecCCCCeEEEeecCCCCCCCeeeEEEEEeee
Q psy7201 5 ENKAAWKECFKVSGVKLPTGYYFGVSAATGDLSDNHDVLGIRTYEL 50 (65)
Q Consensus 5 ~~~~~w~~Cf~~~~v~LP~~~yfGiSAaTG~lsDnhDIis~~~~~l 50 (65)
.+.++|+.||+++.+.||.++||||||+||+++|+|||++|++|++
T Consensus 173 ~~~~~~~~c~~~~~~~l~~~~y~G~sA~tg~~~d~~dIls~~~~~~ 218 (218)
T cd07308 173 SEGNNWKECFTVEDVILPSQGYFGFSAQTGDLSDNHDILSVHTYEL 218 (218)
T ss_pred CCCCCccEEEEcCCcccCCCCEEEEEeccCCCcCcEEEEEEEeeeC
Confidence 3467999999999999999999999999999999999999999974
No 8
>cd01951 lectin_L-type legume lectins. The L-type (legume-type) lectins are a highly diverse family of carbohydrate binding proteins that generally display no enzymatic activity toward the sugars they bind. This family includes arcelin, concanavalinA, the lectin-like receptor kinases, the ERGIC-53/VIP36/EMP46 type1 transmembrane proteins, and an alpha-amylase inhibitor. L-type lectins have a dome-shaped beta-barrel carbohydrate recognition domain with a curved seven-stranded beta-sheet referred to as the "front face" and a flat six-stranded beta-sheet referred to as the "back face". This domain homodimerizes so that adjacent back sheets form a contiguous 12-stranded sheet and homotetramers occur by a back-to-back association of these homodimers. Though L-type lectins exhibit both sequence and structural similarity to one another, their carbohydrate binding specificities differ widely.
Probab=98.01 E-value=1e-05 Score=54.97 Aligned_cols=39 Identities=31% Similarity=0.365 Sum_probs=32.3
Q ss_pred ccccceeeccee---cCCCCeEEEeecCCCCCCCeeeEEEEE
Q psy7201 9 AWKECFKVSGVK---LPTGYYFGVSAATGDLSDNHDVLGIRT 47 (65)
Q Consensus 9 ~w~~Cf~~~~v~---LP~~~yfGiSAaTG~lsDnhDIis~~~ 47 (65)
.|..|..+.... +|...|+||||+||+.++.|+|+++++
T Consensus 180 ~~~~~l~~~~~l~~~~~~~~yvGFTAsTG~~~~~h~V~~wsf 221 (223)
T cd01951 180 LTSLDITIPVDLIQLGPTKAYFGFTASTGGLTNLHDILNWSF 221 (223)
T ss_pred cccccEEEeeeecccCCCcEEEEEEcccCCCcceeEEEEEEe
Confidence 477899874322 488999999999999999999999865
No 9
>cd06899 lectin_legume_LecRK_Arcelin_ConA legume lectins, lectin-like receptor kinases, arcelin, concanavalinA, and alpha-amylase inhibitor. This alignment model includes the legume lectins (also known as agglutinins), the arcelin (also known as phytohemagglutinin-L) family of lectin-like defense proteins, the LecRK family of lectin-like receptor kinases, concanavalinA (ConA), and an alpha-amylase inhibitor. Arcelin is a major seed glycoprotein discovered in kidney beans (Phaseolus vulgaris) that has insecticidal properties and protects the seeds from predation by larvae of various bruchids. Arcelin is devoid of monosaccharide binding properties and lacks a key metal-binding loop that is present in other members of this family. Phytohaemagglutinin (PHA) is a lectin found in plants, especially beans, that affects cell metabolism by inducing mitosis and by altering the permeability of the cell membrane to various proteins. PHA agglutinates most mammalian red blood cell types by bindin
Probab=97.64 E-value=6.1e-05 Score=52.40 Aligned_cols=28 Identities=39% Similarity=0.536 Sum_probs=25.3
Q ss_pred ecCCCCeEEEeecCCCCCCCeeeEEEEE
Q psy7201 20 KLPTGYYFGVSAATGDLSDNHDVLGIRT 47 (65)
Q Consensus 20 ~LP~~~yfGiSAaTG~lsDnhDIis~~~ 47 (65)
.||+..|+|+||+||++.+.|+|++...
T Consensus 205 ~l~~~~~vGFSasTG~~~~~h~i~sWsF 232 (236)
T cd06899 205 VLPEEVYVGFSASTGLLTELHYILSWSF 232 (236)
T ss_pred hCCCceEEEEEeEcCCCcceEEEEEEEE
Confidence 4788899999999999999999999754
No 10
>PF00139 Lectin_legB: Legume lectin domain; InterPro: IPR001220 Legume lectins are one of the largest lectin families with more than 70 lectins reported. Leguminous plant lectins resemble each other in their physicochemical properties although they differ in their carbohydrate specificities. They consist of two or four subunits with relative molecular mass of 30 kDa and each subunit has one carbohydrate-binding site. The interaction with sugars requires tightly bound calcium and manganese ions. The structural similarities of these lectins are reported by the primary structural analyses and X-ray crystallographic studies. X-ray studies have shown that the folding of the polypeptide chains in the region of the carbohydrate-binding sites is also similar, despite differences in the primary sequences. The carbohydrate-binding sites of these lectins consist of two conserved amino acids on beta pleated sheets. One of these loops contains transition metals, calcium and manganese, which keep the amino acid residues of the sugar-binding site at the required positions. Amino acid sequences of this loop play an important role in the carbohydrate-binding specificities of these lectins. These lectins bind either glucose/mannose or galactose. The exact function of legume lectins is not known but they may be involved in the attachment of nitrogen-fixing bacteria to legumes and in the protection against pathogens. Some legume lectins are proteolytically processed to produce two chains, beta (which corresponds to the N-terminal) and alpha (C-terminal) (IPR000985 from INTERPRO). The lectin concanavalin A (conA) from jack bean is exceptional in that the two chains are transposed and ligated (by formation of a new peptide bond). The N terminus of mature conA thus corresponds to that of the alpha chain and the C terminus to the beta chain.; GO: 0005488 binding; PDB: 1VLN_B 2GDF_C 2JE9_C 2JEC_C 1DGL_B 2P37_B 2CWM_A 2P34_D 2OW4_A 3IPV_B ....
Probab=96.80 E-value=0.00084 Score=46.35 Aligned_cols=27 Identities=41% Similarity=0.621 Sum_probs=24.3
Q ss_pred cCCCCeEEEeecCCCCCCCeeeEEEEE
Q psy7201 21 LPTGYYFGVSAATGDLSDNHDVLGIRT 47 (65)
Q Consensus 21 LP~~~yfGiSAaTG~lsDnhDIis~~~ 47 (65)
||...|+||||+||.....|+|++...
T Consensus 209 l~~~v~vGFsasTG~~~~~h~I~sW~F 235 (236)
T PF00139_consen 209 LPEQVYVGFSASTGGSYQTHDILSWSF 235 (236)
T ss_dssp SCSEEEEEEEEEESSSSEEEEEEEEEE
T ss_pred cCCCcEEEEEeecCCCcceEEEEEEEe
Confidence 677789999999999999999999864
No 11
>cd06900 lectin_VcfQ VcfQ bacterial pilus biogenesis protein, lectin domain. This family includes bacterial proteins homologous to the VcfQ (also known as MshQ) bacterial pilus biogenesis protein. VcfQ is encoded by the vcfQ gene of the type IV pilus gene cluster of Vibrio cholerae and is essential for type IV pilus assembly. VcfQ has a Laminin G-like domain as well as an L-type lectin domain.
Probab=92.69 E-value=0.13 Score=37.72 Aligned_cols=28 Identities=18% Similarity=0.605 Sum_probs=24.9
Q ss_pred ecCCCCeEEEeecCCCCCCCeeeEEEEE
Q psy7201 20 KLPTGYYFGVSAATGDLSDNHDVLGIRT 47 (65)
Q Consensus 20 ~LP~~~yfGiSAaTG~lsDnhDIis~~~ 47 (65)
.+|+..+|+++++||....+|||=.+++
T Consensus 226 avP~~f~lS~TgSTGgstN~HEIdnf~V 253 (255)
T cd06900 226 AIPENFYLSFTGSTGGSTNTHEIDNFQV 253 (255)
T ss_pred CCCccEEEEEEecCCCcccceeecceEe
Confidence 5888999999999999999999977664
No 12
>PF09116 gp45-slide_C: gp45 sliding clamp, C terminal; InterPro: IPR015200 This domain is essential for the interaction of the gp45 sliding clamp with the corresponding polymerase. It adopts a DNA clamp fold, consisting of two alpha helices and two beta sheets - the fold is duplicated and has internal pseudo two-fold symmetry []. ; PDB: 1B8H_A 1B77_B 3U61_F 3U60_G 3U5Z_R 1CZD_B.
Probab=86.01 E-value=0.63 Score=30.22 Aligned_cols=28 Identities=25% Similarity=0.593 Sum_probs=16.8
Q ss_pred cccccccccceeecceecCCCCe-EEEee
Q psy7201 4 FENKAAWKECFKVSGVKLPTGYY-FGVSA 31 (65)
Q Consensus 4 i~~~~~w~~Cf~~~~v~LP~~~y-fGiSA 31 (65)
-++..++..||+++|++|..|-| +-||+
T Consensus 59 ~~~~~~F~f~~k~eNmk~~~gdY~V~is~ 87 (112)
T PF09116_consen 59 YDGDNNFCFCFKMENMKMLPGDYKVKISK 87 (112)
T ss_dssp --SS--EEEEEEGGG----SS-EEEEEEE
T ss_pred cCCCccEEEEEEeceeEeecCCeEEEEEh
Confidence 45566799999999999877777 99998
No 13
>PHA02545 45 sliding clamp; Provisional
Probab=82.92 E-value=1.3 Score=31.91 Aligned_cols=30 Identities=20% Similarity=0.427 Sum_probs=25.3
Q ss_pred cccccccccceeecceecCCCCe-EEEeecC
Q psy7201 4 FENKAAWKECFKVSGVKLPTGYY-FGVSAAT 33 (65)
Q Consensus 4 i~~~~~w~~Cf~~~~v~LP~~~y-fGiSAaT 33 (65)
.++..++..|+++++++|+.|-| +-+|++-
T Consensus 171 ~~g~~~F~f~~~~~Nmk~~~gdY~v~is~~g 201 (223)
T PHA02545 171 YDGDNEFNFIIKIENMKLVPGDYKVSISKKG 201 (223)
T ss_pred ccCCcceEEEEEeceeEecCCCeEEEEEccc
Confidence 45566899999999999988877 9998874
No 14
>PF07013 DUF1314: Protein of unknown function (DUF1314); InterPro: IPR010741 This family consists of several alphaherpesvirus proteins of around 200 residues in length. They belong to the varicellovirus ORF2 family and their function is unknown.
Probab=57.36 E-value=8.5 Score=27.02 Aligned_cols=25 Identities=28% Similarity=0.369 Sum_probs=21.5
Q ss_pred cceeecceecCCCCeEEEeecCCCC
Q psy7201 12 ECFKVSGVKLPTGYYFGVSAATGDL 36 (65)
Q Consensus 12 ~Cf~~~~v~LP~~~yfGiSAaTG~l 36 (65)
.|..=++|.||.|.+++|+|-+|..
T Consensus 6 ~~LaGR~iDLPgG~el~I~~~~g~~ 30 (177)
T PF07013_consen 6 RSLAGRTIDLPGGDELRISANTGRP 30 (177)
T ss_pred hhhcCcceecCCCCeEEEecCCCCc
Confidence 3556678999999999999999986
No 15
>PF09175 DUF1944: Domain of unknown function (DUF1944); InterPro: IPR015258 Vitellinogen precursors provide the major egg yolk proteins that are a source of nutrients during early development of oviparous vertebrates and invertebrates. Vitellinogen precursors are multi-domain apolipoproteins that are cleaved into distinct yolk proteins. Different vitellinogen precursors exist, which are composed of variable combinations of yolk protein components; however, the cleavage sites are conserved [, ]. In vertebrates, a complete vitellinogen is composed of an N-terminal signal peptide for export, followed by four regions that can be cleaved into yolk proteins: lipovitellin-1, phosvitin, lipovitellin-2, and a von Willebrand factor type D domain (YGP40). Vitellinogens are post-translationally glycosylated and phosphorylated in the endoplasmic reticulum and Golgi complex of hepatocytes, before being secreted into the circulatory system to be taken up by oocytes. In the ovary, vitellinogens bind to specific Vtgr receptors on oocyte membranes to become internalised by endocytosis, where they are cleaved into yolk proteins by cathepsin D. YGP40 is released into the yolk plasma before or during compartmentation of lipovitellin-phosvitin complex into the yolk granule. The different yolk proteins have distinct roles. Phosvitins are important in sequestering calcium, iron and other cations for the developing embryo. Phosvitins are one of the most phosphorylated (10%) proteins in nature, the high concentration of phosphate groups providing efficient metal-binding sites in clusters [, ]. Lipovitellins are involved in lipid and metal storage, and contain a heterogeneous mixture of about 16% (w/w) noncovalently bound lipid, most being phospholipid. Lipovitellin-1 contains two chains, LV1N and LV1C [, ]. This entry represents the beta-sheet shell domain found in vitellinogen, which generally corresponds to the lipovitellin-2 peptide product. This domain consists of several large open beta-sheets []. It is often found C-terminal to IPR001747 from INTERPRO and IPR015255 from INTERPRO. ; GO: 0005319 lipid transporter activity, 0006869 lipid transport; PDB: 1LSH_B.
Probab=53.56 E-value=15 Score=25.33 Aligned_cols=44 Identities=20% Similarity=0.442 Sum_probs=26.0
Q ss_pred ccccccccccceeecceecCC-------C----Ce-EEEeecCCCCCCCeeeEEEEE
Q psy7201 3 DFENKAAWKECFKVSGVKLPT-------G----YY-FGVSAATGDLSDNHDVLGIRT 47 (65)
Q Consensus 3 di~~~~~w~~Cf~~~~v~LP~-------~----~y-fGiSAaTG~lsDnhDIis~~~ 47 (65)
++..++.|+.|.+..-..-.. | -| .-++|.||-+ .+|=-+.+++
T Consensus 38 ~l~e~~nWk~CaD~~~ls~hK~~A~l~WG~eCq~Y~~~~~aeTG~~-g~~PA~r~kv 93 (165)
T PF09175_consen 38 NLAENSNWKICADAVLLSKHKAKAKLAWGAECQQYKVSITAETGLL-GSKPAARLKV 93 (165)
T ss_dssp ES-TT---EEEEEEE--SSSEEEEEEEESSSS-SEEEEEEEEEEES-SSSEEEEEEE
T ss_pred ecCcCCCeEEEehhhcccccceEEEEecchhhhheeeeeeehhhcc-cCCcceEEEe
Confidence 567789999999862211110 1 25 8999999999 6777666665
No 16
>KOG2540|consensus
Probab=47.12 E-value=26 Score=25.87 Aligned_cols=49 Identities=18% Similarity=0.347 Sum_probs=36.1
Q ss_pred ccccccccccceeeccee-cCCCCeEEEeecCCCCCCCeeeEEEEEeeecCC
Q psy7201 3 DFENKAAWKECFKVSGVK-LPTGYYFGVSAATGDLSDNHDVLGIRTYELEFP 53 (65)
Q Consensus 3 di~~~~~w~~Cf~~~~v~-LP~~~yfGiSAaTG~lsDnhDIis~~~~~l~~~ 53 (65)
||.+.-+|+.-=|-+.|. +|...-+++= |.+..-+|+||-|.+|++.+-
T Consensus 141 dV~~~l~Wkf~PqQrEiyV~PGEtALaFY--ta~N~sdkpIiGvstYni~P~ 190 (269)
T KOG2540|consen 141 DVADSLQWKFTPQQREIYVLPGETALAFY--TAENPSDKPIIGVSTYNITPG 190 (269)
T ss_pred cccccCcccccccceEEEEcCCcceeeeE--eccCCCCCCceeeEeeccCcc
Confidence 677788999877766765 6655546653 445566899999999999753
No 17
>PHA02290 hypothetical protein
Probab=34.86 E-value=20 Score=25.77 Aligned_cols=20 Identities=45% Similarity=0.790 Sum_probs=14.9
Q ss_pred cCCCCeEEEeecCCCCCCCeee
Q psy7201 21 LPTGYYFGVSAATGDLSDNHDV 42 (65)
Q Consensus 21 LP~~~yfGiSAaTG~lsDnhDI 42 (65)
+-.|.|+|+| ||-|+|.-|.
T Consensus 92 ~q~GQYIG~S--~G~LSe~ED~ 111 (234)
T PHA02290 92 VQPGQYIGIS--TGLLSESEDF 111 (234)
T ss_pred ecCCceeccc--ccccchhhhh
Confidence 4457799997 7888887663
No 18
>PF02408 CUB_2: CUB-like domain; InterPro: IPR003366 This domain is found in a family of hypothetical Caenorhabditis elegans proteins. The aligned region has no known function nor do any of the proteins which possess it. However, this domain is related to the CUB domain (IPR000859 from INTERPRO). The aligned region is approximately 130 amino acids long and contains two conserved cysteine residues.
Probab=32.00 E-value=71 Score=19.78 Aligned_cols=23 Identities=35% Similarity=0.784 Sum_probs=17.2
Q ss_pred ccceeecceecCCCCe--EEEeecCCC
Q psy7201 11 KECFKVSGVKLPTGYY--FGVSAATGD 35 (65)
Q Consensus 11 ~~Cf~~~~v~LP~~~y--fGiSAaTG~ 35 (65)
..|.= .+.+|.|+| +-+++...+
T Consensus 42 ~~C~y--~i~iP~G~~a~v~~~~~~~~ 66 (120)
T PF02408_consen 42 QNCTY--QINIPKGYYAKVTLSANLND 66 (120)
T ss_pred CceEE--EEEcCCceEEEEEEEEecCC
Confidence 45653 468999998 788887776
No 19
>PF02465 FliD_N: Flagellar hook-associated protein 2 N-terminus; InterPro: IPR003481 The flagellar hook-associated protein 2 (HAP2 or FliD) is the capping protein for the flagella and forms the distal end of the flagella. The protein plays a role in mucin specific adhesion of the bacteria [].; GO: 0009296 flagellum assembly, 0009288 bacterial-type flagellum
Probab=29.04 E-value=66 Score=19.10 Aligned_cols=26 Identities=12% Similarity=0.134 Sum_probs=19.6
Q ss_pred cceecCCCCeEEEeecCCCCCCCeee
Q psy7201 17 SGVKLPTGYYFGVSAATGDLSDNHDV 42 (65)
Q Consensus 17 ~~v~LP~~~yfGiSAaTG~lsDnhDI 42 (65)
..+......++.+||..|..+.+|+|
T Consensus 64 ~~atsS~~~~~tvta~~~A~~gsy~i 89 (99)
T PF02465_consen 64 KKATSSNESVVTVTATSGASAGSYSI 89 (99)
T ss_pred eeeeccCCceEEEEecCCCcceeEEE
Confidence 34445556789999999988888875
No 20
>PHA03371 circ protein; Provisional
Probab=28.94 E-value=54 Score=24.06 Aligned_cols=28 Identities=25% Similarity=0.277 Sum_probs=23.6
Q ss_pred ceeecceecCCCCeEEEeecCCCCCCCe
Q psy7201 13 CFKVSGVKLPTGYYFGVSAATGDLSDNH 40 (65)
Q Consensus 13 Cf~~~~v~LP~~~yfGiSAaTG~lsDnh 40 (65)
|..=+.|.||.|.++.|.|..|....+.
T Consensus 28 ~LaGR~vDLPgGde~~If~~~g~T~~~~ 55 (240)
T PHA03371 28 SLAGRTVDLPGGDELRIFADCGTTTVNF 55 (240)
T ss_pred HhcCcceecCCCCeEEEeccCCCCccce
Confidence 5556789999999999999999877654
No 21
>PF05015 Plasmid_killer: Plasmid maintenance system killer protein; InterPro: IPR007711 Several plasmids with proteic killer gene systems have been reported. All of them encode a stable toxin and an unstable antidote. Upon loss of the plasmid, the less stable inhibitor is inactivated more rapidly than the toxin, allowing the toxin to be activated. The activation of those systems result in cell filamentation and cessation of viable cell production. It has been verified that both the stable killer and the unstable inhibitor of the systems are short polypeptides. This family corresponds to the toxin.
Probab=27.69 E-value=31 Score=20.80 Aligned_cols=15 Identities=13% Similarity=0.539 Sum_probs=10.0
Q ss_pred cccccccccccceeecc
Q psy7201 2 TDFENKAAWKECFKVSG 18 (65)
Q Consensus 2 ~di~~~~~w~~Cf~~~~ 18 (65)
++|++ +|+.||+..+
T Consensus 67 i~i~~--~~RliF~~~~ 81 (93)
T PF05015_consen 67 IRING--NWRLIFRFED 81 (93)
T ss_pred EEeCC--CEEEEEEEeC
Confidence 34444 7999998643
No 22
>KOG2107|consensus
Probab=26.63 E-value=74 Score=22.42 Aligned_cols=29 Identities=24% Similarity=0.596 Sum_probs=21.9
Q ss_pred cccccccceeecc--eecCCCCeEEEeecCC
Q psy7201 6 NKAAWKECFKVSG--VKLPTGYYFGVSAATG 34 (65)
Q Consensus 6 ~~~~w~~Cf~~~~--v~LP~~~yfGiSAaTG 34 (65)
.+++|--||--.+ |.||.|-|.-||..+.
T Consensus 111 ~dd~WIRi~vekGDlivlPaGiyHRFTtt~~ 141 (179)
T KOG2107|consen 111 KDDQWIRIFVEKGDLIVLPAGIYHRFTTTPS 141 (179)
T ss_pred CCCCEEEEEEecCCEEEecCcceeeeecCch
Confidence 4588998886543 7899999988876543
No 23
>COG3549 HigB Plasmid maintenance system killer protein [General function prediction only]
Probab=25.77 E-value=34 Score=21.81 Aligned_cols=12 Identities=25% Similarity=0.758 Sum_probs=9.6
Q ss_pred ccccccceeecc
Q psy7201 7 KAAWKECFKVSG 18 (65)
Q Consensus 7 ~~~w~~Cf~~~~ 18 (65)
.++|+.||.-++
T Consensus 70 N~QwrLiF~~~~ 81 (94)
T COG3549 70 NQQWRLIFRWEG 81 (94)
T ss_pred CCEEEEEEEeCC
Confidence 579999998654
No 24
>PF07495 Y_Y_Y: Y_Y_Y domain; InterPro: IPR011123 This region is mostly found at the end of the beta propellers (IPR011110 from INTERPRO) in a family of two component regulators. However they are also found tandemly repeated in Q891H4 from SWISSPROT without other signal conduction domains being present. It is named after the conserved tyrosines found in the alignment. The exact function is not known.; PDB: 3V9F_D 3VA6_B 3OTT_B 4A2M_D 4A2L_B.
Probab=24.81 E-value=70 Score=17.07 Aligned_cols=34 Identities=26% Similarity=0.357 Sum_probs=18.1
Q ss_pred cccccceeec-c---eecCCCCe-EEEeecCCCCCCCee
Q psy7201 8 AAWKECFKVS-G---VKLPTGYY-FGVSAATGDLSDNHD 41 (65)
Q Consensus 8 ~~w~~Cf~~~-~---v~LP~~~y-fGiSAaTG~lsDnhD 41 (65)
++|..--.-. . ..||.|-| |-+-|....-.-+-+
T Consensus 19 ~~W~~~~~~~~~~~~~~L~~G~Y~l~V~a~~~~~~~~~~ 57 (66)
T PF07495_consen 19 DEWITLGSYSNSISYTNLPPGKYTLEVRAKDNNGKWSSD 57 (66)
T ss_dssp SSEEEESSTS-EEEEES--SEEEEEEEEEEETTS-B-SS
T ss_pred CeEEECCCCcEEEEEEeCCCEEEEEEEEEECCCCCcCcc
Confidence 5565543332 2 25999988 999998754433333
No 25
>PF07472 PA-IIL: Fucose-binding lectin II (PA-IIL); InterPro: IPR010907 This entry represents calcium-mediated lectins. Structures have been determined for both fucose-binding lectin II (PA-IIL) [] and mannose-specific lectin II (RS-IIL) []. These proteins have homologous structures, their monomers consisting of a 9-stranded beta sandwich with Greek-key topology. Each monomer contains two calcium ions that mediate an exceptionally high binding affinity to the monosaccharide ligand in a recognition mode unique among carbohydrate-protein interactions. In Pseudomonas aeruginosa, PA-IIL contributes to the pathogenic virulence of the bacterium, functioning as a tetramer when binding fucose []. In the plant pathogen Ralstonia solanacearum (Pseudomonas solanacearum), RS-IIL recognises fucose, but displays much higher affinity to mannose and fructose, which is opposite to the preference of PA-IIL. ; PDB: 2WRA_A 2WR9_C 1OUX_C 2VUC_B 1GZT_C 2BOJ_D 2JDM_D 2JDH_D 1W8F_D 1UZV_A ....
Probab=22.40 E-value=75 Score=20.61 Aligned_cols=16 Identities=38% Similarity=0.665 Sum_probs=10.4
Q ss_pred eecCCCCeEEEeecCC
Q psy7201 19 VKLPTGYYFGVSAATG 34 (65)
Q Consensus 19 v~LP~~~yfGiSAaTG 34 (65)
+.||.+--||++|-+-
T Consensus 1 f~lP~~~~fg~ta~~N 16 (107)
T PF07472_consen 1 FQLPPNIRFGVTALAN 16 (107)
T ss_dssp EE--TT-EEEEEEEEE
T ss_pred CCCCCcceEeEEEEEc
Confidence 3699999999998754
Done!