Query         psy7201
Match_columns 65
No_of_seqs    105 out of 290
Neff          4.8 
Searched_HMMs 46136
Date          Fri Aug 16 20:46:44 2013
Command       hhsearch -i /work/01045/syshi/Psyhhblits/psy7201.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/7201hhsearch_cdd -cpu 12 -v 0 

 No Hit                             Prob E-value P-value  Score    SS Cols Query HMM  Template HMM
  1 cd06901 lectin_VIP36_VIPL VIP3  99.8 6.3E-21 1.4E-25  135.0   5.9   57    2-58    175-231 (248)
  2 PF03388 Lectin_leg-like:  Legu  99.8 3.6E-19 7.8E-24  123.6   4.0   48    4-51    182-229 (229)
  3 cd06902 lectin_ERGIC-53_ERGL E  99.7 2.8E-18   6E-23  119.9   5.0   45    6-50    180-224 (225)
  4 cd06903 lectin_EMP46_EMP47 EMP  99.7   3E-17 6.4E-22  114.3   4.9   45    2-50    169-214 (215)
  5 KOG3839|consensus               99.6 3.9E-17 8.5E-22  121.5   1.4   57    1-57    226-282 (351)
  6 KOG3838|consensus               99.6 6.2E-16 1.4E-20  118.1   5.5   56    7-63    212-267 (497)
  7 cd07308 lectin_leg-like legume  99.6 4.7E-15   1E-19  100.9   5.3   46    5-50    173-218 (218)
  8 cd01951 lectin_L-type legume l  98.0   1E-05 2.2E-10   55.0   4.6   39    9-47    180-221 (223)
  9 cd06899 lectin_legume_LecRK_Ar  97.6 6.1E-05 1.3E-09   52.4   3.7   28   20-47    205-232 (236)
 10 PF00139 Lectin_legB:  Legume l  96.8 0.00084 1.8E-08   46.3   2.2   27   21-47    209-235 (236)
 11 cd06900 lectin_VcfQ VcfQ bacte  92.7    0.13 2.8E-06   37.7   3.1   28   20-47    226-253 (255)
 12 PF09116 gp45-slide_C:  gp45 sl  86.0    0.63 1.4E-05   30.2   2.0   28    4-31     59-87  (112)
 13 PHA02545 45 sliding clamp; Pro  82.9     1.3 2.8E-05   31.9   2.7   30    4-33    171-201 (223)
 14 PF07013 DUF1314:  Protein of u  57.4     8.5 0.00018   27.0   1.9   25   12-36      6-30  (177)
 15 PF09175 DUF1944:  Domain of un  53.6      15 0.00031   25.3   2.6   44    3-47     38-93  (165)
 16 KOG2540|consensus               47.1      26 0.00057   25.9   3.2   49    3-53    141-190 (269)
 17 PHA02290 hypothetical protein   34.9      20 0.00044   25.8   1.0   20   21-42     92-111 (234)
 18 PF02408 CUB_2:  CUB-like domai  32.0      71  0.0015   19.8   3.1   23   11-35     42-66  (120)
 19 PF02465 FliD_N:  Flagellar hoo  29.0      66  0.0014   19.1   2.5   26   17-42     64-89  (99)
 20 PHA03371 circ protein; Provisi  28.9      54  0.0012   24.1   2.4   28   13-40     28-55  (240)
 21 PF05015 Plasmid_killer:  Plasm  27.7      31 0.00067   20.8   0.8   15    2-18     67-81  (93)
 22 KOG2107|consensus               26.6      74  0.0016   22.4   2.6   29    6-34    111-141 (179)
 23 COG3549 HigB Plasmid maintenan  25.8      34 0.00073   21.8   0.8   12    7-18     70-81  (94)
 24 PF07495 Y_Y_Y:  Y_Y_Y domain;   24.8      70  0.0015   17.1   1.9   34    8-41     19-57  (66)
 25 PF07472 PA-IIL:  Fucose-bindin  22.4      75  0.0016   20.6   1.9   16   19-34      1-16  (107)

No 1  
>cd06901 lectin_VIP36_VIPL VIP36 and VIPL type 1 transmembrane proteins, lectin domain. The vesicular integral protein of 36 kDa (VIP36) is a type 1 transmembrane protein of the mammalian early secretory pathway that acts as a cargo receptor transporting high mannose type glycoproteins between the Golgi and the endoplasmic reticulum (ER).  Lectins of the early secretory pathway are involved in the selective transport of newly synthesized glycoproteins from the ER to the ER-Golgi intermediate compartment (ERGIC). The most prominent cycling lectin is the mannose-binding type1 membrane protein ERGIC-53, which functions as a cargo receptor to facilitate export of glycoproteins from the ER. L-type lectins have a dome-shaped beta-barrel carbohydrate recognition domain with a curved seven-stranded beta-sheet referred to as the "front face" and a flat six-stranded beta-sheet referred to as the "back face".  This domain homodimerizes so that adjacent back sheets form a contiguous 12-stranded she
Probab=99.83  E-value=6.3e-21  Score=134.95  Aligned_cols=57  Identities=60%  Similarity=1.120  Sum_probs=53.8

Q ss_pred             cccccccccccceeecceecCCCCeEEEeecCCCCCCCeeeEEEEEeeecCCCCcCC
Q psy7201           2 TDFENKAAWKECFKVSGVKLPTGYYFGVSAATGDLSDNHDVLGIRTYELEFPGEKLS   58 (65)
Q Consensus         2 ~di~~~~~w~~Cf~~~~v~LP~~~yfGiSAaTG~lsDnhDIis~~~~~l~~~~~~~~   58 (65)
                      +|++++++|+.||++++|.||+++||||||+||+++|+|||++|++|++..+.++++
T Consensus       175 vd~~~~~~w~~Cf~~~~v~LP~~~yfGiSA~Tg~~sd~hdIlsv~~~~l~~~~~~~~  231 (248)
T cd06901         175 TDIDGKNEWKECFDVTGVRLPTGYYFGASAATGDLSDNHDIISMKLYELDVEETPEE  231 (248)
T ss_pred             EecCCCCceeeeEEeCCeecCCCCEEEEEecCCCCCCcEEEEEEEEecCcccccccc
Confidence            678899999999999999999999999999999999999999999999999988743


No 2  
>PF03388 Lectin_leg-like:  Legume-like lectin family;  InterPro: IPR005052  Lectins are structurally diverse proteins that bind to specific carbohydrates. This family includes the VIP36 and ERGIC-53 lectins. These two proteins were the first members of the family of animal lectins similar to the leguminous plant lectins []. The alignment for this family is towards the N terminus, where the similarity of VIP36 and ERGIC-53 is greatest. Although they have been identified as a family of animal lectins, this alignment also includes yeast sequences[].  ERGIC-53 is a 53kDa protein, localised to the intermediate region between the endoplasmic reticulum and the Golgi apparatus (ER-Golgi-Intermediate Compartment, ERGIC). It was identified as a calcium-dependent, mannose-specific lectin []. Its dysfunction has been associated with combined factors V and VIII deficiency, suggesting an important and substrate-specific role for ERGIC-53 in the glycoprotein-secreting pathway [,]. The L-type lectin-like domain has an overall globular shape composed of a beta-sandwich of two major twisted antiparallel beta-sheets. The beta-sandwich comprises a major concave beta-sheet and a minor convex beta-sheet, in a variation of the jelly roll fold [, , , ]. ; GO: 0016020 membrane; PDB: 3A4U_A 3LCP_B 2A6Z_A 2A71_C 2A70_B 2A6Y_A 2A6X_A 2A6W_B 2A6V_B 2E6V_B ....
Probab=99.76  E-value=3.6e-19  Score=123.60  Aligned_cols=48  Identities=60%  Similarity=1.217  Sum_probs=45.5

Q ss_pred             cccccccccceeecceecCCCCeEEEeecCCCCCCCeeeEEEEEeeec
Q psy7201           4 FENKAAWKECFKVSGVKLPTGYYFGVSAATGDLSDNHDVLGIRTYELE   51 (65)
Q Consensus         4 i~~~~~w~~Cf~~~~v~LP~~~yfGiSAaTG~lsDnhDIis~~~~~l~   51 (65)
                      .++.++|+.||++++|.||.++||||||+||+++|||||+||++|+|.
T Consensus       182 ~~~~~~~~~Cf~~~~v~LP~~~yfGvSA~Tg~~~d~hdi~s~~~~~l~  229 (229)
T PF03388_consen  182 LKNQDDWELCFTTDGVDLPEGYYFGVSAATGELSDNHDILSVKTYSLD  229 (229)
T ss_dssp             CSECCTTEEEEEESTEEGGSSBEEEEEEEESSSGGEEEEEEEEEEEEE
T ss_pred             ccCCcCCcEEEEcCCeecCCCCEEEEEecCCCCCCcEEEEEEEEeecC
Confidence            567899999999999999999999999999999999999999999974


No 3  
>cd06902 lectin_ERGIC-53_ERGL ERGIC-53 and ERGL type 1 transmembrane proteins, N-terminal lectin domain. ERGIC-53 and ERGL, N-terminal carbohydrate recognition domain. ERGIC-53 and ERGL are eukaryotic mannose-binding type 1 transmembrane proteins of the early secretory pathway that transport newly synthesized glycoproteins from the endoplasmic reticulum (ER) to the ER-Golgi intermediate compartment (ERGIC).  ERGIC-53 and ERGL have an N-terminal lectin-like carbohydrate recognition domain (represented by this alignment model) as well as a C-terminal transmembrane domain.  ERGIC-53 functions as a 'cargo receptor' to facilitate the export of glycoproteins with different characteristics from the ER, while the ERGIC-53-like protein (ERGL) which may act as a regulator of ERGIC-53.  In mammals, ERGIC-53 forms a complex with MCFD2 (multi-coagulation factor deficiency 2) which then recruits blood coagulation factors V and VIII.  Mutations in either MCFD2 or ERGIC-53 cause a mild form of inherite
Probab=99.73  E-value=2.8e-18  Score=119.85  Aligned_cols=45  Identities=51%  Similarity=0.851  Sum_probs=42.7

Q ss_pred             cccccccceeecceecCCCCeEEEeecCCCCCCCeeeEEEEEeee
Q psy7201           6 NKAAWKECFKVSGVKLPTGYYFGVSAATGDLSDNHDVLGIRTYEL   50 (65)
Q Consensus         6 ~~~~w~~Cf~~~~v~LP~~~yfGiSAaTG~lsDnhDIis~~~~~l   50 (65)
                      +.++|+.||++++|.||.++||||||+||+++|+|||+||++|++
T Consensus       180 ~~~~~~~Cf~~~~v~LP~~~yfGiSA~Tg~l~d~hDIls~~~~sl  224 (225)
T cd06902         180 NKDDYELCTRVENMVLPPNGYFGVSAATGGLADDHDVLSFLTFSL  224 (225)
T ss_pred             CCCcccEEEecCCeeCCCCCEEEEEecCCCCCCcEeEEEEEEecc
Confidence            357899999999999999999999999999999999999999976


No 4  
>cd06903 lectin_EMP46_EMP47 EMP46 and EMP47 type 1 transmembrane proteins, N-terminal lectin domain. EMP46 and EMP47, N-terminal carbohydrate recognition domain. EMP46 and EMP47 are fungal type-I transmembrane proteins that cycle between the endoplasmic reticulum and the golgi apparatus and are thought to function as cargo receptors that transport newly synthesized glycoproteins.  EMP47 is a receptor for EMP46 responsible for the selective transport of EMP46 by forming hetero-oligomerization between the two proteins. EMP46 and EMP47 have an N-terminal lectin-like carbohydrate recognition domain (represented by this alignment model) as well as a C-terminal transmembrane domain. EMP46 and EMP47 are 45% sequence-identical to one another and have sequence homology to a class of intracellular lectins defined by ERGIC-53 and VIP36.  L-type lectins have a dome-shaped beta-barrel carbohydrate recognition domain with a curved seven-stranded beta-sheet referred to as the "front face" and a flat s
Probab=99.69  E-value=3e-17  Score=114.29  Aligned_cols=45  Identities=31%  Similarity=0.735  Sum_probs=41.7

Q ss_pred             cccccccccccceeecceecC-CCCeEEEeecCCCCCCCeeeEEEEEeee
Q psy7201           2 TDFENKAAWKECFKVSGVKLP-TGYYFGVSAATGDLSDNHDVLGIRTYEL   50 (65)
Q Consensus         2 ~di~~~~~w~~Cf~~~~v~LP-~~~yfGiSAaTG~lsDnhDIis~~~~~l   50 (65)
                      ||+|+    +.||++++|.|| .+|||||||+||+++|+|||++|++|+.
T Consensus       169 v~vd~----~~Cf~~~~v~lP~~~y~fGiSAaTg~~~d~hdIl~~~~~~~  214 (215)
T cd06903         169 VQVDN----RLCFQTDKVQLPQGGYRFGITAANADNPESFEILKLKVWNG  214 (215)
T ss_pred             EEECC----CEEEecCCeecCCCCCEEEEEEcCCCCCCcEEEEEEEEecC
Confidence            67776    899999999999 8999999999999999999999999974


No 5  
>KOG3839|consensus
Probab=99.64  E-value=3.9e-17  Score=121.50  Aligned_cols=57  Identities=46%  Similarity=0.876  Sum_probs=53.6

Q ss_pred             CcccccccccccceeecceecCCCCeEEEeecCCCCCCCeeeEEEEEeeecCCCCcC
Q psy7201           1 STDFENKAAWKECFKVSGVKLPTGYYFGVSAATGDLSDNHDVLGIRTYELEFPGEKL   57 (65)
Q Consensus         1 ~~di~~~~~w~~Cf~~~~v~LP~~~yfGiSAaTG~lsDnhDIis~~~~~l~~~~~~~   57 (65)
                      ++|+.+.++|..|+.+.+|+||.|||||+||+||+++|+|||+|++.|+|...+++.
T Consensus       226 ~~dl~~~~~~~~c~~~n~v~lp~g~~fg~SasTGdlSd~HdivS~kl~~L~d~~~~~  282 (351)
T KOG3839|consen  226 SIDLEGPNEWIDCFSLNNVELPLGYFFGVSASTGDLSDSHDIVSLKLFELTDIHVEG  282 (351)
T ss_pred             EEecCCCceeeeeeeecceecccceEEeeeeccCccchhhHHHHhhhhhhhcccCCC
Confidence            478999999999999999999999999999999999999999999999999886654


No 6  
>KOG3838|consensus
Probab=99.61  E-value=6.2e-16  Score=118.05  Aligned_cols=56  Identities=45%  Similarity=0.800  Sum_probs=51.1

Q ss_pred             ccccccceeecceecCCCCeEEEeecCCCCCCCeeeEEEEEeeecCCCCcCCCCCCC
Q psy7201           7 KAAWKECFKVSGVKLPTGYYFGVSAATGDLSDNHDVLGIRTYELEFPGEKLSKGPFI   63 (65)
Q Consensus         7 ~~~w~~Cf~~~~v~LP~~~yfGiSAaTG~lsDnhDIis~~~~~l~~~~~~~~~~~~~   63 (65)
                      .+.|+.|.+++++.||.+.|||||||||+|+|+|||+|+.||+|..|. ++.++||.
T Consensus       212 ~d~yE~C~rve~~~lp~nGyFGvSAATGgLADDHDVl~FltfsL~e~~-qeppqP~~  267 (497)
T KOG3838|consen  212 SDDYEFCVRVENLLLPPNGYFGVSAATGGLADDHDVLSFLTFSLSEPG-QEPPQPFL  267 (497)
T ss_pred             CCCcceeEeccceeccCCCeeeeeecccccccccceeeeEEeeecCCC-CCCCCchh
Confidence            478999999999999999999999999999999999999999998554 55888875


No 7  
>cd07308 lectin_leg-like legume-like lectins: ERGIC-53, ERGL, VIP36, VIPL, EMP46, and EMP47. The legume-like (leg-like) lectins are eukaryotic intracellular sugar transport proteins with a carbohydrate recognition domain similar to that of the legume lectins.  This domain binds high-mannose-type oligosaccharides for transport from the endoplasmic reticulum to the Golgi complex.  These leg-like lectins include ERGIC-53, ERGL, VIP36, VIPL, EMP46, EMP47, and the UIP5 (ULP1-interacting protein 5) precursor protein.  Leg-like lectins have different intracellular distributions and dynamics in the endoplasmic reticulum-Golgi system of the secretory pathway and interact with N-glycans of glycoproteins in a calcium-dependent manner, suggesting a role in glycoprotein sorting and trafficking.  L-type lectins have a dome-shaped beta-barrel carbohydrate recognition domain with a curved seven-stranded beta-sheet referred to as the "front face" and a flat six-stranded beta-sheet referred to as the "ba
Probab=99.56  E-value=4.7e-15  Score=100.93  Aligned_cols=46  Identities=61%  Similarity=1.059  Sum_probs=42.8

Q ss_pred             ccccccccceeecceecCCCCeEEEeecCCCCCCCeeeEEEEEeee
Q psy7201           5 ENKAAWKECFKVSGVKLPTGYYFGVSAATGDLSDNHDVLGIRTYEL   50 (65)
Q Consensus         5 ~~~~~w~~Cf~~~~v~LP~~~yfGiSAaTG~lsDnhDIis~~~~~l   50 (65)
                      .+.++|+.||+++.+.||.++||||||+||+++|+|||++|++|++
T Consensus       173 ~~~~~~~~c~~~~~~~l~~~~y~G~sA~tg~~~d~~dIls~~~~~~  218 (218)
T cd07308         173 SEGNNWKECFTVEDVILPSQGYFGFSAQTGDLSDNHDILSVHTYEL  218 (218)
T ss_pred             CCCCCccEEEEcCCcccCCCCEEEEEeccCCCcCcEEEEEEEeeeC
Confidence            3467999999999999999999999999999999999999999974


No 8  
>cd01951 lectin_L-type legume lectins. The L-type (legume-type) lectins are a highly diverse family of carbohydrate binding proteins that generally display no enzymatic activity toward the sugars they bind.  This family includes arcelin, concanavalinA, the lectin-like receptor kinases, the ERGIC-53/VIP36/EMP46 type1 transmembrane proteins, and an alpha-amylase inhibitor.  L-type lectins have a dome-shaped beta-barrel carbohydrate recognition domain with a curved seven-stranded beta-sheet referred to as the "front face" and a flat six-stranded beta-sheet referred to as the "back face".  This domain homodimerizes so that adjacent back sheets form a contiguous 12-stranded sheet and homotetramers occur by a back-to-back association of these homodimers.  Though L-type lectins exhibit both sequence and structural similarity to one another, their carbohydrate binding specificities differ widely.
Probab=98.01  E-value=1e-05  Score=54.97  Aligned_cols=39  Identities=31%  Similarity=0.365  Sum_probs=32.3

Q ss_pred             ccccceeeccee---cCCCCeEEEeecCCCCCCCeeeEEEEE
Q psy7201           9 AWKECFKVSGVK---LPTGYYFGVSAATGDLSDNHDVLGIRT   47 (65)
Q Consensus         9 ~w~~Cf~~~~v~---LP~~~yfGiSAaTG~lsDnhDIis~~~   47 (65)
                      .|..|..+....   +|...|+||||+||+.++.|+|+++++
T Consensus       180 ~~~~~l~~~~~l~~~~~~~~yvGFTAsTG~~~~~h~V~~wsf  221 (223)
T cd01951         180 LTSLDITIPVDLIQLGPTKAYFGFTASTGGLTNLHDILNWSF  221 (223)
T ss_pred             cccccEEEeeeecccCCCcEEEEEEcccCCCcceeEEEEEEe
Confidence            477899874322   488999999999999999999999865


No 9  
>cd06899 lectin_legume_LecRK_Arcelin_ConA legume lectins, lectin-like receptor kinases, arcelin, concanavalinA, and alpha-amylase inhibitor. This alignment model includes the legume lectins (also known as agglutinins), the arcelin (also known as phytohemagglutinin-L) family of lectin-like defense proteins, the LecRK family of lectin-like receptor kinases, concanavalinA (ConA), and an alpha-amylase inhibitor.  Arcelin is a major seed glycoprotein discovered in kidney beans (Phaseolus vulgaris) that has insecticidal properties and protects the seeds from predation by larvae of various bruchids.  Arcelin is devoid of monosaccharide binding properties and lacks a key metal-binding loop that is present in other members of this family.  Phytohaemagglutinin (PHA) is a lectin found in plants, especially beans, that affects cell metabolism by inducing mitosis and by altering the permeability of the cell membrane to various proteins.  PHA agglutinates most mammalian red blood cell types by bindin
Probab=97.64  E-value=6.1e-05  Score=52.40  Aligned_cols=28  Identities=39%  Similarity=0.536  Sum_probs=25.3

Q ss_pred             ecCCCCeEEEeecCCCCCCCeeeEEEEE
Q psy7201          20 KLPTGYYFGVSAATGDLSDNHDVLGIRT   47 (65)
Q Consensus        20 ~LP~~~yfGiSAaTG~lsDnhDIis~~~   47 (65)
                      .||+..|+|+||+||++.+.|+|++...
T Consensus       205 ~l~~~~~vGFSasTG~~~~~h~i~sWsF  232 (236)
T cd06899         205 VLPEEVYVGFSASTGLLTELHYILSWSF  232 (236)
T ss_pred             hCCCceEEEEEeEcCCCcceEEEEEEEE
Confidence            4788899999999999999999999754


No 10 
>PF00139 Lectin_legB:  Legume lectin domain;  InterPro: IPR001220 Legume lectins are one of the largest lectin families with more than 70 lectins reported. Leguminous plant lectins resemble each other in their physicochemical properties although they differ in their carbohydrate specificities. They consist of two or four subunits with relative molecular mass of 30 kDa and each subunit has one carbohydrate-binding site. The interaction with sugars requires tightly bound calcium and manganese ions. The structural similarities of these lectins are reported by the primary structural analyses and X-ray crystallographic studies. X-ray studies have shown that the folding of the polypeptide chains in the region of the carbohydrate-binding sites is also similar, despite differences in the primary sequences. The carbohydrate-binding sites of these lectins consist of two conserved amino acids on beta pleated sheets. One of these loops contains transition metals, calcium and manganese, which keep the amino acid residues of the sugar-binding site at the required positions. Amino acid sequences of this loop play an important role in the carbohydrate-binding specificities of these lectins. These lectins bind either glucose/mannose or galactose. The exact function of legume lectins is not known but they may be involved in the attachment of nitrogen-fixing bacteria to legumes and in the protection against pathogens. Some legume lectins are proteolytically processed to produce two chains, beta (which corresponds to the N-terminal) and alpha (C-terminal) (IPR000985 from INTERPRO). The lectin concanavalin A (conA) from jack bean is exceptional in that the two chains are transposed and ligated (by formation of a new peptide bond). The N terminus of mature conA thus corresponds to that of the alpha chain and the C terminus to the beta chain.; GO: 0005488 binding; PDB: 1VLN_B 2GDF_C 2JE9_C 2JEC_C 1DGL_B 2P37_B 2CWM_A 2P34_D 2OW4_A 3IPV_B ....
Probab=96.80  E-value=0.00084  Score=46.35  Aligned_cols=27  Identities=41%  Similarity=0.621  Sum_probs=24.3

Q ss_pred             cCCCCeEEEeecCCCCCCCeeeEEEEE
Q psy7201          21 LPTGYYFGVSAATGDLSDNHDVLGIRT   47 (65)
Q Consensus        21 LP~~~yfGiSAaTG~lsDnhDIis~~~   47 (65)
                      ||...|+||||+||.....|+|++...
T Consensus       209 l~~~v~vGFsasTG~~~~~h~I~sW~F  235 (236)
T PF00139_consen  209 LPEQVYVGFSASTGGSYQTHDILSWSF  235 (236)
T ss_dssp             SCSEEEEEEEEEESSSSEEEEEEEEEE
T ss_pred             cCCCcEEEEEeecCCCcceEEEEEEEe
Confidence            677789999999999999999999864


No 11 
>cd06900 lectin_VcfQ VcfQ bacterial pilus biogenesis protein, lectin domain. This family includes bacterial proteins homologous to the VcfQ (also known as MshQ) bacterial pilus biogenesis protein.  VcfQ is encoded by the vcfQ gene of the type IV pilus gene cluster of Vibrio cholerae and is essential for type IV pilus assembly.  VcfQ has a Laminin G-like domain as well as an L-type lectin domain.
Probab=92.69  E-value=0.13  Score=37.72  Aligned_cols=28  Identities=18%  Similarity=0.605  Sum_probs=24.9

Q ss_pred             ecCCCCeEEEeecCCCCCCCeeeEEEEE
Q psy7201          20 KLPTGYYFGVSAATGDLSDNHDVLGIRT   47 (65)
Q Consensus        20 ~LP~~~yfGiSAaTG~lsDnhDIis~~~   47 (65)
                      .+|+..+|+++++||....+|||=.+++
T Consensus       226 avP~~f~lS~TgSTGgstN~HEIdnf~V  253 (255)
T cd06900         226 AIPENFYLSFTGSTGGSTNTHEIDNFQV  253 (255)
T ss_pred             CCCccEEEEEEecCCCcccceeecceEe
Confidence            5888999999999999999999977664


No 12 
>PF09116 gp45-slide_C:  gp45 sliding clamp, C terminal;  InterPro: IPR015200 This domain is essential for the interaction of the gp45 sliding clamp with the corresponding polymerase. It adopts a DNA clamp fold, consisting of two alpha helices and two beta sheets - the fold is duplicated and has internal pseudo two-fold symmetry []. ; PDB: 1B8H_A 1B77_B 3U61_F 3U60_G 3U5Z_R 1CZD_B.
Probab=86.01  E-value=0.63  Score=30.22  Aligned_cols=28  Identities=25%  Similarity=0.593  Sum_probs=16.8

Q ss_pred             cccccccccceeecceecCCCCe-EEEee
Q psy7201           4 FENKAAWKECFKVSGVKLPTGYY-FGVSA   31 (65)
Q Consensus         4 i~~~~~w~~Cf~~~~v~LP~~~y-fGiSA   31 (65)
                      -++..++..||+++|++|..|-| +-||+
T Consensus        59 ~~~~~~F~f~~k~eNmk~~~gdY~V~is~   87 (112)
T PF09116_consen   59 YDGDNNFCFCFKMENMKMLPGDYKVKISK   87 (112)
T ss_dssp             --SS--EEEEEEGGG----SS-EEEEEEE
T ss_pred             cCCCccEEEEEEeceeEeecCCeEEEEEh
Confidence            45566799999999999877777 99998


No 13 
>PHA02545 45 sliding clamp; Provisional
Probab=82.92  E-value=1.3  Score=31.91  Aligned_cols=30  Identities=20%  Similarity=0.427  Sum_probs=25.3

Q ss_pred             cccccccccceeecceecCCCCe-EEEeecC
Q psy7201           4 FENKAAWKECFKVSGVKLPTGYY-FGVSAAT   33 (65)
Q Consensus         4 i~~~~~w~~Cf~~~~v~LP~~~y-fGiSAaT   33 (65)
                      .++..++..|+++++++|+.|-| +-+|++-
T Consensus       171 ~~g~~~F~f~~~~~Nmk~~~gdY~v~is~~g  201 (223)
T PHA02545        171 YDGDNEFNFIIKIENMKLVPGDYKVSISKKG  201 (223)
T ss_pred             ccCCcceEEEEEeceeEecCCCeEEEEEccc
Confidence            45566899999999999988877 9998874


No 14 
>PF07013 DUF1314:  Protein of unknown function (DUF1314);  InterPro: IPR010741 This family consists of several alphaherpesvirus proteins of around 200 residues in length. They belong to the varicellovirus ORF2 family and their function is unknown.
Probab=57.36  E-value=8.5  Score=27.02  Aligned_cols=25  Identities=28%  Similarity=0.369  Sum_probs=21.5

Q ss_pred             cceeecceecCCCCeEEEeecCCCC
Q psy7201          12 ECFKVSGVKLPTGYYFGVSAATGDL   36 (65)
Q Consensus        12 ~Cf~~~~v~LP~~~yfGiSAaTG~l   36 (65)
                      .|..=++|.||.|.+++|+|-+|..
T Consensus         6 ~~LaGR~iDLPgG~el~I~~~~g~~   30 (177)
T PF07013_consen    6 RSLAGRTIDLPGGDELRISANTGRP   30 (177)
T ss_pred             hhhcCcceecCCCCeEEEecCCCCc
Confidence            3556678999999999999999986


No 15 
>PF09175 DUF1944:  Domain of unknown function (DUF1944);  InterPro: IPR015258 Vitellinogen precursors provide the major egg yolk proteins that are a source of nutrients during early development of oviparous vertebrates and invertebrates. Vitellinogen precursors are multi-domain apolipoproteins that are cleaved into distinct yolk proteins. Different vitellinogen precursors exist, which are composed of variable combinations of yolk protein components; however, the cleavage sites are conserved [, ]. In vertebrates, a complete vitellinogen is composed of an N-terminal signal peptide for export, followed by four regions that can be cleaved into yolk proteins: lipovitellin-1, phosvitin, lipovitellin-2, and a von Willebrand factor type D domain (YGP40). Vitellinogens are post-translationally glycosylated and phosphorylated in the endoplasmic reticulum and Golgi complex of hepatocytes, before being secreted into the circulatory system to be taken up by oocytes. In the ovary, vitellinogens bind to specific Vtgr receptors on oocyte membranes to become internalised by endocytosis, where they are cleaved into yolk proteins by cathepsin D. YGP40 is released into the yolk plasma before or during compartmentation of lipovitellin-phosvitin complex into the yolk granule. The different yolk proteins have distinct roles. Phosvitins are important in sequestering calcium, iron and other cations for the developing embryo. Phosvitins are one of the most phosphorylated (10%) proteins in nature, the high concentration of phosphate groups providing efficient metal-binding sites in clusters [, ]. Lipovitellins are involved in lipid and metal storage, and contain a heterogeneous mixture of about 16% (w/w) noncovalently bound lipid, most being phospholipid. Lipovitellin-1 contains two chains, LV1N and LV1C [, ]. This entry represents the beta-sheet shell domain found in vitellinogen, which generally corresponds to the lipovitellin-2 peptide product. This domain consists of several large open beta-sheets []. It is often found C-terminal to IPR001747 from INTERPRO and IPR015255 from INTERPRO. ; GO: 0005319 lipid transporter activity, 0006869 lipid transport; PDB: 1LSH_B.
Probab=53.56  E-value=15  Score=25.33  Aligned_cols=44  Identities=20%  Similarity=0.442  Sum_probs=26.0

Q ss_pred             ccccccccccceeecceecCC-------C----Ce-EEEeecCCCCCCCeeeEEEEE
Q psy7201           3 DFENKAAWKECFKVSGVKLPT-------G----YY-FGVSAATGDLSDNHDVLGIRT   47 (65)
Q Consensus         3 di~~~~~w~~Cf~~~~v~LP~-------~----~y-fGiSAaTG~lsDnhDIis~~~   47 (65)
                      ++..++.|+.|.+..-..-..       |    -| .-++|.||-+ .+|=-+.+++
T Consensus        38 ~l~e~~nWk~CaD~~~ls~hK~~A~l~WG~eCq~Y~~~~~aeTG~~-g~~PA~r~kv   93 (165)
T PF09175_consen   38 NLAENSNWKICADAVLLSKHKAKAKLAWGAECQQYKVSITAETGLL-GSKPAARLKV   93 (165)
T ss_dssp             ES-TT---EEEEEEE--SSSEEEEEEEESSSS-SEEEEEEEEEEES-SSSEEEEEEE
T ss_pred             ecCcCCCeEEEehhhcccccceEEEEecchhhhheeeeeeehhhcc-cCCcceEEEe
Confidence            567789999999862211110       1    25 8999999999 6777666665


No 16 
>KOG2540|consensus
Probab=47.12  E-value=26  Score=25.87  Aligned_cols=49  Identities=18%  Similarity=0.347  Sum_probs=36.1

Q ss_pred             ccccccccccceeeccee-cCCCCeEEEeecCCCCCCCeeeEEEEEeeecCC
Q psy7201           3 DFENKAAWKECFKVSGVK-LPTGYYFGVSAATGDLSDNHDVLGIRTYELEFP   53 (65)
Q Consensus         3 di~~~~~w~~Cf~~~~v~-LP~~~yfGiSAaTG~lsDnhDIis~~~~~l~~~   53 (65)
                      ||.+.-+|+.-=|-+.|. +|...-+++=  |.+..-+|+||-|.+|++.+-
T Consensus       141 dV~~~l~Wkf~PqQrEiyV~PGEtALaFY--ta~N~sdkpIiGvstYni~P~  190 (269)
T KOG2540|consen  141 DVADSLQWKFTPQQREIYVLPGETALAFY--TAENPSDKPIIGVSTYNITPG  190 (269)
T ss_pred             cccccCcccccccceEEEEcCCcceeeeE--eccCCCCCCceeeEeeccCcc
Confidence            677788999877766765 6655546653  445566899999999999753


No 17 
>PHA02290 hypothetical protein
Probab=34.86  E-value=20  Score=25.77  Aligned_cols=20  Identities=45%  Similarity=0.790  Sum_probs=14.9

Q ss_pred             cCCCCeEEEeecCCCCCCCeee
Q psy7201          21 LPTGYYFGVSAATGDLSDNHDV   42 (65)
Q Consensus        21 LP~~~yfGiSAaTG~lsDnhDI   42 (65)
                      +-.|.|+|+|  ||-|+|.-|.
T Consensus        92 ~q~GQYIG~S--~G~LSe~ED~  111 (234)
T PHA02290         92 VQPGQYIGIS--TGLLSESEDF  111 (234)
T ss_pred             ecCCceeccc--ccccchhhhh
Confidence            4457799997  7888887663


No 18 
>PF02408 CUB_2:  CUB-like domain;  InterPro: IPR003366 This domain is found in a family of hypothetical Caenorhabditis elegans proteins. The aligned region has no known function nor do any of the proteins which possess it. However, this domain is related to the CUB domain (IPR000859 from INTERPRO). The aligned region is approximately 130 amino acids long and contains two conserved cysteine residues.
Probab=32.00  E-value=71  Score=19.78  Aligned_cols=23  Identities=35%  Similarity=0.784  Sum_probs=17.2

Q ss_pred             ccceeecceecCCCCe--EEEeecCCC
Q psy7201          11 KECFKVSGVKLPTGYY--FGVSAATGD   35 (65)
Q Consensus        11 ~~Cf~~~~v~LP~~~y--fGiSAaTG~   35 (65)
                      ..|.=  .+.+|.|+|  +-+++...+
T Consensus        42 ~~C~y--~i~iP~G~~a~v~~~~~~~~   66 (120)
T PF02408_consen   42 QNCTY--QINIPKGYYAKVTLSANLND   66 (120)
T ss_pred             CceEE--EEEcCCceEEEEEEEEecCC
Confidence            45653  468999998  788887776


No 19 
>PF02465 FliD_N:  Flagellar hook-associated protein 2 N-terminus;  InterPro: IPR003481 The flagellar hook-associated protein 2 (HAP2 or FliD) is the capping protein for the flagella and forms the distal end of the flagella. The protein plays a role in mucin specific adhesion of the bacteria [].; GO: 0009296 flagellum assembly, 0009288 bacterial-type flagellum
Probab=29.04  E-value=66  Score=19.10  Aligned_cols=26  Identities=12%  Similarity=0.134  Sum_probs=19.6

Q ss_pred             cceecCCCCeEEEeecCCCCCCCeee
Q psy7201          17 SGVKLPTGYYFGVSAATGDLSDNHDV   42 (65)
Q Consensus        17 ~~v~LP~~~yfGiSAaTG~lsDnhDI   42 (65)
                      ..+......++.+||..|..+.+|+|
T Consensus        64 ~~atsS~~~~~tvta~~~A~~gsy~i   89 (99)
T PF02465_consen   64 KKATSSNESVVTVTATSGASAGSYSI   89 (99)
T ss_pred             eeeeccCCceEEEEecCCCcceeEEE
Confidence            34445556789999999988888875


No 20 
>PHA03371 circ protein; Provisional
Probab=28.94  E-value=54  Score=24.06  Aligned_cols=28  Identities=25%  Similarity=0.277  Sum_probs=23.6

Q ss_pred             ceeecceecCCCCeEEEeecCCCCCCCe
Q psy7201          13 CFKVSGVKLPTGYYFGVSAATGDLSDNH   40 (65)
Q Consensus        13 Cf~~~~v~LP~~~yfGiSAaTG~lsDnh   40 (65)
                      |..=+.|.||.|.++.|.|..|....+.
T Consensus        28 ~LaGR~vDLPgGde~~If~~~g~T~~~~   55 (240)
T PHA03371         28 SLAGRTVDLPGGDELRIFADCGTTTVNF   55 (240)
T ss_pred             HhcCcceecCCCCeEEEeccCCCCccce
Confidence            5556789999999999999999877654


No 21 
>PF05015 Plasmid_killer:  Plasmid maintenance system killer protein;  InterPro: IPR007711 Several plasmids with proteic killer gene systems have been reported. All of them encode a stable toxin and an unstable antidote. Upon loss of the plasmid, the less stable inhibitor is inactivated more rapidly than the toxin, allowing the toxin to be activated. The activation of those systems result in cell filamentation and cessation of viable cell production. It has been verified that both the stable killer and the unstable inhibitor of the systems are short polypeptides. This family corresponds to the toxin.
Probab=27.69  E-value=31  Score=20.80  Aligned_cols=15  Identities=13%  Similarity=0.539  Sum_probs=10.0

Q ss_pred             cccccccccccceeecc
Q psy7201           2 TDFENKAAWKECFKVSG   18 (65)
Q Consensus         2 ~di~~~~~w~~Cf~~~~   18 (65)
                      ++|++  +|+.||+..+
T Consensus        67 i~i~~--~~RliF~~~~   81 (93)
T PF05015_consen   67 IRING--NWRLIFRFED   81 (93)
T ss_pred             EEeCC--CEEEEEEEeC
Confidence            34444  7999998643


No 22 
>KOG2107|consensus
Probab=26.63  E-value=74  Score=22.42  Aligned_cols=29  Identities=24%  Similarity=0.596  Sum_probs=21.9

Q ss_pred             cccccccceeecc--eecCCCCeEEEeecCC
Q psy7201           6 NKAAWKECFKVSG--VKLPTGYYFGVSAATG   34 (65)
Q Consensus         6 ~~~~w~~Cf~~~~--v~LP~~~yfGiSAaTG   34 (65)
                      .+++|--||--.+  |.||.|-|.-||..+.
T Consensus       111 ~dd~WIRi~vekGDlivlPaGiyHRFTtt~~  141 (179)
T KOG2107|consen  111 KDDQWIRIFVEKGDLIVLPAGIYHRFTTTPS  141 (179)
T ss_pred             CCCCEEEEEEecCCEEEecCcceeeeecCch
Confidence            4588998886543  7899999988876543


No 23 
>COG3549 HigB Plasmid maintenance system killer protein [General function prediction only]
Probab=25.77  E-value=34  Score=21.81  Aligned_cols=12  Identities=25%  Similarity=0.758  Sum_probs=9.6

Q ss_pred             ccccccceeecc
Q psy7201           7 KAAWKECFKVSG   18 (65)
Q Consensus         7 ~~~w~~Cf~~~~   18 (65)
                      .++|+.||.-++
T Consensus        70 N~QwrLiF~~~~   81 (94)
T COG3549          70 NQQWRLIFRWEG   81 (94)
T ss_pred             CCEEEEEEEeCC
Confidence            579999998654


No 24 
>PF07495 Y_Y_Y:  Y_Y_Y domain;  InterPro: IPR011123 This region is mostly found at the end of the beta propellers (IPR011110 from INTERPRO) in a family of two component regulators. However they are also found tandemly repeated in Q891H4 from SWISSPROT without other signal conduction domains being present. It is named after the conserved tyrosines found in the alignment. The exact function is not known.; PDB: 3V9F_D 3VA6_B 3OTT_B 4A2M_D 4A2L_B.
Probab=24.81  E-value=70  Score=17.07  Aligned_cols=34  Identities=26%  Similarity=0.357  Sum_probs=18.1

Q ss_pred             cccccceeec-c---eecCCCCe-EEEeecCCCCCCCee
Q psy7201           8 AAWKECFKVS-G---VKLPTGYY-FGVSAATGDLSDNHD   41 (65)
Q Consensus         8 ~~w~~Cf~~~-~---v~LP~~~y-fGiSAaTG~lsDnhD   41 (65)
                      ++|..--.-. .   ..||.|-| |-+-|....-.-+-+
T Consensus        19 ~~W~~~~~~~~~~~~~~L~~G~Y~l~V~a~~~~~~~~~~   57 (66)
T PF07495_consen   19 DEWITLGSYSNSISYTNLPPGKYTLEVRAKDNNGKWSSD   57 (66)
T ss_dssp             SSEEEESSTS-EEEEES--SEEEEEEEEEEETTS-B-SS
T ss_pred             CeEEECCCCcEEEEEEeCCCEEEEEEEEEECCCCCcCcc
Confidence            5565543332 2   25999988 999998754433333


No 25 
>PF07472 PA-IIL:  Fucose-binding lectin II (PA-IIL);  InterPro: IPR010907 This entry represents calcium-mediated lectins. Structures have been determined for both fucose-binding lectin II (PA-IIL) [] and mannose-specific lectin II (RS-IIL) []. These proteins have homologous structures, their monomers consisting of a 9-stranded beta sandwich with Greek-key topology. Each monomer contains two calcium ions that mediate an exceptionally high binding affinity to the monosaccharide ligand in a recognition mode unique among carbohydrate-protein interactions. In Pseudomonas aeruginosa, PA-IIL contributes to the pathogenic virulence of the bacterium, functioning as a tetramer when binding fucose []. In the plant pathogen Ralstonia solanacearum (Pseudomonas solanacearum), RS-IIL recognises fucose, but displays much higher affinity to mannose and fructose, which is opposite to the preference of PA-IIL. ; PDB: 2WRA_A 2WR9_C 1OUX_C 2VUC_B 1GZT_C 2BOJ_D 2JDM_D 2JDH_D 1W8F_D 1UZV_A ....
Probab=22.40  E-value=75  Score=20.61  Aligned_cols=16  Identities=38%  Similarity=0.665  Sum_probs=10.4

Q ss_pred             eecCCCCeEEEeecCC
Q psy7201          19 VKLPTGYYFGVSAATG   34 (65)
Q Consensus        19 v~LP~~~yfGiSAaTG   34 (65)
                      +.||.+--||++|-+-
T Consensus         1 f~lP~~~~fg~ta~~N   16 (107)
T PF07472_consen    1 FQLPPNIRFGVTALAN   16 (107)
T ss_dssp             EE--TT-EEEEEEEEE
T ss_pred             CCCCCcceEeEEEEEc
Confidence            3699999999998754


Done!