Query         psy13631
Match_columns 81
No_of_seqs    120 out of 626
Neff          8.6 
Searched_HMMs 46136
Date          Fri Aug 16 16:12:45 2013
Command       hhsearch -i /work/01045/syshi/Psyhhblits/psy13631.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/13631hhsearch_cdd -cpu 12 -v 0 

 No Hit                             Prob E-value P-value  Score    SS Cols Query HMM  Template HMM
  1 KOG1214|consensus               99.2 8.8E-11 1.9E-15   86.4   7.2   69   11-80    978-1053(1289)
  2 KOG1215|consensus               99.1 5.6E-10 1.2E-14   82.9   8.5   75    2-81    389-466 (877)
  3 smart00135 LY Low-density lipo  98.4 1.4E-06 3.1E-11   41.6   4.9   32   49-80      6-37  (43)
  4 PF12662 cEGF:  Complement Clr-  97.8 9.9E-06 2.1E-10   35.0   1.1   15    2-16      5-20  (24)
  5 PF14670 FXa_inhibition:  Coagu  97.6 3.1E-05 6.8E-10   36.6   1.3   14    2-15     22-36  (36)
  6 PF00058 Ldl_recept_b:  Low-den  97.1 0.00076 1.7E-08   32.7   2.8   17   64-80      1-18  (42)
  7 KOG1215|consensus               96.7  0.0028   6E-08   47.8   4.3   40    2-42    694-734 (877)
  8 PF09064 Tme5_EGF_like:  Thromb  96.3  0.0027   6E-08   29.4   1.5   14    2-15     21-34  (34)
  9 KOG1214|consensus               96.2   0.011 2.4E-07   45.0   4.8   57   22-80   1038-1096(1289)
 10 PF02822 Antistasin:  Antistasi  93.6   0.043 9.4E-07   23.9   1.1   12    1-12      8-20  (26)
 11 PF08450 SGL:  SMP-30/Gluconola  93.5    0.17 3.8E-06   32.3   4.3   25   56-80      4-28  (246)
 12 KOG4659|consensus               92.5    0.63 1.4E-05   37.6   6.5   62   19-80    374-435 (1899)
 13 PF07645 EGF_CA:  Calcium-bindi  91.6   0.088 1.9E-06   25.2   0.8   14    2-15     28-42  (42)
 14 PF07995 GSDH:  Glucose / Sorbo  91.3    0.88 1.9E-05   30.9   5.6   44   29-74    148-203 (331)
 15 PF08450 SGL:  SMP-30/Gluconola  88.7     2.9 6.4E-05   26.6   6.3   45   31-79    117-161 (246)
 16 COG2133 Glucose/sorbosone dehy  87.4     3.7 7.9E-05   29.1   6.5   26   49-74    236-261 (399)
 17 TIGR02604 Piru_Ver_Nterm putat  85.0     4.8  0.0001   27.6   6.1   44   30-77    165-208 (367)
 18 PF01436 NHL:  NHL repeat;  Int  82.7     2.9 6.3E-05   18.0   4.8   26   52-78      2-27  (28)
 19 KOG1520|consensus               80.9     6.2 0.00013   27.8   5.3   52   22-74    128-182 (376)
 20 TIGR03606 non_repeat_PQQ dehyd  80.7       6 0.00013   28.5   5.3   49   30-79    200-259 (454)
 21 PLN02919 haloacid dehalogenase  79.8      13 0.00028   29.5   7.2   57   22-78    581-650 (1057)
 22 PF01826 TIL:  Trypsin Inhibito  79.3    0.83 1.8E-05   22.8   0.5   15    2-16     36-50  (55)
 23 COG3386 Gluconolactonase [Carb  78.2     2.5 5.3E-05   28.8   2.7   23   57-79     30-52  (307)
 24 TIGR01492 CPW_WPC Plasmodium f  75.4     1.9 4.1E-05   22.4   1.2   16    2-17      9-25  (62)
 25 PF12661 hEGF:  Human growth fa  75.4     1.3 2.9E-05   16.0   0.5    7    2-8       3-9   (13)
 26 PLN02919 haloacid dehalogenase  73.5      31 0.00068   27.5   7.8   58   21-78    636-709 (1057)
 27 COG3386 Gluconolactonase [Carb  72.0      20 0.00043   24.4   5.8   29   51-79    162-190 (307)
 28 PF01731 Arylesterase:  Arylest  69.2      17 0.00038   20.1   4.8   31   48-78     50-80  (86)
 29 PF07995 GSDH:  Glucose / Sorbo  69.1      33 0.00072   23.3   7.2   55   23-78    273-331 (331)
 30 PF04942 CC:  CC domain;  Inter  69.0     3.6 7.9E-05   19.2   1.3   14    2-15     19-33  (36)
 31 KOG0315|consensus               68.1      15 0.00032   25.0   4.4   47   20-67     52-99  (311)
 32 PF02425 GBP_PSP:  Paralytic/GB  66.7     4.1 8.8E-05   17.0   1.0   14    2-16      6-20  (23)
 33 smart00032 CCP Domain abundant  65.6     2.6 5.7E-05   20.1   0.4   15    2-16     29-45  (57)
 34 PF13948 DUF4215:  Domain of un  64.4     5.1 0.00011   19.6   1.3   15    2-16      3-18  (47)
 35 smart00179 EGF_CA Calcium-bind  64.0     4.2 9.1E-05   18.1   1.0    6    3-8      28-33  (39)
 36 PF00396 Granulin:  Granulin;    62.8       4 8.6E-05   19.8   0.7   14    3-16     28-42  (43)
 37 KOG4499|consensus               58.0     9.2  0.0002   25.8   2.0   23   56-78     19-41  (310)
 38 PF09717 CPW_WPC:  Plasmodium f  57.4     4.8  0.0001   20.6   0.6   16    2-17      8-23  (60)
 39 cd00033 CCP Complement control  56.5       5 0.00011   19.2   0.5   15    2-16     28-44  (57)
 40 PF14339 DUF4394:  Domain of un  55.9      44 0.00096   22.1   4.9   25   52-77     27-51  (236)
 41 PF06977 SdiA-regulated:  SdiA-  55.3      54  0.0012   21.7   5.3   33   46-78     15-48  (248)
 42 PF14280 DUF4365:  Domain of un  50.4      38 0.00082   19.6   3.7   24   53-76     84-107 (138)
 43 PF13947 GUB_WAK_bind:  Wall-as  50.0      29 0.00063   19.3   3.1   33    4-39     26-59  (106)
 44 PF12947 EGF_3:  EGF domain;  I  49.9     6.9 0.00015   18.1   0.4   12    3-15     25-36  (36)
 45 PRK11028 6-phosphogluconolacto  49.0      76  0.0016   21.0   7.6   54   23-76      4-59  (330)
 46 smart00494 ChtBD2 Chitin-bindi  46.3      17 0.00038   17.2   1.6   16    1-16     31-47  (56)
 47 PF06045 Rhamnogal_lyase:  Rham  45.6      48   0.001   21.5   3.8   42   27-69     20-61  (203)
 48 PF12075 KN_motif:  KN motif;    45.2      13 0.00027   17.7   0.9   15    2-16      5-21  (39)
 49 PF03088 Str_synth:  Strictosid  44.1      57  0.0012   18.1   4.8   31   49-79     54-84  (89)
 50 PF00084 Sushi:  Sushi domain (  42.7       7 0.00015   18.7  -0.2   15    2-16     28-44  (56)
 51 PF14761 HPS3_N:  Hermansky-Pud  42.7      96  0.0021   20.3   5.6   49   21-70     29-78  (215)
 52 PF05098 LEF-4:  Late expressio  41.1      70  0.0015   23.3   4.4   58    7-72    238-299 (450)
 53 PF03022 MRJP:  Major royal jel  40.3      66  0.0014   21.6   4.0   25   53-78    187-211 (287)
 54 PF09603 Fib_succ_major:  Fibro  39.3     9.9 0.00021   23.2   0.0    9    2-10     69-77  (184)
 55 PF08366 LLGL:  LLGL2;  InterPr  38.7      80  0.0017   18.3   4.3   11   19-29     27-37  (105)
 56 smart00181 EGF Epidermal growt  38.0      17 0.00036   15.7   0.7    7    3-9      24-30  (35)
 57 smart00284 OLF Olfactomedin-li  37.2      91   0.002   20.9   4.2   41   34-74    205-246 (255)
 58 PTZ00444 hypothetical protein;  36.5      18  0.0004   23.0   0.9   14    2-16    104-118 (184)
 59 PF00622 SPRY:  SPRY domain;  I  35.2      82  0.0018   17.4   3.9   21   52-72     67-87  (124)
 60 PF02191 OLF:  Olfactomedin-lik  34.8 1.4E+02   0.003   19.8   6.7   44   34-77    200-244 (250)
 61 smart00104 ANATO Anaphylatoxin  34.8      16 0.00035   16.4   0.4   14    3-16      2-16  (35)
 62 PF01607 CBM_14:  Chitin bindin  34.7      19 0.00041   17.0   0.6   15    2-16     30-45  (53)
 63 PHA02885 putative interleukin   33.9   1E+02  0.0022   18.1   4.3   15   58-72     55-69  (135)
 64 KOG4649|consensus               32.6 1.7E+02  0.0037   20.3   5.2   19   59-77     38-56  (354)
 65 PF10282 Lactonase:  Lactonase,  32.0 1.6E+02  0.0035   19.9   9.1   58   19-77    255-317 (345)
 66 KOG0650|consensus               31.2      68  0.0015   24.5   3.1   51   19-72    577-637 (733)
 67 PF05281 Secretogranin_V:  Neur  31.1      22 0.00047   23.1   0.6   15    2-18    102-116 (208)
 68 PF12191 stn_TNFRSF12A:  Tumour  28.8      34 0.00074   20.6   1.1   15    2-16     35-50  (129)
 69 PF01623 Carla_C4:  Carlavirus   28.7      43 0.00093   19.0   1.4    8   11-18     73-80  (91)
 70 PF12093 Corona_NS8:  Coronavir  27.3      32  0.0007   19.9   0.8   19   19-37     43-61  (118)
 71 PF02664 LuxS:  S-Ribosylhomocy  27.3      36 0.00077   21.2   1.0   10    1-10     80-89  (157)
 72 PF00008 EGF:  EGF-like domain   26.9      30 0.00065   15.2   0.5    7    2-8      23-29  (32)
 73 smart00289 WR1 Worm-specific r  25.4      34 0.00073   15.2   0.6    9    2-10     21-29  (38)
 74 KOG1446|consensus               24.3 2.5E+02  0.0055   19.5   5.5   48   18-71    111-158 (311)
 75 PF10015 DUF2258:  Uncharacteri  24.2      80  0.0017   17.2   1.9   15   54-68     61-75  (75)
 76 PF06387 Calcyon:  D1 dopamine   24.2      39 0.00084   21.5   0.8   14    2-16    112-125 (186)
 77 KOG4283|consensus               23.8 2.7E+02  0.0058   19.7   6.7   53   22-76    158-213 (397)
 78 smart00449 SPRY Domain in SPla  23.6 1.4E+02  0.0031   16.4   3.8   24   52-75     66-89  (122)
 79 PRK02260 S-ribosylhomocysteina  23.1      42 0.00092   20.9   0.8   10    1-10     80-89  (158)
 80 PHA02639 EEV host range protei  22.9      43 0.00093   22.8   0.9   15    2-16    177-193 (295)
 81 PF00020 TNFR_c6:  TNFR/NGFR cy  22.5      59  0.0013   14.7   1.1   14    3-16      1-16  (39)
 82 KOG1007|consensus               22.5      91   0.002   21.8   2.4   47   22-69    186-232 (370)
 83 COG1854 LuxS LuxS protein invo  22.5      52  0.0011   20.5   1.1   12    1-12     80-92  (161)
 84 PF02655 ATP-grasp_3:  ATP-gras  22.0      82  0.0018   18.9   1.9   25   49-73    132-156 (161)
 85 TIGR02145 Fib_succ_major Fibro  21.6      31 0.00067   21.5   0.0    9    2-10     59-67  (171)
 86 PF08310 LGFP:  LGFP repeat;  I  21.3      81  0.0017   15.5   1.5   12   61-72     21-32  (54)
 87 PF12860 PAS_7:  PAS fold        20.9 1.3E+02  0.0028   16.5   2.5   19   50-69      3-21  (115)
 88 PF13619 KTSC:  KTSC domain      20.7 1.2E+02  0.0025   15.2   2.1   16   52-67      4-19  (60)
 89 PF03302 VSP:  Giardia variant-  20.1      59  0.0013   23.0   1.1   15    2-16      3-18  (397)

No 1  
>KOG1214|consensus
Probab=99.18  E-value=8.8e-11  Score=86.38  Aligned_cols=69  Identities=26%  Similarity=0.349  Sum_probs=56.9

Q ss_pred             CCCCCCCC---CCcEEEEEeCCcEEEEeCCCCCcee----eEEEeCCccceEEEEEeCCCCEEEEEECCCCcEEeec
Q psy13631         11 DNFTCANG---SQEMLLLVQRTDICKISLDTPDHTN----VMLPVVGVKHAIAIDWDPVEQHLYWTDDEARNKSGES   80 (81)
Q Consensus        11 d~~tC~~~---~~~~Ll~s~~~~I~~i~l~~~~~~~----~~~~~~~~~~~~ald~d~~~~~iyW~d~~~~~I~r~s   80 (81)
                      ++.+|...   ...||||+++..|.++++++..+..    .++.+++ +-+++||||+++++|||+|+...+|.|+|
T Consensus       978 p~~~~~v~p~~~gt~LL~aqg~~I~~lplng~~~~K~~ak~~l~~p~-~IiVGidfDC~e~mvyWtDv~g~SI~ras 1053 (1289)
T KOG1214|consen  978 PTPRPDVTPPSVGTFLLYAQGQQIGYLPLNGTRLQKDAAKTLLSLPG-SIIVGIDFDCRERMVYWTDVAGRSISRAS 1053 (1289)
T ss_pred             CCCCCCCcCCCCcceEEEeccceEEEeecCcchhchhhhhceEeccc-ceeeeeecccccceEEEeecCCCcccccc
Confidence            66777722   3569999999999999999887543    2244455 77899999999999999999999999987


No 2  
>KOG1215|consensus
Probab=99.10  E-value=5.6e-10  Score=82.89  Aligned_cols=75  Identities=33%  Similarity=0.603  Sum_probs=66.3

Q ss_pred             CCCCceEEc-CCCCCC--CCCCcEEEEEeCCcEEEEeCCCCCceeeEEEeCCccceEEEEEeCCCCEEEEEECCCCcEEe
Q psy13631          2 FHFYGVRLV-DNFTCA--NGSQEMLLLVQRTDICKISLDTPDHTNVMLPVVGVKHAIAIDWDPVEQHLYWTDDEARNKSG   78 (81)
Q Consensus         2 ~~~~Gy~L~-d~~tC~--~~~~~~Ll~s~~~~I~~i~l~~~~~~~~~~~~~~~~~~~ald~d~~~~~iyW~d~~~~~I~r   78 (81)
                      -|++||.|. ++  |+  ..+++||+++++.+|++++++..   ....|+.++.+++++|+|..++++||+|...+.|.+
T Consensus       389 ~c~~g~~~~~~~--c~~~~~~~~~l~~s~~~~ir~~~~~~~---~~~~p~~~~~~~~~~d~d~~~~~i~~~d~~~~~i~~  463 (877)
T KOG1215|consen  389 ACSPGYELRLDK--CEASDQPEAFLLFSNRHDIRRISLDCS---DVSRPLEGIKNAVALDFDVLNNRIYWADLSDEKICR  463 (877)
T ss_pred             ecCCCcEeccCC--ceecCCCCcEEEEecCccceecccCCC---cceEEccCCccceEEEEEecCCEEEEEeccCCeEee
Confidence            499999999 87  76  44899999999999999999986   456677788899999999999999999999999998


Q ss_pred             ecC
Q psy13631         79 ESH   81 (81)
Q Consensus        79 ~s~   81 (81)
                      +++
T Consensus       464 ~~~  466 (877)
T KOG1215|consen  464 ASQ  466 (877)
T ss_pred             ecc
Confidence            763


No 3  
>smart00135 LY Low-density lipoprotein-receptor YWTD domain. Type "B" repeats in low-density lipoprotein (LDL) receptor that  plays a central role in mammalian cholesterol metabolism. Also present in a variety of molecules similar to gp300/megalin.
Probab=98.38  E-value=1.4e-06  Score=41.61  Aligned_cols=32  Identities=28%  Similarity=0.361  Sum_probs=28.0

Q ss_pred             CCccceEEEEEeCCCCEEEEEECCCCcEEeec
Q psy13631         49 VGVKHAIAIDWDPVEQHLYWTDDEARNKSGES   80 (81)
Q Consensus        49 ~~~~~~~ald~d~~~~~iyW~d~~~~~I~r~s   80 (81)
                      .++..+.++++|+.++++||+|.....|.|+.
T Consensus         6 ~~~~~~~~la~d~~~~~lYw~D~~~~~I~~~~   37 (43)
T smart00135        6 EGLGHPNGLAVDWIEGRLYWTDWGLDVIEVAN   37 (43)
T ss_pred             CCCCCcCEEEEeecCCEEEEEeCCCCEEEEEe
Confidence            46777888999999999999999999999864


No 4  
>PF12662 cEGF:  Complement Clr-like EGF-like
Probab=97.81  E-value=9.9e-06  Score=35.05  Aligned_cols=15  Identities=27%  Similarity=0.310  Sum_probs=14.1

Q ss_pred             CCCCceEEc-CCCCCC
Q psy13631          2 FHFYGVRLV-DNFTCA   16 (81)
Q Consensus         2 ~~~~Gy~L~-d~~tC~   16 (81)
                      .|++||+|. |+++|.
T Consensus         5 ~C~~Gy~l~~d~~~C~   20 (24)
T PF12662_consen    5 SCPPGYQLSPDGRSCE   20 (24)
T ss_pred             eCCCCCcCCCCCCccc
Confidence            499999999 999998


No 5  
>PF14670 FXa_inhibition:  Coagulation Factor Xa inhibitory site; PDB: 3Q3K_B 1NFY_B 1LQD_A 1G2L_B 1IQF_L 2UWP_B 2VH6_B 3KQC_L 2P93_L 2BQW_A ....
Probab=97.61  E-value=3.1e-05  Score=36.61  Aligned_cols=14  Identities=36%  Similarity=0.411  Sum_probs=11.1

Q ss_pred             CCCCceEEc-CCCCC
Q psy13631          2 FHFYGVRLV-DNFTC   15 (81)
Q Consensus         2 ~~~~Gy~L~-d~~tC   15 (81)
                      .||+||+|+ |+++|
T Consensus        22 ~C~~Gy~L~~D~~tC   36 (36)
T PF14670_consen   22 SCPPGYKLAEDGRTC   36 (36)
T ss_dssp             E-STTEEE-TTSSSE
T ss_pred             ECCCCCEECcCCCCC
Confidence            499999999 99998


No 6  
>PF00058 Ldl_recept_b:  Low-density lipoprotein receptor repeat class B;  InterPro: IPR000033  The low-density lipoprotein receptor (LDLR) is the major cholesterol-carrying lipoprotein of plasma, acting to regulate cholesterol homeostasis in mammalian cells. The LDL receptor binds LDL and transports it into cells by acidic endocytosis. In order to be internalized, the receptor-ligand complex must first cluster into clathrin-coated pits. Once inside the cell, the LDLR separates from its ligand, which is degraded in the lysosomes, while the receptor returns to the cell surface []. The internal dissociation of the LDLR with its ligand is mediated by proton pumps within the walls of the endosome that lower the pH. The LDLR is a multi-domain protein, containing:    The ligand-binding domain contains seven or eight 40-amino acid LDLR class A (cysteine-rich) repeats, each of which contains a coordinated calcium ion and six cysteine residues involved in disulphide bond formation []. Similar domains have been found in other extracellular and membrane proteins [].      The second conserved region contains two EGF repeats, followed by six LDLR class B (YWTD) repeats, and another EGF repeat. The LDLR class B repeats each contain a conserved YWTD motif, and is predicted to form a beta-propeller structure []. This region is critical for ligand release and recycling of the receptor [].     The third domain is rich in serine and threonine residues and contains clustered O-linked carbohydrate chains.     The fourth domain is the hydrophobic transmembrane region.     The fifth domain is the cytoplasmic tail that directs the receptor to clathrin-coated pits.   LDLR is closely related in structure to several other receptors, including LRP1, LRP1b, megalin/LRP2, VLDL receptor, lipoprotein receptor, MEGF7/LRP4, and LRP8/apolipoprotein E receptor2); these proteins participate in a wide range of physiological processes, including the regulation of lipid metabolism, protection against atherosclerosis, neurodevelopment, and transport of nutrients and vitamins []. This entry represents the LDLR classB (YWTD) repeat, the structure of which has been solved []. The six YWTD repeats together fold into a six-bladed beta-propeller. Each blade of the propeller consists of four antiparallel beta-strands; the innermost strand of each blade is labeled 1 and the outermost strand, 4. The sequence repeats are offset with respect to the blades of the propeller, such that any given 40-residue YWTD repeat spans strands 24 of one propeller blade and strand 1 of the subsequent blade. This offset ensures circularization of the propeller because the last strand of the final sequence repeat acts as an innermost strand 1 of the blade that harbors strands 24 from the first sequence repeat. The repeat is found in a variety of proteins that include, vitellogenin receptor from Drosophila melanogaster, low-density lipoprotein (LDL) receptor [], preproepidermal growth factor, and nidogen (entactin).; PDB: 3S2K_A 3S8Z_A 3S8V_B 4A0P_A 3SOB_B 3S94_B 4DG6_A 3SOV_A 3SOQ_A 1NPE_A ....
Probab=97.07  E-value=0.00076  Score=32.66  Aligned_cols=17  Identities=24%  Similarity=0.491  Sum_probs=15.0

Q ss_pred             CEEEEEECCCC-cEEeec
Q psy13631         64 QHLYWTDDEAR-NKSGES   80 (81)
Q Consensus        64 ~~iyW~d~~~~-~I~r~s   80 (81)
                      ++|||+|...+ .|.++.
T Consensus         1 ~~iYWtD~~~~~~I~~a~   18 (42)
T PF00058_consen    1 GKIYWTDWSQDPSIERAN   18 (42)
T ss_dssp             TEEEEEETTTTEEEEEEE
T ss_pred             CEEEEEECCCCcEEEEEE
Confidence            58999999999 999874


No 7  
>KOG1215|consensus
Probab=96.70  E-value=0.0028  Score=47.80  Aligned_cols=40  Identities=35%  Similarity=0.411  Sum_probs=34.1

Q ss_pred             CCCCceEEc-CCCCCCCCCCcEEEEEeCCcEEEEeCCCCCce
Q psy13631          2 FHFYGVRLV-DNFTCANGSQEMLLLVQRTDICKISLDTPDHT   42 (81)
Q Consensus         2 ~~~~Gy~L~-d~~tC~~~~~~~Ll~s~~~~I~~i~l~~~~~~   42 (81)
                      -||+|+.|. ++++|.. +..+++++....|..+..+.....
T Consensus       694 ~c~~~~~l~~~~~~C~~-~~~~~~~~~~~~~~~~~~~~~~~~  734 (877)
T KOG1215|consen  694 ACPEGYRLSPDGKSCSS-PEGYLLITSRTGIPCISLDSELSP  734 (877)
T ss_pred             eCCCCCeecCCCCeecC-cccccccccccccceeecCcccCC
Confidence            499999999 9999997 777889999999999888775443


No 8  
>PF09064 Tme5_EGF_like:  Thrombomodulin like fifth domain, EGF-like;  InterPro: IPR015149 This domain adopts a fold similar to other EGF domains, with a flat major and a twisted minor beta sheet. Disulphide pairing, however, is not of the usual 1-3, 2-4, 5-6 type; rather 1-2, 3-4, 5-6 pairing is found. Its extended major sheet (strands beta-2 and beta-3 and the connecting loop) projects into thrombin's active site groove. This domain is required for interaction of thrombomodulin with thrombin, and subsequent activation of protein-C []. ; GO: 0004888 transmembrane signaling receptor activity, 0016021 integral to membrane
Probab=96.30  E-value=0.0027  Score=29.44  Aligned_cols=14  Identities=29%  Similarity=0.349  Sum_probs=10.4

Q ss_pred             CCCCceEEcCCCCC
Q psy13631          2 FHFYGVRLVDNFTC   15 (81)
Q Consensus         2 ~~~~Gy~L~d~~tC   15 (81)
                      +||+||.|+++.-|
T Consensus        21 ~CPeGyIlde~~~c   34 (34)
T PF09064_consen   21 FCPEGYILDEGSMC   34 (34)
T ss_pred             eCCCceEecCCccC
Confidence            79999999844333


No 9  
>KOG1214|consensus
Probab=96.19  E-value=0.011  Score=45.01  Aligned_cols=57  Identities=19%  Similarity=0.353  Sum_probs=43.5

Q ss_pred             EEEEEe--CCcEEEEeCCCCCceeeEEEeCCccceEEEEEeCCCCEEEEEECCCCcEEeec
Q psy13631         22 MLLLVQ--RTDICKISLDTPDHTNVMLPVVGVKHAIAIDWDPVEQHLYWTDDEARNKSGES   80 (81)
Q Consensus        22 ~Ll~s~--~~~I~~i~l~~~~~~~~~~~~~~~~~~~ald~d~~~~~iyW~d~~~~~I~r~s   80 (81)
                      .+.++.  +.+|++-+|++.+.+.++-  +++.++-+|+.|....-|||+|...++|.-|+
T Consensus      1038 mvyWtDv~g~SI~rasL~G~Ep~ti~n--~~L~SPEGiAVDh~~Rn~ywtDS~lD~IevA~ 1096 (1289)
T KOG1214|consen 1038 MVYWTDVAGRSISRASLEGAEPETIVN--SGLISPEGIAVDHIRRNMYWTDSVLDKIEVAL 1096 (1289)
T ss_pred             eEEEeecCCCccccccccCCCCceeec--ccCCCccceeeeeccceeeeeccccchhheee
Confidence            444444  5688898998776544332  58888888888889999999999999987664


No 10 
>PF02822 Antistasin:  Antistasin family;  InterPro: IPR004094 Peptide proteinase inhibitors can be found as single domain proteins or as single or multiple domains within proteins; these are referred to as either simple or compound inhibitors, respectively. In many cases they are synthesised as part of a larger precursor protein, either as a prepropeptide or as an N-terminal domain associated with an inactive peptidase or zymogen. This domain prevents access of the substrate to the active site. Removal of the N-terminal inhibitor domain either by interaction with a second peptidase or by autocatalytic cleavage activates the zymogen. Other inhibitors interact direct with proteinases using a simple noncovalent lock and key mechanism; while yet others use a conformational change-based trapping mechanism that depends on their structural and thermodynamic properties.  This group of serine protease inhibitors belong to MEROPS inhibitor family I15, clan IO. They inhibit serine peptidases of the S1 family (IPR001254 from INTERPRO) [] and are characterised by a well conserved pattern of cysteine residues. Many of the proteins that belong to this family are anti-coagulants.; GO: 0004867 serine-type endopeptidase inhibitor activity; PDB: 1HIA_I 1BX8_A 1BX7_A 3BG4_D 1C9P_B 1EJA_B 1SKZ_A.
Probab=93.64  E-value=0.043  Score=23.93  Aligned_cols=12  Identities=25%  Similarity=0.553  Sum_probs=6.1

Q ss_pred             CCCCCceEEc-CC
Q psy13631          1 MFHFYGVRLV-DN   12 (81)
Q Consensus         1 ~~~~~Gy~L~-d~   12 (81)
                      |.||.||+++ +|
T Consensus         8 ~~C~~Gf~~D~~G   20 (26)
T PF02822_consen    8 MYCPYGFKTDENG   20 (26)
T ss_dssp             ---TT-EEE-TTS
T ss_pred             CcCCCcccCCCCC
Confidence            6899999998 66


No 11 
>PF08450 SGL:  SMP-30/Gluconolaconase/LRE-like region;  InterPro: IPR013658 This family describes a region that is found in proteins expressed by a variety of eukaryotic and prokaryotic species. These proteins include various enzymes, such as senescence marker protein 30 (SMP-30, Q15493 from SWISSPROT), gluconolactonase (Q01578 from SWISSPROT) and luciferin-regenerating enzyme (LRE, Q86DU5 from SWISSPROT). SMP-30 is known to hydrolyse diisopropyl phosphorofluoridate in the liver, and has been noted as having sequence similarity, in the region described in this family, with PON1 (P52430 from SWISSPROT) and LRE. ; PDB: 2GHS_A 2DG0_L 2DG1_D 2DSO_D 3E5Z_A 2IAT_A 2IAV_A 2GVV_A 3HLI_A 2GVU_A ....
Probab=93.51  E-value=0.17  Score=32.32  Aligned_cols=25  Identities=28%  Similarity=0.555  Sum_probs=22.0

Q ss_pred             EEEEeCCCCEEEEEECCCCcEEeec
Q psy13631         56 AIDWDPVEQHLYWTDDEARNKSGES   80 (81)
Q Consensus        56 ald~d~~~~~iyW~d~~~~~I~r~s   80 (81)
                      +.-||..++.+||+|+..+.|.|..
T Consensus         4 gp~~d~~~g~l~~~D~~~~~i~~~~   28 (246)
T PF08450_consen    4 GPVWDPRDGRLYWVDIPGGRIYRVD   28 (246)
T ss_dssp             EEEEETTTTEEEEEETTTTEEEEEE
T ss_pred             ceEEECCCCEEEEEEcCCCEEEEEE
Confidence            4678999999999999999999864


No 12 
>KOG4659|consensus
Probab=92.50  E-value=0.63  Score=37.60  Aligned_cols=62  Identities=24%  Similarity=0.235  Sum_probs=48.9

Q ss_pred             CCcEEEEEeCCcEEEEeCCCCCceeeEEEeCCccceEEEEEeCCCCEEEEEECCCCcEEeec
Q psy13631         19 SQEMLLLVQRTDICKISLDTPDHTNVMLPVVGVKHAIAIDWDPVEQHLYWTDDEARNKSGES   80 (81)
Q Consensus        19 ~~~~Ll~s~~~~I~~i~l~~~~~~~~~~~~~~~~~~~ald~d~~~~~iyW~d~~~~~I~r~s   80 (81)
                      +.-.|++..-+-||+|.-+++..+-..+.+.+...-.=++.++.++.||-+|...+.|.|.+
T Consensus       374 ~DGSl~VGDfNyIRRI~~dg~v~tIl~L~~t~~sh~Yy~AvsPvdgtlyvSdp~s~qv~rv~  435 (1899)
T KOG4659|consen  374 PDGSLIVGDFNYIRRISQDGQVSTILTLGLTDTSHSYYIAVSPVDGTLYVSDPLSKQVWRVS  435 (1899)
T ss_pred             CCCcEEEccchheeeecCCCceEEEEEecCCCccceeEEEecCcCceEEecCCCcceEEEec
Confidence            34456667778999999998765545554456666778999999999999999999999976


No 13 
>PF07645 EGF_CA:  Calcium-binding EGF domain;  InterPro: IPR001881 A sequence of about forty amino-acid residues found in epidermal growth factor (EGF) has been shown [, , , , , ] to be present in a large number of membrane-bound and extracellular, mostly animal, proteins. Many of these proteins require calcium for their biological function and a calcium-binding site has been found at the N terminus of some EGF-like domains []. Calcium-binding may be crucial for numerous protein-protein interactions. For human coagulation factor IX it has been shown [] that the calcium-ligands form a pentagonal bipyramid. The first, third and fourth conserved negatively charged or polar residues are side chain ligands. The latter is possibly hydroxylated (see aspartic acid and asparagine hydroxylation site) []. A conserved aromatic residue, as well as the second conserved negative residue, are thought to be involved in stabilising the calcium-binding site. As in non-calcium binding EGF-like domains, there are six conserved cysteines and the structure of both types is very similar as calcium-binding induces only strictly local structural changes [].  +------------------+ +---------+ | | | | nxnnC-x(3,14)-C-x(3,7)-CxxbxxxxaxC-x(1,6)-C-x(8,13)-Cx | | +------------------+ 'n': negatively charged or polar residue [DEQN] 'b': possibly beta-hydroxylated residue [DN] 'a': aromatic amino acid 'C': cysteine, involved in disulphide bond 'x': any amino acid. ; GO: 0005509 calcium ion binding; PDB: 2VJ3_A 1TOZ_A 1LMJ_A 1UZQ_A 1UZK_A 1UZJ_B 1UZP_A 1EMO_A 1EMN_A 2RR0_A ....
Probab=91.61  E-value=0.088  Score=25.17  Aligned_cols=14  Identities=36%  Similarity=0.325  Sum_probs=11.7

Q ss_pred             CCCCceEEc-CCCCC
Q psy13631          2 FHFYGVRLV-DNFTC   15 (81)
Q Consensus         2 ~~~~Gy~L~-d~~tC   15 (81)
                      .|++||++. ++++|
T Consensus        28 ~C~~Gy~~~~~~~~C   42 (42)
T PF07645_consen   28 SCPPGYELNDDGTTC   42 (42)
T ss_dssp             EESTTEEECTTSSEE
T ss_pred             eCCCCcEECCCCCcC
Confidence            389999988 88776


No 14 
>PF07995 GSDH:  Glucose / Sorbosone dehydrogenase;  InterPro: IPR012938 Proteins containing this domain are thought to be glucose/sorbosone dehydrogenases. The best characterised of these proteins is soluble glucose dehydrogenase (P13650 from SWISSPROT) from Acinetobacter calcoaceticus, which oxidises glucose to gluconolactone. The enzyme is a calcium-dependent homodimer which uses PQQ as a cofactor [].; GO: 0016901 oxidoreductase activity, acting on the CH-OH group of donors, quinone or similar compound as acceptor, 0048038 quinone binding, 0005975 carbohydrate metabolic process; PDB: 2ISM_A 2WG3_D 3HO5_A 3HO4_A 3HO3_A 2WFT_A 2WG4_B 2WFX_B 1CRU_A 1CQ1_B ....
Probab=91.26  E-value=0.88  Score=30.92  Aligned_cols=44  Identities=18%  Similarity=0.400  Sum_probs=30.4

Q ss_pred             CcEEEEeCCCCC------------ceeeEEEeCCccceEEEEEeCCCCEEEEEECCCC
Q psy13631         29 TDICKISLDTPD------------HTNVMLPVVGVKHAIAIDWDPVEQHLYWTDDEAR   74 (81)
Q Consensus        29 ~~I~~i~l~~~~------------~~~~~~~~~~~~~~~ald~d~~~~~iyW~d~~~~   74 (81)
                      -.|.+++.+++.            ...+.  ..|+.|+.+++||+.++.||-+|...+
T Consensus       148 G~ilri~~dG~~p~dnP~~~~~~~~~~i~--A~GlRN~~~~~~d~~tg~l~~~d~G~~  203 (331)
T PF07995_consen  148 GKILRIDPDGSIPADNPFVGDDGADSEIY--AYGLRNPFGLAFDPNTGRLWAADNGPD  203 (331)
T ss_dssp             TEEEEEETTSSB-TTSTTTTSTTSTTTEE--EE--SEEEEEEEETTTTEEEEEEE-SS
T ss_pred             ceEEEecccCcCCCCCccccCCCceEEEE--EeCCCccccEEEECCCCcEEEEccCCC
Confidence            368888887651            11222  359999999999999999999997654


No 15 
>PF08450 SGL:  SMP-30/Gluconolaconase/LRE-like region;  InterPro: IPR013658 This family describes a region that is found in proteins expressed by a variety of eukaryotic and prokaryotic species. These proteins include various enzymes, such as senescence marker protein 30 (SMP-30, Q15493 from SWISSPROT), gluconolactonase (Q01578 from SWISSPROT) and luciferin-regenerating enzyme (LRE, Q86DU5 from SWISSPROT). SMP-30 is known to hydrolyse diisopropyl phosphorofluoridate in the liver, and has been noted as having sequence similarity, in the region described in this family, with PON1 (P52430 from SWISSPROT) and LRE. ; PDB: 2GHS_A 2DG0_L 2DG1_D 2DSO_D 3E5Z_A 2IAT_A 2IAV_A 2GVV_A 3HLI_A 2GVU_A ....
Probab=88.69  E-value=2.9  Score=26.63  Aligned_cols=45  Identities=20%  Similarity=0.231  Sum_probs=33.5

Q ss_pred             EEEEeCCCCCceeeEEEeCCccceEEEEEeCCCCEEEEEECCCCcEEee
Q psy13631         31 ICKISLDTPDHTNVMLPVVGVKHAIAIDWDPVEQHLYWTDDEARNKSGE   79 (81)
Q Consensus        31 I~~i~l~~~~~~~~~~~~~~~~~~~ald~d~~~~~iyW~d~~~~~I~r~   79 (81)
                      |.+++.+. . ...+.  .++..+-+|.|++.++.+|++|...+.|.+.
T Consensus       117 v~~~~~~~-~-~~~~~--~~~~~pNGi~~s~dg~~lyv~ds~~~~i~~~  161 (246)
T PF08450_consen  117 VYRIDPDG-K-VTVVA--DGLGFPNGIAFSPDGKTLYVADSFNGRIWRF  161 (246)
T ss_dssp             EEEEETTS-E-EEEEE--EEESSEEEEEEETTSSEEEEEETTTTEEEEE
T ss_pred             eEEECCCC-e-EEEEe--cCcccccceEECCcchheeecccccceeEEE
Confidence            66677662 2 22222  4677778999999999999999999999874


No 16 
>COG2133 Glucose/sorbosone dehydrogenases [Carbohydrate transport and metabolism]
Probab=87.41  E-value=3.7  Score=29.12  Aligned_cols=26  Identities=23%  Similarity=0.538  Sum_probs=23.2

Q ss_pred             CCccceEEEEEeCCCCEEEEEECCCC
Q psy13631         49 VGVKHAIAIDWDPVEQHLYWTDDEAR   74 (81)
Q Consensus        49 ~~~~~~~ald~d~~~~~iyW~d~~~~   74 (81)
                      -|+.|+++++||+.++.+|-++...+
T Consensus       236 ~G~RN~qGl~w~P~tg~Lw~~e~g~d  261 (399)
T COG2133         236 YGHRNPQGLAWHPVTGALWTTEHGPD  261 (399)
T ss_pred             eccCCccceeecCCCCcEEEEecCCC
Confidence            48999999999999999999987763


No 17 
>TIGR02604 Piru_Ver_Nterm putative membrane-bound dehydrogenase domain. All proteins that score above the trusted cutoff score of 45 to this model are large proteins of either Pirellula sp. 1 or Verrucomicrobium spinosum. These proteins all contain, in addition to this domain, several hundred residues of highly variable sequence, and then a well-conserved C-terminal domain (TIGR02603) that features a putative cytochrome c-type heme binding motif CXXCH. The membrane-bound L-sorbosone dehydrogenase from Acetobacter liquefaciens (Gluconacetobacter liquefaciens) is homologous to this domain but lacks additional sequence regions shared by members of this family and belongs to a different clade of the larger family of homologs. It and its closely related homologs are excluded from the this model by scoring between the trusted (45) and noise (18) cutoffs.
Probab=85.05  E-value=4.8  Score=27.65  Aligned_cols=44  Identities=11%  Similarity=0.207  Sum_probs=30.3

Q ss_pred             cEEEEeCCCCCceeeEEEeCCccceEEEEEeCCCCEEEEEECCCCcEE
Q psy13631         30 DICKISLDTPDHTNVMLPVVGVKHAIAIDWDPVEQHLYWTDDEARNKS   77 (81)
Q Consensus        30 ~I~~i~l~~~~~~~~~~~~~~~~~~~ald~d~~~~~iyW~d~~~~~I~   77 (81)
                      .|.+++.++...+ ++  ..++.++.+++||. ++.+|.+|.......
T Consensus       165 ~i~r~~pdg~~~e-~~--a~G~rnp~Gl~~d~-~G~l~~tdn~~~~~~  208 (367)
T TIGR02604       165 GLFRYNPDGGKLR-VV--AHGFQNPYGHSVDS-WGDVFFCDNDDPPLC  208 (367)
T ss_pred             eEEEEecCCCeEE-EE--ecCcCCCccceECC-CCCEEEEccCCCcee
Confidence            4777777765432 22  36899999999998 567888887544333


No 18 
>PF01436 NHL:  NHL repeat;  InterPro: IPR001258 The NHL repeat, named after NCL-1, HT2A and Lin-41, is found largely in a large number of eukaryotic and prokaryotic proteins. For example, the repeat is found in a variety of enzymes of the copper type II, ascorbate-dependent monooxygenase family which catalyse the C terminus alpha-amidation of biological peptides []. In many it occurs in tandem arrays, for example in the ringfinger beta-box, coiled-coil (RBCC) eukaryotic growth regulators []. The 'Brain Tumor' protein (Brat) is one such growth regulator that contains a 6-bladed NHL-repeat beta-propeller [, ].  The NHL repeats are also found in serine/threonine protein kinase (STPK) in diverse range of pathogenic bacteria. These STPK are transmembrane receptors with a intracellular N-terminal kinase domain and extracellular C-terminal sensor domain. In the STPK, PknD, from Mycobacterium tuberculosis, the sensor domain forms a rigid, six-bladed b-propeller composed of NHL repeats with a flexible tether to the transmembrane domain.; GO: 0005515 protein binding; PDB: 3FVZ_A 3FW0_A 1RWL_A 1RWI_A 1Q7F_A.
Probab=82.72  E-value=2.9  Score=17.99  Aligned_cols=26  Identities=12%  Similarity=0.166  Sum_probs=20.3

Q ss_pred             cceEEEEEeCCCCEEEEEECCCCcEEe
Q psy13631         52 KHAIAIDWDPVEQHLYWTDDEARNKSG   78 (81)
Q Consensus        52 ~~~~ald~d~~~~~iyW~d~~~~~I~r   78 (81)
                      ..+.+|..| .++.||=+|.....|..
T Consensus         2 ~~P~gvav~-~~g~i~VaD~~n~rV~v   27 (28)
T PF01436_consen    2 NYPHGVAVD-SDGNIYVADSGNHRVQV   27 (28)
T ss_dssp             SSEEEEEEE-TTSEEEEEECCCTEEEE
T ss_pred             cCCcEEEEe-CCCCEEEEECCCCEEEE
Confidence            356677788 78899999988887753


No 19 
>KOG1520|consensus
Probab=80.92  E-value=6.2  Score=27.81  Aligned_cols=52  Identities=19%  Similarity=0.241  Sum_probs=32.9

Q ss_pred             EEEEEeCC-cEEEEeCCCCCceeeEEEeCCcc--ceEEEEEeCCCCEEEEEECCCC
Q psy13631         22 MLLLVQRT-DICKISLDTPDHTNVMLPVVGVK--HAIAIDWDPVEQHLYWTDDEAR   74 (81)
Q Consensus        22 ~Ll~s~~~-~I~~i~l~~~~~~~~~~~~~~~~--~~~ald~d~~~~~iyW~d~~~~   74 (81)
                      =|++|... .+..++..+...+....++.|..  -.=++|.|. ++.|||+|...+
T Consensus       128 dL~VaDAYlGL~~V~p~g~~a~~l~~~~~G~~~kf~N~ldI~~-~g~vyFTDSSsk  182 (376)
T KOG1520|consen  128 DLYVADAYLGLLKVGPEGGLAELLADEAEGKPFKFLNDLDIDP-EGVVYFTDSSSK  182 (376)
T ss_pred             eEEEEecceeeEEECCCCCcceeccccccCeeeeecCceeEcC-CCeEEEeccccc
Confidence            46667664 67777777665333333333422  234688888 899999998653


No 20 
>TIGR03606 non_repeat_PQQ dehydrogenase, PQQ-dependent, s-GDH family. PQQ, or pyrroloquinoline-quinone, serves as a cofactor for a number of sugar and alcohol dehydrogenases in a limited number of bacterial species. Most characterized PQQ-dependent enzymes have multiple repeats of a sequence region described by pfam01011 (PQQ enzyme repeat), but this protein family in unusual in lacking that repeat. Below the noise cutoff are related proteins mostly from species that lack PQQ biosynthesis.
Probab=80.68  E-value=6  Score=28.50  Aligned_cols=49  Identities=12%  Similarity=0.224  Sum_probs=33.5

Q ss_pred             cEEEEeCCCCCce--------eeEEEeCCccceEEEEEeCCCCEEEEEECCC---CcEEee
Q psy13631         30 DICKISLDTPDHT--------NVMLPVVGVKHAIAIDWDPVEQHLYWTDDEA---RNKSGE   79 (81)
Q Consensus        30 ~I~~i~l~~~~~~--------~~~~~~~~~~~~~ald~d~~~~~iyW~d~~~---~~I~r~   79 (81)
                      .|.||+.+++.-.        ..-+-.-|++|+.+++||+ ++.||.+|...   +.|.+.
T Consensus       200 kILRin~DGsiP~dNPf~~g~~~eIyA~G~RNp~Gla~dp-~G~Lw~~e~Gp~~~DEiN~I  259 (454)
T TIGR03606       200 KVLRLNLDGSIPKDNPSINGVVSHIFTYGHRNPQGLAFTP-DGTLYASEQGPNSDDELNII  259 (454)
T ss_pred             EEEEEcCCCCCCCCCCccCCCcceEEEEeccccceeEECC-CCCEEEEecCCCCCcEEEEe
Confidence            6888888865210        0112235899999999999 78999999865   445543


No 21 
>PLN02919 haloacid dehalogenase-like hydrolase family protein
Probab=79.80  E-value=13  Score=29.54  Aligned_cols=57  Identities=14%  Similarity=0.153  Sum_probs=36.1

Q ss_pred             EEEEEeC--CcEEEEeCCCCCceeeEE-----------EeCCccceEEEEEeCCCCEEEEEECCCCcEEe
Q psy13631         22 MLLLVQR--TDICKISLDTPDHTNVML-----------PVVGVKHAIAIDWDPVEQHLYWTDDEARNKSG   78 (81)
Q Consensus        22 ~Ll~s~~--~~I~~i~l~~~~~~~~~~-----------~~~~~~~~~ald~d~~~~~iyW~d~~~~~I~r   78 (81)
                      -|+++..  +.|+.+++++........           ....+..+.+|.+|..++.||++|.....|.+
T Consensus       581 ~lyVaDs~n~rI~v~d~~G~~i~~ig~~g~~G~~dG~~~~a~f~~P~GIavd~~gn~LYVaDt~n~~Ir~  650 (1057)
T PLN02919        581 RLFISDSNHNRIVVTDLDGNFIVQIGSTGEEGLRDGSFEDATFNRPQGLAYNAKKNLLYVADTENHALRE  650 (1057)
T ss_pred             eEEEEECCCCeEEEEeCCCCEEEEEccCCCcCCCCCchhccccCCCcEEEEeCCCCEEEEEeCCCceEEE
Confidence            4555553  578888877542111100           00113457899999999999999998877765


No 22 
>PF01826 TIL:  Trypsin Inhibitor like cysteine rich domain;  InterPro: IPR002919 This domain is found in proteinase inhibitors as well as in many extracellular proteins. The domain typically contains ten cysteine residues that form five disulphide bonds. The cysteine residues that form the disulphide bonds are 1-7, 2-6, 3-5, 4-10 and 8-9. This inhibitor domain belongs to MEROPS inhibitor family I8 (clan IA). Proteins containing this domain inhibit peptidases belonging to families S1 (IPR001254 from INTERPRO), S8 (IPR000209 from INTERPRO), and M4 (IPR001570 from INTERPRO) [] and are restricted to the chordata, nematoda, arthropoda and echinodermata. Examples of proteins containing this domain are:  chymotrypsin/elastase inhibitor from Ascaris suum (pig roundworm) Acp62F protein from Drosophila melanogaster  Bombina trypsin inhibitor from Bombina maxima (large-webbed bell toad) Bombyx subtilisin inhibitor from Bombyx mori (silk moth) von Willebrand factor ; PDB: 2P3F_N 1HX2_A 1CCV_A 1EAI_D 2H9E_C 1COU_A 1ATE_A 1ATB_A 1ATD_A 1ATA_A ....
Probab=79.32  E-value=0.83  Score=22.80  Aligned_cols=15  Identities=33%  Similarity=0.281  Sum_probs=12.0

Q ss_pred             CCCCceEEcCCCCCC
Q psy13631          2 FHFYGVRLVDNFTCA   16 (81)
Q Consensus         2 ~~~~Gy~L~d~~tC~   16 (81)
                      +|++||.+.++..|.
T Consensus        36 ~C~~G~v~~~~~~CV   50 (55)
T PF01826_consen   36 FCPPGYVRNDNGRCV   50 (55)
T ss_dssp             EETTTEEEETTSEEE
T ss_pred             CCCCCeeEcCCCCEE
Confidence            699999988447776


No 23 
>COG3386 Gluconolactonase [Carbohydrate transport and metabolism]
Probab=78.16  E-value=2.5  Score=28.77  Aligned_cols=23  Identities=26%  Similarity=0.388  Sum_probs=19.9

Q ss_pred             EEEeCCCCEEEEEECCCCcEEee
Q psy13631         57 IDWDPVEQHLYWTDDEARNKSGE   79 (81)
Q Consensus        57 ld~d~~~~~iyW~d~~~~~I~r~   79 (81)
                      --||...+.+||+|+..++|.|.
T Consensus        30 P~w~~~~~~L~w~DI~~~~i~r~   52 (307)
T COG3386          30 PVWDPDRGALLWVDILGGRIHRL   52 (307)
T ss_pred             ccCcCCCCEEEEEeCCCCeEEEe
Confidence            34889999999999999999874


No 24 
>TIGR01492 CPW_WPC Plasmodium falciparum CPW-WPC domain. This model represents a domain of about 61 residues in length with six well-conserved cysteine residues and six well-conserved aromatic sites. The domain can be found in tandem repeats, and is known so far only in Plasmodium falciparum. It is named for motifs of CPxxW and (less well conserved) WPC.
Probab=75.38  E-value=1.9  Score=22.39  Aligned_cols=16  Identities=13%  Similarity=0.011  Sum_probs=13.7

Q ss_pred             CCCCceEEc-CCCCCCC
Q psy13631          2 FHFYGVRLV-DNFTCAN   17 (81)
Q Consensus         2 ~~~~Gy~L~-d~~tC~~   17 (81)
                      .||.|+.+. |+..|..
T Consensus         9 ~CP~~W~~~~~~~~C~a   25 (62)
T TIGR01492         9 PCPENWIQKNDKYGCIA   25 (62)
T ss_pred             cCCccceecCCCCeeCC
Confidence            699999999 8777993


No 25 
>PF12661 hEGF:  Human growth factor-like EGF; PDB: 2YGQ_A 2E26_A 3A7Q_A 2YGP_A 2YGO_A 1HRE_A 1HAE_A 1HAF_A 1HRF_A.
Probab=75.38  E-value=1.3  Score=15.98  Aligned_cols=7  Identities=14%  Similarity=-0.330  Sum_probs=4.7

Q ss_pred             CCCCceE
Q psy13631          2 FHFYGVR    8 (81)
Q Consensus         2 ~~~~Gy~    8 (81)
                      .|++||.
T Consensus         3 ~C~~G~~    9 (13)
T PF12661_consen    3 QCPPGWT    9 (13)
T ss_dssp             EE-TTEE
T ss_pred             cCcCCCc
Confidence            3899996


No 26 
>PLN02919 haloacid dehalogenase-like hydrolase family protein
Probab=73.55  E-value=31  Score=27.46  Aligned_cols=58  Identities=9%  Similarity=0.143  Sum_probs=36.7

Q ss_pred             cEEEEEeC--CcEEEEeCCCCCceeeEE--------------EeCCccceEEEEEeCCCCEEEEEECCCCcEEe
Q psy13631         21 EMLLLVQR--TDICKISLDTPDHTNVML--------------PVVGVKHAIAIDWDPVEQHLYWTDDEARNKSG   78 (81)
Q Consensus        21 ~~Ll~s~~--~~I~~i~l~~~~~~~~~~--------------~~~~~~~~~ald~d~~~~~iyW~d~~~~~I~r   78 (81)
                      ..|.++..  +.|+.+++.+..-....-              ..+.+..+.+|.+|+.++.+|++|.....|.+
T Consensus       636 n~LYVaDt~n~~Ir~id~~~~~V~tlag~G~~g~~~~gg~~~~~~~ln~P~gVa~dp~~g~LyVad~~~~~I~v  709 (1057)
T PLN02919        636 NLLYVADTENHALREIDFVNETVRTLAGNGTKGSDYQGGKKGTSQVLNSPWDVCFEPVNEKVYIAMAGQHQIWE  709 (1057)
T ss_pred             CEEEEEeCCCceEEEEecCCCEEEEEeccCcccCCCCCChhhhHhhcCCCeEEEEecCCCeEEEEECCCCeEEE
Confidence            45667665  568888876542211110              00013456789999989999999988777754


No 27 
>COG3386 Gluconolactonase [Carbohydrate transport and metabolism]
Probab=72.00  E-value=20  Score=24.44  Aligned_cols=29  Identities=17%  Similarity=0.236  Sum_probs=25.1

Q ss_pred             ccceEEEEEeCCCCEEEEEECCCCcEEee
Q psy13631         51 VKHAIAIDWDPVEQHLYWTDDEARNKSGE   79 (81)
Q Consensus        51 ~~~~~ald~d~~~~~iyW~d~~~~~I~r~   79 (81)
                      +..+=+|.|++.++.+|++|.....|.|.
T Consensus       162 ~~~~NGla~SpDg~tly~aDT~~~~i~r~  190 (307)
T COG3386         162 LTIPNGLAFSPDGKTLYVADTPANRIHRY  190 (307)
T ss_pred             EEecCceEECCCCCEEEEEeCCCCeEEEE
Confidence            56666788999999999999999999885


No 28 
>PF01731 Arylesterase:  Arylesterase;  InterPro: IPR002640  The serum paraoxonases/arylesterases are enzymes that catalyse the hydrolysis of the toxic metabolites of a variety of organophosphorus insecticides. The enzymes hydrolyse a broad spectrum of organophosphate substrates, including paraoxon and a number of aromatic carboxylic acid esters (e.g., phenyl acetate), and hence confer resistance to organophosphate toxicity [].   Mammals have 3 distinct paraoxonase types, termed PON1-3 [, ]. In mice and humans, the PON genes are found on the same chromosome in close proximity. PON activity has been found in variety of tissues, with highest levels in liver and serum - the source of serum PON is thought to be the liver. Unlike mammals, fish and avian species lack paraoxonase activity.   Human and rabbit PONs appear to have two distinct Ca2+ binding sites, one required for stability and one required for catalytic activity. The Ca2+ dependency of PONs suggests a mechanism of hydrolysis where Ca2+ acts as the electrophillic catalyst, like that proposed for phospholipase A2. The paraoxonase enzymes, PON1 and PON3, are high density lipoprotein (HDL)- associated proteins capable of preventing oxidative modification of low density lipoproteins (LPL) []. Although PON2 has oxidative properties, the enzyme does not associate with HDL.   Within a given species, PON1, PON2 and PON3 share ~60% amino acid sequence identity, whereas between mammalian species particular PONs (1,2 or 3) share 79-90% identity at the amino acid level. Human PON1 and PON3 share numerous conserved phosphorylation and N-glycosylation sites; however, it is not known whether the PON proteins are modified at these sites, or whether modification at these sites is required for activity in vivo [].  This family consists of arylesterases (Also known as serum paraoxonase) 3.1.1.2 from EC. These enzymes hydrolyse organophosphorus esters such as paraoxon and are found in the liver and blood. They confer resistance to organophosphate toxicity []. Human arylesterase (PON1) P27169 from SWISSPROT is associated with HDL and may protect against LDL oxidation [].; GO: 0004064 arylesterase activity
Probab=69.24  E-value=17  Score=20.11  Aligned_cols=31  Identities=23%  Similarity=0.215  Sum_probs=26.8

Q ss_pred             eCCccceEEEEEeCCCCEEEEEECCCCcEEe
Q psy13631         48 VVGVKHAIAIDWDPVEQHLYWTDDEARNKSG   78 (81)
Q Consensus        48 ~~~~~~~~ald~d~~~~~iyW~d~~~~~I~r   78 (81)
                      +.++..+-+|.+++..++||=++...++|.-
T Consensus        50 a~g~~~aNGI~~s~~~k~lyVa~~~~~~I~v   80 (86)
T PF01731_consen   50 ASGFSFANGIAISPDKKYLYVASSLAHSIHV   80 (86)
T ss_pred             eccCCCCceEEEcCCCCEEEEEeccCCeEEE
Confidence            3588888899999999999999999888764


No 29 
>PF07995 GSDH:  Glucose / Sorbosone dehydrogenase;  InterPro: IPR012938 Proteins containing this domain are thought to be glucose/sorbosone dehydrogenases. The best characterised of these proteins is soluble glucose dehydrogenase (P13650 from SWISSPROT) from Acinetobacter calcoaceticus, which oxidises glucose to gluconolactone. The enzyme is a calcium-dependent homodimer which uses PQQ as a cofactor [].; GO: 0016901 oxidoreductase activity, acting on the CH-OH group of donors, quinone or similar compound as acceptor, 0048038 quinone binding, 0005975 carbohydrate metabolic process; PDB: 2ISM_A 2WG3_D 3HO5_A 3HO4_A 3HO3_A 2WFT_A 2WG4_B 2WFX_B 1CRU_A 1CQ1_B ....
Probab=69.08  E-value=33  Score=23.30  Aligned_cols=55  Identities=18%  Similarity=0.207  Sum_probs=35.3

Q ss_pred             EEEEe--CCcEEEEeCCCCCce-eeEEEeCCcc-ceEEEEEeCCCCEEEEEECCCCcEEe
Q psy13631         23 LLLVQ--RTDICKISLDTPDHT-NVMLPVVGVK-HAIAIDWDPVEQHLYWTDDEARNKSG   78 (81)
Q Consensus        23 Ll~s~--~~~I~~i~l~~~~~~-~~~~~~~~~~-~~~ald~d~~~~~iyW~d~~~~~I~r   78 (81)
                      ++++.  +..|.++.++..... ....-+.+.. +++.|.+++. +.||-++-..++|+|
T Consensus       273 ~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~r~~~v~~~pD-G~Lyv~~d~~G~iyR  331 (331)
T PF07995_consen  273 LFVADYGGGRIWRLDLDEDGSVTEEEEFLGGFGGRPRDVAQGPD-GALYVSDDSDGKIYR  331 (331)
T ss_dssp             EEEEETTTTEEEEEEEETTEEEEEEEEECTTSSS-EEEEEEETT-SEEEEEE-TTTTEEE
T ss_pred             EEEecCCCCEEEEEeeecCCCccceEEccccCCCCceEEEEcCC-CeEEEEECCCCeEeC
Confidence            44555  358888888754221 1111224555 7889999984 789999889999987


No 30 
>PF04942 CC:  CC domain;  InterPro: IPR007026 This short domain contains four conserved cysteines that are probably required for the formation of two disulphide bonds. The domain is only found in proteins from Caenorhabditis species. The domain is named after the characteristic CC motif.
Probab=68.96  E-value=3.6  Score=19.15  Aligned_cols=14  Identities=21%  Similarity=0.311  Sum_probs=10.3

Q ss_pred             CCCCceEEc-CCCCC
Q psy13631          2 FHFYGVRLV-DNFTC   15 (81)
Q Consensus         2 ~~~~Gy~L~-d~~tC   15 (81)
                      .||.||.|- .+.=|
T Consensus        19 ~CP~G~~~i~g~~CC   33 (36)
T PF04942_consen   19 VCPSGYTVITGNGCC   33 (36)
T ss_pred             cCCCCCEEECCCccC
Confidence            599999998 55433


No 31 
>KOG0315|consensus
Probab=68.11  E-value=15  Score=24.98  Aligned_cols=47  Identities=15%  Similarity=0.228  Sum_probs=33.0

Q ss_pred             CcEEEEEeCCcEEEEeCCCCCceeeEEEeCC-ccceEEEEEeCCCCEEE
Q psy13631         20 QEMLLLVQRTDICKISLDTPDHTNVMLPVVG-VKHAIAIDWDPVEQHLY   67 (81)
Q Consensus        20 ~~~Ll~s~~~~I~~i~l~~~~~~~~~~~~~~-~~~~~ald~d~~~~~iy   67 (81)
                      ..+|..+-...||..++++.+..+++. ..+ -+|+.++.|.+....+|
T Consensus        52 k~~LAaa~~qhvRlyD~~S~np~Pv~t-~e~h~kNVtaVgF~~dgrWMy   99 (311)
T KOG0315|consen   52 KKDLAAAGNQHVRLYDLNSNNPNPVAT-FEGHTKNVTAVGFQCDGRWMY   99 (311)
T ss_pred             cchhhhccCCeeEEEEccCCCCCceeE-EeccCCceEEEEEeecCeEEE
Confidence            345655556689999998876544332 233 38899999999888777


No 32 
>PF02425 GBP_PSP:  Paralytic/GBP/PSP peptide;  InterPro: IPR003463 This family includes insect peptides that are short (23 amino acids) and contain 1 disulphide bridge. The family includes growth-blocking peptide (GBP) of Pseudaletia separata (Oriental armyworm) and the paralytic peptides from Manduca sexta (Tobacco hawkmoth), Heliothis virescens (Noctuid moth), and Spodoptera exigua (Beet armyworm) [] as well as plasmatocyte-spreading peptide (PSP1) []. These peptides function to halt metamorphosis from larvae to pupae.; PDB: 1V28_A 2DJC_A 2EQQ_A 2EQH_A 2EQT_A 1BQF_A 2DJ9_A 1HRL_A 1IRR_A 1B5N_A ....
Probab=66.66  E-value=4.1  Score=16.95  Aligned_cols=14  Identities=21%  Similarity=0.166  Sum_probs=11.1

Q ss_pred             CCCCceEEc-CCCCCC
Q psy13631          2 FHFYGVRLV-DNFTCA   16 (81)
Q Consensus         2 ~~~~Gy~L~-d~~tC~   16 (81)
                      -|.+||+-. || .||
T Consensus         6 gc~~gy~rtadg-rck   20 (23)
T PF02425_consen    6 GCATGYMRTADG-RCK   20 (23)
T ss_dssp             SSSTTEEEETTT-EEE
T ss_pred             cccccceEcCCc-ccc
Confidence            499999888 76 576


No 33 
>smart00032 CCP Domain abundant in complement control proteins; SUSHI repeat; short complement-like repeat (SCR). The complement control protein (CCP) modules (also known as short  consensus repeats SCRs or SUSHI repeats) contain approximately 60 amino  acid residues and have been identified in several proteins of the  complement system. A missense mutation in seventh CCP domain causes  deficiency of the b subunit of factor XIII.
Probab=65.62  E-value=2.6  Score=20.12  Aligned_cols=15  Identities=27%  Similarity=0.377  Sum_probs=11.7

Q ss_pred             CCCCceEEc-C-CCCCC
Q psy13631          2 FHFYGVRLV-D-NFTCA   16 (81)
Q Consensus         2 ~~~~Gy~L~-d-~~tC~   16 (81)
                      .|.+||+|. . ..+|.
T Consensus        29 ~C~~Gy~l~g~~~~~C~   45 (57)
T smart00032       29 SCNPGYTLIGSSTITCL   45 (57)
T ss_pred             EcCCCCEEcCCCeeEEC
Confidence            589999988 4 56787


No 34 
>PF13948 DUF4215:  Domain of unknown function (DUF4215)
Probab=64.43  E-value=5.1  Score=19.59  Aligned_cols=15  Identities=27%  Similarity=0.240  Sum_probs=13.4

Q ss_pred             CCCCceEEc-CCCCCC
Q psy13631          2 FHFYGVRLV-DNFTCA   16 (81)
Q Consensus         2 ~~~~Gy~L~-d~~tC~   16 (81)
                      .|..||.|. ...+|.
T Consensus         3 ~C~~G~~l~~~~~~C~   18 (47)
T PF13948_consen    3 QCQDGYYLNQNNNKCE   18 (47)
T ss_pred             cccCccEEcCCCCccc
Confidence            589999999 888997


No 35 
>smart00179 EGF_CA Calcium-binding EGF-like domain.
Probab=64.01  E-value=4.2  Score=18.05  Aligned_cols=6  Identities=17%  Similarity=-0.158  Sum_probs=3.0

Q ss_pred             CCCceE
Q psy13631          3 HFYGVR    8 (81)
Q Consensus         3 ~~~Gy~    8 (81)
                      |+.||.
T Consensus        28 C~~g~~   33 (39)
T smart00179       28 CPPGYT   33 (39)
T ss_pred             CCCCCc
Confidence            455554


No 36 
>PF00396 Granulin:  Granulin;  InterPro: IPR000118 Metazoan granulins [] are a family of cysteine-rich peptides of about 6 Kd which may have multiple biological activity. A precursor protein (known as acrogranin) potentially encodes seven different forms of granulin (grnA to grnG) which are probably released by post-translational proteolytic processing. Granulins are evolutionary related to a PMP-D1, a peptide extracted from the pars intercerebralis of migratory locusts []. A schematic representation of the structure of a granulin is shown below:  xxxCxxxxxCxxxxxCCxxxxxxxxCCxxxxxxCCxxxxxCCxxxxxCxxxxxxCx 'C': conserved cysteine probably involved in a disulphide bond.   In plants a granulin domain is often associated with the C terminus of cysteine proteases belong to the MEROPS peptidase family C1, subfamily C1A (papain).; PDB: 1I8Y_A 1QGM_A 1I8X_A 2JYT_A 2JYU_A 1FWO_A 2JYV_A 2JYE_A 1G26_A.
Probab=62.83  E-value=4  Score=19.82  Aligned_cols=14  Identities=21%  Similarity=0.138  Sum_probs=11.0

Q ss_pred             CCCceEEc-CCCCCC
Q psy13631          3 HFYGVRLV-DNFTCA   16 (81)
Q Consensus         3 ~~~Gy~L~-d~~tC~   16 (81)
                      ||.||.=+ ++.+|.
T Consensus        28 CP~G~~C~~~~~~C~   42 (43)
T PF00396_consen   28 CPHGYTCDPDGGSCI   42 (43)
T ss_dssp             SSTTSEEECTTTEEE
T ss_pred             CCCcCEECCCCCEEc
Confidence            89999877 777774


No 37 
>KOG4499|consensus
Probab=58.01  E-value=9.2  Score=25.82  Aligned_cols=23  Identities=30%  Similarity=0.413  Sum_probs=17.2

Q ss_pred             EEEEeCCCCEEEEEECCCCcEEe
Q psy13631         56 AIDWDPVEQHLYWTDDEARNKSG   78 (81)
Q Consensus        56 ald~d~~~~~iyW~d~~~~~I~r   78 (81)
                      +..||..++.+||+|+..+.+.|
T Consensus        19 gp~w~~~~~sLl~VDi~ag~v~r   41 (310)
T KOG4499|consen   19 GPHWDVERQSLLYVDIEAGEVHR   41 (310)
T ss_pred             CCceEEecceEEEEEeccCceeh
Confidence            34577888889998887776654


No 38 
>PF09717 CPW_WPC:  Plasmodium falciparum domain of unknown function (CPW_WPC);  InterPro: IPR006387 This entry represents a domain of about 61 residues in length with six well-conserved cysteine residues and six well-conserved aromatic sites. The domain can be found in tandem repeats. It is named for motifs of CPxxW and (less well conserved) WPC. Its function is unknown.
Probab=57.40  E-value=4.8  Score=20.57  Aligned_cols=16  Identities=25%  Similarity=0.140  Sum_probs=12.7

Q ss_pred             CCCCceEEcCCCCCCC
Q psy13631          2 FHFYGVRLVDNFTCAN   17 (81)
Q Consensus         2 ~~~~Gy~L~d~~tC~~   17 (81)
                      .||.|+.+.++..|..
T Consensus         8 ~CP~~W~~~~~~~C~a   23 (60)
T PF09717_consen    8 PCPEGWILSDDGSCIA   23 (60)
T ss_pred             ccccccccCCCCeEcC
Confidence            5999999666677993


No 39 
>cd00033 CCP Complement control protein (CCP) modules (aka short consensus repeats SCRs or SUSHI repeats) have been identified in several proteins of the complement system. SUSHI repeats (short complement-like repeat, SCR) are abundant in complement control proteins. The complement control protein (CCP) modules (also known as short consensus repeats SCRs or SUSHI repeats) contain approximately 60 amino acid residues and have been identified in several proteins of the complement system. Typically, 2 to 4 modules contribute to a binding site, implying that the orientation of the modules to each other is critical for function.
Probab=56.54  E-value=5  Score=19.23  Aligned_cols=15  Identities=33%  Similarity=0.432  Sum_probs=11.6

Q ss_pred             CCCCceEEc-C-CCCCC
Q psy13631          2 FHFYGVRLV-D-NFTCA   16 (81)
Q Consensus         2 ~~~~Gy~L~-d-~~tC~   16 (81)
                      .|.+||+|. . ..+|.
T Consensus        28 ~C~~Gy~~~g~~~~~C~   44 (57)
T cd00033          28 SCNEGYTLVGSSTITCT   44 (57)
T ss_pred             ECCCCCeEeCCCeeEEC
Confidence            488999988 4 46787


No 40 
>PF14339 DUF4394:  Domain of unknown function (DUF4394)
Probab=55.88  E-value=44  Score=22.14  Aligned_cols=25  Identities=20%  Similarity=0.360  Sum_probs=18.5

Q ss_pred             cceEEEEEeCCCCEEEEEECCCCcEE
Q psy13631         52 KHAIAIDWDPVEQHLYWTDDEARNKS   77 (81)
Q Consensus        52 ~~~~ald~d~~~~~iyW~d~~~~~I~   77 (81)
                      ...++|||.+.++.+|=.. ....|+
T Consensus        27 e~l~GID~Rpa~G~LYgl~-~~g~lY   51 (236)
T PF14339_consen   27 ESLVGIDFRPANGQLYGLG-STGRLY   51 (236)
T ss_pred             CeEEEEEeecCCCCEEEEe-CCCcEE
Confidence            5689999999999999663 334444


No 41 
>PF06977 SdiA-regulated:  SdiA-regulated;  InterPro: IPR009722 This entry represents a conserved region approximately 100 residues long within a number of hypothetical bacterial proteins that may be regulated by SdiA, a member of the LuxR family of transcriptional regulators []. Some proteins contain the IPR001258 from INTERPRO repeat.; PDB: 3QQZ_A.
Probab=55.35  E-value=54  Score=21.67  Aligned_cols=33  Identities=18%  Similarity=0.340  Sum_probs=20.8

Q ss_pred             EEeCCcc-ceEEEEEeCCCCEEEEEECCCCcEEe
Q psy13631         46 LPVVGVK-HAIAIDWDPVEQHLYWTDDEARNKSG   78 (81)
Q Consensus        46 ~~~~~~~-~~~ald~d~~~~~iyW~d~~~~~I~r   78 (81)
                      .+++++. ++-+|.|++.++++|=+.-....|..
T Consensus        15 ~~l~g~~~e~SGLTy~pd~~tLfaV~d~~~~i~e   48 (248)
T PF06977_consen   15 KPLPGILDELSGLTYNPDTGTLFAVQDEPGEIYE   48 (248)
T ss_dssp             EE-TT--S-EEEEEEETTTTEEEEEETTTTEEEE
T ss_pred             eECCCccCCccccEEcCCCCeEEEEECCCCEEEE
Confidence            3667766 48899999988887666555555554


No 42 
>PF14280 DUF4365:  Domain of unknown function (DUF4365)
Probab=50.36  E-value=38  Score=19.60  Aligned_cols=24  Identities=21%  Similarity=0.303  Sum_probs=18.6

Q ss_pred             ceEEEEEeCCCCEEEEEECCCCcE
Q psy13631         53 HAIAIDWDPVEQHLYWTDDEARNK   76 (81)
Q Consensus        53 ~~~ald~d~~~~~iyW~d~~~~~I   76 (81)
                      .++.+=+|..++.+||.+.....+
T Consensus        84 PvilV~~~~~~~~~yW~~l~~~~~  107 (138)
T PF14280_consen   84 PVILVLVDPDSDCAYWVSLKGYPI  107 (138)
T ss_pred             CEEEEEEeCCCCEEEEEEcccCcc
Confidence            355667899999999999875544


No 43 
>PF13947 GUB_WAK_bind:  Wall-associated receptor kinase galacturonan-binding
Probab=49.98  E-value=29  Score=19.33  Aligned_cols=33  Identities=24%  Similarity=0.284  Sum_probs=18.0

Q ss_pred             CCceEEc-CCCCCCCCCCcEEEEEeCCcEEEEeCCCC
Q psy13631          4 FYGVRLV-DNFTCANGSQEMLLLVQRTDICKISLDTP   39 (81)
Q Consensus         4 ~~Gy~L~-d~~tC~~~~~~~Ll~s~~~~I~~i~l~~~   39 (81)
                      .+||+|. +..+   .+..+++.....+|..|++...
T Consensus        26 ~~~F~L~C~~~~---~~~~l~l~~~~~~V~~I~~~~~   59 (106)
T PF13947_consen   26 DPGFELTCNNNT---SPPKLLLSSGNYEVLSISYENG   59 (106)
T ss_pred             CCCcEEECCCCC---CCceeEecCCcEEEEEEecCCC
Confidence            3678876 4322   2333444344457777887654


No 44 
>PF12947 EGF_3:  EGF domain;  InterPro: IPR024731 This entry represents an EGF domain found in the the C terminus of malarial parasite merozoite surface protein 1 [], as well as other proteins.; PDB: 2NPR_A 1N1I_C 1B9W_A 1YO8_A 2RHP_A.
Probab=49.86  E-value=6.9  Score=18.05  Aligned_cols=12  Identities=33%  Similarity=0.423  Sum_probs=6.5

Q ss_pred             CCCceEEcCCCCC
Q psy13631          3 HFYGVRLVDNFTC   15 (81)
Q Consensus         3 ~~~Gy~L~d~~tC   15 (81)
                      |++||.. ||..|
T Consensus        25 C~~Gy~G-dG~~C   36 (36)
T PF12947_consen   25 CKPGYEG-DGFFC   36 (36)
T ss_dssp             E-CEEEC-CSTCE
T ss_pred             CCCCCcc-CCcCC
Confidence            7788753 45444


No 45 
>PRK11028 6-phosphogluconolactonase; Provisional
Probab=49.04  E-value=76  Score=20.98  Aligned_cols=54  Identities=13%  Similarity=0.103  Sum_probs=29.6

Q ss_pred             EEEEe--CCcEEEEeCCCCCceeeEEEeCCccceEEEEEeCCCCEEEEEECCCCcE
Q psy13631         23 LLLVQ--RTDICKISLDTPDHTNVMLPVVGVKHAIAIDWDPVEQHLYWTDDEARNK   76 (81)
Q Consensus        23 Ll~s~--~~~I~~i~l~~~~~~~~~~~~~~~~~~~ald~d~~~~~iyW~d~~~~~I   76 (81)
                      +++++  ...|..++++....-..+-.++...++..+.+++..+++|.+....+.|
T Consensus         4 ~y~~~~~~~~I~~~~~~~~g~l~~~~~~~~~~~~~~l~~spd~~~lyv~~~~~~~i   59 (330)
T PRK11028          4 VYIASPESQQIHVWNLNHEGALTLLQVVDVPGQVQPMVISPDKRHLYVGVRPEFRV   59 (330)
T ss_pred             EEEEcCCCCCEEEEEECCCCceeeeeEEecCCCCccEEECCCCCEEEEEECCCCcE
Confidence            44443  3577777775322111111112234566777888888888876655555


No 46 
>smart00494 ChtBD2 Chitin-binding domain type 2.
Probab=46.28  E-value=17  Score=17.15  Aligned_cols=16  Identities=19%  Similarity=0.069  Sum_probs=12.4

Q ss_pred             CCCCCceEEc-CCCCCC
Q psy13631          1 MFHFYGVRLV-DNFTCA   16 (81)
Q Consensus         1 ~~~~~Gy~L~-d~~tC~   16 (81)
                      +.||.|+.-. ..+.|.
T Consensus        31 ~~C~~g~~f~~~~~~C~   47 (56)
T smart00494       31 GSCPAGLVFDPATQTCD   47 (56)
T ss_pred             ccCcCCCeECCCCCCcC
Confidence            4688888777 777887


No 47 
>PF06045 Rhamnogal_lyase:  Rhamnogalacturonate lyase family;  InterPro: IPR010325 Rhamnogalacturonate lyase degrades the rhamnogalacturonan I (RG-I) backbone of pectin []. This family contains mainly members from plants, but also contains the plant pathogen Erwinia chrysanthemi.
Probab=45.56  E-value=48  Score=21.49  Aligned_cols=42  Identities=14%  Similarity=0.379  Sum_probs=25.6

Q ss_pred             eCCcEEEEeCCCCCceeeEEEeCCccceEEEEEeCCCCEEEEE
Q psy13631         27 QRTDICKISLDTPDHTNVMLPVVGVKHAIAIDWDPVEQHLYWT   69 (81)
Q Consensus        27 ~~~~I~~i~l~~~~~~~~~~~~~~~~~~~ald~d~~~~~iyW~   69 (81)
                      -.+.|..+.+......-.-+...|+.+...+. +...++-||-
T Consensus        20 ldNGiVqVtls~p~G~VtgIkYnGi~NLle~~-n~e~nrGYwD   61 (203)
T PF06045_consen   20 LDNGIVQVTLSKPGGIVTGIKYNGIDNLLEVA-NKENNRGYWD   61 (203)
T ss_pred             EECCEEEEEEcCCCceEEEEEECCEehhhccc-CcccCCceEE
Confidence            44568888887643222222235666665555 7788889995


No 48 
>PF12075 KN_motif:  KN motif;  InterPro: IPR021939  This small motif is found at the N terminus of Kank proteins and has been called the KN (for Kank N-terminal) motif. This protein is found in eukaryotes. Proteins in this family are typically between 413 to 1202 amino acids in length. This protein is found associated with PF00023 from PFAM. This protein has two conserved sequence motifs: TPYG and LDLDF. Kank1 was obtained by positional cloning of a tumor suppressor gene in renal cell carcinoma, while the other members were found by homology search. The family is involved in the regulation of actin polymerisation and cell motility through signaling pathways containing PI3K/Akt and/or unidentified modulators/effectors []. 
Probab=45.22  E-value=13  Score=17.73  Aligned_cols=15  Identities=20%  Similarity=0.051  Sum_probs=12.0

Q ss_pred             CCCCceEEc-CC-CCCC
Q psy13631          2 FHFYGVRLV-DN-FTCA   16 (81)
Q Consensus         2 ~~~~Gy~L~-d~-~tC~   16 (81)
                      .||-||.|+ |- +.|.
T Consensus         5 ~tPYGyhiDLDFvkyve   21 (39)
T PF12075_consen    5 ETPYGYHIDLDFVKYVE   21 (39)
T ss_pred             cCCcceeecchHHHHHH
Confidence            479999999 84 6677


No 49 
>PF03088 Str_synth:  Strictosidine synthase;  InterPro: IPR018119 This entry represents a conserved region found in strictosidine synthase (4.3.3.2 from EC), a key enzyme in alkaloid biosynthesis. It catalyses the Pictet-Spengler stereospecific condensation of tryptamine with secologanin to form strictosidine []. The structure of the native enzyme from the Indian medicinal plant Rauvolfia serpentina (Serpentwood) (Devilpepper) represents the first example of a six-bladed four-stranded beta-propeller fold from the plant kingdom [].; GO: 0016844 strictosidine synthase activity, 0009058 biosynthetic process; PDB: 2FPB_A 2V91_B 2FP8_A 3V1S_B 2FPC_A 2VAQ_A 2FP9_B.
Probab=44.13  E-value=57  Score=18.14  Aligned_cols=31  Identities=10%  Similarity=0.045  Sum_probs=25.7

Q ss_pred             CCccceEEEEEeCCCCEEEEEECCCCcEEee
Q psy13631         49 VGVKHAIAIDWDPVEQHLYWTDDEARNKSGE   79 (81)
Q Consensus        49 ~~~~~~~ald~d~~~~~iyW~d~~~~~I~r~   79 (81)
                      .++.-+-+|.....+..|..++....+|.|.
T Consensus        54 ~~L~fpNGVals~d~~~vlv~Et~~~Ri~ry   84 (89)
T PF03088_consen   54 DGLYFPNGVALSPDESFVLVAETGRYRILRY   84 (89)
T ss_dssp             EEESSEEEEEE-TTSSEEEEEEGGGTEEEEE
T ss_pred             hCCCccCeEEEcCCCCEEEEEeccCceEEEE
Confidence            5788888888899999999999988888873


No 50 
>PF00084 Sushi:  Sushi domain (SCR repeat);  InterPro: IPR000436 Sushi domains are also known as Complement control protein (CCP) modules, or short consensus repeats (SCR), exist in a wide variety of complement and adhesion proteins. The structure is known for this domain, it is based on a beta-sandwich arrangement; one face made up of three beta-strands hydrogen-bonded to form a triple-stranded region at its centre and the other face formed from two separate beta-strands []. CD21 (also called C3d receptor, CR2, Epstein Barr virus receptor or EBV-R) is the receptor for EBV and for C3d, C3dg and iC3b. Complement components may activate B cells through CD21. CD21 is part of a large signal-transduction complex that also involves CD19, CD81, and Leu13.  Some of the proteins in this group are responsible for the molecular basis of the blood group antigens, surface markers on the outside of the red blood cell membrane. Most of these markers are proteins, but some are carbohydrates attached to lipids or proteins [Reid M.E., Lomas-Francis C. The Blood Group Antigen FactsBook Academic Press, London / San Diego, (1997)]. Complement decay-accelerating factor (Antigen CD55) belongs to the Cromer blood group system and is associated with Cr(a), Dr(a), Es(a), Tc(a/b/c), Wd(a), WES(a/b), IFC and UMC antigens. Complement receptor type 1 (C3b/C4b receptor) (Antigen CD35) belongs to the Knops blood group system and is associated with Kn(a/b), McC(a), Sl(a) and Yk(a) antigens. CD molecules are leucocyte antigens on cell surfaces. CD antigens nomenclature is updated at Protein Reviews On The Web (http://prow.nci.nih.gov/). ; PDB: 1HFH_A 1HFI_A 1E5G_A 2UWN_A 2V8E_A 2XWB_H 3HS0_I 2OK5_A 2XWJ_J 3HRZ_D ....
Probab=42.73  E-value=7  Score=18.72  Aligned_cols=15  Identities=27%  Similarity=0.321  Sum_probs=11.1

Q ss_pred             CCCCceEEc--CCCCCC
Q psy13631          2 FHFYGVRLV--DNFTCA   16 (81)
Q Consensus         2 ~~~~Gy~L~--d~~tC~   16 (81)
                      .|.+||+|.  ...+|.
T Consensus        28 ~C~~Gy~~~g~~~~~C~   44 (56)
T PF00084_consen   28 SCNPGYELSGSSTITCQ   44 (56)
T ss_dssp             EESTTEEEESSSEEEEE
T ss_pred             cCCCCCEecCCCEEEEC
Confidence            488999988  446784


No 51 
>PF14761 HPS3_N:  Hermansky-Pudlak syndrome 3
Probab=42.67  E-value=96  Score=20.31  Aligned_cols=49  Identities=8%  Similarity=0.163  Sum_probs=31.0

Q ss_pred             cEEEE-EeCCcEEEEeCCCCCceeeEEEeCCccceEEEEEeCCCCEEEEEE
Q psy13631         21 EMLLL-VQRTDICKISLDTPDHTNVMLPVVGVKHAIAIDWDPVEQHLYWTD   70 (81)
Q Consensus        21 ~~Ll~-s~~~~I~~i~l~~~~~~~~~~~~~~~~~~~ald~d~~~~~iyW~d   70 (81)
                      .+|++ +.+..|...+|..+.. ......+.++.+..+.|...+++|-=.+
T Consensus        29 d~Lfva~~g~~Vev~~l~~~~~-~~~~~F~Tv~~V~~l~y~~~GDYlvTlE   78 (215)
T PF14761_consen   29 DALFVAASGCKVEVYDLEQEEC-PLLCTFSTVGRVLQLVYSEAGDYLVTLE   78 (215)
T ss_pred             ceEEEEcCCCEEEEEEcccCCC-ceeEEEcchhheeEEEeccccceEEEEE
Confidence            45544 6677888888884433 2333335677777888887777765443


No 52 
>PF05098 LEF-4:  Late expression factor 4 (LEF-4);  InterPro: IPR007790 The baculovirus Autographa californica nuclear polyhedrosis virus (AcMNPV) virus encodes a DNA-dependent RNA polymerase that is required for transcription of viral late genes. This polymerase is composed of four equimolar subunits, LEF-8, LEF-4, LEF-9, and p47. LEF-4 carries out all the enzymatic functions related to mRNA capping []. ; GO: 0006355 regulation of transcription, DNA-dependent
Probab=41.07  E-value=70  Score=23.30  Aligned_cols=58  Identities=14%  Similarity=0.204  Sum_probs=31.3

Q ss_pred             eEEc-CCCCCC--CCCCcEEEEEeCCcEEEEeCCCCCceeeEEEeCCccceEEEEEeCCC-CEEEEEECC
Q psy13631          7 VRLV-DNFTCA--NGSQEMLLLVQRTDICKISLDTPDHTNVMLPVVGVKHAIAIDWDPVE-QHLYWTDDE   72 (81)
Q Consensus         7 y~L~-d~~tC~--~~~~~~Ll~s~~~~I~~i~l~~~~~~~~~~~~~~~~~~~ald~d~~~-~~iyW~d~~   72 (81)
                      |-|. ||.+||  -...-++++......+...+++..        -.+.+++|+..-..+ +.+|-+|+.
T Consensus       238 WAlKLDGvRGkG~~~~~~~~i~~DDMq~fsg~l~~~~--------f~~Nnvv~fQcE~i~~~~~YITDlL  299 (450)
T PF05098_consen  238 WALKLDGVRGKGYFTNGFIIIQMDDMQMFSGKLDPSP--------FSLNNVVAFQCELIDDETFYITDLL  299 (450)
T ss_pred             EEEeecccceeeEEeccEEEEEEchhhhhhcccccch--------hhcccEEEEEEEEecCceEEEeeee
Confidence            4577 998888  112333344444444444442221        134556666655555 778888864


No 53 
>PF03022 MRJP:  Major royal jelly protein;  InterPro: IPR003534 The major royal jelly proteins (MRJPs) comprise 12.5% of the mass, and 82-90% of the protein content [], of honeybee (Apis mellifera) royal jelly. Royal jelly is a substance secreted by the cephalic glands of nurse bees [] and it is used to trigger development of a queen bee from a bee larva. The biological function of the MRJPs is unknown, but they are believed to play a major role in nutrition due to their high essential amino acid content []. Two royal jelly proteins, MRJP3 and MRJP5, contain a tandem repeat that results from a high genetic variablility. This polymorphism may be useful for genotyping individual bees [].; PDB: 3Q6P_B 3Q6K_A 3Q6T_A 2QE8_B.
Probab=40.31  E-value=66  Score=21.55  Aligned_cols=25  Identities=28%  Similarity=0.431  Sum_probs=21.7

Q ss_pred             ceEEEEEeCCCCEEEEEECCCCcEEe
Q psy13631         53 HAIAIDWDPVEQHLYWTDDEARNKSG   78 (81)
Q Consensus        53 ~~~ald~d~~~~~iyW~d~~~~~I~r   78 (81)
                      +..++.+|. ++.||++++....|.+
T Consensus       187 ~s~g~~~D~-~G~ly~~~~~~~aI~~  211 (287)
T PF03022_consen  187 QSDGMAIDP-NGNLYFTDVEQNAIGC  211 (287)
T ss_dssp             SECEEEEET-TTEEEEEECCCTEEEE
T ss_pred             CCceEEECC-CCcEEEecCCCCeEEE
Confidence            567888998 8899999999999876


No 54 
>PF09603 Fib_succ_major:  Fibrobacter succinogenes major domain (Fib_succ_major);  InterPro: IPR011871  This domain of about 175 to 200 amino acids is found, in from one to five copies, in over 50 proteins in Fibrobacter succinogenes subsp. succinogenes S85, an obligate anaerobe of the rumen. Many members of this family have an apparent lipoprotein signal sequence. Conserved cysteine residues, suggestive of disulphide bond formation, are also consistent with an extracytoplasmic location for this domain. This domain can also be found in small numbers of proteins in Chlorobium tepidum and Bacteroides thetaiotaomicron.
Probab=39.34  E-value=9.9  Score=23.21  Aligned_cols=9  Identities=33%  Similarity=0.154  Sum_probs=7.7

Q ss_pred             CCCCceEEc
Q psy13631          2 FHFYGVRLV   10 (81)
Q Consensus         2 ~~~~Gy~L~   10 (81)
                      -||+||+|-
T Consensus        69 ~CP~GWrlP   77 (184)
T PF09603_consen   69 ICPEGWRLP   77 (184)
T ss_pred             CCCCCCCCC
Confidence            499999887


No 55 
>PF08366 LLGL:  LLGL2;  InterPro: IPR013577 This domain is found in lethal giant larvae homologue 2 (LLGL2) proteins and syntaxin-binding proteins like tomosyn []. It has been identified in eukaryotes and tends to be found together with WD repeats (IPR001680 from INTERPRO). 
Probab=38.66  E-value=80  Score=18.25  Aligned_cols=11  Identities=0%  Similarity=0.220  Sum_probs=7.5

Q ss_pred             CCcEEEEEeCC
Q psy13631         19 SQEMLLLVQRT   29 (81)
Q Consensus        19 ~~~~Ll~s~~~   29 (81)
                      .++|++|+-+.
T Consensus        27 ~~~~iiFsGGm   37 (105)
T PF08366_consen   27 GEPFIIFSGGM   37 (105)
T ss_pred             CCcEEEEeCCc
Confidence            56777777664


No 56 
>smart00181 EGF Epidermal growth factor-like domain.
Probab=38.04  E-value=17  Score=15.70  Aligned_cols=7  Identities=14%  Similarity=-0.344  Sum_probs=3.4

Q ss_pred             CCCceEE
Q psy13631          3 HFYGVRL    9 (81)
Q Consensus         3 ~~~Gy~L    9 (81)
                      |+.||.+
T Consensus        24 C~~g~~g   30 (35)
T smart00181       24 CPPGYTG   30 (35)
T ss_pred             CCCCCcc
Confidence            4555543


No 57 
>smart00284 OLF Olfactomedin-like domains.
Probab=37.21  E-value=91  Score=20.88  Aligned_cols=41  Identities=15%  Similarity=0.302  Sum_probs=25.8

Q ss_pred             EeCCCCCceeeEEEeCCc-cceEEEEEeCCCCEEEEEECCCC
Q psy13631         34 ISLDTPDHTNVMLPVVGV-KHAIAIDWDPVEQHLYWTDDEAR   74 (81)
Q Consensus        34 i~l~~~~~~~~~~~~~~~-~~~~ald~d~~~~~iyW~d~~~~   74 (81)
                      .++..+....+-+|.++. +....|||++.+++||==|-+..
T Consensus       205 yDt~t~~~~~~~i~f~n~y~~~s~l~YNP~d~~LY~wdng~~  246 (255)
T smart00284      205 YDTNTGKEGHLDIPFENMYEYISMLDYNPNDRKLYAWNNGHL  246 (255)
T ss_pred             EECCCCccceeeeeeccccccceeceeCCCCCeEEEEeCCeE
Confidence            345544434444555543 45678999999999996564433


No 58 
>PTZ00444 hypothetical protein; Provisional
Probab=36.54  E-value=18  Score=23.00  Aligned_cols=14  Identities=14%  Similarity=-0.035  Sum_probs=11.8

Q ss_pred             CCCCceEEc-CCCCCC
Q psy13631          2 FHFYGVRLV-DNFTCA   16 (81)
Q Consensus         2 ~~~~Gy~L~-d~~tC~   16 (81)
                      .||.||... || .|.
T Consensus       104 ~CP~GW~k~~dg-~Ci  118 (184)
T PTZ00444        104 PCPEGWTKNSNG-QCW  118 (184)
T ss_pred             cCCccceecCCC-eee
Confidence            599999888 76 687


No 59 
>PF00622 SPRY:  SPRY domain;  InterPro: IPR003877 The SPRY domain is of unknown function. Distant homologues are domains in butyrophilin/marenostrin/pyrin []. Ca2+-release from the sarcoplasmic or endoplasmic reticulum, the intracellular Ca2+ store, is mediated by the ryanodine receptor (RyR) and/or the inositol trisphosphate receptor (IP3R).; GO: 0005515 protein binding; PDB: 2V24_A 3EK9_A 2AFJ_A 2IWG_E 3EMW_A 2WL1_A 3TOJ_B 2VOK_A 2VOL_B 2FNJ_A ....
Probab=35.15  E-value=82  Score=17.37  Aligned_cols=21  Identities=14%  Similarity=0.124  Sum_probs=18.6

Q ss_pred             cceEEEEEeCCCCEEEEEECC
Q psy13631         52 KHAIAIDWDPVEQHLYWTDDE   72 (81)
Q Consensus        52 ~~~~ald~d~~~~~iyW~d~~   72 (81)
                      ..++++-+|...+.|.|++..
T Consensus        67 ~dvIG~~lD~~~g~l~F~~ng   87 (124)
T PF00622_consen   67 GDVIGCGLDLDNGELSFYKNG   87 (124)
T ss_dssp             TSEEEEEEETTTTEEEEEETT
T ss_pred             CcEEEEEEeecccEEEEEECC
Confidence            678999999999999999754


No 60 
>PF02191 OLF:  Olfactomedin-like domain;  InterPro: IPR003112 The olfactomedin-domain was first identified in olfactomedin, an extracellular matrix protein of the olfactory neuroepithelium []. Members of this extracellular domain-family have since been shown to be present in several metazoan proteins, such as latrophilins, myocilins, optimedins and noelins, the latter being involved in the generation of neural crest cells. Myocilin is of considerable interest, as mutations in its olfactomedin-domain can lead to glaucoma []. The olfactomedin-domains in myocilin and optimedin are essential for the interaction between these two proteins [].; GO: 0005515 protein binding
Probab=34.83  E-value=1.4e+02  Score=19.80  Aligned_cols=44  Identities=16%  Similarity=0.256  Sum_probs=28.4

Q ss_pred             EeCCCCCceeeEEEeCCc-cceEEEEEeCCCCEEEEEECCCCcEE
Q psy13631         34 ISLDTPDHTNVMLPVVGV-KHAIAIDWDPVEQHLYWTDDEARNKS   77 (81)
Q Consensus        34 i~l~~~~~~~~~~~~~~~-~~~~ald~d~~~~~iyW~d~~~~~I~   77 (81)
                      .++.......+-+|..+. .....|+|++.+++||==|-+...++
T Consensus       200 fDt~t~~~~~~~i~f~~~~~~~~~l~YNP~dk~LY~wd~G~~v~Y  244 (250)
T PF02191_consen  200 FDTYTGKEEDVSIPFPNPYGNISMLSYNPRDKKLYAWDNGYQVTY  244 (250)
T ss_pred             EECCCCceeceeeeeccccCceEeeeECCCCCeEEEEECCeEEEE
Confidence            455544444444555543 46789999999999997665544443


No 61 
>smart00104 ANATO Anaphylatoxin homologous domain. C3a, C4a and C5a anaphylatoxins are protein fragments generated enzymatically in serum during activation of complement molecules C3, C4, and C5. They induce smooth muscle contraction. These fragments are homologous to a three-fold repeat in fibulins.
Probab=34.81  E-value=16  Score=16.38  Aligned_cols=14  Identities=36%  Similarity=0.567  Sum_probs=11.7

Q ss_pred             CCCceEEc-CCCCCC
Q psy13631          3 HFYGVRLV-DNFTCA   16 (81)
Q Consensus         3 ~~~Gy~L~-d~~tC~   16 (81)
                      |.+|+++. .+.+|.
T Consensus         2 C~dG~~~~~~~~tC~   16 (35)
T smart00104        2 CADGMRLAPEGETCE   16 (35)
T ss_pred             cccccccCCCCCCHH
Confidence            77888888 788897


No 62 
>PF01607 CBM_14:  Chitin binding Peritrophin-A domain;  InterPro: IPR002557 This entry represents a chitin binding domain []. It is found in (amongst others) the Peritrophin-A chitin binding proteins, particularly the peritrophic matrix proteins of insects and animal chitinases [, , ]. Copies of the domain are also found in some baculoviruses. It is an extracellular domain that contains six conserved cysteines that probably form three disulphide bridges. Chitin binding has been demonstrated for a protein containing only two of these domains [].; GO: 0008061 chitin binding, 0006030 chitin metabolic process, 0005576 extracellular region; PDB: 1DQC_A.
Probab=34.70  E-value=19  Score=16.97  Aligned_cols=15  Identities=13%  Similarity=0.111  Sum_probs=10.3

Q ss_pred             CCCCceEEc-CCCCCC
Q psy13631          2 FHFYGVRLV-DNFTCA   16 (81)
Q Consensus         2 ~~~~Gy~L~-d~~tC~   16 (81)
                      .||+|+.-+ +...|.
T Consensus        30 ~C~~g~~fd~~~~~C~   45 (53)
T PF01607_consen   30 RCPEGLYFDPSSQRCV   45 (53)
T ss_dssp             E-TTS-EE-TTTSSEE
T ss_pred             CCcCCCEECcCcCEEc
Confidence            599998888 888898


No 63 
>PHA02885 putative interleukin binding protein; Provisional
Probab=33.86  E-value=1e+02  Score=18.06  Aligned_cols=15  Identities=13%  Similarity=0.463  Sum_probs=11.0

Q ss_pred             EEeCCCCEEEEEECC
Q psy13631         58 DWDPVEQHLYWTDDE   72 (81)
Q Consensus        58 d~d~~~~~iyW~d~~   72 (81)
                      -...++++|||.--.
T Consensus        55 ~lnsedgyiywi~pn   69 (135)
T PHA02885         55 CLNSEDGYIYWIGPN   69 (135)
T ss_pred             eecCCCceEEEEeCC
Confidence            356688999998543


No 64 
>KOG4649|consensus
Probab=32.56  E-value=1.7e+02  Score=20.27  Aligned_cols=19  Identities=21%  Similarity=0.282  Sum_probs=9.7

Q ss_pred             EeCCCCEEEEEECCCCcEE
Q psy13631         59 WDPVEQHLYWTDDEARNKS   77 (81)
Q Consensus        59 ~d~~~~~iyW~d~~~~~I~   77 (81)
                      .|+.++.+||-.+...+|.
T Consensus        38 vd~~sG~~~We~ilg~RiE   56 (354)
T KOG4649|consen   38 VDPQSGNLIWEAILGVRIE   56 (354)
T ss_pred             ecCCCCcEEeehhhCceee
Confidence            3445555666555444443


No 65 
>PF10282 Lactonase:  Lactonase, 7-bladed beta-propeller;  InterPro: IPR019405  6-phosphogluconolactonases (6PGL) 3.1.1.31 from EC, which hydrolyses 6-phosphogluconolactone to 6-phosphogluconate is opne of the enzymes in the pentose phosphate pathway. Two families of structurally dissimilar 6PGLs are known to exist: the Escherichia coli (strain K12) YbhE IPR022528 from INTERPRO [] and the Pseudomonas aeruginosa DevB IPR005900 from INTERPRO [] types.  This entry contains bacterial 6-phosphogluconolactonases (6PGL) YbhE-type 3.1.1.31 from EC which hydrolyse 6-phosphogluconolactone to 6-phosphogluconate. The entry also contains the fungal muconate lactonizing enzyme carboxy-cis,cis-muconate cyclase 5.5.1.5 from EC and muconate cycloisomerase 5.5.1.1 from EC, which convert cis,cis-muconates to muconolactones and vice versa as part of the microbial beta-ketoadipate pathway. Structures have been reported for the E. coli 6-phosphogluconolactonase and Neurospora crassa muconate cycloisomerase. Structures of proteins in this family have revealed a 7-bladed beta-propeller fold [].; PDB: 3SCY_A 1L0Q_A 3HFQ_B 3FGB_A 1RI6_A 3U4Y_A 3BWS_A 1JOF_H.
Probab=32.05  E-value=1.6e+02  Score=19.87  Aligned_cols=58  Identities=22%  Similarity=0.340  Sum_probs=39.2

Q ss_pred             CCcEEEEEeCC--cEEEEeCCCCCc--ee-eEEEeCCccceEEEEEeCCCCEEEEEECCCCcEE
Q psy13631         19 SQEMLLLVQRT--DICKISLDTPDH--TN-VMLPVVGVKHAIAIDWDPVEQHLYWTDDEARNKS   77 (81)
Q Consensus        19 ~~~~Ll~s~~~--~I~~i~l~~~~~--~~-~~~~~~~~~~~~ald~d~~~~~iyW~d~~~~~I~   77 (81)
                      ...||.++++.  .|-.+.+++...  .. ..++..+ ..+..+.+++..++||=++...+.|.
T Consensus       255 dg~~lyvsnr~~~sI~vf~~d~~~g~l~~~~~~~~~G-~~Pr~~~~s~~g~~l~Va~~~s~~v~  317 (345)
T PF10282_consen  255 DGRFLYVSNRGSNSISVFDLDPATGTLTLVQTVPTGG-KFPRHFAFSPDGRYLYVANQDSNTVS  317 (345)
T ss_dssp             TSSEEEEEECTTTEEEEEEECTTTTTEEEEEEEEESS-SSEEEEEE-TTSSEEEEEETTTTEEE
T ss_pred             CCCEEEEEeccCCEEEEEEEecCCCceEEEEEEeCCC-CCccEEEEeCCCCEEEEEecCCCeEE
Confidence            35689999874  677777754322  11 2233333 56889999999999999988877765


No 66 
>KOG0650|consensus
Probab=31.21  E-value=68  Score=24.47  Aligned_cols=51  Identities=16%  Similarity=0.322  Sum_probs=31.1

Q ss_pred             CCcEEEEEeCCcEEEEeCCCCCceeeEE---------Ee-CCccceEEEEEeCCCCEEEEEECC
Q psy13631         19 SQEMLLLVQRTDICKISLDTPDHTNVML---------PV-VGVKHAIAIDWDPVEQHLYWTDDE   72 (81)
Q Consensus        19 ~~~~Ll~s~~~~I~~i~l~~~~~~~~~~---------~~-~~~~~~~ald~d~~~~~iyW~d~~   72 (81)
                      ..|+|++|+++.+|--+|-.......++         .+ ++--|.+...||   +++.|-|..
T Consensus       577 s~p~lfVaTq~~vRiYdL~kqelvKkL~tg~kwiS~msihp~GDnli~gs~d---~k~~WfDld  637 (733)
T KOG0650|consen  577 SKPYLFVATQRSVRIYDLSKQELVKKLLTGSKWISSMSIHPNGDNLILGSYD---KKMCWFDLD  637 (733)
T ss_pred             CCceEEEEeccceEEEehhHHHHHHHHhcCCeeeeeeeecCCCCeEEEecCC---CeeEEEEcc
Confidence            4689999999999888776532111111         01 122355555554   689999874


No 67 
>PF05281 Secretogranin_V:  Neuroendocrine protein 7B2 precursor (Secretogranin V);  InterPro: IPR007945 Mature peptide hormones and neuropeptides are typically synthesised from much larger precursors and require several post-translational processing steps--including proteolytic cleavage--for the formation of the bioactive species. The subtilisin-related proteolytic enzymes that accomplish neuroendocrine-specific cleavages are known as prohormone convertases 1 and 2 (PC1 and PC2), which belong to MEROPS peptidase family S8B. The cell biology of these proteases within the regulated secretory pathway of neuroendocrine cells is complex, and they are themselves initially synthesised as inactive precursor molecules. ProPC1 propeptide cleavage occurs rapidly in the endoplasmic reticulum, yet its major site of action on prohormones takes place later in the secretory pathway. PC1 undergoes an interesting carboxyl terminal processing event whose function appears to be to activate the enzyme. ProPC2, on the other hand, exhibits comparatively long initial folding times and exits the endoplasmic reticulum without propeptide cleavage, in association with the neuroendocrine-specific protein 7B2. Once the proPC2/7B2 complex arrives at the trans-Golgi network, 7B2 is internally cleaved into two domains, the 21kDa fragment and a carboxy-terminal 31 residue peptide. PC2 propeptide removal occurs in the maturing secretory granule, most likely through autocatalysis, and 7B2 association does not appear to be directly required for this cleavage event. However, if proPC2 has not encountered 7B2 intracellularly, it cannot generate a catalytically active mature species. The molecular mechanism behind the intriguing intracellular association of 7B2 and proPC2 is still unknown, but may involve conformational rearrangement or stabilisation of a proPC2 conformer mediated by a 36-residue internal segment of 21kDa 7B2. This family represents, 7B2 (secretogranin V), which is the molecular escort protein for PC2. 7B2 is a bifunctional protein with an N-terminal activation domain and a C-terminal inhibitory domain (MEROPS inhibitor family I21, clan I-) separated by a furin cleavage site []. Although 7B2 represents a potent inhibitor of PC2, there is an absolute requirement of 7B2 for the activation of PC2, which is synthesised as a zymogen. Both the full length, 27 kDa, and the C-terminal peptide (CT domain) derived from intramolecular cleavage of 7B2 are potent inhibitors of PC2. Studies have shown that the active peptide in the CT domain to be LLRVHK, active in the nanomolar range not only against PC2 but also PC1 [, ]. Knock-out studies have shown that the PC2 nulls are not phenotypically equivalent to the 7B2 nulls, which suggests that 7B2 may have other activities in addition to being the activator of PC2 [].  7B2 exhibits both structural and functional homology to proSAAS (IPR010832 from INTERPRO), which is the PC1 binding protein. The CT domain of proSAAS contains the same inhibitor hexapeptide as 7B2, consequently both 7B2 and proSAAS are two members of a homologous family of prohormone convertase inhibitor proteins. ; GO: 0007218 neuropeptide signaling pathway, 0030141 stored secretory granule
Probab=31.14  E-value=22  Score=23.12  Aligned_cols=15  Identities=20%  Similarity=0.200  Sum_probs=10.8

Q ss_pred             CCCCceEEcCCCCCCCC
Q psy13631          2 FHFYGVRLVDNFTCANG   18 (81)
Q Consensus         2 ~~~~Gy~L~d~~tC~~~   18 (81)
                      -||.||.-+||  |...
T Consensus       102 PCPvGyt~~dg--Cle~  116 (208)
T PF05281_consen  102 PCPVGYTAEDG--CLEN  116 (208)
T ss_pred             CCCCCcccccC--cccC
Confidence            59999993277  8743


No 68 
>PF12191 stn_TNFRSF12A:  Tumour necrosis factor receptor stn_TNFRSF12A_TNFR domain;  InterPro: IPR022316 The tumour necrosis factor (TNF) receptor (TNFR) superfamily comprises more than 20 type-I transmembrane proteins. Family members are defined based on similarity in their extracellular domain - a region that contains many cysteine residues arranged in a specific repetitive pattern []. The cysteines allow formation of an extended rod-like structure, responsible for ligand binding []. Upon receptor activation, different intracellular signalling complexes are assembled for different members of the TNFR superfamily, depending on their intracellular domains and sequences []. Activation of TNFRs can therefore induce a range of disparate effects, including cell proliferation, differentiation, survival, or apoptotic cell death, depending upon the receptor involved []. TNFRs are widely distributed and play important roles in many crucial biological processes, such as lymphoid and neuronal development, innate and adaptive immunity, and maintenance of cellular homeostasis []. Drugs that manipulate their signalling have potential roles in the prevention and treatment of many diseases, such as viral infections, coronary heart disease, transplant rejection, and immune disease []. TNF receptor 12 (also known as TWEAK receptor, and fibroblast growth factor-inducible-14 (Fn14)) has been implicated in endothelial cell growth and migration []. The receptor may also play a role in cell-matrix interactions [].; PDB: 2KN0_A 2RPJ_A 2KMZ_A 2EQP_A.
Probab=28.83  E-value=34  Score=20.57  Aligned_cols=15  Identities=20%  Similarity=0.027  Sum_probs=11.5

Q ss_pred             CCCCceEEc-CCCCCC
Q psy13631          2 FHFYGVRLV-DNFTCA   16 (81)
Q Consensus         2 ~~~~Gy~L~-d~~tC~   16 (81)
                      +||.|+.-+ |...|.
T Consensus        35 ~Cp~G~sWs~dLdkCm   50 (129)
T PF12191_consen   35 PCPRGSSWSADLDKCM   50 (129)
T ss_dssp             SS-SSEEEETTTTEEE
T ss_pred             CCCCCCcCcccccccc
Confidence            799999999 876664


No 69 
>PF01623 Carla_C4:  Carlavirus putative nucleic acid binding protein;  InterPro: IPR002568 This family of carlavirus nucleic acid binding proteins includes a motif for a potential C-4 type zinc finger this has four highly conserved cysteine residues and is a conserved feature of the carlaviruses 3' terminal ORF []. These proteins may function as viral transcriptional regulators. The carlavirus family includes Garlic latent virus and Potato virus S and Potato virus M, these viruses are positive strand, ssRNA with no DNA stage.; GO: 0003676 nucleic acid binding, 0006355 regulation of transcription, DNA-dependent
Probab=28.73  E-value=43  Score=18.96  Aligned_cols=8  Identities=63%  Similarity=1.041  Sum_probs=6.6

Q ss_pred             CCCCCCCC
Q psy13631         11 DNFTCANG   18 (81)
Q Consensus        11 d~~tC~~~   18 (81)
                      ||+||..|
T Consensus        73 DnkTC~PG   80 (91)
T PF01623_consen   73 DNKTCVPG   80 (91)
T ss_pred             CCCcccCC
Confidence            99999954


No 70 
>PF12093 Corona_NS8:  Coronavirus NS8 protein;  InterPro: IPR022722  This family of proteins is functionally uncharacterised. This protein is found in coronaviruses. Proteins in this family are typically between 39 to 121 amino acids in length. This protein has two conserved sequence motifs: EDPCP and INCQ. 
Probab=27.32  E-value=32  Score=19.89  Aligned_cols=19  Identities=16%  Similarity=0.088  Sum_probs=11.3

Q ss_pred             CCcEEEEEeCCcEEEEeCC
Q psy13631         19 SQEMLLLVQRTDICKISLD   37 (81)
Q Consensus        19 ~~~~Ll~s~~~~I~~i~l~   37 (81)
                      ++.|+=+..+++-|...|-
T Consensus        43 ~dW~ik~~~R~sArL~~Lc   61 (118)
T PF12093_consen   43 SDWFIKIGTRKSARLVQLC   61 (118)
T ss_pred             ccceEEEcCccchhHhhhc
Confidence            5666666666655555553


No 71 
>PF02664 LuxS:  S-Ribosylhomocysteinase (LuxS);  InterPro: IPR003815 In bacteria, the regulation of gene expression in response to changes in cell density is called quorum sensing. Quorum-sensing bacteria produce, release, and respond to hormone-like molecules (autoinducers) that accumulate in the external environment as the cell population grows. For example, enteric bacteria use quorum sensing to regulate several traits that allow them to establish and maintain infection in their host, including motility, biofilm formation, and virulence-specific genes []. The LuxS/AI-2 system is one of several quorum sensing mechanisms. AI-2 (autoinducer-2) is a signalling molecule that functions in interspecies communication by regulating niche-specific genes with diverse functions in various bacteria, often in response to population density. LuxS (S-ribosylhomocysteinase; 4.4.1.21 from EC) is an autoinducer-production protein that has a metabolic function as a component of the activated methyl cycle. LuxS converts S-ribosylhomocysteine to homocysteine and 4,5-dihydroxy-2,3-pentanedione (DPD); DPD can then spontaneously cyclise to active AI-2 [, ]. LuxS is a homodimeric iron-dependent metalloenzyme containing two identical tetrahedral metal-binding sites similar to those found in peptidases and amidases []. ; GO: 0005506 iron ion binding, 0009372 quorum sensing; PDB: 1J6X_B 1VGX_A 1INN_B 1VJE_B 1J6V_A 1VH2_A 1J6W_B 1JOE_B 1J98_A 1IE0_A ....
Probab=27.27  E-value=36  Score=21.20  Aligned_cols=10  Identities=30%  Similarity=0.158  Sum_probs=7.7

Q ss_pred             CCCCCceEEc
Q psy13631          1 MFHFYGVRLV   10 (81)
Q Consensus         1 ~~~~~Gy~L~   10 (81)
                      |-|.+||.|-
T Consensus        80 MGCrTGFYli   89 (157)
T PF02664_consen   80 MGCRTGFYLI   89 (157)
T ss_dssp             -TTSSEEEEE
T ss_pred             cccccccEEE
Confidence            7799999876


No 72 
>PF00008 EGF:  EGF-like domain This is a sub-family of the Pfam entry This is a sub-family of the Pfam entry;  InterPro: IPR006209 A sequence of about thirty to forty amino-acid residues long found in the sequence of epidermal growth factor (EGF) has been shown [, , , , ] to be present, in a more or less conserved form, in a large number of other, mostly animal proteins. The list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied. The functional significance of EGF domains in what appear to be unrelated proteins is not yet clear. However, a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase). The EGF domain includes six cysteine residues which have been shown (in EGF) to be involved in disulphide bonds. The main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet. Subdomains between the conserved cysteines vary in length.; GO: 0005515 protein binding; PDB: 1WHE_A 1CCF_A 1APO_A 1WHF_A 2VJ3_A 1TOZ_A 4D90_B 3CFW_A 1EDM_B 1IXA_A ....
Probab=26.92  E-value=30  Score=15.18  Aligned_cols=7  Identities=14%  Similarity=-0.320  Sum_probs=5.3

Q ss_pred             CCCCceE
Q psy13631          2 FHFYGVR    8 (81)
Q Consensus         2 ~~~~Gy~    8 (81)
                      .|++||.
T Consensus        23 ~C~~G~~   29 (32)
T PF00008_consen   23 ECPPGYT   29 (32)
T ss_dssp             EEBTTEE
T ss_pred             ECCCCCc
Confidence            3888886


No 73 
>smart00289 WR1 Worm-specific repeat type 1. Worm-specific repeat type 1. Cysteine-rich domain apparently unique (so far) to C. elegans. Often appears with KU domains. About 3 dozen worm proteins contain this domain.
Probab=25.45  E-value=34  Score=15.24  Aligned_cols=9  Identities=11%  Similarity=-0.274  Sum_probs=6.3

Q ss_pred             CCCCceEEc
Q psy13631          2 FHFYGVRLV   10 (81)
Q Consensus         2 ~~~~Gy~L~   10 (81)
                      .||.||+-.
T Consensus        21 ~CP~g~~C~   29 (38)
T smart00289       21 SCPSGYSCQ   29 (38)
T ss_pred             CCCCCCEEe
Confidence            588888644


No 74 
>KOG1446|consensus
Probab=24.32  E-value=2.5e+02  Score=19.51  Aligned_cols=48  Identities=13%  Similarity=0.102  Sum_probs=30.5

Q ss_pred             CCCcEEEEEeCCcEEEEeCCCCCceeeEEEeCCccceEEEEEeCCCCEEEEEEC
Q psy13631         18 GSQEMLLLVQRTDICKISLDTPDHTNVMLPVVGVKHAIAIDWDPVEQHLYWTDD   71 (81)
Q Consensus        18 ~~~~~Ll~s~~~~I~~i~l~~~~~~~~~~~~~~~~~~~ald~d~~~~~iyW~d~   71 (81)
                      .++.||--+....||.-++...... ..+   .+...-..+||+.+  ++++-.
T Consensus       111 ~~d~FlS~S~D~tvrLWDlR~~~cq-g~l---~~~~~pi~AfDp~G--LifA~~  158 (311)
T KOG1446|consen  111 KDDTFLSSSLDKTVRLWDLRVKKCQ-GLL---NLSGRPIAAFDPEG--LIFALA  158 (311)
T ss_pred             CCCeEEecccCCeEEeeEecCCCCc-eEE---ecCCCcceeECCCC--cEEEEe
Confidence            3577888777788999888855433 233   23444456788866  555544


No 75 
>PF10015 DUF2258:  Uncharacterized protein conserved in archaea (DUF2258);  InterPro: IPR017140 There is currently no experimental data for members of this group or their homologues, nor do they exhibit features indicative of any function.
Probab=24.21  E-value=80  Score=17.22  Aligned_cols=15  Identities=20%  Similarity=0.501  Sum_probs=12.0

Q ss_pred             eEEEEEeCCCCEEEE
Q psy13631         54 AIAIDWDPVEQHLYW   68 (81)
Q Consensus        54 ~~ald~d~~~~~iyW   68 (81)
                      .+.++||..++.+-|
T Consensus        61 RI~v~y~v~~~kl~~   75 (75)
T PF10015_consen   61 RIRVEYDVEDGKLVF   75 (75)
T ss_pred             EEEEEEEecCCEeeC
Confidence            357889999988876


No 76 
>PF06387 Calcyon:  D1 dopamine receptor-interacting protein (calcyon);  InterPro: IPR009431 This family consists of several D1 dopamine receptor-interacting (calcyon) proteins. D1/D5 dopamine receptors in the basal ganglia, hippocampus, and cerebral cortex modulate motor, reward, and cognitive behaviour. D1-like dopamine receptors likely modulate neocortical and hippocampal neuronal excitability and synaptic function via Ca2+ as well as cAMP-dependent signalling []. Defective calcyon proteins have been implicated in both attention-deficit/hyperactivity disorder (ADHD) [] and schizophrenia.; GO: 0050780 dopamine receptor binding, 0007212 dopamine receptor signaling pathway, 0016021 integral to membrane
Probab=24.20  E-value=39  Score=21.51  Aligned_cols=14  Identities=21%  Similarity=0.147  Sum_probs=11.2

Q ss_pred             CCCCceEEcCCCCCC
Q psy13631          2 FHFYGVRLVDNFTCA   16 (81)
Q Consensus         2 ~~~~Gy~L~d~~tC~   16 (81)
                      .||+||.+. .+.|.
T Consensus       112 sCPdGFv~k-hk~C~  125 (186)
T PF06387_consen  112 SCPDGFVLK-HKRCT  125 (186)
T ss_pred             cCCCcceee-ccccc
Confidence            499999887 35688


No 77 
>KOG4283|consensus
Probab=23.83  E-value=2.7e+02  Score=19.66  Aligned_cols=53  Identities=11%  Similarity=0.217  Sum_probs=36.8

Q ss_pred             EEEEEeCC---cEEEEeCCCCCceeeEEEeCCccceEEEEEeCCCCEEEEEECCCCcE
Q psy13631         22 MLLLVQRT---DICKISLDTPDHTNVMLPVVGVKHAIAIDWDPVEQHLYWTDDEARNK   76 (81)
Q Consensus        22 ~Ll~s~~~---~I~~i~l~~~~~~~~~~~~~~~~~~~ald~d~~~~~iyW~d~~~~~I   76 (81)
                      .-|+|.+.   .|+.-++.+..+...+.  ..-.++.|+++-+..++|..+-...+.|
T Consensus       158 HcLiA~gtr~~~VrLCDi~SGs~sH~Ls--GHr~~vlaV~Wsp~~e~vLatgsaDg~i  213 (397)
T KOG4283|consen  158 HCLIAAGTRDVQVRLCDIASGSFSHTLS--GHRDGVLAVEWSPSSEWVLATGSADGAI  213 (397)
T ss_pred             ceEEEEecCCCcEEEEeccCCcceeeec--cccCceEEEEeccCceeEEEecCCCceE
Confidence            45556543   67777787777665443  2345688999999999998876655553


No 78 
>smart00449 SPRY Domain in SPla and the RYanodine Receptor. Domain of unknown function. Distant homologues are domains in butyrophilin/marenostrin/pyrin homologues.
Probab=23.56  E-value=1.4e+02  Score=16.44  Aligned_cols=24  Identities=8%  Similarity=-0.051  Sum_probs=19.7

Q ss_pred             cceEEEEEeCCCCEEEEEECCCCc
Q psy13631         52 KHAIAIDWDPVEQHLYWTDDEARN   75 (81)
Q Consensus        52 ~~~~ald~d~~~~~iyW~d~~~~~   75 (81)
                      ...+++-+|...+.|++.+.....
T Consensus        66 gd~iGv~lD~~~g~l~F~~ng~~~   89 (122)
T smart00449       66 GDVIGCFLDLEAGTISFYKNGKYL   89 (122)
T ss_pred             CCEEEEEEECCCCEEEEEECCCEe
Confidence            668899999999999999876544


No 79 
>PRK02260 S-ribosylhomocysteinase; Provisional
Probab=23.12  E-value=42  Score=20.88  Aligned_cols=10  Identities=30%  Similarity=0.158  Sum_probs=8.4

Q ss_pred             CCCCCceEEc
Q psy13631          1 MFHFYGVRLV   10 (81)
Q Consensus         1 ~~~~~Gy~L~   10 (81)
                      |-|.+||.|-
T Consensus        80 MGCrTGFYli   89 (158)
T PRK02260         80 MGCRTGFYLI   89 (158)
T ss_pred             CccccccEEE
Confidence            6799999877


No 80 
>PHA02639 EEV host range protein; Provisional
Probab=22.90  E-value=43  Score=22.80  Aligned_cols=15  Identities=33%  Similarity=0.324  Sum_probs=12.3

Q ss_pred             CCCCceEEc--CCCCCC
Q psy13631          2 FHFYGVRLV--DNFTCA   16 (81)
Q Consensus         2 ~~~~Gy~L~--d~~tC~   16 (81)
                      .|.+||.|.  ...+|.
T Consensus       177 sC~~GY~L~Gs~~~tC~  193 (295)
T PHA02639        177 SCKSGFDLVGEKYSTCN  193 (295)
T ss_pred             EcCCCCeEcCCCcEEEC
Confidence            499999998  467896


No 81 
>PF00020 TNFR_c6:  TNFR/NGFR cysteine-rich region;  InterPro: IPR001368 A number of proteins, some of which are known to be receptors for growth factors have been found to contain a cysteine-rich domain at the N-terminal region that can be subdivided into four (or in some cases, three) repeats containing six conserved cysteines all of which are involved in intrachain disulphide bonds [].  CD27 (also called S152 or T14) mediates a co-stimulatory signal for T and B cell activation and is involved in murine T cell development. Tyrosine-phosphorylation of ZAP-70 following CD27 ligation of T cells has been reported [], but not confirmed independently. CD30 was originally identified as Ki-1, an antigen expressed on Reed-Sternberg cells in Hodgkin's lymphomas and other non-Hodgkin's lymphomas, particularly diffuse large-cell lymphoma and immunoblastic lymphoma. CD30 has pleiotropic effects on CD30-positive lymphoma cell lines ranging from cell proliferation to cell death. It is thought to be involved in negative selection of T-cells in the thymus and is involved in TCR-mediated cell death. CD30 is a member of the TNFR family of molecules, activate NFkB through interaction with TRAF2 and TRAF5. CD40 (Bp50) plays a central role in the regulation of cell-mediated immunity as well as antibody mediated immunity. It is central to T cell dependent (TD)-responses and may influence survival of B cell lymphomas. CD95 (also called APO-1, fas antigen, Fas tumour necrosis factor receptor superfamily, member 6, TNFRSF6 or apoptosis antigen 1, APT1) is expressed, typically at high levels, on activated T and B cells. It is involved in the mediation of apoptosis-inducing signals.  Other proteins known to belong to this family [, , , ] are, tumour Necrosis Factor type I and type II receptors (TNFR), Rabbit fibroma virus soluble TNF receptor (protein T2), lymphotoxin alpha/beta receptor, low-affinity nerve growth factor receptor (LA-NGFR) (p75), T-cell antigen OX40, Wsl-1, a receptor (for a yet undefined ligand) that mediates apoptosis and Vaccinia virus protein A53 (SalF19R). CD molecules are leucocyte antigens on cell surfaces. CD antigens nomenclature is updated at Protein Reviews On The Web (http://prow.nci.nih.gov/).; GO: 0005488 binding; PDB: 3TJE_F 3QD6_S 3ALQ_U 3IJ2_Y 3BUK_D 1SG1_X 1NCF_B 1EXT_A 1FT4_B 1TNR_R ....
Probab=22.54  E-value=59  Score=14.73  Aligned_cols=14  Identities=21%  Similarity=0.112  Sum_probs=9.6

Q ss_pred             CCCceEEc-CC-CCCC
Q psy13631          3 HFYGVRLV-DN-FTCA   16 (81)
Q Consensus         3 ~~~Gy~L~-d~-~tC~   16 (81)
                      |++|+.++ .+ ..|.
T Consensus         1 C~~g~y~~~~~~~~C~   16 (39)
T PF00020_consen    1 CPPGTYSDSENHPQCL   16 (39)
T ss_dssp             ECTTEEEESSCSSSEE
T ss_pred             CccCcccCCCCCCcCC
Confidence            78888887 55 4553


No 82 
>KOG1007|consensus
Probab=22.54  E-value=91  Score=21.79  Aligned_cols=47  Identities=17%  Similarity=0.207  Sum_probs=25.1

Q ss_pred             EEEEEeCCcEEEEeCCCCCceeeEEEeCCccceEEEEEeCCCCEEEEE
Q psy13631         22 MLLLVQRTDICKISLDTPDHTNVMLPVVGVKHAIAIDWDPVEQHLYWT   69 (81)
Q Consensus        22 ~Ll~s~~~~I~~i~l~~~~~~~~~~~~~~~~~~~ald~d~~~~~iyW~   69 (81)
                      .+..++...++..++.+......+-- ...+.+..+||++..+++.=+
T Consensus       186 qv~tt~d~tl~~~D~RT~~~~~sI~d-AHgq~vrdlDfNpnkq~~lvt  232 (370)
T KOG1007|consen  186 QVATTSDSTLQFWDLRTMKKNNSIED-AHGQRVRDLDFNPNKQHILVT  232 (370)
T ss_pred             eEEEeCCCcEEEEEccchhhhcchhh-hhcceeeeccCCCCceEEEEE
Confidence            34445555566666654322111111 122567788999888876543


No 83 
>COG1854 LuxS LuxS protein involved in autoinducer AI2 synthesis [Signal transduction mechanisms]
Probab=22.53  E-value=52  Score=20.54  Aligned_cols=12  Identities=17%  Similarity=0.005  Sum_probs=9.4

Q ss_pred             CCCCCceEEc-CC
Q psy13631          1 MFHFYGVRLV-DN   12 (81)
Q Consensus         1 ~~~~~Gy~L~-d~   12 (81)
                      |-|.+||.|. -|
T Consensus        80 MGCrTGFYm~l~G   92 (161)
T COG1854          80 MGCRTGFYMILIG   92 (161)
T ss_pred             cccccceEEEEEC
Confidence            6789999887 54


No 84 
>PF02655 ATP-grasp_3:  ATP-grasp domain;  InterPro: IPR003806  The ATP-grasp fold is one of several distinct ATP-binding folds, and is found in enzymes that catalyze the formation of amide bonds, catalyzing the ATP-dependent ligation of a carboxylate-containing molecule to an amino or thiol group-containing molecule []. This fold is found in many different enzyme families, including various peptide synthetases, biotin carboxylase, synapsin, succinyl-CoA synthetase, pyruvate phosphate dikinase, and glutathione synthetase, amongst others []. These enzymes contribute predominantly to macromolecular synthesis, using ATP-hydrolysis to activate their substrates.  The ATP-grasp fold shares functional and structural similarities with the PIPK (phosphatidylinositol phosphate kinase) and protein kinase superfamilies. The ATP-grasp domain consists of two subdomains with different alpha+beta folds, which grasp the ATP molecule between them. Each subdomain provides a variable loop that forms part of the active site, with regions from other domains also contributing to the active site, even though these other domains are not conserved between the various ATP-grasp enzymes []. This entry describes a type of ATP-grasp fold that is found in a set of proteins of unknown function.; GO: 0005524 ATP binding, 0046872 metal ion binding; PDB: 3DF7_A.
Probab=21.98  E-value=82  Score=18.93  Aligned_cols=25  Identities=16%  Similarity=0.206  Sum_probs=18.9

Q ss_pred             CCccceEEEEEeCCCCEEEEEECCC
Q psy13631         49 VGVKHAIAIDWDPVEQHLYWTDDEA   73 (81)
Q Consensus        49 ~~~~~~~ald~d~~~~~iyW~d~~~   73 (81)
                      +++...+++||-..++.+|..+++.
T Consensus       132 ~gl~G~~giD~I~~~~~~~viEINP  156 (161)
T PF02655_consen  132 PGLRGYVGIDFILDDGGPYVIEINP  156 (161)
T ss_dssp             TT--EEEEEEEEESS-SEEEEEEES
T ss_pred             CCCeeeEeEEEEEeCCcEEEEEEcC
Confidence            4778889999999999999998764


No 85 
>TIGR02145 Fib_succ_major Fibrobacter succinogenes major paralogous domain. This domain of about 175 to 200 amino acids is found, in from one to five copies, in over 50 proteins in Fibrobacter succinogenes S85, an obligate anaerobe of the rumen. Many members of this family have an apparent lipoprotein signal sequence. Conserved cysteine residues, suggestive of disulfide bond formation, are also consistent with an extracytoplasmic location for this domain. This domain can also be found in small numbers of proteins in Chlorobium tepidum and Bacteroides thetaiotaomicron.
Probab=21.58  E-value=31  Score=21.54  Aligned_cols=9  Identities=22%  Similarity=0.003  Sum_probs=7.6

Q ss_pred             CCCCceEEc
Q psy13631          2 FHFYGVRLV   10 (81)
Q Consensus         2 ~~~~Gy~L~   10 (81)
                      -||+|+.|-
T Consensus        59 ~cP~GWhlP   67 (171)
T TIGR02145        59 ICPEGWHLP   67 (171)
T ss_pred             cCCCCCCCC
Confidence            499999987


No 86 
>PF08310 LGFP:  LGFP repeat;  InterPro: IPR013207 This 54 amino acid repeat is found in many hypothetical proteins. Several hypothetical proteins from Corynebacterium glutamicum (Brevibacterium flavum) and Corynebacterium efficiens along with PS1 protein contain this repeat region. The N-terminal region of PS1 contains an esterase domain which transfers corynomycolic acid. The C-terminal region consists of 4 tandem LGFP repeats. It is hypothesised that the PS1 proteins in Corynebacterium, when associated with the cell wall, may be anchored via the LGFP tandem repeats that may be important for maintaining cell wall integrity. Deletion of Q01377 from SWISSPROT protein results in a 10-fold increase in the cell volume of the organism and infers the corresponding involvement of the protein in the cell shape formation []. The secondary structure of each repeat is predicted to comprise two beta-strands and one alpha-helix.
Probab=21.31  E-value=81  Score=15.48  Aligned_cols=12  Identities=17%  Similarity=0.542  Sum_probs=8.8

Q ss_pred             CCCCEEEEEECC
Q psy13631         61 PVEQHLYWTDDE   72 (81)
Q Consensus        61 ~~~~~iyW~d~~   72 (81)
                      .....|||+...
T Consensus        21 F~~G~Iywsp~t   32 (54)
T PF08310_consen   21 FQNGTIYWSPAT   32 (54)
T ss_pred             cCCeEEEEeCCC
Confidence            356899999654


No 87 
>PF12860 PAS_7:  PAS fold
Probab=20.86  E-value=1.3e+02  Score=16.48  Aligned_cols=19  Identities=16%  Similarity=0.669  Sum_probs=12.3

Q ss_pred             CccceEEEEEeCCCCEEEEE
Q psy13631         50 GVKHAIAIDWDPVEQHLYWT   69 (81)
Q Consensus        50 ~~~~~~ald~d~~~~~iyW~   69 (81)
                      .+...+++ ||....-+||=
T Consensus         3 ~l~~Gv~v-~D~~~rl~~~N   21 (115)
T PF12860_consen    3 SLPQGVAV-FDSDGRLVFWN   21 (115)
T ss_pred             CcCceEEE-EcCCCeEEeEc
Confidence            34445555 78888778773


No 88 
>PF13619 KTSC:  KTSC domain
Probab=20.71  E-value=1.2e+02  Score=15.24  Aligned_cols=16  Identities=19%  Similarity=0.403  Sum_probs=10.8

Q ss_pred             cceEEEEEeCCCCEEE
Q psy13631         52 KHAIAIDWDPVEQHLY   67 (81)
Q Consensus        52 ~~~~ald~d~~~~~iy   67 (81)
                      +++.+++||..++.|.
T Consensus         4 s~I~~v~Yd~~~~~L~   19 (60)
T PF13619_consen    4 SNIRSVGYDPETRTLE   19 (60)
T ss_pred             CcccEEeECCCCCEEE
Confidence            4566778887776654


No 89 
>PF03302 VSP:  Giardia variant-specific surface protein;  InterPro: IPR005127 During infection, the intestinal protozoan parasite Giardia lamblia virus undergoes continuous antigenic variation which is determined by diversification of the parasite's major surface antigen, named VSP (variant surface protein).
Probab=20.12  E-value=59  Score=22.98  Aligned_cols=15  Identities=27%  Similarity=0.408  Sum_probs=13.3

Q ss_pred             CCCCceEEc-CCCCCC
Q psy13631          2 FHFYGVRLV-DNFTCA   16 (81)
Q Consensus         2 ~~~~Gy~L~-d~~tC~   16 (81)
                      .|.+||.|. |+..|.
T Consensus         3 ~C~~gy~~~~~~t~C~   18 (397)
T PF03302_consen    3 ECTSGYKLSTDKTSCV   18 (397)
T ss_pred             cccCCceECCCCCccc
Confidence            489999999 988887


Done!