Query         010708
Match_columns 503
No_of_seqs    272 out of 2350
Neff          7.9 
Searched_HMMs 46136
Date          Fri Mar 29 03:39:47 2013
Command       hhsearch -i /work/01045/syshi/csienesis_hhblits_a3m/010708.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/010708hhsearch_cdd -cpu 12 -v 0 

 No Hit                             Prob E-value P-value  Score    SS Cols Query HMM  Template HMM
  1 PF01453 B_lectin:  D-mannose b 100.0 6.9E-30 1.5E-34  219.7   3.7  110   70-180     2-114 (114)
  2 cd00028 B_lectin Bulb-type man  99.9 3.2E-25 6.8E-30  191.9  15.2  115   30-148     2-116 (116)
  3 PF00954 S_locus_glycop:  S-loc  99.9 5.8E-25 1.3E-29  188.3   9.7   99  206-304     1-110 (110)
  4 smart00108 B_lectin Bulb-type   99.9 8.2E-24 1.8E-28  182.5  14.5  113   30-147     2-114 (114)
  5 PF08276 PAN_2:  PAN-like domai  99.4 1.1E-13 2.4E-18  107.0   5.0   57  337-395     4-66  (66)
  6 cd00129 PAN_APPLE PAN/APPLE-li  99.3 1.4E-12   3E-17  104.3   5.4   67  337-408     8-79  (80)
  7 cd01098 PAN_AP_plant Plant PAN  99.3   6E-12 1.3E-16  101.9   6.9   72  337-410    11-84  (84)
  8 smart00108 B_lectin Bulb-type   98.9 7.8E-09 1.7E-13   89.0   9.4   89   88-209    23-112 (114)
  9 cd00028 B_lectin Bulb-type man  98.8 2.6E-08 5.6E-13   86.0   8.9   90   88-210    23-114 (116)
 10 PF01453 B_lectin:  D-mannose b  98.7 1.2E-07 2.5E-12   81.6  11.5  100   36-149    12-114 (114)
 11 smart00473 PAN_AP divergent su  98.3 1.6E-06 3.4E-11   68.6   6.5   70  337-408     3-77  (78)
 12 KOG1187 Serine/threonine prote  98.0 3.2E-06   7E-11   87.4   3.5   37  467-503    61-97  (361)
 13 cd01100 APPLE_Factor_XI_like S  97.2  0.0003 6.5E-09   55.3   3.3   32  358-389    25-57  (73)
 14 PLN00113 leucine-rich repeat r  94.9   0.036 7.8E-07   64.9   5.9   29  471-502   683-711 (968)
 15 KOG3653 Transforming growth fa  92.9   0.035 7.6E-07   58.0   0.6   17  487-503   216-232 (534)
 16 PF00024 PAN_1:  PAN domain Thi  90.0     0.4 8.8E-06   37.3   3.8   49  340-389     4-56  (79)
 17 PLN03224 probable serine/threo  89.3    0.16 3.5E-06   55.0   1.2   24  479-502   143-166 (507)
 18 KOG1025 Epidermal growth facto  89.1    0.34 7.4E-06   54.2   3.5   19  484-502   699-717 (1177)
 19 PF07645 EGF_CA:  Calcium-bindi  86.8    0.18 3.8E-06   34.9  -0.3   32  272-303     3-36  (42)
 20 PRK11138 outer membrane biogen  86.6      15 0.00032   38.3  14.0   77   67-144    86-186 (394)
 21 cd00053 EGF Epidermal growth f  83.5    0.81 1.8E-05   29.3   1.8   30  274-303     2-32  (36)
 22 PF14295 PAN_4:  PAN domain; PD  82.4    0.96 2.1E-05   32.1   2.0   23  358-380    16-38  (51)
 23 PLN03225 Serine/threonine-prot  82.2    0.53 1.2E-05   51.9   0.8   24  479-502   130-153 (566)
 24 smart00223 APPLE APPLE domain.  82.2     1.4 3.1E-05   35.1   3.0   32  358-389    22-57  (79)
 25 smart00179 EGF_CA Calcium-bind  79.7     1.5 3.2E-05   29.0   2.0   30  272-301     3-33  (39)
 26 KOG0196 Tyrosine kinase, EPH (  77.0    0.87 1.9E-05   50.8   0.4   34  470-503   609-651 (996)
 27 PF07974 EGF_2:  EGF-like domai  76.4     1.9 4.1E-05   28.0   1.7   25  278-303     6-30  (32)
 28 cd01099 PAN_AP_HGF Subfamily o  76.1     4.1 8.8E-05   32.4   3.9   32  358-389    25-59  (80)
 29 PF12662 cEGF:  Complement Clr-  75.2       2 4.3E-05   26.0   1.4   12  292-303     2-13  (24)
 30 PF12661 hEGF:  Human growth fa  73.3    0.95 2.1E-05   23.2  -0.2   11  293-303     1-11  (13)
 31 cd00054 EGF_CA Calcium-binding  73.0     2.9 6.3E-05   27.1   2.1   31  272-302     3-34  (38)
 32 PF01683 EB:  EB module;  Inter  70.4     3.4 7.3E-05   29.8   2.0   33  268-303    16-48  (52)
 33 KOG1219 Uncharacterized conser  68.7     6.7 0.00015   48.8   4.9   27  278-304  3870-3898(4289)
 34 PTZ00284 protein kinase; Provi  68.2     1.3 2.9E-05   47.3  -0.8   30  473-502   121-150 (467)
 35 PF08277 PAN_3:  PAN-like domai  67.1      15 0.00032   28.0   5.2   30  358-388    20-49  (71)
 36 KOG0193 Serine/threonine prote  67.0     2.5 5.5E-05   45.9   1.0   16  487-502   398-413 (678)
 37 PF00008 EGF:  EGF-like domain   66.4     1.3 2.8E-05   28.7  -0.8   26  278-303     4-31  (32)
 38 PF13360 PQQ_2:  PQQ-like domai  65.9      64  0.0014   30.3  10.6   78   67-144    53-148 (238)
 39 smart00605 CW CW domain.        65.7      18 0.00039   29.5   5.8   53  358-410    22-75  (94)
 40 PF12947 EGF_3:  EGF domain;  I  64.1     1.5 3.2E-05   29.3  -0.9   27  277-303     5-32  (36)
 41 PF09064 Tme5_EGF_like:  Thromb  63.7     4.2 9.2E-05   26.7   1.2   18  285-302    11-28  (34)
 42 KOG0194 Protein tyrosine kinas  60.7     4.4 9.5E-05   43.4   1.5   16  487-502   163-178 (474)
 43 KOG1035 eIF-2alpha kinase GCN2  59.8       3 6.6E-05   48.8   0.0   21  481-501   479-499 (1351)
 44 PRK11138 outer membrane biogen  58.4      50  0.0011   34.3   9.0   18   93-110   302-320 (394)
 45 TIGR03300 assembly_YfgL outer   56.7      53  0.0011   33.8   8.7   51   93-143    73-130 (377)
 46 smart00181 EGF Epidermal growt  56.0     8.9 0.00019   24.6   1.8   24  278-302     6-30  (35)
 47 cd05622 STKc_ROCK1 Catalytic d  54.7     4.1   9E-05   42.1   0.1   30  473-502    35-64  (371)
 48 cd05106 PTKc_CSF-1R Catalytic   54.1       6 0.00013   40.9   1.2   19  483-501    40-58  (374)
 49 PF13360 PQQ_2:  PQQ-like domai  54.0      82  0.0018   29.5   9.0   77   68-144    11-102 (238)
 50 TIGR03300 assembly_YfgL outer   53.5      74  0.0016   32.7   9.2   56   89-144   245-305 (377)
 51 PTZ00036 glycogen synthase kin  53.4     5.3 0.00011   42.5   0.6   23  480-502    65-87  (440)
 52 PF04478 Mid2:  Mid2 like cell   51.5     8.3 0.00018   34.6   1.4   12  423-434    54-65  (154)
 53 PRK09605 bifunctional UGMP fam  51.3       6 0.00013   43.2   0.6   30  473-502   325-354 (535)
 54 cd05621 STKc_ROCK2 Catalytic d  50.4     5.2 0.00011   41.4  -0.0   25  478-502    40-64  (370)
 55 PHA03210 serine/threonine kina  50.2     8.1 0.00018   41.8   1.4   23  480-502   147-169 (501)
 56 PF01436 NHL:  NHL repeat;  Int  49.3      31 0.00068   21.3   3.4   21   88-108     6-26  (28)
 57 PHA02887 EGF-like protein; Pro  48.7      14 0.00031   31.5   2.3   33  270-303    82-119 (126)
 58 KOG0663 Protein kinase PITSLRE  47.9     5.4 0.00012   40.6  -0.4   20  482-501    77-96  (419)
 59 PF01102 Glycophorin_A:  Glycop  47.3     8.1 0.00018   33.5   0.7    7  425-431    71-77  (122)
 60 PF08693 SKG6:  Transmembrane a  47.2      12 0.00027   25.6   1.4    6  440-445    33-38  (40)
 61 PHA03209 serine/threonine kina  46.5     9.8 0.00021   39.0   1.3   25  478-502    63-87  (357)
 62 PF13956 Ibs_toxin:  Toxin Ibs,  46.4      11 0.00024   21.0   0.9   15    1-15      2-16  (19)
 63 cd05596 STKc_ROCK Catalytic do  45.8     4.7  0.0001   41.6  -1.2   23  480-502    42-64  (370)
 64 KOG0600 Cdc2-related protein k  43.1     6.5 0.00014   42.0  -0.7   15  487-501   123-137 (560)
 65 PLN00034 mitogen-activated pro  42.1      11 0.00023   38.5   0.8   16  487-502    80-95  (353)
 66 KOG4649 PQQ (pyrrolo-quinoline  39.2      55  0.0012   32.3   5.0   46   68-113   167-217 (354)
 67 KOG0192 Tyrosine kinase specif  38.9      14  0.0003   38.4   1.0   16  487-502    47-62  (362)
 68 PHA03211 serine/threonine kina  38.5      15 0.00033   39.4   1.3   23  480-502   168-190 (461)
 69 cd00216 PQQ_DH Dehydrogenases   38.1 1.1E+02  0.0023   33.1   7.8   76   68-144    37-136 (488)
 70 KOG2052 Activin A type IB rece  34.9      20 0.00044   37.9   1.4   17  486-502   216-232 (513)
 71 COG3117 Uncharacterized protei  33.9   2E+02  0.0044   26.8   7.6   73   39-136    42-114 (188)
 72 PF15102 TMEM154:  TMEM154 prot  33.8      19 0.00042   32.0   0.9    8  423-430    61-68  (146)
 73 PF05935 Arylsulfotrans:  Aryls  33.3   1E+02  0.0022   33.2   6.6   54   94-150   127-187 (477)
 74 PF01011 PQQ:  PQQ enzyme repea  32.9      64  0.0014   21.3   3.2   19  125-143     7-25  (38)
 75 KOG0291 WD40-repeat-containing  32.9 7.1E+02   0.015   28.4  12.6   85   87-193   354-448 (893)
 76 PF02009 Rifin_STEVOR:  Rifin/s  32.8      33 0.00072   34.5   2.6   13  436-448   273-285 (299)
 77 PHA03212 serine/threonine kina  32.1      20 0.00044   37.3   0.9   20  482-501    93-112 (391)
 78 KOG1214 Nidogen and related ba  30.0      36 0.00079   38.5   2.4   32  272-304   828-860 (1289)
 79 PF05454 DAG1:  Dystroglycan (D  28.9      19  0.0004   36.1   0.0   10  437-446   165-174 (290)
 80 KOG1095 Protein tyrosine kinas  28.9      28  0.0006   40.9   1.4   16  487-502   698-713 (1025)
 81 cd05845 Ig2_L1-CAM_like Second  26.7 1.2E+02  0.0025   25.0   4.4   33   68-101    32-64  (95)
 82 KOG0986 G protein-coupled rece  26.4      33 0.00072   36.5   1.3   21  481-501   185-205 (591)
 83 KOG0605 NDR and related serine  26.1      35 0.00076   36.8   1.4   23  479-501   139-161 (550)
 84 KOG4257 Focal adhesion tyrosin  26.1      30 0.00066   38.3   1.0   17  486-502   394-410 (974)
 85 KOG0694 Serine/threonine prote  25.8      31 0.00066   38.3   1.0   23  480-502   367-389 (694)
 86 KOG1214 Nidogen and related ba  25.7      61  0.0013   36.9   3.2   28  277-304   741-769 (1289)
 87 PF06247 Plasmod_Pvs28:  Plasmo  25.5      19  0.0004   33.5  -0.6   28  276-303    48-81  (197)
 88 KOG1166 Mitotic checkpoint ser  24.8      29 0.00062   40.6   0.5   24  478-501   695-718 (974)
 89 KOG1024 Receptor-like protein   24.7 1.1E+02  0.0024   32.1   4.6   28  475-502   278-305 (563)
 90 PF14670 FXa_inhibition:  Coagu  24.2      17 0.00036   24.3  -0.9   14  290-303    17-30  (36)
 91 PF14610 DUF4448:  Protein of u  24.1      48   0.001   30.9   1.9   22  423-444   162-183 (189)
 92 KOG4258 Insulin/growth factor   24.0      41 0.00089   38.3   1.5   19  483-501   996-1014(1025)
 93 PF02402 Lysis_col:  Lysis prot  23.9      26 0.00057   24.4   0.0   21    1-21      1-21  (46)
 94 PF12946 EGF_MSP1_1:  MSP1 EGF   23.8      15 0.00033   24.7  -1.1   26  278-303     5-32  (37)
 95 PF01034 Syndecan:  Syndecan do  23.0      26 0.00057   26.6  -0.1    8  437-444    29-36  (64)
 96 TIGR02513 type_III_yscB type I  22.4 1.1E+02  0.0024   26.9   3.5   17  119-135    79-95  (139)
 97 PF11920 DUF3438:  Protein of u  22.1 4.9E+02   0.011   26.1   8.5   90    1-101     1-97  (288)
 98 smart00564 PQQ beta-propeller   21.2 1.6E+02  0.0035   18.1   3.5   17  127-143    15-31  (33)
 99 COG2143 Thioredoxin-related pr  20.9      78  0.0017   28.7   2.4   20   36-55     35-54  (182)
100 TIGR01478 STEVOR variant surfa  20.6      56  0.0012   32.4   1.6    6  292-297   143-148 (295)
101 PF12690 BsuPI:  Intracellular   20.4 4.1E+02  0.0089   21.1   6.3   16   96-111    27-42  (82)
102 PHA03207 serine/threonine kina  20.3      45 0.00097   34.6   0.9   19  483-501    94-112 (392)
103 KOG4289 Cadherin EGF LAG seven  20.2      54  0.0012   39.6   1.5   41  278-321  1245-1286(2531)
104 PTZ00370 STEVOR; Provisional    20.1      59  0.0013   32.3   1.6    6  292-297   143-148 (296)

No 1  
>PF01453 B_lectin:  D-mannose binding lectin;  InterPro: IPR001480 A bulb lectin super-family (Amaryllidaceae, Orchidaceae and Aliaceae) contains a ~115-residue-long domain whose overall three dimensional fold is very similar to that of [, ]:  Dictyostelium discoideum comitin, an actin binding protein Curculigo latifolia curculin, a sweet tasting and taste-modifying protein   This domain generally binds mannose, but in at least one protein, curculin, it is apparently devoid of mannose-binding activity.  Each bulb-type lectin domain consists of three sequential beta-sheet subdomains (I, II, III) that are inter-related by pseudo three-fold symmetry. The three subdomains are flat four-stranded, antiparrallel beta-sheets. Together they form a 12-stranded beta-barrel in which the barrel axis coincides with the pseudo 3-fold axis.; GO: 0005529 sugar binding; PDB: 3M7H_A 3M7J_B 3MEZ_D 1DLP_A 1BWU_D 1KJ1_A 1B2P_A 1XD6_A 2DPF_C 2D04_B ....
Probab=99.96  E-value=6.9e-30  Score=219.75  Aligned_cols=110  Identities=45%  Similarity=0.763  Sum_probs=81.1

Q ss_pred             CeEEEEeCCCCCCcC--CccEEEEecCCcEEEEeCCCcEEEee-cCcCCccccceEEEccCCCEEEEeccCCcEEEeecC
Q 010708           70 KAVIWVANRDNPLKD--SSGIITISEDGNLVLVNGQKEVLWSS-NVSNLVNNSTSAQLLDSGNLVLRDNINRAIVWESFQ  146 (503)
Q Consensus        70 ~~vvW~anr~~p~~~--~~~~l~l~~~G~Lvl~~~~~~~~Wss-~~~~~~~~~~~~~L~d~GNlVl~~~~~~~~~WqSFd  146 (503)
                      ++|||+|||+.|+..  ...+|.|+.||||+|++..+.++|++ ++.+....+..|+|+|+|||||++.. +.++|||||
T Consensus         2 ~tvvW~an~~~p~~~~s~~~~L~l~~dGnLvl~~~~~~~iWss~~t~~~~~~~~~~~L~~~GNlvl~d~~-~~~lW~Sf~   80 (114)
T PF01453_consen    2 RTVVWVANRNSPLTSSSGNYTLILQSDGNLVLYDSNGSVIWSSNNTSGRGNSGCYLVLQDDGNLVLYDSS-GNVLWQSFD   80 (114)
T ss_dssp             --------TTEEEEECETTEEEEEETTSEEEEEETTTEEEEE--S-TTSS-SSEEEEEETTSEEEEEETT-SEEEEESTT
T ss_pred             cccccccccccccccccccccceECCCCeEEEEcCCCCEEEEecccCCccccCeEEEEeCCCCEEEEeec-ceEEEeecC
Confidence            689999999999954  24799999999999999998999999 55543213789999999999999976 999999999


Q ss_pred             CCCCCcCCCCeecccccCCceEEEEecCCCCCCC
Q 010708          147 EPTDSFLPGMHHGIDQRTGKKVQLTSWKSLSDPS  180 (503)
Q Consensus       147 ~PTDTlLpgq~l~~~~~tg~~~~L~Sw~s~~dps  180 (503)
                      |||||+||||+|+.+..+|.+..++||++.+|||
T Consensus        81 ~ptdt~L~~q~l~~~~~~~~~~~~~sw~s~~dps  114 (114)
T PF01453_consen   81 YPTDTLLPGQKLGDGNVTGKNDSLTSWSSNTDPS  114 (114)
T ss_dssp             SSS-EEEEEET--TSEEEEESTSSEEEESS----
T ss_pred             CCccEEEeccCcccCCCccccceEEeECCCCCCC
Confidence            9999999999999977776666799999999996


No 2  
>cd00028 B_lectin Bulb-type mannose-specific lectin. The domain contains a three-fold internal repeat (beta-prism architecture). The consensus sequence motif QXDXNXVXY is involved in alpha-D-mannose recognition. Lectins are carbohydrate-binding proteins which specifically recognize diverse carbohydrates and mediate a wide variety of biological processes, such as cell-cell and host-pathogen interactions, serum glycoprotein turnover, and innate immune responses.
Probab=99.93  E-value=3.2e-25  Score=191.90  Aligned_cols=115  Identities=43%  Similarity=0.717  Sum_probs=100.6

Q ss_pred             CccCCCCeEEeCCCeeEEEeeCCCCCCceEEEEEecCCCCCeEEEEeCCCCCCcCCccEEEEecCCcEEEEeCCCcEEEe
Q 010708           30 QLIRDPDAILSNGSNFKLGFFNPADSPYRYMGIWYDMPSEKAVIWVANRDNPLKDSSGIITISEDGNLVLVNGQKEVLWS  109 (503)
Q Consensus        30 ~~l~~~~~L~S~~g~f~~gf~~~~~~~~~~l~i~~~~~~~~~vvW~anr~~p~~~~~~~l~l~~~G~Lvl~~~~~~~~Ws  109 (503)
                      +.|..|++|+|+++.|++|||........+..|||...+ +++||+||++.|. ...+.|.|+.||+|+|+|.++.++|+
T Consensus         2 ~~l~~~~~l~s~~~~f~~G~~~~~~q~~dgnlv~~~~~~-~~~vW~snt~~~~-~~~~~l~l~~dGnLvl~~~~g~~vW~   79 (116)
T cd00028           2 NPLSSGQTLVSSGSLFELGFFKLIMQSRDYNLILYKGSS-RTVVWVANRDNPS-GSSCTLTLQSDGNLVIYDGSGTVVWS   79 (116)
T ss_pred             cCcCCCCEEEeCCCcEEEecccCCCCCCeEEEEEEeCCC-CeEEEECCCCCCC-CCCEEEEEecCCCeEEEcCCCcEEEE
Confidence            568899999999999999999987543378889998765 7899999999984 35689999999999999999999999


Q ss_pred             ecCcCCccccceEEEccCCCEEEEeccCCcEEEeecCCC
Q 010708          110 SNVSNLVNNSTSAQLLDSGNLVLRDNINRAIVWESFQEP  148 (503)
Q Consensus       110 s~~~~~~~~~~~~~L~d~GNlVl~~~~~~~~~WqSFd~P  148 (503)
                      |++.+ ......|+|+|||||||++.+ +.++|||||||
T Consensus        80 S~~~~-~~~~~~~~L~ddGnlvl~~~~-~~~~W~Sf~~P  116 (116)
T cd00028          80 SNTTR-VNGNYVLVLLDDGNLVLYDSD-GNFLWQSFDYP  116 (116)
T ss_pred             ecccC-CCCceEEEEeCCCCEEEECCC-CCEEEcCCCCC
Confidence            99875 224788999999999999987 89999999999


No 3  
>PF00954 S_locus_glycop:  S-locus glycoprotein family;  InterPro: IPR000858 In Brassicaceae, self-incompatible plants have a self/non-self recognition system, which involves the inability of flowering plants to achieve self-fertilisation. This is sporophytically controlled by multiple alleles at a single locus (S). There are a total of 50 different S alleles in Brassica oleracea. S-locus glycoproteins, as well as S-receptor kinases, are in linkage with the S-alleles []. Most of the proteins within this family contain apple-like domain (IPR003609 from INTERPRO), which is predicted to possess protein- and/or carbohydrate-binding functions.; GO: 0048544 recognition of pollen
Probab=99.92  E-value=5.8e-25  Score=188.34  Aligned_cols=99  Identities=49%  Similarity=1.022  Sum_probs=88.1

Q ss_pred             eecCCCCCeeeeecccccceeeeeEEE---------EEEEeCCe--eEEEEecCCcEEEEEEeCCCCCeeEEeccccccc
Q 010708          206 WRSGPWNGQIFIGIPELKSVYLFRHNF---------TFGFANDW--TFFALTAQGILEERIWIKWKDNWEVGFLNLRTEC  274 (503)
Q Consensus       206 w~sg~~~g~~~~~~~~~~~~~~~~~~~---------~~~~~~~~--~rl~L~~dG~l~~~~~~~~~~~W~~~~~~p~~~C  274 (503)
                      ||+|+|||..|+++|+|.....+.+.|         +|.+.+.+  +|++||++|+||+|.|++..++|.+.|.+|.|+|
T Consensus         1 wrsG~WnG~~f~g~p~~~~~~~~~~~fv~~~~e~~~t~~~~~~s~~~r~~ld~~G~l~~~~w~~~~~~W~~~~~~p~d~C   80 (110)
T PF00954_consen    1 WRSGPWNGQRFSGIPEMSSNSLYNYSFVSNNEEVYYTYSLSNSSVLSRLVLDSDGQLQRYIWNESTQSWSVFWSAPKDQC   80 (110)
T ss_pred             CCccccCCeEECCcccccccceeEEEEEECCCeEEEEEecCCCceEEEEEEeeeeEEEEEEEecCCCcEEEEEEecccCC
Confidence            899999999999999998656666665         34444554  9999999999999999999999999999999999


Q ss_pred             ccccccCCCcccCCCCCCccccCCCCccCC
Q 010708          275 DVYGKCGAFGICNSQEKPICSCLEGFEPKN  304 (503)
Q Consensus       275 ~v~g~CG~~giC~~~~~~~C~C~~Gf~~~~  304 (503)
                      |+|+.||+||+|+.+..+.|+|++||+|++
T Consensus        81 d~y~~CG~~g~C~~~~~~~C~Cl~GF~P~n  110 (110)
T PF00954_consen   81 DVYGFCGPNGICNSNNSPKCSCLPGFEPKN  110 (110)
T ss_pred             CCccccCCccEeCCCCCCceECCCCcCCCc
Confidence            999999999999887788999999999874


No 4  
>smart00108 B_lectin Bulb-type mannose-specific lectin.
Probab=99.91  E-value=8.2e-24  Score=182.47  Aligned_cols=113  Identities=41%  Similarity=0.686  Sum_probs=98.8

Q ss_pred             CccCCCCeEEeCCCeeEEEeeCCCCCCceEEEEEecCCCCCeEEEEeCCCCCCcCCccEEEEecCCcEEEEeCCCcEEEe
Q 010708           30 QLIRDPDAILSNGSNFKLGFFNPADSPYRYMGIWYDMPSEKAVIWVANRDNPLKDSSGIITISEDGNLVLVNGQKEVLWS  109 (503)
Q Consensus        30 ~~l~~~~~L~S~~g~f~~gf~~~~~~~~~~l~i~~~~~~~~~vvW~anr~~p~~~~~~~l~l~~~G~Lvl~~~~~~~~Ws  109 (503)
                      +.|..|+.|+|+++.|++|||.... ...+.+|||...+ .++||+|||+.|+.. ++.|.|++||+|+|++.++.++|+
T Consensus         2 ~~l~~~~~l~s~~~~f~~G~~~~~~-q~dgnlV~~~~~~-~~~vW~snt~~~~~~-~~~l~l~~dGnLvl~~~~g~~vW~   78 (114)
T smart00108        2 NTLSSGQTLVSGNSLFELGFFTLIM-QNDYNLILYKSSS-RTVVWVANRDNPVSD-SCTLTLQSDGNLVLYDGDGRVVWS   78 (114)
T ss_pred             cccCCCCEEecCCCcEeeeccccCC-CCCEEEEEEECCC-CcEEEECCCCCCCCC-CEEEEEeCCCCEEEEeCCCCEEEE
Confidence            5678899999999999999998764 3468889998765 789999999999765 389999999999999999999999


Q ss_pred             ecCcCCccccceEEEccCCCEEEEeccCCcEEEeecCC
Q 010708          110 SNVSNLVNNSTSAQLLDSGNLVLRDNINRAIVWESFQE  147 (503)
Q Consensus       110 s~~~~~~~~~~~~~L~d~GNlVl~~~~~~~~~WqSFd~  147 (503)
                      |++.... ....|+|+|+|||||++.. ++++||||||
T Consensus        79 S~t~~~~-~~~~~~L~ddGnlvl~~~~-~~~~W~Sf~~  114 (114)
T smart00108       79 SNTTGAN-GNYVLVLLDDGNLVIYDSD-GNFLWQSFDY  114 (114)
T ss_pred             ecccCCC-CceEEEEeCCCCEEEECCC-CCEEeCCCCC
Confidence            9986222 4688999999999999987 8899999997


No 5  
>PF08276 PAN_2:  PAN-like domain;  InterPro: IPR013227 PAN domains have significant functional versatility fulfilling diverse biological functions by mediating protein-protein or protein-carbohydrate interactions []. These domains contain a hair-pin loop like structure, similar to knottins, but the pattern of disulphide bonds differs
Probab=99.44  E-value=1.1e-13  Score=107.02  Aligned_cols=57  Identities=46%  Similarity=1.011  Sum_probs=49.2

Q ss_pred             CCCeEEEeeeeCCCCccccCc---cchhhHHHhhcccCceeeeeecC---CCceeeecccccccc
Q 010708          337 EDGFSKLNKMKVPDFTEWTSP---ATEDECREQCLKNCSCIAYAFDG---GIGCMVWRSINLIDI  395 (503)
Q Consensus       337 ~~~f~~~~~~~~p~~~~~~~~---~~~~~C~~~Cl~nCsC~a~~y~~---g~gC~~~~~~~l~~~  395 (503)
                      +++|+++++|++|+ +...++   .++++|++.||+||||+||+|.+   +++|++|++ +|+|+
T Consensus         4 ~d~F~~l~~~~~p~-~~~~~~~~~~s~~~C~~~Cl~nCsC~Ayay~~~~~~~~C~lW~~-~L~d~   66 (66)
T PF08276_consen    4 GDGFLKLPNMKLPD-FDNAIVDSSVSLEECEKACLSNCSCTAYAYSNLSGGGGCLLWYG-DLVDL   66 (66)
T ss_pred             CCEEEEECCeeCCC-CcceeeecCCCHHHHHhhcCCCCCEeeEEeeccCCCCEEEEEcC-EeecC
Confidence            78999999999999 544442   78999999999999999999984   466999999 88874


No 6  
>cd00129 PAN_APPLE PAN/APPLE-like domain; present in N-terminal (N) domains of plasminogen/ hepatocyte growth factor proteins,  plasma prekallikrein/coagulation factor XI and microneme antigen proteins, plant receptor-like protein kinases, and various nematode and leech anti-platelet proteins. Common structural features include two disulfide bonds that link the alpha-helix to the central region of the protein. PAN domains have significant functional versatility, fulfilling diverse biological functions by mediating protein-protein or protein-carbohydrate interactions.
Probab=99.33  E-value=1.4e-12  Score=104.30  Aligned_cols=67  Identities=27%  Similarity=0.385  Sum_probs=59.7

Q ss_pred             CCCeEEEeeeeCCCCccccCccchhhHHHhhcc---cCceeeeeecC-CCceeeeccccc-ccceEecCCcceeEEE
Q 010708          337 EDGFSKLNKMKVPDFTEWTSPATEDECREQCLK---NCSCIAYAFDG-GIGCMVWRSINL-IDIQRLPFGGTDLYIR  408 (503)
Q Consensus       337 ~~~f~~~~~~~~p~~~~~~~~~~~~~C~~~Cl~---nCsC~a~~y~~-g~gC~~~~~~~l-~~~~~~~~~~~~~~ir  408 (503)
                      +..|+++.++++|+ +..   .+.++|++.|++   ||||.||+|.+ +.||.+|++ +| +|+++....+.++|+|
T Consensus         8 ~g~fl~~~~~klpd-~~~---~s~~eC~~~Cl~~~~nCsC~Aya~~~~~~gC~~W~~-~l~~d~~~~~~~g~~Ly~r   79 (80)
T cd00129           8 AGTTLIKIALKIKT-TKA---NTADECANRCEKNGLPFSCKAFVFAKARKQCLWFPF-NSMSGVRKEFSHGFDLYEN   79 (80)
T ss_pred             CCeEEEeecccCCc-ccc---cCHHHHHHHHhcCCCCCCceeeeccCCCCCeEEecC-cchhhHHhccCCCceeEeE
Confidence            67899999999999 655   579999999999   99999999975 568999999 99 9998887778999997


No 7  
>cd01098 PAN_AP_plant Plant PAN/APPLE-like domain; present in plant S-receptor protein kinases and secreted glycoproteins. PAN/APPLE domains fulfill diverse biological functions by mediating protein-protein or protein-carbohydrate interactions. S-receptor protein kinases and S-locus glycoproteins are involved in sporophytic self-incompatibility response in Brassica, one of probably many molecular mechanisms, by which hermaphrodite flowering plants avoid self-fertilization.
Probab=99.29  E-value=6e-12  Score=101.93  Aligned_cols=72  Identities=44%  Similarity=0.931  Sum_probs=58.7

Q ss_pred             CCCeEEEeeeeCCCCccccCc-cchhhHHHhhcccCceeeeeecC-CCceeeecccccccceEecCCcceeEEEee
Q 010708          337 EDGFSKLNKMKVPDFTEWTSP-ATEDECREQCLKNCSCIAYAFDG-GIGCMVWRSINLIDIQRLPFGGTDLYIRVA  410 (503)
Q Consensus       337 ~~~f~~~~~~~~p~~~~~~~~-~~~~~C~~~Cl~nCsC~a~~y~~-g~gC~~~~~~~l~~~~~~~~~~~~~~ir~~  410 (503)
                      .++|+++.++++|+ ...... .++++|++.||+||+|.||+|.+ +++|++|.. ++.+.+.....+..+|||++
T Consensus        11 ~~~f~~~~~~~~~~-~~~~~~~~s~~~C~~~Cl~nCsC~a~~~~~~~~~C~~~~~-~~~~~~~~~~~~~~~yiKv~   84 (84)
T cd01098          11 TDGFLKLPDVKLPD-NASAITAISLEECREACLSNCSCTAYAYNNGSGGCLLWNG-LLNNLRSLSSGGGTLYLRLA   84 (84)
T ss_pred             CCEEEEeCCeeCCC-chhhhccCCHHHHHHHHhcCCCcceeeecCCCCeEEEEec-eecceEeecCCCcEEEEEeC
Confidence            36899999999998 433223 68999999999999999999986 345999998 88887765555689999985


No 8  
>smart00108 B_lectin Bulb-type mannose-specific lectin.
Probab=98.90  E-value=7.8e-09  Score=88.97  Aligned_cols=89  Identities=28%  Similarity=0.454  Sum_probs=67.0

Q ss_pred             EEEEecCCcEEEEeCC-CcEEEeecCcCCccccceEEEccCCCEEEEeccCCcEEEeecCCCCCCcCCCCeecccccCCc
Q 010708           88 IITISEDGNLVLVNGQ-KEVLWSSNVSNLVNNSTSAQLLDSGNLVLRDNINRAIVWESFQEPTDSFLPGMHHGIDQRTGK  166 (503)
Q Consensus        88 ~l~l~~~G~Lvl~~~~-~~~~Wss~~~~~~~~~~~~~L~d~GNlVl~~~~~~~~~WqSFd~PTDTlLpgq~l~~~~~tg~  166 (503)
                      ++.++.||+||+++.. ..++|++++..+......+.|+++|||||++.+ +.++|+|=..                   
T Consensus        23 ~~~~q~dgnlV~~~~~~~~~vW~snt~~~~~~~~~l~l~~dGnLvl~~~~-g~~vW~S~t~-------------------   82 (114)
T smart00108       23 TLIMQNDYNLILYKSSSRTVVWVANRDNPVSDSCTLTLQSDGNLVLYDGD-GRVVWSSNTT-------------------   82 (114)
T ss_pred             ccCCCCCEEEEEEECCCCcEEEECCCCCCCCCCEEEEEeCCCCEEEEeCC-CCEEEEeccc-------------------
Confidence            5677889999999875 579999998654323478999999999999987 8899997220                   


Q ss_pred             eEEEEecCCCCCCCCceEEEEEeeCCCceEEEecCCeeeeecC
Q 010708          167 KVQLTSWKSLSDPSTGSFSAGLIHQNIPEIFVWNVSRPYWRSG  209 (503)
Q Consensus       167 ~~~L~Sw~s~~dps~G~y~l~~~~~g~~~~~~~~~~~~yw~sg  209 (503)
                                  ...+.|.+.|+++|+..++- ..+++.|.|.
T Consensus        83 ------------~~~~~~~~~L~ddGnlvl~~-~~~~~~W~Sf  112 (114)
T smart00108       83 ------------GANGNYVLVLLDDGNLVIYD-SDGNFLWQSF  112 (114)
T ss_pred             ------------CCCCceEEEEeCCCCEEEEC-CCCCEEeCCC
Confidence                        12356889999999865542 2346788764


No 9  
>cd00028 B_lectin Bulb-type mannose-specific lectin. The domain contains a three-fold internal repeat (beta-prism architecture). The consensus sequence motif QXDXNXVXY is involved in alpha-D-mannose recognition. Lectins are carbohydrate-binding proteins which specifically recognize diverse carbohydrates and mediate a wide variety of biological processes, such as cell-cell and host-pathogen interactions, serum glycoprotein turnover, and innate immune responses.
Probab=98.79  E-value=2.6e-08  Score=86.01  Aligned_cols=90  Identities=26%  Similarity=0.436  Sum_probs=67.7

Q ss_pred             EEEEec-CCcEEEEeCC-CcEEEeecCcCCccccceEEEccCCCEEEEeccCCcEEEeecCCCCCCcCCCCeecccccCC
Q 010708           88 IITISE-DGNLVLVNGQ-KEVLWSSNVSNLVNNSTSAQLLDSGNLVLRDNINRAIVWESFQEPTDSFLPGMHHGIDQRTG  165 (503)
Q Consensus        88 ~l~l~~-~G~Lvl~~~~-~~~~Wss~~~~~~~~~~~~~L~d~GNlVl~~~~~~~~~WqSFd~PTDTlLpgq~l~~~~~tg  165 (503)
                      .+.++. ||+||+++.. ..++|++++..+......+.|+++|||||++.+ +.++|+|--..                 
T Consensus        23 ~~~~q~~dgnlv~~~~~~~~~vW~snt~~~~~~~~~l~l~~dGnLvl~~~~-g~~vW~S~~~~-----------------   84 (116)
T cd00028          23 KLIMQSRDYNLILYKGSSRTVVWVANRDNPSGSSCTLTLQSDGNLVIYDGS-GTVVWSSNTTR-----------------   84 (116)
T ss_pred             cCCCCCCeEEEEEEeCCCCeEEEECCCCCCCCCCEEEEEecCCCeEEEcCC-CcEEEEecccC-----------------
Confidence            466776 9999999875 479999998653224678999999999999987 88999874310                 


Q ss_pred             ceEEEEecCCCCCCCCceEEEEEeeCCCceEEEecCCeeeeecCC
Q 010708          166 KKVQLTSWKSLSDPSTGSFSAGLIHQNIPEIFVWNVSRPYWRSGP  210 (503)
Q Consensus       166 ~~~~L~Sw~s~~dps~G~y~l~~~~~g~~~~~~~~~~~~yw~sg~  210 (503)
                                    ..+.+.+.|+++|+..++- ..+.+.|.|..
T Consensus        85 --------------~~~~~~~~L~ddGnlvl~~-~~~~~~W~Sf~  114 (116)
T cd00028          85 --------------VNGNYVLVLLDDGNLVLYD-SDGNFLWQSFD  114 (116)
T ss_pred             --------------CCCceEEEEeCCCCEEEEC-CCCCEEEcCCC
Confidence                          2356889999999855543 23567898764


No 10 
>PF01453 B_lectin:  D-mannose binding lectin;  InterPro: IPR001480 A bulb lectin super-family (Amaryllidaceae, Orchidaceae and Aliaceae) contains a ~115-residue-long domain whose overall three dimensional fold is very similar to that of [, ]:  Dictyostelium discoideum comitin, an actin binding protein Curculigo latifolia curculin, a sweet tasting and taste-modifying protein   This domain generally binds mannose, but in at least one protein, curculin, it is apparently devoid of mannose-binding activity.  Each bulb-type lectin domain consists of three sequential beta-sheet subdomains (I, II, III) that are inter-related by pseudo three-fold symmetry. The three subdomains are flat four-stranded, antiparrallel beta-sheets. Together they form a 12-stranded beta-barrel in which the barrel axis coincides with the pseudo 3-fold axis.; GO: 0005529 sugar binding; PDB: 3M7H_A 3M7J_B 3MEZ_D 1DLP_A 1BWU_D 1KJ1_A 1B2P_A 1XD6_A 2DPF_C 2D04_B ....
Probab=98.74  E-value=1.2e-07  Score=81.63  Aligned_cols=100  Identities=27%  Similarity=0.450  Sum_probs=70.5

Q ss_pred             CeEEeCCCeeEEEeeCCCCCCceEEEEEecCCCCCeEEEEe-CCCCCCcCCccEEEEecCCcEEEEeCCCcEEEeecCcC
Q 010708           36 DAILSNGSNFKLGFFNPADSPYRYMGIWYDMPSEKAVIWVA-NRDNPLKDSSGIITISEDGNLVLVNGQKEVLWSSNVSN  114 (503)
Q Consensus        36 ~~L~S~~g~f~~gf~~~~~~~~~~l~i~~~~~~~~~vvW~a-nr~~p~~~~~~~l~l~~~G~Lvl~~~~~~~~Wss~~~~  114 (503)
                      +.+.+.+|.+.|-|+.+|+     | +.|..  ...+||.+ +...... ..+.+.|++||||||+|..+.++|+|... 
T Consensus        12 ~p~~~~s~~~~L~l~~dGn-----L-vl~~~--~~~~iWss~~t~~~~~-~~~~~~L~~~GNlvl~d~~~~~lW~Sf~~-   81 (114)
T PF01453_consen   12 SPLTSSSGNYTLILQSDGN-----L-VLYDS--NGSVIWSSNNTSGRGN-SGCYLVLQDDGNLVLYDSSGNVLWQSFDY-   81 (114)
T ss_dssp             EEEEECETTEEEEEETTSE-----E-EEEET--TTEEEEE--S-TTSS--SSEEEEEETTSEEEEEETTSEEEEESTTS-
T ss_pred             cccccccccccceECCCCe-----E-EEEcC--CCCEEEEecccCCccc-cCeEEEEeCCCCEEEEeecceEEEeecCC-
Confidence            4566656999999999985     2 33432  36789999 5454432 35899999999999999999999999532 


Q ss_pred             CccccceEEEcc--CCCEEEEeccCCcEEEeecCCCC
Q 010708          115 LVNNSTSAQLLD--SGNLVLRDNINRAIVWESFQEPT  149 (503)
Q Consensus       115 ~~~~~~~~~L~d--~GNlVl~~~~~~~~~WqSFd~PT  149 (503)
                      ++  .+.+.+++  +||++ +... ..+.|.|=+.|.
T Consensus        82 pt--dt~L~~q~l~~~~~~-~~~~-~~~sw~s~~dps  114 (114)
T PF01453_consen   82 PT--DTLLPGQKLGDGNVT-GKND-SLTSWSSNTDPS  114 (114)
T ss_dssp             SS---EEEEEET--TSEEE-EEST-SSEEEESS----
T ss_pred             Cc--cEEEeccCcccCCCc-cccc-eEEeECCCCCCC
Confidence            33  57788888  99999 7654 678999987773


No 11 
>smart00473 PAN_AP divergent subfamily of APPLE domains. Apple-like domains present in Plasminogen, C. elegans hypothetical ORFs and the extracellular portion of plant receptor-like protein kinases. Predicted to possess protein- and/or carbohydrate-binding functions.
Probab=98.31  E-value=1.6e-06  Score=68.58  Aligned_cols=70  Identities=37%  Similarity=0.835  Sum_probs=52.3

Q ss_pred             CCCeEEEeeeeCCCCccccCc--cchhhHHHhhcc-cCceeeeeecC-CCceeeec-ccccccceEecCCcceeEEE
Q 010708          337 EDGFSKLNKMKVPDFTEWTSP--ATEDECREQCLK-NCSCIAYAFDG-GIGCMVWR-SINLIDIQRLPFGGTDLYIR  408 (503)
Q Consensus       337 ~~~f~~~~~~~~p~~~~~~~~--~~~~~C~~~Cl~-nCsC~a~~y~~-g~gC~~~~-~~~l~~~~~~~~~~~~~~ir  408 (503)
                      ...|.+++++.+++ ......  .+.++|++.|++ +|+|.||.|.. +.+|.+|. + ++.+.......+..+|.|
T Consensus         3 ~~~f~~~~~~~l~~-~~~~~~~~~s~~~C~~~C~~~~~~C~s~~y~~~~~~C~l~~~~-~~~~~~~~~~~~~~~y~~   77 (78)
T smart00473        3 DDCFVRLPNTKLPG-FSRIVISVASLEECASKCLNSNCSCRSFTYNNGTKGCLLWSES-SLGDARLFPSGGVDLYEK   77 (78)
T ss_pred             CceeEEecCccCCC-CcceeEcCCCHHHHHHHhCCCCCceEEEEEcCCCCEEEEeeCC-ccccceecccCCceeEEe
Confidence            45688888888876 332212  689999999999 99999999985 44599999 6 777776444455666654


No 12 
>KOG1187 consensus Serine/threonine protein kinase [Signal transduction mechanisms]
Probab=98.02  E-value=3.2e-06  Score=87.42  Aligned_cols=37  Identities=46%  Similarity=0.774  Sum_probs=34.8

Q ss_pred             CCCcccHHHHHHHhcCCCcCCcccccCCccEEEEEcC
Q 010708          467 DLPLFQFEELATATNNFQLSSKLGQGGFGPVYWVMVH  503 (503)
Q Consensus       467 ~~~~fs~~el~~AT~nFs~~n~LG~GgfG~VYKG~L~  503 (503)
                      ..+.|+|.+|.+||++|+++++||+||||.||||+|+
T Consensus        61 ~~~~fs~~el~~AT~~Fs~~~~ig~Ggfg~VYkG~l~   97 (361)
T KOG1187|consen   61 PLRSFSYDELRKATNNFSESNLIGEGGFGTVYKGVLS   97 (361)
T ss_pred             CcceeeHHHHHHHHhCCchhcceecCCCeEEEEEEEC
Confidence            5677999999999999999999999999999999984


No 13 
>cd01100 APPLE_Factor_XI_like Subfamily of PAN/APPLE-like domains; present in plasma prekallikrein/coagulation factor XI, microneme antigen proteins, and a few prokaryotic proteins. PAN/APPLE domains fulfill diverse biological functions by mediating protein-protein or protein-carbohydrate interactions.
Probab=97.20  E-value=0.0003  Score=55.29  Aligned_cols=32  Identities=19%  Similarity=0.699  Sum_probs=27.7

Q ss_pred             cchhhHHHhhcccCceeeeeecCCCc-eeeecc
Q 010708          358 ATEDECREQCLKNCSCIAYAFDGGIG-CMVWRS  389 (503)
Q Consensus       358 ~~~~~C~~~Cl~nCsC~a~~y~~g~g-C~~~~~  389 (503)
                      .+.++|++.|+.+|+|.||.|..+.+ |+++..
T Consensus        25 ~s~~~Cq~~C~~~~~C~afT~~~~~~~C~lk~~   57 (73)
T cd01100          25 SSAEQCQAACTADPGCLAFTYNTKSKKCFLKSS   57 (73)
T ss_pred             CCHHHHHHHcCCCCCceEEEEECCCCeEEcccC
Confidence            57899999999999999999986545 999765


No 14 
>PLN00113 leucine-rich repeat receptor-like protein kinase; Provisional
Probab=94.92  E-value=0.036  Score=64.94  Aligned_cols=29  Identities=10%  Similarity=0.237  Sum_probs=22.6

Q ss_pred             ccHHHHHHHhcCCCcCCcccccCCccEEEEEc
Q 010708          471 FQFEELATATNNFQLSSKLGQGGFGPVYWVMV  502 (503)
Q Consensus       471 fs~~el~~AT~nFs~~n~LG~GgfG~VYKG~L  502 (503)
                      ++++++.   ..+.+.++||+|+||.||||+.
T Consensus       683 ~~~~~~~---~~~~~~~~ig~G~~g~Vy~~~~  711 (968)
T PLN00113        683 ITINDIL---SSLKEENVISRGKKGASYKGKS  711 (968)
T ss_pred             hhHHHHH---hhCCcccEEccCCCeeEEEEEE
Confidence            4455544   4577889999999999999974


No 15 
>KOG3653 consensus Transforming growth factor beta/activin receptor subfamily of serine/threonine kinases [Signal transduction mechanisms]
Probab=92.95  E-value=0.035  Score=57.98  Aligned_cols=17  Identities=29%  Similarity=0.597  Sum_probs=15.4

Q ss_pred             CcccccCCccEEEEEcC
Q 010708          487 SKLGQGGFGPVYWVMVH  503 (503)
Q Consensus       487 n~LG~GgfG~VYKG~L~  503 (503)
                      .+||+|+||.||||.|.
T Consensus       216 eli~~Grfg~V~KaqL~  232 (534)
T KOG3653|consen  216 ELIGRGRFGCVWKAQLD  232 (534)
T ss_pred             HHhhcCccceeehhhcc
Confidence            68999999999999873


No 16 
>PF00024 PAN_1:  PAN domain This Prosite entry concerns apple domains, a subset of PAN domains;  InterPro: IPR003014 PAN domains have significant functional versatility fulfilling diverse biological functions by mediating protein-protein or protein-carbohydrate interactions []. These domains contain a hair-pin loop like structure, similar to knottins, but the pattern of disulphide bonds differs It has been shown that, the N-terminal N domains of members of the plasminogen/hepatocyte growth factor family, the apple domains of the plasma prekallikrein/coagulation factor XI family, and domains of various nematode proteins belong to the same module superfamily, the PAN module []. PAN contains a conserved core of three disulphide bridges. In some members of the family there is an additional fourth disulphide bridge that links the N and C termini of the domain.; PDB: 1GP9_C 2QJ2_B 1GMO_H 1NK1_B 3MKP_B 1BHT_B 3HN4_A 1GMN_A 3HMS_A 3HMT_B ....
Probab=90.02  E-value=0.4  Score=37.31  Aligned_cols=49  Identities=18%  Similarity=0.506  Sum_probs=35.0

Q ss_pred             eEEEeeeeCCCCccccCc--cchhhHHHhhcccCc-eeeeeecCCCc-eeeecc
Q 010708          340 FSKLNKMKVPDFTEWTSP--ATEDECREQCLKNCS-CIAYAFDGGIG-CMVWRS  389 (503)
Q Consensus       340 f~~~~~~~~p~~~~~~~~--~~~~~C~~~Cl~nCs-C~a~~y~~g~g-C~~~~~  389 (503)
                      |.++.+..+.. ......  .++++|.+.|+.+=. |.+|.|....+ |.+...
T Consensus         4 f~~~~~~~l~~-~~~~~~~v~s~~~C~~~C~~~~~~C~s~~y~~~~~~C~L~~~   56 (79)
T PF00024_consen    4 FERIPGYRLSG-HSIKEINVPSLEECAQLCLNEPRRCKSFNYDPSSKTCYLSSS   56 (79)
T ss_dssp             EEEEEEEEEES-CEEEEEEESSHHHHHHHHHHSTT-ESEEEEETTTTEEEEECS
T ss_pred             eEEECCEEEeC-CcceEEcCCCHHHHHhhcCcCcccCCeEEEECCCCEEEEcCC
Confidence            55666665544 221111  479999999999999 99999986544 998755


No 17 
>PLN03224 probable serine/threonine protein kinase; Provisional
Probab=89.27  E-value=0.16  Score=55.04  Aligned_cols=24  Identities=38%  Similarity=0.696  Sum_probs=21.0

Q ss_pred             HhcCCCcCCcccccCCccEEEEEc
Q 010708          479 ATNNFQLSSKLGQGGFGPVYWVMV  502 (503)
Q Consensus       479 AT~nFs~~n~LG~GgfG~VYKG~L  502 (503)
                      ..++|...++||+|+||+||||.+
T Consensus       143 ~~d~F~i~~~LG~GgFG~VYkG~~  166 (507)
T PLN03224        143 SSDDFQLRDKLGGGNFGITFEGLR  166 (507)
T ss_pred             cccCceEeeEeecCCCeEEEEEEe
Confidence            367899999999999999999963


No 18 
>KOG1025 consensus Epidermal growth factor receptor EGFR and related tyrosine kinases [Signal transduction mechanisms]
Probab=89.15  E-value=0.34  Score=54.20  Aligned_cols=19  Identities=37%  Similarity=0.601  Sum_probs=15.8

Q ss_pred             CcCCcccccCCccEEEEEc
Q 010708          484 QLSSKLGQGGFGPVYWVMV  502 (503)
Q Consensus       484 s~~n~LG~GgfG~VYKG~L  502 (503)
                      .+..+||+|.||+||||+.
T Consensus       699 kk~kvLGsgAfGtV~kGiw  717 (1177)
T KOG1025|consen  699 KKDKVLGSGAFGTVYKGIW  717 (1177)
T ss_pred             hhhceeccccceeEEeeeE
Confidence            4447899999999999963


No 19 
>PF07645 EGF_CA:  Calcium-binding EGF domain;  InterPro: IPR001881 A sequence of about forty amino-acid residues found in epidermal growth factor (EGF) has been shown [, , , , , ] to be present in a large number of membrane-bound and extracellular, mostly animal, proteins. Many of these proteins require calcium for their biological function and a calcium-binding site has been found at the N terminus of some EGF-like domains []. Calcium-binding may be crucial for numerous protein-protein interactions. For human coagulation factor IX it has been shown [] that the calcium-ligands form a pentagonal bipyramid. The first, third and fourth conserved negatively charged or polar residues are side chain ligands. The latter is possibly hydroxylated (see aspartic acid and asparagine hydroxylation site) []. A conserved aromatic residue, as well as the second conserved negative residue, are thought to be involved in stabilising the calcium-binding site. As in non-calcium binding EGF-like domains, there are six conserved cysteines and the structure of both types is very similar as calcium-binding induces only strictly local structural changes [].  +------------------+ +---------+ | | | | nxnnC-x(3,14)-C-x(3,7)-CxxbxxxxaxC-x(1,6)-C-x(8,13)-Cx | | +------------------+ 'n': negatively charged or polar residue [DEQN] 'b': possibly beta-hydroxylated residue [DN] 'a': aromatic amino acid 'C': cysteine, involved in disulphide bond 'x': any amino acid. ; GO: 0005509 calcium ion binding; PDB: 2VJ3_A 1TOZ_A 1LMJ_A 1UZQ_A 1UZK_A 1UZJ_B 1UZP_A 1EMO_A 1EMN_A 2RR0_A ....
Probab=86.79  E-value=0.18  Score=34.93  Aligned_cols=32  Identities=31%  Similarity=0.669  Sum_probs=25.0

Q ss_pred             cccccc-cccCCCcccC-CCCCCccccCCCCccC
Q 010708          272 TECDVY-GKCGAFGICN-SQEKPICSCLEGFEPK  303 (503)
Q Consensus       272 ~~C~v~-g~CG~~giC~-~~~~~~C~C~~Gf~~~  303 (503)
                      |.|... ..|..++.|. ..++..|.|++||+..
T Consensus         3 dEC~~~~~~C~~~~~C~N~~Gsy~C~C~~Gy~~~   36 (42)
T PF07645_consen    3 DECAEGPHNCPENGTCVNTEGSYSCSCPPGYELN   36 (42)
T ss_dssp             STTTTTSSSSSTTSEEEEETTEEEEEESTTEEEC
T ss_pred             cccCCCCCcCCCCCEEEcCCCCEEeeCCCCcEEC
Confidence            567764 5899999995 3467899999999843


No 20 
>PRK11138 outer membrane biogenesis protein BamB; Provisional
Probab=86.57  E-value=15  Score=38.31  Aligned_cols=77  Identities=19%  Similarity=0.359  Sum_probs=48.8

Q ss_pred             CCCCeEEEEeCCCC----------------CCcCCccEEEEe-cCCcEEEEeC-CCcEEEeecCcCCc------cccceE
Q 010708           67 PSEKAVIWVANRDN----------------PLKDSSGIITIS-EDGNLVLVNG-QKEVLWSSNVSNLV------NNSTSA  122 (503)
Q Consensus        67 ~~~~~vvW~anr~~----------------p~~~~~~~l~l~-~~G~Lvl~~~-~~~~~Wss~~~~~~------~~~~~~  122 (503)
                      ..+..++|..+-..                |+.. ...+.+. .+|.|+-+|. +|.++|+....+..      .+....
T Consensus        86 ~~tG~~~W~~~~~~~~~~~~~~~~~~~~~~~~v~-~~~v~v~~~~g~l~ald~~tG~~~W~~~~~~~~~ssP~v~~~~v~  164 (394)
T PRK11138         86 ADTGKEIWSVDLSEKDGWFSKNKSALLSGGVTVA-GGKVYIGSEKGQVYALNAEDGEVAWQTKVAGEALSRPVVSDGLVL  164 (394)
T ss_pred             CCCCcEeeEEcCCCcccccccccccccccccEEE-CCEEEEEcCCCEEEEEECCCCCCcccccCCCceecCCEEECCEEE
Confidence            34678899876433                1111 2344444 5688888885 68999998754321      012223


Q ss_pred             EEccCCCEEEEeccCCcEEEee
Q 010708          123 QLLDSGNLVLRDNINRAIVWES  144 (503)
Q Consensus       123 ~L~d~GNlVl~~~~~~~~~WqS  144 (503)
                      ....+|.|+-.|..+|+++|+-
T Consensus       165 v~~~~g~l~ald~~tG~~~W~~  186 (394)
T PRK11138        165 VHTSNGMLQALNESDGAVKWTV  186 (394)
T ss_pred             EECCCCEEEEEEccCCCEeeee
Confidence            3456788888888779999986


No 21 
>cd00053 EGF Epidermal growth factor domain, found in epidermal growth factor (EGF) presents in a large number of proteins, mostly animal; the list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied; the functional significance of EGF-like domains in what appear to be unrelated proteins is not yet clear; a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase); the domain includes six cysteine residues which have been shown to be involved in disulfide bonds; the main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet; Subdomains between the conserved cysteines vary in length; the region between the 5th and 6th cysteine contains two conserved glycines of which at  least  one  is  present  in  most EGF-like domains; a subset of these bind calcium.
Probab=83.47  E-value=0.81  Score=29.33  Aligned_cols=30  Identities=23%  Similarity=0.615  Sum_probs=22.3

Q ss_pred             cccccccCCCcccCCC-CCCccccCCCCccC
Q 010708          274 CDVYGKCGAFGICNSQ-EKPICSCLEGFEPK  303 (503)
Q Consensus       274 C~v~g~CG~~giC~~~-~~~~C~C~~Gf~~~  303 (503)
                      |.....|..++.|... +...|.|++||...
T Consensus         2 C~~~~~C~~~~~C~~~~~~~~C~C~~g~~g~   32 (36)
T cd00053           2 CAASNPCSNGGTCVNTPGSYRCVCPPGYTGD   32 (36)
T ss_pred             CCCCCCCCCCCEEecCCCCeEeECCCCCccc
Confidence            4435678888999653 56789999999743


No 22 
>PF14295 PAN_4:  PAN domain; PDB: 2YIL_E 2YIP_C 2YIO_A.
Probab=82.36  E-value=0.96  Score=32.05  Aligned_cols=23  Identities=26%  Similarity=0.810  Sum_probs=17.5

Q ss_pred             cchhhHHHhhcccCceeeeeecC
Q 010708          358 ATEDECREQCLKNCSCIAYAFDG  380 (503)
Q Consensus       358 ~~~~~C~~~Cl~nCsC~a~~y~~  380 (503)
                      .+.++|.++|..+=.|.++.|..
T Consensus        16 ~s~~~C~~~C~~~~~C~~~~~~~   38 (51)
T PF14295_consen   16 SSPEECQAACAADPGCQAFTFNP   38 (51)
T ss_dssp             --HHHHHHHHHTSTT--EEEEET
T ss_pred             CCHHHHHHHccCCCCCCEEEEEC
Confidence            57899999999999999999874


No 23 
>PLN03225 Serine/threonine-protein kinase SNT7; Provisional
Probab=82.22  E-value=0.53  Score=51.88  Aligned_cols=24  Identities=42%  Similarity=0.787  Sum_probs=21.2

Q ss_pred             HhcCCCcCCcccccCCccEEEEEc
Q 010708          479 ATNNFQLSSKLGQGGFGPVYWVMV  502 (503)
Q Consensus       479 AT~nFs~~n~LG~GgfG~VYKG~L  502 (503)
                      ..++|...++||+|+||.||||++
T Consensus       130 ~~~~y~l~~~LG~G~FG~VYka~~  153 (566)
T PLN03225        130 KKDDFVLGKKLGEGAFGVVYKASL  153 (566)
T ss_pred             ccCCeEEeEEEeeCCCeEEEEEEE
Confidence            567788889999999999999975


No 24 
>smart00223 APPLE APPLE domain. Four-fold repeat in plasma kallikrein and coagulation factor XI. Factor XI apple 3 mediates binding to platelets. Factor XI apple 1 binds high-molecular-mass kininogen. Apple 4 in factor XI mediates dimer formation and binds to factor XIIa. Mutations in apple 4 cause factor XI deficiency, an inherited bleeding disorder.
Probab=82.16  E-value=1.4  Score=35.07  Aligned_cols=32  Identities=19%  Similarity=0.613  Sum_probs=26.2

Q ss_pred             cchhhHHHhhcccCceeeeeecC-CC---ceeeecc
Q 010708          358 ATEDECREQCLKNCSCIAYAFDG-GI---GCMVWRS  389 (503)
Q Consensus       358 ~~~~~C~~~Cl~nCsC~a~~y~~-g~---gC~~~~~  389 (503)
                      .+.++|+++|..+=.|.+|.|.. ..   .|+++..
T Consensus        22 ~~~~~Cq~~Ct~~~~C~~FTf~~~~~~~~~C~LK~s   57 (79)
T smart00223       22 PSAQVCQKRCTSHPRCLFFTFSTNEPPEEKCLLKDS   57 (79)
T ss_pred             CCHHHHHHhhcCCCCccEEEeeCCCCCCCEeEeCcC
Confidence            56899999999999999999974 23   5888643


No 25 
>smart00179 EGF_CA Calcium-binding EGF-like domain.
Probab=79.70  E-value=1.5  Score=29.03  Aligned_cols=30  Identities=27%  Similarity=0.613  Sum_probs=22.4

Q ss_pred             cccccccccCCCcccCC-CCCCccccCCCCc
Q 010708          272 TECDVYGKCGAFGICNS-QEKPICSCLEGFE  301 (503)
Q Consensus       272 ~~C~v~g~CG~~giC~~-~~~~~C~C~~Gf~  301 (503)
                      +.|.....|...+.|.. .+...|.|++||.
T Consensus         3 ~~C~~~~~C~~~~~C~~~~g~~~C~C~~g~~   33 (39)
T smart00179        3 DECASGNPCQNGGTCVNTVGSYRCECPPGYT   33 (39)
T ss_pred             ccCcCCCCcCCCCEeECCCCCeEeECCCCCc
Confidence            45665467888889964 3467899999997


No 26 
>KOG0196 consensus Tyrosine kinase, EPH (ephrin) receptor family [Signal transduction mechanisms]
Probab=76.96  E-value=0.87  Score=50.80  Aligned_cols=34  Identities=24%  Similarity=0.426  Sum_probs=25.1

Q ss_pred             cccHHHHHHHhcCCC---------cCCcccccCCccEEEEEcC
Q 010708          470 LFQFEELATATNNFQ---------LSSKLGQGGFGPVYWVMVH  503 (503)
Q Consensus       470 ~fs~~el~~AT~nFs---------~~n~LG~GgfG~VYKG~L~  503 (503)
                      .++|+|=-+|...|.         -+.+||.|-||-||+|+|+
T Consensus       609 P~TYEDPnqAvreFakEId~s~i~Ie~VIGaGEFGEVc~GrLk  651 (996)
T KOG0196|consen  609 PHTYEDPNQAVREFAKEIDPSCVKIEKVIGAGEFGEVCSGRLK  651 (996)
T ss_pred             CccccCccHHHHHhhhhcChhheEEEEEEecccccceeccccc
Confidence            467776555555543         2378999999999999984


No 27 
>PF07974 EGF_2:  EGF-like domain;  InterPro: IPR013111 A sequence of about thirty to forty amino-acid residues long found in the sequence of epidermal growth factor (EGF) has been shown [, , , , ] to be present, in a more or less conserved form, in a large number of other, mostly animal proteins. The list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied. The functional significance of EGF domains in what appear to be unrelated proteins is not yet clear. However, a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase). The EGF domain includes six cysteine residues which have been shown (in EGF) to be involved in disulphide bonds. The main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet. Subdomains between the conserved cysteines vary in length. This entry contains EGF domains found in a variety of extracellular and membrane proteins
Probab=76.44  E-value=1.9  Score=28.02  Aligned_cols=25  Identities=28%  Similarity=0.650  Sum_probs=19.8

Q ss_pred             cccCCCcccCCCCCCccccCCCCccC
Q 010708          278 GKCGAFGICNSQEKPICSCLEGFEPK  303 (503)
Q Consensus       278 g~CG~~giC~~~~~~~C~C~~Gf~~~  303 (503)
                      .+|..+|+|... ...|.|.+||...
T Consensus         6 ~~C~~~G~C~~~-~g~C~C~~g~~G~   30 (32)
T PF07974_consen    6 NICSGHGTCVSP-CGRCVCDSGYTGP   30 (32)
T ss_pred             CccCCCCEEeCC-CCEEECCCCCcCC
Confidence            479999999753 4689999999753


No 28 
>cd01099 PAN_AP_HGF Subfamily of PAN/APPLE-like domains; present in N-terminal (N) domains of plasminogen/hepatocyte growth factor proteins, and various proteins found in Bilateria, such as leech anti-platelet proteins. PAN/APPLE domains fulfill diverse biological functions by mediating protein-protein or protein-carbohydrate interactions.
Probab=76.06  E-value=4.1  Score=32.35  Aligned_cols=32  Identities=22%  Similarity=0.663  Sum_probs=26.3

Q ss_pred             cchhhHHHhhcc--cCceeeeeecCCCc-eeeecc
Q 010708          358 ATEDECREQCLK--NCSCIAYAFDGGIG-CMVWRS  389 (503)
Q Consensus       358 ~~~~~C~~~Cl~--nCsC~a~~y~~g~g-C~~~~~  389 (503)
                      .+.++|.+.|++  +=.|.+|.|....+ |.+-..
T Consensus        25 ~s~~~C~~~C~~~~~f~CrSf~y~~~~~~C~L~~~   59 (80)
T cd01099          25 ASLEECLRKCLEETEFTCRSFNYNYKSKECILSDE   59 (80)
T ss_pred             CCHHHHHHHhCCCCCceEeEEEEEcCCCEEEEeCC
Confidence            578999999999  88999999975444 988543


No 29 
>PF12662 cEGF:  Complement Clr-like EGF-like
Probab=75.23  E-value=2  Score=26.01  Aligned_cols=12  Identities=33%  Similarity=0.830  Sum_probs=10.2

Q ss_pred             CccccCCCCccC
Q 010708          292 PICSCLEGFEPK  303 (503)
Q Consensus       292 ~~C~C~~Gf~~~  303 (503)
                      ..|.|++||+..
T Consensus         2 y~C~C~~Gy~l~   13 (24)
T PF12662_consen    2 YTCSCPPGYQLS   13 (24)
T ss_pred             EEeeCCCCCcCC
Confidence            479999999965


No 30 
>PF12661 hEGF:  Human growth factor-like EGF; PDB: 2YGQ_A 2E26_A 3A7Q_A 2YGP_A 2YGO_A 1HRE_A 1HAE_A 1HAF_A 1HRF_A.
Probab=73.30  E-value=0.95  Score=23.17  Aligned_cols=11  Identities=27%  Similarity=0.839  Sum_probs=7.8

Q ss_pred             ccccCCCCccC
Q 010708          293 ICSCLEGFEPK  303 (503)
Q Consensus       293 ~C~C~~Gf~~~  303 (503)
                      .|.|++||...
T Consensus         1 ~C~C~~G~~G~   11 (13)
T PF12661_consen    1 TCQCPPGWTGP   11 (13)
T ss_dssp             EEEE-TTEETT
T ss_pred             CccCcCCCcCC
Confidence            48999999754


No 31 
>cd00054 EGF_CA Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.
Probab=72.97  E-value=2.9  Score=27.08  Aligned_cols=31  Identities=29%  Similarity=0.613  Sum_probs=22.3

Q ss_pred             cccccccccCCCcccCCC-CCCccccCCCCcc
Q 010708          272 TECDVYGKCGAFGICNSQ-EKPICSCLEGFEP  302 (503)
Q Consensus       272 ~~C~v~g~CG~~giC~~~-~~~~C~C~~Gf~~  302 (503)
                      +.|.....|...+.|... +...|.|++||..
T Consensus         3 ~~C~~~~~C~~~~~C~~~~~~~~C~C~~g~~g   34 (38)
T cd00054           3 DECASGNPCQNGGTCVNTVGSYRCSCPPGYTG   34 (38)
T ss_pred             ccCCCCCCcCCCCEeECCCCCeEeECCCCCcC
Confidence            456554578888899643 4568999999874


No 32 
>PF01683 EB:  EB module;  InterPro: IPR006149  The EB domain has no known function. It is found in several Caenorhabditis sp. and Drosophila sp. proteins. The domain contains 8 conserved cysteines that probably form four disulphide bridges and is found associated with kunitz domains IPR002223 from INTERPRO 
Probab=70.40  E-value=3.4  Score=29.79  Aligned_cols=33  Identities=18%  Similarity=0.510  Sum_probs=27.2

Q ss_pred             cccccccccccccCCCcccCCCCCCccccCCCCccC
Q 010708          268 LNLRTECDVYGKCGAFGICNSQEKPICSCLEGFEPK  303 (503)
Q Consensus       268 ~~p~~~C~v~g~CG~~giC~~~~~~~C~C~~Gf~~~  303 (503)
                      ..|.+.|....-|-.+++|..   ..|.|++||.+.
T Consensus        16 ~~~g~~C~~~~qC~~~s~C~~---g~C~C~~g~~~~   48 (52)
T PF01683_consen   16 VQPGESCESDEQCIGGSVCVN---GRCQCPPGYVEV   48 (52)
T ss_pred             CCCCCCCCCcCCCCCcCEEcC---CEeECCCCCEec
Confidence            346678999999999999953   589999999765


No 33 
>KOG1219 consensus Uncharacterized conserved protein, contains laminin, cadherin and EGF domains [Signal transduction mechanisms]
Probab=68.75  E-value=6.7  Score=48.82  Aligned_cols=27  Identities=26%  Similarity=0.525  Sum_probs=21.2

Q ss_pred             cccCCCcccCCC--CCCccccCCCCccCC
Q 010708          278 GKCGAFGICNSQ--EKPICSCLEGFEPKN  304 (503)
Q Consensus       278 g~CG~~giC~~~--~~~~C~C~~Gf~~~~  304 (503)
                      ..|---|.|+..  ++..|.||+-|..+.
T Consensus      3870 npCqhgG~C~~~~~ggy~CkCpsqysG~~ 3898 (4289)
T KOG1219|consen 3870 NPCQHGGTCISQPKGGYKCKCPSQYSGNH 3898 (4289)
T ss_pred             CcccCCCEecCCCCCceEEeCcccccCcc
Confidence            567788899754  577999999998764


No 34 
>PTZ00284 protein kinase; Provisional
Probab=68.16  E-value=1.3  Score=47.33  Aligned_cols=30  Identities=33%  Similarity=0.473  Sum_probs=23.5

Q ss_pred             HHHHHHHhcCCCcCCcccccCCccEEEEEc
Q 010708          473 FEELATATNNFQLSSKLGQGGFGPVYWVMV  502 (503)
Q Consensus       473 ~~el~~AT~nFs~~n~LG~GgfG~VYKG~L  502 (503)
                      .+++...++.|...++||+|+||+||+++.
T Consensus       121 ~~~~~~~~~~y~i~~~lG~G~fg~V~~a~~  150 (467)
T PTZ00284        121 GEDIDVSTQRFKILSLLGEGTFGKVVEAWD  150 (467)
T ss_pred             CCccccCCCcEEEEEEEEeccCEEEEEEEE
Confidence            344555677788778999999999999864


No 35 
>PF08277 PAN_3:  PAN-like domain;  InterPro: IPR006583 PAN domains have significant functional versatility fulfilling diverse biological functions by mediating protein-protein or protein-carbohydrate interactions []. These domains contain a hair-pin loop like structure, similar to knottins, but the pattern of disulphide bonds differs The PAN-3 or CW is a domain associated with a number of Caenorhabditis elegans hypothetical proteins.
Probab=67.11  E-value=15  Score=28.04  Aligned_cols=30  Identities=23%  Similarity=0.703  Sum_probs=25.8

Q ss_pred             cchhhHHHhhcccCceeeeeecCCCceeeec
Q 010708          358 ATEDECREQCLKNCSCIAYAFDGGIGCMVWR  388 (503)
Q Consensus       358 ~~~~~C~~~Cl~nCsC~a~~y~~g~gC~~~~  388 (503)
                      .+-++|-+.|.++=.|..+.+. ...|.++.
T Consensus        20 ~sw~~Cv~~C~~~~~C~la~~~-~~~C~~y~   49 (71)
T PF08277_consen   20 TSWDDCVQKCYNDENCVLAYFD-SGKCYLYN   49 (71)
T ss_pred             CCHHHHhHHhCCCCEEEEEEeC-CCCEEEEE
Confidence            5679999999999999998887 56798864


No 36 
>KOG0193 consensus Serine/threonine protein kinase RAF [Signal transduction mechanisms]
Probab=67.03  E-value=2.5  Score=45.92  Aligned_cols=16  Identities=38%  Similarity=0.734  Sum_probs=14.4

Q ss_pred             CcccccCCccEEEEEc
Q 010708          487 SKLGQGGFGPVYWVMV  502 (503)
Q Consensus       487 n~LG~GgfG~VYKG~L  502 (503)
                      ..||+|.||+||||.-
T Consensus       398 ~rIGsGsFGtV~Rg~w  413 (678)
T KOG0193|consen  398 ERIGSGSFGTVYRGRW  413 (678)
T ss_pred             ceeccccccceeeccc
Confidence            6899999999999963


No 37 
>PF00008 EGF:  EGF-like domain This is a sub-family of the Pfam entry This is a sub-family of the Pfam entry;  InterPro: IPR006209 A sequence of about thirty to forty amino-acid residues long found in the sequence of epidermal growth factor (EGF) has been shown [, , , , ] to be present, in a more or less conserved form, in a large number of other, mostly animal proteins. The list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied. The functional significance of EGF domains in what appear to be unrelated proteins is not yet clear. However, a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase). The EGF domain includes six cysteine residues which have been shown (in EGF) to be involved in disulphide bonds. The main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet. Subdomains between the conserved cysteines vary in length.; GO: 0005515 protein binding; PDB: 1WHE_A 1CCF_A 1APO_A 1WHF_A 2VJ3_A 1TOZ_A 4D90_B 3CFW_A 1EDM_B 1IXA_A ....
Probab=66.41  E-value=1.3  Score=28.65  Aligned_cols=26  Identities=27%  Similarity=0.613  Sum_probs=19.3

Q ss_pred             cccCCCcccCC-C-CCCccccCCCCccC
Q 010708          278 GKCGAFGICNS-Q-EKPICSCLEGFEPK  303 (503)
Q Consensus       278 g~CG~~giC~~-~-~~~~C~C~~Gf~~~  303 (503)
                      ..|...|.|.. . ....|.|++||...
T Consensus         4 ~~C~n~g~C~~~~~~~y~C~C~~G~~G~   31 (32)
T PF00008_consen    4 NPCQNGGTCIDLPGGGYTCECPPGYTGK   31 (32)
T ss_dssp             TSSTTTEEEEEESTSEEEEEEBTTEEST
T ss_pred             CcCCCCeEEEeCCCCCEEeECCCCCccC
Confidence            36777888853 2 56789999999753


No 38 
>PF13360 PQQ_2:  PQQ-like domain; PDB: 3HXJ_B 1YIQ_A 1KV9_A 3Q54_A 2YH3_A 3PRW_A 3P1L_A 3Q7M_A 3Q7O_A 3Q7N_A ....
Probab=65.88  E-value=64  Score=30.27  Aligned_cols=78  Identities=23%  Similarity=0.384  Sum_probs=48.9

Q ss_pred             CCCCeEEEEeCCCCCCcCC----cc-EEEEecCCcEEEEe-CCCcEEEee-cCcC---C--cc------ccceEEEccCC
Q 010708           67 PSEKAVIWVANRDNPLKDS----SG-IITISEDGNLVLVN-GQKEVLWSS-NVSN---L--VN------NSTSAQLLDSG  128 (503)
Q Consensus        67 ~~~~~vvW~anr~~p~~~~----~~-~l~l~~~G~Lvl~~-~~~~~~Wss-~~~~---~--~~------~~~~~~L~d~G  128 (503)
                      ....+++|....+.++...    .. .+....+|.|..+| .+|.++|.. ....   .  ..      +...+....+|
T Consensus        53 ~~tG~~~W~~~~~~~~~~~~~~~~~~v~v~~~~~~l~~~d~~tG~~~W~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g  132 (238)
T PF13360_consen   53 AKTGKVLWRFDLPGPISGAPVVDGGRVYVGTSDGSLYALDAKTGKVLWSIYLTSSPPAGVRSSSSPAVDGDRLYVGTSSG  132 (238)
T ss_dssp             TTTSEEEEEEECSSCGGSGEEEETTEEEEEETTSEEEEEETTTSCEEEEEEE-SSCTCSTB--SEEEEETTEEEEEETCS
T ss_pred             CCCCCEEEEeeccccccceeeecccccccccceeeeEecccCCcceeeeeccccccccccccccCceEecCEEEEEeccC
Confidence            3568899998876654322    13 34455678888888 689999994 3221   0  00      01122233488


Q ss_pred             CEEEEeccCCcEEEee
Q 010708          129 NLVLRDNINRAIVWES  144 (503)
Q Consensus       129 NlVl~~~~~~~~~WqS  144 (503)
                      .|+..|..+|+++|+-
T Consensus       133 ~l~~~d~~tG~~~w~~  148 (238)
T PF13360_consen  133 KLVALDPKTGKLLWKY  148 (238)
T ss_dssp             EEEEEETTTTEEEEEE
T ss_pred             cEEEEecCCCcEEEEe
Confidence            9999987669999986


No 39 
>smart00605 CW CW domain.
Probab=65.69  E-value=18  Score=29.47  Aligned_cols=53  Identities=17%  Similarity=0.473  Sum_probs=34.2

Q ss_pred             cchhhHHHhhcccCceeeeeecCCCceeeecccccccceEecC-CcceeEEEee
Q 010708          358 ATEDECREQCLKNCSCIAYAFDGGIGCMVWRSINLIDIQRLPF-GGTDLYIRVA  410 (503)
Q Consensus       358 ~~~~~C~~~Cl~nCsC~a~~y~~g~gC~~~~~~~l~~~~~~~~-~~~~~~ir~~  410 (503)
                      .+-++|...|..+..|..+.......|++..-..+..+++... .+..+=+|+.
T Consensus        22 ~sw~~Ci~~C~~~~~Cvlay~~~~~~C~~f~~~~~~~v~~~~~~~~~~VAfK~~   75 (94)
T smart00605       22 LSWDECIQKCYEDSNCVLAYGNSSETCYLFSYGTVLTVKKLSSSSGKKVAFKVS   75 (94)
T ss_pred             CCHHHHHHHHhCCCceEEEecCCCCceEEEEcCCeEEEEEccCCCCcEEEEEEe
Confidence            5689999999999999987654335698753214555555443 3344555554


No 40 
>PF12947 EGF_3:  EGF domain;  InterPro: IPR024731 This entry represents an EGF domain found in the the C terminus of malarial parasite merozoite surface protein 1 [], as well as other proteins.; PDB: 2NPR_A 1N1I_C 1B9W_A 1YO8_A 2RHP_A.
Probab=64.11  E-value=1.5  Score=29.34  Aligned_cols=27  Identities=26%  Similarity=0.690  Sum_probs=19.1

Q ss_pred             ccccCCCcccCCC-CCCccccCCCCccC
Q 010708          277 YGKCGAFGICNSQ-EKPICSCLEGFEPK  303 (503)
Q Consensus       277 ~g~CG~~giC~~~-~~~~C~C~~Gf~~~  303 (503)
                      .+-|.++..|... +...|.|.+||...
T Consensus         5 ~~~C~~nA~C~~~~~~~~C~C~~Gy~Gd   32 (36)
T PF12947_consen    5 NGGCHPNATCTNTGGSYTCTCKPGYEGD   32 (36)
T ss_dssp             GGGS-TTCEEEE-TTSEEEEE-CEEECC
T ss_pred             CCCCCCCcEeecCCCCEEeECCCCCccC
Confidence            4678999999653 46789999999864


No 41 
>PF09064 Tme5_EGF_like:  Thrombomodulin like fifth domain, EGF-like;  InterPro: IPR015149 This domain adopts a fold similar to other EGF domains, with a flat major and a twisted minor beta sheet. Disulphide pairing, however, is not of the usual 1-3, 2-4, 5-6 type; rather 1-2, 3-4, 5-6 pairing is found. Its extended major sheet (strands beta-2 and beta-3 and the connecting loop) projects into thrombin's active site groove. This domain is required for interaction of thrombomodulin with thrombin, and subsequent activation of protein-C []. ; GO: 0004888 transmembrane signaling receptor activity, 0016021 integral to membrane
Probab=63.68  E-value=4.2  Score=26.65  Aligned_cols=18  Identities=28%  Similarity=0.654  Sum_probs=13.9

Q ss_pred             ccCCCCCCccccCCCCcc
Q 010708          285 ICNSQEKPICSCLEGFEP  302 (503)
Q Consensus       285 iC~~~~~~~C~C~~Gf~~  302 (503)
                      .|..+....|.||.||-.
T Consensus        11 ~CDpn~~~~C~CPeGyIl   28 (34)
T PF09064_consen   11 DCDPNSPGQCFCPEGYIL   28 (34)
T ss_pred             ccCCCCCCceeCCCceEe
Confidence            566555679999999974


No 42 
>KOG0194 consensus Protein tyrosine kinase [Signal transduction mechanisms]
Probab=60.72  E-value=4.4  Score=43.42  Aligned_cols=16  Identities=44%  Similarity=0.790  Sum_probs=15.2

Q ss_pred             CcccccCCccEEEEEc
Q 010708          487 SKLGQGGFGPVYWVMV  502 (503)
Q Consensus       487 n~LG~GgfG~VYKG~L  502 (503)
                      .+||+|.||.||+|+|
T Consensus       163 kkLGeGaFGeV~~G~l  178 (474)
T KOG0194|consen  163 KKLGEGAFGEVFKGKL  178 (474)
T ss_pred             ceeecccccEEEEEEE
Confidence            7999999999999987


No 43 
>KOG1035 consensus eIF-2alpha kinase GCN2 [Translation, ribosomal structure and biogenesis]
Probab=59.75  E-value=3  Score=48.84  Aligned_cols=21  Identities=48%  Similarity=0.757  Sum_probs=17.1

Q ss_pred             cCCCcCCcccccCCccEEEEE
Q 010708          481 NNFQLSSKLGQGGFGPVYWVM  501 (503)
Q Consensus       481 ~nFs~~n~LG~GgfG~VYKG~  501 (503)
                      +.|-+-..||+||||.|||.+
T Consensus       479 ~DFEEL~lLGkGGFG~VvkVR  499 (1351)
T KOG1035|consen  479 NDFEELELLGKGGFGSVVKVR  499 (1351)
T ss_pred             hhhHHHHHhcCCCCceEEEEe
Confidence            445555789999999999985


No 44 
>PRK11138 outer membrane biogenesis protein BamB; Provisional
Probab=58.38  E-value=50  Score=34.32  Aligned_cols=18  Identities=17%  Similarity=0.398  Sum_probs=8.5

Q ss_pred             cCCcEEEEeC-CCcEEEee
Q 010708           93 EDGNLVLVNG-QKEVLWSS  110 (503)
Q Consensus        93 ~~G~Lvl~~~-~~~~~Wss  110 (503)
                      .+|.|+..|. +|..+|+.
T Consensus       302 ~~g~l~ald~~tG~~~W~~  320 (394)
T PRK11138        302 QNDRVYALDTRGGVELWSQ  320 (394)
T ss_pred             CCCeEEEEECCCCcEEEcc
Confidence            3455554443 34455544


No 45 
>TIGR03300 assembly_YfgL outer membrane assembly lipoprotein YfgL. Members of this protein family are YfgL, a lipoprotein component of a complex that acts protein insertion into the bacterial outer membrane. Other members of this complex are NlpB, YfiO, and YaeT. This protein contains multiple copies of a repeat that, in other contexts, are associated with binding of the coenzyme PQQ.
Probab=56.70  E-value=53  Score=33.77  Aligned_cols=51  Identities=16%  Similarity=0.382  Sum_probs=27.8

Q ss_pred             cCCcEEEEe-CCCcEEEeecCcCCc------cccceEEEccCCCEEEEeccCCcEEEe
Q 010708           93 EDGNLVLVN-GQKEVLWSSNVSNLV------NNSTSAQLLDSGNLVLRDNINRAIVWE  143 (503)
Q Consensus        93 ~~G~Lvl~~-~~~~~~Wss~~~~~~------~~~~~~~L~d~GNlVl~~~~~~~~~Wq  143 (503)
                      .+|.|+-+| .+|.++|........      .+.....-..+|+|+-.|..+|+++|+
T Consensus        73 ~~g~v~a~d~~tG~~~W~~~~~~~~~~~p~v~~~~v~v~~~~g~l~ald~~tG~~~W~  130 (377)
T TIGR03300        73 ADGTVVALDAETGKRLWRVDLDERLSGGVGADGGLVFVGTEKGEVIALDAEDGKELWR  130 (377)
T ss_pred             CCCeEEEEEccCCcEeeeecCCCCcccceEEcCCEEEEEcCCCEEEEEECCCCcEeee
Confidence            446676666 467777876543211      011122224466666666655777775


No 46 
>smart00181 EGF Epidermal growth factor-like domain.
Probab=55.99  E-value=8.9  Score=24.62  Aligned_cols=24  Identities=29%  Similarity=0.683  Sum_probs=18.1

Q ss_pred             cccCCCcccCC-CCCCccccCCCCcc
Q 010708          278 GKCGAFGICNS-QEKPICSCLEGFEP  302 (503)
Q Consensus       278 g~CG~~giC~~-~~~~~C~C~~Gf~~  302 (503)
                      ..|... .|.. .+...|.|++||+.
T Consensus         6 ~~C~~~-~C~~~~~~~~C~C~~g~~g   30 (35)
T smart00181        6 GPCSNG-TCINTPGSYTCSCPPGYTG   30 (35)
T ss_pred             CCCCCC-EEECCCCCeEeECCCCCcc
Confidence            457666 8854 35788999999975


No 47 
>cd05622 STKc_ROCK1 Catalytic domain of the Protein Serine/Threonine Kinase, Rho-associated coiled-coil containing protein kinase 1. Serine/Threonine Kinases (STKs), ROCK subfamily, ROCK1 (or ROK-beta) isoform, catalytic (c) domain. STKs catalyze the transfer of the gamma-phosphoryl group from ATP to serine/threonine residues on protein substrates. The ROCK subfamily is part of a larger superfamily that includes the catalytic domains of other protein STKs, protein tyrosine kinases, RIO kinases, aminoglycoside phosphotransferase, choline kinase, and phosphoinositide 3-kinase. ROCK contains an N-terminal extension, a catalytic kinase domain, and a C-terminal extension, which contains a coiled-coil region encompassing a Rho-binding domain (RBD) and a pleckstrin homology (PH) domain. ROCK is auto-inhibited by the RBD and PH domain interacting with the catalytic domain, and is activated via interaction with Rho GTPases. ROCK1 is preferentially expressed in the liver, lung, spleen, testes, an
Probab=54.73  E-value=4.1  Score=42.08  Aligned_cols=30  Identities=23%  Similarity=0.534  Sum_probs=23.6

Q ss_pred             HHHHHHHhcCCCcCCcccccCCccEEEEEc
Q 010708          473 FEELATATNNFQLSSKLGQGGFGPVYWVMV  502 (503)
Q Consensus       473 ~~el~~AT~nFs~~n~LG~GgfG~VYKG~L  502 (503)
                      +.++....++|....+||+|+||.||+++.
T Consensus        35 ~~~~~~~~~~y~i~~~lG~G~fg~Vy~~~~   64 (371)
T cd05622          35 IRDLRMKAEDYEVVKVIGRGAFGEVQLVRH   64 (371)
T ss_pred             HhhcCcchhhcEEEEEEeecCCeEEEEEEE
Confidence            445555667788778999999999999863


No 48 
>cd05106 PTKc_CSF-1R Catalytic domain of the Protein Tyrosine Kinase, Colony-Stimulating Factor-1 Receptor. Protein Tyrosine Kinase (PTK) family; Colony-Stimulating Factor-1 Receptor (CSF-1R); catalytic (c) domain. The PTKc family is part of a larger superfamily that includes the catalytic domains of other kinases such as protein serine/threonine kinases, RIO kinases, and phosphoinositide 3-kinase (PI3K). PTKs catalyze the transfer of the gamma-phosphoryl group from ATP to tyrosine (tyr) residues in protein substrates. CSF-1R, also called c-Fms, is a member of the Platelet Derived Growth Factor Receptor (PDGFR) subfamily of proteins, which are receptor tyr kinases (RTKs) containing an extracellular ligand-binding region with five immunoglobulin-like domains, a transmembrane segment, and an intracellular catalytic domain. The binding of CSF-1R to its ligand, CSF-1, leads to receptor dimerization, trans phosphorylation and activation, and intracellular signaling. CSF-1R signaling is criti
Probab=54.06  E-value=6  Score=40.91  Aligned_cols=19  Identities=37%  Similarity=0.620  Sum_probs=15.8

Q ss_pred             CCcCCcccccCCccEEEEE
Q 010708          483 FQLSSKLGQGGFGPVYWVM  501 (503)
Q Consensus       483 Fs~~n~LG~GgfG~VYKG~  501 (503)
                      |.-.++||+|+||.||+++
T Consensus        40 ~~~~~~LG~G~fg~V~~~~   58 (374)
T cd05106          40 LQFGKTLGAGAFGKVVEAT   58 (374)
T ss_pred             ceehheecCCCcccEEEEE
Confidence            4445889999999999875


No 49 
>PF13360 PQQ_2:  PQQ-like domain; PDB: 3HXJ_B 1YIQ_A 1KV9_A 3Q54_A 2YH3_A 3PRW_A 3P1L_A 3Q7M_A 3Q7O_A 3Q7N_A ....
Probab=53.99  E-value=82  Score=29.52  Aligned_cols=77  Identities=19%  Similarity=0.401  Sum_probs=46.3

Q ss_pred             CCCeEEEEeCC----CCCC--cC-CccEEEE-ecCCcEEEEeC-CCcEEEeecCcCCc------cccceEEEccCCCEEE
Q 010708           68 SEKAVIWVANR----DNPL--KD-SSGIITI-SEDGNLVLVNG-QKEVLWSSNVSNLV------NNSTSAQLLDSGNLVL  132 (503)
Q Consensus        68 ~~~~vvW~anr----~~p~--~~-~~~~l~l-~~~G~Lvl~~~-~~~~~Wss~~~~~~------~~~~~~~L~d~GNlVl  132 (503)
                      .....+|..+-    ..++  .- ....+.+ +.+|.|+.+|. +|..+|........      .+........+|.|..
T Consensus        11 ~tG~~~W~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~l~~~d~~tG~~~W~~~~~~~~~~~~~~~~~~v~v~~~~~~l~~   90 (238)
T PF13360_consen   11 RTGKELWSYDLGPGIGGPVATAVPDGGRVYVASGDGNLYALDAKTGKVLWRFDLPGPISGAPVVDGGRVYVGTSDGSLYA   90 (238)
T ss_dssp             TTTEEEEEEECSSSCSSEEETEEEETTEEEEEETTSEEEEEETTTSEEEEEEECSSCGGSGEEEETTEEEEEETTSEEEE
T ss_pred             CCCCEEEEEECCCCCCCccceEEEeCCEEEEEcCCCEEEEEECCCCCEEEEeeccccccceeeecccccccccceeeeEe
Confidence            46778888753    2222  00 1233444 48899999996 89999998753321      0112222334556777


Q ss_pred             EeccCCcEEEee
Q 010708          133 RDNINRAIVWES  144 (503)
Q Consensus       133 ~~~~~~~~~WqS  144 (503)
                      .|..+|+++|+.
T Consensus        91 ~d~~tG~~~W~~  102 (238)
T PF13360_consen   91 LDAKTGKVLWSI  102 (238)
T ss_dssp             EETTTSCEEEEE
T ss_pred             cccCCcceeeee
Confidence            775569999995


No 50 
>TIGR03300 assembly_YfgL outer membrane assembly lipoprotein YfgL. Members of this protein family are YfgL, a lipoprotein component of a complex that acts protein insertion into the bacterial outer membrane. Other members of this complex are NlpB, YfiO, and YaeT. This protein contains multiple copies of a repeat that, in other contexts, are associated with binding of the coenzyme PQQ.
Probab=53.47  E-value=74  Score=32.68  Aligned_cols=56  Identities=18%  Similarity=0.328  Sum_probs=37.0

Q ss_pred             EEEecCCcEEEEeC-CCcEEEeecCcCCc----cccceEEEccCCCEEEEeccCCcEEEee
Q 010708           89 ITISEDGNLVLVNG-QKEVLWSSNVSNLV----NNSTSAQLLDSGNLVLRDNINRAIVWES  144 (503)
Q Consensus        89 l~l~~~G~Lvl~~~-~~~~~Wss~~~~~~----~~~~~~~L~d~GNlVl~~~~~~~~~WqS  144 (503)
                      +....+|.|+.+|. +|..+|........    .+.......++|.|+..|..+++.+|+.
T Consensus       245 y~~~~~g~l~a~d~~tG~~~W~~~~~~~~~p~~~~~~vyv~~~~G~l~~~d~~tG~~~W~~  305 (377)
T TIGR03300       245 YAVSYQGRVAALDLRSGRVLWKRDASSYQGPAVDDNRLYVTDADGVVVALDRRSGSELWKN  305 (377)
T ss_pred             EEEEcCCEEEEEECCCCcEEEeeccCCccCceEeCCEEEEECCCCeEEEEECCCCcEEEcc
Confidence            33456788998886 67899987632211    0123334466788888887668899974


No 51 
>PTZ00036 glycogen synthase kinase; Provisional
Probab=53.35  E-value=5.3  Score=42.47  Aligned_cols=23  Identities=30%  Similarity=0.731  Sum_probs=18.6

Q ss_pred             hcCCCcCCcccccCCccEEEEEc
Q 010708          480 TNNFQLSSKLGQGGFGPVYWVMV  502 (503)
Q Consensus       480 T~nFs~~n~LG~GgfG~VYKG~L  502 (503)
                      ++.|...++||+|+||.||+|+.
T Consensus        65 ~~~y~~~~~LG~G~fg~Vy~~~~   87 (440)
T PTZ00036         65 NKSYKLGNIIGNGSFGVVYEAIC   87 (440)
T ss_pred             CCeEEEeEEEEeCCCEEEEEEEE
Confidence            34566668999999999999864


No 52 
>PF04478 Mid2:  Mid2 like cell wall stress sensor;  InterPro: IPR007567 This family represents a region near the C terminus of Mid2, which contains a transmembrane region. The remainder of the protein sequence is serine-rich and of low complexity, and is therefore impossible to align accurately. Mid2 is thought to act as a mechanosensor of cell wall stress. The C-terminal cytoplasmic region of Mid2 is known to interact with Rom2, a guanine nucleotide exchange factor (GEF) for Rho1, which is part of the cell wall integrity signalling pathway [].
Probab=51.51  E-value=8.3  Score=34.55  Aligned_cols=12  Identities=8%  Similarity=-0.153  Sum_probs=5.1

Q ss_pred             EEechhHHHHHH
Q 010708          423 FVSPLIKGMFAL  434 (503)
Q Consensus       423 i~i~~~~~~~~l  434 (503)
                      +++++.++++++
T Consensus        54 vVVGVGg~ill~   65 (154)
T PF04478_consen   54 VVVGVGGPILLG   65 (154)
T ss_pred             EEecccHHHHHH
Confidence            444444444333


No 53 
>PRK09605 bifunctional UGMP family protein/serine/threonine protein kinase; Validated
Probab=51.29  E-value=6  Score=43.25  Aligned_cols=30  Identities=7%  Similarity=-0.045  Sum_probs=21.5

Q ss_pred             HHHHHHHhcCCCcCCcccccCCccEEEEEc
Q 010708          473 FEELATATNNFQLSSKLGQGGFGPVYWVMV  502 (503)
Q Consensus       473 ~~el~~AT~nFs~~n~LG~GgfG~VYKG~L  502 (503)
                      +......+..+...++||+|+||.||||+.
T Consensus       325 ~~~~~~~~~~~~~~~~iG~G~~g~Vy~~~~  354 (535)
T PRK09605        325 WIKEEEVKRRKIPDHLIGKGAEADIKKGEY  354 (535)
T ss_pred             eccccccccccCccceeccCCcEEEEEEee
Confidence            333334445556679999999999999764


No 54 
>cd05621 STKc_ROCK2 Catalytic domain of the Protein Serine/Threonine Kinase, Rho-associated coiled-coil containing protein kinase 2. Serine/Threonine Kinases (STKs), ROCK subfamily, ROCK2 (or ROK-alpha) isoform, catalytic (c) domain. STKs catalyze the transfer of the gamma-phosphoryl group from ATP to serine/threonine residues on protein substrates. The ROCK subfamily is part of a larger superfamily that includes the catalytic domains of other protein STKs, protein tyrosine kinases, RIO kinases, aminoglycoside phosphotransferase, choline kinase, and phosphoinositide 3-kinase. ROCK contains an N-terminal extension, a catalytic kinase domain, and a C-terminal extension, which contains a coiled-coil region encompassing a Rho-binding domain (RBD) and a pleckstrin homology (PH) domain. ROCK is auto-inhibited by the RBD and PH domain interacting with the catalytic domain, and is activated via interaction with Rho GTPases. ROCK2 was the first identified target of activated RhoA, and was found 
Probab=50.37  E-value=5.2  Score=41.35  Aligned_cols=25  Identities=24%  Similarity=0.530  Sum_probs=19.5

Q ss_pred             HHhcCCCcCCcccccCCccEEEEEc
Q 010708          478 TATNNFQLSSKLGQGGFGPVYWVMV  502 (503)
Q Consensus       478 ~AT~nFs~~n~LG~GgfG~VYKG~L  502 (503)
                      ...++|....+||+|+||.||+++.
T Consensus        40 ~~~~~y~~~~~lG~G~fg~Vy~~~~   64 (370)
T cd05621          40 MKAEDYDVVKVIGRGAFGEVQLVRH   64 (370)
T ss_pred             CCHHHCeEEEEEEecCCeEEEEEEE
Confidence            3345566668999999999998863


No 55 
>PHA03210 serine/threonine kinase US3; Provisional
Probab=50.17  E-value=8.1  Score=41.83  Aligned_cols=23  Identities=22%  Similarity=0.564  Sum_probs=18.7

Q ss_pred             hcCCCcCCcccccCCccEEEEEc
Q 010708          480 TNNFQLSSKLGQGGFGPVYWVMV  502 (503)
Q Consensus       480 T~nFs~~n~LG~GgfG~VYKG~L  502 (503)
                      .+.|....+||+|+||.||++.+
T Consensus       147 ~~~Y~ii~~LG~G~fG~Vyl~~~  169 (501)
T PHA03210        147 LAHFRVIDDLPAGAFGKIFICAL  169 (501)
T ss_pred             hhccEEEeEecCCCCcceEEEEE
Confidence            45677678999999999998753


No 56 
>PF01436 NHL:  NHL repeat;  InterPro: IPR001258 The NHL repeat, named after NCL-1, HT2A and Lin-41, is found largely in a large number of eukaryotic and prokaryotic proteins. For example, the repeat is found in a variety of enzymes of the copper type II, ascorbate-dependent monooxygenase family which catalyse the C terminus alpha-amidation of biological peptides []. In many it occurs in tandem arrays, for example in the ringfinger beta-box, coiled-coil (RBCC) eukaryotic growth regulators []. The 'Brain Tumor' protein (Brat) is one such growth regulator that contains a 6-bladed NHL-repeat beta-propeller [, ].  The NHL repeats are also found in serine/threonine protein kinase (STPK) in diverse range of pathogenic bacteria. These STPK are transmembrane receptors with a intracellular N-terminal kinase domain and extracellular C-terminal sensor domain. In the STPK, PknD, from Mycobacterium tuberculosis, the sensor domain forms a rigid, six-bladed b-propeller composed of NHL repeats with a flexible tether to the transmembrane domain.; GO: 0005515 protein binding; PDB: 3FVZ_A 3FW0_A 1RWL_A 1RWI_A 1Q7F_A.
Probab=49.28  E-value=31  Score=21.29  Aligned_cols=21  Identities=14%  Similarity=0.344  Sum_probs=16.0

Q ss_pred             EEEEecCCcEEEEeCCCcEEE
Q 010708           88 IITISEDGNLVLVNGQKEVLW  108 (503)
Q Consensus        88 ~l~l~~~G~Lvl~~~~~~~~W  108 (503)
                      -+.+..+|++++.|..+..||
T Consensus         6 gvav~~~g~i~VaD~~n~rV~   26 (28)
T PF01436_consen    6 GVAVDSDGNIYVADSGNHRVQ   26 (28)
T ss_dssp             EEEEETTSEEEEEECCCTEEE
T ss_pred             EEEEeCCCCEEEEECCCCEEE
Confidence            377778899999887766655


No 57 
>PHA02887 EGF-like protein; Provisional
Probab=48.69  E-value=14  Score=31.45  Aligned_cols=33  Identities=21%  Similarity=0.557  Sum_probs=24.4

Q ss_pred             cccccc--cccccCCCcccCCC---CCCccccCCCCccC
Q 010708          270 LRTECD--VYGKCGAFGICNSQ---EKPICSCLEGFEPK  303 (503)
Q Consensus       270 p~~~C~--v~g~CG~~giC~~~---~~~~C~C~~Gf~~~  303 (503)
                      -.++|.  ..+.|= +|.|.+-   +.+.|.|++||...
T Consensus        82 hf~pC~~eyk~YCi-HG~C~yI~dL~epsCrC~~GYtG~  119 (126)
T PHA02887         82 FFEKCKNDFNDFCI-NGECMNIIDLDEKFCICNKGYTGI  119 (126)
T ss_pred             CccccChHhhCEee-CCEEEccccCCCceeECCCCcccC
Confidence            345675  467887 7999752   57899999999753


No 58 
>KOG0663 consensus Protein kinase PITSLRE and related kinases [General function prediction only]
Probab=47.91  E-value=5.4  Score=40.57  Aligned_cols=20  Identities=25%  Similarity=0.607  Sum_probs=16.6

Q ss_pred             CCCcCCcccccCCccEEEEE
Q 010708          482 NFQLSSKLGQGGFGPVYWVM  501 (503)
Q Consensus       482 nFs~~n~LG~GgfG~VYKG~  501 (503)
                      +|-.-|+|++|.||.||||+
T Consensus        77 efe~lnrI~EGtyGiVYRak   96 (419)
T KOG0663|consen   77 EFEKLNRIEEGTYGVVYRAK   96 (419)
T ss_pred             HHHHHhhcccCcceeEEEec
Confidence            34445899999999999986


No 59 
>PF01102 Glycophorin_A:  Glycophorin A;  InterPro: IPR001195 Proteins in this group are responsible for the molecular basis of the blood group antigens, surface markers on the outside of the red blood cell membrane. Most of these markers are proteins, but some are carbohydrates attached to lipids or proteins [Reid M.E., Lomas-Francis C. The Blood Group Antigen FactsBook Academic Press, London / San Diego, (1997)]. Glycophorin A (PAS-2) and glycophorin B (PAS-3) belong to the MNS blood group system and are associated with antigens that include M/N, S/s, U, He, Mi(a), M(c), Vw, Mur, M(g), Vr, M(e), Mt(a), St(a), Ri(a), Cl(a), Ny(a), Hut, Hil, M(v), Far, Mit, Dantu, Hop, Nob, En(a), ENKT, amongst others. Glycophorin A is the major sialoglycoprotein of the erythrocyte membrane []. Structurally, glycophorin A consists of an N-terminal extracellular domain, heavily glycosylated on serine and threonine residues, followed by a transmembrane region and a C-terminal cytoplasmic domain. Other glycophorins in this entry such as Glycophorin B and Glycophorin E represent minor sialoglycoproteins in the erythrocyte membrane.; GO: 0016021 integral to membrane; PDB: 2KPF_B 1AFO_B 2KPE_A.
Probab=47.26  E-value=8.1  Score=33.45  Aligned_cols=7  Identities=14%  Similarity=0.368  Sum_probs=2.6

Q ss_pred             echhHHH
Q 010708          425 SPLIKGM  431 (503)
Q Consensus       425 i~~~~~~  431 (503)
                      +++++++
T Consensus        71 ~gv~aGv   77 (122)
T PF01102_consen   71 FGVMAGV   77 (122)
T ss_dssp             HHHHHHH
T ss_pred             HHHHHHH
Confidence            3333333


No 60 
>PF08693 SKG6:  Transmembrane alpha-helix domain;  InterPro: IPR014805 SKG6 and AXL2 are membrane proteins that show polarised intracellular localisation [, ]. This entry represents the highly conserved transmembrane alpha-helical domain found in these proteins [, ]. The full-length AXL2 protein has a negative regulatory function in cytokinesis [].
Probab=47.24  E-value=12  Score=25.61  Aligned_cols=6  Identities=33%  Similarity=0.966  Sum_probs=2.3

Q ss_pred             hhhhhh
Q 010708          440 FLWRWI  445 (503)
Q Consensus       440 ~~~~~~  445 (503)
                      ++|+||
T Consensus        33 ~~~~rR   38 (40)
T PF08693_consen   33 FFWYRR   38 (40)
T ss_pred             heEEec
Confidence            334443


No 61 
>PHA03209 serine/threonine kinase US3; Provisional
Probab=46.54  E-value=9.8  Score=38.95  Aligned_cols=25  Identities=20%  Similarity=0.292  Sum_probs=20.4

Q ss_pred             HHhcCCCcCCcccccCCccEEEEEc
Q 010708          478 TATNNFQLSSKLGQGGFGPVYWVMV  502 (503)
Q Consensus       478 ~AT~nFs~~n~LG~GgfG~VYKG~L  502 (503)
                      ....+|....+||+|+||.||+|+.
T Consensus        63 ~~~~~y~~~~~lg~G~~g~Vy~~~~   87 (357)
T PHA03209         63 VASLGYTVIKTLTPGSEGRVFVATK   87 (357)
T ss_pred             hhhcCcEEEEEecCCCCeEEEEEEE
Confidence            3445687778999999999999864


No 62 
>PF13956 Ibs_toxin:  Toxin Ibs, type I toxin-antitoxin system
Probab=46.43  E-value=11  Score=20.97  Aligned_cols=15  Identities=27%  Similarity=0.366  Sum_probs=7.8

Q ss_pred             ChhHHHHHHHHHHHh
Q 010708            1 MIPIALLIILLSCFC   15 (503)
Q Consensus         1 ~~~~~~~~~~~~~~~   15 (503)
                      |..+++|+++|++++
T Consensus         2 Mk~vIIlvvLLliSf   16 (19)
T PF13956_consen    2 MKLVIILVVLLLISF   16 (19)
T ss_pred             ceehHHHHHHHhccc
Confidence            444555555555444


No 63 
>cd05596 STKc_ROCK Catalytic domain of the Protein Serine/Threonine Kinase, Rho-associated coiled-coil containing protein kinase. Serine/Threonine Kinases (STKs), Rho-associated coiled-coil containing protein kinase (ROCK) subfamily, catalytic (c) domain. STKs catalyze the transfer of the gamma-phosphoryl group from ATP to serine/threonine residues on protein substrates. The ROCK subfamily is part of a larger superfamily that includes the catalytic domains of other protein STKs, protein tyrosine kinases, RIO kinases, aminoglycoside phosphotransferase, choline kinase, and phosphoinositide 3-kinase. ROCK is also referred to as Rho-associated kinase or simply as Rho kinase. It contains an N-terminal extension, a catalytic kinase domain, and a long C-terminal extension, which contains a coiled-coil region encompassing a Rho-binding domain (RBD) and a pleckstrin homology (PH) domain. ROCK is auto-inhibited by the RBD and PH domain interacting with the catalytic domain. It is activated via in
Probab=45.77  E-value=4.7  Score=41.63  Aligned_cols=23  Identities=30%  Similarity=0.606  Sum_probs=18.6

Q ss_pred             hcCCCcCCcccccCCccEEEEEc
Q 010708          480 TNNFQLSSKLGQGGFGPVYWVMV  502 (503)
Q Consensus       480 T~nFs~~n~LG~GgfG~VYKG~L  502 (503)
                      .++|....+||+|+||.||+++.
T Consensus        42 ~~~y~~~~~lg~G~~g~Vy~~~~   64 (370)
T cd05596          42 AEDFDVIKVIGRGAFGEVQLVRH   64 (370)
T ss_pred             HHHcEEEEEEeeCCCEEEEEEEE
Confidence            44566668999999999999863


No 64 
>KOG0600 consensus Cdc2-related protein kinase [Cell cycle control, cell division, chromosome partitioning]
Probab=43.07  E-value=6.5  Score=41.97  Aligned_cols=15  Identities=40%  Similarity=0.835  Sum_probs=13.8

Q ss_pred             CcccccCCccEEEEE
Q 010708          487 SKLGQGGFGPVYWVM  501 (503)
Q Consensus       487 n~LG~GgfG~VYKG~  501 (503)
                      .+||+|.||.|||++
T Consensus       123 ~kIGeGTyg~VYkAr  137 (560)
T KOG0600|consen  123 EKIGEGTYGQVYKAR  137 (560)
T ss_pred             HHhcCcchhheeEee
Confidence            689999999999985


No 65 
>PLN00034 mitogen-activated protein kinase kinase; Provisional
Probab=42.14  E-value=11  Score=38.46  Aligned_cols=16  Identities=38%  Similarity=0.696  Sum_probs=14.3

Q ss_pred             CcccccCCccEEEEEc
Q 010708          487 SKLGQGGFGPVYWVMV  502 (503)
Q Consensus       487 n~LG~GgfG~VYKG~L  502 (503)
                      ++||+|+||.||+++.
T Consensus        80 ~~lg~G~~g~V~~~~~   95 (353)
T PLN00034         80 NRIGSGAGGTVYKVIH   95 (353)
T ss_pred             hhccCCCCeEEEEEEE
Confidence            6899999999999863


No 66 
>KOG4649 consensus PQQ (pyrrolo-quinoline quinone) repeat protein [Secondary metabolites biosynthesis, transport and catabolism]
Probab=39.22  E-value=55  Score=32.28  Aligned_cols=46  Identities=22%  Similarity=0.419  Sum_probs=34.9

Q ss_pred             CCCeEEEEeCCCCCCcCCc-----cEEEEecCCcEEEEeCCCcEEEeecCc
Q 010708           68 SEKAVIWVANRDNPLKDSS-----GIITISEDGNLVLVNGQKEVLWSSNVS  113 (503)
Q Consensus        68 ~~~~vvW~anr~~p~~~~~-----~~l~l~~~G~Lvl~~~~~~~~Wss~~~  113 (503)
                      .+.++.|.+.|..|+-.+.     ....=+.||+|.-.|+.|+.||.-.|.
T Consensus       167 ~~~~~~w~~~~~~PiF~splcv~~sv~i~~VdG~l~~f~~sG~qvwr~~t~  217 (354)
T KOG4649|consen  167 YSSTEFWAATRFGPIFASPLCVGSSVIITTVDGVLTSFDESGRQVWRPATK  217 (354)
T ss_pred             CCcceehhhhcCCccccCceeccceEEEEEeccEEEEEcCCCcEEEeecCC
Confidence            4458899999999997541     233345899999999999999976553


No 67 
>KOG0192 consensus Tyrosine kinase specific for activated (GTP-bound) p21cdc42Hs [Signal transduction mechanisms]
Probab=38.87  E-value=14  Score=38.37  Aligned_cols=16  Identities=44%  Similarity=0.634  Sum_probs=14.0

Q ss_pred             CcccccCCccEEEEEc
Q 010708          487 SKLGQGGFGPVYWVMV  502 (503)
Q Consensus       487 n~LG~GgfG~VYKG~L  502 (503)
                      ..||+|+||+||||.-
T Consensus        47 ~~iG~G~~g~V~~~~~   62 (362)
T KOG0192|consen   47 EVLGSGSFGTVYKGKW   62 (362)
T ss_pred             hhcccCCceeEEEEEe
Confidence            4599999999999974


No 68 
>PHA03211 serine/threonine kinase US3; Provisional
Probab=38.51  E-value=15  Score=39.37  Aligned_cols=23  Identities=22%  Similarity=0.208  Sum_probs=18.9

Q ss_pred             hcCCCcCCcccccCCccEEEEEc
Q 010708          480 TNNFQLSSKLGQGGFGPVYWVMV  502 (503)
Q Consensus       480 T~nFs~~n~LG~GgfG~VYKG~L  502 (503)
                      ..+|.-..+||+|+||.||+++.
T Consensus       168 ~~gy~i~~~Lg~G~~G~Vy~a~~  190 (461)
T PHA03211        168 GLGFAIHRALTPGSEGCVFESSH  190 (461)
T ss_pred             cCCeEEEEEEccCCCeEEEEEEE
Confidence            44566678999999999999874


No 69 
>cd00216 PQQ_DH Dehydrogenases with pyrrolo-quinoline quinone (PQQ) as cofactor, like ethanol, methanol, and membrane bound glucose dehydrogenases. The alignment model contains an 8-bladed beta-propeller.
Probab=38.11  E-value=1.1e+02  Score=33.08  Aligned_cols=76  Identities=22%  Similarity=0.310  Sum_probs=46.5

Q ss_pred             CCCeEEEEeCCC-------CCCcCCccEEEEe-cCCcEEEEeC-CCcEEEeecCcCC--------cc------c-cceEE
Q 010708           68 SEKAVIWVANRD-------NPLKDSSGIITIS-EDGNLVLVNG-QKEVLWSSNVSNL--------VN------N-STSAQ  123 (503)
Q Consensus        68 ~~~~vvW~anr~-------~p~~~~~~~l~l~-~~G~Lvl~~~-~~~~~Wss~~~~~--------~~------~-~~~~~  123 (503)
                      ....++|..+-.       .|+.. ..++.+. .+|.|+-+|. +|.++|.......        ..      . .....
T Consensus        37 ~~~~~~W~~~~~~~~~~~~sPvv~-~g~vy~~~~~g~l~AlD~~tG~~~W~~~~~~~~~~~~~~~~~~g~~~~~~~~V~v  115 (488)
T cd00216          37 KKLKVAWTFSTGDERGQEGTPLVV-DGDMYFTTSHSALFALDAATGKVLWRYDPKLPADRGCCDVVNRGVAYWDPRKVFF  115 (488)
T ss_pred             hcceeeEEEECCCCCCcccCCEEE-CCEEEEeCCCCcEEEEECCCChhhceeCCCCCccccccccccCCcEEccCCeEEE
Confidence            446678887654       24332 2344444 5788888885 6889999764321        00      0 11122


Q ss_pred             EccCCCEEEEeccCCcEEEee
Q 010708          124 LLDSGNLVLRDNINRAIVWES  144 (503)
Q Consensus       124 L~d~GNlVl~~~~~~~~~WqS  144 (503)
                      -..+|.|+-.|.++++++|+-
T Consensus       116 ~~~~g~v~AlD~~TG~~~W~~  136 (488)
T cd00216         116 GTFDGRLVALDAETGKQVWKF  136 (488)
T ss_pred             ecCCCeEEEEECCCCCEeeee
Confidence            235788888888779999994


No 70 
>KOG2052 consensus Activin A type IB receptor, serine/threonine protein kinase [Signal transduction mechanisms]
Probab=34.88  E-value=20  Score=37.86  Aligned_cols=17  Identities=29%  Similarity=0.515  Sum_probs=14.6

Q ss_pred             CCcccccCCccEEEEEc
Q 010708          486 SSKLGQGGFGPVYWVMV  502 (503)
Q Consensus       486 ~n~LG~GgfG~VYKG~L  502 (503)
                      ...||+|.||-|.||.-
T Consensus       216 ~e~IGkGRyGEVwrG~w  232 (513)
T KOG2052|consen  216 QEIIGKGRFGEVWRGRW  232 (513)
T ss_pred             EEEecCccccceeeccc
Confidence            36899999999999863


No 71 
>COG3117 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=33.88  E-value=2e+02  Score=26.82  Aligned_cols=73  Identities=16%  Similarity=0.223  Sum_probs=42.6

Q ss_pred             EeCCCeeEEEeeCCCCCCceEEEEEecCCCCCeEEEEeCCCCCCcCCccEEEEecCCcEEEEeCCCcEEEeecCcCCccc
Q 010708           39 LSNGSNFKLGFFNPADSPYRYMGIWYDMPSEKAVIWVANRDNPLKDSSGIITISEDGNLVLVNGQKEVLWSSNVSNLVNN  118 (503)
Q Consensus        39 ~S~~g~f~~gf~~~~~~~~~~l~i~~~~~~~~~vvW~anr~~p~~~~~~~l~l~~~G~Lvl~~~~~~~~Wss~~~~~~~~  118 (503)
                      ....+.=...|.+.|-.+....|+-..+.+.....|+.-   |              -|.|++. +.+-|.-.       
T Consensus        42 Y~~e~~~~~~~de~G~~~y~l~a~~~eh~~~~~~t~ft~---P--------------~l~lf~e-~~q~w~v~-------   96 (188)
T COG3117          42 YTMEGLDTTVYDEQGKLKYRLTAQHVEHYSDSSDTHFTA---P--------------VLTLFRE-NRQTWTVQ-------   96 (188)
T ss_pred             eeecCcceeEECCCcceeEEeehhhhhhcccccceEEec---c--------------eEEEEcC-CCccEEEE-------
Confidence            455555566676665443345566555544443444322   1              2444433 55667753       


Q ss_pred             cceEEEccCCCEEEEecc
Q 010708          119 STSAQLLDSGNLVLRDNI  136 (503)
Q Consensus       119 ~~~~~L~d~GNlVl~~~~  136 (503)
                      ...+.|.++|.|+|+..+
T Consensus        97 Ad~a~l~~d~~l~l~~~v  114 (188)
T COG3117          97 ADRAKLTKDGKLYLLGDV  114 (188)
T ss_pred             hhhheecCCceEEEEece
Confidence            467899999999999754


No 72 
>PF15102 TMEM154:  TMEM154 protein family
Probab=33.85  E-value=19  Score=32.03  Aligned_cols=8  Identities=25%  Similarity=0.384  Sum_probs=3.2

Q ss_pred             EEechhHH
Q 010708          423 FVSPLIKG  430 (503)
Q Consensus       423 i~i~~~~~  430 (503)
                      |+++.+++
T Consensus        61 IlIP~VLL   68 (146)
T PF15102_consen   61 ILIPLVLL   68 (146)
T ss_pred             EeHHHHHH
Confidence            34443333


No 73 
>PF05935 Arylsulfotrans:  Arylsulfotransferase (ASST);  InterPro: IPR010262 This family consists of several bacterial arylsulphotransferase proteins. Arylsulphotransferase (ASST) transfers a sulphate group from phenolic sulphate esters to a phenolic acceptor substrate [].; PDB: 3ETT_B 3ELQ_A 3ETS_A.
Probab=33.30  E-value=1e+02  Score=33.25  Aligned_cols=54  Identities=20%  Similarity=0.420  Sum_probs=31.6

Q ss_pred             CCcEEEEeCCCcEEEeecCcCCccccceEEEccCCCEEEEecc-------CCcEEEeecCCCCC
Q 010708           94 DGNLVLVNGQKEVLWSSNVSNLVNNSTSAQLLDSGNLVLRDNI-------NRAIVWESFQEPTD  150 (503)
Q Consensus        94 ~G~Lvl~~~~~~~~Wss~~~~~~~~~~~~~L~d~GNlVl~~~~-------~~~~~WqSFd~PTD  150 (503)
                      .+..+++|..|.++|.-......  ...+..+++|+|......       -|+++|+ ++.|..
T Consensus       127 ~~~~~~iD~~G~Vrw~~~~~~~~--~~~~~~l~nG~ll~~~~~~~~e~D~~G~v~~~-~~l~~~  187 (477)
T PF05935_consen  127 SSYTYLIDNNGDVRWYLPLDSGS--DNSFKQLPNGNLLIGSGNRLYEIDLLGKVIWE-YDLPGG  187 (477)
T ss_dssp             EEEEEEEETTS-EEEEE-GGGT----SSEEE-TTS-EEEEEBTEEEEE-TT--EEEE-EE--TT
T ss_pred             CceEEEECCCccEEEEEccCccc--cceeeEcCCCCEEEecCCceEEEcCCCCEEEe-eecCCc
Confidence            46788899999999997754322  222778899999865441       4888888 777763


No 74 
>PF01011 PQQ:  PQQ enzyme repeat family.;  InterPro: IPR002372 Pyrrolo-quinoline quinone (PQQ) is a redox coenzyme, which serves as a cofactor for a number of enzymes (quinoproteins) and particularly for some bacterial dehydrogenases [, ]. A number of bacterial quinoproteins belong to this family. Enzymes in this group have repeats of a beta propeller.; PDB: 1H4I_C 1H4J_E 1W6S_A 2YH3_A 3PRW_A 3P1L_A 3Q7M_A 3Q7O_A 3Q7N_A 1G72_A ....
Probab=32.94  E-value=64  Score=21.31  Aligned_cols=19  Identities=21%  Similarity=0.438  Sum_probs=16.1

Q ss_pred             ccCCCEEEEeccCCcEEEe
Q 010708          125 LDSGNLVLRDNINRAIVWE  143 (503)
Q Consensus       125 ~d~GNlVl~~~~~~~~~Wq  143 (503)
                      ..+|.|+-.|..+|+++|+
T Consensus         7 ~~~g~l~AlD~~TG~~~W~   25 (38)
T PF01011_consen    7 TPDGYLYALDAKTGKVLWK   25 (38)
T ss_dssp             TTTSEEEEEETTTTSEEEE
T ss_pred             CCCCEEEEEECCCCCEEEe
Confidence            6788888888878999997


No 75 
>KOG0291 consensus WD40-repeat-containing subunit of the 18S rRNA processing complex [RNA processing and modification]
Probab=32.94  E-value=7.1e+02  Score=28.43  Aligned_cols=85  Identities=18%  Similarity=0.406  Sum_probs=54.9

Q ss_pred             cEEEEecCCcEEEEeC-CCcE-EEeecCc-------CCccccceEEEccCCCEEEEeccCCcEEEeecCCCCCCcCCCCe
Q 010708           87 GIITISEDGNLVLVNG-QKEV-LWSSNVS-------NLVNNSTSAQLLDSGNLVLRDNINRAIVWESFQEPTDSFLPGMH  157 (503)
Q Consensus        87 ~~l~l~~~G~Lvl~~~-~~~~-~Wss~~~-------~~~~~~~~~~L~d~GNlVl~~~~~~~~~WqSFd~PTDTlLpgq~  157 (503)
                      ..+.-..||.++.... ++.+ ||.+...       ..+++++.++..-+||.+|..+--|.|                 
T Consensus       354 ~~l~YSpDgq~iaTG~eDgKVKvWn~~SgfC~vTFteHts~Vt~v~f~~~g~~llssSLDGtV-----------------  416 (893)
T KOG0291|consen  354 TSLAYSPDGQLIATGAEDGKVKVWNTQSGFCFVTFTEHTSGVTAVQFTARGNVLLSSSLDGTV-----------------  416 (893)
T ss_pred             eeEEECCCCcEEEeccCCCcEEEEeccCceEEEEeccCCCceEEEEEEecCCEEEEeecCCeE-----------------
Confidence            4688889999988865 4555 8998641       122257788889999999986521332                 


Q ss_pred             ecccccCCceEEEEecCCCCCCCCceEE-EEEeeCCC
Q 010708          158 HGIDQRTGKKVQLTSWKSLSDPSTGSFS-AGLIHQNI  193 (503)
Q Consensus       158 l~~~~~tg~~~~L~Sw~s~~dps~G~y~-l~~~~~g~  193 (503)
                      =-+|.+     +....++.+-|.|-.|+ +..|+.|.
T Consensus       417 RAwDlk-----RYrNfRTft~P~p~QfscvavD~sGe  448 (893)
T KOG0291|consen  417 RAWDLK-----RYRNFRTFTSPEPIQFSCVAVDPSGE  448 (893)
T ss_pred             Eeeeec-----ccceeeeecCCCceeeeEEEEcCCCC
Confidence            222333     23334456667888887 66677764


No 76 
>PF02009 Rifin_STEVOR:  Rifin/stevor family;  InterPro: IPR002858 Malaria is still a major cause of mortality in many areas of the world. Plasmodium falciparum causes the most severe human form of the disease and is responsible for most fatalities. Severe cases of malaria can occur when the parasite invades and then proliferates within red blood cell erythrocytes. The parasite produces many variant antigenic proteins, encoded by multigene families, which are present on the surface of the infected erythrocyte and play important roles in virulence. A crucial survival mechanism for the malaria parasite is its ability to evade the immune response by switching these variant surface antigens. The high virulence of P. falciparum relative to other malarial parasites is in large part due to the fact that in this organism many of these surface antigens mediate the binding of infected erythrocytes to the vascular endothelium (cytoadherence) and non-infected erythrocytes (rosetting). This can lead to the accumulation of infected cells in the vasculature of a variety of organs, blocking the blood flow and reducing the oxygen supply. Clinical symptoms of severe infection can include fever, progressive anaemia, multi-organ dysfunction and coma. For more information see []. Several multicopy gene families have been described in Plasmodium falciparum, including the stevor family of subtelomeric open reading frames and the rif interspersed repetitive elements. Both families contain three predicted transmembrane segments. It has been proposed that stevor and rif are members of a larger superfamily that code for variant surface antigens [].
Probab=32.83  E-value=33  Score=34.52  Aligned_cols=13  Identities=15%  Similarity=0.442  Sum_probs=5.7

Q ss_pred             HHHhhhhhhhhhh
Q 010708          436 ICTLFLWRWIAKR  448 (503)
Q Consensus       436 ~~~~~~~~~~~~~  448 (503)
                      +++++++|.||++
T Consensus       273 vIIYLILRYRRKK  285 (299)
T PF02009_consen  273 VIIYLILRYRRKK  285 (299)
T ss_pred             HHHHHHHHHHHHh
Confidence            3444445444433


No 77 
>PHA03212 serine/threonine kinase US3; Provisional
Probab=32.13  E-value=20  Score=37.34  Aligned_cols=20  Identities=15%  Similarity=0.325  Sum_probs=16.5

Q ss_pred             CCCcCCcccccCCccEEEEE
Q 010708          482 NFQLSSKLGQGGFGPVYWVM  501 (503)
Q Consensus       482 nFs~~n~LG~GgfG~VYKG~  501 (503)
                      .|....+||+|+||.||+++
T Consensus        93 ~y~~~~~lg~G~~g~V~~~~  112 (391)
T PHA03212         93 GFSILETFTPGAEGFAFACI  112 (391)
T ss_pred             CcEEEEEEcCCCCeEEEEEE
Confidence            35555789999999999985


No 78 
>KOG1214 consensus Nidogen and related basement membrane protein proteins [Cell wall/membrane/envelope biogenesis; Extracellular structures]
Probab=30.03  E-value=36  Score=38.52  Aligned_cols=32  Identities=22%  Similarity=0.597  Sum_probs=24.1

Q ss_pred             cccccccccCCCcccCCC-CCCccccCCCCccCC
Q 010708          272 TECDVYGKCGAFGICNSQ-EKPICSCLEGFEPKN  304 (503)
Q Consensus       272 ~~C~v~g~CG~~giC~~~-~~~~C~C~~Gf~~~~  304 (503)
                      |+|. +..|-++..|-+. ++-.|.|.|||....
T Consensus       828 DeC~-psrChp~A~CyntpgsfsC~C~pGy~GDG  860 (1289)
T KOG1214|consen  828 DECS-PSRCHPAATCYNTPGSFSCRCQPGYYGDG  860 (1289)
T ss_pred             cccC-ccccCCCceEecCCCcceeecccCccCCC
Confidence            6676 7888888889543 456799999998653


No 79 
>PF05454 DAG1:  Dystroglycan (Dystrophin-associated glycoprotein 1);  InterPro: IPR008465 Dystroglycan is one of the dystrophin-associated glycoproteins, which is encoded by a 5.5 kb transcript in Homo sapiens. The protein product is cleaved into two non-covalently associated subunits, [alpha] (N-terminal) and [beta] (C-terminal). In skeletal muscle the dystroglycan complex works as a transmembrane linkage between the extracellular matrix and the cytoskeleton [alpha]-dystroglycan is extracellular and binds to merosin ([alpha]-2 laminin) in the basement membrane, while [beta]-dystroglycan is a transmembrane protein and binds to dystrophin, which is a large rod-like cytoskeletal protein, absent in Duchenne muscular dystrophy patients. Dystrophin binds to intracellular actin cables. In this way, the dystroglycan complex, which links the extracellular matrix to the intracellular actin cables, is thought to provide structural integrity in muscle tissues. The dystroglycan complex is also known to serve as an agrin receptor in muscle, where it may regulate agrin-induced acetylcholine receptor clustering at the neuromuscular junction. There is also evidence which suggests the function of dystroglycan as a part of the signal transduction pathway because it is shown that Grb2, a mediator of the Ras-related signal pathway, can interact with the cytoplasmic domain of dystroglycan. In general, aberrant expression of dystrophin-associated protein complex underlies the pathogenesis of Duchenne muscular dystrophy, Becker muscular dystrophy and severe childhood autosomal recessive muscular dystrophy. Interestingly, no genetic disease has been described for either [alpha]- or [beta]-dystroglycan. Dystroglycan is widely distributed in non-muscle tissues as well as in muscle tissues. During epithelial morphogenesis of kidney, the dystroglycan complex is shown to act as a receptor for the basement membrane. Dystroglycan expression in Mus musculus brain and neural retina has also been reported. However, the physiological role of dystroglycan in non-muscle tissues has remained unclear [].; PDB: 1EG4_P.
Probab=28.93  E-value=19  Score=36.12  Aligned_cols=10  Identities=10%  Similarity=0.069  Sum_probs=0.0

Q ss_pred             HHhhhhhhhh
Q 010708          437 CTLFLWRWIA  446 (503)
Q Consensus       437 ~~~~~~~~~~  446 (503)
                      ++++++||||
T Consensus       165 Ia~icyrrkR  174 (290)
T PF05454_consen  165 IACICYRRKR  174 (290)
T ss_dssp             ----------
T ss_pred             HHHHhhhhhh
Confidence            3344444433


No 80 
>KOG1095 consensus Protein tyrosine kinase [Signal transduction mechanisms]
Probab=28.89  E-value=28  Score=40.87  Aligned_cols=16  Identities=44%  Similarity=0.673  Sum_probs=15.0

Q ss_pred             CcccccCCccEEEEEc
Q 010708          487 SKLGQGGFGPVYWVMV  502 (503)
Q Consensus       487 n~LG~GgfG~VYKG~L  502 (503)
                      +.||+|.||.||+|++
T Consensus       698 ~~lG~G~FG~VY~g~~  713 (1025)
T KOG1095|consen  698 RVLGKGAFGEVYEGTY  713 (1025)
T ss_pred             eeeccccccceEEEEE
Confidence            7899999999999986


No 81 
>cd05845 Ig2_L1-CAM_like Second immunoglobulin (Ig)-like domain of the L1 cell adhesion molecule (CAM) and similar proteins. Ig2_L1-CAM_like: domain similar to the second immunoglobulin (Ig)-like domain of the L1 cell adhesion molecule (CAM). L1 belongs to the L1 subfamily of cell adhesion molecules (CAMs) and is comprised of an extracellular region having six Ig-like domains, five fibronectin type III domains, a transmembrane region and an intracellular domain. L1 is primarily expressed in the nervous system and is involved in its development and function. L1 is associated with an X-linked recessive disorder, X-linked hydrocephalus, MASA syndrome, or spastic paraplegia type 1, that involves abnormalities of axonal growth.
Probab=26.71  E-value=1.2e+02  Score=25.03  Aligned_cols=33  Identities=15%  Similarity=0.377  Sum_probs=19.9

Q ss_pred             CCCeEEEEeCCCCCCcCCccEEEEecCCcEEEEe
Q 010708           68 SEKAVIWVANRDNPLKDSSGIITISEDGNLVLVN  101 (503)
Q Consensus        68 ~~~~vvW~anr~~p~~~~~~~l~l~~~G~Lvl~~  101 (503)
                      +..++.|+-+....+. ....+.+..+|||.+.+
T Consensus        32 P~P~i~W~~~~~~~i~-~~~Ri~~~~~GnL~fs~   64 (95)
T cd05845          32 VPLRIYWMNSDLLHIT-QDERVSMGQNGNLYFAN   64 (95)
T ss_pred             CCCEEEEECCCCcccc-ccccEEECCCceEEEEE
Confidence            4566778844333333 24567777778887654


No 82 
>KOG0986 consensus G protein-coupled receptor kinase [Signal transduction mechanisms]
Probab=26.45  E-value=33  Score=36.48  Aligned_cols=21  Identities=43%  Similarity=0.754  Sum_probs=17.5

Q ss_pred             cCCCcCCcccccCCccEEEEE
Q 010708          481 NNFQLSSKLGQGGFGPVYWVM  501 (503)
Q Consensus       481 ~nFs~~n~LG~GgfG~VYKG~  501 (503)
                      +.|+.-++||+||||-||-.+
T Consensus       185 n~F~~~RvlGkGGFGEV~acq  205 (591)
T KOG0986|consen  185 NTFRVYRVLGKGGFGEVCACQ  205 (591)
T ss_pred             cceeeeEEEecccccceeEEE
Confidence            448888999999999999543


No 83 
>KOG0605 consensus NDR and related serine/threonine kinases [General function prediction only]
Probab=26.11  E-value=35  Score=36.81  Aligned_cols=23  Identities=30%  Similarity=0.632  Sum_probs=18.0

Q ss_pred             HhcCCCcCCcccccCCccEEEEE
Q 010708          479 ATNNFQLSSKLGQGGFGPVYWVM  501 (503)
Q Consensus       479 AT~nFs~~n~LG~GgfG~VYKG~  501 (503)
                      ..+.|..-.+||+|+||.||-++
T Consensus       139 ~~~DFe~Lk~IgkGAfGeVrLar  161 (550)
T KOG0605|consen  139 SLDDFELLKVIGKGAFGEVRLAR  161 (550)
T ss_pred             CcccchhheeeccccceeEEEEE
Confidence            34556666799999999999764


No 84 
>KOG4257 consensus Focal adhesion tyrosine kinase FAK, contains FERM domain [Signal transduction mechanisms]
Probab=26.07  E-value=30  Score=38.26  Aligned_cols=17  Identities=35%  Similarity=0.634  Sum_probs=15.2

Q ss_pred             CCcccccCCccEEEEEc
Q 010708          486 SSKLGQGGFGPVYWVMV  502 (503)
Q Consensus       486 ~n~LG~GgfG~VYKG~L  502 (503)
                      ..+||+|-||-||+|+.
T Consensus       394 ~r~iG~GqFGdVy~gvY  410 (974)
T KOG4257|consen  394 KRLIGEGQFGDVYKGVY  410 (974)
T ss_pred             HHhhcCCcccceeeeEe
Confidence            38999999999999974


No 85 
>KOG0694 consensus Serine/threonine protein kinase [Signal transduction mechanisms]
Probab=25.80  E-value=31  Score=38.34  Aligned_cols=23  Identities=35%  Similarity=0.690  Sum_probs=18.8

Q ss_pred             hcCCCcCCcccccCCccEEEEEc
Q 010708          480 TNNFQLSSKLGQGGFGPVYWVMV  502 (503)
Q Consensus       480 T~nFs~~n~LG~GgfG~VYKG~L  502 (503)
                      .++|.--.+||+|.||+|+.+.+
T Consensus       367 l~~F~~l~vLGkGsFGkV~lae~  389 (694)
T KOG0694|consen  367 LDDFRLLAVLGRGSFGKVLLAEL  389 (694)
T ss_pred             ccceEEEEEeccCcCceEEEEEE
Confidence            34566668999999999998876


No 86 
>KOG1214 consensus Nidogen and related basement membrane protein proteins [Cell wall/membrane/envelope biogenesis; Extracellular structures]
Probab=25.71  E-value=61  Score=36.87  Aligned_cols=28  Identities=25%  Similarity=0.732  Sum_probs=23.2

Q ss_pred             ccccCCCcccCCC-CCCccccCCCCccCC
Q 010708          277 YGKCGAFGICNSQ-EKPICSCLEGFEPKN  304 (503)
Q Consensus       277 ~g~CG~~giC~~~-~~~~C~C~~Gf~~~~  304 (503)
                      +..||++++|.+. ....|.|-.||++.+
T Consensus       741 ~~~CGp~s~Cin~pg~~rceC~~gy~F~d  769 (1289)
T KOG1214|consen  741 FHRCGPNSVCINLPGSYRCECRSGYEFAD  769 (1289)
T ss_pred             CCCCCCCceeecCCCceeEEEeecceecc
Confidence            5689999999765 467899999998764


No 87 
>PF06247 Plasmod_Pvs28:  Plasmodium ookinete surface protein Pvs28;  InterPro: IPR010423 This family consists of several ookinete surface protein (Pvs28) from several species of Plasmodium. Pvs25 and Pvs28 are expressed on the surface of ookinetes. These proteins are potential candidates for vaccine and induce antibodies that block the infectivity of Plasmodium vivax in immunised animals [].; GO: 0009986 cell surface, 0016020 membrane; PDB: 1Z3G_B 1Z1Y_B 1Z27_A.
Probab=25.54  E-value=19  Score=33.51  Aligned_cols=28  Identities=32%  Similarity=0.783  Sum_probs=20.3

Q ss_pred             cccccCCCcccCCC------CCCccccCCCCccC
Q 010708          276 VYGKCGAFGICNSQ------EKPICSCLEGFEPK  303 (503)
Q Consensus       276 v~g~CG~~giC~~~------~~~~C~C~~Gf~~~  303 (503)
                      +.-.||.|+.|...      ....|.|.+||...
T Consensus        48 ~~K~Cgdya~C~~~~~~~~~~~~~C~C~~gY~~~   81 (197)
T PF06247_consen   48 VNKPCGDYAKCINQANKGEERAYKCDCINGYILK   81 (197)
T ss_dssp             TTSEEETTEEEEE-SSTTSSTSEEEEE-TTEEES
T ss_pred             cCccccchhhhhcCCCcccceeEEEecccCceee
Confidence            35689999999642      23479999999865


No 88 
>KOG1166 consensus Mitotic checkpoint serine/threonine protein kinase [Cell cycle control, cell division, chromosome partitioning]
Probab=24.79  E-value=29  Score=40.62  Aligned_cols=24  Identities=29%  Similarity=0.699  Sum_probs=19.1

Q ss_pred             HHhcCCCcCCcccccCCccEEEEE
Q 010708          478 TATNNFQLSSKLGQGGFGPVYWVM  501 (503)
Q Consensus       478 ~AT~nFs~~n~LG~GgfG~VYKG~  501 (503)
                      .-...|.-+..||+|++|+||+|+
T Consensus       695 ~~~~~~~I~~e~G~g~y~~vy~a~  718 (974)
T KOG1166|consen  695 VGGEKFCISKEIGEGSYGSVYVAT  718 (974)
T ss_pred             ecceeEEEEeeeccccceEEEEee
Confidence            334456667899999999999986


No 89 
>KOG1024 consensus Receptor-like protein tyrosine kinase RYK/derailed [Signal transduction mechanisms]
Probab=24.70  E-value=1.1e+02  Score=32.12  Aligned_cols=28  Identities=32%  Similarity=0.488  Sum_probs=20.0

Q ss_pred             HHHHHhcCCCcCCcccccCCccEEEEEc
Q 010708          475 ELATATNNFQLSSKLGQGGFGPVYWVMV  502 (503)
Q Consensus       475 el~~AT~nFs~~n~LG~GgfG~VYKG~L  502 (503)
                      +|..--.-|....++=+|.||.||+|++
T Consensus       278 ~l~v~r~Rv~l~~llqEGtFGri~~gI~  305 (563)
T KOG1024|consen  278 ELTVQRCRVRLSCLLQEGTFGRIYRGIW  305 (563)
T ss_pred             hhhhhhhheechhhhhcCchhheeeeee
Confidence            3333333366667899999999999965


No 90 
>PF14670 FXa_inhibition:  Coagulation Factor Xa inhibitory site; PDB: 3Q3K_B 1NFY_B 1LQD_A 1G2L_B 1IQF_L 2UWP_B 2VH6_B 3KQC_L 2P93_L 2BQW_A ....
Probab=24.24  E-value=17  Score=24.30  Aligned_cols=14  Identities=29%  Similarity=0.629  Sum_probs=10.1

Q ss_pred             CCCccccCCCCccC
Q 010708          290 EKPICSCLEGFEPK  303 (503)
Q Consensus       290 ~~~~C~C~~Gf~~~  303 (503)
                      .+..|.|++||...
T Consensus        17 g~~~C~C~~Gy~L~   30 (36)
T PF14670_consen   17 GSYRCSCPPGYKLA   30 (36)
T ss_dssp             TSEEEE-STTEEE-
T ss_pred             CceEeECCCCCEEC
Confidence            46789999999865


No 91 
>PF14610 DUF4448:  Protein of unknown function (DUF4448)
Probab=24.08  E-value=48  Score=30.86  Aligned_cols=22  Identities=23%  Similarity=0.614  Sum_probs=9.4

Q ss_pred             EEechhHHHHHHHHHHhhhhhh
Q 010708          423 FVSPLIKGMFALAICTLFLWRW  444 (503)
Q Consensus       423 i~i~~~~~~~~l~~~~~~~~~~  444 (503)
                      |++.++++++++++++++++.|
T Consensus       162 I~lPvvv~~~~~~~~~~~~~~R  183 (189)
T PF14610_consen  162 IALPVVVVVLALIMYGFFFWNR  183 (189)
T ss_pred             EEccHHHHHHHHHHHhhheeec
Confidence            3444444444444444444433


No 92 
>KOG4258 consensus Insulin/growth factor receptor (contains protein kinase domain) [Signal transduction mechanisms]
Probab=24.00  E-value=41  Score=38.32  Aligned_cols=19  Identities=47%  Similarity=0.690  Sum_probs=15.8

Q ss_pred             CCcCCcccccCCccEEEEE
Q 010708          483 FQLSSKLGQGGFGPVYWVM  501 (503)
Q Consensus       483 Fs~~n~LG~GgfG~VYKG~  501 (503)
                      ....+.||+|+||+||-|.
T Consensus       996 it~~relg~gsfg~Vy~g~ 1014 (1025)
T KOG4258|consen  996 ITLGRELGQGSFGMVYEGN 1014 (1025)
T ss_pred             HhhhhhhccCccceEEEec
Confidence            3445899999999999885


No 93 
>PF02402 Lysis_col:  Lysis protein;  InterPro: IPR003059 The DNA sequence of the entire colicin E2 operon has been determined []. The operon comprises the colicin activity gene (ceaB), the colicin immunity gene (ceiB) and the lysis gene (celB), which is essential for colicin release from producing cells []. A putative LexA binding site is located upstream from ceaB, and a rho-independent terminator structure is located downstream from celB []. Comparison of the amino acid sequences of colicin E2 and cloacin DF13 reveal extensive similarity. These colicins have different modes of action and recognise different cell surface receptors; the two major regions of heterology at the C terminus, and in the C-terminal end of the central region are thought to correspond to the catalytic and receptor-recognition domains, respectively [].  Sequence similarities between colicins E2, A and E1 [] are less striking. The colicin E2 (pyocin) immunity protein does not share similarity with either the colicin E3 or cloacin DF13 [] immunity proteins. By contrast, the lysis proteins of the ColE2, ColE1 and CloDF13 plasmids are almost identical except in the N-terminal regions, which themselves are similar to lipoprotein signal peptides []. Processing of the ColE2 prolysis protein to the mature form is prevented by globomycin, a specific inhibitor of the lipoprotein signal peptidase []. The mature ColE2 lysis protein is located in the cell envelope [].; GO: 0009405 pathogenesis, 0019835 cytolysis, 0019867 outer membrane
Probab=23.94  E-value=26  Score=24.38  Aligned_cols=21  Identities=29%  Similarity=0.246  Sum_probs=13.5

Q ss_pred             ChhHHHHHHHHHHHhhhcccc
Q 010708            1 MIPIALLIILLSCFCLDFAVA   21 (503)
Q Consensus         1 ~~~~~~~~~~~~~~~~~~~~~   21 (503)
                      |..|++++++++.+++++|.+
T Consensus         1 MkKi~~~~i~~~~~~L~aCQa   21 (46)
T PF02402_consen    1 MKKIIFIGIFLLTMLLAACQA   21 (46)
T ss_pred             CcEEEEeHHHHHHHHHHHhhh
Confidence            556666666566566677776


No 94 
>PF12946 EGF_MSP1_1:  MSP1 EGF domain 1;  InterPro: IPR024730 This EGF-like domain is found at the C terminus of the malaria parasite MSP1 protein. MSP1 is the merozoite surface protein 1. This domain is part of the C-terminal fragment that is proteolytically processed from the the rest of the protein and is left attached to the surface of the invading parasite [].; PDB: 1N1I_C 2FLG_A 1CEJ_A 2NPR_A 1B9W_A 1OB1_F.
Probab=23.76  E-value=15  Score=24.68  Aligned_cols=26  Identities=31%  Similarity=0.583  Sum_probs=16.4

Q ss_pred             cccCCCcccC-C-CCCCccccCCCCccC
Q 010708          278 GKCGAFGICN-S-QEKPICSCLEGFEPK  303 (503)
Q Consensus       278 g~CG~~giC~-~-~~~~~C~C~~Gf~~~  303 (503)
                      ..|-.++-|- + ++...|.|++||+..
T Consensus         5 ~~cP~NA~C~~~~dG~eecrCllgyk~~   32 (37)
T PF12946_consen    5 TKCPANAGCFRYDDGSEECRCLLGYKKV   32 (37)
T ss_dssp             S---TTEEEEEETTSEEEEEE-TTEEEE
T ss_pred             ccCCCCcccEEcCCCCEEEEeeCCcccc
Confidence            4677888883 3 357789999999854


No 95 
>PF01034 Syndecan:  Syndecan domain;  InterPro: IPR001050 The syndecans are transmembrane proteoglycans which are involved in the organisation of cytoskeleton and/or actin microfilaments, and have important roles as cell surface receptors during cell-cell and/or cell-matrix interactions [, ]. Structurally, these proteins consist of four separate domains:   A signal sequence; An extracellular domain (ectodomain) of variable length whose sequence is not evolutionary conserved in the various forms of syndecans. The ectodomain contains the sites of attachment of the heparan sulphate glycosaminoglycan side chains;  A transmembrane region;  A highly conserved cytoplasmic domain of about 30 to 35 residues, which could interact with cytoskeletal proteins.    The proteins known to belong to this family are:    Syndecan 1.  Syndecan 2 or fibroglycan.  Syndecan 3 or neuroglycan or N-syndecan.  Syndecan 4 or amphiglycan or ryudocan.  Drosophila syndecan.   Caenorhabditis elegans probable syndecan (F57C7.3).    Syndecan-4, a transmembrane heparan sulphate proteoglycan, is a coreceptor with integrins in cell adhesion. It has been suggested to form a ternary signalling complex with protein kinase Calpha and phosphatidylinositol 4,5-bisphosphate (PIP2). Structural studies have demonstrated that the cytoplasmic domain undergoes a conformational transition and forms a symmetric dimer in the presence of phospholipid activator PIP2, and whose overall structure in solution exhibits a twisted clamp shape having a cavity in the centre of dimeric interface. In addition, it has been observed that the syndecan-4 variable domain interacts, strongly, not only with fatty acyl groups but also the anionic head group of PIP2. These findings indicate that PIP2 promotes oligomerisation of the syndecan-4 cytoplasmic domain for transmembrane signalling and cell-matrix adhesion [, ].; GO: 0008092 cytoskeletal protein binding, 0016020 membrane; PDB: 1EJQ_B 1EJP_B 1YBO_C 1OBY_Q.
Probab=23.00  E-value=26  Score=26.58  Aligned_cols=8  Identities=13%  Similarity=0.509  Sum_probs=0.0

Q ss_pred             HHhhhhhh
Q 010708          437 CTLFLWRW  444 (503)
Q Consensus       437 ~~~~~~~~  444 (503)
                      ++++++|.
T Consensus        29 Ilf~iyR~   36 (64)
T PF01034_consen   29 ILFLIYRM   36 (64)
T ss_dssp             --------
T ss_pred             HHHHHHHH
Confidence            33334443


No 96 
>TIGR02513 type_III_yscB type III secretion system chaperone, YscB family. Members of this family include YscB of Yersinia and functionally equivalent (but differently named) proteins from type III secretion systems of other pathogens that affect animal cells. YscB acts, along with SycN (TIGR02503), as a chaperone for YopN, a key part of a complex that regulates type III secretion so it responds to contact with the eukaryotic target cell.
Probab=22.42  E-value=1.1e+02  Score=26.86  Aligned_cols=17  Identities=24%  Similarity=0.350  Sum_probs=14.1

Q ss_pred             cceEEEccCCCEEEEec
Q 010708          119 STSAQLLDSGNLVLRDN  135 (503)
Q Consensus       119 ~~~~~L~d~GNlVl~~~  135 (503)
                      +.+++|.|+|||+|...
T Consensus        79 p~aLvld~~~qLiLear   95 (139)
T TIGR02513        79 PQALVLDADGQLILEAR   95 (139)
T ss_pred             CceEEEcCccchhHHHH
Confidence            56799999999999743


No 97 
>PF11920 DUF3438:  Protein of unknown function (DUF3438);  InterPro: IPR021844  Members of this protein family are found occasionally on plasmids such as the Pseudomonas putida TOL plasmid pWWO_p085. Usually, however, they are found on the bacterial main chromosome in a region flanked by markers of conjugative transfer and/or transposition. 
Probab=22.05  E-value=4.9e+02  Score=26.08  Aligned_cols=90  Identities=20%  Similarity=0.261  Sum_probs=43.6

Q ss_pred             ChhHHHHHHHHHHHhhhccccc----cccccCCCccCCCC-eEEeCCCeeEEEeeCCCCCCceEEEEEecCCCCCeEEEE
Q 010708            1 MIPIALLIILLSCFCLDFAVAI----DSSITSSQLIRDPD-AILSNGSNFKLGFFNPADSPYRYMGIWYDMPSEKAVIWV   75 (503)
Q Consensus         1 ~~~~~~~~~~~~~~~~~~~~~~----~~~l~~g~~l~~~~-~L~S~~g~f~~gf~~~~~~~~~~l~i~~~~~~~~~vvW~   75 (503)
                      |..+++++++++ ++.+.+.|.    =+.++.-=.|..|| .++.-+.+-..||=..-+..   |-|   ....+++-|.
T Consensus         1 mk~~~~~~~l~~-~~~~~a~a~E~~~Wer~Pl~V~L~VgqERiV~~~~nVrVg~P~~l~~k---Lrv---qS~gGavYL~   73 (288)
T PF11920_consen    1 MKKLLLLLLLLL-LLASAAQAVELMKWERIPLAVPLPVGQERIVFFDKNVRVGVPASLNGK---LRV---QSAGGAVYLK   73 (288)
T ss_pred             ChhhHHHHHHHH-HhcccccchHhhhccCCceeeecCCCcEEEEEcCcCeEEcCChhhhhc---EEE---eeeCCEEEEe
Confidence            666555443333 333333331    13444455566664 46766666666663322111   111   1124566677


Q ss_pred             eCCCCCCcCCccEEEEe--cCCcEEEEe
Q 010708           76 ANRDNPLKDSSGIITIS--EDGNLVLVN  101 (503)
Q Consensus        76 anr~~p~~~~~~~l~l~--~~G~Lvl~~  101 (503)
                      |+...|.    ..|.++  ++|.++|+|
T Consensus        74 A~e~f~~----tRLqlq~~~~GeiILLD   97 (288)
T PF11920_consen   74 ASEPFPP----TRLQLQDVESGEIILLD   97 (288)
T ss_pred             ccCCCCc----ceEEEEEcCCCcEEEEE
Confidence            7765542    234433  567777776


No 98 
>smart00564 PQQ beta-propeller repeat. Beta-propeller repeat occurring in enzymes with pyrrolo-quinoline quinone (PQQ) as cofactor, in Ire1p-like Ser/Thr kinases, and in prokaryotic dehydrogenases.
Probab=21.24  E-value=1.6e+02  Score=18.10  Aligned_cols=17  Identities=29%  Similarity=0.565  Sum_probs=10.5

Q ss_pred             CCCEEEEeccCCcEEEe
Q 010708          127 SGNLVLRDNINRAIVWE  143 (503)
Q Consensus       127 ~GNlVl~~~~~~~~~Wq  143 (503)
                      +|.|+-.|..+|+++|+
T Consensus        15 ~g~l~a~d~~~G~~~W~   31 (33)
T smart00564       15 DGTLYALDAKTGEILWT   31 (33)
T ss_pred             CCEEEEEEcccCcEEEE
Confidence            45666666545677776


No 99 
>COG2143 Thioredoxin-related protein [Posttranslational modification, protein turnover, chaperones]
Probab=20.88  E-value=78  Score=28.73  Aligned_cols=20  Identities=15%  Similarity=0.305  Sum_probs=16.0

Q ss_pred             CeEEeCCCeeEEEeeCCCCC
Q 010708           36 DAILSNGSNFKLGFFNPADS   55 (503)
Q Consensus        36 ~~L~S~~g~f~~gf~~~~~~   55 (503)
                      -...++++.|.+-+|...++
T Consensus        35 ~ksi~~~~Kylllmfes~~C   54 (182)
T COG2143          35 NKSISPNDKYLLLMFESNGC   54 (182)
T ss_pred             HHhcCccCcEEEEEEcCCCC
Confidence            35689999999999886654


No 100
>TIGR01478 STEVOR variant surface antigen, stevor family. This model represents the stevor branch of the rifin/stevor family (pfam02009) of predicted variant surface antigens as found in Plasmodium falciparum. This model is based on a set of stevor sequences kindly provided by Matt Berriman from the Sanger Center. This is a global model and assesses a penalty for incomplete sequence. Additional fragmentary sequences may be found with the fragment model and a cutoff of 8 bits.
Probab=20.65  E-value=56  Score=32.42  Aligned_cols=6  Identities=33%  Similarity=1.082  Sum_probs=3.5

Q ss_pred             CccccC
Q 010708          292 PICSCL  297 (503)
Q Consensus       292 ~~C~C~  297 (503)
                      ..|.|-
T Consensus       143 s~cect  148 (295)
T TIGR01478       143 KSCECT  148 (295)
T ss_pred             Cceeee
Confidence            456664


No 101
>PF12690 BsuPI:  Intracellular proteinase inhibitor;  InterPro: IPR020481 BsuPI is a intracellular proteinase inhibitor that directly regulates the major intracellular proteinase (ISP-1) activity in vivo. It inhibits ISP-1 in the early stages of sporulation and then may be inactivated by a membrane-bound proteinase [].; PDB: 3ISY_A.
Probab=20.44  E-value=4.1e+02  Score=21.06  Aligned_cols=16  Identities=13%  Similarity=0.461  Sum_probs=8.7

Q ss_pred             cEEEEeCCCcEEEeec
Q 010708           96 NLVLVNGQKEVLWSSN  111 (503)
Q Consensus        96 ~Lvl~~~~~~~~Wss~  111 (503)
                      +++|+|..|..||.-.
T Consensus        27 D~~v~d~~g~~vwrwS   42 (82)
T PF12690_consen   27 DFVVKDKEGKEVWRWS   42 (82)
T ss_dssp             EEEEE-TT--EEEETT
T ss_pred             EEEEECCCCCEEEEec
Confidence            5777777777777643


No 102
>PHA03207 serine/threonine kinase US3; Provisional
Probab=20.33  E-value=45  Score=34.59  Aligned_cols=19  Identities=26%  Similarity=0.457  Sum_probs=15.7

Q ss_pred             CCcCCcccccCCccEEEEE
Q 010708          483 FQLSSKLGQGGFGPVYWVM  501 (503)
Q Consensus       483 Fs~~n~LG~GgfG~VYKG~  501 (503)
                      |....+||+|+||.||+++
T Consensus        94 y~i~~~Lg~G~~g~Vy~~~  112 (392)
T PHA03207         94 YNILSSLTPGSEGEVFVCT  112 (392)
T ss_pred             eEEEEeecCCCCeEEEEEE
Confidence            4555789999999999875


No 103
>KOG4289 consensus Cadherin EGF LAG seven-pass G-type receptor [Signal transduction mechanisms]
Probab=20.18  E-value=54  Score=39.58  Aligned_cols=41  Identities=27%  Similarity=0.653  Sum_probs=0.0

Q ss_pred             cccCCCccc-CCCCCCccccCCCCccCCcccccCCCCCCCceeCC
Q 010708          278 GKCGAFGIC-NSQEKPICSCLEGFEPKNAEEWNRGNWTSGCIRRS  321 (503)
Q Consensus       278 g~CG~~giC-~~~~~~~C~C~~Gf~~~~~~~~~~~~~~~GC~~~~  321 (503)
                      +.||++|-| ...+.++|.|-|||....=+   ....++-|++..
T Consensus      1245 ~pC~nng~C~srEggYtCeCrpg~tGehCE---vs~~agrCvpGv 1286 (2531)
T KOG4289|consen 1245 GPCGNNGRCRSREGGYTCECRPGFTGEHCE---VSARAGRCVPGV 1286 (2531)
T ss_pred             CCCCCCCceEEecCceeEEecCCcccccee---eecccCccccce


No 104
>PTZ00370 STEVOR; Provisional
Probab=20.14  E-value=59  Score=32.34  Aligned_cols=6  Identities=33%  Similarity=1.204  Sum_probs=3.6

Q ss_pred             CccccC
Q 010708          292 PICSCL  297 (503)
Q Consensus       292 ~~C~C~  297 (503)
                      ..|.|-
T Consensus       143 s~cect  148 (296)
T PTZ00370        143 STCECT  148 (296)
T ss_pred             Cceeee
Confidence            466665


Done!