Query         psy7019
Match_columns 128
No_of_seqs    126 out of 692
Neff          6.7 
Searched_HMMs 46136
Date          Sat Aug 17 00:43:29 2013
Command       hhsearch -i /work/01045/syshi/Psyhhblits/psy7019.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/7019hhsearch_cdd -cpu 12 -v 0 

 No Hit                             Prob E-value P-value  Score    SS Cols Query HMM  Template HMM
  1 KOG0196|consensus               99.2 9.8E-12 2.1E-16  109.7   6.1   72    3-74    465-537 (996)
  2 PF00041 fn3:  Fibronectin type  98.8   1E-08 2.2E-13   65.3   5.2   59    6-64     25-85  (85)
  3 KOG3513|consensus               98.6 4.7E-07   1E-11   82.5  10.2   75    4-78    843-920 (1051)
  4 cd00063 FN3 Fibronectin type 3  98.5 1.2E-06 2.7E-11   54.4   7.9   62    9-70     29-92  (93)
  5 KOG4221|consensus               98.5   2E-06 4.3E-11   79.2  11.8   84    3-87    542-627 (1381)
  6 KOG4367|consensus               98.4 1.4E-07 3.1E-12   79.3   2.5   62   12-73    470-531 (699)
  7 KOG4221|consensus               98.3 3.3E-06 7.2E-11   77.7   8.2   70    7-76    644-715 (1381)
  8 KOG3513|consensus               97.9 5.5E-05 1.2E-09   69.3   9.3   73    6-78    639-718 (1051)
  9 PF09294 Interfer-bind:  Interf  97.8 7.4E-05 1.6E-09   50.3   5.8   61   11-71     43-105 (106)
 10 smart00060 FN3 Fibronectin typ  97.6  0.0008 1.7E-08   40.2   8.1   36   25-60     46-82  (83)
 11 PF07495 Y_Y_Y:  Y_Y_Y domain;   96.3    0.02 4.4E-07   35.1   5.9   46   13-64     11-57  (66)
 12 KOG4258|consensus               96.2   0.011 2.4E-07   53.7   6.1   70    2-72    850-923 (1025)
 13 KOG4152|consensus               96.2  0.0064 1.4E-07   52.9   4.1   37   36-72    648-684 (830)
 14 KOG4802|consensus               96.0   0.021 4.5E-07   48.4   6.4   63   24-86    197-263 (516)
 15 PF01108 Tissue_fac:  Tissue fa  95.9   0.032   7E-07   37.9   5.9   52    8-59     47-103 (107)
 16 KOG4222|consensus               95.7   0.076 1.6E-06   49.7   9.1   63    6-69    776-841 (1281)
 17 PLN02533 probable purple acid   91.7     0.3 6.5E-06   41.2   4.7   35   34-74    102-136 (427)
 18 KOG0196|consensus               90.9    0.55 1.2E-05   43.0   5.6   46   33-78    396-445 (996)
 19 PF10179 DUF2369:  Uncharacteri  90.5    0.53 1.2E-05   38.3   4.7   29   29-57      9-37  (300)
 20 PF09423 PhoD:  PhoD-like phosp  90.3    0.52 1.1E-05   39.6   4.8   34   36-72     65-98  (453)
 21 KOG4222|consensus               90.3    0.17 3.7E-06   47.5   2.0   64   11-74    558-624 (1281)
 22 PF10179 DUF2369:  Uncharacteri  85.9     4.9 0.00011   32.7   7.6   76   35-114   128-211 (300)
 23 KOG1378|consensus               78.9     3.9 8.4E-05   35.1   4.7   38   31-73    105-142 (452)
 24 KOG4806|consensus               61.9      12 0.00027   31.5   3.9   29   29-57    164-192 (454)
 25 cd05762 Ig8_MLCK Eighth immuno  61.5      33 0.00071   22.5   5.4   44   33-76     54-97  (98)
 26 KOG4258|consensus               59.7      15 0.00033   34.2   4.4   43   36-78    567-613 (1025)
 27 PF11344 DUF3146:  Protein of u  57.7     7.4 0.00016   25.6   1.6   15   41-55     66-80  (80)
 28 PF07353 Uroplakin_II:  Uroplak  57.4      19 0.00042   27.0   3.9   36   35-71    102-137 (184)
 29 COG4733 Phage-related protein,  55.4      29 0.00063   32.3   5.4   38   24-62    649-686 (952)
 30 KOG4802|consensus               51.6      28 0.00062   30.0   4.5   36   37-72    316-353 (516)
 31 TIGR03000 plancto_dom_1 Planct  49.9      23  0.0005   23.1   2.9   25   31-55     25-49  (75)
 32 PF04775 Bile_Hydr_Trans:  Acyl  48.5      30 0.00065   24.2   3.7   25   35-59      5-29  (126)
 33 COG3540 PhoD Phosphodiesterase  47.5      26 0.00057   30.6   3.7   36   35-75    101-136 (522)
 34 PF02922 CBM_48:  Carbohydrate-  47.5      55  0.0012   20.4   4.5   20   37-56     53-74  (85)
 35 PF02010 REJ:  REJ domain;  Int  46.2     6.7 0.00014   32.3   0.0   32   40-72    267-298 (440)
 36 smart00112 CA Cadherin repeats  44.7      71  0.0015   19.4   5.7   47   11-60     10-59  (79)
 37 PF13754 Big_3_4:  Bacterial Ig  44.4      66  0.0014   19.0   5.2   33   22-64     10-42  (54)
 38 cd05735 Ig8_DSCAM Eight immuno  42.3      56  0.0012   20.8   4.0   41   35-75     47-87  (88)
 39 cd05748 Ig_Titin_like Immunogl  42.3      75  0.0016   19.0   5.1   28   33-60     38-65  (74)
 40 cd02856 Glycogen_debranching_e  40.7      37  0.0008   22.4   3.0   19   36-54     48-66  (103)
 41 PF15417 DUF4624:  Domain of un  39.8      61  0.0013   23.0   4.0   32   23-54     73-108 (132)
 42 PF01299 Lamp:  Lysosome-associ  37.9      14 0.00031   29.6   0.7   34   35-69    231-264 (306)
 43 COG4733 Phage-related protein,  37.4      31 0.00067   32.2   2.8   25   40-64    762-786 (952)
 44 KOG0095|consensus               36.6      23 0.00049   26.7   1.6   26    1-27     45-70  (213)
 45 PHA02579 7 baseplate wedge sub  35.8      53  0.0012   30.6   3.9   51    9-60     29-89  (1030)
 46 cd02852 Isoamylase_N_term Isoa  35.7      48   0.001   22.4   3.0   19   36-54     52-70  (119)
 47 PF13750 Big_3_3:  Bacterial Ig  33.7 1.4E+02  0.0031   21.7   5.4   21   40-60    117-137 (158)
 48 COG3979 Uncharacterized protei  33.7      89  0.0019   23.4   4.4   41   33-73     51-92  (181)
 49 KOG4152|consensus               33.3      74  0.0016   28.5   4.3   59    8-66    716-791 (830)
 50 cd02860 Pullulanase_N_term Pul  32.6      69  0.0015   20.9   3.3   20   36-55     50-69  (100)
 51 PF00801 PKD:  PKD domain;  Int  31.6 1.2E+02  0.0026   18.2   5.2   45    8-60     22-66  (69)
 52 cd05894 Ig_C5_MyBP-C C5 immuno  30.6 1.4E+02  0.0031   18.7   4.8   27   34-60     51-77  (86)
 53 PF05345 He_PIG:  Putative Ig d  30.4      61  0.0013   18.9   2.5   14   46-59     36-49  (49)
 54 cd05851 Ig3_Contactin-1 Third   30.0      80  0.0017   20.0   3.2   31   34-64     52-82  (88)
 55 COG3422 Uncharacterized conser  30.0      42 0.00092   20.9   1.7   14   44-57     11-24  (59)
 56 cd02853 MTHase_N_term Maltooli  29.7      65  0.0014   20.4   2.7   17   36-53     43-59  (85)
 57 PF13860 FlgD_ig:  FlgD Ig-like  29.6 1.5E+02  0.0033   18.7   4.8   16   42-57     65-80  (81)
 58 PF00907 T-box:  T-box;  InterP  28.7      70  0.0015   23.5   3.1   27   33-59     31-57  (184)
 59 PF00028 Cadherin:  Cadherin do  28.5      50  0.0011   20.8   2.0   48    9-59     28-78  (93)
 60 PF12514 DUF3718:  Protein of u  27.1      19 0.00042   22.8  -0.2   15  109-123    25-39  (68)
 61 PRK12634 flgD flagellar basal   25.9 2.9E+02  0.0063   21.3   6.2   17   43-59    163-179 (221)
 62 PF02018 CBM_4_9:  Carbohydrate  25.6      73  0.0016   20.8   2.5   20   40-59     56-75  (131)
 63 COG5584 Predicted small secret  25.1 2.1E+02  0.0045   19.7   4.6   44    9-56     51-94  (103)
 64 cd04972 Ig_TrkABC_d4 Fourth do  24.8 1.2E+02  0.0027   18.9   3.4   27   35-61     56-82  (90)
 65 COG3900 Predicted periplasmic   24.1 2.7E+02  0.0058   22.2   5.6   62    7-71    171-238 (262)
 66 PF05738 Cna_B:  Cna protein B-  23.7      87  0.0019   18.7   2.4   25   29-54     21-45  (70)
 67 cd05893 Ig_Palladin_C C-termin  23.7      52  0.0011   20.4   1.4   27   36-62     42-68  (75)
 68 KOG4806|consensus               22.8 1.1E+02  0.0024   26.0   3.5   25   36-60    411-435 (454)
 69 KOG1948|consensus               22.3 2.9E+02  0.0063   26.4   6.2   22   34-55    945-966 (1165)
 70 cd05765 Ig_3 Subgroup of the i  21.9 1.5E+02  0.0032   17.7   3.3   28   34-61     46-73  (81)
 71 PLN02960 alpha-amylase          21.8 1.3E+02  0.0029   28.3   4.0   41    5-54    306-346 (897)
 72 PLN03144 Carbon catabolite rep  21.5 2.3E+02   0.005   25.4   5.3   65    6-72    151-218 (606)
 73 PRK06655 flgD flagellar basal   21.3 3.9E+02  0.0084   20.7   6.1   13   43-55    167-179 (225)

No 1  
>KOG0196|consensus
Probab=99.24  E-value=9.8e-12  Score=109.73  Aligned_cols=72  Identities=22%  Similarity=0.327  Sum_probs=63.8

Q ss_pred             ccceeCCcEEEEEEEeeCCC-CCCeEEEEecCCceEEEccCCCCCEEEEEEEEEcCCCCCCCCCceeeeeCCC
Q psy7019           3 KTTEFNQASFKASHSLSDRP-DDRKHVVYQGTNFSCKVNKLQELTTYRFYITATNDAGQGPYSEAYPFTTCTA   74 (128)
Q Consensus         3 ~~~~~~~~~i~Y~le~~~~~-~~~~~~v~~g~~t~~~V~~L~p~t~Y~FRVrA~N~~G~G~~S~~~~~~T~~~   74 (128)
                      ++-+-||.++.|+|++++++ ++........+.++.++++|+|++.|.|||||++.+|+|+||...+|+|.+.
T Consensus       465 ~p~~png~ildYEvky~ek~~~e~~~~~~~t~~~~~ti~gL~p~t~YvfqVRarT~aG~G~~S~~~~fqT~~~  537 (996)
T KOG0196|consen  465 EPDQPNGVILDYEVKYYEKDEDERSYSTLKTKTTTATITGLKPGTVYVFQVRARTAAGYGPYSGKHEFQTLPS  537 (996)
T ss_pred             CCCCCCCcceeEEEEEeeccccccceeEEecccceEEeeccCCCcEEEEEEEEecccCCCCCCCceeeeecCc
Confidence            56678999999999999875 4555566666889999999999999999999999999999999999999875


No 2  
>PF00041 fn3:  Fibronectin type III domain;  InterPro: IPR003961 Fibronectins are multi-domain glycoproteins found in a soluble form in plasma, and in an insoluble form in loose connective tissue and basement membranes []. They contain multiple copies of 3 repeat regions (types I, II and III), which bind to a variety of substances including heparin, collagen, DNA, actin, fibrin and fibronectin receptors on cell surfaces. The wide variety of these substances means that fibronectins are involved in a number of important functions: e.g., wound healing; cell adhesion; blood coagulation; cell differentiation and migration; maintenance of the cellular cytoskeleton; and tumour metastasis []. The role of fibronectin in cell differentiation is demonstrated by the marked reduction in the expression of its gene when neoplastic transformation occurs. Cell attachment has been found to be mediated by the binding of the tetrapeptide RGDS to integrins on the cell surface [], although related sequences can also display cell adhesion activity. Plasma fibronectin occurs as a dimer of 2 different subunits, linked together by 2 disulphide bonds near the C terminus. The difference in the 2 chains occurs in the type III repeat region and is caused by alternative splicing of the mRNA from one gene []. The observation that, in a given protein, an individual repeat of one of the 3 types (e.g., the first FnIII repeat) shows much less similarity to its subsequent tandem repeats within that protein than to its equivalent repeat between fibronectins from other species, has suggested that the repeating structure of fibronectin arose at an early stage of evolution. It also seems to suggest that the structure is subject to high selective pressure []. The fibronectin type III repeat region is an approximately 100 amino acid domain, different tandem repeats of which contain binding sites for DNA, heparin and the cell surface []. The superfamily of sequences believed to contain FnIII repeats represents 45 different families, the majority of which are involved in cell surface binding in some manner, or are receptor protein tyrosine kinases, or cytokine receptors.; GO: 0005515 protein binding; PDB: 1UEM_A 1TDQ_A 1X5I_A 2IC2_B 2IBG_C 2IBB_A 3R8Q_A 2FNB_A 1FNH_A 2EDB_A ....
Probab=98.80  E-value=1e-08  Score=65.28  Aligned_cols=59  Identities=27%  Similarity=0.270  Sum_probs=47.2

Q ss_pred             eeCCcEEEEEEEeeCCCCCC-eE-EEEecCCceEEEccCCCCCEEEEEEEEEcCCCCCCCC
Q psy7019           6 EFNQASFKASHSLSDRPDDR-KH-VVYQGTNFSCKVNKLQELTTYRFYITATNDAGQGPYS   64 (128)
Q Consensus         6 ~~~~~~i~Y~le~~~~~~~~-~~-~v~~g~~t~~~V~~L~p~t~Y~FRVrA~N~~G~G~~S   64 (128)
                      ..++...+|.|++....+.. +. ....+..+.+.+.+|.|++.|.|+|+|.|..|.|++|
T Consensus        25 ~~~~~~~~y~v~~~~~~~~~~~~~~~~~~~~~~~~i~~L~p~t~Y~~~v~a~~~~g~g~~S   85 (85)
T PF00041_consen   25 SGNGPITGYRVEYRSVNSTSDWQEVTVPGNETSYTITGLQPGTTYEFRVRAVNSDGEGPPS   85 (85)
T ss_dssp             STSSSESEEEEEEEETTSSSEEEEEEEETTSSEEEEESCCTTSEEEEEEEEEETTEEEEEE
T ss_pred             CCCCCeeEEEEEEEecccceeeeeeeeeeeeeeeeeccCCCCCEEEEEEEEEeCCcCcCCC
Confidence            34666779999998655554 43 3455566799999999999999999999999998875


No 3  
>KOG3513|consensus
Probab=98.56  E-value=4.7e-07  Score=82.48  Aligned_cols=75  Identities=28%  Similarity=0.296  Sum_probs=62.3

Q ss_pred             cceeCCcEEEEEEEeeCCCCC---CeEEEEecCCceEEEccCCCCCEEEEEEEEEcCCCCCCCCCceeeeeCCCCCCC
Q psy7019           4 TTEFNQASFKASHSLSDRPDD---RKHVVYQGTNFSCKVNKLQELTTYRFYITATNDAGQGPYSEAYPFTTCTAPPAP   78 (128)
Q Consensus         4 ~~~~~~~~i~Y~le~~~~~~~---~~~~v~~g~~t~~~V~~L~p~t~Y~FRVrA~N~~G~G~~S~~~~~~T~~~pp~~   78 (128)
                      +.+-||-..+|+|++...++.   .......+..++.++++|+|++.|.|.|||.|.+|.||.|......|..+||.+
T Consensus       843 ~~~~nG~l~gY~v~Y~~~~~~~~~~~~~~i~~~~~~~~ltgL~~~T~Y~~~vrA~nsaG~Gp~s~~~~~tt~k~pPs~  920 (1051)
T KOG3513|consen  843 PLWDNGKLTGYEVKYWKINEKEGSLSRVQIAGNRTSWRLTGLEPNTKYRFYVRAYTSAGGGPASSEENVTTKKAPPSQ  920 (1051)
T ss_pred             cCccCCccceeEEEEEEcCCCcccccceeecCCcceEeeeCCCCCceEEEEEEEecCCCCCCCccceeccccCCCCcc
Confidence            456788889999999753332   233344578899999999999999999999999999999999999998888875


No 4  
>cd00063 FN3 Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein fibronectin. Its tenth fibronectin type III repeat contains an RGD cell recognition sequence in a flexible loop between 2 strands. Approximately 2% of all animal proteins contain the FN3 repeat; including extracellular and intracellular proteins, membrane spanning cytokine receptors, growth hormone receptors, tyrosine phosphatase receptors, and adhesion molecules. FN3-like domains are also found in bacterial glycosyl hydrolases.
Probab=98.48  E-value=1.2e-06  Score=54.44  Aligned_cols=62  Identities=21%  Similarity=0.203  Sum_probs=49.7

Q ss_pred             CcEEEEEEEeeCCCCCCeEEEEec--CCceEEEccCCCCCEEEEEEEEEcCCCCCCCCCceeee
Q psy7019           9 QASFKASHSLSDRPDDRKHVVYQG--TNFSCKVNKLQELTTYRFYITATNDAGQGPYSEAYPFT   70 (128)
Q Consensus         9 ~~~i~Y~le~~~~~~~~~~~v~~g--~~t~~~V~~L~p~t~Y~FRVrA~N~~G~G~~S~~~~~~   70 (128)
                      +....|.+++.......|..+...  ..+.+.+.+|.|++.|.|+|+|.+..|.|.+|....|.
T Consensus        29 ~~~~~y~v~~~~~~~~~~~~~~~~~~~~~~~~i~~l~p~~~Y~~~v~a~~~~~~~~~s~~~~~~   92 (93)
T cd00063          29 GPITGYVVEYREKGSGDWKEVEVTPGSETSYTLTGLKPGTEYEFRVRAVNGGGESPPSESVTVT   92 (93)
T ss_pred             CcceeEEEEEeeCCCCCCEEeeccCCcccEEEEccccCCCEEEEEEEEECCCccCCCccccccc
Confidence            556789999876445667766654  67889999999999999999999999988888755543


No 5  
>KOG4221|consensus
Probab=98.47  E-value=2e-06  Score=79.18  Aligned_cols=84  Identities=17%  Similarity=0.145  Sum_probs=67.1

Q ss_pred             ccce-eCCcEEEEEEEeeCCCCCCeEEEEecCCceEEEccCCCCCEEEEEEEEEcCCCCCCCCCceeeeeCCCCCCCCC-
Q psy7019           3 KTTE-FNQASFKASHSLSDRPDDRKHVVYQGTNFSCKVNKLQELTTYRFYITATNDAGQGPYSEAYPFTTCTAPPAPLK-   80 (128)
Q Consensus         3 ~~~~-~~~~~i~Y~le~~~~~~~~~~~v~~g~~t~~~V~~L~p~t~Y~FRVrA~N~~G~G~~S~~~~~~T~~~pp~~~~-   80 (128)
                      |+++ -||.+++|.+-+.+.+.+.|. .+....++++|.+|+|.+.|.|||.|.|.+|.|..|..+.|.|..+-|+.|. 
T Consensus       542 epP~~~n~~I~~yk~~ys~~~~~~~~-~~~~n~~e~ti~gL~k~TeY~~~vvA~N~~G~g~sS~~i~V~Tlsd~PsaPP~  620 (1381)
T KOG4221|consen  542 EPPPFGNGPITGYKLFYSEDDTGKEL-RVENNATEYTINGLEKYTEYSIRVVAYNSAGSGVSSADITVRTLSDVPSAPPQ  620 (1381)
T ss_pred             cCCCCCCCCceEEEEEEEcCCCCceE-EEecCccEEEeecCCCccceEEEEEEecCCCCCCCCCceEEEeccCCCCCCCc
Confidence            4445 788899999999865455554 3444788999999999999999999999999999999999999988776533 


Q ss_pred             CceEEEe
Q psy7019          81 GKTYFSL   87 (128)
Q Consensus        81 ~p~v~~~   87 (128)
                      +..+...
T Consensus       621 Nl~lev~  627 (1381)
T KOG4221|consen  621 NLSLEVV  627 (1381)
T ss_pred             ceEEEec
Confidence            3555543


No 6  
>KOG4367|consensus
Probab=98.40  E-value=1.4e-07  Score=79.31  Aligned_cols=62  Identities=27%  Similarity=0.351  Sum_probs=58.7

Q ss_pred             EEEEEEeeCCCCCCeEEEEecCCceEEEccCCCCCEEEEEEEEEcCCCCCCCCCceeeeeCC
Q psy7019          12 FKASHSLSDRPDDRKHVVYQGTNFSCKVNKLQELTTYRFYITATNDAGQGPYSEAYPFTTCT   73 (128)
Q Consensus        12 i~Y~le~~~~~~~~~~~v~~g~~t~~~V~~L~p~t~Y~FRVrA~N~~G~G~~S~~~~~~T~~   73 (128)
                      =+|.||..+.++++|..+|.|+++-|+|++|+-++.|.-||+|+|++|.++||+.+.++|+.
T Consensus       470 dg~~leld~g~~g~frevy~g~etmctvdglhfns~y~arvka~n~tg~s~ys~tl~lqts~  531 (699)
T KOG4367|consen  470 DGYILELDDGNGGQFREVYVGKETMCTVDGLHFNSTYNARVKAFNKTGVSPYSKTLVLQTSE  531 (699)
T ss_pred             cceEEEeecCCCCceeEEEecCceeEEecceecchhHHHHHHHhhccCCCcccceeEeeecc
Confidence            37999999888899999999999999999999999999999999999999999999999974


No 7  
>KOG4221|consensus
Probab=98.27  E-value=3.3e-06  Score=77.74  Aligned_cols=70  Identities=24%  Similarity=0.205  Sum_probs=57.9

Q ss_pred             eCCcEEEEEEEeeCCC-CCC-eEEEEecCCceEEEccCCCCCEEEEEEEEEcCCCCCCCCCceeeeeCCCCC
Q psy7019           7 FNQASFKASHSLSDRP-DDR-KHVVYQGTNFSCKVNKLQELTTYRFYITATNDAGQGPYSEAYPFTTCTAPP   76 (128)
Q Consensus         7 ~~~~~i~Y~le~~~~~-~~~-~~~v~~g~~t~~~V~~L~p~t~Y~FRVrA~N~~G~G~~S~~~~~~T~~~pp   76 (128)
                      -||-..+|.++++... +.+ ...+..|..+.+.+.+|+|++.|.||+.|.|..|.|++|+++.+.|+...+
T Consensus       644 ~ng~itgYkIRy~~~~~~~~~~~t~v~~n~~~~l~~~Lep~T~Y~vrIsa~t~nGtGpaS~w~~aeT~~~d~  715 (1381)
T KOG4221|consen  644 QNGQITGYKIRYRKLSREDEVNETVVKGNTTQYLFNGLEPNTQYRVRISAMTVNGTGPASEWVSAETPESDL  715 (1381)
T ss_pred             ccceEEEEEEEecccCcccccceeecccchhhhHhhcCCCCceEEEEEEEeccCCCCCcccceeccCccccc
Confidence            3777889999997432 223 356677788899999999999999999999999999999999999975433


No 8  
>KOG3513|consensus
Probab=97.93  E-value=5.5e-05  Score=69.29  Aligned_cols=73  Identities=22%  Similarity=0.142  Sum_probs=58.4

Q ss_pred             eeCCc-EEEEEEEeeCCCCCCeEEEEec--CC---ceEEEccCCCCCEEEEEEEEEcCCCCCCCCCce-eeeeCCCCCCC
Q psy7019           6 EFNQA-SFKASHSLSDRPDDRKHVVYQG--TN---FSCKVNKLQELTTYRFYITATNDAGQGPYSEAY-PFTTCTAPPAP   78 (128)
Q Consensus         6 ~~~~~-~i~Y~le~~~~~~~~~~~v~~g--~~---t~~~V~~L~p~t~Y~FRVrA~N~~G~G~~S~~~-~~~T~~~pp~~   78 (128)
                      ..|+. +..|.+|.++...+.|..+..-  ..   .+..+.+|.|-..|.|||.|.|..|.|++|.++ ...|.++.|..
T Consensus       639 ~dn~SpI~~Y~iq~rt~~~~~W~~v~~vp~~~~~~~sa~vv~L~Pwv~YeFRV~AvN~iG~gePS~pS~~~rT~ea~P~~  718 (1051)
T KOG3513|consen  639 SDNNSPIEKYTIQFRTPFPGKWKAVTTVPGNITGDESATVVNLSPWVEYEFRVVAVNSIGIGEPSPPSEKVRTPEAAPSV  718 (1051)
T ss_pred             CCCCCCceEEeEEecCCCCCcceEeeECCCcccCccceeEEccCCCcceEEEEEEEcccccCCCCCCccceecCCCCCcc
Confidence            34544 5689999998888899876642  22   367899999999999999999999999999876 56777776653


No 9  
>PF09294 Interfer-bind:  Interferon-alpha/beta receptor, fibronectin type III;  InterPro: IPR015373 Members of this family adopt a secondary structure consisting of seven beta-strands arranged in an immunoglobulin-like beta-sandwich, in a Greek-key topology. They are required for binding to interferon-alpha []. ; PDB: 1A21_A 3LQM_B 3ELA_T 1AHW_C 2A2Q_T 1TFH_B 1FAK_T 1WSS_T 1W2K_T 2FIR_T ....
Probab=97.78  E-value=7.4e-05  Score=50.28  Aligned_cols=61  Identities=21%  Similarity=0.130  Sum_probs=43.0

Q ss_pred             EEEEEEEeeCCCCCCeEEEEecCCceEEEccCCCCCEEEEEEEEEcC--CCCCCCCCceeeee
Q psy7019          11 SFKASHSLSDRPDDRKHVVYQGTNFSCKVNKLQELTTYRFYITATND--AGQGPYSEAYPFTT   71 (128)
Q Consensus        11 ~i~Y~le~~~~~~~~~~~v~~g~~t~~~V~~L~p~t~Y~FRVrA~N~--~G~G~~S~~~~~~T   71 (128)
                      .+.|.|.+.+.....-........+.+.+.+|.|++.|.++|+|...  .-.|.+|++...+|
T Consensus        43 ~~~Y~v~~~~~~~~~~~~~~~~~~~~~~l~~L~p~t~YCv~V~~~~~~~~~~s~~S~~~C~~t  105 (106)
T PF09294_consen   43 SLSYNVSYWKNGSNEKKKEIETKNSSVTLSDLKPGTNYCVSVQAFSPSQNKNSQPSEPQCITT  105 (106)
T ss_dssp             G-EEEEEEEETTTSCEEEEEESSSEEEEEES--TTSEEEEEEEEEECSSTEEEEEBSEEEEE-
T ss_pred             CeEEEEEEEeCCCccceEEEeecCCEEEEeCCCCCCCEEEEEEEEeccCCCcCCCCCCEeEeC
Confidence            47899998865555344556667778899999999999999999433  33577888777666


No 10 
>smart00060 FN3 Fibronectin type 3 domain. One of three types of internal repeat within the plasma protein, fibronectin. The tenth fibronectin type III repeat contains a RGD cell recognition  sequence in a flexible loop between 2 strands. Type III modules are present in both extracellular and intracellular proteins.
Probab=97.62  E-value=0.0008  Score=40.18  Aligned_cols=36  Identities=28%  Similarity=0.302  Sum_probs=28.1

Q ss_pred             CeEEEEecC-CceEEEccCCCCCEEEEEEEEEcCCCC
Q psy7019          25 RKHVVYQGT-NFSCKVNKLQELTTYRFYITATNDAGQ   60 (128)
Q Consensus        25 ~~~~v~~g~-~t~~~V~~L~p~t~Y~FRVrA~N~~G~   60 (128)
                      .|....... ...+.+.+|.|++.|.|+|+|.+..|.
T Consensus        46 ~~~~~~~~~~~~~~~i~~L~~~~~Y~v~v~a~~~~g~   82 (83)
T smart00060       46 SWKEVNVTPSSTSYTLTGLKPGTEYEFRVRAVNGAGE   82 (83)
T ss_pred             ccEEEEecCCccEEEEeCcCCCCEEEEEEEEEcccCC
Confidence            455444333 478999999999999999999998654


No 11 
>PF07495 Y_Y_Y:  Y_Y_Y domain;  InterPro: IPR011123 This region is mostly found at the end of the beta propellers (IPR011110 from INTERPRO) in a family of two component regulators. However they are also found tandemly repeated in Q891H4 from SWISSPROT without other signal conduction domains being present. It is named after the conserved tyrosines found in the alignment. The exact function is not known.; PDB: 3V9F_D 3VA6_B 3OTT_B 4A2M_D 4A2L_B.
Probab=96.29  E-value=0.02  Score=35.10  Aligned_cols=46  Identities=15%  Similarity=0.094  Sum_probs=31.3

Q ss_pred             EEEEEeeCCCCCCeEEEEecCCc-eEEEccCCCCCEEEEEEEEEcCCCCCCCC
Q psy7019          13 KASHSLSDRPDDRKHVVYQGTNF-SCKVNKLQELTTYRFYITATNDAGQGPYS   64 (128)
Q Consensus        13 ~Y~le~~~~~~~~~~~v~~g~~t-~~~V~~L~p~t~Y~FRVrA~N~~G~G~~S   64 (128)
                      .|.|+  . .+++|.....  .. ++..+.|.|| .|.|+|+|.|..|.....
T Consensus        11 ~Y~l~--g-~d~~W~~~~~--~~~~~~~~~L~~G-~Y~l~V~a~~~~~~~~~~   57 (66)
T PF07495_consen   11 RYRLE--G-FDDEWITLGS--YSNSISYTNLPPG-KYTLEVRAKDNNGKWSSD   57 (66)
T ss_dssp             EEEEE--T-TESSEEEESS--TS-EEEEES--SE-EEEEEEEEEETTS-B-SS
T ss_pred             EEEEE--C-CCCeEEECCC--CcEEEEEEeCCCE-EEEEEEEEECCCCCcCcc
Confidence            45554  2 4577876554  33 8999999988 899999999998877654


No 12 
>KOG4258|consensus
Probab=96.23  E-value=0.011  Score=53.67  Aligned_cols=70  Identities=17%  Similarity=0.185  Sum_probs=54.6

Q ss_pred             cccceeCCcEEEEEEEeeCCCCCCeEEE----EecCCceEEEccCCCCCEEEEEEEEEcCCCCCCCCCceeeeeC
Q psy7019           2 IKTTEFNQASFKASHSLSDRPDDRKHVV----YQGTNFSCKVNKLQELTTYRFYITATNDAGQGPYSEAYPFTTC   72 (128)
Q Consensus         2 ~~~~~~~~~~i~Y~le~~~~~~~~~~~v----~~g~~t~~~V~~L~p~t~Y~FRVrA~N~~G~G~~S~~~~~~T~   72 (128)
                      .|+.+-||-+++|.++++...+++-..+    +..+.-.+.++.|.|| .|..||||.+-+|.|++-+.+.|--.
T Consensus       850 ~EP~~pNGli~~Y~Vk~r~~~~et~v~cvsR~~~~k~~gv~l~~l~~G-~y~~~vratSlaGng~~t~~~~~~v~  923 (1025)
T KOG4258|consen  850 LEPKEPNGLILNYEVKYRRNGDETHVECVSRMDYAKAGGVYLKRLNPG-NYSVRVRATSLAGNGSWTESVIFYVS  923 (1025)
T ss_pred             CCCCCCCccEEEEEEEEeeccCcchhhhhhhhhhhhcCceEEEecCCC-cEEEEEEEEeeccCCCCcCCcceEEe
Confidence            4788999999999999986666554332    2224566889999988 89999999999999999887665443


No 13 
>KOG4152|consensus
Probab=96.17  E-value=0.0064  Score=52.95  Aligned_cols=37  Identities=35%  Similarity=0.511  Sum_probs=33.3

Q ss_pred             eEEEccCCCCCEEEEEEEEEcCCCCCCCCCceeeeeC
Q psy7019          36 SCKVNKLQELTTYRFYITATNDAGQGPYSEAYPFTTC   72 (128)
Q Consensus        36 ~~~V~~L~p~t~Y~FRVrA~N~~G~G~~S~~~~~~T~   72 (128)
                      .-+-..|.+|+.|.|||.|.|..|.|++|+..+|.|.
T Consensus       648 ~~~k~~lv~Gq~yrfrV~aIng~G~gp~s~i~~~kTc  684 (830)
T KOG4152|consen  648 KPVKTSLVTGQAYRFRVTAINGKGPGPASTILKLKTC  684 (830)
T ss_pred             cccccccccccceeeeeeeeeccCCCchhhheeeeec
Confidence            3445678999999999999999999999999999996


No 14 
>KOG4802|consensus
Probab=96.03  E-value=0.021  Score=48.36  Aligned_cols=63  Identities=16%  Similarity=0.204  Sum_probs=44.3

Q ss_pred             CCeEEEEecC-CceEEEccCCCCCEEEEEEEEEcCCCCCCCCCcee-eeeC--CCCCCCCCCceEEE
Q psy7019          24 DRKHVVYQGT-NFSCKVNKLQELTTYRFYITATNDAGQGPYSEAYP-FTTC--TAPPAPLKGKTYFS   86 (128)
Q Consensus        24 ~~~~~v~~g~-~t~~~V~~L~p~t~Y~FRVrA~N~~G~G~~S~~~~-~~T~--~~pp~~~~~p~v~~   86 (128)
                      -+|..+.... ...+...++.||-=|.|||.|.|..|.=.||++.. |..+  +.+|++|+...+..
T Consensus       197 thwQtv~~t~~e~~~~~t~~rPgRwyefrvaavn~~G~rGFs~PSkpf~ssk~pkaPp~P~dl~l~~  263 (516)
T KOG4802|consen  197 THWQTVEKTMEENTYIFTDMRPGRWYEFRVAAVNAYGFRGFSEPSKPFPSSKNPKAPPSPNDLKLIG  263 (516)
T ss_pred             ccceeeeecCCCceeeeeecCcceeEEEEEeeeecccccccCCCCCCCCCCCCCCCCcCcccceeee
Confidence            3477766543 34788899999999999999999999888887764 3333  33344455556654


No 15 
>PF01108 Tissue_fac:  Tissue factor; PDB: 3OG4_B 3OG6_B 1FYH_E 1FG9_D 1JRH_I 3DGC_R 3DLQ_R 1LQS_R 1Y6M_R 1J7V_R ....
Probab=95.89  E-value=0.032  Score=37.87  Aligned_cols=52  Identities=12%  Similarity=0.051  Sum_probs=40.6

Q ss_pred             CCcEEEEEEEeeCCCCCCeEEE---EecCCceEEEccCC--CCCEEEEEEEEEcCCC
Q psy7019           8 NQASFKASHSLSDRPDDRKHVV---YQGTNFSCKVNKLQ--ELTTYRFYITATNDAG   59 (128)
Q Consensus         8 ~~~~i~Y~le~~~~~~~~~~~v---~~g~~t~~~V~~L~--p~t~Y~FRVrA~N~~G   59 (128)
                      +.....|.+|+....++.|..+   .....++|.+++..  +...|.+||||.-...
T Consensus        47 ~~~~~~ytVq~~~~~~~~W~~v~~C~~i~~~~Cdlt~~~~~~~~~Y~~rV~A~~~~~  103 (107)
T PF01108_consen   47 SPPNVTYTVQYKKYGSSSWKDVPGCQNITETSCDLTDETSDPSESYYARVRAEVGNQ  103 (107)
T ss_dssp             SSSTEEEEEEEEESSTSCEEEECCEEEESSSEEECTTCCTTTTSEEEEEEEEEETTE
T ss_pred             CCCCeEEEEEEEecCCcceeeccceecccccceeCcchhhcCcCCEEEEEEEEeCCc
Confidence            4456899999985577889876   44467899998865  7899999999986543


No 16 
>KOG4222|consensus
Probab=95.72  E-value=0.076  Score=49.73  Aligned_cols=63  Identities=19%  Similarity=0.128  Sum_probs=48.2

Q ss_pred             eeCCcEEEEEEEeeCCCCCC---eEEEEecCCceEEEccCCCCCEEEEEEEEEcCCCCCCCCCceee
Q psy7019           6 EFNQASFKASHSLSDRPDDR---KHVVYQGTNFSCKVNKLQELTTYRFYITATNDAGQGPYSEAYPF   69 (128)
Q Consensus         6 ~~~~~~i~Y~le~~~~~~~~---~~~v~~g~~t~~~V~~L~p~t~Y~FRVrA~N~~G~G~~S~~~~~   69 (128)
                      +.||...+|.++....+.-.   ...+. ....+.+|.+|.+|..|.|+|-|+|++|.|..|+..-.
T Consensus       776 ~~ng~~qg~ki~~~~~e~tr~h~n~t~~-a~~~sv~i~~l~~g~ay~vtv~a~T~aGvG~~s~p~~~  841 (1281)
T KOG4222|consen  776 VQNGILQGYKIECSGGEKTRIHINKTTN-ARTGSVTIGNLVTGIAYSVTVAARTGAGVGVKSPPQPI  841 (1281)
T ss_pred             ccCCcccceeEEeecCcccccccccccc-CCCCceEeccccccceEEEEEeeecCCccCCCCCCeee
Confidence            46888889999986433222   22223 45678999999999999999999999999999876433


No 17 
>PLN02533 probable purple acid phosphatase
Probab=91.74  E-value=0.3  Score=41.16  Aligned_cols=35  Identities=14%  Similarity=0.184  Sum_probs=27.3

Q ss_pred             CceEEEccCCCCCEEEEEEEEEcCCCCCCCCCceeeeeCCC
Q psy7019          34 NFSCKVNKLQELTTYRFYITATNDAGQGPYSEAYPFTTCTA   74 (128)
Q Consensus        34 ~t~~~V~~L~p~t~Y~FRVrA~N~~G~G~~S~~~~~~T~~~   74 (128)
                      ...++|++|+|++.|.+||.      ....|+...|.|++.
T Consensus       102 iH~v~l~~L~p~T~Y~Yrvg------~~~~s~~~~F~T~p~  136 (427)
T PLN02533        102 INDVVIGPLKPNTVYYYKCG------GPSSTQEFSFRTPPS  136 (427)
T ss_pred             EEEEEeCCCCCCCEEEEEEC------CCCCccceEEECCCC
Confidence            35689999999999999994      223477788998754


No 18 
>KOG0196|consensus
Probab=90.90  E-value=0.55  Score=43.04  Aligned_cols=46  Identities=33%  Similarity=0.486  Sum_probs=34.5

Q ss_pred             CCceEEEccCCCCCEEEEEEEEEcCCCC-CCC---CCceeeeeCCCCCCC
Q psy7019          33 TNFSCKVNKLQELTTYRFYITATNDAGQ-GPY---SEAYPFTTCTAPPAP   78 (128)
Q Consensus        33 ~~t~~~V~~L~p~t~Y~FRVrA~N~~G~-G~~---S~~~~~~T~~~pp~~   78 (128)
                      .++++.|.+|.|.+.|.|.|.|.|+... +++   ...+.+.|..+.|++
T Consensus       396 t~~~V~v~~L~ah~~YTFeV~AvNgVS~lsp~~~~~a~vnItt~qa~ps~  445 (996)
T KOG0196|consen  396 TETSVTVSDLLAHTNYTFEVEAVNGVSDLSPFPRQFASVNITTNQAAPSP  445 (996)
T ss_pred             ccceEEEeccccccccEEEEEEeecccccCCCCCcceeEEeeccccCCCc
Confidence            3577999999999999999999998543 332   234677777666655


No 19 
>PF10179 DUF2369:  Uncharacterised conserved protein (DUF2369);  InterPro: IPR019326  This is a proline-rich region of a group of proteins found from plants to fungi. The function is largely unknown, although the entry contains Fibronectin type-III domain-containing protein C4orf31, which promotes matrix assembly and cell adhesiveness.
Probab=90.45  E-value=0.53  Score=38.27  Aligned_cols=29  Identities=28%  Similarity=0.331  Sum_probs=26.2

Q ss_pred             EEecCCceEEEccCCCCCEEEEEEEEEcC
Q psy7019          29 VYQGTNFSCKVNKLQELTTYRFYITATND   57 (128)
Q Consensus        29 v~~g~~t~~~V~~L~p~t~Y~FRVrA~N~   57 (128)
                      +-.|..+.+.|.+|+|++.|.|.|.|+|.
T Consensus         9 vCvg~~t~~t~~~L~p~t~YyfdVF~vn~   37 (300)
T PF10179_consen    9 VCVGQKTNQTLSGLKPDTTYYFDVFVVNQ   37 (300)
T ss_pred             EEcCCCceEEeccCCCCCeEEEEEEEEEC
Confidence            44567899999999999999999999998


No 20 
>PF09423 PhoD:  PhoD-like phosphatase;  InterPro: IPR018946 This entry contains a number of putative proteins as well as Alkaline phosphatase D which catalyses the reaction:  A phosphate monoester + H(2)O = an alcohol + phosphate  ; PDB: 2YEQ_B.
Probab=90.34  E-value=0.52  Score=39.59  Aligned_cols=34  Identities=29%  Similarity=0.339  Sum_probs=19.7

Q ss_pred             eEEEccCCCCCEEEEEEEEEcCCCCCCCCCceeeeeC
Q psy7019          36 SCKVNKLQELTTYRFYITATNDAGQGPYSEAYPFTTC   72 (128)
Q Consensus        36 ~~~V~~L~p~t~Y~FRVrA~N~~G~G~~S~~~~~~T~   72 (128)
                      .+.|++|+|++.|.+|+..   .+.+..|....|+|.
T Consensus        65 ~v~v~gL~p~t~Y~Y~~~~---~~~~~~s~~g~~rT~   98 (453)
T PF09423_consen   65 KVDVTGLQPGTRYYYRFVV---DGGGQTSPVGRFRTA   98 (453)
T ss_dssp             EEEE-S--TT-EEEEEEEE-----TTEE---EEEE--
T ss_pred             ecccCCCCCCceEEEEEEE---ecCCCCCCceEEEcC
Confidence            5779999999999999998   333566777899998


No 21 
>KOG4222|consensus
Probab=90.27  E-value=0.17  Score=47.47  Aligned_cols=64  Identities=14%  Similarity=0.092  Sum_probs=47.3

Q ss_pred             EEEEEEEeeCCC-CCCeEEEE-ecCCceEEEccCCCCCEEEEEEEEEcCCCCCCCCCcee-eeeCCC
Q psy7019          11 SFKASHSLSDRP-DDRKHVVY-QGTNFSCKVNKLQELTTYRFYITATNDAGQGPYSEAYP-FTTCTA   74 (128)
Q Consensus        11 ~i~Y~le~~~~~-~~~~~~v~-~g~~t~~~V~~L~p~t~Y~FRVrA~N~~G~G~~S~~~~-~~T~~~   74 (128)
                      ...|.+|++-.. .+.|..+- .-+.+.+.|.+|+|++.|.|-|||.|..|-+.+|.... +.|.++
T Consensus       558 ~s~yiieafs~~~~etw~~ta~~v~~t~~~I~gL~P~~sylf~vRa~n~~Gis~Ps~~S~~vrta~a  624 (1281)
T KOG4222|consen  558 ASGYIIEAFSPDLGETWQTTAGRVKTTTYAIRGLKPNLSYLFLVRAENEQGISDPSTSSDPVRTAPA  624 (1281)
T ss_pred             cchhHHHHhhhhhcccccccccccccceeeecCcCccceeeeeeeccccccccCCcccCCccccCCC
Confidence            457888876433 35576544 23578899999999999999999999999988876553 344433


No 22 
>PF10179 DUF2369:  Uncharacterised conserved protein (DUF2369);  InterPro: IPR019326  This is a proline-rich region of a group of proteins found from plants to fungi. The function is largely unknown, although the entry contains Fibronectin type-III domain-containing protein C4orf31, which promotes matrix assembly and cell adhesiveness.
Probab=85.90  E-value=4.9  Score=32.73  Aligned_cols=76  Identities=18%  Similarity=0.343  Sum_probs=51.8

Q ss_pred             ceEEEccCCCCCEEEEEEEEEcCCCCCCCCCceeee-eCC-----CCCCCCCCceEEEeeccccEEEEehhhhhhhhcce
Q psy7019          35 FSCKVNKLQELTTYRFYITATNDAGQGPYSEAYPFT-TCT-----APPAPLKGKTYFSLVLSRGVCLRAGFLWVFASGRA  108 (128)
Q Consensus        35 t~~~V~~L~p~t~Y~FRVrA~N~~G~G~~S~~~~~~-T~~-----~pp~~~~~p~v~~~~~~~~v~~~~g~~~~~~~~~~  108 (128)
                      ..+.++++.||..|..|+.+.|... +..  .+.+. ++.     +-|.-|..+.+...+.-+ -|..+-++|+-+-++.
T Consensus       128 ~~f~l~~~~~g~~Yliri~~~~~~e-~~~--~~kV~aast~~~~~~~P~LP~d~~Ik~f~~lr-tC~SvTIAW~~s~d~~  203 (300)
T PF10179_consen  128 RHFRLSGVKPGERYLIRIQISNSDE-GPS--TFKVQAASTNPSKQPYPQLPDDTSIKEFNKLR-TCNSVTIAWLGSPDRS  203 (300)
T ss_pred             EEEEECCCCCCCeEEEEEEccCCCC-Cce--EEEEEEecCCcccCCCCCCCCCCceeEEcCCc-ccceEEEEEecCCCCC
Confidence            5688999999999999998766433 222  23333 221     223345667776554334 6888999999888774


Q ss_pred             --eeeeee
Q psy7019         109 --FCVSAR  114 (128)
Q Consensus       109 --~~~~~~  114 (128)
                        .|+|++
T Consensus       204 ~kYCvy~~  211 (300)
T PF10179_consen  204 IKYCVYRR  211 (300)
T ss_pred             ceEEEEEE
Confidence              899987


No 23 
>KOG1378|consensus
Probab=78.88  E-value=3.9  Score=35.14  Aligned_cols=38  Identities=21%  Similarity=0.332  Sum_probs=30.8

Q ss_pred             ecCCceEEEccCCCCCEEEEEEEEEcCCCCCCCCCceeeeeCC
Q psy7019          31 QGTNFSCKVNKLQELTTYRFYITATNDAGQGPYSEAYPFTTCT   73 (128)
Q Consensus        31 ~g~~t~~~V~~L~p~t~Y~FRVrA~N~~G~G~~S~~~~~~T~~   73 (128)
                      .|....+.+++|+|++.|.+||-..     ..+|++..|.|++
T Consensus       105 sg~ih~~~~~~L~~~t~YyY~~Gs~-----~~wS~~f~F~t~p  142 (452)
T KOG1378|consen  105 SGYIHDAVMKNLEPNTRYYYQVGSD-----LKWSEIFSFKTPP  142 (452)
T ss_pred             eeeEeeeeecCCCCCceEEEEeCCC-----CCcccceEeECCC
Confidence            3445678899999999999999642     2289999999987


No 24 
>KOG4806|consensus
Probab=61.95  E-value=12  Score=31.47  Aligned_cols=29  Identities=24%  Similarity=0.260  Sum_probs=25.3

Q ss_pred             EEecCCceEEEccCCCCCEEEEEEEEEcC
Q psy7019          29 VYQGTNFSCKVNKLQELTTYRFYITATND   57 (128)
Q Consensus        29 v~~g~~t~~~V~~L~p~t~Y~FRVrA~N~   57 (128)
                      +-.|+.+.+.|.+|+|+++|.|.|..+|.
T Consensus       164 iCig~~n~~~v~~L~pdt~Y~~dvFvv~~  192 (454)
T KOG4806|consen  164 ICIGNKNIFTVSDLKPDTQYYFDVFVVNI  192 (454)
T ss_pred             EEcCCccEEEhhhcCCCceEEEEEEEEec
Confidence            44567779999999999999999999985


No 25 
>cd05762 Ig8_MLCK Eighth immunoglobulin (Ig)-like domain of human myosin light-chain kinase (MLCK). Ig8_MLCK: the eighth immunoglobulin (Ig)-like domain of human myosin light-chain kinase (MLCK). MLCK is a key regulator of different forms of cell motility involving actin and myosin II.  Agonist stimulation of smooth muscle cells increases cytosolic Ca2+, which binds calmodulin.  This Ca2+-calmodulin complex in turn binds to and activates MLCK. Activated MLCK leads to the phosphorylation of the 20 kDa myosin regulatory light chain (RLC) of myosin II and the stimulation of actin-activated myosin MgATPase activity. MLCK is widely present in vertebrate tissues; it phosphorylates the 20 kDa RLC of both smooth and nonmuscle myosin II. Phosphorylation leads to the activation of the myosin motor domain and altered structural properties of myosin II. In smooth muscle MLCK it is involved in initiating contraction. In nonmuscle cells, MLCK may participate in cell division and cell motility; it has
Probab=61.51  E-value=33  Score=22.45  Aligned_cols=44  Identities=11%  Similarity=0.117  Sum_probs=31.4

Q ss_pred             CCceEEEccCCCCCEEEEEEEEEcCCCCCCCCCceeeeeCCCCC
Q psy7019          33 TNFSCKVNKLQELTTYRFYITATNDAGQGPYSEAYPFTTCTAPP   76 (128)
Q Consensus        33 ~~t~~~V~~L~p~t~Y~FRVrA~N~~G~G~~S~~~~~~T~~~pp   76 (128)
                      ......|.+....-.=.|.+.|.|..|.-..+-.+.+.-.+.||
T Consensus        54 ~~s~L~I~~~~~~D~G~Ytc~a~N~~G~~~~~~~l~V~~~P~pP   97 (98)
T cd05762          54 NSSKLTITEGQQEHCGCYTLEVENKLGSRQAQVNLTVVDKPDPP   97 (98)
T ss_pred             CeeEEEECCCChhhCEEEEEEEEcCCCceeEEEEEEEecCCCCC
Confidence            34567888888777778899999999987665555555444444


No 26 
>KOG4258|consensus
Probab=59.73  E-value=15  Score=34.24  Aligned_cols=43  Identities=16%  Similarity=0.210  Sum_probs=34.3

Q ss_pred             eEEEccCCCCCEEEEEEEEEcCCCC----CCCCCceeeeeCCCCCCC
Q psy7019          36 SCKVNKLQELTTYRFYITATNDAGQ----GPYSEAYPFTTCTAPPAP   78 (128)
Q Consensus        36 ~~~V~~L~p~t~Y~FRVrA~N~~G~----G~~S~~~~~~T~~~pp~~   78 (128)
                      ...+.+|+|.|+|.+-|++....-+    -..|+...++|.+..|.+
T Consensus       567 ~~~l~~LkP~TqYAvfVkT~t~t~~~~~~~A~S~I~YvqT~~~~Psp  613 (1025)
T KOG4258|consen  567 GFLLDGLKPWTQYAVFVKTLTVTEAHEAYEAKSKIGYVQTLPDIPSP  613 (1025)
T ss_pred             ceehhcCCccceeEEEEeeeehhhhccccccccceEEEEecCCCCCC
Confidence            5789999999999999999844222    246889999999877765


No 27 
>PF11344 DUF3146:  Protein of unknown function (DUF3146);  InterPro: IPR021492  This family of proteins with unknown function appear to be restricted to Cyanobacteria. 
Probab=57.70  E-value=7.4  Score=25.60  Aligned_cols=15  Identities=33%  Similarity=0.450  Sum_probs=13.5

Q ss_pred             cCCCCCEEEEEEEEE
Q psy7019          41 KLQELTTYRFYITAT   55 (128)
Q Consensus        41 ~L~p~t~Y~FRVrA~   55 (128)
                      .|+||..|.|-|||.
T Consensus        66 ~LEpGgdY~Ftirak   80 (80)
T PF11344_consen   66 QLEPGGDYSFTIRAK   80 (80)
T ss_pred             eccCCCceEEEEecC
Confidence            589999999999984


No 28 
>PF07353 Uroplakin_II:  Uroplakin II;  InterPro: IPR009952 This family contains uroplakin II, which is approximately 180 residues long and seems to be restricted to mammals. Uroplakin II is an integral membrane protein, and is one of the components of the apical plaques of mammalian urothelium formed by the asymmetric unit membrane - this is believed to play a role in strengthening the urothelial apical surface to prevent the cells from rupturing during bladder distension [].; GO: 0016044 cellular membrane organization, 0030176 integral to endoplasmic reticulum membrane
Probab=57.44  E-value=19  Score=27.02  Aligned_cols=36  Identities=25%  Similarity=0.262  Sum_probs=24.7

Q ss_pred             ceEEEccCCCCCEEEEEEEEEcCCCCCCCCCceeeee
Q psy7019          35 FSCKVNKLQELTTYRFYITATNDAGQGPYSEAYPFTT   71 (128)
Q Consensus        35 t~~~V~~L~p~t~Y~FRVrA~N~~G~G~~S~~~~~~T   71 (128)
                      ..+.+.+|.||++|.|+-...++.+ .+.|.++...|
T Consensus       102 saYqVtNL~pGTkY~isY~Vtkgts-tESS~~i~msT  137 (184)
T PF07353_consen  102 SAYQVTNLQPGTKYYISYLVTKGTS-TESSNEIPMST  137 (184)
T ss_pred             eeEEeeccCCCcEEEEEEEEecCcc-ceecceecccc
Confidence            3488999999999999988866522 23333444444


No 29 
>COG4733 Phage-related protein, tail component [Function unknown]
Probab=55.43  E-value=29  Score=32.30  Aligned_cols=38  Identities=11%  Similarity=0.049  Sum_probs=32.2

Q ss_pred             CCeEEEEecCCceEEEccCCCCCEEEEEEEEEcCCCCCC
Q psy7019          24 DRKHVVYQGTNFSCKVNKLQELTTYRFYITATNDAGQGP   62 (128)
Q Consensus        24 ~~~~~v~~g~~t~~~V~~L~p~t~Y~FRVrA~N~~G~G~   62 (128)
                      +.|...-++..+.+.+.+|-+ ..|..||||.|..|...
T Consensus       649 ~~~vta~rt~~~~~~~~gi~~-GqY~i~VrAiN~~g~~~  686 (952)
T COG4733         649 GNWITAPRTSAAGFDVEGIPA-GQYAIRVRAINVFEPNS  686 (952)
T ss_pred             CceEecccccccceeecCcCc-cceEEEEEEeeccCCCC
Confidence            467777777889999999998 59999999999988743


No 30 
>KOG4802|consensus
Probab=51.63  E-value=28  Score=29.96  Aligned_cols=36  Identities=19%  Similarity=0.219  Sum_probs=29.1

Q ss_pred             EEEccCCCCCEEEEEEEEEcCCCCCCCC-C-ceeeeeC
Q psy7019          37 CKVNKLQELTTYRFYITATNDAGQGPYS-E-AYPFTTC   72 (128)
Q Consensus        37 ~~V~~L~p~t~Y~FRVrA~N~~G~G~~S-~-~~~~~T~   72 (128)
                      ..|++|.|+..|..+|+|...-|.+..- + .+.|.|.
T Consensus       316 ~si~~L~Pns~Y~VevqAi~y~g~~rLksek~~~F~~T  353 (516)
T KOG4802|consen  316 FSIKELLPNSSYYVEVQAISYLGSRRLKSEKWMLFNTT  353 (516)
T ss_pred             hhhhhcCCCCeEEEEEEEEEeccCcccccceEEEEeec
Confidence            4499999999999999999999999874 3 2455553


No 31 
>TIGR03000 plancto_dom_1 Planctomycetes uncharacterized domain TIGR03000. Domains described by this model are found, so far, only in the Planctomycetes (Pirellula sp. strain 1 and Gemmata obscuriglobus), in up to six proteins per genome, and may be duplicated within a protein. The function is unknown.
Probab=49.87  E-value=23  Score=23.12  Aligned_cols=25  Identities=24%  Similarity=0.316  Sum_probs=21.2

Q ss_pred             ecCCceEEEccCCCCCEEEEEEEEE
Q psy7019          31 QGTNFSCKVNKLQELTTYRFYITAT   55 (128)
Q Consensus        31 ~g~~t~~~V~~L~p~t~Y~FRVrA~   55 (128)
                      .|...++.-..|.+|..|.|+|+|.
T Consensus        25 ~G~~R~F~T~~L~~G~~y~Y~v~a~   49 (75)
T TIGR03000        25 TGTVRTFTTPPLEAGKEYEYTVTAE   49 (75)
T ss_pred             CccEEEEECCCCCCCCEEEEEEEEE
Confidence            3456778889999999999999994


No 32 
>PF04775 Bile_Hydr_Trans:  Acyl-CoA thioester hydrolase/BAAT N-terminal region;  InterPro: IPR006862 This entry presents the N-termini of acyl-CoA thioester hydrolase and bile acid-CoA:amino acid N-acetyltransferase (BAAT) []. This region is not thought to contain the active site of either enzyme. Thioesterase isoforms have been identified in peroxisomes, cytoplasm and mitochondria, where they are thought to have distinct functions in lipid metabolism []. For example, in peroxisomes, the hydrolase acts on bile-CoA esters [].; GO: 0016290 palmitoyl-CoA hydrolase activity, 0006629 lipid metabolic process; PDB: 3HLK_B 3K2I_B.
Probab=48.49  E-value=30  Score=24.20  Aligned_cols=25  Identities=20%  Similarity=0.223  Sum_probs=17.6

Q ss_pred             ceEEEccCCCCCEEEEEEEEEcCCC
Q psy7019          35 FSCKVNKLQELTTYRFYITATNDAG   59 (128)
Q Consensus        35 t~~~V~~L~p~t~Y~FRVrA~N~~G   59 (128)
                      -..+|++|.|+..|.++.+..+..|
T Consensus         5 ~~I~v~GL~p~~~vtl~a~~~~~~g   29 (126)
T PF04775_consen    5 VDIRVSGLPPGQEVTLRARLTDDNG   29 (126)
T ss_dssp             -EEEEES--TT-EEEEEEEEE-TTS
T ss_pred             eEEEEeCCCCCCEEEEEEEEEeCCC
Confidence            4678999999999999999997655


No 33 
>COG3540 PhoD Phosphodiesterase/alkaline phosphatase D [Inorganic ion transport and metabolism]
Probab=47.55  E-value=26  Score=30.59  Aligned_cols=36  Identities=19%  Similarity=0.217  Sum_probs=27.1

Q ss_pred             ceEEEccCCCCCEEEEEEEEEcCCCCCCCCCceeeeeCCCC
Q psy7019          35 FSCKVNKLQELTTYRFYITATNDAGQGPYSEAYPFTTCTAP   75 (128)
Q Consensus        35 t~~~V~~L~p~t~Y~FRVrA~N~~G~G~~S~~~~~~T~~~p   75 (128)
                      -++.+.+|.|++.|.+|+.+-+..     |....++|.+++
T Consensus       101 v~v~~~gL~P~~~yfYRf~~~~~~-----spvGrtrTapa~  136 (522)
T COG3540         101 VHVDLRGLSPDQDYFYRFKAGDER-----SPVGRTRTAPAP  136 (522)
T ss_pred             EEEeccCCCCCceEEEEEeeCCcc-----ccccccccCCCC
Confidence            357789999999999999986653     345566776654


No 34 
>PF02922 CBM_48:  Carbohydrate-binding module 48 (Isoamylase N-terminal domain);  InterPro: IPR004193 O-Glycosyl hydrolases 3.2.1. from EC are a widespread group of enzymes that hydrolyse the glycosidic bond between two or more carbohydrates, or between a carbohydrate and a non-carbohydrate moiety. A classification system for glycosyl hydrolases, based on sequence similarity, has led to the definition of 85 different families [, ]. This classification is available on the CAZy (CArbohydrate-Active EnZymes) web site. Enzymes containing this domain belong to family 13 (GH13 from CAZY) of the glycosyl hydrolases. This domain is found in a range of enzymes that act on branched substrates ie. isoamylase, pullulanase and branching enzyme. Isoamylase hydrolyses 1,6-alpha-D-glucosidic branch linkages in glycogen, amylopectin and dextrin; 1,4-alpha-glucan branching enzyme functions in the formation of 1,6-glucosidic linkages of glycogen; and pullulanase is a starch-debranching enzyme.; GO: 0004553 hydrolase activity, hydrolyzing O-glycosyl compounds, 0005975 carbohydrate metabolic process; PDB: 2BHZ_A 2BY2_A 2BY3_A 2BXY_A 2BY1_A 2BHY_A 2BHU_A 2BXZ_A 2BY0_A 2FHB_A ....
Probab=47.51  E-value=55  Score=20.38  Aligned_cols=20  Identities=20%  Similarity=0.326  Sum_probs=14.6

Q ss_pred             EEEc-cCCCCC-EEEEEEEEEc
Q psy7019          37 CKVN-KLQELT-TYRFYITATN   56 (128)
Q Consensus        37 ~~V~-~L~p~t-~Y~FRVrA~N   56 (128)
                      +.+. ++.+|. .|.|+|.-.+
T Consensus        53 ~~~~~~~~~g~~~Y~y~i~~~~   74 (85)
T PF02922_consen   53 VTVPGDLPPGGYYYKYRIDGDD   74 (85)
T ss_dssp             EEEEGCGTTTT-EEEEEEEETT
T ss_pred             EEEcCCcCCCCEEEEEEEEeCC
Confidence            3444 788875 9999998654


No 35 
>PF02010 REJ:  REJ domain;  InterPro: IPR002859 The REJ (Receptor for Egg Jelly) domain is found in PKD1 P98161 from SWISSPROT and the sperm receptor for egg jelly Q26627 from SWISSPROT. The exact function of this domain is unknown. The domain is 600 amino acids long so is probably composed of multiple structural domains. There are six completely conserved cysteine residues that may form disulphide bridges. This region contains tandem PKD-like domains. Sequence similarity between a region of the autosomal dominant polycystic kidney disease (ADPKD) protein, polycystin-1 and a sea urchin sperm glycoprotein involved in fertilization, the receptor for egg jelly (suREJ) has been known for some time. The suREJ protein binds the glycoprotein coat of the egg (egg jelly), triggering the acrosome reaction, which transforms the sperm into a fusogenic cell. The sequence similarity and expression pattern suggests that the predicted human PKDREJ protein is a mammalian equivalent of the suREJ protein and therefore may have a central role in human fertilization [].; PDB: 2E7M_A 2YRL_A.
Probab=46.25  E-value=6.7  Score=32.32  Aligned_cols=32  Identities=28%  Similarity=0.420  Sum_probs=0.0

Q ss_pred             ccCCCCCEEEEEEEEEcCCCCCCCCCceeeeeC
Q psy7019          40 NKLQELTTYRFYITATNDAGQGPYSEAYPFTTC   72 (128)
Q Consensus        40 ~~L~p~t~Y~FRVrA~N~~G~G~~S~~~~~~T~   72 (128)
                      ..|++|..|.|||.+.+..|...+ ..+.|.+.
T Consensus       267 ~~l~~g~~Y~~~l~v~~~~~~~~~-a~~~~~~n  298 (440)
T PF02010_consen  267 GVLEPGSTYTFRLTVTDSSGSSGS-ASISFTVN  298 (440)
T ss_dssp             ---------------------------------
T ss_pred             cccccccccccccccccccccccc-cccccccc
Confidence            369999999999999999887553 34555553


No 36 
>smart00112 CA Cadherin repeats. Cadherins are glycoproteins involved in Ca2+-mediated cell-cell adhesion. Cadherin domains occur as repeats in the extracellular regions which are thought to mediate cell-cell contact when bound to calcium.
Probab=44.70  E-value=71  Score=19.39  Aligned_cols=47  Identities=15%  Similarity=0.117  Sum_probs=26.4

Q ss_pred             EEEEEEEeeCCCCCCeEEEEecCCceEEE---ccCCCCCEEEEEEEEEcCCCC
Q psy7019          11 SFKASHSLSDRPDDRKHVVYQGTNFSCKV---NKLQELTTYRFYITATNDAGQ   60 (128)
Q Consensus        11 ~i~Y~le~~~~~~~~~~~v~~g~~t~~~V---~~L~p~t~Y~FRVrA~N~~G~   60 (128)
                      .+.|.|.-.. ..+.|..-... . ...+   -+-+....|.|.|+|.+..+.
T Consensus        10 ~i~Y~i~~~~-~~~~F~i~~~t-g-~i~~~~~LD~e~~~~y~l~v~a~D~~~~   59 (79)
T smart00112       10 KVTYSILSGN-EDGLFSIDPET-G-EITTTKPLDREEQPEYTLTVEATDGGGP   59 (79)
T ss_pred             EEEEEEecCC-CCCEEEEeCCc-c-EEEeCCccCeeCCCeEEEEEEEEECCCC
Confidence            4889888543 22334322111 1 1222   223356899999999987654


No 37 
>PF13754 Big_3_4:  Bacterial Ig-like domain (group 3)
Probab=44.38  E-value=66  Score=18.96  Aligned_cols=33  Identities=33%  Similarity=0.343  Sum_probs=23.4

Q ss_pred             CCCCeEEEEecCCceEEEccCCCCCEEEEEEEEEcCCCCCCCC
Q psy7019          22 PDDRKHVVYQGTNFSCKVNKLQELTTYRFYITATNDAGQGPYS   64 (128)
Q Consensus        22 ~~~~~~~v~~g~~t~~~V~~L~p~t~Y~FRVrA~N~~G~G~~S   64 (128)
                      .++.|..-         +..+ ..-.|.+.|+|.+.+|--..+
T Consensus        10 ~~G~Ws~t---------~~~~-~dG~y~itv~a~D~AGN~s~~   42 (54)
T PF13754_consen   10 SDGNWSFT---------VPAL-ADGTYTITVTATDAAGNTSTS   42 (54)
T ss_pred             CCCcEEEe---------CCCC-CCccEEEEEEEEeCCCCCCCc
Confidence            45677643         3344 477999999999999976544


No 38 
>cd05735 Ig8_DSCAM Eight immunoglobulin (Ig) domain of Down Syndrome Cell Adhesion molecule (DSCAM). Ig8_DSCAM:  the eight immunoglobulin (Ig) domain of Down Syndrome Cell Adhesion molecule (DSCAM). DSCAM is a cell adhesion molecule expressed largely in the developing nervous system. The gene encoding DSCAM is located at human chromosome 21q22, the locus associated with the mental retardation phenotype of Down Syndrome. DSCAM is predicted to be the largest member of the IG superfamily. It has been demonstrated that DSCAM can mediate cation-independent homophilic intercellular adhesion.
Probab=42.30  E-value=56  Score=20.76  Aligned_cols=41  Identities=12%  Similarity=-0.013  Sum_probs=26.6

Q ss_pred             ceEEEccCCCCCEEEEEEEEEcCCCCCCCCCceeeeeCCCC
Q psy7019          35 FSCKVNKLQELTTYRFYITATNDAGQGPYSEAYPFTTCTAP   75 (128)
Q Consensus        35 t~~~V~~L~p~t~Y~FRVrA~N~~G~G~~S~~~~~~T~~~p   75 (128)
                      ....|.++.+.-.=.|...|.|..|....+-.+.++..+.|
T Consensus        47 s~L~I~~~~~~D~G~YtC~A~N~~G~~~~~~~L~V~~~P~~   87 (88)
T cd05735          47 STLQILPTVREDSGFFSCHAINSYGEDRGIIQLTVQEPPDP   87 (88)
T ss_pred             EEEEECCCCcccCEEEEEEEEcCCCcceEEEEEEEeCCCCC
Confidence            34667776655555588999999998765545555444333


No 39 
>cd05748 Ig_Titin_like Immunoglobulin (Ig)-like domain of titin and similar proteins. Ig_Titin_like: immunoglobulin (Ig)-like domain found in titin-like proteins. Titin (also called connectin) is a fibrous sarcomeric protein specifically found in vertebrate striated muscle. Titin is gigantic, depending on isoform composition it ranges from 2970 to 3700 kDa, and is of a length that spans half a sarcomere. Titin largely consists of multiple repeats of Ig-like and fibronectin type 3 (FN-III)-like domains. Titin connects the ends of myosin thick filaments to Z disks and extends along the thick filament to the H zone.  It appears to function similarly to an elastic band, keeping the myosin filaments centered in the sarcomere during muscle contraction or stretching. Within the sarcomere, titin is also attached to or is associated with myosin binding protein C (MyBP-C). MyBP-C appears to contribute to the generation of passive tension by titin, and similar to titin has repeated Ig-like and FN-
Probab=42.30  E-value=75  Score=18.99  Aligned_cols=28  Identities=18%  Similarity=0.288  Sum_probs=23.9

Q ss_pred             CCceEEEccCCCCCEEEEEEEEEcCCCC
Q psy7019          33 TNFSCKVNKLQELTTYRFYITATNDAGQ   60 (128)
Q Consensus        33 ~~t~~~V~~L~p~t~Y~FRVrA~N~~G~   60 (128)
                      ......|.++.+.-.-.|.+.|.|..|.
T Consensus        38 ~~~~L~I~~~~~~D~G~Y~C~a~N~~G~   65 (74)
T cd05748          38 SSTSLVIKNAERSDSGKYTLTLKNPAGE   65 (74)
T ss_pred             CeEEEEECCCCcCcCEEEEEEEECCCcc
Confidence            3567889999988888999999999875


No 40 
>cd02856 Glycogen_debranching_enzyme_N_term Glycogen_debranching_enzyme N-terminal domain.  Glycogen debranching enzymes have both 4-alpha-glucanotransferase and amylo-1,6-glucosidase activities. As a transferase it transfers a segment of a 1,4-alpha-D-glucan to a new 4-position in an acceptor, which may be glucose or another 1,4-alpha-D-glucan. As a glucosidase it catalyzes the endohydrolysis of 1,6-alpha-D-glucoside linkages at points of branching in chains of 1,4-linked alpha-D-glucose residues.  The N-terminus of the glycogen debranching enzyme may be related to the immunoglobulin and/or fibronectin type III superfamilies.  These domains are associated with different types of catalytic domains at  either the N-terminal or C-terminal end and may be involved in homodimeric/tetrameric/dodecameric interactions.  Members of this family include members of the alpha amylase family, sialidase, galactose oxidase, cellulase, cellulose, hyaluronate lyase, chitobiase, and chitinase.
Probab=40.72  E-value=37  Score=22.45  Aligned_cols=19  Identities=11%  Similarity=0.165  Sum_probs=16.1

Q ss_pred             eEEEccCCCCCEEEEEEEE
Q psy7019          36 SCKVNKLQELTTYRFYITA   54 (128)
Q Consensus        36 ~~~V~~L~p~t~Y~FRVrA   54 (128)
                      +..|.++.+|..|.|||..
T Consensus        48 ~~~v~~~~~g~~Y~y~i~g   66 (103)
T cd02856          48 HGFLPGIKAGQRYGFRVHG   66 (103)
T ss_pred             EEEECCCCCCCEEEEEECC
Confidence            4677888899999999975


No 41 
>PF15417 DUF4624:  Domain of unknown function (DUF4624)
Probab=39.80  E-value=61  Score=23.01  Aligned_cols=32  Identities=22%  Similarity=0.335  Sum_probs=24.3

Q ss_pred             CCCeEEEEec----CCceEEEccCCCCCEEEEEEEE
Q psy7019          23 DDRKHVVYQG----TNFSCKVNKLQELTTYRFYITA   54 (128)
Q Consensus        23 ~~~~~~v~~g----~~t~~~V~~L~p~t~Y~FRVrA   54 (128)
                      +-.|...+.|    ...+.-+++|+.+..|..|.++
T Consensus        73 evLWsn~~~~~V~~dt~tisL~nlqk~kEY~V~ftG  108 (132)
T PF15417_consen   73 EVLWSNTWNGKVSGDTFTISLNNLQKEKEYVVCFTG  108 (132)
T ss_pred             ceeeccccccccccceEEEEhhhcccCceEEEEEec
Confidence            3447666654    4556889999999999999885


No 42 
>PF01299 Lamp:  Lysosome-associated membrane glycoprotein (Lamp);  InterPro: IPR002000 Lysosome-associated membrane glycoproteins (lamp) [] are integral membrane proteins, specific to lysosomes, and whose exact biological function is not yet clear. Structurally, the lamp proteins consist of two internally homologous lysosome-luminal domains separated by a proline-rich hinge region; at the C-terminal extremity there is a transmembrane region (TM) followed by a very short cytoplasmic tail (C). In each of the duplicated domains, there are two conserved disulphide bonds. This structure is schematically represented in the figure below.   +-----+ +-----+ +-----+ +-----+ | | | | | | | | xCxxxxxCxxxxxxxxxxxxCxxxxxCxxxxxxxxxCxxxxxCxxxxxxxxxxxxCxxxxxCxxxxxxxx +--------------------------++Hinge++--------------------------++TM++C+  In mammals, there are two closely related types of lamp: lamp-1 and lamp-2, which form major components of the lysosome membrane. In chicken lamp-1 is known as LEP100.  Also included in this entry is the macrophage protein CD68 (or macrosialin) [] is a heavily glycosylated integral membrane protein whose structure consists of a mucin-like domain followed by a proline-rich hinge; a single lamp-like domain; a transmembrane region and a short cytoplasmic tail.   Similar to CD68, mammalian lamp-3, which is expressed in lymphoid organs, dendritic cells and in lung, contains all the C-terminal regions but lacks the N-terminal lamp-like region []. In a lamp-family protein from nematodes [] only the part C-terminal to the hinge is conserved. ; GO: 0016020 membrane
Probab=37.89  E-value=14  Score=29.56  Aligned_cols=34  Identities=12%  Similarity=0.034  Sum_probs=23.1

Q ss_pred             ceEEEccCCCCCEEEEEEEEEcCCCCCCCCCceee
Q psy7019          35 FSCKVNKLQELTTYRFYITATNDAGQGPYSEAYPF   69 (128)
Q Consensus        35 t~~~V~~L~p~t~Y~FRVrA~N~~G~G~~S~~~~~   69 (128)
                      ..+.+.+-..=.....||+|++..+.+ |+...+.
T Consensus       231 q~i~lt~~~~l~~~~lqvQaF~~~~~~-Fg~a~~C  264 (306)
T PF01299_consen  231 QSINLTDNVSLNLSDLQVQAFRVKNNT-FGTAEEC  264 (306)
T ss_pred             CEEEecCCceEEEeEEEEEEEEecCCC-CCChhcC
Confidence            345555544455677899999988877 8876554


No 43 
>COG4733 Phage-related protein, tail component [Function unknown]
Probab=37.41  E-value=31  Score=32.16  Aligned_cols=25  Identities=16%  Similarity=0.060  Sum_probs=22.1

Q ss_pred             ccCCCCCEEEEEEEEEcCCCCCCCC
Q psy7019          40 NKLQELTTYRFYITATNDAGQGPYS   64 (128)
Q Consensus        40 ~~L~p~t~Y~FRVrA~N~~G~G~~S   64 (128)
                      .+|.||..|.|++|++|..|-..+=
T Consensus       762 ~~i~~g~~~~F~~R~Vn~vG~~~~~  786 (952)
T COG4733         762 VGIQAGLTFWFRNRNVDLVGNNDKW  786 (952)
T ss_pred             cCcCCCceEEEEeeecccccccccc
Confidence            5899999999999999999976553


No 44 
>KOG0095|consensus
Probab=36.61  E-value=23  Score=26.67  Aligned_cols=26  Identities=31%  Similarity=0.286  Sum_probs=17.6

Q ss_pred             CcccceeCCcEEEEEEEeeCCCCCCeE
Q psy7019           1 MIKTTEFNQASFKASHSLSDRPDDRKH   27 (128)
Q Consensus         1 ~~~~~~~~~~~i~Y~le~~~~~~~~~~   27 (128)
                      ||||+|.||.-+...+= .-.++++|+
T Consensus        45 miktvev~gekiklqiw-dtagqerfr   70 (213)
T KOG0095|consen   45 MIKTVEVNGEKIKLQIW-DTAGQERFR   70 (213)
T ss_pred             EEEEEEECCeEEEEEEe-eccchHHHH
Confidence            89999999998866543 223444443


No 45 
>PHA02579 7 baseplate wedge subunit; Provisional
Probab=35.79  E-value=53  Score=30.56  Aligned_cols=51  Identities=24%  Similarity=0.182  Sum_probs=31.5

Q ss_pred             CcEEEEEEEeeCCCC--CC--------eEEEEecCCceEEEccCCCCCEEEEEEEEEcCCCC
Q psy7019           9 QASFKASHSLSDRPD--DR--------KHVVYQGTNFSCKVNKLQELTTYRFYITATNDAGQ   60 (128)
Q Consensus         9 ~~~i~Y~le~~~~~~--~~--------~~~v~~g~~t~~~V~~L~p~t~Y~FRVrA~N~~G~   60 (128)
                      |+.+.|.+|+.+...  +.        |...--..++..=-+.+.|.+.|+|||+.. ..|+
T Consensus        29 G~NFyY~Ve~a~t~~~~g~~ip~~~~~w~nlg~T~~~~wFed~~~p~t~Yk~Rv~~~-~qGF   89 (1030)
T PHA02579         29 GANFYYFVELAETRDADGELIPDDELRWINLGYTANNEWFEDKLQPNTYYKFRVAVA-AQGF   89 (1030)
T ss_pred             CCceEEEEEEEeeccCCCccCCCccccceecCcccchhhhhhccCCcceEEEEEEee-ccCC
Confidence            677889999874322  22        543222234444445599999999999863 3444


No 46 
>cd02852 Isoamylase_N_term Isoamylase N-terminus domain. Isoamylase (aka glycogen 6-glucanohydrolase) is one of the starch-debranching enzymes that catalyzes the hydrolysis of alpha-1,6-glucosidic linkages specific in alpha-glucans such as amylopectin or glycogen. Isoamylase contains a bound calcium ion, but this is not in the same position as the conserved calcium ion that has been reported in other alpha-amylase family enzymes. The N-terminus of isoamylase may be related to the immunoglobulin and/or fibronectin type III superfamilies.  These domains are associated with different types of catalytic domains at  either the N-terminal or C-terminal end and may be involved in homodimeric/tetrameric/dodecameric interactions.  Members of this family include members of the alpha amylase family, sialidase, galactose oxidase, cellulase, cellulose, hyaluronate lyase, chitobiase, and chitinase.
Probab=35.73  E-value=48  Score=22.43  Aligned_cols=19  Identities=16%  Similarity=0.224  Sum_probs=16.4

Q ss_pred             eEEEccCCCCCEEEEEEEE
Q psy7019          36 SCKVNKLQELTTYRFYITA   54 (128)
Q Consensus        36 ~~~V~~L~p~t~Y~FRVrA   54 (128)
                      ++.|.++.+|..|.|||..
T Consensus        52 ~~~v~~~~~g~~Y~y~v~g   70 (119)
T cd02852          52 HVFVEGLKPGQLYGYRVDG   70 (119)
T ss_pred             EEEECCCCCCCEEEEEECC
Confidence            4678889999999999984


No 47 
>PF13750 Big_3_3:  Bacterial Ig-like domain (group 3)
Probab=33.73  E-value=1.4e+02  Score=21.66  Aligned_cols=21  Identities=29%  Similarity=0.509  Sum_probs=18.9

Q ss_pred             ccCCCCCEEEEEEEEEcCCCC
Q psy7019          40 NKLQELTTYRFYITATNDAGQ   60 (128)
Q Consensus        40 ~~L~p~t~Y~FRVrA~N~~G~   60 (128)
                      -.|+++..|.+.|.|.+.+|=
T Consensus       117 psle~~~~YtLtV~a~D~aGN  137 (158)
T PF13750_consen  117 PSLEADDSYTLTVSATDKAGN  137 (158)
T ss_pred             CCcCCCCeEEEEEEEEecCCC
Confidence            478899999999999999985


No 48 
>COG3979 Uncharacterized protein contain chitin-binding domain type 3 [General function prediction only]
Probab=33.66  E-value=89  Score=23.39  Aligned_cols=41  Identities=22%  Similarity=0.289  Sum_probs=29.0

Q ss_pred             CCceEEEccCCCCCEEEEEEEEEcCCCC-CCCCCceeeeeCC
Q psy7019          33 TNFSCKVNKLQELTTYRFYITATNDAGQ-GPYSEAYPFTTCT   73 (128)
Q Consensus        33 ~~t~~~V~~L~p~t~Y~FRVrA~N~~G~-G~~S~~~~~~T~~   73 (128)
                      ......+++|.+.+.|.|.+-+....|- +.++......+..
T Consensus        51 ~~~~~~~~g~~~et~y~~~~~a~~t~g~~s~~~~~~~~~~~~   92 (181)
T COG3979          51 TGTVVTVEGLAPETEYWTLVEAPDTSGNWSAWSRLLTVSTSG   92 (181)
T ss_pred             ccceEEecCcceeeEEEEeeeccCCCCccccceeeeeeccCc
Confidence            4677899999999999999999886332 3344444444443


No 49 
>KOG4152|consensus
Probab=33.30  E-value=74  Score=28.54  Aligned_cols=59  Identities=22%  Similarity=0.239  Sum_probs=39.6

Q ss_pred             CCcEEEEEEEeeCC--------CCCCeEEEEecCCceEEEc---------cCCCCCEEEEEEEEEcCCCCCCCCCc
Q psy7019           8 NQASFKASHSLSDR--------PDDRKHVVYQGTNFSCKVN---------KLQELTTYRFYITATNDAGQGPYSEA   66 (128)
Q Consensus         8 ~~~~i~Y~le~~~~--------~~~~~~~v~~g~~t~~~V~---------~L~p~t~Y~FRVrA~N~~G~G~~S~~   66 (128)
                      .|.+++|+.-..-.        .+-.|..+|.|-.+.|.|+         +...+..-.||+.|.|..|+|+.-..
T Consensus       716 sg~Iieys~ylAi~tq~~~~~~sql~fmr~ycg~~t~c~vt~g~l~tA~~d~t~~paivfri~aknekGyGpatqv  791 (830)
T KOG4152|consen  716 SGTIIEYSPYLAIITQAGATGVSQLPFMRSYCGIITTCVVTSGKLITAVPDITTGPAIVFRIVAKNEKGYGPATQV  791 (830)
T ss_pred             CcceEEeehhhHhhhhhhccCccccceeeeecccceeeEEecccceeeccccccCcceEEEEEecCCCCCCcceEE
Confidence            45677886543211        1123667777776666543         44567788999999999999987654


No 50 
>cd02860 Pullulanase_N_term Pullulanase domain N-terminus. Pullulanase (AKA dextrinase; alpha-dextrin endo-1,6-alpha glucosidase) is an enzyme with action similar to that of isoamylase; it cleaves 1,6-alpha-glucosidic linkages in pullulan, amylopectin, and glycogen, and in alpha-and beta-amylase limit-dextrins of amylopectin and glycogen.  The N-terminus of pullulanase may be related to the immunoglobulin and/or fibronectin type III superfamilies.  These domains are associated with different types of catalytic domains at  either the N-terminal or C-terminal end and may be involved in homodimeric/tetrameric/dodecameric interactions.  Members of this family include members of the alpha amylase family, sialidase, galactose oxidase, cellulase, cellulose, hyaluronate lyase, chitobiase, and chitinase.
Probab=32.62  E-value=69  Score=20.91  Aligned_cols=20  Identities=15%  Similarity=0.076  Sum_probs=16.4

Q ss_pred             eEEEccCCCCCEEEEEEEEE
Q psy7019          36 SCKVNKLQELTTYRFYITAT   55 (128)
Q Consensus        36 ~~~V~~L~p~t~Y~FRVrA~   55 (128)
                      +..|.++.+|..|.|||...
T Consensus        50 ~~~v~~~~~g~~Y~y~i~~~   69 (100)
T cd02860          50 SVTLDGDLEGYYYLYEVKVY   69 (100)
T ss_pred             EEEeCCccCCcEEEEEEEEe
Confidence            45677888999999999864


No 51 
>PF00801 PKD:  PKD domain;  InterPro: IPR000601 The PKD (Polycystic Kidney Disease) domain was first identified in the Polycystic Kidney Disease protein, polycystin-1 (PDK1 gene), and contains an Ig-like fold consisting of a beta-sandwich of seven strands in two sheets with a Greek key topology, although some members have additional strands []. Polycystin-1 is a large cell-surface glycoprotein involved in adhesive protein-protein and protein-carbohydrate interactions; however it is not clear if the PKD domain mediates any of these interactions.  PKD domains are also found in other proteins, usually in the extracellular parts of proteins involved in interactions with other proteins. For example, domains with a PKD-type fold are found in archaeal surface layer proteins that protect the cell from extreme environments [], and in the human VPS10 domain-containing receptor SorCS2 [].; PDB: 1B4R_A 2KZW_A 2C4X_A 2C26_A 2Y72_B 3JQU_A 3JS7_B 1WGO_A 1L0Q_A.
Probab=31.63  E-value=1.2e+02  Score=18.16  Aligned_cols=45  Identities=24%  Similarity=0.221  Sum_probs=26.4

Q ss_pred             CCcEEEEEEEeeCCCCCCeEEEEecCCceEEEccCCCCCEEEEEEEEEcCCCC
Q psy7019           8 NQASFKASHSLSDRPDDRKHVVYQGTNFSCKVNKLQELTTYRFYITATNDAGQ   60 (128)
Q Consensus         8 ~~~~i~Y~le~~~~~~~~~~~v~~g~~t~~~V~~L~p~t~Y~FRVrA~N~~G~   60 (128)
                      +|..+.|...+.+     .....  ...+..-.-..+| .|..+|.+.|..|.
T Consensus        22 ~g~~~~y~W~fgd-----~~~~~--~~~~~t~ty~~~G-~y~V~ltv~n~~g~   66 (69)
T PF00801_consen   22 DGSPVTYSWDFGD-----NGTVS--TGSSVTHTYSSPG-TYTVTLTVTNGVGS   66 (69)
T ss_dssp             TSSECEEEEE-SS-----ESEEE--CSSEEEEEESSSE-EEEEEEEEEETTSE
T ss_pred             CCCCeEEEEEECC-----CCccc--cCCCEEEEcCCCe-EEEEEEEEEECCCC
Confidence            4667778777664     11111  2333444444455 89999999998773


No 52 
>cd05894 Ig_C5_MyBP-C C5 immunoglobulin (Ig) domain of cardiac myosin binding protein C (MyBP-C). Ig_C5_MyBP_C : the C5 immunoglobulin (Ig) domain of cardiac myosin binding protein C (MyBP-C). MyBP_C consists of repeated domains, Ig and fibronectin type 3, and various linkers. Three isoforms of MYBP_C exist and are included in this group: cardiac(c), and fast and slow skeletal muscle (s) MyBP_C. cMYBP_C has insertions between and inside domains and an additional cardiac-specific Ig domain at the N-terminus. For cMYBP_C  an interaction has been demonstrated between this C5 domain and the Ig C8 domain.
Probab=30.63  E-value=1.4e+02  Score=18.75  Aligned_cols=27  Identities=22%  Similarity=0.158  Sum_probs=23.7

Q ss_pred             CceEEEccCCCCCEEEEEEEEEcCCCC
Q psy7019          34 NFSCKVNKLQELTTYRFYITATNDAGQ   60 (128)
Q Consensus        34 ~t~~~V~~L~p~t~Y~FRVrA~N~~G~   60 (128)
                      .....|.++.+.-.-.|.+.|.|..|.
T Consensus        51 ~~~L~I~~~~~~D~G~Y~c~a~N~~G~   77 (86)
T cd05894          51 LSSFVIEGAEREDEGVYTITVTNPVGE   77 (86)
T ss_pred             eEEEEECCCccCcCEEEEEEEEeCCCc
Confidence            456899999999999999999999885


No 53 
>PF05345 He_PIG:  Putative Ig domain;  InterPro: IPR008009 This alignment represents the conserved core region of a ~90 residue repeat found in several haemagglutinins and other cell surface proteins. Sequence similarities to Hyalin (IPR003410 from INTERPRO) and the PKD domain (IPR000601 from INTERPRO) suggest an Ig-like fold so this family may be similar in function to the (IPR003791 from INTERPRO) and (IPR003790 from INTERPRO) protein families.
Probab=30.38  E-value=61  Score=18.91  Aligned_cols=14  Identities=50%  Similarity=0.824  Sum_probs=12.1

Q ss_pred             CEEEEEEEEEcCCC
Q psy7019          46 TTYRFYITATNDAG   59 (128)
Q Consensus        46 t~Y~FRVrA~N~~G   59 (128)
                      -.|.|.|.|.+..|
T Consensus        36 G~y~~~vtatd~~G   49 (49)
T PF05345_consen   36 GTYTFTVTATDGSG   49 (49)
T ss_pred             cEEEEEEEEEcCCC
Confidence            38999999999876


No 54 
>cd05851 Ig3_Contactin-1 Third Ig domain of contactin-1. Ig3_Contactin-1: Third Ig domain of the neural cell adhesion molecule contactin-1. Contactins are comprised of six Ig domains followed by four fibronectin type III (FnIII) domains anchored to the membrane by glycosylphosphatidylinositol. Contactin-1 is differentially expressed in tumor tissues and may through a RhoA mechanism, facilitate invasion and metastasis of human lung adenocarcinoma.
Probab=30.00  E-value=80  Score=19.97  Aligned_cols=31  Identities=16%  Similarity=0.069  Sum_probs=25.0

Q ss_pred             CceEEEccCCCCCEEEEEEEEEcCCCCCCCC
Q psy7019          34 NFSCKVNKLQELTTYRFYITATNDAGQGPYS   64 (128)
Q Consensus        34 ~t~~~V~~L~p~t~Y~FRVrA~N~~G~G~~S   64 (128)
                      .....|.++.+.-.-.|.+.|.|..|....+
T Consensus        52 ~~~L~I~~v~~~D~G~Y~C~A~N~~G~~~~~   82 (88)
T cd05851          52 GAVLKIFNIQPEDEGTYECEAENIKGKDKHQ   82 (88)
T ss_pred             CCEEEECcCChhhCEEEEEEEEcCCCceEEE
Confidence            3468899999888888999999999965433


No 55 
>COG3422 Uncharacterized conserved protein [Function unknown]
Probab=29.97  E-value=42  Score=20.88  Aligned_cols=14  Identities=36%  Similarity=0.627  Sum_probs=11.6

Q ss_pred             CCCEEEEEEEEEcC
Q psy7019          44 ELTTYRFYITATND   57 (128)
Q Consensus        44 p~t~Y~FRVrA~N~   57 (128)
                      .+.+|.||++|.|.
T Consensus        11 k~Ge~rfrlkA~N~   24 (59)
T COG3422          11 KAGEYRFRLKAANG   24 (59)
T ss_pred             CCCcEEEEEEccCc
Confidence            36689999999885


No 56 
>cd02853 MTHase_N_term Maltooligosyl trehalose synthase (MTSase) N-terminus domain. MTSase and maltooligosyl trehalose trehalohydrolase (MTHase) work together to produce trehalose. MTSase is responsible for converting the alpha-1,4-glucosidic linkage to an alpha,alpha-1,1-glucosidic linkage at the reducing end of the maltooligosaccharide through an intramolecular transglucosylation reaction, while MTHase hydrolyzes the penultimate alpha-1,4 linkage of the reducing end, resulting in the release of trehalose. The N-terminus of MTSase may be related to the immunoglobulin and/or fibronectin type III superfamilies. These domains are associated with different types of catalytic domains at  either the N-terminal or C-terminal end and may be involved in homodimeric/tetrameric/dodecameric interactions. Members of this family include members of the alpha amylase family, sialidase, galactose oxidase, cellulase, cellulose, hyaluronate lyase, chitobiase, and chitinase.
Probab=29.72  E-value=65  Score=20.42  Aligned_cols=17  Identities=24%  Similarity=0.366  Sum_probs=14.1

Q ss_pred             eEEEccCCCCCEEEEEEE
Q psy7019          36 SCKVNKLQELTTYRFYIT   53 (128)
Q Consensus        36 ~~~V~~L~p~t~Y~FRVr   53 (128)
                      +..+.++ +|..|.|+|.
T Consensus        43 ~~~v~~~-~g~~Y~y~v~   59 (85)
T cd02853          43 EAEVPGA-AGTRYRYRLD   59 (85)
T ss_pred             EEEeCCC-CCCeEEEEEC
Confidence            3556778 9999999997


No 57 
>PF13860 FlgD_ig:  FlgD Ig-like domain; PDB: 3C12_A 3OSV_A.
Probab=29.56  E-value=1.5e+02  Score=18.69  Aligned_cols=16  Identities=38%  Similarity=0.391  Sum_probs=10.0

Q ss_pred             CCCCCEEEEEEEEEcC
Q psy7019          42 LQELTTYRFYITATND   57 (128)
Q Consensus        42 L~p~t~Y~FRVrA~N~   57 (128)
                      ..|.-.|.|+|.|.+.
T Consensus        65 ~~~~G~Y~~~v~a~~~   80 (81)
T PF13860_consen   65 PVPDGTYTFRVTATDG   80 (81)
T ss_dssp             B--SEEEEEEEEEEET
T ss_pred             CCCCCCEEEEEEEEeC
Confidence            3455678888888764


No 58 
>PF00907 T-box:  T-box;  InterPro: IPR001699 Transcription factors of the T-box family are required both for early cell-fate decisions, such as those necessary for formation of the basic vertebrate body plan, and for differentiation and organogenesis []. The T-box is defined as the minimal region within the T-box protein that is both necessary and sufficient for sequence-specific DNA binding, all members of the family so far examined bind to the DNA consensus sequence TCACACCT. The T-box is a relatively large DNA-binding domain, generally comprising about a third of the entire protein (17-26 kDa). These genes were uncovered on the basis of similarity to the DNA binding domain [] of Mus musculus (Mouse) Brachyury (T) gene product, which similarity is the defining feature of the family. The Brachyury gene is named for its phenotype, which was identified 70 years ago as a mutant mouse strain with a short blunted tail. The gene, and its paralogues, have become a well-studied model for the family, and hence much of what is known about the T-box family is derived from the murine Brachyury gene. Consistent with its nuclear location, Brachyury protein has a sequence-specific DNA-binding activity and can act as a transcriptional regulator []. Homozygous mutants for the gene undergo extensive developmental anomalies, thus rendering the mutation lethal []. The postulated role of Brachyury is as a transcription factor, regulating the specification and differentiation of posterior mesoderm during gastrulation in a dose-dependent manner []. T-box proteins tend to be expressed in specific organs or cell types, especially during development, and they are generally required for the development of those tissues, for example, Brachyury is expressed in posterior mesoderm and in the developing notochord, and it is required for the formation of these cells in mice []. ; GO: 0003700 sequence-specific DNA binding transcription factor activity, 0006355 regulation of transcription, DNA-dependent, 0005634 nucleus; PDB: 1H6F_B 4A04_A 1XBR_B 2X6V_B 2X6U_A.
Probab=28.65  E-value=70  Score=23.50  Aligned_cols=27  Identities=11%  Similarity=0.132  Sum_probs=20.8

Q ss_pred             CCceEEEccCCCCCEEEEEEEEEcCCC
Q psy7019          33 TNFSCKVNKLQELTTYRFYITATNDAG   59 (128)
Q Consensus        33 ~~t~~~V~~L~p~t~Y~FRVrA~N~~G   59 (128)
                      +...+.|++|.|...|.+-++-.-...
T Consensus        31 P~l~y~vsGL~p~~~Y~i~l~~~~~d~   57 (184)
T PF00907_consen   31 PTLEYSVSGLDPDSLYSISLHFERVDN   57 (184)
T ss_dssp             S-EEEEEESS-TTSEEEEEEEEEESCS
T ss_pred             cccEEEecCCCCCcceEEEEEEEEecC
Confidence            467899999999999999988776533


No 59 
>PF00028 Cadherin:  Cadherin domain;  InterPro: IPR002126 Cadherins are a family of adhesion molecules that mediate Ca2+-dependent cell-cell adhesion in all solid tissues of the organism which modulate a wide variety of processes including cell polarisation and migration [, ,]. Cadherin-mediated cell-cell junctions are formed as a result of interaction between extracellular domains of identical cadherins, which are located on the membranes of the neighbouring cells. The stability of these adhesive junctions is ensured by binding of the intracellular cadherin domain with the actin cytoskeleton. There are a number of different isoforms distributed in a tissue-specific manner in a wide variety of organisms. Cells containing different cadherins tend to segregate in vitro, while those that contain the same cadherins tend to preferentially aggregate together. This observation is linked to the finding that cadherin expression causes morphological changes involving the positional segregation of cells into layers, suggesting they may play an important role in the sorting of different cell types during morphogenesis, histogenesis and regeneration. They may also be involved in the regulation of tight and gap junctions, and in the control of intercellular spacing. Cadherins are evolutionary related to the desmogleins which are component of intercellular desmosome junctions involved in the interaction of plaque proteins. Structurally, cadherins comprise a number of domains: classically, these include a signal sequence; a propeptide of around 130 residues; a single transmembrane domain and five tandemly repeated extracellular cadherin domains, 4 of which are cadherin repeats, and the fifth contains 4 conserved cysteines and a N-terminal cytoplasmic domain []. However, proteins are designated as members of the broadly defined cadherin family if they have one or more cadherin repeats. A cadherin repeat is an independently folding sequence of approximately 110 amino acids that contains motifs with the conserved sequences DRE, DXNDNAPXF, and DXD. Crystal structures have revealed that multiple cadherin domains form Ca2+-dependent rod-like structures with a conserved Ca2+-binding pocket at the domain-domain interface. Cadherins depend on calcium for their function: calcium ions bind to specific residues in each cadherin repeat to ensure its proper folding, to confer rigidity upon the extracellular domain and is essential for cadherin adhesive function and for protection against protease digestion.; GO: 0005509 calcium ion binding, 0007156 homophilic cell adhesion, 0016020 membrane; PDB: 2A4E_A 2A4C_B 2O72_A 2QVI_A 1NCJ_A 3Q2W_A 3Q2N_A 3LNH_B 3LNI_A 3Q2L_A ....
Probab=28.49  E-value=50  Score=20.81  Aligned_cols=48  Identities=15%  Similarity=0.189  Sum_probs=26.7

Q ss_pred             CcEEEEEEEeeCCCCCCeEEEEecCCceEEEc---cCCCCCEEEEEEEEEcCCC
Q psy7019           9 QASFKASHSLSDRPDDRKHVVYQGTNFSCKVN---KLQELTTYRFYITATNDAG   59 (128)
Q Consensus         9 ~~~i~Y~le~~~~~~~~~~~v~~g~~t~~~V~---~L~p~t~Y~FRVrA~N~~G   59 (128)
                      +..+.|.+.-.+ ..+.|..-..  .-...+.   +-+....|.|.|+|.+..|
T Consensus        28 n~~i~y~i~~~~-~~~~F~I~~~--tg~i~~~~~LD~E~~~~y~l~v~a~D~~~   78 (93)
T PF00028_consen   28 NSQITYSILGGN-PDGLFSIDPN--TGEISLKKPLDRETQSSYQLTVRATDSGG   78 (93)
T ss_dssp             TSSEEEEEEETT-STTSEEEETT--TTEEEESSSSCTTTTSEEEEEEEEEETTT
T ss_pred             CceEEEEEecCc-ccCceEEeee--eeccccceecCcccCCEEEEEEEEEECCC
Confidence            345667776543 2344443221  1123333   2345789999999999833


No 60 
>PF12514 DUF3718:  Protein of unknown function (DUF3718);  InterPro: IPR022193 This entry is represented by Bacteriophage Aaphi23, Orf19. The characteristics of the protein distribution suggest prophage matches in addition to the phage matches.  This domain family is found in bacteria and viruses, and is approximately 70 amino acids in length. There is a single completely conserved residue C that may be functionally important. 
Probab=27.13  E-value=19  Score=22.76  Aligned_cols=15  Identities=20%  Similarity=0.297  Sum_probs=12.7

Q ss_pred             eeeeeeecCCCcccc
Q psy7019         109 FCVSARSPCPGQTLS  123 (128)
Q Consensus       109 ~~~~~~~~~~~~~~~  123 (128)
                      +-++..+.||||.|.
T Consensus        25 r~i~~~l~CNG~sl~   39 (68)
T PF12514_consen   25 RNIYDGLVCNGESLL   39 (68)
T ss_pred             hhcccCCeECCcCHH
Confidence            567889999999885


No 61 
>PRK12634 flgD flagellar basal body rod modification protein; Reviewed
Probab=25.90  E-value=2.9e+02  Score=21.29  Aligned_cols=17  Identities=29%  Similarity=0.335  Sum_probs=12.0

Q ss_pred             CCCCEEEEEEEEEcCCC
Q psy7019          43 QELTTYRFYITATNDAG   59 (128)
Q Consensus        43 ~p~t~Y~FRVrA~N~~G   59 (128)
                      .|.-.|.|+|.|.+..|
T Consensus       163 ~~~G~Yt~~v~a~~~~G  179 (221)
T PRK12634        163 MAAGKYGVTATQTDTAG  179 (221)
T ss_pred             CCCeeeEEEEEEEeCCC
Confidence            45567888888876655


No 62 
>PF02018 CBM_4_9:  Carbohydrate binding domain;  InterPro: IPR003305 The 1,4-beta-glucanase CenC from Cellulomonas fimi contains two cellulose-binding domains, CBD(N1) and CBD(N2), arranged in tandem at its N terminus. These homologous CBDs are distinct in their selectivity for binding amorphous and not crystalline cellulose []. Multidimensional heteronuclear nuclear magnetic resonance (NMR) spectroscopy was used to determine the tertiary structure of the 152 amino acid N-terminal cellulose-binding domain from C. fimi 1,4-beta-glucanase CenC (CBDN1) []. The tertiary structure of CBDN1 is strikingly similar to that of the bacterial 1,3-1,4-beta-glucanases, as well as other sugar-binding proteins with jelly-roll folds.; GO: 0016798 hydrolase activity, acting on glycosyl bonds; PDB: 3OEA_B 2ZEX_B 3OEB_A 2ZEY_A 2ZEW_A 1GUI_A 2W5F_A 2WZE_A 2WYS_A 2ZEZ_B ....
Probab=25.63  E-value=73  Score=20.83  Aligned_cols=20  Identities=25%  Similarity=0.270  Sum_probs=17.2

Q ss_pred             ccCCCCCEEEEEEEEEcCCC
Q psy7019          40 NKLQELTTYRFYITATNDAG   59 (128)
Q Consensus        40 ~~L~p~t~Y~FRVrA~N~~G   59 (128)
                      ..|++|..|.|.+.+....+
T Consensus        56 ~~l~~G~~Y~~s~~vk~~~~   75 (131)
T PF02018_consen   56 ISLKPGKTYTVSFWVKADSG   75 (131)
T ss_dssp             EEE-TTSEEEEEEEEEESSS
T ss_pred             eEecCCCEEEEEEEEEeCCC
Confidence            68999999999999988777


No 63 
>COG5584 Predicted small secreted protein [Function unknown]
Probab=25.09  E-value=2.1e+02  Score=19.72  Aligned_cols=44  Identities=25%  Similarity=0.317  Sum_probs=29.1

Q ss_pred             CcEEEEEEEeeCCCCCCeEEEEecCCceEEEccCCCCCEEEEEEEEEc
Q psy7019           9 QASFKASHSLSDRPDDRKHVVYQGTNFSCKVNKLQELTTYRFYITATN   56 (128)
Q Consensus         9 ~~~i~Y~le~~~~~~~~~~~v~~g~~t~~~V~~L~p~t~Y~FRVrA~N   56 (128)
                      |.|+.|+-|..+ ..+....+|+|.-++.+-.   +..+|.|-+.|.+
T Consensus        51 GSwI~~~pe~~~-k~~~~~~VY~gGIT~i~~g---e~~qyef~aDA~T   94 (103)
T COG5584          51 GSWIVYEPEVYP-KTDKYRLVYQGGITTIKEG---ENIQYEFYADANT   94 (103)
T ss_pred             eeEEEEeccccc-CCCcEEEEEecceeEecCC---CceEEEEEEecCC
Confidence            556666666553 6677888998865554333   3568888888744


No 64 
>cd04972 Ig_TrkABC_d4 Fourth domain (immunoglobulin-like) of Trk receptors TrkA, TrkB and TrkC. TrkABC_d4: the fourth domain of Trk receptors TrkA, TrkB and TrkC, this is an immunoglobulin (Ig)-like domain which binds to neurotrophin. The Trk family of receptors are tyrosine kinase receptors. They are activated by dimerization, leading to autophosphorylation of intracellular tyrosine residues, and triggering the signal transduction pathway. TrkA, TrkB, and TrkC share significant sequence homology and domain organization. The first three domains are leucine-rich domains. The fourth and fifth domains are Ig-like domains playing a part in ligand binding. TrkA, Band C mediate the trophic effects of the neurotrophin Nerve growth factor (NGF) family. TrkA is recognized by NGF. TrKB is recognized by brain-derived neurotrophic factor (BDNF) and neurotrophin (NT)-4. TrkC is recognized by NT-3. NT-3 is promiscuous as in some cell systems it activates TrkA and TrkB receptors. TrkA is a receptor fo
Probab=24.82  E-value=1.2e+02  Score=18.89  Aligned_cols=27  Identities=22%  Similarity=0.310  Sum_probs=23.0

Q ss_pred             ceEEEccCCCCCEEEEEEEEEcCCCCC
Q psy7019          35 FSCKVNKLQELTTYRFYITATNDAGQG   61 (128)
Q Consensus        35 t~~~V~~L~p~t~Y~FRVrA~N~~G~G   61 (128)
                      ....|.++.+.-...|++.|.|..|.-
T Consensus        56 ~~L~I~~v~~~D~g~Y~C~A~N~~G~~   82 (90)
T cd04972          56 YNLQLSNITSETQTTVTCTAENPVGQA   82 (90)
T ss_pred             EEEEEecCCcccCEEEEEEEECCCCce
Confidence            468899999888889999999999864


No 65 
>COG3900 Predicted periplasmic protein [Function unknown]
Probab=24.06  E-value=2.7e+02  Score=22.20  Aligned_cols=62  Identities=11%  Similarity=0.193  Sum_probs=38.7

Q ss_pred             eCCcEEEEEEEeeCCCCCCeEE-EEecCCc---eEEE--ccCCCCCEEEEEEEEEcCCCCCCCCCceeeee
Q psy7019           7 FNQASFKASHSLSDRPDDRKHV-VYQGTNF---SCKV--NKLQELTTYRFYITATNDAGQGPYSEAYPFTT   71 (128)
Q Consensus         7 ~~~~~i~Y~le~~~~~~~~~~~-v~~g~~t---~~~V--~~L~p~t~Y~FRVrA~N~~G~G~~S~~~~~~T   71 (128)
                      .||+.+ |+|-.+ +++-.|.+ +..|...   .+.|  +.+.-+.+|.++.+-.+ .|.+-.|..+.|..
T Consensus       171 Vgg~ec-d~lafr-~ddvDfqIWIsqGeqpvp~k~VITsk~v~g~PqYtv~fsnwk-sgdev~s~~F~Fkp  238 (262)
T COG3900         171 VGGEEC-DQLAFR-KDDVDFQIWISQGEQPVPLKYVITSKDVPGEPQYTVVFSNWK-SGDEVPSSDFTFKP  238 (262)
T ss_pred             cCceee-hhhhcc-CCCCceEEEeecCCCCcceeEEEEecccCCCCcEEEEEcccc-cCCCccccceeecC
Confidence            455555 555555 36666654 4455432   2333  45667789999999876 56677777777753


No 66 
>PF05738 Cna_B:  Cna protein B-type domain;  InterPro: IPR008454 This entry represents a repeated B region domain found in the collagen-binding surface protein Cna in Staphylococcus aureus, as well as other related domains. The B region domain of Cna has a prealbumin-like beta-sandwich fold of seven strands in two sheets with a Greek key topology []. However, this domain does not mediate collagen binding, the IPR008456 from INTERPRO region carries out that function; instead it appears to form a stalk that presents the ligand binding domain away from the bacterial cell surface. Cna is a collagen-binding MSCRAMM (Microbial Surface Component Recognizing Adhesive Matrix Molecules), and is necessary and sufficient for S. aureus cells to adhere to cartilage.; PDB: 2X5P_A 3RKP_A 3KPT_A 1VLF_T 1TI2_F 1TI6_D 1TI4_J 1VLE_V 1VLD_X 3PF2_A ....
Probab=23.74  E-value=87  Score=18.72  Aligned_cols=25  Identities=24%  Similarity=0.133  Sum_probs=18.0

Q ss_pred             EEecCCceEEEccCCCCCEEEEEEEE
Q psy7019          29 VYQGTNFSCKVNKLQELTTYRFYITA   54 (128)
Q Consensus        29 v~~g~~t~~~V~~L~p~t~Y~FRVrA   54 (128)
                      .....+-.+.+.+|.+|. |.++-..
T Consensus        21 ~~Td~~G~~~f~~L~~G~-Y~l~E~~   45 (70)
T PF05738_consen   21 VTTDENGKYTFKNLPPGT-YTLKETK   45 (70)
T ss_dssp             EEGGTTSEEEEEEEESEE-EEEEEEE
T ss_pred             EEECCCCEEEEeecCCeE-EEEEEEE
Confidence            444556678999999998 7666554


No 67 
>cd05893 Ig_Palladin_C C-terminal immunoglobulin (Ig)-like domain of palladin. Ig_Palladin_C: C-terminal immunoglobulin (Ig)-like domain of palladin. Palladin belongs to the palladin-myotilin-myopalladin family. Proteins belonging to this family contain multiple Ig-like domains and function as scaffolds, modulating actin cytoskeleton. Palladin binds to alpha-actinin ezrin, vasodilator-stimulated phosphoprotein VASP, SPIN90 (DIP, mDia interacting protein), and Src. Palladin also binds F-actin directly, via its Ig3 domain. Palladin is expressed as several alternatively spliced isoforms, having various combinations of Ig-like domains, in a cell-type-specific manner. It has been suggested that palladin's different Ig-like domains may be specialized for distinct functions.
Probab=23.69  E-value=52  Score=20.40  Aligned_cols=27  Identities=15%  Similarity=0.155  Sum_probs=23.6

Q ss_pred             eEEEccCCCCCEEEEEEEEEcCCCCCC
Q psy7019          36 SCKVNKLQELTTYRFYITATNDAGQGP   62 (128)
Q Consensus        36 ~~~V~~L~p~t~Y~FRVrA~N~~G~G~   62 (128)
                      ...|.++++.-.-.+.+.|.|..|.-.
T Consensus        42 ~L~I~~v~~~D~G~Y~C~A~N~~G~~~   68 (75)
T cd05893          42 CLLIQGATKEDAGWYTVSAKNEAGIVS   68 (75)
T ss_pred             EEEECCCCHHHCEEEEEEEEcCCCEEE
Confidence            578999999988999999999998653


No 68 
>KOG4806|consensus
Probab=22.80  E-value=1.1e+02  Score=25.97  Aligned_cols=25  Identities=20%  Similarity=0.227  Sum_probs=20.2

Q ss_pred             eEEEccCCCCCEEEEEEEEEcCCCC
Q psy7019          36 SCKVNKLQELTTYRFYITATNDAGQ   60 (128)
Q Consensus        36 ~~~V~~L~p~t~Y~FRVrA~N~~G~   60 (128)
                      .-+|.+|.||+.|...|-|.-..|.
T Consensus       411 teTI~gL~PgssYlldv~a~~~~g~  435 (454)
T KOG4806|consen  411 TETILGLMPGSSYLLDVTANLSMGK  435 (454)
T ss_pred             hhhhcccccCceEEEEEEEcccCCc
Confidence            3569999999999999998655443


No 69 
>KOG1948|consensus
Probab=22.30  E-value=2.9e+02  Score=26.39  Aligned_cols=22  Identities=14%  Similarity=0.176  Sum_probs=19.8

Q ss_pred             CceEEEccCCCCCEEEEEEEEE
Q psy7019          34 NFSCKVNKLQELTTYRFYITAT   55 (128)
Q Consensus        34 ~t~~~V~~L~p~t~Y~FRVrA~   55 (128)
                      +-.+++.+|.||..|..+|.+.
T Consensus       945 nG~yRiRGL~Pdc~Y~V~vk~~  966 (1165)
T KOG1948|consen  945 NGTYRIRGLLPDCEYQVHVKSY  966 (1165)
T ss_pred             CCcEEEeccCCCceEEEEEeec
Confidence            4569999999999999999986


No 70 
>cd05765 Ig_3 Subgroup of the immunoglobulin (Ig) superfamily. Ig_3: subgroup of the immunoglobulin (Ig) domain found in the Ig superfamily. The Ig superfamily is a heterogenous group of proteins, built on a common fold comprised of a sandwich of two beta sheets. Members of the Ig superfamily are components of immunoglobulin, neuroglia, cell surface glycoproteins, such as T-cell receptors, CD2, CD4, CD8, and membrane glycoproteins, such as butyrophilin and chondroitin sulfate proteoglycan core protein. A predominant feature of most Ig domains is a disulfide bridge connecting the two beta-sheets with a tryptophan residue packed against the disulfide bond.
Probab=21.94  E-value=1.5e+02  Score=17.70  Aligned_cols=28  Identities=18%  Similarity=-0.027  Sum_probs=23.7

Q ss_pred             CceEEEccCCCCCEEEEEEEEEcCCCCC
Q psy7019          34 NFSCKVNKLQELTTYRFYITATNDAGQG   61 (128)
Q Consensus        34 ~t~~~V~~L~p~t~Y~FRVrA~N~~G~G   61 (128)
                      ..+..|.++++.-.-.+...|.|..|..
T Consensus        46 ~~~L~I~~~~~~D~G~Y~C~a~N~~G~~   73 (81)
T cd05765          46 IGQLVIYNAQPQDAGLYTCTARNSGGLL   73 (81)
T ss_pred             ccEEEEccCCcccCEEEEEEEecCCceE
Confidence            3568899999888889999999998853


No 71 
>PLN02960 alpha-amylase
Probab=21.80  E-value=1.3e+02  Score=28.27  Aligned_cols=41  Identities=7%  Similarity=0.012  Sum_probs=27.9

Q ss_pred             ceeCCcEEEEEEEeeCCCCCCeEEEEecCCceEEEccCCCCCEEEEEEEE
Q psy7019           5 TEFNQASFKASHSLSDRPDDRKHVVYQGTNFSCKVNKLQELTTYRFYITA   54 (128)
Q Consensus         5 ~~~~~~~i~Y~le~~~~~~~~~~~v~~g~~t~~~V~~L~p~t~Y~FRVrA   54 (128)
                      ++.--..+.|++++.+ ..+.|...        .|..+.+|+.|+|+|.-
T Consensus       306 ~~~~~~~~~~~~~~~k-~~~gw~~~--------~ip~~~hG~~Yky~v~~  346 (897)
T PLN02960        306 FLEKKPPLPYWEETRK-GRKAWLKK--------YIPAIPHGSKYRVYFNT  346 (897)
T ss_pred             HhccCCCCcceeeeee-cCCcEEEE--------EccCCCCCCEEEEEEEe
Confidence            4444556789999885 44444322        35578889999999974


No 72 
>PLN03144 Carbon catabolite repressor protein 4 homolog; Provisional
Probab=21.45  E-value=2.3e+02  Score=25.43  Aligned_cols=65  Identities=8%  Similarity=-0.056  Sum_probs=42.8

Q ss_pred             eeCCcEEEEEEEeeCCC---CCCeEEEEecCCceEEEccCCCCCEEEEEEEEEcCCCCCCCCCceeeeeC
Q psy7019           6 EFNQASFKASHSLSDRP---DDRKHVVYQGTNFSCKVNKLQELTTYRFYITATNDAGQGPYSEAYPFTTC   72 (128)
Q Consensus         6 ~~~~~~i~Y~le~~~~~---~~~~~~v~~g~~t~~~V~~L~p~t~Y~FRVrA~N~~G~G~~S~~~~~~T~   72 (128)
                      +.||....|...+..+.   ++.|..+-  ....++-+.-.=|..-+|++.+.+....++..+..++.|.
T Consensus       151 ~~~~~~~~~~~~~~~~~~~~~~~w~~vg--~~~~y~p~~~d~g~~lk~~~~~~~~~~~~~~~~~~~~~t~  218 (606)
T PLN03144        151 LTNGSVPLYPSGIEQKTQVGGETWIEVG--RSKTYTPTADDVGHVLKFECVVVDAETGLPVGHPQSILTS  218 (606)
T ss_pred             cccCcccccccccccccCCCCCceEEeC--CCccccCChhhCCceEEEEEEEcccccCCCCccceeeccc
Confidence            45665555655544333   56686654  4556666666668889999999988887777666666554


No 73 
>PRK06655 flgD flagellar basal body rod modification protein; Reviewed
Probab=21.26  E-value=3.9e+02  Score=20.65  Aligned_cols=13  Identities=15%  Similarity=0.115  Sum_probs=9.5

Q ss_pred             CCCCEEEEEEEEE
Q psy7019          43 QELTTYRFYITAT   55 (128)
Q Consensus        43 ~p~t~Y~FRVrA~   55 (128)
                      .|.-.|.|+|.|.
T Consensus       167 lp~G~Yt~~V~A~  179 (225)
T PRK06655        167 LPDGNYTIKASAS  179 (225)
T ss_pred             CCCeeEEEEEEEE
Confidence            3556788888886


Done!