Query         psy4958
Match_columns 65
No_of_seqs    69 out of 71
Neff          4.3 
Searched_HMMs 46136
Date          Fri Aug 16 22:14:24 2013
Command       hhsearch -i /work/01045/syshi/Psyhhblits/psy4958.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/4958hhsearch_cdd -cpu 12 -v 0 

 No Hit                             Prob E-value P-value  Score    SS Cols Query HMM  Template HMM
  1 PF00041 fn3:  Fibronectin type  95.9  0.0065 1.4E-07   33.8   2.0   24    2-25     61-84  (85)
  2 KOG4221|consensus               95.1   0.064 1.4E-06   47.0   6.1   58    2-63    579-643 (1381)
  3 PF00041 fn3:  Fibronectin type  94.8   0.026 5.7E-07   31.3   2.2   23   36-60      2-24  (85)
  4 smart00060 FN3 Fibronectin typ  94.3   0.062 1.3E-06   27.7   2.8   27   35-63      2-28  (83)
  5 KOG3513|consensus               94.2    0.12 2.5E-06   44.5   5.5   55    4-61    586-640 (1051)
  6 KOG1225|consensus               93.6   0.045 9.8E-07   43.7   2.0   60    2-63    423-482 (525)
  7 cd00063 FN3 Fibronectin type 3  93.2     0.1 2.3E-06   27.7   2.5   26   34-61      1-26  (93)
  8 cd00063 FN3 Fibronectin type 3  91.7    0.25 5.3E-06   26.2   2.8   22    3-24     63-84  (93)
  9 KOG0196|consensus               90.8    0.21 4.6E-06   42.6   2.7   36   23-61    321-356 (996)
 10 KOG4221|consensus               90.7    0.57 1.2E-05   41.4   5.3   57    2-61    679-744 (1381)
 11 PF14054 DUF4249:  Domain of un  90.4    0.77 1.7E-05   31.9   4.9   58    3-65     95-160 (298)
 12 KOG0196|consensus               86.6     1.5 3.2E-05   37.7   5.0   58    2-62    403-469 (996)
 13 PF02014 Reeler:  Reeler domain  86.2     5.6 0.00012   25.2   6.5   57    4-62     30-112 (132)
 14 cd08544 Reeler Reeler, the N-t  85.3     6.1 0.00013   25.0   6.4   58    4-62     30-108 (135)
 15 PF10179 DUF2369:  Uncharacteri  84.0     2.9 6.4E-05   31.4   5.1   60    3-62    134-202 (300)
 16 KOG4802|consensus               83.6     2.8   6E-05   33.8   5.0   59    3-63    215-283 (516)
 17 PF09294 Interfer-bind:  Interf  78.4     2.4 5.1E-05   25.2   2.4   15    2-16     72-86  (106)
 18 KOG3513|consensus               75.3     6.3 0.00014   34.3   4.9   58    1-60    678-741 (1051)
 19 PF13205 Big_5:  Bacterial Ig-l  66.0     7.3 0.00016   22.9   2.5   12    3-14     71-82  (107)
 20 PF09423 PhoD:  PhoD-like phosp  64.6     8.6 0.00019   28.9   3.2   39    2-42     69-108 (453)
 21 PF01108 Tissue_fac:  Tissue fa  62.6      12 0.00027   22.7   3.2   28   33-63     21-48  (107)
 22 PF14250 AbrB-like:  AbrB-like   59.8     5.5 0.00012   24.6   1.2   14    1-14     50-63  (71)
 23 PF02018 CBM_4_9:  Carbohydrate  55.6      15 0.00033   21.6   2.6   20    2-21     56-75  (131)
 24 KOG3632|consensus               53.5      19 0.00041   32.1   3.7   61    2-63    723-797 (1335)
 25 KOG4258|consensus               45.3      33 0.00071   30.1   3.9   57    2-60    571-635 (1025)
 26 KOG4222|consensus               43.2      33 0.00072   30.7   3.7   58    3-62    707-775 (1281)
 27 PF07691 PA14:  PA14 domain;  I  40.9      78  0.0017   19.2   4.2   30   30-62    109-138 (145)
 28 PF10179 DUF2369:  Uncharacteri  40.9      28  0.0006   26.3   2.6   18    1-18    264-281 (300)
 29 smart00758 PA14 domain in bact  38.3      94   0.002   19.0   4.5   32   29-64    100-131 (136)
 30 KOG4152|consensus               37.8      44 0.00095   28.3   3.4   52    4-59    654-711 (830)
 31 PF03684 UPF0179:  Uncharacteri  36.6      21 0.00046   24.3   1.3   17    3-21     37-53  (142)
 32 PF07532 Big_4:  Bacterial Ig-l  36.3      77  0.0017   17.4   3.4   14   45-58     25-38  (59)
 33 cd03869 M14_CPX_like Peptidase  36.1      24 0.00052   27.4   1.6   13    3-16    368-380 (405)
 34 COG1860 Uncharacterized protei  35.4      23  0.0005   24.5   1.3   12    2-13     38-49  (147)
 35 PF03442 CBM_X2:  Carbohydrate   35.3      20 0.00044   21.8   0.9   11    3-13     39-49  (84)
 36 PRK01177 hypothetical protein;  30.1      31 0.00067   23.5   1.2   17    3-21     39-55  (140)
 37 PF07495 Y_Y_Y:  Y_Y_Y domain;   29.4      52  0.0011   17.7   1.9   17    4-21     36-52  (66)
 38 TIGR02148 Fibro_Slime fibro-sl  28.6      61  0.0013   20.6   2.3   17    2-18     58-75  (90)
 39 COG4026 Uncharacterized protei  27.2      40 0.00087   25.4   1.5   16    3-18     28-43  (290)
 40 PF05935 Arylsulfotrans:  Aryls  27.2      71  0.0015   24.6   2.9   35    1-36     56-91  (477)
 41 KOG4258|consensus               27.1      29 0.00063   30.4   0.8   58    3-61    788-854 (1025)
 42 PF15404 PH_4:  Pleckstrin homo  24.9      89  0.0019   22.0   2.8   36    2-37     20-57  (185)
 43 cd03863 M14_CPD_II The second   24.6      54  0.0012   25.0   1.8   15    3-18    337-351 (375)
 44 cd00604 IPT_CGTD IPT domain (d  24.4 1.2E+02  0.0027   18.1   3.1   22    4-27     55-76  (81)
 45 PF07198 DUF1410:  Protein of u  21.6      61  0.0013   18.2   1.3   17    3-19     44-60  (66)
 46 PF15206 FAM209:  FAM209 family  21.3      81  0.0018   21.9   2.0   19    5-23    131-150 (150)
 47 PF07867 DUF1654:  Protein of u  21.2 1.4E+02   0.003   18.3   2.8   14   49-62     60-73  (73)
 48 COG5294 Uncharacterized protei  21.0      84  0.0018   20.9   2.0   41    4-44     50-90  (113)
 49 KOG4806|consensus               20.4 2.6E+02  0.0057   22.5   4.9   59    3-61    290-354 (454)

No 1  
>PF00041 fn3:  Fibronectin type III domain;  InterPro: IPR003961 Fibronectins are multi-domain glycoproteins found in a soluble form in plasma, and in an insoluble form in loose connective tissue and basement membranes []. They contain multiple copies of 3 repeat regions (types I, II and III), which bind to a variety of substances including heparin, collagen, DNA, actin, fibrin and fibronectin receptors on cell surfaces. The wide variety of these substances means that fibronectins are involved in a number of important functions: e.g., wound healing; cell adhesion; blood coagulation; cell differentiation and migration; maintenance of the cellular cytoskeleton; and tumour metastasis []. The role of fibronectin in cell differentiation is demonstrated by the marked reduction in the expression of its gene when neoplastic transformation occurs. Cell attachment has been found to be mediated by the binding of the tetrapeptide RGDS to integrins on the cell surface [], although related sequences can also display cell adhesion activity. Plasma fibronectin occurs as a dimer of 2 different subunits, linked together by 2 disulphide bonds near the C terminus. The difference in the 2 chains occurs in the type III repeat region and is caused by alternative splicing of the mRNA from one gene []. The observation that, in a given protein, an individual repeat of one of the 3 types (e.g., the first FnIII repeat) shows much less similarity to its subsequent tandem repeats within that protein than to its equivalent repeat between fibronectins from other species, has suggested that the repeating structure of fibronectin arose at an early stage of evolution. It also seems to suggest that the structure is subject to high selective pressure []. The fibronectin type III repeat region is an approximately 100 amino acid domain, different tandem repeats of which contain binding sites for DNA, heparin and the cell surface []. The superfamily of sequences believed to contain FnIII repeats represents 45 different families, the majority of which are involved in cell surface binding in some manner, or are receptor protein tyrosine kinases, or cytokine receptors.; GO: 0005515 protein binding; PDB: 1UEM_A 1TDQ_A 1X5I_A 2IC2_B 2IBG_C 2IBB_A 3R8Q_A 2FNB_A 1FNH_A 2EDB_A ....
Probab=95.88  E-value=0.0065  Score=33.81  Aligned_cols=24  Identities=25%  Similarity=0.444  Sum_probs=20.0

Q ss_pred             eeccCCCceEEEEEEecCCccCcc
Q psy4958           2 LIIGTGRAYNISVQTVSEDEISTP   25 (65)
Q Consensus         2 ~~L~PGR~Y~ItV~tvs~~e~S~p   25 (65)
                      .+|.||+.|.|.|+++.+...|.+
T Consensus        61 ~~L~p~t~Y~~~v~a~~~~g~g~~   84 (85)
T PF00041_consen   61 TGLQPGTTYEFRVRAVNSDGEGPP   84 (85)
T ss_dssp             ESCCTTSEEEEEEEEEETTEEEEE
T ss_pred             ccCCCCCEEEEEEEEEeCCcCcCC
Confidence            479999999999999997765543


No 2  
>KOG4221|consensus
Probab=95.09  E-value=0.064  Score=47.00  Aligned_cols=58  Identities=26%  Similarity=0.395  Sum_probs=41.1

Q ss_pred             eeccCCCceEEEEEEecCC---ccCcceeeeeec---CCCC-ceeeEEecCccccceEEEEeeCCCCCC
Q psy4958           2 LIIGTGRAYNISVQTVSED---EISTPTTAQYRT---IPLR-PLSFTYDKASITSNSLRVVWEPPKGFS   63 (65)
Q Consensus         2 ~~L~PGR~Y~ItV~tvs~~---e~S~p~~~~~RT---~Pl~-p~~L~f~~~~vt~~S~rV~W~pp~G~~   63 (65)
                      -+|.|-++|.|.|-+-...   ..|+.  +.-||   +|.+ |.||+.  .-++.+++||+|++|....
T Consensus       579 ~gL~k~TeY~~~vvA~N~~G~g~sS~~--i~V~Tlsd~PsaPP~Nl~l--ev~sStsVrVsW~pP~~~t  643 (1381)
T KOG4221|consen  579 NGLEKYTEYSIRVVAYNSAGSGVSSAD--ITVRTLSDVPSAPPQNLSL--EVVSSTSVRVSWLPPPSET  643 (1381)
T ss_pred             ecCCCccceEEEEEEecCCCCCCCCCc--eEEEeccCCCCCCCcceEE--EecCCCeEEEEccCCCccc
Confidence            3789999999998765432   33333  23344   5666 677987  5577999999999998654


No 3  
>PF00041 fn3:  Fibronectin type III domain;  InterPro: IPR003961 Fibronectins are multi-domain glycoproteins found in a soluble form in plasma, and in an insoluble form in loose connective tissue and basement membranes []. They contain multiple copies of 3 repeat regions (types I, II and III), which bind to a variety of substances including heparin, collagen, DNA, actin, fibrin and fibronectin receptors on cell surfaces. The wide variety of these substances means that fibronectins are involved in a number of important functions: e.g., wound healing; cell adhesion; blood coagulation; cell differentiation and migration; maintenance of the cellular cytoskeleton; and tumour metastasis []. The role of fibronectin in cell differentiation is demonstrated by the marked reduction in the expression of its gene when neoplastic transformation occurs. Cell attachment has been found to be mediated by the binding of the tetrapeptide RGDS to integrins on the cell surface [], although related sequences can also display cell adhesion activity. Plasma fibronectin occurs as a dimer of 2 different subunits, linked together by 2 disulphide bonds near the C terminus. The difference in the 2 chains occurs in the type III repeat region and is caused by alternative splicing of the mRNA from one gene []. The observation that, in a given protein, an individual repeat of one of the 3 types (e.g., the first FnIII repeat) shows much less similarity to its subsequent tandem repeats within that protein than to its equivalent repeat between fibronectins from other species, has suggested that the repeating structure of fibronectin arose at an early stage of evolution. It also seems to suggest that the structure is subject to high selective pressure []. The fibronectin type III repeat region is an approximately 100 amino acid domain, different tandem repeats of which contain binding sites for DNA, heparin and the cell surface []. The superfamily of sequences believed to contain FnIII repeats represents 45 different families, the majority of which are involved in cell surface binding in some manner, or are receptor protein tyrosine kinases, or cytokine receptors.; GO: 0005515 protein binding; PDB: 1UEM_A 1TDQ_A 1X5I_A 2IC2_B 2IBG_C 2IBB_A 3R8Q_A 2FNB_A 1FNH_A 2EDB_A ....
Probab=94.83  E-value=0.026  Score=31.29  Aligned_cols=23  Identities=30%  Similarity=0.765  Sum_probs=20.4

Q ss_pred             CceeeEEecCccccceEEEEeeCCC
Q psy4958          36 RPLSFTYDKASITSNSLRVVWEPPK   60 (65)
Q Consensus        36 ~p~~L~f~~~~vt~~S~rV~W~pp~   60 (65)
                      +|.+|++  .+++.+++.|+|.+|.
T Consensus         2 ~P~~l~v--~~~~~~sv~v~W~~~~   24 (85)
T PF00041_consen    2 APENLSV--SNISPTSVTVSWKPPS   24 (85)
T ss_dssp             SSEEEEE--EEECSSEEEEEEEESS
T ss_pred             cCcCeEE--EECCCCEEEEEEECCC
Confidence            5899988  7788999999999995


No 4  
>smart00060 FN3 Fibronectin type 3 domain. One of three types of internal repeat within the plasma protein, fibronectin. The tenth fibronectin type III repeat contains a RGD cell recognition  sequence in a flexible loop between 2 strands. Type III modules are present in both extracellular and intracellular proteins.
Probab=94.28  E-value=0.062  Score=27.68  Aligned_cols=27  Identities=30%  Similarity=0.575  Sum_probs=17.9

Q ss_pred             CCceeeEEecCccccceEEEEeeCCCCCC
Q psy4958          35 LRPLSFTYDKASITSNSLRVVWEPPKGFS   63 (65)
Q Consensus        35 l~p~~L~f~~~~vt~~S~rV~W~pp~G~~   63 (65)
                      .+|.++..  .....+++.|+|++|.+..
T Consensus         2 ~~p~~~~~--~~~~~~~~~v~W~~~~~~~   28 (83)
T smart00060        2 SPPSNLRV--TDVTSTSVTLSWEPPPDDG   28 (83)
T ss_pred             CCCCcEEE--EEEeCCEEEEEECCCCCCC
Confidence            34555554  3345669999999887653


No 5  
>KOG3513|consensus
Probab=94.16  E-value=0.12  Score=44.48  Aligned_cols=55  Identities=22%  Similarity=0.329  Sum_probs=48.6

Q ss_pred             ccCCCceEEEEEEecCCccCcceeeeeecCCCCceeeEEecCccccceEEEEeeCCCC
Q psy4958           4 IGTGRAYNISVQTVSEDEISTPTTAQYRTIPLRPLSFTYDKASITSNSLRVVWEPPKG   61 (65)
Q Consensus         4 L~PGR~Y~ItV~tvs~~e~S~p~~~~~RT~Pl~p~~L~f~~~~vt~~S~rV~W~pp~G   61 (65)
                      |.+.-.|.-.|||.- +++|+-..-+-|..|.+|-+|..  .+++.++++|+|.+..-
T Consensus       586 l~~~G~Y~C~aqT~~-Ds~s~~A~l~V~gpPgpP~~v~~--~~i~~t~~~lsW~~g~d  640 (1051)
T KOG3513|consen  586 LEDSGKYTCVAQTAL-DSASARADLLVRGPPGPPPDVHV--DDISDTTARLSWSPGSD  640 (1051)
T ss_pred             cccCceEEEEEEEee-cchhcccceEEecCCCCCCceeE--eeeccceEEEEeecCCC
Confidence            566778999999966 99999999999999999999977  88999999999997653


No 6  
>KOG1225|consensus
Probab=93.57  E-value=0.045  Score=43.71  Aligned_cols=60  Identities=25%  Similarity=0.151  Sum_probs=54.1

Q ss_pred             eeccCCCceEEEEEEecCCccCcceeeeeecCCCCceeeEEecCccccceEEEEeeCCCCCC
Q psy4958           2 LIIGTGRAYNISVQTVSEDEISTPTTAQYRTIPLRPLSFTYDKASITSNSLRVVWEPPKGFS   63 (65)
Q Consensus         2 ~~L~PGR~Y~ItV~tvs~~e~S~p~~~~~RT~Pl~p~~L~f~~~~vt~~S~rV~W~pp~G~~   63 (65)
                      .+|.||..|+++|.++.+..-|.|..+...|+---+..+.|  ...+.+|+.+.|..|.-..
T Consensus       423 ~~~~~g~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~g~~~v--~~~~~~s~e~~g~~~s~~~  482 (525)
T KOG1225|consen  423 QGLEPGDEYNCSVNTVAANIGSLPKDKSETTVLCWNGGLCV--DGETESSLEVGGPCPSSGT  482 (525)
T ss_pred             eeecCCcceeeehhhhhhhhccCCcccccceEeecCCceee--eeeeeccccccCCCCCccc
Confidence            46899999999999999999999999999999888999999  6688999999999987553


No 7  
>cd00063 FN3 Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein fibronectin. Its tenth fibronectin type III repeat contains an RGD cell recognition sequence in a flexible loop between 2 strands. Approximately 2% of all animal proteins contain the FN3 repeat; including extracellular and intracellular proteins, membrane spanning cytokine receptors, growth hormone receptors, tyrosine phosphatase receptors, and adhesion molecules. FN3-like domains are also found in bacterial glycosyl hydrolases.
Probab=93.25  E-value=0.1  Score=27.71  Aligned_cols=26  Identities=31%  Similarity=0.732  Sum_probs=19.0

Q ss_pred             CCCceeeEEecCccccceEEEEeeCCCC
Q psy4958          34 PLRPLSFTYDKASITSNSLRVVWEPPKG   61 (65)
Q Consensus        34 Pl~p~~L~f~~~~vt~~S~rV~W~pp~G   61 (65)
                      |.+|.++++  .....++++|+|++|..
T Consensus         1 p~~p~~~~~--~~~~~~~~~v~W~~~~~   26 (93)
T cd00063           1 PSPPTNLRV--TDVTSTSVTLSWTPPED   26 (93)
T ss_pred             CcCCCCcEE--EEecCCEEEEEECCCCC
Confidence            456777766  33447999999999865


No 8  
>cd00063 FN3 Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein fibronectin. Its tenth fibronectin type III repeat contains an RGD cell recognition sequence in a flexible loop between 2 strands. Approximately 2% of all animal proteins contain the FN3 repeat; including extracellular and intracellular proteins, membrane spanning cytokine receptors, growth hormone receptors, tyrosine phosphatase receptors, and adhesion molecules. FN3-like domains are also found in bacterial glycosyl hydrolases.
Probab=91.73  E-value=0.25  Score=26.22  Aligned_cols=22  Identities=23%  Similarity=0.321  Sum_probs=18.0

Q ss_pred             eccCCCceEEEEEEecCCccCc
Q psy4958           3 IIGTGRAYNISVQTVSEDEIST   24 (65)
Q Consensus         3 ~L~PGR~Y~ItV~tvs~~e~S~   24 (65)
                      +|.||..|.+.|.++..+..|.
T Consensus        63 ~l~p~~~Y~~~v~a~~~~~~~~   84 (93)
T cd00063          63 GLKPGTEYEFRVRAVNGGGESP   84 (93)
T ss_pred             cccCCCEEEEEEEEECCCccCC
Confidence            5899999999999998655553


No 9  
>KOG0196|consensus
Probab=90.76  E-value=0.21  Score=42.63  Aligned_cols=36  Identities=25%  Similarity=0.509  Sum_probs=30.8

Q ss_pred             CcceeeeeecCCCCceeeEEecCccccceEEEEeeCCCC
Q psy4958          23 STPTTAQYRTIPLRPLSFTYDKASITSNSLRVVWEPPKG   61 (65)
Q Consensus        23 S~p~~~~~RT~Pl~p~~L~f~~~~vt~~S~rV~W~pp~G   61 (65)
                      ++|...-=..+|.+|+||.+  + |+.+|+.+.|.+|+.
T Consensus       321 ~Dp~~mpCT~PPSaP~nlis--~-vn~Ts~~L~W~~P~d  356 (996)
T KOG0196|consen  321 SDPPSMPCTRPPSAPRNLIS--N-VNGTSLILEWSPPAD  356 (996)
T ss_pred             CCCCCCCCCCCCCccceeee--e-cccceEEEEecCCcc
Confidence            56667777889999999988  5 889999999999974


No 10 
>KOG4221|consensus
Probab=90.75  E-value=0.57  Score=41.40  Aligned_cols=57  Identities=19%  Similarity=0.319  Sum_probs=38.8

Q ss_pred             eeccCCCceEEEEEEecC---CccCcce------eeeeecCCCCceeeEEecCccccceEEEEeeCCCC
Q psy4958           2 LIIGTGRAYNISVQTVSE---DEISTPT------TAQYRTIPLRPLSFTYDKASITSNSLRVVWEPPKG   61 (65)
Q Consensus         2 ~~L~PGR~Y~ItV~tvs~---~e~S~p~------~~~~RT~Pl~p~~L~f~~~~vt~~S~rV~W~pp~G   61 (65)
                      -+|.|++.|+|-|.++--   |-+|+..      +..+--+|.+|..|...   --+++|.|+|+||.-
T Consensus       679 ~~Lep~T~Y~vrIsa~t~nGtGpaS~w~~aeT~~~d~~e~vp~~ps~l~~~---~g~~si~vsW~Pp~~  744 (1381)
T KOG4221|consen  679 NGLEPNTQYRVRISAMTVNGTGPASEWVSAETPESDLDERVPGKPSELHVH---PGSNSIVVSWTPPPH  744 (1381)
T ss_pred             hcCCCCceEEEEEEEeccCCCCCcccceeccCccccccccCCCCCceeeec---cCceeEEEEeCCCCC
Confidence            369999999988776632   2233333      22333467788888764   338999999999863


No 11 
>PF14054 DUF4249:  Domain of unknown function (DUF4249)
Probab=90.40  E-value=0.77  Score=31.86  Aligned_cols=58  Identities=22%  Similarity=0.355  Sum_probs=35.2

Q ss_pred             eccCCCceEEEEEEecCCccCcceeeeeecCCCC--ceeeEEec---Cccccce---EEEEeeCCCCCCCC
Q psy4958           3 IIGTGRAYNISVQTVSEDEISTPTTAQYRTIPLR--PLSFTYDK---ASITSNS---LRVVWEPPKGFSPF   65 (65)
Q Consensus         3 ~L~PGR~Y~ItV~tvs~~e~S~p~~~~~RT~Pl~--p~~L~f~~---~~vt~~S---~rV~W~pp~G~~~~   65 (65)
                      .+.||+.|.+.|.+=-+..+|+..     ++|.+  +.++.+..   .......   ++++|.-|.+...|
T Consensus        95 ~~~~G~~Y~L~V~~~~~~~~sa~~-----~vp~~~~i~~v~~~~~~~~~~~~~~~~~i~~~~~D~~~~~ny  160 (298)
T PF14054_consen   95 RGRPGRTYRLEVETPGGKTYSAET-----TVPPPPPIDSVSYEFKKIGDGDEGEYYRISITFQDPPGEDNY  160 (298)
T ss_pred             cccCCCEEEEEEEECCCCEEEEEE-----EECCCCceeEEEEEEecCCCCcccceeEEEEEEeCCCCCceE
Confidence            578999999999976444444432     34444  34554431   1112222   89999998877643


No 12 
>KOG0196|consensus
Probab=86.55  E-value=1.5  Score=37.73  Aligned_cols=58  Identities=19%  Similarity=0.247  Sum_probs=42.5

Q ss_pred             eeccCCCceEEEEEEecCCccC---ccee------eeeecCCCCceeeEEecCccccceEEEEeeCCCCC
Q psy4958           2 LIIGTGRAYNISVQTVSEDEIS---TPTT------AQYRTIPLRPLSFTYDKASITSNSLRVVWEPPKGF   62 (65)
Q Consensus         2 ~~L~PGR~Y~ItV~tvs~~e~S---~p~~------~~~RT~Pl~p~~L~f~~~~vt~~S~rV~W~pp~G~   62 (65)
                      -+|.|-..|...|+++- |+-.   .|..      ......|.++.+|+.  ...+.+|+.|+|..|.-.
T Consensus       403 ~~L~ah~~YTFeV~AvN-gVS~lsp~~~~~a~vnItt~qa~ps~V~~~r~--~~~~~~sitlsW~~p~~p  469 (996)
T KOG0196|consen  403 SDLLAHTNYTFEVEAVN-GVSDLSPFPRQFASVNITTNQAAPSPVSVLRQ--VSRTSDSITLSWSEPDQP  469 (996)
T ss_pred             eccccccccEEEEEEee-cccccCCCCCcceeEEeeccccCCCccceEEE--eeeccCceEEecCCCCCC
Confidence            47999999999999987 3322   1111      124567888999987  446789999999988654


No 13 
>PF02014 Reeler:  Reeler domain Schematic picture including Reeler domain;  InterPro: IPR002861 Extracellular matrix (ECM) proteins play an important role in early cortical development, specifically in the formation of neural connections and in controlling the cyto-architecture of the central nervous system. The product of the reeler gene in mouse is reelin,a large extracellular protein secreted by pioneer neurons that coordinates cell positioning during neurodevelopment []. F-spondin and mindin are a family of matrix-attached adhesion molecules that share structural similarities and overlapping domains of expression. Both F-spondin and mindin promote adhesion and outgrowth of hippocampal embryonic neurons and bind to a putative receptor(s) expressed on both hippocampal and sensory neurons []. This domain of unknown function is found at the N terminus of reelin and F-spondin.; PDB: 2ZOT_B 2ZOU_B 3COO_A.
Probab=86.16  E-value=5.6  Score=25.22  Aligned_cols=57  Identities=18%  Similarity=0.267  Sum_probs=30.7

Q ss_pred             ccCCCceEEEEEEecCCccCcceeeeeecCCCC----c----------------------eeeEEecCccccceEEEEee
Q psy4958           4 IGTGRAYNISVQTVSEDEISTPTTAQYRTIPLR----P----------------------LSFTYDKASITSNSLRVVWE   57 (65)
Q Consensus         4 L~PGR~Y~ItV~tvs~~e~S~p~~~~~RT~Pl~----p----------------------~~L~f~~~~vt~~S~rV~W~   57 (65)
                      ..||..|+|++ ..+++..-.--..|+|-.-..    +                      ..++ ..+...-+++.+.|.
T Consensus        30 y~pg~~~~Vtl-~~~~~~~F~GFllqAr~~~~~~~~~vG~f~~~~~~~~~~~~~C~~~~~~avT-H~~~~~K~~v~~~W~  107 (132)
T PF02014_consen   30 YEPGQTYTVTL-SSSGSSSFRGFLLQARDANNSSPAPVGTFQLPPDSDTTQLLDCSGGDNNAVT-HSNNSPKTSVTFTWT  107 (132)
T ss_dssp             B-TTBEEEEEE-EETTTEEBSEEEEEEEETT--SSSBSSEEEES-TTTEEEETTE----EEEEE-ESS-S-BSEEEEEEE
T ss_pred             EcCCCEEEEEE-ECCCCCceeEEEEEEEeCCCCCcccccccCcCCcccceEeecccccccceEE-ecCCCCccEEEEEEc
Confidence            57999999999 455565555555555532222    1                      1111 123333478999999


Q ss_pred             CCCCC
Q psy4958          58 PPKGF   62 (65)
Q Consensus        58 pp~G~   62 (65)
                      +|...
T Consensus       108 AP~~~  112 (132)
T PF02014_consen  108 APADF  112 (132)
T ss_dssp             --STT
T ss_pred             CCCCC
Confidence            99863


No 14 
>cd08544 Reeler Reeler, the N-terminal domain of reelin, F-spondin, and a variety of other proteins. This domain is found at the N-terminus of F-spondin, a protein attached to the extracellular matrix, which plays roles in neuronal development and vascular remodelling. The F-spondin reeler domain has been reported to bind heparin. The reeler domain is also found at the N-terminus of reelin, an extracellular glycoprotein involved in the development of the brain cortex, and in a variety of other eukaryotic proteins with different domain architectures, including the animal ferric-chelate reductase 1 or stromal cell-derived receptor 2, a member of the cytochrome B561 family, which reduces ferric iron before its transport from the endosome to the cytoplasm. Also included is the insect putative defense protein 1, which is expressed upon bacterial infection and appears to contain a single reeler domain.
Probab=85.30  E-value=6.1  Score=24.99  Aligned_cols=58  Identities=17%  Similarity=0.213  Sum_probs=33.0

Q ss_pred             ccCCCceEEEEEEecCCccCcceeeeeecCC--CCceeeEEec-------------Cccc------cceEEEEeeCCCCC
Q psy4958           4 IGTGRAYNISVQTVSEDEISTPTTAQYRTIP--LRPLSFTYDK-------------ASIT------SNSLRVVWEPPKGF   62 (65)
Q Consensus         4 L~PGR~Y~ItV~tvs~~e~S~p~~~~~RT~P--l~p~~L~f~~-------------~~vt------~~S~rV~W~pp~G~   62 (65)
                      -.||..|+|+++.-.. ....--..|+|-.-  .++..+....             +.++      -+++.+.|.+|...
T Consensus        30 y~pG~~~~Vtl~~~~~-~~F~GF~lqAr~~~~~~~vG~f~~~~~~~tk~~~c~~~~~avtH~~~~~k~~v~~~W~AP~~~  108 (135)
T cd08544          30 YVPGETYTVTLSGSSP-SPFRGFLLQARDASSNEPVGTFQLPPDSGTKLLTCCGSDNAVTHSNNSKKTSVTFTWTAPSNG  108 (135)
T ss_pred             ECCCCEEEEEEECCCC-CceeEEEEEEEcCCCCcceEEEEeCCCcccEEecCCCCCCcceeCCCCCCceeEEEEECCCCC
Confidence            4799999999986542 23344455555531  1122211110             2233      45789999999864


No 15 
>PF10179 DUF2369:  Uncharacterised conserved protein (DUF2369);  InterPro: IPR019326  This is a proline-rich region of a group of proteins found from plants to fungi. The function is largely unknown, although the entry contains Fibronectin type-III domain-containing protein C4orf31, which promotes matrix assembly and cell adhesiveness.
Probab=84.01  E-value=2.9  Score=31.40  Aligned_cols=60  Identities=22%  Similarity=0.265  Sum_probs=42.1

Q ss_pred             eccCCCceEEEEEEecCCccCcceeee-------eecCCCCceeeEEecCc--cccceEEEEeeCCCCC
Q psy4958           3 IIGTGRAYNISVQTVSEDEISTPTTAQ-------YRTIPLRPLSFTYDKAS--ITSNSLRVVWEPPKGF   62 (65)
Q Consensus         3 ~L~PGR~Y~ItV~tvs~~e~S~p~~~~-------~RT~Pl~p~~L~f~~~~--vt~~S~rV~W~pp~G~   62 (65)
                      ++.||-.|-|.++..+.+|...-..++       .+-+|+-|.++++..-+  -+=+|+.|.|.+....
T Consensus       134 ~~~~g~~Yliri~~~~~~e~~~~~kV~aast~~~~~~~P~LP~d~~Ik~f~~lrtC~SvTIAW~~s~d~  202 (300)
T PF10179_consen  134 GVKPGERYLIRIQISNSDEGPSTFKVQAASTNPSKQPYPQLPDDTSIKEFNKLRTCNSVTIAWLGSPDR  202 (300)
T ss_pred             CCCCCCeEEEEEEccCCCCCceEEEEEEecCCcccCCCCCCCCCCceeEEcCCcccceEEEEEecCCCC
Confidence            568999999999877777766555555       34466668777763222  4558999999975543


No 16 
>KOG4802|consensus
Probab=83.61  E-value=2.8  Score=33.80  Aligned_cols=59  Identities=27%  Similarity=0.351  Sum_probs=33.6

Q ss_pred             eccCCCceEEEEEEecC------CccCcceeeeee-cCCCCceeeEEec---CccccceEEEEeeCCCCCC
Q psy4958           3 IIGTGRAYNISVQTVSE------DEISTPTTAQYR-TIPLRPLSFTYDK---ASITSNSLRVVWEPPKGFS   63 (65)
Q Consensus         3 ~L~PGR~Y~ItV~tvs~------~e~S~p~~~~~R-T~Pl~p~~L~f~~---~~vt~~S~rV~W~pp~G~~   63 (65)
                      -+.|||-|++-|.+|+-      .+.|.|-....- -.|-+|.+|+.-+   ++. -+ +.|.|-|+..|-
T Consensus       215 ~~rPgRwyefrvaavn~~G~rGFs~PSkpf~ssk~pkaPp~P~dl~l~~v~~dG~-~~-~~v~w~P~~sdl  283 (516)
T KOG4802|consen  215 DMRPGRWYEFRVAAVNAYGFRGFSEPSKPFPSSKNPKAPPSPNDLKLIGVQFDGR-YM-LKVVWCPSKSDL  283 (516)
T ss_pred             ecCcceeEEEEEeeeecccccccCCCCCCCCCCCCCCCCcCcccceeeeeeecce-EE-EEEEeCCCCCCC
Confidence            36899999999999931      122322222221 1233355554321   222 23 889999998763


No 17 
>PF09294 Interfer-bind:  Interferon-alpha/beta receptor, fibronectin type III;  InterPro: IPR015373 Members of this family adopt a secondary structure consisting of seven beta-strands arranged in an immunoglobulin-like beta-sandwich, in a Greek-key topology. They are required for binding to interferon-alpha []. ; PDB: 1A21_A 3LQM_B 3ELA_T 1AHW_C 2A2Q_T 1TFH_B 1FAK_T 1WSS_T 1W2K_T 2FIR_T ....
Probab=78.43  E-value=2.4  Score=25.20  Aligned_cols=15  Identities=33%  Similarity=0.446  Sum_probs=11.8

Q ss_pred             eeccCCCceEEEEEE
Q psy4958           2 LIIGTGRAYNISVQT   16 (65)
Q Consensus         2 ~~L~PGR~Y~ItV~t   16 (65)
                      -.|.||+.|=|+|++
T Consensus        72 ~~L~p~t~YCv~V~~   86 (106)
T PF09294_consen   72 SDLKPGTNYCVSVQA   86 (106)
T ss_dssp             ES--TTSEEEEEEEE
T ss_pred             eCCCCCCCEEEEEEE
Confidence            368999999999998


No 18 
>KOG3513|consensus
Probab=75.32  E-value=6.3  Score=34.34  Aligned_cols=58  Identities=21%  Similarity=0.319  Sum_probs=43.1

Q ss_pred             CeeccCCCceEEEEEEecCCccCcce--eeeeecCCC---C-ceeeEEecCccccceEEEEeeCCC
Q psy4958           1 MLIIGTGRAYNISVQTVSEDEISTPT--TAQYRTIPL---R-PLSFTYDKASITSNSLRVVWEPPK   60 (65)
Q Consensus         1 ~~~L~PGR~Y~ItV~tvs~~e~S~p~--~~~~RT~Pl---~-p~~L~f~~~~vt~~S~rV~W~pp~   60 (65)
                      ++.|.|=-.|+..|.++-.-..++|.  +-..||-+.   . |+||+-  .+..-+.|.|+|+|=.
T Consensus       678 vv~L~Pwv~YeFRV~AvN~iG~gePS~pS~~~rT~ea~P~~~P~nv~g--~g~~~~eLvItW~Pl~  741 (1051)
T KOG3513|consen  678 VVNLSPWVEYEFRVVAVNSIGIGEPSPPSEKVRTPEAAPSVNPSNVKG--GGGSPTELVITWEPLP  741 (1051)
T ss_pred             EEccCCCcceEEEEEEEcccccCCCCCCccceecCCCCCccCCccccc--cCCCCceEEEEeccCC
Confidence            36799999999999998654444432  456777654   4 799965  6667899999999854


No 19 
>PF13205 Big_5:  Bacterial Ig-like domain
Probab=65.99  E-value=7.3  Score=22.86  Aligned_cols=12  Identities=17%  Similarity=0.617  Sum_probs=10.8

Q ss_pred             eccCCCceEEEE
Q psy4958           3 IIGTGRAYNISV   14 (65)
Q Consensus         3 ~L~PGR~Y~ItV   14 (65)
                      .|.||+.|.|+|
T Consensus        71 ~L~~~t~Y~v~i   82 (107)
T PF13205_consen   71 PLKPGTTYTVTI   82 (107)
T ss_pred             cCCCCCEEEEEE
Confidence            488999999998


No 20 
>PF09423 PhoD:  PhoD-like phosphatase;  InterPro: IPR018946 This entry contains a number of putative proteins as well as Alkaline phosphatase D which catalyses the reaction:  A phosphate monoester + H(2)O = an alcohol + phosphate  ; PDB: 2YEQ_B.
Probab=64.60  E-value=8.6  Score=28.91  Aligned_cols=39  Identities=23%  Similarity=0.392  Sum_probs=17.4

Q ss_pred             eeccCCCceEEEEEEecCCccCcceeeeeecCCCC-ceeeEE
Q psy4958           2 LIIGTGRAYNISVQTVSEDEISTPTTAQYRTIPLR-PLSFTY   42 (65)
Q Consensus         2 ~~L~PGR~Y~ItV~tvs~~e~S~p~~~~~RT~Pl~-p~~L~f   42 (65)
                      -+|.||+.|...|..-.++..| + .-+-||.|.. +..++|
T Consensus        69 ~gL~p~t~Y~Y~~~~~~~~~~s-~-~g~~rT~p~~~~~~~r~  108 (453)
T PF09423_consen   69 TGLQPGTRYYYRFVVDGGGQTS-P-VGRFRTAPDGDPDPFRF  108 (453)
T ss_dssp             -S--TT-EEEEEEEE--TTEE-----EEEE--TT-----EEE
T ss_pred             CCCCCCceEEEEEEEecCCCCC-C-ceEEEcCCCCCCCceEE
Confidence            4799999998888862222333 3 3567888877 666776


No 21 
>PF01108 Tissue_fac:  Tissue factor; PDB: 3OG4_B 3OG6_B 1FYH_E 1FG9_D 1JRH_I 3DGC_R 3DLQ_R 1LQS_R 1Y6M_R 1J7V_R ....
Probab=62.62  E-value=12  Score=22.70  Aligned_cols=28  Identities=21%  Similarity=0.313  Sum_probs=19.5

Q ss_pred             CCCCceeeEEecCccccceEEEEeeCCCCCC
Q psy4958          33 IPLRPLSFTYDKASITSNSLRVVWEPPKGFS   63 (65)
Q Consensus        33 ~Pl~p~~L~f~~~~vt~~S~rV~W~pp~G~~   63 (65)
                      ...+|.||+|+..   .-.-.++|+++.+..
T Consensus        21 ~lp~P~nv~~~s~---nf~~iL~W~~~~~~~   48 (107)
T PF01108_consen   21 SLPAPQNVTVDSV---NFKHILRWDPGPGSP   48 (107)
T ss_dssp             SGSSCEEEEEEEE---TTEEEEEEEESTTSS
T ss_pred             cCCCCCeeEEEEE---CCceEEEeCCCCCCC
Confidence            3344999999543   345789999966643


No 22 
>PF14250 AbrB-like:  AbrB-like transcriptional regulator
Probab=59.81  E-value=5.5  Score=24.58  Aligned_cols=14  Identities=14%  Similarity=0.413  Sum_probs=12.2

Q ss_pred             CeeccCCCceEEEE
Q psy4958           1 MLIIGTGRAYNISV   14 (65)
Q Consensus         1 ~~~L~PGR~Y~ItV   14 (65)
                      +++|.||.++.|.+
T Consensus        50 ~m~L~PGdEFeI~L   63 (71)
T PF14250_consen   50 QMGLKPGDEFEIKL   63 (71)
T ss_pred             HhCCCCCCEEEEEe
Confidence            47899999999986


No 23 
>PF02018 CBM_4_9:  Carbohydrate binding domain;  InterPro: IPR003305 The 1,4-beta-glucanase CenC from Cellulomonas fimi contains two cellulose-binding domains, CBD(N1) and CBD(N2), arranged in tandem at its N terminus. These homologous CBDs are distinct in their selectivity for binding amorphous and not crystalline cellulose []. Multidimensional heteronuclear nuclear magnetic resonance (NMR) spectroscopy was used to determine the tertiary structure of the 152 amino acid N-terminal cellulose-binding domain from C. fimi 1,4-beta-glucanase CenC (CBDN1) []. The tertiary structure of CBDN1 is strikingly similar to that of the bacterial 1,3-1,4-beta-glucanases, as well as other sugar-binding proteins with jelly-roll folds.; GO: 0016798 hydrolase activity, acting on glycosyl bonds; PDB: 3OEA_B 2ZEX_B 3OEB_A 2ZEY_A 2ZEW_A 1GUI_A 2W5F_A 2WZE_A 2WYS_A 2ZEZ_B ....
Probab=55.60  E-value=15  Score=21.62  Aligned_cols=20  Identities=15%  Similarity=0.426  Sum_probs=15.5

Q ss_pred             eeccCCCceEEEEEEecCCc
Q psy4958           2 LIIGTGRAYNISVQTVSEDE   21 (65)
Q Consensus         2 ~~L~PGR~Y~ItV~tvs~~e   21 (65)
                      ..|.||..|.|++|.-....
T Consensus        56 ~~l~~G~~Y~~s~~vk~~~~   75 (131)
T PF02018_consen   56 ISLKPGKTYTVSFWVKADSG   75 (131)
T ss_dssp             EEE-TTSEEEEEEEEEESSS
T ss_pred             eEecCCCEEEEEEEEEeCCC
Confidence            46899999999999777444


No 24 
>KOG3632|consensus
Probab=53.54  E-value=19  Score=32.08  Aligned_cols=61  Identities=26%  Similarity=0.439  Sum_probs=41.4

Q ss_pred             eeccCCCceEEEEEEecCCccC----------cceeeeeecCC----CCceeeEEecCccccceEEEEeeCCCCCC
Q psy4958           2 LIIGTGRAYNISVQTVSEDEIS----------TPTTAQYRTIP----LRPLSFTYDKASITSNSLRVVWEPPKGFS   63 (65)
Q Consensus         2 ~~L~PGR~Y~ItV~tvs~~e~S----------~p~~~~~RT~P----l~p~~L~f~~~~vt~~S~rV~W~pp~G~~   63 (65)
                      ..|.||..|+++|-.----..+          -..+.+-||+|    .+|..++... +++-..++|+|.||.=||
T Consensus       723 ~~lrpgt~y~a~vea~~p~q~pwdl~~v~~etr~atv~f~tLpAGppappldV~vE~-g~spg~l~vswrPptlds  797 (1335)
T KOG3632|consen  723 MRLRPGTDYWASVEAALPRQEPWDLRMVPMETRQATVLFRTLPAGPPAPPLDVKVET-GGSPGRLEVSWRPPTLDS  797 (1335)
T ss_pred             hccCCCCccceecccccCcCCCcccccchhhhhccceeeecccCCCCCCchheeeec-CCCCceeeeeccCceecc
Confidence            3689999999999643310000          12344567765    4578888764 488899999999997665


No 25 
>KOG4258|consensus
Probab=45.27  E-value=33  Score=30.07  Aligned_cols=57  Identities=25%  Similarity=0.374  Sum_probs=35.7

Q ss_pred             eeccCCCceEEEEEEecCCccCcceeee-----eecCCCC---ceeeEEecCccccceEEEEeeCCC
Q psy4958           2 LIIGTGRAYNISVQTVSEDEISTPTTAQ-----YRTIPLR---PLSFTYDKASITSNSLRVVWEPPK   60 (65)
Q Consensus         2 ~~L~PGR~Y~ItV~tvs~~e~S~p~~~~-----~RT~Pl~---p~~L~f~~~~vt~~S~rV~W~pp~   60 (65)
                      -+|-|-+.|-|=|.|+--.++-+...|+     -||.|..   |...-. .+ -+++-|-|+|.||.
T Consensus       571 ~~LkP~TqYAvfVkT~t~t~~~~~~~A~S~I~YvqT~~~~PspPl~~ls-~s-nsSsqi~l~W~pP~  635 (1025)
T KOG4258|consen  571 DGLKPWTQYAVFVKTLTVTEAHEAYEAKSKIGYVQTLPDIPSPPLDVLS-KS-NSSSQILLKWKPPS  635 (1025)
T ss_pred             hcCCccceeEEEEeeeehhhhccccccccceEEEEecCCCCCCcchhhh-cc-CcchheeEEecCCC
Confidence            5799999999999999434443333222     3666555   443321 12 22457999999985


No 26 
>KOG4222|consensus
Probab=43.20  E-value=33  Score=30.74  Aligned_cols=58  Identities=22%  Similarity=0.279  Sum_probs=37.1

Q ss_pred             eccCCCceEEEEEEecC------CccCcceeeeee-cCCCC-ce---eeEEecCccccceEEEEeeCCCCC
Q psy4958           3 IIGTGRAYNISVQTVSE------DEISTPTTAQYR-TIPLR-PL---SFTYDKASITSNSLRVVWEPPKGF   62 (65)
Q Consensus         3 ~L~PGR~Y~ItV~tvs~------~e~S~p~~~~~R-T~Pl~-p~---~L~f~~~~vt~~S~rV~W~pp~G~   62 (65)
                      -|.||+.|++.|.-...      +--|+++.+.-- -.|.. |.   +++.  ..-+.+++.|+|.+|..|
T Consensus       707 nl~p~t~ye~f~~Pf~~~~~s~~g~pS~sk~alt~e~~PSapp~~~~~~s~--~~~n~Ta~~Vsw~~pp~d  775 (1281)
T KOG4222|consen  707 NLKPGTNYEFFVRPFFPHGYSIQGAPSNSKTALTLEEPPSAPPQGVQHVSK--GSYNGTAGSVSWAPPPAD  775 (1281)
T ss_pred             ccCCCccceeeccCccCCCcceecCCcccccccccccCCCCCCCCcccccc--ccCCCceeeEEecCCccc
Confidence            48899999999876654      333444443211 12222 34   4644  557789999999999655


No 27 
>PF07691 PA14:  PA14 domain;  InterPro: IPR011658 The PA14 domain forms an insert in bacterial beta-glucosidases, other glycosidases, glycosyltransferases, proteases, amidases, yeast adhesins and bacterial toxins, including anthrax protective antigen (PA). The domain also occurs in a Dictyostelium pre-spore cell-inducing factor Psi and in fibrocystin, the mammalian protein whose mutation leads to polycystic kidney and hepatic disease. The crystal structure of PA shows that this domain (named PA14 after its location in the PA20 pro-peptide) has a beta-barrel structure. The PA14 domain sequence suggests a binding function, rather than a catalytic role. The PA14 domain distribution is compatible with carbohydrate binding [].; PDB: 2XVG_A 2XVK_A 2XVL_A 2XJU_A 2XJT_A 2XJQ_A 2XJS_A 2XJV_A 2XJP_A 2XJR_A ....
Probab=40.88  E-value=78  Score=19.22  Aligned_cols=30  Identities=23%  Similarity=0.461  Sum_probs=15.1

Q ss_pred             eecCCCCceeeEEecCccccceEEEEeeCCCCC
Q psy4958          30 YRTIPLRPLSFTYDKASITSNSLRVVWEPPKGF   62 (65)
Q Consensus        30 ~RT~Pl~p~~L~f~~~~vt~~S~rV~W~pp~G~   62 (65)
                      ++.++.++.....  . .....|+|.|.+|.+.
T Consensus       109 g~~y~i~i~y~~~--~-~~~~~~~l~w~~p~~~  138 (145)
T PF07691_consen  109 GGKYPIRIEYFNR--G-GDASLIQLSWTPPGGA  138 (145)
T ss_dssp             T-EEEEEEEEEEC--S-CSSCCEEEEEE-TTS-
T ss_pred             CeeEEEEEEEEEC--C-CCCEEEEEEEECCCCc
Confidence            3455555544432  2 2223369999998764


No 28 
>PF10179 DUF2369:  Uncharacterised conserved protein (DUF2369);  InterPro: IPR019326  This is a proline-rich region of a group of proteins found from plants to fungi. The function is largely unknown, although the entry contains Fibronectin type-III domain-containing protein C4orf31, which promotes matrix assembly and cell adhesiveness.
Probab=40.88  E-value=28  Score=26.26  Aligned_cols=18  Identities=17%  Similarity=0.204  Sum_probs=15.2

Q ss_pred             CeeccCCCceEEEEEEec
Q psy4958           1 MLIIGTGRAYNISVQTVS   18 (65)
Q Consensus         1 ~~~L~PGR~Y~ItV~tvs   18 (65)
                      +.+|.||+.|=|-|+..-
T Consensus       264 I~~L~PG~~Yl~dV~~~~  281 (300)
T PF10179_consen  264 IKGLKPGTTYLFDVYVNG  281 (300)
T ss_pred             cccCCCCcEEEEEEEEec
Confidence            368999999999998764


No 29 
>smart00758 PA14 domain in bacterial beta-glucosidases other glycosidases, glycosyltransferases, proteases, amidases, yeast adhesins, and bacterial toxins.
Probab=38.33  E-value=94  Score=18.98  Aligned_cols=32  Identities=25%  Similarity=0.325  Sum_probs=19.3

Q ss_pred             eeecCCCCceeeEEecCccccceEEEEeeCCCCCCC
Q psy4958          29 QYRTIPLRPLSFTYDKASITSNSLRVVWEPPKGFSP   64 (65)
Q Consensus        29 ~~RT~Pl~p~~L~f~~~~vt~~S~rV~W~pp~G~~~   64 (65)
                      .++.++.++.....    ....++++.|..|.+..|
T Consensus       100 ~g~~~~i~v~y~~~----~~~~~~~l~w~~~~~~~~  131 (136)
T smart00758      100 AGGTYPIRIEYFEA----GTGGLLKLGWTTPDAAKE  131 (136)
T ss_pred             CCcEEEEEEEEEeC----CCCEEEEEEEECCCCCcE
Confidence            34455555544422    234588999999987653


No 30 
>KOG4152|consensus
Probab=37.78  E-value=44  Score=28.26  Aligned_cols=52  Identities=27%  Similarity=0.553  Sum_probs=0.0

Q ss_pred             ccCCCceEEEEEEecCCccCcceeeeeec------CCCCceeeEEecCccccceEEEEeeCC
Q psy4958           4 IGTGRAYNISVQTVSEDEISTPTTAQYRT------IPLRPLSFTYDKASITSNSLRVVWEPP   59 (65)
Q Consensus         4 L~PGR~Y~ItV~tvs~~e~S~p~~~~~RT------~Pl~p~~L~f~~~~vt~~S~rV~W~pp   59 (65)
                      |+||..|..-|.++- |.---|-..-...      +|-.|+.++..++   +..+.+.||||
T Consensus       654 lv~Gq~yrfrV~aIn-g~G~gp~s~i~~~kTc~pG~P~apS~~ri~k~---~eGi~l~wepp  711 (830)
T KOG4152|consen  654 LVTGQAYRFRVTAIN-GKGPGPASTILKLKTCAPGKPTAPSGARIKKT---IEGISLVWEPP  711 (830)
T ss_pred             cccccceeeeeeeee-ccCCCchhhheeeeeccCCCCCCccccccccc---ccceeecccCC


No 31 
>PF03684 UPF0179:  Uncharacterised protein family (UPF0179);  InterPro: IPR005369 The function of this family is unknown, however the proteins contain two cysteine clusters that may be iron sulphur redox centres.
Probab=36.60  E-value=21  Score=24.31  Aligned_cols=17  Identities=35%  Similarity=0.606  Sum_probs=13.7

Q ss_pred             eccCCCceEEEEEEecCCc
Q psy4958           3 IIGTGRAYNISVQTVSEDE   21 (65)
Q Consensus         3 ~L~PGR~Y~ItV~tvs~~e   21 (65)
                      .|.|||-|.|+  .|....
T Consensus        37 nLe~Gr~YrI~--~VR~~~   53 (142)
T PF03684_consen   37 NLEPGRRYRIV--EVRNNE   53 (142)
T ss_pred             cCCCCceEEEE--EEcCCC
Confidence            69999999997  566555


No 32 
>PF07532 Big_4:  Bacterial Ig-like domain (group 4);  InterPro: IPR011081 This entry represents bacterial domains with an Ig-like fold. These domains are found in a variety of bacterial surface proteins.
Probab=36.28  E-value=77  Score=17.39  Aligned_cols=14  Identities=21%  Similarity=0.715  Sum_probs=11.6

Q ss_pred             CccccceEEEEeeC
Q psy4958          45 ASITSNSLRVVWEP   58 (65)
Q Consensus        45 ~~vt~~S~rV~W~p   58 (65)
                      ++.+...+.|.|+.
T Consensus        25 ~dGs~~~~~V~W~~   38 (59)
T PF07532_consen   25 SDGSTEEVPVTWDP   38 (59)
T ss_pred             CCCCEEEEEeEeCC
Confidence            66777889999995


No 33 
>cd03869 M14_CPX_like Peptidase M14-like domain of carboxypeptidase (CP)-like protein X (CPX), CPX forms a distinct subgroup of the N/E subfamily of the M14 family of metallocarboxypeptidases (MCPs). The M14 family are zinc-binding CPs which hydrolyze single, C-terminal amino acids from polypeptide chains, and have a recognition site for the free C-terminal carboxyl group, which is a key determinant of specificity. Proteins belonging to this subgroup include CP-like protein X1 (CPX1), CP-like protein X2 (CPX2),  and aortic CP-like protein (ACLP) and its isoform adipocyte enhancer binding protein-1 (AEBP1). AEBP1 is a truncated form of ACLP, which may arise from alternative splicing of the gene. These proteins are inactive towards standard CP substrates because they lack one or more critical active site and substrate-binding residues that are necessary for activity. They may function as binding proteins rather than as active CPs or display catalytic activity toward other substrates.  Pro
Probab=36.11  E-value=24  Score=27.42  Aligned_cols=13  Identities=15%  Similarity=0.710  Sum_probs=10.9

Q ss_pred             eccCCCceEEEEEE
Q psy4958           3 IIGTGRAYNISVQT   16 (65)
Q Consensus         3 ~L~PGR~Y~ItV~t   16 (65)
                      .|.||+ |+|+|++
T Consensus       368 ll~pG~-y~v~~~a  380 (405)
T cd03869         368 LLNPGE-YRVTAHA  380 (405)
T ss_pred             ecCCce-EEEEEEe
Confidence            489995 9999985


No 34 
>COG1860 Uncharacterized protein conserved in archaea [Function unknown]
Probab=35.42  E-value=23  Score=24.53  Aligned_cols=12  Identities=33%  Similarity=0.681  Sum_probs=10.3

Q ss_pred             eeccCCCceEEE
Q psy4958           2 LIIGTGRAYNIS   13 (65)
Q Consensus         2 ~~L~PGR~Y~It   13 (65)
                      +-|+|||-|.|+
T Consensus        38 ~nLe~GrrYkVv   49 (147)
T COG1860          38 LNLEEGRRYKVV   49 (147)
T ss_pred             eccCCCcEEEEE
Confidence            369999999986


No 35 
>PF03442 CBM_X2:  Carbohydrate binding domain X2;  InterPro: IPR005102  The structure of this module is known [] and consists of an Ig-like fold. The function of this domain is unknown, but might be involved in mediating interaction with carbohydrates.; PDB: 1EHX_A.
Probab=35.34  E-value=20  Score=21.76  Aligned_cols=11  Identities=27%  Similarity=0.561  Sum_probs=6.9

Q ss_pred             eccCCCceEEE
Q psy4958           3 IIGTGRAYNIS   13 (65)
Q Consensus         3 ~L~PGR~Y~It   13 (65)
                      .|++|++|.|+
T Consensus        39 ~L~~g~DYtvs   49 (84)
T PF03442_consen   39 TLVEGTDYTVS   49 (84)
T ss_dssp             --G-GGGEEEE
T ss_pred             cccCCCCEEEe
Confidence            48999999883


No 36 
>PRK01177 hypothetical protein; Provisional
Probab=30.12  E-value=31  Score=23.54  Aligned_cols=17  Identities=41%  Similarity=0.534  Sum_probs=13.2

Q ss_pred             eccCCCceEEEEEEecCCc
Q psy4958           3 IIGTGRAYNISVQTVSEDE   21 (65)
Q Consensus         3 ~L~PGR~Y~ItV~tvs~~e   21 (65)
                      .|.|||-|.|+  .|...+
T Consensus        39 nLe~GrrYrI~--~VR~~~   55 (140)
T PRK01177         39 NLEPGRRYKIL--KVREIE   55 (140)
T ss_pred             cCCCCceEEEE--EEecCC
Confidence            69999999997  566433


No 37 
>PF07495 Y_Y_Y:  Y_Y_Y domain;  InterPro: IPR011123 This region is mostly found at the end of the beta propellers (IPR011110 from INTERPRO) in a family of two component regulators. However they are also found tandemly repeated in Q891H4 from SWISSPROT without other signal conduction domains being present. It is named after the conserved tyrosines found in the alignment. The exact function is not known.; PDB: 3V9F_D 3VA6_B 3OTT_B 4A2M_D 4A2L_B.
Probab=29.43  E-value=52  Score=17.74  Aligned_cols=17  Identities=18%  Similarity=0.499  Sum_probs=11.2

Q ss_pred             ccCCCceEEEEEEecCCc
Q psy4958           4 IGTGRAYNISVQTVSEDE   21 (65)
Q Consensus         4 L~PGR~Y~ItV~tvs~~e   21 (65)
                      |.|| .|.+.|++.....
T Consensus        36 L~~G-~Y~l~V~a~~~~~   52 (66)
T PF07495_consen   36 LPPG-KYTLEVRAKDNNG   52 (66)
T ss_dssp             --SE-EEEEEEEEEETTS
T ss_pred             CCCE-EEEEEEEEECCCC
Confidence            5565 5999999887443


No 38 
>TIGR02148 Fibro_Slime fibro-slime domain. This model represents a conserved region of about 90 amino acids, shared in at least 4 distinct large putative proteins from the slime mold Dictyostelium discoideum and 10 proteins from the rumen bacterium Fibrobacter succinogenes, and in no other species so far. We propose here the name fibro-slime domain
Probab=28.58  E-value=61  Score=20.61  Aligned_cols=17  Identities=18%  Similarity=0.358  Sum_probs=14.4

Q ss_pred             eeccCCCceEEEE-EEec
Q psy4958           2 LIIGTGRAYNISV-QTVS   18 (65)
Q Consensus         2 ~~L~PGR~Y~ItV-~tvs   18 (65)
                      |+|++|+.|.+.+ +|-.
T Consensus        58 lgl~~g~~Y~~d~F~~ER   75 (90)
T TIGR02148        58 LGLKEGKTYPFDIFYCER   75 (90)
T ss_pred             cCCccCcEeeEEEEEEee
Confidence            6899999999999 6555


No 39 
>COG4026 Uncharacterized protein containing TOPRIM domain, potential nuclease [General function prediction only]
Probab=27.24  E-value=40  Score=25.44  Aligned_cols=16  Identities=25%  Similarity=0.609  Sum_probs=13.2

Q ss_pred             eccCCCceEEEEEEec
Q psy4958           3 IIGTGRAYNISVQTVS   18 (65)
Q Consensus         3 ~L~PGR~Y~ItV~tvs   18 (65)
                      .|.=|-.|||||+++-
T Consensus        28 r~aLG~eYnITisSIi   43 (290)
T COG4026          28 RLALGSEYNITISSII   43 (290)
T ss_pred             HhhhcccceeEEEeec
Confidence            3556899999999887


No 40 
>PF05935 Arylsulfotrans:  Arylsulfotransferase (ASST);  InterPro: IPR010262 This family consists of several bacterial arylsulphotransferase proteins. Arylsulphotransferase (ASST) transfers a sulphate group from phenolic sulphate esters to a phenolic acceptor substrate [].; PDB: 3ETT_B 3ELQ_A 3ETS_A.
Probab=27.20  E-value=71  Score=24.63  Aligned_cols=35  Identities=14%  Similarity=0.157  Sum_probs=18.9

Q ss_pred             CeeccCCCceEEEEEEec-CCccCcceeeeeecCCCC
Q psy4958           1 MLIIGTGRAYNISVQTVS-EDEISTPTTAQYRTIPLR   36 (65)
Q Consensus         1 ~~~L~PGR~Y~ItV~tvs-~~e~S~p~~~~~RT~Pl~   36 (65)
                      ++||.|+..+.|+|..-. .|+. ...+..-.|-+++
T Consensus        56 V~GLyp~~~N~V~v~~~~~~G~~-~~~~~~i~t~~~~   91 (477)
T PF05935_consen   56 VIGLYPDYENTVTVTYTDEDGKT-ETKTYTITTPPLP   91 (477)
T ss_dssp             EE---TTSEEEEEEEEEE-TTEE-EEEEEEEE-----
T ss_pred             EEEEeCCCeEEEEEEEEccCCeE-eeeEEEEECCCcC
Confidence            479999999999999887 3333 4444555555554


No 41 
>KOG4258|consensus
Probab=27.10  E-value=29  Score=30.37  Aligned_cols=58  Identities=26%  Similarity=0.324  Sum_probs=41.3

Q ss_pred             eccCCCceEEEEEEecCCc----cCcceeeeeecCCCC-----ceeeEEecCccccceEEEEeeCCCC
Q psy4958           3 IIGTGRAYNISVQTVSEDE----ISTPTTAQYRTIPLR-----PLSFTYDKASITSNSLRVVWEPPKG   61 (65)
Q Consensus         3 ~L~PGR~Y~ItV~tvs~~e----~S~p~~~~~RT~Pl~-----p~~L~f~~~~vt~~S~rV~W~pp~G   61 (65)
                      +|.+=++|.|..|+..-.-    .|.....++||-|-.     +..+..... +..+++-++|++|+.
T Consensus       788 ~Lrh~tlY~i~l~aCnh~~~~~~cS~a~~v~~RT~~~~~aD~i~g~v~we~~-~~~~~v~l~w~EP~~  854 (1025)
T KOG4258|consen  788 GLRHFTLYRIDLQACNHATPKCGCSHAAFVFARTMPTMGADDIPGPVTWECH-IEMNSVILRWLEPKE  854 (1025)
T ss_pred             ccccchhhhhhHhhhcccccccccchhhhhhhccccccccccCCCceeEecc-cCcceEEEecCCCCC
Confidence            4566678888888765433    489999999998854     232333323 578999999999863


No 42 
>PF15404 PH_4:  Pleckstrin homology domain
Probab=24.91  E-value=89  Score=21.95  Aligned_cols=36  Identities=22%  Similarity=0.302  Sum_probs=27.5

Q ss_pred             eeccCCCc--eEEEEEEecCCccCcceeeeeecCCCCc
Q psy4958           2 LIIGTGRA--YNISVQTVSEDEISTPTTAQYRTIPLRP   37 (65)
Q Consensus         2 ~~L~PGR~--Y~ItV~tvs~~e~S~p~~~~~RT~Pl~p   37 (65)
                      ..|.||.+  |++...+..|-.....-..++.|+|+.=
T Consensus        20 vvL~~G~Li~f~~~~Rs~tG~~~~~~~~~~~~tI~L~d   57 (185)
T PF15404_consen   20 VVLIPGFLILFQLFKRSRTGFAKPTVHYEHYLTIPLRD   57 (185)
T ss_pred             EEEeCCEEEEEEEEeeccCCcccceEEeEeeeeeccCC
Confidence            47899985  7887778776666667777888888763


No 43 
>cd03863 M14_CPD_II The second carboxypeptidase (CP)-like domain of  Carboxypeptidase D (CPD; EC 3.4.17.22), domain II. CPD differs from all other metallocarboxypeptidases in that it contains multiple CP-like domains. CPD belongs to the N/E-like subfamily of the M14 family of metallocarboxypeptidases (MCPs).The M14 family are zinc-binding CPs which hydrolyze single, C-terminal amino acids from polypeptide chains, and have a recognition site for the free C-terminal carboxyl group, which is a key determinant of specificity. CPD is a single-chain protein containing a signal peptide, three tandem repeats of CP-like domains separated by short bridge regions, followed by a transmembrane domain, and a C-terminal cytosolic tail. The first two CP-like domains of CPD contain all of the essential active site and substrate-binding residues, while the third CP-like domain lacks critical residues necessary for enzymatic activity and is inactive towards standard CP substrates. Domain I is optimally ac
Probab=24.63  E-value=54  Score=24.99  Aligned_cols=15  Identities=13%  Similarity=0.474  Sum_probs=12.6

Q ss_pred             eccCCCceEEEEEEec
Q psy4958           3 IIGTGRAYNISVQTVS   18 (65)
Q Consensus         3 ~L~PGR~Y~ItV~tvs   18 (65)
                      .|.||+ |+|+|+.+.
T Consensus       337 ~l~pG~-ytl~vs~~G  351 (375)
T cd03863         337 LLVPGT-YKVTASARG  351 (375)
T ss_pred             ccCCee-EEEEEEEcC
Confidence            478886 999999876


No 44 
>cd00604 IPT_CGTD IPT domain (domain D) of cyclodextrin glycosyltransferase (CGTase) and similar enzymes. These enzymes are involved in the enzymatic hydrolysis of alpha-1,4 linkages of starch polymers and belong to the glycosyl hydrolase family 13. Most consist of three domains (A,B,C) but CGTase is more complex and has two additional domains (D,E). The function of the IPT/D domain is unknown.
Probab=24.43  E-value=1.2e+02  Score=18.09  Aligned_cols=22  Identities=41%  Similarity=0.530  Sum_probs=13.7

Q ss_pred             ccCCCceEEEEEEecCCccCccee
Q psy4958           4 IGTGRAYNISVQTVSEDEISTPTT   27 (65)
Q Consensus         4 L~PGR~Y~ItV~tvs~~e~S~p~~   27 (65)
                      +.+| .|+|+|.+-. |..|....
T Consensus        55 ~~~g-~~~i~V~~~~-G~~Sn~~~   76 (81)
T cd00604          55 VAPG-NYNISVTTVD-GVTSNGYN   76 (81)
T ss_pred             CCCC-ceEEEEEECC-CcccCcEe
Confidence            3444 7888888554 66665543


No 45 
>PF07198 DUF1410:  Protein of unknown function (DUF1410);  InterPro: IPR009849 This entry represents a conserved region approximately 100 residues long, multiple copies of which are sometimes found within hypothetical Ureaplasma parvum proteins of unknown function.
Probab=21.63  E-value=61  Score=18.18  Aligned_cols=17  Identities=6%  Similarity=0.210  Sum_probs=13.3

Q ss_pred             eccCCCceEEEEEEecC
Q psy4958           3 IIGTGRAYNISVQTVSE   19 (65)
Q Consensus         3 ~L~PGR~Y~ItV~tvs~   19 (65)
                      .|.||+.|+|.=-+..+
T Consensus        44 ~L~~n~~Y~l~~I~~~n   60 (66)
T PF07198_consen   44 NLPPNHKYKLKKIVLNN   60 (66)
T ss_pred             CCCCCCEEEEEEEEECC
Confidence            58999999997665553


No 46 
>PF15206 FAM209:  FAM209 family
Probab=21.27  E-value=81  Score=21.94  Aligned_cols=19  Identities=21%  Similarity=0.176  Sum_probs=14.6

Q ss_pred             cCCCceE-EEEEEecCCccC
Q psy4958           5 GTGRAYN-ISVQTVSEDEIS   23 (65)
Q Consensus         5 ~PGR~Y~-ItV~tvs~~e~S   23 (65)
                      .|.--|| ||+|-|.|+|-|
T Consensus       131 ~PaDp~nniTIYeiWgeeds  150 (150)
T PF15206_consen  131 VPADPQNNITIYEIWGEEDS  150 (150)
T ss_pred             CCCCCCcceEEEEeccccCC
Confidence            4666777 999999977654


No 47 
>PF07867 DUF1654:  Protein of unknown function (DUF1654);  InterPro: IPR012449 This entry is represented by Bacteriophage F116 (Pseudomonas phage F116), Orf28. The characteristics of the protein distribution suggest prophage matches in addition to the phage matches. This family consists of proteins from the Pseudomonadaceae. 
Probab=21.24  E-value=1.4e+02  Score=18.31  Aligned_cols=14  Identities=43%  Similarity=0.788  Sum_probs=11.1

Q ss_pred             cceEEEEeeCCCCC
Q psy4958          49 SNSLRVVWEPPKGF   62 (65)
Q Consensus        49 ~~S~rV~W~pp~G~   62 (65)
                      +-+++|+|+.+.-|
T Consensus        60 DGsv~i~W~~~~~d   73 (73)
T PF07867_consen   60 DGSVRIRWERPKED   73 (73)
T ss_pred             CCeEEEEEEcccCC
Confidence            46899999988654


No 48 
>COG5294 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=21.02  E-value=84  Score=20.90  Aligned_cols=41  Identities=17%  Similarity=0.058  Sum_probs=30.7

Q ss_pred             ccCCCceEEEEEEecCCccCcceeeeeecCCCCceeeEEec
Q psy4958           4 IGTGRAYNISVQTVSEDEISTPTTAQYRTIPLRPLSFTYDK   44 (65)
Q Consensus         4 L~PGR~Y~ItV~tvs~~e~S~p~~~~~RT~Pl~p~~L~f~~   44 (65)
                      +.||+.|+|+.+.=.|.+.+.--++..-.-+..=..|+.+.
T Consensus        50 ~~~~y~y~i~ayn~~Gkkk~v~f~a~~~lr~~aYLKlt~kg   90 (113)
T COG5294          50 GSPGYEYTITAYNKNGKKKEVKFTATHNLRKEAYLKLTHKG   90 (113)
T ss_pred             CCccceeeehhhccCCcEEEEEEEecCcCCCccEEEEEecC
Confidence            46899999999999988888777776666665555555533


No 49 
>KOG4806|consensus
Probab=20.44  E-value=2.6e+02  Score=22.50  Aligned_cols=59  Identities=17%  Similarity=0.310  Sum_probs=42.2

Q ss_pred             eccCCCceEEEEEEecCCccCcceeeee------ecCCCCceeeEEecCccccceEEEEeeCCCC
Q psy4958           3 IIGTGRAYNISVQTVSEDEISTPTTAQY------RTIPLRPLSFTYDKASITSNSLRVVWEPPKG   61 (65)
Q Consensus         3 ~L~PGR~Y~ItV~tvs~~e~S~p~~~~~------RT~Pl~p~~L~f~~~~vt~~S~rV~W~pp~G   61 (65)
                      ++.||+-|-.-++-+-++|+-.-++.-+      |-+|.-|.+-+..+-+-+-+|+.|.|.....
T Consensus       290 vv~~~~~~~~rf~~~ndde~~~~v~V~aSt~~t~~~~P~LP~dTtVk~v~r~CSsAtIaW~gs~d  354 (454)
T KOG4806|consen  290 VVLPGERYLMRFEPSNDDEALQKVMVAASTEATFRDLPELPQDTTVKNVRRRCSSATIAWNGSPD  354 (454)
T ss_pred             EecCCCceEEEEEecCCchhhheeEeeeecccccCcCCCCCCCceEeeeccccchheeeeccCcc
Confidence            5789999999999877777654444333      4456667776665555667899999986543


Done!