Query         000547
Match_columns 1432
No_of_seqs    200 out of 215
Neff          3.8 
Searched_HMMs 46136
Date          Mon Apr  1 18:51:32 2013
Command       hhsearch -i /work/01045/syshi/lefta3m/000547.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/leftcdd/000547hhsearch_cdd -cpu 12 -v 0 

 No Hit                             Prob E-value P-value  Score    SS Cols Query HMM  Template HMM
  1 KOG0196 Tyrosine kinase, EPH (  96.6  0.0018 3.9E-08   81.4   4.4   61  796-869   261-331 (996)
  2 PF07699 GCC2_GCC3:  GCC2 and G  96.3   0.002 4.4E-08   55.3   2.0   26  805-832     9-34  (48)
  3 PF10256 Erf4:  Golgin subfamil  92.2    0.22 4.7E-06   50.0   5.1   67  966-1032   21-91  (118)
  4 PHA02637 TNF-alpha-receptor-li  91.9    0.19   4E-06   52.1   4.2   57  797-858    31-92  (127)
  5 KOG0921 Dosage compensation co  91.2     0.5 1.1E-05   61.1   7.7    7  287-293  1272-1278(1282)
  6 PF07562 NCD3G:  Nine Cysteines  91.1   0.062 1.4E-06   47.7  -0.1   36  795-832     6-51  (54)
  7 cd00185 TNFR Tumor necrosis fa  89.9    0.43 9.3E-06   46.7   4.6   66  793-864    12-87  (98)
  8 PHA02637 TNF-alpha-receptor-li  85.9    0.54 1.2E-05   48.8   2.6   41  789-830    38-87  (127)
  9 KOG0921 Dosage compensation co  82.2     3.1 6.7E-05   54.4   7.3   17  557-573  1135-1151(1282)
 10 PF07699 GCC2_GCC3:  GCC2 and G  77.5     1.8 3.9E-05   37.4   2.3   32  791-822     7-43  (48)
 11 cd00185 TNFR Tumor necrosis fa  67.2     6.5 0.00014   38.6   3.8   33  804-836    11-45  (98)
 12 PTZ00382 Variant-specific surf  63.1     4.3 9.3E-05   40.4   1.7   24  805-831     4-27  (96)
 13 KOG4069 Uncharacterized conser  60.6      19  0.0004   38.1   5.7   78  965-1046   45-135 (154)
 14 KOG4289 Cadherin EGF LAG seven  53.4      11 0.00024   51.6   3.4   86 1274-1370 2170-2256(2531)
 15 cd00055 EGF_Lam Laminin-type e  52.6     7.5 0.00016   33.9   1.3   27  789-818    17-43  (50)
 16 smart00180 EGF_Lam Laminin-typ  52.5     7.7 0.00017   33.5   1.3   20  797-816    21-40  (46)
 17 PF05268 GP38:  Phage tail fibr  51.9      65  0.0014   36.9   8.5   37  210-247   114-152 (260)
 18 PF12273 RCR:  Chitin synthesis  45.1      20 0.00043   37.0   3.2   20  881-900     2-22  (130)
 19 KOG3973 Uncharacterized conser  44.7      81  0.0018   38.2   8.2    7  289-295   453-459 (465)
 20 PF14946 DUF4501:  Domain of un  44.4      80  0.0017   34.8   7.5   37  876-912    83-121 (180)
 21 PF15496 DUF4646:  Domain of un  40.1      46 0.00099   34.6   4.9   70  971-1041   43-120 (123)
 22 KOG1836 Extracellular matrix g  40.0      19 0.00041   50.9   2.7   43  790-832   794-838 (1705)
 23 PF07354 Sp38:  Zona-pellucida-  39.1      18  0.0004   42.1   2.0   35  794-831   218-261 (271)
 24 PF15496 DUF4646:  Domain of un  38.2      36 0.00078   35.4   3.8   25 1116-1140   45-69  (123)
 25 PF00053 Laminin_EGF:  Laminin   37.3      13 0.00028   32.0   0.4   22  797-818    21-42  (49)
 26 KOG1836 Extracellular matrix g  36.9      28  0.0006   49.4   3.5   36  796-831   697-733 (1705)
 27 PF10256 Erf4:  Golgin subfamil  34.7      60  0.0013   32.8   4.7   56 1114-1169   26-106 (118)
 28 PRK09677 putative lipopolysacc  34.3      96  0.0021   33.9   6.5   61   87-150    35-95  (192)
 29 KOG2675 Adenylate cyclase-asso  33.6      25 0.00053   43.4   2.0   26  341-366   445-470 (480)
 30 KOG4611 Uncharacterized conser  32.3      36 0.00078   41.0   3.0   48  807-873    52-125 (747)
 31 cd00064 FU Furin-like repeats.  30.5      38 0.00082   29.1   2.1   22  796-817    18-42  (49)
 32 KOG4258 Insulin/growth factor   29.1      35 0.00077   45.1   2.4   46  805-873   304-350 (1025)
 33 KOG4180 Predicted kinase [Gene  29.0      99  0.0021   37.4   5.7   28 1075-1103   96-123 (395)
 34 PF02166 Androgen_recep:  Andro  27.5      20 0.00044   42.8   0.0   26   21-57    335-360 (423)
 35 PF05281 Secretogranin_V:  Neur  26.5      96  0.0021   35.3   4.9   91  761-867    49-144 (208)
 36 KOG4260 Uncharacterized conser  25.3      73  0.0016   37.6   3.8   50  798-866   132-181 (350)
 37 COG4907 Predicted membrane pro  24.7      50  0.0011   41.2   2.5   27  250-293   569-595 (595)
 38 cd04299 GT1_Glycogen_Phosphory  23.5      51  0.0011   43.7   2.4   77 1018-1096  435-522 (778)
 39 KOG0994 Extracellular matrix g  23.4      79  0.0017   43.2   4.0   65  797-866   785-856 (1758)
 40 PF15195 TMEM210:  TMEM210 fami  23.1      51  0.0011   33.5   1.8   10   41-50    101-110 (116)
 41 PF04519 Bactofilin:  Polymer-f  23.0 3.1E+02  0.0067   26.9   7.2   57   71-150     5-61  (101)
 42 PF00020 TNFR_c6:  TNFR/NGFR cy  22.4      39 0.00085   28.0   0.8   19  810-830     1-19  (39)
 43 PF12661 hEGF:  Human growth fa  21.8      28  0.0006   23.6  -0.2   11  797-807     3-13  (13)
 44 PF05268 GP38:  Phage tail fibr  21.6 1.6E+02  0.0035   33.9   5.4   21  245-265   113-134 (260)
 45 PRK06958 single-stranded DNA-b  21.6 5.1E+02   0.011   29.1   9.2   35  108-144    71-108 (182)

No 1  
>KOG0196 consensus Tyrosine kinase, EPH (ephrin) receptor family [Signal transduction mechanisms]
Probab=96.58  E-value=0.0018  Score=81.38  Aligned_cols=61  Identities=30%  Similarity=0.688  Sum_probs=45.1

Q ss_pred             cCCCCCcc----cccceeCCCCceecccCCCCCCcccCCCCCCCCCcceecccCcccCCCCCccccCCCccC------Cc
Q 000547          796 KACPRGLY----GVFCEECPVGTFKNVSGSDRALCRNCSSNELPHRALYIPIRGGVTECPCPYKCVSERYHM------PH  865 (1432)
Q Consensus       796 ~~CP~G~~----G~fC~eCP~GtYKn~~Gs~~~~C~pC~~~~lp~ra~yiy~~gG~~~~~CpY~C~sdk~~~------p~  865 (1432)
                      =-|.+||.    |.-|+.||.||||...|  ..+|.+||+++..+         ....+.|.  |..++|+.      --
T Consensus       261 C~C~aGye~~~~~~~C~aCp~G~yK~~~~--~~~C~~CP~~S~s~---------~ega~~C~--C~~gyyRA~~Dp~~mp  327 (996)
T KOG0196|consen  261 CVCKAGYEEAENGKACQACPPGTYKASQG--DSLCLPCPPNSHSS---------SEGATSCT--CENGYYRADSDPPSMP  327 (996)
T ss_pred             eeecCCCCcccCCCcceeCCCCcccCCCC--CCCCCCCCCCCCCC---------CCCCCccc--ccCCcccCCCCCCCCC
Confidence            46999984    67799999999999865  48999999865332         22355675  88888873      24


Q ss_pred             ccch
Q 000547          866 CYTT  869 (1432)
Q Consensus       866 C~T~  869 (1432)
                      |.+|
T Consensus       328 CT~P  331 (996)
T KOG0196|consen  328 CTRP  331 (996)
T ss_pred             CCCC
Confidence            7665


No 2  
>PF07699 GCC2_GCC3:  GCC2 and GCC3;  InterPro: IPR011641 Protein phosphorylation, which plays a key role in most cellular activities, is a reversible process mediated by protein kinases and phosphoprotein phosphatases. Protein kinases catalyse the transfer of the gamma phosphate from nucleotide triphosphates (often ATP) to one or more amino acid residues in a protein substrate side chain, resulting in a conformational change affecting protein function. Phosphoprotein phosphatases catalyse the reverse process. Protein kinases fall into three broad classes, characterised with respect to substrate specificity []:   Serine/threonine-protein kinases Tyrosine-protein kinases Dual specific protein kinases (e.g. MEK - phosphorylates both Thr and Tyr on target proteins)   Protein kinase function has been evolutionarily conserved from Escherichia coli to human []. Protein kinases play a role in a multitude of cellular processes, including division, proliferation, apoptosis, and differentiation []. Phosphorylation usually results in a functional change of the target protein by changing enzyme activity, cellular location, or association with other proteins. The catalytic subunits of protein kinases are highly conserved, and several structures have been solved [], leading to large screens to develop kinase-specific inhibitors for the treatments of a number of diseases []. Tyrosine-protein kinases can transfer a phosphate group from ATP to a tyrosine residue in a protein. These enzymes can be divided into two main groups []:   Receptor tyrosine kinases (RTK), which are transmembrane proteins involved in signal transduction; they play key roles in growth, differentiation, metabolism, adhesion, motility, death and oncogenesis []. RTKs are composed of 3 domains: an extracellular domain (binds ligand), a transmembrane (TM) domain, and an intracellular catalytic domain (phosphorylates substrate). The TM domain plays an important role in the dimerisation process necessary for signal transduction [].      Cytoplasmic / non-receptor tyrosine kinases, which act as regulatory proteins, playing key roles in cell differentiation, motility, proliferation, and survival. For example, the Src-family of protein-tyrosine kinases [].   This entry represents various ephrin type A and B receptors, which have tyrosine kinase activity.
Probab=96.34  E-value=0.002  Score=55.27  Aligned_cols=26  Identities=31%  Similarity=0.887  Sum_probs=22.9

Q ss_pred             ccceeCCCCceecccCCCCCCcccCCCC
Q 000547          805 VFCEECPVGTFKNVSGSDRALCRNCSSN  832 (1432)
Q Consensus       805 ~fC~eCP~GtYKn~~Gs~~~~C~pC~~~  832 (1432)
                      .-|++||.||||+..|.  ..|.+||.+
T Consensus         9 ~~C~~Cp~GtYq~~~g~--~~C~~Cp~g   34 (48)
T PF07699_consen    9 NKCQPCPKGTYQDEEGQ--TSCTPCPPG   34 (48)
T ss_pred             CccCCCCCCccCCccCC--ccCccCcCC
Confidence            46999999999999886  589999975


No 3  
>PF10256 Erf4:  Golgin subfamily A member 7/ERF4 family;  InterPro: IPR019383 Proteins in this entry include Golgin subfamily A member 7 and the Ras modification protein ERF4. 
Probab=92.24  E-value=0.22  Score=49.97  Aligned_cols=67  Identities=19%  Similarity=0.347  Sum_probs=44.9

Q ss_pred             CCcchhHhhhcHHHHHHHHHHHHHHHhhh--h--hhhhhhheeeheecchhHHHHHHHHHHHHHHHHHHHH
Q 000547          966 SPPEQVIEIVYEDAFNRFADEINALAAYQ--W--WEGSVYSILSVLAYPLAWSWLQLCRKNKLQQLREFVR 1032 (1432)
Q Consensus       966 spP~~i~~ivyed~fn~Fad~IN~laay~--~--We~~i~~iLsvl~YPlaw~~~q~rRrkk~~rL~efv~ 1032 (1432)
                      .-|.++...+.+++|++++++||+..+=.  .  |.-++-.+|.++..=+..++.....||++++|.+|+.
T Consensus        21 ~~P~~L~~~is~~ef~~iI~~IN~~l~~a~~~~~~~~~~~~~l~~lt~~l~~~~~~~~~~~~~~~le~~l~   91 (118)
T PF10256_consen   21 EYPGELSGYISPEEFEEIINTINQILKEAFEPISWRNIIENILGCLTLGLSSLCFKTHYKRKLRELEKYLE   91 (118)
T ss_pred             cCCHhhcCCCCHHHHHHHHHHHHHHHHHHhcchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence            56777999999999999999999987322  2  4444544555544222122234455677888999997


No 4  
>PHA02637 TNF-alpha-receptor-like protein; Provisional
Probab=91.86  E-value=0.19  Score=52.14  Aligned_cols=57  Identities=23%  Similarity=0.571  Sum_probs=38.6

Q ss_pred             CCCCCcc---cccceeCCCCceecccC--CCCCCcccCCCCCCCCCcceecccCcccCCCCCccccC
Q 000547          797 ACPRGLY---GVFCEECPVGTFKNVSG--SDRALCRNCSSNELPHRALYIPIRGGVTECPCPYKCVS  858 (1432)
Q Consensus       797 ~CP~G~~---G~fC~eCP~GtYKn~~G--s~~~~C~pC~~~~lp~ra~yiy~~gG~~~~~CpY~C~s  858 (1432)
                      .|..+-|   +.-|.+||.|+|+...=  .....|.|||.++.....++..     ....|.-.|++
T Consensus        31 ~C~e~EY~~~~~CC~kCPPGt~v~~~Ct~~t~T~C~PCp~GTYTe~~N~~~-----~C~~C~~~Cd~   92 (127)
T PHA02637         31 KCKDNEYKRHNLCCLSCPPGTYASRLCDIKTNTQCTPCGSGTFTSHNNHLP-----ACLSCNGRCDR   92 (127)
T ss_pred             CCCCCcCcCCCeEcCCCCCCCEEeCcCCCCCCcccccCCCCCeeccCCCCC-----cccccCCccCc
Confidence            6888865   34589999999987431  1245799999876555444442     35677777765


No 5  
>KOG0921 consensus Dosage compensation complex, subunit MLE [Transcription]
Probab=91.20  E-value=0.5  Score=61.12  Aligned_cols=7  Identities=57%  Similarity=1.246  Sum_probs=2.9

Q ss_pred             CCCCCCe
Q 000547          287 AGGGGGR  293 (1432)
Q Consensus       287 GGGGGGR  293 (1432)
                      +|++|||
T Consensus      1272 ~G~~Gg~ 1278 (1282)
T KOG0921|consen 1272 GGGRGGN 1278 (1282)
T ss_pred             CCCcccc
Confidence            3344444


No 6  
>PF07562 NCD3G:  Nine Cysteines Domain of family 3 GPCR;  InterPro: IPR011500 G-protein-coupled receptors, GPCRs, constitute a vast protein family that encompasses a wide range of functions (including various autocrine, paracrine and endocrine processes). They show considerable diversity at the sequence level, on the basis of which they can be separated into distinct groups. We use the term clan to describe the GPCRs, as they embrace a group of families for which there are indications of evolutionary relationship, but between which there is no statistically significant similarity in sequence []. The currently known clan members include the rhodopsin-like GPCRs, the secretin-like GPCRs, the cAMP receptors, the fungal mating pheromone receptors, and the metabotropic glutamate receptor family. There is a specialised database for GPCRs (http://www.gpcr.org/7tm/).  GPCR family 3 receptors (also known as family C) are structurally similar to other GPCRs, but do not show any significant sequence similarity and thus represent a distinct group. Structurally they are composed of four elements; an N-terminal signal sequence; a large hydrophilic extracellular agonist-binding region containing several conserved cysteine residues which could be involved in disulphide bonds; a shorter region containing seven transmembrane domains; and a C-terminal cytoplasmic domain of variable length []. Family 3 members include the metabotropic glutamate receptors, the extracellular calcium-sensing receptors, the gamma-amino-butyric acid (GABA) type B receptors, and the vomeronasal type-2 receptors [, , , ]. As these receptors regulate many important physiological processes they are potentially promising targets for drug development. This entry represents a conserved sequence, found in the extracellular region, that contains several highly-conserved Cys residues that are predicted to form disulphide bridges.; GO: 0004930 G-protein coupled receptor activity, 0007186 G-protein coupled receptor protein signaling pathway; PDB: 2E4X_B 2E4Y_A 2E4U_B 2E4V_B 2E4W_A.
Probab=91.05  E-value=0.062  Score=47.67  Aligned_cols=36  Identities=31%  Similarity=0.744  Sum_probs=20.3

Q ss_pred             ccCCCCCccc----------ccceeCCCCceecccCCCCCCcccCCCC
Q 000547          795 GKACPRGLYG----------VFCEECPVGTFKNVSGSDRALCRNCSSN  832 (1432)
Q Consensus       795 g~~CP~G~~G----------~fC~eCP~GtYKn~~Gs~~~~C~pC~~~  832 (1432)
                      ...|++|++-          +-|++||.|+|.|.+  +...|.+||.+
T Consensus         6 S~~C~pG~~k~~~~~~~~CCw~C~~C~~~~is~~~--~~~~C~~C~~~   51 (54)
T PF07562_consen    6 SEPCPPGQRKKIQKGQPSCCWDCVPCPEGEISNQT--DSTSCTKCPEG   51 (54)
T ss_dssp             S----TTTEEEE--SSS-S--EEEE--TTEEEE----ETTEEEE--TT
T ss_pred             CCCCCCCCEECccCCCcceEEEeecCCCCcEECCC--CccccccCCCc
Confidence            4689999862          349999999999986  45899999965


No 7  
>cd00185 TNFR Tumor necrosis factor receptor (TNFR) domain; superfamily of TNF-like receptor domains. When bound to TNF-like cytokines, TNFRs trigger multiple signal transduction pathways, they are involved in inflammation response, apoptosis, autoimmunity and organogenesis. TNFRs domains are elongated with generally three tandem repeats of cysteine-rich domains (CRDs). They fit in the grooves between protomers within the ligand trimer. Some TNFRs, such as NGFR and HveA, bind ligands with no structural similarity to TNF and do not bind ligand trimers.
Probab=89.93  E-value=0.43  Score=46.67  Aligned_cols=66  Identities=24%  Similarity=0.564  Sum_probs=41.1

Q ss_pred             EEccCCCCCcc---------cccceeCCCCceecccCCCCCCcccCCCCCCCCCcceecccCc-ccCCCCCccccCCCcc
Q 000547          793 VTGKACPRGLY---------GVFCEECPVGTFKNVSGSDRALCRNCSSNELPHRALYIPIRGG-VTECPCPYKCVSERYH  862 (1432)
Q Consensus       793 i~g~~CP~G~~---------G~fC~eCP~GtYKn~~Gs~~~~C~pC~~~~lp~ra~yiy~~gG-~~~~~CpY~C~sdk~~  862 (1432)
                      .-=+.||+|++         ..-|++||.|+|.+... ....|++|..=  + ...++....- ...+.|-  |.+++|-
T Consensus        12 ~cC~~C~pG~~~~~~C~~~~~t~C~~C~~g~ys~~~~-~~~~C~~c~~C--~-~g~~~~~~ct~t~dt~C~--C~~G~y~   85 (98)
T cd00185          12 LCCDQCPPGTYLSKDCTPGSDTVCEPCPPGTYTDSWN-HLPKCLSCRTC--D-SGLVEKAPCTATRNTVCG--CKPGFYC   85 (98)
T ss_pred             CCCCCCCCCccCCCcCCCCCCCeecCCCCCCcccCCC-CCCcCCcCccC--C-CCCEEEccCCCCCCCeEe--CCCCCEe
Confidence            44578999986         25699999999988753 23578888642  2 1222222222 3344675  8888875


Q ss_pred             CC
Q 000547          863 MP  864 (1432)
Q Consensus       863 ~p  864 (1432)
                      ..
T Consensus        86 ~~   87 (98)
T cd00185          86 LT   87 (98)
T ss_pred             cC
Confidence            44


No 8  
>PHA02637 TNF-alpha-receptor-like protein; Provisional
Probab=85.85  E-value=0.54  Score=48.84  Aligned_cols=41  Identities=27%  Similarity=0.665  Sum_probs=30.7

Q ss_pred             CCceEEccCCCCCcc---------cccceeCCCCceecccCCCCCCcccCC
Q 000547          789 GNGTVTGKACPRGLY---------GVFCEECPVGTFKNVSGSDRALCRNCS  830 (1432)
Q Consensus       789 ~nGTi~g~~CP~G~~---------G~fC~eCP~GtYKn~~Gs~~~~C~pC~  830 (1432)
                      ..+.+-=+.||||++         ..-|.+||.|||..... ....|.+|.
T Consensus        38 ~~~~~CC~kCPPGt~v~~~Ct~~t~T~C~PCp~GTYTe~~N-~~~~C~~C~   87 (127)
T PHA02637         38 KRHNLCCLSCPPGTYASRLCDIKTNTQCTPCGSGTFTSHNN-HLPACLSCN   87 (127)
T ss_pred             cCCCeEcCCCCCCCEEeCcCCCCCCcccccCCCCCeeccCC-CCCcccccC
Confidence            445667789999986         56799999999977543 345688776


No 9  
>KOG0921 consensus Dosage compensation complex, subunit MLE [Transcription]
Probab=82.18  E-value=3.1  Score=54.44  Aligned_cols=17  Identities=29%  Similarity=0.390  Sum_probs=8.3

Q ss_pred             EEEEEEeeEEeeEEEee
Q 000547          557 QICRAEEINIEGIIKGS  573 (1432)
Q Consensus       557 qicrVeditv~g~v~GS  573 (1432)
                      ||||-+-..|+..|.-|
T Consensus      1135 dIs~pSAa~inLmig~~ 1151 (1282)
T KOG0921|consen 1135 DISRPSAADINLMIGDS 1151 (1282)
T ss_pred             HhcccccccccceeccC
Confidence            34555555555444433


No 10 
>PF07699 GCC2_GCC3:  GCC2 and GCC3;  InterPro: IPR011641 Protein phosphorylation, which plays a key role in most cellular activities, is a reversible process mediated by protein kinases and phosphoprotein phosphatases. Protein kinases catalyse the transfer of the gamma phosphate from nucleotide triphosphates (often ATP) to one or more amino acid residues in a protein substrate side chain, resulting in a conformational change affecting protein function. Phosphoprotein phosphatases catalyse the reverse process. Protein kinases fall into three broad classes, characterised with respect to substrate specificity []:   Serine/threonine-protein kinases Tyrosine-protein kinases Dual specific protein kinases (e.g. MEK - phosphorylates both Thr and Tyr on target proteins)   Protein kinase function has been evolutionarily conserved from Escherichia coli to human []. Protein kinases play a role in a multitude of cellular processes, including division, proliferation, apoptosis, and differentiation []. Phosphorylation usually results in a functional change of the target protein by changing enzyme activity, cellular location, or association with other proteins. The catalytic subunits of protein kinases are highly conserved, and several structures have been solved [], leading to large screens to develop kinase-specific inhibitors for the treatments of a number of diseases []. Tyrosine-protein kinases can transfer a phosphate group from ATP to a tyrosine residue in a protein. These enzymes can be divided into two main groups []:   Receptor tyrosine kinases (RTK), which are transmembrane proteins involved in signal transduction; they play key roles in growth, differentiation, metabolism, adhesion, motility, death and oncogenesis []. RTKs are composed of 3 domains: an extracellular domain (binds ligand), a transmembrane (TM) domain, and an intracellular catalytic domain (phosphorylates substrate). The TM domain plays an important role in the dimerisation process necessary for signal transduction [].      Cytoplasmic / non-receptor tyrosine kinases, which act as regulatory proteins, playing key roles in cell differentiation, motility, proliferation, and survival. For example, the Src-family of protein-tyrosine kinases [].   This entry represents various ephrin type A and B receptors, which have tyrosine kinase activity.
Probab=77.48  E-value=1.8  Score=37.40  Aligned_cols=32  Identities=38%  Similarity=0.902  Sum_probs=24.8

Q ss_pred             ceEEccCCCCCcc----c-ccceeCCCCceecccCCC
Q 000547          791 GTVTGKACPRGLY----G-VFCEECPVGTFKNVSGSD  822 (1432)
Q Consensus       791 GTi~g~~CP~G~~----G-~fC~eCP~GtYKn~~Gs~  822 (1432)
                      +.-.=.+||+|+|    | ..|++||.|+|-...|+.
T Consensus         7 ~~~~C~~Cp~GtYq~~~g~~~C~~Cp~g~~T~~~Gs~   43 (48)
T PF07699_consen    7 GNNKCQPCPKGTYQDEEGQTSCTPCPPGSTTSSEGST   43 (48)
T ss_pred             CCCccCCCCCCccCCccCCccCccCcCCCccCCcCCc
Confidence            3344579999987    3 559999999998877763


No 11 
>cd00185 TNFR Tumor necrosis factor receptor (TNFR) domain; superfamily of TNF-like receptor domains. When bound to TNF-like cytokines, TNFRs trigger multiple signal transduction pathways, they are involved in inflammation response, apoptosis, autoimmunity and organogenesis. TNFRs domains are elongated with generally three tandem repeats of cysteine-rich domains (CRDs). They fit in the grooves between protomers within the ligand trimer. Some TNFRs, such as NGFR and HveA, bind ligands with no structural similarity to TNF and do not bind ligand trimers.
Probab=67.15  E-value=6.5  Score=38.63  Aligned_cols=33  Identities=24%  Similarity=0.796  Sum_probs=24.0

Q ss_pred             cccceeCCCCceecccCC--CCCCcccCCCCCCCC
Q 000547          804 GVFCEECPVGTFKNVSGS--DRALCRNCSSNELPH  836 (1432)
Q Consensus       804 G~fC~eCP~GtYKn~~Gs--~~~~C~pC~~~~lp~  836 (1432)
                      +.-|..||.|+|-.....  ....|.|||.+.+..
T Consensus        11 ~~cC~~C~pG~~~~~~C~~~~~t~C~~C~~g~ys~   45 (98)
T cd00185          11 GLCCDQCPPGTYLSKDCTPGSDTVCEPCPPGTYTD   45 (98)
T ss_pred             CCCCCCCCCCccCCCcCCCCCCCeecCCCCCCccc
Confidence            456999999999876532  245799999865433


No 12 
>PTZ00382 Variant-specific surface protein (VSP); Provisional
Probab=63.06  E-value=4.3  Score=40.39  Aligned_cols=24  Identities=25%  Similarity=0.493  Sum_probs=18.4

Q ss_pred             ccceeCCCCceecccCCCCCCcccCCC
Q 000547          805 VFCEECPVGTFKNVSGSDRALCRNCSS  831 (1432)
Q Consensus       805 ~fC~eCP~GtYKn~~Gs~~~~C~pC~~  831 (1432)
                      ..|.+|..|+|++..   ...|.+|+.
T Consensus         4 ~~Ct~C~~g~~~~~~---~~~C~~C~~   27 (96)
T PTZ00382          4 AVCTSCDSDKKPNKD---GSGCVLCSV   27 (96)
T ss_pred             cccCcCCCCCccCCC---CCcCCcCCC
Confidence            468999999988854   357888874


No 13 
>KOG4069 consensus Uncharacterized conserved protein [Function unknown]
Probab=60.59  E-value=19  Score=38.14  Aligned_cols=78  Identities=24%  Similarity=0.343  Sum_probs=54.6

Q ss_pred             CCCcchhHhhhcHHHHHHHHHHHHHHHhh-------hhhhhhhhheeeheecchhHHHHHHHHHHHHHHHHHHHHhh---
Q 000547          965 HSPPEQVIEIVYEDAFNRFADEINALAAY-------QWWEGSVYSILSVLAYPLAWSWLQLCRKNKLQQLREFVRSE--- 1034 (1432)
Q Consensus       965 ~spP~~i~~ivyed~fn~Fad~IN~laay-------~~We~~i~~iLsvl~YPlaw~~~q~rRrkk~~rL~efv~se--- 1034 (1432)
                      ...|..+++.|..+.|+.=+..+|++-|=       ..+|+.+   =++.+|-.|.--.. --+||+++|+||+.++   
T Consensus        45 ~~~pa~le~~i~R~vfE~Ti~rlN~~yAeAE~~~~qty~Egcl---gC~TaY~iy~ctet-hYek~L~klskfl~~qNe~  120 (154)
T KOG4069|consen   45 AEYPARLEEKIPRDVFENTIVRLNRIYAEAEAITPQTYFEGCL---GCFTAYAIYACTET-HYEKKLDKLSKFLNRQNEE  120 (154)
T ss_pred             ecCcHHHhccCcHHHHHHHHHHHHHHHHHHHhcCCcchHHHHH---HHHHHHHHHHHHHH-HHHHHHHHHHHHHHhhhhh
Confidence            35789999999999999999999998432       2456554   35666666554333 3357899999999854   


Q ss_pred             -cCc--cccccchhh
Q 000547         1035 -YDH--SCLRSCRSR 1046 (1432)
Q Consensus      1035 -ydh--~c~rs~rsR 1046 (1432)
                       |.|  .=+|++--|
T Consensus       121 IY~~~Gl~l~dP~eR  135 (154)
T KOG4069|consen  121 IYHHVGLHLRDPMER  135 (154)
T ss_pred             hccccceeecCchhh
Confidence             777  355665555


No 14 
>KOG4289 consensus Cadherin EGF LAG seven-pass G-type receptor [Signal transduction mechanisms]
Probab=53.44  E-value=11  Score=51.56  Aligned_cols=86  Identities=23%  Similarity=0.191  Sum_probs=54.8

Q ss_pred             hhhHHHHHHHHhhhhhHHHHHHHHHHHHHhHHHHHHHHHhhcccccCCCcccccccccCCCCCC-chhhhhhhhhhhhhH
Q 000547         1274 QDLVGLLVSVLLLADFSLVLLTLLQMYSISLLNFFLVLFILPLGLLFPFPAGISALFSHGPRRS-AGLARIYALWNITSL 1352 (1432)
Q Consensus      1274 ~~~~~l~i~~~ll~d~s~~ll~llq~y~~s~~~~~~~l~~lpl~~~~p~~ag~nalfs~~~rrs-~~~~r~~~lwn~~s~ 1352 (1432)
                      ...|++-...+||+=+.+++|-.|---+.++..-|++-|-|           .+-+|-||--+. +--+=+-+|--.+++
T Consensus      2170 ~a~~gvslaal~lt~~llls~RsLksn~~~I~~~l~~Al~l-----------~~L~Fv~gi~~nq~~CtvvailLhf~~~ 2238 (2531)
T KOG4289|consen 2170 YAAVGVSLAALLLTFLLLLSLRSLKSNSHGIHFNLAAALGL-----------AQLVFVLGINQNQFYCTVVAILLHFTYL 2238 (2531)
T ss_pred             HHHHHHHHHHHHHHHHHHHHHHHHhhcchhhhhHHHHHHhH-----------HHHHhhhhcccCchhhHHHHHHHHHHHh
Confidence            35677777778877777777777777777766555443321           122444444443 233455667778888


Q ss_pred             HHHHHHhhhceeeccccC
Q 000547         1353 INVATAFICGYLHYRDHS 1370 (1432)
Q Consensus      1353 ~n~~~a~~~g~~~~~~~~ 1370 (1432)
                      .-.+-+|+.|+--|.|-.
T Consensus      2239 stFaWlfl~gLhlYRml~ 2256 (2531)
T KOG4289|consen 2239 STFAWLFLEGLHLYRMLT 2256 (2531)
T ss_pred             hhHHHHHHHHHHHHHHHh
Confidence            888888888887777654


No 15 
>cd00055 EGF_Lam Laminin-type epidermal growth factor-like domain; laminins are the major noncollagenous components of basement membranes that mediate cell adhesion, growth migration, and differentiation; the laminin-type epidermal growth factor-like module occurs in tandem arrays; the domain contains 4 disulfide bonds (loops a-d) the first three resemble epidermal growth factor (EGF); the number of copies of this domain in the different forms of laminins is highly variable ranging from 3 up to 22 copies
Probab=52.62  E-value=7.5  Score=33.90  Aligned_cols=27  Identities=26%  Similarity=0.628  Sum_probs=22.0

Q ss_pred             CCceEEccCCCCCcccccceeCCCCceecc
Q 000547          789 GNGTVTGKACPRGLYGVFCEECPVGTFKNV  818 (1432)
Q Consensus       789 ~nGTi~g~~CP~G~~G~fC~eCP~GtYKn~  818 (1432)
                      .+|.-   .|++++.|..|++|+.|+|...
T Consensus        17 ~~G~C---~C~~~~~G~~C~~C~~g~~~~~   43 (50)
T cd00055          17 GTGQC---ECKPNTTGRRCDRCAPGYYGLP   43 (50)
T ss_pred             CCCEE---eCCCcCCCCCCCCCCCCCccCC
Confidence            34544   4999999999999999999653


No 16 
>smart00180 EGF_Lam Laminin-type epidermal growth factor-like domai.
Probab=52.55  E-value=7.7  Score=33.50  Aligned_cols=20  Identities=25%  Similarity=0.750  Sum_probs=18.5

Q ss_pred             CCCCCcccccceeCCCCcee
Q 000547          797 ACPRGLYGVFCEECPVGTFK  816 (1432)
Q Consensus       797 ~CP~G~~G~fC~eCP~GtYK  816 (1432)
                      .|++++.|..|++|+.|+|-
T Consensus        21 ~C~~~~~G~~C~~C~~g~~g   40 (46)
T smart00180       21 ECKPNVTGRRCDRCAPGYYG   40 (46)
T ss_pred             ECCCCCCCCCCCcCCCCcCC
Confidence            49999999999999999996


No 17 
>PF05268 GP38:  Phage tail fibre adhesin Gp38;  InterPro: IPR007932 This entry is represented by Bacteriophage T2, Gp38. The characteristics of the protein distribution suggest prophage matches in addition to the phage matches. This family contains several Gp38 proteins from T-even-like phages. Gp38, together with a second phage protein, Gp57, catalyses the organisation of Gp37 but is absent from the phage particle. Gp37 is responsible for receptor recognition [].
Probab=51.86  E-value=65  Score=36.94  Aligned_cols=37  Identities=24%  Similarity=0.514  Sum_probs=21.4

Q ss_pred             cccCCCCCCCCCCCCCCCCCcEEEEEEceeEEe--ccEEE
Q 000547          210 SYGSKGGTTFKGENFGGDGGGRIRLEVVNEIEV--NGSLL  247 (1432)
Q Consensus       210 ~~GSGGG~~~~~~~~GG~GGG~I~i~a~~~l~l--~G~i~  247 (1432)
                      .+|-|| +++.....|.+||=+|+=....+|.|  +|.|-
T Consensus       114 ~yGRGG-nGs~~~~~g~~GG~~I~N~iG~rLRI~N~GaIA  152 (260)
T PF05268_consen  114 MYGRGG-NGSGSNSAGAAGGHAIQNDIGGRLRINNNGAIA  152 (260)
T ss_pred             EEecCC-CCCCCCCCccccceeeecCCcceEEEecCCEEe
Confidence            344444 33445567778888887655555554  45554


No 18 
>PF12273 RCR:  Chitin synthesis regulation, resistance to Congo red;  InterPro: IPR020999  RCR proteins are ER membrane proteins that regulate chitin deposition in fungal cell walls. Although chitin, a linear polymer of beta-1,4-linked N-acetylglucosamine, constitutes only 2% of the cell wall it plays a vital role in the overall protection of the cell wall against stress, noxious chemicals and osmotic pressure changes. Congo red is a cell wall-disrupting benzidine-type dye extensively used in many cell wall mutant studies that specifically targets chitin in yeast cells and inhibits growth. RCR proteins render the yeasts resistant to Congo red by diminishing the content of chitin in the cell wall []. RCR proteins are probably regulating chitin synthase III interact directly with ubiquitin ligase Rsp5, and the VPEY motif is necessary for this, via interaction with the WW domains of Rsp5 []. 
Probab=45.10  E-value=20  Score=37.01  Aligned_cols=20  Identities=35%  Similarity=0.884  Sum_probs=8.7

Q ss_pred             c-hhHHHHHHHHHHHHHHHHH
Q 000547          881 W-LFGLILLGLLILLALVLSV  900 (1432)
Q Consensus       881 ~-~F~lll~~llilLalvls~  900 (1432)
                      | +|+||+++++++|+++..+
T Consensus         2 W~l~~iii~~i~l~~~~~~~~   22 (130)
T PF12273_consen    2 WVLFAIIIVAILLFLFLFYCH   22 (130)
T ss_pred             eeeHHHHHHHHHHHHHHHHHH
Confidence            5 3444444444444444333


No 19 
>KOG3973 consensus Uncharacterized conserved glycine-rich protein [Function unknown]
Probab=44.69  E-value=81  Score=38.23  Aligned_cols=7  Identities=71%  Similarity=1.341  Sum_probs=3.4

Q ss_pred             CCCCeEE
Q 000547          289 GGGGRVS  295 (1432)
Q Consensus       289 GGGGRI~  295 (1432)
                      |||||-.
T Consensus       453 ggggrg~  459 (465)
T KOG3973|consen  453 GGGGRGG  459 (465)
T ss_pred             CCCCCcc
Confidence            3355543


No 20 
>PF14946 DUF4501:  Domain of unknown function (DUF4501)
Probab=44.43  E-value=80  Score=34.81  Aligned_cols=37  Identities=41%  Similarity=0.630  Sum_probs=23.4

Q ss_pred             HcCCcchhH-HHHHHHHHHHHHHHHHhhhhccC-CCCCC
Q 000547          876 TFGGPWLFG-LILLGLLILLALVLSVARMKYMG-GDELP  912 (1432)
Q Consensus       876 tfGGp~~F~-lll~~llilLalvls~~R~k~~~-~d~~~  912 (1432)
                      .+|||+..+ |||=.|+|-+++++|++-.-|-| +.++|
T Consensus        83 ~~g~P~vAASL~LgTffIS~~LilSvA~FFYLKrs~kLP  121 (180)
T PF14946_consen   83 HTGGPQVAASLFLGTFFISLGLILSVASFFYLKRSSKLP  121 (180)
T ss_pred             cCCChhHHHHHHHHHHHHHHHHHHHHhhheeecccccCC
Confidence            489999544 44446667788888887654444 33444


No 21 
>PF15496 DUF4646:  Domain of unknown function (DUF4646)
Probab=40.09  E-value=46  Score=34.61  Aligned_cols=70  Identities=16%  Similarity=0.244  Sum_probs=46.2

Q ss_pred             hHhhhcHHHHHHHHHHHHHHHhhhhhhhhhhheeeh--------eecchhHHHHHHHHHHHHHHHHHHHHhhcCccccc
Q 000547          971 VIEIVYEDAFNRFADEINALAAYQWWEGSVYSILSV--------LAYPLAWSWLQLCRKNKLQQLREFVRSEYDHSCLR 1041 (1432)
Q Consensus       971 i~~ivyed~fn~Fad~IN~laay~~We~~i~~iLsv--------l~YPlaw~~~q~rRrkk~~rL~efv~seydh~c~r 1041 (1432)
                      ...=|.++++.+|.++++.-++..-+|.++-...-+        ..|=.++.....+.+||-..+.++|. ..++.++|
T Consensus        43 ~~~DVs~eDW~~F~~dl~~aa~ls~~~~~~~~~~~~~~~v~~Gi~~~~v~~~~~~~~~~~k~~~v~~~i~-~WN~~FF~  120 (123)
T PF15496_consen   43 ASHDVSEEDWTRFLNDLSEAASLSPSQSIVAGVGPIVMGVGFGIPAYLVAKAIRKAMKEKKRGEVESTID-QWNEGFFR  120 (123)
T ss_pred             hhcCCCHHHHHHHHHHHHHHHhcCcccceeeeeccccccccccchhhhhhHhhhhcccccchHHHHHHHH-HHHHHhcc
Confidence            344567899999999999998887777543332222        22333444666666777778888887 56665555


No 22 
>KOG1836 consensus Extracellular matrix glycoprotein Laminin subunits alpha and gamma [Extracellular structures]
Probab=39.98  E-value=19  Score=50.87  Aligned_cols=43  Identities=42%  Similarity=0.767  Sum_probs=35.6

Q ss_pred             CceEEccCCCCCcccccceeCCCCceecccCC--CCCCcccCCCC
Q 000547          790 NGTVTGKACPRGLYGVFCEECPVGTFKNVSGS--DRALCRNCSSN  832 (1432)
Q Consensus       790 nGTi~g~~CP~G~~G~fC~eCP~GtYKn~~Gs--~~~~C~pC~~~  832 (1432)
                      .-++.-|.||+||.|..|++|.-|+|=+..+-  +...|++|+-+
T Consensus       794 ~~~~iCk~Cp~gytG~rCe~c~dgyfg~p~~~~~~~~~c~~c~c~  838 (1705)
T KOG1836|consen  794 ILEVVCKNCPPGYTGLRCEECADGYFGNPLGHDGDVRPCQSCQCN  838 (1705)
T ss_pred             ccceecCCCCCCCcccccccCCCccccCCCCCCCCcccCccceec
Confidence            56677799999999999999999999886543  23579999975


No 23 
>PF07354 Sp38:  Zona-pellucida-binding protein (Sp38);  InterPro: IPR010857 This family contains a number of zona-pellucida-binding proteins that seem to be restricted to mammals. These are sperm proteins that bind to the 90 kDa family of zona pellucida glycoproteins in a calcium-dependent manner []. These represent some of the specific molecules that mediate the first steps of gamete interaction, allowing fertilisation to occur [].; GO: 0007339 binding of sperm to zona pellucida, 0005576 extracellular region
Probab=39.15  E-value=18  Score=42.08  Aligned_cols=35  Identities=26%  Similarity=0.607  Sum_probs=26.9

Q ss_pred             EccCCCCCccc---------ccceeCCCCceecccCCCCCCcccCCC
Q 000547          794 TGKACPRGLYG---------VFCEECPVGTFKNVSGSDRALCRNCSS  831 (1432)
Q Consensus       794 ~g~~CP~G~~G---------~fC~eCP~GtYKn~~Gs~~~~C~pC~~  831 (1432)
                      +=..|+|||+-         -=|+-|++|||....+   ..|+.|..
T Consensus       218 ~idsC~PGfG~N~~~h~~C~~CCVvCsPgTysp~~~---~~C~~C~~  261 (271)
T PF07354_consen  218 RIDSCRPGFGKNDILHSDCPSCCVVCSPGTYSPDDD---VHCQQCNS  261 (271)
T ss_pred             EeeccCCCCCcCcccCCCCCCeeEECCCcccCCCCC---ceEEecCc
Confidence            34678888751         3599999999988754   58999985


No 24 
>PF15496 DUF4646:  Domain of unknown function (DUF4646)
Probab=38.20  E-value=36  Score=35.36  Aligned_cols=25  Identities=16%  Similarity=0.331  Sum_probs=19.9

Q ss_pred             CcCChhHHHHHHHHHhHhhhheecc
Q 000547         1116 QSVPPTVWYRLVAGVNAQLRLVHCG 1140 (1432)
Q Consensus      1116 q~~~~~~w~rfVa~lN~qlr~V~~g 1140 (1432)
                      -+|..+.|.||+.+|+...++-...
T Consensus        45 ~DVs~eDW~~F~~dl~~aa~ls~~~   69 (123)
T PF15496_consen   45 HDVSEEDWTRFLNDLSEAASLSPSQ   69 (123)
T ss_pred             cCCCHHHHHHHHHHHHHHHhcCccc
Confidence            3699999999999999985554443


No 25 
>PF00053 Laminin_EGF:  Laminin EGF-like (Domains III and V);  InterPro: IPR002049 Laminins [] are the major noncollagenous components of basement membranes that mediate cell adhesion, growth migration, and differentiation. They are composed of distinct but related alpha, beta and gamma chains. The three chains form a cross-shaped molecule that consist of a long arm and three short globular arms. The long arm consist of a coiled coil structure contributed by all three chains and cross-linked by interchain disulphide bonds. Beside different types of globular domains each subunit contains, in its first half, consecutive repeats of about 60 amino acids in length that include eight conserved cysteines []. The tertiary structure [, ] of this domain is remotely similar in its N-terminal to that of the EGF-like module (see PDOC00021 from PROSITEDOC). It is known as a 'LE' or 'laminin-type EGF-like' domain. The number of copies of the LE domain in the different forms of laminins is highly variable; from 3 up to 22 copies have been found. A schematic representation of the topology of the four disulphide bonds in the LE domain is shown below.  +-------------------+ +-|-----------+ | +--------+ +-----------------+ | | | | | | | | xxCxCxxxxxxxxxxxCxxxxxxxCxxCxxxxxGxxCxxCxxgaagxxxxxxxxxxxCxx sssssssssssssssssssssssssssssssssss 'C': conserved cysteine involved in a disulphide bond 'a': conserved aromatic residue 'G': conserved glycine (lower case = less conserved) 's': region similar to the EGF-like domain  In mouse laminin gamma-1 chain, the seventh LE domain has been shown to be the only one that binds with a high affinity to nidogen []. The binding-sites are located on the surface within the loops C1-C3 and C5-C6 [, ]. Long consecutive arrays of LE domains in laminins form rod-like elements of limited flexibility [], which determine the spacing in the formation of laminin networks of basement membranes [].; PDB: 3TBD_A 3ZYG_B 3ZYI_B 2Y38_A 1KLO_A 1NPE_B 3ZYJ_B 1TLE_A.
Probab=37.34  E-value=13  Score=32.03  Aligned_cols=22  Identities=32%  Similarity=0.696  Sum_probs=18.1

Q ss_pred             CCCCCcccccceeCCCCceecc
Q 000547          797 ACPRGLYGVFCEECPVGTFKNV  818 (1432)
Q Consensus       797 ~CP~G~~G~fC~eCP~GtYKn~  818 (1432)
                      .|++++.|..|++|..|+|...
T Consensus        21 ~C~~~~~G~~C~~C~~g~~~~~   42 (49)
T PF00053_consen   21 VCKPGTTGPRCDQCKPGYFGLP   42 (49)
T ss_dssp             SBSTTEESTTS-EE-TTEECST
T ss_pred             eccccccCCcCcCCCCcccccc
Confidence            4999999999999999999774


No 26 
>KOG1836 consensus Extracellular matrix glycoprotein Laminin subunits alpha and gamma [Extracellular structures]
Probab=36.92  E-value=28  Score=49.40  Aligned_cols=36  Identities=31%  Similarity=0.629  Sum_probs=27.1

Q ss_pred             cCCCCCcccccceeCCCCceeccc-CCCCCCcccCCC
Q 000547          796 KACPRGLYGVFCEECPVGTFKNVS-GSDRALCRNCSS  831 (1432)
Q Consensus       796 ~~CP~G~~G~fC~eCP~GtYKn~~-Gs~~~~C~pC~~  831 (1432)
                      =.||+||.|-||+.|+.|+.+-.. +.....|.||+-
T Consensus       697 c~C~~g~tG~~Ce~C~~gfrr~~~~~~~~~~c~~C~c  733 (1705)
T KOG1836|consen  697 CTCPVGYTGQFCESCAPGFRRLSPQLGPFCPCIPCDC  733 (1705)
T ss_pred             ccCCCCcccchhhhcchhhhcccccCCCCCccccccc
Confidence            579999999999999999976643 333345667764


No 27 
>PF10256 Erf4:  Golgin subfamily A member 7/ERF4 family;  InterPro: IPR019383 Proteins in this entry include Golgin subfamily A member 7 and the Ras modification protein ERF4. 
Probab=34.68  E-value=60  Score=32.79  Aligned_cols=56  Identities=16%  Similarity=0.246  Sum_probs=40.3

Q ss_pred             ccCcCChhHHHHHHHHHhHhhhhe-eccc------------------------cchhhhhHHHHHhhccCccccccCeEE
Q 000547         1114 MSQSVPPTVWYRLVAGVNAQLRLV-HCGH------------------------LKTTFGHLISWLDTHANPSLCQYGIRV 1168 (1432)
Q Consensus      1114 ~~q~~~~~~w~rfVa~lN~qlr~V-~~g~------------------------lr~tl~~v~~~le~h~n~~l~~~Gvrv 1168 (1432)
                      +...++++.|+++|..+|..|+.- ..-.                        -++.+..+-+||++.|++.++.+|+++
T Consensus        26 L~~~is~~ef~~iI~~IN~~l~~a~~~~~~~~~~~~~l~~lt~~l~~~~~~~~~~~~~~~le~~l~~~N~~~~~~~gi~i  105 (118)
T PF10256_consen   26 LSGYISPEEFEEIINTINQILKEAFEPISWRNIIENILGCLTLGLSSLCFKTHYKRKLRELEKYLEQLNEELFKPRGIKI  105 (118)
T ss_pred             hcCCCCHHHHHHHHHHHHHHHHHHhcchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccCCcEE
Confidence            355689999999999999988766 3321                        122334466788888777999999865


Q ss_pred             E
Q 000547         1169 D 1169 (1432)
Q Consensus      1169 ~ 1169 (1432)
                      -
T Consensus       106 i  106 (118)
T PF10256_consen  106 I  106 (118)
T ss_pred             E
Confidence            3


No 28 
>PRK09677 putative lipopolysaccharide biosynthesis O-acetyl transferase WbbJ; Provisional
Probab=34.29  E-value=96  Score=33.91  Aligned_cols=61  Identities=15%  Similarity=0.245  Sum_probs=40.1

Q ss_pred             ceEEEccceeEEcCCceeeecccCcEEEEEeeeeEEeCCCceEEeceEEEEEeeEEEcCCcEEE
Q 000547           87 DIYVEGSGNLHILPGVTLCCPIKGCLLTINVTGEFLLGRNSEIVAGTVYVSALNASFSSGSVVN  150 (1432)
Q Consensus        87 Dv~i~G~g~L~I~~gv~l~c~~~G~~I~v~~sGn~~lg~ns~ivagsi~L~A~nvti~~~g~I~  150 (1432)
                      ..++++.+++++-.+|.+   ..+|.+.+...+.+.+++++.|-.........+++|.++..|.
T Consensus        35 pf~~~~~~~I~iG~~v~i---~~~~ri~~~~~~~i~IG~~v~Ig~~v~I~~~~~v~IG~~v~Ig   95 (192)
T PRK09677         35 PFYIRNDGSINFGEGFTS---GVGLRLDAFGRGKLFFGDNVQVNDYVHIACIESITIGRDTLIA   95 (192)
T ss_pred             CEEEcCCCeEEECCceEE---CCCeEEEecCCCeEEECCCCEECCCcEEccCceEEECCCCEEC
Confidence            346666677777777766   5677777766778888888777655443334556665555553


No 29 
>KOG2675 consensus Adenylate cyclase-associated protein (CAP/Srv2p) [Cytoskeleton; Signal transduction mechanisms]
Probab=33.63  E-value=25  Score=43.43  Aligned_cols=26  Identities=31%  Similarity=0.528  Sum_probs=14.2

Q ss_pred             CCCCccccccccCCCcceeeEEEEec
Q 000547          341 NLPTNTDTLLLEFPKQQLWTNVYIRD  366 (1432)
Q Consensus       341 n~~t~t~Tlll~fp~~~lw~nv~v~~  366 (1432)
                      +.++..|-=+.|||--.=|-|.++..
T Consensus       445 ~vp~~~dgDy~EfpvPEQfkt~~~~~  470 (480)
T KOG2675|consen  445 NVPTNEDGDYVEFPVPEQFKTKFNGG  470 (480)
T ss_pred             ecccCCCCCcccccChHHHhhhccCc
Confidence            34444555667776655555554443


No 30 
>KOG4611 consensus Uncharacterized conserved protein [Function unknown]
Probab=32.35  E-value=36  Score=40.98  Aligned_cols=48  Identities=42%  Similarity=0.980  Sum_probs=0.0

Q ss_pred             ceeCCCCceecccCCCCCCcccCCCCCCCCCcceecccCcccCCCCCcc---------------------ccCCCccC--
Q 000547          807 CEECPVGTFKNVSGSDRALCRNCSSNELPHRALYIPIRGGVTECPCPYK---------------------CVSERYHM--  863 (1432)
Q Consensus       807 C~eCP~GtYKn~~Gs~~~~C~pC~~~~lp~ra~yiy~~gG~~~~~CpY~---------------------C~sdk~~~--  863 (1432)
                      |+|||.||...   .+.--|..|-+.                ...||-+                     |-+++|++  
T Consensus        52 ceecpegtlss---pdqtgclncnng----------------tchcpsqstlifrdasgnlltndafcgncasgfyrndn  112 (747)
T KOG4611|consen   52 CEECPEGTLSS---PDQTGCLNCNNG----------------TCHCPSQSTLIFRDASGNLLTNDAFCGNCASGFYRNDN  112 (747)
T ss_pred             cccCCCcccCC---CccCCceecCCC----------------ccCCCCcceEEEEcCCCCeeccccccccccccceECCC


Q ss_pred             ---CcccchHHHH
Q 000547          864 ---PHCYTTLEEL  873 (1432)
Q Consensus       864 ---p~C~T~leel  873 (1432)
                         ..|-|...|.
T Consensus       113 gyctkcetscsem  125 (747)
T KOG4611|consen  113 GYCTKCETSCSEM  125 (747)
T ss_pred             cccccccccHhhh


No 31 
>cd00064 FU Furin-like repeats. Cysteine rich region. Exact function of the domain is not known. Furin is a serine-kinase dependent proprotein processor. Other members of this family include endoproteases and cell surface receptors.
Probab=30.47  E-value=38  Score=29.11  Aligned_cols=22  Identities=41%  Similarity=0.995  Sum_probs=13.5

Q ss_pred             cCCCCCc--ccccce-eCCCCceec
Q 000547          796 KACPRGL--YGVFCE-ECPVGTFKN  817 (1432)
Q Consensus       796 ~~CP~G~--~G~fC~-eCP~GtYKn  817 (1432)
                      ..|+.|+  .+..|+ +||.++|.+
T Consensus        18 ~~C~~~~~~~~~~Cv~~C~~~~~~~   42 (49)
T cd00064          18 TSCRHGFYLDGGTCVSECPEGTYAD   42 (49)
T ss_pred             ccCcCccCCCCCcccccCCCCceec
Confidence            4566666  345566 666666665


No 32 
>KOG4258 consensus Insulin/growth factor receptor (contains protein kinase domain) [Signal transduction mechanisms]
Probab=29.07  E-value=35  Score=45.10  Aligned_cols=46  Identities=26%  Similarity=0.608  Sum_probs=30.7

Q ss_pred             ccce-eCCCCceecccCCCCCCcccCCCCCCCCCcceecccCcccCCCCCccccCCCccCCcccchHHHH
Q 000547          805 VFCE-ECPVGTFKNVSGSDRALCRNCSSNELPHRALYIPIRGGVTECPCPYKCVSERYHMPHCYTTLEEL  873 (1432)
Q Consensus       805 ~fC~-eCP~GtYKn~~Gs~~~~C~pC~~~~lp~ra~yiy~~gG~~~~~CpY~C~sdk~~~p~C~T~leel  873 (1432)
                      ..|. +||.|+-.|.   +..+|++|..                   .||-.|.++ .+.-+=.+..|+|
T Consensus       304 ~~Cv~~CPsGy~~N~---~~~~C~~C~g-------------------~C~~vC~~~-~~tIds~~~Aq~L  350 (1025)
T KOG4258|consen  304 GQCVAKCPSGYKRNS---SSSECVKCEG-------------------PCPKVCEPG-TKTIDSVADAQNL  350 (1025)
T ss_pred             ccchhhCCCcceecC---cccceeccCC-------------------CCcceeccC-ceEecchhhhHHh
Confidence            3566 8999986664   3589999962                   799999863 3222334555654


No 33 
>KOG4180 consensus Predicted kinase [General function prediction only]
Probab=28.99  E-value=99  Score=37.43  Aligned_cols=28  Identities=32%  Similarity=0.547  Sum_probs=19.6

Q ss_pred             CCCCCcccccCCcEEEEEecccccccCcc
Q 000547         1075 RADLPPRLNQRLPMSLCFGGDGSYMSPFS 1103 (1432)
Q Consensus      1075 r~dl~~~i~q~~Pm~Ilf~GdGSym~Pf~ 1103 (1432)
                      |.+|... ..--.|+|-.||||+|+--=+
T Consensus        96 R~~lsq~-i~waD~VisvGGDGTfL~Aas  123 (395)
T KOG4180|consen   96 RNDLSQP-IRWADMVISVGGDGTFLLAAS  123 (395)
T ss_pred             hhhccCc-CchhhEEEEecCccceeehhh
Confidence            3444333 344579999999999997766


No 34 
>PF02166 Androgen_recep:  Androgen receptor;  InterPro: IPR001103 Steroid or nuclear hormone receptors (NRs) constitute an important super-family of transcription regulators that are involved in diverse physiological functions, including control of embryonic development, cell differentiation and homeostasis. Members include the steroid hormone receptors and receptors for thyroid hormone, retinoids and 1,25-dihydroxy-vitamin D3. The proteins function as dimeric molecules in the nucleus to regulate the transcription of target genes in a ligand-responsive manner [, ].   NRs are extremely important in medical research, a large number of them being implicated in diseases such as cancer, diabetes and hormone resistance syndromes. Many do not yet have a defined ligand and are accordingly termed "orphan" receptors. More than 300 NRs have been described to date and a new system has recently been introduced in an attempt to rationalise the increasingly complex set of names used to describe superfamily members. The androgen receptor (AR) consists of 3 functional and structural domains: an N-terminal (modulatory) domain; a DNA binding domain (IPR001628 from INTERPRO) that mediates specific binding to target DNA sequences (ligand-responsive elements); and a hormone binding domain. The N-terminal domain (NTD) is unique to the androgen receptors and spans approximately the first 530 residues; the highly-conserved DNA-binding domain is smaller (around 65 residues) and occupies the central portion of the protein; and the hormone ligand binding domain (LBD) lies at the receptor C terminus. In the absence of ligand, steroid hormone receptors are thought to be weakly associated with nuclear components; hormone binding greatly increases receptor affinity.  The LBDs of steroid hormone receptors fold into 12 helices that form a ligand-binding pocket. When an agonist is bound, helix 12 folds over the pocket to enclose the ligand []. When an antagonist is unbound, helix 12 is positioned away from the pocket in a way that interferes with the binding of coactivators to a groove in the hormone-binding domain formed after ligand binding. In AR, ligand binding that induces folding of helix 12 to overlie the pocket discloses a groove that binds a region of the NTD. Coactivator molecules can also bind to this groove, but the predominant site for coactivator binding to AR is in the NTD. AR ligand resides in a pocket and primarily contacts helices 4, 5, and 10. The DNA-binding region includes eight cysteine residues that form two coordination complexes, each composed of four cysteines and a Zn2+ ion. These two zinc fingers form the structure that binds to the major groove of DNA. The second zinc finger stabilises the binding complex by hydrophobic interactions with the first finger and contributes to specificity of receptor DNA binding. It is also necessary for receptor dimerisation that occurs during DNA binding Defects in the androgen receptor cause testicular feminisation syndrome, androgen insensibility syndrome (AIS) [, ]. AIS may be complete (CAIS), where external genitalia are phenotypically female; partial (PAIS), where genitalia are substantively ambiguous; or mild (MAIS), where external genitalia are normal male, or nearly so. Defects in the receptor also cause X-linked spinal and bulbar muscular atrophy (also known as Kennedy's disease).; GO: 0003677 DNA binding, 0004882 androgen receptor activity, 0005496 steroid binding, 0006355 regulation of transcription, DNA-dependent, 0030521 androgen receptor signaling pathway, 0005634 nucleus; PDB: 1XOW_B 2Q7K_B 2Q7I_B.
Probab=27.47  E-value=20  Score=42.80  Aligned_cols=26  Identities=50%  Similarity=0.989  Sum_probs=0.0

Q ss_pred             cccccCcceeeeCCCCCcccCCCCCCCCCCCCCCCCC
Q 000547           21 SLDQYNFPVIGFGADSLFHGDYTPPSPPPPIAPPHPP   57 (1432)
Q Consensus        21 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~   57 (1432)
                      +-|-||||.-           ..-|+|||||+|||+.
T Consensus       335 sRDyYnF~la-----------Lag~~~p~~~~hph~R  360 (423)
T PF02166_consen  335 SRDYYNFPLA-----------LAGPPPPPPPPHPHAR  360 (423)
T ss_dssp             -------------------------------------
T ss_pred             cccccccccc-----------cccccccccccccccc
Confidence            4567787752           2335556666777655


No 35 
>PF05281 Secretogranin_V:  Neuroendocrine protein 7B2 precursor (Secretogranin V);  InterPro: IPR007945 Mature peptide hormones and neuropeptides are typically synthesised from much larger precursors and require several post-translational processing steps--including proteolytic cleavage--for the formation of the bioactive species. The subtilisin-related proteolytic enzymes that accomplish neuroendocrine-specific cleavages are known as prohormone convertases 1 and 2 (PC1 and PC2), which belong to MEROPS peptidase family S8B. The cell biology of these proteases within the regulated secretory pathway of neuroendocrine cells is complex, and they are themselves initially synthesised as inactive precursor molecules. ProPC1 propeptide cleavage occurs rapidly in the endoplasmic reticulum, yet its major site of action on prohormones takes place later in the secretory pathway. PC1 undergoes an interesting carboxyl terminal processing event whose function appears to be to activate the enzyme. ProPC2, on the other hand, exhibits comparatively long initial folding times and exits the endoplasmic reticulum without propeptide cleavage, in association with the neuroendocrine-specific protein 7B2. Once the proPC2/7B2 complex arrives at the trans-Golgi network, 7B2 is internally cleaved into two domains, the 21kDa fragment and a carboxy-terminal 31 residue peptide. PC2 propeptide removal occurs in the maturing secretory granule, most likely through autocatalysis, and 7B2 association does not appear to be directly required for this cleavage event. However, if proPC2 has not encountered 7B2 intracellularly, it cannot generate a catalytically active mature species. The molecular mechanism behind the intriguing intracellular association of 7B2 and proPC2 is still unknown, but may involve conformational rearrangement or stabilisation of a proPC2 conformer mediated by a 36-residue internal segment of 21kDa 7B2. This family represents, 7B2 (secretogranin V), which is the molecular escort protein for PC2. 7B2 is a bifunctional protein with an N-terminal activation domain and a C-terminal inhibitory domain (MEROPS inhibitor family I21, clan I-) separated by a furin cleavage site []. Although 7B2 represents a potent inhibitor of PC2, there is an absolute requirement of 7B2 for the activation of PC2, which is synthesised as a zymogen. Both the full length, 27 kDa, and the C-terminal peptide (CT domain) derived from intramolecular cleavage of 7B2 are potent inhibitors of PC2. Studies have shown that the active peptide in the CT domain to be LLRVHK, active in the nanomolar range not only against PC2 but also PC1 [, ]. Knock-out studies have shown that the PC2 nulls are not phenotypically equivalent to the 7B2 nulls, which suggests that 7B2 may have other activities in addition to being the activator of PC2 [].  7B2 exhibits both structural and functional homology to proSAAS (IPR010832 from INTERPRO), which is the PC1 binding protein. The CT domain of proSAAS contains the same inhibitor hexapeptide as 7B2, consequently both 7B2 and proSAAS are two members of a homologous family of prohormone convertase inhibitor proteins. ; GO: 0007218 neuropeptide signaling pathway, 0030141 stored secretory granule
Probab=26.55  E-value=96  Score=35.29  Aligned_cols=91  Identities=25%  Similarity=0.454  Sum_probs=53.4

Q ss_pred             CCcccccceeceEEEcCCCCCCCCCcccCCceEEccCCCCCc--ccccce---eCCCCceecccCCCCCCcccCCCCCCC
Q 000547          761 GDEYLPLASVNGSIDARGGLGRGQGLAGGNGTVTGKACPRGL--YGVFCE---ECPVGTFKNVSGSDRALCRNCSSNELP  835 (1432)
Q Consensus       761 Gd~y~pia~~~G~I~~~GG~~~~~g~~G~nGTi~g~~CP~G~--~G~fC~---eCP~GtYKn~~Gs~~~~C~pC~~~~lp  835 (1432)
                      -+||.+-.+.=|.-.++||++.+....+..|.|-...=-+--  -=-+|.   +||+|+- ...     .|.    .+++
T Consensus        49 d~E~l~hssLwG~q~isGGagEG~q~L~~~~~i~n~~~~ktd~~lPaYCnPPNPCPvGyt-~~d-----gCl----e~fE  118 (208)
T PF05281_consen   49 DQEYLQHSSLWGHQYISGGAGEGPQHLGPEGNIPNIKIVKTDNNLPAYCNPPNPCPVGYT-AED-----GCL----ENFE  118 (208)
T ss_pred             ccccccccccCCceeeeCCCCccchhcCccCCcCcccccccccCCCCCCCCCCCCCCCcc-ccc-----Ccc----cCCc
Confidence            357776667788888999999776666666665332110000  015687   8999985 222     254    3466


Q ss_pred             CCcceecccCcccCCCCCccccCCCccCCccc
Q 000547          836 HRALYIPIRGGVTECPCPYKCVSERYHMPHCY  867 (1432)
Q Consensus       836 ~ra~yiy~~gG~~~~~CpY~C~sdk~~~p~C~  867 (1432)
                      .-|.|--.-  .....|.  |  |-.||-.|-
T Consensus       119 nTAeFSRey--Qa~QdC~--C--D~EHMf~C~  144 (208)
T PF05281_consen  119 NTAEFSREY--QAAQDCM--C--DTEHMFDCP  144 (208)
T ss_pred             cHHHhhHHH--HhcCccC--C--CcccccCCc
Confidence            666554322  2233443  4  678998887


No 36 
>KOG4260 consensus Uncharacterized conserved protein [Function unknown]
Probab=25.28  E-value=73  Score=37.59  Aligned_cols=50  Identities=36%  Similarity=0.869  Sum_probs=0.0

Q ss_pred             CCCCcccccceeCCCCceecccCCCCCCcccCCCCCCCCCcceecccCcccCCCCCccccCCCccCCcc
Q 000547          798 CPRGLYGVFCEECPVGTFKNVSGSDRALCRNCSSNELPHRALYIPIRGGVTECPCPYKCVSERYHMPHC  866 (1432)
Q Consensus       798 CP~G~~G~fC~eCP~GtYKn~~Gs~~~~C~pC~~~~lp~ra~yiy~~gG~~~~~CpY~C~sdk~~~p~C  866 (1432)
                      ||+|.||.-|.+||-|+=+.-.|.  ..|+   .+         ..|+|...-.|.     --|.-|+|
T Consensus       132 Cp~gtyGpdCl~Cpggser~C~Gn--G~C~---Gd---------GsR~GsGkCkC~-----~GY~Gp~C  181 (350)
T KOG4260|consen  132 CPDGTYGPDCLQCPGGSERPCFGN--GSCH---GD---------GSREGSGKCKCE-----TGYTGPLC  181 (350)
T ss_pred             cCCCCcCCccccCCCCCcCCcCCC--Cccc---CC---------CCCCCCCccccc-----CCCCCccc


No 37 
>COG4907 Predicted membrane protein [Function unknown]
Probab=24.67  E-value=50  Score=41.15  Aligned_cols=27  Identities=44%  Similarity=0.779  Sum_probs=0.0

Q ss_pred             CCCCCCCCcccCCccEEEEEeeeccCeEEEEecCCCCCCCCCCe
Q 000547          250 GGDVGVKGGGGSGGSIYVKAHRMTGNGKISASGGNGFAGGGGGR  293 (1432)
Q Consensus       250 Gg~g~~~gGGGSGGSI~L~a~~l~g~G~isA~GG~GgGGGGGGR  293 (1432)
                      +..-++.+|||-|++                 ||+-||||||+|
T Consensus       569 ~S~~~~~~GGG~G~~-----------------gGg~GGGGGGar  595 (595)
T COG4907         569 SSRRSSSSGGGGGFS-----------------GGGSGGGGGGAR  595 (595)
T ss_pred             ccccCCCCCCCCCcC-----------------CCCCCCCCCCCC


No 38 
>cd04299 GT1_Glycogen_Phosphorylase_like This family is most closely related to the oligosaccharide phosphorylase domain family and other unidentified sequences. Oligosaccharide phosphorylase catalyzes the breakdown of oligosaccharides into glucose-1-phosphate units. They are important allosteric enzymes in carbohydrate metabolism. The members of this family are found in bacteria and Archaea.
Probab=23.53  E-value=51  Score=43.67  Aligned_cols=77  Identities=25%  Similarity=0.312  Sum_probs=44.4

Q ss_pred             HHHHHHHH-HHHHHHHhhcCccccccchhhhhhcceEEeecCC-ceEEEEEEecCCCCcCCCCCccccc---------CC
Q 000547         1018 LCRKNKLQ-QLREFVRSEYDHSCLRSCRSRALYEGLKVAATAD-LMLAYIDFFLGGDEKRADLPPRLNQ---------RL 1086 (1432)
Q Consensus      1018 ~rRrkk~~-rL~efv~seydh~c~rs~rsRaly~~lKvs~S~D-ysLaYiD~f~~~d~kr~dl~~~i~q---------~~ 1086 (1432)
                      |..|++.| +|+++|+......+.|.-.+.+.-.....-..|| ++++++==|-  .|||.+|-.+...         .-
T Consensus       435 w~~K~~~K~~L~~~v~~~~~~~~~~~g~~~~~~~~~~~~ldpd~ltigfarRfa--~YKR~~Lil~dl~rl~~il~~~~~  512 (778)
T cd04299         435 WEVRQQLRRRLIEFVRRRLRRQWLRRGASAEEIGEADDVLDPNVLTIGFARRFA--TYKRATLLLRDPERLKRLLNDPER  512 (778)
T ss_pred             HHHHHHHHHHHHHHHHHHHHhhhhhcCCchhhhhhcCCccCCCccEEeeeecch--hhhhHHHHHHHHHHHHHHhhCCCC
Confidence            44444444 5888888655544444222221111222234566 6677766664  6899998443221         25


Q ss_pred             cEEEEEeccc
Q 000547         1087 PMSLCFGGDG 1096 (1432)
Q Consensus      1087 Pm~Ilf~GdG 1096 (1432)
                      |+-+||+|.+
T Consensus       513 pvQ~IfaGKA  522 (778)
T cd04299         513 PVQFIFAGKA  522 (778)
T ss_pred             CeEEEEEEec
Confidence            9999999998


No 39 
>KOG0994 consensus Extracellular matrix glycoprotein Laminin subunit beta [Extracellular structures]
Probab=23.41  E-value=79  Score=43.22  Aligned_cols=65  Identities=25%  Similarity=0.628  Sum_probs=41.5

Q ss_pred             CCCCCcccccceeCCCCceecccCCCCCCcccCCCCCCCCCcceeccc-------CcccCCCCCccccCCCccCCcc
Q 000547          797 ACPRGLYGVFCEECPVGTFKNVSGSDRALCRNCSSNELPHRALYIPIR-------GGVTECPCPYKCVSERYHMPHC  866 (1432)
Q Consensus       797 ~CP~G~~G~fC~eCP~GtYKn~~Gs~~~~C~pC~~~~lp~ra~yiy~~-------gG~~~~~CpY~C~sdk~~~p~C  866 (1432)
                      .|-|+--|-.|.+|-+|||-  +|  .+=|++|.-+..-+-.+|-...       .|..-..|. +|.+++..-|-|
T Consensus       785 qCkPnVVGR~CdqCApGtyG--FG--PsGCk~CdC~~~Gs~~~~Cd~~tGQC~C~~g~ygrqCn-qCqpG~WgFPeC  856 (1758)
T KOG0994|consen  785 QCKPNVVGRRCDQCAPGTYG--FG--PSGCKACDCNSIGSLDKYCDKITGQCQCRPGTYGRQCN-QCQPGYWGFPEC  856 (1758)
T ss_pred             cccCccccccccccCCcccC--cC--CccCccccccccccccccccccccceeeccccchhhcc-ccCCCccCCCcC
Confidence            67777889999999999993  33  4779999876665555554422       234444554 555554444433


No 40 
>PF15195 TMEM210:  TMEM210 family
Probab=23.13  E-value=51  Score=33.49  Aligned_cols=10  Identities=50%  Similarity=1.321  Sum_probs=4.2

Q ss_pred             CCCCCCCCCC
Q 000547           41 DYTPPSPPPP   50 (1432)
Q Consensus        41 ~~~~~~~~p~   50 (1432)
                      +-+|++||||
T Consensus       101 ~~~p~~pppp  110 (116)
T PF15195_consen  101 EASPEEPPPP  110 (116)
T ss_pred             CCCCCCCCcC
Confidence            3444444333


No 41 
>PF04519 Bactofilin:  Polymer-forming cytoskeletal;  InterPro: IPR007607 This family contains several uncharacterised hypothetical proteins.
Probab=22.97  E-value=3.1e+02  Score=26.87  Aligned_cols=57  Identities=18%  Similarity=0.424  Sum_probs=0.0

Q ss_pred             ccCeEEEeeceeeccCceEEEccceeEEcCCceeeecccCcEEEEEeeeeEEeCCCceEEeceEEEEEeeEEEcCCcEEE
Q 000547           71 LETVCLLNSSLTFENDDIYVEGSGNLHILPGVTLCCPIKGCLLTINVTGEFLLGRNSEIVAGTVYVSALNASFSSGSVVN  150 (1432)
Q Consensus        71 ~~t~C~l~~s~~~~~qDv~i~G~g~L~I~~gv~l~c~~~G~~I~v~~sGn~~lg~ns~ivagsi~L~A~nvti~~~g~I~  150 (1432)
                      +...|.+.-++.+.. ++.|.|.              +.|   .+...+.+.|..++.+.+.   +.|.+++|  .|.++
T Consensus         5 I~~~~~i~G~i~~~~-~v~i~G~--------------v~G---~i~~~g~v~i~~~~~v~G~---i~~~~~~i--~G~v~   61 (101)
T PF04519_consen    5 IGKGTKIEGDISSDG-DVRIDGR--------------VEG---NIKAEGKVKIGGNGEVKGD---IKADDVII--SGSVD   61 (101)
T ss_pred             ECCCCEEEEEEEECc-EEEEEEE--------------EEE---EEEEceEEEEcCCCEEEEE---EEEeEEEE--cCEEe


No 42 
>PF00020 TNFR_c6:  TNFR/NGFR cysteine-rich region;  InterPro: IPR001368 A number of proteins, some of which are known to be receptors for growth factors have been found to contain a cysteine-rich domain at the N-terminal region that can be subdivided into four (or in some cases, three) repeats containing six conserved cysteines all of which are involved in intrachain disulphide bonds [].  CD27 (also called S152 or T14) mediates a co-stimulatory signal for T and B cell activation and is involved in murine T cell development. Tyrosine-phosphorylation of ZAP-70 following CD27 ligation of T cells has been reported [], but not confirmed independently. CD30 was originally identified as Ki-1, an antigen expressed on Reed-Sternberg cells in Hodgkin's lymphomas and other non-Hodgkin's lymphomas, particularly diffuse large-cell lymphoma and immunoblastic lymphoma. CD30 has pleiotropic effects on CD30-positive lymphoma cell lines ranging from cell proliferation to cell death. It is thought to be involved in negative selection of T-cells in the thymus and is involved in TCR-mediated cell death. CD30 is a member of the TNFR family of molecules, activate NFkB through interaction with TRAF2 and TRAF5. CD40 (Bp50) plays a central role in the regulation of cell-mediated immunity as well as antibody mediated immunity. It is central to T cell dependent (TD)-responses and may influence survival of B cell lymphomas. CD95 (also called APO-1, fas antigen, Fas tumour necrosis factor receptor superfamily, member 6, TNFRSF6 or apoptosis antigen 1, APT1) is expressed, typically at high levels, on activated T and B cells. It is involved in the mediation of apoptosis-inducing signals.  Other proteins known to belong to this family [, , , ] are, tumour Necrosis Factor type I and type II receptors (TNFR), Rabbit fibroma virus soluble TNF receptor (protein T2), lymphotoxin alpha/beta receptor, low-affinity nerve growth factor receptor (LA-NGFR) (p75), T-cell antigen OX40, Wsl-1, a receptor (for a yet undefined ligand) that mediates apoptosis and Vaccinia virus protein A53 (SalF19R). CD molecules are leucocyte antigens on cell surfaces. CD antigens nomenclature is updated at Protein Reviews On The Web (http://prow.nci.nih.gov/).; GO: 0005488 binding; PDB: 3TJE_F 3QD6_S 3ALQ_U 3IJ2_Y 3BUK_D 1SG1_X 1NCF_B 1EXT_A 1FT4_B 1TNR_R ....
Probab=22.44  E-value=39  Score=28.04  Aligned_cols=19  Identities=37%  Similarity=0.947  Sum_probs=13.9

Q ss_pred             CCCCceecccCCCCCCcccCC
Q 000547          810 CPVGTFKNVSGSDRALCRNCS  830 (1432)
Q Consensus       810 CP~GtYKn~~Gs~~~~C~pC~  830 (1432)
                      ||.|+|.+..+.  ..|.||.
T Consensus         1 C~~g~y~~~~~~--~~C~~C~   19 (39)
T PF00020_consen    1 CPPGTYSDSENH--PQCLPCS   19 (39)
T ss_dssp             ECTTEEEESSCS--SSEEEEE
T ss_pred             CccCcccCCCCC--CcCCccC
Confidence            899999997543  6776554


No 43 
>PF12661 hEGF:  Human growth factor-like EGF; PDB: 2YGQ_A 2E26_A 3A7Q_A 2YGP_A 2YGO_A 1HRE_A 1HAE_A 1HAF_A 1HRF_A.
Probab=21.80  E-value=28  Score=23.61  Aligned_cols=11  Identities=45%  Similarity=1.066  Sum_probs=8.3

Q ss_pred             CCCCCcccccc
Q 000547          797 ACPRGLYGVFC  807 (1432)
Q Consensus       797 ~CP~G~~G~fC  807 (1432)
                      .||+||.|..|
T Consensus         3 ~C~~G~~G~~C   13 (13)
T PF12661_consen    3 QCPPGWTGPNC   13 (13)
T ss_dssp             EE-TTEETTTT
T ss_pred             cCcCCCcCCCC
Confidence            48999998877


No 44 
>PF05268 GP38:  Phage tail fibre adhesin Gp38;  InterPro: IPR007932 This entry is represented by Bacteriophage T2, Gp38. The characteristics of the protein distribution suggest prophage matches in addition to the phage matches. This family contains several Gp38 proteins from T-even-like phages. Gp38, together with a second phage protein, Gp57, catalyses the organisation of Gp37 but is absent from the phage particle. Gp37 is responsible for receptor recognition [].
Probab=21.58  E-value=1.6e+02  Score=33.94  Aligned_cols=21  Identities=29%  Similarity=0.703  Sum_probs=11.0

Q ss_pred             EEEecCCCC-CCCCcccCCccE
Q 000547          245 SLLADGGDV-GVKGGGGSGGSI  265 (1432)
Q Consensus       245 ~i~AnGg~g-~~~gGGGSGGSI  265 (1432)
                      .+.-.|+++ ....-|..||..
T Consensus       113 t~yGRGGnGs~~~~~g~~GG~~  134 (260)
T PF05268_consen  113 TMYGRGGNGSGSNSAGAAGGHA  134 (260)
T ss_pred             EEEecCCCCCCCCCCcccccee
Confidence            344446665 344556666663


No 45 
>PRK06958 single-stranded DNA-binding protein; Provisional
Probab=21.56  E-value=5.1e+02  Score=29.08  Aligned_cols=35  Identities=14%  Similarity=0.013  Sum_probs=19.9

Q ss_pred             ccCcEEEEEeeeeEEeCCCc---eEEeceEEEEEeeEEEc
Q 000547          108 IKGCLLTINVTGEFLLGRNS---EIVAGTVYVSALNASFS  144 (1432)
Q Consensus       108 ~~G~~I~v~~sGn~~lg~ns---~ivagsi~L~A~nvti~  144 (1432)
                      .+|+.|.|  .|.+....-.   -...-.+.|.|.+|.+.
T Consensus        71 kKGs~V~V--eGrL~~~~yeDkdG~kr~~~eVvA~~V~fL  108 (182)
T PRK06958         71 KKGSSVYI--EGRIRTRKWQGQDGQDRYSTEIVADQMQML  108 (182)
T ss_pred             CCCCEEEE--EEEEEeCceECCCCcEEEEEEEEEeEEEEC
Confidence            55655443  4666665322   12334677788888774


Done!