Query         000543
Match_columns 1433
No_of_seqs    200 out of 215
Neff          3.8 
Searched_HMMs 46136
Date          Mon Apr  1 18:43:00 2013
Command       hhsearch -i /work/01045/syshi/lefta3m/000543.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/leftcdd/000543hhsearch_cdd -cpu 12 -v 0 

 No Hit                             Prob E-value P-value  Score    SS Cols Query HMM  Template HMM
  1 KOG0196 Tyrosine kinase, EPH (  96.6  0.0019   4E-08   81.2   4.4   61  796-869   261-331 (996)
  2 PF07699 GCC2_GCC3:  GCC2 and G  96.3  0.0021 4.5E-08   55.2   2.0   26  805-832     9-34  (48)
  3 PF10256 Erf4:  Golgin subfamil  92.0    0.22 4.7E-06   49.9   4.9   67  966-1032   21-91  (118)
  4 PHA02637 TNF-alpha-receptor-li  91.7     0.2 4.4E-06   51.9   4.2   57  797-858    31-92  (127)
  5 PF07562 NCD3G:  Nine Cysteines  91.0   0.065 1.4E-06   47.6  -0.1   36  795-832     6-51  (54)
  6 KOG0921 Dosage compensation co  90.8    0.42 9.1E-06   61.8   6.5    7  287-293  1272-1278(1282)
  7 cd00185 TNFR Tumor necrosis fa  90.1    0.41 8.9E-06   46.8   4.6   66  793-864    12-87  (98)
  8 PHA02637 TNF-alpha-receptor-li  85.9    0.54 1.2E-05   48.9   2.6   41  789-830    38-87  (127)
  9 KOG0921 Dosage compensation co  82.9     2.8   6E-05   54.9   7.3   15  557-571  1135-1149(1282)
 10 PF07699 GCC2_GCC3:  GCC2 and G  77.0     1.9   4E-05   37.3   2.3   32  791-822     7-43  (48)
 11 cd00185 TNFR Tumor necrosis fa  66.4     6.8 0.00015   38.5   3.8   33  804-836    11-45  (98)
 12 PTZ00382 Variant-specific surf  62.4     4.5 9.7E-05   40.3   1.7   24  805-831     4-27  (96)
 13 KOG4069 Uncharacterized conser  60.0      18 0.00039   38.3   5.5   78  965-1046   45-135 (154)
 14 KOG4289 Cadherin EGF LAG seven  53.4      11 0.00024   51.6   3.4   86 1275-1371 2170-2256(2531)
 15 smart00180 EGF_Lam Laminin-typ  52.6     7.7 0.00017   33.5   1.3   20  797-816    21-40  (46)
 16 cd00055 EGF_Lam Laminin-type e  52.3     7.6 0.00016   33.9   1.3   27  789-818    17-43  (50)
 17 PF05268 GP38:  Phage tail fibr  50.8      69  0.0015   36.7   8.5   37  210-247   114-152 (260)
 18 KOG3973 Uncharacterized conser  46.9      72  0.0016   38.6   8.3    7  289-295   453-459 (465)
 19 PF12273 RCR:  Chitin synthesis  45.5      19 0.00042   37.1   3.2   20  881-900     2-22  (130)
 20 PF14946 DUF4501:  Domain of un  44.2      80  0.0017   34.8   7.5   37  876-912    83-121 (180)
 21 PF10256 Erf4:  Golgin subfamil  42.4      36 0.00078   34.4   4.5   56 1114-1169   26-106 (118)
 22 KOG1836 Extracellular matrix g  39.9      19 0.00041   50.9   2.7   43  790-832   794-838 (1705)
 23 PF15496 DUF4646:  Domain of un  39.6      36 0.00078   35.3   4.1   23 1117-1139   46-68  (123)
 24 PF07354 Sp38:  Zona-pellucida-  39.3      18 0.00039   42.1   2.0   36  793-831   217-261 (271)
 25 PF15496 DUF4646:  Domain of un  38.2      51  0.0011   34.3   4.9   70  971-1041   43-120 (123)
 26 PF00053 Laminin_EGF:  Laminin   37.6      13 0.00028   32.1   0.4   22  797-818    21-42  (49)
 27 KOG1836 Extracellular matrix g  36.5      28 0.00061   49.3   3.5   64  796-863   697-768 (1705)
 28 KOG2675 Adenylate cyclase-asso  36.3      21 0.00045   44.0   2.0   26  341-366   445-470 (480)
 29 PRK09677 putative lipopolysacc  34.6      95  0.0021   34.0   6.5   61   87-150    35-95  (192)
 30 KOG4611 Uncharacterized conser  31.7      37 0.00081   40.8   3.0   48  807-873    52-125 (747)
 31 cd00064 FU Furin-like repeats.  30.0      39 0.00085   29.0   2.1   22  796-817    18-42  (49)
 32 KOG4258 Insulin/growth factor   28.3      37 0.00081   44.9   2.4   46  805-873   304-350 (1025)
 33 PF02166 Androgen_recep:  Andro  27.5      20 0.00044   42.8   0.0   26   21-57    335-360 (423)
 34 KOG4180 Predicted kinase [Gene  25.7 1.2E+02  0.0027   36.7   5.7   28 1075-1103   96-123 (395)
 35 KOG4260 Uncharacterized conser  25.4      73  0.0016   37.6   3.8   50  798-866   132-181 (350)
 36 PF15195 TMEM210:  TMEM210 fami  25.3      43 0.00094   33.9   1.8   11   41-51    101-111 (116)
 37 COG4907 Predicted membrane pro  24.5      51  0.0011   41.1   2.5    7  287-293   589-595 (595)
 38 KOG0994 Extracellular matrix g  23.4      80  0.0017   43.2   4.0   65  797-866   785-856 (1758)
 39 cd04299 GT1_Glycogen_Phosphory  22.8      54  0.0012   43.5   2.4   77 1018-1096  435-522 (778)
 40 PF04519 Bactofilin:  Polymer-f  22.7 3.1E+02  0.0068   26.8   7.1   57   71-150     5-61  (101)
 41 PF00020 TNFR_c6:  TNFR/NGFR cy  22.1      40 0.00087   28.0   0.8   19  810-830     1-19  (39)
 42 PF12661 hEGF:  Human growth fa  22.0      27 0.00059   23.6  -0.2   11  797-807     3-13  (13)
 43 PF05281 Secretogranin_V:  Neur  21.3 1.4E+02   0.003   34.1   4.8   91  761-867    49-144 (208)
 44 PF05268 GP38:  Phage tail fibr  21.0 1.7E+02  0.0037   33.8   5.4   21  245-265   113-134 (260)
 45 COG4907 Predicted membrane pro  20.9      64  0.0014   40.3   2.4   10  221-230   584-593 (595)
 46 PRK06958 single-stranded DNA-b  20.8 5.5E+02   0.012   28.9   9.2   35  108-144    71-108 (182)

No 1  
>KOG0196 consensus Tyrosine kinase, EPH (ephrin) receptor family [Signal transduction mechanisms]
Probab=96.57  E-value=0.0019  Score=81.24  Aligned_cols=61  Identities=30%  Similarity=0.688  Sum_probs=45.0

Q ss_pred             cCCCCCcc----cccceeCCCCceecccCCCCCCcccCCCCCCCCCcceecccCcccCCCCCccccCCCccC------Cc
Q 000543          796 KACPRGLY----GVFCEECPVGTFKNVSGSDRALCRNCSSNELPHRALYIPIRGGVTECPCPYKCVSERYHM------PH  865 (1433)
Q Consensus       796 ~~CP~G~~----G~fC~eCP~GtYKn~~Gs~~~~C~pC~~~~lp~ra~yiy~~gG~~~~~CpY~C~sdk~~~------p~  865 (1433)
                      =-|.+||.    |.-|+.||.||||...|  ..+|.+||.++..+         ....+.|.  |..++|+.      --
T Consensus       261 C~C~aGye~~~~~~~C~aCp~G~yK~~~~--~~~C~~CP~~S~s~---------~ega~~C~--C~~gyyRA~~Dp~~mp  327 (996)
T KOG0196|consen  261 CVCKAGYEEAENGKACQACPPGTYKASQG--DSLCLPCPPNSHSS---------SEGATSCT--CENGYYRADSDPPSMP  327 (996)
T ss_pred             eeecCCCCcccCCCcceeCCCCcccCCCC--CCCCCCCCCCCCCC---------CCCCCccc--ccCCcccCCCCCCCCC
Confidence            47999984    67799999999999865  48999999865332         12345675  88888873      24


Q ss_pred             ccch
Q 000543          866 CYTT  869 (1433)
Q Consensus       866 C~T~  869 (1433)
                      |.+|
T Consensus       328 CT~P  331 (996)
T KOG0196|consen  328 CTRP  331 (996)
T ss_pred             CCCC
Confidence            7665


No 2  
>PF07699 GCC2_GCC3:  GCC2 and GCC3;  InterPro: IPR011641 Protein phosphorylation, which plays a key role in most cellular activities, is a reversible process mediated by protein kinases and phosphoprotein phosphatases. Protein kinases catalyse the transfer of the gamma phosphate from nucleotide triphosphates (often ATP) to one or more amino acid residues in a protein substrate side chain, resulting in a conformational change affecting protein function. Phosphoprotein phosphatases catalyse the reverse process. Protein kinases fall into three broad classes, characterised with respect to substrate specificity []:   Serine/threonine-protein kinases Tyrosine-protein kinases Dual specific protein kinases (e.g. MEK - phosphorylates both Thr and Tyr on target proteins)   Protein kinase function has been evolutionarily conserved from Escherichia coli to human []. Protein kinases play a role in a multitude of cellular processes, including division, proliferation, apoptosis, and differentiation []. Phosphorylation usually results in a functional change of the target protein by changing enzyme activity, cellular location, or association with other proteins. The catalytic subunits of protein kinases are highly conserved, and several structures have been solved [], leading to large screens to develop kinase-specific inhibitors for the treatments of a number of diseases []. Tyrosine-protein kinases can transfer a phosphate group from ATP to a tyrosine residue in a protein. These enzymes can be divided into two main groups []:   Receptor tyrosine kinases (RTK), which are transmembrane proteins involved in signal transduction; they play key roles in growth, differentiation, metabolism, adhesion, motility, death and oncogenesis []. RTKs are composed of 3 domains: an extracellular domain (binds ligand), a transmembrane (TM) domain, and an intracellular catalytic domain (phosphorylates substrate). The TM domain plays an important role in the dimerisation process necessary for signal transduction [].      Cytoplasmic / non-receptor tyrosine kinases, which act as regulatory proteins, playing key roles in cell differentiation, motility, proliferation, and survival. For example, the Src-family of protein-tyrosine kinases [].   This entry represents various ephrin type A and B receptors, which have tyrosine kinase activity.
Probab=96.32  E-value=0.0021  Score=55.21  Aligned_cols=26  Identities=31%  Similarity=0.887  Sum_probs=22.9

Q ss_pred             ccceeCCCCceecccCCCCCCcccCCCC
Q 000543          805 VFCEECPVGTFKNVSGSDRALCRNCSSN  832 (1433)
Q Consensus       805 ~fC~eCP~GtYKn~~Gs~~~~C~pC~~~  832 (1433)
                      .-|++||.||||+..|.  ..|.+||.+
T Consensus         9 ~~C~~Cp~GtYq~~~g~--~~C~~Cp~g   34 (48)
T PF07699_consen    9 NKCQPCPKGTYQDEEGQ--TSCTPCPPG   34 (48)
T ss_pred             CccCCCCCCccCCccCC--ccCccCcCC
Confidence            46999999999999886  589999975


No 3  
>PF10256 Erf4:  Golgin subfamily A member 7/ERF4 family;  InterPro: IPR019383 Proteins in this entry include Golgin subfamily A member 7 and the Ras modification protein ERF4. 
Probab=92.05  E-value=0.22  Score=49.94  Aligned_cols=67  Identities=19%  Similarity=0.347  Sum_probs=44.8

Q ss_pred             CCcchhhhhhcHHHHHHHHHHHHHHHhhh--h--hhhhhhheeeeeecchhHHHHHHHHHHHHHHHHHHHH
Q 000543          966 SPPEQVIEIVYEDAFNRFADEINALAAYQ--W--WEGSVYSILSVLAYPLAWSWLQLCRKNKLQQLREFVR 1032 (1433)
Q Consensus       966 spP~~i~~ivyed~fn~Fad~IN~laay~--~--We~~i~~iLsvl~YPlaw~~~q~rRrkk~~rL~efv~ 1032 (1433)
                      .-|.++...+.+++|++++++||+..+=.  .  |.-++-.+|.++..=+..++.....||++++|.+|+.
T Consensus        21 ~~P~~L~~~is~~ef~~iI~~IN~~l~~a~~~~~~~~~~~~~l~~lt~~l~~~~~~~~~~~~~~~le~~l~   91 (118)
T PF10256_consen   21 EYPGELSGYISPEEFEEIINTINQILKEAFEPISWRNIIENILGCLTLGLSSLCFKTHYKRKLRELEKYLE   91 (118)
T ss_pred             cCCHhhcCCCCHHHHHHHHHHHHHHHHHHhcchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence            56777999999999999999999987322  2  4344444555544222122234455677888999997


No 4  
>PHA02637 TNF-alpha-receptor-like protein; Provisional
Probab=91.67  E-value=0.2  Score=51.90  Aligned_cols=57  Identities=23%  Similarity=0.571  Sum_probs=38.4

Q ss_pred             CCCCCcc---cccceeCCCCceecccC--CCCCCcccCCCCCCCCCcceecccCcccCCCCCccccC
Q 000543          797 ACPRGLY---GVFCEECPVGTFKNVSG--SDRALCRNCSSNELPHRALYIPIRGGVTECPCPYKCVS  858 (1433)
Q Consensus       797 ~CP~G~~---G~fC~eCP~GtYKn~~G--s~~~~C~pC~~~~lp~ra~yiy~~gG~~~~~CpY~C~s  858 (1433)
                      .|..+-|   +.-|.+||.|+|+...=  .....|.|||.++.....++..     ....|.-.|++
T Consensus        31 ~C~e~EY~~~~~CC~kCPPGt~v~~~Ct~~t~T~C~PCp~GTYTe~~N~~~-----~C~~C~~~Cd~   92 (127)
T PHA02637         31 KCKDNEYKRHNLCCLSCPPGTYASRLCDIKTNTQCTPCGSGTFTSHNNHLP-----ACLSCNGRCDR   92 (127)
T ss_pred             CCCCCcCcCCCeEcCCCCCCCEEeCcCCCCCCcccccCCCCCeeccCCCCC-----cccccCCccCc
Confidence            6888865   34589999999987431  1245799999876555444442     35677777765


No 5  
>PF07562 NCD3G:  Nine Cysteines Domain of family 3 GPCR;  InterPro: IPR011500 G-protein-coupled receptors, GPCRs, constitute a vast protein family that encompasses a wide range of functions (including various autocrine, paracrine and endocrine processes). They show considerable diversity at the sequence level, on the basis of which they can be separated into distinct groups. We use the term clan to describe the GPCRs, as they embrace a group of families for which there are indications of evolutionary relationship, but between which there is no statistically significant similarity in sequence []. The currently known clan members include the rhodopsin-like GPCRs, the secretin-like GPCRs, the cAMP receptors, the fungal mating pheromone receptors, and the metabotropic glutamate receptor family. There is a specialised database for GPCRs (http://www.gpcr.org/7tm/).  GPCR family 3 receptors (also known as family C) are structurally similar to other GPCRs, but do not show any significant sequence similarity and thus represent a distinct group. Structurally they are composed of four elements; an N-terminal signal sequence; a large hydrophilic extracellular agonist-binding region containing several conserved cysteine residues which could be involved in disulphide bonds; a shorter region containing seven transmembrane domains; and a C-terminal cytoplasmic domain of variable length []. Family 3 members include the metabotropic glutamate receptors, the extracellular calcium-sensing receptors, the gamma-amino-butyric acid (GABA) type B receptors, and the vomeronasal type-2 receptors [, , , ]. As these receptors regulate many important physiological processes they are potentially promising targets for drug development. This entry represents a conserved sequence, found in the extracellular region, that contains several highly-conserved Cys residues that are predicted to form disulphide bridges.; GO: 0004930 G-protein coupled receptor activity, 0007186 G-protein coupled receptor protein signaling pathway; PDB: 2E4X_B 2E4Y_A 2E4U_B 2E4V_B 2E4W_A.
Probab=90.96  E-value=0.065  Score=47.57  Aligned_cols=36  Identities=31%  Similarity=0.744  Sum_probs=20.4

Q ss_pred             ccCCCCCccc----------ccceeCCCCceecccCCCCCCcccCCCC
Q 000543          795 GKACPRGLYG----------VFCEECPVGTFKNVSGSDRALCRNCSSN  832 (1433)
Q Consensus       795 g~~CP~G~~G----------~fC~eCP~GtYKn~~Gs~~~~C~pC~~~  832 (1433)
                      ...|++|++-          .-|++||.|+|.|.+  +...|.+||.+
T Consensus         6 S~~C~pG~~k~~~~~~~~CCw~C~~C~~~~is~~~--~~~~C~~C~~~   51 (54)
T PF07562_consen    6 SEPCPPGQRKKIQKGQPSCCWDCVPCPEGEISNQT--DSTSCTKCPEG   51 (54)
T ss_dssp             S----TTTEEEE--SSS-S--EEEE--TTEEEE----ETTEEEE--TT
T ss_pred             CCCCCCCCEECccCCCcceEEEeecCCCCcEECCC--CccccccCCCc
Confidence            4689999862          349999999999986  45899999965


No 6  
>KOG0921 consensus Dosage compensation complex, subunit MLE [Transcription]
Probab=90.78  E-value=0.42  Score=61.78  Aligned_cols=7  Identities=57%  Similarity=1.246  Sum_probs=2.9

Q ss_pred             CCCCCCe
Q 000543          287 AGGGGGR  293 (1433)
Q Consensus       287 GGGGGGR  293 (1433)
                      +|++||+
T Consensus      1272 ~G~~Gg~ 1278 (1282)
T KOG0921|consen 1272 GGGRGGN 1278 (1282)
T ss_pred             CCCcccc
Confidence            3344444


No 7  
>cd00185 TNFR Tumor necrosis factor receptor (TNFR) domain; superfamily of TNF-like receptor domains. When bound to TNF-like cytokines, TNFRs trigger multiple signal transduction pathways, they are involved in inflammation response, apoptosis, autoimmunity and organogenesis. TNFRs domains are elongated with generally three tandem repeats of cysteine-rich domains (CRDs). They fit in the grooves between protomers within the ligand trimer. Some TNFRs, such as NGFR and HveA, bind ligands with no structural similarity to TNF and do not bind ligand trimers.
Probab=90.11  E-value=0.41  Score=46.83  Aligned_cols=66  Identities=24%  Similarity=0.564  Sum_probs=41.2

Q ss_pred             EEccCCCCCcc---------cccceeCCCCceecccCCCCCCcccCCCCCCCCCcceecccCc-ccCCCCCccccCCCcc
Q 000543          793 VTGKACPRGLY---------GVFCEECPVGTFKNVSGSDRALCRNCSSNELPHRALYIPIRGG-VTECPCPYKCVSERYH  862 (1433)
Q Consensus       793 i~g~~CP~G~~---------G~fC~eCP~GtYKn~~Gs~~~~C~pC~~~~lp~ra~yiy~~gG-~~~~~CpY~C~sdk~~  862 (1433)
                      .-=+.||+|++         ..-|++||.|+|.+... ....|++|..=  + ...++....- ...+.|-  |.+++|-
T Consensus        12 ~cC~~C~pG~~~~~~C~~~~~t~C~~C~~g~ys~~~~-~~~~C~~c~~C--~-~g~~~~~~ct~t~dt~C~--C~~G~y~   85 (98)
T cd00185          12 LCCDQCPPGTYLSKDCTPGSDTVCEPCPPGTYTDSWN-HLPKCLSCRTC--D-SGLVEKAPCTATRNTVCG--CKPGFYC   85 (98)
T ss_pred             CCCCCCCCCccCCCcCCCCCCCeecCCCCCCcccCCC-CCCcCCcCccC--C-CCCEEEccCCCCCCCeEe--CCCCCEe
Confidence            44578999986         25699999999988753 23578888642  2 1222222222 3344675  8888875


Q ss_pred             CC
Q 000543          863 MP  864 (1433)
Q Consensus       863 ~p  864 (1433)
                      ..
T Consensus        86 ~~   87 (98)
T cd00185          86 LT   87 (98)
T ss_pred             cC
Confidence            44


No 8  
>PHA02637 TNF-alpha-receptor-like protein; Provisional
Probab=85.94  E-value=0.54  Score=48.88  Aligned_cols=41  Identities=27%  Similarity=0.665  Sum_probs=30.8

Q ss_pred             CCceEEccCCCCCcc---------cccceeCCCCceecccCCCCCCcccCC
Q 000543          789 GNGTVTGKACPRGLY---------GVFCEECPVGTFKNVSGSDRALCRNCS  830 (1433)
Q Consensus       789 ~nGTi~g~~CP~G~~---------G~fC~eCP~GtYKn~~Gs~~~~C~pC~  830 (1433)
                      ..+.+-=+.||||++         ..-|.+||.|||..... ....|.+|.
T Consensus        38 ~~~~~CC~kCPPGt~v~~~Ct~~t~T~C~PCp~GTYTe~~N-~~~~C~~C~   87 (127)
T PHA02637         38 KRHNLCCLSCPPGTYASRLCDIKTNTQCTPCGSGTFTSHNN-HLPACLSCN   87 (127)
T ss_pred             cCCCeEcCCCCCCCEEeCcCCCCCCcccccCCCCCeeccCC-CCCcccccC
Confidence            445667789999986         56799999999977543 345688776


No 9  
>KOG0921 consensus Dosage compensation complex, subunit MLE [Transcription]
Probab=82.94  E-value=2.8  Score=54.85  Aligned_cols=15  Identities=27%  Similarity=0.390  Sum_probs=7.5

Q ss_pred             EEEEEEeeEEeeEEE
Q 000543          557 QICRAEEINIEGIIK  571 (1433)
Q Consensus       557 qicrVeditv~g~v~  571 (1433)
                      ||||-+-..|+..|.
T Consensus      1135 dIs~pSAa~inLmig 1149 (1282)
T KOG0921|consen 1135 DISRPSAADINLMIG 1149 (1282)
T ss_pred             Hhcccccccccceec
Confidence            345555555554444


No 10 
>PF07699 GCC2_GCC3:  GCC2 and GCC3;  InterPro: IPR011641 Protein phosphorylation, which plays a key role in most cellular activities, is a reversible process mediated by protein kinases and phosphoprotein phosphatases. Protein kinases catalyse the transfer of the gamma phosphate from nucleotide triphosphates (often ATP) to one or more amino acid residues in a protein substrate side chain, resulting in a conformational change affecting protein function. Phosphoprotein phosphatases catalyse the reverse process. Protein kinases fall into three broad classes, characterised with respect to substrate specificity []:   Serine/threonine-protein kinases Tyrosine-protein kinases Dual specific protein kinases (e.g. MEK - phosphorylates both Thr and Tyr on target proteins)   Protein kinase function has been evolutionarily conserved from Escherichia coli to human []. Protein kinases play a role in a multitude of cellular processes, including division, proliferation, apoptosis, and differentiation []. Phosphorylation usually results in a functional change of the target protein by changing enzyme activity, cellular location, or association with other proteins. The catalytic subunits of protein kinases are highly conserved, and several structures have been solved [], leading to large screens to develop kinase-specific inhibitors for the treatments of a number of diseases []. Tyrosine-protein kinases can transfer a phosphate group from ATP to a tyrosine residue in a protein. These enzymes can be divided into two main groups []:   Receptor tyrosine kinases (RTK), which are transmembrane proteins involved in signal transduction; they play key roles in growth, differentiation, metabolism, adhesion, motility, death and oncogenesis []. RTKs are composed of 3 domains: an extracellular domain (binds ligand), a transmembrane (TM) domain, and an intracellular catalytic domain (phosphorylates substrate). The TM domain plays an important role in the dimerisation process necessary for signal transduction [].      Cytoplasmic / non-receptor tyrosine kinases, which act as regulatory proteins, playing key roles in cell differentiation, motility, proliferation, and survival. For example, the Src-family of protein-tyrosine kinases [].   This entry represents various ephrin type A and B receptors, which have tyrosine kinase activity.
Probab=77.04  E-value=1.9  Score=37.29  Aligned_cols=32  Identities=38%  Similarity=0.902  Sum_probs=24.8

Q ss_pred             ceEEccCCCCCcc----c-ccceeCCCCceecccCCC
Q 000543          791 GTVTGKACPRGLY----G-VFCEECPVGTFKNVSGSD  822 (1433)
Q Consensus       791 GTi~g~~CP~G~~----G-~fC~eCP~GtYKn~~Gs~  822 (1433)
                      +.-.=.+||+|+|    | ..|++||.|+|-...|+.
T Consensus         7 ~~~~C~~Cp~GtYq~~~g~~~C~~Cp~g~~T~~~Gs~   43 (48)
T PF07699_consen    7 GNNKCQPCPKGTYQDEEGQTSCTPCPPGSTTSSEGST   43 (48)
T ss_pred             CCCccCCCCCCccCCccCCccCccCcCCCccCCcCCc
Confidence            3344579999987    3 559999999998877763


No 11 
>cd00185 TNFR Tumor necrosis factor receptor (TNFR) domain; superfamily of TNF-like receptor domains. When bound to TNF-like cytokines, TNFRs trigger multiple signal transduction pathways, they are involved in inflammation response, apoptosis, autoimmunity and organogenesis. TNFRs domains are elongated with generally three tandem repeats of cysteine-rich domains (CRDs). They fit in the grooves between protomers within the ligand trimer. Some TNFRs, such as NGFR and HveA, bind ligands with no structural similarity to TNF and do not bind ligand trimers.
Probab=66.37  E-value=6.8  Score=38.45  Aligned_cols=33  Identities=24%  Similarity=0.796  Sum_probs=23.9

Q ss_pred             cccceeCCCCceecccCC--CCCCcccCCCCCCCC
Q 000543          804 GVFCEECPVGTFKNVSGS--DRALCRNCSSNELPH  836 (1433)
Q Consensus       804 G~fC~eCP~GtYKn~~Gs--~~~~C~pC~~~~lp~  836 (1433)
                      +.-|..||.|+|-.....  ....|.|||.+.+..
T Consensus        11 ~~cC~~C~pG~~~~~~C~~~~~t~C~~C~~g~ys~   45 (98)
T cd00185          11 GLCCDQCPPGTYLSKDCTPGSDTVCEPCPPGTYTD   45 (98)
T ss_pred             CCCCCCCCCCccCCCcCCCCCCCeecCCCCCCccc
Confidence            456999999999876532  245799999865433


No 12 
>PTZ00382 Variant-specific surface protein (VSP); Provisional
Probab=62.45  E-value=4.5  Score=40.27  Aligned_cols=24  Identities=25%  Similarity=0.493  Sum_probs=18.4

Q ss_pred             ccceeCCCCceecccCCCCCCcccCCC
Q 000543          805 VFCEECPVGTFKNVSGSDRALCRNCSS  831 (1433)
Q Consensus       805 ~fC~eCP~GtYKn~~Gs~~~~C~pC~~  831 (1433)
                      ..|.+|..|+|++..   ...|.+|+.
T Consensus         4 ~~Ct~C~~g~~~~~~---~~~C~~C~~   27 (96)
T PTZ00382          4 AVCTSCDSDKKPNKD---GSGCVLCSV   27 (96)
T ss_pred             cccCcCCCCCccCCC---CCcCCcCCC
Confidence            468999999988854   357888874


No 13 
>KOG4069 consensus Uncharacterized conserved protein [Function unknown]
Probab=60.04  E-value=18  Score=38.26  Aligned_cols=78  Identities=24%  Similarity=0.343  Sum_probs=54.4

Q ss_pred             CCCcchhhhhhcHHHHHHHHHHHHHHHhh-------hhhhhhhhheeeeeecchhHHHHHHHHHHHHHHHHHHHHhh---
Q 000543          965 HSPPEQVIEIVYEDAFNRFADEINALAAY-------QWWEGSVYSILSVLAYPLAWSWLQLCRKNKLQQLREFVRSE--- 1034 (1433)
Q Consensus       965 ~spP~~i~~ivyed~fn~Fad~IN~laay-------~~We~~i~~iLsvl~YPlaw~~~q~rRrkk~~rL~efv~se--- 1034 (1433)
                      ...|..+++.|..+.|+.=+..+|++-|=       ..+|+.+   =++.+|-.|.--.. --+||+++|+||+.++   
T Consensus        45 ~~~pa~le~~i~R~vfE~Ti~rlN~~yAeAE~~~~qty~Egcl---gC~TaY~iy~ctet-hYek~L~klskfl~~qNe~  120 (154)
T KOG4069|consen   45 AEYPARLEEKIPRDVFENTIVRLNRIYAEAEAITPQTYFEGCL---GCFTAYAIYACTET-HYEKKLDKLSKFLNRQNEE  120 (154)
T ss_pred             ecCcHHHhccCcHHHHHHHHHHHHHHHHHHHhcCCcchHHHHH---HHHHHHHHHHHHHH-HHHHHHHHHHHHHHhhhhh
Confidence            35789999999999999999999998432       2456654   35556655543333 3357899999999854   


Q ss_pred             -cCc--cccccchhh
Q 000543         1035 -YDH--SCLRSCRSR 1046 (1433)
Q Consensus      1035 -ydh--~c~rs~rsR 1046 (1433)
                       |.|  .=+|++--|
T Consensus       121 IY~~~Gl~l~dP~eR  135 (154)
T KOG4069|consen  121 IYHHVGLHLRDPMER  135 (154)
T ss_pred             hccccceeecCchhh
Confidence             777  355665555


No 14 
>KOG4289 consensus Cadherin EGF LAG seven-pass G-type receptor [Signal transduction mechanisms]
Probab=53.40  E-value=11  Score=51.55  Aligned_cols=86  Identities=23%  Similarity=0.191  Sum_probs=55.2

Q ss_pred             hhhHHHHHHHHhhhhhHHHHHHHHHHHHHhHHHHHHHHHhhcccccCCCcccccccccCCCCCC-chhhhhhhhhhhhhH
Q 000543         1275 QDLVGLLVSVLLLADFSLVLLTLLQMYSISLLNFFLVLFILPLGLLFPFPAGISALFSHGPRRS-AGLARIYALWNITSL 1353 (1433)
Q Consensus      1275 ~~~~~l~i~~~ll~d~s~~ll~llq~y~~s~~~~~~~l~~lpl~~~~p~~ag~nalfs~~~rrs-~~~~r~~~lwn~~s~ 1353 (1433)
                      ...|++....+||+=+.+++|-.|---+.++..-|++-|-|           .+-+|-||--+. +-.+=+-+|--.+++
T Consensus      2170 ~a~~gvslaal~lt~~llls~RsLksn~~~I~~~l~~Al~l-----------~~L~Fv~gi~~nq~~CtvvailLhf~~~ 2238 (2531)
T KOG4289|consen 2170 YAAVGVSLAALLLTFLLLLSLRSLKSNSHGIHFNLAAALGL-----------AQLVFVLGINQNQFYCTVVAILLHFTYL 2238 (2531)
T ss_pred             HHHHHHHHHHHHHHHHHHHHHHHHhhcchhhhhHHHHHHhH-----------HHHHhhhhcccCchhhHHHHHHHHHHHh
Confidence            35677777778877777777777777777766555443321           122344444443 334556677788888


Q ss_pred             HHHHHHhhhceeeccccC
Q 000543         1354 INVATAFICGYLHYRDHS 1371 (1433)
Q Consensus      1354 ~n~~~a~~~g~~~~~~~~ 1371 (1433)
                      .-.+-+|+.|+--|.|-.
T Consensus      2239 stFaWlfl~gLhlYRml~ 2256 (2531)
T KOG4289|consen 2239 STFAWLFLEGLHLYRMLT 2256 (2531)
T ss_pred             hhHHHHHHHHHHHHHHHh
Confidence            888888888887777544


No 15 
>smart00180 EGF_Lam Laminin-type epidermal growth factor-like domai.
Probab=52.59  E-value=7.7  Score=33.51  Aligned_cols=20  Identities=25%  Similarity=0.750  Sum_probs=18.5

Q ss_pred             CCCCCcccccceeCCCCcee
Q 000543          797 ACPRGLYGVFCEECPVGTFK  816 (1433)
Q Consensus       797 ~CP~G~~G~fC~eCP~GtYK  816 (1433)
                      .|++++.|..|++|+.|+|-
T Consensus        21 ~C~~~~~G~~C~~C~~g~~g   40 (46)
T smart00180       21 ECKPNVTGRRCDRCAPGYYG   40 (46)
T ss_pred             ECCCCCCCCCCCcCCCCcCC
Confidence            49999999999999999996


No 16 
>cd00055 EGF_Lam Laminin-type epidermal growth factor-like domain; laminins are the major noncollagenous components of basement membranes that mediate cell adhesion, growth migration, and differentiation; the laminin-type epidermal growth factor-like module occurs in tandem arrays; the domain contains 4 disulfide bonds (loops a-d) the first three resemble epidermal growth factor (EGF); the number of copies of this domain in the different forms of laminins is highly variable ranging from 3 up to 22 copies
Probab=52.35  E-value=7.6  Score=33.86  Aligned_cols=27  Identities=26%  Similarity=0.628  Sum_probs=22.0

Q ss_pred             CCceEEccCCCCCcccccceeCCCCceecc
Q 000543          789 GNGTVTGKACPRGLYGVFCEECPVGTFKNV  818 (1433)
Q Consensus       789 ~nGTi~g~~CP~G~~G~fC~eCP~GtYKn~  818 (1433)
                      .+|.-   .|++++.|..|++|+.|+|...
T Consensus        17 ~~G~C---~C~~~~~G~~C~~C~~g~~~~~   43 (50)
T cd00055          17 GTGQC---ECKPNTTGRRCDRCAPGYYGLP   43 (50)
T ss_pred             CCCEE---eCCCcCCCCCCCCCCCCCccCC
Confidence            34544   4999999999999999999653


No 17 
>PF05268 GP38:  Phage tail fibre adhesin Gp38;  InterPro: IPR007932 This entry is represented by Bacteriophage T2, Gp38. The characteristics of the protein distribution suggest prophage matches in addition to the phage matches. This family contains several Gp38 proteins from T-even-like phages. Gp38, together with a second phage protein, Gp57, catalyses the organisation of Gp37 but is absent from the phage particle. Gp37 is responsible for receptor recognition [].
Probab=50.83  E-value=69  Score=36.75  Aligned_cols=37  Identities=24%  Similarity=0.514  Sum_probs=21.0

Q ss_pred             cccCCCCCCCCCCCCCCCCCcEEEEEEceeEEe--ccEEE
Q 000543          210 SYGSKGGTTFKGENFGGDGGGRIRLEVVNEIEV--NGSLL  247 (1433)
Q Consensus       210 ~~GSGGG~~~~~~~~GG~GGG~I~i~a~~~l~l--~G~i~  247 (1433)
                      .+|-|| +++.....|.+||=+|+=....+|.|  +|.|-
T Consensus       114 ~yGRGG-nGs~~~~~g~~GG~~I~N~iG~rLRI~N~GaIA  152 (260)
T PF05268_consen  114 MYGRGG-NGSGSNSAGAAGGHAIQNDIGGRLRINNNGAIA  152 (260)
T ss_pred             EEecCC-CCCCCCCCccccceeeecCCcceEEEecCCEEe
Confidence            344433 33445567778788887655555554  44554


No 18 
>KOG3973 consensus Uncharacterized conserved glycine-rich protein [Function unknown]
Probab=46.85  E-value=72  Score=38.63  Aligned_cols=7  Identities=71%  Similarity=1.341  Sum_probs=3.4

Q ss_pred             CCCCeEE
Q 000543          289 GGGGRVS  295 (1433)
Q Consensus       289 GGGGRI~  295 (1433)
                      |||||-.
T Consensus       453 ggggrg~  459 (465)
T KOG3973|consen  453 GGGGRGG  459 (465)
T ss_pred             CCCCCcc
Confidence            3366643


No 19 
>PF12273 RCR:  Chitin synthesis regulation, resistance to Congo red;  InterPro: IPR020999  RCR proteins are ER membrane proteins that regulate chitin deposition in fungal cell walls. Although chitin, a linear polymer of beta-1,4-linked N-acetylglucosamine, constitutes only 2% of the cell wall it plays a vital role in the overall protection of the cell wall against stress, noxious chemicals and osmotic pressure changes. Congo red is a cell wall-disrupting benzidine-type dye extensively used in many cell wall mutant studies that specifically targets chitin in yeast cells and inhibits growth. RCR proteins render the yeasts resistant to Congo red by diminishing the content of chitin in the cell wall []. RCR proteins are probably regulating chitin synthase III interact directly with ubiquitin ligase Rsp5, and the VPEY motif is necessary for this, via interaction with the WW domains of Rsp5 []. 
Probab=45.46  E-value=19  Score=37.08  Aligned_cols=20  Identities=35%  Similarity=0.884  Sum_probs=8.7

Q ss_pred             c-hhHHHHHHHHHHHHHHHHH
Q 000543          881 W-LFGLILLGLLILLALVLSV  900 (1433)
Q Consensus       881 ~-~F~lll~~llilLalv~s~  900 (1433)
                      | +|+||+++++++|+++..+
T Consensus         2 W~l~~iii~~i~l~~~~~~~~   22 (130)
T PF12273_consen    2 WVLFAIIIVAILLFLFLFYCH   22 (130)
T ss_pred             eeeHHHHHHHHHHHHHHHHHH
Confidence            5 3444444444444444333


No 20 
>PF14946 DUF4501:  Domain of unknown function (DUF4501)
Probab=44.22  E-value=80  Score=34.80  Aligned_cols=37  Identities=41%  Similarity=0.630  Sum_probs=23.3

Q ss_pred             HcCCcchhH-HHHHHHHHHHHHHHHHhhhhccC-CCCCC
Q 000543          876 TFGGPWLFG-LILLGLLILLALVLSVARMKYMG-GDELP  912 (1433)
Q Consensus       876 tfGGp~~F~-lll~~llilLalv~s~~R~k~~~-~d~l~  912 (1433)
                      .+|||+..+ |||=.|+|-+++++|++-.-|-| +.++|
T Consensus        83 ~~g~P~vAASL~LgTffIS~~LilSvA~FFYLKrs~kLP  121 (180)
T PF14946_consen   83 HTGGPQVAASLFLGTFFISLGLILSVASFFYLKRSSKLP  121 (180)
T ss_pred             cCCChhHHHHHHHHHHHHHHHHHHHHhhheeecccccCC
Confidence            489999544 44446667788888887654444 33444


No 21 
>PF10256 Erf4:  Golgin subfamily A member 7/ERF4 family;  InterPro: IPR019383 Proteins in this entry include Golgin subfamily A member 7 and the Ras modification protein ERF4. 
Probab=42.40  E-value=36  Score=34.36  Aligned_cols=56  Identities=16%  Similarity=0.242  Sum_probs=40.0

Q ss_pred             ccCCCChHHHHHHHHHHHHhhhhh-ccccc------------------------chhhhhHHHHHhhcCCccccccCeEE
Q 000543         1114 MSQSVPPTVWYRLVAGVNAQLRLV-HCGHL------------------------KTTFGHLISWLDTHANPSLCQYGIRV 1168 (1433)
Q Consensus      1114 ~~Q~~~~~~w~rfVa~lN~qlr~v-~~g~l------------------------r~tl~~v~~~le~h~n~~l~~~gvrv 1168 (1433)
                      +...++++.|+++|..+|..|+.- ..-..                        ++.+..+-+||++.|++.++.+||++
T Consensus        26 L~~~is~~ef~~iI~~IN~~l~~a~~~~~~~~~~~~~l~~lt~~l~~~~~~~~~~~~~~~le~~l~~~N~~~~~~~gi~i  105 (118)
T PF10256_consen   26 LSGYISPEEFEEIINTINQILKEAFEPISWRNIIENILGCLTLGLSSLCFKTHYKRKLRELEKYLEQLNEELFKPRGIKI  105 (118)
T ss_pred             hcCCCCHHHHHHHHHHHHHHHHHHhcchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccCCcEE
Confidence            345688999999999999987654 22111                        22333466789888887999999975


Q ss_pred             E
Q 000543         1169 D 1169 (1433)
Q Consensus      1169 d 1169 (1433)
                      =
T Consensus       106 i  106 (118)
T PF10256_consen  106 I  106 (118)
T ss_pred             E
Confidence            3


No 22 
>KOG1836 consensus Extracellular matrix glycoprotein Laminin subunits alpha and gamma [Extracellular structures]
Probab=39.88  E-value=19  Score=50.85  Aligned_cols=43  Identities=42%  Similarity=0.767  Sum_probs=35.6

Q ss_pred             CceEEccCCCCCcccccceeCCCCceecccCC--CCCCcccCCCC
Q 000543          790 NGTVTGKACPRGLYGVFCEECPVGTFKNVSGS--DRALCRNCSSN  832 (1433)
Q Consensus       790 nGTi~g~~CP~G~~G~fC~eCP~GtYKn~~Gs--~~~~C~pC~~~  832 (1433)
                      .-++.-|.||+||.|..|++|.-|+|=+..+-  +...|++|+-+
T Consensus       794 ~~~~iCk~Cp~gytG~rCe~c~dgyfg~p~~~~~~~~~c~~c~c~  838 (1705)
T KOG1836|consen  794 ILEVVCKNCPPGYTGLRCEECADGYFGNPLGHDGDVRPCQSCQCN  838 (1705)
T ss_pred             ccceecCCCCCCCcccccccCCCccccCCCCCCCCcccCccceec
Confidence            56677799999999999999999999886643  23579999975


No 23 
>PF15496 DUF4646:  Domain of unknown function (DUF4646)
Probab=39.60  E-value=36  Score=35.34  Aligned_cols=23  Identities=17%  Similarity=0.356  Sum_probs=18.4

Q ss_pred             CCChHHHHHHHHHHHHhhhhhcc
Q 000543         1117 SVPPTVWYRLVAGVNAQLRLVHC 1139 (1433)
Q Consensus      1117 ~~~~~~w~rfVa~lN~qlr~v~~ 1139 (1433)
                      +|..+.|.||+.+|+.-.++-..
T Consensus        46 DVs~eDW~~F~~dl~~aa~ls~~   68 (123)
T PF15496_consen   46 DVSEEDWTRFLNDLSEAASLSPS   68 (123)
T ss_pred             CCCHHHHHHHHHHHHHHHhcCcc
Confidence            59999999999999997444333


No 24 
>PF07354 Sp38:  Zona-pellucida-binding protein (Sp38);  InterPro: IPR010857 This family contains a number of zona-pellucida-binding proteins that seem to be restricted to mammals. These are sperm proteins that bind to the 90 kDa family of zona pellucida glycoproteins in a calcium-dependent manner []. These represent some of the specific molecules that mediate the first steps of gamete interaction, allowing fertilisation to occur [].; GO: 0007339 binding of sperm to zona pellucida, 0005576 extracellular region
Probab=39.28  E-value=18  Score=42.11  Aligned_cols=36  Identities=28%  Similarity=0.622  Sum_probs=27.3

Q ss_pred             EEccCCCCCccc---------ccceeCCCCceecccCCCCCCcccCCC
Q 000543          793 VTGKACPRGLYG---------VFCEECPVGTFKNVSGSDRALCRNCSS  831 (1433)
Q Consensus       793 i~g~~CP~G~~G---------~fC~eCP~GtYKn~~Gs~~~~C~pC~~  831 (1433)
                      ++=..|+|||+-         -=|+-|++|||....+   ..|+.|..
T Consensus       217 v~idsC~PGfG~N~~~h~~C~~CCVvCsPgTysp~~~---~~C~~C~~  261 (271)
T PF07354_consen  217 VRIDSCRPGFGKNDILHSDCPSCCVVCSPGTYSPDDD---VHCQQCNS  261 (271)
T ss_pred             EEeeccCCCCCcCcccCCCCCCeeEECCCcccCCCCC---ceEEecCc
Confidence            334678888761         3599999999988754   58999985


No 25 
>PF15496 DUF4646:  Domain of unknown function (DUF4646)
Probab=38.19  E-value=51  Score=34.27  Aligned_cols=70  Identities=16%  Similarity=0.244  Sum_probs=46.1

Q ss_pred             hhhhhcHHHHHHHHHHHHHHHhhhhhhhhhhheeee--------eecchhHHHHHHHHHHHHHHHHHHHHhhcCccccc
Q 000543          971 VIEIVYEDAFNRFADEINALAAYQWWEGSVYSILSV--------LAYPLAWSWLQLCRKNKLQQLREFVRSEYDHSCLR 1041 (1433)
Q Consensus       971 i~~ivyed~fn~Fad~IN~laay~~We~~i~~iLsv--------l~YPlaw~~~q~rRrkk~~rL~efv~seydh~c~r 1041 (1433)
                      ...=|.++++.+|.++++.-++..-+|.++-...-+        ..|=.++.....+.+||-..+.++|. ..++.++|
T Consensus        43 ~~~DVs~eDW~~F~~dl~~aa~ls~~~~~~~~~~~~~~~v~~Gi~~~~v~~~~~~~~~~~k~~~v~~~i~-~WN~~FF~  120 (123)
T PF15496_consen   43 ASHDVSEEDWTRFLNDLSEAASLSPSQSIVAGVGPIVMGVGFGIPAYLVAKAIRKAMKEKKRGEVESTID-QWNEGFFR  120 (123)
T ss_pred             hhcCCCHHHHHHHHHHHHHHHhcCcccceeeeeccccccccccchhhhhhHhhhhcccccchHHHHHHHH-HHHHHhcc
Confidence            334467899999999999998887777543332222        22333444666666777778888887 56665555


No 26 
>PF00053 Laminin_EGF:  Laminin EGF-like (Domains III and V);  InterPro: IPR002049 Laminins [] are the major noncollagenous components of basement membranes that mediate cell adhesion, growth migration, and differentiation. They are composed of distinct but related alpha, beta and gamma chains. The three chains form a cross-shaped molecule that consist of a long arm and three short globular arms. The long arm consist of a coiled coil structure contributed by all three chains and cross-linked by interchain disulphide bonds. Beside different types of globular domains each subunit contains, in its first half, consecutive repeats of about 60 amino acids in length that include eight conserved cysteines []. The tertiary structure [, ] of this domain is remotely similar in its N-terminal to that of the EGF-like module (see PDOC00021 from PROSITEDOC). It is known as a 'LE' or 'laminin-type EGF-like' domain. The number of copies of the LE domain in the different forms of laminins is highly variable; from 3 up to 22 copies have been found. A schematic representation of the topology of the four disulphide bonds in the LE domain is shown below.  +-------------------+ +-|-----------+ | +--------+ +-----------------+ | | | | | | | | xxCxCxxxxxxxxxxxCxxxxxxxCxxCxxxxxGxxCxxCxxgaagxxxxxxxxxxxCxx sssssssssssssssssssssssssssssssssss 'C': conserved cysteine involved in a disulphide bond 'a': conserved aromatic residue 'G': conserved glycine (lower case = less conserved) 's': region similar to the EGF-like domain  In mouse laminin gamma-1 chain, the seventh LE domain has been shown to be the only one that binds with a high affinity to nidogen []. The binding-sites are located on the surface within the loops C1-C3 and C5-C6 [, ]. Long consecutive arrays of LE domains in laminins form rod-like elements of limited flexibility [], which determine the spacing in the formation of laminin networks of basement membranes [].; PDB: 3TBD_A 3ZYG_B 3ZYI_B 2Y38_A 1KLO_A 1NPE_B 3ZYJ_B 1TLE_A.
Probab=37.63  E-value=13  Score=32.08  Aligned_cols=22  Identities=32%  Similarity=0.696  Sum_probs=18.1

Q ss_pred             CCCCCcccccceeCCCCceecc
Q 000543          797 ACPRGLYGVFCEECPVGTFKNV  818 (1433)
Q Consensus       797 ~CP~G~~G~fC~eCP~GtYKn~  818 (1433)
                      .|++++.|..|++|..|+|...
T Consensus        21 ~C~~~~~G~~C~~C~~g~~~~~   42 (49)
T PF00053_consen   21 VCKPGTTGPRCDQCKPGYFGLP   42 (49)
T ss_dssp             SBSTTEESTTS-EE-TTEECST
T ss_pred             eccccccCCcCcCCCCcccccc
Confidence            4999999999999999999774


No 27 
>KOG1836 consensus Extracellular matrix glycoprotein Laminin subunits alpha and gamma [Extracellular structures]
Probab=36.55  E-value=28  Score=49.31  Aligned_cols=64  Identities=30%  Similarity=0.564  Sum_probs=38.8

Q ss_pred             cCCCCCcccccceeCCCCceeccc-CCCCCCcccCCCCCCCCCcceecccCccc-------CCCCCccccCCCccC
Q 000543          796 KACPRGLYGVFCEECPVGTFKNVS-GSDRALCRNCSSNELPHRALYIPIRGGVT-------ECPCPYKCVSERYHM  863 (1433)
Q Consensus       796 ~~CP~G~~G~fC~eCP~GtYKn~~-Gs~~~~C~pC~~~~lp~ra~yiy~~gG~~-------~~~CpY~C~sdk~~~  863 (1433)
                      =.||+||.|-||+.|+.|+.+-.. +.....|.||+-+  -+ ..--..+.|.-       .+.|. +|..++|.+
T Consensus       697 c~C~~g~tG~~Ce~C~~gfrr~~~~~~~~~~c~~C~cn--gh-~~~Cd~~tG~C~C~~~t~G~~C~-~C~~GfYg~  768 (1705)
T KOG1836|consen  697 CTCPVGYTGQFCESCAPGFRRLSPQLGPFCPCIPCDCN--GH-SNICDPRTGQCKCKHNTFGGQCA-QCVDGFYGL  768 (1705)
T ss_pred             ccCCCCcccchhhhcchhhhcccccCCCCCcccccccC--Cc-cccccCCCCceecccCCCCCchh-hhcCCCCCc
Confidence            579999999999999999976643 3333456677643  12 33333333321       12333 566677765


No 28 
>KOG2675 consensus Adenylate cyclase-associated protein (CAP/Srv2p) [Cytoskeleton; Signal transduction mechanisms]
Probab=36.33  E-value=21  Score=43.99  Aligned_cols=26  Identities=31%  Similarity=0.528  Sum_probs=13.8

Q ss_pred             CCCCccccccccCCCcceeeEEEEec
Q 000543          341 NLPTNTDTLLLEFPKQQLWTNVYIRD  366 (1433)
Q Consensus       341 n~~t~t~tlll~fp~~~lw~nv~v~~  366 (1433)
                      +.++..|-=+.|||--.=|-|.++..
T Consensus       445 ~vp~~~dgDy~EfpvPEQfkt~~~~~  470 (480)
T KOG2675|consen  445 NVPTNEDGDYVEFPVPEQFKTKFNGG  470 (480)
T ss_pred             ecccCCCCCcccccChHHHhhhccCc
Confidence            34444555567776555555544433


No 29 
>PRK09677 putative lipopolysaccharide biosynthesis O-acetyl transferase WbbJ; Provisional
Probab=34.56  E-value=95  Score=33.95  Aligned_cols=61  Identities=15%  Similarity=0.245  Sum_probs=41.1

Q ss_pred             ceEEEccceeEEcCCceeeecccCcEEEEEeeeeEEeCCCceEEeceEEEEEeeEEEcCCcEEE
Q 000543           87 DIYVEGSGNLHILPGVTLCCPIKGCLLTINVTGEFLLGRNSEIVAGTVYVSALNASFSSGSVVN  150 (1433)
Q Consensus        87 Dv~i~G~g~L~I~~gv~l~c~~~G~~I~v~~sGn~~lg~ns~ivagsi~L~A~nvti~~~g~I~  150 (1433)
                      ..++++.+++++-.+|.+   ..+|.+.+...+.+.+++++.|-.........+++|.++..|.
T Consensus        35 pf~~~~~~~I~iG~~v~i---~~~~ri~~~~~~~i~IG~~v~Ig~~v~I~~~~~v~IG~~v~Ig   95 (192)
T PRK09677         35 PFYIRNDGSINFGEGFTS---GVGLRLDAFGRGKLFFGDNVQVNDYVHIACIESITIGRDTLIA   95 (192)
T ss_pred             CEEEcCCCeEEECCceEE---CCCeEEEecCCCeEEECCCCEECCCcEEccCceEEECCCCEEC
Confidence            456667778888888766   6677777766778888888887655443334556666555554


No 30 
>KOG4611 consensus Uncharacterized conserved protein [Function unknown]
Probab=31.72  E-value=37  Score=40.84  Aligned_cols=48  Identities=42%  Similarity=0.980  Sum_probs=0.0

Q ss_pred             ceeCCCCceecccCCCCCCcccCCCCCCCCCcceecccCcccCCCCCcc---------------------ccCCCccC--
Q 000543          807 CEECPVGTFKNVSGSDRALCRNCSSNELPHRALYIPIRGGVTECPCPYK---------------------CVSERYHM--  863 (1433)
Q Consensus       807 C~eCP~GtYKn~~Gs~~~~C~pC~~~~lp~ra~yiy~~gG~~~~~CpY~---------------------C~sdk~~~--  863 (1433)
                      |+|||.||...   .+.--|..|-+.                ...||-+                     |-+++|++  
T Consensus        52 ceecpegtlss---pdqtgclncnng----------------tchcpsqstlifrdasgnlltndafcgncasgfyrndn  112 (747)
T KOG4611|consen   52 CEECPEGTLSS---PDQTGCLNCNNG----------------TCHCPSQSTLIFRDASGNLLTNDAFCGNCASGFYRNDN  112 (747)
T ss_pred             cccCCCcccCC---CccCCceecCCC----------------ccCCCCcceEEEEcCCCCeeccccccccccccceECCC


Q ss_pred             ---CcccchHHHH
Q 000543          864 ---PHCYTTLEEL  873 (1433)
Q Consensus       864 ---p~C~T~leel  873 (1433)
                         ..|-|...|.
T Consensus       113 gyctkcetscsem  125 (747)
T KOG4611|consen  113 GYCTKCETSCSEM  125 (747)
T ss_pred             cccccccccHhhh


No 31 
>cd00064 FU Furin-like repeats. Cysteine rich region. Exact function of the domain is not known. Furin is a serine-kinase dependent proprotein processor. Other members of this family include endoproteases and cell surface receptors.
Probab=29.99  E-value=39  Score=29.03  Aligned_cols=22  Identities=41%  Similarity=0.995  Sum_probs=13.4

Q ss_pred             cCCCCCc--ccccce-eCCCCceec
Q 000543          796 KACPRGL--YGVFCE-ECPVGTFKN  817 (1433)
Q Consensus       796 ~~CP~G~--~G~fC~-eCP~GtYKn  817 (1433)
                      ..|+.|+  .+..|+ +||.++|.+
T Consensus        18 ~~C~~~~~~~~~~Cv~~C~~~~~~~   42 (49)
T cd00064          18 TSCRHGFYLDGGTCVSECPEGTYAD   42 (49)
T ss_pred             ccCcCccCCCCCcccccCCCCceec
Confidence            4566666  345566 666666665


No 32 
>KOG4258 consensus Insulin/growth factor receptor (contains protein kinase domain) [Signal transduction mechanisms]
Probab=28.33  E-value=37  Score=44.92  Aligned_cols=46  Identities=26%  Similarity=0.608  Sum_probs=30.2

Q ss_pred             ccce-eCCCCceecccCCCCCCcccCCCCCCCCCcceecccCcccCCCCCccccCCCccCCcccchHHHH
Q 000543          805 VFCE-ECPVGTFKNVSGSDRALCRNCSSNELPHRALYIPIRGGVTECPCPYKCVSERYHMPHCYTTLEEL  873 (1433)
Q Consensus       805 ~fC~-eCP~GtYKn~~Gs~~~~C~pC~~~~lp~ra~yiy~~gG~~~~~CpY~C~sdk~~~p~C~T~leel  873 (1433)
                      ..|. +||.|+-.|.   +..+|++|..                   .||-.|.++ .+.-+=.+..|+|
T Consensus       304 ~~Cv~~CPsGy~~N~---~~~~C~~C~g-------------------~C~~vC~~~-~~tIds~~~Aq~L  350 (1025)
T KOG4258|consen  304 GQCVAKCPSGYKRNS---SSSECVKCEG-------------------PCPKVCEPG-TKTIDSVADAQNL  350 (1025)
T ss_pred             ccchhhCCCcceecC---cccceeccCC-------------------CCcceeccC-ceEecchhhhHHh
Confidence            3566 8999986664   3589999962                   799999863 2222334455554


No 33 
>PF02166 Androgen_recep:  Androgen receptor;  InterPro: IPR001103 Steroid or nuclear hormone receptors (NRs) constitute an important super-family of transcription regulators that are involved in diverse physiological functions, including control of embryonic development, cell differentiation and homeostasis. Members include the steroid hormone receptors and receptors for thyroid hormone, retinoids and 1,25-dihydroxy-vitamin D3. The proteins function as dimeric molecules in the nucleus to regulate the transcription of target genes in a ligand-responsive manner [, ].   NRs are extremely important in medical research, a large number of them being implicated in diseases such as cancer, diabetes and hormone resistance syndromes. Many do not yet have a defined ligand and are accordingly termed "orphan" receptors. More than 300 NRs have been described to date and a new system has recently been introduced in an attempt to rationalise the increasingly complex set of names used to describe superfamily members. The androgen receptor (AR) consists of 3 functional and structural domains: an N-terminal (modulatory) domain; a DNA binding domain (IPR001628 from INTERPRO) that mediates specific binding to target DNA sequences (ligand-responsive elements); and a hormone binding domain. The N-terminal domain (NTD) is unique to the androgen receptors and spans approximately the first 530 residues; the highly-conserved DNA-binding domain is smaller (around 65 residues) and occupies the central portion of the protein; and the hormone ligand binding domain (LBD) lies at the receptor C terminus. In the absence of ligand, steroid hormone receptors are thought to be weakly associated with nuclear components; hormone binding greatly increases receptor affinity.  The LBDs of steroid hormone receptors fold into 12 helices that form a ligand-binding pocket. When an agonist is bound, helix 12 folds over the pocket to enclose the ligand []. When an antagonist is unbound, helix 12 is positioned away from the pocket in a way that interferes with the binding of coactivators to a groove in the hormone-binding domain formed after ligand binding. In AR, ligand binding that induces folding of helix 12 to overlie the pocket discloses a groove that binds a region of the NTD. Coactivator molecules can also bind to this groove, but the predominant site for coactivator binding to AR is in the NTD. AR ligand resides in a pocket and primarily contacts helices 4, 5, and 10. The DNA-binding region includes eight cysteine residues that form two coordination complexes, each composed of four cysteines and a Zn2+ ion. These two zinc fingers form the structure that binds to the major groove of DNA. The second zinc finger stabilises the binding complex by hydrophobic interactions with the first finger and contributes to specificity of receptor DNA binding. It is also necessary for receptor dimerisation that occurs during DNA binding Defects in the androgen receptor cause testicular feminisation syndrome, androgen insensibility syndrome (AIS) [, ]. AIS may be complete (CAIS), where external genitalia are phenotypically female; partial (PAIS), where genitalia are substantively ambiguous; or mild (MAIS), where external genitalia are normal male, or nearly so. Defects in the receptor also cause X-linked spinal and bulbar muscular atrophy (also known as Kennedy's disease).; GO: 0003677 DNA binding, 0004882 androgen receptor activity, 0005496 steroid binding, 0006355 regulation of transcription, DNA-dependent, 0030521 androgen receptor signaling pathway, 0005634 nucleus; PDB: 1XOW_B 2Q7K_B 2Q7I_B.
Probab=27.45  E-value=20  Score=42.80  Aligned_cols=26  Identities=50%  Similarity=0.989  Sum_probs=0.0

Q ss_pred             cccccCcceeeeCCCCCcccCCCCCCCCCCCCCCCCC
Q 000543           21 SLDQYNFPVIGFGADSLFHGDYTPPSPPPPIAPPHPP   57 (1433)
Q Consensus        21 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~   57 (1433)
                      +-|-||||.-           ..-|+|||||+|||+.
T Consensus       335 sRDyYnF~la-----------Lag~~~p~~~~hph~R  360 (423)
T PF02166_consen  335 SRDYYNFPLA-----------LAGPPPPPPPPHPHAR  360 (423)
T ss_dssp             -------------------------------------
T ss_pred             cccccccccc-----------cccccccccccccccc
Confidence            4577888752           2335556666777665


No 34 
>KOG4180 consensus Predicted kinase [General function prediction only]
Probab=25.75  E-value=1.2e+02  Score=36.70  Aligned_cols=28  Identities=32%  Similarity=0.547  Sum_probs=19.5

Q ss_pred             CCCCCcccccCCcEEEEEecccccccCcc
Q 000543         1075 RADLPPRLNQRLPMSLCFGGDGSYMSPFS 1103 (1433)
Q Consensus      1075 r~dl~~~i~q~~Pm~Ilf~GdGSym~Pf~ 1103 (1433)
                      |.+|... ..--.|+|-.||||+|+--=+
T Consensus        96 R~~lsq~-i~waD~VisvGGDGTfL~Aas  123 (395)
T KOG4180|consen   96 RNDLSQP-IRWADMVISVGGDGTFLLAAS  123 (395)
T ss_pred             hhhccCc-CchhhEEEEecCccceeehhh
Confidence            3444333 344579999999999987766


No 35 
>KOG4260 consensus Uncharacterized conserved protein [Function unknown]
Probab=25.42  E-value=73  Score=37.61  Aligned_cols=50  Identities=36%  Similarity=0.869  Sum_probs=0.0

Q ss_pred             CCCCcccccceeCCCCceecccCCCCCCcccCCCCCCCCCcceecccCcccCCCCCccccCCCccCCcc
Q 000543          798 CPRGLYGVFCEECPVGTFKNVSGSDRALCRNCSSNELPHRALYIPIRGGVTECPCPYKCVSERYHMPHC  866 (1433)
Q Consensus       798 CP~G~~G~fC~eCP~GtYKn~~Gs~~~~C~pC~~~~lp~ra~yiy~~gG~~~~~CpY~C~sdk~~~p~C  866 (1433)
                      ||+|.||.-|.+||-|+=+.-.|.  ..|+   .+         ..|+|...-.|.     --|.-|+|
T Consensus       132 Cp~gtyGpdCl~Cpggser~C~Gn--G~C~---Gd---------GsR~GsGkCkC~-----~GY~Gp~C  181 (350)
T KOG4260|consen  132 CPDGTYGPDCLQCPGGSERPCFGN--GSCH---GD---------GSREGSGKCKCE-----TGYTGPLC  181 (350)
T ss_pred             cCCCCcCCccccCCCCCcCCcCCC--Cccc---CC---------CCCCCCCccccc-----CCCCCccc


No 36 
>PF15195 TMEM210:  TMEM210 family
Probab=25.32  E-value=43  Score=33.94  Aligned_cols=11  Identities=45%  Similarity=1.123  Sum_probs=4.7

Q ss_pred             CCCCCCCCCCC
Q 000543           41 DYTPPSPPPPI   51 (1433)
Q Consensus        41 ~~~~~~~~p~~   51 (1433)
                      +-+|++|||||
T Consensus       101 ~~~p~~ppppp  111 (116)
T PF15195_consen  101 EASPEEPPPPP  111 (116)
T ss_pred             CCCCCCCCcCc
Confidence            34444444443


No 37 
>COG4907 Predicted membrane protein [Function unknown]
Probab=24.45  E-value=51  Score=41.10  Aligned_cols=7  Identities=71%  Similarity=1.526  Sum_probs=3.4

Q ss_pred             CCCCCCe
Q 000543          287 AGGGGGR  293 (1433)
Q Consensus       287 GGGGGGR  293 (1433)
                      |||||+|
T Consensus       589 GGGGGar  595 (595)
T COG4907         589 GGGGGAR  595 (595)
T ss_pred             CCCCCCC
Confidence            4445554


No 38 
>KOG0994 consensus Extracellular matrix glycoprotein Laminin subunit beta [Extracellular structures]
Probab=23.41  E-value=80  Score=43.22  Aligned_cols=65  Identities=26%  Similarity=0.664  Sum_probs=41.4

Q ss_pred             CCCCCcccccceeCCCCceecccCCCCCCcccCCCCCCCCCcceecc-------cCcccCCCCCccccCCCccCCcc
Q 000543          797 ACPRGLYGVFCEECPVGTFKNVSGSDRALCRNCSSNELPHRALYIPI-------RGGVTECPCPYKCVSERYHMPHC  866 (1433)
Q Consensus       797 ~CP~G~~G~fC~eCP~GtYKn~~Gs~~~~C~pC~~~~lp~ra~yiy~-------~gG~~~~~CpY~C~sdk~~~p~C  866 (1433)
                      .|-|+--|-.|.+|-+|||-  +|  .+=|++|.-+..-+-.+|-..       |.|..-..|. +|.+++..-|-|
T Consensus       785 qCkPnVVGR~CdqCApGtyG--FG--PsGCk~CdC~~~Gs~~~~Cd~~tGQC~C~~g~ygrqCn-qCqpG~WgFPeC  856 (1758)
T KOG0994|consen  785 QCKPNVVGRRCDQCAPGTYG--FG--PSGCKACDCNSIGSLDKYCDKITGQCQCRPGTYGRQCN-QCQPGYWGFPEC  856 (1758)
T ss_pred             cccCccccccccccCCcccC--cC--CccCccccccccccccccccccccceeeccccchhhcc-ccCCCccCCCcC
Confidence            67777889999999999993  33  477999987666555555432       2234444554 555554444433


No 39 
>cd04299 GT1_Glycogen_Phosphorylase_like This family is most closely related to the oligosaccharide phosphorylase domain family and other unidentified sequences. Oligosaccharide phosphorylase catalyzes the breakdown of oligosaccharides into glucose-1-phosphate units. They are important allosteric enzymes in carbohydrate metabolism. The members of this family are found in bacteria and Archaea.
Probab=22.78  E-value=54  Score=43.47  Aligned_cols=77  Identities=25%  Similarity=0.312  Sum_probs=44.4

Q ss_pred             HHHHHHHH-HHHHHHHhhcCccccccchhhhhhcceEEeecCC-ceEEEEEEecCCCCcCCCCCccccc---------CC
Q 000543         1018 LCRKNKLQ-QLREFVRSEYDHSCLRSCRSRALYEGLKVAATAD-LMLAYIDFFLGGDEKRADLPPRLNQ---------RL 1086 (1433)
Q Consensus      1018 ~rRrkk~~-rL~efv~seydh~c~rs~rsRaly~~lKvs~S~D-ysLaYiD~f~~~d~kr~dl~~~i~q---------~~ 1086 (1433)
                      |..|++.| +|+++|+......+.|.-.+.+.-.....-..|| ++++++==|-  .|||.+|-.+...         .-
T Consensus       435 w~~K~~~K~~L~~~v~~~~~~~~~~~g~~~~~~~~~~~~ldpd~ltigfarRfa--~YKR~~Lil~dl~rl~~il~~~~~  512 (778)
T cd04299         435 WEVRQQLRRRLIEFVRRRLRRQWLRRGASAEEIGEADDVLDPNVLTIGFARRFA--TYKRATLLLRDPERLKRLLNDPER  512 (778)
T ss_pred             HHHHHHHHHHHHHHHHHHHHhhhhhcCCchhhhhhcCCccCCCccEEeeeecch--hhhhHHHHHHHHHHHHHHhhCCCC
Confidence            44444444 5888888655544444222221111222234566 6677766664  6899998443221         25


Q ss_pred             cEEEEEeccc
Q 000543         1087 PMSLCFGGDG 1096 (1433)
Q Consensus      1087 Pm~Ilf~GdG 1096 (1433)
                      |+-+||+|.+
T Consensus       513 pvQ~IfaGKA  522 (778)
T cd04299         513 PVQFIFAGKA  522 (778)
T ss_pred             CeEEEEEEec
Confidence            9999999998


No 40 
>PF04519 Bactofilin:  Polymer-forming cytoskeletal;  InterPro: IPR007607 This family contains several uncharacterised hypothetical proteins.
Probab=22.70  E-value=3.1e+02  Score=26.84  Aligned_cols=57  Identities=18%  Similarity=0.424  Sum_probs=0.0

Q ss_pred             ccCeEEEeeceeeccCceEEEccceeEEcCCceeeecccCcEEEEEeeeeEEeCCCceEEeceEEEEEeeEEEcCCcEEE
Q 000543           71 LETVCLLNSSLTFENDDIYVEGSGNLHILPGVTLCCPIKGCLLTINVTGEFLLGRNSEIVAGTVYVSALNASFSSGSVVN  150 (1433)
Q Consensus        71 ~~t~C~l~~s~~~~~qDv~i~G~g~L~I~~gv~l~c~~~G~~I~v~~sGn~~lg~ns~ivagsi~L~A~nvti~~~g~I~  150 (1433)
                      +...|.+.-++.+.. ++.|.|.              +.|   .+...+.+.|+.++.+.+.   +.|.++.|  .|.++
T Consensus         5 I~~~~~i~G~i~~~~-~v~i~G~--------------v~G---~i~~~g~v~i~~~~~v~G~---i~~~~~~i--~G~v~   61 (101)
T PF04519_consen    5 IGKGTKIEGDISSDG-DVRIDGR--------------VEG---NIKAEGKVKIGGNGEVKGD---IKADDVII--SGSVD   61 (101)
T ss_pred             ECCCCEEEEEEEECc-EEEEEEE--------------EEE---EEEEceEEEEcCCCEEEEE---EEEeEEEE--cCEEe


No 41 
>PF00020 TNFR_c6:  TNFR/NGFR cysteine-rich region;  InterPro: IPR001368 A number of proteins, some of which are known to be receptors for growth factors have been found to contain a cysteine-rich domain at the N-terminal region that can be subdivided into four (or in some cases, three) repeats containing six conserved cysteines all of which are involved in intrachain disulphide bonds [].  CD27 (also called S152 or T14) mediates a co-stimulatory signal for T and B cell activation and is involved in murine T cell development. Tyrosine-phosphorylation of ZAP-70 following CD27 ligation of T cells has been reported [], but not confirmed independently. CD30 was originally identified as Ki-1, an antigen expressed on Reed-Sternberg cells in Hodgkin's lymphomas and other non-Hodgkin's lymphomas, particularly diffuse large-cell lymphoma and immunoblastic lymphoma. CD30 has pleiotropic effects on CD30-positive lymphoma cell lines ranging from cell proliferation to cell death. It is thought to be involved in negative selection of T-cells in the thymus and is involved in TCR-mediated cell death. CD30 is a member of the TNFR family of molecules, activate NFkB through interaction with TRAF2 and TRAF5. CD40 (Bp50) plays a central role in the regulation of cell-mediated immunity as well as antibody mediated immunity. It is central to T cell dependent (TD)-responses and may influence survival of B cell lymphomas. CD95 (also called APO-1, fas antigen, Fas tumour necrosis factor receptor superfamily, member 6, TNFRSF6 or apoptosis antigen 1, APT1) is expressed, typically at high levels, on activated T and B cells. It is involved in the mediation of apoptosis-inducing signals.  Other proteins known to belong to this family [, , , ] are, tumour Necrosis Factor type I and type II receptors (TNFR), Rabbit fibroma virus soluble TNF receptor (protein T2), lymphotoxin alpha/beta receptor, low-affinity nerve growth factor receptor (LA-NGFR) (p75), T-cell antigen OX40, Wsl-1, a receptor (for a yet undefined ligand) that mediates apoptosis and Vaccinia virus protein A53 (SalF19R). CD molecules are leucocyte antigens on cell surfaces. CD antigens nomenclature is updated at Protein Reviews On The Web (http://prow.nci.nih.gov/).; GO: 0005488 binding; PDB: 3TJE_F 3QD6_S 3ALQ_U 3IJ2_Y 3BUK_D 1SG1_X 1NCF_B 1EXT_A 1FT4_B 1TNR_R ....
Probab=22.13  E-value=40  Score=27.97  Aligned_cols=19  Identities=37%  Similarity=0.947  Sum_probs=14.0

Q ss_pred             CCCCceecccCCCCCCcccCC
Q 000543          810 CPVGTFKNVSGSDRALCRNCS  830 (1433)
Q Consensus       810 CP~GtYKn~~Gs~~~~C~pC~  830 (1433)
                      ||.|+|.+..+.  ..|.||.
T Consensus         1 C~~g~y~~~~~~--~~C~~C~   19 (39)
T PF00020_consen    1 CPPGTYSDSENH--PQCLPCS   19 (39)
T ss_dssp             ECTTEEEESSCS--SSEEEEE
T ss_pred             CccCcccCCCCC--CcCCccC
Confidence            899999997543  6776654


No 42 
>PF12661 hEGF:  Human growth factor-like EGF; PDB: 2YGQ_A 2E26_A 3A7Q_A 2YGP_A 2YGO_A 1HRE_A 1HAE_A 1HAF_A 1HRF_A.
Probab=22.02  E-value=27  Score=23.64  Aligned_cols=11  Identities=45%  Similarity=1.066  Sum_probs=8.3

Q ss_pred             CCCCCcccccc
Q 000543          797 ACPRGLYGVFC  807 (1433)
Q Consensus       797 ~CP~G~~G~fC  807 (1433)
                      .||+||.|..|
T Consensus         3 ~C~~G~~G~~C   13 (13)
T PF12661_consen    3 QCPPGWTGPNC   13 (13)
T ss_dssp             EE-TTEETTTT
T ss_pred             cCcCCCcCCCC
Confidence            48999999877


No 43 
>PF05281 Secretogranin_V:  Neuroendocrine protein 7B2 precursor (Secretogranin V);  InterPro: IPR007945 Mature peptide hormones and neuropeptides are typically synthesised from much larger precursors and require several post-translational processing steps--including proteolytic cleavage--for the formation of the bioactive species. The subtilisin-related proteolytic enzymes that accomplish neuroendocrine-specific cleavages are known as prohormone convertases 1 and 2 (PC1 and PC2), which belong to MEROPS peptidase family S8B. The cell biology of these proteases within the regulated secretory pathway of neuroendocrine cells is complex, and they are themselves initially synthesised as inactive precursor molecules. ProPC1 propeptide cleavage occurs rapidly in the endoplasmic reticulum, yet its major site of action on prohormones takes place later in the secretory pathway. PC1 undergoes an interesting carboxyl terminal processing event whose function appears to be to activate the enzyme. ProPC2, on the other hand, exhibits comparatively long initial folding times and exits the endoplasmic reticulum without propeptide cleavage, in association with the neuroendocrine-specific protein 7B2. Once the proPC2/7B2 complex arrives at the trans-Golgi network, 7B2 is internally cleaved into two domains, the 21kDa fragment and a carboxy-terminal 31 residue peptide. PC2 propeptide removal occurs in the maturing secretory granule, most likely through autocatalysis, and 7B2 association does not appear to be directly required for this cleavage event. However, if proPC2 has not encountered 7B2 intracellularly, it cannot generate a catalytically active mature species. The molecular mechanism behind the intriguing intracellular association of 7B2 and proPC2 is still unknown, but may involve conformational rearrangement or stabilisation of a proPC2 conformer mediated by a 36-residue internal segment of 21kDa 7B2. This family represents, 7B2 (secretogranin V), which is the molecular escort protein for PC2. 7B2 is a bifunctional protein with an N-terminal activation domain and a C-terminal inhibitory domain (MEROPS inhibitor family I21, clan I-) separated by a furin cleavage site []. Although 7B2 represents a potent inhibitor of PC2, there is an absolute requirement of 7B2 for the activation of PC2, which is synthesised as a zymogen. Both the full length, 27 kDa, and the C-terminal peptide (CT domain) derived from intramolecular cleavage of 7B2 are potent inhibitors of PC2. Studies have shown that the active peptide in the CT domain to be LLRVHK, active in the nanomolar range not only against PC2 but also PC1 [, ]. Knock-out studies have shown that the PC2 nulls are not phenotypically equivalent to the 7B2 nulls, which suggests that 7B2 may have other activities in addition to being the activator of PC2 [].  7B2 exhibits both structural and functional homology to proSAAS (IPR010832 from INTERPRO), which is the PC1 binding protein. The CT domain of proSAAS contains the same inhibitor hexapeptide as 7B2, consequently both 7B2 and proSAAS are two members of a homologous family of prohormone convertase inhibitor proteins. ; GO: 0007218 neuropeptide signaling pathway, 0030141 stored secretory granule
Probab=21.25  E-value=1.4e+02  Score=34.09  Aligned_cols=91  Identities=25%  Similarity=0.454  Sum_probs=52.8

Q ss_pred             CCcccccceeceEEEcCCCCCCCCCcccCCceEEccCCCCCc--ccccce---eCCCCceecccCCCCCCcccCCCCCCC
Q 000543          761 GDEYLPLASVNGSIDARGGLGRGQGLAGGNGTVTGKACPRGL--YGVFCE---ECPVGTFKNVSGSDRALCRNCSSNELP  835 (1433)
Q Consensus       761 Gd~y~pia~~~G~I~~~GG~~~~~g~~G~nGTi~g~~CP~G~--~G~fC~---eCP~GtYKn~~Gs~~~~C~pC~~~~lp  835 (1433)
                      -+||.+-.+.=|.-.++||++.+....+..|.|-...=-+--  -=-+|.   +||+|+- ...     .|.    .+++
T Consensus        49 d~E~l~hssLwG~q~isGGagEG~q~L~~~~~i~n~~~~ktd~~lPaYCnPPNPCPvGyt-~~d-----gCl----e~fE  118 (208)
T PF05281_consen   49 DQEYLQHSSLWGHQYISGGAGEGPQHLGPEGNIPNIKIVKTDNNLPAYCNPPNPCPVGYT-AED-----GCL----ENFE  118 (208)
T ss_pred             ccccccccccCCceeeeCCCCccchhcCccCCcCcccccccccCCCCCCCCCCCCCCCcc-ccc-----Ccc----cCCc
Confidence            357776667788888899998766666666655332110000  015687   8999985 222     254    3466


Q ss_pred             CCcceecccCcccCCCCCccccCCCccCCccc
Q 000543          836 HRALYIPIRGGVTECPCPYKCVSERYHMPHCY  867 (1433)
Q Consensus       836 ~ra~yiy~~gG~~~~~CpY~C~sdk~~~p~C~  867 (1433)
                      .-|.|--.-  .....|.  |  |-.||-.|-
T Consensus       119 nTAeFSRey--Qa~QdC~--C--D~EHMf~C~  144 (208)
T PF05281_consen  119 NTAEFSREY--QAAQDCM--C--DTEHMFDCP  144 (208)
T ss_pred             cHHHhhHHH--HhcCccC--C--CcccccCCc
Confidence            666554322  2233443  4  667888886


No 44 
>PF05268 GP38:  Phage tail fibre adhesin Gp38;  InterPro: IPR007932 This entry is represented by Bacteriophage T2, Gp38. The characteristics of the protein distribution suggest prophage matches in addition to the phage matches. This family contains several Gp38 proteins from T-even-like phages. Gp38, together with a second phage protein, Gp57, catalyses the organisation of Gp37 but is absent from the phage particle. Gp37 is responsible for receptor recognition [].
Probab=20.98  E-value=1.7e+02  Score=33.79  Aligned_cols=21  Identities=29%  Similarity=0.703  Sum_probs=10.9

Q ss_pred             EEEecCCCC-CCCCcccCCccE
Q 000543          245 SLLADGGDV-GVKGGGGSGGSI  265 (1433)
Q Consensus       245 ~i~AnGg~g-~~~gGGGSGGSI  265 (1433)
                      .+.-.|+++ ....-|..||..
T Consensus       113 t~yGRGGnGs~~~~~g~~GG~~  134 (260)
T PF05268_consen  113 TMYGRGGNGSGSNSAGAAGGHA  134 (260)
T ss_pred             EEEecCCCCCCCCCCcccccee
Confidence            344446665 344555666653


No 45 
>COG4907 Predicted membrane protein [Function unknown]
Probab=20.91  E-value=64  Score=40.32  Aligned_cols=10  Identities=60%  Similarity=1.215  Sum_probs=4.4

Q ss_pred             CCCCCCCCCc
Q 000543          221 GENFGGDGGG  230 (1433)
Q Consensus       221 ~~~~GG~GGG  230 (1433)
                      .++.||.|||
T Consensus       584 gGg~GGGGGG  593 (595)
T COG4907         584 GGGSGGGGGG  593 (595)
T ss_pred             CCCCCCCCCC
Confidence            3344444444


No 46 
>PRK06958 single-stranded DNA-binding protein; Provisional
Probab=20.75  E-value=5.5e+02  Score=28.86  Aligned_cols=35  Identities=14%  Similarity=0.013  Sum_probs=20.0

Q ss_pred             ccCcEEEEEeeeeEEeCCCc---eEEeceEEEEEeeEEEc
Q 000543          108 IKGCLLTINVTGEFLLGRNS---EIVAGTVYVSALNASFS  144 (1433)
Q Consensus       108 ~~G~~I~v~~sGn~~lg~ns---~ivagsi~L~A~nvti~  144 (1433)
                      .+|+.|.|  .|.+....-.   -...-.+.|.|.+|.+.
T Consensus        71 kKGs~V~V--eGrL~~~~yeDkdG~kr~~~eVvA~~V~fL  108 (182)
T PRK06958         71 KKGSSVYI--EGRIRTRKWQGQDGQDRYSTEIVADQMQML  108 (182)
T ss_pred             CCCCEEEE--EEEEEeCceECCCCcEEEEEEEEEeEEEEC
Confidence            55655443  4666665322   22334677888888774


Done!