Query         psy14481
Match_columns 179
No_of_seqs    138 out of 325
Neff          4.0 
Searched_HMMs 46136
Date          Fri Aug 16 20:56:35 2013
Command       hhsearch -i /work/01045/syshi/Psyhhblits/psy14481.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/14481hhsearch_cdd -cpu 12 -v 0 

 No Hit                             Prob E-value P-value  Score    SS Cols Query HMM  Template HMM
  1 smart00517 PolyA C-terminal do 100.0 3.8E-31 8.3E-36  187.7   6.6   64  106-169     1-64  (64)
  2 PF00658 PABP:  Poly-adenylate  100.0 6.3E-30 1.4E-34  184.2   3.7   72   95-166     1-72  (72)
  3 TIGR01628 PABP-1234 polyadenyl 100.0 9.2E-29   2E-33  225.5   8.9   80   88-167   483-562 (562)
  4 KOG0943|consensus               97.5 4.4E-05 9.4E-10   79.1   2.2   66  108-173  2605-2672(3015)
  5 PTZ00382 Variant-specific surf  95.4  0.0092   2E-07   45.1   1.9   33   27-59     60-93  (96)
  6 PF03302 VSP:  Giardia variant-  93.2   0.044 9.5E-07   50.2   1.6   34   26-59    360-394 (397)
  7 PF05808 Podoplanin:  Podoplani  91.7    0.05 1.1E-06   45.3   0.0   37   23-59    119-155 (162)
  8 PF08693 SKG6:  Transmembrane a  87.3    0.68 1.5E-05   30.5   2.8   30   32-61     11-40  (40)
  9 PF01034 Syndecan:  Syndecan do  84.1    0.33 7.1E-06   35.0   0.1   32   27-61      7-38  (64)
 10 PF12301 CD99L2:  CD99 antigen   82.5    0.51 1.1E-05   39.3   0.6   35   26-62    108-143 (169)
 11 PF01102 Glycophorin_A:  Glycop  78.1    0.48   1E-05   37.7  -0.9   16   29-44     60-75  (122)
 12 PF13908 Shisa:  Wnt and FGF in  74.3     4.4 9.6E-05   32.7   3.7   25   28-52     74-98  (179)
 13 PF02009 Rifin_STEVOR:  Rifin/s  72.7    0.71 1.5E-05   41.5  -1.3   29   33-61    256-285 (299)
 14 PF14828 Amnionless:  Amnionles  72.7     4.9 0.00011   37.8   4.1   25   33-57    338-362 (437)
 15 PF10873 DUF2668:  Protein of u  72.4     3.1 6.7E-05   34.6   2.4   29   28-56     56-84  (155)
 16 PF15102 TMEM154:  TMEM154 prot  69.1     9.5 0.00021   31.4   4.6   35   28-62     53-87  (146)
 17 PF06365 CD34_antigen:  CD34/Po  64.7     3.4 7.3E-05   35.5   1.2   36   33-68    101-137 (202)
 18 PF12273 RCR:  Chitin synthesis  64.4       2 4.4E-05   33.3  -0.2   34   40-73      5-41  (130)
 19 PHA03265 envelope glycoprotein  64.0     3.4 7.4E-05   38.7   1.2   39   21-62    339-377 (402)
 20 PF01299 Lamp:  Lysosome-associ  61.7     4.4 9.5E-05   35.6   1.4   28   32-62    273-300 (306)
 21 TIGR01495 ETRAMP Plasmodium ri  60.1     5.1 0.00011   30.0   1.3   32   31-62     50-81  (85)
 22 PF15069 FAM163:  FAM163 family  56.2     7.7 0.00017   31.9   1.8   24   31-54      4-27  (143)
 23 PTZ00046 rifin; Provisional     54.3     3.4 7.4E-05   38.3  -0.6   28   34-61    316-344 (358)
 24 TIGR01477 RIFIN variant surfac  53.2     3.6 7.8E-05   38.1  -0.6   28   34-61    311-339 (353)
 25 PF12768 Rax2:  Cortical protei  47.4     9.8 0.00021   33.7   1.2   30   27-56    221-251 (281)
 26 PF02158 Neuregulin:  Neureguli  45.5       7 0.00015   36.9   0.0   24   32-56      9-33  (404)
 27 PF12191 stn_TNFRSF12A:  Tumour  45.2     6.7 0.00014   31.8  -0.1   34   32-65     75-109 (129)
 28 PF05283 MGC-24:  Multi-glycosy  44.8     7.3 0.00016   33.1   0.0   27   34-60    158-184 (186)
 29 PHA03048 IMV membrane protein;  42.9     6.2 0.00013   30.4  -0.6   29   33-62     44-72  (93)
 30 PF14575 EphA2_TM:  Ephrin type  41.5     9.3  0.0002   27.6   0.1   31   36-67      4-34  (75)
 31 PHA02898 virion envelope prote  40.7     6.7 0.00014   30.1  -0.7   32   33-65     45-77  (92)
 32 PF07213 DAP10:  DAP10 membrane  39.8      18  0.0004   27.1   1.5   43   17-59     18-60  (79)
 33 KOG0123|consensus               36.2      19 0.00041   33.0   1.3   85   86-174   270-354 (369)
 34 PHA02902 putative IMV membrane  34.1      18 0.00039   26.5   0.6   19   44-62     11-29  (70)
 35 PF12768 Rax2:  Cortical protei  31.9      20 0.00044   31.8   0.8   35   32-66    229-266 (281)
 36 PF02480 Herpes_gE:  Alphaherpe  31.8      15 0.00033   34.5   0.0   26   34-59    353-378 (439)
 37 PF04391 DUF533:  Protein of un  31.1      17 0.00038   30.6   0.2  101   41-153    33-142 (188)
 38 TIGR01167 LPXTG_anchor LPXTG-m  30.2      26 0.00056   20.7   0.8   10   51-60     23-32  (34)
 39 PHA02681 ORF089 virion membran  29.2      21 0.00046   27.3   0.4   20   43-62     10-29  (92)
 40 PF05454 DAG1:  Dystroglycan (D  28.9      19  0.0004   32.5   0.0   21   44-64    157-177 (290)
 41 PF14851 FAM176:  FAM176 family  28.5      43 0.00093   27.7   2.0   18   34-51     26-43  (153)
 42 PHA02680 ORF090 IMV phosphoryl  27.8      20 0.00043   27.5   0.0   26   33-59     46-71  (91)
 43 PF04478 Mid2:  Mid2 like cell   27.3      70  0.0015   26.7   3.1   10   30-39     50-59  (154)
 44 PF05337 CSF-1:  Macrophage col  27.0      21 0.00046   32.4   0.0   32   45-76    238-269 (285)
 45 PF03908 Sec20:  Sec20;  InterP  25.6      27 0.00058   25.5   0.3   17   43-59     75-91  (92)
 46 PF09835 DUF2062:  Uncharacteri  25.5      16 0.00035   28.6  -0.9   27   38-64    120-153 (154)
 47 PF05795 Plasmodium_Vir:  Plasm  25.2      33 0.00071   29.2   0.8   23   35-58    281-303 (354)
 48 PF02480 Herpes_gE:  Alphaherpe  24.5      25 0.00054   33.2   0.0   45   19-63    342-386 (439)
 49 PF11897 DUF3417:  Protein of u  24.4   1E+02  0.0022   24.0   3.4   43  130-173    48-90  (118)
 50 PF06809 NPDC1:  Neural prolife  24.2 1.6E+02  0.0034   27.5   5.0   34   24-60    192-225 (341)
 51 PF07235 DUF1427:  Protein of u  23.5      44 0.00095   25.7   1.2   27  113-139    39-76  (90)
 52 PF02468 PsbN:  Photosystem II   23.3      37  0.0008   22.8   0.6   24   36-59      3-26  (43)
 53 PF02962 CHMI:  5-carboxymethyl  23.2      66  0.0014   25.4   2.1   42   98-140    64-106 (124)
 54 PRK11638 lipopolysaccharide bi  22.9      64  0.0014   29.4   2.3   22   40-61    320-341 (342)
 55 PF13396 PLDc_N:  Phospholipase  22.9      75  0.0016   20.1   2.0   22   39-60     25-46  (46)
 56 PF05545 FixQ:  Cbb3-type cytoc  22.8      31 0.00067   22.6   0.2   23   39-61     12-34  (49)
 57 PF14283 DUF4366:  Domain of un  22.8      77  0.0017   27.4   2.6   24   44-68    167-190 (218)
 58 PRK10905 cell division protein  22.5      37 0.00081   31.4   0.7   19   38-56      2-20  (328)
 59 PRK11556 multidrug efflux syst  22.2      20 0.00043   32.7  -1.1   17  123-139    88-104 (415)
 60 KOG0077|consensus               22.1      39 0.00084   29.1   0.7   30  112-153    12-44  (193)
 61 PHA03286 envelope glycoprotein  21.9      23 0.00049   34.3  -0.8   31   34-64    392-422 (492)
 62 KOG1236|consensus               21.8      84  0.0018   30.9   2.9   63   97-167   430-492 (565)
 63 PF11710 Git3:  G protein-coupl  21.7      30 0.00066   28.8  -0.0   33   30-62      3-35  (201)
 64 KOG4836|consensus               21.7      62  0.0013   28.4   1.8   23   30-52     81-103 (215)
 65 PRK15031 5-carboxymethyl-2-hyd  21.3      57  0.0012   25.9   1.4   41   98-139    65-106 (126)
 66 PRK11901 hypothetical protein;  21.2      48   0.001   30.7   1.1   19   38-56     40-58  (327)
 67 PF15065 NCU-G1:  Lysosomal tra  21.1      53  0.0012   30.3   1.4   29   33-61    320-348 (350)
 68 KOG3540|consensus               21.1      46 0.00099   32.8   1.0   54    5-62    523-576 (615)
 69 PF00226 DnaJ:  DnaJ domain;  I  20.9 1.8E+02  0.0039   18.9   3.7   41  115-170    18-58  (64)
 70 CHL00020 psbN photosystem II p  20.8      79  0.0017   21.3   1.8   23   37-59      4-26  (43)
 71 PTZ00208 65 kDa invariant surf  20.4      61  0.0013   31.1   1.7   31   38-68    388-421 (436)
 72 cd00580 CHMI 5-carboxymethyl-2  20.3      75  0.0016   23.8   1.9   37  106-142    71-111 (113)

No 1  
>smart00517 PolyA C-terminal domain of Poly(A)-binding protein. Present also in Drosophila hyperplastics discs protein. Involved in homodimerisation (either directly or indirectly)
Probab=99.97  E-value=3.8e-31  Score=187.71  Aligned_cols=64  Identities=67%  Similarity=0.994  Sum_probs=62.3

Q ss_pred             ChHHHHHHhhhhhhhhHhhhCCCCCCCceeeccCCChHHHHHhhcChHHHHHHHHHHHHHHHHH
Q psy14481        106 QPQEQKQMLGERLFPLIQRMYPELAGKITGMLLEIDNAELLYMLEHNESLKSKVEEAVAVLQAH  169 (179)
Q Consensus       106 ~peeQKq~LGErLyplV~~l~p~~AgKITGMLLEmd~sELL~mLes~e~L~~KV~EA~~vL~~~  169 (179)
                      +|++|||+|||+|||+|++++|++||||||||||||++|+++||+|+++|++||+||++||++|
T Consensus         1 ~p~~qkq~LGE~Lyp~V~~~~p~~A~KITGMLLEmd~~ell~lle~~~~L~~kv~EA~~vl~~~   64 (64)
T smart00517        1 PPQEQKQALGERLYPKVQALEPELAGKITGMLLEMDNSELLHLLESPELLRSKVDEALEVLKSH   64 (64)
T ss_pred             CHHHHHHHHhHHHhHHHHhhCcccCCcCeeeeeCCCHHHHHHHhcCHHHHHHHHHHHHHHHHhC
Confidence            5899999999999999999999999999999999999999999999999999999999999874


No 2  
>PF00658 PABP:  Poly-adenylate binding protein, unique domain;  InterPro: IPR002004 The polyadenylate-binding protein (PABP) has a conserved C-terminal domain (PABC), which is also found in the hyperplastic discs protein (HYD) family of ubiquitin ligases that contain HECT domains (IPR000569 from INTERPRO) []. PABP recognises the 3' mRNA poly(A) tail and plays an essential role in eukaryotic translation initiation and mRNA stabilisation/degradation. PABC domains of PABP are peptide-binding domains that mediate PABP homo-oligomerisation and protein-protein interactions. In mammals, the PABC domain of PABP functions to recruit several different translation factors to the mRNA poly(A) tail [].; GO: 0003723 RNA binding; PDB: 3KUR_E 1JH4_A 2RQH_B 3KUI_A 3KUS_A 3KUJ_A 3KTR_A 2X04_A 3PTH_A 1JGN_A ....
Probab=99.96  E-value=6.3e-30  Score=184.24  Aligned_cols=72  Identities=67%  Similarity=1.054  Sum_probs=70.6

Q ss_pred             CCCChhHhhcCChHHHHHHhhhhhhhhHhhhCCCCCCCceeeccCCChHHHHHhhcChHHHHHHHHHHHHHH
Q psy14481         95 EPLTSTMLAAAQPQEQKQMLGERLFPLIQRMYPELAGKITGMLLEIDNAELLYMLEHNESLKSKVEEAVAVL  166 (179)
Q Consensus        95 epl~~s~La~a~peeQKq~LGErLyplV~~l~p~~AgKITGMLLEmd~sELL~mLes~e~L~~KV~EA~~vL  166 (179)
                      +|+++++|+++++++||++|||+|||+|+.++|++||||||||||||++||++||+|++.|+.||+||++||
T Consensus         1 ~~~~~~~la~~~~~~qk~~LGe~Ly~~V~~~~p~~A~KITGMLLe~~~~ell~ll~~~~~L~~kv~eA~~vL   72 (72)
T PF00658_consen    1 EPLTASALASASPEQQKQILGERLYPLVQAIYPELAGKITGMLLEMDNSELLHLLEDPELLREKVQEAIEVL   72 (72)
T ss_dssp             SSTSHHHHHTSCHHHHHHHHHHHHHHHHHHHTHHHHHHHHHHHTTSCHHHHHHHHHTHHHHHHHHHHHHHHH
T ss_pred             CcchHHHHhcCCHHHHHHHHhccccHHHHHhCcchhHHHHHHHhcCCHHHHHHHhCCHHHHHHHHHHHHHhC
Confidence            578999999999999999999999999999999999999999999999999999999999999999999987


No 3  
>TIGR01628 PABP-1234 polyadenylate binding protein, human types 1, 2, 3, 4 family. There are four paralogs in Homo sapiens which are expressed in testis, platelets, broadly expressed, or of unknown tissue range.
Probab=99.95  E-value=9.2e-29  Score=225.54  Aligned_cols=80  Identities=58%  Similarity=0.813  Sum_probs=77.0

Q ss_pred             ccccCCCCCCChhHhhcCChHHHHHHhhhhhhhhHhhhCCCCCCCceeeccCCChHHHHHhhcChHHHHHHHHHHHHHHH
Q psy14481         88 AVHVQGQEPLTSTMLAAAQPQEQKQMLGERLFPLIQRMYPELAGKITGMLLEIDNAELLYMLEHNESLKSKVEEAVAVLQ  167 (179)
Q Consensus        88 ~~~~~g~epl~~s~La~a~peeQKq~LGErLyplV~~l~p~~AgKITGMLLEmd~sELL~mLes~e~L~~KV~EA~~vL~  167 (179)
                      +++.+|++++++++|++++|++||+||||+|||+|++++|++||||||||||||++|||+||||+|+|++||+||++||+
T Consensus       483 ~~~~~~~~~~~~~~la~~~p~~q~~~lg~~~~~~~~~~~~~~~~~~tgm~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~  562 (562)
T TIGR01628       483 TASQGGQNKKLAQVLASATPQMQKQVLGERLFPLVEAIEPALAAKITGMLLEMDNSELLHLLESPELLKSKVDEALEVLK  562 (562)
T ss_pred             CccccccchhHHHHHhhCCHHHHHHHHHHHhHHHHHhhChhhcCcceEEEecCCHHHHHHHhcCHHHHHHHHHHHHHHhC
Confidence            46677889999999999999999999999999999999999999999999999999999999999999999999999984


No 4  
>KOG0943|consensus
Probab=97.51  E-value=4.4e-05  Score=79.06  Aligned_cols=66  Identities=30%  Similarity=0.459  Sum_probs=61.2

Q ss_pred             HHHHHHhhhhhhhhHhhhCCCCCCCceeeccCCChHHHHHhhcChHHHHHHHHHHH--HHHHHHHHhH
Q psy14481        108 QEQKQMLGERLFPLIQRMYPELAGKITGMLLEIDNAELLYMLEHNESLKSKVEEAV--AVLQAHQAKQ  173 (179)
Q Consensus       108 eeQKq~LGErLyplV~~l~p~~AgKITGMLLEmd~sELL~mLes~e~L~~KV~EA~--~vL~~~q~~~  173 (179)
                      .-.+|.|||+||+.++.+.|..|.||||||||.....++..|..++.|+..|++|+  +++..|.++-
T Consensus      2605 sAe~Q~la~l~wgqhe~~l~a~~s~~sGfL~dek~~g~llgi~ce~p~eafVdaa~ndd~i~~H~qei 2672 (3015)
T KOG0943|consen 2605 SAERQALAELLWGQHEAMLPAFASKISGFLLDEKPAGLLLGIACEDPLEAFVDAAMNDDLIIAHGQEI 2672 (3015)
T ss_pred             hhhhccchhhcchhhhhHHHHHHHhhhhhhhccchHHHHHHHhhcCcHHHHHHHhcchHHHHHHHHHh
Confidence            34688999999999999999999999999999999999999999999999999999  8888887653


No 5  
>PTZ00382 Variant-specific surface protein (VSP); Provisional
Probab=95.43  E-value=0.0092  Score=45.12  Aligned_cols=33  Identities=27%  Similarity=0.460  Sum_probs=23.0

Q ss_pred             CCCceeeeeechhhhHHHHHHhh-hcceeeeeec
Q psy14481         27 EGGVNVAMISGISFGALALVTLF-GAVSFIMYRR   59 (179)
Q Consensus        27 ~~~~~~~~i~g~~~~~~~i~~~~-~~~~~~~yrr   59 (179)
                      .++++.|+|+||..|+++++..+ |.+-+++++|
T Consensus        60 ~~~ls~gaiagi~vg~~~~v~~lv~~l~w~f~~r   93 (96)
T PTZ00382         60 RSGLSTGAIAGISVAVVAVVGGLVGFLCWWFVCR   93 (96)
T ss_pred             CCCcccccEEEEEeehhhHHHHHHHHHhheeEEe
Confidence            36788999999999988777554 4444444434


No 6  
>PF03302 VSP:  Giardia variant-specific surface protein;  InterPro: IPR005127 During infection, the intestinal protozoan parasite Giardia lamblia virus undergoes continuous antigenic variation which is determined by diversification of the parasite's major surface antigen, named VSP (variant surface protein).
Probab=93.18  E-value=0.044  Score=50.21  Aligned_cols=34  Identities=26%  Similarity=0.503  Sum_probs=26.9

Q ss_pred             CCCCceeeeeechhhhHHHHHH-hhhcceeeeeec
Q psy14481         26 GEGGVNVAMISGISFGALALVT-LFGAVSFIMYRR   59 (179)
Q Consensus        26 ~~~~~~~~~i~g~~~~~~~i~~-~~~~~~~~~yrr   59 (179)
                      ..+||..|+|+||+.|++|+|. ++|.|-|+|.-|
T Consensus       360 n~s~LstgaIaGIsvavvvvVgglvGfLcWwf~cr  394 (397)
T PF03302_consen  360 NKSGLSTGAIAGISVAVVVVVGGLVGFLCWWFICR  394 (397)
T ss_pred             ccccccccceeeeeehhHHHHHHHHHHHhhheeec
Confidence            4578999999999999887666 777777776554


No 7  
>PF05808 Podoplanin:  Podoplanin;  InterPro: IPR008783 This family consists of several mammalian podoplanin-like proteins which are thought to control specifically the unique shape of podocytes [].; GO: 0016021 integral to membrane; PDB: 3IET_X.
Probab=91.75  E-value=0.05  Score=45.32  Aligned_cols=37  Identities=19%  Similarity=0.445  Sum_probs=0.0

Q ss_pred             CCCCCCCceeeeeechhhhHHHHHHhhhcceeeeeec
Q psy14481         23 PWSGEGGVNVAMISGISFGALALVTLFGAVSFIMYRR   59 (179)
Q Consensus        23 ~~~~~~~~~~~~i~g~~~~~~~i~~~~~~~~~~~yrr   59 (179)
                      ++...+|+..+.+.||..|.++-++++|++-|++.||
T Consensus       119 tt~ek~GL~T~tLVGIIVGVLlaIG~igGIIivvvRK  155 (162)
T PF05808_consen  119 TTVEKDGLSTVTLVGIIVGVLLAIGFIGGIIIVVVRK  155 (162)
T ss_dssp             -------------------------------------
T ss_pred             cccccCCcceeeeeeehhhHHHHHHHHhheeeEEeeh
Confidence            3456789999999999999999999999999999998


No 8  
>PF08693 SKG6:  Transmembrane alpha-helix domain;  InterPro: IPR014805 SKG6 and AXL2 are membrane proteins that show polarised intracellular localisation [, ]. This entry represents the highly conserved transmembrane alpha-helical domain found in these proteins [, ]. The full-length AXL2 protein has a negative regulatory function in cytokinesis [].
Probab=87.34  E-value=0.68  Score=30.54  Aligned_cols=30  Identities=30%  Similarity=0.561  Sum_probs=23.8

Q ss_pred             eeeeechhhhHHHHHHhhhcceeeeeeccc
Q psy14481         32 VAMISGISFGALALVTLFGAVSFIMYRRYR   61 (179)
Q Consensus        32 ~~~i~g~~~~~~~i~~~~~~~~~~~yrr~~   61 (179)
                      ++.--|......+|+..++.+.|++|||+.
T Consensus        11 vaIa~~VvVPV~vI~~vl~~~l~~~~rR~k   40 (40)
T PF08693_consen   11 VAIAVGVVVPVGVIIIVLGAFLFFWYRRKK   40 (40)
T ss_pred             EEEEEEEEechHHHHHHHHHHhheEEeccC
Confidence            344457788888999999999999999863


No 9  
>PF01034 Syndecan:  Syndecan domain;  InterPro: IPR001050 The syndecans are transmembrane proteoglycans which are involved in the organisation of cytoskeleton and/or actin microfilaments, and have important roles as cell surface receptors during cell-cell and/or cell-matrix interactions [, ]. Structurally, these proteins consist of four separate domains:   A signal sequence; An extracellular domain (ectodomain) of variable length whose sequence is not evolutionary conserved in the various forms of syndecans. The ectodomain contains the sites of attachment of the heparan sulphate glycosaminoglycan side chains;  A transmembrane region;  A highly conserved cytoplasmic domain of about 30 to 35 residues, which could interact with cytoskeletal proteins.    The proteins known to belong to this family are:    Syndecan 1.  Syndecan 2 or fibroglycan.  Syndecan 3 or neuroglycan or N-syndecan.  Syndecan 4 or amphiglycan or ryudocan.  Drosophila syndecan.   Caenorhabditis elegans probable syndecan (F57C7.3).    Syndecan-4, a transmembrane heparan sulphate proteoglycan, is a coreceptor with integrins in cell adhesion. It has been suggested to form a ternary signalling complex with protein kinase Calpha and phosphatidylinositol 4,5-bisphosphate (PIP2). Structural studies have demonstrated that the cytoplasmic domain undergoes a conformational transition and forms a symmetric dimer in the presence of phospholipid activator PIP2, and whose overall structure in solution exhibits a twisted clamp shape having a cavity in the centre of dimeric interface. In addition, it has been observed that the syndecan-4 variable domain interacts, strongly, not only with fatty acyl groups but also the anionic head group of PIP2. These findings indicate that PIP2 promotes oligomerisation of the syndecan-4 cytoplasmic domain for transmembrane signalling and cell-matrix adhesion [, ].; GO: 0008092 cytoskeletal protein binding, 0016020 membrane; PDB: 1EJQ_B 1EJP_B 1YBO_C 1OBY_Q.
Probab=84.10  E-value=0.33  Score=34.99  Aligned_cols=32  Identities=25%  Similarity=0.366  Sum_probs=0.4

Q ss_pred             CCCceeeeeechhhhHHHHHHhhhcceeeeeeccc
Q psy14481         27 EGGVNVAMISGISFGALALVTLFGAVSFIMYRRYR   61 (179)
Q Consensus        27 ~~~~~~~~i~g~~~~~~~i~~~~~~~~~~~yrr~~   61 (179)
                      ..++-+|+|+|+..|.   ++++..+.|++||-|.
T Consensus         7 ~~~vlaavIaG~Vvgl---l~ailLIlf~iyR~rk   38 (64)
T PF01034_consen    7 RSEVLAAVIAGGVVGL---LFAILLILFLIYRMRK   38 (64)
T ss_dssp             ---------------------------------S-
T ss_pred             cchHHHHHHHHHHHHH---HHHHHHHHHHHHHHHh
Confidence            3444488888876544   4444556788888543


No 10 
>PF12301 CD99L2:  CD99 antigen like protein 2;  InterPro: IPR022078  This family of proteins is found in eukaryotes. Proteins in this family are typically between 165 and 237 amino acids in length. CD99L2 and CD99 are involved in trans-endothelial migration of neutrophils in vitro and in the recruitment of neutrophils into inflamed peritoneum. 
Probab=82.52  E-value=0.51  Score=39.34  Aligned_cols=35  Identities=34%  Similarity=0.648  Sum_probs=23.4

Q ss_pred             CCCCceeeeeechhhhHHHHHHhhhcc-eeeeeecccC
Q psy14481         26 GEGGVNVAMISGISFGALALVTLFGAV-SFIMYRRYRW   62 (179)
Q Consensus        26 ~~~~~~~~~i~g~~~~~~~i~~~~~~~-~~~~yrr~~~   62 (179)
                      ..+....+.|+||.  +.|.++++|++ |||-|.||.|
T Consensus       108 ~~~~~~~g~IaGIv--sav~valvGAvsSyiaYqkKKl  143 (169)
T PF12301_consen  108 QDGEAEAGTIAGIV--SAVVVALVGAVSSYIAYQKKKL  143 (169)
T ss_pred             cccCcccchhhhHH--HHHHHHHHHHHHHHHHHHhhcc
Confidence            34556678889987  44455566654 5666988877


No 11 
>PF01102 Glycophorin_A:  Glycophorin A;  InterPro: IPR001195 Proteins in this group are responsible for the molecular basis of the blood group antigens, surface markers on the outside of the red blood cell membrane. Most of these markers are proteins, but some are carbohydrates attached to lipids or proteins [Reid M.E., Lomas-Francis C. The Blood Group Antigen FactsBook Academic Press, London / San Diego, (1997)]. Glycophorin A (PAS-2) and glycophorin B (PAS-3) belong to the MNS blood group system and are associated with antigens that include M/N, S/s, U, He, Mi(a), M(c), Vw, Mur, M(g), Vr, M(e), Mt(a), St(a), Ri(a), Cl(a), Ny(a), Hut, Hil, M(v), Far, Mit, Dantu, Hop, Nob, En(a), ENKT, amongst others. Glycophorin A is the major sialoglycoprotein of the erythrocyte membrane []. Structurally, glycophorin A consists of an N-terminal extracellular domain, heavily glycosylated on serine and threonine residues, followed by a transmembrane region and a C-terminal cytoplasmic domain. Other glycophorins in this entry such as Glycophorin B and Glycophorin E represent minor sialoglycoproteins in the erythrocyte membrane.; GO: 0016021 integral to membrane; PDB: 2KPF_B 1AFO_B 2KPE_A.
Probab=78.10  E-value=0.48  Score=37.67  Aligned_cols=16  Identities=38%  Similarity=0.576  Sum_probs=11.4

Q ss_pred             CceeeeeechhhhHHH
Q psy14481         29 GVNVAMISGISFGALA   44 (179)
Q Consensus        29 ~~~~~~i~g~~~~~~~   44 (179)
                      ++..++|+||+||++.
T Consensus        60 ~fs~~~i~~Ii~gv~a   75 (122)
T PF01102_consen   60 RFSEPAIIGIIFGVMA   75 (122)
T ss_dssp             SSS-TCHHHHHHHHHH
T ss_pred             CccccceeehhHHHHH
Confidence            4556789999998653


No 12 
>PF13908 Shisa:  Wnt and FGF inhibitory regulator
Probab=74.26  E-value=4.4  Score=32.74  Aligned_cols=25  Identities=24%  Similarity=0.299  Sum_probs=17.1

Q ss_pred             CCceeeeeechhhhHHHHHHhhhcc
Q psy14481         28 GGVNVAMISGISFGALALVTLFGAV   52 (179)
Q Consensus        28 ~~~~~~~i~g~~~~~~~i~~~~~~~   52 (179)
                      .+..+++|.||.+++++|+.+++.+
T Consensus        74 ~~~~~~iivgvi~~Vi~Iv~~Iv~~   98 (179)
T PF13908_consen   74 IYFITGIIVGVICGVIAIVVLIVCF   98 (179)
T ss_pred             ccceeeeeeehhhHHHHHHHhHhhh
Confidence            4455777888888877776666544


No 13 
>PF02009 Rifin_STEVOR:  Rifin/stevor family;  InterPro: IPR002858 Malaria is still a major cause of mortality in many areas of the world. Plasmodium falciparum causes the most severe human form of the disease and is responsible for most fatalities. Severe cases of malaria can occur when the parasite invades and then proliferates within red blood cell erythrocytes. The parasite produces many variant antigenic proteins, encoded by multigene families, which are present on the surface of the infected erythrocyte and play important roles in virulence. A crucial survival mechanism for the malaria parasite is its ability to evade the immune response by switching these variant surface antigens. The high virulence of P. falciparum relative to other malarial parasites is in large part due to the fact that in this organism many of these surface antigens mediate the binding of infected erythrocytes to the vascular endothelium (cytoadherence) and non-infected erythrocytes (rosetting). This can lead to the accumulation of infected cells in the vasculature of a variety of organs, blocking the blood flow and reducing the oxygen supply. Clinical symptoms of severe infection can include fever, progressive anaemia, multi-organ dysfunction and coma. For more information see []. Several multicopy gene families have been described in Plasmodium falciparum, including the stevor family of subtelomeric open reading frames and the rif interspersed repetitive elements. Both families contain three predicted transmembrane segments. It has been proposed that stevor and rif are members of a larger superfamily that code for variant surface antigens [].
Probab=72.72  E-value=0.71  Score=41.49  Aligned_cols=29  Identities=24%  Similarity=0.414  Sum_probs=24.5

Q ss_pred             eeeechhhhHHHHHHhhhcceeee-eeccc
Q psy14481         33 AMISGISFGALALVTLFGAVSFIM-YRRYR   61 (179)
Q Consensus        33 ~~i~g~~~~~~~i~~~~~~~~~~~-yrr~~   61 (179)
                      -+|.+|..+.++||..+.++-+|| |||+.
T Consensus       256 t~I~aSiiaIliIVLIMvIIYLILRYRRKK  285 (299)
T PF02009_consen  256 TAIIASIIAILIIVLIMVIIYLILRYRRKK  285 (299)
T ss_pred             HHHHHHHHHHHHHHHHHHHHHHHHHHHHHh
Confidence            457888999999999999999988 88843


No 14 
>PF14828 Amnionless:  Amnionless
Probab=72.72  E-value=4.9  Score=37.76  Aligned_cols=25  Identities=32%  Similarity=0.497  Sum_probs=20.1

Q ss_pred             eeeechhhhHHHHHHhhhcceeeee
Q psy14481         33 AMISGISFGALALVTLFGAVSFIMY   57 (179)
Q Consensus        33 ~~i~g~~~~~~~i~~~~~~~~~~~y   57 (179)
                      +.|.|+.+++++++++++++.|..|
T Consensus       338 ~~v~~~vl~~Lllv~ll~~~~ll~~  362 (437)
T PF14828_consen  338 GTVVGIVLGCLLLVALLFGVILLYR  362 (437)
T ss_pred             ceeeeehHHHHHHHHHHHHhheEEe
Confidence            3678999999999999988776554


No 15 
>PF10873 DUF2668:  Protein of unknown function (DUF2668);  InterPro: IPR022640  Members in this family of proteins are annotated as cysteine and tyrosine-rich protein 1, however currently no function is known []. 
Probab=72.43  E-value=3.1  Score=34.58  Aligned_cols=29  Identities=17%  Similarity=0.403  Sum_probs=23.4

Q ss_pred             CCceeeeeechhhhHHHHHHhhhcceeee
Q psy14481         28 GGVNVAMISGISFGALALVTLFGAVSFIM   56 (179)
Q Consensus        28 ~~~~~~~i~g~~~~~~~i~~~~~~~~~~~   56 (179)
                      ..+..++|+||.||+|.|+..++++.+-+
T Consensus        56 ~~lsgtAIaGIVfgiVfimgvva~i~icv   84 (155)
T PF10873_consen   56 DVLSGTAIAGIVFGIVFIMGVVAGIAICV   84 (155)
T ss_pred             cccccceeeeeehhhHHHHHHHHHHHHHH
Confidence            45567789999999999999888776554


No 16 
>PF15102 TMEM154:  TMEM154 protein family
Probab=69.15  E-value=9.5  Score=31.43  Aligned_cols=35  Identities=23%  Similarity=0.241  Sum_probs=26.5

Q ss_pred             CCceeeeeechhhhHHHHHHhhhcceeeeeecccC
Q psy14481         28 GGVNVAMISGISFGALALVTLFGAVSFIMYRRYRW   62 (179)
Q Consensus        28 ~~~~~~~i~g~~~~~~~i~~~~~~~~~~~yrr~~~   62 (179)
                      ...++.++-+|-+..++|+.+++.+.++.|||++-
T Consensus        53 ~q~efiLmIlIP~VLLvlLLl~vV~lv~~~kRkr~   87 (146)
T PF15102_consen   53 SQLEFILMILIPLVLLVLLLLSVVCLVIYYKRKRT   87 (146)
T ss_pred             CCcceEEEEeHHHHHHHHHHHHHHHheeEEeeccc
Confidence            45566778888888888888888777777877665


No 17 
>PF06365 CD34_antigen:  CD34/Podocalyxin family;  InterPro: IPR013836 This family consists of several mammalian CD34 antigen proteins. The CD34 antigen is a human leukocyte membrane protein expressed specifically by lymphohematopoietic progenitor cells. CD34 is a phosphoprotein. Activation of protein kinase C (PKC) has been found to enhance CD34 phosphorylation [, ]. This family contains several eukaryotic podocalyxin proteins. Podocalyxin is a major membrane protein of the glomerular epithelium and is thought to be involved in maintenance of the architecture of the foot processes and filtration slits characteristic of this unique epithelium by virtue of its high negative charge. Podocalyxin functions as an anti-adhesin that maintains an open filtration pathway between neighbouring foot processes in the glomerular epithelium by charge repulsion [].
Probab=64.66  E-value=3.4  Score=35.53  Aligned_cols=36  Identities=22%  Similarity=0.468  Sum_probs=29.2

Q ss_pred             eeeechhhhHHHHHHhhhcceeeeeecccCCC-Cccc
Q psy14481         33 AMISGISFGALALVTLFGAVSFIMYRRYRWNK-PQTL   68 (179)
Q Consensus        33 ~~i~g~~~~~~~i~~~~~~~~~~~yrr~~~n~-pq~Y   68 (179)
                      ..|+=.++|++.++.+++..+|++..||.|++ .|++
T Consensus       101 ~lI~lv~~g~~lLla~~~~~~Y~~~~Rrs~~~~~~rl  137 (202)
T PF06365_consen  101 TLIALVTSGSFLLLAILLGAGYCCHQRRSWSKKGQRL  137 (202)
T ss_pred             EEEehHHhhHHHHHHHHHHHHHHhhhhccCCcchhhh
Confidence            56777888888899999999999999999954 4443


No 18 
>PF12273 RCR:  Chitin synthesis regulation, resistance to Congo red;  InterPro: IPR020999  RCR proteins are ER membrane proteins that regulate chitin deposition in fungal cell walls. Although chitin, a linear polymer of beta-1,4-linked N-acetylglucosamine, constitutes only 2% of the cell wall it plays a vital role in the overall protection of the cell wall against stress, noxious chemicals and osmotic pressure changes. Congo red is a cell wall-disrupting benzidine-type dye extensively used in many cell wall mutant studies that specifically targets chitin in yeast cells and inhibits growth. RCR proteins render the yeasts resistant to Congo red by diminishing the content of chitin in the cell wall []. RCR proteins are probably regulating chitin synthase III interact directly with ubiquitin ligase Rsp5, and the VPEY motif is necessary for this, via interaction with the WW domains of Rsp5 []. 
Probab=64.35  E-value=2  Score=33.28  Aligned_cols=34  Identities=18%  Similarity=0.128  Sum_probs=15.6

Q ss_pred             hhHHHHHHhhhcceeeeeecccC---CCCcccccCCC
Q psy14481         40 FGALALVTLFGAVSFIMYRRYRW---NKPQTLSDKCS   73 (179)
Q Consensus        40 ~~~~~i~~~~~~~~~~~yrr~~~---n~pq~YS~~vR   73 (179)
                      |++++++.++..+++++.+||+.   .+|..+|+=+.
T Consensus         5 ~~iii~~i~l~~~~~~~~~rRR~r~G~~P~~gt~w~~   41 (130)
T PF12273_consen    5 FAIIIVAILLFLFLFYCHNRRRRRRGLQPIYGTRWMA   41 (130)
T ss_pred             HHHHHHHHHHHHHHHHHHHHHHhhcCCCCcCCceecC
Confidence            44444444444444444222222   35766665444


No 19 
>PHA03265 envelope glycoprotein D; Provisional
Probab=63.98  E-value=3.4  Score=38.70  Aligned_cols=39  Identities=26%  Similarity=0.430  Sum_probs=28.6

Q ss_pred             CCCCCCCCCceeeeeechhhhHHHHHHhhhcceeeeeecccC
Q psy14481         21 SNPWSGEGGVNVAMISGISFGALALVTLFGAVSFIMYRRYRW   62 (179)
Q Consensus        21 ~~~~~~~~~~~~~~i~g~~~~~~~i~~~~~~~~~~~yrr~~~   62 (179)
                      -.+.|++.+-++|+|-|.+.+++|+|   |+|.|++.|||.-
T Consensus       339 ~~~~s~~~~~~~g~~ig~~i~glv~v---g~il~~~~rr~k~  377 (402)
T PHA03265        339 PPPTSKSNSTFVGISVGLGIAGLVLV---GVILYVCLRRKKE  377 (402)
T ss_pred             CCCCCCCCCcccceEEccchhhhhhh---hHHHHHHhhhhhh
Confidence            34556666677888888887777764   6788999888654


No 20 
>PF01299 Lamp:  Lysosome-associated membrane glycoprotein (Lamp);  InterPro: IPR002000 Lysosome-associated membrane glycoproteins (lamp) [] are integral membrane proteins, specific to lysosomes, and whose exact biological function is not yet clear. Structurally, the lamp proteins consist of two internally homologous lysosome-luminal domains separated by a proline-rich hinge region; at the C-terminal extremity there is a transmembrane region (TM) followed by a very short cytoplasmic tail (C). In each of the duplicated domains, there are two conserved disulphide bonds. This structure is schematically represented in the figure below.   +-----+ +-----+ +-----+ +-----+ | | | | | | | | xCxxxxxCxxxxxxxxxxxxCxxxxxCxxxxxxxxxCxxxxxCxxxxxxxxxxxxCxxxxxCxxxxxxxx +--------------------------++Hinge++--------------------------++TM++C+  In mammals, there are two closely related types of lamp: lamp-1 and lamp-2, which form major components of the lysosome membrane. In chicken lamp-1 is known as LEP100.  Also included in this entry is the macrophage protein CD68 (or macrosialin) [] is a heavily glycosylated integral membrane protein whose structure consists of a mucin-like domain followed by a proline-rich hinge; a single lamp-like domain; a transmembrane region and a short cytoplasmic tail.   Similar to CD68, mammalian lamp-3, which is expressed in lymphoid organs, dendritic cells and in lung, contains all the C-terminal regions but lacks the N-terminal lamp-like region []. In a lamp-family protein from nematodes [] only the part C-terminal to the hinge is conserved. ; GO: 0016020 membrane
Probab=61.67  E-value=4.4  Score=35.56  Aligned_cols=28  Identities=25%  Similarity=0.547  Sum_probs=20.3

Q ss_pred             eeeeechhhhHHHHHHhhhcceeeeeecccC
Q psy14481         32 VAMISGISFGALALVTLFGAVSFIMYRRYRW   62 (179)
Q Consensus        32 ~~~i~g~~~~~~~i~~~~~~~~~~~yrr~~~   62 (179)
                      +-.|-|++|++++|+.+   ++|+++|||.-
T Consensus       273 vPIaVG~~La~lvlivL---iaYli~Rrr~~  300 (306)
T PF01299_consen  273 VPIAVGAALAGLVLIVL---IAYLIGRRRSR  300 (306)
T ss_pred             HHHHHHHHHHHHHHHHH---HhheeEecccc
Confidence            55567888887777654   57999888754


No 21 
>TIGR01495 ETRAMP Plasmodium ring stage membrane protein ETRAMP. These genes have been shown to be found in the sub-telomeric regions of both P. falciparum and P. yoelii chromosomes.
Probab=60.05  E-value=5.1  Score=30.00  Aligned_cols=32  Identities=16%  Similarity=0.169  Sum_probs=26.8

Q ss_pred             eeeeeechhhhHHHHHHhhhcceeeeeecccC
Q psy14481         31 NVAMISGISFGALALVTLFGAVSFIMYRRYRW   62 (179)
Q Consensus        31 ~~~~i~g~~~~~~~i~~~~~~~~~~~yrr~~~   62 (179)
                      ....||++..|...++++..|+++..||+..-
T Consensus        50 K~i~iS~ias~la~lv~t~~G~g~y~~~kk~~   81 (85)
T TIGR01495        50 KIILYSSIASGLALLVGAGVGLGYYYKKKNKD   81 (85)
T ss_pred             ceeehHHHHHHHHHHHHHHHHHhhhhhccccc
Confidence            36789999999999998888999988887543


No 22 
>PF15069 FAM163:  FAM163 family
Probab=56.20  E-value=7.7  Score=31.93  Aligned_cols=24  Identities=13%  Similarity=0.400  Sum_probs=19.5

Q ss_pred             eeeeeechhhhHHHHHHhhhccee
Q psy14481         31 NVAMISGISFGALALVTLFGAVSF   54 (179)
Q Consensus        31 ~~~~i~g~~~~~~~i~~~~~~~~~   54 (179)
                      ..++|||..||.|.|++++..+-|
T Consensus         4 GTvVItGgILAtVILLcIIaVLCY   27 (143)
T PF15069_consen    4 GTVVITGGILATVILLCIIAVLCY   27 (143)
T ss_pred             eeEEEechHHHHHHHHHHHHHHHH
Confidence            367899999999999988765544


No 23 
>PTZ00046 rifin; Provisional
Probab=54.25  E-value=3.4  Score=38.34  Aligned_cols=28  Identities=21%  Similarity=0.395  Sum_probs=23.1

Q ss_pred             eeechhhhHHHHHHhhhcceeee-eeccc
Q psy14481         34 MISGISFGALALVTLFGAVSFIM-YRRYR   61 (179)
Q Consensus        34 ~i~g~~~~~~~i~~~~~~~~~~~-yrr~~   61 (179)
                      +|..+..|.+|||..+.++=.|| |||+.
T Consensus       316 aIiaSiiAIvVIVLIMvIIYLILRYRRKK  344 (358)
T PTZ00046        316 AIIASIVAIVVIVLIMVIIYLILRYRRKK  344 (358)
T ss_pred             HHHHHHHHHHHHHHHHHHHHHHHHhhhcc
Confidence            46677889999999999888888 88853


No 24 
>TIGR01477 RIFIN variant surface antigen, rifin family. This model represents the rifin branch of the rifin/stevor family (pfam02009) of predicted variant surface antigens as found in Plasmodium falciparum. This model is based on a set of rifin sequences kindly provided by Matt Berriman from the Sanger Center. This is a global model and assesses a penalty for incomplete sequence. Additional fragmentary sequences may be found with the fragment model and a cutoff of 20 bits.
Probab=53.15  E-value=3.6  Score=38.11  Aligned_cols=28  Identities=25%  Similarity=0.401  Sum_probs=23.1

Q ss_pred             eeechhhhHHHHHHhhhcceeee-eeccc
Q psy14481         34 MISGISFGALALVTLFGAVSFIM-YRRYR   61 (179)
Q Consensus        34 ~i~g~~~~~~~i~~~~~~~~~~~-yrr~~   61 (179)
                      +|..+..|.+|||..+.++=.|| |||+.
T Consensus       311 ~IiaSiIAIvvIVLIMvIIYLILRYRRKK  339 (353)
T TIGR01477       311 PIIASIIAILIIVLIMVIIYLILRYRRKK  339 (353)
T ss_pred             HHHHHHHHHHHHHHHHHHHHHHHHhhhcc
Confidence            46677889999999999888888 88854


No 25 
>PF12768 Rax2:  Cortical protein marker for cell polarity
Probab=47.45  E-value=9.8  Score=33.73  Aligned_cols=30  Identities=20%  Similarity=0.253  Sum_probs=17.5

Q ss_pred             CCCceeeeeechhhhH-HHHHHhhhcceeee
Q psy14481         27 EGGVNVAMISGISFGA-LALVTLFGAVSFIM   56 (179)
Q Consensus        27 ~~~~~~~~i~g~~~~~-~~i~~~~~~~~~~~   56 (179)
                      ..-++.|.+-+|+||+ ++++.++|.+++++
T Consensus       221 ~~~l~~G~VVlIslAiALG~v~ll~l~Gii~  251 (281)
T PF12768_consen  221 GKKLSRGFVVLISLAIALGTVFLLVLIGIIL  251 (281)
T ss_pred             cccccceEEEEEehHHHHHHHHHHHHHHHHH
Confidence            4666667777766664 34455555555555


No 26 
>PF02158 Neuregulin:  Neuregulin family;  InterPro: IPR002154 Neuregulins are a sub-family of EGF-like molecules that have been shown to play multiple essential roles in vertebrate embryogenesis including: cardiac development, Schwann cell and oligodendrocyte differentiation, some aspects of neuronal development, as well as the formation of neuromuscular synapses [, ]. Included in the family are heregulin; neu differentiation factor; acetylcholine receptor synthesis stimulator; glial growth factor; and sensory and motor-neuron derived factor []. Multiple family members are generated by alternate splicing or by use of several cell type-specific transcription initiation sites. In general, they bind to and activate the erbB family of receptor tyrosine kinases (erbB2 (HER2), erbB3 (HER3), and erbB4 (HER4)), functioning both as heterodimers and homodimers.  The transmembrane forms of neuregulin 1 (NRG1) are present within synaptic vesicles, including those containing glutamate []. After exocytosis, NRG1 is in the presynaptic membrane, where the ectodomain of NRG1 may be cleaved off. The ectodomain then migrates across the synaptic cleft and binds to and activates a member of the EGF-receptor family on the postsynaptic membrane. This has been shown to increase the expression of certain glutamate-receptor subunits. NRG1 appears to signal for glutamate-receptor subunit expression, localisation, and /or phosphorylation facilitating subsequent glutamate transmission.   The NRG1 gene has been identified as a potential gene determining susceptibility to schizophrenia by a combination of genetic linkage and association approaches []. ; GO: 0005102 receptor binding, 0009790 embryo development; PDB: 1HRE_A 1HAE_A 1HAF_A 1HRF_A.
Probab=45.48  E-value=7  Score=36.89  Aligned_cols=24  Identities=29%  Similarity=0.441  Sum_probs=0.0

Q ss_pred             eeeeechhhhHHHHHHhhhcc-eeee
Q psy14481         32 VAMISGISFGALALVTLFGAV-SFIM   56 (179)
Q Consensus        32 ~~~i~g~~~~~~~i~~~~~~~-~~~~   56 (179)
                      |-.|||||+|.+| |+++|.+ .|.=
T Consensus         9 VLTITgIcvaLlV-VGi~Cvv~aYCK   33 (404)
T PF02158_consen    9 VLTITGICVALLV-VGIVCVVDAYCK   33 (404)
T ss_dssp             --------------------------
T ss_pred             hhhhhhhhHHHHH-HHHHHHHHHHHH
Confidence            4568999998764 6777777 6655


No 27 
>PF12191 stn_TNFRSF12A:  Tumour necrosis factor receptor stn_TNFRSF12A_TNFR domain;  InterPro: IPR022316 The tumour necrosis factor (TNF) receptor (TNFR) superfamily comprises more than 20 type-I transmembrane proteins. Family members are defined based on similarity in their extracellular domain - a region that contains many cysteine residues arranged in a specific repetitive pattern []. The cysteines allow formation of an extended rod-like structure, responsible for ligand binding []. Upon receptor activation, different intracellular signalling complexes are assembled for different members of the TNFR superfamily, depending on their intracellular domains and sequences []. Activation of TNFRs can therefore induce a range of disparate effects, including cell proliferation, differentiation, survival, or apoptotic cell death, depending upon the receptor involved []. TNFRs are widely distributed and play important roles in many crucial biological processes, such as lymphoid and neuronal development, innate and adaptive immunity, and maintenance of cellular homeostasis []. Drugs that manipulate their signalling have potential roles in the prevention and treatment of many diseases, such as viral infections, coronary heart disease, transplant rejection, and immune disease []. TNF receptor 12 (also known as TWEAK receptor, and fibroblast growth factor-inducible-14 (Fn14)) has been implicated in endothelial cell growth and migration []. The receptor may also play a role in cell-matrix interactions [].; PDB: 2KN0_A 2RPJ_A 2KMZ_A 2EQP_A.
Probab=45.24  E-value=6.7  Score=31.81  Aligned_cols=34  Identities=24%  Similarity=0.426  Sum_probs=1.6

Q ss_pred             eeeeechhhhHHHHHHhhhcc-eeeeeecccCCCC
Q psy14481         32 VAMISGISFGALALVTLFGAV-SFIMYRRYRWNKP   65 (179)
Q Consensus        32 ~~~i~g~~~~~~~i~~~~~~~-~~~~yrr~~~n~p   65 (179)
                      +..+..|.+++++++...+.+ +|+.|||.|-.++
T Consensus        75 ~~l~~pi~~sal~v~lVl~llsg~lv~rrcrrr~~  109 (129)
T PF12191_consen   75 FPLLWPILGSALSVVLVLALLSGFLVWRRCRRREK  109 (129)
T ss_dssp             SSSS-------------------------------
T ss_pred             cceehhhhhhHHHHHHHHHHHHHHHHHhhhhcccc
Confidence            334455555544444444444 5555666554333


No 28 
>PF05283 MGC-24:  Multi-glycosylated core protein 24 (MGC-24);  InterPro: IPR007947 CD164 is a mucin-like receptor, or sialomucin, with specificity in receptor/ ligand interactions that depends on the structural characteristics of the mucin-like receptor. Its functions include mediating, or regulating, haematopoietic progenitor cell adhesion and the negative regulation of their growth and/or-differentiation. It exists in the native state as a disulphide- linked homodimer of two 80-85kDa subunits. It is usually expressed by CD34+ and CD341o/- haematopoietic stem cells and associated microenvironmental cells. It contains, in its extracellular region, two mucin domains (I and II) linked by a non-mucin domain, which has been predicted to contain intra- disulphide bridges. This receptor may play a key role in haematopoiesis by facilitating the adhesion of human CD34+ cells to bone marrow stroma and by negatively regulating CD34+ CD341o/- haematopoietic progenitor cell proliferation. These effects involve the CD164 class I and/or II epitopes recognised by the monoclonal antibodies (mAbs) 105A5 and 103B2/9E10. These epitopes are carbohydrate-dependent and are located on the N-terminal mucin domain I [, ]. It has been found that murine MGC-24v and rat endolyn share significant sequence similarities with human CD164. However, CD164 lacks the consensus glycosaminoglycan (GAG)-attachment site found in MGC-24; it is possible that GAG-association is responsible for the high molecular weight of the epithelial-derived MGC-24 glycoprotein [].  Genomic structure studies have placed CD164 within the mucin-subgroup that comprises multiple exons, and demonstrate the diverse chromosomal distribution of this family of molecules. Molecules with such multiple exons may have sophisticated regulatory mechanisms that involve not only post-translational modifications of the oligosaccharide side chains, but also differential exon usage. Although differences in the intron and exon sizes are seen between the mouse and human genes, the predicted proteins are similar in size and structure, maintaining functionally important motifs that regulate cell proliferation or subcellular distribution [].  CD164 is a gene whose expression depends on differential usage of poly- adenylation sites within the 3'-UTR. The conserved distribution of the 3.2- and 1.2-kb CD164 transcripts between mouse and human suggests that (i) a mechanism may exist to regulate tissue-specific polyadenylation, and (ii) differences in polyadenylation are important for the expression and function of CD164 in different tissues. Two other aspects of the structure of CD164 are of particular interest. First, it shares one of several conserved features of a cytokine-binding pocket - in this respect, it is notable that evidence exists for a class of cell-surface sialomucin modulators that directly interact with growth factor receptors to regulate their response to physiological ligands. Second, its cytoplasmic tail contains a C-terminal YHTL motif found in many endocytic membrane proteins or receptors. These Tyr-based motifs bind to adaptor proteins, which mediate the sorting of membrane proteins into transport vesicles from the plasma membrane to the endosomes, and between intracellular compartments. 
Probab=44.77  E-value=7.3  Score=33.05  Aligned_cols=27  Identities=19%  Similarity=0.262  Sum_probs=21.9

Q ss_pred             eeechhhhHHHHHHhhhcceeeeeecc
Q psy14481         34 MISGISFGALALVTLFGAVSFIMYRRY   60 (179)
Q Consensus        34 ~i~g~~~~~~~i~~~~~~~~~~~yrr~   60 (179)
                      .=+|+.+|+.||+..+=+|.||+||-+
T Consensus       158 FD~~SFiGGIVL~LGv~aI~ff~~KF~  184 (186)
T PF05283_consen  158 FDAASFIGGIVLTLGVLAIIFFLYKFC  184 (186)
T ss_pred             CchhhhhhHHHHHHHHHHHHHHHhhhc
Confidence            346788899999988889999999743


No 29 
>PHA03048 IMV membrane protein; Provisional
Probab=42.87  E-value=6.2  Score=30.39  Aligned_cols=29  Identities=34%  Similarity=0.641  Sum_probs=22.2

Q ss_pred             eeeechhhhHHHHHHhhhcceeeeeecccC
Q psy14481         33 AMISGISFGALALVTLFGAVSFIMYRRYRW   62 (179)
Q Consensus        33 ~~i~g~~~~~~~i~~~~~~~~~~~yrr~~~   62 (179)
                      =+++++|| .++++.++|.+-|-+|+|+--
T Consensus        44 Ralsii~F-Ilgivl~lG~~ifsmy~r~C~   72 (93)
T PHA03048         44 RALSGIAF-VLGIVMTIGMLIYSMWGRYCT   72 (93)
T ss_pred             HHHHHHHH-HHHHHHHHHHHHHHHHhcccC
Confidence            35678887 577888889888989987543


No 30 
>PF14575 EphA2_TM:  Ephrin type-A receptor 2 transmembrane domain; PDB: 3KUL_A 2XVD_A 2VX1_A 2VWV_A 2VX0_A 2VWY_A 2VWZ_A 2VWW_A 2VWU_A 2VWX_A ....
Probab=41.47  E-value=9.3  Score=27.58  Aligned_cols=31  Identities=26%  Similarity=0.562  Sum_probs=10.8

Q ss_pred             echhhhHHHHHHhhhcceeeeeecccCCCCcc
Q psy14481         36 SGISFGALALVTLFGAVSFIMYRRYRWNKPQT   67 (179)
Q Consensus        36 ~g~~~~~~~i~~~~~~~~~~~yrr~~~n~pq~   67 (179)
                      .+++.|.++|+.++. +.++..||+.+.|...
T Consensus         4 ~~~~~g~~~ll~~v~-~~~~~~rr~~~~~~~~   34 (75)
T PF14575_consen    4 ASIIVGVLLLLVLVI-IVIVCFRRCKYSKAKS   34 (75)
T ss_dssp             HHHHHHHHHHHHHHH-HHHCCCTT--------
T ss_pred             ehHHHHHHHHHHhhe-eEEEEEeeEcCCCCcc
Confidence            344445444444433 3444555555545433


No 31 
>PHA02898 virion envelope protein; Provisional
Probab=40.66  E-value=6.7  Score=30.15  Aligned_cols=32  Identities=34%  Similarity=0.466  Sum_probs=23.0

Q ss_pred             eeeechhhhHHHHHHhhhcceeeeeecccC-CCC
Q psy14481         33 AMISGISFGALALVTLFGAVSFIMYRRYRW-NKP   65 (179)
Q Consensus        33 ~~i~g~~~~~~~i~~~~~~~~~~~yrr~~~-n~p   65 (179)
                      =+++++|| .++++.++|.+-|-+|+|+-- |+|
T Consensus        45 RalSii~F-Ilgivl~lG~~ifs~y~r~C~~~~~   77 (92)
T PHA02898         45 RSISIISF-ILAIILILGIIFFKGYNMFCGGNTT   77 (92)
T ss_pred             HHHHHHHH-HHHHHHHHHHHHHHHHhhhcCCCcc
Confidence            35678887 567788888888888887543 444


No 32 
>PF07213 DAP10:  DAP10 membrane protein;  InterPro: IPR009861 This family consists of several mammalian DAP10 membrane proteins. In activated mouse natural killer (NK) cells, the NKG2D receptor associates with two intracellular adaptors, DAP10 and DAP12, which trigger phosphatidyl inositol 3 kinase (PI3K) and Syk family protein tyrosine kinases, respectively. It has been suggested that the DAP10-PI3K pathway is sufficient to initiate NKG2D-mediated killing of target cells [].
Probab=39.80  E-value=18  Score=27.07  Aligned_cols=43  Identities=21%  Similarity=0.220  Sum_probs=34.5

Q ss_pred             hccCCCCCCCCCCceeeeeechhhhHHHHHHhhhcceeeeeec
Q psy14481         17 AEDLSNPWSGEGGVNVAMISGISFGALALVTLFGAVSFIMYRR   59 (179)
Q Consensus        17 ~~~~~~~~~~~~~~~~~~i~g~~~~~~~i~~~~~~~~~~~yrr   59 (179)
                      +.+.+.+-++=+.++.+..+||.+|=.++-.++...-|..-|.
T Consensus        18 aq~~~gscs~C~~ls~g~LaGiV~~D~vlTLLIv~~vy~car~   60 (79)
T PF07213_consen   18 AQTQPGSCSGCYPLSPGLLAGIVAADAVLTLLIVLVVYYCARP   60 (79)
T ss_pred             hcCCCCCCCCccccCHHHHHHHHHHHHHHHHHHHHHHHhhccc
Confidence            4566678888899999999999999888877777777777443


No 33 
>KOG0123|consensus
Probab=36.23  E-value=19  Score=33.04  Aligned_cols=85  Identities=15%  Similarity=0.004  Sum_probs=73.1

Q ss_pred             ccccccCCCCCCChhHhhcCChHHHHHHhhhhhhhhHhhhCCCCCCCceeeccCCChHHHHHhhcChHHHHHHHHHHHHH
Q psy14481         86 LRAVHVQGQEPLTSTMLAAAQPQEQKQMLGERLFPLIQRMYPELAGKITGMLLEIDNAELLYMLEHNESLKSKVEEAVAV  165 (179)
Q Consensus        86 ~~~~~~~g~epl~~s~La~a~peeQKq~LGErLyplV~~l~p~~AgKITGMLLEmd~sELL~mLes~e~L~~KV~EA~~v  165 (179)
                      .+.+++++    ...++......++.+.+||....+|...+...+.....+..+.+.+....|.+-.+.+..+....+++
T Consensus       270 ~~nl~vkn----ld~~~~~e~L~~~f~~~GeI~s~kv~~~~~g~skG~gfV~fs~~eeA~~A~~~~n~~~i~~k~l~vav  345 (369)
T KOG0123|consen  270 GANLYVKN----LDETLSDEKLRKIFSSFGEITSAKVMVDENGKSKGFGFVEFSSPEEAKKAMTEMNGRLIGGKPLYVAV  345 (369)
T ss_pred             cccccccc----CccccchhHHHHHHhcccceeeEEEEeccCCCccceEEEEcCCHHHHHHHHHhhChhhhcCCchhhhH
Confidence            36677777    44566777888999999999999999999999999999999999999999999999899999999999


Q ss_pred             HHHHHHhHH
Q psy14481        166 LQAHQAKQA  174 (179)
Q Consensus       166 L~~~q~~~~  174 (179)
                      .+.++....
T Consensus       346 ~qr~~~r~~  354 (369)
T KOG0123|consen  346 AQRKEDRRA  354 (369)
T ss_pred             Hhhhccchh
Confidence            987766544


No 34 
>PHA02902 putative IMV membrane protein; Provisional
Probab=34.08  E-value=18  Score=26.50  Aligned_cols=19  Identities=26%  Similarity=0.712  Sum_probs=13.7

Q ss_pred             HHHHhhhcceeeeeecccC
Q psy14481         44 ALVTLFGAVSFIMYRRYRW   62 (179)
Q Consensus        44 ~i~~~~~~~~~~~yrr~~~   62 (179)
                      ++|.++|-+-|.+|||+.-
T Consensus        11 v~v~Ivclliya~YrR~kc   29 (70)
T PHA02902         11 VIVIIFCLLIYAAYKRYKC   29 (70)
T ss_pred             HHHHHHHHHHHHHHHHhcC
Confidence            4556667778889999744


No 35 
>PF12768 Rax2:  Cortical protein marker for cell polarity
Probab=31.92  E-value=20  Score=31.75  Aligned_cols=35  Identities=31%  Similarity=0.471  Sum_probs=27.8

Q ss_pred             eeeee-chhhhHHHHHHhhhcceeeeeec--ccCCCCc
Q psy14481         32 VAMIS-GISFGALALVTLFGAVSFIMYRR--YRWNKPQ   66 (179)
Q Consensus        32 ~~~i~-g~~~~~~~i~~~~~~~~~~~yrr--~~~n~pq   66 (179)
                      |.+|+ +|+||++.|+.++|.+-..+.||  .|...|+
T Consensus       229 VVlIslAiALG~v~ll~l~Gii~~~~~r~~~~~~~~p~  266 (281)
T PF12768_consen  229 VVLISLAIALGTVFLLVLIGIILAYIRRRRQGYVPAPT  266 (281)
T ss_pred             EEEEehHHHHHHHHHHHHHHHHHHHHHhhhccCcCCCc
Confidence            66776 89999999999999998888777  3444455


No 36 
>PF02480 Herpes_gE:  Alphaherpesvirus glycoprotein E;  InterPro: IPR003404 Glycoprotein E (gE) of Alphaherpesvirus forms a complex with glycoprotein I (gI), functioning as an immunoglobulin G (IgG) Fc binding protein. gE is involved in virus spread but is not essential for propagation [].; GO: 0016020 membrane; PDB: 2GJ7_F 2GIY_B.
Probab=31.83  E-value=15  Score=34.52  Aligned_cols=26  Identities=19%  Similarity=0.544  Sum_probs=0.0

Q ss_pred             eeechhhhHHHHHHhhhcceeeeeec
Q psy14481         34 MISGISFGALALVTLFGAVSFIMYRR   59 (179)
Q Consensus        34 ~i~g~~~~~~~i~~~~~~~~~~~yrr   59 (179)
                      .+.|+.+|+++++.+++++.++..||
T Consensus       353 ~~l~vVlgvavlivVv~viv~vc~~~  378 (439)
T PF02480_consen  353 ALLGVVLGVAVLIVVVGVIVWVCLRC  378 (439)
T ss_dssp             --------------------------
T ss_pred             chHHHHHHHHHHHHHHHHHhheeeee
Confidence            44555555566666655555555433


No 37 
>PF04391 DUF533:  Protein of unknown function (DUF533);  InterPro: IPR007486 Some family members may be secreted or integral membrane proteins.
Probab=31.10  E-value=17  Score=30.60  Aligned_cols=101  Identities=17%  Similarity=0.180  Sum_probs=49.7

Q ss_pred             hHHHHHHhhhcceeeeeecccCCCCcccccCCCCCCCCCCCCCccccccccCCCCCCChhHhhcC------ChHHHHHHh
Q psy14481         41 GALALVTLFGAVSFIMYRRYRWNKPQTLSDKCSNPDSSGYIDDSTLRAVHVQGQEPLTSTMLAAA------QPQEQKQML  114 (179)
Q Consensus        41 ~~~~i~~~~~~~~~~~yrr~~~n~pq~YS~~vRNP~s~~~iqDSt~~~~~~~g~epl~~s~La~a------~peeQKq~L  114 (179)
                      ...+.++++|+++|-.|+++--|++.       ++...+..-+.+.++..-.-...+.-.|++.+      .++|+..  
T Consensus        33 ~~~Gg~AalG~lA~~ayq~~q~~~~~-------~~~~~~~~~~~~~~~~~~~~a~lllrAMIaAAkADG~ID~~Er~~--  103 (188)
T PF04391_consen   33 LKYGGLAALGGLAYKAYQNWQQNQGA-------APAAEPQPFAQPPPAEAEAHARLLLRAMIAAAKADGHIDEEERQR--  103 (188)
T ss_pred             HHHHHHHHHHHHHHHHHHHHhccCCC-------CCCCCCCCCCCCCcccccHHHHHHHHHHHHHHHcCCCCCHHHHHH--
Confidence            34456677888888888776544443       11100000011111111112223444555554      4556665  


Q ss_pred             hhhhhhhHhhh--CCCCCCCceeeccC-CChHHHHHhhcChH
Q psy14481        115 GERLFPLIQRM--YPELAGKITGMLLE-IDNAELLYMLEHNE  153 (179)
Q Consensus       115 GErLyplV~~l--~p~~AgKITGMLLE-md~sELL~mLes~e  153 (179)
                         |+..+.+.  +++...-|.-.|-. +|..+|..-+.||+
T Consensus       104 ---I~~~l~~~g~d~e~~~~l~~eL~~P~d~~~la~~v~~~e  142 (188)
T PF04391_consen  104 ---IEGALQELGLDAEERAWLQAELAAPLDPDALAAAVTDPE  142 (188)
T ss_pred             ---HHHHHHHhCCCHHHHHHHHHHHhCCCCHHHHHHhCCCHH
Confidence               45556553  44444444444444 67777777776665


No 38 
>TIGR01167 LPXTG_anchor LPXTG-motif cell wall anchor domain. A common feature of this proteins containing this domain appears to be a high proportion of charged and zwitterionic residues immediatedly upstream of the LPXTG motif. This model differs from other descriptions of the LPXTG region by including a portion of that upstream charged region.
Probab=30.17  E-value=26  Score=20.74  Aligned_cols=10  Identities=10%  Similarity=0.235  Sum_probs=4.1

Q ss_pred             cceeeeeecc
Q psy14481         51 AVSFIMYRRY   60 (179)
Q Consensus        51 ~~~~~~yrr~   60 (179)
                      +.++++.|||
T Consensus        23 ~~~~~~~~rk   32 (34)
T TIGR01167        23 LGGLLLRKRK   32 (34)
T ss_pred             HHHHHheecc
Confidence            3344444443


No 39 
>PHA02681 ORF089 virion membrane protein; Provisional
Probab=29.17  E-value=21  Score=27.30  Aligned_cols=20  Identities=20%  Similarity=0.443  Sum_probs=16.3

Q ss_pred             HHHHHhhhcceeeeeecccC
Q psy14481         43 LALVTLFGAVSFIMYRRYRW   62 (179)
Q Consensus        43 ~~i~~~~~~~~~~~yrr~~~   62 (179)
                      ++++.++|-+-|.+|||...
T Consensus        10 ~V~V~IVclliya~YRR~~i   29 (92)
T PHA02681         10 VIVISIVCYIVIMMYRRSCV   29 (92)
T ss_pred             HHHHHHHHHHHHHHHHhccC
Confidence            67777888888999999854


No 40 
>PF05454 DAG1:  Dystroglycan (Dystrophin-associated glycoprotein 1);  InterPro: IPR008465 Dystroglycan is one of the dystrophin-associated glycoproteins, which is encoded by a 5.5 kb transcript in Homo sapiens. The protein product is cleaved into two non-covalently associated subunits, [alpha] (N-terminal) and [beta] (C-terminal). In skeletal muscle the dystroglycan complex works as a transmembrane linkage between the extracellular matrix and the cytoskeleton [alpha]-dystroglycan is extracellular and binds to merosin ([alpha]-2 laminin) in the basement membrane, while [beta]-dystroglycan is a transmembrane protein and binds to dystrophin, which is a large rod-like cytoskeletal protein, absent in Duchenne muscular dystrophy patients. Dystrophin binds to intracellular actin cables. In this way, the dystroglycan complex, which links the extracellular matrix to the intracellular actin cables, is thought to provide structural integrity in muscle tissues. The dystroglycan complex is also known to serve as an agrin receptor in muscle, where it may regulate agrin-induced acetylcholine receptor clustering at the neuromuscular junction. There is also evidence which suggests the function of dystroglycan as a part of the signal transduction pathway because it is shown that Grb2, a mediator of the Ras-related signal pathway, can interact with the cytoplasmic domain of dystroglycan. In general, aberrant expression of dystrophin-associated protein complex underlies the pathogenesis of Duchenne muscular dystrophy, Becker muscular dystrophy and severe childhood autosomal recessive muscular dystrophy. Interestingly, no genetic disease has been described for either [alpha]- or [beta]-dystroglycan. Dystroglycan is widely distributed in non-muscle tissues as well as in muscle tissues. During epithelial morphogenesis of kidney, the dystroglycan complex is shown to act as a receptor for the basement membrane. Dystroglycan expression in Mus musculus brain and neural retina has also been reported. However, the physiological role of dystroglycan in non-muscle tissues has remained unclear [].; PDB: 1EG4_P.
Probab=28.95  E-value=19  Score=32.53  Aligned_cols=21  Identities=38%  Similarity=0.610  Sum_probs=0.0

Q ss_pred             HHHHhhhcceeeeeecccCCC
Q psy14481         44 ALVTLFGAVSFIMYRRYRWNK   64 (179)
Q Consensus        44 ~i~~~~~~~~~~~yrr~~~n~   64 (179)
                      +|+.+.|++..+.||||+-.|
T Consensus       157 ~iLLIA~iIa~icyrrkR~GK  177 (290)
T PF05454_consen  157 AILLIAGIIACICYRRKRKGK  177 (290)
T ss_dssp             ---------------------
T ss_pred             HHHHHHHHHHHHhhhhhhccc
Confidence            344457788889999987644


No 41 
>PF14851 FAM176:  FAM176 family
Probab=28.45  E-value=43  Score=27.67  Aligned_cols=18  Identities=28%  Similarity=0.455  Sum_probs=14.9

Q ss_pred             eeechhhhHHHHHHhhhc
Q psy14481         34 MISGISFGALALVTLFGA   51 (179)
Q Consensus        34 ~i~g~~~~~~~i~~~~~~   51 (179)
                      .++|+|+|-+..+|++++
T Consensus        26 Fv~gVC~GLlLtLcllV~   43 (153)
T PF14851_consen   26 FVSGVCAGLLLTLCLLVI   43 (153)
T ss_pred             HHHHHHHHHHHHHHHHHh
Confidence            478999999988888844


No 42 
>PHA02680 ORF090 IMV phosphorylated membrane protein; Provisional
Probab=27.82  E-value=20  Score=27.53  Aligned_cols=26  Identities=35%  Similarity=0.571  Sum_probs=20.2

Q ss_pred             eeeechhhhHHHHHHhhhcceeeeeec
Q psy14481         33 AMISGISFGALALVTLFGAVSFIMYRR   59 (179)
Q Consensus        33 ~~i~g~~~~~~~i~~~~~~~~~~~yrr   59 (179)
                      =+++++|| .++++.+.|.+.|-+||+
T Consensus        46 RalSii~F-IlG~vl~lGilifs~y~~   71 (91)
T PHA02680         46 RALSVTCF-IVGAVLLLGLFVFSMYRK   71 (91)
T ss_pred             HHHHHHHH-HHHHHHHHHHHHHHHhcc
Confidence            45688887 567788888888888984


No 43 
>PF04478 Mid2:  Mid2 like cell wall stress sensor;  InterPro: IPR007567 This family represents a region near the C terminus of Mid2, which contains a transmembrane region. The remainder of the protein sequence is serine-rich and of low complexity, and is therefore impossible to align accurately. Mid2 is thought to act as a mechanosensor of cell wall stress. The C-terminal cytoplasmic region of Mid2 is known to interact with Rom2, a guanine nucleotide exchange factor (GEF) for Rho1, which is part of the cell wall integrity signalling pathway [].
Probab=27.30  E-value=70  Score=26.69  Aligned_cols=10  Identities=10%  Similarity=0.610  Sum_probs=4.5

Q ss_pred             ceeeeeechh
Q psy14481         30 VNVAMISGIS   39 (179)
Q Consensus        30 ~~~~~i~g~~   39 (179)
                      +-+|.+-|++
T Consensus        50 IVIGvVVGVG   59 (154)
T PF04478_consen   50 IVIGVVVGVG   59 (154)
T ss_pred             EEEEEEeccc
Confidence            3345444443


No 44 
>PF05337 CSF-1:  Macrophage colony stimulating factor-1 (CSF-1);  InterPro: IPR008001 Colony stimulating factor 1 (CSF-1) is a homodimeric polypeptide growth factor whose primary function is to regulate the survival, proliferation, differentiation, and function of cells of the mononuclear phagocytic lineage. This lineage includes mononuclear phagocytic precursors, blood monocytes, tissue macrophages, osteoclasts, and microglia of the brain, all of which possess cell surface receptors for CSF-1. The protein has also been linked with male fertility [] and mutations in the Csf-1 gene have been found to cause osteopetrosis and failure of tooth eruption [].; GO: 0005125 cytokine activity, 0008083 growth factor activity, 0016021 integral to membrane; PDB: 3EJJ_A.
Probab=26.98  E-value=21  Score=32.38  Aligned_cols=32  Identities=25%  Similarity=0.461  Sum_probs=0.0

Q ss_pred             HHHhhhcceeeeeecccCCCCcccccCCCCCC
Q psy14481         45 LVTLFGAVSFIMYRRYRWNKPQTLSDKCSNPD   76 (179)
Q Consensus        45 i~~~~~~~~~~~yrr~~~n~pq~YS~~vRNP~   76 (179)
                      ++..+|++.|.-||||..-.||+-...+.-|.
T Consensus       238 VLLaVGGLLfYr~rrRs~~e~q~~d~~~~~Pe  269 (285)
T PF05337_consen  238 VLLAVGGLLFYRRRRRSHREPQTVDSPMEQPE  269 (285)
T ss_dssp             --------------------------------
T ss_pred             hhhhccceeeecccccccccccccCCcccCCC
Confidence            34566777776666666677887776666553


No 45 
>PF03908 Sec20:  Sec20;  InterPro: IPR005606 Sec20 is a membrane glycoprotein associated with secretory pathway.
Probab=25.57  E-value=27  Score=25.51  Aligned_cols=17  Identities=29%  Similarity=0.635  Sum_probs=12.6

Q ss_pred             HHHHHhhhcceeeeeec
Q psy14481         43 LALVTLFGAVSFIMYRR   59 (179)
Q Consensus        43 ~~i~~~~~~~~~~~yrr   59 (179)
                      ++++..++.+.||++||
T Consensus        75 ~~~~~f~~~v~yI~~rR   91 (92)
T PF03908_consen   75 FAFLFFLLVVLYILWRR   91 (92)
T ss_pred             HHHHHHHHHHHHHhhhc
Confidence            45566667788999987


No 46 
>PF09835 DUF2062:  Uncharacterized protein conserved in bacteria (DUF2062);  InterPro: IPR018639  This domain, found in various prokaryotic proteins, has no known function. It is found at the C-terminal of family 2 glycosyltransferase proteins, in addition to proteins of unknown function.
Probab=25.52  E-value=16  Score=28.59  Aligned_cols=27  Identities=30%  Similarity=0.688  Sum_probs=16.5

Q ss_pred             hhhhHHHHHHhhhcceeee-------eecccCCC
Q psy14481         38 ISFGALALVTLFGAVSFIM-------YRRYRWNK   64 (179)
Q Consensus        38 ~~~~~~~i~~~~~~~~~~~-------yrr~~~n~   64 (179)
                      ..+|++++...++.++|++       ||||+++|
T Consensus       120 ~~~G~~i~~~v~~~i~Y~l~~~~~~~~r~~r~~~  153 (154)
T PF09835_consen  120 FLLGSLILGIVLGIISYFLVYFLVRKYRKRRRKR  153 (154)
T ss_pred             HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhC
Confidence            3456666666666666665       77776643


No 47 
>PF05795 Plasmodium_Vir:  Plasmodium vivax Vir protein;  InterPro: IPR008780 This family consists of several Vir proteins specific to the genus Plasmodium and Plasmodium vivax in particular. The vir genes are present at about 600-1,000 copies per haploid genome and encode proteins that are immunovariant in natural infections, indicating that they may have a functional role in establishing chronic infection through antigenic variation [].
Probab=25.22  E-value=33  Score=29.16  Aligned_cols=23  Identities=17%  Similarity=0.260  Sum_probs=13.2

Q ss_pred             eechhhhHHHHHHhhhcceeeeee
Q psy14481         35 ISGISFGALALVTLFGAVSFIMYR   58 (179)
Q Consensus        35 i~g~~~~~~~i~~~~~~~~~~~yr   58 (179)
                      ++++. ++++.+..+.+++|+||+
T Consensus       281 ~~~~v-~~~~~~~G~~~~~f~LYK  303 (354)
T PF05795_consen  281 FSTSV-SPVLSVLGIPLIFFLLYK  303 (354)
T ss_pred             cccch-hhhhhhHHHHHHHHHHhc
Confidence            34444 334444455567788888


No 48 
>PF02480 Herpes_gE:  Alphaherpesvirus glycoprotein E;  InterPro: IPR003404 Glycoprotein E (gE) of Alphaherpesvirus forms a complex with glycoprotein I (gI), functioning as an immunoglobulin G (IgG) Fc binding protein. gE is involved in virus spread but is not essential for propagation [].; GO: 0016020 membrane; PDB: 2GJ7_F 2GIY_B.
Probab=24.55  E-value=25  Score=33.16  Aligned_cols=45  Identities=11%  Similarity=0.158  Sum_probs=0.0

Q ss_pred             cCCCCCCCCCCceeeeeechhhhHHHHHHhhhcceeeeeecccCC
Q psy14481         19 DLSNPWSGEGGVNVAMISGISFGALALVTLFGAVSFIMYRRYRWN   63 (179)
Q Consensus        19 ~~~~~~~~~~~~~~~~i~g~~~~~~~i~~~~~~~~~~~yrr~~~n   63 (179)
                      ..++.+...+..-++++.|+..+.++++.+++...+.-.||+..+
T Consensus       342 ~~p~~~~~~~~~~l~vVlgvavlivVv~viv~vc~~~rrrR~~~~  386 (439)
T PF02480_consen  342 APPSPRTSRGAALLGVVLGVAVLIVVVGVIVWVCLRCRRRRRQRD  386 (439)
T ss_dssp             ---------------------------------------------
T ss_pred             CCCCCCCCcccchHHHHHHHHHHHHHHHHHhheeeeehhcccccc
Confidence            334445555566577777877777777777776666666655554


No 49 
>PF11897 DUF3417:  Protein of unknown function (DUF3417);  InterPro: IPR024517 This domain of unknown function is found at the N terminus of members of the glycogen phosphorylase family. 
Probab=24.42  E-value=1e+02  Score=24.04  Aligned_cols=43  Identities=12%  Similarity=0.191  Sum_probs=36.0

Q ss_pred             CCCceeeccCCChHHHHHhhcChHHHHHHHHHHHHHHHHHHHhH
Q psy14481        130 AGKITGMLLEIDNAELLYMLEHNESLKSKVEEAVAVLQAHQAKQ  173 (179)
Q Consensus       130 AgKITGMLLEmd~sELL~mLes~e~L~~KV~EA~~vL~~~q~~~  173 (179)
                      -..-.-||-+++.+.|-.+.+|++ +..+++++++-++.+...+
T Consensus        48 ~~NPv~~L~~vs~~rL~~la~D~~-fl~~~~~v~~~f~~Ym~~~   90 (118)
T PF11897_consen   48 GHNPVRLLQEVSQERLEELAQDPE-FLARYDRVYARFEEYMSQK   90 (118)
T ss_pred             CCCHHHHHHHCCHHHHHHHhCCHH-HHHHHHHHHHHHHHHHcCC
Confidence            344556889999999999999997 8889999999999987763


No 50 
>PF06809 NPDC1:  Neural proliferation differentiation control-1 protein (NPDC1);  InterPro: IPR009635 This family consists of several neural proliferation differentiation control-1 (NPDC1) proteins. NPDC1 plays a role in the control of neural cell proliferation and differentiation. It has been suggested that NPDC1 may be involved in the development of several secretion glands. This family also contains the C-terminal region of the Caenorhabditis elegans protein CAB-1 (Q93249 from SWISSPROT) which is known to interact with AEX-3 [].; GO: 0016021 integral to membrane
Probab=24.19  E-value=1.6e+02  Score=27.55  Aligned_cols=34  Identities=18%  Similarity=0.190  Sum_probs=20.2

Q ss_pred             CCCCCCceeeeeechhhhHHHHHHhhhcceeeeeecc
Q psy14481         24 WSGEGGVNVAMISGISFGALALVTLFGAVSFIMYRRY   60 (179)
Q Consensus        24 ~~~~~~~~~~~i~g~~~~~~~i~~~~~~~~~~~yrr~   60 (179)
                      .+.++++-+++|++.|+++++   ++...+|.+||-.
T Consensus       192 ~~~~d~l~lv~Iv~~cvaG~a---Aliva~~cW~Rlq  225 (341)
T PF06809_consen  192 GSHGDGLTLVLIVVCCVAGAA---ALIVAGYCWYRLQ  225 (341)
T ss_pred             CCCCCCeeeehhHHHHHHHHH---HHHHhhheEEEec
Confidence            334566678888888886554   2233445556543


No 51 
>PF07235 DUF1427:  Protein of unknown function (DUF1427);  InterPro: IPR009872 This family consists of several bacterial proteins of around 100 residues in length. The function of this family is unknown.
Probab=23.52  E-value=44  Score=25.67  Aligned_cols=27  Identities=33%  Similarity=0.602  Sum_probs=20.9

Q ss_pred             HhhhhhhhhHhhh-----------CCCCCCCceeeccC
Q psy14481        113 MLGERLFPLIQRM-----------YPELAGKITGMLLE  139 (179)
Q Consensus       113 ~LGErLyplV~~l-----------~p~~AgKITGMLLE  139 (179)
                      ++||+++|.+++.           +.+-+.+|+|.|=.
T Consensus        39 ~~Ge~~~p~~~~~~~~~~~~~a~~~~~c~~hv~G~lp~   76 (90)
T PF07235_consen   39 LLGEQAIPLVRRLLRGHPLTAAWFHAECVPHVFGQLPG   76 (90)
T ss_pred             hcccchhHHHHHHhcCCchhhhhhhccccccccCCCCC
Confidence            6899999999876           33567888888754


No 52 
>PF02468 PsbN:  Photosystem II reaction centre N protein (psbN);  InterPro: IPR003398 Oxygenic photosynthesis uses two multi-subunit photosystems (I and II) located in the cell membranes of cyanobacteria and in the thylakoid membranes of chloroplasts in plants and algae. Photosystem II (PSII) has a P680 reaction centre containing chlorophyll 'a' that uses light energy to carry out the oxidation (splitting) of water molecules, and to produce ATP via a proton pump. Photosystem I (PSI) has a P700 reaction centre containing chlorophyll that takes the electron and associated hydrogen donated from PSII to reduce NADP+ to NADPH. Both ATP and NADPH are subsequently used in the light-independent reactions to convert carbon dioxide to glucose using the hydrogen atom extracted from water by PSII, releasing oxygen as a by-product. PSII is a multisubunit protein-pigment complex containing polypeptides both intrinsic and extrinsic to the photosynthetic membrane [, ]. Within the core of the complex, the chlorophyll and beta-carotene pigments are mainly bound to the antenna proteins CP43 (PsbC) and CP47 (PsbB), which pass the excitation energy on to the reaction centre proteins D1 (Qb, PsbA) and D2 (Qa, PsbD) that bind all the redox-active cofactors involved in the energy conversion process. The PSII oxygen-evolving complex (OEC) oxidises water to provide protons for use by PSI, and consists of OEE1 (PsbO), OEE2 (PsbP) and OEE3 (PsbQ). The remaining subunits in PSII are of low molecular weight (less than 10 kDa), and are involved in PSII assembly, stabilisation, dimerisation, and photo-protection [].   This family represents the low molecular weight transmembrane protein PsbN found in PSII. PsbN may have a role in PSII stability, however its actual function unknown. PsbN does not appear to be essential for photoautotrophic growth or normal PSII function.; GO: 0015979 photosynthesis, 0009523 photosystem II, 0009539 photosystem II reaction center, 0016020 membrane
Probab=23.26  E-value=37  Score=22.78  Aligned_cols=24  Identities=8%  Similarity=0.249  Sum_probs=18.0

Q ss_pred             echhhhHHHHHHhhhcceeeeeec
Q psy14481         36 SGISFGALALVTLFGAVSFIMYRR   59 (179)
Q Consensus        36 ~g~~~~~~~i~~~~~~~~~~~yrr   59 (179)
                      +...+++++.+.++|..+|.+|.-
T Consensus         3 ~a~~~~i~i~~~lv~~Tgy~iYta   26 (43)
T PF02468_consen    3 TATVLAIFISCLLVSITGYAIYTA   26 (43)
T ss_pred             ceeeHHHHHHHHHHHHHhhhhhhe
Confidence            345667777777888888998875


No 53 
>PF02962 CHMI:  5-carboxymethyl-2-hydroxymuconate isomerase;  InterPro: IPR004220 5-carboxymethyl-2-hydroxymuconate isomerase transforms 5-carboxymethyl-2-hydroxy-muconic acid into 5-oxo-pent-3-ene-1,2,5-tricarboxylic acid during the third step of the homoprotocatechuate catabolic pathway []. Homoprotocatechuate (HPC; 3,4-dihydroxyphenylacetate) is catabolized to Krebs cycle intermediates via extradiol (meta-) cleavage and the necessary enzymes are chromosomally encoded in a variety of bacteria []. 5-carboxymethyl-2-hydroxymuconate isomerase is probably a dimer of two identical subunits []. A comparison of the N-terminal half of the isomerase/decarboxylase sequence from the pathway (both encoded by the gene hpcE), with the second half showed significant similarity. This suggests that a duplication may have occurred to produce a bifunctional gene [].; PDB: 3E6Q_H 1OTG_B.
Probab=23.17  E-value=66  Score=25.43  Aligned_cols=42  Identities=29%  Similarity=0.416  Sum_probs=28.4

Q ss_pred             ChhHhhcCChHHHHHHhhhhhhhhHhhhCCC-CCCCceeeccCC
Q psy14481         98 TSTMLAAAQPQEQKQMLGERLFPLIQRMYPE-LAGKITGMLLEI  140 (179)
Q Consensus        98 ~~s~La~a~peeQKq~LGErLyplV~~l~p~-~AgKITGMLLEm  140 (179)
                      +...++.= .++||+.|||+||..+.+..+. .++.-.+.-+|+
T Consensus        64 ~l~il~GR-s~e~k~~l~~~l~~~l~~~~~~~~~~~~~~LsvEi  106 (124)
T PF02962_consen   64 TLRILAGR-SEEQKKALSEALLAVLKAHLAPLFAQRYLQLSVEI  106 (124)
T ss_dssp             EEEEETT---HHHHHHHHHHHHHHHHHHCCCHCCHSEEEEEEEE
T ss_pred             EeeecCCC-CHHHHHHHHHHHHHHHHHHhhHhhcCCeeEEEEEE
Confidence            44444444 4568999999999999887554 566666665654


No 54 
>PRK11638 lipopolysaccharide biosynthesis protein WzzE; Provisional
Probab=22.95  E-value=64  Score=29.39  Aligned_cols=22  Identities=18%  Similarity=0.128  Sum_probs=17.3

Q ss_pred             hhHHHHHHhhhcceeeeeeccc
Q psy14481         40 FGALALVTLFGAVSFIMYRRYR   61 (179)
Q Consensus        40 ~~~~~i~~~~~~~~~~~yrr~~   61 (179)
                      +-..++++++.|+++++.||++
T Consensus       320 lil~~llG~~lg~~~vL~r~~~  341 (342)
T PRK11638        320 MIMWGAVGALVGAGVALTRRRR  341 (342)
T ss_pred             HHHHHHHHHHHHheeeEeecCC
Confidence            4455677788899999999975


No 55 
>PF13396 PLDc_N:  Phospholipase_D-nuclease N-terminal
Probab=22.86  E-value=75  Score=20.05  Aligned_cols=22  Identities=9%  Similarity=0.408  Sum_probs=17.2

Q ss_pred             hhhHHHHHHhhhcceeeeeecc
Q psy14481         39 SFGALALVTLFGAVSFIMYRRY   60 (179)
Q Consensus        39 ~~~~~~i~~~~~~~~~~~yrr~   60 (179)
                      -+..+.++-.+|.+.|++.+||
T Consensus        25 W~~~i~~~P~iG~i~Yl~~gr~   46 (46)
T PF13396_consen   25 WLIVILFFPIIGPILYLIFGRK   46 (46)
T ss_pred             HHHHHHHHHHHHHhheEEEeCC
Confidence            3455666888999999998885


No 56 
>PF05545 FixQ:  Cbb3-type cytochrome oxidase component FixQ;  InterPro: IPR008621 This family consists of several Cbb3-type cytochrome oxidase components (FixQ/CcoQ). FixQ is found in nitrogen fixing bacteria. Since nitrogen fixation is an energy-consuming process, effective symbioses depend on operation of a respiratory chain with a high affinity for O2, closely coupled to ATP production. This requirement is fulfilled by a special three-subunit terminal oxidase (cytochrome terminal oxidase cbb3), which was first identified in Bradyrhizobium japonicum as the product of the fixNOQP operon [].
Probab=22.80  E-value=31  Score=22.55  Aligned_cols=23  Identities=22%  Similarity=0.418  Sum_probs=16.4

Q ss_pred             hhhHHHHHHhhhcceeeeeeccc
Q psy14481         39 SFGALALVTLFGAVSFIMYRRYR   61 (179)
Q Consensus        39 ~~~~~~i~~~~~~~~~~~yrr~~   61 (179)
                      +++.+.++..|.++.+..||++.
T Consensus        12 ~~~~v~~~~~F~gi~~w~~~~~~   34 (49)
T PF05545_consen   12 SIGTVLFFVFFIGIVIWAYRPRN   34 (49)
T ss_pred             HHHHHHHHHHHHHHHHHHHcccc
Confidence            45667777777777777787764


No 57 
>PF14283 DUF4366:  Domain of unknown function (DUF4366)
Probab=22.77  E-value=77  Score=27.37  Aligned_cols=24  Identities=25%  Similarity=0.346  Sum_probs=13.7

Q ss_pred             HHHHhhhcceeeeeecccCCCCccc
Q psy14481         44 ALVTLFGAVSFIMYRRYRWNKPQTL   68 (179)
Q Consensus        44 ~i~~~~~~~~~~~yrr~~~n~pq~Y   68 (179)
                      ++|.++|+ |.+.|.+-+..|.+..
T Consensus       167 llv~l~gG-Ga~yYfK~~K~K~~~~  190 (218)
T PF14283_consen  167 LLVALIGG-GAYYYFKFYKPKQEEK  190 (218)
T ss_pred             HHHHHhhc-ceEEEEEEeccccccc
Confidence            33334444 6667777666666655


No 58 
>PRK10905 cell division protein DamX; Validated
Probab=22.54  E-value=37  Score=31.38  Aligned_cols=19  Identities=21%  Similarity=0.371  Sum_probs=17.5

Q ss_pred             hhhhHHHHHHhhhcceeee
Q psy14481         38 ISFGALALVTLFGAVSFIM   56 (179)
Q Consensus        38 ~~~~~~~i~~~~~~~~~~~   56 (179)
                      |++|++|||.++-+||+.|
T Consensus         2 iGiGilVLlLLIigIgSAL   20 (328)
T PRK10905          2 MGVGILVLLLLIIGIGSAL   20 (328)
T ss_pred             cchhHHHHHHHHHHHhHhh
Confidence            5789999999999999998


No 59 
>PRK11556 multidrug efflux system subunit MdtA; Provisional
Probab=22.24  E-value=20  Score=32.75  Aligned_cols=17  Identities=6%  Similarity=0.311  Sum_probs=10.9

Q ss_pred             hhhCCCCCCCceeeccC
Q psy14481        123 QRMYPELAGKITGMLLE  139 (179)
Q Consensus       123 ~~l~p~~AgKITGMLLE  139 (179)
                      -.+.+...|+|+-...+
T Consensus        88 v~v~~~vsG~V~~i~v~  104 (415)
T PRK11556         88 VTVRSRVDGQLMALHFQ  104 (415)
T ss_pred             EEEEccccEEEEEEECC
Confidence            34567777888776433


No 60 
>KOG0077|consensus
Probab=22.08  E-value=39  Score=29.11  Aligned_cols=30  Identities=40%  Similarity=0.585  Sum_probs=19.0

Q ss_pred             HHhhhhhhhhHhhh---CCCCCCCceeeccCCChHHHHHhhcChH
Q psy14481        112 QMLGERLFPLIQRM---YPELAGKITGMLLEIDNAELLYMLEHNE  153 (179)
Q Consensus       112 q~LGErLyplV~~l---~p~~AgKITGMLLEmd~sELL~mLes~e  153 (179)
                      |+||  ||-+-..+   .=|+|||-|          |||||.|+.
T Consensus        12 q~Lg--L~kK~gKllFlGLDNAGKTT----------LLHMLKdDr   44 (193)
T KOG0077|consen   12 QFLG--LYKKFGKLLFLGLDNAGKTT----------LLHMLKDDR   44 (193)
T ss_pred             HHHH--HhccCceEEEEeecCCchhh----------HHHHHcccc
Confidence            5666  66554433   345566644          899999864


No 61 
>PHA03286 envelope glycoprotein E; Provisional
Probab=21.94  E-value=23  Score=34.31  Aligned_cols=31  Identities=32%  Similarity=0.551  Sum_probs=19.4

Q ss_pred             eeechhhhHHHHHHhhhcceeeeeecccCCC
Q psy14481         34 MISGISFGALALVTLFGAVSFIMYRRYRWNK   64 (179)
Q Consensus        34 ~i~g~~~~~~~i~~~~~~~~~~~yrr~~~n~   64 (179)
                      +|+-+.+|...+|.+++...-.+||||+-.|
T Consensus       392 l~~s~~~~~~~~~~~~~~~~~~~~~r~~~~r  422 (492)
T PHA03286        392 LVSSMAAGAILVVLLFALCIAGLYRRRRRHR  422 (492)
T ss_pred             HHHHHHHHHHHHHHHHHHHhHhHhhhhhhhh
Confidence            3455555666666666666667788866544


No 62 
>KOG1236|consensus
Probab=21.75  E-value=84  Score=30.93  Aligned_cols=63  Identities=25%  Similarity=0.283  Sum_probs=53.5

Q ss_pred             CChhHhhcCChHHHHHHhhhhhhhhHhhhCCCCCCCceeeccCCChHHHHHhhcChHHHHHHHHHHHHHHH
Q psy14481         97 LTSTMLAAAQPQEQKQMLGERLFPLIQRMYPELAGKITGMLLEIDNAELLYMLEHNESLKSKVEEAVAVLQ  167 (179)
Q Consensus        97 l~~s~La~a~peeQKq~LGErLyplV~~l~p~~AgKITGMLLEmd~sELL~mLes~e~L~~KV~EA~~vL~  167 (179)
                      +++-+...+++++|++.|+  ||--|.+.+--.||+.+   ||.+..   .-.-|||.+..||++++.-.+
T Consensus       430 lDtGiv~kLs~qn~~N~l~--lF~av~~~~Gy~ag~lm---l~~Sr~---q~~~d~e~F~~kvEklv~~~~  492 (565)
T KOG1236|consen  430 LDTGIVTKLSHQNQMNLLS--LFRAVARFDGYDAGELM---LERSRQ---QQCIDPERFARKVEKLVLKAR  492 (565)
T ss_pred             ecchhhhccChhhHHHHHH--HHHHHHHhcchhHHHHH---Hhhccc---ccCCCHHHHHHHHHHHHHHhh
Confidence            6777889999999999999  99999999888888864   676665   567899999999999987655


No 63 
>PF11710 Git3:  G protein-coupled glucose receptor regulating Gpa2;  InterPro: IPR023041 This entry contains a functionally uncharacterised region belonging to the Git3 G-protein coupled receptor. Git3 is one of six proteins required for glucose-triggered adenylate cyclase activation, and is a G protein-coupled receptor responsible for the activation of adenylate cyclase through Gpa2 - heterotrimeric G protein alpha subunit, part of the glucose-detection pathway. Git3 contains seven predicted transmembrane domains, a third cytoplasmic loop and a cytoplasmic tail []. This is the conserved N-terminal domain of the member proteins. 
Probab=21.66  E-value=30  Score=28.81  Aligned_cols=33  Identities=21%  Similarity=0.343  Sum_probs=21.6

Q ss_pred             ceeeeeechhhhHHHHHHhhhcceeeeeecccC
Q psy14481         30 VNVAMISGISFGALALVTLFGAVSFIMYRRYRW   62 (179)
Q Consensus        30 ~~~~~i~g~~~~~~~i~~~~~~~~~~~yrr~~~   62 (179)
                      +.++.|++++++.++.+.++..+.++-|||+.|
T Consensus         3 ~r~~~i~~s~lSlls~~~~l~~~~~~~~r~~~f   35 (201)
T PF11710_consen    3 LRIGSIVISALSLLSSLAVLYLFFFITYRRRSF   35 (201)
T ss_pred             eeeehHHHHHHHHHHHHHHHHHHHhhhhhhhhH
Confidence            346777777777776665555555555777766


No 64 
>KOG4836|consensus
Probab=21.66  E-value=62  Score=28.36  Aligned_cols=23  Identities=22%  Similarity=0.494  Sum_probs=19.4

Q ss_pred             ceeeeeechhhhHHHHHHhhhcc
Q psy14481         30 VNVAMISGISFGALALVTLFGAV   52 (179)
Q Consensus        30 ~~~~~i~g~~~~~~~i~~~~~~~   52 (179)
                      .-+..|+||++.+++|.++++-+
T Consensus        81 ~~~iVI~GiGv~g~~iY~i~~El  103 (215)
T KOG4836|consen   81 YYIIVIAGIGVTGAFIYAIFGEL  103 (215)
T ss_pred             eeeeeeeeccHHHHhHHHHHHHH
Confidence            33788999999999999998854


No 65 
>PRK15031 5-carboxymethyl-2-hydroxymuconate delta-isomerase; Provisional
Probab=21.29  E-value=57  Score=25.91  Aligned_cols=41  Identities=29%  Similarity=0.385  Sum_probs=28.0

Q ss_pred             ChhHhhcCChHHHHHHhhhhhhhhHhhhCCCCCCCc-eeeccC
Q psy14481         98 TSTMLAAAQPQEQKQMLGERLFPLIQRMYPELAGKI-TGMLLE  139 (179)
Q Consensus        98 ~~s~La~a~peeQKq~LGErLyplV~~l~p~~AgKI-TGMLLE  139 (179)
                      +...++.= .++||+.||++||..+.+.-+...++= .+.-+|
T Consensus        65 ~l~i~~GR-s~e~k~~l~~~l~~~l~~~~~~~~~~~~~~LS~E  106 (126)
T PRK15031         65 TLKIGAGR-SLESRQEVGEMLFALIKAHFAALMESRYLALSFE  106 (126)
T ss_pred             EeeecCCC-CHHHHHHHHHHHHHHHHHHhhhhhcccceEEEEE
Confidence            44444444 567999999999999988877665444 454444


No 66 
>PRK11901 hypothetical protein; Reviewed
Probab=21.21  E-value=48  Score=30.66  Aligned_cols=19  Identities=32%  Similarity=0.447  Sum_probs=17.6

Q ss_pred             hhhhHHHHHHhhhcceeee
Q psy14481         38 ISFGALALVTLFGAVSFIM   56 (179)
Q Consensus        38 ~~~~~~~i~~~~~~~~~~~   56 (179)
                      |+.|++|||.++-+||++|
T Consensus        40 iGiGilVLlLLIi~IgSAL   58 (327)
T PRK11901         40 IGIGILVLLLLIIAIGSAL   58 (327)
T ss_pred             HHHHHHHHHHHHHHHhhhc
Confidence            6789999999999999998


No 67 
>PF15065 NCU-G1:  Lysosomal transcription factor, NCU-G1
Probab=21.11  E-value=53  Score=30.34  Aligned_cols=29  Identities=24%  Similarity=0.482  Sum_probs=21.7

Q ss_pred             eeeechhhhHHHHHHhhhcceeeeeeccc
Q psy14481         33 AMISGISFGALALVTLFGAVSFIMYRRYR   61 (179)
Q Consensus        33 ~~i~g~~~~~~~i~~~~~~~~~~~yrr~~   61 (179)
                      -+|.+++||.=+++.++|++-..+.|+|+
T Consensus       320 i~i~~vgLG~P~l~li~Ggl~v~~~r~r~  348 (350)
T PF15065_consen  320 IMIMAVGLGVPLLLLILGGLYVCLRRRRK  348 (350)
T ss_pred             HHHHHHHhhHHHHHHHHhhheEEEecccc
Confidence            35677888888888888888666666554


No 68 
>KOG3540|consensus
Probab=21.08  E-value=46  Score=32.81  Aligned_cols=54  Identities=20%  Similarity=0.268  Sum_probs=33.6

Q ss_pred             CCCCChhHHhhhhccCCCCCCCCCCceeeeeechhhhHHHHHHhhhcceeeeeecccC
Q psy14481          5 APPDSPRAHQAHAEDLSNPWSGEGGVNVAMISGISFGALALVTLFGAVSFIMYRRYRW   62 (179)
Q Consensus         5 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~g~~~~~~~i~~~~~~~~~~~yrr~~~   62 (179)
                      +++|++-.-+||.--   +++..-|.+.++|.|...++++|. ++..|++++.|+|.+
T Consensus       523 ~e~d~~~e~~r~~~~---~~~ed~~~s~~av~gllv~~~~i~-tvivisl~mlrkr~y  576 (615)
T KOG3540|consen  523 AEFDEGAEFYRHDLL---PQSEDVGRSASAVIGLLVSAVFIA-TVIVISLVMLRKRQY  576 (615)
T ss_pred             CCCCCchhhhhhhhc---cccccccccHHHHHHHHHHHHHHH-HHHHHHHHHHccccc
Confidence            456777777777533   444455667777788555555554 445667777666544


No 69 
>PF00226 DnaJ:  DnaJ domain;  InterPro: IPR001623 The prokaryotic heat shock protein DnaJ interacts with the chaperone hsp70-like DnaK protein []. Structurally, the DnaJ protein consists of an N-terminal conserved domain (called 'J' domain) of about 70 amino acids, a glycine-rich region ('G' domain') of about 30 residues, a central domain containing four repeats of a CXXCXGXG motif ('CRR' domain) and a C-terminal region of 120 to 170 residues. Such a structure is shown in the following schematic representation:  +------------+-+-------+-----+-----------+--------------------------------+ | N-terminal | | Gly-R | | CXXCXGXG | C-terminal | +------------+-+-------+-----+-----------+--------------------------------+   It is thought that the 'J' domain of DnaJ mediates the interaction with the dnaK protein and consists of four helices, the second of which has a charged surface that includes at least one pair of basic residues that are essential for interaction with the ATPase domain of Hsp70. The J- and CRR-domains are found in many prokaryotic and eukaryotic proteins [], either together or separately. In yeast, J-domains have been classified into 3 groups; the class III proteins are functionally distinct and do not appear to act as molecular chaperones []. ; GO: 0031072 heat shock protein binding; PDB: 2GUZ_C 2L6L_A 1HDJ_A 2EJ7_A 1FPO_C 2CUG_A 2QSA_A 2OCH_A 3BVO_B 3APQ_A ....
Probab=20.87  E-value=1.8e+02  Score=18.91  Aligned_cols=41  Identities=15%  Similarity=0.167  Sum_probs=28.7

Q ss_pred             hhhhhhhHhhhCCCCCCCceeeccCCChHHHHHhhcChHHHHHHHHHHHHHHHHHH
Q psy14481        115 GERLFPLIQRMYPELAGKITGMLLEIDNAELLYMLEHNESLKSKVEEAVAVLQAHQ  170 (179)
Q Consensus       115 GErLyplV~~l~p~~AgKITGMLLEmd~sELL~mLes~e~L~~KV~EA~~vL~~~q  170 (179)
                      =....-++...||+..+..+               ..-+..-.+|++|.++|..-.
T Consensus        18 k~~y~~l~~~~HPD~~~~~~---------------~~~~~~~~~i~~Ay~~L~~~~   58 (64)
T PF00226_consen   18 KKAYRRLSKQYHPDKNSGDE---------------AEAEEKFARINEAYEILSDPE   58 (64)
T ss_dssp             HHHHHHHHHHTSTTTGTSTH---------------HHHHHHHHHHHHHHHHHHSHH
T ss_pred             HHHHHhhhhccccccchhhh---------------hhhhHHHHHHHHHHHHhCCHH
Confidence            33456777888999866665               333456678999999987543


No 70 
>CHL00020 psbN photosystem II protein N
Probab=20.80  E-value=79  Score=21.26  Aligned_cols=23  Identities=9%  Similarity=0.130  Sum_probs=18.2

Q ss_pred             chhhhHHHHHHhhhcceeeeeec
Q psy14481         37 GISFGALALVTLFGAVSFIMYRR   59 (179)
Q Consensus        37 g~~~~~~~i~~~~~~~~~~~yrr   59 (179)
                      ...++.++.+.++|..+|.+|.-
T Consensus         4 A~~~~i~i~~ll~~~Tgy~iYta   26 (43)
T CHL00020          4 ATLVAIFISGLLVSFTGYALYTA   26 (43)
T ss_pred             hhhHHHHHHHHHHHhhheeeeec
Confidence            45677777788889999999864


No 71 
>PTZ00208 65 kDa invariant surface glycoprotein; Provisional
Probab=20.44  E-value=61  Score=31.06  Aligned_cols=31  Identities=23%  Similarity=0.368  Sum_probs=19.0

Q ss_pred             hhhhHHH---HHHhhhcceeeeeecccCCCCccc
Q psy14481         38 ISFGALA---LVTLFGAVSFIMYRRYRWNKPQTL   68 (179)
Q Consensus        38 ~~~~~~~---i~~~~~~~~~~~yrr~~~n~pq~Y   68 (179)
                      |.++.++   |+.++...-|||+|||+-|--.+-
T Consensus       388 i~~avl~p~~il~~~~~~~~~~v~rrr~~~~dv~  421 (436)
T PTZ00208        388 IILAVLVPAIILAIIAVAFFIMVKRRRNSSEDVD  421 (436)
T ss_pred             HHHHHHHHHHHHHHHHHHhheeeeeccCCchhcc
Confidence            3445443   444455567888999988654443


No 72 
>cd00580 CHMI 5-carboxymethyl-2-hydroxymuconate isomerase (CHMI) is a trimeric enzyme catalyzing the isomerization of the unsaturated ketone 5-(carboxymethyl)-2-hydroxymuconate to 5-(carboxymethyl)-2-oxo-3-hexene-1,6-dionate. This is one step in the homoprotocatechuate pathway, one of the microbial meta-fission pathways that degrade aromatic carbon sources to citric acid cycle intermediates.  Despite the structural similarity of CHMI with 4-oxalocrotonate tautomerase (4-OT) and macrophage migration inhibitory factor (MIF), there is no significant sequence similarity among these protein families, and therefore, they are not combined in one hierarchy.
Probab=20.27  E-value=75  Score=23.77  Aligned_cols=37  Identities=24%  Similarity=0.347  Sum_probs=28.4

Q ss_pred             ChHHHHHHhhhhhhhhHhhhCCCCCC----CceeeccCCCh
Q psy14481        106 QPQEQKQMLGERLFPLIQRMYPELAG----KITGMLLEIDN  142 (179)
Q Consensus       106 ~peeQKq~LGErLyplV~~l~p~~Ag----KITGMLLEmd~  142 (179)
                      -.+|||+-|++.|+..+.+.-+...+    .|+=-+-|||.
T Consensus        71 Rs~eqK~~l~~~i~~~l~~~~~~~~~~~~~~~svei~e~~~  111 (113)
T cd00580          71 RSEEQKQELSEALLAALRAHLAPVFAKRYLSLSVEIRELDP  111 (113)
T ss_pred             CCHHHHHHHHHHHHHHHHHhhhhhhhccceEEEEEEEecCC
Confidence            47789999999999999888666544    66666666664


Done!