Query psy14481
Match_columns 179
No_of_seqs 138 out of 325
Neff 4.0
Searched_HMMs 46136
Date Fri Aug 16 20:56:35 2013
Command hhsearch -i /work/01045/syshi/Psyhhblits/psy14481.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/14481hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 smart00517 PolyA C-terminal do 100.0 3.8E-31 8.3E-36 187.7 6.6 64 106-169 1-64 (64)
2 PF00658 PABP: Poly-adenylate 100.0 6.3E-30 1.4E-34 184.2 3.7 72 95-166 1-72 (72)
3 TIGR01628 PABP-1234 polyadenyl 100.0 9.2E-29 2E-33 225.5 8.9 80 88-167 483-562 (562)
4 KOG0943|consensus 97.5 4.4E-05 9.4E-10 79.1 2.2 66 108-173 2605-2672(3015)
5 PTZ00382 Variant-specific surf 95.4 0.0092 2E-07 45.1 1.9 33 27-59 60-93 (96)
6 PF03302 VSP: Giardia variant- 93.2 0.044 9.5E-07 50.2 1.6 34 26-59 360-394 (397)
7 PF05808 Podoplanin: Podoplani 91.7 0.05 1.1E-06 45.3 0.0 37 23-59 119-155 (162)
8 PF08693 SKG6: Transmembrane a 87.3 0.68 1.5E-05 30.5 2.8 30 32-61 11-40 (40)
9 PF01034 Syndecan: Syndecan do 84.1 0.33 7.1E-06 35.0 0.1 32 27-61 7-38 (64)
10 PF12301 CD99L2: CD99 antigen 82.5 0.51 1.1E-05 39.3 0.6 35 26-62 108-143 (169)
11 PF01102 Glycophorin_A: Glycop 78.1 0.48 1E-05 37.7 -0.9 16 29-44 60-75 (122)
12 PF13908 Shisa: Wnt and FGF in 74.3 4.4 9.6E-05 32.7 3.7 25 28-52 74-98 (179)
13 PF02009 Rifin_STEVOR: Rifin/s 72.7 0.71 1.5E-05 41.5 -1.3 29 33-61 256-285 (299)
14 PF14828 Amnionless: Amnionles 72.7 4.9 0.00011 37.8 4.1 25 33-57 338-362 (437)
15 PF10873 DUF2668: Protein of u 72.4 3.1 6.7E-05 34.6 2.4 29 28-56 56-84 (155)
16 PF15102 TMEM154: TMEM154 prot 69.1 9.5 0.00021 31.4 4.6 35 28-62 53-87 (146)
17 PF06365 CD34_antigen: CD34/Po 64.7 3.4 7.3E-05 35.5 1.2 36 33-68 101-137 (202)
18 PF12273 RCR: Chitin synthesis 64.4 2 4.4E-05 33.3 -0.2 34 40-73 5-41 (130)
19 PHA03265 envelope glycoprotein 64.0 3.4 7.4E-05 38.7 1.2 39 21-62 339-377 (402)
20 PF01299 Lamp: Lysosome-associ 61.7 4.4 9.5E-05 35.6 1.4 28 32-62 273-300 (306)
21 TIGR01495 ETRAMP Plasmodium ri 60.1 5.1 0.00011 30.0 1.3 32 31-62 50-81 (85)
22 PF15069 FAM163: FAM163 family 56.2 7.7 0.00017 31.9 1.8 24 31-54 4-27 (143)
23 PTZ00046 rifin; Provisional 54.3 3.4 7.4E-05 38.3 -0.6 28 34-61 316-344 (358)
24 TIGR01477 RIFIN variant surfac 53.2 3.6 7.8E-05 38.1 -0.6 28 34-61 311-339 (353)
25 PF12768 Rax2: Cortical protei 47.4 9.8 0.00021 33.7 1.2 30 27-56 221-251 (281)
26 PF02158 Neuregulin: Neureguli 45.5 7 0.00015 36.9 0.0 24 32-56 9-33 (404)
27 PF12191 stn_TNFRSF12A: Tumour 45.2 6.7 0.00014 31.8 -0.1 34 32-65 75-109 (129)
28 PF05283 MGC-24: Multi-glycosy 44.8 7.3 0.00016 33.1 0.0 27 34-60 158-184 (186)
29 PHA03048 IMV membrane protein; 42.9 6.2 0.00013 30.4 -0.6 29 33-62 44-72 (93)
30 PF14575 EphA2_TM: Ephrin type 41.5 9.3 0.0002 27.6 0.1 31 36-67 4-34 (75)
31 PHA02898 virion envelope prote 40.7 6.7 0.00014 30.1 -0.7 32 33-65 45-77 (92)
32 PF07213 DAP10: DAP10 membrane 39.8 18 0.0004 27.1 1.5 43 17-59 18-60 (79)
33 KOG0123|consensus 36.2 19 0.00041 33.0 1.3 85 86-174 270-354 (369)
34 PHA02902 putative IMV membrane 34.1 18 0.00039 26.5 0.6 19 44-62 11-29 (70)
35 PF12768 Rax2: Cortical protei 31.9 20 0.00044 31.8 0.8 35 32-66 229-266 (281)
36 PF02480 Herpes_gE: Alphaherpe 31.8 15 0.00033 34.5 0.0 26 34-59 353-378 (439)
37 PF04391 DUF533: Protein of un 31.1 17 0.00038 30.6 0.2 101 41-153 33-142 (188)
38 TIGR01167 LPXTG_anchor LPXTG-m 30.2 26 0.00056 20.7 0.8 10 51-60 23-32 (34)
39 PHA02681 ORF089 virion membran 29.2 21 0.00046 27.3 0.4 20 43-62 10-29 (92)
40 PF05454 DAG1: Dystroglycan (D 28.9 19 0.0004 32.5 0.0 21 44-64 157-177 (290)
41 PF14851 FAM176: FAM176 family 28.5 43 0.00093 27.7 2.0 18 34-51 26-43 (153)
42 PHA02680 ORF090 IMV phosphoryl 27.8 20 0.00043 27.5 0.0 26 33-59 46-71 (91)
43 PF04478 Mid2: Mid2 like cell 27.3 70 0.0015 26.7 3.1 10 30-39 50-59 (154)
44 PF05337 CSF-1: Macrophage col 27.0 21 0.00046 32.4 0.0 32 45-76 238-269 (285)
45 PF03908 Sec20: Sec20; InterP 25.6 27 0.00058 25.5 0.3 17 43-59 75-91 (92)
46 PF09835 DUF2062: Uncharacteri 25.5 16 0.00035 28.6 -0.9 27 38-64 120-153 (154)
47 PF05795 Plasmodium_Vir: Plasm 25.2 33 0.00071 29.2 0.8 23 35-58 281-303 (354)
48 PF02480 Herpes_gE: Alphaherpe 24.5 25 0.00054 33.2 0.0 45 19-63 342-386 (439)
49 PF11897 DUF3417: Protein of u 24.4 1E+02 0.0022 24.0 3.4 43 130-173 48-90 (118)
50 PF06809 NPDC1: Neural prolife 24.2 1.6E+02 0.0034 27.5 5.0 34 24-60 192-225 (341)
51 PF07235 DUF1427: Protein of u 23.5 44 0.00095 25.7 1.2 27 113-139 39-76 (90)
52 PF02468 PsbN: Photosystem II 23.3 37 0.0008 22.8 0.6 24 36-59 3-26 (43)
53 PF02962 CHMI: 5-carboxymethyl 23.2 66 0.0014 25.4 2.1 42 98-140 64-106 (124)
54 PRK11638 lipopolysaccharide bi 22.9 64 0.0014 29.4 2.3 22 40-61 320-341 (342)
55 PF13396 PLDc_N: Phospholipase 22.9 75 0.0016 20.1 2.0 22 39-60 25-46 (46)
56 PF05545 FixQ: Cbb3-type cytoc 22.8 31 0.00067 22.6 0.2 23 39-61 12-34 (49)
57 PF14283 DUF4366: Domain of un 22.8 77 0.0017 27.4 2.6 24 44-68 167-190 (218)
58 PRK10905 cell division protein 22.5 37 0.00081 31.4 0.7 19 38-56 2-20 (328)
59 PRK11556 multidrug efflux syst 22.2 20 0.00043 32.7 -1.1 17 123-139 88-104 (415)
60 KOG0077|consensus 22.1 39 0.00084 29.1 0.7 30 112-153 12-44 (193)
61 PHA03286 envelope glycoprotein 21.9 23 0.00049 34.3 -0.8 31 34-64 392-422 (492)
62 KOG1236|consensus 21.8 84 0.0018 30.9 2.9 63 97-167 430-492 (565)
63 PF11710 Git3: G protein-coupl 21.7 30 0.00066 28.8 -0.0 33 30-62 3-35 (201)
64 KOG4836|consensus 21.7 62 0.0013 28.4 1.8 23 30-52 81-103 (215)
65 PRK15031 5-carboxymethyl-2-hyd 21.3 57 0.0012 25.9 1.4 41 98-139 65-106 (126)
66 PRK11901 hypothetical protein; 21.2 48 0.001 30.7 1.1 19 38-56 40-58 (327)
67 PF15065 NCU-G1: Lysosomal tra 21.1 53 0.0012 30.3 1.4 29 33-61 320-348 (350)
68 KOG3540|consensus 21.1 46 0.00099 32.8 1.0 54 5-62 523-576 (615)
69 PF00226 DnaJ: DnaJ domain; I 20.9 1.8E+02 0.0039 18.9 3.7 41 115-170 18-58 (64)
70 CHL00020 psbN photosystem II p 20.8 79 0.0017 21.3 1.8 23 37-59 4-26 (43)
71 PTZ00208 65 kDa invariant surf 20.4 61 0.0013 31.1 1.7 31 38-68 388-421 (436)
72 cd00580 CHMI 5-carboxymethyl-2 20.3 75 0.0016 23.8 1.9 37 106-142 71-111 (113)
No 1
>smart00517 PolyA C-terminal domain of Poly(A)-binding protein. Present also in Drosophila hyperplastics discs protein. Involved in homodimerisation (either directly or indirectly)
Probab=99.97 E-value=3.8e-31 Score=187.71 Aligned_cols=64 Identities=67% Similarity=0.994 Sum_probs=62.3
Q ss_pred ChHHHHHHhhhhhhhhHhhhCCCCCCCceeeccCCChHHHHHhhcChHHHHHHHHHHHHHHHHH
Q psy14481 106 QPQEQKQMLGERLFPLIQRMYPELAGKITGMLLEIDNAELLYMLEHNESLKSKVEEAVAVLQAH 169 (179)
Q Consensus 106 ~peeQKq~LGErLyplV~~l~p~~AgKITGMLLEmd~sELL~mLes~e~L~~KV~EA~~vL~~~ 169 (179)
+|++|||+|||+|||+|++++|++||||||||||||++|+++||+|+++|++||+||++||++|
T Consensus 1 ~p~~qkq~LGE~Lyp~V~~~~p~~A~KITGMLLEmd~~ell~lle~~~~L~~kv~EA~~vl~~~ 64 (64)
T smart00517 1 PPQEQKQALGERLYPKVQALEPELAGKITGMLLEMDNSELLHLLESPELLRSKVDEALEVLKSH 64 (64)
T ss_pred CHHHHHHHHhHHHhHHHHhhCcccCCcCeeeeeCCCHHHHHHHhcCHHHHHHHHHHHHHHHHhC
Confidence 5899999999999999999999999999999999999999999999999999999999999874
No 2
>PF00658 PABP: Poly-adenylate binding protein, unique domain; InterPro: IPR002004 The polyadenylate-binding protein (PABP) has a conserved C-terminal domain (PABC), which is also found in the hyperplastic discs protein (HYD) family of ubiquitin ligases that contain HECT domains (IPR000569 from INTERPRO) []. PABP recognises the 3' mRNA poly(A) tail and plays an essential role in eukaryotic translation initiation and mRNA stabilisation/degradation. PABC domains of PABP are peptide-binding domains that mediate PABP homo-oligomerisation and protein-protein interactions. In mammals, the PABC domain of PABP functions to recruit several different translation factors to the mRNA poly(A) tail [].; GO: 0003723 RNA binding; PDB: 3KUR_E 1JH4_A 2RQH_B 3KUI_A 3KUS_A 3KUJ_A 3KTR_A 2X04_A 3PTH_A 1JGN_A ....
Probab=99.96 E-value=6.3e-30 Score=184.24 Aligned_cols=72 Identities=67% Similarity=1.054 Sum_probs=70.6
Q ss_pred CCCChhHhhcCChHHHHHHhhhhhhhhHhhhCCCCCCCceeeccCCChHHHHHhhcChHHHHHHHHHHHHHH
Q psy14481 95 EPLTSTMLAAAQPQEQKQMLGERLFPLIQRMYPELAGKITGMLLEIDNAELLYMLEHNESLKSKVEEAVAVL 166 (179)
Q Consensus 95 epl~~s~La~a~peeQKq~LGErLyplV~~l~p~~AgKITGMLLEmd~sELL~mLes~e~L~~KV~EA~~vL 166 (179)
+|+++++|+++++++||++|||+|||+|+.++|++||||||||||||++||++||+|++.|+.||+||++||
T Consensus 1 ~~~~~~~la~~~~~~qk~~LGe~Ly~~V~~~~p~~A~KITGMLLe~~~~ell~ll~~~~~L~~kv~eA~~vL 72 (72)
T PF00658_consen 1 EPLTASALASASPEQQKQILGERLYPLVQAIYPELAGKITGMLLEMDNSELLHLLEDPELLREKVQEAIEVL 72 (72)
T ss_dssp SSTSHHHHHTSCHHHHHHHHHHHHHHHHHHHTHHHHHHHHHHHTTSCHHHHHHHHHTHHHHHHHHHHHHHHH
T ss_pred CcchHHHHhcCCHHHHHHHHhccccHHHHHhCcchhHHHHHHHhcCCHHHHHHHhCCHHHHHHHHHHHHHhC
Confidence 578999999999999999999999999999999999999999999999999999999999999999999987
No 3
>TIGR01628 PABP-1234 polyadenylate binding protein, human types 1, 2, 3, 4 family. There are four paralogs in Homo sapiens which are expressed in testis, platelets, broadly expressed, or of unknown tissue range.
Probab=99.95 E-value=9.2e-29 Score=225.54 Aligned_cols=80 Identities=58% Similarity=0.813 Sum_probs=77.0
Q ss_pred ccccCCCCCCChhHhhcCChHHHHHHhhhhhhhhHhhhCCCCCCCceeeccCCChHHHHHhhcChHHHHHHHHHHHHHHH
Q psy14481 88 AVHVQGQEPLTSTMLAAAQPQEQKQMLGERLFPLIQRMYPELAGKITGMLLEIDNAELLYMLEHNESLKSKVEEAVAVLQ 167 (179)
Q Consensus 88 ~~~~~g~epl~~s~La~a~peeQKq~LGErLyplV~~l~p~~AgKITGMLLEmd~sELL~mLes~e~L~~KV~EA~~vL~ 167 (179)
+++.+|++++++++|++++|++||+||||+|||+|++++|++||||||||||||++|||+||||+|+|++||+||++||+
T Consensus 483 ~~~~~~~~~~~~~~la~~~p~~q~~~lg~~~~~~~~~~~~~~~~~~tgm~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 562 (562)
T TIGR01628 483 TASQGGQNKKLAQVLASATPQMQKQVLGERLFPLVEAIEPALAAKITGMLLEMDNSELLHLLESPELLKSKVDEALEVLK 562 (562)
T ss_pred CccccccchhHHHHHhhCCHHHHHHHHHHHhHHHHHhhChhhcCcceEEEecCCHHHHHHHhcCHHHHHHHHHHHHHHhC
Confidence 46677889999999999999999999999999999999999999999999999999999999999999999999999984
No 4
>KOG0943|consensus
Probab=97.51 E-value=4.4e-05 Score=79.06 Aligned_cols=66 Identities=30% Similarity=0.459 Sum_probs=61.2
Q ss_pred HHHHHHhhhhhhhhHhhhCCCCCCCceeeccCCChHHHHHhhcChHHHHHHHHHHH--HHHHHHHHhH
Q psy14481 108 QEQKQMLGERLFPLIQRMYPELAGKITGMLLEIDNAELLYMLEHNESLKSKVEEAV--AVLQAHQAKQ 173 (179)
Q Consensus 108 eeQKq~LGErLyplV~~l~p~~AgKITGMLLEmd~sELL~mLes~e~L~~KV~EA~--~vL~~~q~~~ 173 (179)
.-.+|.|||+||+.++.+.|..|.||||||||.....++..|..++.|+..|++|+ +++..|.++-
T Consensus 2605 sAe~Q~la~l~wgqhe~~l~a~~s~~sGfL~dek~~g~llgi~ce~p~eafVdaa~ndd~i~~H~qei 2672 (3015)
T KOG0943|consen 2605 SAERQALAELLWGQHEAMLPAFASKISGFLLDEKPAGLLLGIACEDPLEAFVDAAMNDDLIIAHGQEI 2672 (3015)
T ss_pred hhhhccchhhcchhhhhHHHHHHHhhhhhhhccchHHHHHHHhhcCcHHHHHHHhcchHHHHHHHHHh
Confidence 34688999999999999999999999999999999999999999999999999999 8888887653
No 5
>PTZ00382 Variant-specific surface protein (VSP); Provisional
Probab=95.43 E-value=0.0092 Score=45.12 Aligned_cols=33 Identities=27% Similarity=0.460 Sum_probs=23.0
Q ss_pred CCCceeeeeechhhhHHHHHHhh-hcceeeeeec
Q psy14481 27 EGGVNVAMISGISFGALALVTLF-GAVSFIMYRR 59 (179)
Q Consensus 27 ~~~~~~~~i~g~~~~~~~i~~~~-~~~~~~~yrr 59 (179)
.++++.|+|+||..|+++++..+ |.+-+++++|
T Consensus 60 ~~~ls~gaiagi~vg~~~~v~~lv~~l~w~f~~r 93 (96)
T PTZ00382 60 RSGLSTGAIAGISVAVVAVVGGLVGFLCWWFVCR 93 (96)
T ss_pred CCCcccccEEEEEeehhhHHHHHHHHHhheeEEe
Confidence 36788999999999988777554 4444444434
No 6
>PF03302 VSP: Giardia variant-specific surface protein; InterPro: IPR005127 During infection, the intestinal protozoan parasite Giardia lamblia virus undergoes continuous antigenic variation which is determined by diversification of the parasite's major surface antigen, named VSP (variant surface protein).
Probab=93.18 E-value=0.044 Score=50.21 Aligned_cols=34 Identities=26% Similarity=0.503 Sum_probs=26.9
Q ss_pred CCCCceeeeeechhhhHHHHHH-hhhcceeeeeec
Q psy14481 26 GEGGVNVAMISGISFGALALVT-LFGAVSFIMYRR 59 (179)
Q Consensus 26 ~~~~~~~~~i~g~~~~~~~i~~-~~~~~~~~~yrr 59 (179)
..+||..|+|+||+.|++|+|. ++|.|-|+|.-|
T Consensus 360 n~s~LstgaIaGIsvavvvvVgglvGfLcWwf~cr 394 (397)
T PF03302_consen 360 NKSGLSTGAIAGISVAVVVVVGGLVGFLCWWFICR 394 (397)
T ss_pred ccccccccceeeeeehhHHHHHHHHHHHhhheeec
Confidence 4578999999999999887666 777777776554
No 7
>PF05808 Podoplanin: Podoplanin; InterPro: IPR008783 This family consists of several mammalian podoplanin-like proteins which are thought to control specifically the unique shape of podocytes [].; GO: 0016021 integral to membrane; PDB: 3IET_X.
Probab=91.75 E-value=0.05 Score=45.32 Aligned_cols=37 Identities=19% Similarity=0.445 Sum_probs=0.0
Q ss_pred CCCCCCCceeeeeechhhhHHHHHHhhhcceeeeeec
Q psy14481 23 PWSGEGGVNVAMISGISFGALALVTLFGAVSFIMYRR 59 (179)
Q Consensus 23 ~~~~~~~~~~~~i~g~~~~~~~i~~~~~~~~~~~yrr 59 (179)
++...+|+..+.+.||..|.++-++++|++-|++.||
T Consensus 119 tt~ek~GL~T~tLVGIIVGVLlaIG~igGIIivvvRK 155 (162)
T PF05808_consen 119 TTVEKDGLSTVTLVGIIVGVLLAIGFIGGIIIVVVRK 155 (162)
T ss_dssp -------------------------------------
T ss_pred cccccCCcceeeeeeehhhHHHHHHHHhheeeEEeeh
Confidence 3456789999999999999999999999999999998
No 8
>PF08693 SKG6: Transmembrane alpha-helix domain; InterPro: IPR014805 SKG6 and AXL2 are membrane proteins that show polarised intracellular localisation [, ]. This entry represents the highly conserved transmembrane alpha-helical domain found in these proteins [, ]. The full-length AXL2 protein has a negative regulatory function in cytokinesis [].
Probab=87.34 E-value=0.68 Score=30.54 Aligned_cols=30 Identities=30% Similarity=0.561 Sum_probs=23.8
Q ss_pred eeeeechhhhHHHHHHhhhcceeeeeeccc
Q psy14481 32 VAMISGISFGALALVTLFGAVSFIMYRRYR 61 (179)
Q Consensus 32 ~~~i~g~~~~~~~i~~~~~~~~~~~yrr~~ 61 (179)
++.--|......+|+..++.+.|++|||+.
T Consensus 11 vaIa~~VvVPV~vI~~vl~~~l~~~~rR~k 40 (40)
T PF08693_consen 11 VAIAVGVVVPVGVIIIVLGAFLFFWYRRKK 40 (40)
T ss_pred EEEEEEEEechHHHHHHHHHHhheEEeccC
Confidence 344457788888999999999999999863
No 9
>PF01034 Syndecan: Syndecan domain; InterPro: IPR001050 The syndecans are transmembrane proteoglycans which are involved in the organisation of cytoskeleton and/or actin microfilaments, and have important roles as cell surface receptors during cell-cell and/or cell-matrix interactions [, ]. Structurally, these proteins consist of four separate domains: A signal sequence; An extracellular domain (ectodomain) of variable length whose sequence is not evolutionary conserved in the various forms of syndecans. The ectodomain contains the sites of attachment of the heparan sulphate glycosaminoglycan side chains; A transmembrane region; A highly conserved cytoplasmic domain of about 30 to 35 residues, which could interact with cytoskeletal proteins. The proteins known to belong to this family are: Syndecan 1. Syndecan 2 or fibroglycan. Syndecan 3 or neuroglycan or N-syndecan. Syndecan 4 or amphiglycan or ryudocan. Drosophila syndecan. Caenorhabditis elegans probable syndecan (F57C7.3). Syndecan-4, a transmembrane heparan sulphate proteoglycan, is a coreceptor with integrins in cell adhesion. It has been suggested to form a ternary signalling complex with protein kinase Calpha and phosphatidylinositol 4,5-bisphosphate (PIP2). Structural studies have demonstrated that the cytoplasmic domain undergoes a conformational transition and forms a symmetric dimer in the presence of phospholipid activator PIP2, and whose overall structure in solution exhibits a twisted clamp shape having a cavity in the centre of dimeric interface. In addition, it has been observed that the syndecan-4 variable domain interacts, strongly, not only with fatty acyl groups but also the anionic head group of PIP2. These findings indicate that PIP2 promotes oligomerisation of the syndecan-4 cytoplasmic domain for transmembrane signalling and cell-matrix adhesion [, ].; GO: 0008092 cytoskeletal protein binding, 0016020 membrane; PDB: 1EJQ_B 1EJP_B 1YBO_C 1OBY_Q.
Probab=84.10 E-value=0.33 Score=34.99 Aligned_cols=32 Identities=25% Similarity=0.366 Sum_probs=0.4
Q ss_pred CCCceeeeeechhhhHHHHHHhhhcceeeeeeccc
Q psy14481 27 EGGVNVAMISGISFGALALVTLFGAVSFIMYRRYR 61 (179)
Q Consensus 27 ~~~~~~~~i~g~~~~~~~i~~~~~~~~~~~yrr~~ 61 (179)
..++-+|+|+|+..|. ++++..+.|++||-|.
T Consensus 7 ~~~vlaavIaG~Vvgl---l~ailLIlf~iyR~rk 38 (64)
T PF01034_consen 7 RSEVLAAVIAGGVVGL---LFAILLILFLIYRMRK 38 (64)
T ss_dssp ---------------------------------S-
T ss_pred cchHHHHHHHHHHHHH---HHHHHHHHHHHHHHHh
Confidence 3444488888876544 4444556788888543
No 10
>PF12301 CD99L2: CD99 antigen like protein 2; InterPro: IPR022078 This family of proteins is found in eukaryotes. Proteins in this family are typically between 165 and 237 amino acids in length. CD99L2 and CD99 are involved in trans-endothelial migration of neutrophils in vitro and in the recruitment of neutrophils into inflamed peritoneum.
Probab=82.52 E-value=0.51 Score=39.34 Aligned_cols=35 Identities=34% Similarity=0.648 Sum_probs=23.4
Q ss_pred CCCCceeeeeechhhhHHHHHHhhhcc-eeeeeecccC
Q psy14481 26 GEGGVNVAMISGISFGALALVTLFGAV-SFIMYRRYRW 62 (179)
Q Consensus 26 ~~~~~~~~~i~g~~~~~~~i~~~~~~~-~~~~yrr~~~ 62 (179)
..+....+.|+||. +.|.++++|++ |||-|.||.|
T Consensus 108 ~~~~~~~g~IaGIv--sav~valvGAvsSyiaYqkKKl 143 (169)
T PF12301_consen 108 QDGEAEAGTIAGIV--SAVVVALVGAVSSYIAYQKKKL 143 (169)
T ss_pred cccCcccchhhhHH--HHHHHHHHHHHHHHHHHHhhcc
Confidence 34556678889987 44455566654 5666988877
No 11
>PF01102 Glycophorin_A: Glycophorin A; InterPro: IPR001195 Proteins in this group are responsible for the molecular basis of the blood group antigens, surface markers on the outside of the red blood cell membrane. Most of these markers are proteins, but some are carbohydrates attached to lipids or proteins [Reid M.E., Lomas-Francis C. The Blood Group Antigen FactsBook Academic Press, London / San Diego, (1997)]. Glycophorin A (PAS-2) and glycophorin B (PAS-3) belong to the MNS blood group system and are associated with antigens that include M/N, S/s, U, He, Mi(a), M(c), Vw, Mur, M(g), Vr, M(e), Mt(a), St(a), Ri(a), Cl(a), Ny(a), Hut, Hil, M(v), Far, Mit, Dantu, Hop, Nob, En(a), ENKT, amongst others. Glycophorin A is the major sialoglycoprotein of the erythrocyte membrane []. Structurally, glycophorin A consists of an N-terminal extracellular domain, heavily glycosylated on serine and threonine residues, followed by a transmembrane region and a C-terminal cytoplasmic domain. Other glycophorins in this entry such as Glycophorin B and Glycophorin E represent minor sialoglycoproteins in the erythrocyte membrane.; GO: 0016021 integral to membrane; PDB: 2KPF_B 1AFO_B 2KPE_A.
Probab=78.10 E-value=0.48 Score=37.67 Aligned_cols=16 Identities=38% Similarity=0.576 Sum_probs=11.4
Q ss_pred CceeeeeechhhhHHH
Q psy14481 29 GVNVAMISGISFGALA 44 (179)
Q Consensus 29 ~~~~~~i~g~~~~~~~ 44 (179)
++..++|+||+||++.
T Consensus 60 ~fs~~~i~~Ii~gv~a 75 (122)
T PF01102_consen 60 RFSEPAIIGIIFGVMA 75 (122)
T ss_dssp SSS-TCHHHHHHHHHH
T ss_pred CccccceeehhHHHHH
Confidence 4556789999998653
No 12
>PF13908 Shisa: Wnt and FGF inhibitory regulator
Probab=74.26 E-value=4.4 Score=32.74 Aligned_cols=25 Identities=24% Similarity=0.299 Sum_probs=17.1
Q ss_pred CCceeeeeechhhhHHHHHHhhhcc
Q psy14481 28 GGVNVAMISGISFGALALVTLFGAV 52 (179)
Q Consensus 28 ~~~~~~~i~g~~~~~~~i~~~~~~~ 52 (179)
.+..+++|.||.+++++|+.+++.+
T Consensus 74 ~~~~~~iivgvi~~Vi~Iv~~Iv~~ 98 (179)
T PF13908_consen 74 IYFITGIIVGVICGVIAIVVLIVCF 98 (179)
T ss_pred ccceeeeeeehhhHHHHHHHhHhhh
Confidence 4455777888888877776666544
No 13
>PF02009 Rifin_STEVOR: Rifin/stevor family; InterPro: IPR002858 Malaria is still a major cause of mortality in many areas of the world. Plasmodium falciparum causes the most severe human form of the disease and is responsible for most fatalities. Severe cases of malaria can occur when the parasite invades and then proliferates within red blood cell erythrocytes. The parasite produces many variant antigenic proteins, encoded by multigene families, which are present on the surface of the infected erythrocyte and play important roles in virulence. A crucial survival mechanism for the malaria parasite is its ability to evade the immune response by switching these variant surface antigens. The high virulence of P. falciparum relative to other malarial parasites is in large part due to the fact that in this organism many of these surface antigens mediate the binding of infected erythrocytes to the vascular endothelium (cytoadherence) and non-infected erythrocytes (rosetting). This can lead to the accumulation of infected cells in the vasculature of a variety of organs, blocking the blood flow and reducing the oxygen supply. Clinical symptoms of severe infection can include fever, progressive anaemia, multi-organ dysfunction and coma. For more information see []. Several multicopy gene families have been described in Plasmodium falciparum, including the stevor family of subtelomeric open reading frames and the rif interspersed repetitive elements. Both families contain three predicted transmembrane segments. It has been proposed that stevor and rif are members of a larger superfamily that code for variant surface antigens [].
Probab=72.72 E-value=0.71 Score=41.49 Aligned_cols=29 Identities=24% Similarity=0.414 Sum_probs=24.5
Q ss_pred eeeechhhhHHHHHHhhhcceeee-eeccc
Q psy14481 33 AMISGISFGALALVTLFGAVSFIM-YRRYR 61 (179)
Q Consensus 33 ~~i~g~~~~~~~i~~~~~~~~~~~-yrr~~ 61 (179)
-+|.+|..+.++||..+.++-+|| |||+.
T Consensus 256 t~I~aSiiaIliIVLIMvIIYLILRYRRKK 285 (299)
T PF02009_consen 256 TAIIASIIAILIIVLIMVIIYLILRYRRKK 285 (299)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHh
Confidence 457888999999999999999988 88843
No 14
>PF14828 Amnionless: Amnionless
Probab=72.72 E-value=4.9 Score=37.76 Aligned_cols=25 Identities=32% Similarity=0.497 Sum_probs=20.1
Q ss_pred eeeechhhhHHHHHHhhhcceeeee
Q psy14481 33 AMISGISFGALALVTLFGAVSFIMY 57 (179)
Q Consensus 33 ~~i~g~~~~~~~i~~~~~~~~~~~y 57 (179)
+.|.|+.+++++++++++++.|..|
T Consensus 338 ~~v~~~vl~~Lllv~ll~~~~ll~~ 362 (437)
T PF14828_consen 338 GTVVGIVLGCLLLVALLFGVILLYR 362 (437)
T ss_pred ceeeeehHHHHHHHHHHHHhheEEe
Confidence 3678999999999999988776554
No 15
>PF10873 DUF2668: Protein of unknown function (DUF2668); InterPro: IPR022640 Members in this family of proteins are annotated as cysteine and tyrosine-rich protein 1, however currently no function is known [].
Probab=72.43 E-value=3.1 Score=34.58 Aligned_cols=29 Identities=17% Similarity=0.403 Sum_probs=23.4
Q ss_pred CCceeeeeechhhhHHHHHHhhhcceeee
Q psy14481 28 GGVNVAMISGISFGALALVTLFGAVSFIM 56 (179)
Q Consensus 28 ~~~~~~~i~g~~~~~~~i~~~~~~~~~~~ 56 (179)
..+..++|+||.||+|.|+..++++.+-+
T Consensus 56 ~~lsgtAIaGIVfgiVfimgvva~i~icv 84 (155)
T PF10873_consen 56 DVLSGTAIAGIVFGIVFIMGVVAGIAICV 84 (155)
T ss_pred cccccceeeeeehhhHHHHHHHHHHHHHH
Confidence 45567789999999999999888776554
No 16
>PF15102 TMEM154: TMEM154 protein family
Probab=69.15 E-value=9.5 Score=31.43 Aligned_cols=35 Identities=23% Similarity=0.241 Sum_probs=26.5
Q ss_pred CCceeeeeechhhhHHHHHHhhhcceeeeeecccC
Q psy14481 28 GGVNVAMISGISFGALALVTLFGAVSFIMYRRYRW 62 (179)
Q Consensus 28 ~~~~~~~i~g~~~~~~~i~~~~~~~~~~~yrr~~~ 62 (179)
...++.++-+|-+..++|+.+++.+.++.|||++-
T Consensus 53 ~q~efiLmIlIP~VLLvlLLl~vV~lv~~~kRkr~ 87 (146)
T PF15102_consen 53 SQLEFILMILIPLVLLVLLLLSVVCLVIYYKRKRT 87 (146)
T ss_pred CCcceEEEEeHHHHHHHHHHHHHHHheeEEeeccc
Confidence 45566778888888888888888777777877665
No 17
>PF06365 CD34_antigen: CD34/Podocalyxin family; InterPro: IPR013836 This family consists of several mammalian CD34 antigen proteins. The CD34 antigen is a human leukocyte membrane protein expressed specifically by lymphohematopoietic progenitor cells. CD34 is a phosphoprotein. Activation of protein kinase C (PKC) has been found to enhance CD34 phosphorylation [, ]. This family contains several eukaryotic podocalyxin proteins. Podocalyxin is a major membrane protein of the glomerular epithelium and is thought to be involved in maintenance of the architecture of the foot processes and filtration slits characteristic of this unique epithelium by virtue of its high negative charge. Podocalyxin functions as an anti-adhesin that maintains an open filtration pathway between neighbouring foot processes in the glomerular epithelium by charge repulsion [].
Probab=64.66 E-value=3.4 Score=35.53 Aligned_cols=36 Identities=22% Similarity=0.468 Sum_probs=29.2
Q ss_pred eeeechhhhHHHHHHhhhcceeeeeecccCCC-Cccc
Q psy14481 33 AMISGISFGALALVTLFGAVSFIMYRRYRWNK-PQTL 68 (179)
Q Consensus 33 ~~i~g~~~~~~~i~~~~~~~~~~~yrr~~~n~-pq~Y 68 (179)
..|+=.++|++.++.+++..+|++..||.|++ .|++
T Consensus 101 ~lI~lv~~g~~lLla~~~~~~Y~~~~Rrs~~~~~~rl 137 (202)
T PF06365_consen 101 TLIALVTSGSFLLLAILLGAGYCCHQRRSWSKKGQRL 137 (202)
T ss_pred EEEehHHhhHHHHHHHHHHHHHHhhhhccCCcchhhh
Confidence 56777888888899999999999999999954 4443
No 18
>PF12273 RCR: Chitin synthesis regulation, resistance to Congo red; InterPro: IPR020999 RCR proteins are ER membrane proteins that regulate chitin deposition in fungal cell walls. Although chitin, a linear polymer of beta-1,4-linked N-acetylglucosamine, constitutes only 2% of the cell wall it plays a vital role in the overall protection of the cell wall against stress, noxious chemicals and osmotic pressure changes. Congo red is a cell wall-disrupting benzidine-type dye extensively used in many cell wall mutant studies that specifically targets chitin in yeast cells and inhibits growth. RCR proteins render the yeasts resistant to Congo red by diminishing the content of chitin in the cell wall []. RCR proteins are probably regulating chitin synthase III interact directly with ubiquitin ligase Rsp5, and the VPEY motif is necessary for this, via interaction with the WW domains of Rsp5 [].
Probab=64.35 E-value=2 Score=33.28 Aligned_cols=34 Identities=18% Similarity=0.128 Sum_probs=15.6
Q ss_pred hhHHHHHHhhhcceeeeeecccC---CCCcccccCCC
Q psy14481 40 FGALALVTLFGAVSFIMYRRYRW---NKPQTLSDKCS 73 (179)
Q Consensus 40 ~~~~~i~~~~~~~~~~~yrr~~~---n~pq~YS~~vR 73 (179)
|++++++.++..+++++.+||+. .+|..+|+=+.
T Consensus 5 ~~iii~~i~l~~~~~~~~~rRR~r~G~~P~~gt~w~~ 41 (130)
T PF12273_consen 5 FAIIIVAILLFLFLFYCHNRRRRRRGLQPIYGTRWMA 41 (130)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHhhcCCCCcCCceecC
Confidence 44444444444444444222222 35766665444
No 19
>PHA03265 envelope glycoprotein D; Provisional
Probab=63.98 E-value=3.4 Score=38.70 Aligned_cols=39 Identities=26% Similarity=0.430 Sum_probs=28.6
Q ss_pred CCCCCCCCCceeeeeechhhhHHHHHHhhhcceeeeeecccC
Q psy14481 21 SNPWSGEGGVNVAMISGISFGALALVTLFGAVSFIMYRRYRW 62 (179)
Q Consensus 21 ~~~~~~~~~~~~~~i~g~~~~~~~i~~~~~~~~~~~yrr~~~ 62 (179)
-.+.|++.+-++|+|-|.+.+++|+| |+|.|++.|||.-
T Consensus 339 ~~~~s~~~~~~~g~~ig~~i~glv~v---g~il~~~~rr~k~ 377 (402)
T PHA03265 339 PPPTSKSNSTFVGISVGLGIAGLVLV---GVILYVCLRRKKE 377 (402)
T ss_pred CCCCCCCCCcccceEEccchhhhhhh---hHHHHHHhhhhhh
Confidence 34556666677888888887777764 6788999888654
No 20
>PF01299 Lamp: Lysosome-associated membrane glycoprotein (Lamp); InterPro: IPR002000 Lysosome-associated membrane glycoproteins (lamp) [] are integral membrane proteins, specific to lysosomes, and whose exact biological function is not yet clear. Structurally, the lamp proteins consist of two internally homologous lysosome-luminal domains separated by a proline-rich hinge region; at the C-terminal extremity there is a transmembrane region (TM) followed by a very short cytoplasmic tail (C). In each of the duplicated domains, there are two conserved disulphide bonds. This structure is schematically represented in the figure below. +-----+ +-----+ +-----+ +-----+ | | | | | | | | xCxxxxxCxxxxxxxxxxxxCxxxxxCxxxxxxxxxCxxxxxCxxxxxxxxxxxxCxxxxxCxxxxxxxx +--------------------------++Hinge++--------------------------++TM++C+ In mammals, there are two closely related types of lamp: lamp-1 and lamp-2, which form major components of the lysosome membrane. In chicken lamp-1 is known as LEP100. Also included in this entry is the macrophage protein CD68 (or macrosialin) [] is a heavily glycosylated integral membrane protein whose structure consists of a mucin-like domain followed by a proline-rich hinge; a single lamp-like domain; a transmembrane region and a short cytoplasmic tail. Similar to CD68, mammalian lamp-3, which is expressed in lymphoid organs, dendritic cells and in lung, contains all the C-terminal regions but lacks the N-terminal lamp-like region []. In a lamp-family protein from nematodes [] only the part C-terminal to the hinge is conserved. ; GO: 0016020 membrane
Probab=61.67 E-value=4.4 Score=35.56 Aligned_cols=28 Identities=25% Similarity=0.547 Sum_probs=20.3
Q ss_pred eeeeechhhhHHHHHHhhhcceeeeeecccC
Q psy14481 32 VAMISGISFGALALVTLFGAVSFIMYRRYRW 62 (179)
Q Consensus 32 ~~~i~g~~~~~~~i~~~~~~~~~~~yrr~~~ 62 (179)
+-.|-|++|++++|+.+ ++|+++|||.-
T Consensus 273 vPIaVG~~La~lvlivL---iaYli~Rrr~~ 300 (306)
T PF01299_consen 273 VPIAVGAALAGLVLIVL---IAYLIGRRRSR 300 (306)
T ss_pred HHHHHHHHHHHHHHHHH---HhheeEecccc
Confidence 55567888887777654 57999888754
No 21
>TIGR01495 ETRAMP Plasmodium ring stage membrane protein ETRAMP. These genes have been shown to be found in the sub-telomeric regions of both P. falciparum and P. yoelii chromosomes.
Probab=60.05 E-value=5.1 Score=30.00 Aligned_cols=32 Identities=16% Similarity=0.169 Sum_probs=26.8
Q ss_pred eeeeeechhhhHHHHHHhhhcceeeeeecccC
Q psy14481 31 NVAMISGISFGALALVTLFGAVSFIMYRRYRW 62 (179)
Q Consensus 31 ~~~~i~g~~~~~~~i~~~~~~~~~~~yrr~~~ 62 (179)
....||++..|...++++..|+++..||+..-
T Consensus 50 K~i~iS~ias~la~lv~t~~G~g~y~~~kk~~ 81 (85)
T TIGR01495 50 KIILYSSIASGLALLVGAGVGLGYYYKKKNKD 81 (85)
T ss_pred ceeehHHHHHHHHHHHHHHHHHhhhhhccccc
Confidence 36789999999999998888999988887543
No 22
>PF15069 FAM163: FAM163 family
Probab=56.20 E-value=7.7 Score=31.93 Aligned_cols=24 Identities=13% Similarity=0.400 Sum_probs=19.5
Q ss_pred eeeeeechhhhHHHHHHhhhccee
Q psy14481 31 NVAMISGISFGALALVTLFGAVSF 54 (179)
Q Consensus 31 ~~~~i~g~~~~~~~i~~~~~~~~~ 54 (179)
..++|||..||.|.|++++..+-|
T Consensus 4 GTvVItGgILAtVILLcIIaVLCY 27 (143)
T PF15069_consen 4 GTVVITGGILATVILLCIIAVLCY 27 (143)
T ss_pred eeEEEechHHHHHHHHHHHHHHHH
Confidence 367899999999999988765544
No 23
>PTZ00046 rifin; Provisional
Probab=54.25 E-value=3.4 Score=38.34 Aligned_cols=28 Identities=21% Similarity=0.395 Sum_probs=23.1
Q ss_pred eeechhhhHHHHHHhhhcceeee-eeccc
Q psy14481 34 MISGISFGALALVTLFGAVSFIM-YRRYR 61 (179)
Q Consensus 34 ~i~g~~~~~~~i~~~~~~~~~~~-yrr~~ 61 (179)
+|..+..|.+|||..+.++=.|| |||+.
T Consensus 316 aIiaSiiAIvVIVLIMvIIYLILRYRRKK 344 (358)
T PTZ00046 316 AIIASIVAIVVIVLIMVIIYLILRYRRKK 344 (358)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhhhcc
Confidence 46677889999999999888888 88853
No 24
>TIGR01477 RIFIN variant surface antigen, rifin family. This model represents the rifin branch of the rifin/stevor family (pfam02009) of predicted variant surface antigens as found in Plasmodium falciparum. This model is based on a set of rifin sequences kindly provided by Matt Berriman from the Sanger Center. This is a global model and assesses a penalty for incomplete sequence. Additional fragmentary sequences may be found with the fragment model and a cutoff of 20 bits.
Probab=53.15 E-value=3.6 Score=38.11 Aligned_cols=28 Identities=25% Similarity=0.401 Sum_probs=23.1
Q ss_pred eeechhhhHHHHHHhhhcceeee-eeccc
Q psy14481 34 MISGISFGALALVTLFGAVSFIM-YRRYR 61 (179)
Q Consensus 34 ~i~g~~~~~~~i~~~~~~~~~~~-yrr~~ 61 (179)
+|..+..|.+|||..+.++=.|| |||+.
T Consensus 311 ~IiaSiIAIvvIVLIMvIIYLILRYRRKK 339 (353)
T TIGR01477 311 PIIASIIAILIIVLIMVIIYLILRYRRKK 339 (353)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhhhcc
Confidence 46677889999999999888888 88854
No 25
>PF12768 Rax2: Cortical protein marker for cell polarity
Probab=47.45 E-value=9.8 Score=33.73 Aligned_cols=30 Identities=20% Similarity=0.253 Sum_probs=17.5
Q ss_pred CCCceeeeeechhhhH-HHHHHhhhcceeee
Q psy14481 27 EGGVNVAMISGISFGA-LALVTLFGAVSFIM 56 (179)
Q Consensus 27 ~~~~~~~~i~g~~~~~-~~i~~~~~~~~~~~ 56 (179)
..-++.|.+-+|+||+ ++++.++|.+++++
T Consensus 221 ~~~l~~G~VVlIslAiALG~v~ll~l~Gii~ 251 (281)
T PF12768_consen 221 GKKLSRGFVVLISLAIALGTVFLLVLIGIIL 251 (281)
T ss_pred cccccceEEEEEehHHHHHHHHHHHHHHHHH
Confidence 4666667777766664 34455555555555
No 26
>PF02158 Neuregulin: Neuregulin family; InterPro: IPR002154 Neuregulins are a sub-family of EGF-like molecules that have been shown to play multiple essential roles in vertebrate embryogenesis including: cardiac development, Schwann cell and oligodendrocyte differentiation, some aspects of neuronal development, as well as the formation of neuromuscular synapses [, ]. Included in the family are heregulin; neu differentiation factor; acetylcholine receptor synthesis stimulator; glial growth factor; and sensory and motor-neuron derived factor []. Multiple family members are generated by alternate splicing or by use of several cell type-specific transcription initiation sites. In general, they bind to and activate the erbB family of receptor tyrosine kinases (erbB2 (HER2), erbB3 (HER3), and erbB4 (HER4)), functioning both as heterodimers and homodimers. The transmembrane forms of neuregulin 1 (NRG1) are present within synaptic vesicles, including those containing glutamate []. After exocytosis, NRG1 is in the presynaptic membrane, where the ectodomain of NRG1 may be cleaved off. The ectodomain then migrates across the synaptic cleft and binds to and activates a member of the EGF-receptor family on the postsynaptic membrane. This has been shown to increase the expression of certain glutamate-receptor subunits. NRG1 appears to signal for glutamate-receptor subunit expression, localisation, and /or phosphorylation facilitating subsequent glutamate transmission. The NRG1 gene has been identified as a potential gene determining susceptibility to schizophrenia by a combination of genetic linkage and association approaches []. ; GO: 0005102 receptor binding, 0009790 embryo development; PDB: 1HRE_A 1HAE_A 1HAF_A 1HRF_A.
Probab=45.48 E-value=7 Score=36.89 Aligned_cols=24 Identities=29% Similarity=0.441 Sum_probs=0.0
Q ss_pred eeeeechhhhHHHHHHhhhcc-eeee
Q psy14481 32 VAMISGISFGALALVTLFGAV-SFIM 56 (179)
Q Consensus 32 ~~~i~g~~~~~~~i~~~~~~~-~~~~ 56 (179)
|-.|||||+|.+| |+++|.+ .|.=
T Consensus 9 VLTITgIcvaLlV-VGi~Cvv~aYCK 33 (404)
T PF02158_consen 9 VLTITGICVALLV-VGIVCVVDAYCK 33 (404)
T ss_dssp --------------------------
T ss_pred hhhhhhhhHHHHH-HHHHHHHHHHHH
Confidence 4568999998764 6777777 6655
No 27
>PF12191 stn_TNFRSF12A: Tumour necrosis factor receptor stn_TNFRSF12A_TNFR domain; InterPro: IPR022316 The tumour necrosis factor (TNF) receptor (TNFR) superfamily comprises more than 20 type-I transmembrane proteins. Family members are defined based on similarity in their extracellular domain - a region that contains many cysteine residues arranged in a specific repetitive pattern []. The cysteines allow formation of an extended rod-like structure, responsible for ligand binding []. Upon receptor activation, different intracellular signalling complexes are assembled for different members of the TNFR superfamily, depending on their intracellular domains and sequences []. Activation of TNFRs can therefore induce a range of disparate effects, including cell proliferation, differentiation, survival, or apoptotic cell death, depending upon the receptor involved []. TNFRs are widely distributed and play important roles in many crucial biological processes, such as lymphoid and neuronal development, innate and adaptive immunity, and maintenance of cellular homeostasis []. Drugs that manipulate their signalling have potential roles in the prevention and treatment of many diseases, such as viral infections, coronary heart disease, transplant rejection, and immune disease []. TNF receptor 12 (also known as TWEAK receptor, and fibroblast growth factor-inducible-14 (Fn14)) has been implicated in endothelial cell growth and migration []. The receptor may also play a role in cell-matrix interactions [].; PDB: 2KN0_A 2RPJ_A 2KMZ_A 2EQP_A.
Probab=45.24 E-value=6.7 Score=31.81 Aligned_cols=34 Identities=24% Similarity=0.426 Sum_probs=1.6
Q ss_pred eeeeechhhhHHHHHHhhhcc-eeeeeecccCCCC
Q psy14481 32 VAMISGISFGALALVTLFGAV-SFIMYRRYRWNKP 65 (179)
Q Consensus 32 ~~~i~g~~~~~~~i~~~~~~~-~~~~yrr~~~n~p 65 (179)
+..+..|.+++++++...+.+ +|+.|||.|-.++
T Consensus 75 ~~l~~pi~~sal~v~lVl~llsg~lv~rrcrrr~~ 109 (129)
T PF12191_consen 75 FPLLWPILGSALSVVLVLALLSGFLVWRRCRRREK 109 (129)
T ss_dssp SSSS-------------------------------
T ss_pred cceehhhhhhHHHHHHHHHHHHHHHHHhhhhcccc
Confidence 334455555544444444444 5555666554333
No 28
>PF05283 MGC-24: Multi-glycosylated core protein 24 (MGC-24); InterPro: IPR007947 CD164 is a mucin-like receptor, or sialomucin, with specificity in receptor/ ligand interactions that depends on the structural characteristics of the mucin-like receptor. Its functions include mediating, or regulating, haematopoietic progenitor cell adhesion and the negative regulation of their growth and/or-differentiation. It exists in the native state as a disulphide- linked homodimer of two 80-85kDa subunits. It is usually expressed by CD34+ and CD341o/- haematopoietic stem cells and associated microenvironmental cells. It contains, in its extracellular region, two mucin domains (I and II) linked by a non-mucin domain, which has been predicted to contain intra- disulphide bridges. This receptor may play a key role in haematopoiesis by facilitating the adhesion of human CD34+ cells to bone marrow stroma and by negatively regulating CD34+ CD341o/- haematopoietic progenitor cell proliferation. These effects involve the CD164 class I and/or II epitopes recognised by the monoclonal antibodies (mAbs) 105A5 and 103B2/9E10. These epitopes are carbohydrate-dependent and are located on the N-terminal mucin domain I [, ]. It has been found that murine MGC-24v and rat endolyn share significant sequence similarities with human CD164. However, CD164 lacks the consensus glycosaminoglycan (GAG)-attachment site found in MGC-24; it is possible that GAG-association is responsible for the high molecular weight of the epithelial-derived MGC-24 glycoprotein []. Genomic structure studies have placed CD164 within the mucin-subgroup that comprises multiple exons, and demonstrate the diverse chromosomal distribution of this family of molecules. Molecules with such multiple exons may have sophisticated regulatory mechanisms that involve not only post-translational modifications of the oligosaccharide side chains, but also differential exon usage. Although differences in the intron and exon sizes are seen between the mouse and human genes, the predicted proteins are similar in size and structure, maintaining functionally important motifs that regulate cell proliferation or subcellular distribution []. CD164 is a gene whose expression depends on differential usage of poly- adenylation sites within the 3'-UTR. The conserved distribution of the 3.2- and 1.2-kb CD164 transcripts between mouse and human suggests that (i) a mechanism may exist to regulate tissue-specific polyadenylation, and (ii) differences in polyadenylation are important for the expression and function of CD164 in different tissues. Two other aspects of the structure of CD164 are of particular interest. First, it shares one of several conserved features of a cytokine-binding pocket - in this respect, it is notable that evidence exists for a class of cell-surface sialomucin modulators that directly interact with growth factor receptors to regulate their response to physiological ligands. Second, its cytoplasmic tail contains a C-terminal YHTL motif found in many endocytic membrane proteins or receptors. These Tyr-based motifs bind to adaptor proteins, which mediate the sorting of membrane proteins into transport vesicles from the plasma membrane to the endosomes, and between intracellular compartments.
Probab=44.77 E-value=7.3 Score=33.05 Aligned_cols=27 Identities=19% Similarity=0.262 Sum_probs=21.9
Q ss_pred eeechhhhHHHHHHhhhcceeeeeecc
Q psy14481 34 MISGISFGALALVTLFGAVSFIMYRRY 60 (179)
Q Consensus 34 ~i~g~~~~~~~i~~~~~~~~~~~yrr~ 60 (179)
.=+|+.+|+.||+..+=+|.||+||-+
T Consensus 158 FD~~SFiGGIVL~LGv~aI~ff~~KF~ 184 (186)
T PF05283_consen 158 FDAASFIGGIVLTLGVLAIIFFLYKFC 184 (186)
T ss_pred CchhhhhhHHHHHHHHHHHHHHHhhhc
Confidence 346788899999988889999999743
No 29
>PHA03048 IMV membrane protein; Provisional
Probab=42.87 E-value=6.2 Score=30.39 Aligned_cols=29 Identities=34% Similarity=0.641 Sum_probs=22.2
Q ss_pred eeeechhhhHHHHHHhhhcceeeeeecccC
Q psy14481 33 AMISGISFGALALVTLFGAVSFIMYRRYRW 62 (179)
Q Consensus 33 ~~i~g~~~~~~~i~~~~~~~~~~~yrr~~~ 62 (179)
=+++++|| .++++.++|.+-|-+|+|+--
T Consensus 44 Ralsii~F-Ilgivl~lG~~ifsmy~r~C~ 72 (93)
T PHA03048 44 RALSGIAF-VLGIVMTIGMLIYSMWGRYCT 72 (93)
T ss_pred HHHHHHHH-HHHHHHHHHHHHHHHHhcccC
Confidence 35678887 577888889888989987543
No 30
>PF14575 EphA2_TM: Ephrin type-A receptor 2 transmembrane domain; PDB: 3KUL_A 2XVD_A 2VX1_A 2VWV_A 2VX0_A 2VWY_A 2VWZ_A 2VWW_A 2VWU_A 2VWX_A ....
Probab=41.47 E-value=9.3 Score=27.58 Aligned_cols=31 Identities=26% Similarity=0.562 Sum_probs=10.8
Q ss_pred echhhhHHHHHHhhhcceeeeeecccCCCCcc
Q psy14481 36 SGISFGALALVTLFGAVSFIMYRRYRWNKPQT 67 (179)
Q Consensus 36 ~g~~~~~~~i~~~~~~~~~~~yrr~~~n~pq~ 67 (179)
.+++.|.++|+.++. +.++..||+.+.|...
T Consensus 4 ~~~~~g~~~ll~~v~-~~~~~~rr~~~~~~~~ 34 (75)
T PF14575_consen 4 ASIIVGVLLLLVLVI-IVIVCFRRCKYSKAKS 34 (75)
T ss_dssp HHHHHHHHHHHHHHH-HHHCCCTT--------
T ss_pred ehHHHHHHHHHHhhe-eEEEEEeeEcCCCCcc
Confidence 344445444444433 3444555555545433
No 31
>PHA02898 virion envelope protein; Provisional
Probab=40.66 E-value=6.7 Score=30.15 Aligned_cols=32 Identities=34% Similarity=0.466 Sum_probs=23.0
Q ss_pred eeeechhhhHHHHHHhhhcceeeeeecccC-CCC
Q psy14481 33 AMISGISFGALALVTLFGAVSFIMYRRYRW-NKP 65 (179)
Q Consensus 33 ~~i~g~~~~~~~i~~~~~~~~~~~yrr~~~-n~p 65 (179)
=+++++|| .++++.++|.+-|-+|+|+-- |+|
T Consensus 45 RalSii~F-Ilgivl~lG~~ifs~y~r~C~~~~~ 77 (92)
T PHA02898 45 RSISIISF-ILAIILILGIIFFKGYNMFCGGNTT 77 (92)
T ss_pred HHHHHHHH-HHHHHHHHHHHHHHHHhhhcCCCcc
Confidence 35678887 567788888888888887543 444
No 32
>PF07213 DAP10: DAP10 membrane protein; InterPro: IPR009861 This family consists of several mammalian DAP10 membrane proteins. In activated mouse natural killer (NK) cells, the NKG2D receptor associates with two intracellular adaptors, DAP10 and DAP12, which trigger phosphatidyl inositol 3 kinase (PI3K) and Syk family protein tyrosine kinases, respectively. It has been suggested that the DAP10-PI3K pathway is sufficient to initiate NKG2D-mediated killing of target cells [].
Probab=39.80 E-value=18 Score=27.07 Aligned_cols=43 Identities=21% Similarity=0.220 Sum_probs=34.5
Q ss_pred hccCCCCCCCCCCceeeeeechhhhHHHHHHhhhcceeeeeec
Q psy14481 17 AEDLSNPWSGEGGVNVAMISGISFGALALVTLFGAVSFIMYRR 59 (179)
Q Consensus 17 ~~~~~~~~~~~~~~~~~~i~g~~~~~~~i~~~~~~~~~~~yrr 59 (179)
+.+.+.+-++=+.++.+..+||.+|=.++-.++...-|..-|.
T Consensus 18 aq~~~gscs~C~~ls~g~LaGiV~~D~vlTLLIv~~vy~car~ 60 (79)
T PF07213_consen 18 AQTQPGSCSGCYPLSPGLLAGIVAADAVLTLLIVLVVYYCARP 60 (79)
T ss_pred hcCCCCCCCCccccCHHHHHHHHHHHHHHHHHHHHHHHhhccc
Confidence 4566678888899999999999999888877777777777443
No 33
>KOG0123|consensus
Probab=36.23 E-value=19 Score=33.04 Aligned_cols=85 Identities=15% Similarity=0.004 Sum_probs=73.1
Q ss_pred ccccccCCCCCCChhHhhcCChHHHHHHhhhhhhhhHhhhCCCCCCCceeeccCCChHHHHHhhcChHHHHHHHHHHHHH
Q psy14481 86 LRAVHVQGQEPLTSTMLAAAQPQEQKQMLGERLFPLIQRMYPELAGKITGMLLEIDNAELLYMLEHNESLKSKVEEAVAV 165 (179)
Q Consensus 86 ~~~~~~~g~epl~~s~La~a~peeQKq~LGErLyplV~~l~p~~AgKITGMLLEmd~sELL~mLes~e~L~~KV~EA~~v 165 (179)
.+.+++++ ...++......++.+.+||....+|...+...+.....+..+.+.+....|.+-.+.+..+....+++
T Consensus 270 ~~nl~vkn----ld~~~~~e~L~~~f~~~GeI~s~kv~~~~~g~skG~gfV~fs~~eeA~~A~~~~n~~~i~~k~l~vav 345 (369)
T KOG0123|consen 270 GANLYVKN----LDETLSDEKLRKIFSSFGEITSAKVMVDENGKSKGFGFVEFSSPEEAKKAMTEMNGRLIGGKPLYVAV 345 (369)
T ss_pred cccccccc----CccccchhHHHHHHhcccceeeEEEEeccCCCccceEEEEcCCHHHHHHHHHhhChhhhcCCchhhhH
Confidence 36677777 44566777888999999999999999999999999999999999999999999999899999999999
Q ss_pred HHHHHHhHH
Q psy14481 166 LQAHQAKQA 174 (179)
Q Consensus 166 L~~~q~~~~ 174 (179)
.+.++....
T Consensus 346 ~qr~~~r~~ 354 (369)
T KOG0123|consen 346 AQRKEDRRA 354 (369)
T ss_pred Hhhhccchh
Confidence 987766544
No 34
>PHA02902 putative IMV membrane protein; Provisional
Probab=34.08 E-value=18 Score=26.50 Aligned_cols=19 Identities=26% Similarity=0.712 Sum_probs=13.7
Q ss_pred HHHHhhhcceeeeeecccC
Q psy14481 44 ALVTLFGAVSFIMYRRYRW 62 (179)
Q Consensus 44 ~i~~~~~~~~~~~yrr~~~ 62 (179)
++|.++|-+-|.+|||+.-
T Consensus 11 v~v~Ivclliya~YrR~kc 29 (70)
T PHA02902 11 VIVIIFCLLIYAAYKRYKC 29 (70)
T ss_pred HHHHHHHHHHHHHHHHhcC
Confidence 4556667778889999744
No 35
>PF12768 Rax2: Cortical protein marker for cell polarity
Probab=31.92 E-value=20 Score=31.75 Aligned_cols=35 Identities=31% Similarity=0.471 Sum_probs=27.8
Q ss_pred eeeee-chhhhHHHHHHhhhcceeeeeec--ccCCCCc
Q psy14481 32 VAMIS-GISFGALALVTLFGAVSFIMYRR--YRWNKPQ 66 (179)
Q Consensus 32 ~~~i~-g~~~~~~~i~~~~~~~~~~~yrr--~~~n~pq 66 (179)
|.+|+ +|+||++.|+.++|.+-..+.|| .|...|+
T Consensus 229 VVlIslAiALG~v~ll~l~Gii~~~~~r~~~~~~~~p~ 266 (281)
T PF12768_consen 229 VVLISLAIALGTVFLLVLIGIILAYIRRRRQGYVPAPT 266 (281)
T ss_pred EEEEehHHHHHHHHHHHHHHHHHHHHHhhhccCcCCCc
Confidence 66776 89999999999999998888777 3444455
No 36
>PF02480 Herpes_gE: Alphaherpesvirus glycoprotein E; InterPro: IPR003404 Glycoprotein E (gE) of Alphaherpesvirus forms a complex with glycoprotein I (gI), functioning as an immunoglobulin G (IgG) Fc binding protein. gE is involved in virus spread but is not essential for propagation [].; GO: 0016020 membrane; PDB: 2GJ7_F 2GIY_B.
Probab=31.83 E-value=15 Score=34.52 Aligned_cols=26 Identities=19% Similarity=0.544 Sum_probs=0.0
Q ss_pred eeechhhhHHHHHHhhhcceeeeeec
Q psy14481 34 MISGISFGALALVTLFGAVSFIMYRR 59 (179)
Q Consensus 34 ~i~g~~~~~~~i~~~~~~~~~~~yrr 59 (179)
.+.|+.+|+++++.+++++.++..||
T Consensus 353 ~~l~vVlgvavlivVv~viv~vc~~~ 378 (439)
T PF02480_consen 353 ALLGVVLGVAVLIVVVGVIVWVCLRC 378 (439)
T ss_dssp --------------------------
T ss_pred chHHHHHHHHHHHHHHHHHhheeeee
Confidence 44555555566666655555555433
No 37
>PF04391 DUF533: Protein of unknown function (DUF533); InterPro: IPR007486 Some family members may be secreted or integral membrane proteins.
Probab=31.10 E-value=17 Score=30.60 Aligned_cols=101 Identities=17% Similarity=0.180 Sum_probs=49.7
Q ss_pred hHHHHHHhhhcceeeeeecccCCCCcccccCCCCCCCCCCCCCccccccccCCCCCCChhHhhcC------ChHHHHHHh
Q psy14481 41 GALALVTLFGAVSFIMYRRYRWNKPQTLSDKCSNPDSSGYIDDSTLRAVHVQGQEPLTSTMLAAA------QPQEQKQML 114 (179)
Q Consensus 41 ~~~~i~~~~~~~~~~~yrr~~~n~pq~YS~~vRNP~s~~~iqDSt~~~~~~~g~epl~~s~La~a------~peeQKq~L 114 (179)
...+.++++|+++|-.|+++--|++. ++...+..-+.+.++..-.-...+.-.|++.+ .++|+..
T Consensus 33 ~~~Gg~AalG~lA~~ayq~~q~~~~~-------~~~~~~~~~~~~~~~~~~~~a~lllrAMIaAAkADG~ID~~Er~~-- 103 (188)
T PF04391_consen 33 LKYGGLAALGGLAYKAYQNWQQNQGA-------APAAEPQPFAQPPPAEAEAHARLLLRAMIAAAKADGHIDEEERQR-- 103 (188)
T ss_pred HHHHHHHHHHHHHHHHHHHHhccCCC-------CCCCCCCCCCCCCcccccHHHHHHHHHHHHHHHcCCCCCHHHHHH--
Confidence 34456677888888888776544443 11100000011111111112223444555554 4556665
Q ss_pred hhhhhhhHhhh--CCCCCCCceeeccC-CChHHHHHhhcChH
Q psy14481 115 GERLFPLIQRM--YPELAGKITGMLLE-IDNAELLYMLEHNE 153 (179)
Q Consensus 115 GErLyplV~~l--~p~~AgKITGMLLE-md~sELL~mLes~e 153 (179)
|+..+.+. +++...-|.-.|-. +|..+|..-+.||+
T Consensus 104 ---I~~~l~~~g~d~e~~~~l~~eL~~P~d~~~la~~v~~~e 142 (188)
T PF04391_consen 104 ---IEGALQELGLDAEERAWLQAELAAPLDPDALAAAVTDPE 142 (188)
T ss_pred ---HHHHHHHhCCCHHHHHHHHHHHhCCCCHHHHHHhCCCHH
Confidence 45556553 44444444444444 67777777776665
No 38
>TIGR01167 LPXTG_anchor LPXTG-motif cell wall anchor domain. A common feature of this proteins containing this domain appears to be a high proportion of charged and zwitterionic residues immediatedly upstream of the LPXTG motif. This model differs from other descriptions of the LPXTG region by including a portion of that upstream charged region.
Probab=30.17 E-value=26 Score=20.74 Aligned_cols=10 Identities=10% Similarity=0.235 Sum_probs=4.1
Q ss_pred cceeeeeecc
Q psy14481 51 AVSFIMYRRY 60 (179)
Q Consensus 51 ~~~~~~yrr~ 60 (179)
+.++++.|||
T Consensus 23 ~~~~~~~~rk 32 (34)
T TIGR01167 23 LGGLLLRKRK 32 (34)
T ss_pred HHHHHheecc
Confidence 3344444443
No 39
>PHA02681 ORF089 virion membrane protein; Provisional
Probab=29.17 E-value=21 Score=27.30 Aligned_cols=20 Identities=20% Similarity=0.443 Sum_probs=16.3
Q ss_pred HHHHHhhhcceeeeeecccC
Q psy14481 43 LALVTLFGAVSFIMYRRYRW 62 (179)
Q Consensus 43 ~~i~~~~~~~~~~~yrr~~~ 62 (179)
++++.++|-+-|.+|||...
T Consensus 10 ~V~V~IVclliya~YRR~~i 29 (92)
T PHA02681 10 VIVISIVCYIVIMMYRRSCV 29 (92)
T ss_pred HHHHHHHHHHHHHHHHhccC
Confidence 67777888888999999854
No 40
>PF05454 DAG1: Dystroglycan (Dystrophin-associated glycoprotein 1); InterPro: IPR008465 Dystroglycan is one of the dystrophin-associated glycoproteins, which is encoded by a 5.5 kb transcript in Homo sapiens. The protein product is cleaved into two non-covalently associated subunits, [alpha] (N-terminal) and [beta] (C-terminal). In skeletal muscle the dystroglycan complex works as a transmembrane linkage between the extracellular matrix and the cytoskeleton [alpha]-dystroglycan is extracellular and binds to merosin ([alpha]-2 laminin) in the basement membrane, while [beta]-dystroglycan is a transmembrane protein and binds to dystrophin, which is a large rod-like cytoskeletal protein, absent in Duchenne muscular dystrophy patients. Dystrophin binds to intracellular actin cables. In this way, the dystroglycan complex, which links the extracellular matrix to the intracellular actin cables, is thought to provide structural integrity in muscle tissues. The dystroglycan complex is also known to serve as an agrin receptor in muscle, where it may regulate agrin-induced acetylcholine receptor clustering at the neuromuscular junction. There is also evidence which suggests the function of dystroglycan as a part of the signal transduction pathway because it is shown that Grb2, a mediator of the Ras-related signal pathway, can interact with the cytoplasmic domain of dystroglycan. In general, aberrant expression of dystrophin-associated protein complex underlies the pathogenesis of Duchenne muscular dystrophy, Becker muscular dystrophy and severe childhood autosomal recessive muscular dystrophy. Interestingly, no genetic disease has been described for either [alpha]- or [beta]-dystroglycan. Dystroglycan is widely distributed in non-muscle tissues as well as in muscle tissues. During epithelial morphogenesis of kidney, the dystroglycan complex is shown to act as a receptor for the basement membrane. Dystroglycan expression in Mus musculus brain and neural retina has also been reported. However, the physiological role of dystroglycan in non-muscle tissues has remained unclear [].; PDB: 1EG4_P.
Probab=28.95 E-value=19 Score=32.53 Aligned_cols=21 Identities=38% Similarity=0.610 Sum_probs=0.0
Q ss_pred HHHHhhhcceeeeeecccCCC
Q psy14481 44 ALVTLFGAVSFIMYRRYRWNK 64 (179)
Q Consensus 44 ~i~~~~~~~~~~~yrr~~~n~ 64 (179)
+|+.+.|++..+.||||+-.|
T Consensus 157 ~iLLIA~iIa~icyrrkR~GK 177 (290)
T PF05454_consen 157 AILLIAGIIACICYRRKRKGK 177 (290)
T ss_dssp ---------------------
T ss_pred HHHHHHHHHHHHhhhhhhccc
Confidence 344457788889999987644
No 41
>PF14851 FAM176: FAM176 family
Probab=28.45 E-value=43 Score=27.67 Aligned_cols=18 Identities=28% Similarity=0.455 Sum_probs=14.9
Q ss_pred eeechhhhHHHHHHhhhc
Q psy14481 34 MISGISFGALALVTLFGA 51 (179)
Q Consensus 34 ~i~g~~~~~~~i~~~~~~ 51 (179)
.++|+|+|-+..+|++++
T Consensus 26 Fv~gVC~GLlLtLcllV~ 43 (153)
T PF14851_consen 26 FVSGVCAGLLLTLCLLVI 43 (153)
T ss_pred HHHHHHHHHHHHHHHHHh
Confidence 478999999988888844
No 42
>PHA02680 ORF090 IMV phosphorylated membrane protein; Provisional
Probab=27.82 E-value=20 Score=27.53 Aligned_cols=26 Identities=35% Similarity=0.571 Sum_probs=20.2
Q ss_pred eeeechhhhHHHHHHhhhcceeeeeec
Q psy14481 33 AMISGISFGALALVTLFGAVSFIMYRR 59 (179)
Q Consensus 33 ~~i~g~~~~~~~i~~~~~~~~~~~yrr 59 (179)
=+++++|| .++++.+.|.+.|-+||+
T Consensus 46 RalSii~F-IlG~vl~lGilifs~y~~ 71 (91)
T PHA02680 46 RALSVTCF-IVGAVLLLGLFVFSMYRK 71 (91)
T ss_pred HHHHHHHH-HHHHHHHHHHHHHHHhcc
Confidence 45688887 567788888888888984
No 43
>PF04478 Mid2: Mid2 like cell wall stress sensor; InterPro: IPR007567 This family represents a region near the C terminus of Mid2, which contains a transmembrane region. The remainder of the protein sequence is serine-rich and of low complexity, and is therefore impossible to align accurately. Mid2 is thought to act as a mechanosensor of cell wall stress. The C-terminal cytoplasmic region of Mid2 is known to interact with Rom2, a guanine nucleotide exchange factor (GEF) for Rho1, which is part of the cell wall integrity signalling pathway [].
Probab=27.30 E-value=70 Score=26.69 Aligned_cols=10 Identities=10% Similarity=0.610 Sum_probs=4.5
Q ss_pred ceeeeeechh
Q psy14481 30 VNVAMISGIS 39 (179)
Q Consensus 30 ~~~~~i~g~~ 39 (179)
+-+|.+-|++
T Consensus 50 IVIGvVVGVG 59 (154)
T PF04478_consen 50 IVIGVVVGVG 59 (154)
T ss_pred EEEEEEeccc
Confidence 3345444443
No 44
>PF05337 CSF-1: Macrophage colony stimulating factor-1 (CSF-1); InterPro: IPR008001 Colony stimulating factor 1 (CSF-1) is a homodimeric polypeptide growth factor whose primary function is to regulate the survival, proliferation, differentiation, and function of cells of the mononuclear phagocytic lineage. This lineage includes mononuclear phagocytic precursors, blood monocytes, tissue macrophages, osteoclasts, and microglia of the brain, all of which possess cell surface receptors for CSF-1. The protein has also been linked with male fertility [] and mutations in the Csf-1 gene have been found to cause osteopetrosis and failure of tooth eruption [].; GO: 0005125 cytokine activity, 0008083 growth factor activity, 0016021 integral to membrane; PDB: 3EJJ_A.
Probab=26.98 E-value=21 Score=32.38 Aligned_cols=32 Identities=25% Similarity=0.461 Sum_probs=0.0
Q ss_pred HHHhhhcceeeeeecccCCCCcccccCCCCCC
Q psy14481 45 LVTLFGAVSFIMYRRYRWNKPQTLSDKCSNPD 76 (179)
Q Consensus 45 i~~~~~~~~~~~yrr~~~n~pq~YS~~vRNP~ 76 (179)
++..+|++.|.-||||..-.||+-...+.-|.
T Consensus 238 VLLaVGGLLfYr~rrRs~~e~q~~d~~~~~Pe 269 (285)
T PF05337_consen 238 VLLAVGGLLFYRRRRRSHREPQTVDSPMEQPE 269 (285)
T ss_dssp --------------------------------
T ss_pred hhhhccceeeecccccccccccccCCcccCCC
Confidence 34566777776666666677887776666553
No 45
>PF03908 Sec20: Sec20; InterPro: IPR005606 Sec20 is a membrane glycoprotein associated with secretory pathway.
Probab=25.57 E-value=27 Score=25.51 Aligned_cols=17 Identities=29% Similarity=0.635 Sum_probs=12.6
Q ss_pred HHHHHhhhcceeeeeec
Q psy14481 43 LALVTLFGAVSFIMYRR 59 (179)
Q Consensus 43 ~~i~~~~~~~~~~~yrr 59 (179)
++++..++.+.||++||
T Consensus 75 ~~~~~f~~~v~yI~~rR 91 (92)
T PF03908_consen 75 FAFLFFLLVVLYILWRR 91 (92)
T ss_pred HHHHHHHHHHHHHhhhc
Confidence 45566667788999987
No 46
>PF09835 DUF2062: Uncharacterized protein conserved in bacteria (DUF2062); InterPro: IPR018639 This domain, found in various prokaryotic proteins, has no known function. It is found at the C-terminal of family 2 glycosyltransferase proteins, in addition to proteins of unknown function.
Probab=25.52 E-value=16 Score=28.59 Aligned_cols=27 Identities=30% Similarity=0.688 Sum_probs=16.5
Q ss_pred hhhhHHHHHHhhhcceeee-------eecccCCC
Q psy14481 38 ISFGALALVTLFGAVSFIM-------YRRYRWNK 64 (179)
Q Consensus 38 ~~~~~~~i~~~~~~~~~~~-------yrr~~~n~ 64 (179)
..+|++++...++.++|++ ||||+++|
T Consensus 120 ~~~G~~i~~~v~~~i~Y~l~~~~~~~~r~~r~~~ 153 (154)
T PF09835_consen 120 FLLGSLILGIVLGIISYFLVYFLVRKYRKRRRKR 153 (154)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhC
Confidence 3456666666666666665 77776643
No 47
>PF05795 Plasmodium_Vir: Plasmodium vivax Vir protein; InterPro: IPR008780 This family consists of several Vir proteins specific to the genus Plasmodium and Plasmodium vivax in particular. The vir genes are present at about 600-1,000 copies per haploid genome and encode proteins that are immunovariant in natural infections, indicating that they may have a functional role in establishing chronic infection through antigenic variation [].
Probab=25.22 E-value=33 Score=29.16 Aligned_cols=23 Identities=17% Similarity=0.260 Sum_probs=13.2
Q ss_pred eechhhhHHHHHHhhhcceeeeee
Q psy14481 35 ISGISFGALALVTLFGAVSFIMYR 58 (179)
Q Consensus 35 i~g~~~~~~~i~~~~~~~~~~~yr 58 (179)
++++. ++++.+..+.+++|+||+
T Consensus 281 ~~~~v-~~~~~~~G~~~~~f~LYK 303 (354)
T PF05795_consen 281 FSTSV-SPVLSVLGIPLIFFLLYK 303 (354)
T ss_pred cccch-hhhhhhHHHHHHHHHHhc
Confidence 34444 334444455567788888
No 48
>PF02480 Herpes_gE: Alphaherpesvirus glycoprotein E; InterPro: IPR003404 Glycoprotein E (gE) of Alphaherpesvirus forms a complex with glycoprotein I (gI), functioning as an immunoglobulin G (IgG) Fc binding protein. gE is involved in virus spread but is not essential for propagation [].; GO: 0016020 membrane; PDB: 2GJ7_F 2GIY_B.
Probab=24.55 E-value=25 Score=33.16 Aligned_cols=45 Identities=11% Similarity=0.158 Sum_probs=0.0
Q ss_pred cCCCCCCCCCCceeeeeechhhhHHHHHHhhhcceeeeeecccCC
Q psy14481 19 DLSNPWSGEGGVNVAMISGISFGALALVTLFGAVSFIMYRRYRWN 63 (179)
Q Consensus 19 ~~~~~~~~~~~~~~~~i~g~~~~~~~i~~~~~~~~~~~yrr~~~n 63 (179)
..++.+...+..-++++.|+..+.++++.+++...+.-.||+..+
T Consensus 342 ~~p~~~~~~~~~~l~vVlgvavlivVv~viv~vc~~~rrrR~~~~ 386 (439)
T PF02480_consen 342 APPSPRTSRGAALLGVVLGVAVLIVVVGVIVWVCLRCRRRRRQRD 386 (439)
T ss_dssp ---------------------------------------------
T ss_pred CCCCCCCCcccchHHHHHHHHHHHHHHHHHhheeeeehhcccccc
Confidence 334445555566577777877777777777776666666655554
No 49
>PF11897 DUF3417: Protein of unknown function (DUF3417); InterPro: IPR024517 This domain of unknown function is found at the N terminus of members of the glycogen phosphorylase family.
Probab=24.42 E-value=1e+02 Score=24.04 Aligned_cols=43 Identities=12% Similarity=0.191 Sum_probs=36.0
Q ss_pred CCCceeeccCCChHHHHHhhcChHHHHHHHHHHHHHHHHHHHhH
Q psy14481 130 AGKITGMLLEIDNAELLYMLEHNESLKSKVEEAVAVLQAHQAKQ 173 (179)
Q Consensus 130 AgKITGMLLEmd~sELL~mLes~e~L~~KV~EA~~vL~~~q~~~ 173 (179)
-..-.-||-+++.+.|-.+.+|++ +..+++++++-++.+...+
T Consensus 48 ~~NPv~~L~~vs~~rL~~la~D~~-fl~~~~~v~~~f~~Ym~~~ 90 (118)
T PF11897_consen 48 GHNPVRLLQEVSQERLEELAQDPE-FLARYDRVYARFEEYMSQK 90 (118)
T ss_pred CCCHHHHHHHCCHHHHHHHhCCHH-HHHHHHHHHHHHHHHHcCC
Confidence 344556889999999999999997 8889999999999987763
No 50
>PF06809 NPDC1: Neural proliferation differentiation control-1 protein (NPDC1); InterPro: IPR009635 This family consists of several neural proliferation differentiation control-1 (NPDC1) proteins. NPDC1 plays a role in the control of neural cell proliferation and differentiation. It has been suggested that NPDC1 may be involved in the development of several secretion glands. This family also contains the C-terminal region of the Caenorhabditis elegans protein CAB-1 (Q93249 from SWISSPROT) which is known to interact with AEX-3 [].; GO: 0016021 integral to membrane
Probab=24.19 E-value=1.6e+02 Score=27.55 Aligned_cols=34 Identities=18% Similarity=0.190 Sum_probs=20.2
Q ss_pred CCCCCCceeeeeechhhhHHHHHHhhhcceeeeeecc
Q psy14481 24 WSGEGGVNVAMISGISFGALALVTLFGAVSFIMYRRY 60 (179)
Q Consensus 24 ~~~~~~~~~~~i~g~~~~~~~i~~~~~~~~~~~yrr~ 60 (179)
.+.++++-+++|++.|+++++ ++...+|.+||-.
T Consensus 192 ~~~~d~l~lv~Iv~~cvaG~a---Aliva~~cW~Rlq 225 (341)
T PF06809_consen 192 GSHGDGLTLVLIVVCCVAGAA---ALIVAGYCWYRLQ 225 (341)
T ss_pred CCCCCCeeeehhHHHHHHHHH---HHHHhhheEEEec
Confidence 334566678888888886554 2233445556543
No 51
>PF07235 DUF1427: Protein of unknown function (DUF1427); InterPro: IPR009872 This family consists of several bacterial proteins of around 100 residues in length. The function of this family is unknown.
Probab=23.52 E-value=44 Score=25.67 Aligned_cols=27 Identities=33% Similarity=0.602 Sum_probs=20.9
Q ss_pred HhhhhhhhhHhhh-----------CCCCCCCceeeccC
Q psy14481 113 MLGERLFPLIQRM-----------YPELAGKITGMLLE 139 (179)
Q Consensus 113 ~LGErLyplV~~l-----------~p~~AgKITGMLLE 139 (179)
++||+++|.+++. +.+-+.+|+|.|=.
T Consensus 39 ~~Ge~~~p~~~~~~~~~~~~~a~~~~~c~~hv~G~lp~ 76 (90)
T PF07235_consen 39 LLGEQAIPLVRRLLRGHPLTAAWFHAECVPHVFGQLPG 76 (90)
T ss_pred hcccchhHHHHHHhcCCchhhhhhhccccccccCCCCC
Confidence 6899999999876 33567888888754
No 52
>PF02468 PsbN: Photosystem II reaction centre N protein (psbN); InterPro: IPR003398 Oxygenic photosynthesis uses two multi-subunit photosystems (I and II) located in the cell membranes of cyanobacteria and in the thylakoid membranes of chloroplasts in plants and algae. Photosystem II (PSII) has a P680 reaction centre containing chlorophyll 'a' that uses light energy to carry out the oxidation (splitting) of water molecules, and to produce ATP via a proton pump. Photosystem I (PSI) has a P700 reaction centre containing chlorophyll that takes the electron and associated hydrogen donated from PSII to reduce NADP+ to NADPH. Both ATP and NADPH are subsequently used in the light-independent reactions to convert carbon dioxide to glucose using the hydrogen atom extracted from water by PSII, releasing oxygen as a by-product. PSII is a multisubunit protein-pigment complex containing polypeptides both intrinsic and extrinsic to the photosynthetic membrane [, ]. Within the core of the complex, the chlorophyll and beta-carotene pigments are mainly bound to the antenna proteins CP43 (PsbC) and CP47 (PsbB), which pass the excitation energy on to the reaction centre proteins D1 (Qb, PsbA) and D2 (Qa, PsbD) that bind all the redox-active cofactors involved in the energy conversion process. The PSII oxygen-evolving complex (OEC) oxidises water to provide protons for use by PSI, and consists of OEE1 (PsbO), OEE2 (PsbP) and OEE3 (PsbQ). The remaining subunits in PSII are of low molecular weight (less than 10 kDa), and are involved in PSII assembly, stabilisation, dimerisation, and photo-protection []. This family represents the low molecular weight transmembrane protein PsbN found in PSII. PsbN may have a role in PSII stability, however its actual function unknown. PsbN does not appear to be essential for photoautotrophic growth or normal PSII function.; GO: 0015979 photosynthesis, 0009523 photosystem II, 0009539 photosystem II reaction center, 0016020 membrane
Probab=23.26 E-value=37 Score=22.78 Aligned_cols=24 Identities=8% Similarity=0.249 Sum_probs=18.0
Q ss_pred echhhhHHHHHHhhhcceeeeeec
Q psy14481 36 SGISFGALALVTLFGAVSFIMYRR 59 (179)
Q Consensus 36 ~g~~~~~~~i~~~~~~~~~~~yrr 59 (179)
+...+++++.+.++|..+|.+|.-
T Consensus 3 ~a~~~~i~i~~~lv~~Tgy~iYta 26 (43)
T PF02468_consen 3 TATVLAIFISCLLVSITGYAIYTA 26 (43)
T ss_pred ceeeHHHHHHHHHHHHHhhhhhhe
Confidence 345667777777888888998875
No 53
>PF02962 CHMI: 5-carboxymethyl-2-hydroxymuconate isomerase; InterPro: IPR004220 5-carboxymethyl-2-hydroxymuconate isomerase transforms 5-carboxymethyl-2-hydroxy-muconic acid into 5-oxo-pent-3-ene-1,2,5-tricarboxylic acid during the third step of the homoprotocatechuate catabolic pathway []. Homoprotocatechuate (HPC; 3,4-dihydroxyphenylacetate) is catabolized to Krebs cycle intermediates via extradiol (meta-) cleavage and the necessary enzymes are chromosomally encoded in a variety of bacteria []. 5-carboxymethyl-2-hydroxymuconate isomerase is probably a dimer of two identical subunits []. A comparison of the N-terminal half of the isomerase/decarboxylase sequence from the pathway (both encoded by the gene hpcE), with the second half showed significant similarity. This suggests that a duplication may have occurred to produce a bifunctional gene [].; PDB: 3E6Q_H 1OTG_B.
Probab=23.17 E-value=66 Score=25.43 Aligned_cols=42 Identities=29% Similarity=0.416 Sum_probs=28.4
Q ss_pred ChhHhhcCChHHHHHHhhhhhhhhHhhhCCC-CCCCceeeccCC
Q psy14481 98 TSTMLAAAQPQEQKQMLGERLFPLIQRMYPE-LAGKITGMLLEI 140 (179)
Q Consensus 98 ~~s~La~a~peeQKq~LGErLyplV~~l~p~-~AgKITGMLLEm 140 (179)
+...++.= .++||+.|||+||..+.+..+. .++.-.+.-+|+
T Consensus 64 ~l~il~GR-s~e~k~~l~~~l~~~l~~~~~~~~~~~~~~LsvEi 106 (124)
T PF02962_consen 64 TLRILAGR-SEEQKKALSEALLAVLKAHLAPLFAQRYLQLSVEI 106 (124)
T ss_dssp EEEEETT---HHHHHHHHHHHHHHHHHHCCCHCCHSEEEEEEEE
T ss_pred EeeecCCC-CHHHHHHHHHHHHHHHHHHhhHhhcCCeeEEEEEE
Confidence 44444444 4568999999999999887554 566666665654
No 54
>PRK11638 lipopolysaccharide biosynthesis protein WzzE; Provisional
Probab=22.95 E-value=64 Score=29.39 Aligned_cols=22 Identities=18% Similarity=0.128 Sum_probs=17.3
Q ss_pred hhHHHHHHhhhcceeeeeeccc
Q psy14481 40 FGALALVTLFGAVSFIMYRRYR 61 (179)
Q Consensus 40 ~~~~~i~~~~~~~~~~~yrr~~ 61 (179)
+-..++++++.|+++++.||++
T Consensus 320 lil~~llG~~lg~~~vL~r~~~ 341 (342)
T PRK11638 320 MIMWGAVGALVGAGVALTRRRR 341 (342)
T ss_pred HHHHHHHHHHHHheeeEeecCC
Confidence 4455677788899999999975
No 55
>PF13396 PLDc_N: Phospholipase_D-nuclease N-terminal
Probab=22.86 E-value=75 Score=20.05 Aligned_cols=22 Identities=9% Similarity=0.408 Sum_probs=17.2
Q ss_pred hhhHHHHHHhhhcceeeeeecc
Q psy14481 39 SFGALALVTLFGAVSFIMYRRY 60 (179)
Q Consensus 39 ~~~~~~i~~~~~~~~~~~yrr~ 60 (179)
-+..+.++-.+|.+.|++.+||
T Consensus 25 W~~~i~~~P~iG~i~Yl~~gr~ 46 (46)
T PF13396_consen 25 WLIVILFFPIIGPILYLIFGRK 46 (46)
T ss_pred HHHHHHHHHHHHHhheEEEeCC
Confidence 3455666888999999998885
No 56
>PF05545 FixQ: Cbb3-type cytochrome oxidase component FixQ; InterPro: IPR008621 This family consists of several Cbb3-type cytochrome oxidase components (FixQ/CcoQ). FixQ is found in nitrogen fixing bacteria. Since nitrogen fixation is an energy-consuming process, effective symbioses depend on operation of a respiratory chain with a high affinity for O2, closely coupled to ATP production. This requirement is fulfilled by a special three-subunit terminal oxidase (cytochrome terminal oxidase cbb3), which was first identified in Bradyrhizobium japonicum as the product of the fixNOQP operon [].
Probab=22.80 E-value=31 Score=22.55 Aligned_cols=23 Identities=22% Similarity=0.418 Sum_probs=16.4
Q ss_pred hhhHHHHHHhhhcceeeeeeccc
Q psy14481 39 SFGALALVTLFGAVSFIMYRRYR 61 (179)
Q Consensus 39 ~~~~~~i~~~~~~~~~~~yrr~~ 61 (179)
+++.+.++..|.++.+..||++.
T Consensus 12 ~~~~v~~~~~F~gi~~w~~~~~~ 34 (49)
T PF05545_consen 12 SIGTVLFFVFFIGIVIWAYRPRN 34 (49)
T ss_pred HHHHHHHHHHHHHHHHHHHcccc
Confidence 45667777777777777787764
No 57
>PF14283 DUF4366: Domain of unknown function (DUF4366)
Probab=22.77 E-value=77 Score=27.37 Aligned_cols=24 Identities=25% Similarity=0.346 Sum_probs=13.7
Q ss_pred HHHHhhhcceeeeeecccCCCCccc
Q psy14481 44 ALVTLFGAVSFIMYRRYRWNKPQTL 68 (179)
Q Consensus 44 ~i~~~~~~~~~~~yrr~~~n~pq~Y 68 (179)
++|.++|+ |.+.|.+-+..|.+..
T Consensus 167 llv~l~gG-Ga~yYfK~~K~K~~~~ 190 (218)
T PF14283_consen 167 LLVALIGG-GAYYYFKFYKPKQEEK 190 (218)
T ss_pred HHHHHhhc-ceEEEEEEeccccccc
Confidence 33334444 6667777666666655
No 58
>PRK10905 cell division protein DamX; Validated
Probab=22.54 E-value=37 Score=31.38 Aligned_cols=19 Identities=21% Similarity=0.371 Sum_probs=17.5
Q ss_pred hhhhHHHHHHhhhcceeee
Q psy14481 38 ISFGALALVTLFGAVSFIM 56 (179)
Q Consensus 38 ~~~~~~~i~~~~~~~~~~~ 56 (179)
|++|++|||.++-+||+.|
T Consensus 2 iGiGilVLlLLIigIgSAL 20 (328)
T PRK10905 2 MGVGILVLLLLIIGIGSAL 20 (328)
T ss_pred cchhHHHHHHHHHHHhHhh
Confidence 5789999999999999998
No 59
>PRK11556 multidrug efflux system subunit MdtA; Provisional
Probab=22.24 E-value=20 Score=32.75 Aligned_cols=17 Identities=6% Similarity=0.311 Sum_probs=10.9
Q ss_pred hhhCCCCCCCceeeccC
Q psy14481 123 QRMYPELAGKITGMLLE 139 (179)
Q Consensus 123 ~~l~p~~AgKITGMLLE 139 (179)
-.+.+...|+|+-...+
T Consensus 88 v~v~~~vsG~V~~i~v~ 104 (415)
T PRK11556 88 VTVRSRVDGQLMALHFQ 104 (415)
T ss_pred EEEEccccEEEEEEECC
Confidence 34567777888776433
No 60
>KOG0077|consensus
Probab=22.08 E-value=39 Score=29.11 Aligned_cols=30 Identities=40% Similarity=0.585 Sum_probs=19.0
Q ss_pred HHhhhhhhhhHhhh---CCCCCCCceeeccCCChHHHHHhhcChH
Q psy14481 112 QMLGERLFPLIQRM---YPELAGKITGMLLEIDNAELLYMLEHNE 153 (179)
Q Consensus 112 q~LGErLyplV~~l---~p~~AgKITGMLLEmd~sELL~mLes~e 153 (179)
|+|| ||-+-..+ .=|+|||-| |||||.|+.
T Consensus 12 q~Lg--L~kK~gKllFlGLDNAGKTT----------LLHMLKdDr 44 (193)
T KOG0077|consen 12 QFLG--LYKKFGKLLFLGLDNAGKTT----------LLHMLKDDR 44 (193)
T ss_pred HHHH--HhccCceEEEEeecCCchhh----------HHHHHcccc
Confidence 5666 66554433 345566644 899999864
No 61
>PHA03286 envelope glycoprotein E; Provisional
Probab=21.94 E-value=23 Score=34.31 Aligned_cols=31 Identities=32% Similarity=0.551 Sum_probs=19.4
Q ss_pred eeechhhhHHHHHHhhhcceeeeeecccCCC
Q psy14481 34 MISGISFGALALVTLFGAVSFIMYRRYRWNK 64 (179)
Q Consensus 34 ~i~g~~~~~~~i~~~~~~~~~~~yrr~~~n~ 64 (179)
+|+-+.+|...+|.+++...-.+||||+-.|
T Consensus 392 l~~s~~~~~~~~~~~~~~~~~~~~~r~~~~r 422 (492)
T PHA03286 392 LVSSMAAGAILVVLLFALCIAGLYRRRRRHR 422 (492)
T ss_pred HHHHHHHHHHHHHHHHHHHhHhHhhhhhhhh
Confidence 3455555666666666666667788866544
No 62
>KOG1236|consensus
Probab=21.75 E-value=84 Score=30.93 Aligned_cols=63 Identities=25% Similarity=0.283 Sum_probs=53.5
Q ss_pred CChhHhhcCChHHHHHHhhhhhhhhHhhhCCCCCCCceeeccCCChHHHHHhhcChHHHHHHHHHHHHHHH
Q psy14481 97 LTSTMLAAAQPQEQKQMLGERLFPLIQRMYPELAGKITGMLLEIDNAELLYMLEHNESLKSKVEEAVAVLQ 167 (179)
Q Consensus 97 l~~s~La~a~peeQKq~LGErLyplV~~l~p~~AgKITGMLLEmd~sELL~mLes~e~L~~KV~EA~~vL~ 167 (179)
+++-+...+++++|++.|+ ||--|.+.+--.||+.+ ||.+.. .-.-|||.+..||++++.-.+
T Consensus 430 lDtGiv~kLs~qn~~N~l~--lF~av~~~~Gy~ag~lm---l~~Sr~---q~~~d~e~F~~kvEklv~~~~ 492 (565)
T KOG1236|consen 430 LDTGIVTKLSHQNQMNLLS--LFRAVARFDGYDAGELM---LERSRQ---QQCIDPERFARKVEKLVLKAR 492 (565)
T ss_pred ecchhhhccChhhHHHHHH--HHHHHHHhcchhHHHHH---Hhhccc---ccCCCHHHHHHHHHHHHHHhh
Confidence 6777889999999999999 99999999888888864 676665 567899999999999987655
No 63
>PF11710 Git3: G protein-coupled glucose receptor regulating Gpa2; InterPro: IPR023041 This entry contains a functionally uncharacterised region belonging to the Git3 G-protein coupled receptor. Git3 is one of six proteins required for glucose-triggered adenylate cyclase activation, and is a G protein-coupled receptor responsible for the activation of adenylate cyclase through Gpa2 - heterotrimeric G protein alpha subunit, part of the glucose-detection pathway. Git3 contains seven predicted transmembrane domains, a third cytoplasmic loop and a cytoplasmic tail []. This is the conserved N-terminal domain of the member proteins.
Probab=21.66 E-value=30 Score=28.81 Aligned_cols=33 Identities=21% Similarity=0.343 Sum_probs=21.6
Q ss_pred ceeeeeechhhhHHHHHHhhhcceeeeeecccC
Q psy14481 30 VNVAMISGISFGALALVTLFGAVSFIMYRRYRW 62 (179)
Q Consensus 30 ~~~~~i~g~~~~~~~i~~~~~~~~~~~yrr~~~ 62 (179)
+.++.|++++++.++.+.++..+.++-|||+.|
T Consensus 3 ~r~~~i~~s~lSlls~~~~l~~~~~~~~r~~~f 35 (201)
T PF11710_consen 3 LRIGSIVISALSLLSSLAVLYLFFFITYRRRSF 35 (201)
T ss_pred eeeehHHHHHHHHHHHHHHHHHHHhhhhhhhhH
Confidence 346777777777776665555555555777766
No 64
>KOG4836|consensus
Probab=21.66 E-value=62 Score=28.36 Aligned_cols=23 Identities=22% Similarity=0.494 Sum_probs=19.4
Q ss_pred ceeeeeechhhhHHHHHHhhhcc
Q psy14481 30 VNVAMISGISFGALALVTLFGAV 52 (179)
Q Consensus 30 ~~~~~i~g~~~~~~~i~~~~~~~ 52 (179)
.-+..|+||++.+++|.++++-+
T Consensus 81 ~~~iVI~GiGv~g~~iY~i~~El 103 (215)
T KOG4836|consen 81 YYIIVIAGIGVTGAFIYAIFGEL 103 (215)
T ss_pred eeeeeeeeccHHHHhHHHHHHHH
Confidence 33788999999999999998854
No 65
>PRK15031 5-carboxymethyl-2-hydroxymuconate delta-isomerase; Provisional
Probab=21.29 E-value=57 Score=25.91 Aligned_cols=41 Identities=29% Similarity=0.385 Sum_probs=28.0
Q ss_pred ChhHhhcCChHHHHHHhhhhhhhhHhhhCCCCCCCc-eeeccC
Q psy14481 98 TSTMLAAAQPQEQKQMLGERLFPLIQRMYPELAGKI-TGMLLE 139 (179)
Q Consensus 98 ~~s~La~a~peeQKq~LGErLyplV~~l~p~~AgKI-TGMLLE 139 (179)
+...++.= .++||+.||++||..+.+.-+...++= .+.-+|
T Consensus 65 ~l~i~~GR-s~e~k~~l~~~l~~~l~~~~~~~~~~~~~~LS~E 106 (126)
T PRK15031 65 TLKIGAGR-SLESRQEVGEMLFALIKAHFAALMESRYLALSFE 106 (126)
T ss_pred EeeecCCC-CHHHHHHHHHHHHHHHHHHhhhhhcccceEEEEE
Confidence 44444444 567999999999999988877665444 454444
No 66
>PRK11901 hypothetical protein; Reviewed
Probab=21.21 E-value=48 Score=30.66 Aligned_cols=19 Identities=32% Similarity=0.447 Sum_probs=17.6
Q ss_pred hhhhHHHHHHhhhcceeee
Q psy14481 38 ISFGALALVTLFGAVSFIM 56 (179)
Q Consensus 38 ~~~~~~~i~~~~~~~~~~~ 56 (179)
|+.|++|||.++-+||++|
T Consensus 40 iGiGilVLlLLIi~IgSAL 58 (327)
T PRK11901 40 IGIGILVLLLLIIAIGSAL 58 (327)
T ss_pred HHHHHHHHHHHHHHHhhhc
Confidence 6789999999999999998
No 67
>PF15065 NCU-G1: Lysosomal transcription factor, NCU-G1
Probab=21.11 E-value=53 Score=30.34 Aligned_cols=29 Identities=24% Similarity=0.482 Sum_probs=21.7
Q ss_pred eeeechhhhHHHHHHhhhcceeeeeeccc
Q psy14481 33 AMISGISFGALALVTLFGAVSFIMYRRYR 61 (179)
Q Consensus 33 ~~i~g~~~~~~~i~~~~~~~~~~~yrr~~ 61 (179)
-+|.+++||.=+++.++|++-..+.|+|+
T Consensus 320 i~i~~vgLG~P~l~li~Ggl~v~~~r~r~ 348 (350)
T PF15065_consen 320 IMIMAVGLGVPLLLLILGGLYVCLRRRRK 348 (350)
T ss_pred HHHHHHHhhHHHHHHHHhhheEEEecccc
Confidence 35677888888888888888666666554
No 68
>KOG3540|consensus
Probab=21.08 E-value=46 Score=32.81 Aligned_cols=54 Identities=20% Similarity=0.268 Sum_probs=33.6
Q ss_pred CCCCChhHHhhhhccCCCCCCCCCCceeeeeechhhhHHHHHHhhhcceeeeeecccC
Q psy14481 5 APPDSPRAHQAHAEDLSNPWSGEGGVNVAMISGISFGALALVTLFGAVSFIMYRRYRW 62 (179)
Q Consensus 5 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~g~~~~~~~i~~~~~~~~~~~yrr~~~ 62 (179)
+++|++-.-+||.-- +++..-|.+.++|.|...++++|. ++..|++++.|+|.+
T Consensus 523 ~e~d~~~e~~r~~~~---~~~ed~~~s~~av~gllv~~~~i~-tvivisl~mlrkr~y 576 (615)
T KOG3540|consen 523 AEFDEGAEFYRHDLL---PQSEDVGRSASAVIGLLVSAVFIA-TVIVISLVMLRKRQY 576 (615)
T ss_pred CCCCCchhhhhhhhc---cccccccccHHHHHHHHHHHHHHH-HHHHHHHHHHccccc
Confidence 456777777777533 444455667777788555555554 445667777666544
No 69
>PF00226 DnaJ: DnaJ domain; InterPro: IPR001623 The prokaryotic heat shock protein DnaJ interacts with the chaperone hsp70-like DnaK protein []. Structurally, the DnaJ protein consists of an N-terminal conserved domain (called 'J' domain) of about 70 amino acids, a glycine-rich region ('G' domain') of about 30 residues, a central domain containing four repeats of a CXXCXGXG motif ('CRR' domain) and a C-terminal region of 120 to 170 residues. Such a structure is shown in the following schematic representation: +------------+-+-------+-----+-----------+--------------------------------+ | N-terminal | | Gly-R | | CXXCXGXG | C-terminal | +------------+-+-------+-----+-----------+--------------------------------+ It is thought that the 'J' domain of DnaJ mediates the interaction with the dnaK protein and consists of four helices, the second of which has a charged surface that includes at least one pair of basic residues that are essential for interaction with the ATPase domain of Hsp70. The J- and CRR-domains are found in many prokaryotic and eukaryotic proteins [], either together or separately. In yeast, J-domains have been classified into 3 groups; the class III proteins are functionally distinct and do not appear to act as molecular chaperones []. ; GO: 0031072 heat shock protein binding; PDB: 2GUZ_C 2L6L_A 1HDJ_A 2EJ7_A 1FPO_C 2CUG_A 2QSA_A 2OCH_A 3BVO_B 3APQ_A ....
Probab=20.87 E-value=1.8e+02 Score=18.91 Aligned_cols=41 Identities=15% Similarity=0.167 Sum_probs=28.7
Q ss_pred hhhhhhhHhhhCCCCCCCceeeccCCChHHHHHhhcChHHHHHHHHHHHHHHHHHH
Q psy14481 115 GERLFPLIQRMYPELAGKITGMLLEIDNAELLYMLEHNESLKSKVEEAVAVLQAHQ 170 (179)
Q Consensus 115 GErLyplV~~l~p~~AgKITGMLLEmd~sELL~mLes~e~L~~KV~EA~~vL~~~q 170 (179)
=....-++...||+..+..+ ..-+..-.+|++|.++|..-.
T Consensus 18 k~~y~~l~~~~HPD~~~~~~---------------~~~~~~~~~i~~Ay~~L~~~~ 58 (64)
T PF00226_consen 18 KKAYRRLSKQYHPDKNSGDE---------------AEAEEKFARINEAYEILSDPE 58 (64)
T ss_dssp HHHHHHHHHHTSTTTGTSTH---------------HHHHHHHHHHHHHHHHHHSHH
T ss_pred HHHHHhhhhccccccchhhh---------------hhhhHHHHHHHHHHHHhCCHH
Confidence 33456777888999866665 333456678999999987543
No 70
>CHL00020 psbN photosystem II protein N
Probab=20.80 E-value=79 Score=21.26 Aligned_cols=23 Identities=9% Similarity=0.130 Sum_probs=18.2
Q ss_pred chhhhHHHHHHhhhcceeeeeec
Q psy14481 37 GISFGALALVTLFGAVSFIMYRR 59 (179)
Q Consensus 37 g~~~~~~~i~~~~~~~~~~~yrr 59 (179)
...++.++.+.++|..+|.+|.-
T Consensus 4 A~~~~i~i~~ll~~~Tgy~iYta 26 (43)
T CHL00020 4 ATLVAIFISGLLVSFTGYALYTA 26 (43)
T ss_pred hhhHHHHHHHHHHHhhheeeeec
Confidence 45677777788889999999864
No 71
>PTZ00208 65 kDa invariant surface glycoprotein; Provisional
Probab=20.44 E-value=61 Score=31.06 Aligned_cols=31 Identities=23% Similarity=0.368 Sum_probs=19.0
Q ss_pred hhhhHHH---HHHhhhcceeeeeecccCCCCccc
Q psy14481 38 ISFGALA---LVTLFGAVSFIMYRRYRWNKPQTL 68 (179)
Q Consensus 38 ~~~~~~~---i~~~~~~~~~~~yrr~~~n~pq~Y 68 (179)
|.++.++ |+.++...-|||+|||+-|--.+-
T Consensus 388 i~~avl~p~~il~~~~~~~~~~v~rrr~~~~dv~ 421 (436)
T PTZ00208 388 IILAVLVPAIILAIIAVAFFIMVKRRRNSSEDVD 421 (436)
T ss_pred HHHHHHHHHHHHHHHHHHhheeeeeccCCchhcc
Confidence 3445443 444455567888999988654443
No 72
>cd00580 CHMI 5-carboxymethyl-2-hydroxymuconate isomerase (CHMI) is a trimeric enzyme catalyzing the isomerization of the unsaturated ketone 5-(carboxymethyl)-2-hydroxymuconate to 5-(carboxymethyl)-2-oxo-3-hexene-1,6-dionate. This is one step in the homoprotocatechuate pathway, one of the microbial meta-fission pathways that degrade aromatic carbon sources to citric acid cycle intermediates. Despite the structural similarity of CHMI with 4-oxalocrotonate tautomerase (4-OT) and macrophage migration inhibitory factor (MIF), there is no significant sequence similarity among these protein families, and therefore, they are not combined in one hierarchy.
Probab=20.27 E-value=75 Score=23.77 Aligned_cols=37 Identities=24% Similarity=0.347 Sum_probs=28.4
Q ss_pred ChHHHHHHhhhhhhhhHhhhCCCCCC----CceeeccCCCh
Q psy14481 106 QPQEQKQMLGERLFPLIQRMYPELAG----KITGMLLEIDN 142 (179)
Q Consensus 106 ~peeQKq~LGErLyplV~~l~p~~Ag----KITGMLLEmd~ 142 (179)
-.+|||+-|++.|+..+.+.-+...+ .|+=-+-|||.
T Consensus 71 Rs~eqK~~l~~~i~~~l~~~~~~~~~~~~~~~svei~e~~~ 111 (113)
T cd00580 71 RSEEQKQELSEALLAALRAHLAPVFAKRYLSLSVEIRELDP 111 (113)
T ss_pred CCHHHHHHHHHHHHHHHHHhhhhhhhccceEEEEEEEecCC
Confidence 47789999999999999888666544 66666666664
Done!