Query psy17334
Match_columns 188
No_of_seqs 202 out of 1669
Neff 7.9
Searched_HMMs 46136
Date Fri Aug 16 19:37:27 2013
Command hhsearch -i /work/01045/syshi/Psyhhblits/psy17334.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/17334hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 KOG4219|consensus 99.5 2.1E-14 4.4E-19 122.6 4.1 86 49-136 27-112 (423)
2 KOG4220|consensus 99.3 1E-12 2.3E-17 112.6 1.6 75 58-134 31-105 (503)
3 PHA03234 DNA packaging protein 99.2 1.2E-12 2.7E-17 111.7 0.3 77 56-136 31-109 (338)
4 PHA02834 chemokine receptor-li 98.9 2.5E-10 5.3E-15 96.7 1.4 76 56-136 27-102 (323)
5 PHA02638 CC chemokine receptor 98.9 5.1E-10 1.1E-14 98.2 0.6 76 56-136 97-172 (417)
6 PHA03235 DNA packaging protein 98.8 1.8E-09 3.8E-14 94.6 2.2 77 56-136 31-109 (409)
7 PHA03087 G protein-coupled che 98.8 7.1E-09 1.5E-13 87.7 5.2 76 56-135 39-114 (335)
8 PF00001 7tm_1: 7 transmembran 98.2 5.5E-08 1.2E-12 76.9 -4.8 61 74-136 1-61 (257)
9 PF10320 7TM_GPCR_Srsx: Serpen 97.5 9.3E-06 2E-10 66.8 -1.9 47 69-115 2-48 (257)
10 KOG2087|consensus 96.4 0.0022 4.8E-08 54.9 2.8 59 59-117 26-84 (363)
11 PF10324 7TM_GPCR_Srw: Serpent 93.8 0.015 3.2E-07 48.8 -0.4 48 67-115 6-54 (318)
12 PF11710 Git3: G protein-coupl 93.0 0.015 3.2E-07 46.3 -1.7 49 86-136 30-78 (201)
13 PF05462 Dicty_CAR: Slime mold 92.6 0.04 8.7E-07 46.6 0.4 52 59-110 8-59 (303)
14 PF05296 TAS2R: Mammalian tast 92.1 0.11 2.4E-06 43.8 2.5 55 56-110 5-62 (303)
15 PF10328 7TM_GPCR_Srx: Serpent 91.7 0.05 1.1E-06 44.8 -0.1 44 67-110 3-46 (274)
16 PF10317 7TM_GPCR_Srd: Serpent 87.8 0.094 2E-06 43.8 -1.4 49 63-111 4-53 (292)
17 KOG2927|consensus 85.0 0.25 5.4E-06 42.4 -0.3 18 113-130 243-262 (372)
18 PF03344 Daxx: Daxx Family; I 84.0 0.32 7E-06 45.8 0.0 10 91-100 344-353 (713)
19 PF10321 7TM_GPCR_Srt: Serpent 82.6 0.79 1.7E-05 39.0 1.8 54 56-109 31-84 (313)
20 PF04931 DNA_pol_phi: DNA poly 78.5 0.15 3.2E-06 48.6 -4.4 12 101-112 590-601 (784)
21 KOG1832|consensus 73.1 0.62 1.3E-05 44.8 -1.7 6 131-136 1391-1396(1516)
22 PF00002 7tm_2: 7 transmembran 71.8 2.6 5.6E-05 33.7 1.8 47 63-109 6-52 (242)
23 PF14851 FAM176: FAM176 family 67.2 1.1 2.3E-05 34.2 -1.2 30 72-102 4-33 (153)
24 PF09882 DUF2109: Predicted me 64.8 1.8 4E-05 29.0 -0.3 45 70-114 6-50 (78)
25 PF01102 Glycophorin_A: Glycop 64.7 1.7 3.6E-05 31.9 -0.6 9 79-87 85-93 (122)
26 KOG4156|consensus 60.8 2.8 6.1E-05 39.1 0.0 6 110-115 572-577 (1329)
27 PF10327 7TM_GPCR_Sri: Serpent 59.8 16 0.00035 30.7 4.4 57 58-114 9-69 (303)
28 PF02532 PsbI: Photosystem II 58.5 14 0.00029 21.0 2.5 18 58-75 9-26 (36)
29 KOG1834|consensus 57.8 1.2 2.6E-05 41.4 -2.8 7 101-107 872-878 (952)
30 PF08229 SHR3_chaperone: ER me 56.5 13 0.00027 29.6 3.0 14 96-109 130-143 (196)
31 KOG1189|consensus 53.4 2.3 5.1E-05 40.2 -1.7 6 122-127 865-870 (960)
32 PF03839 Sec62: Translocation 53.1 5.2 0.00011 32.5 0.3 10 122-131 175-184 (224)
33 PF03402 V1R: Vomeronasal orga 52.9 6.2 0.00013 32.8 0.8 45 87-135 6-51 (265)
34 CHL00024 psbI photosystem II p 52.6 11 0.00023 21.5 1.4 18 58-75 9-26 (36)
35 PRK02655 psbI photosystem II r 49.9 12 0.00026 21.4 1.4 18 58-75 9-26 (38)
36 PF12606 RELT: Tumour necrosis 47.0 5.5 0.00012 24.5 -0.3 6 68-73 11-16 (50)
37 PF11446 DUF2897: Protein of u 42.6 13 0.00027 23.4 0.9 8 72-79 16-23 (55)
38 PF10856 DUF2678: Protein of u 42.3 97 0.0021 22.5 5.4 16 56-71 30-45 (118)
39 PF10873 DUF2668: Protein of u 42.2 4.4 9.6E-05 30.5 -1.5 34 58-91 62-95 (155)
40 TIGR01477 RIFIN variant surfac 41.9 7.1 0.00015 33.8 -0.5 29 61-89 311-339 (353)
41 PF02009 Rifin_STEVOR: Rifin/s 41.9 5.8 0.00013 33.6 -1.0 27 62-88 258-284 (299)
42 PTZ00046 rifin; Provisional 39.4 8.3 0.00018 33.4 -0.5 28 62-89 317-344 (358)
43 TIGR00869 sec62 protein transl 39.0 12 0.00025 30.6 0.4 17 115-131 174-190 (232)
44 PF11861 DUF3381: Domain of un 37.7 30 0.00066 26.4 2.5 7 101-107 69-75 (159)
45 PF07423 DUF1510: Protein of u 36.2 12 0.00026 30.2 0.0 9 94-102 13-21 (217)
46 PF12191 stn_TNFRSF12A: Tumour 35.6 21 0.00045 26.3 1.2 13 62-74 84-96 (129)
47 PF15039 DUF4530: Domain of un 35.3 13 0.00028 26.1 0.0 6 70-75 55-60 (111)
48 PF06024 DUF912: Nucleopolyhed 33.9 1.6E+02 0.0036 20.5 5.7 30 59-89 64-93 (101)
49 PF10319 7TM_GPCR_Srj: Serpent 33.2 96 0.0021 26.5 5.0 56 60-115 7-65 (310)
50 PF05808 Podoplanin: Podoplani 33.0 14 0.00031 28.3 0.0 21 57-77 129-149 (162)
51 COG5137 Histone chaperone invo 32.2 15 0.00033 29.7 0.0 18 12-29 33-50 (279)
52 PRK13183 psbN photosystem II r 31.2 70 0.0015 19.3 2.7 33 59-91 8-40 (46)
53 CHL00020 psbN photosystem II p 30.9 67 0.0014 19.1 2.6 33 59-91 5-37 (43)
54 PF15330 SIT: SHP2-interacting 30.9 20 0.00044 25.6 0.5 29 59-87 2-30 (107)
55 PF04712 Radial_spoke: Radial 30.7 16 0.00036 33.0 -0.0 11 127-137 344-354 (491)
56 PF02724 CDC45: CDC45-like pro 30.0 3.8 8.2E-05 38.2 -4.3 37 93-129 54-91 (622)
57 PF02724 CDC45: CDC45-like pro 28.9 3.8 8.3E-05 38.1 -4.5 10 77-86 76-85 (622)
58 COG5137 Histone chaperone invo 28.1 24 0.00052 28.6 0.5 10 77-86 93-102 (279)
59 PHA03055 Hypothetical protein; 27.9 1.2E+02 0.0025 20.5 3.7 51 66-116 16-66 (79)
60 CHL00204 ycf1 Ycf1; Provisiona 27.0 67 0.0015 33.7 3.4 12 78-89 184-195 (1832)
61 PF14654 Epiglycanin_C: Mucin, 26.7 90 0.002 22.0 3.1 22 56-77 18-39 (106)
62 PF01299 Lamp: Lysosome-associ 25.8 61 0.0013 27.2 2.6 12 77-88 289-300 (306)
63 KOG4564|consensus 25.7 36 0.00078 30.7 1.2 50 62-111 149-198 (473)
64 PF02468 PsbN: Photosystem II 25.1 76 0.0016 18.9 2.1 16 60-75 6-21 (43)
65 PF06679 DUF1180: Protein of u 24.7 1.5E+02 0.0033 22.8 4.3 8 62-69 97-104 (163)
66 PHA03099 epidermal growth fact 23.3 74 0.0016 23.6 2.3 28 60-87 103-130 (139)
67 PRK14762 membrane protein; Pro 22.9 94 0.002 16.2 1.9 7 65-71 9-15 (27)
68 KOG1172|consensus 22.7 1.3E+02 0.0029 29.2 4.4 43 74-116 648-692 (876)
69 PF08229 SHR3_chaperone: ER me 21.8 71 0.0015 25.4 2.1 12 64-75 65-76 (196)
70 PF10329 DUF2417: Region of un 21.7 46 0.00099 27.2 1.0 37 73-109 85-121 (232)
71 PF10316 7TM_GPCR_Srbc: Serpen 20.9 50 0.0011 27.5 1.1 53 60-112 8-60 (273)
72 PHA03062 putative IMV membrane 20.8 1.6E+02 0.0035 19.8 3.3 51 66-116 17-67 (78)
73 PF04713 Pox_I5: Poxvirus prot 20.7 1.8E+02 0.0039 19.5 3.5 49 68-116 18-66 (77)
74 PHA03283 envelope glycoprotein 20.3 1.1E+02 0.0023 28.1 3.1 36 57-92 397-432 (542)
75 PF07423 DUF1510: Protein of u 20.2 13 0.00029 30.0 -2.4 15 86-100 10-24 (217)
76 PF15050 SCIMP: SCIMP protein 20.2 21 0.00047 26.1 -1.0 8 77-84 25-32 (133)
77 PF05961 Chordopox_A13L: Chord 20.1 46 0.00099 21.8 0.6 18 68-87 10-27 (68)
No 1
>KOG4219|consensus
Probab=99.48 E-value=2.1e-14 Score=122.64 Aligned_cols=86 Identities=26% Similarity=0.402 Sum_probs=77.1
Q ss_pred CCCCCccchhhHHHHHHHHHHHHhhcccceeeEeeecCcCCCcchhHhHHhHHHHHhhhhhccceeeEEeccCCCccccc
Q psy17334 49 GPQFPAYIRTPSMVLCSIILCIGVLGNIMVPCVILKSKDMRNSTNIFLMNLSIADLMVLLVCTPTVLVEVNSKPETWQMG 128 (188)
Q Consensus 49 ~~~~~~~~~~~~~~~~~~i~l~gl~GN~lvi~~i~~~~~l~~~~~~fi~nLa~aDLl~~l~~~P~~i~~~~~~~~~W~fg 128 (188)
....|.+.+.++.++|+++++++++||++|||++..+|++|+.+|+|++|||+||+++++++.++...+. ....|.||
T Consensus 27 ~f~lp~~~~~~wai~yg~l~~vAv~GN~iVlwIil~hrrMRtvtnyfL~NLAfADl~~s~Fn~~f~f~ya--l~~~W~~G 104 (423)
T KOG4219|consen 27 LFVLPAWQQALWAIAYGLLVFVAVVGNLIVLWIILAHRRMRTVTNYFLVNLAFADLSMSIFNTVFNFQYA--LHQEWYFG 104 (423)
T ss_pred cccCCHHHHHHHHHHHHHHHHHHHhcCceEEEEEeehhehhhhHHHHHHHHHHHHHHHHHHhhHHHHHHH--HHhccccc
Confidence 3456677888899999999999999999999999999999999999999999999999999999877644 44889999
Q ss_pred cccccccc
Q psy17334 129 EHISNTQE 136 (188)
Q Consensus 129 ~~~Cki~~ 136 (188)
.++|++..
T Consensus 105 ~f~C~f~n 112 (423)
T KOG4219|consen 105 SFYCRFVN 112 (423)
T ss_pred cceeeecc
Confidence 99999864
No 2
>KOG4220|consensus
Probab=99.27 E-value=1e-12 Score=112.59 Aligned_cols=75 Identities=19% Similarity=0.306 Sum_probs=68.0
Q ss_pred hhHHHHHHHHHHHHhhcccceeeEeeecCcCCCcchhHhHHhHHHHHhhhhhccceeeEEeccCCCccccccccccc
Q psy17334 58 TPSMVLCSIILCIGVLGNIMVPCVILKSKDMRNSTNIFLMNLSIADLMVLLVCTPTVLVEVNSKPETWQMGEHISNT 134 (188)
Q Consensus 58 ~~~~~~~~~i~l~gl~GN~lvi~~i~~~~~l~~~~~~fi~nLa~aDLl~~l~~~P~~i~~~~~~~~~W~fg~~~Cki 134 (188)
++++++..++.++.++||++|++.+...|+|++..|||+++||+|||+++.+.+|++.++.+. ++|++|..+|.+
T Consensus 31 v~i~~v~~~lsLVTv~GNlLVmiSfKvnrqLqTVnNYfLfSLAcADliIG~~SMnl~t~Y~lm--g~W~LG~~~Cdl 105 (503)
T KOG4220|consen 31 VFIVVVTGSLSLVTVVGNLLVMISFKVNRQLQTVNNYFLFSLACADLIIGAFSMNLYTTYTLM--GYWPLGPLVCDL 105 (503)
T ss_pred EeeehhhhHHHHHhhhccEEEEEEEEecceeeeecceeehHHHHhhhhhheeechHHHHHHHH--cccccchHHHHH
Confidence 346677788899999999999999999999999999999999999999999999999886544 899999999975
No 3
>PHA03234 DNA packaging protein UL33; Provisional
Probab=99.24 E-value=1.2e-12 Score=111.65 Aligned_cols=77 Identities=14% Similarity=0.261 Sum_probs=62.9
Q ss_pred chhhHHHHHHHHHHHHhhcccceeeEe--eecCcCCCcchhHhHHhHHHHHhhhhhccceeeEEeccCCCcccccccccc
Q psy17334 56 IRTPSMVLCSIILCIGVLGNIMVPCVI--LKSKDMRNSTNIFLMNLSIADLMVLLVCTPTVLVEVNSKPETWQMGEHISN 133 (188)
Q Consensus 56 ~~~~~~~~~~~i~l~gl~GN~lvi~~i--~~~~~l~~~~~~fi~nLa~aDLl~~l~~~P~~i~~~~~~~~~W~fg~~~Ck 133 (188)
.+.++.++|.+++++|++||++|++++ .+++++|+++|+|++|||++|++++++ +|+.++. ....|+||..+||
T Consensus 31 ~~~~~~~~y~~vf~~gl~gN~lvl~v~~~~~~~~~rt~tn~fi~NLAvaDLL~~l~-lp~~~~~---~~~~w~fG~~lCk 106 (338)
T PHA03234 31 AQILESAINGIMLTLIIPMIIIVICTLIIYHKVAKHNATSFYLITLFASDFLHMLC-VFFLTLN---REALFNFNQAFCQ 106 (338)
T ss_pred HHHHhhHHHHHHHHHHhhhHHHHHHHHHHHhccccccHHHHHHHHHHHHHHHHHHH-HHHHHHH---HhCCccCchhHHH
Confidence 567788999999999999999999854 455677899999999999999999875 5554432 2357999999999
Q ss_pred ccc
Q psy17334 134 TQE 136 (188)
Q Consensus 134 i~~ 136 (188)
+..
T Consensus 107 ~~~ 109 (338)
T PHA03234 107 CVL 109 (338)
T ss_pred HHH
Confidence 853
No 4
>PHA02834 chemokine receptor-like protein; Provisional
Probab=98.94 E-value=2.5e-10 Score=96.66 Aligned_cols=76 Identities=21% Similarity=0.500 Sum_probs=61.3
Q ss_pred chhhHHHHHHHHHHHHhhcccceeeEeeecCcCCCcchhHhHHhHHHHHhhhhhccceeeEEeccCCCcccccccccccc
Q psy17334 56 IRTPSMVLCSIILCIGVLGNIMVPCVILKSKDMRNSTNIFLMNLSIADLMVLLVCTPTVLVEVNSKPETWQMGEHISNTQ 135 (188)
Q Consensus 56 ~~~~~~~~~~~i~l~gl~GN~lvi~~i~~~~~l~~~~~~fi~nLa~aDLl~~l~~~P~~i~~~~~~~~~W~fg~~~Cki~ 135 (188)
.+.++.+++.+++++|++||+++++++.+++++ +++|+|++|||++|++. ++++|+.+... ...|.+|...|++.
T Consensus 27 ~~~~~~~~~~li~v~~~~gN~lVi~vi~~~~~~-~~~n~~i~nLAiaDll~-~~~lP~~i~~~---~~~w~~g~~~C~~~ 101 (323)
T PHA02834 27 VNYFVIVFYILLFIFGLIGNVLVIAVLIVKRFM-FVVDVYLFNIAMSDLML-VFSFPFIIHND---LNEWIFGEFMCKLV 101 (323)
T ss_pred hhhhHHHHHHHHHHHHHhhHHHHHHHHHhcccc-chhhhhhHHHHHHHHHH-HHHHHHHHHHH---cCCcCCcchHHHhH
Confidence 455788999999999999999999888776654 56899999999999986 55688765422 24699999999975
Q ss_pred c
Q psy17334 136 E 136 (188)
Q Consensus 136 ~ 136 (188)
.
T Consensus 102 ~ 102 (323)
T PHA02834 102 L 102 (323)
T ss_pred H
Confidence 3
No 5
>PHA02638 CC chemokine receptor-like protein; Provisional
Probab=98.86 E-value=5.1e-10 Score=98.17 Aligned_cols=76 Identities=22% Similarity=0.536 Sum_probs=62.3
Q ss_pred chhhHHHHHHHHHHHHhhcccceeeEeeecCcCCCcchhHhHHhHHHHHhhhhhccceeeEEeccCCCcccccccccccc
Q psy17334 56 IRTPSMVLCSIILCIGVLGNIMVPCVILKSKDMRNSTNIFLMNLSIADLMVLLVCTPTVLVEVNSKPETWQMGEHISNTQ 135 (188)
Q Consensus 56 ~~~~~~~~~~~i~l~gl~GN~lvi~~i~~~~~l~~~~~~fi~nLa~aDLl~~l~~~P~~i~~~~~~~~~W~fg~~~Cki~ 135 (188)
...++.+++.+++++|++||+++++++. +|++|++++++++|||++|++.++ .+|+.+... .+.|.||...|++.
T Consensus 97 ~~~~l~~~y~lvfvlgliGN~LVl~il~-~k~lrt~t~i~llnLAisDLl~~l-~lPf~i~~~---~~~W~fg~~~Ck~~ 171 (417)
T PHA02638 97 ISEYIKIFYIIIFILGLFGNAAIIMILF-CKKIKTITDIYIFNLAISDLIFVI-DFPFIIYNE---FDQWIFGDFMCKVI 171 (417)
T ss_pred hhhHHHHHHHHHHHHHHHHHHHHHHHHH-hccCCCHhHHHHHHHHHHHHHHHH-HHHHHHHHH---hccccccccchhhH
Confidence 4456778899999999999999997665 488899999999999999998754 588776422 36799999999985
Q ss_pred c
Q psy17334 136 E 136 (188)
Q Consensus 136 ~ 136 (188)
.
T Consensus 172 ~ 172 (417)
T PHA02638 172 S 172 (417)
T ss_pred H
Confidence 3
No 6
>PHA03235 DNA packaging protein UL33; Provisional
Probab=98.81 E-value=1.8e-09 Score=94.59 Aligned_cols=77 Identities=17% Similarity=0.236 Sum_probs=56.9
Q ss_pred chhhHHHHHHHHHHHHhhcccceeeEeee-cCc-CCCcchhHhHHhHHHHHhhhhhccceeeEEeccCCCcccccccccc
Q psy17334 56 IRTPSMVLCSIILCIGVLGNIMVPCVILK-SKD-MRNSTNIFLMNLSIADLMVLLVCTPTVLVEVNSKPETWQMGEHISN 133 (188)
Q Consensus 56 ~~~~~~~~~~~i~l~gl~GN~lvi~~i~~-~~~-l~~~~~~fi~nLa~aDLl~~l~~~P~~i~~~~~~~~~W~fg~~~Ck 133 (188)
.+.+..+++.+++++|++||++|++++.+ +|+ .++..++|++|||++|++. ++++|+.+. .....|.+|...|+
T Consensus 31 ~~~~~~~~~~li~vvGiigN~lVL~~~~~~~r~~~~~~~~~~I~NLAvsDLl~-l~~lP~~i~---~~~~~~~~g~~~Ck 106 (409)
T PHA03235 31 ARTTETFINLLIISVGGPLNLIVLVTQLLANRVHGFSTPTLYMTNLYLANLLT-VFVLPFIML---SNQGLLSGSVAGCK 106 (409)
T ss_pred hHhHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccCCccHHHHHHHHHHHHHH-HHHHHHHHH---hcCccccCCCCeeh
Confidence 56688899999999999999999987543 332 2356689999999999986 456887653 12233445678999
Q ss_pred ccc
Q psy17334 134 TQE 136 (188)
Q Consensus 134 i~~ 136 (188)
+..
T Consensus 107 ~~~ 109 (409)
T PHA03235 107 FAS 109 (409)
T ss_pred hHH
Confidence 853
No 7
>PHA03087 G protein-coupled chemokine receptor-like protein; Provisional
Probab=98.78 E-value=7.1e-09 Score=87.66 Aligned_cols=76 Identities=28% Similarity=0.514 Sum_probs=63.1
Q ss_pred chhhHHHHHHHHHHHHhhcccceeeEeeecCcCCCcchhHhHHhHHHHHhhhhhccceeeEEeccCCCcccccccccccc
Q psy17334 56 IRTPSMVLCSIILCIGVLGNIMVPCVILKSKDMRNSTNIFLMNLSIADLMVLLVCTPTVLVEVNSKPETWQMGEHISNTQ 135 (188)
Q Consensus 56 ~~~~~~~~~~~i~l~gl~GN~lvi~~i~~~~~l~~~~~~fi~nLa~aDLl~~l~~~P~~i~~~~~~~~~W~fg~~~Cki~ 135 (188)
.+.+..+++.+++++|++||+++++++.++ ++|++.++|+.|||++|++.++. .|..+. ......|.+|...|++.
T Consensus 39 ~~~~~~~~~~~i~~~gl~gN~lvl~~~~~~-~~~~~~~~ll~~laisDll~~~~-~~~~~~--~~~~~~~~~~~~~C~~~ 114 (335)
T PHA03087 39 NSTILIVVYSTIFFFGLVGNIIVIYVLTKT-KIKTPMDIYLLNLAVSDLLFVMT-LPFQIY--YYILFQWSFGEFACKIV 114 (335)
T ss_pred hhhHHHHHHHHHHHHHHHhhHhEEeeehhc-cccCchHHHHHHHHHHHHHHHHh-HHHHHH--HHhCCCCCCCcHHHHHH
Confidence 455788888999999999999999998887 88999999999999999987764 665542 33447899999999864
No 8
>PF00001 7tm_1: 7 transmembrane receptor (rhodopsin family) Rhodopsin-like GPCR superfamily signature 5-hydroxytryptamine 7 receptor signature bradykinin receptor signature gastrin receptor signature melatonin receptor signature olfactory receptor signature; InterPro: IPR000276 G-protein-coupled receptors, GPCRs, constitute a vast protein family that encompasses a wide range of functions (including various autocrine, paracrine and endocrine processes). They show considerable diversity at the sequence level, on the basis of which they can be separated into distinct groups. We use the term clan to describe the GPCRs, as they embrace a group of families for which there are indications of evolutionary relationship, but between which there is no statistically significant similarity in sequence []. The currently known clan members include the rhodopsin-like GPCRs, the secretin-like GPCRs, the cAMP receptors, the fungal mating pheromone receptors, and the metabotropic glutamate receptor family. There is a specialised database for GPCRs (http://www.gpcr.org/7tm/). The rhodopsin-like GPCRs themselves represent a widespread protein family that includes hormone, neurotransmitter and light receptors, all of which transduce extracellular signals through interaction with guanine nucleotide-binding (G) proteins. Although their activating ligands vary widely in structure and character, the amino acid sequences of the receptors are very similar and are believed to adopt a common structural framework comprising 7 transmembrane (TM) helices [, , ].; GO: 0007186 G-protein coupled receptor protein signaling pathway, 0016021 integral to membrane; PDB: 2KI9_A 3QAK_A 2YDV_A 3VGA_A 3PWH_A 3RFM_A 3EML_A 3VG9_A 3REY_A 3UZA_A ....
Probab=98.17 E-value=5.5e-08 Score=76.86 Aligned_cols=61 Identities=38% Similarity=0.659 Sum_probs=51.9
Q ss_pred cccceeeEeeecCcCCCcchhHhHHhHHHHHhhhhhccceeeEEeccCCCccccccccccccc
Q psy17334 74 GNIMVPCVILKSKDMRNSTNIFLMNLSIADLMVLLVCTPTVLVEVNSKPETWQMGEHISNTQE 136 (188)
Q Consensus 74 GN~lvi~~i~~~~~l~~~~~~fi~nLa~aDLl~~l~~~P~~i~~~~~~~~~W~fg~~~Cki~~ 136 (188)
||+++++++.++|++|++.++|++|||++|++.++...|..+..... ..|.++...|++..
T Consensus 1 GN~lvi~~~~~~~~~~~~~~~~l~~Lav~Dll~~~~~~~~~~~~~~~--~~~~~~~~~C~~~~ 61 (257)
T PF00001_consen 1 GNILVILVILRSKRLRTPSNILLLNLAVADLLVGLFCIPFYIYSLLF--DDWIFSSFLCRIFG 61 (257)
T ss_dssp HHHHHHHHHHHSGGG-SHHHHHHHHHHHHHHHHHHTHHHHHHHHHHH--SSCTSHHHHHHHHH
T ss_pred CchhehhhhhhhccCCChhHHHHHHHHHHHHhhcccccccccccccc--cccccccccccccc
Confidence 89999999999999999999999999999999999888876654332 68999999998753
No 9
>PF10320 7TM_GPCR_Srsx: Serpentine type 7TM GPCR chemoreceptor Srsx; InterPro: IPR019424 G-protein-coupled receptors, GPCRs, constitute a vast protein family that encompasses a wide range of functions (including various autocrine, paracrine and endocrine processes). They show considerable diversity at the sequence level, on the basis of which they can be separated into distinct groups. We use the term clan to describe the GPCRs, as they embrace a group of families for which there are indications of evolutionary relationship, but between which there is no statistically significant similarity in sequence []. The currently known clan members include the rhodopsin-like GPCRs, the secretin-like GPCRs, the cAMP receptors, the fungal mating pheromone receptors, and the metabotropic glutamate receptor family. There is a specialised database for GPCRs (http://www.gpcr.org/7tm/). The nematode Caenorhabditis elegans has only 14 types of chemosensory neuron, yet is able to sense and respond to several hundred different chemicals because each neuron detects several stimuli []. Chemoperception is one of the central senses of soil nematodes like C. elegans which are otherwise 'blind' and 'deaf' []. Chemoreception in C. elegans is mediated by members of the seven-transmembrane G-protein-coupled receptor class (7TM GPCRs). More than 1300 potential chemoreceptor genes have been identified in C. elegans, which are generally prefixed sr for serpentine receptor. The receptor superfamilies include Sra (Sra, Srb, Srab, Sre), Str (Srh, Str, Sri, Srd, Srj, Srm, Srn) and Srg (Srx, Srt, Srg, Sru, Srv, Srxa), as well as the families Srw, Srz, Srbc, Srsx and Srr [, , ]. Many of these proteins have homologues in Caenorhabditis briggsae. This entry represents serpentine receptor class sx (Srsx), which is a solo family amongst the superfamilies of chemoreceptors. Chemoperception is one of the central senses of soil nematodes like C. elegans which are otherwise 'blind' and 'deaf' [].
Probab=97.53 E-value=9.3e-06 Score=66.80 Aligned_cols=47 Identities=34% Similarity=0.618 Sum_probs=43.2
Q ss_pred HHHhhcccceeeEeeecCcCCCcchhHhHHhHHHHHhhhhhccceee
Q psy17334 69 CIGVLGNIMVPCVILKSKDMRNSTNIFLMNLSIADLMVLLVCTPTVL 115 (188)
Q Consensus 69 l~gl~GN~lvi~~i~~~~~l~~~~~~fi~nLa~aDLl~~l~~~P~~i 115 (188)
++|++||+++++++.++|++|+|+++++..+|++|++.++..+|...
T Consensus 2 ~ig~~gN~~~i~~~~~~~~Lrs~~~~li~~~~~~d~~~~~~~~~~~~ 48 (257)
T PF10320_consen 2 IIGLFGNLLLIILIFRNKSLRSPCYILICILCFADLICLLGTLPFML 48 (257)
T ss_pred EEEEEccHHHHHHHHhccccccchHHHHHHHHHHHHHHHhhHHHHHH
Confidence 57999999999999999999999999999999999999887777665
No 10
>KOG2087|consensus
Probab=96.44 E-value=0.0022 Score=54.94 Aligned_cols=59 Identities=22% Similarity=0.280 Sum_probs=47.2
Q ss_pred hHHHHHHHHHHHHhhcccceeeEeeecCcCCCcchhHhHHhHHHHHhhhhhccceeeEE
Q psy17334 59 PSMVLCSIILCIGVLGNIMVPCVILKSKDMRNSTNIFLMNLSIADLMVLLVCTPTVLVE 117 (188)
Q Consensus 59 ~~~~~~~~i~l~gl~GN~lvi~~i~~~~~l~~~~~~fi~nLa~aDLl~~l~~~P~~i~~ 117 (188)
++-+..-++.+++++||++|++.+...+...+...+++.|||++|+++++....+..++
T Consensus 26 ~lRi~vW~i~~lAi~gN~~Vl~~~~~~~~~~~~~~~li~~la~ad~~mGiYl~~ia~vD 84 (363)
T KOG2087|consen 26 ILRISVWVIALLAIVGNLLVLLTRFTSRYELNSHRFLICNLAFADLLMGIYLGLIASVD 84 (363)
T ss_pred eeeehhhhhhhHHhccCeeeeeeeeehhhhccchHHHHHHHHHHHHHcchHHHHHHHhh
Confidence 34455566778899999999999888888788899999999999999988755554443
No 11
>PF10324 7TM_GPCR_Srw: Serpentine type 7TM GPCR chemoreceptor Srw; InterPro: IPR019427 G-protein-coupled receptors, GPCRs, constitute a vast protein family that encompasses a wide range of functions (including various autocrine, paracrine and endocrine processes). They show considerable diversity at the sequence level, on the basis of which they can be separated into distinct groups. We use the term clan to describe the GPCRs, as they embrace a group of families for which there are indications of evolutionary relationship, but between which there is no statistically significant similarity in sequence []. The currently known clan members include the rhodopsin-like GPCRs, the secretin-like GPCRs, the cAMP receptors, the fungal mating pheromone receptors, and the metabotropic glutamate receptor family. There is a specialised database for GPCRs (http://www.gpcr.org/7tm/). The nematode Caenorhabditis elegans has only 14 types of chemosensory neuron, yet is able to sense and respond to several hundred different chemicals because each neuron detects several stimuli []. Chemoperception is one of the central senses of soil nematodes like C. elegans which are otherwise 'blind' and 'deaf' []. Chemoreception in C. elegans is mediated by members of the seven-transmembrane G-protein-coupled receptor class (7TM GPCRs). More than 1300 potential chemoreceptor genes have been identified in C. elegans, which are generally prefixed sr for serpentine receptor. The receptor superfamilies include Sra (Sra, Srb, Srab, Sre), Str (Srh, Str, Sri, Srd, Srj, Srm, Srn) and Srg (Srx, Srt, Srg, Sru, Srv, Srxa), as well as the families Srw, Srz, Srbc, Srsx and Srr [, , ]. Many of these proteins have homologues in Caenorhabditis briggsae. This entry represents serpentine receptor class w (Srw), which is a solo family amongst the superfamilies of chemoreceptors. The genes encoding Srw do not appear to be under as strong an adaptive evolutionary pressure as those of Srz [].
Probab=93.82 E-value=0.015 Score=48.79 Aligned_cols=48 Identities=33% Similarity=0.475 Sum_probs=38.9
Q ss_pred HHHHHhhcccceeeEeeecCcCCC-cchhHhHHhHHHHHhhhhhccceee
Q psy17334 67 ILCIGVLGNIMVPCVILKSKDMRN-STNIFLMNLSIADLMVLLVCTPTVL 115 (188)
Q Consensus 67 i~l~gl~GN~lvi~~i~~~~~l~~-~~~~fi~nLa~aDLl~~l~~~P~~i 115 (188)
+.++|+++|++.++++.| |.+|+ ++|.+++.+|++|++..+..++..+
T Consensus 6 ~~~~g~~~N~~h~~VLtr-k~mR~~~in~~l~~Iai~Dl~~~~~~~~~~~ 54 (318)
T PF10324_consen 6 LSIFGLFINIFHLIVLTR-KSMRSSSINILLIGIAICDLLYMLSILIWEL 54 (318)
T ss_pred EeHHHHHHHHHHhhhcCC-hhhhcCCHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 457899999888887765 55564 8999999999999999887766554
No 12
>PF11710 Git3: G protein-coupled glucose receptor regulating Gpa2; InterPro: IPR023041 This entry contains a functionally uncharacterised region belonging to the Git3 G-protein coupled receptor. Git3 is one of six proteins required for glucose-triggered adenylate cyclase activation, and is a G protein-coupled receptor responsible for the activation of adenylate cyclase through Gpa2 - heterotrimeric G protein alpha subunit, part of the glucose-detection pathway. Git3 contains seven predicted transmembrane domains, a third cytoplasmic loop and a cytoplasmic tail []. This is the conserved N-terminal domain of the member proteins.
Probab=92.97 E-value=0.015 Score=46.33 Aligned_cols=49 Identities=14% Similarity=0.100 Sum_probs=32.7
Q ss_pred CcCCCcchhHhHHhHHHHHhhhhhccceeeEEeccCCCccccccccccccc
Q psy17334 86 KDMRNSTNIFLMNLSIADLMVLLVCTPTVLVEVNSKPETWQMGEHISNTQE 136 (188)
Q Consensus 86 ~~l~~~~~~fi~nLa~aDLl~~l~~~P~~i~~~~~~~~~W~fg~~~Cki~~ 136 (188)
++++...+.+++||.++|++.++..+...+..+ ..+.-.-+...|.+++
T Consensus 30 ~r~~~fR~~LIl~L~~aD~~qal~~~i~~~~~l--~~~~i~~~s~~C~aqG 78 (201)
T PF11710_consen 30 YRRRSFRHQLILNLLLADFIQALAFLISPIRWL--ARGGIIAPSPFCQAQG 78 (201)
T ss_pred hhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHH--hcCCeeCCCCchhhhH
Confidence 455667788999999999999987554222211 2233445667888865
No 13
>PF05462 Dicty_CAR: Slime mold cyclic AMP receptor
Probab=92.57 E-value=0.04 Score=46.60 Aligned_cols=52 Identities=19% Similarity=0.296 Sum_probs=43.6
Q ss_pred hHHHHHHHHHHHHhhcccceeeEeeecCcCCCcchhHhHHhHHHHHhhhhhc
Q psy17334 59 PSMVLCSIILCIGVLGNIMVPCVILKSKDMRNSTNIFLMNLSIADLMVLLVC 110 (188)
Q Consensus 59 ~~~~~~~~i~l~gl~GN~lvi~~i~~~~~l~~~~~~fi~nLa~aDLl~~l~~ 110 (188)
.+.++..+..+++++|-++++..+++.|++|++.+-++.-++++|++..+..
T Consensus 8 ~~~~i~~~~s~lSllGclfiI~tf~~~k~~r~~~~rli~yl~~~~ll~~v~~ 59 (303)
T PF05462_consen 8 TLYAIELVASVLSLLGCLFIIITFCLFKRLRKPINRLIFYLSIANLLTNVAS 59 (303)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCccHHHHHHHHHHHHHHHHHHH
Confidence 4556666778888999999999999999999999999999999999866543
No 14
>PF05296 TAS2R: Mammalian taste receptor protein (TAS2R); InterPro: IPR007960 This family consists of several forms of mammalian taste receptor proteins (TAS2Rs). TAS2Rs are G protein-coupled receptors expressed in subsets of taste receptor cells of the tongue and palate epithelia and are organised in the genome in clusters. The proteins are genetically linked to loci that influence bitter perception in mice and humans [].; GO: 0004930 G-protein coupled receptor activity, 0007186 G-protein coupled receptor protein signaling pathway, 0050909 sensory perception of taste, 0016021 integral to membrane
Probab=92.06 E-value=0.11 Score=43.76 Aligned_cols=55 Identities=18% Similarity=0.299 Sum_probs=41.6
Q ss_pred chhhHHHHHHHHHHHHhhcccceeeEee---ecCcCCCcchhHhHHhHHHHHhhhhhc
Q psy17334 56 IRTPSMVLCSIILCIGVLGNIMVPCVIL---KSKDMRNSTNIFLMNLSIADLMVLLVC 110 (188)
Q Consensus 56 ~~~~~~~~~~~i~l~gl~GN~lvi~~i~---~~~~l~~~~~~fi~nLa~aDLl~~l~~ 110 (188)
...+..++.++.+++|++||+.++.+.. ++++.-.|.+..+.+||++.++.....
T Consensus 5 ~~~i~~~i~~~~~~~Gi~~N~FI~~vn~~~w~k~~~l~~~d~IL~~La~sr~~l~~~~ 62 (303)
T PF05296_consen 5 LEIIFLIILVVEFIIGILGNGFIVLVNCSDWVKSRKLSPSDQILTSLAISRILLQWVI 62 (303)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHhHHHHHcCCCCChHHHHHHHHHHHHHHHHHHH
Confidence 4556778888889999999999876654 223334588999999999988766543
No 15
>PF10328 7TM_GPCR_Srx: Serpentine type 7TM GPCR chemoreceptor Srx; InterPro: IPR019430 G-protein-coupled receptors, GPCRs, constitute a vast protein family that encompasses a wide range of functions (including various autocrine, paracrine and endocrine processes). They show considerable diversity at the sequence level, on the basis of which they can be separated into distinct groups. We use the term clan to describe the GPCRs, as they embrace a group of families for which there are indications of evolutionary relationship, but between which there is no statistically significant similarity in sequence []. The currently known clan members include the rhodopsin-like GPCRs, the secretin-like GPCRs, the cAMP receptors, the fungal mating pheromone receptors, and the metabotropic glutamate receptor family. There is a specialised database for GPCRs (http://www.gpcr.org/7tm/). The nematode Caenorhabditis elegans has only 14 types of chemosensory neuron, yet is able to sense and respond to several hundred different chemicals because each neuron detects several stimuli []. Chemoperception is one of the central senses of soil nematodes like C. elegans which are otherwise 'blind' and 'deaf' []. Chemoreception in C. elegans is mediated by members of the seven-transmembrane G-protein-coupled receptor class (7TM GPCRs). More than 1300 potential chemoreceptor genes have been identified in C. elegans, which are generally prefixed sr for serpentine receptor. The receptor superfamilies include Sra (Sra, Srb, Srab, Sre), Str (Srh, Str, Sri, Srd, Srj, Srm, Srn) and Srg (Srx, Srt, Srg, Sru, Srv, Srxa), as well as the families Srw, Srz, Srbc, Srsx and Srr [, , ]. Many of these proteins have homologues in Caenorhabditis briggsae. This entry represents serpentine receptor class x (Srx) from the Srg superfamily [, ]. Srg receptors contain seven hydrophobic, putative transmembrane, regions and can be distinguished from other 7TM GPCR receptors by their own characteristic TM signatures.
Probab=91.69 E-value=0.05 Score=44.83 Aligned_cols=44 Identities=34% Similarity=0.514 Sum_probs=39.5
Q ss_pred HHHHHhhcccceeeEeeecCcCCCcchhHhHHhHHHHHhhhhhc
Q psy17334 67 ILCIGVLGNIMVPCVILKSKDMRNSTNIFLMNLSIADLMVLLVC 110 (188)
Q Consensus 67 i~l~gl~GN~lvi~~i~~~~~l~~~~~~fi~nLa~aDLl~~l~~ 110 (188)
+.++|++.|.++++.+.+.+++|++.+++-.+.|++|.+.+++.
T Consensus 3 ~s~~G~~~N~~v~~~~~~~~~~~~sF~~l~~~~a~~n~i~~~~~ 46 (274)
T PF10328_consen 3 ISIIGIILNWLVFIIIFKLKSLRNSFGILCASQAIANIIICLIF 46 (274)
T ss_pred eeHHHHHHHHHHHHHHHhcccccCCHHHHHHHHHHHHHHHHHHH
Confidence 45789999999999999999999999999999999999988753
No 16
>PF10317 7TM_GPCR_Srd: Serpentine type 7TM GPCR chemoreceptor Srd; InterPro: IPR019421 G-protein-coupled receptors, GPCRs, constitute a vast protein family that encompasses a wide range of functions (including various autocrine, paracrine and endocrine processes). They show considerable diversity at the sequence level, on the basis of which they can be separated into distinct groups. We use the term clan to describe the GPCRs, as they embrace a group of families for which there are indications of evolutionary relationship, but between which there is no statistically significant similarity in sequence []. The currently known clan members include the rhodopsin-like GPCRs, the secretin-like GPCRs, the cAMP receptors, the fungal mating pheromone receptors, and the metabotropic glutamate receptor family. There is a specialised database for GPCRs (http://www.gpcr.org/7tm/). The nematode Caenorhabditis elegans has only 14 types of chemosensory neuron, yet is able to sense and respond to several hundred different chemicals because each neuron detects several stimuli []. Chemoperception is one of the central senses of soil nematodes like C. elegans which are otherwise 'blind' and 'deaf' []. Chemoreception in C. elegans is mediated by members of the seven-transmembrane G-protein-coupled receptor class (7TM GPCRs). More than 1300 potential chemoreceptor genes have been identified in C. elegans, which are generally prefixed sr for serpentine receptor. The receptor superfamilies include Sra (Sra, Srb, Srab, Sre), Str (Srh, Str, Sri, Srd, Srj, Srm, Srn) and Srg (Srx, Srt, Srg, Sru, Srv, Srxa), as well as the families Srw, Srz, Srbc, Srsx and Srr [, , ]. Many of these proteins have homologues in Caenorhabditis briggsae. This entry represents the chemoreceptor Srd [].
Probab=87.84 E-value=0.094 Score=43.75 Aligned_cols=49 Identities=22% Similarity=0.404 Sum_probs=37.6
Q ss_pred HHHHHHHHHhhcccceeeEeeec-CcCCCcchhHhHHhHHHHHhhhhhcc
Q psy17334 63 LCSIILCIGVLGNIMVPCVILKS-KDMRNSTNIFLMNLSIADLMVLLVCT 111 (188)
Q Consensus 63 ~~~~i~l~gl~GN~lvi~~i~~~-~~l~~~~~~fi~nLa~aDLl~~l~~~ 111 (188)
++.+++.+|++.|+++++++.++ ++.-+...+++.|-|+.|++.++...
T Consensus 4 ~~~~~~~~~~~~n~~Ll~~i~~~tp~~l~~~~~~l~~~~~~~~~~~~~~~ 53 (292)
T PF10317_consen 4 YHPIFFILGIILNILLLYLIIFKTPKSLRTYSILLLNTAIFDLISIISAF 53 (292)
T ss_pred eHHHHHHHHHHHHHHHHHHHHHhChHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 45677889999999999887655 33334568999999999999876543
No 17
>KOG2927|consensus
Probab=84.96 E-value=0.25 Score=42.39 Aligned_cols=18 Identities=6% Similarity=0.021 Sum_probs=10.3
Q ss_pred eeeEEeccC--CCccccccc
Q psy17334 113 TVLVEVNSK--PETWQMGEH 130 (188)
Q Consensus 113 ~~i~~~~~~--~~~W~fg~~ 130 (188)
+.++++++. .+-|.|.+.
T Consensus 243 F~I~~il~~g~~g~W~FPNL 262 (372)
T KOG2927|consen 243 FGITWILTGGKHGFWLFPNL 262 (372)
T ss_pred HHHHHHHhCCCCceEeccch
Confidence 333434445 578887765
No 18
>PF03344 Daxx: Daxx Family; InterPro: IPR005012 Daxx is a ubiquitously expressed protein that functions, in part, as a transcriptional co-repressor through its interaction with a growing number of nuclear, DNA-associated proteins. Human Daxx contains four structural domains commonly found in transcriptional regulatory proteins: two predicted paired amphipathic helices, an acid-rich domain and a Ser/Pro/Thr (SPT)-rich domain. The post-translational modification status of the SPT-domain of hDaxx regulates its association with transcription factors such as Pax3 and ETS-1, effectively bringing hDaxx to sites of active transcription. Through its presence at the site of active transcription, hDaxx could then be able to associate with acetylated histones present in the nucleosomes and Dek that is associated with chromatin. Through its association with the SPT-domain of hDaxx, histone deacetylases may also be brought to the site of active transcription. As a consequence, nucleosomes in the vicinity of the site of active transcription will have the histone tails deacetylated, allowing the deactylated tail to bind to DNA, thereby leading to an inactive chromatin structure and transcriptional repression []. The Daxx protein (also known as the Fas-binding protein) is thought to play a role in apoptosis as a component of nuclear promyelocytic leukemia protein (PML) oncogenic domains (PODS). Daxx associates with PODs through a direct interaction with PML, a critical component of PODs. The interaction is a dynamic, cell cycle regulated event and is dependent on the post-translational modification of PML by the small ubiquitin-related modifier SUMO-1. ; PDB: 2KZS_A 2KZU_A.
Probab=84.04 E-value=0.32 Score=45.77 Aligned_cols=10 Identities=20% Similarity=-0.044 Sum_probs=0.0
Q ss_pred cchhHhHHhH
Q psy17334 91 STNIFLMNLS 100 (188)
Q Consensus 91 ~~~~fi~nLa 100 (188)
..|-|.+||.
T Consensus 344 l~~~~~~~lt 353 (713)
T PF03344_consen 344 LWYNFGCHLT 353 (713)
T ss_dssp ----------
T ss_pred cccccccccc
Confidence 3466667665
No 19
>PF10321 7TM_GPCR_Srt: Serpentine type 7TM GPCR chemoreceptor Srt; InterPro: IPR019425 Chemoreception is mediated in Caenorhabditis elegans by members of the seven-transmembrane G-protein-coupled receptor class (7TM GPCRs) of proteins which are of the serpentine type []. Srt is a member of the Srg superfamily of chemoreceptors. Chemoperception is one of the central senses of soil nematodes like C. elegans which are otherwise 'blind' and 'deaf' [].
Probab=82.62 E-value=0.79 Score=38.97 Aligned_cols=54 Identities=22% Similarity=0.172 Sum_probs=43.7
Q ss_pred chhhHHHHHHHHHHHHhhcccceeeEeeecCcCCCcchhHhHHhHHHHHhhhhh
Q psy17334 56 IRTPSMVLCSIILCIGVLGNIMVPCVILKSKDMRNSTNIFLMNLSIADLMVLLV 109 (188)
Q Consensus 56 ~~~~~~~~~~~i~l~gl~GN~lvi~~i~~~~~l~~~~~~fi~nLa~aDLl~~l~ 109 (188)
.+.++++.+++..++..+-.+.+++++.+.+.+|.+.|-+..-|++.|++...+
T Consensus 31 ~~p~~G~~~~~~g~~~~~lY~p~~~~i~~~~~~k~~~ykiM~~L~i~Di~~l~~ 84 (313)
T PF10321_consen 31 KRPILGIYFLIFGIIIIILYIPCLIAIFKKKLFKMSCYKIMFFLAIFDIIQLFI 84 (313)
T ss_pred cccchhHHHHHHHHHHHHHHHHHHHHHHHhccccCcHHHHHHHHHHHHHHHHHh
Confidence 445677777777777777788888888887777889999999999999997754
No 20
>PF04931 DNA_pol_phi: DNA polymerase phi; InterPro: IPR007015 Proteins of this family are predominantly nucleolar. The majority are described as transcription factor transactivators. The family also includes the fifth essential DNA polymerase (Pol5p) of Schizosaccharomyces pombe (Fission yeast) and Saccharomyces cerevisiae (Baker's yeast) (2.7.7.7 from EC). Pol5p is localized exclusively to the nucleolus and binds near or at the enhancer region of rRNA-encoding DNA repeating units.; GO: 0003677 DNA binding, 0003887 DNA-directed DNA polymerase activity, 0006351 transcription, DNA-dependent
Probab=78.46 E-value=0.15 Score=48.57 Aligned_cols=12 Identities=17% Similarity=0.678 Sum_probs=5.8
Q ss_pred HHHHhhhhhccc
Q psy17334 101 IADLMVLLVCTP 112 (188)
Q Consensus 101 ~aDLl~~l~~~P 112 (188)
+.|++.+++.-|
T Consensus 590 lveiLLslls~~ 601 (784)
T PF04931_consen 590 LVEILLSLLSQP 601 (784)
T ss_pred HHHHHHHHHhCc
Confidence 345555554444
No 21
>KOG1832|consensus
Probab=73.11 E-value=0.62 Score=44.79 Aligned_cols=6 Identities=17% Similarity=0.052 Sum_probs=2.8
Q ss_pred cccccc
Q psy17334 131 ISNTQE 136 (188)
Q Consensus 131 ~Cki~~ 136 (188)
.||+..
T Consensus 1391 ~~r~yE 1396 (1516)
T KOG1832|consen 1391 SARMYE 1396 (1516)
T ss_pred hhhhhh
Confidence 455543
No 22
>PF00002 7tm_2: 7 transmembrane receptor (Secretin family); InterPro: IPR000832 G-protein-coupled receptors, GPCRs, constitute a vast protein family that encompasses a wide range of functions (including various autocrine, paracrine and endocrine processes). They show considerable diversity at the sequence level, on the basis of which they can be separated into distinct groups. We use the term clan to describe the GPCRs, as they embrace a group of families for which there are indications of evolutionary relationship, but between which there is no statistically significant similarity in sequence []. The currently known clan members include the rhodopsin-like GPCRs, the secretin-like GPCRs, the cAMP receptors, the fungal mating pheromone receptors, and the metabotropic glutamate receptor family. There is a specialised database for GPCRs (http://www.gpcr.org/7tm/). The secretin-like GPCRs include secretin [], calcitonin [], parathyroid hormone/parathyroid hormone-related peptides [] and vasoactive intestinal peptide [], all of which activate adenylyl cyclase and the phosphatidyl-inositol-calcium pathway. These receptors contain seven transmembrane regions, in a manner reminiscent of the rhodopsins and other receptors believed to interact with G-proteins (however there is no significant sequence identity between these families, the secretin-like receptors thus bear their own unique '7TM' signature). Their N terminus is probably located on the extracellular side of the membrane and potentially glycosylated. This N-terminal region contains a long conserved region which allow the binding of large peptidic ligand such as glucagon, secretin, VIP and PACAP; this region contains five conserved cysteines residues which could be involved in disulphide bond. The C-terminal region of these receptor is probably cytoplasmic. Every receptor gene in this family is encoded on multiple exons, and several of these genes are alternatively spliced to yield functionally distinct products. ; GO: 0004930 G-protein coupled receptor activity, 0007186 G-protein coupled receptor protein signaling pathway, 0016021 integral to membrane; PDB: 3L2J_A 1BL1_A.
Probab=71.77 E-value=2.6 Score=33.66 Aligned_cols=47 Identities=15% Similarity=0.211 Sum_probs=3.0
Q ss_pred HHHHHHHHHhhcccceeeEeeecCcCCCcchhHhHHhHHHHHhhhhh
Q psy17334 63 LCSIILCIGVLGNIMVPCVILKSKDMRNSTNIFLMNLSIADLMVLLV 109 (188)
Q Consensus 63 ~~~~i~l~gl~GN~lvi~~i~~~~~l~~~~~~fi~nLa~aDLl~~l~ 109 (188)
+..+-+.+++++-++++......|++|+..+....||+++.++..+.
T Consensus 6 i~~vg~~~Si~~ll~~i~~~~~~r~lr~~~~~i~~~l~~sll~~~~~ 52 (242)
T PF00002_consen 6 ISYVGCSLSIICLLLTIITYLLFRKLRSFRNKIHLNLCLSLLLANLS 52 (242)
T ss_dssp HHHHHHH----------------------------------------
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhhcccchhhhhhhHHHHHHHHHH
Confidence 33444455556655555555556778887788889999987775543
No 23
>PF14851 FAM176: FAM176 family
Probab=67.20 E-value=1.1 Score=34.22 Aligned_cols=30 Identities=10% Similarity=0.214 Sum_probs=15.4
Q ss_pred hhcccceeeEeeecCcCCCcchhHhHHhHHH
Q psy17334 72 VLGNIMVPCVILKSKDMRNSTNIFLMNLSIA 102 (188)
Q Consensus 72 l~GN~lvi~~i~~~~~l~~~~~~fi~nLa~a 102 (188)
++.|.+..|+.++... -+.--||++..|+-
T Consensus 4 llSnsLaaya~I~~~P-E~~aLYFv~gVC~G 33 (153)
T PF14851_consen 4 LLSNSLAAYAHIRDNP-ERFALYFVSGVCAG 33 (153)
T ss_pred HHHHHHHHHHHHHhCh-HHHHHHHHHHHHHH
Confidence 4566666665544222 22345666665554
No 24
>PF09882 DUF2109: Predicted membrane protein (DUF2109); InterPro: IPR019214 This entry is found in various hypothetical archaeal proteins and has no known function.
Probab=64.83 E-value=1.8 Score=28.97 Aligned_cols=45 Identities=24% Similarity=0.429 Sum_probs=30.6
Q ss_pred HHhhcccceeeEeeecCcCCCcchhHhHHhHHHHHhhhhhcccee
Q psy17334 70 IGVLGNIMVPCVILKSKDMRNSTNIFLMNLSIADLMVLLVCTPTV 114 (188)
Q Consensus 70 ~gl~GN~lvi~~i~~~~~l~~~~~~fi~nLa~aDLl~~l~~~P~~ 114 (188)
+|+++=..++-++..+.+.++-.|..++|.+++.++....-.|+-
T Consensus 6 ~g~Iai~~~iR~~~~~~r~~KL~yLnv~~F~iaalIaL~i~~P~g 50 (78)
T PF09882_consen 6 IGIIAILMAIRIFLTKSRARKLLYLNVINFAIAALIALYIKSPMG 50 (78)
T ss_pred HHHHHHHHHHHHHHhHhHHHhhhHHHHHHHHHHHHHHHHhCCcHH
Confidence 333333344444555566677789999999999998887766653
No 25
>PF01102 Glycophorin_A: Glycophorin A; InterPro: IPR001195 Proteins in this group are responsible for the molecular basis of the blood group antigens, surface markers on the outside of the red blood cell membrane. Most of these markers are proteins, but some are carbohydrates attached to lipids or proteins [Reid M.E., Lomas-Francis C. The Blood Group Antigen FactsBook Academic Press, London / San Diego, (1997)]. Glycophorin A (PAS-2) and glycophorin B (PAS-3) belong to the MNS blood group system and are associated with antigens that include M/N, S/s, U, He, Mi(a), M(c), Vw, Mur, M(g), Vr, M(e), Mt(a), St(a), Ri(a), Cl(a), Ny(a), Hut, Hil, M(v), Far, Mit, Dantu, Hop, Nob, En(a), ENKT, amongst others. Glycophorin A is the major sialoglycoprotein of the erythrocyte membrane []. Structurally, glycophorin A consists of an N-terminal extracellular domain, heavily glycosylated on serine and threonine residues, followed by a transmembrane region and a C-terminal cytoplasmic domain. Other glycophorins in this entry such as Glycophorin B and Glycophorin E represent minor sialoglycoproteins in the erythrocyte membrane.; GO: 0016021 integral to membrane; PDB: 2KPF_B 1AFO_B 2KPE_A.
Probab=64.75 E-value=1.7 Score=31.92 Aligned_cols=9 Identities=11% Similarity=0.206 Sum_probs=3.3
Q ss_pred eeEeeecCc
Q psy17334 79 PCVILKSKD 87 (188)
Q Consensus 79 i~~i~~~~~ 87 (188)
+|+++|.|+
T Consensus 85 ~y~irR~~K 93 (122)
T PF01102_consen 85 SYCIRRLRK 93 (122)
T ss_dssp HHHHHHHS-
T ss_pred HHHHHHHhc
Confidence 344443333
No 26
>KOG4156|consensus
Probab=60.83 E-value=2.8 Score=39.09 Aligned_cols=6 Identities=33% Similarity=0.645 Sum_probs=2.7
Q ss_pred ccceee
Q psy17334 110 CTPTVL 115 (188)
Q Consensus 110 ~~P~~i 115 (188)
.+|..+
T Consensus 572 vVP~tl 577 (1329)
T KOG4156|consen 572 VVPVTL 577 (1329)
T ss_pred ccceec
Confidence 345444
No 27
>PF10327 7TM_GPCR_Sri: Serpentine type 7TM GPCR chemoreceptor Sri; InterPro: IPR019429 G-protein-coupled receptors, GPCRs, constitute a vast protein family that encompasses a wide range of functions (including various autocrine, paracrine and endocrine processes). They show considerable diversity at the sequence level, on the basis of which they can be separated into distinct groups. We use the term clan to describe the GPCRs, as they embrace a group of families for which there are indications of evolutionary relationship, but between which there is no statistically significant similarity in sequence []. The currently known clan members include the rhodopsin-like GPCRs, the secretin-like GPCRs, the cAMP receptors, the fungal mating pheromone receptors, and the metabotropic glutamate receptor family. There is a specialised database for GPCRs (http://www.gpcr.org/7tm/). The nematode Caenorhabditis elegans has only 14 types of chemosensory neuron, yet is able to sense and respond to several hundred different chemicals because each neuron detects several stimuli []. Chemoperception is one of the central senses of soil nematodes like C. elegans which are otherwise 'blind' and 'deaf' []. Chemoreception in C. elegans is mediated by members of the seven-transmembrane G-protein-coupled receptor class (7TM GPCRs). More than 1300 potential chemoreceptor genes have been identified in C. elegans, which are generally prefixed sr for serpentine receptor. The receptor superfamilies include Sra (Sra, Srb, Srab, Sre), Str (Srh, Str, Sri, Srd, Srj, Srm, Srn) and Srg (Srx, Srt, Srg, Sru, Srv, Srxa), as well as the families Srw, Srz, Srbc, Srsx and Srr [, , ]. Many of these proteins have homologues in Caenorhabditis briggsae. This entry represents Sri, which is part of the Str superfamily of chemoreceptors.
Probab=59.75 E-value=16 Score=30.72 Aligned_cols=57 Identities=18% Similarity=0.159 Sum_probs=39.5
Q ss_pred hhHHHHHHHHHHHHhhcccceeeEeee-cCcCCCcchhHhHH---hHHHHHhhhhhcccee
Q psy17334 58 TPSMVLCSIILCIGVLGNIMVPCVILK-SKDMRNSTNIFLMN---LSIADLMVLLVCTPTV 114 (188)
Q Consensus 58 ~~~~~~~~~i~l~gl~GN~lvi~~i~~-~~~l~~~~~~fi~n---La~aDLl~~l~~~P~~ 114 (188)
.++...+-++.+++++-|.+.++++.. .+++.+..|+++.. ..++|+-.++..-|..
T Consensus 9 ~~li~~~~~ig~iS~~~n~~~iyLi~fks~k~~~fry~ll~~Qi~~~l~di~~t~L~qpip 69 (303)
T PF10327_consen 9 QWLINYYHIIGVISFILNSLGIYLIIFKSPKLDNFRYYLLYFQISCTLTDIHLTFLMQPIP 69 (303)
T ss_pred HHHHHHHHHHHHHHHHHHHHHheeEEEecCCccchhhHHHHHHHHHHHhhhhhhhhccchh
Confidence 467778888899999999999877664 56667666655543 3356777776654443
No 28
>PF02532 PsbI: Photosystem II reaction centre I protein (PSII 4.8 kDa protein); InterPro: IPR003686 Oxygenic photosynthesis uses two multi-subunit photosystems (I and II) located in the cell membranes of cyanobacteria and in the thylakoid membranes of chloroplasts in plants and algae. Photosystem II (PSII) has a P680 reaction centre containing chlorophyll 'a' that uses light energy to carry out the oxidation (splitting) of water molecules, and to produce ATP via a proton pump. Photosystem I (PSI) has a P700 reaction centre containing chlorophyll that takes the electron and associated hydrogen donated from PSII to reduce NADP+ to NADPH. Both ATP and NADPH are subsequently used in the light-independent reactions to convert carbon dioxide to glucose using the hydrogen atom extracted from water by PSII, releasing oxygen as a by-product. PSII is a multisubunit protein-pigment complex containing polypeptides both intrinsic and extrinsic to the photosynthetic membrane [, ]. Within the core of the complex, the chlorophyll and beta-carotene pigments are mainly bound to the antenna proteins CP43 (PsbC) and CP47 (PsbB), which pass the excitation energy on to the reaction centre proteins D1 (Qb, PsbA) and D2 (Qa, PsbD) that bind all the redox-active cofactors involved in the energy conversion process. The PSII oxygen-evolving complex (OEC) oxidises water to provide protons for use by PSI, and consists of OEE1 (PsbO), OEE2 (PsbP) and OEE3 (PsbQ). The remaining subunits in PSII are of low molecular weight (less than 10 kDa), and are involved in PSII assembly, stabilisation, dimerisation, and photo-protection []. This family represents the low molecular weight transmembrane protein PsbI, which is tightly associated with the D1/D2 heterodimer in PSII. The function of PsbI is unknown, but it may be involved in the assembly, dimerisation or stabilisation of PSII dimers [].; GO: 0015979 photosynthesis, 0009523 photosystem II, 0009539 photosystem II reaction center, 0016020 membrane; PDB: 3A0H_i 3ARC_I 3A0B_i 3BZ2_I 3PRQ_I 3KZI_I 3PRR_I 2AXT_i 4FBY_I 1S5L_i ....
Probab=58.49 E-value=14 Score=21.02 Aligned_cols=18 Identities=17% Similarity=0.370 Sum_probs=13.2
Q ss_pred hhHHHHHHHHHHHHhhcc
Q psy17334 58 TPSMVLCSIILCIGVLGN 75 (188)
Q Consensus 58 ~~~~~~~~~i~l~gl~GN 75 (188)
+..+++++.++++|++.|
T Consensus 9 y~vV~ffv~LFifGflsn 26 (36)
T PF02532_consen 9 YTVVIFFVSLFIFGFLSN 26 (36)
T ss_dssp HHHHHHHHHHHHHHHHTT
T ss_pred hhhHHHHHHHHhccccCC
Confidence 345666777888888887
No 29
>KOG1834|consensus
Probab=57.78 E-value=1.2 Score=41.41 Aligned_cols=7 Identities=14% Similarity=0.017 Sum_probs=2.6
Q ss_pred HHHHhhh
Q psy17334 101 IADLMVL 107 (188)
Q Consensus 101 ~aDLl~~ 107 (188)
+-|..+.
T Consensus 872 WDDSalt 878 (952)
T KOG1834|consen 872 WDDSALT 878 (952)
T ss_pred cccccce
Confidence 3344333
No 30
>PF08229 SHR3_chaperone: ER membrane protein SH3 ; InterPro: IPR013248 This family of proteins are membrane localised chaperones that are required for correct plasma membrane localisation of amino acid permeases (AAPs) []. Shr3 prevents AAPs proteins from aggregating and assists in their correct folding. In the absence of Shr3, AAPs are retained in the ER.
Probab=56.47 E-value=13 Score=29.61 Aligned_cols=14 Identities=29% Similarity=0.434 Sum_probs=7.9
Q ss_pred hHHhHHHHHhhhhh
Q psy17334 96 LMNLSIADLMVLLV 109 (188)
Q Consensus 96 i~nLa~aDLl~~l~ 109 (188)
+--||.++.+++++
T Consensus 130 l~VlAASn~Ii~~~ 143 (196)
T PF08229_consen 130 LRVLAASNTIIALV 143 (196)
T ss_pred HHHHHHhhHHHHHH
Confidence 33456666665554
No 31
>KOG1189|consensus
Probab=53.42 E-value=2.3 Score=40.16 Aligned_cols=6 Identities=17% Similarity=0.745 Sum_probs=3.3
Q ss_pred CCcccc
Q psy17334 122 PETWQM 127 (188)
Q Consensus 122 ~~~W~f 127 (188)
.+-|.|
T Consensus 865 ~GgW~f 870 (960)
T KOG1189|consen 865 DGGWSF 870 (960)
T ss_pred cCCeee
Confidence 355665
No 32
>PF03839 Sec62: Translocation protein Sec62; InterPro: IPR004728 Members of the NSCC2 family have been sequenced from various yeast, fungal and animals species including Saccharomyces cerevisiae, Drosophila melanogaster and Homo sapiens. These proteins are the Sec62 proteins, believed to be associated with the Sec61 and Sec63 constituents of the general protein secretary systems of yeast microsomes. They are also the non-selective cation (NS) channels of the mammalian cytoplasmic membrane. The yeast Sec62 protein has been shown to be essential for cell growth. The mammalian NS channel proteins have been implicated in platelet derived growth factor(PGDF) dependent single channel current in fibroblasts. These channels are essentially closed in serum deprived tissue-culture cells and are specifically opened by exposure to PDGF. These channels are reported to exhibit equal selectivity for Na+, K+ and Cs+ with low permeability to Ca2+, and no permeability to anions.; GO: 0008565 protein transporter activity, 0015031 protein transport, 0016021 integral to membrane
Probab=53.06 E-value=5.2 Score=32.47 Aligned_cols=10 Identities=10% Similarity=0.285 Sum_probs=6.9
Q ss_pred CCcccccccc
Q psy17334 122 PETWQMGEHI 131 (188)
Q Consensus 122 ~~~W~fg~~~ 131 (188)
.+-|.|.+..
T Consensus 175 ~~fWlfPNLf 184 (224)
T PF03839_consen 175 HGFWLFPNLF 184 (224)
T ss_pred CCEEeCCccc
Confidence 5788887653
No 33
>PF03402 V1R: Vomeronasal organ pheromone receptor family, V1R; InterPro: IPR004072 G-protein-coupled receptors, GPCRs, constitute a vast protein family that encompasses a wide range of functions (including various autocrine, paracrine and endocrine processes). They show considerable diversity at the sequence level, on the basis of which they can be separated into distinct groups. We use the term clan to describe the GPCRs, as they embrace a group of families for which there are indications of evolutionary relationship, but between which there is no statistically significant similarity in sequence []. The currently known clan members include the rhodopsin-like GPCRs, the secretin-like GPCRs, the cAMP receptors, the fungal mating pheromone receptors, and the metabotropic glutamate receptor family. There is a specialised database for GPCRs (http://www.gpcr.org/7tm/). The rhodopsin-like GPCRs themselves represent a widespread protein family that includes hormone, neurotransmitter and light receptors, all of which transduce extracellular signals through interaction with guanine nucleotide-binding (G) proteins. Although their activating ligands vary widely in structure and character, the amino acid sequences of the receptors are very similar and are believed to adopt a common structural framework comprising 7 transmembrane (TM) helices [, , ]. Pheromones have evolved in all animal phyla, to signal sex and dominance status, and are responsible for stereotypical social and sexual behaviour among members of the same species. In mammals, these chemical signals are believed to be detected primarily by the vomeronasal organ (VNO), a chemosensory organ located at the base of the nasal septum []. The VNO is present in most amphibia, reptiles and non-primate mammals but is absent in birds, adult catarrhine monkeys and apes []. An active role for the human VNO in the detection of pheromones is disputed; the VNO is clearly present in the foetus but appears to be atrophied or absent in adults. Three distinct families of putative pheromone receptors have been identified in the vomeronasal organ (V1Rs, V2Rs and V3Rs). All are G protein-coupled receptors but are only distantly related to the receptors of the main olfactory system, highlighting their different role []. The V1 receptors share between 50 and 90% sequence identity but have little similarity to other families of G protein-coupled receptors. They appear to be distantly related to the mammalian T2R bitter taste receptors and the rhodopsin-like GPCRs []. In rat, the family comprises 30-40 genes. These are expressed in the apical regions of the VNO, in neurons expressing Gi2. Coupling of the receptors to this protein mediates inositol trisphosphate signalling []. A number of human V1 receptor homologues have also been found. The majority of these human sequences are pseudogenes [] but an apparently functional receptor has been identified that is expressed in the human olfactory system [].; GO: 0016503 pheromone receptor activity, 0007186 G-protein coupled receptor protein signaling pathway, 0016021 integral to membrane
Probab=52.91 E-value=6.2 Score=32.82 Aligned_cols=45 Identities=16% Similarity=0.250 Sum_probs=31.0
Q ss_pred cCCCcchhHhHHhHHHHHhhhhh-ccceeeEEeccCCCcccccccccccc
Q psy17334 87 DMRNSTNIFLMNLSIADLMVLLV-CTPTVLVEVNSKPETWQMGEHISNTQ 135 (188)
Q Consensus 87 ~l~~~~~~fi~nLa~aDLl~~l~-~~P~~i~~~~~~~~~W~fg~~~Cki~ 135 (188)
+..+|++..+.|||++++++.+. ++|.... + .+.+ .|++..||+.
T Consensus 6 ~r~kp~dlIl~hLa~aN~lvLl~rGip~~~~-~-~~~~--~~~d~gCK~v 51 (265)
T PF03402_consen 6 HRLKPIDLILIHLALANILVLLSRGIPQTMA-F-FGWK--FFDDIGCKIV 51 (265)
T ss_pred CCCCcHHHHHHHHHHHHHHHHHHhhHHHHHH-H-hhcc--cCCCceeeee
Confidence 34568999999999999998875 4564332 1 1212 3688999974
No 34
>CHL00024 psbI photosystem II protein I
Probab=52.55 E-value=11 Score=21.47 Aligned_cols=18 Identities=17% Similarity=0.370 Sum_probs=13.4
Q ss_pred hhHHHHHHHHHHHHhhcc
Q psy17334 58 TPSMVLCSIILCIGVLGN 75 (188)
Q Consensus 58 ~~~~~~~~~i~l~gl~GN 75 (188)
+..+++++.++++|++.|
T Consensus 9 y~vV~ffvsLFifGFlsn 26 (36)
T CHL00024 9 YTVVIFFVSLFIFGFLSN 26 (36)
T ss_pred hhHHHHHHHHHHccccCC
Confidence 356667778888888877
No 35
>PRK02655 psbI photosystem II reaction center I protein I; Provisional
Probab=49.94 E-value=12 Score=21.43 Aligned_cols=18 Identities=11% Similarity=0.181 Sum_probs=13.2
Q ss_pred hhHHHHHHHHHHHHhhcc
Q psy17334 58 TPSMVLCSIILCIGVLGN 75 (188)
Q Consensus 58 ~~~~~~~~~i~l~gl~GN 75 (188)
+..+++++.++++|++.|
T Consensus 9 y~vV~ffvsLFiFGflsn 26 (38)
T PRK02655 9 YIVVFFFVGLFVFGFLSS 26 (38)
T ss_pred hhhHHHHHHHHHcccCCC
Confidence 346667777888888877
No 36
>PF12606 RELT: Tumour necrosis factor receptor superfamily member 19; InterPro: IPR022248 The members of tumor necrosis factor receptor (TNFR) superfamily have been designated as the "guardians of the immune system" due to their roles in immune cell proliferation, differentiation, activation, and death (apoptosis). RELT (receptor expressed in lymphoid tissues) is a member of the TNFR superfamily. The messenger RNA of RELT is especially abundant in hematologic tissues such as spleen, lymph node, and peripheral blood leukocytes as well as in leukemias and lymphomas. RELT is able to activate the NF-kappaB pathway and selectively binds tumor necrosis factor receptor-associated factor 1 []. RELT like proteins 1 and 2 (RELL1 and RELL2) are two RELT homologues that bind to RELT. The expression of RELL1 at the mRNA level is ubiquitous, whereas expression of RELL2 mRNA is more restricted to particular tissues [].
Probab=47.03 E-value=5.5 Score=24.52 Aligned_cols=6 Identities=33% Similarity=0.982 Sum_probs=2.2
Q ss_pred HHHHhh
Q psy17334 68 LCIGVL 73 (188)
Q Consensus 68 ~l~gl~ 73 (188)
+++|++
T Consensus 11 iv~~lL 16 (50)
T PF12606_consen 11 IVMGLL 16 (50)
T ss_pred HHHHHH
Confidence 333333
No 37
>PF11446 DUF2897: Protein of unknown function (DUF2897); InterPro: IPR021550 This is a bacterial family of uncharacterised proteins.
Probab=42.61 E-value=13 Score=23.37 Aligned_cols=8 Identities=38% Similarity=0.725 Sum_probs=4.4
Q ss_pred hhccccee
Q psy17334 72 VLGNIMVP 79 (188)
Q Consensus 72 l~GN~lvi 79 (188)
++||+.++
T Consensus 16 IigNia~L 23 (55)
T PF11446_consen 16 IIGNIAAL 23 (55)
T ss_pred HHhHHHHH
Confidence 45665544
No 38
>PF10856 DUF2678: Protein of unknown function (DUF2678); InterPro: IPR022564 This family of proteins has no known function.
Probab=42.29 E-value=97 Score=22.54 Aligned_cols=16 Identities=13% Similarity=0.102 Sum_probs=7.1
Q ss_pred chhhHHHHHHHHHHHH
Q psy17334 56 IRTPSMVLCSIILCIG 71 (188)
Q Consensus 56 ~~~~~~~~~~~i~l~g 71 (188)
...+..++..+..+++
T Consensus 30 inliiG~vT~l~VLvt 45 (118)
T PF10856_consen 30 INLIIGAVTSLFVLVT 45 (118)
T ss_pred EEeehHHHHHHHHHHH
Confidence 4444444444444443
No 39
>PF10873 DUF2668: Protein of unknown function (DUF2668); InterPro: IPR022640 Members in this family of proteins are annotated as cysteine and tyrosine-rich protein 1, however currently no function is known [].
Probab=42.20 E-value=4.4 Score=30.52 Aligned_cols=34 Identities=12% Similarity=0.253 Sum_probs=24.1
Q ss_pred hhHHHHHHHHHHHHhhcccceeeEeeecCcCCCc
Q psy17334 58 TPSMVLCSIILCIGVLGNIMVPCVILKSKDMRNS 91 (188)
Q Consensus 58 ~~~~~~~~~i~l~gl~GN~lvi~~i~~~~~l~~~ 91 (188)
.+..++|.++|++|+++-+++++..+.+++.+++
T Consensus 62 AIaGIVfgiVfimgvva~i~icvCmc~kn~rgsR 95 (155)
T PF10873_consen 62 AIAGIVFGIVFIMGVVAGIAICVCMCMKNSRGSR 95 (155)
T ss_pred eeeeeehhhHHHHHHHHHHHHHHhhhhhcCCCcc
Confidence 3566788899999999887777666655554443
No 40
>TIGR01477 RIFIN variant surface antigen, rifin family. This model represents the rifin branch of the rifin/stevor family (pfam02009) of predicted variant surface antigens as found in Plasmodium falciparum. This model is based on a set of rifin sequences kindly provided by Matt Berriman from the Sanger Center. This is a global model and assesses a penalty for incomplete sequence. Additional fragmentary sequences may be found with the fragment model and a cutoff of 20 bits.
Probab=41.95 E-value=7.1 Score=33.76 Aligned_cols=29 Identities=17% Similarity=0.304 Sum_probs=18.5
Q ss_pred HHHHHHHHHHHhhcccceeeEeeecCcCC
Q psy17334 61 MVLCSIILCIGVLGNIMVPCVILKSKDMR 89 (188)
Q Consensus 61 ~~~~~~i~l~gl~GN~lvi~~i~~~~~l~ 89 (188)
+++++++.++.++--.++||+|+|+||.+
T Consensus 311 ~IiaSiIAIvvIVLIMvIIYLILRYRRKK 339 (353)
T TIGR01477 311 PIIASIIAILIIVLIMVIIYLILRYRRKK 339 (353)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhhhcc
Confidence 34445555555555567888888887754
No 41
>PF02009 Rifin_STEVOR: Rifin/stevor family; InterPro: IPR002858 Malaria is still a major cause of mortality in many areas of the world. Plasmodium falciparum causes the most severe human form of the disease and is responsible for most fatalities. Severe cases of malaria can occur when the parasite invades and then proliferates within red blood cell erythrocytes. The parasite produces many variant antigenic proteins, encoded by multigene families, which are present on the surface of the infected erythrocyte and play important roles in virulence. A crucial survival mechanism for the malaria parasite is its ability to evade the immune response by switching these variant surface antigens. The high virulence of P. falciparum relative to other malarial parasites is in large part due to the fact that in this organism many of these surface antigens mediate the binding of infected erythrocytes to the vascular endothelium (cytoadherence) and non-infected erythrocytes (rosetting). This can lead to the accumulation of infected cells in the vasculature of a variety of organs, blocking the blood flow and reducing the oxygen supply. Clinical symptoms of severe infection can include fever, progressive anaemia, multi-organ dysfunction and coma. For more information see []. Several multicopy gene families have been described in Plasmodium falciparum, including the stevor family of subtelomeric open reading frames and the rif interspersed repetitive elements. Both families contain three predicted transmembrane segments. It has been proposed that stevor and rif are members of a larger superfamily that code for variant surface antigens [].
Probab=41.89 E-value=5.8 Score=33.58 Aligned_cols=27 Identities=19% Similarity=0.322 Sum_probs=15.5
Q ss_pred HHHHHHHHHHhhcccceeeEeeecCcC
Q psy17334 62 VLCSIILCIGVLGNIMVPCVILKSKDM 88 (188)
Q Consensus 62 ~~~~~i~l~gl~GN~lvi~~i~~~~~l 88 (188)
++.+++.++.++--+++||+|+|+||.
T Consensus 258 I~aSiiaIliIVLIMvIIYLILRYRRK 284 (299)
T PF02009_consen 258 IIASIIAILIIVLIMVIIYLILRYRRK 284 (299)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 334444444444445777888877763
No 42
>PTZ00046 rifin; Provisional
Probab=39.43 E-value=8.3 Score=33.44 Aligned_cols=28 Identities=14% Similarity=0.350 Sum_probs=17.8
Q ss_pred HHHHHHHHHHhhcccceeeEeeecCcCC
Q psy17334 62 VLCSIILCIGVLGNIMVPCVILKSKDMR 89 (188)
Q Consensus 62 ~~~~~i~l~gl~GN~lvi~~i~~~~~l~ 89 (188)
++++++.++.++--.++||+|+|+||.+
T Consensus 317 IiaSiiAIvVIVLIMvIIYLILRYRRKK 344 (358)
T PTZ00046 317 IIASIVAIVVIVLIMVIIYLILRYRRKK 344 (358)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHhhhcc
Confidence 4445555555555567788888887754
No 43
>TIGR00869 sec62 protein translocation protein, Sec62 family. protein secretary systems of yeast microsomes. They are also the non-selective cation (NS) channels of the mammalian cytoplasmic membrane. The yeast Sec62 protein has been shown to be essential for cell growth. The mammalian NS channel proteins has been implicated in platelet derived growth factor(PGDF) dependent single channel current in fibroblasts. These channels are essentially closed in serum deprived tissue-culture cells and are specifically opened by exposure to PDGF. These channels are reported to exhibit equal selectivity for Na+, K+ and Cs+ with low permeability to Ca2+, and no permeability to anions.
Probab=38.99 E-value=12 Score=30.59 Aligned_cols=17 Identities=24% Similarity=0.448 Sum_probs=10.0
Q ss_pred eEEeccCCCcccccccc
Q psy17334 115 LVEVNSKPETWQMGEHI 131 (188)
Q Consensus 115 i~~~~~~~~~W~fg~~~ 131 (188)
++++..+.+-|.|.+..
T Consensus 174 i~~~~~g~~fWlfPNLf 190 (232)
T TIGR00869 174 LTLIVVKPGIWIFPNLF 190 (232)
T ss_pred HHHHHhCCCeeeecchh
Confidence 33334456888887653
No 44
>PF11861 DUF3381: Domain of unknown function (DUF3381); InterPro: IPR024576 This uncharacterised domain is found in eukaryotic rRNA methyltransferases like yeast Spb1 and mammalian homologue Ftsj3. Spb1 is required for proper assembly of pre-ribosomal particles during the biogenesis of the 60S ribosomal subunit [, ].
Probab=37.73 E-value=30 Score=26.42 Aligned_cols=7 Identities=57% Similarity=0.629 Sum_probs=3.4
Q ss_pred HHHHhhh
Q psy17334 101 IADLMVL 107 (188)
Q Consensus 101 ~aDLl~~ 107 (188)
|.||-++
T Consensus 69 c~DLKVL 75 (159)
T PF11861_consen 69 CKDLKVL 75 (159)
T ss_pred HHHHHhc
Confidence 4455543
No 45
>PF07423 DUF1510: Protein of unknown function (DUF1510); InterPro: IPR009988 This family consists of several hypothetical bacterial proteins of around 200 residues in length. The function of this family is unknown.
Probab=36.16 E-value=12 Score=30.19 Aligned_cols=9 Identities=22% Similarity=0.434 Sum_probs=4.5
Q ss_pred hHhHHhHHH
Q psy17334 94 IFLMNLSIA 102 (188)
Q Consensus 94 ~fi~nLa~a 102 (188)
-.|+|++|+
T Consensus 13 N~iLNiaI~ 21 (217)
T PF07423_consen 13 NKILNIAIG 21 (217)
T ss_pred hhhHHHHHH
Confidence 345555544
No 46
>PF12191 stn_TNFRSF12A: Tumour necrosis factor receptor stn_TNFRSF12A_TNFR domain; InterPro: IPR022316 The tumour necrosis factor (TNF) receptor (TNFR) superfamily comprises more than 20 type-I transmembrane proteins. Family members are defined based on similarity in their extracellular domain - a region that contains many cysteine residues arranged in a specific repetitive pattern []. The cysteines allow formation of an extended rod-like structure, responsible for ligand binding []. Upon receptor activation, different intracellular signalling complexes are assembled for different members of the TNFR superfamily, depending on their intracellular domains and sequences []. Activation of TNFRs can therefore induce a range of disparate effects, including cell proliferation, differentiation, survival, or apoptotic cell death, depending upon the receptor involved []. TNFRs are widely distributed and play important roles in many crucial biological processes, such as lymphoid and neuronal development, innate and adaptive immunity, and maintenance of cellular homeostasis []. Drugs that manipulate their signalling have potential roles in the prevention and treatment of many diseases, such as viral infections, coronary heart disease, transplant rejection, and immune disease []. TNF receptor 12 (also known as TWEAK receptor, and fibroblast growth factor-inducible-14 (Fn14)) has been implicated in endothelial cell growth and migration []. The receptor may also play a role in cell-matrix interactions [].; PDB: 2KN0_A 2RPJ_A 2KMZ_A 2EQP_A.
Probab=35.60 E-value=21 Score=26.33 Aligned_cols=13 Identities=23% Similarity=0.468 Sum_probs=0.0
Q ss_pred HHHHHHHHHHhhc
Q psy17334 62 VLCSIILCIGVLG 74 (188)
Q Consensus 62 ~~~~~i~l~gl~G 74 (188)
....++++++++.
T Consensus 84 sal~v~lVl~lls 96 (129)
T PF12191_consen 84 SALSVVLVLALLS 96 (129)
T ss_dssp -------------
T ss_pred hHHHHHHHHHHHH
Confidence 3344444455444
No 47
>PF15039 DUF4530: Domain of unknown function (DUF4530)
Probab=35.25 E-value=13 Score=26.08 Aligned_cols=6 Identities=50% Similarity=0.551 Sum_probs=2.9
Q ss_pred HHhhcc
Q psy17334 70 IGVLGN 75 (188)
Q Consensus 70 ~gl~GN 75 (188)
.-=+||
T Consensus 55 weelgn 60 (111)
T PF15039_consen 55 WEELGN 60 (111)
T ss_pred HHHhcc
Confidence 334555
No 48
>PF06024 DUF912: Nucleopolyhedrovirus protein of unknown function (DUF912); InterPro: IPR009261 This entry is represented by Autographa californica nuclear polyhedrosis virus (AcMNPV), Orf78; it is a family of uncharacterised viral proteins.
Probab=33.89 E-value=1.6e+02 Score=20.47 Aligned_cols=30 Identities=20% Similarity=0.208 Sum_probs=14.0
Q ss_pred hHHHHHHHHHHHHhhcccceeeEeeecCcCC
Q psy17334 59 PSMVLCSIILCIGVLGNIMVPCVILKSKDMR 89 (188)
Q Consensus 59 ~~~~~~~~i~l~gl~GN~lvi~~i~~~~~l~ 89 (188)
++++++++++++.++- ++..++|+|.|+.+
T Consensus 64 ili~lls~v~IlVily-~IyYFVILRer~~~ 93 (101)
T PF06024_consen 64 ILISLLSFVCILVILY-AIYYFVILRERQKS 93 (101)
T ss_pred hHHHHHHHHHHHHHHh-hheEEEEEeccccc
Confidence 3444444444443332 24456666666543
No 49
>PF10319 7TM_GPCR_Srj: Serpentine type 7TM GPCR chemoreceptor Srj; InterPro: IPR019423 G-protein-coupled receptors, GPCRs, constitute a vast protein family that encompasses a wide range of functions (including various autocrine, paracrine and endocrine processes). They show considerable diversity at the sequence level, on the basis of which they can be separated into distinct groups. We use the term clan to describe the GPCRs, as they embrace a group of families for which there are indications of evolutionary relationship, but between which there is no statistically significant similarity in sequence []. The currently known clan members include the rhodopsin-like GPCRs, the secretin-like GPCRs, the cAMP receptors, the fungal mating pheromone receptors, and the metabotropic glutamate receptor family. There is a specialised database for GPCRs (http://www.gpcr.org/7tm/). The nematode Caenorhabditis elegans has only 14 types of chemosensory neuron, yet is able to sense and respond to several hundred different chemicals because each neuron detects several stimuli []. Chemoperception is one of the central senses of soil nematodes like C. elegans which are otherwise 'blind' and 'deaf' []. Chemoreception in C. elegans is mediated by members of the seven-transmembrane G-protein-coupled receptor class (7TM GPCRs). More than 1300 potential chemoreceptor genes have been identified in C. elegans, which are generally prefixed sr for serpentine receptor. The receptor superfamilies include Sra (Sra, Srb, Srab, Sre), Str (Srh, Str, Sri, Srd, Srj, Srm, Srn) and Srg (Srx, Srt, Srg, Sru, Srv, Srxa), as well as the families Srw, Srz, Srbc, Srsx and Srr [, , ]. Many of these proteins have homologues in Caenorhabditis briggsae. This entry represents serpentine receptor class j (Srj) from the Str superfamily [, ]. The Srj family is designated as the out-group based on its location in preliminary phylogenetic analyses of the entire superfamily [].
Probab=33.23 E-value=96 Score=26.46 Aligned_cols=56 Identities=16% Similarity=0.228 Sum_probs=39.5
Q ss_pred HHHHHHHHHHHHhhcccceeeEeeecCcCCCcc-hhHhHHhHHHHHhhhhh--ccceee
Q psy17334 60 SMVLCSIILCIGVLGNIMVPCVILKSKDMRNST-NIFLMNLSIADLMVLLV--CTPTVL 115 (188)
Q Consensus 60 ~~~~~~~i~l~gl~GN~lvi~~i~~~~~l~~~~-~~fi~nLa~aDLl~~l~--~~P~~i 115 (188)
.-.+.-+.++++++-|-+.++++...|+.+--. .++++..|+-|++.+++ .+|.-+
T Consensus 7 h~~~Pk~~~~lsf~~Np~fiyli~~~~~~~~G~Yr~LL~~Fa~fn~~~S~~~~~vp~~v 65 (310)
T PF10319_consen 7 HHYIPKIFGILSFIVNPIFIYLIFTEKKSQFGNYRYLLLFFAIFNLIYSVVDLLVPICV 65 (310)
T ss_pred HHHHHHHHHHHHHHHhhhhheeEEcccccccccHHHHHHHHHHHHHHHHHHHHHhhhee
Confidence 334455667788899999999998888766533 46667788888887764 355443
No 50
>PF05808 Podoplanin: Podoplanin; InterPro: IPR008783 This family consists of several mammalian podoplanin-like proteins which are thought to control specifically the unique shape of podocytes [].; GO: 0016021 integral to membrane; PDB: 3IET_X.
Probab=33.02 E-value=14 Score=28.34 Aligned_cols=21 Identities=29% Similarity=0.594 Sum_probs=0.0
Q ss_pred hhhHHHHHHHHHHHHhhcccc
Q psy17334 57 RTPSMVLCSIILCIGVLGNIM 77 (188)
Q Consensus 57 ~~~~~~~~~~i~l~gl~GN~l 77 (188)
-.+..++..++..+|++|-++
T Consensus 129 ~tLVGIIVGVLlaIG~igGII 149 (162)
T PF05808_consen 129 VTLVGIIVGVLLAIGFIGGII 149 (162)
T ss_dssp ---------------------
T ss_pred eeeeeehhhHHHHHHHHhhee
Confidence 346778888889999999644
No 51
>COG5137 Histone chaperone involved in gene silencing [Transcription / Chromatin structure and dynamics]
Probab=32.23 E-value=15 Score=29.70 Aligned_cols=18 Identities=17% Similarity=0.338 Sum_probs=8.9
Q ss_pred CCCCCCCcccccccCCCC
Q psy17334 12 NLTSPDNWTLAIGAAENA 29 (188)
Q Consensus 12 ~~~~~~~~~~~~~~~~~~ 29 (188)
.+-...-|-....+...+
T Consensus 33 ~lK~dlEwkltYvgs~~S 50 (279)
T COG5137 33 ELKCDLEWKLTYVGSVHS 50 (279)
T ss_pred hhcCCceEEEEEeecccc
Confidence 444455566655444433
No 52
>PRK13183 psbN photosystem II reaction center protein N; Provisional
Probab=31.22 E-value=70 Score=19.26 Aligned_cols=33 Identities=12% Similarity=0.177 Sum_probs=17.4
Q ss_pred hHHHHHHHHHHHHhhcccceeeEeeecCcCCCc
Q psy17334 59 PSMVLCSIILCIGVLGNIMVPCVILKSKDMRNS 91 (188)
Q Consensus 59 ~~~~~~~~i~l~gl~GN~lvi~~i~~~~~l~~~ 91 (188)
.++++++..+++|+.|-.+-...=-..+.||.|
T Consensus 8 ~~~~i~i~~lL~~~TgyaiYtaFGppSk~LrDP 40 (46)
T PRK13183 8 LSLAITILAILLALTGFGIYTAFGPPSKELDDP 40 (46)
T ss_pred HHHHHHHHHHHHHHhhheeeeccCCcccccCCc
Confidence 455566677778877754322222233445544
No 53
>CHL00020 psbN photosystem II protein N
Probab=30.91 E-value=67 Score=19.09 Aligned_cols=33 Identities=15% Similarity=0.152 Sum_probs=16.7
Q ss_pred hHHHHHHHHHHHHhhcccceeeEeeecCcCCCc
Q psy17334 59 PSMVLCSIILCIGVLGNIMVPCVILKSKDMRNS 91 (188)
Q Consensus 59 ~~~~~~~~i~l~gl~GN~lvi~~i~~~~~l~~~ 91 (188)
.++++++..+++|+.|-.+-...=-..+.||.|
T Consensus 5 ~~~~i~i~~ll~~~Tgy~iYtaFGppSk~LrDP 37 (43)
T CHL00020 5 TLVAIFISGLLVSFTGYALYTAFGQPSKQLRDP 37 (43)
T ss_pred hhHHHHHHHHHHHhhheeeeeccCCchhccCCc
Confidence 345566666777777743322222233445544
No 54
>PF15330 SIT: SHP2-interacting transmembrane adaptor protein, SIT
Probab=30.86 E-value=20 Score=25.59 Aligned_cols=29 Identities=7% Similarity=0.161 Sum_probs=18.2
Q ss_pred hHHHHHHHHHHHHhhcccceeeEeeecCc
Q psy17334 59 PSMVLCSIILCIGVLGNIMVPCVILKSKD 87 (188)
Q Consensus 59 ~~~~~~~~i~l~gl~GN~lvi~~i~~~~~ 87 (188)
++.+++.+++++.++.|++......+.++
T Consensus 2 ~Ll~il~llLll~l~asl~~wr~~~rq~k 30 (107)
T PF15330_consen 2 LLLGILALLLLLSLAASLLAWRMKQRQKK 30 (107)
T ss_pred hHHHHHHHHHHHHHHHHHHHHHHHhhhcc
Confidence 45666677777778888766554444443
No 55
>PF04712 Radial_spoke: Radial spokehead-like protein
Probab=30.69 E-value=16 Score=33.04 Aligned_cols=11 Identities=0% Similarity=-0.151 Sum_probs=6.6
Q ss_pred cccccccccch
Q psy17334 127 MGEHISNTQEE 137 (188)
Q Consensus 127 fg~~~Cki~~~ 137 (188)
.....|.....
T Consensus 344 l~qGR~~~~~~ 354 (491)
T PF04712_consen 344 LPQGRCTWYNP 354 (491)
T ss_pred ccCceEeCCCC
Confidence 44556777744
No 56
>PF02724 CDC45: CDC45-like protein; InterPro: IPR003874 CDC45 is an essential gene required for initiation of DNA replication in Saccharomyces cerevisiae (cell division control protein 45), forming a complex with MCM5/CDC46. Homologs of CDC45 have been identified in human [], mouse and the smut fungus, Melampsora spp., (tsd2 protein) among others.; GO: 0006270 DNA-dependent DNA replication initiation
Probab=30.01 E-value=3.8 Score=38.17 Aligned_cols=37 Identities=22% Similarity=0.383 Sum_probs=14.4
Q ss_pred hhHhHHhH-HHHHhhhhhccceeeEEeccCCCcccccc
Q psy17334 93 NIFLMNLS-IADLMVLLVCTPTVLVEVNSKPETWQMGE 129 (188)
Q Consensus 93 ~~fi~nLa-~aDLl~~l~~~P~~i~~~~~~~~~W~fg~ 129 (188)
+++++|.. ..||.-.+-.-+-..++++-..+.|.+.+
T Consensus 54 ~vilIncGa~~dl~~~l~~~~~~~iyViDshRP~~L~N 91 (622)
T PF02724_consen 54 SVILINCGATVDLEEFLELDEDVTIYVIDSHRPWNLDN 91 (622)
T ss_pred eEEEEecCchhhHHHHhCCCCceEEEEEeCCCCccHhh
Confidence 34455543 33444322211233333333445565433
No 57
>PF02724 CDC45: CDC45-like protein; InterPro: IPR003874 CDC45 is an essential gene required for initiation of DNA replication in Saccharomyces cerevisiae (cell division control protein 45), forming a complex with MCM5/CDC46. Homologs of CDC45 have been identified in human [], mouse and the smut fungus, Melampsora spp., (tsd2 protein) among others.; GO: 0006270 DNA-dependent DNA replication initiation
Probab=28.85 E-value=3.8 Score=38.14 Aligned_cols=10 Identities=20% Similarity=0.152 Sum_probs=4.3
Q ss_pred ceeeEeeecC
Q psy17334 77 MVPCVILKSK 86 (188)
Q Consensus 77 lvi~~i~~~~ 86 (188)
+.++++=.+|
T Consensus 76 ~~iyViDshR 85 (622)
T PF02724_consen 76 VTIYVIDSHR 85 (622)
T ss_pred eEEEEEeCCC
Confidence 3445444433
No 58
>COG5137 Histone chaperone involved in gene silencing [Transcription / Chromatin structure and dynamics]
Probab=28.12 E-value=24 Score=28.61 Aligned_cols=10 Identities=20% Similarity=0.042 Sum_probs=4.4
Q ss_pred ceeeEeeecC
Q psy17334 77 MVPCVILKSK 86 (188)
Q Consensus 77 lvi~~i~~~~ 86 (188)
++|++.++++
T Consensus 93 TvILlsc~Y~ 102 (279)
T COG5137 93 TVILLSCRYK 102 (279)
T ss_pred eEEEEEEeec
Confidence 3444444444
No 59
>PHA03055 Hypothetical protein; Provisional
Probab=27.95 E-value=1.2e+02 Score=20.49 Aligned_cols=51 Identities=14% Similarity=0.230 Sum_probs=30.7
Q ss_pred HHHHHHhhcccceeeEeeecCcCCCcchhHhHHhHHHHHhhhhhccceeeE
Q psy17334 66 IILCIGVLGNIMVPCVILKSKDMRNSTNIFLMNLSIADLMVLLVCTPTVLV 116 (188)
Q Consensus 66 ~i~l~gl~GN~lvi~~i~~~~~l~~~~~~fi~nLa~aDLl~~l~~~P~~i~ 116 (188)
++.+..+.|..+++-.+..+|.+-..+--|---+.+.++...++.+|..++
T Consensus 16 L~llMiisG~ali~k~~~p~r~~~~RS~~~~rVl~~lE~va~lifIPgti~ 66 (79)
T PHA03055 16 LMLLMVISGTAMIVKELNPNDIFTMQSLKFNRAVTIFKYIGLFIYIPGTII 66 (79)
T ss_pred HHHHHHHhccHHHhhhcCccceeeehhhhhHHHHHHHHHHHHHHhhccHHH
Confidence 333444556666655444443333333456666788888888888887654
No 60
>CHL00204 ycf1 Ycf1; Provisional
Probab=26.96 E-value=67 Score=33.68 Aligned_cols=12 Identities=25% Similarity=0.329 Sum_probs=5.3
Q ss_pred eeeEeeecCcCC
Q psy17334 78 VPCVILKSKDMR 89 (188)
Q Consensus 78 vi~~i~~~~~l~ 89 (188)
|++.|..+...|
T Consensus 184 vlvwI~qn~sIr 195 (1832)
T CHL00204 184 VLVWIQQNNSIR 195 (1832)
T ss_pred heeeeecCCccc
Confidence 444444444443
No 61
>PF14654 Epiglycanin_C: Mucin, catalytic, TM and cytoplasmic tail region
Probab=26.72 E-value=90 Score=21.99 Aligned_cols=22 Identities=14% Similarity=0.434 Sum_probs=14.5
Q ss_pred chhhHHHHHHHHHHHHhhcccc
Q psy17334 56 IRTPSMVLCSIILCIGVLGNIM 77 (188)
Q Consensus 56 ~~~~~~~~~~~i~l~gl~GN~l 77 (188)
..++++.+..++..+|++..++
T Consensus 18 WeIfLItLasVvvavGl~aGLf 39 (106)
T PF14654_consen 18 WEIFLITLASVVVAVGLFAGLF 39 (106)
T ss_pred hHHHHHHHHHHHHHHHHHHHHH
Confidence 4556666777777777776543
No 62
>PF01299 Lamp: Lysosome-associated membrane glycoprotein (Lamp); InterPro: IPR002000 Lysosome-associated membrane glycoproteins (lamp) [] are integral membrane proteins, specific to lysosomes, and whose exact biological function is not yet clear. Structurally, the lamp proteins consist of two internally homologous lysosome-luminal domains separated by a proline-rich hinge region; at the C-terminal extremity there is a transmembrane region (TM) followed by a very short cytoplasmic tail (C). In each of the duplicated domains, there are two conserved disulphide bonds. This structure is schematically represented in the figure below. +-----+ +-----+ +-----+ +-----+ | | | | | | | | xCxxxxxCxxxxxxxxxxxxCxxxxxCxxxxxxxxxCxxxxxCxxxxxxxxxxxxCxxxxxCxxxxxxxx +--------------------------++Hinge++--------------------------++TM++C+ In mammals, there are two closely related types of lamp: lamp-1 and lamp-2, which form major components of the lysosome membrane. In chicken lamp-1 is known as LEP100. Also included in this entry is the macrophage protein CD68 (or macrosialin) [] is a heavily glycosylated integral membrane protein whose structure consists of a mucin-like domain followed by a proline-rich hinge; a single lamp-like domain; a transmembrane region and a short cytoplasmic tail. Similar to CD68, mammalian lamp-3, which is expressed in lymphoid organs, dendritic cells and in lung, contains all the C-terminal regions but lacks the N-terminal lamp-like region []. In a lamp-family protein from nematodes [] only the part C-terminal to the hinge is conserved. ; GO: 0016020 membrane
Probab=25.82 E-value=61 Score=27.21 Aligned_cols=12 Identities=8% Similarity=0.310 Sum_probs=7.1
Q ss_pred ceeeEeeecCcC
Q psy17334 77 MVPCVILKSKDM 88 (188)
Q Consensus 77 lvi~~i~~~~~l 88 (188)
++.|+|.|+|+.
T Consensus 289 LiaYli~Rrr~~ 300 (306)
T PF01299_consen 289 LIAYLIGRRRSR 300 (306)
T ss_pred HHhheeEecccc
Confidence 555666665553
No 63
>KOG4564|consensus
Probab=25.66 E-value=36 Score=30.74 Aligned_cols=50 Identities=12% Similarity=0.186 Sum_probs=33.2
Q ss_pred HHHHHHHHHHhhcccceeeEeeecCcCCCcchhHhHHhHHHHHhhhhhcc
Q psy17334 62 VLCSIILCIGVLGNIMVPCVILKSKDMRNSTNIFLMNLSIADLMVLLVCT 111 (188)
Q Consensus 62 ~~~~~i~l~gl~GN~lvi~~i~~~~~l~~~~~~fi~nLa~aDLl~~l~~~ 111 (188)
+++.+=+.++++.=++.+.++..-|++|-..|++=.||-++-++.++..+
T Consensus 149 ~lytvGyslSl~sL~vAl~If~~FR~L~CtRn~IH~nLF~SfiLra~~~~ 198 (473)
T KOG4564|consen 149 ILYTVGYSLSLVSLLVALIIFLYFRSLHCTRNYIHMNLFASFILRAASVL 198 (473)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHhhhhcchHHHHHHHHHHHHHHHHHHHH
Confidence 34444455555554444556667789999899999999888776665543
No 64
>PF02468 PsbN: Photosystem II reaction centre N protein (psbN); InterPro: IPR003398 Oxygenic photosynthesis uses two multi-subunit photosystems (I and II) located in the cell membranes of cyanobacteria and in the thylakoid membranes of chloroplasts in plants and algae. Photosystem II (PSII) has a P680 reaction centre containing chlorophyll 'a' that uses light energy to carry out the oxidation (splitting) of water molecules, and to produce ATP via a proton pump. Photosystem I (PSI) has a P700 reaction centre containing chlorophyll that takes the electron and associated hydrogen donated from PSII to reduce NADP+ to NADPH. Both ATP and NADPH are subsequently used in the light-independent reactions to convert carbon dioxide to glucose using the hydrogen atom extracted from water by PSII, releasing oxygen as a by-product. PSII is a multisubunit protein-pigment complex containing polypeptides both intrinsic and extrinsic to the photosynthetic membrane [, ]. Within the core of the complex, the chlorophyll and beta-carotene pigments are mainly bound to the antenna proteins CP43 (PsbC) and CP47 (PsbB), which pass the excitation energy on to the reaction centre proteins D1 (Qb, PsbA) and D2 (Qa, PsbD) that bind all the redox-active cofactors involved in the energy conversion process. The PSII oxygen-evolving complex (OEC) oxidises water to provide protons for use by PSI, and consists of OEE1 (PsbO), OEE2 (PsbP) and OEE3 (PsbQ). The remaining subunits in PSII are of low molecular weight (less than 10 kDa), and are involved in PSII assembly, stabilisation, dimerisation, and photo-protection []. This family represents the low molecular weight transmembrane protein PsbN found in PSII. PsbN may have a role in PSII stability, however its actual function unknown. PsbN does not appear to be essential for photoautotrophic growth or normal PSII function.; GO: 0015979 photosynthesis, 0009523 photosystem II, 0009539 photosystem II reaction center, 0016020 membrane
Probab=25.10 E-value=76 Score=18.86 Aligned_cols=16 Identities=13% Similarity=0.256 Sum_probs=10.4
Q ss_pred HHHHHHHHHHHHhhcc
Q psy17334 60 SMVLCSIILCIGVLGN 75 (188)
Q Consensus 60 ~~~~~~~i~l~gl~GN 75 (188)
.+++++..+++|+.|-
T Consensus 6 ~~~i~i~~~lv~~Tgy 21 (43)
T PF02468_consen 6 VLAIFISCLLVSITGY 21 (43)
T ss_pred eHHHHHHHHHHHHHhh
Confidence 4556666677777773
No 65
>PF06679 DUF1180: Protein of unknown function (DUF1180); InterPro: IPR009565 This entry consists of several hypothetical eukaryotic proteins thought to be membrane proteins. Their function is unknown.
Probab=24.72 E-value=1.5e+02 Score=22.82 Aligned_cols=8 Identities=13% Similarity=0.110 Sum_probs=2.9
Q ss_pred HHHHHHHH
Q psy17334 62 VLCSIILC 69 (188)
Q Consensus 62 ~~~~~i~l 69 (188)
.+|+++.+
T Consensus 97 ~~~Vl~g~ 104 (163)
T PF06679_consen 97 ALYVLVGL 104 (163)
T ss_pred hHHHHHHH
Confidence 33333333
No 66
>PHA03099 epidermal growth factor-like protein (EGF-like protein); Provisional
Probab=23.29 E-value=74 Score=23.62 Aligned_cols=28 Identities=7% Similarity=0.044 Sum_probs=12.8
Q ss_pred HHHHHHHHHHHHhhcccceeeEeeecCc
Q psy17334 60 SMVLCSIILCIGVLGNIMVPCVILKSKD 87 (188)
Q Consensus 60 ~~~~~~~i~l~gl~GN~lvi~~i~~~~~ 87 (188)
+.+++++++++.+.|-+..++...|+++
T Consensus 103 ~~~il~il~~i~is~~~~~~yr~~r~~~ 130 (139)
T PHA03099 103 SPGIVLVLVGIIITCCLLSVYRFTRRTK 130 (139)
T ss_pred hhHHHHHHHHHHHHHHHHhhheeeeccc
Confidence 3344444455555554444444444443
No 67
>PRK14762 membrane protein; Provisional
Probab=22.85 E-value=94 Score=16.22 Aligned_cols=7 Identities=57% Similarity=0.809 Sum_probs=3.0
Q ss_pred HHHHHHH
Q psy17334 65 SIILCIG 71 (188)
Q Consensus 65 ~~i~l~g 71 (188)
+++|++|
T Consensus 9 ~iiflig 15 (27)
T PRK14762 9 LIIFLIG 15 (27)
T ss_pred HHHHHHH
Confidence 3444444
No 68
>KOG1172|consensus
Probab=22.74 E-value=1.3e+02 Score=29.24 Aligned_cols=43 Identities=21% Similarity=0.228 Sum_probs=33.7
Q ss_pred cccceeeEeeecCcCCCcc--hhHhHHhHHHHHhhhhhccceeeE
Q psy17334 74 GNIMVPCVILKSKDMRNST--NIFLMNLSIADLMVLLVCTPTVLV 116 (188)
Q Consensus 74 GN~lvi~~i~~~~~l~~~~--~~fi~nLa~aDLl~~l~~~P~~i~ 116 (188)
-|+...++..+.++|+++. ++=++-|++..++++++++|++..
T Consensus 648 qqIts~iv~rke~kLKKgsgyH~DLlllgil~~icsllGLPw~~~ 692 (876)
T KOG1172|consen 648 QQITAVIVNRKENKLKKGSGYHLDLLLLGILTLICSLLGLPWSNA 692 (876)
T ss_pred chHHHHHhhcccccCCCCcchhHHHHHHHHHHHHHHhcCCCcccc
Confidence 4666677777888888876 566777888899999999998764
No 69
>PF08229 SHR3_chaperone: ER membrane protein SH3 ; InterPro: IPR013248 This family of proteins are membrane localised chaperones that are required for correct plasma membrane localisation of amino acid permeases (AAPs) []. Shr3 prevents AAPs proteins from aggregating and assists in their correct folding. In the absence of Shr3, AAPs are retained in the ER.
Probab=21.84 E-value=71 Score=25.40 Aligned_cols=12 Identities=25% Similarity=0.661 Sum_probs=6.1
Q ss_pred HHHHHHHHhhcc
Q psy17334 64 CSIILCIGVLGN 75 (188)
Q Consensus 64 ~~~i~l~gl~GN 75 (188)
+-++..+|++|.
T Consensus 65 Lh~v~~lglig~ 76 (196)
T PF08229_consen 65 LHIVIGLGLIGL 76 (196)
T ss_pred HHHHHHHHHHHH
Confidence 334455556664
No 70
>PF10329 DUF2417: Region of unknown function (DUF2417); InterPro: IPR019431 This entry represents a family of fungal proteins with no known function. In some cases these proteins also contain an alpha/beta hydrolase fold (IPR000073 from INTERPRO).
Probab=21.68 E-value=46 Score=27.19 Aligned_cols=37 Identities=22% Similarity=0.336 Sum_probs=16.9
Q ss_pred hcccceeeEeeecCcCCCcchhHhHHhHHHHHhhhhh
Q psy17334 73 LGNIMVPCVILKSKDMRNSTNIFLMNLSIADLMVLLV 109 (188)
Q Consensus 73 ~GN~lvi~~i~~~~~l~~~~~~fi~nLa~aDLl~~l~ 109 (188)
+.|++.++.+....+.-...++.++-|.+.|++..+.
T Consensus 85 ~~~~~~L~Ff~vpS~~~r~l~~vl~~Lllvdlilil~ 121 (232)
T PF10329_consen 85 ITNLFNLWFFGVPSKLERILNIVLAGLLLVDLILILA 121 (232)
T ss_pred HHHHHHHHheecCcHHHHHHHHHHHHHHHHHHHHHHh
Confidence 3343333333333333333455555555566665544
No 71
>PF10316 7TM_GPCR_Srbc: Serpentine type 7TM GPCR chemoreceptor Srbc ; InterPro: IPR019420 G-protein-coupled receptors, GPCRs, constitute a vast protein family that encompasses a wide range of functions (including various autocrine, paracrine and endocrine processes). They show considerable diversity at the sequence level, on the basis of which they can be separated into distinct groups. We use the term clan to describe the GPCRs, as they embrace a group of families for which there are indications of evolutionary relationship, but between which there is no statistically significant similarity in sequence []. The currently known clan members include the rhodopsin-like GPCRs, the secretin-like GPCRs, the cAMP receptors, the fungal mating pheromone receptors, and the metabotropic glutamate receptor family. There is a specialised database for GPCRs (http://www.gpcr.org/7tm/). The nematode Caenorhabditis elegans has only 14 types of chemosensory neuron, yet is able to sense and respond to several hundred different chemicals because each neuron detects several stimuli []. Chemoperception is one of the central senses of soil nematodes like C. elegans which are otherwise 'blind' and 'deaf' []. Chemoreception in C. elegans is mediated by members of the seven-transmembrane G-protein-coupled receptor class (7TM GPCRs). More than 1300 potential chemoreceptor genes have been identified in C. elegans, which are generally prefixed sr for serpentine receptor. The receptor superfamilies include Sra (Sra, Srb, Srab, Sre), Str (Srh, Str, Sri, Srd, Srj, Srm, Srn) and Srg (Srx, Srt, Srg, Sru, Srv, Srxa), as well as the families Srw, Srz, Srbc, Srsx and Srr [, , ]. Many of these proteins have homologues in Caenorhabditis briggsae. This entry represents serpentine receptor class b (Srb) from the Sra superfamily []. Srb receptors contain 6-8 hydrophobic, putative transmembrane, regions and can be distinguished from other 7TM GPCR receptors by their own characteristic TM signatures. Srbc is a solo family amongst the superfamilies of chemoreceptors.
Probab=20.89 E-value=50 Score=27.53 Aligned_cols=53 Identities=15% Similarity=0.046 Sum_probs=32.3
Q ss_pred HHHHHHHHHHHHhhcccceeeEeeecCcCCCcchhHhHHhHHHHHhhhhhccc
Q psy17334 60 SMVLCSIILCIGVLGNIMVPCVILKSKDMRNSTNIFLMNLSIADLMVLLVCTP 112 (188)
Q Consensus 60 ~~~~~~~i~l~gl~GN~lvi~~i~~~~~l~~~~~~fi~nLa~aDLl~~l~~~P 112 (188)
..++-++...+..+.|..+++.+.+.|+.+++-..++.-..+.|.+.+....+
T Consensus 8 v~~i~i~~s~~~~~iN~~lL~~if~~Kk~kk~~l~LfY~Rf~~D~~~~~~~~~ 60 (273)
T PF10316_consen 8 VSIIGIIFSIITCLINFYLLYSIFYSKKKKKPDLSLFYFRFAIDVFYGFSVFI 60 (273)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhccccCCCCEEeeHHHHHHHHHHHHHHHH
Confidence 33444555667778898888887766654444444444445677777765443
No 72
>PHA03062 putative IMV membrane protein; Provisional
Probab=20.83 E-value=1.6e+02 Score=19.79 Aligned_cols=51 Identities=14% Similarity=0.216 Sum_probs=30.3
Q ss_pred HHHHHHhhcccceeeEeeecCcCCCcchhHhHHhHHHHHhhhhhccceeeE
Q psy17334 66 IILCIGVLGNIMVPCVILKSKDMRNSTNIFLMNLSIADLMVLLVCTPTVLV 116 (188)
Q Consensus 66 ~i~l~gl~GN~lvi~~i~~~~~l~~~~~~fi~nLa~aDLl~~l~~~P~~i~ 116 (188)
++.+..+.|..+++-.+..+|.+-..+--|---+.+.++...++.+|..++
T Consensus 17 L~llMivsG~ali~k~~~p~r~~~~RS~~~~rVl~~lE~va~lifIPgti~ 67 (78)
T PHA03062 17 LALLMIVSGGALLFKSLAPHRVVMMRSATFNKVVSFLEYVAILIFVPGTIA 67 (78)
T ss_pred HHHHHHHhChHHHHhhcCccceeeehhhhhHHHHHHHHHHHHHHhhccHHH
Confidence 334444556666655444443333333456666788888888888886654
No 73
>PF04713 Pox_I5: Poxvirus protein I5; InterPro: IPR006803 This entry represents the Poxvirus protein I5.
Probab=20.69 E-value=1.8e+02 Score=19.52 Aligned_cols=49 Identities=14% Similarity=0.275 Sum_probs=28.9
Q ss_pred HHHHhhcccceeeEeeecCcCCCcchhHhHHhHHHHHhhhhhccceeeE
Q psy17334 68 LCIGVLGNIMVPCVILKSKDMRNSTNIFLMNLSIADLMVLLVCTPTVLV 116 (188)
Q Consensus 68 ~l~gl~GN~lvi~~i~~~~~l~~~~~~fi~nLa~aDLl~~l~~~P~~i~ 116 (188)
.+..+.|-.+++-.+..+|.+-...-.+---|.+.+.+..++.+|..++
T Consensus 18 llMvisG~ali~k~~ap~~~v~~rS~~~~rVl~~le~vailifIPGTIi 66 (77)
T PF04713_consen 18 LLMVISGGALIFKRIAPHRSVMMRSATFNRVLTFLEYVAILIFIPGTII 66 (77)
T ss_pred HHHHHhChHHHhhccCCcceehhhhhhhHHHHHHHHHHHHHhhcccHHH
Confidence 3344556666554444444433333445555778888888888886554
No 74
>PHA03283 envelope glycoprotein E; Provisional
Probab=20.30 E-value=1.1e+02 Score=28.11 Aligned_cols=36 Identities=17% Similarity=0.038 Sum_probs=17.1
Q ss_pred hhhHHHHHHHHHHHHhhcccceeeEeeecCcCCCcc
Q psy17334 57 RTPSMVLCSIILCIGVLGNIMVPCVILKSKDMRNST 92 (188)
Q Consensus 57 ~~~~~~~~~~i~l~gl~GN~lvi~~i~~~~~l~~~~ 92 (188)
+..+.++.++++++|++.=++++|+..++++.+..+
T Consensus 397 ~~~l~~~~~~~~~~~~~~~~l~vw~c~~~r~~~~~~ 432 (542)
T PHA03283 397 RHYLAFLLAIICTCAALLVALVVWGCILYRRSNRKP 432 (542)
T ss_pred cccchhHHHHHHHHHHHHHHHhhhheeeehhhcCCc
Confidence 334444444444444443335555555555544433
No 75
>PF07423 DUF1510: Protein of unknown function (DUF1510); InterPro: IPR009988 This family consists of several hypothetical bacterial proteins of around 200 residues in length. The function of this family is unknown.
Probab=20.24 E-value=13 Score=29.97 Aligned_cols=15 Identities=20% Similarity=0.149 Sum_probs=9.0
Q ss_pred CcCCCcchhHhHHhH
Q psy17334 86 KDMRNSTNIFLMNLS 100 (188)
Q Consensus 86 ~~l~~~~~~fi~nLa 100 (188)
|+.....|++|.-.+
T Consensus 10 RK~N~iLNiaI~IV~ 24 (217)
T PF07423_consen 10 RKTNKILNIAIGIVS 24 (217)
T ss_pred hhhhhhHHHHHHHHH
Confidence 444456677777544
No 76
>PF15050 SCIMP: SCIMP protein
Probab=20.17 E-value=21 Score=26.07 Aligned_cols=8 Identities=0% Similarity=0.310 Sum_probs=4.2
Q ss_pred ceeeEeee
Q psy17334 77 MVPCVILK 84 (188)
Q Consensus 77 lvi~~i~~ 84 (188)
+++++++|
T Consensus 25 lIlyCvcR 32 (133)
T PF15050_consen 25 LILYCVCR 32 (133)
T ss_pred HHHHHHHH
Confidence 55555544
No 77
>PF05961 Chordopox_A13L: Chordopoxvirus A13L protein; InterPro: IPR009236 This family consists of A13L proteins from the Chordopoxviruses. A13L or p8 is one of the three most abundant membrane proteins of the intracellular mature Vaccinia virus [].
Probab=20.10 E-value=46 Score=21.75 Aligned_cols=18 Identities=22% Similarity=0.713 Sum_probs=8.8
Q ss_pred HHHHhhcccceeeEeeecCc
Q psy17334 68 LCIGVLGNIMVPCVILKSKD 87 (188)
Q Consensus 68 ~l~gl~GN~lvi~~i~~~~~ 87 (188)
.+++++| ++++.+..+++
T Consensus 10 ICVaii~--lIlY~iYnr~~ 27 (68)
T PF05961_consen 10 ICVAIIG--LILYGIYNRKK 27 (68)
T ss_pred HHHHHHH--HHHHHHHhccc
Confidence 3334444 55666655443
Done!