Query psy14918
Match_columns 463
No_of_seqs 267 out of 472
Neff 6.7
Searched_HMMs 46136
Date Sat Aug 17 00:16:39 2013
Command hhsearch -i /work/01045/syshi/Psyhhblits/psy14918.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/14918hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 PF01534 Frizzled: Frizzled/Sm 100.0 2.6E-83 5.7E-88 646.4 32.1 313 46-363 1-328 (328)
2 KOG3577|consensus 100.0 2.4E-60 5.3E-65 509.9 11.5 325 33-363 206-543 (556)
3 PF05462 Dicty_CAR: Slime mold 99.9 4.1E-22 8.9E-27 200.1 26.1 258 49-350 2-260 (303)
4 KOG4193|consensus 99.8 5.2E-17 1.1E-21 176.1 23.7 178 59-254 325-505 (610)
5 PF00002 7tm_2: 7 transmembran 99.7 1.3E-17 2.8E-22 161.8 1.9 180 55-250 2-182 (242)
6 KOG4564|consensus 99.3 1.1E-10 2.3E-15 122.8 17.4 180 63-248 154-337 (473)
7 PF11710 Git3: G protein-coupl 99.1 1.8E-09 3.9E-14 102.9 13.6 177 58-245 5-194 (201)
8 KOG4289|consensus 98.9 1.9E-09 4.1E-14 121.3 8.0 157 56-243 2165-2321(2531)
9 PF02101 Ocular_alb: Ocular al 98.9 1.1E-07 2.4E-12 96.7 19.0 148 89-247 68-222 (405)
10 PHA03087 G protein-coupled che 98.3 0.00036 7.8E-09 70.8 24.4 163 41-214 28-193 (335)
11 PHA03235 DNA packaging protein 98.3 0.0012 2.7E-08 69.6 28.8 132 55-196 34-168 (409)
12 PF00001 7tm_1: 7 transmembran 98.1 0.00017 3.7E-09 68.0 17.5 137 70-216 1-143 (257)
13 PHA03234 DNA packaging protein 98.0 0.0012 2.6E-08 67.9 21.5 149 54-212 33-184 (338)
14 PHA02834 chemokine receptor-li 98.0 0.0019 4E-08 65.7 22.2 159 42-211 17-176 (323)
15 PHA02638 CC chemokine receptor 97.9 0.0071 1.5E-07 64.0 25.8 148 36-198 85-234 (417)
16 PF05296 TAS2R: Mammalian tast 96.6 0.41 8.9E-06 48.5 21.3 220 59-289 12-251 (303)
17 PF02076 STE3: Pheromone A rec 96.2 0.99 2.2E-05 45.4 20.9 112 179-294 107-220 (283)
18 KOG2087|consensus 94.4 0.078 1.7E-06 54.4 6.2 191 56-255 27-222 (363)
19 PF10320 7TM_GPCR_Srsx: Serpen 94.3 0.55 1.2E-05 46.3 11.8 134 67-213 4-139 (257)
20 PF10278 Med19: Mediator of RN 93.5 0.03 6.6E-07 52.0 1.0 8 394-401 92-99 (178)
21 KOG4220|consensus 93.5 0.086 1.9E-06 55.1 4.4 143 59-215 36-189 (503)
22 KOG4219|consensus 92.6 4.6 0.0001 42.4 15.4 128 58-200 37-174 (423)
23 PF10321 7TM_GPCR_Srt: Serpent 92.5 14 0.0003 37.8 21.6 39 320-361 267-305 (313)
24 KOG4043|consensus 90.5 0.18 4E-06 45.9 2.3 7 394-400 121-127 (214)
25 PF06454 DUF1084: Protein of u 85.4 43 0.00094 33.6 19.0 62 223-289 166-227 (281)
26 KOG2573|consensus 83.5 0.45 9.7E-06 49.4 0.8 6 198-203 177-182 (498)
27 KOG2138|consensus 82.8 0.43 9.4E-06 52.5 0.4 10 19-28 312-321 (883)
28 PF07771 TSGP1: Tick salivary 81.6 0.35 7.5E-06 42.5 -0.7 16 340-356 39-54 (120)
29 PF02117 7TM_GPCR_Sra: Serpent 80.7 72 0.0016 32.7 21.2 101 55-163 22-130 (328)
30 PF10292 7TM_GPCR_Srab: Serpen 79.0 78 0.0017 32.1 23.7 169 54-237 17-196 (324)
31 KOG4043|consensus 74.4 2 4.3E-05 39.4 1.9 6 379-384 97-102 (214)
32 PF10323 7TM_GPCR_Srv: Serpent 72.9 92 0.002 31.0 13.7 72 136-207 77-151 (283)
33 KOG0526|consensus 71.7 1.4 3.1E-05 47.3 0.4 13 47-59 76-88 (615)
34 PF02118 Srg: Srg family chemo 70.2 1.1E+02 0.0024 29.7 17.4 63 134-196 81-144 (275)
35 PF11970 Git3_C: G protein-cou 66.7 24 0.00052 28.4 6.4 68 264-348 4-72 (76)
36 TIGR00869 sec62 protein transl 65.2 3.3 7.2E-05 40.4 1.4 6 340-345 181-186 (232)
37 PF08496 Peptidase_S49_N: Pept 63.4 5.2 0.00011 36.7 2.3 12 322-333 6-17 (155)
38 PF03839 Sec62: Translocation 61.9 2.8 6.1E-05 40.7 0.2 6 340-345 175-180 (224)
39 KOG2270|consensus 60.8 7.2 0.00016 41.1 2.9 15 380-394 416-430 (520)
40 PF01284 MARVEL: Membrane-asso 59.1 1.2E+02 0.0025 26.2 11.0 23 58-80 43-65 (144)
41 PF10324 7TM_GPCR_Srw: Serpent 57.7 2.1E+02 0.0045 28.5 17.0 56 63-121 6-68 (318)
42 PRK11281 hypothetical protein; 55.9 4.5E+02 0.0097 31.9 17.8 28 164-191 608-636 (1113)
43 KOG1059|consensus 55.6 4.5 9.8E-05 45.3 0.5 11 153-163 407-417 (877)
44 PRK11192 ATP-dependent RNA hel 55.4 3.1 6.7E-05 43.9 -0.8 11 349-359 325-335 (434)
45 PRK14552 C/D box methylation g 54.9 2.3 5.1E-05 45.1 -1.8 12 375-386 316-327 (414)
46 PF10361 DUF2434: Protein of u 53.6 98 0.0021 31.3 9.4 79 80-168 109-196 (296)
47 KOG2573|consensus 52.9 7.3 0.00016 40.8 1.5 8 182-189 194-201 (498)
48 KOG2927|consensus 52.4 47 0.001 34.4 7.1 22 326-347 240-261 (372)
49 PF06658 DUF1168: Protein of u 51.0 13 0.00027 33.7 2.5 7 405-411 25-31 (142)
50 PF11188 DUF2975: Protein of u 50.3 1.2E+02 0.0025 26.1 8.6 54 224-282 22-75 (136)
51 PRK14552 C/D box methylation g 50.3 3.9 8.3E-05 43.4 -1.0 12 181-192 165-176 (414)
52 PF15002 ERK-JNK_inhib: ERK an 50.0 13 0.00028 35.7 2.6 10 348-357 124-133 (207)
53 PRK10929 putative mechanosensi 49.7 5.5E+02 0.012 31.1 18.0 45 147-191 565-615 (1109)
54 PF10317 7TM_GPCR_Srd: Serpent 49.1 2.8E+02 0.0061 27.5 22.6 95 62-164 7-112 (292)
55 PF10856 DUF2678: Protein of u 46.8 9.3 0.0002 33.2 1.0 61 45-108 17-77 (118)
56 KOG4055|consensus 45.3 7.6 0.00016 36.3 0.2 6 405-410 69-74 (213)
57 smart00786 SHR3_chaperone ER m 43.6 4.3 9.4E-05 38.5 -1.7 7 322-328 94-100 (196)
58 PF15333 TAF1D: TATA box-bindi 43.0 29 0.00063 33.1 3.7 6 417-422 43-48 (217)
59 PF11694 DUF3290: Protein of u 42.4 1E+02 0.0023 28.0 7.1 51 55-105 14-64 (149)
60 KOG2270|consensus 41.0 15 0.00032 38.9 1.5 10 181-190 223-232 (520)
61 KOG4055|consensus 36.4 27 0.00059 32.8 2.3 10 412-421 85-94 (213)
62 cd07912 Tweety_N N-terminal do 32.9 2E+02 0.0044 30.7 8.5 27 54-80 36-62 (418)
63 PF10192 GpcrRhopsn4: Rhodopsi 32.7 4.8E+02 0.01 25.5 20.7 17 45-61 18-34 (257)
64 PHA03029 hypothetical protein; 30.8 2.8E+02 0.0061 22.3 6.8 16 280-295 60-75 (92)
65 KOG0345|consensus 30.7 29 0.00063 37.4 1.8 13 67-79 173-185 (567)
66 KOG2357|consensus 30.1 36 0.00077 36.0 2.3 18 63-80 108-125 (440)
67 PRK10263 DNA translocase FtsK; 29.8 9.5E+02 0.021 29.7 13.9 22 136-157 71-92 (1355)
68 PF12794 MscS_TM: Mechanosensi 29.0 6.5E+02 0.014 25.9 19.2 51 137-187 81-137 (340)
69 PF12569 NARP1: NMDA receptor- 28.9 32 0.00069 37.7 1.8 33 327-359 331-364 (517)
70 KOG1277|consensus 27.9 7.7E+02 0.017 26.9 11.5 86 198-294 421-509 (593)
71 PF15002 ERK-JNK_inhib: ERK an 27.4 47 0.001 31.9 2.5 6 352-357 122-127 (207)
72 PRK11778 putative inner membra 27.0 45 0.00097 34.4 2.4 12 322-333 4-15 (330)
73 KOG3088|consensus 26.4 4.9E+02 0.011 26.4 9.3 15 225-239 239-253 (313)
74 KOG4433|consensus 26.1 1E+02 0.0022 33.3 4.8 33 50-82 36-68 (526)
75 PF10582 Connexin_CCC: Gap jun 25.9 2.1E+02 0.0046 22.5 5.4 36 40-75 25-64 (67)
76 PRK14485 putative bifunctional 25.7 1.1E+03 0.023 27.2 15.0 109 83-199 4-116 (712)
77 KOG0112|consensus 25.6 26 0.00057 40.2 0.4 6 417-422 717-722 (975)
78 KOG4812|consensus 24.4 2.1E+02 0.0045 28.3 6.2 26 228-253 227-252 (262)
79 KOG3598|consensus 23.3 42 0.0009 40.8 1.5 10 59-68 1118-1127(2220)
80 PF06813 Nodulin-like: Nodulin 23.1 7.3E+02 0.016 24.4 17.0 90 135-243 131-220 (250)
81 KOG4520|consensus 23.1 36 0.00078 32.2 0.7 26 426-451 168-193 (238)
82 PF03908 Sec20: Sec20; InterP 23.0 85 0.0018 25.8 2.9 26 84-109 64-89 (92)
83 KOG0112|consensus 22.7 30 0.00065 39.8 0.2 9 451-459 778-786 (975)
84 PRK12585 putative monovalent c 22.2 3.7E+02 0.008 25.7 7.2 29 226-254 8-36 (197)
85 KOG3058|consensus 21.8 5.1E+02 0.011 27.0 8.8 81 39-122 96-178 (351)
86 PF05620 DUF788: Protein of un 21.7 44 0.00095 30.9 1.0 11 293-303 64-74 (170)
87 PRK03113 putative disulfide ox 21.3 1.5E+02 0.0033 26.6 4.4 49 178-234 9-57 (139)
88 PLN02967 kinase 21.2 43 0.00093 37.3 1.0 12 449-460 135-146 (581)
89 TIGR00955 3a01204 The Eye Pigm 21.1 1.2E+03 0.025 26.0 14.6 69 178-246 530-610 (617)
90 PRK00611 putative disulfide ox 20.9 1.5E+02 0.0033 26.6 4.3 48 180-235 10-57 (135)
91 PF04929 Herpes_DNAp_acc: Herp 20.5 33 0.00073 35.8 0.0 17 323-339 237-253 (381)
No 1
>PF01534 Frizzled: Frizzled/Smoothened family membrane region; InterPro: IPR000539 The frizzled (fz) locus of Drosophila coordinates the cytoskeletons of epidermal cells, producing a parallel array of cuticular hairs and bristles [, ]. In fz mutants, the orientation of individual hairs with respect both to their neighbours and to the organism as a whole is altered. In the wild-type wing, all hairs point towards the distal tip []. In the developing wing, fz has 2 functions: it is required for the proximal-distal transmission of an intracellular polarity signal; and it is required for cells to respond to the polarity signal. Fz produces an mRNA that encodes an integral membrane protein with 7 putative transmembrane (TM) domains. This protein should contain both extracellular and cytoplasmic domains, which could function in the transmission and interpretation of polarity information []. This signature is usually found downstream of the Fz domain (IPR000024 from INTERPRO); GO: 0007166 cell surface receptor linked signaling pathway, 0016020 membrane
Probab=100.00 E-value=2.6e-83 Score=646.41 Aligned_cols=313 Identities=55% Similarity=1.015 Sum_probs=292.1
Q ss_pred CCcChHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCCCCCCchHHHHHHHHHHHHHHHHHHHHhcCCcceeccC-C-C
Q psy14918 46 IFFSQPEREFASMWIFVMSALCCISTLMTLTTFLIDTERFKYPERPIVFLSGCYFMVSIGYLIRVSVGHEEVACED-Q-M 123 (463)
Q Consensus 46 ~~ft~~e~~~~~~~~~v~s~i~~~~tlftilTF~i~~~r~~yp~r~iifls~c~~~~sl~~li~~~~g~~~i~C~~-~-~ 123 (463)
|+||++|+++++.|+++||++|+++|+||++||++|++|++||||||+|+++||+++|++|+++++.|+|+++|++ + +
T Consensus 1 p~ft~~e~~~~~~wi~~~s~ic~~~t~ftv~Tf~id~~rf~yPerpi~fl~~Cy~~~s~~~l~~~~~g~~~i~C~~~~~~ 80 (328)
T PF01534_consen 1 PFFTQEEKEFARIWIGVWSVICFISTLFTVLTFLIDRSRFRYPERPIIFLSFCYFIVSLGYLIRFILGRESIACRSQDGT 80 (328)
T ss_pred CCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccccCCccchhhHHHHHHHHHHHHHHHHHhccCCCcCCCcCCCC
Confidence 7999999999999999999999999999999999999999999999999999999999999999999999999997 3 3
Q ss_pred ceeecc-CCCC--hhHHHHHHHHHHHHHHHHHHHHHHHHHHHhccccchhhhhhhHHHHHHHHHHhhHHHHHHhhhhccc
Q psy14918 124 VRYGAS-GPTP--CTLVFLLVYFFGMSSYIWWVILSFTWFLAAGLKWGNEAIASYSLYFHLAAWLIPTVKSVSVIIMSAV 200 (463)
Q Consensus 124 ~~~~~~-~~~~--C~~~~~l~yyf~las~~Ww~il~~~w~ls~~~k~~~e~i~~~~~yfh~~aW~iP~v~tii~l~~~~v 200 (463)
.+...+ |.++ |+++|+++|||+|||.+||++|+++|||++++||++|+|++++.|||++||++|+++|+++|++++|
T Consensus 81 ~~~~~~~~~~~~~C~~~F~l~Yyf~mAa~~WWviLt~~W~lsa~~kw~~e~i~~~s~yfH~~aW~iP~~~ti~vL~~~~V 160 (328)
T PF01534_consen 81 ERILVQDGLENSSCTVVFLLLYYFGMAASLWWVILTLTWFLSAGLKWGSEAIEKKSSYFHLVAWGIPAVLTIAVLALRKV 160 (328)
T ss_pred ceeEeecCCCCCchhhHHHHHHHHHhHHHHHHHHHHHHHHHHhhcccCcchhhhhcchhhhHHhhhhHHHHHHHHHhccc
Confidence 344444 4344 9999999999999999999999999999999999999999999999999999999999999999999
Q ss_pred cCCcccceeEeeeCCccccchhhhhHHHHHHHHHHHHHHHHHHHhhhhhhhhcccCCCCccchhhhHHHHHHHHHHHHHH
Q psy14918 201 DGDTIAGICYVGNHNIHNLKYFVVFPLIIYLILGIIFLFGGFISLFRIRNVIKNQGGISNRTKADKLEKLMIRIGIFSVL 280 (463)
Q Consensus 201 ~~~~~tg~CwIg~~~~~~~~~f~~~Pl~i~llig~~fl~~~~~~l~~ir~~~k~~~~~~~~~~~~kl~~l~~rigif~~l 280 (463)
|||+++|+||||++|..++.+|+++|++++|++|++|++.|+++++|+|+.+++++ ++++|++++|.|+|+|+++
T Consensus 161 dgD~ltGiC~Vg~~~~~~l~~fvl~Pl~i~l~iG~~fL~~G~~~l~rir~~~~~~~-----~~~~Kl~klm~RiGiFsvl 235 (328)
T PF01534_consen 161 DGDELTGICFVGNQNPSALRGFVLAPLFIYLLIGTVFLLAGFVSLFRIRRSMKHDG-----TKTSKLEKLMIRIGIFSVL 235 (328)
T ss_pred ccccccceeEEeCCCHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHhcchhhhcccc-----cchhhHHHHHHHHHHHHHH
Confidence 99999999999999999999999999999999999999999999999999999877 7789999999999999999
Q ss_pred HhhhhHHHHhhHhhhhcccccccccc---ccCCCC-------CCCcCCCCchHHHHHHHHHHHHhhcccceeeeechhhh
Q psy14918 281 STLPASIVIGCLIYESYVYNEYVESI---VCPCSE-------KLDKDYFKPLHVVLLLKYFMALAVGLTSGVWIWSGKTF 350 (463)
Q Consensus 281 y~~~~~~~~~~~fYe~~~~~~w~~s~---~C~~~~-------~~~~~~~~P~~~~~~lk~fm~li~GI~~~~w~~s~KT~ 350 (463)
|++|++++++||+||+.|++.|++++ +|.... ....+.++|+++++++|+||+|++||++++|+||+||+
T Consensus 236 y~vp~~~~i~c~~YE~~~~~~W~~~~~~~~c~~~~~p~~~~~~~~~~~~~P~~~vf~Lk~~m~LivGItsg~WIwS~KTl 315 (328)
T PF01534_consen 236 YLVPALIVIACHFYEYSNRPQWEESWRDYNCCRLNVPCPSDSCSCCPPSRPSFAVFMLKYFMSLIVGITSGFWIWSSKTL 315 (328)
T ss_pred HHHHHHHHHHHHHHHHHhhHHHhhhcccCcCCccCCCCCCCCCcccccCCcchHHHHHHHHHHHhhchheeeeeechHHH
Confidence 99999999999999999999999877 332211 11124679999999999999999999999999999999
Q ss_pred hHHHHHHhhhhCC
Q psy14918 351 SSWKKLWNRLFGF 363 (463)
Q Consensus 351 ~~W~~~~~r~~~~ 363 (463)
++||++++|+++|
T Consensus 316 ~sWrr~~~R~~~~ 328 (328)
T PF01534_consen 316 ESWRRFFRRLCGK 328 (328)
T ss_pred HHHHHHHHHHhCc
Confidence 9999999998875
No 2
>KOG3577|consensus
Probab=100.00 E-value=2.4e-60 Score=509.89 Aligned_cols=325 Identities=52% Similarity=0.955 Sum_probs=301.8
Q ss_pred ecCCCCCcCCCCCCCcChHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCCCCCCchHHHHHHHHHHHHHHHHHHHHhc
Q psy14918 33 VSGINNCAYPCQGIFFSQPEREFASMWIFVMSALCCISTLMTLTTFLIDTERFKYPERPIVFLSGCYFMVSIGYLIRVSV 112 (463)
Q Consensus 33 ~~~~~~C~~pC~~~~ft~~e~~~~~~~~~v~s~i~~~~tlftilTF~i~~~r~~yp~r~iifls~c~~~~sl~~li~~~~ 112 (463)
+.+.++|++||++++|+++|+++++.|++.|+.+|+.+|+|+++||++|.+|++||||+|+|+++||++.+++|++++..
T Consensus 206 ~~~~~~C~~~C~~~~f~~~e~~~~~~~i~~~s~~c~~~tlft~ltf~~d~~rfryper~i~~l~~c~~~~s~~~~~~~~~ 285 (556)
T KOG3577|consen 206 SSEISNCAPPCPNPLFSRDENRFARSWIGIWSALCFLSTLFTVLTFLIDRMRFRYPERPIFFLSACYLIVSVGYLSSFVL 285 (556)
T ss_pred ccCcccCCCCCCCcccchhhhhhHHHHhhhhhhccchhhHhHHHhhccchhhccCCCcchhhHHHhhhhhccceeecccC
Confidence 45677999999999999999999999999999999999999999999999999999999999999999999999999999
Q ss_pred CCcceeccC-CCceeeccCCCC--hhHHHHHHHHHHHHHHHHHHHHHHHHHHHhccccchhhhhhhHHHHHHHHHHhhHH
Q psy14918 113 GHEEVACED-QMVRYGASGPTP--CTLVFLLVYFFGMSSYIWWVILSFTWFLAAGLKWGNEAIASYSLYFHLAAWLIPTV 189 (463)
Q Consensus 113 g~~~i~C~~-~~~~~~~~~~~~--C~~~~~l~yyf~las~~Ww~il~~~w~ls~~~k~~~e~i~~~~~yfh~~aW~iP~v 189 (463)
|+++++|+. +...+..++.++ |+++|.++|||+||+.+||++++++|++++++||+.|++++++.|||+++|.+|++
T Consensus 286 g~~~~~c~~~~~~~~~~~~~~~~~C~~vf~l~y~f~~a~~~wwv~l~l~w~l~~~~kw~~e~~e~~~~~fh~~aw~~p~~ 365 (556)
T KOG3577|consen 286 GSDEIACRATPARLLVQTGLENSGCTLVFLLLYFFLMAGSVWWVILTLTWFLAAGLKWGEEAIEAKSSYFHLLAWGLPSV 365 (556)
T ss_pred CCceeccCCCCCCceeecCCCCCCceeeehhHHHHHHHHHHHHHHHHHHHHhhhcCcccccccccccceeeehhcCCchH
Confidence 889999995 333455555555 99999999999999999999999999999999999999999999999999999999
Q ss_pred HHHHhhhhccccCCcccceeEeeeCCccccchhhhhHHHHHHHHHHHHHHHHHHHhhhhhhhhcccCCCCccchhhhHHH
Q psy14918 190 KSVSVIIMSAVDGDTIAGICYVGNHNIHNLKYFVVFPLIIYLILGIIFLFGGFISLFRIRNVIKNQGGISNRTKADKLEK 269 (463)
Q Consensus 190 ~tii~l~~~~v~~~~~tg~CwIg~~~~~~~~~f~~~Pl~i~llig~~fl~~~~~~l~~ir~~~k~~~~~~~~~~~~kl~~ 269 (463)
.|++++++++||||+.+|+|+||+.|..++.+|+++|+++++++|..|++.|+..++++|..+++++ +.+.|+++
T Consensus 366 ~ti~~l~~~~v~~~~~~giC~vG~~~~~~~~~fvl~pl~~~l~ig~~~l~~G~~~l~~~r~~~k~~~-----~~~~kl~~ 440 (556)
T KOG3577|consen 366 QTILILALNKVDGDELSGICFVGNSNPDALRGFVLAPLFLYLIIGTLFLLAGFVSLFRIRSSIKLSG-----TCTDKLEK 440 (556)
T ss_pred HHHHHHHHHHhcccceeEEEecCCCCcchhceeeecchhhhhhHHHHHHHHHHHHHhcccccccCCc-----cchHHHHH
Confidence 9999999999999999999999999999999999999999999999999999999999999999866 88999999
Q ss_pred HHHHHHHHHHHHhhhhHHHHhhHhhhhcccccccccc----------ccCCCCCCCcCCCCchHHHHHHHHHHHHhhccc
Q psy14918 270 LMIRIGIFSVLSTLPASIVIGCLIYESYVYNEYVESI----------VCPCSEKLDKDYFKPLHVVLLLKYFMALAVGLT 339 (463)
Q Consensus 270 l~~rigif~~ly~~~~~~~~~~~fYe~~~~~~w~~s~----------~C~~~~~~~~~~~~P~~~~~~lk~fm~li~GI~ 339 (463)
+|.|+|+|++||.+|+.++++|++||+.|++.|+.+| +|+.... .....+|++.++|++|+|.+++||+
T Consensus 441 ~m~~iGifs~Ly~~p~~~~~~c~~ye~~~~~~w~~~~~~~~~~~~~~~~~~~~~-~~~~~~p~~~v~m~k~l~~~~vgi~ 519 (556)
T KOG3577|consen 441 LMVRIGIFSALYTVPALIVIACYFYEQANADSWEGSWEQDCCKIVSFPCPGTAC-CVGKNRPEFGVFMLKYLNSLIVGIT 519 (556)
T ss_pred HHHhcCcceEEEeecceEEEeeEeehhhcccchhhhhhhccceeeccCCcchhc-cccccCchHHHHHHHHHhhcccCcC
Confidence 9999999999999999999999999999999999877 2222211 1246799999999999999999999
Q ss_pred ceeeeechhhhhHHHHHHhhhhCC
Q psy14918 340 SGVWIWSGKTFSSWKKLWNRLFGF 363 (463)
Q Consensus 340 ~~~w~~s~KT~~~W~~~~~r~~~~ 363 (463)
+++|+|+.||+.+|+++++|+..+
T Consensus 520 ~~~wv~s~kt~~~w~~~~~~~~~~ 543 (556)
T KOG3577|consen 520 SGFWVWSSKTLDSWQRFIRRGLVS 543 (556)
T ss_pred cceEEecCCCcccchhhhcccccc
Confidence 999999999999999999886543
No 3
>PF05462 Dicty_CAR: Slime mold cyclic AMP receptor
Probab=99.91 E-value=4.1e-22 Score=200.11 Aligned_cols=258 Identities=17% Similarity=0.268 Sum_probs=192.4
Q ss_pred ChHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCCCCCCchHHHHHHHHHHHHHHHHHHHHhcCCcceeccCCCceeec
Q psy14918 49 SQPEREFASMWIFVMSALCCISTLMTLTTFLIDTERFKYPERPIVFLSGCYFMVSIGYLIRVSVGHEEVACEDQMVRYGA 128 (463)
Q Consensus 49 t~~e~~~~~~~~~v~s~i~~~~tlftilTF~i~~~r~~yp~r~iifls~c~~~~sl~~li~~~~g~~~i~C~~~~~~~~~ 128 (463)
+++|.+..+....+.|++|+++++|.++||+..++.++++.|.++|++++-++.+++.++..... ++ .
T Consensus 2 ~~~~~~~~~~i~~~~s~lSllGclfiI~tf~~~k~~r~~~~rli~yl~~~~ll~~v~~~~~~~~~-------~~-----~ 69 (303)
T PF05462_consen 2 VKQQIRTLYAIELVASVLSLLGCLFIIITFCLFKRLRKPINRLIFYLSIANLLTNVASMIMTLSP-------SA-----G 69 (303)
T ss_pred cHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCccHHHHHHHHHHHHHHHHHHHHHHHhcc-------cC-----C
Confidence 46888999999999999999999999999999887778899999999999999888776643321 00 0
Q ss_pred cCCCChhHHHHHHHHHHHHHHHHHHHHHHHHHHHhccccchhhhhhhHHHHHHHHHHhhHHHHHHhhhhccccCCcccce
Q psy14918 129 SGPTPCTLVFLLVYFFGMSSYIWWVILSFTWFLAAGLKWGNEAIASYSLYFHLAAWLIPTVKSVSVIIMSAVDGDTIAGI 208 (463)
Q Consensus 129 ~~~~~C~~~~~l~yyf~las~~Ww~il~~~w~ls~~~k~~~e~i~~~~~yfh~~aW~iP~v~tii~l~~~~v~~~~~tg~ 208 (463)
+++..|.++++++|||.+|+.+|.++++++.|+...++. +..++...+||+++||+|++.++++++.+.+ +..+++
T Consensus 70 ~~s~lC~~Qafliq~f~~as~lWt~~iA~nly~~~~~~~--~~~~k~e~~Y~l~~wGiPli~a~i~l~~~~~--g~~G~W 145 (303)
T PF05462_consen 70 ENSFLCQFQAFLIQFFMLASFLWTLCIAFNLYLTFVKRY--DELEKLEKYYHLVCWGIPLIPAVIPLIGNDY--GPAGNW 145 (303)
T ss_pred CCCcchhhHhHHHHHhhHHHHHHHHHHHHHhhheeEEcC--ccHHHHHHHHHHHHHHHHHHHHHHHHHccCC--CCCCCc
Confidence 112229999999999999999999999999999886542 2357888899999999999999999876654 445669
Q ss_pred eEeeeCCccccchhhhhHHHHHHHHHHHHHHHHHHHhhhhhhhhcccCCCCccchhhhHHHHHHHHHHHHHHHhhhhHHH
Q psy14918 209 CYVGNHNIHNLKYFVVFPLIIYLILGIIFLFGGFISLFRIRNVIKNQGGISNRTKADKLEKLMIRIGIFSVLSTLPASIV 288 (463)
Q Consensus 209 CwIg~~~~~~~~~f~~~Pl~i~llig~~fl~~~~~~l~~ir~~~k~~~~~~~~~~~~kl~~l~~rigif~~ly~~~~~~~ 288 (463)
|||.++....+.+.+++|+.+++++..++..+.....++.. +...+ +..++..+...|+..|.+.+++..+..
T Consensus 146 CWI~~~~~~~r~~lfY~Pl~ii~~~~~ilv~i~~~~~y~~~---~~~~s----d~~~~~~~~~~kL~~Yp~ifiicw~fa 218 (303)
T PF05462_consen 146 CWIKPEWDVWRFALFYIPLWIIIIISAILVGITMRYIYRVI---RNGVS----DNKDKHLKYQLKLVNYPLIFIICWIFA 218 (303)
T ss_pred eeecCCCcchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH---ccccc----hhHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 99987655556678899999888887776655554444432 22221 112232334556777888777777777
Q ss_pred HhhHhhhhccccccccccccCCCCCCCcCCCCchHHHHHHHHHHHHhhcccce-eeeechhhh
Q psy14918 289 IGCLIYESYVYNEYVESIVCPCSEKLDKDYFKPLHVVLLLKYFMALAVGLTSG-VWIWSGKTF 350 (463)
Q Consensus 289 ~~~~fYe~~~~~~w~~s~~C~~~~~~~~~~~~P~~~~~~lk~fm~li~GI~~~-~w~~s~KT~ 350 (463)
+.-..|+..+. .| +.+.+|+......-|..-+ .|+.|++-+
T Consensus 219 ~INRI~~~~~~--------------------~~-~~l~~Lh~~~s~lqGf~nsivy~~n~~~~ 260 (303)
T PF05462_consen 219 TINRIYNFIGK--------------------NP-FWLSVLHVGFSPLQGFFNSIVYGYNNSLM 260 (303)
T ss_pred HHHHHHHHhcC--------------------Cc-hHHHHHHHHHHHHHHHHHHHHHHhCCHHH
Confidence 77666665321 23 4677888888878898844 599998753
No 4
>KOG4193|consensus
Probab=99.77 E-value=5.2e-17 Score=176.11 Aligned_cols=178 Identities=16% Similarity=0.324 Sum_probs=134.5
Q ss_pred HHHHHHHHHHHHHHHHHHHHhccCC-CCCCCchHHHHHHHHHHHHHHHHHHHHhcCCcceeccCCCceeeccCCCChhHH
Q psy14918 59 WIFVMSALCCISTLMTLTTFLIDTE-RFKYPERPIVFLSGCYFMVSIGYLIRVSVGHEEVACEDQMVRYGASGPTPCTLV 137 (463)
Q Consensus 59 ~~~v~s~i~~~~tlftilTF~i~~~-r~~yp~r~iifls~c~~~~sl~~li~~~~g~~~i~C~~~~~~~~~~~~~~C~~~ 137 (463)
...+..++|+++.+.+++||++.++ +..+ .++.+.+++|.++..+.|+++...-..++ ..|.++
T Consensus 325 i~~VG~~iSlvcL~lti~ty~~~~~l~~~~-~~i~~~l~~~L~l~~l~fL~~~~~~~~~~--------------~~C~~~ 389 (610)
T KOG4193|consen 325 LSTVGCIISLVCLLLTIATYLLFRKLQNDR-TKIHINLCLCLFLAELLFLLGIDRTSTSV--------------VLCIAA 389 (610)
T ss_pred hhhhhhHHHHHHHHHHHHHHHHHHHHHhhc-chhHHHHHHHHHHHHHHHhcccccccCcc--------------cccHHH
Confidence 3346667778888888899998765 2223 89999999999999999888654321111 038999
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHhccccchhhhhhhHHHHHHHHHHhhHHHHHHhhhhccccC--CcccceeEeeeCC
Q psy14918 138 FLLVYFFGMSSYIWWVILSFTWFLAAGLKWGNEAIASYSLYFHLAAWLIPTVKSVSVIIMSAVDG--DTIAGICYVGNHN 215 (463)
Q Consensus 138 ~~l~yyf~las~~Ww~il~~~w~ls~~~k~~~e~i~~~~~yfh~~aW~iP~v~tii~l~~~~v~~--~~~tg~CwIg~~~ 215 (463)
++++|||.||+++||.+++++.|+..+.... ...+++..+|.+++||.|++++.++++.+.... +.-.+.||+..++
T Consensus 390 a~llhff~LaaF~Wm~leg~hl~~~~v~vf~-~~~~~~~l~~~~~gwg~Pavvv~Isa~~~~~~~~~~~~~~~CWl~~~~ 468 (610)
T KOG4193|consen 390 AILLHFFFLAAFFWMLLEGFHLYLLLVEVFR-SRPRRRKLLYSLYGWGVPAVVVGVSALVDPDLEGQYGTPRVCWLDTQN 468 (610)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHhc-cccchHHHHHHHHHhhhhHHHHhheeEEeccCccccccCCceEEecCC
Confidence 9999999999999999999998875544322 223455555668999999999998887766542 2223459998766
Q ss_pred ccccchhhhhHHHHHHHHHHHHHHHHHHHhhhhhhhhcc
Q psy14918 216 IHNLKYFVVFPLIIYLILGIIFLFGGFISLFRIRNVIKN 254 (463)
Q Consensus 216 ~~~~~~f~~~Pl~i~llig~~fl~~~~~~l~~ir~~~k~ 254 (463)
. ..|.|++|+++++++++++++.+++.|++.+...++
T Consensus 469 ~--~~~~F~GPv~~ii~~Ni~~Fv~t~~~l~~~~~~~~~ 505 (610)
T KOG4193|consen 469 G--FIWSFLGPVTLIILVNIVMFVVTLKKLLRRLSKLQP 505 (610)
T ss_pred c--eEEEEehHHHHHHHHHHHHHHHHHHHHhhcccccCc
Confidence 4 456689999999999999999999999987766554
No 5
>PF00002 7tm_2: 7 transmembrane receptor (Secretin family); InterPro: IPR000832 G-protein-coupled receptors, GPCRs, constitute a vast protein family that encompasses a wide range of functions (including various autocrine, paracrine and endocrine processes). They show considerable diversity at the sequence level, on the basis of which they can be separated into distinct groups. We use the term clan to describe the GPCRs, as they embrace a group of families for which there are indications of evolutionary relationship, but between which there is no statistically significant similarity in sequence []. The currently known clan members include the rhodopsin-like GPCRs, the secretin-like GPCRs, the cAMP receptors, the fungal mating pheromone receptors, and the metabotropic glutamate receptor family. There is a specialised database for GPCRs (http://www.gpcr.org/7tm/). The secretin-like GPCRs include secretin [], calcitonin [], parathyroid hormone/parathyroid hormone-related peptides [] and vasoactive intestinal peptide [], all of which activate adenylyl cyclase and the phosphatidyl-inositol-calcium pathway. These receptors contain seven transmembrane regions, in a manner reminiscent of the rhodopsins and other receptors believed to interact with G-proteins (however there is no significant sequence identity between these families, the secretin-like receptors thus bear their own unique '7TM' signature). Their N terminus is probably located on the extracellular side of the membrane and potentially glycosylated. This N-terminal region contains a long conserved region which allow the binding of large peptidic ligand such as glucagon, secretin, VIP and PACAP; this region contains five conserved cysteines residues which could be involved in disulphide bond. The C-terminal region of these receptor is probably cytoplasmic. Every receptor gene in this family is encoded on multiple exons, and several of these genes are alternatively spliced to yield functionally distinct products. ; GO: 0004930 G-protein coupled receptor activity, 0007186 G-protein coupled receptor protein signaling pathway, 0016021 integral to membrane; PDB: 3L2J_A 1BL1_A.
Probab=99.67 E-value=1.3e-17 Score=161.77 Aligned_cols=180 Identities=18% Similarity=0.327 Sum_probs=7.8
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhccCCCCCCCchHHHHHHHHHHHHHHHHHHHHhcCCcceeccCCCceeeccCCC-C
Q psy14918 55 FASMWIFVMSALCCISTLMTLTTFLIDTERFKYPERPIVFLSGCYFMVSIGYLIRVSVGHEEVACEDQMVRYGASGPT-P 133 (463)
Q Consensus 55 ~~~~~~~v~s~i~~~~tlftilTF~i~~~r~~yp~r~iifls~c~~~~sl~~li~~~~g~~~i~C~~~~~~~~~~~~~-~ 133 (463)
.++....+.+++|+++.+++++||...++.++.+.+++.++++|.++.++.++++...-.+.. .+..+ -
T Consensus 2 ~l~~i~~vg~~~Si~~ll~~i~~~~~~r~lr~~~~~i~~~l~~sll~~~~~~l~~~~~~~~~~----------~~~~~~~ 71 (242)
T PF00002_consen 2 ALSIISYVGCSLSIICLLLTIITYLLFRKLRSFRNKIHLNLCLSLLLANLSFLIGISQTFSPI----------STTNHCL 71 (242)
T ss_dssp HHHHHHHHHHH---------------------------------------------------------------------
T ss_pred eeehHHHHHHHHHHHHHHHHHHHHHHHHhhcccchhhhhhhHHHHHHHHHHHheehhhccccc----------ccccccc
Confidence 456667788888888889999999987765555678889999999999999987643211100 00111 2
Q ss_pred hhHHHHHHHHHHHHHHHHHHHHHHHHHHHhccccchhhhhhhHHHHHHHHHHhhHHHHHHhhhhccccCCcccceeEeee
Q psy14918 134 CTLVFLLVYFFGMSSYIWWVILSFTWFLAAGLKWGNEAIASYSLYFHLAAWLIPTVKSVSVIIMSAVDGDTIAGICYVGN 213 (463)
Q Consensus 134 C~~~~~l~yyf~las~~Ww~il~~~w~ls~~~k~~~e~i~~~~~yfh~~aW~iP~v~tii~l~~~~v~~~~~tg~CwIg~ 213 (463)
|.+.++++|||.+|+++|+.+++++.|...+.. .+..+++..+|++++||+|++.+++.++.+.. +..+..||+++
T Consensus 72 C~~~a~~~hy~~la~f~Wm~~~~~~l~~~~~~~--~~~~~~~~~~~~~~gwg~P~~iv~i~~~~~~~--~y~~~~CWl~~ 147 (242)
T PF00002_consen 72 CRAIAILLHYFFLASFFWMLVEAFYLYRLLVKV--FNSSRRRFWWYYLIGWGIPALIVVISVAVNSD--GYGNDNCWLSN 147 (242)
T ss_dssp --------------------------------------------------------------------------------
T ss_pred chhhhhHhHHHHHHHHHHHHHHHHHhheeEEEe--ecccchhhheeeeeeecCcceeeeeeeeeccc--ccccccccccC
Confidence 999999999999999999999999999887553 22225566789999999999999999888722 22235799986
Q ss_pred CCccccchhhhhHHHHHHHHHHHHHHHHHHHhhhhhh
Q psy14918 214 HNIHNLKYFVVFPLIIYLILGIIFLFGGFISLFRIRN 250 (463)
Q Consensus 214 ~~~~~~~~f~~~Pl~i~llig~~fl~~~~~~l~~ir~ 250 (463)
+ .+..+.+.+|+.+.++++.++++.+++.+.+.++
T Consensus 148 ~--~~~~~~f~~P~~~~l~in~vi~~~v~~~l~~~~~ 182 (242)
T PF00002_consen 148 D--WGFIWAFVGPVLIILLINIVIFILVLRVLVRKSR 182 (242)
T ss_dssp -------------------------------------
T ss_pred C--CceEEEEEecccceecccchhheeeEEeehhhhh
Confidence 3 3556778999999999999999999887776555
No 6
>KOG4564|consensus
Probab=99.28 E-value=1.1e-10 Score=122.80 Aligned_cols=180 Identities=19% Similarity=0.300 Sum_probs=127.3
Q ss_pred HHHHHHHHHHHHHHHHhccCCCCCCCchHHHHHHHHHHHHHHHHHHHHh--cCCcceeccCCCc--eeeccCCCChhHHH
Q psy14918 63 MSALCCISTLMTLTTFLIDTERFKYPERPIVFLSGCYFMVSIGYLIRVS--VGHEEVACEDQMV--RYGASGPTPCTLVF 138 (463)
Q Consensus 63 ~s~i~~~~tlftilTF~i~~~r~~yp~r~iifls~c~~~~sl~~li~~~--~g~~~i~C~~~~~--~~~~~~~~~C~~~~ 138 (463)
.-.+|+++-+.+++.|...++..-....+...|-++|++-++.+++-.. .-+..-.+.++.. -.+.++...|-+..
T Consensus 154 GyslSl~sL~vAl~If~~FR~L~CtRn~IH~nLF~SfiLra~~~~i~~~~l~~~~~~~~~~~~~~~~~~~~~~~~Ck~~~ 233 (473)
T KOG4564|consen 154 GYSLSLVSLLVALIIFLYFRSLHCTRNYIHMNLFASFILRAASVLIKDLVLVVNGEQDASSDTSLHCLISSNPVGCKLLF 233 (473)
T ss_pred HHHHHHHHHHHHHHHHHHhhhhcchHHHHHHHHHHHHHHHHHHHHHHHHHhhccccccccccccccccccccchhHHHHH
Confidence 3334444444444555554654434468889999999999999988643 1122222322211 01122222399999
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhccccchhhhhhhHHHHHHHHHHhhHHHHHHhhhhccccCCcccceeEeeeCCccc
Q psy14918 139 LLVYFFGMSSYIWWVILSFTWFLAAGLKWGNEAIASYSLYFHLAAWLIPTVKSVSVIIMSAVDGDTIAGICYVGNHNIHN 218 (463)
Q Consensus 139 ~l~yyf~las~~Ww~il~~~w~ls~~~k~~~e~i~~~~~yfh~~aW~iP~v~tii~l~~~~v~~~~~tg~CwIg~~~~~~ 218 (463)
++++||.+|-++|++++++.+|...+....+| +.+...|-+++||+|++++++=...+.+..|. .||..+.+ ..
T Consensus 234 ~~~~Yf~~aNf~WmlvEGlYLhtLlv~~~~se--r~~~~~y~~iGWG~P~v~v~~Wai~r~~~~d~---~CW~~~~~-~~ 307 (473)
T KOG4564|consen 234 VFFQYFVLANFFWMLVEGLYLHTLLVVAFFSE--RRYFWLYLLIGWGVPAVFVVVWAIVRLYLEDT---GCWDSNDT-GH 307 (473)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHhhheeeccC--ccHHHHHHHHhhCCcHHHhhhHHhHheeeccc---cccccCCC-cc
Confidence 99999999999999999999887765544444 45566788999999999999888888766664 79998776 23
Q ss_pred cchhhhhHHHHHHHHHHHHHHHHHHHhhhh
Q psy14918 219 LKYFVVFPLIIYLILGIIFLFGGFISLFRI 248 (463)
Q Consensus 219 ~~~f~~~Pl~i~llig~~fl~~~~~~l~~i 248 (463)
..|..=+|.++.++||.+||+.+++.|.+.
T Consensus 308 ~~WIi~gPi~~ai~vNf~flinIvrILv~K 337 (473)
T KOG4564|consen 308 IWWIIRGPILLAILVNFIFLINIVRILVTK 337 (473)
T ss_pred eEEEEeccHHHHHHHHHHHHHHHHHHHHHH
Confidence 445556999999999999999999988653
No 7
>PF11710 Git3: G protein-coupled glucose receptor regulating Gpa2; InterPro: IPR023041 This entry contains a functionally uncharacterised region belonging to the Git3 G-protein coupled receptor. Git3 is one of six proteins required for glucose-triggered adenylate cyclase activation, and is a G protein-coupled receptor responsible for the activation of adenylate cyclase through Gpa2 - heterotrimeric G protein alpha subunit, part of the glucose-detection pathway. Git3 contains seven predicted transmembrane domains, a third cytoplasmic loop and a cytoplasmic tail []. This is the conserved N-terminal domain of the member proteins.
Probab=99.08 E-value=1.8e-09 Score=102.88 Aligned_cols=177 Identities=18% Similarity=0.290 Sum_probs=121.4
Q ss_pred HHHHHHHHHHHHHHHHHHHHHhc-cCCCCCCCchHHHHHHHHHHHHHHHHHH-HHh-cCCcceeccCCCceeeccCCCC-
Q psy14918 58 MWIFVMSALCCISTLMTLTTFLI-DTERFKYPERPIVFLSGCYFMVSIGYLI-RVS-VGHEEVACEDQMVRYGASGPTP- 133 (463)
Q Consensus 58 ~~~~v~s~i~~~~tlftilTF~i-~~~r~~yp~r~iifls~c~~~~sl~~li-~~~-~g~~~i~C~~~~~~~~~~~~~~- 133 (463)
....+.|.+|+++++.++..|.. ++++..+.++.|++|.++.++.++++++ ... ..++.+ ..+++
T Consensus 5 ~~~i~~s~lSlls~~~~l~~~~~~~~r~~~fR~~LIl~L~~aD~~qal~~~i~~~~~l~~~~i-----------~~~s~~ 73 (201)
T PF11710_consen 5 IGSIVISALSLLSSLAVLYLFFFITYRRRSFRHQLILNLLLADFIQALAFLISPIRWLARGGI-----------IAPSPF 73 (201)
T ss_pred eehHHHHHHHHHHHHHHHHHHHhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCe-----------eCCCCc
Confidence 34456777777777766666654 3455556678999999999999999998 322 111111 12344
Q ss_pred hhHHHHHHHHHHHHHHHHHHHHHHHHHHHhccc-cch---hh----hhhhHHHHHHHHHHhhHHHHHHhhhh-ccccCCc
Q psy14918 134 CTLVFLLVYFFGMSSYIWWVILSFTWFLAAGLK-WGN---EA----IASYSLYFHLAAWLIPTVKSVSVIIM-SAVDGDT 204 (463)
Q Consensus 134 C~~~~~l~yyf~las~~Ww~il~~~w~ls~~~k-~~~---e~----i~~~~~yfh~~aW~iP~v~tii~l~~-~~v~~~~ 204 (463)
|.++|++.+.+..++-+|-+.++++=|+...+. |.. .. +.+...+.-.+.+++|++...++... +....++
T Consensus 74 C~aqGf~~q~g~~~sd~~ilaIAihT~l~v~~~~~~~~~~~~~~~gl~~~~~~v~~~~~~~~~~~~~la~i~~~~~~Y~~ 153 (201)
T PF11710_consen 74 CQAQGFFLQVGDEASDLWILAIAIHTFLIVFRPNWKRKRSKNVEGGLYPYRYWVWVIWILVPLLLASLAFIGLGGPGYGP 153 (201)
T ss_pred hhhhHHHHHHHHHHHHHHHHHHHHHHHHHHhcCccccccccccccceEEeeeeeehHHHHHHHHHHHHHHhccccCcccc
Confidence 999999999999999999999999988877553 211 00 11111122244457777777666654 3333455
Q ss_pred ccceeEeeeCCccccchhhhhHHHHHHHHHHHHHHHHHHHh
Q psy14918 205 IAGICYVGNHNIHNLKYFVVFPLIIYLILGIIFLFGGFISL 245 (463)
Q Consensus 205 ~tg~CwIg~~~~~~~~~f~~~Pl~i~llig~~fl~~~~~~l 245 (463)
.+++||++.+..+.+.++-++|..+..++..+.-+.++..+
T Consensus 154 ~g~WCWi~~~~~~~Rl~l~y~~~~~~~~~~i~iY~~if~~l 194 (201)
T PF11710_consen 154 AGAWCWIPSRYEWYRLWLHYIWRFIIIFAIIIIYIAIFFYL 194 (201)
T ss_pred cCcEEEECCCcHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 67789999877778888889999999888887777776543
No 8
>KOG4289|consensus
Probab=98.92 E-value=1.9e-09 Score=121.34 Aligned_cols=157 Identities=12% Similarity=0.241 Sum_probs=116.4
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHhccCCCCCCCchHHHHHHHHHHHHHHHHHHHHhcCCcceeccCCCceeeccCCCChh
Q psy14918 56 ASMWIFVMSALCCISTLMTLTTFLIDTERFKYPERPIVFLSGCYFMVSIGYLIRVSVGHEEVACEDQMVRYGASGPTPCT 135 (463)
Q Consensus 56 ~~~~~~v~s~i~~~~tlftilTF~i~~~r~~yp~r~iifls~c~~~~sl~~li~~~~g~~~i~C~~~~~~~~~~~~~~C~ 135 (463)
+...+.+...+++++.+++++..+..+...+.-.++-..++++..+..+.|++++. .+.+. |+
T Consensus 2165 l~~~t~a~~gvslaal~lt~~llls~RsLksn~~~I~~~l~~Al~l~~L~Fv~gi~--~nq~~---------------Ct 2227 (2531)
T KOG4289|consen 2165 LKMTTYAAVGVSLAALLLTFLLLLSLRSLKSNSHGIHFNLAAALGLAQLVFVLGIN--QNQFY---------------CT 2227 (2531)
T ss_pred hhhhhHHHHHHHHHHHHHHHHHHHHHHHHhhcchhhhhHHHHHHhHHHHHhhhhcc--cCchh---------------hH
Confidence 44455566666666666555544444555556678889999999999999888743 23333 69
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHhccccchhhhhhhHHHHHHHHHHhhHHHHHHhhhhccccCCcccceeEeeeCC
Q psy14918 136 LVFLLVYFFGMSSYIWWVILSFTWFLAAGLKWGNEAIASYSLYFHLAAWLIPTVKSVSVIIMSAVDGDTIAGICYVGNHN 215 (463)
Q Consensus 136 ~~~~l~yyf~las~~Ww~il~~~w~ls~~~k~~~e~i~~~~~yfh~~aW~iP~v~tii~l~~~~v~~~~~tg~CwIg~~~ 215 (463)
++++++||+.++-+.|.++.+++.|-+..-.-. ..-..+||+++|++|++++.++.++. ..||++-++
T Consensus 2228 vvailLhf~~~stFaWlfl~gLhlYRml~e~v~----tg~mrfy~~vGwgvPa~itglavgld--------DfCwlS~h~ 2295 (2531)
T KOG4289|consen 2228 VVAILLHFTYLSTFAWLFLEGLHLYRMLTENVD----TGPMRFYYLVGWGVPALITGLAVGLD--------DFCWLSPHD 2295 (2531)
T ss_pred HHHHHHHHHHhhhHHHHHHHHHHHHHHHhccCC----CCceeEEEEeecchhhheeeeeeccc--------ceeecccch
Confidence 999999999999999999999999977643111 22245678999999999998888665 579998665
Q ss_pred ccccchhhhhHHHHHHHHHHHHHHHHHH
Q psy14918 216 IHNLKYFVVFPLIIYLILGIIFLFGGFI 243 (463)
Q Consensus 216 ~~~~~~f~~~Pl~i~llig~~fl~~~~~ 243 (463)
.+.|.|.+|++++++++.+..++...
T Consensus 2296 --~LiWsfagPigl~i~ms~~~~~lsa~ 2321 (2531)
T KOG4289|consen 2296 --TLIWSFAGPIGLVILMSFYAAALSAR 2321 (2531)
T ss_pred --heeeeecccceeeehHHHHHHHHHHH
Confidence 57888999999999888766555443
No 9
>PF02101 Ocular_alb: Ocular albinism type 1 protein; InterPro: IPR001414 Ocular albinism type 1 (OA1) is an X-linked disorder characterised by severe impairment of visual acuity, retinal hypopigmentation and the presence of macromelanosomes. A novel transcript from the OA1 critical region is expressed in high levels in RNA samples from retina and from melanoma and encodes a potential integral membrane protein []. This protein is of unknown function but is known to bind heterotrimeric G proteins.; GO: 0016020 membrane
Probab=98.90 E-value=1.1e-07 Score=96.71 Aligned_cols=148 Identities=17% Similarity=0.308 Sum_probs=103.2
Q ss_pred chHHHHHHHHHHHHHHHHHHHHh--cCCcceeccCCCceeeccCCC--C---hhHHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy14918 89 ERPIVFLSGCYFMVSIGYLIRVS--VGHEEVACEDQMVRYGASGPT--P---CTLVFLLVYFFGMSSYIWWVILSFTWFL 161 (463)
Q Consensus 89 ~r~iifls~c~~~~sl~~li~~~--~g~~~i~C~~~~~~~~~~~~~--~---C~~~~~l~yyf~las~~Ww~il~~~w~l 161 (463)
.|+++.+++|.++.++|-+++.. .|..++.=.. -..++++ + |+.+.+-+|||.+|.+.|.++.++|-|+
T Consensus 68 ~rIl~~la~aDlLaclGVivRS~vWl~~p~~~~s~----s~~~~~d~wp~afCv~ss~WIq~fYsAtfwWtfcYAVDv~L 143 (405)
T PF02101_consen 68 RRILFWLAVADLLACLGVIVRSSVWLGFPNFIDSI----SDVNGTDIWPAAFCVGSSMWIQLFYSATFWWTFCYAVDVYL 143 (405)
T ss_pred chhHHHHHHHHHHhhhhHHHHhhhhhcCCcccccc----cCCCCCccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhh
Confidence 38889999999999999999854 3322221111 1122222 2 9999999999999999999999999999
Q ss_pred HhccccchhhhhhhHHHHHHHHHHhhHHHHHHhhhhccccCCcccceeEeeeCCccccchhhhhHHHHHHHHHHHHHHHH
Q psy14918 162 AAGLKWGNEAIASYSLYFHLAAWLIPTVKSVSVIIMSAVDGDTIAGICYVGNHNIHNLKYFVVFPLIIYLILGIIFLFGG 241 (463)
Q Consensus 162 s~~~k~~~e~i~~~~~yfh~~aW~iP~v~tii~l~~~~v~~~~~tg~CwIg~~~~~~~~~f~~~Pl~i~llig~~fl~~~ 241 (463)
..+++-+ .....+||+++|++|+++++.-+++=.+- +. .-|--+.+....-+...++|+.++++.+=+++...
T Consensus 144 v~~~~ag----~~~~~lYH~~aWgl~~lL~~~Gl~~Ly~P-s~--~~Ce~~l~~alphYvttY~PlllVlvaNPiLy~~a 216 (405)
T PF02101_consen 144 VIRRSAG----RSTIWLYHMMAWGLPALLCAEGLAMLYYP-SI--SRCESGLEHALPHYVTTYIPLLLVLVANPILYIKA 216 (405)
T ss_pred eeeccCC----CcchhHHHHHHHHHHHHHHHhccceeecC-CH--HhhhhhhhhcchhHHHHHHHHHHHHHhccHHHHHH
Confidence 8766422 22246899999999999998766543332 11 13654422222233456899999999999999888
Q ss_pred HHHhhh
Q psy14918 242 FISLFR 247 (463)
Q Consensus 242 ~~~l~~ 247 (463)
..++-+
T Consensus 217 ~~~V~~ 222 (405)
T PF02101_consen 217 VRAVAS 222 (405)
T ss_pred HHHHHH
Confidence 766543
No 10
>PHA03087 G protein-coupled chemokine receptor-like protein; Provisional
Probab=98.27 E-value=0.00036 Score=70.81 Aligned_cols=163 Identities=12% Similarity=0.168 Sum_probs=94.8
Q ss_pred CCCCCCCcChHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCCCCCC-chHHHHHHHHHHHHHHHHHHHHhcCCcceec
Q psy14918 41 YPCQGIFFSQPEREFASMWIFVMSALCCISTLMTLTTFLIDTERFKYP-ERPIVFLSGCYFMVSIGYLIRVSVGHEEVAC 119 (463)
Q Consensus 41 ~pC~~~~ft~~e~~~~~~~~~v~s~i~~~~tlftilTF~i~~~r~~yp-~r~iifls~c~~~~sl~~li~~~~g~~~i~C 119 (463)
.+|+.-...+++...+.....+..+++++++++.++...-. +.+.| .-.+..++++.++.++........ .. .
T Consensus 28 ~~~~~~~~~~~~~~~~~~~~~~i~~~gl~gN~lvl~~~~~~--~~~~~~~~ll~~laisDll~~~~~~~~~~~---~~-~ 101 (335)
T PHA03087 28 ADCDLNIGYDTNSTILIVVYSTIFFFGLVGNIIVIYVLTKT--KIKTPMDIYLLNLAVSDLLFVMTLPFQIYY---YI-L 101 (335)
T ss_pred CCCCccccccchhhHHHHHHHHHHHHHHHhhHhEEeeehhc--cccCchHHHHHHHHHHHHHHHHhHHHHHHH---Hh-C
Confidence 55554444455555666666677777888877766544333 33333 445677888888766543332210 00 0
Q ss_pred cCCCceeeccCCCChhHHHHHHHHHHHHHHHHHHHHHHHHHHHhccccchhh--hhhhHHHHHHHHHHhhHHHHHHhhhh
Q psy14918 120 EDQMVRYGASGPTPCTLVFLLVYFFGMSSYIWWVILSFTWFLAAGLKWGNEA--IASYSLYFHLAAWLIPTVKSVSVIIM 197 (463)
Q Consensus 120 ~~~~~~~~~~~~~~C~~~~~l~yyf~las~~Ww~il~~~w~ls~~~k~~~e~--i~~~~~yfh~~aW~iP~v~tii~l~~ 197 (463)
.+ -.-++..|.+..++.+.+..++..=.+.++++=|++...--..+. -.++.....++.|.++++..+..+..
T Consensus 102 ~~-----~~~~~~~C~~~~~~~~~~~~~S~~~l~~iaidRy~aI~~p~~~~~~~~~~~~~~~~~~iWl~~~~~~~p~~~~ 176 (335)
T PHA03087 102 FQ-----WSFGEFACKIVSGLYYIGFYNSMNFITVMSVDRYIAIVHPVKSNKINTVKYGYIVSLVIWIISIIETTPILFV 176 (335)
T ss_pred CC-----CCCCcHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccCccccccchhhhHHHHHHHHHHHHHHhccHhhe
Confidence 00 001222299999998888888877777889998887643211111 12333445577899988766655555
Q ss_pred ccccCCcccceeEeeeC
Q psy14918 198 SAVDGDTIAGICYVGNH 214 (463)
Q Consensus 198 ~~v~~~~~tg~CwIg~~ 214 (463)
..+..+..+..|.....
T Consensus 177 ~~~~~~~~~~~C~~~~~ 193 (335)
T PHA03087 177 YTTKKDHETLICCMFYN 193 (335)
T ss_pred eeeeccCCCceEEecCC
Confidence 55544455667877543
No 11
>PHA03235 DNA packaging protein UL33; Provisional
Probab=98.26 E-value=0.0012 Score=69.55 Aligned_cols=132 Identities=11% Similarity=0.091 Sum_probs=82.1
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhcc-CCC-CCCCchHHHHHHHHHHHHHHHHHHHHhcCCcceeccCCCceeeccCCC
Q psy14918 55 FASMWIFVMSALCCISTLMTLTTFLID-TER-FKYPERPIVFLSGCYFMVSIGYLIRVSVGHEEVACEDQMVRYGASGPT 132 (463)
Q Consensus 55 ~~~~~~~v~s~i~~~~tlftilTF~i~-~~r-~~yp~r~iifls~c~~~~sl~~li~~~~g~~~i~C~~~~~~~~~~~~~ 132 (463)
.......++.+++++++++.++++... +.+ .+.+...++.||++++++.+ .+-..+... . .....|+.
T Consensus 34 ~~~~~~~li~vvGiigN~lVL~~~~~~~r~~~~~~~~~~I~NLAvsDLl~l~-~lP~~i~~~------~---~~~~~g~~ 103 (409)
T PHA03235 34 TETFINLLIISVGGPLNLIVLVTQLLANRVHGFSTPTLYMTNLYLANLLTVF-VLPFIMLSN------Q---GLLSGSVA 103 (409)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccCCccHHHHHHHHHHHHHHHH-HHHHHHHhc------C---ccccCCCC
Confidence 444555567777888888888776643 333 34567788999999998643 222111110 0 00111222
Q ss_pred ChhHHHHHHHHHHHHHHHHHHHHHHHHHHHhccccchhhh-hhhHHHHHHHHHHhhHHHHHHhhh
Q psy14918 133 PCTLVFLLVYFFGMSSYIWWVILSFTWFLAAGLKWGNEAI-ASYSLYFHLAAWLIPTVKSVSVII 196 (463)
Q Consensus 133 ~C~~~~~l~yyf~las~~Ww~il~~~w~ls~~~k~~~e~i-~~~~~yfh~~aW~iP~v~tii~l~ 196 (463)
-|.+.+++.+....+++.=.+.++++=|+++..-...... .+......+++|.++++..+..+.
T Consensus 104 ~Ck~~~~l~~~~~~~Si~tL~~ISiDRY~aI~~p~~~~~~~~~~a~~ii~~iWi~sll~s~P~~~ 168 (409)
T PHA03235 104 GCKFASLLYYASCTVGFATVALIAADRYRVIHQRTRARSSAYRSTYKILGLTWFASLICSGPAPV 168 (409)
T ss_pred eehhHHHHHHHHHHHHHHHHHHHHHHHHHHeechhhccCcccchhhhhHHHHHHHHHHHHHHHHH
Confidence 3999999888888888877888999988887542211111 233444567889999887765443
No 12
>PF00001 7tm_1: 7 transmembrane receptor (rhodopsin family) Rhodopsin-like GPCR superfamily signature 5-hydroxytryptamine 7 receptor signature bradykinin receptor signature gastrin receptor signature melatonin receptor signature olfactory receptor signature; InterPro: IPR000276 G-protein-coupled receptors, GPCRs, constitute a vast protein family that encompasses a wide range of functions (including various autocrine, paracrine and endocrine processes). They show considerable diversity at the sequence level, on the basis of which they can be separated into distinct groups. We use the term clan to describe the GPCRs, as they embrace a group of families for which there are indications of evolutionary relationship, but between which there is no statistically significant similarity in sequence []. The currently known clan members include the rhodopsin-like GPCRs, the secretin-like GPCRs, the cAMP receptors, the fungal mating pheromone receptors, and the metabotropic glutamate receptor family. There is a specialised database for GPCRs (http://www.gpcr.org/7tm/). The rhodopsin-like GPCRs themselves represent a widespread protein family that includes hormone, neurotransmitter and light receptors, all of which transduce extracellular signals through interaction with guanine nucleotide-binding (G) proteins. Although their activating ligands vary widely in structure and character, the amino acid sequences of the receptors are very similar and are believed to adopt a common structural framework comprising 7 transmembrane (TM) helices [, , ].; GO: 0007186 G-protein coupled receptor protein signaling pathway, 0016021 integral to membrane; PDB: 2KI9_A 3QAK_A 2YDV_A 3VGA_A 3PWH_A 3RFM_A 3EML_A 3VG9_A 3REY_A 3UZA_A ....
Probab=98.13 E-value=0.00017 Score=67.97 Aligned_cols=137 Identities=16% Similarity=0.178 Sum_probs=89.7
Q ss_pred HHHHHHHHHhccCCCCCCCchHHHHHHHHHHHHHHHHHHHHhcCCcceeccCCCceeeccCCC-C-hhHHHHHHHHHHHH
Q psy14918 70 STLMTLTTFLIDTERFKYPERPIVFLSGCYFMVSIGYLIRVSVGHEEVACEDQMVRYGASGPT-P-CTLVFLLVYFFGMS 147 (463)
Q Consensus 70 ~tlftilTF~i~~~r~~yp~r~iifls~c~~~~sl~~li~~~~g~~~i~C~~~~~~~~~~~~~-~-C~~~~~l~yyf~la 147 (463)
++++.++++.-.+++.+-.+..++.++++.++.++......... ... ++ ...+ . |.+...+.+.+..+
T Consensus 1 GN~lvi~~~~~~~~~~~~~~~~l~~Lav~Dll~~~~~~~~~~~~--~~~-~~-------~~~~~~~C~~~~~~~~~~~~~ 70 (257)
T PF00001_consen 1 GNILVILVILRSKRLRTPSNILLLNLAVADLLVGLFCIPFYIYS--LLF-DD-------WIFSSFLCRIFGFLFYFSSFS 70 (257)
T ss_dssp HHHHHHHHHHHSGGG-SHHHHHHHHHHHHHHHHHHTHHHHHHHH--HHH-SS-------CTSHHHHHHHHHHHHHHHHHH
T ss_pred CchhehhhhhhhccCCChhHHHHHHHHHHHHhhccccccccccc--ccc-cc-------ccccccccccccccccccccc
Confidence 35666777776555444456677889999999888766554321 010 00 0011 2 99999999999999
Q ss_pred HHHHHHHHHHHHHHHhccccchhhh--hhhHHHHHHHHHHhhHHHHHHhhhhccc-c-CCcccceeEeeeCCc
Q psy14918 148 SYIWWVILSFTWFLAAGLKWGNEAI--ASYSLYFHLAAWLIPTVKSVSVIIMSAV-D-GDTIAGICYVGNHNI 216 (463)
Q Consensus 148 s~~Ww~il~~~w~ls~~~k~~~e~i--~~~~~yfh~~aW~iP~v~tii~l~~~~v-~-~~~~tg~CwIg~~~~ 216 (463)
+....+.+++|=|++.......... .+......+++|.+++++.+..+..... + .+.....|++...+.
T Consensus 71 s~~~~~~is~dRy~~i~~p~~~~~~~~~~~~~~~i~~~w~~~~~~~~~~~~~~~~~~~~~~~~~~C~~~~~~~ 143 (257)
T PF00001_consen 71 SIFSLVAISIDRYLAICHPLRYRRIRTRRRARIIIILIWIISFLISLPPLFFSWVYFVSDGSQSFCFIDFSSS 143 (257)
T ss_dssp HHHHHHHHHHHHHHHHHSHHTHHHHSCHHHHHHHHHHHHHHHHHHHHHHHHTCEEEEESTCCCEEEEESCSSS
T ss_pred ccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccc
Confidence 9999999999988887532111111 2445567788999999988888665332 2 222267899976654
No 13
>PHA03234 DNA packaging protein UL33; Provisional
Probab=97.98 E-value=0.0012 Score=67.90 Aligned_cols=149 Identities=11% Similarity=0.075 Sum_probs=88.9
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHhccCC-C-CCCCchHHHHHHHHHHHHHHHHHHHHhcCCcceeccCCCceeeccCC
Q psy14918 54 EFASMWIFVMSALCCISTLMTLTTFLIDTE-R-FKYPERPIVFLSGCYFMVSIGYLIRVSVGHEEVACEDQMVRYGASGP 131 (463)
Q Consensus 54 ~~~~~~~~v~s~i~~~~tlftilTF~i~~~-r-~~yp~r~iifls~c~~~~sl~~li~~~~g~~~i~C~~~~~~~~~~~~ 131 (463)
...-..-.+.-+++++++++.+.+..+..+ + .+-.+..++.||++++++.+.......... .+ - .-|+
T Consensus 33 ~~~~~~y~~vf~~gl~gN~lvl~v~~~~~~~~~rt~tn~fi~NLAvaDLL~~l~lp~~~~~~~-------~~--w-~fG~ 102 (338)
T PHA03234 33 ILESAINGIMLTLIIPMIIIVICTLIIYHKVAKHNATSFYLITLFASDFLHMLCVFFLTLNRE-------AL--F-NFNQ 102 (338)
T ss_pred HHhhHHHHHHHHHHhhhHHHHHHHHHHHhccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHh-------CC--c-cCch
Confidence 344444456667778888888877664332 2 344567889999999988764433222110 00 0 1122
Q ss_pred CChhHHHHHHHHHHHHHHHHHHHHHHHHHHHhccccchhhhhhhHHHHHHHHHHhhHHHHHHhhhhccccCCc-ccceeE
Q psy14918 132 TPCTLVFLLVYFFGMSSYIWWVILSFTWFLAAGLKWGNEAIASYSLYFHLAAWLIPTVKSVSVIIMSAVDGDT-IAGICY 210 (463)
Q Consensus 132 ~~C~~~~~l~yyf~las~~Ww~il~~~w~ls~~~k~~~e~i~~~~~yfh~~aW~iP~v~tii~l~~~~v~~~~-~tg~Cw 210 (463)
.-|-+.+++.+....+|++-.+.+++|=|++...........+......++.|.+.++.++..+...++..++ ....|.
T Consensus 103 ~lCk~~~~~~~~~~~~Si~~L~~ISiDRY~aIv~p~~~~~~~~~~~~~i~~~Wi~s~l~~~P~l~~~~~~~~~~~~~~C~ 182 (338)
T PHA03234 103 AFCQCVLFIYHASCSYSICMLAIIATIRYKTLHRRKKNDKKNNHIGRNIGILFLASAMCAIPAALFVKTEGKKGNYGKCN 182 (338)
T ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHHHhHeeeechhhhhhhhhhHHHHHHHHHHHHHHHHhhHhHeeeeeecCCCCCcCc
Confidence 3399999999999999999999999999988754321111111112223445888877666555544433222 224697
Q ss_pred ee
Q psy14918 211 VG 212 (463)
Q Consensus 211 Ig 212 (463)
..
T Consensus 183 ~~ 184 (338)
T PHA03234 183 IH 184 (338)
T ss_pred cc
Confidence 64
No 14
>PHA02834 chemokine receptor-like protein; Provisional
Probab=97.95 E-value=0.0019 Score=65.72 Aligned_cols=159 Identities=13% Similarity=0.225 Sum_probs=93.6
Q ss_pred CCCCCCcChHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCCCCCCchHHHHHHHHHHHHHHHHHHHHhcCCcceeccC
Q psy14918 42 PCQGIFFSQPEREFASMWIFVMSALCCISTLMTLTTFLIDTERFKYPERPIVFLSGCYFMVSIGYLIRVSVGHEEVACED 121 (463)
Q Consensus 42 pC~~~~ft~~e~~~~~~~~~v~s~i~~~~tlftilTF~i~~~r~~yp~r~iifls~c~~~~sl~~li~~~~g~~~i~C~~ 121 (463)
+|+....++++.........++.+++++++++.++.+.-++.+ +..+..++.||+++++..++. .+.. ..... +
T Consensus 17 ~~~~~~~~~~~~~~~~~~~~li~v~~~~gN~lVi~vi~~~~~~-~~~n~~i~nLAiaDll~~~~l--P~~i-~~~~~--~ 90 (323)
T PHA02834 17 ECDFEMVNSDVNYFVIVFYILLFIFGLIGNVLVIAVLIVKRFM-FVVDVYLFNIAMSDLMLVFSF--PFII-HNDLN--E 90 (323)
T ss_pred CCcccccccchhhhHHHHHHHHHHHHHhhHHHHHHHHHhcccc-chhhhhhHHHHHHHHHHHHHH--HHHH-HHHcC--C
Confidence 5664455555555566666778888888888887766543333 345677888999998764432 2111 01110 0
Q ss_pred CCceeeccCCCChhHHHHHHHHHHHHHHHHHHHHHHHHHHHhccccch-hhhhhhHHHHHHHHHHhhHHHHHHhhhhccc
Q psy14918 122 QMVRYGASGPTPCTLVFLLVYFFGMSSYIWWVILSFTWFLAAGLKWGN-EAIASYSLYFHLAAWLIPTVKSVSVIIMSAV 200 (463)
Q Consensus 122 ~~~~~~~~~~~~C~~~~~l~yyf~las~~Ww~il~~~w~ls~~~k~~~-e~i~~~~~yfh~~aW~iP~v~tii~l~~~~v 200 (463)
-.-|+..|.+.+.+.+.+..+++.=.+.++++=|++...-... ....+......+++|.++++..+.++.....
T Consensus 91 -----w~~g~~~C~~~~~~~~~~~~~Si~tL~~IsidRY~aI~~P~~~~~~~~~~~~~~i~~iWi~s~l~~~P~~~~~~~ 165 (323)
T PHA02834 91 -----WIFGEFMCKLVLGVYFVGFFSNMFFVTLISIDRYILVVNATKIKNKSISLSVLLSVAAWVCSVILSMPAMVLYYV 165 (323)
T ss_pred -----cCCcchHHHhHHHHHHHHHHHHHHHHHHHHHHHhhheeCchhccCCccchHHHHHHHHHHHHHHHHhhHHHHHHh
Confidence 0112223999998877777777777788999988876432111 1111222334467899998877655544333
Q ss_pred cCCcccceeEe
Q psy14918 201 DGDTIAGICYV 211 (463)
Q Consensus 201 ~~~~~tg~CwI 211 (463)
+.......|..
T Consensus 166 ~~~~~~~~C~~ 176 (323)
T PHA02834 166 DNTDNLKQCIF 176 (323)
T ss_pred ccCCCceEEec
Confidence 33323346863
No 15
>PHA02638 CC chemokine receptor-like protein; Provisional
Probab=97.88 E-value=0.0071 Score=64.00 Aligned_cols=148 Identities=10% Similarity=0.142 Sum_probs=87.9
Q ss_pred CCCCcCCCCCCCcChHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCCCCCCchHHHHHHHHHHHHHHHHHHHHhcCCc
Q psy14918 36 INNCAYPCQGIFFSQPEREFASMWIFVMSALCCISTLMTLTTFLIDTERFKYPERPIVFLSGCYFMVSIGYLIRVSVGHE 115 (463)
Q Consensus 36 ~~~C~~pC~~~~ft~~e~~~~~~~~~v~s~i~~~~tlftilTF~i~~~r~~yp~r~iifls~c~~~~sl~~li~~~~g~~ 115 (463)
.++...||-. +.....+.....+..+++++++++.++... .+++.+..+..++.||++.++..+..-..+..
T Consensus 85 ~~~~~~~c~~----~~~~~~l~~~y~lvfvlgliGN~LVl~il~-~k~lrt~t~i~llnLAisDLl~~l~lPf~i~~--- 156 (417)
T PHA02638 85 LINIKNKCMY----PSISEYIKIFYIIIFILGLFGNAAIIMILF-CKKIKTITDIYIFNLAISDLIFVIDFPFIIYN--- 156 (417)
T ss_pred ccccCCCCCc----cchhhHHHHHHHHHHHHHHHHHHHHHHHHH-hccCCCHhHHHHHHHHHHHHHHHHHHHHHHHH---
Confidence 3446777821 113346666666788888888888775533 33333444677788999998765442211111
Q ss_pred ceeccCCCceeeccCCCChhHHHHHHHHHHHHHHHHHHHHHHHHHHHhccccchhhh--hhhHHHHHHHHHHhhHHHHHH
Q psy14918 116 EVACEDQMVRYGASGPTPCTLVFLLVYFFGMSSYIWWVILSFTWFLAAGLKWGNEAI--ASYSLYFHLAAWLIPTVKSVS 193 (463)
Q Consensus 116 ~i~C~~~~~~~~~~~~~~C~~~~~l~yyf~las~~Ww~il~~~w~ls~~~k~~~e~i--~~~~~yfh~~aW~iP~v~tii 193 (463)
.. .+ -..|+.-|.+...+.+.+..++++-.++++++=|++...-...... .+......+++|.+.++..+.
T Consensus 157 ~~----~~---W~fg~~~Ck~~~~l~~~~~~~Si~~L~~isiDRYlaIv~p~~~~~~~~~~~~~i~~~~iW~~s~l~slP 229 (417)
T PHA02638 157 EF----DQ---WIFGDFMCKVISASYYIGFFSNMFLITLMSIDRYFAILYPISFQKYRTFNIGIILCIISWILSLIITSP 229 (417)
T ss_pred Hh----cc---ccccccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHhccccccceecHhhhHhhHhHHHHHHHHHHHH
Confidence 00 00 0123333999888877777777777788999988887542111111 122233457889999888877
Q ss_pred hhhhc
Q psy14918 194 VIIMS 198 (463)
Q Consensus 194 ~l~~~ 198 (463)
.+...
T Consensus 230 ~~~~~ 234 (417)
T PHA02638 230 AYFIF 234 (417)
T ss_pred HHHHh
Confidence 76543
No 16
>PF05296 TAS2R: Mammalian taste receptor protein (TAS2R); InterPro: IPR007960 This family consists of several forms of mammalian taste receptor proteins (TAS2Rs). TAS2Rs are G protein-coupled receptors expressed in subsets of taste receptor cells of the tongue and palate epithelia and are organised in the genome in clusters. The proteins are genetically linked to loci that influence bitter perception in mice and humans [].; GO: 0004930 G-protein coupled receptor activity, 0007186 G-protein coupled receptor protein signaling pathway, 0050909 sensory perception of taste, 0016021 integral to membrane
Probab=96.64 E-value=0.41 Score=48.51 Aligned_cols=220 Identities=10% Similarity=0.148 Sum_probs=98.9
Q ss_pred HHHHHHHHHHHHHHHHHHHHhccCC---CCCCCchHHHHHHHHHHHHHHHHHHHHhcC--CcceeccCCCceeeccCCCC
Q psy14918 59 WIFVMSALCCISTLMTLTTFLIDTE---RFKYPERPIVFLSGCYFMVSIGYLIRVSVG--HEEVACEDQMVRYGASGPTP 133 (463)
Q Consensus 59 ~~~v~s~i~~~~tlftilTF~i~~~---r~~yp~r~iifls~c~~~~sl~~li~~~~g--~~~i~C~~~~~~~~~~~~~~ 133 (463)
...+...++.+++.|.++..+.|+. +.+.-|.++..++++=++.....++....- .......+ ..
T Consensus 12 i~~~~~~~Gi~~N~FI~~vn~~~w~k~~~l~~~d~IL~~La~sr~~l~~~~~~~~~~~~~~~~~~~~~----------~~ 81 (303)
T PF05296_consen 12 ILVVEFIIGILGNGFIVLVNCSDWVKSRKLSPSDQILTSLAISRILLQWVILLNSFLSFFFPNIYFSE----------NV 81 (303)
T ss_pred HHHHHHHHHHHHHHHHHHHhHHHHHcCCCCChHHHHHHHHHHHHHHHHHHHHHHHHHHHHcchhhhhh----------hH
Confidence 3345567888999999999998753 344457888888887444333222221110 11111111 01
Q ss_pred hhHHHHHHHHHHHHHHHHHHH-HHHHHHHHhcc------ccchhhhhhhHHHHHHHHHHhhHHHHHHhhhhccc--c---
Q psy14918 134 CTLVFLLVYFFGMSSYIWWVI-LSFTWFLAAGL------KWGNEAIASYSLYFHLAAWLIPTVKSVSVIIMSAV--D--- 201 (463)
Q Consensus 134 C~~~~~l~yyf~las~~Ww~i-l~~~w~ls~~~------k~~~e~i~~~~~yfh~~aW~iP~v~tii~l~~~~v--~--- 201 (463)
+.+... +.-|...+.+|.+. |++..++-+.. -|-...|.+......++.+.++...++........ +
T Consensus 82 ~~~~~~-~~~f~~~~s~W~tt~LsvfYcvKI~~fs~~~Fl~LK~rI~~~v~~lLl~s~l~s~~~~~~~~~~~~~~~~~~~ 160 (303)
T PF05296_consen 82 YKIIDF-LWMFSNSSSLWFTTWLSVFYCVKIANFSHPFFLWLKRRISKVVPWLLLGSLLISFLNLLSIPLFIDNHINNNN 160 (303)
T ss_pred HHHHHH-HHHHHhHHHHHHHHHHHHHHheeeecCCcHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhheeeeecccc
Confidence 222222 23334444555543 44444433211 12233456666666677777777323222222221 1
Q ss_pred -CCcccceeEeeeCCccccchhh--hhHHHHHHHHHHHHHHHHHHHhhhhhhhhcccCCCCccchhhhHHHHHHHHHHHH
Q psy14918 202 -GDTIAGICYVGNHNIHNLKYFV--VFPLIIYLILGIIFLFGGFISLFRIRNVIKNQGGISNRTKADKLEKLMIRIGIFS 278 (463)
Q Consensus 202 -~~~~tg~CwIg~~~~~~~~~f~--~~Pl~i~llig~~fl~~~~~~l~~ir~~~k~~~~~~~~~~~~kl~~l~~rigif~ 278 (463)
....+...-....+......+. .+-..+-+++-++-.+..+.+|.|.-+.+++..+..++...+-..+.+.-+..|.
T Consensus 161 ~~~~~N~t~~~~~~~~~~~~~~~~~~~~~~lPf~i~l~s~~lli~SL~rH~r~M~~n~~g~~~ps~~aH~~a~k~~~sfl 240 (303)
T PF05296_consen 161 TNNSRNSTSNFQESKSSYFYFFILFNLGSFLPFLIFLVSSILLIFSLWRHMRRMQKNATGFRDPSTEAHIRAIKTMISFL 240 (303)
T ss_pred ccCCCcceEEeecchHHHHHHHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHhhCCCCCCCCCCcHHHHHHHHHHHHHH
Confidence 1111111112111211111111 1222223333334444456777777666665432222223334445555566677
Q ss_pred HHHhhhhHHHH
Q psy14918 279 VLSTLPASIVI 289 (463)
Q Consensus 279 ~ly~~~~~~~~ 289 (463)
++|++.....+
T Consensus 241 ~ly~~~~~~~~ 251 (303)
T PF05296_consen 241 ILYIIYFLSLI 251 (303)
T ss_pred HHHHHHHHHHH
Confidence 77766544333
No 17
>PF02076 STE3: Pheromone A receptor; InterPro: IPR001499 G-protein-coupled receptors, GPCRs, constitute a vast protein family that encompasses a wide range of functions (including various autocrine, paracrine and endocrine processes). They show considerable diversity at the sequence level, on the basis of which they can be separated into distinct groups. We use the term clan to describe the GPCRs, as they embrace a group of families for which there are indications of evolutionary relationship, but between which there is no statistically significant similarity in sequence []. The currently known clan members include the rhodopsin-like GPCRs, the secretin-like GPCRs, the cAMP receptors, the fungal mating pheromone receptors, and the metabotropic glutamate receptor family. There is a specialised database for GPCRs (http://www.gpcr.org/7tm/). Little is known about the structure and function of the mating factor receptors, STE2 and STE3. It is believed, however, that they are integral membrane proteins that may be involved in the response to mating factors on the cell membrane [, , ]. The amino acid sequences of both receptors contain high proportions of hydrophobic residues grouped into 7 domains, in a manner reminiscent of the rhodopsins and other receptors believed to interact with G-proteins. However, while a similar 3D framework has been proposed to account for this, there is no significant sequence similarity either between STE2 and STE3, or between these and the rhodopsin-type family: the receptors thus bear their own unique '7TM' signatures. The STE3 gene of Saccharomyces cerevisiae (Baker's yeast) is the cell-surface receptor that binds the 13-residue lipopeptide a-factor. Several related fungal pheromone receptor sequences are known: these include pheromone B alpha 1 and B alpha 3, and pheromone B beta 1 receptors from Schizophyllum commune; pheromone receptor 1 from Ustilago hordei; and pheromone receptors 1 and 2 from Ustilago maydis. Members of the family share about 20% sequence identity.; GO: 0004932 mating-type factor pheromone receptor activity, 0007186 G-protein coupled receptor protein signaling pathway, 0016021 integral to membrane
Probab=96.22 E-value=0.99 Score=45.43 Aligned_cols=112 Identities=12% Similarity=0.123 Sum_probs=76.9
Q ss_pred HHHHHHHhhHHHHHHhhhhccccCCcccc-eeEeeeCCccccchhhhhHHHHHHHHHHHHHHHHHHHhhhhhhhhcccCC
Q psy14918 179 FHLAAWLIPTVKSVSVIIMSAVDGDTIAG-ICYVGNHNIHNLKYFVVFPLIIYLILGIIFLFGGFISLFRIRNVIKNQGG 257 (463)
Q Consensus 179 fh~~aW~iP~v~tii~l~~~~v~~~~~tg-~CwIg~~~~~~~~~f~~~Pl~i~llig~~fl~~~~~~l~~ir~~~k~~~~ 257 (463)
--++++++|++..+.....+....|...+ +|.-...+.+....++++|-.++-+++.++-++.++..+|.|+..++--+
T Consensus 107 d~~i~~g~Pil~m~l~yivQ~~Rf~I~e~~GC~~~~~~s~~~~~l~~iwp~i~~lia~vYa~lt~~~f~~rr~~f~~~l~ 186 (283)
T PF02076_consen 107 DLLICFGIPILQMALHYIVQGHRFDIVEDVGCYPAIYPSWPTIFLFIIWPPIFSLIAAVYAILTLRRFIRRRKQFRDILS 186 (283)
T ss_pred HHHHHHHHHHHHHHHHHHhcccceeEecccCCccCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh
Confidence 34789999999998888777666665554 49988777766777788999999999999999999999999888764211
Q ss_pred CC-ccchhhhHHHHHHHHHHHHHHHhhhhHHHHhhHhh
Q psy14918 258 IS-NRTKADKLEKLMIRIGIFSVLSTLPASIVIGCLIY 294 (463)
Q Consensus 258 ~~-~~~~~~kl~~l~~rigif~~ly~~~~~~~~~~~fY 294 (463)
.. +.-+..+. .|+..+++..++...-+..+.++
T Consensus 187 ~s~s~l~~~Rf----~RL~~la~~~i~~~~P~~i~~~v 220 (283)
T PF02076_consen 187 SSNSGLNTSRF----FRLLALALVIILIYLPLSIYYLV 220 (283)
T ss_pred cCCCcccHHHH----HHHHHHHHHHHHHHHHHHHHHHH
Confidence 00 00122223 35555555555554444555555
No 18
>KOG2087|consensus
Probab=94.44 E-value=0.078 Score=54.44 Aligned_cols=191 Identities=13% Similarity=0.128 Sum_probs=120.5
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHhccCCCCCCCchHHHHHHHHHHHHHHHHHHHHhcCCcceeccCCC-ceeec--cCCC
Q psy14918 56 ASMWIFVMSALCCISTLMTLTTFLIDTERFKYPERPIVFLSGCYFMVSIGYLIRVSVGHEEVACEDQM-VRYGA--SGPT 132 (463)
Q Consensus 56 ~~~~~~v~s~i~~~~tlftilTF~i~~~r~~yp~r~iifls~c~~~~sl~~li~~~~g~~~i~C~~~~-~~~~~--~~~~ 132 (463)
.|..+=+.+.+.++++++.+++...-+.+..-|.-.|..+|++.+++.+--.+-... | +|...+ .+++. |+..
T Consensus 27 lRi~vW~i~~lAi~gN~~Vl~~~~~~~~~~~~~~~li~~la~ad~~mGiYl~~ia~v--D--~~~~gey~~~ai~W~tg~ 102 (363)
T KOG2087|consen 27 LRISVWVIALLAIVGNLLVLLTRFTSRYELNSHRFLICNLAFADLLMGIYLGLIASV--D--AKTRGEYYKHAIDWQTGL 102 (363)
T ss_pred eeehhhhhhhHHhccCeeeeeeeeehhhhccchHHHHHHHHHHHHHcchHHHHHHHh--h--HHHHHHHHHHHHhhhhcC
Confidence 334444677788888888887777655545555677788888887766522211111 1 111110 01222 3334
Q ss_pred ChhHHHHHHHHHHHHHHHHHHHHHHHHHHHhccccc-hhhh-hhhHHHHHHHHHHhhHHHHHHhhhhccccCCcccceeE
Q psy14918 133 PCTLVFLLVYFFGMSSYIWWVILSFTWFLAAGLKWG-NEAI-ASYSLYFHLAAWLIPTVKSVSVIIMSAVDGDTIAGICY 210 (463)
Q Consensus 133 ~C~~~~~l~yyf~las~~Ww~il~~~w~ls~~~k~~-~e~i-~~~~~yfh~~aW~iP~v~tii~l~~~~v~~~~~tg~Cw 210 (463)
.|.+.|++.-|....+.+=.+.+++.-+++++.-.. ...+ -+...+--+.+|....+..++++.... +...+++|.
T Consensus 103 gC~~aGflavFASElSv~~LT~itlEr~l~i~~p~~~~~~~~lr~~~~ill~~wl~~~l~A~~Pl~g~s--~Y~~~~vCl 180 (363)
T KOG2087|consen 103 GCPVAGFLAVFASELSVFLLTLITLERWLSITYPFRLDRKAKLRPLVLILLLGWLFAFLMALLPLFGIS--SYGASSVCL 180 (363)
T ss_pred CCchHHHHHHHHHHHHHHHHHHHHHHHHhheeccccCCCcccccHHHHHHHHHHHHHHHHHhccccCCC--CCcccceee
Confidence 499999999999999999999999995555432111 1112 222456678999999999999987633 334458999
Q ss_pred eeeCCccccchhhhhHHHHHHHHHHHHHHHHHHHhhhhhhhhccc
Q psy14918 211 VGNHNIHNLKYFVVFPLIIYLILGIIFLFGGFISLFRIRNVIKNQ 255 (463)
Q Consensus 211 Ig~~~~~~~~~f~~~Pl~i~llig~~fl~~~~~~l~~ir~~~k~~ 255 (463)
--..+...-.| .-+.+++.+|.+-++++..+-.++.-.+++.
T Consensus 181 PL~~~~~~s~g---~y~~~~l~~N~lafiiia~~Y~~iy~~l~~~ 222 (363)
T KOG2087|consen 181 PLHIEEPLSTG---YYLVALLGLNLLAFIIIAFSYGKIYCSLRKG 222 (363)
T ss_pred ecccCCccchh---HHHHHHHHHHHHHHHHHHHHhhhhheeeecC
Confidence 86554433343 2356777888888888777777776665543
No 19
>PF10320 7TM_GPCR_Srsx: Serpentine type 7TM GPCR chemoreceptor Srsx; InterPro: IPR019424 G-protein-coupled receptors, GPCRs, constitute a vast protein family that encompasses a wide range of functions (including various autocrine, paracrine and endocrine processes). They show considerable diversity at the sequence level, on the basis of which they can be separated into distinct groups. We use the term clan to describe the GPCRs, as they embrace a group of families for which there are indications of evolutionary relationship, but between which there is no statistically significant similarity in sequence []. The currently known clan members include the rhodopsin-like GPCRs, the secretin-like GPCRs, the cAMP receptors, the fungal mating pheromone receptors, and the metabotropic glutamate receptor family. There is a specialised database for GPCRs (http://www.gpcr.org/7tm/). The nematode Caenorhabditis elegans has only 14 types of chemosensory neuron, yet is able to sense and respond to several hundred different chemicals because each neuron detects several stimuli []. Chemoperception is one of the central senses of soil nematodes like C. elegans which are otherwise 'blind' and 'deaf' []. Chemoreception in C. elegans is mediated by members of the seven-transmembrane G-protein-coupled receptor class (7TM GPCRs). More than 1300 potential chemoreceptor genes have been identified in C. elegans, which are generally prefixed sr for serpentine receptor. The receptor superfamilies include Sra (Sra, Srb, Srab, Sre), Str (Srh, Str, Sri, Srd, Srj, Srm, Srn) and Srg (Srx, Srt, Srg, Sru, Srv, Srxa), as well as the families Srw, Srz, Srbc, Srsx and Srr [, , ]. Many of these proteins have homologues in Caenorhabditis briggsae. This entry represents serpentine receptor class sx (Srsx), which is a solo family amongst the superfamilies of chemoreceptors. Chemoperception is one of the central senses of soil nematodes like C. elegans which are otherwise 'blind' and 'deaf' [].
Probab=94.30 E-value=0.55 Score=46.26 Aligned_cols=134 Identities=11% Similarity=0.120 Sum_probs=82.2
Q ss_pred HHHHHHHHHHHHhccCCCCCCCchHHHHHHHHHHHHHHHHHHHHhcCCcceeccCCCceeeccCCC-ChhHHHHHHHHHH
Q psy14918 67 CCISTLMTLTTFLIDTERFKYPERPIVFLSGCYFMVSIGYLIRVSVGHEEVACEDQMVRYGASGPT-PCTLVFLLVYFFG 145 (463)
Q Consensus 67 ~~~~tlftilTF~i~~~r~~yp~r~iifls~c~~~~sl~~li~~~~g~~~i~C~~~~~~~~~~~~~-~C~~~~~l~yyf~ 145 (463)
++++++..+....-+++.++.-.-.+-.+|+++++..++.++..... + ...+.+. .|...-+..+++.
T Consensus 4 g~~gN~~~i~~~~~~~~Lrs~~~~li~~~~~~d~~~~~~~~~~~~~~---~--------~~~~i~~~~Cf~~~~~~~f~~ 72 (257)
T PF10320_consen 4 GLFGNLLLIILIFRNKSLRSPCYILICILCFADLICLLGTLPFMLFL---F--------RDHQITRSECFWQIFFYIFFQ 72 (257)
T ss_pred EEEccHHHHHHHHhccccccchHHHHHHHHHHHHHHHhhHHHHHHHH---H--------hheeccHHHHHHHHHHHHHHH
Confidence 44566666666665555433334555667788888887777654321 0 0011222 3999999999999
Q ss_pred HHHHHHHHHHHHHHHHHhccccchhhhhhhHHHHHHHHHHhhHHHHHHhhhhccccCCc-ccceeEeee
Q psy14918 146 MSSYIWWVILSFTWFLAAGLKWGNEAIASYSLYFHLAAWLIPTVKSVSVIIMSAVDGDT-IAGICYVGN 213 (463)
Q Consensus 146 las~~Ww~il~~~w~ls~~~k~~~e~i~~~~~yfh~~aW~iP~v~tii~l~~~~v~~~~-~tg~CwIg~ 213 (463)
.+.....+++++|-+++...--....+... .|..+.-.+|.+.++.....+-...|+ ....|-.-.
T Consensus 73 ~~qs~~~l~i~iDr~iaV~~P~~Y~~~~~~--~y~~~~~~~~~~~s~~~~~~~~~~~~~~~~~~C~pp~ 139 (257)
T PF10320_consen 73 CAQSVIMLAIAIDRLIAVCFPLRYRTISTR--KYLIILLIFPVIYSIFFTVIGFLYRDDETIVICNPPL 139 (257)
T ss_pred HHHHHHHHHHHhhheeeEeehhhhhhcccc--cchhhHhHHHHHHHHHHHhheeEecCCcccccCCCcc
Confidence 999999999999998886432111111111 144555567777777776666554333 456788754
No 20
>PF10278 Med19: Mediator of RNA pol II transcription subunit 19 ; InterPro: IPR019403 The Mediator complex is a coactivator involved in the regulated transcription of nearly all RNA polymerase II-dependent genes. Mediator functions as a bridge to convey information from gene-specific regulatory proteins to the basal RNA polymerase II transcription machinery. The Mediator complex, having a compact conformation in its free form, is recruited to promoters by direct interactions with regulatory proteins and serves for the assembly of a functional preinitiation complex with RNA polymerase II and the general transcription factors. On recruitment the Mediator complex unfolds to an extended conformation and partially surrounds RNA polymerase II, specifically interacting with the unphosphorylated form of the C-terminal domain (CTD) of RNA polymerase II. The Mediator complex dissociates from the RNA polymerase II holoenzyme and stays at the promoter when transcriptional elongation begins. The Mediator complex is composed of at least 31 subunits: MED1, MED4, MED6, MED7, MED8, MED9, MED10, MED11, MED12, MED13, MED13L, MED14, MED15, MED16, MED17, MED18, MED19, MED20, MED21, MED22, MED23, MED24, MED25, MED26, MED27, MED29, MED30, MED31, CCNC, CDK8 and CDC2L6/CDK11. The subunits form at least three structurally distinct submodules. The head and the middle modules interact directly with RNA polymerase II, whereas the elongated tail module interacts with gene-specific regulatory proteins. Mediator containing the CDK8 module is less active than Mediator lacking this module in supporting transcriptional activation. The head module contains: MED6, MED8, MED11, SRB4/MED17, SRB5/MED18, ROX3/MED19, SRB2/MED20 and SRB6/MED22. The middle module contains: MED1, MED4, NUT1/MED5, MED7, CSE2/MED9, NUT2/MED10, SRB7/MED21 and SOH1/MED31. CSE2/MED9 interacts directly with MED4. The tail module contains: MED2, PGD1/MED3, RGR1/MED14, GAL11/MED15 and SIN4/MED16. The CDK8 module contains: MED12, MED13, CCNC and CDK8. Individual preparations of the Mediator complex lacking one or more distinct subunits have been variously termed ARC, CRSP, DRIP, PC2, SMCC and TRAP. Med19 represents a family of conserved proteins which are members of the multi-protein co-activator Mediator complex []. ; GO: 0001104 RNA polymerase II transcription cofactor activity, 0006357 regulation of transcription from RNA polymerase II promoter, 0016592 mediator complex
Probab=93.50 E-value=0.03 Score=52.03 Aligned_cols=8 Identities=25% Similarity=0.393 Sum_probs=4.2
Q ss_pred CCCCCCCC
Q psy14918 394 PPPPNIPI 401 (463)
Q Consensus 394 ~~p~~~~~ 401 (463)
++|+|+|-
T Consensus 92 L~pGplPE 99 (178)
T PF10278_consen 92 LHPGPLPE 99 (178)
T ss_pred CCCCCCch
Confidence 44555553
No 21
>KOG4220|consensus
Probab=93.48 E-value=0.086 Score=55.08 Aligned_cols=143 Identities=17% Similarity=0.237 Sum_probs=92.9
Q ss_pred HHHHHHHHHHHHHHHHHHHHhccCCCCCCCchHHHHHHHHHHHHHH----HHHHHHhcCCcceeccCCCceeeccCCCC-
Q psy14918 59 WIFVMSALCCISTLMTLTTFLIDTERFKYPERPIVFLSGCYFMVSI----GYLIRVSVGHEEVACEDQMVRYGASGPTP- 133 (463)
Q Consensus 59 ~~~v~s~i~~~~tlftilTF~i~~~r~~yp~r~iifls~c~~~~sl----~~li~~~~g~~~i~C~~~~~~~~~~~~~~- 133 (463)
..+++|.+..++.++.++.|-+|++..+-.+--++-|+++.+++.+ .|..-.+.|+-. .++
T Consensus 36 v~~~lsLVTv~GNlLVmiSfKvnrqLqTVnNYfLfSLAcADliIG~~SMnl~t~Y~lmg~W~--------------LG~~ 101 (503)
T KOG4220|consen 36 VTGSLSLVTVVGNLLVMISFKVNRQLQTVNNYFLFSLACADLIIGAFSMNLYTTYTLMGYWP--------------LGPL 101 (503)
T ss_pred hhhHHHHHhhhccEEEEEEEEecceeeeecceeehHHHHhhhhhheeechHHHHHHHHcccc--------------cchH
Confidence 3344445555555555667778887655556677888888877544 222333333221 123
Q ss_pred -hhHHHHHHHHHHHHHHHHHHHHHHHHHHHhccc--cchhhhhhhHHHHHHHHHHhhHHHHHHhhhhccc-cCCc--ccc
Q psy14918 134 -CTLVFLLVYFFGMSSYIWWVILSFTWFLAAGLK--WGNEAIASYSLYFHLAAWLIPTVKSVSVIIMSAV-DGDT--IAG 207 (463)
Q Consensus 134 -C~~~~~l~yyf~las~~Ww~il~~~w~ls~~~k--~~~e~i~~~~~yfh~~aW~iP~v~tii~l~~~~v-~~~~--~tg 207 (463)
|-.--.+=|-..-|+.+=.++++||=|.+..+- .....-.+.+.....+||+|.+++-+-++..-++ .|.. -.|
T Consensus 102 ~CdlWLalDYvaSNASVmNLLiISFDRYFsVTrPLtYrakRTtkrA~~MI~~AW~iSfiLWaPaIl~WqyivGkrTv~~~ 181 (503)
T KOG4220|consen 102 VCDLWLALDYVASNASVMNLLIISFDRYFSVTRPLTYRAKRTTKRAGLMIGAAWVLSFVLWAPAILFWQYIVGKRTVPDG 181 (503)
T ss_pred HHHHHHHHHHHhhhhhhhhhheeeeecceeecccccccccccchHHHHHHHHHHHHHHHHHHHHHHhhHhheeeeecCCC
Confidence 888888999999999999999999988876331 1111124556678899999999999977665554 3332 236
Q ss_pred eeEeeeCC
Q psy14918 208 ICYVGNHN 215 (463)
Q Consensus 208 ~CwIg~~~ 215 (463)
-|+|.-.+
T Consensus 182 eC~iQFls 189 (503)
T KOG4220|consen 182 ECYIQFLS 189 (503)
T ss_pred ceEEEeec
Confidence 79997544
No 22
>KOG4219|consensus
Probab=92.59 E-value=4.6 Score=42.40 Aligned_cols=128 Identities=14% Similarity=0.141 Sum_probs=71.0
Q ss_pred HHHHHHHHHHHHHHHHHHHHHhcc---CCCCCCCchHHHHHHHHHHHHHHHHHHH---HhcCC----cceeccCCCceee
Q psy14918 58 MWIFVMSALCCISTLMTLTTFLID---TERFKYPERPIVFLSGCYFMVSIGYLIR---VSVGH----EEVACEDQMVRYG 127 (463)
Q Consensus 58 ~~~~v~s~i~~~~tlftilTF~i~---~~r~~yp~r~iifls~c~~~~sl~~li~---~~~g~----~~i~C~~~~~~~~ 127 (463)
.|..+.+++.+++-.=++++-+|. ++.+.-.+-.|++|+++.+++++--... ...-. ..+-
T Consensus 37 ~wai~yg~l~~vAv~GN~iVlwIil~hrrMRtvtnyfL~NLAfADl~~s~Fn~~f~f~yal~~~W~~G~f~--------- 107 (423)
T KOG4219|consen 37 LWAIAYGLLVFVAVVGNLIVLWIILAHRRMRTVTNYFLVNLAFADLSMSIFNTVFNFQYALHQEWYFGSFY--------- 107 (423)
T ss_pred HHHHHHHHHHHHHHhcCceEEEEEeehhehhhhHHHHHHHHHHHHHHHHHHhhHHHHHHHHHhccccccce---------
Confidence 333333433333333333444443 2223445678899999999888743321 11100 1223
Q ss_pred ccCCCChhHHHHHHHHHHHHHHHHHHHHHHHHHHHhccccchhhhhhhHHHHHHHHHHhhHHHHHHhhhhccc
Q psy14918 128 ASGPTPCTLVFLLVYFFGMSSYIWWVILSFTWFLAAGLKWGNEAIASYSLYFHLAAWLIPTVKSVSVIIMSAV 200 (463)
Q Consensus 128 ~~~~~~C~~~~~l~yyf~las~~Ww~il~~~w~ls~~~k~~~e~i~~~~~yfh~~aW~iP~v~tii~l~~~~v 200 (463)
|.++-++---...++.+=++.+++|=|+++..--....-.+.++.-.+++|.+.+++..-.+...++
T Consensus 108 ------C~f~nf~~itav~vSVfTlvAiA~DRy~AIi~Pl~~r~s~r~sk~iIllIW~lA~l~a~P~~l~s~v 174 (423)
T KOG4219|consen 108 ------CRFVNFFPITAVFVSVFTLVAIAIDRYMAIIHPLQPRPSRRSSKIIILLIWALALLLALPQLLYSSV 174 (423)
T ss_pred ------eeeccccchhhhhHhHHHHHHHHHHHHHHHhhhcccCCCCcceeehhHHHHHHHHHHhccceeeeee
Confidence 4444444444455666667888999998873211111223445567789999999877766666665
No 23
>PF10321 7TM_GPCR_Srt: Serpentine type 7TM GPCR chemoreceptor Srt; InterPro: IPR019425 Chemoreception is mediated in Caenorhabditis elegans by members of the seven-transmembrane G-protein-coupled receptor class (7TM GPCRs) of proteins which are of the serpentine type []. Srt is a member of the Srg superfamily of chemoreceptors. Chemoperception is one of the central senses of soil nematodes like C. elegans which are otherwise 'blind' and 'deaf' [].
Probab=92.48 E-value=14 Score=37.76 Aligned_cols=39 Identities=10% Similarity=0.218 Sum_probs=28.8
Q ss_pred CchHHHHHHHHHHHHhhcccceeeeechhhhhHHHHHHhhhh
Q psy14918 320 KPLHVVLLLKYFMALAVGLTSGVWIWSGKTFSSWKKLWNRLF 361 (463)
Q Consensus 320 ~P~~~~~~lk~fm~li~GI~~~~w~~s~KT~~~W~~~~~r~~ 361 (463)
.|+..+...++.-.+.-|..+.++..=+||+ |+-+.+..
T Consensus 267 ~p~~~i~~~~~~Wql~~g~~~iIYl~lNrtI---R~~~~k~~ 305 (313)
T PF10321_consen 267 PPPWLIIIGQISWQLSHGCPPIIYLTLNRTI---RNSVLKML 305 (313)
T ss_pred ccHHHHHHHHHHHhccCCccceEEEEECHHH---HHHHHHHH
Confidence 4666677778888999999999888777777 44444433
No 24
>KOG4043|consensus
Probab=90.47 E-value=0.18 Score=45.92 Aligned_cols=7 Identities=0% Similarity=-0.164 Sum_probs=3.4
Q ss_pred CCCCCCC
Q psy14918 394 PPPPNIP 400 (463)
Q Consensus 394 ~~p~~~~ 400 (463)
++|.|.+
T Consensus 121 Lh~GPl~ 127 (214)
T KOG4043|consen 121 LHAGPLD 127 (214)
T ss_pred cCCCCch
Confidence 4455554
No 25
>PF06454 DUF1084: Protein of unknown function (DUF1084); InterPro: IPR009457 This entry consists of several hypothetical plant specific proteins of unknown function.
Probab=85.40 E-value=43 Score=33.65 Aligned_cols=62 Identities=26% Similarity=0.324 Sum_probs=38.2
Q ss_pred hhhHHHHHHHHHHHHHHHHHHHhhhhhhhhcccCCCCccchhhhHHHHHHHHHHHHHHHhhhhHHHH
Q psy14918 223 VVFPLIIYLILGIIFLFGGFISLFRIRNVIKNQGGISNRTKADKLEKLMIRIGIFSVLSTLPASIVI 289 (463)
Q Consensus 223 ~~~Pl~i~llig~~fl~~~~~~l~~ir~~~k~~~~~~~~~~~~kl~~l~~rigif~~ly~~~~~~~~ 289 (463)
...-.++.+++|+.|++.|..-..++|+.-.... .+.+|+.+......+.++.+++=..+++
T Consensus 166 ~~~~A~isli~a~~Fl~YG~~L~~~Lr~~p~~s~-----~r~kkl~~V~~vt~ic~~cF~ir~i~~~ 227 (281)
T PF06454_consen 166 AIFIAVISLIAALGFLYYGGKLFFKLRRFPIESK-----GRSKKLRKVGFVTIICSVCFLIRCIMVL 227 (281)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHcCccccc-----cHHHHHHHHHhHHHHHHHHHHHHHHHHH
Confidence 3455788888999999999887777776422211 3345666655555555555554444433
No 26
>KOG2573|consensus
Probab=83.47 E-value=0.45 Score=49.40 Aligned_cols=6 Identities=33% Similarity=0.579 Sum_probs=2.8
Q ss_pred ccccCC
Q psy14918 198 SAVDGD 203 (463)
Q Consensus 198 ~~v~~~ 203 (463)
.+.|-|
T Consensus 177 DqlDKD 182 (498)
T KOG2573|consen 177 DQLDKD 182 (498)
T ss_pred HHHhhh
Confidence 444444
No 27
>KOG2138|consensus
Probab=82.84 E-value=0.43 Score=52.52 Aligned_cols=10 Identities=40% Similarity=0.979 Sum_probs=4.2
Q ss_pred CCCCCCCCCC
Q psy14918 19 NPIASNSPYN 28 (463)
Q Consensus 19 ~~~~~~~~~~ 28 (463)
+|+.++.+|+
T Consensus 312 k~~~s~k~ya 321 (883)
T KOG2138|consen 312 KPLVSQKPYA 321 (883)
T ss_pred CccccccCCC
Confidence 3444444443
No 28
>PF07771 TSGP1: Tick salivary peptide group 1; InterPro: IPR011694 This entry contains a group of peptides derived from a salivary gland cDNA library of the tick Ixodes scapularis (Black-legged tick) []. Also present are peptides from a related tick species, Ixodes ricinus (Sheep tick). They are characterised by a putative signal peptide, indicative of secretion, and conserved cysteine residues.
Probab=81.60 E-value=0.35 Score=42.47 Aligned_cols=16 Identities=19% Similarity=0.628 Sum_probs=8.5
Q ss_pred ceeeeechhhhhHHHHH
Q psy14918 340 SGVWIWSGKTFSSWKKL 356 (463)
Q Consensus 340 ~~~w~~s~KT~~~W~~~ 356 (463)
+.+|-|+..+ ..|...
T Consensus 39 C~YYC~~~~~-~~W~~g 54 (120)
T PF07771_consen 39 CNYYCWNDDT-NGWDYG 54 (120)
T ss_pred cEEEcCCCCC-CceEEE
Confidence 4566665443 456553
No 29
>PF02117 7TM_GPCR_Sra: Serpentine type 7TM GPCR chemoreceptor Sra; InterPro: IPR000344 G-protein-coupled receptors, GPCRs, constitute a vast protein family that encompasses a wide range of functions (including various autocrine, paracrine and endocrine processes). They show considerable diversity at the sequence level, on the basis of which they can be separated into distinct groups. We use the term clan to describe the GPCRs, as they embrace a group of families for which there are indications of evolutionary relationship, but between which there is no statistically significant similarity in sequence []. The currently known clan members include the rhodopsin-like GPCRs, the secretin-like GPCRs, the cAMP receptors, the fungal mating pheromone receptors, and the metabotropic glutamate receptor family. There is a specialised database for GPCRs (http://www.gpcr.org/7tm/). The nematode Caenorhabditis elegans has only 14 types of chemosensory neuron, yet is able to sense and respond to several hundred different chemicals because each neuron detects several stimuli []. Chemoperception is one of the central senses of soil nematodes like C. elegans which are otherwise 'blind' and 'deaf' []. Chemoreception in C. elegans is mediated by members of the seven-transmembrane G-protein-coupled receptor class (7TM GPCRs). More than 1300 potential chemoreceptor genes have been identified in C. elegans, which are generally prefixed sr for serpentine receptor. The receptor superfamilies include Sra (Sra, Srb, Srab, Sre), Str (Srh, Str, Sri, Srd, Srj, Srm, Srn) and Srg (Srx, Srt, Srg, Sru, Srv, Srxa), as well as the families Srw, Srz, Srbc, Srsx and Srr [, , ]. Many of these proteins have homologues in Caenorhabditis briggsae. This entry represents serpentine receptor class a (Sra) from the Sra superfamily []. Sra receptors contain 6-7 hydrophobic, putative transmembrane, regions and can be distinguished from other 7TM GPCR receptors by their own characteristic TM signatures.; GO: 0004888 transmembrane signaling receptor activity, 0007606 sensory perception of chemical stimulus, 0016021 integral to membrane
Probab=80.67 E-value=72 Score=32.70 Aligned_cols=101 Identities=15% Similarity=0.270 Sum_probs=51.4
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhccCCCCCCCchHHHHHHHHHHH-HHHHHHH-------HHhcCCcceeccCCCcee
Q psy14918 55 FASMWIFVMSALCCISTLMTLTTFLIDTERFKYPERPIVFLSGCYFM-VSIGYLI-------RVSVGHEEVACEDQMVRY 126 (463)
Q Consensus 55 ~~~~~~~v~s~i~~~~tlftilTF~i~~~r~~yp~r~iifls~c~~~-~sl~~li-------~~~~g~~~i~C~~~~~~~ 126 (463)
.......+++.++++.+.+++-... .++-+....|.+++.++-+.. -.+.|.+ +.+. ..+=.|.
T Consensus 22 ~~~~~~~~~~~~t~~~s~~Ai~~l~-~ksif~~sTr~LL~~~l~~a~~Hq~~y~~~~~~~lyrsf~-~~~dpC~------ 93 (328)
T PF02117_consen 22 INQIFILIIIIITFIFSVFAIKKLW-KKSIFSNSTRILLISNLLFANIHQISYAIIQIWSLYRSFF-YSDDPCS------ 93 (328)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHH-hCCCCCccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh-cCCCCee------
Confidence 3334444556666665555543332 234466678999888865532 3333332 2222 1122442
Q ss_pred eccCCCChhHHHHHHHHHHHHHHHHHHHHHHHHHHHh
Q psy14918 127 GASGPTPCTLVFLLVYFFGMSSYIWWVILSFTWFLAA 163 (463)
Q Consensus 127 ~~~~~~~C~~~~~l~yyf~las~~Ww~il~~~w~ls~ 163 (463)
+....++|.....+++++....+.==..|++|=.++.
T Consensus 94 i~f~~~eC~~~~~~~~~~~~~~~~~q~aL~idRl~at 130 (328)
T PF02117_consen 94 ILFTSSECFPYYYFYYFTNSGMIFIQFALTIDRLLAT 130 (328)
T ss_pred eEeecCceeeeehHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 2233455988877777665444333344566655543
No 30
>PF10292 7TM_GPCR_Srab: Serpentine type 7TM GPCR receptor class ab chemoreceptor; InterPro: IPR019408 G-protein-coupled receptors, GPCRs, constitute a vast protein family that encompasses a wide range of functions (including various autocrine, paracrine and endocrine processes). They show considerable diversity at the sequence level, on the basis of which they can be separated into distinct groups. We use the term clan to describe the GPCRs, as they embrace a group of families for which there are indications of evolutionary relationship, but between which there is no statistically significant similarity in sequence []. The currently known clan members include the rhodopsin-like GPCRs, the secretin-like GPCRs, the cAMP receptors, the fungal mating pheromone receptors, and the metabotropic glutamate receptor family. There is a specialised database for GPCRs (http://www.gpcr.org/7tm/). The nematode Caenorhabditis elegans has only 14 types of chemosensory neuron, yet is able to sense and respond to several hundred different chemicals because each neuron detects several stimuli []. Chemoperception is one of the central senses of soil nematodes like C. elegans which are otherwise 'blind' and 'deaf' []. Chemoreception in C. elegans is mediated by members of the seven-transmembrane G-protein-coupled receptor class (7TM GPCRs). More than 1300 potential chemoreceptor genes have been identified in C. elegans, which are generally prefixed sr for serpentine receptor. The receptor superfamilies include Sra (Sra, Srb, Srab, Sre), Str (Srh, Str, Sri, Srd, Srj, Srm, Srn) and Srg (Srx, Srt, Srg, Sru, Srv, Srxa), as well as the families Srw, Srz, Srbc, Srsx and Srr [, , ]. Many of these proteins have homologues in Caenorhabditis briggsae. Srab is part of the Sra superfamily of chemoreceptors. The expression pattern of the srab genes is biologically intriguing. Of the six promoters successfully expressed in transgenic organisms, one was exclusively expressed in the tail phasmid neurons, two were exclusively expressed in a head amphid neuron, and two were expressed both in the head and tail neurons as well as a limited number of other cells [].
Probab=78.95 E-value=78 Score=32.14 Aligned_cols=169 Identities=14% Similarity=0.169 Sum_probs=77.5
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHhccCCCCCCCchHHHH-HHHHHHHHHHHHHHHHhcC-----CcceeccCCCceee
Q psy14918 54 EFASMWIFVMSALCCISTLMTLTTFLIDTERFKYPERPIVF-LSGCYFMVSIGYLIRVSVG-----HEEVACEDQMVRYG 127 (463)
Q Consensus 54 ~~~~~~~~v~s~i~~~~tlftilTF~i~~~r~~yp~r~iif-ls~c~~~~sl~~li~~~~g-----~~~i~C~~~~~~~~ 127 (463)
+.+.....+.|++++...++.+..+. ....+-...|+++. .+++.++.+++-++.-... ...-.|+--
T Consensus 17 ~~~~~~~~~~s~~~~~~~~~~~~~~~-~~~~~H~N~ril~~~~~~~~l~~~~~r~~~h~~~l~~~~~~~~~Cd~~----- 90 (324)
T PF10292_consen 17 RLSLIFNLLLSIIAFPVIIYALWKIR-NSKLFHFNTRILFIVHCFSFLIHCTGRIILHTYDLYNYFFPDDPCDMI----- 90 (324)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHh-hcchhchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHccCCCcccc-----
Confidence 34444444555555544444332222 12233333455544 3445555555443321100 113345321
Q ss_pred ccCCCChhHHHHHHHHHHHHHHHHHHHHHHHHHHHhccccchhhhhhhHH----HHHHHHHHhhHHHHHHhhhhccccC-
Q psy14918 128 ASGPTPCTLVFLLVYFFGMSSYIWWVILSFTWFLAAGLKWGNEAIASYSL----YFHLAAWLIPTVKSVSVIIMSAVDG- 202 (463)
Q Consensus 128 ~~~~~~C~~~~~l~yyf~las~~Ww~il~~~w~ls~~~k~~~e~i~~~~~----yfh~~aW~iP~v~tii~l~~~~v~~- 202 (463)
.+...|.+.-....++.+....=.+.+++.=+++......- |+..+ .+-++.|.++++.+.. ...+.++
T Consensus 91 -~~~~~C~~lR~~~~~~~~~~~~t~v~l~IER~iAT~~~~~Y---E~~~~~~Gi~l~~~qi~is~~~~~~--~~~~~~~~ 164 (324)
T PF10292_consen 91 -PSTYRCFILRIPYNFGLFLVSFTTVSLVIERTIATFFSKSY---EKSGKWLGILLAFFQILISLLILYF--VYSDEDFP 164 (324)
T ss_pred -cchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhHHHh---cCCCccHHHHHHHHHHHHHHHHHHH--HHhhhhcc
Confidence 12223988888877777777666666777766555332112 22212 2223334444443332 2344444
Q ss_pred CcccceeEeeeCCccccchhhhhHHHHHHHHHHHH
Q psy14918 203 DTIAGICYVGNHNIHNLKYFVVFPLIIYLILGIIF 237 (463)
Q Consensus 203 ~~~tg~CwIg~~~~~~~~~f~~~Pl~i~llig~~f 237 (463)
++.---|.....+.. ..+..|..+.+++..+-
T Consensus 165 ~~~~~yC~~~~~~~~---~~~~~~~~~~~~~~i~~ 196 (324)
T PF10292_consen 165 NERMYYCQASSSSSP---YRVNIPFIVILVLQILA 196 (324)
T ss_pred CccceeeEEecCCCc---hhhHHHHHHHHHHHHHH
Confidence 444456998754421 22345555555544433
No 31
>KOG4043|consensus
Probab=74.36 E-value=2 Score=39.36 Aligned_cols=6 Identities=0% Similarity=0.307 Sum_probs=2.3
Q ss_pred ccccCC
Q psy14918 379 VLKQHD 384 (463)
Q Consensus 379 ~I~~~~ 384 (463)
+|..++
T Consensus 97 lIEKPP 102 (214)
T KOG4043|consen 97 LIEKPP 102 (214)
T ss_pred HHhcCC
Confidence 344333
No 32
>PF10323 7TM_GPCR_Srv: Serpentine type 7TM GPCR chemoreceptor Srv; InterPro: IPR019426 G-protein-coupled receptors, GPCRs, constitute a vast protein family that encompasses a wide range of functions (including various autocrine, paracrine and endocrine processes). They show considerable diversity at the sequence level, on the basis of which they can be separated into distinct groups. We use the term clan to describe the GPCRs, as they embrace a group of families for which there are indications of evolutionary relationship, but between which there is no statistically significant similarity in sequence []. The currently known clan members include the rhodopsin-like GPCRs, the secretin-like GPCRs, the cAMP receptors, the fungal mating pheromone receptors, and the metabotropic glutamate receptor family. There is a specialised database for GPCRs (http://www.gpcr.org/7tm/). The nematode Caenorhabditis elegans has only 14 types of chemosensory neuron, yet is able to sense and respond to several hundred different chemicals because each neuron detects several stimuli []. Chemoperception is one of the central senses of soil nematodes like C. elegans which are otherwise 'blind' and 'deaf' []. Chemoreception in C. elegans is mediated by members of the seven-transmembrane G-protein-coupled receptor class (7TM GPCRs). More than 1300 potential chemoreceptor genes have been identified in C. elegans, which are generally prefixed sr for serpentine receptor. The receptor superfamilies include Sra (Sra, Srb, Srab, Sre), Str (Srh, Str, Sri, Srd, Srj, Srm, Srn) and Srg (Srx, Srt, Srg, Sru, Srv, Srxa), as well as the families Srw, Srz, Srbc, Srsx and Srr [, , ]. Many of these proteins have homologues in Caenorhabditis briggsae. This entry represents serpentine receptor class v (Srv) from the Srg superfamily [, ]. Srg receptors contain seven hydrophobic, putative transmembrane, regions and can be distinguished from other 7TM GPCR receptors by their own characteristic TM signatures.
Probab=72.95 E-value=92 Score=31.03 Aligned_cols=72 Identities=18% Similarity=0.246 Sum_probs=38.3
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHhcccc-chhh-hhh-hHHHHHHHHHHhhHHHHHHhhhhccccCCcccc
Q psy14918 136 LVFLLVYFFGMSSYIWWVILSFTWFLAAGLKW-GNEA-IAS-YSLYFHLAAWLIPTVKSVSVIIMSAVDGDTIAG 207 (463)
Q Consensus 136 ~~~~l~yyf~las~~Ww~il~~~w~ls~~~k~-~~e~-i~~-~~~yfh~~aW~iP~v~tii~l~~~~v~~~~~tg 207 (463)
+.....||+...-..=-++++++=+++...-. .... .++ ......++-|+.|.+..+..+.-.++..+...+
T Consensus 77 ~~~~~~y~~~~~~~~gi~lls~nR~~ai~~P~~~~~~~~~~~~~~~i~~i~wi~p~li~~~~~~~~~~~f~~~~~ 151 (283)
T PF10323_consen 77 IYYNIAYYFLYIQCIGIVLLSLNRYLAICFPTSRHTKFWQPAKIWIIILIQWIPPLLISLPFFFDTDFYFDNEEN 151 (283)
T ss_pred HHHHHHHHHHHHHHHhHHHHHHhhhheEEeecHHHhhhccccchhheeeeeehhhhhheeeeeccCceeeecccc
Confidence 33334444444444334566777666543311 1110 111 122346788999999888877766666555555
No 33
>KOG0526|consensus
Probab=71.67 E-value=1.4 Score=47.33 Aligned_cols=13 Identities=15% Similarity=0.097 Sum_probs=7.6
Q ss_pred CcChHHHHHHHHH
Q psy14918 47 FFSQPEREFASMW 59 (463)
Q Consensus 47 ~ft~~e~~~~~~~ 59 (463)
-|.+++.+.+..+
T Consensus 76 GF~d~d~~~L~~f 88 (615)
T KOG0526|consen 76 GFRDDDLEKLKSF 88 (615)
T ss_pred CcCHHHHHHHHHH
Confidence 4667776655443
No 34
>PF02118 Srg: Srg family chemoreceptor; InterPro: IPR000609 G-protein-coupled receptors, GPCRs, constitute a vast protein family that encompasses a wide range of functions (including various autocrine, paracrine and endocrine processes). They show considerable diversity at the sequence level, on the basis of which they can be separated into distinct groups. We use the term clan to describe the GPCRs, as they embrace a group of families for which there are indications of evolutionary relationship, but between which there is no statistically significant similarity in sequence []. The currently known clan members include the rhodopsin-like GPCRs, the secretin-like GPCRs, the cAMP receptors, the fungal mating pheromone receptors, and the metabotropic glutamate receptor family. There is a specialised database for GPCRs (http://www.gpcr.org/7tm/). The nematode Caenorhabditis elegans has only 14 types of chemosensory neuron, yet is able to sense and respond to several hundred different chemicals because each neuron detects several stimuli []. Chemoperception is one of the central senses of soil nematodes like C. elegans which are otherwise 'blind' and 'deaf' []. Chemoreception in C. elegans is mediated by members of the seven-transmembrane G-protein-coupled receptor class (7TM GPCRs). More than 1300 potential chemoreceptor genes have been identified in C. elegans, which are generally prefixed sr for serpentine receptor. The receptor superfamilies include Sra (Sra, Srb, Srab, Sre), Str (Srh, Str, Sri, Srd, Srj, Srm, Srn) and Srg (Srx, Srt, Srg, Sru, Srv, Srxa), as well as the families Srw, Srz, Srbc, Srsx and Srr [, , ]. Many of these proteins have homologues in Caenorhabditis briggsae. This entry represents serpentine receptor class g (Srg) from the Srg superfamily [, ]. Srg receptors contain seven hydrophobic, putative transmembrane, regions and can be distinguished from other 7TM GPCR receptors by their own characteristic TM signatures. ; GO: 0004888 transmembrane signaling receptor activity, 0007606 sensory perception of chemical stimulus, 0016020 membrane
Probab=70.20 E-value=1.1e+02 Score=29.70 Aligned_cols=63 Identities=17% Similarity=0.391 Sum_probs=35.0
Q ss_pred hhHHHHHHHHHHHHHHHHHHHHHHHHHHHhccccchhhh-hhhHHHHHHHHHHhhHHHHHHhhh
Q psy14918 134 CTLVFLLVYFFGMSSYIWWVILSFTWFLAAGLKWGNEAI-ASYSLYFHLAAWLIPTVKSVSVII 196 (463)
Q Consensus 134 C~~~~~l~yyf~las~~Ww~il~~~w~ls~~~k~~~e~i-~~~~~yfh~~aW~iP~v~tii~l~ 196 (463)
..+...+.||+.-+-.+=-..++++=+.+...--..|.+ .++.....++...+|...+...+.
T Consensus 81 l~~~~~l~~~~~~~Q~~~~~~is~nR~t~v~~p~~~~~~W~~~~~~~i~~i~~~p~~~~~~~~~ 144 (275)
T PF02118_consen 81 LNIFYFLQYYFAYVQYLSTILISLNRFTSVLFPIRYEKFWKRYYWIIIIIIFLLPFSFTWNIFI 144 (275)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhHHhhhHHHHhhhhhheeeeeehhHHHHHHHHc
Confidence 777777888877777666667777744443221112222 222222445566677766655554
No 35
>PF11970 Git3_C: G protein-coupled glucose receptor regulating Gpa2 C-term; InterPro: IPR022596 This entry contains a functionally uncharacterised region belonging to the Git3 G-protein coupled receptor. Git3 is one of six proteins required for glucose-triggered adenylate cyclase activation, and is a G protein-coupled receptor responsible for the activation of adenylate cyclase through Gpa2 - heterotrimeric G protein alpha subunit, part of the glucose-detection pathway. Git3 contains seven predicted transmembrane domains, a third cytoplasmic loop and a cytoplasmic tail []. This family is the conserved C-terminal domain of the member proteins.
Probab=66.72 E-value=24 Score=28.41 Aligned_cols=68 Identities=9% Similarity=0.085 Sum_probs=50.7
Q ss_pred hhhHHHHHHHHHHHHHHHhhhhHHHHhhHhhhhccccccccccccCCCCCCCcCCCCchHHHHHHHHHHHHhhccc-cee
Q psy14918 264 ADKLEKLMIRIGIFSVLSTLPASIVIGCLIYESYVYNEYVESIVCPCSEKLDKDYFKPLHVVLLLKYFMALAVGLT-SGV 342 (463)
Q Consensus 264 ~~kl~~l~~rigif~~ly~~~~~~~~~~~fYe~~~~~~w~~s~~C~~~~~~~~~~~~P~~~~~~lk~fm~li~GI~-~~~ 342 (463)
.+++++.+-.+.++-+.|++.+++=++.+.+.+.+.. ...|.++++.+-.++.-+-|.. +.+
T Consensus 4 r~~i~r~lr~mfiYP~~Yi~lwlfP~~~~~~~~~~~~-----------------~~~p~~~l~~i~~~~~~~~G~VD~lv 66 (76)
T PF11970_consen 4 RKRIRRQLRSMFIYPLVYIVLWLFPFAAHRMQYMYEI-----------------GHGPSFWLFCIAGFMQPSQGFVDCLV 66 (76)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcc-----------------CCCCchHHHHHHHHHHHccCHHHhhh
Confidence 4567777777888888888887766666665554321 3479999999999999888888 667
Q ss_pred eeechh
Q psy14918 343 WIWSGK 348 (463)
Q Consensus 343 w~~s~K 348 (463)
+.+++|
T Consensus 67 f~~~er 72 (76)
T PF11970_consen 67 FTLRER 72 (76)
T ss_pred eeeecc
Confidence 777765
No 36
>TIGR00869 sec62 protein translocation protein, Sec62 family. protein secretary systems of yeast microsomes. They are also the non-selective cation (NS) channels of the mammalian cytoplasmic membrane. The yeast Sec62 protein has been shown to be essential for cell growth. The mammalian NS channel proteins has been implicated in platelet derived growth factor(PGDF) dependent single channel current in fibroblasts. These channels are essentially closed in serum deprived tissue-culture cells and are specifically opened by exposure to PDGF. These channels are reported to exhibit equal selectivity for Na+, K+ and Cs+ with low permeability to Ca2+, and no permeability to anions.
Probab=65.17 E-value=3.3 Score=40.37 Aligned_cols=6 Identities=50% Similarity=1.769 Sum_probs=4.5
Q ss_pred ceeeee
Q psy14918 340 SGVWIW 345 (463)
Q Consensus 340 ~~~w~~ 345 (463)
.++|.+
T Consensus 181 ~~fWlf 186 (232)
T TIGR00869 181 PGIWIF 186 (232)
T ss_pred CCeeee
Confidence 678876
No 37
>PF08496 Peptidase_S49_N: Peptidase family S49 N-terminal; InterPro: IPR013703 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Proteolytic enzymes that exploit serine in their catalytic activity are ubiquitous, being found in viruses, bacteria and eukaryotes []. They include a wide range of peptidase activity, including exopeptidase, endopeptidase, oligopeptidase and omega-peptidase activity. Over 20 families (denoted S1 - S66) of serine protease have been identified, these being grouped into clans on the basis of structural similarity and other functional evidence []. Structures are known for members of the clans and the structures indicate that some appear to be totally unrelated, suggesting different evolutionary origins for the serine peptidases []. Not withstanding their different evolutionary origins, there are similarities in the reaction mechanisms of several peptidases. Chymotrypsin, subtilisin and carboxypeptidase C have a catalytic triad of serine, aspartate and histidine in common: serine acts as a nucleophile, aspartate as an electrophile, and histidine as a base []. The geometric orientations of the catalytic residues are similar between families, despite different protein folds []. The linear arrangements of the catalytic residues commonly reflect clan relationships. For example the catalytic triad in the chymotrypsin clan (PA) is ordered HDS, but is ordered DHS in the subtilisin clan (SB) and SDH in the carboxypeptidase clan (SC) [, ]. This domain is found to the N terminus of bacterial signal peptidases that belong to the MEROPS peptidase family S49 (protease IV family, clan SK) (see also IPR002142 from INTERPRO) [, ]. ; GO: 0004252 serine-type endopeptidase activity, 0005886 plasma membrane
Probab=63.35 E-value=5.2 Score=36.68 Aligned_cols=12 Identities=17% Similarity=0.013 Sum_probs=9.1
Q ss_pred hHHHHHHHHHHH
Q psy14918 322 LHVVLLLKYFMA 333 (463)
Q Consensus 322 ~~~~~~lk~fm~ 333 (463)
+|++|+.+.+..
T Consensus 6 ~yglFlaK~vTv 17 (155)
T PF08496_consen 6 EYGLFLAKIVTV 17 (155)
T ss_pred HHHHHHHHHHHH
Confidence 577888887766
No 38
>PF03839 Sec62: Translocation protein Sec62; InterPro: IPR004728 Members of the NSCC2 family have been sequenced from various yeast, fungal and animals species including Saccharomyces cerevisiae, Drosophila melanogaster and Homo sapiens. These proteins are the Sec62 proteins, believed to be associated with the Sec61 and Sec63 constituents of the general protein secretary systems of yeast microsomes. They are also the non-selective cation (NS) channels of the mammalian cytoplasmic membrane. The yeast Sec62 protein has been shown to be essential for cell growth. The mammalian NS channel proteins have been implicated in platelet derived growth factor(PGDF) dependent single channel current in fibroblasts. These channels are essentially closed in serum deprived tissue-culture cells and are specifically opened by exposure to PDGF. These channels are reported to exhibit equal selectivity for Na+, K+ and Cs+ with low permeability to Ca2+, and no permeability to anions.; GO: 0008565 protein transporter activity, 0015031 protein transport, 0016021 integral to membrane
Probab=61.94 E-value=2.8 Score=40.75 Aligned_cols=6 Identities=33% Similarity=1.503 Sum_probs=4.0
Q ss_pred ceeeee
Q psy14918 340 SGVWIW 345 (463)
Q Consensus 340 ~~~w~~ 345 (463)
.++|++
T Consensus 175 ~~fWlf 180 (224)
T PF03839_consen 175 HGFWLF 180 (224)
T ss_pred CCEEeC
Confidence 557875
No 39
>KOG2270|consensus
Probab=60.78 E-value=7.2 Score=41.11 Aligned_cols=15 Identities=20% Similarity=0.273 Sum_probs=9.7
Q ss_pred cccCCCCCCccCCCC
Q psy14918 380 LKQHDYLPPLAHGMP 394 (463)
Q Consensus 380 I~~~~~~~~~a~~~~ 394 (463)
+.+++.++++..+..
T Consensus 416 ~~~~~~~~~~~t~l~ 430 (520)
T KOG2270|consen 416 LVARNLLYTTITGLD 430 (520)
T ss_pred hhhccchhhhhhccc
Confidence 456777777766554
No 40
>PF01284 MARVEL: Membrane-associating domain; InterPro: IPR021128 This entry represents the ~130-residue MARVEL (MAL and related proteins for vesicle trafficking and membrane link) domain. The MARVEL domain is a module with a four transmembrane-helix architecture that has been identified in proteins of the myelin and lymphocyte (MAL), physins, gyrins and occludin families. All described MARVEL domain-containing proteins are consistent with the M-shaped topology: four transmembrane-helix region architecture with cytoplasmic N- and C-terminal regions. Their function could be related to cholesterol-rich membrane apposition events in a variety of cellular processes, such as biogenesis of vesicular transport carriers or tight junction regulation [].
Probab=59.12 E-value=1.2e+02 Score=26.17 Aligned_cols=23 Identities=22% Similarity=0.388 Sum_probs=16.3
Q ss_pred HHHHHHHHHHHHHHHHHHHHHhc
Q psy14918 58 MWIFVMSALCCISTLMTLTTFLI 80 (463)
Q Consensus 58 ~~~~v~s~i~~~~tlftilTF~i 80 (463)
.+....++++++.+++.++.+..
T Consensus 43 ~~~~~v~~~~~~~~~~~l~~~~~ 65 (144)
T PF01284_consen 43 GFALFVAVLSFLYTLIFLLLYLF 65 (144)
T ss_pred hHHHHHHHHHHHHHHHHHHHHHH
Confidence 45566778888777777777664
No 41
>PF10324 7TM_GPCR_Srw: Serpentine type 7TM GPCR chemoreceptor Srw; InterPro: IPR019427 G-protein-coupled receptors, GPCRs, constitute a vast protein family that encompasses a wide range of functions (including various autocrine, paracrine and endocrine processes). They show considerable diversity at the sequence level, on the basis of which they can be separated into distinct groups. We use the term clan to describe the GPCRs, as they embrace a group of families for which there are indications of evolutionary relationship, but between which there is no statistically significant similarity in sequence []. The currently known clan members include the rhodopsin-like GPCRs, the secretin-like GPCRs, the cAMP receptors, the fungal mating pheromone receptors, and the metabotropic glutamate receptor family. There is a specialised database for GPCRs (http://www.gpcr.org/7tm/). The nematode Caenorhabditis elegans has only 14 types of chemosensory neuron, yet is able to sense and respond to several hundred different chemicals because each neuron detects several stimuli []. Chemoperception is one of the central senses of soil nematodes like C. elegans which are otherwise 'blind' and 'deaf' []. Chemoreception in C. elegans is mediated by members of the seven-transmembrane G-protein-coupled receptor class (7TM GPCRs). More than 1300 potential chemoreceptor genes have been identified in C. elegans, which are generally prefixed sr for serpentine receptor. The receptor superfamilies include Sra (Sra, Srb, Srab, Sre), Str (Srh, Str, Sri, Srd, Srj, Srm, Srn) and Srg (Srx, Srt, Srg, Sru, Srv, Srxa), as well as the families Srw, Srz, Srbc, Srsx and Srr [, , ]. Many of these proteins have homologues in Caenorhabditis briggsae. This entry represents serpentine receptor class w (Srw), which is a solo family amongst the superfamilies of chemoreceptors. The genes encoding Srw do not appear to be under as strong an adaptive evolutionary pressure as those of Srz [].
Probab=57.69 E-value=2.1e+02 Score=28.52 Aligned_cols=56 Identities=11% Similarity=0.139 Sum_probs=32.5
Q ss_pred HHHHHHHHHHHHHHHHhcc-CCC-CCCC-chHHHHHHHHHHHHHHHHHHHHhcC----CcceeccC
Q psy14918 63 MSALCCISTLMTLTTFLID-TER-FKYP-ERPIVFLSGCYFMVSIGYLIRVSVG----HEEVACED 121 (463)
Q Consensus 63 ~s~i~~~~tlftilTF~i~-~~r-~~yp-~r~iifls~c~~~~sl~~li~~~~g----~~~i~C~~ 121 (463)
++.+++++++|- +.+. ++. +.-| ...++-+|+|.++..+..+...... .+.-.|..
T Consensus 6 ~~~~g~~~N~~h---~~VLtrk~mR~~~in~~l~~Iai~Dl~~~~~~~~~~~~~~~~~~~~~~C~~ 68 (318)
T PF10324_consen 6 LSIFGLFINIFH---LIVLTRKSMRSSSINILLIGIAICDLLYMLSILIWELFFFIIFYEDDECIP 68 (318)
T ss_pred EeHHHHHHHHHH---hhhcCChhhhcCCHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhccccCCCC
Confidence 345555555544 4443 432 2223 5777889999999888777665421 23566744
No 42
>PRK11281 hypothetical protein; Provisional
Probab=55.86 E-value=4.5e+02 Score=31.85 Aligned_cols=28 Identities=7% Similarity=0.064 Sum_probs=17.8
Q ss_pred ccccchhhhhhhHHHHHHHHHH-hhHHHH
Q psy14918 164 GLKWGNEAIASYSLYFHLAAWL-IPTVKS 191 (463)
Q Consensus 164 ~~k~~~e~i~~~~~yfh~~aW~-iP~v~t 191 (463)
..+|..+..+...+.+...+|. +|++++
T Consensus 608 HF~w~~~~~~~~~~~~~~~~~~~~pl~~~ 636 (1113)
T PRK11281 608 HFGMPKEQVSHFRRQIVRLSLALLPLLFW 636 (1113)
T ss_pred hcCCCHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 5678887777766655555555 466553
No 43
>KOG1059|consensus
Probab=55.59 E-value=4.5 Score=45.25 Aligned_cols=11 Identities=36% Similarity=1.014 Sum_probs=7.1
Q ss_pred HHHHHHHHHHh
Q psy14918 153 VILSFTWFLAA 163 (463)
Q Consensus 153 ~il~~~w~ls~ 163 (463)
-|--|.||++.
T Consensus 407 ~ItdFEWYlsV 417 (877)
T KOG1059|consen 407 YITDFEWYLSV 417 (877)
T ss_pred hhhhHHHHHHH
Confidence 34456799875
No 44
>PRK11192 ATP-dependent RNA helicase SrmB; Provisional
Probab=55.41 E-value=3.1 Score=43.94 Aligned_cols=11 Identities=9% Similarity=0.151 Sum_probs=4.4
Q ss_pred hhhHHHHHHhh
Q psy14918 349 TFSSWKKLWNR 359 (463)
Q Consensus 349 T~~~W~~~~~r 359 (463)
+...+-..+-|
T Consensus 325 s~~~yiqr~GR 335 (434)
T PRK11192 325 SADTYLHRIGR 335 (434)
T ss_pred CHHHHhhcccc
Confidence 33344444433
No 45
>PRK14552 C/D box methylation guide ribonucleoprotein complex aNOP56 subunit; Provisional
Probab=54.88 E-value=2.3 Score=45.06 Aligned_cols=12 Identities=33% Similarity=0.296 Sum_probs=8.5
Q ss_pred ccccccccCCCC
Q psy14918 375 NKHRVLKQHDYL 386 (463)
Q Consensus 375 ~~~~~I~~~~~~ 386 (463)
.+|.+|=+|+++
T Consensus 316 pk~G~Iy~~~~V 327 (414)
T PRK14552 316 PKHGVIFQYPAI 327 (414)
T ss_pred CCceEEEcCHHH
Confidence 467777777775
No 46
>PF10361 DUF2434: Protein of unknown function (DUF2434); InterPro: IPR018830 This entry represents a family of proteins conserved in fungi. Their function is not known.
Probab=53.62 E-value=98 Score=31.27 Aligned_cols=79 Identities=22% Similarity=0.386 Sum_probs=44.2
Q ss_pred ccCCCCCCCchHHHHHHHHHHHHHHHHHHHHhc-----C--CcceeccCCCceeeccCCCC--hhHHHHHHHHHHHHHHH
Q psy14918 80 IDTERFKYPERPIVFLSGCYFMVSIGYLIRVSV-----G--HEEVACEDQMVRYGASGPTP--CTLVFLLVYFFGMSSYI 150 (463)
Q Consensus 80 i~~~r~~yp~r~iifls~c~~~~sl~~li~~~~-----g--~~~i~C~~~~~~~~~~~~~~--C~~~~~l~yyf~las~~ 150 (463)
||-+|.--|+.+++..++-+.++..+-+.-.+- | .|+-.++.|...+..++... =...=+++|.|..
T Consensus 109 IDVDR~yl~~~piil~sfF~~l~~~~~lA~vWE~VRhWGSw~ERQ~id~dpf~~~~d~~R~k~Ef~lPl~FYlf~~---- 184 (296)
T PF10361_consen 109 IDVDRYYLQGLPIILQSFFWYLMQPGTLAAVWEAVRHWGSWQERQFIDPDPFALRQDDRRSKVEFYLPLVFYLFAW---- 184 (296)
T ss_pred eeecHHhcccccHHHHHHHHHHHHHHHHHHHHHHHhhhhchhhhccccCCccccCcccceeeeeehHHHHHHHHHH----
Confidence 566776678888888777776666555544331 1 45556655532222111111 2455566776655
Q ss_pred HHHHHHHHHHHHhccccc
Q psy14918 151 WWVILSFTWFLAAGLKWG 168 (463)
Q Consensus 151 Ww~il~~~w~ls~~~k~~ 168 (463)
+++||.+-+.|+
T Consensus 185 ------lnFFl~vpRsW~ 196 (296)
T PF10361_consen 185 ------LNFFLVVPRSWT 196 (296)
T ss_pred ------HHHHeeeccCcc
Confidence 345677777776
No 47
>KOG2573|consensus
Probab=52.94 E-value=7.3 Score=40.76 Aligned_cols=8 Identities=25% Similarity=0.895 Sum_probs=4.1
Q ss_pred HHHHhhHH
Q psy14918 182 AAWLIPTV 189 (463)
Q Consensus 182 ~aW~iP~v 189 (463)
..|-+|=+
T Consensus 194 YswhFPEL 201 (498)
T KOG2573|consen 194 YSWHFPEL 201 (498)
T ss_pred HhhccHHH
Confidence 35555544
No 48
>KOG2927|consensus
Probab=52.37 E-value=47 Score=34.41 Aligned_cols=22 Identities=23% Similarity=0.487 Sum_probs=15.4
Q ss_pred HHHHHHHHHhhcccceeeeech
Q psy14918 326 LLLKYFMALAVGLTSGVWIWSG 347 (463)
Q Consensus 326 ~~lk~fm~li~GI~~~~w~~s~ 347 (463)
++++.++.+++|=-.++|++-+
T Consensus 240 lILF~I~~il~~g~~g~W~FPN 261 (372)
T KOG2927|consen 240 LILFGITWILTGGKHGFWLFPN 261 (372)
T ss_pred HHHHHHHHHHhCCCCceEeccc
Confidence 3556667777777788998744
No 49
>PF06658 DUF1168: Protein of unknown function (DUF1168); InterPro: IPR009548 This family consists of several hypothetical eukaryotic proteins of unknown function.
Probab=50.97 E-value=13 Score=33.72 Aligned_cols=7 Identities=43% Similarity=0.539 Sum_probs=3.7
Q ss_pred cCCCCCc
Q psy14918 405 PVMGGGA 411 (463)
Q Consensus 405 p~~~~~~ 411 (463)
+++|+++
T Consensus 25 NV~GSSA 31 (142)
T PF06658_consen 25 NVQGSSA 31 (142)
T ss_pred ccccccc
Confidence 5555543
No 50
>PF11188 DUF2975: Protein of unknown function (DUF2975); InterPro: IPR021354 This family of proteins have no known function. Some members are annotated as membrane proteins however this cannot be confirmed.
Probab=50.33 E-value=1.2e+02 Score=26.11 Aligned_cols=54 Identities=15% Similarity=0.284 Sum_probs=32.8
Q ss_pred hhHHHHHHHHHHHHHHHHHHHhhhhhhhhcccCCCCccchhhhHHHHHHHHHHHHHHHh
Q psy14918 224 VFPLIIYLILGIIFLFGGFISLFRIRNVIKNQGGISNRTKADKLEKLMIRIGIFSVLST 282 (463)
Q Consensus 224 ~~Pl~i~llig~~fl~~~~~~l~~ir~~~k~~~~~~~~~~~~kl~~l~~rigif~~ly~ 282 (463)
..|..++.++..+.+..+++.+.++-+...+.+ .-+.+.-+.+.++|...+.+.
T Consensus 22 ~~~~~~~~~~~~~~~~~~l~~~~~ll~~i~~~~-----~Fs~~n~~~l~~ig~~~l~~~ 75 (136)
T PF11188_consen 22 YWPFLLIGLLVLLLILIALYQLRRLLRNIQKGK-----PFSPENIRRLRRIGWLLLIIS 75 (136)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCC-----cchHHHHHHHHHHHHHHHHHH
Confidence 466777777777777777777777666655433 233344455556665555443
No 51
>PRK14552 C/D box methylation guide ribonucleoprotein complex aNOP56 subunit; Provisional
Probab=50.30 E-value=3.9 Score=43.44 Aligned_cols=12 Identities=8% Similarity=0.235 Sum_probs=6.0
Q ss_pred HHHHHhhHHHHH
Q psy14918 181 LAAWLIPTVKSV 192 (463)
Q Consensus 181 ~~aW~iP~v~ti 192 (463)
...|-+|=+-.+
T Consensus 165 wY~~~FPEL~~i 176 (414)
T PRK14552 165 WYSLHFPELDEL 176 (414)
T ss_pred HHHhhCcCHHhh
Confidence 445666644333
No 52
>PF15002 ERK-JNK_inhib: ERK and JNK pathways, inhibitor
Probab=50.04 E-value=13 Score=35.73 Aligned_cols=10 Identities=10% Similarity=0.428 Sum_probs=4.5
Q ss_pred hhhhHHHHHH
Q psy14918 348 KTFSSWKKLW 357 (463)
Q Consensus 348 KT~~~W~~~~ 357 (463)
.++-.|.=-+
T Consensus 124 ~~ll~Wa~~f 133 (207)
T PF15002_consen 124 NDLLRWALNF 133 (207)
T ss_pred HHHHHHHHHH
Confidence 3444454433
No 53
>PRK10929 putative mechanosensitive channel protein; Provisional
Probab=49.68 E-value=5.5e+02 Score=31.09 Aligned_cols=45 Identities=22% Similarity=0.465 Sum_probs=26.9
Q ss_pred HHHHHHHHHHHHHHH-----HhccccchhhhhhhHHHHHHHHHH-hhHHHH
Q psy14918 147 SSYIWWVILSFTWFL-----AAGLKWGNEAIASYSLYFHLAAWL-IPTVKS 191 (463)
Q Consensus 147 as~~Ww~il~~~w~l-----s~~~k~~~e~i~~~~~yfh~~aW~-iP~v~t 191 (463)
++..|++.+++.+.. ....+|..+..++..+++....|. +|++++
T Consensus 565 ~~~~w~~~~~~~~~~~~Gl~~~HF~w~~~~v~~~~~~~~~~~~~~~pl~~~ 615 (1109)
T PRK10929 565 VPLLWVFMICATFARPNGLFIAHFGWPRERVARAMRYYLLSIGLIVPLIMA 615 (1109)
T ss_pred HHHHHHHHHHHHHcCCCCeeHHhcCCCHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 345666664433321 235678888877777766666666 466554
No 54
>PF10317 7TM_GPCR_Srd: Serpentine type 7TM GPCR chemoreceptor Srd; InterPro: IPR019421 G-protein-coupled receptors, GPCRs, constitute a vast protein family that encompasses a wide range of functions (including various autocrine, paracrine and endocrine processes). They show considerable diversity at the sequence level, on the basis of which they can be separated into distinct groups. We use the term clan to describe the GPCRs, as they embrace a group of families for which there are indications of evolutionary relationship, but between which there is no statistically significant similarity in sequence []. The currently known clan members include the rhodopsin-like GPCRs, the secretin-like GPCRs, the cAMP receptors, the fungal mating pheromone receptors, and the metabotropic glutamate receptor family. There is a specialised database for GPCRs (http://www.gpcr.org/7tm/). The nematode Caenorhabditis elegans has only 14 types of chemosensory neuron, yet is able to sense and respond to several hundred different chemicals because each neuron detects several stimuli []. Chemoperception is one of the central senses of soil nematodes like C. elegans which are otherwise 'blind' and 'deaf' []. Chemoreception in C. elegans is mediated by members of the seven-transmembrane G-protein-coupled receptor class (7TM GPCRs). More than 1300 potential chemoreceptor genes have been identified in C. elegans, which are generally prefixed sr for serpentine receptor. The receptor superfamilies include Sra (Sra, Srb, Srab, Sre), Str (Srh, Str, Sri, Srd, Srj, Srm, Srn) and Srg (Srx, Srt, Srg, Sru, Srv, Srxa), as well as the families Srw, Srz, Srbc, Srsx and Srr [, , ]. Many of these proteins have homologues in Caenorhabditis briggsae. This entry represents the chemoreceptor Srd [].
Probab=49.06 E-value=2.8e+02 Score=27.51 Aligned_cols=95 Identities=18% Similarity=0.289 Sum_probs=48.0
Q ss_pred HHHHHHHHHHHHHHHHHhcc---CC-CCCCCchHHHHHHHHHHHHHHHHHHHHhcCCcceeccCCCceeeccCC-----C
Q psy14918 62 VMSALCCISTLMTLTTFLID---TE-RFKYPERPIVFLSGCYFMVSIGYLIRVSVGHEEVACEDQMVRYGASGP-----T 132 (463)
Q Consensus 62 v~s~i~~~~tlftilTF~i~---~~-r~~yp~r~iifls~c~~~~sl~~li~~~~g~~~i~C~~~~~~~~~~~~-----~ 132 (463)
+++.++++..++.+ |++. |+ ...| ...++.-+++.++.+++-++... .-+..+.. .-++..|. .
T Consensus 7 ~~~~~~~~~n~~Ll--~~i~~~tp~~l~~~-~~~l~~~~~~~~~~~~~~~~~q~---R~i~~~~~-~~~~~~GpC~~~~~ 79 (292)
T PF10317_consen 7 IFFILGIILNILLL--YLIIFKTPKSLRTY-SILLLNTAIFDLISIISAFLTQP---RIIPNGTS-IAVISYGPCRYFGP 79 (292)
T ss_pred HHHHHHHHHHHHHH--HHHHHhChHHHHHH-HHHHHHHHHHHHHHHHHHHHhhh---hccCCCCc-EEEEeeecceecCc
Confidence 45556665555544 3332 22 2334 36777788888887765543211 01111111 11222221 2
Q ss_pred C-hhHHHHHHHHHHHHHHHHHHHHHHH-HHHHhc
Q psy14918 133 P-CTLVFLLVYFFGMSSYIWWVILSFT-WFLAAG 164 (463)
Q Consensus 133 ~-C~~~~~l~yyf~las~~Ww~il~~~-w~ls~~ 164 (463)
. |-....+...+.+.+ .|+.++++. -|...+
T Consensus 80 ~~C~~~y~~~~~~~~~s-~~~i~~sf~yRy~iL~ 112 (292)
T PF10317_consen 80 WFCFFCYHIMLHFSIHS-GWSIILSFYYRYYILR 112 (292)
T ss_pred HHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHhc
Confidence 2 888888877766655 455556655 455543
No 55
>PF10856 DUF2678: Protein of unknown function (DUF2678); InterPro: IPR022564 This family of proteins has no known function.
Probab=46.80 E-value=9.3 Score=33.23 Aligned_cols=61 Identities=13% Similarity=0.332 Sum_probs=32.7
Q ss_pred CCCcChHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCCCCCCchHHHHHHHHHHHHHHHHHH
Q psy14918 45 GIFFSQPEREFASMWIFVMSALCCISTLMTLTTFLIDTERFKYPERPIVFLSGCYFMVSIGYLI 108 (463)
Q Consensus 45 ~~~ft~~e~~~~~~~~~v~s~i~~~~tlftilTF~i~~~r~~yp~r~iifls~c~~~~sl~~li 108 (463)
.|+|.|+..+.- ....+.+.+..+..+++++.=.+.++. .|.-..+|+++|..+++++-++
T Consensus 17 rPLFGE~~~r~r-iinliiG~vT~l~VLvtii~afvf~~~--~p~p~~iffavcI~l~~~s~~l 77 (118)
T PF10856_consen 17 RPLFGETSARDR-IINLIIGAVTSLFVLVTIISAFVFPQD--PPKPLHIFFAVCILLICISAIL 77 (118)
T ss_pred CcccCCCCcccE-EEEeehHHHHHHHHHHHHhheEEecCC--CCCceEEehHHHHHHHHHHHHh
Confidence 578865543221 222234444444455554333333332 2444568899999988886554
No 56
>KOG4055|consensus
Probab=45.33 E-value=7.6 Score=36.33 Aligned_cols=6 Identities=50% Similarity=0.789 Sum_probs=3.4
Q ss_pred cCCCCC
Q psy14918 405 PVMGGG 410 (463)
Q Consensus 405 p~~~~~ 410 (463)
+++|++
T Consensus 69 nv~GSS 74 (213)
T KOG4055|consen 69 NVMGSS 74 (213)
T ss_pred hccccc
Confidence 556654
No 57
>smart00786 SHR3_chaperone ER membrane protein SH3. This family of proteins are membrane localised chaperones that are required for correct plasma membrane localisation of amino acid permeases (AAPs) PUBMED:15623581. Shr3 prevents AAPs proteins from aggregating and assists in their correct folding. In the absence of Shr3, AAPs are retained in the ER.
Probab=43.61 E-value=4.3 Score=38.51 Aligned_cols=7 Identities=29% Similarity=0.335 Sum_probs=2.7
Q ss_pred hHHHHHH
Q psy14918 322 LHVVLLL 328 (463)
Q Consensus 322 ~~~~~~l 328 (463)
++.+|+.
T Consensus 94 SL~Ly~~ 100 (196)
T smart00786 94 SLVLYVL 100 (196)
T ss_pred chHHHHH
Confidence 3334433
No 58
>PF15333 TAF1D: TATA box-binding protein-associated factor 1D
Probab=42.96 E-value=29 Score=33.14 Aligned_cols=6 Identities=33% Similarity=0.728 Sum_probs=2.3
Q ss_pred cccCCC
Q psy14918 417 SLKQNP 422 (463)
Q Consensus 417 s~~q~~ 422 (463)
+.+..+
T Consensus 43 S~ep~P 48 (217)
T PF15333_consen 43 SFEPRP 48 (217)
T ss_pred ccCCCC
Confidence 344333
No 59
>PF11694 DUF3290: Protein of unknown function (DUF3290); InterPro: IPR021707 This family of proteins with unknown function appears to be restricted to Firmicutes.
Probab=42.36 E-value=1e+02 Score=28.04 Aligned_cols=51 Identities=12% Similarity=0.219 Sum_probs=32.9
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhccCCCCCCCchHHHHHHHHHHHHHHH
Q psy14918 55 FASMWIFVMSALCCISTLMTLTTFLIDTERFKYPERPIVFLSGCYFMVSIG 105 (463)
Q Consensus 55 ~~~~~~~v~s~i~~~~tlftilTF~i~~~r~~yp~r~iifls~c~~~~sl~ 105 (463)
.-..+..++.++-++..+|.++-|+=++...+|.|..|+.+-+..+++.+-
T Consensus 14 ~~~~~~~~~i~~ll~~l~~~~~~Y~r~r~~tKyRDL~II~~L~ll~l~giq 64 (149)
T PF11694_consen 14 QNDYLRYILIIILLLVLIFFFIKYLRNRLDTKYRDLSIIALLLLLLLIGIQ 64 (149)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHhcCcchhhhHHHHHHHHHHHHHHHH
Confidence 334444455555566666666666656677889999998777666655443
No 60
>KOG2270|consensus
Probab=40.98 E-value=15 Score=38.91 Aligned_cols=10 Identities=30% Similarity=0.199 Sum_probs=5.0
Q ss_pred HHHHHhhHHH
Q psy14918 181 LAAWLIPTVK 190 (463)
Q Consensus 181 ~~aW~iP~v~ 190 (463)
+-+=|||.--
T Consensus 223 l~~aGIP~Pe 232 (520)
T KOG2270|consen 223 LNNAGIPCPE 232 (520)
T ss_pred HHhcCCCCCC
Confidence 3444577543
No 61
>KOG4055|consensus
Probab=36.45 E-value=27 Score=32.78 Aligned_cols=10 Identities=10% Similarity=-0.047 Sum_probs=4.0
Q ss_pred ccccccccCC
Q psy14918 412 HSMTSSLKQN 421 (463)
Q Consensus 412 ~s~~~s~~q~ 421 (463)
+.+...++|+
T Consensus 85 YR~lRRrEq~ 94 (213)
T KOG4055|consen 85 YRHLRRREQD 94 (213)
T ss_pred HHHHHHHHHH
Confidence 3333344443
No 62
>cd07912 Tweety_N N-terminal domain of the protein encoded by the Drosophila tweety gene and related proteins, a family of chloride ion channels. The protein product of the Drosophila tweety (tty) gene is thought to form a trans-membrane protein with five membrane-spanning regions and a cytoplasmic C-terminus. This N-terminal domain contains the putative transmembrane spanning regions. Tweety has been suggested as a candidate for a large conductance chloride channel, both in vertebrate and insect cells. Three human homologs have been identified and designated TTYH1-3. TTYH2 has been associated with the progression of cancer, and Drosophila melanogaster tweety has been assumed to play a role in development. TTYH2, and TTYH3 bind to and are ubiquinated by Nedd4-2, a HECT type E3 ubiquitin ligase, which most likely plays a role in controlling the cellular levels of tweety family proteins.
Probab=32.86 E-value=2e+02 Score=30.69 Aligned_cols=27 Identities=26% Similarity=0.452 Sum_probs=17.3
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHhc
Q psy14918 54 EFASMWIFVMSALCCISTLMTLTTFLI 80 (463)
Q Consensus 54 ~~~~~~~~v~s~i~~~~tlftilTF~i 80 (463)
-..-.+++.+++++++.+++.++.|++
T Consensus 36 ~~Sv~~~a~iaavwlil~ll~ll~~~~ 62 (418)
T cd07912 36 QQSLLILASIPAACLILSLLFLLVYLI 62 (418)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 334455556777777777777775554
No 63
>PF10192 GpcrRhopsn4: Rhodopsin-like GPCR transmembrane domain; InterPro: IPR019336 G-protein-coupled receptors, GPCRs, constitute a vast protein family that encompasses a wide range of functions (including various autocrine, paracrine and endocrine processes). They show considerable diversity at the sequence level, on the basis of which they can be separated into distinct groups. We use the term clan to describe the GPCRs, as they embrace a group of families for which there are indications of evolutionary relationship, but between which there is no statistically significant similarity in sequence []. The currently known clan members include the rhodopsin-like GPCRs, the secretin-like GPCRs, the cAMP receptors, the fungal mating pheromone receptors, and the metabotropic glutamate receptor family. There is a specialised database for GPCRs (http://www.gpcr.org/7tm/). This region of 270 amino acids is the seven transmembrane alpha-helical domains included within five GPCRRHODOPSN4 motifs of a G-protein-coupled-receptor (GPCR) protein, conserved from nematodes to humans [].
Probab=32.70 E-value=4.8e+02 Score=25.49 Aligned_cols=17 Identities=18% Similarity=0.413 Sum_probs=11.2
Q ss_pred CCCcChHHHHHHHHHHH
Q psy14918 45 GIFFSQPEREFASMWIF 61 (463)
Q Consensus 45 ~~~ft~~e~~~~~~~~~ 61 (463)
+..|+-||.-..+....
T Consensus 18 ~~hfS~de~gi~~~~~~ 34 (257)
T PF10192_consen 18 TSHFSADEQGILEIYLL 34 (257)
T ss_pred ccccChhhcCcHHHHHH
Confidence 35788888766555543
No 64
>PHA03029 hypothetical protein; Provisional
Probab=30.79 E-value=2.8e+02 Score=22.34 Aligned_cols=16 Identities=13% Similarity=0.332 Sum_probs=10.7
Q ss_pred HHhhhhHHHHhhHhhh
Q psy14918 280 LSTLPASIVIGCLIYE 295 (463)
Q Consensus 280 ly~~~~~~~~~~~fYe 295 (463)
+.++|++.+.+++|+.
T Consensus 60 fwllp~al~a~fyffs 75 (92)
T PHA03029 60 FWLLPFALAAAFYFFS 75 (92)
T ss_pred HHHHHHHHHHHHHHHH
Confidence 4567777777777664
No 65
>KOG0345|consensus
Probab=30.73 E-value=29 Score=37.42 Aligned_cols=13 Identities=23% Similarity=0.340 Sum_probs=6.1
Q ss_pred HHHHHHHHHHHHh
Q psy14918 67 CCISTLMTLTTFL 79 (463)
Q Consensus 67 ~~~~tlftilTF~ 79 (463)
+|-.++=+|+.++
T Consensus 173 gFe~~~n~ILs~L 185 (567)
T KOG0345|consen 173 GFEASVNTILSFL 185 (567)
T ss_pred cHHHHHHHHHHhc
Confidence 3444444444444
No 66
>KOG2357|consensus
Probab=30.14 E-value=36 Score=35.95 Aligned_cols=18 Identities=6% Similarity=0.159 Sum_probs=9.6
Q ss_pred HHHHHHHHHHHHHHHHhc
Q psy14918 63 MSALCCISTLMTLTTFLI 80 (463)
Q Consensus 63 ~s~i~~~~tlftilTF~i 80 (463)
+-=+++++.||...++.+
T Consensus 108 ~~E~~~v~~Ll~y~~nY~ 125 (440)
T KOG2357|consen 108 TVEIVMVAILLLYAANYF 125 (440)
T ss_pred HhHHHHHHHHHHHHHHHH
Confidence 334555555666555554
No 67
>PRK10263 DNA translocase FtsK; Provisional
Probab=29.83 E-value=9.5e+02 Score=29.72 Aligned_cols=22 Identities=32% Similarity=0.422 Sum_probs=13.1
Q ss_pred HHHHHHHHHHHHHHHHHHHHHH
Q psy14918 136 LVFLLVYFFGMSSYIWWVILSF 157 (463)
Q Consensus 136 ~~~~l~yyf~las~~Ww~il~~ 157 (463)
+..++++.|+++++++-+++.+
T Consensus 71 LAD~L~~LFGl~AYLLP~LL~~ 92 (1355)
T PRK10263 71 LADTLFFIFGVMAYTIPVIIVG 92 (1355)
T ss_pred HHHHHHHHHhHHHHHHHHHHHH
Confidence 3445666777777666554433
No 68
>PF12794 MscS_TM: Mechanosensitive ion channel inner membrane domain 1
Probab=28.97 E-value=6.5e+02 Score=25.85 Aligned_cols=51 Identities=16% Similarity=0.298 Sum_probs=29.7
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHH------hccccchhhhhhhHHHHHHHHHHhh
Q psy14918 137 VFLLVYFFGMSSYIWWVILSFTWFLA------AGLKWGNEAIASYSLYFHLAAWLIP 187 (463)
Q Consensus 137 ~~~l~yyf~las~~Ww~il~~~w~ls------~~~k~~~e~i~~~~~yfh~~aW~iP 187 (463)
+..+-.-+...+..||+...+...+. ...+|..+..+...+.+.-..|.+.
T Consensus 81 ~~~l~~~l~~~a~~~~~~~~~~~l~rp~Gl~~~HF~w~~~~~~~~r~~l~~~~~~~~ 137 (340)
T PF12794_consen 81 SVALGAALLAMALFWLVFEFFRRLLRPNGLAERHFGWPKERVQRLRRQLRWLIWVLV 137 (340)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHCCCCeEeccCCCCHHHHHHHHHHHHHHHHHHH
Confidence 33444455556666776655554443 2567888777766666655555543
No 69
>PF12569 NARP1: NMDA receptor-regulated protein 1 ; InterPro: IPR021183 This group represents N-terminal acetyltransferase A (NatA) auxiliary subunit and represents a non-catalytic component of the NatA N-terminal acetyltransferase, which catalyzes acetylation of proteins beginning with Met-Ser, Met-Gly and Met-Ala. N-terminal acetylation plays a role in normal eukaryotic translation and processing, protect against proteolytic degradation and protein turnover. NAT1 anchors ARD1 and NAT5 to the ribosome and may present the N- terminal of nascent polypeptides for acetylation [], [].
Probab=28.89 E-value=32 Score=37.71 Aligned_cols=33 Identities=12% Similarity=0.102 Sum_probs=19.9
Q ss_pred HHHHHHHHhhcccce-eeeechhhhhHHHHHHhh
Q psy14918 327 LLKYFMALAVGLTSG-VWIWSGKTFSSWKKLWNR 359 (463)
Q Consensus 327 ~lk~fm~li~GI~~~-~w~~s~KT~~~W~~~~~r 359 (463)
+.+.|....-+=.=+ .|.+.+=|+.+|-++++-
T Consensus 331 v~k~f~~~~~DQfDFH~Yc~RK~t~r~Y~~~L~~ 364 (517)
T PF12569_consen 331 VLKHFDDFEEDQFDFHSYCLRKMTLRAYVDMLRW 364 (517)
T ss_pred HHHHHHHHhcccccHHHHHHhhccHHHHHHHHHH
Confidence 444455544444433 377777788888887743
No 70
>KOG1277|consensus
Probab=27.88 E-value=7.7e+02 Score=26.90 Aligned_cols=86 Identities=16% Similarity=0.167 Sum_probs=49.8
Q ss_pred ccccCCcccceeEeeeCCc--cccchhhhhHHHHHHHHHHHHHHHHHHHhhhhhhhhcccCCCCccchhhhHHH-HHHHH
Q psy14918 198 SAVDGDTIAGICYVGNHNI--HNLKYFVVFPLIIYLILGIIFLFGGFISLFRIRNVIKNQGGISNRTKADKLEK-LMIRI 274 (463)
Q Consensus 198 ~~v~~~~~tg~CwIg~~~~--~~~~~f~~~Pl~i~llig~~fl~~~~~~l~~ir~~~k~~~~~~~~~~~~kl~~-l~~ri 274 (463)
++..+++...-|-+..... .-..| +.-|+.+.++-|..-+..+++++|=|-...= ..|+-- .-..+
T Consensus 421 GkN~~~~~~~PCR~~~~pR~Ip~~kW-y~~~~~~~~~gG~LPFgsIfIEmYfIFtSfW----------~ykiYyvYgfm~ 489 (593)
T KOG1277|consen 421 GKNRSGEFDAPCRTKAIPREIPPKKW-YRSPLVIMLMGGFLPFGSIFIEMYFIFTSFW----------GYKIYYVYGFMF 489 (593)
T ss_pred ccccccCCCCCcccccCCCCCCCccc-cccchHHHHhhccCccchhhhhHHHHHHHHh----------hhhHHHHHHHHH
Confidence 3444455555687654331 11334 3578889999999899999999887654321 122221 11123
Q ss_pred HHHHHHHhhhhHHHHhhHhh
Q psy14918 275 GIFSVLSTLPASIVIGCLIY 294 (463)
Q Consensus 275 gif~~ly~~~~~~~~~~~fY 294 (463)
.+|.+|-++-+.++++|-.+
T Consensus 490 lVf~IL~iVtvcvTIv~TYF 509 (593)
T KOG1277|consen 490 LVFVILLIVTVCVTIVLTYF 509 (593)
T ss_pred HHHHHHHHHHHHHHHHHhHh
Confidence 45666666666666766443
No 71
>PF15002 ERK-JNK_inhib: ERK and JNK pathways, inhibitor
Probab=27.37 E-value=47 Score=31.95 Aligned_cols=6 Identities=33% Similarity=1.071 Sum_probs=3.0
Q ss_pred HHHHHH
Q psy14918 352 SWKKLW 357 (463)
Q Consensus 352 ~W~~~~ 357 (463)
-|...+
T Consensus 122 ~w~~ll 127 (207)
T PF15002_consen 122 DWNDLL 127 (207)
T ss_pred hHHHHH
Confidence 455544
No 72
>PRK11778 putative inner membrane peptidase; Provisional
Probab=26.96 E-value=45 Score=34.41 Aligned_cols=12 Identities=17% Similarity=-0.020 Sum_probs=8.4
Q ss_pred hHHHHHHHHHHH
Q psy14918 322 LHVVLLLKYFMA 333 (463)
Q Consensus 322 ~~~~~~lk~fm~ 333 (463)
+|++|+.+.+..
T Consensus 4 ~y~~f~~k~~t~ 15 (330)
T PRK11778 4 EYGLFLAKTVTV 15 (330)
T ss_pred HHHHHHHHHHHH
Confidence 567787777655
No 73
>KOG3088|consensus
Probab=26.44 E-value=4.9e+02 Score=26.44 Aligned_cols=15 Identities=13% Similarity=0.315 Sum_probs=8.5
Q ss_pred hHHHHHHHHHHHHHH
Q psy14918 225 FPLIIYLILGIIFLF 239 (463)
Q Consensus 225 ~Pl~i~llig~~fl~ 239 (463)
++++|+.++|.++++
T Consensus 239 i~v~i~m~i~a~~Ft 253 (313)
T KOG3088|consen 239 IAVGILMLIGAGLFT 253 (313)
T ss_pred hHHHHHHHHHHHHHH
Confidence 556666666555444
No 74
>KOG4433|consensus
Probab=26.10 E-value=1e+02 Score=33.35 Aligned_cols=33 Identities=24% Similarity=0.304 Sum_probs=28.1
Q ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccC
Q psy14918 50 QPEREFASMWIFVMSALCCISTLMTLTTFLIDT 82 (463)
Q Consensus 50 ~~e~~~~~~~~~v~s~i~~~~tlftilTF~i~~ 82 (463)
++|..++-.+++.++..|++.+|+.++.|++.+
T Consensus 36 ~~~Y~QaL~lla~l~aa~l~l~Ll~ll~yli~~ 68 (526)
T KOG4433|consen 36 DSEYQQALLLLAALAAACLGLSLLFLLFYLICR 68 (526)
T ss_pred cHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 566778888899999999999999999888753
No 75
>PF10582 Connexin_CCC: Gap junction channel protein cysteine-rich domain; InterPro: IPR019570 The connexins are a family of integral membrane proteins that oligomerise to form intercellular channels that are clustered at gap junctions. These channels are specialised sites of cell-cell contact that allow the passage of ions, intracellular metabolites and messenger molecules (with molecular weight less than 1-2kDa) from the cytoplasm of one cell to its opposing neighbours. They are found in almost all vertebrate cell types, and somewhat similar proteins have been cloned from plant species. Invertebrates utilise a different family of molecules, innexins, that share a similar predicted secondary structure to the vertebrate connexins, but have no sequence identity to them []. Vertebrate gap junction channels are thought to participate in diverse biological functions. For instance, in the heart they permit the rapid cell-cell transfer of action potentials, ensuring coordinated contraction of the cardiomyocytes. They are also responsible for neurotransmission at specialised 'electrical' synapses. In non-excitable tissues, such as the liver, they may allow metabolic cooperation between cells. In the brain, glial cells are extensively-coupled by gap junctions; this allows waves of intracellular Ca2+ to propagate through nervous tissue, and may contribute to their ability to spatially-buffer local changes in extracellular K+ concentration []. The connexin protein family is encoded by at least 13 genes in rodents, with many homologues cloned from other species. They show overlapping tissue expression patterns, most tissues expressing more than one connexin type. Their conductances, permeability to different molecules, phosphorylation and voltage-dependence of their gating, have been found to vary. Possible communication diversity is increased further by the fact that gap junctions may be formed by the association of different connexin isoforms from apposing cells. However, in vitro studies have shown that not all possible combinations of connexins produce active channels [, ]. Hydropathy analysis predicts that all cloned connexins share a common transmembrane (TM) topology. Each connexin is thought to contain 4 TM domains, with two extracellular and three cytoplasmic regions. This model has been validated for several of the family members by in vitro biochemical analysis. Both N- and C-termini are thought to face the cytoplasm, and the third TM domain has an amphipathic character, suggesting that it contributes to the lining of the formed-channel. Amino acid sequence identity between the isoforms is ~50-80%, with the TM domains being well conserved. Both extracellular loops contain characteristically conserved cysteine residues, which likely form intramolecular disulphide bonds. By contrast, the single putative intracellular loop (between TM domains 2 and 3) and the cytoplasmic C terminus are highly variable among the family members. Six connexins are thought to associate to form a hemi-channel, or connexon. Two connexons then interact (likely via the extracellular loops of their connexins) to form the complete gap junction channel. NH2-*** *** *************-COOH ** ** ** ** ** ** ** ** Cytoplasmic ---**----**-----**----**---------------- ** ** ** ** Membrane ** ** ** ** ---**----**-----**----**---------------- ** ** ** ** Extracellular ** ** ** ** ** ** Two sets of nomenclature have been used to identify the connexins. The first, and most commonly used, classifies the connexin molecules according to molecular weight, such as connexin43 (abbreviated to Cx43), indicating a connexin of molecular weight close to 43kDa. However, studies have revealed cases where clear functional homologues exist across species that have quite different molecular masses; therefore, an alternative nomenclature was proposed based on evolutionary considerations, which divides the family into two major subclasses, alpha and beta, each with a number of members []. Due to their ubiquity and overlapping tissue distributions, it has proved difficult to elucidate the functions of individual connexin isoforms. To circumvent this problem, particular connexin-encoding genes have been subjected to targeted-disruption in mice, and the phenotype of the resulting animals investigated. Around half the connexin isoforms have been investigated in this manner []. Further insight into the functional roles of connexins has come from the discovery that a number of human diseases are caused by mutations in connexin genes. For instance, mutations in Cx32 give rise to a form of inherited peripheral neuropathy called X-linked dominant Charcot-Marie-Tooth disease []. Similarly, mutations in Cx26 are responsible for both autosomal recessive and dominant forms of nonsyndromic deafness, a disorder characterised by hearing loss, with no apparent effects on other organ systems. This entry represents the cysteine rich domain of the connexins.; PDB: 2ZW3_F.
Probab=25.92 E-value=2.1e+02 Score=22.48 Aligned_cols=36 Identities=22% Similarity=0.510 Sum_probs=27.0
Q ss_pred cCCCCCC---Cc-ChHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy14918 40 AYPCQGI---FF-SQPEREFASMWIFVMSALCCISTLMTL 75 (463)
Q Consensus 40 ~~pC~~~---~f-t~~e~~~~~~~~~v~s~i~~~~tlfti 75 (463)
..|||+. +- .+.||+..-..+.+.|.+|++.++.=+
T Consensus 25 ~~PCp~~VdCfVSRPtEKtIfl~fM~~~s~vsi~L~l~El 64 (67)
T PF10582_consen 25 RSPCPNTVDCFVSRPTEKTIFLIFMFAVSCVSILLNLAEL 64 (67)
T ss_dssp -TTSSSSEEEE-SSHHHHHHHHHHHHHHHHHHHHHHHHHH
T ss_pred CCCCCCcceEeCCCCchhhhHHHHHHHHHHHHHHHHHHHH
Confidence 3588853 22 578888899999999999998887644
No 76
>PRK14485 putative bifunctional cbb3-type cytochrome c oxidase subunit I/II; Provisional
Probab=25.75 E-value=1.1e+03 Score=27.23 Aligned_cols=109 Identities=12% Similarity=0.133 Sum_probs=52.3
Q ss_pred CCCCCCchHHHHHHHHHHHHHH-HHHHHHhcCCcceeccCCCceeeccCCCChhHHHHHHHHHHHHHHHHHHH---HHHH
Q psy14918 83 ERFKYPERPIVFLSGCYFMVSI-GYLIRVSVGHEEVACEDQMVRYGASGPTPCTLVFLLVYFFGMSSYIWWVI---LSFT 158 (463)
Q Consensus 83 ~r~~yp~r~iifls~c~~~~sl-~~li~~~~g~~~i~C~~~~~~~~~~~~~~C~~~~~l~yyf~las~~Ww~i---l~~~ 158 (463)
+++.|++++..++.++-.+-.+ +-+++.+.+ -+..-|+-. + +.+...-+-+----..+.++||+. ++..
T Consensus 4 ~~~~y~~~~v~~f~~a~vlw~~vg~l~Gl~~a-~~l~~Pd~~--~----~~~~l~Fgr~R~vHtNavif~~~~~a~~g~~ 76 (712)
T PRK14485 4 EQFYYDNKIVRKFLIATIIWGIVGMLVGLLVA-LQLVFPNLN--F----GISWLTFGRLRPLHTNAVIFAFVGNAIFAGV 76 (712)
T ss_pred ceeeccHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHhCCccc--C----CCCccccchhHHHHHHHHHHHHHHHHHHHHH
Confidence 4678888888776655544333 555555442 122222110 0 000111112222234455666653 3344
Q ss_pred HHHHhccccchhhhhhhHHHHHHHHHHhhHHHHHHhhhhcc
Q psy14918 159 WFLAAGLKWGNEAIASYSLYFHLAAWLIPTVKSVSVIIMSA 199 (463)
Q Consensus 159 w~ls~~~k~~~e~i~~~~~yfh~~aW~iP~v~tii~l~~~~ 199 (463)
.|+.-+. -..|-......++++..|.+-++..++.+.++-
T Consensus 77 yy~vprl-~~~~l~s~~La~~~~~~~~~~i~~~~i~l~~G~ 116 (712)
T PRK14485 77 YYSTQRL-LKARMFSDLLSKIHFWGWQLIIVSAAITLPLGF 116 (712)
T ss_pred HHHHHHH-hCCCCcchHHHHHHHHHHHHHHHHHHHHHHccc
Confidence 5554322 112222233456788888888877777666643
No 77
>KOG0112|consensus
Probab=25.57 E-value=26 Score=40.24 Aligned_cols=6 Identities=0% Similarity=-0.003 Sum_probs=2.5
Q ss_pred cccCCC
Q psy14918 417 SLKQNP 422 (463)
Q Consensus 417 s~~q~~ 422 (463)
.+...+
T Consensus 717 ~r~~~~ 722 (975)
T KOG0112|consen 717 ERIRDA 722 (975)
T ss_pred ccCccc
Confidence 344444
No 78
>KOG4812|consensus
Probab=24.38 E-value=2.1e+02 Score=28.26 Aligned_cols=26 Identities=31% Similarity=0.846 Sum_probs=21.5
Q ss_pred HHHHHHHHHHHHHHHHHhhhhhhhhc
Q psy14918 228 IIYLILGIIFLFGGFISLFRIRNVIK 253 (463)
Q Consensus 228 ~i~llig~~fl~~~~~~l~~ir~~~k 253 (463)
.|++++|...++.+++.-.|||+.-+
T Consensus 227 wi~~vlG~ll~lr~~i~YikVrrm~~ 252 (262)
T KOG4812|consen 227 WIFLVLGLLLFLRGFINYIKVRRMEE 252 (262)
T ss_pred HHHHHHHHHHHHHHHHhHHHHhhHHH
Confidence 46778899999999999999988643
No 79
>KOG3598|consensus
Probab=23.30 E-value=42 Score=40.75 Aligned_cols=10 Identities=60% Similarity=1.580 Sum_probs=4.1
Q ss_pred HHHHHHHHHH
Q psy14918 59 WIFVMSALCC 68 (463)
Q Consensus 59 ~~~v~s~i~~ 68 (463)
|+|++-.+||
T Consensus 1118 WlGvlKaLCc 1127 (2220)
T KOG3598|consen 1118 WLGVLKALCC 1127 (2220)
T ss_pred HHHHHHHHHh
Confidence 4444444443
No 80
>PF06813 Nodulin-like: Nodulin-like; InterPro: IPR010658 This entry represents a conserved region within plant nodulin-like proteins and a number of uncharacterised proteins.
Probab=23.12 E-value=7.3e+02 Score=24.41 Aligned_cols=90 Identities=26% Similarity=0.235 Sum_probs=50.2
Q ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHHHHhccccchhhhhhhHHHHHHHHHHhhHHHHHHhhhhccccCCcccceeEeeeC
Q psy14918 135 TLVFLLVYFFGMSSYIWWVILSFTWFLAAGLKWGNEAIASYSLYFHLAAWLIPTVKSVSVIIMSAVDGDTIAGICYVGNH 214 (463)
Q Consensus 135 ~~~~~l~yyf~las~~Ww~il~~~w~ls~~~k~~~e~i~~~~~yfh~~aW~iP~v~tii~l~~~~v~~~~~tg~CwIg~~ 214 (463)
.+++++.=|+++++.++-.+ |-+.-. +....|+.+.+|..+++-.+.....+..+.++... .
T Consensus 131 ~vvgilk~~~GLSaai~t~i-----~~~~f~-------~~~~~fll~la~~~~~v~l~~~~~vr~~~~~~~~~------~ 192 (250)
T PF06813_consen 131 TVVGILKGFFGLSAAIFTQI-----YSAFFG-------DDPSSFLLFLAVLPAVVCLVAMFFVRPVPPSPSRT------R 192 (250)
T ss_pred ceehhhhHHHHhHHHHHHHH-----HHHHcC-------CChHHHHHHHHHHHHHHHHHHhhheeccCCccccc------c
Confidence 78889999999999875543 222111 12345677899988877666666666665441111 1
Q ss_pred CccccchhhhhHHHHHHHHHHHHHHHHHH
Q psy14918 215 NIHNLKYFVVFPLIIYLILGIIFLFGGFI 243 (463)
Q Consensus 215 ~~~~~~~f~~~Pl~i~llig~~fl~~~~~ 243 (463)
+...... |..-..+.+.++...++..+.
T Consensus 193 ~~~~~~~-F~~~~~i~l~la~yLl~~~i~ 220 (250)
T PF06813_consen 193 QPSESKR-FLFFFAIALALAIYLLVISIL 220 (250)
T ss_pred CcchhHH-HHHHHHHHHHHHHHHHHHHHH
Confidence 1111122 234455566666555555544
No 81
>KOG4520|consensus
Probab=23.07 E-value=36 Score=32.19 Aligned_cols=26 Identities=23% Similarity=0.411 Sum_probs=0.0
Q ss_pred ccchhhhhhhhHhhhhhHHHHhhhhh
Q psy14918 426 HVHKNERKKKKKKKKKKKKKKKKKKK 451 (463)
Q Consensus 426 ~~kk~~~~~~~~~~~~~~~~~~~~~~ 451 (463)
+..+++++|+|||+.+.+++++|+.|
T Consensus 168 Ed~~dd~Ek~kkk~R~~~r~~~kr~k 193 (238)
T KOG4520|consen 168 EDGPDDVEKEKKKTRSNHRHGSKRRK 193 (238)
T ss_pred ccCccHHHHHHHhhhhhhcccccccc
No 82
>PF03908 Sec20: Sec20; InterPro: IPR005606 Sec20 is a membrane glycoprotein associated with secretory pathway.
Probab=23.03 E-value=85 Score=25.81 Aligned_cols=26 Identities=19% Similarity=0.517 Sum_probs=21.6
Q ss_pred CCCCCchHHHHHHHHHHHHHHHHHHH
Q psy14918 84 RFKYPERPIVFLSGCYFMVSIGYLIR 109 (463)
Q Consensus 84 r~~yp~r~iifls~c~~~~sl~~li~ 109 (463)
|....++.++++++.+|+.+++|++.
T Consensus 64 r~~~~D~~li~~~~~~f~~~v~yI~~ 89 (92)
T PF03908_consen 64 RRDKTDRILIFFAFLFFLLVVLYILW 89 (92)
T ss_pred HHHHHhHHHHHHHHHHHHHHHHHHhh
Confidence 33445899999999999999999873
No 83
>KOG0112|consensus
Probab=22.69 E-value=30 Score=39.83 Aligned_cols=9 Identities=44% Similarity=0.187 Sum_probs=3.4
Q ss_pred hhhhhhccc
Q psy14918 451 KKKKKWSFR 459 (463)
Q Consensus 451 ~~~~~~~~~ 459 (463)
|+||++++|
T Consensus 778 ~~~k~p~~k 786 (975)
T KOG0112|consen 778 KLKKKPHRK 786 (975)
T ss_pred hhhcCCCCc
Confidence 333333333
No 84
>PRK12585 putative monovalent cation/H+ antiporter subunit G; Reviewed
Probab=22.18 E-value=3.7e+02 Score=25.67 Aligned_cols=29 Identities=17% Similarity=0.338 Sum_probs=23.5
Q ss_pred HHHHHHHHHHHHHHHHHHHhhhhhhhhcc
Q psy14918 226 PLIIYLILGIIFLFGGFISLFRIRNVIKN 254 (463)
Q Consensus 226 Pl~i~llig~~fl~~~~~~l~~ir~~~k~ 254 (463)
-..+++++|++|++++.+-++|......+
T Consensus 8 I~~vLLliG~~f~ligaIGLlRfPD~YtR 36 (197)
T PRK12585 8 IISIMILIGGLLSILAAIGVIRLPDVYTR 36 (197)
T ss_pred HHHHHHHHHHHHHHHHHHHHHhcCcHHHH
Confidence 35678899999999999999998666543
No 85
>KOG3058|consensus
Probab=21.81 E-value=5.1e+02 Score=26.99 Aligned_cols=81 Identities=21% Similarity=0.301 Sum_probs=48.0
Q ss_pred CcCCCCCCCcC-hHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCCCCCCchHHHHHHHHHHHHHHHHHHHHhcC-Ccc
Q psy14918 39 CAYPCQGIFFS-QPEREFASMWIFVMSALCCISTLMTLTTFLIDTERFKYPERPIVFLSGCYFMVSIGYLIRVSVG-HEE 116 (463)
Q Consensus 39 C~~pC~~~~ft-~~e~~~~~~~~~v~s~i~~~~tlftilTF~i~~~r~~yp~r~iifls~c~~~~sl~~li~~~~g-~~~ 116 (463)
=.+|=||.+|+ -.|...+..+..+.+.+|++. +|++++|. +.|.--=.|..+++..+|++-++...+..+.+ ...
T Consensus 96 ~~pPLPDi~f~~vp~i~wa~~~~e~~~~~~~~~-~f~ll~fH--~~r~iv~rR~~f~~gt~y~lR~iTm~vT~LPvP~~h 172 (351)
T KOG3058|consen 96 PYPPLPDIFFDLVPEIPWAFSLCEIIGMILVVL-LFTLLLFH--QHRWIVLRRVFFLLGTLYLLRCITMYVTQLPVPGQH 172 (351)
T ss_pred CCCCCCcHHHHhcccchHHHHHHHHHHHHHHHH-HHHHHHHh--cchhhHHHHHHHHHHHHHHHhhheeEEEecccCCCC
Confidence 34566676663 344455666666777777664 44555664 22221123677788888888888776654433 456
Q ss_pred eeccCC
Q psy14918 117 VACEDQ 122 (463)
Q Consensus 117 i~C~~~ 122 (463)
.-|...
T Consensus 173 ~~C~~k 178 (351)
T KOG3058|consen 173 FRCAPK 178 (351)
T ss_pred cccCCc
Confidence 667543
No 86
>PF05620 DUF788: Protein of unknown function (DUF788); InterPro: IPR008506 This family consists of several eukaryotic proteins of unknown function.
Probab=21.67 E-value=44 Score=30.93 Aligned_cols=11 Identities=18% Similarity=0.244 Sum_probs=5.5
Q ss_pred hhhhccccccc
Q psy14918 293 IYESYVYNEYV 303 (463)
Q Consensus 293 fYe~~~~~~w~ 303 (463)
+-|..-++.+.
T Consensus 64 ~l~~~~rp~~~ 74 (170)
T PF05620_consen 64 FLEKMARPKYD 74 (170)
T ss_pred HHHHhCCCCcC
Confidence 34444555555
No 87
>PRK03113 putative disulfide oxidoreductase; Provisional
Probab=21.33 E-value=1.5e+02 Score=26.63 Aligned_cols=49 Identities=14% Similarity=0.381 Sum_probs=35.8
Q ss_pred HHHHHHHHhhHHHHHHhhhhccccCCcccceeEeeeCCccccchhhhhHHHHHHHHH
Q psy14918 178 YFHLAAWLIPTVKSVSVIIMSAVDGDTIAGICYVGNHNIHNLKYFVVFPLIIYLILG 234 (463)
Q Consensus 178 yfh~~aW~iP~v~tii~l~~~~v~~~~~tg~CwIg~~~~~~~~~f~~~Pl~i~llig 234 (463)
+...++|.+.++.+..++..+.+.+-+-=..||. |+. .+.|+.++.++|
T Consensus 9 ~~~~l~~l~~~~~~~~aly~q~v~gl~PC~LCi~-------QRi-~~~~l~l~~lia 57 (139)
T PRK03113 9 NALLTAWGASFIATLGSLYFSEIMKFEPCVLCWY-------QRI-FMYPFVLWLGIA 57 (139)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHhcCCCCCHHHHH-------HHH-HHHHHHHHHHHH
Confidence 3367999999999999999999987777778985 443 345555544444
No 88
>PLN02967 kinase
Probab=21.25 E-value=43 Score=37.25 Aligned_cols=12 Identities=25% Similarity=0.349 Sum_probs=4.9
Q ss_pred hhhhhhhhcccc
Q psy14918 449 KKKKKKKWSFRF 460 (463)
Q Consensus 449 ~~~~~~~~~~~~ 460 (463)
||+++++|+||+
T Consensus 135 ~~~~~~~~~~~~ 146 (581)
T PLN02967 135 KKVRKRRKVKKM 146 (581)
T ss_pred hhhhhcccccch
Confidence 333344444443
No 89
>TIGR00955 3a01204 The Eye Pigment Precursor Transporter (EPP) Family protein.
Probab=21.14 E-value=1.2e+03 Score=26.03 Aligned_cols=69 Identities=17% Similarity=0.175 Sum_probs=37.4
Q ss_pred HHHHHHHHhhHHHHHHhhhhccccCCc--------ccceeEeeeCCccccch----hhhhHHHHHHHHHHHHHHHHHHHh
Q psy14918 178 YFHLAAWLIPTVKSVSVIIMSAVDGDT--------IAGICYVGNHNIHNLKY----FVVFPLIIYLILGIIFLFGGFISL 245 (463)
Q Consensus 178 yfh~~aW~iP~v~tii~l~~~~v~~~~--------~tg~CwIg~~~~~~~~~----f~~~Pl~i~llig~~fl~~~~~~l 245 (463)
++.-+.|.-|.--..-++..+++++.. .++.|-+.....-..++ -+|..+++++.+..+|.+++.+.|
T Consensus 530 ~~~W~~~isp~~ya~~al~~nef~~~~~~~c~~~~~~~~c~~~g~~~l~~~g~~~~~~~~~~~il~~~~~~~~~l~~~~L 609 (617)
T TIGR00955 530 YFKWLSYLSWFRYGNEGLLINQWSDVDNIECTSANTTGPCPSSGEVILETLSFRNADLYLDLIGLVILIFFFRLLAYFAL 609 (617)
T ss_pred HHHHHHHcCHHHHHHHHHHHHHhCCCccccccCcCcCCCCCcChHHHHHhcCCCcccHHHHHHHHHHHHHHHHHHHHHHH
Confidence 344455555555555555566654432 22446432111101111 247778888888888888887766
Q ss_pred h
Q psy14918 246 F 246 (463)
Q Consensus 246 ~ 246 (463)
-
T Consensus 610 ~ 610 (617)
T TIGR00955 610 R 610 (617)
T ss_pred H
Confidence 3
No 90
>PRK00611 putative disulfide oxidoreductase; Provisional
Probab=20.95 E-value=1.5e+02 Score=26.55 Aligned_cols=48 Identities=25% Similarity=0.464 Sum_probs=35.5
Q ss_pred HHHHHHhhHHHHHHhhhhccccCCcccceeEeeeCCccccchhhhhHHHHHHHHHH
Q psy14918 180 HLAAWLIPTVKSVSVIIMSAVDGDTIAGICYVGNHNIHNLKYFVVFPLIIYLILGI 235 (463)
Q Consensus 180 h~~aW~iP~v~tii~l~~~~v~~~~~tg~CwIg~~~~~~~~~f~~~Pl~i~llig~ 235 (463)
..+||.+..+.|..++..+++-+-+-=..||. |+. .+.|+.++..++.
T Consensus 10 l~~aw~va~~a~~~sLy~q~v~gl~PC~LCiy-------QRi-~~~~l~l~~~la~ 57 (135)
T PRK00611 10 LYFAWLISCIGTLMSIYYSYILNVEPCVLCYY-------QRI-CLFPLVVILGIAA 57 (135)
T ss_pred HHHHHHHHHHHHHHHHHHHHHcCCCCchHHHH-------HHH-HHHHHHHHHHHHH
Confidence 67999999999999999999987777778985 443 3455555444433
No 91
>PF04929 Herpes_DNAp_acc: Herpes DNA replication accessory factor ; InterPro: IPR007013 Replicative DNA polymerases are capable of polymerising tens of thousands of nucleotides without dissociating from their DNA templates. The high processivity of these polymerases is dependent upon accessory proteins that bind to the catalytic subunit of the polymerase or to the substrate. The Epstein-Barr virus (strain GD1) (HHV-4) (Human herpesvirus 4) BMRF1 protein is an essential component of the viral DNA polymerase and is absolutely required for lytic virus replication []. BMRF1 is also a transactivator []. This family is predicted to have a UL42-like structure [].; PDB: 3I2M_X 3HSL_X 2Z0L_B.
Probab=20.52 E-value=33 Score=35.79 Aligned_cols=17 Identities=12% Similarity=0.036 Sum_probs=6.4
Q ss_pred HHHHHHHHHHHHhhccc
Q psy14918 323 HVVLLLKYFMALAVGLT 339 (463)
Q Consensus 323 ~~~~~lk~fm~li~GI~ 339 (463)
+.-+..-+-+.-|-|+.
T Consensus 237 l~sL~aALglCKIPgv~ 253 (381)
T PF04929_consen 237 LESLIAALGLCKIPGVC 253 (381)
T ss_dssp HHHHHHHHHHT-BTTTE
T ss_pred HHHHHHHhccCCCCcee
Confidence 33333333333444444
Done!