Query psy6890
Match_columns 116
No_of_seqs 118 out of 1114
Neff 10.0
Searched_HMMs 46136
Date Fri Aug 16 21:05:46 2013
Command hhsearch -i /work/01045/syshi/Psyhhblits/psy6890.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/6890hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 KOG4219|consensus 99.8 2.8E-21 6.2E-26 132.9 7.3 87 6-92 250-337 (423)
2 PHA03234 DNA packaging protein 99.8 1.6E-20 3.5E-25 129.8 10.2 82 9-90 229-316 (338)
3 PHA03235 DNA packaging protein 99.8 6.9E-19 1.5E-23 124.3 9.7 82 10-91 237-324 (409)
4 KOG4220|consensus 99.8 1.5E-19 3.2E-24 125.0 4.2 87 10-99 410-496 (503)
5 PHA02834 chemokine receptor-li 99.8 5.8E-19 1.2E-23 121.3 5.9 82 10-91 222-310 (323)
6 PHA02638 CC chemokine receptor 99.8 3E-18 6.5E-23 121.3 9.7 86 10-95 306-402 (417)
7 PHA03087 G protein-coupled che 99.7 2.7E-16 5.8E-21 108.4 9.0 84 10-93 237-326 (335)
8 PF00001 7tm_1: 7 transmembran 99.4 2.2E-12 4.7E-17 84.4 9.2 67 6-73 191-257 (257)
9 PF10324 7TM_GPCR_Srw: Serpent 99.0 1.9E-09 4.1E-14 74.0 8.0 82 10-91 235-318 (318)
10 KOG2087|consensus 98.8 2.6E-11 5.6E-16 83.2 -6.8 76 14-91 232-307 (363)
11 PF10320 7TM_GPCR_Srsx: Serpen 98.5 1.6E-07 3.4E-12 63.1 4.4 80 8-88 178-257 (257)
12 PF10323 7TM_GPCR_Srv: Serpent 97.9 0.00022 4.7E-09 48.7 9.3 83 5-90 201-283 (283)
13 PF11970 Git3_C: G protein-cou 97.8 0.00033 7.2E-09 38.5 7.2 70 7-78 2-72 (76)
14 PF05462 Dicty_CAR: Slime mold 97.6 0.0014 3E-08 45.3 9.3 68 18-90 203-270 (303)
15 PF10321 7TM_GPCR_Srt: Serpent 97.3 0.00039 8.4E-09 48.2 4.2 77 5-90 231-307 (313)
16 PF05296 TAS2R: Mammalian tast 96.7 0.02 4.3E-07 39.6 8.3 78 8-91 225-302 (303)
17 PF10317 7TM_GPCR_Srd: Serpent 96.6 0.019 4E-07 39.3 7.8 77 6-86 216-292 (292)
18 PF10328 7TM_GPCR_Srx: Serpent 96.6 0.012 2.7E-07 39.7 6.8 77 4-88 197-273 (274)
19 PF10327 7TM_GPCR_Sri: Serpent 95.8 0.031 6.7E-07 38.6 5.5 73 7-83 230-302 (303)
20 PF10318 7TM_GPCR_Srh: Serpent 95.7 0.14 3E-06 35.1 8.4 80 6-90 222-302 (302)
21 PF02101 Ocular_alb: Ocular al 95.0 0.24 5.2E-06 35.4 7.7 28 13-40 240-267 (405)
22 PF03125 Sre: C. elegans Sre G 93.4 2 4.3E-05 30.5 12.0 43 50-92 281-323 (365)
23 PF03402 V1R: Vomeronasal orga 92.5 0.15 3.2E-06 34.8 2.9 71 9-84 193-263 (265)
24 KOG2575|consensus 89.5 0.53 1.1E-05 34.1 3.4 26 12-37 254-279 (510)
25 KOG4564|consensus 88.4 0.93 2E-05 33.4 4.1 77 5-89 342-418 (473)
26 PF10319 7TM_GPCR_Srj: Serpent 88.2 2.9 6.3E-05 29.2 6.3 70 10-83 239-309 (310)
27 PF02118 Srg: Srg family chemo 87.8 6.1 0.00013 26.5 7.7 75 6-83 200-274 (275)
28 PF09889 DUF2116: Uncharacteri 86.7 1.7 3.7E-05 22.6 3.5 25 7-31 30-54 (59)
29 KOG4193|consensus 85.5 15 0.00032 28.4 9.6 26 67-92 558-583 (610)
30 PF10624 TraS: Plasmid conjuga 83.9 0.98 2.1E-05 27.5 2.0 17 58-74 66-82 (164)
31 PF10292 7TM_GPCR_Srab: Serpen 78.9 19 0.00042 25.1 10.6 81 8-89 224-305 (324)
32 PF10326 7TM_GPCR_Str: Serpent 78.7 0.32 7E-06 33.4 -1.5 57 24-85 251-307 (307)
33 PF06072 Herpes_US9: Alphaherp 78.2 7.8 0.00017 20.2 4.6 26 8-33 24-50 (60)
34 PF03155 Alg6_Alg8: ALG6, ALG8 65.7 11 0.00023 28.1 3.7 26 11-36 219-244 (469)
35 COG5058 LAG1 Protein transport 65.5 32 0.00069 24.5 5.7 66 11-76 229-294 (395)
36 PF04789 DUF621: Protein of un 59.3 26 0.00055 24.4 4.3 77 5-86 218-294 (305)
37 PF05393 Hum_adeno_E3A: Human 58.6 15 0.00033 20.7 2.7 13 19-31 44-56 (94)
38 KOG3827|consensus 58.1 5 0.00011 28.9 0.9 27 16-42 63-89 (400)
39 PF05296 TAS2R: Mammalian tast 50.6 83 0.0018 21.8 7.0 73 10-88 223-295 (303)
40 PF15188 CCDC-167: Coiled-coil 46.2 23 0.00051 19.9 2.2 18 16-33 67-84 (85)
41 PF04505 Dispanin: Interferon- 44.9 54 0.0012 18.1 4.1 21 23-43 23-43 (82)
42 PF11298 DUF3099: Protein of u 43.9 54 0.0012 17.8 4.1 31 5-35 10-40 (73)
43 PF11119 DUF2633: Protein of u 42.7 51 0.0011 17.2 3.8 32 10-41 4-35 (59)
44 PF04238 DUF420: Protein of un 41.9 81 0.0018 19.3 7.4 28 9-36 31-58 (133)
45 PF00375 SDF: Sodium:dicarboxy 40.7 1.4E+02 0.003 21.6 9.0 76 14-89 167-247 (390)
46 COG3821 Predicted membrane pro 39.8 26 0.00056 23.2 1.9 18 20-37 7-24 (234)
47 PHA02849 putative transmembran 39.0 70 0.0015 17.7 3.8 24 18-41 17-40 (82)
48 PHA02657 hypothetical protein; 38.8 74 0.0016 17.9 3.5 24 18-41 27-50 (95)
49 TIGR01912 TatC-Arch Twin argin 37.8 1.1E+02 0.0024 20.6 4.8 31 10-40 8-38 (237)
50 PF06210 DUF1003: Protein of u 37.8 87 0.0019 18.4 6.1 58 18-81 5-62 (108)
51 TIGR00869 sec62 protein transl 36.2 1.1E+02 0.0023 20.7 4.4 26 12-37 117-142 (232)
52 PF04093 MreD: rod shape-deter 32.2 1.2E+02 0.0026 18.4 6.8 23 60-82 137-159 (160)
53 PHA03234 DNA packaging protein 31.7 1.9E+02 0.004 20.4 8.2 88 6-93 229-323 (338)
54 PF11036 YqgB: Virulence promo 31.5 18 0.00039 17.2 0.2 10 69-78 18-27 (43)
55 PF11014 DUF2852: Protein of u 30.5 1.3E+02 0.0027 18.1 3.7 22 18-40 11-32 (115)
56 PF06454 DUF1084: Protein of u 29.4 2E+02 0.0042 20.0 7.8 61 13-86 202-262 (281)
57 PF13800 Sigma_reg_N: Sigma fa 29.3 1.1E+02 0.0024 17.1 4.1 22 10-32 7-29 (96)
58 PF05391 Lsm_interact: Lsm int 29.3 29 0.00063 14.0 0.6 9 76-84 11-19 (21)
59 PF15050 SCIMP: SCIMP protein 27.5 1.1E+02 0.0023 18.5 3.0 33 53-85 8-40 (133)
60 PF06196 DUF997: Protein of un 26.6 1.2E+02 0.0025 16.8 2.9 20 13-32 4-23 (80)
61 COG4920 Predicted membrane pro 25.8 2.1E+02 0.0046 19.2 5.5 30 4-33 84-117 (249)
62 PF04628 Sedlin_N: Sedlin, N-t 25.1 60 0.0013 19.5 1.7 32 58-89 97-131 (132)
63 PF10445 DUF2456: Protein of u 24.5 1.5E+02 0.0033 17.0 3.7 29 11-39 25-53 (94)
64 PF10601 zf-LITAF-like: LITAF- 24.5 73 0.0016 16.9 1.9 8 28-35 44-51 (73)
65 PF13071 DUF3935: Protein of u 24.5 1.2E+02 0.0026 15.9 3.9 27 16-42 4-30 (68)
66 PF03839 Sec62: Translocation 24.1 2.3E+02 0.0051 19.0 4.9 62 13-76 110-171 (224)
67 PF05399 EVI2A: Ectropic viral 24.1 2.2E+02 0.0047 19.1 4.2 23 16-38 131-153 (227)
68 COG4420 Predicted membrane pro 23.5 2.2E+02 0.0049 18.6 6.2 31 49-79 84-114 (191)
69 PF01679 Pmp3: Proteolipid mem 23.2 84 0.0018 15.7 1.8 23 16-38 24-46 (51)
70 KOG1726|consensus 23.2 1.2E+02 0.0027 20.4 3.0 13 22-34 50-62 (225)
71 PF11286 DUF3087: Protein of u 23.1 2.1E+02 0.0046 18.3 8.3 34 11-45 11-44 (165)
72 PF07325 Curto_V2: Curtovirus 22.8 84 0.0018 18.5 1.9 13 79-91 52-64 (126)
73 COG2383 Uncharacterized conser 22.0 1.4E+02 0.0031 17.4 2.8 19 11-29 35-53 (109)
74 PF04612 T2SM: Type II secreti 22.0 30 0.00065 21.3 0.0 23 6-28 9-31 (160)
75 PF11587 Prion_bPrPp: Major pr 21.9 92 0.002 13.7 1.5 14 29-42 8-21 (29)
76 PF10842 DUF2642: Protein of u 21.6 35 0.00075 18.2 0.2 11 64-74 4-14 (66)
77 COG1687 AzlD Predicted branche 21.2 1.8E+02 0.004 17.1 3.1 22 15-36 6-27 (106)
78 COG4758 Predicted membrane pro 21.0 2.8E+02 0.0061 18.8 7.1 25 49-73 48-72 (235)
79 PF02295 z-alpha: Adenosine de 20.8 63 0.0014 17.0 1.1 13 65-77 34-46 (66)
80 TIGR02052 MerP mercuric transp 20.5 94 0.002 16.2 1.9 18 15-32 4-21 (92)
81 PF15086 UPF0542: Uncharacteri 20.3 1.7E+02 0.0036 16.0 3.0 18 23-40 23-40 (74)
82 PF12166 DUF3595: Protein of u 20.3 48 0.001 24.1 0.7 25 19-43 68-92 (422)
No 1
>KOG4219|consensus
Probab=99.85 E-value=2.8e-21 Score=132.91 Aligned_cols=87 Identities=26% Similarity=0.486 Sum_probs=76.4
Q ss_pred hhhhhhhHHHHHHHHHHHHHHHHhhhHHHHHHHHHHhhCCC-CCcchHHHHHHHHHHHHhhhhhhHHHHHhccHHHHHHH
Q psy6890 6 GRILLLEQKATKVLGVVFFTFVILWAPFFILNLIPSICSPC-ETKIHPGIFEVATWLGYASSMVNPIFYTIFNKVFRTAF 84 (116)
Q Consensus 6 ~~~~~~~~k~~k~~~~iv~~f~i~~~P~~v~~l~~~~~~~~-~~~~~~~~~~~~~~l~~~ns~~NPiiY~~~~~~fr~~~ 84 (116)
.++.+.++|+.||+++|+++|.+||+||+++.++.....+. +......++....||+.+|+|+||+||++.|++||.++
T Consensus 250 ~~~~kak~K~vkmliiVV~~FaicWlPyh~y~il~~~~~~i~~~k~i~~vyl~~~WLaMSst~yNPiIY~~lN~Rfr~gf 329 (423)
T KOG4219|consen 250 HEQLKAKKKVVKMLIIVVVIFAICWLPYHIYFILNATNPEINRKKFIQQVYLAIYWLAMSSTCYNPIIYCFLNKRFRGGF 329 (423)
T ss_pred hHHHHHHHHHHHHHHHHHHHHHHhccChhHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHhhhccHhhhhhHHHHHHHH
Confidence 45678899999999999999999999999999998765532 23456678999999999999999999999999999999
Q ss_pred HHhHhhhh
Q psy6890 85 KKVLLCRY 92 (116)
Q Consensus 85 ~~~~~~~~ 92 (116)
++.++|..
T Consensus 330 ~~~fr~cp 337 (423)
T KOG4219|consen 330 RRAFRWCP 337 (423)
T ss_pred hhhhheee
Confidence 99998764
No 2
>PHA03234 DNA packaging protein UL33; Provisional
Probab=99.84 E-value=1.6e-20 Score=129.75 Aligned_cols=82 Identities=12% Similarity=0.171 Sum_probs=66.1
Q ss_pred hhhhHHHHHHHHHHHHHHHHhhhHHHHHHHHHHhhC-----CCCC-cchHHHHHHHHHHHHhhhhhhHHHHHhccHHHHH
Q psy6890 9 LLLEQKATKVLGVVFFTFVILWAPFFILNLIPSICS-----PCET-KIHPGIFEVATWLGYASSMVNPIFYTIFNKVFRT 82 (116)
Q Consensus 9 ~~~~~k~~k~~~~iv~~f~i~~~P~~v~~l~~~~~~-----~~~~-~~~~~~~~~~~~l~~~ns~~NPiiY~~~~~~fr~ 82 (116)
.++++|++|++++++++|++||+||+++.++..+.. .+.. ......+.++.+++++|||+||+||++.+++||+
T Consensus 229 ~~~~~k~~k~i~~vv~vF~iCWlPy~iv~l~~~~~~~~~~~~c~~~~~~~~~~~v~~~La~~nsclNPiIY~f~~~~FR~ 308 (338)
T PHA03234 229 EKKHKKTLFFIRILILSFLCIQIPNIAILICEIAFLYIANNSCFGLAQREILQIIIRLMPEIHCFSNPLVYAFTGGDFRL 308 (338)
T ss_pred hhhhhhhhhHHHHHHHHHHHHHhHHHHHHHHHHHHHhcccCcchHHHHHHHHHHHHHHHHHhhhhhhHHHHHHhhHHHHH
Confidence 356789999999999999999999999987654421 1111 1233457889999999999999999999999999
Q ss_pred HHHHhHhh
Q psy6890 83 AFKKVLLC 90 (116)
Q Consensus 83 ~~~~~~~~ 90 (116)
++++.++.
T Consensus 309 ~~~~~~~~ 316 (338)
T PHA03234 309 RFTACFQD 316 (338)
T ss_pred HHHHHHHH
Confidence 99988754
No 3
>PHA03235 DNA packaging protein UL33; Provisional
Probab=99.79 E-value=6.9e-19 Score=124.26 Aligned_cols=82 Identities=16% Similarity=0.417 Sum_probs=65.6
Q ss_pred hhhHHHHHHHHHHHHHHHHhhhHHHHHHHHHHhhC-----CCC-CcchHHHHHHHHHHHHhhhhhhHHHHHhccHHHHHH
Q psy6890 10 LLEQKATKVLGVVFFTFVILWAPFFILNLIPSICS-----PCE-TKIHPGIFEVATWLGYASSMVNPIFYTIFNKVFRTA 83 (116)
Q Consensus 10 ~~~~k~~k~~~~iv~~f~i~~~P~~v~~l~~~~~~-----~~~-~~~~~~~~~~~~~l~~~ns~~NPiiY~~~~~~fr~~ 83 (116)
++++|.++++++++++|++||+||+++.++..+.. ++. ......+..++.+++++|||+||+||++++++||++
T Consensus 237 ~~~~k~~~~v~iivv~F~iCWlPy~v~~l~~~~~~~~~~~~~~~~~~~~~~~~i~~~La~~ns~lNPiIY~~~~~~FRk~ 316 (409)
T PHA03235 237 KKRSRTLTFVCILLLSFLCLQTPFVAIMIFDSYATLIWPSDCEHINLRDAVSTLSRLVPNLHCLLNPILYAFLGNDFLKR 316 (409)
T ss_pred hcchhhhhhHHHHHHHHHHHHhHHHHHHHHHHHHHhcCCCCchhhhHHHHHHHHHHHHHHHHHhHhHHHHHHhhHHHHHH
Confidence 34668889999999999999999999988765422 111 112345677899999999999999999999999999
Q ss_pred HHHhHhhh
Q psy6890 84 FKKVLLCR 91 (116)
Q Consensus 84 ~~~~~~~~ 91 (116)
+++.++..
T Consensus 317 ~~~~l~~~ 324 (409)
T PHA03235 317 FRQCFRGE 324 (409)
T ss_pred HHHHHhhh
Confidence 99988653
No 4
>KOG4220|consensus
Probab=99.78 E-value=1.5e-19 Score=125.00 Aligned_cols=87 Identities=34% Similarity=0.805 Sum_probs=77.9
Q ss_pred hhhHHHHHHHHHHHHHHHHhhhHHHHHHHHHHhhCCCCCcchHHHHHHHHHHHHhhhhhhHHHHHhccHHHHHHHHHhHh
Q psy6890 10 LLEQKATKVLGVVFFTFVILWAPFFILNLIPSICSPCETKIHPGIFEVATWLGYASSMVNPIFYTIFNKVFRTAFKKVLL 89 (116)
Q Consensus 10 ~~~~k~~k~~~~iv~~f~i~~~P~~v~~l~~~~~~~~~~~~~~~~~~~~~~l~~~ns~~NPiiY~~~~~~fr~~~~~~~~ 89 (116)
.+|+|++|++..|+++|+++|+||+++.++..|+++ -.+..+..+..||.|+||-+||+.|++-|..||+-+++++.
T Consensus 410 ~rErKAAkTLsAILlAFIiTWtPYNImVlv~tFC~~---CiP~tlW~~gYwLCYINSTiNP~CYALCNatFrkTfk~lL~ 486 (503)
T KOG4220|consen 410 VRERKAAKTLSAILLAFILTWTPYNIMVLVNTFCKN---CIPETLWTFGYWLCYINSTINPLCYALCNATFRKTFKRLLL 486 (503)
T ss_pred HHHHHHHHHHHHHHHHHHHHcccceeeeehHhhccc---ccchhHhhhhhheeeecccccHHHHHHHhHHHHHHHHHhhe
Confidence 789999999999999999999999999999998763 34566888999999999999999999999999999999998
Q ss_pred hhhcCCcCcc
Q psy6890 90 CRYCGEKNRR 99 (116)
Q Consensus 90 ~~~~~~~~~~ 99 (116)
|++...+..+
T Consensus 487 Cr~~~~~~~~ 496 (503)
T KOG4220|consen 487 CRWKKRRTRR 496 (503)
T ss_pred eeecccchhc
Confidence 8876655433
No 5
>PHA02834 chemokine receptor-like protein; Provisional
Probab=99.77 E-value=5.8e-19 Score=121.34 Aligned_cols=82 Identities=24% Similarity=0.405 Sum_probs=65.0
Q ss_pred hhhHHHHHHHHHHHHHHHHhhhHHHHHHHHHHhhCCC--CC--c---chHHHHHHHHHHHHhhhhhhHHHHHhccHHHHH
Q psy6890 10 LLEQKATKVLGVVFFTFVILWAPFFILNLIPSICSPC--ET--K---IHPGIFEVATWLGYASSMVNPIFYTIFNKVFRT 82 (116)
Q Consensus 10 ~~~~k~~k~~~~iv~~f~i~~~P~~v~~l~~~~~~~~--~~--~---~~~~~~~~~~~l~~~ns~~NPiiY~~~~~~fr~ 82 (116)
++++|.+|++++++++|++||+||+++.++..+.... +. . .......++.+++++|||+||+||++++++||+
T Consensus 222 ~~~~k~~k~~~~vv~~F~icWlPy~i~~~l~~~~~~~~~~~~~~~~~~~~~~~~i~~~l~~~ns~iNPiIY~~~~~~fR~ 301 (323)
T PHA02834 222 KNKTRSIKIILTVVTFTVVFWVPFNIVLFINSLQSVGLIDIGCYHFKKIVYSIDIAELISFVHCCVNPIIYAFVGKNFKK 301 (323)
T ss_pred cccceEEeehhHHHHHHHHHHhhHHHHHHHHHHHHhcCCCccchHHHHHHHHHHHHHHHHHhhccccHHHHHHhcHHHHH
Confidence 4577889999999999999999999998876653210 10 0 111235678899999999999999999999999
Q ss_pred HHHHhHhhh
Q psy6890 83 AFKKVLLCR 91 (116)
Q Consensus 83 ~~~~~~~~~ 91 (116)
++++++++.
T Consensus 302 ~~~~~~~~~ 310 (323)
T PHA02834 302 VFKNMFCRT 310 (323)
T ss_pred HHHHHHHhh
Confidence 999988543
No 6
>PHA02638 CC chemokine receptor-like protein; Provisional
Probab=99.77 E-value=3e-18 Score=121.33 Aligned_cols=86 Identities=17% Similarity=0.336 Sum_probs=68.4
Q ss_pred hhhHHHHHHHHHHHHHHHHhhhHHHHHHHHHHhhCC-----------CCCcchHHHHHHHHHHHHhhhhhhHHHHHhccH
Q psy6890 10 LLEQKATKVLGVVFFTFVILWAPFFILNLIPSICSP-----------CETKIHPGIFEVATWLGYASSMVNPIFYTIFNK 78 (116)
Q Consensus 10 ~~~~k~~k~~~~iv~~f~i~~~P~~v~~l~~~~~~~-----------~~~~~~~~~~~~~~~l~~~ns~~NPiiY~~~~~ 78 (116)
++++|.+|++++++++|++||+||+++.++..+... +..........++..++++|||+||+||++.++
T Consensus 306 ~~k~k~~rli~~ivi~f~lcW~Py~i~~ll~~~~~~~~~~~~~~~~~c~~~~l~~~~~vt~~la~~~sclNPiIY~f~~~ 385 (417)
T PHA02638 306 SKKTKSIIIVSIIIICSLICWIPLNIVILFATMYSFKGFNSIISEHICGFIKLGYAMMLAEAISLTHCCINPLIYTLIGE 385 (417)
T ss_pred cccchhhhhHHHHHHHHHHHHhhHHHHHHHHHHHHhhccccccccccccHHHHHHHHHHHHHHHHHHHhhhHHHHHHhCH
Confidence 446788999999999999999999999888765321 111123345678899999999999999999999
Q ss_pred HHHHHHHHhHhhhhcCC
Q psy6890 79 VFRTAFKKVLLCRYCGE 95 (116)
Q Consensus 79 ~fr~~~~~~~~~~~~~~ 95 (116)
+||+++++++++.....
T Consensus 386 ~FR~~l~~~~~~~~~~~ 402 (417)
T PHA02638 386 NFRMHLLMIFRNIFTNA 402 (417)
T ss_pred HHHHHHHHHHHHhcccc
Confidence 99999999986555333
No 7
>PHA03087 G protein-coupled chemokine receptor-like protein; Provisional
Probab=99.68 E-value=2.7e-16 Score=108.37 Aligned_cols=84 Identities=24% Similarity=0.453 Sum_probs=66.7
Q ss_pred hhhHHHHHHHHHHHHHHHHhhhHHHHHHHHHHhhC----C-CCC-cchHHHHHHHHHHHHhhhhhhHHHHHhccHHHHHH
Q psy6890 10 LLEQKATKVLGVVFFTFVILWAPFFILNLIPSICS----P-CET-KIHPGIFEVATWLGYASSMVNPIFYTIFNKVFRTA 83 (116)
Q Consensus 10 ~~~~k~~k~~~~iv~~f~i~~~P~~v~~l~~~~~~----~-~~~-~~~~~~~~~~~~l~~~ns~~NPiiY~~~~~~fr~~ 83 (116)
++++|++|++++++++|++||+|+++..++..... . +.. .....+..++.+++.+|+++||+||++++++||++
T Consensus 237 ~~~~k~~k~l~~iv~~f~i~w~P~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~l~~~ns~~NPiIY~~~~~~fr~~ 316 (335)
T PHA03087 237 KKNKKAIKLVLIIVILFVIFWLPFNVSVFVYSLHILHFKSGCKAVKYIQYALHVTEIISLSHCCINPLIYAFVSEFFNKH 316 (335)
T ss_pred hhcchHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCCCcHHHHHHHHHHHHHHHHHHHHhhhhhHHHHcCHHHHHH
Confidence 56789999999999999999999999877654321 1 111 12334566888999999999999999999999999
Q ss_pred HHHhHhhhhc
Q psy6890 84 FKKVLLCRYC 93 (116)
Q Consensus 84 ~~~~~~~~~~ 93 (116)
+++.++....
T Consensus 317 ~~~~~~~~~~ 326 (335)
T PHA03087 317 KKKSLKLMFT 326 (335)
T ss_pred HHHHHHHHhh
Confidence 9999965543
No 8
>PF00001 7tm_1: 7 transmembrane receptor (rhodopsin family) Rhodopsin-like GPCR superfamily signature 5-hydroxytryptamine 7 receptor signature bradykinin receptor signature gastrin receptor signature melatonin receptor signature olfactory receptor signature; InterPro: IPR000276 G-protein-coupled receptors, GPCRs, constitute a vast protein family that encompasses a wide range of functions (including various autocrine, paracrine and endocrine processes). They show considerable diversity at the sequence level, on the basis of which they can be separated into distinct groups. We use the term clan to describe the GPCRs, as they embrace a group of families for which there are indications of evolutionary relationship, but between which there is no statistically significant similarity in sequence []. The currently known clan members include the rhodopsin-like GPCRs, the secretin-like GPCRs, the cAMP receptors, the fungal mating pheromone receptors, and the metabotropic glutamate receptor family. There is a specialised database for GPCRs (http://www.gpcr.org/7tm/). The rhodopsin-like GPCRs themselves represent a widespread protein family that includes hormone, neurotransmitter and light receptors, all of which transduce extracellular signals through interaction with guanine nucleotide-binding (G) proteins. Although their activating ligands vary widely in structure and character, the amino acid sequences of the receptors are very similar and are believed to adopt a common structural framework comprising 7 transmembrane (TM) helices [, , ].; GO: 0007186 G-protein coupled receptor protein signaling pathway, 0016021 integral to membrane; PDB: 2KI9_A 3QAK_A 2YDV_A 3VGA_A 3PWH_A 3RFM_A 3EML_A 3VG9_A 3REY_A 3UZA_A ....
Probab=99.42 E-value=2.2e-12 Score=84.40 Aligned_cols=67 Identities=28% Similarity=0.619 Sum_probs=58.6
Q ss_pred hhhhhhhHHHHHHHHHHHHHHHHhhhHHHHHHHHHHhhCCCCCcchHHHHHHHHHHHHhhhhhhHHHH
Q psy6890 6 GRILLLEQKATKVLGVVFFTFVILWAPFFILNLIPSICSPCETKIHPGIFEVATWLGYASSMVNPIFY 73 (116)
Q Consensus 6 ~~~~~~~~k~~k~~~~iv~~f~i~~~P~~v~~l~~~~~~~~~~~~~~~~~~~~~~l~~~ns~~NPiiY 73 (116)
+++.++++|.++++++++++|++||+|+.+..++....+..+. .......++.+++++||++||+||
T Consensus 191 ~~~~~~~~~~~~~~~~i~~~f~~~~~P~~i~~~~~~~~~~~~~-~~~~~~~~~~~l~~~~s~~nP~iY 257 (257)
T PF00001_consen 191 RRRSRRERRAARTLLIIVLVFLLCWLPYFILSLLSVFSPSSSL-ISSILFYISYFLAFLNSCLNPIIY 257 (257)
T ss_dssp HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHSSTSTC-SHHHHHHHHHHHHHHHHHHHHHHH
T ss_pred ccccccccccccccccccccccccCCceeHHHHHHHHcCccch-hhHHHHHHHHHHHHHHHhhCcEEC
Confidence 4566889999999999999999999999999998888664322 567789999999999999999999
No 9
>PF10324 7TM_GPCR_Srw: Serpentine type 7TM GPCR chemoreceptor Srw; InterPro: IPR019427 G-protein-coupled receptors, GPCRs, constitute a vast protein family that encompasses a wide range of functions (including various autocrine, paracrine and endocrine processes). They show considerable diversity at the sequence level, on the basis of which they can be separated into distinct groups. We use the term clan to describe the GPCRs, as they embrace a group of families for which there are indications of evolutionary relationship, but between which there is no statistically significant similarity in sequence []. The currently known clan members include the rhodopsin-like GPCRs, the secretin-like GPCRs, the cAMP receptors, the fungal mating pheromone receptors, and the metabotropic glutamate receptor family. There is a specialised database for GPCRs (http://www.gpcr.org/7tm/). The nematode Caenorhabditis elegans has only 14 types of chemosensory neuron, yet is able to sense and respond to several hundred different chemicals because each neuron detects several stimuli []. Chemoperception is one of the central senses of soil nematodes like C. elegans which are otherwise 'blind' and 'deaf' []. Chemoreception in C. elegans is mediated by members of the seven-transmembrane G-protein-coupled receptor class (7TM GPCRs). More than 1300 potential chemoreceptor genes have been identified in C. elegans, which are generally prefixed sr for serpentine receptor. The receptor superfamilies include Sra (Sra, Srb, Srab, Sre), Str (Srh, Str, Sri, Srd, Srj, Srm, Srn) and Srg (Srx, Srt, Srg, Sru, Srv, Srxa), as well as the families Srw, Srz, Srbc, Srsx and Srr [, , ]. Many of these proteins have homologues in Caenorhabditis briggsae. This entry represents serpentine receptor class w (Srw), which is a solo family amongst the superfamilies of chemoreceptors. The genes encoding Srw do not appear to be under as strong an adaptive evolutionary pressure as those of Srz [].
Probab=99.02 E-value=1.9e-09 Score=74.04 Aligned_cols=82 Identities=18% Similarity=0.189 Sum_probs=69.2
Q ss_pred hhhHHHHHHHHHHHHHHHHhhhHHHHHHHHHHhhCCCC--CcchHHHHHHHHHHHHhhhhhhHHHHHhccHHHHHHHHHh
Q psy6890 10 LLEQKATKVLGVVFFTFVILWAPFFILNLIPSICSPCE--TKIHPGIFEVATWLGYASSMVNPIFYTIFNKVFRTAFKKV 87 (116)
Q Consensus 10 ~~~~k~~k~~~~iv~~f~i~~~P~~v~~l~~~~~~~~~--~~~~~~~~~~~~~l~~~ns~~NPiiY~~~~~~fr~~~~~~ 87 (116)
+++.++++++..+.+.|+++-+|+.+..++..+..+.. ......+..+...+...|+..|++||.+++.+||+.++++
T Consensus 235 ~~~~~tt~li~~~ti~f~i~e~p~gi~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~ns~~h~~ic~~mSsqYR~t~~~~ 314 (318)
T PF10324_consen 235 KKSDRTTKLILFMTISFLISELPQGIIFLLESFFEEDSGLIFIIIQLSIIFNILITINSSIHFFICCFMSSQYRKTVKKL 314 (318)
T ss_pred ccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcchhHHHHHHHHHHHHHHHHHhhhHHHhhhhhhHHHHHHHHHH
Confidence 46678899999999999999999999999977633211 2344566788999999999999999999999999999999
Q ss_pred Hhhh
Q psy6890 88 LLCR 91 (116)
Q Consensus 88 ~~~~ 91 (116)
++|+
T Consensus 315 f~~k 318 (318)
T PF10324_consen 315 FGCK 318 (318)
T ss_pred hccC
Confidence 9753
No 10
>KOG2087|consensus
Probab=98.82 E-value=2.6e-11 Score=83.25 Aligned_cols=76 Identities=20% Similarity=0.270 Sum_probs=58.1
Q ss_pred HHHHHHHHHHHHHHHhhhHHHHHHHHHHhhCCCCCcchHHHHHHHHHHHHhhhhhhHHHHHhccHHHHHHHHHhHhhh
Q psy6890 14 KATKVLGVVFFTFVILWAPFFILNLIPSICSPCETKIHPGIFEVATWLGYASSMVNPIFYTIFNKVFRTAFKKVLLCR 91 (116)
Q Consensus 14 k~~k~~~~iv~~f~i~~~P~~v~~l~~~~~~~~~~~~~~~~~~~~~~l~~~ns~~NPiiY~~~~~~fr~~~~~~~~~~ 91 (116)
+++|-+..++.+-++||.|..++.+...++.+... ......+.....-+|||+||++|+++++.||+.++.+....
T Consensus 232 ~~akr~a~LvfTd~icw~Pi~f~~~~al~~~~li~--~~~sk~llv~flPlns~~NP~LYa~fT~~fk~d~~~l~~k~ 307 (363)
T KOG2087|consen 232 SVAKRMAFLVFTDCICWCPIAFFKFSALIGVELIS--VSYSKWLLVFFLPLNSCLNPFLYAFFTPVFKEDLFLLLSKV 307 (363)
T ss_pred hhhhCeeEEEEccccccCchheeeeHHhcCCcccC--hhhceeEEEEEEEcccccCchhHHHcCHHHHHHHHHHHhhc
Confidence 78899999999999999999999988877542111 11111233445568999999999999999999998887543
No 11
>PF10320 7TM_GPCR_Srsx: Serpentine type 7TM GPCR chemoreceptor Srsx; InterPro: IPR019424 G-protein-coupled receptors, GPCRs, constitute a vast protein family that encompasses a wide range of functions (including various autocrine, paracrine and endocrine processes). They show considerable diversity at the sequence level, on the basis of which they can be separated into distinct groups. We use the term clan to describe the GPCRs, as they embrace a group of families for which there are indications of evolutionary relationship, but between which there is no statistically significant similarity in sequence []. The currently known clan members include the rhodopsin-like GPCRs, the secretin-like GPCRs, the cAMP receptors, the fungal mating pheromone receptors, and the metabotropic glutamate receptor family. There is a specialised database for GPCRs (http://www.gpcr.org/7tm/). The nematode Caenorhabditis elegans has only 14 types of chemosensory neuron, yet is able to sense and respond to several hundred different chemicals because each neuron detects several stimuli []. Chemoperception is one of the central senses of soil nematodes like C. elegans which are otherwise 'blind' and 'deaf' []. Chemoreception in C. elegans is mediated by members of the seven-transmembrane G-protein-coupled receptor class (7TM GPCRs). More than 1300 potential chemoreceptor genes have been identified in C. elegans, which are generally prefixed sr for serpentine receptor. The receptor superfamilies include Sra (Sra, Srb, Srab, Sre), Str (Srh, Str, Sri, Srd, Srj, Srm, Srn) and Srg (Srx, Srt, Srg, Sru, Srv, Srxa), as well as the families Srw, Srz, Srbc, Srsx and Srr [, , ]. Many of these proteins have homologues in Caenorhabditis briggsae. This entry represents serpentine receptor class sx (Srsx), which is a solo family amongst the superfamilies of chemoreceptors. Chemoperception is one of the central senses of soil nematodes like C. elegans which are otherwise 'blind' and 'deaf' [].
Probab=98.51 E-value=1.6e-07 Score=63.08 Aligned_cols=80 Identities=20% Similarity=0.160 Sum_probs=65.4
Q ss_pred hhhhhHHHHHHHHHHHHHHHHhhhHHHHHHHHHHhhCCCCCcchHHHHHHHHHHHHhhhhhhHHHHHhccHHHHHHHHHh
Q psy6890 8 ILLLEQKATKVLGVVFFTFVILWAPFFILNLIPSICSPCETKIHPGIFEVATWLGYASSMVNPIFYTIFNKVFRTAFKKV 87 (116)
Q Consensus 8 ~~~~~~k~~k~~~~iv~~f~i~~~P~~v~~l~~~~~~~~~~~~~~~~~~~~~~l~~~ns~~NPiiY~~~~~~fr~~~~~~ 87 (116)
..++++|+.|.+.++++.|+.+|.--.+...+....+. +.+....+..-..+++..|.+.|-++|.++|+|||+++++.
T Consensus 178 ~~~~~~kv~ksL~v~v~i~i~~w~~s~~~~~v~~~~~~-~~~~~~~i~~~~~i~v~~~~s~~ffV~~~~S~EYR~af~~~ 256 (257)
T PF10320_consen 178 NSSRSKKVFKSLKVTVIIFIFSWFLSQIINTVSLALGL-DGETIAIIQMYAGIFVNISYSQNFFVYYWRSSEYRKAFREL 256 (257)
T ss_pred cchhHHHHHHHhhhheeeeeHHHHHHHHHHHHHHHhCC-cHHHHHHHHHHHHHHHHHHHHHHheEEEEcCHHHHHHHHHh
Confidence 46678999999999999999999998888776554321 33344556777889999999999999999999999999986
Q ss_pred H
Q psy6890 88 L 88 (116)
Q Consensus 88 ~ 88 (116)
+
T Consensus 257 ~ 257 (257)
T PF10320_consen 257 F 257 (257)
T ss_pred C
Confidence 4
No 12
>PF10323 7TM_GPCR_Srv: Serpentine type 7TM GPCR chemoreceptor Srv; InterPro: IPR019426 G-protein-coupled receptors, GPCRs, constitute a vast protein family that encompasses a wide range of functions (including various autocrine, paracrine and endocrine processes). They show considerable diversity at the sequence level, on the basis of which they can be separated into distinct groups. We use the term clan to describe the GPCRs, as they embrace a group of families for which there are indications of evolutionary relationship, but between which there is no statistically significant similarity in sequence []. The currently known clan members include the rhodopsin-like GPCRs, the secretin-like GPCRs, the cAMP receptors, the fungal mating pheromone receptors, and the metabotropic glutamate receptor family. There is a specialised database for GPCRs (http://www.gpcr.org/7tm/). The nematode Caenorhabditis elegans has only 14 types of chemosensory neuron, yet is able to sense and respond to several hundred different chemicals because each neuron detects several stimuli []. Chemoperception is one of the central senses of soil nematodes like C. elegans which are otherwise 'blind' and 'deaf' []. Chemoreception in C. elegans is mediated by members of the seven-transmembrane G-protein-coupled receptor class (7TM GPCRs). More than 1300 potential chemoreceptor genes have been identified in C. elegans, which are generally prefixed sr for serpentine receptor. The receptor superfamilies include Sra (Sra, Srb, Srab, Sre), Str (Srh, Str, Sri, Srd, Srj, Srm, Srn) and Srg (Srx, Srt, Srg, Sru, Srv, Srxa), as well as the families Srw, Srz, Srbc, Srsx and Srr [, , ]. Many of these proteins have homologues in Caenorhabditis briggsae. This entry represents serpentine receptor class v (Srv) from the Srg superfamily [, ]. Srg receptors contain seven hydrophobic, putative transmembrane, regions and can be distinguished from other 7TM GPCR receptors by their own characteristic TM signatures.
Probab=97.88 E-value=0.00022 Score=48.65 Aligned_cols=83 Identities=20% Similarity=0.229 Sum_probs=56.2
Q ss_pred hhhhhhhhHHHHHHHHHHHHHHHHhhhHHHHHHHHHHhhCCCCCcchHHHHHHHHHHHHhhhhhhHHHHHhccHHHHHHH
Q psy6890 5 HGRILLLEQKATKVLGVVFFTFVILWAPFFILNLIPSICSPCETKIHPGIFEVATWLGYASSMVNPIFYTIFNKVFRTAF 84 (116)
Q Consensus 5 ~~~~~~~~~k~~k~~~~iv~~f~i~~~P~~v~~l~~~~~~~~~~~~~~~~~~~~~~l~~~ns~~NPiiY~~~~~~fr~~~ 84 (116)
..++.++|.+++.+..+.++++++...=+ +......... +.+....+..+...+.-..|.+||+.=.+.|+++|+.+
T Consensus 201 ~s~~~~rE~~L~~~~~i~~~a~~~~~~~~-~~~~~~~~~~--~~~~~~~~r~~y~~~~~~~s~inP~~LLi~n~~lr~~~ 277 (283)
T PF10323_consen 201 SSRSRRREIRLAIQVFILFCAFFVILVYY-IFSNYFAQNF--NTDPIFYLRAFYPILNGLLSFINPWMLLIFNKDLRKQV 277 (283)
T ss_pred hhhhhhHhHHHHHHHHHHHHHHHHHHHHH-HHHHHHHhcc--cchHHHHHHHHHHHHHHHHHhhhhHHhhhccHHHHHHH
Confidence 44566778888888877776655544433 3333333322 22223334456677777899999999999999999999
Q ss_pred HHhHhh
Q psy6890 85 KKVLLC 90 (116)
Q Consensus 85 ~~~~~~ 90 (116)
++.++|
T Consensus 278 ~~~~~~ 283 (283)
T PF10323_consen 278 RRMLKC 283 (283)
T ss_pred HHHcCC
Confidence 998875
No 13
>PF11970 Git3_C: G protein-coupled glucose receptor regulating Gpa2 C-term; InterPro: IPR022596 This entry contains a functionally uncharacterised region belonging to the Git3 G-protein coupled receptor. Git3 is one of six proteins required for glucose-triggered adenylate cyclase activation, and is a G protein-coupled receptor responsible for the activation of adenylate cyclase through Gpa2 - heterotrimeric G protein alpha subunit, part of the glucose-detection pathway. Git3 contains seven predicted transmembrane domains, a third cytoplasmic loop and a cytoplasmic tail []. This family is the conserved C-terminal domain of the member proteins.
Probab=97.78 E-value=0.00033 Score=38.53 Aligned_cols=70 Identities=11% Similarity=0.205 Sum_probs=53.8
Q ss_pred hhhhhhHHHHHHHHHHHHHHHHhhh-HHHHHHHHHHhhCCCCCcchHHHHHHHHHHHHhhhhhhHHHHHhccH
Q psy6890 7 RILLLEQKATKVLGVVFFTFVILWA-PFFILNLIPSICSPCETKIHPGIFEVATWLGYASSMVNPIFYTIFNK 78 (116)
Q Consensus 7 ~~~~~~~k~~k~~~~iv~~f~i~~~-P~~v~~l~~~~~~~~~~~~~~~~~~~~~~l~~~ns~~NPiiY~~~~~ 78 (116)
+++++-+|.+|.+++==++|+++|+ |+..-.+-..... .......+..++..+..+|..+|-++|++.-+
T Consensus 2 ~~r~~i~r~lr~mfiYP~~Yi~lwlfP~~~~~~~~~~~~--~~~p~~~l~~i~~~~~~~~G~VD~lvf~~~er 72 (76)
T PF11970_consen 2 KRRKRIRRQLRSMFIYPLVYIVLWLFPFAAHRMQYMYEI--GHGPSFWLFCIAGFMQPSQGFVDCLVFTLRER 72 (76)
T ss_pred hhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcc--CCCCchHHHHHHHHHHHccCHHHhhheeeecc
Confidence 3455667888899999999999999 8776655444222 33455678889999999999999999997653
No 14
>PF05462 Dicty_CAR: Slime mold cyclic AMP receptor
Probab=97.56 E-value=0.0014 Score=45.27 Aligned_cols=68 Identities=18% Similarity=0.161 Sum_probs=51.9
Q ss_pred HHHHHHHHHHHhhhHHHHHHHHHHhhCCCCCcchHHHHHHHHHHHHhhhhhhHHHHHhccHHHHHHHHHhHhh
Q psy6890 18 VLGVVFFTFVILWAPFFILNLIPSICSPCETKIHPGIFEVATWLGYASSMVNPIFYTIFNKVFRTAFKKVLLC 90 (116)
Q Consensus 18 ~~~~iv~~f~i~~~P~~v~~l~~~~~~~~~~~~~~~~~~~~~~l~~~ns~~NPiiY~~~~~~fr~~~~~~~~~ 90 (116)
-+..=.++|++||+|-.+..+.+.+++ . ...+..+-..++.+.=.+|.++|++.++-.++.+...+.+
T Consensus 203 kL~~Yp~ifiicw~fa~INRI~~~~~~--~---~~~l~~Lh~~~s~lqGf~nsivy~~n~~~~~~~~~~~~~~ 270 (303)
T PF05462_consen 203 KLVNYPLIFIICWIFATINRIYNFIGK--N---PFWLSVLHVGFSPLQGFFNSIVYGYNNSLMWRYLGSKILC 270 (303)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhcC--C---chHHHHHHHHHHHHHHHHHHHHHHhCCHHHHHHHHHHHHH
Confidence 355788999999999999999887743 1 2334444455666677899999999999999998887743
No 15
>PF10321 7TM_GPCR_Srt: Serpentine type 7TM GPCR chemoreceptor Srt; InterPro: IPR019425 Chemoreception is mediated in Caenorhabditis elegans by members of the seven-transmembrane G-protein-coupled receptor class (7TM GPCRs) of proteins which are of the serpentine type []. Srt is a member of the Srg superfamily of chemoreceptors. Chemoperception is one of the central senses of soil nematodes like C. elegans which are otherwise 'blind' and 'deaf' [].
Probab=97.30 E-value=0.00039 Score=48.16 Aligned_cols=77 Identities=14% Similarity=0.150 Sum_probs=58.1
Q ss_pred hhhhhhhhHHHHHHHHHHHHHHHHhhhHHHHHHHHHHhhCCCCCcchHHHHHHHHHHHHhhhhhhHHHHHhccHHHHHHH
Q psy6890 5 HGRILLLEQKATKVLGVVFFTFVILWAPFFILNLIPSICSPCETKIHPGIFEVATWLGYASSMVNPIFYTIFNKVFRTAF 84 (116)
Q Consensus 5 ~~~~~~~~~k~~k~~~~iv~~f~i~~~P~~v~~l~~~~~~~~~~~~~~~~~~~~~~l~~~ns~~NPiiY~~~~~~fr~~~ 84 (116)
+++..+.++++.-...++.+...+.++-|...... ..+..+..+....=..++..+|+||..+|+..|+++
T Consensus 231 s~~~~k~~~qI~iQs~iIC~f~~i~a~iyv~m~f~---------~~p~~~i~~~~~~Wql~~g~~~iIYl~lNrtIR~~~ 301 (313)
T PF10321_consen 231 SSQLSKAQRQIFIQSVIICFFHAIAAVIYVYMQFF---------PPPPWLIIIGQISWQLSHGCPPIIYLTLNRTIRNSV 301 (313)
T ss_pred chhhhhhhHHHHHHHHHHHHHHHHHHHHHHeeeee---------cccHHHHHHHHHHHhccCCccceEEEEECHHHHHHH
Confidence 34566777888888888877777776666444322 123456777788888899999999999999999999
Q ss_pred HHhHhh
Q psy6890 85 KKVLLC 90 (116)
Q Consensus 85 ~~~~~~ 90 (116)
+++++.
T Consensus 302 ~k~~~~ 307 (313)
T PF10321_consen 302 LKMLGP 307 (313)
T ss_pred HHHHcc
Confidence 999853
No 16
>PF05296 TAS2R: Mammalian taste receptor protein (TAS2R); InterPro: IPR007960 This family consists of several forms of mammalian taste receptor proteins (TAS2Rs). TAS2Rs are G protein-coupled receptors expressed in subsets of taste receptor cells of the tongue and palate epithelia and are organised in the genome in clusters. The proteins are genetically linked to loci that influence bitter perception in mice and humans [].; GO: 0004930 G-protein coupled receptor activity, 0007186 G-protein coupled receptor protein signaling pathway, 0050909 sensory perception of taste, 0016021 integral to membrane
Probab=96.70 E-value=0.02 Score=39.56 Aligned_cols=78 Identities=17% Similarity=0.125 Sum_probs=54.7
Q ss_pred hhhhhHHHHHHHHHHHHHHHHhhhHHHHHHHHHHhhCCCCCcchHHHHHHHHHHHHhhhhhhHHHHHhccHHHHHHHHHh
Q psy6890 8 ILLLEQKATKVLGVVFFTFVILWAPFFILNLIPSICSPCETKIHPGIFEVATWLGYASSMVNPIFYTIFNKVFRTAFKKV 87 (116)
Q Consensus 8 ~~~~~~k~~k~~~~iv~~f~i~~~P~~v~~l~~~~~~~~~~~~~~~~~~~~~~l~~~ns~~NPiiY~~~~~~fr~~~~~~ 87 (116)
+.++..|++|+++..++.|+++++-..+..+...+ ........+...+..+.+...|+|=.+-|++.|+++++.
T Consensus 225 s~~aH~~a~k~~~sfl~ly~~~~~~~~~~~~~~~~------~~~~~~~~~~~~i~~~yps~hs~iLIlgn~KLr~~~~~i 298 (303)
T PF05296_consen 225 STEAHIRAIKTMISFLILYIIYFLSLILSFLSFFF------PENSIWFWVCEIIIALYPSGHSIILILGNPKLRQALLKI 298 (303)
T ss_pred CcHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh------ccccHHHHHHHHHHHHHHHHHHHHHhcCchHHHHHHHHH
Confidence 44566777888777777776655443332222211 122335677888899999999999999999999999999
Q ss_pred Hhhh
Q psy6890 88 LLCR 91 (116)
Q Consensus 88 ~~~~ 91 (116)
+.+.
T Consensus 299 l~~~ 302 (303)
T PF05296_consen 299 LWCL 302 (303)
T ss_pred Hhhc
Confidence 8653
No 17
>PF10317 7TM_GPCR_Srd: Serpentine type 7TM GPCR chemoreceptor Srd; InterPro: IPR019421 G-protein-coupled receptors, GPCRs, constitute a vast protein family that encompasses a wide range of functions (including various autocrine, paracrine and endocrine processes). They show considerable diversity at the sequence level, on the basis of which they can be separated into distinct groups. We use the term clan to describe the GPCRs, as they embrace a group of families for which there are indications of evolutionary relationship, but between which there is no statistically significant similarity in sequence []. The currently known clan members include the rhodopsin-like GPCRs, the secretin-like GPCRs, the cAMP receptors, the fungal mating pheromone receptors, and the metabotropic glutamate receptor family. There is a specialised database for GPCRs (http://www.gpcr.org/7tm/). The nematode Caenorhabditis elegans has only 14 types of chemosensory neuron, yet is able to sense and respond to several hundred different chemicals because each neuron detects several stimuli []. Chemoperception is one of the central senses of soil nematodes like C. elegans which are otherwise 'blind' and 'deaf' []. Chemoreception in C. elegans is mediated by members of the seven-transmembrane G-protein-coupled receptor class (7TM GPCRs). More than 1300 potential chemoreceptor genes have been identified in C. elegans, which are generally prefixed sr for serpentine receptor. The receptor superfamilies include Sra (Sra, Srb, Srab, Sre), Str (Srh, Str, Sri, Srd, Srj, Srm, Srn) and Srg (Srx, Srt, Srg, Sru, Srv, Srxa), as well as the families Srw, Srz, Srbc, Srsx and Srr [, , ]. Many of these proteins have homologues in Caenorhabditis briggsae. This entry represents the chemoreceptor Srd [].
Probab=96.64 E-value=0.019 Score=39.35 Aligned_cols=77 Identities=16% Similarity=0.263 Sum_probs=57.2
Q ss_pred hhhhhhhHHHHHHHHHHHHHHHHhhhHHHHHHHHHHhhCCCCCcchHHHHHHHHHHHHhhhhhhHHHHHhccHHHHHHHH
Q psy6890 6 GRILLLEQKATKVLGVVFFTFVILWAPFFILNLIPSICSPCETKIHPGIFEVATWLGYASSMVNPIFYTIFNKVFRTAFK 85 (116)
Q Consensus 6 ~~~~~~~~k~~k~~~~iv~~f~i~~~P~~v~~l~~~~~~~~~~~~~~~~~~~~~~l~~~ns~~NPiiY~~~~~~fr~~~~ 85 (116)
++.++..++.+|.+.+=.+.=+++..|-....++..+.. ........+...++.+-+.+||+++.++-+-||++++
T Consensus 216 ~~tk~~h~~lv~~Lt~Q~~lP~~~~~p~~~~~~~~~~~~----~~~~~~e~~~~~~~~~~~~~~P~itl~fv~PYR~~i~ 291 (292)
T PF10317_consen 216 ERTKSMHRQLVKGLTIQALLPLFFYIPGVIIYFLSQFTG----YEHPFLEYLIFMLASLPPLIDPLITLYFVRPYRKAIL 291 (292)
T ss_pred HHHHHHHHHHHHHHHHHHHhHHHHHHHHHHHHHHHHHhc----cccHHHHHHHHHHHHHHHHhchHhheeeeHhHHHHhc
Confidence 344555666777777777777888887777766665533 2345566677777888999999999999999999986
Q ss_pred H
Q psy6890 86 K 86 (116)
Q Consensus 86 ~ 86 (116)
|
T Consensus 292 r 292 (292)
T PF10317_consen 292 R 292 (292)
T ss_pred C
Confidence 4
No 18
>PF10328 7TM_GPCR_Srx: Serpentine type 7TM GPCR chemoreceptor Srx; InterPro: IPR019430 G-protein-coupled receptors, GPCRs, constitute a vast protein family that encompasses a wide range of functions (including various autocrine, paracrine and endocrine processes). They show considerable diversity at the sequence level, on the basis of which they can be separated into distinct groups. We use the term clan to describe the GPCRs, as they embrace a group of families for which there are indications of evolutionary relationship, but between which there is no statistically significant similarity in sequence []. The currently known clan members include the rhodopsin-like GPCRs, the secretin-like GPCRs, the cAMP receptors, the fungal mating pheromone receptors, and the metabotropic glutamate receptor family. There is a specialised database for GPCRs (http://www.gpcr.org/7tm/). The nematode Caenorhabditis elegans has only 14 types of chemosensory neuron, yet is able to sense and respond to several hundred different chemicals because each neuron detects several stimuli []. Chemoperception is one of the central senses of soil nematodes like C. elegans which are otherwise 'blind' and 'deaf' []. Chemoreception in C. elegans is mediated by members of the seven-transmembrane G-protein-coupled receptor class (7TM GPCRs). More than 1300 potential chemoreceptor genes have been identified in C. elegans, which are generally prefixed sr for serpentine receptor. The receptor superfamilies include Sra (Sra, Srb, Srab, Sre), Str (Srh, Str, Sri, Srd, Srj, Srm, Srn) and Srg (Srx, Srt, Srg, Sru, Srv, Srxa), as well as the families Srw, Srz, Srbc, Srsx and Srr [, , ]. Many of these proteins have homologues in Caenorhabditis briggsae. This entry represents serpentine receptor class x (Srx) from the Srg superfamily [, ]. Srg receptors contain seven hydrophobic, putative transmembrane, regions and can be distinguished from other 7TM GPCR receptors by their own characteristic TM signatures.
Probab=96.63 E-value=0.012 Score=39.73 Aligned_cols=77 Identities=14% Similarity=0.104 Sum_probs=54.2
Q ss_pred hhhhhhhhhHHHHHHHHHHHHHHHHhhhHHHHHHHHHHhhCCCCCcchHHHHHHHHHHHHhhhhhhHHHHHhccHHHHHH
Q psy6890 4 RHGRILLLEQKATKVLGVVFFTFVILWAPFFILNLIPSICSPCETKIHPGIFEVATWLGYASSMVNPIFYTIFNKVFRTA 83 (116)
Q Consensus 4 ~~~~~~~~~~k~~k~~~~iv~~f~i~~~P~~v~~l~~~~~~~~~~~~~~~~~~~~~~l~~~ns~~NPiiY~~~~~~fr~~ 83 (116)
++++++++|.+..+....--+.|++.++-|+...-+.. .... .+..+...-.+.-++|++|+.+.|+|+|+.
T Consensus 197 ~s~~r~rke~~f~~Qs~~Q~~~~~i~~~~~~~~~~~~~------~~~~--~F~~~t~~w~~~h~~DG~i~l~fN~~~r~~ 268 (274)
T PF10328_consen 197 ESKKRRRKEIRFFIQSFIQDLLYLIDLIFYFFIPPLSS------NRWW--QFFCTTFSWVLVHALDGLIMLIFNSEIRRK 268 (274)
T ss_pred hhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcc------ccHH--HHHHHHHHHHHHHHhcceeEeEEcHHHHHH
Confidence 44567788999999999999999999988887744421 1111 122222222334457999999999999999
Q ss_pred HHHhH
Q psy6890 84 FKKVL 88 (116)
Q Consensus 84 ~~~~~ 88 (116)
+++..
T Consensus 269 ~~~~~ 273 (274)
T PF10328_consen 269 IRKKK 273 (274)
T ss_pred HHhcc
Confidence 87754
No 19
>PF10327 7TM_GPCR_Sri: Serpentine type 7TM GPCR chemoreceptor Sri; InterPro: IPR019429 G-protein-coupled receptors, GPCRs, constitute a vast protein family that encompasses a wide range of functions (including various autocrine, paracrine and endocrine processes). They show considerable diversity at the sequence level, on the basis of which they can be separated into distinct groups. We use the term clan to describe the GPCRs, as they embrace a group of families for which there are indications of evolutionary relationship, but between which there is no statistically significant similarity in sequence []. The currently known clan members include the rhodopsin-like GPCRs, the secretin-like GPCRs, the cAMP receptors, the fungal mating pheromone receptors, and the metabotropic glutamate receptor family. There is a specialised database for GPCRs (http://www.gpcr.org/7tm/). The nematode Caenorhabditis elegans has only 14 types of chemosensory neuron, yet is able to sense and respond to several hundred different chemicals because each neuron detects several stimuli []. Chemoperception is one of the central senses of soil nematodes like C. elegans which are otherwise 'blind' and 'deaf' []. Chemoreception in C. elegans is mediated by members of the seven-transmembrane G-protein-coupled receptor class (7TM GPCRs). More than 1300 potential chemoreceptor genes have been identified in C. elegans, which are generally prefixed sr for serpentine receptor. The receptor superfamilies include Sra (Sra, Srb, Srab, Sre), Str (Srh, Str, Sri, Srd, Srj, Srm, Srn) and Srg (Srx, Srt, Srg, Sru, Srv, Srxa), as well as the families Srw, Srz, Srbc, Srsx and Srr [, , ]. Many of these proteins have homologues in Caenorhabditis briggsae. This entry represents Sri, which is part of the Str superfamily of chemoreceptors.
Probab=95.82 E-value=0.031 Score=38.62 Aligned_cols=73 Identities=22% Similarity=0.122 Sum_probs=57.8
Q ss_pred hhhhhhHHHHHHHHHHHHHHHHhhhHHHHHHHHHHhhCCCCCcchHHHHHHHHHHHHhhhhhhHHHHHhccHHHHHH
Q psy6890 7 RILLLEQKATKVLGVVFFTFVILWAPFFILNLIPSICSPCETKIHPGIFEVATWLGYASSMVNPIFYTIFNKVFRTA 83 (116)
Q Consensus 7 ~~~~~~~k~~k~~~~iv~~f~i~~~P~~v~~l~~~~~~~~~~~~~~~~~~~~~~l~~~ns~~NPiiY~~~~~~fr~~ 83 (116)
+.-++.+++++.+.+=.++-.+|-+|..+......... .....+-.....+..++|.+|-++-.+.++.||+.
T Consensus 230 ~ty~kHk~av~SLi~Q~~~~~i~~~P~~~~~~~~~~~~----~~~q~i~~~~~~~f~~HS~~n~ivli~t~ppYR~f 302 (303)
T PF10327_consen 230 QTYQKHKEAVRSLIAQFATSSICILPPFIFVVVVIFEF----EDAQVISEICLAIFSSHSSVNMIVLIITTPPYRKF 302 (303)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhheecC----CCcHHHHHHHHHHHHHhhHhhheeeeEcCcchhhc
Confidence 34566778888888888888999999998887666533 23345566777888899999999999999999985
No 20
>PF10318 7TM_GPCR_Srh: Serpentine type 7TM GPCR chemoreceptor Srh; InterPro: IPR019422 G-protein-coupled receptors, GPCRs, constitute a vast protein family that encompasses a wide range of functions (including various autocrine, paracrine and endocrine processes). They show considerable diversity at the sequence level, on the basis of which they can be separated into distinct groups. We use the term clan to describe the GPCRs, as they embrace a group of families for which there are indications of evolutionary relationship, but between which there is no statistically significant similarity in sequence []. The currently known clan members include the rhodopsin-like GPCRs, the secretin-like GPCRs, the cAMP receptors, the fungal mating pheromone receptors, and the metabotropic glutamate receptor family. There is a specialised database for GPCRs (http://www.gpcr.org/7tm/). The nematode Caenorhabditis elegans has only 14 types of chemosensory neuron, yet is able to sense and respond to several hundred different chemicals because each neuron detects several stimuli []. Chemoperception is one of the central senses of soil nematodes like C. elegans which are otherwise 'blind' and 'deaf' []. Chemoreception in C. elegans is mediated by members of the seven-transmembrane G-protein-coupled receptor class (7TM GPCRs). More than 1300 potential chemoreceptor genes have been identified in C. elegans, which are generally prefixed sr for serpentine receptor. The receptor superfamilies include Sra (Sra, Srb, Srab, Sre), Str (Srh, Str, Sri, Srd, Srj, Srm, Srn) and Srg (Srx, Srt, Srg, Sru, Srv, Srxa), as well as the families Srw, Srz, Srbc, Srsx and Srr [, , ]. Many of these proteins have homologues in Caenorhabditis briggsae. Srh is part of the Str superfamily of chemoreceptors [].
Probab=95.74 E-value=0.14 Score=35.09 Aligned_cols=80 Identities=11% Similarity=0.225 Sum_probs=54.3
Q ss_pred hhhhhhhHHHHHHHHHHHHH-HHHhhhHHHHHHHHHHhhCCCCCcchHHHHHHHHHHHHhhhhhhHHHHHhccHHHHHHH
Q psy6890 6 GRILLLEQKATKVLGVVFFT-FVILWAPFFILNLIPSICSPCETKIHPGIFEVATWLGYASSMVNPIFYTIFNKVFRTAF 84 (116)
Q Consensus 6 ~~~~~~~~k~~k~~~~iv~~-f~i~~~P~~v~~l~~~~~~~~~~~~~~~~~~~~~~l~~~ns~~NPiiY~~~~~~fr~~~ 84 (116)
++..+..+|..+.+.+=+++ +++.-.|.........++. ....+..+...+...+..+.-++..+.++.+|+.+
T Consensus 222 ~~T~k~Qkkfl~~l~iQ~~ip~~~l~~P~~~~~~~~~~~~-----~~q~~~n~~~~~~~~HG~~sti~mi~~~~pYR~~~ 296 (302)
T PF10318_consen 222 KKTRKMQKKFLIALIIQVLIPFIFLFIPLIYFIISIIFGY-----YNQALNNISFIIISLHGIASTIVMILVHKPYRKFL 296 (302)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHhhhHHHHHHHHhccc-----cccccchHHHHHHHhccHHHHHHHhhccHHHHHHH
Confidence 34445566666555554444 3444557777666554432 12334556677788899999999999999999999
Q ss_pred HHhHhh
Q psy6890 85 KKVLLC 90 (116)
Q Consensus 85 ~~~~~~ 90 (116)
+++++|
T Consensus 297 ~~~~~~ 302 (302)
T PF10318_consen 297 LSLFRC 302 (302)
T ss_pred HHHhcC
Confidence 999975
No 21
>PF02101 Ocular_alb: Ocular albinism type 1 protein; InterPro: IPR001414 Ocular albinism type 1 (OA1) is an X-linked disorder characterised by severe impairment of visual acuity, retinal hypopigmentation and the presence of macromelanosomes. A novel transcript from the OA1 critical region is expressed in high levels in RNA samples from retina and from melanoma and encodes a potential integral membrane protein []. This protein is of unknown function but is known to bind heterotrimeric G proteins.; GO: 0016020 membrane
Probab=95.01 E-value=0.24 Score=35.37 Aligned_cols=28 Identities=18% Similarity=0.303 Sum_probs=21.0
Q ss_pred HHHHHHHHHHHHHHHHhhhHHHHHHHHH
Q psy6890 13 QKATKVLGVVFFTFVILWAPFFILNLIP 40 (116)
Q Consensus 13 ~k~~k~~~~iv~~f~i~~~P~~v~~l~~ 40 (116)
..+-+-++.|+++|.+||+|-.+--.+.
T Consensus 240 ~~IK~kFf~I~lVF~iCWlpNIINg~LL 267 (405)
T PF02101_consen 240 AQIKIKFFKIMLVFYICWLPNIINGSLL 267 (405)
T ss_pred HHHHHHHHHHHHHHHHHhhhhhHhHHHH
Confidence 3344567889999999999988775543
No 22
>PF03125 Sre: C. elegans Sre G protein-coupled chemoreceptor; InterPro: IPR004151 G-protein-coupled receptors, GPCRs, constitute a vast protein family that encompasses a wide range of functions (including various autocrine, paracrine and endocrine processes). They show considerable diversity at the sequence level, on the basis of which they can be separated into distinct groups. We use the term clan to describe the GPCRs, as they embrace a group of families for which there are indications of evolutionary relationship, but between which there is no statistically significant similarity in sequence []. The currently known clan members include the rhodopsin-like GPCRs, the secretin-like GPCRs, the cAMP receptors, the fungal mating pheromone receptors, and the metabotropic glutamate receptor family. There is a specialised database for GPCRs (http://www.gpcr.org/7tm/). The nematode Caenorhabditis elegans has only 14 types of chemosensory neuron, yet is able to sense and respond to several hundred different chemicals because each neuron detects several stimuli []. Chemoperception is one of the central senses of soil nematodes like C. elegans which are otherwise 'blind' and 'deaf' []. Chemoreception in C. elegans is mediated by members of the seven-transmembrane G-protein-coupled receptor class (7TM GPCRs). More than 1300 potential chemoreceptor genes have been identified in C. elegans, which are generally prefixed sr for serpentine receptor. The receptor superfamilies include Sra (Sra, Srb, Srab, Sre), Str (Srh, Str, Sri, Srd, Srj, Srm, Srn) and Srg (Srx, Srt, Srg, Sru, Srv, Srxa), as well as the families Srw, Srz, Srbc, Srsx and Srr [, , ]. Many of these proteins have homologues in Caenorhabditis briggsae. This entry represents serpentine receptor class e (Sre) from the Sra superfamily []. ; GO: 0004888 transmembrane signaling receptor activity, 0007606 sensory perception of chemical stimulus, 0016021 integral to membrane
Probab=93.43 E-value=2 Score=30.53 Aligned_cols=43 Identities=7% Similarity=-0.058 Sum_probs=34.4
Q ss_pred chHHHHHHHHHHHHhhhhhhHHHHHhccHHHHHHHHHhHhhhh
Q psy6890 50 IHPGIFEVATWLGYASSMVNPIFYTIFNKVFRTAFKKVLLCRY 92 (116)
Q Consensus 50 ~~~~~~~~~~~l~~~ns~~NPiiY~~~~~~fr~~~~~~~~~~~ 92 (116)
.......+......+++.+=|.+-....+..|+++++.+....
T Consensus 281 ~~~~~~~~~e~~i~l~~~~i~~~~i~s~~~wrk~f~~~~~~~~ 323 (365)
T PF03125_consen 281 YRTILNHIFENCIFLNPIFICPVIIFSVPSWRKEFKRSFPKIR 323 (365)
T ss_pred HHHHHHHHHHHHHHHhhHHHHHHHHHcCHHHHHHHHHhhhhhh
Confidence 3345567788888888888888888999999999999886543
No 23
>PF03402 V1R: Vomeronasal organ pheromone receptor family, V1R; InterPro: IPR004072 G-protein-coupled receptors, GPCRs, constitute a vast protein family that encompasses a wide range of functions (including various autocrine, paracrine and endocrine processes). They show considerable diversity at the sequence level, on the basis of which they can be separated into distinct groups. We use the term clan to describe the GPCRs, as they embrace a group of families for which there are indications of evolutionary relationship, but between which there is no statistically significant similarity in sequence []. The currently known clan members include the rhodopsin-like GPCRs, the secretin-like GPCRs, the cAMP receptors, the fungal mating pheromone receptors, and the metabotropic glutamate receptor family. There is a specialised database for GPCRs (http://www.gpcr.org/7tm/). The rhodopsin-like GPCRs themselves represent a widespread protein family that includes hormone, neurotransmitter and light receptors, all of which transduce extracellular signals through interaction with guanine nucleotide-binding (G) proteins. Although their activating ligands vary widely in structure and character, the amino acid sequences of the receptors are very similar and are believed to adopt a common structural framework comprising 7 transmembrane (TM) helices [, , ]. Pheromones have evolved in all animal phyla, to signal sex and dominance status, and are responsible for stereotypical social and sexual behaviour among members of the same species. In mammals, these chemical signals are believed to be detected primarily by the vomeronasal organ (VNO), a chemosensory organ located at the base of the nasal septum []. The VNO is present in most amphibia, reptiles and non-primate mammals but is absent in birds, adult catarrhine monkeys and apes []. An active role for the human VNO in the detection of pheromones is disputed; the VNO is clearly present in the foetus but appears to be atrophied or absent in adults. Three distinct families of putative pheromone receptors have been identified in the vomeronasal organ (V1Rs, V2Rs and V3Rs). All are G protein-coupled receptors but are only distantly related to the receptors of the main olfactory system, highlighting their different role []. The V1 receptors share between 50 and 90% sequence identity but have little similarity to other families of G protein-coupled receptors. They appear to be distantly related to the mammalian T2R bitter taste receptors and the rhodopsin-like GPCRs []. In rat, the family comprises 30-40 genes. These are expressed in the apical regions of the VNO, in neurons expressing Gi2. Coupling of the receptors to this protein mediates inositol trisphosphate signalling []. A number of human V1 receptor homologues have also been found. The majority of these human sequences are pseudogenes [] but an apparently functional receptor has been identified that is expressed in the human olfactory system [].; GO: 0016503 pheromone receptor activity, 0007186 G-protein coupled receptor protein signaling pathway, 0016021 integral to membrane
Probab=92.53 E-value=0.15 Score=34.76 Aligned_cols=71 Identities=18% Similarity=0.220 Sum_probs=52.4
Q ss_pred hhhhHHHHHHHHHHHHHHHHhhhHHHHHHHHHHhhCCCCCcchHHHHHHHHHHHHhhhhhhHHHHHhccHHHHHHH
Q psy6890 9 LLLEQKATKVLGVVFFTFVILWAPFFILNLIPSICSPCETKIHPGIFEVATWLGYASSMVNPIFYTIFNKVFRTAF 84 (116)
Q Consensus 9 ~~~~~k~~k~~~~iv~~f~i~~~P~~v~~l~~~~~~~~~~~~~~~~~~~~~~l~~~ns~~NPiiY~~~~~~fr~~~ 84 (116)
...|.|++|++.+.+.+|+.++..-.++.+......+ + ..+..+..+++..-+.+-|++-.-.+++..+.+
T Consensus 193 ~SpE~RAtktILlLVs~FV~fY~l~si~~~~~~~~~~-~----~~~~~~~~~ls~cfptisPfvLI~~d~~i~~~~ 263 (265)
T PF03402_consen 193 SSPETRATKTILLLVSTFVSFYGLSSILFIYLTSFKN-S----PWLLNISVFLSSCFPTISPFVLISSDKRIIKFL 263 (265)
T ss_pred CChhHHHhCeEeeHHHHHHHHHhHHHHHHHHHHHhcC-C----cceeEHHHHHhHHhHhhChHHhhccCchHHHHh
Confidence 3579999999999999999999998887654433221 1 223345567777778899999988888776654
No 24
>KOG2575|consensus
Probab=89.50 E-value=0.53 Score=34.09 Aligned_cols=26 Identities=35% Similarity=0.599 Sum_probs=22.4
Q ss_pred hHHHHHHHHHHHHHHHHhhhHHHHHH
Q psy6890 12 EQKATKVLGVVFFTFVILWAPFFILN 37 (116)
Q Consensus 12 ~~k~~k~~~~iv~~f~i~~~P~~v~~ 37 (116)
=.+++++.++|+++|+++|.|+....
T Consensus 254 f~ri~~ia~~Vv~TF~iiw~P~~~~~ 279 (510)
T KOG2575|consen 254 FARIIKIALAVVGTFVIIWLPFLLSG 279 (510)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHhcc
Confidence 46788999999999999999998553
No 25
>KOG4564|consensus
Probab=88.36 E-value=0.93 Score=33.44 Aligned_cols=77 Identities=13% Similarity=0.146 Sum_probs=48.0
Q ss_pred hhhhhhhhHHHHHHHHHHHHHHHHhhhHHHHHHHHHHhhCCCCCcchHHHHHHHHHHHHhhhhhhHHHHHhccHHHHHHH
Q psy6890 5 HGRILLLEQKATKVLGVVFFTFVILWAPFFILNLIPSICSPCETKIHPGIFEVATWLGYASSMVNPIFYTIFNKVFRTAF 84 (116)
Q Consensus 5 ~~~~~~~~~k~~k~~~~iv~~f~i~~~P~~v~~l~~~~~~~~~~~~~~~~~~~~~~l~~~ns~~NPiiY~~~~~~fr~~~ 84 (116)
+.......+|++|-.++.+=+|-+-+..+ .+.++.+ ........+...+..+.-.+=-+||++.|.+.|.++
T Consensus 342 ~~~~~~~y~K~vKaTLvLIPLfGI~~ilf-------~~~P~~~-~~~~v~~~~~~~L~SfQGf~VAvlYCFlN~EVq~el 413 (473)
T KOG4564|consen 342 NASETDQYRKLVKATLVLIPLFGIHYILF-------AFRPDED-TLREVYLYFELFLGSFQGFFVAVLYCFLNGEVQAEL 413 (473)
T ss_pred cccchHHHHHHHHHHHHHHHHcCCeeEEE-------EecCchH-HHHHHHHHHHHHHHhccchheehheeecCHHHHHHH
Confidence 33444557888888887776666553322 2222111 111222334555566666677799999999999999
Q ss_pred HHhHh
Q psy6890 85 KKVLL 89 (116)
Q Consensus 85 ~~~~~ 89 (116)
|+.+.
T Consensus 414 rr~W~ 418 (473)
T KOG4564|consen 414 RRKWS 418 (473)
T ss_pred HHHHH
Confidence 99884
No 26
>PF10319 7TM_GPCR_Srj: Serpentine type 7TM GPCR chemoreceptor Srj; InterPro: IPR019423 G-protein-coupled receptors, GPCRs, constitute a vast protein family that encompasses a wide range of functions (including various autocrine, paracrine and endocrine processes). They show considerable diversity at the sequence level, on the basis of which they can be separated into distinct groups. We use the term clan to describe the GPCRs, as they embrace a group of families for which there are indications of evolutionary relationship, but between which there is no statistically significant similarity in sequence []. The currently known clan members include the rhodopsin-like GPCRs, the secretin-like GPCRs, the cAMP receptors, the fungal mating pheromone receptors, and the metabotropic glutamate receptor family. There is a specialised database for GPCRs (http://www.gpcr.org/7tm/). The nematode Caenorhabditis elegans has only 14 types of chemosensory neuron, yet is able to sense and respond to several hundred different chemicals because each neuron detects several stimuli []. Chemoperception is one of the central senses of soil nematodes like C. elegans which are otherwise 'blind' and 'deaf' []. Chemoreception in C. elegans is mediated by members of the seven-transmembrane G-protein-coupled receptor class (7TM GPCRs). More than 1300 potential chemoreceptor genes have been identified in C. elegans, which are generally prefixed sr for serpentine receptor. The receptor superfamilies include Sra (Sra, Srb, Srab, Sre), Str (Srh, Str, Sri, Srd, Srj, Srm, Srn) and Srg (Srx, Srt, Srg, Sru, Srv, Srxa), as well as the families Srw, Srz, Srbc, Srsx and Srr [, , ]. Many of these proteins have homologues in Caenorhabditis briggsae. This entry represents serpentine receptor class j (Srj) from the Str superfamily [, ]. The Srj family is designated as the out-group based on its location in preliminary phylogenetic analyses of the entire superfamily [].
Probab=88.22 E-value=2.9 Score=29.23 Aligned_cols=70 Identities=14% Similarity=0.108 Sum_probs=47.3
Q ss_pred hhhHHHHHHHHHH-HHHHHHhhhHHHHHHHHHHhhCCCCCcchHHHHHHHHHHHHhhhhhhHHHHHhccHHHHHH
Q psy6890 10 LLEQKATKVLGVV-FFTFVILWAPFFILNLIPSICSPCETKIHPGIFEVATWLGYASSMVNPIFYTIFNKVFRTA 83 (116)
Q Consensus 10 ~~~~k~~k~~~~i-v~~f~i~~~P~~v~~l~~~~~~~~~~~~~~~~~~~~~~l~~~ns~~NPiiY~~~~~~fr~~ 83 (116)
+-++++.|.+++= ++=-++|+.|-.+.+..-.++ .+...+...+......+=+++||+.-.+.-|.||+.
T Consensus 239 ~lq~qL~~AL~vQT~IPi~vsf~Pc~~~wy~pif~----i~~~~~~n~~~~iAls~FPf~DPlAii~~lP~~R~r 309 (310)
T PF10319_consen 239 RLQRQLFKALIVQTVIPICVSFSPCVLSWYGPIFG----IDLGRWNNYFSVIALSAFPFLDPLAIILCLPAFRNR 309 (310)
T ss_pred HHHHHHHHHHHHHHHhHHHHhhccHHHHHhHHHHc----CChhHHHHHHHHHHHHHccccCchHhheecHHhhcc
Confidence 3344444444432 222478899988887655553 344566666677777777899999999999999975
No 27
>PF02118 Srg: Srg family chemoreceptor; InterPro: IPR000609 G-protein-coupled receptors, GPCRs, constitute a vast protein family that encompasses a wide range of functions (including various autocrine, paracrine and endocrine processes). They show considerable diversity at the sequence level, on the basis of which they can be separated into distinct groups. We use the term clan to describe the GPCRs, as they embrace a group of families for which there are indications of evolutionary relationship, but between which there is no statistically significant similarity in sequence []. The currently known clan members include the rhodopsin-like GPCRs, the secretin-like GPCRs, the cAMP receptors, the fungal mating pheromone receptors, and the metabotropic glutamate receptor family. There is a specialised database for GPCRs (http://www.gpcr.org/7tm/). The nematode Caenorhabditis elegans has only 14 types of chemosensory neuron, yet is able to sense and respond to several hundred different chemicals because each neuron detects several stimuli []. Chemoperception is one of the central senses of soil nematodes like C. elegans which are otherwise 'blind' and 'deaf' []. Chemoreception in C. elegans is mediated by members of the seven-transmembrane G-protein-coupled receptor class (7TM GPCRs). More than 1300 potential chemoreceptor genes have been identified in C. elegans, which are generally prefixed sr for serpentine receptor. The receptor superfamilies include Sra (Sra, Srb, Srab, Sre), Str (Srh, Str, Sri, Srd, Srj, Srm, Srn) and Srg (Srx, Srt, Srg, Sru, Srv, Srxa), as well as the families Srw, Srz, Srbc, Srsx and Srr [, , ]. Many of these proteins have homologues in Caenorhabditis briggsae. This entry represents serpentine receptor class g (Srg) from the Srg superfamily [, ]. Srg receptors contain seven hydrophobic, putative transmembrane, regions and can be distinguished from other 7TM GPCR receptors by their own characteristic TM signatures. ; GO: 0004888 transmembrane signaling receptor activity, 0007606 sensory perception of chemical stimulus, 0016020 membrane
Probab=87.82 E-value=6.1 Score=26.50 Aligned_cols=75 Identities=16% Similarity=0.133 Sum_probs=39.9
Q ss_pred hhhhhhhHHHHHHHHHHHHHHHHhhhHHHHHHHHHHhhCCCCCcchHHHHHHHHHHHHhhhhhhHHHHHhccHHHHHH
Q psy6890 6 GRILLLEQKATKVLGVVFFTFVILWAPFFILNLIPSICSPCETKIHPGIFEVATWLGYASSMVNPIFYTIFNKVFRTA 83 (116)
Q Consensus 6 ~~~~~~~~k~~k~~~~iv~~f~i~~~P~~v~~l~~~~~~~~~~~~~~~~~~~~~~l~~~ns~~NPiiY~~~~~~fr~~ 83 (116)
++..+.|++..++.++..+++++.-+...+..+...... .........+.....=+-+...|++-.+++++.|+.
T Consensus 200 ~~~~~~er~L~~is~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~~~~~~~~f~sD~ltl~~P~iLl~fs~~vR~~ 274 (275)
T PF02118_consen 200 KRIKSVERNLTIISFIISFVQLLIAIWQIINSFAFFFDF---LPISSFLYYLLPFASDLLTLSQPYILLIFSKNVRRQ 274 (275)
T ss_pred hhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcc---ccHHHHHHHHHHHHHHHHHHHHHHHHheECHhhhcc
Confidence 344455677777666666666655555444322222111 111222222222222233556799999999999975
No 28
>PF09889 DUF2116: Uncharacterized protein containing a Zn-ribbon (DUF2116); InterPro: IPR019216 This entry contains various hypothetical prokaryotic proteins whose functions are unknown. They contain a conserved zinc ribbon motif in the N-terminal part and a predicted transmembrane segment in the C-terminal part.
Probab=86.74 E-value=1.7 Score=22.65 Aligned_cols=25 Identities=16% Similarity=0.337 Sum_probs=15.1
Q ss_pred hhhhhhHHHHHHHHHHHHHHHHhhh
Q psy6890 7 RILLLEQKATKVLGVVFFTFVILWA 31 (116)
Q Consensus 7 ~~~~~~~k~~k~~~~iv~~f~i~~~ 31 (116)
+++++.+|.-.++..++++|++-|+
T Consensus 30 k~qk~~~~~~~i~~~~~i~~l~v~~ 54 (59)
T PF09889_consen 30 KRQKRMRKTQYIFFGIFILFLAVWI 54 (59)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 3444555666677777776666554
No 29
>KOG4193|consensus
Probab=85.50 E-value=15 Score=28.35 Aligned_cols=26 Identities=27% Similarity=0.443 Sum_probs=20.2
Q ss_pred hhhHHHHHhccHHHHHHHHHhHhhhh
Q psy6890 67 MVNPIFYTIFNKVFRTAFKKVLLCRY 92 (116)
Q Consensus 67 ~~NPiiY~~~~~~fr~~~~~~~~~~~ 92 (116)
+.=-++|+++.++.|++.++.++|..
T Consensus 558 ~fIFi~~cll~~kvr~~~~k~~~~~~ 583 (610)
T KOG4193|consen 558 VFIFIFHCLLRKKVRKEYRKWLCCGR 583 (610)
T ss_pred hHhhHhhhhhhHHHHHHHHHHhcccC
Confidence 34457788899999999999997443
No 30
>PF10624 TraS: Plasmid conjugative transfer entry exclusion protein TraS; InterPro: IPR018898 Entry exclusion (Eex) is a process which prevents redundant transfer of DNA between donor cells. TraS is a protein involved in Eex. It blocks redundant conjugative DNA synthesis and transport between donor cells, and it is suggested that TraS interferes with a signalling pathway that is required to trigger DNA transfer []. TraS on the recipient cell is known to form an interaction with TraG on the donor cell [].
Probab=83.94 E-value=0.98 Score=27.54 Aligned_cols=17 Identities=18% Similarity=0.143 Sum_probs=9.8
Q ss_pred HHHHHHhhhhhhHHHHH
Q psy6890 58 ATWLGYASSMVNPIFYT 74 (116)
Q Consensus 58 ~~~l~~~ns~~NPiiY~ 74 (116)
...++-+.|.+||++..
T Consensus 66 wyi~mpVcsllsPllsL 82 (164)
T PF10624_consen 66 WYILMPVCSLLSPLLSL 82 (164)
T ss_pred EEeeecHHHHHhHHHHH
Confidence 33445566667776544
No 31
>PF10292 7TM_GPCR_Srab: Serpentine type 7TM GPCR receptor class ab chemoreceptor; InterPro: IPR019408 G-protein-coupled receptors, GPCRs, constitute a vast protein family that encompasses a wide range of functions (including various autocrine, paracrine and endocrine processes). They show considerable diversity at the sequence level, on the basis of which they can be separated into distinct groups. We use the term clan to describe the GPCRs, as they embrace a group of families for which there are indications of evolutionary relationship, but between which there is no statistically significant similarity in sequence []. The currently known clan members include the rhodopsin-like GPCRs, the secretin-like GPCRs, the cAMP receptors, the fungal mating pheromone receptors, and the metabotropic glutamate receptor family. There is a specialised database for GPCRs (http://www.gpcr.org/7tm/). The nematode Caenorhabditis elegans has only 14 types of chemosensory neuron, yet is able to sense and respond to several hundred different chemicals because each neuron detects several stimuli []. Chemoperception is one of the central senses of soil nematodes like C. elegans which are otherwise 'blind' and 'deaf' []. Chemoreception in C. elegans is mediated by members of the seven-transmembrane G-protein-coupled receptor class (7TM GPCRs). More than 1300 potential chemoreceptor genes have been identified in C. elegans, which are generally prefixed sr for serpentine receptor. The receptor superfamilies include Sra (Sra, Srb, Srab, Sre), Str (Srh, Str, Sri, Srd, Srj, Srm, Srn) and Srg (Srx, Srt, Srg, Sru, Srv, Srxa), as well as the families Srw, Srz, Srbc, Srsx and Srr [, , ]. Many of these proteins have homologues in Caenorhabditis briggsae. Srab is part of the Sra superfamily of chemoreceptors. The expression pattern of the srab genes is biologically intriguing. Of the six promoters successfully expressed in transgenic organisms, one was exclusively expressed in the tail phasmid neurons, two were exclusively expressed in a head amphid neuron, and two were expressed both in the head and tail neurons as well as a limited number of other cells [].
Probab=78.93 E-value=19 Score=25.10 Aligned_cols=81 Identities=14% Similarity=0.053 Sum_probs=54.2
Q ss_pred hhhhhHHHHHHHHHHHHHHHHhhhHHHHHHHHHHh-hCCCCCcchHHHHHHHHHHHHhhhhhhHHHHHhccHHHHHHHHH
Q psy6890 8 ILLLEQKATKVLGVVFFTFVILWAPFFILNLIPSI-CSPCETKIHPGIFEVATWLGYASSMVNPIFYTIFNKVFRTAFKK 86 (116)
Q Consensus 8 ~~~~~~k~~k~~~~iv~~f~i~~~P~~v~~l~~~~-~~~~~~~~~~~~~~~~~~l~~~ns~~NPiiY~~~~~~fr~~~~~ 86 (116)
+..++.|..+++..++.+..+.-+.+.+..+.... .+..+......+.. ........+.+=|++.....++.|+..++
T Consensus 224 Ql~ENl~slr~L~p~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~y~al~e-~~~~~P~Y~ii~~~~~~~~~k~~r~~~~~ 302 (324)
T PF10292_consen 224 QLEENLRSLRLLKPFIILSSIFIFFYIFASIFLRLFNPSMSKPNYFALAE-LNHIFPLYSIISPLILYRKIKKIRKKRKK 302 (324)
T ss_pred hHHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccCChhhHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 45678889999988888888887777777665543 33222222222222 33456667888899888888888888777
Q ss_pred hHh
Q psy6890 87 VLL 89 (116)
Q Consensus 87 ~~~ 89 (116)
-+.
T Consensus 303 ~l~ 305 (324)
T PF10292_consen 303 RLE 305 (324)
T ss_pred HHH
Confidence 664
No 32
>PF10326 7TM_GPCR_Str: Serpentine type 7TM GPCR chemoreceptor Str; InterPro: IPR019428 G-protein-coupled receptors, GPCRs, constitute a vast protein family that encompasses a wide range of functions (including various autocrine, paracrine and endocrine processes). They show considerable diversity at the sequence level, on the basis of which they can be separated into distinct groups. We use the term clan to describe the GPCRs, as they embrace a group of families for which there are indications of evolutionary relationship, but between which there is no statistically significant similarity in sequence []. The currently known clan members include the rhodopsin-like GPCRs, the secretin-like GPCRs, the cAMP receptors, the fungal mating pheromone receptors, and the metabotropic glutamate receptor family. There is a specialised database for GPCRs (http://www.gpcr.org/7tm/). The nematode Caenorhabditis elegans has only 14 types of chemosensory neuron, yet is able to sense and respond to several hundred different chemicals because each neuron detects several stimuli []. Chemoperception is one of the central senses of soil nematodes like C. elegans which are otherwise 'blind' and 'deaf' []. Chemoreception in C. elegans is mediated by members of the seven-transmembrane G-protein-coupled receptor class (7TM GPCRs). More than 1300 potential chemoreceptor genes have been identified in C. elegans, which are generally prefixed sr for serpentine receptor. The receptor superfamilies include Sra (Sra, Srb, Srab, Sre), Str (Srh, Str, Sri, Srd, Srj, Srm, Srn) and Srg (Srx, Srt, Srg, Sru, Srv, Srxa), as well as the families Srw, Srz, Srbc, Srsx and Srr [, , ]. Many of these proteins have homologues in Caenorhabditis briggsae. This entry represents serpentine receptor class r (Str) from the Str superfamily [, ]. Almost a quarter (22.5%) of str and srj family genes and pseudogenes in C. elegans appear to have been newly formed by gene duplications since the species split [].
Probab=78.66 E-value=0.32 Score=33.36 Aligned_cols=57 Identities=19% Similarity=0.332 Sum_probs=39.1
Q ss_pred HHHHHhhhHHHHHHHHHHhhCCCCCcchHHHHHHHHHHHHhhhhhhHHHHHhccHHHHHHHH
Q psy6890 24 FTFVILWAPFFILNLIPSICSPCETKIHPGIFEVATWLGYASSMVNPIFYTIFNKVFRTAFK 85 (116)
Q Consensus 24 ~~f~i~~~P~~v~~l~~~~~~~~~~~~~~~~~~~~~~l~~~ns~~NPiiY~~~~~~fr~~~~ 85 (116)
+=.++.+.|..+..+.-.++-+ .. ........+..+-+++||++-.+.-++||++++
T Consensus 251 iP~i~~~~P~~~~~~~p~~~i~--~~---~~~~~~~~~~~~yP~iDpl~~i~~ik~yR~~i~ 307 (307)
T PF10326_consen 251 IPFIFMYIPVFIVFILPFFGID--LG---FFSNIISILISLYPAIDPLPVIFIIKDYRKAIK 307 (307)
T ss_pred hhheeeecchhheeeeeccCCC--CC---ccccHhhhhEEEEeehhhheeeEeeHHHHHhhC
Confidence 3456678888877766555432 11 122244456667899999999999999999874
No 33
>PF06072 Herpes_US9: Alphaherpesvirus tegument protein US9; InterPro: IPR009278 This family consists of several US9 and related proteins from the Alphaherpesviruses. The function of the US9 protein is unknown although in Bovine herpesvirus 5 Us9 is essential for the anterograde spread of the virus from the olfactory mucosa to the bulb [].; GO: 0019033 viral tegument
Probab=78.21 E-value=7.8 Score=20.18 Aligned_cols=26 Identities=4% Similarity=0.002 Sum_probs=14.6
Q ss_pred hhhhhHHHHHHHHHHHHHH-HHhhhHH
Q psy6890 8 ILLLEQKATKVLGVVFFTF-VILWAPF 33 (116)
Q Consensus 8 ~~~~~~k~~k~~~~iv~~f-~i~~~P~ 33 (116)
..++++|...+.+.++.++ ++|-+-.
T Consensus 24 ~~r~RrRrc~~~v~~v~~~~~~c~~S~ 50 (60)
T PF06072_consen 24 ASRRRRRRCRLAVAIVFAVVALCVLSG 50 (60)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 4455666666555555555 5554443
No 34
>PF03155 Alg6_Alg8: ALG6, ALG8 glycosyltransferase family; InterPro: IPR004856 N-linked (asparagine-linked) glycosylation of proteins is mediated by a highly conserved pathway in eukaryotes, in which a lipid (dolichol phosphate)-linked oligosaccharide is assembled at the endoplasmic reticulum membrane prior to the transfer of the oligosaccharide moiety to the target asparagine residues. This oligosaccharide is composed of Glc(3)Man(9)GlcNAc(2). The addition of the three glucose residues is the final series of steps in the synthesis of the oligosaccharide precursor. Alg6 transfers the first glucose residue, and Alg8 transfers the second one []. In the human alg6 gene, a C-T transition, which causes Ala333 to be replaced with Val, has been identified as the cause of a congenital disorder of glycosylation, designated as type Ic OMIM:603147 []. ; GO: 0016758 transferase activity, transferring hexosyl groups, 0005789 endoplasmic reticulum membrane
Probab=65.74 E-value=11 Score=28.10 Aligned_cols=26 Identities=27% Similarity=0.352 Sum_probs=22.7
Q ss_pred hhHHHHHHHHHHHHHHHHhhhHHHHH
Q psy6890 11 LEQKATKVLGVVFFTFVILWAPFFIL 36 (116)
Q Consensus 11 ~~~k~~k~~~~iv~~f~i~~~P~~v~ 36 (116)
.-.|..+.-++++++|.++|.|+...
T Consensus 219 ~~~~~~~lg~~Vi~~f~~~~~PF~~~ 244 (469)
T PF03155_consen 219 SIKRLIKLGIVVIATFALSFGPFLYS 244 (469)
T ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHh
Confidence 45889999999999999999999633
No 35
>COG5058 LAG1 Protein transporter of the TRAM (translocating chain-associating membrane) superfamily, longevity assurance factor [Intracellular trafficking and secretion]
Probab=65.50 E-value=32 Score=24.54 Aligned_cols=66 Identities=11% Similarity=0.236 Sum_probs=46.5
Q ss_pred hhHHHHHHHHHHHHHHHHhhhHHHHHHHHHHhhCCCCCcchHHHHHHHHHHHHhhhhhhHHHHHhc
Q psy6890 11 LEQKATKVLGVVFFTFVILWAPFFILNLIPSICSPCETKIHPGIFEVATWLGYASSMVNPIFYTIF 76 (116)
Q Consensus 11 ~~~k~~k~~~~iv~~f~i~~~P~~v~~l~~~~~~~~~~~~~~~~~~~~~~l~~~ns~~NPiiY~~~ 76 (116)
.++...-+++=.+++-++-|+-|+.-.+.....--..-+....+..+++++-++|+.+-+.|++++
T Consensus 229 prkD~~elv~HHIVTllLI~lSY~fhftr~GlAI~itmDvSD~~Ls~sK~lnYl~~~l~~~iF~iF 294 (395)
T COG5058 229 PRKDFKELVFHHIVTLLLIWLSYVFHFTRMGLAIYITMDVSDFFLSLSKTLNYLNSVLATFIFGIF 294 (395)
T ss_pred hhhhHHHHHHHHHHHHHHHHHHHHHHHHhccceEEEEEeccHHHHHHHHHHHHhhchhHHHHHHHH
Confidence 344445567777888889999988776654432110123556788899999999999888888865
No 36
>PF04789 DUF621: Protein of unknown function (DUF621); InterPro: IPR006874 This is a conserved region found in uncharacterised proteins from Caenorhabditis elegans, and is noted to have possible G-protein-coupled receptor-like activity.
Probab=59.29 E-value=26 Score=24.45 Aligned_cols=77 Identities=14% Similarity=0.091 Sum_probs=48.9
Q ss_pred hhhhhhhhHHHHHHHHHHHHHHHHhhhHHHHHHHHHHhhCCCCCcchHHHHHHHHHHHHhhhhhhHHHHHhccHHHHHHH
Q psy6890 5 HGRILLLEQKATKVLGVVFFTFVILWAPFFILNLIPSICSPCETKIHPGIFEVATWLGYASSMVNPIFYTIFNKVFRTAF 84 (116)
Q Consensus 5 ~~~~~~~~~k~~k~~~~iv~~f~i~~~P~~v~~l~~~~~~~~~~~~~~~~~~~~~~l~~~ns~~NPiiY~~~~~~fr~~~ 84 (116)
.....+.|.++.|..+.+.++|- .+..+..+.....- +......+-.+...+-...-++-|..+.+.+++.||-+
T Consensus 218 kssLnK~Ei~ILKQAifIFvlFQ---~ss~VFl~~qt~~~--~~~tAFlIKR~iNT~ei~agaatP~ffffTs~eirk~~ 292 (305)
T PF04789_consen 218 KSSLNKNEITILKQAIFIFVLFQ---ASSCVFLLCQTFKF--NVATAFLIKRIINTMEIFAGAATPCFFFFTSKEIRKLV 292 (305)
T ss_pred hccCCchhHHHHHHHHHHHHHHH---HHHHHHHhheeeee--cchHHHHHHHHHHHHHHHhhccCCeEEEEehHHHHHHH
Confidence 33445667888888766655554 44444444443322 22223344566677777888899999999999998876
Q ss_pred HH
Q psy6890 85 KK 86 (116)
Q Consensus 85 ~~ 86 (116)
-.
T Consensus 293 s~ 294 (305)
T PF04789_consen 293 SS 294 (305)
T ss_pred Hh
Confidence 44
No 37
>PF05393 Hum_adeno_E3A: Human adenovirus early E3A glycoprotein; InterPro: IPR008652 This family consists of several early glycoproteins (E3A), from human adenovirus type 2.; GO: 0016021 integral to membrane
Probab=58.58 E-value=15 Score=20.75 Aligned_cols=13 Identities=0% Similarity=0.125 Sum_probs=6.1
Q ss_pred HHHHHHHHHHhhh
Q psy6890 19 LGVVFFTFVILWA 31 (116)
Q Consensus 19 ~~~iv~~f~i~~~ 31 (116)
.+++++.+++||.
T Consensus 44 Fil~VilwfvCC~ 56 (94)
T PF05393_consen 44 FILLVILWFVCCK 56 (94)
T ss_pred HHHHHHHHHHHHH
Confidence 3444444555543
No 38
>KOG3827|consensus
Probab=58.13 E-value=5 Score=28.89 Aligned_cols=27 Identities=19% Similarity=0.319 Sum_probs=21.4
Q ss_pred HHHHHHHHHHHHHhhhHHHHHHHHHHh
Q psy6890 16 TKVLGVVFFTFVILWAPFFILNLIPSI 42 (116)
Q Consensus 16 ~k~~~~iv~~f~i~~~P~~v~~l~~~~ 42 (116)
-.|+++..+.|++.|+-+.+++-+..+
T Consensus 63 R~~lliF~~sf~~SWl~Fg~iwwlIA~ 89 (400)
T KOG3827|consen 63 RWMLLIFSLSFVLSWLFFGVIWWLIAY 89 (400)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 457888899999999999987655443
No 39
>PF05296 TAS2R: Mammalian taste receptor protein (TAS2R); InterPro: IPR007960 This family consists of several forms of mammalian taste receptor proteins (TAS2Rs). TAS2Rs are G protein-coupled receptors expressed in subsets of taste receptor cells of the tongue and palate epithelia and are organised in the genome in clusters. The proteins are genetically linked to loci that influence bitter perception in mice and humans [].; GO: 0004930 G-protein coupled receptor activity, 0007186 G-protein coupled receptor protein signaling pathway, 0050909 sensory perception of taste, 0016021 integral to membrane
Probab=50.56 E-value=83 Score=21.85 Aligned_cols=73 Identities=12% Similarity=0.110 Sum_probs=40.5
Q ss_pred hhhHHHHHHHHHHHHHHHHhhhHHHHHHHHHHhhCCCCCcchHHHHHHHHHHHHhhhhhhHHHHHhccHHHHHHHHHhH
Q psy6890 10 LLEQKATKVLGVVFFTFVILWAPFFILNLIPSICSPCETKIHPGIFEVATWLGYASSMVNPIFYTIFNKVFRTAFKKVL 88 (116)
Q Consensus 10 ~~~~k~~k~~~~iv~~f~i~~~P~~v~~l~~~~~~~~~~~~~~~~~~~~~~l~~~ns~~NPiiY~~~~~~fr~~~~~~~ 88 (116)
..+.++.....-+++.|++.++.|++..++... .....-.....++...-.++.|-.-.+.==-=.+.+|+.+
T Consensus 223 ~ps~~aH~~a~k~~~sfl~ly~~~~~~~~~~~~------~~~~~~~~~~~~~~~~i~~~yps~hs~iLIlgn~KLr~~~ 295 (303)
T PF05296_consen 223 DPSTEAHIRAIKTMISFLILYIIYFLSLILSFL------SFFFPENSIWFWVCEIIIALYPSGHSIILILGNPKLRQAL 295 (303)
T ss_pred CCCcHHHHHHHHHHHHHHHHHHHHHHHHHHHHH------HHHhccccHHHHHHHHHHHHHHHHHHHHHhcCchHHHHHH
Confidence 346677777788888999999998877665432 1111122344455555555555544433222233444444
No 40
>PF15188 CCDC-167: Coiled-coil domain-containing protein 167
Probab=46.17 E-value=23 Score=19.89 Aligned_cols=18 Identities=11% Similarity=0.226 Sum_probs=13.1
Q ss_pred HHHHHHHHHHHHHhhhHH
Q psy6890 16 TKVLGVVFFTFVILWAPF 33 (116)
Q Consensus 16 ~k~~~~iv~~f~i~~~P~ 33 (116)
-|.+.+.+++|++|.+-|
T Consensus 67 rK~~~ls~~l~~v~~LvY 84 (85)
T PF15188_consen 67 RKSMLLSVALFFVCFLVY 84 (85)
T ss_pred hhhHHHHHHHHHHHHHHc
Confidence 357777778888887755
No 41
>PF04505 Dispanin: Interferon-induced transmembrane protein; InterPro: IPR007593 This family includes the human leukocyte antigen CD225, which is an interferon inducible transmembrane protein, and is associated with interferon induced cell growth suppression [].; GO: 0009607 response to biotic stimulus, 0016021 integral to membrane
Probab=44.89 E-value=54 Score=18.09 Aligned_cols=21 Identities=19% Similarity=0.195 Sum_probs=13.6
Q ss_pred HHHHHHhhhHHHHHHHHHHhh
Q psy6890 23 FFTFVILWAPFFILNLIPSIC 43 (116)
Q Consensus 23 v~~f~i~~~P~~v~~l~~~~~ 43 (116)
++..++|+.|..++.++....
T Consensus 23 I~s~l~Cc~PlGi~Ai~~s~k 43 (82)
T PF04505_consen 23 IFSTLCCCWPLGIVAIVYSSK 43 (82)
T ss_pred HHHHHHHHhhHHHHHheechh
Confidence 333446666988888876543
No 42
>PF11298 DUF3099: Protein of unknown function (DUF3099); InterPro: IPR021449 Some members in this family of proteins are annotated as membrane proteins however this cannot be confirmed. Currently no function is known.
Probab=43.94 E-value=54 Score=17.84 Aligned_cols=31 Identities=10% Similarity=0.101 Sum_probs=24.2
Q ss_pred hhhhhhhhHHHHHHHHHHHHHHHHhhhHHHH
Q psy6890 5 HGRILLLEQKATKVLGVVFFTFVILWAPFFI 35 (116)
Q Consensus 5 ~~~~~~~~~k~~k~~~~iv~~f~i~~~P~~v 35 (116)
.++..+++++-..++.+=+.+|++.+.-+..
T Consensus 10 ~~d~~~R~r~Y~i~M~~Ri~~fvlA~~~~~~ 40 (73)
T PF11298_consen 10 SQDQRRRRRRYLIMMGIRIPCFVLAAVVYRL 40 (73)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHhh
Confidence 4556677888888888888888888887754
No 43
>PF11119 DUF2633: Protein of unknown function (DUF2633); InterPro: IPR022576 This family is conserved largely in Proteobacteria. Several members are named as YfgG. The function is not known.
Probab=42.73 E-value=51 Score=17.17 Aligned_cols=32 Identities=6% Similarity=0.073 Sum_probs=18.7
Q ss_pred hhhHHHHHHHHHHHHHHHHhhhHHHHHHHHHH
Q psy6890 10 LLEQKATKVLGVVFFTFVILWAPFFILNLIPS 41 (116)
Q Consensus 10 ~~~~k~~k~~~~iv~~f~i~~~P~~v~~l~~~ 41 (116)
+...+.+|.+++|..++++.=+-|..+.-+..
T Consensus 4 k~~~~mtriVLLISfiIlfgRl~Y~~I~a~~h 35 (59)
T PF11119_consen 4 KKNSRMTRIVLLISFIILFGRLIYSAIGAWVH 35 (59)
T ss_pred cccchHHHHHHHHHHHHHHHHHHHHHHhHHHH
Confidence 44556667666666665555555665555543
No 44
>PF04238 DUF420: Protein of unknown function (DUF420); InterPro: IPR007352 This is a predicted membrane protein with four transmembrane helices.
Probab=41.85 E-value=81 Score=19.25 Aligned_cols=28 Identities=18% Similarity=0.215 Sum_probs=18.4
Q ss_pred hhhhHHHHHHHHHHHHHHHHhhhHHHHH
Q psy6890 9 LLLEQKATKVLGVVFFTFVILWAPFFIL 36 (116)
Q Consensus 9 ~~~~~k~~k~~~~iv~~f~i~~~P~~v~ 36 (116)
.+..++.-...+...++|++++.-++..
T Consensus 31 ~~~Hr~~Ml~a~~ls~lFlv~Yl~~~~~ 58 (133)
T PF04238_consen 31 IKLHRKLMLTAFVLSALFLVSYLYYHFL 58 (133)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHh
Confidence 3444555566677777888888766655
No 45
>PF00375 SDF: Sodium:dicarboxylate symporter family; InterPro: IPR001991 It has been shown [] that integral membrane proteins that mediate the uptake of a wide variety of molecules with the concomitant uptake of sodium ions (sodium symporters) can be grouped, on the basis of sequence and functional similarities into a number of distinct families. One of these families [] is known as the sodium:dicarboxylate symporter family (SDF). Such re-uptake of neurotransmitters from the synapses, is thought to be an important mechanism for terminating their action, by removing these chemicals from the synaptic cleft, and transporting them into presynaptic nerve terminals, and surrounding neuroglia. this removal is also believed to prevent them accumulating to the point of reaching neurotoxic [, ]. The structure of these transporter proteins has been variously reported to contain from 8 to 10 transmembrane (TM) regions, although 10 now seems to be the accepted value. Members of the family include: several mammalian excitatory amino acid transporters, and a number of bacterial transporters. They vary with regars to their dependence on transport of sodium, and other ions.; GO: 0017153 sodium:dicarboxylate symporter activity, 0006835 dicarboxylic acid transport, 0016020 membrane; PDB: 3V8G_B 1XFH_A 3KBC_B 2NWX_B 3V8F_B 2NWL_B 2NWW_A.
Probab=40.73 E-value=1.4e+02 Score=21.56 Aligned_cols=76 Identities=14% Similarity=0.151 Sum_probs=44.6
Q ss_pred HHHHHHHHHHHHHHHhhhHHHHHHHHHHhhCCCCCcchHH--HHHHHHHH--HHhhhhhhHHHHH-hccHHHHHHHHHhH
Q psy6890 14 KATKVLGVVFFTFVILWAPFFILNLIPSICSPCETKIHPG--IFEVATWL--GYASSMVNPIFYT-IFNKVFRTAFKKVL 88 (116)
Q Consensus 14 k~~k~~~~iv~~f~i~~~P~~v~~l~~~~~~~~~~~~~~~--~~~~~~~l--~~~ns~~NPiiY~-~~~~~fr~~~~~~~ 88 (116)
........-++-++++..|+.+..+........+.+.... .+.+...+ ...-..+.|+++. +...+..+.++++.
T Consensus 167 ~~~~~~~~~ii~~i~~~~Pigv~~l~a~~~~~~~~~~l~~l~~~v~~~~~~~~i~~~v~~pl~~~~~~~~np~~~~~~~~ 246 (390)
T PF00375_consen 167 ESLNEVIMKIINWIMKLAPIGVFGLIANSIATQGLSILGALGKFVLTVYVALLIHLFVVLPLILFVLTRKNPFKFLKAML 246 (390)
T ss_dssp HHHHHHHHHHHHHHTTTHHHHHHHHHHHHHHSSCCGHHHHHHHHHHHHHHHHHHHHHHTHHHHHH-TTT--HHHHHHHTH
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcChhhHHHHHHHHHHHHHHHHHHHHHHhhHHHhhhcCCCHHHHHHHHH
Confidence 3445667778889999999999988866543323322211 12222222 2223568999998 55556667676665
Q ss_pred h
Q psy6890 89 L 89 (116)
Q Consensus 89 ~ 89 (116)
.
T Consensus 247 ~ 247 (390)
T PF00375_consen 247 P 247 (390)
T ss_dssp H
T ss_pred H
Confidence 3
No 46
>COG3821 Predicted membrane protein [Function unknown]
Probab=39.78 E-value=26 Score=23.18 Aligned_cols=18 Identities=11% Similarity=0.335 Sum_probs=12.4
Q ss_pred HHHHHHHHHhhhHHHHHH
Q psy6890 20 GVVFFTFVILWAPFFILN 37 (116)
Q Consensus 20 ~~iv~~f~i~~~P~~v~~ 37 (116)
.+.+..|++||+-|.-+.
T Consensus 7 ~lA~~~F~~~W~~yt~~~ 24 (234)
T COG3821 7 LLALAVFLLCWLLYTPAS 24 (234)
T ss_pred HHHHHHHHHHHHhhhHHh
Confidence 455677888888776543
No 47
>PHA02849 putative transmembrane protein; Provisional
Probab=38.99 E-value=70 Score=17.69 Aligned_cols=24 Identities=33% Similarity=0.541 Sum_probs=15.5
Q ss_pred HHHHHHHHHHHhhhHHHHHHHHHH
Q psy6890 18 VLGVVFFTFVILWAPFFILNLIPS 41 (116)
Q Consensus 18 ~~~~iv~~f~i~~~P~~v~~l~~~ 41 (116)
.+++.+.++.+|.+-|.++.+..+
T Consensus 17 v~vi~v~v~vI~i~~flLlyLvkw 40 (82)
T PHA02849 17 VTVILVFVLVISFLAFMLLYLIKW 40 (82)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHH
Confidence 455566667777777777666554
No 48
>PHA02657 hypothetical protein; Provisional
Probab=38.79 E-value=74 Score=17.93 Aligned_cols=24 Identities=21% Similarity=0.447 Sum_probs=16.0
Q ss_pred HHHHHHHHHHHhhhHHHHHHHHHH
Q psy6890 18 VLGVVFFTFVILWAPFFILNLIPS 41 (116)
Q Consensus 18 ~~~~iv~~f~i~~~P~~v~~l~~~ 41 (116)
++++.+.++.+|.+-|.++.+..+
T Consensus 27 imVitvfv~vI~il~flLLYLvkW 50 (95)
T PHA02657 27 ILVFTIFIFVVCILIYLLIYLVDW 50 (95)
T ss_pred hhHHHHHHHHHHHHHHHHHHHHHH
Confidence 455666677777777777766554
No 49
>TIGR01912 TatC-Arch Twin arginine targeting (Tat) protein translocase TatC, Archaeal clade. This model represents the TatC translocase component of the Sec-independent protein translocation system. This system is responsible for translocation of folded proteins, often with bound cofactors across the periplasmic membrane. A related model (TIGR00945) represents the bacterial clade of this family. TatC is often found (in bacteria) in a gene cluster with the two other components of the system, TatA/E (TIGR01411) and TatB (TIGR01410). A model also exists for the Twin-arginine signal sequence (TIGR01409).
Probab=37.79 E-value=1.1e+02 Score=20.57 Aligned_cols=31 Identities=16% Similarity=0.129 Sum_probs=23.5
Q ss_pred hhhHHHHHHHHHHHHHHHHhhhHHHHHHHHH
Q psy6890 10 LLEQKATKVLGVVFFTFVILWAPFFILNLIP 40 (116)
Q Consensus 10 ~~~~k~~k~~~~iv~~f~i~~~P~~v~~l~~ 40 (116)
+-++|..+.+..+++.+.+||.|++.-.+++
T Consensus 8 ELR~Rli~~~i~~~~~~~~~~~~~~~~~i~~ 38 (237)
T TIGR01912 8 TFRLILLLVALAIVTGSVLGWLTIFTPFIIA 38 (237)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 4578899999999999999996665444433
No 50
>PF06210 DUF1003: Protein of unknown function (DUF1003); InterPro: IPR010406 This entry consists of several hypothetical bacterial proteins of unknown function.
Probab=37.75 E-value=87 Score=18.40 Aligned_cols=58 Identities=14% Similarity=0.164 Sum_probs=37.0
Q ss_pred HHHHHHHHHHHhhhHHHHHHHHHHhhCCCCCcchHHHHHHHHHHHHhhhhhhHHHHHhccHHHH
Q psy6890 18 VLGVVFFTFVILWAPFFILNLIPSICSPCETKIHPGIFEVATWLGYASSMVNPIFYTIFNKVFR 81 (116)
Q Consensus 18 ~~~~iv~~f~i~~~P~~v~~l~~~~~~~~~~~~~~~~~~~~~~l~~~ns~~NPiiY~~~~~~fr 81 (116)
..+++.++|++.|.-.++..... . .-.++.+......++...+..=|+|-.--|++=+
T Consensus 5 ~Fi~~~~~~~~~Wi~~N~~~~~~---~---~fDpyPFilLnl~lS~~Aa~~ap~IlmsQNRq~~ 62 (108)
T PF06210_consen 5 TFIIIFTVFLAVWILLNILAPPR---P---AFDPYPFILLNLVLSLEAAYQAPLILMSQNRQAA 62 (108)
T ss_pred HHHHHHHHHHHHHHHHHhhcccc---C---CCCCccHHHHHHHHHHHHHHHHHHHHHHhhHhHH
Confidence 45667778888888777765543 1 1122345556667777777788998776665443
No 51
>TIGR00869 sec62 protein translocation protein, Sec62 family. protein secretary systems of yeast microsomes. They are also the non-selective cation (NS) channels of the mammalian cytoplasmic membrane. The yeast Sec62 protein has been shown to be essential for cell growth. The mammalian NS channel proteins has been implicated in platelet derived growth factor(PGDF) dependent single channel current in fibroblasts. These channels are essentially closed in serum deprived tissue-culture cells and are specifically opened by exposure to PDGF. These channels are reported to exhibit equal selectivity for Na+, K+ and Cs+ with low permeability to Ca2+, and no permeability to anions.
Probab=36.24 E-value=1.1e+02 Score=20.73 Aligned_cols=26 Identities=15% Similarity=0.084 Sum_probs=19.3
Q ss_pred hHHHHHHHHHHHHHHHHhhhHHHHHH
Q psy6890 12 EQKATKVLGVVFFTFVILWAPFFILN 37 (116)
Q Consensus 12 ~~k~~k~~~~iv~~f~i~~~P~~v~~ 37 (116)
....+...+++++++++|..|.+=..
T Consensus 117 ~~~~l~~~~~~~~ila~~lFPlWP~~ 142 (232)
T TIGR00869 117 YMDYLIVILVVSIILALVLFPLWPRF 142 (232)
T ss_pred HHHHHHHHHHHHHHHHHhhcccChHH
Confidence 34456778888999999999966433
No 52
>PF04093 MreD: rod shape-determining protein MreD; InterPro: IPR007227 The MreD (murein formation D) protein is involved in bacterial cell shape determination. Most rod-shaped bacteria depend on MreB and RodA to achieve either a rod shape or some other non-spherical morphology such as coil or stalk formation. MreD is encoded in an operon with MreB, and often with RodA and PBP-2 as well. It is highly hydrophobic (therefore somewhat low-complexity) and highly divergent, and therefore cannot always be identified on the basis of sequence similarity.; GO: 0008360 regulation of cell shape, 0016021 integral to membrane
Probab=32.24 E-value=1.2e+02 Score=18.44 Aligned_cols=23 Identities=17% Similarity=0.193 Sum_probs=15.7
Q ss_pred HHHHhhhhhhHHHHHhccHHHHH
Q psy6890 60 WLGYASSMVNPIFYTIFNKVFRT 82 (116)
Q Consensus 60 ~l~~~ns~~NPiiY~~~~~~fr~ 82 (116)
.-...|+.++.++|-....=+||
T Consensus 137 p~~~~~~~~~~il~~~~~~l~~k 159 (160)
T PF04093_consen 137 PTLLLNALIAIILYPPVFKLLRK 159 (160)
T ss_pred HHHHHHHHHHHHHHHHHHHHHhc
Confidence 33467788888888776665554
No 53
>PHA03234 DNA packaging protein UL33; Provisional
Probab=31.73 E-value=1.9e+02 Score=20.42 Aligned_cols=88 Identities=14% Similarity=0.143 Sum_probs=52.6
Q ss_pred hhhhhhhHHHHHHHHHHHHHHHHhhhHHHHHHHHH------HhhCCCCC-cchHHHHHHHHHHHHhhhhhhHHHHHhccH
Q psy6890 6 GRILLLEQKATKVLGVVFFTFVILWAPFFILNLIP------SICSPCET-KIHPGIFEVATWLGYASSMVNPIFYTIFNK 78 (116)
Q Consensus 6 ~~~~~~~~k~~k~~~~iv~~f~i~~~P~~v~~l~~------~~~~~~~~-~~~~~~~~~~~~l~~~ns~~NPiiY~~~~~ 78 (116)
+++.++..|++-.++++.++...-+--..+...+. .-...... ........++..=..+|+.+=.++.-=+.+
T Consensus 229 ~~~~~k~~k~i~~vv~vF~iCWlPy~iv~l~~~~~~~~~~~~c~~~~~~~~~~~v~~~La~~nsclNPiIY~f~~~~FR~ 308 (338)
T PHA03234 229 EKKHKKTLFFIRILILSFLCIQIPNIAILICEIAFLYIANNSCFGLAQREILQIIIRLMPEIHCFSNPLVYAFTGGDFRL 308 (338)
T ss_pred hhhhhhhhhHHHHHHHHHHHHHhHHHHHHHHHHHHHhcccCcchHHHHHHHHHHHHHHHHHhhhhhhHHHHHHhhHHHHH
Confidence 34556677887777777766655443333221110 00000011 122334456666677899998899999999
Q ss_pred HHHHHHHHhHhhhhc
Q psy6890 79 VFRTAFKKVLLCRYC 93 (116)
Q Consensus 79 ~fr~~~~~~~~~~~~ 93 (116)
++++.+++..+|..+
T Consensus 309 ~~~~~~~~~~~~~~~ 323 (338)
T PHA03234 309 RFTACFQDFFKCNLC 323 (338)
T ss_pred HHHHHHHHHhhhhhc
Confidence 999999998866643
No 54
>PF11036 YqgB: Virulence promoting factor; InterPro: IPR020196 The yqgB and yqfZ genes are associated with the genomes of bacteria with distinct pathogenic properties and consequently fall into the category of being virulence genes []. However, yqgB and yqfZ genes are not true virulence factors but instead are probably lifestyle determinant genes where the gene products act in concert, enabling the bacteria to cope with its suboptimal physical environment and thus facilitating host colonization [].
Probab=31.45 E-value=18 Score=17.24 Aligned_cols=10 Identities=30% Similarity=0.820 Sum_probs=7.4
Q ss_pred hHHHHHhccH
Q psy6890 69 NPIFYTIFNK 78 (116)
Q Consensus 69 NPiiY~~~~~ 78 (116)
||.+|++.++
T Consensus 18 ~~~vyGLLS~ 27 (43)
T PF11036_consen 18 NPAVYGLLSQ 27 (43)
T ss_pred CchhhhhHhh
Confidence 7788887764
No 55
>PF11014 DUF2852: Protein of unknown function (DUF2852); InterPro: IPR021273 This bacterial family of proteins has no known function.
Probab=30.47 E-value=1.3e+02 Score=18.06 Aligned_cols=22 Identities=18% Similarity=0.544 Sum_probs=13.8
Q ss_pred HHHHHHHHHHHhhhHHHHHHHHH
Q psy6890 18 VLGVVFFTFVILWAPFFILNLIP 40 (116)
Q Consensus 18 ~~~~iv~~f~i~~~P~~v~~l~~ 40 (116)
.++++|+-|+++| |..+..|.+
T Consensus 11 ~Ia~mVlGFi~fW-PlGla~Lay 32 (115)
T PF11014_consen 11 WIAAMVLGFIVFW-PLGLALLAY 32 (115)
T ss_pred HHHHHHHHHHHHH-HHHHHHHHH
Confidence 4566777777755 666555544
No 56
>PF06454 DUF1084: Protein of unknown function (DUF1084); InterPro: IPR009457 This entry consists of several hypothetical plant specific proteins of unknown function.
Probab=29.40 E-value=2e+02 Score=19.96 Aligned_cols=61 Identities=15% Similarity=0.297 Sum_probs=32.5
Q ss_pred HHHHHHHHHHHHHHHHhhhHHHHHHHHHHhhCCCCCcchHHHHHHHHHHHHhhhhhhHHHHHhccHHHHHHHHH
Q psy6890 13 QKATKVLGVVFFTFVILWAPFFILNLIPSICSPCETKIHPGIFEVATWLGYASSMVNPIFYTIFNKVFRTAFKK 86 (116)
Q Consensus 13 ~k~~k~~~~iv~~f~i~~~P~~v~~l~~~~~~~~~~~~~~~~~~~~~~l~~~ns~~NPiiY~~~~~~fr~~~~~ 86 (116)
+|-.+-+..+.++-.+|++--.++.++..+.+..+.+. ..+..+| ++|.+..+-....+--
T Consensus 202 ~kkl~~V~~vt~ic~~cF~ir~i~~~~~~~~~~~~~d~------------~~~~i~~-~iyy~i~EivPs~lvL 262 (281)
T PF06454_consen 202 SKKLRKVGFVTIICSVCFLIRCIMVLFSAFDKPANLDV------------LSHPILN-FIYYFITEIVPSALVL 262 (281)
T ss_pred HHHHHHHHhHHHHHHHHHHHHHHHHHHHhhccccccch------------hhhHHHH-HHHHHHHHHHHHHHHH
Confidence 33445566666666677777777766655533211110 0122234 5777777777766633
No 57
>PF13800 Sigma_reg_N: Sigma factor regulator N-terminal
Probab=29.34 E-value=1.1e+02 Score=17.13 Aligned_cols=22 Identities=5% Similarity=0.111 Sum_probs=9.2
Q ss_pred hhhHHH-HHHHHHHHHHHHHhhhH
Q psy6890 10 LLEQKA-TKVLGVVFFTFVILWAP 32 (116)
Q Consensus 10 ~~~~k~-~k~~~~iv~~f~i~~~P 32 (116)
+.++|. .++.++.+++ ++.-+|
T Consensus 7 K~K~k~~l~~~~isi~~-~lvi~~ 29 (96)
T PF13800_consen 7 KAKRKSRLRTVVISIIS-ALVIFI 29 (96)
T ss_pred HHHHHHHHHHHHHHHhh-hhhhHH
Confidence 334444 3444444344 444444
No 58
>PF05391 Lsm_interact: Lsm interaction motif; InterPro: IPR008669 This short motif is found at the C terminus of Prp24 proteins and probably interacts with the Lsm proteins to promote U4/U6 formation [].
Probab=29.30 E-value=29 Score=14.05 Aligned_cols=9 Identities=44% Similarity=0.619 Sum_probs=5.6
Q ss_pred ccHHHHHHH
Q psy6890 76 FNKVFRTAF 84 (116)
Q Consensus 76 ~~~~fr~~~ 84 (116)
.|.+||+-+
T Consensus 11 SNddFrkmf 19 (21)
T PF05391_consen 11 SNDDFRKMF 19 (21)
T ss_pred chHHHHHHH
Confidence 466777654
No 59
>PF15050 SCIMP: SCIMP protein
Probab=27.51 E-value=1.1e+02 Score=18.53 Aligned_cols=33 Identities=18% Similarity=0.113 Sum_probs=24.1
Q ss_pred HHHHHHHHHHHhhhhhhHHHHHhccHHHHHHHH
Q psy6890 53 GIFEVATWLGYASSMVNPIFYTIFNKVFRTAFK 85 (116)
Q Consensus 53 ~~~~~~~~l~~~ns~~NPiiY~~~~~~fr~~~~ 85 (116)
++..++..+..++..+--|+|+.....+|++=+
T Consensus 8 FWiiLAVaII~vS~~lglIlyCvcR~~lRqGkk 40 (133)
T PF15050_consen 8 FWIILAVAIILVSVVLGLILYCVCRWQLRQGKK 40 (133)
T ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHHHHHcccc
Confidence 344455557777778888999998888887743
No 60
>PF06196 DUF997: Protein of unknown function (DUF997); InterPro: IPR010398 This is a family of predicted bacterial membrane protein with unknown function.
Probab=26.58 E-value=1.2e+02 Score=16.80 Aligned_cols=20 Identities=5% Similarity=0.172 Sum_probs=11.4
Q ss_pred HHHHHHHHHHHHHHHHhhhH
Q psy6890 13 QKATKVLGVVFFTFVILWAP 32 (116)
Q Consensus 13 ~k~~k~~~~iv~~f~i~~~P 32 (116)
+|=++..+...++|++.|.-
T Consensus 4 ~rEA~~tl~l~l~yf~~W~~ 23 (80)
T PF06196_consen 4 NREARWTLGLTLIYFAWWYG 23 (80)
T ss_pred HHHHHHHHHHHHHHHHHHHH
Confidence 33444555666677777753
No 61
>COG4920 Predicted membrane protein [Function unknown]
Probab=25.78 E-value=2.1e+02 Score=19.17 Aligned_cols=30 Identities=10% Similarity=0.082 Sum_probs=13.6
Q ss_pred hhhhhhhhhHHHHHHHHHH----HHHHHHhhhHH
Q psy6890 4 RHGRILLLEQKATKVLGVV----FFTFVILWAPF 33 (116)
Q Consensus 4 ~~~~~~~~~~k~~k~~~~i----v~~f~i~~~P~ 33 (116)
+.++-.++..+..+-.... ..-|++.|.-+
T Consensus 84 kDee~l~E~~~~~~~~f~~m~~ml~yfIll~~f~ 117 (249)
T COG4920 84 KDEELLNEYKRFARASFMPMLTMLPYFILLATFL 117 (249)
T ss_pred hhHHHHHHHHHHHHHhccHHHHHHHHHHHHHHHH
Confidence 3334445555555544333 33344555433
No 62
>PF04628 Sedlin_N: Sedlin, N-terminal conserved region; InterPro: IPR006722 Sedlin is a 140 amino-acid protein with a putative role in endoplasmic reticulum-to-Golgi transport. Several missense mutations and deletion mutations in the SEDL gene, which result in protein truncation by frame shift, are responsible for spondyloepiphyseal dysplasia tarda, a progressive skeletal disorder (OMIM:313400). [].; GO: 0006888 ER to Golgi vesicle-mediated transport, 0005622 intracellular; PDB: 3PR6_A 2J3W_A 1H3Q_A.
Probab=25.13 E-value=60 Score=19.52 Aligned_cols=32 Identities=16% Similarity=0.287 Sum_probs=22.4
Q ss_pred HHHHHHhhhhhhHHH---HHhccHHHHHHHHHhHh
Q psy6890 58 ATWLGYASSMVNPIF---YTIFNKVFRTAFKKVLL 89 (116)
Q Consensus 58 ~~~l~~~ns~~NPii---Y~~~~~~fr~~~~~~~~ 89 (116)
...-+++....||+- =-+.++.|.+.++++.+
T Consensus 97 ~vh~~Y~~~~~NPF~~~~~~I~S~~Fd~~v~~l~~ 131 (132)
T PF04628_consen 97 EVHELYVKALCNPFYQPGTPIKSPKFDSRVRALAK 131 (132)
T ss_dssp HHHHHHHHHHTSTTCGCT-HHHHHHHHHHHHHHHH
T ss_pred HHHHHHHHHccCCCCCCCCCcCCHHHHHHHHHHhc
Confidence 344567788888885 34667888888887763
No 63
>PF10445 DUF2456: Protein of unknown function (DUF2456); InterPro: IPR018852 This entry represents a family of uncharacterised proteins.
Probab=24.54 E-value=1.5e+02 Score=17.03 Aligned_cols=29 Identities=24% Similarity=0.346 Sum_probs=22.3
Q ss_pred hhHHHHHHHHHHHHHHHHhhhHHHHHHHH
Q psy6890 11 LEQKATKVLGVVFFTFVILWAPFFILNLI 39 (116)
Q Consensus 11 ~~~k~~k~~~~iv~~f~i~~~P~~v~~l~ 39 (116)
--+++.|-++.-++.|++.|-|...+..-
T Consensus 25 lv~~~~rgli~av~~f~~~WP~tigIla~ 53 (94)
T PF10445_consen 25 LVQQAIRGLILAVLIFLLLWPITIGILAA 53 (94)
T ss_pred HHHHHHHHHHHHHHHHHHHHhhHHHHhhc
Confidence 34677888899999999999887655443
No 64
>PF10601 zf-LITAF-like: LITAF-like zinc ribbon domain; InterPro: IPR006629 Members of this family display a conserved zinc ribbon structure [] with the motif C-XX-C- separated from the more C-terminal HX-C(P)X-C-X4-G-R motif by a variable region of usually 25-30 (hydrophobic) residues. Although it belongs to one of the zinc finger's fold groups (zinc ribbon), this particular domain was first identified in LPS-induced tumour necrosis alpha factor (LITAF) which is produced in mammalian cells after being challenged with lipopolysaccharide (LPS). The hydrophobic region probably inserts into the membrane rather than traversing it. Such an insertion brings together the N- and C-terminal C-XX-C motifs to form a compact Zn2+-binding structure [].
Probab=24.52 E-value=73 Score=16.90 Aligned_cols=8 Identities=25% Similarity=0.347 Sum_probs=4.1
Q ss_pred HhhhHHHH
Q psy6890 28 ILWAPFFI 35 (116)
Q Consensus 28 i~~~P~~v 35 (116)
++|+|+..
T Consensus 44 ~~~iP~~~ 51 (73)
T PF10601_consen 44 CCCIPFCC 51 (73)
T ss_pred HhhHhhcc
Confidence 35555544
No 65
>PF13071 DUF3935: Protein of unknown function (DUF3935)
Probab=24.47 E-value=1.2e+02 Score=15.91 Aligned_cols=27 Identities=11% Similarity=0.405 Sum_probs=19.3
Q ss_pred HHHHHHHHHHHHHhhhHHHHHHHHHHh
Q psy6890 16 TKVLGVVFFTFVILWAPFFILNLIPSI 42 (116)
Q Consensus 16 ~k~~~~iv~~f~i~~~P~~v~~l~~~~ 42 (116)
+|.+.-+++.|++.|.-..-+.+...+
T Consensus 4 lKqi~Gi~IsfFVFwfsmLGVqmfaeF 30 (68)
T PF13071_consen 4 LKQIFGIIISFFVFWFSMLGVQMFAEF 30 (68)
T ss_pred HHHHHHHHHHHHHHHHHHHhHHHHHHH
Confidence 467778888999999887665554433
No 66
>PF03839 Sec62: Translocation protein Sec62; InterPro: IPR004728 Members of the NSCC2 family have been sequenced from various yeast, fungal and animals species including Saccharomyces cerevisiae, Drosophila melanogaster and Homo sapiens. These proteins are the Sec62 proteins, believed to be associated with the Sec61 and Sec63 constituents of the general protein secretary systems of yeast microsomes. They are also the non-selective cation (NS) channels of the mammalian cytoplasmic membrane. The yeast Sec62 protein has been shown to be essential for cell growth. The mammalian NS channel proteins have been implicated in platelet derived growth factor(PGDF) dependent single channel current in fibroblasts. These channels are essentially closed in serum deprived tissue-culture cells and are specifically opened by exposure to PDGF. These channels are reported to exhibit equal selectivity for Na+, K+ and Cs+ with low permeability to Ca2+, and no permeability to anions.; GO: 0008565 protein transporter activity, 0015031 protein transport, 0016021 integral to membrane
Probab=24.09 E-value=2.3e+02 Score=19.04 Aligned_cols=62 Identities=8% Similarity=-0.062 Sum_probs=32.1
Q ss_pred HHHHHHHHHHHHHHHHhhhHHHHHHHHHHhhCCCCCcchHHHHHHHHHHHHhhhhhhHHHHHhc
Q psy6890 13 QKATKVLGVVFFTFVILWAPFFILNLIPSICSPCETKIHPGIFEVATWLGYASSMVNPIFYTIF 76 (116)
Q Consensus 13 ~k~~k~~~~iv~~f~i~~~P~~v~~l~~~~~~~~~~~~~~~~~~~~~~l~~~ns~~NPiiY~~~ 76 (116)
...+..++++++++++|..|.+=..+-...-. -......+..+...++.+-.++--++|.+.
T Consensus 110 ~~~l~~~~~~~~v~a~~lFPlWP~~~r~gv~Y--lS~~~lgll~~~~~laivRlilf~i~w~~~ 171 (224)
T PF03839_consen 110 MQYLIGALLLVGVIAICLFPLWPRWMRQGVYY--LSVGALGLLGLFFALAIVRLILFLITWFFT 171 (224)
T ss_pred HHHHHHHHHHHHHHHHHhhhcChHHHhheeeh--hHHHHHHHHHHHHHHHHHHHHHHHHHHHHh
Confidence 34556678888888999999764433221100 000011123334444555555555666654
No 67
>PF05399 EVI2A: Ectropic viral integration site 2A protein (EVI2A); InterPro: IPR008608 This family contains several mammalian ectropic viral integration site 2A (EVI2A) proteins. The function of this protein is unknown although it is thought to be a membrane protein and may function as an oncogene in retrovirus induced myeloid tumours [, ].; GO: 0016021 integral to membrane
Probab=24.08 E-value=2.2e+02 Score=19.11 Aligned_cols=23 Identities=17% Similarity=0.127 Sum_probs=13.2
Q ss_pred HHHHHHHHHHHHHhhhHHHHHHH
Q psy6890 16 TKVLGVVFFTFVILWAPFFILNL 38 (116)
Q Consensus 16 ~k~~~~iv~~f~i~~~P~~v~~l 38 (116)
..=+++|.++|+||-+-+....+
T Consensus 131 LIClIIIAVLfLICT~LfLSTVV 153 (227)
T PF05399_consen 131 LICLIIIAVLFLICTLLFLSTVV 153 (227)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHH
Confidence 34445556677777666554433
No 68
>COG4420 Predicted membrane protein [Function unknown]
Probab=23.48 E-value=2.2e+02 Score=18.63 Aligned_cols=31 Identities=10% Similarity=0.016 Sum_probs=22.8
Q ss_pred cchHHHHHHHHHHHHhhhhhhHHHHHhccHH
Q psy6890 49 KIHPGIFEVATWLGYASSMVNPIFYTIFNKV 79 (116)
Q Consensus 49 ~~~~~~~~~~~~l~~~ns~~NPiiY~~~~~~ 79 (116)
...+.+..+..+++...+..-|+|..--|++
T Consensus 84 wDpyPFi~LnLllS~~AaiqAp~IlmSQNRQ 114 (191)
T COG4420 84 WDPYPFILLNLLLSTLAAIQAPLILMSQNRQ 114 (191)
T ss_pred CCCccHHHHHHHHHHHHHHHHhHHHHHHhHH
Confidence 3456666777778888888999988766654
No 69
>PF01679 Pmp3: Proteolipid membrane potential modulator; InterPro: IPR000612 Proteolipid membrane potential modulator is an evolutionarily conserved proteolipid in the plasma membrane which, in S. pombe, is transcriptionally regulated by the Spc1 stress MAPK (mitogen-activated protein kinases) pathway. It functions to modulate the membrane potential, particularly to resist high cellular cation concentration. In eukaryotic organisms, stress-activated mitogen-activated protein kinases play crucial roles in transmitting environmental signals that will regulate gene expression for allowing the cell to adapt to cellular stress. Pmp3-like proteins are highly conserved in bacteria, yeast, nematode and plants. Proteins in this entry include the PMP3 as well as several other proteins that have been shown [] to be evolutionary related. These are small proteins of from 52 to 140 amino-acid resiudes that contain two transmembrane domains and belong to the UPF0057 (PMP3) protein family.; GO: 0016021 integral to membrane
Probab=23.17 E-value=84 Score=15.74 Aligned_cols=23 Identities=22% Similarity=0.336 Sum_probs=15.5
Q ss_pred HHHHHHHHHHHHHhhhHHHHHHH
Q psy6890 16 TKVLGVVFFTFVILWAPFFILNL 38 (116)
Q Consensus 16 ~k~~~~iv~~f~i~~~P~~v~~l 38 (116)
.+.+.+-++..++.|.|-.+..+
T Consensus 24 ~~~~~inl~Ltl~g~iPg~ihA~ 46 (51)
T PF01679_consen 24 SKDFWINLLLTLLGWIPGVIHAL 46 (51)
T ss_pred chhhHHHHHHHHHHHHHHHHHee
Confidence 44556677778888998665543
No 70
>KOG1726|consensus
Probab=23.17 E-value=1.2e+02 Score=20.37 Aligned_cols=13 Identities=31% Similarity=1.047 Sum_probs=10.1
Q ss_pred HHHHHHHhhhHHH
Q psy6890 22 VFFTFVILWAPFF 34 (116)
Q Consensus 22 iv~~f~i~~~P~~ 34 (116)
.+.-|+++|+|++
T Consensus 50 ~~~d~~lsw~P~Y 62 (225)
T KOG1726|consen 50 TLTDFLLSWFPFY 62 (225)
T ss_pred HHHHHHHHHhhhH
Confidence 4556789999987
No 71
>PF11286 DUF3087: Protein of unknown function (DUF3087); InterPro: IPR021438 This family of proteins with unknown function appears to be restricted to Gammaproteobacteria.
Probab=23.08 E-value=2.1e+02 Score=18.26 Aligned_cols=34 Identities=18% Similarity=0.182 Sum_probs=17.4
Q ss_pred hhHHHHHHHHHHHHHHHHhhhHHHHHHHHHHhhCC
Q psy6890 11 LEQKATKVLGVVFFTFVILWAPFFILNLIPSICSP 45 (116)
Q Consensus 11 ~~~k~~k~~~~iv~~f~i~~~P~~v~~l~~~~~~~ 45 (116)
-++++-+.++.+++.|++.-+-+ -..++..|+.+
T Consensus 11 YRk~~n~v~~~~v~~lai~sl~~-s~llI~lFg~~ 44 (165)
T PF11286_consen 11 YRKHLNRVIVACVASLAILSLAF-SQLLIALFGGE 44 (165)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHcCCC
Confidence 34444445555555555554444 44455666653
No 72
>PF07325 Curto_V2: Curtovirus V2 protein; InterPro: IPR009931 This family consists of several Curtovirus V2 proteins. The exact function of V2 is unclear but it is known that the protein is required for a successful host infection process [].
Probab=22.78 E-value=84 Score=18.54 Aligned_cols=13 Identities=31% Similarity=0.457 Sum_probs=10.7
Q ss_pred HHHHHHHHhHhhh
Q psy6890 79 VFRTAFKKVLLCR 91 (116)
Q Consensus 79 ~fr~~~~~~~~~~ 91 (116)
+|+++++++++..
T Consensus 52 qFQKevKKLLk~k 64 (126)
T PF07325_consen 52 QFQKEVKKLLKRK 64 (126)
T ss_pred HHHHHHHHHHHHh
Confidence 6999999998644
No 73
>COG2383 Uncharacterized conserved protein [Function unknown]
Probab=22.03 E-value=1.4e+02 Score=17.40 Aligned_cols=19 Identities=21% Similarity=0.261 Sum_probs=15.4
Q ss_pred hhHHHHHHHHHHHHHHHHh
Q psy6890 11 LEQKATKVLGVVFFTFVIL 29 (116)
Q Consensus 11 ~~~k~~k~~~~iv~~f~i~ 29 (116)
.-+|..|.+..++.+|++.
T Consensus 35 Alkk~ikvvl~liGvyils 53 (109)
T COG2383 35 ALKKAIKVVLLLIGVYILS 53 (109)
T ss_pred HHHHHHHHHHHHHHHHHHH
Confidence 3578899999999988764
No 74
>PF04612 T2SM: Type II secretion system (T2SS), protein M; InterPro: IPR007690 General secretion pathway (GSP) protein M is a membrane protein involved in the export of proteins in bacteria. It consists of a short cytosolic N-terminal domain, a transmembrane domain, and a C-terminal periplasmic domain. The precise function of this protein is unknown, though in Vibrio cholerae, the EpsM protein interacts with the EpsL protein, and also forms homodimers [],; GO: 0006858 extracellular transport; PDB: 1UV7_A.
Probab=21.95 E-value=30 Score=21.27 Aligned_cols=23 Identities=26% Similarity=0.349 Sum_probs=0.0
Q ss_pred hhhhhhhHHHHHHHHHHHHHHHH
Q psy6890 6 GRILLLEQKATKVLGVVFFTFVI 28 (116)
Q Consensus 6 ~~~~~~~~k~~k~~~~iv~~f~i 28 (116)
.....+|++++...+++++++++
T Consensus 9 ~~ls~REr~ll~~~~~~l~~~l~ 31 (160)
T PF04612_consen 9 QSLSPRERRLLLVLGVVLLLALL 31 (160)
T ss_dssp -----------------------
T ss_pred HhCCHHHHHHHHHHHHHHHHHHH
Confidence 34556777777666555555544
No 75
>PF11587 Prion_bPrPp: Major prion protein bPrPp - N terminal; PDB: 1SKH_A.
Probab=21.89 E-value=92 Score=13.69 Aligned_cols=14 Identities=7% Similarity=0.361 Sum_probs=7.7
Q ss_pred hhhHHHHHHHHHHh
Q psy6890 29 LWAPFFILNLIPSI 42 (116)
Q Consensus 29 ~~~P~~v~~l~~~~ 42 (116)
||+-...+..+...
T Consensus 8 cWilvLfvatwsdv 21 (29)
T PF11587_consen 8 CWILVLFVATWSDV 21 (29)
T ss_dssp THHHHHHHHHHHHH
T ss_pred HHHHHHHHHHHhhh
Confidence 56655555555544
No 76
>PF10842 DUF2642: Protein of unknown function (DUF2642); InterPro: IPR020139 This entry contains proteins with no known function.
Probab=21.57 E-value=35 Score=18.21 Aligned_cols=11 Identities=36% Similarity=0.718 Sum_probs=8.5
Q ss_pred hhhhhhHHHHH
Q psy6890 64 ASSMVNPIFYT 74 (116)
Q Consensus 64 ~ns~~NPiiY~ 74 (116)
..|++||.+|-
T Consensus 4 ~vs~vdpyvyq 14 (66)
T PF10842_consen 4 MVSLVDPYVYQ 14 (66)
T ss_pred ceeccCHHHHH
Confidence 46788999885
No 77
>COG1687 AzlD Predicted branched-chain amino acid permeases (azaleucine resistance) [Amino acid transport and metabolism]
Probab=21.20 E-value=1.8e+02 Score=17.06 Aligned_cols=22 Identities=23% Similarity=0.407 Sum_probs=17.4
Q ss_pred HHHHHHHHHHHHHHhhhHHHHH
Q psy6890 15 ATKVLGVVFFTFVILWAPFFIL 36 (116)
Q Consensus 15 ~~k~~~~iv~~f~i~~~P~~v~ 36 (116)
.+-+..+++++|+.=++||.++
T Consensus 6 ~l~i~v~~v~t~~~RilPF~if 27 (106)
T COG1687 6 ILTIIVIAVGTFLTRILPFLIF 27 (106)
T ss_pred HHHHHHHHHHHHHHHHHHHHhc
Confidence 4456677888999999999876
No 78
>COG4758 Predicted membrane protein [Function unknown]
Probab=21.01 E-value=2.8e+02 Score=18.84 Aligned_cols=25 Identities=16% Similarity=0.123 Sum_probs=17.5
Q ss_pred cchHHHHHHHHHHHHhhhhhhHHHH
Q psy6890 49 KIHPGIFEVATWLGYASSMVNPIFY 73 (116)
Q Consensus 49 ~~~~~~~~~~~~l~~~ns~~NPiiY 73 (116)
.........+..+.+....+||+.-
T Consensus 48 ~~~~~~~i~a~~lf~~i~~li~F~l 72 (235)
T COG4758 48 SRASVLKIIALFLFFAIVFLIPFTL 72 (235)
T ss_pred hhHHHHHHHHHHHHHHHHHHHHHHH
Confidence 3455567777788888888888653
No 79
>PF02295 z-alpha: Adenosine deaminase z-alpha domain; InterPro: IPR000607 Double-stranded RNA-specific adenosine deaminase (3.5 from EC) converts multiple adenosines to inosines and creates I/U mismatched base pairs in double-helical RNA substrates without apparent sequence specificity. DRADA has been found to modify adenosines in AU-rich regions more frequently, probably due to the relative ease of melting A/U base pairs compared to G/C base pairs. The protein functions to modify viral RNA genomes, and may be responsible for hypermutation of certain negative-stranded viruses. DRADA edits the mRNAs for the glutamate receptor subunits by site-selective adenosine deamination. The DRADA repeat is also found in viral E3 proteins, which contain a double-stranded RNA-binding domain.; GO: 0003723 RNA binding, 0003726 double-stranded RNA adenosine deaminase activity; PDB: 1OYI_A 3EYI_A 2L4M_A 2HEO_D 1J75_A 1SFU_B 3IRR_B 2ACJ_C 3F22_B 2L54_A ....
Probab=20.85 E-value=63 Score=17.04 Aligned_cols=13 Identities=23% Similarity=0.434 Sum_probs=10.1
Q ss_pred hhhhhHHHHHhcc
Q psy6890 65 SSMVNPIFYTIFN 77 (116)
Q Consensus 65 ns~~NPiiY~~~~ 77 (116)
-.-+|+++|.+-.
T Consensus 34 kk~VN~~LY~L~k 46 (66)
T PF02295_consen 34 KKEVNRVLYRLEK 46 (66)
T ss_dssp HHHHHHHHHHHHH
T ss_pred HHHHHHHHHHHHH
Confidence 5779999998643
No 80
>TIGR02052 MerP mercuric transport protein periplasmic component. This model represents the periplasmic mercury (II) binding protein of the bacterial mercury detoxification system which passes mercuric ion to the MerT transporter for subsequent reduction to Hg(0) by the mercuric reductase MerA. MerP contains a distinctive GMTCXXC motif associated with metal binding. MerP is related to a larger family of metal binding proteins (pfam00403).
Probab=20.49 E-value=94 Score=16.15 Aligned_cols=18 Identities=22% Similarity=0.453 Sum_probs=7.5
Q ss_pred HHHHHHHHHHHHHHhhhH
Q psy6890 15 ATKVLGVVFFTFVILWAP 32 (116)
Q Consensus 15 ~~k~~~~iv~~f~i~~~P 32 (116)
++-.+...+.+|...|.|
T Consensus 4 ~~~~~~~~~~~~~~~~~~ 21 (92)
T TIGR02052 4 LATLLALFVLTSLPAWAA 21 (92)
T ss_pred HHHHHHHHHHhcchhhhc
Confidence 333344444444444443
No 81
>PF15086 UPF0542: Uncharacterised protein family UPF0542
Probab=20.33 E-value=1.7e+02 Score=15.99 Aligned_cols=18 Identities=22% Similarity=0.622 Sum_probs=9.4
Q ss_pred HHHHHHhhhHHHHHHHHH
Q psy6890 23 FFTFVILWAPFFILNLIP 40 (116)
Q Consensus 23 v~~f~i~~~P~~v~~l~~ 40 (116)
+..+++|-+|++++..+.
T Consensus 23 l~~vll~LtPlfiisa~l 40 (74)
T PF15086_consen 23 LTTVLLILTPLFIISAVL 40 (74)
T ss_pred HHHHHHHHhHHHHHHHHH
Confidence 344555666666554443
No 82
>PF12166 DUF3595: Protein of unknown function (DUF3595); InterPro: IPR021999 This family of proteins is functionally uncharacterised.This family of proteins is found in eukaryotes. Proteins in this family are typically between 578 and 2525 amino acids in length.
Probab=20.31 E-value=48 Score=24.12 Aligned_cols=25 Identities=12% Similarity=0.422 Sum_probs=20.3
Q ss_pred HHHHHHHHHHhhhHHHHHHHHHHhh
Q psy6890 19 LGVVFFTFVILWAPFFILNLIPSIC 43 (116)
Q Consensus 19 ~~~iv~~f~i~~~P~~v~~l~~~~~ 43 (116)
..+++++.++-|+|..++......+
T Consensus 68 ~~~~~~li~iiw~PLllfS~~n~~~ 92 (422)
T PF12166_consen 68 GLLLLLLIIIIWFPLLLFSSGNPVG 92 (422)
T ss_pred HHHHHHHHHHHHHHHHHHhhcCccc
Confidence 5677788888999999998876654
Done!