Query psy7158
Match_columns 402
No_of_seqs 181 out of 675
Neff 6.6
Searched_HMMs 46136
Date Fri Aug 16 19:39:16 2013
Command hhsearch -i /work/01045/syshi/Psyhhblits/psy7158.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/7158hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 KOG1162|consensus 100.0 8.2E-69 1.8E-73 557.8 27.2 355 27-393 127-560 (617)
2 PF03124 EXS: EXS family; Int 100.0 9.4E-44 2E-48 356.2 15.9 226 163-391 2-311 (345)
3 COG5409 EXS domain-containing 100.0 4.1E-29 8.9E-34 241.8 8.6 246 131-386 4-328 (384)
4 PF03105 SPX: SPX domain; Int 99.1 1.8E-10 3.8E-15 110.3 5.7 58 20-77 217-275 (275)
5 KOG1161|consensus 99.0 4.5E-10 9.7E-15 109.4 6.9 75 33-108 102-177 (310)
6 COG5408 SPX domain-containing 98.1 3.3E-06 7.1E-11 82.4 5.4 51 25-75 239-290 (296)
7 COG5036 SPX domain-containing 98.1 4.3E-06 9.4E-11 84.0 5.6 75 24-107 93-168 (509)
8 PF06570 DUF1129: Protein of u 86.2 29 0.00063 32.3 17.9 16 237-252 175-190 (206)
9 KOG4031|consensus 79.9 13 0.00029 34.8 8.9 66 2-67 106-176 (216)
10 COG0598 CorA Mg2+ and Co2+ tra 66.9 78 0.0017 31.5 11.7 34 28-61 177-211 (322)
11 PF10669 Phage_Gp23: Protein g 50.6 37 0.00079 28.2 4.9 22 8-30 49-70 (121)
12 COG5522 Predicted integral mem 38.7 3.5E+02 0.0077 25.7 11.3 75 114-197 110-185 (236)
13 PF11143 DUF2919: Protein of u 34.9 1.9E+02 0.0042 25.7 7.5 59 127-188 53-112 (149)
14 PF15038 Jiraiya: Jiraiya 33.3 1E+02 0.0022 28.3 5.5 58 131-191 110-167 (175)
15 PF03904 DUF334: Domain of unk 31.1 4.3E+02 0.0093 25.4 9.4 18 50-67 95-112 (230)
16 TIGR00383 corA magnesium Mg(2+ 29.7 5.4E+02 0.012 25.1 13.3 32 28-59 172-204 (318)
17 PF08367 M16C_assoc: Peptidase 27.8 86 0.0019 29.9 4.4 45 1-45 1-46 (248)
18 KOG4326|consensus 26.6 74 0.0016 24.9 2.9 31 6-36 43-74 (81)
19 PF04644 Motilin_ghrelin: Moti 25.9 56 0.0012 20.8 1.8 16 10-26 8-23 (28)
20 PF09925 DUF2157: Predicted me 25.8 4.4E+02 0.0095 22.8 12.1 7 224-230 107-113 (145)
21 COG4062 MtrB Tetrahydromethano 24.3 1.7E+02 0.0036 24.5 4.7 70 77-148 25-98 (108)
22 PF02268 TFIIA_gamma_N: Transc 24.2 1E+02 0.0022 22.3 3.1 31 33-65 15-45 (49)
23 COG5110 RPN1 26S proteasome re 23.2 50 0.0011 35.7 1.9 61 9-69 28-112 (881)
24 PF01086 Clathrin_lg_ch: Clath 23.0 1.4E+02 0.003 28.4 4.7 62 2-63 112-179 (225)
25 PF07716 bZIP_2: Basic region 22.2 2E+02 0.0043 20.7 4.4 31 10-40 21-51 (54)
26 TIGR02894 DNA_bind_RsfA transc 21.8 1.7E+02 0.0036 26.6 4.7 27 17-43 121-147 (161)
27 PF06160 EzrA: Septation ring 21.1 5.5E+02 0.012 27.8 9.4 91 16-117 103-198 (560)
28 PF10669 Phage_Gp23: Protein g 20.3 4E+02 0.0086 22.2 6.1 10 11-20 45-54 (121)
29 PF04930 FUN14: FUN14 family; 20.1 79 0.0017 26.0 2.2 34 104-138 66-99 (100)
No 1
>KOG1162|consensus
Probab=100.00 E-value=8.2e-69 Score=557.82 Aligned_cols=355 Identities=28% Similarity=0.425 Sum_probs=299.1
Q ss_pred HHHHHHHHHHHHHHHhhhh-hhchHHHHHHHHHhhccccccccchHHHHhccccCCcccchhHHHHHHHHHHHHHHhhcc
Q psy7158 27 EEEEEISMRLVHSLINVSN-AAISLTDLGFFASHVKTLLNVDVGAKWRAEHVDVSHFYVNKDINHLISETEATVTSELEE 105 (402)
Q Consensus 27 ~~~~~~~~e~~~~L~~L~~-~~LN~tgF~KILKKyDK~~~~~~~~~~~~~~v~~s~F~~~~~l~~li~~ve~lYa~~f~~ 105 (402)
+....++.|+|+.|.+||+ ++||.|||+||+|||||+++++. ..| .+.|++++|.++|++++++.+||++|+++|++
T Consensus 127 ~~lk~af~Efy~~L~llk~y~~lN~~~f~KI~KKyDK~~~~~~-~~~-~~~v~~s~f~~~~~i~~l~~~Ve~~f~~~fan 204 (617)
T KOG1162|consen 127 EKLKLAFSEFYLKLRLLKNYQFLNVTAFRKILKKYDKITSRDA-KRY-VKMVDKSYFTSSDEITRLMLEVEETFTKHFAN 204 (617)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccch-HHH-HHHHHHHhcccHHHHHHHHHHHHHHHHHHHhC
Confidence 3445689999999999997 99999999999999999999998 787 88999999999999999999999999999999
Q ss_pred CcHHHHHhhcCCCCCCCCCCC--chhhHHHHHHHHHHHHHHHHHHHhhccCCchh-hhHHHhhhhhHHHHHHHHHHHHHH
Q psy7158 106 GDRQKAMKRLRVPPLGEQQSP--WTTFKVGLFLGCFAILFAIILLRVFFDTSQED-FKMAMRLYRGPLLIIQFIFLMGVN 182 (402)
Q Consensus 106 gdrk~A~~~LR~~~~~~~~~~--~~~F~~GlflG~s~~lli~~li~~~~~~~~~~-~~~~~~i~r~~~l~~l~~~l~g~n 182 (402)
|||++||+.||.+ .++++.+ ...|.+|.++|+++++.+.+-+..+..++++. ...++|+|+ +++++++++++|+|
T Consensus 205 ~nr~~~m~~lr~~-~~e~h~~~~~~~f~~g~~~~l~val~~~~~~~~~~~~~~~~~~~~~~~l~~-~~~v~l~~fl~~~n 282 (617)
T KOG1162|consen 205 GNRRKAMKVLRPK-LKEKHRPTFSTGFFVGCGIGLSVALVALIYLRNILQSEQRFYMETMFPLYG-FGLVVLHKFLYNVN 282 (617)
T ss_pred CChhHhhhhcCCc-ccccCCCchhHHHHHHHHHHHHHHHHHHHHHHHHhcccchhHHHHHHHHHH-HHHHHHHHHHhcCc
Confidence 9999999999998 4443322 44555555555555555444444444333222 455777777 99999999999999
Q ss_pred HHHHhhcCCCceEEEeeCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHhhccCCCCccchhHHHHHHHHHHHHcCCcccch
Q psy7158 183 VYGWRSSGVNHVLIFELDPRNHLSEQHLMEISLIMGNVWALSLLCFFFSHVFSIPPYSQPVALLLIMLAFLLNPSRTLKY 262 (402)
Q Consensus 183 ~~iw~~~~INy~~Ife~d~r~~~~~~~~~~~a~~~~~~~~lsl~~~~~~~~~~~~~~~~Pl~~~~~~~~~l~~P~~i~~~ 262 (402)
+++|+++||||++|||+|+++++.+++++++++..+++|+++++.++.....+.++++.|+++++++++++++|++++|+
T Consensus 283 iy~W~~~rVNy~fIf~~~~~~~l~~~~~l~i~~~~~~~~~l~~l~~l~~~~~~~~~~~~Pl~ll~~~~~~L~~Pf~~fY~ 362 (617)
T KOG1162|consen 283 IYEWSRTRVNYKFIFEFDQRTELGYRDILLIHNTNGILEFLPVLKNLDMSMSGQTTELSPLILLLLFFFLLVCPFNTFYR 362 (617)
T ss_pred hHHHHHhcCCceeeecCCccccccHHHHHHHHHHHHHHHHHHHHHHhccccCCCCcccchHHHHHHHHheeeccchhhhH
Confidence 99999999999999999999999999999999999999999999998876666678899999999999999999999999
Q ss_pred hhhhhHhhhc------cc----------c--hhh---h------hhhhhcC-------------------------C---
Q psy7158 263 DARFWLLRVT------PV----------A--LLL---I------MLAFLLN-------------------------P--- 287 (402)
Q Consensus 263 ~~R~~~lr~l------Pf----------g--L~s---s------~~C~y~~-------------------------P--- 287 (402)
++|+|+++++ |+ | ++| + ++|+|.. |
T Consensus 363 sSRf~ll~~l~rvi~spl~~V~~~DFfl~Dql~S~v~a~~~l~~~~C~y~~~~~~~~~~~~~~~~~~~~~~iva~lP~~~ 442 (617)
T KOG1162|consen 363 SSRFWLLKRLFRVLSSPLYKVLFVDFFLADQLTSLVLALRDLEFFICYYGTGDFQARRTCYCKDDYIEFQSIVAVLPYWF 442 (617)
T ss_pred hhHHHHHHHHHHHHhccceeeccccccHHHHHHHHHHHHHhHHHhheeecccccccccccccccchHHHHHHHHHHHHHH
Confidence 9999999776 55 2 333 2 8888871 1
Q ss_pred ----------------cc--chhhhhHHHHHHHH--hhccCCCCCCchhHHHHHHHHHhhhhhhhhhhhccccCCCCCCC
Q psy7158 288 ----------------SR--TLKYDARFWLLRVT--VKKDQSVMDNPFFYPWLAASVISSIYSYIWDIKMDWGLFDKNAP 347 (402)
Q Consensus 288 ----------------~l--a~KY~~~~~~l~~~--~~~~~~~~~~~~~~~wi~~~~i~S~Ys~~WDv~mDWgL~~~~~~ 347 (402)
++ |+||++++++.++. ++.+.+ .+.++++|++++.+||+|+++||++|||||++++
T Consensus 443 RfLQClRR~~d~~~~~hL~NAlKY~~~i~~v~~~~~y~~~~~--~~~~~~l~~~~s~vaS~y~~~WDvv~DWgLl~~~-- 518 (617)
T KOG1162|consen 443 RFLQCLRRYRDEKAFPHLLNALKYSTTILAVMLTTLYRILPG--SSLWFALWILSSKVASLYTTYWDVVMDWGLLNRK-- 518 (617)
T ss_pred HHHHHHHHHHhccchHhHHHHHHHHHHHHHHHHHHHHHHcCC--cchHHHHHHHHHHHHHHHHHHHHHheeccccccc--
Confidence 23 99999999995444 333321 1589999999999999999999999999999988
Q ss_pred CCCCCCCccccccCCccchhhhhhhhhccccccchhhhhchHHHhh
Q psy7158 348 PDNPFLREETVYSSTEMKALLYQAMEEAPSLETTEPEVISRYFRMF 393 (402)
Q Consensus 348 ~~~~~LR~~l~y~~~~~~~~Yy~Aiv~n~ilR~~W~~~l~~~~~~f 393 (402)
++ |+||++++||+ |++||+||+.|+++|++|++.+...++..
T Consensus 519 ~~-~~lRd~l~~p~---k~vYy~aiv~N~vLR~aW~~~~i~~~~~~ 560 (617)
T KOG1162|consen 519 SK-PWLRDNLLYPQ---KWVYYSAIVLNFVLRFAWFFKTILVFHEE 560 (617)
T ss_pred Cc-hhhHHHhcccc---hheehhHHHHHHHHHHHHHHHHHHHHHHh
Confidence 45 99999999999 89999999999999999988877666543
No 2
>PF03124 EXS: EXS family; InterPro: IPR004342 The EXS domain is named after ERD1/XPR1/SYG1 and proteins containing this motif include the C-terminal of the SYG1 G-protein associated signal transduction protein from Saccharomyces cerevisiae, and sequences that are thought to be Murine leukemia virus (MLV) receptors (XPR1. The N-terminal of these proteins often have an SPX domain (IPR004331 from INTERPRO) []. While the N-terminal is thought to be involved in signal transduction, the role of the C-terminal is not known. This region of similarity contains several predicted transmembrane helices. This family also includes the ERD1 (ERD: ER retention defective) S. cerevisiae proteins. ERD1 proteins are involved in the localization of endogenous endoplasmic reticulum (ER) proteins. Erd1 null mutants secrete such proteins even though they possess the C-terminal HDEL ER lumen localization label sequence. In addition, null mutants also exhibit defects in the Golgi-dependent processing of several glycoproteins, which led to the suggestion that the sorting of luminal ER proteins actually occurs in the Golgi, with subsequent return of these proteins to the ER via `salvage' vesicles [].; GO: 0016021 integral to membrane
Probab=100.00 E-value=9.4e-44 Score=356.21 Aligned_cols=226 Identities=33% Similarity=0.553 Sum_probs=192.2
Q ss_pred HhhhhhHHHHHHHHHHHHHHHHHHhhcCCCceEEEeeCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHhhccCCC--C---
Q psy7158 163 MRLYRGPLLIIQFIFLMGVNVYGWRSSGVNHVLIFELDPRNHLSEQHLMEISLIMGNVWALSLLCFFFSHVFSI--P--- 237 (402)
Q Consensus 163 ~~i~r~~~l~~l~~~l~g~n~~iw~~~~INy~~Ife~d~r~~~~~~~~~~~a~~~~~~~~lsl~~~~~~~~~~~--~--- 237 (402)
+|+||+++++++++|+||+|+++|+++||||++|||+||+++++++|+++++++++++|++++.+++.....+. +
T Consensus 2 ~~~~R~~~L~~l~~~l~~~nl~v~~~~~Iny~~If~~~~~~~~~~~~~~~~a~~~~~~~~l~~~~~l~~~~~~~~~~~~~ 81 (345)
T PF03124_consen 2 PPPFRGLLLLILGLWLWGINLYVWSRYRINYVFIFELDPRNSLSYRQLFELASFLTIIWLLCFLIYLASVSPSIISFANW 81 (345)
T ss_pred cchHHHHHHHHHHHHHHHHHHHHHHHcCcCHHHHhcCCcccCCcHHHHHhhhHHHHHHHHHHHHHHHHHHcCCcccccch
Confidence 57899999999999999999999999999999999999999999999999999999999999988886543322 1
Q ss_pred --ccchhHHHHHHHHHHHHcCCcccchhhhhhHhhhc------cc----------c--hhhh---------hhhhhc---
Q psy7158 238 --PYSQPVALLLIMLAFLLNPSRTLKYDARFWLLRVT------PV----------A--LLLI---------MLAFLL--- 285 (402)
Q Consensus 238 --~~~~Pl~~~~~~~~~l~~P~~i~~~~~R~~~lr~l------Pf----------g--L~ss---------~~C~y~--- 285 (402)
.+++|+++++++++++++|++++++++|+|+++++ |+ | |||. ++|+|.
T Consensus 82 ~~~~~~Pll~~~~~~~~l~~P~~~~~~~~R~~~~~~l~ril~~~~~~v~f~d~~laD~LtS~~~~l~D~~~~~c~~~~~~ 161 (345)
T PF03124_consen 82 YFVEYIPLLLLLILLLLLFFPFNIFYRSSRRWFLRTLKRILLAPFYPVRFRDFFLADQLTSLSKVLGDLEFTICYYFSGS 161 (345)
T ss_pred hhHHHhhHHHHHHHHHHhhcccchhhhhHHHHHHHHHHHHHhcccccchHHHHHHHHHHHhHHHHHHHHHHHHHHHhccc
Confidence 13689999999999999999999999999999887 44 1 5552 788776
Q ss_pred --C-------------------C--------------------cc--chhhhhHHHHH-HHHhhcc--CCCCCCchhHHH
Q psy7158 286 --N-------------------P--------------------SR--TLKYDARFWLL-RVTVKKD--QSVMDNPFFYPW 319 (402)
Q Consensus 286 --~-------------------P--------------------~l--a~KY~~~~~~l-~~~~~~~--~~~~~~~~~~~w 319 (402)
+ | |+ |+||++|++++ ..+.... ++.+....+.+|
T Consensus 162 ~~~~~~~c~~~~~~~~~~~~~lP~~iR~~QClrry~~~~~~~~hL~Na~KY~~~~~v~~~~~~~~~~~~~~~~~~~~~~w 241 (345)
T PF03124_consen 162 FTSPSNQCGSSSWYIDPIVASLPYWIRFLQCLRRYRDTGDRFPHLFNALKYSSSIPVIILSALYRFYPSSDSSIWLFILW 241 (345)
T ss_pred ccCCCCcCccccHhHHhHHHHHHHHHHHHHHHHHHHhCCCCchhHHHHHHHHHHHHHHHHHHHHHhcCCccchhhHHHHH
Confidence 1 1 12 99999999994 3333222 111223578999
Q ss_pred HHHHHHhhhhhhhhhhhccccCCCCCCCCCCCCCCccccccCCccchhhhhhhhhccccccchhhhhch-HHH
Q psy7158 320 LAASVISSIYSYIWDIKMDWGLFDKNAPPDNPFLREETVYSSTEMKALLYQAMEEAPSLETTEPEVISR-YFR 391 (402)
Q Consensus 320 i~~~~i~S~Ys~~WDv~mDWgL~~~~~~~~~~~LR~~l~y~~~~~~~~Yy~Aiv~n~ilR~~W~~~l~~-~~~ 391 (402)
++++++||+||++|||+|||||+++++++++++||++++||+ |++||.||+.|+++|++|++++++ +..
T Consensus 242 ~~~~~i~s~Ys~~WDv~~DWgL~~~~~~~~~~~LR~~l~~~~---~~~Yy~ai~~n~ilRf~W~~~~~~~~~~ 311 (345)
T PF03124_consen 242 IIFALINSLYSFYWDVKMDWGLFQPKKKSKNWLLRRRLLYPR---KWFYYFAIILNFILRFAWILTLSPPHFS 311 (345)
T ss_pred HHHHHHHHHHHHHHHHHhcccccCCccccCCCCCccccccCC---cchhhhHHHHHHHHHHHHHHHHHHhhcc
Confidence 999999999999999999999998875578999999999998 899999999999999999999988 443
No 3
>COG5409 EXS domain-containing protein [Signal transduction mechanisms]
Probab=99.95 E-value=4.1e-29 Score=241.79 Aligned_cols=246 Identities=22% Similarity=0.219 Sum_probs=176.1
Q ss_pred HHHHHHHHHHHHHHHHHHHhhccCCchh-hhHHHhhhhhHHHHHHHHHHHHHHHHHHhhcCCCceEEEeeCCCCC----C
Q psy7158 131 KVGLFLGCFAILFAIILLRVFFDTSQED-FKMAMRLYRGPLLIIQFIFLMGVNVYGWRSSGVNHVLIFELDPRNH----L 205 (402)
Q Consensus 131 ~~GlflG~s~~lli~~li~~~~~~~~~~-~~~~~~i~r~~~l~~l~~~l~g~n~~iw~~~~INy~~Ife~d~r~~----~ 205 (402)
++|+-.|.+..+.+.+.......+.+.. ...++++|.+..+......++++|+++|.+.+|||++||+++.++. .
T Consensus 4 vv~~~~g~~l~l~~~i~~l~~~~~~~~~~~~ill~lw~~~~l~~~~~~lf~v~~~i~~r~~inyr~i~~~e~~~~~a~~~ 83 (384)
T COG5409 4 VVGLEKGVSLSLGLYIQNLLNVGEPQSFIVLILLALWGGWILVFFLAFLFDVSCYILTRTPINYRFIFLFEQLSSTARNF 83 (384)
T ss_pred eeeeeccccccHHHHHHHheeCCchHHHHHHHHHHHHhHHHHHHHHHHHhcCceeeEEeccccchhhhhHhHhhhhhccc
Confidence 3566677777766655544222211111 4568999999999999999999999999999999999998886543 2
Q ss_pred CHHHHHHHHHHHHHHHHHHHHHHHhhcc---CCC-CccchhHHHHHHHHHHHHcCCcccchhhhhhHhhhc---------
Q psy7158 206 SEQHLMEISLIMGNVWALSLLCFFFSHV---FSI-PPYSQPVALLLIMLAFLLNPSRTLKYDARFWLLRVT--------- 272 (402)
Q Consensus 206 ~~~~~~~~a~~~~~~~~lsl~~~~~~~~---~~~-~~~~~Pl~~~~~~~~~l~~P~~i~~~~~R~~~lr~l--------- 272 (402)
+.++-. ++.-...|.+++..|.-... +.+ -...+|+++.+....++++|+++.++.+|+|++.++
T Consensus 84 ~~dfh~--~~i~~~~~~~slfiFl~~v~g~~~~l~~~~~~P~l~v~~vf~~ll~Pf~ii~y~sRr~Li~sliRv~l~~~~ 161 (384)
T COG5409 84 NLDFHR--IIIPFHFFTTSLFIFLNAVEGLKFILLFVYFLPLLQVGTVFWFLLKPFQIIYYWSRRYLIESLIRVFLFGYS 161 (384)
T ss_pred chhhHH--HhhhHHHHHHHHHHHHHHhhccccceehhhhccHHHHHHHHHHhhcccceeeecchhhHHHHHHHHHHhccc
Confidence 222211 33334445566666653322 222 246789999999999999999999999999998665
Q ss_pred cc-------c-----hhhh------hhhhhc----CC-------------------------------------cc--ch
Q psy7158 273 PV-------A-----LLLI------MLAFLL----NP-------------------------------------SR--TL 291 (402)
Q Consensus 273 Pf-------g-----L~ss------~~C~y~----~P-------------------------------------~l--a~ 291 (402)
|+ | +|++ ++|+|+ +| ++ |+
T Consensus 162 ~v~f~dF~f~di~~SlTya~gdi~~FfCv~~~~~~~pLc~sshs~~~g~~~~lP~ivR~lQCLRry~ds~~~fphLlNAL 241 (384)
T COG5409 162 LVRFTDFFFGDILISLTYALGDIYIFFCVYSLLFREPLCKSSHSDLSGLAALLPVIVRFLQCLRRYRDSLHEFPHLLNAL 241 (384)
T ss_pred eeeeeeeehhhHHHHHHHhhhhhhhHHHHHhhhccccccccCCcchHhHHHhhHHHHHHHHHHHHHHhhhhhhHHHHHHH
Confidence 22 2 4543 788887 12 23 99
Q ss_pred hhhhHHHHHHHHhhccCCCCCCchhHHHHHHHHHhhhhhhhhhhhccccCCCCCCCCCCCCCCccccccCCccchhhhhh
Q psy7158 292 KYDARFWLLRVTVKKDQSVMDNPFFYPWLAASVISSIYSYIWDIKMDWGLFDKNAPPDNPFLREETVYSSTEMKALLYQA 371 (402)
Q Consensus 292 KY~~~~~~l~~~~~~~~~~~~~~~~~~wi~~~~i~S~Ys~~WDv~mDWgL~~~~~~~~~~~LR~~l~y~~~~~~~~Yy~A 371 (402)
||++++|+.+.....+..+.+...+++|+.++.+||+||+.|||.||||+..+.. +.++-.|++++++ |+.|
T Consensus 242 KYs~~i~v~~~~~~~r~~~~~~~l~~l~~~~a~lnS~yT~~WDV~mDW~l~~~~~-~~~~~kr~~~~l~-------y~~a 313 (384)
T COG5409 242 KYSLNIPVLFCLWLYRVYEGEERLFHLQIWFALLNSIYTSFWDVFMDWSLDSLTS-LRSWSKRAVTLLK-------YHIA 313 (384)
T ss_pred HHHHhhHHHHHHHHHHhccCcceeehHHHHHHHHHHHHHHhHHhheeeeeccccc-chhccccchhHHH-------HHHH
Confidence 9999999944332211111135678999999999999999999999999976653 4556677776655 8999
Q ss_pred hhhccccccchhhhh
Q psy7158 372 MEEAPSLETTEPEVI 386 (402)
Q Consensus 372 iv~n~ilR~~W~~~l 386 (402)
|+.|+++|++|++..
T Consensus 314 ~~inFllR~~Wi~~~ 328 (384)
T COG5409 314 MIINFLLRFSWIVYY 328 (384)
T ss_pred HHHHHHHHHHHHHhh
Confidence 999999999999984
No 4
>PF03105 SPX: SPX domain; InterPro: IPR004331 The SPX domain is named after SYG1/Pho81/XPR1 proteins. This 180 residue length domain is found at the amino terminus of a variety of proteins. In the yeast protein SYG1, the N terminus directly binds to the G- protein beta subunit and inhibits transduction of the mating pheromone signal [] suggesting that all the members of this family are involved in G-protein associated signal transduction. The C-terminal of these proteins often have an EXS domain (IPR004342 from INTERPRO) []. The N-termini of several proteins involved in the regulation of phosphate transport, including the putative phosphate level sensors PHO81 from Saccharomyces cerevisiae and NUC-2 from Neurospora crassa, are also members of this family [, ]. NUC-2 contains several ankyrin repeats (IPR002110 from INTERPRO). Several members of this family are the XPR1 proteins: the xenotropic and polytropic retrovirus receptor confers susceptibility to infection with Murine leukemia virus (MLV) []. The similarity between SYG1, phosphate regulators and XPR1 sequences has been previously noted, as has the additional similarity to several predicted proteins, of unknown function, from Drosophila melanogaster, Arabidopsis thaliana, Caenorhabditis elegans, Schizosaccharomyces pombe, and Saccharomyces cerevisiae [, ]. In addition, given the similarities between XPR1 and SYG1 and phosphate regulatory proteins, it has been proposed that XPR1 might be involved in G-protein associated signal transduction [, , ] and may itself function as a phosphate sensor [].
Probab=99.05 E-value=1.8e-10 Score=110.29 Aligned_cols=58 Identities=10% Similarity=0.123 Sum_probs=49.7
Q ss_pred HHhhhhhHHHHHHHHHHHHHHHhhhh-hhchHHHHHHHHHhhccccccccchHHHHhcc
Q psy7158 20 EKKNKKGEEEEEISMRLVHSLINVSN-AAISLTDLGFFASHVKTLLNVDVGAKWRAEHV 77 (402)
Q Consensus 20 ~~~~~~~~~~~~~~~e~~~~L~~L~~-~~LN~tgF~KILKKyDK~~~~~~~~~~~~~~v 77 (402)
..+.....+..+++.|+|+.|.+|++ ++||+|||+||||||||+++++.++.||.+.|
T Consensus 217 ~~~~~~~~~l~~~~~e~y~~l~~Lk~f~~LN~taf~KIlKK~DK~~~~~~~~~y~~~v~ 275 (275)
T PF03105_consen 217 KSLKKARKQLKKAFIELYRELELLKSFVELNRTAFRKILKKYDKVTGTSLSDDYMEEVV 275 (275)
T ss_pred hhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccccchHhhhhccC
Confidence 34444555667889999999999996 99999999999999999999999999887654
No 5
>KOG1161|consensus
Probab=99.02 E-value=4.5e-10 Score=109.42 Aligned_cols=75 Identities=5% Similarity=0.051 Sum_probs=61.9
Q ss_pred HHHHHHHHHhhh-hhhchHHHHHHHHHhhccccccccchHHHHhccccCCcccchhHHHHHHHHHHHHHHhhccCcH
Q psy7158 33 SMRLVHSLINVS-NAAISLTDLGFFASHVKTLLNVDVGAKWRAEHVDVSHFYVNKDINHLISETEATVTSELEEGDR 108 (402)
Q Consensus 33 ~~e~~~~L~~L~-~~~LN~tgF~KILKKyDK~~~~~~~~~~~~~~v~~s~F~~~~~l~~li~~ve~lYa~~f~~gdr 108 (402)
+.+.-..+.+|. |++||+|||+||||||||.++-... .++...|..+||++.+.++.++.+++.+|....+.++-
T Consensus 102 l~~~~~em~~L~~fs~LN~tGf~KILKK~DKrtg~~l~-~~f~~~l~~~Pf~~~e~~~~Lv~e~~~l~~~l~~~~~~ 177 (310)
T KOG1161|consen 102 LVDFHGEMVLLENFSRLNYTGFAKILKKHDKRTGYRLR-PYFQVRLLHQPFFTTEQLFRLVYEISILLDLLRPSNRN 177 (310)
T ss_pred HHHHHHHHHHHHHHHHHhHHHHHHHHHHHhcccccccc-cHHHHHHHhCCCchhhhHHHHHHHHHHHHHHhcccccc
Confidence 444445555566 4999999999999999999996554 55788899999999999999999999999988776544
No 6
>COG5408 SPX domain-containing protein [Signal transduction mechanisms]
Probab=98.12 E-value=3.3e-06 Score=82.39 Aligned_cols=51 Identities=10% Similarity=0.134 Sum_probs=44.7
Q ss_pred hhHHHHHHHHHHHHHHHhhhh-hhchHHHHHHHHHhhccccccccchHHHHh
Q psy7158 25 KGEEEEEISMRLVHSLINVSN-AAISLTDLGFFASHVKTLLNVDVGAKWRAE 75 (402)
Q Consensus 25 ~~~~~~~~~~e~~~~L~~L~~-~~LN~tgF~KILKKyDK~~~~~~~~~~~~~ 75 (402)
+.....+.++++|+.|..++. ++||+|||+||+|||||+++++....||..
T Consensus 239 ~R~~lkk~ii~~y~~l~~lksf~eLN~tGf~Ki~KK~DK~l~~~~~~~~~s~ 290 (296)
T COG5408 239 KRSLLKKRIIELYIQLHQLKSFIELNYTGFSKITKKYDKTLHQNLRHEYMSR 290 (296)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhHHHHHHHHHH
Confidence 445567789999999999995 999999999999999999999988887644
No 7
>COG5036 SPX domain-containing protein involved in vacuolar polyphosphate accumulation [Inorganic ion transport and metabolism]
Probab=98.08 E-value=4.3e-06 Score=84.05 Aligned_cols=75 Identities=12% Similarity=0.092 Sum_probs=58.5
Q ss_pred hhhHHHHHHHHHHHHHHHhhh-hhhchHHHHHHHHHhhccccccccchHHHHhccccCCcccchhHHHHHHHHHHHHHHh
Q psy7158 24 KKGEEEEEISMRLVHSLINVS-NAAISLTDLGFFASHVKTLLNVDVGAKWRAEHVDVSHFYVNKDINHLISETEATVTSE 102 (402)
Q Consensus 24 ~~~~~~~~~~~e~~~~L~~L~-~~~LN~tgF~KILKKyDK~~~~~~~~~~~~~~v~~s~F~~~~~l~~li~~ve~lYa~~ 102 (402)
+.+|||+. -+|. |. +..||+|||.||+|||||-++....+- +.-.++..|| .+.+.+.|+-++-.+|.-.
T Consensus 93 ~~LeE~L~----~v~~---l~kF~RLN~tGF~KIvKKHDK~~~y~lkpv-fqvrLk~~p~-~se~yd~Ll~kis~LY~~l 163 (509)
T COG5036 93 KILEEELD----TVHD---LAKFSRLNFTGFKKIVKKHDKHTGYSLKPV-FQVRLKAKPF-FSEQYDPLLYKISSLYNIL 163 (509)
T ss_pred hHHHHHHH----HHHH---HHhhhhhhhHHHHHHHHhhcCCCCceechh-HHHHhccCCc-chhhhcHHHHHHHHHHHHH
Confidence 35566644 3344 44 499999999999999999999877766 4667788888 5678999999999999776
Q ss_pred hccCc
Q psy7158 103 LEEGD 107 (402)
Q Consensus 103 f~~gd 107 (402)
-++++
T Consensus 164 R~~~~ 168 (509)
T COG5036 164 RSSLS 168 (509)
T ss_pred HhcCC
Confidence 66665
No 8
>PF06570 DUF1129: Protein of unknown function (DUF1129); InterPro: IPR009214 There are currently no experimental data for members of this group or their homologues. However, these proteins contain predicted integral membrane proteins (with several transmembrane segments).
Probab=86.16 E-value=29 Score=32.32 Aligned_cols=16 Identities=19% Similarity=0.276 Sum_probs=7.9
Q ss_pred CccchhHHHHHHHHHH
Q psy7158 237 PPYSQPVALLLIMLAF 252 (402)
Q Consensus 237 ~~~~~Pl~~~~~~~~~ 252 (402)
.|.+-|++.+++.++.
T Consensus 175 np~l~~~~~iiig~i~ 190 (206)
T PF06570_consen 175 NPVLPPWVYIIIGVIA 190 (206)
T ss_pred CcCCCHHHHHHHHHHH
Confidence 3445455555444444
No 9
>KOG4031|consensus
Probab=79.87 E-value=13 Score=34.83 Aligned_cols=66 Identities=18% Similarity=0.091 Sum_probs=38.0
Q ss_pred ccccccHHHHhhhhHHHHHHhhhhhHHH----HHHHHHHHHHHHhhhh-hhchHHHHHHHHHhhccccccc
Q psy7158 2 EPLLKWKEEKKKMEEEEEEKKNKKGEEE----EEISMRLVHSLINVSN-AAISLTDLGFFASHVKTLLNVD 67 (402)
Q Consensus 2 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~----~~~~~e~~~~L~~L~~-~~LN~tgF~KILKKyDK~~~~~ 67 (402)
|++-|||||.++.-||+-++--++++|. ...|-++|+..+.-.. ...|-.+=.--++|.++..+..
T Consensus 106 E~IRkWkeeQ~~rl~ekD~~sek~k~ElrekAkKelddwy~~~~ek~~k~~~~nk~eee~~~~e~~~~~~g 176 (216)
T KOG4031|consen 106 EKIRKWKEEQMKRLQEKDEASEKLKEELREKAKKELDDWYDQQNEKLEKTKANNKAEEEALVKENEEFSPG 176 (216)
T ss_pred HHHHHHHHHHHHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhHHHHHHHHHhhcccCCC
Confidence 6788999999887666544333222222 3456677776554322 3344334444567777766554
No 10
>COG0598 CorA Mg2+ and Co2+ transporters [Inorganic ion transport and metabolism]
Probab=66.91 E-value=78 Score=31.54 Aligned_cols=34 Identities=26% Similarity=0.017 Sum_probs=26.6
Q ss_pred HHHHHHHHHHHHHHhhhh-hhchHHHHHHHHHhhc
Q psy7158 28 EEEEISMRLVHSLINVSN-AAISLTDLGFFASHVK 61 (402)
Q Consensus 28 ~~~~~~~e~~~~L~~L~~-~~LN~tgF~KILKKyD 61 (402)
++++.+.++-+.+.-++. ..-+..++.+..+...
T Consensus 177 ~~l~~l~~l~~~l~~lr~~l~~~~~~l~~l~~~~~ 211 (322)
T COG0598 177 EELERLGELRRSLVYLRRALAPLRDVLLRLARRPL 211 (322)
T ss_pred HHHHHHHHHHHHHHHHHHHHHhHHHHHHHHHhcCc
Confidence 777888888888888886 6666778888777764
No 11
>PF10669 Phage_Gp23: Protein gp23 (Bacteriophage A118); InterPro: IPR018926 This entry is represented by the major tail subunit protein, Gp23 of Listeria phage A118 and prophage found in Bacilli. The function is currently unknown.
Probab=50.57 E-value=37 Score=28.18 Aligned_cols=22 Identities=41% Similarity=0.684 Sum_probs=10.8
Q ss_pred HHHHhhhhHHHHHHhhhhhHHHH
Q psy7158 8 KEEKKKMEEEEEEKKNKKGEEEE 30 (402)
Q Consensus 8 ~~~~~~~~~~~~~~~~~~~~~~~ 30 (402)
|||..|.|+|| +||+++-++|.
T Consensus 49 KEER~K~E~~~-q~r~rES~~Er 70 (121)
T PF10669_consen 49 KEERSKKEEKR-QKRNRESKRER 70 (121)
T ss_pred HHHHHHHHHHH-HHHhhhhHHHH
Confidence 55555554443 34455555553
No 12
>COG5522 Predicted integral membrane protein [Function unknown]
Probab=38.66 E-value=3.5e+02 Score=25.71 Aligned_cols=75 Identities=21% Similarity=0.288 Sum_probs=49.1
Q ss_pred hcCCCCCCCCCCCchhhHHHHHHHHHHHHHHHHHHHhhccCCchhhhHHHhhhhhHHHHHHHHHHHHHHHHHHhh-cCCC
Q psy7158 114 RLRVPPLGEQQSPWTTFKVGLFLGCFAILFAIILLRVFFDTSQEDFKMAMRLYRGPLLIIQFIFLMGVNVYGWRS-SGVN 192 (402)
Q Consensus 114 ~LR~~~~~~~~~~~~~F~~GlflG~s~~lli~~li~~~~~~~~~~~~~~~~i~r~~~l~~l~~~l~g~n~~iw~~-~~IN 192 (402)
.+-.|..+..+.++..+..=+++-+++.+.+...+. ..+..+. =||.-.-++....+|+|++.|++ .|-|
T Consensus 110 AlltPDl~~~~~p~l~~~lffitH~svfls~v~~~v-hfreRpg--------ksgl~~svl~~~~lg~~~lfinrrLGtN 180 (236)
T COG5522 110 ALLTPDLQYLQVPWLEFLLFFITHISVFLSAVILIV-HFRERPG--------KSGLVMSVLVAISLGIMCLFINRRLGTN 180 (236)
T ss_pred HHHcCccccccchHHHHHHHHHHHHHHHHHHHHHHH-HhccCCC--------ccchhHHHHHHHHHHHHHHHHHHHhcCc
Confidence 344555556666777777777777777766544433 3332221 24455677788889999999987 8999
Q ss_pred ceEEE
Q psy7158 193 HVLIF 197 (402)
Q Consensus 193 y~~If 197 (402)
|-++=
T Consensus 181 Ylyls 185 (236)
T COG5522 181 YLYLS 185 (236)
T ss_pred eeEee
Confidence 99864
No 13
>PF11143 DUF2919: Protein of unknown function (DUF2919); InterPro: IPR021318 This bacterial family of proteins has no known function. Some members are annotated as YfeZ however this cannot be confirmed.
Probab=34.91 E-value=1.9e+02 Score=25.74 Aligned_cols=59 Identities=25% Similarity=0.227 Sum_probs=30.4
Q ss_pred chhhHHHHHHHHHHHHHHHHHHHhhccCCchhhhHHHhhhhhHHHHHHHHHHHHHHHH-HHhh
Q psy7158 127 WTTFKVGLFLGCFAILFAIILLRVFFDTSQEDFKMAMRLYRGPLLIIQFIFLMGVNVY-GWRS 188 (402)
Q Consensus 127 ~~~F~~GlflG~s~~lli~~li~~~~~~~~~~~~~~~~i~r~~~l~~l~~~l~g~n~~-iw~~ 188 (402)
...|..|+.+|+-.++.. .+....+..+.-...+.+--|.+.+..+ +.-.+..++ .|+.
T Consensus 53 ~~~f~lgL~~g~Pall~~--~l~~~R~~~~~~~~~lwr~~r~lLl~~~-~~~l~~~~~~l~~~ 112 (149)
T PF11143_consen 53 HSDFYLGLAAGLPALLLM--LLSGRRHRSPRWLNRLWRAGRWLLLLSL-LVDLVWQLYHLWLQ 112 (149)
T ss_pred chHHHHHHHHhHHHHHHH--HHHccCCCCchHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHc
Confidence 688999999998887765 2222222222223335555554433333 333344444 5443
No 14
>PF15038 Jiraiya: Jiraiya
Probab=33.30 E-value=1e+02 Score=28.33 Aligned_cols=58 Identities=19% Similarity=0.173 Sum_probs=35.9
Q ss_pred HHHHHHHHHHHHHHHHHHHhhccCCchhhhHHHhhhhhHHHHHHHHHHHHHHHHHHhhcCC
Q psy7158 131 KVGLFLGCFAILFAIILLRVFFDTSQEDFKMAMRLYRGPLLIIQFIFLMGVNVYGWRSSGV 191 (402)
Q Consensus 131 ~~GlflG~s~~lli~~li~~~~~~~~~~~~~~~~i~r~~~l~~l~~~l~g~n~~iw~~~~I 191 (402)
.+++++|+.+.+....++..+.-...+ ...--+..|.+.+++..++ ..|++.|+|..=
T Consensus 110 ~~~ff~sI~vfl~~l~ly~~l~f~~~~--~~~~s~ilG~g~vfl~~~~-vh~l~~w~r~~~ 167 (175)
T PF15038_consen 110 VGLFFCSISVFLAALILYMLLQFHSEP--GIATSIILGSGAVFLGAAM-VHNLYRWQRETR 167 (175)
T ss_pred hhhHHhhhhHHHHHHHHHHHHHHHHHH--HHHHHHHHHHHHHHHHHHH-HHHHHHHHHhcc
Confidence 467788888888777766543221111 1123345566666665555 899999998653
No 15
>PF03904 DUF334: Domain of unknown function (DUF334); InterPro: IPR005602 This is a family of proteins found in Staphylococcus aureus plasmid with no characterised function.
Probab=31.11 E-value=4.3e+02 Score=25.35 Aligned_cols=18 Identities=11% Similarity=0.021 Sum_probs=10.3
Q ss_pred HHHHHHHHHhhccccccc
Q psy7158 50 LTDLGFFASHVKTLLNVD 67 (402)
Q Consensus 50 ~tgF~KILKKyDK~~~~~ 67 (402)
..+|.++||+.=|.++++
T Consensus 95 ~~dF~~~Lq~~Lk~V~td 112 (230)
T PF03904_consen 95 HNDFQDILQDELKDVDTD 112 (230)
T ss_pred HHHHHHHHHHHHHhhchH
Confidence 456666666665555443
No 16
>TIGR00383 corA magnesium Mg(2+) and cobalt Co(2+) transport protein (corA). The article in Microb Comp Genomics 1998;3(3):151-69 (Medline:98448512) discusses this family and suggests that some members may have functions other than Mg2+ transport.
Probab=29.65 E-value=5.4e+02 Score=25.09 Aligned_cols=32 Identities=13% Similarity=0.046 Sum_probs=20.8
Q ss_pred HHHHHHHHHHHHHHhhhh-hhchHHHHHHHHHh
Q psy7158 28 EEEEISMRLVHSLINVSN-AAISLTDLGFFASH 59 (402)
Q Consensus 28 ~~~~~~~e~~~~L~~L~~-~~LN~tgF~KILKK 59 (402)
+..+.+.+.-+.+..++. ..-|...+.++.+.
T Consensus 172 ~~l~~l~~l~~~l~~l~~~l~~~~~vl~~l~~~ 204 (318)
T TIGR00383 172 TLMDEILSLRTELLALRRSLWPLRDVLNFLLRK 204 (318)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcC
Confidence 344556666677777775 67777777776543
No 17
>PF08367 M16C_assoc: Peptidase M16C associated; InterPro: IPR013578 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This domain appears in eukaryotes as well as bacteria and tends to be found near the C terminus of metalloproteases and related sequences belonging to MEROPS peptidase family M16 (subfamily M16C, clan ME). These include: eupitrilysin, falcilysin, PreP peptidase, CYM1 peptidase and subfamily M16C non-peptidase homologues.; GO: 0008237 metallopeptidase activity, 0008270 zinc ion binding, 0006508 proteolysis; PDB: 2FGE_B 3S5I_A 3S5H_A 3S5M_A 3S5K_A.
Probab=27.77 E-value=86 Score=29.90 Aligned_cols=45 Identities=22% Similarity=0.209 Sum_probs=33.9
Q ss_pred CccccccHHHHhhhhHHHHHHhhhhh-HHHHHHHHHHHHHHHhhhh
Q psy7158 1 MEPLLKWKEEKKKMEEEEEEKKNKKG-EEEEEISMRLVHSLINVSN 45 (402)
Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~e~~~~L~~L~~ 45 (402)
|.|--.+.+++++.|+++.+++.+++ ++|.+.+.+....|...|.
T Consensus 1 m~P~~~~~~~~~~~e~~~L~~~k~~Ls~~e~~~i~~~~~~L~~~Q~ 46 (248)
T PF08367_consen 1 MKPDPGLSEKQEEEEKEKLAAYKASLSEEEKEKIIEQTKELKERQE 46 (248)
T ss_dssp EEEETTHHHHHHHHHHHHHHHHHHCS-HHHHHHHHHHHHHHHHHHC
T ss_pred CCCCchHHHHHHHHHHHHHHHHHhhCCHHHHHHHHHHHHHHHHHhC
Confidence 56777788888888877777766665 7788888888888777665
No 18
>KOG4326|consensus
Probab=26.62 E-value=74 Score=24.93 Aligned_cols=31 Identities=19% Similarity=0.278 Sum_probs=17.3
Q ss_pred ccH-HHHhhhhHHHHHHhhhhhHHHHHHHHHH
Q psy7158 6 KWK-EEKKKMEEEEEEKKNKKGEEEEEISMRL 36 (402)
Q Consensus 6 ~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~e~ 36 (402)
.|. +||++...|++++|.+-++.|.+.+.|.
T Consensus 43 ei~a~eKav~da~~a~ekKr~a~~eaR~Lae~ 74 (81)
T KOG4326|consen 43 EIDAHEKAVADAEEAAEKKRWAKDEARYLAEV 74 (81)
T ss_pred HHHHHHHHHHhHHHHHHHHhhHHHHHHHHHHh
Confidence 354 5566655555555544466666666554
No 19
>PF04644 Motilin_ghrelin: Motilin/ghrelin; InterPro: IPR006738 Motilin is a gastrointestinal regulatory polypeptide produced by motilin cells in the duodenal epithelium. It is released into the general circulation at about 100-min intervals during the inter-digestive state and is the most important factor in controlling the inter-digestive migrating contractions. Motilin also stimulates endogenous release of the endocrine pancreas []. This domain is also found in ghrelin, a growth hormone secretagogue synthesised by endocrine cells in the stomach. Ghrelin stimulates growth hormone secretagogue receptors in the pituitary. These receptors are distinct from the growth hormone-releasing hormone receptors, and thus provide a means of controlling pituitary growth hormone release by the gastrointestinal system [].; GO: 0005179 hormone activity, 0005576 extracellular region; PDB: 1LBJ_A.
Probab=25.92 E-value=56 Score=20.85 Aligned_cols=16 Identities=38% Similarity=0.650 Sum_probs=9.2
Q ss_pred HHhhhhHHHHHHhhhhh
Q psy7158 10 EKKKMEEEEEEKKNKKG 26 (402)
Q Consensus 10 ~~~~~~~~~~~~~~~~~ 26 (402)
|-.|| ||+|++|-.++
T Consensus 8 e~qr~-QekE~nk~~kK 23 (28)
T PF04644_consen 8 EHQRM-QEKERNKGQKK 23 (28)
T ss_dssp HHHHH-HHHHHHHH---
T ss_pred HHHHH-HHHHhccCccc
Confidence 55677 77777775543
No 20
>PF09925 DUF2157: Predicted membrane protein (DUF2157); InterPro: IPR018677 This family of various hypothetical prokaryotic proteins has no known function.
Probab=25.76 E-value=4.4e+02 Score=22.82 Aligned_cols=7 Identities=14% Similarity=0.449 Sum_probs=2.8
Q ss_pred HHHHHHh
Q psy7158 224 SLLCFFF 230 (402)
Q Consensus 224 sl~~~~~ 230 (402)
.+++-.+
T Consensus 107 aligQ~y 113 (145)
T PF09925_consen 107 ALIGQIY 113 (145)
T ss_pred HHHHhHh
Confidence 3344433
No 21
>COG4062 MtrB Tetrahydromethanopterin S-methyltransferase, subunit B [Coenzyme metabolism]
Probab=24.27 E-value=1.7e+02 Score=24.49 Aligned_cols=70 Identities=19% Similarity=0.121 Sum_probs=37.6
Q ss_pred cccCCcccchhHHHHHHHHHHHHHHhhccCcHHHHHhhcCCCCCCCCCCC----chhhHHHHHHHHHHHHHHHHHH
Q psy7158 77 VDVSHFYVNKDINHLISETEATVTSELEEGDRQKAMKRLRVPPLGEQQSP----WTTFKVGLFLGCFAILFAIILL 148 (402)
Q Consensus 77 v~~s~F~~~~~l~~li~~ve~lYa~~f~~gdrk~A~~~LR~~~~~~~~~~----~~~F~~GlflG~s~~lli~~li 148 (402)
-+...|.+-+.+++-++++|.+-.+....=|... --|.+-|.++..+. ...+.-|+.+|..++.++++++
T Consensus 25 ~edvi~ldv~pi~Eqi~kLe~~vddl~~sldPst--p~lnS~PgRegv~~~aG~~tna~yGfviGl~i~aLlAlil 98 (108)
T COG4062 25 FEDVIFLDVDPIEEQIKKLETLVDDLENSLDPST--PPLNSFPGREGVYATAGYLTNAFYGFVIGLGIMALLALIL 98 (108)
T ss_pred cCceEEEeccHHHHHHHHHHHHHHHHHhccCCCC--CCcccCCCccchHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 3455677778888888888876554332222211 11111122233221 4556678888877766665554
No 22
>PF02268 TFIIA_gamma_N: Transcription initiation factor IIA, gamma subunit, helical domain; InterPro: IPR015872 Transcription factor IIA (TFIIA) is one of several factors that form part of a transcription pre-initiation complex along with RNA polymerase II, the TATA-box-binding protein (TBP) and TBP-associated factors, on the TATA-box sequence upstream of the initiation start site. After initiation, some components of the pre-initiation complex (including TFIIA) remain attached and re-initiate a subsequent round of transcription. TFIIA binds to TBP to stabilise TBP binding to the TATA element. TFIIA also inhibits the cytokine HMGB1 (high mobility group 1 protein) binding to TBP [], and can dissociate HMGB1 already bound to TBP/TATA-box. Human and Drosophila TFIIA have three subunits: two large subunits, LN/alpha and LC/beta, derived from the same gene, and a small subunit, S/gamma. Yeast TFIIA has two subunits: a large TOA1 subunit that shows sequence similarity to the N-terminal of LN/alpha and the C-terminal of LC/beta, and a small subunit, TOA2 that is highly homologous with S/gamma. The conserved regions of the large and small subunits of TFIIA combine to form two domains: a four-helix bundle (helical domain) composed of two helices from each of the N-terminal regions of TOA1 and TOA2 in yeast; and a beta-barrel (beta-barrel domain) composed of beta-sheets from the C-terminal regions of TOA1 and TOA2 []. This entry represents the alpha-helical domain found at the N-terminal of the gamma subunit of transcription factor TFIIA.; GO: 0006367 transcription initiation from RNA polymerase II promoter, 0005672 transcription factor TFIIA complex; PDB: 1NVP_D 1RM1_B 1YTF_D 1NH2_D.
Probab=24.18 E-value=1e+02 Score=22.34 Aligned_cols=31 Identities=10% Similarity=0.036 Sum_probs=21.3
Q ss_pred HHHHHHHHHhhhhhhchHHHHHHHHHhhccccc
Q psy7158 33 SMRLVHSLINVSNAAISLTDLGFFASHVKTLLN 65 (402)
Q Consensus 33 ~~e~~~~L~~L~~~~LN~tgF~KILKKyDK~~~ 65 (402)
+.+-+..+ ++...++-.=-.|||.+|||...
T Consensus 15 L~dtLDel--i~~~~I~p~La~kVL~~FDksi~ 45 (49)
T PF02268_consen 15 LTDTLDEL--IQEGKITPQLAMKVLEQFDKSIN 45 (49)
T ss_dssp HHHHHHHH--HHTTSS-HHHHHHHHHHHHHHHH
T ss_pred HHHHHHHH--HHcCCCCHHHHHHHHHHHHHHHH
Confidence 44444443 34478888888999999999864
No 23
>COG5110 RPN1 26S proteasome regulatory complex component [Posttranslational modification, protein turnover, chaperones]
Probab=23.22 E-value=50 Score=35.73 Aligned_cols=61 Identities=20% Similarity=0.194 Sum_probs=37.8
Q ss_pred HHHhhhhHHHHHHhhhhhHHHHHHHHHHH---------HHHHhhhh----hhchHHHHHH-----------HHHhhcccc
Q psy7158 9 EEKKKMEEEEEEKKNKKGEEEEEISMRLV---------HSLINVSN----AAISLTDLGF-----------FASHVKTLL 64 (402)
Q Consensus 9 ~~~~~~~~~~~~~~~~~~~~~~~~~~e~~---------~~L~~L~~----~~LN~tgF~K-----------ILKKyDK~~ 64 (402)
..|||.|+|..-..-++++-|+|.+-|-. .+|++|+- +.-..|+.-| .++-|||.+
T Consensus 28 ~~k~kee~e~lseed~~lk~dLellVeriqd~d~~l~~~sLn~LkeviksStSsmtavpkplkfLrp~y~dl~~iydkw~ 107 (881)
T COG5110 28 KDKKKEEEEQLSEEDAMLKGDLELLVERIQDPDIDLQNNSLNMLKEVIKSSTSSMTAVPKPLKFLRPNYLDLLEIYDKWL 107 (881)
T ss_pred cchhhhhHhhhchhhhhhcccHHHHHHHhhCCChHHHHHHHHHHHHHHhccccccccCCchhhhcCCCcchHHHHHhhcc
Confidence 34566666665556666666777666544 36666652 3444566655 566789988
Q ss_pred ccccc
Q psy7158 65 NVDVG 69 (402)
Q Consensus 65 ~~~~~ 69 (402)
..+..
T Consensus 108 ~~n~K 112 (881)
T COG5110 108 EGNKK 112 (881)
T ss_pred Ccchh
Confidence 77644
No 24
>PF01086 Clathrin_lg_ch: Clathrin light chain; InterPro: IPR000996 Proteins synthesized on the ribosome and processed in the endoplasmic reticulum are transported from the Golgi apparatus to the trans-Golgi network (TGN), and from there via small carrier vesicles to their final destination compartment. These vesicles have specific coat proteins (such as clathrin or coatomer) that are important for cargo selection and direction of transport []. Clathrin coats contain both clathrin (acts as a scaffold) and adaptor complexes that link clathrin to receptors in coated vesicles. Clathrin-associated protein complexes are believed to interact with the cytoplasmic tails of membrane proteins, leading to their selection and concentration. The two major types of clathrin adaptor complexes are the heterotetrameric adaptor protein (AP) complexes, and the monomeric GGA (Golgi-localising, Gamma-adaptin ear domain homology, ARF-binding proteins) adaptors [, ]. Clathrin is a trimer composed of three heavy chains and three light chains, each monomer projecting outwards like a leg; this three-legged structure is known as a triskelion [, ]. The heavy chains form the legs, their N-terminal beta-propeller regions extending outwards, while their C-terminal alpha-alpha-superhelical regions form the central hub of the triskelion. Peptide motifs can bind between the beta-propeller blades. The light chains appear to have a regulatory role, and may help orient the assembly and disassembly of clathrin coats as they interact with hsc70 uncoating ATPase []. Clathrin triskelia self-polymerise into a curved lattice by twisting individual legs together. The clathrin lattice forms around a vesicle as it buds from the TGN, plasma membrane or endosomes, acting to stabilise the vesicle and facilitate the budding process []. The multiple blades created when the triskelia polymerise are involved in multiple protein interactions, enabling the recruitment of different cargo adaptors and membrane attachment proteins []. This entry represents clathrin light chains, which are more divergent in sequence than the heavy chains []. In higher eukaryotes, two genes encode distinct but related light chains, each of which can yield two separate forms via alternative splicing. In yeast there is a single light chain whose sequence is only distantly related to that of higher eukaryotes. Clathrin light chains have a conserved acidic N-terminal domain, a central coiled-coil domain and a conserved C-terminal domain. More information about these proteins can be found at Protein of the Month: Clathrin [].; GO: 0005198 structural molecule activity, 0006886 intracellular protein transport, 0016192 vesicle-mediated transport, 0030130 clathrin coat of trans-Golgi network vesicle, 0030132 clathrin coat of coated pit; PDB: 3LVG_E 3LVH_D.
Probab=23.00 E-value=1.4e+02 Score=28.43 Aligned_cols=62 Identities=23% Similarity=0.303 Sum_probs=26.1
Q ss_pred ccccccHHHHhhh----hHHHHHHhhhhhHHHHHHHHHHHHHHHhhh--hhhchHHHHHHHHHhhccc
Q psy7158 2 EPLLKWKEEKKKM----EEEEEEKKNKKGEEEEEISMRLVHSLINVS--NAAISLTDLGFFASHVKTL 63 (402)
Q Consensus 2 ~~~~~~~~~~~~~----~~~~~~~~~~~~~~~~~~~~e~~~~L~~L~--~~~LN~tgF~KILKKyDK~ 63 (402)
|++-+|+|+.++. ++++++||-.-.++=.+.+.++|....... +...|+..=-..+++-|+.
T Consensus 112 e~ireWre~~~~~i~ekD~~e~~kk~e~~~~A~k~lddfY~~~~~k~e~~k~~nr~~ee~fl~~~~~~ 179 (225)
T PF01086_consen 112 EAIREWREERDKRIEEKDAEEEEKKEEIKEKAKKELDDFYENRNEKKEKNKKQNREEEEEFLAKREEF 179 (225)
T ss_dssp THHHHHHHHHTTTSTHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHTT--------------
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHccccc
Confidence 4677899986543 333444444444444445666665544333 3666665555555555554
No 25
>PF07716 bZIP_2: Basic region leucine zipper; InterPro: IPR011700 The basic-leucine zipper (bZIP) transcription factors [, ] of eukaryotes are proteins that contain a basic region mediating sequence-specific DNA-binding, followed by a leucine zipper region (see IPR002158 from INTERPRO), which is required for dimerization.; GO: 0003700 sequence-specific DNA binding transcription factor activity, 0043565 sequence-specific DNA binding, 0046983 protein dimerization activity, 0006355 regulation of transcription, DNA-dependent; PDB: 1NWQ_A 1H89_B 1H88_A 1GTW_B 2E43_A 1IO4_A 1GU4_B 2E42_A 1H8A_B 1GU5_B ....
Probab=22.19 E-value=2e+02 Score=20.70 Aligned_cols=31 Identities=35% Similarity=0.382 Sum_probs=21.5
Q ss_pred HHhhhhHHHHHHhhhhhHHHHHHHHHHHHHH
Q psy7158 10 EKKKMEEEEEEKKNKKGEEEEEISMRLVHSL 40 (402)
Q Consensus 10 ~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~L 40 (402)
++||+..+..+.+...+++++..|..-...|
T Consensus 21 ~rkk~~~~~le~~~~~L~~en~~L~~~i~~L 51 (54)
T PF07716_consen 21 QRKKQREEELEQEVQELEEENEQLRQEIAQL 51 (54)
T ss_dssp HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 5667777777777777777777776555543
No 26
>TIGR02894 DNA_bind_RsfA transcription factor, RsfA family. In a subset of endospore-forming members of the Firmcutes, members of this protein family are found, several to a genome. Two very strongly conserved sequences regions are separated by a highly variable linker region. Much of the linker region was excised from the seed alignment for this model. A characterized member is the prespore-specific transcription RsfA from Bacillus subtilis, previously called YwfN, which is controlled by sigma factor F and seems to fine-tune expression of some genes in the sigma-F regulon. A paralog in Bacillus subtilis is designated YlbO.
Probab=21.79 E-value=1.7e+02 Score=26.58 Aligned_cols=27 Identities=26% Similarity=0.202 Sum_probs=20.5
Q ss_pred HHHHHhhhhhHHHHHHHHHHHHHHHhh
Q psy7158 17 EEEEKKNKKGEEEEEISMRLVHSLINV 43 (402)
Q Consensus 17 ~~~~~~~~~~~~~~~~~~e~~~~L~~L 43 (402)
|+.++++.+++++.+++.|.|++|...
T Consensus 121 e~Le~e~~~L~~~~~~~~eDY~~L~~I 147 (161)
T TIGR02894 121 EELEKELEKLRQRLSTIEEDYQTLIDI 147 (161)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 466777777888888888888887553
No 27
>PF06160 EzrA: Septation ring formation regulator, EzrA ; InterPro: IPR010379 During the bacterial cell cycle, the tubulin-like cell-division protein FtsZ polymerises into a ring structure that establishes the location of the nascent division site. EzrA modulates the frequency and position of FtsZ ring formation [].; GO: 0000921 septin ring assembly, 0005940 septin ring, 0016021 integral to membrane
Probab=21.07 E-value=5.5e+02 Score=27.80 Aligned_cols=91 Identities=13% Similarity=0.118 Sum_probs=57.7
Q ss_pred HHHHHHhhhhhHHHHHHHHHHHHHHHhhhhhhchHHHHHHHHHhhccccccccchHHHHhcccc-CCccc-chhHHHHHH
Q psy7158 16 EEEEEKKNKKGEEEEEISMRLVHSLINVSNAAISLTDLGFFASHVKTLLNVDVGAKWRAEHVDV-SHFYV-NKDINHLIS 93 (402)
Q Consensus 16 ~~~~~~~~~~~~~~~~~~~e~~~~L~~L~~~~LN~tgF~KILKKyDK~~~~~~~~~~~~~~v~~-s~F~~-~~~l~~li~ 93 (402)
..+.++.....+++.+.+.+.+..| ++.-+-|......+.++|...-.. ...+ ..|.. .+.+++.++
T Consensus 103 i~~~~~~l~~~e~~i~~i~~~l~~L--~~~e~~nr~~i~~l~~~y~~lrk~---------ll~~~~~~G~a~~~Le~~L~ 171 (560)
T PF06160_consen 103 IKEIEEQLDEIEEDIKEILDELDEL--LESEEKNREEIEELKEKYRELRKE---------LLAHSFSYGPAIEELEKQLE 171 (560)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHH--HHHHHHHHHHHHHHHHHHHHHHHH---------HHHhhhhhchhHHHHHHHHH
Confidence 3444555555666644444444443 234788899999999999654211 1112 22443 477888899
Q ss_pred HHHHHHHHh---hccCcHHHHHhhcCC
Q psy7158 94 ETEATVTSE---LEEGDRQKAMKRLRV 117 (402)
Q Consensus 94 ~ve~lYa~~---f~~gdrk~A~~~LR~ 117 (402)
++|..|..+ -++||.-+|.+.|..
T Consensus 172 ~ie~~F~~f~~lt~~GD~~~A~eil~~ 198 (560)
T PF06160_consen 172 NIEEEFSEFEELTENGDYLEAREILEK 198 (560)
T ss_pred HHHHHHHHHHHHHHCCCHHHHHHHHHH
Confidence 999988864 357999888877764
No 28
>PF10669 Phage_Gp23: Protein gp23 (Bacteriophage A118); InterPro: IPR018926 This entry is represented by the major tail subunit protein, Gp23 of Listeria phage A118 and prophage found in Bacilli. The function is currently unknown.
Probab=20.26 E-value=4e+02 Score=22.22 Aligned_cols=10 Identities=30% Similarity=0.594 Sum_probs=5.1
Q ss_pred HhhhhHHHHH
Q psy7158 11 KKKMEEEEEE 20 (402)
Q Consensus 11 ~~~~~~~~~~ 20 (402)
+-.|.|||+.
T Consensus 45 r~~MKEER~K 54 (121)
T PF10669_consen 45 RIRMKEERSK 54 (121)
T ss_pred HHHHHHHHHH
Confidence 4456665443
No 29
>PF04930 FUN14: FUN14 family; InterPro: IPR007014 This is a family of short proteins found in eukaryotes and some archaea. Although the function of these proteins is not known they may contain transmembrane helices.
Probab=20.13 E-value=79 Score=25.96 Aligned_cols=34 Identities=21% Similarity=0.220 Sum_probs=17.8
Q ss_pred ccCcHHHHHhhcCCCCCCCCCCCchhhHHHHHHHH
Q psy7158 104 EEGDRQKAMKRLRVPPLGEQQSPWTTFKVGLFLGC 138 (402)
Q Consensus 104 ~~gdrk~A~~~LR~~~~~~~~~~~~~F~~GlflG~ 138 (402)
+..|.+...++++.- ..+..+....|.+|+++|+
T Consensus 66 ~~~~~~~~~~~~~~~-l~~~~~~~~gF~~Gf~lG~ 99 (100)
T PF04930_consen 66 DAPDIKSLWNKLMDF-LKSNLPFSAGFLAGFLLGF 99 (100)
T ss_pred cchhHHHHHHHHHHH-HHcCccHhHHHHHHHHHHc
Confidence 344555445454432 1122334677778877774
Done!