Query psy4941
Match_columns 172
No_of_seqs 94 out of 96
Neff 4.0
Searched_HMMs 46136
Date Fri Aug 16 21:49:03 2013
Command hhsearch -i /work/01045/syshi/Psyhhblits/psy4941.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/4941hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 PF08687 ASD2: Apx/Shroom doma 100.0 7.5E-73 1.6E-77 486.5 18.7 168 1-168 95-264 (264)
2 COG1340 Uncharacterized archae 77.1 57 0.0012 29.5 13.8 106 4-114 53-161 (294)
3 PF10191 COG7: Golgi complex c 76.0 22 0.00048 35.3 9.7 91 2-92 73-178 (766)
4 TIGR02231 conserved hypothetic 68.4 75 0.0016 29.6 10.9 86 83-168 70-168 (525)
5 PF12130 DUF3585: Protein of u 66.9 63 0.0014 25.7 8.9 73 95-167 6-92 (142)
6 PRK04863 mukB cell division pr 57.1 1.4E+02 0.003 32.4 11.5 117 26-168 890-1008(1486)
7 PRK10963 hypothetical protein; 53.4 38 0.00083 28.4 5.6 57 1-57 46-102 (223)
8 KOG3878|consensus 53.0 77 0.0017 29.9 7.9 48 89-136 136-190 (469)
9 PF12718 Tropomyosin_1: Tropom 52.2 1.2E+02 0.0026 24.1 10.1 21 84-104 122-142 (143)
10 PF13949 ALIX_LYPXL_bnd: ALIX 50.0 1.6E+02 0.0034 24.8 13.7 118 3-123 148-277 (296)
11 TIGR00691 spoT_relA (p)ppGpp s 46.9 72 0.0016 31.4 7.1 96 27-140 63-180 (683)
12 KOG3759|consensus 46.1 51 0.0011 32.2 5.8 48 43-90 61-109 (621)
13 PF10805 DUF2730: Protein of u 44.0 1.4E+02 0.003 22.5 7.9 59 64-126 46-104 (106)
14 PF08687 ASD2: Apx/Shroom doma 44.0 2.3E+02 0.0051 25.2 10.4 75 3-101 184-258 (264)
15 KOG2180|consensus 42.5 2.4E+02 0.0052 28.9 9.9 69 43-127 163-244 (793)
16 PF13328 HD_4: HD domain; PDB: 41.5 84 0.0018 24.2 5.5 50 26-75 61-124 (153)
17 TIGR02231 conserved hypothetic 38.7 1.9E+02 0.0042 26.9 8.4 93 3-100 75-168 (525)
18 PF07889 DUF1664: Protein of u 37.2 1.7E+02 0.0038 23.2 6.7 29 46-74 96-124 (126)
19 PF10146 zf-C4H2: Zinc finger- 35.7 2.9E+02 0.0063 23.8 10.3 87 1-112 3-99 (230)
20 TIGR01321 TrpR trp operon repr 35.1 1.8E+02 0.004 22.0 6.3 57 29-91 22-92 (94)
21 PRK11092 bifunctional (p)ppGpp 32.6 3.4E+02 0.0074 27.1 9.3 95 27-139 88-204 (702)
22 PF06037 DUF922: Bacterial pro 32.2 2.7E+02 0.0059 22.5 9.1 74 38-114 72-149 (161)
23 PF15188 CCDC-167: Coiled-coil 31.0 2.3E+02 0.0049 21.2 6.6 47 55-101 7-53 (85)
24 PF05651 Diacid_rec: Putative 30.6 1.4E+02 0.0029 23.6 5.2 42 126-167 92-133 (135)
25 PRK02224 chromosome segregatio 30.4 5.4E+02 0.012 25.3 18.5 38 130-167 550-587 (880)
26 PF04111 APG6: Autophagy prote 29.9 3.1E+02 0.0067 24.4 7.8 38 130-167 98-135 (314)
27 KOG3540|consensus 29.8 1.8E+02 0.004 28.6 6.7 50 106-156 339-402 (615)
28 COG3835 CdaR Sugar diacid util 29.5 57 0.0012 30.5 3.2 28 126-153 94-121 (376)
29 PF03961 DUF342: Protein of un 28.9 4.6E+02 0.0099 24.1 9.0 43 122-164 366-408 (451)
30 PF08203 RNA_polI_A14: Yeast R 28.6 36 0.00077 25.1 1.4 39 43-81 16-73 (76)
31 COG3879 Uncharacterized protei 26.9 1.5E+02 0.0033 26.2 5.3 47 4-55 62-108 (247)
32 PF07851 TMPIT: TMPIT-like pro 26.3 5.1E+02 0.011 23.8 9.4 44 6-49 4-47 (330)
33 KOG0996|consensus 26.1 8.9E+02 0.019 26.5 13.9 87 5-103 791-877 (1293)
34 COG1771 Uncharacterized protei 26.1 1.2E+02 0.0026 29.0 4.7 77 66-142 4-91 (471)
35 PF04340 DUF484: Protein of un 26.0 2.1E+02 0.0046 23.6 5.8 47 2-48 50-96 (225)
36 TIGR01837 PHA_granule_1 poly(h 25.8 1.6E+02 0.0035 22.6 4.7 34 70-103 81-115 (118)
37 PF06419 COG6: Conserved oligo 25.8 6.2E+02 0.014 24.6 10.7 25 107-131 89-114 (618)
38 PF04719 TAFII28: hTAFII28-lik 25.2 43 0.00094 25.1 1.4 22 119-140 2-23 (90)
39 PF08700 Vps51: Vps51/Vps67; 24.9 2.4E+02 0.0052 19.4 7.3 39 4-60 24-62 (87)
40 PF05597 Phasin: Poly(hydroxya 24.5 3.6E+02 0.0078 21.4 11.1 57 46-102 68-127 (132)
41 KOG3091|consensus 24.5 6.8E+02 0.015 24.6 13.6 111 51-164 374-492 (508)
42 PF14584 DUF4446: Protein of u 24.5 82 0.0018 25.5 3.0 78 62-142 18-97 (151)
43 PF04728 LPP: Lipoprotein leuc 24.2 1.3E+02 0.0029 21.0 3.5 40 2-44 6-45 (56)
44 KOG3088|consensus 23.7 1.4E+02 0.003 27.4 4.5 34 86-119 59-92 (313)
45 PF00658 PABP: Poly-adenylate 23.5 2.8E+02 0.0061 19.8 7.0 55 24-105 16-70 (72)
46 PF09277 Erythro-docking: Eryt 23.0 1.7E+02 0.0036 20.6 3.9 37 65-102 8-44 (58)
47 PF10779 XhlA: Haemolysin XhlA 21.8 2.4E+02 0.0052 19.6 4.6 41 1-41 1-41 (71)
48 PF11365 DUF3166: Protein of u 21.6 1.1E+02 0.0024 23.4 3.0 15 43-57 36-50 (96)
49 PF08317 Spc7: Spc7 kinetochor 21.5 5.7E+02 0.012 22.6 12.7 67 37-109 203-269 (325)
50 smart00150 SPEC Spectrin repea 21.4 2.5E+02 0.0055 18.5 7.1 12 60-71 55-66 (101)
51 PF06785 UPF0242: Uncharacteri 21.3 6.9E+02 0.015 23.6 8.6 119 33-164 23-146 (401)
52 PRK01381 Trp operon repressor; 20.9 3.9E+02 0.0085 20.5 6.2 58 29-92 22-93 (99)
53 PF02050 FliJ: Flagellar FliJ 20.5 3E+02 0.0065 19.0 8.7 39 123-164 40-78 (123)
No 1
>PF08687 ASD2: Apx/Shroom domain ASD2; InterPro: IPR014799 Cell shape changes require the coordination of actin and microtubule cytoskeletons. The Shroom family is a small group of related proteins that are defined by sequence similarity and in most cases by some link to the actin cytoskeleton. The Shroom (Shrm) protein family is found only in animals. Proteins of this family are predicted to be utilised in multiple morphogenic and developmental processes across animal phyla to regulate cells shape or intracellular architecture in an actin and myosin-dependent manner []. While the founding member of the Shrm family is Shrm1 (formerly Apx), it appears that this protein is found only in Xenopus []. In mice and humans, the Shrm family of proteins consists of: Shrm2 (formerly Apxl), a protein involved in the morphogenesis, maintenance, and/or function of vascular endothelial cells. Shrm3 (formerly Shroom), a protein necessary for neural tube closure in vertebrate development as deficiency in Shrm results in spina bifida. Shrm3 is also conserved in some invertebrates, as orthologues can be found in sea urchins. Shrm4, a regulator of cyto-skeletal architecture that may play an important role in vertebrate development. It is implicated in X-linked mental retardation in humans. This protein family is based on the conservation of a specific arrangement of an N-terminal PDZ domain, a centrally positioned sequence motif termed ASD1 (Apx/Shrm Domain 1) and a C-terminal motif termed ASD2 [, , ]. Shrm2 and Shrm3 contain all three domains, while Shrm4 contains the PDZ and ASD2 domains, but lacks a discernible ASD1 element. To date, the ASD1 and ASD2 elements have only been found in Shrm-related proteins and do not appear in combination with other conserved domains. ASD1 is required for targeting actin, while ASD2 is capable of eliciting an actomyosin based constriction event [, ]. ASD2 is the most highly conserved sequence element shared by Shrm1, Shrm2, Shrm3, and Shrm4. It possesses a well conserved series of leucine residues that exhibit spacing consistent with that of a leucine zipper motif []. Shroom2 is both necessary and sufficient to govern the localization of pigment granules at the apical surface of epithelial cells. Shroom2 is a central regulator of RPE pigmentation. Despite their diverse biological roles, Shroom family proteins share a common activity. Since the locus encoding human SHROOM2 lies within the critical region for two distinct forms of ocular albinism, it is possible that SHROOM2 mutations may contribute to human visual system disorders [].; GO: 0000902 cell morphogenesis, 0005737 cytoplasm; PDB: 3THF_B.
Probab=100.00 E-value=7.5e-73 Score=486.46 Aligned_cols=168 Identities=49% Similarity=0.755 Sum_probs=159.3
Q ss_pred CchhHHHHHHHHHHHHHHHHHHHhhHHhhHHHHHHHHhhcCchhHHHHHHHHHhHHHHHHHHHHhHhhHHHHHHHhhCCC
Q psy4941 1 MSRLDSKLTLLRSVQVDLNEESRGNDRLGEALATRLASEARPAETAKYKLHVQEVGHITSLLLGLSGRLARAENALIGLR 80 (172)
Q Consensus 1 ~~~l~~kL~~L~~eqe~l~ee~~~n~~lG~~ve~~V~~~ckp~E~eKy~~Fi~DlekVv~LLLsLs~RLaRvenaL~~l~ 80 (172)
|.+|++||.+|+++|++|++|+++|+++|++|+++|+++|+|||++||+|||||||||||||||||||||||||||.+++
T Consensus 95 i~~l~~kl~~L~~eqe~l~ee~~~n~~lG~~ve~~v~~~c~p~E~~Ky~~fi~Dl~kv~~LLLsLs~RLaRve~aL~~~~ 174 (264)
T PF08687_consen 95 IESLSKKLEVLQEEQEALQEEIQANEALGAEVEALVQEVCKPNEFEKYRMFIGDLEKVVNLLLSLSGRLARVENALSSLD 174 (264)
T ss_dssp HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHS-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCc
Confidence 57899999999999999999999999999999999999999999999999999999999999999999999999999997
Q ss_pred C--CHHHHHHHHHHHHHHHhHHHHHHHHHHhhhhhhhHHHHHHHhcCChHHHhhHHHHHHhHHHHHHHhhchHHHhhhcH
Q psy4941 81 E--DHCDRKSLEEKRDKLASQLEEAKELKENIDRRSTNVATLLRKYLSSEEFADYNHFINMKAKLLVDAREISDKIQLGE 158 (172)
Q Consensus 81 ~--d~~Er~~L~~k~~~L~~Q~EDAkeLKe~ldRRe~~v~~iL~~~L~~eql~dY~~fv~mKa~Ll~eqreLddkIklgE 158 (172)
+ +++||.+|.+|+++|++|+||||+||+|||||+++|++||++|||+|||+||+|||+|||+|||+||+||||||+||
T Consensus 175 ~~~~~~Er~~L~~k~~~L~~Q~edAk~LKe~~drRe~~v~~iL~~~L~~eq~~dy~~fv~mKa~Ll~eqreLddkiklge 254 (264)
T PF08687_consen 175 EDADPEERESLLEKRRLLQRQLEDAKELKENLDRRERVVSEILARYLSEEQLADYRHFVKMKAALLIEQRELDDKIKLGE 254 (264)
T ss_dssp -----HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHS-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
T ss_pred ccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHhHHHHHHHHHHHHhCCHHHHHHHHHHHHHHHHHHHHHHhHHHHHHhhH
Confidence 5 57899999999999999999999999999999999999999999999999999999999999999999999999999
Q ss_pred HHHHHHHhhc
Q psy4941 159 EQLAALQETM 168 (172)
Q Consensus 159 EQL~~L~esL 168 (172)
|||+||++||
T Consensus 255 EQL~~L~esL 264 (264)
T PF08687_consen 255 EQLEALRESL 264 (264)
T ss_dssp HHHHHHCC--
T ss_pred HHHHHHHhcC
Confidence 9999999986
No 2
>COG1340 Uncharacterized archaeal coiled-coil protein [Function unknown]
Probab=77.08 E-value=57 Score=29.51 Aligned_cols=106 Identities=19% Similarity=0.195 Sum_probs=70.2
Q ss_pred hHHHHHHHHHHHHHHHHHHHhhHHhhHHHHHHHHhhcCchhHHHHHHHHHhHH---HHHHHHHHhHhhHHHHHHHhhCCC
Q psy4941 4 LDSKLTLLRSVQVDLNEESRGNDRLGEALATRLASEARPAETAKYKLHVQEVG---HITSLLLGLSGRLARAENALIGLR 80 (172)
Q Consensus 4 l~~kL~~L~~eqe~l~ee~~~n~~lG~~ve~~V~~~ckp~E~eKy~~Fi~Dle---kVv~LLLsLs~RLaRvenaL~~l~ 80 (172)
+..+..++++++..|.++++.-..-..++.+.+++ -+..|+-.....+ ..-.-+=+|...+-|.|-...+.+
T Consensus 53 ~~e~~~elr~~rdeineev~elK~kR~ein~kl~e-----L~~~~~~l~e~~~~~~~~~~~~~~ler~i~~Le~~~~T~~ 127 (294)
T COG1340 53 LREKAQELREERDEINEEVQELKEKRDEINAKLQE-----LRKEYRELKEKRNEFNLGGRSIKSLEREIERLEKKQQTSV 127 (294)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-----HHHHHHHHHHHhhhhhccCCCHHHHHHHHHHHHHHHHhcC
Confidence 34566777888888888888887777888887777 3444444444444 122223456667777777777777
Q ss_pred CCHHHHHHHHHHHHHHHhHHHHHHHHHHhhhhhh
Q psy4941 81 EDHCDRKSLEEKRDKLASQLEEAKELKENIDRRS 114 (172)
Q Consensus 81 ~d~~Er~~L~~k~~~L~~Q~EDAkeLKe~ldRRe 114 (172)
-+|++-.-+-++...|..++++|+.-.+.-....
T Consensus 128 L~~e~E~~lvq~I~~L~k~le~~~k~~e~~~~~~ 161 (294)
T COG1340 128 LTPEEERELVQKIKELRKELEDAKKALEENEKLK 161 (294)
T ss_pred CChHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 7777666777788888888888875444433333
No 3
>PF10191 COG7: Golgi complex component 7 (COG7); InterPro: IPR019335 The conserved oligomeric Golgi (COG) complex is an eight-subunit (Cog1-8) peripheral Golgi protein involved in membrane trafficking and glycoconjugate synthesis []. COG7 is required for normal Golgi morphology and trafficking. Mutation in COG7 causes a congenital disorder of glycosylation [].
Probab=75.96 E-value=22 Score=35.30 Aligned_cols=91 Identities=19% Similarity=0.257 Sum_probs=53.3
Q ss_pred chhHHHHHHHHHHHHHHHHHHHhhHH-hhHHHHHHHH----------hhcCchhHHHHHHHHHhHHHHHHH--HHHhHhh
Q psy4941 2 SRLDSKLTLLRSVQVDLNEESRGNDR-LGEALATRLA----------SEARPAETAKYKLHVQEVGHITSL--LLGLSGR 68 (172)
Q Consensus 2 ~~l~~kL~~L~~eqe~l~ee~~~n~~-lG~~ve~~V~----------~~ckp~E~eKy~~Fi~DlekVv~L--LLsLs~R 68 (172)
.+|+.-...|+.....++++++.=+. -|..|..++. ....-.|.+++..+..+||-++.= +-..++|
T Consensus 73 ~~l~~ea~~L~~~~~~v~~~~~~~e~~t~~s~~~L~~ld~vK~rm~~a~~~L~EA~~w~~l~~~v~~~~~~~d~~~~a~~ 152 (766)
T PF10191_consen 73 DRLRQEAASLQEQMASVQEEIKAVEQDTAQSMAQLAELDSVKSRMEAARETLQEADNWSTLSAEVDDLFESGDIAKIADR 152 (766)
T ss_pred HHHHHHHHHHHHHHHHHHHHHhhhhccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhHHHHHHHHHHHHhcCCHHHHHHH
Confidence 34555556666666666666543222 2222322221 122346777777777777665532 4456999
Q ss_pred HHHHHHHhhCCC--CCHHHHHHHHHH
Q psy4941 69 LARAENALIGLR--EDHCDRKSLEEK 92 (172)
Q Consensus 69 LaRvenaL~~l~--~d~~Er~~L~~k 92 (172)
|+.+.++|..+. ||.++|..+.+.
T Consensus 153 l~~m~~sL~~l~~~pd~~~r~~~le~ 178 (766)
T PF10191_consen 153 LAEMQRSLAVLQDVPDYEERRQQLEA 178 (766)
T ss_pred HHHHHHHHHHHcCCCchhHHHHHHHH
Confidence 999999999765 355677666554
No 4
>TIGR02231 conserved hypothetical protein. This family consists of proteins over 500 amino acids long in Caenorhabditis elegans and several bacteria (Pseudomonas aeruginosa, Nostoc sp. PCC 7120, Leptospira interrogans, etc.). The function is unknown.
Probab=68.43 E-value=75 Score=29.62 Aligned_cols=86 Identities=17% Similarity=0.231 Sum_probs=64.8
Q ss_pred HHHHHHHHHHHHHHHhHHHHHHHHHHhhhhhhhHHHHHHHh------------cCChHHHhhHHHHHHhH-HHHHHHhhc
Q psy4941 83 HCDRKSLEEKRDKLASQLEEAKELKENIDRRSTNVATLLRK------------YLSSEEFADYNHFINMK-AKLLVDARE 149 (172)
Q Consensus 83 ~~Er~~L~~k~~~L~~Q~EDAkeLKe~ldRRe~~v~~iL~~------------~L~~eql~dY~~fv~mK-a~Ll~eqre 149 (172)
+++-..|..++..|..++.++..=++.++.+..++..+-.. ..+.+++.++-.|+.-. +.+..+.++
T Consensus 70 ~~~~~~l~~~l~~l~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 149 (525)
T TIGR02231 70 PERLAELRKQIRELEAELRDLEDRGDALKALAKFLEDIREGLTEPIKDSAKRNEPDLKEWFQAFDFNGSEIERLLTEDRE 149 (525)
T ss_pred cHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhccccccccccCCCCHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 45667788888888888888888888888888888776532 33667788887777665 667777788
Q ss_pred hHHHhhhcHHHHHHHHhhc
Q psy4941 150 ISDKIQLGEEQLAALQETM 168 (172)
Q Consensus 150 LddkIklgEEQL~~L~esL 168 (172)
++.+++-.++++..|...|
T Consensus 150 ~~~~~~~~~~~l~~l~~~l 168 (525)
T TIGR02231 150 AERRIRELEKQLSELQNEL 168 (525)
T ss_pred HHHHHHHHHHHHHHHHHHH
Confidence 8888888888877776543
No 5
>PF12130 DUF3585: Protein of unknown function (DUF3585); InterPro: IPR022735 This domain is found in eukaryotes. It is typically between 135 and 149 amino acids in length and is found associated with PF00307 from PFAM.
Probab=66.90 E-value=63 Score=25.68 Aligned_cols=73 Identities=22% Similarity=0.354 Sum_probs=59.3
Q ss_pred HHHhHHHHHHHHHHhhhhhhhHHHHHHHhcC-----ChHHHhhHHHHHHhHHHHH---------HHhhchHHHhhhcHHH
Q psy4941 95 KLASQLEEAKELKENIDRRSTNVATLLRKYL-----SSEEFADYNHFINMKAKLL---------VDAREISDKIQLGEEQ 160 (172)
Q Consensus 95 ~L~~Q~EDAkeLKe~ldRRe~~v~~iL~~~L-----~~eql~dY~~fv~mKa~Ll---------~eqreLddkIklgEEQ 160 (172)
..++++++--.-...++.|+..+-..|...- .++-+.+|-.+|+-|.+|+ +.+.+|++++...+-+
T Consensus 6 ~Iq~el~~ie~~~~~lE~~g~~lE~~LR~~~~~~~~e~~Ll~ewf~Lv~eKn~LvrreseL~~~~~~~~Le~~~~~L~~E 85 (142)
T PF12130_consen 6 EIQRELEEIEEEQRELEERGVELEKQLRSAEENDEEEEELLQEWFKLVNEKNALVRRESELMYMEKEQELEDRQARLEQE 85 (142)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhHHHHHHHHHHHH
Confidence 4678888888888889999999988887766 3444899999999999876 4467789999999999
Q ss_pred HHHHHhh
Q psy4941 161 LAALQET 167 (172)
Q Consensus 161 L~~L~es 167 (172)
|.++...
T Consensus 86 Lr~l~~~ 92 (142)
T PF12130_consen 86 LRELLAK 92 (142)
T ss_pred HHHHHhc
Confidence 9988753
No 6
>PRK04863 mukB cell division protein MukB; Provisional
Probab=57.08 E-value=1.4e+02 Score=32.44 Aligned_cols=117 Identities=15% Similarity=0.255 Sum_probs=0.0
Q ss_pred HHhhHHHHHHHHhhcCchhHHHHHHHHHhHHHHHHHHHHhHhhHHHHHHHhhCCCCCHHHHHHHHHHHHHHHhHHHHHHH
Q psy4941 26 DRLGEALATRLASEARPAETAKYKLHVQEVGHITSLLLGLSGRLARAENALIGLREDHCDRKSLEEKRDKLASQLEEAKE 105 (172)
Q Consensus 26 ~~lG~~ve~~V~~~ckp~E~eKy~~Fi~DlekVv~LLLsLs~RLaRvenaL~~l~~d~~Er~~L~~k~~~L~~Q~EDAke 105 (172)
+.+++.|+..=.+.-...+...| .....++|+.+|+.|..|..|| +-+..|..+++.|+.
T Consensus 890 ~~~~~~~~~~~~~~~~~~~a~~y-------------~~~~~~~L~qLE~~l~~L~~Dp-------~~~e~lr~e~~~~~~ 949 (1486)
T PRK04863 890 ETLADRVEEIREQLDEAEEAKRF-------------VQQHGNALAQLEPIVSVLQSDP-------EQFEQLKQDYQQAQQ 949 (1486)
T ss_pred ccHHHHHHHHHHHHHHHHHHHHH-------------HHHHHHHHHHHHHHHHHhCCCH-------HHHHHHHHHHHHHHH
Q ss_pred HHHhhhhhhhHHHHHHHh--cCChHHHhhHHHHHHhHHHHHHHhhchHHHhhhcHHHHHHHHhhc
Q psy4941 106 LKENIDRRSTNVATLLRK--YLSSEEFADYNHFINMKAKLLVDAREISDKIQLGEEQLAALQETM 168 (172)
Q Consensus 106 LKe~ldRRe~~v~~iL~~--~L~~eql~dY~~fv~mKa~Ll~eqreLddkIklgEEQL~~L~esL 168 (172)
--.+++.+=.++.++..+ ||+ |..=+.|-+.=-.-.-.|..+|+--+++-..+++-+
T Consensus 950 ~~~~~~~~~~~l~~~~~~~~~~~------y~~~~~~l~~~~~~~~~Le~~Le~iE~~~~~areql 1008 (1486)
T PRK04863 950 TQRDAKQQAFALTEVVQRRAHFS------YEDAAEMLAKNSDLNEKLRQRLEQAEQERTRAREQL 1008 (1486)
T ss_pred HHHHHHHHHHHHHHHHHHHHhcc------HHHHHhHhhcchhhhHHHHHHHHHHHHHHHHHHHHH
No 7
>PRK10963 hypothetical protein; Provisional
Probab=53.37 E-value=38 Score=28.41 Aligned_cols=57 Identities=16% Similarity=0.124 Sum_probs=48.9
Q ss_pred CchhHHHHHHHHHHHHHHHHHHHhhHHhhHHHHHHHHhhcCchhHHHHHHHHHhHHH
Q psy4941 1 MSRLDSKLTLLRSVQVDLNEESRGNDRLGEALATRLASEARPAETAKYKLHVQEVGH 57 (172)
Q Consensus 1 ~~~l~~kL~~L~~eqe~l~ee~~~n~~lG~~ve~~V~~~ckp~E~eKy~~Fi~Dlek 57 (172)
|+.++.|+..|+.....|-.-++.|+++...+-.++...-.+..++.+-.-+.++-+
T Consensus 46 ~~~LR~r~~~Le~~l~~Li~~A~~Ne~l~~~~~~l~l~Ll~a~~~~~l~~~L~~~~~ 102 (223)
T PRK10963 46 MARQRNHIHVLEEEMTLLMEQAIANEDLFYRLLPLQSRLAAADSLQDMLMRLHRWAR 102 (223)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCHHHHHHHHHHHHH
Confidence 457889999999999999999999999999999999999988888877665555533
No 8
>KOG3878|consensus
Probab=52.95 E-value=77 Score=29.92 Aligned_cols=48 Identities=25% Similarity=0.455 Sum_probs=27.3
Q ss_pred HHHHHHHHHhHHHHHHHHHHhhhh-------hhhHHHHHHHhcCChHHHhhHHHH
Q psy4941 89 LEEKRDKLASQLEEAKELKENIDR-------RSTNVATLLRKYLSSEEFADYNHF 136 (172)
Q Consensus 89 L~~k~~~L~~Q~EDAkeLKe~ldR-------Re~~v~~iL~~~L~~eql~dY~~f 136 (172)
|.++--.+.++-++|.+|.+|-.| ++..-..-+...|+..-++.|..|
T Consensus 136 lkE~e~r~~ee~~e~~~lQe~~qr~l~ee~~~~E~Qr~Qiq~ALN~QT~~QF~~Y 190 (469)
T KOG3878|consen 136 LKEKELRLMEEKKEARELQENAQRELLEEGYKEELQRRQIQDALNKQTYQQFKLY 190 (469)
T ss_pred HHHHHHHHHHhhhcchhHHHHHHHHHHHhhhhHHHHHHHHHHHHhHHHHHHHHHH
Confidence 335666677777777777776655 333333444455555544444444
No 9
>PF12718 Tropomyosin_1: Tropomyosin like; InterPro: IPR000533 Tropomyosins [], are a family of closely related proteins present in muscle and non-muscle cells. In striated muscle, tropomyosin mediate the interactions between the troponin complex and actin so as to regulate muscle contraction []. The role of tropomyosin in smooth muscle and non-muscle tissues is not clear. Tropomyosin is an alpha-helical protein that forms a coiled-coil structure of 2 parallel helices containing 2 sets of 7 alternating actin binding sites []. There are multiple cell-specific isoforms, created by differential splicing of the messenger RNA from one gene, but the proportions of the isoforms vary between different cell types. Muscle isoforms of tropomyosin are characterised by having 284 amino acid residues and a highly conserved N-terminal region, whereas non-muscle forms are generally smaller and are heterogeneous in their N-terminal region. This entry represents tropomyosin (Tmp) 1, 2 and 3. Within the yeast Tmp1 and Tmp2, biochemical and sequence analyses indicate that Tpm2 spans four actin monomers along a filament, whereas Tpm1 spans five. Despite its shorter length, Tpm2 can compete with Tpm1 for binding to F-actin. Over-expression of Tpm2 in vivo alters the axial budding of haploids to a bipolar pattern, and this can be partially suppressed by co-over-expression of Tpm1. This suggests distinct functions for the two tropomyosins, and indicates that the ratio between them is important for correct morphogenesis [].
Probab=52.21 E-value=1.2e+02 Score=24.10 Aligned_cols=21 Identities=38% Similarity=0.534 Sum_probs=16.5
Q ss_pred HHHHHHHHHHHHHHhHHHHHH
Q psy4941 84 CDRKSLEEKRDKLASQLEEAK 104 (172)
Q Consensus 84 ~Er~~L~~k~~~L~~Q~EDAk 104 (172)
.++..+..|+.-+..|+.+||
T Consensus 122 ~~~~~~E~k~eel~~k~~~~k 142 (143)
T PF12718_consen 122 QERDQWEEKYEELEEKYKEAK 142 (143)
T ss_pred hhHHHHHHHHHHHHHHHHHhc
Confidence 366778888888888888876
No 10
>PF13949 ALIX_LYPXL_bnd: ALIX V-shaped domain binding to HIV ; PDB: 2XS1_A 2XS8_A 2R03_A 2R02_A 2OEX_B 2OEV_A 2OJQ_A 2R05_A.
Probab=50.00 E-value=1.6e+02 Score=24.84 Aligned_cols=118 Identities=20% Similarity=0.185 Sum_probs=64.7
Q ss_pred hhHHHHHHHHHHHHHHHHHHHh---hHHhhHHHHH------HHHhhcCchhHHHHHHHHHhHHHHHHHHHHhHhhHHHHH
Q psy4941 3 RLDSKLTLLRSVQVDLNEESRG---NDRLGEALAT------RLASEARPAETAKYKLHVQEVGHITSLLLGLSGRLARAE 73 (172)
Q Consensus 3 ~l~~kL~~L~~eqe~l~ee~~~---n~~lG~~ve~------~V~~~ckp~E~eKy~~Fi~DlekVv~LLLsLs~RLaRve 73 (172)
.+-.++..++.++..+..+++. ++....-+.. ..-+.+-..|+.||.-|++.|+..++ .-..-|..+.
T Consensus 148 ~ll~~l~~l~~eR~~~~~~lk~~~~~d~i~~~l~~~~~~~~~~~~~lf~~eL~k~~~~~~~i~~~~~---~Q~~ll~~i~ 224 (296)
T PF13949_consen 148 ELLNKLEELKKEREELLEQLKEKLQNDDISKLLSELNKNGSADFEALFEEELKKFDPLQNRIQQNLS---KQEELLQEIQ 224 (296)
T ss_dssp HHHHHHHHHHHHHHHHHHHHHH-----HHHHHHHHHHHSSS--HHHHHHHHHHHHHHHHHHHHHHHH---HHHHHHHHHH
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhhccHHHHHHHhhccCCccHHHHHHHHHHHHHHHHHHHHHHHH---HHHHHHHHHH
Confidence 3456788889999998888875 3333333331 12235556778888877776665533 3333444444
Q ss_pred HHhh---CCCCCHHHHHHHHHHHHHHHhHHHHHHHHHHhhhhhhhHHHHHHHh
Q psy4941 74 NALI---GLREDHCDRKSLEEKRDKLASQLEEAKELKENIDRRSTNVATLLRK 123 (172)
Q Consensus 74 naL~---~l~~d~~Er~~L~~k~~~L~~Q~EDAkeLKe~ldRRe~~v~~iL~~ 123 (172)
.+-. .......+...-..-...|..-++--.+++.|+..--.+..++...
T Consensus 225 ~~~~~~~~~~~~~~~~~~r~~~~~~l~~a~~~y~el~~~l~eG~~FY~~L~~~ 277 (296)
T PF13949_consen 225 EANEEFAQSRKSDQEQKERESALQRLEAAYDAYKELSSNLEEGLKFYNDLLEI 277 (296)
T ss_dssp HHHHHHHTTS--SHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
T ss_pred HHHHHHHHHhcccHHHHHHHHHHHHHHHHHHHHHHHHHhHHHHHHHHHHHHHH
Confidence 4333 2221112222222223445566666678888888877777666543
No 11
>TIGR00691 spoT_relA (p)ppGpp synthetase, RelA/SpoT family. (p)ppGpp is a regulatory metabolite of the stringent response, but appears also to be involved in antibiotic biosynthesis in some species.
Probab=46.89 E-value=72 Score=31.45 Aligned_cols=96 Identities=20% Similarity=0.364 Sum_probs=57.9
Q ss_pred HhhHHHHHHHHhhcCchhH----------HHHH-HHHHhHHHHHHHHHHhHhhHHHHHHHhhCCCCCHHHH---HH----
Q psy4941 27 RLGEALATRLASEARPAET----------AKYK-LHVQEVGHITSLLLGLSGRLARAENALIGLREDHCDR---KS---- 88 (172)
Q Consensus 27 ~lG~~ve~~V~~~ckp~E~----------eKy~-~Fi~DlekVv~LLLsLs~RLaRvenaL~~l~~d~~Er---~~---- 88 (172)
.+|.+|..+|..+.++... +.|+ ||++-..-+--+|..|+-||..+. +|..++++...+ ++
T Consensus 63 ~FG~~Va~lV~~vTk~~~~~~~~~~~~q~e~~rkmlla~~~d~rvvlVKLADrlhNmr-tl~~~~~~k~~~iA~Et~~iy 141 (683)
T TIGR00691 63 EFGEEVAELVDGVTKITKLKKKSRQELQAENFRKMILAMAQDIRVIVIKLADRLHNMR-TLDFLPPEKQKRIAKETLEIY 141 (683)
T ss_pred HHCHHHHHHHHHHHHhcccccchhhHHHHHHHHHHHHhhcCCcceEeeeHHHHHhHHH-HHHhhChHHHHHHHHHHHHHH
Confidence 5999999999999887654 4444 444322223346778899999885 577665432211 11
Q ss_pred --HHHHH--HHHHhHHHHHHHHHHhhhhhhhHHHHHHHhcCChHHHhhHHHHHHhH
Q psy4941 89 --LEEKR--DKLASQLEEAKELKENIDRRSTNVATLLRKYLSSEEFADYNHFINMK 140 (172)
Q Consensus 89 --L~~k~--~~L~~Q~EDAkeLKe~ldRRe~~v~~iL~~~L~~eql~dY~~fv~mK 140 (172)
|-.+. ..++.++| ++-.+||.|+++..-..++..+
T Consensus 142 aPlA~rLG~~~ik~eLe-----------------dl~f~~l~p~~y~~i~~~l~~~ 180 (683)
T TIGR00691 142 APLAHRLGMSSIKTELE-----------------DLSFKYLYPKEYENIKSLVNEQ 180 (683)
T ss_pred HHHHHHhChHHHHHHHH-----------------HHHHHhcCHHHHHHHHHHHHHH
Confidence 11111 12333333 4567888888887777666654
No 12
>KOG3759|consensus
Probab=46.13 E-value=51 Score=32.21 Aligned_cols=48 Identities=21% Similarity=0.299 Sum_probs=39.0
Q ss_pred hhHHHHHHHHHhHHHHHHHHHHhHhhHHHHHHHhhCC-CCCHHHHHHHH
Q psy4941 43 AETAKYKLHVQEVGHITSLLLGLSGRLARAENALIGL-REDHCDRKSLE 90 (172)
Q Consensus 43 ~E~eKy~~Fi~DlekVv~LLLsLs~RLaRvenaL~~l-~~d~~Er~~L~ 90 (172)
|+.-+.+..-.+=++..|-||+||+..|.|+=.+.-+ ..+|.||..|.
T Consensus 61 s~~~rlr~leeEqerL~ssLlaLsSHFAqVQfRirQ~vea~p~er~~LL 109 (621)
T KOG3759|consen 61 SDALRLRQLEEEQERLNSSLLALSSHFAQVQFRIRQMVEADPSERLKLL 109 (621)
T ss_pred cHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCHHHHHHHH
Confidence 7777899999999999999999999999999888743 44666665543
No 13
>PF10805 DUF2730: Protein of unknown function (DUF2730); InterPro: IPR020269 This entry represents a family of various hypothetical proteins. The proteins, which include HI1498 and Gp25, from phage Mu, are currently uncharacterised.
Probab=44.03 E-value=1.4e+02 Score=22.50 Aligned_cols=59 Identities=29% Similarity=0.257 Sum_probs=37.8
Q ss_pred HhHhhHHHHHHHhhCCCCCHHHHHHHHHHHHHHHhHHHHHHHHHHhhhhhhhHHHHHHHhcCC
Q psy4941 64 GLSGRLARAENALIGLREDHCDRKSLEEKRDKLASQLEEAKELKENIDRRSTNVATLLRKYLS 126 (172)
Q Consensus 64 sLs~RLaRvenaL~~l~~d~~Er~~L~~k~~~L~~Q~EDAkeLKe~ldRRe~~v~~iL~~~L~ 126 (172)
...+||..||+.+..+|. .++-..|.- .+.+.-.+-+.+.+.++.=.+.+.-+|.+.|.
T Consensus 46 ~~~~Rl~~lE~~l~~LPt-~~dv~~L~l---~l~el~G~~~~l~~~l~~v~~~~~lLlE~~lk 104 (106)
T PF10805_consen 46 EHDRRLQALETKLEHLPT-RDDVHDLQL---ELAELRGELKELSARLQGVSHQLDLLLENELK 104 (106)
T ss_pred HHHHHHHHHHHHHHhCCC-HHHHHHHHH---HHHHHHhHHHHHHHHHHHHHHHHHHHHHHHhc
Confidence 459999999999999965 333333322 34444555566777776666666666665554
No 14
>PF08687 ASD2: Apx/Shroom domain ASD2; InterPro: IPR014799 Cell shape changes require the coordination of actin and microtubule cytoskeletons. The Shroom family is a small group of related proteins that are defined by sequence similarity and in most cases by some link to the actin cytoskeleton. The Shroom (Shrm) protein family is found only in animals. Proteins of this family are predicted to be utilised in multiple morphogenic and developmental processes across animal phyla to regulate cells shape or intracellular architecture in an actin and myosin-dependent manner []. While the founding member of the Shrm family is Shrm1 (formerly Apx), it appears that this protein is found only in Xenopus []. In mice and humans, the Shrm family of proteins consists of: Shrm2 (formerly Apxl), a protein involved in the morphogenesis, maintenance, and/or function of vascular endothelial cells. Shrm3 (formerly Shroom), a protein necessary for neural tube closure in vertebrate development as deficiency in Shrm results in spina bifida. Shrm3 is also conserved in some invertebrates, as orthologues can be found in sea urchins. Shrm4, a regulator of cyto-skeletal architecture that may play an important role in vertebrate development. It is implicated in X-linked mental retardation in humans. This protein family is based on the conservation of a specific arrangement of an N-terminal PDZ domain, a centrally positioned sequence motif termed ASD1 (Apx/Shrm Domain 1) and a C-terminal motif termed ASD2 [, , ]. Shrm2 and Shrm3 contain all three domains, while Shrm4 contains the PDZ and ASD2 domains, but lacks a discernible ASD1 element. To date, the ASD1 and ASD2 elements have only been found in Shrm-related proteins and do not appear in combination with other conserved domains. ASD1 is required for targeting actin, while ASD2 is capable of eliciting an actomyosin based constriction event [, ]. ASD2 is the most highly conserved sequence element shared by Shrm1, Shrm2, Shrm3, and Shrm4. It possesses a well conserved series of leucine residues that exhibit spacing consistent with that of a leucine zipper motif []. Shroom2 is both necessary and sufficient to govern the localization of pigment granules at the apical surface of epithelial cells. Shroom2 is a central regulator of RPE pigmentation. Despite their diverse biological roles, Shroom family proteins share a common activity. Since the locus encoding human SHROOM2 lies within the critical region for two distinct forms of ocular albinism, it is possible that SHROOM2 mutations may contribute to human visual system disorders [].; GO: 0000902 cell morphogenesis, 0005737 cytoplasm; PDB: 3THF_B.
Probab=43.97 E-value=2.3e+02 Score=25.16 Aligned_cols=75 Identities=24% Similarity=0.230 Sum_probs=56.0
Q ss_pred hhHHHHHHHHHHHHHHHHHHHhhHHhhHHHHHHHHhhcCchhHHHHHHHHHhHHHHHHHHHHhHhhHHHHHHHhhCCCCC
Q psy4941 3 RLDSKLTLLRSVQVDLNEESRGNDRLGEALATRLASEARPAETAKYKLHVQEVGHITSLLLGLSGRLARAENALIGLRED 82 (172)
Q Consensus 3 ~l~~kL~~L~~eqe~l~ee~~~n~~lG~~ve~~V~~~ckp~E~eKy~~Fi~DlekVv~LLLsLs~RLaRvenaL~~l~~d 82 (172)
+|..|...|....+.-.+==...++.|..|...+...-.|.++.-|+-||.-=-++
T Consensus 184 ~L~~k~~~L~~Q~edAk~LKe~~drRe~~v~~iL~~~L~~eq~~dy~~fv~mKa~L------------------------ 239 (264)
T PF08687_consen 184 SLLEKRRLLQRQLEDAKELKENLDRRERVVSEILARYLSEEQLADYRHFVKMKAAL------------------------ 239 (264)
T ss_dssp HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHS-HHHHHHHHHHHHHHHHH------------------------
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHhHHHHHHHHHHHHhCCHHHHHHHHHHHHHHHHH------------------------
Confidence 56667777776666555444556788889999999999999999999998654222
Q ss_pred HHHHHHHHHHHHHHHhHHH
Q psy4941 83 HCDRKSLEEKRDKLASQLE 101 (172)
Q Consensus 83 ~~Er~~L~~k~~~L~~Q~E 101 (172)
-.|...|.+|.++-.+|+.
T Consensus 240 l~eqreLddkiklgeEQL~ 258 (264)
T PF08687_consen 240 LIEQRELDDKIKLGEEQLE 258 (264)
T ss_dssp HHHHHHHHHHHHHHHHHHH
T ss_pred HHHHHhHHHHHHhhHHHHH
Confidence 1577888888888888875
No 15
>KOG2180|consensus
Probab=42.55 E-value=2.4e+02 Score=28.92 Aligned_cols=69 Identities=20% Similarity=0.306 Sum_probs=44.5
Q ss_pred hhHHHHHHHHHhHHHHHHHHHHhHhhHHHHHHHhhC--------C-C--CCHHHHHHHHHHHHHHHhHHHHHHHHHHhhh
Q psy4941 43 AETAKYKLHVQEVGHITSLLLGLSGRLARAENALIG--------L-R--EDHCDRKSLEEKRDKLASQLEEAKELKENID 111 (172)
Q Consensus 43 ~E~eKy~~Fi~DlekVv~LLLsLs~RLaRvenaL~~--------l-~--~d~~Er~~L~~k~~~L~~Q~EDAkeLKe~ld 111 (172)
|-|.+|+ |+|+|.+| |+++.++..+|.. . . +.+++ ..+..++.||+..=..+|
T Consensus 163 ~~F~~Yk----~v~~I~~L----s~si~~~k~~l~~qi~~df~~~F~~~~~~~~--------~~~l~~l~daC~v~d~le 226 (793)
T KOG2180|consen 163 NHFIAYK----SVDEIANL----SESIDKLKKSLLSQIFQDFKAAFSGGETHEE--------ALLLQKLSDACLVVDALE 226 (793)
T ss_pred HHHHHhc----chHHHHHH----HHHHHHHHHHHHHHHHHHHHHhcCCCCCCCC--------ccHHHHHHHHHHHHHHhC
Confidence 3566776 78888765 6777777777763 1 0 11122 245677888888887776
Q ss_pred h--hhhHHHHHHHhcCCh
Q psy4941 112 R--RSTNVATLLRKYLSS 127 (172)
Q Consensus 112 R--Re~~v~~iL~~~L~~ 127 (172)
- |+.+|.-++.+.|++
T Consensus 227 psvreelIkwf~~qqL~e 244 (793)
T KOG2180|consen 227 PSVREELIKWFCSQQLEE 244 (793)
T ss_pred CccHHHHHHHHHHHHHHH
Confidence 5 677777777666553
No 16
>PF13328 HD_4: HD domain; PDB: 3NR1_B.
Probab=41.52 E-value=84 Score=24.25 Aligned_cols=50 Identities=18% Similarity=0.316 Sum_probs=32.9
Q ss_pred HHhhHHHHHHHHhhcCchhHHHH------HHHHHhHHH--------HHHHHHHhHhhHHHHHHH
Q psy4941 26 DRLGEALATRLASEARPAETAKY------KLHVQEVGH--------ITSLLLGLSGRLARAENA 75 (172)
Q Consensus 26 ~~lG~~ve~~V~~~ckp~E~eKy------~~Fi~Dlek--------Vv~LLLsLs~RLaRvena 75 (172)
+.+|..|..+|..+.++....++ ..++..+-+ +--+++.|+-||..+.+.
T Consensus 61 ~~fg~~V~~lV~~lt~~~~~~~~~~~~~~~~~~~~~r~ml~~~~~d~~~~lIKlaDrl~nl~~~ 124 (153)
T PF13328_consen 61 ERFGEDVADLVDALTKIKKLSKKPWEERSEEYAERLRRMLLAMSEDVRAVLIKLADRLHNLRTI 124 (153)
T ss_dssp HHHHHHHHHHHHHT---TTS-HH---HHHHHHHHHGGG-----S-H-HHHHHHHHHHHHHHHHH
T ss_pred HccChHHHHHHHHHHhccccccccchhhHHHHHHHhhhhccccCCchHHHHHHHHHHHHhhccH
Confidence 45899999999999976554444 333333333 456789999999998887
No 17
>TIGR02231 conserved hypothetical protein. This family consists of proteins over 500 amino acids long in Caenorhabditis elegans and several bacteria (Pseudomonas aeruginosa, Nostoc sp. PCC 7120, Leptospira interrogans, etc.). The function is unknown.
Probab=38.72 E-value=1.9e+02 Score=26.91 Aligned_cols=93 Identities=14% Similarity=0.111 Sum_probs=44.7
Q ss_pred hhHHHHHHHHHHHHHHHHHHHhhHHhhHHHHHHHHhhcCchhHH-HHHHHHHhHHHHHHHHHHhHhhHHHHHHHhhCCCC
Q psy4941 3 RLDSKLTLLRSVQVDLNEESRGNDRLGEALATRLASEARPAETA-KYKLHVQEVGHITSLLLGLSGRLARAENALIGLRE 81 (172)
Q Consensus 3 ~l~~kL~~L~~eqe~l~ee~~~n~~lG~~ve~~V~~~ckp~E~e-Ky~~Fi~DlekVv~LLLsLs~RLaRvenaL~~l~~ 81 (172)
.|..+|..|+.+...++.+...-++.-.-+++.-...-.|-.-+ +.. -.|+..+..++=..+++++.+..++..+
T Consensus 75 ~l~~~l~~l~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~-- 150 (525)
T TIGR02231 75 ELRKQIRELEAELRDLEDRGDALKALAKFLEDIREGLTEPIKDSAKRN--EPDLKEWFQAFDFNGSEIERLLTEDREA-- 150 (525)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhccccccccccC--CCCHHHHHHHHHHHHHHHHHHHHHHHHH--
Confidence 45666777777776666666665555544444432111000000 000 1245555556666677776666555432
Q ss_pred CHHHHHHHHHHHHHHHhHH
Q psy4941 82 DHCDRKSLEEKRDKLASQL 100 (172)
Q Consensus 82 d~~Er~~L~~k~~~L~~Q~ 100 (172)
..+...+.+++..|..++
T Consensus 151 -~~~~~~~~~~l~~l~~~l 168 (525)
T TIGR02231 151 -ERRIRELEKQLSELQNEL 168 (525)
T ss_pred -HHHHHHHHHHHHHHHHHH
Confidence 123344444444444444
No 18
>PF07889 DUF1664: Protein of unknown function (DUF1664); InterPro: IPR012458 The members of this family are hypothetical plant proteins of unknown function. The region featured in this family is approximately 100 amino acids long.
Probab=37.15 E-value=1.7e+02 Score=23.16 Aligned_cols=29 Identities=17% Similarity=0.359 Sum_probs=23.5
Q ss_pred HHHHHHHHhHHHHHHHHHHhHhhHHHHHH
Q psy4941 46 AKYKLHVQEVGHITSLLLGLSGRLARAEN 74 (172)
Q Consensus 46 eKy~~Fi~DlekVv~LLLsLs~RLaRven 74 (172)
+.-..+-+|++.|-...-.|-+++.++|.
T Consensus 96 ~dv~~i~~dv~~v~~~V~~Le~ki~~ie~ 124 (126)
T PF07889_consen 96 EDVSQIGDDVDSVQQMVEGLEGKIDEIEE 124 (126)
T ss_pred hhHHHHHHHHHHHHHHHHHHHHHHHHHhc
Confidence 34566778999999999999999998874
No 19
>PF10146 zf-C4H2: Zinc finger-containing protein ; InterPro: IPR018482 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target. This entry represents a family of proteins which appears to have a highly conserved zinc finger domain at the C-terminal end, described as -C-X2-CH-X3-H-X5-C-X2-C-. The structure is predicted to contain a coiled coil. Members of this family are annotated as being tumour-associated antigen HCA127 in humans, but this could not be confirmed.
Probab=35.75 E-value=2.9e+02 Score=23.85 Aligned_cols=87 Identities=21% Similarity=0.286 Sum_probs=55.9
Q ss_pred CchhHHHHHHHHHHHHHHHHHHHhhHHhhHHHHHHHHhhcCchhHHHHHHHHHhHHHHHHHHHHhHhhHHHHH------H
Q psy4941 1 MSRLDSKLTLLRSVQVDLNEESRGNDRLGEALATRLASEARPAETAKYKLHVQEVGHITSLLLGLSGRLARAE------N 74 (172)
Q Consensus 1 ~~~l~~kL~~L~~eqe~l~ee~~~n~~lG~~ve~~V~~~ckp~E~eKy~~Fi~DlekVv~LLLsLs~RLaRve------n 74 (172)
|..|+.|...+..-+..|..++.. ++.=..||.|..+-.+.|+ ..|.+.|| .
T Consensus 3 i~~ir~K~~~lek~k~~i~~e~~~--------------------~e~ee~~L~e~~kE~~~L~--~Er~~h~eeLrqI~~ 60 (230)
T PF10146_consen 3 IKEIRNKTLELEKLKNEILQEVES--------------------LENEEKCLEEYRKEMEELL--QERMAHVEELRQINQ 60 (230)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHH--------------------HHHHHHHHHHHHHHHHHHH--HHHHHHHHHHHHHHH
Confidence 356788888888888888887765 3444578888888888876 44544444 3
Q ss_pred HhhCCCCC----HHHHHHHHHHHHHHHhHHHHHHHHHHhhhh
Q psy4941 75 ALIGLRED----HCDRKSLEEKRDKLASQLEEAKELKENIDR 112 (172)
Q Consensus 75 aL~~l~~d----~~Er~~L~~k~~~L~~Q~EDAkeLKe~ldR 112 (172)
-++.++.- ..||.... ....+-+++-..||+.||+
T Consensus 61 DIn~lE~iIkqa~~er~~~~---~~i~r~~eey~~Lk~~in~ 99 (230)
T PF10146_consen 61 DINTLENIIKQAESERNKRQ---EKIQRLYEEYKPLKDEINE 99 (230)
T ss_pred HHHHHHHHHHHHHHHHHHHH---HHHHHHHHHHHHHHHHHHH
Confidence 34444321 12443333 3445666788899999886
No 20
>TIGR01321 TrpR trp operon repressor, proteobacterial. This model represents TrpR, the repressor of the trp operon. It is found so far only in the gamma subdivision of the proteobacteria and in Chlamydia trachomatis. All members belong to species capable of tryptophan biosynthesis.
Probab=35.05 E-value=1.8e+02 Score=22.05 Aligned_cols=57 Identities=18% Similarity=0.185 Sum_probs=36.1
Q ss_pred hHHHHHHHHhhcCchhHHHHHHHHHhHHHHHHHHHH--hHh------------hHHHHHHHhhCCCCCHHHHHHHHH
Q psy4941 29 GEALATRLASEARPAETAKYKLHVQEVGHITSLLLG--LSG------------RLARAENALIGLREDHCDRKSLEE 91 (172)
Q Consensus 29 G~~ve~~V~~~ckp~E~eKy~~Fi~DlekVv~LLLs--Ls~------------RLaRvenaL~~l~~d~~Er~~L~~ 91 (172)
.+.+...+...|+|.|.+-..+=+ +|++.|+. +|. -.+|++|.|...| |+.++-|..
T Consensus 22 ~~~l~~~l~~lLTp~E~~~l~~R~----~i~~~Ll~~~~tQrEIa~~lGiS~atIsR~sn~lk~~~--~~~~~~l~~ 92 (94)
T TIGR01321 22 EDDMQLLLELILTRSEREDLGDRI----RIVNELLNGNMSQREIASKLGVSIATITRGSNNLKTMD--PNFKQFLRK 92 (94)
T ss_pred HHHHHHHHHHhCCHHHHHHHHHHH----HHHHHHHhCCCCHHHHHHHhCCChhhhhHHHhhcccCC--HHHHHHHHh
Confidence 356788899999999988443322 24554443 232 3679999998763 455555543
No 21
>PRK11092 bifunctional (p)ppGpp synthetase II/ guanosine-3',5'-bis pyrophosphate 3'-pyrophosphohydrolase; Provisional
Probab=32.64 E-value=3.4e+02 Score=27.10 Aligned_cols=95 Identities=19% Similarity=0.316 Sum_probs=56.2
Q ss_pred HhhHHHHHHHHhhcCchh----------HHHHH-HHHHhHHHHHHHHHHhHhhHHHHHHHhhCCCCCHHHH---H-----
Q psy4941 27 RLGEALATRLASEARPAE----------TAKYK-LHVQEVGHITSLLLGLSGRLARAENALIGLREDHCDR---K----- 87 (172)
Q Consensus 27 ~lG~~ve~~V~~~ckp~E----------~eKy~-~Fi~DlekVv~LLLsLs~RLaRvenaL~~l~~d~~Er---~----- 87 (172)
.+|..|..+|..+.+... .+.|+ ||++=.+-+--+|..|+-||..+. +|..++++...+ +
T Consensus 88 ~FG~~Va~lV~gvTk~~~l~~~~~~~~q~e~~rkmllam~~DiRVvlIKLADRlhNmr-tL~~~~~ek~~~iA~ETl~iy 166 (702)
T PRK11092 88 LFGKSVAELVEGVSKLDKLKFRDKKEAQAENFRKMIMAMVQDIRVILIKLADRTHNMR-TLGSLRPDKRRRIARETLEIY 166 (702)
T ss_pred HHCHHHHHHHHHHHhhccccccchhhHHHHHHHHHHHHhcCCCceEEEEHHHHHhhHH-HHHhcCccHHHHHHHHHHHHH
Confidence 599999999999998643 44444 333322222245778899999985 577776543211 1
Q ss_pred -HHHHHH--HHHHhHHHHHHHHHHhhhhhhhHHHHHHHhcCChHHHhhHHHHHHh
Q psy4941 88 -SLEEKR--DKLASQLEEAKELKENIDRRSTNVATLLRKYLSSEEFADYNHFINM 139 (172)
Q Consensus 88 -~L~~k~--~~L~~Q~EDAkeLKe~ldRRe~~v~~iL~~~L~~eql~dY~~fv~m 139 (172)
.|-.+. ..++.++|| +-.+||.|+++..-..++..
T Consensus 167 aPlA~rlGi~~ik~eLed-----------------L~f~~l~P~~y~~i~~~~~~ 204 (702)
T PRK11092 167 SPLAHRLGIHHIKTELEE-----------------LGFEALYPNRYRVIKEVVKA 204 (702)
T ss_pred HHHHHHhChHHHHHHHHH-----------------HHHHhhCHHHHHHHHHHHHH
Confidence 111221 234455554 45689999987655544443
No 22
>PF06037 DUF922: Bacterial protein of unknown function (DUF922); InterPro: IPR010321 This entry consists of several hypothetical bacterial proteins of unknown function.
Probab=32.16 E-value=2.7e+02 Score=22.47 Aligned_cols=74 Identities=16% Similarity=0.237 Sum_probs=42.2
Q ss_pred hhcCchhHHHHHHHHHhHHHHHHHHHHhHhhH-HHHHHHhhCCCCCHH---HHHHHHHHHHHHHhHHHHHHHHHHhhhhh
Q psy4941 38 SEARPAETAKYKLHVQEVGHITSLLLGLSGRL-ARAENALIGLREDHC---DRKSLEEKRDKLASQLEEAKELKENIDRR 113 (172)
Q Consensus 38 ~~ckp~E~eKy~~Fi~DlekVv~LLLsLs~RL-aRvenaL~~l~~d~~---Er~~L~~k~~~L~~Q~EDAkeLKe~ldRR 113 (172)
..+.|...-++..|+.+|.+==.-=-..+-.. ..+|.+|.++++++. =+..+...-..+...+..+. ..+||+
T Consensus 72 ~~~~~~~~~~W~~~~a~l~~HE~~H~~ia~~~a~~ie~~l~~L~~~~~C~~l~~~~~~~~~~~l~~~~~~q---~~fDr~ 148 (161)
T PF06037_consen 72 AKAPPELRQRWDRFSAGLRRHEEVHGRIAREMAREIEKALKGLPPDPDCQKLRAEANRRTDAILARHRQRQ---RDFDRV 148 (161)
T ss_pred CCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCCCCHHHHHHHHHHHHHHHHHHHHHHH---Hhhchh
Confidence 45667777889999999943211111112222 358999999987763 23445554444444444433 456665
Q ss_pred h
Q psy4941 114 S 114 (172)
Q Consensus 114 e 114 (172)
|
T Consensus 149 E 149 (161)
T PF06037_consen 149 E 149 (161)
T ss_pred h
Confidence 5
No 23
>PF15188 CCDC-167: Coiled-coil domain-containing protein 167
Probab=31.04 E-value=2.3e+02 Score=21.22 Aligned_cols=47 Identities=26% Similarity=0.282 Sum_probs=31.8
Q ss_pred HHHHHHHHHHhHhhHHHHHHHhhCCCCCHHHHHHHHHHHHHHHhHHH
Q psy4941 55 VGHITSLLLGLSGRLARAENALIGLREDHCDRKSLEEKRDKLASQLE 101 (172)
Q Consensus 55 lekVv~LLLsLs~RLaRvenaL~~l~~d~~Er~~L~~k~~~L~~Q~E 101 (172)
+|++=--+...=.||-+|++.|.+-.-+|+.|.++..-...|+.+++
T Consensus 7 Id~lEekl~~cr~~le~ve~rL~~~eLs~e~R~~lE~E~~~l~~~l~ 53 (85)
T PF15188_consen 7 IDGLEEKLAQCRRRLEAVESRLRRRELSPEARRSLEKELNELKEKLE 53 (85)
T ss_pred HhhHHHHHHHHHHHHHHHHHHHcccCCChHHHHHHHHHHHHHHHHhh
Confidence 34444444555678888899998876678888888866655555543
No 24
>PF05651 Diacid_rec: Putative sugar diacid recognition; InterPro: IPR008599 This region is found in several proteins characterised as carbohydrate diacid regulators (e.g. P36047 from SWISSPROT). An HTH DNA-binding motif is found at the C terminus of these proteins suggesting that this region includes the sugar recognition region.
Probab=30.64 E-value=1.4e+02 Score=23.57 Aligned_cols=42 Identities=21% Similarity=0.396 Sum_probs=36.0
Q ss_pred ChHHHhhHHHHHHhHHHHHHHhhchHHHhhhcHHHHHHHHhh
Q psy4941 126 SSEEFADYNHFINMKAKLLVDAREISDKIQLGEEQLAALQET 167 (172)
Q Consensus 126 ~~eql~dY~~fv~mKa~Ll~eqreLddkIklgEEQL~~L~es 167 (172)
+|++...|-..+++-+-+++++..+.+....-+.+++.+-..
T Consensus 92 ~p~eV~~~~~lvk~~~Elll~e~~~~~~~~~~~~~~~~~l~~ 133 (135)
T PF05651_consen 92 EPEEVRPYAQLVKKMAELLLEEEYLKEQLQWRERARESFLEE 133 (135)
T ss_pred CHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 589999999999999999999999999888777777665443
No 25
>PRK02224 chromosome segregation protein; Provisional
Probab=30.42 E-value=5.4e+02 Score=25.34 Aligned_cols=38 Identities=21% Similarity=0.157 Sum_probs=25.0
Q ss_pred HhhHHHHHHhHHHHHHHhhchHHHhhhcHHHHHHHHhh
Q psy4941 130 FADYNHFINMKAKLLVDAREISDKIQLGEEQLAALQET 167 (172)
Q Consensus 130 l~dY~~fv~mKa~Ll~eqreLddkIklgEEQL~~L~es 167 (172)
-++|..|.+....+-.+.-++.++|.-..+++..+...
T Consensus 550 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~ 587 (880)
T PRK02224 550 EAEAEEKREAAAEAEEEAEEAREEVAELNSKLAELKER 587 (880)
T ss_pred HHHHHHHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHH
Confidence 45556666666667777777777777777777665543
No 26
>PF04111 APG6: Autophagy protein Apg6; InterPro: IPR007243 Macroautophagy is a bulk degradation process induced by starvation in eukaryotic cells. In yeast, 15 Apg proteins coordinate the formation of autophagosomes. No molecule involved in autophagy has yet been identified in higher eukaryotes []. The pre-autophagosomal structure contains at least five Apg proteins: Apg1p, Apg2p, Apg5p, Aut7p/Apg8p and Apg16p. It is found in the vacuole []. The C-terminal glycine of Apg12p is conjugated to a lysine residue of Apg5p via an isopeptide bond. During autophagy, cytoplasmic components are enclosed in autophagosomes and delivered to lysosomes/vacuoles. Auotphagy protein 16 (Apg16) has been shown to be bind to Apg5 and is required for the function of the Apg12p-Apg5p conjugate []. Autophagy protein 5 (Apg5) is directly required for the import of aminopeptidase I via the cytoplasm-to-vacuole targeting pathway []. Apg6/Vps30p has two distinct functions in the autophagic process, either associated with the membrane or in a retrieval step of the carboxypeptidase Y sorting pathway [].; GO: 0006914 autophagy; PDB: 3Q8T_A 3VP7_A 4DDP_A.
Probab=29.92 E-value=3.1e+02 Score=24.44 Aligned_cols=38 Identities=16% Similarity=0.316 Sum_probs=27.3
Q ss_pred HhhHHHHHHhHHHHHHHhhchHHHhhhcHHHHHHHHhh
Q psy4941 130 FADYNHFINMKAKLLVDAREISDKIQLGEEQLAALQET 167 (172)
Q Consensus 130 l~dY~~fv~mKa~Ll~eqreLddkIklgEEQL~~L~es 167 (172)
..+|..|-.-...+..+...+..++.....||+.|+.+
T Consensus 98 ~~~~n~~~~~l~~~~~e~~sl~~q~~~~~~~L~~L~kt 135 (314)
T PF04111_consen 98 WREYNELQLELIEFQEERDSLKNQYEYASNQLDRLRKT 135 (314)
T ss_dssp HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCHHT-
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhc
Confidence 34444444444556688889999999999999998754
No 27
>KOG3540|consensus
Probab=29.83 E-value=1.8e+02 Score=28.62 Aligned_cols=50 Identities=26% Similarity=0.559 Sum_probs=37.9
Q ss_pred HHHhhhhhhhHHHHHHHhcCChHH------HhhHHHHH--------HhHHHHHHHhhchHHHhhh
Q psy4941 106 LKENIDRRSTNVATLLRKYLSSEE------FADYNHFI--------NMKAKLLVDAREISDKIQL 156 (172)
Q Consensus 106 LKe~ldRRe~~v~~iL~~~L~~eq------l~dY~~fv--------~mKa~Ll~eqreLddkIkl 156 (172)
|..+.-|= .-|...|.+|+-+|| +.-|+|.. +||++.+.--|=|++||--
T Consensus 339 Lqa~pprp-~~Vl~aLkrYvRAEqKdr~HTlrhyqHv~~vDpkkAaqmk~qV~thLrvIeeR~Nq 402 (615)
T KOG3540|consen 339 LQADPPRP-HRVLQALKRYVRAEQKDRMHTLRHYQHVLAVDPKKAAQMKSQVMTHLRVIEERINQ 402 (615)
T ss_pred HhcCCCCh-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcChHHHHHHHHHHHHHHHHHHHHhcc
Confidence 33333333 456677889999998 67788875 6999999999999999864
No 28
>COG3835 CdaR Sugar diacid utilization regulator [Transcription / Signal transduction mechanisms]
Probab=29.53 E-value=57 Score=30.49 Aligned_cols=28 Identities=25% Similarity=0.573 Sum_probs=25.2
Q ss_pred ChHHHhhHHHHHHhHHHHHHHhhchHHH
Q psy4941 126 SSEEFADYNHFINMKAKLLVDAREISDK 153 (172)
Q Consensus 126 ~~eql~dY~~fv~mKa~Ll~eqreLddk 153 (172)
.|++.++|-..|+|-|-|||||+.+-|-
T Consensus 94 eP~~Vr~y~ELVrm~AElliEQ~~~~Eq 121 (376)
T COG3835 94 EPEEVRKYGELVRMTAELLIEQARLLEQ 121 (376)
T ss_pred ChHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 5899999999999999999999987653
No 29
>PF03961 DUF342: Protein of unknown function (DUF342); InterPro: IPR005646 This family of bacterial proteins has no known function. The proteins are in the region of 500-600 amino acid residues in length.
Probab=28.88 E-value=4.6e+02 Score=24.11 Aligned_cols=43 Identities=16% Similarity=0.233 Sum_probs=38.3
Q ss_pred HhcCChHHHhhHHHHHHhHHHHHHHhhchHHHhhhcHHHHHHH
Q psy4941 122 RKYLSSEEFADYNHFINMKAKLLVDAREISDKIQLGEEQLAAL 164 (172)
Q Consensus 122 ~~~L~~eql~dY~~fv~mKa~Ll~eqreLddkIklgEEQL~~L 164 (172)
...++++...-|..+...+..+..+-.++++++.-.++++...
T Consensus 366 ~~~~~~~~~~~~~~l~~~~~~l~~~~~~l~~~~~~l~~~l~~~ 408 (451)
T PF03961_consen 366 QGKLPPEKKEQLKKLKEKKKELKEELKELKEELKELKEELERS 408 (451)
T ss_pred cccCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhh
Confidence 3568899999999999999999999999999999888888776
No 30
>PF08203 RNA_polI_A14: Yeast RNA polymerase I subunit RPA14; InterPro: IPR013239 Saccharomyces cerevisiae RNA polymerase I (Pol I) is a complex consisting of 14 subunits. Subunit RPA14 forms part of a Pol I subcomplex consisting of RPA14 and and RPA43. The RPA14 and RPA43 heterodimer is proposed to play a role in the recruitment of Pol I to the promoter []. ; PDB: 2RF4_F.
Probab=28.57 E-value=36 Score=25.06 Aligned_cols=39 Identities=26% Similarity=0.346 Sum_probs=24.8
Q ss_pred hhHHHH-HHHHHhHHHHHHHH------------------HHhHhhHHHHHHHhhCCCC
Q psy4941 43 AETAKY-KLHVQEVGHITSLL------------------LGLSGRLARAENALIGLRE 81 (172)
Q Consensus 43 ~E~eKy-~~Fi~DlekVv~LL------------------LsLs~RLaRvenaL~~l~~ 81 (172)
.|.++| ..||+.-|.+.+.+ -+.=+.|.||++-|.||||
T Consensus 16 ~ea~~fL~~FI~~~E~~~~~~~~~~~~~~~t~~s~~t~~ss~LSQLKRiQRdlrGLPP 73 (76)
T PF08203_consen 16 DEAEQFLTEFIDEKESIASNLTNSEDNSAGTGLSSDTGLSSVLSQLKRIQRDLRGLPP 73 (76)
T ss_dssp HHHHHHHHHHHHHHHSS-------------------TTHHHHHHHHHHHHHHHHHS--
T ss_pred HHHHHHHHHHHHHHHHHhccccccccccccccccCCCccHHHHHHHHHHHHhhCCCCC
Confidence 455555 56888888884433 3344689999999999986
No 31
>COG3879 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=26.89 E-value=1.5e+02 Score=26.21 Aligned_cols=47 Identities=19% Similarity=0.149 Sum_probs=21.3
Q ss_pred hHHHHHHHHHHHHHHHHHHHhhHHhhHHHHHHHHhhcCchhHHHHHHHHHhH
Q psy4941 4 LDSKLTLLRSVQVDLNEESRGNDRLGEALATRLASEARPAETAKYKLHVQEV 55 (172)
Q Consensus 4 l~~kL~~L~~eqe~l~ee~~~n~~lG~~ve~~V~~~ckp~E~eKy~~Fi~Dl 55 (172)
+..|...|+.+.+.++..+..-.+-+..-.+.+. +++++++|.+|-+
T Consensus 62 ~Q~~~~~L~~ev~~~~~~~~s~~~~~~t~~~~ie-----~~l~~l~~~aG~v 108 (247)
T COG3879 62 LQKKVNTLAAEVEDLENKLDSVRRSVLTDDAALE-----DRLEKLRMLAGSV 108 (247)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhHHhHHHHHH-----HHHHHHHHHhccC
Confidence 3344444444444444444433322223333333 2677777766643
No 32
>PF07851 TMPIT: TMPIT-like protein; InterPro: IPR012926 A number of members of this family are annotated as being transmembrane proteins induced by tumour necrosis factor alpha, but no literature was found to support this. ; GO: 0016021 integral to membrane
Probab=26.25 E-value=5.1e+02 Score=23.77 Aligned_cols=44 Identities=16% Similarity=0.102 Sum_probs=26.3
Q ss_pred HHHHHHHHHHHHHHHHHHhhHHhhHHHHHHHHhhcCchhHHHHH
Q psy4941 6 SKLTLLRSVQVDLNEESRGNDRLGEALATRLASEARPAETAKYK 49 (172)
Q Consensus 6 ~kL~~L~~eqe~l~ee~~~n~~lG~~ve~~V~~~ckp~E~eKy~ 49 (172)
+-...|+++-+.|++..+.-..-=+++..+....|+.-.-.|++
T Consensus 4 eEW~eL~~efq~Lqethr~Y~qKleel~~lQ~~C~ssI~~Qkkr 47 (330)
T PF07851_consen 4 EEWEELQKEFQELQETHRSYKQKLEELSKLQDKCSSSISHQKKR 47 (330)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 34556677777777766666655566666555555544444544
No 33
>KOG0996|consensus
Probab=26.11 E-value=8.9e+02 Score=26.46 Aligned_cols=87 Identities=22% Similarity=0.214 Sum_probs=54.1
Q ss_pred HHHHHHHHHHHHHHHHHHHhhHHhhHHHHHHHHhhcCchhHHHHHHHHHhHHHHHHHHHHhHhhHHHHHHHhhCCCCCHH
Q psy4941 5 DSKLTLLRSVQVDLNEESRGNDRLGEALATRLASEARPAETAKYKLHVQEVGHITSLLLGLSGRLARAENALIGLREDHC 84 (172)
Q Consensus 5 ~~kL~~L~~eqe~l~ee~~~n~~lG~~ve~~V~~~ckp~E~eKy~~Fi~DlekVv~LLLsLs~RLaRvenaL~~l~~d~~ 84 (172)
..+..+++++...+.+....+...+.+++. .-+++..++.+...+.=-|-++++..|.+....-+|..
T Consensus 791 ~~~~~~~~~~~~~~ee~~~~lr~~~~~l~~------------~l~~~~~~~k~~~~~~~~l~~~i~~~E~~~~k~~~d~~ 858 (1293)
T KOG0996|consen 791 SDKARQHQEQLHELEERVRKLRERIPELEN------------RLEKLTASVKRLAELIEYLESQIAELEAAVLKKVVDKK 858 (1293)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHhhHHHHh------------HHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhccCcHH
Confidence 344455555555555555555555555542 24556667777777778888999999998665444554
Q ss_pred HHHHHHHHHHHHHhHHHHH
Q psy4941 85 DRKSLEEKRDKLASQLEEA 103 (172)
Q Consensus 85 Er~~L~~k~~~L~~Q~EDA 103 (172)
...-+.+...-+.++++++
T Consensus 859 ~l~~~~~~ie~l~kE~e~~ 877 (1293)
T KOG0996|consen 859 RLKELEEQIEELKKEVEEL 877 (1293)
T ss_pred HHHHHHHHHHHHHHHHHHH
Confidence 4444455566667777776
No 34
>COG1771 Uncharacterized protein conserved in archaea [Function unknown]
Probab=26.10 E-value=1.2e+02 Score=28.99 Aligned_cols=77 Identities=26% Similarity=0.304 Sum_probs=50.1
Q ss_pred HhhHHHHHHHhhCCCCCHH---HHHHHHHHHHHHHhHHHHHHHHHHhhhhhh-----hHHHHHH-HhcCChH--HHhhHH
Q psy4941 66 SGRLARAENALIGLREDHC---DRKSLEEKRDKLASQLEEAKELKENIDRRS-----TNVATLL-RKYLSSE--EFADYN 134 (172)
Q Consensus 66 s~RLaRvenaL~~l~~d~~---Er~~L~~k~~~L~~Q~EDAkeLKe~ldRRe-----~~v~~iL-~~~L~~e--ql~dY~ 134 (172)
|+=.||.+..|.++..+.+ |=.-+-.=.-.|..|+|.-++|||...+|| +.|.+.= +.+.++. ..+.|-
T Consensus 4 s~LIar~n~fLdsi~i~~d~Lkev~g~~di~~tLr~nLE~LqdLke~mE~r~Fd~PyknVg~l~~~p~i~E~~~~i~sh~ 83 (471)
T COG1771 4 SELIARINKFLDSISIDNDGLKEVVGLPDIFLTLRKNLESLQDLKEEMERREFDSPYKNVGELGPTPIIKEKLSKIPSHL 83 (471)
T ss_pred hHHHHHHHHHHHHhhccccchhhhcCcHHHHHHHHHHHHHHHHHHHHHHhhccCCccccccccCCCccchhhhhcchHHH
Confidence 5556888888886653322 222233345678899999999999999996 4454432 2333332 245588
Q ss_pred HHHHhHHH
Q psy4941 135 HFINMKAK 142 (172)
Q Consensus 135 ~fv~mKa~ 142 (172)
.|+.||++
T Consensus 84 sy~rrkas 91 (471)
T COG1771 84 SYIRRKAS 91 (471)
T ss_pred HHHHHHHH
Confidence 89999975
No 35
>PF04340 DUF484: Protein of unknown function, DUF484; InterPro: IPR007435 This family consists of several proteins of uncharacterised function.; PDB: 3E98_B.
Probab=25.97 E-value=2.1e+02 Score=23.61 Aligned_cols=47 Identities=17% Similarity=0.157 Sum_probs=37.8
Q ss_pred chhHHHHHHHHHHHHHHHHHHHhhHHhhHHHHHHHHhhcCchhHHHH
Q psy4941 2 SRLDSKLTLLRSVQVDLNEESRGNDRLGEALATRLASEARPAETAKY 48 (172)
Q Consensus 2 ~~l~~kL~~L~~eqe~l~ee~~~n~~lG~~ve~~V~~~ckp~E~eKy 48 (172)
..++.|...|+..-+.|-.-++.|+++...+..++...-..+.++-+
T Consensus 50 ~~LR~~~~~L~~~l~~Li~~Ar~Ne~~~~~~~~l~l~LL~a~sl~~l 96 (225)
T PF04340_consen 50 ERLRERNRQLEEQLEELIENARENEAIFQRLHRLVLALLAARSLQEL 96 (225)
T ss_dssp HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHC--SHHHH
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCHHHH
Confidence 46788899999999999999999999999988888888777665543
No 36
>TIGR01837 PHA_granule_1 poly(hydroxyalkanoate) granule-associated protein. This model describes a domain found in some proteins associated with polyhydroxyalkanoate (PHA) granules in a subset of species that have PHA inclusion granules. Included are two tandem proteins of Pseudomonas oleovorans, PhaI and PhaF, and their homologs in related species. PhaF proteins have a low-complexity C-terminal region with repeats similar to AAAKP.
Probab=25.79 E-value=1.6e+02 Score=22.56 Aligned_cols=34 Identities=35% Similarity=0.406 Sum_probs=27.7
Q ss_pred HHHHHHhhCCC-CCHHHHHHHHHHHHHHHhHHHHH
Q psy4941 70 ARAENALIGLR-EDHCDRKSLEEKRDKLASQLEEA 103 (172)
Q Consensus 70 aRvenaL~~l~-~d~~Er~~L~~k~~~L~~Q~EDA 103 (172)
.+|.++|..++ |..+|=..|..|...|..+.++-
T Consensus 81 ~~v~~~L~~lg~~tk~ev~~L~~RI~~Le~~l~~l 115 (118)
T TIGR01837 81 ERVEQALNRLNIPSREEIEALSAKIEQLAVQVEEL 115 (118)
T ss_pred HHHHHHHHHcCCCCHHHHHHHHHHHHHHHHHHHHH
Confidence 36778888876 57789999999999998888764
No 37
>PF06419 COG6: Conserved oligomeric complex COG6; InterPro: IPR010490 COG6 is a component of the conserved oligomeric golgi complex, which is composed of eight different subunits and is required for normal golgi morphology and localisation.
Probab=25.79 E-value=6.2e+02 Score=24.58 Aligned_cols=25 Identities=28% Similarity=0.489 Sum_probs=18.2
Q ss_pred HHhhhhhhhHHHHHHHhc-CChHHHh
Q psy4941 107 KENIDRRSTNVATLLRKY-LSSEEFA 131 (172)
Q Consensus 107 Ke~ldRRe~~v~~iL~~~-L~~eql~ 131 (172)
++.+..|..++..++.+| ||+++.+
T Consensus 89 ~~~~~~k~~ll~~f~~~f~Ls~~E~~ 114 (618)
T PF06419_consen 89 KEELELKKKLLDAFLERFTLSEEEED 114 (618)
T ss_pred HHHHHHHHHHHHHHHHhCCCCHHHHH
Confidence 456677888888888885 7777653
No 38
>PF04719 TAFII28: hTAFII28-like protein conserved region; InterPro: IPR006809 The general transcription factor, TFIID, consists of the TATA-binding protein (TBP) associated with a series of TBP-associated factors (TAFs) that together participate in the assembly of the transcription preinitiation complex. The conserved region is found at the C terminus of most member proteins. The crystal structure of hTAFII28 with hTAFII18 shows that this region is involved in the binding of these two subunits. The conserved region contains four alpha helices and three loops arranged as in histone H3 [, ].; GO: 0006367 transcription initiation from RNA polymerase II promoter, 0005634 nucleus; PDB: 1BH9_B 1BH8_B.
Probab=25.24 E-value=43 Score=25.11 Aligned_cols=22 Identities=23% Similarity=0.386 Sum_probs=11.3
Q ss_pred HHHHhcCChHHHhhHHHHHHhH
Q psy4941 119 TLLRKYLSSEEFADYNHFINMK 140 (172)
Q Consensus 119 ~iL~~~L~~eql~dY~~fv~mK 140 (172)
.+|-..||+||+.-|..|-..+
T Consensus 2 ~~L~~~f~~eQ~~Rye~fRRs~ 23 (90)
T PF04719_consen 2 QLLLSNFDEEQLDRYEAFRRSS 23 (90)
T ss_dssp --------HHHHHHHHHHHH--
T ss_pred hHHHHcCCHHHHHHHHHHHHcc
Confidence 3577889999999999997654
No 39
>PF08700 Vps51: Vps51/Vps67; InterPro: IPR014812 The VFT tethering complex (also known as GARP complex, Golgi associated retrograde protein complex, Vps53 tethering complex) is a conserved eukaryotic docking complex which is involved in recycling of proteins from endosomes to the late Golgi. Vps51 (also known as Vps67) is a subunit of VFT and interacts with the SNARE Tlg1 [].
Probab=24.85 E-value=2.4e+02 Score=19.43 Aligned_cols=39 Identities=21% Similarity=0.216 Sum_probs=24.9
Q ss_pred hHHHHHHHHHHHHHHHHHHHhhHHhhHHHHHHHHhhcCchhHHHHHHHHHhHHHHHH
Q psy4941 4 LDSKLTLLRSVQVDLNEESRGNDRLGEALATRLASEARPAETAKYKLHVQEVGHITS 60 (172)
Q Consensus 4 l~~kL~~L~~eqe~l~ee~~~n~~lG~~ve~~V~~~ckp~E~eKy~~Fi~DlekVv~ 60 (172)
|.+....|..+......+++. +-++.|+-||+=-+-|.+
T Consensus 24 i~~~~~~L~~~i~~~~~eLr~------------------~V~~nY~~fI~as~~I~~ 62 (87)
T PF08700_consen 24 IRQLENKLRQEIEEKDEELRK------------------LVYENYRDFIEASDEISS 62 (87)
T ss_pred HHHHHHHHHHHHHHHHHHHHH------------------HHHhhHHHHHHHHHHHHH
Confidence 334444555555555555555 678999999986666643
No 40
>PF05597 Phasin: Poly(hydroxyalcanoate) granule associated protein (phasin); InterPro: IPR008769 Polyhydroxyalkanoates (PHAs) are storage polyesters synthesised by various bacteria as intracellular carbon and energy reserve material. PHAs are accumulated as water-insoluble inclusions within the cells. This family consists of the phasins PhaF and PhaI which act as a transcriptional regulator of PHA biosynthesis genes. PhaF has been proposed to repress expression of the phaC1 gene and the phaIF operon.
Probab=24.54 E-value=3.6e+02 Score=21.40 Aligned_cols=57 Identities=26% Similarity=0.291 Sum_probs=37.7
Q ss_pred HHHHHHHHhH-HHHHHHHHHhHhhHH-HHHHHhhCCC-CCHHHHHHHHHHHHHHHhHHHH
Q psy4941 46 AKYKLHVQEV-GHITSLLLGLSGRLA-RAENALIGLR-EDHCDRKSLEEKRDKLASQLEE 102 (172)
Q Consensus 46 eKy~~Fi~Dl-ekVv~LLLsLs~RLa-RvenaL~~l~-~d~~Er~~L~~k~~~L~~Q~ED 102 (172)
+....+++|+ +++..=.=.+.+-+- ||.++|+.|. |...|=+.|..|...|..+.+.
T Consensus 68 ~~~~~~~~~~~~~~~~~~dklE~~fd~rV~~aL~rLgvPs~~dv~~L~~rId~L~~~v~~ 127 (132)
T PF05597_consen 68 DQVKSRVDDVKERATGQWDKLEQAFDERVARALNRLGVPSRKDVEALSARIDQLTAQVER 127 (132)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCCHHHHHHHHHHHHHHHHHHHH
Confidence 4444555555 333222223333332 5888999776 6788999999999999988875
No 41
>KOG3091|consensus
Probab=24.53 E-value=6.8e+02 Score=24.55 Aligned_cols=111 Identities=21% Similarity=0.273 Sum_probs=75.1
Q ss_pred HHHhHHHHHHHHHHhHhhHHHHHHHhhCCCC-----CHHHHHHHHHHHHHHHhHHHHHHHHHHhhhh---hhhHHHHHHH
Q psy4941 51 HVQEVGHITSLLLGLSGRLARAENALIGLRE-----DHCDRKSLEEKRDKLASQLEEAKELKENIDR---RSTNVATLLR 122 (172)
Q Consensus 51 Fi~DlekVv~LLLsLs~RLaRvenaL~~l~~-----d~~Er~~L~~k~~~L~~Q~EDAkeLKe~ldR---Re~~v~~iL~ 122 (172)
++.-+++.-+=+-.||.|.-||=..+..+.. .+ +-+.|..|..-|..|+++=-++|--|+- +.+... =.
T Consensus 374 ~~~KI~~~k~r~~~Ls~RiLRv~ikqeilr~~G~~L~~-~EE~Lr~Kldtll~~ln~Pnq~k~Rl~~L~e~~r~q~--~~ 450 (508)
T KOG3091|consen 374 AVAKIEEAKNRHVELSHRILRVMIKQEILRKRGYALTP-DEEELRAKLDTLLAQLNAPNQLKARLDELYEILRMQN--SQ 450 (508)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCcCCc-cHHHHHHHHHHHHHHhcChHHHHHHHHHHHHHHHhhc--ch
Confidence 3344566667788999999999887774421 22 3366899999999999999888876654 112211 24
Q ss_pred hcCChHHHhhHHHHHHhHHHHHHHhhchHHHhhhcHHHHHHH
Q psy4941 123 KYLSSEEFADYNHFINMKAKLLVDAREISDKIQLGEEQLAAL 164 (172)
Q Consensus 123 ~~L~~eql~dY~~fv~mKa~Ll~eqreLddkIklgEEQL~~L 164 (172)
.++.+.-+-|++..+.||.-|-.+|+.|.--+-...-+++.+
T Consensus 451 ~~~~~~~~iD~~~~~e~~e~lt~~~e~l~~Lv~Ilk~d~edi 492 (508)
T KOG3091|consen 451 LKLQESYWIDFDKLIEMKEHLTQEQEALTKLVNILKGDQEDI 492 (508)
T ss_pred hccccceeechhhhHHHHHHHHHHHHHHHHHHHHHHhHHHHH
Confidence 466777788999999999999888765554444333344333
No 42
>PF14584 DUF4446: Protein of unknown function (DUF4446)
Probab=24.48 E-value=82 Score=25.48 Aligned_cols=78 Identities=26% Similarity=0.375 Sum_probs=52.1
Q ss_pred HHHhHhhHHHHHHHhhCCC--CCHHHHHHHHHHHHHHHhHHHHHHHHHHhhhhhhhHHHHHHHhcCChHHHhhHHHHHHh
Q psy4941 62 LLGLSGRLARAENALIGLR--EDHCDRKSLEEKRDKLASQLEEAKELKENIDRRSTNVATLLRKYLSSEEFADYNHFINM 139 (172)
Q Consensus 62 LLsLs~RLaRvenaL~~l~--~d~~Er~~L~~k~~~L~~Q~EDAkeLKe~ldRRe~~v~~iL~~~L~~eql~dY~~fv~m 139 (172)
++.+..|+.|+.+.-..+= .+. .+|+........+.++.++--+.+..|-..+.+.+...+..=-+-.|..|=.|
T Consensus 18 ~~~~~~kl~kl~r~Y~~lm~g~~~---~~lE~~l~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~kvgvvRYnAF~dm 94 (151)
T PF14584_consen 18 IIILNIKLRKLKRRYDALMRGKDG---KNLEDLLNELFDQIDELKEELEELEKRIEELEEKLRNCVQKVGVVRYNAFEDM 94 (151)
T ss_pred HHHHHHHHHHHHHHHHHHhCCCCc---ccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccceEEEEEccCcccc
Confidence 3456677777776655431 122 25555555666777777777777777778888888888887778888888777
Q ss_pred HHH
Q psy4941 140 KAK 142 (172)
Q Consensus 140 Ka~ 142 (172)
-+.
T Consensus 95 Gg~ 97 (151)
T PF14584_consen 95 GGD 97 (151)
T ss_pred ccc
Confidence 543
No 43
>PF04728 LPP: Lipoprotein leucine-zipper; InterPro: IPR006817 This repeating sequence, NAKVDQLSNDV, is found in the enterobacterial outer membrane lipoprotein LPP. The outer membrane lipoprotein is the most abundant protein in an Escherichia coli cell. The messenger RNA for the lipoprotein of the E. coli outer membrane codes for a putative precursor, prolipoprotein, which has 20 additional amino acid residues extending from the amino terminus of the lipoprotein.; GO: 0019867 outer membrane; PDB: 1JCC_A 2GUV_C 2GUS_A 1JCD_A 1KFM_A 1T8Z_D 1KFN_A 1EQ7_A.
Probab=24.21 E-value=1.3e+02 Score=20.96 Aligned_cols=40 Identities=20% Similarity=0.250 Sum_probs=26.1
Q ss_pred chhHHHHHHHHHHHHHHHHHHHhhHHhhHHHHHHHHhhcCchh
Q psy4941 2 SRLDSKLTLLRSVQVDLNEESRGNDRLGEALATRLASEARPAE 44 (172)
Q Consensus 2 ~~l~~kL~~L~~eqe~l~ee~~~n~~lG~~ve~~V~~~ckp~E 44 (172)
+.|+.-+..|......|..|+.. ++.+|.+.-.++.+.|+
T Consensus 6 d~Ls~dVq~L~~kvdqLs~dv~~---lr~~v~~ak~EAaRAN~ 45 (56)
T PF04728_consen 6 DQLSSDVQTLNSKVDQLSSDVNA---LRADVQAAKEEAARANQ 45 (56)
T ss_dssp HHHHHHHHHHHHHHHHHHHHHHH---HHHHHHHHHHHHHHHHH
T ss_pred HHHHHHHHHHHHHHHHHHHHHHH---HHHHHHHHHHHHHHHHH
Confidence 45667777777777777777654 55666666665555543
No 44
>KOG3088|consensus
Probab=23.67 E-value=1.4e+02 Score=27.41 Aligned_cols=34 Identities=29% Similarity=0.473 Sum_probs=27.6
Q ss_pred HHHHHHHHHHHHhHHHHHHHHHHhhhhhhhHHHH
Q psy4941 86 RKSLEEKRDKLASQLEEAKELKENIDRRSTNVAT 119 (172)
Q Consensus 86 r~~L~~k~~~L~~Q~EDAkeLKe~ldRRe~~v~~ 119 (172)
-..+..|..-|.++.||=+.=-+.+||||+.+..
T Consensus 59 a~~~~~kq~eL~~rqeEL~Rke~ELdRREr~~a~ 92 (313)
T KOG3088|consen 59 AKDLAKKQAELLKKQEELRRKEQELDRRERALAR 92 (313)
T ss_pred hhHHHHHHHHHHHHHHHHHHHHHHHhHHHHHHhh
Confidence 3456667777888888888888999999999987
No 45
>PF00658 PABP: Poly-adenylate binding protein, unique domain; InterPro: IPR002004 The polyadenylate-binding protein (PABP) has a conserved C-terminal domain (PABC), which is also found in the hyperplastic discs protein (HYD) family of ubiquitin ligases that contain HECT domains (IPR000569 from INTERPRO) []. PABP recognises the 3' mRNA poly(A) tail and plays an essential role in eukaryotic translation initiation and mRNA stabilisation/degradation. PABC domains of PABP are peptide-binding domains that mediate PABP homo-oligomerisation and protein-protein interactions. In mammals, the PABC domain of PABP functions to recruit several different translation factors to the mRNA poly(A) tail [].; GO: 0003723 RNA binding; PDB: 3KUR_E 1JH4_A 2RQH_B 3KUI_A 3KUS_A 3KUJ_A 3KTR_A 2X04_A 3PTH_A 1JGN_A ....
Probab=23.53 E-value=2.8e+02 Score=19.75 Aligned_cols=55 Identities=29% Similarity=0.363 Sum_probs=35.7
Q ss_pred hhHHhhHHHHHHHHhhcCchhHHHHHHHHHhHHHHHHHHHHhHhhHHHHHHHhhCCCCCHHHHHHHHHHHHHHHhHHHHH
Q psy4941 24 GNDRLGEALATRLASEARPAETAKYKLHVQEVGHITSLLLGLSGRLARAENALIGLREDHCDRKSLEEKRDKLASQLEEA 103 (172)
Q Consensus 24 ~n~~lG~~ve~~V~~~ckp~E~eKy~~Fi~DlekVv~LLLsLs~RLaRvenaL~~l~~d~~Er~~L~~k~~~L~~Q~EDA 103 (172)
....+|+.+...|....+ ..-+|||+.||-+ ++.|--.+.+--..|..+.++|
T Consensus 16 qk~~LGe~Ly~~V~~~~p-----------~~A~KITGMLLe~----------------~~~ell~ll~~~~~L~~kv~eA 68 (72)
T PF00658_consen 16 QKQILGERLYPLVQAIYP-----------ELAGKITGMLLEM----------------DNSELLHLLEDPELLREKVQEA 68 (72)
T ss_dssp HHHHHHHHHHHHHHHHTH-----------HHHHHHHHHHTTS----------------CHHHHHHHHHTHHHHHHHHHHH
T ss_pred HHHHHhccccHHHHHhCc-----------chhHHHHHHHhcC----------------CHHHHHHHhCCHHHHHHHHHHH
Confidence 467899999999974333 5668999999865 3344444444445566666666
Q ss_pred HH
Q psy4941 104 KE 105 (172)
Q Consensus 104 ke 105 (172)
-+
T Consensus 69 ~~ 70 (72)
T PF00658_consen 69 IE 70 (72)
T ss_dssp HH
T ss_pred HH
Confidence 43
No 46
>PF09277 Erythro-docking: Erythronolide synthase, docking; InterPro: IPR015357 Docking domains are found in prokaryotic erythronolide synthase. They adopt a structure consisting of a bundle of four alpha-helices, and mediate homodimerisation of the protein, stabilising the resulting complex []. ; PDB: 1PZQ_A.
Probab=23.01 E-value=1.7e+02 Score=20.64 Aligned_cols=37 Identities=19% Similarity=0.384 Sum_probs=22.0
Q ss_pred hHhhHHHHHHHhhCCCCCHHHHHHHHHHHHHHHhHHHH
Q psy4941 65 LSGRLARAENALIGLREDHCDRKSLEEKRDKLASQLEE 102 (172)
Q Consensus 65 Ls~RLaRvenaL~~l~~d~~Er~~L~~k~~~L~~Q~ED 102 (172)
.+.||-|+|.||..|+.... ..-...+..-|.+--.|
T Consensus 8 v~a~lDrLE~aL~aLpd~dG-H~~V~aRLeaLLR~Wq~ 44 (58)
T PF09277_consen 8 VGAELDRLEAALLALPDEDG-HARVTARLEALLRRWQD 44 (58)
T ss_dssp THHHHHHHHHHHHHHTTSS--THHHHHHHHHHHHHHHH
T ss_pred hhHHHHHHHHHHhcCCcccc-ccHHHHHHHHHHHHHHh
Confidence 36799999999999975311 22333444444444433
No 47
>PF10779 XhlA: Haemolysin XhlA; InterPro: IPR019715 Haemolysin XhlA is a cell-surface associated haemolysin that lyses the two most prevalent types of insect immune cells (granulocytes and plasmatocytes) as well as rabbit and horse erythrocytes [].
Probab=21.83 E-value=2.4e+02 Score=19.56 Aligned_cols=41 Identities=10% Similarity=0.121 Sum_probs=33.6
Q ss_pred CchhHHHHHHHHHHHHHHHHHHHhhHHhhHHHHHHHHhhcC
Q psy4941 1 MSRLDSKLTLLRSVQVDLNEESRGNDRLGEALATRLASEAR 41 (172)
Q Consensus 1 ~~~l~~kL~~L~~eqe~l~ee~~~n~~lG~~ve~~V~~~ck 41 (172)
|+.|..++...+...+.+.+++..-+.-.+..+..+...|+
T Consensus 1 ~~~i~e~l~~ie~~l~~~~~~i~~lE~~~~~~e~~i~~~~~ 41 (71)
T PF10779_consen 1 LQDIKEKLNRIETKLDNHEERIDKLEKRDAANEKDIKNLNK 41 (71)
T ss_pred CHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 45678888888888888888888888888888888777774
No 48
>PF11365 DUF3166: Protein of unknown function (DUF3166); InterPro: IPR021507 This eukaryotic family of proteins has no known function.
Probab=21.60 E-value=1.1e+02 Score=23.41 Aligned_cols=15 Identities=27% Similarity=0.379 Sum_probs=13.2
Q ss_pred hhHHHHHHHHHhHHH
Q psy4941 43 AETAKYKLHVQEVGH 57 (172)
Q Consensus 43 ~E~eKy~~Fi~Dlek 57 (172)
+|+.||+...||+|.
T Consensus 36 ~EL~kyk~~~g~~d~ 50 (96)
T PF11365_consen 36 EELNKYKSKYGDLDS 50 (96)
T ss_pred HHHHHHHHhcCCCcc
Confidence 588999999999986
No 49
>PF08317 Spc7: Spc7 kinetochore protein; InterPro: IPR013253 This entry consists of cell division proteins which are required for kinetochore-spindle association [].
Probab=21.51 E-value=5.7e+02 Score=22.57 Aligned_cols=67 Identities=19% Similarity=0.273 Sum_probs=31.9
Q ss_pred HhhcCchhHHHHHHHHHhHHHHHHHHHHhHhhHHHHHHHhhCCCCCHHHHHHHHHHHHHHHhHHHHHHHHHHh
Q psy4941 37 ASEARPAETAKYKLHVQEVGHITSLLLGLSGRLARAENALIGLREDHCDRKSLEEKRDKLASQLEEAKELKEN 109 (172)
Q Consensus 37 ~~~ckp~E~eKy~~Fi~DlekVv~LLLsLs~RLaRvenaL~~l~~d~~Er~~L~~k~~~L~~Q~EDAkeLKe~ 109 (172)
-+.|.|.+++.++-=+.+++--+. .....|+.++.-+..+ .++-+.+.+++..+..+..+|+...+.
T Consensus 203 ~~~~D~~eL~~lr~eL~~~~~~i~---~~k~~l~el~~el~~l---~~~i~~~~~~k~~l~~eI~e~~~~~~~ 269 (325)
T PF08317_consen 203 IESCDQEELEALRQELAEQKEEIE---AKKKELAELQEELEEL---EEKIEELEEQKQELLAEIAEAEKIREE 269 (325)
T ss_pred hhhcCHHHHHHHHHHHHHHHHHHH---HHHHHHHHHHHHHHHH---HHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 366888888777744333333322 2222222333222222 122344455555566666666655553
No 50
>smart00150 SPEC Spectrin repeats.
Probab=21.43 E-value=2.5e+02 Score=18.49 Aligned_cols=12 Identities=25% Similarity=0.221 Sum_probs=4.9
Q ss_pred HHHHHhHhhHHH
Q psy4941 60 SLLLGLSGRLAR 71 (172)
Q Consensus 60 ~LLLsLs~RLaR 71 (172)
.-+..++.+|..
T Consensus 55 ~~~~~~~~~L~~ 66 (101)
T smart00150 55 EALNELGEQLIE 66 (101)
T ss_pred HHHHHHHHHHHH
Confidence 333444444443
No 51
>PF06785 UPF0242: Uncharacterised protein family (UPF0242); InterPro: IPR009623 This is a group of proteins of unknown function.
Probab=21.33 E-value=6.9e+02 Score=23.63 Aligned_cols=119 Identities=19% Similarity=0.260 Sum_probs=72.4
Q ss_pred HHHHHhhcCc---hhHHHHHHHHHhHHHHHHHHHHhHhhHHHHHHHhhCCCCCHHHHHHHHHHHHHHHhHHHHHHHHHHh
Q psy4941 33 ATRLASEARP---AETAKYKLHVQEVGHITSLLLGLSGRLARAENALIGLREDHCDRKSLEEKRDKLASQLEEAKELKEN 109 (172)
Q Consensus 33 e~~V~~~ckp---~E~eKy~~Fi~DlekVv~LLLsLs~RLaRvenaL~~l~~d~~Er~~L~~k~~~L~~Q~EDAkeLKe~ 109 (172)
.-++--.|-| .---+|.-||=-+=.-++.+.+++-| |+-|.. .--..|+.|.+.+.++-|--+...|.
T Consensus 23 ~~llp~~~~pfls~~qk~y~~f~f~~iss~gwff~i~~r----e~qlk~-----aa~~llq~kirk~~e~~eglr~i~es 93 (401)
T PF06785_consen 23 AFLLPLVCYPFLSDSQKNYGYFVFSIISSLGWFFAIGRR----EKQLKT-----AAGQLLQTKIRKITEKDEGLRKIRES 93 (401)
T ss_pred HHHHHHhHhhhcCHHHHhcceeehHHHHHhHHHHHhhHH----HHHHHH-----HHHHHHHHHHHHHHhccHHHHHHHHH
Confidence 3344445655 22346888887776777777777766 444443 22256888888888888888888888
Q ss_pred hhhhhhHHHHH--HHhcCChHHHhhHHHHHHhHHHHHHHhhchHHHhhhcHHHHHHH
Q psy4941 110 IDRRSTNVATL--LRKYLSSEEFADYNHFINMKAKLLVDAREISDKIQLGEEQLAAL 164 (172)
Q Consensus 110 ldRRe~~v~~i--L~~~L~~eql~dY~~fv~mKa~Ll~eqreLddkIklgEEQL~~L 164 (172)
+..|..-...+ =...|-++-++.-.-|++||. ++..||.-|+-.+|.=.||
T Consensus 94 ~~e~q~e~~qL~~qnqkL~nqL~~~~~vf~k~k~----~~q~LE~li~~~~EEn~~l 146 (401)
T PF06785_consen 94 VEERQQESEQLQSQNQKLKNQLFHVREVFMKTKG----DIQHLEGLIRHLREENQCL 146 (401)
T ss_pred HHHHHHHHHHHHHhHHHHHHHHHHHHHHHHHhcc----hHHHHHHHHHHHHHHHHHH
Confidence 88877554433 122333333444557888885 3455555555555544443
No 52
>PRK01381 Trp operon repressor; Provisional
Probab=20.93 E-value=3.9e+02 Score=20.51 Aligned_cols=58 Identities=26% Similarity=0.230 Sum_probs=38.0
Q ss_pred hHHHHHHHHhhcCchhHHHHHHHHHhHHHHHHHHHH--hHhh------------HHHHHHHhhCCCCCHHHHHHHHHH
Q psy4941 29 GEALATRLASEARPAETAKYKLHVQEVGHITSLLLG--LSGR------------LARAENALIGLREDHCDRKSLEEK 92 (172)
Q Consensus 29 G~~ve~~V~~~ckp~E~eKy~~Fi~DlekVv~LLLs--Ls~R------------LaRvenaL~~l~~d~~Er~~L~~k 92 (172)
.+.+...+...|+|.|.+-...=+ +|+..|+- +|.| ..|+=|+|...+ ++.|+-|.+.
T Consensus 22 ~~~~~~~l~~llTp~Er~al~~R~----~I~~~L~~g~~sQREIa~~lGvSiaTITRgsn~Lk~~~--~~~k~~l~~~ 93 (99)
T PRK01381 22 EDLHLPLLTLLLTPDEREALGTRV----RIVEELLRGELSQREIKQELGVGIATITRGSNSLKTAP--PEFKEWLEQQ 93 (99)
T ss_pred HHHHHHHHHHhCCHHHHHHHHHHH----HHHHHHHcCCcCHHHHHHHhCCceeeehhhHHHhccCC--HHHHHHHHHH
Confidence 456778889999999987543222 46666665 4444 467777777754 5666666543
No 53
>PF02050 FliJ: Flagellar FliJ protein; InterPro: IPR012823 Many flagellar proteins are exported by a flagellum-specific export pathway. Attempts have been made to characterise the apparatus responsible for this process, by designing assays to screen for mutants with export defects []. Experiments involving filament removal from temperature-sensitive flagellar mutants of Salmonella typhimurium have shown that, while most mutants were able to regrow filaments, flhA, fliH, fliI and fliN mutants showed no or greatly reduced regrowth. This suggests that the corresponding gene products are involved in the process of flagellum-specific export. The sequences of fliH, fliI and the adjacent gene, fliJ, have been deduced. FliJ was shown to encode a protein of molecular mass 17,302 Da []. It is a membrane-associated protein that affects chemotactic events, mutations in FliJ result in failure to respond to chemotactic stimuli.; GO: 0003774 motor activity, 0001539 ciliary or flagellar motility, 0006935 chemotaxis, 0009288 bacterial-type flagellum, 0016020 membrane, 0044461 bacterial-type flagellum part; PDB: 3AJW_A.
Probab=20.48 E-value=3e+02 Score=19.00 Aligned_cols=39 Identities=10% Similarity=0.293 Sum_probs=26.0
Q ss_pred hcCChHHHhhHHHHHHhHHHHHHHhhchHHHhhhcHHHHHHH
Q psy4941 123 KYLSSEEFADYNHFINMKAKLLVDAREISDKIQLGEEQLAAL 164 (172)
Q Consensus 123 ~~L~~eql~dY~~fv~mKa~Ll~eqreLddkIklgEEQL~~L 164 (172)
.-.|+.++..|.+|+..-...|..+.. .|...+.++...
T Consensus 40 ~~~s~~~~~~~~~~~~~l~~~i~~~~~---~~~~~~~~~~~~ 78 (123)
T PF02050_consen 40 QGVSVAQLRNYQRYISALEQAIQQQQQ---ELERLEQEVEQA 78 (123)
T ss_dssp SGGGHHHHHHHHHHHHHHHHHHHHHHH---HHHHHHHHHHHH
T ss_pred CCCCHHHHHHHHHHHHHHHHHHHHHHH---HHHHHHHHHHHH
Confidence 357899999999999887777766544 344444444433
Done!