Query psy13237
Match_columns 84
No_of_seqs 107 out of 211
Neff 5.5
Searched_HMMs 46136
Date Fri Aug 16 23:39:00 2013
Command hhsearch -i /work/01045/syshi/Psyhhblits/psy13237.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/13237hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 PF01267 F-actin_cap_A: F-acti 100.0 3E-31 6.5E-36 197.5 9.1 81 3-83 161-243 (271)
2 KOG0836|consensus 100.0 1.7E-30 3.7E-35 193.6 8.6 82 2-83 167-250 (282)
3 PF07661 MORN_2: MORN repeat v 79.9 2.5 5.3E-05 19.2 2.3 14 24-37 5-18 (22)
4 PF09911 DUF2140: Uncharacteri 75.7 18 0.00039 25.7 6.7 35 12-46 79-124 (187)
5 PF09906 DUF2135: Uncharacteri 71.0 2.6 5.7E-05 24.1 1.3 18 17-34 2-19 (50)
6 PF08863 YolD: YolD-like prote 54.3 11 0.00024 22.8 1.9 22 18-39 40-61 (92)
7 cd08678 C2_C21orf25-like C2 do 51.4 14 0.00031 23.5 2.2 18 15-32 107-124 (126)
8 COG3177 Fic family protein [Fu 50.1 5.4 0.00012 30.6 0.0 12 23-34 190-201 (348)
9 KOG0024|consensus 49.3 5.1 0.00011 31.7 -0.2 18 19-36 40-57 (354)
10 COG2989 Uncharacterized protei 49.0 13 0.00028 31.1 2.1 18 24-41 334-351 (561)
11 smart00676 DM10 Domains in hyp 48.3 29 0.00064 22.4 3.3 34 4-39 6-41 (104)
12 PRK08404 V-type ATP synthase s 46.9 32 0.00068 22.2 3.3 21 55-75 2-22 (103)
13 PF13068 DUF3932: Protein of u 45.9 3.1 6.8E-05 26.0 -1.5 17 2-18 15-31 (81)
14 cd08400 C2_Ras_p21A1 C2 domain 44.7 18 0.00038 23.3 1.9 16 15-30 110-125 (126)
15 cd01254 PH_PLD Phospholipase D 43.9 25 0.00054 22.9 2.5 27 1-27 29-66 (121)
16 PF13708 Methyltransf_27: Meth 42.9 23 0.0005 25.1 2.3 20 22-41 158-178 (194)
17 PF13695 zf-3CxxC: Zinc-bindin 37.2 39 0.00084 21.4 2.5 22 18-39 17-38 (98)
18 PF02661 Fic: Fic/DOC family; 33.6 8.1 0.00018 23.0 -1.0 9 24-32 77-85 (97)
19 cd06408 PB1_NoxR The PB1 domai 33.0 27 0.00058 22.3 1.2 11 20-30 2-12 (86)
20 PF14332 DUF4388: Domain of un 32.4 49 0.0011 20.1 2.4 24 16-39 20-49 (103)
21 PF07166 DUF1398: Protein of u 31.6 1.2E+02 0.0026 20.5 4.3 17 24-40 41-58 (125)
22 PF02246 B1: Protein L b1 doma 31.4 52 0.0011 20.1 2.2 18 21-39 10-27 (69)
23 PF03544 TonB_C: Gram-negative 30.3 89 0.0019 17.7 3.2 21 15-35 12-32 (79)
24 TIGR02887 spore_ger_x_C germin 29.5 2E+02 0.0043 21.7 5.7 25 19-44 255-279 (371)
25 KOG3859|consensus 28.4 1.1E+02 0.0023 24.5 4.1 51 20-72 78-134 (406)
26 PF13216 DUF4024: Protein of u 26.1 76 0.0016 16.9 2.0 23 19-41 3-25 (35)
27 PHA02762 hypothetical protein; 25.7 1.6E+02 0.0034 17.6 3.8 40 2-41 9-48 (62)
28 PF12476 DUF3696: Protein of u 25.4 50 0.0011 18.5 1.4 8 22-29 5-12 (52)
29 PF14231 GXWXG: GXWXG protein 25.0 24 0.00051 21.0 -0.0 11 1-11 15-25 (60)
30 PHA02633 hypothetical protein; 24.8 76 0.0016 19.2 2.2 21 14-34 41-61 (63)
31 PF09266 VirDNA-topo-I_N: Vira 24.3 92 0.002 18.6 2.4 31 27-62 2-32 (58)
32 PF14593 PH_3: PH domain; PDB: 24.3 2.1E+02 0.0045 18.5 6.0 57 9-69 42-103 (104)
33 KOG4173|consensus 22.5 22 0.00047 26.7 -0.6 19 52-70 85-103 (253)
34 PF12637 TSCPD: TSCPD domain; 22.2 1E+02 0.0023 19.3 2.6 24 17-40 6-33 (95)
35 PLN03113 DNA ligase 1; Provisi 22.2 1.2E+02 0.0026 26.2 3.6 31 4-41 392-422 (744)
36 PF11080 DUF2622: Protein of u 21.6 55 0.0012 21.3 1.2 10 22-31 10-19 (96)
37 PF07104 DUF1366: Protein of u 21.1 2.3E+02 0.005 18.9 4.1 16 65-80 59-74 (116)
38 cd07900 Adenylation_DNA_ligase 21.0 96 0.0021 22.1 2.5 21 21-41 42-62 (219)
39 PF10106 DUF2345: Uncharacteri 21.0 74 0.0016 21.7 1.8 37 3-42 62-101 (155)
40 PF03714 PUD: Bacterial pullan 20.9 69 0.0015 20.1 1.5 8 22-29 3-10 (103)
41 PF09346 SMI1_KNR4: SMI1 / KNR 20.9 70 0.0015 19.2 1.5 21 59-79 5-25 (130)
42 PF07270 DUF1438: Protein of u 20.8 33 0.00072 24.0 0.0 10 24-33 45-54 (151)
43 PF14977 FAM194: FAM194 protei 20.7 62 0.0013 23.7 1.4 19 24-42 70-88 (208)
44 PF15121 TMEM71: TMEM71 protei 20.3 47 0.001 23.3 0.7 12 29-40 81-92 (149)
45 KOG0967|consensus 20.1 84 0.0018 27.2 2.2 19 22-40 377-395 (714)
No 1
>PF01267 F-actin_cap_A: F-actin capping protein alpha subunit; InterPro: IPR002189 This entry represents a component of the WASH complex. The WASH complex is present at the surface of endosomes and recruits and activates the Arp2/3 complex to induce actin polymerisation. The WASH complex plays a key role in the fission of tubules that serve as transport intermediates during endosome sorting []. The WASH complex's subunit structure: F-actin-capping protein subunit alpha (CAPZA1, CAPZA2 or CAPZA3), F-actin-capping protein subunit beta (CAPZB), WASH (WASH1, WASH2P, WASH3P, WASH4P, WASH5P or WASH6P), FAM21 (FAM21A, FAM21B or FAM21C), KIAA1033, KIAA0196 (strumpellin) and CCDC53. The actin filament system, a prominent part of the cytoskeleton in eukaryotic cells, is both a static structure and a dynamic network that can undergo rearrangements: it is thought to be involved in processes such as cell movement and phagocytosis [], as well as muscle contraction. The F-actin capping protein binds in a calcium-independent manner to the fast growing ends of actin filaments (barbed end) thereby blocking the exchange of subunits at these ends. Unlike gelsolin and severin this protein does not sever actin filaments. The F-actin capping protein is a heterodimer composed of two unrelated subunits: alpha and beta (see IPR001698 from INTERPRO). Neither of the subunits shows sequence similarity to other filament-capping proteins []. The alpha subunit is a protein of about 268 to 286 amino acid residues whose sequence is well conserved in eukaryotic species [].; GO: 0003779 actin binding, 0030036 actin cytoskeleton organization, 0008290 F-actin capping protein complex; PDB: 1MQ1_D 1MWN_X 2KBM_Y 2KZ7_A 3AAE_C 3AA6_A 3LK4_G 1IZN_A 3AA7_A 3AAA_A ....
Probab=99.97 E-value=3e-31 Score=197.48 Aligned_cols=81 Identities=44% Similarity=0.677 Sum_probs=73.9
Q ss_pred cccceeEee--CCCceEEEEEEEeeEeeecccEEEEeccceecceeecCchhhHHHHHHHHHHHHHHHHHHHHHHHhhhh
Q psy13237 3 DRSLADFIE--GNNVEVKGNLKVQVHYYEDGNVQLVSSKDIKETVVVTSEEQTAREIVKLIEESETAYQQAISENYMIIR 80 (84)
Q Consensus 3 ~~w~~~~~~--~~~~~l~G~I~V~vHYyEdGNVqL~s~k~~~~~~~~s~~~~~A~~iv~~I~~~E~~~q~~L~e~y~~Ls 80 (84)
|||||.|.- ..+.+|+|+|+|+|||||||||||+++|++..++.++++.++|++||++|+++|++||++|+++|++||
T Consensus 161 G~WrS~w~~~~~~~~~l~G~I~V~vHYyEdGNVqL~~~k~~~~~~~~~~~~~~a~~iv~~I~~~E~~~q~~L~e~~~~ls 240 (271)
T PF01267_consen 161 GRWRSEWTVDFSSSGTLSGKIKVQVHYYEDGNVQLNSSKEVSETVSVSNDEQFASDIVKAIKEAENKYQTSLNESYNNLS 240 (271)
T ss_dssp EEEEEEEEEEEETTEEEEEEEEEEEEE-TTEEEEEEEEEEEEEEEE--SHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
T ss_pred ceeeEEEEEecCCCceEEEEEEeeEEEEeecEEEEEEccccceeeccCChhhhHHHHHHHHHHHHHHHHHHHHHHHHHhh
Confidence 999999994 356799999999999999999999999999999998888899999999999999999999999999999
Q ss_pred ccC
Q psy13237 81 KTC 83 (84)
Q Consensus 81 e~t 83 (84)
++|
T Consensus 241 ~~t 243 (271)
T PF01267_consen 241 EKT 243 (271)
T ss_dssp HTH
T ss_pred HHH
Confidence 975
No 2
>KOG0836|consensus
Probab=99.97 E-value=1.7e-30 Score=193.64 Aligned_cols=82 Identities=54% Similarity=0.786 Sum_probs=77.5
Q ss_pred CcccceeEeeC-C-CceEEEEEEEeeEeeecccEEEEeccceecceeecCchhhHHHHHHHHHHHHHHHHHHHHHHHhhh
Q psy13237 2 FDRSLADFIEG-N-NVEVKGNLKVQVHYYEDGNVQLVSSKDIKETVVVTSEEQTAREIVKLIEESETAYQQAISENYMII 79 (84)
Q Consensus 2 ~~~w~~~~~~~-~-~~~l~G~I~V~vHYyEdGNVqL~s~k~~~~~~~~s~~~~~A~~iv~~I~~~E~~~q~~L~e~y~~L 79 (84)
-|||||.|.-+ . ..+|+|.|+|||||||||||||.++|++.+++.+++..++|++++++|+++||+||++|+++|++|
T Consensus 167 NG~WRS~w~~~v~~~~~l~G~i~vqvHYyEdGNV~l~s~Kd~qds~~vsn~~q~a~e~~~iI~~~Eneyq~ai~e~yq~l 246 (282)
T KOG0836|consen 167 NGRWRSEWNYDVQPTTELKGRIKVQVHYYEDGNVQLVSSKDIQDSLTVSNEVQTAKEFIKIIEEAENEYQAAISENYQTL 246 (282)
T ss_pred CCceeeeeeeccCchheeeeEEEEEEEEEecCcEEEEecCcchhhheeechHHHHHHHHHHHHHHHHHHHHHHHHHHHhc
Confidence 49999999932 2 589999999999999999999999999999999999999999999999999999999999999999
Q ss_pred hccC
Q psy13237 80 RKTC 83 (84)
Q Consensus 80 se~t 83 (84)
||+|
T Consensus 247 sD~t 250 (282)
T KOG0836|consen 247 SDTT 250 (282)
T ss_pred ChhH
Confidence 9986
No 3
>PF07661 MORN_2: MORN repeat variant; InterPro: IPR011652 This entry represents an apparent variant of the IPR003409 from INTERPRO repeat.
Probab=79.93 E-value=2.5 Score=19.21 Aligned_cols=14 Identities=29% Similarity=0.572 Sum_probs=10.7
Q ss_pred eeEeeecccEEEEe
Q psy13237 24 QVHYYEDGNVQLVS 37 (84)
Q Consensus 24 ~vHYyEdGNVqL~s 37 (84)
...||++|+++-..
T Consensus 5 ~~~yy~nG~l~~~~ 18 (22)
T PF07661_consen 5 WKFYYENGKLKSEG 18 (22)
T ss_pred EEEEeCCCCEEEEE
Confidence 45799999987654
No 4
>PF09911 DUF2140: Uncharacterized protein conserved in bacteria (DUF2140); InterPro: IPR018672 This family of conserved hypothetical proteins has no known function.
Probab=75.71 E-value=18 Score=25.70 Aligned_cols=35 Identities=34% Similarity=0.413 Sum_probs=22.2
Q ss_pred CCCceEEEEEEE---eeEe--------eecccEEEEeccceeccee
Q psy13237 12 GNNVEVKGNLKV---QVHY--------YEDGNVQLVSSKDIKETVV 46 (84)
Q Consensus 12 ~~~~~l~G~I~V---~vHY--------yEdGNVqL~s~k~~~~~~~ 46 (84)
+....+.|++++ +|.| -|+|||+|+...----+++
T Consensus 79 ~~~v~l~g~~~vlg~~v~~~~~f~P~V~~nGnv~L~~~s~svG~L~ 124 (187)
T PF09911_consen 79 DDQVYLYGTIKVLGQDVPLYMYFDPEVLENGNVQLKAKSLSVGTLP 124 (187)
T ss_pred CCEEEEEEEEEECcEEEEEEEEEEEEEecCCcEEEEeceeEecCcc
Confidence 445677777654 3333 3999999998765433333
No 5
>PF09906 DUF2135: Uncharacterized protein conserved in bacteria (DUF2135); InterPro: IPR019220 This entry, found in various hypothetical prokaryotic proteins, has no known function.
Probab=71.04 E-value=2.6 Score=24.06 Aligned_cols=18 Identities=28% Similarity=0.455 Sum_probs=15.9
Q ss_pred EEEEEEEeeEeeecccEE
Q psy13237 17 VKGNLKVQVHYYEDGNVQ 34 (84)
Q Consensus 17 l~G~I~V~vHYyEdGNVq 34 (84)
+.|.=+|.||||-+++-.
T Consensus 2 ~~G~Y~VyVnYyg~~~~~ 19 (50)
T PF09906_consen 2 PPGTYLVYVNYYGNRSQS 19 (50)
T ss_pred CCceEEEEEEeccCCCcC
Confidence 468899999999999877
No 6
>PF08863 YolD: YolD-like protein; InterPro: IPR014962 These proteins are functionally uncharacterised. However it has been predicted that these proteins are functionally equivalent to the UmuD subunit of polymerase V from Gram-negative bacteria [].
Probab=54.27 E-value=11 Score=22.78 Aligned_cols=22 Identities=41% Similarity=0.644 Sum_probs=16.4
Q ss_pred EEEEEEeeEeeecccEEEEecc
Q psy13237 18 KGNLKVQVHYYEDGNVQLVSSK 39 (84)
Q Consensus 18 ~G~I~V~vHYyEdGNVqL~s~k 39 (84)
.....|.+.||++|+.+=.+.+
T Consensus 40 ~~~~~v~ity~~~g~~~~~~G~ 61 (92)
T PF08863_consen 40 QENQPVTITYYEDGYYQSVTGT 61 (92)
T ss_pred cCCCEEEEEEEECCeeEEEEEE
Confidence 3456788999999998765544
No 7
>cd08678 C2_C21orf25-like C2 domain found in the Human chromosome 21 open reading frame 25 (C21orf25) protein. The members in this cd are named after the Human C21orf25 which contains a single C2 domain. Several other members contain a C1 domain downstream of the C2 domain. No other information on this protein is currently known. The C2 domain was first identified in PKC. C2 domains fold into an 8-standed beta-sandwich that can adopt 2 structural arrangements: Type I and Type II, distinguished by a circular permutation involving their N- and C-terminal beta strands. Many C2 domains are Ca2+-dependent membrane-targeting modules that bind a wide variety of substances including bind phospholipids, inositol polyphosphates, and intracellular proteins. Most C2 domain proteins are either signal transduction enzymes that contain a single C2 domain, such as protein kinase C, or membrane trafficking proteins which contain at least two C2 domains, such as synaptotagmin 1. However, there are a
Probab=51.36 E-value=14 Score=23.55 Aligned_cols=18 Identities=22% Similarity=0.440 Sum_probs=15.8
Q ss_pred ceEEEEEEEeeEeeeccc
Q psy13237 15 VEVKGNLKVQVHYYEDGN 32 (84)
Q Consensus 15 ~~l~G~I~V~vHYyEdGN 32 (84)
....|.|.++++|++..+
T Consensus 107 ~~~~G~l~l~~~~~~~~~ 124 (126)
T cd08678 107 DSVSGSITVEFLFMEPAE 124 (126)
T ss_pred CCcceEEEEEEEEecccc
Confidence 568999999999999874
No 8
>COG3177 Fic family protein [Function unknown]
Probab=50.13 E-value=5.4 Score=30.65 Aligned_cols=12 Identities=42% Similarity=0.678 Sum_probs=10.0
Q ss_pred EeeEeeecccEE
Q psy13237 23 VQVHYYEDGNVQ 34 (84)
Q Consensus 23 V~vHYyEdGNVq 34 (84)
+-+|.|+|||=+
T Consensus 190 e~IHPF~DGNGR 201 (348)
T COG3177 190 EYIHPFEDGNGR 201 (348)
T ss_pred HhccccccCccH
Confidence 458999999965
No 9
>KOG0024|consensus
Probab=49.32 E-value=5.1 Score=31.66 Aligned_cols=18 Identities=33% Similarity=0.584 Sum_probs=13.8
Q ss_pred EEEEEeeEeeecccEEEE
Q psy13237 19 GNLKVQVHYYEDGNVQLV 36 (84)
Q Consensus 19 G~I~V~vHYyEdGNVqL~ 36 (84)
|.---+||||++|+.-.-
T Consensus 40 GICGSDvHy~~~G~ig~~ 57 (354)
T KOG0024|consen 40 GICGSDVHYYTHGRIGDF 57 (354)
T ss_pred EecCccchhhccCCcCcc
Confidence 444568999999998654
No 10
>COG2989 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=48.97 E-value=13 Score=31.10 Aligned_cols=18 Identities=33% Similarity=0.552 Sum_probs=15.0
Q ss_pred eeEeeecccEEEEeccce
Q psy13237 24 QVHYYEDGNVQLVSSKDI 41 (84)
Q Consensus 24 ~vHYyEdGNVqL~s~k~~ 41 (84)
.++|||||++.|.+.--+
T Consensus 334 ~l~y~~~G~~vl~~rvVV 351 (561)
T COG2989 334 SLEYYENGREVLRSRVVV 351 (561)
T ss_pred eeeeEeCCcEEEEEEEEe
Confidence 579999999999886544
No 11
>smart00676 DM10 Domains in hypothetical proteins in Drosophila, C. elegans and mammals. Occurs singly in some nucleoside diphosphate kinases.
Probab=48.32 E-value=29 Score=22.37 Aligned_cols=34 Identities=18% Similarity=0.276 Sum_probs=23.1
Q ss_pred ccceeEeeCCCceEEEEE-EEeeEee-ecccEEEEecc
Q psy13237 4 RSLADFIEGNNVEVKGNL-KVQVHYY-EDGNVQLVSSK 39 (84)
Q Consensus 4 ~w~~~~~~~~~~~l~G~I-~V~vHYy-EdGNVqL~s~k 39 (84)
|+.+.|.++.... |.+ ++.++|| |||-|.+...+
T Consensus 6 rF~a~~~d~~~~~--~~~R~~~i~yyl~D~Tvei~E~~ 41 (104)
T smart00676 6 RFDAYWEDPVAMF--YLIRRFKIYYYLEDDTIEVFEPD 41 (104)
T ss_pred EEEEEEecCCCCc--CcEEEEEEEEECCCCeEEEEEcc
Confidence 5677888754322 555 6778877 79999998543
No 12
>PRK08404 V-type ATP synthase subunit H; Validated
Probab=46.90 E-value=32 Score=22.19 Aligned_cols=21 Identities=19% Similarity=0.374 Sum_probs=18.1
Q ss_pred HHHHHHHHHHHHHHHHHHHHH
Q psy13237 55 REIVKLIEESETAYQQAISEN 75 (84)
Q Consensus 55 ~~iv~~I~~~E~~~q~~L~e~ 75 (84)
.++++.|+++|.+|+..|.+.
T Consensus 2 ~e~i~~ik~aE~~~e~~L~~A 22 (103)
T PRK08404 2 EDVIKEIVKAEKEAEERIEKA 22 (103)
T ss_pred hHHHHHHHHHHHHHHHHHHHH
Confidence 367999999999999988765
No 13
>PF13068 DUF3932: Protein of unknown function (DUF3932)
Probab=45.94 E-value=3.1 Score=25.96 Aligned_cols=17 Identities=24% Similarity=0.628 Sum_probs=14.0
Q ss_pred CcccceeEeeCCCceEE
Q psy13237 2 FDRSLADFIEGNNVEVK 18 (84)
Q Consensus 2 ~~~w~~~~~~~~~~~l~ 18 (84)
|+||-+.|+++...+|.
T Consensus 15 fdrwvssfvsdhpaql~ 31 (81)
T PF13068_consen 15 FDRWVSSFVSDHPAQLE 31 (81)
T ss_pred HHHHHHHHhcCCccccc
Confidence 78999999987776664
No 14
>cd08400 C2_Ras_p21A1 C2 domain present in RAS p21 protein activator 1 (RasA1). RasA1 is a GAP1 (GTPase activating protein 1), a Ras-specific GAP member, which suppresses Ras function by enhancing the GTPase activity of Ras proteins resulting in the inactive GDP-bound form of Ras. In this way it can control cellular proliferation and differentiation. RasA1 contains a C2 domain, a Ras-GAP domain, a pleckstrin homology (PH)-like domain, a SH3 domain, and 2 SH2 domains. C2 domains fold into an 8-standed beta-sandwich that can adopt 2 structural arrangements: Type I and Type II, distinguished by a circular permutation involving their N- and C-terminal beta strands. Many C2 domains are Ca2+-dependent membrane-targeting modules that bind a wide variety of substances including bind phospholipids, inositol polyphosphates, and intracellular proteins. Most C2 domain proteins are either signal transduction enzymes that contain a single C2 domain, such as protein kinase C, or membrane trafficki
Probab=44.69 E-value=18 Score=23.29 Aligned_cols=16 Identities=19% Similarity=0.397 Sum_probs=13.8
Q ss_pred ceEEEEEEEeeEeeec
Q psy13237 15 VEVKGNLKVQVHYYED 30 (84)
Q Consensus 15 ~~l~G~I~V~vHYyEd 30 (84)
....|.|.+.++|+||
T Consensus 110 ~~~~G~i~l~l~~~~~ 125 (126)
T cd08400 110 GGEWGSLRIRARYSHE 125 (126)
T ss_pred CCcCcEEEEEEEEEcc
Confidence 3566999999999997
No 15
>cd01254 PH_PLD Phospholipase D (PLD) pleckstrin homology (PH) domain. Phospholipase D (PLD) pleckstrin homology (PH) domain. PLD hydrolyzes phosphatidylcholine to phosphatidic acid (PtdOH), which can bind target proteins. PLD contains a PH domain, a PX domain and four conserved PLD signature domains. The PLD PH domain is specific for bisphosphorylated inositides. PH domains share little sequence conservation, but all have a common fold, which is electrostatically polarized. PH domains also have diverse functions. They are often involved in targeting proteins to the plasma membrane, but few display strong specificity in lipid binding. Any specificity is usually determined by loop regions or insertions in the N-terminus of the domain, which are not conserved across all PH domains.
Probab=43.92 E-value=25 Score=22.92 Aligned_cols=27 Identities=15% Similarity=0.257 Sum_probs=20.0
Q ss_pred CCcccceeEee-----------CCCceEEEEEEEeeEe
Q psy13237 1 MFDRSLADFIE-----------GNNVEVKGNLKVQVHY 27 (84)
Q Consensus 1 ~~~~w~~~~~~-----------~~~~~l~G~I~V~vHY 27 (84)
|..+|..+|+. |....+.|.|.++..|
T Consensus 29 ~~~~w~kRWFvlr~s~L~Y~~~~~~~~~~~vil~D~~f 66 (121)
T cd01254 29 MCDRWQKRWFIVKESFLAYMDDPSSAQILDVILFDVDF 66 (121)
T ss_pred cccCCcceeEEEeCCEEEEEcCCCCCceeeEEEEcCCc
Confidence 34689999982 3356899999987665
No 16
>PF13708 Methyltransf_27: Methyltransferase domain
Probab=42.88 E-value=23 Score=25.08 Aligned_cols=20 Identities=20% Similarity=0.554 Sum_probs=16.5
Q ss_pred EEeeEeeecccEEEEe-ccce
Q psy13237 22 KVQVHYYEDGNVQLVS-SKDI 41 (84)
Q Consensus 22 ~V~vHYyEdGNVqL~s-~k~~ 41 (84)
-+.|-||..||++|.. +.++
T Consensus 158 ~f~vR~fkkGt~Hi~fk~~dl 178 (194)
T PF13708_consen 158 YFRVRYFKKGTAHITFKRPDL 178 (194)
T ss_pred cEEEEEecCCcEEEEECCHHH
Confidence 4789999999999998 4444
No 17
>PF13695 zf-3CxxC: Zinc-binding domain
Probab=37.23 E-value=39 Score=21.38 Aligned_cols=22 Identities=18% Similarity=0.345 Sum_probs=18.1
Q ss_pred EEEEEEeeEeeecccEEEEecc
Q psy13237 18 KGNLKVQVHYYEDGNVQLVSSK 39 (84)
Q Consensus 18 ~G~I~V~vHYyEdGNVqL~s~k 39 (84)
++.+.|.+|+|+.|-|....=+
T Consensus 17 S~~v~i~f~~~~~g~v~~rv~~ 38 (98)
T PF13695_consen 17 SAKVWILFHMYRGGQVNMRVFG 38 (98)
T ss_pred cCEEEEEEEEcCCCeEEEEEEC
Confidence 5788999999999988876543
No 18
>PF02661 Fic: Fic/DOC family; InterPro: IPR003812 This entry contains the Fic (filamentation induced by cAMP) protein and doc (death on curing) protein. The Fic protein is involved in cell division and is suggested to be involved in the synthesis of PAB or folate, indicating that the Fic protein and cAMP are involved in a regulatory mechanism of cell division via folate metabolism []. This family contains a central conserved motif HPFXXGNG in most members. The exact molecular function of these proteins is uncertain. P1 lysogens of Escherichia coli carry the prophage as a stable low copy number plasmid. The frequency with which viable cells cured of prophage are produced is about 10(-5) per cell per generation []. A significant part of this remarkable stability can be attributed to a plasmid-encoded mechanism that causes death of cells that have lost P1 []. In other words, the lysogenic cells appear to be addicted to the presence of the prophage. The plasmid withdrawal response depends on a gene named doc (death on curing) that is represented by this family.; PDB: 3SHG_A 3N3V_A 3N3U_A 2F6S_B 2JK8_B 2VZA_D 2VY3_A 3LET_B 3EQX_A 3SE5_D ....
Probab=33.61 E-value=8.1 Score=22.98 Aligned_cols=9 Identities=44% Similarity=0.921 Sum_probs=7.1
Q ss_pred eeEeeeccc
Q psy13237 24 QVHYYEDGN 32 (84)
Q Consensus 24 ~vHYyEdGN 32 (84)
.+|-|.|||
T Consensus 77 ~iHPF~dGN 85 (97)
T PF02661_consen 77 YIHPFRDGN 85 (97)
T ss_dssp HHCSSSSHH
T ss_pred hcCcccCCC
Confidence 468888888
No 19
>cd06408 PB1_NoxR The PB1 domain is present in the Epichloe festucae NoxR protein (NADPH oxidase regulator), a key regulator of NADPH oxidase isoform, NoxA. NoxA is essential for growth control of the fungal endophyte in plant tissue in the process of symbiotic interaction between a fungi and its plant host. The Epichloe festucae p67(phox)-like regulator, NoxR, dispensable in culture but essential in plants for the symbiotic interaction. Plants infected with a noxR deletion mutant show severe stunting and premature senescence, whereas hyphae in the meristematic tissues show increased branching leading to increased fungal colonization of pseudostem and leaf blade tissue. The PB1 domain is a modular domain mediating specific protein-protein interactions which a play role in many critical cell processes such as osteoclastogenesis, angiogenesis, early cardiovascular development, and cell polarity. A canonical PB1-PB1 interaction, which involves heterodimerization of two PB1 domains, is
Probab=33.04 E-value=27 Score=22.28 Aligned_cols=11 Identities=36% Similarity=0.730 Sum_probs=8.7
Q ss_pred EEEEeeEeeec
Q psy13237 20 NLKVQVHYYED 30 (84)
Q Consensus 20 ~I~V~vHYyEd 30 (84)
+|+|.|||-+|
T Consensus 2 ~ikVKv~~~~D 12 (86)
T cd06408 2 KIRVKVHAQDD 12 (86)
T ss_pred cEEEEEEecCc
Confidence 68999998555
No 20
>PF14332 DUF4388: Domain of unknown function (DUF4388)
Probab=32.44 E-value=49 Score=20.08 Aligned_cols=24 Identities=38% Similarity=0.547 Sum_probs=16.7
Q ss_pred eEEEEEEEee------EeeecccEEEEecc
Q psy13237 16 EVKGNLKVQV------HYYEDGNVQLVSSK 39 (84)
Q Consensus 16 ~l~G~I~V~v------HYyEdGNVqL~s~k 39 (84)
+-+|.+.|+. =||+||.|---+..
T Consensus 20 ~ktG~L~v~~~~~~~~i~f~~G~iv~A~~~ 49 (103)
T PF14332_consen 20 RKTGVLEVQSGGGEGRIYFRDGRIVHASSG 49 (103)
T ss_pred CCeEEEEEEeCCcEEEEEEECCEEEEEEeC
Confidence 5567766633 49999998766655
No 21
>PF07166 DUF1398: Protein of unknown function (DUF1398); InterPro: IPR009833 This entry is represented by Bacteriophage DE3, Orf: ECD_10033. The characteristics of the protein distribution suggest prophage matches in addition to the phage matches. This family consists of several hypothetical Enterobacterial proteins of around 130 residues in length. Members of this family seem to be found exclusively in Escherichia coli and Salmonella species. The function of this family is unknown.; PDB: 2HH8_A.
Probab=31.61 E-value=1.2e+02 Score=20.50 Aligned_cols=17 Identities=12% Similarity=0.110 Sum_probs=7.4
Q ss_pred eeEee-ecccEEEEeccc
Q psy13237 24 QVHYY-EDGNVQLVSSKD 40 (84)
Q Consensus 24 ~vHYy-EdGNVqL~s~k~ 40 (84)
++||| ++|+.=...++.
T Consensus 41 ~~~y~~~~d~~V~~~~~~ 58 (125)
T PF07166_consen 41 NTTYVGENDYTVSSKGNY 58 (125)
T ss_dssp EEEEE-SSS-EEEEE-ST
T ss_pred cEEEEEcCCCEEEecccc
Confidence 35665 666544443333
No 22
>PF02246 B1: Protein L b1 domain; InterPro: IPR003147 Protein L is a bacterial protein with immunoglobulin (Ig) light chain-binding properties. It contains a number of homologous b1 repeats towards the N terminus. These repeats have been found to be responsible for the interaction of protein L with Ig light chains [].; PDB: 1KH0_A 1K52_A 1JML_A 2PTL_A 1K50_B 1HZ6_C 1HZ5_B 1K53_B 2KAC_A 2JZP_A ....
Probab=31.37 E-value=52 Score=20.07 Aligned_cols=18 Identities=28% Similarity=0.506 Sum_probs=12.2
Q ss_pred EEEeeEeeecccEEEEecc
Q psy13237 21 LKVQVHYYEDGNVQLVSSK 39 (84)
Q Consensus 21 I~V~vHYyEdGNVqL~s~k 39 (84)
|+.++ ||+||-+|--.-|
T Consensus 10 ik~nl-ifadg~tqtaefk 27 (69)
T PF02246_consen 10 IKANL-IFADGSTQTAEFK 27 (69)
T ss_dssp EEEEE-EETTSEEEEEEEE
T ss_pred EEEEE-EEccCcEEEEEee
Confidence 44443 8999999965433
No 23
>PF03544 TonB_C: Gram-negative bacterial TonB protein C-terminal; InterPro: IPR006260 The sequences in this set all contain a conserved C-terminal domain which is characteristic of TonB and is homologs. TonB is an energy-transducer for TonB-dependent receptors of Gram-negative bacteria []. Most members are designated as TonB or TonB-related proteins, but a few represent the paralogous TolA protein. Several bacteria have up to four TonB paralogs. In nearly every case, a proline-rich repetitive region is found N-terminal to this domain; these low-complexity regions are highly divergent and cannot readily be aligned. The region is suggested to span the periplasm. Iron is essential for growth in both bacteria and mammals. Controlling the amount of free iron in solution is often used as a tactic by hosts to limit invasion of pathogenic microbes; binding iron tightly within protein molecules can accomplish this. Some bacteria express surface receptors to capture eukaryotic iron-binding compounds, while others have evolved siderophores to scavenge iron from iron-binding host proteins []. The absence of free iron molecules in the surrounding environment triggers transcription of gene clusters that encode both siderophore-synthesis ezymes, and receptors that recognise iron-bound siderophores []. An example of the latter is Escherichia coli fepA, which resides in the outer envelope and captures iron-bound enterobactin []. To complete transport of bound iron across the inner membrane, a second receptor complex is needed. The major component of this is tonB, a 27kDa protein that facilitates energy transfer from the proton motive force to outer receptors. B-12 and colicin receptors also make use of the tonB system to drive active transport at the outer membrane.; GO: 0008565 protein transporter activity, 0015031 protein transport, 0016020 membrane, 0030288 outer membrane-bounded periplasmic space; PDB: 1U07_B 1IHR_A 2GRX_C 2GSK_B 1QXX_A 1XX3_A 2K9K_A.
Probab=30.33 E-value=89 Score=17.74 Aligned_cols=21 Identities=24% Similarity=0.332 Sum_probs=18.6
Q ss_pred ceEEEEEEEeeEeeecccEEE
Q psy13237 15 VEVKGNLKVQVHYYEDGNVQL 35 (84)
Q Consensus 15 ~~l~G~I~V~vHYyEdGNVqL 35 (84)
..++|.+.|.++-=+||+|.-
T Consensus 12 ~~~~G~v~v~~~I~~~G~v~~ 32 (79)
T PF03544_consen 12 RGIEGTVVVEFTIDPDGRVSD 32 (79)
T ss_dssp HTEEEEEEEEEEEETTTEEEE
T ss_pred CCCeEEEEEEEEEeCCCCEEE
Confidence 467999999999999999974
No 24
>TIGR02887 spore_ger_x_C germination protein, Ger(x)C family. Members of this protein family are restricted to endospore-forming members of the Firmicutes lineage of bacteria, including the genera Bacillus, Clostridium, Thermoanaerobacter, Carboxydothermus, etc. Members are nearly all predicted lipoproteins and belong to probable transport operons, some of which have been characterized as crucial to germination in response to alanine. Members typically have been gene symbols gerKC, gerAC, gerYC, etc.
Probab=29.46 E-value=2e+02 Score=21.69 Aligned_cols=25 Identities=8% Similarity=0.243 Sum_probs=15.0
Q ss_pred EEEEEeeEeeecccEEEEeccceecc
Q psy13237 19 GNLKVQVHYYEDGNVQLVSSKDIKET 44 (84)
Q Consensus 19 G~I~V~vHYyEdGNVqL~s~k~~~~~ 44 (84)
++.++.+++ ++|.+++.-.=.++..
T Consensus 255 ~~~~i~~~~-~~~~~~~~i~i~~~~~ 279 (371)
T TIGR02887 255 SKTKIKIKV-KGGKPHFKIHIKLEGR 279 (371)
T ss_pred cccEEEEEE-ECCeEEEEEEEEEEEE
Confidence 456677775 7777666555444433
No 25
>KOG3859|consensus
Probab=28.36 E-value=1.1e+02 Score=24.45 Aligned_cols=51 Identities=31% Similarity=0.432 Sum_probs=33.6
Q ss_pred EEEEeeEeee--cccEEEEeccceecceeec---CchhhHHHHHHHHHH-HHHHHHHHH
Q psy13237 20 NLKVQVHYYE--DGNVQLVSSKDIKETVVVT---SEEQTAREIVKLIEE-SETAYQQAI 72 (84)
Q Consensus 20 ~I~V~vHYyE--dGNVqL~s~k~~~~~~~~s---~~~~~A~~iv~~I~~-~E~~~q~~L 72 (84)
.++++.+-|| .+||||+-+-. +++-+. +.+...+-||..|.+ +|+-.|..|
T Consensus 78 ~V~L~~~TyelqEsnvrlKLtiv--~tvGfGDQinK~~Syk~iVdyidaQFEaYLQEEL 134 (406)
T KOG3859|consen 78 NVKLQANTYELQESNVRLKLTIV--DTVGFGDQINKEDSYKPIVDYIDAQFEAYLQEEL 134 (406)
T ss_pred CceeecchhhhhhcCeeEEEEEE--eecccccccCcccccchHHHHHHHHHHHHHHHHH
Confidence 4677777776 89999986532 233332 446678889998864 566666554
No 26
>PF13216 DUF4024: Protein of unknown function (DUF4024)
Probab=26.08 E-value=76 Score=16.88 Aligned_cols=23 Identities=17% Similarity=0.368 Sum_probs=16.4
Q ss_pred EEEEEeeEeeecccEEEEeccce
Q psy13237 19 GNLKVQVHYYEDGNVQLVSSKDI 41 (84)
Q Consensus 19 G~I~V~vHYyEdGNVqL~s~k~~ 41 (84)
|---...|.|.|-||.|-.--++
T Consensus 3 glsvt~lhlfrde~vnflfcief 25 (35)
T PF13216_consen 3 GLSVTNLHLFRDEKVNFLFCIEF 25 (35)
T ss_pred ceEEEEEEEeecCCccEEEEehh
Confidence 33345689999999998765443
No 27
>PHA02762 hypothetical protein; Provisional
Probab=25.71 E-value=1.6e+02 Score=17.63 Aligned_cols=40 Identities=15% Similarity=0.298 Sum_probs=32.6
Q ss_pred CcccceeEeeCCCceEEEEEEEeeEeeecccEEEEeccce
Q psy13237 2 FDRSLADFIEGNNVEVKGNLKVQVHYYEDGNVQLVSSKDI 41 (84)
Q Consensus 2 ~~~w~~~~~~~~~~~l~G~I~V~vHYyEdGNVqL~s~k~~ 41 (84)
||.-.-.|..+......|...|.+--=|+||+..-+-.+.
T Consensus 9 fgnlii~~~rs~~ks~eg~afvtigide~g~iayisiep~ 48 (62)
T PHA02762 9 FGNLIIEFKRNVEKSFEGEAFVTIGIDENDKISYISIEPL 48 (62)
T ss_pred CccEEEEEecCccccccccEEEEEeECCCCcEEEEEeccc
Confidence 6666777776667788999999999999999998776654
No 28
>PF12476 DUF3696: Protein of unknown function (DUF3696); InterPro: IPR022532 This domain is found in bacteria and archaea, and is approximately 50 amino acids in length.
Probab=25.36 E-value=50 Score=18.46 Aligned_cols=8 Identities=25% Similarity=0.783 Sum_probs=6.5
Q ss_pred EEeeEeee
Q psy13237 22 KVQVHYYE 29 (84)
Q Consensus 22 ~V~vHYyE 29 (84)
+|.||||+
T Consensus 5 ~v~i~ff~ 12 (52)
T PF12476_consen 5 DVSIYFFE 12 (52)
T ss_pred HEEEEEEE
Confidence 48899994
No 29
>PF14231 GXWXG: GXWXG protein
Probab=24.95 E-value=24 Score=21.03 Aligned_cols=11 Identities=18% Similarity=0.184 Sum_probs=8.4
Q ss_pred CCcccceeEee
Q psy13237 1 MFDRSLADFIE 11 (84)
Q Consensus 1 ~~~~w~~~~~~ 11 (84)
|+|||+..-+.
T Consensus 15 m~G~W~G~~f~ 25 (60)
T PF14231_consen 15 MLGRWRGGEFD 25 (60)
T ss_pred heEecccCccC
Confidence 78999877664
No 30
>PHA02633 hypothetical protein; Provisional
Probab=24.79 E-value=76 Score=19.25 Aligned_cols=21 Identities=5% Similarity=0.045 Sum_probs=17.2
Q ss_pred CceEEEEEEEeeEeeecccEE
Q psy13237 14 NVEVKGNLKVQVHYYEDGNVQ 34 (84)
Q Consensus 14 ~~~l~G~I~V~vHYyEdGNVq 34 (84)
+.--.++|.++||=.+++||-
T Consensus 41 t~c~~~si~l~V~~~~~~~ip 61 (63)
T PHA02633 41 TYSDMMKFDLCICLRIESNID 61 (63)
T ss_pred CeeEEEEEEEEEeeccccccc
Confidence 345678999999999999874
No 31
>PF09266 VirDNA-topo-I_N: Viral DNA topoisomerase I, N-terminal; InterPro: IPR015346 DNA topoisomerases regulate the number of topological links between two DNA strands (i.e. change the number of superhelical turns) by catalysing transient single- or double-strand breaks, crossing the strands through one another, then resealing the breaks []. These enzymes have several functions: to remove DNA supercoils during transcription and DNA replication; for strand breakage during recombination; for chromosome condensation; and to disentangle intertwined DNA during mitosis [, ]. DNA topoisomerases are divided into two classes: type I enzymes (5.99.1.2 from EC; topoisomerases I, III and V) break single-strand DNA, and type II enzymes (5.99.1.3 from EC; topoisomerases II, IV and VI) break double-strand DNA []. Type I topoisomerases are ATP-independent enzymes (except for reverse gyrase), and can be subdivided according to their structure and reaction mechanisms: type IA (bacterial and archaeal topoisomerase I, topoisomerase III and reverse gyrase) and type IB (eukaryotic topoisomerase I and topoisomerase V). These enzymes are primarily responsible for relaxing positively and/or negatively supercoiled DNA, except for reverse gyrase, which can introduce positive supercoils into DNA. This entry represents a domain foundpredominantly found in viral DNA topoisomerase I, a type IB enzyme. This domain assumes a beta(2)-alpha-beta-alpha-beta(2) fold, with a left-handed crossover between strands beta2 and beta3 []. More information about this protein can be found at Protein of the Month: DNA Topoisomerase [].; GO: 0003677 DNA binding, 0003916 DNA topoisomerase activity, 0006265 DNA topological change; PDB: 3IGC_A 2H7G_X 2H7F_X 1VCC_A.
Probab=24.34 E-value=92 Score=18.64 Aligned_cols=31 Identities=29% Similarity=0.434 Sum_probs=16.3
Q ss_pred eeecccEEEEeccceecceeecCchhhHHHHHHHHH
Q psy13237 27 YYEDGNVQLVSSKDIKETVVVTSEEQTAREIVKLIE 62 (84)
Q Consensus 27 YyEdGNVqL~s~k~~~~~~~~s~~~~~A~~iv~~I~ 62 (84)
||-||++ -..|++...++. +.-+-+|++.|+
T Consensus 2 ~yk~gkl--f~dk~~~~~v~~---~n~~yeIlk~~K 32 (58)
T PF09266_consen 2 YYKDGKL--FFDKIYTNPVPD---DNPTYEILKKYK 32 (58)
T ss_dssp EEETTEE--ESSTTS-SB--T---T-THHHHHHH--
T ss_pred ccccceE--EechhhCCCCCC---ccHHHHHHHHcC
Confidence 7888875 455666655554 334677777664
No 32
>PF14593 PH_3: PH domain; PDB: 1W1H_D 1W1D_A 1W1G_A 2VKI_A.
Probab=24.33 E-value=2.1e+02 Score=18.55 Aligned_cols=57 Identities=21% Similarity=0.311 Sum_probs=36.6
Q ss_pred EeeCCCceEEEEEEEeeEeeecccEEEEeccceecc-----eeecCchhhHHHHHHHHHHHHHHHH
Q psy13237 9 FIEGNNVEVKGNLKVQVHYYEDGNVQLVSSKDIKET-----VVVTSEEQTAREIVKLIEESETAYQ 69 (84)
Q Consensus 9 ~~~~~~~~l~G~I~V~vHYyEdGNVqL~s~k~~~~~-----~~~s~~~~~A~~iv~~I~~~E~~~q 69 (84)
+++|....++|.|-+. .+=.|++++.+.+... ..+..++..|..=+++|++.=..|.
T Consensus 42 Yvdp~~~~~KGeI~~~----~~l~v~~k~~~~F~I~tp~RtY~l~d~~~~A~~W~~~I~~~~~~~~ 103 (104)
T PF14593_consen 42 YVDPKKMVLKGEIPWS----KELSVEVKSFKTFFIHTPKRTYYLEDPEGNAQQWVEAIEEVKKQYY 103 (104)
T ss_dssp EEETTTTEEEEEE--S----TT-EEEECSSSEEEEEETTEEEEEE-TTS-HHHHHHHHHHHHHHHH
T ss_pred EEECCCCeECcEEecC----CceEEEEccCCEEEEECCCcEEEEECCCCCHHHHHHHHHHHHHHhc
Confidence 4577788999999887 5556777776665422 2334667789999999988765553
No 33
>KOG4173|consensus
Probab=22.46 E-value=22 Score=26.75 Aligned_cols=19 Identities=11% Similarity=0.335 Sum_probs=13.3
Q ss_pred hhHHHHHHHHHHHHHHHHH
Q psy13237 52 QTAREIVKLIEESETAYQQ 70 (84)
Q Consensus 52 ~~A~~iv~~I~~~E~~~q~ 70 (84)
.-+-.+.+.|...|.-|.+
T Consensus 85 agc~~~~d~lD~~E~hY~~ 103 (253)
T KOG4173|consen 85 AGCCQVFDALDDYEHHYHT 103 (253)
T ss_pred cchHHHHhhhhhHHHhhhh
Confidence 3456778888888877743
No 34
>PF12637 TSCPD: TSCPD domain; InterPro: IPR024434 The domain is found in isolation in many proteins where it has a conserved C-terminal motif TSCPD, after which the domain is named. Most copies of the domain possess 4 conserved cysteines that may be part of an Iron-sulphur cluster. This domain is found at the C terminus of some ribonucleoside-diphosphate reductase enzymes.
Probab=22.23 E-value=1e+02 Score=19.32 Aligned_cols=24 Identities=38% Similarity=0.451 Sum_probs=17.8
Q ss_pred EEEEEEEeeEeeec-cc---EEEEeccc
Q psy13237 17 VKGNLKVQVHYYED-GN---VQLVSSKD 40 (84)
Q Consensus 17 l~G~I~V~vHYyEd-GN---VqL~s~k~ 40 (84)
..|++.|.|-|+|| |+ |=.+..|.
T Consensus 6 ~~g~~yvtv~~d~d~g~p~Evf~~~~~~ 33 (95)
T PF12637_consen 6 GCGKLYVTVNFDEDNGRPFEVFINVGKA 33 (95)
T ss_pred cccceEEEEEeeCCCCcceEEEEecCcC
Confidence 45789999999999 77 55555444
No 35
>PLN03113 DNA ligase 1; Provisional
Probab=22.20 E-value=1.2e+02 Score=26.24 Aligned_cols=31 Identities=23% Similarity=0.427 Sum_probs=23.3
Q ss_pred ccceeEeeCCCceEEEEEEEeeEeeecccEEEEeccce
Q psy13237 4 RSLADFIEGNNVEVKGNLKVQVHYYEDGNVQLVSSKDI 41 (84)
Q Consensus 4 ~w~~~~~~~~~~~l~G~I~V~vHYyEdGNVqL~s~k~~ 41 (84)
.|..+|+-++ .++|+||-++|.|+|-|..--
T Consensus 392 ~~~~E~KyDG-------eR~QiH~~~~g~v~lfSRn~e 422 (744)
T PLN03113 392 EFTCEYKYDG-------ERAQIHFLEDGSVEIYSRNAE 422 (744)
T ss_pred CEEEEEeecc-------ceEEEEEecCCEEEEEeCCCC
Confidence 5666666543 489999999999999776543
No 36
>PF11080 DUF2622: Protein of unknown function (DUF2622); InterPro: IPR022597 This family is conserved in the Enterobacteriaceae family. The function is not known.
Probab=21.60 E-value=55 Score=21.33 Aligned_cols=10 Identities=40% Similarity=0.607 Sum_probs=7.9
Q ss_pred EEeeEeeecc
Q psy13237 22 KVQVHYYEDG 31 (84)
Q Consensus 22 ~V~vHYyEdG 31 (84)
-|.|||+|+|
T Consensus 10 VVt~~~~e~~ 19 (96)
T PF11080_consen 10 VVTFEYQEAG 19 (96)
T ss_pred EEEEEeccCC
Confidence 4788888877
No 37
>PF07104 DUF1366: Protein of unknown function (DUF1366); InterPro: IPR009796 This entry is represented by Streptococcus phage 7201, Orf40. The characteristics of the protein distribution suggest prophage matches in addition to the phage matches. This family consists of several hypothetical Streptococcus thermophilus bacteriophage proteins of around 130 residues in length. One of the sequences in this family, from phage Sfi11 (O80186 from SWISSPROT) is known as Gp149. The function of this family is unknown.
Probab=21.12 E-value=2.3e+02 Score=18.94 Aligned_cols=16 Identities=25% Similarity=0.266 Sum_probs=11.2
Q ss_pred HHHHHHHHHHHHhhhh
Q psy13237 65 ETAYQQAISENYMIIR 80 (84)
Q Consensus 65 E~~~q~~L~e~y~~Ls 80 (84)
|+--+.+.+|.|+.+.
T Consensus 59 e~~p~rA~~Ekf~kvd 74 (116)
T PF07104_consen 59 ENFPQRAENEKFQKVD 74 (116)
T ss_pred HhchhHHHHHHHHHHH
Confidence 3444678888888776
No 38
>cd07900 Adenylation_DNA_ligase_I_Euk Adenylation domain of eukaryotic DNA Ligase I. ATP-dependent polynucleotide ligases catalyze phosphodiester bond formation using nicked nucleic acid substrates with the high energy nucleotide of ATP as a cofactor in a three step reaction mechanism. DNA ligases play a vital role in the diverse processes of DNA replication, recombination and repair. ATP-dependent ligases are present in many organisms such as viruses, bacteriophages, eukarya, archaea and bacteria. Some organisms express a variety of different ligases which appear to be targeted to specific functions. There are three classes of ATP-dependent DNA ligases in eukaryotic cells (I, III and IV). DNA ligase I is required for the ligation of Okazaki fragments during lagging-strand DNA synthesis and for base excision repair (BER). DNA ligases have a highly modular architecture consisting of a unique arrangement of two or more discrete domains. The adenylation and C-terminal oligonucleotide/oligo
Probab=21.02 E-value=96 Score=22.12 Aligned_cols=21 Identities=33% Similarity=0.499 Sum_probs=17.2
Q ss_pred EEEeeEeeecccEEEEeccce
Q psy13237 21 LKVQVHYYEDGNVQLVSSKDI 41 (84)
Q Consensus 21 I~V~vHYyEdGNVqL~s~k~~ 41 (84)
+++++|+-.+|.|+|-|..--
T Consensus 42 ~R~~~h~~~~~~v~l~SR~g~ 62 (219)
T cd07900 42 ERAQIHLLEDGKVKIFSRNLE 62 (219)
T ss_pred eEEEEEEcCCCeEEEECCCCc
Confidence 478899987899999887654
No 39
>PF10106 DUF2345: Uncharacterized protein conserved in bacteria (DUF2345); InterPro: IPR018769 This entry represents the C-terminal domain of a subset of the Rhs element Vgr protein family found only in genomes with type VI secretion loci. However, members of this protein family, although recognizably correlated to type VI secretion according the partial phylogenetic profiling algorithm, are often encoded far from the type VI secretion locus.
Probab=20.98 E-value=74 Score=21.73 Aligned_cols=37 Identities=19% Similarity=0.513 Sum_probs=25.2
Q ss_pred cccceeEeeCCCceE---EEEEEEeeEeeecccEEEEecccee
Q psy13237 3 DRSLADFIEGNNVEV---KGNLKVQVHYYEDGNVQLVSSKDIK 42 (84)
Q Consensus 3 ~~w~~~~~~~~~~~l---~G~I~V~vHYyEdGNVqL~s~k~~~ 42 (84)
|+..+-|.......+ .|.|.+|.| +|++.|.+.|++.
T Consensus 62 ~~~islfa~~~Gi~liA~~G~v~iqAq---~~~~~l~A~k~i~ 101 (155)
T PF10106_consen 62 GEGISLFAQKGGIKLIAAQGKVEIQAQ---SDDMELTAKKDIT 101 (155)
T ss_pred hceEEEEEcccceeEEEcCCcEEEEEc---CCchhhhccCcEE
Confidence 344555662222233 689999998 7888888888875
No 40
>PF03714 PUD: Bacterial pullanase-associated domain; InterPro: IPR005323 Domain is found in pullanase - carbohydrate de-branching - proteins. It is found both to the N or the C-termini of of the alpha-amylase active site region. This domain contains several conserved aromatic residues that are suggestive of a carbohydrate binding function.; GO: 0030246 carbohydrate binding, 0005975 carbohydrate metabolic process; PDB: 2J43_A 2J44_A 2YA1_A 2FHC_A 2FHB_A 2FHF_A 2J73_B 2J72_A 2J71_A.
Probab=20.90 E-value=69 Score=20.10 Aligned_cols=8 Identities=50% Similarity=0.849 Sum_probs=5.8
Q ss_pred EEeeEeee
Q psy13237 22 KVQVHYYE 29 (84)
Q Consensus 22 ~V~vHYyE 29 (84)
.|.|||+.
T Consensus 3 ~v~vhY~r 10 (103)
T PF03714_consen 3 TVRVHYHR 10 (103)
T ss_dssp EEEEEEES
T ss_pred EEEEEEEC
Confidence 57788873
No 41
>PF09346 SMI1_KNR4: SMI1 / KNR4 family (SUKH-1); InterPro: IPR018958 Proteins in this family are involved in the regulation of 1,3-beta-glucan synthase activity and cell-wall formation [, ]. Yeast members of this family are involved in the regulation of cell wall assembly. Saccharomyces cerevisiae (Baker's yeast) protein KNR4 (SMI1) has a regulatory role in chitin deposition and in cell wall assembly []. It was originally identified as a regulator of chitin synthase expression (acting as a repressor) [] and of 1,3-beta-glucan synthase levels []. It was shown to localise in patches at presumptive bud sites in unbudded cells and at the incipient bud site during bud emergence []. KNR4 is believed to connect the PKC1-SLT2 MAPK pathway with cell proliferation. It has been shown to interact with BCK2, a gene involved in cell cycle progression in S. cerevisiae (forming a complex) to allow PKC1 to coordinate the cell cycle (cell proliferation) with cell wall integrity [, ]. PKC1 plays an essential role in cell wall integrity and cell proliferation through a bifurcated PKC1/mitogen-activated protein (MAP) kinase pathway. KNR4 also interacts with the tyrosine-tRNA synthetase protein encoded by TYS1 and is involved in sporulation process []. Note: previously reported evidence that KNR4 may interact with nuclear matrix-association region [] may be due to an artefact [].; PDB: 3D5P_A 2ICG_A.
Probab=20.88 E-value=70 Score=19.24 Aligned_cols=21 Identities=19% Similarity=0.294 Sum_probs=15.0
Q ss_pred HHHHHHHHHHHHHHHHHHhhh
Q psy13237 59 KLIEESETAYQQAISENYMII 79 (84)
Q Consensus 59 ~~I~~~E~~~q~~L~e~y~~L 79 (84)
+.|+++|.+++..|-+.|..+
T Consensus 5 ~~I~~~E~~lg~~LP~~yk~f 25 (130)
T PF09346_consen 5 EEIQELEEKLGVRLPDDYKEF 25 (130)
T ss_dssp HHHHHHHHHHTS---HHHHHH
T ss_pred HHHHHHHHHhCCCCcHHHHHH
Confidence 568999999999998888765
No 42
>PF07270 DUF1438: Protein of unknown function (DUF1438); InterPro: IPR009895 This family consists of several hypothetical proteins of around 170 residues in length, which appear to be mouse specific. The function of this family is unknown.
Probab=20.79 E-value=33 Score=24.03 Aligned_cols=10 Identities=40% Similarity=0.561 Sum_probs=8.7
Q ss_pred eeEeeecccE
Q psy13237 24 QVHYYEDGNV 33 (84)
Q Consensus 24 ~vHYyEdGNV 33 (84)
+-||||||=+
T Consensus 45 ~SHc~edgWi 54 (151)
T PF07270_consen 45 PSHCSEDGWI 54 (151)
T ss_pred CcccccCCce
Confidence 6799999976
No 43
>PF14977 FAM194: FAM194 protein
Probab=20.74 E-value=62 Score=23.68 Aligned_cols=19 Identities=26% Similarity=0.363 Sum_probs=15.1
Q ss_pred eeEeeecccEEEEecccee
Q psy13237 24 QVHYYEDGNVQLVSSKDIK 42 (84)
Q Consensus 24 ~vHYyEdGNVqL~s~k~~~ 42 (84)
-.=||+.||++|+-+..-.
T Consensus 70 g~~y~~~g~~~l~l~~~gG 88 (208)
T PF14977_consen 70 GTCYHPNGNIWLVLNQEGG 88 (208)
T ss_pred EEEEcCCCCEEEEEECCCC
Confidence 3459999999999887643
No 44
>PF15121 TMEM71: TMEM71 protein family
Probab=20.26 E-value=47 Score=23.27 Aligned_cols=12 Identities=50% Similarity=0.803 Sum_probs=10.1
Q ss_pred ecccEEEEeccc
Q psy13237 29 EDGNVQLVSSKD 40 (84)
Q Consensus 29 EdGNVqL~s~k~ 40 (84)
|||||.|+-+..
T Consensus 81 ~dGNITLsPsqT 92 (149)
T PF15121_consen 81 EDGNITLSPSQT 92 (149)
T ss_pred CCCCEeecCccc
Confidence 899999987764
No 45
>KOG0967|consensus
Probab=20.07 E-value=84 Score=27.17 Aligned_cols=19 Identities=37% Similarity=0.688 Sum_probs=16.3
Q ss_pred EEeeEeeecccEEEEeccc
Q psy13237 22 KVQVHYYEDGNVQLVSSKD 40 (84)
Q Consensus 22 ~V~vHYyEdGNVqL~s~k~ 40 (84)
+.|||+-|||+|..-+...
T Consensus 377 RAQIH~~edG~v~IfSRN~ 395 (714)
T KOG0967|consen 377 RAQIHKLEDGTVEIFSRNS 395 (714)
T ss_pred eeeeEEccCCcEEEEeccc
Confidence 6799999999999877654
Done!