Query psy2111
Match_columns 63
No_of_seqs 101 out of 189
Neff 4.5
Searched_HMMs 46136
Date Sat Aug 17 00:33:31 2013
Command hhsearch -i /work/01045/syshi/Psyhhblits/psy2111.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/2111hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 KOG0147|consensus 99.8 7E-20 1.5E-24 141.8 4.3 54 1-54 495-548 (549)
2 TIGR01622 SF-CC1 splicing fact 99.1 4.8E-11 1E-15 86.6 4.0 41 2-42 416-456 (457)
3 TIGR01645 half-pint poly-U bin 98.9 7.9E-10 1.7E-14 87.0 3.2 37 2-38 571-607 (612)
4 KOG2202|consensus 98.8 6E-09 1.3E-13 75.6 3.1 38 2-39 116-153 (260)
5 TIGR01642 U2AF_lg U2 snRNP aux 98.6 2.5E-08 5.4E-13 73.2 3.1 37 2-38 470-506 (509)
6 KOG1548|consensus 98.5 7.9E-08 1.7E-12 72.7 2.6 32 2-33 320-351 (382)
7 KOG0120|consensus 98.4 1.9E-07 4E-12 72.6 3.1 41 1-41 459-499 (500)
8 PF13893 RRM_5: RNA recognitio 98.4 2.8E-07 6.1E-12 50.4 2.9 30 2-31 27-56 (56)
9 smart00361 RRM_1 RNA recogniti 98.2 8.6E-07 1.9E-11 50.9 2.5 28 2-29 43-70 (70)
10 KOG1996|consensus 97.8 1.6E-05 3.5E-10 59.9 2.8 37 3-39 336-372 (378)
11 KOG0124|consensus 97.6 2.8E-05 6.1E-10 60.3 1.6 36 2-37 503-538 (544)
12 TIGR01661 ELAV_HUD_SF ELAV/HuD 97.5 9.3E-05 2E-09 51.9 2.8 32 3-34 318-349 (352)
13 cd00590 RRM RRM (RNA recogniti 96.8 0.0012 2.6E-08 34.8 2.4 28 3-30 47-74 (74)
14 PF14259 RRM_6: RNA recognitio 96.7 0.0015 3.2E-08 36.2 2.5 26 2-27 45-70 (70)
15 smart00360 RRM RNA recognition 96.5 0.002 4.3E-08 33.5 1.9 28 2-29 44-71 (71)
16 TIGR01628 PABP-1234 polyadenyl 96.4 0.0028 6.1E-08 47.8 2.8 32 2-33 332-363 (562)
17 smart00362 RRM_2 RNA recogniti 96.0 0.007 1.5E-07 31.6 2.5 27 3-29 46-72 (72)
18 PLN03134 glycine-rich RNA-bind 95.3 0.021 4.5E-07 37.2 3.1 30 3-32 83-112 (144)
19 TIGR01661 ELAV_HUD_SF ELAV/HuD 95.3 0.014 3E-07 40.9 2.4 31 2-32 51-81 (352)
20 KOG0148|consensus 95.1 0.017 3.7E-07 43.3 2.4 29 3-31 111-139 (321)
21 PF00076 RRM_1: RNA recognitio 94.6 0.03 6.5E-07 30.0 2.0 26 2-27 45-70 (70)
22 TIGR01648 hnRNP-R-Q heterogene 93.3 0.096 2.1E-06 41.6 3.3 32 2-33 275-306 (578)
23 KOG4208|consensus 92.9 0.11 2.4E-06 37.2 3.0 33 2-34 98-130 (214)
24 COG0724 RNA-binding proteins ( 92.4 0.12 2.5E-06 32.5 2.2 32 2-33 163-194 (306)
25 TIGR01622 SF-CC1 splicing fact 91.8 0.19 4.1E-06 36.7 3.1 31 2-32 234-264 (457)
26 TIGR01659 sex-lethal sex-letha 91.7 0.17 3.7E-06 37.4 2.8 30 3-32 156-185 (346)
27 KOG4212|consensus 91.2 0.2 4.3E-06 40.0 2.8 30 1-30 578-607 (608)
28 TIGR01642 U2AF_lg U2 snRNP aux 91.1 0.26 5.6E-06 36.4 3.2 30 3-32 344-373 (509)
29 PF11608 Limkain-b1: Limkain b 90.1 0.23 5E-06 31.5 1.9 33 2-34 45-77 (90)
30 TIGR01649 hnRNP-L_PTB hnRNP-L/ 90.0 0.35 7.6E-06 36.5 3.1 31 2-32 442-478 (481)
31 KOG0107|consensus 89.4 0.3 6.6E-06 34.6 2.3 31 3-33 54-84 (195)
32 TIGR01649 hnRNP-L_PTB hnRNP-L/ 88.9 0.51 1.1E-05 35.7 3.3 32 3-34 320-351 (481)
33 TIGR01628 PABP-1234 polyadenyl 88.7 0.37 8E-06 36.5 2.4 29 2-30 135-163 (562)
34 TIGR01645 half-pint poly-U bin 88.3 0.36 7.7E-06 38.8 2.2 27 3-29 156-182 (612)
35 TIGR01659 sex-lethal sex-letha 85.7 0.85 1.9E-05 33.8 2.9 32 2-33 241-274 (346)
36 KOG0123|consensus 84.1 1.2 2.6E-05 33.4 3.1 30 3-32 122-151 (369)
37 KOG0131|consensus 83.3 1.3 2.9E-05 31.6 2.8 32 3-34 146-177 (203)
38 COG5175 MOT2 Transcriptional r 83.1 1.2 2.5E-05 34.9 2.7 41 3-44 172-212 (480)
39 PLN03120 nucleic acid binding 82.4 1.7 3.7E-05 31.8 3.2 29 3-32 50-78 (260)
40 KOG0108|consensus 82.2 1.4 3E-05 34.1 2.8 38 3-40 67-104 (435)
41 KOG4206|consensus 77.1 1.8 4E-05 31.1 1.9 31 3-33 59-89 (221)
42 PLN02435 probable UDP-N-acetyl 75.9 2.3 5.1E-05 33.4 2.4 38 3-40 427-475 (493)
43 KOG0114|consensus 74.4 5 0.00011 26.7 3.3 36 4-39 65-100 (124)
44 PF04847 Calcipressin: Calcipr 72.9 2.4 5.2E-05 29.2 1.6 31 2-32 37-69 (184)
45 KOG0145|consensus 72.3 4.2 9.1E-05 30.9 2.9 31 4-34 91-121 (360)
46 KOG4207|consensus 71.7 1.7 3.7E-05 31.8 0.7 28 3-30 62-89 (256)
47 KOG0125|consensus 71.4 4.2 9.2E-05 31.4 2.7 31 3-33 143-173 (376)
48 KOG0110|consensus 71.4 4.8 0.0001 33.4 3.2 30 2-31 15-44 (725)
49 cd04917 ACT_AKiii-LysC-EC_2 AC 70.3 3.5 7.5E-05 22.5 1.6 18 5-22 47-64 (64)
50 cd04920 ACT_AKiii-DAPDC_2 ACT 69.6 3.2 7E-05 23.1 1.4 18 5-22 46-63 (63)
51 KOG0122|consensus 68.1 5.3 0.00011 29.6 2.6 31 3-33 238-268 (270)
52 PF03880 DbpA: DbpA RNA bindin 67.8 3.8 8.3E-05 23.7 1.5 25 6-30 49-73 (74)
53 KOG0145|consensus 67.1 7.6 0.00016 29.5 3.3 29 4-32 328-356 (360)
54 PF12078 DUF3557: Domain of un 65.7 5.1 0.00011 25.1 1.9 34 11-44 4-40 (154)
55 PTZ00339 UDP-N-acetylglucosami 64.3 6.5 0.00014 30.8 2.6 37 3-39 418-465 (482)
56 cd04915 ACT_AK-Ectoine_2 ACT d 64.0 5.2 0.00011 22.5 1.5 18 5-22 49-66 (66)
57 PF07576 BRAP2: BRCA1-associat 57.9 7.1 0.00015 24.9 1.5 23 1-23 59-81 (110)
58 KOG0109|consensus 57.6 4.6 0.0001 30.8 0.7 30 3-32 119-148 (346)
59 cd04918 ACT_AK1-AT_2 ACT domai 56.6 8.4 0.00018 21.3 1.5 18 5-22 48-65 (65)
60 TIGR01648 hnRNP-R-Q heterogene 56.0 6.7 0.00015 31.4 1.4 28 2-29 105-133 (578)
61 PLN03213 repressor of silencin 55.6 15 0.00032 30.3 3.2 38 1-39 55-92 (759)
62 KOG0110|consensus 54.9 13 0.00028 31.0 2.8 31 2-32 566-596 (725)
63 KOG0123|consensus 51.0 18 0.00039 27.2 2.9 31 3-33 44-74 (369)
64 KOG0111|consensus 50.5 10 0.00023 28.2 1.5 31 4-34 60-90 (298)
65 PF08255 Leader_Trp: Trp-opero 50.1 3 6.6E-05 18.3 -0.8 7 14-20 5-11 (14)
66 KOG4661|consensus 47.6 17 0.00038 30.4 2.5 31 3-33 454-484 (940)
67 PF10593 Z1: Z1 domain; Inter 45.1 7.9 0.00017 27.3 0.2 21 17-37 172-196 (239)
68 KOG0146|consensus 44.9 12 0.00025 28.7 1.1 34 1-34 65-101 (371)
69 KOG0121|consensus 44.2 22 0.00048 24.4 2.3 26 5-30 87-112 (153)
70 PF14268 YoaP: YoaP-like 40.8 20 0.00044 19.7 1.4 17 20-36 26-42 (44)
71 COG0424 Maf Nucleotide-binding 40.2 35 0.00076 23.9 2.8 27 2-30 83-109 (193)
72 PF01625 PMSR: Peptide methion 40.2 34 0.00074 22.8 2.7 36 7-42 105-145 (155)
73 KOG0127|consensus 39.2 23 0.00051 29.2 2.1 36 3-38 165-201 (678)
74 cd04937 ACT_AKi-DapG-BS_2 ACT 36.8 29 0.00064 18.9 1.7 17 5-21 47-63 (64)
75 PF05172 Nup35_RRM: Nup53/35/4 36.6 30 0.00065 21.7 1.9 30 2-32 60-90 (100)
76 cd00018 AP2 DNA-binding domain 36.6 47 0.001 18.1 2.5 19 3-21 32-50 (61)
77 KOG0226|consensus 36.4 25 0.00053 26.4 1.7 28 2-29 238-265 (290)
78 cd04922 ACT_AKi-HSDH-ThrA_2 AC 35.8 29 0.00064 18.2 1.6 17 6-22 50-66 (66)
79 KOG4660|consensus 35.7 17 0.00037 29.4 0.8 23 5-27 121-143 (549)
80 cd04916 ACT_AKiii-YclM-BS_2 AC 33.8 35 0.00075 17.9 1.6 17 6-22 50-66 (66)
81 PF05409 Peptidase_C30: Corona 33.1 31 0.00067 26.0 1.8 42 5-47 230-277 (293)
82 PRK05528 methionine sulfoxide 32.3 72 0.0016 21.5 3.3 38 4-42 95-138 (156)
83 PF09292 Neil1-DNA_bind: Endon 30.7 18 0.0004 19.7 0.2 8 17-24 24-33 (39)
84 PF14930 Qn_am_d_aII: Quinohem 30.5 13 0.00028 24.1 -0.5 9 15-23 85-93 (108)
85 PF02324 Glyco_hydro_70: Glyco 30.4 64 0.0014 27.4 3.2 34 2-38 642-675 (809)
86 PRK08961 bifunctional aspartat 29.9 45 0.00097 27.5 2.3 36 5-42 444-479 (861)
87 KOG0146|consensus 29.2 32 0.0007 26.3 1.3 27 2-28 333-359 (371)
88 PRK09084 aspartate kinase III; 28.9 39 0.00085 25.7 1.7 19 5-23 430-448 (448)
89 COG5475 Uncharacterized small 28.5 21 0.00045 21.1 0.1 9 18-26 33-41 (60)
90 PF11767 SET_assoc: Histone ly 27.9 49 0.0011 19.3 1.7 24 3-26 40-63 (66)
91 cd04892 ACT_AK-like_2 ACT doma 27.8 51 0.0011 16.5 1.6 16 6-21 49-64 (65)
92 COG4734 ArdA Antirestriction p 27.6 16 0.00034 25.9 -0.5 42 1-42 1-45 (193)
93 PF01691 Adeno_E1B_19K: Adenov 27.3 5.6 0.00012 26.6 -2.7 25 20-44 31-55 (134)
94 PF11202 PRTase_1: Phosphoribo 26.9 26 0.00056 25.9 0.4 42 3-46 23-64 (257)
95 COG1582 FlgEa Uncharacterized 26.5 20 0.00043 21.6 -0.2 7 15-21 6-12 (67)
96 PF04736 Eclosion: Eclosion ho 26.5 26 0.00057 20.8 0.4 12 14-25 22-33 (62)
97 KOG0106|consensus 26.2 42 0.00092 23.9 1.4 42 2-43 139-180 (216)
98 PRK05925 aspartate kinase; Pro 25.9 42 0.00091 25.8 1.4 20 5-24 417-436 (440)
99 smart00333 TUDOR Tudor domain. 24.2 26 0.00056 18.4 0.0 25 16-42 14-38 (57)
100 PF01205 UPF0029: Uncharacteri 24.2 27 0.00059 22.2 0.1 7 19-25 85-91 (110)
101 KOG0144|consensus 24.1 42 0.00091 27.0 1.2 17 1-17 81-97 (510)
102 KOG2416|consensus 23.9 1.1E+02 0.0023 25.8 3.4 35 3-37 488-525 (718)
103 PF07566 DUF1543: Domain of Un 23.9 20 0.00043 19.9 -0.5 16 9-24 16-31 (52)
104 COG3437 Response regulator con 23.7 44 0.00095 25.7 1.2 24 5-28 320-343 (360)
105 cd04712 BAH_DCM_I BAH, or Brom 23.4 21 0.00046 23.0 -0.5 8 14-21 49-56 (130)
106 PF08777 RRM_3: RNA binding mo 23.3 68 0.0015 19.8 1.8 40 2-41 43-90 (105)
107 COG0225 MsrA Peptide methionin 22.9 1.1E+02 0.0024 21.3 3.0 36 7-42 111-152 (174)
108 KOG0533|consensus 22.8 1.7E+02 0.0036 21.2 3.9 39 5-43 133-171 (243)
109 PF07275 ArdA: Antirestriction 22.7 18 0.0004 23.3 -0.9 8 15-22 14-21 (169)
110 KOG2314|consensus 22.7 49 0.0011 27.5 1.3 41 1-42 110-152 (698)
111 cd07996 WGR_MMR_like WGR domai 22.7 90 0.002 17.5 2.1 14 3-16 49-62 (74)
112 PF10708 DUF2510: Protein of u 22.3 26 0.00057 18.5 -0.2 15 18-32 13-27 (36)
113 PF14743 DNA_ligase_OB_2: DNA 22.2 62 0.0014 18.7 1.4 16 20-35 38-53 (66)
114 KOG0124|consensus 22.1 58 0.0012 26.0 1.6 25 5-29 164-188 (544)
115 PLN02474 UTP--glucose-1-phosph 22.1 1.2E+02 0.0025 24.0 3.2 42 2-43 361-413 (469)
116 KOG0148|consensus 22.1 68 0.0015 24.4 1.9 31 1-31 205-235 (321)
117 KOG2163|consensus 22.0 26 0.00057 29.2 -0.3 30 13-42 674-707 (719)
118 PF10272 Tmpp129: Putative tra 21.8 26 0.00057 26.7 -0.3 8 18-25 321-328 (358)
119 PF05470 eIF-3c_N: Eukaryotic 21.7 31 0.00067 27.8 0.0 14 11-24 463-476 (595)
120 PRK13014 methionine sulfoxide 21.5 1.3E+02 0.0028 20.9 3.1 35 7-42 113-154 (186)
121 COG5470 Uncharacterized conser 21.4 59 0.0013 20.8 1.3 27 14-40 44-70 (96)
122 PF13333 rve_2: Integrase core 21.3 90 0.0019 16.6 1.9 35 2-39 16-50 (52)
123 PLN02551 aspartokinase 20.9 64 0.0014 25.4 1.6 20 5-24 492-511 (521)
124 KOG2068|consensus 20.9 32 0.0007 26.1 -0.0 28 3-30 132-159 (327)
125 PF12148 DUF3590: Protein of u 20.8 35 0.00075 21.2 0.1 28 14-41 6-38 (85)
126 PF08060 NOSIC: NOSIC (NUC001) 20.7 47 0.001 18.3 0.6 12 33-44 24-35 (53)
127 KOG3873|consensus 20.3 44 0.00095 26.3 0.6 14 15-28 107-120 (422)
128 cd04508 TUDOR Tudor domains ar 20.1 39 0.00085 17.1 0.2 13 17-29 11-23 (48)
129 PLN03121 nucleic acid binding 20.1 1.2E+02 0.0025 22.2 2.7 35 3-40 51-85 (243)
No 1
>KOG0147|consensus
Probab=99.79 E-value=7e-20 Score=141.79 Aligned_cols=54 Identities=35% Similarity=0.658 Sum_probs=51.6
Q ss_pred CCCCCHHHHHHHHHHHhccccCCceeEEEEechhhHHhhCcCcccCCcccccCC
Q psy2111 1 MGEVGWEKKRRAYRTLHGAWFNSKLVTVKYLRYERYHERFPQAKALVAPLKPSN 54 (63)
Q Consensus 1 ~kc~s~e~A~~A~~~LhGRWF~GR~I~Aey~p~~~Y~~~FP~a~~a~~pL~ps~ 54 (63)
+||+|+++|..|+++||||||+||+|+|.|+|++.||.+||++..+...|.++.
T Consensus 495 vrc~s~~~A~~a~~alhgrWF~gr~Ita~~~~~~~Y~~~FP~~~~~~~~~~~~~ 548 (549)
T KOG0147|consen 495 VRCPSAEAAGTAVKALHGRWFAGRMITAKYLPLERYHSKFPDSKAAPLLFHTNE 548 (549)
T ss_pred EecCcHHHHHHHHHHHhhhhhccceeEEEEeehhhhhhhCCCcccceeeeeccC
Confidence 489999999999999999999999999999999999999999999999988764
No 2
>TIGR01622 SF-CC1 splicing factor, CC1-like family. A homologous gene from Plasmodium falciparum was identified in the course of the analysis of that genome at TIGR and was included in the model.
Probab=99.13 E-value=4.8e-11 Score=86.56 Aligned_cols=41 Identities=17% Similarity=0.341 Sum_probs=39.5
Q ss_pred CCCCHHHHHHHHHHHhccccCCceeEEEEechhhHHhhCcC
Q psy2111 2 GEVGWEKKRRAYRTLHGAWFNSKLVTVKYLRYERYHERFPQ 42 (63)
Q Consensus 2 kc~s~e~A~~A~~~LhGRWF~GR~I~Aey~p~~~Y~~~FP~ 42 (63)
+|.+.|+|..|++.||||||+||.|+|+|++++.|...||.
T Consensus 416 ~F~~~e~A~~A~~~lnGr~f~gr~i~~~~~~~~~~~~~~~~ 456 (457)
T TIGR01622 416 KFSSVDAALAAFQALNGRYFGGKMITAAFVVNDVYDMSCLP 456 (457)
T ss_pred EECCHHHHHHHHHHhcCcccCCeEEEEEEEcHHHHHhhcCC
Confidence 68899999999999999999999999999999999999985
No 3
>TIGR01645 half-pint poly-U binding splicing factor, half-pint family. In the case of PUF60 (GP|6176532), in complex with p54, and in the presence of U2AF, facilitates association of U2 snRNP with pre-mRNA.
Probab=98.91 E-value=7.9e-10 Score=86.99 Aligned_cols=37 Identities=16% Similarity=0.164 Sum_probs=35.3
Q ss_pred CCCCHHHHHHHHHHHhccccCCceeEEEEechhhHHh
Q psy2111 2 GEVGWEKKRRAYRTLHGAWFNSKLVTVKYLRYERYHE 38 (63)
Q Consensus 2 kc~s~e~A~~A~~~LhGRWF~GR~I~Aey~p~~~Y~~ 38 (63)
++.++++|.+|++.||||||+||+|.|+|.+.+.|+.
T Consensus 571 ~F~~~~~A~~A~~~LnGR~F~GR~V~a~~yd~~~f~~ 607 (612)
T TIGR01645 571 EFSDSMEVDRAKAALDGRFFGGRTVVAEAYDQILFDH 607 (612)
T ss_pred EECCHHHHHHHHHHhcCCeECCeEEEEEEcCHHHhhc
Confidence 6889999999999999999999999999999999975
No 4
>KOG2202|consensus
Probab=98.75 E-value=6e-09 Score=75.62 Aligned_cols=38 Identities=16% Similarity=0.288 Sum_probs=35.4
Q ss_pred CCCCHHHHHHHHHHHhccccCCceeEEEEechhhHHhh
Q psy2111 2 GEVGWEKKRRAYRTLHGAWFNSKLVTVKYLRYERYHER 39 (63)
Q Consensus 2 kc~s~e~A~~A~~~LhGRWF~GR~I~Aey~p~~~Y~~~ 39 (63)
++-++|+|++|++.||||||+|++|.|+++|++++...
T Consensus 116 ~f~~Ee~ae~a~~~lnnRw~~G~pi~ae~~pvT~~rea 153 (260)
T KOG2202|consen 116 KFRSEEDAEAALEDLNNRWYNGRPIHAELSPVTDFREA 153 (260)
T ss_pred hcccHHHHHHHHHHHcCccccCCcceeeecCcCchhhh
Confidence 56789999999999999999999999999999998764
No 5
>TIGR01642 U2AF_lg U2 snRNP auxilliary factor, large subunit, splicing factor. Members of this subfamily are found in plants, metazoa and fungi.
Probab=98.62 E-value=2.5e-08 Score=73.19 Aligned_cols=37 Identities=16% Similarity=0.312 Sum_probs=35.1
Q ss_pred CCCCHHHHHHHHHHHhccccCCceeEEEEechhhHHh
Q psy2111 2 GEVGWEKKRRAYRTLHGAWFNSKLVTVKYLRYERYHE 38 (63)
Q Consensus 2 kc~s~e~A~~A~~~LhGRWF~GR~I~Aey~p~~~Y~~ 38 (63)
.|.++++|.+|+..||||||+||.|.++|++.+.|..
T Consensus 470 ~F~~~e~A~~A~~~lnGr~~~gr~v~~~~~~~~~~~~ 506 (509)
T TIGR01642 470 EYADVRSAEKAMEGMNGRKFNDRVVVAAFYGEDCYKA 506 (509)
T ss_pred EECCHHHHHHHHHHcCCCEECCeEEEEEEeCHHHhhc
Confidence 5789999999999999999999999999999999975
No 6
>KOG1548|consensus
Probab=98.48 E-value=7.9e-08 Score=72.66 Aligned_cols=32 Identities=19% Similarity=0.351 Sum_probs=28.7
Q ss_pred CCCCHHHHHHHHHHHhccccCCceeEEEEech
Q psy2111 2 GEVGWEKKRRAYRTLHGAWFNSKLVTVKYLRY 33 (63)
Q Consensus 2 kc~s~e~A~~A~~~LhGRWF~GR~I~Aey~p~ 33 (63)
.+.+.++|..|++.||||||+||+|+|+...-
T Consensus 320 ~f~n~eeA~~ciq~m~GR~fdgRql~A~i~DG 351 (382)
T KOG1548|consen 320 SFRNNEEADQCIQTMDGRWFDGRQLTASIWDG 351 (382)
T ss_pred EeCChHHHHHHHHHhcCeeecceEEEEEEeCC
Confidence 46789999999999999999999999987643
No 7
>KOG0120|consensus
Probab=98.41 E-value=1.9e-07 Score=72.59 Aligned_cols=41 Identities=27% Similarity=0.378 Sum_probs=38.8
Q ss_pred CCCCCHHHHHHHHHHHhccccCCceeEEEEechhhHHhhCc
Q psy2111 1 MGEVGWEKKRRAYRTLHGAWFNSKLVTVKYLRYERYHERFP 41 (63)
Q Consensus 1 ~kc~s~e~A~~A~~~LhGRWF~GR~I~Aey~p~~~Y~~~FP 41 (63)
++|.|+++|++|+.+|+||=|+||.|.+.|++++.||.++.
T Consensus 459 Vefas~ed~qrA~~~L~GrKF~nRtVvtsYydeDkY~~r~~ 499 (500)
T KOG0120|consen 459 VEFADTEDSQRAMEELTGRKFANRTVVASYYDEDKYHAREF 499 (500)
T ss_pred EEecChHHHHHHHHHccCceeCCcEEEEEecCHHHhhcccc
Confidence 36899999999999999999999999999999999999875
No 8
>PF13893 RRM_5: RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain); PDB: 2PE8_A 2PEH_A 2D9O_A 1A9N_D 2DIT_A 3S6E_A 2LQ5_A 1SJQ_A 2AD9_A 1X4D_A ....
Probab=98.40 E-value=2.8e-07 Score=50.38 Aligned_cols=30 Identities=27% Similarity=0.393 Sum_probs=27.6
Q ss_pred CCCCHHHHHHHHHHHhccccCCceeEEEEe
Q psy2111 2 GEVGWEKKRRAYRTLHGAWFNSKLVTVKYL 31 (63)
Q Consensus 2 kc~s~e~A~~A~~~LhGRWF~GR~I~Aey~ 31 (63)
.+.+.++|..|++.|||++|+||.|.++|.
T Consensus 27 ~f~~~~~A~~a~~~l~~~~~~g~~l~V~~a 56 (56)
T PF13893_consen 27 EFASVEDAQKAIEQLNGRQFNGRPLKVSYA 56 (56)
T ss_dssp EESSHHHHHHHHHHHTTSEETTEEEEEEEE
T ss_pred EECCHHHHHHHHHHhCCCEECCcEEEEEEC
Confidence 357999999999999999999999999984
No 9
>smart00361 RRM_1 RNA recognition motif.
Probab=98.24 E-value=8.6e-07 Score=50.92 Aligned_cols=28 Identities=25% Similarity=0.328 Sum_probs=25.4
Q ss_pred CCCCHHHHHHHHHHHhccccCCceeEEE
Q psy2111 2 GEVGWEKKRRAYRTLHGAWFNSKLVTVK 29 (63)
Q Consensus 2 kc~s~e~A~~A~~~LhGRWF~GR~I~Ae 29 (63)
.+.+.++|.+|++.||||||+||.|++.
T Consensus 43 ~f~~~~dA~~A~~~l~g~~~~gr~l~~~ 70 (70)
T smart00361 43 TFERSEDAARAIVDLNGRYFDGRTVKAE 70 (70)
T ss_pred EECCHHHHHHHHHHhCCCEECCEEEEeC
Confidence 3578999999999999999999999873
No 10
>KOG1996|consensus
Probab=97.79 E-value=1.6e-05 Score=59.91 Aligned_cols=37 Identities=19% Similarity=0.349 Sum_probs=34.4
Q ss_pred CCCHHHHHHHHHHHhccccCCceeEEEEechhhHHhh
Q psy2111 3 EVGWEKKRRAYRTLHGAWFNSKLVTVKYLRYERYHER 39 (63)
Q Consensus 3 c~s~e~A~~A~~~LhGRWF~GR~I~Aey~p~~~Y~~~ 39 (63)
+..+++|.+|+=.||||||+||.+.|-|+.++.|..+
T Consensus 336 F~r~e~aiKA~VdlnGRyFGGr~v~A~Fyn~ekfs~~ 372 (378)
T KOG1996|consen 336 FERVESAIKAVVDLNGRYFGGRVVSACFYNLEKFSNL 372 (378)
T ss_pred eccHHHHHHHHHhcCCceecceeeeheeccHHhhhhh
Confidence 5678999999999999999999999999999999764
No 11
>KOG0124|consensus
Probab=97.59 E-value=2.8e-05 Score=60.30 Aligned_cols=36 Identities=22% Similarity=0.309 Sum_probs=31.4
Q ss_pred CCCCHHHHHHHHHHHhccccCCceeEEEEechhhHH
Q psy2111 2 GEVGWEKKRRAYRTLHGAWFNSKLVTVKYLRYERYH 37 (63)
Q Consensus 2 kc~s~e~A~~A~~~LhGRWF~GR~I~Aey~p~~~Y~ 37 (63)
+++...++.+|.++|+||||+||.|.||.+...+|-
T Consensus 503 efS~~~e~~rak~ALdGRfFgGr~VvAE~YDQ~~FD 538 (544)
T KOG0124|consen 503 EFSIASETHRAKQALDGRFFGGRKVVAEVYDQERFD 538 (544)
T ss_pred eechhhHHHHHHHhhccceecCceeehhhhhhhccc
Confidence 456677889999999999999999999999887764
No 12
>TIGR01661 ELAV_HUD_SF ELAV/HuD family splicing factor. These proteins contain 3 RNA-recognition motifs (rrm: pfam00076).
Probab=97.48 E-value=9.3e-05 Score=51.86 Aligned_cols=32 Identities=13% Similarity=0.278 Sum_probs=29.4
Q ss_pred CCCHHHHHHHHHHHhccccCCceeEEEEechh
Q psy2111 3 EVGWEKKRRAYRTLHGAWFNSKLVTVKYLRYE 34 (63)
Q Consensus 3 c~s~e~A~~A~~~LhGRWF~GR~I~Aey~p~~ 34 (63)
+.+.++|..|++.|||.+|+||.|++.|....
T Consensus 318 F~~~~~A~~Ai~~lnG~~~~gr~i~V~~~~~~ 349 (352)
T TIGR01661 318 MTNYDEAAMAILSLNGYTLGNRVLQVSFKTNK 349 (352)
T ss_pred ECCHHHHHHHHHHhCCCEECCeEEEEEEccCC
Confidence 57899999999999999999999999998654
No 13
>cd00590 RRM RRM (RNA recognition motif), also known as RBD (RNA binding domain) or RNP (ribonucleoprotein domain), is a highly abundant domain in eukaryotes found in proteins involved in post-transcriptional gene expression processes including mRNA and rRNA processing, RNA export, and RNA stability. This domain is 90 amino acids in length and consists of a four-stranded beta-sheet packed against two alpha-helices. RRM usually interacts with ssRNA, but is also known to interact with ssDNA as well as proteins. RRM binds a variable number of nucleotides, ranging from two to eight. The active site includes three aromatic side-chains located within the conserved RNP1 and RNP2 motifs of the domain. The RRM domain is found in a variety heterogeneous nuclear ribonucleoproteins (hnRNPs), proteins implicated in regulation of alternative splicing, and protein components of small nuclear ribonucleoproteins (snRNPs).
Probab=96.82 E-value=0.0012 Score=34.76 Aligned_cols=28 Identities=18% Similarity=0.261 Sum_probs=25.8
Q ss_pred CCCHHHHHHHHHHHhccccCCceeEEEE
Q psy2111 3 EVGWEKKRRAYRTLHGAWFNSKLVTVKY 30 (63)
Q Consensus 3 c~s~e~A~~A~~~LhGRWF~GR~I~Aey 30 (63)
+.+.++|..|++.+||.+|+|+.|++.|
T Consensus 47 f~s~~~a~~a~~~~~~~~~~~~~~~v~~ 74 (74)
T cd00590 47 FEDEEDAEKALEALNGKELGGRPLRVEF 74 (74)
T ss_pred ECCHHHHHHHHHHhCCCeECCeEEEEeC
Confidence 5789999999999999999999999875
No 14
>PF14259 RRM_6: RNA recognition motif (a.k.a. RRM, RBD, or RNP domain); PDB: 2DNN_A 1WI6_A 2EVZ_A 2ADC_A 1QM9_A 2KG1_A 2HGN_A 1U2F_A 2G4B_A 2HZC_A ....
Probab=96.74 E-value=0.0015 Score=36.18 Aligned_cols=26 Identities=19% Similarity=0.206 Sum_probs=23.8
Q ss_pred CCCCHHHHHHHHHHHhccccCCceeE
Q psy2111 2 GEVGWEKKRRAYRTLHGAWFNSKLVT 27 (63)
Q Consensus 2 kc~s~e~A~~A~~~LhGRWF~GR~I~ 27 (63)
.+.|+++|..|++.+||.+|+||.|.
T Consensus 45 ~f~~~~~a~~al~~~~~~~~~g~~l~ 70 (70)
T PF14259_consen 45 EFSSEEDAKRALELLNGKEIDGRKLR 70 (70)
T ss_dssp EESSHHHHHHHHHHHTTEEETTEEEE
T ss_pred EeCCHHHHHHHHHHCCCcEECCEEcC
Confidence 46899999999999999999999873
No 15
>smart00360 RRM RNA recognition motif.
Probab=96.51 E-value=0.002 Score=33.53 Aligned_cols=28 Identities=21% Similarity=0.244 Sum_probs=24.9
Q ss_pred CCCCHHHHHHHHHHHhccccCCceeEEE
Q psy2111 2 GEVGWEKKRRAYRTLHGAWFNSKLVTVK 29 (63)
Q Consensus 2 kc~s~e~A~~A~~~LhGRWF~GR~I~Ae 29 (63)
.+.+.++|..|++.|||..|+|+.|.++
T Consensus 44 ~f~~~~~a~~a~~~~~~~~~~~~~~~v~ 71 (71)
T smart00360 44 EFESEEDAEKALEALNGKELDGRPLKVK 71 (71)
T ss_pred EeCCHHHHHHHHHHcCCCeeCCcEEEeC
Confidence 3678999999999999999999998763
No 16
>TIGR01628 PABP-1234 polyadenylate binding protein, human types 1, 2, 3, 4 family. There are four paralogs in Homo sapiens which are expressed in testis, platelets, broadly expressed, or of unknown tissue range.
Probab=96.40 E-value=0.0028 Score=47.83 Aligned_cols=32 Identities=22% Similarity=0.187 Sum_probs=29.0
Q ss_pred CCCCHHHHHHHHHHHhccccCCceeEEEEech
Q psy2111 2 GEVGWEKKRRAYRTLHGAWFNSKLVTVKYLRY 33 (63)
Q Consensus 2 kc~s~e~A~~A~~~LhGRWF~GR~I~Aey~p~ 33 (63)
.+.+.++|.+|++.|||++|+||.|.+.+-..
T Consensus 332 ~f~~~~~A~~A~~~~~g~~~~gk~l~V~~a~~ 363 (562)
T TIGR01628 332 CFSNPEEANRAVTEMHGRMLGGKPLYVALAQR 363 (562)
T ss_pred EeCCHHHHHHHHHHhcCCeeCCceeEEEeccC
Confidence 46899999999999999999999999998653
No 17
>smart00362 RRM_2 RNA recognition motif.
Probab=96.04 E-value=0.007 Score=31.57 Aligned_cols=27 Identities=19% Similarity=0.225 Sum_probs=24.3
Q ss_pred CCCHHHHHHHHHHHhccccCCceeEEE
Q psy2111 3 EVGWEKKRRAYRTLHGAWFNSKLVTVK 29 (63)
Q Consensus 3 c~s~e~A~~A~~~LhGRWF~GR~I~Ae 29 (63)
+.+.++|..|++.|||..|+|+.|+++
T Consensus 46 f~~~~~a~~a~~~~~~~~~~~~~i~v~ 72 (72)
T smart00362 46 FESEEDAEKAIEALNGTKLGGRPLRVE 72 (72)
T ss_pred eCCHHHHHHHHHHhCCcEECCEEEeeC
Confidence 568899999999999999999998863
No 18
>PLN03134 glycine-rich RNA-binding protein 4; Provisional
Probab=95.33 E-value=0.021 Score=37.24 Aligned_cols=30 Identities=13% Similarity=0.046 Sum_probs=27.7
Q ss_pred CCCHHHHHHHHHHHhccccCCceeEEEEec
Q psy2111 3 EVGWEKKRRAYRTLHGAWFNSKLVTVKYLR 32 (63)
Q Consensus 3 c~s~e~A~~A~~~LhGRWF~GR~I~Aey~p 32 (63)
+.+.|+|..|++.|||..++||.|.+++-.
T Consensus 83 F~~~e~A~~Al~~lng~~i~Gr~l~V~~a~ 112 (144)
T PLN03134 83 FNDEGAATAAISEMDGKELNGRHIRVNPAN 112 (144)
T ss_pred ECCHHHHHHHHHHcCCCEECCEEEEEEeCC
Confidence 578999999999999999999999999864
No 19
>TIGR01661 ELAV_HUD_SF ELAV/HuD family splicing factor. These proteins contain 3 RNA-recognition motifs (rrm: pfam00076).
Probab=95.31 E-value=0.014 Score=40.94 Aligned_cols=31 Identities=29% Similarity=0.404 Sum_probs=28.1
Q ss_pred CCCCHHHHHHHHHHHhccccCCceeEEEEec
Q psy2111 2 GEVGWEKKRRAYRTLHGAWFNSKLVTVKYLR 32 (63)
Q Consensus 2 kc~s~e~A~~A~~~LhGRWF~GR~I~Aey~p 32 (63)
.+.++++|.+|++.|||.++.||.|.++|-.
T Consensus 51 ~f~~~~~A~~Ai~~l~g~~l~g~~i~v~~a~ 81 (352)
T TIGR01661 51 NYVRPEDAEKAVNSLNGLRLQNKTIKVSYAR 81 (352)
T ss_pred EECcHHHHHHHHhhcccEEECCeeEEEEeec
Confidence 3678999999999999999999999999864
No 20
>KOG0148|consensus
Probab=95.11 E-value=0.017 Score=43.31 Aligned_cols=29 Identities=14% Similarity=0.388 Sum_probs=26.0
Q ss_pred CCCHHHHHHHHHHHhccccCCceeEEEEe
Q psy2111 3 EVGWEKKRRAYRTLHGAWFNSKLVTVKYL 31 (63)
Q Consensus 3 c~s~e~A~~A~~~LhGRWF~GR~I~Aey~ 31 (63)
+..-++|+.||+.|||-|.++|.|....-
T Consensus 111 f~~k~dAEnAI~~MnGqWlG~R~IRTNWA 139 (321)
T KOG0148|consen 111 FPNKEDAENAIQQMNGQWLGRRTIRTNWA 139 (321)
T ss_pred ccchHHHHHHHHHhCCeeeccceeecccc
Confidence 56789999999999999999999998754
No 21
>PF00076 RRM_1: RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain); InterPro: IPR000504 Many eukaryotic proteins containing one or more copies of a putative RNA-binding domain of about 90 amino acids are known to bind single-stranded RNAs [, , ]. The largest group of single strand RNA-binding proteins is the eukaryotic RNA recognition motif (RRM) family that contains an eight amino acid RNP-1 consensus sequence [, ]. RRM proteins have a variety of RNA binding preferences and functions, and include heterogeneous nuclear ribonucleoproteins (hnRNPs), proteins implicated in regulation of alternative splicing (SR, U2AF, Sxl), protein components of small nuclear ribonucleoproteins (U1 and U2 snRNPs), and proteins that regulate RNA stability and translation (PABP, La, Hu) [, , ]. The RRM in heterodimeric splicing factor U2 snRNP auxiliary factor (U2AF) appears to have two RRM-like domains with specialised features for protein recognition []. The motif also appears in a few single stranded DNA binding proteins. The typical RRM consists of four anti-parallel beta-strands and two alpha-helices arranged in a beta-alpha-beta-beta-alpha-beta fold with side chains that stack with RNA bases. Specificity of RNA binding is determined by multiple contacts with surrounding amino acids. A third helix is present during RNA binding in some cases []. The RRM is reviewed in a number of publications [, , ].; GO: 0003676 nucleic acid binding; PDB: 2RNE_A 2DGO_A 2DO4_A 1YTY_B 2VOO_B 2VOP_A 2VON_B 1ZH5_B 2VOD_A 1S79_A ....
Probab=94.60 E-value=0.03 Score=30.04 Aligned_cols=26 Identities=19% Similarity=0.202 Sum_probs=23.5
Q ss_pred CCCCHHHHHHHHHHHhccccCCceeE
Q psy2111 2 GEVGWEKKRRAYRTLHGAWFNSKLVT 27 (63)
Q Consensus 2 kc~s~e~A~~A~~~LhGRWF~GR~I~ 27 (63)
.+.+.++|.+|++.|||--++|+.|.
T Consensus 45 ~F~~~~~a~~a~~~l~g~~~~~~~ir 70 (70)
T PF00076_consen 45 EFESEEDAEKALEELNGKKINGRKIR 70 (70)
T ss_dssp EESSHHHHHHHHHHHTTEEETTEEEE
T ss_pred EEcCHHHHHHHHHHcCCCEECccCcC
Confidence 46789999999999999999999884
No 22
>TIGR01648 hnRNP-R-Q heterogeneous nuclear ribonucleoprotein R, Q family. Sequences in this subfamily include the human heterogeneous nuclear ribonucleoproteins (hnRNP) R, Q and APOBEC-1 complementation factor (aka APOBEC-1 stimulating protein). These proteins contain three RNA recognition domains (rrm: pfam00076) and a somewhat variable C-terminal domain.
Probab=93.26 E-value=0.096 Score=41.60 Aligned_cols=32 Identities=16% Similarity=0.108 Sum_probs=29.5
Q ss_pred CCCCHHHHHHHHHHHhccccCCceeEEEEech
Q psy2111 2 GEVGWEKKRRAYRTLHGAWFNSKLVTVKYLRY 33 (63)
Q Consensus 2 kc~s~e~A~~A~~~LhGRWF~GR~I~Aey~p~ 33 (63)
.|.+.|+|.+|++.|||.-|+||.|.+++...
T Consensus 275 eF~s~e~A~kAi~~lnG~~i~Gr~I~V~~Akp 306 (578)
T TIGR01648 275 HFEDREDAVKAMDELNGKELEGSEIEVTLAKP 306 (578)
T ss_pred EeCCHHHHHHHHHHhCCCEECCEEEEEEEccC
Confidence 47899999999999999999999999998854
No 23
>KOG4208|consensus
Probab=92.94 E-value=0.11 Score=37.21 Aligned_cols=33 Identities=15% Similarity=0.044 Sum_probs=31.5
Q ss_pred CCCCHHHHHHHHHHHhccccCCceeEEEEechh
Q psy2111 2 GEVGWEKKRRAYRTLHGAWFNSKLVTVKYLRYE 34 (63)
Q Consensus 2 kc~s~e~A~~A~~~LhGRWF~GR~I~Aey~p~~ 34 (63)
-+.|+|.|..|..+||+=.|-|+++.|.|||.+
T Consensus 98 EFEs~eVA~IaAETMNNYLl~e~lL~c~vmppe 130 (214)
T KOG4208|consen 98 EFESEEVAKIAAETMNNYLLMEHLLECHVMPPE 130 (214)
T ss_pred EeccHHHHHHHHHHhhhhhhhhheeeeEEeCch
Confidence 368999999999999999999999999999998
No 24
>COG0724 RNA-binding proteins (RRM domain) [General function prediction only]
Probab=92.43 E-value=0.12 Score=32.50 Aligned_cols=32 Identities=16% Similarity=0.110 Sum_probs=29.1
Q ss_pred CCCCHHHHHHHHHHHhccccCCceeEEEEech
Q psy2111 2 GEVGWEKKRRAYRTLHGAWFNSKLVTVKYLRY 33 (63)
Q Consensus 2 kc~s~e~A~~A~~~LhGRWF~GR~I~Aey~p~ 33 (63)
.+.+.++|..|+..|||-+|.||.|.+.+...
T Consensus 163 ~f~~~~~~~~a~~~~~~~~~~~~~~~v~~~~~ 194 (306)
T COG0724 163 EFESEESAEKAIEELNGKELEGRPLRVQKAQP 194 (306)
T ss_pred EecCHHHHHHHHHHcCCCeECCceeEeecccc
Confidence 35789999999999999999999999999764
No 25
>TIGR01622 SF-CC1 splicing factor, CC1-like family. A homologous gene from Plasmodium falciparum was identified in the course of the analysis of that genome at TIGR and was included in the model.
Probab=91.82 E-value=0.19 Score=36.69 Aligned_cols=31 Identities=16% Similarity=0.217 Sum_probs=28.2
Q ss_pred CCCCHHHHHHHHHHHhccccCCceeEEEEec
Q psy2111 2 GEVGWEKKRRAYRTLHGAWFNSKLVTVKYLR 32 (63)
Q Consensus 2 kc~s~e~A~~A~~~LhGRWF~GR~I~Aey~p 32 (63)
.+.+.++|.+|++.|||-.+.||.|.+.|..
T Consensus 234 ~f~~~e~A~~A~~~l~g~~i~g~~i~v~~a~ 264 (457)
T TIGR01622 234 QFHDAEEAKEALEVMNGFELAGRPIKVGYAQ 264 (457)
T ss_pred EECCHHHHHHHHHhcCCcEECCEEEEEEEcc
Confidence 3578999999999999999999999999954
No 26
>TIGR01659 sex-lethal sex-lethal family splicing factor. This model describes the sex-lethal family of splicing factors found in Dipteran insects. The sex-lethal phenotype, however, may be limited to the Melanogasters and closely related species. In Drosophila the protein acts as an inhibitor of splicing. This subfamily is most closely related to the ELAV/HUD subfamily of splicing factors (TIGR01661).
Probab=91.74 E-value=0.17 Score=37.42 Aligned_cols=30 Identities=27% Similarity=0.303 Sum_probs=27.6
Q ss_pred CCCHHHHHHHHHHHhccccCCceeEEEEec
Q psy2111 3 EVGWEKKRRAYRTLHGAWFNSKLVTVKYLR 32 (63)
Q Consensus 3 c~s~e~A~~A~~~LhGRWF~GR~I~Aey~p 32 (63)
+.++|+|.+|+..|||..+.||.|.++|..
T Consensus 156 F~~~e~A~~Ai~~LnG~~l~gr~i~V~~a~ 185 (346)
T TIGR01659 156 FGSEADSQRAIKNLNGITVRNKRLKVSYAR 185 (346)
T ss_pred EccHHHHHHHHHHcCCCccCCceeeeeccc
Confidence 578999999999999999999999998854
No 27
>KOG4212|consensus
Probab=91.20 E-value=0.2 Score=40.02 Aligned_cols=30 Identities=20% Similarity=0.334 Sum_probs=28.3
Q ss_pred CCCCCHHHHHHHHHHHhccccCCceeEEEE
Q psy2111 1 MGEVGWEKKRRAYRTLHGAWFNSKLVTVKY 30 (63)
Q Consensus 1 ~kc~s~e~A~~A~~~LhGRWF~GR~I~Aey 30 (63)
+|+.++|+|+.|+..|+|--.+||.|.+.|
T Consensus 578 VrF~s~edAEra~a~Mngs~l~Gr~I~V~y 607 (608)
T KOG4212|consen 578 VRFFSPEDAERACALMNGSRLDGRNIKVTY 607 (608)
T ss_pred EEecCHHHHHHHHHHhccCcccCceeeeee
Confidence 378899999999999999999999999987
No 28
>TIGR01642 U2AF_lg U2 snRNP auxilliary factor, large subunit, splicing factor. Members of this subfamily are found in plants, metazoa and fungi.
Probab=91.05 E-value=0.26 Score=36.41 Aligned_cols=30 Identities=13% Similarity=-0.069 Sum_probs=27.7
Q ss_pred CCCHHHHHHHHHHHhccccCCceeEEEEec
Q psy2111 3 EVGWEKKRRAYRTLHGAWFNSKLVTVKYLR 32 (63)
Q Consensus 3 c~s~e~A~~A~~~LhGRWF~GR~I~Aey~p 32 (63)
+.+.++|..|+..|||-.++|+.|.+++..
T Consensus 344 f~~~~~a~~A~~~l~g~~~~~~~l~v~~a~ 373 (509)
T TIGR01642 344 YKDPSVTDVAIAALNGKDTGDNKLHVQRAC 373 (509)
T ss_pred ECCHHHHHHHHHHcCCCEECCeEEEEEECc
Confidence 578999999999999999999999999864
No 29
>PF11608 Limkain-b1: Limkain b1; InterPro: IPR024582 This entry represents a conserved domain found in limkain b1, which is a novel human autoantigen, localised to a subset of ABCD3 and PXF marked peroxisomes. Limkain b1 may be a relatively common target of human autoantibodies reactive to cytoplasmic vesicle-like structures [].; PDB: 2DIU_A.
Probab=90.11 E-value=0.23 Score=31.49 Aligned_cols=33 Identities=15% Similarity=0.092 Sum_probs=25.3
Q ss_pred CCCCHHHHHHHHHHHhccccCCceeEEEEechh
Q psy2111 2 GEVGWEKKRRAYRTLHGAWFNSKLVTVKYLRYE 34 (63)
Q Consensus 2 kc~s~e~A~~A~~~LhGRWF~GR~I~Aey~p~~ 34 (63)
++.+.|.|..|++.|+|=--=|+.|++.|.|..
T Consensus 45 rF~~~~~A~RA~KRmegEdVfG~kI~v~~~~~~ 77 (90)
T PF11608_consen 45 RFPNQEFAERAQKRMEGEDVFGNKISVSFSPKN 77 (90)
T ss_dssp EESSHHHHHHHHHHHTT--SSSS--EEESS--S
T ss_pred EeCCHHHHHHHHHhhcccccccceEEEEEcCCc
Confidence 678999999999999999999999999999865
No 30
>TIGR01649 hnRNP-L_PTB hnRNP-L/PTB/hephaestus splicing factor family. Included in this family of heterogeneous ribonucleoproteins are PTB (polypyrimidine tract binding protein ) and hnRNP-L. These proteins contain four RNA recognition motifs (rrm: pfam00067).
Probab=89.96 E-value=0.35 Score=36.52 Aligned_cols=31 Identities=16% Similarity=0.050 Sum_probs=27.4
Q ss_pred CCCCHHHHHHHHHHHhccccCCce------eEEEEec
Q psy2111 2 GEVGWEKKRRAYRTLHGAWFNSKL------VTVKYLR 32 (63)
Q Consensus 2 kc~s~e~A~~A~~~LhGRWF~GR~------I~Aey~p 32 (63)
.+.+.++|..|+..|||..++|+- |.+.|..
T Consensus 442 eF~~~e~A~~Al~~ln~~~l~~~~~~~~~~lkv~fs~ 478 (481)
T TIGR01649 442 EWESVEDAVEALIALNHHQLNEPNGSAPYHLKVSFST 478 (481)
T ss_pred EcCCHHHHHHHHHHhcCCccCCCCCCccceEEEEecc
Confidence 468999999999999999999995 8888764
No 31
>KOG0107|consensus
Probab=89.42 E-value=0.3 Score=34.60 Aligned_cols=31 Identities=16% Similarity=-0.018 Sum_probs=28.4
Q ss_pred CCCHHHHHHHHHHHhccccCCceeEEEEech
Q psy2111 3 EVGWEKKRRAYRTLHGAWFNSKLVTVKYLRY 33 (63)
Q Consensus 3 c~s~e~A~~A~~~LhGRWF~GR~I~Aey~p~ 33 (63)
+.++-+|..|++.|+|+=|.|..|+||+..-
T Consensus 54 Fed~RDA~DAvr~LDG~~~cG~r~rVE~S~G 84 (195)
T KOG0107|consen 54 FEDPRDAEDAVRYLDGKDICGSRIRVELSTG 84 (195)
T ss_pred ccCcccHHHHHhhcCCccccCceEEEEeecC
Confidence 6788899999999999999999999998754
No 32
>TIGR01649 hnRNP-L_PTB hnRNP-L/PTB/hephaestus splicing factor family. Included in this family of heterogeneous ribonucleoproteins are PTB (polypyrimidine tract binding protein ) and hnRNP-L. These proteins contain four RNA recognition motifs (rrm: pfam00067).
Probab=88.94 E-value=0.51 Score=35.67 Aligned_cols=32 Identities=16% Similarity=0.014 Sum_probs=28.7
Q ss_pred CCCHHHHHHHHHHHhccccCCceeEEEEechh
Q psy2111 3 EVGWEKKRRAYRTLHGAWFNSKLVTVKYLRYE 34 (63)
Q Consensus 3 c~s~e~A~~A~~~LhGRWF~GR~I~Aey~p~~ 34 (63)
+.+.++|..|++.|||.=+.||.|.+++...+
T Consensus 320 f~~~~~A~~Ai~~lng~~l~g~~l~v~~s~~~ 351 (481)
T TIGR01649 320 MADPYQAQLALTHLNGVKLFGKPLRVCPSKQQ 351 (481)
T ss_pred ECCHHHHHHHHHHhCCCEECCceEEEEEcccc
Confidence 57899999999999999999999999987543
No 33
>TIGR01628 PABP-1234 polyadenylate binding protein, human types 1, 2, 3, 4 family. There are four paralogs in Homo sapiens which are expressed in testis, platelets, broadly expressed, or of unknown tissue range.
Probab=88.67 E-value=0.37 Score=36.52 Aligned_cols=29 Identities=24% Similarity=0.152 Sum_probs=26.1
Q ss_pred CCCCHHHHHHHHHHHhccccCCceeEEEE
Q psy2111 2 GEVGWEKKRRAYRTLHGAWFNSKLVTVKY 30 (63)
Q Consensus 2 kc~s~e~A~~A~~~LhGRWF~GR~I~Aey 30 (63)
.+.+.|+|.+|++.|||.+++|+.|.+..
T Consensus 135 ~F~~~e~A~~Ai~~lng~~~~~~~i~v~~ 163 (562)
T TIGR01628 135 HFEKEESAKAAIQKVNGMLLNDKEVYVGR 163 (562)
T ss_pred EECCHHHHHHHHHHhcccEecCceEEEec
Confidence 46789999999999999999999998854
No 34
>TIGR01645 half-pint poly-U binding splicing factor, half-pint family. In the case of PUF60 (GP|6176532), in complex with p54, and in the presence of U2AF, facilitates association of U2 snRNP with pre-mRNA.
Probab=88.26 E-value=0.36 Score=38.79 Aligned_cols=27 Identities=15% Similarity=0.118 Sum_probs=25.6
Q ss_pred CCCHHHHHHHHHHHhccccCCceeEEE
Q psy2111 3 EVGWEKKRRAYRTLHGAWFNSKLVTVK 29 (63)
Q Consensus 3 c~s~e~A~~A~~~LhGRWF~GR~I~Ae 29 (63)
+.+.++|..|++.|||..++||.|.+.
T Consensus 156 F~s~e~A~~Ai~~lnG~~i~GR~IkV~ 182 (612)
T TIGR01645 156 YEVPEAAQLALEQMNGQMLGGRNIKVG 182 (612)
T ss_pred eCcHHHHHHHHHhcCCeEEecceeeec
Confidence 678999999999999999999999997
No 35
>TIGR01659 sex-lethal sex-lethal family splicing factor. This model describes the sex-lethal family of splicing factors found in Dipteran insects. The sex-lethal phenotype, however, may be limited to the Melanogasters and closely related species. In Drosophila the protein acts as an inhibitor of splicing. This subfamily is most closely related to the ELAV/HUD subfamily of splicing factors (TIGR01661).
Probab=85.75 E-value=0.85 Score=33.77 Aligned_cols=32 Identities=16% Similarity=0.066 Sum_probs=27.0
Q ss_pred CCCCHHHHHHHHHHHhccccCC--ceeEEEEech
Q psy2111 2 GEVGWEKKRRAYRTLHGAWFNS--KLVTVKYLRY 33 (63)
Q Consensus 2 kc~s~e~A~~A~~~LhGRWF~G--R~I~Aey~p~ 33 (63)
.+.+.|+|.+|++.|||.-+.| +.|++.+...
T Consensus 241 ~F~~~e~A~~Ai~~lng~~~~g~~~~l~V~~a~~ 274 (346)
T TIGR01659 241 RFNKREEAQEAISALNNVIPEGGSQPLTVRLAEE 274 (346)
T ss_pred EECCHHHHHHHHHHhCCCccCCCceeEEEEECCc
Confidence 3678999999999999999977 6888887654
No 36
>KOG0123|consensus
Probab=84.13 E-value=1.2 Score=33.41 Aligned_cols=30 Identities=23% Similarity=0.277 Sum_probs=26.3
Q ss_pred CCCHHHHHHHHHHHhccccCCceeEEEEec
Q psy2111 3 EVGWEKKRRAYRTLHGAWFNSKLVTVKYLR 32 (63)
Q Consensus 3 c~s~e~A~~A~~~LhGRWF~GR~I~Aey~p 32 (63)
+.++++|.+|++.|||..+.|+.|-+.--.
T Consensus 122 f~~e~~a~~ai~~~ng~ll~~kki~vg~~~ 151 (369)
T KOG0123|consen 122 FESEESAKKAIEKLNGMLLNGKKIYVGLFE 151 (369)
T ss_pred eCCHHHHHHHHHHhcCcccCCCeeEEeecc
Confidence 578999999999999999999999875443
No 37
>KOG0131|consensus
Probab=83.33 E-value=1.3 Score=31.59 Aligned_cols=32 Identities=19% Similarity=0.351 Sum_probs=28.6
Q ss_pred CCCHHHHHHHHHHHhccccCCceeEEEEechh
Q psy2111 3 EVGWEKKRRAYRTLHGAWFNSKLVTVKYLRYE 34 (63)
Q Consensus 3 c~s~e~A~~A~~~LhGRWF~GR~I~Aey~p~~ 34 (63)
.++.|.+.+|+.+|||-.-..|.|+++|--.+
T Consensus 146 ~~sfeasd~ai~s~ngq~l~nr~itv~ya~k~ 177 (203)
T KOG0131|consen 146 YASFEASDAAIGSMNGQYLCNRPITVSYAFKK 177 (203)
T ss_pred chhHHHHHHHHHHhccchhcCCceEEEEEEec
Confidence 46889999999999999999999999996544
No 38
>COG5175 MOT2 Transcriptional repressor [Transcription]
Probab=83.13 E-value=1.2 Score=34.85 Aligned_cols=41 Identities=12% Similarity=0.283 Sum_probs=33.1
Q ss_pred CCCHHHHHHHHHHHhccccCCceeEEEEechhhHHhhCcCcc
Q psy2111 3 EVGWEKKRRAYRTLHGAWFNSKLVTVKYLRYERYHERFPQAK 44 (63)
Q Consensus 3 c~s~e~A~~A~~~LhGRWF~GR~I~Aey~p~~~Y~~~FP~a~ 44 (63)
..+.|+|..|+....|...|||.|.|.|=. +.|...|-+-.
T Consensus 172 y~~kedAarcIa~vDgs~~DGr~lkatYGT-TKYCtsYLRn~ 212 (480)
T COG5175 172 YSTKEDAARCIAEVDGSLLDGRVLKATYGT-TKYCTSYLRNA 212 (480)
T ss_pred ecchHHHHHHHHHhccccccCceEeeecCc-hHHHHHHHcCC
Confidence 457899999999999999999999999854 55666554433
No 39
>PLN03120 nucleic acid binding protein; Provisional
Probab=82.40 E-value=1.7 Score=31.80 Aligned_cols=29 Identities=21% Similarity=0.040 Sum_probs=25.8
Q ss_pred CCCHHHHHHHHHHHhccccCCceeEEEEec
Q psy2111 3 EVGWEKKRRAYRTLHGAWFNSKLVTVKYLR 32 (63)
Q Consensus 3 c~s~e~A~~A~~~LhGRWF~GR~I~Aey~p 32 (63)
+.++++|..|+. |||.-+.||.|.++..+
T Consensus 50 F~d~eaAe~All-LnG~~l~gr~V~Vt~a~ 78 (260)
T PLN03120 50 FKDPQGAETALL-LSGATIVDQSVTITPAE 78 (260)
T ss_pred eCcHHHHHHHHH-hcCCeeCCceEEEEecc
Confidence 578999999995 99999999999998744
No 40
>KOG0108|consensus
Probab=82.18 E-value=1.4 Score=34.14 Aligned_cols=38 Identities=34% Similarity=0.406 Sum_probs=31.9
Q ss_pred CCCHHHHHHHHHHHhccccCCceeEEEEechhhHHhhC
Q psy2111 3 EVGWEKKRRAYRTLHGAWFNSKLVTVKYLRYERYHERF 40 (63)
Q Consensus 3 c~s~e~A~~A~~~LhGRWF~GR~I~Aey~p~~~Y~~~F 40 (63)
+.+.|+|..|++.|||+=|.||.+.++|-....=+.+.
T Consensus 67 ~~~~~~~~~a~~~lNg~~~~gr~l~v~~~~~~~~~~~~ 104 (435)
T KOG0108|consen 67 FTDEETAERAIRNLNGAEFNGRKLRVNYASNRKNAERS 104 (435)
T ss_pred cCchhhHHHHHHhcCCcccCCceEEeecccccchhHHH
Confidence 45789999999999999999999999997765554443
No 41
>KOG4206|consensus
Probab=77.14 E-value=1.8 Score=31.13 Aligned_cols=31 Identities=26% Similarity=0.266 Sum_probs=27.8
Q ss_pred CCCHHHHHHHHHHHhccccCCceeEEEEech
Q psy2111 3 EVGWEKKRRAYRTLHGAWFNSKLVTVKYLRY 33 (63)
Q Consensus 3 c~s~e~A~~A~~~LhGRWF~GR~I~Aey~p~ 33 (63)
+.+++.|..|.++|+|.-|=|+.+.+.|--.
T Consensus 59 Fk~~~~As~A~r~l~gfpFygK~mriqyA~s 89 (221)
T KOG4206|consen 59 FKETEAASAALRALQGFPFYGKPMRIQYAKS 89 (221)
T ss_pred ecChhHHHHHHHHhcCCcccCchhheecccC
Confidence 4678999999999999999999999999643
No 42
>PLN02435 probable UDP-N-acetylglucosamine pyrophosphorylase
Probab=75.91 E-value=2.3 Score=33.42 Aligned_cols=38 Identities=18% Similarity=0.119 Sum_probs=30.6
Q ss_pred CCCHHHHHHHHHHHhcccc--CCcee---------EEEEechhhHHhhC
Q psy2111 3 EVGWEKKRRAYRTLHGAWF--NSKLV---------TVKYLRYERYHERF 40 (63)
Q Consensus 3 c~s~e~A~~A~~~LhGRWF--~GR~I---------~Aey~p~~~Y~~~F 40 (63)
-+|++.|..++..+|.||. +|-.| .+|-.|.-.|..-.
T Consensus 427 ~Dsp~tar~~l~~~~~~wl~~aG~~~~~~~~~~~~~vEisP~~sY~gE~ 475 (493)
T PLN02435 427 FDTPESARLLVLRLHTRWVVAAGGFLTHSVPLYATGVEVSPLCSYAGEN 475 (493)
T ss_pred CCCHHHHHHHHHHHHHHHHHHcCCEecCCccccCCcEEeCCceeeCCCC
Confidence 4789999999999999999 67665 46788887776443
No 43
>KOG0114|consensus
Probab=74.35 E-value=5 Score=26.70 Aligned_cols=36 Identities=17% Similarity=0.179 Sum_probs=30.5
Q ss_pred CCHHHHHHHHHHHhccccCCceeEEEEechhhHHhh
Q psy2111 4 VGWEKKRRAYRTLHGAWFNSKLVTVKYLRYERYHER 39 (63)
Q Consensus 4 ~s~e~A~~A~~~LhGRWF~GR~I~Aey~p~~~Y~~~ 39 (63)
.++++|.+|.+.|+|=.+++|-+.+-|...++=..+
T Consensus 65 edi~dAk~A~dhlsg~n~~~ryl~vlyyq~~~~~~~ 100 (124)
T KOG0114|consen 65 EDIFDAKKACDHLSGYNVDNRYLVVLYYQPEDAFKL 100 (124)
T ss_pred hHhhhHHHHHHHhcccccCCceEEEEecCHHHHHHH
Confidence 478999999999999999999999998876654443
No 44
>PF04847 Calcipressin: Calcipressin; InterPro: IPR006931 Calcipressin 1 negatively regulates calcineurin (IPR015757 from INTERPRO) by direct binding and is essential for the survival of T helper type 1 cells. Calcipressin 1 is a phosphoprotein that increases its capacity to inhibit calcineurin when phosphorylated at the FLISPP motif, and this phosphorylation also controls the half-life of calcipressin 1 by accelerating its degradation []. Calcineurin is a calcium-responsive enzyme that dephosphorylates the nuclear factor of activated T cells (NFAT). In so doing it promotes its nuclear translocation and uniquely links calcium signalling to transcriptional regulation []. Calcipressins are a family of proteins derived from three genes. Calcipressin 1 is also known as modulatory calcineurin-interacting protein 1 (MCIP1), Adapt78 and Down syndrome critical region 1 (DSCR1). Calcipressin 2 is variously known as MCIP2, ZAKI-4 and DSCR1-like 1. Calcipressin 3 is also called MCIP3 and DSCR1-like 2 []. DSCR1 (Adapt78) is associated with successful adaptation to oxidative stress and calcium stress as well as with diseases like Alzheimer's and Down syndrome. The DSCR1 (Adapt78) isoform 1 protein, calcipressin 1, inhibits calcineurin and protects against acute calcium-mediated stress damage, including transient oxidative stress []. Calcipressin 1 is encoded by DSCR1, a gene on human chromosome 21. Calcipressin 1 isoform 1 has an N-terminal coding region, which generates a new polypeptide of 252 amino acids. Endogenous calcipressin 1 exists as a complex together with the calcineurin A and B heterodimer []. ; GO: 0019722 calcium-mediated signaling; PDB: 1WEY_A.
Probab=72.86 E-value=2.4 Score=29.16 Aligned_cols=31 Identities=32% Similarity=0.300 Sum_probs=24.5
Q ss_pred CCCCHHHHHHHHHHHh--ccccCCceeEEEEec
Q psy2111 2 GEVGWEKKRRAYRTLH--GAWFNSKLVTVKYLR 32 (63)
Q Consensus 2 kc~s~e~A~~A~~~Lh--GRWF~GR~I~Aey~p 32 (63)
-|.+.++|..|...|| |..|.|..+.+-|..
T Consensus 37 ~f~~~~~A~~~r~~l~~~~~~~~g~~l~~yf~~ 69 (184)
T PF04847_consen 37 VFESPESAQRARQLLHWDGTSFNGKRLRVYFGQ 69 (184)
T ss_dssp E-SSTTHHHHHHHTST--TSEETTEE-EEE---
T ss_pred EeCCHHHHHHHHHHhcccccccCCCceEEEEcc
Confidence 3889999999999999 999999999988873
No 45
>KOG0145|consensus
Probab=72.32 E-value=4.2 Score=30.91 Aligned_cols=31 Identities=29% Similarity=0.387 Sum_probs=27.9
Q ss_pred CCHHHHHHHHHHHhccccCCceeEEEEechh
Q psy2111 4 VGWEKKRRAYRTLHGAWFNSKLVTVKYLRYE 34 (63)
Q Consensus 4 ~s~e~A~~A~~~LhGRWF~GR~I~Aey~p~~ 34 (63)
.+++||++|++.|||=-.-.|.|.+.|-+..
T Consensus 91 v~p~DAe~AintlNGLrLQ~KTIKVSyARPS 121 (360)
T KOG0145|consen 91 VRPKDAEKAINTLNGLRLQNKTIKVSYARPS 121 (360)
T ss_pred cChHHHHHHHhhhcceeeccceEEEEeccCC
Confidence 4689999999999999999999999997653
No 46
>KOG4207|consensus
Probab=71.72 E-value=1.7 Score=31.84 Aligned_cols=28 Identities=14% Similarity=0.173 Sum_probs=23.6
Q ss_pred CCCHHHHHHHHHHHhccccCCceeEEEE
Q psy2111 3 EVGWEKKRRAYRTLHGAWFNSKLVTVKY 30 (63)
Q Consensus 3 c~s~e~A~~A~~~LhGRWF~GR~I~Aey 30 (63)
+-.-.+|+.|.++|.|+-.+||-|.+++
T Consensus 62 f~~k~daedA~damDG~~ldgRelrVq~ 89 (256)
T KOG4207|consen 62 FHDKRDAEDALDAMDGAVLDGRELRVQM 89 (256)
T ss_pred eeecchHHHHHHhhcceeeccceeeehh
Confidence 3445789999999999999999997654
No 47
>KOG0125|consensus
Probab=71.40 E-value=4.2 Score=31.35 Aligned_cols=31 Identities=19% Similarity=0.160 Sum_probs=27.4
Q ss_pred CCCHHHHHHHHHHHhccccCCceeEEEEech
Q psy2111 3 EVGWEKKRRAYRTLHGAWFNSKLVTVKYLRY 33 (63)
Q Consensus 3 c~s~e~A~~A~~~LhGRWF~GR~I~Aey~p~ 33 (63)
+.+++||++|-..|||.=-.||.|.+.--..
T Consensus 143 men~~dadRARa~LHgt~VEGRkIEVn~ATa 173 (376)
T KOG0125|consen 143 MENPADADRARAELHGTVVEGRKIEVNNATA 173 (376)
T ss_pred ecChhhHHHHHHHhhcceeeceEEEEeccch
Confidence 5789999999999999999999999876443
No 48
>KOG0110|consensus
Probab=71.35 E-value=4.8 Score=33.40 Aligned_cols=30 Identities=20% Similarity=0.368 Sum_probs=27.8
Q ss_pred CCCCHHHHHHHHHHHhccccCCceeEEEEe
Q psy2111 2 GEVGWEKKRRAYRTLHGAWFNSKLVTVKYL 31 (63)
Q Consensus 2 kc~s~e~A~~A~~~LhGRWF~GR~I~Aey~ 31 (63)
++.|+|+|.+|++-+|+-+++--.|++++-
T Consensus 15 Gf~~eeeA~~Ai~~fn~s~i~ts~i~Ve~~ 44 (725)
T KOG0110|consen 15 GFKSEEEAQKAIKYFNKSFINTSRITVEFC 44 (725)
T ss_pred eeccHHHHHHHHHHhhccccccceEEEEeh
Confidence 578999999999999999999999999874
No 49
>cd04917 ACT_AKiii-LysC-EC_2 ACT domains located C-terminal to the catalytic domain of the lysine-sensitive aspartokinase isoenzyme AKIII. This CD includes the second of two ACT domains located C-terminal to the catalytic domain of the lysine-sensitive aspartokinase isoenzyme AKIII, a monofunctional class enzyme found in bacteria (Escherichia coli (EC) LysC). Aspartokinase is the first enzyme in the aspartate metabolic pathway and catalyzes the conversion of aspartate and ATP to aspartylphosphate and ADP. The E. coli AKIII (LysC) binds two feedback allosteric inhibitor lysine molecules at the dimer interface located between the ACT1 domain of two subunits. The second ACT domain (ACT2), this CD, is not involved in the binding of heterotrophic effectors. Members of this CD belong to the superfamily of ACT regulatory domains.
Probab=70.26 E-value=3.5 Score=22.54 Aligned_cols=18 Identities=28% Similarity=0.359 Sum_probs=15.5
Q ss_pred CHHHHHHHHHHHhccccC
Q psy2111 5 GWEKKRRAYRTLHGAWFN 22 (63)
Q Consensus 5 s~e~A~~A~~~LhGRWF~ 22 (63)
..+++.+|++.||.++|+
T Consensus 47 ~~~~~~~a~~~Lh~~f~~ 64 (64)
T cd04917 47 KEEDKDEVVQRLHSRLFE 64 (64)
T ss_pred eHHHHHHHHHHHHHHHhC
Confidence 467899999999999885
No 50
>cd04920 ACT_AKiii-DAPDC_2 ACT domains of a bifunctional AKIII (LysC)-like aspartokinase/meso-diaminopimelate decarboxylase (DAPDC). This CD includes the second of two ACT domains of a bifunctional AKIII (LysC)-like aspartokinase/meso-diaminopimelate decarboxylase (DAPDC) bacterial protein. Aspartokinase (AK) is the first enzyme in the aspartate metabolic pathway and catalyzes the conversion of aspartate and ATP to aspartylphosphate and ADP. The lysA gene encodes the enzyme DAPDC, a pyridoxal-5'-phosphate (PLP)-dependent enzyme which catalyzes the final step in the lysine biosynthetic pathway converting meso-diaminopimelic acid (DAP) to l-lysine. Tandem ACT domains are positioned centrally with the AK catalytic domain N-terminal and the DAPDC domains C-terminal. Members of this CD belong to the superfamily of ACT regulatory domains.
Probab=69.64 E-value=3.2 Score=23.09 Aligned_cols=18 Identities=11% Similarity=0.032 Sum_probs=15.2
Q ss_pred CHHHHHHHHHHHhccccC
Q psy2111 5 GWEKKRRAYRTLHGAWFN 22 (63)
Q Consensus 5 s~e~A~~A~~~LhGRWF~ 22 (63)
+.++..+|++.||..+|.
T Consensus 46 ~~~d~~~av~~LH~~f~~ 63 (63)
T cd04920 46 DEDQADGLCARLHFQLIE 63 (63)
T ss_pred eHHHHHHHHHHHHHHHhC
Confidence 467889999999999873
No 51
>KOG0122|consensus
Probab=68.05 E-value=5.3 Score=29.65 Aligned_cols=31 Identities=16% Similarity=0.252 Sum_probs=27.8
Q ss_pred CCCHHHHHHHHHHHhccccCCceeEEEEech
Q psy2111 3 EVGWEKKRRAYRTLHGAWFNSKLVTVKYLRY 33 (63)
Q Consensus 3 c~s~e~A~~A~~~LhGRWF~GR~I~Aey~p~ 33 (63)
+.|-|+|.+|+..|||.=|+.=.+.+|+...
T Consensus 238 F~sRddA~rAI~~LnG~gyd~LILrvEwskP 268 (270)
T KOG0122|consen 238 FESRDDAARAIADLNGYGYDNLILRVEWSKP 268 (270)
T ss_pred EecHHHHHHHHHHccCcccceEEEEEEecCC
Confidence 4678999999999999999999999998754
No 52
>PF03880 DbpA: DbpA RNA binding domain ; InterPro: IPR005580 This RNA binding domain is found at the C terminus of a number of DEAD helicase proteins [].; PDB: 2G0C_A 3MOJ_B.
Probab=67.76 E-value=3.8 Score=23.65 Aligned_cols=25 Identities=24% Similarity=0.250 Sum_probs=15.6
Q ss_pred HHHHHHHHHHHhccccCCceeEEEE
Q psy2111 6 WEKKRRAYRTLHGAWFNSKLVTVKY 30 (63)
Q Consensus 6 ~e~A~~A~~~LhGRWF~GR~I~Aey 30 (63)
.+.|..+++.|+|..+.||.|.+|-
T Consensus 49 ~~~a~~v~~~l~~~~~~gk~v~ve~ 73 (74)
T PF03880_consen 49 EEVAEKVLEALNGKKIKGKKVRVER 73 (74)
T ss_dssp TT-HHHHHHHHTT--SSS----EEE
T ss_pred HHHHHHHHHHhcCCCCCCeeEEEEE
Confidence 4578999999999999999999873
No 53
>KOG0145|consensus
Probab=67.07 E-value=7.6 Score=29.55 Aligned_cols=29 Identities=14% Similarity=0.367 Sum_probs=26.1
Q ss_pred CCHHHHHHHHHHHhccccCCceeEEEEec
Q psy2111 4 VGWEKKRRAYRTLHGAWFNSKLVTVKYLR 32 (63)
Q Consensus 4 ~s~e~A~~A~~~LhGRWF~GR~I~Aey~p 32 (63)
.+-++|..|+.+|||--.++|.+++.|-.
T Consensus 328 tNYdEAamAi~sLNGy~lg~rvLQVsFKt 356 (360)
T KOG0145|consen 328 TNYDEAAMAIASLNGYRLGDRVLQVSFKT 356 (360)
T ss_pred cchHHHHHHHHHhcCccccceEEEEEEec
Confidence 46789999999999999999999998854
No 54
>PF12078 DUF3557: Domain of unknown function (DUF3557); InterPro: IPR021942 This presumed domain is functionally uncharacterised. This domain is found in eukaryotes. This domain is about 150 amino acids in length.
Probab=65.67 E-value=5.1 Score=25.13 Aligned_cols=34 Identities=24% Similarity=0.277 Sum_probs=27.0
Q ss_pred HHHHHHhccccCCce--eEEEEechhh-HHhhCcCcc
Q psy2111 11 RAYRTLHGAWFNSKL--VTVKYLRYER-YHERFPQAK 44 (63)
Q Consensus 11 ~A~~~LhGRWF~GR~--I~Aey~p~~~-Y~~~FP~a~ 44 (63)
.|.+.|..+.|+||. |.++.+.... .-.++|...
T Consensus 4 eA~k~L~~~lfggR~~~i~V~~l~i~~~~~~rlP~~l 40 (154)
T PF12078_consen 4 EAMKYLIKKLFGGRSHPIKVKKLSISNSGILRLPEGL 40 (154)
T ss_pred HHHHHHHHHHcCCCCceEEEEEEEECCCceeecCCCc
Confidence 588999999999999 7898887766 444777544
No 55
>PTZ00339 UDP-N-acetylglucosamine pyrophosphorylase; Provisional
Probab=64.29 E-value=6.5 Score=30.76 Aligned_cols=37 Identities=14% Similarity=0.179 Sum_probs=28.2
Q ss_pred CCCHHHHHHHHHHHhcccc--CCcee---------EEEEechhhHHhh
Q psy2111 3 EVGWEKKRRAYRTLHGAWF--NSKLV---------TVKYLRYERYHER 39 (63)
Q Consensus 3 c~s~e~A~~A~~~LhGRWF--~GR~I---------~Aey~p~~~Y~~~ 39 (63)
-+|++.|..++..||-||. +|-.+ .+|-.|+-.|-.-
T Consensus 418 ~d~p~tar~~l~~~~~~wl~~ag~~~~~~~~~~~~~~Eisp~~sy~ge 465 (482)
T PTZ00339 418 ADTILNAQKLLLSLHTRWLEAALETVAGNPREGLNLCEISPLVSYGGE 465 (482)
T ss_pred CCCHHHHHHHHHHHHHHHHHHCCCeeeccccCCCceEEecCcceeCCC
Confidence 3689999999999999998 66644 3577777666543
No 56
>cd04915 ACT_AK-Ectoine_2 ACT domains located C-terminal to the catalytic domain of the aspartokinase of the ectoine (1,4,5,6-tetrahydro-2-methyl pyrimidine-4-carboxylate) biosynthetic pathway. This CD includes the second of two ACT domains located C-terminal to the catalytic domain of the aspartokinase of the ectoine (1,4,5,6-tetrahydro-2-methyl pyrimidine-4-carboxylate) biosynthetic pathway found in Methylomicrobium alcaliphilum, Vibrio cholerae, and various other halotolerant or halophilic bacteria. Bacteria exposed to hyperosmotic stress accumulate organic solutes called 'compatible solutes' of which ectoine, a heterocyclic amino acid, is one. Apart from its osmotic function, ectoine also exhibits a protective effect on proteins, nucleic acids and membranes against a variety of stress factors. de novo synthesis of ectoine starts with the phosphorylation of L-aspartate and shares its first two enzymatic steps with the biosynthesis of amino acids of the aspartate family: aspartokinas
Probab=64.05 E-value=5.2 Score=22.50 Aligned_cols=18 Identities=22% Similarity=0.331 Sum_probs=15.1
Q ss_pred CHHHHHHHHHHHhccccC
Q psy2111 5 GWEKKRRAYRTLHGAWFN 22 (63)
Q Consensus 5 s~e~A~~A~~~LhGRWF~ 22 (63)
+.++..+|++.||.++|.
T Consensus 49 ~~~~~~~av~~Lh~~f~~ 66 (66)
T cd04915 49 DRDDYDNAIKALHAALVE 66 (66)
T ss_pred EHHHHHHHHHHHHHHHhC
Confidence 357889999999999873
No 57
>PF07576 BRAP2: BRCA1-associated protein 2; InterPro: IPR011422 These proteins include BRCA1-associated protein 2 (BRAP2), which binds nuclear localisation signals (NLSs) in vitro and in yeast two-hybrid screening []. These proteins share a region of sequence similarity at their N terminus. They also have IPR001607 from INTERPRO at the C terminus.
Probab=57.85 E-value=7.1 Score=24.87 Aligned_cols=23 Identities=26% Similarity=0.284 Sum_probs=21.1
Q ss_pred CCCCCHHHHHHHHHHHhccccCC
Q psy2111 1 MGEVGWEKKRRAYRTLHGAWFNS 23 (63)
Q Consensus 1 ~kc~s~e~A~~A~~~LhGRWF~G 23 (63)
||+.+.++|..=++..||+=|+.
T Consensus 59 ikF~~~~~Ad~Fy~~fNGk~Fns 81 (110)
T PF07576_consen 59 IKFRDQESADEFYEEFNGKPFNS 81 (110)
T ss_pred EEECCHHHHHHHHHHhCCCccCC
Confidence 58899999999999999999975
No 58
>KOG0109|consensus
Probab=57.57 E-value=4.6 Score=30.81 Aligned_cols=30 Identities=23% Similarity=0.114 Sum_probs=25.8
Q ss_pred CCCHHHHHHHHHHHhccccCCceeEEEEec
Q psy2111 3 EVGWEKKRRAYRTLHGAWFNSKLVTVKYLR 32 (63)
Q Consensus 3 c~s~e~A~~A~~~LhGRWF~GR~I~Aey~p 32 (63)
++-.|+|..|++.|+||=|+|+.+.+..+.
T Consensus 119 ~d~~eda~~air~l~~~~~~gk~m~vq~st 148 (346)
T KOG0109|consen 119 FDRAEDAVEAIRGLDNTEFQGKRMHVQLST 148 (346)
T ss_pred EeeccchHHHHhcccccccccceeeeeeec
Confidence 445689999999999999999999988653
No 59
>cd04918 ACT_AK1-AT_2 ACT domains located C-terminal to the catalytic domain of a monofunctional, lysine-sensitive, plant aspartate kinase 1 (AK1). This CD includes the second of two ACT domains located C-terminal to the catalytic domain of a monofunctional, lysine-sensitive, plant aspartate kinase 1 (AK1), which can be synergistically inhibited by S-adenosylmethionine (SAM). This isoenzyme is found in higher plants, Arabidopsis thaliana (AT) and Zea mays, and also in Chlorophyta. In its inactive state, Arabidopsis AK1 binds the effectors lysine and SAM (two molecules each) at the interface of two ACT1 domain subunits. The second ACT domain (ACT2), this CD, does not interact with an effector. Members of this CD belong to the superfamily of ACT regulatory domains.
Probab=56.62 E-value=8.4 Score=21.31 Aligned_cols=18 Identities=17% Similarity=0.346 Sum_probs=14.9
Q ss_pred CHHHHHHHHHHHhccccC
Q psy2111 5 GWEKKRRAYRTLHGAWFN 22 (63)
Q Consensus 5 s~e~A~~A~~~LhGRWF~ 22 (63)
+.+++..|++.||.++|.
T Consensus 48 ~~~~~~~av~~Lh~~f~~ 65 (65)
T cd04918 48 NDSEAEGCVQALHKSFFE 65 (65)
T ss_pred eHHHHHHHHHHHHHHHhC
Confidence 467889999999998873
No 60
>TIGR01648 hnRNP-R-Q heterogeneous nuclear ribonucleoprotein R, Q family. Sequences in this subfamily include the human heterogeneous nuclear ribonucleoproteins (hnRNP) R, Q and APOBEC-1 complementation factor (aka APOBEC-1 stimulating protein). These proteins contain three RNA recognition domains (rrm: pfam00076) and a somewhat variable C-terminal domain.
Probab=56.02 E-value=6.7 Score=31.39 Aligned_cols=28 Identities=21% Similarity=0.206 Sum_probs=23.2
Q ss_pred CCCCHHHHHHHHHHHhccccC-CceeEEE
Q psy2111 2 GEVGWEKKRRAYRTLHGAWFN-SKLVTVK 29 (63)
Q Consensus 2 kc~s~e~A~~A~~~LhGRWF~-GR~I~Ae 29 (63)
.+.+.|+|.+|++.|||.-+. ||.|.+.
T Consensus 105 ~F~~~e~A~~Ai~~lng~~i~~Gr~l~V~ 133 (578)
T TIGR01648 105 TFCGKEEAKEAVKLLNNYEIRPGRLLGVC 133 (578)
T ss_pred EeCCHHHHHHHHHHcCCCeecCCcccccc
Confidence 357899999999999999884 7877554
No 61
>PLN03213 repressor of silencing 3; Provisional
Probab=55.61 E-value=15 Score=30.31 Aligned_cols=38 Identities=13% Similarity=0.274 Sum_probs=29.2
Q ss_pred CCCCCHHHHHHHHHHHhccccCCceeEEEEechhhHHhh
Q psy2111 1 MGEVGWEKKRRAYRTLHGAWFNSKLVTVKYLRYERYHER 39 (63)
Q Consensus 1 ~kc~s~e~A~~A~~~LhGRWF~GR~I~Aey~p~~~Y~~~ 39 (63)
|.+++.+++.+|+..|||.=+.||.|.++--. +.|..|
T Consensus 55 MssdddaEeeKAISaLNGAEWKGR~LKVNKAK-P~YLeR 92 (759)
T PLN03213 55 FSPSSTNSLTKLFSTYNGCVWKGGRLRLEKAK-EHYLAR 92 (759)
T ss_pred ecCCcHHHHHHHHHHhcCCeecCceeEEeecc-HHHHHH
Confidence 34556688999999999999999999998653 334333
No 62
>KOG0110|consensus
Probab=54.93 E-value=13 Score=31.00 Aligned_cols=31 Identities=16% Similarity=0.178 Sum_probs=29.2
Q ss_pred CCCCHHHHHHHHHHHhccccCCceeEEEEec
Q psy2111 2 GEVGWEKKRRAYRTLHGAWFNSKLVTVKYLR 32 (63)
Q Consensus 2 kc~s~e~A~~A~~~LhGRWF~GR~I~Aey~p 32 (63)
.+.++|+|..|++.|+|-=.+|..|.++|..
T Consensus 566 EF~~~e~A~~a~k~lqgtvldGH~l~lk~S~ 596 (725)
T KOG0110|consen 566 EFAKPESAQAALKALQGTVLDGHKLELKISE 596 (725)
T ss_pred EecCHHHHHHHHHHhcCceecCceEEEEecc
Confidence 4678999999999999999999999999998
No 63
>KOG0123|consensus
Probab=50.98 E-value=18 Score=27.17 Aligned_cols=31 Identities=13% Similarity=0.053 Sum_probs=27.4
Q ss_pred CCCHHHHHHHHHHHhccccCCceeEEEEech
Q psy2111 3 EVGWEKKRRAYRTLHGAWFNSKLVTVKYLRY 33 (63)
Q Consensus 3 c~s~e~A~~A~~~LhGRWF~GR~I~Aey~p~ 33 (63)
+.++++|++|++.||.--+.||+|..-.+..
T Consensus 44 f~~~~da~~A~~~~n~~~~~~~~~rim~s~r 74 (369)
T KOG0123|consen 44 FQQPADAERALDTMNFDVLKGKPIRIMWSQR 74 (369)
T ss_pred cCCHHHHHHHHHHcCCcccCCcEEEeehhcc
Confidence 6789999999999999999999999876543
No 64
>KOG0111|consensus
Probab=50.47 E-value=10 Score=28.20 Aligned_cols=31 Identities=13% Similarity=0.087 Sum_probs=27.2
Q ss_pred CCHHHHHHHHHHHhccccCCceeEEEEechh
Q psy2111 4 VGWEKKRRAYRTLHGAWFNSKLVTVKYLRYE 34 (63)
Q Consensus 4 ~s~e~A~~A~~~LhGRWF~GR~I~Aey~p~~ 34 (63)
.-.|+|..|++.||+-=.=||.|++.|-...
T Consensus 60 e~aEDAaaAiDNMnesEL~GrtirVN~AkP~ 90 (298)
T KOG0111|consen 60 EEAEDAAAAIDNMNESELFGRTIRVNLAKPE 90 (298)
T ss_pred eccchhHHHhhcCchhhhcceeEEEeecCCc
Confidence 4568999999999999999999999997554
No 65
>PF08255 Leader_Trp: Trp-operon Leader Peptide; InterPro: IPR013205 The tryptophan operon regulatory region of Citrobacter freundii (leader transcript) encodes a 14-residue peptide containing characteristic tandem tryptophan residues. It is about 10 nucleotides shorter than those of Escherichia coli and Salmonella typhimurium [].
Probab=50.13 E-value=3 Score=18.28 Aligned_cols=7 Identities=57% Similarity=1.213 Sum_probs=5.5
Q ss_pred HHHhccc
Q psy2111 14 RTLHGAW 20 (63)
Q Consensus 14 ~~LhGRW 20 (63)
=+|||+|
T Consensus 5 ~~L~~WW 11 (14)
T PF08255_consen 5 FSLHGWW 11 (14)
T ss_pred EEEeeEE
Confidence 3689998
No 66
>KOG4661|consensus
Probab=47.55 E-value=17 Score=30.43 Aligned_cols=31 Identities=10% Similarity=0.189 Sum_probs=27.0
Q ss_pred CCCHHHHHHHHHHHhccccCCceeEEEEech
Q psy2111 3 EVGWEKKRRAYRTLHGAWFNSKLVTVKYLRY 33 (63)
Q Consensus 3 c~s~e~A~~A~~~LhGRWF~GR~I~Aey~p~ 33 (63)
.++.++|.+||..||-.=.-||+|.+|-...
T Consensus 454 MSts~eAtkCI~hLHrTELHGrmISVEkaKN 484 (940)
T KOG4661|consen 454 MSTSAEATKCIEHLHRTELHGRMISVEKAKN 484 (940)
T ss_pred ecchHHHHHHHHHhhhhhhcceeeeeeeccc
Confidence 3678999999999999999999999986543
No 67
>PF10593 Z1: Z1 domain; InterPro: IPR018310 This entry represents the Z1 domain of unknown function that is found in a group of putative endonucleases. This domain is found associated with a helicase domain of superfamily type II [].
Probab=45.09 E-value=7.9 Score=27.29 Aligned_cols=21 Identities=29% Similarity=0.613 Sum_probs=13.2
Q ss_pred hccccCCce----eEEEEechhhHH
Q psy2111 17 HGAWFNSKL----VTVKYLRYERYH 37 (63)
Q Consensus 17 hGRWF~GR~----I~Aey~p~~~Y~ 37 (63)
+||||+=|. +.=-|+|.+.|.
T Consensus 172 mgRwFGYR~gY~dl~Ri~~~~~l~~ 196 (239)
T PF10593_consen 172 MGRWFGYRPGYEDLCRIYMPEELYD 196 (239)
T ss_pred HhhcccCCcccccceEEecCHHHHH
Confidence 799999765 233455555543
No 68
>KOG0146|consensus
Probab=44.87 E-value=12 Score=28.67 Aligned_cols=34 Identities=18% Similarity=0.261 Sum_probs=22.6
Q ss_pred CCCCCHHHHHHHHHHHhccc-cCC--ceeEEEEechh
Q psy2111 1 MGEVGWEKKRRAYRTLHGAW-FNS--KLVTVKYLRYE 34 (63)
Q Consensus 1 ~kc~s~e~A~~A~~~LhGRW-F~G--R~I~Aey~p~~ 34 (63)
+|+.+-.+|..||++|||.- ..| --+.++|-..+
T Consensus 65 VKf~s~~eAqaAI~aLHgSqTmpGASSSLVVK~ADTd 101 (371)
T KOG0146|consen 65 VKFSSHAEAQAAINALHGSQTMPGASSSLVVKFADTD 101 (371)
T ss_pred EEeccchHHHHHHHHhcccccCCCCccceEEEeccch
Confidence 37788899999999999953 222 23445554433
No 69
>KOG0121|consensus
Probab=44.21 E-value=22 Score=24.39 Aligned_cols=26 Identities=12% Similarity=0.396 Sum_probs=23.9
Q ss_pred CHHHHHHHHHHHhccccCCceeEEEE
Q psy2111 5 GWEKKRRAYRTLHGAWFNSKLVTVKY 30 (63)
Q Consensus 5 s~e~A~~A~~~LhGRWF~GR~I~Aey 30 (63)
+.++|..|++-|+|--.+-|+|.+.+
T Consensus 87 ~~~dA~~AlryisgtrLddr~ir~D~ 112 (153)
T KOG0121|consen 87 SRDDAEDALRYISGTRLDDRPIRIDW 112 (153)
T ss_pred cchhHHHHHHHhccCcccccceeeec
Confidence 67899999999999999999999865
No 70
>PF14268 YoaP: YoaP-like
Probab=40.77 E-value=20 Score=19.69 Aligned_cols=17 Identities=24% Similarity=0.630 Sum_probs=14.6
Q ss_pred ccCCceeEEEEechhhH
Q psy2111 20 WFNSKLVTVKYLRYERY 36 (63)
Q Consensus 20 WF~GR~I~Aey~p~~~Y 36 (63)
+|+|+-||.+.+....+
T Consensus 26 FYnGkfiT~eils~~kf 42 (44)
T PF14268_consen 26 FYNGKFITNEILSEKKF 42 (44)
T ss_pred EECCEEEEeeccChhhh
Confidence 79999999999887665
No 71
>COG0424 Maf Nucleotide-binding protein implicated in inhibition of septum formation [Cell division and chromosome partitioning]
Probab=40.25 E-value=35 Score=23.90 Aligned_cols=27 Identities=22% Similarity=0.122 Sum_probs=22.0
Q ss_pred CCCCHHHHHHHHHHHhccccCCceeEEEE
Q psy2111 2 GEVGWEKKRRAYRTLHGAWFNSKLVTVKY 30 (63)
Q Consensus 2 kc~s~e~A~~A~~~LhGRWF~GR~I~Aey 30 (63)
|=.|.++|..-.+.|.||+ .+.+|+--
T Consensus 83 KP~~~~eA~~~L~~lSG~~--h~v~T~v~ 109 (193)
T COG0424 83 KPKDEEEAREMLRKLSGRT--HQVYTGVA 109 (193)
T ss_pred CCCCHHHHHHHHHHhcCCe--EEEEEEEE
Confidence 4468999999999999999 67777643
No 72
>PF01625 PMSR: Peptide methionine sulfoxide reductase; InterPro: IPR002569 Peptide methionine sulphoxide reductase (Msr) reverses the inactivation of many proteins due to the oxidation of critical methionine residues by reducing methionine sulphoxide, Met(O), to methionine []. It is present in most living organisms, and the cognate structural gene belongs to the so-called minimum gene set [, ]. The domains: MsrA and MsrB, reduce different epimeric forms of methionine sulphoxide. This group represent MsrA, the crystal structure of which has been determined in a number of organisms. In Mycobacterium tuberculosis, the MsrA structure has been determined to 1.5 Angstrom resolution []. In contrast to the three catalytic cysteine residues found in previously characterised MsrA structures, M. tuberculosis MsrA represents a class containing only two functional cysteine residues. The overall structure shows no resemblance to the structures of MsrB (IPR002579 from INTERPRO) from other organisms; though the active sites show approximate mirror symmetry. In each case, conserved amino acid motifs mediate the stereo-specific recognition and reduction of the substrate. In a number of pathogenic bacteria including Neisseria gonorrhoeae, the MsrA and MsrB domains are fused; the MsrA being N-terminal to MsrB. This arrangement is reversed in Treponema pallidum. In N. gonorrhoeae and Neisseria meningitidis a thioredoxin domain is fused to the N terminus. This may function to reduce the active sites of the downstream MsrA and MsrB domains. ; GO: 0016671 oxidoreductase activity, acting on a sulfur group of donors, disulfide as acceptor, 0019538 protein metabolic process, 0055114 oxidation-reduction process; PDB: 2GT3_A 1FF3_B 2IEM_A 3E0M_D 2J89_A 3PIN_B 3PIM_B 3PIL_B 2L90_A 3BQF_A ....
Probab=40.25 E-value=34 Score=22.84 Aligned_cols=36 Identities=19% Similarity=0.416 Sum_probs=24.7
Q ss_pred HHHHHHHHHHhccccCCceeEE-----EEechhhHHhhCcC
Q psy2111 7 EKKRRAYRTLHGAWFNSKLVTV-----KYLRYERYHERFPQ 42 (63)
Q Consensus 7 e~A~~A~~~LhGRWF~GR~I~A-----ey~p~~~Y~~~FP~ 42 (63)
+.|...+..|.-++++.++++. .|.|-++||++|-.
T Consensus 105 ~~a~~~~~~~~~~~~~~~i~tei~p~~~Fy~AE~yHQ~Yl~ 145 (155)
T PF01625_consen 105 KIAEASIAELQAKRFGRPIVTEIEPLKNFYPAEEYHQKYLE 145 (155)
T ss_dssp HHHHHHHHHHHHHTTSSSBS-EEEECEEEEEHHGGGTTHHH
T ss_pred HHHHHHHHHHHHhcCCCCeEEEEecCCcEEECHHHHHHHHH
Confidence 4566777777778666665553 57788899988743
No 73
>KOG0127|consensus
Probab=39.25 E-value=23 Score=29.21 Aligned_cols=36 Identities=14% Similarity=0.251 Sum_probs=28.7
Q ss_pred CCCHHHHHHHHHHHhccccCCceeEEEEe-chhhHHh
Q psy2111 3 EVGWEKKRRAYRTLHGAWFNSKLVTVKYL-RYERYHE 38 (63)
Q Consensus 3 c~s~e~A~~A~~~LhGRWF~GR~I~Aey~-p~~~Y~~ 38 (63)
+....+|.+|+..+||-=++||.|.|.+- +-..|..
T Consensus 165 fk~~~dA~~Al~~~N~~~i~gR~VAVDWAV~Kd~ye~ 201 (678)
T KOG0127|consen 165 FKEKKDAEKALEFFNGNKIDGRPVAVDWAVDKDTYED 201 (678)
T ss_pred EeeHHHHHHHHHhccCceecCceeEEeeecccccccc
Confidence 34567899999999999999999999984 4444543
No 74
>cd04937 ACT_AKi-DapG-BS_2 ACT domains of the diaminopimelate-sensitive aspartokinase (AK) isoenzyme AKI. This CD includes the C-terminal of the two ACT domains of the diaminopimelate-sensitive aspartokinase (AK) isoenzyme AKI, a monofunctional class enzyme found in Bacilli (Bacillus subtilis (BS) strain 168), Clostridia, and Actinobacteria bacterial species. In B. subtilis, the regulation of the diaminopimelate-lysine biosynthetic pathway involves dual control by diaminopimelate and lysine, effected through separate diaminopimelate- and lysine-sensitive AK isoenzymes. AKI activity is invariant during the exponential and stationary phases of growth and is not altered by addition of amino acids to the growth medium. The role of this isoenzyme is most likely to provide a constant level of aspartyl-beta-phosphate for the biosynthesis of diaminopimelate for peptidoglycan synthesis and dipicolinate during sporulation. The BS AKI is tetrameric consisting of two alpha and two beta subunits; th
Probab=36.81 E-value=29 Score=18.87 Aligned_cols=17 Identities=24% Similarity=0.280 Sum_probs=13.7
Q ss_pred CHHHHHHHHHHHhcccc
Q psy2111 5 GWEKKRRAYRTLHGAWF 21 (63)
Q Consensus 5 s~e~A~~A~~~LhGRWF 21 (63)
+.+++.+|+++||..+|
T Consensus 47 ~~~~~~~av~~Lh~~f~ 63 (64)
T cd04937 47 SEDDVKEAVNALHEAFE 63 (64)
T ss_pred cHHHHHHHHHHHHHHhc
Confidence 45778889999998764
No 75
>PF05172 Nup35_RRM: Nup53/35/40-type RNA recognition motif; InterPro: IPR007846 The MPPN (Mitotic PhosphoProtein N end) family is uncharacterised however it probably plays a role in the cell cycle because the family includes mitotic phosphoproteins O13026 from SWISSPROT []. This family also includes Q05166 from SWISSPROT a suppressor of thermosensitive mutations in the DNA polymerase delta gene, Pol III []. The conserved central region appears to be distantly related to the RNA-binding region RNP-1 (RNA recognition motif, IPR000504 from INTERPRO), suggesting an RNA binding function for this protein.; PDB: 1WWH_C 3P3D_A.
Probab=36.60 E-value=30 Score=21.65 Aligned_cols=30 Identities=23% Similarity=0.173 Sum_probs=22.5
Q ss_pred CCCCHHHHHHHHHHHhccccCCc-eeEEEEec
Q psy2111 2 GEVGWEKKRRAYRTLHGAWFNSK-LVTVKYLR 32 (63)
Q Consensus 2 kc~s~e~A~~A~~~LhGRWF~GR-~I~Aey~p 32 (63)
++++..+|.+|++ -||+-|+|. +|-+.|..
T Consensus 60 ~Y~~~~~A~rAL~-~NG~i~~g~~mvGV~~~~ 90 (100)
T PF05172_consen 60 TYDNPLSAQRALQ-KNGTIFSGSLMVGVKPCD 90 (100)
T ss_dssp EESSHHHHHHHHT-TTTEEETTCEEEEEEE-H
T ss_pred ECCCHHHHHHHHH-hCCeEEcCcEEEEEEEcH
Confidence 4678889998886 499999986 55577764
No 76
>cd00018 AP2 DNA-binding domain found in transcription regulators in plants such as APETALA2 and EREBP (ethylene responsive element binding protein). In EREBPs the domain specifically binds to the 11bp GCC box of the ethylene response element (ERE), a promotor element essential for ethylene responsiveness. EREBPs and the C-repeat binding factor CBF1, which is involved in stress response, contain a single copy of the AP2 domain. APETALA2-like proteins, which play a role in plant development contain two copies.
Probab=36.58 E-value=47 Score=18.13 Aligned_cols=19 Identities=21% Similarity=-0.067 Sum_probs=13.1
Q ss_pred CCCHHHHHHHHHHHhcccc
Q psy2111 3 EVGWEKKRRAYRTLHGAWF 21 (63)
Q Consensus 3 c~s~e~A~~A~~~LhGRWF 21 (63)
++++|+|..|++...=+.+
T Consensus 32 f~t~eeAa~Ayd~a~~~~~ 50 (61)
T cd00018 32 FDTAEEAARAYDRAALKLR 50 (61)
T ss_pred CCCHHHHHHHHHHHHHHhc
Confidence 5788999988876533333
No 77
>KOG0226|consensus
Probab=36.38 E-value=25 Score=26.42 Aligned_cols=28 Identities=18% Similarity=0.271 Sum_probs=24.9
Q ss_pred CCCCHHHHHHHHHHHhccccCCceeEEE
Q psy2111 2 GEVGWEKKRRAYRTLHGAWFNSKLVTVK 29 (63)
Q Consensus 2 kc~s~e~A~~A~~~LhGRWF~GR~I~Ae 29 (63)
.+.+++++..|.+.|+|++-+.|.|.+.
T Consensus 238 Sf~~pad~~rAmrem~gkyVgsrpiklR 265 (290)
T KOG0226|consen 238 SFRDPADYVRAMREMNGKYVGSRPIKLR 265 (290)
T ss_pred eecCHHHHHHHHHhhcccccccchhHhh
Confidence 3678999999999999999999999753
No 78
>cd04922 ACT_AKi-HSDH-ThrA_2 ACT domains of the bifunctional enzyme aspartokinase (AK) - homoserine dehydrogenase (HSDH). This CD includes the second of two ACT domains of the bifunctional enzyme aspartokinase (AK) - homoserine dehydrogenase (HSDH). The ACT domains are positioned between the N-terminal catalytic domain of AK and the C-terminal HSDH domain found in bacteria (Escherichia coli (EC) ThrA) and higher plants (Zea mays AK-HSDH). AK and HSDH are the first and third enzymes in the biosynthetic pathway of the aspartate family of amino acids. AK catalyzes the phosphorylation of Asp to P-aspartyl phosphate. HSDH catalyzes the NADPH-dependent conversion of Asp 3-semialdehyde to homoserine. HSDH is the first committed reaction in the branch of the pathway that leads to Thr and Met. In E. coli, ThrA is subject to allosteric regulation by the end product L-threonine and the native enzyme is reported to be tetrameric. As with bacteria, plant AK and HSDH are feedback inhibited by pathwa
Probab=35.83 E-value=29 Score=18.20 Aligned_cols=17 Identities=24% Similarity=0.446 Sum_probs=13.4
Q ss_pred HHHHHHHHHHHhccccC
Q psy2111 6 WEKKRRAYRTLHGAWFN 22 (63)
Q Consensus 6 ~e~A~~A~~~LhGRWF~ 22 (63)
.++...+++.||.+.|.
T Consensus 50 ~~~~~~~~~~lh~~~~~ 66 (66)
T cd04922 50 EDDATKALRAVHERFFL 66 (66)
T ss_pred HHHHHHHHHHHHHHHhC
Confidence 46677889999988774
No 79
>KOG4660|consensus
Probab=35.73 E-value=17 Score=29.41 Aligned_cols=23 Identities=17% Similarity=0.196 Sum_probs=21.3
Q ss_pred CHHHHHHHHHHHhccccCCceeE
Q psy2111 5 GWEKKRRAYRTLHGAWFNSKLVT 27 (63)
Q Consensus 5 s~e~A~~A~~~LhGRWF~GR~I~ 27 (63)
|+-+|+.|+++|++|=++|+.|.
T Consensus 121 DvR~A~~Alk~l~~~~~~~~~~k 143 (549)
T KOG4660|consen 121 DVRDAERALKALNRREIAGKRIK 143 (549)
T ss_pred ehHhHHHHHHHHHHHHhhhhhhc
Confidence 56789999999999999999987
No 80
>cd04916 ACT_AKiii-YclM-BS_2 ACT domains located C-terminal to the catalytic domain of the lysine plus threonine-sensitive aspartokinase isoenzyme AKIII. This CD includes the second of two ACT domains located C-terminal to the catalytic domain of the lysine plus threonine-sensitive aspartokinase isoenzyme AKIII, a monofunctional class enzyme found in Bacilli (Bacillus subtilis (BS) YclM) and Clostridia species. Aspartokinase is the first enzyme in the aspartate metabolic pathway and catalyzes the conversion of aspartate and ATP to aspartylphosphate and ADP. B. subtilis YclM is reported to be a single polypeptide of 50 kD. AKIII from B. subtilis strain 168 is induced by lysine and repressed by threonine and it is synergistically inhibited by lysine and threonine. Members of this CD belong to the superfamily of ACT regulatory domains.
Probab=33.75 E-value=35 Score=17.92 Aligned_cols=17 Identities=24% Similarity=0.419 Sum_probs=13.5
Q ss_pred HHHHHHHHHHHhccccC
Q psy2111 6 WEKKRRAYRTLHGAWFN 22 (63)
Q Consensus 6 ~e~A~~A~~~LhGRWF~ 22 (63)
.++...+.+.||.+.|+
T Consensus 50 ~~d~~~~~~~lh~~~~~ 66 (66)
T cd04916 50 NEDADKAVKAIYEEFFN 66 (66)
T ss_pred HHHHHHHHHHHHHHHhC
Confidence 46677889999998874
No 81
>PF05409 Peptidase_C30: Coronavirus endopeptidase C30; InterPro: IPR008740 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Cysteine peptidases have characteristic molecular topologies, which can be seen not only in their three-dimensional structures, but commonly also in the two-dimensional structures. These are peptidases in which the nucleophile is the sulphydryl group of a cysteine residue. Cysteine proteases are divided into clans (proteins which are evolutionary related), and further sub-divided into families, on the basis of the architecture of their catalytic dyad or triad []. This group of cysteine peptidases correspond to MEROPS peptidase family C30 (clan PA(C)). These peptidases are related to serine endopeptidases of family S1 and are restricted to RNA viruses, where they are involved in viral polyprotein processing during replication [, , ]. This entry represents the CoV M-pro which comprises three domains. Domains I and II are six-stranded antiparallel beta barrels and together resemble the architecture of chymotrypsin and of picornaviruses 3C proteinases. The substrate-binding site is located in a cleft between these two domains. The catalytic site is situated at the centre of the cleft. A long loop connects domain II to the C-terminal domain (domain III). This latter domain, a globular cluster of five helices, has been implicated in the proteolytic activity of M-pro. In the active site of M-pro, Cys and His form a catalytic dyad. In contrast to serine proteinases and other cysteine proteinases, which have a catalytic triad, there is no third catalytic residue present [, , , ]. ; GO: 0019082 viral protein processing; PDB: 2VJ1_B 2K7X_A 2Q6D_B 2Q6F_A 1P9U_E 1LVO_B 2AMP_B 1P9S_A 2ZU2_A 3D23_A ....
Probab=33.08 E-value=31 Score=25.95 Aligned_cols=42 Identities=14% Similarity=0.113 Sum_probs=29.0
Q ss_pred CHHHHHHHHHHHhccccCCcee------EEEEechhhHHhhCcCcccCC
Q psy2111 5 GWEKKRRAYRTLHGAWFNSKLV------TVKYLRYERYHERFPQAKALV 47 (63)
Q Consensus 5 s~e~A~~A~~~LhGRWF~GR~I------~Aey~p~~~Y~~~FP~a~~a~ 47 (63)
|+|.-..|++.||.- |+||.| .-||.|.+.|+|.+|-.....
T Consensus 230 sV~rlLaaI~~L~~g-~~G~~IlG~~~ledE~Tp~~V~~Q~~gV~LQs~ 277 (293)
T PF05409_consen 230 SVERLLAAIQRLHKG-FGGRTILGSCSLEDEFTPEMVYNQMAGVNLQSG 277 (293)
T ss_dssp -HHHHHHHHHHHHHT-TTTT-BTTBSS--TSS-HHHHHHHHTSSC--S-
T ss_pred cHHHHHHHHHHHhcC-CCCCcccccccccccCCHHHHhhhccceEeecc
Confidence 566777889999876 588887 568999999999998766543
No 82
>PRK05528 methionine sulfoxide reductase A; Provisional
Probab=32.27 E-value=72 Score=21.49 Aligned_cols=38 Identities=11% Similarity=0.066 Sum_probs=21.9
Q ss_pred CCHHHHHHHHHHHhccccCCceeEE------EEechhhHHhhCcC
Q psy2111 4 VGWEKKRRAYRTLHGAWFNSKLVTV------KYLRYERYHERFPQ 42 (63)
Q Consensus 4 ~s~e~A~~A~~~LhGRWF~GR~I~A------ey~p~~~Y~~~FP~ 42 (63)
.+.|....|.+.+..+- .++.|.. .|.+-++|||.|-+
T Consensus 95 ~d~eQ~~~a~~~~~~~~-~~~~i~Tei~~~~~Fy~AE~yHQ~Yl~ 138 (156)
T PRK05528 95 EVDDHLIEARQFIERRE-DADKIAVEVLPLTNYVKSAEEHQDRLE 138 (156)
T ss_pred CCHHHHHHHHHHHHHHh-cCCCeEEEEecCCCeeecHHHHHHHHH
Confidence 35555555555554332 2344544 46778889998744
No 83
>PF09292 Neil1-DNA_bind: Endonuclease VIII-like 1, DNA bind; InterPro: IPR015371 This domain is predominantly found in Endonuclease VIII-like 1 proteins and adopts a glucocorticoid receptor-like fold. Structural analysis reveals a zincless finger motif that is required for glycosylase activity []. ; PDB: 1TDH_A.
Probab=30.66 E-value=18 Score=19.71 Aligned_cols=8 Identities=38% Similarity=1.248 Sum_probs=3.6
Q ss_pred hcc--ccCCc
Q psy2111 17 HGA--WFNSK 24 (63)
Q Consensus 17 hGR--WF~GR 24 (63)
||| ||.|.
T Consensus 24 ~gRTiWFqGd 33 (39)
T PF09292_consen 24 NGRTIWFQGD 33 (39)
T ss_dssp TS-EEEESS-
T ss_pred CCCEEEeeCC
Confidence 455 66664
No 84
>PF14930 Qn_am_d_aII: Quinohemoprotein amine dehydrogenase, alpha subunit domain II; PDB: 1JMZ_A 1JMX_A 1PBY_A 1JJU_A.
Probab=30.47 E-value=13 Score=24.11 Aligned_cols=9 Identities=44% Similarity=1.276 Sum_probs=7.4
Q ss_pred HHhccccCC
Q psy2111 15 TLHGAWFNS 23 (63)
Q Consensus 15 ~LhGRWF~G 23 (63)
.|.||||..
T Consensus 85 ~l~GR~F~~ 93 (108)
T PF14930_consen 85 SLSGRWFEA 93 (108)
T ss_dssp EEEEEEEET
T ss_pred ceeEEEEec
Confidence 589999965
No 85
>PF02324 Glyco_hydro_70: Glycosyl hydrolase family 70; InterPro: IPR003318 O-Glycosyl hydrolases 3.2.1. from EC are a widespread group of enzymes that hydrolyse the glycosidic bond between two or more carbohydrates, or between a carbohydrate and a non-carbohydrate moiety. A classification system for glycosyl hydrolases, based on sequence similarity, has led to the definition of 85 different families [, ]. This classification is available on the CAZy (CArbohydrate-Active EnZymes) web site. Glucosyltransferases or sucrose 6-glycosyl transferases (GTF-S) (2.4.1.5 from EC, GH70 from CAZY) catalyse the transfer of D-glucopyramnosyl units from sucrose onto acceptor molecules []. This signature roughly corresponds to the N-terminal catalytic domain of the enzyme. Members of this group also contain the putative cell wall binding repeat (IPR002479 from INTERPRO).; GO: 0009250 glucan biosynthetic process; PDB: 3AIE_G 3AIB_D 3AIC_E 3TTQ_A 3TTO_D 3KLL_A 3KLK_A 3HZ3_A.
Probab=30.35 E-value=64 Score=27.39 Aligned_cols=34 Identities=21% Similarity=0.205 Sum_probs=25.8
Q ss_pred CCCCHHHHHHHHHHHhccccCCceeEEEEechhhHHh
Q psy2111 2 GEVGWEKKRRAYRTLHGAWFNSKLVTVKYLRYERYHE 38 (63)
Q Consensus 2 kc~s~e~A~~A~~~LhGRWF~GR~I~Aey~p~~~Y~~ 38 (63)
|-.+.|+=..|+++||. +|=++.|..||.+.|.-
T Consensus 642 KYGs~~dL~~AikALH~---~GiqviaDwVpdQiYnL 675 (809)
T PF02324_consen 642 KYGSVEDLRNAIKALHA---AGIQVIADWVPDQIYNL 675 (809)
T ss_dssp TTB-HHHHHHHHHHHHH---TT-EEEEEE-TSEE---
T ss_pred CCCCHHHHHHHHHHHHH---cCcchhhhhchHhhhCC
Confidence 56788999999999998 69999999999999974
No 86
>PRK08961 bifunctional aspartate kinase/diaminopimelate decarboxylase protein; Provisional
Probab=29.93 E-value=45 Score=27.47 Aligned_cols=36 Identities=19% Similarity=0.219 Sum_probs=25.8
Q ss_pred CHHHHHHHHHHHhccccCCceeEEEEechhhHHhhCcC
Q psy2111 5 GWEKKRRAYRTLHGAWFNSKLVTVKYLRYERYHERFPQ 42 (63)
Q Consensus 5 s~e~A~~A~~~LhGRWF~GR~I~Aey~p~~~Y~~~FP~ 42 (63)
+.+++.+|++.||.++|.++.+..-+-+ ..++.|-.
T Consensus 444 ~~~d~~~av~~LH~~f~~~~~~~~~~~~--~~~~~~~~ 479 (861)
T PRK08961 444 DESDADGLLPRLHAELIESGAMPVVFGP--RWREIFGS 479 (861)
T ss_pred eHHHHHHHHHHHHHHHhcCCCCceeeCc--cHHHhcCC
Confidence 4688999999999999999766553333 25555543
No 87
>KOG0146|consensus
Probab=29.20 E-value=32 Score=26.32 Aligned_cols=27 Identities=15% Similarity=0.068 Sum_probs=20.4
Q ss_pred CCCCHHHHHHHHHHHhccccCCceeEE
Q psy2111 2 GEVGWEKKRRAYRTLHGAWFNSKLVTV 28 (63)
Q Consensus 2 kc~s~e~A~~A~~~LhGRWF~GR~I~A 28 (63)
.++++.+|..||++|||=-.+=|-+.+
T Consensus 333 SfDNp~SaQaAIqAMNGFQIGMKRLKV 359 (371)
T KOG0146|consen 333 SFDNPASAQAAIQAMNGFQIGMKRLKV 359 (371)
T ss_pred ecCCchhHHHHHHHhcchhhhhhhhhh
Confidence 358899999999999996555444443
No 88
>PRK09084 aspartate kinase III; Validated
Probab=28.92 E-value=39 Score=25.70 Aligned_cols=19 Identities=16% Similarity=0.242 Sum_probs=16.9
Q ss_pred CHHHHHHHHHHHhccccCC
Q psy2111 5 GWEKKRRAYRTLHGAWFNS 23 (63)
Q Consensus 5 s~e~A~~A~~~LhGRWF~G 23 (63)
+.+++.+|++.||..+|.|
T Consensus 430 ~~~d~~~al~~LH~~f~~~ 448 (448)
T PRK09084 430 PESDAEQVVQALHQNLFEG 448 (448)
T ss_pred cHHHHHHHHHHHHHHHhcC
Confidence 5678999999999999976
No 89
>COG5475 Uncharacterized small protein [Function unknown]
Probab=28.49 E-value=21 Score=21.14 Aligned_cols=9 Identities=33% Similarity=0.671 Sum_probs=6.8
Q ss_pred ccccCCcee
Q psy2111 18 GAWFNSKLV 26 (63)
Q Consensus 18 GRWF~GR~I 26 (63)
=|||+|--+
T Consensus 33 C~Wf~g~g~ 41 (60)
T COG5475 33 CRWFDGYGV 41 (60)
T ss_pred EEEecCCCc
Confidence 489999663
No 90
>PF11767 SET_assoc: Histone lysine methyltransferase SET associated; InterPro: IPR024636 The SET domain is a protein-protein interaction domain found in protein lysine methyltransferase enzymes. This entry represents a domain of unknown function which is associated with the SET domain and found in histone lysine methyltransferases [].
Probab=27.86 E-value=49 Score=19.35 Aligned_cols=24 Identities=17% Similarity=0.217 Sum_probs=19.1
Q ss_pred CCCHHHHHHHHHHHhccccCCcee
Q psy2111 3 EVGWEKKRRAYRTLHGAWFNSKLV 26 (63)
Q Consensus 3 c~s~e~A~~A~~~LhGRWF~GR~I 26 (63)
+.|.++|.++++..||+-|-+-.+
T Consensus 40 F~~~~Ea~rC~~~~~~~~~f~y~m 63 (66)
T PF11767_consen 40 FNDSKEAERCFRAEDGTLFFTYRM 63 (66)
T ss_pred ECChHHHHHHHHhcCCCEEEEEEE
Confidence 457789999999999997765443
No 91
>cd04892 ACT_AK-like_2 ACT domains C-terminal to the catalytic domain of aspartokinase (AK; 4-L-aspartate-4-phosphotransferase). This CD includes the second of two ACT domains C-terminal to the catalytic domain of aspartokinase (AK; 4-L-aspartate-4-phosphotransferase). The exception in this group, is the inclusion of the first ACT domain of the bifunctional aspartokinase - homoserine dehydrogenase-like enzyme group (ACT_AKi-HSDH-ThrA-like_1) which includes the monofunctional, threonine-sensitive, aspartokinase found in Methanococcus jannaschii and other related archaeal species. AK catalyzes the conversion of aspartate and ATP to aspartylphosphate and ADP. AK is the first enzyme in the pathway of the biosynthesis of the aspartate family of amino acids (lysine, threonine, methionine, and isoleucine) and the bacterial cell wall component, meso-diaminopimelate. One mechanism for the regulation of this pathway is by the production of several isoenzymes of AK with different repressors an
Probab=27.84 E-value=51 Score=16.54 Aligned_cols=16 Identities=25% Similarity=0.528 Sum_probs=12.5
Q ss_pred HHHHHHHHHHHhcccc
Q psy2111 6 WEKKRRAYRTLHGAWF 21 (63)
Q Consensus 6 ~e~A~~A~~~LhGRWF 21 (63)
.++...+.+.||.++|
T Consensus 49 ~~~~~~~~~~l~~~~~ 64 (65)
T cd04892 49 EDDADKAVKALHEEFF 64 (65)
T ss_pred HHHHHHHHHHHHHHHh
Confidence 4566778899999886
No 92
>COG4734 ArdA Antirestriction protein [General function prediction only]
Probab=27.62 E-value=16 Score=25.93 Aligned_cols=42 Identities=12% Similarity=0.025 Sum_probs=30.5
Q ss_pred CCCCCHHHHHHHHHHHhccccCCceeEEEEec---hhhHHhhCcC
Q psy2111 1 MGEVGWEKKRRAYRTLHGAWFNSKLVTVKYLR---YERYHERFPQ 42 (63)
Q Consensus 1 ~kc~s~e~A~~A~~~LhGRWF~GR~I~Aey~p---~~~Y~~~FP~ 42 (63)
|++-+.-.|.-.-..+||||-.|...|..|.- ...+-.|||.
T Consensus 1 mrf~t~~~A~yv~gv~y~~y~~gsi~t~~fqe~vsaml~~srfPn 45 (193)
T COG4734 1 MRFTTTTPAVYVGGVTYHKYNCGSIFTKWFQETVSAMLFDSRFPN 45 (193)
T ss_pred CceecccchHHhchhhcceeeccccccHHHHHHHHHHHHhhcCcc
Confidence 44455555665566799999999999987743 2478888987
No 93
>PF01691 Adeno_E1B_19K: Adenovirus E1B 19K protein / small t-antigen The Prosite family contains members not in the Pfam family.; InterPro: IPR002924 This family consists of adenovirus E1B 19 kDa protein or small t-antigen. The E1B 19 kDa protein inhibits E1A induced apoptosis and hence prolongs the viability of the host cell []. It can also inhibit apoptosis mediated by tumour necrosis factor alpha and Fas antigen []. E1B 19 kDa blocks apoptosis by interacting with and inhibiting the p53-inducible and death-promoting Bax protein []. The E1B region of adenovirus encodes two proteins E1B 19 kDa the small t-antigen as found in this family and E1B 55 kDa the large t-antigen which is not found in this family; both of these proteins inhibit E1A induced apoptosis [].; GO: 0005521 lamin binding, 0006916 anti-apoptosis
Probab=27.34 E-value=5.6 Score=26.59 Aligned_cols=25 Identities=32% Similarity=0.517 Sum_probs=20.5
Q ss_pred ccCCceeEEEEechhhHHhhCcCcc
Q psy2111 20 WFNSKLVTVKYLRYERYHERFPQAK 44 (63)
Q Consensus 20 WF~GR~I~Aey~p~~~Y~~~FP~a~ 44 (63)
||+|++..+-|--.++|++.|=+..
T Consensus 31 ~Fg~~La~vV~~vk~ey~~~F~~il 55 (134)
T PF01691_consen 31 LFGGRLAKVVYRVKEEYEEEFEEIL 55 (134)
T ss_pred ecCchHHHHHHHHHHHHHHHHHHHc
Confidence 4999999999999999988875543
No 94
>PF11202 PRTase_1: Phosphoribosyl transferase (PRTase); InterPro: IPR011215 This is a family of uncharacterised bacterial proteins.
Probab=26.92 E-value=26 Score=25.90 Aligned_cols=42 Identities=26% Similarity=0.389 Sum_probs=31.1
Q ss_pred CCCHHHHHHHHHHHhccccCCceeEEEEechhhHHhhCcCcccC
Q psy2111 3 EVGWEKKRRAYRTLHGAWFNSKLVTVKYLRYERYHERFPQAKAL 46 (63)
Q Consensus 3 c~s~e~A~~A~~~LhGRWF~GR~I~Aey~p~~~Y~~~FP~a~~a 46 (63)
..+.|+-++.+|+ |+===..|+..||.|.+.|.+.|-++...
T Consensus 23 ~~~~eerE~lIQs--G~~HYSEmL~~E~~Ps~~y~~lf~~aL~~ 64 (257)
T PF11202_consen 23 ETPVEERERLIQS--GRRHYSEMLPIEYQPSAEYMDLFHQALAQ 64 (257)
T ss_pred cCCHHHHHHHHHc--CCccccccCCCCCCCCHHHHHHHHHHHHH
Confidence 4567777777774 44222359999999999999999887644
No 95
>COG1582 FlgEa Uncharacterized protein, possibly involved in motility [Cell motility and secretion]
Probab=26.50 E-value=20 Score=21.65 Aligned_cols=7 Identities=43% Similarity=0.653 Sum_probs=5.5
Q ss_pred HHhcccc
Q psy2111 15 TLHGAWF 21 (63)
Q Consensus 15 ~LhGRWF 21 (63)
.||||||
T Consensus 6 rlNG~~~ 12 (67)
T COG1582 6 RLNGREF 12 (67)
T ss_pred EecCcce
Confidence 4788887
No 96
>PF04736 Eclosion: Eclosion hormone; InterPro: IPR006825 Eclosion hormone is an insect neuropeptide that triggers the performance of ecdysis behaviour, which causes shedding of the old cuticle at the end of a molt [, ].; GO: 0008255 ecdysis-triggering hormone activity, 0007218 neuropeptide signaling pathway, 0018990 ecdysis, chitin-based cuticle
Probab=26.47 E-value=26 Score=20.82 Aligned_cols=12 Identities=33% Similarity=0.899 Sum_probs=10.5
Q ss_pred HHHhccccCCce
Q psy2111 14 RTLHGAWFNSKL 25 (63)
Q Consensus 14 ~~LhGRWF~GR~ 25 (63)
+.|-|-||.|++
T Consensus 22 KkM~G~yF~G~~ 33 (62)
T PF04736_consen 22 KKMFGAYFEGQL 33 (62)
T ss_pred HHHHHhHhCcch
Confidence 678999999986
No 97
>KOG0106|consensus
Probab=26.21 E-value=42 Score=23.94 Aligned_cols=42 Identities=21% Similarity=0.213 Sum_probs=34.1
Q ss_pred CCCCHHHHHHHHHHHhccccCCceeEEEEechhhHHhhCcCc
Q psy2111 2 GEVGWEKKRRAYRTLHGAWFNSKLVTVKYLRYERYHERFPQA 43 (63)
Q Consensus 2 kc~s~e~A~~A~~~LhGRWF~GR~I~Aey~p~~~Y~~~FP~a 43 (63)
++.+.++|.+|+..|||-=+.|+.|+..+-.-+.-...++..
T Consensus 139 ~Fs~~~da~ra~~~l~~~~~~~~~l~~~~~~~d~~r~~s~~~ 180 (216)
T KOG0106|consen 139 EFSEQEDAKRALEKLDGKKLNGRRISVEKNSRDRSRRRSRGR 180 (216)
T ss_pred eehhhhhhhhcchhccchhhcCceeeecccCcchhhhcCccc
Confidence 678899999999999999999999999666655555555544
No 98
>PRK05925 aspartate kinase; Provisional
Probab=25.94 E-value=42 Score=25.75 Aligned_cols=20 Identities=20% Similarity=0.175 Sum_probs=17.0
Q ss_pred CHHHHHHHHHHHhccccCCc
Q psy2111 5 GWEKKRRAYRTLHGAWFNSK 24 (63)
Q Consensus 5 s~e~A~~A~~~LhGRWF~GR 24 (63)
+.+++.+|++.||+++|..+
T Consensus 417 ~~~d~~~av~~LH~~f~~~~ 436 (440)
T PRK05925 417 NEELAVAVTELLHNDYVKQK 436 (440)
T ss_pred ehHHHHHHHHHHHHHHhccc
Confidence 45788999999999998765
No 99
>smart00333 TUDOR Tudor domain. Domain of unknown function present in several RNA-binding proteins. 10 copies in the Drosophila Tudor protein. Initial proposal that the survival motor neuron gene product contain a Tudor domain are corroborated by more recent database search techniques such as PSI-BLAST (unpublished).
Probab=24.22 E-value=26 Score=18.41 Aligned_cols=25 Identities=20% Similarity=0.517 Sum_probs=16.0
Q ss_pred HhccccCCceeEEEEechhhHHhhCcC
Q psy2111 16 LHGAWFNSKLVTVKYLRYERYHERFPQ 42 (63)
Q Consensus 16 LhGRWF~GR~I~Aey~p~~~Y~~~FP~ 42 (63)
.+|+||.++.+...= ...|+-.|-+
T Consensus 14 ~d~~wyra~I~~~~~--~~~~~V~f~D 38 (57)
T smart00333 14 EDGEWYRARIIKVDG--EQLYEVFFID 38 (57)
T ss_pred CCCCEEEEEEEEECC--CCEEEEEEEC
Confidence 389999998877652 2445444544
No 100
>PF01205 UPF0029: Uncharacterized protein family UPF0029; InterPro: IPR001498 The Impact protein is a translational regulator that ensures constant high levels of translation under amino acid starvation. It acts by interacting with Gcn1/Gcn1L1, thereby preventing activation of Gcn2 protein kinases (EIF2AK1 to 4) and subsequent down-regulation of protein synthesis. It is evolutionary conserved from eukaryotes to archaea []. This entry represents the N-terminal domain of the Impact proteins.; PDB: 1VI7_A 2CVE_A.
Probab=24.18 E-value=27 Score=22.17 Aligned_cols=7 Identities=14% Similarity=0.339 Sum_probs=2.6
Q ss_pred cccCCce
Q psy2111 19 AWFNSKL 25 (63)
Q Consensus 19 RWF~GR~ 25 (63)
|||+|-+
T Consensus 85 RyfGGi~ 91 (110)
T PF01205_consen 85 RYFGGIK 91 (110)
T ss_dssp EE--SS-
T ss_pred EEeCCee
Confidence 6777754
No 101
>KOG0144|consensus
Probab=24.11 E-value=42 Score=26.96 Aligned_cols=17 Identities=18% Similarity=0.173 Sum_probs=0.0
Q ss_pred CCCCCHHHHHHHHHHHh
Q psy2111 1 MGEVGWEKKRRAYRTLH 17 (63)
Q Consensus 1 ~kc~s~e~A~~A~~~Lh 17 (63)
+||.+-++|.+|+.+||
T Consensus 81 v~~~trk~a~~a~~Alh 97 (510)
T KOG0144|consen 81 VKYYTRKEADEAINALH 97 (510)
T ss_pred EEeccHHHHHHHHHHhh
No 102
>KOG2416|consensus
Probab=23.94 E-value=1.1e+02 Score=25.76 Aligned_cols=35 Identities=20% Similarity=0.362 Sum_probs=29.0
Q ss_pred CCCHHHHHHHHHHHhc-ccc--CCceeEEEEechhhHH
Q psy2111 3 EVGWEKKRRAYRTLHG-AWF--NSKLVTVKYLRYERYH 37 (63)
Q Consensus 3 c~s~e~A~~A~~~LhG-RWF--~GR~I~Aey~p~~~Y~ 37 (63)
-.+.++|.....+||| +|= ++|-+.|.|...+.--
T Consensus 488 yss~eEA~atr~AlhnV~WP~sNPK~L~adf~~~deld 525 (718)
T KOG2416|consen 488 YSSVEEAAATREALHNVQWPPSNPKHLIADFVRADELD 525 (718)
T ss_pred cccHHHHHHHHHHHhccccCCCCCceeEeeecchhHHH
Confidence 4789999999999999 785 5677999999877543
No 103
>PF07566 DUF1543: Domain of Unknown Function (DUF1543); InterPro: IPR011440 This domain is found as 1-2 copies in a small family of proteins of unknown function.; PDB: 2QSD_F.
Probab=23.88 E-value=20 Score=19.93 Aligned_cols=16 Identities=25% Similarity=0.445 Sum_probs=11.8
Q ss_pred HHHHHHHHhccccCCc
Q psy2111 9 KRRAYRTLHGAWFNSK 24 (63)
Q Consensus 9 A~~A~~~LhGRWF~GR 24 (63)
+..|...+..+||.+.
T Consensus 16 ~~ea~~~~k~~W~~~~ 31 (52)
T PF07566_consen 16 IEEAKPKAKQRWFGDK 31 (52)
T ss_dssp CHHHHHHHHCC-SSES
T ss_pred HHHHHHHHHHhhhhcc
Confidence 4567789999999874
No 104
>COG3437 Response regulator containing a CheY-like receiver domain and an HD-GYP domain [Transcription / Signal transduction mechanisms]
Probab=23.67 E-value=44 Score=25.73 Aligned_cols=24 Identities=29% Similarity=0.376 Sum_probs=21.3
Q ss_pred CHHHHHHHHHHHhccccCCceeEE
Q psy2111 5 GWEKKRRAYRTLHGAWFNSKLVTV 28 (63)
Q Consensus 5 s~e~A~~A~~~LhGRWF~GR~I~A 28 (63)
+.++|..=++..-||||+=++|.|
T Consensus 320 ~~~eA~~~i~~~~Gr~fdP~lVea 343 (360)
T COG3437 320 STEEALEIIRAQSGRLFDPKLVEA 343 (360)
T ss_pred CHHHHHHHHHHhcCCcCCHHHHHH
Confidence 578999999999999999988765
No 105
>cd04712 BAH_DCM_I BAH, or Bromo Adjacent Homology domain, as present in DNA (Cytosine-5)-methyltransferases (DCM) 1. DNA methylation, or the covalent addition of a methyl group to cytosine within the context of the CpG dinucleotide, has profound effects on the genome. These effects include transcriptional repression via inhibition of transcription factor binding, the recruitment of methyl-binding proteins and their associated chromatin remodeling factors, X chromosome inactivation, imprinting, and the suppression of parasitic DNA sequences. DNA methylation is also essential for proper embryonic development and is an important player in both DNA repair and genome stability. BAH domains are found in a variety of proteins playing roles in transcriptional silencing and the remodeling of chromatin. It is assumed that in most or all of these instances the BAH domain mediates protein-protein interactions.
Probab=23.38 E-value=21 Score=23.02 Aligned_cols=8 Identities=38% Similarity=1.344 Sum_probs=6.8
Q ss_pred HHHhcccc
Q psy2111 14 RTLHGAWF 21 (63)
Q Consensus 14 ~~LhGRWF 21 (63)
+.+|||||
T Consensus 49 ~~~h~~W~ 56 (130)
T cd04712 49 KMFHGRWL 56 (130)
T ss_pred eEEEEEEE
Confidence 57899998
No 106
>PF08777 RRM_3: RNA binding motif; InterPro: IPR014886 This domain is found in protein La which functions as an RNA chaperone during RNA polymerase III transcription, and can also stimulate translation initiation. It contains a five stranded beta sheet which forms an atypical RNA recognition motif []. ; PDB: 1OWX_A.
Probab=23.29 E-value=68 Score=19.83 Aligned_cols=40 Identities=25% Similarity=0.213 Sum_probs=23.1
Q ss_pred CCCCHHHHHHHHHHHhcc-----ccCCceeEEEEec---hhhHHhhCc
Q psy2111 2 GEVGWEKKRRAYRTLHGA-----WFNSKLVTVKYLR---YERYHERFP 41 (63)
Q Consensus 2 kc~s~e~A~~A~~~LhGR-----WF~GR~I~Aey~p---~~~Y~~~FP 41 (63)
+|.++++|..|+..+.-. ..++..++++-+. .+.|-.+..
T Consensus 43 Rf~~~~~A~~a~~~~~~~~~~~~~i~~~~~~~~vLeGeeE~~Yw~ki~ 90 (105)
T PF08777_consen 43 RFKTPEAAQKALEKLKEANDGKLKIKGKEVTLEVLEGEEEEEYWKKII 90 (105)
T ss_dssp EESS---HHHHHHHHHHTTTS-B-TTSSSEEEE---HHHHHHHHHHHH
T ss_pred EECCcchHHHHHHHHHhccCCceEEcCceEEEEECCCHHHHHHHHHHH
Confidence 578899999999988766 6777777777664 234544433
No 107
>COG0225 MsrA Peptide methionine sulfoxide reductase [Posttranslational modification, protein turnover, chaperones]
Probab=22.95 E-value=1.1e+02 Score=21.30 Aligned_cols=36 Identities=19% Similarity=0.452 Sum_probs=23.5
Q ss_pred HHHHHHHHHHhc-cccCCceeEE-----EEechhhHHhhCcC
Q psy2111 7 EKKRRAYRTLHG-AWFNSKLVTV-----KYLRYERYHERFPQ 42 (63)
Q Consensus 7 e~A~~A~~~LhG-RWF~GR~I~A-----ey~p~~~Y~~~FP~ 42 (63)
+.|...+..|.- .+|++.+++- .|.+-++||+.|-+
T Consensus 111 ~~a~~~~~~~q~~~~~~~~IvteI~p~~~Fy~AEeYHQ~Yl~ 152 (174)
T COG0225 111 AIAEASIEELQASGYFKKPIVTEIEPAKNFYPAEEYHQDYLK 152 (174)
T ss_pred HHHHHHHHHHHHhccCCCCeEEEeeccccCcccHHHHHHHHH
Confidence 456666666766 6776665542 46777888987643
No 108
>KOG0533|consensus
Probab=22.82 E-value=1.7e+02 Score=21.21 Aligned_cols=39 Identities=13% Similarity=0.314 Sum_probs=32.4
Q ss_pred CHHHHHHHHHHHhccccCCceeEEEEechhhHHhhCcCc
Q psy2111 5 GWEKKRRAYRTLHGAWFNSKLVTVKYLRYERYHERFPQA 43 (63)
Q Consensus 5 s~e~A~~A~~~LhGRWF~GR~I~Aey~p~~~Y~~~FP~a 43 (63)
--++|..|++.+||==.+|+.+..+-+....--.+-|..
T Consensus 133 r~~DA~~avk~~~gv~ldG~~mk~~~i~~~~~~~r~~~~ 171 (243)
T KOG0533|consen 133 RRDDAERAVKKYNGVALDGRPMKIEIISSPSQSKRLPVG 171 (243)
T ss_pred chHhHHHHHHHhcCcccCCceeeeEEecCcccccccccc
Confidence 357999999999998899999999998887776665544
No 109
>PF07275 ArdA: Antirestriction protein (ArdA); InterPro: IPR009899 This family consists of several bacterial antirestriction (ArdA) proteins. ArdA functions in bacterial conjugation to allow an unmodified plasmid to evade restriction in the recipient bacterium and yet acquire cognate modification [].; PDB: 2W82_C.
Probab=22.69 E-value=18 Score=23.29 Aligned_cols=8 Identities=50% Similarity=1.452 Sum_probs=5.3
Q ss_pred HHhccccC
Q psy2111 15 TLHGAWFN 22 (63)
Q Consensus 15 ~LhGRWF~ 22 (63)
.|||+||+
T Consensus 14 ~l~G~Wl~ 21 (169)
T PF07275_consen 14 ELHGEWLD 21 (169)
T ss_dssp CEEEEEEE
T ss_pred EEeeEEEc
Confidence 47888853
No 110
>KOG2314|consensus
Probab=22.68 E-value=49 Score=27.52 Aligned_cols=41 Identities=15% Similarity=0.345 Sum_probs=29.5
Q ss_pred CCCCCHHHHHHHHHHHhccccCCceeEEEEe--chhhHHhhCcC
Q psy2111 1 MGEVGWEKKRRAYRTLHGAWFNSKLVTVKYL--RYERYHERFPQ 42 (63)
Q Consensus 1 ~kc~s~e~A~~A~~~LhGRWF~GR~I~Aey~--p~~~Y~~~FP~ 42 (63)
|-|.+..+|..|++.|||-=++.+---+-++ ..+.|.+ +|+
T Consensus 110 ~E~~~~~~A~~aVK~l~G~~ldknHtf~v~~f~d~eky~s-~~d 152 (698)
T KOG2314|consen 110 VEYASMRDAKKAVKSLNGKRLDKNHTFFVRLFKDFEKYES-ISD 152 (698)
T ss_pred EEecChhhHHHHHHhcccceecccceEEeehhhhHHHhcC-Ccc
Confidence 3589999999999999999998775544433 3445554 444
No 111
>cd07996 WGR_MMR_like WGR domain of molybdate metabolism regulator and related proteins. The WGR domain is found in the putative Escherichia coli molybdate metabolism regulator and related bacterial proteins, as well as in various other bacterial proteins of unknown function. It has been called WGR after the most conserved central motif of the domain. The domain appears to occur in single-domain proteins and in a variety of domain architectures, together with ATP-dependent DNA ligase domains, WD40 repeats, leucine-rich repeats, and other domains. It has been proposed to function as a nucleic acid binding domain.
Probab=22.66 E-value=90 Score=17.46 Aligned_cols=14 Identities=21% Similarity=-0.023 Sum_probs=12.1
Q ss_pred CCCHHHHHHHHHHH
Q psy2111 3 EVGWEKKRRAYRTL 16 (63)
Q Consensus 3 c~s~e~A~~A~~~L 16 (63)
|.|.++|.+++..|
T Consensus 49 ~~s~~~A~~~~~k~ 62 (74)
T cd07996 49 FDSEEEALKAAEKL 62 (74)
T ss_pred CCCHHHHHHHHHHH
Confidence 78999999988876
No 112
>PF10708 DUF2510: Protein of unknown function (DUF2510); InterPro: IPR018929 This domain of unknown function is found in a family of proteins conserved in Actinobacteria. Many members are annotated as putative membrane proteins but this could not be confirmed.
Probab=22.30 E-value=26 Score=18.50 Aligned_cols=15 Identities=7% Similarity=-0.057 Sum_probs=11.3
Q ss_pred ccccCCceeEEEEec
Q psy2111 18 GAWFNSKLVTVKYLR 32 (63)
Q Consensus 18 GRWF~GR~I~Aey~p 32 (63)
=||+||..=|....|
T Consensus 13 ~R~WDG~~WT~~~~p 27 (36)
T PF10708_consen 13 LRYWDGAAWTEHTRP 27 (36)
T ss_pred eeEeCCCcccCCccC
Confidence 399999987776644
No 113
>PF14743 DNA_ligase_OB_2: DNA ligase OB-like domain; PDB: 2Q2U_D 2Q2T_A 1FVI_A 1P8L_A.
Probab=22.17 E-value=62 Score=18.67 Aligned_cols=16 Identities=31% Similarity=0.495 Sum_probs=11.7
Q ss_pred ccCCceeEEEEechhh
Q psy2111 20 WFNSKLVTVKYLRYER 35 (63)
Q Consensus 20 WF~GR~I~Aey~p~~~ 35 (63)
|+=|++|+++|.-++.
T Consensus 38 ~~iG~iit~ky~~~t~ 53 (66)
T PF14743_consen 38 PYIGKIITVKYQGLTK 53 (66)
T ss_dssp HHTT-EEEEEEE-TTS
T ss_pred CCCCCEEEEEEEccCC
Confidence 5889999999987653
No 114
>KOG0124|consensus
Probab=22.13 E-value=58 Score=26.04 Aligned_cols=25 Identities=16% Similarity=0.189 Sum_probs=23.0
Q ss_pred CHHHHHHHHHHHhccccCCceeEEE
Q psy2111 5 GWEKKRRAYRTLHGAWFNSKLVTVK 29 (63)
Q Consensus 5 s~e~A~~A~~~LhGRWF~GR~I~Ae 29 (63)
-+|.|.-|+..|||--.+||-|.+.
T Consensus 164 vPEaAqLAlEqMNg~mlGGRNiKVg 188 (544)
T KOG0124|consen 164 VPEAAQLALEQMNGQMLGGRNIKVG 188 (544)
T ss_pred CcHHHHHHHHHhccccccCcccccc
Confidence 3789999999999999999999886
No 115
>PLN02474 UTP--glucose-1-phosphate uridylyltransferase
Probab=22.07 E-value=1.2e+02 Score=24.00 Aligned_cols=42 Identities=10% Similarity=0.139 Sum_probs=32.7
Q ss_pred CCCCHHHHHHHHHHHhccccC---Cc--------eeEEEEechhhHHhhCcCc
Q psy2111 2 GEVGWEKKRRAYRTLHGAWFN---SK--------LVTVKYLRYERYHERFPQA 43 (63)
Q Consensus 2 kc~s~e~A~~A~~~LhGRWF~---GR--------~I~Aey~p~~~Y~~~FP~a 43 (63)
.|+|...+...+-.||.+|.- +| -|.-+|.-+.+|..|||.-
T Consensus 361 ~~~dll~~rsdly~l~~~~l~~~~~~~~~~~p~IeL~~~f~~v~~f~~rf~~i 413 (469)
T PLN02474 361 ATSDLLLVQSDLYTLVDGFVIRNKARTNPSNPSIELGPEFKKVANFLSRFKSI 413 (469)
T ss_pred CCCCHHHHHHHHHHhccCeEEecCcccCCCCCcEEECcccccHHhHHHhcCCC
Confidence 588999999999999999975 34 2333566789999999943
No 116
>KOG0148|consensus
Probab=22.05 E-value=68 Score=24.42 Aligned_cols=31 Identities=13% Similarity=0.042 Sum_probs=27.7
Q ss_pred CCCCCHHHHHHHHHHHhccccCCceeEEEEe
Q psy2111 1 MGEVGWEKKRRAYRTLHGAWFNSKLVTVKYL 31 (63)
Q Consensus 1 ~kc~s~e~A~~A~~~LhGRWF~GR~I~Aey~ 31 (63)
+|+.+.|.|.+||-.+||-=-+|.+|.+..=
T Consensus 205 VrF~tkEaAahAIv~mNntei~G~~VkCsWG 235 (321)
T KOG0148|consen 205 VRFETKEAAAHAIVQMNNTEIGGQLVRCSWG 235 (321)
T ss_pred EEecchhhHHHHHHHhcCceeCceEEEEecc
Confidence 3688999999999999999999999998653
No 117
>KOG2163|consensus
Probab=22.01 E-value=26 Score=29.15 Aligned_cols=30 Identities=20% Similarity=0.422 Sum_probs=22.0
Q ss_pred HHHHhccccCCc-eeEEEEechhhH---HhhCcC
Q psy2111 13 YRTLHGAWFNSK-LVTVKYLRYERY---HERFPQ 42 (63)
Q Consensus 13 ~~~LhGRWF~GR-~I~Aey~p~~~Y---~~~FP~ 42 (63)
...+.-|||+|+ +.+|.|...+.= +..|-+
T Consensus 674 LmeIt~rW~dgkGplaa~fsrsEVk~lIkALFqD 707 (719)
T KOG2163|consen 674 LMEITHRWFDGKGPLAAHFSRSEVKGLIKALFQD 707 (719)
T ss_pred HhHHHHHHhcCCccHHhhccHHHHHHHHHHHhhc
Confidence 455677999999 999999887653 344544
No 118
>PF10272 Tmpp129: Putative transmembrane protein precursor; InterPro: IPR018801 This entry consists of proteins conserved from worms to humans. They are purported to be transmembrane protein-precursors but their function is unknown.
Probab=21.80 E-value=26 Score=26.71 Aligned_cols=8 Identities=50% Similarity=1.265 Sum_probs=7.6
Q ss_pred ccccCCce
Q psy2111 18 GAWFNSKL 25 (63)
Q Consensus 18 GRWF~GR~ 25 (63)
||||+.|+
T Consensus 321 ~kwFasrQ 328 (358)
T PF10272_consen 321 GKWFASRQ 328 (358)
T ss_pred HHHhhhcC
Confidence 89999998
No 119
>PF05470 eIF-3c_N: Eukaryotic translation initiation factor 3 subunit 8 N-terminus; InterPro: IPR008905 The largest of the mammalian translation initiation factors, eIF3, consists of at least eight subunits ranging in mass from 35 to 170 kDa. eIF3 binds to the 40 S ribosome in an early step of translation initiation and promotes the binding of methionyl-tRNAi and mRNA [].; GO: 0003743 translation initiation factor activity, 0006413 translational initiation, 0005852 eukaryotic translation initiation factor 3 complex
Probab=21.70 E-value=31 Score=27.78 Aligned_cols=14 Identities=14% Similarity=0.425 Sum_probs=11.6
Q ss_pred HHHHHHhccccCCc
Q psy2111 11 RAYRTLHGAWFNSK 24 (63)
Q Consensus 11 ~A~~~LhGRWF~GR 24 (63)
.=+.+|||+||.+|
T Consensus 463 IY~~AL~d~~~~AR 476 (595)
T PF05470_consen 463 IYHHALHDRYYEAR 476 (595)
T ss_pred HHHHHHHhhHHHHH
Confidence 34689999999988
No 120
>PRK13014 methionine sulfoxide reductase A; Provisional
Probab=21.54 E-value=1.3e+02 Score=20.88 Aligned_cols=35 Identities=20% Similarity=0.334 Sum_probs=20.2
Q ss_pred HHHHHHHHHHhcc-ccCCceeEEE------EechhhHHhhCcC
Q psy2111 7 EKKRRAYRTLHGA-WFNSKLVTVK------YLRYERYHERFPQ 42 (63)
Q Consensus 7 e~A~~A~~~LhGR-WF~GR~I~Ae------y~p~~~Y~~~FP~ 42 (63)
+.|..+++.|... .| ++.|..+ |.|-+.||+.|=+
T Consensus 113 ~~a~~~~~~~~~~~~~-~~~i~Tei~p~~~Fy~AEeyHQ~Yl~ 154 (186)
T PRK13014 113 KVAEAYIAQLDEAGIF-KKPIVTPIKPYKNFYPAEDYHQDYLK 154 (186)
T ss_pred HHHHHHHHHHHhcccc-CCCcEEEEecCCCeeeCHHHHHHHHH
Confidence 3455566666443 34 4445444 6667778887643
No 121
>COG5470 Uncharacterized conserved protein [Function unknown]
Probab=21.43 E-value=59 Score=20.76 Aligned_cols=27 Identities=22% Similarity=0.525 Sum_probs=20.7
Q ss_pred HHHhccccCCceeEEEEechhhHHhhC
Q psy2111 14 RTLHGAWFNSKLVTVKYLRYERYHERF 40 (63)
Q Consensus 14 ~~LhGRWF~GR~I~Aey~p~~~Y~~~F 40 (63)
..|-|-|+.+|+|..||=..+-=++-|
T Consensus 44 ~~lEG~w~ptr~vviEFps~~~ar~~y 70 (96)
T COG5470 44 ETLEGEWRPTRNVVIEFPSLEAARDCY 70 (96)
T ss_pred eeccCCCCcccEEEEEcCCHHHHHHHh
Confidence 457788999999999997776555444
No 122
>PF13333 rve_2: Integrase core domain
Probab=21.33 E-value=90 Score=16.63 Aligned_cols=35 Identities=14% Similarity=0.165 Sum_probs=25.9
Q ss_pred CCCCHHHHHHHHHHHhccccCCceeEEEEechhhHHhh
Q psy2111 2 GEVGWEKKRRAYRTLHGAWFNSKLVTVKYLRYERYHER 39 (63)
Q Consensus 2 kc~s~e~A~~A~~~LhGRWF~GR~I~Aey~p~~~Y~~~ 39 (63)
+|.+.+++..++..-= .||+.+-|. ++++..|+++
T Consensus 16 ~~~t~eel~~~I~~YI-~~yN~~Rl~--~lsP~eyr~~ 50 (52)
T PF13333_consen 16 KFKTREELKQAIDEYI-DYYNNERLK--GLSPVEYRNQ 50 (52)
T ss_pred ccchHHHHHHHHHHHH-HHhccCCCC--CcCHHHHHHh
Confidence 5788899998888653 467777666 7777777764
No 123
>PLN02551 aspartokinase
Probab=20.87 E-value=64 Score=25.40 Aligned_cols=20 Identities=25% Similarity=0.509 Sum_probs=17.1
Q ss_pred CHHHHHHHHHHHhccccCCc
Q psy2111 5 GWEKKRRAYRTLHGAWFNSK 24 (63)
Q Consensus 5 s~e~A~~A~~~LhGRWF~GR 24 (63)
+.+++.+|+++||.++|.+.
T Consensus 492 ~~~d~~~Av~aLH~~Ff~~~ 511 (521)
T PLN02551 492 NDDEAEQCVRALHSAFFEGD 511 (521)
T ss_pred eHHHHHHHHHHHHHHHhcCC
Confidence 46789999999999999863
No 124
>KOG2068|consensus
Probab=20.87 E-value=32 Score=26.14 Aligned_cols=28 Identities=11% Similarity=0.044 Sum_probs=24.5
Q ss_pred CCCHHHHHHHHHHHhccccCCceeEEEE
Q psy2111 3 EVGWEKKRRAYRTLHGAWFNSKLVTVKY 30 (63)
Q Consensus 3 c~s~e~A~~A~~~LhGRWF~GR~I~Aey 30 (63)
....|+|..|++..||=-.+||.+.|.|
T Consensus 132 y~~~eda~rci~~v~g~~~dg~~lka~~ 159 (327)
T KOG2068|consen 132 YEEEEDADRCIDDVDGFVDDGRALKASL 159 (327)
T ss_pred ccchHhhhhHHHHhhhHHhhhhhhHHhh
Confidence 3457999999999999999999987765
No 125
>PF12148 DUF3590: Protein of unknown function (DUF3590); InterPro: IPR021991 This domain is found in eukaryotes, and is typically between 83 and 97 amino acids in length. It is found in association with PF00097 from PFAM, PF02182 from PFAM, PF00628 from PFAM, PF00240 from PFAM. There are two conserved sequence motifs: RAR and NYN. The domain is part of the protein NIRF which has zinc finger and ubiquitinating domains. The function of this domain is likely to be mainly structural, however this has not been confirmed. ; PDB: 3DB4_A 3ASK_A 3DB3_A 2L3R_A.
Probab=20.83 E-value=35 Score=21.25 Aligned_cols=28 Identities=29% Similarity=0.542 Sum_probs=18.1
Q ss_pred HHHhccccCCceeEEEEec-----hhhHHhhCc
Q psy2111 14 RTLHGAWFNSKLVTVKYLR-----YERYHERFP 41 (63)
Q Consensus 14 ~~LhGRWF~GR~I~Aey~p-----~~~Y~~~FP 41 (63)
+.-||-||....+++.=-. .-.||-+|-
T Consensus 6 d~~~gAWfEa~i~~i~~~~~~~~e~viYhIkyd 38 (85)
T PF12148_consen 6 DRNMGAWFEAQIVTITKKCMSDDEDVIYHIKYD 38 (85)
T ss_dssp -TTT-EEEEEEEEEEEES-SSSSTTEEEEEEET
T ss_pred cCCCcceEEEEEEEeeccCCCCCCCEEEEEEec
Confidence 3458999999999886322 346777754
No 126
>PF08060 NOSIC: NOSIC (NUC001) domain; InterPro: IPR012976 This is the central domain in Nop56/SIK1-like proteins [].; PDB: 3PLA_K 3ICX_B 3ID6_A 3ID5_E 3NVM_A 3NMU_B 2NNW_C 3NVI_A 3NVK_A 2OZB_E ....
Probab=20.69 E-value=47 Score=18.28 Aligned_cols=12 Identities=33% Similarity=0.568 Sum_probs=9.0
Q ss_pred hhhHHhhCcCcc
Q psy2111 33 YERYHERFPQAK 44 (63)
Q Consensus 33 ~~~Y~~~FP~a~ 44 (63)
-+-|..+||+=.
T Consensus 24 re~Y~~~FPEL~ 35 (53)
T PF08060_consen 24 REWYSWHFPELE 35 (53)
T ss_dssp HHHHTTTSTTHH
T ss_pred HHHHHccchhHH
Confidence 367889999844
No 127
>KOG3873|consensus
Probab=20.33 E-value=44 Score=26.32 Aligned_cols=14 Identities=36% Similarity=0.698 Sum_probs=10.5
Q ss_pred HHhccccCCceeEE
Q psy2111 15 TLHGAWFNSKLVTV 28 (63)
Q Consensus 15 ~LhGRWF~GR~I~A 28 (63)
--||=||.||-|--
T Consensus 107 i~rGDWf~GK~Vgl 120 (422)
T KOG3873|consen 107 IHRGDWFGGKGVGL 120 (422)
T ss_pred eeeccccccceeEE
Confidence 34788999998743
No 128
>cd04508 TUDOR Tudor domains are found in many eukaryotic organisms and have been implicated in protein-protein interactions in which methylated protein substrates bind to these domains. For example, the Tudor domain of Survival of Motor Neuron (SMN) binds to symmetrically dimethylated arginines of arginine-glycine (RG) rich sequences found in the C-terminal tails of Sm proteins. The SMN protein is linked to spinal muscular atrophy. Another example is the tandem tudor domains of 53BP1, which bind to histone H4 specifically dimethylated at Lys20 (H4-K20me2). 53BP1 is a key transducer of the DNA damage checkpoint signal.
Probab=20.14 E-value=39 Score=17.11 Aligned_cols=13 Identities=23% Similarity=0.917 Sum_probs=10.2
Q ss_pred hccccCCceeEEE
Q psy2111 17 HGAWFNSKLVTVK 29 (63)
Q Consensus 17 hGRWF~GR~I~Ae 29 (63)
.|+||-++++...
T Consensus 11 d~~wyra~V~~~~ 23 (48)
T cd04508 11 DGKWYRAKITSIL 23 (48)
T ss_pred CCeEEEEEEEEEC
Confidence 4899998887664
No 129
>PLN03121 nucleic acid binding protein; Provisional
Probab=20.13 E-value=1.2e+02 Score=22.24 Aligned_cols=35 Identities=20% Similarity=0.204 Sum_probs=27.3
Q ss_pred CCCHHHHHHHHHHHhccccCCceeEEEEechhhHHhhC
Q psy2111 3 EVGWEKKRRAYRTLHGAWFNSKLVTVKYLRYERYHERF 40 (63)
Q Consensus 3 c~s~e~A~~A~~~LhGRWF~GR~I~Aey~p~~~Y~~~F 40 (63)
+.++++|..|+ .|||-=+.++.|.++- .++|...|
T Consensus 51 F~d~~aaetAl-lLnGa~l~d~~I~It~--~~~y~~~~ 85 (243)
T PLN03121 51 FKDAYALETAV-LLSGATIVDQRVCITR--WGQYEDEF 85 (243)
T ss_pred ECCHHHHHHHH-hcCCCeeCCceEEEEe--CcccccCc
Confidence 57889998888 8999999999988764 44465433
Done!