Query 027770
Match_columns 219
No_of_seqs 55 out of 57
Neff 3.4
Searched_HMMs 46136
Date Fri Mar 29 14:55:05 2013
Command hhsearch -i /work/01045/syshi/csienesis_hhblits_a3m/027770.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/027770hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 PF12827 Peroxin-22: Peroxisom 92.7 0.021 4.6E-07 46.3 -1.1 40 179-218 68-111 (117)
2 COG4502 5'(3')-deoxyribonucleo 88.0 2 4.3E-05 37.4 6.7 74 127-212 66-141 (180)
3 cd08556 GDPD Glycerophosphodie 82.8 2.2 4.7E-05 33.8 4.3 64 132-204 49-116 (189)
4 cd08562 GDPD_EcUgpQ_like Glyce 78.8 3 6.5E-05 34.7 4.0 63 136-204 90-157 (229)
5 cd08582 GDPD_like_2 Glyceropho 71.9 6.4 0.00014 33.2 4.3 61 136-202 90-154 (233)
6 COG0144 Sun tRNA and rRNA cyto 69.0 5.5 0.00012 37.0 3.6 97 109-208 239-345 (355)
7 PF06941 NT5C: 5' nucleotidase 68.9 9 0.0002 31.5 4.4 79 126-215 70-152 (191)
8 PF01644 Chitin_synth_1: Chiti 59.5 22 0.00049 30.5 5.2 29 176-204 105-143 (163)
9 cd08579 GDPD_memb_like Glycero 57.6 19 0.00041 30.2 4.4 59 139-204 89-152 (220)
10 cd08563 GDPD_TtGDE_like Glycer 57.2 16 0.00035 30.8 4.0 57 140-203 96-157 (230)
11 cd08567 GDPD_SpGDE_like Glycer 56.3 16 0.00034 31.1 3.8 66 131-204 109-188 (263)
12 TIGR03778 VPDSG_CTERM VPDSG-CT 55.9 16 0.00034 23.1 2.8 21 12-32 2-22 (26)
13 TIGR03510 XapX XapX domain. Th 55.6 15 0.00032 26.1 2.9 23 17-39 4-26 (49)
14 PF04246 RseC_MucC: Positive r 54.9 12 0.00027 29.6 2.8 28 13-40 97-124 (135)
15 PF13419 HAD_2: Haloacid dehal 50.6 69 0.0015 24.0 6.2 79 125-212 73-164 (176)
16 cd08573 GDPD_GDE1 Glycerophosp 50.1 28 0.00062 30.5 4.5 62 137-204 93-156 (258)
17 cd08564 GDPD_GsGDE_like Glycer 48.8 37 0.0008 29.5 5.0 61 132-201 108-171 (265)
18 cd08601 GDPD_SaGlpQ_like Glyce 48.6 34 0.00073 29.4 4.7 44 158-202 127-170 (256)
19 PF01102 Glycophorin_A: Glycop 44.9 20 0.00044 29.5 2.6 19 17-35 71-89 (122)
20 PRK11009 aphA acid phosphatase 43.0 1.5E+02 0.0032 26.4 7.9 110 90-205 48-191 (237)
21 PF01171 ATP_bind_3: PP-loop f 42.8 58 0.0013 26.6 5.0 67 130-205 80-148 (182)
22 cd01028 TOPRIM_TopoIA TOPRIM_T 41.4 1.8E+02 0.004 23.2 7.7 79 105-186 34-129 (142)
23 TIGR02245 HAD_IIID1 HAD-superf 40.9 2.4E+02 0.0053 24.5 9.1 76 103-186 20-95 (195)
24 TIGR03190 benz_CoA_bzdN benzoy 40.5 7 0.00015 36.3 -0.8 70 136-208 276-353 (377)
25 cd01427 HAD_like Haloacid deha 39.6 1E+02 0.0022 21.7 5.3 60 108-171 3-63 (139)
26 cd08561 GDPD_cytoplasmic_ScUgp 39.3 49 0.0011 28.3 4.2 55 146-207 106-162 (249)
27 COG4317 Uncharacterized protei 37.3 38 0.00083 27.0 3.0 20 19-38 8-27 (93)
28 cd08568 GDPD_TmGDE_like Glycer 37.0 59 0.0013 27.5 4.3 50 146-202 92-141 (226)
29 PRK07066 3-hydroxybutyryl-CoA 37.0 97 0.0021 28.7 6.0 71 140-218 79-172 (321)
30 PF10389 CoatB: Bacteriophage 37.0 26 0.00057 24.6 1.8 11 27-37 35-45 (46)
31 cd08565 GDPD_pAtGDE_like Glyce 36.2 68 0.0015 27.6 4.7 59 136-202 85-150 (235)
32 PF03672 UPF0154: Uncharacteri 35.4 45 0.00098 24.9 2.9 19 17-35 9-27 (64)
33 PF12732 YtxH: YtxH-like prote 34.2 29 0.00064 25.1 1.8 14 17-30 7-20 (74)
34 PF00891 Methyltransf_2: O-met 32.7 35 0.00076 28.8 2.3 28 140-168 155-182 (241)
35 TIGR03593 yidC_nterm membrane 32.2 99 0.0021 28.2 5.3 19 22-40 9-27 (366)
36 PF13844 Glyco_transf_41: Glyc 30.7 1.1E+02 0.0024 30.3 5.6 74 126-207 293-369 (468)
37 PF01751 Toprim: Toprim domain 30.1 2.3E+02 0.005 21.0 6.6 75 91-167 8-82 (100)
38 cd04885 ACT_ThrD-I Tandem C-te 29.9 89 0.0019 21.7 3.6 30 142-171 35-64 (68)
39 PRK09260 3-hydroxybutyryl-CoA 29.0 1.4E+02 0.0029 26.2 5.4 42 140-188 77-118 (288)
40 cd08566 GDPD_AtGDE_like Glycer 28.7 1.1E+02 0.0024 26.4 4.7 48 147-202 101-148 (240)
41 PF06295 DUF1043: Protein of u 28.6 53 0.0011 26.5 2.6 14 17-30 4-17 (128)
42 PHA00350 putative assembly pro 28.4 93 0.002 30.1 4.6 24 180-204 371-394 (399)
43 PF13940 Ldr_toxin: Toxin Ldr, 28.1 42 0.00091 22.6 1.6 26 5-32 6-31 (35)
44 cd04906 ACT_ThrD-I_1 First of 27.5 1E+02 0.0022 22.5 3.7 30 143-172 38-68 (85)
45 cd08559 GDPD_periplasmic_GlpQ_ 27.4 1.1E+02 0.0023 27.4 4.5 53 146-202 128-188 (296)
46 PRK01294 lipase chaperone; Pro 27.3 3.7E+02 0.008 25.3 8.2 21 79-99 89-110 (336)
47 PRK11677 hypothetical protein; 26.8 43 0.00094 27.9 1.8 18 19-36 6-23 (134)
48 cd08570 GDPD_YPL206cp_fungi Gl 26.4 1.2E+02 0.0026 25.7 4.5 64 135-201 85-155 (234)
49 PF08370 PDR_assoc: Plant PDR 26.1 96 0.0021 23.0 3.3 26 15-40 31-60 (65)
50 PF06295 DUF1043: Protein of u 26.0 44 0.00095 27.0 1.7 22 19-40 2-23 (128)
51 COG4980 GvpP Gas vesicle prote 25.9 47 0.001 27.3 1.8 14 17-30 13-26 (115)
52 cd06660 Aldo_ket_red Aldo-keto 25.7 84 0.0018 26.8 3.4 86 89-177 57-147 (285)
53 PF00998 RdRP_3: Viral RNA dep 25.6 75 0.0016 31.5 3.5 123 72-198 216-365 (486)
54 PF00248 Aldo_ket_red: Aldo/ke 25.5 54 0.0012 27.9 2.2 84 92-177 48-136 (283)
55 COG3086 RseC Positive regulato 25.1 42 0.00091 28.9 1.5 35 3-37 94-128 (150)
56 COG4218 MtrF Tetrahydromethano 25.0 63 0.0014 24.9 2.2 18 17-34 48-65 (73)
57 TIGR02863 spore_sspJ small, ac 24.8 49 0.0011 23.4 1.5 18 154-171 12-29 (47)
58 PF09527 ATPase_gene1: Putativ 24.1 90 0.0019 21.3 2.7 19 17-35 37-55 (55)
59 PF09575 Spore_SspJ: Small spo 24.0 51 0.0011 23.2 1.5 18 154-171 11-28 (46)
60 PRK14901 16S rRNA methyltransf 23.8 70 0.0015 30.2 2.8 53 121-173 353-407 (434)
61 TIGR03142 cytochro_ccmI cytoch 23.7 61 0.0013 25.5 2.1 8 1-8 1-8 (117)
62 TIGR01261 hisB_Nterm histidino 23.3 2.7E+02 0.0059 22.8 5.9 69 111-185 8-91 (161)
63 PRK07819 3-hydroxybutyryl-CoA 23.3 2.8E+02 0.006 24.8 6.3 70 142-218 82-174 (286)
64 COG1167 ARO8 Transcriptional r 23.1 92 0.002 29.7 3.5 73 128-214 242-320 (459)
65 PF14155 DUF4307: Domain of un 22.9 82 0.0018 25.0 2.6 26 12-37 4-29 (112)
66 PRK06049 rpl30p 50S ribosomal 22.8 1.3E+02 0.0029 25.5 4.0 64 147-215 2-67 (154)
67 PF12263 DUF3611: Protein of u 22.8 1E+02 0.0022 26.9 3.4 21 16-36 62-82 (183)
68 PF10031 DUF2273: Small integr 22.8 99 0.0022 21.8 2.8 18 13-30 8-25 (51)
69 PF12792 CSS-motif: CSS motif 21.8 86 0.0019 24.9 2.6 34 159-192 45-78 (208)
70 TIGR02250 FCP1_euk FCP1-like p 21.7 4.1E+02 0.0088 21.8 6.6 79 126-212 55-141 (156)
71 PRK08942 D,D-heptose 1,7-bisph 21.6 4.1E+02 0.009 21.3 6.6 45 106-150 5-51 (181)
72 cd01657 Ribosomal_L7_archeal_e 21.4 1.3E+02 0.0027 25.5 3.6 63 147-214 1-65 (159)
73 PLN02545 3-hydroxybutyryl-CoA 21.2 2.2E+02 0.0048 25.0 5.2 29 140-168 79-107 (295)
74 PRK01844 hypothetical protein; 21.2 1.1E+02 0.0024 23.4 2.9 19 17-35 16-34 (72)
75 PF05729 NACHT: NACHT domain 21.2 1.3E+02 0.0029 22.6 3.4 41 177-217 118-162 (166)
76 KOG2181 LIM domain binding pro 21.0 2.3E+02 0.0049 27.7 5.6 31 7-38 239-269 (415)
77 smart00550 Zalpha Z-DNA-bindin 20.6 37 0.00081 24.3 0.3 53 154-206 3-66 (68)
78 PF09587 PGA_cap: Bacterial ca 20.5 4.3E+02 0.0094 22.7 6.8 97 97-195 30-134 (250)
79 KOG0121 Nuclear cap-binding pr 20.3 1.3E+02 0.0028 26.0 3.4 55 130-185 22-78 (153)
80 PF13807 GNVR: G-rich domain o 20.0 97 0.0021 22.6 2.4 17 15-31 62-78 (82)
81 PRK11677 hypothetical protein; 20.0 79 0.0017 26.4 2.1 28 1-38 1-28 (134)
No 1
>PF12827 Peroxin-22: Peroxisomal biogenesis protein family; InterPro: IPR024359 Peroxin-22 is an integral peroxisomal membrane protein. The N terminus of peroxin-22 is located in the matrix, while the C terminus is located in the cytosol. Peroxin-22 interacts with the ubiquitin-conjugating enzyme Pex4p, anchoring it at the peroxisomal membrane. Both proteins may act at the same step in peroxisome biogenesis [].; PDB: 2Y9O_B 2Y9M_B 2Y9P_B.
Probab=92.71 E-value=0.021 Score=46.28 Aligned_cols=40 Identities=25% Similarity=0.377 Sum_probs=28.3
Q ss_pred cceEEeeecCcCccceeeecccceeecCCHH----HHHhhhhcc
Q 027770 179 KDKVLFCSTEIGRTSFVRQLEPDWHIDTNPE----IVSQLAVHL 218 (219)
Q Consensus 179 rHKVLFCST~~Gr~sfVRQLeP~lHIDtd~~----vv~~L~rfv 218 (219)
.|||+-|+|.+|+-|-||||.|+.-+=...+ +=++|.||+
T Consensus 68 ~~KiI~Cdt~~G~~~~vK~Lk~d~llv~~ddl~~~ip~Di~rfv 111 (117)
T PF12827_consen 68 NYKIIKCDTMQGYWSCVKHLKPDQLLVCSDDLGISIPEDINRFV 111 (117)
T ss_dssp GGGEEEESSHHHHHHHHHHH--SEEEE-GGGT-SSS-GGGGGT-
T ss_pred ceeEEEecchhhHHHHHHhcCCCceEEehhhccccCHHHHHHHH
Confidence 4999999999999999999999988742222 334677776
No 2
>COG4502 5'(3')-deoxyribonucleotidase [Nucleotide transport and metabolism]
Probab=87.97 E-value=2 Score=37.37 Aligned_cols=74 Identities=23% Similarity=0.364 Sum_probs=57.1
Q ss_pred cceeehhHHHHHHHHhhcCceEEEEEee--chhhHHHHHHHHHHcCCCCCCCCccceEEeeecCcCccceeeecccceee
Q 027770 127 QVTVKSSVLEALLEITKFCDLYLMERVL--DNESEKKVLLALETAGVFTSGGLVKDKVLFCSTEIGRTSFVRQLEPDWHI 204 (219)
Q Consensus 127 ~a~v~esalevLleLsk~~DlYLm~~V~--dD~eE~~Vl~~Le~~GlF~~gGL~rHKVLFCST~~Gr~sfVRQLeP~lHI 204 (219)
+..++|-+-+++-||++.+|||+.+.-. -+.=|++--.++|.. .-|...+++|| |+-.+| +.++-|
T Consensus 66 nL~V~p~aq~v~keLt~~y~vYivtaamdhp~s~~dK~eWl~E~F-----PFi~~qn~vfC----gnKniv---kaDilI 133 (180)
T COG4502 66 NLGVQPFAQTVLKELTSIYNVYIVTAAMDHPKSCEDKGEWLKEKF-----PFISYQNIVFC----GNKNIV---KADILI 133 (180)
T ss_pred hcCccccHHHHHHHHHhhheEEEEEeccCCchhHHHHHHHHHHHC-----CCCChhhEEEe----cCCCeE---EeeEEe
Confidence 6788999999999999999999987663 344466777777873 36667999999 444444 568899
Q ss_pred cCCHHHHH
Q 027770 205 DTNPEIVS 212 (219)
Q Consensus 205 Dtd~~vv~ 212 (219)
|.++-.++
T Consensus 134 DDnp~nLE 141 (180)
T COG4502 134 DDNPLNLE 141 (180)
T ss_pred cCCchhhh
Confidence 98887655
No 3
>cd08556 GDPD Glycerophosphodiester phosphodiesterase domain as found in prokaryota and eukaryota, and similar proteins. The typical glycerophosphodiester phosphodiesterase domain (GDPD) consists of a TIM barrel and a small insertion domain named the GDPD-insertion (GDPD-I) domain, which is specific for GDPD proteins. This family corresponds to both typical GDPD domain and GDPD-like domain which lacks the GDPD-I region. Members in this family mainly consist of a large family of prokaryotic and eukaryotic glycerophosphodiester phosphodiesterases (GP-GDEs, EC 3.1.4.46), and a number of uncharacterized homologs. Sphingomyelinases D (SMases D) (sphingomyelin phosphodiesterase D, EC 3.1.4.41) from spider venom, SMases D-like proteins, and phospholipase D (PLD) from several pathogenic bacteria are also included in this family. GDPD plays an essential role in glycerol metabolism and catalyzes the hydrolysis of glycerophosphodiesters to sn-glycerol-3-phosphate (G3P) and the corresponding alcoho
Probab=82.85 E-value=2.2 Score=33.75 Aligned_cols=64 Identities=20% Similarity=0.217 Sum_probs=47.9
Q ss_pred hhHHHHHHHHhhcCceEEEEEeech----hhHHHHHHHHHHcCCCCCCCCccceEEeeecCcCccceeeecccceee
Q 027770 132 SSVLEALLEITKFCDLYLMERVLDN----ESEKKVLLALETAGVFTSGGLVKDKVLFCSTEIGRTSFVRQLEPDWHI 204 (219)
Q Consensus 132 esalevLleLsk~~DlYLm~~V~dD----~eE~~Vl~~Le~~GlF~~gGL~rHKVLFCST~~Gr~sfVRQLeP~lHI 204 (219)
+.--|+|-.+ +- +++++.-++++ .-.+.+++++++.| ..++|+|||........+|++.|.+.+
T Consensus 49 ~tL~e~l~~~-~~-~~~i~leiK~~~~~~~~~~~l~~~i~~~~-------~~~~v~i~s~~~~~l~~~~~~~p~~~~ 116 (189)
T cd08556 49 PTLEEVLELV-KG-GVGLNIELKEPTRYPGLEAKVAELLREYG-------LEERVVVSSFDHEALRALKELDPEVPT 116 (189)
T ss_pred CCHHHHHHhc-cc-CcEEEEEECCCCCchhHHHHHHHHHHHcC-------CcCCEEEEeCCHHHHHHHHHhCCCCcE
Confidence 3444444333 33 89999999997 35688999999966 237999999988888888888887655
No 4
>cd08562 GDPD_EcUgpQ_like Glycerophosphodiester phosphodiesterase domain in Escherichia coli cytosolic glycerophosphodiester phosphodiesterase UgpQ and similar proteins. This subfamily corresponds to the glycerophosphodiester phosphodiesterase domain (GDPD) present in Escherichia coli cytosolic glycerophosphodiester phosphodiesterase (GP-GDE, EC 3.1.4.46), UgpQ, and similar proteins. GP-GDE plays an essential role in the metabolic pathway of E. coli. It catalyzes the degradation of glycerophosphodiesters to produce sn-glycerol-3-phosphate (G3P) and the corresponding alcohols, which are major sources of carbon and phosphate. E. coli possesses two major G3P uptake systems: Glp and Ugp, which contain genes coding for two distinct GP-GDEs. UgpQ gene from the E. coli ugp operon codes for a cytosolic phosphodiesterase GlpQ, which is the prototype of this family. Various glycerophosphodiesters, such as glycerophosphocholine (GPC), glycerophosphoethanolanmine (GPE), glycerophosphoglycerol (GPG)
Probab=78.83 E-value=3 Score=34.69 Aligned_cols=63 Identities=19% Similarity=0.207 Sum_probs=46.7
Q ss_pred HHHHHHhhcCceEEEEEeechhh-----HHHHHHHHHHcCCCCCCCCccceEEeeecCcCccceeeecccceee
Q 027770 136 EALLEITKFCDLYLMERVLDNES-----EKKVLLALETAGVFTSGGLVKDKVLFCSTEIGRTSFVRQLEPDWHI 204 (219)
Q Consensus 136 evLleLsk~~DlYLm~~V~dD~e-----E~~Vl~~Le~~GlF~~gGL~rHKVLFCST~~Gr~sfVRQLeP~lHI 204 (219)
+-++++.+..+++++.-++++.+ .+.+..++++.|.. .+||+|||-...-...+|++.|.+.+
T Consensus 90 ~evl~~~~~~~~~l~iEiK~~~~~~~~~~~~v~~~l~~~~~~------~~~v~~~Sf~~~~l~~~~~~~p~~~~ 157 (229)
T cd08562 90 ADVLELARELGLGLNLEIKPDPGDEALTARVVAAALRELWPH------ASKLLLSSFSLEALRAARRAAPELPL 157 (229)
T ss_pred HHHHHHHHhcCCEEEEEECCCCCccHHHHHHHHHHHHHhcCC------cCCEEEECCCHHHHHHHHHhCCCCcE
Confidence 33445566567888888887653 35688888886553 48999999988888899998887643
No 5
>cd08582 GDPD_like_2 Glycerophosphodiester phosphodiesterase domain of uncharacterized bacterial glycerophosphodiester phosphodiesterases. This subfamily corresponds to the glycerophosphodiester phosphodiesterase domain (GDPD) present in a group of uncharacterized bacterial glycerophosphodiester phosphodiesterase and similar proteins. They show high sequence similarity to Escherichia coli glycerophosphodiester phosphodiesterase, which catalyzes the degradation of glycerophosphodiesters to produce sn-glycerol-3-phosphate (G3P) and the corresponding alcohols.
Probab=71.86 E-value=6.4 Score=33.24 Aligned_cols=61 Identities=23% Similarity=0.339 Sum_probs=44.7
Q ss_pred HHHHHHhhcCceEEEEEeech----hhHHHHHHHHHHcCCCCCCCCccceEEeeecCcCccceeeecccce
Q 027770 136 EALLEITKFCDLYLMERVLDN----ESEKKVLLALETAGVFTSGGLVKDKVLFCSTEIGRTSFVRQLEPDW 202 (219)
Q Consensus 136 evLleLsk~~DlYLm~~V~dD----~eE~~Vl~~Le~~GlF~~gGL~rHKVLFCST~~Gr~sfVRQLeP~l 202 (219)
+-++++.+..+++|+.-++++ .-.+.+.++++++|. ..+||+|+|-..-....+|++.|.+
T Consensus 90 ~evl~~~~~~~~~l~ieiK~~~~~~~~~~~~~~~~~~~~~------~~~~v~~~Sf~~~~l~~~~~~~p~~ 154 (233)
T cd08582 90 EEYLAIVPKYGKKLFIEIKHPRRGPEAEEELLKLLKESGL------LPEQIVIISFDAEALKRVRELAPTL 154 (233)
T ss_pred HHHHHHHHhcCceEEEEeCCCccCccHHHHHHHHHHHcCC------CCCCEEEEecCHHHHHHHHHHCCCC
Confidence 334455555589999999963 334678888888653 3489999998888888889988873
No 6
>COG0144 Sun tRNA and rRNA cytosine-C5-methylases [Translation, ribosomal structure and biogenesis]
Probab=69.04 E-value=5.5 Score=36.95 Aligned_cols=97 Identities=24% Similarity=0.214 Sum_probs=68.7
Q ss_pred Eeeccee------eccCChhhhhccceeehhHHHHHHHHhhcC--ceEEEEEeechhhHHHHHHHHHHcCCCCC--CCCc
Q 027770 109 CRLLGVI------LEESCPEELQKQVTVKSSVLEALLEITKFC--DLYLMERVLDNESEKKVLLALETAGVFTS--GGLV 178 (219)
Q Consensus 109 is~~gvv------lee~sp~el~~~a~v~esalevLleLsk~~--DlYLm~~V~dD~eE~~Vl~~Le~~GlF~~--gGL~ 178 (219)
||-.|++ .|+.+++++.+-+.++.+.++.-..+.|-. =||=-|.+.-.|.|++|..+|++...|.- .++.
T Consensus 239 CSg~G~irr~Pd~~~~~~~~~i~~l~~lQ~~iL~~a~~~lk~GG~LVYSTCS~~~eENE~vV~~~L~~~~~~~~~~~~~~ 318 (355)
T COG0144 239 CSGTGVIRRDPDVKWRRTPEDIAELAKLQKEILAAALKLLKPGGVLVYSTCSLTPEENEEVVERFLERHPDFELEPVRLP 318 (355)
T ss_pred CCCCcccccCccccccCCHHHHHHHHHHHHHHHHHHHHhcCCCCEEEEEccCCchhcCHHHHHHHHHhCCCceeeccccc
Confidence 7888887 577788888888888888888888888775 47888999999999999999998655531 1111
Q ss_pred cceEEeeecCcCccceeeecccceeecCCH
Q 027770 179 KDKVLFCSTEIGRTSFVRQLEPDWHIDTNP 208 (219)
Q Consensus 179 rHKVLFCST~~Gr~sfVRQLeP~lHIDtd~ 208 (219)
-.. ++.-.......++++-|..| ++|-
T Consensus 319 ~~~--~~~~~~~~~~~~~r~~p~~~-~~dG 345 (355)
T COG0144 319 WGP--LFEGLGSELGKTRRLYPHVH-GTDG 345 (355)
T ss_pred ccc--cccccccccCCeEEECCCCC-CCCC
Confidence 000 22221123445799999999 5454
No 7
>PF06941 NT5C: 5' nucleotidase, deoxy (Pyrimidine), cytosolic type C protein (NT5C); InterPro: IPR010708 This family consists of several 5' nucleotidase, deoxy (Pyrimidine), and cytosolic type C (NT5C) proteins. 5'(3')-deoxyribonucleotidase is a ubiquitous enzyme in mammalian cells whose physiological function is not known [].; GO: 0016791 phosphatase activity; PDB: 1Z4M_A 1Q92_A 1Q91_A 1Z4J_A 1Z4I_A 1Z4Q_A 1Z4K_A 2JAW_A 1MH9_A 1Z4L_A ....
Probab=68.86 E-value=9 Score=31.54 Aligned_cols=79 Identities=19% Similarity=0.242 Sum_probs=48.0
Q ss_pred ccceeehhHHHHHHHHhhcC-ceEEEEEeech-h--hHHHHHHHHHHcCCCCCCCCccceEEeeecCcCccceeeecccc
Q 027770 126 KQVTVKSSVLEALLEITKFC-DLYLMERVLDN-E--SEKKVLLALETAGVFTSGGLVKDKVLFCSTEIGRTSFVRQLEPD 201 (219)
Q Consensus 126 ~~a~v~esalevLleLsk~~-DlYLm~~V~dD-~--eE~~Vl~~Le~~GlF~~gGL~rHKVLFCST~~Gr~sfVRQLeP~ 201 (219)
.+....+.|.|+|-+|.+.. .++.|+.-..+ . .++.-.+-|++. |. ++..+++.||.. | + .+..+
T Consensus 70 ~~l~p~~gA~e~l~~L~~~g~~~~~Itar~~~~~~~~~~~k~~Wl~~h--f~--~i~~~~~~~~~~---K-~---~v~~D 138 (191)
T PF06941_consen 70 SNLPPIPGAVEALKKLRDKGHEIVIITARPPEFPDHSAEEKREWLERH--FP--FIPYDNLIFTGD---K-T---LVGGD 138 (191)
T ss_dssp TT--B-TTHHHHHHHHHTSTTEEEEEEE-SSSSGCCCHHHHHHHHHHH--HT--HHHHCCEEEESS---G-G---GC--S
T ss_pred cCCCccHHHHHHHHHHHHcCCcEEEEEecCccccchHHHHHHHHHHHH--cC--CCchheEEEecC---C-C---eEecc
Confidence 35677899999999999996 56655655444 1 233334455553 22 455588999842 2 2 45789
Q ss_pred eeecCCHHHHHhhh
Q 027770 202 WHIDTNPEIVSQLA 215 (219)
Q Consensus 202 lHIDtd~~vv~~L~ 215 (219)
+.||-++..+..+.
T Consensus 139 vlIDD~~~n~~~~~ 152 (191)
T PF06941_consen 139 VLIDDRPHNLEQFA 152 (191)
T ss_dssp EEEESSSHHHSS-S
T ss_pred EEecCChHHHHhcc
Confidence 99999998887653
No 8
>PF01644 Chitin_synth_1: Chitin synthase; InterPro: IPR004834 This region is found commonly in chitin synthases classes I, II and III 2.4.1.16 from EC. Chitin a linear homopolymer of GlcNAc residues, it is an important component of the cell wall of fungi and is synthesised on the cytoplasmic surface of the cell membrane by membrane bound chitin synthases []. ; GO: 0004100 chitin synthase activity, 0006031 chitin biosynthetic process
Probab=59.55 E-value=22 Score=30.55 Aligned_cols=29 Identities=31% Similarity=0.678 Sum_probs=23.2
Q ss_pred CCccceEEeeecCcCc----------cceeeecccceee
Q 027770 176 GLVKDKVLFCSTEIGR----------TSFVRQLEPDWHI 204 (219)
Q Consensus 176 GL~rHKVLFCST~~Gr----------~sfVRQLeP~lHI 204 (219)
|..+=.++||-.++=. -+|+|||+|.+.|
T Consensus 105 ~~~PvQ~ifclKe~N~kKinSHrWfFnaf~~~l~P~vcv 143 (163)
T PF01644_consen 105 NIVPVQIIFCLKEKNAKKINSHRWFFNAFCRQLQPNVCV 143 (163)
T ss_pred CCCCEEEEEEeccccccccchhhHHHHHHHhhcCCcEEE
Confidence 6778889999887643 3799999998764
No 9
>cd08579 GDPD_memb_like Glycerophosphodiester phosphodiesterase domain of uncharacterized bacterial glycerophosphodiester phosphodiesterases. This subfamily corresponds to the glycerophosphodiester phosphodiesterase domain (GDPD) present in uncharacterized bacterial glycerophosphodiester phosphodiesterases. In addition to a C-terminal GDPD domain, most members in this family have an N-terminus that functions as a membrane anchor.
Probab=57.59 E-value=19 Score=30.17 Aligned_cols=59 Identities=14% Similarity=0.087 Sum_probs=41.3
Q ss_pred HHHhhcCceEEEEEeechh-----hHHHHHHHHHHcCCCCCCCCccceEEeeecCcCccceeeecccceee
Q 027770 139 LEITKFCDLYLMERVLDNE-----SEKKVLLALETAGVFTSGGLVKDKVLFCSTEIGRTSFVRQLEPDWHI 204 (219)
Q Consensus 139 leLsk~~DlYLm~~V~dD~-----eE~~Vl~~Le~~GlF~~gGL~rHKVLFCST~~Gr~sfVRQLeP~lHI 204 (219)
+++.+..++.++.-++++. -.+.+++++++.|.. .+|+|+|-......-+|++.|.+.+
T Consensus 89 l~~~~~~~~~l~iEiK~~~~~~~~~~~~v~~~l~~~~~~-------~~v~v~Sf~~~~l~~~~~~~p~~~~ 152 (220)
T cd08579 89 LALAKGLKQKLLIELKPHGHDSPDLVEKFVKLYKQNLIE-------NQHQVHSLDYRVIEKVKKLDPKIKT 152 (220)
T ss_pred HHHhhccCCeEEEEECCCCCCCHHHHHHHHHHHHHcCCC-------cCeEEEeCCHHHHHHHHHHCCCCeE
Confidence 3444445777878888765 246788888886542 4799999877777778888887654
No 10
>cd08563 GDPD_TtGDE_like Glycerophosphodiester phosphodiesterase domain of Thermoanaerobacter tengcongensis and similar proteins. This subfamily corresponds to the glycerophosphodiester phosphodiesterase domain (GDPD) present in Thermoanaerobacter tengcongensis glycerophosphodiester phosphodiesterase (TtGDE, EC 3.1.4.46) and its uncharacterized homologs. Members in this family show high sequence similarity to Escherichia coli GP-GDE, which catalyzes the degradation of glycerophosphodiesters to produce sn-glycerol-3-phosphate (G3P) and the corresponding alcohols. Despite the fact that most of GDPD family members exist as the monomer, TtGDE can function as a dimeric unit. Its catalytic mechanism is based on the general base-acid catalysis, which is similar to that of phosphoinositide-specific phospholipases C (PI-PLCs, EC 3.1.4.11). A divalent metal cation is required for the enzyme activity of TtGDE.
Probab=57.22 E-value=16 Score=30.78 Aligned_cols=57 Identities=26% Similarity=0.346 Sum_probs=43.0
Q ss_pred HHhhcCceEEEEEeechhh-----HHHHHHHHHHcCCCCCCCCccceEEeeecCcCccceeeeccccee
Q 027770 140 EITKFCDLYLMERVLDNES-----EKKVLLALETAGVFTSGGLVKDKVLFCSTEIGRTSFVRQLEPDWH 203 (219)
Q Consensus 140 eLsk~~DlYLm~~V~dD~e-----E~~Vl~~Le~~GlF~~gGL~rHKVLFCST~~Gr~sfVRQLeP~lH 203 (219)
++.+.-+++++.-++++.. ++.+.++++++|. .+++++.|-..-....+|++.|.+.
T Consensus 96 ~~~~~~~~~l~leiK~~~~~~~~~~~~l~~~l~~~~~-------~~~v~~~Sf~~~~l~~~~~~~p~~~ 157 (230)
T cd08563 96 DLLKDKDLLLNIEIKTDVIHYPGIEKKVLELVKEYNL-------EDRVIFSSFNHESLKRLKKLDPKIK 157 (230)
T ss_pred HHHHhcCcEEEEEECCCCCcChhHHHHHHHHHHHcCC-------CCCEEEEcCCHHHHHHHHHHCCCCc
Confidence 4444457888888887643 4688889988662 3689999988888888999999753
No 11
>cd08567 GDPD_SpGDE_like Glycerophosphodiester phosphodiesterase domain of putative Silicibacter pomeroyi glycerophosphodiester phosphodiesterase and similar proteins. This subfamily corresponds to the glycerophosphodiester phosphodiesterase domain (GDPD) present in a group of uncharacterized bacterial glycerophosphodiester phosphodiesterases (GP-GDE, EC 3.1.4.46) and similar proteins. The prototype of this CD is a putative GP-GDE from Silicibacter pomeroyi (SpGDE). It shows high sequence similarity to Escherichia coli GP-GDE, which catalyzes the degradation of glycerophosphodiesters to produce sn-glycerol-3-phosphate (G3P) and the corresponding alcohols.
Probab=56.31 E-value=16 Score=31.09 Aligned_cols=66 Identities=20% Similarity=0.260 Sum_probs=44.2
Q ss_pred ehhHHHHHHHHhhcC---ceEEEEEeechhh-----------HHHHHHHHHHcCCCCCCCCccceEEeeecCcCccceee
Q 027770 131 KSSVLEALLEITKFC---DLYLMERVLDNES-----------EKKVLLALETAGVFTSGGLVKDKVLFCSTEIGRTSFVR 196 (219)
Q Consensus 131 ~esalevLleLsk~~---DlYLm~~V~dD~e-----------E~~Vl~~Le~~GlF~~gGL~rHKVLFCST~~Gr~sfVR 196 (219)
.+.--|+|- +.+.. +++|..-++++.. .+.++++++++|. .+||+|.|...-....+|
T Consensus 109 iptL~evl~-~~~~~~~~~~~l~iEiK~~~~~~~~~~~~~~~~~~v~~~l~~~~~-------~~~v~~~Sf~~~~l~~~~ 180 (263)
T cd08567 109 IPTLEEVFA-LVEKYGNQKVRFNIETKSDPDRDILHPPPEEFVDAVLAVIRKAGL-------EDRVVLQSFDWRTLQEVR 180 (263)
T ss_pred CCCHHHHHH-HHHHhccCCceEEEEEcCCCCccccCccHHHHHHHHHHHHHHcCC-------CCceEEEeCCHHHHHHHH
Confidence 444444444 44433 5777777775432 2678888888653 268999998877788889
Q ss_pred ecccceee
Q 027770 197 QLEPDWHI 204 (219)
Q Consensus 197 QLeP~lHI 204 (219)
++.|.+.+
T Consensus 181 ~~~p~~~~ 188 (263)
T cd08567 181 RLAPDIPT 188 (263)
T ss_pred HHCCCccE
Confidence 98887643
No 12
>TIGR03778 VPDSG_CTERM VPDSG-CTERM exosortase interaction domain. Through in silico analysis, we previously described the PEP-CTERM/exosortase system (PubMed:16930487). This model describes a PEP-CTERM-like variant C-terminal protein sorting signal, as found at the C-terminus of twenty otherwise unrelated proteins in Verrucomicrobiae bacterium DG1235. The variant motif, VPDSG, seems an intermediate between the VPEP motif (TIGR02595) of typical exosortase systems and the classical LPXTG of sortase in Gram-positive bacteria.
Probab=55.90 E-value=16 Score=23.10 Aligned_cols=21 Identities=38% Similarity=0.376 Sum_probs=18.8
Q ss_pred CCCchHHHHHHHHHHHHHHHH
Q 027770 12 QDSRSVGAIAGLALAIVFTWR 32 (219)
Q Consensus 12 ~~~~s~~~~ag~a~a~v~~w~ 32 (219)
.|++|-.+++|++++.++++|
T Consensus 2 PDsGST~~Ll~~~l~~l~~~r 22 (26)
T TIGR03778 2 PDSGSTLALLGLGLLGLLGLR 22 (26)
T ss_pred CCchhHHHHHHHHHHHHHHHh
Confidence 478899999999999999886
No 13
>TIGR03510 XapX XapX domain. This model describes an uncharacterized small, hydrophobic protein of about 50 amino acids, found between the xapB and xapR genes of the E. coli xanthosine utilization system, and homologous regions in other small proteins, such as the N-terminal region of DUF1427 (Pfam model pfam07235). We name this domain XapX, as it comprises the full length of the protein encoded between the genes for the well-studied XapB and XapR proteins.
Probab=55.65 E-value=15 Score=26.12 Aligned_cols=23 Identities=22% Similarity=0.345 Sum_probs=19.5
Q ss_pred HHHHHHHHHHHHHHHHhhcCCCC
Q 027770 17 VGAIAGLALAIVFTWRMFRSPGA 39 (219)
Q Consensus 17 ~~~~ag~a~a~v~~w~~lr~~~~ 39 (219)
++-.+||++.++|+|--+++|-+
T Consensus 4 lsl~~G~~vG~~~~~l~vp~PAP 26 (49)
T TIGR03510 4 LSLGAGLLVGALYSLLKVPSPAP 26 (49)
T ss_pred HHHHHHHHHHHHHHHhCCCCCCC
Confidence 46689999999999999988753
No 14
>PF04246 RseC_MucC: Positive regulator of sigma(E), RseC/MucC; InterPro: IPR007359 This bacterial family of integral membrane proteins represents a positive regulator of the sigma(E) transcription factor, namely RseC/MucC. The sigma(E) transcription factor is up-regulated by cell envelope protein misfolding, and regulates the expression of genes that are collectively termed ECF (devoted to Extra-Cellular Functions) []. In Pseudomonas aeruginosa, derepression of sigma(E) is associated with the alginate-overproducing phenotype characteristic of chronic respiratory tract colonization in cystic fibrosis patients. The mechanism by which RseC/MucC positively regulates the sigma(E) transcription factor is unknown. RseC is also thought to have a role in thiamine biosynthesis in Salmonella typhimurium []. In addition, this family also includes an N-terminal part of RnfF, a Rhodobacter capsulatus protein, of unknown function, that is essential for nitrogen fixation. This protein also contains a domain found in ApbE protein IPR003374 from INTERPRO, which is itself involved in thiamine biosynthesis.
Probab=54.87 E-value=12 Score=29.65 Aligned_cols=28 Identities=21% Similarity=0.171 Sum_probs=21.3
Q ss_pred CCchHHHHHHHHHHHHHHHHhhcCCCCC
Q 027770 13 DSRSVGAIAGLALAIVFTWRMFRSPGAP 40 (219)
Q Consensus 13 ~~~s~~~~ag~a~a~v~~w~~lr~~~~~ 40 (219)
...-+++++|+++++++.|++-|.-...
T Consensus 97 ~~~~l~~l~~l~~~~~~~~~~~~~~~~~ 124 (135)
T PF04246_consen 97 LWAILGGLLGLALGFLILRLFDRRLKKK 124 (135)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHhhhccC
Confidence 3445788999999999999888865544
No 15
>PF13419 HAD_2: Haloacid dehalogenase-like hydrolase; PDB: 2FI1_A 2I6X_A 3SD7_A 4F71_A 4DFD_B 4F72_B 4DCC_A 3DDH_A 3KZX_A 2B0C_A ....
Probab=50.58 E-value=69 Score=23.95 Aligned_cols=79 Identities=20% Similarity=0.266 Sum_probs=51.8
Q ss_pred hccceeehhHHHHHHHHh-hcCceEEEEEeechhhHHHHHHHHHHcCCCCCCCCccceEEeeecCcCc--------ccee
Q 027770 125 QKQVTVKSSVLEALLEIT-KFCDLYLMERVLDNESEKKVLLALETAGVFTSGGLVKDKVLFCSTEIGR--------TSFV 195 (219)
Q Consensus 125 ~~~a~v~esalevLleLs-k~~DlYLm~~V~dD~eE~~Vl~~Le~~GlF~~gGL~rHKVLFCST~~Gr--------~sfV 195 (219)
.......+.+.+.|.+|. +...+++++.- +...+...|+..|+.+ + -+ -+||+.+.|. ..++
T Consensus 73 ~~~~~~~~~~~~~L~~l~~~~~~~~i~Sn~----~~~~~~~~l~~~~~~~---~-f~-~i~~~~~~~~~Kp~~~~~~~~~ 143 (176)
T PF13419_consen 73 ESKLQPYPGVRELLERLKAKGIPLVIVSNG----SRERIERVLERLGLDD---Y-FD-EIISSDDVGSRKPDPDAYRRAL 143 (176)
T ss_dssp HGGEEESTTHHHHHHHHHHTTSEEEEEESS----EHHHHHHHHHHTTHGG---G-CS-EEEEGGGSSSSTTSHHHHHHHH
T ss_pred hhccchhhhhhhhhhhcccccceeEEeecC----Cccccccccccccccc---c-cc-cccccchhhhhhhHHHHHHHHH
Confidence 468899999999999998 66788776533 5677778888866542 1 13 3456665553 3344
Q ss_pred eec--cc--ceeecCCHHHHH
Q 027770 196 RQL--EP--DWHIDTNPEIVS 212 (219)
Q Consensus 196 RQL--eP--~lHIDtd~~vv~ 212 (219)
+++ +| .+|||.++.-+.
T Consensus 144 ~~~~~~p~~~~~vgD~~~d~~ 164 (176)
T PF13419_consen 144 EKLGIPPEEILFVGDSPSDVE 164 (176)
T ss_dssp HHHTSSGGGEEEEESSHHHHH
T ss_pred HHcCCCcceEEEEeCCHHHHH
Confidence 433 44 678887765443
No 16
>cd08573 GDPD_GDE1 Glycerophosphodiester phosphodiesterase domain of mammalian glycerophosphodiester phosphodiesterase GDE1 and similar proteins. This subfamily corresponds to the glycerophosphodiester phosphodiesterase domain (GDPD) present in mammalian GDE1 (also known as MIR16, membrane interacting protein of RGS16) and their metazoan homologs. GDE1 is widely expressed in mammalian tissues, including the heart, brain, liver, and kidney. It shows sequence homology to bacterial glycerophosphodiester phosphodiesterases (GP-GDEs, EC 3.1.4.46), which catalyzes the hydrolysis of various glycerophosphodiesters, and produce sn-glycerol-3-phosphate (G3P) and the corresponding alcohols. GDE1 has been characterized as GPI-GDE (EC 3.1.4.44) that selectively hydrolyzes extracellular glycerophosphoinositol (GPI) to generate glycerol phosphate and inositol. It functions as an integral membrane-bound glycoprotein interacting with regulator of G protein signaling protein RGS16, and is modulated by G
Probab=50.11 E-value=28 Score=30.47 Aligned_cols=62 Identities=18% Similarity=0.096 Sum_probs=45.1
Q ss_pred HHHHHhhcCceEEEEEeechhh--HHHHHHHHHHcCCCCCCCCccceEEeeecCcCccceeeecccceee
Q 027770 137 ALLEITKFCDLYLMERVLDNES--EKKVLLALETAGVFTSGGLVKDKVLFCSTEIGRTSFVRQLEPDWHI 204 (219)
Q Consensus 137 vLleLsk~~DlYLm~~V~dD~e--E~~Vl~~Le~~GlF~~gGL~rHKVLFCST~~Gr~sfVRQLeP~lHI 204 (219)
-++++.+..++.++.-++++.+ ++.+.+++++.|.+ .++++++|-......-+|++.|.+.+
T Consensus 93 evl~~~~~~~~~l~iEiK~~~~~~~~~v~~~l~~~~~~------~~~v~v~SF~~~~l~~~~~~~p~~~~ 156 (258)
T cd08573 93 EAVKECLENNLRMIFDVKSNSSKLVDALKNLFKKYPGL------YDKAIVCSFNPIVIYKVRKADPKILT 156 (258)
T ss_pred HHHHHHHhcCCEEEEEeCCCcHHHHHHHHHHHHHCCCc------cCCEEEEECCHHHHHHHHHhCCCceE
Confidence 3335555557777777886543 46788888886532 36899999998888899999998654
No 17
>cd08564 GDPD_GsGDE_like Glycerophosphodiester phosphodiesterase domain of putative Galdieria sulphuraria glycerophosphodiester phosphodiesterase and similar proteins. This subfamily corresponds to the glycerophosphodiester phosphodiesterase domain (GDPD) present in putative Galdieria sulphuraria glycerophosphodiester phosphodiesterase (GsGDE, EC 3.1.4.46) and its uncharacterized eukaryotic homologs. Members in this family show high sequence similarity to Escherichia coli GP-GDE, which catalyzes the degradation of glycerophosphodiesters to produce sn-glycerol-3-phosphate (G3P) and the corresponding alcohols.
Probab=48.77 E-value=37 Score=29.53 Aligned_cols=61 Identities=21% Similarity=0.247 Sum_probs=42.9
Q ss_pred hhHHHHHHHHhhcCceEEEEEeech--hhHHHHHHHHHHcCCCCCCCCccceEEeeecCc-Cccceeeecccc
Q 027770 132 SSVLEALLEITKFCDLYLMERVLDN--ESEKKVLLALETAGVFTSGGLVKDKVLFCSTEI-GRTSFVRQLEPD 201 (219)
Q Consensus 132 esalevLleLsk~~DlYLm~~V~dD--~eE~~Vl~~Le~~GlF~~gGL~rHKVLFCST~~-Gr~sfVRQLeP~ 201 (219)
+.-.|+|- +.+. ++.+..-++++ .-++.++++|+++|.. ++|+|||-.- .-...+|++.|.
T Consensus 108 ptL~evl~-~~~~-~~~l~iEiK~~~~~~~~~v~~~l~~~~~~-------~~v~i~SF~~~~~l~~~~~~~~~ 171 (265)
T cd08564 108 PTLEDVLV-TFKD-KLKYNIELKGREVGLGERVLNLVEKYGMI-------LQVHFSSFLHYDRLDLLKALRPN 171 (265)
T ss_pred CCHHHHHH-Hhcc-CcEEEEEeCCCchhHHHHHHHHHHHcCCC-------CCEEEEecCchhHHHHHHHhCcC
Confidence 33344443 3333 67777778866 3458899999996642 6899999888 777788998886
No 18
>cd08601 GDPD_SaGlpQ_like Glycerophosphodiester phosphodiesterase domain of Staphylococcus aureus and similar proteins. This subfamily corresponds to the glycerophosphodiester phosphodiesterase domain (GDPD) present in uncharacterized glycerophosphodiester phosphodiesterase (GP-GDE, EC 3.1.4.46) from Staphylococcus aureus, Bacillus subtilis and similar proteins. Members in this family show very high sequence similarity to Escherichia coli periplasmic phosphodiesterase GlpQ, which catalyzes the Ca2+-dependent degradation of periplasmic glycerophosphodiesters to produce sn-glycerol-3-phosphate (G3P) and the corresponding alcohols.
Probab=48.59 E-value=34 Score=29.40 Aligned_cols=44 Identities=27% Similarity=0.382 Sum_probs=35.1
Q ss_pred hHHHHHHHHHHcCCCCCCCCccceEEeeecCcCccceeeecccce
Q 027770 158 SEKKVLLALETAGVFTSGGLVKDKVLFCSTEIGRTSFVRQLEPDW 202 (219)
Q Consensus 158 eE~~Vl~~Le~~GlF~~gGL~rHKVLFCST~~Gr~sfVRQLeP~l 202 (219)
-++.+.++++++|+.+. .+.+++|+|+|-..--...+|++.|.+
T Consensus 127 ~~~~v~~~l~~~~~~~~-~~~~~~v~i~Sf~~~~l~~~~~~~p~~ 170 (256)
T cd08601 127 MEEKLLATLDKYGLLTD-NLKNGQVIIQSFSKESLKKLHQLNPNI 170 (256)
T ss_pred HHHHHHHHHHHcCCCcc-cCCCCCEEEecCCHHHHHHHHHhCCCC
Confidence 35789999999887652 455689999998888888889988864
No 19
>PF01102 Glycophorin_A: Glycophorin A; InterPro: IPR001195 Proteins in this group are responsible for the molecular basis of the blood group antigens, surface markers on the outside of the red blood cell membrane. Most of these markers are proteins, but some are carbohydrates attached to lipids or proteins [Reid M.E., Lomas-Francis C. The Blood Group Antigen FactsBook Academic Press, London / San Diego, (1997)]. Glycophorin A (PAS-2) and glycophorin B (PAS-3) belong to the MNS blood group system and are associated with antigens that include M/N, S/s, U, He, Mi(a), M(c), Vw, Mur, M(g), Vr, M(e), Mt(a), St(a), Ri(a), Cl(a), Ny(a), Hut, Hil, M(v), Far, Mit, Dantu, Hop, Nob, En(a), ENKT, amongst others. Glycophorin A is the major sialoglycoprotein of the erythrocyte membrane []. Structurally, glycophorin A consists of an N-terminal extracellular domain, heavily glycosylated on serine and threonine residues, followed by a transmembrane region and a C-terminal cytoplasmic domain. Other glycophorins in this entry such as Glycophorin B and Glycophorin E represent minor sialoglycoproteins in the erythrocyte membrane.; GO: 0016021 integral to membrane; PDB: 2KPF_B 1AFO_B 2KPE_A.
Probab=44.86 E-value=20 Score=29.45 Aligned_cols=19 Identities=26% Similarity=0.369 Sum_probs=15.0
Q ss_pred HHHHHHHHHHHHHHHHhhc
Q 027770 17 VGAIAGLALAIVFTWRMFR 35 (219)
Q Consensus 17 ~~~~ag~a~a~v~~w~~lr 35 (219)
+|.+||++..++|.++++|
T Consensus 71 ~gv~aGvIg~Illi~y~ir 89 (122)
T PF01102_consen 71 FGVMAGVIGIILLISYCIR 89 (122)
T ss_dssp HHHHHHHHHHHHHHHHHHH
T ss_pred HHHHHHHHHHHHHHHHHHH
Confidence 6788888887888888874
No 20
>PRK11009 aphA acid phosphatase/phosphotransferase; Provisional
Probab=43.04 E-value=1.5e+02 Score=26.42 Aligned_cols=110 Identities=10% Similarity=0.065 Sum_probs=66.9
Q ss_pred CCchHHHHHHHhcCce--eEEEeecceeeccCCh--------------h----h--------hhccceeehhHHHHHHHH
Q 027770 90 KPTLGQVVRQKLSEGR--KVTCRLLGVILEESCP--------------E----E--------LQKQVTVKSSVLEALLEI 141 (219)
Q Consensus 90 ~~t~~qiv~~kls~~R--kvTis~~gvvlee~sp--------------~----e--------l~~~a~v~esalevLleL 141 (219)
-++..||-.. |.|.+ .|.+-+.++|+.++.. + + .+......+.++|.|-.+
T Consensus 48 ~~~~~~~~~~-~~~~~p~av~~DIDeTvldnsp~~~~~~~~f~~~~~~y~~~~~fw~~y~~~~~~~a~p~~Ga~elL~~L 126 (237)
T PRK11009 48 WVSVAQIEKS-LEGRPPMAVGFDIDDTVLFSSPGFWRGKKTFSPGSEDYLKNQKFWEKMNNGWDEFSIPKEVARQLIDMH 126 (237)
T ss_pred EEEHHHhhhh-ccCCCCcEEEEECcCccccCCchheeeeeccCCCcccccChHHHHHHHHhcccccCcchHHHHHHHHHH
Confidence 3688888754 44433 5777788899875321 0 0 011333445589999988
Q ss_pred -hhcCceEEEEEeechhhHHHHHHHHHHcCC--CCCCCCccceEEeeecC---cCccceeeecccceeec
Q 027770 142 -TKFCDLYLMERVLDNESEKKVLLALETAGV--FTSGGLVKDKVLFCSTE---IGRTSFVRQLEPDWHID 205 (219)
Q Consensus 142 -sk~~DlYLm~~V~dD~eE~~Vl~~Le~~Gl--F~~gGL~rHKVLFCST~---~Gr~sfVRQLeP~lHID 205 (219)
.+-..||+++-=.....+.....+++..|+ -. -..++||.+. ..+...+.+++-.++|.
T Consensus 127 ~~~G~~I~iVTnR~~~k~~~t~~~Llk~~gip~~~-----~f~vil~gd~~~K~~K~~~l~~~~i~I~IG 191 (237)
T PRK11009 127 VKRGDSIYFITGRTATKTETVSKTLADDFHIPADN-----MNPVIFAGDKPGQYTKTQWLKKKNIRIFYG 191 (237)
T ss_pred HHCCCeEEEEeCCCCcccHHHHHHHHHHcCCCccc-----ceeEEEcCCCCCCCCHHHHHHhcCCeEEEc
Confidence 555778887633223445566666766555 22 1357787663 45666777777778876
No 21
>PF01171 ATP_bind_3: PP-loop family; InterPro: IPR011063 This entry represents the PP-loop motif superfamily [,]. The PP-loop motif appears to be a modified version of the P-loop of nucleotide binding domain that is involved in phosphate binding []. Named PP-motif, since it appears to be a part of a previously uncharacterised ATP pyrophophatase domain. ATP sulfurylases, Escherichia coli NtrL, and Bacillus subtilis OutB consist of this domain alone. In other proteins, the pyrophosphatase domain is associated with amidotransferase domains (type I or type II), a putative citrulline-aspartate ligase domain or a nitrilase/amidase domain.; PDB: 3A2K_A 2E89_B 2E21_D 1WY5_B 1NI5_A.
Probab=42.76 E-value=58 Score=26.64 Aligned_cols=67 Identities=19% Similarity=0.299 Sum_probs=42.3
Q ss_pred eehhHHHHHHHHhhc--CceEEEEEeechhhHHHHHHHHHHcCCCCCCCCccceEEeeecCcCccceeeecccceeec
Q 027770 130 VKSSVLEALLEITKF--CDLYLMERVLDNESEKKVLLALETAGVFTSGGLVKDKVLFCSTEIGRTSFVRQLEPDWHID 205 (219)
Q Consensus 130 v~esalevLleLsk~--~DlYLm~~V~dD~eE~~Vl~~Le~~GlF~~gGL~rHKVLFCST~~Gr~sfVRQLeP~lHID 205 (219)
.+.--.+.|.++|+. |+..++.|-.||+-|.-+++++...|+.+-.|+.+.+-. .| ++=+.|.+.+.
T Consensus 80 aR~~Ry~~l~~~a~~~g~~~i~~GHh~dD~~ET~l~~l~rg~~~~~l~~~~~~~~~-----~~----~~~iRPLl~~~ 148 (182)
T PF01171_consen 80 ARELRYQFLREIAKEEGCNKIALGHHLDDQAETFLMNLLRGSGLRGLAGMPPVSPF-----KG----IKLIRPLLYVS 148 (182)
T ss_dssp HHHHHHHHHHHHHHTTT-CEEE---BHHHHHHHHHHHHHHT--CCCCC-S-SEEEE-----TT----CEEE-GGGCS-
T ss_pred HHHHHHHHHHHhhhcccccceeecCcCCccHHHHHHHHHHhccchhhccccccccc-----cC----cccCCcchhCC
Confidence 344555788888876 699999999999999999999999888665576663322 33 55566877664
No 22
>cd01028 TOPRIM_TopoIA TOPRIM_TopoIA: topoisomerase-primase (TOPRIM) nucleotidyl transferase/hydrolase domain of the type found in the type IA family of DNA topoisomerases (TopoIA). This subgroup contains proteins similar to the Type I DNA topoisomerases: E. coli topisomerases I and III, eukaryotic topoisomerase III and, ATP-dependent reverse gyrase found in archaea and thermophilic bacteria. Type IA DNA topoisomerases remove (relax) negative supercoils in the DNA. These enzymes cleave one strand of the DNA duplex, covalently link to the 5' phosphoryl end of the DNA break and allow the other strand of the duplex to pass through the gap. Reverse gyrase is also able to insert positive supercoils in the presence of ATP and negative supercoils in the presence of AMPPNP. The TOPRIM domain has two conserved motifs, one of which centers at a conserved glutamate and the other one at two conserved aspartates (DxD). For topoisomerases the conserved glutamate is believed to act as a general b
Probab=41.38 E-value=1.8e+02 Score=23.17 Aligned_cols=79 Identities=25% Similarity=0.234 Sum_probs=50.5
Q ss_pred eeEEEeecceeeccCChhhhhcc--------------cee---ehhHHHHHHHHhhcCceEEEEEeechhhHHHHHHHHH
Q 027770 105 RKVTCRLLGVILEESCPEELQKQ--------------VTV---KSSVLEALLEITKFCDLYLMERVLDNESEKKVLLALE 167 (219)
Q Consensus 105 RkvTis~~gvvlee~sp~el~~~--------------a~v---~esalevLleLsk~~DlYLm~~V~dD~eE~~Vl~~Le 167 (219)
.-+.+++.|=|++=..|++-... +.+ ..+.++.|.++++-+|.++++.=.|-|.|.-...+++
T Consensus 34 ~~~v~~~~GHl~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ik~l~~~~d~iiiAtD~DrEGE~I~~~i~~ 113 (142)
T cd01028 34 GYVVTASVGHLLELPFPEEYVDWDKDWPLELFPFEPKYVVIPDKKKQLKALKKLAKKADEIVLATDPDREGELIAWEILE 113 (142)
T ss_pred CEEEEEEccccccCCCcccccccccCCchhhCCCCceEEeCCcHHHHHHHHHHHHhcCCEEEEcCCCCcchHHHHHHHHH
Confidence 34667778888776555543331 111 3456778889999999888776666666665566777
Q ss_pred HcCCCCCCCCccceEEeee
Q 027770 168 TAGVFTSGGLVKDKVLFCS 186 (219)
Q Consensus 168 ~~GlF~~gGL~rHKVLFCS 186 (219)
-.|. . -.+-.|+.|-|
T Consensus 114 ~~~~-~--~~~v~R~~fss 129 (142)
T cd01028 114 VLKC-D--NKPVKRAWFSE 129 (142)
T ss_pred HhCC-C--CCCeEEEEEcc
Confidence 6554 1 22447888876
No 23
>TIGR02245 HAD_IIID1 HAD-superfamily subfamily IIID hydrolase, TIGR02245. This family of sequences appears to belong to the Haloacid Dehalogenase (HAD) superfamily of enzymes by virtue of the presence of three catalytic domains, in this case: LLVLD(ILV)D(YH)T, I(VMG)IWS, and (DN)(VC)K(PA)Lx{15-17}T(IL)(MH)(FV)DD(IL)(GRS)(RK)N. Since this family has no large "cap" domain between motifs 1 and 2 or between 2 and 3, it is formally a "class III" HAD.
Probab=40.87 E-value=2.4e+02 Score=24.49 Aligned_cols=76 Identities=16% Similarity=0.187 Sum_probs=54.6
Q ss_pred CceeEEEeecceeeccCChhhhhccceeehhHHHHHHHHhhcCceEEEEEeechhhHHHHHHHHHHcCCCCCCCCccceE
Q 027770 103 EGRKVTCRLLGVILEESCPEELQKQVTVKSSVLEALLEITKFCDLYLMERVLDNESEKKVLLALETAGVFTSGGLVKDKV 182 (219)
Q Consensus 103 ~~RkvTis~~gvvlee~sp~el~~~a~v~esalevLleLsk~~DlYLm~~V~dD~eE~~Vl~~Le~~GlF~~gGL~rHKV 182 (219)
|.+.+.+-+.++++.-.++.+ ......|+-+-|-|-.++..+||++.+ ...++.+...++.-|+.++ .+.|+
T Consensus 20 ~kklLVLDLDeTLvh~~~~~~-~~~~~kRP~l~eFL~~~~~~feIvVwT----Aa~~~ya~~~l~~l~~~~~---~~~~i 91 (195)
T TIGR02245 20 GKKLLVLDIDYTLFDHRSPAE-TGEELMRPYLHEFLTSAYEDYDIVIWS----ATSMKWIEIKMTELGVLTN---PNYKI 91 (195)
T ss_pred CCcEEEEeCCCceEcccccCC-CceEEeCCCHHHHHHHHHhCCEEEEEe----cCCHHHHHHHHHHhcccCC---ccceE
Confidence 456777888899886434433 346788999999999999999999876 4456788888888766553 12345
Q ss_pred Eeee
Q 027770 183 LFCS 186 (219)
Q Consensus 183 LFCS 186 (219)
.||=
T Consensus 92 ~~~l 95 (195)
T TIGR02245 92 TFLL 95 (195)
T ss_pred EEEe
Confidence 5543
No 24
>TIGR03190 benz_CoA_bzdN benzoyl-CoA reductase, bzd-type, N subunit. Members of this family are the N subunit of one of two related types of four-subunit ATP-dependent benzoyl-CoA reductase. This enzyme system catalyzes the dearomatization of benzoyl-CoA, a common intermediate in pathways for the degradation for a number of different aromatic compounds, such as phenol and toluene.
Probab=40.48 E-value=7 Score=36.33 Aligned_cols=70 Identities=17% Similarity=0.165 Sum_probs=46.1
Q ss_pred HHHHHHhhcCceEEEEEeec-hhh--HHHHHHHHHHcCCCCCCCCccceEEeeecCcCcccee-eecc----cceeecCC
Q 027770 136 EALLEITKFCDLYLMERVLD-NES--EKKVLLALETAGVFTSGGLVKDKVLFCSTEIGRTSFV-RQLE----PDWHIDTN 207 (219)
Q Consensus 136 evLleLsk~~DlYLm~~V~d-D~e--E~~Vl~~Le~~GlF~~gGL~rHKVLFCST~~Gr~sfV-RQLe----P~lHIDtd 207 (219)
+++..||+-+--...|.... +.+ -+.+..+.++ ++..|+.-|-.-||-+..+-...+ ++|+ |.++||+|
T Consensus 276 dpl~alA~~yl~~~~C~~~~~p~~~R~~~i~~lv~~---~~~DGVI~~~~kfC~~~~~e~~~lk~~l~e~GIP~L~iE~D 352 (377)
T TIGR03190 276 DVIKAIAERYCDRPACPTKDYPVHTRYDHVLGLAKE---YNVQGAIFLQQKFCDPHEGDYPDLKRHLEANGIPTLFLEFD 352 (377)
T ss_pred cHHHHHHHHhcCCCCCCCcCCCHHHHHHHHHHHHHH---hCCCEEEEecccCCCcchhhhHHHHHHHHHCCCCEEEEecC
Confidence 45666665542123344432 222 2555555565 344588999999999999988665 6776 99999987
Q ss_pred H
Q 027770 208 P 208 (219)
Q Consensus 208 ~ 208 (219)
-
T Consensus 353 ~ 353 (377)
T TIGR03190 353 I 353 (377)
T ss_pred C
Confidence 5
No 25
>cd01427 HAD_like Haloacid dehalogenase-like hydrolases. The haloacid dehalogenase-like (HAD) superfamily includes L-2-haloacid dehalogenase, epoxide hydrolase, phosphoserine phosphatase, phosphomannomutase, phosphoglycolate phosphatase, P-type ATPase, and many others, all of which use a nucleophilic aspartate in their phosphoryl transfer reaction. All members possess a highly conserved alpha/beta core domain, and many also possess a small cap domain, the fold and function of which is variable. Members of this superfamily are sometimes referred to as belonging to the DDDD superfamily of phosphohydrolases.
Probab=39.57 E-value=1e+02 Score=21.71 Aligned_cols=60 Identities=28% Similarity=0.305 Sum_probs=43.9
Q ss_pred EEeecceeeccCChhhhhccceeehhHHHHHHHHhhc-CceEEEEEeechhhHHHHHHHHHHcCC
Q 027770 108 TCRLLGVILEESCPEELQKQVTVKSSVLEALLEITKF-CDLYLMERVLDNESEKKVLLALETAGV 171 (219)
Q Consensus 108 Tis~~gvvlee~sp~el~~~a~v~esalevLleLsk~-~DlYLm~~V~dD~eE~~Vl~~Le~~Gl 171 (219)
.+.+.|+++.+.+..+........+.+++.|-+|... ..+|+++ ......+...++..|+
T Consensus 3 vfD~D~tl~~~~~~~~~~~~~~~~~~~~~~l~~l~~~g~~i~ivS----~~~~~~~~~~~~~~~~ 63 (139)
T cd01427 3 LFDLDGTLLDSEPGIAEIEELELYPGVKEALKELKEKGIKLALAT----NKSRREVLELLEELGL 63 (139)
T ss_pred EEccCCceEccCccccccccCCcCcCHHHHHHHHHHCCCeEEEEe----CchHHHHHHHHHHcCC
Confidence 4567888888777666667788999999999999986 5555544 3336677777777665
No 26
>cd08561 GDPD_cytoplasmic_ScUgpQ2_like Glycerophosphodiester phosphodiesterase domain of Streptomyces coelicolor cytoplasmic phosphodiesterases UgpQ2 and similar proteins. This subfamily corresponds to the glycerophosphodiester phosphodiesterase domain (GDPD) present in a group of uncharacterized cytoplasmic phosphodiesterases which predominantly exist in bacteria. The prototype of this family is a putative cytoplasmic phosphodiesterase encoded by gene ulpQ2 (SCO1419) in the Streptomyces coelicolor genome. It is distantly related to the Escherichia coli cytoplasmic phosphodiesterases UgpQ that catalyzes the hydrolysis of glycerophosphodiesters at the inner side of the cytoplasmic membrane to produce sn-glycerol-3-phosphate (G3P) and the corresponding alcohols.
Probab=39.29 E-value=49 Score=28.30 Aligned_cols=55 Identities=22% Similarity=0.221 Sum_probs=43.8
Q ss_pred ceEEEEEeech--hhHHHHHHHHHHcCCCCCCCCccceEEeeecCcCccceeeecccceeecCC
Q 027770 146 DLYLMERVLDN--ESEKKVLLALETAGVFTSGGLVKDKVLFCSTEIGRTSFVRQLEPDWHIDTN 207 (219)
Q Consensus 146 DlYLm~~V~dD--~eE~~Vl~~Le~~GlF~~gGL~rHKVLFCST~~Gr~sfVRQLeP~lHIDtd 207 (219)
++++..-+.+. .-.+.++++++++|+ .++++|.|-...-...++++.|.+.+.-.
T Consensus 106 ~~~~~ieiK~~~~~~~~~~~~~l~~~~~-------~~~~~~~Sf~~~~l~~~~~~~p~~~~~~~ 162 (249)
T cd08561 106 DVRLNIEIKDDGPAAAAALADLIERYGA-------QDRVLVASFSDRVLRRFRRLCPRVATSAG 162 (249)
T ss_pred CCcEEEEECCCchhHHHHHHHHHHHcCC-------CCcEEEEECCHHHHHHHHHHCCCcceecc
Confidence 47788888886 446789999998662 26899999999999999999998766543
No 27
>COG4317 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=37.31 E-value=38 Score=26.97 Aligned_cols=20 Identities=35% Similarity=0.534 Sum_probs=17.4
Q ss_pred HHHHHHHHHHHHHHhhcCCC
Q 027770 19 AIAGLALAIVFTWRMFRSPG 38 (219)
Q Consensus 19 ~~ag~a~a~v~~w~~lr~~~ 38 (219)
-.||+++.++|+...+|+|-
T Consensus 8 lgAGllVGiiyaLl~vrsPA 27 (93)
T COG4317 8 LGAGLLVGIIYALLKVRSPA 27 (93)
T ss_pred HhhhHHHHHHHHHHhCCCCC
Confidence 34799999999999999875
No 28
>cd08568 GDPD_TmGDE_like Glycerophosphodiester phosphodiesterase domain of Thermotoga maritime and similar proteins. This subfamily corresponds to the glycerophosphodiester phosphodiesterase domain (GDPD) present in Thermotoga maritime glycerophosphodiester phosphodiesterase (TmGDE, EC 3.1.4.46) and its uncharacterized homologs. Members in this family show high sequence similarity to Escherichia coli GP-GDE, which catalyzes the degradation of glycerophosphodiesters to produce sn-glycerol-3-phosphate (G3P) and the corresponding alcohols. TmGDE exists as a monomer that might be the biologically relevant form.
Probab=37.04 E-value=59 Score=27.46 Aligned_cols=50 Identities=26% Similarity=0.359 Sum_probs=36.6
Q ss_pred ceEEEEEeechhhHHHHHHHHHHcCCCCCCCCccceEEeeecCcCccceeeecccce
Q 027770 146 DLYLMERVLDNESEKKVLLALETAGVFTSGGLVKDKVLFCSTEIGRTSFVRQLEPDW 202 (219)
Q Consensus 146 DlYLm~~V~dD~eE~~Vl~~Le~~GlF~~gGL~rHKVLFCST~~Gr~sfVRQLeP~l 202 (219)
+++|..-+++....+.+.++++++|.. .+++|-|-..-....+|++.|.+
T Consensus 92 ~~~l~iEiK~~~~~~~~~~~l~~~~~~-------~~v~i~SF~~~~l~~~~~~~p~~ 141 (226)
T cd08568 92 DAIINVEIKDIDAVEPVLEIVEKFNAL-------DRVIFSSFNHDALRELRKLDPDA 141 (226)
T ss_pred CcEEEEEECCccHHHHHHHHHHHcCCC-------CcEEEEECCHHHHHHHHHhCCCC
Confidence 467888899887778899999987643 46777776665556666666654
No 29
>PRK07066 3-hydroxybutyryl-CoA dehydrogenase; Validated
Probab=37.03 E-value=97 Score=28.73 Aligned_cols=71 Identities=17% Similarity=0.102 Sum_probs=46.2
Q ss_pred HHhhcCceEEEEEeechhhHHHHHHHHHHcCCCCCCCCccceEEeeecCcCccc--eee-----------------eccc
Q 027770 140 EITKFCDLYLMERVLDNESEKKVLLALETAGVFTSGGLVKDKVLFCSTEIGRTS--FVR-----------------QLEP 200 (219)
Q Consensus 140 eLsk~~DlYLm~~V~dD~eE~~Vl~~Le~~GlF~~gGL~rHKVLFCST~~Gr~s--fVR-----------------QLeP 200 (219)
+..+.||+.+-+-.++-+-|..+..-+++ ..+++-||-+||+ |-.. |.- ++-|
T Consensus 79 ~av~~aDlViEavpE~l~vK~~lf~~l~~-------~~~~~aIlaSnTS-~l~~s~la~~~~~p~R~~g~HffnP~~~~p 150 (321)
T PRK07066 79 ACVADADFIQESAPEREALKLELHERISR-------AAKPDAIIASSTS-GLLPTDFYARATHPERCVVGHPFNPVYLLP 150 (321)
T ss_pred HHhcCCCEEEECCcCCHHHHHHHHHHHHH-------hCCCCeEEEECCC-ccCHHHHHHhcCCcccEEEEecCCccccCc
Confidence 55577998776655566667777777776 5666777777777 5321 221 2334
Q ss_pred ceee----cCCHHHHHhhhhcc
Q 027770 201 DWHI----DTNPEIVSQLAVHL 218 (219)
Q Consensus 201 ~lHI----Dtd~~vv~~L~rfv 218 (219)
.+=| +|+++++..+..|+
T Consensus 151 LVEVv~g~~T~~e~~~~~~~f~ 172 (321)
T PRK07066 151 LVEVLGGERTAPEAVDAAMGIY 172 (321)
T ss_pred eEEEeCCCCCCHHHHHHHHHHH
Confidence 4433 79999998887764
No 30
>PF10389 CoatB: Bacteriophage coat protein B ; InterPro: IPR008020 The major coat protein in the capsid of filamentous bacteriophage forms a helical assembly of about 7000 identical protomers, with each protomer comprised of 46 amino acids, after the cleavage of the signal peptide. Each protomer forms a slightly curved helix that combines to form a tubular structure that encapsulates the viral DNA [].; PDB: 2IFO_A.
Probab=36.95 E-value=26 Score=24.64 Aligned_cols=11 Identities=9% Similarity=0.241 Sum_probs=8.7
Q ss_pred HHHHHHhhcCC
Q 027770 27 IVFTWRMFRSP 37 (219)
Q Consensus 27 ~v~~w~~lr~~ 37 (219)
.+..||++|.+
T Consensus 35 ~i~v~kwiRra 45 (46)
T PF10389_consen 35 GIAVYKWIRRA 45 (46)
T ss_dssp HHHHHHHHHHT
T ss_pred HHHHHHHHHhh
Confidence 57889999854
No 31
>cd08565 GDPD_pAtGDE_like Glycerophosphodiester phosphodiesterase domain of putative Agrobacterium tumefaciens glycerophosphodiester phosphodiesterase and similar proteins. This subfamily corresponds to the glycerophosphodiester phosphodiesterase domain (GDPD) present in putative Agrobacterium tumefaciens glycerophosphodiester phosphodiesterase (pAtGDE, EC 3.1.4.46) and its uncharacterized homologs. Members in this family show high sequence similarity to Escherichia coli GP-GDE, which catalyzes the degradation of glycerophosphodiesters to produce sn-glycerol-3-phosphate (G3P) and the corresponding alcohols.
Probab=36.21 E-value=68 Score=27.65 Aligned_cols=59 Identities=17% Similarity=0.215 Sum_probs=40.2
Q ss_pred HHHHHHhhcCceEEEEEeechh-------hHHHHHHHHHHcCCCCCCCCccceEEeeecCcCccceeeecccce
Q 027770 136 EALLEITKFCDLYLMERVLDNE-------SEKKVLLALETAGVFTSGGLVKDKVLFCSTEIGRTSFVRQLEPDW 202 (219)
Q Consensus 136 evLleLsk~~DlYLm~~V~dD~-------eE~~Vl~~Le~~GlF~~gGL~rHKVLFCST~~Gr~sfVRQLeP~l 202 (219)
+-++++.+..+++|..-++++. -++.+.++++++|+. ++++|.|-......-+|++ |.+
T Consensus 85 ~evl~~~~~~~~~l~iEiK~~~~~~~~~~~~~~v~~~i~~~~~~-------~~v~~~Sf~~~~l~~~~~~-p~~ 150 (235)
T cd08565 85 EEVLALFAPSGLELHVEIKTDADGTPYPGAAALAAATLRRHGLL-------ERSVLTSFDPAVLTEVRKH-PGV 150 (235)
T ss_pred HHHHHHhhccCcEEEEEECCCCCCCccHHHHHHHHHHHHhCCCc-------CCEEEEECCHHHHHHHHhC-CCC
Confidence 3344555557787878887653 246788888886542 5788888877777777777 753
No 32
>PF03672 UPF0154: Uncharacterised protein family (UPF0154); InterPro: IPR005359 The proteins in this entry are functionally uncharacterised.
Probab=35.42 E-value=45 Score=24.91 Aligned_cols=19 Identities=26% Similarity=0.504 Sum_probs=14.1
Q ss_pred HHHHHHHHHHHHHHHHhhc
Q 027770 17 VGAIAGLALAIVFTWRMFR 35 (219)
Q Consensus 17 ~~~~ag~a~a~v~~w~~lr 35 (219)
+|+++||.+|--+-=|+|+
T Consensus 9 ~G~~~Gff~ar~~~~k~l~ 27 (64)
T PF03672_consen 9 VGAVIGFFIARKYMEKQLK 27 (64)
T ss_pred HHHHHHHHHHHHHHHHHHH
Confidence 6777777777777777776
No 33
>PF12732 YtxH: YtxH-like protein; InterPro: IPR024623 This family of uncharacterised proteins is found in bacteria. Proteins in this family are typically between 100 and 143 amino acids in length. The N-terminal region is the most conserved.
Probab=34.17 E-value=29 Score=25.08 Aligned_cols=14 Identities=43% Similarity=0.714 Sum_probs=9.0
Q ss_pred HHHHHHHHHHHHHH
Q 027770 17 VGAIAGLALAIVFT 30 (219)
Q Consensus 17 ~~~~ag~a~a~v~~ 30 (219)
+|+++|.++++.|+
T Consensus 7 ~Ga~~Ga~~glL~a 20 (74)
T PF12732_consen 7 AGAAAGAAAGLLFA 20 (74)
T ss_pred HHHHHHHHHHHHhC
Confidence 56677777666653
No 34
>PF00891 Methyltransf_2: O-methyltransferase; InterPro: IPR001077 Methyl transfer from the ubiquitous S-adenosyl-L-methionine (AdoMet) to either nitrogen, oxygen or carbon atoms is frequently employed in diverse organisms ranging from bacteria to plants and mammals. The reaction is catalysed by methyltransferases (Mtases) and modifies DNA, RNA, proteins and small molecules, such as catechol for regulatory purposes. The various aspects of the role of DNA methylation in prokaryotic restriction-modification systems and in a number of cellular processes in eukaryotes including gene regulation and differentiation is well documented. Three classes of DNA Mtases transfer the methyl group from AdoMet to the target base to form either N-6-methyladenine, or N-4-methylcytosine, or C-5- methylcytosine. In C-5-cytosine Mtases, ten conserved motifs are arranged in the same order []. Motif I (a glycine-rich or closely related consensus sequence; FAGxGG in M.HhaI []), shared by other AdoMet-Mtases [], is part of the cofactor binding site and motif IV (PCQ) is part of the catalytic site. In contrast, sequence comparison among N-6-adenine and N-4-cytosine Mtases indicated two of the conserved segments [], although more conserved segments may be present. One of them corresponds to motif I in C-5-cytosine Mtases, and the other is named (D/N/S)PP(Y/F). Crystal structures are known for a number of Mtases [, , , ]. The cofactor binding sites are almost identical and the essential catalytic amino acids coincide. The comparable protein folding and the existence of equivalent amino acids in similar secondary and tertiary positions indicate that many (if not all) AdoMet-Mtases have a common catalytic domain structure. This permits tertiary structure prediction of other DNA, RNA, protein, and small-molecule AdoMet-Mtases from their amino acid sequences []. This domain includes a range of O-methyltransferases some of which utilise S-adenosyl methionine as substrate []. In prokaryotes, the major role of DNA methylation is to protect host DNA against degradation by restriction enzymes. In eukaryotes, DNA methylation has been implicated in the control of several cellular processes, including differentiation, gene regulation, and embryonic development. O-methyltransferases have a common catalytic domain structure, which might be universal among S-adenosyl-L-methionine (AdoMet)-dependent methyltransferases []. Comparative analysis of the predicted amino acid sequences of a number of plant O-methyltransferase cDNA clones show that they share some 32-71% sequence identity, and can be grouped according to the different compounds they utilise as substrates [].; GO: 0008171 O-methyltransferase activity; PDB: 1FPQ_A 1FP1_D 3P9K_B 3P9I_D 3P9C_A 3I53_A 3I5U_A 3I64_A 3I58_A 1ZG3_A ....
Probab=32.69 E-value=35 Score=28.81 Aligned_cols=28 Identities=25% Similarity=0.470 Sum_probs=22.9
Q ss_pred HHhhcCceEEEEEeechhhHHHHHHHHHH
Q 027770 140 EITKFCDLYLMERVLDNESEKKVLLALET 168 (219)
Q Consensus 140 eLsk~~DlYLm~~V~dD~eE~~Vl~~Le~ 168 (219)
.+-. +|+|+|.+|..|=.++.+..+|++
T Consensus 155 ~~P~-~D~~~l~~vLh~~~d~~~~~iL~~ 182 (241)
T PF00891_consen 155 PLPV-ADVYLLRHVLHDWSDEDCVKILRN 182 (241)
T ss_dssp CCSS-ESEEEEESSGGGS-HHHHHHHHHH
T ss_pred hhcc-ccceeeehhhhhcchHHHHHHHHH
Confidence 3444 899999999999999999999985
No 35
>TIGR03593 yidC_nterm membrane protein insertase, YidC/Oxa1 family, N-terminal domain. Essentially all bacteria have a member of the YidC family, whose C-terminal domain is modeled by TIGR03592. The two copies are found in endospore-forming bacteria such as Bacillus subtilis appear redundant during vegetative growth, although the member designated spoIIIJ (stage III sporulation protein J) has a distinct role in spore formation. YidC, its mitochondrial homolog Oxa1, and its chloroplast homolog direct insertion into the bacterial/organellar inner (or only) membrane. This model describes an N-terminal sequence region, including a large periplasmic domain lacking in YidC members from Gram-positive species. The multifunctional YidC protein acts both with and independently of the Sec system.
Probab=32.23 E-value=99 Score=28.21 Aligned_cols=19 Identities=26% Similarity=0.712 Sum_probs=13.7
Q ss_pred HHHHHHHHHHHhhcCCCCC
Q 027770 22 GLALAIVFTWRMFRSPGAP 40 (219)
Q Consensus 22 g~a~a~v~~w~~lr~~~~~ 40 (219)
.|.+.+++.|.++..+..+
T Consensus 9 ~ls~~~~~~w~~~~~~~~~ 27 (366)
T TIGR03593 9 ALSFVIFLLWQAWQSDPGP 27 (366)
T ss_pred HHHHHHHHHHHHHHhhccC
Confidence 3566688999999866543
No 36
>PF13844 Glyco_transf_41: Glycosyl transferase family 41; PDB: 3PE4_C 3PE3_D 3TAX_C 2XGO_A 2JLB_B 2XGM_A 2VSY_B 2XGS_B 2VSN_A.
Probab=30.68 E-value=1.1e+02 Score=30.26 Aligned_cols=74 Identities=18% Similarity=0.276 Sum_probs=47.7
Q ss_pred ccceeehhHHHHHHHHhhc---CceEEEEEeechhhHHHHHHHHHHcCCCCCCCCccceEEeeecCcCccceeeecccce
Q 027770 126 KQVTVKSSVLEALLEITKF---CDLYLMERVLDNESEKKVLLALETAGVFTSGGLVKDKVLFCSTEIGRTSFVRQLEPDW 202 (219)
Q Consensus 126 ~~a~v~esalevLleLsk~---~DlYLm~~V~dD~eE~~Vl~~Le~~GlF~~gGL~rHKVLFCST~~Gr~sfVRQLeP~l 202 (219)
..+++.++.+++..+|.+- +-+.|+. .....|+.+...+++ .|+..+||+|.-...-..++-|.-.-|+
T Consensus 293 ~~~KI~p~~l~~W~~IL~~vP~S~L~L~~--~~~~~~~~l~~~~~~------~Gv~~~Ri~f~~~~~~~ehl~~~~~~DI 364 (468)
T PF13844_consen 293 NLFKISPETLDLWARILKAVPNSRLWLLR--FPASGEARLRRRFAA------HGVDPDRIIFSPVAPREEHLRRYQLADI 364 (468)
T ss_dssp -GGG--HHHHHHHHHHHHHSTTEEEEEEE--TSTTHHHHHHHHHHH------TTS-GGGEEEEE---HHHHHHHGGG-SE
T ss_pred ccccCCHHHHHHHHHHHHhCCCcEEEEee--CCHHHHHHHHHHHHH------cCCChhhEEEcCCCCHHHHHHHhhhCCE
Confidence 4677889999999998887 3444443 345567888888887 4778899999887665566666677888
Q ss_pred eecCC
Q 027770 203 HIDTN 207 (219)
Q Consensus 203 HIDtd 207 (219)
..|+-
T Consensus 365 ~LDT~ 369 (468)
T PF13844_consen 365 CLDTF 369 (468)
T ss_dssp EE--S
T ss_pred EeeCC
Confidence 88864
No 37
>PF01751 Toprim: Toprim domain; InterPro: IPR006171 This is a conserved region from DNA primase. This corresponds to the Toprim (topoisomerase-primase) domain common to DnaG primases, topoisomerases, OLD family nucleases and RecR/M DNA repair proteins []. Both DnaG motifs IV and V are present in the alignment, the DxD (V) motif may be involved in Mg2+ binding and mutations to the conserved glutamate (IV) completely abolish DnaG type primase activity. DNA primase 2.7.7.6 from EC is a nucleotidyltransferase it synthesizes the oligoribonucleotide primers required for DNA replication on the lagging strand of the replication fork; it can also prime the leading stand and has been implicated in cell division []. This family also includes the atypical archaeal A subunit from type II DNA topoisomerases []. Type II DNA topoisomerases catalyse the relaxation of DNA supercoiling by causing transient double strand breaks.; PDB: 2ZJT_A 3IG0_A 3M4I_A 3NUH_B 1GKU_B 1GL9_C 3PWT_A 1CY4_A 1ECL_A 1CY7_A ....
Probab=30.11 E-value=2.3e+02 Score=20.98 Aligned_cols=75 Identities=20% Similarity=0.242 Sum_probs=50.4
Q ss_pred CchHHHHHHHhcCceeEEEeecceeeccCChhhhhccceeehhHHHHHHHHhhcCceEEEEEeechhhHHHHHHHHH
Q 027770 91 PTLGQVVRQKLSEGRKVTCRLLGVILEESCPEELQKQVTVKSSVLEALLEITKFCDLYLMERVLDNESEKKVLLALE 167 (219)
Q Consensus 91 ~t~~qiv~~kls~~RkvTis~~gvvlee~sp~el~~~a~v~esalevLleLsk~~DlYLm~~V~dD~eE~~Vl~~Le 167 (219)
++-..-+++=|++..-..+++.|-+++-..|++-.+ .-..+.++.|.++.+.+|-.+++.=.|.+.|.-...+++
T Consensus 8 ps~a~~i~~~l~~~~~~v~~~~Ghl~~~~~~~~~~~--~~~~~~i~~l~~~~~~~~~iiiatD~D~EGe~Ia~~i~~ 82 (100)
T PF01751_consen 8 PSDAKAIAKALGGEEYIVIATSGHLLELAKPEDYDP--KDKKKQIKNLKKLLKKADEIIIATDPDREGELIAWEIIE 82 (100)
T ss_dssp HHHHHHHHHHSSTTTEEEEEESSSSEESTTSSHHHC--HTTHHHHHHHHHHHHSCSEEEEEC-SSHHHHHHHHHHHH
T ss_pred HHHHHHHHHHcCCCCEEEEEeCCccccccccccccc--ccccccchhhHHHhhhccEeeecCCCChHHHHHHHHHHH
Confidence 334445555555666888999999999888776544 344566788888888788888787666666644444444
No 38
>cd04885 ACT_ThrD-I Tandem C-terminal ACT domains of threonine dehydratase I (ThrD-I; L-threonine hydrolyase). This CD includes each of two tandem C-terminal ACT domains of threonine dehydratase I (ThrD-I; L-threonine hydrolyase) which catalyzes the committed step in branched chain amino acid biosynthesis in plants and microorganisms, the pyridoxal 5'-phosphate (PLP)-dependent dehydration/deamination of L-threonine (or L-serine) to 2-ketobutyrate (or pyruvate). ThrD-I is a cooperative, feedback-regulated (isoleucine and valine) allosteric enzyme that forms a tetramer and contains four pyridoxal phosphate moieties. Members of this CD belong to the superfamily of ACT regulatory domains.
Probab=29.94 E-value=89 Score=21.68 Aligned_cols=30 Identities=20% Similarity=0.131 Sum_probs=25.8
Q ss_pred hhcCceEEEEEeechhhHHHHHHHHHHcCC
Q 027770 142 TKFCDLYLMERVLDNESEKKVLLALETAGV 171 (219)
Q Consensus 142 sk~~DlYLm~~V~dD~eE~~Vl~~Le~~Gl 171 (219)
.+.+.+.+...+.+.++-+.+++.|++.|.
T Consensus 35 ~~~~~v~v~ie~~~~~~~~~i~~~L~~~G~ 64 (68)
T cd04885 35 GDEARVLVGIQVPDREDLAELKERLEALGY 64 (68)
T ss_pred CCceEEEEEEEeCCHHHHHHHHHHHHHcCC
Confidence 456888888999998899999999999774
No 39
>PRK09260 3-hydroxybutyryl-CoA dehydrogenase; Validated
Probab=28.98 E-value=1.4e+02 Score=26.25 Aligned_cols=42 Identities=14% Similarity=0.022 Sum_probs=28.1
Q ss_pred HHhhcCceEEEEEeechhhHHHHHHHHHHcCCCCCCCCccceEEeeecC
Q 027770 140 EITKFCDLYLMERVLDNESEKKVLLALETAGVFTSGGLVKDKVLFCSTE 188 (219)
Q Consensus 140 eLsk~~DlYLm~~V~dD~eE~~Vl~~Le~~GlF~~gGL~rHKVLFCST~ 188 (219)
+..+.||+.+.|--.+.+-++.+..-+++ .+.++.++...|+
T Consensus 77 ~~~~~aD~Vi~avpe~~~~k~~~~~~l~~-------~~~~~~il~~~tS 118 (288)
T PRK09260 77 AAVADADLVIEAVPEKLELKKAVFETADA-------HAPAECYIATNTS 118 (288)
T ss_pred HhhcCCCEEEEeccCCHHHHHHHHHHHHh-------hCCCCcEEEEcCC
Confidence 55678999998877666667777766655 3455666644343
No 40
>cd08566 GDPD_AtGDE_like Glycerophosphodiester phosphodiesterase domain of Agrobacterium tumefaciens and similar proteins. This subfamily corresponds to the glycerophosphodiester phosphodiesterase domain (GDPD) present in Agrobacterium tumefaciens glycerophosphodiester phosphodiesterase (AtGDE, EC 3.1.4.46) and its uncharacterized eukaryotic homolgoues. Members in this family shows high sequence similarity to Escherichia coli GP-GDE, which catalyzes the degradation of glycerophosphodiesters to produce sn-glycerol-3-phosphate (G3P) and the corresponding alcohols. AtGDE exists as a hexamer that is a trimer of dimers, which is unique among current known GDPD family members. However, it remains unclear if the hexamer plays a physiological role in AtGDE enzymatic function.
Probab=28.70 E-value=1.1e+02 Score=26.41 Aligned_cols=48 Identities=23% Similarity=0.318 Sum_probs=37.8
Q ss_pred eEEEEEeechhhHHHHHHHHHHcCCCCCCCCccceEEeeecCcCccceeeecccce
Q 027770 147 LYLMERVLDNESEKKVLLALETAGVFTSGGLVKDKVLFCSTEIGRTSFVRQLEPDW 202 (219)
Q Consensus 147 lYLm~~V~dD~eE~~Vl~~Le~~GlF~~gGL~rHKVLFCST~~Gr~sfVRQLeP~l 202 (219)
+++..-+++. ..+.+.+++++.|. ..||+|.|-.......+|++.|.+
T Consensus 101 ~~l~iEiK~~-~~~~~~~~~~~~~~-------~~~v~~~sf~~~~l~~~~~~~p~~ 148 (240)
T cd08566 101 ILLNLDLKDA-DLDEVIALVKKHGA-------LDQVIFKSYSEEQAKELRALAPEV 148 (240)
T ss_pred cEEEEEECch-HHHHHHHHHHHcCC-------cccEEEEECCHHHHHHHHHhCCCC
Confidence 7777888865 56778899998664 258999999888888888888875
No 41
>PF06295 DUF1043: Protein of unknown function (DUF1043); InterPro: IPR009386 This entry consists of several hypothetical bacterial proteins of unknown function.
Probab=28.63 E-value=53 Score=26.53 Aligned_cols=14 Identities=21% Similarity=0.655 Sum_probs=6.6
Q ss_pred HHHHHHHHHHHHHH
Q 027770 17 VGAIAGLALAIVFT 30 (219)
Q Consensus 17 ~~~~ag~a~a~v~~ 30 (219)
||.|+|++|+++..
T Consensus 4 i~lvvG~iiG~~~~ 17 (128)
T PF06295_consen 4 IGLVVGLIIGFLIG 17 (128)
T ss_pred HHHHHHHHHHHHHH
Confidence 34455555544443
No 42
>PHA00350 putative assembly protein
Probab=28.36 E-value=93 Score=30.13 Aligned_cols=24 Identities=21% Similarity=0.342 Sum_probs=13.9
Q ss_pred ceEEeeecCcCccceeeecccceee
Q 027770 180 DKVLFCSTEIGRTSFVRQLEPDWHI 204 (219)
Q Consensus 180 HKVLFCST~~Gr~sfVRQLeP~lHI 204 (219)
++.+.|. ..-|.+-.|+=+|..|+
T Consensus 371 ~~~vtC~-~~~~~~~~~~~~~~~~~ 394 (399)
T PHA00350 371 SRVVTCI-PRSRFAQQKAQTPPDHV 394 (399)
T ss_pred eeEEEeC-cccccchhhccCCcccc
Confidence 6778887 33345555555555554
No 43
>PF13940 Ldr_toxin: Toxin Ldr, type I toxin-antitoxin system
Probab=28.13 E-value=42 Score=22.59 Aligned_cols=26 Identities=23% Similarity=0.520 Sum_probs=19.8
Q ss_pred ehhhhhcCCCchHHHHHHHHHHHHHHHH
Q 027770 5 LTLLSWYQDSRSVGAIAGLALAIVFTWR 32 (219)
Q Consensus 5 ~~~~~~~~~~~s~~~~ag~a~a~v~~w~ 32 (219)
+++.|||-=+- -.|||++.+++..|.
T Consensus 6 lg~~~WhDLAA--P~iagIi~s~iv~w~ 31 (35)
T PF13940_consen 6 LGIAFWHDLAA--PIIAGIIASLIVGWL 31 (35)
T ss_pred HHHHHHHHhHh--HHHHHHHHHHHHHHH
Confidence 46788886444 358999999999985
No 44
>cd04906 ACT_ThrD-I_1 First of two tandem C-terminal ACT domains of threonine dehydratase I (ThrD-I; L-threonine hydrolyase). This CD includes the first of two tandem C-terminal ACT domains of threonine dehydratase I (ThrD-I; L-threonine hydrolyase) which catalyzes the committed step in branched chain amino acid biosynthesis in plants and microorganisms, the pyridoxal 5'-phosphate (PLP)-dependent dehydration/deamination of L-threonine (or L-serine) to 2-ketobutyrate (or pyruvate). ThrD-I is a cooperative, feedback-regulated (isoleucine and valine) allosteric enzyme that forms a tetramer and contains four pyridoxal phosphate moieties. Members of this CD belong to the superfamily of ACT regulatory domains.
Probab=27.48 E-value=1e+02 Score=22.55 Aligned_cols=30 Identities=23% Similarity=0.257 Sum_probs=25.5
Q ss_pred hcCceEEEEEeec-hhhHHHHHHHHHHcCCC
Q 027770 143 KFCDLYLMERVLD-NESEKKVLLALETAGVF 172 (219)
Q Consensus 143 k~~DlYLm~~V~d-D~eE~~Vl~~Le~~GlF 172 (219)
..+++.+..++.+ .++-+.+++.|++.|..
T Consensus 38 ~~~~v~i~ie~~~~~~~~~~i~~~L~~~G~~ 68 (85)
T cd04906 38 KDAHIFVGVSVANGAEELAELLEDLKSAGYE 68 (85)
T ss_pred CeeEEEEEEEeCCcHHHHHHHHHHHHHCCCC
Confidence 4678888899998 88899999999997753
No 45
>cd08559 GDPD_periplasmic_GlpQ_like Periplasmic glycerophosphodiester phosphodiesterase domain (GlpQ) and similar proteins. This subfamily corresponds to the glycerophosphodiester phosphodiesterase domain (GDPD) present in bacterial and eukaryotic glycerophosphodiester phosphodiesterase (GP-GDE, EC 3.1.4.46) similar to Escherichia coli periplasmic phosphodiesterase GlpQ. GP-GDEs are involved in glycerol metabolism and catalyze the degradation of glycerophosphodiesters to produce sn-glycerol-3-phosphate (G3P) and the corresponding alcohols, which are major sources of carbon and phosphate. In E. coli, there are two major G3P uptake systems: Glp and Ugp, which contain genes coding for two different GP-GDEs. GlpQ gene from the glp operon codes for a periplasmic phosphodiesterase GlpQ. GlpQ is a dimeric enzyme that hydrolyzes periplasmic glycerophosphodiesters, such as glycerophosphocholine (GPC), glycerophosphoethanolanmine (GPE), glycerophosphoglycerol (GPG), glycerophosphoinositol (GPI),
Probab=27.40 E-value=1.1e+02 Score=27.37 Aligned_cols=53 Identities=23% Similarity=0.197 Sum_probs=40.1
Q ss_pred ceEEEEEeechhh--------HHHHHHHHHHcCCCCCCCCccceEEeeecCcCccceeeecccce
Q 027770 146 DLYLMERVLDNES--------EKKVLLALETAGVFTSGGLVKDKVLFCSTEIGRTSFVRQLEPDW 202 (219)
Q Consensus 146 DlYLm~~V~dD~e--------E~~Vl~~Le~~GlF~~gGL~rHKVLFCST~~Gr~sfVRQLeP~l 202 (219)
.+.|..-++++.. ++.++++++++|+-+ ..++|++.|=.-.-..-+|++.|.+
T Consensus 128 ~~~l~IEiK~~~~~~~~~~~~~~~v~~~l~~~~~~~----~~~~v~i~SF~~~~L~~~r~~~p~~ 188 (296)
T cd08559 128 NVGIYPETKHPTFHKQEGPDIEEKLLEVLKKYGYTG----KNDPVFIQSFEPESLKRLRNETPDI 188 (296)
T ss_pred cceEEEEecChhhhhhcCCCHHHHHHHHHHHcCCCC----CCCCEEEecCCHHHHHHHHHhCCCC
Confidence 6777777876543 478999999876532 2479999998888888889988864
No 46
>PRK01294 lipase chaperone; Provisional
Probab=27.26 E-value=3.7e+02 Score=25.31 Aligned_cols=21 Identities=10% Similarity=0.023 Sum_probs=13.9
Q ss_pred HHhhhhhc-CCCCCchHHHHHH
Q 027770 79 TQIADALF-QPVKPTLGQVVRQ 99 (219)
Q Consensus 79 ~~~~~~~~-~~~~~t~~qiv~~ 99 (219)
++..|-|+ .-+..++.+|..+
T Consensus 89 Rd~FDYfLs~~gE~~l~~i~~~ 110 (336)
T PRK01294 89 RDFFDYFLSALGELDLAAIDAL 110 (336)
T ss_pred HHHHHHHhhccCCCCHHHHHHH
Confidence 45566666 4677788877754
No 47
>PRK11677 hypothetical protein; Provisional
Probab=26.80 E-value=43 Score=27.90 Aligned_cols=18 Identities=28% Similarity=0.401 Sum_probs=12.3
Q ss_pred HHHHHHHHHHHHHHhhcC
Q 027770 19 AIAGLALAIVFTWRMFRS 36 (219)
Q Consensus 19 ~~ag~a~a~v~~w~~lr~ 36 (219)
+++||.|++|.+|...|-
T Consensus 6 a~i~livG~iiG~~~~R~ 23 (134)
T PRK11677 6 ALIGLVVGIIIGAVAMRF 23 (134)
T ss_pred HHHHHHHHHHHHHHHHhh
Confidence 456677777777777774
No 48
>cd08570 GDPD_YPL206cp_fungi Glycerophosphodiester phosphodiesterase domain of Saccharomyces cerevisiae YPL206cp and similar proteins. This subfamily corresponds to the glycerophosphodiester phosphodiesterase domain (GDPD) present in Saccharomyces cerevisiae YPL206cp and uncharacterized hypothetical homologs existing in fungi. The product of S. cerevisiae ORF YPL206c (PGC1), YPL206cp (Pgc1p), displays homology to bacterial and mammalian glycerophosphodiester phosphodiesterases (GP-GDE, EC 3.1.4.46), which catalyzes the degradation of glycerophosphodiesters to produce sn-glycerol-3-phosphate (G3P) and the corresponding alcohols. S. cerevisiae YPL206cp is an integral membrane protein with a single GDPD domain following by a short hydrophobic C-terminal tail that may function as a membrane anchor. This protein plays an essential role in the regulation of the cardiolipin (CL) biosynthetic pathway in yeast by removing the excess phosphatidylglycerol (PG) content of membranes via a phospholip
Probab=26.38 E-value=1.2e+02 Score=25.74 Aligned_cols=64 Identities=14% Similarity=0.144 Sum_probs=41.8
Q ss_pred HHHHHHHhhcC---ceEEEEEeechhh----HHHHHHHHHHcCCCCCCCCccceEEeeecCcCccceeeecccc
Q 027770 135 LEALLEITKFC---DLYLMERVLDNES----EKKVLLALETAGVFTSGGLVKDKVLFCSTEIGRTSFVRQLEPD 201 (219)
Q Consensus 135 levLleLsk~~---DlYLm~~V~dD~e----E~~Vl~~Le~~GlF~~gGL~rHKVLFCST~~Gr~sfVRQLeP~ 201 (219)
++-++++.+.. ++++..-++++.. ++.|..++++++..+ + ..++|+|.|-...-..-+|++.|.
T Consensus 85 L~evl~~~~~~~~~~~~l~iEiK~~~~~~~~~~~v~~~i~~~~~~~--~-~~~~v~i~Sf~~~~l~~l~~~~p~ 155 (234)
T cd08570 85 LKDVLEWLVEHELPDVKLMLDIKRDNDPEILFKLIAEMLAVKPDLD--F-WRERIILGLWHLDFLKYGKEVLPG 155 (234)
T ss_pred HHHHHHHHHhcCCCCeEEEEEECCCCCHHHHHHHHHHHHHhcCCcc--c-ccCCEEEEeCCHHHHHHHHHhCCC
Confidence 34444555544 8899999997643 356777777755332 1 347899988666666666777665
No 49
>PF08370 PDR_assoc: Plant PDR ABC transporter associated; InterPro: IPR013581 ABC transporters belong to the ATP-Binding Cassette (ABC) superfamily, which uses the hydrolysis of ATP to energise diverse biological systems. ABC transporters minimally consist of two conserved regions: a highly conserved ATP binding cassette (ABC) and a less conserved transmembrane domain (TMD). These can be found on the same protein or on two different ones. Most ABC transporters function as a dimer and therefore are constituted of four domains, two ABC modules and two TMDs. ABC transporters are involved in the export or import of a wide variety of substrates ranging from small ions to macromolecules. The major function of ABC import systems is to provide essential nutrients to bacteria. They are found only in prokaryotes and their four constitutive domains are usually encoded by independent polypeptides (two ABC proteins and two TMD proteins). Prokaryotic importers require additional extracytoplasmic binding proteins (one or more per systems) for function. In contrast, export systems are involved in the extrusion of noxious substances, the export of extracellular toxins and the targeting of membrane components. They are found in all living organisms and in general the TMD is fused to the ABC module in a variety of combinations. Some eukaryotic exporters encode the four domains on the same polypeptide chain []. The ABC module (approximately two hundred amino acid residues) is known to bind and hydrolyse ATP, thereby coupling transport to ATP hydrolysis in a large number of biological processes. The cassette is duplicated in several subfamilies. Its primary sequence is highly conserved, displaying a typical phosphate-binding loop: Walker A, and a magnesium binding site: Walker B. Besides these two regions, three other conserved motifs are present in the ABC cassette: the switch region which contains a histidine loop, postulated to polarise the attaching water molecule for hydrolysis, the signature conserved motif (LSGGQ) specific to the ABC transporter, and the Q-motif (between Walker A and the signature), which interacts with the gamma phosphate through a water bond. The Walker A, Walker B, Q-loop and switch region form the nucleotide binding site [, , ]. The 3D structure of a monomeric ABC module adopts a stubby L-shape with two distinct arms. ArmI (mainly beta-strand) contains Walker A and Walker B. The important residues for ATP hydrolysis and/or binding are located in the P-loop. The ATP-binding pocket is located at the extremity of armI. The perpendicular armII contains mostly the alpha helical subdomain with the signature motif. It only seems to be required for structural integrity of the ABC module. ArmII is in direct contact with the TMD. The hinge between armI and armII contains both the histidine loop and the Q-loop, making contact with the gamma phosphate of the ATP molecule. ATP hydrolysis leads to a conformational change that could facilitate ADP release. In the dimer the two ABC cassettes contact each other through hydrophobic interactions at the antiparallel beta-sheet of armI by a two-fold axis [, , , , , ]. The ATP-Binding Cassette (ABC) superfamily forms one of the largest of all protein families with a diversity of physiological functions []. Several studies have shown that there is a correlation between the functional characterisation and the phylogenetic classification of the ABC cassette [, ]. More than 50 subfamilies have been described based on a phylogenetic and functional classification [, , ]; (for further information see http://www.tcdb.org/tcdb/index.php?tc=3.A.1). This domain is found on the C terminus of ABC-2 type transporter domains (IPR013525 from INTERPRO). It seems to be associated with the plant pleiotropic drug resistance (PDR) protein family of ABC transporters. Like in yeast, plant PDR ABC transporters may also play a role in the transport of antifungal agents [] (see also IPR010929 from INTERPRO). The PDR family is characterised by a configuration in which the ABC domain is nearer the N terminus of the protein than the transmembrane domain [].
Probab=26.15 E-value=96 Score=22.99 Aligned_cols=26 Identities=19% Similarity=0.385 Sum_probs=17.9
Q ss_pred chHHHHHHHHH----HHHHHHHhhcCCCCC
Q 027770 15 RSVGAIAGLAL----AIVFTWRMFRSPGAP 40 (219)
Q Consensus 15 ~s~~~~ag~a~----a~v~~w~~lr~~~~~ 40 (219)
.++||++|+++ ...++..+|..++.+
T Consensus 31 IgvgaL~G~~vlFNil~~laL~yL~p~~k~ 60 (65)
T PF08370_consen 31 IGVGALLGFIVLFNILFTLALTYLNPLGKS 60 (65)
T ss_pred ehHHHHHHHHHHHHHHHHHHHHhcCCcCCC
Confidence 36899999886 345566777766543
No 50
>PF06295 DUF1043: Protein of unknown function (DUF1043); InterPro: IPR009386 This entry consists of several hypothetical bacterial proteins of unknown function.
Probab=25.98 E-value=44 Score=26.99 Aligned_cols=22 Identities=23% Similarity=0.401 Sum_probs=18.4
Q ss_pred HHHHHHHHHHHHHHhhcCCCCC
Q 027770 19 AIAGLALAIVFTWRMFRSPGAP 40 (219)
Q Consensus 19 ~~ag~a~a~v~~w~~lr~~~~~ 40 (219)
+++||.|+++.+|...|.-.+.
T Consensus 2 ~~i~lvvG~iiG~~~~r~~~~~ 23 (128)
T PF06295_consen 2 AIIGLVVGLIIGFLIGRLTSSN 23 (128)
T ss_pred hHHHHHHHHHHHHHHHHHhccc
Confidence 6899999999999999955433
No 51
>COG4980 GvpP Gas vesicle protein [General function prediction only]
Probab=25.86 E-value=47 Score=27.29 Aligned_cols=14 Identities=43% Similarity=0.824 Sum_probs=11.0
Q ss_pred HHHHHHHHHHHHHH
Q 027770 17 VGAIAGLALAIVFT 30 (219)
Q Consensus 17 ~~~~ag~a~a~v~~ 30 (219)
+|+|+|.+.|+.|+
T Consensus 13 iGgiiGa~aaLL~A 26 (115)
T COG4980 13 IGGIIGAAAALLFA 26 (115)
T ss_pred HHHHHHHHHHHHhC
Confidence 68888888887774
No 52
>cd06660 Aldo_ket_red Aldo-keto reductases (AKRs) are a superfamily of soluble NAD(P)(H) oxidoreductases whose chief purpose is to reduce aldehydes and ketones to primary and secondary alcohols. AKRs are present in all phyla and are of importance to both health and industrial applications. Members have very distinct functions and include the prokaryotic 2,5-diketo-D-gluconic acid reductases and beta-keto ester reductases, the eukaryotic aldose reductases, aldehyde reductases, hydroxysteroid dehydrogenases, steroid 5beta-reductases, potassium channel beta-subunits and aflatoxin aldehyde reductases, among others.
Probab=25.72 E-value=84 Score=26.76 Aligned_cols=86 Identities=20% Similarity=0.212 Sum_probs=51.7
Q ss_pred CCCchHHHHHHHhcCceeEEEeecceeeccCChhhhhccceeehhHHHHHHHH-hhcCceEEEEEeechhh-HHHHHHHH
Q 027770 89 VKPTLGQVVRQKLSEGRKVTCRLLGVILEESCPEELQKQVTVKSSVLEALLEI-TKFCDLYLMERVLDNES-EKKVLLAL 166 (219)
Q Consensus 89 ~~~t~~qiv~~kls~~RkvTis~~gvvlee~sp~el~~~a~v~esalevLleL-sk~~DlYLm~~V~dD~e-E~~Vl~~L 166 (219)
....++++.++.- .-.++.|+++.-...... .....-.++.++.+.|-+| ...-|+|++-....+.. ...+.++|
T Consensus 57 sE~~lG~al~~~~-~R~~~~i~tK~~~~~~~~--~~~~~~~~~~~l~~sL~~L~~~~iDl~~lh~~~~~~~~~~~~~~~l 133 (285)
T cd06660 57 SEELLGEALKERG-PREEVFIATKVGPRPGDG--RDLSPEHIRRAVEESLKRLGTDYIDLYLLHWPDPDTPDIEETLRAL 133 (285)
T ss_pred CHHHHHHHHhccC-CcCcEEEEeeecCCCCCC--CCCCHHHHHHHHHHHHHHhCCCceeEEEecCCCCCCCCHHHHHHHH
Confidence 3455666665532 234788888854322110 1112334677888888888 56789999998876654 45555555
Q ss_pred H---HcCCCCCCCC
Q 027770 167 E---TAGVFTSGGL 177 (219)
Q Consensus 167 e---~~GlF~~gGL 177 (219)
+ +.|.+..-|+
T Consensus 134 ~~l~~~G~ir~iGv 147 (285)
T cd06660 134 EELVKEGKIRAIGV 147 (285)
T ss_pred HHHHHcCCccEEEe
Confidence 5 6666553344
No 53
>PF00998 RdRP_3: Viral RNA dependent RNA polymerase; InterPro: IPR002166 The RNA dependent RNA polymerase is also known as non-structural protein NS5B. NS5B is a 65 kDa protein that resembles other viral RNA polymerases. Hepatitis C virus (HCV) replication is thought to occur in membrane bound replication complexes. These complexes transcribe the positive strand and the resulting minus strand is used as a template for the synthesis of genomic RNA. There are two viral proteins involved in the reaction, NS3 and NS5B [, , ].; GO: 0003723 RNA binding, 0003968 RNA-directed RNA polymerase activity, 0005524 ATP binding, 0006351 transcription, DNA-dependent, 0019079 viral genome replication; PDB: 2XYM_A 3I5K_D 4AEX_A 4E7A_A 2XXD_A 4E78_A 4AEP_A 4E76_A 1S4F_D 1S48_A ....
Probab=25.63 E-value=75 Score=31.53 Aligned_cols=123 Identities=22% Similarity=0.213 Sum_probs=64.2
Q ss_pred CCchhHH--HHhhhhhcCCCCCchHHHHHHHhcCceeEEEeecceeecc-------CChhhhhccceeehhHHHHHHHHh
Q 027770 72 SSSEDLR--TQIADALFQPVKPTLGQVVRQKLSEGRKVTCRLLGVILEE-------SCPEELQKQVTVKSSVLEALLEIT 142 (219)
Q Consensus 72 ~~~~~~~--~~~~~~~~~~~~~t~~qiv~~kls~~RkvTis~~gvvlee-------~sp~el~~~a~v~esalevLleLs 142 (219)
-+.+|+| +.+....|.+.. .+.++...||--+-.|+.+=--+.+.. .-+..+-|..+.--.+......+-
T Consensus 216 Vs~~~l~~E~~iY~~~~~~~~-~~~~~L~~~l~~~G~~~~~~G~~~y~~~gcR~SGd~~TS~GN~l~~~~~~~a~~~~~g 294 (486)
T PF00998_consen 216 VSEEDLRFEHSIYQCCFLDPE-ELIKLLTERLYNGGPMFNSDGQVRYGYRGCRMSGDMNTSLGNCLTCYLKAYAACRAAG 294 (486)
T ss_dssp --HHHHHHHHHHHTTSBEGHH-HHHHHHHHHTTTEEEEEETTSE-EEEEESS--TTSTTHHHHHHHHHHHHHHHHHHHHT
T ss_pred CchhhhhhhheehhcccCCch-hhhccchhceeeeEEEEecCCCeeeccCCcccCCCCCcccccHHHHHHHHHHHHhhcC
Confidence 4567888 666666664333 888999988865544443321111110 001122222222223334444433
Q ss_pred hcC-------ceEEEEEeec-hhhHHHHHHHHHHcCCCCCCCCcc------ceEEeeecCc----Cccceeeec
Q 027770 143 KFC-------DLYLMERVLD-NESEKKVLLALETAGVFTSGGLVK------DKVLFCSTEI----GRTSFVRQL 198 (219)
Q Consensus 143 k~~-------DlYLm~~V~d-D~eE~~Vl~~Le~~GlF~~gGL~r------HKVLFCST~~----Gr~sfVRQL 198 (219)
+.| |.+|||--.+ |++++....++.++| |. |..+ ++|=|||+-= +.--|+|..
T Consensus 295 ~~~~l~~~GDD~vvi~E~~~~~~~~~~l~~~~~~~G-f~--~~~e~p~y~lE~iefCq~~pv~~~~~~~mvr~~ 365 (486)
T PF00998_consen 295 KDCSLLNCGDDCVVICESAGVDEDEAALTEAFTRYG-FP--MKVEKPVYELELIEFCQSNPVFDGDGWGMVRYP 365 (486)
T ss_dssp EEEEEEEETTEEEEEEEHCHHHHHHHHHHHHHHHTT--B--SSSCEEBSSGGG-EBTTBEEEEEEETTS-EEEE
T ss_pred CCccEeecCCceEEEecccchHHHHHHHHHHHHhcC-CC--CCCCCcccccccccccCceeeeccCCCcccccc
Confidence 333 7888887766 455567888899999 87 3433 6788888742 233466653
No 54
>PF00248 Aldo_ket_red: Aldo/keto reductase family; InterPro: IPR023210 The aldo-keto reductase family includes a number of related monomeric NADPH-dependent oxidoreductases, such as aldehyde reductase, aldose reductase, prostaglandin F synthase, xylose reductase, rho crystallin, and many others []. All possess a similar structure, with a beta-alpha-beta fold characteristic of nucleotide binding proteins []. The fold comprises a parallel beta-8/alpha-8-barrel, which contains a novel NADP-binding motif. The binding site is located in a large, deep, elliptical pocket in the C-terminal end of the beta sheet, the substrate being bound in an extended conformation. The hydrophobic nature of the pocket favours aromatic and apolar substrates over highly polar ones []. Binding of the NADPH coenzyme causes a massive conformational change, reorienting a loop, effectively locking the coenzyme in place. This binding is more similar to FAD- than to NAD(P)-binding oxidoreductases []. Some proteins of this entry contain a K+ ion channel beta chain regulatory domain; these are reported to have oxidoreductase activity []. This entry represents the NADP-dependent oxidoreductase domain found in these proteins.; PDB: 1C9W_A 4F40_B 1VBJ_A 1XGD_A 1X97_A 2ACS_A 1EF3_A 2ACU_A 1PWM_A 2NVD_A ....
Probab=25.48 E-value=54 Score=27.92 Aligned_cols=84 Identities=20% Similarity=0.238 Sum_probs=53.9
Q ss_pred chHHHHHHHhcCceeEEEeecceeeccCChhhhhccceeehhHHHHHHHHh-hcCceEEEEEeechhh-HHHHHHHHH--
Q 027770 92 TLGQVVRQKLSEGRKVTCRLLGVILEESCPEELQKQVTVKSSVLEALLEIT-KFCDLYLMERVLDNES-EKKVLLALE-- 167 (219)
Q Consensus 92 t~~qiv~~kls~~RkvTis~~gvvlee~sp~el~~~a~v~esalevLleLs-k~~DlYLm~~V~dD~e-E~~Vl~~Le-- 167 (219)
.+++..++.-....++.|++.. .....+...-....++++..+.|-+|- ..-|+|++-....+.+ -+.+...|+
T Consensus 48 ~lg~~l~~~~~~r~~~~i~tK~--~~~~~~~~~~~~~~i~~~~~~sL~~L~~d~iDl~~lH~~~~~~~~~~~~~~~l~~l 125 (283)
T PF00248_consen 48 ILGRALRKSRVPRDDIFISTKV--YGDGKPEPDYSPDSIRESLERSLERLGTDYIDLLLLHWPDPSEDALEEVWEALEEL 125 (283)
T ss_dssp HHHHHHHHTSSTGGGSEEEEEE--ESSSSTGGGSSHHHHHHHHHHHHHHHTSSSEEEEEESSSSTTSSHHHHHHHHHHHH
T ss_pred cccccccccccccccccccccc--cccccccccccccccccccccccccccccchhccccccccccccccchhhhhhhhc
Confidence 4677776633445577788875 222233333344567788888888887 4569999988887777 566666555
Q ss_pred -HcCCCCCCCC
Q 027770 168 -TAGVFTSGGL 177 (219)
Q Consensus 168 -~~GlF~~gGL 177 (219)
+.|.+..-|+
T Consensus 126 ~~~G~ir~iGv 136 (283)
T PF00248_consen 126 KKEGKIRHIGV 136 (283)
T ss_dssp HHTTSEEEEEE
T ss_pred ccccccccccc
Confidence 6677664343
No 55
>COG3086 RseC Positive regulator of sigma E activity [Signal transduction mechanisms]
Probab=25.06 E-value=42 Score=28.89 Aligned_cols=35 Identities=26% Similarity=0.270 Sum_probs=28.6
Q ss_pred eeehhhhhcCCCchHHHHHHHHHHHHHHHHhhcCC
Q 027770 3 WVLTLLSWYQDSRSVGAIAGLALAIVFTWRMFRSP 37 (219)
Q Consensus 3 ~~~~~~~~~~~~~s~~~~ag~a~a~v~~w~~lr~~ 37 (219)
|+...||.+....-+|+++|.+++.+++-++-|--
T Consensus 94 ~La~~L~~~e~~~~~~~~lg~~l~fl~~r~ysRkl 128 (150)
T COG3086 94 ILAQYLFFSELIVIFGAFLGLALGFLLARRYSRKL 128 (150)
T ss_pred HHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHHHHh
Confidence 34456788999999999999999988888887743
No 56
>COG4218 MtrF Tetrahydromethanopterin S-methyltransferase, subunit F [Coenzyme metabolism]
Probab=24.98 E-value=63 Score=24.90 Aligned_cols=18 Identities=39% Similarity=0.536 Sum_probs=14.8
Q ss_pred HHHHHHHHHHHHHHHHhh
Q 027770 17 VGAIAGLALAIVFTWRMF 34 (219)
Q Consensus 17 ~~~~ag~a~a~v~~w~~l 34 (219)
.+.++||||.+||+--++
T Consensus 48 st~i~GlaiGfvfA~vLv 65 (73)
T COG4218 48 STRIAGLAIGFVFAGVLV 65 (73)
T ss_pred hhhhHHHHHHHHHHHHHH
Confidence 467899999999987665
No 57
>TIGR02863 spore_sspJ small, acid-soluble spore protein, SspJ family. RL J Bacteriol. 1998 Dec;180(24):6704-12.
Probab=24.77 E-value=49 Score=23.42 Aligned_cols=18 Identities=28% Similarity=0.320 Sum_probs=15.1
Q ss_pred echhhHHHHHHHHHHcCC
Q 027770 154 LDNESEKKVLLALETAGV 171 (219)
Q Consensus 154 ~dD~eE~~Vl~~Le~~Gl 171 (219)
+++.+.+.|..+||++|-
T Consensus 12 ~seKd~n~v~gaLEdAg~ 29 (47)
T TIGR02863 12 KSEKDKNAIQGALEDAGQ 29 (47)
T ss_pred chhhhHHHHHHHHHHHHH
Confidence 467888999999998774
No 58
>PF09527 ATPase_gene1: Putative F0F1-ATPase subunit (ATPase_gene1)
Probab=24.07 E-value=90 Score=21.26 Aligned_cols=19 Identities=21% Similarity=0.569 Sum_probs=12.1
Q ss_pred HHHHHHHHHHHHHHHHhhc
Q 027770 17 VGAIAGLALAIVFTWRMFR 35 (219)
Q Consensus 17 ~~~~ag~a~a~v~~w~~lr 35 (219)
+|.+.|++.++.-.||++|
T Consensus 37 ~g~llG~~~g~~~~~~~~k 55 (55)
T PF09527_consen 37 IGLLLGIAAGFYNVYRLVK 55 (55)
T ss_pred HHHHHHHHHHHHHHHHHhC
Confidence 4555666666667777664
No 59
>PF09575 Spore_SspJ: Small spore protein J (Spore_SspJ); InterPro: IPR014220 This entry represents a group of small acid-soluble proteins (SASP) from Bacillus species, which are present in spores but not in growing cells. The sspJ gene is transcribed in the forespore compartment by RNA polymerase with the forespore-specific sigmaG. Loss of SspJ causes a slight decrease in the rate of spore outgrowth in an otherwise wild-type background [].
Probab=24.00 E-value=51 Score=23.23 Aligned_cols=18 Identities=28% Similarity=0.167 Sum_probs=15.3
Q ss_pred echhhHHHHHHHHHHcCC
Q 027770 154 LDNESEKKVLLALETAGV 171 (219)
Q Consensus 154 ~dD~eE~~Vl~~Le~~Gl 171 (219)
++|.+.++|..+||++|-
T Consensus 11 ~seK~~n~v~gaLedA~~ 28 (46)
T PF09575_consen 11 NSEKDKNVVQGALEDASK 28 (46)
T ss_pred cchhhHHHHHHHHHHHHH
Confidence 578899999999998763
No 60
>PRK14901 16S rRNA methyltransferase B; Provisional
Probab=23.76 E-value=70 Score=30.22 Aligned_cols=53 Identities=21% Similarity=0.287 Sum_probs=35.2
Q ss_pred hhhhhccceeehhHHHHHHHHhhcCc--eEEEEEeechhhHHHHHHHHHHcCCCC
Q 027770 121 PEELQKQVTVKSSVLEALLEITKFCD--LYLMERVLDNESEKKVLLALETAGVFT 173 (219)
Q Consensus 121 p~el~~~a~v~esalevLleLsk~~D--lYLm~~V~dD~eE~~Vl~~Le~~GlF~ 173 (219)
++++..-.......++...++.+--. ||--|.+..++.|++|...|++.+-|.
T Consensus 353 ~~~~~~l~~~Q~~iL~~a~~~lkpgG~lvystcsi~~~Ene~~v~~~l~~~~~~~ 407 (434)
T PRK14901 353 PEKIQELAPLQAELLESLAPLLKPGGTLVYATCTLHPAENEAQIEQFLARHPDWK 407 (434)
T ss_pred HHHHHHHHHHHHHHHHHHHHhcCCCCEEEEEeCCCChhhHHHHHHHHHHhCCCcE
Confidence 44444444444555555555555544 566688999999999999999865554
No 61
>TIGR03142 cytochro_ccmI cytochrome c-type biogenesis protein CcmI. This TPR repeat-containing protein is the CcmI protein (also called CycH) of c-type cytochrome biogenesis. CcmI is thought to act as an apo-cytochrome c chaperone. This model describes the N-terminal region of the protein, Members of this protein family
Probab=23.74 E-value=61 Score=25.49 Aligned_cols=8 Identities=38% Similarity=1.344 Sum_probs=4.8
Q ss_pred Ceeeehhh
Q 027770 1 MFWVLTLL 8 (219)
Q Consensus 1 ~~~~~~~~ 8 (219)
|||+++.+
T Consensus 1 ~fWl~~a~ 8 (117)
T TIGR03142 1 LFWIVAAL 8 (117)
T ss_pred CHHHHHHH
Confidence 68865543
No 62
>TIGR01261 hisB_Nterm histidinol-phosphatase. This model describes histidinol phosphatase. All known examples in the scope of this model are bifunctional proteins with a histidinol phosphatase domain followed by an imidazoleglycerol-phosphate dehydratase domain. These enzymatic domains catalyze the ninth and seventh steps, respectively, of histidine biosynthesis.
Probab=23.34 E-value=2.7e+02 Score=22.79 Aligned_cols=69 Identities=20% Similarity=0.368 Sum_probs=42.9
Q ss_pred ecceeeccCC---hhhhhccceeehhHHHHHHHHhhc-CceEEEEEeec-----------hhhHHHHHHHHHHcCCCCCC
Q 027770 111 LLGVILEESC---PEELQKQVTVKSSVLEALLEITKF-CDLYLMERVLD-----------NESEKKVLLALETAGVFTSG 175 (219)
Q Consensus 111 ~~gvvlee~s---p~el~~~a~v~esalevLleLsk~-~DlYLm~~V~d-----------D~eE~~Vl~~Le~~GlF~~g 175 (219)
..|++.++.+ +.+.....++-+.+.+.|..|..- ..+++++--.. ...+..+..+|+..|+-
T Consensus 8 ~dg~l~~~~~~~~~~~~~~~~~~~pgv~e~L~~L~~~g~~l~IvSN~~g~~~~~~~~~~~~~~~~~~~~~l~~~gl~--- 84 (161)
T TIGR01261 8 RDGTLIEEPPSDFQVDALEKLRFEKGVIPALLKLKKAGYKFVMVTNQDGLGTPSFPQADFDGPHNLMLQIFRSQGII--- 84 (161)
T ss_pred CCCCccccCCCccccCCHHHeeECCCHHHHHHHHHHCCCeEEEEeCCccccCCcCCHHHHHHHHHHHHHHHHHCCCc---
Confidence 4566655433 233445678889999999999764 56665543210 02456788888886652
Q ss_pred CCccceEEee
Q 027770 176 GLVKDKVLFC 185 (219)
Q Consensus 176 GL~rHKVLFC 185 (219)
=+.++||
T Consensus 85 ---fd~ii~~ 91 (161)
T TIGR01261 85 ---FDDVLIC 91 (161)
T ss_pred ---eeEEEEC
Confidence 2567777
No 63
>PRK07819 3-hydroxybutyryl-CoA dehydrogenase; Validated
Probab=23.27 E-value=2.8e+02 Score=24.77 Aligned_cols=70 Identities=13% Similarity=0.110 Sum_probs=43.3
Q ss_pred hhcCceEEEEEeechhhHHHHHHHHHHcCCCCCCCCccceEEeeecCcC--------------c---------cceeeec
Q 027770 142 TKFCDLYLMERVLDNESEKKVLLALETAGVFTSGGLVKDKVLFCSTEIG--------------R---------TSFVRQL 198 (219)
Q Consensus 142 sk~~DlYLm~~V~dD~eE~~Vl~~Le~~GlF~~gGL~rHKVLFCST~~G--------------r---------~sfVRQL 198 (219)
.+.||+.+-|-.+|.+-|+.+...|++. ..+...+++|+.-+ | ++..+-+
T Consensus 82 ~~~~d~ViEav~E~~~~K~~l~~~l~~~-------~~~~~~il~snTS~~~~~~la~~~~~~~r~~g~hf~~P~~~~~lv 154 (286)
T PRK07819 82 FADRQLVIEAVVEDEAVKTEIFAELDKV-------VTDPDAVLASNTSSIPIMKLAAATKRPGRVLGLHFFNPVPVLPLV 154 (286)
T ss_pred hCCCCEEEEecccCHHHHHHHHHHHHHh-------hCCCCcEEEECCCCCCHHHHHhhcCCCccEEEEecCCCcccCceE
Confidence 4779998877667777778888877762 21223444444333 2 2223334
Q ss_pred ccceeecCCHHHHHhhhhcc
Q 027770 199 EPDWHIDTNPEIVSQLAVHL 218 (219)
Q Consensus 199 eP~lHIDtd~~vv~~L~rfv 218 (219)
|=..|-.|+++++..+.+|+
T Consensus 155 Elv~~~~T~~~~~~~~~~~~ 174 (286)
T PRK07819 155 ELVPTLVTSEATVARAEEFA 174 (286)
T ss_pred EEeCCCCCCHHHHHHHHHHH
Confidence 44457789999998888773
No 64
>COG1167 ARO8 Transcriptional regulators containing a DNA-binding HTH domain and an aminotransferase domain (MocR family) and their eukaryotic orthologs [Transcription / Amino acid transport and metabolism]
Probab=23.13 E-value=92 Score=29.70 Aligned_cols=73 Identities=25% Similarity=0.428 Sum_probs=39.5
Q ss_pred ceeehhHHHHHHHHhhcCceEEEEEeechhhHHHHHHHHHHcCCCC--CCCCcc-ceEEeeecCcCccceeeecccceee
Q 027770 128 VTVKSSVLEALLEITKFCDLYLMERVLDNESEKKVLLALETAGVFT--SGGLVK-DKVLFCSTEIGRTSFVRQLEPDWHI 204 (219)
Q Consensus 128 a~v~esalevLleLsk~~DlYLm~~V~dD~eE~~Vl~~Le~~GlF~--~gGL~r-HKVLFCST~~Gr~sfVRQLeP~lHI 204 (219)
.++-.+-++-|++||.-.|+|+|+ ||-.-+ |.-.+--. ..++.+ +||+||+| |.+=|=|-+=|
T Consensus 242 ~tms~~rR~~Ll~lA~~~~~~IIE---DD~y~e-----l~~~~~p~~~l~~ld~~~rViy~gS------FSK~l~PglRl 307 (459)
T COG1167 242 VTMSLERRKALLALAEKYDVLIIE---DDYYGE-----LRYDGPPPPPLKALDAPGRVIYLGS------FSKTLAPGLRL 307 (459)
T ss_pred CccCHHHHHHHHHHHHHcCCeEEe---eCcchh-----hhcCCCCCCChHhhCCCCCEEEEee------ehhhcccccce
Confidence 344456677788888888888754 443322 11111111 013333 89999986 66666666544
Q ss_pred c---CCHHHHHhh
Q 027770 205 D---TNPEIVSQL 214 (219)
Q Consensus 205 D---td~~vv~~L 214 (219)
- .++++++.+
T Consensus 308 G~vv~p~~~~~~~ 320 (459)
T COG1167 308 GYVVAPPELIEKL 320 (459)
T ss_pred eeeeCCHHHHHHH
Confidence 3 444444433
No 65
>PF14155 DUF4307: Domain of unknown function (DUF4307)
Probab=22.90 E-value=82 Score=24.98 Aligned_cols=26 Identities=15% Similarity=0.221 Sum_probs=19.1
Q ss_pred CCCchHHHHHHHHHHHHHHHHhhcCC
Q 027770 12 QDSRSVGAIAGLALAIVFTWRMFRSP 37 (219)
Q Consensus 12 ~~~~s~~~~ag~a~a~v~~w~~lr~~ 37 (219)
+...-++++++++.+.+++|..+...
T Consensus 4 r~~~~~~~v~~vv~~~~~~w~~~~~~ 29 (112)
T PF14155_consen 4 RKLVIAGAVLVVVAGAVVAWFGYSQF 29 (112)
T ss_pred ceeEehHHHHHHHHHHHHhHhhhhhc
Confidence 44556778888888888888888743
No 66
>PRK06049 rpl30p 50S ribosomal protein L30P; Reviewed
Probab=22.83 E-value=1.3e+02 Score=25.48 Aligned_cols=64 Identities=6% Similarity=0.169 Sum_probs=51.4
Q ss_pred eEEEEEeechhhH-HHHHHHHHHcCCCCCCCCccceEEeeecCcCccceeeecccceeec-CCHHHHHhhh
Q 027770 147 LYLMERVLDNESE-KKVLLALETAGVFTSGGLVKDKVLFCSTEIGRTSFVRQLEPDWHID-TNPEIVSQLA 215 (219)
Q Consensus 147 lYLm~~V~dD~eE-~~Vl~~Le~~GlF~~gGL~rHKVLFCST~~Gr~sfVRQLeP~lHID-td~~vv~~L~ 215 (219)
+|++.+|.+...- ..+...|+..||=.. |+..|-.-.....-|++..+|.+-+. -|.+.++.|-
T Consensus 2 ~~~ViriRg~ig~~~~~r~tL~~LgL~ki-----~~~V~v~~tp~~~GML~kV~~~V~~ge~~~~tv~~Li 67 (154)
T PRK06049 2 MYAVIRIRGRVNVPRDIADTLKMLRLHRV-----NHCVLVPETPSYKGMLQKVKDYVTWGEIDADTLAELL 67 (154)
T ss_pred cEEEEEecCCCCCCHHHHHHHHHcCCCcC-----CCEEEEeCCHHHHHHHHhhhceeEEeeCchHHHHHHH
Confidence 6888999887543 678888888666554 77788777788999999999999988 4888888873
No 67
>PF12263 DUF3611: Protein of unknown function (DUF3611); InterPro: IPR022051 This family of proteins is found in bacteria and eukaryotes. Proteins in this family are typically between 180 and 205 amino acids in length. There are two completely conserved residues (W and G) that may be functionally important.
Probab=22.80 E-value=1e+02 Score=26.89 Aligned_cols=21 Identities=19% Similarity=0.455 Sum_probs=16.2
Q ss_pred hHHHHHHHHHHHHHHHHhhcC
Q 027770 16 SVGAIAGLALAIVFTWRMFRS 36 (219)
Q Consensus 16 s~~~~ag~a~a~v~~w~~lr~ 36 (219)
.+++++-+.+.+.|.|+|.|-
T Consensus 62 a~~gl~~l~~si~~~fry~Rl 82 (183)
T PF12263_consen 62 AICGLVALFFSIFWSFRYTRL 82 (183)
T ss_pred HHHHHHHHHHHHHHHHHHHHH
Confidence 456677777888999999884
No 68
>PF10031 DUF2273: Small integral membrane protein (DUF2273); InterPro: IPR018730 Members of this family of hypothetical bacterial proteins have no known function.
Probab=22.76 E-value=99 Score=21.76 Aligned_cols=18 Identities=28% Similarity=0.624 Sum_probs=14.2
Q ss_pred CCchHHHHHHHHHHHHHH
Q 027770 13 DSRSVGAIAGLALAIVFT 30 (219)
Q Consensus 13 ~~~s~~~~ag~a~a~v~~ 30 (219)
.-+.+|+++|+.+|+.+-
T Consensus 8 ~~~iiG~~~G~ila~l~l 25 (51)
T PF10031_consen 8 RGKIIGGLIGLILALLIL 25 (51)
T ss_pred cchHHHHHHHHHHHHHHH
Confidence 346799999999988654
No 69
>PF12792 CSS-motif: CSS motif domain associated with EAL ; InterPro: IPR024744 This domain, with its characteristic highly conserved CSS sequence motif, is found N-terminal to the EAL domain (PF00563 from PFAM), found in many putative cyclic diguanylate phosphodiesterases.
Probab=21.82 E-value=86 Score=24.91 Aligned_cols=34 Identities=18% Similarity=0.304 Sum_probs=22.2
Q ss_pred HHHHHHHHHHcCCCCCCCCccceEEeeecCcCcc
Q 027770 159 EKKVLLALETAGVFTSGGLVKDKVLFCSTEIGRT 192 (219)
Q Consensus 159 E~~Vl~~Le~~GlF~~gGL~rHKVLFCST~~Gr~ 192 (219)
-.......-..-.+..-|+.++.-++|||--|..
T Consensus 45 ~~~Lr~~~~~~~~ir~i~~~~~g~i~CSS~~G~~ 78 (208)
T PF12792_consen 45 LNALRQIVARSPYIRDIGLVRNGRIYCSSLWGDL 78 (208)
T ss_pred HHHHHHHHHhchhhhheeEEeCCeEEecCCCCCc
Confidence 3344444334444444477779999999999986
No 70
>TIGR02250 FCP1_euk FCP1-like phosphatase, phosphatase domain. This domain is related to domains found in the human NLI interacting factor-like phosphatases, and together both are detected by the Pfam model pfam03031.
Probab=21.68 E-value=4.1e+02 Score=21.83 Aligned_cols=79 Identities=10% Similarity=0.049 Sum_probs=44.5
Q ss_pred ccceeehhHHHHHHHHhhcCceEEEEEeechhhHHHHHHHHHHcCCCCCCCCccceEEeeecCcCccceeeec-------
Q 027770 126 KQVTVKSSVLEALLEITKFCDLYLMERVLDNESEKKVLLALETAGVFTSGGLVKDKVLFCSTEIGRTSFVRQL------- 198 (219)
Q Consensus 126 ~~a~v~esalevLleLsk~~DlYLm~~V~dD~eE~~Vl~~Le~~GlF~~gGL~rHKVLFCST~~Gr~sfVRQL------- 198 (219)
.....+|.+.+.|-++++.+++++.+ ...++....+++..+.-. .+-.++|++-+...| ..++-|
T Consensus 55 ~~v~~rPgv~efL~~l~~~yel~I~T----~~~~~yA~~vl~~ldp~~--~~F~~ri~~rd~~~~--~~~KdL~~i~~~d 126 (156)
T TIGR02250 55 YLTKLRPFLHEFLKEASKLYEMHVYT----MGTRAYAQAIAKLIDPDG--KYFGDRIISRDESGS--PHTKSLLRLFPAD 126 (156)
T ss_pred EEEEECCCHHHHHHHHHhhcEEEEEe----CCcHHHHHHHHHHhCcCC--CeeccEEEEeccCCC--CccccHHHHcCCC
Confidence 45678999999999999888888754 233334444444322111 122367776444333 223333
Q ss_pred -ccceeecCCHHHHH
Q 027770 199 -EPDWHIDTNPEIVS 212 (219)
Q Consensus 199 -eP~lHIDtd~~vv~ 212 (219)
.=.+=||.++++-.
T Consensus 127 ~~~vvivDd~~~~~~ 141 (156)
T TIGR02250 127 ESMVVIIDDREDVWP 141 (156)
T ss_pred cccEEEEeCCHHHhh
Confidence 33445888776643
No 71
>PRK08942 D,D-heptose 1,7-bisphosphate phosphatase; Validated
Probab=21.62 E-value=4.1e+02 Score=21.35 Aligned_cols=45 Identities=13% Similarity=0.056 Sum_probs=31.7
Q ss_pred eEEEeecceeeccC-ChhhhhccceeehhHHHHHHHHhhc-CceEEE
Q 027770 106 KVTCRLLGVILEES-CPEELQKQVTVKSSVLEALLEITKF-CDLYLM 150 (219)
Q Consensus 106 kvTis~~gvvlee~-sp~el~~~a~v~esalevLleLsk~-~DlYLm 150 (219)
-+-+-+.|+|.++. ...+..++....+.+.+.|.+|.+. ..++++
T Consensus 5 ~~~~d~~~t~~~~~~~~~~~~~~~~~~pgv~e~L~~Lk~~g~~l~I~ 51 (181)
T PRK08942 5 AIFLDRDGVINVDSDGYVKSPDEWIPIPGSIEAIARLKQAGYRVVVA 51 (181)
T ss_pred EEEEECCCCcccCCccccCCHHHeEECCCHHHHHHHHHHCCCEEEEE
Confidence 34566778877776 3444456888999999999999875 344443
No 72
>cd01657 Ribosomal_L7_archeal_euk Ribosomal protein L7, which is found in archaea and eukaryotes but not in prokaryotes, binds domain II of the 23S rRNA as well as the 5S rRNA and is one of five ribosomal proteins that mediate the interactions 5S rRNA makes with the ribosome. The eukaryotic L7 members have an N-terminal extension not found in the archeal L7 orthologs. L7 is closely related to the ribosomal L30 protein found in eukaryotes and prokaryotes.
Probab=21.37 E-value=1.3e+02 Score=25.48 Aligned_cols=63 Identities=17% Similarity=0.236 Sum_probs=51.6
Q ss_pred eEEEEEeechhhH-HHHHHHHHHcCCCCCCCCccceEEeeecCcCccceeeecccceeec-CCHHHHHhh
Q 027770 147 LYLMERVLDNESE-KKVLLALETAGVFTSGGLVKDKVLFCSTEIGRTSFVRQLEPDWHID-TNPEIVSQL 214 (219)
Q Consensus 147 lYLm~~V~dD~eE-~~Vl~~Le~~GlF~~gGL~rHKVLFCST~~Gr~sfVRQLeP~lHID-td~~vv~~L 214 (219)
+|++.++.+..+. ..+...|+..||=.. |...|..-.....-|++.++|.+-+. -+.+.++.|
T Consensus 1 l~~Virirg~~g~~~~~r~tL~~LgL~k~-----~~~v~~~~t~~~~gmL~kV~~~V~~ge~~~~tv~~L 65 (159)
T cd01657 1 LYAVVRIRGIVGVPPKIRKTLQLLRLRRI-----NNAVFVKLTKATIGMLKKVEPYVTWGEPNLETLREL 65 (159)
T ss_pred CEEEEEecCcCCCCHHHHHHHHHcCCCcC-----CCEEEEeCCHHHHHHHHHhhhheEeccCcHHHHHHH
Confidence 5778888887633 678888888776665 88888888899999999999999998 488888877
No 73
>PLN02545 3-hydroxybutyryl-CoA dehydrogenase
Probab=21.21 E-value=2.2e+02 Score=24.97 Aligned_cols=29 Identities=21% Similarity=0.244 Sum_probs=22.3
Q ss_pred HHhhcCceEEEEEeechhhHHHHHHHHHH
Q 027770 140 EITKFCDLYLMERVLDNESEKKVLLALET 168 (219)
Q Consensus 140 eLsk~~DlYLm~~V~dD~eE~~Vl~~Le~ 168 (219)
+..+.||+.+.|-.++.+.++.+..-+++
T Consensus 79 ~~~~~aD~Vieav~e~~~~k~~v~~~l~~ 107 (295)
T PLN02545 79 EELRDADFIIEAIVESEDLKKKLFSELDR 107 (295)
T ss_pred HHhCCCCEEEEcCccCHHHHHHHHHHHHh
Confidence 34578999998877777778888776665
No 74
>PRK01844 hypothetical protein; Provisional
Probab=21.20 E-value=1.1e+02 Score=23.42 Aligned_cols=19 Identities=16% Similarity=0.340 Sum_probs=10.8
Q ss_pred HHHHHHHHHHHHHHHHhhc
Q 027770 17 VGAIAGLALAIVFTWRMFR 35 (219)
Q Consensus 17 ~~~~ag~a~a~v~~w~~lr 35 (219)
+|++.||-+|--+-=++|+
T Consensus 16 ~G~~~Gff~ark~~~k~lk 34 (72)
T PRK01844 16 AGVALGFFIARKYMMNYLQ 34 (72)
T ss_pred HHHHHHHHHHHHHHHHHHH
Confidence 4555555555555556666
No 75
>PF05729 NACHT: NACHT domain
Probab=21.18 E-value=1.3e+02 Score=22.60 Aligned_cols=41 Identities=15% Similarity=0.150 Sum_probs=30.3
Q ss_pred CccceEEeeecCcCccceeeecccc--eeec--CCHHHHHhhhhc
Q 027770 177 LVKDKVLFCSTEIGRTSFVRQLEPD--WHID--TNPEIVSQLAVH 217 (219)
Q Consensus 177 L~rHKVLFCST~~Gr~sfVRQLeP~--lHID--td~~vv~~L~rf 217 (219)
....|++.++++.+...+.+.++.. ++|+ +++++-+++..|
T Consensus 118 ~~~~~liit~r~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~ 162 (166)
T PF05729_consen 118 PPGVKLIITSRPRAFPDLRRRLKQAQILELEPFSEEDIKQYLRKY 162 (166)
T ss_pred CCCCeEEEEEcCChHHHHHHhcCCCcEEEECCCCHHHHHHHHHHH
Confidence 4458898888888888888888887 7777 556666666554
No 76
>KOG2181 consensus LIM domain binding protein LDB1/NLI/CLIM [Transcription]
Probab=21.04 E-value=2.3e+02 Score=27.73 Aligned_cols=31 Identities=16% Similarity=0.312 Sum_probs=18.0
Q ss_pred hhhhcCCCchHHHHHHHHHHHHHHHHhhcCCC
Q 027770 7 LLSWYQDSRSVGAIAGLALAIVFTWRMFRSPG 38 (219)
Q Consensus 7 ~~~~~~~~~s~~~~ag~a~a~v~~w~~lr~~~ 38 (219)
|.++|| +-+|.-==-+--.++=-|.-+-.|+
T Consensus 239 lMSrhK-ayalsPRdclKttLFQkwQrMvaPp 269 (415)
T KOG2181|consen 239 LMSRHK-AYALSPRDCLKTTLFQKWQRMVAPP 269 (415)
T ss_pred HHHhcc-ccCCCHHHHHHHHHHHHhhhhhcCC
Confidence 456676 5555554445555666776665553
No 77
>smart00550 Zalpha Z-DNA-binding domain in adenosine deaminases. Helix-turn-helix-containing domain. Also known as Zab.
Probab=20.64 E-value=37 Score=24.28 Aligned_cols=53 Identities=28% Similarity=0.425 Sum_probs=38.2
Q ss_pred echhhHHHHHHHHHHcCC--CC------CCCCcc---ceEEeeecCcCccceeeecccceeecC
Q 027770 154 LDNESEKKVLLALETAGV--FT------SGGLVK---DKVLFCSTEIGRTSFVRQLEPDWHIDT 206 (219)
Q Consensus 154 ~dD~eE~~Vl~~Le~~Gl--F~------~gGL~r---HKVLFCST~~Gr~sfVRQLeP~lHIDt 206 (219)
..|.-|++|+.+|++.|- ++ .-|+.+ ||+|+-=-..|.+.+..+--|.|+|..
T Consensus 3 ~~~~~~~~IL~~L~~~g~~~~ta~eLa~~lgl~~~~v~r~L~~L~~~G~V~~~~~~~~~W~i~~ 66 (68)
T smart00550 3 TQDSLEEKILEFLENSGDETSTALQLAKNLGLPKKEVNRVLYSLEKKGKVCKQGGTPPLWKLTD 66 (68)
T ss_pred CchHHHHHHHHHHHHCCCCCcCHHHHHHHHCCCHHHHHHHHHHHHHCCCEEecCCCCCceEeec
Confidence 457778999999998766 54 226666 788887777888777666348888864
No 78
>PF09587 PGA_cap: Bacterial capsule synthesis protein PGA_cap; InterPro: IPR019079 CapA is a putative poly-gamma-glutamate capsule biosynthesis protein found in bacteria. Poly-gamma-glutamate is a natural polymer that may be involved in virulence and may help bacteria survive in high salt concentrations. It is a surface-associated protein [].
Probab=20.50 E-value=4.3e+02 Score=22.71 Aligned_cols=97 Identities=20% Similarity=0.250 Sum_probs=58.2
Q ss_pred HHHHhcCceeEEEeecceeeccCChhhhhccceeehhHHHHHHHHhhcCceEEEE--EeechhhH--HHHHHHHHHcCCC
Q 027770 97 VRQKLSEGRKVTCRLLGVILEESCPEELQKQVTVKSSVLEALLEITKFCDLYLME--RVLDNESE--KKVLLALETAGVF 172 (219)
Q Consensus 97 v~~kls~~RkvTis~~gvvlee~sp~el~~~a~v~esalevLleLsk~~DlYLm~--~V~dD~eE--~~Vl~~Le~~GlF 172 (219)
++.-|.+.--+.+.+|+.+..+..+..-...+...++.++.|..+- .|+.=++ |+.|--.+ ..-++.|+++|+-
T Consensus 30 v~~~l~~aD~~~~NlE~~v~~~~~~~~~~~~f~~~~~~~~~L~~~G--~d~vslANNH~~D~G~~gl~~Tl~~L~~~gi~ 107 (250)
T PF09587_consen 30 VKPLLQSADLVVANLETPVTDSGQPASGYPHFNAPPEILDALKDAG--FDVVSLANNHIFDYGEEGLLDTLEALDKAGIP 107 (250)
T ss_pred HHHHHhhCCEEEEEeeecCcCCCCcCCCcceecCCHHHHHHHHHcC--CCEEEecCCCCccccHHHHHHHHHHHHHCCCc
Confidence 4456667778899999999776554442234444556666654321 4655555 66664444 5567788899988
Q ss_pred CCC-CCcc---ceEEeeecCcCcccee
Q 027770 173 TSG-GLVK---DKVLFCSTEIGRTSFV 195 (219)
Q Consensus 173 ~~g-GL~r---HKVLFCST~~Gr~sfV 195 (219)
..| |.+. .+..+-+....|..|+
T Consensus 108 ~~Gag~~~~~a~~p~i~~~~g~kia~l 134 (250)
T PF09587_consen 108 YVGAGRNLEEARRPAIIEVNGVKIAFL 134 (250)
T ss_pred EeECcCChHHhcCeEEEEECCEEEEEE
Confidence 754 4432 4555555544444443
No 79
>KOG0121 consensus Nuclear cap-binding protein complex, subunit CBP20 (RRM superfamily) [RNA processing and modification]
Probab=20.30 E-value=1.3e+02 Score=26.00 Aligned_cols=55 Identities=20% Similarity=0.280 Sum_probs=44.6
Q ss_pred eehhHHHHHHHHhhcCceEEEEEeechhhHHHHHHHHHHcCCCCCC--CCccceEEee
Q 027770 130 VKSSVLEALLEITKFCDLYLMERVLDNESEKKVLLALETAGVFTSG--GLVKDKVLFC 185 (219)
Q Consensus 130 v~esalevLleLsk~~DlYLm~~V~dD~eE~~Vl~~Le~~GlF~~g--GL~rHKVLFC 185 (219)
++-+..|-|-.+-+.|.||+ .-+.--..|+||-.+.-..|-...- ||++.|--+|
T Consensus 22 f~gt~~e~~~a~r~S~tvyV-gNlSfyttEEqiyELFs~cG~irriiMGLdr~kktpC 78 (153)
T KOG0121|consen 22 FRGTDEEQLEALRKSCTVYV-GNLSFYTTEEQIYELFSKCGDIRRIIMGLDRFKKTPC 78 (153)
T ss_pred hcCchHHHHHHHhhcceEEE-eeeeeeecHHHHHHHHHhccchheeEeccccCCcCcc
Confidence 44466677777888999998 7788888999999999999988843 9999886655
No 80
>PF13807 GNVR: G-rich domain on putative tyrosine kinase
Probab=20.04 E-value=97 Score=22.62 Aligned_cols=17 Identities=29% Similarity=0.532 Sum_probs=12.4
Q ss_pred chHHHHHHHHHHHHHHH
Q 027770 15 RSVGAIAGLALAIVFTW 31 (219)
Q Consensus 15 ~s~~~~ag~a~a~v~~w 31 (219)
..+|+++|+++++.+++
T Consensus 62 l~l~~~~Gl~lgi~~~~ 78 (82)
T PF13807_consen 62 LALGLFLGLILGIGLAF 78 (82)
T ss_pred HHHHHHHHHHHHHHHHH
Confidence 35788888888777664
No 81
>PRK11677 hypothetical protein; Provisional
Probab=20.02 E-value=79 Score=26.38 Aligned_cols=28 Identities=25% Similarity=0.459 Sum_probs=17.5
Q ss_pred CeeeehhhhhcCCCchHHHHHHHHHHHHHHHHhhcCCC
Q 027770 1 MFWVLTLLSWYQDSRSVGAIAGLALAIVFTWRMFRSPG 38 (219)
Q Consensus 1 ~~~~~~~~~~~~~~~s~~~~ag~a~a~v~~w~~lr~~~ 38 (219)
|-|+.++ +|-|+|++|+.+.. |++.+..
T Consensus 1 M~W~~a~---------i~livG~iiG~~~~-R~~~~~~ 28 (134)
T PRK11677 1 MTWEYAL---------IGLVVGIIIGAVAM-RFGNRKL 28 (134)
T ss_pred CcHHHHH---------HHHHHHHHHHHHHH-hhccchh
Confidence 6677665 67777777755444 4555543
Done!