Query         027770
Match_columns 219
No_of_seqs    55 out of 57
Neff          3.4 
Searched_HMMs 46136
Date          Fri Mar 29 14:55:05 2013
Command       hhsearch -i /work/01045/syshi/csienesis_hhblits_a3m/027770.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/027770hhsearch_cdd -cpu 12 -v 0 

 No Hit                             Prob E-value P-value  Score    SS Cols Query HMM  Template HMM
  1 PF12827 Peroxin-22:  Peroxisom  92.7   0.021 4.6E-07   46.3  -1.1   40  179-218    68-111 (117)
  2 COG4502 5'(3')-deoxyribonucleo  88.0       2 4.3E-05   37.4   6.7   74  127-212    66-141 (180)
  3 cd08556 GDPD Glycerophosphodie  82.8     2.2 4.7E-05   33.8   4.3   64  132-204    49-116 (189)
  4 cd08562 GDPD_EcUgpQ_like Glyce  78.8       3 6.5E-05   34.7   4.0   63  136-204    90-157 (229)
  5 cd08582 GDPD_like_2 Glyceropho  71.9     6.4 0.00014   33.2   4.3   61  136-202    90-154 (233)
  6 COG0144 Sun tRNA and rRNA cyto  69.0     5.5 0.00012   37.0   3.6   97  109-208   239-345 (355)
  7 PF06941 NT5C:  5' nucleotidase  68.9       9  0.0002   31.5   4.4   79  126-215    70-152 (191)
  8 PF01644 Chitin_synth_1:  Chiti  59.5      22 0.00049   30.5   5.2   29  176-204   105-143 (163)
  9 cd08579 GDPD_memb_like Glycero  57.6      19 0.00041   30.2   4.4   59  139-204    89-152 (220)
 10 cd08563 GDPD_TtGDE_like Glycer  57.2      16 0.00035   30.8   4.0   57  140-203    96-157 (230)
 11 cd08567 GDPD_SpGDE_like Glycer  56.3      16 0.00034   31.1   3.8   66  131-204   109-188 (263)
 12 TIGR03778 VPDSG_CTERM VPDSG-CT  55.9      16 0.00034   23.1   2.8   21   12-32      2-22  (26)
 13 TIGR03510 XapX XapX domain. Th  55.6      15 0.00032   26.1   2.9   23   17-39      4-26  (49)
 14 PF04246 RseC_MucC:  Positive r  54.9      12 0.00027   29.6   2.8   28   13-40     97-124 (135)
 15 PF13419 HAD_2:  Haloacid dehal  50.6      69  0.0015   24.0   6.2   79  125-212    73-164 (176)
 16 cd08573 GDPD_GDE1 Glycerophosp  50.1      28 0.00062   30.5   4.5   62  137-204    93-156 (258)
 17 cd08564 GDPD_GsGDE_like Glycer  48.8      37  0.0008   29.5   5.0   61  132-201   108-171 (265)
 18 cd08601 GDPD_SaGlpQ_like Glyce  48.6      34 0.00073   29.4   4.7   44  158-202   127-170 (256)
 19 PF01102 Glycophorin_A:  Glycop  44.9      20 0.00044   29.5   2.6   19   17-35     71-89  (122)
 20 PRK11009 aphA acid phosphatase  43.0 1.5E+02  0.0032   26.4   7.9  110   90-205    48-191 (237)
 21 PF01171 ATP_bind_3:  PP-loop f  42.8      58  0.0013   26.6   5.0   67  130-205    80-148 (182)
 22 cd01028 TOPRIM_TopoIA TOPRIM_T  41.4 1.8E+02   0.004   23.2   7.7   79  105-186    34-129 (142)
 23 TIGR02245 HAD_IIID1 HAD-superf  40.9 2.4E+02  0.0053   24.5   9.1   76  103-186    20-95  (195)
 24 TIGR03190 benz_CoA_bzdN benzoy  40.5       7 0.00015   36.3  -0.8   70  136-208   276-353 (377)
 25 cd01427 HAD_like Haloacid deha  39.6   1E+02  0.0022   21.7   5.3   60  108-171     3-63  (139)
 26 cd08561 GDPD_cytoplasmic_ScUgp  39.3      49  0.0011   28.3   4.2   55  146-207   106-162 (249)
 27 COG4317 Uncharacterized protei  37.3      38 0.00083   27.0   3.0   20   19-38      8-27  (93)
 28 cd08568 GDPD_TmGDE_like Glycer  37.0      59  0.0013   27.5   4.3   50  146-202    92-141 (226)
 29 PRK07066 3-hydroxybutyryl-CoA   37.0      97  0.0021   28.7   6.0   71  140-218    79-172 (321)
 30 PF10389 CoatB:  Bacteriophage   37.0      26 0.00057   24.6   1.8   11   27-37     35-45  (46)
 31 cd08565 GDPD_pAtGDE_like Glyce  36.2      68  0.0015   27.6   4.7   59  136-202    85-150 (235)
 32 PF03672 UPF0154:  Uncharacteri  35.4      45 0.00098   24.9   2.9   19   17-35      9-27  (64)
 33 PF12732 YtxH:  YtxH-like prote  34.2      29 0.00064   25.1   1.8   14   17-30      7-20  (74)
 34 PF00891 Methyltransf_2:  O-met  32.7      35 0.00076   28.8   2.3   28  140-168   155-182 (241)
 35 TIGR03593 yidC_nterm membrane   32.2      99  0.0021   28.2   5.3   19   22-40      9-27  (366)
 36 PF13844 Glyco_transf_41:  Glyc  30.7 1.1E+02  0.0024   30.3   5.6   74  126-207   293-369 (468)
 37 PF01751 Toprim:  Toprim domain  30.1 2.3E+02   0.005   21.0   6.6   75   91-167     8-82  (100)
 38 cd04885 ACT_ThrD-I Tandem C-te  29.9      89  0.0019   21.7   3.6   30  142-171    35-64  (68)
 39 PRK09260 3-hydroxybutyryl-CoA   29.0 1.4E+02  0.0029   26.2   5.4   42  140-188    77-118 (288)
 40 cd08566 GDPD_AtGDE_like Glycer  28.7 1.1E+02  0.0024   26.4   4.7   48  147-202   101-148 (240)
 41 PF06295 DUF1043:  Protein of u  28.6      53  0.0011   26.5   2.6   14   17-30      4-17  (128)
 42 PHA00350 putative assembly pro  28.4      93   0.002   30.1   4.6   24  180-204   371-394 (399)
 43 PF13940 Ldr_toxin:  Toxin Ldr,  28.1      42 0.00091   22.6   1.6   26    5-32      6-31  (35)
 44 cd04906 ACT_ThrD-I_1 First of   27.5   1E+02  0.0022   22.5   3.7   30  143-172    38-68  (85)
 45 cd08559 GDPD_periplasmic_GlpQ_  27.4 1.1E+02  0.0023   27.4   4.5   53  146-202   128-188 (296)
 46 PRK01294 lipase chaperone; Pro  27.3 3.7E+02   0.008   25.3   8.2   21   79-99     89-110 (336)
 47 PRK11677 hypothetical protein;  26.8      43 0.00094   27.9   1.8   18   19-36      6-23  (134)
 48 cd08570 GDPD_YPL206cp_fungi Gl  26.4 1.2E+02  0.0026   25.7   4.5   64  135-201    85-155 (234)
 49 PF08370 PDR_assoc:  Plant PDR   26.1      96  0.0021   23.0   3.3   26   15-40     31-60  (65)
 50 PF06295 DUF1043:  Protein of u  26.0      44 0.00095   27.0   1.7   22   19-40      2-23  (128)
 51 COG4980 GvpP Gas vesicle prote  25.9      47   0.001   27.3   1.8   14   17-30     13-26  (115)
 52 cd06660 Aldo_ket_red Aldo-keto  25.7      84  0.0018   26.8   3.4   86   89-177    57-147 (285)
 53 PF00998 RdRP_3:  Viral RNA dep  25.6      75  0.0016   31.5   3.5  123   72-198   216-365 (486)
 54 PF00248 Aldo_ket_red:  Aldo/ke  25.5      54  0.0012   27.9   2.2   84   92-177    48-136 (283)
 55 COG3086 RseC Positive regulato  25.1      42 0.00091   28.9   1.5   35    3-37     94-128 (150)
 56 COG4218 MtrF Tetrahydromethano  25.0      63  0.0014   24.9   2.2   18   17-34     48-65  (73)
 57 TIGR02863 spore_sspJ small, ac  24.8      49  0.0011   23.4   1.5   18  154-171    12-29  (47)
 58 PF09527 ATPase_gene1:  Putativ  24.1      90  0.0019   21.3   2.7   19   17-35     37-55  (55)
 59 PF09575 Spore_SspJ:  Small spo  24.0      51  0.0011   23.2   1.5   18  154-171    11-28  (46)
 60 PRK14901 16S rRNA methyltransf  23.8      70  0.0015   30.2   2.8   53  121-173   353-407 (434)
 61 TIGR03142 cytochro_ccmI cytoch  23.7      61  0.0013   25.5   2.1    8    1-8       1-8   (117)
 62 TIGR01261 hisB_Nterm histidino  23.3 2.7E+02  0.0059   22.8   5.9   69  111-185     8-91  (161)
 63 PRK07819 3-hydroxybutyryl-CoA   23.3 2.8E+02   0.006   24.8   6.3   70  142-218    82-174 (286)
 64 COG1167 ARO8 Transcriptional r  23.1      92   0.002   29.7   3.5   73  128-214   242-320 (459)
 65 PF14155 DUF4307:  Domain of un  22.9      82  0.0018   25.0   2.6   26   12-37      4-29  (112)
 66 PRK06049 rpl30p 50S ribosomal   22.8 1.3E+02  0.0029   25.5   4.0   64  147-215     2-67  (154)
 67 PF12263 DUF3611:  Protein of u  22.8   1E+02  0.0022   26.9   3.4   21   16-36     62-82  (183)
 68 PF10031 DUF2273:  Small integr  22.8      99  0.0022   21.8   2.8   18   13-30      8-25  (51)
 69 PF12792 CSS-motif:  CSS motif   21.8      86  0.0019   24.9   2.6   34  159-192    45-78  (208)
 70 TIGR02250 FCP1_euk FCP1-like p  21.7 4.1E+02  0.0088   21.8   6.6   79  126-212    55-141 (156)
 71 PRK08942 D,D-heptose 1,7-bisph  21.6 4.1E+02   0.009   21.3   6.6   45  106-150     5-51  (181)
 72 cd01657 Ribosomal_L7_archeal_e  21.4 1.3E+02  0.0027   25.5   3.6   63  147-214     1-65  (159)
 73 PLN02545 3-hydroxybutyryl-CoA   21.2 2.2E+02  0.0048   25.0   5.2   29  140-168    79-107 (295)
 74 PRK01844 hypothetical protein;  21.2 1.1E+02  0.0024   23.4   2.9   19   17-35     16-34  (72)
 75 PF05729 NACHT:  NACHT domain    21.2 1.3E+02  0.0029   22.6   3.4   41  177-217   118-162 (166)
 76 KOG2181 LIM domain binding pro  21.0 2.3E+02  0.0049   27.7   5.6   31    7-38    239-269 (415)
 77 smart00550 Zalpha Z-DNA-bindin  20.6      37 0.00081   24.3   0.3   53  154-206     3-66  (68)
 78 PF09587 PGA_cap:  Bacterial ca  20.5 4.3E+02  0.0094   22.7   6.8   97   97-195    30-134 (250)
 79 KOG0121 Nuclear cap-binding pr  20.3 1.3E+02  0.0028   26.0   3.4   55  130-185    22-78  (153)
 80 PF13807 GNVR:  G-rich domain o  20.0      97  0.0021   22.6   2.4   17   15-31     62-78  (82)
 81 PRK11677 hypothetical protein;  20.0      79  0.0017   26.4   2.1   28    1-38      1-28  (134)

No 1  
>PF12827 Peroxin-22:  Peroxisomal biogenesis protein family;  InterPro: IPR024359 Peroxin-22 is an integral peroxisomal membrane protein. The N terminus of peroxin-22 is located in the matrix, while the C terminus is located in the cytosol. Peroxin-22 interacts with the ubiquitin-conjugating enzyme Pex4p, anchoring it at the peroxisomal membrane. Both proteins may act at the same step in peroxisome biogenesis [].; PDB: 2Y9O_B 2Y9M_B 2Y9P_B.
Probab=92.71  E-value=0.021  Score=46.28  Aligned_cols=40  Identities=25%  Similarity=0.377  Sum_probs=28.3

Q ss_pred             cceEEeeecCcCccceeeecccceeecCCHH----HHHhhhhcc
Q 027770          179 KDKVLFCSTEIGRTSFVRQLEPDWHIDTNPE----IVSQLAVHL  218 (219)
Q Consensus       179 rHKVLFCST~~Gr~sfVRQLeP~lHIDtd~~----vv~~L~rfv  218 (219)
                      .|||+-|+|.+|+-|-||||.|+.-+=...+    +=++|.||+
T Consensus        68 ~~KiI~Cdt~~G~~~~vK~Lk~d~llv~~ddl~~~ip~Di~rfv  111 (117)
T PF12827_consen   68 NYKIIKCDTMQGYWSCVKHLKPDQLLVCSDDLGISIPEDINRFV  111 (117)
T ss_dssp             GGGEEEESSHHHHHHHHHHH--SEEEE-GGGT-SSS-GGGGGT-
T ss_pred             ceeEEEecchhhHHHHHHhcCCCceEEehhhccccCHHHHHHHH
Confidence            4999999999999999999999988742222    334677776


No 2  
>COG4502 5'(3')-deoxyribonucleotidase [Nucleotide transport and metabolism]
Probab=87.97  E-value=2  Score=37.37  Aligned_cols=74  Identities=23%  Similarity=0.364  Sum_probs=57.1

Q ss_pred             cceeehhHHHHHHHHhhcCceEEEEEee--chhhHHHHHHHHHHcCCCCCCCCccceEEeeecCcCccceeeecccceee
Q 027770          127 QVTVKSSVLEALLEITKFCDLYLMERVL--DNESEKKVLLALETAGVFTSGGLVKDKVLFCSTEIGRTSFVRQLEPDWHI  204 (219)
Q Consensus       127 ~a~v~esalevLleLsk~~DlYLm~~V~--dD~eE~~Vl~~Le~~GlF~~gGL~rHKVLFCST~~Gr~sfVRQLeP~lHI  204 (219)
                      +..++|-+-+++-||++.+|||+.+.-.  -+.=|++--.++|..     .-|...+++||    |+-.+|   +.++-|
T Consensus        66 nL~V~p~aq~v~keLt~~y~vYivtaamdhp~s~~dK~eWl~E~F-----PFi~~qn~vfC----gnKniv---kaDilI  133 (180)
T COG4502          66 NLGVQPFAQTVLKELTSIYNVYIVTAAMDHPKSCEDKGEWLKEKF-----PFISYQNIVFC----GNKNIV---KADILI  133 (180)
T ss_pred             hcCccccHHHHHHHHHhhheEEEEEeccCCchhHHHHHHHHHHHC-----CCCChhhEEEe----cCCCeE---EeeEEe
Confidence            6788999999999999999999987663  344466777777873     36667999999    444444   568899


Q ss_pred             cCCHHHHH
Q 027770          205 DTNPEIVS  212 (219)
Q Consensus       205 Dtd~~vv~  212 (219)
                      |.++-.++
T Consensus       134 DDnp~nLE  141 (180)
T COG4502         134 DDNPLNLE  141 (180)
T ss_pred             cCCchhhh
Confidence            98887655


No 3  
>cd08556 GDPD Glycerophosphodiester phosphodiesterase domain as found in prokaryota and eukaryota, and similar proteins. The typical glycerophosphodiester phosphodiesterase domain (GDPD) consists of a TIM barrel and a small insertion domain named the GDPD-insertion (GDPD-I) domain, which is specific for GDPD proteins. This family corresponds to both typical GDPD domain and GDPD-like domain which lacks the GDPD-I region. Members in this family mainly consist of a large family of prokaryotic and eukaryotic glycerophosphodiester phosphodiesterases (GP-GDEs, EC 3.1.4.46), and a number of uncharacterized homologs. Sphingomyelinases D (SMases D) (sphingomyelin phosphodiesterase D, EC 3.1.4.41) from spider venom, SMases D-like proteins, and phospholipase D (PLD) from several pathogenic bacteria are also included in this family. GDPD plays an essential role in glycerol metabolism and catalyzes the hydrolysis of glycerophosphodiesters to sn-glycerol-3-phosphate (G3P) and the corresponding alcoho
Probab=82.85  E-value=2.2  Score=33.75  Aligned_cols=64  Identities=20%  Similarity=0.217  Sum_probs=47.9

Q ss_pred             hhHHHHHHHHhhcCceEEEEEeech----hhHHHHHHHHHHcCCCCCCCCccceEEeeecCcCccceeeecccceee
Q 027770          132 SSVLEALLEITKFCDLYLMERVLDN----ESEKKVLLALETAGVFTSGGLVKDKVLFCSTEIGRTSFVRQLEPDWHI  204 (219)
Q Consensus       132 esalevLleLsk~~DlYLm~~V~dD----~eE~~Vl~~Le~~GlF~~gGL~rHKVLFCST~~Gr~sfVRQLeP~lHI  204 (219)
                      +.--|+|-.+ +- +++++.-++++    .-.+.+++++++.|       ..++|+|||........+|++.|.+.+
T Consensus        49 ~tL~e~l~~~-~~-~~~i~leiK~~~~~~~~~~~l~~~i~~~~-------~~~~v~i~s~~~~~l~~~~~~~p~~~~  116 (189)
T cd08556          49 PTLEEVLELV-KG-GVGLNIELKEPTRYPGLEAKVAELLREYG-------LEERVVVSSFDHEALRALKELDPEVPT  116 (189)
T ss_pred             CCHHHHHHhc-cc-CcEEEEEECCCCCchhHHHHHHHHHHHcC-------CcCCEEEEeCCHHHHHHHHHhCCCCcE
Confidence            3444444333 33 89999999997    35688999999966       237999999988888888888887655


No 4  
>cd08562 GDPD_EcUgpQ_like Glycerophosphodiester phosphodiesterase domain in Escherichia coli cytosolic glycerophosphodiester phosphodiesterase UgpQ and similar proteins. This subfamily corresponds to the glycerophosphodiester phosphodiesterase domain (GDPD) present in Escherichia coli cytosolic glycerophosphodiester phosphodiesterase (GP-GDE, EC 3.1.4.46), UgpQ, and similar proteins. GP-GDE plays an essential role in the metabolic pathway of E. coli. It catalyzes the degradation of glycerophosphodiesters to produce sn-glycerol-3-phosphate (G3P) and the corresponding alcohols, which are major sources of carbon and phosphate. E. coli possesses two major G3P uptake systems: Glp and Ugp, which contain genes coding for two distinct GP-GDEs. UgpQ gene from the E. coli ugp operon codes for a cytosolic phosphodiesterase GlpQ, which is the prototype of this family. Various glycerophosphodiesters, such as glycerophosphocholine (GPC), glycerophosphoethanolanmine (GPE), glycerophosphoglycerol (GPG)
Probab=78.83  E-value=3  Score=34.69  Aligned_cols=63  Identities=19%  Similarity=0.207  Sum_probs=46.7

Q ss_pred             HHHHHHhhcCceEEEEEeechhh-----HHHHHHHHHHcCCCCCCCCccceEEeeecCcCccceeeecccceee
Q 027770          136 EALLEITKFCDLYLMERVLDNES-----EKKVLLALETAGVFTSGGLVKDKVLFCSTEIGRTSFVRQLEPDWHI  204 (219)
Q Consensus       136 evLleLsk~~DlYLm~~V~dD~e-----E~~Vl~~Le~~GlF~~gGL~rHKVLFCST~~Gr~sfVRQLeP~lHI  204 (219)
                      +-++++.+..+++++.-++++.+     .+.+..++++.|..      .+||+|||-...-...+|++.|.+.+
T Consensus        90 ~evl~~~~~~~~~l~iEiK~~~~~~~~~~~~v~~~l~~~~~~------~~~v~~~Sf~~~~l~~~~~~~p~~~~  157 (229)
T cd08562          90 ADVLELARELGLGLNLEIKPDPGDEALTARVVAAALRELWPH------ASKLLLSSFSLEALRAARRAAPELPL  157 (229)
T ss_pred             HHHHHHHHhcCCEEEEEECCCCCccHHHHHHHHHHHHHhcCC------cCCEEEECCCHHHHHHHHHhCCCCcE
Confidence            33445566567888888887653     35688888886553      48999999988888899998887643


No 5  
>cd08582 GDPD_like_2 Glycerophosphodiester phosphodiesterase domain of uncharacterized bacterial glycerophosphodiester phosphodiesterases. This subfamily corresponds to the glycerophosphodiester phosphodiesterase domain (GDPD) present in a group of uncharacterized bacterial glycerophosphodiester phosphodiesterase and similar proteins. They show high sequence similarity to Escherichia coli glycerophosphodiester phosphodiesterase, which catalyzes the degradation of glycerophosphodiesters to produce sn-glycerol-3-phosphate (G3P) and the corresponding alcohols.
Probab=71.86  E-value=6.4  Score=33.24  Aligned_cols=61  Identities=23%  Similarity=0.339  Sum_probs=44.7

Q ss_pred             HHHHHHhhcCceEEEEEeech----hhHHHHHHHHHHcCCCCCCCCccceEEeeecCcCccceeeecccce
Q 027770          136 EALLEITKFCDLYLMERVLDN----ESEKKVLLALETAGVFTSGGLVKDKVLFCSTEIGRTSFVRQLEPDW  202 (219)
Q Consensus       136 evLleLsk~~DlYLm~~V~dD----~eE~~Vl~~Le~~GlF~~gGL~rHKVLFCST~~Gr~sfVRQLeP~l  202 (219)
                      +-++++.+..+++|+.-++++    .-.+.+.++++++|.      ..+||+|+|-..-....+|++.|.+
T Consensus        90 ~evl~~~~~~~~~l~ieiK~~~~~~~~~~~~~~~~~~~~~------~~~~v~~~Sf~~~~l~~~~~~~p~~  154 (233)
T cd08582          90 EEYLAIVPKYGKKLFIEIKHPRRGPEAEEELLKLLKESGL------LPEQIVIISFDAEALKRVRELAPTL  154 (233)
T ss_pred             HHHHHHHHhcCceEEEEeCCCccCccHHHHHHHHHHHcCC------CCCCEEEEecCHHHHHHHHHHCCCC
Confidence            334455555589999999963    334678888888653      3489999998888888889988873


No 6  
>COG0144 Sun tRNA and rRNA cytosine-C5-methylases [Translation, ribosomal structure and biogenesis]
Probab=69.04  E-value=5.5  Score=36.95  Aligned_cols=97  Identities=24%  Similarity=0.214  Sum_probs=68.7

Q ss_pred             Eeeccee------eccCChhhhhccceeehhHHHHHHHHhhcC--ceEEEEEeechhhHHHHHHHHHHcCCCCC--CCCc
Q 027770          109 CRLLGVI------LEESCPEELQKQVTVKSSVLEALLEITKFC--DLYLMERVLDNESEKKVLLALETAGVFTS--GGLV  178 (219)
Q Consensus       109 is~~gvv------lee~sp~el~~~a~v~esalevLleLsk~~--DlYLm~~V~dD~eE~~Vl~~Le~~GlF~~--gGL~  178 (219)
                      ||-.|++      .|+.+++++.+-+.++.+.++.-..+.|-.  =||=-|.+.-.|.|++|..+|++...|.-  .++.
T Consensus       239 CSg~G~irr~Pd~~~~~~~~~i~~l~~lQ~~iL~~a~~~lk~GG~LVYSTCS~~~eENE~vV~~~L~~~~~~~~~~~~~~  318 (355)
T COG0144         239 CSGTGVIRRDPDVKWRRTPEDIAELAKLQKEILAAALKLLKPGGVLVYSTCSLTPEENEEVVERFLERHPDFELEPVRLP  318 (355)
T ss_pred             CCCCcccccCccccccCCHHHHHHHHHHHHHHHHHHHHhcCCCCEEEEEccCCchhcCHHHHHHHHHhCCCceeeccccc
Confidence            7888887      577788888888888888888888888775  47888999999999999999998655531  1111


Q ss_pred             cceEEeeecCcCccceeeecccceeecCCH
Q 027770          179 KDKVLFCSTEIGRTSFVRQLEPDWHIDTNP  208 (219)
Q Consensus       179 rHKVLFCST~~Gr~sfVRQLeP~lHIDtd~  208 (219)
                      -..  ++.-.......++++-|..| ++|-
T Consensus       319 ~~~--~~~~~~~~~~~~~r~~p~~~-~~dG  345 (355)
T COG0144         319 WGP--LFEGLGSELGKTRRLYPHVH-GTDG  345 (355)
T ss_pred             ccc--cccccccccCCeEEECCCCC-CCCC
Confidence            000  22221123445799999999 5454


No 7  
>PF06941 NT5C:  5' nucleotidase, deoxy (Pyrimidine), cytosolic type C protein (NT5C);  InterPro: IPR010708 This family consists of several 5' nucleotidase, deoxy (Pyrimidine), and cytosolic type C (NT5C) proteins. 5'(3')-deoxyribonucleotidase is a ubiquitous enzyme in mammalian cells whose physiological function is not known [].; GO: 0016791 phosphatase activity; PDB: 1Z4M_A 1Q92_A 1Q91_A 1Z4J_A 1Z4I_A 1Z4Q_A 1Z4K_A 2JAW_A 1MH9_A 1Z4L_A ....
Probab=68.86  E-value=9  Score=31.54  Aligned_cols=79  Identities=19%  Similarity=0.242  Sum_probs=48.0

Q ss_pred             ccceeehhHHHHHHHHhhcC-ceEEEEEeech-h--hHHHHHHHHHHcCCCCCCCCccceEEeeecCcCccceeeecccc
Q 027770          126 KQVTVKSSVLEALLEITKFC-DLYLMERVLDN-E--SEKKVLLALETAGVFTSGGLVKDKVLFCSTEIGRTSFVRQLEPD  201 (219)
Q Consensus       126 ~~a~v~esalevLleLsk~~-DlYLm~~V~dD-~--eE~~Vl~~Le~~GlF~~gGL~rHKVLFCST~~Gr~sfVRQLeP~  201 (219)
                      .+....+.|.|+|-+|.+.. .++.|+.-..+ .  .++.-.+-|++.  |.  ++..+++.||..   | +   .+..+
T Consensus        70 ~~l~p~~gA~e~l~~L~~~g~~~~~Itar~~~~~~~~~~~k~~Wl~~h--f~--~i~~~~~~~~~~---K-~---~v~~D  138 (191)
T PF06941_consen   70 SNLPPIPGAVEALKKLRDKGHEIVIITARPPEFPDHSAEEKREWLERH--FP--FIPYDNLIFTGD---K-T---LVGGD  138 (191)
T ss_dssp             TT--B-TTHHHHHHHHHTSTTEEEEEEE-SSSSGCCCHHHHHHHHHHH--HT--HHHHCCEEEESS---G-G---GC--S
T ss_pred             cCCCccHHHHHHHHHHHHcCCcEEEEEecCccccchHHHHHHHHHHHH--cC--CCchheEEEecC---C-C---eEecc
Confidence            35677899999999999996 56655655444 1  233334455553  22  455588999842   2 2   45789


Q ss_pred             eeecCCHHHHHhhh
Q 027770          202 WHIDTNPEIVSQLA  215 (219)
Q Consensus       202 lHIDtd~~vv~~L~  215 (219)
                      +.||-++..+..+.
T Consensus       139 vlIDD~~~n~~~~~  152 (191)
T PF06941_consen  139 VLIDDRPHNLEQFA  152 (191)
T ss_dssp             EEEESSSHHHSS-S
T ss_pred             EEecCChHHHHhcc
Confidence            99999998887653


No 8  
>PF01644 Chitin_synth_1:  Chitin synthase;  InterPro: IPR004834 This region is found commonly in chitin synthases classes I, II and III 2.4.1.16 from EC. Chitin a linear homopolymer of GlcNAc residues, it is an important component of the cell wall of fungi and is synthesised on the cytoplasmic surface of the cell membrane by membrane bound chitin synthases []. ; GO: 0004100 chitin synthase activity, 0006031 chitin biosynthetic process
Probab=59.55  E-value=22  Score=30.55  Aligned_cols=29  Identities=31%  Similarity=0.678  Sum_probs=23.2

Q ss_pred             CCccceEEeeecCcCc----------cceeeecccceee
Q 027770          176 GLVKDKVLFCSTEIGR----------TSFVRQLEPDWHI  204 (219)
Q Consensus       176 GL~rHKVLFCST~~Gr----------~sfVRQLeP~lHI  204 (219)
                      |..+=.++||-.++=.          -+|+|||+|.+.|
T Consensus       105 ~~~PvQ~ifclKe~N~kKinSHrWfFnaf~~~l~P~vcv  143 (163)
T PF01644_consen  105 NIVPVQIIFCLKEKNAKKINSHRWFFNAFCRQLQPNVCV  143 (163)
T ss_pred             CCCCEEEEEEeccccccccchhhHHHHHHHhhcCCcEEE
Confidence            6778889999887643          3799999998764


No 9  
>cd08579 GDPD_memb_like Glycerophosphodiester phosphodiesterase domain of uncharacterized bacterial glycerophosphodiester phosphodiesterases. This subfamily corresponds to the glycerophosphodiester phosphodiesterase domain (GDPD) present in uncharacterized bacterial glycerophosphodiester phosphodiesterases. In addition to a C-terminal GDPD domain, most members in this family have an N-terminus that functions as a membrane anchor.
Probab=57.59  E-value=19  Score=30.17  Aligned_cols=59  Identities=14%  Similarity=0.087  Sum_probs=41.3

Q ss_pred             HHHhhcCceEEEEEeechh-----hHHHHHHHHHHcCCCCCCCCccceEEeeecCcCccceeeecccceee
Q 027770          139 LEITKFCDLYLMERVLDNE-----SEKKVLLALETAGVFTSGGLVKDKVLFCSTEIGRTSFVRQLEPDWHI  204 (219)
Q Consensus       139 leLsk~~DlYLm~~V~dD~-----eE~~Vl~~Le~~GlF~~gGL~rHKVLFCST~~Gr~sfVRQLeP~lHI  204 (219)
                      +++.+..++.++.-++++.     -.+.+++++++.|..       .+|+|+|-......-+|++.|.+.+
T Consensus        89 l~~~~~~~~~l~iEiK~~~~~~~~~~~~v~~~l~~~~~~-------~~v~v~Sf~~~~l~~~~~~~p~~~~  152 (220)
T cd08579          89 LALAKGLKQKLLIELKPHGHDSPDLVEKFVKLYKQNLIE-------NQHQVHSLDYRVIEKVKKLDPKIKT  152 (220)
T ss_pred             HHHhhccCCeEEEEECCCCCCCHHHHHHHHHHHHHcCCC-------cCeEEEeCCHHHHHHHHHHCCCCeE
Confidence            3444445777878888765     246788888886542       4799999877777778888887654


No 10 
>cd08563 GDPD_TtGDE_like Glycerophosphodiester phosphodiesterase domain of Thermoanaerobacter tengcongensis and similar proteins. This subfamily corresponds to the glycerophosphodiester phosphodiesterase domain (GDPD) present in Thermoanaerobacter tengcongensis glycerophosphodiester phosphodiesterase (TtGDE, EC 3.1.4.46) and its uncharacterized homologs. Members in this family show high sequence similarity to Escherichia coli GP-GDE, which catalyzes the degradation of glycerophosphodiesters to produce sn-glycerol-3-phosphate (G3P) and the corresponding alcohols. Despite the fact that most of GDPD family members exist as the monomer, TtGDE can function as a dimeric unit. Its catalytic mechanism is based on the general base-acid catalysis, which is similar to that of phosphoinositide-specific phospholipases C (PI-PLCs, EC 3.1.4.11). A divalent metal cation is required for the enzyme activity of TtGDE.
Probab=57.22  E-value=16  Score=30.78  Aligned_cols=57  Identities=26%  Similarity=0.346  Sum_probs=43.0

Q ss_pred             HHhhcCceEEEEEeechhh-----HHHHHHHHHHcCCCCCCCCccceEEeeecCcCccceeeeccccee
Q 027770          140 EITKFCDLYLMERVLDNES-----EKKVLLALETAGVFTSGGLVKDKVLFCSTEIGRTSFVRQLEPDWH  203 (219)
Q Consensus       140 eLsk~~DlYLm~~V~dD~e-----E~~Vl~~Le~~GlF~~gGL~rHKVLFCST~~Gr~sfVRQLeP~lH  203 (219)
                      ++.+.-+++++.-++++..     ++.+.++++++|.       .+++++.|-..-....+|++.|.+.
T Consensus        96 ~~~~~~~~~l~leiK~~~~~~~~~~~~l~~~l~~~~~-------~~~v~~~Sf~~~~l~~~~~~~p~~~  157 (230)
T cd08563          96 DLLKDKDLLLNIEIKTDVIHYPGIEKKVLELVKEYNL-------EDRVIFSSFNHESLKRLKKLDPKIK  157 (230)
T ss_pred             HHHHhcCcEEEEEECCCCCcChhHHHHHHHHHHHcCC-------CCCEEEEcCCHHHHHHHHHHCCCCc
Confidence            4444457888888887643     4688889988662       3689999988888888999999753


No 11 
>cd08567 GDPD_SpGDE_like Glycerophosphodiester phosphodiesterase domain of putative Silicibacter pomeroyi glycerophosphodiester phosphodiesterase and similar proteins. This subfamily corresponds to the glycerophosphodiester phosphodiesterase domain (GDPD) present in a group of uncharacterized bacterial glycerophosphodiester phosphodiesterases (GP-GDE, EC 3.1.4.46) and similar proteins. The prototype of this CD is a putative GP-GDE from Silicibacter pomeroyi (SpGDE). It shows high sequence similarity to Escherichia coli GP-GDE, which catalyzes the degradation of glycerophosphodiesters to produce sn-glycerol-3-phosphate (G3P) and the corresponding alcohols.
Probab=56.31  E-value=16  Score=31.09  Aligned_cols=66  Identities=20%  Similarity=0.260  Sum_probs=44.2

Q ss_pred             ehhHHHHHHHHhhcC---ceEEEEEeechhh-----------HHHHHHHHHHcCCCCCCCCccceEEeeecCcCccceee
Q 027770          131 KSSVLEALLEITKFC---DLYLMERVLDNES-----------EKKVLLALETAGVFTSGGLVKDKVLFCSTEIGRTSFVR  196 (219)
Q Consensus       131 ~esalevLleLsk~~---DlYLm~~V~dD~e-----------E~~Vl~~Le~~GlF~~gGL~rHKVLFCST~~Gr~sfVR  196 (219)
                      .+.--|+|- +.+..   +++|..-++++..           .+.++++++++|.       .+||+|.|...-....+|
T Consensus       109 iptL~evl~-~~~~~~~~~~~l~iEiK~~~~~~~~~~~~~~~~~~v~~~l~~~~~-------~~~v~~~Sf~~~~l~~~~  180 (263)
T cd08567         109 IPTLEEVFA-LVEKYGNQKVRFNIETKSDPDRDILHPPPEEFVDAVLAVIRKAGL-------EDRVVLQSFDWRTLQEVR  180 (263)
T ss_pred             CCCHHHHHH-HHHHhccCCceEEEEEcCCCCccccCccHHHHHHHHHHHHHHcCC-------CCceEEEeCCHHHHHHHH
Confidence            444444444 44433   5777777775432           2678888888653       268999998877788889


Q ss_pred             ecccceee
Q 027770          197 QLEPDWHI  204 (219)
Q Consensus       197 QLeP~lHI  204 (219)
                      ++.|.+.+
T Consensus       181 ~~~p~~~~  188 (263)
T cd08567         181 RLAPDIPT  188 (263)
T ss_pred             HHCCCccE
Confidence            98887643


No 12 
>TIGR03778 VPDSG_CTERM VPDSG-CTERM exosortase interaction domain. Through in silico analysis, we previously described the PEP-CTERM/exosortase system (PubMed:16930487). This model describes a PEP-CTERM-like variant C-terminal protein sorting signal, as found at the C-terminus of twenty otherwise unrelated proteins in Verrucomicrobiae bacterium DG1235. The variant motif, VPDSG, seems an intermediate between the VPEP motif (TIGR02595) of typical exosortase systems and the classical LPXTG of sortase in Gram-positive bacteria.
Probab=55.90  E-value=16  Score=23.10  Aligned_cols=21  Identities=38%  Similarity=0.376  Sum_probs=18.8

Q ss_pred             CCCchHHHHHHHHHHHHHHHH
Q 027770           12 QDSRSVGAIAGLALAIVFTWR   32 (219)
Q Consensus        12 ~~~~s~~~~ag~a~a~v~~w~   32 (219)
                      .|++|-.+++|++++.++++|
T Consensus         2 PDsGST~~Ll~~~l~~l~~~r   22 (26)
T TIGR03778         2 PDSGSTLALLGLGLLGLLGLR   22 (26)
T ss_pred             CCchhHHHHHHHHHHHHHHHh
Confidence            478899999999999999886


No 13 
>TIGR03510 XapX XapX domain. This model describes an uncharacterized small, hydrophobic protein of about 50 amino acids, found between the xapB and xapR genes of the E. coli xanthosine utilization system, and homologous regions in other small proteins, such as the N-terminal region of DUF1427 (Pfam model pfam07235). We name this domain XapX, as it comprises the full length of the protein encoded between the genes for the well-studied XapB and XapR proteins.
Probab=55.65  E-value=15  Score=26.12  Aligned_cols=23  Identities=22%  Similarity=0.345  Sum_probs=19.5

Q ss_pred             HHHHHHHHHHHHHHHHhhcCCCC
Q 027770           17 VGAIAGLALAIVFTWRMFRSPGA   39 (219)
Q Consensus        17 ~~~~ag~a~a~v~~w~~lr~~~~   39 (219)
                      ++-.+||++.++|+|--+++|-+
T Consensus         4 lsl~~G~~vG~~~~~l~vp~PAP   26 (49)
T TIGR03510         4 LSLGAGLLVGALYSLLKVPSPAP   26 (49)
T ss_pred             HHHHHHHHHHHHHHHhCCCCCCC
Confidence            46689999999999999988753


No 14 
>PF04246 RseC_MucC:  Positive regulator of sigma(E), RseC/MucC;  InterPro: IPR007359 This bacterial family of integral membrane proteins represents a positive regulator of the sigma(E) transcription factor, namely RseC/MucC. The sigma(E) transcription factor is up-regulated by cell envelope protein misfolding, and regulates the expression of genes that are collectively termed ECF (devoted to Extra-Cellular Functions) []. In Pseudomonas aeruginosa, derepression of sigma(E) is associated with the alginate-overproducing phenotype characteristic of chronic respiratory tract colonization in cystic fibrosis patients. The mechanism by which RseC/MucC positively regulates the sigma(E) transcription factor is unknown. RseC is also thought to have a role in thiamine biosynthesis in Salmonella typhimurium []. In addition, this family also includes an N-terminal part of RnfF, a Rhodobacter capsulatus protein, of unknown function, that is essential for nitrogen fixation. This protein also contains a domain found in ApbE protein IPR003374 from INTERPRO, which is itself involved in thiamine biosynthesis.
Probab=54.87  E-value=12  Score=29.65  Aligned_cols=28  Identities=21%  Similarity=0.171  Sum_probs=21.3

Q ss_pred             CCchHHHHHHHHHHHHHHHHhhcCCCCC
Q 027770           13 DSRSVGAIAGLALAIVFTWRMFRSPGAP   40 (219)
Q Consensus        13 ~~~s~~~~ag~a~a~v~~w~~lr~~~~~   40 (219)
                      ...-+++++|+++++++.|++-|.-...
T Consensus        97 ~~~~l~~l~~l~~~~~~~~~~~~~~~~~  124 (135)
T PF04246_consen   97 LWAILGGLLGLALGFLILRLFDRRLKKK  124 (135)
T ss_pred             HHHHHHHHHHHHHHHHHHHHHHhhhccC
Confidence            3445788999999999999888865544


No 15 
>PF13419 HAD_2:  Haloacid dehalogenase-like hydrolase; PDB: 2FI1_A 2I6X_A 3SD7_A 4F71_A 4DFD_B 4F72_B 4DCC_A 3DDH_A 3KZX_A 2B0C_A ....
Probab=50.58  E-value=69  Score=23.95  Aligned_cols=79  Identities=20%  Similarity=0.266  Sum_probs=51.8

Q ss_pred             hccceeehhHHHHHHHHh-hcCceEEEEEeechhhHHHHHHHHHHcCCCCCCCCccceEEeeecCcCc--------ccee
Q 027770          125 QKQVTVKSSVLEALLEIT-KFCDLYLMERVLDNESEKKVLLALETAGVFTSGGLVKDKVLFCSTEIGR--------TSFV  195 (219)
Q Consensus       125 ~~~a~v~esalevLleLs-k~~DlYLm~~V~dD~eE~~Vl~~Le~~GlF~~gGL~rHKVLFCST~~Gr--------~sfV  195 (219)
                      .......+.+.+.|.+|. +...+++++.-    +...+...|+..|+.+   + -+ -+||+.+.|.        ..++
T Consensus        73 ~~~~~~~~~~~~~L~~l~~~~~~~~i~Sn~----~~~~~~~~l~~~~~~~---~-f~-~i~~~~~~~~~Kp~~~~~~~~~  143 (176)
T PF13419_consen   73 ESKLQPYPGVRELLERLKAKGIPLVIVSNG----SRERIERVLERLGLDD---Y-FD-EIISSDDVGSRKPDPDAYRRAL  143 (176)
T ss_dssp             HGGEEESTTHHHHHHHHHHTTSEEEEEESS----EHHHHHHHHHHTTHGG---G-CS-EEEEGGGSSSSTTSHHHHHHHH
T ss_pred             hhccchhhhhhhhhhhcccccceeEEeecC----Cccccccccccccccc---c-cc-cccccchhhhhhhHHHHHHHHH
Confidence            468899999999999998 66788776533    5677778888866542   1 13 3456665553        3344


Q ss_pred             eec--cc--ceeecCCHHHHH
Q 027770          196 RQL--EP--DWHIDTNPEIVS  212 (219)
Q Consensus       196 RQL--eP--~lHIDtd~~vv~  212 (219)
                      +++  +|  .+|||.++.-+.
T Consensus       144 ~~~~~~p~~~~~vgD~~~d~~  164 (176)
T PF13419_consen  144 EKLGIPPEEILFVGDSPSDVE  164 (176)
T ss_dssp             HHHTSSGGGEEEEESSHHHHH
T ss_pred             HHcCCCcceEEEEeCCHHHHH
Confidence            433  44  678887765443


No 16 
>cd08573 GDPD_GDE1 Glycerophosphodiester phosphodiesterase domain of mammalian glycerophosphodiester phosphodiesterase GDE1 and similar proteins. This subfamily corresponds to the glycerophosphodiester phosphodiesterase domain (GDPD) present in mammalian GDE1 (also known as MIR16, membrane interacting protein of RGS16) and their metazoan homologs. GDE1 is widely expressed in mammalian tissues, including the heart, brain, liver, and kidney. It shows sequence homology to bacterial glycerophosphodiester phosphodiesterases (GP-GDEs, EC 3.1.4.46), which catalyzes the hydrolysis of various glycerophosphodiesters, and produce sn-glycerol-3-phosphate (G3P) and the corresponding alcohols. GDE1 has been characterized as GPI-GDE (EC 3.1.4.44) that selectively hydrolyzes extracellular glycerophosphoinositol (GPI) to generate glycerol phosphate and inositol. It functions as an integral membrane-bound glycoprotein interacting with regulator of G protein signaling protein RGS16, and is modulated by G 
Probab=50.11  E-value=28  Score=30.47  Aligned_cols=62  Identities=18%  Similarity=0.096  Sum_probs=45.1

Q ss_pred             HHHHHhhcCceEEEEEeechhh--HHHHHHHHHHcCCCCCCCCccceEEeeecCcCccceeeecccceee
Q 027770          137 ALLEITKFCDLYLMERVLDNES--EKKVLLALETAGVFTSGGLVKDKVLFCSTEIGRTSFVRQLEPDWHI  204 (219)
Q Consensus       137 vLleLsk~~DlYLm~~V~dD~e--E~~Vl~~Le~~GlF~~gGL~rHKVLFCST~~Gr~sfVRQLeP~lHI  204 (219)
                      -++++.+..++.++.-++++.+  ++.+.+++++.|.+      .++++++|-......-+|++.|.+.+
T Consensus        93 evl~~~~~~~~~l~iEiK~~~~~~~~~v~~~l~~~~~~------~~~v~v~SF~~~~l~~~~~~~p~~~~  156 (258)
T cd08573          93 EAVKECLENNLRMIFDVKSNSSKLVDALKNLFKKYPGL------YDKAIVCSFNPIVIYKVRKADPKILT  156 (258)
T ss_pred             HHHHHHHhcCCEEEEEeCCCcHHHHHHHHHHHHHCCCc------cCCEEEEECCHHHHHHHHHhCCCceE
Confidence            3335555557777777886543  46788888886532      36899999998888899999998654


No 17 
>cd08564 GDPD_GsGDE_like Glycerophosphodiester phosphodiesterase domain of putative Galdieria sulphuraria glycerophosphodiester phosphodiesterase and similar proteins. This subfamily corresponds to the glycerophosphodiester phosphodiesterase domain (GDPD) present in putative Galdieria sulphuraria glycerophosphodiester phosphodiesterase (GsGDE, EC 3.1.4.46) and its uncharacterized eukaryotic homologs. Members in this family show high sequence similarity to Escherichia coli GP-GDE, which catalyzes the degradation of glycerophosphodiesters to produce sn-glycerol-3-phosphate (G3P) and the corresponding alcohols.
Probab=48.77  E-value=37  Score=29.53  Aligned_cols=61  Identities=21%  Similarity=0.247  Sum_probs=42.9

Q ss_pred             hhHHHHHHHHhhcCceEEEEEeech--hhHHHHHHHHHHcCCCCCCCCccceEEeeecCc-Cccceeeecccc
Q 027770          132 SSVLEALLEITKFCDLYLMERVLDN--ESEKKVLLALETAGVFTSGGLVKDKVLFCSTEI-GRTSFVRQLEPD  201 (219)
Q Consensus       132 esalevLleLsk~~DlYLm~~V~dD--~eE~~Vl~~Le~~GlF~~gGL~rHKVLFCST~~-Gr~sfVRQLeP~  201 (219)
                      +.-.|+|- +.+. ++.+..-++++  .-++.++++|+++|..       ++|+|||-.- .-...+|++.|.
T Consensus       108 ptL~evl~-~~~~-~~~l~iEiK~~~~~~~~~v~~~l~~~~~~-------~~v~i~SF~~~~~l~~~~~~~~~  171 (265)
T cd08564         108 PTLEDVLV-TFKD-KLKYNIELKGREVGLGERVLNLVEKYGMI-------LQVHFSSFLHYDRLDLLKALRPN  171 (265)
T ss_pred             CCHHHHHH-Hhcc-CcEEEEEeCCCchhHHHHHHHHHHHcCCC-------CCEEEEecCchhHHHHHHHhCcC
Confidence            33344443 3333 67777778866  3458899999996642       6899999888 777788998886


No 18 
>cd08601 GDPD_SaGlpQ_like Glycerophosphodiester phosphodiesterase domain of Staphylococcus aureus and similar proteins. This subfamily corresponds to the glycerophosphodiester phosphodiesterase domain (GDPD) present in uncharacterized glycerophosphodiester phosphodiesterase (GP-GDE, EC 3.1.4.46) from Staphylococcus aureus, Bacillus subtilis and similar proteins. Members in this family show very high sequence similarity to Escherichia coli periplasmic phosphodiesterase GlpQ, which catalyzes the Ca2+-dependent degradation of periplasmic glycerophosphodiesters to produce sn-glycerol-3-phosphate (G3P) and the corresponding alcohols.
Probab=48.59  E-value=34  Score=29.40  Aligned_cols=44  Identities=27%  Similarity=0.382  Sum_probs=35.1

Q ss_pred             hHHHHHHHHHHcCCCCCCCCccceEEeeecCcCccceeeecccce
Q 027770          158 SEKKVLLALETAGVFTSGGLVKDKVLFCSTEIGRTSFVRQLEPDW  202 (219)
Q Consensus       158 eE~~Vl~~Le~~GlF~~gGL~rHKVLFCST~~Gr~sfVRQLeP~l  202 (219)
                      -++.+.++++++|+.+. .+.+++|+|+|-..--...+|++.|.+
T Consensus       127 ~~~~v~~~l~~~~~~~~-~~~~~~v~i~Sf~~~~l~~~~~~~p~~  170 (256)
T cd08601         127 MEEKLLATLDKYGLLTD-NLKNGQVIIQSFSKESLKKLHQLNPNI  170 (256)
T ss_pred             HHHHHHHHHHHcCCCcc-cCCCCCEEEecCCHHHHHHHHHhCCCC
Confidence            35789999999887652 455689999998888888889988864


No 19 
>PF01102 Glycophorin_A:  Glycophorin A;  InterPro: IPR001195 Proteins in this group are responsible for the molecular basis of the blood group antigens, surface markers on the outside of the red blood cell membrane. Most of these markers are proteins, but some are carbohydrates attached to lipids or proteins [Reid M.E., Lomas-Francis C. The Blood Group Antigen FactsBook Academic Press, London / San Diego, (1997)]. Glycophorin A (PAS-2) and glycophorin B (PAS-3) belong to the MNS blood group system and are associated with antigens that include M/N, S/s, U, He, Mi(a), M(c), Vw, Mur, M(g), Vr, M(e), Mt(a), St(a), Ri(a), Cl(a), Ny(a), Hut, Hil, M(v), Far, Mit, Dantu, Hop, Nob, En(a), ENKT, amongst others. Glycophorin A is the major sialoglycoprotein of the erythrocyte membrane []. Structurally, glycophorin A consists of an N-terminal extracellular domain, heavily glycosylated on serine and threonine residues, followed by a transmembrane region and a C-terminal cytoplasmic domain. Other glycophorins in this entry such as Glycophorin B and Glycophorin E represent minor sialoglycoproteins in the erythrocyte membrane.; GO: 0016021 integral to membrane; PDB: 2KPF_B 1AFO_B 2KPE_A.
Probab=44.86  E-value=20  Score=29.45  Aligned_cols=19  Identities=26%  Similarity=0.369  Sum_probs=15.0

Q ss_pred             HHHHHHHHHHHHHHHHhhc
Q 027770           17 VGAIAGLALAIVFTWRMFR   35 (219)
Q Consensus        17 ~~~~ag~a~a~v~~w~~lr   35 (219)
                      +|.+||++..++|.++++|
T Consensus        71 ~gv~aGvIg~Illi~y~ir   89 (122)
T PF01102_consen   71 FGVMAGVIGIILLISYCIR   89 (122)
T ss_dssp             HHHHHHHHHHHHHHHHHHH
T ss_pred             HHHHHHHHHHHHHHHHHHH
Confidence            6788888887888888874


No 20 
>PRK11009 aphA acid phosphatase/phosphotransferase; Provisional
Probab=43.04  E-value=1.5e+02  Score=26.42  Aligned_cols=110  Identities=10%  Similarity=0.065  Sum_probs=66.9

Q ss_pred             CCchHHHHHHHhcCce--eEEEeecceeeccCCh--------------h----h--------hhccceeehhHHHHHHHH
Q 027770           90 KPTLGQVVRQKLSEGR--KVTCRLLGVILEESCP--------------E----E--------LQKQVTVKSSVLEALLEI  141 (219)
Q Consensus        90 ~~t~~qiv~~kls~~R--kvTis~~gvvlee~sp--------------~----e--------l~~~a~v~esalevLleL  141 (219)
                      -++..||-.. |.|.+  .|.+-+.++|+.++..              +    +        .+......+.++|.|-.+
T Consensus        48 ~~~~~~~~~~-~~~~~p~av~~DIDeTvldnsp~~~~~~~~f~~~~~~y~~~~~fw~~y~~~~~~~a~p~~Ga~elL~~L  126 (237)
T PRK11009         48 WVSVAQIEKS-LEGRPPMAVGFDIDDTVLFSSPGFWRGKKTFSPGSEDYLKNQKFWEKMNNGWDEFSIPKEVARQLIDMH  126 (237)
T ss_pred             EEEHHHhhhh-ccCCCCcEEEEECcCccccCCchheeeeeccCCCcccccChHHHHHHHHhcccccCcchHHHHHHHHHH
Confidence            3688888754 44433  5777788899875321              0    0        011333445589999988


Q ss_pred             -hhcCceEEEEEeechhhHHHHHHHHHHcCC--CCCCCCccceEEeeecC---cCccceeeecccceeec
Q 027770          142 -TKFCDLYLMERVLDNESEKKVLLALETAGV--FTSGGLVKDKVLFCSTE---IGRTSFVRQLEPDWHID  205 (219)
Q Consensus       142 -sk~~DlYLm~~V~dD~eE~~Vl~~Le~~Gl--F~~gGL~rHKVLFCST~---~Gr~sfVRQLeP~lHID  205 (219)
                       .+-..||+++-=.....+.....+++..|+  -.     -..++||.+.   ..+...+.+++-.++|.
T Consensus       127 ~~~G~~I~iVTnR~~~k~~~t~~~Llk~~gip~~~-----~f~vil~gd~~~K~~K~~~l~~~~i~I~IG  191 (237)
T PRK11009        127 VKRGDSIYFITGRTATKTETVSKTLADDFHIPADN-----MNPVIFAGDKPGQYTKTQWLKKKNIRIFYG  191 (237)
T ss_pred             HHCCCeEEEEeCCCCcccHHHHHHHHHHcCCCccc-----ceeEEEcCCCCCCCCHHHHHHhcCCeEEEc
Confidence             555778887633223445566666766555  22     1357787663   45666777777778876


No 21 
>PF01171 ATP_bind_3:  PP-loop family;  InterPro: IPR011063 This entry represents the PP-loop motif superfamily [,]. The PP-loop motif appears to be a modified version of the P-loop of nucleotide binding domain that is involved in phosphate binding []. Named PP-motif, since it appears to be a part of a previously uncharacterised ATP pyrophophatase domain. ATP sulfurylases, Escherichia coli NtrL, and Bacillus subtilis OutB consist of this domain alone. In other proteins, the pyrophosphatase domain is associated with amidotransferase domains (type I or type II), a putative citrulline-aspartate ligase domain or a nitrilase/amidase domain.; PDB: 3A2K_A 2E89_B 2E21_D 1WY5_B 1NI5_A.
Probab=42.76  E-value=58  Score=26.64  Aligned_cols=67  Identities=19%  Similarity=0.299  Sum_probs=42.3

Q ss_pred             eehhHHHHHHHHhhc--CceEEEEEeechhhHHHHHHHHHHcCCCCCCCCccceEEeeecCcCccceeeecccceeec
Q 027770          130 VKSSVLEALLEITKF--CDLYLMERVLDNESEKKVLLALETAGVFTSGGLVKDKVLFCSTEIGRTSFVRQLEPDWHID  205 (219)
Q Consensus       130 v~esalevLleLsk~--~DlYLm~~V~dD~eE~~Vl~~Le~~GlF~~gGL~rHKVLFCST~~Gr~sfVRQLeP~lHID  205 (219)
                      .+.--.+.|.++|+.  |+..++.|-.||+-|.-+++++...|+.+-.|+.+.+-.     .|    ++=+.|.+.+.
T Consensus        80 aR~~Ry~~l~~~a~~~g~~~i~~GHh~dD~~ET~l~~l~rg~~~~~l~~~~~~~~~-----~~----~~~iRPLl~~~  148 (182)
T PF01171_consen   80 ARELRYQFLREIAKEEGCNKIALGHHLDDQAETFLMNLLRGSGLRGLAGMPPVSPF-----KG----IKLIRPLLYVS  148 (182)
T ss_dssp             HHHHHHHHHHHHHHTTT-CEEE---BHHHHHHHHHHHHHHT--CCCCC-S-SEEEE-----TT----CEEE-GGGCS-
T ss_pred             HHHHHHHHHHHhhhcccccceeecCcCCccHHHHHHHHHHhccchhhccccccccc-----cC----cccCCcchhCC
Confidence            344555788888876  699999999999999999999999888665576663322     33    55566877664


No 22 
>cd01028 TOPRIM_TopoIA TOPRIM_TopoIA: topoisomerase-primase (TOPRIM) nucleotidyl transferase/hydrolase domain of the type found in the type IA family of DNA topoisomerases (TopoIA).  This subgroup contains proteins similar to the Type I DNA topoisomerases: E. coli topisomerases I and III, eukaryotic topoisomerase III and, ATP-dependent reverse gyrase found in archaea and thermophilic bacteria.   Type IA DNA topoisomerases remove (relax) negative supercoils in the DNA. These enzymes cleave one strand of the DNA duplex, covalently link to the 5' phosphoryl end of the DNA break and allow the other strand of the duplex to pass through the gap. Reverse gyrase is also able to insert positive supercoils in the presence of ATP and negative supercoils in the presence of AMPPNP.  The TOPRIM domain has two conserved motifs, one of which centers at a conserved glutamate and the other one at two conserved aspartates (DxD).  For topoisomerases the conserved glutamate is believed to act as a general b
Probab=41.38  E-value=1.8e+02  Score=23.17  Aligned_cols=79  Identities=25%  Similarity=0.234  Sum_probs=50.5

Q ss_pred             eeEEEeecceeeccCChhhhhcc--------------cee---ehhHHHHHHHHhhcCceEEEEEeechhhHHHHHHHHH
Q 027770          105 RKVTCRLLGVILEESCPEELQKQ--------------VTV---KSSVLEALLEITKFCDLYLMERVLDNESEKKVLLALE  167 (219)
Q Consensus       105 RkvTis~~gvvlee~sp~el~~~--------------a~v---~esalevLleLsk~~DlYLm~~V~dD~eE~~Vl~~Le  167 (219)
                      .-+.+++.|=|++=..|++-...              +.+   ..+.++.|.++++-+|.++++.=.|-|.|.-...+++
T Consensus        34 ~~~v~~~~GHl~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ik~l~~~~d~iiiAtD~DrEGE~I~~~i~~  113 (142)
T cd01028          34 GYVVTASVGHLLELPFPEEYVDWDKDWPLELFPFEPKYVVIPDKKKQLKALKKLAKKADEIVLATDPDREGELIAWEILE  113 (142)
T ss_pred             CEEEEEEccccccCCCcccccccccCCchhhCCCCceEEeCCcHHHHHHHHHHHHhcCCEEEEcCCCCcchHHHHHHHHH
Confidence            34667778888776555543331              111   3456778889999999888776666666665566777


Q ss_pred             HcCCCCCCCCccceEEeee
Q 027770          168 TAGVFTSGGLVKDKVLFCS  186 (219)
Q Consensus       168 ~~GlF~~gGL~rHKVLFCS  186 (219)
                      -.|. .  -.+-.|+.|-|
T Consensus       114 ~~~~-~--~~~v~R~~fss  129 (142)
T cd01028         114 VLKC-D--NKPVKRAWFSE  129 (142)
T ss_pred             HhCC-C--CCCeEEEEEcc
Confidence            6554 1  22447888876


No 23 
>TIGR02245 HAD_IIID1 HAD-superfamily subfamily IIID hydrolase, TIGR02245. This family of sequences appears to belong to the Haloacid Dehalogenase (HAD) superfamily of enzymes by virtue of the presence of three catalytic domains, in this case: LLVLD(ILV)D(YH)T, I(VMG)IWS, and (DN)(VC)K(PA)Lx{15-17}T(IL)(MH)(FV)DD(IL)(GRS)(RK)N. Since this family has no large "cap" domain between motifs 1 and 2 or between 2 and 3, it is formally a "class III" HAD.
Probab=40.87  E-value=2.4e+02  Score=24.49  Aligned_cols=76  Identities=16%  Similarity=0.187  Sum_probs=54.6

Q ss_pred             CceeEEEeecceeeccCChhhhhccceeehhHHHHHHHHhhcCceEEEEEeechhhHHHHHHHHHHcCCCCCCCCccceE
Q 027770          103 EGRKVTCRLLGVILEESCPEELQKQVTVKSSVLEALLEITKFCDLYLMERVLDNESEKKVLLALETAGVFTSGGLVKDKV  182 (219)
Q Consensus       103 ~~RkvTis~~gvvlee~sp~el~~~a~v~esalevLleLsk~~DlYLm~~V~dD~eE~~Vl~~Le~~GlF~~gGL~rHKV  182 (219)
                      |.+.+.+-+.++++.-.++.+ ......|+-+-|-|-.++..+||++.+    ...++.+...++.-|+.++   .+.|+
T Consensus        20 ~kklLVLDLDeTLvh~~~~~~-~~~~~kRP~l~eFL~~~~~~feIvVwT----Aa~~~ya~~~l~~l~~~~~---~~~~i   91 (195)
T TIGR02245        20 GKKLLVLDIDYTLFDHRSPAE-TGEELMRPYLHEFLTSAYEDYDIVIWS----ATSMKWIEIKMTELGVLTN---PNYKI   91 (195)
T ss_pred             CCcEEEEeCCCceEcccccCC-CceEEeCCCHHHHHHHHHhCCEEEEEe----cCCHHHHHHHHHHhcccCC---ccceE
Confidence            456777888899886434433 346788999999999999999999876    4456788888888766553   12345


Q ss_pred             Eeee
Q 027770          183 LFCS  186 (219)
Q Consensus       183 LFCS  186 (219)
                      .||=
T Consensus        92 ~~~l   95 (195)
T TIGR02245        92 TFLL   95 (195)
T ss_pred             EEEe
Confidence            5543


No 24 
>TIGR03190 benz_CoA_bzdN benzoyl-CoA reductase, bzd-type, N subunit. Members of this family are the N subunit of one of two related types of four-subunit ATP-dependent benzoyl-CoA reductase. This enzyme system catalyzes the dearomatization of benzoyl-CoA, a common intermediate in pathways for the degradation for a number of different aromatic compounds, such as phenol and toluene.
Probab=40.48  E-value=7  Score=36.33  Aligned_cols=70  Identities=17%  Similarity=0.165  Sum_probs=46.1

Q ss_pred             HHHHHHhhcCceEEEEEeec-hhh--HHHHHHHHHHcCCCCCCCCccceEEeeecCcCcccee-eecc----cceeecCC
Q 027770          136 EALLEITKFCDLYLMERVLD-NES--EKKVLLALETAGVFTSGGLVKDKVLFCSTEIGRTSFV-RQLE----PDWHIDTN  207 (219)
Q Consensus       136 evLleLsk~~DlYLm~~V~d-D~e--E~~Vl~~Le~~GlF~~gGL~rHKVLFCST~~Gr~sfV-RQLe----P~lHIDtd  207 (219)
                      +++..||+-+--...|.... +.+  -+.+..+.++   ++..|+.-|-.-||-+..+-...+ ++|+    |.++||+|
T Consensus       276 dpl~alA~~yl~~~~C~~~~~p~~~R~~~i~~lv~~---~~~DGVI~~~~kfC~~~~~e~~~lk~~l~e~GIP~L~iE~D  352 (377)
T TIGR03190       276 DVIKAIAERYCDRPACPTKDYPVHTRYDHVLGLAKE---YNVQGAIFLQQKFCDPHEGDYPDLKRHLEANGIPTLFLEFD  352 (377)
T ss_pred             cHHHHHHHHhcCCCCCCCcCCCHHHHHHHHHHHHHH---hCCCEEEEecccCCCcchhhhHHHHHHHHHCCCCEEEEecC
Confidence            45666665542123344432 222  2555555565   344588999999999999988665 6776    99999987


Q ss_pred             H
Q 027770          208 P  208 (219)
Q Consensus       208 ~  208 (219)
                      -
T Consensus       353 ~  353 (377)
T TIGR03190       353 I  353 (377)
T ss_pred             C
Confidence            5


No 25 
>cd01427 HAD_like Haloacid dehalogenase-like hydrolases. The haloacid dehalogenase-like (HAD) superfamily includes L-2-haloacid dehalogenase, epoxide hydrolase, phosphoserine phosphatase, phosphomannomutase, phosphoglycolate phosphatase, P-type ATPase, and many others, all of which use a nucleophilic aspartate in their phosphoryl transfer reaction. All members possess a highly conserved alpha/beta core domain, and many also possess a small cap domain, the fold and function of which is variable. Members of this superfamily are sometimes referred to as belonging to the DDDD superfamily of phosphohydrolases.
Probab=39.57  E-value=1e+02  Score=21.71  Aligned_cols=60  Identities=28%  Similarity=0.305  Sum_probs=43.9

Q ss_pred             EEeecceeeccCChhhhhccceeehhHHHHHHHHhhc-CceEEEEEeechhhHHHHHHHHHHcCC
Q 027770          108 TCRLLGVILEESCPEELQKQVTVKSSVLEALLEITKF-CDLYLMERVLDNESEKKVLLALETAGV  171 (219)
Q Consensus       108 Tis~~gvvlee~sp~el~~~a~v~esalevLleLsk~-~DlYLm~~V~dD~eE~~Vl~~Le~~Gl  171 (219)
                      .+.+.|+++.+.+..+........+.+++.|-+|... ..+|+++    ......+...++..|+
T Consensus         3 vfD~D~tl~~~~~~~~~~~~~~~~~~~~~~l~~l~~~g~~i~ivS----~~~~~~~~~~~~~~~~   63 (139)
T cd01427           3 LFDLDGTLLDSEPGIAEIEELELYPGVKEALKELKEKGIKLALAT----NKSRREVLELLEELGL   63 (139)
T ss_pred             EEccCCceEccCccccccccCCcCcCHHHHHHHHHHCCCeEEEEe----CchHHHHHHHHHHcCC
Confidence            4567888888777666667788999999999999986 5555544    3336677777777665


No 26 
>cd08561 GDPD_cytoplasmic_ScUgpQ2_like Glycerophosphodiester phosphodiesterase domain of Streptomyces coelicolor cytoplasmic phosphodiesterases UgpQ2 and similar proteins. This subfamily corresponds to the glycerophosphodiester phosphodiesterase domain (GDPD) present in a group of uncharacterized cytoplasmic phosphodiesterases which predominantly exist in bacteria. The prototype of this family is a putative cytoplasmic phosphodiesterase encoded by gene ulpQ2 (SCO1419) in the Streptomyces coelicolor genome. It is distantly related to the Escherichia coli cytoplasmic phosphodiesterases UgpQ that catalyzes the hydrolysis of glycerophosphodiesters at the inner side of the cytoplasmic membrane to produce sn-glycerol-3-phosphate (G3P) and the corresponding alcohols.
Probab=39.29  E-value=49  Score=28.30  Aligned_cols=55  Identities=22%  Similarity=0.221  Sum_probs=43.8

Q ss_pred             ceEEEEEeech--hhHHHHHHHHHHcCCCCCCCCccceEEeeecCcCccceeeecccceeecCC
Q 027770          146 DLYLMERVLDN--ESEKKVLLALETAGVFTSGGLVKDKVLFCSTEIGRTSFVRQLEPDWHIDTN  207 (219)
Q Consensus       146 DlYLm~~V~dD--~eE~~Vl~~Le~~GlF~~gGL~rHKVLFCST~~Gr~sfVRQLeP~lHIDtd  207 (219)
                      ++++..-+.+.  .-.+.++++++++|+       .++++|.|-...-...++++.|.+.+.-.
T Consensus       106 ~~~~~ieiK~~~~~~~~~~~~~l~~~~~-------~~~~~~~Sf~~~~l~~~~~~~p~~~~~~~  162 (249)
T cd08561         106 DVRLNIEIKDDGPAAAAALADLIERYGA-------QDRVLVASFSDRVLRRFRRLCPRVATSAG  162 (249)
T ss_pred             CCcEEEEECCCchhHHHHHHHHHHHcCC-------CCcEEEEECCHHHHHHHHHHCCCcceecc
Confidence            47788888886  446789999998662       26899999999999999999998766543


No 27 
>COG4317 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=37.31  E-value=38  Score=26.97  Aligned_cols=20  Identities=35%  Similarity=0.534  Sum_probs=17.4

Q ss_pred             HHHHHHHHHHHHHHhhcCCC
Q 027770           19 AIAGLALAIVFTWRMFRSPG   38 (219)
Q Consensus        19 ~~ag~a~a~v~~w~~lr~~~   38 (219)
                      -.||+++.++|+...+|+|-
T Consensus         8 lgAGllVGiiyaLl~vrsPA   27 (93)
T COG4317           8 LGAGLLVGIIYALLKVRSPA   27 (93)
T ss_pred             HhhhHHHHHHHHHHhCCCCC
Confidence            34799999999999999875


No 28 
>cd08568 GDPD_TmGDE_like Glycerophosphodiester phosphodiesterase domain of Thermotoga maritime and similar proteins. This subfamily corresponds to the glycerophosphodiester phosphodiesterase domain (GDPD) present in Thermotoga maritime glycerophosphodiester phosphodiesterase (TmGDE, EC 3.1.4.46) and its uncharacterized  homologs. Members in this family show high sequence similarity to Escherichia coli GP-GDE, which catalyzes the degradation of glycerophosphodiesters to produce sn-glycerol-3-phosphate (G3P) and the corresponding alcohols. TmGDE exists as a monomer that might be the biologically relevant form.
Probab=37.04  E-value=59  Score=27.46  Aligned_cols=50  Identities=26%  Similarity=0.359  Sum_probs=36.6

Q ss_pred             ceEEEEEeechhhHHHHHHHHHHcCCCCCCCCccceEEeeecCcCccceeeecccce
Q 027770          146 DLYLMERVLDNESEKKVLLALETAGVFTSGGLVKDKVLFCSTEIGRTSFVRQLEPDW  202 (219)
Q Consensus       146 DlYLm~~V~dD~eE~~Vl~~Le~~GlF~~gGL~rHKVLFCST~~Gr~sfVRQLeP~l  202 (219)
                      +++|..-+++....+.+.++++++|..       .+++|-|-..-....+|++.|.+
T Consensus        92 ~~~l~iEiK~~~~~~~~~~~l~~~~~~-------~~v~i~SF~~~~l~~~~~~~p~~  141 (226)
T cd08568          92 DAIINVEIKDIDAVEPVLEIVEKFNAL-------DRVIFSSFNHDALRELRKLDPDA  141 (226)
T ss_pred             CcEEEEEECCccHHHHHHHHHHHcCCC-------CcEEEEECCHHHHHHHHHhCCCC
Confidence            467888899887778899999987643       46777776665556666666654


No 29 
>PRK07066 3-hydroxybutyryl-CoA dehydrogenase; Validated
Probab=37.03  E-value=97  Score=28.73  Aligned_cols=71  Identities=17%  Similarity=0.102  Sum_probs=46.2

Q ss_pred             HHhhcCceEEEEEeechhhHHHHHHHHHHcCCCCCCCCccceEEeeecCcCccc--eee-----------------eccc
Q 027770          140 EITKFCDLYLMERVLDNESEKKVLLALETAGVFTSGGLVKDKVLFCSTEIGRTS--FVR-----------------QLEP  200 (219)
Q Consensus       140 eLsk~~DlYLm~~V~dD~eE~~Vl~~Le~~GlF~~gGL~rHKVLFCST~~Gr~s--fVR-----------------QLeP  200 (219)
                      +..+.||+.+-+-.++-+-|..+..-+++       ..+++-||-+||+ |-..  |.-                 ++-|
T Consensus        79 ~av~~aDlViEavpE~l~vK~~lf~~l~~-------~~~~~aIlaSnTS-~l~~s~la~~~~~p~R~~g~HffnP~~~~p  150 (321)
T PRK07066         79 ACVADADFIQESAPEREALKLELHERISR-------AAKPDAIIASSTS-GLLPTDFYARATHPERCVVGHPFNPVYLLP  150 (321)
T ss_pred             HHhcCCCEEEECCcCCHHHHHHHHHHHHH-------hCCCCeEEEECCC-ccCHHHHHHhcCCcccEEEEecCCccccCc
Confidence            55577998776655566667777777776       5666777777777 5321  221                 2334


Q ss_pred             ceee----cCCHHHHHhhhhcc
Q 027770          201 DWHI----DTNPEIVSQLAVHL  218 (219)
Q Consensus       201 ~lHI----Dtd~~vv~~L~rfv  218 (219)
                      .+=|    +|+++++..+..|+
T Consensus       151 LVEVv~g~~T~~e~~~~~~~f~  172 (321)
T PRK07066        151 LVEVLGGERTAPEAVDAAMGIY  172 (321)
T ss_pred             eEEEeCCCCCCHHHHHHHHHHH
Confidence            4433    79999998887764


No 30 
>PF10389 CoatB:  Bacteriophage coat protein B ;  InterPro: IPR008020 The major coat protein in the capsid of filamentous bacteriophage forms a helical assembly of about 7000 identical protomers, with each protomer comprised of 46 amino acids, after the cleavage of the signal peptide. Each protomer forms a slightly curved helix that combines to form a tubular structure that encapsulates the viral DNA [].; PDB: 2IFO_A.
Probab=36.95  E-value=26  Score=24.64  Aligned_cols=11  Identities=9%  Similarity=0.241  Sum_probs=8.7

Q ss_pred             HHHHHHhhcCC
Q 027770           27 IVFTWRMFRSP   37 (219)
Q Consensus        27 ~v~~w~~lr~~   37 (219)
                      .+..||++|.+
T Consensus        35 ~i~v~kwiRra   45 (46)
T PF10389_consen   35 GIAVYKWIRRA   45 (46)
T ss_dssp             HHHHHHHHHHT
T ss_pred             HHHHHHHHHhh
Confidence            57889999854


No 31 
>cd08565 GDPD_pAtGDE_like Glycerophosphodiester phosphodiesterase domain of putative Agrobacterium tumefaciens glycerophosphodiester phosphodiesterase and similar proteins. This subfamily corresponds to the glycerophosphodiester phosphodiesterase domain (GDPD) present in putative Agrobacterium tumefaciens glycerophosphodiester phosphodiesterase (pAtGDE, EC 3.1.4.46) and its uncharacterized homologs. Members in this family show high sequence similarity to Escherichia coli GP-GDE, which catalyzes the degradation of glycerophosphodiesters to produce sn-glycerol-3-phosphate (G3P) and the corresponding alcohols.
Probab=36.21  E-value=68  Score=27.65  Aligned_cols=59  Identities=17%  Similarity=0.215  Sum_probs=40.2

Q ss_pred             HHHHHHhhcCceEEEEEeechh-------hHHHHHHHHHHcCCCCCCCCccceEEeeecCcCccceeeecccce
Q 027770          136 EALLEITKFCDLYLMERVLDNE-------SEKKVLLALETAGVFTSGGLVKDKVLFCSTEIGRTSFVRQLEPDW  202 (219)
Q Consensus       136 evLleLsk~~DlYLm~~V~dD~-------eE~~Vl~~Le~~GlF~~gGL~rHKVLFCST~~Gr~sfVRQLeP~l  202 (219)
                      +-++++.+..+++|..-++++.       -++.+.++++++|+.       ++++|.|-......-+|++ |.+
T Consensus        85 ~evl~~~~~~~~~l~iEiK~~~~~~~~~~~~~~v~~~i~~~~~~-------~~v~~~Sf~~~~l~~~~~~-p~~  150 (235)
T cd08565          85 EEVLALFAPSGLELHVEIKTDADGTPYPGAAALAAATLRRHGLL-------ERSVLTSFDPAVLTEVRKH-PGV  150 (235)
T ss_pred             HHHHHHhhccCcEEEEEECCCCCCCccHHHHHHHHHHHHhCCCc-------CCEEEEECCHHHHHHHHhC-CCC
Confidence            3344555557787878887653       246788888886542       5788888877777777777 753


No 32 
>PF03672 UPF0154:  Uncharacterised protein family (UPF0154);  InterPro: IPR005359 The proteins in this entry are functionally uncharacterised.
Probab=35.42  E-value=45  Score=24.91  Aligned_cols=19  Identities=26%  Similarity=0.504  Sum_probs=14.1

Q ss_pred             HHHHHHHHHHHHHHHHhhc
Q 027770           17 VGAIAGLALAIVFTWRMFR   35 (219)
Q Consensus        17 ~~~~ag~a~a~v~~w~~lr   35 (219)
                      +|+++||.+|--+-=|+|+
T Consensus         9 ~G~~~Gff~ar~~~~k~l~   27 (64)
T PF03672_consen    9 VGAVIGFFIARKYMEKQLK   27 (64)
T ss_pred             HHHHHHHHHHHHHHHHHHH
Confidence            6777777777777777776


No 33 
>PF12732 YtxH:  YtxH-like protein;  InterPro: IPR024623 This family of uncharacterised proteins is found in bacteria. Proteins in this family are typically between 100 and 143 amino acids in length. The N-terminal region is the most conserved.
Probab=34.17  E-value=29  Score=25.08  Aligned_cols=14  Identities=43%  Similarity=0.714  Sum_probs=9.0

Q ss_pred             HHHHHHHHHHHHHH
Q 027770           17 VGAIAGLALAIVFT   30 (219)
Q Consensus        17 ~~~~ag~a~a~v~~   30 (219)
                      +|+++|.++++.|+
T Consensus         7 ~Ga~~Ga~~glL~a   20 (74)
T PF12732_consen    7 AGAAAGAAAGLLFA   20 (74)
T ss_pred             HHHHHHHHHHHHhC
Confidence            56677777666653


No 34 
>PF00891 Methyltransf_2:  O-methyltransferase;  InterPro: IPR001077 Methyl transfer from the ubiquitous S-adenosyl-L-methionine (AdoMet) to either nitrogen, oxygen or carbon atoms is frequently employed in diverse organisms ranging from bacteria to plants and mammals. The reaction is catalysed by methyltransferases (Mtases) and modifies DNA, RNA, proteins and small molecules, such as catechol for regulatory purposes. The various aspects of the role of DNA methylation in prokaryotic restriction-modification systems and in a number of cellular processes in eukaryotes including gene regulation and differentiation is well documented. Three classes of DNA Mtases transfer the methyl group from AdoMet to the target base to form either N-6-methyladenine, or N-4-methylcytosine, or C-5- methylcytosine. In C-5-cytosine Mtases, ten conserved motifs are arranged in the same order []. Motif I (a glycine-rich or closely related consensus sequence; FAGxGG in M.HhaI []), shared by other AdoMet-Mtases [], is part of the cofactor binding site and motif IV (PCQ) is part of the catalytic site. In contrast, sequence comparison among N-6-adenine and N-4-cytosine Mtases indicated two of the conserved segments [], although more conserved segments may be present. One of them corresponds to motif I in C-5-cytosine Mtases, and the other is named (D/N/S)PP(Y/F). Crystal structures are known for a number of Mtases [, , , ]. The cofactor binding sites are almost identical and the essential catalytic amino acids coincide. The comparable protein folding and the existence of equivalent amino acids in similar secondary and tertiary positions indicate that many (if not all) AdoMet-Mtases have a common catalytic domain structure. This permits tertiary structure prediction of other DNA, RNA, protein, and small-molecule AdoMet-Mtases from their amino acid sequences []. This domain includes a range of O-methyltransferases some of which utilise S-adenosyl methionine as substrate []. In prokaryotes, the major role of DNA methylation is to protect host DNA against degradation by restriction enzymes. In eukaryotes, DNA methylation has been implicated in the control of several cellular processes, including differentiation, gene regulation, and embryonic development. O-methyltransferases have a common catalytic domain structure, which might be universal among S-adenosyl-L-methionine (AdoMet)-dependent methyltransferases [].  Comparative analysis of the predicted amino acid sequences of a number of plant O-methyltransferase cDNA clones show that they share some 32-71% sequence identity, and can be grouped according to the different compounds they utilise as substrates [].; GO: 0008171 O-methyltransferase activity; PDB: 1FPQ_A 1FP1_D 3P9K_B 3P9I_D 3P9C_A 3I53_A 3I5U_A 3I64_A 3I58_A 1ZG3_A ....
Probab=32.69  E-value=35  Score=28.81  Aligned_cols=28  Identities=25%  Similarity=0.470  Sum_probs=22.9

Q ss_pred             HHhhcCceEEEEEeechhhHHHHHHHHHH
Q 027770          140 EITKFCDLYLMERVLDNESEKKVLLALET  168 (219)
Q Consensus       140 eLsk~~DlYLm~~V~dD~eE~~Vl~~Le~  168 (219)
                      .+-. +|+|+|.+|..|=.++.+..+|++
T Consensus       155 ~~P~-~D~~~l~~vLh~~~d~~~~~iL~~  182 (241)
T PF00891_consen  155 PLPV-ADVYLLRHVLHDWSDEDCVKILRN  182 (241)
T ss_dssp             CCSS-ESEEEEESSGGGS-HHHHHHHHHH
T ss_pred             hhcc-ccceeeehhhhhcchHHHHHHHHH
Confidence            3444 899999999999999999999985


No 35 
>TIGR03593 yidC_nterm membrane protein insertase, YidC/Oxa1 family, N-terminal domain. Essentially all bacteria have a member of the YidC family, whose C-terminal domain is modeled by TIGR03592. The two copies are found in endospore-forming bacteria such as Bacillus subtilis appear redundant during vegetative growth, although the member designated spoIIIJ (stage III sporulation protein J) has a distinct role in spore formation. YidC, its mitochondrial homolog Oxa1, and its chloroplast homolog direct insertion into the bacterial/organellar inner (or only) membrane. This model describes an N-terminal sequence region, including a large periplasmic domain lacking in YidC members from Gram-positive species. The multifunctional YidC protein acts both with and independently of the Sec system.
Probab=32.23  E-value=99  Score=28.21  Aligned_cols=19  Identities=26%  Similarity=0.712  Sum_probs=13.7

Q ss_pred             HHHHHHHHHHHhhcCCCCC
Q 027770           22 GLALAIVFTWRMFRSPGAP   40 (219)
Q Consensus        22 g~a~a~v~~w~~lr~~~~~   40 (219)
                      .|.+.+++.|.++..+..+
T Consensus         9 ~ls~~~~~~w~~~~~~~~~   27 (366)
T TIGR03593         9 ALSFVIFLLWQAWQSDPGP   27 (366)
T ss_pred             HHHHHHHHHHHHHHhhccC
Confidence            3566688999999866543


No 36 
>PF13844 Glyco_transf_41:  Glycosyl transferase family 41; PDB: 3PE4_C 3PE3_D 3TAX_C 2XGO_A 2JLB_B 2XGM_A 2VSY_B 2XGS_B 2VSN_A.
Probab=30.68  E-value=1.1e+02  Score=30.26  Aligned_cols=74  Identities=18%  Similarity=0.276  Sum_probs=47.7

Q ss_pred             ccceeehhHHHHHHHHhhc---CceEEEEEeechhhHHHHHHHHHHcCCCCCCCCccceEEeeecCcCccceeeecccce
Q 027770          126 KQVTVKSSVLEALLEITKF---CDLYLMERVLDNESEKKVLLALETAGVFTSGGLVKDKVLFCSTEIGRTSFVRQLEPDW  202 (219)
Q Consensus       126 ~~a~v~esalevLleLsk~---~DlYLm~~V~dD~eE~~Vl~~Le~~GlF~~gGL~rHKVLFCST~~Gr~sfVRQLeP~l  202 (219)
                      ..+++.++.+++..+|.+-   +-+.|+.  .....|+.+...+++      .|+..+||+|.-...-..++-|.-.-|+
T Consensus       293 ~~~KI~p~~l~~W~~IL~~vP~S~L~L~~--~~~~~~~~l~~~~~~------~Gv~~~Ri~f~~~~~~~ehl~~~~~~DI  364 (468)
T PF13844_consen  293 NLFKISPETLDLWARILKAVPNSRLWLLR--FPASGEARLRRRFAA------HGVDPDRIIFSPVAPREEHLRRYQLADI  364 (468)
T ss_dssp             -GGG--HHHHHHHHHHHHHSTTEEEEEEE--TSTTHHHHHHHHHHH------TTS-GGGEEEEE---HHHHHHHGGG-SE
T ss_pred             ccccCCHHHHHHHHHHHHhCCCcEEEEee--CCHHHHHHHHHHHHH------cCCChhhEEEcCCCCHHHHHHHhhhCCE
Confidence            4677889999999998887   3444443  345567888888887      4778899999887665566666677888


Q ss_pred             eecCC
Q 027770          203 HIDTN  207 (219)
Q Consensus       203 HIDtd  207 (219)
                      ..|+-
T Consensus       365 ~LDT~  369 (468)
T PF13844_consen  365 CLDTF  369 (468)
T ss_dssp             EE--S
T ss_pred             EeeCC
Confidence            88864


No 37 
>PF01751 Toprim:  Toprim domain;  InterPro: IPR006171 This is a conserved region from DNA primase. This corresponds to the Toprim (topoisomerase-primase) domain common to DnaG primases, topoisomerases, OLD family nucleases and RecR/M DNA repair proteins []. Both DnaG motifs IV and V are present in the alignment, the DxD (V) motif may be involved in Mg2+ binding and mutations to the conserved glutamate (IV) completely abolish DnaG type primase activity. DNA primase 2.7.7.6 from EC is a nucleotidyltransferase it synthesizes the oligoribonucleotide primers required for DNA replication on the lagging strand of the replication fork; it can also prime the leading stand and has been implicated in cell division []. This family also includes the atypical archaeal A subunit from type II DNA topoisomerases []. Type II DNA topoisomerases catalyse the relaxation of DNA supercoiling by causing transient double strand breaks.; PDB: 2ZJT_A 3IG0_A 3M4I_A 3NUH_B 1GKU_B 1GL9_C 3PWT_A 1CY4_A 1ECL_A 1CY7_A ....
Probab=30.11  E-value=2.3e+02  Score=20.98  Aligned_cols=75  Identities=20%  Similarity=0.242  Sum_probs=50.4

Q ss_pred             CchHHHHHHHhcCceeEEEeecceeeccCChhhhhccceeehhHHHHHHHHhhcCceEEEEEeechhhHHHHHHHHH
Q 027770           91 PTLGQVVRQKLSEGRKVTCRLLGVILEESCPEELQKQVTVKSSVLEALLEITKFCDLYLMERVLDNESEKKVLLALE  167 (219)
Q Consensus        91 ~t~~qiv~~kls~~RkvTis~~gvvlee~sp~el~~~a~v~esalevLleLsk~~DlYLm~~V~dD~eE~~Vl~~Le  167 (219)
                      ++-..-+++=|++..-..+++.|-+++-..|++-.+  .-..+.++.|.++.+.+|-.+++.=.|.+.|.-...+++
T Consensus         8 ps~a~~i~~~l~~~~~~v~~~~Ghl~~~~~~~~~~~--~~~~~~i~~l~~~~~~~~~iiiatD~D~EGe~Ia~~i~~   82 (100)
T PF01751_consen    8 PSDAKAIAKALGGEEYIVIATSGHLLELAKPEDYDP--KDKKKQIKNLKKLLKKADEIIIATDPDREGELIAWEIIE   82 (100)
T ss_dssp             HHHHHHHHHHSSTTTEEEEEESSSSEESTTSSHHHC--HTTHHHHHHHHHHHHSCSEEEEEC-SSHHHHHHHHHHHH
T ss_pred             HHHHHHHHHHcCCCCEEEEEeCCccccccccccccc--ccccccchhhHHHhhhccEeeecCCCChHHHHHHHHHHH
Confidence            334445555555666888999999999888776544  344566788888888788888787666666644444444


No 38 
>cd04885 ACT_ThrD-I Tandem C-terminal ACT domains of threonine dehydratase I (ThrD-I; L-threonine hydrolyase). This CD includes each of two tandem C-terminal ACT domains of threonine dehydratase I (ThrD-I; L-threonine hydrolyase) which catalyzes the committed step in branched chain amino acid biosynthesis in plants and microorganisms, the pyridoxal 5'-phosphate (PLP)-dependent dehydration/deamination of L-threonine (or L-serine) to 2-ketobutyrate (or pyruvate). ThrD-I is a cooperative, feedback-regulated (isoleucine and valine) allosteric enzyme that forms a tetramer and contains four pyridoxal phosphate moieties. Members of this CD belong to the superfamily of ACT regulatory domains.
Probab=29.94  E-value=89  Score=21.68  Aligned_cols=30  Identities=20%  Similarity=0.131  Sum_probs=25.8

Q ss_pred             hhcCceEEEEEeechhhHHHHHHHHHHcCC
Q 027770          142 TKFCDLYLMERVLDNESEKKVLLALETAGV  171 (219)
Q Consensus       142 sk~~DlYLm~~V~dD~eE~~Vl~~Le~~Gl  171 (219)
                      .+.+.+.+...+.+.++-+.+++.|++.|.
T Consensus        35 ~~~~~v~v~ie~~~~~~~~~i~~~L~~~G~   64 (68)
T cd04885          35 GDEARVLVGIQVPDREDLAELKERLEALGY   64 (68)
T ss_pred             CCceEEEEEEEeCCHHHHHHHHHHHHHcCC
Confidence            456888888999998899999999999774


No 39 
>PRK09260 3-hydroxybutyryl-CoA dehydrogenase; Validated
Probab=28.98  E-value=1.4e+02  Score=26.25  Aligned_cols=42  Identities=14%  Similarity=0.022  Sum_probs=28.1

Q ss_pred             HHhhcCceEEEEEeechhhHHHHHHHHHHcCCCCCCCCccceEEeeecC
Q 027770          140 EITKFCDLYLMERVLDNESEKKVLLALETAGVFTSGGLVKDKVLFCSTE  188 (219)
Q Consensus       140 eLsk~~DlYLm~~V~dD~eE~~Vl~~Le~~GlF~~gGL~rHKVLFCST~  188 (219)
                      +..+.||+.+.|--.+.+-++.+..-+++       .+.++.++...|+
T Consensus        77 ~~~~~aD~Vi~avpe~~~~k~~~~~~l~~-------~~~~~~il~~~tS  118 (288)
T PRK09260         77 AAVADADLVIEAVPEKLELKKAVFETADA-------HAPAECYIATNTS  118 (288)
T ss_pred             HhhcCCCEEEEeccCCHHHHHHHHHHHHh-------hCCCCcEEEEcCC
Confidence            55678999998877666667777766655       3455666644343


No 40 
>cd08566 GDPD_AtGDE_like Glycerophosphodiester phosphodiesterase domain of Agrobacterium tumefaciens and similar proteins. This subfamily corresponds to the glycerophosphodiester phosphodiesterase domain (GDPD) present in Agrobacterium tumefaciens glycerophosphodiester phosphodiesterase (AtGDE, EC 3.1.4.46) and its uncharacterized eukaryotic homolgoues. Members in this family shows high sequence similarity to Escherichia coli GP-GDE, which catalyzes the degradation of glycerophosphodiesters to produce sn-glycerol-3-phosphate (G3P) and the corresponding alcohols. AtGDE exists as a hexamer that is a trimer of dimers, which is unique among current known GDPD family members. However, it remains unclear if the hexamer plays a physiological role in AtGDE enzymatic function.
Probab=28.70  E-value=1.1e+02  Score=26.41  Aligned_cols=48  Identities=23%  Similarity=0.318  Sum_probs=37.8

Q ss_pred             eEEEEEeechhhHHHHHHHHHHcCCCCCCCCccceEEeeecCcCccceeeecccce
Q 027770          147 LYLMERVLDNESEKKVLLALETAGVFTSGGLVKDKVLFCSTEIGRTSFVRQLEPDW  202 (219)
Q Consensus       147 lYLm~~V~dD~eE~~Vl~~Le~~GlF~~gGL~rHKVLFCST~~Gr~sfVRQLeP~l  202 (219)
                      +++..-+++. ..+.+.+++++.|.       ..||+|.|-.......+|++.|.+
T Consensus       101 ~~l~iEiK~~-~~~~~~~~~~~~~~-------~~~v~~~sf~~~~l~~~~~~~p~~  148 (240)
T cd08566         101 ILLNLDLKDA-DLDEVIALVKKHGA-------LDQVIFKSYSEEQAKELRALAPEV  148 (240)
T ss_pred             cEEEEEECch-HHHHHHHHHHHcCC-------cccEEEEECCHHHHHHHHHhCCCC
Confidence            7777888865 56778899998664       258999999888888888888875


No 41 
>PF06295 DUF1043:  Protein of unknown function (DUF1043);  InterPro: IPR009386 This entry consists of several hypothetical bacterial proteins of unknown function.
Probab=28.63  E-value=53  Score=26.53  Aligned_cols=14  Identities=21%  Similarity=0.655  Sum_probs=6.6

Q ss_pred             HHHHHHHHHHHHHH
Q 027770           17 VGAIAGLALAIVFT   30 (219)
Q Consensus        17 ~~~~ag~a~a~v~~   30 (219)
                      ||.|+|++|+++..
T Consensus         4 i~lvvG~iiG~~~~   17 (128)
T PF06295_consen    4 IGLVVGLIIGFLIG   17 (128)
T ss_pred             HHHHHHHHHHHHHH
Confidence            34455555544443


No 42 
>PHA00350 putative assembly protein
Probab=28.36  E-value=93  Score=30.13  Aligned_cols=24  Identities=21%  Similarity=0.342  Sum_probs=13.9

Q ss_pred             ceEEeeecCcCccceeeecccceee
Q 027770          180 DKVLFCSTEIGRTSFVRQLEPDWHI  204 (219)
Q Consensus       180 HKVLFCST~~Gr~sfVRQLeP~lHI  204 (219)
                      ++.+.|. ..-|.+-.|+=+|..|+
T Consensus       371 ~~~vtC~-~~~~~~~~~~~~~~~~~  394 (399)
T PHA00350        371 SRVVTCI-PRSRFAQQKAQTPPDHV  394 (399)
T ss_pred             eeEEEeC-cccccchhhccCCcccc
Confidence            6778887 33345555555555554


No 43 
>PF13940 Ldr_toxin:  Toxin Ldr, type I toxin-antitoxin system
Probab=28.13  E-value=42  Score=22.59  Aligned_cols=26  Identities=23%  Similarity=0.520  Sum_probs=19.8

Q ss_pred             ehhhhhcCCCchHHHHHHHHHHHHHHHH
Q 027770            5 LTLLSWYQDSRSVGAIAGLALAIVFTWR   32 (219)
Q Consensus         5 ~~~~~~~~~~~s~~~~ag~a~a~v~~w~   32 (219)
                      +++.|||-=+-  -.|||++.+++..|.
T Consensus         6 lg~~~WhDLAA--P~iagIi~s~iv~w~   31 (35)
T PF13940_consen    6 LGIAFWHDLAA--PIIAGIIASLIVGWL   31 (35)
T ss_pred             HHHHHHHHhHh--HHHHHHHHHHHHHHH
Confidence            46788886444  358999999999985


No 44 
>cd04906 ACT_ThrD-I_1 First of two tandem C-terminal ACT domains of threonine dehydratase I (ThrD-I; L-threonine hydrolyase). This CD includes the first of two tandem C-terminal ACT domains of threonine dehydratase I (ThrD-I; L-threonine hydrolyase) which catalyzes the committed step in branched chain amino acid biosynthesis in plants and microorganisms, the pyridoxal 5'-phosphate (PLP)-dependent dehydration/deamination of L-threonine (or L-serine) to 2-ketobutyrate (or pyruvate). ThrD-I is a cooperative, feedback-regulated (isoleucine and valine) allosteric enzyme that forms a tetramer and contains four pyridoxal phosphate moieties. Members of this CD belong to the superfamily of ACT regulatory domains.
Probab=27.48  E-value=1e+02  Score=22.55  Aligned_cols=30  Identities=23%  Similarity=0.257  Sum_probs=25.5

Q ss_pred             hcCceEEEEEeec-hhhHHHHHHHHHHcCCC
Q 027770          143 KFCDLYLMERVLD-NESEKKVLLALETAGVF  172 (219)
Q Consensus       143 k~~DlYLm~~V~d-D~eE~~Vl~~Le~~GlF  172 (219)
                      ..+++.+..++.+ .++-+.+++.|++.|..
T Consensus        38 ~~~~v~i~ie~~~~~~~~~~i~~~L~~~G~~   68 (85)
T cd04906          38 KDAHIFVGVSVANGAEELAELLEDLKSAGYE   68 (85)
T ss_pred             CeeEEEEEEEeCCcHHHHHHHHHHHHHCCCC
Confidence            4678888899998 88899999999997753


No 45 
>cd08559 GDPD_periplasmic_GlpQ_like Periplasmic glycerophosphodiester phosphodiesterase domain (GlpQ) and similar proteins. This subfamily corresponds to the glycerophosphodiester phosphodiesterase domain (GDPD) present in bacterial and eukaryotic glycerophosphodiester phosphodiesterase (GP-GDE, EC 3.1.4.46) similar to Escherichia coli periplasmic phosphodiesterase GlpQ. GP-GDEs are involved in glycerol metabolism and catalyze the degradation of glycerophosphodiesters to produce sn-glycerol-3-phosphate (G3P) and the corresponding alcohols, which are major sources of carbon and phosphate. In E. coli, there are two major G3P uptake systems: Glp and Ugp, which contain genes coding for two different GP-GDEs. GlpQ gene from the glp operon codes for a periplasmic phosphodiesterase GlpQ. GlpQ is a dimeric enzyme that hydrolyzes periplasmic glycerophosphodiesters, such as glycerophosphocholine (GPC), glycerophosphoethanolanmine (GPE), glycerophosphoglycerol (GPG), glycerophosphoinositol (GPI), 
Probab=27.40  E-value=1.1e+02  Score=27.37  Aligned_cols=53  Identities=23%  Similarity=0.197  Sum_probs=40.1

Q ss_pred             ceEEEEEeechhh--------HHHHHHHHHHcCCCCCCCCccceEEeeecCcCccceeeecccce
Q 027770          146 DLYLMERVLDNES--------EKKVLLALETAGVFTSGGLVKDKVLFCSTEIGRTSFVRQLEPDW  202 (219)
Q Consensus       146 DlYLm~~V~dD~e--------E~~Vl~~Le~~GlF~~gGL~rHKVLFCST~~Gr~sfVRQLeP~l  202 (219)
                      .+.|..-++++..        ++.++++++++|+-+    ..++|++.|=.-.-..-+|++.|.+
T Consensus       128 ~~~l~IEiK~~~~~~~~~~~~~~~v~~~l~~~~~~~----~~~~v~i~SF~~~~L~~~r~~~p~~  188 (296)
T cd08559         128 NVGIYPETKHPTFHKQEGPDIEEKLLEVLKKYGYTG----KNDPVFIQSFEPESLKRLRNETPDI  188 (296)
T ss_pred             cceEEEEecChhhhhhcCCCHHHHHHHHHHHcCCCC----CCCCEEEecCCHHHHHHHHHhCCCC
Confidence            6777777876543        478999999876532    2479999998888888889988864


No 46 
>PRK01294 lipase chaperone; Provisional
Probab=27.26  E-value=3.7e+02  Score=25.31  Aligned_cols=21  Identities=10%  Similarity=0.023  Sum_probs=13.9

Q ss_pred             HHhhhhhc-CCCCCchHHHHHH
Q 027770           79 TQIADALF-QPVKPTLGQVVRQ   99 (219)
Q Consensus        79 ~~~~~~~~-~~~~~t~~qiv~~   99 (219)
                      ++..|-|+ .-+..++.+|..+
T Consensus        89 Rd~FDYfLs~~gE~~l~~i~~~  110 (336)
T PRK01294         89 RDFFDYFLSALGELDLAAIDAL  110 (336)
T ss_pred             HHHHHHHhhccCCCCHHHHHHH
Confidence            45566666 4677788877754


No 47 
>PRK11677 hypothetical protein; Provisional
Probab=26.80  E-value=43  Score=27.90  Aligned_cols=18  Identities=28%  Similarity=0.401  Sum_probs=12.3

Q ss_pred             HHHHHHHHHHHHHHhhcC
Q 027770           19 AIAGLALAIVFTWRMFRS   36 (219)
Q Consensus        19 ~~ag~a~a~v~~w~~lr~   36 (219)
                      +++||.|++|.+|...|-
T Consensus         6 a~i~livG~iiG~~~~R~   23 (134)
T PRK11677          6 ALIGLVVGIIIGAVAMRF   23 (134)
T ss_pred             HHHHHHHHHHHHHHHHhh
Confidence            456677777777777774


No 48 
>cd08570 GDPD_YPL206cp_fungi Glycerophosphodiester phosphodiesterase domain of Saccharomyces cerevisiae YPL206cp and similar proteins. This subfamily corresponds to the glycerophosphodiester phosphodiesterase domain (GDPD) present in Saccharomyces cerevisiae YPL206cp and uncharacterized hypothetical homologs existing in fungi. The product of S. cerevisiae ORF YPL206c (PGC1), YPL206cp (Pgc1p), displays homology to bacterial and mammalian glycerophosphodiester phosphodiesterases (GP-GDE, EC 3.1.4.46), which catalyzes the degradation of glycerophosphodiesters to produce sn-glycerol-3-phosphate (G3P) and the corresponding alcohols. S. cerevisiae YPL206cp is an integral membrane protein with a single GDPD domain following by a short hydrophobic C-terminal tail that may function as a membrane anchor. This protein plays an essential role in the regulation of the cardiolipin (CL) biosynthetic pathway in yeast by removing the excess phosphatidylglycerol (PG) content of membranes via a phospholip
Probab=26.38  E-value=1.2e+02  Score=25.74  Aligned_cols=64  Identities=14%  Similarity=0.144  Sum_probs=41.8

Q ss_pred             HHHHHHHhhcC---ceEEEEEeechhh----HHHHHHHHHHcCCCCCCCCccceEEeeecCcCccceeeecccc
Q 027770          135 LEALLEITKFC---DLYLMERVLDNES----EKKVLLALETAGVFTSGGLVKDKVLFCSTEIGRTSFVRQLEPD  201 (219)
Q Consensus       135 levLleLsk~~---DlYLm~~V~dD~e----E~~Vl~~Le~~GlF~~gGL~rHKVLFCST~~Gr~sfVRQLeP~  201 (219)
                      ++-++++.+..   ++++..-++++..    ++.|..++++++..+  + ..++|+|.|-...-..-+|++.|.
T Consensus        85 L~evl~~~~~~~~~~~~l~iEiK~~~~~~~~~~~v~~~i~~~~~~~--~-~~~~v~i~Sf~~~~l~~l~~~~p~  155 (234)
T cd08570          85 LKDVLEWLVEHELPDVKLMLDIKRDNDPEILFKLIAEMLAVKPDLD--F-WRERIILGLWHLDFLKYGKEVLPG  155 (234)
T ss_pred             HHHHHHHHHhcCCCCeEEEEEECCCCCHHHHHHHHHHHHHhcCCcc--c-ccCCEEEEeCCHHHHHHHHHhCCC
Confidence            34444555544   8899999997643    356777777755332  1 347899988666666666777665


No 49 
>PF08370 PDR_assoc:  Plant PDR ABC transporter associated;  InterPro: IPR013581 ABC transporters belong to the ATP-Binding Cassette (ABC) superfamily, which uses the hydrolysis of ATP to energise diverse biological systems. ABC transporters minimally consist of two conserved regions: a highly conserved ATP binding cassette (ABC) and a less conserved transmembrane domain (TMD). These can be found on the same protein or on two different ones. Most ABC transporters function as a dimer and therefore are constituted of four domains, two ABC modules and two TMDs. ABC transporters are involved in the export or import of a wide variety of substrates ranging from small ions to macromolecules. The major function of ABC import systems is to provide essential nutrients to bacteria. They are found only in prokaryotes and their four constitutive domains are usually encoded by independent polypeptides (two ABC proteins and two TMD proteins). Prokaryotic importers require additional extracytoplasmic binding proteins (one or more per systems) for function. In contrast, export systems are involved in the extrusion of noxious substances, the export of extracellular toxins and the targeting of membrane components. They are found in all living organisms and in general the TMD is fused to the ABC module in a variety of combinations. Some eukaryotic exporters encode the four domains on the same polypeptide chain [].  The ABC module (approximately two hundred amino acid residues) is known to bind and hydrolyse ATP, thereby coupling transport to ATP hydrolysis in a large number of biological processes. The cassette is duplicated in several subfamilies. Its primary sequence is highly conserved, displaying a typical phosphate-binding loop: Walker A, and a magnesium binding site: Walker B. Besides these two regions, three other conserved motifs are present in the ABC cassette: the switch region which contains a histidine loop, postulated to polarise the attaching water molecule for hydrolysis, the signature conserved motif (LSGGQ) specific to the ABC transporter, and the Q-motif (between Walker A and the signature), which interacts with the gamma phosphate through a water bond. The Walker A, Walker B, Q-loop and switch region form the nucleotide binding site [, , ]. The 3D structure of a monomeric ABC module adopts a stubby L-shape with two distinct arms. ArmI (mainly beta-strand) contains Walker A and Walker B. The important residues for ATP hydrolysis and/or binding are located in the P-loop. The ATP-binding pocket is located at the extremity of armI. The perpendicular armII contains mostly the alpha helical subdomain with the signature motif. It only seems to be required for structural integrity of the ABC module. ArmII is in direct contact with the TMD. The hinge between armI and armII contains both the histidine loop and the Q-loop, making contact with the gamma phosphate of the ATP molecule. ATP hydrolysis leads to a conformational change that could facilitate ADP release. In the dimer the two ABC cassettes contact each other through hydrophobic interactions at the antiparallel beta-sheet of armI by a two-fold axis [, , , , , ]. The ATP-Binding Cassette (ABC) superfamily forms one of the largest of all protein families with a diversity of physiological functions []. Several studies have shown that there is a correlation between the functional characterisation and the phylogenetic classification of the ABC cassette [, ]. More than 50 subfamilies have been described based on a phylogenetic and functional classification [, , ]; (for further information see http://www.tcdb.org/tcdb/index.php?tc=3.A.1). This domain is found on the C terminus of ABC-2 type transporter domains (IPR013525 from INTERPRO). It seems to be associated with the plant pleiotropic drug resistance (PDR) protein family of ABC transporters. Like in yeast, plant PDR ABC transporters may also play a role in the transport of antifungal agents [] (see also IPR010929 from INTERPRO). The PDR family is characterised by a configuration in which the ABC domain is nearer the N terminus of the protein than the transmembrane domain []. 
Probab=26.15  E-value=96  Score=22.99  Aligned_cols=26  Identities=19%  Similarity=0.385  Sum_probs=17.9

Q ss_pred             chHHHHHHHHH----HHHHHHHhhcCCCCC
Q 027770           15 RSVGAIAGLAL----AIVFTWRMFRSPGAP   40 (219)
Q Consensus        15 ~s~~~~ag~a~----a~v~~w~~lr~~~~~   40 (219)
                      .++||++|+++    ...++..+|..++.+
T Consensus        31 IgvgaL~G~~vlFNil~~laL~yL~p~~k~   60 (65)
T PF08370_consen   31 IGVGALLGFIVLFNILFTLALTYLNPLGKS   60 (65)
T ss_pred             ehHHHHHHHHHHHHHHHHHHHHhcCCcCCC
Confidence            36899999886    345566777766543


No 50 
>PF06295 DUF1043:  Protein of unknown function (DUF1043);  InterPro: IPR009386 This entry consists of several hypothetical bacterial proteins of unknown function.
Probab=25.98  E-value=44  Score=26.99  Aligned_cols=22  Identities=23%  Similarity=0.401  Sum_probs=18.4

Q ss_pred             HHHHHHHHHHHHHHhhcCCCCC
Q 027770           19 AIAGLALAIVFTWRMFRSPGAP   40 (219)
Q Consensus        19 ~~ag~a~a~v~~w~~lr~~~~~   40 (219)
                      +++||.|+++.+|...|.-.+.
T Consensus         2 ~~i~lvvG~iiG~~~~r~~~~~   23 (128)
T PF06295_consen    2 AIIGLVVGLIIGFLIGRLTSSN   23 (128)
T ss_pred             hHHHHHHHHHHHHHHHHHhccc
Confidence            6899999999999999955433


No 51 
>COG4980 GvpP Gas vesicle protein [General function prediction only]
Probab=25.86  E-value=47  Score=27.29  Aligned_cols=14  Identities=43%  Similarity=0.824  Sum_probs=11.0

Q ss_pred             HHHHHHHHHHHHHH
Q 027770           17 VGAIAGLALAIVFT   30 (219)
Q Consensus        17 ~~~~ag~a~a~v~~   30 (219)
                      +|+|+|.+.|+.|+
T Consensus        13 iGgiiGa~aaLL~A   26 (115)
T COG4980          13 IGGIIGAAAALLFA   26 (115)
T ss_pred             HHHHHHHHHHHHhC
Confidence            68888888887774


No 52 
>cd06660 Aldo_ket_red Aldo-keto reductases (AKRs) are a superfamily of soluble NAD(P)(H) oxidoreductases whose chief purpose is to reduce aldehydes and ketones to primary and secondary alcohols. AKRs are present in all phyla and are of importance to both health and industrial applications. Members have very distinct functions and include the prokaryotic 2,5-diketo-D-gluconic acid reductases and beta-keto ester reductases, the eukaryotic aldose reductases, aldehyde reductases, hydroxysteroid dehydrogenases, steroid 5beta-reductases, potassium channel beta-subunits and aflatoxin aldehyde reductases, among others.
Probab=25.72  E-value=84  Score=26.76  Aligned_cols=86  Identities=20%  Similarity=0.212  Sum_probs=51.7

Q ss_pred             CCCchHHHHHHHhcCceeEEEeecceeeccCChhhhhccceeehhHHHHHHHH-hhcCceEEEEEeechhh-HHHHHHHH
Q 027770           89 VKPTLGQVVRQKLSEGRKVTCRLLGVILEESCPEELQKQVTVKSSVLEALLEI-TKFCDLYLMERVLDNES-EKKVLLAL  166 (219)
Q Consensus        89 ~~~t~~qiv~~kls~~RkvTis~~gvvlee~sp~el~~~a~v~esalevLleL-sk~~DlYLm~~V~dD~e-E~~Vl~~L  166 (219)
                      ....++++.++.- .-.++.|+++.-......  .....-.++.++.+.|-+| ...-|+|++-....+.. ...+.++|
T Consensus        57 sE~~lG~al~~~~-~R~~~~i~tK~~~~~~~~--~~~~~~~~~~~l~~sL~~L~~~~iDl~~lh~~~~~~~~~~~~~~~l  133 (285)
T cd06660          57 SEELLGEALKERG-PREEVFIATKVGPRPGDG--RDLSPEHIRRAVEESLKRLGTDYIDLYLLHWPDPDTPDIEETLRAL  133 (285)
T ss_pred             CHHHHHHHHhccC-CcCcEEEEeeecCCCCCC--CCCCHHHHHHHHHHHHHHhCCCceeEEEecCCCCCCCCHHHHHHHH
Confidence            3455666665532 234788888854322110  1112334677888888888 56789999998876654 45555555


Q ss_pred             H---HcCCCCCCCC
Q 027770          167 E---TAGVFTSGGL  177 (219)
Q Consensus       167 e---~~GlF~~gGL  177 (219)
                      +   +.|.+..-|+
T Consensus       134 ~~l~~~G~ir~iGv  147 (285)
T cd06660         134 EELVKEGKIRAIGV  147 (285)
T ss_pred             HHHHHcCCccEEEe
Confidence            5   6666553344


No 53 
>PF00998 RdRP_3:  Viral RNA dependent RNA polymerase;  InterPro: IPR002166 The RNA dependent RNA polymerase is also known as non-structural protein NS5B. NS5B is a 65 kDa protein that resembles other viral RNA polymerases. Hepatitis C virus (HCV) replication is thought to occur in membrane bound replication complexes. These complexes transcribe the positive strand and the resulting minus strand is used as a template for the synthesis of genomic RNA. There are two viral proteins involved in the reaction, NS3 and NS5B [, , ].; GO: 0003723 RNA binding, 0003968 RNA-directed RNA polymerase activity, 0005524 ATP binding, 0006351 transcription, DNA-dependent, 0019079 viral genome replication; PDB: 2XYM_A 3I5K_D 4AEX_A 4E7A_A 2XXD_A 4E78_A 4AEP_A 4E76_A 1S4F_D 1S48_A ....
Probab=25.63  E-value=75  Score=31.53  Aligned_cols=123  Identities=22%  Similarity=0.213  Sum_probs=64.2

Q ss_pred             CCchhHH--HHhhhhhcCCCCCchHHHHHHHhcCceeEEEeecceeecc-------CChhhhhccceeehhHHHHHHHHh
Q 027770           72 SSSEDLR--TQIADALFQPVKPTLGQVVRQKLSEGRKVTCRLLGVILEE-------SCPEELQKQVTVKSSVLEALLEIT  142 (219)
Q Consensus        72 ~~~~~~~--~~~~~~~~~~~~~t~~qiv~~kls~~RkvTis~~gvvlee-------~sp~el~~~a~v~esalevLleLs  142 (219)
                      -+.+|+|  +.+....|.+.. .+.++...||--+-.|+.+=--+.+..       .-+..+-|..+.--.+......+-
T Consensus       216 Vs~~~l~~E~~iY~~~~~~~~-~~~~~L~~~l~~~G~~~~~~G~~~y~~~gcR~SGd~~TS~GN~l~~~~~~~a~~~~~g  294 (486)
T PF00998_consen  216 VSEEDLRFEHSIYQCCFLDPE-ELIKLLTERLYNGGPMFNSDGQVRYGYRGCRMSGDMNTSLGNCLTCYLKAYAACRAAG  294 (486)
T ss_dssp             --HHHHHHHHHHHTTSBEGHH-HHHHHHHHHTTTEEEEEETTSE-EEEEESS--TTSTTHHHHHHHHHHHHHHHHHHHHT
T ss_pred             CchhhhhhhheehhcccCCch-hhhccchhceeeeEEEEecCCCeeeccCCcccCCCCCcccccHHHHHHHHHHHHhhcC
Confidence            4567888  666666664333 888999988865544443321111110       001122222222223334444433


Q ss_pred             hcC-------ceEEEEEeec-hhhHHHHHHHHHHcCCCCCCCCcc------ceEEeeecCc----Cccceeeec
Q 027770          143 KFC-------DLYLMERVLD-NESEKKVLLALETAGVFTSGGLVK------DKVLFCSTEI----GRTSFVRQL  198 (219)
Q Consensus       143 k~~-------DlYLm~~V~d-D~eE~~Vl~~Le~~GlF~~gGL~r------HKVLFCST~~----Gr~sfVRQL  198 (219)
                      +.|       |.+|||--.+ |++++....++.++| |.  |..+      ++|=|||+-=    +.--|+|..
T Consensus       295 ~~~~l~~~GDD~vvi~E~~~~~~~~~~l~~~~~~~G-f~--~~~e~p~y~lE~iefCq~~pv~~~~~~~mvr~~  365 (486)
T PF00998_consen  295 KDCSLLNCGDDCVVICESAGVDEDEAALTEAFTRYG-FP--MKVEKPVYELELIEFCQSNPVFDGDGWGMVRYP  365 (486)
T ss_dssp             EEEEEEEETTEEEEEEEHCHHHHHHHHHHHHHHHTT--B--SSSCEEBSSGGG-EBTTBEEEEEEETTS-EEEE
T ss_pred             CCccEeecCCceEEEecccchHHHHHHHHHHHHhcC-CC--CCCCCcccccccccccCceeeeccCCCcccccc
Confidence            333       7888887766 455567888899999 87  3433      6788888742    233466653


No 54 
>PF00248 Aldo_ket_red:  Aldo/keto reductase family;  InterPro: IPR023210 The aldo-keto reductase family includes a number of related monomeric NADPH-dependent oxidoreductases, such as aldehyde reductase, aldose reductase, prostaglandin F synthase, xylose reductase, rho crystallin, and many others []. All possess a similar structure, with a beta-alpha-beta fold characteristic of nucleotide binding proteins []. The fold comprises a parallel beta-8/alpha-8-barrel, which contains a novel NADP-binding motif. The binding site is located in a large, deep, elliptical pocket in the C-terminal end of the beta sheet, the substrate being bound in an extended conformation. The hydrophobic nature of the pocket favours aromatic and apolar substrates over highly polar ones []. Binding of the NADPH coenzyme causes a massive conformational change, reorienting a loop, effectively locking the coenzyme in place. This binding is more similar to FAD- than to NAD(P)-binding oxidoreductases [].  Some proteins of this entry contain a K+ ion channel beta chain regulatory domain; these are reported to have oxidoreductase activity [].  This entry represents the NADP-dependent oxidoreductase domain found in these proteins.; PDB: 1C9W_A 4F40_B 1VBJ_A 1XGD_A 1X97_A 2ACS_A 1EF3_A 2ACU_A 1PWM_A 2NVD_A ....
Probab=25.48  E-value=54  Score=27.92  Aligned_cols=84  Identities=20%  Similarity=0.238  Sum_probs=53.9

Q ss_pred             chHHHHHHHhcCceeEEEeecceeeccCChhhhhccceeehhHHHHHHHHh-hcCceEEEEEeechhh-HHHHHHHHH--
Q 027770           92 TLGQVVRQKLSEGRKVTCRLLGVILEESCPEELQKQVTVKSSVLEALLEIT-KFCDLYLMERVLDNES-EKKVLLALE--  167 (219)
Q Consensus        92 t~~qiv~~kls~~RkvTis~~gvvlee~sp~el~~~a~v~esalevLleLs-k~~DlYLm~~V~dD~e-E~~Vl~~Le--  167 (219)
                      .+++..++.-....++.|++..  .....+...-....++++..+.|-+|- ..-|+|++-....+.+ -+.+...|+  
T Consensus        48 ~lg~~l~~~~~~r~~~~i~tK~--~~~~~~~~~~~~~~i~~~~~~sL~~L~~d~iDl~~lH~~~~~~~~~~~~~~~l~~l  125 (283)
T PF00248_consen   48 ILGRALRKSRVPRDDIFISTKV--YGDGKPEPDYSPDSIRESLERSLERLGTDYIDLLLLHWPDPSEDALEEVWEALEEL  125 (283)
T ss_dssp             HHHHHHHHTSSTGGGSEEEEEE--ESSSSTGGGSSHHHHHHHHHHHHHHHTSSSEEEEEESSSSTTSSHHHHHHHHHHHH
T ss_pred             cccccccccccccccccccccc--cccccccccccccccccccccccccccccchhccccccccccccccchhhhhhhhc
Confidence            4677776633445577788875  222233333344567788888888887 4569999988887777 566666555  


Q ss_pred             -HcCCCCCCCC
Q 027770          168 -TAGVFTSGGL  177 (219)
Q Consensus       168 -~~GlF~~gGL  177 (219)
                       +.|.+..-|+
T Consensus       126 ~~~G~ir~iGv  136 (283)
T PF00248_consen  126 KKEGKIRHIGV  136 (283)
T ss_dssp             HHTTSEEEEEE
T ss_pred             ccccccccccc
Confidence             6677664343


No 55 
>COG3086 RseC Positive regulator of sigma E activity [Signal transduction mechanisms]
Probab=25.06  E-value=42  Score=28.89  Aligned_cols=35  Identities=26%  Similarity=0.270  Sum_probs=28.6

Q ss_pred             eeehhhhhcCCCchHHHHHHHHHHHHHHHHhhcCC
Q 027770            3 WVLTLLSWYQDSRSVGAIAGLALAIVFTWRMFRSP   37 (219)
Q Consensus         3 ~~~~~~~~~~~~~s~~~~ag~a~a~v~~w~~lr~~   37 (219)
                      |+...||.+....-+|+++|.+++.+++-++-|--
T Consensus        94 ~La~~L~~~e~~~~~~~~lg~~l~fl~~r~ysRkl  128 (150)
T COG3086          94 ILAQYLFFSELIVIFGAFLGLALGFLLARRYSRKL  128 (150)
T ss_pred             HHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHHHHh
Confidence            34456788999999999999999988888887743


No 56 
>COG4218 MtrF Tetrahydromethanopterin S-methyltransferase, subunit F [Coenzyme metabolism]
Probab=24.98  E-value=63  Score=24.90  Aligned_cols=18  Identities=39%  Similarity=0.536  Sum_probs=14.8

Q ss_pred             HHHHHHHHHHHHHHHHhh
Q 027770           17 VGAIAGLALAIVFTWRMF   34 (219)
Q Consensus        17 ~~~~ag~a~a~v~~w~~l   34 (219)
                      .+.++||||.+||+--++
T Consensus        48 st~i~GlaiGfvfA~vLv   65 (73)
T COG4218          48 STRIAGLAIGFVFAGVLV   65 (73)
T ss_pred             hhhhHHHHHHHHHHHHHH
Confidence            467899999999987665


No 57 
>TIGR02863 spore_sspJ small, acid-soluble spore protein, SspJ family. RL J Bacteriol. 1998 Dec;180(24):6704-12.
Probab=24.77  E-value=49  Score=23.42  Aligned_cols=18  Identities=28%  Similarity=0.320  Sum_probs=15.1

Q ss_pred             echhhHHHHHHHHHHcCC
Q 027770          154 LDNESEKKVLLALETAGV  171 (219)
Q Consensus       154 ~dD~eE~~Vl~~Le~~Gl  171 (219)
                      +++.+.+.|..+||++|-
T Consensus        12 ~seKd~n~v~gaLEdAg~   29 (47)
T TIGR02863        12 KSEKDKNAIQGALEDAGQ   29 (47)
T ss_pred             chhhhHHHHHHHHHHHHH
Confidence            467888999999998774


No 58 
>PF09527 ATPase_gene1:  Putative F0F1-ATPase subunit (ATPase_gene1)
Probab=24.07  E-value=90  Score=21.26  Aligned_cols=19  Identities=21%  Similarity=0.569  Sum_probs=12.1

Q ss_pred             HHHHHHHHHHHHHHHHhhc
Q 027770           17 VGAIAGLALAIVFTWRMFR   35 (219)
Q Consensus        17 ~~~~ag~a~a~v~~w~~lr   35 (219)
                      +|.+.|++.++.-.||++|
T Consensus        37 ~g~llG~~~g~~~~~~~~k   55 (55)
T PF09527_consen   37 IGLLLGIAAGFYNVYRLVK   55 (55)
T ss_pred             HHHHHHHHHHHHHHHHHhC
Confidence            4555666666667777664


No 59 
>PF09575 Spore_SspJ:  Small spore protein J (Spore_SspJ);  InterPro: IPR014220 This entry represents a group of small acid-soluble proteins (SASP) from Bacillus species, which are present in spores but not in growing cells. The sspJ gene is transcribed in the forespore compartment by RNA polymerase with the forespore-specific sigmaG. Loss of SspJ causes a slight decrease in the rate of spore outgrowth in an otherwise wild-type background [].
Probab=24.00  E-value=51  Score=23.23  Aligned_cols=18  Identities=28%  Similarity=0.167  Sum_probs=15.3

Q ss_pred             echhhHHHHHHHHHHcCC
Q 027770          154 LDNESEKKVLLALETAGV  171 (219)
Q Consensus       154 ~dD~eE~~Vl~~Le~~Gl  171 (219)
                      ++|.+.++|..+||++|-
T Consensus        11 ~seK~~n~v~gaLedA~~   28 (46)
T PF09575_consen   11 NSEKDKNVVQGALEDASK   28 (46)
T ss_pred             cchhhHHHHHHHHHHHHH
Confidence            578899999999998763


No 60 
>PRK14901 16S rRNA methyltransferase B; Provisional
Probab=23.76  E-value=70  Score=30.22  Aligned_cols=53  Identities=21%  Similarity=0.287  Sum_probs=35.2

Q ss_pred             hhhhhccceeehhHHHHHHHHhhcCc--eEEEEEeechhhHHHHHHHHHHcCCCC
Q 027770          121 PEELQKQVTVKSSVLEALLEITKFCD--LYLMERVLDNESEKKVLLALETAGVFT  173 (219)
Q Consensus       121 p~el~~~a~v~esalevLleLsk~~D--lYLm~~V~dD~eE~~Vl~~Le~~GlF~  173 (219)
                      ++++..-.......++...++.+--.  ||--|.+..++.|++|...|++.+-|.
T Consensus       353 ~~~~~~l~~~Q~~iL~~a~~~lkpgG~lvystcsi~~~Ene~~v~~~l~~~~~~~  407 (434)
T PRK14901        353 PEKIQELAPLQAELLESLAPLLKPGGTLVYATCTLHPAENEAQIEQFLARHPDWK  407 (434)
T ss_pred             HHHHHHHHHHHHHHHHHHHHhcCCCCEEEEEeCCCChhhHHHHHHHHHHhCCCcE
Confidence            44444444444555555555555544  566688999999999999999865554


No 61 
>TIGR03142 cytochro_ccmI cytochrome c-type biogenesis protein CcmI. This TPR repeat-containing protein is the CcmI protein (also called CycH) of c-type cytochrome biogenesis. CcmI is thought to act as an apo-cytochrome c chaperone. This model describes the N-terminal region of the protein, Members of this protein family
Probab=23.74  E-value=61  Score=25.49  Aligned_cols=8  Identities=38%  Similarity=1.344  Sum_probs=4.8

Q ss_pred             Ceeeehhh
Q 027770            1 MFWVLTLL    8 (219)
Q Consensus         1 ~~~~~~~~    8 (219)
                      |||+++.+
T Consensus         1 ~fWl~~a~    8 (117)
T TIGR03142         1 LFWIVAAL    8 (117)
T ss_pred             CHHHHHHH
Confidence            68865543


No 62 
>TIGR01261 hisB_Nterm histidinol-phosphatase. This model describes histidinol phosphatase. All known examples in the scope of this model are bifunctional proteins with a histidinol phosphatase domain followed by an imidazoleglycerol-phosphate dehydratase domain. These enzymatic domains catalyze the ninth and seventh steps, respectively, of histidine biosynthesis.
Probab=23.34  E-value=2.7e+02  Score=22.79  Aligned_cols=69  Identities=20%  Similarity=0.368  Sum_probs=42.9

Q ss_pred             ecceeeccCC---hhhhhccceeehhHHHHHHHHhhc-CceEEEEEeec-----------hhhHHHHHHHHHHcCCCCCC
Q 027770          111 LLGVILEESC---PEELQKQVTVKSSVLEALLEITKF-CDLYLMERVLD-----------NESEKKVLLALETAGVFTSG  175 (219)
Q Consensus       111 ~~gvvlee~s---p~el~~~a~v~esalevLleLsk~-~DlYLm~~V~d-----------D~eE~~Vl~~Le~~GlF~~g  175 (219)
                      ..|++.++.+   +.+.....++-+.+.+.|..|..- ..+++++--..           ...+..+..+|+..|+-   
T Consensus         8 ~dg~l~~~~~~~~~~~~~~~~~~~pgv~e~L~~L~~~g~~l~IvSN~~g~~~~~~~~~~~~~~~~~~~~~l~~~gl~---   84 (161)
T TIGR01261         8 RDGTLIEEPPSDFQVDALEKLRFEKGVIPALLKLKKAGYKFVMVTNQDGLGTPSFPQADFDGPHNLMLQIFRSQGII---   84 (161)
T ss_pred             CCCCccccCCCccccCCHHHeeECCCHHHHHHHHHHCCCeEEEEeCCccccCCcCCHHHHHHHHHHHHHHHHHCCCc---
Confidence            4566655433   233445678889999999999764 56665543210           02456788888886652   


Q ss_pred             CCccceEEee
Q 027770          176 GLVKDKVLFC  185 (219)
Q Consensus       176 GL~rHKVLFC  185 (219)
                         =+.++||
T Consensus        85 ---fd~ii~~   91 (161)
T TIGR01261        85 ---FDDVLIC   91 (161)
T ss_pred             ---eeEEEEC
Confidence               2567777


No 63 
>PRK07819 3-hydroxybutyryl-CoA dehydrogenase; Validated
Probab=23.27  E-value=2.8e+02  Score=24.77  Aligned_cols=70  Identities=13%  Similarity=0.110  Sum_probs=43.3

Q ss_pred             hhcCceEEEEEeechhhHHHHHHHHHHcCCCCCCCCccceEEeeecCcC--------------c---------cceeeec
Q 027770          142 TKFCDLYLMERVLDNESEKKVLLALETAGVFTSGGLVKDKVLFCSTEIG--------------R---------TSFVRQL  198 (219)
Q Consensus       142 sk~~DlYLm~~V~dD~eE~~Vl~~Le~~GlF~~gGL~rHKVLFCST~~G--------------r---------~sfVRQL  198 (219)
                      .+.||+.+-|-.+|.+-|+.+...|++.       ..+...+++|+.-+              |         ++..+-+
T Consensus        82 ~~~~d~ViEav~E~~~~K~~l~~~l~~~-------~~~~~~il~snTS~~~~~~la~~~~~~~r~~g~hf~~P~~~~~lv  154 (286)
T PRK07819         82 FADRQLVIEAVVEDEAVKTEIFAELDKV-------VTDPDAVLASNTSSIPIMKLAAATKRPGRVLGLHFFNPVPVLPLV  154 (286)
T ss_pred             hCCCCEEEEecccCHHHHHHHHHHHHHh-------hCCCCcEEEECCCCCCHHHHHhhcCCCccEEEEecCCCcccCceE
Confidence            4779998877667777778888877762       21223444444333              2         2223334


Q ss_pred             ccceeecCCHHHHHhhhhcc
Q 027770          199 EPDWHIDTNPEIVSQLAVHL  218 (219)
Q Consensus       199 eP~lHIDtd~~vv~~L~rfv  218 (219)
                      |=..|-.|+++++..+.+|+
T Consensus       155 Elv~~~~T~~~~~~~~~~~~  174 (286)
T PRK07819        155 ELVPTLVTSEATVARAEEFA  174 (286)
T ss_pred             EEeCCCCCCHHHHHHHHHHH
Confidence            44457789999998888773


No 64 
>COG1167 ARO8 Transcriptional regulators containing a DNA-binding HTH domain and an aminotransferase domain (MocR family) and their eukaryotic orthologs [Transcription / Amino acid transport and metabolism]
Probab=23.13  E-value=92  Score=29.70  Aligned_cols=73  Identities=25%  Similarity=0.428  Sum_probs=39.5

Q ss_pred             ceeehhHHHHHHHHhhcCceEEEEEeechhhHHHHHHHHHHcCCCC--CCCCcc-ceEEeeecCcCccceeeecccceee
Q 027770          128 VTVKSSVLEALLEITKFCDLYLMERVLDNESEKKVLLALETAGVFT--SGGLVK-DKVLFCSTEIGRTSFVRQLEPDWHI  204 (219)
Q Consensus       128 a~v~esalevLleLsk~~DlYLm~~V~dD~eE~~Vl~~Le~~GlF~--~gGL~r-HKVLFCST~~Gr~sfVRQLeP~lHI  204 (219)
                      .++-.+-++-|++||.-.|+|+|+   ||-.-+     |.-.+--.  ..++.+ +||+||+|      |.+=|=|-+=|
T Consensus       242 ~tms~~rR~~Ll~lA~~~~~~IIE---DD~y~e-----l~~~~~p~~~l~~ld~~~rViy~gS------FSK~l~PglRl  307 (459)
T COG1167         242 VTMSLERRKALLALAEKYDVLIIE---DDYYGE-----LRYDGPPPPPLKALDAPGRVIYLGS------FSKTLAPGLRL  307 (459)
T ss_pred             CccCHHHHHHHHHHHHHcCCeEEe---eCcchh-----hhcCCCCCCChHhhCCCCCEEEEee------ehhhcccccce
Confidence            344456677788888888888754   443322     11111111  013333 89999986      66666666544


Q ss_pred             c---CCHHHHHhh
Q 027770          205 D---TNPEIVSQL  214 (219)
Q Consensus       205 D---td~~vv~~L  214 (219)
                      -   .++++++.+
T Consensus       308 G~vv~p~~~~~~~  320 (459)
T COG1167         308 GYVVAPPELIEKL  320 (459)
T ss_pred             eeeeCCHHHHHHH
Confidence            3   444444433


No 65 
>PF14155 DUF4307:  Domain of unknown function (DUF4307)
Probab=22.90  E-value=82  Score=24.98  Aligned_cols=26  Identities=15%  Similarity=0.221  Sum_probs=19.1

Q ss_pred             CCCchHHHHHHHHHHHHHHHHhhcCC
Q 027770           12 QDSRSVGAIAGLALAIVFTWRMFRSP   37 (219)
Q Consensus        12 ~~~~s~~~~ag~a~a~v~~w~~lr~~   37 (219)
                      +...-++++++++.+.+++|..+...
T Consensus         4 r~~~~~~~v~~vv~~~~~~w~~~~~~   29 (112)
T PF14155_consen    4 RKLVIAGAVLVVVAGAVVAWFGYSQF   29 (112)
T ss_pred             ceeEehHHHHHHHHHHHHhHhhhhhc
Confidence            44556778888888888888888743


No 66 
>PRK06049 rpl30p 50S ribosomal protein L30P; Reviewed
Probab=22.83  E-value=1.3e+02  Score=25.48  Aligned_cols=64  Identities=6%  Similarity=0.169  Sum_probs=51.4

Q ss_pred             eEEEEEeechhhH-HHHHHHHHHcCCCCCCCCccceEEeeecCcCccceeeecccceeec-CCHHHHHhhh
Q 027770          147 LYLMERVLDNESE-KKVLLALETAGVFTSGGLVKDKVLFCSTEIGRTSFVRQLEPDWHID-TNPEIVSQLA  215 (219)
Q Consensus       147 lYLm~~V~dD~eE-~~Vl~~Le~~GlF~~gGL~rHKVLFCST~~Gr~sfVRQLeP~lHID-td~~vv~~L~  215 (219)
                      +|++.+|.+...- ..+...|+..||=..     |+..|-.-.....-|++..+|.+-+. -|.+.++.|-
T Consensus         2 ~~~ViriRg~ig~~~~~r~tL~~LgL~ki-----~~~V~v~~tp~~~GML~kV~~~V~~ge~~~~tv~~Li   67 (154)
T PRK06049          2 MYAVIRIRGRVNVPRDIADTLKMLRLHRV-----NHCVLVPETPSYKGMLQKVKDYVTWGEIDADTLAELL   67 (154)
T ss_pred             cEEEEEecCCCCCCHHHHHHHHHcCCCcC-----CCEEEEeCCHHHHHHHHhhhceeEEeeCchHHHHHHH
Confidence            6888999887543 678888888666554     77788777788999999999999988 4888888873


No 67 
>PF12263 DUF3611:  Protein of unknown function (DUF3611);  InterPro: IPR022051  This family of proteins is found in bacteria and eukaryotes. Proteins in this family are typically between 180 and 205 amino acids in length. There are two completely conserved residues (W and G) that may be functionally important. 
Probab=22.80  E-value=1e+02  Score=26.89  Aligned_cols=21  Identities=19%  Similarity=0.455  Sum_probs=16.2

Q ss_pred             hHHHHHHHHHHHHHHHHhhcC
Q 027770           16 SVGAIAGLALAIVFTWRMFRS   36 (219)
Q Consensus        16 s~~~~ag~a~a~v~~w~~lr~   36 (219)
                      .+++++-+.+.+.|.|+|.|-
T Consensus        62 a~~gl~~l~~si~~~fry~Rl   82 (183)
T PF12263_consen   62 AICGLVALFFSIFWSFRYTRL   82 (183)
T ss_pred             HHHHHHHHHHHHHHHHHHHHH
Confidence            456677777888999999884


No 68 
>PF10031 DUF2273:  Small integral membrane protein (DUF2273);  InterPro: IPR018730  Members of this family of hypothetical bacterial proteins have no known function. 
Probab=22.76  E-value=99  Score=21.76  Aligned_cols=18  Identities=28%  Similarity=0.624  Sum_probs=14.2

Q ss_pred             CCchHHHHHHHHHHHHHH
Q 027770           13 DSRSVGAIAGLALAIVFT   30 (219)
Q Consensus        13 ~~~s~~~~ag~a~a~v~~   30 (219)
                      .-+.+|+++|+.+|+.+-
T Consensus         8 ~~~iiG~~~G~ila~l~l   25 (51)
T PF10031_consen    8 RGKIIGGLIGLILALLIL   25 (51)
T ss_pred             cchHHHHHHHHHHHHHHH
Confidence            346799999999988654


No 69 
>PF12792 CSS-motif:  CSS motif domain associated with EAL ;  InterPro: IPR024744 This domain, with its characteristic highly conserved CSS sequence motif, is found N-terminal to the EAL domain (PF00563 from PFAM), found in many putative cyclic diguanylate phosphodiesterases.
Probab=21.82  E-value=86  Score=24.91  Aligned_cols=34  Identities=18%  Similarity=0.304  Sum_probs=22.2

Q ss_pred             HHHHHHHHHHcCCCCCCCCccceEEeeecCcCcc
Q 027770          159 EKKVLLALETAGVFTSGGLVKDKVLFCSTEIGRT  192 (219)
Q Consensus       159 E~~Vl~~Le~~GlF~~gGL~rHKVLFCST~~Gr~  192 (219)
                      -.......-..-.+..-|+.++.-++|||--|..
T Consensus        45 ~~~Lr~~~~~~~~ir~i~~~~~g~i~CSS~~G~~   78 (208)
T PF12792_consen   45 LNALRQIVARSPYIRDIGLVRNGRIYCSSLWGDL   78 (208)
T ss_pred             HHHHHHHHHhchhhhheeEEeCCeEEecCCCCCc
Confidence            3344444334444444477779999999999986


No 70 
>TIGR02250 FCP1_euk FCP1-like phosphatase, phosphatase domain. This domain is related to domains found in the human NLI interacting factor-like phosphatases, and together both are detected by the Pfam model pfam03031.
Probab=21.68  E-value=4.1e+02  Score=21.83  Aligned_cols=79  Identities=10%  Similarity=0.049  Sum_probs=44.5

Q ss_pred             ccceeehhHHHHHHHHhhcCceEEEEEeechhhHHHHHHHHHHcCCCCCCCCccceEEeeecCcCccceeeec-------
Q 027770          126 KQVTVKSSVLEALLEITKFCDLYLMERVLDNESEKKVLLALETAGVFTSGGLVKDKVLFCSTEIGRTSFVRQL-------  198 (219)
Q Consensus       126 ~~a~v~esalevLleLsk~~DlYLm~~V~dD~eE~~Vl~~Le~~GlF~~gGL~rHKVLFCST~~Gr~sfVRQL-------  198 (219)
                      .....+|.+.+.|-++++.+++++.+    ...++....+++..+.-.  .+-.++|++-+...|  ..++-|       
T Consensus        55 ~~v~~rPgv~efL~~l~~~yel~I~T----~~~~~yA~~vl~~ldp~~--~~F~~ri~~rd~~~~--~~~KdL~~i~~~d  126 (156)
T TIGR02250        55 YLTKLRPFLHEFLKEASKLYEMHVYT----MGTRAYAQAIAKLIDPDG--KYFGDRIISRDESGS--PHTKSLLRLFPAD  126 (156)
T ss_pred             EEEEECCCHHHHHHHHHhhcEEEEEe----CCcHHHHHHHHHHhCcCC--CeeccEEEEeccCCC--CccccHHHHcCCC
Confidence            45678999999999999888888754    233334444444322111  122367776444333  223333       


Q ss_pred             -ccceeecCCHHHHH
Q 027770          199 -EPDWHIDTNPEIVS  212 (219)
Q Consensus       199 -eP~lHIDtd~~vv~  212 (219)
                       .=.+=||.++++-.
T Consensus       127 ~~~vvivDd~~~~~~  141 (156)
T TIGR02250       127 ESMVVIIDDREDVWP  141 (156)
T ss_pred             cccEEEEeCCHHHhh
Confidence             33445888776643


No 71 
>PRK08942 D,D-heptose 1,7-bisphosphate phosphatase; Validated
Probab=21.62  E-value=4.1e+02  Score=21.35  Aligned_cols=45  Identities=13%  Similarity=0.056  Sum_probs=31.7

Q ss_pred             eEEEeecceeeccC-ChhhhhccceeehhHHHHHHHHhhc-CceEEE
Q 027770          106 KVTCRLLGVILEES-CPEELQKQVTVKSSVLEALLEITKF-CDLYLM  150 (219)
Q Consensus       106 kvTis~~gvvlee~-sp~el~~~a~v~esalevLleLsk~-~DlYLm  150 (219)
                      -+-+-+.|+|.++. ...+..++....+.+.+.|.+|.+. ..++++
T Consensus         5 ~~~~d~~~t~~~~~~~~~~~~~~~~~~pgv~e~L~~Lk~~g~~l~I~   51 (181)
T PRK08942          5 AIFLDRDGVINVDSDGYVKSPDEWIPIPGSIEAIARLKQAGYRVVVA   51 (181)
T ss_pred             EEEEECCCCcccCCccccCCHHHeEECCCHHHHHHHHHHCCCEEEEE
Confidence            34566778877776 3444456888999999999999875 344443


No 72 
>cd01657 Ribosomal_L7_archeal_euk Ribosomal protein L7, which is found in archaea and eukaryotes but not in prokaryotes, binds domain II of the 23S rRNA as well as the 5S rRNA and is one of five ribosomal proteins that mediate the interactions 5S rRNA makes with the ribosome.  The eukaryotic L7 members have an N-terminal extension not found in the archeal L7 orthologs.  L7 is closely related to the ribosomal L30 protein found in eukaryotes and prokaryotes.
Probab=21.37  E-value=1.3e+02  Score=25.48  Aligned_cols=63  Identities=17%  Similarity=0.236  Sum_probs=51.6

Q ss_pred             eEEEEEeechhhH-HHHHHHHHHcCCCCCCCCccceEEeeecCcCccceeeecccceeec-CCHHHHHhh
Q 027770          147 LYLMERVLDNESE-KKVLLALETAGVFTSGGLVKDKVLFCSTEIGRTSFVRQLEPDWHID-TNPEIVSQL  214 (219)
Q Consensus       147 lYLm~~V~dD~eE-~~Vl~~Le~~GlF~~gGL~rHKVLFCST~~Gr~sfVRQLeP~lHID-td~~vv~~L  214 (219)
                      +|++.++.+..+. ..+...|+..||=..     |...|..-.....-|++.++|.+-+. -+.+.++.|
T Consensus         1 l~~Virirg~~g~~~~~r~tL~~LgL~k~-----~~~v~~~~t~~~~gmL~kV~~~V~~ge~~~~tv~~L   65 (159)
T cd01657           1 LYAVVRIRGIVGVPPKIRKTLQLLRLRRI-----NNAVFVKLTKATIGMLKKVEPYVTWGEPNLETLREL   65 (159)
T ss_pred             CEEEEEecCcCCCCHHHHHHHHHcCCCcC-----CCEEEEeCCHHHHHHHHHhhhheEeccCcHHHHHHH
Confidence            5778888887633 678888888776665     88888888899999999999999998 488888877


No 73 
>PLN02545 3-hydroxybutyryl-CoA dehydrogenase
Probab=21.21  E-value=2.2e+02  Score=24.97  Aligned_cols=29  Identities=21%  Similarity=0.244  Sum_probs=22.3

Q ss_pred             HHhhcCceEEEEEeechhhHHHHHHHHHH
Q 027770          140 EITKFCDLYLMERVLDNESEKKVLLALET  168 (219)
Q Consensus       140 eLsk~~DlYLm~~V~dD~eE~~Vl~~Le~  168 (219)
                      +..+.||+.+.|-.++.+.++.+..-+++
T Consensus        79 ~~~~~aD~Vieav~e~~~~k~~v~~~l~~  107 (295)
T PLN02545         79 EELRDADFIIEAIVESEDLKKKLFSELDR  107 (295)
T ss_pred             HHhCCCCEEEEcCccCHHHHHHHHHHHHh
Confidence            34578999998877777778888776665


No 74 
>PRK01844 hypothetical protein; Provisional
Probab=21.20  E-value=1.1e+02  Score=23.42  Aligned_cols=19  Identities=16%  Similarity=0.340  Sum_probs=10.8

Q ss_pred             HHHHHHHHHHHHHHHHhhc
Q 027770           17 VGAIAGLALAIVFTWRMFR   35 (219)
Q Consensus        17 ~~~~ag~a~a~v~~w~~lr   35 (219)
                      +|++.||-+|--+-=++|+
T Consensus        16 ~G~~~Gff~ark~~~k~lk   34 (72)
T PRK01844         16 AGVALGFFIARKYMMNYLQ   34 (72)
T ss_pred             HHHHHHHHHHHHHHHHHHH
Confidence            4555555555555556666


No 75 
>PF05729 NACHT:  NACHT domain
Probab=21.18  E-value=1.3e+02  Score=22.60  Aligned_cols=41  Identities=15%  Similarity=0.150  Sum_probs=30.3

Q ss_pred             CccceEEeeecCcCccceeeecccc--eeec--CCHHHHHhhhhc
Q 027770          177 LVKDKVLFCSTEIGRTSFVRQLEPD--WHID--TNPEIVSQLAVH  217 (219)
Q Consensus       177 L~rHKVLFCST~~Gr~sfVRQLeP~--lHID--td~~vv~~L~rf  217 (219)
                      ....|++.++++.+...+.+.++..  ++|+  +++++-+++..|
T Consensus       118 ~~~~~liit~r~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~  162 (166)
T PF05729_consen  118 PPGVKLIITSRPRAFPDLRRRLKQAQILELEPFSEEDIKQYLRKY  162 (166)
T ss_pred             CCCCeEEEEEcCChHHHHHHhcCCCcEEEECCCCHHHHHHHHHHH
Confidence            4458898888888888888888887  7777  556666666554


No 76 
>KOG2181 consensus LIM domain binding protein LDB1/NLI/CLIM [Transcription]
Probab=21.04  E-value=2.3e+02  Score=27.73  Aligned_cols=31  Identities=16%  Similarity=0.312  Sum_probs=18.0

Q ss_pred             hhhhcCCCchHHHHHHHHHHHHHHHHhhcCCC
Q 027770            7 LLSWYQDSRSVGAIAGLALAIVFTWRMFRSPG   38 (219)
Q Consensus         7 ~~~~~~~~~s~~~~ag~a~a~v~~w~~lr~~~   38 (219)
                      |.++|| +-+|.-==-+--.++=-|.-+-.|+
T Consensus       239 lMSrhK-ayalsPRdclKttLFQkwQrMvaPp  269 (415)
T KOG2181|consen  239 LMSRHK-AYALSPRDCLKTTLFQKWQRMVAPP  269 (415)
T ss_pred             HHHhcc-ccCCCHHHHHHHHHHHHhhhhhcCC
Confidence            456676 5555554445555666776665553


No 77 
>smart00550 Zalpha Z-DNA-binding domain in adenosine deaminases. Helix-turn-helix-containing domain. Also known as Zab.
Probab=20.64  E-value=37  Score=24.28  Aligned_cols=53  Identities=28%  Similarity=0.425  Sum_probs=38.2

Q ss_pred             echhhHHHHHHHHHHcCC--CC------CCCCcc---ceEEeeecCcCccceeeecccceeecC
Q 027770          154 LDNESEKKVLLALETAGV--FT------SGGLVK---DKVLFCSTEIGRTSFVRQLEPDWHIDT  206 (219)
Q Consensus       154 ~dD~eE~~Vl~~Le~~Gl--F~------~gGL~r---HKVLFCST~~Gr~sfVRQLeP~lHIDt  206 (219)
                      ..|.-|++|+.+|++.|-  ++      .-|+.+   ||+|+-=-..|.+.+..+--|.|+|..
T Consensus         3 ~~~~~~~~IL~~L~~~g~~~~ta~eLa~~lgl~~~~v~r~L~~L~~~G~V~~~~~~~~~W~i~~   66 (68)
T smart00550        3 TQDSLEEKILEFLENSGDETSTALQLAKNLGLPKKEVNRVLYSLEKKGKVCKQGGTPPLWKLTD   66 (68)
T ss_pred             CchHHHHHHHHHHHHCCCCCcCHHHHHHHHCCCHHHHHHHHHHHHHCCCEEecCCCCCceEeec
Confidence            457778999999998766  54      226666   788887777888777666348888864


No 78 
>PF09587 PGA_cap:  Bacterial capsule synthesis protein PGA_cap;  InterPro: IPR019079  CapA is a putative poly-gamma-glutamate capsule biosynthesis protein found in bacteria. Poly-gamma-glutamate is a natural polymer that may be involved in virulence and may help bacteria survive in high salt concentrations. It is a surface-associated protein []. 
Probab=20.50  E-value=4.3e+02  Score=22.71  Aligned_cols=97  Identities=20%  Similarity=0.250  Sum_probs=58.2

Q ss_pred             HHHHhcCceeEEEeecceeeccCChhhhhccceeehhHHHHHHHHhhcCceEEEE--EeechhhH--HHHHHHHHHcCCC
Q 027770           97 VRQKLSEGRKVTCRLLGVILEESCPEELQKQVTVKSSVLEALLEITKFCDLYLME--RVLDNESE--KKVLLALETAGVF  172 (219)
Q Consensus        97 v~~kls~~RkvTis~~gvvlee~sp~el~~~a~v~esalevLleLsk~~DlYLm~--~V~dD~eE--~~Vl~~Le~~GlF  172 (219)
                      ++.-|.+.--+.+.+|+.+..+..+..-...+...++.++.|..+-  .|+.=++  |+.|--.+  ..-++.|+++|+-
T Consensus        30 v~~~l~~aD~~~~NlE~~v~~~~~~~~~~~~f~~~~~~~~~L~~~G--~d~vslANNH~~D~G~~gl~~Tl~~L~~~gi~  107 (250)
T PF09587_consen   30 VKPLLQSADLVVANLETPVTDSGQPASGYPHFNAPPEILDALKDAG--FDVVSLANNHIFDYGEEGLLDTLEALDKAGIP  107 (250)
T ss_pred             HHHHHhhCCEEEEEeeecCcCCCCcCCCcceecCCHHHHHHHHHcC--CCEEEecCCCCccccHHHHHHHHHHHHHCCCc
Confidence            4456667778899999999776554442234444556666654321  4655555  66664444  5567788899988


Q ss_pred             CCC-CCcc---ceEEeeecCcCcccee
Q 027770          173 TSG-GLVK---DKVLFCSTEIGRTSFV  195 (219)
Q Consensus       173 ~~g-GL~r---HKVLFCST~~Gr~sfV  195 (219)
                      ..| |.+.   .+..+-+....|..|+
T Consensus       108 ~~Gag~~~~~a~~p~i~~~~g~kia~l  134 (250)
T PF09587_consen  108 YVGAGRNLEEARRPAIIEVNGVKIAFL  134 (250)
T ss_pred             EeECcCChHHhcCeEEEEECCEEEEEE
Confidence            754 4432   4555555544444443


No 79 
>KOG0121 consensus Nuclear cap-binding protein complex, subunit CBP20 (RRM superfamily) [RNA processing and modification]
Probab=20.30  E-value=1.3e+02  Score=26.00  Aligned_cols=55  Identities=20%  Similarity=0.280  Sum_probs=44.6

Q ss_pred             eehhHHHHHHHHhhcCceEEEEEeechhhHHHHHHHHHHcCCCCCC--CCccceEEee
Q 027770          130 VKSSVLEALLEITKFCDLYLMERVLDNESEKKVLLALETAGVFTSG--GLVKDKVLFC  185 (219)
Q Consensus       130 v~esalevLleLsk~~DlYLm~~V~dD~eE~~Vl~~Le~~GlF~~g--GL~rHKVLFC  185 (219)
                      ++-+..|-|-.+-+.|.||+ .-+.--..|+||-.+.-..|-...-  ||++.|--+|
T Consensus        22 f~gt~~e~~~a~r~S~tvyV-gNlSfyttEEqiyELFs~cG~irriiMGLdr~kktpC   78 (153)
T KOG0121|consen   22 FRGTDEEQLEALRKSCTVYV-GNLSFYTTEEQIYELFSKCGDIRRIIMGLDRFKKTPC   78 (153)
T ss_pred             hcCchHHHHHHHhhcceEEE-eeeeeeecHHHHHHHHHhccchheeEeccccCCcCcc
Confidence            44466677777888999998 7788888999999999999988843  9999886655


No 80 
>PF13807 GNVR:  G-rich domain on putative tyrosine kinase
Probab=20.04  E-value=97  Score=22.62  Aligned_cols=17  Identities=29%  Similarity=0.532  Sum_probs=12.4

Q ss_pred             chHHHHHHHHHHHHHHH
Q 027770           15 RSVGAIAGLALAIVFTW   31 (219)
Q Consensus        15 ~s~~~~ag~a~a~v~~w   31 (219)
                      ..+|+++|+++++.+++
T Consensus        62 l~l~~~~Gl~lgi~~~~   78 (82)
T PF13807_consen   62 LALGLFLGLILGIGLAF   78 (82)
T ss_pred             HHHHHHHHHHHHHHHHH
Confidence            35788888888777664


No 81 
>PRK11677 hypothetical protein; Provisional
Probab=20.02  E-value=79  Score=26.38  Aligned_cols=28  Identities=25%  Similarity=0.459  Sum_probs=17.5

Q ss_pred             CeeeehhhhhcCCCchHHHHHHHHHHHHHHHHhhcCCC
Q 027770            1 MFWVLTLLSWYQDSRSVGAIAGLALAIVFTWRMFRSPG   38 (219)
Q Consensus         1 ~~~~~~~~~~~~~~~s~~~~ag~a~a~v~~w~~lr~~~   38 (219)
                      |-|+.++         +|-|+|++|+.+.. |++.+..
T Consensus         1 M~W~~a~---------i~livG~iiG~~~~-R~~~~~~   28 (134)
T PRK11677          1 MTWEYAL---------IGLVVGIIIGAVAM-RFGNRKL   28 (134)
T ss_pred             CcHHHHH---------HHHHHHHHHHHHHH-hhccchh
Confidence            6677665         67777777755444 4555543


Done!