RPS-BLAST 2.2.26 [Sep-21-2011]

Database: CDD.v3.10 
           44,354 sequences; 10,937,602 total letters

Searching..................................................done

Query= psy14141
         (698 letters)



>gnl|CDD|173796 cd04857, Peptidases_S8_Tripeptidyl_Aminopeptidase_II, Peptidase S8
           family domain in Tripeptidyl aminopeptidases_II.
           Tripeptidyl aminopeptidases II are member of the
           peptidase S8 or Subtilase family. Subtilases, or
           subtilisin-like serine proteases, have an Asp/His/Ser
           catalytic triad similar to that found in trypsin-like
           proteases, but do not share their three-dimensional
           structure (an example of convergent evolution).
           Tripeptidyl aminopeptidase II removes tripeptides from
           the free N terminus of oligopeptides as well as having
           endoproteolytic activity.  Some tripeptidyl
           aminopeptidases have been shown to cleave tripeptides
           and small peptides, e.g. angiotensin II and glucagon,
           while others are believed to be involved in MHC I
           processing.
          Length = 412

 Score =  204 bits (520), Expect = 6e-59
 Identities = 76/107 (71%), Positives = 89/107 (83%)

Query: 369 GLLPKKETGVLNVLSKHPEYDGRGVVIAIFDSGVDPGAAGLQVTSDGKPKVIERYDCGGA 428
           GLLPKKETG L  L K+PEYDGRGV+IAI D+GVDPGA GLQVT+DGKPK+I+  DC G+
Sbjct: 1   GLLPKKETGALRFLQKYPEYDGRGVLIAILDTGVDPGAPGLQVTTDGKPKIIDIIDCTGS 60

Query: 429 GDVDTSTVVKVDDTNHITGLSGRKLKIPTSWKNPTGDFHIGLKNVYE 475
           GDVDTSTVV  DD   I GL+GRKLKIP SWKNP+G +H+G+KN Y+
Sbjct: 61  GDVDTSTVVTPDDGGIIGGLTGRKLKIPASWKNPSGKYHVGIKNAYD 107



 Score =  183 bits (466), Expect = 3e-51
 Identities = 62/114 (54%), Positives = 85/114 (74%), Gaps = 2/114 (1%)

Query: 544 EKKFNCHDLGPAYDVVVFHNGDYWCACVDTTETGDLAACHVLGEYNVTRDFTSLTPADQF 603
           EKK+   D GP YD VVFH+G++W A +DT+ETGDL +C VL  Y   R++ +    D  
Sbjct: 108 EKKYE--DPGPVYDCVVFHDGEHWRAVIDTSETGDLDSCTVLTNYREEREYATFGEQDLL 165

Query: 604 NFSINVYEEGNVLELVGLCSSHGTHVASIAAAYFPDEPEKNGVAPGAQIISLCI 657
           N+S+N+Y++GN+L +V    +HGTHVA IAAA+FP+EPE+NGVAPGAQI+S+ I
Sbjct: 166 NYSVNIYDDGNLLSIVTDSGAHGTHVAGIAAAHFPEEPERNGVAPGAQIVSIKI 219



 Score = 88.1 bits (219), Expect = 2e-18
 Identities = 27/46 (58%), Positives = 39/46 (84%)

Query: 11  TQHMNGTSMSAPHATGCVALIISALQQQGLSYSPYSIRRALEITAQ 56
           +Q MNGTSMS+P+A G +AL++S L+ +G+ Y+PYS+RRALE TA+
Sbjct: 365 SQLMNGTSMSSPNACGGIALLLSGLKAEGIPYTPYSVRRALENTAK 410



 Score = 81.6 bits (202), Expect = 2e-16
 Identities = 33/54 (61%), Positives = 39/54 (72%)

Query: 645 GVAPGAQIISLCIDGKPKVIERYDCGGAGDVDTSTVVKVDDTNHITGLSGRKLK 698
           GV PGA  + +  DGKPK+I+  DC G+GDVDTSTVV  DD   I GL+GRKLK
Sbjct: 33  GVDPGAPGLQVTTDGKPKIIDIIDCTGSGDVDTSTVVTPDDGGIIGGLTGRKLK 86


>gnl|CDD|215701 pfam00082, Peptidase_S8, Subtilase family.  Subtilases are a family
           of serine proteases. They appear to have independently
           and convergently evolved an Asp/Ser/His catalytic triad,
           like that found in the trypsin serine proteases (see
           pfam00089). Structure is an alpha/beta fold containing a
           7-stranded parallel beta sheet, order 2314567.
          Length = 277

 Score = 65.1 bits (159), Expect = 1e-11
 Identities = 24/76 (31%), Positives = 38/76 (50%), Gaps = 5/76 (6%)

Query: 4   SSSSNKETQHMNGTSMSAPHATGCVALIISALQQQGLSYSPYSIRRALEITAQYIPSVEP 63
                      +GTSM+AP   G  AL++SA      S +P ++R  L  TA  + S+  
Sbjct: 206 PGGDLGGYDSHSGTSMAAPLVAGAAALLLSANP----SLTPETLRALLVTTATDLGSMGL 261

Query: 64  -YAQGFGLLQVEKALE 78
             + G+GLL + KA++
Sbjct: 262 DRSFGYGLLNLGKAVK 277



 Score = 46.2 bits (110), Expect = 3e-05
 Identities = 16/62 (25%), Positives = 24/62 (38%)

Query: 593 DFTSLTPADQFNFSINVYEEGNVLELVGLCSSHGTHVASIAAAYFPDEPEKNGVAPGAQI 652
           D +       +    +  +           + HGTHVA I AA   +     GVAP A++
Sbjct: 15  DLSGRYIGLAYRNGYDFVDNDPDPTPDDDNNGHGTHVAGIIAAGDNNGSGGVGVAPNAKL 74

Query: 653 IS 654
            S
Sbjct: 75  ES 76


>gnl|CDD|173814 cd07489, Peptidases_S8_5, Peptidase S8 family domain,
           uncharacterized subfamily 5.  gap in seq This family is
           a member of the Peptidases S8 or Subtilases serine endo-
           and exo-peptidase clan. They have an Asp/His/Ser
           catalytic triad similar to that found in trypsin-like
           proteases, but do not share their three-dimensional
           structure and are not homologous to trypsin. The
           stability of subtilases may be enhanced by calcium, some
           members have been shown to bind up to 4 ions via binding
           sites with different affinity. Some members of this clan
           contain disulfide bonds. These enzymes can be intra- and
           extracellular, some function at extreme temperatures and
           pH values.
          Length = 312

 Score = 52.2 bits (126), Expect = 3e-07
 Identities = 26/75 (34%), Positives = 37/75 (49%), Gaps = 13/75 (17%)

Query: 14  MNGTSMSAPHATGCVALIISALQQQGLSYSPYSIRRALEITAQYIPS---------VEPY 64
           ++GTSM+ P+  G  AL+I A   +    SP  +R  L  TA+ +P          + P 
Sbjct: 227 LSGTSMATPYVAGAAALLIQARHGKL---SPAELRDLLASTAKPLPWSDGTSALPDLAPV 283

Query: 65  AQ-GFGLLQVEKALE 78
           AQ G GL+   KAL 
Sbjct: 284 AQQGAGLVNAYKALY 298



 Score = 39.9 bits (94), Expect = 0.003
 Identities = 22/56 (39%), Positives = 27/56 (48%), Gaps = 10/56 (17%)

Query: 622 CSSHGTHVASIAAAYFPDEPEKNGVAPGAQIISLCIDGKPKVIERYDCGGAGDVDT 677
           C  HGTHVA I AA  P+     GVAP A +      G  +V   + C G+   DT
Sbjct: 67  CQGHGTHVAGIIAAN-PNAYGFTGVAPEATL------GAYRV---FGCSGSTTEDT 112


>gnl|CDD|173801 cd07475, Peptidases_S8_C5a_Peptidase, Peptidase S8 family domain in
           Streptococcal C5a peptidases.  Streptococcal C5a
           peptidase (SCP), is a highly specific protease and
           adhesin/invasin.  The subtilisin-like protease domain is
           located at the N-terminus and contains a
           protease-associated domain inserted into a loop.  There
           are three fibronectin type III (Fn) domains at the
           C-terminus. SCP binds to integrins with the help of
           Arg-Gly-Asp motifs which are thought to stabilize
           conformational changes required for substrate binding.
           Peptidases S8 or Subtilases are a serine endo- and
           exo-peptidase clan. They have an Asp/His/Ser catalytic
           triad similar to that found in trypsin-like proteases,
           but do not share their three-dimensional structure and
           are not homologous to trypsin. The stability of
           subtilases may be enhanced by calcium, some members have
           been shown to bind up to 4 ions via binding sites with
           different affinity. Some members of this clan contain
           disulfide bonds. These enzymes can be intra- and
           extracellular, some function at extreme temperatures and
           pH values.
          Length = 346

 Score = 51.1 bits (123), Expect = 1e-06
 Identities = 22/85 (25%), Positives = 38/85 (44%), Gaps = 11/85 (12%)

Query: 4   SSSSNKETQHMNGTSMSAPHATGCVALIISALQQQGLSYSP---YSIRRALEITAQYIPS 60
           S+ ++    +M+GTSM++PH  G  AL+   L+++    S      + + L +     P 
Sbjct: 261 STVNDNTYGYMSGTSMASPHVAGASALVKQRLKEKYPKLSGEELVDLVKNLLMNTATPPL 320

Query: 61  VEPYA--------QGFGLLQVEKAL 77
                        QG GL+ V KA+
Sbjct: 321 DSEDTKTYYSPRRQGAGLIDVAKAI 345



 Score = 43.8 bits (104), Expect = 2e-04
 Identities = 20/50 (40%), Positives = 27/50 (54%), Gaps = 5/50 (10%)

Query: 623 SSHGTHVASIAAAYFPDEPEKN----GVAPGAQIISLCIDGKPKVIERYD 668
           SSHG HVA I A    DE +      GVAP AQ++++ +   P+    YD
Sbjct: 82  SSHGMHVAGIVAGN-GDEEDNGEGIKGVAPEAQLLAMKVFSNPEGGSTYD 130



 Score = 35.7 bits (83), Expect = 0.070
 Identities = 15/34 (44%), Positives = 20/34 (58%)

Query: 388 YDGRGVVIAIFDSGVDPGAAGLQVTSDGKPKVIE 421
           Y G G+V+A+ DSGVDP     ++  D K K  E
Sbjct: 8   YKGEGMVVAVIDSGVDPTHDAFRLDDDSKAKYSE 41


>gnl|CDD|173800 cd07474, Peptidases_S8_subtilisin_Vpr-like, Peptidase S8 family
           domain in Vpr-like proteins.  The maturation of the
           peptide antibiotic (lantibiotic) subtilin in Bacillus
           subtilis ATCC 6633 includes posttranslational
           modifications of the propeptide and proteolytic cleavage
           of the leader peptide.  Vpr was identified as one of the
           proteases,  along with WprA, that are capable of
           processing subtilin.    Asp, Ser, His triadPeptidases S8
           or Subtilases are a serine endo- and exo-peptidase clan.
           They have an Asp/His/Ser catalytic triad similar to that
           found in trypsin-like proteases, but do not share their
           three-dimensional structure and are not homologous to
           trypsin. The stability of subtilases may be enhanced by
           calcium, some members have been shown to bind up to 4
           ions via binding sites with different affinity. Some
           members of this clan contain disulfide bonds. These
           enzymes can be intra- and extracellular, some function
           at extreme temperatures and pH values.
          Length = 295

 Score = 45.8 bits (109), Expect = 3e-05
 Identities = 24/68 (35%), Positives = 34/68 (50%), Gaps = 9/68 (13%)

Query: 14  MNGTSMSAPHATGCVALIISALQQQGLSYSPYSIRRALEITAQYIPSVE-----PYAQGF 68
           M+GTSM+APH  G  AL    L+Q    +SP  I+ AL  TA+ +   +        QG 
Sbjct: 232 MSGTSMAAPHVAGAAAL----LKQAHPDWSPAQIKAALMNTAKPLYDSDGVVYPVSRQGA 287

Query: 69  GLLQVEKA 76
           G +   +A
Sbjct: 288 GRVDALRA 295



 Score = 45.4 bits (108), Expect = 6e-05
 Identities = 18/54 (33%), Positives = 23/54 (42%), Gaps = 7/54 (12%)

Query: 622 CSSHGTHVASIAAAYFPDEPEKNGVAPGAQIISLCIDGKPKVIERYDCGGAGDV 675
            + HGTHVA I A    +     GVAP A + +       KV+      G  DV
Sbjct: 61  ATGHGTHVAGIIAGNGVNVGTIKGVAPKADLYAY------KVLGPG-GSGTTDV 107



 Score = 34.2 bits (79), Expect = 0.17
 Identities = 17/70 (24%), Positives = 24/70 (34%), Gaps = 9/70 (12%)

Query: 390 GRGVVIAIFDSGVDPGAAGLQVTSDGKPKVIERYDCGG---------AGDVDTSTVVKVD 440
           G+GV +A+ D+G+D     L        KV   YD                        D
Sbjct: 1   GKGVKVAVIDTGIDYTHPDLGGPGFPNDKVKGGYDFVDDDYDPMDTRPYPSPLGDASAGD 60

Query: 441 DTNHITGLSG 450
            T H T ++G
Sbjct: 61  ATGHGTHVAG 70


>gnl|CDD|173803 cd07477, Peptidases_S8_Subtilisin_subset, Peptidase S8 family
           domain in Subtilisin proteins.  This group is composed
           of many different subtilisins: Pro-TK-subtilisin,
           subtilisin Carlsberg, serine protease Pb92 subtilisin,
           and BPN subtilisins just to name a few.
           Pro-TK-subtilisin is a serine protease from the
           hyperthermophilic archaeon Thermococcus kodakaraensis
           and consists of a signal peptide, a propeptide, and a
           mature domain.  TK-subtilisin is matured from
           pro-TK-subtilisin upon autoprocessing and degradation of
           the propeptide. Unlike other subtilisins though, the
           folding of the unprocessed form of pro-TK-subtilisin is
           induced by Ca2+ binding which is almost completed prior
           to autoprocessing. Ca2+ is required for activity unlike
           the bacterial subtilisins. The propeptide is not
           required for folding of the mature domain unlike the
           bacterial subtilases because of the stability produced
           from Ca2+ binding.  Subtilisin Carlsberg is extremely
           similar in structure to subtilisin BPN'/Novo thought it
           has a 30% difference in amino acid sequence.  The
           substrate binding regions are also similar and 2
           possible Ca2+ binding sites have been identified
           recently. Subtilisin Carlsberg possesses the highest
           commercial importance as a proteolytic additive for
           detergents. Serine protease Pb92, the serine protease
           from the alkalophilic Bacillus strain PB92, also
           contains two calcium ions and the overall  folding of
           the polypeptide chain closely resembles that of the
           subtilisins.   Members of the peptidases S8 and S35 clan
           include endopeptidases, exopeptidases and also a
           tripeptidyl-peptidase. The S8 family has an Asp/His/Ser
           catalytic triad similar to that found in trypsin-like
           proteases, but do not share their three-dimensional
           structure and are not homologous to trypsin. The S53
           family contains a catalytic triad Glu/Asp/Ser. The
           stability of these enzymes may be enhanced by calcium,
           some members have been shown to bind up to 4 ions via
           binding sites with different affinity. Some members of
           this clan contain disulfide bonds. These enzymes can be
           intra- and extracellular, some function at extreme
           temperatures and pH values.
          Length = 229

 Score = 45.2 bits (108), Expect = 4e-05
 Identities = 14/41 (34%), Positives = 21/41 (51%), Gaps = 4/41 (9%)

Query: 14  MNGTSMSAPHATGCVALIISALQQQGLSYSPYSIRRALEIT 54
           ++GTSM+ PH  G  AL+ S         +   +R+AL  T
Sbjct: 193 LSGTSMATPHVAGVAALVWSK----RPELTNAQVRQALNKT 229



 Score = 37.5 bits (88), Expect = 0.011
 Identities = 17/60 (28%), Positives = 21/60 (35%), Gaps = 21/60 (35%)

Query: 625 HGTHVASIAAAYFPDEPEKNGVAPGAQI--------------------ISLCIDGKPKVI 664
           HGTHVA I AA   +     GVAP A +                    I   I+    +I
Sbjct: 42  HGTHVAGIIAAL-DNGVGVVGVAPEADLYAVKVLNDDGSGTYSDIIAGIEWAIENGMDII 100


>gnl|CDD|173802 cd07476, Peptidases_S8_thiazoline_oxidase_subtilisin-like_protease,
           Peptidase S8 family domain in Thiazoline
           oxidase/subtilisin-like proteases.  Thiazoline
           oxidase/subtilisin-like protease is produced by the
           symbiotic bacteria Prochloron spp. that inhabit didemnid
           family ascidians.  The cyclic peptides of the
           patellamide class found in didemnid extracts are now
           known to be synthesized by the Prochloron spp.  The
           prepatellamide is heterocyclized to form thiazole and
           oxazoline rings and the peptide is cleaved to form the
           two cyclic patellamides A and C.  Subtilases, or
           subtilisin-like serine proteases, have an Asp/His/Ser
           catalytic triad similar to that found in trypsin-like
           proteases, but do not share their three-dimensional
           structure (an example of convergent evolution).
          Length = 267

 Score = 45.0 bits (107), Expect = 5e-05
 Identities = 17/46 (36%), Positives = 25/46 (54%)

Query: 10  ETQHMNGTSMSAPHATGCVALIISALQQQGLSYSPYSIRRALEITA 55
           E    +GTS +A    G  AL++S   ++G    P ++RRAL  TA
Sbjct: 204 EVVRRSGTSFAAAIVAGIAALLLSLQLRRGAPPDPLAVRRALLETA 249


>gnl|CDD|173807 cd07481, Peptidases_S8_BacillopeptidaseF-like, Peptidase S8 family
           domain in BacillopeptidaseF-like proteins.  Bacillus
           subtilis produces and secretes proteases and other types
           of exoenzymes at the end of the exponential phase of
           growth. The ones that make up this group is known as
           bacillopeptidase F, encoded by bpr,  a serine protease
           with high esterolytic activity which is inhibited by
           PMSF.  Like other members of the peptidases S8 family
           these have a Asp/His/Ser catalytic triad similar to that
           found in trypsin-like proteases, but do not share their
           three-dimensional structure and are not homologous to
           trypsin. The stability of these enzymes may be enhanced
           by calcium, some members have been shown to bind up to 4
           ions via binding sites with different affinity.
          Length = 264

 Score = 44.7 bits (106), Expect = 8e-05
 Identities = 17/53 (32%), Positives = 23/53 (43%), Gaps = 2/53 (3%)

Query: 4   SSSSNKETQHMNGTSMSAPHATGCVALIISALQQQGLSYSPYSIRRALEITAQ 56
           S+         +GTSM+APH  G  AL+ SA     L     +    L  TA+
Sbjct: 214 SAVPGGGYGSSSGTSMAAPHVAGVAALLWSA--NPSLIGDVDATEAILTETAR 264



 Score = 30.4 bits (69), Expect = 3.1
 Identities = 12/31 (38%), Positives = 15/31 (48%), Gaps = 1/31 (3%)

Query: 623 SSHGTHVASIAAAYFPDEPEKNGVAPGAQII 653
           + HGTH          D  +  GVAPGA+ I
Sbjct: 52  NGHGTHTMGTMVGNDGDGQQI-GVAPGARWI 81


>gnl|CDD|173787 cd00306, Peptidases_S8_S53, Peptidase domain in the S8 and S53
           families.  Members of the peptidases S8 (subtilisin and
           kexin) and S53 (sedolisin) family include endopeptidases
           and  exopeptidases. The S8 family has an Asp/His/Ser
           catalytic triad similar to that found in trypsin-like
           proteases, but do not share their three-dimensional
           structure and are not homologous to trypsin. Serine acts
           as a nucleophile, aspartate as an electrophile, and
           histidine as a base. The S53 family contains a catalytic
           triad Glu/Asp/Ser with an additional acidic residue Asp
           in the oxyanion hole, similar to that of subtilisin.
           The serine residue here is the nucleophilic equivalent
           of the serine residue in the S8 family, while glutamic
           acid has the same role here as the histidine base. 
           However, the aspartic acid residue that acts as an
           electrophile is quite different.  In S53, it follows
           glutamic acid, while in S8 it precedes histidine. The
           stability of these enzymes may be enhanced by calcium;
           some members have been shown to bind up to 4 ions via
           binding sites with different affinity.  There is a great
           diversity in the characteristics of their members: some
           contain disulfide bonds, some are intracellular while
           others are extracellular, some function at extreme
           temperatures, and others at high or low pH values.
          Length = 241

 Score = 44.1 bits (104), Expect = 1e-04
 Identities = 15/41 (36%), Positives = 23/41 (56%), Gaps = 4/41 (9%)

Query: 14  MNGTSMSAPHATGCVALIISALQQQGLSYSPYSIRRALEIT 54
           ++GTSM+AP   G  AL++SA        +P  ++ AL  T
Sbjct: 205 LSGTSMAAPIVAGVAALLLSANPDL----TPAQVKAALLST 241



 Score = 41.8 bits (98), Expect = 5e-04
 Identities = 17/50 (34%), Positives = 24/50 (48%), Gaps = 1/50 (2%)

Query: 606 SINVYEEGNVLELVGLCSSHGTHVASIAAAYFPDEPEKNGVAPGAQIISL 655
             +  +  N        + HGTHVA I AA   +     GVAPGA++I +
Sbjct: 27  GNDDDDNENGPTDPDDGNGHGTHVAGIIAAS-ANNGGGVGVAPGAKLIPV 75



 Score = 30.6 bits (69), Expect = 2.3
 Identities = 15/60 (25%), Positives = 23/60 (38%), Gaps = 11/60 (18%)

Query: 393 VVIAIFDSGVDPGAAGLQVTSDGKPKVIERYDCGGAGDVDTSTVVKVDDTN----HITGL 448
           V +A+ D+GVDP              +    D G   D + +     DD N    H+ G+
Sbjct: 1   VTVAVIDTGVDP-------DHPDLDGLFGGGDGGNDDDDNENGPTDPDDGNGHGTHVAGI 53


>gnl|CDD|173811 cd07485, Peptidases_S8_Fervidolysin_like, Peptidase S8 family
           domain in Fervidolysin.  Fervidolysin found in
           Fervidobacterium pennivorans is an extracellular
           subtilisin-like keratinase.  It is contains a signal
           peptide, a propeptide, and a catalytic region. The
           tertiary structure of fervidolysin is similar to that of
           subtilisin.  It contains a Asp/His/Ser catalytic triad
           and is a member of the peptidase S8 (subtilisin and
           kexin) family. The catalytic triad is similar to that
           found in trypsin-like proteases, but it does not share
           their three-dimensional structure and are not homologous
           to trypsin. Serine acts as a nucleophile, aspartate as
           an electrophile, and histidine as a base. The S53 family
           contains a catalytic triad Glu/Asp/Ser with an
           additional acidic residue Asp in the oxyanion hole,
           similar to that of subtilisin.  The serine residue here
           is the nucleophilic equivalent of the serine residue in
           the S8 family, while glutamic acid has the same role
           here as the histidine base.   However, the aspartic acid
           residue that acts as an electrophile is quite different.
            In S53, it follows glutamic acid, while in S8 it
           precedes histidine. The stability of these enzymes may
           be enhanced by calcium; some members have been shown to
           bind up to 4 ions via binding sites with different
           affinity.  There is a great diversity in the
           characteristics of their members: some contain disulfide
           bonds, some are intracellular while others are
           extracellular, some function at extreme temperatures,
           and others at high or low pH values.
          Length = 273

 Score = 44.0 bits (104), Expect = 1e-04
 Identities = 16/43 (37%), Positives = 28/43 (65%), Gaps = 3/43 (6%)

Query: 12  QHMNGTSMSAPHATGCVALIISALQQQGLSYSPYSIRRALEIT 54
           ++++GTSM+APH +G  AL++S        ++P  IR+ LE +
Sbjct: 234 EYLSGTSMAAPHVSGVAALVLSKFPD---VFTPEQIRKLLEES 273



 Score = 29.4 bits (66), Expect = 5.9
 Identities = 16/41 (39%), Positives = 19/41 (46%), Gaps = 6/41 (14%)

Query: 623 SSHGTHVASIAAAY------FPDEPEKNGVAPGAQIISLCI 657
             HGTHVA   AA               GVAPG +I+S+ I
Sbjct: 61  GGHGTHVAGTIAAVNNNGGGVGGIAGAGGVAPGVKIMSIQI 101


>gnl|CDD|173790 cd04077, Peptidases_S8_PCSK9_ProteinaseK_like, Peptidase S8 family
           domain in ProteinaseK-like proteins.  The peptidase S8
           or Subtilase clan of proteases have a Asp/His/Ser
           catalytic triad that is not homologous to trypsin. This
           CD contains several members of this clan including:
           PCSK9 (Proprotein convertase subtilisin/kexin type 9),
           Proteinase_K, Proteinase_T, and other subtilisin-like
           serine proteases.  PCSK9 posttranslationally regulates
           hepatic low-density lipoprotein receptors (LDLRs) by
           binding to LDLRs on the cell surface, leading to their
           degradation. The binding site of PCSK9 has been
           localized to the epidermal growth factor-like repeat A
           (EGF-A) domain of the LDLR. Characterized Proteinases K
           are secreted endopeptidases with a high degree of
           sequence conservation.  Proteinases K are not
           substrate-specific and function in a wide variety of
           species in different pathways. It can hydrolyze keratin
           and other proteins with subtilisin-like specificity. The
           number of calcium-binding motifs found in these differ.
           Proteinase T is a novel proteinase from the fungus
           Tritirachium album Limber. The amino acid sequence of
           proteinase T as deduced from the nucleotide sequence is
           about 56% identical to that of proteinase K.
          Length = 255

 Score = 43.7 bits (104), Expect = 1e-04
 Identities = 16/49 (32%), Positives = 25/49 (51%), Gaps = 4/49 (8%)

Query: 3   SSSSSNKETQHMNGTSMSAPHATGCVALIISALQQQGLSYSPYSIRRAL 51
           +   S+  T  ++GTSM+APH  G  A ++S     G   SP  ++  L
Sbjct: 205 AWIGSDTATATLSGTSMAAPHVAGLAAYLLS----LGPDLSPAEVKARL 249



 Score = 42.5 bits (101), Expect = 3e-04
 Identities = 18/61 (29%), Positives = 28/61 (45%), Gaps = 17/61 (27%)

Query: 622 CSSHGTHVASIAAAYFPDEPEKNGVAPGAQIISLCIDGKPKVIERYDCGGAGDVDTSTVV 681
           C+ HGTHVA           +  GVA  A ++++      KV+   DC G+G    S ++
Sbjct: 62  CNGHGTHVAGTVGG------KTYGVAKKANLVAV------KVL---DCNGSG--TLSGII 104

Query: 682 K 682
            
Sbjct: 105 A 105


>gnl|CDD|173812 cd07487, Peptidases_S8_1, Peptidase S8 family domain,
           uncharacterized subfamily 1.  This family is a member of
           the Peptidases S8 or Subtilases serine endo- and
           exo-peptidase clan. They have an Asp/His/Ser catalytic
           triad similar to that found in trypsin-like proteases,
           but do not share their three-dimensional structure and
           are not homologous to trypsin. The stability of
           subtilases may be enhanced by calcium, some members have
           been shown to bind up to 4 ions via binding sites with
           different affinity. Some members of this clan contain
           disulfide bonds. These enzymes can be intra- and
           extracellular, some function at extreme temperatures and
           pH values.
          Length = 264

 Score = 43.7 bits (104), Expect = 1e-04
 Identities = 19/61 (31%), Positives = 29/61 (47%), Gaps = 12/61 (19%)

Query: 623 SSHGTHVASIAAAYFPD-EPEKNGVAPGAQIISLCIDGKPKVIERYDCGGAGDVDTSTVV 681
           + HGTHVA I A        +  GVAPGA ++ +      KV++     G+G    S ++
Sbjct: 44  NGHGTHVAGIIAGSGRASNGKYKGVAPGANLVGV------KVLDDS---GSGSE--SDII 92

Query: 682 K 682
            
Sbjct: 93  A 93



 Score = 40.3 bits (95), Expect = 0.002
 Identities = 16/43 (37%), Positives = 24/43 (55%), Gaps = 4/43 (9%)

Query: 14  MNGTSMSAPHATGCVALIISALQQQGLSYSPYSIRRALEITAQ 56
           M+GTSM+ PH +G +AL+   LQ      +P  ++  L  TA 
Sbjct: 226 MSGTSMATPHVSGAIALL---LQANP-ILTPDEVKCILRDTAT 264


>gnl|CDD|173798 cd05562, Peptidases_S53_like, Peptidase domain in the S53 family.
           Members of the peptidase S53 (sedolisin) family include
           endopeptidases and exopeptidases. The S53 family
           contains a catalytic triad Glu/Asp/Ser with an
           additional acidic residue Asp in the oxyanion hole,
           similar to that of Asn in subtilisin. The stability of
           these enzymes may be enhanced by calcium, some members
           have been shown to bind up to 4 ions via binding sites
           with different affinity. Some members of this clan
           contain disulfide bonds. These enzymes can be intra- and
           extracellular, some function at extreme temperatures and
           pH values. Characterized sedolisins include Kumamolisin,
           an extracellular calcium-dependent thermostable
           endopeptidase from Bacillus. The enzyme is synthesized
           with a 188 amino acid N-terminal preprotein region which
           is cleaved after the extraction into the extracellular
           space with low pH. One kumamolysin paralog,
           kumamolisin-As, is believed to be a collagenase. TPP1 is
           a serine protease that functions as a tripeptidyl
           exopeptidase as well as an endopeptidase. Less is known
           about PSCP from Pseudomonas which is thought to be an
           aspartic proteinase.
          Length = 275

 Score = 43.8 bits (104), Expect = 2e-04
 Identities = 26/66 (39%), Positives = 35/66 (53%), Gaps = 9/66 (13%)

Query: 16  GTSMSAPHATGCVALIISALQQQGLSYSPYSIRRALEITAQYIPSVEPY---AQGFGLLQ 72
           GTS +APHA G  AL++SA        +P  IR AL  TA  +   EP    A G GL+ 
Sbjct: 215 GTSAAAPHAAGVAALVLSA----NPGLTPADIRDALRSTALDMG--EPGYDNASGSGLVD 268

Query: 73  VEKALE 78
            ++A+ 
Sbjct: 269 ADRAVA 274


>gnl|CDD|173804 cd07478, Peptidases_S8_CspA-like, Peptidase S8 family domain in
           CspA-like proteins.  GSP (germination-specific protease)
           converts the spore peptidoglycan hydrolase (SleC)
           precursor to an active enzyme during germination of
           Clostridium perfringens S40 spores.  Analysis of an
           enzyme fraction of GSP showed that it was composed of a
           gene cluster containing the processed forms of products
           of cspA, cspB, and cspC which are positioned in a tandem
           array just upstream of the 5' end of sleC. The amino
           acid sequences deduced from the nucleotide sequences of
           the csp genes showed significant similarity and showed a
           high degree of homology with those of the catalytic
           domain and the oxyanion binding region of
           subtilisin-like serine proteases.   Members of the
           peptidases S8 and S35 clan include endopeptidases,
           exopeptidases and also a tripeptidyl-peptidase. The S8
           family has an Asp/His/Ser catalytic triad similar to
           that found in trypsin-like proteases, but do not share
           their three-dimensional structure and are not homologous
           to trypsin. The S53 family contains a catalytic triad
           Glu/Asp/Ser. The stability of these enzymes may be
           enhanced by calcium, some members have been shown to
           bind up to 4 ions via binding sites with different
           affinity. Some members of this clan contain disulfide
           bonds. These enzymes can be intra- and extracellular,
           some function at extreme temperatures and pH values.
          Length = 455

 Score = 43.8 bits (104), Expect = 2e-04
 Identities = 16/31 (51%), Positives = 21/31 (67%)

Query: 623 SSHGTHVASIAAAYFPDEPEKNGVAPGAQII 653
           + HGTHVA IAA    + P+  GVAP A++I
Sbjct: 78  NGHGTHVAGIAAGNGDNNPDFKGVAPEAELI 108



 Score = 40.3 bits (95), Expect = 0.003
 Identities = 8/28 (28%), Positives = 14/28 (50%)

Query: 4   SSSSNKETQHMNGTSMSAPHATGCVALI 31
           ++S        +GTS++A    G  AL+
Sbjct: 386 TASPGGGYTTRSGTSVAAAIVAGACALL 413



 Score = 31.0 bits (71), Expect = 2.2
 Identities = 7/16 (43%), Positives = 12/16 (75%)

Query: 388 YDGRGVVIAIFDSGVD 403
             G+GV++ I D+G+D
Sbjct: 1   LTGKGVLVGIIDTGID 16


>gnl|CDD|173799 cd07473, Peptidases_S8_Subtilisin_like, Peptidase S8 family domain
           in Subtilisin-like proteins.  This family is a member of
           the Peptidases S8 or Subtilases serine endo- and
           exo-peptidase clan. They have an Asp/His/Ser catalytic
           triad similar to that found in trypsin-like proteases,
           but do not share their three-dimensional structure and
           are not homologous to trypsin. The stability of
           subtilases may be enhanced by calcium, some members have
           been shown to bind up to 4 ions via binding sites with
           different affinity. Some members of this clan contain
           disulfide bonds. These enzymes can be intra- and
           extracellular, some function at extreme temperatures and
           pH values.
          Length = 259

 Score = 42.2 bits (100), Expect = 5e-04
 Identities = 15/48 (31%), Positives = 25/48 (52%), Gaps = 4/48 (8%)

Query: 4   SSSSNKETQHMNGTSMSAPHATGCVALIISALQQQGLSYSPYSIRRAL 51
           S+S      +M+GTSM+ PH  G  AL++S       + +   I+ A+
Sbjct: 211 STSPGGGYGYMSGTSMATPHVAGAAALLLSLNP----NLTAAQIKDAI 254



 Score = 37.2 bits (87), Expect = 0.021
 Identities = 20/65 (30%), Positives = 24/65 (36%), Gaps = 23/65 (35%)

Query: 624 SHGTHVASIAAAYFPDEPEKN------GVAPGAQIISLCIDGKPKVIERYDCGGAGDVDT 677
            HGTHVA I  A        N      GVA   +I+ L      K +     G  G   T
Sbjct: 64  GHGTHVAGIIGA------VGNNGIGIAGVAWNVKIMPL------KFL-----GADGSGTT 106

Query: 678 STVVK 682
           S  +K
Sbjct: 107 SDAIK 111


>gnl|CDD|173821 cd07497, Peptidases_S8_14, Peptidase S8 family domain,
           uncharacterized subfamily 14.  This family is a member
           of the Peptidases S8 or Subtilases serine endo- and
           exo-peptidase clan. They have an Asp/His/Ser catalytic
           triad similar to that found in trypsin-like proteases,
           but do not share their three-dimensional structure and
           are not homologous to trypsin. The stability of
           subtilases may be enhanced by calcium, some members have
           been shown to bind up to 4 ions via binding sites with
           different affinity. Some members of this clan contain
           disulfide bonds. These enzymes can be intra- and
           extracellular, some function at extreme temperatures and
           pH values.
          Length = 311

 Score = 41.7 bits (98), Expect = 8e-04
 Identities = 18/42 (42%), Positives = 25/42 (59%), Gaps = 2/42 (4%)

Query: 16  GTSMSAPHATGCVALIISALQQQ--GLSYSPYSIRRALEITA 55
           GTSM+ P   G  AL+ISAL+++     Y P+ +R  L  TA
Sbjct: 270 GTSMATPMTAGSAALVISALKEKEGVGEYDPFLVRTILMSTA 311



 Score = 32.1 bits (73), Expect = 0.94
 Identities = 12/20 (60%), Positives = 13/20 (65%)

Query: 390 GRGVVIAIFDSGVDPGAAGL 409
           G GVVIAI D+GVD     L
Sbjct: 1   GEGVVIAIVDTGVDYSHPDL 20


>gnl|CDD|173794 cd04848, Peptidases_S8_Autotransporter_serine_protease_like,
           Peptidase S8 family domain in Autotransporter serine
           proteases.  Autotransporter serine proteases belong to
           Peptidase S8 or Subtilase family. Subtilases, or
           subtilisin-like serine proteases, have an Asp/His/Ser
           catalytic triad similar to that found in trypsin-like
           proteases, but do not share their three-dimensional
           structure (an example of convergent evolution).
           Autotransporters are a superfamily of outer
           membrane/secreted proteins of gram-negative bacteria.
           The presence of these subtilisin-like domains in these
           autotransporters are may enable them to be
           auto-catalytic and may also serve to allow them to act
           as a maturation protease cleaving other outer membrane
           proteins at the cell surface.
          Length = 267

 Score = 41.5 bits (98), Expect = 9e-04
 Identities = 18/53 (33%), Positives = 21/53 (39%), Gaps = 1/53 (1%)

Query: 603 FNFSINVYEEGNVLELVGLCSSHGTHVASIAAAYFPDEPEKNGVAPGAQIISL 655
              S  V          G   SHGTHVA + AA        +GVAP A + S 
Sbjct: 26  SEASYYVAVNDAGYASNGDGDSHGTHVAGVIAAARDGGGM-HGVAPDATLYSA 77



 Score = 37.3 bits (87), Expect = 0.020
 Identities = 9/22 (40%), Positives = 14/22 (63%)

Query: 13  HMNGTSMSAPHATGCVALIISA 34
            ++GTS +APH +G  AL+   
Sbjct: 228 RVSGTSFAAPHVSGAAALLAQK 249



 Score = 30.4 bits (69), Expect = 2.9
 Identities = 7/22 (31%), Positives = 10/22 (45%)

Query: 389 DGRGVVIAIFDSGVDPGAAGLQ 410
            G GV + + DSG+D       
Sbjct: 1   TGAGVKVGVIDSGIDLSHPEFA 22


>gnl|CDD|173795 cd04852, Peptidases_S8_3, Peptidase S8 family domain,
           uncharacterized subfamily 3.  This family is a member of
           the Peptidases S8 or Subtilases serine endo- and
           exo-peptidase clan. They have an Asp/His/Ser catalytic
           triad similar to that found in trypsin-like proteases,
           but do not share their three-dimensional structure and
           are not homologous to trypsin. The stability of
           subtilases may be enhanced by calcium, some members have
           been shown to bind up to 4 ions via binding sites with
           different affinity. Some members of this clan contain
           disulfide bonds. These enzymes can be intra- and
           extracellular, some function at extreme temperatures and
           pH values.
          Length = 307

 Score = 41.0 bits (97), Expect = 0.001
 Identities = 18/42 (42%), Positives = 26/42 (61%), Gaps = 4/42 (9%)

Query: 14  MNGTSMSAPHATGCVALIISALQQQGLSYSPYSIRRALEITA 55
           ++GTSM++PH  G  AL+ SA       +SP +I+ AL  TA
Sbjct: 269 ISGTSMASPHVAGVAALLKSAHP----DWSPAAIKSALMTTA 306



 Score = 33.3 bits (77), Expect = 0.39
 Identities = 14/38 (36%), Positives = 16/38 (42%), Gaps = 8/38 (21%)

Query: 625 HGTHVASIAA-AYFPDEPEKN-------GVAPGAQIIS 654
           HGTH AS AA     +            GVAP A+I  
Sbjct: 110 HGTHTASTAAGNVVVNASVGGFAFGTASGVAPRARIAV 147



 Score = 29.1 bits (66), Expect = 7.0
 Identities = 6/20 (30%), Positives = 11/20 (55%)

Query: 385 HPEYDGRGVVIAIFDSGVDP 404
                G G++I + D+G+ P
Sbjct: 24  GAANAGEGIIIGVLDTGIWP 43


>gnl|CDD|234398 TIGR03921, T7SS_mycosin, type VII secretion-associated serine
           protease mycosin.  Members of this family are
           subtilisin-related serine proteases, found strictly in
           the Actinobacteria and associated with type VII
           secretion operons. The designation mycosin is used for
           members from Mycobacterium [Protein fate, Protein and
           peptide secretion and trafficking, Protein fate, Protein
           modification and repair].
          Length = 350

 Score = 41.2 bits (97), Expect = 0.001
 Identities = 22/62 (35%), Positives = 29/62 (46%), Gaps = 5/62 (8%)

Query: 622 CSSHGTHVASIAAAYFPDEPEK-NGVAPGAQIISLCIDGKPKVIERYDCGGAGDVDTSTV 680
           C  HGT VA I A   P E +  +GVAP A+I+ +            D G +G  D  T+
Sbjct: 50  CDGHGTLVAGIIAGR-PGEGDGFSGVAPDARILPIRQ---TSAAFEPDEGTSGVGDLGTL 105

Query: 681 VK 682
            K
Sbjct: 106 AK 107



 Score = 31.5 bits (72), Expect = 1.3
 Identities = 18/66 (27%), Positives = 29/66 (43%), Gaps = 5/66 (7%)

Query: 15  NGTSMSAPHATGCVALIISALQQQGLSYSPYSIRRALEITAQYIPSVEP-YAQGFGLLQV 73
           +GTS +AP  +G  AL+ S         +   +RR +E TA +          G+G++  
Sbjct: 223 SGTSFAAPFVSGTAALVRSRFPD----LTAAQVRRRIEATADHPARGGRDDYVGYGVVDP 278

Query: 74  EKALEW 79
             AL  
Sbjct: 279 VAALTG 284


>gnl|CDD|234383 TIGR03895, protease_PatA, cyanobactin maturation protease,
           PatA/PatG family.  This model describes a protease
           domain associated with the maturation of various members
           of the cyanobactin family of ribosomally produced,
           heavily modified bioactive metabolites. Members include
           the PatA protein and C-terminal domain of the PatG
           protein of Prochloron didemni, TenA and a region of TenG
           from Nostoc spongiaeforme var. tenue, etc.
          Length = 602

 Score = 41.2 bits (97), Expect = 0.002
 Identities = 19/46 (41%), Positives = 28/46 (60%)

Query: 10  ETQHMNGTSMSAPHATGCVALIISALQQQGLSYSPYSIRRALEITA 55
           ET  ++GTS + P  TG  AL++S   QQG +  P ++R AL  +A
Sbjct: 153 ETVRLSGTSFATPVVTGVAALLLSLQLQQGETPDPQAVRTALLNSA 198


>gnl|CDD|173806 cd07480, Peptidases_S8_12, Peptidase S8 family domain,
           uncharacterized subfamily 12.  This family is a member
           of the Peptidases S8 or Subtilases serine endo- and
           exo-peptidase clan. They have an Asp/His/Ser catalytic
           triad similar to that found in trypsin-like proteases,
           but do not share their three-dimensional structure and
           are not homologous to trypsin. The stability of
           subtilases may be enhanced by calcium, some members have
           been shown to bind up to 4 ions via binding sites with
           different affinity. Some members of this clan contain
           disulfide bonds. These enzymes can be intra- and
           extracellular, some function at extreme temperatures and
           pH values.
          Length = 297

 Score = 40.4 bits (95), Expect = 0.002
 Identities = 20/62 (32%), Positives = 29/62 (46%), Gaps = 2/62 (3%)

Query: 14  MNGTSMSAPHATGCVALIISALQQQGLSYSPYSIRRAL--EITAQYIPSVEPYAQGFGLL 71
           M+GTSM+ PH  G  AL   AL + G       ++  L    T Q+ P ++   +G GL 
Sbjct: 234 MSGTSMATPHVAGVAALWAEALPKAGGRALAALLQARLTAARTTQFAPGLDLPDRGVGLG 293

Query: 72  QV 73
             
Sbjct: 294 LA 295



 Score = 32.0 bits (73), Expect = 0.97
 Identities = 7/20 (35%), Positives = 12/20 (60%)

Query: 386 PEYDGRGVVIAIFDSGVDPG 405
             + G GV +A+ D+G+D  
Sbjct: 3   SPFTGAGVRVAVLDTGIDLT 22


>gnl|CDD|173810 cd07484, Peptidases_S8_Thermitase_like, Peptidase S8 family domain
           in Thermitase-like proteins.  Thermitase is a
           non-specific, trypsin-related serine protease with a
           very high specific activity.  It contains a subtilisin
           like domain. The tertiary structure of thermitase is
           similar to that of subtilisin BPN'.  It contains a
           Asp/His/Ser catalytic triad. Members of the peptidases
           S8 (subtilisin and kexin) and S53 (sedolisin) clan
           include endopeptidases and  exopeptidases. The S8 family
           has an Asp/His/Ser catalytic triad similar to that found
           in trypsin-like proteases, but do not share their
           three-dimensional structure and are not homologous to
           trypsin. Serine acts as a nucleophile, aspartate as an
           electrophile, and histidine as a base. The S53 family
           contains a catalytic triad Glu/Asp/Ser with an
           additional acidic residue Asp in the oxyanion hole,
           similar to that of subtilisin.  The serine residue here
           is the nucleophilic equivalent of the serine residue in
           the S8 family, while glutamic acid has the same role
           here as the histidine  base.   However, the aspartic
           acid residue that acts as an electrophile  is quite
           different.  In S53 the it follows glutamic acid, while
           in S8 it precedes histidine. The stability of these
           enzymes may be enhanced by calcium, some members have
           been shown to bind up to 4 ions via binding sites with
           different affinity.  There is a great diversity in the
           characteristics of their members: some contain disulfide
           bonds, some are intracellular while others are
           extracellular, some function at extreme temperatures,
           and others at high or low pH values.
          Length = 260

 Score = 39.6 bits (93), Expect = 0.003
 Identities = 20/45 (44%), Positives = 25/45 (55%), Gaps = 5/45 (11%)

Query: 14  MNGTSMSAPHATGCVALIISALQQQGLSYSPYSIRRALEITAQYI 58
           M+GTSM+ PH  G  AL+ S    QG   S   +R AL+ TA  I
Sbjct: 220 MSGTSMATPHVAGVAALLYS----QG-PLSASEVRDALKKTADDI 259



 Score = 35.7 bits (83), Expect = 0.063
 Identities = 19/52 (36%), Positives = 26/52 (50%), Gaps = 6/52 (11%)

Query: 624 SHGTHVASIAAAYFPDEPEKNGVAPGAQIISLCIDGKPKVIERYDCGGAGDV 675
            HGTHVA I AA   +     GVAP A+I+ +      KV++    G   D+
Sbjct: 69  GHGTHVAGIIAAATNNGTGVAGVAPKAKIMPV------KVLDANGSGSLADI 114



 Score = 31.5 bits (72), Expect = 1.2
 Identities = 21/62 (33%), Positives = 27/62 (43%), Gaps = 11/62 (17%)

Query: 389 DGRGVVIAIFDSGVDPGAAGLQVTSDGKPKVIERYDCGGAGDVDTSTVVKVDDTNHITGL 448
            G GV +A+ D+GVDP    L      K K +  YD     D D       DD  H T +
Sbjct: 26  GGSGVTVAVVDTGVDPTHPDL-----LKVKFVLGYDFVD-NDSDAM-----DDNGHGTHV 74

Query: 449 SG 450
           +G
Sbjct: 75  AG 76


>gnl|CDD|173815 cd07490, Peptidases_S8_6, Peptidase S8 family domain,
           uncharacterized subfamily 6.  This family is a member of
           the Peptidases S8 or Subtilases serine endo- and
           exo-peptidase clan. They have an Asp/His/Ser catalytic
           triad similar to that found in trypsin-like proteases,
           but do not share their three-dimensional structure and
           are not homologous to trypsin. The stability of
           subtilases may be enhanced by calcium, some members have
           been shown to bind up to 4 ions via binding sites with
           different affinity. Some members of this clan contain
           disulfide bonds. These enzymes can be intra- and
           extracellular, some function at extreme temperatures and
           pH values.
          Length = 254

 Score = 39.5 bits (92), Expect = 0.004
 Identities = 18/42 (42%), Positives = 22/42 (52%), Gaps = 4/42 (9%)

Query: 14  MNGTSMSAPHATGCVALIISALQQQGLSYSPYSIRRALEITA 55
           ++GTSM+APH  G  AL    L       SP  I+ AL  TA
Sbjct: 216 LSGTSMAAPHVAGVAAL----LAAAHPDLSPEQIKDALTETA 253


>gnl|CDD|173791 cd04842, Peptidases_S8_Kp43_protease, Peptidase S8 family domain in
           Kp43 proteases.  Kp43 proteases are members of the
           peptidase S8 or Subtilase clan of proteases. They have
           an Asp/His/Ser catalytic triad similar to that found in
           trypsin-like proteases, but do not share their
           three-dimensional structure (an example of convergent
           evolution). Kp43 is topologically similar to kexin and
           furin both of which are proprotein convertases, but
           differ in amino acids sequence and the position of its
           C-terminal barrel.  Kp43 has 3 Ca2+ binding sites that
           differ from the corresponding sites in the other known
           subtilisin-like proteases.  KP-43 protease is known to
           be an oxidation-resistant protease when compared with
           the other subtilisin-like proteases.
          Length = 293

 Score = 38.1 bits (89), Expect = 0.011
 Identities = 14/33 (42%), Positives = 16/33 (48%), Gaps = 3/33 (9%)

Query: 625 HGTHVASIAAAYFPDEPEK---NGVAPGAQIIS 654
           HGTHVA I A    D        GVAP A++  
Sbjct: 56  HGTHVAGIIAGKGNDSSSISLYKGVAPKAKLYF 88



 Score = 37.7 bits (88), Expect = 0.013
 Identities = 9/37 (24%), Positives = 14/37 (37%)

Query: 3   SSSSSNKETQHMNGTSMSAPHATGCVALIISALQQQG 39
              +S+      +GTSM+ P   G  AL+        
Sbjct: 236 IGDTSDSAYTSKSGTSMATPLVAGAAALLRQYFVDGY 272



 Score = 29.6 bits (67), Expect = 5.3
 Identities = 8/51 (15%), Positives = 17/51 (33%), Gaps = 5/51 (9%)

Query: 388 YDGRGVVIAIFDSGVDPG-----AAGLQVTSDGKPKVIERYDCGGAGDVDT 433
             G+G ++ + D+G+D             T+    K++         D   
Sbjct: 4   LTGKGQIVGVADTGLDTNHCFFYDPNFNKTNLFHRKIVRYDSLSDTKDDVD 54


>gnl|CDD|173820 cd07496, Peptidases_S8_13, Peptidase S8 family domain,
           uncharacterized subfamily 13.  This family is a member
           of the Peptidases S8 or Subtilases serine endo- and
           exo-peptidase clan. They have an Asp/His/Ser catalytic
           triad similar to that found in trypsin-like proteases,
           but do not share their three-dimensional structure and
           are not homologous to trypsin. The stability of
           subtilases may be enhanced by calcium, some members have
           been shown to bind up to 4 ions via binding sites with
           different affinity. Some members of this clan contain
           disulfide bonds. These enzymes can be intra- and
           extracellular, some function at extreme temperatures and
           pH values.
          Length = 285

 Score = 37.3 bits (87), Expect = 0.020
 Identities = 11/21 (52%), Positives = 14/21 (66%)

Query: 14  MNGTSMSAPHATGCVALIISA 34
           + GTSM+APH  G  AL+ S 
Sbjct: 249 LQGTSMAAPHVAGVAALMKSV 269



 Score = 31.9 bits (73), Expect = 1.1
 Identities = 21/56 (37%), Positives = 25/56 (44%), Gaps = 20/56 (35%)

Query: 623 SSHGTHVASIAAAYFPDEPEKN------GVAPGAQIISLCIDGKPKVIERYDCGGA 672
           S HGTHVA   AA        N      GVA GA+I+ + + GK        CGG 
Sbjct: 71  SWHGTHVAGTIAA------VTNNGVGVAGVAWGARILPVRVLGK--------CGGT 112


>gnl|CDD|173789 cd04059, Peptidases_S8_Protein_convertases_Kexins_Furin-like,
           Peptidase S8 family domain in Protein convertases.
           Protein convertases, whose members include furins and
           kexins, are members of the peptidase S8 or Subtilase
           clan of proteases. They have an Asp/His/Ser catalytic
           triad that is not homologous to trypsin. Kexins are
           involved in the activation of peptide hormones, growth
           factors, and viral proteins.  Furin cleaves cell surface
           vasoactive peptides and proteins involved in
           cardiovascular tissue remodeling in the TGN, at cell
           surface, or in endosomes but rarely in the ER.  Furin
           also plays a key role in blood pressure regulation
           though the activation of transforming growth factor
           (TGF)-beta. High specificity is seen for cleavage after
           dibasic (Lys-Arg or Arg-Arg) or multiple basic residues
           in protein convertases.  There is also strong sequence
           conservation.
          Length = 297

 Score = 36.4 bits (85), Expect = 0.036
 Identities = 12/24 (50%), Positives = 16/24 (66%)

Query: 11  TQHMNGTSMSAPHATGCVALIISA 34
           T   NGTS +AP A G +AL++ A
Sbjct: 256 TSSHNGTSAAAPLAAGVIALMLEA 279



 Score = 31.0 bits (71), Expect = 1.9
 Identities = 9/26 (34%), Positives = 15/26 (57%), Gaps = 1/26 (3%)

Query: 379 LNVLSKHPE-YDGRGVVIAIFDSGVD 403
           LNV     +   G+GV +A+ D G++
Sbjct: 26  LNVTPAWEQGITGKGVTVAVVDDGLE 51



 Score = 30.6 bits (70), Expect = 2.6
 Identities = 18/61 (29%), Positives = 24/61 (39%), Gaps = 14/61 (22%)

Query: 623 SSHGTHVASIAAAYFPDEPEKN------GVAPGAQIISLCIDGKPK--VIERYDCGGAGD 674
           +SHGT  A   AA        N      GVAPGA++  + +       V+E    G   D
Sbjct: 84  NSHGTRCAGEIAA------VGNNGICGVGVAPGAKLGGIRMLDGDVTDVVEAESLGLNPD 137

Query: 675 V 675
            
Sbjct: 138 Y 138


>gnl|CDD|149160 pfam07926, TPR_MLP1_2, TPR/MLP1/MLP2-like protein.  The sequences
           featured in this family are similar to a region of human
           TPR protein and to yeast myosin-like proteins 1 (MLP1)
           and 2 (MLP2). These proteins share a number of features;
           for example, they all have coiled-coil regions and all
           three are associated with nuclear pores. TPR is thought
           to be a component of nuclear pore complex- attached
           intra-nuclear filaments, and is implicated in nuclear
           protein import. Moreover, its N-terminal region is
           involved in the activation of oncogenic kinases,
           possibly by mediating the dimerisation of kinase domains
           or by targeting these kinases to the nuclear pore
           complex. MLP1 and MLP2 are involved in the process of
           telomere length regulation, where they are thought to
           interact with proteins such as Tel1p and modulate their
           activity.
          Length = 132

 Score = 34.2 bits (79), Expect = 0.067
 Identities = 17/79 (21%), Positives = 34/79 (43%), Gaps = 15/79 (18%)

Query: 471 KNVYELYPKLLQERIQKERKEKLWDPSHRKAQAEAQKNLQNFIQKHA-DAKNLSRENKLL 529
           K   E   + LQE ++++            A  EAQ+  +  + KHA D + L      L
Sbjct: 19  KEDAEEKIQKLQEDLEEQ---------AEIAN-EAQQKYERELVKHAEDIEELQA----L 64

Query: 530 KEELESMVESLNNLEKKFN 548
           +++L  + + +  L+ +  
Sbjct: 65  RKQLNELKKEIAQLKAEAE 83


>gnl|CDD|224322 COG1404, AprE, Subtilisin-like serine proteases [Posttranslational
           modification, protein turnover, chaperones].
          Length = 508

 Score = 35.3 bits (80), Expect = 0.12
 Identities = 27/140 (19%), Positives = 47/140 (33%), Gaps = 10/140 (7%)

Query: 14  MNGTSMSAPHATGCVALIISALQQQGLSYSPYSIRRALEITA-QYIPSVEPYAQGFGLLQ 72
           ++GTSM+APH +G  AL++SA        +P  +R  +  TA     S      G GL  
Sbjct: 358 LSGTSMAAPHVSGVAALVLSANPN---ELTPAQVRNLIVTTAGLTPLSGVDNLVGGGLAN 414

Query: 73  VEKALEWLEKYHAELESKVRFHVTCAGSSSKNKGIHLRVGDQQVPKEVNVSVEPVFADSD 132
           ++ A              +  + T +   +      L +    +   V  +V      + 
Sbjct: 415 LDAA------ATDVAGGTLPANGTGSSQGAVEAPGTLALLTPALGGTVPAAVAVAGTVTV 468

Query: 133 NIVFDPEIKYNFQMSLSLTC 152
            +     I        S   
Sbjct: 469 GLGTAGTIALAALGGASAAL 488



 Score = 30.7 bits (68), Expect = 3.0
 Identities = 16/32 (50%), Positives = 20/32 (62%), Gaps = 1/32 (3%)

Query: 624 SHGTHVAS-IAAAYFPDEPEKNGVAPGAQIIS 654
            HGTHVA  IAA  F +     GVAPGA+++ 
Sbjct: 184 GHGTHVAGTIAAVIFDNGAGVAGVAPGAKLLL 215


>gnl|CDD|202716 pfam03646, FlaG, FlaG protein.  Although important for flagella the
           exact function of this protein is unknown.
          Length = 107

 Score = 33.0 bits (76), Expect = 0.12
 Identities = 12/50 (24%), Positives = 19/50 (38%)

Query: 501 AQAEAQKNLQNFIQKHADAKNLSRENKLLKEELESMVESLNNLEKKFNCH 550
             A  +             +  + E ++ KEELE  VE LN   +  N +
Sbjct: 7   QTAAEKVVSAAQAVAEEAEETAAEEKEVSKEELEEAVEKLNKFLQSLNTN 56


>gnl|CDD|173805 cd07479, Peptidases_S8_SKI-1_like, Peptidase S8 family domain in
           SKI-1-like proteins.  SKI-1 (type I membrane-bound
           subtilisin-kexin-isoenzyme) proteins are secretory
           Ca2+-dependent serine proteinases cleave at nonbasic
           residues: Thr, Leu, and Lys.  SKI-1s play a critical
           role in the regulation of the synthesis and metabolism
           of cholesterol and fatty acid metabolism.   Members of
           the peptidases S8 and S35 clan include endopeptidases,
           exopeptidases and also a tripeptidyl-peptidase. The S8
           family has an Asp/His/Ser catalytic triad similar to
           that found in trypsin-like proteases, but do not share
           their three-dimensional structure and are not homologous
           to trypsin. The S53 family contains a catalytic triad
           Glu/Asp/Ser. The stability of these enzymes may be
           enhanced by calcium, some members have been shown to
           bind up to 4 ions via binding sites with different
           affinity. Some members of this clan contain disulfide
           bonds. These enzymes can be intra- and extracellular,
           some function at extreme temperatures and pH values.
          Length = 255

 Score = 34.4 bits (79), Expect = 0.14
 Identities = 15/47 (31%), Positives = 31/47 (65%)

Query: 14  MNGTSMSAPHATGCVALIISALQQQGLSYSPYSIRRALEITAQYIPS 60
           ++GTS+++P   G VAL++S + ++    +P S+++AL  +A  +P 
Sbjct: 209 LSGTSVASPVVAGAVALLLSTVPEKRDLINPASMKQALIESATRLPG 255


>gnl|CDD|173817 cd07492, Peptidases_S8_8, Peptidase S8 family domain,
           uncharacterized subfamily 8.  This family is a member of
           the Peptidases S8 or Subtilases serine endo- and
           exo-peptidase clan. They have an Asp/His/Ser catalytic
           triad similar to that found in trypsin-like proteases,
           but do not share their three-dimensional structure and
           are not homologous to trypsin. The stability of
           subtilases may be enhanced by calcium, some members have
           been shown to bind up to 4 ions via binding sites with
           different affinity. Some members of this clan contain
           disulfide bonds. These enzymes can be intra- and
           extracellular, some function at extreme temperatures and
           pH values.
          Length = 222

 Score = 33.5 bits (77), Expect = 0.23
 Identities = 14/41 (34%), Positives = 21/41 (51%), Gaps = 4/41 (9%)

Query: 15  NGTSMSAPHATGCVALIISALQQQGLSYSPYSIRRALEITA 55
           +G S +APH TG VAL++S             ++R L+  A
Sbjct: 185 SGNSFAAPHVTGMVALLLSE----KPDIDANDLKRLLQRLA 221


>gnl|CDD|206039 pfam13868, Trichoplein, Tumour suppressor, Mitostatin.  Trichoplein
           or mitostatin, was first defined as a meiosis-specific
           nuclear structural protein. It has since been linked
           with mitochondrial movement. It is associated with the
           mitochondrial outer membrane, and over-expression leads
           to reduction in mitochondrial motility whereas lack of
           it enhances mitochondrial movement. The activity appears
           to be mediated through binding the mitochondria to the
           actin intermediate filaments (IFs).
          Length = 349

 Score = 33.7 bits (78), Expect = 0.26
 Identities = 13/53 (24%), Positives = 25/53 (47%), Gaps = 6/53 (11%)

Query: 482 QERIQKERKEKLWDPSHRKAQAEAQKNLQNFIQKHADAKNLSRENKLLKEELE 534
            ERIQ+E + +       + + E QK L+  I +  + +   +E +  +E  E
Sbjct: 105 IERIQEEDEAE------AQEKREKQKKLREEIDEFNEERIERKEEEKEREREE 151


>gnl|CDD|222417 pfam13851, GAS, Growth-arrest specific micro-tubule binding.  This
           family is the highly conserved central region of a
           number of metazoan proteins referred to as growth-arrest
           proteins. In mouse, Gas8 is predominantly a testicular
           protein, whose expression is developmentally regulated
           during puberty and spermatogenesis. In humans, it is
           absent in infertile males who lack the ability to
           generate gametes. The localisation of Gas8 in the
           motility apparatus of post-meiotic gametocytes and
           mature spermatozoa, together with the detection of Gas8
           also in cilia at the apical surfaces of epithelial cells
           lining the pulmonary bronchi and Fallopian tubes
           suggests that the Gas8 protein may have a role in the
           functioning of motile cellular appendages. Gas8 is a
           microtubule-binding protein localised to regions of
           dynein regulation in mammalian cells.
          Length = 201

 Score = 33.0 bits (76), Expect = 0.36
 Identities = 24/87 (27%), Positives = 42/87 (48%), Gaps = 20/87 (22%)

Query: 479 KLLQERIQKERKEKLWDPSHRKAQAEA---QKNLQNFIQ-----KHADA---------KN 521
           KL+ E I +E K +L +P   KA+ E    +K L+++ +     K+  A         KN
Sbjct: 48  KLMAE-IAQENK-RLVEPLK-KAEEEVEELRKKLKDYEKDKQSLKNLKARLKELEKELKN 104

Query: 522 LSRENKLLKEELESMVESLNNLEKKFN 548
           L  E+++L++  E +    + L  KF 
Sbjct: 105 LKWESEVLEQRFEKVERERDELYDKFE 131


>gnl|CDD|173808 cd07482, Peptidases_S8_Lantibiotic_specific_protease, Peptidase S8
           family domain in Lantiobiotic (lanthionine-containing
           antibiotics) specific proteases.  Lantiobiotic
           (lanthionine-containing antibiotics) specific proteases
           are very similar in structure to serine proteases.
           Lantibiotics are ribosomally synthesised antimicrobial
           agents derived from ribosomally synthesised peptides
           with antimicrobial activities against Gram-positive
           bacteria. The proteases that cleave the N-terminal
           leader peptides from lantiobiotics include:  epiP, nsuP,
           mutP, and nisP.  EpiP, from Staphylococcus, is thought
           to cleave matured epidermin. NsuP, a dehydratase from
           Streptococcus and NisP, a membrane-anchored
           subtilisin-like serine protease from Lactococcus cleave
           nisin.  MutP is highly similar to epiP and nisP and is
           thought to process the prepeptide mutacin III of S.
           mutans. Members of the peptidases S8 (subtilisin and
           kexin) and S53 (sedolisin) clan include endopeptidases
           and  exopeptidases. The S8 family has an Asp/His/Ser
           catalytic triad similar to that found in trypsin-like
           proteases, but do not share their three-dimensional
           structure and are not homologous to trypsin. Serine acts
           as a nucleophile, aspartate as an electrophile, and
           histidine as a base. The S53 family contains a catalytic
           triad Glu/Asp/Ser with an additional acidic residue Asp
           in the oxyanion hole, similar to that of subtilisin.
           The serine residue here is the nucleophilic equivalent
           of the serine residue in the S8 family, while glutamic
           acid has the same role here as the histidine  base. 
           However, the aspartic acid residue that acts as an
           electrophile  is quite different.  In S53 the it follows
           glutamic acid, while in S8 it precedes histidine. The
           stability of these enzymes may be enhanced by calcium,
           some members have been shown to bind up to 4 ions via
           binding sites with different affinity.  There is a great
           diversity in the characteristics of their members: some
           contain disulfide bonds, some are intracellular while
           others are extracellular, some function at extreme
           temperatures, and others at high or low pH values.
          Length = 294

 Score = 33.1 bits (76), Expect = 0.39
 Identities = 11/20 (55%), Positives = 15/20 (75%)

Query: 14  MNGTSMSAPHATGCVALIIS 33
           M GTS++AP  +G +ALII 
Sbjct: 257 MYGTSLAAPKVSGALALIID 276


>gnl|CDD|173809 cd07483, Peptidases_S8_Subtilisin_Novo-like, Peptidase S8 family
           domain in Subtilisin_Novo-like proteins.  Subtilisins
           are a group of alkaline proteinases originating from
           different strains of Bacillus subtilis.  Novo is one of
           the strains that produced enzymes belonging to this
           group.  The enzymes obtained from the Novo and BPN'
           strains are identical.  The Carlsburg and Novo
           subtilisins are thought to have arisen from a common
           ancestral protein.  They have similar peptidase and
           esterase activities, pH profiles, catalyze
           transesterification reactions, and are both inhibited by
           diispropyl fluorophosphate, though they differ in 85
           positions in the amino acid sequence.  Members of the
           peptidases S8 and S35 clan include endopeptidases,
           exopeptidases and also a tripeptidyl-peptidase. The S8
           family has an Asp/His/Ser catalytic triad similar to
           that found in trypsin-like proteases, but do not share
           their three-dimensional structure and are not homologous
           to trypsin. The S53 family contains a catalytic triad
           Glu/Asp/Ser with an additional acidic residue Asp in the
           oxyanion hole, similar to that of subtilisin.. The
           stability of these enzymes may be enhanced by calcium,
           some members have been shown to bind up to 4 ions via
           binding sites with different affinity. Some members of
           this clan contain disulfide bonds. These enzymes can be
           intra- and extracellular, some function at extreme
           temperatures and pH values.
          Length = 291

 Score = 33.1 bits (76), Expect = 0.47
 Identities = 15/34 (44%), Positives = 19/34 (55%)

Query: 624 SHGTHVASIAAAYFPDEPEKNGVAPGAQIISLCI 657
            HGTHVA I AA   +    +GVA   +I+ L I
Sbjct: 86  DHGTHVAGIIAAVRDNGIGIDGVADNVKIMPLRI 119


>gnl|CDD|205039 pfam12718, Tropomyosin_1, Tropomyosin like.  This family is a set
           of eukaryotic tropomyosins. Within the yeast Tmp1 and
           Tmp2, biochemical and sequence analyses indicate that
           Tpm2p spans four actin monomers along a filament,
           whereas Tpmlp spans five. Despite its shorter length,
           Tpm2p can compete with Tpm1p for binding to F-actin.
           Over-expression of Tpm2p in vivo alters the axial
           budding of haploids to a bipolar pattern, and this can
           be partially suppressed by co-over-expression of Tpm1p.
           This suggests distinct functions for the two
           tropomyosins, and indicates that the ratio between them
           is important for correct morphogenesis. The family also
           contains higher eukaryote Tmp3 members.
          Length = 143

 Score = 31.4 bits (72), Expect = 0.64
 Identities = 17/65 (26%), Positives = 32/65 (49%), Gaps = 14/65 (21%)

Query: 482 QERIQKERKEKLWDPSHRKAQAE------------AQKNLQNFIQKHADAKNLSRENKLL 529
           QE ++KE  +++     +  Q E            A++ L+   +   +A+ L+R  +LL
Sbjct: 28  QENLEKE--QEITSLQKKNQQLEEEVEKLEEQLKEAKEKLEESEKLATNAEALTRRIQLL 85

Query: 530 KEELE 534
           +EELE
Sbjct: 86  EEELE 90


>gnl|CDD|177251 MTH00208, ND5, NADH dehydrogenase subunit 5; Provisional.
          Length = 628

 Score = 32.6 bits (75), Expect = 0.83
 Identities = 16/74 (21%), Positives = 30/74 (40%), Gaps = 4/74 (5%)

Query: 288 TLTSPTMYIDSESRSLSLIL-IPVIFWLINSLRLNSSHLSFLTSHNTLTNQ---IQLSSS 343
               P + +    +SL LI+ I    +  + L   SS+       + L+NQ   + +S S
Sbjct: 498 VFNLPPLTLPLILKSLPLIVTIIGAAFSFSFLLSLSSNPINSNFFSFLSNQWFFVNISHS 557

Query: 344 DSTNPTMTSSSSFS 357
              + +  +S   S
Sbjct: 558 IIISFSFFTSLFGS 571



 Score = 29.9 bits (68), Expect = 6.0
 Identities = 17/75 (22%), Positives = 40/75 (53%), Gaps = 1/75 (1%)

Query: 291 SPTMYIDSESRSLSLILIPVIFWLINSLRLNSSHLSFLTSHNTLTNQIQLSSSDSTNPTM 350
           +P+M I S + S+ +IL+  IF+  N   +++S+ S   ++N   + I  ++S+ +   +
Sbjct: 4   NPSMIISSINLSIFIILLISIFFSSNLPNISTSNASLGGNNNN-NSIINYNNSNFSISLL 62

Query: 351 TSSSSFSITPPIIEF 365
            + +  S+   +I  
Sbjct: 63  KTLAFLSLINLLIWI 77


>gnl|CDD|226889 COG4487, COG4487, Uncharacterized protein conserved in bacteria
           [Function unknown].
          Length = 438

 Score = 32.5 bits (74), Expect = 0.90
 Identities = 15/77 (19%), Positives = 31/77 (40%), Gaps = 10/77 (12%)

Query: 479 KLLQERIQKERKEKLWDPSHRKAQAEAQKNLQN-------FIQKHADAKNLSRENKLLKE 531
           K LQ   +   K++  + +  + + E +K L+          ++     NL  E K  +E
Sbjct: 132 KQLQNTAEIIEKKRENNKNEERLKFENEKKLEESLELEREKFEEQLHEANLDLEFKENEE 191

Query: 532 ELESM---VESLNNLEK 545
           + ES    ++ L    +
Sbjct: 192 QRESKWAILKKLKRRAE 208



 Score = 30.9 bits (70), Expect = 2.6
 Identities = 13/72 (18%), Positives = 31/72 (43%), Gaps = 3/72 (4%)

Query: 475 ELYPKLLQERIQKERKEKLWDPSHRKAQAEAQKNLQN-FIQKHADAKNLSRENKLLKEEL 533
           E   ++L    + E++       +R A+ +    L+   I +  + KN    N+ +K+  
Sbjct: 35  EDQSRILNTLEEFEKEANEKRAQYRSAKKKELSQLEEQLINQKKEQKN--LFNEQIKQFE 92

Query: 534 ESMVESLNNLEK 545
            ++ + +  LE 
Sbjct: 93  LALQDEIAKLEA 104



 Score = 30.2 bits (68), Expect = 3.7
 Identities = 22/82 (26%), Positives = 31/82 (37%), Gaps = 10/82 (12%)

Query: 475 ELYPKLLQERIQKERKEKLWDPSHRKAQAEAQKNLQNFIQKHADA---------KNLSRE 525
           E   K  Q R  K+ KE          Q + QKNL N   K  +          + L   
Sbjct: 50  EANEKRAQYRSAKK-KELSQLEEQLINQKKEQKNLFNEQIKQFELALQDEIAKLEALELL 108

Query: 526 NKLLKEELESMVESLNNLEKKF 547
           N    +ELE + + L+ L K+ 
Sbjct: 109 NLEKDKELELLEKELDELSKEL 130


>gnl|CDD|177142 MTH00070, ND2, NADH dehydrogenase subunit 2; Provisional.
          Length = 346

 Score = 32.2 bits (74), Expect = 1.0
 Identities = 21/50 (42%), Positives = 27/50 (54%), Gaps = 9/50 (18%)

Query: 327 FLT--SHNTLTNQIQLSSSDSTNPTMTSSSSFSIT-----PPIIEF-PKW 368
           FL   + N+ TN  +LSSS S NPT+TS    ++      PP   F PKW
Sbjct: 216 FLMLMNLNS-TNINKLSSSWSKNPTLTSMMMLTLLSLGGLPPTTGFIPKW 264


>gnl|CDD|225533 COG2986, HutH, Histidine ammonia-lyase [Amino acid transport and
           metabolism].
          Length = 498

 Score = 32.2 bits (74), Expect = 1.1
 Identities = 17/68 (25%), Positives = 30/68 (44%), Gaps = 4/68 (5%)

Query: 30  LIISALQQQGLSYSPYSIRRALEIT--AQYIPSVEPY--AQGFGLLQVEKALEWLEKYHA 85
           +  SA Q+  +S + ++ R+ LEI    + I ++E    AQ   L          +  +A
Sbjct: 406 IPTSANQEDHVSMATHAARKLLEIIENLRTILAIELLAAAQAVDLRAPLDLSPGTKAAYA 465

Query: 86  ELESKVRF 93
            +  KV F
Sbjct: 466 AVREKVPF 473


>gnl|CDD|214922 smart00935, OmpH, Outer membrane protein (OmpH-like).  This family
           includes outer membrane proteins such as OmpH among
           others. Skp (OmpH) has been characterized as a molecular
           chaperone that interacts with unfolded proteins as they
           emerge in the periplasm from the Sec translocation
           machinery.
          Length = 140

 Score = 30.6 bits (70), Expect = 1.1
 Identities = 20/73 (27%), Positives = 36/73 (49%), Gaps = 10/73 (13%)

Query: 479 KLLQERIQKERKEKLWDPSHRKAQAEA-QKNLQNFIQK-HADAKNLSR-ENKLLKEELES 535
           K  Q++++KE K+       R+A+ E  +K LQ   +K   DA  LS    +  ++EL+ 
Sbjct: 17  KAAQKQLEKEFKK-------RQAELEKLEKELQKLKEKLQKDAATLSEAAREKKEKELQK 69

Query: 536 MVESLNNLEKKFN 548
            V+     ++K  
Sbjct: 70  KVQEFQRKQQKLQ 82



 Score = 28.3 bits (64), Expect = 7.8
 Identities = 12/45 (26%), Positives = 22/45 (48%), Gaps = 4/45 (8%)

Query: 508 NLQNFIQKHADAKN----LSRENKLLKEELESMVESLNNLEKKFN 548
           ++Q  +Q+    K     L +E K  + ELE + + L  L++K  
Sbjct: 5   DVQKILQESPAGKAAQKQLEKEFKKRQAELEKLEKELQKLKEKLQ 49


>gnl|CDD|200086 TIGR01225, hutH, histidine ammonia-lyase.  This enzyme deaminates
           histidine to urocanic acid, the first step in histidine
           degradation. It is closely related to phenylalanine
           ammonia-lyase [Energy metabolism, Amino acids and
           amines].
          Length = 506

 Score = 32.0 bits (73), Expect = 1.1
 Identities = 16/64 (25%), Positives = 28/64 (43%), Gaps = 4/64 (6%)

Query: 33  SALQQQGLSYSPYSIRRALEITA--QYIPSVEPYA--QGFGLLQVEKALEWLEKYHAELE 88
           SA Q+  +S   ++ R+  E+    + + ++E  A  QG       K    LEK +  + 
Sbjct: 407 SANQEDHVSMGAHAARKLREMIENVRRVIAIELLAACQGLEFRDPLKTSAKLEKAYQAVR 466

Query: 89  SKVR 92
           S V 
Sbjct: 467 SVVA 470


>gnl|CDD|173822 cd07498, Peptidases_S8_15, Peptidase S8 family domain,
           uncharacterized subfamily 15.  This family is a member
           of the Peptidases S8 or Subtilases serine endo- and
           exo-peptidase clan. They have an Asp/His/Ser catalytic
           triad similar to that found in trypsin-like proteases,
           but do not share their three-dimensional structure and
           are not homologous to trypsin. The stability of
           subtilases may be enhanced by calcium, some members have
           been shown to bind up to 4 ions via binding sites with
           different affinity. Some members of this clan contain
           disulfide bonds. These enzymes can be intra- and
           extracellular, some function at extreme temperatures and
           pH values.
          Length = 242

 Score = 31.2 bits (71), Expect = 1.5
 Identities = 18/58 (31%), Positives = 26/58 (44%), Gaps = 10/58 (17%)

Query: 393 VVIAIFDSGVDPGAAGLQVTSDGKPKVIERYDCGGAGDVDTSTVVKVDDTNHITGLSG 450
           VV+AI D+GVD     L     GKPK++  ++       +  T    D   H T  +G
Sbjct: 1   VVVAIIDTGVDLNHPDLS----GKPKLVPGWN---FVSNNDPT---SDIDGHGTACAG 48


>gnl|CDD|218636 pfam05557, MAD, Mitotic checkpoint protein.  This family consists
           of several eukaryotic mitotic checkpoint (Mitotic arrest
           deficient or MAD) proteins. The mitotic spindle
           checkpoint monitors proper attachment of the bipolar
           spindle to the kinetochores of aligned sister chromatids
           and causes a cell cycle arrest in prometaphase when
           failures occur. Multiple components of the mitotic
           spindle checkpoint have been identified in yeast and
           higher eukaryotes. In S.cerevisiae, the existence of a
           Mad1-dependent complex containing Mad2, Mad3, Bub3 and
           Cdc20 has been demonstrated.
          Length = 722

 Score = 31.8 bits (72), Expect = 1.5
 Identities = 17/80 (21%), Positives = 30/80 (37%), Gaps = 15/80 (18%)

Query: 479 KLLQERIQKERKEKLWDPSHRKAQAEAQK----NLQNFIQK-------HADAKNLSRENK 527
           KL+Q   +  +KE      H++AQ E ++      +N+ ++           K L    K
Sbjct: 69  KLIQLENELMQKEL----EHKRAQIELERKASTLAENYERELDRNLELEVRLKALEELEK 124

Query: 528 LLKEELESMVESLNNLEKKF 547
             + E     E    L+ K 
Sbjct: 125 KAENEAAEAEEEAKLLKDKL 144



 Score = 31.1 bits (70), Expect = 2.5
 Identities = 24/105 (22%), Positives = 45/105 (42%), Gaps = 17/105 (16%)

Query: 458 SWKNPTGDFHIGLKNVYELYPK---LLQERIQ-KERKE------KLWDPSHRKAQAEAQK 507
           SWK+   D  + L+   +L  +   L  E +Q KE+        K  + + ++ Q E QK
Sbjct: 323 SWKSLLQDIGLNLRTPDDLSRRIVVLQNEELQLKEKNGSISSSAKQLETTLQQLQLERQK 382

Query: 508 NLQNFIQK-------HADAKNLSRENKLLKEELESMVESLNNLEK 545
            +   ++         A  + L R   L+ +E + +   LN+ +K
Sbjct: 383 AVSEILELKKKLEALKALVRRLQRRLTLVTKERDGLRAILNSYDK 427


>gnl|CDD|144972 pfam01576, Myosin_tail_1, Myosin tail.  The myosin molecule is a
           multi-subunit complex made up of two heavy chains and
           four light chains it is a fundamental contractile
           protein found in all eukaryote cell types. This family
           consists of the coiled-coil myosin heavy chain tail
           region. The coiled-coil is composed of the tail from two
           molecules of myosin. These can then assemble into the
           macromolecular thick filament. The coiled-coil region
           provides the structural backbone the thick filament.
          Length = 859

 Score = 31.9 bits (73), Expect = 1.5
 Identities = 16/55 (29%), Positives = 28/55 (50%), Gaps = 4/55 (7%)

Query: 499 RKAQAEAQKNLQNFIQKHADAKN-LSRENKLLKEELESMVESLNNLEKKFNCHDL 552
           R    E  + L+N +++  D    L RENK L++E+  + + L    +  N H+L
Sbjct: 408 RNLSTELFR-LKNELEELKDQVEALRRENKNLQDEIHDLTDQLGEGGR--NVHEL 459



 Score = 31.6 bits (72), Expect = 2.0
 Identities = 18/63 (28%), Positives = 32/63 (50%), Gaps = 8/63 (12%)

Query: 488 ERKEKLWDPSHRKAQAEAQKNLQNFIQK----HADAKNLSRENKLLKEELESMVESLNNL 543
           E+K+K +D    K  AE ++ +     +      +A+NLS E   LK ELE + + +  L
Sbjct: 376 EKKQKNFD----KILAEWKRKVDELQAELDTAQREARNLSTELFRLKNELEELKDQVEAL 431

Query: 544 EKK 546
            ++
Sbjct: 432 RRE 434


>gnl|CDD|173818 cd07493, Peptidases_S8_9, Peptidase S8 family domain,
           uncharacterized subfamily 9.  This family is a member of
           the Peptidases S8 or Subtilases serine endo- and
           exo-peptidase clan. They have an Asp/His/Ser catalytic
           triad similar to that found in trypsin-like proteases,
           but do not share their three-dimensional structure and
           are not homologous to trypsin. The stability of
           subtilases may be enhanced by calcium, some members have
           been shown to bind up to 4 ions via binding sites with
           different affinity. Some members of this clan contain
           disulfide bonds. These enzymes can be intra- and
           extracellular, some function at extreme temperatures and
           pH values.
          Length = 261

 Score = 31.1 bits (71), Expect = 1.7
 Identities = 12/42 (28%), Positives = 20/42 (47%), Gaps = 4/42 (9%)

Query: 14  MNGTSMSAPHATGCVALIISALQQQGLSYSPYSIRRALEITA 55
            NGTS S P   G +A +  A      +++   I+ A+  +A
Sbjct: 223 ANGTSFSCPLIAGLIACLWQAHP----NWTNLQIKEAILKSA 260


>gnl|CDD|237005 PRK11864, PRK11864, 2-ketoisovalerate ferredoxin oxidoreductase
           subunit beta; Provisional.
          Length = 300

 Score = 31.2 bits (71), Expect = 1.7
 Identities = 19/60 (31%), Positives = 32/60 (53%), Gaps = 1/60 (1%)

Query: 490 KEKLWDPSHRKAQAEAQKNLQNFIQKHADAKNLSREN-KLLKEELESMVESLNNLEKKFN 548
           K KL  PS      + +K ++ +++     K+L+ E  K L+EE++ M E +  L KKF 
Sbjct: 241 KFKLNSPSKTLLDKKKRKPVEEYLKLQGRFKHLTEEEIKGLQEEIDEMWEEIKKLAKKFG 300


>gnl|CDD|217020 pfam02403, Seryl_tRNA_N, Seryl-tRNA synthetase N-terminal domain.
           This domain is found associated with the Pfam tRNA
           synthetase class II domain (pfam00587) and represents
           the N-terminal domain of seryl-tRNA synthetase.
          Length = 108

 Score = 29.5 bits (67), Expect = 1.7
 Identities = 22/88 (25%), Positives = 41/88 (46%), Gaps = 17/88 (19%)

Query: 478 PKLLQERIQKERKEKLW-------DPSHRKAQAEAQ--KNLQNFIQK--------HADAK 520
           P+ ++E+++K   + L        D   R+ Q E +  +  +N + K          DA+
Sbjct: 11  PEAVKEKLRKRGVDVLDVDELLELDEERRELQVELEELQAERNELSKEIGKAKKKKEDAE 70

Query: 521 NLSRENKLLKEELESMVESLNNLEKKFN 548
            L  E K LK+EL+++   L  LE + +
Sbjct: 71  ALIAEVKELKDELKALEAELRELEAELD 98


>gnl|CDD|235175 PRK03918, PRK03918, chromosome segregation protein; Provisional.
          Length = 880

 Score = 31.6 bits (72), Expect = 1.7
 Identities = 12/70 (17%), Positives = 33/70 (47%), Gaps = 5/70 (7%)

Query: 479 KLLQERIQKERKEKLWDPSH--RKAQAEAQKNLQNFIQKHADAKNLSRENKLLKEELESM 536
           K++++ +  +  E  +       K      + L+ FI++  + + L +E    ++ELE +
Sbjct: 149 KVVRQILGLDDYENAYKNLGEVIKEIKRRIERLEKFIKRTENIEELIKEK---EKELEEV 205

Query: 537 VESLNNLEKK 546
           +  +N +  +
Sbjct: 206 LREINEISSE 215



 Score = 29.3 bits (66), Expect = 8.3
 Identities = 16/77 (20%), Positives = 37/77 (48%), Gaps = 4/77 (5%)

Query: 470 LKNVYELYPKLLQERIQKERKEKLWDPSHRKAQAEAQKNLQNFIQKHADAKNLSRENKLL 529
           LK   E Y KL +    +E  ++L +   R ++ E +  +    ++  + +      + L
Sbjct: 288 LKEKAEEYIKLSEFY--EEYLDELREIEKRLSRLEEE--INGIEERIKELEEKEERLEEL 343

Query: 530 KEELESMVESLNNLEKK 546
           K++L+ + + L  LE++
Sbjct: 344 KKKLKELEKRLEELEER 360


>gnl|CDD|236545 PRK09510, tolA, cell envelope integrity inner membrane protein
           TolA; Provisional.
          Length = 387

 Score = 30.9 bits (70), Expect = 2.1
 Identities = 17/56 (30%), Positives = 25/56 (44%), Gaps = 2/56 (3%)

Query: 482 QERIQKERKEKLWDPSHRKAQAEAQKNLQNFIQKHA-DAKNLSRENKLLKEELESM 536
           QER+++  KE+L     +K QAE         QK A +A   +      K E E+ 
Sbjct: 100 QERLKQLEKERLAAQEQKK-QAEEAAKQAALKQKQAEEAAAKAAAAAKAKAEAEAK 154


>gnl|CDD|115072 pfam06391, MAT1, CDK-activating kinase assembly factor MAT1.  MAT1
           is an assembly/targeting factor for cyclin-dependent
           kinase-activating kinase (CAK), which interacts with the
           transcription factor TFIIH. The domain found to the
           N-terminal side of this domain is a C3HC4 RING finger.
          Length = 200

 Score = 30.5 bits (69), Expect = 2.2
 Identities = 19/79 (24%), Positives = 36/79 (45%), Gaps = 5/79 (6%)

Query: 475 ELYPKLLQERIQKERKEKLWDPSHRKAQAEAQKNLQNFIQK----HADAKNLSRENKLLK 530
           EL   L +E+  KE K        ++ +   +K+ Q  I +    +  A  +  ++K   
Sbjct: 94  ELEQALEEEKEMKEEKRLHLQKEEQEQKMAKEKDKQEIIDELETSNLPANVIIAQHKKQS 153

Query: 531 EELESMVESLNNLEK-KFN 548
           ++LES VE L   ++  F+
Sbjct: 154 KQLESQVEKLERKKRVTFS 172


>gnl|CDD|223496 COG0419, SbcC, ATPase involved in DNA repair [DNA replication,
           recombination, and repair].
          Length = 908

 Score = 31.3 bits (71), Expect = 2.3
 Identities = 23/77 (29%), Positives = 38/77 (49%), Gaps = 7/77 (9%)

Query: 472 NVYELYPKLLQERIQKERKEKLWDPSHRKAQAEAQKNLQNFIQKHADAKNLSRENKLLKE 531
           N+     +LLQ  ++ E +EK+      + +AE ++ LQ  I+     +    E + L+E
Sbjct: 643 NLQAELEELLQAALE-ELEEKV-----EELEAEIRRELQ-RIENEEQLEEKLEELEQLEE 695

Query: 532 ELESMVESLNNLEKKFN 548
           ELE + E L  L KK  
Sbjct: 696 ELEQLREELEELLKKLG 712


>gnl|CDD|192773 pfam11559, ADIP, Afadin- and alpha -actinin-Binding.  This family
           is found in mammals where it is localised at cell-cell
           adherens junctions, and in Sch. pombe and other fungi
           where it anchors spindle-pole bodies to spindle
           microtubules. It is a coiled-coil structure, and in
           pombe, it is required for anchoring the minus end of
           spindle microtubules to the centrosome equivalent, the
           spindle-pole body. The name ADIP derives from the family
           being composed of Afadin- and alpha -Actinin-Binding
           Proteins Localised at Cell-Cell Adherens Junctions.
          Length = 149

 Score = 30.0 bits (68), Expect = 2.4
 Identities = 17/74 (22%), Positives = 34/74 (45%), Gaps = 11/74 (14%)

Query: 481 LQERIQKERKEKLWDPSHRKAQA--EAQKNLQNFIQKHADAKNLSRENKLLKEELESMVE 538
           LQ  I++  K +L D   R+       ++ L+         K L ++ K  KEE++ +  
Sbjct: 69  LQNTIERL-KTQLED-LERELALLQAKERQLE------KKLKTLEQKLKNEKEEVQRLKN 120

Query: 539 SLNNLEKKFNCHDL 552
            +   + ++N H+L
Sbjct: 121 IIQQRKTQYN-HEL 133


>gnl|CDD|151045 pfam10482, CtIP_N, Tumour-suppressor protein CtIP N-terminal
           domain.  CtIP is predominantly a nuclear protein that
           complexes with both BRCA1 and the BRCA1-associated RING
           domain protein (BARD1). At the protein level, CtIP
           expression varies with cell cycle progression in a
           pattern identical to that of BRCA1. Thus, the
           steady-state levels of CtIP polypeptides, which remain
           low in resting cells and G1 cycling cells, increase
           dramatically as Dividing cells traverse the G1/S
           boundary. CtIP can potentially modulate the functions
           ascribed to BRCA1 in transcriptional regulation, DNA
           repair, and/or cell cycle checkpoint control. This
           N-terminal domain carries a coiled-coil region and is
           essential for homodimerisation of the protein. The
           C-terminal domain is family pfam08573.
          Length = 120

 Score = 29.5 bits (66), Expect = 2.5
 Identities = 15/40 (37%), Positives = 22/40 (55%), Gaps = 4/40 (10%)

Query: 499 RKAQAE----AQKNLQNFIQKHADAKNLSRENKLLKEELE 534
           +K Q E     Q+NLQ   +   + K L+ EN+ L EEL+
Sbjct: 81  KKKQQELENNHQQNLQVISELTNEMKTLTEENRRLSEELK 120


>gnl|CDD|240338 PTZ00262, PTZ00262, subtilisin-like protease; Provisional.
          Length = 639

 Score = 31.1 bits (70), Expect = 2.7
 Identities = 21/64 (32%), Positives = 34/64 (53%), Gaps = 4/64 (6%)

Query: 14  MNGTSMSAPHATGCVALIISALQQQGLSYSPYSIRRALEITAQYIPSVEPYAQGFGLLQV 73
           +NGTSM+APH     +LI+S      LSY    + R L+ +   +PS++   +  G L +
Sbjct: 553 LNGTSMAAPHVAAIASLILSI--NPSLSYE--EVIRILKESIVQLPSLKNKVKWGGYLDI 608

Query: 74  EKAL 77
             A+
Sbjct: 609 HHAV 612


>gnl|CDD|227498 COG5170, CDC55, Serine/threonine protein phosphatase 2A, regulatory
           subunit [Signal transduction mechanisms].
          Length = 460

 Score = 30.8 bits (69), Expect = 2.9
 Identities = 26/111 (23%), Positives = 45/111 (40%), Gaps = 6/111 (5%)

Query: 237 TTKHHFVLVPKEATIAVLKIRSNSLEAQGKFILHCTQHKPK---LYMAVEVHKVTLTSPT 293
           T ++HF+L   + TI + KI   +L+   +  L  + H P    L    E+    L+   
Sbjct: 97  TGRNHFLLSTNDKTIKLWKIYEKNLKVVAENNLSDSFHSPMGGPLTSTKELLLPRLSEHD 156

Query: 294 MYIDSESRSLSLILIPVIFWLINSLRLNSSHLSFLTSHNTLTNQIQLSSSD 344
             I ++   +     P     INS+  NS   + L++ +   N   L   D
Sbjct: 157 EIIAAKPCRVYANAHPYH---INSISFNSDKETLLSADDLRINLWNLEIID 204


>gnl|CDD|223020 PHA03246, PHA03246, large tegument protein UL36; Provisional.
          Length = 3095

 Score = 30.7 bits (69), Expect = 3.1
 Identities = 19/51 (37%), Positives = 28/51 (54%), Gaps = 2/51 (3%)

Query: 13   HMNGTSMSAPHATGCVALIIS-ALQQQGLSYSPYSIRRALEI-TAQYIPSV 61
             +    + A HA+G V++ IS AL+    ++SP   RRALE   +  IPS 
Sbjct: 1565 RLEQMRVEALHASGRVSIEISDALETLARTHSPEGARRALEYGVSIVIPSA 1615


>gnl|CDD|223783 COG0711, AtpF, F0F1-type ATP synthase, subunit b [Energy production
           and conversion].
          Length = 161

 Score = 29.6 bits (67), Expect = 3.4
 Identities = 9/59 (15%), Positives = 23/59 (38%), Gaps = 11/59 (18%)

Query: 482 QERIQKERKEKLWDPSHRKAQAEAQKNLQNFIQKHADAKNLSRENKLLKEELESMVESL 540
           ++  ++  +E        +A+ E ++  +    +      +  E +   EEL + V  L
Sbjct: 82  KKEAEQIAEEIK-----AEAEEELERIKEAAEAE------IEAEKERALEELRAEVAEL 129


>gnl|CDD|163064 TIGR02894, DNA_bind_RsfA, transcription factor, RsfA family.  In a
           subset of endospore-forming members of the Firmcutes,
           members of this protein family are found, several to a
           genome. Two very strongly conserved sequences regions
           are separated by a highly variable linker region. Much
           of the linker region was excised from the seed alignment
           for this model. A characterized member is the
           prespore-specific transcription RsfA from Bacillus
           subtilis, previously called YwfN, which is controlled by
           sigma factor F and seems to fine-tune expression of some
           genes in the sigma-F regulon. A paralog in Bacillus
           subtilis is designated YlbO [Regulatory functions, DNA
           interactions, Cellular processes, Sporulation and
           germination].
          Length = 161

 Score = 29.3 bits (66), Expect = 3.5
 Identities = 19/73 (26%), Positives = 32/73 (43%), Gaps = 14/73 (19%)

Query: 482 QERIQ---KERKEKLWDPSHRKAQAEAQKN-----LQNFIQKHADAKNLSRENKLLKEEL 533
           +E I+   K+RKE       ++             LQN    +   + L +EN+ LK + 
Sbjct: 60  EEAIELAKKQRKEL------KREAGSLTLQDVISFLQNLKTTNPSDQALQKENERLKNQN 113

Query: 534 ESMVESLNNLEKK 546
           ES+ +    LEK+
Sbjct: 114 ESLQKRNEELEKE 126


>gnl|CDD|227439 COG5108, RPO41, Mitochondrial DNA-directed RNA polymerase
           [Transcription].
          Length = 1117

 Score = 30.7 bits (69), Expect = 3.7
 Identities = 20/80 (25%), Positives = 36/80 (45%), Gaps = 10/80 (12%)

Query: 481 LQERIQKERKEKLWDPSHRKAQAEAQKNLQNFIQKHADAKNLSRENKLL---KEELESMV 537
             E+      E++W     + +   +KN     +K+A+A NLS    ++   KEE+  ++
Sbjct: 255 WLEKSAIALAEEMWKAGIERFEKRGEKN----REKNAEASNLSWYRMIIHLHKEEIAGLL 310

Query: 538 ESL---NNLEKKFNCHDLGP 554
           E+L    +L K       GP
Sbjct: 311 EALLAEASLGKAGKRLTYGP 330


>gnl|CDD|222160 pfam13476, AAA_23, AAA domain. 
          Length = 204

 Score = 29.8 bits (67), Expect = 4.0
 Identities = 18/74 (24%), Positives = 38/74 (51%), Gaps = 3/74 (4%)

Query: 471 KNVYELYPKLLQERIQKERKEKLWDPSHRKA---QAEAQKNLQNFIQKHADAKNLSRENK 527
           K +  L   L QER+++ + ++       +    + E  ++ ++ ++K  + K   +E +
Sbjct: 131 KEILPLLLYLGQERLEELKFKRKEKKERLEELEKELEELEDEKDLLEKLLEEKEKKKELE 190

Query: 528 LLKEELESMVESLN 541
            LKEELE ++E L 
Sbjct: 191 ELKEELEELLEELE 204


>gnl|CDD|218038 pfam04350, PilO, Pilus assembly protein, PilO.  PilO proteins are
           involved in the assembly of pilin. However, the precise
           function of this family of proteins is not known.
          Length = 144

 Score = 29.0 bits (66), Expect = 4.1
 Identities = 15/46 (32%), Positives = 21/46 (45%), Gaps = 4/46 (8%)

Query: 500 KAQAEAQKNLQNFIQKHADAKNLSRENKLLKEELESMVESLNNLEK 545
           +AQA+     Q F +K   A NL       K++LE + E    L K
Sbjct: 3   RAQAKEATLKQEFEEKARQAANL----PAYKKQLEELEERFGALLK 44


>gnl|CDD|233142 TIGR00826, EIIB_glc, PTS system, glucose-like IIB component.  The
           PTS Glucose-Glucoside (Glc) Family (TC 4.A.1) Bacterial
           PTS transporters transport and concomitantly
           phosphorylate their sugar substrates, and typically
           consist of multiple subunits or protein domains. The Glc
           family includes permeases specific for glucose,
           N-acetylglucosamine and a large variety of a- and
           b-glucosides. However, not all b-glucoside PTS permeases
           are in this class, as the cellobiose (Cel) b-glucoside
           PTS permease is in the Lac family (TC #4.A.3). These
           permeases show limited sequence similarity with members
           of the Fru family (TC #4.A.2). Several of the E. coli
           PTS permeases in the Glc family lack their own IIA
           domains and instead use the glucose IIA protein (IIAglc
           or Crr). Most of these permeases have the B and C
           domains linked together in a single polypeptide chain,
           and a cysteyl residue in the IIB domain is
           phosphorylated by direct phosphoryl transfer from
           IIAglc(his~P). Those permeases which lack a IIA domain
           include the maltose (Mal), arbutin-salicin-cellobiose
           (ASC), trehalose (Tre), putative glucoside (Glv) and
           sucrose (Scr) permeases of E. coli. Most, but not all
           Scr permeases of other bacteria also lack a IIA domain.
           This model is specific for the IIB domain of the Glc
           family PTS transporters [Transport and binding proteins,
           Carbohydrates, organic alcohols, and acids, Signal
           transduction, PTS].
          Length = 88

 Score = 28.1 bits (63), Expect = 4.6
 Identities = 11/47 (23%), Positives = 19/47 (40%)

Query: 468 IGLKNVYELYPKLLQERIQKERKEKLWDPSHRKAQAEAQKNLQNFIQ 514
            G  NV ++   + + R+      K+ D   +K  A     LQ  +Q
Sbjct: 38  GGTGNVNKVDACITRLRVGVADVSKVDDAVLKKLGAHGVNPLQRAVQ 84


>gnl|CDD|224495 COG1579, COG1579, Zn-ribbon protein, possibly nucleic acid-binding
           [General function prediction only].
          Length = 239

 Score = 29.6 bits (67), Expect = 4.9
 Identities = 14/72 (19%), Positives = 33/72 (45%), Gaps = 4/72 (5%)

Query: 479 KLLQE-RIQKERKEKLWDPSH---RKAQAEAQKNLQNFIQKHADAKNLSRENKLLKEELE 534
             +Q+  ++K+R E          +KA+AE +   +       + ++L  +   L+ E++
Sbjct: 10  LAIQKLDLEKDRLEPRIKEIRKALKKAKAELEALNKALEALEIELEDLENQVSQLESEIQ 69

Query: 535 SMVESLNNLEKK 546
            + E +   E+K
Sbjct: 70  EIRERIKRAEEK 81


>gnl|CDD|202045 pfam01920, Prefoldin_2, Prefoldin subunit.  This family includes
           prefoldin subunits that are not detected by pfam02996.
          Length = 106

 Score = 28.4 bits (64), Expect = 5.0
 Identities = 11/39 (28%), Positives = 19/39 (48%)

Query: 503 AEAQKNLQNFIQKHADAKNLSRENKLLKEELESMVESLN 541
            + Q+ LQ   Q+    +   +E +L+ EELE + E   
Sbjct: 8   QQLQQQLQLLAQQIKQLETQLKEIELVLEELELLDEDTK 46


>gnl|CDD|220366 pfam09728, Taxilin, Myosin-like coiled-coil protein.  Taxilin
           contains an extraordinarily long coiled-coil domain in
           its C-terminal half and is ubiquitously expressed. It is
           a novel binding partner of several syntaxin family
           members and is possibly involved in Ca2+-dependent
           exocytosis in neuroendocrine cells. Gamma-taxilin,
           described as leucine zipper protein Factor Inhibiting
           ATF4-mediated Transcription (FIAT), localises to the
           nucleus in osteoblasts and dimerises with ATF4 to form
           inactive dimers, thus inhibiting ATF4-mediated
           transcription.
          Length = 309

 Score = 29.7 bits (67), Expect = 5.1
 Identities = 19/86 (22%), Positives = 34/86 (39%), Gaps = 19/86 (22%)

Query: 485 IQKERKEKLWDPSHR------KAQAEAQKNLQ---NFIQKHADAKN-----LSRENKLLK 530
           +QK  K  L + +        + + E  +  Q     IQ   +  +     L  EN+ L+
Sbjct: 83  LQKHNK-TLKEENKARSDEEEEKRKELTEKFQSTLKDIQDQMEEHSNPNHKLCEENEELR 141

Query: 531 EELESMVESLN----NLEKKFNCHDL 552
           E+L+ ++E       + EK     DL
Sbjct: 142 EKLKELIEQYELREQHFEKLLKTKDL 167



 Score = 29.3 bits (66), Expect = 6.5
 Identities = 19/71 (26%), Positives = 31/71 (43%), Gaps = 9/71 (12%)

Query: 470 LKNVYELYPKLLQERIQKERKEKLWDPSHRKAQAEAQKNLQNFIQKHADAK--------N 521
           L+ + +   +LL+E  + E+  K       + Q E +  LQ+   K   A+         
Sbjct: 24  LETLIKRLAELLKEMKRLEKDLKKLQKRGEQLQKE-KDKLQSEHSKAILARSKLESLCRE 82

Query: 522 LSRENKLLKEE 532
           L + NK LKEE
Sbjct: 83  LQKHNKTLKEE 93


>gnl|CDD|188690 cd08736, RGS_RhoGEF-like, Regulator of G protein signaling (RGS)
           domain found in the Rho guanine nucleotide exchange
           factor (RhoGEF) protein.  The RGS domain found in the
           Rho guanine nucleotide exchange factor (RhoGEF) protein
           subfamily of the RGS domain containing protein family,
           which is a diverse group of multifunctional proteins
           that regulate cellular signaling events downstream of
           G-protein coupled receptors (GPCRs). RhoGEFs link
           signals from heterotrimeric G-alpha12/13 protein-coupled
           receptors to Rho GTPase activation, leading to various
           cellular responses, such as actin reorganization and
           gene expression. The RGS domain of the RhoGEFs has very
           little sequence similarity with the canonical RGS domain
           of the RGS proteins and therefore is often refered to as
           the RH (RGS Homology) domain. The RGS-GEFs subfamily
           includes the leukemia-associated RhoGEF (LARG),
           p115RhoGEF, and PDZ-RhoGEF. RGS proteins play critical
           regulatory role as GTPase activating proteins (GAPs) of
           the heterotrimeric G-protein G-alpha-subunits. RGS
           proteins play critical regulatory role as GTPase
           activating proteins (GAPs) of the heterotrimeric
           G-protein G-alpha-subunits. RGS proteins regulate many
           aspects of embryonic development such as glial
           differentiation, embryonic axis formation, skeletal and
           muscle development, cell migration during early
           embryogenesis, as well as apoptosis, cell proliferation,
           and modulation of cardiac development.
          Length = 120

 Score = 28.4 bits (64), Expect = 5.2
 Identities = 11/37 (29%), Positives = 20/37 (54%)

Query: 479 KLLQERIQKERKEKLWDPSHRKAQAEAQKNLQNFIQK 515
           K L   I +E   +++  +  +A  E Q+ L++F QK
Sbjct: 72  KRLPNLIDEEDLRRVFQEAQERAMPEIQEQLEDFRQK 108


>gnl|CDD|173797 cd05561, Peptidases_S8_4, Peptidase S8 family domain,
           uncharacterized subfamily 4.  This family is a member of
           the Peptidases S8 or Subtilases serine endo- and
           exo-peptidase clan. They have an Asp/His/Ser catalytic
           triad similar to that found in trypsin-like proteases,
           but do not share their three-dimensional structure and
           are not homologous to trypsin. The stability of
           subtilases may be enhanced by calcium, some members have
           been shown to bind up to 4 ions via binding sites with
           different affinity. Some members of this clan contain
           disulfide bonds. These enzymes can be intra- and
           extracellular, some function at extreme temperatures and
           pH values.
          Length = 239

 Score = 29.6 bits (67), Expect = 5.4
 Identities = 15/41 (36%), Positives = 19/41 (46%), Gaps = 5/41 (12%)

Query: 16  GTSMSAPHATGCVALIISALQQQGLSYSPYSIRRALEITAQ 56
           GTS +AP  T  +AL    LQ   L+      R  L  TA+
Sbjct: 190 GTSFAAPFVTAALAL---LLQASPLAPDD--ARARLAATAK 225


>gnl|CDD|204935 pfam12474, PKK, Polo kinase kinase.  This domain family is found in
           eukaryotes, and is approximately 140 amino acids in
           length. The family is found in association with
           pfam00069. Polo-like kinase 1 (Plx1) is essential during
           mitosis for the activation of Cdc25C, for spindle
           assembly, and for cyclin B degradation. This family is
           Polo kinase kinase (PKK) which phosphorylates Polo
           kinase and Polo-like kinase to activate them. PKK is a
           serine/threonine kinase.
          Length = 142

 Score = 28.4 bits (64), Expect = 5.9
 Identities = 19/60 (31%), Positives = 26/60 (43%), Gaps = 11/60 (18%)

Query: 483 ERIQKERKEKLWDPSHRKAQAEAQKNLQNFI---QKHADAK---NLSRENKLLKEELESM 536
           ER QK   EKL      + Q +  + L   I   QK        +L  E K LK+E+E +
Sbjct: 32  ERQQKRTIEKL-----EQRQTQELRRLPKRIRAEQKTRLKMFKESLKIEKKELKQEVEKL 86


>gnl|CDD|216704 pfam01793, Glyco_transf_15, Glycolipid 2-alpha-mannosyltransferase.
            This is a family of alpha-1,2 mannosyl-transferases
           involved in N-linked and O-linked glycosylation of
           proteins. Some of the enzymes in this family have been
           shown to be involved in O- and N-linked glycan
           modifications in the Golgi.
          Length = 273

 Score = 29.4 bits (67), Expect = 6.2
 Identities = 7/18 (38%), Positives = 14/18 (77%)

Query: 531 EELESMVESLNNLEKKFN 548
            ELE +++S+ ++E +FN
Sbjct: 12  SELEGLLKSIRSVEDRFN 29


>gnl|CDD|202427 pfam02841, GBP_C, Guanylate-binding protein, C-terminal domain.
           Transcription of the anti-viral guanylate-binding
           protein (GBP) is induced by interferon-gamma during
           macrophage induction. This family contains GBP1 and
           GPB2, both GTPases capable of binding GTP, GDP and GMP.
          Length = 297

 Score = 29.2 bits (66), Expect = 6.8
 Identities = 18/72 (25%), Positives = 31/72 (43%), Gaps = 18/72 (25%)

Query: 471 KNVYELYPKLLQERIQKERKEKLWDPSHRKAQAEAQKNLQNFIQKHADAKNLSRENKLLK 530
           +  Y+ + K L E+++ ER+         K  AE ++ L            L  + +LLK
Sbjct: 238 ERSYQEHVKQLIEKMEAERE---------KLLAEQERML---------EHKLQEQEELLK 279

Query: 531 EELESMVESLNN 542
           E  ++  ESL  
Sbjct: 280 EGFKTEAESLQK 291


>gnl|CDD|234501 TIGR04211, SH3_and_anchor, SH3 domain protein.  Members of this
           protein family have a signal peptide, a strongly
           conserved SH3 domain, a variable region, and then a
           C-terminal hydrophobic transmembrane alpha helix region.
          Length = 198

 Score = 28.8 bits (65), Expect = 7.1
 Identities = 20/70 (28%), Positives = 32/70 (45%), Gaps = 4/70 (5%)

Query: 481 LQERIQKERKE----KLWDPSHRKAQAEAQKNLQNFIQKHADAKNLSRENKLLKEELESM 536
           LQE++ + ++E    K    +        QK L    Q  A+A  L  EN+ L+EEL  +
Sbjct: 85  LQEQLAELQQENQELKQELSTLEAELERLQKELARIKQLSANAIELDEENRELREELAEL 144

Query: 537 VESLNNLEKK 546
            +    LE +
Sbjct: 145 KQENEALEAE 154


>gnl|CDD|227489 COG5160, ULP1, Protease, Ulp1 family [Posttranslational
           modification, protein turnover, chaperones].
          Length = 578

 Score = 29.5 bits (66), Expect = 7.1
 Identities = 26/158 (16%), Positives = 49/158 (31%), Gaps = 26/158 (16%)

Query: 407 AGLQVTSDGKPKVIERYDCGGAGDVDTST--VVKVDDTNHITGLSGRKLKIPTSWKNPTG 464
             +  TS     +  + +   +  VD+ T  +       H+  +S           +P  
Sbjct: 82  PAVASTSPNNSIIESKLNLNQSSCVDSKTLDIYSHPTPKHMYFVS--------DHNHPKF 133

Query: 465 DFHIGLKNVYE---------LYPKLLQER---IQKERKEKLWDPSHRKAQAEAQKN-LQN 511
             +I  + +Y              LL++        R               +  N   N
Sbjct: 134 SLYIQQEVIYSPQVNLISDVELIILLKKSSFMAIISRDLFKV--KMILDVNLSSINRNTN 191

Query: 512 FIQKHADAKNLSRENKLLKEELESM-VESLNNLEKKFN 548
            + K   +  LS +N LL   + S+ +E L+  EK   
Sbjct: 192 ILIKSTASSILSIKNFLLMSYINSIKLEILDQSEKHLT 229



 Score = 29.1 bits (65), Expect = 9.0
 Identities = 21/147 (14%), Positives = 41/147 (27%), Gaps = 38/147 (25%)

Query: 298 SESRSLSLILIPVI-----FWLINS---LRLNSSHLSFLTSHNTLTNQIQLSSSD----- 344
            +   +S + + ++     F  I S    ++       L+S N  TN +  S++      
Sbjct: 145 PQVNLISDVELIILLKKSSFMAIISRDLFKVKMILDVNLSSINRNTNILIKSTASSILSI 204

Query: 345 ----------STNPTMTSSSSFSITPPIIEFPKWGLLPKKETGVLNVLSKHPEYDGRGVV 394
                     S    +   S   +T P         L  K+   L   +K+     R   
Sbjct: 205 KNFLLMSYINSIKLEILDQSEKHLTHPSSSPESSPSLNMKQ---LINSTKNAFKSLR--- 258

Query: 395 IAIFDSGVDPGAAGLQVTSDGKPKVIE 421
                         L   S    ++ +
Sbjct: 259 ---------LKKTKLSTNSPHSSQIHK 276


>gnl|CDD|178652 PLN03105, TCP24, transcription factor TCP24 (TEOSINTE BRANCHED1,
           CYCLOIDEA, AND PCF FAMILY 24); Provisional.
          Length = 324

 Score = 29.1 bits (64), Expect = 7.4
 Identities = 19/50 (38%), Positives = 30/50 (60%), Gaps = 3/50 (6%)

Query: 317 SLRLNSSHLSFLTSHNTLTNQIQLSSSDSTNPTMTSSSS--FSITPPIIE 364
           +L+ NS  L FL ++N +T +  +SSS S++  M S S   F  TPP ++
Sbjct: 243 TLQSNSQSL-FLNNNNNITQRSSISSSSSSSSPMDSQSISFFMATPPPLD 291


>gnl|CDD|216555 pfam01531, Glyco_transf_11, Glycosyl transferase family 11.  This
           family contains several fucosyl transferase enzymes.
          Length = 298

 Score = 29.1 bits (65), Expect = 7.8
 Identities = 29/148 (19%), Positives = 52/148 (35%), Gaps = 32/148 (21%)

Query: 444 HITGLSGRKLKIPTSWKNPTGDFHIGLKN--VYELYPK-LLQERIQKERKEKLWDPSHRK 500
           H+ G   +    P SW      +H GL+   +YE      L+E IQ            R 
Sbjct: 107 HLRGEYVKFTGYPCSWTF----YHHGLRQEILYEFTLHDHLREEIQN---------FLRG 153

Query: 501 AQAEAQKNLQNFIQKHAD----AKNLSRENKLLKEELESMVESLNNLEKKFNCHDLGPAY 556
            Q         F+  H         + +  K +  ++  ++++L+    +++      + 
Sbjct: 154 LQVNLGSRPSTFVGVHIRRGDYVDVMPKVWKGVVADINYLIQALDWFRARYS------SP 207

Query: 557 DVVVFHNGDYWC------ACVDTTETGD 578
             VVF +   WC      +C D    GD
Sbjct: 208 VFVVFSDDMEWCKKNIDTSCGDVYFAGD 235


>gnl|CDD|152704 pfam12269, zf-CpG_bind_C, CpG binding protein zinc finger C
           terminal domain.  This domain family is found in
           eukaryotes, and is approximately 240 amino acids in
           length. This domain is the zinc finger domain of a CpG
           binding DNA methyltransferase protein. It contains a
           CxxC motif which forms the zinc finger and binds to DNA.
          Length = 236

 Score = 29.0 bits (65), Expect = 7.9
 Identities = 16/68 (23%), Positives = 33/68 (48%), Gaps = 10/68 (14%)

Query: 471 KNVYELYPKLLQERIQKERKEKLWDPSHRKAQAEAQKNLQNFIQKHADAKNLSRENKLLK 530
           K +YE+ P+ +QE          W+ +   A+   +K L+   +K   A+   +E +   
Sbjct: 3   KRIYEILPQRIQE----------WNLTPCVAEEIGKKALERIRKKQQVARTRLQELERRH 52

Query: 531 EELESMVE 538
           +ELE+++ 
Sbjct: 53  QELETIIS 60


>gnl|CDD|233089 TIGR00680, kdpA, K+-transporting ATPase, KdpA.  Kdp is a high
           affinity ATP-driven K+ transport system in Escherichia
           coli. It is composed of three membrane-bound subunits,
           KdpA, KdpB and KdpC and one small peptide, KdpF. KdpA is
           the K+-transporting subunit of this complex. During
           assembly of the complex, KdpA and KdpC bind to each
           other. This interaction is thought to stabilize the
           complex [medline:9858692]. Data indicates that KdpC
           might connect the KdpA, the K+-transporting subunit, to
           KdpB, the ATP-hydrolyzing (energy providing) subunit
           [medline:9858692] [Transport and binding proteins,
           Cations and iron carrying compounds].
          Length = 563

 Score = 29.1 bits (65), Expect = 8.5
 Identities = 12/33 (36%), Positives = 14/33 (42%), Gaps = 5/33 (15%)

Query: 215 VLGSNPSSKPAVSWD-----QVDFKANTTKHHF 242
            L  NP+  PA SWD      V F  NT    +
Sbjct: 90  SLPLNPTGLPAPSWDLALNTAVSFVTNTNWQSY 122


>gnl|CDD|218336 pfam04935, SURF6, Surfeit locus protein 6.  The surfeit locus
           protein SURF-6 is shown to be a component of the
           nucleolar matrix and has a strong binding capacity for
           nucleic acids.
          Length = 206

 Score = 28.4 bits (64), Expect = 8.9
 Identities = 17/62 (27%), Positives = 30/62 (48%), Gaps = 8/62 (12%)

Query: 478 PKLLQERI-----QKERKEKLWDPSHR---KAQAEAQKNLQNFIQKHADAKNLSRENKLL 529
            KLL++ +     QK++ +K W        K +AE QK  +  ++K  D K   ++ K  
Sbjct: 141 EKLLKKALKRKEKQKKKSKKEWKERKEKVEKKKAERQKKREENLKKRKDDKKNKKKKKAK 200

Query: 530 KE 531
           K+
Sbjct: 201 KK 202


>gnl|CDD|240419 PTZ00440, PTZ00440, reticulocyte binding protein 2-like protein;
            Provisional.
          Length = 2722

 Score = 29.4 bits (66), Expect = 9.0
 Identities = 14/48 (29%), Positives = 22/48 (45%), Gaps = 1/48 (2%)

Query: 501  AQAEAQKNLQNFIQKHADAKNLSRENKLLKEELESMVESLNNLEKKFN 548
               E +    +F Q+ A+  N  + NK+ KE+  S    L+ LEK   
Sbjct: 1988 EYEELKDLRNSFNQEKAETLNNLKLNKI-KEDFNSYKNLLDELEKSVK 2034


>gnl|CDD|200466 cd11327, AmyAc_Glg_debranch_2, Alpha amylase catalytic domain found
           in glycogen debranching enzymes.  Debranching enzymes
           facilitate the breakdown of glycogen through
           glucosyltransferase and glucosidase activity. These
           activities are performed by a single enzyme in mammals,
           yeast, and some bacteria, but by two distinct enzymes in
           Escherichia coli and other bacteria. Debranching enzymes
           perform two activities, 4-alpha-D-glucanotransferase (EC
           2.4.1.25) and amylo-1,6-glucosidase (EC 3.2.1.33).
           4-alpha-D-glucanotransferase catalyzes the
           endohydrolysis of 1,6-alpha-D-glucoside linkages at
           points of branching in chains of 1,4-linked
           alpha-D-glucose residues. Amylo-alpha-1,6-glucosidase
           catalyzes the endohydrolysis of 1,6-alpha-D-glucoside
           linkages at points of branching in chains of 1,4-linked
           alpha-D-glucose residues. The catalytic triad (DED),
           which is highly conserved in other debranching enzymes,
           is not present in this group. The Alpha-amylase family
           comprises the largest family of glycoside hydrolases
           (GH), with the majority of enzymes acting on starch,
           glycogen, and related oligo- and polysaccharides. These
           proteins catalyze the transformation of alpha-1,4 and
           alpha-1,6 glucosidic linkages with retention of the
           anomeric center. The protein is described as having 3
           domains: A, B, C. A is a (beta/alpha) 8-barrel; B is a
           loop between the beta 3 strand and alpha 3 helix of A; C
           is the C-terminal extension characterized by a Greek
           key. The majority of the enzymes have an active site
           cleft found between domains A and B where a triad of
           catalytic residues (Asp, Glu and Asp) performs
           catalysis. Other members of this family have lost the
           catalytic activity as in the case of the human 4F2hc, or
           only have 2 residues that serve as the catalytic
           nucleophile and the acid/base, such as Thermus A4
           beta-galactosidase with 2 Glu residues (GH42) and human
           alpha-galactosidase with 2 Asp residues (GH31). The
           family members are quite extensive and include: alpha
           amylase, maltosyltransferase, cyclodextrin
           glycotransferase, maltogenic amylase, neopullulanase,
           isoamylase, 1,4-alpha-D-glucan maltotetrahydrolase,
           4-alpha-glucotransferase, oligo-1,6-glucosidase,
           amylosucrase, sucrose phosphorylase, and amylomaltase.
          Length = 478

 Score = 29.1 bits (66), Expect = 9.2
 Identities = 18/50 (36%), Positives = 24/50 (48%), Gaps = 4/50 (8%)

Query: 35  LQQQGLSYSPYSIRRALEITAQYIPSVEPYAQGFGLLQVEKALEWLEKYH 84
           LQ+ G S SPYSI   LE+     P   P  +      VE+ ++ LEK  
Sbjct: 58  LQELGESNSPYSIADQLEL----NPDFFPDGKKKTFEDVEELVKKLEKEW 103


>gnl|CDD|236892 PRK11281, PRK11281, hypothetical protein; Provisional.
          Length = 1113

 Score = 29.1 bits (66), Expect = 9.3
 Identities = 9/39 (23%), Positives = 22/39 (56%)

Query: 507 KNLQNFIQKHADAKNLSRENKLLKEELESMVESLNNLEK 545
            ++Q  +      K L  E+KL++++LE  +  L+ +++
Sbjct: 39  ADVQAQLDALNKQKLLEAEDKLVQQDLEQTLALLDKIDR 77


>gnl|CDD|215343 PLN02638, PLN02638, cellulose synthase A (UDP-forming), catalytic
           subunit.
          Length = 1079

 Score = 29.1 bits (65), Expect = 9.6
 Identities = 16/39 (41%), Positives = 21/39 (53%), Gaps = 3/39 (7%)

Query: 427 GAGDVDTSTVVKVDDT---NHITGLSGRKLKIPTSWKNP 462
           G GD+D ST V +DD    +       RK+ IP+S  NP
Sbjct: 232 GGGDIDASTDVLMDDALLNDEARQPLSRKVSIPSSRINP 270


>gnl|CDD|130141 TIGR01069, mutS2, MutS2 family protein.  Function of MutS2 is
           unknown. It should not be considered a DNA mismatch
           repair protein. It is likely a DNA mismatch binding
           protein of unknown cellular function [DNA metabolism,
           Other].
          Length = 771

 Score = 29.0 bits (65), Expect = 10.0
 Identities = 14/64 (21%), Positives = 27/64 (42%), Gaps = 7/64 (10%)

Query: 482 QERIQKERKEKLWDPSHRKAQAEAQKNLQNFIQKHADAKNLSRENKLLKEELESMVESLN 541
            E+  +++ E L      K Q + +K L+  +++        RE     E  +   E+L 
Sbjct: 523 LEKELEQKNEHL--EKLLKEQEKLKKELEQEMEEL-----KERERNKKLELEKEAQEALK 575

Query: 542 NLEK 545
            L+K
Sbjct: 576 ALKK 579


  Database: CDD.v3.10
    Posted date:  Mar 20, 2013  7:55 AM
  Number of letters in database: 10,937,602
  Number of sequences in database:  44,354
  
Lambda     K      H
   0.315    0.132    0.389 

Gapped
Lambda     K      H
   0.267   0.0788    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 34,757,451
Number of extensions: 3393644
Number of successful extensions: 3869
Number of sequences better than 10.0: 1
Number of HSP's gapped: 3838
Number of HSP's successfully gapped: 152
Length of query: 698
Length of database: 10,937,602
Length adjustment: 104
Effective length of query: 594
Effective length of database: 6,324,786
Effective search space: 3756922884
Effective search space used: 3756922884
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.6 bits)
S2: 62 (27.5 bits)