RPS-BLAST 2.2.26 [Sep-21-2011]

Database: CDD.v3.10 
           44,354 sequences; 10,937,602 total letters

Searching..................................................done

Query= psy4596
         (439 letters)



>gnl|CDD|238492 cd00992, PDZ_signaling, PDZ domain found in a variety of Eumetazoan
           signaling molecules, often in tandem arrangements. May
           be responsible for specific protein-protein
           interactions, as most PDZ domains bind C-terminal
           polypeptides, and binding to internal (non-C-terminal)
           polypeptides and even to lipids has been demonstrated.
           In this subfamily of PDZ domains an N-terminal
           beta-strand forms the peptide-binding groove base, a
           circular permutation with respect to PDZ domains found
           in proteases.
          Length = 82

 Score = 84.9 bits (211), Expect = 2e-20
 Identities = 40/91 (43%), Positives = 50/91 (54%), Gaps = 11/91 (12%)

Query: 160 LQDVVLPK--SGPLGFSIIGGTDHSCIPFGQHKPGIFISHIVPGGVAALSGKLRMGDRIL 217
           ++ V L K   G LGFS+ GG D           GIF+S + PGG A   G LR+GDRIL
Sbjct: 1   VRTVTLRKDPGGGLGFSLRGGKDS--------GGGIFVSRVEPGGPAER-GGLRVGDRIL 51

Query: 218 AVNGRDVSKVTHEEAVLALLEPCPEILLKIQ 248
            VNG  V  +THEEAV  L     E+ L ++
Sbjct: 52  EVNGVSVEGLTHEEAVELLKNSGDEVTLTVR 82



 Score = 81.8 bits (203), Expect = 2e-19
 Identities = 35/90 (38%), Positives = 56/90 (62%), Gaps = 10/90 (11%)

Query: 258 EITLIRQEGEKLGMHIKGGLRGQRGNPLDKNDEGVFITKINSGGAAKRDGRLKVGMRLLE 317
            +TL +  G  LG  ++GG           +  G+F++++  GG A+R G L+VG R+LE
Sbjct: 3   TVTLRKDPGGGLGFSLRGG---------KDSGGGIFVSRVEPGGPAER-GGLRVGDRILE 52

Query: 318 VNGMSLLGASHQEAVNTLRYSGHQISLTVC 347
           VNG+S+ G +H+EAV  L+ SG +++LTV 
Sbjct: 53  VNGVSVEGLTHEEAVELLKNSGDEVTLTVR 82



 Score = 47.9 bits (115), Expect = 2e-07
 Identities = 20/37 (54%), Positives = 29/37 (78%)

Query: 29 VGMRLLEVNGMSLLGASHQEAVNTLRYSGHQISLTVC 65
          VG R+LEVNG+S+ G +H+EAV  L+ SG +++LTV 
Sbjct: 46 VGDRILEVNGVSVEGLTHEEAVELLKNSGDEVTLTVR 82


>gnl|CDD|214570 smart00228, PDZ, Domain present in PSD-95, Dlg, and ZO-1/2.  Also
           called DHR (Dlg homologous region) or GLGF (relatively
           well conserved tetrapeptide in these domains). Some PDZs
           have been shown to bind C-terminal polypeptides; others
           appear to bind internal (non-C-terminal) polypeptides.
           Different PDZs possess different binding specificities.
          Length = 85

 Score = 80.1 bits (198), Expect = 1e-18
 Identities = 36/93 (38%), Positives = 49/93 (52%), Gaps = 10/93 (10%)

Query: 160 LQDVVLPK-SGPLGFSIIGGTDHSCIPFGQHKPGIFISHIVPGGVAALSGKLRMGDRILA 218
            + V L K  G LGFS++GG D           G+ +S +VPG  AA +G LR+GD IL 
Sbjct: 2   PRLVELEKGGGGLGFSLVGGKDEG--------GGVVVSSVVPGSPAAKAG-LRVGDVILE 52

Query: 219 VNGRDVSKVTHEEAVLALLEPCPEILLKIQHDP 251
           VNG  V  +TH EAV  L +   ++ L +    
Sbjct: 53  VNGTSVEGLTHLEAVDLLKKAGGKVTLTVLRGG 85



 Score = 67.4 bits (165), Expect = 4e-14
 Identities = 30/88 (34%), Positives = 48/88 (54%), Gaps = 10/88 (11%)

Query: 262 IRQEGEKLGMHIKGGLRGQRGNPLDKNDEGVFITKINSGGAAKRDGRLKVGMRLLEVNGM 321
           + + G  LG  + GG              GV ++ +  G  A + G L+VG  +LEVNG 
Sbjct: 7   LEKGGGGLGFSLVGG---------KDEGGGVVVSSVVPGSPAAKAG-LRVGDVILEVNGT 56

Query: 322 SLLGASHQEAVNTLRYSGHQISLTVCKG 349
           S+ G +H EAV+ L+ +G +++LTV +G
Sbjct: 57  SVEGLTHLEAVDLLKKAGGKVTLTVLRG 84



 Score = 40.8 bits (96), Expect = 7e-05
 Identities = 19/39 (48%), Positives = 29/39 (74%)

Query: 29 VGMRLLEVNGMSLLGASHQEAVNTLRYSGHQISLTVCKG 67
          VG  +LEVNG S+ G +H EAV+ L+ +G +++LTV +G
Sbjct: 46 VGDVILEVNGTSVEGLTHLEAVDLLKKAGGKVTLTVLRG 84


>gnl|CDD|201332 pfam00595, PDZ, PDZ domain (Also known as DHR or GLGF).  PDZ
           domains are found in diverse signaling proteins.
          Length = 80

 Score = 75.0 bits (185), Expect = 5e-17
 Identities = 42/89 (47%), Positives = 59/89 (66%), Gaps = 11/89 (12%)

Query: 162 DVVLPKS--GPLGFSIIGGTDHSCIPFGQHKPGIFISHIVPGGVAALSGKLRMGDRILAV 219
           +V L KS  G LGFS++GG+D          PGIF+S ++PGG AA +G L+ GDRIL++
Sbjct: 1   EVTLEKSGRGGLGFSLVGGSDG--------DPGIFVSEVLPGG-AAEAGGLQEGDRILSI 51

Query: 220 NGRDVSKVTHEEAVLALLEPCPEILLKIQ 248
           NG+D+  ++H+EAVLAL     E+ L I 
Sbjct: 52  NGQDLENLSHDEAVLALKGSGGEVTLTIL 80



 Score = 63.0 bits (154), Expect = 9e-13
 Identities = 32/89 (35%), Positives = 49/89 (55%), Gaps = 10/89 (11%)

Query: 258 EITLIRQEGEKLGMHIKGGLRGQRGNPLDKNDEGVFITKINSGGAAKRDGRLKVGMRLLE 317
           E+TL +     LG  + GG            D G+F++++  GGAA+  G L+ G R+L 
Sbjct: 1   EVTLEKSGRGGLGFSLVGG---------SDGDPGIFVSEVLPGGAAEAGG-LQEGDRILS 50

Query: 318 VNGMSLLGASHQEAVNTLRYSGHQISLTV 346
           +NG  L   SH EAV  L+ SG +++LT+
Sbjct: 51  INGQDLENLSHDEAVLALKGSGGEVTLTI 79



 Score = 31.8 bits (73), Expect = 0.10
 Identities = 16/36 (44%), Positives = 23/36 (63%)

Query: 29 VGMRLLEVNGMSLLGASHQEAVNTLRYSGHQISLTV 64
           G R+L +NG  L   SH EAV  L+ SG +++LT+
Sbjct: 44 EGDRILSINGQDLENLSHDEAVLALKGSGGEVTLTI 79


>gnl|CDD|238080 cd00136, PDZ, PDZ domain, also called DHR (Dlg homologous region)
           or GLGF (after a conserved sequence motif). Many PDZ
           domains bind C-terminal polypeptides, though binding to
           internal (non-C-terminal) polypeptides and even to
           lipids has been demonstrated. Heterodimerization through
           PDZ-PDZ domain interactions adds to the domain's
           versatility, and PDZ domain-mediated interactions may be
           modulated dynamically through target phosphorylation.
           Some PDZ domains play a role in scaffolding
           supramolecular complexes. PDZ domains are found in
           diverse signaling proteins in bacteria, archebacteria,
           and eurkayotes. This CD contains two distinct structural
           subgroups with either a N- or C-terminal beta-strand
           forming the peptide-binding groove base. The circular
           permutation placing the strand on the N-terminus appears
           to be found in Eumetazoa only, while the C-terminal
           variant is found in all three kingdoms of life, and
           seems to co-occur with protease domains. PDZ domains
           have been named after PSD95(post synaptic density
           protein), DlgA (Drosophila disc large tumor suppressor),
           and ZO1, a mammalian tight junction protein.
          Length = 70

 Score = 51.2 bits (123), Expect = 1e-08
 Identities = 28/68 (41%), Positives = 36/68 (52%), Gaps = 11/68 (16%)

Query: 169 GPLGFSIIGGTDHSCIPFGQHKPGIFISHIVPGGVAALSGKLRMGDRILAVNGRDVSKVT 228
           G LGFSI GGT+           G+ +  + PG  A  +G L+ GD ILAVNG DV  +T
Sbjct: 1   GGLGFSIRGGTE----------GGVVVLSVEPGSPAERAG-LQAGDVILAVNGTDVKNLT 49

Query: 229 HEEAVLAL 236
            E+    L
Sbjct: 50  LEDVAELL 57



 Score = 43.1 bits (102), Expect = 8e-06
 Identities = 17/62 (27%), Positives = 31/62 (50%), Gaps = 2/62 (3%)

Query: 286 DKNDEGVFITKINSGGAAKRDGRLKVGMRLLEVNGMSLLGASHQEAVNTLRYS-GHQISL 344
              + GV +  +  G  A+R G L+ G  +L VNG  +   + ++    L+   G +++L
Sbjct: 9   GGTEGGVVVLSVEPGSPAERAG-LQAGDVILAVNGTDVKNLTLEDVAELLKKEVGEKVTL 67

Query: 345 TV 346
           TV
Sbjct: 68  TV 69



 Score = 29.6 bits (67), Expect = 0.47
 Identities = 10/37 (27%), Positives = 19/37 (51%), Gaps = 1/37 (2%)

Query: 29 VGMRLLEVNGMSLLGASHQEAVNTLRYS-GHQISLTV 64
           G  +L VNG  +   + ++    L+   G +++LTV
Sbjct: 33 AGDVILAVNGTDVKNLTLEDVAELLKKEVGEKVTLTV 69


>gnl|CDD|238488 cd00988, PDZ_CTP_protease, PDZ domain of C-terminal processing-,
           tail-specific-, and tricorn proteases, which function in
           posttranslational protein processing, maturation, and
           disassembly or degradation, in Bacteria, Archaea, and
           plant chloroplasts. May be responsible for substrate
           recognition and/or binding, as most PDZ domains bind
           C-terminal polypeptides, and binding to internal
           (non-C-terminal) polypeptides and even to lipids has
           been demonstrated. In this subfamily of
           protease-associated PDZ domains a C-terminal beta-strand
           forms the peptide-binding groove base, a circular
           permutation with respect to PDZ domains found in
           Eumetazoan signaling proteins.
          Length = 85

 Score = 49.9 bits (120), Expect = 5e-08
 Identities = 21/80 (26%), Positives = 41/80 (51%), Gaps = 4/80 (5%)

Query: 186 FGQHKPGIFISHIVPGGVAALSGKLRMGDRILAVNGRDVSKVTHEEAVLALL-EPCPEIL 244
                 G+ I+ ++PG  AA +G ++ GD I+A++G  V  ++ E+ V  L  +   ++ 
Sbjct: 8   LKYDDGGLVITSVLPGSPAAKAG-IKAGDIIVAIDGEPVDGLSLEDVVKLLRGKAGTKVR 66

Query: 245 LKIQHDPHPEGFQEITLIRQ 264
           L ++        +E+TL R 
Sbjct: 67  LTLKRG--DGEPREVTLTRL 84



 Score = 32.6 bits (75), Expect = 0.061
 Identities = 25/76 (32%), Positives = 41/76 (53%), Gaps = 8/76 (10%)

Query: 285 LDKNDEGVFITKINSGG-AAKRDGRLKVGMRLLEVNGMSLLGASHQEAVNTLR-YSGHQI 342
           L  +D G+ IT +  G  AAK    +K G  ++ ++G  + G S ++ V  LR  +G ++
Sbjct: 8   LKYDDGGLVITSVLPGSPAAKAG--IKAGDIIVAIDGEPVDGLSLEDVVKLLRGKAGTKV 65

Query: 343 SLTVCKGFERGDIERR 358
            LT+    +RGD E R
Sbjct: 66  RLTL----KRGDGEPR 77


>gnl|CDD|233695 TIGR02037, degP_htrA_DO, periplasmic serine protease, Do/DeqQ
           family.  This family consists of a set proteins various
           designated DegP, heat shock protein HtrA, and protease
           DO. The ortholog in Pseudomonas aeruginosa is designated
           MucD and is found in an operon that controls mucoid
           phenotype. This family also includes the DegQ (HhoA)
           paralog in E. coli which can rescue a DegP mutant, but
           not the smaller DegS paralog, which cannot. Members of
           this family are located in the periplasm and have
           separable functions as both protease and chaperone.
           Members have a trypsin domain and two copies of a PDZ
           domain. This protein protects bacteria from thermal and
           other stresses and may be important for the survival of
           bacterial pathogens.// The chaperone function is
           dominant at low temperatures, whereas the proteolytic
           activity is turned on at elevated temperatures [Protein
           fate, Protein folding and stabilization, Protein fate,
           Degradation of proteins, peptides, and glycopeptides].
          Length = 428

 Score = 53.0 bits (128), Expect = 2e-07
 Identities = 37/148 (25%), Positives = 61/148 (41%), Gaps = 20/148 (13%)

Query: 186 FGQHKP-GIFISHIVPGGVAALSGKLRMGDRILAVNGRDVSKVTHEEAVLALLEPCPEIL 244
            G  K  G  ++ ++PG  A  +G L+ GD I +VNG+ +S        +  L+P  ++ 
Sbjct: 251 LGLEKQRGALVAQVLPGSPAEKAG-LKAGDVITSVNGKPISSFADLRRAIGTLKPGKKVT 309

Query: 245 LKIQHDPHPEGFQEITL-------IRQEGEKL-----GMHIKGGLRGQRGNPLDKNDEGV 292
           L I         + IT+        +             ++   +R +     D    GV
Sbjct: 310 LGILRKGKE---KTITVTLGASPEEQASSSNPFLGLTVANLSPEIRKELRLKGDVK--GV 364

Query: 293 FITKINSGGAAKRDGRLKVGMRLLEVNG 320
            +TK+ SG  A R G L+ G  +L VN 
Sbjct: 365 VVTKVVSGSPAARAG-LQPGDVILSVNQ 391



 Score = 39.9 bits (94), Expect = 0.002
 Identities = 21/59 (35%), Positives = 31/59 (52%), Gaps = 1/59 (1%)

Query: 192 GIFISHIVPGGVAALSGKLRMGDRILAVNGRDVSKVTHEEAVLALLEPCPEILLKIQHD 250
           G+ ++ +V G  AA +G L+ GD IL+VN + VS V     VLA  +    + L I   
Sbjct: 363 GVVVTKVVSGSPAARAG-LQPGDVILSVNQQPVSSVAELRKVLARAKKGGRVALLILRG 420


>gnl|CDD|238487 cd00987, PDZ_serine_protease, PDZ domain of tryspin-like serine
           proteases, such as DegP/HtrA, which are oligomeric
           proteins involved in heat-shock response, chaperone
           function, and apoptosis. May be responsible for
           substrate recognition and/or binding, as most PDZ
           domains bind C-terminal polypeptides, though binding to
           internal (non-C-terminal) polypeptides and even to
           lipids has been demonstrated. In this subfamily of
           protease-associated PDZ domains a C-terminal beta-strand
           forms the peptide-binding groove base, a circular
           permutation with respect to PDZ domains found in
           Eumetazoan signaling proteins.
          Length = 90

 Score = 43.4 bits (103), Expect = 9e-06
 Identities = 22/57 (38%), Positives = 33/57 (57%), Gaps = 1/57 (1%)

Query: 192 GIFISHIVPGGVAALSGKLRMGDRILAVNGRDVSKVTHEEAVLALLEPCPEILLKIQ 248
           G+ ++ + PG  AA +G L+ GD ILAVNG+ V  V      LA L+P  ++ L + 
Sbjct: 25  GVLVASVDPGSPAAKAG-LKPGDVILAVNGKPVKSVADLRRALAELKPGDKVTLTVL 80



 Score = 33.4 bits (77), Expect = 0.038
 Identities = 25/90 (27%), Positives = 41/90 (45%), Gaps = 13/90 (14%)

Query: 276 GLRGQRGNP------LDKNDEGVFITKINSGGAAKRDGRLKVGMRLLEVNGMSLLGASH- 328
           G+  Q   P        K+ +GV +  ++ G  A + G LK G  +L VNG  +   +  
Sbjct: 4   GVTVQDLTPDLAEELGLKDTKGVLVASVDPGSPAAKAG-LKPGDVILAVNGKPVKSVADL 62

Query: 329 QEAVNTLRYSGHQISLTVCKGFERGDIERR 358
           + A+  L+  G +++LTV     RG  E  
Sbjct: 63  RRALAELK-PGDKVTLTV----LRGGKELT 87


>gnl|CDD|238489 cd00989, PDZ_metalloprotease, PDZ domain of bacterial and plant
           zinc metalloprotases, presumably membrane-associated or
           integral membrane proteases, which may be involved in
           signalling and regulatory mechanisms. May be responsible
           for substrate recognition and/or binding, as most PDZ
           domains bind C-terminal polypeptides, and binding to
           internal (non-C-terminal) polypeptides and even to
           lipids has been demonstrated. In this subfamily of
           protease-associated PDZ domains a C-terminal beta-strand
           forms the peptide-binding groove base, a circular
           permutation with respect to PDZ domains found in
           Eumetazoan signaling proteins.
          Length = 79

 Score = 41.1 bits (97), Expect = 5e-05
 Identities = 23/68 (33%), Positives = 36/68 (52%), Gaps = 4/68 (5%)

Query: 184 IPFGQHKPGIFISHIVPGGVAALSGKLRMGDRILAVNGRDVSKVTHEEAVLALLE-PCPE 242
              G       I  +VPG  AA +G L+ GDRILA+NG+ +   + E+ V A+ E P   
Sbjct: 5   FVPGGPPIEPVIGEVVPGSPAAKAG-LKAGDRILAINGQKIK--SWEDLVDAVQENPGKP 61

Query: 243 ILLKIQHD 250
           + L ++ +
Sbjct: 62  LTLTVERN 69


>gnl|CDD|223864 COG0793, Prc, Periplasmic protease [Cell envelope biogenesis, outer
           membrane].
          Length = 406

 Score = 41.2 bits (97), Expect = 0.001
 Identities = 22/78 (28%), Positives = 40/78 (51%), Gaps = 9/78 (11%)

Query: 192 GIFISHIVPGGVAALSGKLRMGDRILAVNGRDVSKVTHEEAVLAL-LEPCPEILLKIQHD 250
           G+ +   + G  AA +G ++ GD I+ ++G+ V  V+ +EAV  +  +P  ++ L I   
Sbjct: 113 GVKVVSPIDGSPAAKAG-IKPGDVIIKIDGKSVGGVSLDEAVKLIRGKPGTKVTLTI--- 168

Query: 251 PHPEG---FQEITLIRQE 265
               G      +TL R+E
Sbjct: 169 -LRAGGGKPFTVTLTREE 185



 Score = 31.1 bits (71), Expect = 1.1
 Identities = 26/104 (25%), Positives = 46/104 (44%), Gaps = 19/104 (18%)

Query: 250 DPH-----PEGFQEITL-IRQEGEKLGMHIKGGLRGQRGNPLDKNDEGVFITKINSGGAA 303
           DPH     PE   E       E   +G+ ++            ++  GV +     G  A
Sbjct: 77  DPHSTYLDPEDAAEFRTDTSGEFGGIGIELQ-----------MEDIGGVKVVSPIDGSPA 125

Query: 304 KRDGRLKVGMRLLEVNGMSLLGASHQEAVNTLR-YSGHQISLTV 346
            + G +K G  +++++G S+ G S  EAV  +R   G +++LT+
Sbjct: 126 AKAG-IKPGDVIIKIDGKSVGGVSLDEAVKLIRGKPGTKVTLTI 168


>gnl|CDD|223343 COG0265, DegQ, Trypsin-like serine proteases, typically
           periplasmic, contain C-terminal PDZ domain
           [Posttranslational modification, protein turnover,
           chaperones].
          Length = 347

 Score = 38.3 bits (89), Expect = 0.005
 Identities = 20/58 (34%), Positives = 33/58 (56%), Gaps = 1/58 (1%)

Query: 191 PGIFISHIVPGGVAALSGKLRMGDRILAVNGRDVSKVTHEEAVLALLEPCPEILLKIQ 248
            G  +  ++PG  AA +G ++ GD I AVNG+ V+ ++   A +A   P  E+ LK+ 
Sbjct: 270 AGAVVLGVLPGSPAAKAG-IKAGDIITAVNGKPVASLSDLVAAVASNRPGDEVALKLL 326



 Score = 31.0 bits (70), Expect = 1.1
 Identities = 14/68 (20%), Positives = 25/68 (36%), Gaps = 3/68 (4%)

Query: 280 QRGNPLDKNDEGVFITKINSGGAAKRDGRLKVGMRLLEVNGMSLLGA-SHQEAVNTLRYS 338
                      G  +  +  G  A + G +K G  +  VNG  +        AV + R  
Sbjct: 260 ADIALGLPVAAGAVVLGVLPGSPAAKAG-IKAGDIITAVNGKPVASLSDLVAAVASNR-P 317

Query: 339 GHQISLTV 346
           G +++L +
Sbjct: 318 GDEVALKL 325


>gnl|CDD|148630 pfam07133, Merozoite_SPAM, Merozoite surface protein (SPAM).  This
           family consists of several Plasmodium falciparum SPAM
           (secreted polymorphic antigen associated with
           merozoites) proteins. Variation among SPAM alleles is
           the result of deletions and amino acid substitutions in
           non-repetitive sequences within and flanking the alanine
           heptad-repeat domain. Heptad repeats in which the a and
           d position contain hydrophobic residues generate
           amphipathic alpha-helices which give rise to helical
           bundles or coiled-coil structures in proteins. SPAM is
           an example of a P. falciparum antigen in which a
           repetitive sequence has features characteristic of a
           well-defined structural element.
          Length = 164

 Score = 36.4 bits (84), Expect = 0.009
 Identities = 16/58 (27%), Positives = 40/58 (68%), Gaps = 2/58 (3%)

Query: 376 DREDLDTDVFKQEEEM--KKELVEWEKEEQEERKENLELDEVREKSTPEMPHYFQDAN 431
           + E+ D +  ++ E++  ++E+VE E+EE+E+ ++N++L ++ +K+  ++ +  QD N
Sbjct: 53  EEEEEDEEEIEEPEDIEDEEEIVEDEEEEEEDEEDNVDLKDIEKKNINDIFNSTQDDN 110



 Score = 28.3 bits (63), Expect = 4.9
 Identities = 16/54 (29%), Positives = 32/54 (59%), Gaps = 3/54 (5%)

Query: 372 MSSLDREDL---DTDVFKQEEEMKKELVEWEKEEQEERKENLELDEVREKSTPE 422
           ++S D+ED+   + DV  +++E  +E  E ++EE EE ++  + +E+ E    E
Sbjct: 28  ITSWDKEDIIKENEDVKDEKQEDDEEEEEEDEEEIEEPEDIEDEEEIVEDEEEE 81


>gnl|CDD|221961 pfam13180, PDZ_2, PDZ domain. 
          Length = 81

 Score = 33.4 bits (77), Expect = 0.032
 Identities = 17/64 (26%), Positives = 32/64 (50%), Gaps = 1/64 (1%)

Query: 187 GQHKPGIFISHIVPGGVAALSGKLRMGDRILAVNGRDVSKVTHEEAVLALLEPCPEILLK 246
                G+ +  +  G  AA +G L+ GD IL+++G+ V+ +T    V+   +P   + L 
Sbjct: 9   QNEGTGVTVVSVKEGSPAAKAG-LKPGDIILSIDGKKVNSLTELIEVILNGKPGDTVKLT 67

Query: 247 IQHD 250
           +  D
Sbjct: 68  VYRD 71


>gnl|CDD|236873 PRK11186, PRK11186, carboxy-terminal protease; Provisional.
          Length = 667

 Score = 34.1 bits (79), Expect = 0.14
 Identities = 13/25 (52%), Positives = 18/25 (72%)

Query: 195 ISHIVPGGVAALSGKLRMGDRILAV 219
           I+ +V GG AA S KL +GD+I+ V
Sbjct: 259 INSLVAGGPAAKSKKLSVGDKIVGV 283


>gnl|CDD|227169 COG4832, COG4832, Uncharacterized conserved protein [Function
           unknown].
          Length = 207

 Score = 33.3 bits (76), Expect = 0.18
 Identities = 27/142 (19%), Positives = 55/142 (38%), Gaps = 19/142 (13%)

Query: 310 KVGMRLLEVNGMSLL---------GASHQEAVNTLRYSGHQISLTVCKGFERGDIERRAS 360
           K    +++V  M+ L             Q AV  L    + I ++     E+ D  +   
Sbjct: 18  KKKASIIKVPEMNFLMIDGKGDPNSTEFQNAVQVLYSLAYSIKMSYKMKEEKDDKIKDFV 77

Query: 361 TS--EGSRSVTQSMSSLDREDLD-TDVFKQEEEMKKELVEWEKEEQEERKENLELDEVRE 417
               EG          LD+     T + +Q + + +E+ E  KE  E++K +++L  V+ 
Sbjct: 78  VPPLEGLWWSEPEFDKLDKNKWKWTIMIRQPDFITQEIFEKAKEVVEKKKPSIDLSRVKF 137

Query: 418 KSTPEMPHYFQDANCVCFVNFG 439
           +S        ++   +  ++ G
Sbjct: 138 ES-------LEEGLSIQIMHVG 152


>gnl|CDD|232883 TIGR00225, prc, C-terminal peptidase (prc).  A C-terminal peptidase
           with different substrates in different species including
           processing of D1 protein of the photosystem II reaction
           center in higher plants and cleavage of a peptide of 11
           residues from the precursor form of penicillin-binding
           protein in E.coli E.coli and H influenza have the most
           distal branch of the tree and their proteins have an
           N-terminal 200 amino acids that show no homology to
           other proteins in the database [Protein fate,
           Degradation of proteins, peptides, and glycopeptides,
           Protein fate, Protein modification and repair].
          Length = 334

 Score = 33.1 bits (76), Expect = 0.22
 Identities = 18/74 (24%), Positives = 35/74 (47%), Gaps = 3/74 (4%)

Query: 193 IFISHIVPGGVAALSGKLRMGDRILAVNGRDVSKVTHEEAVLALL-EPCPEILLKIQHDP 251
           I I     G  A  +G ++ GD+I+ +NG+ V+ ++ ++AV  +  +   ++ L+I    
Sbjct: 64  IVIVSPFEGSPAEKAG-IKPGDKIIKINGKSVAGMSLDDAVALIRGKKGTKVSLEILR-A 121

Query: 252 HPEGFQEITLIRQE 265
                   TL R  
Sbjct: 122 GKSKPLTFTLKRDR 135


>gnl|CDD|217902 pfam04111, APG6, Autophagy protein Apg6.  In yeast, 15 Apg proteins
           coordinate the formation of autophagosomes. Autophagy is
           a bulk degradation process induced by starvation in
           eukaryotic cells. Apg6/Vps30p has two distinct functions
           in the autophagic process, either associated with the
           membrane or in a retrieval step of the carboxypeptidase
           Y sorting pathway.
          Length = 356

 Score = 32.5 bits (74), Expect = 0.38
 Identities = 22/59 (37%), Positives = 34/59 (57%), Gaps = 2/59 (3%)

Query: 373 SSLDREDLDTDVFKQEEEMKKELVEWEKEEQEERKENLELDEVREKSTPEMPHYFQDAN 431
            +LD E LD ++ K+EE +  EL E EKE+ +   E +EL E +E+   E   Y ++ N
Sbjct: 66  EALDSE-LD-ELKKEEERLLDELEELEKEDDDLDGELVELQEEKEQLENEELQYLREYN 122


>gnl|CDD|182262 PRK10139, PRK10139, serine endoprotease; Provisional.
          Length = 455

 Score = 32.6 bits (74), Expect = 0.40
 Identities = 19/70 (27%), Positives = 38/70 (54%), Gaps = 2/70 (2%)

Query: 192 GIFISHIVPGGVAALSGKLRMGDRILAVNGRDVSKVTHEEAVLALLEPCPEILLKIQHDP 251
           G F+S ++P   +A +G ++ GD I ++NG+ ++      + +A  EP  ++ L +  + 
Sbjct: 291 GAFVSEVLPNSGSAKAG-VKAGDIITSLNGKPLNSFAELRSRIATTEPGTKVKLGLLRNG 349

Query: 252 HPEGFQEITL 261
            P    E+TL
Sbjct: 350 KPLEV-EVTL 358


>gnl|CDD|177679 PLN00047, PLN00047, photosystem II biogenesis protein Psb29;
           Provisional.
          Length = 283

 Score = 32.1 bits (73), Expect = 0.44
 Identities = 20/51 (39%), Positives = 29/51 (56%), Gaps = 7/51 (13%)

Query: 379 DLDTDVFK-------QEEEMKKELVEWEKEEQEERKENLELDEVREKSTPE 422
           D D DV++       Q +E+ KE VE EK++QEER E+ + +E   K   E
Sbjct: 228 DRDLDVYRGLLSKLVQAKELLKEYVEREKKKQEERAESQKANEAVTKCLGE 278


>gnl|CDD|237799 PRK14715, PRK14715, DNA polymerase II large subunit; Provisional.
          Length = 1627

 Score = 32.5 bits (74), Expect = 0.48
 Identities = 12/37 (32%), Positives = 23/37 (62%)

Query: 393 KELVEWEKEEQEERKENLELDEVREKSTPEMPHYFQD 429
           KEL E ++E+ EE+ E ++ +EV E+   E   ++ +
Sbjct: 277 KELKEKKEEKDEEKSEEVKTEEVDEEFEEEEKGFYYE 313


>gnl|CDD|182820 PRK10898, PRK10898, serine endoprotease; Provisional.
          Length = 353

 Score = 32.3 bits (74), Expect = 0.50
 Identities = 11/34 (32%), Positives = 21/34 (61%), Gaps = 1/34 (2%)

Query: 189 HKPGIFISHIVPGGVAALSGKLRMGDRILAVNGR 222
              GI ++ + P G AA +G +++ D I++VN +
Sbjct: 277 QLQGIVVNEVSPDGPAAKAG-IQVNDLIISVNNK 309


>gnl|CDD|216447 pfam01346, FKBP_N, Domain amino terminal to FKBP-type
           peptidyl-prolyl isomerase.  This family is only found at
           the amino terminus of pfam00254. This domain is of
           unknown function.
          Length = 124

 Score = 30.6 bits (70), Expect = 0.56
 Identities = 13/59 (22%), Positives = 25/59 (42%), Gaps = 14/59 (23%)

Query: 367 SVTQSMSSLDREDLDTDVFKQ--------------EEEMKKELVEWEKEEQEERKENLE 411
           S+ + +     E LD D   Q              +EE++K L  ++K+ Q +++   E
Sbjct: 32  SLGRYLKQQGEEQLDKDALLQGLQDALAGKELKLTDEEIQKALQAFQKKLQAKQQAKAE 90


>gnl|CDD|238486 cd00986, PDZ_LON_protease, PDZ domain of ATP-dependent LON serine
           proteases. Most PDZ domains bind C-terminal
           polypeptides, though binding to internal
           (non-C-terminal) polypeptides and even to lipids has
           been demonstrated. In this bacterial subfamily of
           protease-associated PDZ domains a C-terminal beta-strand
            is thought to form the peptide-binding groove base, a
           circular permutation with respect to PDZ domains found
           in Eumetazoan signaling proteins.
          Length = 79

 Score = 29.3 bits (66), Expect = 0.75
 Identities = 14/35 (40%), Positives = 24/35 (68%), Gaps = 2/35 (5%)

Query: 192 GIFISHIVPGGVAALSGKLRMGDRILAVNGRDVSK 226
           G++++ +V G  AA  GKL+ GD I+AV+G+   +
Sbjct: 9   GVYVTSVVEGMPAA--GKLKAGDHIIAVDGKPFKE 41


>gnl|CDD|227492 COG5163, NOP7, Protein required for biogenesis of the 60S ribosomal
           subunit [Translation, ribosomal structure and
           biogenesis].
          Length = 591

 Score = 31.6 bits (71), Expect = 0.90
 Identities = 17/74 (22%), Positives = 28/74 (37%), Gaps = 1/74 (1%)

Query: 365 SRSVTQSMSSLDREDLDTDVFKQEEEMKKELVEWEKEEQEERKENLELDEVREKSTPEMP 424
           S   +   S++  +  + DV K+E   K   +  E    EE+ +  E D  + K   EM 
Sbjct: 282 SLDSSTLKSAVCNDPGNIDVSKEELSEKIPELMVECRLVEEKLDTFE-DNNKNKDIMEMV 340

Query: 425 HYFQDANCVCFVNF 438
                +    F  F
Sbjct: 341 SKPCSSLKSLFSGF 354


>gnl|CDD|222665 pfam14303, NAM-associated, No apical meristem-associated C-terminal
           domain.  This domain is found in a number of different
           types of plant proteins including NAM-like proteins.
          Length = 147

 Score = 30.0 bits (68), Expect = 1.1
 Identities = 12/32 (37%), Positives = 17/32 (53%)

Query: 386 KQEEEMKKELVEWEKEEQEERKENLELDEVRE 417
           ++EE   K L E EKE  E  K+  E   ++E
Sbjct: 89  EKEERFMKALAEAEKERAELEKKKAEAKLMKE 120


>gnl|CDD|205206 pfam13025, DUF3886, Protein of unknown function (DUF3886).  This
           family of proteins is functionally uncharacterized. This
           family of proteins is found in bacteria. Proteins in
           this family are approximately 90 amino acids in length.
           There are two completely conserved L residues that may
           be functionally important.
          Length = 70

 Score = 28.5 bits (64), Expect = 1.2
 Identities = 16/42 (38%), Positives = 23/42 (54%), Gaps = 1/42 (2%)

Query: 378 EDLDTDVFKQEEEMKKELV-EWEKEEQEERKENLELDEVREK 418
           + L+ D+  + +  KKEL  E EK E+EE     E  + REK
Sbjct: 12  DMLNADLLAKLKAKKKELKAEEEKREEEEEARKREERKEREK 53


>gnl|CDD|131391 TIGR02338, gimC_beta, prefoldin, beta subunit, archaeal.
           Chaperonins are cytosolic, ATP-dependent molecular
           chaperones, with a conserved toroidal architecture, that
           assist in the folding of nascent and/or denatured
           polypeptide chains. The group I chaperonin system
           consists of GroEL and GroES, and is found (usually) in
           bacteria and organelles of bacterial origin. The group
           II chaperonin system, called the thermosome in Archaea
           and TRiC or CCT in the Eukaryota, is structurally
           similar but only distantly related. Prefoldin, also
           called GimC, is a complex in Archaea and Eukaryota, that
           works with group II chaperonins. Members of this protein
           family are the archaeal clade of the beta class of
           prefoldin subunit. Closely related, but outside the
           scope of this family are the eukaryotic beta-class
           prefoldin subunits, Gim-1,3,4 and 6. The alpha class
           prefoldin subunits are more distantly related.
          Length = 110

 Score = 29.2 bits (66), Expect = 1.3
 Identities = 18/55 (32%), Positives = 30/55 (54%), Gaps = 4/55 (7%)

Query: 366 RSVTQSMSSLDREDLDTDVFKQEEEMKKELVEWEKEE-QEERKENLELDEVREKS 419
           +   +++  L+R   DT V+K    +   LV+ +KEE  +E KE  E  E+R K+
Sbjct: 34  KEAEKALEELERLPDDTPVYKSVGNL---LVKTDKEEAIQELKEKKETLELRVKT 85


>gnl|CDD|177681 PLN00049, PLN00049, carboxyl-terminal processing protease;
           Provisional.
          Length = 389

 Score = 30.9 bits (70), Expect = 1.4
 Identities = 22/67 (32%), Positives = 34/67 (50%), Gaps = 5/67 (7%)

Query: 199 VPGGVAALSGKLRMGDRILAVNGRDVSKVTHEEAVLALLEPCP-EILLKIQHDPHPEGFQ 257
            PGG AA +G +R GD ILA++G     ++  EA   L  P    + L ++  P     +
Sbjct: 110 APGGPAARAG-IRPGDVILAIDGTSTEGLSLYEAADRLQGPEGSSVELTLRRGPET---R 165

Query: 258 EITLIRQ 264
            +TL R+
Sbjct: 166 LVTLTRE 172


>gnl|CDD|198139 smart01071, CDC37_N, Cdc37 N terminal kinase binding.  Cdc37 is a
           molecular chaperone required for the activity of
           numerous eukaryotic protein kinases. This domain
           corresponds to the N terminal domain which binds
           predominantly to protein kinases.and is found N terminal
           to the Hsp (Heat shocked protein) 90-binding domain.
           Expression of a construct consisting of only the
           N-terminal domain of Saccharomyces pombe Cdc37 results
           in cellular viability. This indicates that interactions
           with the cochaperone Hsp90 may not be essential for
           Cdc37 function.
          Length = 154

 Score = 29.7 bits (67), Expect = 1.5
 Identities = 12/33 (36%), Positives = 18/33 (54%)

Query: 376 DREDLDTDVFKQEEEMKKELVEWEKEEQEERKE 408
           D E L  ++ K  +++KKE  E  K+  E  KE
Sbjct: 105 DSEGLLEELKKHRDKLKKEQKELRKKLDELEKE 137


>gnl|CDD|216108 pfam00769, ERM, Ezrin/radixin/moesin family.  This family of
           proteins contain a band 4.1 domain (pfam00373), at their
           amino terminus. This family represents the rest of these
           proteins.
          Length = 244

 Score = 30.5 bits (69), Expect = 1.6
 Identities = 18/84 (21%), Positives = 32/84 (38%), Gaps = 11/84 (13%)

Query: 355 IERRASTSEGSRSVTQSMSSLDREDLDTDVFKQEEEMKK---ELVEWEKEEQE----ERK 407
           +E  A+ SE  R   + +   + ++   +V K EEE +K   E  + ++E +E      +
Sbjct: 66  LEEEAAASEEER---ERLE-AEVDEATAEVAKLEEEREKKEAETRQLQQELREAQEAHER 121

Query: 408 ENLELDEVREKSTPEMPHYFQDAN 431
              EL E     T           
Sbjct: 122 ARQELLEAAAAPTAPPHVAAPVNG 145


>gnl|CDD|226483 COG3975, COG3975, Predicted protease with the C-terminal PDZ domain
           [General function prediction only].
          Length = 558

 Score = 30.5 bits (69), Expect = 2.0
 Identities = 13/36 (36%), Positives = 19/36 (52%), Gaps = 1/36 (2%)

Query: 186 FGQHKPGIFISHIVPGGVAALSGKLRMGDRILAVNG 221
                    I+ + PGG A  +G L  GD+I+A+NG
Sbjct: 457 VKSEGGHEKITFVFPGGPAYKAG-LSPGDKIVAING 491


>gnl|CDD|173412 PTZ00121, PTZ00121, MAEBL; Provisional.
          Length = 2084

 Score = 30.5 bits (68), Expect = 2.1
 Identities = 17/75 (22%), Positives = 30/75 (40%)

Query: 351  ERGDIERRASTSEGSRSVTQSMSSLDREDLDTDVFKQEEEMKKELVEWEKEEQEERKENL 410
            E+   E     +E ++   +       E    +  K+ EE  K   E  K+E EE K+  
Sbjct: 1687 EKKAAEALKKEAEEAKKAEELKKKEAEEKKKAEELKKAEEENKIKAEEAKKEAEEDKKKA 1746

Query: 411  ELDEVREKSTPEMPH 425
            E  +  E+   ++ H
Sbjct: 1747 EEAKKDEEEKKKIAH 1761



 Score = 28.6 bits (63), Expect = 8.0
 Identities = 14/47 (29%), Positives = 25/47 (53%), Gaps = 10/47 (21%)

Query: 386  KQEEEMKKELVEWEKEEQEERK----------ENLELDEVREKSTPE 422
            K+ EE KK+  E +K E++E+K          E  + +E+++K   E
Sbjct: 1668 KKAEEDKKKAEEAKKAEEDEKKAAEALKKEAEEAKKAEELKKKEAEE 1714


>gnl|CDD|222636 pfam14265, DUF4355, Domain of unknown function (DUF4355).  This
           family of proteins is found in bacteria and viruses.
           Proteins in this family are typically between 180 and
           214 amino acids in length.
          Length = 125

 Score = 29.1 bits (66), Expect = 2.2
 Identities = 10/33 (30%), Positives = 17/33 (51%), Gaps = 1/33 (3%)

Query: 386 KQEEEMKKELVEWEKEEQEERKENLELDEVREK 418
           K E E++K   E E+ E E  +  L+  E ++ 
Sbjct: 46  KAEYELEKLEKELEELEAELARRELK-AEAKKM 77


>gnl|CDD|233191 TIGR00927, 2A1904, K+-dependent Na+/Ca+ exchanger.  [Transport and
           binding proteins, Cations and iron carrying compounds].
          Length = 1096

 Score = 30.3 bits (68), Expect = 2.3
 Identities = 16/43 (37%), Positives = 27/43 (62%)

Query: 376 DREDLDTDVFKQEEEMKKELVEWEKEEQEERKENLELDEVREK 418
           D E+ + +  ++EEE ++E  E E+EE+ E   +LE  E R+K
Sbjct: 862 DSEEEEEEEEEEEEEEEEEEEEEEEEEENEEPLSLEWPETRQK 904


>gnl|CDD|204467 pfam10376, Mei5, Double-strand recombination repair protein.  Mei5
           is one of a pair of meiosis-specific proteins which
           facilitate the loading of Dmc1 on to Rad51 on DNA at
           double-strand breaks during recombination. Recombination
           is carried out by a large protein complex based around
           the two RecA homologues, Rad51 and Dmc1. This complex
           may play both a catalytic and a structural role in the
           interaction between homologous chromosomes during
           meiosis. Mei5 is seen to contain a coiled-coli region.
          Length = 212

 Score = 29.8 bits (67), Expect = 2.4
 Identities = 15/62 (24%), Positives = 28/62 (45%)

Query: 356 ERRASTSEGSRSVTQSMSSLDREDLDTDVFKQEEEMKKELVEWEKEEQEERKENLELDEV 415
            R    SE    +  S   ++R     D  ++E E +K  +E++ +E+ +    L+L E 
Sbjct: 94  WRTICQSESRSILNSSSPKINRMGGYKDFKRKELEAEKRKLEYQVDEESDDLRRLKLVEK 153

Query: 416 RE 417
            E
Sbjct: 154 YE 155


>gnl|CDD|223821 COG0750, COG0750, Predicted membrane-associated Zn-dependent
           proteases 1 [Cell envelope biogenesis, outer membrane].
          Length = 375

 Score = 29.8 bits (67), Expect = 2.8
 Identities = 29/121 (23%), Positives = 41/121 (33%), Gaps = 15/121 (12%)

Query: 194 FISHIVPGGVAALSGKLRMGDRILAVNGRDVSKVTHEEAVLALLEPCPEILLKIQ----- 248
            +  + P   AAL+G LR GDRI+AV+G  V+       +L         LL I      
Sbjct: 132 VVGEVAPKSAAALAG-LRPGDRIVAVDGEKVASWDDVRRLLVAAAGDVFNLLTILVIRLD 190

Query: 249 ---HDPHPEGFQEITLIRQEGEKL----GMHIKGGLRGQRGNPLDKNDEGVFITKINSGG 301
              H    E  + + L             + +  G  G   N     D G  + K     
Sbjct: 191 GEAHAVAAEIIKSLGLTPVVIPLKPGDKIVAVDVGAIGLSPNGEP--DVGKVLVKYGPLE 248

Query: 302 A 302
           A
Sbjct: 249 A 249


>gnl|CDD|236802 PRK10942, PRK10942, serine endoprotease; Provisional.
          Length = 473

 Score = 29.7 bits (67), Expect = 3.1
 Identities = 30/130 (23%), Positives = 55/130 (42%), Gaps = 7/130 (5%)

Query: 192 GIFISHIVPGGVAALSGKLRMGDRILAVNGRDVSKVTHEEAVLALLEPCPEILLKIQHDP 251
           G F+S ++P   AA +G ++ GD I ++NG+ +S      A +  +    ++ L +  D 
Sbjct: 312 GAFVSQVLPNSSAAKAG-IKAGDVITSLNGKPISSFAALRAQVGTMPVGSKLTLGLLRDG 370

Query: 252 HPEGFQEITLIRQEGEKLGMHIKGGLRGQRGNPLDKNDE--GVFITKINSGGAAKRDGRL 309
            P     + +  Q+  +  +       G  G  L       GV +  +  G  A + G L
Sbjct: 371 KP---VNVNVELQQSSQNQVDSSNIFNGIEGAELSNKGGDKGVVVDNVKPGTPAAQIG-L 426

Query: 310 KVGMRLLEVN 319
           K G  ++  N
Sbjct: 427 KKGDVIIGAN 436


>gnl|CDD|218312 pfam04889, Cwf_Cwc_15, Cwf15/Cwc15 cell cycle control protein.
           This family represents Cwf15/Cwc15 (from
           Schizosaccharomyces pombe and Saccharomyces cerevisiae
           respectively) and their homologues. The function of
           these proteins is unknown, but they form part of the
           spliceosome and are thus thought to be involved in mRNA
           splicing.
          Length = 241

 Score = 29.3 bits (66), Expect = 3.5
 Identities = 21/73 (28%), Positives = 35/73 (47%), Gaps = 8/73 (10%)

Query: 351 ERGDIERRASTSEGSRSVTQSMSSLDREDLDTDVFKQEEEM------KKELVEWEKEEQE 404
           +R + + R S ++ S S + S SS D  D D    +    +      KKE  E ++ E+E
Sbjct: 108 KRIEEDARNSDADDSDSSSDSDSSDDDSDDDDSEDETAALLRELEKIKKERAEEKEREEE 167

Query: 405 ERKENLELDEVRE 417
           E+    E ++ RE
Sbjct: 168 EKAA--EEEKARE 178


>gnl|CDD|205480 pfam13300, DUF4078, Domain of unknown function (DUF4078).  This
           family is found from fungi to humans, but its exact
           function is not known.
          Length = 88

 Score = 27.6 bits (62), Expect = 3.6
 Identities = 14/33 (42%), Positives = 21/33 (63%)

Query: 386 KQEEEMKKELVEWEKEEQEERKENLELDEVREK 418
           K EEE K+++ E EK  +E  +E  E +E +EK
Sbjct: 40  KDEEERKEQMEELEKAREETERERKEREERKEK 72


>gnl|CDD|223496 COG0419, SbcC, ATPase involved in DNA repair [DNA replication,
           recombination, and repair].
          Length = 908

 Score = 29.7 bits (67), Expect = 3.6
 Identities = 12/33 (36%), Positives = 22/33 (66%)

Query: 386 KQEEEMKKELVEWEKEEQEERKENLELDEVREK 418
           ++ EE+++E+ E E+E +  R    EL+E+ EK
Sbjct: 291 ERLEELEREIEELEEELEGLRALLEELEELLEK 323


>gnl|CDD|220376 pfam09745, DUF2040, Coiled-coil domain-containing protein 55
           (DUF2040).  This entry is a conserved domain of
           approximately 130 residues of proteins conserved from
           fungi to humans. The proteins do contain a coiled-coil
           domain, but the function is unknown.
          Length = 128

 Score = 28.5 bits (64), Expect = 3.6
 Identities = 10/35 (28%), Positives = 17/35 (48%), Gaps = 6/35 (17%)

Query: 381 DTDVF------KQEEEMKKELVEWEKEEQEERKEN 409
           D + F      KQ EE +K   E ++ E+ E + +
Sbjct: 86  DKEKFVTSAYKKQLEENRKLEEEEKEREELEEEND 120


>gnl|CDD|202427 pfam02841, GBP_C, Guanylate-binding protein, C-terminal domain.
           Transcription of the anti-viral guanylate-binding
           protein (GBP) is induced by interferon-gamma during
           macrophage induction. This family contains GBP1 and
           GPB2, both GTPases capable of binding GTP, GDP and GMP.
          Length = 297

 Score = 29.2 bits (66), Expect = 3.8
 Identities = 15/59 (25%), Positives = 27/59 (45%), Gaps = 5/59 (8%)

Query: 365 SRSVTQSMSSLDREDLDTDVFKQEEEMKKELVEWEKEEQEE-RKENLELDEVREKSTPE 422
             ++ Q+  +L       +   + E  K E  E E+E   E +KE  ++ E +E+S  E
Sbjct: 189 EEAILQTDQALT----AKEKAIEAERAKAEAAEAEQELLREKQKEEEQMMEAQERSYQE 243


>gnl|CDD|235426 PRK05352, PRK05352, Na(+)-translocating NADH-quinone reductase
           subunit A; Provisional.
          Length = 448

 Score = 29.4 bits (67), Expect = 4.0
 Identities = 16/50 (32%), Positives = 23/50 (46%), Gaps = 12/50 (24%)

Query: 266 GEKLGMHIKGGLRGQRGNPL--DKNDEGVFIT--------KINSGGAAKR 305
           G +  M +K G + ++G PL  DK + GV  T         IN G   +R
Sbjct: 39  GLRPKMKVKEGDKVKKGQPLFEDKKNPGVKFTSPASGTVVAINRG--ERR 86


>gnl|CDD|220818 pfam10595, UPF0564, Uncharacterized protein family UPF0564.  This
           family of proteins has no known function. However, one
           of the members is annotated as an EF-hand family
           protein.
          Length = 349

 Score = 29.4 bits (66), Expect = 4.0
 Identities = 16/54 (29%), Positives = 25/54 (46%), Gaps = 6/54 (11%)

Query: 355 IERRASTSEGSRSVTQSMSSLDREDLDTDVFKQEEEMKKELVEWEKEEQEERKE 408
            +R+    E +  V Q   + D         K +EE K++L +  KEE+E  KE
Sbjct: 297 EQRKKKKKEMAPKVKQRFEAND------PAQKLQEERKEQLAKLRKEEKEREKE 344


>gnl|CDD|219780 pfam08286, Spc24, Spc24 subunit of Ndc80.  Spc24 is a component of
           the evolutionarily conserved kinetochore-associated
           Ndc80 complex and is involved in chromosome segregation.
          Length = 117

 Score = 28.1 bits (63), Expect = 4.5
 Identities = 15/58 (25%), Positives = 28/58 (48%), Gaps = 10/58 (17%)

Query: 377 REDLDTDVFK-------QEEE---MKKELVEWEKEEQEERKENLELDEVREKSTPEMP 424
            ++L+ + F+        E E   ++ EL E ++  +E  ++ +E DEV E  TP   
Sbjct: 1   IQELENEKFRLAKELNDLESELNNLQSELEELKERLEELEEQEVEGDEVDEDETPVDD 58


>gnl|CDD|219951 pfam08644, SPT16, FACT complex subunit (SPT16/CDC68).  Proteins in
           this family are subunits the FACT complex. The FACT
           complex plays a role in transcription initiation and
           promotes binding of TATA-binding protein (TBP) to a TATA
           box in chromatin.
          Length = 152

 Score = 28.4 bits (64), Expect = 4.7
 Identities = 18/64 (28%), Positives = 31/64 (48%), Gaps = 5/64 (7%)

Query: 366 RSVTQSMSSLDREDLDTDVFKQEEEMKKELVEWEKEEQEER--KENLELDEVREKSTPEM 423
           + +T    S D + L  DVFK  +E++K + + E E +E+    E  +L   R K  P +
Sbjct: 68  KELT--FRSKDSKHLQ-DVFKAIKELQKRVKKRETERKEKADVVEQEKLILNRNKRPPRL 124

Query: 424 PHYF 427
              +
Sbjct: 125 KDVY 128


>gnl|CDD|217206 pfam02731, SKIP_SNW, SKIP/SNW domain.  This domain is found in
           chromatin proteins.
          Length = 158

 Score = 28.4 bits (64), Expect = 4.9
 Identities = 11/30 (36%), Positives = 21/30 (70%), Gaps = 5/30 (16%)

Query: 377 REDLDTDVFKQEEEMKKELVEWEKEEQEER 406
           RE++     +Q  E++++L E EK+E+EE+
Sbjct: 122 REEV-----RQRAELQRQLAEKEKQEKEEK 146


>gnl|CDD|132281 TIGR03237, dnd_assoc_2, DNA phosphorothioation-dependent
           restriction protein DptH.  A DNA sulfur modification
           (phosphorothioation) system, dnd (degradation during
           electrophoresis), is sparsely and sporadically
           distributed among the bacteria. This protein is one
           member of a three-gene restriction enzyme cassette that
           depends on DNA phosphorothioation [DNA metabolism,
           Restriction/modification].
          Length = 1256

 Score = 29.5 bits (66), Expect = 4.9
 Identities = 13/38 (34%), Positives = 18/38 (47%), Gaps = 1/38 (2%)

Query: 393 KELVEWEKEEQEER-KENLELDEVREKSTPEMPHYFQD 429
           KEL  W+  + +ER  EN EL +  +  T   P    D
Sbjct: 193 KELAGWKGNQIKERLGENRELSDKLDFITYHFPRELHD 230


>gnl|CDD|222447 pfam13904, DUF4207, Domain of unknown function (DUF4207).  This
           family is found in eukaryotes; it has several conserved
           tryptophan residues. The function is not known.
          Length = 261

 Score = 28.9 bits (65), Expect = 5.0
 Identities = 10/34 (29%), Positives = 21/34 (61%), Gaps = 3/34 (8%)

Query: 388 EEEMKKELVEWEK---EEQEERKENLELDEVREK 418
           +EE KK L EWE    ++Q++++E     + +++
Sbjct: 177 QEEAKKRLQEWELKKLKQQQQKREEERRKQRKKQ 210


>gnl|CDD|206039 pfam13868, Trichoplein, Tumour suppressor, Mitostatin.  Trichoplein
           or mitostatin, was first defined as a meiosis-specific
           nuclear structural protein. It has since been linked
           with mitochondrial movement. It is associated with the
           mitochondrial outer membrane, and over-expression leads
           to reduction in mitochondrial motility whereas lack of
           it enhances mitochondrial movement. The activity appears
           to be mediated through binding the mitochondria to the
           actin intermediate filaments (IFs).
          Length = 349

 Score = 29.1 bits (66), Expect = 5.3
 Identities = 9/33 (27%), Positives = 20/33 (60%)

Query: 386 KQEEEMKKELVEWEKEEQEERKENLELDEVREK 418
           ++  E ++EL E E+  +EE +    ++E R++
Sbjct: 303 RRAAEREEELEEGERLREEEAERQARIEEERQR 335


>gnl|CDD|227448 COG5118, BDP1, Transcription initiation factor TFIIIB, Bdp1 subunit
           [Transcription].
          Length = 507

 Score = 28.9 bits (64), Expect = 5.4
 Identities = 8/39 (20%), Positives = 19/39 (48%)

Query: 380 LDTDVFKQEEEMKKELVEWEKEEQEERKENLELDEVREK 418
           L + + ++  E++ E     KE +E +    E D+  ++
Sbjct: 435 LRSYLLEKLIELQNEHKHHMKEIEEAKNTAKEEDQTAQR 473


>gnl|CDD|220383 pfam09756, DDRGK, DDRGK domain.  This is a family of proteins of
           approximately 300 residues, found in plants and
           vertebrates. They contain a highly conserved DDRGK
           motif.
          Length = 189

 Score = 28.5 bits (64), Expect = 6.0
 Identities = 16/54 (29%), Positives = 29/54 (53%), Gaps = 1/54 (1%)

Query: 386 KQEEEMKKELVEWEKEEQEERKENLELDEVREKSTPEMPHYFQDANCV-CFVNF 438
           K+EEE +KE  E  ++EQEE ++      V E+ T ++    +    +  F+N+
Sbjct: 55  KKEEEERKEREEQARKEQEEYEKLKSSFVVEEEGTDKLSADEESNELLEDFINY 108


>gnl|CDD|217286 pfam02919, Topoisom_I_N, Eukaryotic DNA topoisomerase I, DNA
           binding fragment.  Topoisomerase I promotes the
           relaxation of DNA superhelical tension by introducing a
           transient single-stranded break in duplex DNA and are
           vital for the processes of replication, transcription,
           and recombination. This family may be more than one
           structural domain.
          Length = 215

 Score = 28.3 bits (64), Expect = 7.0
 Identities = 14/41 (34%), Positives = 23/41 (56%), Gaps = 7/41 (17%)

Query: 385 FKQEEEMKKELVEWEKEEQEERKENLE-------LDEVREK 418
           F+ E+E KK + + EK+  +E K+ LE       +D  +EK
Sbjct: 95  FEAEKEKKKAMSKEEKKAIKEEKDKLEEPYGYCLVDGRKEK 135


>gnl|CDD|236896 PRK11301, livM, leucine/isoleucine/valine transporter permease
           subunit; Provisional.
          Length = 419

 Score = 28.4 bits (64), Expect = 7.4
 Identities = 14/33 (42%), Positives = 17/33 (51%)

Query: 12  GEQAGVILAAGEVVIHPVGMRLLEVNGMSLLGA 44
           G Q GVILAA  +V+ P  MR      M + G 
Sbjct: 360 GSQFGVILAAILLVVLPELMRDFNEYRMLMFGL 392


>gnl|CDD|237975 cd00001, PTS_IIB_man, PTS_IIB, PTS system, Mannose/sorbose specific
           IIB subunit. The bacterial phosphoenolpyruvate: sugar
           phosphotransferase system (PTS) is a multi-protein
           system involved in the regulation of a variety of
           metabolic and transcriptional processes. This family is
           one of four structurally and functionally distinct group
           IIB PTS system cytoplasmic enzymes, necessary for the
           uptake of carbohydrates across the cytoplasmic membrane
           and their phosphorylation. The active site histidine
           receives a phosphate group from the IIA subunit and
           transfers it to the substrate.
          Length = 151

 Score = 27.6 bits (62), Expect = 7.8
 Identities = 9/24 (37%), Positives = 14/24 (58%)

Query: 289 DEGVFITKINSGGAAKRDGRLKVG 312
           + GV I  IN G  A R G++++ 
Sbjct: 92  EGGVPIKTINVGNMAFRPGKVQIT 115


>gnl|CDD|152077 pfam11641, Antigen_Bd37, Glycosylphosphatidylinositol-anchored
           merozoite surface protein.  This family of proteins
           represents the core region of Bd37, a surface antigen of
           B.divergens which is GPI-anchored at the surface of the
           merozoite. The structure of the protein consists of
           mainly alpha folds and has three sub domains.
          Length = 224

 Score = 28.3 bits (63), Expect = 7.9
 Identities = 18/64 (28%), Positives = 35/64 (54%), Gaps = 3/64 (4%)

Query: 367 SVTQSMSSLDREDLDTDVFKQEEEMK-KELVEWEKEEQEERKENLEL--DEVREKSTPEM 423
           +V  S  S D E+L T + K  E++K ++ +  +K+  ++   NL++  DE+ +K T   
Sbjct: 93  TVESSAQSTDSEELKTLLLKFSEDLKAEQELHSDKKGSKDLLSNLKVQRDELVKKFTALT 152

Query: 424 PHYF 427
           P + 
Sbjct: 153 PSFL 156


>gnl|CDD|183610 PRK12585, PRK12585, putative monovalent cation/H+ antiporter
           subunit G; Reviewed.
          Length = 197

 Score = 28.1 bits (62), Expect = 8.0
 Identities = 15/39 (38%), Positives = 25/39 (64%), Gaps = 2/39 (5%)

Query: 386 KQEEEMKKELVEWEKEEQ--EERKENLELDEVREKSTPE 422
           +QE E  +E +EWE+ E+  +ER++  E +  RE+ T E
Sbjct: 132 RQEREELEERMEWERREEKIDEREDQEEQEREREEQTIE 170


>gnl|CDD|217751 pfam03830, PTSIIB_sorb, PTS system sorbose subfamily IIB component.
           
          Length = 151

 Score = 27.4 bits (62), Expect = 8.1
 Identities = 6/24 (25%), Positives = 14/24 (58%)

Query: 289 DEGVFITKINSGGAAKRDGRLKVG 312
           + GV I ++N G    ++G+ ++ 
Sbjct: 93  EGGVPIKELNVGNMHFKEGKKQIT 116


>gnl|CDD|177876 PLN02231, PLN02231, alanine transaminase.
          Length = 534

 Score = 28.4 bits (63), Expect = 8.3
 Identities = 20/62 (32%), Positives = 31/62 (50%), Gaps = 3/62 (4%)

Query: 357 RRASTSEGSRSVTQSMSSLDREDLDTDVFKQEEEMKKELVEWEKEEQEERKEN---LELD 413
           R +STSE S S + S   +  + ++  V K E  ++ E+V   +  QEE K N      D
Sbjct: 39  RFSSTSEMSASDSTSSLPVTLDTINPKVLKCEYAVRGEIVTIAQRLQEELKTNPGSYPFD 98

Query: 414 EV 415
           E+
Sbjct: 99  EI 100


>gnl|CDD|238356 cd00660, Topoisomer_IB_N, Topoisomer_IB_N: N-terminal DNA binding
           fragment found in eukaryotic DNA topoisomerase (topo) IB
           proteins similar to the monomeric yeast and human topo I
           and heterodimeric topo I from Leishmania donvanni. Topo
           I enzymes are divided into:  topo type IA (bacterial)
           and type IB (eukaryotic). Topo I relaxes superhelical
           tension in duplex DNA by creating a single-strand nick,
           the broken strand can then rotate around the unbroken
           strand to remove DNA supercoils and, the nick is
           religated, liberating topo I. These enzymes regulate the
           topological changes that accompany DNA replication,
           transcription and other nuclear processes.  Human topo I
           is the target of a diverse set of anticancer drugs
           including camptothecins (CPTs). CPTs bind to the topo
           I-DNA complex and inhibit re-ligation of the
           single-strand nick, resulting in the accumulation of
           topo I-DNA adducts.  In addition to differences in
           structure and some biochemical properties,
           Trypanosomatid parasite topo I differ from human topo I
           in their sensitivity to CPTs and other classical topo I
           inhibitors. Trypanosomatid topos I play putative roles
           in organizing the kinetoplast DNA network unique to
           these parasites.  This family may represent more than
           one structural domain.
          Length = 215

 Score = 28.0 bits (63), Expect = 8.4
 Identities = 15/41 (36%), Positives = 24/41 (58%), Gaps = 7/41 (17%)

Query: 385 FKQEEEMKKELVEWEKEEQEERKENLE-------LDEVREK 418
           F++E+E KK + + EK+  +E KE LE       +D  +EK
Sbjct: 94  FEEEKEKKKAMSKEEKKAIKEEKEKLEEPYGYCLVDGHKEK 134


>gnl|CDD|219868 pfam08496, Peptidase_S49_N, Peptidase family S49 N-terminal.  This
           domain is found to the N-terminus of bacterial signal
           peptidases of the S49 family (pfam01343).
          Length = 154

 Score = 27.5 bits (62), Expect = 8.5
 Identities = 12/36 (33%), Positives = 18/36 (50%)

Query: 392 KKELVEWEKEEQEERKENLELDEVREKSTPEMPHYF 427
           KKEL  WEK E++  K   + ++ + K     P  F
Sbjct: 65  KKELKAWEKAEKKAEKAKAKAEKKKAKKEEPKPRLF 100


>gnl|CDD|226889 COG4487, COG4487, Uncharacterized protein conserved in bacteria
           [Function unknown].
          Length = 438

 Score = 28.2 bits (63), Expect = 8.9
 Identities = 10/34 (29%), Positives = 17/34 (50%), Gaps = 2/34 (5%)

Query: 387 QEEEMKKELVEWEKEEQ--EERKENLELDEVREK 418
           +  E K+E  + E+  +   E+K    L+  REK
Sbjct: 139 EIIEKKRENNKNEERLKFENEKKLEESLELEREK 172


>gnl|CDD|235083 PRK02889, tolB, translocation protein TolB; Provisional.
          Length = 427

 Score = 28.4 bits (64), Expect = 9.2
 Identities = 16/39 (41%), Positives = 21/39 (53%), Gaps = 1/39 (2%)

Query: 306 DGRLKVGMRLLE-VNGMSLLGASHQEAVNTLRYSGHQIS 343
           DGR  V  RL + V G SL G S     + LR + H+I+
Sbjct: 110 DGRYDVRFRLYDTVKGQSLGGLSLTVTEDQLRLAAHRIA 148


>gnl|CDD|224117 COG1196, Smc, Chromosome segregation ATPases [Cell division and
           chromosome partitioning].
          Length = 1163

 Score = 28.5 bits (64), Expect = 9.4
 Identities = 12/63 (19%), Positives = 22/63 (34%)

Query: 356 ERRASTSEGSRSVTQSMSSLDREDLDTDVFKQEEEMKKELVEWEKEEQEERKENLELDEV 415
                  E    + + + +L  E  + +   +E E     +E  KEE EE+   L  +  
Sbjct: 316 NELEELEERLEELKEKIEALKEELEERETLLEELEQLLAELEEAKEELEEKLSALLEELE 375

Query: 416 REK 418
              
Sbjct: 376 ELF 378


>gnl|CDD|222417 pfam13851, GAS, Growth-arrest specific micro-tubule binding.  This
           family is the highly conserved central region of a
           number of metazoan proteins referred to as growth-arrest
           proteins. In mouse, Gas8 is predominantly a testicular
           protein, whose expression is developmentally regulated
           during puberty and spermatogenesis. In humans, it is
           absent in infertile males who lack the ability to
           generate gametes. The localisation of Gas8 in the
           motility apparatus of post-meiotic gametocytes and
           mature spermatozoa, together with the detection of Gas8
           also in cilia at the apical surfaces of epithelial cells
           lining the pulmonary bronchi and Fallopian tubes
           suggests that the Gas8 protein may have a role in the
           functioning of motile cellular appendages. Gas8 is a
           microtubule-binding protein localised to regions of
           dynein regulation in mammalian cells.
          Length = 201

 Score = 27.6 bits (62), Expect = 9.6
 Identities = 14/35 (40%), Positives = 21/35 (60%), Gaps = 3/35 (8%)

Query: 386 KQEEEMKKELVEWEKEEQEERKENL--ELDEVREK 418
           K+ E+  K L +WE E  E+R E +  E DE+ +K
Sbjct: 96  KELEKELKNL-KWESEVLEQRFEKVERERDELYDK 129


  Database: CDD.v3.10
    Posted date:  Mar 20, 2013  7:55 AM
  Number of letters in database: 10,937,602
  Number of sequences in database:  44,354
  
Lambda     K      H
   0.316    0.134    0.385 

Gapped
Lambda     K      H
   0.267   0.0677    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 22,111,145
Number of extensions: 2172780
Number of successful extensions: 3178
Number of sequences better than 10.0: 1
Number of HSP's gapped: 3060
Number of HSP's successfully gapped: 130
Length of query: 439
Length of database: 10,937,602
Length adjustment: 100
Effective length of query: 339
Effective length of database: 6,502,202
Effective search space: 2204246478
Effective search space used: 2204246478
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.6 bits)
S2: 60 (27.0 bits)