RPS-BLAST 2.2.26 [Sep-21-2011]

Database: CDD.v3.10 
           44,354 sequences; 10,937,602 total letters

Searching..................................................done

Query= psy17599
         (180 letters)



>gnl|CDD|215804 pfam00226, DnaJ, DnaJ domain.  DnaJ domains (J-domains) are
          associated with hsp70 heat-shock system and it is
          thought that this domain mediates the interaction.
          DnaJ-domain is therefore part of a chaperone (protein
          folding) system. The T-antigens, although not in
          Prosite are confirmed as DnaJ containing domains from
          literature.
          Length = 63

 Score = 79.5 bits (197), Expect = 2e-20
 Identities = 32/63 (50%), Positives = 41/63 (65%)

Query: 28 DLYGLLELTIDCSEQDIRSAYRKKALKCHPDKNPDDEKAIETFHLLSKAIEVLLDKSARA 87
          D Y +L +  D S+++I+ AYRK ALK HPDKNP D  A E F  +++A EVL D   RA
Sbjct: 1  DYYEILGVPRDASDEEIKKAYRKLALKYHPDKNPGDPAAEEKFKEINEAYEVLSDPEKRA 60

Query: 88 AYD 90
           YD
Sbjct: 61 IYD 63


>gnl|CDD|236757 PRK10767, PRK10767, chaperone protein DnaJ; Provisional.
          Length = 371

 Score = 82.1 bits (204), Expect = 7e-19
 Identities = 31/63 (49%), Positives = 43/63 (68%)

Query: 28 DLYGLLELTIDCSEQDIRSAYRKKALKCHPDKNPDDEKAIETFHLLSKAIEVLLDKSARA 87
          D Y +L ++ + SE +I+ AYRK A+K HPD+NP D++A E F  + +A EVL D   RA
Sbjct: 5  DYYEVLGVSRNASEDEIKKAYRKLAMKYHPDRNPGDKEAEEKFKEIKEAYEVLSDPQKRA 64

Query: 88 AYD 90
          AYD
Sbjct: 65 AYD 67


>gnl|CDD|99751 cd06257, DnaJ, DnaJ domain or J-domain.  DnaJ/Hsp40 (heat shock
          protein 40) proteins are highly conserved and play
          crucial roles in protein translation, folding,
          unfolding, translocation, and degradation. They act
          primarily by stimulating the ATPase activity of Hsp70s,
          an important chaperonine family. Hsp40 proteins are
          characterized by the presence of a J domain, which
          mediates the interaction with Hsp70. They may contain
          other domains as well, and the architectures provide a
          means of classification.
          Length = 55

 Score = 71.8 bits (177), Expect = 2e-17
 Identities = 29/55 (52%), Positives = 39/55 (70%)

Query: 28 DLYGLLELTIDCSEQDIRSAYRKKALKCHPDKNPDDEKAIETFHLLSKAIEVLLD 82
          D Y +L +  D S+++I+ AYRK ALK HPDKNPDD +A E F  +++A EVL D
Sbjct: 1  DYYDILGVPPDASDEEIKKAYRKLALKYHPDKNPDDPEAEEKFKEINEAYEVLSD 55


>gnl|CDD|223560 COG0484, DnaJ, DnaJ-class molecular chaperone with C-terminal Zn
          finger domain [Posttranslational modification, protein
          turnover, chaperones].
          Length = 371

 Score = 76.1 bits (188), Expect = 9e-17
 Identities = 32/63 (50%), Positives = 44/63 (69%)

Query: 28 DLYGLLELTIDCSEQDIRSAYRKKALKCHPDKNPDDEKAIETFHLLSKAIEVLLDKSARA 87
          D Y +L ++ D SE++I+ AYRK A K HPD+NP D++A E F  +++A EVL D   RA
Sbjct: 5  DYYEILGVSKDASEEEIKKAYRKLAKKYHPDRNPGDKEAEEKFKEINEAYEVLSDPEKRA 64

Query: 88 AYD 90
          AYD
Sbjct: 65 AYD 67


>gnl|CDD|233829 TIGR02349, DnaJ_bact, chaperone protein DnaJ.  This model
          represents bacterial forms of DnaJ, part of the
          DnaK-DnaJ-GrpE chaperone system. The three components
          typically are encoded by consecutive genes. DnaJ
          homologs occur in many genomes, typically not near DnaK
          and GrpE-like genes; most such genes are not included
          by this family. Eukaryotic (mitochondrial and
          chloroplast) forms are not included in the scope of
          this family.
          Length = 354

 Score = 69.5 bits (171), Expect = 2e-14
 Identities = 31/63 (49%), Positives = 41/63 (65%), Gaps = 1/63 (1%)

Query: 28 DLYGLLELTIDCSEQDIRSAYRKKALKCHPDKNPDDEKAIETFHLLSKAIEVLLDKSARA 87
          D Y +L ++ D SE++I+ AYRK A K HPD+N D E A E F  +++A EVL D   RA
Sbjct: 1  DYYEILGVSKDASEEEIKKAYRKLAKKYHPDRNKDKE-AEEKFKEINEAYEVLSDPEKRA 59

Query: 88 AYD 90
           YD
Sbjct: 60 QYD 62


>gnl|CDD|237668 PRK14301, PRK14301, chaperone protein DnaJ; Provisional.
          Length = 373

 Score = 68.6 bits (168), Expect = 4e-14
 Identities = 30/63 (47%), Positives = 42/63 (66%)

Query: 28 DLYGLLELTIDCSEQDIRSAYRKKALKCHPDKNPDDEKAIETFHLLSKAIEVLLDKSARA 87
          D Y +L ++ D SE +I+ AYRK AL+ HPD+NPD+ +A + F   ++A EVL D   RA
Sbjct: 5  DYYEVLGVSRDASEDEIKKAYRKLALQYHPDRNPDNPEAEQKFKEAAEAYEVLRDAEKRA 64

Query: 88 AYD 90
           YD
Sbjct: 65 RYD 67


>gnl|CDD|197617 smart00271, DnaJ, DnaJ molecular chaperone homology domain. 
          Length = 60

 Score = 63.0 bits (154), Expect = 5e-14
 Identities = 29/57 (50%), Positives = 37/57 (64%), Gaps = 1/57 (1%)

Query: 28 DLYGLLELTIDCSEQDIRSAYRKKALKCHPDKNPDD-EKAIETFHLLSKAIEVLLDK 83
          D Y +L +  D S  +I+ AYRK ALK HPDKNP D E+A E F  +++A EVL D 
Sbjct: 2  DYYEILGVPRDASLDEIKKAYRKLALKYHPDKNPGDKEEAEEKFKEINEAYEVLSDP 58


>gnl|CDD|237664 PRK14294, PRK14294, chaperone protein DnaJ; Provisional.
          Length = 366

 Score = 67.1 bits (164), Expect = 1e-13
 Identities = 30/63 (47%), Positives = 42/63 (66%)

Query: 28 DLYGLLELTIDCSEQDIRSAYRKKALKCHPDKNPDDEKAIETFHLLSKAIEVLLDKSARA 87
          D Y +L +T D SE++I+ +YRK A+K HPD+NP D++A E F   ++A EVL D   R 
Sbjct: 5  DYYEILGVTRDASEEEIKKSYRKLAMKYHPDRNPGDKEAEELFKEAAEAYEVLSDPKKRG 64

Query: 88 AYD 90
           YD
Sbjct: 65 IYD 67


>gnl|CDD|237655 PRK14279, PRK14279, chaperone protein DnaJ; Provisional.
          Length = 392

 Score = 65.9 bits (161), Expect = 4e-13
 Identities = 28/63 (44%), Positives = 38/63 (60%)

Query: 28 DLYGLLELTIDCSEQDIRSAYRKKALKCHPDKNPDDEKAIETFHLLSKAIEVLLDKSARA 87
          D Y  L ++ D S ++I+ AYRK A + HPD NP D  A E F  +S+A +VL D + R 
Sbjct: 10 DFYKELGVSSDASAEEIKKAYRKLARELHPDANPGDPAAEERFKAVSEAHDVLSDPAKRK 69

Query: 88 AYD 90
           YD
Sbjct: 70 EYD 72


>gnl|CDD|184611 PRK14297, PRK14297, chaperone protein DnaJ; Provisional.
          Length = 380

 Score = 66.0 bits (161), Expect = 4e-13
 Identities = 26/63 (41%), Positives = 40/63 (63%)

Query: 28 DLYGLLELTIDCSEQDIRSAYRKKALKCHPDKNPDDEKAIETFHLLSKAIEVLLDKSARA 87
          D Y +L L    S+ +I+ A+RK A+K HPDKN  +++A E F  +++A +VL D   +A
Sbjct: 5  DYYEVLGLEKGASDDEIKKAFRKLAIKYHPDKNKGNKEAEEKFKEINEAYQVLSDPQKKA 64

Query: 88 AYD 90
           YD
Sbjct: 65 QYD 67


>gnl|CDD|237658 PRK14284, PRK14284, chaperone protein DnaJ; Provisional.
          Length = 391

 Score = 65.6 bits (160), Expect = 6e-13
 Identities = 29/64 (45%), Positives = 42/64 (65%)

Query: 27 LDLYGLLELTIDCSEQDIRSAYRKKALKCHPDKNPDDEKAIETFHLLSKAIEVLLDKSAR 86
          +D Y +L ++   S ++I+ AYRK A+K HPDKNP D +A + F  +S+A EVL D   R
Sbjct: 1  MDYYTILGVSKTASPEEIKKAYRKLAVKYHPDKNPGDAEAEKRFKEVSEAYEVLSDAQKR 60

Query: 87 AAYD 90
           +YD
Sbjct: 61 ESYD 64


>gnl|CDD|237654 PRK14278, PRK14278, chaperone protein DnaJ; Provisional.
          Length = 378

 Score = 65.5 bits (160), Expect = 6e-13
 Identities = 30/63 (47%), Positives = 39/63 (61%), Gaps = 1/63 (1%)

Query: 28 DLYGLLELTIDCSEQDIRSAYRKKALKCHPDKNPDDEKAIETFHLLSKAIEVLLDKSARA 87
          D YGLL ++ + S+ +I+ AYRK A + HPD NP DE+A E F  +S A EVL D   R 
Sbjct: 4  DYYGLLGVSRNASDAEIKRAYRKLARELHPDVNP-DEEAQEKFKEISVAYEVLSDPEKRR 62

Query: 88 AYD 90
            D
Sbjct: 63 IVD 65


>gnl|CDD|237660 PRK14289, PRK14289, chaperone protein DnaJ; Provisional.
          Length = 386

 Score = 64.9 bits (158), Expect = 1e-12
 Identities = 27/63 (42%), Positives = 41/63 (65%)

Query: 28 DLYGLLELTIDCSEQDIRSAYRKKALKCHPDKNPDDEKAIETFHLLSKAIEVLLDKSARA 87
          D Y +L ++   +  +I+ AYRKKA++ HPDKNP D++A E F   ++A +VL D   R+
Sbjct: 6  DYYEVLGVSKTATVDEIKKAYRKKAIQYHPDKNPGDKEAEEKFKEAAEAYDVLSDPDKRS 65

Query: 88 AYD 90
           YD
Sbjct: 66 RYD 68


>gnl|CDD|237657 PRK14281, PRK14281, chaperone protein DnaJ; Provisional.
          Length = 397

 Score = 62.5 bits (152), Expect = 6e-12
 Identities = 28/63 (44%), Positives = 41/63 (65%)

Query: 28 DLYGLLELTIDCSEQDIRSAYRKKALKCHPDKNPDDEKAIETFHLLSKAIEVLLDKSARA 87
          D Y +L ++    + +I+ AYRK ALK HPDKNPD+++A E F  +++A EVL +   R 
Sbjct: 4  DYYEVLGVSRSADKDEIKKAYRKLALKYHPDKNPDNKEAEEHFKEVNEAYEVLSNDDKRR 63

Query: 88 AYD 90
           YD
Sbjct: 64 RYD 66


>gnl|CDD|237662 PRK14292, PRK14292, chaperone protein DnaJ; Provisional.
          Length = 371

 Score = 61.8 bits (150), Expect = 1e-11
 Identities = 29/64 (45%), Positives = 40/64 (62%), Gaps = 1/64 (1%)

Query: 27 LDLYGLLELTIDCSEQDIRSAYRKKALKCHPDKNPDDEKAIETFHLLSKAIEVLLDKSAR 86
          +D Y LL ++   S  +I+SAYRK ALK HPD+N  ++ A E F  +++A  VL D   R
Sbjct: 2  MDYYELLGVSRTASADEIKSAYRKLALKYHPDRNK-EKGAAEKFAQINEAYAVLSDAEKR 60

Query: 87 AAYD 90
          A YD
Sbjct: 61 AHYD 64


>gnl|CDD|240236 PTZ00037, PTZ00037, DnaJ_C chaperone protein; Provisional.
          Length = 421

 Score = 61.0 bits (148), Expect = 2e-11
 Identities = 30/62 (48%), Positives = 38/62 (61%), Gaps = 4/62 (6%)

Query: 29 LYGLLELTIDCSEQDIRSAYRKKALKCHPDKNPDDEKAIETFHLLSKAIEVLLDKSARAA 88
          LY +L L+ DC+  +I+ AYRK A+K HPDK  D EK    F  +S+A EVL D   R  
Sbjct: 30 LYEVLNLSKDCTTSEIKKAYRKLAIKHHPDKGGDPEK----FKEISRAYEVLSDPEKRKI 85

Query: 89 YD 90
          YD
Sbjct: 86 YD 87


>gnl|CDD|184612 PRK14298, PRK14298, chaperone protein DnaJ; Provisional.
          Length = 377

 Score = 60.6 bits (147), Expect = 2e-11
 Identities = 31/63 (49%), Positives = 39/63 (61%), Gaps = 1/63 (1%)

Query: 28 DLYGLLELTIDCSEQDIRSAYRKKALKCHPDKNPDDEKAIETFHLLSKAIEVLLDKSARA 87
          D Y +L L+ D S +DI+ AYRK A+K HPDKN  +  A E F  +S+A  VL D   RA
Sbjct: 6  DYYEILGLSKDASVEDIKKAYRKLAMKYHPDKN-KEPDAEEKFKEISEAYAVLSDAEKRA 64

Query: 88 AYD 90
           YD
Sbjct: 65 QYD 67


>gnl|CDD|237665 PRK14295, PRK14295, chaperone protein DnaJ; Provisional.
          Length = 389

 Score = 60.2 bits (146), Expect = 4e-11
 Identities = 28/63 (44%), Positives = 38/63 (60%)

Query: 28 DLYGLLELTIDCSEQDIRSAYRKKALKCHPDKNPDDEKAIETFHLLSKAIEVLLDKSARA 87
          D Y +L +  D +E +I+ AYRK A + HPD N  D KA E F  +S+A +VL D+  R 
Sbjct: 10 DYYKVLGVPKDATEAEIKKAYRKLAREYHPDANKGDAKAEERFKEISEAYDVLSDEKKRK 69

Query: 88 AYD 90
           YD
Sbjct: 70 EYD 72


>gnl|CDD|184599 PRK14277, PRK14277, chaperone protein DnaJ; Provisional.
          Length = 386

 Score = 59.4 bits (144), Expect = 6e-11
 Identities = 29/70 (41%), Positives = 44/70 (62%), Gaps = 2/70 (2%)

Query: 21 MADIKDLDLYGLLELTIDCSEQDIRSAYRKKALKCHPDKNPDDEKAIETFHLLSKAIEVL 80
          MA  KD   Y +L +  + +E++I+ AYR+ A K HPD NP D++A + F  +++A E+L
Sbjct: 1  MAAKKDY--YEILGVDRNATEEEIKKAYRRLAKKYHPDLNPGDKEAEQKFKEINEAYEIL 58

Query: 81 LDKSARAAYD 90
           D   RA YD
Sbjct: 59 SDPQKRAQYD 68


>gnl|CDD|237653 PRK14276, PRK14276, chaperone protein DnaJ; Provisional.
          Length = 380

 Score = 58.6 bits (142), Expect = 1e-10
 Identities = 26/65 (40%), Positives = 36/65 (55%), Gaps = 5/65 (7%)

Query: 28 DLYGLLELTIDCSEQDIRSAYRKKALKCHPDKN--PDDEKAIETFHLLSKAIEVLLDKSA 85
          + Y  L ++ D S+ +I+ AYRK + K HPD N  P  E   E +  + +A E L D   
Sbjct: 5  EYYDRLGVSKDASQDEIKKAYRKLSKKYHPDINKEPGAE---EKYKEVQEAYETLSDPQK 61

Query: 86 RAAYD 90
          RAAYD
Sbjct: 62 RAAYD 66


>gnl|CDD|172774 PRK14286, PRK14286, chaperone protein DnaJ; Provisional.
          Length = 372

 Score = 58.1 bits (140), Expect = 2e-10
 Identities = 25/67 (37%), Positives = 43/67 (64%)

Query: 24 IKDLDLYGLLELTIDCSEQDIRSAYRKKALKCHPDKNPDDEKAIETFHLLSKAIEVLLDK 83
          + +   Y +L ++   ++++I+SAYRK A+K HPDKN  ++++ E F   ++A E+L D 
Sbjct: 1  MSERSYYDILGVSKSANDEEIKSAYRKLAIKYHPDKNKGNKESEEKFKEATEAYEILRDP 60

Query: 84 SARAAYD 90
            R AYD
Sbjct: 61 KKRQAYD 67


>gnl|CDD|172773 PRK14285, PRK14285, chaperone protein DnaJ; Provisional.
          Length = 365

 Score = 57.7 bits (139), Expect = 2e-10
 Identities = 28/63 (44%), Positives = 41/63 (65%)

Query: 28 DLYGLLELTIDCSEQDIRSAYRKKALKCHPDKNPDDEKAIETFHLLSKAIEVLLDKSARA 87
          D Y +L L+   S+ +I+ AYRK A+K HPDKN  +++A   F   ++A EVL+D + RA
Sbjct: 4  DYYEILGLSKGASKDEIKKAYRKIAIKYHPDKNKGNKEAESIFKEATEAYEVLIDDNKRA 63

Query: 88 AYD 90
           YD
Sbjct: 64 QYD 66


>gnl|CDD|225124 COG2214, CbpA, DnaJ-class molecular chaperone [Posttranslational
          modification, protein turnover, chaperones].
          Length = 237

 Score = 54.8 bits (131), Expect = 2e-09
 Identities = 31/67 (46%), Positives = 42/67 (62%), Gaps = 1/67 (1%)

Query: 25 KDLDLYGLLELTIDCSEQDIRSAYRKKALKCHPDKNPDDEK-AIETFHLLSKAIEVLLDK 83
            LD Y +L +  + S ++I+ AYRK ALK HPD+NP D K A E F  +++A E+L D 
Sbjct: 4  DLLDYYEILGVPPNASLEEIKKAYRKLALKYHPDRNPGDPKVAEEKFKEINEAYEILSDP 63

Query: 84 SARAAYD 90
            RA YD
Sbjct: 64 ERRAEYD 70


>gnl|CDD|237663 PRK14293, PRK14293, chaperone protein DnaJ; Provisional.
          Length = 374

 Score = 55.4 bits (134), Expect = 2e-09
 Identities = 24/63 (38%), Positives = 37/63 (58%), Gaps = 1/63 (1%)

Query: 28 DLYGLLELTIDCSEQDIRSAYRKKALKCHPDKNPDDEKAIETFHLLSKAIEVLLDKSARA 87
          D Y +L ++ D  + +++ AYR+ A K HPD N  +  A + F  +++A EVL D   RA
Sbjct: 4  DYYEILGVSRDADKDELKRAYRRLARKYHPDVN-KEPGAEDRFKEINRAYEVLSDPETRA 62

Query: 88 AYD 90
           YD
Sbjct: 63 RYD 65


>gnl|CDD|237656 PRK14280, PRK14280, chaperone protein DnaJ; Provisional.
          Length = 376

 Score = 55.1 bits (133), Expect = 2e-09
 Identities = 28/63 (44%), Positives = 38/63 (60%), Gaps = 1/63 (1%)

Query: 28 DLYGLLELTIDCSEQDIRSAYRKKALKCHPDKNPDDEKAIETFHLLSKAIEVLLDKSARA 87
          D Y +L ++   S+ +I+ AYRK + K HPD N  +E A E F  +S+A EVL D   RA
Sbjct: 5  DYYEVLGVSKSASKDEIKKAYRKLSKKYHPDIN-KEEGADEKFKEISEAYEVLSDDQKRA 63

Query: 88 AYD 90
           YD
Sbjct: 64 QYD 66


>gnl|CDD|172776 PRK14288, PRK14288, chaperone protein DnaJ; Provisional.
          Length = 369

 Score = 54.7 bits (131), Expect = 3e-09
 Identities = 28/65 (43%), Positives = 43/65 (66%)

Query: 26 DLDLYGLLELTIDCSEQDIRSAYRKKALKCHPDKNPDDEKAIETFHLLSKAIEVLLDKSA 85
          +L  Y +LE+    +++ I+ +YRK ALK HPD+N  D++A E F L+++A  VL D+  
Sbjct: 2  ELSYYEILEVEKHSNQETIKKSYRKLALKYHPDRNAGDKEAEEKFKLINEAYGVLSDEKK 61

Query: 86 RAAYD 90
          RA YD
Sbjct: 62 RALYD 66


>gnl|CDD|184604 PRK14283, PRK14283, chaperone protein DnaJ; Provisional.
          Length = 378

 Score = 53.3 bits (128), Expect = 8e-09
 Identities = 25/63 (39%), Positives = 37/63 (58%), Gaps = 1/63 (1%)

Query: 28 DLYGLLELTIDCSEQDIRSAYRKKALKCHPDKNPDDEKAIETFHLLSKAIEVLLDKSARA 87
          D Y +L +  +  +++I+ AYRK A K HPD + ++E A E F  +S+A  VL D   R 
Sbjct: 6  DYYEVLGVDRNADKKEIKKAYRKLARKYHPDVS-EEEGAEEKFKEISEAYAVLSDDEKRQ 64

Query: 88 AYD 90
           YD
Sbjct: 65 RYD 67


>gnl|CDD|237666 PRK14296, PRK14296, chaperone protein DnaJ; Provisional.
          Length = 372

 Score = 53.0 bits (127), Expect = 1e-08
 Identities = 28/67 (41%), Positives = 40/67 (59%), Gaps = 1/67 (1%)

Query: 24 IKDLDLYGLLELTIDCSEQDIRSAYRKKALKCHPDKNPDDEKAIETFHLLSKAIEVLLDK 83
          +K  D Y +L ++   SEQ+IR AYRK A + HPD N   + A +    +++A +VLLDK
Sbjct: 1  MKKKDYYEVLGVSKTASEQEIRQAYRKLAKQYHPDLNKSPD-AHDKMVEINEAADVLLDK 59

Query: 84 SARAAYD 90
            R  YD
Sbjct: 60 DKRKQYD 66


>gnl|CDD|172778 PRK14290, PRK14290, chaperone protein DnaJ; Provisional.
          Length = 365

 Score = 52.2 bits (125), Expect = 2e-08
 Identities = 26/64 (40%), Positives = 40/64 (62%), Gaps = 1/64 (1%)

Query: 28 DLYGLLELTIDCSEQDIRSAYRKKALKCHPDKNPDDE-KAIETFHLLSKAIEVLLDKSAR 86
          D Y +L +  + S++DI+ A+R+ A K HPD +P ++ +A E F  +S+A EVL D   R
Sbjct: 4  DYYKILGVDRNASQEDIKKAFRELAKKWHPDLHPGNKAEAEEKFKEISEAYEVLSDPQKR 63

Query: 87 AAYD 90
            YD
Sbjct: 64 RQYD 67


>gnl|CDD|184603 PRK14282, PRK14282, chaperone protein DnaJ; Provisional.
          Length = 369

 Score = 51.3 bits (123), Expect = 4e-08
 Identities = 22/64 (34%), Positives = 41/64 (64%), Gaps = 1/64 (1%)

Query: 28 DLYGLLELTIDCSEQDIRSAYRKKALKCHPDKNPDDEK-AIETFHLLSKAIEVLLDKSAR 86
          D Y +L ++ + ++++I+ AY++   + HPD++P++ K A + F  + +A EVL D   R
Sbjct: 5  DYYEILGVSRNATQEEIKRAYKRLVKEWHPDRHPENRKEAEQKFKEIQEAYEVLSDPQKR 64

Query: 87 AAYD 90
          A YD
Sbjct: 65 AMYD 68


>gnl|CDD|237661 PRK14291, PRK14291, chaperone protein DnaJ; Provisional.
          Length = 382

 Score = 48.6 bits (116), Expect = 4e-07
 Identities = 24/63 (38%), Positives = 39/63 (61%), Gaps = 1/63 (1%)

Query: 28 DLYGLLELTIDCSEQDIRSAYRKKALKCHPDKNPDDEKAIETFHLLSKAIEVLLDKSARA 87
          D Y +L ++ + ++++I+ AYR+ A K HPD N + E A E F  +++A +VL D   R 
Sbjct: 4  DYYEILGVSRNATQEEIKKAYRRLARKYHPDFNKNPE-AEEKFKEINEAYQVLSDPEKRK 62

Query: 88 AYD 90
           YD
Sbjct: 63 LYD 65


>gnl|CDD|234368 TIGR03835, termin_org_DnaJ, terminal organelle assembly protein
          TopJ.  This model describes TopJ (MG_200, CbpA), a DnaJ
          homolog and probable assembly protein of the Mycoplasma
          terminal organelle. The terminal organelle is involved
          in both cytadherence and gliding motility [Cellular
          processes, Chemotaxis and motility].
          Length = 871

 Score = 47.9 bits (113), Expect = 9e-07
 Identities = 25/63 (39%), Positives = 36/63 (57%), Gaps = 1/63 (1%)

Query: 28 DLYGLLELTIDCSEQDIRSAYRKKALKCHPDKNPDDEKAIETFHLLSKAIEVLLDKSARA 87
          D Y +L +  D  EQ+I+ A+RK A K HPD+N     A   F  +++A +VL +   RA
Sbjct: 3  DYYEVLGIDRDADEQEIKKAFRKLAKKYHPDRN-KAPDAASIFAEINEANDVLSNPKKRA 61

Query: 88 AYD 90
           YD
Sbjct: 62 NYD 64


>gnl|CDD|182347 PRK10266, PRK10266, curved DNA-binding protein CbpA; Provisional.
          Length = 306

 Score = 45.2 bits (107), Expect = 5e-06
 Identities = 25/65 (38%), Positives = 36/65 (55%), Gaps = 5/65 (7%)

Query: 28 DLYGLLELTIDCSEQDIRSAYRKKALKCHPD--KNPDDEKAIETFHLLSKAIEVLLDKSA 85
          D Y ++ +      + I++AYR+ A K HPD  K PD E     F  +++A EVL D+  
Sbjct: 5  DYYAIMGVKPTDDLKTIKTAYRRLARKYHPDVSKEPDAE---ARFKEVAEAWEVLSDEQR 61

Query: 86 RAAYD 90
          RA YD
Sbjct: 62 RAEYD 66


>gnl|CDD|172788 PRK14300, PRK14300, chaperone protein DnaJ; Provisional.
          Length = 372

 Score = 44.2 bits (104), Expect = 1e-05
 Identities = 24/63 (38%), Positives = 37/63 (58%), Gaps = 1/63 (1%)

Query: 28 DLYGLLELTIDCSEQDIRSAYRKKALKCHPDKNPDDEKAIETFHLLSKAIEVLLDKSARA 87
          D Y +L ++   S+ D++ AY K A + HPD   D + A + F  ++ A +VL D+  RA
Sbjct: 4  DYYQILGVSKTASQADLKKAYLKLAKQYHPD-TTDAKDAEKKFKEINAAYDVLKDEQKRA 62

Query: 88 AYD 90
          AYD
Sbjct: 63 AYD 65


>gnl|CDD|237667 PRK14299, PRK14299, chaperone protein DnaJ; Provisional.
          Length = 291

 Score = 44.2 bits (104), Expect = 1e-05
 Identities = 23/64 (35%), Positives = 35/64 (54%), Gaps = 1/64 (1%)

Query: 28 DLYGLLELTIDCSEQDIRSAYRKKALKCHPDKNPDDEKAIETFHLLSKAIEVLLDKSARA 87
          D Y +L +  + S+ +I+ A++K A K HPD N     A E F  +++A  VL D   R 
Sbjct: 5  DYYAILGVPKNASQDEIKKAFKKLARKYHPDVNKSPG-AEEKFKEINEAYTVLSDPEKRR 63

Query: 88 AYDS 91
           YD+
Sbjct: 64 IYDT 67


>gnl|CDD|227694 COG5407, SEC63, Preprotein translocase subunit Sec63 [Intracellular
           trafficking and secretion].
          Length = 610

 Score = 43.8 bits (103), Expect = 2e-05
 Identities = 27/83 (32%), Positives = 43/83 (51%), Gaps = 10/83 (12%)

Query: 17  LSSIMADIKDL-----DLYGLLELTIDCSEQDIRSAYRKKALKCHPDKNP---DDEKAI- 67
           L S +  +K       D Y +L +  D SE+DI+  YR  ++K HPDK P   ++ ++  
Sbjct: 83  LISNIRTLKIEYRRGFDPYEILGIDQDTSERDIKKRYRMLSMKFHPDKAPPMVNELRSEY 142

Query: 68  -ETFHLLSKAIEVLLDKSARAAY 89
            E +  ++KA  +L DK  R  Y
Sbjct: 143 EEKYKTITKAYGLLTDKKRRENY 165


>gnl|CDD|222986 PHA03102, PHA03102, Small T antigen; Reviewed.
          Length = 153

 Score = 40.4 bits (95), Expect = 8e-05
 Identities = 18/47 (38%), Positives = 23/47 (48%), Gaps = 2/47 (4%)

Query: 32 LLELTIDC--SEQDIRSAYRKKALKCHPDKNPDDEKAIETFHLLSKA 76
          LL L      +   +R AY +K L+ HPDK  D+EK  E   L  K 
Sbjct: 10 LLGLPRSAWGNLPLMRKAYLRKCLEFHPDKGGDEEKMKELNTLYKKF 56


>gnl|CDD|222912 PHA02624, PHA02624, large T antigen; Provisional.
          Length = 647

 Score = 40.4 bits (95), Expect = 2e-04
 Identities = 15/40 (37%), Positives = 21/40 (52%), Gaps = 2/40 (5%)

Query: 29 LYGLLELTIDC--SEQDIRSAYRKKALKCHPDKNPDDEKA 66
          L  LL L +    +   +R AY +K  + HPDK  D+EK 
Sbjct: 13 LMDLLGLPMAAWGNLPLMRKAYLRKCKEYHPDKGGDEEKM 52


>gnl|CDD|237659 PRK14287, PRK14287, chaperone protein DnaJ; Provisional.
          Length = 371

 Score = 40.0 bits (93), Expect = 3e-04
 Identities = 24/65 (36%), Positives = 35/65 (53%), Gaps = 5/65 (7%)

Query: 28 DLYGLLELTIDCSEQDIRSAYRKKALKCHPD--KNPDDEKAIETFHLLSKAIEVLLDKSA 85
          D Y +L +  + S  +++ AYRK A K HPD  K PD E   + F  + +A + L D   
Sbjct: 5  DYYEVLGVDRNASVDEVKKAYRKLARKYHPDVNKAPDAE---DKFKEVKEAYDTLSDPQK 61

Query: 86 RAAYD 90
          +A YD
Sbjct: 62 KAHYD 66


>gnl|CDD|227594 COG5269, ZUO1, Ribosome-associated chaperone zuotin [Translation,
           ribosomal structure and biogenesis / Posttranslational
           modification, protein turnover, chaperones].
          Length = 379

 Score = 36.9 bits (85), Expect = 0.004
 Identities = 29/75 (38%), Positives = 36/75 (48%), Gaps = 5/75 (6%)

Query: 23  DIKDLDLYGLLELTI---DCSEQDIRSAYRKKALKCHPDKNPDDEK--AIETFHLLSKAI 77
           + K +DLY LL L+          I  A++KK  K HPDK          E F L+ KA 
Sbjct: 39  NWKKVDLYALLGLSKYRTKAIPPQILKAHKKKVYKYHPDKTAAGGNKGCDEFFKLIQKAR 98

Query: 78  EVLLDKSARAAYDSV 92
           EVL D+  R  YDS 
Sbjct: 99  EVLGDRKLRLQYDSN 113


>gnl|CDD|184732 PRK14539, PRK14539, 50S ribosomal protein L11/unknown domain fusion
           protein; Provisional.
          Length = 196

 Score = 33.4 bits (76), Expect = 0.041
 Identities = 29/85 (34%), Positives = 43/85 (50%), Gaps = 7/85 (8%)

Query: 57  PDKNPDD-EKAIETFHLLSKAIEVLLDKSARAAYDSVIRRKEEVKIRNSKLDATRKKFKE 115
           PD N DD E+A+ T    +K + VL++      YD V++ KEE K   +K  A  K  + 
Sbjct: 111 PDLNTDDVEEAMHTIAGTAKNMGVLVE-----GYDDVVKAKEEAKAA-AKAAALAKAKEA 164

Query: 116 DLERREKEAENERYKGYKVNKSEEE 140
            L+  E E +  + K  +VN   EE
Sbjct: 165 SLKSAEAELKASKGKSIEVNVIGEE 189


>gnl|CDD|215770 pfam00176, SNF2_N, SNF2 family N-terminal domain.  This domain is
           found in proteins involved in a variety of processes
           including transcription regulation (e.g., SNF2, STH1,
           brahma, MOT1), DNA repair (e.g. ERCC6, RAD16, RAD5), DNA
           recombination (e.g. RAD54), and chromatin unwinding
           (e.g. ISWI) as well as a variety of other proteins with
           little functional information (e.g. lodestar, ETL1).
          Length = 301

 Score = 32.3 bits (74), Expect = 0.098
 Identities = 25/128 (19%), Positives = 50/128 (39%), Gaps = 7/128 (5%)

Query: 28  DLYGLLELTIDC----SEQDIRSAYRKKALKCHPDKNPDDEKAIETFHLLSKAIEVLLDK 83
           +L+ LL   +      S +     +         +KN + EK  E  + L K ++  L +
Sbjct: 169 ELWALLNF-LRPGPFGSFKVFEEWFNIPIANTADNKNKNLEKGKEGINRLHKLLKPFLLR 227

Query: 84  SARAAYDSVIRRKEEVKIRNSKLDATRKKFKEDLERREKEAENERYKGYKVNKSEEEILQ 143
             +   +  +  K E  +    L   ++K  + L  + + A +   +G + N     +L 
Sbjct: 228 RTKDDVEKSLPPKTEH-VLYCNLSDEQRKLYKKLLTKRRLALSFAVEGGEKNVGIASLL- 285

Query: 144 QEIDRLRK 151
             I +LRK
Sbjct: 286 NLIMQLRK 293


>gnl|CDD|236512 PRK09430, djlA, Dna-J like membrane chaperone protein; Provisional.
          Length = 267

 Score = 31.7 bits (73), Expect = 0.16
 Identities = 13/32 (40%), Positives = 20/32 (62%)

Query: 28  DLYGLLELTIDCSEQDIRSAYRKKALKCHPDK 59
           D Y +L ++    +Q+I+ AYRK   + HPDK
Sbjct: 201 DAYKVLGVSESDDDQEIKRAYRKLMSEHHPDK 232


>gnl|CDD|219083 pfam06548, Kinesin-related, Kinesin-related.  This family
           represents a region within kinesin-related proteins from
           higher plants. Many family members also contain the
           pfam00225 domain. Kinesins are ATP-driven microtubule
           motor proteins that produce directed force. Some family
           members are associated with the phragmoplast, a
           structure composed mainly of microtubules that executes
           cytokinesis in higher plants.
          Length = 488

 Score = 30.6 bits (69), Expect = 0.43
 Identities = 16/62 (25%), Positives = 31/62 (50%)

Query: 98  EVKIRNSKLDATRKKFKEDLERREKEAENERYKGYKVNKSEEEILQQEIDRLRKEGKYPL 157
           E K+  S  +   +  KE +E +  + E E +K +  +  E+E+L +EI  L+ + +  L
Sbjct: 183 EHKLLKSDYENHPEVLKETIELKRIQEELESFKNFYFDMGEKEVLLEEIQDLKSQLQCML 242

Query: 158 KP 159
             
Sbjct: 243 SS 244


>gnl|CDD|217803 pfam03938, OmpH, Outer membrane protein (OmpH-like).  This family
           includes outer membrane proteins such as OmpH among
           others. Skp (OmpH) has been characterized as a molecular
           chaperone that interacts with unfolded proteins as they
           emerge in the periplasm from the Sec translocation
           machinery.
          Length = 157

 Score = 29.5 bits (67), Expect = 0.57
 Identities = 17/68 (25%), Positives = 32/68 (47%), Gaps = 4/68 (5%)

Query: 89  YDSVIRRKEEVKIRNSKLDATRKKFKEDLERREKE--AENERYKGYKVNKSEEEI--LQQ 144
              V+      K    +L+   KK + +L+++EKE   E ++ +      SEE     QQ
Sbjct: 23  VQKVLSESPAGKAAQKQLEKEFKKLQAELQKKEKELQKEEQKLQKQAATLSEEARKAKQQ 82

Query: 145 EIDRLRKE 152
           E+ + ++E
Sbjct: 83  ELQQKQQE 90


>gnl|CDD|220368 pfam09730, BicD, Microtubule-associated protein Bicaudal-D.  BicD
           proteins consist of three coiled-coiled domains and are
           involved in dynein-mediated minus end-directed transport
           from the Golgi apparatus to the endoplasmic reticulum
           (ER). For full functioning they bind with GSK-3beta
           pfam05350 to maintain the anchoring of microtubules to
           the centromere. It appears that amino-acid residues
           437-617 of BicD and the kinase activity of GSK-3 are
           necessary for the formation of a complex between BicD
           and GSK-3beta in intact cells.
          Length = 711

 Score = 30.2 bits (68), Expect = 0.66
 Identities = 21/94 (22%), Positives = 42/94 (44%), Gaps = 2/94 (2%)

Query: 59  KNPDDEKAIETFHLLSKAIEVLLDKSARAAYDSVIRRKEEVKIRNSKLDATRKKFKEDLE 118
           K+    +  + + +     E+L  K  R A   V   K E+K   ++ +  +++++++ +
Sbjct: 330 KDRGSHEDGDYYEVDINGPEILECK-YRVAVSEVGELKAELKALKARYNELQERYEDEKD 388

Query: 119 RREKEAENERYKGYKVNKSEEEILQQEIDRLRKE 152
           R   E +N   K   + KS  E  Q+    L KE
Sbjct: 389 RLRNEIQNLAEKLLSLEKSSHE-DQERSGDLEKE 421


>gnl|CDD|236995 PRK11824, PRK11824, polynucleotide phosphorylase/polyadenylase;
           Provisional.
          Length = 693

 Score = 30.0 bits (69), Expect = 0.71
 Identities = 18/56 (32%), Positives = 30/56 (53%), Gaps = 5/56 (8%)

Query: 73  LSKAIEVLLDKSARAAYDSVIRRKEEVKIRNSKLDATRKKFKEDLERREKEAENER 128
           L  A++ L +   + AY   I  K+E   R + LDA +++  E L   E+E E+E+
Sbjct: 241 LKAAVKELAEAKLKEAYQ--ITDKQE---REAALDAIKEEVLEALAAEEEEEEDEK 291


>gnl|CDD|150256 pfam09520, RE_MjaII, MjaII restriction endonuclease.  This family
           includes the MjaII (recognises GGNCC but the cleavage
           site is unknown) restriction endonuclease.
          Length = 252

 Score = 29.3 bits (66), Expect = 0.88
 Identities = 12/38 (31%), Positives = 19/38 (50%)

Query: 122 KEAENERYKGYKVNKSEEEILQQEIDRLRKEGKYPLKP 159
             AE +  K  K++++    + + ID LRK    P KP
Sbjct: 84  AVAERQYDKQGKIDQAAIAKIDEIIDDLRKGENLPKKP 121


>gnl|CDD|237177 PRK12704, PRK12704, phosphodiesterase; Provisional.
          Length = 520

 Score = 29.4 bits (67), Expect = 1.2
 Identities = 16/81 (19%), Positives = 39/81 (48%), Gaps = 19/81 (23%)

Query: 88  AYDSVIRRKE----EVKIRNSKLDATRKKFK----------EDLERREKEAENERYKGYK 133
           A + + + +     E++ R ++L    K+            E LE+RE+E E +     +
Sbjct: 62  AKEEIHKLRNEFEKELRERRNELQKLEKRLLQKEENLDRKLELLEKREEELEKKE---KE 118

Query: 134 VNKSEEEI--LQQEIDRLRKE 152
           + + ++E+   ++E++ L +E
Sbjct: 119 LEQKQQELEKKEEELEELIEE 139



 Score = 28.6 bits (65), Expect = 2.2
 Identities = 17/71 (23%), Positives = 36/71 (50%), Gaps = 12/71 (16%)

Query: 94  RRKEEVKIRNSKLDATRKKF---KEDLERREKEAEN---------ERYKGYKVNKSEEEI 141
           R+ E ++ R  +L+   K+    +++LE++E+E E          ER  G    +++E +
Sbjct: 100 RKLELLEKREEELEKKEKELEQKQQELEKKEEELEELIEEQLQELERISGLTAEEAKEIL 159

Query: 142 LQQEIDRLRKE 152
           L++  +  R E
Sbjct: 160 LEKVEEEARHE 170


>gnl|CDD|223599 COG0525, ValS, Valyl-tRNA synthetase [Translation, ribosomal
           structure and biogenesis].
          Length = 877

 Score = 29.1 bits (66), Expect = 1.5
 Identities = 13/54 (24%), Positives = 33/54 (61%), Gaps = 4/54 (7%)

Query: 99  VKIRNSKLDATRKKFKEDLERREKEAENERYKGYKVNKSEEEILQQEIDRLRKE 152
           +    ++L+   +K +++++R EK+  NE +    V K+ EE++++E ++L + 
Sbjct: 812 LAAELARLEKELEKLEKEIDRIEKKLSNEGF----VAKAPEEVVEKEKEKLAEY 861


>gnl|CDD|235012 PRK02201, PRK02201, putative inner membrane protein translocase
           component YidC; Provisional.
          Length = 357

 Score = 28.8 bits (65), Expect = 1.6
 Identities = 17/49 (34%), Positives = 28/49 (57%), Gaps = 2/49 (4%)

Query: 118 ERREKEAE-NERYKGYKVNKSEEEILQQEIDRL-RKEGKYPLKPVLYVF 164
           E + K+A+ + +YK YK +K  ++  QQEI  L +K    P  P + +F
Sbjct: 164 ELQGKKAKIDAKYKDYKKDKQMKQRKQQEIQELYKKHNISPFSPFVQMF 212


>gnl|CDD|218791 pfam05879, RHD3, Root hair defective 3 GTP-binding protein (RHD3). 
           This family consists of several eukaryotic root hair
           defective 3 like GTP-binding proteins. It has been
           speculated that the RHD3 protein is a member of a novel
           class of GTP-binding proteins that is widespread in
           eukaryotes and required for regulated cell enlargement.
           The family also contains the homologous yeast synthetic
           enhancement of YOP1 (SEY1) protein which is involved in
           membrane trafficking.
          Length = 743

 Score = 28.6 bits (64), Expect = 2.2
 Identities = 21/72 (29%), Positives = 37/72 (51%), Gaps = 3/72 (4%)

Query: 73  LSKAIEVLLDKSARAAYDSVIRR-KEEVKIRNSKLDATRKKFKEDLERREKEAENERYKG 131
           LS+ +E+LL+K  +  +D+V++  K EV+   S+    +  F    E  +K  +N + K 
Sbjct: 438 LSEPVELLLNKLNKETWDTVLKLFKREVEDAVSRFTDRKSGFDLSEEENDKMLKNLKRKS 497

Query: 132 YKV--NKSEEEI 141
           + V   K  EE 
Sbjct: 498 WGVLRTKIHEEA 509


>gnl|CDD|193187 pfam12711, Kinesin-relat_1, Kinesin motor.  This family is closely
           related to Kinesin-related, pfam06548.
          Length = 86

 Score = 26.8 bits (60), Expect = 2.4
 Identities = 15/58 (25%), Positives = 29/58 (50%), Gaps = 5/58 (8%)

Query: 96  KEEVKIRNSKLDATRK--KFKEDLERREKEAENERYKGYKVNKSEEEILQQEIDRLRK 151
           KEE+K+   +++   +  +F  +  R  +E    R + +   + E E+L +EI  LR 
Sbjct: 30  KEEIKLLREQVEHNPEVTRFALENIRLREE--LRRLQNFYE-QGEREMLLEEISELRD 84


>gnl|CDD|234271 TIGR03591, polynuc_phos, polyribonucleotide nucleotidyltransferase.
            Members of this protein family are polyribonucleotide
           nucleotidyltransferase, also called polynucleotide
           phosphorylase. Some members have been shown also to have
           additional functions as guanosine pentaphosphate
           synthetase and as poly(A) polymerase (see model
           TIGR02696 for an exception clade, within this family)
           [Transcription, Degradation of RNA].
          Length = 684

 Score = 28.2 bits (64), Expect = 2.4
 Identities = 22/78 (28%), Positives = 39/78 (50%), Gaps = 8/78 (10%)

Query: 72  LLSKAIEVLLDKSARAAYDSVIRRKEEVKIRNSKLDATRKKFKEDL---ERREKEAENER 128
           L +K  E+  +   + AY   I  K+E   R + LDA +++  E L   E  E+EA+ E+
Sbjct: 233 LKAKVKELAYEAGLKEAYQ--ITEKQE---RYAALDAIKEEVLEALAEEEEEEEEADREK 287

Query: 129 YKGYKVNKSEEEILQQEI 146
                    E++I+++ I
Sbjct: 288 EIKEAFKDLEKKIVRERI 305


>gnl|CDD|234402 TIGR03928, T7_EssCb_Firm, type VII secretion protein EssC,
           C-terminal domain.  This model describes the C-terminal
           domain, or longer subunit, of the Firmicutes type VII
           secretion protein EssC. This protein (homologous to EccC
           in Actinobacteria) and the WXG100 target proteins are
           the only homologous parts of type VII secretion between
           Firmicutes and Actinobacteria [Protein fate, Protein and
           peptide secretion and trafficking].
          Length = 1296

 Score = 28.4 bits (64), Expect = 2.6
 Identities = 13/42 (30%), Positives = 23/42 (54%), Gaps = 7/42 (16%)

Query: 110 RKKFKEDLERREKEAENERYKGYKVNKSEEEILQQEIDRLRK 151
           +KK+K+D+E+R +      Y+ Y   K +E  LQ   ++ R 
Sbjct: 86  KKKYKKDVEKRNR-----SYRLYLDKKRKE--LQALSEKQRH 120


>gnl|CDD|223618 COG0544, Tig, FKBP-type peptidyl-prolyl cis-trans isomerase
           (trigger factor) [Posttranslational modification,
           protein turnover, chaperones].
          Length = 441

 Score = 28.0 bits (63), Expect = 2.6
 Identities = 16/58 (27%), Positives = 29/58 (50%), Gaps = 8/58 (13%)

Query: 103 NSKLDATRKKFKEDLERREKEAENERYKGYKVNKSEEE--------ILQQEIDRLRKE 152
              L+  ++K +++LER  KEA  E+ K   ++   E         +++ EID L K+
Sbjct: 259 EDTLEELKEKLRKNLERELKEATLEKRKEQLLDALVEANDFDLPESLVEAEIDNLLKQ 316


>gnl|CDD|215969 pfam00521, DNA_topoisoIV, DNA gyrase/topoisomerase IV, subunit A. 
          Length = 427

 Score = 27.9 bits (63), Expect = 3.2
 Identities = 22/98 (22%), Positives = 38/98 (38%), Gaps = 24/98 (24%)

Query: 64  EKAIETFHLLSKAIEVLLDKSARAAYDSVIRRKEEVKIRNSKLDATRKKFKEDLERREKE 123
           EK  E  H+L   ++ L                  +++    +D   KK K++L     E
Sbjct: 330 EKLEERLHILEGLLKALNK------------IDFVIEVIRGSID--LKKAKKELIEELSE 375

Query: 124 AENE--------RYKGYKVNKSEEEI--LQQEIDRLRK 151
            + +        R    ++ K E+EI  L++EI  L K
Sbjct: 376 IQADYLLDMRLRRLTKEEIEKLEKEIEELEKEIAELEK 413


>gnl|CDD|214661 smart00435, TOPEUc, DNA Topoisomerase I (eukaryota).  DNA
           Topoisomerase I (eukaryota), DNA topoisomerase V,
           Vaccina virus topoisomerase, Variola virus
           topoisomerase, Shope fibroma virus topoisomeras.
          Length = 391

 Score = 27.7 bits (62), Expect = 3.7
 Identities = 13/67 (19%), Positives = 29/67 (43%)

Query: 86  RAAYDSVIRRKEEVKIRNSKLDATRKKFKEDLERREKEAENERYKGYKVNKSEEEILQQE 145
           +          +  +   SK +   +K   +++ ++KE + E  K  ++ + EE I + E
Sbjct: 297 KKMILLFEMISDLKRKLKSKFERDNEKLDAEVKEKKKEKKKEEKKKKQIERLEERIEKLE 356

Query: 146 IDRLRKE 152
           +    KE
Sbjct: 357 VQATDKE 363


>gnl|CDD|151173 pfam10669, Phage_Gp23, Protein gp23 (Bacteriophage A118).  This is
           the highly conserved family of the major tail subunit
           protein.
          Length = 121

 Score = 27.0 bits (59), Expect = 3.7
 Identities = 12/56 (21%), Positives = 31/56 (55%)

Query: 97  EEVKIRNSKLDATRKKFKEDLERREKEAENERYKGYKVNKSEEEILQQEIDRLRKE 152
           ++ KI   ++   R K + + E+R+KE++ ER K       ++ ++ ++ D + ++
Sbjct: 39  DDSKIVRVEMKEERDKMETEREKRDKESKEERDKFISTMNEQQRLMDKQNDMMGQQ 94


>gnl|CDD|134335 PRK00625, PRK00625, shikimate kinase; Provisional.
          Length = 173

 Score = 27.0 bits (60), Expect = 4.3
 Identities = 18/45 (40%), Positives = 21/45 (46%), Gaps = 5/45 (11%)

Query: 118 ERREKEAENERYKGYKVNKSEEEILQQEIDRLRKEGKY--PLKPV 160
           +R +K    ER K      S EEIL Q IDR+R    Y   L  V
Sbjct: 111 QRLQKRGLPERLKH---APSLEEILSQRIDRMRSIADYIFSLDHV 152


>gnl|CDD|217393 pfam03154, Atrophin-1, Atrophin-1 family.  Atrophin-1 is the
           protein product of the dentatorubral-pallidoluysian
           atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive
           neurodegenerative disorder. It is caused by the
           expansion of a CAG repeat in the DRPLA gene on
           chromosome 12p. This results in an extended
           polyglutamine region in atrophin-1, that is thought to
           confer toxicity to the protein, possibly through
           altering its interactions with other proteins. The
           expansion of a CAG repeat is also the underlying defect
           in six other neurodegenerative disorders, including
           Huntington's disease. One interaction of expanded
           polyglutamine repeats that is thought to be pathogenic
           is that with the short glutamine repeat in the
           transcriptional coactivator CREB binding protein, CBP.
           This interaction draws CBP away from its usual nuclear
           location to the expanded polyglutamine repeat protein
           aggregates that are characteristic of the polyglutamine
           neurodegenerative disorders. This interferes with
           CBP-mediated transcription and causes cytotoxicity.
          Length = 979

 Score = 27.7 bits (61), Expect = 4.5
 Identities = 13/38 (34%), Positives = 21/38 (55%)

Query: 91  SVIRRKEEVKIRNSKLDATRKKFKEDLERREKEAENER 128
           S + +K E  +  +K +A +K  +E    +EKE E ER
Sbjct: 575 SKLAKKREEAVEKAKREAEQKAREEREREKEKEKERER 612


>gnl|CDD|180530 PRK06328, PRK06328, type III secretion system protein; Validated.
          Length = 223

 Score = 27.1 bits (60), Expect = 4.8
 Identities = 18/74 (24%), Positives = 41/74 (55%), Gaps = 1/74 (1%)

Query: 88  AYDSVIRRKEEVKIRNSKLDATRKKFKEDLERREKEAENERYK-GYKVNKSEEEILQQEI 146
           A+ +++  +E ++      +A  ++  E+ E+  +EA+N+ +K G K    +   L++E 
Sbjct: 24  AFSALLDAQELLEKTKEDSEAYTQETHEECEKLREEAKNQGFKEGSKAWSKQLAFLEEET 83

Query: 147 DRLRKEGKYPLKPV 160
            +LR++ K  L P+
Sbjct: 84  QKLREQVKEALVPL 97


>gnl|CDD|148788 pfam07374, DUF1492, Protein of unknown function (DUF1492).  This
           family consists of several hypothetical, highly
           conserved Streptococcal and related phage proteins of
           around 100 residues in length. The function of this
           family is unknown. It appears to be distantly related to
           pfam08281.
          Length = 100

 Score = 26.3 bits (58), Expect = 5.0
 Identities = 14/66 (21%), Positives = 29/66 (43%), Gaps = 11/66 (16%)

Query: 114 KEDLERREKEAENERYKGYKVNKSEEEILQQEIDRLRKEGKYPLKPVLYVFRINLWLISY 173
           KE +E+   EA   + +           L + ID+L       +  ++Y+ ++ +W +  
Sbjct: 31  KEAIEQDTAEAIKRKLE-----------LGRLIDKLENPDSRTILRMVYIDKMTVWQVCD 79

Query: 174 QGRCSR 179
           +  CSR
Sbjct: 80  KLNCSR 85


>gnl|CDD|204414 pfam10211, Ax_dynein_light, Axonemal dynein light chain.  Axonemal
           dynein light chain proteins play a dynamic role in
           flagellar and cilia motility. Eukaryotic cilia and
           flagella are complex organelles consisting of a core
           structure, the axoneme, which is composed of nine
           microtubule doublets forming a cylinder that surrounds a
           pair of central singlet microtubules. This
           ultra-structural arrangement seems to be one of the most
           stable micro-tubular assemblies known and is responsible
           for the flagellar and ciliary movement of a large number
           of organisms ranging from protozoan to mammals. This
           light chain interacts directly with the N-terminal half
           of the heavy chains.
          Length = 189

 Score = 26.8 bits (60), Expect = 5.1
 Identities = 12/59 (20%), Positives = 29/59 (49%)

Query: 96  KEEVKIRNSKLDATRKKFKEDLERREKEAENERYKGYKVNKSEEEILQQEIDRLRKEGK 154
           K E++    KL+  +++ ++ +   E + E    +  +  + EE+    EI  L+K+ +
Sbjct: 122 KSELEQEIKKLEEEKEELEKRVAELEAKLEAIEKREEEERQIEEKRHADEIAFLKKQNQ 180


>gnl|CDD|214922 smart00935, OmpH, Outer membrane protein (OmpH-like).  This family
           includes outer membrane proteins such as OmpH among
           others. Skp (OmpH) has been characterized as a molecular
           chaperone that interacts with unfolded proteins as they
           emerge in the periplasm from the Sec translocation
           machinery.
          Length = 140

 Score = 26.8 bits (60), Expect = 5.2
 Identities = 14/67 (20%), Positives = 30/67 (44%), Gaps = 4/67 (5%)

Query: 90  DSVIRRKEEVKIRNSKLDATRKKFKEDLERREKEAEN--ERYKGYKVNKSEEEI--LQQE 145
             +++     K    +L+   KK + +LE+ EKE +   E+ +      SE      ++E
Sbjct: 7   QKILQESPAGKAAQKQLEKEFKKRQAELEKLEKELQKLKEKLQKDAATLSEAAREKKEKE 66

Query: 146 IDRLRKE 152
           + +  +E
Sbjct: 67  LQKKVQE 73


>gnl|CDD|188306 TIGR03319, RNase_Y, ribonuclease Y.  Members of this family are
           RNase Y, an endoribonuclease. The member from Bacillus
           subtilis, YmdA, has been shown to be involved in
           turnover of yitJ riboswitch [Transcription, Degradation
           of RNA].
          Length = 514

 Score = 27.2 bits (61), Expect = 5.2
 Identities = 12/57 (21%), Positives = 29/57 (50%), Gaps = 2/57 (3%)

Query: 98  EVKIRNSKLDATRKKFKEDLERREKEAENERYKGYKVNKSEEEI--LQQEIDRLRKE 152
           E+K R ++L    ++  +  E  +++ E+   K   + K E+E+   ++ +D   +E
Sbjct: 70  ELKERRNELQRLERRLLQREETLDRKMESLDKKEENLEKKEKELSNKEKNLDEKEEE 126


>gnl|CDD|182904 PRK11020, PRK11020, hypothetical protein; Provisional.
          Length = 118

 Score = 26.1 bits (58), Expect = 5.7
 Identities = 17/55 (30%), Positives = 25/55 (45%), Gaps = 7/55 (12%)

Query: 96  KEEVKIRNSKLDATRKKFKEDLERREKEAENERYKGYKVNKSEEEILQQEIDRLR 150
           K E+K  + +LDA R K      R + E        Y   + E+  L+ EI RL+
Sbjct: 4   KNEIKRLSDRLDAIRHKLAAASLRGDAEK-------YAQFEKEKATLEAEIARLK 51


>gnl|CDD|233119 TIGR00763, lon, ATP-dependent protease La.  This protein is induced
           by heat shock and other stresses in E. coli, B.
           subtilis, and other species. The yeast member,
           designated PIM1, is located in the mitochondrial matrix,
           required for mitochondrial function, and also induced by
           heat shock [Protein fate, Degradation of proteins,
           peptides, and glycopeptides].
          Length = 775

 Score = 27.3 bits (61), Expect = 6.0
 Identities = 20/109 (18%), Positives = 52/109 (47%), Gaps = 13/109 (11%)

Query: 56  HPDKNPDDEKAI-ETFHL---LSKAIEVLLDKSARAAYDSVIRRKEEVKIR--------N 103
              K  D+ + + ET ++   L KA+E+L  +       + I +K E K+          
Sbjct: 170 LQLKEKDELQEVLETVNIEKRLKKALELLKKELELLKLQNKITKKVEEKMEKTQREYYLR 229

Query: 104 SKLDATRKKFKEDLERREKEAE-NERYKGYKVNKSEEEILQQEIDRLRK 151
            +L A +K+   + + +++  +  E+ +  K+ +  ++++++E+ +L  
Sbjct: 230 EQLKAIKKELGIEKDDKDELEKLKEKLEELKLPEEVKKVIEKELTKLSL 278


>gnl|CDD|220369 pfam09731, Mitofilin, Mitochondrial inner membrane protein.
           Mitofilin controls mitochondrial cristae morphology.
           Mitofilin is enriched in the narrow space between the
           inner boundary and the outer membranes, where it forms a
           homotypic interaction and assembles into a large
           multimeric protein complex. The first 78 amino acids
           contain a typical amino-terminal-cleavable mitochondrial
           presequence rich in positive-charged and hydroxylated
           residues and a membrane anchor domain. In addition, it
           has three centrally located coiled coil domains.
          Length = 493

 Score = 26.9 bits (60), Expect = 6.2
 Identities = 21/99 (21%), Positives = 42/99 (42%), Gaps = 6/99 (6%)

Query: 57  PDKNPDDEKAIETFHLLSKAIEVLLDKSARAAYDSVIRR-KEEVKIRNSKLDATRKKFKE 115
              +P  ++ +  F   +  I+ + + + +   +S+I   KEE+   + KL   + + +E
Sbjct: 132 TASDPVVQELVSIF---NDLIDSIKEDNLKDDLESLIASAKEELDQLSKKLAELKAEEEE 188

Query: 116 DLERREKEAENERYKGY--KVNKSEEEILQQEIDRLRKE 152
           +LER  KE   E       ++    E        +LR E
Sbjct: 189 ELERALKEKREELLSKLEEELLARLESKEAALEKQLRLE 227


>gnl|CDD|237551 PRK13902, alaS, alanyl-tRNA synthetase; Provisional.
          Length = 900

 Score = 27.1 bits (61), Expect = 6.7
 Identities = 10/57 (17%), Positives = 25/57 (43%), Gaps = 10/57 (17%)

Query: 97  EEVKIRNSKLDATRKKFKEDLERREKEAENERYKGYKVNKSEEEILQQEIDRLRKEG 153
           E + +   +L  T ++F E+ + ++K  E E+ +        +E+ +     L  + 
Sbjct: 750 EILGVPPEQLPKTVERFFEEWKEQKK--EIEKLR--------KELAELLASELLSKA 796


>gnl|CDD|235246 PRK04184, PRK04184, DNA topoisomerase VI subunit B; Validated.
          Length = 535

 Score = 26.8 bits (60), Expect = 6.8
 Identities = 18/67 (26%), Positives = 24/67 (35%), Gaps = 4/67 (5%)

Query: 75  KAIEVLLDKSARAAYDSVIRRKEEVKIRNSKLDATRKKFKEDLERREKEAENERYKGYKV 134
           K  E  + + AR    + I  KEE  I         KK +E  E  E+E E E       
Sbjct: 473 KTFEKYIPEIARKL--AEILEKEEPDIEEVLAKLIGKKLEE--EEVEEEEEEEAVVVESA 528

Query: 135 NKSEEEI 141
               E+ 
Sbjct: 529 KNYTEKD 535


>gnl|CDD|232945 TIGR00376, TIGR00376, DNA helicase, putative.  The gene product may
           represent a DNA helicase. Eukaryotic members of this
           family have been characterized as binding certain
           single-stranded G-rich DNA sequences (GGGGT and GGGCT).
           A number of related proteins are characterized as
           helicases [DNA metabolism, DNA replication,
           recombination, and repair].
          Length = 637

 Score = 27.1 bits (60), Expect = 6.8
 Identities = 16/79 (20%), Positives = 33/79 (41%), Gaps = 6/79 (7%)

Query: 80  LLDKSARAAYDSVIRRKEEVKIRNSKLDATRKKFKEDLERREKEAENERYKGYKVNKSEE 139
           LL  + + + D +I    +       +   R+K  E +E R K+ +       +   S+ 
Sbjct: 237 LLKSNKQHSLDYLIENHPK----YQIVADIREKIDELIEERNKKLKP--SPQKRRGLSDI 290

Query: 140 EILQQEIDRLRKEGKYPLK 158
           +IL++ + +    G   LK
Sbjct: 291 KILRKALKKREARGIESLK 309


>gnl|CDD|225664 COG3122, COG3122, Uncharacterized protein conserved in bacteria
           [Function unknown].
          Length = 215

 Score = 26.5 bits (58), Expect = 7.4
 Identities = 14/55 (25%), Positives = 27/55 (49%), Gaps = 1/55 (1%)

Query: 114 KEDLERREKEAEN-ERYKGYKVNKSEEEILQQEIDRLRKEGKYPLKPVLYVFRIN 167
           K  ++ RE      E  K  ++ + ++   QQ+   L KE K  +K ++ + R+N
Sbjct: 69  KSRVDAREAYRIAAEENKAEQLERDKQLSEQQKQAALAKEYKAQVKQLIEMNRLN 123


>gnl|CDD|130141 TIGR01069, mutS2, MutS2 family protein.  Function of MutS2 is
           unknown. It should not be considered a DNA mismatch
           repair protein. It is likely a DNA mismatch binding
           protein of unknown cellular function [DNA metabolism,
           Other].
          Length = 771

 Score = 26.7 bits (59), Expect = 8.3
 Identities = 18/84 (21%), Positives = 34/84 (40%), Gaps = 6/84 (7%)

Query: 73  LSKAIEVLLDKSARAAYDSVIRRKEEVKIRNSKLDATRKKFKEDLERREKEAENERYKGY 132
           ++  IE L         +      E++     KL    ++  E+L+ RE+  + E  K  
Sbjct: 513 INVLIEKLSALEKE--LEQKNEHLEKLLKEQEKLKKELEQEMEELKERERNKKLELEKEA 570

Query: 133 --KVNKSEEEILQQEIDRLRKEGK 154
              +   ++E+  + I R  KE K
Sbjct: 571 QEALKALKKEV--ESIIRELKEKK 592


>gnl|CDD|216282 pfam01071, GARS_A, Phosphoribosylglycinamide synthetase, ATP-grasp
           (A) domain.  Phosphoribosylglycinamide synthetase
           catalyzes the second step in the de novo biosynthesis of
           purine. The reaction catalyzed by
           Phosphoribosylglycinamide synthetase is the ATP-
           dependent addition of 5-phosphoribosylamine to glycine
           to form 5'phosphoribosylglycinamide. This domain is
           related to the ATP-grasp domain of biotin
           carboxylase/carbamoyl phosphate synthetase (see
           pfam02786).
          Length = 193

 Score = 26.1 bits (58), Expect = 8.6
 Identities = 13/25 (52%), Positives = 16/25 (64%), Gaps = 1/25 (4%)

Query: 139 EEILQQEIDRLRKEGKYPLKPVLYV 163
           E I++  +D LRKEG  P K VLY 
Sbjct: 147 ETIVEPTVDGLRKEG-IPYKGVLYA 170


>gnl|CDD|148511 pfam06936, Selenoprotein_S, Selenoprotein S (SelS).  This family
           consists of several mammalian selenoprotein S (SelS)
           sequences. SelS is a plasma membrane protein and is
           present in a variety of tissues and cell types. The
           function of this family is unknown.
          Length = 190

 Score = 26.4 bits (58), Expect = 8.6
 Identities = 21/61 (34%), Positives = 35/61 (57%), Gaps = 10/61 (16%)

Query: 97  EEVKIRNSKLDATRKKFKEDLERREKEAENERYKGYKVNKSEEEILQQEI---DRLRKEG 153
           ++V  R   L+A R + +EDL     +A+  ++K  K  + EEE  +Q+I   DR+ +EG
Sbjct: 76  DKVVRRQEALEAARLRMQEDL-----DAKAAKFK-EKQKQLEEEKRRQKIEMWDRM-QEG 128

Query: 154 K 154
           K
Sbjct: 129 K 129


>gnl|CDD|118696 pfam10168, Nup88, Nuclear pore component.  Nup88 can be divided
           into two structural domains; the N-terminal two-thirds
           of the protein has no obvious structural motifs but is
           the region for binding to Nup98, one of the components
           of the nuclear pore. the C-terminal end is a predicted
           coiled-coil domain. Nup88 is overexpressed in tumour
           cells.
          Length = 717

 Score = 26.8 bits (59), Expect = 9.0
 Identities = 19/88 (21%), Positives = 38/88 (43%), Gaps = 12/88 (13%)

Query: 68  ETFHLLSKAIEVLLDKSARAAYDSVIRR----KEEVKIRNSKLDATRKKFKEDLERREKE 123
           E   LLS+A +V      R  Y   + +    +EE + R   L   ++K  ED++   +E
Sbjct: 536 ECLQLLSRATQVF-----REQY---LLKHDLAREEFQRRVKLLQLQKEKQLEDIQDCREE 587

Query: 124 AENERYKGYKVNKSEEEILQQEIDRLRK 151
            ++   +  K+ +  EE    +   + +
Sbjct: 588 RKSLSERAEKLAEKFEEAKYNQELLVNR 615


>gnl|CDD|179699 PRK03992, PRK03992, proteasome-activating nucleotidase;
           Provisional.
          Length = 389

 Score = 26.3 bits (59), Expect = 9.0
 Identities = 20/69 (28%), Positives = 35/69 (50%), Gaps = 15/69 (21%)

Query: 95  RKEEVKIRNSKLDATRKKFKEDLERREKEAENERYKGYKVNKSEEEILQQEIDRLRKEGK 154
           R E ++ RNS+L+   ++ +  L  R+ EAENE+              ++E++RL+ E +
Sbjct: 2   RLEALEERNSELEEQIRQLELKL--RDLEAENEKL-------------ERELERLKSELE 46

Query: 155 YPLKPVLYV 163
               P L V
Sbjct: 47  KLKSPPLIV 55


>gnl|CDD|236828 PRK11034, clpA, ATP-dependent Clp protease ATP-binding subunit;
           Provisional.
          Length = 758

 Score = 26.7 bits (59), Expect = 9.2
 Identities = 15/37 (40%), Positives = 21/37 (56%)

Query: 65  KAIETFHLLSKAIEVLLDKSARAAYDSVIRRKEEVKI 101
           K I   HL  KAI+V+ +  ARA    V +RK+ V +
Sbjct: 387 KYINDRHLPDKAIDVIDEAGARARLMPVSKRKKTVNV 423


>gnl|CDD|234272 TIGR03592, yidC_oxa1_cterm, membrane protein insertase, YidC/Oxa1
           family, C-terminal domain.  This model describes
           full-length from some species, and the C-terminal region
           only from other species, of the YidC/Oxa1 family of
           proteins. This domain appears to be univeral among
           bacteria (although absent from Archaea). The
           well-characterized YidC protein from Escherichia coli
           and its close homologs contain a large N-terminal
           periplasmic domain in addition to the region modeled
           here [Protein fate, Protein and peptide secretion and
           trafficking].
          Length = 181

 Score = 25.9 bits (58), Expect = 9.4
 Identities = 10/31 (32%), Positives = 19/31 (61%), Gaps = 1/31 (3%)

Query: 105 KLDATRKKFKEDLERREKEAENERYKGYKVN 135
           KL   ++K+K+D ++ ++E   + YK   VN
Sbjct: 37  KLKEIQEKYKDDPQKLQQEM-MKLYKEEGVN 66


>gnl|CDD|235855 PRK06707, PRK06707, amidase; Provisional.
          Length = 536

 Score = 26.4 bits (58), Expect = 9.5
 Identities = 19/91 (20%), Positives = 36/91 (39%), Gaps = 2/91 (2%)

Query: 63  DEKAIETFHLLSKAIEVLLDKSARAAYDSVIRRKEEVKIRNSKLDATRKKFKEDL-ERRE 121
           +   ++    L    +  ++       +  ++  EE+   N K    R K+ + L E  E
Sbjct: 376 NNGGVDNLQTLEYEFKHNVNDYFSQQKNVPVKSLEEIIAFNKKDSKRRIKYGQTLIEASE 435

Query: 122 KEAENERYKGYKVNKSEEEILQQEIDRLRKE 152
           K A  +      V  S+E   ++E+DR   E
Sbjct: 436 KSAITKDEFEKVVQTSQE-NAKKELDRYLVE 465


>gnl|CDD|191634 pfam06886, TPX2, Targeting protein for Xklp2 (TPX2).  This family
           represents a conserved region approximately 60 residues
           long within the eukaryotic targeting protein for Xklp2
           (TPX2). Xklp2 is a kinesin-like protein localised on
           centrosomes throughout the cell cycle and on spindle
           pole microtubules during metaphase. In Xenopus, it has
           been shown that Xklp2 protein is required for centrosome
           separation and maintenance of spindle bi-polarity. TPX2
           is a microtubule-associated protein that mediates the
           binding of the C-terminal domain of Xklp2 to
           microtubules. It is phosphorylated during mitosis in a
           microtubule-dependent way.
          Length = 57

 Score = 24.7 bits (54), Expect = 9.7
 Identities = 18/51 (35%), Positives = 26/51 (50%), Gaps = 3/51 (5%)

Query: 110 RKKFKEDLERREKEAENERYKGYKVNKSEEEILQQEIDRLRKEGKYPLKPV 160
           R +F + LE +EK  E E+ +     K EEE   + I +LRKE      P+
Sbjct: 10  RAEFDKKLEEKEKALEAEKEEAEARQKEEEE---EAIKQLRKELVPKANPM 57


  Database: CDD.v3.10
    Posted date:  Mar 20, 2013  7:55 AM
  Number of letters in database: 10,937,602
  Number of sequences in database:  44,354
  
Lambda     K      H
   0.318    0.135    0.386 

Gapped
Lambda     K      H
   0.267   0.0677    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 9,322,233
Number of extensions: 881725
Number of successful extensions: 1958
Number of sequences better than 10.0: 1
Number of HSP's gapped: 1833
Number of HSP's successfully gapped: 325
Length of query: 180
Length of database: 10,937,602
Length adjustment: 91
Effective length of query: 89
Effective length of database: 6,901,388
Effective search space: 614223532
Effective search space used: 614223532
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 56 (25.5 bits)