RPS-BLAST 2.2.26 [Sep-21-2011]

Database: CDD.v3.10 
           44,354 sequences; 10,937,602 total letters

Searching..................................................done

Query= psy1769
         (1662 letters)



>gnl|CDD|100107 cd02440, AdoMet_MTases, S-adenosylmethionine-dependent
            methyltransferases (SAM or AdoMet-MTase), class I;
            AdoMet-MTases are enzymes that use
            S-adenosyl-L-methionine (SAM or AdoMet) as a substrate
            for methyltransfer, creating the product
            S-adenosyl-L-homocysteine (AdoHcy). There are at least
            five structurally distinct families of AdoMet-MTases,
            class I being the largest and most diverse. Within this
            class enzymes can be classified by different substrate
            specificities (small molecules, lipids, nucleic acids,
            etc.) and different target atoms for methylation
            (nitrogen, oxygen, carbon, sulfur, etc.).
          Length = 107

 Score = 62.1 bits (151), Expect = 2e-11
 Identities = 29/107 (27%), Positives = 49/107 (45%), Gaps = 10/107 (9%)

Query: 1430 VLEVGCGMGLLSLFCAEAGAKHVISVDCS--VITQLTQEVVEENDCSDVITVICRRMEDI 1487
            VL++GCG G L+L  A      V  VD S   +    +        +D + V+    E++
Sbjct: 2    VLDLGCGTGALALALASGPGARVTGVDISPVALELARK--AAAALLADNVEVLKGDAEEL 59

Query: 1488 DRLPHGIENVDIIVSNWMGHVLYLDSLINAVV-YARDRFLKPHGLIL 1533
               P   E+ D+I+S+   H  +L   +   +  AR   LKP G+++
Sbjct: 60   --PPEADESFDVIISDPPLH--HLVEDLARFLEEARRL-LKPGGVLV 101


>gnl|CDD|222295 pfam13659, Methyltransf_26, Methyltransferase domain.  This family
            contains methyltransferase domains.
          Length = 117

 Score = 62.1 bits (151), Expect = 2e-11
 Identities = 28/116 (24%), Positives = 45/116 (38%), Gaps = 12/116 (10%)

Query: 1430 VLEVGCGMGLLSLFCAEAGAK-HVISVDCS-VITQLTQEVVEENDCSDVITVICRRMEDI 1487
            VL+ G G G   L  A AG    V+ V+       L +  +     +  + V+     ++
Sbjct: 4    VLDPGAGSGAFLLAAARAGPDARVVGVELDPEAAALARRRLALAGLAPRVRVVVGDAREL 63

Query: 1488 DRLPHGIENVDIIVSNW-----MGHVLYLDSLINAVVYARDRFLKPHG---LILPD 1535
              LP G  + D+++ N       G       L +  + A  R LKP G   +I P 
Sbjct: 64   LELPDG--SFDLVLGNPPYGPRAGDPKDNRDLYDRFLAAALRLLKPGGVLVVITPA 117


>gnl|CDD|234786 PRK00517, prmA, ribosomal protein L11 methyltransferase; Reviewed.
          Length = 250

 Score = 55.2 bits (134), Expect = 6e-08
 Identities = 38/117 (32%), Positives = 54/117 (46%), Gaps = 25/117 (21%)

Query: 1422 SHLLKDKIVLEVGCGMGLLSLFCAEAGAKHVISVDCSVI----TQLTQEVVEENDCSDVI 1477
              +L  K VL+VGCG G+L++  A+ GAK V++VD   I     +  +E  E N     +
Sbjct: 115  KLVLPGKTVLDVGCGSGILAIAAAKLGAKKVLAVD---IDPQAVEAARENAELNGVELNV 171

Query: 1478 TVICRRMEDIDRLPHGIENVDIIVSNWMGHVLYLDSLINAVVYARDRFLKPHG-LIL 1533
                        LP G    D+IV+N + +   L  L   +     R LKP G LIL
Sbjct: 172  Y-----------LPQGDLKADVIVANILANP--LLELAPDLA----RLLKPGGRLIL 211


>gnl|CDD|219759 pfam08241, Methyltransf_11, Methyltransferase domain.  Members of
            this family are SAM dependent methyltransferases.
          Length = 92

 Score = 50.0 bits (120), Expect = 2e-07
 Identities = 28/104 (26%), Positives = 46/104 (44%), Gaps = 14/104 (13%)

Query: 1431 LEVGCGMGLLSLFCAEAGAKHVISVDCSVITQLTQEVVEENDCSDVITVICRRMEDIDRL 1490
            L+VGCG GLL+   A  G   V  VD      L+ E++           +     D + L
Sbjct: 1    LDVGCGTGLLAEALARRGGARVTGVD------LSPEMLALARKRAPRKFVV---GDAEDL 51

Query: 1491 PHGIENVDIIVSNWM-GHVLYLDSLINAVVYARDRFLKPHGLIL 1533
            P   E+ D++VS+ +  H+   +  +  +     R LKP G ++
Sbjct: 52   PFPDESFDVVVSSLVLHHLPDPERALREIA----RVLKPGGKLV 91


>gnl|CDD|218990 pfam06325, PrmA, Ribosomal protein L11 methyltransferase (PrmA).
            This family consists of several Ribosomal protein L11
            methyltransferase (EC:2.1.1.-) sequences.
          Length = 294

 Score = 54.2 bits (131), Expect = 2e-07
 Identities = 37/157 (23%), Positives = 62/157 (39%), Gaps = 37/157 (23%)

Query: 1422 SHLLKDKIVLEVGCGMGLLSLFCAEAGAKHVISVDCS-VITQLTQEVVEENDCSDVITVI 1480
            S +   + VL+VGCG G+L++   + GAK V+ VD   V  +  +E  E N     + V 
Sbjct: 156  SLVKPGETVLDVGCGSGILAIAALKLGAKKVVGVDIDPVAVRAAKENAELNGVEAQLEVY 215

Query: 1481 CRRMEDIDRLPHGI--ENVDIIVSNWMGHVLYLDSLINAVVYARDRFLKPHGL-----IL 1533
                     LP  +     D++V+N +     L  L   +       +KP G      IL
Sbjct: 216  ---------LPGDLPEGKADVVVANILADP--LIELAPDIY----ALVKPGGYLILSGIL 260

Query: 1534 PDRAELYCVAANDTMAATKYSFWHDVYGFDMEPIQRD 1570
             ++A+           A  YS       F++  ++  
Sbjct: 261  EEQAD---------DVAEAYSQG-----FELITVEER 283


>gnl|CDD|223574 COG0500, SmtA, SAM-dependent methyltransferases [Secondary
            metabolites biosynthesis, transport, and catabolism /
            General function prediction only].
          Length = 257

 Score = 53.4 bits (125), Expect = 3e-07
 Identities = 38/284 (13%), Positives = 83/284 (29%), Gaps = 41/284 (14%)

Query: 1370 MDIDDMTSREIFEDVYADIKTHQTTYKDKQLIGCFHDSIINNSHLFKVRKPNSHLLKDKI 1429
              +          ++Y  +      +          D ++    L         L     
Sbjct: 1    DSLLSAELLSRILELYDRLAELLDAFLLLAEE--LLDLLLVLRLL-------RLLPGGLG 51

Query: 1430 VLEVGCGMGLLSLFCAEAGAKHVISVDCSVITQLTQEVVEENDCSDVITVICRRMEDIDR 1489
            VL++GCG G L+L  A  G +    V   +  ++        + + +  V     + +  
Sbjct: 52   VLDIGCGTGRLALL-ARLGGRGAYVVGVDLSPEMLALARARAEGAGLGLVDFVVADALGG 110

Query: 1490 LPH--GIENVDIIVSNWMGHVLYLDSLINAVVYARDRFLKPHGLILPDRAELYCVAANDT 1547
            +       + D+++S  + H+L     +  ++    R LKP G ++              
Sbjct: 111  VLPFEDSASFDLVISLLVLHLLPPAKALRELL----RVLKPGGRLV-------------- 152

Query: 1548 MAATKYSFWHDVYGFDMEPIQRDLPNIAKFHPVPGDKVMTDSILIHSIDLNTCSVDDTSF 1607
                         G     +   L             +  +++L   + L+   +     
Sbjct: 153  ------LSDLLRDGLLEGRLAALLGFGDPVLERGDILLELEALLRLEL-LDLEELLGLLG 205

Query: 1608 NLEFAMVAKEGGFVNAFVLYFKVILKLILRVEPHKFQVLAQSRK 1651
             LE       GG          ++  L L +E   F++L   + 
Sbjct: 206  ELELE----RGGLARLLEELRLLLELLALGLEARGFRLLLLLKG 245


>gnl|CDD|225173 COG2264, PrmA, Ribosomal protein L11 methylase [Translation,
            ribosomal structure and biogenesis].
          Length = 300

 Score = 53.5 bits (129), Expect = 4e-07
 Identities = 38/116 (32%), Positives = 52/116 (44%), Gaps = 16/116 (13%)

Query: 1422 SHLLKDKIVLEVGCGMGLLSLFCAEAGAKHVISVDC-SVITQLTQEVVEENDCSDVITVI 1480
              L K K VL+VGCG G+L++  A+ GAK V+ VD      +  +E    N    ++   
Sbjct: 158  KLLKKGKTVLDVGCGSGILAIAAAKLGAKKVVGVDIDPQAVEAARENARLNGVELLVQAK 217

Query: 1481 CRRMEDIDRLPHGIEN--VDIIVSNWMGHVLYLDSLINAVVYARDRFLKPHG-LIL 1533
                     L    EN   D+IV+N +  V  L  L   +     R LKP G LIL
Sbjct: 218  G------FLLLEVPENGPFDVIVANILAEV--LVELAPDIK----RLLKPGGRLIL 261


>gnl|CDD|222415 pfam13847, Methyltransf_31, Methyltransferase domain.  This family
            appears to be have methyltransferase activity.
          Length = 151

 Score = 49.7 bits (119), Expect = 1e-06
 Identities = 28/112 (25%), Positives = 52/112 (46%), Gaps = 9/112 (8%)

Query: 1426 KDKIVLEVGCGMGLLSLFCAE--AGAKHVISVDCS--VITQLTQEVVEENDCSDVITVIC 1481
                VL++GCG G L+   AE       V+ +D S   I +  +E  ++    + +  I 
Sbjct: 3    SGIKVLDLGCGTGYLTFILAEKLGPGAEVVGIDISEEAI-EKAKENAKKLG-YENVEFIQ 60

Query: 1482 RRMEDIDRLPHGIENVDIIVSNWMGHVLYLDSLINAVVYARDRFLKPHGLIL 1533
              +E++ +L     + D+++SN    VL      + V+    R LKP G+++
Sbjct: 61   GDIEELPQLQLEDNSFDVVISN---EVLNHLPDPDKVLEEIIRVLKPGGVLI 109


>gnl|CDD|221804 pfam12847, Methyltransf_18, Methyltransferase domain.  Protein in
            this family function as methyltransferases.
          Length = 104

 Score = 48.1 bits (115), Expect = 1e-06
 Identities = 25/116 (21%), Positives = 40/116 (34%), Gaps = 24/116 (20%)

Query: 1426 KDKIVLEVGCGMGLLSLFCAEA-GAKHVISVD-----CSVITQLTQEVVEENDCSDVITV 1479
                VL++GCG G L++  A       V  VD       +  +  +  +        IT 
Sbjct: 1    PGARVLDIGCGTGSLAIELARLFPGARVTGVDLSPEMLELARENAKLALGPR-----ITF 55

Query: 1480 ICRRMEDIDRLPHGIENVDIIVSNWMGHVL--YLDSLINAVVYARDRFLKPHGLIL 1533
            +     D       +E  D +     G  L   LD+L +         LKP G ++
Sbjct: 56   VQ---GDAPDALDLLEGFDAVFIGGGGGDLLELLDALASL--------LKPGGRLV 100


>gnl|CDD|225139 COG2230, Cfa, Cyclopropane fatty acid synthase and related
            methyltransferases [Cell envelope biogenesis, outer
            membrane].
          Length = 283

 Score = 51.1 bits (123), Expect = 2e-06
 Identities = 30/116 (25%), Positives = 45/116 (38%), Gaps = 16/116 (13%)

Query: 1423 HLLKDKIVLEVGCGMGLLSLFCAEAGAKHVISVDCSVITQL--TQEVVEENDCSDVITVI 1480
             L     +L++GCG G L+++ AE     V+ V  S   QL   ++ +      D + V 
Sbjct: 69   GLKPGMTLLDIGCGWGGLAIYAAEEYGVTVVGVTLS-EEQLAYAEKRIAARGLEDNVEVR 127

Query: 1481 CRRMEDIDRLPHGIENVDIIVSNWMGHVL---YLDSLINAVVYARDRFLKPHGLIL 1533
             +   D +      E  D IVS  M   +     D     V       LKP G +L
Sbjct: 128  LQDYRDFE------EPFDRIVSVGMFEHVGKENYDDFFKKV----YALLKPGGRML 173


>gnl|CDD|225443 COG2890, HemK, Methylase of polypeptide chain release factors
            [Translation, ribosomal structure and biogenesis].
          Length = 280

 Score = 48.9 bits (117), Expect = 1e-05
 Identities = 26/84 (30%), Positives = 37/84 (44%), Gaps = 8/84 (9%)

Query: 1422 SHLLKDKIVLEVGCGMGLLSLFCAEAGAKH-VISVDCSV-ITQLTQEVVEENDCSDVITV 1479
              L  DK +L++G G G +++  A+ G    VI+VD S     L +E  E N    V+ V
Sbjct: 106  LLLQLDKRILDLGTGSGAIAIALAKEGPDAEVIAVDISPDALALARENAERNGLVRVLVV 165

Query: 1480 ICRRMEDIDRLPHGIENVDIIVSN 1503
                    D         D+IVSN
Sbjct: 166  QS------DLFEPLRGKFDLIVSN 183


>gnl|CDD|232960 TIGR00406, prmA, ribosomal protein L11 methyltransferase.  Ribosomal
            protein L11 methyltransferase is an
            S-adenosyl-L-methionine-dependent methyltransferase
            required for the modification of ribosomal protein L11.
            This protein is found in bacteria and (with a probable
            transit peptide) in Arabidopsis [Protein synthesis,
            Ribosomal proteins: synthesis and modification].
          Length = 288

 Score = 47.5 bits (113), Expect = 3e-05
 Identities = 37/132 (28%), Positives = 58/132 (43%), Gaps = 19/132 (14%)

Query: 1425 LKDKIVLEVGCGMGLLSLFCAEAGAKHVISVDC-SVITQLTQEVVEENDCSDVITVICRR 1483
            LKDK V++VGCG G+LS+   + GA  V+ +D   +  +  ++  E N  SD + V    
Sbjct: 158  LKDKNVIDVGCGSGILSIAALKLGAAKVVGIDIDPLAVESARKNAELNQVSDRLQVKLIY 217

Query: 1484 MEDIDRLPHGIENVDIIVSNWMGHVLY-LDSLINAVVYARDRFLKPHGL-----ILPDRA 1537
            +E            D+IV+N +  V+  L    +       R +KP G      IL  +A
Sbjct: 218  LEQPIEGKA-----DVIVANILAEVIKELYPQFS-------RLVKPGGWLILSGILETQA 265

Query: 1538 ELYCVAANDTMA 1549
            +  C A      
Sbjct: 266  QSVCDAYEQGFT 277


>gnl|CDD|237258 PRK12903, secA, preprotein translocase subunit SecA; Reviewed.
          Length = 925

 Score = 47.4 bits (113), Expect = 6e-05
 Identities = 36/183 (19%), Positives = 67/183 (36%), Gaps = 21/183 (11%)

Query: 938  NNMIKTIDFHQFQNNVMKNHPGNKNFKNFNNKSYGDKNKNEFKVTKIMQRPVDENKDEIA 997
            N +++ I +     ++  N    K        S G  N +      I  RP  E  D+I 
Sbjct: 761  NILLQEIAY-DVIVSLFNNPNAEKILIITEILSDGINNSD------INDRP-QELIDQII 812

Query: 998  KLPIPLGGDKKDDKKVDDKNVSKDQKITKSTNNNLNKKTNKQQNIQILSPAKKPNKEEDK 1057
            +            ++   K +   ++        L     +Q+N++     +    +E +
Sbjct: 813  E-----------SEEERLKALRIQREEMLMRPEELELINEEQKNLKQEIKLELSEIQEAE 861

Query: 1058 IEKEENNTNAKENDEKSTDDEKNKLDDKAKDTATKLSNSADSIKLNKKDEGSKENKPQQQ 1117
             E +  N N  E  E   D +K      AKD   KL  S+D IK  + ++ +K+ K   +
Sbjct: 862  EEIQNINENKNEFVEFKNDPKKLNKLIIAKDVLIKLVISSDEIK--QDEKTTKKKKKDLE 919

Query: 1118 RRK 1120
            +  
Sbjct: 920  KTD 922


>gnl|CDD|218177 pfam04615, Utp14, Utp14 protein.  This protein is found to be part of
            a large ribonucleoprotein complex containing the U3
            snoRNA. Depletion of the Utp proteins impedes production
            of the 18S rRNA, indicating that they are part of the
            active pre-rRNA processing complex. This large RNP
            complex has been termed the small subunit (SSU)
            processome.
          Length = 728

 Score = 47.0 bits (112), Expect = 8e-05
 Identities = 24/154 (15%), Positives = 60/154 (38%), Gaps = 9/154 (5%)

Query: 983  KIMQRPV----DENKDEIAKLPIPLGGDKKDDKKVDDKNVSKDQK--ITKSTNNNLNKKT 1036
            K MQR      +EN  EI +L   L G+++ D++ +++   K+         N     ++
Sbjct: 377  KFMQRAEARKKEENDAEIEELRRELEGEEESDEEENEEPSKKNVGRRKFGPENGEKEAES 436

Query: 1037 NKQQNIQILSPAKKPNKEEDKIEKEENNTNAKENDEKSTDDEKNKLDDKAKDTATK---L 1093
             K +        +K   +E++  ++E     ++   K     +    ++ ++   +    
Sbjct: 437  KKLKKENKNEFKEKKESDEEEELEDEEEAKVEKVANKLLKRSEKAQKEEEEEELDEENPW 496

Query: 1094 SNSADSIKLNKKDEGSKENKPQQQRRKFNNFHNS 1127
              +  S+  + K + SK+    +  +  N    +
Sbjct: 497  LKTTSSVGKSAKKQDSKKKSSSKLDKAANKISKA 530



 Score = 39.3 bits (92), Expect = 0.019
 Identities = 20/122 (16%), Positives = 42/122 (34%), Gaps = 8/122 (6%)

Query: 1005 GDKKDDKKVDDKNVSKDQKITKSTNNNLNKKTNKQQNIQILSPAKKPNKEEDKIEKEENN 1064
             + +  K   +      +K        L  +   +         K+  K + + E+EE  
Sbjct: 432  KEAESKKLKKENKNEFKEKKESDEEEELEDEEEAKVEKVANKLLKRSEKAQKEEEEEE-- 489

Query: 1065 TNAKENDEKSTDDEK---NKLDDKAKDTATKLSNSADSIKLNKKDEGSKENKPQQQRRKF 1121
               +EN    T        K  D  K +++KL  +A+ I         K+ K +++    
Sbjct: 490  -LDEENPWLKTTSSVGKSAKKQDSKKKSSSKLDKAANKISKAAVK--VKKKKKKEKSIDL 546

Query: 1122 NN 1123
            ++
Sbjct: 547  DD 548



 Score = 37.3 bits (87), Expect = 0.088
 Identities = 29/157 (18%), Positives = 53/157 (33%), Gaps = 19/157 (12%)

Query: 950  QNNVMKNHPGNKNFKNFNNKSYGDKNKNEFKVTKIMQRPVDENKDEIAKLPIPLGGDKKD 1009
              N  K     K  K   N+    K  +E +  ++      + +    KL       +K+
Sbjct: 427  PENGEKEAESKKLKKENKNEFKEKKESDEEE--ELEDEEEAKVEKVANKLLKRSEKAQKE 484

Query: 1010 -------------DKKVDDKNVSKDQKITKSTNNNLNKKTNKQQNIQILSPAKKPNKEED 1056
                                  +K Q   K +++ L+K  NK     +    KK  ++  
Sbjct: 485  EEEEELDEENPWLKTTSSVGKSAKKQDSKKKSSSKLDKAANKISKAAVKVKKKKKKEKSI 544

Query: 1057 KIEK----EENNTNAKENDEKSTDDEKNKLDDKAKDT 1089
             ++     EE++     +DE+  DDE+     K KD 
Sbjct: 545  DLDDDLIDEEDSIKLDVDDEEDEDDEELPFLFKQKDL 581



 Score = 31.2 bits (71), Expect = 6.7
 Identities = 26/140 (18%), Positives = 46/140 (32%), Gaps = 20/140 (14%)

Query: 1207 NSNNYNNNKSKQKKTDSTNNVDKKQDSTSTDERKKKATQKQQQSEEESKKTSNKKIKFKG 1266
               N N  K K++  +     D+++        K     ++ Q EEE ++          
Sbjct: 440  KKENKNEFKEKKESDEEEELEDEEEAKVEKVANKLLKRSEKAQKEEEEEE---------- 489

Query: 1267 PEILNETTEPTASSATNSNVTTVSNTADNSNNPSSILKKSKATEDCKVKTVQGAPKTEKS 1326
                 +   P     T+S     S    +S   SS           K+       K +K 
Sbjct: 490  ----LDEENP-WLKTTSSV--GKSAKKQDSKKKSS---SKLDKAANKISKAAVKVKKKKK 539

Query: 1327 QTKSGEYDNLDEREPSTIQL 1346
            + KS + D+    E  +I+L
Sbjct: 540  KEKSIDLDDDLIDEEDSIKL 559


>gnl|CDD|222287 pfam13649, Methyltransf_25, Methyltransferase domain.  This family
            appears to be a methyltransferase domain.
          Length = 97

 Score = 42.3 bits (100), Expect = 1e-04
 Identities = 27/105 (25%), Positives = 43/105 (40%), Gaps = 12/105 (11%)

Query: 1430 VLEVGCGMGLLSLFCAEAGAKHVISVDCSVITQLTQEVVEENDCSDVITV--ICRRMEDI 1487
            +L++GCG G +    A AG   V  VD   I++   E+ +E        V  +     D 
Sbjct: 1    ILDLGCGTGRVLRALARAGPSSVTGVD---ISKEALELAKERLRDKGPKVRFVV---ADA 54

Query: 1488 DRLPHGIENVDIIVSNWMGHVLYL--DSLINAVVYARDRFLKPHG 1530
              LP    + D+++      + YL    L   +  A  R L+P G
Sbjct: 55   RDLPFEEGSFDLVIC-AGLSLDYLSPKQLRALLREAA-RLLRPGG 97


>gnl|CDD|240274 PTZ00112, PTZ00112, origin recognition complex 1 protein;
            Provisional.
          Length = 1164

 Score = 43.4 bits (102), Expect = 0.001
 Identities = 55/308 (17%), Positives = 107/308 (34%), Gaps = 25/308 (8%)

Query: 828  LPSSEKSRKPISHVSPPVVTSIITSTATSNVTTTTSSSETKPAESDNKLPSNNNPSEEGE 887
            +  +E+S+ PI +                N+ +++SSS +    + +   S  +      
Sbjct: 92   IDLNERSKTPIKNNDNVTTPIKANKKEKHNLDSSSSSSISSSLTNISFFSSPTSIYSCLS 151

Query: 888  DVSGRKNAKVVPGSPAKSNNMSQHYNEKGHKSGGERQRDRGDTGRHHQFQNNMIK----- 942
            +    K++  V      ++      N   +K    +Q  +     H    + M +     
Sbjct: 152  NSLSSKHSPKVIKENQSTHVNISSDNSPRNKEISNKQLKKQTNVTHTTCYDKMRRSPRNT 211

Query: 943  -TIDFHQFQNNVMKNHPGNKNFKNFNNKSYGDKNKNEFKVTKIMQRPVDENKDEIAKLPI 1001
             TI  +    N  KN   +KN K   +     K  +E       +  V  +  ++  L  
Sbjct: 212  STIKNNTNDKNKEKNKEKDKNIKKDRDGDKQTKRNSE-------KSKVQNSHFDVRILRS 264

Query: 1002 PLGGDKKDDKKVDDKNVSKDQKITKSTNNNLNKKTNKQQNIQILSPAKKPNKEEDKIEKE 1061
                +KKD+K V   +  +   + K  +  L K +    N Q      K    ++ I + 
Sbjct: 265  YTKENKKDEKNV--VSGIRSSVLLKRKSQCLRKDSYVYSNHQ---KKAKTGDPKNIIHRN 319

Query: 1062 ENNTNAKENDEKSTDDEKNKL-------DDKAKDTATKLSNSADSIKLNKKDEGSKENKP 1114
              ++N+  +D  S++   +             K T TK +N+  + K NK     K N P
Sbjct: 320  NGSSNSNNDDTSSSNHLGSNRISNRNPSSPYKKQTTTKHTNNTKNNKYNKTKTTQKFNHP 379

Query: 1115 QQQRRKFN 1122
             +     N
Sbjct: 380  LRHHATIN 387



 Score = 35.7 bits (82), Expect = 0.27
 Identities = 56/316 (17%), Positives = 102/316 (32%), Gaps = 42/316 (13%)

Query: 1006 DKKDDKKVDDKNVSKDQKITKSTNNNLNKKTNKQQNIQILSPAKKPNKEEDKIEKEENNT 1065
              K   KV  +N S    I+   +    + +NKQ   Q            DK+ +   NT
Sbjct: 155  SSKHSPKVIKENQSTHVNISSDNSPRNKEISNKQLKKQ---TNVTHTTCYDKMRRSPRNT 211

Query: 1066 NAKENDEKSTDDEKNKLDDKAKDTATKLSNSADSIKLNKKDEGSKENKPQQQRRKFNNFH 1125
            +  +N+    + EKNK  DK                  K  +G K+ K   ++ K  N H
Sbjct: 212  STIKNNTNDKNKEKNKEKDKNIK---------------KDRDGDKQTKRNSEKSKVQNSH 256

Query: 1126 NSSKYQRHPYHNNGGDFNYVPVYDGNINKFNKFGGGGNNRYPNHRYYDFDFIDPRSCFFP 1185
               +  R     N  D   V      I               +   Y        S    
Sbjct: 257  FDVRILRSYTKENKKDEKNVV---SGIRSSVLLKRKSQCLRKDSYVY--------SNHQK 305

Query: 1186 GSGTPYPQYGGGGPPMFNNLYNSNNYNNNKSKQKKTDSTNNVDKKQDSTSTDE-RKKKAT 1244
             + T  P+          N+ + NN ++N +    + S +    +  + +     KK+ T
Sbjct: 306  KAKTGDPK----------NIIHRNNGSSNSNNDDTSSSNHLGSNRISNRNPSSPYKKQTT 355

Query: 1245 QKQQQSEEESKKTSNKKIKFKGPEILNETTEPTASS--ATNSNVTTVSNTADNSNNPSSI 1302
             K   + + +K    K  +     + +  T    SS    +      ++         ++
Sbjct: 356  TKHTNNTKNNKYNKTKTTQKFNHPLRHHATINKRSSMLPMSEQKGRGASEKSEYIKEFTM 415

Query: 1303 LKKSKATEDCKVKTVQ 1318
             + +K T+D  +K V+
Sbjct: 416  EEVAKLTKDTTIKLVE 431



 Score = 32.7 bits (74), Expect = 2.3
 Identities = 35/232 (15%), Positives = 81/232 (34%), Gaps = 9/232 (3%)

Query: 83  SISKKKKLKFVKVDPKELNIVIDDGRGRDTDFRNNKPNHVKSQESRKSSSHHQDKGLGSS 142
           S+S K   K +K +      +  D   R+ +      N    +++  + +   DK   S 
Sbjct: 153 SLSSKHSPKVIKENQSTHVNISSDNSPRNKE----ISNKQLKKQTNVTHTTCYDKMRRSP 208

Query: 143 TRHSNTAGGASSRNMSRGSGKHPPSNNNYKADFEVNEDSTQQHNKKHKRESSKVISVTNN 202
              S      + +N  +   K    + N K D + ++ + +   K   + S   + +  +
Sbjct: 209 RNTSTIKNNTNDKNKEKNKEK----DKNIKKDRDGDKQTKRNSEKSKVQNSHFDVRILRS 264

Query: 203 QQVNNDKDFKNNLMWDGDSAGAKKSRNQWKNLDVNNITNKTRERLMNKPASSNDSLLKDN 262
               N KD + N++    S+   K ++Q    D    +N  ++     P +        +
Sbjct: 265 YTKENKKD-EKNVVSGIRSSVLLKRKSQCLRKDSYVYSNHQKKAKTGDPKNIIHRNNGSS 323

Query: 263 GDVDSRTTTASILDSSLYQNTYDSKTRNFVDSGSQIFNAASTLNETPEHSTR 314
              +  T++++ L S+   N   S       +     N  +      + + +
Sbjct: 324 NSNNDDTSSSNHLGSNRISNRNPSSPYKKQTTTKHTNNTKNNKYNKTKTTQK 375


>gnl|CDD|202698 pfam03602, Cons_hypoth95, Conserved hypothetical protein 95. 
          Length = 183

 Score = 40.7 bits (96), Expect = 0.002
 Identities = 26/126 (20%), Positives = 43/126 (34%), Gaps = 25/126 (19%)

Query: 1425 LKDKIVLEVGCGMGLLSLFCAEAGAKHVISVDCSVITQLT-QEVVEENDCSDVITVICRR 1483
            +    VL++  G G L L     GA  V+ V+       T +E +E         V+  R
Sbjct: 42   IGGARVLDLFAGSGALGLEALSRGAASVVFVEKDKKAVATLKENLEALGLEGETAVL--R 99

Query: 1484 M---EDIDRLPHGIENVDIIVSNWMGHVLYLD-----SLINAVVYARDRFLKPHGLILPD 1535
                  + RL       D++         +LD      LI   +      L  +G + P+
Sbjct: 100  NDAARALLRLAGKGPPFDLV---------FLDPPYAKGLIEEAL----ELLAENGWLNPN 146

Query: 1536 RAELYC 1541
             A +  
Sbjct: 147  -ALIVV 151


>gnl|CDD|237872 PRK14968, PRK14968, putative methyltransferase; Provisional.
          Length = 188

 Score = 40.7 bits (96), Expect = 0.003
 Identities = 17/42 (40%), Positives = 23/42 (54%), Gaps = 8/42 (19%)

Query: 1422 SHLL-------KDKIVLEVGCGMGLLSLFCAEAGAKHVISVD 1456
            S LL       K   VLEVG G G++++  A+ G K V+ VD
Sbjct: 12   SFLLAENAVDKKGDRVLEVGTGSGIVAIVAAKNGKK-VVGVD 52


>gnl|CDD|226608 COG4123, COG4123, Predicted O-methyltransferase [General function
            prediction only].
          Length = 248

 Score = 40.7 bits (96), Expect = 0.004
 Identities = 21/86 (24%), Positives = 36/86 (41%), Gaps = 7/86 (8%)

Query: 1422 SHLLKDKIVLEVGCGMGLLSLFCAEAGAK-HVISVDCS-VITQLTQEVVEENDCSDVITV 1479
            + + K   +L++G G G L L  A+   K  ++ V+      ++ Q  V  N   + I V
Sbjct: 40   APVPKKGRILDLGAGNGALGLLLAQRTEKAKIVGVEIQEEAAEMAQRNVALNPLEERIQV 99

Query: 1480 ICRRMEDIDRL--PHGIENVDIIVSN 1503
            I     DI          + D+I+ N
Sbjct: 100  IE---ADIKEFLKALVFASFDLIICN 122


>gnl|CDD|233830 TIGR02350, prok_dnaK, chaperone protein DnaK.  Members of this family
            are the chaperone DnaK, of the DnaK-DnaJ-GrpE chaperone
            system. All members of the seed alignment were taken from
            completely sequenced bacterial or archaeal genomes and
            (except for Mycoplasma sequence) found clustered with
            other genes of this systems. This model excludes DnaK
            homologs that are not DnaK itself, such as the heat shock
            cognate protein HscA (TIGR01991). However, it is not
            designed to distinguish among DnaK paralogs in
            eukaryotes. Note that a number of dnaK genes have shadow
            ORFs in the same reverse (relative to dnaK) reading
            frame, a few of which have been assigned glutamate
            dehydrogenase activity. The significance of this
            observation is unclear; lengths of such shadow ORFs are
            highly variable as if the presumptive protein product is
            not conserved [Protein fate, Protein folding and
            stabilization].
          Length = 595

 Score = 41.5 bits (98), Expect = 0.004
 Identities = 27/101 (26%), Positives = 45/101 (44%), Gaps = 19/101 (18%)

Query: 1015 DKNVSKDQKITKSTNNNLNKKTNKQQNIQILSPAKKPNKEEDKIEKEE----NNTNA--- 1067
            DK   K+Q IT + ++ L++     + I+ +    + N EEDK  KEE    NN ++   
Sbjct: 481  DKGTGKEQSITITASSGLSE-----EEIERMVKEAEANAEEDKKRKEEIEARNNADSLAY 535

Query: 1068 ------KENDEKSTDDEKNKLDDKAKDTATKLSNSA-DSIK 1101
                  KE  +K   +EK K++    +    L     + IK
Sbjct: 536  QAEKTLKEAGDKLPAEEKEKIEKAVAELKEALKGEDVEEIK 576


>gnl|CDD|220365 pfam09726, Macoilin, Transmembrane protein.  This entry is a highly
           conserved protein present in eukaryotes.
          Length = 680

 Score = 41.4 bits (97), Expect = 0.004
 Identities = 24/165 (14%), Positives = 55/165 (33%), Gaps = 20/165 (12%)

Query: 267 SRTTTASILDSSLYQNTYDSK------TRNFVDSGSQIFNAASTLNETPEHSTRDGKLFS 320
             T+T  ++        +         T     S     +  S  ++       +     
Sbjct: 223 GLTSTKELVPVQNSGGNHSLSKSSNSQTPELEYSEKGKDHHHSHNHQHHSIGINNHHS-- 280

Query: 321 NEALYAPSSTTNY--PYSSSSTPTYAAMTTASGNANNVASNGGGSNNGGVYSTNSFMAAT 378
            +   +   T      +S+ S P+ ++   +    +N +S   GS       +       
Sbjct: 281 -KHADSKLQTIEVIENHSNKSRPSSSSTNGSKETTSNSSSAAAGSIGSKSSKSAK----- 334

Query: 379 THGYSSISNTITNTQPSQQQSNNSVYSSTINSYSSFPQQSSVNLN 423
            H   + SN   ++  S   +N SV SS+++   S  +++S + +
Sbjct: 335 -HSNRNKSN---SSPKSHSSANGSVPSSSVSDNESKQKRASKSSS 375



 Score = 36.1 bits (83), Expect = 0.21
 Identities = 29/154 (18%), Positives = 47/154 (30%), Gaps = 8/154 (5%)

Query: 327 PSSTTNYPYSSSSTPTYAAMTTASGNANNVASNGGGSNNGGVYSTNSFMAATTHGYSSIS 386
            + T +      S  +   +T+         S G  S +    S    +  +  G     
Sbjct: 205 ENHTLSVTDKEKSEASSKGLTSTKELVPVQNSGGNHSLSKSSNSQTPELEYSEKGKDHHH 264

Query: 387 NTITNTQPSQQQSNNSVYSSTINSYSSFPQQSSVNLNTYGHQASSLSNSNTTGYSSQNLY 446
           +   +   S   +N+           S  Q   V  N      S  S+S+T G       
Sbjct: 265 SH-NHQHHSIGINNHH-----SKHADSKLQTIEVIENHSN--KSRPSSSSTNGSKETTSN 316

Query: 447 QQQPSSVSYSSNSYVYPQTSIATSSNVSSTSHTS 480
               ++ S  S S    + S    SN S  SH+S
Sbjct: 317 SSSAAAGSIGSKSSKSAKHSNRNKSNSSPKSHSS 350


>gnl|CDD|234715 PRK00290, dnaK, molecular chaperone DnaK; Provisional.
          Length = 627

 Score = 41.2 bits (98), Expect = 0.004
 Identities = 31/103 (30%), Positives = 43/103 (41%), Gaps = 32/103 (31%)

Query: 1034 KKTNKQQNIQI-----LSP---------AKKPNKEEDKIEKEE----NNTNA-------- 1067
            K T K+Q+I I     LS          A+  N EEDK  KE     N  ++        
Sbjct: 484  KGTGKEQSITITASSGLSDEEIERMVKDAEA-NAEEDKKRKELVEARNQADSLIYQTEKT 542

Query: 1068 -KENDEKSTDDEKNKLDDKAKD--TATKLSNSADSIKLNKKDE 1107
             KE  +K   DEK K++   K+   A K     ++IK  K +E
Sbjct: 543  LKELGDKVPADEKEKIEAAIKELKEALK-GEDKEAIK-AKTEE 583


>gnl|CDD|236059 PRK07580, PRK07580, Mg-protoporphyrin IX methyl transferase;
            Validated.
          Length = 230

 Score = 39.8 bits (94), Expect = 0.007
 Identities = 30/104 (28%), Positives = 42/104 (40%), Gaps = 30/104 (28%)

Query: 1420 PNSHLLKDKIVLEVGCGMGLLSLFCAEAGAKHVISVDCSVITQLTQEVVEENDCSDVITV 1479
            P    L    +L+ GCG+G LS+  A  GAK V++ D S       ++VEE         
Sbjct: 57   PADGDLTGLRILDAGCGVGSLSIPLARRGAK-VVASDIS------PQMVEE--------- 100

Query: 1480 ICRRMEDIDRLP--HGIENVDIIVSN---WMG---HVLYLDSLI 1515
              R      R P      N+   V +    +G    V+ LD LI
Sbjct: 101  -ARE-----RAPEAGLAGNITFEVGDLESLLGRFDTVVCLDVLI 138


>gnl|CDD|240388 PTZ00372, PTZ00372, endonuclease 4-like protein; Provisional.
          Length = 413

 Score = 40.1 bits (94), Expect = 0.010
 Identities = 21/104 (20%), Positives = 37/104 (35%)

Query: 1206 YNSNNYNNNKSKQKKTDSTNNVDKKQDSTSTDERKKKATQKQQQSEEESKKTSNKKIKFK 1265
            Y  +N      +   T S         ST  D+++ K  + +++SE++ KK   KK    
Sbjct: 26   YIYSNVLVLSKEILSTFSEEENKVATTSTKKDKKEDKNNESKKKSEKKKKKKKEKKEPKS 85

Query: 1266 GPEILNETTEPTASSATNSNVTTVSNTADNSNNPSSILKKSKAT 1309
              E       P  S  T           D  N  + I + ++ +
Sbjct: 86   EGETKLGFKTPKKSKKTKKKPPKPKPNEDVDNAFNKIAELAEKS 129



 Score = 33.5 bits (77), Expect = 0.86
 Identities = 14/82 (17%), Positives = 31/82 (37%), Gaps = 5/82 (6%)

Query: 1202 FNNLYNSNNYNNNKSKQKKTDSTNNVDKKQDSTSTDERKKKATQKQQQSEEESKKTSNKK 1261
               L        ++ + K   ++   DKK+D  +  ++K +  +K+++ ++E K     K
Sbjct: 31   VLVLSKEILSTFSEEENKVATTSTKKDKKEDKNNESKKKSEKKKKKKKEKKEPKSEGETK 90

Query: 1262 IKFKGPEILNETTEPTASSATN 1283
            + FK P                
Sbjct: 91   LGFKTP-----KKSKKTKKKPP 107



 Score = 30.8 bits (70), Expect = 7.5
 Identities = 21/95 (22%), Positives = 33/95 (34%), Gaps = 8/95 (8%)

Query: 1019 SKDQKITKSTNNNLN-KKTNKQQNIQILSPAKKPNKEEDKIEKEENNTNAKENDEKSTDD 1077
            SK Q I+   +N L   K       +     K       K +KE+ N  +K+  EK    
Sbjct: 19   SKLQPISYIYSNVLVLSKEILSTFSEEE--NKVATTSTKKDKKEDKNNESKKKSEKKKKK 76

Query: 1078 EKNKLDDKAK-----DTATKLSNSADSIKLNKKDE 1107
            +K K + K++        T   +     K  K   
Sbjct: 77   KKEKKEPKSEGETKLGFKTPKKSKKTKKKPPKPKP 111


>gnl|CDD|217086 pfam02527, GidB, rRNA small subunit methyltransferase G.  This is a
            family of bacterial glucose inhibited division proteins
            these are probably involved in the regulation of cell
            devision. GidB has been shown to be a methyltransferase G
            specific to the rRNA small subunit. Previously identified
            as a glucose-inhibited division protein B that appears to
            be present and in a single copy in all complete
            eubacterial genomes so far sequenced. GidB specifically
            methylates the N7 position of a guanosine in 16S rRNA.
          Length = 184

 Score = 38.8 bits (91), Expect = 0.010
 Identities = 31/145 (21%), Positives = 53/145 (36%), Gaps = 43/145 (29%)

Query: 1413 HLFKVRKPNS----HLL-----------KDKIVLEVGCGMGL----LSLFCAEAGAKHVI 1453
            +L  + +PN     HLL               V +VG G G     L++   +   K + 
Sbjct: 20   NLTSITEPNELLERHLLDSLVVLEYLDNIRIKVADVGSGAGFPGIPLAIAFPD---KKLT 76

Query: 1454 SVD-----CSVITQLTQEVVEENDCSDVITVICRRMEDIDRLPHGIENVDIIVSNWMGHV 1508
             ++      + + +L +E+  EN     +T++  R E+           D+I S  +   
Sbjct: 77   LLESLLKKINFLEELKKELNLEN-----VTIVHARAEEYQHEEQ----YDVITSRAVA-- 125

Query: 1509 LYLDSLINAVVYARDRFLKPHGLIL 1533
                SL     +A    LKP G  L
Sbjct: 126  ----SLNELTEWAL-PLLKPGGYFL 145


>gnl|CDD|225172 COG2263, COG2263, Predicted RNA methylase [Translation, ribosomal
            structure and biogenesis].
          Length = 198

 Score = 38.8 bits (91), Expect = 0.011
 Identities = 19/81 (23%), Positives = 35/81 (43%), Gaps = 9/81 (11%)

Query: 1424 LLKDKIVLEVGCGMGLLSLFCAEAGAKHVISVD-CSVITQLTQEVVEENDCSDVITVICR 1482
             L+ K VL++G G G+L++  A  GA  V++VD      ++ +   EE      +  +  
Sbjct: 43   DLEGKTVLDLGAGTGILAIGAALLGASRVLAVDIDPEALEIARANAEELL--GDVEFVVA 100

Query: 1483 RMEDIDRLPHGIENVDIIVSN 1503
             + D           D ++ N
Sbjct: 101  DVSDFRGK------FDTVIMN 115


>gnl|CDD|216988 pfam02353, CMAS, Mycolic acid cyclopropane synthetase.  This family
            consist of Cyclopropane-fatty-acyl-phospholipid synthase
            or CFA synthase EC:2.1.1.79 this enzyme catalyze the
            reaction: S-adenosyl-L-methionine + phospholipid olefinic
            fatty acid <=> S-adenosyl-L-homocysteine + phospholipid
            cyclopropane fatty acid.
          Length = 273

 Score = 39.2 bits (92), Expect = 0.012
 Identities = 32/120 (26%), Positives = 44/120 (36%), Gaps = 22/120 (18%)

Query: 1419 KPNSHLLKDKIVLEVGCGMGLLSLFCAEAGAKHVISVDCSVITQ-LTQEVVEENDCSDVI 1477
            KP   LL      ++GCG G L    AE    +V+ +  S       ++ V        +
Sbjct: 61   KPGMTLL------DIGCGWGGLMRRAAERYDVNVVGLTLSKNQYKHARQRVAAEGLQRKV 114

Query: 1478 TVICRRMEDIDRLPHGIENVDIIVSNWM----GHVLYLDSLINAVVYARDRFLKPHGLIL 1533
             V+ +   D D      E  D IVS  M    GH  Y        +Y     L P GL+L
Sbjct: 115  EVLLQDYRDFD------EPFDRIVSVGMFEHVGHENYDTFF--KKLY---NLLPPGGLML 163


>gnl|CDD|233687 TIGR02021, BchM-ChlM, magnesium protoporphyrin O-methyltransferase.
            This model represents the S-adenosylmethionine-dependent
            O-methyltransferase responsible for methylation of
            magnesium protoporphyrin IX. This step is essentiasl for
            the biosynthesis of both chlorophyll and
            bacteriochlorophyll. This model encompasses two closely
            related clades, from cyanobacteria (and plants) where it
            is called ChlM and other photosynthetic bacteria where it
            is known as BchM [Biosynthesis of cofactors, prosthetic
            groups, and carriers, Chlorophyll and
            bacteriochlorphyll].
          Length = 219

 Score = 38.2 bits (89), Expect = 0.017
 Identities = 29/105 (27%), Positives = 44/105 (41%), Gaps = 19/105 (18%)

Query: 1424 LLKDKIVLEVGCGMGLLSLFCAEAGAKHVISVDCS-VITQLTQEVVEENDCSDVITVICR 1482
             LK K VL+ GCG GLLS+  A+ GA  V +VD S  + Q+ +   +  D +  +     
Sbjct: 53   PLKGKRVLDAGCGTGLLSIELAKRGA-IVKAVDISEQMVQMARNRAQGRDVAGNVEFEVN 111

Query: 1483 RMEDIDRLPHGIENVDIIVSNWMGHVLYLDSLINAVVYARDRFLK 1527
             +  +          DI+V         +D LI+   Y      K
Sbjct: 112  DLLSLC------GEFDIVVC--------MDVLIH---YPASDMAK 139


>gnl|CDD|233708 TIGR02072, BioC, biotin biosynthesis protein BioC.  This enzyme,
            which is found in biotin biosynthetic gene clusters in
            proteobacteria, firmicutes, green-sulfur bacteria,
            fusobacterium and bacteroides, is believed to carry out
            an enzymatic step prior to the formation of pimeloyl-CoA
            (although attribution of this annotation is not
            traceable). The enzyme appears related to
            methyltransferases by homology [Biosynthesis of
            cofactors, prosthetic groups, and carriers, Biotin].
          Length = 240

 Score = 38.4 bits (90), Expect = 0.019
 Identities = 21/85 (24%), Positives = 35/85 (41%), Gaps = 8/85 (9%)

Query: 1420 PNSHLLKDKIVLEVGCGMGLLSLFCAEAGAK-HVISVDCSVITQLTQEVVEENDCSDVIT 1478
                +     VL++GCG G L+    +   +   I+ D S           +   S+ + 
Sbjct: 28   KEKGIFIPASVLDIGCGTGYLTRALLKRFPQAEFIANDIS----AGMLAQAKTKLSENVQ 83

Query: 1479 VICRRMEDIDRLPHGIENVDIIVSN 1503
             IC    D ++LP    + D+IVSN
Sbjct: 84   FIC---GDAEKLPLEDSSFDLIVSN 105


>gnl|CDD|237852 PRK14896, ksgA, 16S ribosomal RNA methyltransferase KsgA/Dim1 family
            protein; Provisional.
          Length = 258

 Score = 38.0 bits (89), Expect = 0.027
 Identities = 13/58 (22%), Positives = 24/58 (41%), Gaps = 5/58 (8%)

Query: 1424 LLKDKIVLEVGCGMGLLSLFCAEAGAK-HVISVDCSVITQLTQEVVEENDCSDVITVI 1480
                  VLE+G G G L+   A+   K + I +D  +   L  + +   +    + +I
Sbjct: 27   DTDGDPVLEIGPGKGALTDELAKRAKKVYAIELDPRLAEFLRDDEIAAGN----VEII 80


>gnl|CDD|237898 PRK15068, PRK15068, tRNA mo(5)U34 methyltransferase; Provisional.
          Length = 322

 Score = 38.3 bits (90), Expect = 0.029
 Identities = 36/104 (34%), Positives = 46/104 (44%), Gaps = 36/104 (34%)

Query: 1423 HL--LKDKIVLEVGCGMGLLSLFC---AEAGAKHVISVDCSVITQLTQ-EVVEENDCSDV 1476
            HL  LK + VL+VGCG G           AGAK V+ +D S +  L Q E V        
Sbjct: 117  HLSPLKGRTVLDVGCGNG---YHMWRMLGAGAKLVVGIDPSQLF-LCQFEAV-------- 164

Query: 1477 ITVICRRMEDIDR----LPHGIENV------DIIVSNWMGHVLY 1510
                 R++   D+    LP GIE +      D + S  MG VLY
Sbjct: 165  -----RKLLGNDQRAHLLPLGIEQLPALKAFDTVFS--MG-VLY 200


>gnl|CDD|222171 pfam13489, Methyltransf_23, Methyltransferase domain.  This family
            appears to be a methyltransferase domain.
          Length = 154

 Score = 35.9 bits (83), Expect = 0.053
 Identities = 28/111 (25%), Positives = 41/111 (36%), Gaps = 23/111 (20%)

Query: 1424 LLKDKIVLEVGCGMGLLSLFCAEAGAKHVISVDCSVITQLTQEVVEENDCSDVITVICRR 1483
            L     VL++GCG G+L     E G   V  VD S    L   + +  D           
Sbjct: 20   LKPGGRVLDIGCGTGILLRLLRERG-FDVTGVDPSPAAVLIFSLFDAPD----------- 67

Query: 1484 MEDIDRLPHGIENVDIIVSNWMGHVL-YLDSLINAVVYARDRFLKPHGLIL 1533
                  +  G    D+I +     VL +L      +   R+  LKP G++L
Sbjct: 68   ----PAVLAG--KYDLITAF---EVLEHLPDPPALLQQLREL-LKPGGVLL 108


>gnl|CDD|237629 PRK14160, PRK14160, heat shock protein GrpE; Provisional.
          Length = 211

 Score = 36.7 bits (85), Expect = 0.064
 Identities = 23/96 (23%), Positives = 43/96 (44%), Gaps = 15/96 (15%)

Query: 1006 DKKDDKKVDDKNVSKDQKITKSTNNNLNKKTNKQQNIQILSPAKKPNKEEDKIEKEENNT 1065
             +K+ K    +N+ +D         N NK+ +K +   +        +E +K E  E++ 
Sbjct: 1    MEKECKDAKHENMEED-----CCKENENKEEDKGKEEDL------EFEEIEKEEIIEDSE 49

Query: 1066 NAKENDEKSTDDEKNKLDDKAKDTATKLSNSADSIK 1101
             + E   +   DE NKL ++ K    KL N  +++K
Sbjct: 50   ESNEVKIEELKDENNKLKEENK----KLENELEALK 81


>gnl|CDD|227709 COG5422, ROM1, RhoGEF, Guanine nucleotide exchange factor for
           Rho/Rac/Cdc42-like GTPases [Signal transduction
           mechanisms].
          Length = 1175

 Score = 37.6 bits (87), Expect = 0.065
 Identities = 51/256 (19%), Positives = 83/256 (32%), Gaps = 51/256 (19%)

Query: 232 KNLDVNNITNKTRERLMNKPASSNDSLLKDNGDVDS--------RTTTASILDSSLYQNT 283
           +N   N+I N       +K +    +L        S        R +   I  S    ++
Sbjct: 47  RNGADNDIINSE-----SKESFGKYALGHQIFSSFSSSPKLFQRRNSAGPITHSPSATSS 101

Query: 284 YDSKTRNFVDSGSQIFNAASTLNETPEHSTRDGKLFSNEALYAPSSTTNYPYSSSSTPTY 343
             S   N    G Q   A+ +L+  P  S+   +  S     +P      P   SS+   
Sbjct: 102 TSSLNSN---DGDQFSPASDSLSFNP--SSTQSRKDSGPGDGSPVQKRKNPLLPSSS--- 153

Query: 344 AAMTTASGNANNVASNGGGSNNGGVYSTNSFMAATTHGYSSISNTITNTQPSQQQSNNSV 403
                  G    +      ++N G ++      +     S  S ++    P  +Q  +S 
Sbjct: 154 -----THGTHPPIVF----TDNNGSHAGAPNARSRKEIPSLGSQSMQLPSPHFRQKFSSS 204

Query: 404 YSSTI--------------NSYSSFPQQSSVNLNTY--GHQASSLSNSNTTGYSSQNLYQ 447
            +S                NS  SFP  S+  L     G   +SL +SN T  SS     
Sbjct: 205 DTSNGFSYPSIRKNSRHSSNSMPSFPHSSTAVLLKRHSGSSGASLISSNITPSSS----- 259

Query: 448 QQPSSVSYSSNSYVYP 463
              +  + S   Y+YP
Sbjct: 260 NSEAMSTSSKRPYIYP 275



 Score = 36.8 bits (85), Expect = 0.14
 Identities = 46/229 (20%), Positives = 84/229 (36%), Gaps = 30/229 (13%)

Query: 295 GSQIFNAASTLNETPEHSTRDGKLFSNEALYAPSSTTNYPYSSSSTPTYAAMTTASGNAN 354
           G QIF++ S+  +  +         +    ++PS+T++    +S+     +  + S + N
Sbjct: 69  GHQIFSSFSSSPKLFQRRNS-----AGPITHSPSATSSTSSLNSNDGDQFSPASDSLSFN 123

Query: 355 NVASNGGGSNNGG----VYSTNSFM--AATTHGYSSISNTITNTQPSQQQSNNSVYSSTI 408
             ++     +  G    V    + +  +++THG        T+   S   + N+     I
Sbjct: 124 PSSTQSRKDSGPGDGSPVQKRKNPLLPSSSTHGTHPPIVF-TDNNGSHAGAPNARSRKEI 182

Query: 409 NSYSSFPQQSSVNLNTYGHQASSLSNSNTTGYSSQNLYQQQPSSVSYSSNSYVYPQTSIA 468
            S  S   Q         H     S+S+T+   S    ++     S S  S+ +  T++ 
Sbjct: 183 PSLGSQSMQLPSP-----HFRQKFSSSDTSNGFSYPSIRKNSRHSSNSMPSFPHSSTAVL 237

Query: 469 TS--SNVSSTSHTSHKYPPYNVDMKNHLDSKSTLETKSSSEHPPYVDYP 515
               S  S  S  S    P          S S  E  S+S   PY+ YP
Sbjct: 238 LKRHSGSSGASLISSNITP----------SSSNSEAMSTSSKRPYI-YP 275


>gnl|CDD|224110 COG1189, COG1189, Predicted rRNA methylase [Translation, ribosomal
            structure and biogenesis].
          Length = 245

 Score = 36.8 bits (86), Expect = 0.066
 Identities = 30/112 (26%), Positives = 48/112 (42%), Gaps = 15/112 (13%)

Query: 1425 LKDKIVLEVGCGMGLLSLFCAEAGAKHVISVDCSVITQLTQEVVEENDCSDVITVICRRM 1484
            +K K+VL++G   G  +    + GAKHV +VD     QL  ++   ND      VI    
Sbjct: 78   VKGKVVLDIGSSTGGFTDVLLQRGAKHVYAVDVG-YGQLHWKL--RNDPR----VIVLER 130

Query: 1485 EDIDRL--PHGIENVDIIVSNWMGHVLYLDSLINAVVYARDRFLKPHGLILP 1534
             ++  L      E  D+IV +     +   SL   ++ A    LK  G ++ 
Sbjct: 131  TNVRYLTPEDFTEKPDLIVID-----VSFISLKL-ILPALLLLLKDGGDLVL 176


>gnl|CDD|219760 pfam08242, Methyltransf_12, Methyltransferase domain.  Members of
            this family are SAM dependent methyltransferases.
          Length = 98

 Score = 34.3 bits (79), Expect = 0.079
 Identities = 23/107 (21%), Positives = 43/107 (40%), Gaps = 14/107 (13%)

Query: 1431 LEVGCGMGLLSLFCAEAGA-KHVISVDCS--VITQLTQEVVE-ENDCSDVITVICRRMED 1486
            L++GCG G L     EA        VD S   +    + +       +  + +      D
Sbjct: 1    LDIGCGTGTLLRALLEALPGLEYTGVDISPAALEAAAERLAALGLLDAVRVRLDVLDAID 60

Query: 1487 IDRLPHGIENVDIIV-SNWMGHVLYLDSLINAVVYARDRFLKPHGLI 1532
            +D       + D++V SN + H+    +++  +     R LKP G++
Sbjct: 61   LD-----PGSFDVVVASNVLHHLADPRAVLRNLR----RLLKPGGVL 98


>gnl|CDD|235640 PRK05901, PRK05901, RNA polymerase sigma factor; Provisional.
          Length = 509

 Score = 36.9 bits (86), Expect = 0.086
 Identities = 20/147 (13%), Positives = 54/147 (36%), Gaps = 6/147 (4%)

Query: 976  KNEFKVTKIMQRPVDENKDEIAKLPIPLGGDKKDDKKVDDKNVSKDQKITKSTNN-NLNK 1034
                 + K      + +  +            K   K   K+     K  +  N  + + 
Sbjct: 57   IFLSGMVKDTDDATESDIPKKKTKTAAKAAAAKAPAKKKLKDELDSSKKAEKKNALDKDD 116

Query: 1035 KTNKQQNIQILSPAKKPNKEEDKIEKEENNTNAKENDE----KSTDDEKNKLDDKAKDTA 1090
              N  ++I +L+ A   + ++D  + ++++ +  ++DE       DD+ +  D++ K+  
Sbjct: 117  DLNYVKDIDVLNQADDDDDDDDDDDLDDDDIDDDDDDEDDDEDDDDDDVDDEDEEKKEAK 176

Query: 1091 TKLSNSADSIKLNKKDEGSKENKPQQQ 1117
                 S D       ++ S+  +  ++
Sbjct: 177  ELEKLSDDD-DFVWDEDDSEALRQARK 202



 Score = 33.8 bits (78), Expect = 0.78
 Identities = 18/117 (15%), Positives = 48/117 (41%), Gaps = 4/117 (3%)

Query: 988  PVDENKDEIAKLPIPLGGDKKDDKKVDDKNVSKDQKITKSTNNNLNKKTNKQQNIQILSP 1047
             +D +K    K  +    D    K +D  N + D       ++  +   +   + +    
Sbjct: 99   ELDSSKKAEKKNALDKDDDLNYVKDIDVLNQADDDDDDDDDDDLDDDDIDDDDDDEDDDE 158

Query: 1048 AKKPNK---EEDKIEKEENNTNAKENDEKSTDDEKNKLDDKAKDTATKLSNSADSIK 1101
                +    E+++ ++ +      ++D+   D++ ++   +A+  A KL+ +AD +K
Sbjct: 159  DDDDDDVDDEDEEKKEAKELEKLSDDDDFVWDEDDSEALRQARKDA-KLTATADPVK 214



 Score = 31.1 bits (71), Expect = 5.5
 Identities = 19/95 (20%), Positives = 37/95 (38%), Gaps = 2/95 (2%)

Query: 1022 QKITKSTNNNLNKKTNKQQNIQ-ILSPAKKPNKEEDKIEKEENNTNAKENDEKSTDDEKN 1080
            +K+ K    + +K    ++ I+  L   KK  ++ D++    +      +D   +D  K 
Sbjct: 18   KKLKKLAAKSKSKGFITKEEIKEALESKKKTPEQIDQVLIFLSGMVKDTDDATESDIPKK 77

Query: 1081 KLDDKAKDTATKLSNSAD-SIKLNKKDEGSKENKP 1114
            K    AK  A K         +L+   +  K+N  
Sbjct: 78   KTKTAAKAAAAKAPAKKKLKDELDSSKKAEKKNAL 112


>gnl|CDD|223813 COG0742, COG0742, N6-adenine-specific methylase [DNA replication,
            recombination, and repair].
          Length = 187

 Score = 35.7 bits (83), Expect = 0.10
 Identities = 22/126 (17%), Positives = 40/126 (31%), Gaps = 24/126 (19%)

Query: 1425 LKDKIVLEVGCGMGLLSLFCAEA---GAKHVISVDCSVITQLT-QEVVEENDCSDVITVI 1480
            ++   VL++  G G L L   EA   GA  V+ V+         +E ++         V+
Sbjct: 42   IEGARVLDLFAGSGALGL---EALSRGAARVVFVEKDRKAVKILKENLKALGLEGEARVL 98

Query: 1481 CRRMEDIDRLPHGIENVDIIVSNWMGHVLYLD-----SLINAVVYARDRFLKPHGLILPD 1535
                    +     E  D++         +LD      L++  +      L      L  
Sbjct: 99   RNDALRALKQLGTREPFDLV---------FLDPPYAKGLLDKELALL---LLEENGWLKP 146

Query: 1536 RAELYC 1541
             A +  
Sbjct: 147  GALIVV 152


>gnl|CDD|217051 pfam02463, SMC_N, RecF/RecN/SMC N terminal domain.  This domain is
            found at the N terminus of SMC proteins. The SMC
            (structural maintenance of chromosomes) superfamily
            proteins have ATP-binding domains at the N- and
            C-termini, and two extended coiled-coil domains separated
            by a hinge in the middle. The eukaryotic SMC proteins
            form two kind of heterodimers: the SMC1/SMC3 and the
            SMC2/SMC4 types. These heterodimers constitute an
            essential part of higher order complexes, which are
            involved in chromatin and DNA dynamics. This family also
            includes the RecF and RecN proteins that are involved in
            DNA metabolism and recombination.
          Length = 1162

 Score = 36.9 bits (85), Expect = 0.11
 Identities = 19/181 (10%), Positives = 53/181 (29%), Gaps = 18/181 (9%)

Query: 960  NKNFKNFNNKSYGDKNKNEFKVTKIMQRPVDENKDEIAKLPIPLGGDKKDDKKVDDKNVS 1019
            N+   +   +   D+ +      + +++  +     + +        K+++K+   +   
Sbjct: 236  NEERIDLLQELLRDEQEEIESSKQELEKEEEILAQVLKEN-------KEEEKEKKLQEEE 288

Query: 1020 KDQKITKSTNNNLNKKTNKQQNIQILSPAKKPNKEEDKIEKEENNT--------NAKEND 1071
                  +           +++ +      K+  KE  K+EKE               +  
Sbjct: 289  LKLLAKEEEELKSELLKLERRKVDDEEKLKESEKELKKLEKELKKEKEEIEELEKELKEL 348

Query: 1072 EKSTDDEKNKLDDKAKDT---ATKLSNSADSIKLNKKDEGSKENKPQQQRRKFNNFHNSS 1128
            E   + E+ + +   K                KL  +   S     +++    N     +
Sbjct: 349  EIKREAEEEEEEQLEKLQEKLEQLEEELLAKKKLESERLSSAAKLKEEELELKNEEEKEA 408

Query: 1129 K 1129
            K
Sbjct: 409  K 409


>gnl|CDD|226413 COG3897, COG3897, Predicted methyltransferase [General function
            prediction only].
          Length = 218

 Score = 35.9 bits (83), Expect = 0.11
 Identities = 13/33 (39%), Positives = 22/33 (66%)

Query: 1424 LLKDKIVLEVGCGMGLLSLFCAEAGAKHVISVD 1456
             ++ K VL++G G GL+++  A AGA  V++ D
Sbjct: 77   TVRGKRVLDLGAGSGLVAIAAARAGAAEVVAAD 109


>gnl|CDD|236275 PRK08486, PRK08486, single-stranded DNA-binding protein;
           Provisional.
          Length = 182

 Score = 35.6 bits (82), Expect = 0.11
 Identities = 17/94 (18%), Positives = 30/94 (31%), Gaps = 4/94 (4%)

Query: 396 QQQSNNSVYSSTINSYSSFPQQSSVNLNTYGHQASSLSNSNTTGYSSQNLYQQQPSSVSY 455
           Q     S ++ T  S      +S  N      Q +S   +    Y         PS   Y
Sbjct: 87  QNGQKRSKHTITAESMQMLDSKSD-NPQANAMQDNSFHENFNNNYPGN---YNNPSQDPY 142

Query: 456 SSNSYVYPQTSIATSSNVSSTSHTSHKYPPYNVD 489
            + +  Y Q + A  +  +      +  P  ++D
Sbjct: 143 MNQAQSYNQNAYAKENQQAQPPKYENSVPEIDID 176


>gnl|CDD|236304 PRK08581, PRK08581, N-acetylmuramoyl-L-alanine amidase; Validated.
          Length = 619

 Score = 36.7 bits (85), Expect = 0.12
 Identities = 37/188 (19%), Positives = 75/188 (39%), Gaps = 27/188 (14%)

Query: 1200 PMFNNLYNSNNYN---NNKSKQKKTDSTNNVDKKQDSTSTDERKKKATQKQQQSEEESK- 1255
             +FN   + ++Y    N++     ++  ++   K D+ +   ++ KA  ++  S   +K 
Sbjct: 125  NLFNLNSDISDYEQPRNSEKSTNDSNKNSDSSIKNDTDTQSSKQDKADNQKAPSSNNTKP 184

Query: 1256 KTSNKKIKFKGPEILNETTEPTASSATNSNVTTVSNTADNSNNPS-SILKKSKATEDCKV 1314
             TSNK+     P   N++    AS  T +  ++  +    S++   SIL           
Sbjct: 185  STSNKQPNSPKPTQPNQSNSQPASDDTANQKSSSKDNQSMSDSALDSILD---------- 234

Query: 1315 KTVQGAPKTEKSQTKSGEYDNLDEREPSTIQLFMGLGLDKPPSQKPLVY---PIRTFNMD 1371
            +  + A KT+K      + D  +       QL         P+Q  L +   P ++F  D
Sbjct: 235  QYSEDAKKTQKDYASQSKKDKTETSNTKNPQL---------PTQDELKHKSKPAQSFEND 285

Query: 1372 IDDMTSRE 1379
            ++   +R 
Sbjct: 286  VNQSNTRS 293



 Score = 32.1 bits (73), Expect = 3.1
 Identities = 34/178 (19%), Positives = 70/178 (39%), Gaps = 5/178 (2%)

Query: 180 DSTQQHNKKHKRESSKVISVTNNQQVNNDKDFKNNLMWDGDSAGAKKSRNQWKNLDVNNI 239
           D  Q  N +     S   S ++ +   + +  K +   D   A +  +     +    N 
Sbjct: 135 DYEQPRNSEKSTNDSNKNSDSSIKNDTDTQSSKQDKA-DNQKAPSSNNTKPSTSNKQPNS 193

Query: 240 TNKTRERLMNKPASSNDSLLKDNGDVDSRTTTASILDSSLYQNTYDSKTRNFVDSGSQIF 299
              T+    N   +S+D+  + +   D+++ + S LDS L Q + D+K      +     
Sbjct: 194 PKPTQPNQSNSQPASDDTANQKSSSKDNQSMSDSALDSILDQYSEDAKKTQKDYASQSKK 253

Query: 300 NAASTLNET-PEHSTRDGKLFSNEALYAPSSTTNYPYSSS-STPTYAAMTTASGNANN 355
           +   T N   P+  T+D     +++  A S   +   S++ ST  +    + S N ++
Sbjct: 254 DKTETSNTKNPQLPTQDE--LKHKSKPAQSFENDVNQSNTRSTSLFETGPSLSNNDDS 309


>gnl|CDD|233668 TIGR01983, UbiG, ubiquinone biosynthesis O-methyltransferase.  This
            model represents an O-methyltransferase believed to act
            at two points in the ubiquinone biosynthetic pathway in
            bacteria (UbiG) and fungi (COQ3). A separate methylase
            (MenG/UbiE) catalyzes the single C-methylation step. The
            most commonly used names for genes in this family do not
            indicate whether this gene is an O-methyl, or C-methyl
            transferase [Biosynthesis of cofactors, prosthetic
            groups, and carriers, Menaquinone and ubiquinone].
          Length = 224

 Score = 35.7 bits (83), Expect = 0.14
 Identities = 33/115 (28%), Positives = 50/115 (43%), Gaps = 20/115 (17%)

Query: 1425 LKDKIVLEVGCGMGLLSLFCAEAGAKHVISVDCSVITQLTQEVVE------ENDCSDVIT 1478
            L    VL+VGCG GLLS   A  GA  V  +D       ++E +E      + D    I 
Sbjct: 44   LFGLRVLDVGCGGGLLSEPLARLGAN-VTGID------ASEENIEVAKLHAKKDPLLKIE 96

Query: 1479 VICRRMEDIDRLPHGIENVDIIV-SNWMGHVLYLDSLINAVVYARDRFLKPHGLI 1532
              C  +ED+     G ++ D++     + HV    + I A      + LKP G++
Sbjct: 97   YRCTSVEDLA--EKGAKSFDVVTCMEVLEHVPDPQAFIRACA----QLLKPGGIL 145


>gnl|CDD|181382 PRK08317, PRK08317, hypothetical protein; Provisional.
          Length = 241

 Score = 35.7 bits (83), Expect = 0.14
 Identities = 23/109 (21%), Positives = 38/109 (34%), Gaps = 21/109 (19%)

Query: 1430 VLEVGCGMGLLSL-----FCAEAGAKHVISVDCS--VITQLTQEVVEENDCSDVITVICR 1482
            VL+VGCG G  +         E     V+ +D S  ++    +         +V      
Sbjct: 23   VLDVGCGPGNDARELARRVGPEG---RVVGIDRSEAMLALAKERA--AGLGPNV----EF 73

Query: 1483 RMEDIDRLPHGIENVDIIVS-NWMGHVLYLDSLINAVVYARDRFLKPHG 1530
               D D LP    + D + S   + H+      +  +     R L+P G
Sbjct: 74   VRGDADGLPFPDGSFDAVRSDRVLQHLEDPARALAEIA----RVLRPGG 118


>gnl|CDD|225137 COG2227, UbiG,
            2-polyprenyl-3-methyl-5-hydroxy-6-metoxy-1,4-benzoquinol
            methylase [Coenzyme metabolism].
          Length = 243

 Score = 35.8 bits (83), Expect = 0.15
 Identities = 25/91 (27%), Positives = 35/91 (38%), Gaps = 12/91 (13%)

Query: 1373 DDMTSREIFE-DVYADIKTH--QTTYKDKQLIGCFHDSIIN--NSHLFKVRKPNSHLLKD 1427
            DD  + +  E D +  + +       + K L    H   IN       +        L  
Sbjct: 7    DDTQNVDYKELDKFEALASRWWDPEGEFKPL----H--KINPLRLDYIREVARLRFDLPG 60

Query: 1428 KIVLEVGCGMGLLSLFCAEAGAKHVISVDCS 1458
              VL+VGCG G+LS   A  GA  V  +D S
Sbjct: 61   LRVLDVGCGGGILSEPLARLGA-SVTGIDAS 90


>gnl|CDD|227596 COG5271, MDN1, AAA ATPase containing von Willebrand factor type A
            (vWA) domain [General function prediction only].
          Length = 4600

 Score = 36.1 bits (83), Expect = 0.20
 Identities = 22/127 (17%), Positives = 49/127 (38%), Gaps = 15/127 (11%)

Query: 1011 KKVDDKNVSK-----DQKITKSTNNNLNKKTNKQQNIQILSPAKKPNKEEDKIEKEENN- 1064
            +K +   +            +   N LN+  + ++       A + + +E +  + E++ 
Sbjct: 3805 EKSNSGELESGTGLGSGVGAEDITNTLNEDDDLEELANEEDTANQSDLDESEARELESDM 3864

Query: 1065 ------TNAKENDEKSTDDEKNKLDDKAKDTATKLSNSADSIKL--NKKDEGSKENKPQQ 1116
                  +   EN+   +++E   LD++  D    LSNS +  KL     +E   E + + 
Sbjct: 3865 NGVTKDSVVSENENSDSEEENQDLDEEVNDIPEDLSNSLNE-KLWDEPNEEDLLETEQKS 3923

Query: 1117 QRRKFNN 1123
              +   N
Sbjct: 3924 NEQSAAN 3930



 Score = 35.4 bits (81), Expect = 0.36
 Identities = 27/125 (21%), Positives = 49/125 (39%), Gaps = 27/125 (21%)

Query: 990  DENKDEIAKLPIPLGGDKKD------------------DKKVDDKNV----SKDQKITKS 1027
            D ++ E  +L   + G  KD                  D++V+D       S ++K+   
Sbjct: 3851 DLDESEARELESDMNGVTKDSVVSENENSDSEEENQDLDEEVNDIPEDLSNSLNEKLWDE 3910

Query: 1028 TNNNLNKKTNKQQNIQILSPAKKPNKEEDKIEKEENNTNAKENDEKSTDDEKNKLDDKAK 1087
             N     +T ++ N Q        N E D + KE++N   ++ D +  +DE+   DD   
Sbjct: 3911 PNEEDLLETEQKSNEQ-----SAANNESDLVSKEDDNKALEDKDRQEKEDEEEMSDDVGI 3965

Query: 1088 DTATK 1092
            D   +
Sbjct: 3966 DDEIQ 3970



 Score = 33.0 bits (75), Expect = 2.1
 Identities = 25/149 (16%), Positives = 60/149 (40%), Gaps = 20/149 (13%)

Query: 970  SYGDKNKNEFKVTKIMQRPVDENKDEIAK-LPIPLGGDKKDDKKVDDKNVSKDQKITKST 1028
            S   +N+N    ++   + +DE  ++I + L   L     D+   +D   ++ +   +S 
Sbjct: 3871 SVVSENENS--DSEEENQDLDEEVNDIPEDLSNSLNEKLWDEPNEEDLLETEQKSNEQSA 3928

Query: 1029 NNNLNKKTNKQQNIQILSPAKKPNKEEDKIEKEENNTNAKENDEKSTDDEKNKLDDKAKD 1088
             NN +   +K+ +             EDK  +E+ +     +D    D+ +    D  ++
Sbjct: 3929 ANNESDLVSKEDD---------NKALEDKDRQEKEDEEEMSDDVGIDDEIQ---PDIQEN 3976

Query: 1089 TATKLSNSAD-----SIKLNKKDEGSKEN 1112
             +     + D      +KL++K+    ++
Sbjct: 3977 NSQPPPENEDLDLPEDLKLDEKEGDVSKD 4005


>gnl|CDD|215656 pfam00012, HSP70, Hsp70 protein.  Hsp70 chaperones help to fold many
            proteins. Hsp70 assisted folding involves repeated cycles
            of substrate binding and release. Hsp70 activity is ATP
            dependent. Hsp70 proteins are made up of two regions: the
            amino terminus is the ATPase domain and the carboxyl
            terminus is the substrate binding region.
          Length = 598

 Score = 35.7 bits (83), Expect = 0.21
 Identities = 19/91 (20%), Positives = 38/91 (41%), Gaps = 11/91 (12%)

Query: 1034 KKTNKQQNIQILSPAKKPNKEEDKIEKEENNTNAKENDEKSTDDEKNKLDDKAKDTATKL 1093
            K T K+Q I I + +   + E +++ K+     A++   K   + KN+ ++        L
Sbjct: 485  KGTGKEQKITITASSGLSDDEIERMVKDAEEYAAEDKKRKERIEAKNEAEEYVYSLEKSL 544

Query: 1094 SNSADSI------KLNK-----KDEGSKENK 1113
                D +      K+ +     K+E   E+K
Sbjct: 545  KEEGDKLPEADKKKVEEAIEWLKEELEGEDK 575


>gnl|CDD|203189 pfam05175, MTS, Methyltransferase small domain.  This domain is found
            in ribosomal RNA small subunit methyltransferase C as
            well as other methyltransferases .
          Length = 170

 Score = 34.1 bits (79), Expect = 0.25
 Identities = 30/131 (22%), Positives = 45/131 (34%), Gaps = 24/131 (18%)

Query: 1422 SHLLKDK--IVLEVGCGMGLLSLFCAEAGAKHVISVDCSVI----TQLTQEVVEENDCSD 1475
            SHL K     VL++GCG G+L    A+      + V    I     +  +  +  N   +
Sbjct: 25   SHLPKPLGGKVLDLGCGYGVLGAALAKRSPD--LEVTMVDINARALESARANLAANGLEN 82

Query: 1476 VITVICRRMEDIDRLPHGIENVDIIVSN---WMGHVLYLDSLINAVVYARDRFLKPHGLI 1532
                       ++         D+I+SN     G     D     +  A    LKP G  
Sbjct: 83   GEVFWSDLYSAVEP-----GKFDLIISNPPFHAGKATDYDVAQRFIAGAARH-LKPGG-- 134

Query: 1533 LPDRAELYCVA 1543
                 EL+ VA
Sbjct: 135  -----ELWIVA 140


>gnl|CDD|219838 pfam08432, DUF1742, Fungal protein of unknown function (DUF1742).
            This is a family of fungal proteins of unknown function.
          Length = 182

 Score = 34.3 bits (79), Expect = 0.25
 Identities = 9/50 (18%), Positives = 23/50 (46%)

Query: 1212 NNNKSKQKKTDSTNNVDKKQDSTSTDERKKKATQKQQQSEEESKKTSNKK 1261
              +K K KK D  ++  +K+D    +++ +  T+   ++     +   +K
Sbjct: 94   KKDKDKDKKDDKKDDKSEKKDEKEAEDKLEDLTKSYSETLSTLSELKPRK 143



 Score = 33.5 bits (77), Expect = 0.47
 Identities = 13/61 (21%), Positives = 27/61 (44%), Gaps = 4/61 (6%)

Query: 1048 AKKPNKEEDKIEKEENNTNAKENDEKSTDDEKNK----LDDKAKDTATKLSNSADSIKLN 1103
             K+  K + K  K++ + +  + D+K  D  + K     +DK +D     S +  ++   
Sbjct: 80   EKQKWKWKKKKSKKKKDKDKDKKDDKKDDKSEKKDEKEAEDKLEDLTKSYSETLSTLSEL 139

Query: 1104 K 1104
            K
Sbjct: 140  K 140



 Score = 30.0 bits (68), Expect = 6.7
 Identities = 12/59 (20%), Positives = 27/59 (45%)

Query: 1037 NKQQNIQILSPAKKPNKEEDKIEKEENNTNAKENDEKSTDDEKNKLDDKAKDTATKLSN 1095
             KQ+       +KK   ++   + ++ +  +++ DEK  +D+   L     +T + LS 
Sbjct: 80   EKQKWKWKKKKSKKKKDKDKDKKDDKKDDKSEKKDEKEAEDKLEDLTKSYSETLSTLSE 138


>gnl|CDD|222636 pfam14265, DUF4355, Domain of unknown function (DUF4355).  This
            family of proteins is found in bacteria and viruses.
            Proteins in this family are typically between 180 and 214
            amino acids in length.
          Length = 125

 Score = 33.4 bits (77), Expect = 0.27
 Identities = 18/88 (20%), Positives = 37/88 (42%), Gaps = 9/88 (10%)

Query: 1006 DKKDDKKVDDKNVSKD-QKITKSTNNNLNKKTNKQQNIQILSPAKKPNKEEDKIEKEENN 1064
            + +++K   DK V K   K          +K ++ + +  +S  +K   E +K+EKE   
Sbjct: 1    EPEEEKTFTDKEVDKAIAKEKAKWEKKQEEKKSEAEKLAKMSAEEKAEYELEKLEKE--- 57

Query: 1065 TNAKENDEKSTDDEKNKLDDKAKDTATK 1092
               +   E      + +L  +AK   ++
Sbjct: 58   -LEELEAEL----ARRELKAEAKKMLSE 80


>gnl|CDD|234708 PRK00274, ksgA, 16S ribosomal RNA methyltransferase KsgA/Dim1 family
            protein; Reviewed.
          Length = 272

 Score = 34.7 bits (81), Expect = 0.31
 Identities = 16/61 (26%), Positives = 27/61 (44%), Gaps = 6/61 (9%)

Query: 1422 SHLLKDKIVLEVGCGMGLLSLFCAEAGAK-HVISVDCSVITQLTQEVVEENDCSDVITVI 1480
            +       VLE+G G+G L+    E  AK   + +D  +   L +   E+N     +T+I
Sbjct: 38   AGPQPGDNVLEIGPGLGALTEPLLERAAKVTAVEIDRDLAPILAETFAEDN-----LTII 92

Query: 1481 C 1481
             
Sbjct: 93   E 93


>gnl|CDD|184931 PRK14967, PRK14967, putative methyltransferase; Provisional.
          Length = 223

 Score = 34.3 bits (79), Expect = 0.33
 Identities = 13/29 (44%), Positives = 18/29 (62%)

Query: 1430 VLEVGCGMGLLSLFCAEAGAKHVISVDCS 1458
            VL++  G G L++  A AGA  V +VD S
Sbjct: 40   VLDLCTGSGALAVAAAAAGAGSVTAVDIS 68


>gnl|CDD|224017 COG1092, COG1092, Predicted SAM-dependent methyltransferases [General
            function prediction only].
          Length = 393

 Score = 35.0 bits (81), Expect = 0.36
 Identities = 14/35 (40%), Positives = 16/35 (45%)

Query: 1424 LLKDKIVLEVGCGMGLLSLFCAEAGAKHVISVDCS 1458
            L   K VL +    G  S+  A  GA  V SVD S
Sbjct: 215  LAAGKRVLNLFSYTGGFSVHAALGGASEVTSVDLS 249


>gnl|CDD|235943 PRK07133, PRK07133, DNA polymerase III subunits gamma and tau;
            Validated.
          Length = 725

 Score = 35.2 bits (81), Expect = 0.39
 Identities = 24/201 (11%), Positives = 71/201 (35%), Gaps = 7/201 (3%)

Query: 937  QNNMIKTIDFHQFQNNVMKNHPGNKNFKNFNNKSYGDKNKNEFKVTKIMQRPVDENKDEI 996
             N    +   ++ +N +  +   ++  +         +N  EF+  +I    +     ++
Sbjct: 409  DNKNKNSSFINKTENILTNSPLKDELLEKTTE-IINIENPQEFEFGQIGNDIISTEIAQL 467

Query: 997  A--KLPIPLGGDKKDDKKVDDKNVSKDQKITKSTNNNLNKKT-NKQQNIQILSPAKKPNK 1053
               +  I  G    ++   +  N     KI ++ N   +  T +   +    + A+  N 
Sbjct: 468  DENQNLIDTGEFDLENNFSNSFNPENGNKIDENINETFDTSTISANLSENKTNFAQSFNN 527

Query: 1054 EEDKIEKEENNTNAKENDEKSTDDEKNKLDDKAKDTATK---LSNSADSIKLNKKDEGSK 1110
            ++  +   E   +  ++     + +KN   +  KD  +    ++    +IK + +++   
Sbjct: 528  KDTNLINSEIPIDLIKDTITINNSQKNVKKNGNKDYLSVEEVINLIMLAIKFHSQNQVEY 587

Query: 1111 ENKPQQQRRKFNNFHNSSKYQ 1131
            +   Q   +    F    ++ 
Sbjct: 588  KKLVQNWNKNLPLFEYDVEFM 608



 Score = 34.0 bits (78), Expect = 0.84
 Identities = 24/153 (15%), Positives = 47/153 (30%), Gaps = 7/153 (4%)

Query: 965  NFNNKSYGDKNKNEFKVTKIMQRPVDENKDEIAKLPIPLGGDKKDDKKVDDKNVSKDQKI 1024
              +     D+N+ +FK  +       + K++  +    + G        +      ++  
Sbjct: 354  ALSELEEEDENEIKFKKIEENSIDNLDIKEKKIENENDIEGKSDTKNLEEGFETKDNKNK 413

Query: 1025 TKSTNNNLNKKTNKQQNIQILSPAKKPNKEEDKIEKEENNTNAKENDEKSTDDEKNKLDD 1084
              S  N    +     +       +K  +  +    +E       ND  ST+  +   + 
Sbjct: 414  NSSFINKT--ENILTNSPLKDELLEKTTEIINIENPQEFEFGQIGNDIISTEIAQLDENQ 471

Query: 1085 KAKDTAT-----KLSNSADSIKLNKKDEGSKEN 1112
               DT         SNS +    NK DE   E 
Sbjct: 472  NLIDTGEFDLENNFSNSFNPENGNKIDENINET 504


>gnl|CDD|165513 PHA03255, PHA03255, BDLF3; Provisional.
          Length = 234

 Score = 34.5 bits (78), Expect = 0.39
 Identities = 31/150 (20%), Positives = 64/150 (42%), Gaps = 10/150 (6%)

Query: 328 SSTTNYPYSSSSTPTYAAMTTASGNANNVASNGGGSNNGGVYSTNSFMAATTHGYSSISN 387
           +S+ +   S+ +     A+TT S +A+     G  +N     +T S    TT   S+ + 
Sbjct: 25  TSSGSSTASAGNVTGTTAVTTPSPSAS-----GPSTNQSTTLTTTSAPITTTAILSTNTT 79

Query: 388 TITNTQPSQQQSNNSVYSSTINSYSSFPQQSSVNLNTYGHQASSLSNSNTTGYSSQNLYQ 447
           T+T+T  +      +  +STIN  +    Q+          ++ ++++ TT  SS     
Sbjct: 80  TVTSTGTTVTPVPTTSNASTINVTTKVTAQNITATEAGTGTSTGVTSNVTTRSSSTTSAT 139

Query: 448 QQPSSVSYSSNSYVYPQTSIATSSNVSSTS 477
            + ++ +  +     P  S   +SN + T+
Sbjct: 140 TRITNATTLA-----PTLSSKGTSNATKTT 164


>gnl|CDD|129627 TIGR00536, hemK_fam, HemK family putative methylases.  The gene hemK
            from E. coli was found to contribute to heme biosynthesis
            and originally suggested to be protoporphyrinogen oxidase
            (Medline 95189105). Functional analysis of the nearest
            homolog in Saccharomyces cerevisiae, YNL063w, finds it is
            not protoporphyrinogen oxidase and sequence analysis
            suggests that HemK homologs have
            S-adenosyl-methionine-dependent methyltransferase
            activity (Medline 99237242). Homologs are found, usually
            in a single copy, in nearly all completed genomes, but
            varying somewhat in apparent domain architecture. Both E.
            coli and H. influenzae have two members rather than one.
            The members from the Mycoplasmas have an additional
            C-terminal domain [Protein fate, Protein modification and
            repair].
          Length = 284

 Score = 34.3 bits (79), Expect = 0.42
 Identities = 22/76 (28%), Positives = 36/76 (47%), Gaps = 6/76 (7%)

Query: 1430 VLEVGCGMGLLSLFCAEAGA-KHVISVDCSV-ITQLTQEVVEENDCSDVITVICRRMEDI 1487
            +L++G G G ++L  A       VI+VD S     + +E  E+N     +  I   +   
Sbjct: 118  ILDLGTGSGCIALALAYEFPNAEVIAVDISPDALAVAEENAEKNQLEHRVEFIQSNL--F 175

Query: 1488 DRLPHGIENVDIIVSN 1503
            + L    + +DIIVSN
Sbjct: 176  EPLAG--QKIDIIVSN 189


>gnl|CDD|226947 COG4581, COG4581, Superfamily II RNA helicase [DNA replication,
            recombination, and repair].
          Length = 1041

 Score = 35.1 bits (81), Expect = 0.45
 Identities = 13/68 (19%), Positives = 22/68 (32%), Gaps = 2/68 (2%)

Query: 1050 KPNKEEDKIEK--EENNTNAKENDEKSTDDEKNKLDDKAKDTATKLSNSADSIKLNKKDE 1107
            K  + E K     EE+  N   +D      + NK   +A D    +   ++   LN    
Sbjct: 19   KEKRVERKAATAEEESLKNIDISDLFERSVDMNKDPKEATDQVKAVDKHSEIESLNVGMR 78

Query: 1108 GSKENKPQ 1115
                   +
Sbjct: 79   LDVRLSSK 86


>gnl|CDD|176193 cd08231, MDR_TM0436_like, Hypothetical enzyme TM0436 resembles the
            zinc-dependent alcohol dehydrogenases (ADH).  This group
            contains the hypothetical TM0436 alcohol dehydrogenase
            from Thermotoga maritima,  proteins annotated as
            5-exo-alcohol dehydrogenase, and other members of the
            medium chain dehydrogenases/reductase
            (MDR)/zinc-dependent alcohol dehydrogenase-like family.
            MDR, which contains the zinc-dependent alcohol
            dehydrogenase (ADH-Zn) and related proteins, is a diverse
            group of proteins related to the first identified member,
            class I mammalian ADH.  MDRs display a broad range of
            activities and are distinguished from the smaller short
            chain dehydrogenases (~ 250 amino acids vs. the ~ 350
            amino acids of the MDR).  The MDR proteins have 2
            domains: a C-terminal NAD(P) binding-Rossmann fold domain
            of a beta-alpha form and an N-terminal catalytic domain
            with distant homology to GroES.  The MDR group contains a
            host of activities, including the founding alcohol
            dehydrogenase (ADH), quinone reductase, sorbitol
            dehydrogenase, formaldehyde dehydrogenase, butanediol DH,
            ketose reductase, cinnamyl reductase, and numerous
            others. The zinc-dependent alcohol dehydrogenases (ADHs)
            catalyze the NAD(P)(H)-dependent interconversion of
            alcohols to aldehydes or ketones. Active site zinc has a
            catalytic role, while structural zinc aids in stability.
          Length = 361

 Score = 34.5 bits (80), Expect = 0.46
 Identities = 11/32 (34%), Positives = 16/32 (50%), Gaps = 2/32 (6%)

Query: 1427 DKIVLEVGCGM-GLLSL-FCAEAGAKHVISVD 1456
               V+  G G  GL ++     AGA+ VI +D
Sbjct: 178  GDTVVVQGAGPLGLYAVAAAKLAGARRVIVID 209


>gnl|CDD|149213 pfam08003, Methyltransf_9, Protein of unknown function (DUF1698).
            This family contains many hypothetical proteins. It also
            includes two putative methyltransferase proteins.
          Length = 315

 Score = 34.3 bits (79), Expect = 0.49
 Identities = 34/110 (30%), Positives = 46/110 (41%), Gaps = 33/110 (30%)

Query: 1423 HL--LKDKIVLEVGCGMGLLSLFCAEAGAKHVISVDCSVITQLTQEVVEENDCSDVITVI 1480
            HL  LK + +L+VGCG G         GA  V+ +D S +     E V            
Sbjct: 110  HLSPLKGRTILDVGCGNGYHMWRMLGEGAALVVGIDPSELFLCQFEAV------------ 157

Query: 1481 CRRMEDIDR----LPHGIENV------DIIVSNWMGHVLY-----LDSLI 1515
             R++   D+    LP GIE +      D + S  MG VLY     LD L+
Sbjct: 158  -RKLLGNDQRAHLLPLGIEQLPALEAFDTVFS--MG-VLYHRRSPLDHLL 203


>gnl|CDD|235638 PRK05896, PRK05896, DNA polymerase III subunits gamma and tau;
            Validated.
          Length = 605

 Score = 34.8 bits (80), Expect = 0.50
 Identities = 31/182 (17%), Positives = 64/182 (35%), Gaps = 35/182 (19%)

Query: 935  QFQNNMIKTIDFHQFQNNVMKNHPGNKNFKNFNNKSYGDKNKNEFKVTKIMQRPVDENKD 994
            + + ++ +  +F  +   ++  +P     K        +K         I  + ++   +
Sbjct: 342  RLKASVNQVFEFEIYLYKIINANPDKLENKKIEISIKPEKKT-INNSQIITTKKINI-VE 399

Query: 995  EIAKLPIPLGGDKKDDKKVDDKNVSKDQKITKSTNNNLNKKTNKQQNIQILSPAKKPNKE 1054
            E  K              V    + K +        N   + NK+Q+I          K+
Sbjct: 400  ESNK------------NSVHFDTLYKTKIFYHKKKIN---QNNKEQDI----------KK 434

Query: 1055 EDKIEKEENNTNAK--ENDE--KSTDDEKNKLDDKAKDTATKLSNSADSIKLNKKDEGSK 1110
            E+ +EKE    + K  +NDE   + +  K K  +K  + +  +    +      K+E S 
Sbjct: 435  EELLEKEFVKKSEKIPKNDELLDNLELAKQKFFNKDIELSKNMLQKFN----KFKNEESA 490

Query: 1111 EN 1112
            EN
Sbjct: 491  EN 492


>gnl|CDD|129544 TIGR00452, TIGR00452, tRNA (mo5U34)-methyltransferase.  This model
            describes a set of proteins with homology in its central
            region to a large number of methyltransferases active on
            a variety of substrates. The similarity includes a motif
            resembling ILDVGCG[SG]G. Known examples to date are
            restricted to the proteobacteria [Unknown function,
            Enzymes of unknown specificity].
          Length = 314

 Score = 34.1 bits (78), Expect = 0.55
 Identities = 30/101 (29%), Positives = 44/101 (43%), Gaps = 26/101 (25%)

Query: 1420 PNSHLLKDKIVLEVGCGMGLLSLFCAEAGAKHVISVDCSVITQLTQEVVEENDCSDVITV 1479
            P+   LK + +L+VGCG G         GAK ++ +D +V+     E V           
Sbjct: 115  PHLSPLKGRTILDVGCGSGYHMWRMLGHGAKSLVGIDPTVLFLCQFEAV----------- 163

Query: 1480 ICRRMEDIDR----LPHGIENV------DIIVSNWMGHVLY 1510
              R++ D D+     P GIE +      D + S  MG VLY
Sbjct: 164  --RKLLDNDKRAILEPLGIEQLHELYAFDTVFS--MG-VLY 199


>gnl|CDD|240246 PTZ00053, PTZ00053, methionine aminopeptidase 2; Provisional.
          Length = 470

 Score = 34.3 bits (79), Expect = 0.61
 Identities = 15/86 (17%), Positives = 30/86 (34%), Gaps = 2/86 (2%)

Query: 1028 TNNNLNKKTNKQQNIQILSPAKKPNKEEDKIEKEENNTNAKENDEKSTDDEKNKLDDKAK 1087
             N     K  KQQN Q  +  KK  K +  ++ ++       ++ +  ++++N    K K
Sbjct: 4    ENGENEVKQQKQQNKQKGTK-KKNKKSKKDVDDDDAFLAELISENQEAENKQNN-KKKKK 61

Query: 1088 DTATKLSNSADSIKLNKKDEGSKENK 1113
                K   +         D     + 
Sbjct: 62   KKKKKKKKNLGEAYDLAYDLPVVWSS 87


>gnl|CDD|177089 CHL00189, infB, translation initiation factor 2; Provisional.
          Length = 742

 Score = 34.4 bits (79), Expect = 0.65
 Identities = 24/122 (19%), Positives = 43/122 (35%), Gaps = 12/122 (9%)

Query: 1006 DKKDDKKVDDKN-------VSKDQKI---TKSTNNNLNKKTNKQQNIQILSPAKKPNKEE 1055
                +  +D KN            KI   +   ++ LN   NK+ + ++    KK NK +
Sbjct: 8    SISSENTIDLKNPLFIKNLKHSSYKIRLESDIKDSLLNLDINKKLHEKLDKKNKKFNKTD 67

Query: 1056 DKIE--KEENNTNAKENDEKSTDDEKNKLDDKAKDTATKLSNSADSIKLNKKDEGSKENK 1113
            D  +  K +     K   +   DD+ +   D   ++       A S+    K +  K  K
Sbjct: 68   DLKDSKKTKLKQKKKIKKKLHIDDDYDNFFDSKNNSKQFAGPLAISLMRKPKPKTEKLKK 127

Query: 1114 PQ 1115
              
Sbjct: 128  KI 129



 Score = 32.9 bits (75), Expect = 1.8
 Identities = 25/131 (19%), Positives = 48/131 (36%), Gaps = 6/131 (4%)

Query: 1200 PMFNNLYNSNNYNNNKSKQKKTDSTNNVDKKQDSTSTDERKKKATQKQQQSEEESKKTSN 1259
            P+F      ++Y   + +    DS  N+D  +      ++K K   K     ++SKKT  
Sbjct: 20   PLFIKNLKHSSYKI-RLESDIKDSLLNLDINKKLHEKLDKKNKKFNK-TDDLKDSKKTKL 77

Query: 1260 KKIKFKGPEILNETTEPTASSATNSNVTTVSNTADNSNNPSSILKKSKATEDCKVKTVQG 1319
            K+ K    ++  +        + N++       A +         K K  +  K  TV  
Sbjct: 78   KQKKKIKKKLHIDDDYDNFFDSKNNSKQFAGPLAISLMRK----PKPKTEKLKKKITVNK 133

Query: 1320 APKTEKSQTKS 1330
            +   +K +  S
Sbjct: 134  STNKKKKKVLS 144



 Score = 32.5 bits (74), Expect = 2.3
 Identities = 22/113 (19%), Positives = 38/113 (33%), Gaps = 4/113 (3%)

Query: 1001 IPLGGDKKDDKKVDDKNVSKDQKITKSTNNNLNKKTNKQQNIQILSPAKKPNKEEDKIEK 1060
            I L  D KD     D N    +K+ K  N   NK  + + + +     KK  K + K+  
Sbjct: 33   IRLESDIKDSLLNLDINKKLHEKLDK-KNKKFNKTDDLKDSKKTKLKQKK--KIKKKLHI 89

Query: 1061 EENNTNAKENDEKSTDDEKNKLDDKAKDTATKLSNSADSIKLNKKDEGSKENK 1113
            +++  N  ++   S            +    K +          K    K+ K
Sbjct: 90   DDDYDNFFDSKNNSKQFAGPLAISLMRKPKPK-TEKLKKKITVNKSTNKKKKK 141


>gnl|CDD|130009 TIGR00934, 2a38euk, potassium uptake protein, Trk family.  The
            proteins of the Trk family are derived from Gram-negative
            and Gram-positive bacteria, yeast and wheat. The proteins
            of E. coli K12 TrkH and TrkG as well as several yeast
            proteins have been functionally characterized.The E. coli
            TrkH and TrkG proteins are complexed to two peripheral
            membrane proteins, TrkA, an NAD-binding protein, and
            TrkE, an ATP-binding protein. This complex forms the
            potassium uptake system. This family is specific for the
            eukaryotic Trk system [Transport and binding proteins,
            Cations and iron carrying compounds].
          Length = 800

 Score = 34.2 bits (78), Expect = 0.66
 Identities = 29/226 (12%), Positives = 66/226 (29%), Gaps = 8/226 (3%)

Query: 858  VTTTTSSSETKPAESDNKLPSNNNPSEEGEDVSGRKNAKVVPGSPAKSNNMSQHYNEKGH 917
            V        +   +SD +   + +  E  +    R +  V P    +S  M Q  +E+  
Sbjct: 192  VKRDEPDQNSPDVKSDTRADESISDLEFEKFAKRRGSRDVDPEDLYRSIMMLQGIHERIR 251

Query: 918  KSGGERQRDRGDTGRHHQFQNNMIKTIDFHQFQNNVMKNHPGNKNFKNFNNKSYGDKNKN 977
            +      R    +    Q Q     +        + ++ +  +   + +++KS+    + 
Sbjct: 252  EKSSANSRSDERSSESIQEQVERRPS-------TSDIERNSQSLT-RRYDDKSFDKAVRL 303

Query: 978  EFKVTKIMQRPVDENKDEIAKLPIPLGGDKKDDKKVDDKNVSKDQKITKSTNNNLNKKTN 1037
                T       D  + + AK    +  D         KN        K    +   K +
Sbjct: 304  RRSKTIDRAEACDLEELDRAKDFEKMTYDNWKAHHRKKKNFRPRGWNLKFRKASRFPKDS 363

Query: 1038 KQQNIQILSPAKKPNKEEDKIEKEENNTNAKENDEKSTDDEKNKLD 1083
             +      +     +    +     +  N      K  +D ++ L 
Sbjct: 364  DRNYEDNGNHLSASSSFGSEEPSLSSEENLYPTYNKKREDSRHTLS 409


>gnl|CDD|234456 TIGR04074, bacter_Hen1, 3' terminal RNA ribose 2'-O-methyltransferase
            Hen1.  Members of this protein family are bacterial Hen1,
            a 3' terminal RNA ribose 2'-O-methyltransferase that acts
            in bacterial RNA repair. All members of the seed
            alignment belong to a cassette with the RNA repair enzyme
            polynucleotide kinase-phosphatase (Pnkp). Chemically
            similar Hen1 in eukaryotes acts instead on small
            regulatory RNAs [Transcription, RNA processing, Protein
            synthesis, tRNA and rRNA base modification].
          Length = 462

 Score = 33.8 bits (78), Expect = 0.73
 Identities = 14/31 (45%), Positives = 18/31 (58%), Gaps = 1/31 (3%)

Query: 1430 VLEVGCGMG-LLSLFCAEAGAKHVISVDCSV 1459
            VL++GCG G LL L  AE   + +  VD S 
Sbjct: 286  VLDLGCGEGKLLRLLLAEKQFERIAGVDVSA 316


>gnl|CDD|240433 PTZ00482, PTZ00482, membrane-attack complex/perforin (MACPF)
            Superfamily; Provisional.
          Length = 844

 Score = 34.1 bits (78), Expect = 0.76
 Identities = 22/103 (21%), Positives = 44/103 (42%), Gaps = 2/103 (1%)

Query: 1006 DKKDDKKVDDK-NVSKDQKITKSTNNNLNKKTNKQQNIQILSPAKKPNKEEDKIEKEENN 1064
            D+ +D   DD  N   DQ + +S N +  +K  +++  Q  +  K  N   D  E    +
Sbjct: 137  DQTNDFDQDDSSNSQTDQGLKQSVNLSSAEKLIEEKKGQTENTFKFYNFGNDGEEAAAKD 196

Query: 1065 TN-AKENDEKSTDDEKNKLDDKAKDTATKLSNSADSIKLNKKD 1106
               +K +D    +D   + DD   ++A +   ++++     K 
Sbjct: 197  GGKSKSSDPGPLNDSDGQGDDGDPESAEEDKAASNTRAAYTKA 239


>gnl|CDD|128898 smart00650, rADc, Ribosomal RNA adenine dimethylases. 
          Length = 169

 Score = 32.5 bits (75), Expect = 0.92
 Identities = 17/75 (22%), Positives = 30/75 (40%), Gaps = 8/75 (10%)

Query: 1430 VLEVGCGMGLLSLFCAEAGAK-HVISVDCSVITQLTQEVVEENDCSDVITVICRRMEDID 1488
            VLE+G G G L+    E   +   I +D  +  +L ++    ++    +TVI     D  
Sbjct: 17   VLEIGPGKGALTEELLERAKRVTAIEIDPRLAPRLREKFAAADN----LTVI---HGDAL 69

Query: 1489 RLPHGIENVDIIVSN 1503
            +          +V N
Sbjct: 70   KFDLPKLQPYKVVGN 84


>gnl|CDD|235350 PRK05134, PRK05134, bifunctional 3-demethylubiquinone-9
            3-methyltransferase/ 2-octaprenyl-6-hydroxy phenol
            methylase; Provisional.
          Length = 233

 Score = 33.2 bits (77), Expect = 0.97
 Identities = 16/32 (50%), Positives = 19/32 (59%), Gaps = 1/32 (3%)

Query: 1425 LKDKIVLEVGCGMGLLSLFCAEAGAKHVISVD 1456
            L  K VL+VGCG G+LS   A  GA  V  +D
Sbjct: 47   LFGKRVLDVGCGGGILSESMARLGAD-VTGID 77


>gnl|CDD|151322 pfam10873, DUF2668, Protein of unknown function (DUF2668).  Members
           in this family of proteins are annotated as Cysteine and
           tyrosine-rich protein 1, however currently no function
           is known.
          Length = 154

 Score = 32.1 bits (73), Expect = 0.97
 Identities = 13/34 (38%), Positives = 19/34 (55%), Gaps = 3/34 (8%)

Query: 564 PGPPPQSTSDYKMLYWNNVPMPNMLHAPPPQAPH 597
           P  PP  T D++M Y  ++P P   ++P PQA  
Sbjct: 110 PMAPPPYTYDHEMEYPTDLPPP---YSPAPQASA 140


>gnl|CDD|129628 TIGR00537, hemK_rel_arch, HemK-related putative methylase.  The gene
            hemK from E. coli was found to contribute to heme
            biosynthesis and originally suggested to be
            protoporphyrinogen oxidase (Medline 95189105). Functional
            analysis of the nearest homolog in Saccharomyces
            cerevisiae, YNL063w, finds it is not protoporphyrinogen
            oxidase and sequence analysis suggests that HemK homologs
            have S-adenosyl-methionine-dependent methyltransferase
            activity (Medline 99237242). Homologs are found, usually
            in a single copy, in nearly all completed genomes, but
            varying somewhat in apparent domain architecture. This
            model represents an archaeal and eukaryotic protein
            family that lacks an N-terminal domain found in HemK and
            its eubacterial homologs. It is found in a single copy in
            the first six completed archaeal and eukaryotic genomes
            [Unknown function, Enzymes of unknown specificity].
          Length = 179

 Score = 32.5 bits (74), Expect = 1.1
 Identities = 11/34 (32%), Positives = 19/34 (55%), Gaps = 1/34 (2%)

Query: 1425 LKDKIVLEVGCGMGLLSLFCAEAGAKHVISVDCS 1458
            LK   VLE+G G GL+++     G   +++ D +
Sbjct: 18   LKPDDVLEIGAGTGLVAIRLKGKGKC-ILTTDIN 50


>gnl|CDD|182340 PRK10258, PRK10258, biotin biosynthesis protein BioC; Provisional.
          Length = 251

 Score = 32.8 bits (75), Expect = 1.1
 Identities = 19/74 (25%), Positives = 33/74 (44%), Gaps = 9/74 (12%)

Query: 1430 VLEVGCGMGLLSLFCAEAGAKHVISVDCSVITQLTQEVVEENDCSDVITVICRRMEDIDR 1489
            VL+ GCG G +S +  E G+  V ++D S    +  +  +++     +        DI+ 
Sbjct: 46   VLDAGCGPGWMSRYWRERGS-QVTALDLS--PPMLAQARQKDAADHYLA------GDIES 96

Query: 1490 LPHGIENVDIIVSN 1503
            LP      D+  SN
Sbjct: 97   LPLATATFDLAWSN 110


>gnl|CDD|240271 PTZ00108, PTZ00108, DNA topoisomerase 2-like protein; Provisional.
          Length = 1388

 Score = 33.5 bits (77), Expect = 1.2
 Identities = 34/262 (12%), Positives = 76/262 (29%), Gaps = 32/262 (12%)

Query: 823  IPPTVLPSSEKSRKPISHVSPPVVTSIITSTATSNVTTTTSSSETKPAESDNKLPSNNNP 882
               T   +S+  +  +         S    +  ++V   +   ++      +  P N   
Sbjct: 1159 KSKTKGKASKLRKPKLKKKEKKKKKSSADKSKKASVVGNSKRVDSDEKRKLDDKPDNKKS 1218

Query: 883  SEEGEDVSGRKNAKVVPGSPAKSNNMSQHYNEKGHKSGGERQRDRGDTGRHHQFQNNMIK 942
            +  G D    +  K  P   +     S+  N        +                    
Sbjct: 1219 NSSGSDQEDDEEQKTKPKKSSVKRLKSKKNNSSKSSEDNDEFSSD--------------- 1263

Query: 943  TIDFHQFQNNVMKNHPGNKNFKNFNNKSYGDKNKNEFKVTKIMQRPVDENKDEIAKLPIP 1002
                   +    KN P   +   ++       +K     +    +P    K ++ K    
Sbjct: 1264 ----DLSKEGKPKNAPKRVSAVQYSPPPP---SKRPDGESNGGSKPSSPTKKKVKKRLEG 1316

Query: 1003 LGGDKKDDKKVDDKNVSKDQKITKSTNNNLNKKTNKQQNIQILSPAKKPNKEEDKIEKEE 1062
                 K  KK + K   K     KS         ++   +      +   K+ D   +++
Sbjct: 1317 SLAALKKKKKSEKKTARKK----KSKTRVKQASASQSSRLL----RRPRKKKSDSSSEDD 1368

Query: 1063 NNTNAKENDEKSTDDEKNKLDD 1084
            +++  + +D +  DDE ++ DD
Sbjct: 1369 DDS--EVDDSEDEDDEDDEDDD 1388


>gnl|CDD|177970 PLN02336, PLN02336, phosphoethanolamine N-methyltransferase.
          Length = 475

 Score = 33.2 bits (76), Expect = 1.2
 Identities = 31/121 (25%), Positives = 56/121 (46%), Gaps = 31/121 (25%)

Query: 1425 LKDKIVLEVGCGMGLLSLFCAEAGAKHVISVDCSVITQLTQEVVEEND------------ 1472
             + K VLE+G G+G  +   A+  A  VI++D        + V+++N+            
Sbjct: 36   YEGKSVLELGAGIGRFTGELAKK-AGQVIALD------FIESVIKKNESINGHYKNVKFM 88

Query: 1473 CSDVITVICRRMEDIDRLPHGIENVDIIVSNWMGHVLYL-DSLINAVVYARDRFLKPHGL 1531
            C+DV +       D++       +VD+I SNW+  ++YL D  +  +     ++LK  G 
Sbjct: 89   CADVTS------PDLNISD---GSVDLIFSNWL--LMYLSDKEVENLAERMVKWLKVGGY 137

Query: 1532 I 1532
            I
Sbjct: 138  I 138


>gnl|CDD|223496 COG0419, SbcC, ATPase involved in DNA repair [DNA replication,
            recombination, and repair].
          Length = 908

 Score = 33.6 bits (77), Expect = 1.3
 Identities = 21/156 (13%), Positives = 61/156 (39%), Gaps = 11/156 (7%)

Query: 968  NKSYGDKNKNEFKVTKIMQRPVDENKDEIAKLPIPLGGDKKDDKKVDDKNVSKDQK-ITK 1026
             +   +      ++  +    +   ++E+ +L         ++ +   + + + ++ I +
Sbjct: 248  KERLEELKARLLEIESLELEALKIREEELRELE-----RLLEELEEKIERLEELEREIEE 302

Query: 1027 STNNNLNKKTNKQQNIQILSPAKKPNKEEDKIEKEENNTNAKENDEKSTDDEKNKLDDKA 1086
                    +   ++  ++L   K     E+++EK E      E++ +   +EKN+L    
Sbjct: 303  LEEELEGLRALLEELEELLEKLKS---LEERLEKLEEKLEKLESELEELAEEKNELAKLL 359

Query: 1087 KDTATKLSNSADSI--KLNKKDEGSKENKPQQQRRK 1120
            ++   +L    + +  +L K  E  K+ +   Q  K
Sbjct: 360  EERLKELEERLEELEKELEKALERLKQLEEAIQELK 395


>gnl|CDD|226894 COG4499, COG4499, Predicted membrane protein [Function unknown].
          Length = 434

 Score = 33.2 bits (76), Expect = 1.3
 Identities = 19/84 (22%), Positives = 40/84 (47%), Gaps = 9/84 (10%)

Query: 1008 KDDKKVDDKNVSKDQKITKSTNNNLN---KKTNKQQNIQILSPAKKPNKEEDKIEKEENN 1064
            K +  +      K Q++ K  N  L    KK  + ++    S   +   +E+K+++EEN 
Sbjct: 357  KSNTDLSGD---KRQELLKEYNKKLQDYTKKLGEVKDETDASEEAEAKAKEEKLKQEENE 413

Query: 1065 TNAKENDEKSTDDEKNKLDDKAKD 1088
               K+  E++ +D++ +  D+ K 
Sbjct: 414  ---KKQKEQADEDKEKRQKDERKK 434


>gnl|CDD|235219 PRK04098, PRK04098, sec-independent translocase; Provisional.
          Length = 158

 Score = 32.0 bits (73), Expect = 1.4
 Identities = 19/96 (19%), Positives = 35/96 (36%), Gaps = 6/96 (6%)

Query: 1020 KDQKITKSTNNNLNKKTNKQQNIQILSPAKKPNKEEDKIEKEENNTNAKENDEKSTDDEK 1079
            K +K  +S   +L KK       + L   K   + E K  ++      K  +E +  +  
Sbjct: 65   KYKKEFESAVESLKKKLK----FEELDDLKITAENEIKSIQDLLQDYKKSLEEDTIPNHL 120

Query: 1080 NKLDDKAKDTATKL--SNSADSIKLNKKDEGSKENK 1113
            N+          +     S   +KL  K++  K +K
Sbjct: 121  NEEVSNETKLTQESSSDESPKEVKLATKNKTKKHDK 156


>gnl|CDD|221315 pfam11921, DUF3439, Domain of unknown function (DUF3439).  This
           presumed domain is functionally uncharacterized. This
           domain is found in eukaryotes. This domain is typically
           between 46 to 94 amino acids in length. This domain is
           found associated with pfam01462, pfam00560.
          Length = 122

 Score = 31.5 bits (71), Expect = 1.5
 Identities = 19/60 (31%), Positives = 28/60 (46%), Gaps = 3/60 (5%)

Query: 812 YVNHNKYEPQYIPPTVLPSSEK---SRKPISHVSPPVVTSIITSTATSNVTTTTSSSETK 868
           ++  N    +     V P S K   S KP+  +  P  T+  T+T T+  TTTT  + TK
Sbjct: 11  WIQKNSGIVKDSSGYVDPDSAKCSGSGKPVRSIICPTTTTTTTTTTTTMPTTTTLPTTTK 70


>gnl|CDD|173412 PTZ00121, PTZ00121, MAEBL; Provisional.
          Length = 2084

 Score = 33.2 bits (75), Expect = 1.5
 Identities = 48/254 (18%), Positives = 90/254 (35%), Gaps = 39/254 (15%)

Query: 864  SSETKPAESDNKLPSNNNPSEEGEDVSGRKNAKVVPGSPAKSNNMSQHYNEKGHKSGGER 923
            + E K AE +NK+ +     E  ED    + AK       K  ++ +   +K  +   E+
Sbjct: 1718 AEELKKAEEENKIKAEEAKKEAEEDKKKAEEAKKDEEEKKKIAHLKKEEEKKAEEIRKEK 1777

Query: 924  QR------DRGDTGRHHQFQNNMIKTIDFHQFQNNVMKNHPGNKNFKNFNN-KSYGDKNK 976
            +       D  D  R  +    +        F N       G +     N+ K   D   
Sbjct: 1778 EAVIEEELDEEDEKRRMEVDKKIKDI-----FDNFANIIEGGKEGNLVINDSKEMEDSAI 1832

Query: 977  NEFKVTKIMQRPVDENKDEIAKLPIPLGGDKKDDKKVDDKNVSKDQKITKSTNNNLN--- 1033
             E   +K MQ                     ++    +    +K+ +  +  N   +   
Sbjct: 1833 KEVADSKNMQL--------------------EEADAFEKHKFNKNNENGEDGNKEADFNK 1872

Query: 1034 KKTNKQQNIQILSPAKKPNK-EEDKIEKEENNTNAKENDEKSTDDEKNKLDDKAKDTATK 1092
            +K  K+ + + +  A +  K ++D IE+E  N N    +    DD+ +K +   +D    
Sbjct: 1873 EKDLKEDDEEEIEEADEIEKIDKDDIEREIPNNNMAGKNNDIIDDKLDKDEYIKRDAE-- 1930

Query: 1093 LSNSADSIKLNKKD 1106
                 + IK++KKD
Sbjct: 1931 -ETREEIIKISKKD 1943


>gnl|CDD|225880 COG3343, RpoE, DNA-directed RNA polymerase, delta subunit
            [Transcription].
          Length = 175

 Score = 32.0 bits (73), Expect = 1.7
 Identities = 20/85 (23%), Positives = 34/85 (40%), Gaps = 2/85 (2%)

Query: 994  DEIAKLPIPLGGDKKDDKKVDDKNVSKDQKITKSTNNNLNKKTNKQQNIQILSPAKKPNK 1053
            DEI +    +   +K D K  DK V   ++  +   +    K  ++ +       +  ++
Sbjct: 85   DEIDEEIQAM--TEKKDIKAKDKEVDAFEEGDEDELDYDEDKEEEEDDEVDSLDDENDDE 142

Query: 1054 EEDKIEKEENNTNAKENDEKSTDDE 1078
            +ED  E  E      E DE   DDE
Sbjct: 143  DEDDDEIVEILIEDDEVDEDEDDDE 167


>gnl|CDD|181159 PRK07890, PRK07890, short chain dehydrogenase; Provisional.
          Length = 258

 Score = 32.2 bits (74), Expect = 1.7
 Identities = 15/33 (45%), Positives = 20/33 (60%), Gaps = 3/33 (9%)

Query: 1424 LLKDKIVLEVGCGMGL---LSLFCAEAGAKHVI 1453
            LLK K+V+  G G GL   L++  A AGA  V+
Sbjct: 2    LLKGKVVVVSGVGPGLGRTLAVRAARAGADVVL 34


>gnl|CDD|234229 TIGR03490, Mycoplas_LppA, mycoides cluster lipoprotein, LppA/P72
            family.  Members of this protein family occur in
            Mycoplasma mycoides, Mycoplasma hyopneumoniae, and
            related Mycoplasmas in small paralogous families that may
            also include truncated forms and/or pseudogenes. Members
            are predicted lipoproteins with a conserved signal
            peptidase II processing and lipid attachment site. Note
            that the name for certain characterized members, p72,
            reflects an anomalous apparent molecular weight, given a
            theoretical MW of about 61 kDa.
          Length = 541

 Score = 32.9 bits (75), Expect = 1.8
 Identities = 15/99 (15%), Positives = 35/99 (35%), Gaps = 1/99 (1%)

Query: 1006 DKKDDKKVDDKNVSKDQKITKSTNNNLNKKTNKQQNIQILSPAKKPNKEEDKIEKEENNT 1065
             K + K  ++      +   K  + N N K+N +   +  +P+    +++    K +   
Sbjct: 36   KKPEIKPNENTPKIPKKPDNKEPSENNNNKSNNENKDE-ENPSSTNPEKKPDPSKNKEEI 94

Query: 1066 NAKENDEKSTDDEKNKLDDKAKDTATKLSNSADSIKLNK 1104
               +++ K  D +               +N  D   L+K
Sbjct: 95   EKPKDEPKKPDKKPQADQPNNVHADQPNNNKVDFSDLDK 133


>gnl|CDD|237555 PRK13914, PRK13914, invasion associated secreted endopeptidase;
            Provisional.
          Length = 481

 Score = 32.5 bits (73), Expect = 1.9
 Identities = 21/81 (25%), Positives = 34/81 (41%)

Query: 1217 KQKKTDSTNNVDKKQDSTSTDERKKKATQKQQQSEEESKKTSNKKIKFKGPEILNETTEP 1276
            K+    +T   +KK+ +T      K  T+  + +   S  T+  K         N T   
Sbjct: 272  KENTNTNTATTEKKETTTQQQTAPKAPTEAAKPAPAPSTNTNANKTNTNTNTNTNNTNTS 331

Query: 1277 TASSATNSNVTTVSNTADNSN 1297
            T S  TN+N  + +NT  N+N
Sbjct: 332  TPSKNTNTNTNSNTNTNSNTN 352



 Score = 31.3 bits (70), Expect = 5.1
 Identities = 29/138 (21%), Positives = 54/138 (39%), Gaps = 5/138 (3%)

Query: 1202 FNNLYNSNNYNNNKSKQKKTDSTNNVDKKQDSTSTDERKKKATQKQQQSEEESKKTSNKK 1261
            +NNL +S+ Y   K   K+T +     K +  T     +K+A    +++   +  T+ KK
Sbjct: 227  WNNLSSSSIYVGQKLAIKQT-ANTATPKAEVKTEAPAAEKQAAPVVKENTNTNTATTEKK 285

Query: 1262 ----IKFKGPEILNETTEPTASSATNSNVTTVSNTADNSNNPSSILKKSKATEDCKVKTV 1317
                 +   P+   E  +P  + +TN+N    +   + + N ++    SK T        
Sbjct: 286  ETTTQQQTAPKAPTEAAKPAPAPSTNTNANKTNTNTNTNTNNTNTSTPSKNTNTNTNSNT 345

Query: 1318 QGAPKTEKSQTKSGEYDN 1335
                 T  +Q  S    N
Sbjct: 346  NTNSNTNANQGSSNNNSN 363


>gnl|CDD|225319 COG2521, COG2521, Predicted archaeal methyltransferase [General
            function prediction only].
          Length = 287

 Score = 32.4 bits (74), Expect = 2.0
 Identities = 11/27 (40%), Positives = 17/27 (62%)

Query: 1430 VLEVGCGMGLLSLFCAEAGAKHVISVD 1456
            VL+   G+G  ++   E GA HVI+V+
Sbjct: 138  VLDTCTGLGYTAIEALERGAIHVITVE 164


>gnl|CDD|227938 COG5651, COG5651, PPE-repeat proteins [Cell motility and
           secretion].
          Length = 490

 Score = 32.6 bits (74), Expect = 2.1
 Identities = 19/162 (11%), Positives = 42/162 (25%), Gaps = 8/162 (4%)

Query: 321 NEALYAPS-----STTNYPYSSSSTPTYAAMTTASGNANNVASNGGGSNNGGVYSTNSFM 375
            EA YA       +     Y+++S    +A+T  +       S+G  +    V +     
Sbjct: 136 TEAQYAEMWAQDAAAMYGYYAAASAAA-SALTPFNEPPPTTNSSGLAAQASAVQALGDLA 194

Query: 376 AATTH-GYSSISNTITNTQPSQQQSNNSVYSSTINSYSSFPQQSSVNLNTYGHQASSLSN 434
           +  T     ++S        +     N    +         Q        +G+  S    
Sbjct: 195 SGITLASQVNLSLLELINPATLSGLANGGTGNLGIGALQQAQNLGFGNVGFGNLGSGNPG 254

Query: 435 SNTTGYSSQNLYQQQPSSVSYSSNSYVYPQTSIATSSNVSST 476
           +                         +   T+I  ++  +  
Sbjct: 255 APGLASQFSATNLGTLLGSLNPYLGNI-GATNIGLAAAGTGN 295



 Score = 32.2 bits (73), Expect = 2.7
 Identities = 23/130 (17%), Positives = 33/130 (25%), Gaps = 10/130 (7%)

Query: 318 LFSNEALYAPSSTTNYPYSSSSTPTYAAMTTASGNANNVASNGGGSNNGGVYSTNSFMAA 377
             S     A S +   P+  +     A     SG  N    N  G   GG     +    
Sbjct: 334 AGSGNLGVANSGSAAAPFGIAGANQAALGGANSGAGNFGLGNNPGGGLGGKPLGGTGNGG 393

Query: 378 TTHGYSSISNT----ITNTQPSQQQSNNSVYSS------TINSYSSFPQQSSVNLNTYGH 427
                   +      I N   S   SNN+   +      T             N    G 
Sbjct: 394 IGASGIGNTGYGNSGIANAGLSNAGSNNAGGENAGNANNTGGGNVGLWNAGDFNAGAAGT 453

Query: 428 QASSLSNSNT 437
             ++  + NT
Sbjct: 454 GFTNNGSYNT 463


>gnl|CDD|220271 pfam09507, CDC27, DNA polymerase subunit Cdc27.  This protein forms
            the C subunit of DNA polymerase delta. It carries the
            essential residues for binding to the Pol1 subunit of
            polymerase alpha, from residues 293-332, which are
            characterized by the motif D--G--VT, referred to as the
            DPIM motif. The first 160 residues of the protein form
            the minimal domain for binding to the B subunit, Cdc1, of
            polymerase delta, the final 10 C-terminal residues,
            362-372, being the DNA sliding clamp, PCNA, binding
            motif.
          Length = 427

 Score = 32.5 bits (74), Expect = 2.3
 Identities = 34/187 (18%), Positives = 71/187 (37%), Gaps = 28/187 (14%)

Query: 960  NKNFKNFNNKSYGDK-----NKNEFKVTKIMQRPVDENKDEIAKLPIP------------ 1002
            N +  + N+  YG +     N N  + T +   PV        K                
Sbjct: 112  NLHNCSKNSLEYGKQAGPITNPNVKRRTGVGLPPVAPAASPALKPTANGKRPSSKPPKSI 171

Query: 1003 LGGDKKDDKKVDDKNVSKDQKI------TKSTNNNLNKKTNKQQNIQILSPAKKPNKEED 1056
            +  + K       ++ SK+         T     +L +   K+ NI + S  KK  KE+ 
Sbjct: 172  MSPEVKVKSAKKTQDTSKETTTEKTEGKTSVKAASLKRNPPKKSNI-MSSFFKKKTKEKK 230

Query: 1057 KIEKEENNTNAKENDEKSTDDEKNKLDDKAKDTAT----KLSNSADSIKLNKKDEGSKEN 1112
            + ++   +T  +E++E+S   +    D+ A+ T             S + +  +E ++E 
Sbjct: 231  EKKEASESTVKEESEEESGKRDVILEDESAEPTGLDEDEDEDEPKPSGERSDSEEETEEK 290

Query: 1113 KPQQQRR 1119
            + ++++R
Sbjct: 291  EKEKRKR 297


>gnl|CDD|184826 PRK14799, thrS, threonyl-tRNA synthetase; Provisional.
          Length = 545

 Score = 32.5 bits (74), Expect = 2.4
 Identities = 39/194 (20%), Positives = 77/194 (39%), Gaps = 34/194 (17%)

Query: 1227 VDKKQDSTSTDERKKKATQKQQQSEEESKKTSNKKIKFKGPEILNETTEPTASSATNSNV 1286
            +D K D++ T +  KKA  +   +    +   N +I +KG ++     + + S+  N   
Sbjct: 38   IDVKSDTSITLDEVKKAINENVLANVSIE---NNQIVYKGNKVSIIEDKVSISTNLNPKY 94

Query: 1287 TTVSNTADNSNNPSSILKKSKATEDCKVKTVQGAPKTEKSQ-----TKSGEYDN--LDER 1339
              + N + +  NP+            +++ V    + +         K+ E D+  + E+
Sbjct: 95   FEILNISTHHPNPNEQY--------VRIRGVAFETEEQLKDYLTWLEKAEETDHRLIGEK 146

Query: 1340 EPSTIQLFM-------GLGLDKPPSQ---KPLVYPIRTFN--MDIDDMTSREIFEDVYAD 1387
                + LF        GL L  P  Q     L+  +R  N  M   ++ +  +F+     
Sbjct: 147  ----LDLFSFHEEAGSGLVLFHPKGQTIRNELIAFMREINDSMGYQEVYTSHVFKTDIWK 202

Query: 1388 IKTHQTTYKDKQLI 1401
            I  H T Y+DK ++
Sbjct: 203  ISGHYTLYRDKLIV 216


>gnl|CDD|234029 TIGR02830, spore_III_AG, stage III sporulation protein AG.  CC A
            comparative genome analysis of all sequenced genomes of
            shows a number of proteins conserved strictly among the
            endospore-forming subset of the Firmicutes. This protein,
            a member of this panel, is found in a spore formation
            operon and is designated stage III sporulation protein AG
            [Cellular processes, Sporulation and germination].
          Length = 186

 Score = 31.6 bits (72), Expect = 2.5
 Identities = 23/123 (18%), Positives = 46/123 (37%), Gaps = 26/123 (21%)

Query: 963  FKNFNNKSYGDKNKNEFKVTKIMQRPVDENKDEIAKLPIPLGGDKK-------------- 1008
            F +  +    D   NE    + +Q  V   +DEI+        + K              
Sbjct: 23   FSSSEDIEESDTPNNEKTEPEFVQGEVQ-KEDEISDYEKQYENELKEILEKIEGVGDVTV 81

Query: 1009 ------DDKKVDDKNVSKDQKITKSTNNNLNKKTNKQQNIQ---ILSPAKKPNKEEDKIE 1059
                   ++KV  KN SK Q+ T+ T+    K++ + ++     ++   +  ++E   + 
Sbjct: 82   MVNLDSSEEKVYAKNTSKGQQTTEETDKEGGKRSVEDESDGEEVVII--RNGDQETPVVL 139

Query: 1060 KEE 1062
            K E
Sbjct: 140  KTE 142


>gnl|CDD|220648 pfam10243, MIP-T3, Microtubule-binding protein MIP-T3.  This protein,
            which interacts with both microtubules and TRAF3 (tumour
            necrosis factor receptor-associated factor 3), is
            conserved from worms to humans. The N-terminal region is
            the microtubule binding domain and is well-conserved; the
            C-terminal 100 residues, also well-conserved, constitute
            the coiled-coil region which binds to TRAF3. The central
            region of the protein is rich in lysine and glutamic acid
            and carries KKE motifs which may also be necessary for
            tubulin-binding, but this region is the least
            well-conserved.
          Length = 506

 Score = 32.2 bits (73), Expect = 2.5
 Identities = 17/134 (12%), Positives = 41/134 (30%), Gaps = 4/134 (2%)

Query: 1215 KSKQKKTDSTNNVDKKQDSTSTDERKKKATQKQQQSEEESKKTSNKKIKFKGPEILNETT 1274
            K K+ +       +KK++      R KK  +K+  ++++      K+ +     +  +  
Sbjct: 150  KEKKVEEPRDREEEKKRERVRAKSRPKKPPKKKPPNKKKEPPEEEKQRQAAREAVKGKPE 209

Query: 1275 EPT----ASSATNSNVTTVSNTADNSNNPSSILKKSKATEDCKVKTVQGAPKTEKSQTKS 1330
            EP          +      + T+    + S    +        +K    +P     +T+ 
Sbjct: 210  EPDVNEEREKEEDDGKDRETTTSPMEEDESRQSSEISRRSSSSLKKPDPSPSMASPETRE 269

Query: 1331 GEYDNLDEREPSTI 1344
                       S  
Sbjct: 270  SSKRTETRPRTSLR 283


>gnl|CDD|227891 COG5604, COG5604, Uncharacterized conserved protein [Function
            unknown].
          Length = 523

 Score = 32.2 bits (73), Expect = 2.6
 Identities = 23/93 (24%), Positives = 38/93 (40%), Gaps = 15/93 (16%)

Query: 1020 KDQKITKSTNNNLNKKTNKQQNIQILSPAKKPNKEEDKIEKEENNTNAKEND-EKSTDDE 1078
            K  K TK    N  K T  ++         K      K+   +N  +  EN  E  T+D 
Sbjct: 3    KASKATKKFTKNHLKNTIDRR---------KQLARSKKVYGTKNRNSHTENKMESGTNDN 53

Query: 1079 KNKLDDKAKDTATKLSNSADSIKLNKKDEGSKE 1111
                 +K K+  +KL +  DS    ++++GS+ 
Sbjct: 54   -----NKNKEKLSKLYSDVDSSSSEEEEDGSES 81


>gnl|CDD|220679 pfam10294, Methyltransf_16, Putative methyltransferase. 
          Length = 170

 Score = 31.1 bits (71), Expect = 2.8
 Identities = 26/130 (20%), Positives = 49/130 (37%), Gaps = 7/130 (5%)

Query: 1407 SIINNSHLFKVRKPNSHLLKDKIVLEVGCGMGLLSLFCAEAGAKH--VISVDCSVITQLT 1464
            +++ + +L     P  + L    VLE+G G GL+ +           V   D     +L 
Sbjct: 25   AVVLSKYLEMKITPGGNNLSGLNVLELGSGTGLVGI-AVALLLPGASVTITDLEEAIELM 83

Query: 1465 QEVVEENDCSDVITV-ICRRMEDIDRLPHGIENVDIIVSNWMGHVLYLDSLINAVVYARD 1523
            ++ +E N  S  +T  +    ED+         VD+I+       +Y +     +    +
Sbjct: 84   KKNIELNGLSSKVTAKVLDWGEDLPDDVFDPHPVDLIL---AADCVYNEDSFPLLEKTLN 140

Query: 1524 RFLKPHGLIL 1533
              L    +IL
Sbjct: 141  DLLGKETVIL 150


>gnl|CDD|114270 pfam05539, Pneumo_att_G, Pneumovirinae attachment membrane
           glycoprotein G. 
          Length = 408

 Score = 31.9 bits (72), Expect = 2.9
 Identities = 20/97 (20%), Positives = 30/97 (30%), Gaps = 9/97 (9%)

Query: 789 SRHLSYEDKDLAKPPVPHTSGNIYVNHNKYEPQYIPPTVLPSSEKSRKPISHVSPPVVTS 848
            R  S   +          S     +  ++  +       P++   R P S  +PP  T 
Sbjct: 240 QRGPSGSPQHPPSTTSQDQSTT--GDGQEHTQR---RKTPPATSNRRSPHSTATPPPTTK 294

Query: 849 II-TSTATSNVTTTTSSSETKPAES---DNKLPSNNN 881
              T   T   T TT S  + P  S       P+  N
Sbjct: 295 RQETGRPTPRPTATTQSGSSPPHSSPPGVQANPTTQN 331


>gnl|CDD|240410 PTZ00418, PTZ00418, Poly(A) polymerase; Provisional.
          Length = 593

 Score = 32.1 bits (73), Expect = 3.0
 Identities = 18/83 (21%), Positives = 33/83 (39%), Gaps = 2/83 (2%)

Query: 811 IYVNHNKYEPQYIPPTVLPSSEKSRKPISHVSPPVVTSIITSTATSNVTTTTSSSETKPA 870
            Y +      +Y+  + LP+   S+ P   V     T   TS+AT++  + +S S +   
Sbjct: 510 KYPDQIDINIKYLKKSQLPAFVLSQTPEEPVKTKANTKTNTSSATTSGQSGSSGSTSNSN 569

Query: 871 --ESDNKLPSNNNPSEEGEDVSG 891
             ES   + S    +      +G
Sbjct: 570 SNESSPTMSSTELLNVSSTSTTG 592


>gnl|CDD|151173 pfam10669, Phage_Gp23, Protein gp23 (Bacteriophage A118).  This is
            the highly conserved family of the major tail subunit
            protein.
          Length = 121

 Score = 30.1 bits (67), Expect = 3.4
 Identities = 17/60 (28%), Positives = 33/60 (55%), Gaps = 2/60 (3%)

Query: 1043 QILSPAKKPNKEEDKIEKEENNTNAKENDEK--STDDEKNKLDDKAKDTATKLSNSADSI 1100
            +I+    K  +++ + E+E+ +  +KE  +K  ST +E+ +L DK  D   +   S DS+
Sbjct: 42   KIVRVEMKEERDKMETEREKRDKESKEERDKFISTMNEQQRLMDKQNDMMGQQQQSIDSL 101


>gnl|CDD|220684 pfam10310, DUF2413, Protein of unknown function (DUF2413).  This is a
            family of proteins conserved in fungi. The function is
            not known.
          Length = 436

 Score = 31.7 bits (72), Expect = 3.6
 Identities = 13/106 (12%), Positives = 34/106 (32%), Gaps = 11/106 (10%)

Query: 1201 MFNNLYNSNNYNNNKSKQKKTDSTNNVDKKQD---------STSTDERKKKATQKQQQSE 1251
              ++L +       K  +K   S ++ +  +D          +   +  KK  +  +   
Sbjct: 4    FLDSLPDEKA--PTKKPKKGDASKDSTEDDEDILEFLDELEQSEKAKPPKKPKEASRPGT 61

Query: 1252 EESKKTSNKKIKFKGPEILNETTEPTASSATNSNVTTVSNTADNSN 1297
              + K S+K  +        +  +P  S+ +  +    S      +
Sbjct: 62   PRNPKKSSKPTESSAASSEEKPAKPRKSAESTRSSHPKSKAPSTES 107



 Score = 30.5 bits (69), Expect = 7.4
 Identities = 20/87 (22%), Positives = 37/87 (42%), Gaps = 10/87 (11%)

Query: 1212 NNNKSKQKKTDSTNNVDKKQDSTSTDERKKKATQKQQQSEEESKKTSNKKIKFKGPEILN 1271
               ++ +K +  T        S     + +K+ +  + S  +SK  S +  + + PE   
Sbjct: 60   GTPRNPKKSSKPTE--SSAASSEEKPAKPRKSAESTRSSHPKSKAPSTESEEEEEPE--- 114

Query: 1272 ETTEPTAS-----SATNSNVTTVSNTA 1293
            ET +P AS     S   S  +T ++TA
Sbjct: 115  ETPDPIASIGGWWSLWGSITSTATSTA 141


>gnl|CDD|221309 pfam11912, DUF3430, Protein of unknown function (DUF3430).  This
           family of proteins are functionally uncharacterized.
           This protein is found in eukaryotes. Proteins in this
           family are typically between 209 to 265 amino acids in
           length.
          Length = 204

 Score = 30.9 bits (70), Expect = 3.7
 Identities = 25/115 (21%), Positives = 38/115 (33%), Gaps = 6/115 (5%)

Query: 363 SNNGGVYSTNSF--MAATTHGYSSISNTI-TNTQPSQQQSNNSVYSSTINSYSSFPQQSS 419
           S++G +   N +     +    S+ +  I +    S    +   Y S+       P  S 
Sbjct: 69  SSDGNIIKYNYYEDSNCSGKPISTETFEIGSCVYSSSYSGSYYSYISSCTDSPEIPSNSL 128

Query: 420 VNLNTYGHQASSLSNSNTTGYSSQNL--YQQQPSSVSYSSNSYVYPQTSIATSSN 472
           V    Y     S  N     Y + N         S +Y  NS   P T+   SSN
Sbjct: 129 VT-VYYNGSDCSNDNFLYIEYFTNNTCINNNNGFSFTYYCNSTNQPYTTYCKSSN 182


>gnl|CDD|215618 PLN03184, PLN03184, chloroplast Hsp70; Provisional.
          Length = 673

 Score = 31.7 bits (72), Expect = 3.7
 Identities = 19/68 (27%), Positives = 29/68 (42%), Gaps = 15/68 (22%)

Query: 1033 NKKTNKQQNIQILSPAKKPNKEEDKIEKEENNTNAKENDEKSTDDEKNKLDDKAKDTATK 1092
            +K T K+Q+I I   +  P  E +++ +E                EK   +DK K  A  
Sbjct: 522  DKGTGKKQDITITGASTLPKDEVERMVQEA---------------EKFAKEDKEKRDAVD 566

Query: 1093 LSNSADSI 1100
              N ADS+
Sbjct: 567  TKNQADSV 574


>gnl|CDD|215437 PLN02816, PLN02816, mannosyltransferase.
          Length = 546

 Score = 31.9 bits (72), Expect = 3.7
 Identities = 32/155 (20%), Positives = 56/155 (36%), Gaps = 18/155 (11%)

Query: 537 IGHFEFRNTLDIAPM-IPMDSYMTNYMQPGPPPQSTSDYKMLYWNNVPMPNMLHAPPPQA 595
           +GH EFR  L + P+ +    Y    M+      S+S  K      VP  N     P + 
Sbjct: 335 LGHKEFRFVLPVLPIALIFSGYAFAQMEVSGSSSSSSVTKK---KQVPRQNHTKWSP-KL 390

Query: 596 PHSAMYKYTSHIPYDMPTSQATQKPFMKLNDSTSGTDTSAYISPTHMRVHTHQI----PA 651
             S  +   ++IP  +         +M L       D   Y+S    +     I    P 
Sbjct: 391 RLSVYFLLATNIPMAL---------YMSLFHQRGTEDAMNYLSDEAYKGRVKSILFLMPC 441

Query: 652 HRYPVENTTTNKMPLAYTEPQPPNLQASLNDYQRY 686
           H  P  +T    +P+ + +  P   +  L++  ++
Sbjct: 442 HSTPYYSTLHRNIPMQFLDCTPSAEKGELDESDQF 476


>gnl|CDD|222581 pfam14181, YqfQ, YqfQ-like protein.  The YqfQ-like protein family
            includes the B. subtilis YqfQ protein, also known as
            VrrA, which is functionally uncharacterized. This family
            of proteins is found in bacteria. Proteins in this family
            are typically between 146 and 237 amino acids in length.
            There are two conserved sequence motifs: QYGP and PKLY.
          Length = 155

 Score = 30.5 bits (69), Expect = 3.8
 Identities = 20/83 (24%), Positives = 40/83 (48%), Gaps = 12/83 (14%)

Query: 1193 QYGGGGPPMFNNL------YNSNNYNNNKSKQKKTDSTNNVDKKQDSTSTDE--RKKKAT 1244
            QYG    P+  NL      +   + ++++ ++ + +ST+  +++    +  E   KKK  
Sbjct: 73   QYG----PLVRNLPAMWKIFRELSSSDDEEEETEEESTDETEQEDPPETKTESKEKKKRE 128

Query: 1245 QKQQQSEEESKKTSNKKIKFKGP 1267
              + ++E+E  KT  KK K   P
Sbjct: 129  VPKPKTEKEKPKTEPKKPKPSKP 151


>gnl|CDD|218439 pfam05109, Herpes_BLLF1, Herpes virus major outer envelope
           glycoprotein (BLLF1).  This family consists of the BLLF1
           viral late glycoprotein, also termed gp350/220. It is
           the most abundantly expressed glycoprotein in the viral
           envelope of the Herpesviruses and is the major antigen
           responsible for stimulating the production of
           neutralising antibodies in vivo.
          Length = 830

 Score = 31.7 bits (71), Expect = 3.8
 Identities = 60/364 (16%), Positives = 115/364 (31%), Gaps = 51/364 (14%)

Query: 326 APSSTTNYPYSSSSTPTYAAMTTASGNANNVASNGGGSNNGGVYSTNSFMAATTHGYS-- 383
           AP +T +  +  +        TTA     ++     G        T+     TT      
Sbjct: 425 APDTTKSVIFVYTLVHVEPHKTTAVPTTPSLPPASTGPTVSTADPTSGTPTGTTSSTLPE 484

Query: 384 -----SISNTITNTQPSQQQSNNSVYSSTINSYSSFPQQSSVNLNTYGHQASSLSNSNTT 438
                S + + T    S   +  +  +++  +  +    ++ +        ++ + S  T
Sbjct: 485 DTSPTSRTTSATPNATSPTPAVTTPNATSPTTQKTSDTPNATSPTPIVIGVTTTATSPPT 544

Query: 439 GYSSQNLYQQ-QPSSVSYSSNSYVYPQTSIATSSNVSSTSHTSHKYPPYNVDMKNHLDSK 497
           G +S       Q +  S  +N+     TS    S ++S   T                  
Sbjct: 545 GTTSVPNATSPQVTEESPVNNTNTPVVTS--APSVLTSAVTTGQHGT------------- 589

Query: 498 STLETKSSSEHPPYVDYPNYMGTQYQDSLMKDVKTAADPIGHFEFRNTLDIAPMIPMDSY 557
                 S     P   +      +   +    + T+A P G     N  +  P +P  ++
Sbjct: 590 -GSSPTSQQPGIPSSSHST---PRSNSTSTTPLLTSAHPTGG---ENITEETPSVPSTTH 642

Query: 558 MTNYMQPGPPPQSTSDYKMLYWNNV-PMPNMLH----APPPQAPHSAMYKYTSHIPYDMP 612
           ++  + PGP P +TS       ++    P  +H     P P A   +            P
Sbjct: 643 VST-LSPGPGPGTTSQVSGPGNSSTSRYPGEVHVTEGMPNPNATSPS-----------AP 690

Query: 613 TSQATQKP---FMKLNDSTSGTDTSAYISPTHMRVHTHQIPAHRYPVENTTTNKMPLAYT 669
           + Q T  P         +++  +TS          HT++  A R    N TT   P   +
Sbjct: 691 SGQKTAVPTVTSTGGKANSTTKETSGSTLMASTSPHTNE-GAFRTTPYNATTYLPPSTSS 749

Query: 670 EPQP 673
           + +P
Sbjct: 750 KLRP 753


>gnl|CDD|227002 COG4655, COG4655, Predicted membrane protein [Function unknown].
          Length = 565

 Score = 31.8 bits (72), Expect = 4.1
 Identities = 19/89 (21%), Positives = 31/89 (34%), Gaps = 1/89 (1%)

Query: 347 TTASGNANNVASNGGGSNNGGVYSTNSFMAATTHGYSSISNTITNTQPSQQQSNNS-VYS 405
             A GN  N  S    S + GV   +   +       S  +  TN   S      S +  
Sbjct: 414 EIALGNVANPDSVASFSLDCGVTRADLLDSPLISIGGSAVSIYTNPPASSLTFFQSDIPY 473

Query: 406 STINSYSSFPQQSSVNLNTYGHQASSLSN 434
            TI S S+     ++  +  G+   S++ 
Sbjct: 474 RTIKSVSTRDTSVTLTGSLLGNLLLSINL 502


>gnl|CDD|112562 pfam03753, HHV6-IE, Human herpesvirus 6 immediate early protein.  The
            proteins in this family are poorly characterized, but an
            investigation has indicated that the immediate early
            protein is required the down-regulation of MHC class I
            expression in dendritic cells. Human herpesvirus 6
            immediate early protein is also referred to as U90.
          Length = 993

 Score = 31.6 bits (71), Expect = 4.5
 Identities = 33/128 (25%), Positives = 59/128 (46%), Gaps = 22/128 (17%)

Query: 1202 FNNL-YNSNNYNNNKSKQKKTDS-TNNVDKKQDSTSTDERKKKATQKQQQSEEESKKTSN 1259
             NN   N+    + KS  + T S T     K+   ST+ + KK T K++    ++ K+  
Sbjct: 619  MNNTNENTPFSKSGKSPPEVTPSKTFYKRDKKKDISTNRKVKKRTAKRKTVGYKTDKS-- 676

Query: 1260 KKIKFKGPEILNETTEPTASSATNSNVTTVSNTADNSNNPSSILKKSKATEDCKVKTVQG 1319
            KKIK             + S  T++NV  +S+ +++  +  +I+KKS+       K ++ 
Sbjct: 677  KKIK-------------SDSLPTDTNVIVISSESEDEEDGFNIIKKSQL-----KKKIKS 718

Query: 1320 APKTEKSQ 1327
              K+E S 
Sbjct: 719  ELKSESSS 726


>gnl|CDD|219563 pfam07767, Nop53, Nop53 (60S ribosomal biogenesis).  This nucleolar
            family of proteins are involved in 60S ribosomal
            biogenesis. They are specifically involved in the
            processing beyond the 27S stage of 25S rRNA maturation.
            This family contains sequences that bear similarity to
            the glioma tumour suppressor candidate region gene 2
            protein (p60). This protein has been found to interact
            with herpes simplex type 1 regulatory proteins.
          Length = 387

 Score = 31.2 bits (71), Expect = 4.6
 Identities = 20/85 (23%), Positives = 34/85 (40%), Gaps = 7/85 (8%)

Query: 1048 AKKPNKEEDKIEKEENNTNAKENDEKSTDDEKNKLDDKAKDTATKLSNSADSIKLNKKDE 1107
              K  K+  ++E+ E     K   E S  DE ++   +  D   +  +  +S     + E
Sbjct: 206  EVKAEKKRQELERVEEKKLEKMAPEASRLDEMSEGLLEESDDDGEEESDDESAWEGFESE 265

Query: 1108 GSKENKPQQQRRKFNNFHNSSKYQR 1132
                NKP + +RK       +K QR
Sbjct: 266  YEPINKPVRPKRK-------TKAQR 283


>gnl|CDD|227545 COG5220, TFB3, Cdk activating kinase (CAK)/RNA polymerase II
            transcription initiation/nucleotide excision repair
            factor TFIIH, subunit TFB3 [Cell division and chromosome
            partitioning / Transcription / DNA replication,
            recombination, and repair].
          Length = 314

 Score = 31.1 bits (70), Expect = 4.7
 Identities = 28/155 (18%), Positives = 62/155 (40%), Gaps = 10/155 (6%)

Query: 1007 KKDDKKVDDKNVSKDQKITK-STNNNLNKKTNK----QQNIQILSPAKKPNKEEDKIEKE 1061
              D    ++K V K +++ + S  NNL + T +    +Q   +L    K  K + + + E
Sbjct: 123  LIDVSLTEEK-VKKYEEMNQDSILNNLERPTPEVMPGKQK-NVLQKRMKLKKRQLERQIE 180

Query: 1062 ENNTNAKENDEKSTDDEKNKLDDKAKDTATKLSNSADSIKLNKKDEGSKENKPQQQRRKF 1121
            E     + N E +  +  N+L   ++D    +     S K + ++      +P +  + +
Sbjct: 181  EEE---RMNKEWTKKEIGNRLGTASEDGNKTIKIGIISDKFDPEELPRIVVEPTRNNKIY 237

Query: 1122 NNFHNSSKYQRHPYHNNGGDFNYVPVYDGNINKFN 1156
             +          P+   GG +N   ++   I+ + 
Sbjct: 238  LSNVQGFLKDLQPFAPGGGSYNAHTIFSLAISSYG 272


>gnl|CDD|165099 PHA02732, PHA02732, hypothetical protein; Provisional.
          Length = 1467

 Score = 31.6 bits (71), Expect = 4.7
 Identities = 31/170 (18%), Positives = 53/170 (31%), Gaps = 38/170 (22%)

Query: 333  YPYSSSSTPTYAAMTTASGNANNVASNGGGSNNGGVYSTNSFMAAT-------------- 378
             PY  + T      T      N  +    G+    V S   +M +T              
Sbjct: 1099 LPYFMNQTSALVGNTALPKGLNVFSGYMFGAGT--VASAFLYMNSTPQSPVLALLLAPYI 1156

Query: 379  THGYSSISNTITNTQPSQQQSN----------------NSVYSSTINSYSSFPQQSSVNL 422
            ++ ++++S   + T  +   S                 + +Y   I +Y           
Sbjct: 1157 SYKFNALSLGFSITADAAIFSLFGIPAPQLLSSYIPTGSVLYQDPIFTYIPPGIIGMSGT 1216

Query: 423  NTYGHQASSLSNSNTTGYSSQNLYQQQPSSVSYSSNSYVYPQTSIATSSN 472
            NT+  +A+ L  S  +   +       PSS S SS        SI+TS  
Sbjct: 1217 NTFTFKAAQLQLSAASSPPAATTPTPPPSSSSSSSAQ------SISTSPG 1260


>gnl|CDD|227458 COG5129, MAK16, Nuclear protein with HMG-like acidic region [General
            function prediction only].
          Length = 303

 Score = 31.2 bits (70), Expect = 5.2
 Identities = 17/66 (25%), Positives = 29/66 (43%), Gaps = 2/66 (3%)

Query: 1052 NKEEDKIEKEENNTNAKENDEKSTDDEKNKLDDKAKDTATKLSNSADSIKLNKKDEGSKE 1111
            ++  +  E EE  ++  E+DE   +D K K+  +  D A K  +    I +  + E   E
Sbjct: 238  DQSMETSESEEEESSESESDEDEDEDNKGKIRKRKTDDAKK--SRKPHIHIEYEQERENE 295

Query: 1112 NKPQQQ 1117
              P  Q
Sbjct: 296  KIPAVQ 301


>gnl|CDD|110537 pfam01540, Lipoprotein_7, Adhesin lipoprotein.  This family consists
            of the p50 and variable adherence-associated antigen
            (Vaa) adhesins from Mycoplasma hominis. M. hominis is a
            mycoplasma associated with human urogenital diseases,
            pneumonia, and septic arthritis. An adhesin is a cell
            surface molecule that mediates adhesion to other cells or
            to the surrounding surface or substrate. The Vaa antigen
            is a 50-kDa surface lipoprotein that has four tandem
            repetitive DNA sequences encoding a periodic peptide
            structure, and is highly immunogenic in the human host.
            p50 is also a 50-kDa lipoprotein, having three repeats
            A,B and C, that may be a tetramer of 191-kDa in its
            native environment.
          Length = 353

 Score = 31.2 bits (70), Expect = 5.2
 Identities = 31/98 (31%), Positives = 46/98 (46%), Gaps = 11/98 (11%)

Query: 996  IAKLPIPLGGDKKDDKKVDDKNVSKDQKITKSTNNNLNKKTNKQQNIQILSPAKKPNKEE 1055
            I+KL   +   K + K +DDKN    +++ +      N K   Q NI+ L   KK N E 
Sbjct: 92   ISKLSAAVENAKNEKKAIDDKNAQIAKELAER-----NAKI--QSNIEEL---KKINNEA 141

Query: 1056 DKIEKEENNTNAK-ENDEKSTDDEKNKLDDKAKDTATK 1092
             ++ K  N T A+ E   K   D K +L++ A D   K
Sbjct: 142  FELSKTVNKTIAEVEKKFKIPKDFKEQLENFADDLLDK 179


>gnl|CDD|221391 pfam12042, RP1-2, Tubuliform egg casing silk strands structural
           domain.  Spiders use fibroins to make silk strands. This
           family includes tubuliform silk fibroins which are used
           to protect egg cases. This domain is a structural domain
           which is found in repeats of up to 20 in many
           individuals (although this is not necessarily the case).
           RP1 makes up structural domains in the N terminal while
           RP2 makes up structural domains in the C terminal.
          Length = 167

 Score = 30.2 bits (68), Expect = 5.3
 Identities = 25/134 (18%), Positives = 50/134 (37%), Gaps = 15/134 (11%)

Query: 337 SSSTPTYAAMTTASGNANNVASNGGGSNNGG--------VYSTNSFMAATTHGY---SSI 385
           SS+T   AA + A     + A + G SN           V S     +A+ +     ++I
Sbjct: 38  SSATSASAAGSLAYALGQSAARSLGLSNASALASAVAQAVSSVGVGASASAYANAISNAI 97

Query: 386 SNTITNTQPSQQQSNNSVYSSTINSYSSFPQQSSVNLNTYGHQASSLSNSNTTGYSSQNL 445
              +         + +S+ SS  ++ S+    S+         +++ + + +   +S   
Sbjct: 98  GQFLAGQGVLNASNASSLASSFASALSASAASSAAQ---AASASAAAAAAQSQAAASAF- 153

Query: 446 YQQQPSSVSYSSNS 459
            Q    S S S+ S
Sbjct: 154 SQAASQSSSQSAAS 167


>gnl|CDD|233115 TIGR00755, ksgA, dimethyladenosine transferase.  In both E. coli and
            Saccharomyces cerevisiae, this protein is responsible for
            the dimethylation of two adjacent adenosine residues in a
            conserved hairpin of 16S rRNA in bacteria, 18S rRNA in
            eukaryotes. This adjacent dimethylation is the only rRNA
            modification shared by bacteria and eukaryotes. A single
            member of this family is present in each of the first 20
            completed microbial genomes. This protein is essential in
            yeast, but not in E. coli, where its deletion leads to
            resistance to the antibiotic kasugamycin. Alternate name:
            S-adenosylmethionine--6-N',N'-adenosyl (rRNA)
            dimethyltransferase [Protein synthesis, tRNA and rRNA
            base modification].
          Length = 253

 Score = 30.7 bits (70), Expect = 5.5
 Identities = 17/82 (20%), Positives = 35/82 (42%), Gaps = 9/82 (10%)

Query: 1424 LLKDKIVLEVGCGMGLLSLFCAEAGAK-HVISVDCSVITQLTQEVVEENDCSDVITVICR 1482
            +L+  +VLE+G G+G L+    +   K   I +D  +  +L + +    +    + +I  
Sbjct: 27   VLEGDVVLEIGPGLGALTEPLLKRAKKVTAIEIDRRLAERLRKLLSLYEN----LEII-- 80

Query: 1483 RMEDIDRLPHGIEN-VDIIVSN 1503
               D  ++          +VSN
Sbjct: 81   -EGDALKVDLNEFPKQLKVVSN 101


>gnl|CDD|217392 pfam03153, TFIIA, Transcription factor IIA, alpha/beta subunit.
           Transcription initiation factor IIA (TFIIA) is a
           heterotrimer, the three subunits being known as alpha,
           beta, and gamma, in order of molecular weight. The N and
           C-terminal domains of the gamma subunit are represented
           in pfam02268 and pfam02751, respectively. This family
           represents the precursor that yields both the alpha and
           beta subunits. The TFIIA heterotrimer is an essential
           general transcription initiation factor for the
           expression of genes transcribed by RNA polymerase II.
           Together with TFIID, TFIIA binds to the promoter region;
           this is the first step in the formation of a
           pre-initiation complex (PIC). Binding of the rest of the
           transcription machinery follows this step. After
           initiation, the PIC does not completely dissociate from
           the promoter. Some components, including TFIIA, remain
           attached and re-initiate a subsequent round of
           transcription.
          Length = 332

 Score = 30.9 bits (70), Expect = 5.6
 Identities = 21/158 (13%), Positives = 33/158 (20%), Gaps = 8/158 (5%)

Query: 579 WNNVPMPNMLHAPPPQAPHSAMYKYTSHIPYDMPTSQATQKPFMKLNDSTSGTDTS-AYI 637
           W+  P      A  PQ                    Q            +   +    + 
Sbjct: 49  WDPSPQAPPPVAQLPQPLPQPPPTQALQALPAGDQQQHNT------PTGSPAANPPATFA 102

Query: 638 SPTHMR-VHTHQIPAHRYPVENTTTNKMPLAYTEPQPPNLQASLNDYQRYQNERHHFTDF 696
            P           P   YPV+         A +    P  Q +L   Q+           
Sbjct: 103 LPAGPAGPTIQTEPGQLYPVQVPVMVTQNPANSPLDQPAQQRALQQLQQRYGAPASGQLP 162

Query: 697 GNFQTPPLGNSVQHNYQTNDSFGTNDLSTRIHQQPTAV 734
              Q+    +  Q   Q N              +  A+
Sbjct: 163 SQQQSAQKNDESQLQQQPNGETPPQQTDGAGDDESEAL 200


>gnl|CDD|218108 pfam04487, CITED, CITED.  CITED, CBP/p300-interacting
           transactivator with ED-rich tail, are characterized by a
           conserved 32-amino acid sequence at the C-terminus.
           CITED proteins do not bind DNA directly and are thought
           to function as transcriptional co-activators.
          Length = 206

 Score = 30.6 bits (69), Expect = 5.7
 Identities = 14/57 (24%), Positives = 19/57 (33%)

Query: 558 MTNYMQPGPPPQSTSDYKMLYWNNVPMPNMLHAPPPQAPHSAMYKYTSHIPYDMPTS 614
           M+  M  G P QS   Y M   ++ P P ML   P       + K  +         
Sbjct: 63  MSGPMGGGHPHQSMPAYMMFNPSSKPQPFMLVPGPQLMASMQLQKLNTQYQGHAGAP 119


>gnl|CDD|235250 PRK04195, PRK04195, replication factor C large subunit; Provisional.
          Length = 482

 Score = 31.0 bits (71), Expect = 5.9
 Identities = 12/70 (17%), Positives = 29/70 (41%)

Query: 1046 SPAKKPNKEEDKIEKEENNTNAKENDEKSTDDEKNKLDDKAKDTATKLSNSADSIKLNKK 1105
               KK  K  +K EK+      ++  +     +K + +++ K+   +     +     +K
Sbjct: 407  KATKKIKKIVEKAEKKREEEKKEKKKKAFAGKKKEEEEEEEKEKKEEEKEEEEEEAEEEK 466

Query: 1106 DEGSKENKPQ 1115
            +E  ++ K Q
Sbjct: 467  EEEEEKKKKQ 476


>gnl|CDD|219481 pfam07596, SBP_bac_10, Protein of unknown function (DUF1559).  A
           large family of paralogous proteins apparently unique to
           planctomycetes.
          Length = 255

 Score = 30.5 bits (69), Expect = 5.9
 Identities = 24/138 (17%), Positives = 43/138 (31%), Gaps = 11/138 (7%)

Query: 320 SNEALYAPSSTTNYPYSSSSTPTYAAMTTASGNANNVASNGGGSNNGGVYSTNSF--MAA 377
           S+ +    +ST     S  +     A   + G  ++  + GG S+  G++  NS      
Sbjct: 99  SDPSGDTGASTGPSTGSGGAGAGAGAAGGSGGGTDSGTNAGGPSSTNGMFGPNSKTRFRD 158

Query: 378 TTHGYSSISNTI---TNTQPSQQQSNNSVYSSTINSYSSFPQQSSVNLNTYGHQASSLSN 434
            T G    SNTI             +        +    +   ++   +T G   S+ + 
Sbjct: 159 ITDG---TSNTILVGERAGGLADSGSGGTGGDAADGGGGWGGGTTTTTSTNG---STPAT 212

Query: 435 SNTTGYSSQNLYQQQPSS 452
           S   G            S
Sbjct: 213 SGPPGGGDNGAANNGFGS 230


>gnl|CDD|217023 pfam02414, Borrelia_orfA, Borrelia ORF-A.  This protein is encoded by
            an open reading frame in plasmid borne DNA repeats of
            Borrelia species. This protein is known as ORF-A. The
            function of this putative protein is unknown.
          Length = 285

 Score = 30.8 bits (70), Expect = 6.0
 Identities = 23/108 (21%), Positives = 38/108 (35%), Gaps = 4/108 (3%)

Query: 1020 KDQKITKSTNNNLNKKTNKQQNIQILSPAKKPNKEEDKIEKEENNTNAKENDEKSTDDEK 1079
            K ++  K  N    KK+NK  +++        N   +K  KEE+     +N EK    + 
Sbjct: 104  KIERFKKRVNAYFKKKSNKNGSVEKNECINNKNNVINKKIKEESKKKNSKNIEKLKIKKY 163

Query: 1080 ----NKLDDKAKDTATKLSNSADSIKLNKKDEGSKENKPQQQRRKFNN 1123
                N   +  KD   KL       +         +NK +  +    N
Sbjct: 164  FKKCNFKTNLKKDIKIKLLKIIKIEENFLIKNKKLKNKQKTLKNILYN 211


>gnl|CDD|234548 TIGR04331, o_ant_LIC12162, putative transferase, LIC12162 family.
            This protein family shows C-terminal sequence similarity
            to various surface carbohydrate biosynthesis enzymes:
            spore coat polysaccharide biosynthesis protein SpsB,
            UDP-N-acetyl-D-glucosamine 2-epimerase, lipid A
            disaccharide synthetase LpxB, etc. It may occur in
            O-antigen biosythesis regions.
          Length = 585

 Score = 31.2 bits (71), Expect = 6.0
 Identities = 18/96 (18%), Positives = 33/96 (34%), Gaps = 15/96 (15%)

Query: 1473 CSDVITVICRRMEDIDRLPHGIENVDIIVSNWMGHVLYLDSLINAVVYARDRF--LKPHG 1530
               ++  + +R+  I    H +    I++  W+ H + +       +Y  DR+  LK   
Sbjct: 61   YERLLPSLAKRLNAIHGTQHSLRYWRILLGPWLIHFIPI-------LY--DRYLSLKLAL 111

Query: 1531 LILPDRA----ELYCVAANDTMAATKYSFWHDVYGF 1562
            L +PD      E              +    D Y F
Sbjct: 112  LKVPDVKALVLEEKDFRPRGDFDDFNHLLSDDDYNF 147


>gnl|CDD|114011 pfam05262, Borrelia_P83, Borrelia P83/100 protein.  This family
            consists of several Borrelia P83/P100 antigen proteins.
          Length = 489

 Score = 31.1 bits (70), Expect = 6.0
 Identities = 23/112 (20%), Positives = 45/112 (40%), Gaps = 1/112 (0%)

Query: 1007 KKDDKKVDDKNVSKDQKITKSTNNNLNKKTNKQQNIQILSPAKKPNKEEDKIEKEENNTN 1066
            ++ DKK  D + ++ +      N +  +    +Q  Q      KP       E ++   N
Sbjct: 220  EELDKKQIDADKAQQKADFAQDNADKQRDE-VRQKQQEAKNLPKPADTSSPKEDKQVAEN 278

Query: 1067 AKENDEKSTDDEKNKLDDKAKDTATKLSNSADSIKLNKKDEGSKENKPQQQR 1118
             K   EK+  + K   ++  K    K  +     K ++K+   KE + Q++R
Sbjct: 279  QKREIEKAQIEIKKNDEEALKAKDHKAFDLKQESKASEKEAEDKELEAQKKR 330



 Score = 30.7 bits (69), Expect = 8.4
 Identities = 22/111 (19%), Positives = 45/111 (40%), Gaps = 9/111 (8%)

Query: 984  IMQRPVDENKDEIAKLPIPLG-GDKKDDKKVDD---KNVSKDQKITKSTNNNLNKKTNKQ 1039
              +  V + + E   LP P      K+DK+V +   + + K Q   K  +    K  +  
Sbjct: 245  KQRDEVRQKQQEAKNLPKPADTSSPKEDKQVAENQKREIEKAQIEIKKNDEEALKAKD-- 302

Query: 1040 QNIQILSPAKKPNKEEDKIEKEENNTNAKENDEKSTDDEKNKLDDKAKDTA 1090
                     K+ +K  +K  +++     K+ +  + D +K K   +A+ T+
Sbjct: 303  ---HKAFDLKQESKASEKEAEDKELEAQKKREPVAEDLQKTKPQVEAQPTS 350


>gnl|CDD|185490 PTZ00163, PTZ00163, hypothetical protein; Provisional.
          Length = 230

 Score = 30.5 bits (68), Expect = 6.1
 Identities = 26/90 (28%), Positives = 38/90 (42%), Gaps = 1/90 (1%)

Query: 1007 KKDDKKVDDKNVSKDQKITKSTNNNLNKKTNKQQNIQILSPAKKPNKEEDKIEKEENNTN 1066
            KKD +   +K++      ++    N  K   K     I S  K P K E K E+E    N
Sbjct: 27   KKDLQDKFNKSIITINIASRILTENNKKWYKKYIYTSIFSGNKNPQKRERKNEEENQKDN 86

Query: 1067 AK-ENDEKSTDDEKNKLDDKAKDTATKLSN 1095
             K +ND    ++ +N +DD   D    L N
Sbjct: 87   TKVDNDNNMENEMENHIDDSIDDPMDDLMN 116


>gnl|CDD|129570 TIGR00478, tly, hemolysin TlyA family protein.  Hemolysins are
            exotoxins that attack blood cell membranes and cause cell
            rupture, often by forming a pore in the membrane. At
            least two members of this protein family have been
            characterized indirectly as pore-forming hemolysins, one
            from the spirochete Serpula (Treponema) hyodysenteriae
            and one from Mycobacterium tuberculosis. However,
            homology domains in this protein suggest
            methyltransferase activity (pfam01728) and RNA-binding
            activity (pfam01479) [Unknown function, General].
          Length = 228

 Score = 30.5 bits (69), Expect = 6.1
 Identities = 14/33 (42%), Positives = 18/33 (54%)

Query: 1426 KDKIVLEVGCGMGLLSLFCAEAGAKHVISVDCS 1458
            K+KIVL+VG   G  +    + GAK V  VD  
Sbjct: 75   KNKIVLDVGSSTGGFTDCALQKGAKEVYGVDVG 107


>gnl|CDD|233044 TIGR00600, rad2, DNA excision repair protein (rad2).  All proteins in
            this family for which functions are known are flap
            endonucleases that generate the 3' incision next to DNA
            damage as part of nucleotide excision repair. This family
            is related to many other flap endonuclease families
            including the fen1 family. This family is based on the
            phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis,
            Stanford University) [DNA metabolism, DNA replication,
            recombination, and repair].
          Length = 1034

 Score = 31.0 bits (70), Expect = 6.3
 Identities = 35/166 (21%), Positives = 64/166 (38%), Gaps = 21/166 (12%)

Query: 967  NNKSYGDKNKNEFKVTKIMQRPVDENKDE----IAKLPIPLGGDKKDDKKVDDKNVSKD- 1021
                   K    F  T I    V E+ D           P+  +K++ +  D   +  D 
Sbjct: 607  TTNPSNAKEVRNFAETGIQTTNVGESADLLLISNPMEVEPMESEKEESES-DGSFIEVDS 665

Query: 1022 -------QKITKSTNNNLNKKTNKQQNIQILSPAKKPNKEEDKIEKEENNTNAKENDEKS 1074
                   Q  +KS   + +++  + +   I    +K  ++    E EE+N      +EK 
Sbjct: 666  VSSTLELQVPSKSQPTDESEENAENKVASIEGEHRKEIEDLLFDESEEDNIVGMIEEEKD 725

Query: 1075 TDDEKNKLDDKAKDTATKLSNSADSIKLNKKDE-GSKENKPQQQRR 1119
             DD KN+  D    +  +L    ++++ N   E  S + + QQQ+R
Sbjct: 726  ADDFKNEWQDI---SLEEL----EALEANLLAEQNSLKAQKQQQKR 764


>gnl|CDD|240430 PTZ00473, PTZ00473, Plasmodium Vir superfamily; Provisional.
          Length = 420

 Score = 31.0 bits (70), Expect = 6.4
 Identities = 23/97 (23%), Positives = 34/97 (35%), Gaps = 5/97 (5%)

Query: 328 SSTTNYPYSSSSTPTYAAMTTASGNANNVASNGGGSNNGGVYSTNSFMAATTHGYSSISN 387
            S       S    TY + TT  G++   +     S     Y + S   ++T G  S S 
Sbjct: 329 RSGRTGSSESIRGFTYDSSTTYGGSSYGTSQTDSTST----YGSRSTFDSSTGG-GSQSG 383

Query: 388 TITNTQPSQQQSNNSVYSSTINSYSSFPQQSSVNLNT 424
             +    S     +S  SS     S F  Q +V  +T
Sbjct: 384 GGSTYGGSSTFDGSSRGSSDSFGVSYFGPQQTVGFST 420


>gnl|CDD|115057 pfam06375, BLVR, Bovine leukaemia virus receptor (BLVR).  This family
            consists of several bovine specific leukaemia virus
            receptors which are thought to function as transmembrane
            proteins, although their exact function is unknown.
          Length = 561

 Score = 31.2 bits (70), Expect = 6.4
 Identities = 16/95 (16%), Positives = 43/95 (45%), Gaps = 4/95 (4%)

Query: 1235 STDERKKKATQKQQQSEEESKKTSNKKIKFKGPEILNETTEPTASSATNSNVTTVSNTAD 1294
                +K K  +K+++ +E  K    +   FK   +  + +  +A+S   ++  +++NT  
Sbjct: 196  EKKSKKPKKKEKKEKEKERDKDKKKEVEGFKSLLLALDDSPASAASVAEADEASLANTVS 255

Query: 1295 NSNNPSSILKKSKATEDCKVKTVQGAPKTEKSQTK 1329
             +   S    +    +D + +  + +PK +K + +
Sbjct: 256  GTAPDS----EPDEPKDAEAEETKKSPKHKKKKQR 286


>gnl|CDD|223109 COG0030, KsgA, Dimethyladenosine transferase (rRNA methylation)
            [Translation, ribosomal structure and biogenesis].
          Length = 259

 Score = 30.7 bits (70), Expect = 6.4
 Identities = 18/84 (21%), Positives = 35/84 (41%), Gaps = 9/84 (10%)

Query: 1422 SHLLKDKIVLEVGCGMGLLSLFCAEAGAK-HVISVDCSVITQLTQEVVEENDCSDVITVI 1480
            +++     VLE+G G+G L+    E  A+   I +D  +   L +     ++    +TVI
Sbjct: 26   ANISPGDNVLEIGPGLGALTEPLLERAARVTAIEIDRRLAEVLKERFAPYDN----LTVI 81

Query: 1481 CRRMEDIDRLPHG-IENVDIIVSN 1503
                 D  +     +     +V+N
Sbjct: 82   ---NGDALKFDFPSLAQPYKVVAN 102


>gnl|CDD|219680 pfam07980, SusD, SusD family.  This family includes several
            hypothetical proteins. It also contains RagB, a protein
            involved in signalling and SusD, an outer membrane
            protein involved in nutrient binding.
          Length = 257

 Score = 30.6 bits (69), Expect = 6.5
 Identities = 23/114 (20%), Positives = 30/114 (26%), Gaps = 21/114 (18%)

Query: 1135 YHNNGGDFNYVPVYDGNINKFNKFGGGGNNRYPNHRYYD--------FDFIDPR------ 1180
              N G  F Y     G+       GGG     P     D        ++  DPR      
Sbjct: 14   TGNGGNSFGYGSGPAGSG-----GGGGWGGVTPTQELVDLFLMKENPYENRDPRLYATIL 68

Query: 1181 --SCFFPGSGTPYPQYGGGGPPMFNNLYNSNNYNNNKSKQKKTDSTNNVDKKQD 1232
                   G+G       GGG    +   N   Y   K       + N  +   D
Sbjct: 69   YDGDTNKGNGADTGGIYGGGGGESSGGGNPTGYYLRKYVDADDANANGGNSGID 122


>gnl|CDD|181572 PRK08871, flgK, flagellar hook-associated protein FlgK; Validated.
          Length = 626

 Score = 31.0 bits (70), Expect = 6.6
 Identities = 15/39 (38%), Positives = 22/39 (56%), Gaps = 3/39 (7%)

Query: 422 LNTYGHQASSLSNSNTTGYSSQNLYQQQPSSVSYSSNSY 460
           LNT GH   ++SN NT GYS Q++ Q       +  ++Y
Sbjct: 20  LNTTGH---NISNVNTEGYSRQSVIQGTNDPRQFGGSTY 55


>gnl|CDD|214360 CHL00094, dnaK, heat shock protein 70.
          Length = 621

 Score = 30.9 bits (70), Expect = 6.8
 Identities = 23/76 (30%), Positives = 35/76 (46%), Gaps = 4/76 (5%)

Query: 1034 KKTNKQQNIQILSPAKKPNKEEDKI--EKEENNTNAKENDEKSTDDEKNKLDDKAKDTAT 1091
            K T K+Q+I I   +  P  E +++  E E+N    KE  EK   D KN+ +        
Sbjct: 486  KGTGKEQSITIQGASTLPKDEVERMVKEAEKNAAEDKEKREKI--DLKNQAESLCYQAEK 543

Query: 1092 KLSNSADSIKLNKKDE 1107
            +L    D I   KK++
Sbjct: 544  QLKELKDKISEEKKEK 559


>gnl|CDD|227430 COG5099, COG5099, RNA-binding protein of the Puf family,
           translational repressor [Translation, ribosomal
           structure and biogenesis].
          Length = 777

 Score = 30.9 bits (70), Expect = 7.0
 Identities = 38/235 (16%), Positives = 81/235 (34%), Gaps = 34/235 (14%)

Query: 363 SNNGGVYSTNSFMAATTHGYSSISNTITNTQPSQQQSNNSVYSS---TINSYSSFPQQSS 419
             +    +T+  +       +S S       P   ++++S   +    IN+  +    SS
Sbjct: 21  KKSPPSSTTSQELMNGNSTPNSFS-------PIPSKASSSATFTLNLPINNSVNHKITSS 73

Query: 420 VNLNTYGHQASSLSNSNTTGYSSQNLYQQQPSSVSYSSNS-----YVYPQT-SIATSSNV 473
            +       + S++ S++T  S   L +   SS + S++S          T     +S+V
Sbjct: 74  SSSRRKPSGSWSVAISSSTSGSQSLLMELPSSSFNPSTSSRNKSNSALSSTQQGNANSSV 133

Query: 474 SSTSHTSHKYPPYNVDMKNHLDSKSTLETKSSSEHPPYVDYPNY--MGTQYQDSLMKDVK 531
           + +S T+      N     + +  ++  T  S          +     T    S +K   
Sbjct: 134 TLSSSTASSMFNSNKLPLPNPNHSNSATTNQSGSSFINTPASSSSQPLTNLVVSSIKRFP 193

Query: 532 --TAADPIGHFE-------------FRNTLDIAP-MIPMDSYMTNYMQPGPPPQS 570
             T+  P  ++                 + +  P + P + + T+ + P P  QS
Sbjct: 194 YLTSLSPFFNYLIDPSSDSATASADTSPSFNPPPNLSPNNLFSTSDLSPLPDTQS 248



 Score = 30.5 bits (69), Expect = 9.9
 Identities = 43/182 (23%), Positives = 66/182 (36%), Gaps = 27/182 (14%)

Query: 309 PEHSTRDGKLFSNEALYAPSSTTNYPYSSSSTPTYAAMTTASGNANN-------VASNGG 361
              ST   +    E    PSS+ N   SS +    A  +T  GNAN+        AS+  
Sbjct: 88  ISSSTSGSQSLLMEL---PSSSFNPSTSSRNKSNSALSSTQQGNANSSVTLSSSTASSMF 144

Query: 362 GSNNGGVYSTNSFMAATTHGYSSISNTITNTQPSQQQSN-NSVYSSTI---------NSY 411
            SN   + + N   +ATT+   S S+ I     S  Q   N V SS             +
Sbjct: 145 NSNKLPLPNPNHSNSATTN--QSGSSFINTPASSSSQPLTNLVVSSIKRFPYLTSLSPFF 202

Query: 412 SSFPQQSSVNLNTYGHQASSL----SNSNTTGYSSQNLYQQQPSSVSYSSNSYVYPQTSI 467
           +     SS +       + S     + S    +S+ +L    P + S  +N  +   +SI
Sbjct: 203 NYLIDPSSDSATASADTSPSFNPPPNLSPNNLFSTSDL-SPLPDTQSVENNIILNSSSSI 261

Query: 468 AT 469
             
Sbjct: 262 NE 263


>gnl|CDD|225174 COG2265, TrmA, SAM-dependent methyltransferases related to tRNA
            (uracil-5-)-methyltransferase [Translation, ribosomal
            structure and biogenesis].
          Length = 432

 Score = 30.7 bits (70), Expect = 7.1
 Identities = 11/33 (33%), Positives = 17/33 (51%), Gaps = 1/33 (3%)

Query: 1424 LLKDKIVLEVGCGMGLLSLFCAEAGAKHVISVD 1456
            L   + VL++ CG+G   L  A+   K V  V+
Sbjct: 291  LAGGERVLDLYCGVGTFGLPLAK-RVKKVHGVE 322


>gnl|CDD|227931 COG5644, COG5644, Uncharacterized conserved protein [Function
            unknown].
          Length = 869

 Score = 30.8 bits (69), Expect = 7.1
 Identities = 23/94 (24%), Positives = 40/94 (42%), Gaps = 9/94 (9%)

Query: 1021 DQKITKSTNNNLNKKTNKQQNIQ-ILSPAKKPNKEEDKIEKEENNTNAKENDEKSTDDEK 1079
            DQ   K+ + N +  T+K+  ++   S +     EE   E E  ++++  +DE S     
Sbjct: 134  DQP-EKNESGNNDHATDKENLLESDASSSNDSESEESDSESEIESSDSDHDDENSDSKLD 192

Query: 1080 N-------KLDDKAKDTATKLSNSADSIKLNKKD 1106
            N          D+A   +   S+  DSI+  K D
Sbjct: 193  NLRNYIVSLKKDEADAESVLSSDDNDSIEEIKYD 226


>gnl|CDD|219408 pfam07423, DUF1510, Protein of unknown function (DUF1510).  This
            family consists of several hypothetical bacterial
            proteins of around 200 residues in length. The function
            of this family is unknown.
          Length = 214

 Score = 30.1 bits (68), Expect = 7.1
 Identities = 19/76 (25%), Positives = 39/76 (51%), Gaps = 5/76 (6%)

Query: 1006 DKKDDKKVDDK--NVSKDQKITKSTNNNLNKKTNKQQNIQILSPAKKPNKEEDKIEKEEN 1063
            D+++ KK DD+     ++ K  +    N   K +K    +    +++ N+EED+   E +
Sbjct: 47   DEQEAKKSDDQETAEIEEVKEEEKEAANSEDKEDKGDAEKEDEESEEENEEEDE---ESS 103

Query: 1064 NTNAKENDEKSTDDEK 1079
            + N KE +EK+  + +
Sbjct: 104  DENEKETEEKTESNVE 119


>gnl|CDD|201997 pfam01829, Peptidase_A6, Peptidase A6 family. 
          Length = 358

 Score = 30.6 bits (69), Expect = 7.4
 Identities = 17/56 (30%), Positives = 31/56 (55%), Gaps = 3/56 (5%)

Query: 322 EALYAPSSTTNYPYSSSSTPTYAAM--TTASGNANNVASNGGGSNNGGVYSTNSFM 375
            A    SSTT   +++ + P + ++  TTA+  A+ V++    S N G+Y T++ M
Sbjct: 79  PAGTFASSTTV-TFNAVNFPGFTSLFGTTATSRADQVSAFRYASMNVGIYPTSNLM 133


>gnl|CDD|222862 PHA02539, 18, tail sheath protein; Provisional.
          Length = 648

 Score = 30.8 bits (70), Expect = 8.0
 Identities = 16/99 (16%), Positives = 35/99 (35%), Gaps = 13/99 (13%)

Query: 323 ALYAPSSTTNYP------YSSSSTPTYAAMTTASGNANNVASNGGGSNNGGVYSTNSFMA 376
           A++ PS+           Y +  +     +++ASG      + GG   + G+  TN    
Sbjct: 142 AVFIPSAKIIAKAKSVGVYPTLGSNWTTEISSASGGVAATLTLGGIVTDSGITLTN---- 197

Query: 377 ATTHGYSSISNTITNTQPSQQQSNNSV--YSSTINSYSS 413
                 ++I+        ++    + V  Y   + S + 
Sbjct: 198 -AETAEAAITAVAFQANIAKYGVPSVVALYPGEVGSTTE 235


>gnl|CDD|237748 PRK14537, PRK14537, 50S ribosomal protein L20/unknown domain fusion
            protein; Provisional.
          Length = 230

 Score = 30.0 bits (67), Expect = 8.2
 Identities = 29/141 (20%), Positives = 51/141 (36%), Gaps = 12/141 (8%)

Query: 989  VDENKDEIAKLPIPLGGDKKDDKKVDDKNVSKDQKITKSTNNNLNKKTNKQQNIQILSPA 1048
            VD N+  ++ L            ++  K +S+ Q  T         K  K    ++L   
Sbjct: 88   VDINRKVLSDLAHLQPEVFVQYVELAKKFLSQFQSTTI-------TKPKKVLINEVLQEK 140

Query: 1049 KKPNKEEDKIEKEENNTNAKENDEKSTDDEKNKLDDKAKDTATKLSNSADSIKLNKKDEG 1108
                 EE    + E   + + ++EKS  + + +     K    +   S + I L+K    
Sbjct: 141  TINQNEEKTSLQNEKVLSPELSEEKSDSELETQPQ---KTQLKEKKPSIEHIDLSKML-- 195

Query: 1109 SKENKPQQQRRKFNNFHNSSK 1129
             KE K   +  K  NF+   K
Sbjct: 196  LKELKKLAKEHKIPNFNKLKK 216


>gnl|CDD|225953 COG3419, PilY1, Tfp pilus assembly protein, tip-associated adhesin
           PilY1 [Cell motility and secretion / Intracellular
           trafficking and secretion].
          Length = 1036

 Score = 30.6 bits (69), Expect = 8.6
 Identities = 42/213 (19%), Positives = 64/213 (30%), Gaps = 38/213 (17%)

Query: 295 GSQIFNAASTLNETPEHSTRDGKLFSNEALYAPSSTTNYPYSSSSTP----TYAAMTTAS 350
           G        T      +S  +G  + + A  A          S+S P    TY    T +
Sbjct: 214 GLGATGIGDTNTINDSYSQGNGDAWESPARNAAVYND----WSNSVPTRIGTYTISLTDA 269

Query: 351 GNANNVASNGGGSNNGGVYSTNSFMAATTHGYSSISNTITNTQPSQQQSNNSVYSSTINS 410
            N + VA     +NN    ST       +    +                  + +  +  
Sbjct: 270 ANPDTVALRASDANNIQFGSTYVAQLGDSAHTYNARPDTFGDV--------QLTTYWLAV 321

Query: 411 YSSFPQQSSVNLNTY-------GHQASSLSNSNTTGYS-----SQNLYQQQPSSVS---- 454
            S     S V LN Y       G   +   N NT   S     +   Y   P++ S    
Sbjct: 322 VSKPNGTSPVGLNQYYLTTFYGGRCVAPRPNKNTKQPSVASQSTNAEYGADPTASSGAAN 381

Query: 455 -YSSNSYVYPQTSI-----ATSSNVSSTSHTSH 481
            +S+ S      SI     A S   SS++ T+ 
Sbjct: 382 FFSAPSAESMVASIKRIFSAISGYASSSAGTAL 414


>gnl|CDD|223599 COG0525, ValS, Valyl-tRNA synthetase [Translation, ribosomal
            structure and biogenesis].
          Length = 877

 Score = 30.7 bits (70), Expect = 8.6
 Identities = 23/141 (16%), Positives = 48/141 (34%), Gaps = 22/141 (15%)

Query: 962  NFKNFNNKSYGDKNKNEFKVTKIMQRPVDENKDEIAKL----PIPLGGDKKDDKKVDDKN 1017
            N +   N S     K     ++ ++  ++ N+D+I  L     + +     ++  V+   
Sbjct: 736  NLRAEMNLSPSAPLKVVLVGSEELEDRLEANEDDIKGLANLEELEILSPDPEEPPVEVTA 795

Query: 1018 VSKDQKITKSTNNNLNKKTNKQQNIQILSPAKKPNKEEDKIEKEENNTNAKENDEKSTDD 1077
            V    ++       ++                +  KE +K+EKE +    K ++E     
Sbjct: 796  VVGGAELFLPLAGLIDLAAEL----------ARLEKELEKLEKEIDRIEKKLSNEGFVAK 845

Query: 1078 --------EKNKLDDKAKDTA 1090
                    EK KL +     A
Sbjct: 846  APEEVVEKEKEKLAEYQVKLA 866


>gnl|CDD|185753 cd09214, GH64-like, glycosyl hydrolase 64 family.  This family is
           represented by the laminaripentaose-producing,
           beta-1,3-glucanase (LPHase) of Streptomyces matensis and
           related bacterial and ascomycete proteins. LPHase is a
           member of glycoside hydrolase family 64 (GH64), it is an
           inverting enzyme involved in the cleavage of long-chain
           polysaccharide beta-1,3-glucans, into specific
           pentasaccharide oligomers. LPHase is a two-domain
           crescent fold structure: one domain is composed of 10
           beta-strands, eight coming from the N-terminus of the
           protein and two from the C-terminal region, and the
           protein has a second inserted domain; this cd includes
           both domains. This protein has an electronegative,
           substrate-binding cleft, and conserved Glu and Asp
           residues involved in the cleavage of the
           beta-1,3-glucan, laminarin, a plant and fungal cell wall
           component. Among bacteria, many beta-1,3-glucanases are
           implicated in fungal cell wall degradation. Also
           included in this family is GluB , the beta-1,3-glucanase
           B from Lysobacter enzymogenes Strain N4-7. Recombinant
           GluB demonstrated higher relative activity toward the
           branched-chain beta-1,3 glucan substrate zymosan A than
           toward linear beta-1,3 glucan substrates. Sometimes
           these two domains are found associated with other
           domains such as in the Catenulispora acidiphila DSM
           44928 carbohydrate binding family 6 protein in which
           they are positioned N-terminal of a carbohydrate binding
           module, family 6 (CBM_6) domain. In the
           Cellulosimicrobium cellulans, glucan
           endo-1,3-beta-glucosidase, they are positioned
           N-terminal of a RICIN, carbohydrate-binding domain, and
           in the Salinispora tropica CNB-440, coagulation factor
           5/8 C-terminal domain (FA58C) protein, they are
           positioned C-terminal of two FA58C domains which are
           proposed to function as cell surface-attached,
           carbohydrate-binding domain. This FA58C-containing
           protein has an internal peptide deletion (of approx. 44
           residues) in the LPHase domain II.
          Length = 319

 Score = 30.4 bits (69), Expect = 9.1
 Identities = 17/55 (30%), Positives = 27/55 (49%), Gaps = 9/55 (16%)

Query: 394 PSQQQSNNSVY--SSTINSYSSFPQQSSVNLNTYG-------HQASSLSNSNTTG 439
           P+ Q S+ S Y  ++  N Y+ F    S+N   YG        Q+S+LS ++ T 
Sbjct: 259 PAAQWSDASNYYQAAPANYYAQFWHAHSINGLAYGFPYDDVNGQSSTLSTTDPTH 313


>gnl|CDD|226973 COG4625, COG4625, Uncharacterized protein with a C-terminal OMP
           (outer membrane protein) domain [Function unknown].
          Length = 577

 Score = 30.5 bits (69), Expect = 9.1
 Identities = 37/171 (21%), Positives = 60/171 (35%), Gaps = 14/171 (8%)

Query: 261 DNGDVDSRTTTASILDSSLYQNTYDSKTRNFVDSGSQIFNAASTLNETPEHSTRDGKLFS 320
            +G+   +  T S+L S  Y  T        V SG+ +  A S       ++T  G + +
Sbjct: 101 YSGNSIVKGGTLSVLGSLRYVVT--------VTSGATLGGAGSLGGLIAGNTT-GGAVVN 151

Query: 321 NEALYAPSSTTNYPYSSSSTPTYAAMTTASGNANNVASNGGGSN-NGGVYSTNSFMAATT 379
            E        +     S  +P       A GN + V  NG  +    G            
Sbjct: 152 GEVRLT--YGSTVAVESLQSPVVNPAAIALGNVDTVTQNGAENVEQVGQAYDILQADGGG 209

Query: 380 HGYSSISNTITNTQPSQQQSNNSVYSSTINSYSSFPQQSSVNLNTYGHQAS 430
            G++ ++N  T+T P+   +  +V SS   S   F   S+        Q S
Sbjct: 210 TGFTKVANAATST-PNLLAAGAAVQSSGSGS-VLFELASTAAAQAAFQQLS 258


>gnl|CDD|215214 PLN02381, PLN02381, valyl-tRNA synthetase.
          Length = 1066

 Score = 30.6 bits (69), Expect = 9.5
 Identities = 13/43 (30%), Positives = 23/43 (53%)

Query: 1223 STNNVDKKQDSTSTDERKKKATQKQQQSEEESKKTSNKKIKFK 1265
            + +  +KK  +    ERKKK  +K ++ E +  K + K+ K K
Sbjct: 4    TESEAEKKILTEEELERKKKKEEKAKEKELKKLKAAQKEAKAK 46


>gnl|CDD|224705 COG1792, MreC, Cell shape-determining protein [Cell envelope
            biogenesis, outer membrane].
          Length = 284

 Score = 30.0 bits (68), Expect = 9.8
 Identities = 17/69 (24%), Positives = 29/69 (42%), Gaps = 6/69 (8%)

Query: 1239 RKKKATQKQQQSEEESKKTSNKKIKFKGPEILNETTEPTASSATNSNVTTVSNTADNSNN 1298
            +K+ A  +Q   E ES +  NK++K    E+L+     +      + V + S   D  + 
Sbjct: 79   KKELAELEQLLEEVESLEEENKRLK----ELLDFKESSSDYDPIAARVISRS--PDPWSQ 132

Query: 1299 PSSILKKSK 1307
               I K S 
Sbjct: 133  TIVIDKGSN 141


  Database: CDD.v3.10
    Posted date:  Mar 20, 2013  7:55 AM
  Number of letters in database: 10,937,602
  Number of sequences in database:  44,354
  
Lambda     K      H
   0.309    0.126    0.363 

Gapped
Lambda     K      H
   0.267   0.0829    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 79,035,503
Number of extensions: 7461121
Number of successful extensions: 7488
Number of sequences better than 10.0: 1
Number of HSP's gapped: 7038
Number of HSP's successfully gapped: 378
Length of query: 1662
Length of database: 10,937,602
Length adjustment: 110
Effective length of query: 1552
Effective length of database: 6,058,662
Effective search space: 9403043424
Effective search space used: 9403043424
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.1 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (21.7 bits)
S2: 66 (29.0 bits)