RPS-BLAST 2.2.26 [Sep-21-2011]

Database: CDD.v3.10 
           44,354 sequences; 10,937,602 total letters

Searching..................................................done

Query= psy4582
         (133 letters)



>gnl|CDD|237177 PRK12704, PRK12704, phosphodiesterase; Provisional.
          Length = 520

 Score = 37.5 bits (88), Expect = 9e-04
 Identities = 25/87 (28%), Positives = 44/87 (50%)

Query: 28  EHIKMSLKSAVEDKIRRKMNEQKVQAQEEIEILKQTENELNQGKFKLARMFERIDKEKAE 87
           E IK       +++I +  NE + + +E    L++ E  L Q +  L R  E ++K + E
Sbjct: 52  EAIKKEALLEAKEEIHKLRNEFEKELRERRNELQKLEKRLLQKEENLDRKLELLEKREEE 111

Query: 88  LERSISFLKEKETELDEILAVLTEKTE 114
           LE+    L++K+ EL++    L E  E
Sbjct: 112 LEKKEKELEQKQQELEKKEEELEELIE 138



 Score = 35.9 bits (84), Expect = 0.003
 Identities = 22/81 (27%), Positives = 45/81 (55%), Gaps = 4/81 (4%)

Query: 34  LKSAVEDKIRRKMNEQKVQAQEEIEILKQTENELNQGKFKLARMFERIDKEKAELERSIS 93
           L++  E ++R +    ++Q  E+   L Q E  L++    L +  E ++K++ ELE+   
Sbjct: 69  LRNEFEKELRER--RNELQKLEKR--LLQKEENLDRKLELLEKREEELEKKEKELEQKQQ 124

Query: 94  FLKEKETELDEILAVLTEKTE 114
            L++KE EL+E++    ++ E
Sbjct: 125 ELEKKEEELEELIEEQLQELE 145


>gnl|CDD|235360 PRK05183, hscA, chaperone protein HscA; Provisional.
          Length = 616

 Score = 34.8 bits (81), Expect = 0.007
 Identities = 19/46 (41%), Positives = 26/46 (56%), Gaps = 6/46 (13%)

Query: 19  PSQGGTITDEHI----KMSLKSAVEDKIRRKMNEQKVQAQEEIEIL 60
           PS G  +TD+ I    K S+  A ED   R + EQKV+A+  +E L
Sbjct: 500 PSYG--LTDDEIARMLKDSMSHAEEDMQARALAEQKVEAERVLEAL 543


>gnl|CDD|236912 PRK11448, hsdR, type I restriction enzyme EcoKI subunit R;
           Provisional.
          Length = 1123

 Score = 34.5 bits (80), Expect = 0.010
 Identities = 19/63 (30%), Positives = 29/63 (46%)

Query: 52  QAQEEIEILKQTENELNQGKFKLARMFERIDKEKAELERSISFLKEKETELDEILAVLTE 111
             Q+E+  LKQ      + K +   + E   +E   LE   + L+EK+ EL+  L  L E
Sbjct: 146 ALQQEVLTLKQQLELQAREKAQSQALAEAQQQELVALEGLAAELEEKQQELEAQLEQLQE 205

Query: 112 KTE 114
           K  
Sbjct: 206 KAA 208


>gnl|CDD|224117 COG1196, Smc, Chromosome segregation ATPases [Cell division and
           chromosome partitioning].
          Length = 1163

 Score = 32.4 bits (74), Expect = 0.050
 Identities = 19/69 (27%), Positives = 39/69 (56%)

Query: 44  RKMNEQKVQAQEEIEILKQTENELNQGKFKLARMFERIDKEKAELERSISFLKEKETELD 103
            ++ E+  + +E+++ L++   EL +   +L    E ++ EK ELE  +  L+E++ EL+
Sbjct: 831 EELEEEIEELEEKLDELEEELEELEKELEELKEELEELEAEKEELEDELKELEEEKEELE 890

Query: 104 EILAVLTEK 112
           E L  L  +
Sbjct: 891 EELRELESE 899



 Score = 29.3 bits (66), Expect = 0.54
 Identities = 25/99 (25%), Positives = 45/99 (45%), Gaps = 12/99 (12%)

Query: 28  EHIKMSLKSAVEDKIR-----RKMNEQKVQAQEEIEILKQTENELN-------QGKFKLA 75
           + ++  L+   E+  R      ++ E+  +A++EIE LK    EL        +   +L 
Sbjct: 235 KELRKELEELEEELSRLEEELEELQEELEEAEKEIEELKSELEELREELEELQEELLELK 294

Query: 76  RMFERIDKEKAELERSISFLKEKETELDEILAVLTEKTE 114
              E ++ E + L   +  L+ +  EL+E L  L EK E
Sbjct: 295 EEIEELEGEISLLRERLEELENELEELEERLEELKEKIE 333



 Score = 27.8 bits (62), Expect = 2.2
 Identities = 15/70 (21%), Positives = 35/70 (50%)

Query: 45  KMNEQKVQAQEEIEILKQTENELNQGKFKLARMFERIDKEKAELERSISFLKEKETELDE 104
           ++  +K + ++E++ L++ + EL +   +L      + +E  +L   +  L+ K   L+ 
Sbjct: 867 ELEAEKEELEDELKELEEEKEELEEELRELESELAELKEEIEKLRERLEELEAKLERLEV 926

Query: 105 ILAVLTEKTE 114
            L  L E+ E
Sbjct: 927 ELPELEEELE 936



 Score = 27.4 bits (61), Expect = 2.8
 Identities = 18/72 (25%), Positives = 33/72 (45%), Gaps = 3/72 (4%)

Query: 41  KIRRKMNEQKVQAQEEIEILKQTENELNQGKFKLARMFERIDKEKAELERSISFLKEKET 100
           +   ++  +    +E +E L+    EL +   +L  + E+I+  K ELE   + L+E E 
Sbjct: 295 EEIEELEGEISLLRERLEELENELEELEE---RLEELKEKIEALKEELEERETLLEELEQ 351

Query: 101 ELDEILAVLTEK 112
            L E+     E 
Sbjct: 352 LLAELEEAKEEL 363



 Score = 26.6 bits (59), Expect = 4.6
 Identities = 22/72 (30%), Positives = 31/72 (43%)

Query: 43  RRKMNEQKVQAQEEIEILKQTENELNQGKFKLARMFERIDKEKAELERSISFLKEKETEL 102
             ++ E+  + QEE+  LK+   EL      L    E ++ E  ELE  +  LKEK   L
Sbjct: 276 LEELREELEELQEELLELKEEIEELEGEISLLRERLEELENELEELEERLEELKEKIEAL 335

Query: 103 DEILAVLTEKTE 114
            E L       E
Sbjct: 336 KEELEERETLLE 347



 Score = 26.6 bits (59), Expect = 5.0
 Identities = 23/83 (27%), Positives = 40/83 (48%), Gaps = 4/83 (4%)

Query: 33  SLKSAVEDKIRRKMNEQKVQAQEEIEILKQTENELNQGKFKLARMFERIDKEKAELERSI 92
            L  A+ +++     E   + + E+E LK+    L +   +L+   E + +E  ELE   
Sbjct: 376 ELFEALREELAELEAELA-EIRNELEELKREIESLEERLERLSERLEDLKEELKELEAE- 433

Query: 93  SFLKEKETELDEILAVLTEKTEV 115
             L+E +TEL+E+   L E  E 
Sbjct: 434 --LEELQTELEELNEELEELEEQ 454



 Score = 25.8 bits (57), Expect = 8.9
 Identities = 27/89 (30%), Positives = 48/89 (53%), Gaps = 2/89 (2%)

Query: 28  EHIKMSLKSAVED--KIRRKMNEQKVQAQEEIEILKQTENELNQGKFKLARMFERIDKEK 85
           E     LK  +E+  + R+ + E+  + +EE+E  ++  + L +    L +  ER+++E 
Sbjct: 771 EEALAKLKEEIEELEEKRQALQEELEELEEELEEAERRLDALERELESLEQRRERLEQEI 830

Query: 86  AELERSISFLKEKETELDEILAVLTEKTE 114
            ELE  I  L+EK  EL+E L  L ++ E
Sbjct: 831 EELEEEIEELEEKLDELEEELEELEKELE 859


>gnl|CDD|237178 PRK12705, PRK12705, hypothetical protein; Provisional.
          Length = 508

 Score = 32.4 bits (74), Expect = 0.054
 Identities = 21/77 (27%), Positives = 40/77 (51%)

Query: 28  EHIKMSLKSAVEDKIRRKMNEQKVQAQEEIEILKQTENELNQGKFKLARMFERIDKEKAE 87
           E    +     ++ + R+ N+Q+ +A+ E E L++ E  L Q + +L    E++D  + +
Sbjct: 47  EEKLEAALLEAKELLLRERNQQRQEARREREELQREEERLVQKEEQLDARAEKLDNLENQ 106

Query: 88  LERSISFLKEKETELDE 104
           LE     L  +E EL+E
Sbjct: 107 LEEREKALSARELELEE 123



 Score = 27.0 bits (60), Expect = 3.3
 Identities = 23/90 (25%), Positives = 38/90 (42%), Gaps = 2/90 (2%)

Query: 45  KMNEQKVQAQEEIEILKQTENELNQGKFKLARMFERIDKEKAELERSISFLKEKETELDE 104
           K++  + Q +E  + L   E EL + + +L     R+     E  R +  LK  + EL+E
Sbjct: 99  KLDNLENQLEEREKALSARELELEELEKQLDNELYRVAGLTPEQARKL-LLKLLDAELEE 157

Query: 105 ILAVLTEKTEVDVDEAVTTTA-PIYKQIVN 133
             A   +K E + D      A  I  Q + 
Sbjct: 158 EKAQRVKKIEEEADLEAERKAQNILAQAMQ 187


>gnl|CDD|233673 TIGR01991, HscA, Fe-S protein assembly chaperone HscA.  The Heat
           Shock Cognate proteins HscA and HscB act together as
           chaperones. HscA resembles DnaK but belongs in a
           separate clade. The apparent function is to aid assembly
           of iron-sulfur cluster proteins. Homologs from Buchnera
           and Wolbachia are clearly in the same clade but are
           highly derived and score lower than some examples of
           DnaK [Protein fate, Protein folding and stabilization].
          Length = 599

 Score = 31.9 bits (73), Expect = 0.075
 Identities = 14/33 (42%), Positives = 20/33 (60%)

Query: 30  IKMSLKSAVEDKIRRKMNEQKVQAQEEIEILKQ 62
           +K S K A ED   R + EQKV+A+  +E L+ 
Sbjct: 497 LKDSFKHAEEDMYARALAEQKVEAERILEALQA 529


>gnl|CDD|215507 PLN02939, PLN02939, transferase, transferring glycosyl groups.
          Length = 977

 Score = 31.8 bits (72), Expect = 0.086
 Identities = 22/120 (18%), Positives = 51/120 (42%), Gaps = 14/120 (11%)

Query: 15  GGSSPSQGGTITDEH------IKMSLKSAVEDKIRRKMNEQK----VQAQEEIEILKQTE 64
                 Q  T +D+        +    +A++++ +    + +     Q ++ + +++  E
Sbjct: 82  TVMELPQKSTSSDDDHNRASMQRDEAIAAIDNEQQTNSKDGEQLSDFQLEDLVGMIQNAE 141

Query: 65  NE---LNQGKFKLARMFERIDKEKAELERSISFLKEKETELDEILAVLT-EKTEVDVDEA 120
                LNQ + +     E+I  EK  L+  I+ L+ + +E D  + +   EK  V++ E 
Sbjct: 142 KNILLLNQARLQALEDLEKILTEKEALQGKINILEMRLSETDARIKLAAQEKIHVEILEE 201


>gnl|CDD|188306 TIGR03319, RNase_Y, ribonuclease Y.  Members of this family are
           RNase Y, an endoribonuclease. The member from Bacillus
           subtilis, YmdA, has been shown to be involved in
           turnover of yitJ riboswitch [Transcription, Degradation
           of RNA].
          Length = 514

 Score = 31.4 bits (72), Expect = 0.12
 Identities = 21/75 (28%), Positives = 36/75 (48%)

Query: 38  VEDKIRRKMNEQKVQAQEEIEILKQTENELNQGKFKLARMFERIDKEKAELERSISFLKE 97
            ++++ +   E + + +E    L++ E  L Q +  L R  E +DK++  LE+    L  
Sbjct: 56  AKEEVHKLRAELERELKERRNELQRLERRLLQREETLDRKMESLDKKEENLEKKEKELSN 115

Query: 98  KETELDEILAVLTEK 112
           KE  LDE    L E 
Sbjct: 116 KEKNLDEKEEELEEL 130



 Score = 28.7 bits (65), Expect = 0.92
 Identities = 25/91 (27%), Positives = 43/91 (47%), Gaps = 4/91 (4%)

Query: 28  EHIKMSLKSAVEDKIRRKMNEQK---VQAQEEIEI-LKQTENELNQGKFKLARMFERIDK 83
           + I    K   E   +  + E K    + + E+E  LK+  NEL + + +L +  E +D+
Sbjct: 35  KRIIEEAKKEAETLKKEALLEAKEEVHKLRAELERELKERRNELQRLERRLLQREETLDR 94

Query: 84  EKAELERSISFLKEKETELDEILAVLTEKTE 114
           +   L++    L++KE EL      L EK E
Sbjct: 95  KMESLDKKEENLEKKEKELSNKEKNLDEKEE 125



 Score = 26.8 bits (60), Expect = 4.1
 Identities = 21/75 (28%), Positives = 38/75 (50%), Gaps = 11/75 (14%)

Query: 39  EDKIRR---KMNEQKVQAQEEI-----EILKQTENELNQGKFKLARMFERIDKEKAELER 90
           E K RR   +  E+++  +EE      E L + E  L + + +L+   + +D+++ ELE 
Sbjct: 70  ELKERRNELQRLERRLLQREETLDRKMESLDKKEENLEKKEKELSNKEKNLDEKEEELEE 129

Query: 91  SISFLKEKETELDEI 105
            I    E+  EL+ I
Sbjct: 130 LI---AEQREELERI 141


>gnl|CDD|192987 pfam12325, TMF_TATA_bd, TATA element modulatory factor 1 TATA
           binding.  This is the C-terminal conserved coiled coil
           region of a family of TATA element modulatory factor 1
           proteins conserved in eukaryotes. The proteins bind to
           the TATA element of some RNA polymerase II promoters and
           repress their activity. by competing with the binding of
           TATA binding protein. TMF1_TATA_bd is the most conserved
           part of the TMFs. TMFs are evolutionarily conserved
           golgins that bind Rab6, a ubiquitous ras-like
           GTP-binding Golgi protein, and contribute to Golgi
           organisation in animal and plant cells. The Rab6-binding
           domain appears to be the same region as this C-terminal
           family.
          Length = 121

 Score = 30.2 bits (69), Expect = 0.15
 Identities = 20/73 (27%), Positives = 34/73 (46%), Gaps = 1/73 (1%)

Query: 42  IRRKMNEQKVQAQEEIEILKQTENELNQGKFKLARMFERIDKEKAELERSISFLKEKETE 101
           +RR   E     ++E+  L+   +E  Q   KL    E + + K E+E     L++ E  
Sbjct: 26  LRRLEGE-LASLKDELARLEAERDEARQEIVKLTEENEELRELKKEIEELEKELEDLEQR 84

Query: 102 LDEILAVLTEKTE 114
            +  L +L EK+E
Sbjct: 85  YETTLELLGEKSE 97


>gnl|CDD|219342 pfam07227, DUF1423, Protein of unknown function (DUF1423).  This
           family represents a conserved region approximately 500
           residues long within a number of Arabidopsis thaliana
           proteins of unknown function.
          Length = 446

 Score = 30.9 bits (70), Expect = 0.17
 Identities = 26/96 (27%), Positives = 46/96 (47%), Gaps = 7/96 (7%)

Query: 27  DEHIKMSLKSAVEDKIRRKMNEQKVQAQEEIEILKQTENELNQGKFKLARM----FERI- 81
           D  ++   +   E K+ R+  + ++   E I  LKQ E ++ Q K   AR      +RI 
Sbjct: 328 DRELEEKAREVSELKMERQRKKPQIDELESIVRLKQAEADMFQLKADEARREAERLQRIA 387

Query: 82  --DKEKAELERSISFLKEKETELDEILAVLTEKTEV 115
               EK+E E +  +LK + +E +E    L E+ ++
Sbjct: 388 LAKTEKSEEEYASKYLKLRLSEAEEERQYLFEELKL 423


>gnl|CDD|222968 PHA03065, PHA03065, Hypothetical protein; Provisional.
          Length = 438

 Score = 30.7 bits (70), Expect = 0.18
 Identities = 24/92 (26%), Positives = 38/92 (41%), Gaps = 16/92 (17%)

Query: 33  SLKSAVEDKIRRKMNEQKVQAQEEIEILKQTENELNQGKFKLARMFERIDKEKAELERSI 92
            +K ++ +K R+       + +EEIE L+     L+        M+E I   K +LE  I
Sbjct: 79  PIKQSLREKRRKASKNTIKRKREEIEKLEDDIKNLDVDDE----MYEEI---KTDLELKI 131

Query: 93  S------FLKEK---ETELDEILAVLTEKTEV 115
                  FL      +  L+  LA L E  E+
Sbjct: 132 DKLSFQLFLANSNNLKRLLESALARLGENVEI 163


>gnl|CDD|218167 pfam04599, Pox_G5, Poxvirus G5 protein.  This protein has been
           predicted to be related to the FEN-1 endonuclease.
          Length = 426

 Score = 30.8 bits (70), Expect = 0.18
 Identities = 24/94 (25%), Positives = 40/94 (42%), Gaps = 16/94 (17%)

Query: 31  KMSLKSAVEDKIRRKMNEQKVQAQEEIEILKQTENELNQGKFKLARMFERIDKEKAELER 90
            + +K ++ +K +  +     +  EEIE L+     L+        M+E I   K +LE 
Sbjct: 76  SIDIKESLREKRKNSLKNTSKRKTEEIENLEDAIKNLDVEDE----MYEEI---KTDLEL 128

Query: 91  SIS------FLKEK---ETELDEILAVLTEKTEV 115
            I       FL  K   +  L+EIL+ L E   +
Sbjct: 129 RIDKLSFHNFLANKSNLKITLEEILSSLKEDVTI 162


>gnl|CDD|233758 TIGR02169, SMC_prok_A, chromosome segregation protein SMC,
           primarily archaeal type.  SMC (structural maintenance of
           chromosomes) proteins bind DNA and act in organizing and
           segregating chromosomes for partition. SMC proteins are
           found in bacteria, archaea, and eukaryotes. It is found
           in a single copy and is homodimeric in prokaryotes, but
           six paralogs (excluded from this family) are found in
           eukarotes, where SMC proteins are heterodimeric. This
           family represents the SMC protein of archaea and a few
           bacteria (Aquifex, Synechocystis, etc); the SMC of other
           bacteria is described by TIGR02168. The N- and
           C-terminal domains of this protein are well conserved,
           but the central hinge region is skewed in composition
           and highly divergent [Cellular processes, Cell division,
           DNA metabolism, Chromosome-associated proteins].
          Length = 1164

 Score = 30.8 bits (70), Expect = 0.20
 Identities = 20/87 (22%), Positives = 41/87 (47%)

Query: 39  EDKIRRKMNEQKVQAQEEIEILKQTENELNQGKFKLARMFERIDKEKAELERSISFLKEK 98
            DK   +  ++    +E++E LK+  NEL +   +L    +R+ +E A+L  +I+ ++ K
Sbjct: 376 VDKEFAETRDELKDYREKLEKLKREINELKRELDRLQEELQRLSEELADLNAAIAGIEAK 435

Query: 99  ETELDEILAVLTEKTEVDVDEAVTTTA 125
             EL+E       + +    +     A
Sbjct: 436 INELEEEKEDKALEIKKQEWKLEQLAA 462



 Score = 28.5 bits (64), Expect = 1.1
 Identities = 18/70 (25%), Positives = 39/70 (55%)

Query: 40  DKIRRKMNEQKVQAQEEIEILKQTENELNQGKFKLARMFERIDKEKAELERSISFLKEKE 99
           +K++R++NE K +     E L++   EL      +A +  +I++ + E E     +K++E
Sbjct: 395 EKLKREINELKRELDRLQEELQRLSEELADLNAAIAGIEAKINELEEEKEDKALEIKKQE 454

Query: 100 TELDEILAVL 109
            +L+++ A L
Sbjct: 455 WKLEQLAADL 464



 Score = 27.7 bits (62), Expect = 2.0
 Identities = 20/87 (22%), Positives = 40/87 (45%), Gaps = 8/87 (9%)

Query: 41  KIRRKMNEQKVQA--------QEEIEILKQTENELNQGKFKLARMFERIDKEKAELERSI 92
           K + K  E++++         +EE+E L+    +L      L +  + ++ +  ELER I
Sbjct: 846 KEQIKSIEKEIENLNGKKEELEEELEELEAALRDLESRLGDLKKERDELEAQLRELERKI 905

Query: 93  SFLKEKETELDEILAVLTEKTEVDVDE 119
             L+ +  +  + L+ L  K E   +E
Sbjct: 906 EELEAQIEKKRKRLSELKAKLEALEEE 932



 Score = 25.8 bits (57), Expect = 8.2
 Identities = 14/72 (19%), Positives = 37/72 (51%)

Query: 41  KIRRKMNEQKVQAQEEIEILKQTENELNQGKFKLARMFERIDKEKAELERSISFLKEKET 100
            + R + E++ + ++  E L + E E+++   ++  +   I++E+   ++      E + 
Sbjct: 305 SLERSIAEKERELEDAEERLAKLEAEIDKLLAEIEELEREIEEERKRRDKLTEEYAELKE 364

Query: 101 ELDEILAVLTEK 112
           EL+++ A L E 
Sbjct: 365 ELEDLRAELEEV 376



 Score = 25.8 bits (57), Expect = 8.2
 Identities = 14/76 (18%), Positives = 38/76 (50%)

Query: 40  DKIRRKMNEQKVQAQEEIEILKQTENELNQGKFKLARMFERIDKEKAELERSISFLKEKE 99
            +I  +++E   +  +    + + E E+ Q + +  ++ ER+++ + +L      ++  +
Sbjct: 698 RRIENRLDELSQELSDASRKIGEIEKEIEQLEQEEEKLKERLEELEEDLSSLEQEIENVK 757

Query: 100 TELDEILAVLTEKTEV 115
           +EL E+ A + E  E 
Sbjct: 758 SELKELEARIEELEED 773



 Score = 25.8 bits (57), Expect = 8.6
 Identities = 21/84 (25%), Positives = 45/84 (53%)

Query: 31  KMSLKSAVEDKIRRKMNEQKVQAQEEIEILKQTENELNQGKFKLARMFERIDKEKAELER 90
           +++L+    +K  +++ EQ++  +E+I+ +++    LN  K +L    E ++    +LE 
Sbjct: 823 RLTLEKEYLEKEIQELQEQRIDLKEQIKSIEKEIENLNGKKEELEEELEELEAALRDLES 882

Query: 91  SISFLKEKETELDEILAVLTEKTE 114
            +  LK++  EL+  L  L  K E
Sbjct: 883 RLGDLKKERDELEAQLRELERKIE 906



 Score = 25.8 bits (57), Expect = 9.9
 Identities = 19/78 (24%), Positives = 38/78 (48%), Gaps = 6/78 (7%)

Query: 33  SLKSAVEDKIRR--KMNEQKVQAQEEIEILK----QTENELNQGKFKLARMFERIDKEKA 86
           SL+ ++ +K R      E+  + + EI+ L     + E E+ + + +  ++ E   + K 
Sbjct: 305 SLERSIAEKERELEDAEERLAKLEAEIDKLLAEIEELEREIEEERKRRDKLTEEYAELKE 364

Query: 87  ELERSISFLKEKETELDE 104
           ELE   + L+E + E  E
Sbjct: 365 ELEDLRAELEEVDKEFAE 382


>gnl|CDD|224188 COG1269, NtpI, Archaeal/vacuolar-type H+-ATPase subunit I [Energy
           production and conversion].
          Length = 660

 Score = 30.8 bits (70), Expect = 0.21
 Identities = 21/88 (23%), Positives = 38/88 (43%), Gaps = 2/88 (2%)

Query: 33  SLKSAVEDKIRRKMNEQKVQAQ--EEIEILKQTENELNQGKFKLARMFERIDKEKAELER 90
           SL S V D +R     +       EE+E L+     L +      +    +++   +LE 
Sbjct: 61  SLLSEVLDYLRSVKGLEGRLFILPEEVEKLEAELKSLEEVIKPAEKFSSEVEELTRKLEE 120

Query: 91  SISFLKEKETELDEILAVLTEKTEVDVD 118
            +S L E+  +L+++L  L     +D D
Sbjct: 121 RLSELDEELEDLEDLLEELEPLAYLDFD 148


>gnl|CDD|130141 TIGR01069, mutS2, MutS2 family protein.  Function of MutS2 is
           unknown. It should not be considered a DNA mismatch
           repair protein. It is likely a DNA mismatch binding
           protein of unknown cellular function [DNA metabolism,
           Other].
          Length = 771

 Score = 30.6 bits (69), Expect = 0.26
 Identities = 19/74 (25%), Positives = 33/74 (44%)

Query: 31  KMSLKSAVEDKIRRKMNEQKVQAQEEIEILKQTENELNQGKFKLARMFERIDKEKAELER 90
           +  +   + ++ +    E K +    IE L   E EL Q    L ++ +  +K K ELE+
Sbjct: 491 RYGIPHFIIEQAKTFYGEFKEEINVLIEKLSALEKELEQKNEHLEKLLKEQEKLKKELEQ 550

Query: 91  SISFLKEKETELDE 104
            +  LKE+E     
Sbjct: 551 EMEELKERERNKKL 564


>gnl|CDD|233757 TIGR02168, SMC_prok_B, chromosome segregation protein SMC, common
           bacterial type.  SMC (structural maintenance of
           chromosomes) proteins bind DNA and act in organizing and
           segregating chromosomes for partition. SMC proteins are
           found in bacteria, archaea, and eukaryotes. This family
           represents the SMC protein of most bacteria. The smc
           gene is often associated with scpB (TIGR00281) and scpA
           genes, where scp stands for segregation and condensation
           protein. SMC was shown (in Caulobacter crescentus) to be
           induced early in S phase but present and bound to DNA
           throughout the cell cycle [Cellular processes, Cell
           division, DNA metabolism, Chromosome-associated
           proteins].
          Length = 1179

 Score = 30.4 bits (69), Expect = 0.29
 Identities = 22/73 (30%), Positives = 35/73 (47%)

Query: 43  RRKMNEQKVQAQEEIEILKQTENELNQGKFKLARMFERIDKEKAELERSISFLKEKETEL 102
             ++ E+  + Q+E+  L    + L Q K  L      ++++  ELE  +  L+ K  EL
Sbjct: 276 VSELEEEIEELQKELYALANEISRLEQQKQILRERLANLERQLEELEAQLEELESKLDEL 335

Query: 103 DEILAVLTEKTEV 115
            E LA L EK E 
Sbjct: 336 AEELAELEEKLEE 348



 Score = 26.6 bits (59), Expect = 5.1
 Identities = 17/85 (20%), Positives = 36/85 (42%), Gaps = 6/85 (7%)

Query: 34  LKSAVEDKIRRKMN--EQKVQAQEEIEILKQTENELNQGKFKLARMFERIDKEKAELERS 91
           L+  +E          E+  +A++ ++  ++   +L      L R+ E ++         
Sbjct: 452 LQEELERLEEALEELREELEEAEQALDAAERELAQLQARLDSLERLQENLEG----FSEG 507

Query: 92  ISFLKEKETELDEILAVLTEKTEVD 116
           +  L + ++ L  IL VL+E   VD
Sbjct: 508 VKALLKNQSGLSGILGVLSELISVD 532



 Score = 26.2 bits (58), Expect = 6.4
 Identities = 21/66 (31%), Positives = 32/66 (48%)

Query: 47  NEQKVQAQEEIEILKQTENELNQGKFKLARMFERIDKEKAELERSISFLKEKETELDEIL 106
            EQ  +  E+IE L     EL +   +L    E +  E+A LE +++ L+ +  EL E L
Sbjct: 844 EEQIEELSEDIESLAAEIEELEELIEELESELEALLNERASLEEALALLRSELEELSEEL 903

Query: 107 AVLTEK 112
             L  K
Sbjct: 904 RELESK 909


>gnl|CDD|215457 PLN02852, PLN02852, ferredoxin-NADP+ reductase.
          Length = 491

 Score = 30.0 bits (68), Expect = 0.36
 Identities = 12/35 (34%), Positives = 20/35 (57%)

Query: 78  FERIDKEKAELERSISFLKEKETELDEILAVLTEK 112
           +E+ID  + E  R+    +EK T ++E+L    EK
Sbjct: 457 WEKIDSAEKEAGRARGKPREKITSIEEMLKAANEK 491


>gnl|CDD|236583 PRK09601, PRK09601, GTP-binding protein YchF; Reviewed.
          Length = 364

 Score = 29.6 bits (68), Expect = 0.37
 Identities = 16/59 (27%), Positives = 32/59 (54%), Gaps = 3/59 (5%)

Query: 40  DKIRRKMNEQKVQAQEEIEILKQTENELNQGKFKLARMFERIDKEKAELERSISFLKEK 98
           +++ +K      +A+ E+E+L++    L +G  K AR  E +  E+ +L +S+  L  K
Sbjct: 145 ERLEKKAKGGDKEAKAELELLEKLLEHLEEG--KPARTLE-LTDEEEKLLKSLQLLTAK 200


>gnl|CDD|217817 pfam03961, DUF342, Protein of unknown function (DUF342).  This
           family of bacterial proteins has no known function. The
           proteins are in the region of 500-600 amino acid
           residues in length.
          Length = 450

 Score = 29.9 bits (68), Expect = 0.38
 Identities = 19/107 (17%), Positives = 48/107 (44%), Gaps = 7/107 (6%)

Query: 20  SQGGTITDEHIKMSLKSAVE-----DKIRRKMNEQKVQAQEEIEILKQTENELNQGKFKL 74
           S  GT T+  + +      E     ++  +++ E+  + ++ ++ L   +    Q   + 
Sbjct: 315 SPAGTKTEIEVGVDFPELKEELKELEEELKELEEELEKIKKLLKKLP--KKARGQLPPEK 372

Query: 75  ARMFERIDKEKAELERSISFLKEKETELDEILAVLTEKTEVDVDEAV 121
               E++ + K +L   +  L+E+  EL E L  L  + ++ V++ +
Sbjct: 373 REQLEKLLETKEKLSEELEELEEELKELKEELESLYSEGKISVNKTI 419


>gnl|CDD|240271 PTZ00108, PTZ00108, DNA topoisomerase 2-like protein; Provisional.
          Length = 1388

 Score = 30.0 bits (68), Expect = 0.39
 Identities = 21/80 (26%), Positives = 41/80 (51%), Gaps = 7/80 (8%)

Query: 31   KMSLKSAVEDKIRRKMNEQKVQAQEEIEILKQTE------NELNQGKFKLARMFERIDKE 84
             M + S  ++K+  K+N +  + ++E+E LK T        +L++ +  L    E  +KE
Sbjct: 1093 SMPIWSLTKEKVE-KLNAELEKKEKELEKLKNTTPKDMWLEDLDKFEEALEEQEEVEEKE 1151

Query: 85   KAELERSISFLKEKETELDE 104
             A+ +R  S  K K ++L +
Sbjct: 1152 IAKEQRLKSKTKGKASKLRK 1171


>gnl|CDD|223092 COG0013, AlaS, Alanyl-tRNA synthetase [Translation, ribosomal
           structure and biogenesis].
          Length = 879

 Score = 29.5 bits (67), Expect = 0.49
 Identities = 17/65 (26%), Positives = 34/65 (52%), Gaps = 2/65 (3%)

Query: 53  AQEEIEILKQTENELNQGKFKLARMFERIDKEKAELERSISFLKEKETELDEILAVLTEK 112
            QE+ ++LK+  + L      L ++ ER+ +E  ELE+ +  LK+K     E+  +L+  
Sbjct: 710 LQEQEDLLKEIASLLKVPPELLPKV-ERLLEELKELEKELERLKKKL-AAAELADLLSNA 767

Query: 113 TEVDV 117
              ++
Sbjct: 768 KAEEI 772


>gnl|CDD|232931 TIGR00344, alaS, alanine--tRNA ligase.  The model describes
           alanine--tRNA ligase. This enzyme catalyzes the reaction
           (tRNAala + L-alanine + ATP = L-alanyl-tRNAala +
           pyrophosphate + AMP) [Protein synthesis, tRNA
           aminoacylation].
          Length = 851

 Score = 29.7 bits (67), Expect = 0.49
 Identities = 13/64 (20%), Positives = 29/64 (45%), Gaps = 1/64 (1%)

Query: 53  AQEEIEILKQTENELNQGKFKLARMFERIDKEKAELERSISFLKEKETELDEILAVLTEK 112
             E+ + LK+  + L     +L +  ER  +E   L++ +  LK K  +  +I  +  + 
Sbjct: 703 LNEQEDKLKELADILKVTPNELPKKIERFFEEWKALQKELESLKLKIADQ-KIDELKDKA 761

Query: 113 TEVD 116
             ++
Sbjct: 762 ETIN 765


>gnl|CDD|223250 COG0172, SerS, Seryl-tRNA synthetase [Translation, ribosomal
           structure and biogenesis].
          Length = 429

 Score = 29.5 bits (67), Expect = 0.53
 Identities = 23/96 (23%), Positives = 44/96 (45%), Gaps = 13/96 (13%)

Query: 36  SAVEDKIRRKMNEQKVQAQEEIEILKQTENELNQ--------GKFKLARMFERIDKEKAE 87
           +   DK+  +++E++ +   E+E L+   NEL++        G+     +   + + K +
Sbjct: 25  ALDVDKLL-ELDEERRKLLRELEELQAERNELSKEIGRALKRGEDDAEELIAEVKELKEK 83

Query: 88  LERSISFLKEKETELDEILAVL----TEKTEVDVDE 119
           L+   + L E E ELD +L  +     E   V  DE
Sbjct: 84  LKELEAALDELEAELDTLLLTIPNIPHESVPVGKDE 119


>gnl|CDD|219791 pfam08317, Spc7, Spc7 kinetochore protein.  This domain is found in
           cell division proteins which are required for
           kinetochore-spindle association.
          Length = 321

 Score = 29.2 bits (66), Expect = 0.59
 Identities = 17/76 (22%), Positives = 35/76 (46%), Gaps = 1/76 (1%)

Query: 40  DKIRRKMNEQKVQAQEEIEILKQTENELNQ-GKFKLARMFERIDKEKAELERSISFLKEK 98
           + I+ K+ ++    +EEI  L+Q  +ELN     +L +  + +     ++      L+E 
Sbjct: 174 NSIKPKLRKKLQALKEEIASLRQLADELNLCDPLELEKARQELRSLSVKISEKRKQLEEL 233

Query: 99  ETELDEILAVLTEKTE 114
           + EL E+   +   T 
Sbjct: 234 QQELQELTIAIEALTN 249


>gnl|CDD|234750 PRK00409, PRK00409, recombination and DNA strand exchange inhibitor
           protein; Reviewed.
          Length = 782

 Score = 29.0 bits (66), Expect = 0.84
 Identities = 19/65 (29%), Positives = 33/65 (50%), Gaps = 1/65 (1%)

Query: 48  EQKVQAQEEIEILKQTENELNQGKFKLARMFERIDKEKAELERSISF-LKEKETELDEIL 106
           E + +A+E   +LK+ E    + + K  ++ E  DK   E E+     +KE + E DEI+
Sbjct: 531 ELEQKAEEAEALLKEAEKLKEELEEKKEKLQEEEDKLLEEAEKEAQQAIKEAKKEADEII 590

Query: 107 AVLTE 111
             L +
Sbjct: 591 KELRQ 595



 Score = 27.1 bits (61), Expect = 3.4
 Identities = 12/81 (14%), Positives = 34/81 (41%), Gaps = 8/81 (9%)

Query: 35  KSAVEDKIRRKMNEQKVQAQEEIEILKQTENELNQGKFKLARMFERIDKEKAELERSISF 94
           K  ++++  + + E + +AQ+ I+  K+  +E+      +  + +      A ++     
Sbjct: 557 KEKLQEEEDKLLEEAEKEAQQAIKEAKKEADEI------IKELRQLQKGGYASVKAHE-- 608

Query: 95  LKEKETELDEILAVLTEKTEV 115
           L E    L++      +K + 
Sbjct: 609 LIEARKRLNKANEKKEKKKKK 629


>gnl|CDD|235600 PRK05771, PRK05771, V-type ATP synthase subunit I; Validated.
          Length = 646

 Score = 28.4 bits (64), Expect = 1.2
 Identities = 21/79 (26%), Positives = 40/79 (50%), Gaps = 3/79 (3%)

Query: 40  DKIRRKMNEQKVQAQEEIEILKQTENELNQGKFKLARMFERIDKEKAELERSISFLKEKE 99
           DK+R  + +     +E+ ++  ++  EL +   +     E+I+KE  ELE  IS L+ + 
Sbjct: 60  DKLRSYLPKLNPLREEKKKVSVKSLEELIKDVEE---ELEKIEKEIKELEEEISELENEI 116

Query: 100 TELDEILAVLTEKTEVDVD 118
            EL++ +  L      D+D
Sbjct: 117 KELEQEIERLEPWGNFDLD 135


>gnl|CDD|220555 pfam10079, DUF2317, Uncharacterized protein conserved in bacteria
           (DUF2317).  Members of this family of hypothetical
           bacterial proteins have no known function.
          Length = 541

 Score = 28.3 bits (64), Expect = 1.2
 Identities = 15/81 (18%), Positives = 35/81 (43%), Gaps = 6/81 (7%)

Query: 38  VEDKIRRKMNEQKVQAQEEIEILKQTENELNQ---GKFKLARMFERIDKEKAELERSISF 94
           +E K+ +K+++  +  ++   +       L +    +   + +    ++ KAELE     
Sbjct: 384 IEPKVEKKLDKLGLSIED---VFLHGLELLKEKVLEEKANSEIDIDFEEAKAELEAQFKR 440

Query: 95  LKEKETELDEILAVLTEKTEV 115
           L+E+  ++D  L    E  E 
Sbjct: 441 LREEAAKIDPSLEGALEANEA 461


>gnl|CDD|206687 cd01900, YchF, YchF GTPase.  YchF is a member of the Obg family,
           which includes four other subfamilies of GTPases: Obg,
           DRG, Ygr210, and NOG1. Obg is an essential gene that is
           involved in DNA replication in C. crescentus and
           Streptomyces griseus and is associated with the
           ribosome. Several members of the family, including YchF,
           possess the TGS domain related to the RNA-binding
           proteins. Experimental data and genomic analysis suggest
           that YchF may be part of a nucleoprotein complex and may
           function as a GTP-dependent translational factor.
          Length = 274

 Score = 28.2 bits (64), Expect = 1.4
 Identities = 17/56 (30%), Positives = 32/56 (57%), Gaps = 3/56 (5%)

Query: 40  DKIRRKMNEQKVQAQEEIEILKQTENELNQGKFKLARMFERIDKEKAELERSISFL 95
           +++ +K      +A+EE+E+L++ +  L +G  K AR  E  D+E   L +S+  L
Sbjct: 141 ERLEKKAKSGDKEAKEELELLEKIKEHLEEG--KPARTLELTDEEIKIL-KSLQLL 193


>gnl|CDD|224307 COG1389, COG1389, DNA topoisomerase VI, subunit B [DNA replication,
           recombination, and repair].
          Length = 538

 Score = 28.1 bits (63), Expect = 1.5
 Identities = 17/66 (25%), Positives = 28/66 (42%), Gaps = 1/66 (1%)

Query: 58  EILKQTENELNQGKFKLARMFERIDKEKAELERSISFLKEKETELDEILAVLTEKTEVDV 117
           EI  +    L +   KL     R  +E  E ++  +  K    E+ + LA + EK E D+
Sbjct: 442 EIENEIRLALMEVARKLKLYLSRKRREMEERKKRKTIEKYLP-EIAKKLAEILEKPEEDI 500

Query: 118 DEAVTT 123
            E +  
Sbjct: 501 REKLAK 506


>gnl|CDD|110537 pfam01540, Lipoprotein_7, Adhesin lipoprotein.  This family
           consists of the p50 and variable adherence-associated
           antigen (Vaa) adhesins from Mycoplasma hominis. M.
           hominis is a mycoplasma associated with human urogenital
           diseases, pneumonia, and septic arthritis. An adhesin is
           a cell surface molecule that mediates adhesion to other
           cells or to the surrounding surface or substrate. The
           Vaa antigen is a 50-kDa surface lipoprotein that has
           four tandem repetitive DNA sequences encoding a periodic
           peptide structure, and is highly immunogenic in the
           human host. p50 is also a 50-kDa lipoprotein, having
           three repeats A,B and C, that may be a tetramer of
           191-kDa in its native environment.
          Length = 353

 Score = 28.1 bits (62), Expect = 1.6
 Identities = 24/95 (25%), Positives = 45/95 (47%), Gaps = 10/95 (10%)

Query: 35  KSAVEDK---IRRKMNEQKVQAQEEIEILKQTENE-------LNQGKFKLARMFERIDKE 84
           K A++DK   I +++ E+  + Q  IE LK+  NE       +N+   ++ + F+     
Sbjct: 106 KKAIDDKNAQIAKELAERNAKIQSNIEELKKINNEAFELSKTVNKTIAEVEKKFKIPKDF 165

Query: 85  KAELERSISFLKEKETELDEILAVLTEKTEVDVDE 119
           K +LE     L +K  ++DE   V + +    + E
Sbjct: 166 KEQLENFADDLLDKSRQIDEFTTVTSTQEGFTLAE 200


>gnl|CDD|197874 smart00787, Spc7, Spc7 kinetochore protein.  This domain is found
           in cell division proteins which are required for
           kinetochore-spindle association.
          Length = 312

 Score = 27.7 bits (62), Expect = 1.7
 Identities = 20/62 (32%), Positives = 32/62 (51%), Gaps = 1/62 (1%)

Query: 54  QEEIEILKQTENELNQGKFK-LARMFERIDKEKAELERSISFLKEKETELDEILAVLTEK 112
           +EE+  LKQ E+EL       L R  E++ K   E+   +  L+E E EL E+ + + + 
Sbjct: 185 EEELRQLKQLEDELEDCDPTELDRAKEKLKKLLQEIMIKVKKLEELEEELQELESKIEDL 244

Query: 113 TE 114
           T 
Sbjct: 245 TN 246


>gnl|CDD|223520 COG0443, DnaK, Molecular chaperone [Posttranslational modification,
           protein turnover, chaperones].
          Length = 579

 Score = 28.1 bits (63), Expect = 1.7
 Identities = 21/84 (25%), Positives = 38/84 (45%), Gaps = 1/84 (1%)

Query: 30  IKMSLKSAVEDKIRRKMNEQKVQAQEEIEILKQTENELNQGKFKLARMFER-IDKEKAEL 88
           ++ +  +A  DK  R++ E + +A+  I  L++   E+ +   +     E  I   +  L
Sbjct: 492 VEDAEANAALDKKFRELVEARNEAESLIYSLEKALKEIVKVSEEEKEKIEEAITDLEEAL 551

Query: 89  ERSISFLKEKETELDEILAVLTEK 112
           E     +K K  EL E+   L EK
Sbjct: 552 EGEKEEIKAKIEELQEVTQKLAEK 575


>gnl|CDD|220365 pfam09726, Macoilin, Transmembrane protein.  This entry is a highly
           conserved protein present in eukaryotes.
          Length = 680

 Score = 27.6 bits (61), Expect = 2.1
 Identities = 27/91 (29%), Positives = 39/91 (42%), Gaps = 15/91 (16%)

Query: 39  EDKIRRKMNEQK-----VQAQ----EEIEILKQTENELNQGKFKLARMFERIDKEKAELE 89
           E+K R+K  E+       QA     E  E LKQ + +L     KL    +  ++E   LE
Sbjct: 501 EEKKRKKEEEETAARAAAQAAASREECAESLKQAKQDLEMEIKKLEHDLKLKEEECRMLE 560

Query: 90  RSISFLK-----EKETE-LDEILAVLTEKTE 114
           +    L+     EKETE L   L  + +K  
Sbjct: 561 KEAQELRKYQESEKETEVLMSALQAMQDKNL 591


>gnl|CDD|235246 PRK04184, PRK04184, DNA topoisomerase VI subunit B; Validated.
          Length = 535

 Score = 27.5 bits (62), Expect = 2.3
 Identities = 20/67 (29%), Positives = 33/67 (49%), Gaps = 7/67 (10%)

Query: 58  EILKQTENELNQGKFKLARMFERIDKEKAELERSISFLKEKETELDEI---LAVLTEKTE 114
           EI K+    L +   KL +   R  KE+   +++ +F K     + EI   LA + EK E
Sbjct: 439 EIEKEIRLALQEVARKLKKYLSRKRKEEEAKKKAKTFEK----YIPEIARKLAEILEKEE 494

Query: 115 VDVDEAV 121
            D++E +
Sbjct: 495 PDIEEVL 501


>gnl|CDD|215656 pfam00012, HSP70, Hsp70 protein.  Hsp70 chaperones help to fold
           many proteins. Hsp70 assisted folding involves repeated
           cycles of substrate binding and release. Hsp70 activity
           is ATP dependent. Hsp70 proteins are made up of two
           regions: the amino terminus is the ATPase domain and the
           carboxyl terminus is the substrate binding region.
          Length = 598

 Score = 27.6 bits (62), Expect = 2.4
 Identities = 24/87 (27%), Positives = 43/87 (49%), Gaps = 11/87 (12%)

Query: 27  DEHIKMSLKSAVEDKIRRKMNEQKVQAQEEIEILKQTENELNQGKFKLARMFERIDKEKA 86
           +  +K + + A EDK R++  E K +A+E +      E  L +   KL       + +K 
Sbjct: 507 ERMVKDAEEYAAEDKKRKERIEAKNEAEEYVY---SLEKSLKEEGDKLP------EADKK 557

Query: 87  ELERSISFLKE--KETELDEILAVLTE 111
           ++E +I +LKE  +  + +EI A   E
Sbjct: 558 KVEEAIEWLKEELEGEDKEEIEAKTEE 584


>gnl|CDD|173611 PTZ00421, PTZ00421, coronin; Provisional.
          Length = 493

 Score = 27.5 bits (61), Expect = 2.6
 Identities = 11/47 (23%), Positives = 25/47 (53%)

Query: 61  KQTENELNQGKFKLARMFERIDKEKAELERSISFLKEKETELDEILA 107
           +  E  L++   +L  + E++  +  E++R    L++KE+ + E L 
Sbjct: 442 RMDEGILDERLGRLQALSEKLRTQHEEIKRCREALQKKESIVMETLE 488


>gnl|CDD|233830 TIGR02350, prok_dnaK, chaperone protein DnaK.  Members of this
           family are the chaperone DnaK, of the DnaK-DnaJ-GrpE
           chaperone system. All members of the seed alignment were
           taken from completely sequenced bacterial or archaeal
           genomes and (except for Mycoplasma sequence) found
           clustered with other genes of this systems. This model
           excludes DnaK homologs that are not DnaK itself, such as
           the heat shock cognate protein HscA (TIGR01991).
           However, it is not designed to distinguish among DnaK
           paralogs in eukaryotes. Note that a number of dnaK genes
           have shadow ORFs in the same reverse (relative to dnaK)
           reading frame, a few of which have been assigned
           glutamate dehydrogenase activity. The significance of
           this observation is unclear; lengths of such shadow ORFs
           are highly variable as if the presumptive protein
           product is not conserved [Protein fate, Protein folding
           and stabilization].
          Length = 595

 Score = 27.3 bits (61), Expect = 2.8
 Identities = 23/81 (28%), Positives = 39/81 (48%), Gaps = 11/81 (13%)

Query: 37  AVEDKIRRKMNEQKVQAQEEIEILKQTENELNQGKFKLARMFERIDKEKAELERSISFLK 96
           A EDK R++  E +  A     +  Q E  L +   KL        +EK ++E++++ LK
Sbjct: 514 AEEDKKRKEEIEARNNAD---SLAYQAEKTLKEAGDKLP------AEEKEKIEKAVAELK 564

Query: 97  E--KETELDEILAVLTEKTEV 115
           E  K  +++EI A   E  + 
Sbjct: 565 EALKGEDVEEIKAKTEELQQA 585


>gnl|CDD|232965 TIGR00414, serS, seryl-tRNA synthetase.  This model represents the
           seryl-tRNA synthetase found in most organisms. This
           protein is a class II tRNA synthetase, and is recognized
           by the pfam model tRNA-synt_2b. The seryl-tRNA
           synthetases of two archaeal species, Methanococcus
           jannaschii and Methanobacterium thermoautotrophicum,
           differ considerably and are included in a different
           model [Protein synthesis, tRNA aminoacylation].
          Length = 418

 Score = 27.3 bits (61), Expect = 2.8
 Identities = 19/68 (27%), Positives = 33/68 (48%), Gaps = 8/68 (11%)

Query: 47  NEQKVQAQEEIEILKQTENELN--------QGKFKLARMFERIDKEKAELERSISFLKEK 98
           ++++ +   EIE L+   NEL+        Q K K+  + + + + K EL    + LK  
Sbjct: 36  DDERKKLLSEIEELQAKRNELSKQIGKAKGQKKDKIEEIKKELKELKEELTELSAALKAL 95

Query: 99  ETELDEIL 106
           E EL + L
Sbjct: 96  EAELQDKL 103


>gnl|CDD|234053 TIGR02903, spore_lon_C, ATP-dependent protease, Lon family.
           Members of this protein family resemble the widely
           distributed ATP-dependent protease La, also called Lon
           and LonA. It resembles even more closely LonB, which is
           a LonA paralog found in genomes if and only if the
           species is capable of endospore formation (as in
           Bacillus subtilis, Clostridium tetani, and select other
           members of the Firmicutes) and expressed specifically in
           the forespore compartment. Members of this family are
           restricted to a subset of spore-forming species, and are
           very likely to participate in the program of endospore
           formation. We propose the designation LonC [Protein
           fate, Degradation of proteins, peptides, and
           glycopeptides, Cellular processes, Sporulation and
           germination].
          Length = 615

 Score = 27.5 bits (61), Expect = 2.8
 Identities = 20/80 (25%), Positives = 42/80 (52%), Gaps = 5/80 (6%)

Query: 25  ITDEHIKMSLKSAVEDKIRRKMNEQKVQAQEEI--EILKQTENELNQGKFKLARMFERID 82
           I D   + ++++ +E K+  +M E++ +  EEI  ++LK+ +   N    K     E+++
Sbjct: 77  IADILARRTVENRIERKVETRMQERQNKYLEEIRLQVLKEEKGPENSSTLKKLERLEKLE 136

Query: 83  KEKAELERSI-SFLKEKETE 101
           K+K  L +S  S L+ +   
Sbjct: 137 KKK--LHKSAQSLLRPRAFS 154


>gnl|CDD|223496 COG0419, SbcC, ATPase involved in DNA repair [DNA replication,
           recombination, and repair].
          Length = 908

 Score = 27.0 bits (60), Expect = 3.2
 Identities = 19/94 (20%), Positives = 38/94 (40%), Gaps = 2/94 (2%)

Query: 23  GTITDEHIKMSLKSAVEDKIRRKMNEQKVQAQEEIEILKQTENELNQGKFKLARMFERID 82
           G    E  +  L    E ++     E+++  ++E   L++   EL +   +L      + 
Sbjct: 464 GQELPEEHEKELLELYELELEEL--EEELSREKEEAELREEIEELEKELRELEEELIELL 521

Query: 83  KEKAELERSISFLKEKETELDEILAVLTEKTEVD 116
           + +  L+  +    EK   L E L  L EK ++ 
Sbjct: 522 ELEEALKEELEEKLEKLENLLEELEELKEKLQLQ 555


>gnl|CDD|227278 COG4942, COG4942, Membrane-bound metallopeptidase [Cell division
           and chromosome partitioning].
          Length = 420

 Score = 27.0 bits (60), Expect = 3.2
 Identities = 14/62 (22%), Positives = 27/62 (43%)

Query: 41  KIRRKMNEQKVQAQEEIEILKQTENELNQGKFKLARMFERIDKEKAELERSISFLKEKET 100
              +++   + +   E   L    +E    + KLA++ E   K  A+L   +S  ++K  
Sbjct: 168 ATLKQLAAVRAEIAAEQAELTTLLSEQRAQQAKLAQLLEERKKTLAQLNSELSADQKKLE 227

Query: 101 EL 102
           EL
Sbjct: 228 EL 229


>gnl|CDD|222259 pfam13608, Potyvirid-P3, Protein P3 of Potyviral polyprotein.  This
           is the P3 protein section of the Potyviridae
           polyproteins. The function is not known except that the
           protein is essential to viral survival.
          Length = 445

 Score = 27.1 bits (61), Expect = 3.4
 Identities = 14/83 (16%), Positives = 37/83 (44%), Gaps = 12/83 (14%)

Query: 33  SLKSAVEDKIRRKMNEQKVQAQEEIEILKQTENELNQGKFKLARMFERIDKEKAELERSI 92
           +L S + +  + K    + + ++E + +++              ++ +++ +    E  +
Sbjct: 264 TLVSIINEHRKLKREAAEEKEEKEEKEIRKLY------------LYSKLEGKLPTEEEFL 311

Query: 93  SFLKEKETELDEILAVLTEKTEV 115
            +++EK  EL E    L E+ EV
Sbjct: 312 EYIEEKNPELLETAEELIEEEEV 334


>gnl|CDD|237624 PRK14143, PRK14143, heat shock protein GrpE; Provisional.
          Length = 238

 Score = 27.0 bits (60), Expect = 3.4
 Identities = 21/66 (31%), Positives = 32/66 (48%), Gaps = 9/66 (13%)

Query: 47  NEQKVQAQEEIEILKQTENELNQGKFKLARMFE----RIDKEKAELERSISFLKEKETEL 102
             +  Q ++E+E LKQ   ELN    ++A  F+    R  +E+ +L      L+ K   L
Sbjct: 66  AARLAQLEQELESLKQELEELNSQYMRIAADFDNFRKRTSREQEDLR-----LQLKCNTL 120

Query: 103 DEILAV 108
            EIL V
Sbjct: 121 SEILPV 126


>gnl|CDD|215960 pfam00509, Hemagglutinin, Haemagglutinin.  Hemagglutinin from
           influenza virus causes membrane fusion of the viral
           membrane with the host membrane. Fusion occurs after the
           host cell internalises the virus by endocytosis. The
           drop of pH causes release of a hydrophobic fusion
           peptide and a large conformational change leading to
           membrane fusion.
          Length = 550

 Score = 26.9 bits (60), Expect = 4.2
 Identities = 18/45 (40%), Positives = 26/45 (57%), Gaps = 3/45 (6%)

Query: 54  QEEIEILKQTENELNQGKFKLARMFERIDKEKAELERSISFLKEK 98
           Q+ I+   Q   +LN    K+   FE I+KE +ELER I+ L +K
Sbjct: 370 QKAID---QITGKLNSLIEKMNTQFEAIEKEFSELERRINNLNKK 411


>gnl|CDD|129690 TIGR00602, rad24, checkpoint protein rad24.  All proteins in this
           family for which functions are known are involved in DNA
           damage tolerance (likely cell cycle checkpoints).This
           family is based on the phylogenomic analysis of JA Eisen
           (1999, Ph.D. Thesis, Stanford University) [DNA
           metabolism, DNA replication, recombination, and repair].
          Length = 637

 Score = 26.8 bits (59), Expect = 4.5
 Identities = 19/84 (22%), Positives = 31/84 (36%), Gaps = 3/84 (3%)

Query: 18  SPSQGGTITDEHIKMSLKS-AVEDKIRRKMNEQKVQAQEEIEILKQTENEL-NQGKFKLA 75
           S S+ G++  +  +MS KS A   K + K         +EI+ L   +  L         
Sbjct: 327 SSSKSGSLPIKK-RMSTKSDAHASKSKIKGKHSSNNENQEIQALGGKDVSLFLFRALGKI 385

Query: 76  RMFERIDKEKAELERSISFLKEKE 99
              +R    + +  R  S L E  
Sbjct: 386 LYCKRATLNELDSPRLPSHLSELS 409


>gnl|CDD|223258 COG0180, TrpS, Tryptophanyl-tRNA synthetase [Translation, ribosomal
           structure and biogenesis].
          Length = 314

 Score = 26.4 bits (59), Expect = 5.0
 Identities = 16/60 (26%), Positives = 25/60 (41%), Gaps = 2/60 (3%)

Query: 54  QEEIEILKQTEN-ELNQGKFKLARMFERIDKEKAELERSISFLKEKETELDEILAVLTEK 112
              +EI  +    EL  G+ K   + E I +    ++     L+E    LD+IL    EK
Sbjct: 256 DSILEIEAEYRGGELGCGECK-KELAEAIQEFLKPIQERREELREDPAYLDDILRKGAEK 314


>gnl|CDD|240403 PTZ00400, PTZ00400, DnaK-type molecular chaperone; Provisional.
          Length = 663

 Score = 26.7 bits (59), Expect = 5.1
 Identities = 26/85 (30%), Positives = 45/85 (52%), Gaps = 8/85 (9%)

Query: 20  SQGGTITDEHIKMSLKSAVEDKIRRKMNEQKVQAQEEIEIL-KQTENELNQGKFKLARMF 78
           S GG ++DE I+  +K A E K + +  ++ V A+ E E L    E +L+  K K++   
Sbjct: 537 SSGG-LSDEEIEKMVKEAEEYKEQDEKKKELVDAKNEAETLIYSVEKQLSDLKDKIS--- 592

Query: 79  ERIDKEKAELERSISFLKEKETELD 103
              D +K EL++ I+ L+   +  D
Sbjct: 593 ---DADKDELKQKITKLRSTLSSED 614


>gnl|CDD|233119 TIGR00763, lon, ATP-dependent protease La.  This protein is induced
           by heat shock and other stresses in E. coli, B.
           subtilis, and other species. The yeast member,
           designated PIM1, is located in the mitochondrial matrix,
           required for mitochondrial function, and also induced by
           heat shock [Protein fate, Degradation of proteins,
           peptides, and glycopeptides].
          Length = 775

 Score = 26.5 bits (59), Expect = 5.1
 Identities = 17/73 (23%), Positives = 34/73 (46%), Gaps = 11/73 (15%)

Query: 30  IKMSLKSAVEDKIRRK-----MNEQKVQAQEEIEILKQTENELNQGKFKL------ARMF 78
           ++  +   VE+K+ +      + EQ    ++E+ I K  ++EL + K KL        + 
Sbjct: 207 LQNKITKKVEEKMEKTQREYYLREQLKAIKKELGIEKDDKDELEKLKEKLEELKLPEEVK 266

Query: 79  ERIDKEKAELERS 91
           + I+KE  +L   
Sbjct: 267 KVIEKELTKLSLL 279


>gnl|CDD|235461 PRK05431, PRK05431, seryl-tRNA synthetase; Provisional.
          Length = 425

 Score = 26.2 bits (59), Expect = 5.7
 Identities = 21/69 (30%), Positives = 37/69 (53%), Gaps = 4/69 (5%)

Query: 45  KMNEQKVQAQEEIEILKQTENELNQ--GKFKLARMFERIDKEKAELERSISFLKEKETEL 102
           +++E++ + Q E+E L+   N L++  G+ K     E  +   AE++     +K  E EL
Sbjct: 32  ELDEERRELQTELEELQAERNALSKEIGQAKRKG--EDAEALIAEVKELKEEIKALEAEL 89

Query: 103 DEILAVLTE 111
           DE+ A L E
Sbjct: 90  DELEAELEE 98


>gnl|CDD|227352 COG5019, CDC3, Septin family protein [Cell division and chromosome
           partitioning / Cytoskeleton].
          Length = 373

 Score = 26.1 bits (58), Expect = 5.7
 Identities = 14/77 (18%), Positives = 31/77 (40%), Gaps = 7/77 (9%)

Query: 31  KMSLKSAVEDKIRRKMNEQKVQAQEEIEILKQTENE---LNQGKFKL--ARMFERIDKEK 85
           K+S      +   ++++E +    EE   LK+   E     + + +     + E   +  
Sbjct: 291 KLSGLKNSGEPSLKEIHEAR--LNEEERELKKKFTEKIREKEKRLEELEQNLIEERKELN 348

Query: 86  AELERSISFLKEKETEL 102
           ++LE     L++ E  L
Sbjct: 349 SKLEEIQKKLEDLEKRL 365


>gnl|CDD|204405 pfam10146, zf-C4H2, Zinc finger-containing protein.  This is a
           family of proteins which appears to have a highly
           conserved zinc finger domain at the C terminal end,
           described as -C-X2-CH-X3-H-X5-C-X2-C-. The structure is
           predicted to contain a coiled coil. Members are
           annotated as being tumour-associated antigen HCA127 in
           humans but this could not confirmed.
          Length = 215

 Score = 26.0 bits (57), Expect = 5.7
 Identities = 23/83 (27%), Positives = 42/83 (50%), Gaps = 15/83 (18%)

Query: 40  DKIRRKMNEQKVQAQEEIE-----------ILKQTENELNQGKFKLARMFERIDKEKAEL 88
           DK   ++ E+K+Q  EE+             +KQ+++EL +   K+ARM      E   L
Sbjct: 38  DKEMEELLEEKMQHVEELRQIHADINDMETEIKQSKSELERRMGKIARMHG----EYNPL 93

Query: 89  ERSISFLKEKETELDEILAVLTE 111
           + SI+ +++ E  L+E+  +  E
Sbjct: 94  KESINEMRKLELGLEELPQLHEE 116


>gnl|CDD|181856 PRK09436, thrA, bifunctional aspartokinase I/homoserine
           dehydrogenase I; Provisional.
          Length = 819

 Score = 26.3 bits (59), Expect = 5.9
 Identities = 16/78 (20%), Positives = 33/78 (42%), Gaps = 8/78 (10%)

Query: 23  GTITDEHIKMSLKSAVEDKIRRKMNEQKVQAQEEIEILKQTENELNQGKFKLARMFERID 82
             +T+  + M  K+A  D    ++ + +    E ++ L       +  + K      ++D
Sbjct: 41  AKVTNHLVAMIEKAAKGDDAYPEILDAERIFHELLDGLAAALPGFDLAQLK-----AKVD 95

Query: 83  KEKAELE---RSISFLKE 97
           +E A+L+     IS L E
Sbjct: 96  QEFAQLKDILHGISLLGE 113


>gnl|CDD|237256 PRK12901, secA, preprotein translocase subunit SecA; Reviewed.
          Length = 1112

 Score = 26.5 bits (59), Expect = 6.0
 Identities = 19/77 (24%), Positives = 34/77 (44%), Gaps = 9/77 (11%)

Query: 34  LKSAVEDKIRRKMNE-----QKVQAQEEIEILKQTENELNQGKFKLARMFERIDKEKAEL 88
           LK    DK  R + E     +K++A+   E+   + +EL     K     + I +  A++
Sbjct: 5   LKKLFGDKSERDLKEIQPIVEKIKAEYP-ELEALSNDELRA---KTDEFKQYIKEAVADI 60

Query: 89  ERSISFLKEKETELDEI 105
           +  I  LK +  E  +I
Sbjct: 61  DAKIEELKAEAIESLDI 77


>gnl|CDD|215364 PLN02678, PLN02678, seryl-tRNA synthetase.
          Length = 448

 Score = 26.2 bits (58), Expect = 6.0
 Identities = 20/61 (32%), Positives = 30/61 (49%)

Query: 52  QAQEEIEILKQTENELNQGKFKLARMFERIDKEKAELERSISFLKEKETELDEILAVLTE 111
           Q Q E++ L++  N+LN+   KL    E   +  AE +     + EKE E+ E  A L  
Sbjct: 44  QRQFELDSLRKEFNKLNKEVAKLKIAKEDATELIAETKELKKEITEKEAEVQEAKAALDA 103

Query: 112 K 112
           K
Sbjct: 104 K 104


>gnl|CDD|222878 PHA02562, 46, endonuclease subunit; Provisional.
          Length = 562

 Score = 26.1 bits (58), Expect = 6.4
 Identities = 16/90 (17%), Positives = 39/90 (43%)

Query: 25  ITDEHIKMSLKSAVEDKIRRKMNEQKVQAQEEIEILKQTENELNQGKFKLARMFERIDKE 84
           I D+  ++       D    ++ E   +  E+ + L + +N+++  K  L  + ++  K 
Sbjct: 304 IKDKLKELQHSLEKLDTAIDELEEIMDEFNEQSKKLLELKNKISTNKQSLITLVDKAKKV 363

Query: 85  KAELERSISFLKEKETELDEILAVLTEKTE 114
           KA +E   +   +   EL ++   L +  +
Sbjct: 364 KAAIEELQAEFVDNAEELAKLQDELDKIVK 393


>gnl|CDD|185474 PTZ00141, PTZ00141, elongation factor 1- alpha; Provisional.
          Length = 446

 Score = 26.2 bits (58), Expect = 6.6
 Identities = 15/40 (37%), Positives = 23/40 (57%), Gaps = 3/40 (7%)

Query: 54 QEEIEILKQTENELNQGKFKLARMFERIDKEKAELERSIS 93
          +  IE  ++   E+ +G FK A +   +DK KAE ER I+
Sbjct: 36 KRTIEKFEKEAAEMGKGSFKYAWV---LDKLKAERERGIT 72


>gnl|CDD|217337 pfam03050, DDE_Tnp_IS66, Transposase IS66 family.  Transposase
           proteins are necessary for efficient DNA transposition.
           This family includes IS66 from Agrobacterium
           tumefaciens.
          Length = 277

 Score = 26.0 bits (58), Expect = 6.7
 Identities = 11/75 (14%), Positives = 24/75 (32%), Gaps = 3/75 (4%)

Query: 29  HIKMSLKSAVEDKIRRKMNEQKVQAQEEIEILKQTENELNQGKFKLARMFERIDKEKAEL 88
            I+   +    ++      E      + +E   + +      K KL +    +      L
Sbjct: 177 AIEREARGLPPEERLALRQEYSRPLLDALEAWLEAQLPGVLPKSKLGKAIRYLLNRWDAL 236

Query: 89  ERSISFLKEKETELD 103
            R   FL++    +D
Sbjct: 237 TR---FLEDGRVPID 248


>gnl|CDD|233723 TIGR02094, more_P_ylases, alpha-glucan phosphorylases.  This family
           consists of known phosphorylases, and homologs believed
           to share the function of using inorganic phosphate to
           cleave an alpha 1,4 linkage between the terminal glucose
           residue and the rest of the polymer (maltodextrin,
           glycogen, etc.). The name of the glucose storage polymer
           substrate, and therefore the name of this enzyme,
           depends on the chain lengths and branching patterns. A
           number of the members of this family have been shown to
           operate on small maltodextrins, as may be obtained by
           utilization of exogenous sources. This family represents
           a distinct clade from the related family modeled by
           TIGR02093/pfam00343.
          Length = 601

 Score = 26.1 bits (58), Expect = 6.7
 Identities = 17/59 (28%), Positives = 30/59 (50%), Gaps = 5/59 (8%)

Query: 25  ITDEHI---KMSLKSAVEDKIRRKMNEQKVQAQEEIEILKQTENELNQGKFKL--ARMF 78
           I DE +    + LK+ + D IRR++ E+ ++   +  IL  T+  L+     +  AR F
Sbjct: 340 IPDEELWEVHLKLKARLIDYIRRRLRERWLRRGADAAILMATDRFLDPDVLTIGFARRF 398


>gnl|CDD|234173 TIGR03346, chaperone_ClpB, ATP-dependent chaperone ClpB.  Members
           of this protein family are the bacterial ATP-dependent
           chaperone ClpB. This protein belongs to the AAA family,
           ATPases associated with various cellular activities
           (pfam00004). This molecular chaperone does not act as a
           protease, but rather serves to disaggregate misfolded
           and aggregated proteins [Protein fate, Protein folding
           and stabilization].
          Length = 852

 Score = 26.1 bits (58), Expect = 7.2
 Identities = 25/90 (27%), Positives = 48/90 (53%), Gaps = 13/90 (14%)

Query: 30  IKMSLKSAVE--DKIRRKMNEQKVQAQEEIEILKQTENELNQGKFKLARMFERIDKEKAE 87
           I+M + S  E  D++ R++    +Q + E E LK+ ++E ++ +       E ++KE AE
Sbjct: 396 IRMEIDSKPEELDELDRRI----IQLEIEREALKKEKDEASKERL------EDLEKELAE 445

Query: 88  LERSISFLKEK-ETELDEILAVLTEKTEVD 116
           LE   + L+E+ + E   I  +   K E++
Sbjct: 446 LEEEYADLEEQWKAEKAAIQGIQQIKEEIE 475


>gnl|CDD|177362 PHA02415, PHA02415, DNA primase domain-containing protein.
          Length = 930

 Score = 26.1 bits (57), Expect = 7.2
 Identities = 20/84 (23%), Positives = 34/84 (40%), Gaps = 10/84 (11%)

Query: 8   WTLDQEVGGSSPSQGGTITDEHIKMSLKSAVEDKIRRKMNEQKVQAQEEIEILKQTENEL 67
           W    E  G  PS+GG   D   +M+   A          E     +  + I++   +E+
Sbjct: 770 WRYLCEFAGMDPSEGGFPHDLLAEMNAHVA----------ETSADREPWVWIMETVLSEI 819

Query: 68  NQGKFKLARMFERIDKEKAELERS 91
           + G +K    F+ +D E   L R+
Sbjct: 820 DGGNYKHPYTFDTVDGEFCLLLRT 843


>gnl|CDD|233004 TIGR00513, accA, acetyl-CoA carboxylase, carboxyl transferase,
           alpha subunit.  The enzyme acetyl-CoA carboxylase
           contains a biotin carboxyl carrier protein or domain, a
           biotin carboxylase, and a carboxyl transferase. This
           model represents the alpha chain of the carboxyl
           transferase for cases in which the architecture of the
           protein is as in E. coli, in which the
           carboxyltransferase portion consists of two
           non-identical subnits, alpha and beta [Fatty acid and
           phospholipid metabolism, Biosynthesis].
          Length = 316

 Score = 25.9 bits (57), Expect = 7.4
 Identities = 19/81 (23%), Positives = 35/81 (43%), Gaps = 9/81 (11%)

Query: 40  DKIRRKMNEQKVQAQEEIEILKQTENELNQGKFKLARMFERIDKEKAELERSISFLKEKE 99
           + +R +  ++ V   EEIE L++   EL +  F     ++R      +L R      ++ 
Sbjct: 20  ESLRARSRDEDVDLSEEIERLEKRSVELTKKIFSNLGAWQR-----LQLAR----HPDRP 70

Query: 100 TELDEILAVLTEKTEVDVDEA 120
             LD I  +  +  E+  D A
Sbjct: 71  YTLDYIELIFDDFFELAGDRA 91


>gnl|CDD|237255 PRK12900, secA, preprotein translocase subunit SecA; Reviewed.
          Length = 1025

 Score = 25.9 bits (57), Expect = 8.0
 Identities = 19/78 (24%), Positives = 36/78 (46%), Gaps = 1/78 (1%)

Query: 30  IKMSLKSAVEDKIRRKMNEQKVQAQEEIEILKQTENELNQGKFKLARMFERIDKEKAELE 89
           I+ SL S  +D +R K  E K + +  +E ++Q + +L +         E  +    EL+
Sbjct: 30  IQASLASLSDDALREKGMELKSRVRGALEPIEQKKKDLEKKLDNPDISLEEAESINEELD 89

Query: 90  R-SISFLKEKETELDEIL 106
             +  + +     L+EIL
Sbjct: 90  TLAKEYEEATAAALEEIL 107


>gnl|CDD|226513 COG4026, COG4026, Uncharacterized protein containing TOPRIM domain,
           potential nuclease [General function prediction only].
          Length = 290

 Score = 25.6 bits (56), Expect = 9.0
 Identities = 21/99 (21%), Positives = 48/99 (48%), Gaps = 5/99 (5%)

Query: 24  TITDEHIKMSLKSAVEDKIRRKMNEQKVQAQEEIEILKQTENELNQGKFKLARMFERIDK 83
           T+      M LK   E+ ++ K+ E + + +E ++ L++ E E  + + +L    +R++ 
Sbjct: 123 TLQRVPEYMDLKEDYEE-LKEKLEELQKEKEELLKELEELEAEYEEVQERL----KRLEV 177

Query: 84  EKAELERSISFLKEKETELDEILAVLTEKTEVDVDEAVT 122
           E + LE  +  L  +  +L +    L    E+  +E ++
Sbjct: 178 ENSRLEEMLKKLPGEVYDLKKRWDELEPGVELPEEELIS 216


>gnl|CDD|165320 PHA03022, PHA03022, hypothetical protein; Provisional.
          Length = 335

 Score = 25.7 bits (56), Expect = 10.0
 Identities = 10/26 (38%), Positives = 17/26 (65%)

Query: 32 MSLKSAVEDKIRRKMNEQKVQAQEEI 57
          +SLK+++ DKI  K  +  +  +EEI
Sbjct: 70 LSLKNSIADKIILKFKDNGIMTEEEI 95


  Database: CDD.v3.10
    Posted date:  Mar 20, 2013  7:55 AM
  Number of letters in database: 10,937,602
  Number of sequences in database:  44,354
  
Lambda     K      H
   0.309    0.127    0.325 

Gapped
Lambda     K      H
   0.267   0.0761    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 6,587,981
Number of extensions: 595750
Number of successful extensions: 1603
Number of sequences better than 10.0: 1
Number of HSP's gapped: 1443
Number of HSP's successfully gapped: 490
Length of query: 133
Length of database: 10,937,602
Length adjustment: 86
Effective length of query: 47
Effective length of database: 7,123,158
Effective search space: 334788426
Effective search space used: 334788426
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.1 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (21.7 bits)
S2: 53 (24.1 bits)