RPS-BLAST 2.2.26 [Sep-21-2011]

Database: CDD.v3.10 
           44,354 sequences; 10,937,602 total letters

Searching..................................................done

Query= psy15864
         (553 letters)



>gnl|CDD|233165 TIGR00879, SP, MFS transporter, sugar porter (SP) family.  This
           model represent the sugar porter subfamily of the major
           facilitator superfamily (pfam00083) [Transport and
           binding proteins, Carbohydrates, organic alcohols, and
           acids].
          Length = 481

 Score =  180 bits (458), Expect = 1e-50
 Identities = 94/341 (27%), Positives = 160/341 (46%), Gaps = 15/341 (4%)

Query: 101 VLVLTYVAEITQPHLRGMLSATASMTTIFG-------TVSQLFLGSFLHWRSAAILNLLF 153
            LV  Y++EI    LRG L++   +   FG          ++ L + L WR    L L+ 
Sbjct: 146 ALVPMYLSEIAPKALRGALTSLYQLAITFGILVAYGFGSGKVSLNNTLGWRIPLGLQLIP 205

Query: 154 PILALCALYFIPESPHWLISQGRMQEASASLCWLRGWVTPDK-VQTELSQITKAIEESEL 212
             L    L+F+PESP WL+ +GR++EA  SL  LRG    DK +  EL  I       E 
Sbjct: 206 AGLLFLGLFFLPESPRWLVGKGRVEEARKSLARLRGTSGEDKELLDELELIDIK-RSIEK 264

Query: 213 KRLGKDGQRRPNYRMYMRRTFLLPYAIVTSLFFIGQFGGMTTLQTYAVGIFESI-HAPLD 271
           + +               R       +   L +  QF G+  +  Y+  IFE+   +   
Sbjct: 265 RSVQPSWGSL----FSSTRRIRRRLFLGVVLQWFQQFTGINAIMYYSPTIFENAGVSTDH 320

Query: 272 PYFATLLLGVAELGGALLCVVLIHYTGKRPLALISTGGSAACFIVVAVY-AQFHLSYGWD 330
            +  ++++G        + + L+   G+RPL LI   G A C  V+ +  A F       
Sbjct: 321 AFLVSIIVGAVNFAFTFVAIFLVDRFGRRPLLLIGAAGMAICLFVLGILGASFVTGSSKS 380

Query: 331 SPLVPTVFLVLAAFLTHICIRLLPWMLIGEVFPNNIRATASGASGSSSYIFAFAVNKLYY 390
           S  V  VF++L      +    +PW+++ E+FP ++R      + +++++  F V  L+ 
Sbjct: 381 SGNVAIVFILLFIAFFAMGWGPVPWVIVSEIFPLSLRPKGISIAVAANWLANFIVGFLFP 440

Query: 391 PMLDTFHLWGTLYFYAAISVVGTLYMYFVMPETEGRTLRDI 431
            ML++  + G   F+  ++V+G +++YF +PET+GRTL +I
Sbjct: 441 TMLESIGVGGVFIFFGGLNVLGLIFVYFFLPETKGRTLEEI 481



 Score = 31.9 bits (73), Expect = 0.85
 Identities = 12/22 (54%), Positives = 19/22 (86%)

Query: 483 VVGTLYMYFVMPETEGRTLRDI 504
           V+G +++YF +PET+GRTL +I
Sbjct: 460 VLGLIFVYFFLPETKGRTLEEI 481


>gnl|CDD|215702 pfam00083, Sugar_tr, Sugar (and other) transporter. 
          Length = 449

 Score =  164 bits (416), Expect = 8e-45
 Identities = 91/343 (26%), Positives = 159/343 (46%), Gaps = 21/343 (6%)

Query: 101 VLVLTYVAEITQPHLRGMLSATASMTTIFGT-VSQLF------LGSFLHWRSAAILNLLF 153
           VLV  Y++EI    LRG L +   +   FG  V+ +         +   WR    L  + 
Sbjct: 120 VLVPMYISEIAPKKLRGALGSLYQLGITFGILVAAIIGLGLNKYSNSDGWRIPLGLQFVP 179

Query: 154 PILALCALYFIPESPHWLISQGRMQEASASLCWLRGWVTPDKVQTELSQITKAIEESELK 213
            IL L  L F+PESP WL+ +G+++EA A L  LRG    D+         +  EE +  
Sbjct: 180 AILLLIGLLFLPESPRWLVLKGKLEEARAVLAKLRGVSDVDQ---------EIQEEKDSL 230

Query: 214 RLGKDGQRRPNYRMYMRRTFLLPYAIVTSLFFIGQFGGMTTLQTYAVGIFESIHAPLDPY 273
               + ++     ++  +T      +   L    Q  G+  +  Y+  IFE++    D  
Sbjct: 231 ERSVEAEKASWLELFRGKTVRQRLLMGVMLQIFQQLTGINAIFYYSPTIFETLG-LSDSL 289

Query: 274 FATLLLGVAELGGALLCVVLIHYTGKRPLALISTGGSAACFIVVAV-YAQFHLSYGWDSP 332
             T+++GV       + + L+   G+RPL L+   G A CF+V+ V       S G    
Sbjct: 290 LVTIIVGVVNFVFTFIAIFLVDRFGRRPLLLLGAAGMAICFLVLGVALLGVAKSKGAG-- 347

Query: 333 LVPTVFLVLAAFLTHICIRLLPWMLIGEVFPNNIRATASGASGSSSYIFAFAVNKLYYPM 392
           +V  VF++L      +    +PW+++ E+FP  +R  A   + +++++  F +  L+  +
Sbjct: 348 IVAIVFILLFIAFFALGWGPVPWVIVSELFPLGVRPKAMAIATAANWLANFLIGFLFPII 407

Query: 393 LDTFHLWGTLYFYAAISVVGTLYMYFVMPETEGRTLRDIEEHF 435
                       +A + V+  L+++F +PET+GRTL +I+E F
Sbjct: 408 TGAIG-GYVFLVFAGLLVLFILFVFFFVPETKGRTLEEIDELF 449



 Score = 31.9 bits (73), Expect = 0.94
 Identities = 13/26 (50%), Positives = 21/26 (80%)

Query: 483 VVGTLYMYFVMPETEGRTLRDIEEHF 508
           V+  L+++F +PET+GRTL +I+E F
Sbjct: 424 VLFILFVFFFVPETKGRTLEEIDELF 449



 Score = 29.6 bits (67), Expect = 4.9
 Identities = 31/209 (14%), Positives = 59/209 (28%), Gaps = 54/209 (25%)

Query: 293 LIHYTGKRPLALISTGGSAACFIVVAVYAQFHLSYGWD------SPLVPTVFLVLAAFLT 346
            +   G    + + +G   + F V  +          D      S L+  V  V+ A L 
Sbjct: 34  ALTSIGACAASTVLSGLIVSIFSVGCLIGSLFAGKLGDRFGRKKSLLIGNVLFVIGALLQ 93

Query: 347 HICIRLLPWMLI---------------------GEVFPNNIRATASGASGSS-------S 378
                   +MLI                      E+ P  +R                 +
Sbjct: 94  GFAKGKSFYMLIVGRVIVGLGVGGISVLVPMYISEIAPKKLRGALGSLYQLGITFGILVA 153

Query: 379 YIFAFAVNKLYYPMLDTFHLWGTLYFYAAISVVGTLYMYFVMPETEGRTLRDIEEHFADK 438
            I    +NK      D + +   L      +++  + + F+ PE+               
Sbjct: 154 AIIGLGLNKYSNS--DGWRI--PLGLQFVPAILLLIGLLFL-PESP-------------- 194

Query: 439 GKTFVTNIRRAEKKRHAARVEGQEEVKGE 467
            +  V   +  E +   A++ G  +V  E
Sbjct: 195 -RWLVLKGKLEEARAVLAKLRGVSDVDQE 222


>gnl|CDD|182225 PRK10077, xylE, D-xylose transporter XylE; Provisional.
          Length = 479

 Score =  127 bits (321), Expect = 1e-31
 Identities = 95/366 (25%), Positives = 167/366 (45%), Gaps = 63/366 (17%)

Query: 106 YVAEITQPHLRGMLSATASMTTIFGTVSQLFLGSFLH------------WR----SAAIL 149
           Y+AEI   H+RG L +      IFG +   F+  F+             WR    S AI 
Sbjct: 144 YIAEIAPAHIRGKLVSFNQFAIIFGQLVVYFVNYFIARSGDASWLNTDGWRYMFASEAIP 203

Query: 150 NLLFPILALCALYFIPESPHWLISQGRMQEASASLCWLRGWVTPDKVQTELSQITKAIEE 209
            LLF +L    LYF+PE+P +L+S+G+ ++A   L  + G          L +I  +++ 
Sbjct: 204 ALLFLML----LYFVPETPRYLMSRGKQEQAEGILRKIMG---NTLATQALQEIKHSLDH 256

Query: 210 SELKRLGKDGQRRPNYRMYMRRTFLLPYAIVTSLFFIGQFGGMTTLQTYAVGIFESIHAP 269
                           ++ M    ++   ++ S+F   QF G+  +  YA  IF+++ A 
Sbjct: 257 GRKT----------GGKLLMFGVGVIVIGVMLSVF--QQFVGINVVLYYAPEIFKTLGAS 304

Query: 270 LD-PYFATLLLGVAELGGALLCVVLIHYTGKRPLALISTGGSA--ACFIVVAVYAQFHLS 326
            D     T+++GV  L   +L ++ +   G++PL +I   G A     +  A Y Q    
Sbjct: 305 TDIALLQTIIVGVINLTFTVLAIMTVDKFGRKPLQIIGALGMAIGMFSLGTAFYTQA--- 361

Query: 327 YGWDSPLVPTVFLVL--AAFLTH---ICIRLLPWMLIGEVFPNNIRATASGASGSSSYIF 381
                 +V  + ++   AAF      +C     W+L+ E+FPN IR  A   + ++ +I 
Sbjct: 362 ----PGIVALLSMLFYVAAFAMSWGPVC-----WVLLSEIFPNAIRGKALAIAVAAQWIA 412

Query: 382 AFAVNKLYYPMLD-------TFHLWGTLYFYAAISVVGTLYMYFVMPETEGRTLRDIEEH 434
            + V+  + PM+D        FH   + + Y  + V+  L+M+  +PET+G+TL ++E  
Sbjct: 413 NYFVSWTF-PMMDKNSWLVAHFHNGFSYWIYGCMGVLAALFMWKFVPETKGKTLEEMEAL 471

Query: 435 FADKGK 440
           +  + K
Sbjct: 472 WEPETK 477



 Score = 29.3 bits (66), Expect = 5.6
 Identities = 11/31 (35%), Positives = 21/31 (67%)

Query: 483 VVGTLYMYFVMPETEGRTLRDIEEHFADKGK 513
           V+  L+M+  +PET+G+TL ++E  +  + K
Sbjct: 447 VLAALFMWKFVPETKGKTLEEMEALWEPETK 477


>gnl|CDD|119392 cd06174, MFS, The Major Facilitator Superfamily (MFS) is a large
           and diverse group of secondary transporters that
           includes uniporters, symporters, and antiporters. MFS
           proteins facilitate the transport across cytoplasmic or
           internal membranes of a variety of substrates including
           ions, sugar phosphates, drugs, neurotransmitters,
           nucleosides, amino acids, and peptides. They do so using
           the electrochemical potential of the transported
           substrates. Uniporters transport a single substrate,
           while symporters and antiporters transport two
           substrates in the same or in opposite directions,
           respectively, across membranes. MFS proteins are
           typically 400 to 600 amino acids in length, and the
           majority contain 12 transmembrane alpha helices (TMs)
           connected by hydrophilic loops. The N- and C-terminal
           halves of these proteins display weak similarity and may
           be the result of a gene duplication/fusion event. Based
           on kinetic studies and the structures of a few bacterial
           superfamily members, GlpT (glycerol-3-phosphate
           transporter), LacY (lactose permease), and EmrD
           (multidrug transporter), MFS proteins are thought to
           function through a single substrate binding site,
           alternating-access mechanism involving a rocker-switch
           type of movement. Bacterial members function primarily
           for nutrient uptake, and as drug-efflux pumps to confer
           antibiotic resistance. Some MFS proteins have medical
           significance in humans such as the glucose transporter
           Glut4, which is impaired in type II diabetes, and
           glucose-6-phosphate transporter (G6PT), which causes
           glycogen storage disease when mutated.
          Length = 352

 Score = 89.7 bits (223), Expect = 2e-19
 Identities = 42/191 (21%), Positives = 70/191 (36%), Gaps = 13/191 (6%)

Query: 233 FLLPYAIVTSLFFIGQFGGMTTLQTYAVGIFESIH--APLDPYFATLLLGVAELGGALLC 290
           FLL   ++ +L F     G   L TY     + +   +  +      L G+  + GALL 
Sbjct: 172 FLLRLLLLLALAFFLLSFGYYGLLTYLPLYLQEVLGLSAAEAGLLLSLFGLGGILGALLG 231

Query: 291 VVLIHYTGKRP-LALISTGGSAACFIVVAVYAQFHLSYGWDSPLVPTVFLVLAAFLTHIC 349
            +L    G+R  L LI    +A   +++A+     L        V  + L          
Sbjct: 232 GLLSDRLGRRRLLLLIGLLLAALGLLLLALAPSLALLL------VALLLLGFGLGFAFPA 285

Query: 350 IRLLPWMLIGEVFPNNIRATASGASGSSSYIFAFAVNKLYYPMLDTFHLWGTLYFYAAIS 409
           +      L  E+ P   R TASG   +   +       L   +LDT    G     AA++
Sbjct: 286 L----LTLASELAPPEARGTASGLFNTFGSLGGALGPLLAGLLLDTGGYGGVFLILAALA 341

Query: 410 VVGTLYMYFVM 420
           ++  L +  + 
Sbjct: 342 LLAALLLLLLP 352



 Score = 55.8 bits (135), Expect = 3e-08
 Identities = 35/189 (18%), Positives = 64/189 (33%), Gaps = 21/189 (11%)

Query: 239 IVTSLFFIGQFGGMTTLQTYAVGIFESIHAPLDPYFATLLLGVAELGGALLCVV---LIH 295
           ++  L F         L      + E +   L    A L++    LG AL  ++   L  
Sbjct: 2   LLLFLGFFLSGLDRGLLSPALPLLAEDLG--LSASQAGLIVSAFSLGYALGSLLAGYLSD 59

Query: 296 YTGKRPLALISTGGSAACFIVVAVYAQFHLSYGWDSPLVPTVFLVLAAFLTHI---CIRL 352
             G+R + L+     A   +++A    F  S            L++  FL  +    +  
Sbjct: 60  RFGRRRVLLLGLLLFALGSLLLA----FASSLWL---------LLVGRFLLGLGGGALYP 106

Query: 353 LPWMLIGEVFPNNIRATASGASGSSSYIFAFAVNKLYYPMLDTFHLWGTLYFYAAISVVG 412
               LI E FP   R  A G   +   + A     L   + ++          A + ++ 
Sbjct: 107 AAAALIAEWFPPKERGRALGLFSAGFGLGALLGPLLGGLLAESLGWRWLFLILAILGLLL 166

Query: 413 TLYMYFVMP 421
            L + F++ 
Sbjct: 167 ALLLLFLLR 175



 Score = 45.0 bits (107), Expect = 6e-05
 Identities = 45/281 (16%), Positives = 73/281 (25%), Gaps = 55/281 (19%)

Query: 96  MAAPLVLVLTYVAEITQPHLRGMLSATASMTTIFGTVSQLFLGSFLH----WRSAAILNL 151
             A        +AE   P  RG      S     G +    LG  L     WR   ++  
Sbjct: 101 GGALYPAAAALIAEWFPPKERGRALGLFSAGFGLGALLGPLLGGLLAESLGWRWLFLILA 160

Query: 152 LFPILALCALYFIPESPHWLISQGRMQEASASLCWLRGWVTPDKVQTELSQITKAIEESE 211
           +  +L    L F+      L+       A A      G+     + T L    + +    
Sbjct: 161 ILGLLLALLLLFLL--RLLLLL------ALAFFLLSFGYYG---LLTYLPLYLQEVLGLS 209

Query: 212 LKRLGKDGQRRPNYRMYMRRTFLLPYAIVTSLFFIGQFGGM---------------TTLQ 256
               G                 L  + +   L  +   GG+                 L 
Sbjct: 210 AAEAG---------------LLLSLFGLGGILGAL--LGGLLSDRLGRRRLLLLIGLLLA 252

Query: 257 TYAVGIFESIHAPLDPYFATLLLGVAELGGALLCVVLI------HYTGKRPLALISTGGS 310
              + +     +      A LLLG          + L          G     L +T GS
Sbjct: 253 ALGLLLLALAPSLALLLVALLLLGFGLGFAFPALLTLASELAPPEARGTA-SGLFNTFGS 311

Query: 311 AACFIVVAVYAQFHLSYGWDSP-LVPTVFLVLAAFLTHICI 350
               +   +      + G+    L+     +LAA L  +  
Sbjct: 312 LGGALGPLLAGLLLDTGGYGGVFLILAALALLAALLLLLLP 352



 Score = 35.0 bits (81), Expect = 0.083
 Identities = 38/197 (19%), Positives = 62/197 (31%), Gaps = 31/197 (15%)

Query: 230 RRTFLLPYAIVTSLFFIGQFGGMTTLQTYAVGIFESIHAPLDPYFATLLLGVAELGGALL 289
           RR  LL   ++ +L  +      ++L    VG F              LLG+        
Sbjct: 64  RRVLLLG-LLLFALGSLL-LAFASSLWLLLVGRF--------------LLGLGGGALYPA 107

Query: 290 CVVLI-HYTGK----RPLALISTGGSAACFIVVAVYAQFHLSYGWDSP-LVPTVFLVLAA 343
              LI  +       R L L S G      +   +      S GW    L+  +  +L A
Sbjct: 108 AAALIAEWFPPKERGRALGLFSAGFGLGALLGPLLGGLLAESLGWRWLFLILAILGLLLA 167

Query: 344 FLTHICIRLLPWMLIGEVFPNNIRATASGASGSSSYIFAFAVNKLYYPMLDTFHLWGTLY 403
            L    +RLL  +L    F       + G  G  +Y+  +    L     +   L     
Sbjct: 168 LLLLFLLRLL-LLLALAFF-----LLSFGYYGLLTYLPLYLQEVLGLSAAEAGLLLS--- 218

Query: 404 FYAAISVVGTLYMYFVM 420
            +    ++G L    + 
Sbjct: 219 LFGLGGILGALLGGLLS 235


>gnl|CDD|233176 TIGR00898, 2A0119, cation transport protein.  [Transport and
           binding proteins, Cations and iron carrying compounds].
          Length = 505

 Score = 76.6 bits (189), Expect = 8e-15
 Identities = 80/332 (24%), Positives = 123/332 (37%), Gaps = 64/332 (19%)

Query: 124 SMTTIFGTVSQLF--LGSFL---------HWRSAAILNLL--FPILALCALY-FIPESPH 169
               I GT+ Q+F  LG  L          WR    L L    P      L  F+PESP 
Sbjct: 212 KQRAIVGTLIQVFFSLGLVLLPLVAYFIPDWR---WLQLAVSLPTFLFFLLSWFVPESPR 268

Query: 170 WLISQGRMQEASASLCWLRGWVTPDKVQ-TELSQITKAIEESELKRLG--KDGQRRPNYR 226
           WLISQGR++EA   L   R      K    E+  ++   + S  K+     D  R PN R
Sbjct: 269 WLISQGRIEEALKIL--QRIAKINGKKLPAEVLSLSLEKDLSSSKKQYSFLDLFRTPNLR 326

Query: 227 MYMRRTFLLPYAIVTSLFFIGQFGGMTTLQTYAVGIFESIHAPLDPYFATLLLGVAELGG 286
              + T  L     T+ F    +G +  L      +  +I      Y    + G+ EL  
Sbjct: 327 ---KTTLCLMMLWFTTAF--SYYGLVLDLG----NLGGNI------YLDLFISGLVELPA 371

Query: 287 ALLCVVLIHYTGKR-PLALISTGGSAACFIVVAVYAQFHLSYGWDSPLVPTVFLVLA-AF 344
            L+ ++LI   G+R  +A        A  +++ V     +   +    + T   VL    
Sbjct: 372 KLITLLLIDRLGRRYTMAASLLLAGVALLLLLFV----PVDLYF----LRTALAVLGKFG 423

Query: 345 LT--HICIRLLPWMLIGEVFPNNIRATA----SGASGSSSYIFAFAVNKLYYPMLDTFHL 398
           +T     +    ++   E++P  +R       S  +   S I  F V      +     L
Sbjct: 424 ITSAFQMV----YLYTAELYPTVVRNLGVGVCSTMARVGSIISPFLVYLGEKWLFLPLVL 479

Query: 399 WGTLYFYAAISVVGTLYMYFVMPETEGRTLRD 430
           +G L   A I  +        +PET+G  L +
Sbjct: 480 FGGLALLAGILTLF-------LPETKGVPLPE 504


>gnl|CDD|219516 pfam07690, MFS_1, Major Facilitator Superfamily. 
          Length = 346

 Score = 66.3 bits (162), Expect = 9e-12
 Identities = 57/295 (19%), Positives = 90/295 (30%), Gaps = 63/295 (21%)

Query: 96  MAAPLVLVLTYVAEITQPHLRGM----LSATASMTTIFGTVSQLFLGSFLHWRSAAILNL 151
             A        +A+   P  RG     LSA   +    G +    L S   WR+A ++  
Sbjct: 99  GGALFPAAAALIADWFPPEERGRALGLLSAGFGLGAALGPLLGGLLASLFGWRAAFLILA 158

Query: 152 LFPILALC---ALYFIPESPHWLISQGRMQEASASLCWLRGWVTPDKVQTELSQITKAIE 208
           +  +LA      L   P              A     W                      
Sbjct: 159 ILALLAAVLAALLLPRPPPESKRPKPAEEAPAPLVPAWK--------------------- 197

Query: 209 ESELKRLGKDGQRRPNYRMYMRRTFLLPYAIVTSLFFIGQFGGMTTLQTYAVGIFESIHA 268
                               + R  +L   +   LF    F  +T L  Y     E +  
Sbjct: 198 -------------------LLLRDPVLWLLLALLLFGFAFFALLTYLPLYQ----EVLG- 233

Query: 269 PLDPYFATLLLGVAELGGALLCVV---LIHYTGKRPLALISTGGSAACFIVVAVYAQFHL 325
            L    A LLLG+A L GA+  ++   L    G+R   L++        + +A+ +    
Sbjct: 234 -LSALLAGLLLGLAGLLGAIGRLLLGRLSDRLGRRRRLLLALLLLILAALGLALLSLTES 292

Query: 326 SYGWDSPLVPTVFLVLAAFLTHICIRLLPWMLIGEVFPNNIRATASGASGSSSYI 380
           S      LV  + L   A L    +      L+ ++ P   R TASG   ++  +
Sbjct: 293 SLWL---LVALLLLGFGAGLVFPAL----NALVSDLAPKEERGTASGLYNTAGSL 340



 Score = 38.6 bits (90), Expect = 0.007
 Identities = 33/212 (15%), Positives = 60/212 (28%), Gaps = 29/212 (13%)

Query: 240 VTSLFFIGQFGGMTTLQTYAVGIFESIHAPLDPYFATLLLGVAELG---GALLCVVLIHY 296
           +    F+   G         + + E +   + P    LLL    LG      L   L   
Sbjct: 1   LFLAAFLAGLGRSLLGPALPLYLAEDLG--ISPTEIGLLLTAFSLGYALAQPLAGRLSDR 58

Query: 297 TGKRPLALISTGGSAACFIVVAVYAQFHLSYGWDSPLVPTVFLVLAAFLTHICIRLL--P 354
            G+R + LI     A   +++   +                 L++   L  +    L   
Sbjct: 59  FGRRRVLLIGLLLFALGLLLLLFASSL-------------WLLLVLRVLQGLGGGALFPA 105

Query: 355 WM-LIGEVFPNNIRATASG---ASGSSSYIFAFAVNKLYYPMLDTFHLW-GTLYFYAAIS 409
              LI + FP   R  A G   A           +      +L +   W       A ++
Sbjct: 106 AAALIADWFPPEERGRALGLLSAGFGLGAALGPLLG----GLLASLFGWRAAFLILAILA 161

Query: 410 VVGTLYMYFVMPETEGRTLRDIEEHFADKGKT 441
           ++  +    ++P     + R      A     
Sbjct: 162 LLAAVLAALLLPRPPPESKRPKPAEEAPAPLV 193


>gnl|CDD|236877 PRK11192, PRK11192, ATP-dependent RNA helicase SrmB; Provisional.
          Length = 434

 Score = 58.8 bits (143), Expect = 3e-09
 Identities = 18/49 (36%), Positives = 28/49 (57%)

Query: 21  KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNN 69
           KK  K  KK   K+A+KKEK+KEK K +K+ +  K   K++  +  +  
Sbjct: 386 KKTGKPSKKVLAKRAEKKEKEKEKPKVKKRHRDTKNIGKRRKPSGTSEE 434



 Score = 56.9 bits (138), Expect = 1e-08
 Identities = 15/41 (36%), Positives = 23/41 (56%)

Query: 21  KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
           K   +K+  K  KK   K  +K++K+KEK K KK+ +  K 
Sbjct: 381 KAPSEKKTGKPSKKVLAKRAEKKEKEKEKPKVKKRHRDTKN 421



 Score = 54.2 bits (131), Expect = 1e-07
 Identities = 13/53 (24%), Positives = 26/53 (49%)

Query: 14  EPTPLDWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNN 66
            P+     K  KK   K+ +K +K+++K + KK+ +  K   K++K    +  
Sbjct: 382 APSEKKTGKPSKKVLAKRAEKKEKEKEKPKVKKRHRDTKNIGKRRKPSGTSEE 434



 Score = 53.8 bits (130), Expect = 1e-07
 Identities = 17/41 (41%), Positives = 24/41 (58%)

Query: 25  KKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNN 65
           K   +KK  K  KK   K  +KKEK+K+K K KK+ ++  N
Sbjct: 381 KAPSEKKTGKPSKKVLAKRAEKKEKEKEKPKVKKRHRDTKN 421



 Score = 49.2 bits (118), Expect = 4e-06
 Identities = 19/43 (44%), Positives = 25/43 (58%)

Query: 26  KEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNN 68
           K K   EKK  K  KK   K+ EKK+K+K+K K KK + +  N
Sbjct: 379 KTKAPSEKKTGKPSKKVLAKRAEKKEKEKEKPKVKKRHRDTKN 421



 Score = 41.1 bits (97), Expect = 0.001
 Identities = 13/36 (36%), Positives = 21/36 (58%)

Query: 26  KEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
            E + K K   +K+  K  KK   K+ +KK+K+K+K
Sbjct: 374 DELRPKTKAPSEKKTGKPSKKVLAKRAEKKEKEKEK 409



 Score = 30.7 bits (70), Expect = 2.1
 Identities = 11/36 (30%), Positives = 18/36 (50%)

Query: 26  KEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
           K +   E + K K   ++K  K  KK   K+ +KK+
Sbjct: 369 KARVIDELRPKTKAPSEKKTGKPSKKVLAKRAEKKE 404


>gnl|CDD|219746 pfam08208, RNA_polI_A34, DNA-directed RNA polymerase I subunit
           RPA34.5.  This is a family of proteins conserved from
           yeasts to human. Subunit A34.5 of RNA polymerase I is a
           non-essential subunit which is thought to help Pol I
           overcome topological constraints imposed on ribosomal
           DNA during the process of transcription.
          Length = 193

 Score = 55.5 bits (134), Expect = 7e-09
 Identities = 23/41 (56%), Positives = 32/41 (78%)

Query: 21  KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
           ++ K+K+KKK+ KK KK++K K++K  E K  KKKKKKKKK
Sbjct: 152 EEKKEKKKKKEVKKEKKEKKDKKEKMVEPKGSKKKKKKKKK 192



 Score = 55.1 bits (133), Expect = 1e-08
 Identities = 23/51 (45%), Positives = 35/51 (68%)

Query: 11  TFTEPTPLDWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
           T       + ++ +KKEKKKK++  K+K++KK+KK+K  + K  KKKKKKK
Sbjct: 140 TAKVEKEAEVEEEEKKEKKKKKEVKKEKKEKKDKKEKMVEPKGSKKKKKKK 190



 Score = 54.7 bits (132), Expect = 1e-08
 Identities = 24/41 (58%), Positives = 32/41 (78%)

Query: 21  KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
           ++ KK++KKKKE K +KKEKK +K+K  + K  KKKKKKKK
Sbjct: 151 EEEKKEKKKKKEVKKEKKEKKDKKEKMVEPKGSKKKKKKKK 191



 Score = 54.7 bits (132), Expect = 1e-08
 Identities = 24/39 (61%), Positives = 30/39 (76%)

Query: 21  KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKK 59
           K+ KKK++ KKEKK KK +K+K  + K  KKKKKKKKKK
Sbjct: 155 KEKKKKKEVKKEKKEKKDKKEKMVEPKGSKKKKKKKKKK 193



 Score = 52.8 bits (127), Expect = 5e-08
 Identities = 27/65 (41%), Positives = 39/65 (60%), Gaps = 5/65 (7%)

Query: 2   DSSTKHQIGTFTEPTPLDWKKAKKKEKKKKEKKAKKKEKKKEKKKKEK-----KKKKKKK 56
            S ++      T     + +  ++++K+KK+KK  KKEKK++K KKEK       KKKKK
Sbjct: 129 GSESETSEKETTAKVEKEAEVEEEEKKEKKKKKEVKKEKKEKKDKKEKMVEPKGSKKKKK 188

Query: 57  KKKKK 61
           KKKKK
Sbjct: 189 KKKKK 193



 Score = 50.5 bits (121), Expect = 3e-07
 Identities = 19/44 (43%), Positives = 31/44 (70%)

Query: 22  KAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNN 65
           + +   K +KE + +++EKK++KKKKE KK+KK+KK KK+    
Sbjct: 136 EKETTAKVEKEAEVEEEEKKEKKKKKEVKKEKKEKKDKKEKMVE 179



 Score = 47.4 bits (113), Expect = 4e-06
 Identities = 18/43 (41%), Positives = 29/43 (67%)

Query: 24  KKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNN 66
           K+   K +++   ++E+KKEKKKK++ KK+KK+KK KK     
Sbjct: 137 KETTAKVEKEAEVEEEEKKEKKKKKEVKKEKKEKKDKKEKMVE 179



 Score = 42.0 bits (99), Expect = 2e-04
 Identities = 16/50 (32%), Positives = 29/50 (58%)

Query: 12  FTEPTPLDWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
                     +  +KE   K +K  + E++++K+KK+KK+ KK+KK+KK 
Sbjct: 123 GPPSELGSESETSEKETTAKVEKEAEVEEEEKKEKKKKKEVKKEKKEKKD 172



 Score = 34.3 bits (79), Expect = 0.10
 Identities = 9/42 (21%), Positives = 23/42 (54%)

Query: 21  KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKN 62
               +   + +  + +   K +++ + E+++KK+KKKKK+  
Sbjct: 123 GPPSELGSESETSEKETTAKVEKEAEVEEEEKKEKKKKKEVK 164


>gnl|CDD|219124 pfam06658, DUF1168, Protein of unknown function (DUF1168).  This
           family consists of several hypothetical eukaryotic
           proteins of unknown function.
          Length = 142

 Score = 52.4 bits (126), Expect = 3e-08
 Identities = 27/75 (36%), Positives = 44/75 (58%), Gaps = 4/75 (5%)

Query: 19  DWKKAKKKE--KKKKEKKAKKKEKKKEKK--KKEKKKKKKKKKKKKKNNNNNNNNNNNNN 74
            WKK  + E  ++K+E+K +K E+K  KK  K++KKK+KKKKKKK K  N       + +
Sbjct: 58  KWKKETEDEEFQQKREEKKRKDEEKTAKKRAKRQKKKQKKKKKKKAKKGNKKEEKEGSKS 117

Query: 75  NNNNNNNNNNNNEEK 89
           +  +++      E+K
Sbjct: 118 SEESSDEEEEGEEDK 132



 Score = 49.7 bits (119), Expect = 3e-07
 Identities = 22/70 (31%), Positives = 40/70 (57%), Gaps = 1/70 (1%)

Query: 21  KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNNNNNNN 80
           ++ K+K+++K  KK  K++KKK+KKKK KKK KK  KK++K  + ++  +++       +
Sbjct: 73  EEKKRKDEEKTAKKRAKRQKKKQKKKK-KKKAKKGNKKEEKEGSKSSEESSDEEEEGEED 131

Query: 81  NNNNNNEEKG 90
                 E   
Sbjct: 132 KQEEPVEIME 141



 Score = 48.1 bits (115), Expect = 1e-06
 Identities = 24/66 (36%), Positives = 39/66 (59%), Gaps = 1/66 (1%)

Query: 24  KKKEKKKK-EKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNNNNNNNNN 82
           K++EKK+K E+K  KK  K++KKK++KKKKKK KK  KK     + ++  +++       
Sbjct: 71  KREEKKRKDEEKTAKKRAKRQKKKQKKKKKKKAKKGNKKEEKEGSKSSEESSDEEEEGEE 130

Query: 83  NNNNEE 88
           +   E 
Sbjct: 131 DKQEEP 136



 Score = 46.6 bits (111), Expect = 3e-06
 Identities = 23/69 (33%), Positives = 41/69 (59%), Gaps = 3/69 (4%)

Query: 25  KKEKKKKEKKAKKKEK-KKEKKKKEKKKKK--KKKKKKKKNNNNNNNNNNNNNNNNNNNN 81
           KKE + +E + K++EK +K+++K  KK+ K  KKK+KKKK       N       + ++ 
Sbjct: 60  KKETEDEEFQQKREEKKRKDEEKTAKKRAKRQKKKQKKKKKKKAKKGNKKEEKEGSKSSE 119

Query: 82  NNNNNEEKG 90
            +++ EE+G
Sbjct: 120 ESSDEEEEG 128



 Score = 46.6 bits (111), Expect = 3e-06
 Identities = 19/66 (28%), Positives = 35/66 (53%)

Query: 21  KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNNNNNNN 80
           +K ++K  KK+ K+ KKK+KKK+KKK +K  KK++K+  K +  +++       +     
Sbjct: 77  RKDEEKTAKKRAKRQKKKQKKKKKKKAKKGNKKEEKEGSKSSEESSDEEEEGEEDKQEEP 136

Query: 81  NNNNNN 86
                 
Sbjct: 137 VEIMEK 142


>gnl|CDD|233175 TIGR00895, 2A0115, benzoate transport.  [Transport and binding
           proteins, Carbohydrates, organic alcohols, and acids].
          Length = 398

 Score = 54.7 bits (132), Expect = 7e-08
 Identities = 57/281 (20%), Positives = 97/281 (34%), Gaps = 44/281 (15%)

Query: 105 TYVAEITQPHLRGMLSATASMTTIF--GTVSQLFLGSFL----HWRSAAILNLLFP-ILA 157
             V+E      RG  +A   M   +  G     FL  +L     WRS   +  + P +L 
Sbjct: 128 ALVSEYAPKRFRG--TAVGLMFCGYPIGAAVGGFLAGWLIPVFGWRSLFYVGGIAPLLLL 185

Query: 158 LCALYFIPESPHWLISQGRMQEASASLCWLRGWVTPDKVQTELSQITKAIEESELKRLGK 217
           L  + F+PES  +L+S+                  P+ V+  ++ I   ++      L +
Sbjct: 186 LLLMRFLPESIDFLVSKR-----------------PETVRRIVNAIAPQMQAEAQSALPE 228

Query: 218 DGQRRPNYRMYMRRTFLLPYAIVTSLFFIGQFGGMTTLQTYAVGI---FESIHAPLDPYF 274
               +   R   +  F    A +T L ++  F  M  +  Y +        +        
Sbjct: 229 QKATQGTKRSVFKALFQGKTARITVLLWLLYF--MLLVGVYFLTNWLPKLMVELGFSLSL 286

Query: 275 ATLLLGVAELGGALLCVV---LIHYTGKRPLALISTGGSAACFIVVAVYAQFHLSYGWDS 331
           A     +   GG +  ++   L    G R  AL+   G+      V V +         S
Sbjct: 287 AATGGALFNFGGVIGSIIFGWLADRLGPRVTALLLLLGAV---FAVLVGSTLF------S 337

Query: 332 PLVPTVFLVLAAFLTHICIRLLPWMLIGEVFPNNIRATASG 372
           P +  +   +A F  +     L   L+   +P  IRAT  G
Sbjct: 338 PTLLLLLGAIAGFFVNGGQSGLY-ALMALFYPTAIRATGVG 377


>gnl|CDD|225371 COG2814, AraJ, Arabinose efflux permease [Carbohydrate transport
           and metabolism].
          Length = 394

 Score = 52.6 bits (127), Expect = 3e-07
 Identities = 58/345 (16%), Positives = 102/345 (29%), Gaps = 88/345 (25%)

Query: 101 VLVLTYVAEITQPHLRGMLSATASMTTIFGTVSQL-------FLGSFLHWRSAAILNLLF 153
            +     A +  P  RG   A A + T   T++ +       FLG    WR+       F
Sbjct: 120 SIAAALAARLVPPGKRG--RALALVFTGL-TLATVLGVPLGTFLGQLFGWRAT------F 170

Query: 154 PILALCALYFIPESPHWLISQGRMQEASASLCWLRGWVTPDKVQTELSQITKAIEESELK 213
             +A+ AL                     +L  L   + P       S+I+ ++      
Sbjct: 171 LAIAVLAL--------------------LALLLLWKLLPP-------SEISGSLPGPLRT 203

Query: 214 RLGKDGQRRPNYRMYMRRTFLLPYAIVTSLFFIGQFGGMTTLQTYAVGIFESIHAPLDPY 273
            L             +RR  +L   + T LF  G F   T ++ +   +           
Sbjct: 204 LLRL-----------LRRPGVLLGLLATFLFMTGHFALYTYIRPFLESVAG-----FSVS 247

Query: 274 FATLLL---GVAELGGALLCVVLIHYTGKRPLALISTGGSAACFIVVAVYAQFHLSYGWD 330
             +L+L   G+A   G LL   L    G R   + +    A   + +             
Sbjct: 248 AVSLVLLAFGIAGFIGNLLGGRLADR-GPRRALIAALLLLALALLALTFTGA-------- 298

Query: 331 SPLVPTVFLVLAAFLTHICIRLLPWMLIGEVFPNNIRATASG-------ASGSSSYIFAF 383
           SP +    L L  F     ++ L    +  + P+      S             + +   
Sbjct: 299 SPALALALLFLWGFAFSPALQGL-QTRLARLAPDAADLAGSLNVAAFNLGIALGAALGG- 356

Query: 384 AVNKLYYPMLDTFHLWGTLYFYAAISVVGTLYMYFVMPETEGRTL 428
                   +LD      T +   A  ++  L +  +    + RT 
Sbjct: 357 -------LVLDALGYAATGWV-GAALLLLALLLALLSARKDRRTA 393


>gnl|CDD|218336 pfam04935, SURF6, Surfeit locus protein 6.  The surfeit locus
           protein SURF-6 is shown to be a component of the
           nucleolar matrix and has a strong binding capacity for
           nucleic acids.
          Length = 206

 Score = 50.8 bits (122), Expect = 4e-07
 Identities = 22/42 (52%), Positives = 32/42 (76%)

Query: 20  WKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
           WK+ K+K +KKK ++ KK+E+  +K+K +KK KKKKK KKK 
Sbjct: 162 WKERKEKVEKKKAERQKKREENLKKRKDDKKNKKKKKAKKKG 203



 Score = 41.9 bits (99), Expect = 3e-04
 Identities = 17/39 (43%), Positives = 32/39 (82%)

Query: 24  KKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKN 62
           KK  K+K+++K K K++ KE+K+K +KKK +++KK+++N
Sbjct: 145 KKALKRKEKQKKKSKKEWKERKEKVEKKKAERQKKREEN 183



 Score = 41.1 bits (97), Expect = 5e-04
 Identities = 22/53 (41%), Positives = 34/53 (64%), Gaps = 11/53 (20%)

Query: 21  KKAKKKEKKKKEKKAKKKEKKKEK---------KKKEK--KKKKKKKKKKKKN 62
           KKA K+++K+K+K  K+ +++KEK         KK+E+  KK+K  KK KKK 
Sbjct: 145 KKALKRKEKQKKKSKKEWKERKEKVEKKKAERQKKREENLKKRKDDKKNKKKK 197



 Score = 41.1 bits (97), Expect = 5e-04
 Identities = 17/55 (30%), Positives = 34/55 (61%)

Query: 24 KKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNNNNN 78
          +++ +K++++KA+KK+K+KE KKKE  +K + ++ K + N +        N   N
Sbjct: 9  EQRRRKREQRKARKKQKRKEAKKKEDAQKSEAEEVKNEENKSKKKAAPIENAEGN 63



 Score = 39.6 bits (93), Expect = 0.002
 Identities = 16/46 (34%), Positives = 29/46 (63%)

Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNN 66
          +KA+KK+K+K+ KK +  +K + ++ K ++ K KKK    +N   N
Sbjct: 18 RKARKKQKRKEAKKKEDAQKSEAEEVKNEENKSKKKAAPIENAEGN 63



 Score = 39.6 bits (93), Expect = 0.002
 Identities = 14/55 (25%), Positives = 31/55 (56%)

Query: 28 KKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNNNNNNNNN 82
          ++++ K+ ++K +KK+K+K+ KKK+  +K + ++  N  N +        N   N
Sbjct: 9  EQRRRKREQRKARKKQKRKEAKKKEDAQKSEAEEVKNEENKSKKKAAPIENAEGN 63



 Score = 39.2 bits (92), Expect = 0.002
 Identities = 18/49 (36%), Positives = 29/49 (59%)

Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNN 69
          ++ +K  KK+K K+AKKKE  ++ + +E K ++ K KKK     N   N
Sbjct: 15 REQRKARKKQKRKEAKKKEDAQKSEAEEVKNEENKSKKKAAPIENAEGN 63



 Score = 38.8 bits (91), Expect = 0.003
 Identities = 23/51 (45%), Positives = 30/51 (58%), Gaps = 7/51 (13%)

Query: 18  LDWKKAKKKEKKKKEKKAKKK-EKKKEK------KKKEKKKKKKKKKKKKK 61
           LD  KA + E+K+K  KA  K E  K K      KK  K+K+K+KKK KK+
Sbjct: 111 LDEDKAAEIEEKEKWTKALAKAEGVKVKDDEKLLKKALKRKEKQKKKSKKE 161



 Score = 38.4 bits (90), Expect = 0.004
 Identities = 13/55 (23%), Positives = 32/55 (58%)

Query: 23 AKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNNNN 77
           +++ K+++ K  KK+++K+ KKK++ +K + ++ K ++N +        N   N
Sbjct: 9  EQRRRKREQRKARKKQKRKEAKKKEDAQKSEAEEVKNEENKSKKKAAPIENAEGN 63



 Score = 38.1 bits (89), Expect = 0.005
 Identities = 15/55 (27%), Positives = 29/55 (52%)

Query: 32 EKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNNNNNNNNNNNNN 86
          E++ +K+E++K +KK+++K+ KKK+  +K       N  N +        N   N
Sbjct: 9  EQRRRKREQRKARKKQKRKEAKKKEDAQKSEAEEVKNEENKSKKKAAPIENAEGN 63



 Score = 37.7 bits (88), Expect = 0.007
 Identities = 25/60 (41%), Positives = 35/60 (58%), Gaps = 16/60 (26%)

Query: 20  WKKAKKK---------EK------KKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNN 64
           W KA  K         EK      K+KEK+ KKK KK+ K++KEK +KKK +++KK+  N
Sbjct: 125 WTKALAKAEGVKVKDDEKLLKKALKRKEKQ-KKKSKKEWKERKEKVEKKKAERQKKREEN 183



 Score = 35.4 bits (82), Expect = 0.047
 Identities = 12/55 (21%), Positives = 31/55 (56%)

Query: 27 EKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNNNNNNNN 81
          E+++++++ +K  KK+++K+ +KK+  +K + ++  N  N +        N   N
Sbjct: 9  EQRRRKREQRKARKKQKRKEAKKKEDAQKSEAEEVKNEENKSKKKAAPIENAEGN 63



 Score = 34.6 bits (80), Expect = 0.073
 Identities = 14/55 (25%), Positives = 28/55 (50%)

Query: 31 KEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNNNNNNNNNNNN 85
          ++++ K++++K  KK+K K+ KKK+  +K +     N  N +        N   N
Sbjct: 9  EQRRRKREQRKARKKQKRKEAKKKEDAQKSEAEEVKNEENKSKKKAAPIENAEGN 63



 Score = 34.2 bits (79), Expect = 0.11
 Identities = 13/32 (40%), Positives = 22/32 (68%)

Query: 21  KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKK 52
           ++ KK+E+  K++K  KK KKK+K KK+ +  
Sbjct: 175 ERQKKREENLKKRKDDKKNKKKKKAKKKGRIL 206



 Score = 33.4 bits (77), Expect = 0.18
 Identities = 16/52 (30%), Positives = 26/52 (50%), Gaps = 2/52 (3%)

Query: 12  FTEPTPLDWKKAKKKEKKKKEKKAKKKEKKKEKKKK--EKKKKKKKKKKKKK 61
           F +          KK+KKKK+   K+  KK E +KK  E+  + K  + ++K
Sbjct: 71  FADGEQAKKDLKLKKKKKKKKTDYKQLLKKLEARKKKLEELDEDKAAEIEEK 122



 Score = 33.4 bits (77), Expect = 0.20
 Identities = 18/50 (36%), Positives = 24/50 (48%), Gaps = 9/50 (18%)

Query: 21  KKAKKKEKKKK---------EKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
           K  KKK+KKK          E + KK E+  E K  E ++K+K  K   K
Sbjct: 82  KLKKKKKKKKTDYKQLLKKLEARKKKLEELDEDKAAEIEEKEKWTKALAK 131



 Score = 32.3 bits (74), Expect = 0.42
 Identities = 11/34 (32%), Positives = 23/34 (67%)

Query: 28 KKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
             +E   +++ +K+E++K  KK+K+K+ KKK+ 
Sbjct: 1  PSSREALLEQRRRKREQRKARKKQKRKEAKKKED 34



 Score = 31.9 bits (73), Expect = 0.51
 Identities = 20/68 (29%), Positives = 32/68 (47%), Gaps = 25/68 (36%)

Query: 19  DWKKA-KKKEKKKKEKKAKKKEKKKEKKKKEK------------------------KKKK 53
           D+K+  KK E +KK+ +   ++K  E ++KEK                        K+K+
Sbjct: 93  DYKQLLKKLEARKKKLEELDEDKAAEIEEKEKWTKALAKAEGVKVKDDEKLLKKALKRKE 152

Query: 54  KKKKKKKK 61
           K+KKK KK
Sbjct: 153 KQKKKSKK 160



 Score = 29.2 bits (66), Expect = 4.2
 Identities = 19/84 (22%), Positives = 31/84 (36%), Gaps = 42/84 (50%)

Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKKE-------------------------------- 48
          ++ +K+E++K  KK K+KE KK++  ++                                
Sbjct: 10 QRRRKREQRKARKKQKRKEAKKKEDAQKSEAEEVKNEENKSKKKAAPIENAEGNIVFSKV 69

Query: 49 ----------KKKKKKKKKKKKKN 62
                      K KKKKKKKK +
Sbjct: 70 EFADGEQAKKDLKLKKKKKKKKTD 93


>gnl|CDD|234229 TIGR03490, Mycoplas_LppA, mycoides cluster lipoprotein, LppA/P72
          family.  Members of this protein family occur in
          Mycoplasma mycoides, Mycoplasma hyopneumoniae, and
          related Mycoplasmas in small paralogous families that
          may also include truncated forms and/or pseudogenes.
          Members are predicted lipoproteins with a conserved
          signal peptidase II processing and lipid attachment
          site. Note that the name for certain characterized
          members, p72, reflects an anomalous apparent molecular
          weight, given a theoretical MW of about 61 kDa.
          Length = 541

 Score = 52.5 bits (126), Expect = 4e-07
 Identities = 20/59 (33%), Positives = 30/59 (50%)

Query: 31 KEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNNNNNNNNNNNNNEEK 89
              +K+ EKK E K  E   K  KK   K+ + NNNN +NN N +  N ++ N  ++ 
Sbjct: 27 TSSNSKQPEKKPEIKPNENTPKIPKKPDNKEPSENNNNKSNNENKDEENPSSTNPEKKP 85



 Score = 42.9 bits (101), Expect = 3e-04
 Identities = 14/61 (22%), Positives = 28/61 (45%)

Query: 28 KKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNNNNNNNNNNNNNE 87
          K+ ++K   K  +   K  K+   K+  +    K+NN N +  N ++ N     + + N+
Sbjct: 32 KQPEKKPEIKPNENTPKIPKKPDNKEPSENNNNKSNNENKDEENPSSTNPEKKPDPSKNK 91

Query: 88 E 88
          E
Sbjct: 92 E 92



 Score = 39.4 bits (92), Expect = 0.004
 Identities = 19/69 (27%), Positives = 28/69 (40%), Gaps = 17/69 (24%)

Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNNNNNNN 80
          +  KK E K  E   K  +K   K+                  + NNNN +NN N +  N
Sbjct: 33 QPEKKPEIKPNENTPKIPKKPDNKEP-----------------SENNNNKSNNENKDEEN 75

Query: 81 NNNNNNEEK 89
           ++ N E+K
Sbjct: 76 PSSTNPEKK 84



 Score = 39.4 bits (92), Expect = 0.005
 Identities = 17/72 (23%), Positives = 35/72 (48%)

Query: 13  TEPTPLDWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNN 72
            EP+  +  K+  + K ++   +   EKK +  K +++ +K K + KK +     +  NN
Sbjct: 56  KEPSENNNNKSNNENKDEENPSSTNPEKKPDPSKNKEEIEKPKDEPKKPDKKPQADQPNN 115

Query: 73  NNNNNNNNNNNN 84
            + +  NNN  +
Sbjct: 116 VHADQPNNNKVD 127



 Score = 33.7 bits (77), Expect = 0.26
 Identities = 13/73 (17%), Positives = 28/73 (38%)

Query: 14  EPTPLDWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNN 73
           +P   +  +    +   + K  +       +KK +  K K++ +K K      +     +
Sbjct: 52  KPDNKEPSENNNNKSNNENKDEENPSSTNPEKKPDPSKNKEEIEKPKDEPKKPDKKPQAD 111

Query: 74  NNNNNNNNNNNNN 86
             NN + +  NNN
Sbjct: 112 QPNNVHADQPNNN 124


>gnl|CDD|220237 pfam09428, DUF2011, Fungal protein of unknown function (DUF2011).
           This is a family of fungal proteins whose function is
           unknown.
          Length = 130

 Score = 48.4 bits (116), Expect = 6e-07
 Identities = 15/41 (36%), Positives = 28/41 (68%)

Query: 21  KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
           KK K+  KK++     ++E+ KE+ +KEK+ +K ++KK K+
Sbjct: 83  KKRKRPGKKRRIALRLRRERTKERAEKEKRTRKNREKKFKR 123



 Score = 45.3 bits (108), Expect = 7e-06
 Identities = 10/40 (25%), Positives = 26/40 (65%)

Query: 24  KKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNN 63
           K++   +  ++  K+  +KEK+ ++ ++KK K+++K+K  
Sbjct: 91  KRRIALRLRRERTKERAEKEKRTRKNREKKFKRRQKEKEK 130



 Score = 43.4 bits (103), Expect = 3e-05
 Identities = 13/41 (31%), Positives = 28/41 (68%)

Query: 21  KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
           KK +   + ++E+  ++ EK+K  +K  +KK K+++K+K+K
Sbjct: 90  KKRRIALRLRRERTKERAEKEKRTRKNREKKFKRRQKEKEK 130



 Score = 42.6 bits (101), Expect = 5e-05
 Identities = 17/56 (30%), Positives = 30/56 (53%)

Query: 5   TKHQIGTFTEPTPLDWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKK 60
            KH      E      KK K+K   KK + A +  +++ K++ EK+K+ +K ++KK
Sbjct: 65  KKHNAKVEKELLREKEKKKKRKRPGKKRRIALRLRRERTKERAEKEKRTRKNREKK 120



 Score = 40.7 bits (96), Expect = 3e-04
 Identities = 11/42 (26%), Positives = 27/42 (64%)

Query: 21  KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKN 62
              +K++KKK+++  KK+      +++  K++ +K+K+ +KN
Sbjct: 75  LLREKEKKKKRKRPGKKRRIALRLRRERTKERAEKEKRTRKN 116



 Score = 38.8 bits (91), Expect = 0.001
 Identities = 12/40 (30%), Positives = 27/40 (67%)

Query: 22  KAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
           K KKK++K+  KK +   + + ++ KE+ +K+K+ +K ++
Sbjct: 79  KEKKKKRKRPGKKRRIALRLRRERTKERAEKEKRTRKNRE 118



 Score = 38.8 bits (91), Expect = 0.001
 Identities = 14/48 (29%), Positives = 28/48 (58%)

Query: 14  EPTPLDWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
           E   L  K+ KKK K+  +K+      ++E+ K+  +K+K+ +K ++K
Sbjct: 72  EKELLREKEKKKKRKRPGKKRRIALRLRRERTKERAEKEKRTRKNREK 119



 Score = 37.3 bits (87), Expect = 0.005
 Identities = 11/45 (24%), Positives = 29/45 (64%)

Query: 21  KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNN 65
           +K   +EK+KK+K+ +  +K++   +  +++ K++ +K+K+   N
Sbjct: 72  EKELLREKEKKKKRKRPGKKRRIALRLRRERTKERAEKEKRTRKN 116


>gnl|CDD|219838 pfam08432, DUF1742, Fungal protein of unknown function (DUF1742).
           This is a family of fungal proteins of unknown function.
          Length = 182

 Score = 49.3 bits (118), Expect = 7e-07
 Identities = 19/42 (45%), Positives = 29/42 (69%)

Query: 21  KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKN 62
           K+ ++K+K K +KK  KK+K K+K KK+ KK  K +KK +K 
Sbjct: 76  KEYEEKQKWKWKKKKSKKKKDKDKDKKDDKKDDKSEKKDEKE 117



 Score = 48.5 bits (116), Expect = 1e-06
 Identities = 18/42 (42%), Positives = 28/42 (66%)

Query: 20  WKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
            +K K K+KK K+KK K K+KK +KK  + +KK +K+ + K 
Sbjct: 81  KQKWKWKKKKSKKKKDKDKDKKDDKKDDKSEKKDEKEAEDKL 122



 Score = 47.8 bits (114), Expect = 2e-06
 Identities = 18/47 (38%), Positives = 29/47 (61%)

Query: 20  WKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNN 66
           WK  KKK KKKK+K   KK+ KK+ K ++K +K+ + K +    + +
Sbjct: 84  WKWKKKKSKKKKDKDKDKKDDKKDDKSEKKDEKEAEDKLEDLTKSYS 130



 Score = 47.4 bits (113), Expect = 3e-06
 Identities = 19/40 (47%), Positives = 27/40 (67%)

Query: 21  KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKK 60
           +K K K KKKK KK K K+K K+  KK+ K +KK +K+ +
Sbjct: 80  EKQKWKWKKKKSKKKKDKDKDKKDDKKDDKSEKKDEKEAE 119



 Score = 46.6 bits (111), Expect = 7e-06
 Identities = 15/41 (36%), Positives = 24/41 (58%)

Query: 21  KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
           K   KK+K KK+K   K +K  +K  K +KK +K+ + K +
Sbjct: 83  KWKWKKKKSKKKKDKDKDKKDDKKDDKSEKKDEKEAEDKLE 123



 Score = 46.6 bits (111), Expect = 7e-06
 Identities = 17/42 (40%), Positives = 27/42 (64%)

Query: 21  KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKN 62
           K+  K +KKK +KK  K + KK+ KK +K +KK +K+ + K 
Sbjct: 81  KQKWKWKKKKSKKKKDKDKDKKDDKKDDKSEKKDEKEAEDKL 122



 Score = 45.5 bits (108), Expect = 2e-05
 Identities = 25/54 (46%), Positives = 33/54 (61%), Gaps = 3/54 (5%)

Query: 13  TEPTPLDWKK---AKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNN 63
            E T    KK   A++ EK KKE + K+K K K+KK K+KK K K KK  KK++
Sbjct: 55  AEYTEAKKKKKELAEEIEKVKKEYEEKQKWKWKKKKSKKKKDKDKDKKDDKKDD 108



 Score = 44.7 bits (106), Expect = 3e-05
 Identities = 18/41 (43%), Positives = 28/41 (68%), Gaps = 1/41 (2%)

Query: 23  AKKKEKKKKEKKAKKKEKKKEKKKKEKK-KKKKKKKKKKKN 62
            K+K K KK+K  KKK+K K+KK  +K  K +KK +K+ ++
Sbjct: 80  EKQKWKWKKKKSKKKKDKDKDKKDDKKDDKSEKKDEKEAED 120



 Score = 43.9 bits (104), Expect = 5e-05
 Identities = 20/48 (41%), Positives = 28/48 (58%), Gaps = 1/48 (2%)

Query: 17  PLDWKKAK-KKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNN 63
            L  +  K KKE ++K+K   KK+K K+KK K+K KK  KK  K +  
Sbjct: 66  ELAEEIEKVKKEYEEKQKWKWKKKKSKKKKDKDKDKKDDKKDDKSEKK 113



 Score = 43.5 bits (103), Expect = 8e-05
 Identities = 18/40 (45%), Positives = 26/40 (65%)

Query: 22  KAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
           K + +EK+K + K KK +KKK+K K +K  KK  K +KK 
Sbjct: 75  KKEYEEKQKWKWKKKKSKKKKDKDKDKKDDKKDDKSEKKD 114



 Score = 41.6 bits (98), Expect = 3e-04
 Identities = 15/42 (35%), Positives = 22/42 (52%)

Query: 20  WKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
            KK  K + KK +KK  K EKK EK+ ++K +   K   +  
Sbjct: 92  KKKKDKDKDKKDDKKDDKSEKKDEKEAEDKLEDLTKSYSETL 133



 Score = 41.2 bits (97), Expect = 4e-04
 Identities = 15/42 (35%), Positives = 23/42 (54%)

Query: 21  KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKN 62
           KK K K+K KK+ K   K +KK++K+ E K +   K   +  
Sbjct: 92  KKKKDKDKDKKDDKKDDKSEKKDEKEAEDKLEDLTKSYSETL 133



 Score = 39.7 bits (93), Expect = 0.001
 Identities = 21/49 (42%), Positives = 33/49 (67%)

Query: 16 TPLDWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNN 64
          TP+   +  + +KKKKE   + ++ KKE ++K+K K KKKK KKKK+ +
Sbjct: 50 TPIYDAEYTEAKKKKKELAEEIEKVKKEYEEKQKWKWKKKKSKKKKDKD 98



 Score = 39.7 bits (93), Expect = 0.001
 Identities = 17/38 (44%), Positives = 25/38 (65%)

Query: 30 KKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNN 67
          KK+KK   +E +K KK+ E+K+K K KKKK K   + +
Sbjct: 61 KKKKKELAEEIEKVKKEYEEKQKWKWKKKKSKKKKDKD 98



 Score = 36.6 bits (85), Expect = 0.013
 Identities = 12/40 (30%), Positives = 21/40 (52%)

Query: 21  KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKK 60
            K K K+  KK+ K++KK++K+ + K E   K   +    
Sbjct: 96  DKDKDKKDDKKDDKSEKKDEKEAEDKLEDLTKSYSETLST 135



 Score = 34.7 bits (80), Expect = 0.066
 Identities = 13/43 (30%), Positives = 21/43 (48%), Gaps = 1/43 (2%)

Query: 20  WKKAKKKEKKKKEKKAK-KKEKKKEKKKKEKKKKKKKKKKKKK 61
            KK K K+KK  +K  K +K+ +KE + K +   K   +    
Sbjct: 93  KKKDKDKDKKDDKKDDKSEKKDEKEAEDKLEDLTKSYSETLST 135



 Score = 32.7 bits (75), Expect = 0.28
 Identities = 12/40 (30%), Positives = 18/40 (45%)

Query: 21  KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKK 60
           KK  KK+ K ++K  K+ E K E   K   +      + K
Sbjct: 101 KKDDKKDDKSEKKDEKEAEDKLEDLTKSYSETLSTLSELK 140



 Score = 32.0 bits (73), Expect = 0.44
 Identities = 19/39 (48%), Positives = 24/39 (61%), Gaps = 7/39 (17%)

Query: 35 AKKKEK----KKEKKKKE---KKKKKKKKKKKKKNNNNN 66
          AKKK+K    + EK KKE   K+K K KKKK KK  + +
Sbjct: 60 AKKKKKELAEEIEKVKKEYEEKQKWKWKKKKSKKKKDKD 98



 Score = 32.0 bits (73), Expect = 0.45
 Identities = 9/40 (22%), Positives = 19/40 (47%)

Query: 19  DWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKK 58
           D  K KK +KK  + + K +++ ++K +   K   +    
Sbjct: 96  DKDKDKKDDKKDDKSEKKDEKEAEDKLEDLTKSYSETLST 135



 Score = 32.0 bits (73), Expect = 0.49
 Identities = 9/44 (20%), Positives = 20/44 (45%)

Query: 21  KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNN 64
           K  K ++K +KE + K ++  K   +      + K +K   + +
Sbjct: 106 KDDKSEKKDEKEAEDKLEDLTKSYSETLSTLSELKPRKYALHKD 149



 Score = 30.4 bits (69), Expect = 1.8
 Identities = 9/43 (20%), Positives = 21/43 (48%)

Query: 19  DWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
           D K  KK +K +K+ + + ++K ++  K   +      + K +
Sbjct: 100 DKKDDKKDDKSEKKDEKEAEDKLEDLTKSYSETLSTLSELKPR 142



 Score = 28.9 bits (65), Expect = 4.8
 Identities = 12/40 (30%), Positives = 18/40 (45%)

Query: 21  KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKK 60
            K   K +KK EK+A+ K +   K   E      + K +K
Sbjct: 104 DKKDDKSEKKDEKEAEDKLEDLTKSYSETLSTLSELKPRK 143



 Score = 28.5 bits (64), Expect = 6.6
 Identities = 14/58 (24%), Positives = 32/58 (55%)

Query: 32  EKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNNNNNNNNNNNNNEEK 89
           E    KK+KK+  ++ EK KK+ ++K+K K     +    + + +  ++  ++ +E+K
Sbjct: 56  EYTEAKKKKKELAEEIEKVKKEYEEKQKWKWKKKKSKKKKDKDKDKKDDKKDDKSEKK 113


>gnl|CDD|185429 PTZ00074, PTZ00074, 60S ribosomal protein L34; Provisional.
          Length = 135

 Score = 48.1 bits (115), Expect = 8e-07
 Identities = 20/38 (52%), Positives = 28/38 (73%)

Query: 21  KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKK 58
           +K  K+  K+K K+ K+K+KKK+KKKK+  KK  KKKK
Sbjct: 98  QKIVKQVLKEKAKQKKQKKKKKKKKKKKTSKKAAKKKK 135



 Score = 45.1 bits (107), Expect = 1e-05
 Identities = 21/40 (52%), Positives = 28/40 (70%), Gaps = 2/40 (5%)

Query: 25  KKEKK--KKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKN 62
            +E+K  K+  K K K+KK++KKKK+KKKKK  KK  KK 
Sbjct: 95  VEEQKIVKQVLKEKAKQKKQKKKKKKKKKKKTSKKAAKKK 134



 Score = 43.1 bits (102), Expect = 5e-05
 Identities = 19/40 (47%), Positives = 27/40 (67%), Gaps = 2/40 (5%)

Query: 26  KEKKK--KEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNN 63
            E++K  K+   +K ++KK+KKKK+KKKKKK  KK  K  
Sbjct: 95  VEEQKIVKQVLKEKAKQKKQKKKKKKKKKKKTSKKAAKKK 134



 Score = 33.9 bits (78), Expect = 0.063
 Identities = 18/35 (51%), Positives = 25/35 (71%), Gaps = 2/35 (5%)

Query: 30  KKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNN 64
            +E+K  K+  K+  K K+KK+KKKKKKKKKK  +
Sbjct: 95  VEEQKIVKQVLKE--KAKQKKQKKKKKKKKKKKTS 127


>gnl|CDD|240380 PTZ00362, PTZ00362, hypothetical protein; Provisional.
          Length = 479

 Score = 50.9 bits (122), Expect = 1e-06
 Identities = 21/64 (32%), Positives = 34/64 (53%)

Query: 25 KKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNNNNNNNNNNN 84
          KK+KKKK+K +K+     +  KK        ++KK+ N     NN   N + +++  N+N
Sbjct: 9  KKKKKKKKKASKESSNGSQNSKKTIFMYNLSRRKKRHNEEGTFNNYGTNESQSSSTQNSN 68

Query: 85 NNEE 88
           NEE
Sbjct: 69 KNEE 72



 Score = 48.2 bits (115), Expect = 7e-06
 Identities = 18/64 (28%), Positives = 30/64 (46%)

Query: 24 KKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNNNNNNNNNN 83
          KKK+KKKK+   +     +  KK        ++KK+       NN   N + +++  N+N
Sbjct: 9  KKKKKKKKKASKESSNGSQNSKKTIFMYNLSRRKKRHNEEGTFNNYGTNESQSSSTQNSN 68

Query: 84 NNNE 87
           N E
Sbjct: 69 KNEE 72



 Score = 46.7 bits (111), Expect = 2e-05
 Identities = 18/65 (27%), Positives = 33/65 (50%)

Query: 22 KAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNNNNNNNN 81
          K KKK+KKK  K++    +  +K        ++KK+  ++   NN   N + +++  N+N
Sbjct: 9  KKKKKKKKKASKESSNGSQNSKKTIFMYNLSRRKKRHNEEGTFNNYGTNESQSSSTQNSN 68

Query: 82 NNNNN 86
           N  N
Sbjct: 69 KNEEN 73



 Score = 46.7 bits (111), Expect = 2e-05
 Identities = 19/64 (29%), Positives = 31/64 (48%)

Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNNNNNNN 80
          KK KKK+K  KE     +  KK        ++KK+  ++   NN   N + +++  N+N 
Sbjct: 10 KKKKKKKKASKESSNGSQNSKKTIFMYNLSRRKKRHNEEGTFNNYGTNESQSSSTQNSNK 69

Query: 81 NNNN 84
          N  N
Sbjct: 70 NEEN 73


>gnl|CDD|150884 pfam10278, Med19, Mediator of RNA pol II transcription subunit 19. 
           Med19 represents a family of conserved proteins which
           are members of the multi-protein co-activator Mediator
           complex. Mediator is required for activation of RNA
           polymerase II transcription by DNA binding
           transactivators.
          Length = 178

 Score = 48.7 bits (116), Expect = 1e-06
 Identities = 23/46 (50%), Positives = 31/46 (67%), Gaps = 1/46 (2%)

Query: 11  TFTEPTPLDWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKK 56
              E TP D +  K  EKK K+KK  + +K+++KKKKEKKKKKK+ 
Sbjct: 125 PLPEETPSDSEGLKGHEKKHKKKK-HEDDKERKKKKKEKKKKKKRH 169



 Score = 41.7 bits (98), Expect = 3e-04
 Identities = 22/58 (37%), Positives = 28/58 (48%), Gaps = 12/58 (20%)

Query: 21  KKAKKKEKKKKEKKA------------KKKEKKKEKKKKEKKKKKKKKKKKKKNNNNN 66
           KK K K KK + +              K  EKK +KKK E  K++KKKKK+KK     
Sbjct: 111 KKHKHKHKKHRTQDPLPEETPSDSEGLKGHEKKHKKKKHEDDKERKKKKKEKKKKKKR 168


>gnl|CDD|237753 PRK14552, PRK14552, C/D box methylation guide ribonucleoprotein
           complex aNOP56 subunit; Provisional.
          Length = 414

 Score = 50.0 bits (120), Expect = 2e-06
 Identities = 22/36 (61%), Positives = 29/36 (80%)

Query: 26  KEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
           KEK  K  K K++EKK +K+KK+KK+KKK KK+KKK
Sbjct: 376 KEKYPKPPKKKREEKKPQKRKKKKKRKKKGKKRKKK 411



 Score = 48.4 bits (116), Expect = 6e-06
 Identities = 20/38 (52%), Positives = 29/38 (76%)

Query: 24  KKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
           +K  K  K+K+ +KK +K++KKKK KKK KK+KKK +K
Sbjct: 377 EKYPKPPKKKREEKKPQKRKKKKKRKKKGKKRKKKGRK 414



 Score = 48.0 bits (115), Expect = 7e-06
 Identities = 20/38 (52%), Positives = 29/38 (76%)

Query: 21  KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKK 58
           +K  K  KKK+E+K  +K KKK+K+KK+ KK+KKK +K
Sbjct: 377 EKYPKPPKKKREEKKPQKRKKKKKRKKKGKKRKKKGRK 414



 Score = 47.7 bits (114), Expect = 1e-05
 Identities = 18/39 (46%), Positives = 29/39 (74%)

Query: 21  KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKK 59
           K+   K  KKK ++ K +++KK+KK+K+K KK+KKK +K
Sbjct: 376 KEKYPKPPKKKREEKKPQKRKKKKKRKKKGKKRKKKGRK 414



 Score = 46.9 bits (112), Expect = 2e-05
 Identities = 20/43 (46%), Positives = 31/43 (72%)

Query: 21  KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNN 63
           ++  K+ ++ KEK  K  +KK+E+KK +K+KKKKK+KKK K  
Sbjct: 366 EELNKRIEEIKEKYPKPPKKKREEKKPQKRKKKKKRKKKGKKR 408



 Score = 46.1 bits (110), Expect = 3e-05
 Identities = 19/39 (48%), Positives = 30/39 (76%)

Query: 22  KAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKK 60
           K K  +  KK+++ KK +K+K+KKK++KK KK+KKK +K
Sbjct: 376 KEKYPKPPKKKREEKKPQKRKKKKKRKKKGKKRKKKGRK 414



 Score = 45.7 bits (109), Expect = 4e-05
 Identities = 19/38 (50%), Positives = 29/38 (76%)

Query: 24  KKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
           K+K  K  +KK ++K+ +K KKKK++KKK KK+KKK +
Sbjct: 376 KEKYPKPPKKKREEKKPQKRKKKKKRKKKGKKRKKKGR 413



 Score = 44.6 bits (106), Expect = 9e-05
 Identities = 18/35 (51%), Positives = 27/35 (77%)

Query: 28  KKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKN 62
           K+K  K  KKK ++K+ +K++KKKK+KKK KK+K 
Sbjct: 376 KEKYPKPPKKKREEKKPQKRKKKKKRKKKGKKRKK 410



 Score = 40.7 bits (96), Expect = 0.002
 Identities = 17/35 (48%), Positives = 24/35 (68%)

Query: 15  PTPLDWKKAKKKEKKKKEKKAKKKEKKKEKKKKEK 49
           P P   K+ +KK +K+K+KK +KK+ KK KKK  K
Sbjct: 380 PKPPKKKREEKKPQKRKKKKKRKKKGKKRKKKGRK 414



 Score = 40.0 bits (94), Expect = 0.003
 Identities = 18/37 (48%), Positives = 28/37 (75%)

Query: 25  KKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
           K+E  K+ ++ K+K  K  KKK+E+KK +K+KKKKK+
Sbjct: 365 KEELNKRIEEIKEKYPKPPKKKREEKKPQKRKKKKKR 401



 Score = 34.9 bits (81), Expect = 0.10
 Identities = 12/36 (33%), Positives = 25/36 (69%)

Query: 12  FTEPTPLDWKKAKKKEKKKKEKKAKKKEKKKEKKKK 47
           + +P     ++ K +++KKK+K+ KK +K+K+K +K
Sbjct: 379 YPKPPKKKREEKKPQKRKKKKKRKKKGKKRKKKGRK 414



 Score = 30.3 bits (69), Expect = 3.0
 Identities = 12/35 (34%), Positives = 24/35 (68%)

Query: 27  EKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
           ++ K+E   + +E K++  K  KKK+++KK +K+K
Sbjct: 362 DELKEELNKRIEEIKEKYPKPPKKKREEKKPQKRK 396


>gnl|CDD|235250 PRK04195, PRK04195, replication factor C large subunit;
           Provisional.
          Length = 482

 Score = 49.9 bits (120), Expect = 2e-06
 Identities = 19/41 (46%), Positives = 31/41 (75%)

Query: 21  KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
           KKA KK KK  EK  KK+E++K++KKK+    KKK++++++
Sbjct: 406 KKATKKIKKIVEKAEKKREEEKKEKKKKAFAGKKKEEEEEE 446



 Score = 47.6 bits (114), Expect = 1e-05
 Identities = 18/49 (36%), Positives = 35/49 (71%)

Query: 13  TEPTPLDWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
            + T    K  +K EKK++E+K +KK+K    KKKE++++++K+KK+++
Sbjct: 406 KKATKKIKKIVEKAEKKREEEKKEKKKKAFAGKKKEEEEEEEKEKKEEE 454



 Score = 46.8 bits (112), Expect = 2e-05
 Identities = 19/57 (33%), Positives = 36/57 (63%)

Query: 5   TKHQIGTFTEPTPLDWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
           T+ +I   T       K  K  EK +K+++ +KKEKKK+    +KK+++++++K+KK
Sbjct: 395 TEEEIEFLTGSKKATKKIKKIVEKAEKKREEEKKEKKKKAFAGKKKEEEEEEEKEKK 451



 Score = 46.5 bits (111), Expect = 2e-05
 Identities = 17/42 (40%), Positives = 34/42 (80%)

Query: 21  KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKN 62
           K  KK+E++KKEKK K    KK+++++E++K+KK+++K+++ 
Sbjct: 418 KAEKKREEEKKEKKKKAFAGKKKEEEEEEEKEKKEEEKEEEE 459



 Score = 45.7 bits (109), Expect = 4e-05
 Identities = 15/41 (36%), Positives = 35/41 (85%)

Query: 21  KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
           +KA+KK +++K++K KK    K+K+++E+++K+KK+++K++
Sbjct: 417 EKAEKKREEEKKEKKKKAFAGKKKEEEEEEEKEKKEEEKEE 457



 Score = 45.7 bits (109), Expect = 5e-05
 Identities = 16/43 (37%), Positives = 32/43 (74%)

Query: 21  KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNN 63
              K ++K+++EKK KKK+    KKK+E+++++K+KK+++K  
Sbjct: 415 IVEKAEKKREEEKKEKKKKAFAGKKKEEEEEEEKEKKEEEKEE 457



 Score = 44.1 bits (105), Expect = 1e-04
 Identities = 16/39 (41%), Positives = 27/39 (69%)

Query: 24  KKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKN 62
           KK  KK K+   K ++K++E+KK++KKK    KKK+++ 
Sbjct: 406 KKATKKIKKIVEKAEKKREEEKKEKKKKAFAGKKKEEEE 444



 Score = 44.1 bits (105), Expect = 2e-04
 Identities = 15/48 (31%), Positives = 36/48 (75%)

Query: 14  EPTPLDWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
           E    + KK     KKK+E++ ++KEKK+E+K++E+++ +++K+++++
Sbjct: 424 EEEKKEKKKKAFAGKKKEEEEEEEKEKKEEEKEEEEEEAEEEKEEEEE 471



 Score = 42.6 bits (101), Expect = 5e-04
 Identities = 16/42 (38%), Positives = 37/42 (88%)

Query: 21  KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKN 62
           +K +++EKK+K+KKA   +KK+E++++EK+KK+++K+++++ 
Sbjct: 420 EKKREEEKKEKKKKAFAGKKKEEEEEEEKEKKEEEKEEEEEE 461



 Score = 41.4 bits (98), Expect = 0.001
 Identities = 15/41 (36%), Positives = 35/41 (85%)

Query: 21  KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
           ++ +KKEKKKK    KKKE+++E++K++K+++K++++++ +
Sbjct: 423 REEEKKEKKKKAFAGKKKEEEEEEEKEKKEEEKEEEEEEAE 463



 Score = 40.7 bits (96), Expect = 0.002
 Identities = 15/42 (35%), Positives = 34/42 (80%)

Query: 21  KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKN 62
           KK KKK+    +KK +++E++KEKK++EK++++++ +++K+ 
Sbjct: 427 KKEKKKKAFAGKKKEEEEEEEKEKKEEEKEEEEEEAEEEKEE 468



 Score = 39.9 bits (94), Expect = 0.003
 Identities = 14/41 (34%), Positives = 36/41 (87%)

Query: 21  KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
           K+ ++K++KKK+  A KK++++E+++KEKK+++K++++++ 
Sbjct: 422 KREEEKKEKKKKAFAGKKKEEEEEEEKEKKEEEKEEEEEEA 462



 Score = 39.9 bits (94), Expect = 0.003
 Identities = 13/42 (30%), Positives = 34/42 (80%)

Query: 21  KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKN 62
           KK K    KKKE++ +++++KKE++K+E++++ +++K++++ 
Sbjct: 430 KKKKAFAGKKKEEEEEEEKEKKEEEKEEEEEEAEEEKEEEEE 471



 Score = 39.1 bits (92), Expect = 0.005
 Identities = 11/43 (25%), Positives = 34/43 (79%)

Query: 21  KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNN 63
           +K KK    KK+++ +++EK+K++++KE+++++ +++K+++  
Sbjct: 429 EKKKKAFAGKKKEEEEEEEKEKKEEEKEEEEEEAEEEKEEEEE 471



 Score = 38.0 bits (89), Expect = 0.011
 Identities = 12/37 (32%), Positives = 34/37 (91%)

Query: 21  KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKK 57
           KK +++E++K++K+ +K+E+++E +++++++++KKKK
Sbjct: 439 KKEEEEEEEKEKKEEEKEEEEEEAEEEKEEEEEKKKK 475


>gnl|CDD|219868 pfam08496, Peptidase_S49_N, Peptidase family S49 N-terminal.
          This domain is found to the N-terminus of bacterial
          signal peptidases of the S49 family (pfam01343).
          Length = 154

 Score = 46.8 bits (112), Expect = 4e-06
 Identities = 17/34 (50%), Positives = 22/34 (64%)

Query: 25 KKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKK 58
          KKE K  EK  KK EK K K +K+K KK++ K +
Sbjct: 65 KKELKAWEKAEKKAEKAKAKAEKKKAKKEEPKPR 98



 Score = 44.8 bits (107), Expect = 2e-05
 Identities = 17/32 (53%), Positives = 23/32 (71%)

Query: 30 KKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
          KKE KA +K +KK +K K K +KKK KK++ K
Sbjct: 65 KKELKAWEKAEKKAEKAKAKAEKKKAKKEEPK 96



 Score = 41.7 bits (99), Expect = 2e-04
 Identities = 16/47 (34%), Positives = 22/47 (46%), Gaps = 2/47 (4%)

Query: 5  TKHQIGTFTEPTPLDWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKK 51
           K  +            KA +K +KK EK   K EKKK KK++ K +
Sbjct: 54 LKESLEAALLDK--KELKAWEKAEKKAEKAKAKAEKKKAKKEEPKPR 98



 Score = 40.6 bits (96), Expect = 4e-04
 Identities = 15/34 (44%), Positives = 23/34 (67%)

Query: 22 KAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKK 55
          K + K  +K EKKA+K + K EKKK +K++ K +
Sbjct: 65 KKELKAWEKAEKKAEKAKAKAEKKKAKKEEPKPR 98



 Score = 39.4 bits (93), Expect = 0.001
 Identities = 12/34 (35%), Positives = 21/34 (61%)

Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKK 54
          KK  K  +K ++K  K K K ++KK K+++ K +
Sbjct: 65 KKELKAWEKAEKKAEKAKAKAEKKKAKKEEPKPR 98



 Score = 38.3 bits (90), Expect = 0.003
 Identities = 18/36 (50%), Positives = 24/36 (66%), Gaps = 1/36 (2%)

Query: 18 LDWKKAKKKEKK-KKEKKAKKKEKKKEKKKKEKKKK 52
          LD K+ K  EK  KK +KAK K +KK+ KK+E K +
Sbjct: 63 LDKKELKAWEKAEKKAEKAKAKAEKKKAKKEEPKPR 98



 Score = 36.4 bits (85), Expect = 0.011
 Identities = 19/53 (35%), Positives = 21/53 (39%), Gaps = 10/53 (18%)

Query: 20 WKKAKKKE----------KKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKN 62
           KK KK E          K  KE        KKE K  EK +KK +K K K  
Sbjct: 34 KKKGKKGELEITDLNEEYKDLKESLEAALLDKKELKAWEKAEKKAEKAKAKAE 86



 Score = 35.2 bits (82), Expect = 0.032
 Identities = 11/25 (44%), Positives = 16/25 (64%)

Query: 38 KEKKKEKKKKEKKKKKKKKKKKKKN 62
          K++ K  +K EKK +K K K +KK 
Sbjct: 65 KKELKAWEKAEKKAEKAKAKAEKKK 89



 Score = 27.9 bits (63), Expect = 10.0
 Identities = 16/55 (29%), Positives = 25/55 (45%), Gaps = 10/55 (18%)

Query: 23 AKKKEKKKKEKKAKK-----KEKKKEKK-----KKEKKKKKKKKKKKKKNNNNNN 67
          A+KK+ KK E +        K+ K+  +     KKE K  +K +KK +K      
Sbjct: 32 AQKKKGKKGELEITDLNEEYKDLKESLEAALLDKKELKAWEKAEKKAEKAKAKAE 86


>gnl|CDD|221641 pfam12569, NARP1, NMDA receptor-regulated protein 1.  This domain
           family is found in eukaryotes, and is approximately 40
           amino acids in length. The family is found in
           association with pfam07719, pfam00515. There is a single
           completely conserved residue L that may be functionally
           important. NARP1 is the mammalian homologue of a yeast
           N-terminal acetyltransferase that regulates entry into
           the G(0) phase of the cell cycle.
          Length = 516

 Score = 49.2 bits (118), Expect = 4e-06
 Identities = 17/42 (40%), Positives = 25/42 (59%), Gaps = 1/42 (2%)

Query: 21  KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKN 62
           KK +KK ++K EKKA+K+E +K   KK+ +   KK K     
Sbjct: 412 KKLRKK-QRKAEKKAEKEEAEKAAAKKKAEAAAKKAKGPDGE 452



 Score = 48.0 bits (115), Expect = 1e-05
 Identities = 17/43 (39%), Positives = 29/43 (67%)

Query: 22  KAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNN 64
            A++K+ +KK++KA+KK +K+E +K   KKK +   KK K  +
Sbjct: 408 PAERKKLRKKQRKAEKKAEKEEAEKAAAKKKAEAAAKKAKGPD 450



 Score = 47.2 bits (113), Expect = 2e-05
 Identities = 15/42 (35%), Positives = 25/42 (59%)

Query: 21  KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKN 62
           KK +K EKK ++++A+K   KK+ +   KK K    + KK +
Sbjct: 416 KKQRKAEKKAEKEEAEKAAAKKKAEAAAKKAKGPDGETKKVD 457



 Score = 45.3 bits (108), Expect = 7e-05
 Identities = 16/43 (37%), Positives = 24/43 (55%)

Query: 21  KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNN 63
              +KK +KK+ K  KK EK++ +K   KKK +   KK K  +
Sbjct: 408 PAERKKLRKKQRKAEKKAEKEEAEKAAAKKKAEAAAKKAKGPD 450



 Score = 44.5 bits (106), Expect = 1e-04
 Identities = 14/41 (34%), Positives = 23/41 (56%)

Query: 21  KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
           +K ++K +KK EK+  +K   K+K +   KK K    + KK
Sbjct: 415 RKKQRKAEKKAEKEEAEKAAAKKKAEAAAKKAKGPDGETKK 455



 Score = 42.2 bits (100), Expect = 6e-04
 Identities = 12/39 (30%), Positives = 22/39 (56%)

Query: 29  KKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNN 67
             + KK +KK++K EKK ++++ +K   KKK +      
Sbjct: 408 PAERKKLRKKQRKAEKKAEKEEAEKAAAKKKAEAAAKKA 446



 Score = 41.8 bits (99), Expect = 7e-04
 Identities = 16/42 (38%), Positives = 21/42 (50%), Gaps = 3/42 (7%)

Query: 25  KKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNN 66
             E+KK  KK +K EKK EK   E+ +K   KKK +      
Sbjct: 408 PAERKKLRKKQRKAEKKAEK---EEAEKAAAKKKAEAAAKKA 446



 Score = 33.8 bits (78), Expect = 0.26
 Identities = 12/54 (22%), Positives = 25/54 (46%)

Query: 22  KAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNN 75
            A+ +E++ +       E+KK +KK+ K +KK +K++ +K              
Sbjct: 393 LAEGEEEEGENGNLSPAERKKLRKKQRKAEKKAEKEEAEKAAAKKKAEAAAKKA 446



 Score = 33.0 bits (76), Expect = 0.40
 Identities = 11/56 (19%), Positives = 27/56 (48%)

Query: 17  PLDWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNN 72
           PL  +  +++ +      A++K+ +K+++K EKK +K++ +K              
Sbjct: 391 PLLAEGEEEEGENGNLSPAERKKLRKKQRKAEKKAEKEEAEKAAAKKKAEAAAKKA 446



 Score = 30.7 bits (70), Expect = 2.6
 Identities = 9/54 (16%), Positives = 23/54 (42%)

Query: 21  KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNN 74
               ++E+ +    +  + KK  KK+++ +KK +K++ +K              
Sbjct: 393 LAEGEEEEGENGNLSPAERKKLRKKQRKAEKKAEKEEAEKAAAKKKAEAAAKKA 446



 Score = 28.7 bits (65), Expect = 8.2
 Identities = 10/38 (26%), Positives = 22/38 (57%)

Query: 24  KKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
           K    + +E++ +       ++KK +KK++K +KK +K
Sbjct: 390 KPLLAEGEEEEGENGNLSPAERKKLRKKQRKAEKKAEK 427


>gnl|CDD|235322 PRK04950, PRK04950, ProP expression regulator; Provisional.
          Length = 213

 Score = 47.6 bits (114), Expect = 4e-06
 Identities = 10/46 (21%), Positives = 25/46 (54%)

Query: 22  KAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNN 67
           KAK + ++ +++  K++   +++K   +++K K K  +KK      
Sbjct: 107 KAKVQAQRAEQQAKKREAAGEKEKAPRRERKPKPKAPRKKRKPRAQ 152



 Score = 43.8 bits (104), Expect = 7e-05
 Identities = 13/41 (31%), Positives = 28/41 (68%)

Query: 21  KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
           ++A+++ KK++    K+K  ++E+K K K  +KK+K + +K
Sbjct: 113 QRAEQQAKKREAAGEKEKAPRRERKPKPKAPRKKRKPRAQK 153



 Score = 41.1 bits (97), Expect = 6e-04
 Identities = 10/49 (20%), Positives = 25/49 (51%)

Query: 21  KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNN 69
           ++AK K + ++ ++  KK +   +K+K  ++++K K K  +        
Sbjct: 104 EEAKAKVQAQRAEQQAKKREAAGEKEKAPRRERKPKPKAPRKKRKPRAQ 152



 Score = 41.1 bits (97), Expect = 6e-04
 Identities = 13/47 (27%), Positives = 31/47 (65%)

Query: 21  KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNN 67
           ++AKK+E   +++KA ++E+K + K   KK+K + +K + ++   ++
Sbjct: 117 QQAKKREAAGEKEKAPRRERKPKPKAPRKKRKPRAQKPEPQHTPVSD 163



 Score = 40.3 bits (95), Expect = 0.001
 Identities = 10/54 (18%), Positives = 28/54 (51%), Gaps = 1/54 (1%)

Query: 23  AKKKEKKKKEK-KAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNN 75
           A+K+ ++ K K +A++ E++ +K++   +K+K  ++++K               
Sbjct: 99  ARKQLEEAKAKVQAQRAEQQAKKREAAGEKEKAPRRERKPKPKAPRKKRKPRAQ 152



 Score = 38.4 bits (90), Expect = 0.005
 Identities = 8/41 (19%), Positives = 24/41 (58%)

Query: 21  KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
           +  ++ +K++   + +K  +++ K K +  +KK+K + +K 
Sbjct: 114 RAEQQAKKREAAGEKEKAPRRERKPKPKAPRKKRKPRAQKP 154



 Score = 37.6 bits (88), Expect = 0.008
 Identities = 10/44 (22%), Positives = 20/44 (45%)

Query: 21  KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNN 64
           +KA ++E+K K K  +KK K + +K + +        +      
Sbjct: 129 EKAPRRERKPKPKAPRKKRKPRAQKPEPQHTPVSDISELTVGQA 172



 Score = 36.4 bits (85), Expect = 0.018
 Identities = 10/49 (20%), Positives = 22/49 (44%), Gaps = 3/49 (6%)

Query: 31  KEKKAKKKEKKKE---KKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNNN 76
           +E KAK + ++ E   KK++   +K+K  ++++K               
Sbjct: 104 EEAKAKVQAQRAEQQAKKREAAGEKEKAPRRERKPKPKAPRKKRKPRAQ 152



 Score = 33.4 bits (77), Expect = 0.22
 Identities = 8/41 (19%), Positives = 22/41 (53%)

Query: 23  AKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNN 63
            +K++  ++E+K K K  +K++K + +K + +       + 
Sbjct: 126 GEKEKAPRRERKPKPKAPRKKRKPRAQKPEPQHTPVSDISE 166



 Score = 29.9 bits (68), Expect = 2.7
 Identities = 9/46 (19%), Positives = 18/46 (39%)

Query: 21  KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNN 66
           +K K K  +KK K   +K + +     +  +    +  K K   + 
Sbjct: 136 RKPKPKAPRKKRKPRAQKPEPQHTPVSDISELTVGQAVKVKAGKSA 181


>gnl|CDD|152107 pfam11671, Apis_Csd, Complementary sex determiner protein.  This
          family of proteins represents the complementary sex
          determiner in the honeybee. In the honeybee, the
          mechanism of sex determination depends on the csd gene
          which produces an SR-type protein. Males are homozygous
          while females are homozygous for the csd gene.
          Heterozygosity generates an active protein which
          initiates female development.
          Length = 146

 Score = 46.3 bits (109), Expect = 5e-06
 Identities = 16/68 (23%), Positives = 44/68 (64%)

Query: 20 WKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNNNNNN 79
          + +++++E+K  + +   +E ++  +++ + + ++++ ++ K  ++ +N +NN N NNN 
Sbjct: 6  YSRSREREQKSYKNENSYREYRETSRERSRDRTERERSREHKIISSLSNLSNNYNYNNNY 65

Query: 80 NNNNNNNE 87
          NN NN N+
Sbjct: 66 NNYNNYNK 73



 Score = 45.9 bits (108), Expect = 7e-06
 Identities = 16/67 (23%), Positives = 41/67 (61%)

Query: 23 AKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNNNNNNNNN 82
          ++ +E+++K  K +   ++  +  +E+ + + ++++ +++   ++ +N +NN N NNN N
Sbjct: 7  SRSREREQKSYKNENSYREYRETSRERSRDRTERERSREHKIISSLSNLSNNYNYNNNYN 66

Query: 83 NNNNEEK 89
          N NN  K
Sbjct: 67 NYNNYNK 73



 Score = 44.7 bits (105), Expect = 2e-05
 Identities = 20/66 (30%), Positives = 36/66 (54%)

Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNNNNNNN 80
          +K+ K E   +E +   +E+ +++ ++E+ ++ K        +NN N NNN NN NN N 
Sbjct: 14 QKSYKNENSYREYRETSRERSRDRTERERSREHKIISSLSNLSNNYNYNNNYNNYNNYNK 73

Query: 81 NNNNNN 86
          +  N N
Sbjct: 74 HYYNIN 79



 Score = 38.2 bits (88), Expect = 0.003
 Identities = 14/76 (18%), Positives = 40/76 (52%)

Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNNNNNNN 80
          +K   + +++++K  K +   +E ++  +++ + + ++++   +   ++ +N +NN N N
Sbjct: 3  RKRYSRSREREQKSYKNENSYREYRETSRERSRDRTERERSREHKIISSLSNLSNNYNYN 62

Query: 81 NNNNNNEEKGTFLYKM 96
          NN NN        Y +
Sbjct: 63 NNYNNYNNYNKHYYNI 78



 Score = 32.4 bits (73), Expect = 0.26
 Identities = 12/85 (14%), Positives = 39/85 (45%)

Query: 29  KKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNNNNNNNNNNNNNEE 88
           +K+  +++++E+K  K +   ++ ++  +++ ++      +  +   ++ +N +NN N  
Sbjct: 3   RKRYSRSREREQKSYKNENSYREYRETSRERSRDRTERERSREHKIISSLSNLSNNYNYN 62

Query: 89  KGTFLYKMAAPLVLVLTYVAEITQP 113
                Y         + Y+ +I  P
Sbjct: 63  NNYNNYNNYNKHYYNINYIEQIPVP 87



 Score = 32.0 bits (72), Expect = 0.38
 Identities = 10/63 (15%), Positives = 35/63 (55%)

Query: 23 AKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNNNNNNNNN 82
          ++K+  + +E++ K  + +   ++  +  +++ + + ++  +  +   ++ +N +NN N 
Sbjct: 2  SRKRYSRSREREQKSYKNENSYREYRETSRERSRDRTERERSREHKIISSLSNLSNNYNY 61

Query: 83 NNN 85
          NNN
Sbjct: 62 NNN 64


>gnl|CDD|221028 pfam11208, DUF2992, Protein of unknown function (DUF2992).  This
           bacterial family of proteins has no known function.
           However, the cis-regulatory yjdF motif, just upstream
           from the gene encoding the proteins for this family, is
           a small non-coding RNA, Rfam:RF01764. The yjdF motif is
           found in many Firmicutes, including Bacillus subtilis.
           In most cases, it resides in potential 5' UTRs of
           homologues of the yjdF gene whose function is unknown.
           However, in Streptococcus thermophilus, a yjdF RNA motif
           is associated with an operon whose protein products
           synthesise nicotinamide adenine dinucleotide (NAD+).
           Also, the S. thermophilus yjdF RNA lacks typical yjdF
           motif consensus features downstream of and including the
           P4 stem. Thus, if yjdF RNAs are riboswitch aptamers, the
           S. thermophilus RNAs might sense a distinct compound
           that structurally resembles the ligand bound by other
           yjdF RNAs. On the ohter hand, perhaps these RNAs have an
           alternative solution forming a similar binding site, as
           is observed with some SAM riboswitches.
          Length = 132

 Score = 45.7 bits (109), Expect = 6e-06
 Identities = 16/35 (45%), Positives = 29/35 (82%)

Query: 27  EKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
           E+ K+EKK + KEKK+E+K+++++ K++KKK K +
Sbjct: 96  ERNKQEKKKRSKEKKEEEKERKRQLKQQKKKAKHR 130



 Score = 41.5 bits (98), Expect = 2e-04
 Identities = 14/35 (40%), Positives = 28/35 (80%)

Query: 22  KAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKK 56
           +  K+EKKK+ K+ K++EK+++++ K++KKK K +
Sbjct: 96  ERNKQEKKKRSKEKKEEEKERKRQLKQQKKKAKHR 130



 Score = 38.8 bits (91), Expect = 0.001
 Identities = 14/34 (41%), Positives = 26/34 (76%)

Query: 24  KKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKK 57
           + K++KKK  K KK+E+K+ K++ +++KKK K +
Sbjct: 97  RNKQEKKKRSKEKKEEEKERKRQLKQQKKKAKHR 130



 Score = 38.4 bits (90), Expect = 0.002
 Identities = 17/35 (48%), Positives = 27/35 (77%), Gaps = 3/35 (8%)

Query: 30  KKEKKAKKKEKKK---EKKKKEKKKKKKKKKKKKK 61
           K E +  K+EKKK   EKK++EK++K++ K++KKK
Sbjct: 92  KLEHERNKQEKKKRSKEKKEEEKERKRQLKQQKKK 126



 Score = 35.3 bits (82), Expect = 0.018
 Identities = 14/32 (43%), Positives = 24/32 (75%), Gaps = 1/32 (3%)

Query: 21  KKAKKK-EKKKKEKKAKKKEKKKEKKKKEKKK 51
           K+ KKK  K+KKE++ ++K + K++KKK K +
Sbjct: 99  KQEKKKRSKEKKEEEKERKRQLKQQKKKAKHR 130



 Score = 29.9 bits (68), Expect = 1.3
 Identities = 19/65 (29%), Positives = 30/65 (46%), Gaps = 18/65 (27%)

Query: 12  FTEPTPLDWKKAKKKEKKKKEKKAKKKEKKKEKKKK--------------EKKKKKKKKK 57
           FTE      K + KK+KK   K+ +++  K+ KK                  K++KKK+ 
Sbjct: 51  FTEAV----KVSIKKQKKINPKRLQRQAAKEVKKPGISTKAQQALKLEHERNKQEKKKRS 106

Query: 58  KKKKN 62
           K+KK 
Sbjct: 107 KEKKE 111


>gnl|CDD|223880 COG0810, TonB, Periplasmic protein TonB, links inner and outer
           membranes [Cell envelope biogenesis, outer membrane].
          Length = 244

 Score = 47.5 bits (113), Expect = 6e-06
 Identities = 22/57 (38%), Positives = 26/57 (45%)

Query: 13  TEPTPLDWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNN 69
            EPTP   K+  K EKK K+ K K K K K K K + + K KK   K         N
Sbjct: 77  PEPTPPKPKEKPKPEKKPKKPKPKPKPKPKPKPKVKPQPKPKKPPSKTAAKAPAAPN 133



 Score = 39.8 bits (93), Expect = 0.002
 Identities = 16/67 (23%), Positives = 24/67 (35%), Gaps = 2/67 (2%)

Query: 13  TEPTPLDWKKAKK--KEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNN 70
            +P        +    + K+K K  KK +K K K K + K K K K + K     +    
Sbjct: 67  PKPPTEPETPPEPTPPKPKEKPKPEKKPKKPKPKPKPKPKPKPKVKPQPKPKKPPSKTAA 126

Query: 71  NNNNNNN 77
                 N
Sbjct: 127 KAPAAPN 133



 Score = 38.6 bits (90), Expect = 0.004
 Identities = 16/49 (32%), Positives = 22/49 (44%)

Query: 13  TEPTPLDWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
            +P P    +   +    K K+  K EKK +K K + K K K K K K 
Sbjct: 65  EQPKPPTEPETPPEPTPPKPKEKPKPEKKPKKPKPKPKPKPKPKPKVKP 113



 Score = 38.2 bits (89), Expect = 0.005
 Identities = 13/48 (27%), Positives = 22/48 (45%)

Query: 14  EPTPLDWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
           EP P      ++ +   + +   +    K K+K + +KK KK K K K
Sbjct: 55  EPQPEPEPPEEQPKPPTEPETPPEPTPPKPKEKPKPEKKPKKPKPKPK 102



 Score = 35.9 bits (83), Expect = 0.032
 Identities = 14/48 (29%), Positives = 21/48 (43%)

Query: 14  EPTPLDWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
              P +  K   + +   E    K ++K + +KK KK K K K K K 
Sbjct: 60  PEPPEEQPKPPTEPETPPEPTPPKPKEKPKPEKKPKKPKPKPKPKPKP 107



 Score = 32.8 bits (75), Expect = 0.32
 Identities = 16/61 (26%), Positives = 20/61 (32%), Gaps = 4/61 (6%)

Query: 5   TKHQIGTFTEPTPLDWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNN 64
            +       E    + K  K K K     K K K K K K + + KK   K   K     
Sbjct: 77  PEPTPPKPKEKPKPEKKPKKPKPKP----KPKPKPKPKVKPQPKPKKPPSKTAAKAPAAP 132

Query: 65  N 65
           N
Sbjct: 133 N 133



 Score = 32.8 bits (75), Expect = 0.38
 Identities = 13/55 (23%), Positives = 21/55 (38%), Gaps = 2/55 (3%)

Query: 7  HQIGTFTEPTPLDWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
                  PT     + +  E++ K     + E   E    + K+K K +KK KK
Sbjct: 44 LAAKVLEAPTEEPQPEPEPPEEQPKPPT--EPETPPEPTPPKPKEKPKPEKKPKK 96


>gnl|CDD|219924 pfam08597, eIF3_subunit, Translation initiation factor eIF3
          subunit.  This is a family of proteins which are
          subunits of the eukaryotic translation initiation
          factor 3 (eIF3). In yeast it is called Hcr1. The
          Saccharomyces cerevisiae protein eIF3j (HCR1) has been
          shown to be required for processing of 20S pre-rRNA and
          binds to 18S rRNA and eIF3 subunits Rpg1p and Prt1p.
          Length = 242

 Score = 46.6 bits (111), Expect = 1e-05
 Identities = 16/44 (36%), Positives = 29/44 (65%), Gaps = 1/44 (2%)

Query: 19 DWKKAKKKEKKK-KEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
           W + + +EK++ K K A K + KK  K K ++K+K K++K++K
Sbjct: 36 SWDEEEDEEKEEEKAKVAAKAKAKKALKAKIEEKEKAKREKEEK 79



 Score = 45.8 bits (109), Expect = 2e-05
 Identities = 14/45 (31%), Positives = 27/45 (60%)

Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNN 65
           + K++EK K   KAK K+  K K ++++K K++K++K  +    
Sbjct: 42 DEEKEEEKAKVAAKAKAKKALKAKIEEKEKAKREKEEKGLRELEE 86



 Score = 38.1 bits (89), Expect = 0.007
 Identities = 14/37 (37%), Positives = 25/37 (67%), Gaps = 1/37 (2%)

Query: 26 KEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKN 62
          ++++K+E+KAK   K K KK   K K ++K+K K++ 
Sbjct: 41 EDEEKEEEKAKVAAKAKAKKAL-KAKIEEKEKAKREK 76



 Score = 35.8 bits (83), Expect = 0.037
 Identities = 13/32 (40%), Positives = 17/32 (53%)

Query: 38  KEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNN 69
            EK K +K  +  KKKK K K K N    N++
Sbjct: 192 NEKLKAEKAAKGGKKKKGKAKAKLNVGGANDD 223



 Score = 35.4 bits (82), Expect = 0.044
 Identities = 12/32 (37%), Positives = 16/32 (50%)

Query: 40  KKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNN 71
            +K K +K  K  KKKK K K   N    N++
Sbjct: 192 NEKLKAEKAAKGGKKKKGKAKAKLNVGGANDD 223



 Score = 35.0 bits (81), Expect = 0.060
 Identities = 11/28 (39%), Positives = 15/28 (53%)

Query: 35 AKKKEKKKEKKKKEKKKKKKKKKKKKKN 62
           + +EK++EK K   K K KK  K K  
Sbjct: 40 EEDEEKEEEKAKVAAKAKAKKALKAKIE 67



 Score = 34.6 bits (80), Expect = 0.084
 Identities = 12/33 (36%), Positives = 14/33 (42%)

Query: 26  KEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKK 58
            EK K EK AK  +KKK K K +          
Sbjct: 192 NEKLKAEKAAKGGKKKKGKAKAKLNVGGANDDD 224



 Score = 34.6 bits (80), Expect = 0.098
 Identities = 13/47 (27%), Positives = 29/47 (61%)

Query: 21  KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNN 67
            K ++KEK K+EK+ K   + +E   +++  +K + +K ++ ++ NN
Sbjct: 64  AKIEEKEKAKREKEEKGLRELEEDTPEDELAEKLRLRKLQEESDLNN 110



 Score = 34.2 bits (79), Expect = 0.13
 Identities = 11/31 (35%), Positives = 18/31 (58%)

Query: 32 EKKAKKKEKKKEKKKKEKKKKKKKKKKKKKN 62
          E+  +K+E+K +   K K KK  K K ++K 
Sbjct: 40 EEDEEKEEEKAKVAAKAKAKKALKAKIEEKE 70



 Score = 33.9 bits (78), Expect = 0.16
 Identities = 10/44 (22%), Positives = 16/44 (36%)

Query: 23  AKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNN 66
             +K K +K  K  KK+K K K K                +++ 
Sbjct: 191 INEKLKAEKAAKGGKKKKGKAKAKLNVGGANDDDDDYDGYDDDF 234



 Score = 33.5 bits (77), Expect = 0.21
 Identities = 13/32 (40%), Positives = 15/32 (46%)

Query: 42  KEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNN 73
            EK K EK  K  KKKK K     N    N++
Sbjct: 192 NEKLKAEKAAKGGKKKKGKAKAKLNVGGANDD 223



 Score = 30.0 bits (68), Expect = 3.1
 Identities = 11/50 (22%), Positives = 22/50 (44%), Gaps = 3/50 (6%)

Query: 13  TEPTPLDWKKAKKKEKKKKEKKAKKKEKKKEKK---KKEKKKKKKKKKKK 59
                   K  K K ++K++ K +K+EK   +      E +  +K + +K
Sbjct: 52  VAAKAKAKKALKAKIEEKEKAKREKEEKGLRELEEDTPEDELAEKLRLRK 101



 Score = 29.6 bits (67), Expect = 3.6
 Identities = 9/40 (22%), Positives = 25/40 (62%)

Query: 21  KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKK 60
            KAK +EK+K +++ ++K  ++ ++   + +  +K + +K
Sbjct: 62  LKAKIEEKEKAKREKEEKGLRELEEDTPEDELAEKLRLRK 101


>gnl|CDD|237035 PRK12280, rplW, 50S ribosomal protein L23; Reviewed.
          Length = 158

 Score = 45.1 bits (107), Expect = 1e-05
 Identities = 20/53 (37%), Positives = 30/53 (56%)

Query: 16  TPLDWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNN 68
            P + +K +K+  K+ E+K   K KK++K+KKEKK  +K  KKK      N  
Sbjct: 91  FPEESEKEQKEVSKETEEKEAIKAKKEKKEKKEKKVAEKLAKKKSTKTTKNTT 143



 Score = 44.8 bits (106), Expect = 2e-05
 Identities = 21/56 (37%), Positives = 34/56 (60%)

Query: 9   IGTFTEPTPLDWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNN 64
           I  F E +  + K+  K+ ++K+  KAKK++K+K++KK  +K  KKK  K  KN  
Sbjct: 88  INLFPEESEKEQKEVSKETEEKEAIKAKKEKKEKKEKKVAEKLAKKKSTKTTKNTT 143



 Score = 40.9 bits (96), Expect = 4e-04
 Identities = 20/47 (42%), Positives = 26/47 (55%), Gaps = 4/47 (8%)

Query: 21  KKAKKKEKKKKEKKA----KKKEKKKEKKKKEKKKKKKKKKKKKKNN 63
            KAKK++K+KKEKK      KK+  K  K   KK  KK   KK++  
Sbjct: 112 IKAKKEKKEKKEKKVAEKLAKKKSTKTTKNTTKKATKKTTTKKEEGK 158



 Score = 39.0 bits (91), Expect = 0.002
 Identities = 17/41 (41%), Positives = 23/41 (56%)

Query: 21  KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
           +K   K KK+K++K +KK  +K  KKK  K  K   KK  K
Sbjct: 108 EKEAIKAKKEKKEKKEKKVAEKLAKKKSTKTTKNTTKKATK 148



 Score = 38.2 bits (89), Expect = 0.003
 Identities = 17/49 (34%), Positives = 26/49 (53%)

Query: 13  TEPTPLDWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
            E   +  +  +K+  K K++K +KKEKK  +K  +KK  K  K   KK
Sbjct: 97  KEQKEVSKETEEKEAIKAKKEKKEKKEKKVAEKLAKKKSTKTTKNTTKK 145


>gnl|CDD|222977 PHA03089, PHA03089, late transcription factor VLTF-4; Provisional.
          Length = 191

 Score = 45.6 bits (108), Expect = 1e-05
 Identities = 20/77 (25%), Positives = 39/77 (50%), Gaps = 5/77 (6%)

Query: 17  PLDWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNNN 76
           P D +   KK  KKK+   +KK+  K+ KKK+K+K++  +   ++ +++  N  N+   +
Sbjct: 40  PEDVEIPSKKTSKKKKTTPRKKKTTKKTKKKKKEKEEVPELAAEELSDSEENEENDKKVD 99

Query: 77  NN-----NNNNNNNNEE 88
                  N     N+E+
Sbjct: 100 YELPKVQNTAAEVNHED 116



 Score = 42.9 bits (101), Expect = 1e-04
 Identities = 19/61 (31%), Positives = 30/61 (49%)

Query: 29  KKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNNNNNNNNNNNNNEE 88
           +   KK  KK+K   +KKK  KK KKKKK+K++         +++  N  N+   +    
Sbjct: 44  EIPSKKTSKKKKTTPRKKKTTKKTKKKKKEKEEVPELAAEELSDSEENEENDKKVDYELP 103

Query: 89  K 89
           K
Sbjct: 104 K 104



 Score = 42.9 bits (101), Expect = 1e-04
 Identities = 21/78 (26%), Positives = 32/78 (41%)

Query: 11  TFTEPTPLDWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNN 70
            F E   +  KK  KK+K    KK   K+ KK+KK+KE+  +   ++      N  N+  
Sbjct: 38  IFPEDVEIPSKKTSKKKKTTPRKKKTTKKTKKKKKEKEEVPELAAEELSDSEENEENDKK 97

Query: 71  NNNNNNNNNNNNNNNNEE 88
            +       N     N E
Sbjct: 98  VDYELPKVQNTAAEVNHE 115



 Score = 40.2 bits (94), Expect = 0.001
 Identities = 17/57 (29%), Positives = 28/57 (49%)

Query: 33  KKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNNNNNNNNNNNNNEEK 89
           K +KKK+    KKK  KK KKKKK+K++         +++  N  N+   +    + 
Sbjct: 49  KTSKKKKTTPRKKKTTKKTKKKKKEKEEVPELAAEELSDSEENEENDKKVDYELPKV 105


>gnl|CDD|233167 TIGR00881, 2A0104, phosphoglycerate transporter family protein.
           [Transport and binding proteins, Carbohydrates, organic
           alcohols, and acids].
          Length = 379

 Score = 46.6 bits (111), Expect = 2e-05
 Identities = 32/158 (20%), Positives = 56/158 (35%), Gaps = 15/158 (9%)

Query: 249 FGGMTTLQTYAVGIFESIH-APLDPYFATLLLGVAELGGALLCVVLIH--YTGKR-PLAL 304
            G +     Y     +    +     +A  L  +  L G LL   L    + G+R PLA+
Sbjct: 233 TGILDWSPLYL---TQEKGFSKEKASWAFTLYELGGLVGTLLAGWLSDKLFNGRRGPLAV 289

Query: 305 ISTGGSAACFIVVAVYAQFHLSYGWDSPLVPTVFLVLAAFLTHICIRLLPWMLIGEVFPN 364
                     I+V++   +       +PL+  + L    FL +    +L  ++  E+ P 
Sbjct: 290 F-----FMALIIVSLLVYWLNPAA--NPLMDLICLFALGFLVYGPQ-MLIGVIASELAPK 341

Query: 365 NIRATASGASGSSSYIFAFAVNKLYYPMLDTFHLWGTL 402
               TA+G  G  +Y+           + D F   G  
Sbjct: 342 KAAGTAAGFVGFFAYLGGILAGLPLGYLADGFGWAGAF 379


>gnl|CDD|240388 PTZ00372, PTZ00372, endonuclease 4-like protein; Provisional.
          Length = 413

 Score = 46.6 bits (111), Expect = 2e-05
 Identities = 19/49 (38%), Positives = 27/49 (55%)

Query: 22 KAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNN 70
          + + K      KK KK++K  E KKK +KKKKKKK+KK+  +       
Sbjct: 44 EEENKVATTSTKKDKKEDKNNESKKKSEKKKKKKKEKKEPKSEGETKLG 92



 Score = 46.2 bits (110), Expect = 3e-05
 Identities = 23/54 (42%), Positives = 31/54 (57%)

Query: 21  KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNN 74
            ++KKK +KKK+KK +KKE K E + K   K  KK KK KK       N + +N
Sbjct: 64  NESKKKSEKKKKKKKEKKEPKSEGETKLGFKTPKKSKKTKKKPPKPKPNEDVDN 117



 Score = 44.3 bits (105), Expect = 1e-04
 Identities = 29/85 (34%), Positives = 38/85 (44%), Gaps = 2/85 (2%)

Query: 11  TFTEPTPLDWKKAKKKEKKKK--EKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNN 68
             T  T  D K+ K  E KKK  +KK KKKEKK+ K + E K   K  KK KK       
Sbjct: 49  VATTSTKKDKKEDKNNESKKKSEKKKKKKKEKKEPKSEGETKLGFKTPKKSKKTKKKPPK 108

Query: 69  NNNNNNNNNNNNNNNNNNEEKGTFL 93
              N + +N  N      E+   ++
Sbjct: 109 PKPNEDVDNAFNKIAELAEKSNVYI 133



 Score = 43.2 bits (102), Expect = 3e-04
 Identities = 18/66 (27%), Positives = 29/66 (43%)

Query: 2  DSSTKHQIGTFTEPTPLDWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
            S    +      T  + +        KK+KK  K  + K+K +K+KKKKK+KK+ K +
Sbjct: 27 IYSNVLVLSKEILSTFSEEENKVATTSTKKDKKEDKNNESKKKSEKKKKKKKEKKEPKSE 86

Query: 62 NNNNNN 67
                
Sbjct: 87 GETKLG 92



 Score = 42.0 bits (99), Expect = 6e-04
 Identities = 17/49 (34%), Positives = 27/49 (55%)

Query: 26 KEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNN 74
          +E+ K    + KK+KK++K  + KKK +KKKKKKK+     +       
Sbjct: 44 EEENKVATTSTKKDKKEDKNNESKKKSEKKKKKKKEKKEPKSEGETKLG 92



 Score = 39.3 bits (92), Expect = 0.004
 Identities = 18/52 (34%), Positives = 23/52 (44%)

Query: 24 KKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNN 75
             E++ K      K+ KKE K  E KKK +KKKKKKK      +       
Sbjct: 41 TFSEEENKVATTSTKKDKKEDKNNESKKKSEKKKKKKKEKKEPKSEGETKLG 92



 Score = 36.2 bits (84), Expect = 0.036
 Identities = 12/54 (22%), Positives = 22/54 (40%)

Query: 23 AKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNNN 76
               +++ +      +K K++ K  + KKK +KKKKKK       +       
Sbjct: 39 LSTFSEEENKVATTSTKKDKKEDKNNESKKKSEKKKKKKKEKKEPKSEGETKLG 92



 Score = 35.1 bits (81), Expect = 0.099
 Identities = 14/53 (26%), Positives = 22/53 (41%)

Query: 25 KKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNNNN 77
              +++ K A    KK +K+ K  + KKK +KKKKK        +       
Sbjct: 40 STFSEEENKVATTSTKKDKKEDKNNESKKKSEKKKKKKKEKKEPKSEGETKLG 92



 Score = 32.8 bits (75), Expect = 0.54
 Identities = 12/76 (15%), Positives = 19/76 (25%), Gaps = 7/76 (9%)

Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNN-------N 73
           K +             KE      ++E K      KK KK + NN +   +        
Sbjct: 19 SKLQPISYIYSNVLVLSKEILSTFSEEENKVATTSTKKDKKEDKNNESKKKSEKKKKKKK 78

Query: 74 NNNNNNNNNNNNNEEK 89
                +        K
Sbjct: 79 EKKEPKSEGETKLGFK 94



 Score = 30.1 bits (68), Expect = 3.1
 Identities = 8/77 (10%), Positives = 20/77 (25%), Gaps = 7/77 (9%)

Query: 22 KAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNN------ 75
          +  K +             K+      +++ K      KK+   + NN +   +      
Sbjct: 17 QKSKLQPISYIYSNVLVLSKEILSTFSEEENKVATTSTKKDKKEDKNNESKKKSEKKKKK 76

Query: 76 -NNNNNNNNNNNEEKGT 91
                  +    + G 
Sbjct: 77 KKEKKEPKSEGETKLGF 93


>gnl|CDD|222636 pfam14265, DUF4355, Domain of unknown function (DUF4355).  This
          family of proteins is found in bacteria and viruses.
          Proteins in this family are typically between 180 and
          214 amino acids in length.
          Length = 125

 Score = 43.0 bits (102), Expect = 4e-05
 Identities = 13/41 (31%), Positives = 24/41 (58%)

Query: 20 WKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKK 60
            K K K +KK+E+K  + EK  +   +EK + + +K +K+
Sbjct: 17 IAKEKAKWEKKQEEKKSEAEKLAKMSAEEKAEYELEKLEKE 57



 Score = 42.6 bits (101), Expect = 6e-05
 Identities = 13/39 (33%), Positives = 25/39 (64%)

Query: 23 AKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
          AK+K K +K+++ KK E +K  K   ++K + + +K +K
Sbjct: 18 AKEKAKWEKKQEEKKSEAEKLAKMSAEEKAEYELEKLEK 56



 Score = 41.5 bits (98), Expect = 1e-04
 Identities = 11/49 (22%), Positives = 27/49 (55%)

Query: 20 WKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNN 68
          W+K ++++K + EK AK   ++K + + EK +K+ ++ + +        
Sbjct: 24 WEKKQEEKKSEAEKLAKMSAEEKAEYELEKLEKELEELEAELARRELKA 72



 Score = 39.9 bits (94), Expect = 5e-04
 Identities = 17/46 (36%), Positives = 27/46 (58%), Gaps = 3/46 (6%)

Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKK---EKKKKKKKKKKKKKNN 63
           KA  KEK K EKK ++K+ + EK  K   E+K + + +K +K+  
Sbjct: 14 DKAIAKEKAKWEKKQEEKKSEAEKLAKMSAEEKAEYELEKLEKELE 59



 Score = 36.1 bits (84), Expect = 0.009
 Identities = 13/41 (31%), Positives = 24/41 (58%)

Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
          K AK   ++K E + +K EK+ E+ + E  +++ K + KK 
Sbjct: 37 KLAKMSAEEKAEYELEKLEKELEELEAELARRELKAEAKKM 77



 Score = 35.3 bits (82), Expect = 0.020
 Identities = 16/39 (41%), Positives = 24/39 (61%), Gaps = 3/39 (7%)

Query: 24 KKKEKKKKEK---KAKKKEKKKEKKKKEKKKKKKKKKKK 59
           ++EK   +K   KA  KEK K +KK+E+KK + +K  K
Sbjct: 2  PEEEKTFTDKEVDKAIAKEKAKWEKKQEEKKSEAEKLAK 40



 Score = 34.5 bits (80), Expect = 0.034
 Identities = 11/43 (25%), Positives = 25/43 (58%)

Query: 19 DWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
             +A+K  K   E+KA+ + +K EK+ +E + +  +++ K +
Sbjct: 31 KKSEAEKLAKMSAEEKAEYELEKLEKELEELEAELARRELKAE 73



 Score = 34.5 bits (80), Expect = 0.035
 Identities = 14/41 (34%), Positives = 22/41 (53%)

Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
          K    KE  K   K K K +KK+++KK + +K  K   ++K
Sbjct: 6  KTFTDKEVDKAIAKEKAKWEKKQEEKKSEAEKLAKMSAEEK 46



 Score = 33.0 bits (76), Expect = 0.11
 Identities = 6/43 (13%), Positives = 21/43 (48%)

Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNN 63
          +  K  +   +EK   + EK +++ ++ + +  +++ K +   
Sbjct: 34 EAEKLAKMSAEEKAEYELEKLEKELEELEAELARRELKAEAKK 76



 Score = 31.5 bits (72), Expect = 0.40
 Identities = 9/41 (21%), Positives = 24/41 (58%)

Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
          ++    +K+  +  AK+K K ++K++++K + +K  K   +
Sbjct: 4  EEKTFTDKEVDKAIAKEKAKWEKKQEEKKSEAEKLAKMSAE 44



 Score = 31.5 bits (72), Expect = 0.42
 Identities = 14/37 (37%), Positives = 22/37 (59%)

Query: 24 KKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKK 60
          + +E+K    K   K   KEK K EKK+++KK + +K
Sbjct: 1  EPEEEKTFTDKEVDKAIAKEKAKWEKKQEEKKSEAEK 37



 Score = 31.1 bits (71), Expect = 0.54
 Identities = 9/37 (24%), Positives = 22/37 (59%)

Query: 25 KKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
          + E++K     +  +   ++K K +KK+++KK + +K
Sbjct: 1  EPEEEKTFTDKEVDKAIAKEKAKWEKKQEEKKSEAEK 37



 Score = 30.3 bits (69), Expect = 1.0
 Identities = 10/41 (24%), Positives = 22/41 (53%)

Query: 22 KAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKN 62
           A++K + + EK  K+ E+ + +  + + K + KK   +K 
Sbjct: 42 SAEEKAEYELEKLEKELEELEAELARRELKAEAKKMLSEKG 82



 Score = 29.9 bits (68), Expect = 1.4
 Identities = 9/41 (21%), Positives = 21/41 (51%)

Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
             +K E + ++ + + +E + E  ++E K + KK   +K 
Sbjct: 42 SAEEKAEYELEKLEKELEELEAELARRELKAEAKKMLSEKG 82



 Score = 29.1 bits (66), Expect = 2.6
 Identities = 7/43 (16%), Positives = 23/43 (53%)

Query: 19 DWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
          + +K    ++  K    +K + +K++++K+ + +K  K   ++
Sbjct: 3  EEEKTFTDKEVDKAIAKEKAKWEKKQEEKKSEAEKLAKMSAEE 45



 Score = 28.4 bits (64), Expect = 4.8
 Identities = 11/35 (31%), Positives = 21/35 (60%)

Query: 28 KKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKN 62
          + ++EK    KE  K   K++ K +KK+++KK + 
Sbjct: 1  EPEEEKTFTDKEVDKAIAKEKAKWEKKQEEKKSEA 35


>gnl|CDD|240246 PTZ00053, PTZ00053, methionine aminopeptidase 2; Provisional.
          Length = 470

 Score = 45.9 bits (109), Expect = 5e-05
 Identities = 22/99 (22%), Positives = 35/99 (35%), Gaps = 8/99 (8%)

Query: 24  KKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKK------KKKNNNNNNNNNNNNNNNN 77
              E  + E K +K++ K++  KK+ KK KK          +  + N    N  NN    
Sbjct: 1   AMNENGENEVKQQKQQNKQKGTKKKNKKSKKDVDDDDAFLAELISENQEAENKQNNKKKK 60

Query: 78  NNNNNNNNNEEKGTFLYKMAAPLVLVLTYVAEITQPHLR 116
                           Y +A  L +V +  A     H+R
Sbjct: 61  KKKKKKKKKNLGEA--YDLAYDLPVVWSSAAFQDNSHIR 97



 Score = 40.9 bits (96), Expect = 0.001
 Identities = 21/81 (25%), Positives = 34/81 (41%), Gaps = 9/81 (11%)

Query: 22 KAKKKEKKKKEKKAKKKEKK---------KEKKKKEKKKKKKKKKKKKKNNNNNNNNNNN 72
          + K+K  KKK KK+KK              E ++ E K+  KKKKKKKK     N     
Sbjct: 16 QNKQKGTKKKNKKSKKDVDDDDAFLAELISENQEAENKQNNKKKKKKKKKKKKKNLGEAY 75

Query: 73 NNNNNNNNNNNNNNEEKGTFL 93
          +   +     ++   +  + +
Sbjct: 76 DLAYDLPVVWSSAAFQDNSHI 96



 Score = 32.4 bits (74), Expect = 0.69
 Identities = 18/67 (26%), Positives = 29/67 (43%)

Query: 25  KKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNNNNNNNNNNN 84
             E ++ E K   K+KKK+KKKK+KK   +            ++    +N++     N  
Sbjct: 44  ISENQEAENKQNNKKKKKKKKKKKKKNLGEAYDLAYDLPVVWSSAAFQDNSHIRKLGNWP 103

Query: 85  NNEEKGT 91
             E K T
Sbjct: 104 EQEWKQT 110



 Score = 32.0 bits (73), Expect = 0.83
 Identities = 12/51 (23%), Positives = 23/51 (45%)

Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNN 71
              ++ + K+  K KKK+KKK+KKK   +            ++    +N+
Sbjct: 44 ISENQEAENKQNNKKKKKKKKKKKKKNLGEAYDLAYDLPVVWSSAAFQDNS 94



 Score = 31.6 bits (72), Expect = 1.2
 Identities = 22/90 (24%), Positives = 37/90 (41%), Gaps = 18/90 (20%)

Query: 21  KKAKKKEKKKKEKK------------------AKKKEKKKEKKKKEKKKKKKKKKKKKKN 62
           K+   K+K KK KK                  A+ K+  K+KKKK+KKKKKK   +    
Sbjct: 18  KQKGTKKKNKKSKKDVDDDDAFLAELISENQEAENKQNNKKKKKKKKKKKKKNLGEAYDL 77

Query: 63  NNNNNNNNNNNNNNNNNNNNNNNNEEKGTF 92
             +     ++    +N++     N  +  +
Sbjct: 78  AYDLPVVWSSAAFQDNSHIRKLGNWPEQEW 107


>gnl|CDD|220648 pfam10243, MIP-T3, Microtubule-binding protein MIP-T3.  This
           protein, which interacts with both microtubules and
           TRAF3 (tumour necrosis factor receptor-associated factor
           3), is conserved from worms to humans. The N-terminal
           region is the microtubule binding domain and is
           well-conserved; the C-terminal 100 residues, also
           well-conserved, constitute the coiled-coil region which
           binds to TRAF3. The central region of the protein is
           rich in lysine and glutamic acid and carries KKE motifs
           which may also be necessary for tubulin-binding, but
           this region is the least well-conserved.
          Length = 506

 Score = 45.6 bits (108), Expect = 5e-05
 Identities = 20/61 (32%), Positives = 30/61 (49%), Gaps = 7/61 (11%)

Query: 14  EPTPLDWKKAKKKEKKKKEKKAKKKEKKKEK-------KKKEKKKKKKKKKKKKKNNNNN 66
           E       K + KEK+  ++K K+KEKK E+       KK+E+ + K + KK  K    N
Sbjct: 126 EEPKDRKPKEEAKEKRPPKEKEKEKEKKVEEPRDREEEKKRERVRAKSRPKKPPKKKPPN 185

Query: 67  N 67
            
Sbjct: 186 K 186



 Score = 45.3 bits (107), Expect = 7e-05
 Identities = 18/41 (43%), Positives = 32/41 (78%)

Query: 21  KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
            ++ K+E+K+KE+  ++K+KKKEK K+E K +K K++ K+K
Sbjct: 100 NESGKEEEKEKEQVKEEKKKKKEKPKEEPKDRKPKEEAKEK 140



 Score = 44.5 bits (105), Expect = 1e-04
 Identities = 14/54 (25%), Positives = 32/54 (59%)

Query: 8   QIGTFTEPTPLDWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
           ++       P    K  K+ K +  K+ +K++++ +++KK+KK+K K++ K +K
Sbjct: 79  RVEKGGSKGPAAKTKPAKEPKNESGKEEEKEKEQVKEEKKKKKEKPKEEPKDRK 132



 Score = 44.5 bits (105), Expect = 1e-04
 Identities = 15/41 (36%), Positives = 32/41 (78%)

Query: 21  KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
           +  ++K+KKK++ K + K++K +++ KEK+  K+K+K+K+K
Sbjct: 112 QVKEEKKKKKEKPKEEPKDRKPKEEAKEKRPPKEKEKEKEK 152



 Score = 44.1 bits (104), Expect = 1e-04
 Identities = 17/40 (42%), Positives = 30/40 (75%)

Query: 21  KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKK 60
           KK KK++ K++ K  K KE+ KEK+  ++K+K+K+KK ++
Sbjct: 117 KKKKKEKPKEEPKDRKPKEEAKEKRPPKEKEKEKEKKVEE 156



 Score = 44.1 bits (104), Expect = 2e-04
 Identities = 18/41 (43%), Positives = 30/41 (73%)

Query: 21  KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
            K +KK+KK+K K+  K  K KE+ K+++  K+K+K+K+KK
Sbjct: 113 VKEEKKKKKEKPKEEPKDRKPKEEAKEKRPPKEKEKEKEKK 153



 Score = 43.7 bits (103), Expect = 2e-04
 Identities = 18/41 (43%), Positives = 29/41 (70%)

Query: 21  KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
            K  K E  K+E+K K++ K+++KKKKEK K++ K +K K+
Sbjct: 95  AKEPKNESGKEEEKEKEQVKEEKKKKKEKPKEEPKDRKPKE 135



 Score = 43.7 bits (103), Expect = 2e-04
 Identities = 17/42 (40%), Positives = 32/42 (76%)

Query: 21  KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKN 62
           +K KKKEK K+E K +K +++ ++K+  K+K+K+K+KK ++ 
Sbjct: 116 EKKKKKEKPKEEPKDRKPKEEAKEKRPPKEKEKEKEKKVEEP 157



 Score = 43.3 bits (102), Expect = 2e-04
 Identities = 14/41 (34%), Positives = 31/41 (75%)

Query: 21  KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
           ++ +K++ K+++KK K+K K++ K +K K++ K+K+  K+K
Sbjct: 106 EEKEKEQVKEEKKKKKEKPKEEPKDRKPKEEAKEKRPPKEK 146



 Score = 42.6 bits (100), Expect = 4e-04
 Identities = 16/43 (37%), Positives = 31/43 (72%)

Query: 21  KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNN 63
           ++ KKK++K KE+   +K K++ K+K+  K+K+K+K+KK +  
Sbjct: 115 EEKKKKKEKPKEEPKDRKPKEEAKEKRPPKEKEKEKEKKVEEP 157



 Score = 42.2 bits (99), Expect = 6e-04
 Identities = 18/41 (43%), Positives = 30/41 (73%)

Query: 21  KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
           K+ +K++++ KE+K KKKEK KE+ K  K K++ K+K+  K
Sbjct: 104 KEEEKEKEQVKEEKKKKKEKPKEEPKDRKPKEEAKEKRPPK 144



 Score = 42.2 bits (99), Expect = 6e-04
 Identities = 18/50 (36%), Positives = 31/50 (62%)

Query: 13  TEPTPLDWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKN 62
                 + +K + KE+KKK+K+  K+E K  K K+E K+K+  K+K+K+ 
Sbjct: 101 ESGKEEEKEKEQVKEEKKKKKEKPKEEPKDRKPKEEAKEKRPPKEKEKEK 150



 Score = 41.8 bits (98), Expect = 7e-04
 Identities = 15/42 (35%), Positives = 31/42 (73%)

Query: 21  KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKN 62
           KK K+K K++ + +  K+E K+++  KEK+K+K+KK ++ ++
Sbjct: 118 KKKKEKPKEEPKDRKPKEEAKEKRPPKEKEKEKEKKVEEPRD 159



 Score = 41.8 bits (98), Expect = 7e-04
 Identities = 15/40 (37%), Positives = 30/40 (75%)

Query: 21  KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKK 60
           K    KE++K++++ K+++KKK++K KE+ K +K K++ K
Sbjct: 99  KNESGKEEEKEKEQVKEEKKKKKEKPKEEPKDRKPKEEAK 138



 Score = 41.8 bits (98), Expect = 8e-04
 Identities = 18/60 (30%), Positives = 31/60 (51%)

Query: 2   DSSTKHQIGTFTEPTPLDWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
           D   K +      P   + +K KK E+ +  ++ KK+E+ + K + +K  KKK   KKK+
Sbjct: 130 DRKPKEEAKEKRPPKEKEKEKEKKVEEPRDREEEKKRERVRAKSRPKKPPKKKPPNKKKE 189



 Score = 41.4 bits (97), Expect = 0.001
 Identities = 18/59 (30%), Positives = 35/59 (59%)

Query: 2   DSSTKHQIGTFTEPTPLDWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKK 60
           D + K      ++      K AK+ + +  +++ K+KE+ KE+KKK+K+K K++ K +K
Sbjct: 74  DEAVKRVEKGGSKGPAAKTKPAKEPKNESGKEEEKEKEQVKEEKKKKKEKPKEEPKDRK 132



 Score = 41.0 bits (96), Expect = 0.001
 Identities = 14/52 (26%), Positives = 25/52 (48%)

Query: 22  KAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNN 73
           + + K + KK  K K   KKKE  ++EK+++  ++  K K    + N     
Sbjct: 168 RVRAKSRPKKPPKKKPPNKKKEPPEEEKQRQAAREAVKGKPEEPDVNEEREK 219



 Score = 40.6 bits (95), Expect = 0.002
 Identities = 15/76 (19%), Positives = 35/76 (46%)

Query: 13  TEPTPLDWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNN 72
            E    +  +AK + KK  +KK   K+K+  +++K+++  ++  K K +  + N      
Sbjct: 161 EEEKKRERVRAKSRPKKPPKKKPPNKKKEPPEEEKQRQAAREAVKGKPEEPDVNEEREKE 220

Query: 73  NNNNNNNNNNNNNNEE 88
            ++  +     +  EE
Sbjct: 221 EDDGKDRETTTSPMEE 236



 Score = 39.9 bits (93), Expect = 0.003
 Identities = 18/46 (39%), Positives = 30/46 (65%), Gaps = 5/46 (10%)

Query: 21  KKAKKKEKKKKEKKAKKKEKKKEKKK-----KEKKKKKKKKKKKKK 61
           +K K+KEKK +E + +++EKK+E+ +     K+  KKK   KKK+ 
Sbjct: 145 EKEKEKEKKVEEPRDREEEKKRERVRAKSRPKKPPKKKPPNKKKEP 190



 Score = 39.9 bits (93), Expect = 0.003
 Identities = 15/41 (36%), Positives = 30/41 (73%)

Query: 21  KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
            K +  ++++KEK+  K+EKKK+K+K +++ K +K K++ K
Sbjct: 98  PKNESGKEEEKEKEQVKEEKKKKKEKPKEEPKDRKPKEEAK 138



 Score = 39.5 bits (92), Expect = 0.004
 Identities = 13/41 (31%), Positives = 25/41 (60%)

Query: 21  KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
           K+ +K   K    K K  ++ K +  KE++K+K++ K++KK
Sbjct: 78  KRVEKGGSKGPAAKTKPAKEPKNESGKEEEKEKEQVKEEKK 118



 Score = 38.7 bits (90), Expect = 0.008
 Identities = 14/45 (31%), Positives = 26/45 (57%), Gaps = 1/45 (2%)

Query: 18  LDWKKAKKKEKKKKEK-KAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
           L   +A K+ +K   K  A K +  KE K +  K+++K+K++ K+
Sbjct: 71  LSSDEAVKRVEKGGSKGPAAKTKPAKEPKNESGKEEEKEKEQVKE 115



 Score = 38.7 bits (90), Expect = 0.008
 Identities = 15/45 (33%), Positives = 29/45 (64%)

Query: 21  KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNN 65
           K+  K  K K+E K K+  K+KEK+K++K ++ + ++++KK    
Sbjct: 125 KEEPKDRKPKEEAKEKRPPKEKEKEKEKKVEEPRDREEEKKRERV 169



 Score = 37.9 bits (88), Expect = 0.013
 Identities = 13/49 (26%), Positives = 31/49 (63%)

Query: 13  TEPTPLDWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
            E      +K K++ K +K K+  K+++  ++K+KEK+KK ++ + +++
Sbjct: 114 KEEKKKKKEKPKEEPKDRKPKEEAKEKRPPKEKEKEKEKKVEEPRDREE 162



 Score = 37.2 bits (86), Expect = 0.019
 Identities = 14/70 (20%), Positives = 30/70 (42%)

Query: 21  KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNNNNNNN 80
           K   KK  KKK    KK+  ++EK+++  ++  K K ++   N       ++  +     
Sbjct: 172 KSRPKKPPKKKPPNKKKEPPEEEKQRQAAREAVKGKPEEPDVNEEREKEEDDGKDRETTT 231

Query: 81  NNNNNNEEKG 90
           +    +E + 
Sbjct: 232 SPMEEDESRQ 241



 Score = 37.2 bits (86), Expect = 0.023
 Identities = 14/68 (20%), Positives = 33/68 (48%)

Query: 21  KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNNNNNNN 80
            K++ K+  KK+   KKKE  +E+K+++  ++  K K ++ + N       ++  +    
Sbjct: 171 AKSRPKKPPKKKPPNKKKEPPEEEKQRQAAREAVKGKPEEPDVNEEREKEEDDGKDRETT 230

Query: 81  NNNNNNEE 88
            +    +E
Sbjct: 231 TSPMEEDE 238



 Score = 36.0 bits (83), Expect = 0.051
 Identities = 14/70 (20%), Positives = 32/70 (45%)

Query: 21  KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNNNNNNN 80
           ++ + +E++KK ++ + K + K+  KK+   KKK+  +++K                + N
Sbjct: 155 EEPRDREEEKKRERVRAKSRPKKPPKKKPPNKKKEPPEEEKQRQAAREAVKGKPEEPDVN 214

Query: 81  NNNNNNEEKG 90
                 E+ G
Sbjct: 215 EEREKEEDDG 224



 Score = 35.6 bits (82), Expect = 0.063
 Identities = 12/71 (16%), Positives = 30/71 (42%)

Query: 21  KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNNNNNNN 80
           K+ + + K + +K  KKK   K+K+  E++K+++  ++  K      + N       ++ 
Sbjct: 165 KRERVRAKSRPKKPPKKKPPNKKKEPPEEEKQRQAAREAVKGKPEEPDVNEEREKEEDDG 224

Query: 81  NNNNNNEEKGT 91
            +         
Sbjct: 225 KDRETTTSPME 235


>gnl|CDD|234017 TIGR02794, tolA_full, TolA protein.  TolA couples the inner
           membrane complex of itself with TolQ and TolR to the
           outer membrane complex of TolB and OprL (also called
           Pal). Most of the length of the protein consists of
           low-complexity sequence that may differ in both length
           and composition from one species to another,
           complicating efforts to discriminate TolA (the most
           divergent gene in the tol-pal system) from paralogs such
           as TonB. Selection of members of the seed alignment and
           criteria for setting scoring cutoffs are based largely
           conserved operon struction. //The Tol-Pal complex is
           required for maintaining outer membrane integrity. Also
           involved in transport (uptake) of colicins and
           filamentous DNA, and implicated in pathogenesis.
           Transport is energized by the proton motive force. TolA
           is an inner membrane protein that interacts with
           periplasmic TolB and with outer membrane porins ompC,
           phoE and lamB [Transport and binding proteins, Other,
           Cellular processes, Pathogenesis].
          Length = 346

 Score = 44.8 bits (106), Expect = 6e-05
 Identities = 18/42 (42%), Positives = 27/42 (64%)

Query: 21  KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKN 62
           KKAK++ KK+ E++AK K   + KKK  + KKK + + K K 
Sbjct: 142 KKAKEEAKKQAEEEAKAKAAAEAKKKAAEAKKKAEAEAKAKA 183



 Score = 43.7 bits (103), Expect = 2e-04
 Identities = 15/41 (36%), Positives = 23/41 (56%)

Query: 21  KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
           K   + EKK KE+  K+ E++ + K   + KKK  + KKK 
Sbjct: 135 KAEAEAEKKAKEEAKKQAEEEAKAKAAAEAKKKAAEAKKKA 175



 Score = 43.7 bits (103), Expect = 2e-04
 Identities = 15/43 (34%), Positives = 25/43 (58%), Gaps = 2/43 (4%)

Query: 21  KKAKKKEKKKKEKKAKKK--EKKKEKKKKEKKKKKKKKKKKKK 61
           K AK+ E+  K+ + K+K  E+ K K+  E K K + + +KK 
Sbjct: 102 KAAKQAEQAAKQAEEKQKQAEEAKAKQAAEAKAKAEAEAEKKA 144



 Score = 43.3 bits (102), Expect = 2e-04
 Identities = 15/40 (37%), Positives = 24/40 (60%)

Query: 22  KAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
           K   + K K E +A+KK K++ KK+ E++ K K   + KK
Sbjct: 127 KQAAEAKAKAEAEAEKKAKEEAKKQAEEEAKAKAAAEAKK 166



 Score = 42.9 bits (101), Expect = 3e-04
 Identities = 19/45 (42%), Positives = 26/45 (57%), Gaps = 3/45 (6%)

Query: 21  KKAKKK---EKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKN 62
           ++AK K   E KKK  +AKKK + + K K E K K K ++ K K 
Sbjct: 154 EEAKAKAAAEAKKKAAEAKKKAEAEAKAKAEAKAKAKAEEAKAKA 198



 Score = 41.0 bits (96), Expect = 0.001
 Identities = 15/41 (36%), Positives = 25/41 (60%)

Query: 21  KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
           K+A+++ K K   +AKKK  + +KK + + K K + K K K
Sbjct: 150 KQAEEEAKAKAAAEAKKKAAEAKKKAEAEAKAKAEAKAKAK 190



 Score = 40.6 bits (95), Expect = 0.001
 Identities = 16/42 (38%), Positives = 22/42 (52%), Gaps = 3/42 (7%)

Query: 22  KAKKK---EKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKK 60
           +AKK+   E K K     KK+  + KKK E + K K + K K
Sbjct: 147 EAKKQAEEEAKAKAAAEAKKKAAEAKKKAEAEAKAKAEAKAK 188



 Score = 40.6 bits (95), Expect = 0.001
 Identities = 18/44 (40%), Positives = 26/44 (59%), Gaps = 3/44 (6%)

Query: 21  KKAKKKEKKKKEKKAKKKEKKK---EKKKKEKKKKKKKKKKKKK 61
            +AK K + + EKKAK++ KK+   E K K   + KKK  + KK
Sbjct: 130 AEAKAKAEAEAEKKAKEEAKKQAEEEAKAKAAAEAKKKAAEAKK 173



 Score = 40.6 bits (95), Expect = 0.001
 Identities = 16/39 (41%), Positives = 24/39 (61%), Gaps = 1/39 (2%)

Query: 23  AKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
            + K K+  E KAK  E + EKK KE+ KK+ +++ K K
Sbjct: 122 EEAKAKQAAEAKAKA-EAEAEKKAKEEAKKQAEEEAKAK 159



 Score = 40.2 bits (94), Expect = 0.002
 Identities = 17/40 (42%), Positives = 22/40 (55%)

Query: 21  KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKK 60
           KK   + KKK E +AK K + K K K E+ K K +  K K
Sbjct: 165 KKKAAEAKKKAEAEAKAKAEAKAKAKAEEAKAKAEAAKAK 204



 Score = 39.0 bits (91), Expect = 0.005
 Identities = 12/41 (29%), Positives = 16/41 (39%)

Query: 21  KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
            KA+ K K K E+   K E  K K   E   K + +     
Sbjct: 181 AKAEAKAKAKAEEAKAKAEAAKAKAAAEAAAKAEAEAAAAA 221



 Score = 38.7 bits (90), Expect = 0.006
 Identities = 11/40 (27%), Positives = 25/40 (62%)

Query: 22  KAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
           +A  ++  K+ ++A K+ ++K+K+ +E K K+  + K K 
Sbjct: 97  RAAAEKAAKQAEQAAKQAEEKQKQAEEAKAKQAAEAKAKA 136



 Score = 38.7 bits (90), Expect = 0.007
 Identities = 14/41 (34%), Positives = 21/41 (51%)

Query: 21  KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
           KKA+ + K K E KAK K ++ + K +  K K   +   K 
Sbjct: 173 KKAEAEAKAKAEAKAKAKAEEAKAKAEAAKAKAAAEAAAKA 213



 Score = 37.9 bits (88), Expect = 0.010
 Identities = 8/39 (20%), Positives = 24/39 (61%)

Query: 21  KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKK 59
           +K +  E+ ++++  ++   +K  K+ E+  K+ ++K+K
Sbjct: 81  EKQRAAEQARQKELEQRAAAEKAAKQAEQAAKQAEEKQK 119



 Score = 37.5 bits (87), Expect = 0.013
 Identities = 13/41 (31%), Positives = 22/41 (53%)

Query: 21  KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
           K A + +KK  E K K + + K K + + K K ++ K K +
Sbjct: 159 KAAAEAKKKAAEAKKKAEAEAKAKAEAKAKAKAEEAKAKAE 199



 Score = 37.1 bits (86), Expect = 0.018
 Identities = 11/42 (26%), Positives = 18/42 (42%)

Query: 21  KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKN 62
            KAK +E K K + AK K   +   K E +       + ++ 
Sbjct: 187 AKAKAEEAKAKAEAAKAKAAAEAAAKAEAEAAAAAAAEAERK 228



 Score = 37.1 bits (86), Expect = 0.019
 Identities = 14/40 (35%), Positives = 26/40 (65%)

Query: 22  KAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
             + +EK+K+ ++AK K+  + K K E + +KK K++ KK
Sbjct: 111 AKQAEEKQKQAEEAKAKQAAEAKAKAEAEAEKKAKEEAKK 150



 Score = 36.0 bits (83), Expect = 0.048
 Identities = 9/48 (18%), Positives = 28/48 (58%)

Query: 14  EPTPLDWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
                  K A KKE+++++K  ++ E+ ++++  E+ ++K+ +++   
Sbjct: 53  NRIQQQKKPAAKKEQERQKKLEQQAEEAEKQRAAEQARQKELEQRAAA 100



 Score = 34.4 bits (79), Expect = 0.12
 Identities = 14/42 (33%), Positives = 20/42 (47%), Gaps = 1/42 (2%)

Query: 21  KKAKKKEKKKKEKKAKKKEKKKEKKKK-EKKKKKKKKKKKKK 61
            K K + + K + +AK K K +E K K E  K K   +   K
Sbjct: 171 AKKKAEAEAKAKAEAKAKAKAEEAKAKAEAAKAKAAAEAAAK 212



 Score = 32.5 bits (74), Expect = 0.53
 Identities = 8/50 (16%), Positives = 27/50 (54%), Gaps = 3/50 (6%)

Query: 15  PTPLDWKKAKKKEKKKKEKKAKKKEKKK---EKKKKEKKKKKKKKKKKKK 61
                 KK +++ ++ ++++A ++ ++K   ++   EK  K+ ++  K+ 
Sbjct: 65  KEQERQKKLEQQAEEAEKQRAAEQARQKELEQRAAAEKAAKQAEQAAKQA 114



 Score = 29.4 bits (66), Expect = 5.2
 Identities = 9/44 (20%), Positives = 18/44 (40%)

Query: 22  KAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNN 65
           +AK K +  K K A +   K E +       + ++K  +    +
Sbjct: 193 EAKAKAEAAKAKAAAEAAAKAEAEAAAAAAAEAERKADEAELGD 236


>gnl|CDD|217023 pfam02414, Borrelia_orfA, Borrelia ORF-A.  This protein is encoded
           by an open reading frame in plasmid borne DNA repeats of
           Borrelia species. This protein is known as ORF-A. The
           function of this putative protein is unknown.
          Length = 285

 Score = 44.3 bits (105), Expect = 7e-05
 Identities = 18/61 (29%), Positives = 30/61 (49%)

Query: 29  KKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNNNNNNNNNNNNNEE 88
             ++K+  +   K  K+KK ++ KK+     KK +N N +   N   NN NN  N   +E
Sbjct: 86  NYQKKECYEIINKYFKEKKIERFKKRVNAYFKKKSNKNGSVEKNECINNKNNVINKKIKE 145

Query: 89  K 89
           +
Sbjct: 146 E 146



 Score = 43.9 bits (104), Expect = 1e-04
 Identities = 24/80 (30%), Positives = 32/80 (40%), Gaps = 5/80 (6%)

Query: 21  KKAKKKEKKKKEKKAKKKEKKK-----EKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNN 75
            K    E   K    KK+  +      ++KK E+ KK+     KKK+N N +   N   N
Sbjct: 74  GKNNGTEIYYKLNYQKKECYEIINKYFKEKKIERFKKRVNAYFKKKSNKNGSVEKNECIN 133

Query: 76  NNNNNNNNNNNEEKGTFLYK 95
           N NN  N    EE      K
Sbjct: 134 NKNNVINKKIKEESKKKNSK 153



 Score = 40.4 bits (95), Expect = 0.002
 Identities = 18/69 (26%), Positives = 27/69 (39%)

Query: 21  KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNNNNNNN 80
              KK+  +   K  K+K+ ++ KK+     KKK  K      N   NN NN  N     
Sbjct: 86  NYQKKECYEIINKYFKEKKIERFKKRVNAYFKKKSNKNGSVEKNECINNKNNVINKKIKE 145

Query: 81  NNNNNNEEK 89
            +   N + 
Sbjct: 146 ESKKKNSKN 154



 Score = 34.7 bits (80), Expect = 0.11
 Identities = 18/62 (29%), Positives = 27/62 (43%), Gaps = 3/62 (4%)

Query: 28  KKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNNNNNNNNNNNNNE 87
           K+KK ++ KK+     KK   K  K    +K +  NN NN  N      +   N+ N  +
Sbjct: 101 KEKKIERFKKRVNAYFKK---KSNKNGSVEKNECINNKNNVINKKIKEESKKKNSKNIEK 157

Query: 88  EK 89
            K
Sbjct: 158 LK 159



 Score = 32.7 bits (75), Expect = 0.37
 Identities = 19/59 (32%), Positives = 25/59 (42%), Gaps = 4/59 (6%)

Query: 21  KKAKKKEKKKKEKKAKKKEKKKE----KKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNN 75
           +++KKK  K  EK   KK  KK       KK+ K K  K  K ++N    N    N   
Sbjct: 145 EESKKKNSKNIEKLKIKKYFKKCNFKTNLKKDIKIKLLKIIKIEENFLIKNKKLKNKQK 203



 Score = 30.4 bits (69), Expect = 2.0
 Identities = 20/53 (37%), Positives = 24/53 (45%), Gaps = 5/53 (9%)

Query: 24  KKKEKKKKE-----KKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNN 71
           K  EK K +     +K K K+  K   K +K  KKKKK  KK   N   N  N
Sbjct: 233 KIYEKYKNKPHFIIEKNKYKDLNKIIGKFKKSFKKKKKNSKKNYENIKINIFN 285



 Score = 30.4 bits (69), Expect = 2.2
 Identities = 25/62 (40%), Positives = 28/62 (45%), Gaps = 10/62 (16%)

Query: 21  KKAKKKEKKKKEKKAKKK-----EKKKEK-----KKKEKKKKKKKKKKKKKNNNNNNNNN 70
           KK  +K  +K  +K K K     EK K K       K KK  KKKKK  KKN  N   N 
Sbjct: 224 KKQIEKFFQKIYEKYKNKPHFIIEKNKYKDLNKIIGKFKKSFKKKKKNSKKNYENIKINI 283

Query: 71  NN 72
            N
Sbjct: 284 FN 285



 Score = 30.0 bits (68), Expect = 2.6
 Identities = 19/77 (24%), Positives = 28/77 (36%), Gaps = 8/77 (10%)

Query: 21  KKAKKKEKKKKEKKAKKKEKKKEKKKKE--------KKKKKKKKKKKKKNNNNNNNNNNN 72
           +K  ++ KK+     KKK  K    +K            KK K++ KKKN+ N       
Sbjct: 102 EKKIERFKKRVNAYFKKKSNKNGSVEKNECINNKNNVINKKIKEESKKKNSKNIEKLKIK 161

Query: 73  NNNNNNNNNNNNNNEEK 89
                 N   N   + K
Sbjct: 162 KYFKKCNFKTNLKKDIK 178


>gnl|CDD|203848 pfam08079, Ribosomal_L30_N, Ribosomal L30 N-terminal domain.
          This presumed domain is found at the N-terminus of
          Ribosomal L30 proteins and has been termed RL30NT or
          NUC018.
          Length = 71

 Score = 39.9 bits (94), Expect = 2e-04
 Identities = 13/29 (44%), Positives = 20/29 (68%)

Query: 33 KKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
          KK K+ EK + K+ K++  KK  +KKK+K
Sbjct: 4  KKRKRNEKLRAKRAKKRAAKKAARKKKRK 32



 Score = 38.7 bits (91), Expect = 4e-04
 Identities = 14/37 (37%), Positives = 22/37 (59%)

Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKK 57
          KK K+ EK + ++  K+  KK  +KKK K   K+ +K
Sbjct: 4  KKRKRNEKLRAKRAKKRAAKKAARKKKRKLIFKRAEK 40



 Score = 37.2 bits (87), Expect = 0.001
 Identities = 11/33 (33%), Positives = 21/33 (63%)

Query: 29 KKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
          KK+++  K + K+ +K+  +K  +KKK+K   K
Sbjct: 4  KKRKRNEKLRAKRAKKRAAKKAARKKKRKLIFK 36



 Score = 37.2 bits (87), Expect = 0.001
 Identities = 14/37 (37%), Positives = 24/37 (64%)

Query: 25 KKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
          KK K+ ++ +AK+ +K+  KK   KKK+K   K+ +K
Sbjct: 4  KKRKRNEKLRAKRAKKRAAKKAARKKKRKLIFKRAEK 40



 Score = 35.7 bits (83), Expect = 0.004
 Identities = 14/36 (38%), Positives = 22/36 (61%)

Query: 24 KKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKK 59
            K++K+ EK   K+ KK+  KK  +KKK+K   K+
Sbjct: 2  LLKKRKRNEKLRAKRAKKRAAKKAARKKKRKLIFKR 37



 Score = 33.7 bits (78), Expect = 0.023
 Identities = 17/50 (34%), Positives = 27/50 (54%), Gaps = 7/50 (14%)

Query: 21 KKAKKKEKKKKEKKAKKKEKKK-------EKKKKEKKKKKKKKKKKKKNN 63
          K+AKK+  KK  +K K+K   K       E +  E+ + + K++ KKK N
Sbjct: 15 KRAKKRAAKKAARKKKRKLIFKRAEKYVKEYRAAERDEIRLKREAKKKGN 64



 Score = 27.2 bits (61), Expect = 5.0
 Identities = 8/26 (30%), Positives = 16/26 (61%)

Query: 36 KKKEKKKEKKKKEKKKKKKKKKKKKK 61
            K++K+ +K + K+ KK+  KK  +
Sbjct: 2  LLKKRKRNEKLRAKRAKKRAAKKAAR 27


>gnl|CDD|240402 PTZ00399, PTZ00399, cysteinyl-tRNA-synthetase; Provisional.
          Length = 651

 Score = 44.2 bits (105), Expect = 2e-04
 Identities = 17/41 (41%), Positives = 30/41 (73%)

Query: 21  KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
           K+  ++EK++KE   ++K  +K KK++EKKKK+ +K +K K
Sbjct: 550 KEELQREKEEKEALKEQKRLRKLKKQEEKKKKELEKLEKAK 590



 Score = 41.2 bits (97), Expect = 0.002
 Identities = 18/44 (40%), Positives = 32/44 (72%), Gaps = 2/44 (4%)

Query: 20  WKKAKKKEKKKK--EKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
           WK   K+E +++  EK+A K++K+  K KK+++KKKK+ +K +K
Sbjct: 545 WKLDDKEELQREKEEKEALKEQKRLRKLKKQEEKKKKELEKLEK 588



 Score = 39.6 bits (93), Expect = 0.004
 Identities = 14/41 (34%), Positives = 29/41 (70%)

Query: 17  PLDWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKK 57
             + ++ K++++  KE+K  +K KK+E+KKK++ +K +K K
Sbjct: 550 KEELQREKEEKEALKEQKRLRKLKKQEEKKKKELEKLEKAK 590



 Score = 37.3 bits (87), Expect = 0.022
 Identities = 17/51 (33%), Positives = 30/51 (58%), Gaps = 5/51 (9%)

Query: 10  GTFT----EPTPLDWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKK 56
           G       +   L  +K +K+  K++ K+ +K +K++EKKKKE +K +K K
Sbjct: 541 GPSVWKLDDKEELQREKEEKEALKEQ-KRLRKLKKQEEKKKKELEKLEKAK 590



 Score = 30.4 bits (69), Expect = 3.0
 Identities = 19/80 (23%), Positives = 30/80 (37%), Gaps = 35/80 (43%)

Query: 21  KKAKKKEKKKKEKKAKKKEKKKEKK----------------------------------- 45
           K+ K+  K KK+++ KKKE +K +K                                   
Sbjct: 564 KEQKRLRKLKKQEEKKKKELEKLEKAKIPPAEFFKRQEDKYSAFDETGLPTHDADGEEIS 623

Query: 46  KKEKKKKKKKKKKKKKNNNN 65
           KKE+KK  K+  K+ K +  
Sbjct: 624 KKERKKLSKEYDKQAKLHEE 643



 Score = 29.6 bits (67), Expect = 5.6
 Identities = 10/34 (29%), Positives = 24/34 (70%), Gaps = 1/34 (2%)

Query: 29  KKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKN 62
           K  +K+  ++EK+ EK+  +++K+ +K KK+++ 
Sbjct: 546 KLDDKEELQREKE-EKEALKEQKRLRKLKKQEEK 578


>gnl|CDD|236048 PRK07561, PRK07561, DNA topoisomerase I subunit omega; Validated.
          Length = 859

 Score = 44.0 bits (105), Expect = 2e-04
 Identities = 9/42 (21%), Positives = 15/42 (35%)

Query: 18  LDWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKK 59
              K     +   K+   KK   K  +K K++    +K  K 
Sbjct: 817 KPEKLRYLADAPAKDPAGKKAAVKFSRKTKQQYVASEKDGKA 858



 Score = 41.3 bits (98), Expect = 0.001
 Identities = 8/41 (19%), Positives = 16/41 (39%)

Query: 20  WKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKK 60
            +K +       +  A KK   K  +K +++    +K  K 
Sbjct: 818 PEKLRYLADAPAKDPAGKKAAVKFSRKTKQQYVASEKDGKA 858



 Score = 41.3 bits (98), Expect = 0.001
 Identities = 9/38 (23%), Positives = 16/38 (42%)

Query: 24  KKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
            +K +   +  AK    KK   K  +K K++    +K 
Sbjct: 818 PEKLRYLADAPAKDPAGKKAAVKFSRKTKQQYVASEKD 855



 Score = 40.9 bits (97), Expect = 0.002
 Identities = 9/44 (20%), Positives = 16/44 (36%)

Query: 18  LDWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
           L  K  K +       K    +K   K  ++ K++    +K  K
Sbjct: 814 LAEKPEKLRYLADAPAKDPAGKKAAVKFSRKTKQQYVASEKDGK 857



 Score = 39.8 bits (94), Expect = 0.004
 Identities = 8/45 (17%), Positives = 18/45 (40%)

Query: 18  LDWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKN 62
           L+    K ++ +       K    K+   K  +K K++    +K+
Sbjct: 811 LELLAEKPEKLRYLADAPAKDPAGKKAAVKFSRKTKQQYVASEKD 855



 Score = 34.0 bits (79), Expect = 0.23
 Identities = 8/41 (19%), Positives = 16/41 (39%)

Query: 15  PTPLDWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKK 55
           P  L +      +    +K A K  +K +++    +K  K 
Sbjct: 818 PEKLRYLADAPAKDPAGKKAAVKFSRKTKQQYVASEKDGKA 858


>gnl|CDD|235640 PRK05901, PRK05901, RNA polymerase sigma factor; Provisional.
          Length = 509

 Score = 43.8 bits (104), Expect = 2e-04
 Identities = 19/81 (23%), Positives = 30/81 (37%)

Query: 1   MDSSTKHQIGTFTEPTPLDWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKK 60
            +   +  I         D        KKK +  AK    K   KKK K +    KK +K
Sbjct: 49  PEQIDQVLIFLSGMVKDTDDATESDIPKKKTKTAAKAAAAKAPAKKKLKDELDSSKKAEK 108

Query: 61  KNNNNNNNNNNNNNNNNNNNN 81
           KN  + +++ N   + +  N 
Sbjct: 109 KNALDKDDDLNYVKDIDVLNQ 129



 Score = 38.4 bits (90), Expect = 0.008
 Identities = 15/69 (21%), Positives = 25/69 (36%)

Query: 21  KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNNNNNNN 80
                 E    +KK K   K    K   KKK K +    KK    N  + +++ N   + 
Sbjct: 65  DTDDATESDIPKKKTKTAAKAAAAKAPAKKKLKDELDSSKKAEKKNALDKDDDLNYVKDI 124

Query: 81  NNNNNNEEK 89
           +  N  ++ 
Sbjct: 125 DVLNQADDD 133



 Score = 36.5 bits (85), Expect = 0.034
 Identities = 15/55 (27%), Positives = 24/55 (43%)

Query: 11 TFTEPTPLDWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNN 65
            T+      ++AKKK KK   K   K    KE+ K+  + KKK  ++  +    
Sbjct: 4  ASTKAELAAEEEAKKKLKKLAAKSKSKGFITKEEIKEALESKKKTPEQIDQVLIF 58



 Score = 36.5 bits (85), Expect = 0.038
 Identities = 14/49 (28%), Positives = 24/49 (48%)

Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNN 69
            A+++ KKK +K A K + K    K+E K+  + KKK  +  +     
Sbjct: 10 LAAEEEAKKKLKKLAAKSKSKGFITKEEIKEALESKKKTPEQIDQVLIF 58



 Score = 36.1 bits (84), Expect = 0.042
 Identities = 17/94 (18%), Positives = 26/94 (27%)

Query: 1   MDSSTKHQIGTFTEPTPLDWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKK 60
               T  QI           K      +    KK  K   K    K   KKK K +    
Sbjct: 44  SKKKTPEQIDQVLIFLSGMVKDTDDATESDIPKKKTKTAAKAAAAKAPAKKKLKDELDSS 103

Query: 61  KNNNNNNNNNNNNNNNNNNNNNNNNNEEKGTFLY 94
           K     N  + +++ N   + +  N  +      
Sbjct: 104 KKAEKKNALDKDDDLNYVKDIDVLNQADDDDDDD 137



 Score = 35.7 bits (83), Expect = 0.056
 Identities = 14/52 (26%), Positives = 24/52 (46%), Gaps = 1/52 (1%)

Query: 22 KAKKKEKKKKEKKAKKK-EKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNN 72
          KA+   +++ +KK KK   K K K    K++ K+  + KKK     +     
Sbjct: 7  KAELAAEEEAKKKLKKLAAKSKSKGFITKEEIKEALESKKKTPEQIDQVLIF 58



 Score = 35.4 bits (82), Expect = 0.081
 Identities = 16/50 (32%), Positives = 23/50 (46%)

Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNN 70
          + A ++E KKK KK   K K K    KE+ K+  + KKK     +     
Sbjct: 9  ELAAEEEAKKKLKKLAAKSKSKGFITKEEIKEALESKKKTPEQIDQVLIF 58



 Score = 34.2 bits (79), Expect = 0.21
 Identities = 14/41 (34%), Positives = 20/41 (48%)

Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
            A  K +   E++AKKK KK   K K K    K++ K+  
Sbjct: 2  TTASTKAELAAEEEAKKKLKKLAAKSKSKGFITKEEIKEAL 42



 Score = 31.9 bits (73), Expect = 1.1
 Identities = 12/38 (31%), Positives = 17/38 (44%)

Query: 25 KKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKN 62
               K E  A+++ KKK KK   K K K    K++  
Sbjct: 2  TTASTKAELAAEEEAKKKLKKLAAKSKSKGFITKEEIK 39



 Score = 30.7 bits (70), Expect = 2.3
 Identities = 9/60 (15%), Positives = 16/60 (26%)

Query: 5  TKHQIGTFTEPTPLDWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNN 64
          TK +I    E      ++  +             +  +    K+K K   K    K    
Sbjct: 34 TKEEIKEALESKKKTPEQIDQVLIFLSGMVKDTDDATESDIPKKKTKTAAKAAAAKAPAK 93


>gnl|CDD|173412 PTZ00121, PTZ00121, MAEBL; Provisional.
          Length = 2084

 Score = 44.4 bits (104), Expect = 2e-04
 Identities = 14/40 (35%), Positives = 22/40 (55%)

Query: 21   KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKK 60
            +  K  E  K E +A   E +  ++K E  +KKK++ KKK
Sbjct: 1340 EAKKAAEAAKAEAEAAADEAEAAEEKAEAAEKKKEEAKKK 1379



 Score = 44.0 bits (103), Expect = 2e-04
 Identities = 18/45 (40%), Positives = 28/45 (62%), Gaps = 4/45 (8%)

Query: 21   KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKK----KKKKKKKKK 61
             KA+ +    + + A++K +  EKKK+E KKK    KKK ++KKK
Sbjct: 1348 AKAEAEAAADEAEAAEEKAEAAEKKKEEAKKKADAAKKKAEEKKK 1392



 Score = 43.2 bits (101), Expect = 3e-04
 Identities = 13/38 (34%), Positives = 21/38 (55%)

Query: 24   KKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
            KK E+ KK  +A K E +    + E  ++K +  +KKK
Sbjct: 1336 KKAEEAKKAAEAAKAEAEAAADEAEAAEEKAEAAEKKK 1373



 Score = 42.8 bits (100), Expect = 5e-04
 Identities = 11/40 (27%), Positives = 22/40 (55%)

Query: 22   KAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
            K K +E KK  + AK + +    + +  ++K +  +KKK+
Sbjct: 1335 KKKAEEAKKAAEAAKAEAEAAADEAEAAEEKAEAAEKKKE 1374



 Score = 42.4 bits (99), Expect = 6e-04
 Identities = 17/41 (41%), Positives = 24/41 (58%), Gaps = 2/41 (4%)

Query: 22   KAKKKEKKKKEKKAKKKEKKKE--KKKKEKKKKKKKKKKKK 60
            +A K E +    +A+  E+K E  +KKKE+ KKK    KKK
Sbjct: 1346 EAAKAEAEAAADEAEAAEEKAEAAEKKKEEAKKKADAAKKK 1386



 Score = 41.7 bits (97), Expect = 0.001
 Identities = 12/44 (27%), Positives = 22/44 (50%), Gaps = 3/44 (6%)

Query: 21   KKAKKKEKKKKEKKAKKKEKKKE---KKKKEKKKKKKKKKKKKK 61
            KKA++ +KK    K K +E KK     K + +    + +  ++K
Sbjct: 1322 KKAEEAKKKADAAKKKAEEAKKAAEAAKAEAEAAADEAEAAEEK 1365



 Score = 41.7 bits (97), Expect = 0.001
 Identities = 15/44 (34%), Positives = 25/44 (56%), Gaps = 3/44 (6%)

Query: 21   KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKK---KKK 61
            KKA++ +K  +  KA+ +    E +  E+K +  +KKK   KKK
Sbjct: 1336 KKAEEAKKAAEAAKAEAEAAADEAEAAEEKAEAAEKKKEEAKKK 1379



 Score = 40.5 bits (94), Expect = 0.003
 Identities = 12/40 (30%), Positives = 21/40 (52%)

Query: 22   KAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
            K K +E KKK   AKKK ++ +K  +  K + +    + +
Sbjct: 1321 KKKAEEAKKKADAAKKKAEEAKKAAEAAKAEAEAAADEAE 1360



 Score = 40.1 bits (93), Expect = 0.004
 Identities = 10/42 (23%), Positives = 19/42 (45%)

Query: 19   DWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKK 60
            + KK     KKK E+  K  E  K + +    + +  ++K +
Sbjct: 1326 EAKKKADAAKKKAEEAKKAAEAAKAEAEAAADEAEAAEEKAE 1367



 Score = 40.1 bits (93), Expect = 0.004
 Identities = 9/40 (22%), Positives = 20/40 (50%)

Query: 22   KAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
            K K    KKK ++AKK  +  + + +    + +  ++K +
Sbjct: 1328 KKKADAAKKKAEEAKKAAEAAKAEAEAAADEAEAAEEKAE 1367



 Score = 39.7 bits (92), Expect = 0.004
 Identities = 12/40 (30%), Positives = 22/40 (55%)

Query: 22   KAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
            +AKKK ++ K+K    K+K +E KK  +  K + +    +
Sbjct: 1319 EAKKKAEEAKKKADAAKKKAEEAKKAAEAAKAEAEAAADE 1358



 Score = 39.4 bits (91), Expect = 0.007
 Identities = 18/43 (41%), Positives = 27/43 (62%), Gaps = 4/43 (9%)

Query: 21   KKAKKKEKKKKEKKAKKKEKKKE----KKKKEKKKKKKKKKKK 59
             +A+  E+K +  + KK+E KK+    KKK E+KKK  + KKK
Sbjct: 1357 DEAEAAEEKAEAAEKKKEEAKKKADAAKKKAEEKKKADEAKKK 1399



 Score = 39.0 bits (90), Expect = 0.007
 Identities = 18/41 (43%), Positives = 29/41 (70%)

Query: 21   KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
             +A +K+K++ +KKA   +KK E+KKK  + KKK ++ KKK
Sbjct: 1366 AEAAEKKKEEAKKKADAAKKKAEEKKKADEAKKKAEEDKKK 1406



 Score = 39.0 bits (90), Expect = 0.009
 Identities = 21/43 (48%), Positives = 28/43 (65%), Gaps = 3/43 (6%)

Query: 21   KKAKKKEKKKKEKKAKKKEKKK---EKKKKEKKKKKKKKKKKK 60
            +KA+  EKKK+E K K    KK   EKKK ++ KKK ++ KKK
Sbjct: 1364 EKAEAAEKKKEEAKKKADAAKKKAEEKKKADEAKKKAEEDKKK 1406



 Score = 38.2 bits (88), Expect = 0.014
 Identities = 16/36 (44%), Positives = 23/36 (63%)

Query: 24   KKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKK 59
            KK E+ KK  +AKKK ++ +KK    KKK ++ KK 
Sbjct: 1309 KKAEEAKKADEAKKKAEEAKKKADAAKKKAEEAKKA 1344



 Score = 37.0 bits (85), Expect = 0.029
 Identities = 17/35 (48%), Positives = 23/35 (65%)

Query: 24   KKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKK 58
            KK E+ KK  +AKKK ++ +KK  E KK  + KKK
Sbjct: 1477 KKAEEAKKADEAKKKAEEAKKKADEAKKAAEAKKK 1511



 Score = 37.0 bits (85), Expect = 0.031
 Identities = 21/42 (50%), Positives = 26/42 (61%), Gaps = 4/42 (9%)

Query: 24   KKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKK----KKK 61
            KK E+KKK  +AKKK ++ +KK  E KK    KKK    KKK
Sbjct: 1385 KKAEEKKKADEAKKKAEEDKKKADELKKAAAAKKKADEAKKK 1426



 Score = 37.0 bits (85), Expect = 0.033
 Identities = 19/44 (43%), Positives = 29/44 (65%), Gaps = 3/44 (6%)

Query: 21   KKAKKKEKKKKE---KKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
            +K K++ KKK +   KKA++K+K  E KKK ++ KKK  + KK 
Sbjct: 1370 EKKKEEAKKKADAAKKKAEEKKKADEAKKKAEEDKKKADELKKA 1413



 Score = 36.7 bits (84), Expect = 0.035
 Identities = 20/42 (47%), Positives = 26/42 (61%), Gaps = 2/42 (4%)

Query: 21   KKAKKKEKKKKEKKAKKKEKKK--EKKKKEKKKKKKKKKKKK 60
            KKA++ +KK  E K     KKK  E KKK ++KKK  + KKK
Sbjct: 1398 KKAEEDKKKADELKKAAAAKKKADEAKKKAEEKKKADEAKKK 1439



 Score = 36.7 bits (84), Expect = 0.037
 Identities = 19/40 (47%), Positives = 29/40 (72%)

Query: 21   KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKK 60
            KKA++K+K  + KKA++K+K  E KKK ++ KK  + KKK
Sbjct: 1284 KKAEEKKKADEAKKAEEKKKADEAKKKAEEAKKADEAKKK 1323



 Score = 36.7 bits (84), Expect = 0.039
 Identities = 23/51 (45%), Positives = 28/51 (54%), Gaps = 7/51 (13%)

Query: 21   KKAKKKEKKKKEKKAKKKEKKKE---KKKKEKKKK----KKKKKKKKKNNN 64
            KKA   +KK  E K K +EKKK    KKK E+ KK    KKK ++ KK   
Sbjct: 1411 KKAAAAKKKADEAKKKAEEKKKADEAKKKAEEAKKADEAKKKAEEAKKAEE 1461



 Score = 36.7 bits (84), Expect = 0.039
 Identities = 21/44 (47%), Positives = 29/44 (65%), Gaps = 4/44 (9%)

Query: 22   KAKKKEKKKK----EKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
            +AKK E+KKK    +KKA++ +K  E KKK ++ KKK    KKK
Sbjct: 1294 EAKKAEEKKKADEAKKKAEEAKKADEAKKKAEEAKKKADAAKKK 1337



 Score = 36.7 bits (84), Expect = 0.041
 Identities = 19/42 (45%), Positives = 26/42 (61%), Gaps = 3/42 (7%)

Query: 22   KAKKKEKKKKE---KKAKKKEKKKEKKKKEKKKKKKKKKKKK 60
            K K +E KK E   KKA++ +K  E KKK ++ KK  + KKK
Sbjct: 1450 KKKAEEAKKAEEAKKKAEEAKKADEAKKKAEEAKKADEAKKK 1491



 Score = 36.3 bits (83), Expect = 0.049
 Identities = 20/42 (47%), Positives = 30/42 (71%), Gaps = 1/42 (2%)

Query: 21   KKAKKKEKKKKE-KKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
            KKA++ +KK +E KKA + +KK E+ KK  + KKK ++ KKK
Sbjct: 1457 KKAEEAKKKAEEAKKADEAKKKAEEAKKADEAKKKAEEAKKK 1498



 Score = 36.3 bits (83), Expect = 0.053
 Identities = 18/41 (43%), Positives = 29/41 (70%)

Query: 21   KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
            KKA + +K +++KKA + +KK E+ KK  + KKK ++ KKK
Sbjct: 1290 KKADEAKKAEEKKKADEAKKKAEEAKKADEAKKKAEEAKKK 1330



 Score = 35.9 bits (82), Expect = 0.064
 Identities = 20/42 (47%), Positives = 27/42 (64%), Gaps = 3/42 (7%)

Query: 21   KKAKKKEKKKKEKKAKKKEKKK---EKKKKEKKKKKKKKKKK 59
            KKA + +KK +E K K  E KK    KKK ++ KKK ++KKK
Sbjct: 1391 KKADEAKKKAEEDKKKADELKKAAAAKKKADEAKKKAEEKKK 1432



 Score = 35.9 bits (82), Expect = 0.069
 Identities = 16/39 (41%), Positives = 26/39 (66%)

Query: 21   KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKK 59
            ++AKK E+ KK+ +  KK ++ EKK  E  KK+ ++ KK
Sbjct: 1665 EEAKKAEEDKKKAEEAKKAEEDEKKAAEALKKEAEEAKK 1703



 Score = 35.9 bits (82), Expect = 0.071
 Identities = 17/45 (37%), Positives = 24/45 (53%)

Query: 14   EPTPLDWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKK 58
            E    D  K K +E KK ++  KK E+ K+K    KKK ++ KK 
Sbjct: 1300 EKKKADEAKKKAEEAKKADEAKKKAEEAKKKADAAKKKAEEAKKA 1344



 Score = 35.5 bits (81), Expect = 0.081
 Identities = 19/43 (44%), Positives = 25/43 (58%), Gaps = 2/43 (4%)

Query: 21   KKAKKKEKKKKEKKAKKKEKKK--EKKKKEKKKKKKKKKKKKK 61
            KKA + +KK +E K   + KKK  E KKK    KKK ++ KK 
Sbjct: 1302 KKADEAKKKAEEAKKADEAKKKAEEAKKKADAAKKKAEEAKKA 1344



 Score = 35.5 bits (81), Expect = 0.096
 Identities = 19/41 (46%), Positives = 29/41 (70%), Gaps = 1/41 (2%)

Query: 21   KKAKKKEKKKKE-KKAKKKEKKKEKKKKEKKKKKKKKKKKK 60
            KKA + +KK +E KKA++ +KK E+ KK  + KKK ++ KK
Sbjct: 1444 KKADEAKKKAEEAKKAEEAKKKAEEAKKADEAKKKAEEAKK 1484



 Score = 35.5 bits (81), Expect = 0.096
 Identities = 16/40 (40%), Positives = 28/40 (70%)

Query: 21   KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKK 60
            ++AKK E+ +K+     K++ +E KK E+ KKK+ ++KKK
Sbjct: 1678 EEAKKAEEDEKKAAEALKKEAEEAKKAEELKKKEAEEKKK 1717



 Score = 35.1 bits (80), Expect = 0.12
 Identities = 18/42 (42%), Positives = 27/42 (64%), Gaps = 3/42 (7%)

Query: 22   KAKKKEKKKKE---KKAKKKEKKKEKKKKEKKKKKKKKKKKK 60
            K K +E KK +   KKA++ +K +E KKK ++ KK  + KKK
Sbjct: 1437 KKKAEEAKKADEAKKKAEEAKKAEEAKKKAEEAKKADEAKKK 1478



 Score = 35.1 bits (80), Expect = 0.13
 Identities = 20/44 (45%), Positives = 29/44 (65%), Gaps = 4/44 (9%)

Query: 22   KAKKKEKKKKEKKAKKKEKKKEKKKKEKKKK----KKKKKKKKK 61
            KA++  K  + KKA++K+K  E KK E+KKK    KKK ++ KK
Sbjct: 1273 KAEEARKADELKKAEEKKKADEAKKAEEKKKADEAKKKAEEAKK 1316



 Score = 34.7 bits (79), Expect = 0.16
 Identities = 15/41 (36%), Positives = 24/41 (58%)

Query: 21   KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
            +  K  E KKK ++AKKK  + +K  + KKK  + KK ++ 
Sbjct: 1481 EAKKADEAKKKAEEAKKKADEAKKAAEAKKKADEAKKAEEA 1521



 Score = 34.7 bits (79), Expect = 0.16
 Identities = 19/42 (45%), Positives = 28/42 (66%), Gaps = 1/42 (2%)

Query: 21   KKAKKKEKKKKE-KKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
            KKA + +KK +E KKA + +KK E+ KK+  + KK  + KKK
Sbjct: 1470 KKADEAKKKAEEAKKADEAKKKAEEAKKKADEAKKAAEAKKK 1511



 Score = 34.7 bits (79), Expect = 0.17
 Identities = 20/43 (46%), Positives = 26/43 (60%), Gaps = 3/43 (6%)

Query: 21   KKAKKKEKKKKEKKAKKKEKKK---EKKKKEKKKKKKKKKKKK 60
            +K K  E KK E+K K  E KK   E KK ++ KKK ++ KKK
Sbjct: 1288 EKKKADEAKKAEEKKKADEAKKKAEEAKKADEAKKKAEEAKKK 1330



 Score = 34.7 bits (79), Expect = 0.17
 Identities = 15/39 (38%), Positives = 25/39 (64%), Gaps = 2/39 (5%)

Query: 22   KAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKK 60
              K +E KKK ++AKK E+  ++KK  +  KK+ ++ KK
Sbjct: 1667 AKKAEEDKKKAEEAKKAEE--DEKKAAEALKKEAEEAKK 1703



 Score = 34.7 bits (79), Expect = 0.17
 Identities = 18/45 (40%), Positives = 29/45 (64%), Gaps = 4/45 (8%)

Query: 21   KKAKKKEKKKKEKKAKKKEKKKEK----KKKEKKKKKKKKKKKKK 61
            KKA+++ K K  ++AKK E+ K+K    KK E+ +KK  +  KK+
Sbjct: 1653 KKAEEENKIKAAEEAKKAEEDKKKAEEAKKAEEDEKKAAEALKKE 1697



 Score = 34.3 bits (78), Expect = 0.18
 Identities = 21/48 (43%), Positives = 29/48 (60%), Gaps = 8/48 (16%)

Query: 22   KAKKKEKKKKEKKAKKKEKKKE----KKKKEKKKK----KKKKKKKKK 61
            K K +EKKK ++  KK E+ K+    KKK E+ KK    KKK ++ KK
Sbjct: 1424 KKKAEEKKKADEAKKKAEEAKKADEAKKKAEEAKKAEEAKKKAEEAKK 1471



 Score = 34.3 bits (78), Expect = 0.18
 Identities = 21/48 (43%), Positives = 29/48 (60%), Gaps = 3/48 (6%)

Query: 21   KKAKKKEKKKKEKKAKKKEKKK---EKKKKEKKKKKKKKKKKKKNNNN 65
            KKA++ EKK  E   K+ E+ K   E KKKE ++KKK ++ KK    N
Sbjct: 1681 KKAEEDEKKAAEALKKEAEEAKKAEELKKKEAEEKKKAEELKKAEEEN 1728



 Score = 34.3 bits (78), Expect = 0.19
 Identities = 16/40 (40%), Positives = 24/40 (60%)

Query: 22   KAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
            K K +E KKK  +AKK  + K+K  + KK ++ KK  + K
Sbjct: 1489 KKKAEEAKKKADEAKKAAEAKKKADEAKKAEEAKKADEAK 1528



 Score = 34.3 bits (78), Expect = 0.20
 Identities = 16/38 (42%), Positives = 24/38 (63%)

Query: 21   KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKK 58
            K A+ K+K  + KKA++ +K  E KK E+ KK  + KK
Sbjct: 1504 KAAEAKKKADEAKKAEEAKKADEAKKAEEAKKADEAKK 1541



 Score = 34.3 bits (78), Expect = 0.20
 Identities = 18/44 (40%), Positives = 27/44 (61%), Gaps = 4/44 (9%)

Query: 22   KAKKKEKKKKEKKAKKKEKKKEKKKKEKKK----KKKKKKKKKK 61
            K K +E KK E+  KK  +  +K+ +E KK    KKK+ ++KKK
Sbjct: 1674 KKKAEEAKKAEEDEKKAAEALKKEAEEAKKAEELKKKEAEEKKK 1717



 Score = 34.3 bits (78), Expect = 0.21
 Identities = 19/44 (43%), Positives = 26/44 (59%), Gaps = 4/44 (9%)

Query: 21   KKAKKKEKKKKEKKAKK----KEKKKEKKKKEKKKKKKKKKKKK 60
            +  K  E KKK ++AKK    K+K +E KKK  + KK  + KKK
Sbjct: 1468 EAKKADEAKKKAEEAKKADEAKKKAEEAKKKADEAKKAAEAKKK 1511



 Score = 34.3 bits (78), Expect = 0.21
 Identities = 15/41 (36%), Positives = 28/41 (68%)

Query: 21   KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
            ++ KK E++ K K A++ +K +E KKK ++ KK ++ +KK 
Sbjct: 1650 EELKKAEEENKIKAAEEAKKAEEDKKKAEEAKKAEEDEKKA 1690



 Score = 34.3 bits (78), Expect = 0.22
 Identities = 13/40 (32%), Positives = 26/40 (65%)

Query: 22   KAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
            K K  E+ KK ++ KKK ++ +K ++++KK  +  KK+ +
Sbjct: 1660 KIKAAEEAKKAEEDKKKAEEAKKAEEDEKKAAEALKKEAE 1699



 Score = 34.3 bits (78), Expect = 0.22
 Identities = 17/38 (44%), Positives = 26/38 (68%)

Query: 21   KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKK 58
            KKA++ +K  + KKA++ +K  E KK E+KKK  + KK
Sbjct: 1516 KKAEEAKKADEAKKAEEAKKADEAKKAEEKKKADELKK 1553



 Score = 34.3 bits (78), Expect = 0.23
 Identities = 15/39 (38%), Positives = 23/39 (58%)

Query: 21   KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKK 59
            KKA + +K  + KK   + KK E+ KK  + KK ++ KK
Sbjct: 1497 KKADEAKKAAEAKKKADEAKKAEEAKKADEAKKAEEAKK 1535



 Score = 34.0 bits (77), Expect = 0.24
 Identities = 17/41 (41%), Positives = 25/41 (60%)

Query: 21   KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
            KKA + +KK +E K K  E KK  + K+K  + KK ++ KK
Sbjct: 1483 KKADEAKKKAEEAKKKADEAKKAAEAKKKADEAKKAEEAKK 1523



 Score = 34.0 bits (77), Expect = 0.24
 Identities = 18/38 (47%), Positives = 28/38 (73%)

Query: 21   KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKK 58
            KKA++ +K  + KKA++K+K  E KK E+ KK ++KKK
Sbjct: 1528 KKAEEAKKADEAKKAEEKKKADELKKAEELKKAEEKKK 1565



 Score = 34.0 bits (77), Expect = 0.24
 Identities = 18/40 (45%), Positives = 24/40 (60%), Gaps = 2/40 (5%)

Query: 21   KKAKKKEKKKKEKKAKKKEKKK--EKKKKEKKKKKKKKKK 58
            KKA++ +KK  E K   + KKK  E KK E+ KK  + KK
Sbjct: 1490 KKAEEAKKKADEAKKAAEAKKKADEAKKAEEAKKADEAKK 1529



 Score = 34.0 bits (77), Expect = 0.26
 Identities = 22/52 (42%), Positives = 32/52 (61%), Gaps = 9/52 (17%)

Query: 22   KAKKKEKKKKEKKAKKKEKKK---------EKKKKEKKKKKKKKKKKKKNNN 64
            +AKK E+ KK  +AKK E+KK         E KK E+KKK ++ KK +++ N
Sbjct: 1526 EAKKAEEAKKADEAKKAEEKKKADELKKAEELKKAEEKKKAEEAKKAEEDKN 1577



 Score = 34.0 bits (77), Expect = 0.27
 Identities = 16/39 (41%), Positives = 29/39 (74%)

Query: 21   KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKK 59
            +KA + +K +++KKA + +K +EKKK ++ KKK ++ KK
Sbjct: 1278 RKADELKKAEEKKKADEAKKAEEKKKADEAKKKAEEAKK 1316



 Score = 34.0 bits (77), Expect = 0.27
 Identities = 20/43 (46%), Positives = 22/43 (51%), Gaps = 3/43 (6%)

Query: 21   KKAKKKEKKKKEKKAKKKEKKK---EKKKKEKKKKKKKKKKKK 60
            KK     KKK E+K K  E KK   E KKK  + KK    KKK
Sbjct: 1377 KKKADAAKKKAEEKKKADEAKKKAEEDKKKADELKKAAAAKKK 1419



 Score = 33.6 bits (76), Expect = 0.35
 Identities = 17/38 (44%), Positives = 25/38 (65%)

Query: 21   KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKK 58
            KK + +EKKK E+  K +E+ K K  +E KK ++ KKK
Sbjct: 1639 KKKEAEEKKKAEELKKAEEENKIKAAEEAKKAEEDKKK 1676



 Score = 33.2 bits (75), Expect = 0.43
 Identities = 16/43 (37%), Positives = 28/43 (65%), Gaps = 2/43 (4%)

Query: 21   KKAKKKEKKKKE--KKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
             K  +++KKK E  KKA++ EKK  +  K++ ++ KK ++ KK
Sbjct: 1667 AKKAEEDKKKAEEAKKAEEDEKKAAEALKKEAEEAKKAEELKK 1709



 Score = 33.2 bits (75), Expect = 0.43
 Identities = 18/43 (41%), Positives = 25/43 (58%), Gaps = 2/43 (4%)

Query: 21   KKAKKKEKKKKEKKAKKKE--KKKEKKKKEKKKKKKKKKKKKK 61
            KK   + KKK E+  KK +  KK    KK+  + KKK ++KKK
Sbjct: 1390 KKKADEAKKKAEEDKKKADELKKAAAAKKKADEAKKKAEEKKK 1432



 Score = 32.8 bits (74), Expect = 0.54
 Identities = 12/41 (29%), Positives = 27/41 (65%)

Query: 21   KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
            K A++ +K +++KK  ++ KK E+ +K+  +  KK+ ++ K
Sbjct: 1662 KAAEEAKKAEEDKKKAEEAKKAEEDEKKAAEALKKEAEEAK 1702



 Score = 32.8 bits (74), Expect = 0.58
 Identities = 16/38 (42%), Positives = 26/38 (68%)

Query: 22   KAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKK 59
            K K+ E+KKK ++ KK E++ + K  E+ KK ++ KKK
Sbjct: 1639 KKKEAEEKKKAEELKKAEEENKIKAAEEAKKAEEDKKK 1676



 Score = 32.4 bits (73), Expect = 0.84
 Identities = 16/38 (42%), Positives = 25/38 (65%)

Query: 24   KKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
            K +EKKK ++  K +EKKK  + K+K ++ KK  + KK
Sbjct: 1285 KAEEKKKADEAKKAEEKKKADEAKKKAEEAKKADEAKK 1322



 Score = 32.4 bits (73), Expect = 0.85
 Identities = 15/41 (36%), Positives = 26/41 (63%)

Query: 21   KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
            ++ KK E+ KK ++  K +  +E KK E+ KKK ++ KK +
Sbjct: 1644 EEKKKAEELKKAEEENKIKAAEEAKKAEEDKKKAEEAKKAE 1684



 Score = 32.4 bits (73), Expect = 0.88
 Identities = 17/42 (40%), Positives = 23/42 (54%), Gaps = 2/42 (4%)

Query: 19   DWKKAKKKEKKKKEKKAKKKEKKK--EKKKKEKKKKKKKKKK 58
            + KK   + KK  E K K  E KK  E KK ++ KK ++ KK
Sbjct: 1494 EAKKKADEAKKAAEAKKKADEAKKAEEAKKADEAKKAEEAKK 1535



 Score = 32.0 bits (72), Expect = 1.1
 Identities = 16/38 (42%), Positives = 27/38 (71%)

Query: 22   KAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKK 59
            K K+ E+KKK ++ KK E++ + K +E KK+ ++ KKK
Sbjct: 1708 KKKEAEEKKKAEELKKAEEENKIKAEEAKKEAEEDKKK 1745



 Score = 32.0 bits (72), Expect = 1.2
 Identities = 16/38 (42%), Positives = 26/38 (68%)

Query: 21   KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKK 58
            KK + +EKKK E+  K +E+ K K ++ KK+ ++ KKK
Sbjct: 1708 KKKEAEEKKKAEELKKAEEENKIKAEEAKKEAEEDKKK 1745



 Score = 32.0 bits (72), Expect = 1.2
 Identities = 19/43 (44%), Positives = 31/43 (72%), Gaps = 4/43 (9%)

Query: 21   KKAKKKEKKKKEKKAK----KKEKKKEKKKKEKKKKKKKKKKK 59
            KKA++ +K ++E K K    KKE +++KKK E+ KK +++KKK
Sbjct: 1716 KKAEELKKAEEENKIKAEEAKKEAEEDKKKAEEAKKDEEEKKK 1758



 Score = 31.6 bits (71), Expect = 1.4
 Identities = 18/41 (43%), Positives = 30/41 (73%)

Query: 22   KAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKN 62
            K K +E KK E++ KK E+ K+K+ +EKKK ++ KK +++N
Sbjct: 1619 KIKAEELKKAEEEKKKVEQLKKKEAEEKKKAEELKKAEEEN 1659



 Score = 31.6 bits (71), Expect = 1.4
 Identities = 19/72 (26%), Positives = 34/72 (47%)

Query: 18   LDWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNNNN 77
            L  ++ KK E+ +KEK+A  +E+  E+ +K + +  KK K    N  N        N   
Sbjct: 1762 LKKEEEKKAEEIRKEKEAVIEEELDEEDEKRRMEVDKKIKDIFDNFANIIEGGKEGNLVI 1821

Query: 78   NNNNNNNNNEEK 89
            N++    ++  K
Sbjct: 1822 NDSKEMEDSAIK 1833



 Score = 30.9 bits (69), Expect = 2.1
 Identities = 19/46 (41%), Positives = 29/46 (63%), Gaps = 4/46 (8%)

Query: 19   DWKKAKKKEKKKKEKKAK----KKEKKKEKKKKEKKKKKKKKKKKK 60
            D KKA +  KK+ E+  K    KK++ +EKKK E+ KK +++ K K
Sbjct: 1686 DEKKAAEALKKEAEEAKKAEELKKKEAEEKKKAEELKKAEEENKIK 1731



 Score = 30.9 bits (69), Expect = 2.2
 Identities = 16/39 (41%), Positives = 28/39 (71%)

Query: 25   KKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNN 63
            K E+ KK ++ KKK ++ +KK+ E+KKK ++ KK ++ N
Sbjct: 1621 KAEELKKAEEEKKKVEQLKKKEAEEKKKAEELKKAEEEN 1659



 Score = 30.9 bits (69), Expect = 2.7
 Identities = 19/44 (43%), Positives = 29/44 (65%), Gaps = 4/44 (9%)

Query: 21   KKAKKKEKKKKEKKAKKKEKKKE----KKKKEKKKKKKKKKKKK 60
            KKA++ +KK+ E+K K +E KK     K K E+ KK+ ++ KKK
Sbjct: 1702 KKAEELKKKEAEEKKKAEELKKAEEENKIKAEEAKKEAEEDKKK 1745



 Score = 30.9 bits (69), Expect = 2.7
 Identities = 16/41 (39%), Positives = 28/41 (68%)

Query: 21   KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
             K K +E KK+ ++ KKK ++ +K ++EKKK    KK+++K
Sbjct: 1728 NKIKAEEAKKEAEEDKKKAEEAKKDEEEKKKIAHLKKEEEK 1768



 Score = 30.5 bits (68), Expect = 2.8
 Identities = 13/37 (35%), Positives = 25/37 (67%)

Query: 24   KKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKK 60
            KK+ ++KK+ +  KK +++ K K  ++ KK ++ KKK
Sbjct: 1640 KKEAEEKKKAEELKKAEEENKIKAAEEAKKAEEDKKK 1676



 Score = 30.5 bits (68), Expect = 3.0
 Identities = 20/44 (45%), Positives = 32/44 (72%), Gaps = 3/44 (6%)

Query: 21   KKAKKKEKKKKEK---KAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
            KK K +E KK E+   KA++ +K +E+KKK ++ KKK+ ++KKK
Sbjct: 1605 KKMKAEEAKKAEEAKIKAEELKKAEEEKKKVEQLKKKEAEEKKK 1648



 Score = 30.5 bits (68), Expect = 3.1
 Identities = 16/41 (39%), Positives = 30/41 (73%)

Query: 21   KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
            ++ KKK ++ K+K+A++K+K +E KK E++ K K  ++ KK
Sbjct: 1629 EEEKKKVEQLKKKEAEEKKKAEELKKAEEENKIKAAEEAKK 1669



 Score = 30.5 bits (68), Expect = 3.3
 Identities = 16/40 (40%), Positives = 29/40 (72%)

Query: 22   KAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
            K + +E KKK ++AKK E++K+K    KK+++KK ++ +K
Sbjct: 1736 KKEAEEDKKKAEEAKKDEEEKKKIAHLKKEEEKKAEEIRK 1775



 Score = 30.5 bits (68), Expect = 3.3
 Identities = 16/41 (39%), Positives = 29/41 (70%)

Query: 21   KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
            ++ KK E++KK+ +  KK++ +EKKK E+ KK +++ K K 
Sbjct: 1623 EELKKAEEEKKKVEQLKKKEAEEKKKAEELKKAEEENKIKA 1663



 Score = 30.5 bits (68), Expect = 3.4
 Identities = 24/123 (19%), Positives = 46/123 (37%), Gaps = 7/123 (5%)

Query: 429  RDIEEHFADKGKTFVTNIRRAEKKRHAARVEGQEEVKGEDNFGMEGEDGKYISRVVGTLY 488
            R  E     + +      R+AE  + A  V+  EE K +     + E+ +    +     
Sbjct: 1200 RKAEAARKAEEERKAEEARKAEDAKKAEAVKKAEEAKKDAEEAKKAEEERNNEEIRKFEE 1259

Query: 489  MYFVMPETEGRTLRDIEEHFADKGKTFVTNIRRAEKKRHAAKVEGQEEVKGEDNFGMEGE 548
                        ++  E   AD+       +++AE+K+ A + +  EE K  D    + E
Sbjct: 1260 ARMAHFARRQAAIKAEEARKADE-------LKKAEEKKKADEAKKAEEKKKADEAKKKAE 1312

Query: 549  DGK 551
            + K
Sbjct: 1313 EAK 1315



 Score = 30.5 bits (68), Expect = 3.5
 Identities = 17/38 (44%), Positives = 26/38 (68%), Gaps = 2/38 (5%)

Query: 24   KKKEKKKKEKKAKKKE--KKKEKKKKEKKKKKKKKKKK 59
            KK ++ KK ++AKK +  KK E+ KK  + KK ++KKK
Sbjct: 1510 KKADEAKKAEEAKKADEAKKAEEAKKADEAKKAEEKKK 1547



 Score = 30.1 bits (67), Expect = 3.7
 Identities = 19/79 (24%), Positives = 34/79 (43%), Gaps = 8/79 (10%)

Query: 22   KAKKKEKKKKEKKAKKKEKKKEKKKKEK--------KKKKKKKKKKKKNNNNNNNNNNNN 73
            K  ++EKKK     K++EKK E+ +KEK         ++ +K++ +      +  +N  N
Sbjct: 1750 KKDEEEKKKIAHLKKEEEKKAEEIRKEKEAVIEEELDEEDEKRRMEVDKKIKDIFDNFAN 1809

Query: 74   NNNNNNNNNNNNNEEKGTF 92
                    N   N+ K   
Sbjct: 1810 IIEGGKEGNLVINDSKEME 1828



 Score = 30.1 bits (67), Expect = 4.1
 Identities = 19/58 (32%), Positives = 33/58 (56%), Gaps = 4/58 (6%)

Query: 8    QIGTFTEPTPLDWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKK----KKK 61
            +I  F E     + + +   K ++ +KA + +K +EKKK ++ KK ++KKK    KKK
Sbjct: 1253 EIRKFEEARMAHFARRQAAIKAEEARKADELKKAEEKKKADEAKKAEEKKKADEAKKK 1310



 Score = 30.1 bits (67), Expect = 4.4
 Identities = 15/39 (38%), Positives = 29/39 (74%)

Query: 22   KAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKK 60
            KA++ +K+ +E K K +E KK++++K+K    KK+++KK
Sbjct: 1731 KAEEAKKEAEEDKKKAEEAKKDEEEKKKIAHLKKEEEKK 1769



 Score = 29.3 bits (65), Expect = 6.2
 Identities = 16/41 (39%), Positives = 29/41 (70%)

Query: 21   KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
            +K  K E+ KK ++AK K ++ +K ++EKKK ++ KKK+ +
Sbjct: 1604 EKKMKAEEAKKAEEAKIKAEELKKAEEEKKKVEQLKKKEAE 1644



 Score = 29.3 bits (65), Expect = 7.2
 Identities = 17/43 (39%), Positives = 30/43 (69%), Gaps = 2/43 (4%)

Query: 21   KKAKKKEKKKKEKKAKKKEKKK--EKKKKEKKKKKKKKKKKKK 61
            K+A++ +K ++ KK + +EKKK  E KK E++ K K ++ KK+
Sbjct: 1696 KEAEEAKKAEELKKKEAEEKKKAEELKKAEEENKIKAEEAKKE 1738



 Score = 29.3 bits (65), Expect = 7.5
 Identities = 20/44 (45%), Positives = 30/44 (68%), Gaps = 4/44 (9%)

Query: 22   KAKKKEKKKKEKKAKKKEKKKEK----KKKEKKKKKKKKKKKKK 61
            K  ++EKK K ++AKK E+ K K    KK E++KKK ++ KKK+
Sbjct: 1599 KLYEEEKKMKAEEAKKAEEAKIKAEELKKAEEEKKKVEQLKKKE 1642



 Score = 29.0 bits (64), Expect = 8.5
 Identities = 15/37 (40%), Positives = 25/37 (67%)

Query: 21   KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKK 57
            KKA + +K ++ KKA++K+K +E KK E+ K    +K
Sbjct: 1546 KKADELKKAEELKKAEEKKKAEEAKKAEEDKNMALRK 1582



 Score = 29.0 bits (64), Expect = 9.2
 Identities = 16/42 (38%), Positives = 28/42 (66%)

Query: 21   KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKN 62
            +K K +E KK E++ K K  ++ KK +E KKK ++ KK +++
Sbjct: 1645 EKKKAEELKKAEEENKIKAAEEAKKAEEDKKKAEEAKKAEED 1686



 Score = 29.0 bits (64), Expect = 9.6
 Identities = 16/48 (33%), Positives = 32/48 (66%)

Query: 12   FTEPTPLDWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKK 59
            + E   +  ++AKK E+ K + +  KK ++++KK ++ KKK+ ++KKK
Sbjct: 1601 YEEEKKMKAEEAKKAEEAKIKAEELKKAEEEKKKVEQLKKKEAEEKKK 1648


>gnl|CDD|219500 pfam07655, Secretin_N_2, Secretin N-terminal domain.  This is a
          short domain found in bacterial type II/III secretory
          system proteins. The architecture of these proteins
          suggest that this family may be functionally analogous
          to pfam03958.
          Length = 95

 Score = 40.0 bits (94), Expect = 2e-04
 Identities = 5/33 (15%), Positives = 24/33 (72%)

Query: 62 NNNNNNNNNNNNNNNNNNNNNNNNNEEKGTFLY 94
          + +N++++++N++N  +++++++ +   GT + 
Sbjct: 29 SGSNSSSSSSNSSNGGSSSSSSSGDSSSGTRIT 61



 Score = 39.2 bits (92), Expect = 4e-04
 Identities = 3/30 (10%), Positives = 22/30 (73%)

Query: 62 NNNNNNNNNNNNNNNNNNNNNNNNNEEKGT 91
          ++ +N++++++N++N  +++++++ +    
Sbjct: 28 SSGSNSSSSSSNSSNGGSSSSSSSGDSSSG 57



 Score = 37.7 bits (88), Expect = 0.001
 Identities = 3/25 (12%), Positives = 22/25 (88%)

Query: 62 NNNNNNNNNNNNNNNNNNNNNNNNN 86
          +++ +N++++++N++N  +++++++
Sbjct: 27 SSSGSNSSSSSSNSSNGGSSSSSSS 51



 Score = 36.9 bits (86), Expect = 0.003
 Identities = 3/25 (12%), Positives = 21/25 (84%)

Query: 62 NNNNNNNNNNNNNNNNNNNNNNNNN 86
           +++ +N++++++N++N  ++++++
Sbjct: 26 VSSSGSNSSSSSSNSSNGGSSSSSS 50


>gnl|CDD|236768 PRK10819, PRK10819, transport protein TonB; Provisional.
          Length = 246

 Score = 42.4 bits (100), Expect = 3e-04
 Identities = 13/65 (20%), Positives = 27/65 (41%)

Query: 14  EPTPLDWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNN 73
           +  P+   K + K K K + K K  +K +E+ K+E K  + +     +N       ++  
Sbjct: 86  KEAPVVIPKPEPKPKPKPKPKPKPVKKVEEQPKREVKPVEPRPASPFENTAPARPTSSTA 145

Query: 74  NNNNN 78
               +
Sbjct: 146 TAAAS 150



 Score = 42.0 bits (99), Expect = 3e-04
 Identities = 15/72 (20%), Positives = 30/72 (41%), Gaps = 3/72 (4%)

Query: 14  EPTPLDWKKAK---KKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNN 70
           EP P   K+A     K + K + K K K K  +K +++ K++ K  + +  +   N    
Sbjct: 79  EPIPEPPKEAPVVIPKPEPKPKPKPKPKPKPVKKVEEQPKREVKPVEPRPASPFENTAPA 138

Query: 71  NNNNNNNNNNNN 82
              ++      +
Sbjct: 139 RPTSSTATAAAS 150



 Score = 40.4 bits (95), Expect = 0.001
 Identities = 16/74 (21%), Positives = 29/74 (39%), Gaps = 1/74 (1%)

Query: 14  EPTPL-DWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNN 72
           EP P+ +  K       K E K K K K K K  K+ +++ K++ K  +    +   N  
Sbjct: 77  EPEPIPEPPKEAPVVIPKPEPKPKPKPKPKPKPVKKVEEQPKREVKPVEPRPASPFENTA 136

Query: 73  NNNNNNNNNNNNNN 86
                ++      +
Sbjct: 137 PARPTSSTATAAAS 150


>gnl|CDD|148630 pfam07133, Merozoite_SPAM, Merozoite surface protein (SPAM).  This
           family consists of several Plasmodium falciparum SPAM
           (secreted polymorphic antigen associated with
           merozoites) proteins. Variation among SPAM alleles is
           the result of deletions and amino acid substitutions in
           non-repetitive sequences within and flanking the alanine
           heptad-repeat domain. Heptad repeats in which the a and
           d position contain hydrophobic residues generate
           amphipathic alpha-helices which give rise to helical
           bundles or coiled-coil structures in proteins. SPAM is
           an example of a P. falciparum antigen in which a
           repetitive sequence has features characteristic of a
           well-defined structural element.
          Length = 164

 Score = 41.4 bits (97), Expect = 3e-04
 Identities = 16/66 (24%), Positives = 33/66 (50%)

Query: 26  KEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNNNNNNNNNNNN 85
           +E +  E + +  E ++E+++ E+     K  +KK  N+  N+  ++N  N  + N   N
Sbjct: 63  EEPEDIEDEEEIVEDEEEEEEDEEDNVDLKDIEKKNINDIFNSTQDDNAQNLISKNYKKN 122

Query: 86  NEEKGT 91
            + K T
Sbjct: 123 EKSKKT 128



 Score = 39.5 bits (92), Expect = 0.001
 Identities = 12/65 (18%), Positives = 34/65 (52%)

Query: 24  KKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNNNNNNNNNN 83
           ++ E  + E++  + E+++E+ +++    K  +KK   +  N+  ++N  N  + N   N
Sbjct: 63  EEPEDIEDEEEIVEDEEEEEEDEEDNVDLKDIEKKNINDIFNSTQDDNAQNLISKNYKKN 122

Query: 84  NNNEE 88
             +++
Sbjct: 123 EKSKK 127



 Score = 36.8 bits (85), Expect = 0.009
 Identities = 14/73 (19%), Positives = 36/73 (49%), Gaps = 9/73 (12%)

Query: 25  KKEKKKKEKKAKKKEKKK------EKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNNNNN 78
           ++E+++ E++ ++ E  +      E +++E++ ++     K     N N+  N+  ++N 
Sbjct: 52  EEEEEEDEEEIEEPEDIEDEEEIVEDEEEEEEDEEDNVDLKDIEKKNINDIFNSTQDDNA 111

Query: 79  NN---NNNNNNEE 88
            N    N   NE+
Sbjct: 112 QNLISKNYKKNEK 124


>gnl|CDD|240427 PTZ00470, PTZ00470, glycoside hydrolase family 47 protein;
          Provisional.
          Length = 522

 Score = 43.2 bits (102), Expect = 3e-04
 Identities = 16/47 (34%), Positives = 23/47 (48%)

Query: 53 KKKKKKKKKNNNNNNNNNNNNNNNNNNNNNNNNNEEKGTFLYKMAAP 99
              ++    +NN ++N NNNNNNNN  N+NN N        K+   
Sbjct: 1  NVNLREHLSVHNNADDNYNNNNNNNNQINSNNPNNNGNNQASKLPRG 47



 Score = 42.0 bits (99), Expect = 7e-04
 Identities = 16/36 (44%), Positives = 22/36 (61%)

Query: 52 KKKKKKKKKKNNNNNNNNNNNNNNNNNNNNNNNNNE 87
              ++    +NN ++N NNNNNNNN  N+NN NN 
Sbjct: 1  NVNLREHLSVHNNADDNYNNNNNNNNQINSNNPNNN 36



 Score = 41.6 bits (98), Expect = 0.001
 Identities = 17/38 (44%), Positives = 22/38 (57%)

Query: 51 KKKKKKKKKKKNNNNNNNNNNNNNNNNNNNNNNNNNEE 88
              ++     NN ++N NNNNNNNN  N+NN NNN  
Sbjct: 1  NVNLREHLSVHNNADDNYNNNNNNNNQINSNNPNNNGN 38



 Score = 40.9 bits (96), Expect = 0.002
 Identities = 19/34 (55%), Positives = 21/34 (61%)

Query: 62 NNNNNNNNNNNNNNNNNNNNNNNNNEEKGTFLYK 95
          +N NNNNNNNN  N+NN NNN NN   K     K
Sbjct: 16 DNYNNNNNNNNQINSNNPNNNGNNQASKLPRGKK 49



 Score = 34.3 bits (79), Expect = 0.15
 Identities = 14/30 (46%), Positives = 17/30 (56%)

Query: 62 NNNNNNNNNNNNNNNNNNNNNNNNNEEKGT 91
          NNNNN  N+NN NNN NN  +     +K  
Sbjct: 22 NNNNNQINSNNPNNNGNNQASKLPRGKKKQ 51



 Score = 32.8 bits (75), Expect = 0.48
 Identities = 13/29 (44%), Positives = 17/29 (58%)

Query: 61 KNNNNNNNNNNNNNNNNNNNNNNNNNEEK 89
           NNNN  N+NN NNN NN  +     ++K
Sbjct: 22 NNNNNQINSNNPNNNGNNQASKLPRGKKK 50



 Score = 29.3 bits (66), Expect = 6.3
 Identities = 11/31 (35%), Positives = 16/31 (51%)

Query: 60 KKNNNNNNNNNNNNNNNNNNNNNNNNNEEKG 90
            NNN  N+NN NNN NN  +      +++ 
Sbjct: 22 NNNNNQINSNNPNNNGNNQASKLPRGKKKQE 52


>gnl|CDD|236978 PRK11778, PRK11778, putative inner membrane peptidase;
          Provisional.
          Length = 330

 Score = 42.9 bits (102), Expect = 3e-04
 Identities = 14/29 (48%), Positives = 17/29 (58%)

Query: 25 KKEKKKKEKKAKKKEKKKEKKKKEKKKKK 53
          KKE K   K  KKKEK++ K  K K K +
Sbjct: 63 KKELKAWHKAQKKKEKQEAKAAKAKSKPR 91



 Score = 42.5 bits (101), Expect = 4e-04
 Identities = 13/29 (44%), Positives = 19/29 (65%)

Query: 30 KKEKKAKKKEKKKEKKKKEKKKKKKKKKK 58
          KKE KA  K +KK++K++ K  K K K +
Sbjct: 63 KKELKAWHKAQKKKEKQEAKAAKAKSKPR 91



 Score = 39.4 bits (93), Expect = 0.004
 Identities = 15/41 (36%), Positives = 22/41 (53%)

Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
          K+ K++ K     K + K   K +KKKEK++ K  K K K 
Sbjct: 50 KEMKEELKAALLDKKELKAWHKAQKKKEKQEAKAAKAKSKP 90



 Score = 37.1 bits (87), Expect = 0.016
 Identities = 12/29 (41%), Positives = 18/29 (62%)

Query: 29 KKKEKKAKKKEKKKEKKKKEKKKKKKKKK 57
          KK+ K   K +KKKEK++ +  K K K +
Sbjct: 63 KKELKAWHKAQKKKEKQEAKAAKAKSKPR 91



 Score = 37.1 bits (87), Expect = 0.016
 Identities = 12/28 (42%), Positives = 15/28 (53%)

Query: 19 DWKKAKKKEKKKKEKKAKKKEKKKEKKK 46
             KA  K +KKKEK+  K  K K K +
Sbjct: 64 KELKAWHKAQKKKEKQEAKAAKAKSKPR 91



 Score = 37.1 bits (87), Expect = 0.019
 Identities = 11/29 (37%), Positives = 17/29 (58%)

Query: 33 KKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
          KK  K   K +KKK++++ K  K K K +
Sbjct: 63 KKELKAWHKAQKKKEKQEAKAAKAKSKPR 91



 Score = 36.3 bits (85), Expect = 0.036
 Identities = 13/38 (34%), Positives = 19/38 (50%)

Query: 28 KKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNN 65
          K+ KE+       KKE K   K +KKK+K++ K     
Sbjct: 50 KEMKEELKAALLDKKELKAWHKAQKKKEKQEAKAAKAK 87



 Score = 36.0 bits (84), Expect = 0.047
 Identities = 10/29 (34%), Positives = 16/29 (55%)

Query: 22 KAKKKEKKKKEKKAKKKEKKKEKKKKEKK 50
          K + K   K +KK +K+E K  K K + +
Sbjct: 63 KKELKAWHKAQKKKEKQEAKAAKAKSKPR 91



 Score = 34.8 bits (81), Expect = 0.092
 Identities = 13/38 (34%), Positives = 22/38 (57%)

Query: 26 KEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNN 63
          KE K++ K A   +K+ +   K +KKK+K++ K  K  
Sbjct: 50 KEMKEELKAALLDKKELKAWHKAQKKKEKQEAKAAKAK 87


>gnl|CDD|218585 pfam05424, Duffy_binding, Duffy binding domain.  This domain is
           found in Plasmodium Duffy binding proteins. Plasmodium
           vivax and Plasmodium knowlesi merozoites invade human
           erythrocytes that express Duffy blood group surface
           determinants. The Duffy receptor family is localised in
           micronemes, an organelle found in all organisms of the
           phylum Apicomplexa.
          Length = 276

 Score = 42.3 bits (100), Expect = 4e-04
 Identities = 25/70 (35%), Positives = 33/70 (47%), Gaps = 12/70 (17%)

Query: 21  KKAKKKEKKKKEKKAKKKEKKK------------EKKKKEKKKKKKKKKKKKKNNNNNNN 68
           +K KKK K+KK  K   K+  K            + KKKE  K+KKK KK KK NN  N 
Sbjct: 206 EKLKKKCKEKKCNKYCSKKCTKCKNACKKYKKWIDNKKKEWDKQKKKYKKYKKRNNKKNY 265

Query: 69  NNNNNNNNNN 78
           ++  +     
Sbjct: 266 SDIKDKYAKE 275



 Score = 41.6 bits (98), Expect = 7e-04
 Identities = 22/77 (28%), Positives = 30/77 (38%), Gaps = 18/77 (23%)

Query: 26  KEKKKKEKKAKKK----EKKKEKKKKEKK------------KKKKK--KKKKKKNNNNNN 67
           +E+KKK +K KKK    +  K   KK  K              KKK   K+KKK      
Sbjct: 199 RERKKKLEKLKKKCKEKKCNKYCSKKCTKCKNACKKYKKWIDNKKKEWDKQKKKYKKYKK 258

Query: 68  NNNNNNNNNNNNNNNNN 84
            NN  N ++  +     
Sbjct: 259 RNNKKNYSDIKDKYAKE 275



 Score = 37.3 bits (87), Expect = 0.013
 Identities = 19/77 (24%), Positives = 31/77 (40%), Gaps = 12/77 (15%)

Query: 24  KKKEKKKKEKKAKKKEKKKEKKKKEKKKK------------KKKKKKKKKNNNNNNNNNN 71
           ++++KK ++ K K KEKK  K   +K  K              KKK+  K          
Sbjct: 199 RERKKKLEKLKKKCKEKKCNKYCSKKCTKCKNACKKYKKWIDNKKKEWDKQKKKYKKYKK 258

Query: 72  NNNNNNNNNNNNNNNEE 88
            NN  N ++  +   +E
Sbjct: 259 RNNKKNYSDIKDKYAKE 275



 Score = 31.2 bits (71), Expect = 1.1
 Identities = 16/74 (21%), Positives = 26/74 (35%), Gaps = 12/74 (16%)

Query: 28  KKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKK--KNN----------NNNNNNNNNNNN 75
           K    KK ++  KK  KK K+K K   KK ++   + N             NN       
Sbjct: 106 KDGNTKKLEENLKKIFKKIKDKLKPNYKKLREDWWEANREDIWKAMLCGLKNNAKYFKKY 165

Query: 76  NNNNNNNNNNNEEK 89
           +     N+  +  +
Sbjct: 166 SYKCRCNSCTSLPE 179



 Score = 28.5 bits (64), Expect = 9.7
 Identities = 17/58 (29%), Positives = 21/58 (36%), Gaps = 13/58 (22%)

Query: 50 KKKKKKKKKKKKNNNNNNNNNNN-------------NNNNNNNNNNNNNNEEKGTFLY 94
          KKKKKK  KK   N N    N               N  N N NN  + ++    F+ 
Sbjct: 2  KKKKKKNGKKWNCNKNKKKKNKGVCVPPRRQQLCLYNLENLNENNIKDEHDLLEAFIK 59



 Score = 28.5 bits (64), Expect = 9.9
 Identities = 16/71 (22%), Positives = 23/71 (32%), Gaps = 13/71 (18%)

Query: 24  KKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKK-------------KKKKKNNNNNNNNN 70
           K    KK E+  KK  KK + K K   KK ++                  KNN       
Sbjct: 106 KDGNTKKLEENLKKIFKKIKDKLKPNYKKLREDWWEANREDIWKAMLCGLKNNAKYFKKY 165

Query: 71  NNNNNNNNNNN 81
           +     N+  +
Sbjct: 166 SYKCRCNSCTS 176


>gnl|CDD|235370 PRK05244, PRK05244, Der GTPase activator; Provisional.
          Length = 177

 Score = 41.1 bits (97), Expect = 4e-04
 Identities = 21/90 (23%), Positives = 40/90 (44%), Gaps = 3/90 (3%)

Query: 24  KKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNNNNNNNNNN 83
           KKK+   K  K   K KKK +++ + + +++K+KKK K   + + +N  N  +       
Sbjct: 1   KKKKSSPKRSKGMAKSKKKTREELDAEARERKRKKKHKGLKSGSRHNEGNTQSKGKGQAQ 60

Query: 84  NNNEEKGTFLYKMAAPLVLVLTYVAEITQP 113
             +   G+   K   PL +      +  +P
Sbjct: 61  KKDPRIGS---KKPIPLGVEEKVKPKKKKP 87



 Score = 39.9 bits (94), Expect = 0.001
 Identities = 16/59 (27%), Positives = 29/59 (49%)

Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNNNNNN 79
          KK K   K+ K     KK+ ++E   + +++K+KKK K  K+ + +N  N  +      
Sbjct: 1  KKKKSSPKRSKGMAKSKKKTREELDAEARERKRKKKHKGLKSGSRHNEGNTQSKGKGQA 59



 Score = 30.7 bits (70), Expect = 1.3
 Identities = 20/65 (30%), Positives = 29/65 (44%), Gaps = 25/65 (38%)

Query: 22 KAKKKEKKKKEK------------KAKKKEKKKEKK------KK-------EKKKKKKKK 56
          +A+++++KKK K               K + + +KK      KK       EK K KKKK
Sbjct: 27 EARERKRKKKHKGLKSGSRHNEGNTQSKGKGQAQKKDPRIGSKKPIPLGVEEKVKPKKKK 86

Query: 57 KKKKK 61
           K KK
Sbjct: 87 PKSKK 91



 Score = 29.1 bits (66), Expect = 3.6
 Identities = 21/62 (33%), Positives = 26/62 (41%), Gaps = 25/62 (40%)

Query: 22 KAKKKEK--------------KKKEKKAKKKE-----KKK------EKKKKEKKKKKKKK 56
          K KKK K               K + +A+KK+     KK       EK K +KKK K KK
Sbjct: 32 KRKKKHKGLKSGSRHNEGNTQSKGKGQAQKKDPRIGSKKPIPLGVEEKVKPKKKKPKSKK 91

Query: 57 KK 58
           K
Sbjct: 92 PK 93


>gnl|CDD|220252 pfam09468, RNase_H2-Ydr279, Ydr279p protein family (RNase H2
           complex component).  RNases H are enzymes that
           specifically hydrolyse RNA when annealed to a
           complementary DNA and are present in all living
           organisms. In yeast RNase H2 is composed of a complex of
           three proteins (Rnh2Ap, Ydr279p and Ylr154p), this
           family represents the homologues of Ydr279p. It is not
           known whether non yeast proteins in this family fulfil
           the same function.
          Length = 287

 Score = 41.9 bits (99), Expect = 5e-04
 Identities = 22/60 (36%), Positives = 30/60 (50%), Gaps = 8/60 (13%)

Query: 12  FTEPTPLDWKKAKKKEKKK--------KEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNN 63
             E  PLD    + K+KK+         E +A+KK K KE+ KK+K K+ K  K  KK  
Sbjct: 217 IPEFKPLDKYLKESKKKKRETEEDVEAAESRAEKKRKSKEEIKKKKPKESKGVKALKKVV 276



 Score = 39.2 bits (92), Expect = 0.004
 Identities = 13/36 (36%), Positives = 22/36 (61%)

Query: 24  KKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKK 59
           + ++K+K +++ KKK+ K+ K  K  KK   K  KK
Sbjct: 247 RAEKKRKSKEEIKKKKPKESKGVKALKKVVAKGMKK 282



 Score = 38.9 bits (91), Expect = 0.004
 Identities = 16/46 (34%), Positives = 26/46 (56%)

Query: 16  TPLDWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
           T  D + A+ + +KK++ K + K+KK ++ K  K  KK   K  KK
Sbjct: 237 TEEDVEAAESRAEKKRKSKEEIKKKKPKESKGVKALKKVVAKGMKK 282



 Score = 38.5 bits (90), Expect = 0.006
 Identities = 14/41 (34%), Positives = 21/41 (51%)

Query: 22  KAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKN 62
           +A +   +KK K  ++ +KKK K+ K  K  KK   K  K 
Sbjct: 242 EAAESRAEKKRKSKEEIKKKKPKESKGVKALKKVVAKGMKK 282



 Score = 34.2 bits (79), Expect = 0.12
 Identities = 14/44 (31%), Positives = 25/44 (56%)

Query: 21  KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNN 64
           K   K  K+ K+KK + +E  +  + + +KK+K K++ KKK   
Sbjct: 221 KPLDKYLKESKKKKRETEEDVEAAESRAEKKRKSKEEIKKKKPK 264


>gnl|CDD|219547 pfam07741, BRF1, Brf1-like TBP-binding domain.  This region
          covers both the Brf homology II and III regions. This
          region is involved in binding TATA binding protein.
          Length = 95

 Score = 39.2 bits (92), Expect = 5e-04
 Identities = 15/52 (28%), Positives = 26/52 (50%), Gaps = 11/52 (21%)

Query: 25 KKEKKKKE-----------KKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNN 65
          ++E K KE           ++ ++KE K++  +     KKKKK+K KK  + 
Sbjct: 11 EEEVKLKERIWDELNKDYLEEQEEKELKQKADEGNNSGKKKKKRKAKKKRDE 62



 Score = 36.9 bits (86), Expect = 0.003
 Identities = 14/35 (40%), Positives = 22/35 (62%)

Query: 26 KEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKK 60
          +E+++KE K K  E     KKK+K+K KKK+ +  
Sbjct: 30 EEQEEKELKQKADEGNNSGKKKKKRKAKKKRDEAG 64



 Score = 36.1 bits (84), Expect = 0.005
 Identities = 13/41 (31%), Positives = 24/41 (58%)

Query: 20 WKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKK 60
          W +  K   +++E+K  K++  +     +KKKK+K KKK+ 
Sbjct: 21 WDELNKDYLEEQEEKELKQKADEGNNSGKKKKKRKAKKKRD 61



 Score = 33.8 bits (78), Expect = 0.042
 Identities = 10/35 (28%), Positives = 22/35 (62%)

Query: 25 KKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKK 59
          +++++K+ K+   +     KKKK++K KKK+ +  
Sbjct: 30 EEQEEKELKQKADEGNNSGKKKKKRKAKKKRDEAG 64



 Score = 33.0 bits (76), Expect = 0.080
 Identities = 12/35 (34%), Positives = 23/35 (65%)

Query: 24 KKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKK 58
          +++E+K+ ++KA +     +KKKK K KKK+ +  
Sbjct: 30 EEQEEKELKQKADEGNNSGKKKKKRKAKKKRDEAG 64



 Score = 31.5 bits (72), Expect = 0.24
 Identities = 11/35 (31%), Positives = 22/35 (62%)

Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKK 55
          ++ ++KE K+K  +     KKK+K+K +KK+ +  
Sbjct: 30 EEQEEKELKQKADEGNNSGKKKKKRKAKKKRDEAG 64



 Score = 27.2 bits (61), Expect = 6.5
 Identities = 9/52 (17%), Positives = 22/52 (42%), Gaps = 10/52 (19%)

Query: 41 KKEKKKKEK----------KKKKKKKKKKKKNNNNNNNNNNNNNNNNNNNNN 82
          ++E K KE+          +++++K+ K+K +  NN+             + 
Sbjct: 11 EEEVKLKERIWDELNKDYLEEQEEKELKQKADEGNNSGKKKKKRKAKKKRDE 62


>gnl|CDD|220838 pfam10659, Trypan_glycop_C, Trypanosome variant surface
          glycoprotein C-terminal domain.  The trypanosome
          parasite expresses these proteins to evade the immune
          response.
          Length = 98

 Score = 39.3 bits (92), Expect = 5e-04
 Identities = 17/53 (32%), Positives = 22/53 (41%)

Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNN 73
           K K KE K K+  A   +    +   EK K KK KK  KK      N   ++
Sbjct: 25 GKCKPKEGKAKKNGAPVTQTAGTETTTEKCKGKKDKKDCKKGCKWEGNTCKDS 77



 Score = 38.2 bits (89), Expect = 0.001
 Identities = 15/46 (32%), Positives = 20/46 (43%), Gaps = 1/46 (2%)

Query: 19 DWKKAKKKEK-KKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNN 63
           W K +   K K KE KAKK      +    +   +K K KK K +
Sbjct: 17 KWDKKEDDGKCKPKEGKAKKNGAPVTQTAGTETTTEKCKGKKDKKD 62



 Score = 32.0 bits (73), Expect = 0.18
 Identities = 13/39 (33%), Positives = 19/39 (48%)

Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKK 59
           K  K + + KEK  K  +K+ + K K K+ K KK    
Sbjct: 2  NKKNKTKTECKEKGCKWDKKEDDGKCKPKEGKAKKNGAP 40



 Score = 30.9 bits (70), Expect = 0.41
 Identities = 13/36 (36%), Positives = 21/36 (58%)

Query: 26 KEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
           +K K + + K+K  K +KK+ + K K K+ K KK 
Sbjct: 2  NKKNKTKTECKEKGCKWDKKEDDGKCKPKEGKAKKN 37



 Score = 30.5 bits (69), Expect = 0.65
 Identities = 14/39 (35%), Positives = 18/39 (46%)

Query: 24 KKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKN 62
          KK + K + K+   K  KKE   K K K+ K KK     
Sbjct: 3  KKNKTKTECKEKGCKWDKKEDDGKCKPKEGKAKKNGAPV 41



 Score = 29.7 bits (67), Expect = 0.93
 Identities = 15/36 (41%), Positives = 20/36 (55%)

Query: 27 EKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKN 62
           KK K K   K++  K  KK++  K K K+ K KKN
Sbjct: 2  NKKNKTKTECKEKGCKWDKKEDDGKCKPKEGKAKKN 37



 Score = 28.9 bits (65), Expect = 1.9
 Identities = 12/39 (30%), Positives = 20/39 (51%)

Query: 22 KAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKK 60
            K K K + ++K  K +KK++  K + K+ K KK    
Sbjct: 2  NKKNKTKTECKEKGCKWDKKEDDGKCKPKEGKAKKNGAP 40


>gnl|CDD|236545 PRK09510, tolA, cell envelope integrity inner membrane protein
           TolA; Provisional.
          Length = 387

 Score = 42.5 bits (100), Expect = 5e-04
 Identities = 13/41 (31%), Positives = 18/41 (43%)

Query: 21  KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
           KKA  + KKK   +AK    K   + K   +K    K  +K
Sbjct: 209 KKAAAEAKKKAAAEAKAAAAKAAAEAKAAAEKAAAAKAAEK 249



 Score = 42.1 bits (99), Expect = 5e-04
 Identities = 17/43 (39%), Positives = 21/43 (48%), Gaps = 2/43 (4%)

Query: 21  KKAKKKEKKKKEKKAKKKEKKKEKKK--KEKKKKKKKKKKKKK 61
            KA  + KKK E +AKKK   + KKK   E K    K   + K
Sbjct: 193 AKAAAEAKKKAEAEAKKKAAAEAKKKAAAEAKAAAAKAAAEAK 235



 Score = 41.7 bits (98), Expect = 7e-04
 Identities = 14/41 (34%), Positives = 20/41 (48%)

Query: 21  KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
           KKA+ + KKK   +AKKK   + K    K   + K   +K 
Sbjct: 201 KKAEAEAKKKAAAEAKKKAAAEAKAAAAKAAAEAKAAAEKA 241



 Score = 39.4 bits (92), Expect = 0.004
 Identities = 12/39 (30%), Positives = 19/39 (48%)

Query: 23  AKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
           AKKK + + +KKA  + KKK   + +    K   + K  
Sbjct: 199 AKKKAEAEAKKKAAAEAKKKAAAEAKAAAAKAAAEAKAA 237



 Score = 38.2 bits (89), Expect = 0.010
 Identities = 18/41 (43%), Positives = 23/41 (56%)

Query: 21  KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
           KKA  + KKK E +A  K   + KKK E + KKK   + KK
Sbjct: 177 KKAAAEAKKKAEAEAAAKAAAEAKKKAEAEAKKKAAAEAKK 217



 Score = 37.5 bits (87), Expect = 0.015
 Identities = 11/39 (28%), Positives = 15/39 (38%)

Query: 21  KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKK 59
           KKA  + K    K A + +   EK    K  +K    K 
Sbjct: 217 KKAAAEAKAAAAKAAAEAKAAAEKAAAAKAAEKAAAAKA 255



 Score = 35.2 bits (81), Expect = 0.075
 Identities = 13/43 (30%), Positives = 17/43 (39%), Gaps = 3/43 (6%)

Query: 21  KKAKKK---EKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKK 60
            +AKKK   E K    KA  + K   +K    K  +K    K 
Sbjct: 213 AEAKKKAAAEAKAAAAKAAAEAKAAAEKAAAAKAAEKAAAAKA 255



 Score = 35.2 bits (81), Expect = 0.094
 Identities = 17/42 (40%), Positives = 23/42 (54%), Gaps = 2/42 (4%)

Query: 21  KKAKKKEKKKKEKKAKKK--EKKKEKKKKEKKKKKKKKKKKK 60
           KKA+ +  KK   +AKKK   +   K   E KKK + + KKK
Sbjct: 169 KKAEAEAAKKAAAEAKKKAEAEAAAKAAAEAKKKAEAEAKKK 210



 Score = 34.4 bits (79), Expect = 0.16
 Identities = 17/41 (41%), Positives = 22/41 (53%)

Query: 21  KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
           KKA  + KKK E +A KK   + KKK E +   K   + KK
Sbjct: 161 KKAAAEAKKKAEAEAAKKAAAEAKKKAEAEAAAKAAAEAKK 201



 Score = 32.9 bits (75), Expect = 0.40
 Identities = 13/41 (31%), Positives = 20/41 (48%)

Query: 21  KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
           K   + E K+    AKK   + +KK + +  KK   + KKK
Sbjct: 146 KAKAEAEAKRAAAAAKKAAAEAKKKAEAEAAKKAAAEAKKK 186



 Score = 32.5 bits (74), Expect = 0.52
 Identities = 16/46 (34%), Positives = 22/46 (47%), Gaps = 5/46 (10%)

Query: 21  KKAKKKEKKKKEKKAKKKEKKKEKKKKEKK-----KKKKKKKKKKK 61
           K+A  K+K+ +E  AK     K K + E K      KK   + KKK
Sbjct: 125 KQAALKQKQAEEAAAKAAAAAKAKAEAEAKRAAAAAKKAAAEAKKK 170



 Score = 32.5 bits (74), Expect = 0.55
 Identities = 11/40 (27%), Positives = 22/40 (55%)

Query: 21  KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKK 60
           ++   +E+KK+ ++A K+   K+K+ +E   K     K K
Sbjct: 109 ERLAAQEQKKQAEEAAKQAALKQKQAEEAAAKAAAAAKAK 148



 Score = 31.7 bits (72), Expect = 1.0
 Identities = 14/41 (34%), Positives = 19/41 (46%)

Query: 21  KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
            KA    K K E +AK+     +K   E KKK + +  KK 
Sbjct: 139 AKAAAAAKAKAEAEAKRAAAAAKKAAAEAKKKAEAEAAKKA 179



 Score = 30.5 bits (69), Expect = 2.5
 Identities = 9/38 (23%), Positives = 27/38 (71%)

Query: 21  KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKK 58
           KK +++ ++ ++K+A ++E+ K+ +K+    +++KK+ 
Sbjct: 83  KKEQQQAEELQQKQAAEQERLKQLEKERLAAQEQKKQA 120



 Score = 30.2 bits (68), Expect = 3.2
 Identities = 12/44 (27%), Positives = 21/44 (47%), Gaps = 3/44 (6%)

Query: 21  KKAKKKEKKKKEKKAKKKEKKKE---KKKKEKKKKKKKKKKKKK 61
           ++A  K     + KA+ + K+     KK   + KKK + +  KK
Sbjct: 135 EEAAAKAAAAAKAKAEAEAKRAAAAAKKAAAEAKKKAEAEAAKK 178



 Score = 29.8 bits (67), Expect = 3.8
 Identities = 12/41 (29%), Positives = 18/41 (43%)

Query: 21  KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
            K      KK   +AKKK + +  KK   + KKK + +   
Sbjct: 153 AKRAAAAAKKAAAEAKKKAEAEAAKKAAAEAKKKAEAEAAA 193



 Score = 29.8 bits (67), Expect = 4.2
 Identities = 10/41 (24%), Positives = 27/41 (65%)

Query: 21  KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
           K+A ++E+ K+ +K +   ++++K+ +E  K+   K+K+ +
Sbjct: 95  KQAAEQERLKQLEKERLAAQEQKKQAEEAAKQAALKQKQAE 135



 Score = 29.4 bits (66), Expect = 4.9
 Identities = 11/41 (26%), Positives = 30/41 (73%)

Query: 21  KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
           ++ KKKE+++ E+  +K+  ++E+ K+ +K++   +++KK+
Sbjct: 79  EQRKKKEQQQAEELQQKQAAEQERLKQLEKERLAAQEQKKQ 119



 Score = 29.4 bits (66), Expect = 5.7
 Identities = 8/37 (21%), Positives = 22/37 (59%)

Query: 25  KKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
           K+ +K++    ++K++ +E  K+   K+K+ ++   K
Sbjct: 104 KQLEKERLAAQEQKKQAEEAAKQAALKQKQAEEAAAK 140



 Score = 29.0 bits (65), Expect = 7.2
 Identities = 11/39 (28%), Positives = 30/39 (76%)

Query: 21  KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKK 59
           K AK+ E+++K+K+ ++ E+ ++K+  E+++ K+ +K++
Sbjct: 72  KSAKRAEEQRKKKEQQQAEELQQKQAAEQERLKQLEKER 110



 Score = 28.6 bits (64), Expect = 9.6
 Identities = 11/38 (28%), Positives = 17/38 (44%)

Query: 23  AKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKK 60
           A   +K   E K K + +  +K   E KKK + +   K
Sbjct: 157 AAAAKKAAAEAKKKAEAEAAKKAAAEAKKKAEAEAAAK 194


>gnl|CDD|221952 pfam13166, AAA_13, AAA domain.  This family of domains contain a
           P-loop motif that is characteristic of the AAA
           superfamily. Many of the proteins in this family are
           conjugative transfer proteins. This family includes the
           PrrC protein that is thought to be the active component
           of the anticodon nuclease.
          Length = 713

 Score = 42.7 bits (101), Expect = 5e-04
 Identities = 17/63 (26%), Positives = 32/63 (50%)

Query: 21  KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNNNNNNN 80
           K+ KK E+K ++ +A+ ++K++E +K + K   K  KK  K  ++N +      N   N 
Sbjct: 102 KELKKLEEKIEQLEAEIEKKEEELEKAKNKFLDKAWKKLAKKYDSNLSEALKGLNYKKNF 161

Query: 81  NNN 83
              
Sbjct: 162 KEK 164



 Score = 42.3 bits (100), Expect = 6e-04
 Identities = 22/65 (33%), Positives = 32/65 (49%), Gaps = 6/65 (9%)

Query: 24  KKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKK------KKKKKKNNNNNNNNNNNNNNNN 77
            KKE KK E+K ++ E + EKK++E +K K K      KK  KK ++N +      N   
Sbjct: 100 LKKELKKLEEKIEQLEAEIEKKEEELEKAKNKFLDKAWKKLAKKYDSNLSEALKGLNYKK 159

Query: 78  NNNNN 82
           N    
Sbjct: 160 NFKEK 164



 Score = 40.8 bits (96), Expect = 0.002
 Identities = 16/70 (22%), Positives = 34/70 (48%), Gaps = 1/70 (1%)

Query: 21  KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKN-NNNNNNNNNNNNNNNNN 79
            + ++ +K+ K+ + K ++ + E +KKE++ +K K K   K         ++N +     
Sbjct: 95  AQIEELKKELKKLEEKIEQLEAEIEKKEEELEKAKNKFLDKAWKKLAKKYDSNLSEALKG 154

Query: 80  NNNNNNNEEK 89
            N   N +EK
Sbjct: 155 LNYKKNFKEK 164



 Score = 37.7 bits (88), Expect = 0.017
 Identities = 14/41 (34%), Positives = 23/41 (56%)

Query: 22  KAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKN 62
            A +KEKK  EK     EK+ ++ + E K  +K+ K+ +K 
Sbjct: 411 DAYQKEKKGLEKAINSLEKEIKQLEAEIKALEKEIKELEKQ 451



 Score = 32.3 bits (74), Expect = 0.82
 Identities = 15/69 (21%), Positives = 26/69 (37%)

Query: 21  KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNNNNNNN 80
            + K++E +K + K   K  KK  KK +    +  K    K N          +   N +
Sbjct: 118 IEKKEEELEKAKNKFLDKAWKKLAKKYDSNLSEALKGLNYKKNFKEKLLKELKSVILNAS 177

Query: 81  NNNNNNEEK 89
           +  +  E K
Sbjct: 178 SLLSLEELK 186



 Score = 31.5 bits (72), Expect = 1.2
 Identities = 17/71 (23%), Positives = 28/71 (39%), Gaps = 4/71 (5%)

Query: 21  KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNN--NNNNNNN 78
            + +KKE++   +KAK K   K  KK  KK      +  K  N   N          +  
Sbjct: 116 AEIEKKEEEL--EKAKNKFLDKAWKKLAKKYDSNLSEALKGLNYKKNFKEKLLKELKSVI 173

Query: 79  NNNNNNNNEEK 89
            N ++  + E+
Sbjct: 174 LNASSLLSLEE 184



 Score = 29.2 bits (66), Expect = 7.7
 Identities = 11/30 (36%), Positives = 18/30 (60%)

Query: 21  KKAKKKEKKKKEKKAKKKEKKKEKKKKEKK 50
           K     EK+ K+ +A+ K  +KE K+ EK+
Sbjct: 422 KAINSLEKEIKQLEAEIKALEKEIKELEKQ 451


>gnl|CDD|219901 pfam08555, DUF1754, Eukaryotic family of unknown function
           (DUF1754).  This is a eukaryotic protein family of
           unknown function.
          Length = 90

 Score = 38.9 bits (91), Expect = 5e-04
 Identities = 18/64 (28%), Positives = 24/64 (37%)

Query: 42  KEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNNNNNNNNNNNNNEEKGTFLYKMAAPLV 101
           K K KK   KKKKKKKKKK  +             ++  ++    EE             
Sbjct: 11  KLKGKKIDVKKKKKKKKKKNKSKEEVVTEKEEEEKSSAESDLKEGEEDEDNEKIEQEEDG 70

Query: 102 LVLT 105
           + LT
Sbjct: 71  MNLT 74



 Score = 38.2 bits (89), Expect = 0.001
 Identities = 15/47 (31%), Positives = 25/47 (53%)

Query: 17 PLDWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNN 63
           L  K  K   KKKK+KK KK + K+E   ++++++K   +   K  
Sbjct: 9  KLKLKGKKIDVKKKKKKKKKKNKSKEEVVTEKEEEEKSSAESDLKEG 55



 Score = 37.0 bits (86), Expect = 0.002
 Identities = 14/45 (31%), Positives = 27/45 (60%)

Query: 26 KEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNN 70
          K K KK    KKK+KKK+K K +++   +K++++K +  ++    
Sbjct: 11 KLKGKKIDVKKKKKKKKKKNKSKEEVVTEKEEEEKSSAESDLKEG 55



 Score = 36.6 bits (85), Expect = 0.003
 Identities = 14/45 (31%), Positives = 26/45 (57%)

Query: 22 KAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNN 66
          K K K+   K+KK KKK+K K K++   +K++++K   + +    
Sbjct: 11 KLKGKKIDVKKKKKKKKKKNKSKEEVVTEKEEEEKSSAESDLKEG 55



 Score = 36.2 bits (84), Expect = 0.004
 Identities = 18/61 (29%), Positives = 30/61 (49%)

Query: 31 KEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNNNNNNNNNNNNNEEKG 90
          K K  K   KKK+KKKK+K K K++   +K+    ++  ++      + +N     EE G
Sbjct: 11 KLKGKKIDVKKKKKKKKKKNKSKEEVVTEKEEEEKSSAESDLKEGEEDEDNEKIEQEEDG 70

Query: 91 T 91
           
Sbjct: 71 M 71



 Score = 35.5 bits (82), Expect = 0.008
 Identities = 18/59 (30%), Positives = 29/59 (49%)

Query: 33 KKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNNNNNNNNNNNNNEEKGT 91
           K K K KK + KKK+KKKKKK K K++          ++  ++      + +NE+   
Sbjct: 8  GKLKLKGKKIDVKKKKKKKKKKNKSKEEVVTEKEEEEKSSAESDLKEGEEDEDNEKIEQ 66



 Score = 35.1 bits (81), Expect = 0.010
 Identities = 14/49 (28%), Positives = 23/49 (46%)

Query: 29 KKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNNNN 77
            K K   KK   K+KKKK+KKK K K++   +      ++  ++    
Sbjct: 7  GGKLKLKGKKIDVKKKKKKKKKKNKSKEEVVTEKEEEEKSSAESDLKEG 55



 Score = 34.3 bits (79), Expect = 0.019
 Identities = 17/59 (28%), Positives = 22/59 (37%)

Query: 32 EKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNNNNNNNNNNNNNEEKG 90
                K K K KK   KKKKKKKKKK K              ++  ++      +E  
Sbjct: 3  SNVVGGKLKLKGKKIDVKKKKKKKKKKNKSKEEVVTEKEEEEKSSAESDLKEGEEDEDN 61



 Score = 34.3 bits (79), Expect = 0.019
 Identities = 14/50 (28%), Positives = 29/50 (58%)

Query: 23 AKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNN 72
             K K K +K   KK+KKK+KKK + K++   +K++++ ++  ++    
Sbjct: 6  VGGKLKLKGKKIDVKKKKKKKKKKNKSKEEVVTEKEEEEKSSAESDLKEG 55


>gnl|CDD|219655 pfam07946, DUF1682, Protein of unknown function (DUF1682).  The
           members of this family are all hypothetical eukaryotic
           proteins of unknown function. One member is described as
           being an adipocyte-specific protein, but no evidence of
           this was found.
          Length = 322

 Score = 41.9 bits (99), Expect = 6e-04
 Identities = 11/38 (28%), Positives = 27/38 (71%)

Query: 21  KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKK 58
           ++ K+++KK++ +    K   +E++K E+K++KK+ +K
Sbjct: 285 QEKKEEKKKEEREAKLAKLSPEEQRKLEEKERKKQARK 322



 Score = 40.7 bits (96), Expect = 0.001
 Identities = 13/44 (29%), Positives = 33/44 (75%), Gaps = 3/44 (6%)

Query: 21  KKAKKKEKKKKEKKAKKKEKKKEKKKK---EKKKKKKKKKKKKK 61
            + +++E+ +++K+ KKKE+++ K  K   E+++K ++K++KK+
Sbjct: 276 AEEERQEEAQEKKEEKKKEEREAKLAKLSPEEQRKLEEKERKKQ 319



 Score = 38.8 bits (91), Expect = 0.006
 Identities = 13/41 (31%), Positives = 31/41 (75%)

Query: 21  KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
            K +++E++K  K A+++ +++ ++KKE+KKK++++ K  K
Sbjct: 262 DKTREEEEEKILKAAEEERQEEAQEKKEEKKKEEREAKLAK 302



 Score = 37.6 bits (88), Expect = 0.011
 Identities = 13/46 (28%), Positives = 30/46 (65%), Gaps = 7/46 (15%)

Query: 24  KKKEKKKKEKKAKKKEKKKEKKKKEKKK-------KKKKKKKKKKN 62
           K  E++++E+  +KKE+KK+++++ K         +K ++K++KK 
Sbjct: 274 KAAEEERQEEAQEKKEEKKKEEREAKLAKLSPEEQRKLEEKERKKQ 319



 Score = 37.6 bits (88), Expect = 0.014
 Identities = 12/41 (29%), Positives = 29/41 (70%)

Query: 21  KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
            +  +K  K +E++ +K  K  E++++E+ ++KK++KKK++
Sbjct: 255 PEVLRKVDKTREEEEEKILKAAEEERQEEAQEKKEEKKKEE 295



 Score = 32.2 bits (74), Expect = 0.64
 Identities = 13/49 (26%), Positives = 26/49 (53%), Gaps = 9/49 (18%)

Query: 25  KKEKKKKEKKAKKKEKKKEKKK---------KEKKKKKKKKKKKKKNNN 64
             E  +K  K +++E++K  K          +EKK++KKK++++ K   
Sbjct: 254 SPEVLRKVDKTREEEEEKILKAAEEERQEEAQEKKEEKKKEEREAKLAK 302


>gnl|CDD|219408 pfam07423, DUF1510, Protein of unknown function (DUF1510).  This
           family consists of several hypothetical bacterial
           proteins of around 200 residues in length. The function
           of this family is unknown.
          Length = 214

 Score = 40.9 bits (96), Expect = 6e-04
 Identities = 14/68 (20%), Positives = 36/68 (52%)

Query: 2   DSSTKHQIGTFTEPTPLDWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
             S +            D + A+ +E K++EK+A   E K++K   EK+ ++ +++ +++
Sbjct: 39  SPSDQAAADEQEAKKSDDQETAEIEEVKEEEKEAANSEDKEDKGDAEKEDEESEEENEEE 98

Query: 62  NNNNNNNN 69
           +  +++ N
Sbjct: 99  DEESSDEN 106



 Score = 40.9 bits (96), Expect = 7e-04
 Identities = 12/69 (17%), Positives = 33/69 (47%)

Query: 2   DSSTKHQIGTFTEPTPLDWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
            S +        E    D ++  + E+ K+E+K     + KE K   +K+ ++ +++ ++
Sbjct: 38  SSPSDQAAADEQEAKKSDDQETAEIEEVKEEEKEAANSEDKEDKGDAEKEDEESEEENEE 97

Query: 62  NNNNNNNNN 70
            +  +++ N
Sbjct: 98  EDEESSDEN 106



 Score = 40.1 bits (94), Expect = 0.001
 Identities = 10/54 (18%), Positives = 29/54 (53%)

Query: 21  KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNN 74
           K   ++  + +E K ++KE    + K++K   +K+ ++ ++ N   +  +++ N
Sbjct: 53  KSDDQETAEIEEVKEEEKEAANSEDKEDKGDAEKEDEESEEENEEEDEESSDEN 106



 Score = 38.6 bits (90), Expect = 0.004
 Identities = 13/82 (15%), Positives = 33/82 (40%), Gaps = 2/82 (2%)

Query: 9   IGTFTEPTPLDWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNN 68
             +         ++  KK     ++ A+ +E K+E+K+    + K+ K   +K +  +  
Sbjct: 36  FPSSPSDQAAADEQEAKK--SDDQETAEIEEVKEEEKEAANSEDKEDKGDAEKEDEESEE 93

Query: 69  NNNNNNNNNNNNNNNNNNEEKG 90
            N   +  +++ N     E+  
Sbjct: 94  ENEEEDEESSDENEKETEEKTE 115



 Score = 28.2 bits (63), Expect = 9.2
 Identities = 8/42 (19%), Positives = 24/42 (57%)

Query: 21  KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKN 62
           +  + K   +KE +  ++E ++E ++   + +K+ ++K + N
Sbjct: 76  EDKEDKGDAEKEDEESEEENEEEDEESSDENEKETEEKTESN 117


>gnl|CDD|220371 pfam09736, Bud13, Pre-mRNA-splicing factor of RES complex.  This
          entry is characterized by proteins with alternating
          conserved and low-complexity regions. Bud13 together
          with Snu17p and a newly identified factor,
          Pml1p/Ylr016c, form a novel trimeric complex. called
          The RES complex, pre-mRNA retention and splicing
          complex. Subunits of this complex are not essential for
          viability of yeasts but they are required for efficient
          splicing in vitro and in vivo. Furthermore,
          inactivation of this complex causes pre-mRNA leakage
          from the nucleus. Bud13 contains a unique,
          phylogenetically conserved C-terminal region of unknown
          function.
          Length = 141

 Score = 40.0 bits (94), Expect = 6e-04
 Identities = 13/53 (24%), Positives = 34/53 (64%)

Query: 19 DWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNN 71
          D ++ +++++++KE+K +K+EK+KE  K   +K++++K+ ++     N     
Sbjct: 13 DIEEKREEKEREKEEKERKEEKEKEWGKGLVQKEEREKRLEELEKAKNKPLAR 65



 Score = 34.6 bits (80), Expect = 0.040
 Identities = 12/56 (21%), Positives = 35/56 (62%), Gaps = 7/56 (12%)

Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKKE-------KKKKKKKKKKKKKNNNNNNNN 69
          +    +EK++++++ K+++++KE+K+KE       K++++K+ ++ +K  N     
Sbjct: 10 RIIDIEEKREEKEREKEEKERKEEKEKEWGKGLVQKEEREKRLEELEKAKNKPLAR 65



 Score = 32.7 bits (75), Expect = 0.20
 Identities = 11/59 (18%), Positives = 34/59 (57%), Gaps = 8/59 (13%)

Query: 22 KAKKKEKKKKEKKAKKKEKKKEKKKKEKKK--------KKKKKKKKKKNNNNNNNNNNN 72
          K+ +    +++++ K++EK+++++K+EK+K        K++++K+ ++     N     
Sbjct: 7  KSGRIIDIEEKREEKEREKEEKERKEEKEKEWGKGLVQKEEREKRLEELEKAKNKPLAR 65



 Score = 29.2 bits (66), Expect = 3.2
 Identities = 7/35 (20%), Positives = 26/35 (74%)

Query: 31 KEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNN 65
          ++K  +  + ++++++KE++K++K++K++K+    
Sbjct: 5  RDKSGRIIDIEEKREEKEREKEEKERKEEKEKEWG 39


>gnl|CDD|215774 pfam00183, HSP90, Hsp90 protein. 
          Length = 529

 Score = 42.1 bits (99), Expect = 7e-04
 Identities = 16/45 (35%), Positives = 35/45 (77%)

Query: 17 PLDWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
          P + ++ +K+EKK++E+K   KE++ ++++++++KKKK KK K+ 
Sbjct: 36 PDEEEEEEKEEKKEEEEKTTDKEEEVDEEEEKEEKKKKTKKVKET 80



 Score = 41.3 bits (97), Expect = 0.001
 Identities = 16/44 (36%), Positives = 32/44 (72%)

Query: 19 DWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKN 62
          + ++ K+++K+++EK   K+E+  E+++KE+KKKK KK K+   
Sbjct: 39 EEEEEKEEKKEEEEKTTDKEEEVDEEEEKEEKKKKTKKVKETTT 82



 Score = 38.6 bits (90), Expect = 0.007
 Identities = 15/41 (36%), Positives = 31/41 (75%)

Query: 23 AKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNN 63
           ++++++KKE++ K  +K++E  ++E+K++KKKK KK K  
Sbjct: 40 EEEEKEEKKEEEEKTTDKEEEVDEEEEKEEKKKKTKKVKET 80



 Score = 35.1 bits (81), Expect = 0.088
 Identities = 18/48 (37%), Positives = 32/48 (66%), Gaps = 8/48 (16%)

Query: 22 KAKKKEKKKKEKKAKKKEKKKEKKKK--------EKKKKKKKKKKKKK 61
          K  +KE   +E++ +K+EKK+E++K         E+++K++KKKK KK
Sbjct: 29 KEVEKEVPDEEEEEEKEEKKEEEEKTTDKEEEVDEEEEKEEKKKKTKK 76


>gnl|CDD|216682 pfam01757, Acyl_transf_3, Acyltransferase family.  This family
           includes a range of acyltransferase enzymes. This domain
           is found in many as yet uncharacterized C. elegans
           proteins and it is approximately 300 amino acids long.
          Length = 326

 Score = 41.8 bits (98), Expect = 7e-04
 Identities = 28/202 (13%), Positives = 66/202 (32%), Gaps = 8/202 (3%)

Query: 221 RRPNYRMYMRR---TFLLPYAIVTSLFFI--GQFGGMTTLQTYAVGIFESIHAPLDPYFA 275
           RR +   ++++     L+PY   + L+F+     GG++      + +   +     P   
Sbjct: 65  RRRSLFKFLKKRLLRLLIPYLFWSLLYFLLGLLVGGLSVYGLLLLLLLLGLLFGGGPNGH 124

Query: 276 TLLLGVAELGGALLCVVLIHYTGKRPLALISTGGSAACFIVVAVYAQFHLSYGWDSPLVP 335
              L  A     LL  +L+    K    L+      A  +++++     L  G    ++ 
Sbjct: 125 LWFLP-ALFVFYLLLPLLLRLLRKLKKLLLL--LLLALLLLLSLLYILILLVGLPPTVLN 181

Query: 336 TVFLVLAAFLTHICIRLLPWMLIGEVFPNNIRATASGASGSSSYIFAFAVNKLYYPMLDT 395
            +  +L  FL    +      +  +     I    + A  +   +  F    +Y      
Sbjct: 182 LLIGLLPFFLLGALLARYRKRIRSKRLLLLIVILLALALLALILLLLFLFGLVYLAPELY 241

Query: 396 FHLWGTLYFYAAISVVGTLYMY 417
            +    L     + ++    + 
Sbjct: 242 GYFSLLLLLLGVLLLLLLALLL 263



 Score = 36.7 bits (85), Expect = 0.025
 Identities = 31/194 (15%), Positives = 54/194 (27%), Gaps = 20/194 (10%)

Query: 225 YRMYMRRTFLLPYAIVTSLFFIGQFGGMTTLQTYAVGIFESIHAPLDPYFATLLLGVAEL 284
            R+  +   LL   ++  L  +     +  L      +   +   L P+F          
Sbjct: 142 LRLLRKLKKLLLLLLLALLLLLSLLYILILLVGLPPTVLNLLIGLL-PFFLL-------- 192

Query: 285 GGALLCVVLIHYTGKRPLALISTGGSAA---CFIVVAVYAQFHLSYGWDSPLVPTVFLVL 341
            GALL         KR L LI    + A     +++                   + L+L
Sbjct: 193 -GALLARYRKRIRSKRLLLLIVILLALALLALILLLLFLFGLVYLAPELYGYFSLLLLLL 251

Query: 342 AAFLTHICIRLLPWMLIGEVFPNNIRATASGASGS-SSYIFAFAVNKLYYPMLDTFHLWG 400
              L  +   LL          +  R        S   Y+    +  L   +L      G
Sbjct: 252 GVLLLLLLALLLAN------LRSLKRLLKYLGKYSLGIYLIHPPILLLLTKLLLLLPPLG 305

Query: 401 TLYFYAAISVVGTL 414
            +  +    V+  L
Sbjct: 306 PILLFLLALVLTLL 319


>gnl|CDD|222665 pfam14303, NAM-associated, No apical meristem-associated C-terminal
           domain.  This domain is found in a number of different
           types of plant proteins including NAM-like proteins.
          Length = 147

 Score = 40.0 bits (94), Expect = 7e-04
 Identities = 16/51 (31%), Positives = 30/51 (58%), Gaps = 1/51 (1%)

Query: 13  TEPTPLDWKKAKKKEKKKKEKKAKKK-EKKKEKKKKEKKKKKKKKKKKKKN 62
               P   KKAK+K ++ K K  K++ EK+KEK+++  K   + +K++ + 
Sbjct: 58  ESKRPEGRKKAKEKLRRDKLKAKKEEAEKEKEKEERFMKALAEAEKERAEL 108



 Score = 38.9 bits (91), Expect = 0.001
 Identities = 14/44 (31%), Positives = 29/44 (65%)

Query: 21  KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNN 64
           +K  K++ ++ + KAKK+E +KEK+K+E+  K   + +K++   
Sbjct: 65  RKKAKEKLRRDKLKAKKEEAEKEKEKEERFMKALAEAEKERAEL 108



 Score = 32.0 bits (73), Expect = 0.37
 Identities = 17/46 (36%), Positives = 28/46 (60%), Gaps = 4/46 (8%)

Query: 20  WKKAKKKEKKK-KEKKAKKKEKKKEKKKKEKKKKK---KKKKKKKK 61
             K ++ EK+K KE++  K   + EK++ E +KKK   K  K++KK
Sbjct: 78  KAKKEEAEKEKEKEERFMKALAEAEKERAELEKKKAEAKLMKEEKK 123



 Score = 30.8 bits (70), Expect = 0.86
 Identities = 14/38 (36%), Positives = 26/38 (68%)

Query: 19  DWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKK 56
           D  KAKK+E +K+++K ++  K   + +KE+ + +KKK
Sbjct: 75  DKLKAKKEEAEKEKEKEERFMKALAEAEKERAELEKKK 112


>gnl|CDD|219589 pfam07808, RED_N, RED-like protein N-terminal region.  This
          family contains sequences that are similar to the
          N-terminal region of Red protein. This and related
          proteins contain a RED repeat which consists of a
          number of RE and RD sequence elements. The region in
          question has several conserved NLS sequences and a
          putative trimeric coiled-coil region, suggesting that
          these proteins are expressed in the nucleus. The
          function of Red protein is unknown, but efficient
          sequestration to nuclear bodies suggests that its
          expression may be tightly regulated of that the protein
          self-aggregates extremely efficiently.
          Length = 238

 Score = 41.0 bits (96), Expect = 7e-04
 Identities = 16/71 (22%), Positives = 31/71 (43%), Gaps = 9/71 (12%)

Query: 28 KKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNNNNN--------- 78
          KKKK    +K+E+  EK+   K + + ++++K  N + + ++                  
Sbjct: 2  KKKKYAYLRKQEENAEKEINPKYRDRARERRKGINKDYDPSSLAAYRAVAPDAKSDIDAA 61

Query: 79 NNNNNNNNEEK 89
           N NN  +E K
Sbjct: 62 ENRNNTIDESK 72



 Score = 33.3 bits (76), Expect = 0.22
 Identities = 15/69 (21%), Positives = 30/69 (43%), Gaps = 1/69 (1%)

Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKK-KEKKKKKKKKKKKKKNNNNNNNNNNNNNNNNNN 79
          KK K    +K+E+ A+K+   K + + +E++K   K               +  ++ +  
Sbjct: 2  KKKKYAYLRKQEENAEKEINPKYRDRARERRKGINKDYDPSSLAAYRAVAPDAKSDIDAA 61

Query: 80 NNNNNNNEE 88
           N NN  +E
Sbjct: 62 ENRNNTIDE 70


>gnl|CDD|218177 pfam04615, Utp14, Utp14 protein.  This protein is found to be part
           of a large ribonucleoprotein complex containing the U3
           snoRNA. Depletion of the Utp proteins impedes production
           of the 18S rRNA, indicating that they are part of the
           active pre-rRNA processing complex. This large RNP
           complex has been termed the small subunit (SSU)
           processome.
          Length = 728

 Score = 42.0 bits (99), Expect = 8e-04
 Identities = 14/71 (19%), Positives = 30/71 (42%)

Query: 21  KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNNNNNNN 80
           ++  KK   +++   +  EK+ E KK +K+ K + K+KK+ +      +          N
Sbjct: 413 EEPSKKNVGRRKFGPENGEKEAESKKLKKENKNEFKEKKESDEEEELEDEEEAKVEKVAN 472

Query: 81  NNNNNNEEKGT 91
                +E+   
Sbjct: 473 KLLKRSEKAQK 483



 Score = 38.9 bits (91), Expect = 0.006
 Identities = 15/56 (26%), Positives = 26/56 (46%)

Query: 22  KAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNNNN 77
             K+  KKK   K  K   K  K   + KKKKKK+K    +++  +  ++   + +
Sbjct: 507 AKKQDSKKKSSSKLDKAANKISKAAVKVKKKKKKEKSIDLDDDLIDEEDSIKLDVD 562



 Score = 38.1 bits (89), Expect = 0.011
 Identities = 19/75 (25%), Positives = 40/75 (53%)

Query: 21  KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNNNNNNN 80
                K++  K+K + K +K   K  K   K KKKKKK+K  + +++  +  ++   + +
Sbjct: 503 VGKSAKKQDSKKKSSSKLDKAANKISKAAVKVKKKKKKEKSIDLDDDLIDEEDSIKLDVD 562

Query: 81  NNNNNNEEKGTFLYK 95
           +  + ++E+  FL+K
Sbjct: 563 DEEDEDDEELPFLFK 577



 Score = 36.6 bits (85), Expect = 0.033
 Identities = 19/51 (37%), Positives = 26/51 (50%), Gaps = 6/51 (11%)

Query: 19  DWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKK------KKKKKNN 63
            W     K++KKK K+ ++   K E  KKEK+K KK K      K+ KK  
Sbjct: 620 SWAGDGIKKRKKKRKRKRRFLTKIEGVKKEKRKDKKLKNVIINEKRNKKAA 670



 Score = 36.2 bits (84), Expect = 0.046
 Identities = 16/85 (18%), Positives = 29/85 (34%), Gaps = 9/85 (10%)

Query: 16  TPLDWKKAKKKEKKKKEKKAKKKEKKKE---------KKKKEKKKKKKKKKKKKKNNNNN 66
              + K  K   K  K  +  +KE+++E         K      K  KK+  KKK+++  
Sbjct: 461 DEEEAKVEKVANKLLKRSEKAQKEEEEEELDEENPWLKTTSSVGKSAKKQDSKKKSSSKL 520

Query: 67  NNNNNNNNNNNNNNNNNNNNEEKGT 91
           +   N  +            E+   
Sbjct: 521 DKAANKISKAAVKVKKKKKKEKSID 545



 Score = 36.2 bits (84), Expect = 0.055
 Identities = 12/66 (18%), Positives = 24/66 (36%), Gaps = 6/66 (9%)

Query: 32  EKKAKKKEKKKEKKKK------EKKKKKKKKKKKKKNNNNNNNNNNNNNNNNNNNNNNNN 85
           E++ ++  KK   ++K      EK+ + KK KK+ KN       ++      +       
Sbjct: 409 EEENEEPSKKNVGRRKFGPENGEKEAESKKLKKENKNEFKEKKESDEEEELEDEEEAKVE 468

Query: 86  NEEKGT 91
                 
Sbjct: 469 KVANKL 474



 Score = 35.0 bits (81), Expect = 0.11
 Identities = 14/75 (18%), Positives = 29/75 (38%), Gaps = 15/75 (20%)

Query: 17  PLDWKKAKKKEKKKKEKKAKKKEKKKEKKKKE---------------KKKKKKKKKKKKK 61
                + + K +K   K  K+ EK ++++++E                K  KK+  KKK 
Sbjct: 457 EELEDEEEAKVEKVANKLLKRSEKAQKEEEEEELDEENPWLKTTSSVGKSAKKQDSKKKS 516

Query: 62  NNNNNNNNNNNNNNN 76
           ++  +   N  +   
Sbjct: 517 SSKLDKAANKISKAA 531



 Score = 33.9 bits (78), Expect = 0.23
 Identities = 14/72 (19%), Positives = 31/72 (43%), Gaps = 7/72 (9%)

Query: 25  KKEKKKKEKKAKKKEKKKEKKKKEKKK-------KKKKKKKKKKNNNNNNNNNNNNNNNN 77
           ++E + +E+  +++ ++  KK   ++K       K+ + KK KK N N       ++   
Sbjct: 398 RRELEGEEESDEEENEEPSKKNVGRRKFGPENGEKEAESKKLKKENKNEFKEKKESDEEE 457

Query: 78  NNNNNNNNNEEK 89
              +      EK
Sbjct: 458 ELEDEEEAKVEK 469



 Score = 33.5 bits (77), Expect = 0.29
 Identities = 16/64 (25%), Positives = 32/64 (50%)

Query: 16  TPLDWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNN 75
           T     K+ KK+  KK+  +K  +   +  K   K KKKKKK+K  + +++  +  ++  
Sbjct: 499 TTSSVGKSAKKQDSKKKSSSKLDKAANKISKAAVKVKKKKKKEKSIDLDDDLIDEEDSIK 558

Query: 76  NNNN 79
            + +
Sbjct: 559 LDVD 562



 Score = 32.0 bits (73), Expect = 0.97
 Identities = 11/33 (33%), Positives = 22/33 (66%)

Query: 35  AKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNN 67
            KK++KK+++K++   K +  KK+K+K+    N
Sbjct: 626 IKKRKKKRKRKRRFLTKIEGVKKEKRKDKKLKN 658



 Score = 31.6 bits (72), Expect = 1.3
 Identities = 15/48 (31%), Positives = 29/48 (60%)

Query: 16  TPLDWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNN 63
            P + +K  + +K KKE K + KEKK+  +++E + +++ K +K  N 
Sbjct: 426 GPENGEKEAESKKLKKENKNEFKEKKESDEEEELEDEEEAKVEKVANK 473



 Score = 30.4 bits (69), Expect = 3.0
 Identities = 15/84 (17%), Positives = 31/84 (36%), Gaps = 15/84 (17%)

Query: 23  AKKKEKKKKEKKAKKKEK-------KKEKKKKEKKKKKKKK--------KKKKKNNNNNN 67
           A+KKE+   E +  ++E        ++E ++  KK   ++K        + + K     N
Sbjct: 384 ARKKEENDAEIEELRRELEGEEESDEEENEEPSKKNVGRRKFGPENGEKEAESKKLKKEN 443

Query: 68  NNNNNNNNNNNNNNNNNNNEEKGT 91
            N       ++      + EE   
Sbjct: 444 KNEFKEKKESDEEEELEDEEEAKV 467



 Score = 30.4 bits (69), Expect = 3.4
 Identities = 11/32 (34%), Positives = 21/32 (65%)

Query: 34  KAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNN 65
           K +KK++K++++   K +  KK+K+K K   N
Sbjct: 627 KKRKKKRKRKRRFLTKIEGVKKEKRKDKKLKN 658


>gnl|CDD|219621 pfam07890, Rrp15p, Rrp15p.  Rrp15p is required for the formation
          of 60S ribosomal subunits.
          Length = 132

 Score = 39.3 bits (92), Expect = 8e-04
 Identities = 13/39 (33%), Positives = 27/39 (69%)

Query: 22 KAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKK 60
          ++KK  K KK+ K++K EKK +++ + +K++  +K + K
Sbjct: 26 RSKKLLKAKKKLKSEKLEKKAKRQLRAEKRQALEKGRVK 64



 Score = 35.0 bits (81), Expect = 0.027
 Identities = 13/37 (35%), Positives = 23/37 (62%)

Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKK 57
          KK  K +KK K +K +KK K++ + +K +  +K + K
Sbjct: 28 KKLLKAKKKLKSEKLEKKAKRQLRAEKRQALEKGRVK 64



 Score = 34.3 bits (79), Expect = 0.050
 Identities = 11/31 (35%), Positives = 21/31 (67%)

Query: 31 KEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
          + KK  K +KK + +K EKK K++ + +K++
Sbjct: 26 RSKKLLKAKKKLKSEKLEKKAKRQLRAEKRQ 56



 Score = 33.9 bits (78), Expect = 0.056
 Identities = 9/34 (26%), Positives = 20/34 (58%)

Query: 29 KKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKN 62
          + K+    KK+ K EK +K+ K++ + +K++   
Sbjct: 26 RSKKLLKAKKKLKSEKLEKKAKRQLRAEKRQALE 59



 Score = 32.7 bits (75), Expect = 0.17
 Identities = 12/38 (31%), Positives = 22/38 (57%)

Query: 25 KKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKN 62
          KK  K K+K   +K +KK K++   +K++  +K + K 
Sbjct: 28 KKLLKAKKKLKSEKLEKKAKRQLRAEKRQALEKGRVKP 65



 Score = 31.2 bits (71), Expect = 0.53
 Identities = 12/35 (34%), Positives = 23/35 (65%)

Query: 27 EKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
           + KK  KAKKK K ++ +KK K++ + +K++  +
Sbjct: 25 SRSKKLLKAKKKLKSEKLEKKAKRQLRAEKRQALE 59



 Score = 30.0 bits (68), Expect = 1.3
 Identities = 11/41 (26%), Positives = 21/41 (51%)

Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
                 K+K    ++ K+  K KKK + +K +KK K++ +
Sbjct: 11 ASKLPASKRKDPILSRSKKLLKAKKKLKSEKLEKKAKRQLR 51



 Score = 27.7 bits (62), Expect = 7.6
 Identities = 13/36 (36%), Positives = 22/36 (61%), Gaps = 1/36 (2%)

Query: 18 LDWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKK 53
          L   K K K +K  EKKAK++ + ++++  EK + K
Sbjct: 30 LLKAKKKLKSEK-LEKKAKRQLRAEKRQALEKGRVK 64


>gnl|CDD|218612 pfam05501, DUF755, Domain of unknown function (DUF755).  This
           family is predominated by ORFs from Circoviridae. The
           function of this family remains to be determined.
          Length = 122

 Score = 38.9 bits (91), Expect = 9e-04
 Identities = 14/54 (25%), Positives = 32/54 (59%)

Query: 18  LDWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNN 71
           L   + K+KE++K  +  KKK KK         KK   KK+++ ++++++++++
Sbjct: 61  LQQPQRKRKEQQKSSQTHKKKRKKSRHVSSRSAKKISAKKRRRSSSSSSSSSSS 114



 Score = 35.9 bits (83), Expect = 0.013
 Identities = 9/51 (17%), Positives = 35/51 (68%)

Query: 24  KKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNN 74
            ++++K+++K ++  +KK++K +    +  KK   KK+  +++++++++++
Sbjct: 64  PQRKRKEQQKSSQTHKKKRKKSRHVSSRSAKKISAKKRRRSSSSSSSSSSS 114



 Score = 34.7 bits (80), Expect = 0.033
 Identities = 12/53 (22%), Positives = 33/53 (62%)

Query: 23  AKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNN 75
            + + K+K+++K+ +  KKK KK +    +  KK   KK   +++++++++++
Sbjct: 62  QQPQRKRKEQQKSSQTHKKKRKKSRHVSSRSAKKISAKKRRRSSSSSSSSSSS 114



 Score = 31.6 bits (72), Expect = 0.39
 Identities = 10/56 (17%), Positives = 34/56 (60%)

Query: 21  KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNNN 76
           ++ ++ ++K+KE++   +  KK++KK      +  KK   K    +++++++++++
Sbjct: 59  QQLQQPQRKRKEQQKSSQTHKKKRKKSRHVSSRSAKKISAKKRRRSSSSSSSSSSS 114


>gnl|CDD|240271 PTZ00108, PTZ00108, DNA topoisomerase 2-like protein; Provisional.
          Length = 1388

 Score = 42.0 bits (99), Expect = 0.001
 Identities = 21/67 (31%), Positives = 34/67 (50%)

Query: 21   KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNNNNNNN 80
            ++ K K K K  K  K K KKKEKKKK+    K KK     N+   +++     ++  +N
Sbjct: 1156 QRLKSKTKGKASKLRKPKLKKKEKKKKKSSADKSKKASVVGNSKRVDSDEKRKLDDKPDN 1215

Query: 81   NNNNNNE 87
              +N++ 
Sbjct: 1216 KKSNSSG 1222



 Score = 41.6 bits (98), Expect = 0.001
 Identities = 15/59 (25%), Positives = 31/59 (52%)

Query: 20   WKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNNNNN 78
             ++ +  ++++ + K K K  K  K K +KK+KKKKK    K+   +   N+   +++ 
Sbjct: 1147 VEEKEIAKEQRLKSKTKGKASKLRKPKLKKKEKKKKKSSADKSKKASVVGNSKRVDSDE 1205



 Score = 39.3 bits (92), Expect = 0.006
 Identities = 17/60 (28%), Positives = 34/60 (56%)

Query: 23   AKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNNNNNNNNN 82
              ++++  KE++ K K K K  K ++ K KKK+KKKKK + + +   +   N+   +++ 
Sbjct: 1146 EVEEKEIAKEQRLKSKTKGKASKLRKPKLKKKEKKKKKSSADKSKKASVVGNSKRVDSDE 1205



 Score = 38.5 bits (90), Expect = 0.011
 Identities = 19/74 (25%), Positives = 38/74 (51%)

Query: 22   KAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNNNNNNNN 81
            + ++ E+K+  K+ + K K K K  K +K K KKK+KKKK ++ + +   +   N+   +
Sbjct: 1143 EQEEVEEKEIAKEQRLKSKTKGKASKLRKPKLKKKEKKKKKSSADKSKKASVVGNSKRVD 1202

Query: 82   NNNNNEEKGTFLYK 95
            ++   +       K
Sbjct: 1203 SDEKRKLDDKPDNK 1216



 Score = 38.1 bits (89), Expect = 0.014
 Identities = 18/91 (19%), Positives = 37/91 (40%)

Query: 1    MDSSTKHQIGTFTEPTPLDWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKK 60
            ++     +       T     K +K + KKKEKK KK    K KK       K+    +K
Sbjct: 1147 VEEKEIAKEQRLKSKTKGKASKLRKPKLKKKEKKKKKSSADKSKKASVVGNSKRVDSDEK 1206

Query: 61   KNNNNNNNNNNNNNNNNNNNNNNNNNEEKGT 91
            +  ++  +N  +N++ ++  ++     +   
Sbjct: 1207 RKLDDKPDNKKSNSSGSDQEDDEEQKTKPKK 1237



 Score = 35.8 bits (83), Expect = 0.080
 Identities = 16/42 (38%), Positives = 25/42 (59%)

Query: 23   AKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNN 64
             KKK KK+ E      +KKK+ +KK  +KKK K + K+ + +
Sbjct: 1306 TKKKVKKRLEGSLAALKKKKKSEKKTARKKKSKTRVKQASAS 1347



 Score = 35.4 bits (82), Expect = 0.091
 Identities = 15/67 (22%), Positives = 38/67 (56%)

Query: 21   KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNNNNNNN 80
            +  +++E+ ++++ AK++  K + K K  K +K K KKK+K    ++ + +   +   N+
Sbjct: 1139 EALEEQEEVEEKEIAKEQRLKSKTKGKASKLRKPKLKKKEKKKKKSSADKSKKASVVGNS 1198

Query: 81   NNNNNNE 87
               +++E
Sbjct: 1199 KRVDSDE 1205



 Score = 35.4 bits (82), Expect = 0.10
 Identities = 18/62 (29%), Positives = 29/62 (46%)

Query: 2    DSSTKHQIGTFTEPTPLDWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
             S TK ++    E +    KK KK EKK   KK  K   K+    +  +  ++ +KKK  
Sbjct: 1303 SSPTKKKVKKRLEGSLAALKKKKKSEKKTARKKKSKTRVKQASASQSSRLLRRPRKKKSD 1362

Query: 62   NN 63
            ++
Sbjct: 1363 SS 1364



 Score = 35.0 bits (81), Expect = 0.12
 Identities = 16/72 (22%), Positives = 40/72 (55%)

Query: 21   KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNNNNNNN 80
            ++A +++++ +EK+  K+++ K K K +  K +K K KKK+     ++ + +   +   N
Sbjct: 1138 EEALEEQEEVEEKEIAKEQRLKSKTKGKASKLRKPKLKKKEKKKKKSSADKSKKASVVGN 1197

Query: 81   NNNNNNEEKGTF 92
            +   +++EK   
Sbjct: 1198 SKRVDSDEKRKL 1209



 Score = 33.9 bits (78), Expect = 0.29
 Identities = 18/59 (30%), Positives = 27/59 (45%)

Query: 4    STKHQIGTFTEPTPLDWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKN 62
            S +    +     P    K K K++ +    A KK+KK EKK   KKK K + K+   +
Sbjct: 1289 SKRPDGESNGGSKPSSPTKKKVKKRLEGSLAALKKKKKSEKKTARKKKSKTRVKQASAS 1347



 Score = 31.6 bits (72), Expect = 1.3
 Identities = 17/72 (23%), Positives = 36/72 (50%)

Query: 18   LDWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNNNN 77
            LD  +   +E+++ E+K   KE++ + K K K  K +K K KKK      ++ + +   +
Sbjct: 1134 LDKFEEALEEQEEVEEKEIAKEQRLKSKTKGKASKLRKPKLKKKEKKKKKSSADKSKKAS 1193

Query: 78   NNNNNNNNNEEK 89
               N+   + ++
Sbjct: 1194 VVGNSKRVDSDE 1205



 Score = 31.2 bits (71), Expect = 1.6
 Identities = 15/69 (21%), Positives = 29/69 (42%)

Query: 21   KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNNNNNNN 80
            KK KKK+K   +K  K       K+    +K+K   K   K +N++ ++  ++       
Sbjct: 1176 KKEKKKKKSSADKSKKASVVGNSKRVDSDEKRKLDDKPDNKKSNSSGSDQEDDEEQKTKP 1235

Query: 81   NNNNNNEEK 89
              ++    K
Sbjct: 1236 KKSSVKRLK 1244



 Score = 30.8 bits (70), Expect = 2.3
 Identities = 11/50 (22%), Positives = 17/50 (34%)

Query: 21   KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNN 70
                K+   +    +K     K+K KK  +      KKKKK+        
Sbjct: 1286 PPPSKRPDGESNGGSKPSSPTKKKVKKRLEGSLAALKKKKKSEKKTARKK 1335



 Score = 30.8 bits (70), Expect = 2.8
 Identities = 16/47 (34%), Positives = 25/47 (53%), Gaps = 3/47 (6%)

Query: 22   KAKKKEKKKKEKKAKKKEKKKEKKKKEKKK---KKKKKKKKKKNNNN 65
            K     KKK +K+ +      +KKKK +KK   KKK K + K+ + +
Sbjct: 1301 KPSSPTKKKVKKRLEGSLAALKKKKKSEKKTARKKKSKTRVKQASAS 1347



 Score = 29.6 bits (67), Expect = 6.3
 Identities = 14/57 (24%), Positives = 22/57 (38%)

Query: 3    SSTKHQIGTFTEPTPLDWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKK 59
               +   G+          K + +      KK KK EKK  +KKK K + K+    +
Sbjct: 1292 PDGESNGGSKPSSPTKKKVKKRLEGSLAALKKKKKSEKKTARKKKSKTRVKQASASQ 1348


>gnl|CDD|236766 PRK10811, rne, ribonuclease E; Reviewed.
          Length = 1068

 Score = 41.6 bits (98), Expect = 0.001
 Identities = 11/54 (20%), Positives = 26/54 (48%)

Query: 35  AKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNNNNNNNNNNNNNEE 88
             K +++   K + K ++++ ++K ++NN  + N   +  +N        N EE
Sbjct: 585 ETKPQEQPAPKAEAKPERQQDRRKPRQNNRRDRNERRDTRDNRTRREGRENREE 638



 Score = 40.0 bits (94), Expect = 0.004
 Identities = 14/108 (12%), Positives = 29/108 (26%), Gaps = 27/108 (25%)

Query: 9   IGTFTEP-TPLDWKKAKKKEKKKKEKKAKKK------------------------EKKKE 43
           + TF  P  P     A+                                      E+ K 
Sbjct: 529 LATFAMPDVPPAPTPAEPAAPVVAAAPKAAAATPPAQPGLLSRFFGALKALFSGGEETKP 588

Query: 44  KKKKEKKKKKKKKKKKKKNNNNNNNN--NNNNNNNNNNNNNNNNNEEK 89
           +++   K + K ++++ +     NN    N   +  +N       E +
Sbjct: 589 QEQPAPKAEAKPERQQDRRKPRQNNRRDRNERRDTRDNRTRREGRENR 636



 Score = 38.9 bits (91), Expect = 0.008
 Identities = 17/94 (18%), Positives = 29/94 (30%), Gaps = 18/94 (19%)

Query: 14  EPTPLDWKKAKKKEKKKKEKKAKKKEK-----------KK------EKKKKEKKKKKKKK 56
            PTP     A       K   A    +           K       E K +E+   K + 
Sbjct: 540 APTP-AEPAAPVVAAAPKAAAATPPAQPGLLSRFFGALKALFSGGEETKPQEQPAPKAEA 598

Query: 57  KKKKKNNNNNNNNNNNNNNNNNNNNNNNNNEEKG 90
           K +++ +      NN  + N   +  +N    +G
Sbjct: 599 KPERQQDRRKPRQNNRRDRNERRDTRDNRTRREG 632



 Score = 38.1 bits (89), Expect = 0.014
 Identities = 9/63 (14%), Positives = 29/63 (46%)

Query: 27  EKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNNNNNNNNNNNNN 86
           E+ K +++   K + K +++++++K ++  ++ +    +  +N        N   N  N 
Sbjct: 584 EETKPQEQPAPKAEAKPERQQDRRKPRQNNRRDRNERRDTRDNRTRREGRENREENRRNR 643

Query: 87  EEK 89
            + 
Sbjct: 644 RQA 646



 Score = 37.7 bits (88), Expect = 0.016
 Identities = 9/56 (16%), Positives = 26/56 (46%)

Query: 32  EKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNNNNNNNNNNNNNE 87
            ++ K +E+   K + + ++++ ++K ++ N  + N   +  +N        N  E
Sbjct: 583 GEETKPQEQPAPKAEAKPERQQDRRKPRQNNRRDRNERRDTRDNRTRREGRENREE 638



 Score = 36.6 bits (85), Expect = 0.041
 Identities = 9/65 (13%), Positives = 31/65 (47%)

Query: 24  KKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNNNNNNNNNN 83
           + K +++   KA+ K ++++ ++K ++  ++ + +++   +N        N   N  N  
Sbjct: 585 ETKPQEQPAPKAEAKPERQQDRRKPRQNNRRDRNERRDTRDNRTRREGRENREENRRNRR 644

Query: 84  NNNEE 88
              ++
Sbjct: 645 QAQQQ 649


>gnl|CDD|222581 pfam14181, YqfQ, YqfQ-like protein.  The YqfQ-like protein family
           includes the B. subtilis YqfQ protein, also known as
           VrrA, which is functionally uncharacterized. This family
           of proteins is found in bacteria. Proteins in this
           family are typically between 146 and 237 amino acids in
           length. There are two conserved sequence motifs: QYGP
           and PKLY.
          Length = 155

 Score = 39.7 bits (93), Expect = 0.001
 Identities = 16/41 (39%), Positives = 23/41 (56%)

Query: 21  KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
           ++    E K + K+ KK+E  K K +KEK K + KK K  K
Sbjct: 110 EQEDPPETKTESKEKKKREVPKPKTEKEKPKTEPKKPKPSK 150



 Score = 37.8 bits (88), Expect = 0.004
 Identities = 13/35 (37%), Positives = 22/35 (62%)

Query: 26  KEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKK 60
            ++ ++E   + K + KEKKK+E  K K +K+K K
Sbjct: 106 TDETEQEDPPETKTESKEKKKREVPKPKTEKEKPK 140



 Score = 37.0 bits (86), Expect = 0.008
 Identities = 15/51 (29%), Positives = 26/51 (50%), Gaps = 3/51 (5%)

Query: 11  TFTEPTPLDWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
           T  E T    +  ++   + K +  +KK+++  K K EK+K K + KK K 
Sbjct: 101 TEEESTD---ETEQEDPPETKTESKEKKKREVPKPKTEKEKPKTEPKKPKP 148



 Score = 36.3 bits (84), Expect = 0.014
 Identities = 18/44 (40%), Positives = 23/44 (52%)

Query: 13  TEPTPLDWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKK 56
           TE       K + KEKKK+E    K EK+K K + +K K  K K
Sbjct: 109 TEQEDPPETKTESKEKKKREVPKPKTEKEKPKTEPKKPKPSKPK 152



 Score = 35.5 bits (82), Expect = 0.027
 Identities = 14/41 (34%), Positives = 23/41 (56%)

Query: 21  KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
           +   + + + KEKK ++  K K +K+K K + KK K  K K
Sbjct: 112 EDPPETKTESKEKKKREVPKPKTEKEKPKTEPKKPKPSKPK 152



 Score = 33.2 bits (76), Expect = 0.15
 Identities = 9/38 (23%), Positives = 21/38 (55%)

Query: 26  KEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNN 63
            E++  ++  ++   + + + KEKKK++  K K +K  
Sbjct: 101 TEEESTDETEQEDPPETKTESKEKKKREVPKPKTEKEK 138



 Score = 30.5 bits (69), Expect = 1.3
 Identities = 9/44 (20%), Positives = 23/44 (52%)

Query: 21  KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNN 64
             +  +E++ +E+   + E++   + K + K+KKK++  K    
Sbjct: 92  SSSDDEEEETEEESTDETEQEDPPETKTESKEKKKREVPKPKTE 135


>gnl|CDD|222447 pfam13904, DUF4207, Domain of unknown function (DUF4207).  This
           family is found in eukaryotes; it has several conserved
           tryptophan residues. The function is not known.
          Length = 261

 Score = 40.9 bits (96), Expect = 0.001
 Identities = 11/43 (25%), Positives = 32/43 (74%)

Query: 19  DWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
           +W  AK+ +++KK +K  ++++K+E++K+ ++ + +++  K+K
Sbjct: 91  NWLSAKQAQRQKKLQKLLEEKQKQEREKEREEAELRQRLAKEK 133



 Score = 38.2 bits (89), Expect = 0.008
 Identities = 11/40 (27%), Positives = 32/40 (80%)

Query: 20  WKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKK 59
           W+  K K++++K ++ ++K++KK+++++E+K+K ++  +K
Sbjct: 187 WELKKLKQQQQKREEERRKQRKKQQEEEERKQKAEEAWQK 226



 Score = 36.6 bits (85), Expect = 0.021
 Identities = 12/47 (25%), Positives = 27/47 (57%), Gaps = 5/47 (10%)

Query: 21  KKAKKKEKKKKEKKAKKKEKKKEK-----KKKEKKKKKKKKKKKKKN 62
           +K K++ +K++E+   ++   KEK     ++K ++  K++  K KK 
Sbjct: 110 EKQKQEREKEREEAELRQRLAKEKYEEWCRQKAQQAAKQRTPKHKKE 156



 Score = 35.9 bits (83), Expect = 0.039
 Identities = 11/41 (26%), Positives = 36/41 (87%), Gaps = 1/41 (2%)

Query: 22  KAKKK-EKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
           +AKK+ ++ + +K  ++++K++E+++K++KK+++++++K+K
Sbjct: 179 EAKKRLQEWELKKLKQQQQKREEERRKQRKKQQEEEERKQK 219



 Score = 35.5 bits (82), Expect = 0.048
 Identities = 9/37 (24%), Positives = 30/37 (81%)

Query: 28  KKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNN 64
           KK K+++ K++E++++++KK+++++++K+K ++    
Sbjct: 190 KKLKQQQQKREEERRKQRKKQQEEEERKQKAEEAWQK 226



 Score = 34.3 bits (79), Expect = 0.13
 Identities = 8/38 (21%), Positives = 30/38 (78%)

Query: 26  KEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNN 63
            +K K++++ +++E++K++KK+++++++K+K ++    
Sbjct: 189 LKKLKQQQQKREEERRKQRKKQQEEEERKQKAEEAWQK 226



 Score = 32.8 bits (75), Expect = 0.34
 Identities = 10/38 (26%), Positives = 31/38 (81%)

Query: 24  KKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
            KK K++++K+ +++ K+++K+++E+++K+K ++  +K
Sbjct: 189 LKKLKQQQQKREEERRKQRKKQQEEEERKQKAEEAWQK 226



 Score = 32.8 bits (75), Expect = 0.40
 Identities = 10/41 (24%), Positives = 30/41 (73%)

Query: 21  KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
             + K E+   +++AKK+ ++ E KK +++++K++++++K+
Sbjct: 166 SGSAKPERNVSQEEAKKRLQEWELKKLKQQQQKREEERRKQ 206



 Score = 32.4 bits (74), Expect = 0.46
 Identities = 11/49 (22%), Positives = 33/49 (67%)

Query: 13  TEPTPLDWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
            +P     ++  KK  ++ E K  K++++K ++++ K++KK+++++++K
Sbjct: 169 AKPERNVSQEEAKKRLQEWELKKLKQQQQKREEERRKQRKKQQEEEERK 217



 Score = 31.6 bits (72), Expect = 0.99
 Identities = 10/38 (26%), Positives = 29/38 (76%)

Query: 24  KKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
           +K+E+++++++ K++E+++ K+K E+  +K  K  K++
Sbjct: 197 QKREEERRKQRKKQQEEEERKQKAEEAWQKWMKNVKQR 234


>gnl|CDD|233172 TIGR00891, 2A0112, putative sialic acid transporter.  [Transport
           and binding proteins, Carbohydrates, organic alcohols,
           and acids].
          Length = 405

 Score = 41.4 bits (97), Expect = 0.001
 Identities = 49/283 (17%), Positives = 94/283 (33%), Gaps = 49/283 (17%)

Query: 105 TYVAEITQPHLRGMLS----ATASMTTIFGTVSQLFLGSF--LHWRSAAILNLLFPILAL 158
            YV E    HLR   S    +  ++  +        +       WR+   +++L  I AL
Sbjct: 123 AYVIESWPKHLRNKASGLLISGYAVGAVVAAQVYSLVVPVWGDGWRALFFISILPIIFAL 182

Query: 159 CALYFIPESPHWLISQGRMQEASASLCWLRGWVTPDKVQTELSQITKAIEESELKRLGKD 218
                IPE+  W             +  L G     ++   +  +  A+ +S        
Sbjct: 183 WLRKNIPEAEDWKEKHAGKALVRTMVDILYG--GEHRIANIVMTLAAAMVQSA------- 233

Query: 219 GQRRPNYRMYMRRTFLLPYAIVTSLFFIGQFGGMTTLQTYAVGIFESIHAPLDPYFATLL 278
           G+R P +   +         ++     +        L TY           L P+    +
Sbjct: 234 GKRWPTFVYLV---------VLVLFANLYSHPIQDLLPTYL-----KADLGLSPHTVANI 279

Query: 279 LGVAELGGALLCVV---LIHYTGKRPLALISTGGSAACFIVVAVYAQFHLSYGWDSPLVP 335
           +  + +G  +   V   L  + G+R  A + +  +    I+         + G +  +  
Sbjct: 280 VVFSNIGAIVGGCVFGFLGDWLGRR-KAYVCSLLAGQLLIIPV------FAIGANVAV-- 330

Query: 336 TVFLVLAAFLTHICIR----LLPWMLIGEVFPNNIRATASGAS 374
              L L  F   + ++    +LP   +GE FP + RA   G +
Sbjct: 331 ---LGLGLFFQQMLVQGIWGILP-KHLGEYFPTDQRAAGLGFT 369



 Score = 29.8 bits (67), Expect = 4.5
 Identities = 37/182 (20%), Positives = 61/182 (33%), Gaps = 32/182 (17%)

Query: 273 YFATLLLGVAELGGALLCVVL----IHYTGKRPLALISTGGSAACFIVVAVYAQFHL--- 325
            F+   LG          V L    +           ++  SAA          F L   
Sbjct: 13  AFSAAWLGWLLDAFDFFLVALVLAEVAGEFGLTTVDAASLISAALISRWFGALMFGLWGD 72

Query: 326 SYGWDSPLVPTVFL-VLAAFLTHIC--------IRLLPWMLIG-----------EVFPNN 365
            YG   P+V ++ L                    RL+  + +G           E +P +
Sbjct: 73  RYGRRLPMVTSIVLFSAGTLACGFAPGYITMFIARLVIGIGMGGEYGSSAAYVIESWPKH 132

Query: 366 IRATASG--ASG-SSSYIFAFAVNKLYYPMLDTFHLWGTLYFYAAISVVGTLYMYFVMPE 422
           +R  ASG   SG +   + A  V  L  P+      W  L+F + + ++  L++   +PE
Sbjct: 133 LRNKASGLLISGYAVGAVVAAQVYSLVVPVWGDG--WRALFFISILPIIFALWLRKNIPE 190

Query: 423 TE 424
            E
Sbjct: 191 AE 192


>gnl|CDD|218899 pfam06102, DUF947, Domain of unknown function (DUF947).  Family of
           eukaryotic proteins with unknown function.
          Length = 168

 Score = 39.6 bits (93), Expect = 0.001
 Identities = 18/40 (45%), Positives = 21/40 (52%), Gaps = 3/40 (7%)

Query: 24  KKKEKKKKEKKAKKKE---KKKEKKKKEKKKKKKKKKKKK 60
           KK E KK   K K  E    K+  K  EKK+KK   K+KK
Sbjct: 129 KKSEIKKLVLKKKFDELKKSKQLDKALEKKRKKNAGKEKK 168



 Score = 36.5 bits (85), Expect = 0.012
 Identities = 14/43 (32%), Positives = 25/43 (58%), Gaps = 2/43 (4%)

Query: 21  KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKK--KKKKKKK 61
            + K  + K +E++  K+ KK+EK+  ++ KK    KK + KK
Sbjct: 93  SRLKTLKNKDREREILKEHKKQEKELIKEGKKPYYLKKSEIKK 135



 Score = 36.5 bits (85), Expect = 0.015
 Identities = 20/56 (35%), Positives = 28/56 (50%), Gaps = 12/56 (21%)

Query: 21  KKAKKKEK------------KKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNN 64
           K+ KK+EK            KK E K    +KK ++ KK K+  K  +KK+KKN  
Sbjct: 109 KEHKKQEKELIKEGKKPYYLKKSEIKKLVLKKKFDELKKSKQLDKALEKKRKKNAG 164



 Score = 35.7 bits (83), Expect = 0.023
 Identities = 12/49 (24%), Positives = 27/49 (55%), Gaps = 1/49 (2%)

Query: 21  KKAKKKEK-KKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNN 68
           K +++KE+ K+  +  K + K  + K +E++  K+ KK++K+       
Sbjct: 76  KDSEEKEELKRTLQSMKSRLKTLKNKDREREILKEHKKQEKELIKEGKK 124



 Score = 35.3 bits (82), Expect = 0.032
 Identities = 13/53 (24%), Positives = 26/53 (49%), Gaps = 2/53 (3%)

Query: 20  WKKAKKKEKKKKEKKA--KKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNN 70
            KK K  E+K++ K+     K + K  K K+++++  K+ KK++         
Sbjct: 72  LKKTKDSEEKEELKRTLQSMKSRLKTLKNKDREREILKEHKKQEKELIKEGKK 124



 Score = 34.9 bits (81), Expect = 0.048
 Identities = 16/46 (34%), Positives = 23/46 (50%), Gaps = 4/46 (8%)

Query: 21  KKAKKKEKKKKEKKAKKKEKKKEKKK----KEKKKKKKKKKKKKKN 62
           K   ++ +  KE K ++KE  KE KK    K+ + KK   KKK   
Sbjct: 99  KNKDREREILKEHKKQEKELIKEGKKPYYLKKSEIKKLVLKKKFDE 144



 Score = 34.6 bits (80), Expect = 0.055
 Identities = 16/53 (30%), Positives = 23/53 (43%), Gaps = 13/53 (24%)

Query: 22  KAKKKEKKKKEKKAKKKE-------------KKKEKKKKEKKKKKKKKKKKKK 61
              K  +++  K+ KK+E             KK E KK   KKK  + KK K+
Sbjct: 98  LKNKDREREILKEHKKQEKELIKEGKKPYYLKKSEIKKLVLKKKFDELKKSKQ 150



 Score = 33.4 bits (77), Expect = 0.16
 Identities = 12/56 (21%), Positives = 25/56 (44%), Gaps = 1/56 (1%)

Query: 21  KKAKKKEKKKKEK-KAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNN 75
           +KA KK K  +EK + K+  +  + + K  K K ++++  K++             
Sbjct: 69  EKALKKTKDSEEKEELKRTLQSMKSRLKTLKNKDREREILKEHKKQEKELIKEGKK 124



 Score = 30.3 bits (69), Expect = 1.4
 Identities = 14/44 (31%), Positives = 26/44 (59%)

Query: 19  DWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKN 62
           D+++ + +E +K  KK K  E+K+E K+  +  K + K  K K+
Sbjct: 59  DYREKEIEELEKALKKTKDSEEKEELKRTLQSMKSRLKTLKNKD 102



 Score = 28.0 bits (63), Expect = 9.8
 Identities = 11/30 (36%), Positives = 17/30 (56%)

Query: 18  LDWKKAKKKEKKKKEKKAKKKEKKKEKKKK 47
           L  K  + K+ K+ +K  +KK KK   K+K
Sbjct: 138 LKKKFDELKKSKQLDKALEKKRKKNAGKEK 167


>gnl|CDD|227492 COG5163, NOP7, Protein required for biogenesis of the 60S ribosomal
           subunit [Translation, ribosomal structure and
           biogenesis].
          Length = 591

 Score = 41.2 bits (96), Expect = 0.001
 Identities = 19/60 (31%), Positives = 26/60 (43%)

Query: 19  DWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNNNNN 78
           +  +A K   K K KK K  E+++EKK K      K+KK  KK   +N        N   
Sbjct: 517 ETSEADKDVNKSKNKKRKVDEEEEEKKLKMIMMSNKQKKLYKKMKYSNAKKEEQAENLKK 576



 Score = 40.1 bits (93), Expect = 0.003
 Identities = 18/80 (22%), Positives = 34/80 (42%), Gaps = 5/80 (6%)

Query: 2   DSSTKHQIGTFTEPTPLDWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKK-----KKKKK 56
           D   + Q     E   + + +  + +K   + K KK++  +E+++K+ K      K+KK 
Sbjct: 497 DEELQAQKELELEAQGIKYSETSEADKDVNKSKNKKRKVDEEEEEKKLKMIMMSNKQKKL 556

Query: 57  KKKKKNNNNNNNNNNNNNNN 76
            KK K +N        N   
Sbjct: 557 YKKMKYSNAKKEEQAENLKK 576



 Score = 37.0 bits (85), Expect = 0.023
 Identities = 16/69 (23%), Positives = 27/69 (39%), Gaps = 5/69 (7%)

Query: 21  KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNNNNNNN 80
           K   K + KK++   +++EKK +      K+KK  KK K      +N        N    
Sbjct: 523 KDVNKSKNKKRKVDEEEEEKKLKMIMMSNKQKKLYKKMK-----YSNAKKEEQAENLKKK 577

Query: 81  NNNNNNEEK 89
                 ++K
Sbjct: 578 KKQIAKQKK 586



 Score = 31.6 bits (71), Expect = 1.2
 Identities = 15/66 (22%), Positives = 25/66 (37%), Gaps = 5/66 (7%)

Query: 29  KKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNN-----NNNNNNNNNNNNN 83
           K  E     K+  K K KK K  +++++KK K    +N           +N        N
Sbjct: 514 KYSETSEADKDVNKSKNKKRKVDEEEEEKKLKMIMMSNKQKKLYKKMKYSNAKKEEQAEN 573

Query: 84  NNNEEK 89
              ++K
Sbjct: 574 LKKKKK 579



 Score = 30.4 bits (68), Expect = 2.6
 Identities = 19/40 (47%), Positives = 25/40 (62%)

Query: 22  KAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
           K KK  KK K   AKK+E+ +  KKK+K+  K+KK   KK
Sbjct: 552 KQKKLYKKMKYSNAKKEEQAENLKKKKKQIAKQKKLDSKK 591



 Score = 30.4 bits (68), Expect = 2.7
 Identities = 23/68 (33%), Positives = 34/68 (50%), Gaps = 21/68 (30%)

Query: 19  DWKKAKKKEKKKKEKKAKKKEKK-----KEKK----------KKE------KKKKKKKKK 57
           D  K+K K++K  E++ +KK K      K+KK          KKE      KKKKK+  K
Sbjct: 524 DVNKSKNKKRKVDEEEEEKKLKMIMMSNKQKKLYKKMKYSNAKKEEQAENLKKKKKQIAK 583

Query: 58  KKKKNNNN 65
           +KK ++  
Sbjct: 584 QKKLDSKK 591


>gnl|CDD|219978 pfam08701, GN3L_Grn1, GNL3L/Grn1 putative GTPase.  Grn1 (yeast)
          and GNL3L (human) are putative GTPases which are
          required for growth and play a role in processing of
          nucleolar pre-rRNA. This family contains a potential
          nuclear localisation signal.
          Length = 80

 Score = 37.6 bits (88), Expect = 0.001
 Identities = 11/33 (33%), Positives = 18/33 (54%)

Query: 24 KKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKK 56
          + + K +K+     ++ +KE KK    K KKKK
Sbjct: 1  RLRYKIEKKVAEHHRKLRKEAKKNPTWKSKKKK 33



 Score = 33.0 bits (76), Expect = 0.053
 Identities = 14/34 (41%), Positives = 18/34 (52%), Gaps = 3/34 (8%)

Query: 28 KKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
          + K EKK  +  +   K +KE KK    K KKKK
Sbjct: 3  RYKIEKKVAEHHR---KLRKEAKKNPTWKSKKKK 33



 Score = 32.6 bits (75), Expect = 0.061
 Identities = 16/62 (25%), Positives = 31/62 (50%), Gaps = 19/62 (30%)

Query: 21 KKAKKKEKKKKEKKAKKK-------------------EKKKEKKKKEKKKKKKKKKKKKK 61
          +K +K+ KK    K+KKK                   E+KK K+++EK+++K+ +K ++ 
Sbjct: 15 RKLRKEAKKNPTWKSKKKKDPGIPNSFPFKEEILEEIEEKKRKQEEEKERRKEARKAERA 74

Query: 62 NN 63
            
Sbjct: 75 EA 76



 Score = 31.5 bits (72), Expect = 0.16
 Identities = 15/53 (28%), Positives = 30/53 (56%), Gaps = 14/53 (26%)

Query: 20 WKKAKKKE---------KKK-----KEKKAKKKEKKKEKKKKEKKKKKKKKKK 58
          WK  KKK+         K++     +EKK K++E+K+ +K+  K ++ + +K+
Sbjct: 27 WKSKKKKDPGIPNSFPFKEEILEEIEEKKRKQEEEKERRKEARKAERAEARKR 79



 Score = 31.5 bits (72), Expect = 0.17
 Identities = 18/60 (30%), Positives = 29/60 (48%), Gaps = 20/60 (33%)

Query: 22 KAKKKEKKKKEKK-AKKKEKKKEKKKK-------------------EKKKKKKKKKKKKK 61
          + K  E  +K +K AKK    K KKKK                   EKK+K++++K+++K
Sbjct: 7  EKKVAEHHRKLRKEAKKNPTWKSKKKKDPGIPNSFPFKEEILEEIEEKKRKQEEEKERRK 66


>gnl|CDD|222592 pfam14204, Ribosomal_L18_c, Ribosomal L18 C-terminal region.
          This domain is the C-terminal end of ribosomal L18/L5
          proteins.
          Length = 93

 Score = 37.9 bits (89), Expect = 0.001
 Identities = 16/46 (34%), Positives = 24/46 (52%), Gaps = 5/46 (10%)

Query: 20 WKKAKKK-----EKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKK 60
          +KKA         +KKKEKK  K E K+   KK   +++K +  +K
Sbjct: 48 YKKAHAAIRADPSRKKKEKKEVKAESKRYNAKKLTLEQRKARVAQK 93



 Score = 29.8 bits (68), Expect = 0.97
 Identities = 10/32 (31%), Positives = 19/32 (59%)

Query: 30 KKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
          KK   A + +  ++KK+K++ K + K+   KK
Sbjct: 49 KKAHAAIRADPSRKKKEKKEVKAESKRYNAKK 80


>gnl|CDD|235778 PRK06319, PRK06319, DNA topoisomerase I/SWI domain fusion protein;
           Validated.
          Length = 860

 Score = 41.3 bits (97), Expect = 0.001
 Identities = 19/48 (39%), Positives = 24/48 (50%), Gaps = 5/48 (10%)

Query: 13  TEPTPLDWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKK 60
           T  TP      +KK K KK+  + K +  K  KKK K K KK  KK+ 
Sbjct: 742 TPKTPY-----EKKTKAKKKSASTKGKAAKTVKKKSKAKSKKTTKKRA 784



 Score = 39.8 bits (93), Expect = 0.004
 Identities = 15/37 (40%), Positives = 16/37 (43%)

Query: 29  KKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNN 65
           +KK K  KK    K K  K  KKK K K KK      
Sbjct: 748 EKKTKAKKKSASTKGKAAKTVKKKSKAKSKKTTKKRA 784



 Score = 39.0 bits (91), Expect = 0.006
 Identities = 15/37 (40%), Positives = 21/37 (56%)

Query: 25  KKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
           +K+ K K+K A  K K  +  KK+ K K KK  KK+ 
Sbjct: 748 EKKTKAKKKSASTKGKAAKTVKKKSKAKSKKTTKKRA 784



 Score = 38.3 bits (89), Expect = 0.013
 Identities = 17/37 (45%), Positives = 18/37 (48%)

Query: 27  EKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNN 63
           EKK K KK     K K  K  +KK K K KK  KK  
Sbjct: 748 EKKTKAKKKSASTKGKAAKTVKKKSKAKSKKTTKKRA 784



 Score = 36.3 bits (84), Expect = 0.040
 Identities = 16/35 (45%), Positives = 17/35 (48%)

Query: 27  EKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
            K   EKK K K+K    K K  K  KKK K K K
Sbjct: 743 PKTPYEKKTKAKKKSASTKGKAAKTVKKKSKAKSK 777


>gnl|CDD|218482 pfam05178, Kri1, KRI1-like family.  The yeast member of this
          family (Kri1p) is found to be required for 40S ribosome
          biogenesis in the nucleolus.
          Length = 99

 Score = 38.0 bits (89), Expect = 0.001
 Identities = 13/35 (37%), Positives = 25/35 (71%)

Query: 26 KEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKK 60
          KE+K++EK  +++E K+ K  K ++ ++K +K KK
Sbjct: 1  KERKEEEKAQREEELKRLKNLKREEIEEKLEKIKK 35



 Score = 34.5 bits (80), Expect = 0.024
 Identities = 8/25 (32%), Positives = 19/25 (76%)

Query: 37 KKEKKKEKKKKEKKKKKKKKKKKKK 61
          K+ K++EK ++E++ K+ K  K+++
Sbjct: 1  KERKEEEKAQREEELKRLKNLKREE 25



 Score = 33.8 bits (78), Expect = 0.037
 Identities = 11/33 (33%), Positives = 23/33 (69%)

Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKK 53
          +K ++K ++++E K  K  K++E ++K +K KK
Sbjct: 3  RKEEEKAQREEELKRLKNLKREEIEEKLEKIKK 35



 Score = 32.6 bits (75), Expect = 0.091
 Identities = 8/32 (25%), Positives = 23/32 (71%)

Query: 30 KKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
          K+ K+ +K ++++E K+ +  K+++ ++K +K
Sbjct: 1  KERKEEEKAQREEELKRLKNLKREEIEEKLEK 32



 Score = 30.7 bits (70), Expect = 0.54
 Identities = 12/35 (34%), Positives = 25/35 (71%)

Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKK 55
          K+ K++EK ++E++ K+ +  K ++ +EK +K KK
Sbjct: 1  KERKEEEKAQREEELKRLKNLKREEIEEKLEKIKK 35



 Score = 29.5 bits (67), Expect = 1.1
 Identities = 6/24 (25%), Positives = 18/24 (75%)

Query: 39 EKKKEKKKKEKKKKKKKKKKKKKN 62
          +++KE++K +++++ K+ K  K+ 
Sbjct: 1  KERKEEEKAQREEELKRLKNLKRE 24


>gnl|CDD|217051 pfam02463, SMC_N, RecF/RecN/SMC N terminal domain.  This domain is
           found at the N terminus of SMC proteins. The SMC
           (structural maintenance of chromosomes) superfamily
           proteins have ATP-binding domains at the N- and
           C-termini, and two extended coiled-coil domains
           separated by a hinge in the middle. The eukaryotic SMC
           proteins form two kind of heterodimers: the SMC1/SMC3
           and the SMC2/SMC4 types. These heterodimers constitute
           an essential part of higher order complexes, which are
           involved in chromatin and DNA dynamics. This family also
           includes the RecF and RecN proteins that are involved in
           DNA metabolism and recombination.
          Length = 1162

 Score = 41.1 bits (96), Expect = 0.001
 Identities = 12/69 (17%), Positives = 29/69 (42%)

Query: 21  KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNNNNNNN 80
             A+     ++ K  + K K++ KK  E  + K+K + +++N    +    N    +   
Sbjct: 185 NLAELIIDLEELKLQELKLKEQAKKALEYYQLKEKLELEEENLLYLDYLKLNEERIDLLQ 244

Query: 81  NNNNNNEEK 89
               + +E+
Sbjct: 245 ELLRDEQEE 253



 Score = 36.1 bits (83), Expect = 0.058
 Identities = 13/69 (18%), Positives = 27/69 (39%)

Query: 21  KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNNNNNNN 80
           ++ K +E K KE+  K  E  + K+K E +++        K N    +       +    
Sbjct: 194 EELKLQELKLKEQAKKALEYYQLKEKLELEEENLLYLDYLKLNEERIDLLQELLRDEQEE 253

Query: 81  NNNNNNEEK 89
             ++  E +
Sbjct: 254 IESSKQELE 262



 Score = 36.1 bits (83), Expect = 0.059
 Identities = 8/64 (12%), Positives = 22/64 (34%)

Query: 18  LDWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNNNN 77
           L     + K ++ K K+  KK  +  + K++ + +++          N    +       
Sbjct: 189 LIIDLEELKLQELKLKEQAKKALEYYQLKEKLELEEENLLYLDYLKLNEERIDLLQELLR 248

Query: 78  NNNN 81
           +   
Sbjct: 249 DEQE 252



 Score = 35.7 bits (82), Expect = 0.071
 Identities = 9/71 (12%), Positives = 28/71 (39%)

Query: 18  LDWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNNNN 77
           +   +  K ++ K +++AKK  +  + K+K + +++          N    +       +
Sbjct: 190 IIDLEELKLQELKLKEQAKKALEYYQLKEKLELEEENLLYLDYLKLNEERIDLLQELLRD 249

Query: 78  NNNNNNNNNEE 88
                 ++ +E
Sbjct: 250 EQEEIESSKQE 260



 Score = 35.3 bits (81), Expect = 0.11
 Identities = 15/69 (21%), Positives = 31/69 (44%)

Query: 21  KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNNNNNNN 80
           ++ +K E   KEK+  ++E+K EK K E++K++K K ++++                   
Sbjct: 763 EEEEKSELSLKEKELAEEEEKTEKLKVEEEKEEKLKAQEEELRALEEELKEEAELLEEEQ 822

Query: 81  NNNNNNEEK 89
                 E+ 
Sbjct: 823 LLIEQEEKI 831



 Score = 34.9 bits (80), Expect = 0.13
 Identities = 11/51 (21%), Positives = 29/51 (56%)

Query: 21  KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNN 71
           K  K+ +K+K+E +  +KE K+ + K+E +++++++ +K +          
Sbjct: 326 KLEKELKKEKEEIEELEKELKELEIKREAEEEEEEQLEKLQEKLEQLEEEL 376



 Score = 33.8 bits (77), Expect = 0.27
 Identities = 11/66 (16%), Positives = 25/66 (37%)

Query: 24  KKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNNNNNNNNNN 83
             +E K +E K K++ KK  +  + K+K + +++     +    N    +       +  
Sbjct: 192 DLEELKLQELKLKEQAKKALEYYQLKEKLELEEENLLYLDYLKLNEERIDLLQELLRDEQ 251

Query: 84  NNNEEK 89
              E  
Sbjct: 252 EEIESS 257



 Score = 33.8 bits (77), Expect = 0.28
 Identities = 10/52 (19%), Positives = 26/52 (50%)

Query: 24   KKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNN 75
            +++  K + KK + +E+KKE  ++  ++  ++ K+  +   + N   N    
Sbjct: 988  EERYNKDELKKERLEEEKKELLREIIEETCQRFKEFLELFVSINRGLNKVFF 1039



 Score = 33.4 bits (76), Expect = 0.38
 Identities = 13/55 (23%), Positives = 30/55 (54%)

Query: 21  KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNN 75
           K+ KK EK+ K++K + +E +KE K+ E K++ +++++++               
Sbjct: 322 KELKKLEKELKKEKEEIEELEKELKELEIKREAEEEEEEQLEKLQEKLEQLEEEL 376



 Score = 33.4 bits (76), Expect = 0.39
 Identities = 15/82 (18%), Positives = 30/82 (36%), Gaps = 13/82 (15%)

Query: 21  KKAKKKEKKKKEKKAK--------KKEKKKEKKKKEKKKKKKKKKKKKKNNNNNN----- 67
           K+  KK  ++ E  A+        K ++ K K++ +K  +  + K+K +    N      
Sbjct: 173 KERLKKLIEETENLAELIIDLEELKLQELKLKEQAKKALEYYQLKEKLELEEENLLYLDY 232

Query: 68  NNNNNNNNNNNNNNNNNNNEEK 89
              N    +       +  EE 
Sbjct: 233 LKLNEERIDLLQELLRDEQEEI 254



 Score = 33.0 bits (75), Expect = 0.44
 Identities = 13/59 (22%), Positives = 27/59 (45%), Gaps = 3/59 (5%)

Query: 25   KKEKKKKEKKAKKKEKKKEKKKKEKKKKKK---KKKKKKKNNNNNNNNNNNNNNNNNNN 80
              E ++KE++  K E KKE+ ++EKK+  +   ++  ++         + N   N    
Sbjct: 981  IAEFEEKEERYNKDELKKERLEEEKKELLREIIEETCQRFKEFLELFVSINRGLNKVFF 1039



 Score = 32.6 bits (74), Expect = 0.60
 Identities = 14/41 (34%), Positives = 29/41 (70%)

Query: 21  KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
            K  +KE KK EK+ KK++++ E+ +KE K+ + K++ +++
Sbjct: 317 LKESEKELKKLEKELKKEKEEIEELEKELKELEIKREAEEE 357



 Score = 32.6 bits (74), Expect = 0.70
 Identities = 12/42 (28%), Positives = 30/42 (71%)

Query: 21  KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKN 62
            + +K + ++K K+++K+ KK EK+ K++K++ ++ +K+ K 
Sbjct: 306 LERRKVDDEEKLKESEKELKKLEKELKKEKEEIEELEKELKE 347



 Score = 32.2 bits (73), Expect = 0.77
 Identities = 13/42 (30%), Positives = 28/42 (66%)

Query: 21  KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKN 62
           K   +++ K+ EK+ KK EK+ +K+K+E ++ +K+ K+ +  
Sbjct: 310 KVDDEEKLKESEKELKKLEKELKKEKEEIEELEKELKELEIK 351



 Score = 32.2 bits (73), Expect = 0.96
 Identities = 10/59 (16%), Positives = 27/59 (45%)

Query: 21   KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNNNNNN 79
                ++++++  K   KKE+ +E+KK+  ++  ++  ++ K       + N   N    
Sbjct: 981  IAEFEEKEERYNKDELKKERLEEEKKELLREIIEETCQRFKEFLELFVSINRGLNKVFF 1039



 Score = 32.2 bits (73), Expect = 0.97
 Identities = 8/68 (11%), Positives = 24/68 (35%)

Query: 21  KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNNNNNNN 80
           +   K++ KK  +  + KEK + +++        K  +++ +       +      ++  
Sbjct: 200 ELKLKEQAKKALEYYQLKEKLELEEENLLYLDYLKLNEERIDLLQELLRDEQEEIESSKQ 259

Query: 81  NNNNNNEE 88
                 E 
Sbjct: 260 ELEKEEEI 267



 Score = 30.7 bits (69), Expect = 2.6
 Identities = 9/70 (12%), Positives = 28/70 (40%)

Query: 21   KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNNNNNNN 80
                   + +++++   K++ K+++ +E+KK+  ++  ++             + N   N
Sbjct: 976  VNLMAIAEFEEKEERYNKDELKKERLEEEKKELLREIIEETCQRFKEFLELFVSINRGLN 1035

Query: 81   NNNNNNEEKG 90
                  E  G
Sbjct: 1036 KVFFYLELGG 1045



 Score = 28.8 bits (64), Expect = 8.8
 Identities = 15/38 (39%), Positives = 25/38 (65%)

Query: 21  KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKK 58
            +  K E++K + + K KE +KE KK EK+ KK+K++ 
Sbjct: 301 SELLKLERRKVDDEEKLKESEKELKKLEKELKKEKEEI 338


>gnl|CDD|187751 cd09270, RNase_H2-B, Ribonuclease H2-B is a subunit of the
           eukaryotic RNase H complex which cleaves RNA-DNA
           hybrids.  Ribonuclease H2B is one of the three proteins
           of eukaryotic RNase H2 complex that is required for
           nucleic acid binding and hydrolysis. RNase H is
           classified into two families, type I (prokaryotic RNase
           HI, eukaryotic RNase H1 and viral RNase H) and type II
           (prokaryotic RNase HII and HIII, and eukaryotic RNase
           H2/HII). RNase H endonucleolytically hydrolyzes an RNA
           strand when it is annealed to a complementary DNA strand
           in the presence of divalent cations, in DNA replication
           and repair. The enzyme can be found in bacteria,
           archaea, and eukaryotes. Most prokaryotic and eukaryotic
           genomes contain multiple RNase H genes. Despite a lack
           of evidence for homology from sequence comparisons, type
           I and type II RNase H share a common fold and similar
           steric configurations of the four acidic active-site
           residues, suggesting identical or very similar catalytic
           mechanisms. Eukaryotic RNase HII is active during
           replication and is believed to play a role in removal of
           Okazaki fragment primers and single ribonucleotides in
           DNA-DNA duplexes. Eukaryotic RNase HII is functional
           when it forms a complex with RNase H2B and RNase H2C
           proteins. It is speculated that the two accessory
           subunits are required for correct folding of the
           catalytic subunit of RNase HII. Mutations in the three
           subunits of human RNase HII cause neurological disorder.
          Length = 211

 Score = 40.0 bits (94), Expect = 0.002
 Identities = 23/49 (46%), Positives = 32/49 (65%), Gaps = 5/49 (10%)

Query: 18  LDW--KKAKKKEKKK---KEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
           L W  KK ++ +KK+   KE +A + +KK    ++EKKKKKKK  KKKK
Sbjct: 146 LAWLLKKVERLKKKELDIKEAEAARDKKKSNNAEEEKKKKKKKSAKKKK 194



 Score = 35.7 bits (83), Expect = 0.035
 Identities = 19/45 (42%), Positives = 27/45 (60%)

Query: 18  LDWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKN 62
           LD K+A+    KKK   A++++KKK+KK  +KKK KK      K 
Sbjct: 161 LDIKEAEAARDKKKSNNAEEEKKKKKKKSAKKKKLKKVAAVGMKA 205


>gnl|CDD|226894 COG4499, COG4499, Predicted membrane protein [Function unknown].
          Length = 434

 Score = 40.9 bits (96), Expect = 0.002
 Identities = 13/40 (32%), Positives = 30/40 (75%)

Query: 22  KAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
            +++ E K KE+K K++E +K++K++  + K+K++K ++K
Sbjct: 394 ASEEAEAKAKEEKLKQEENEKKQKEQADEDKEKRQKDERK 433



 Score = 38.2 bits (89), Expect = 0.009
 Identities = 15/36 (41%), Positives = 29/36 (80%)

Query: 22  KAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKK 57
           +AK KE+K K+++ +KK+K++  + KEK++K ++KK
Sbjct: 399 EAKAKEEKLKQEENEKKQKEQADEDKEKRQKDERKK 434



 Score = 38.2 bits (89), Expect = 0.010
 Identities = 11/41 (26%), Positives = 26/41 (63%)

Query: 21  KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
           K      ++ + K  ++K K++E +KK+K++  + K+K++K
Sbjct: 389 KDETDASEEAEAKAKEEKLKQEENEKKQKEQADEDKEKRQK 429



 Score = 36.7 bits (85), Expect = 0.026
 Identities = 13/39 (33%), Positives = 31/39 (79%)

Query: 21  KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKK 59
           ++A+ K K++K K+ + ++K+KE+  ++K+K++K ++KK
Sbjct: 396 EEAEAKAKEEKLKQEENEKKQKEQADEDKEKRQKDERKK 434



 Score = 36.3 bits (84), Expect = 0.036
 Identities = 13/41 (31%), Positives = 27/41 (65%)

Query: 21  KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
           +    +E + K K+ K K+++ EKK+KE+  + K+K++K +
Sbjct: 391 ETDASEEAEAKAKEEKLKQEENEKKQKEQADEDKEKRQKDE 431



 Score = 34.0 bits (78), Expect = 0.20
 Identities = 11/44 (25%), Positives = 27/44 (61%)

Query: 18  LDWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
              K  + K++    ++A+ K K+++ K++E +KK+K++  + K
Sbjct: 381 YTKKLGEVKDETDASEEAEAKAKEEKLKQEENEKKQKEQADEDK 424



 Score = 33.6 bits (77), Expect = 0.28
 Identities = 13/41 (31%), Positives = 27/41 (65%)

Query: 21  KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
            + K +    +E +AK KE+K ++++ EKK+K++  + K+K
Sbjct: 386 GEVKDETDASEEAEAKAKEEKLKQEENEKKQKEQADEDKEK 426



 Score = 30.9 bits (70), Expect = 2.0
 Identities = 12/39 (30%), Positives = 26/39 (66%)

Query: 24  KKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKN 62
           KK  + K E  A ++ + K K++K K+++ +KK+K++ +
Sbjct: 383 KKLGEVKDETDASEEAEAKAKEEKLKQEENEKKQKEQAD 421


>gnl|CDD|218148 pfam04557, tRNA_synt_1c_R2, Glutaminyl-tRNA synthetase,
          non-specific RNA binding region part 2.  This is a
          region found N terminal to the catalytic domain of
          glutaminyl-tRNA synthetase (EC 6.1.1.18) in eukaryotes
          but not in Escherichia coli. This region is thought to
          bind RNA in a non-specific manner, enhancing
          interactions between the tRNA and enzyme, but is not
          essential for enzyme function.
          Length = 83

 Score = 37.3 bits (87), Expect = 0.002
 Identities = 14/34 (41%), Positives = 17/34 (50%)

Query: 30 KKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNN 63
          K E    KK+KKK+KKK E      K KK    +
Sbjct: 17 KTEADLVKKKKKKKKKKAEDTAATAKAKKATAED 50



 Score = 33.5 bits (77), Expect = 0.035
 Identities = 13/40 (32%), Positives = 17/40 (42%)

Query: 25 KKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNN 64
          K E    +KK KKK+KK E      K KK   +   +   
Sbjct: 17 KTEADLVKKKKKKKKKKAEDTAATAKAKKATAEDVSEGAM 56



 Score = 32.3 bits (74), Expect = 0.091
 Identities = 12/34 (35%), Positives = 16/34 (47%)

Query: 33 KKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNN 66
          K      KKK+KKKK+K +      K KK    +
Sbjct: 17 KTEADLVKKKKKKKKKKAEDTAATAKAKKATAED 50



 Score = 31.6 bits (72), Expect = 0.16
 Identities = 10/28 (35%), Positives = 14/28 (50%)

Query: 40 KKKEKKKKEKKKKKKKKKKKKKNNNNNN 67
           KK+KKKK+KK +      K K     +
Sbjct: 23 VKKKKKKKKKKAEDTAATAKAKKATAED 50



 Score = 31.6 bits (72), Expect = 0.16
 Identities = 12/37 (32%), Positives = 16/37 (43%)

Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKK 57
          K      KKKK+KK KK E      K +K   +   +
Sbjct: 17 KTEADLVKKKKKKKKKKAEDTAATAKAKKATAEDVSE 53



 Score = 28.9 bits (65), Expect = 1.5
 Identities = 12/36 (33%), Positives = 18/36 (50%)

Query: 16 TPLDWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKK 51
          T  D  K KKK+KKKK +      K K+   ++  +
Sbjct: 18 TEADLVKKKKKKKKKKAEDTAATAKAKKATAEDVSE 53


>gnl|CDD|217450 pfam03247, Prothymosin, Prothymosin/parathymosin family.
          Prothymosin alpha and parathymosin are two ubiquitous
          small acidic nuclear proteins that are thought to be
          involved in cell cycle progression, proliferation, and
          cell differentiation.
          Length = 106

 Score = 37.6 bits (87), Expect = 0.002
 Identities = 14/54 (25%), Positives = 24/54 (44%)

Query: 36 KKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNNNNNNNNNNNNNEEK 89
           K +   E   K+ K+KK+  ++K+   N   N N N  N     ++    EE+
Sbjct: 3  TKVDAAAELSAKDLKEKKEVVEEKENGKNAPANGNENEENGAQEGDDEMEEEEE 56



 Score = 36.9 bits (85), Expect = 0.004
 Identities = 16/62 (25%), Positives = 23/62 (37%), Gaps = 5/62 (8%)

Query: 28 KKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNNNNNNNNNNNNNE 87
             K   A +   K  K+KKE  ++K+  K     N   N N N  N     ++     E
Sbjct: 1  SDTKVDAAAELSAKDLKEKKEVVEEKENGK-----NAPANGNENEENGAQEGDDEMEEEE 55

Query: 88 EK 89
          E 
Sbjct: 56 EV 57


>gnl|CDD|240339 PTZ00265, PTZ00265, multidrug resistance protein (mdr1);
           Provisional.
          Length = 1466

 Score = 40.8 bits (95), Expect = 0.002
 Identities = 22/55 (40%), Positives = 28/55 (50%), Gaps = 5/55 (9%)

Query: 42  KEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNNNNNNNNNNNNNE-----EKGT 91
           +E+          +   K    NNN NN ++NNNNNNNNNN  NN      E+GT
Sbjct: 655 RERGSTVDVDIIGEDPTKDNKENNNKNNKDDNNNNNNNNNNKINNAGSYIIEQGT 709



 Score = 40.4 bits (94), Expect = 0.003
 Identities = 18/33 (54%), Positives = 24/33 (72%)

Query: 61  KNNNNNNNNNNNNNNNNNNNNNNNNNEEKGTFL 93
           K+N  NNN NN ++NNNNNNNNNN     G+++
Sbjct: 672 KDNKENNNKNNKDDNNNNNNNNNNKINNAGSYI 704



 Score = 36.2 bits (83), Expect = 0.053
 Identities = 18/45 (40%), Positives = 24/45 (53%)

Query: 38  KEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNNNNNNNNN 82
           +E+          +   K  K+  N NN ++NNNNNNNNNN  NN
Sbjct: 655 RERGSTVDVDIIGEDPTKDNKENNNKNNKDDNNNNNNNNNNKINN 699



 Score = 30.8 bits (69), Expect = 2.5
 Identities = 16/45 (35%), Positives = 22/45 (48%)

Query: 31  KEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNN 75
           +E+ +        +   +  K+   K  K  NNNNNNNNNN  NN
Sbjct: 655 RERGSTVDVDIIGEDPTKDNKENNNKNNKDDNNNNNNNNNNKINN 699


>gnl|CDD|221275 pfam11861, DUF3381, Domain of unknown function (DUF3381).  This
           domain is functionally uncharacterized. This domain is
           found in eukaryotes. This presumed domain is typically
           between 156 to 174 amino acids in length. This domain is
           found associated with pfam07780, pfam01728.
          Length = 154

 Score = 38.8 bits (91), Expect = 0.002
 Identities = 11/42 (26%), Positives = 29/42 (69%)

Query: 21  KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKN 62
           KK K++E++++ +  +  E+++  +  EK+  K K++K+++N
Sbjct: 98  KKEKEEEEEEEVEVEELDEEEQIDELLEKELAKLKREKRREN 139



 Score = 38.0 bits (89), Expect = 0.003
 Identities = 10/39 (25%), Positives = 27/39 (69%)

Query: 21  KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKK 59
           ++ + +E  ++E+  +  EK+  K K+EK+++ ++K+K+
Sbjct: 107 EEVEVEELDEEEQIDELLEKELAKLKREKRRENERKQKE 145



 Score = 36.5 bits (85), Expect = 0.010
 Identities = 11/48 (22%), Positives = 31/48 (64%), Gaps = 4/48 (8%)

Query: 18  LDWKKAKKKEKKKKEKKAKKKE----KKKEKKKKEKKKKKKKKKKKKK 61
           LD K+ +++E+++ E +   +E    +  EK+  + K++K+++ ++K+
Sbjct: 96  LDKKEKEEEEEEEVEVEELDEEEQIDELLEKELAKLKREKRRENERKQ 143



 Score = 33.8 bits (78), Expect = 0.081
 Identities = 10/41 (24%), Positives = 24/41 (58%)

Query: 22  KAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKN 62
           +   +E++  E   K+  K K +K++E ++K+K+  K++  
Sbjct: 112 EELDEEEQIDELLEKELAKLKREKRRENERKQKEILKEQMK 152



 Score = 33.0 bits (76), Expect = 0.16
 Identities = 10/44 (22%), Positives = 27/44 (61%)

Query: 21  KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNN 64
              K+KE++++E+   ++  ++E+  +  +K+  K K++K+  N
Sbjct: 96  LDKKEKEEEEEEEVEVEELDEEEQIDELLEKELAKLKREKRREN 139



 Score = 31.5 bits (72), Expect = 0.53
 Identities = 11/49 (22%), Positives = 28/49 (57%), Gaps = 5/49 (10%)

Query: 13  TEPTPLDWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
            E   LD     ++E+  +  + +  + K+EK+++ ++K+K+  K++ K
Sbjct: 109 VEVEELD-----EEEQIDELLEKELAKLKREKRRENERKQKEILKEQMK 152



 Score = 31.5 bits (72), Expect = 0.58
 Identities = 11/44 (25%), Positives = 28/44 (63%)

Query: 22  KAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNN 65
             KK++++++E++ + +E  +E++  E  +K+  K K++K   N
Sbjct: 96  LDKKEKEEEEEEEVEVEELDEEEQIDELLEKELAKLKREKRREN 139


>gnl|CDD|218737 pfam05764, YL1, YL1 nuclear protein.  The proteins in this family
           are designated YL1. These proteins have been shown to be
           DNA-binding and may be a transcription factor.
          Length = 238

 Score = 39.7 bits (93), Expect = 0.002
 Identities = 15/41 (36%), Positives = 20/41 (48%)

Query: 21  KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
           KK + K K  KE   KKK+K     K  K    + KKK ++
Sbjct: 86  KKKRVKTKAYKEPTKKKKKKDPTAAKSPKAAAPRPKKKSER 126



 Score = 38.5 bits (90), Expect = 0.005
 Identities = 15/42 (35%), Positives = 23/42 (54%)

Query: 21  KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKN 62
           +K  ++E++ K+KK  K +  KE  KK+KKK     K  K  
Sbjct: 75  EKELQREERLKKKKRVKTKAYKEPTKKKKKKDPTAAKSPKAA 116



 Score = 38.1 bits (89), Expect = 0.006
 Identities = 14/41 (34%), Positives = 21/41 (51%)

Query: 21  KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
           KK K+ + K  ++  KKK+KK     K  K    + KKK +
Sbjct: 85  KKKKRVKTKAYKEPTKKKKKKDPTAAKSPKAAAPRPKKKSE 125



 Score = 37.0 bits (86), Expect = 0.013
 Identities = 15/39 (38%), Positives = 22/39 (56%)

Query: 23  AKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
             +KE +++E+  KKK  K +  K+  KKKKKK     K
Sbjct: 73  EGEKELQREERLKKKKRVKTKAYKEPTKKKKKKDPTAAK 111



 Score = 37.0 bits (86), Expect = 0.014
 Identities = 14/43 (32%), Positives = 20/43 (46%)

Query: 20  WKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKN 62
            K+ K K  K+  KK KKK+    K  K    + KKK ++   
Sbjct: 87  KKRVKTKAYKEPTKKKKKKDPTAAKSPKAAAPRPKKKSERISW 129



 Score = 36.6 bits (85), Expect = 0.021
 Identities = 16/45 (35%), Positives = 25/45 (55%), Gaps = 4/45 (8%)

Query: 21  KKAKKKEKKKKEKKAKKKEKKKEKKKKE----KKKKKKKKKKKKK 61
           ++ ++ +KKK+ K    KE  K+KKKK+    K  K    + KKK
Sbjct: 79  QREERLKKKKRVKTKAYKEPTKKKKKKDPTAAKSPKAAAPRPKKK 123



 Score = 35.8 bits (83), Expect = 0.040
 Identities = 14/37 (37%), Positives = 24/37 (64%)

Query: 27  EKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNN 63
           E+ +KE + +++ KKK++ K +  K+  KKKKKK   
Sbjct: 72  EEGEKELQREERLKKKKRVKTKAYKEPTKKKKKKDPT 108



 Score = 33.5 bits (77), Expect = 0.18
 Identities = 11/39 (28%), Positives = 17/39 (43%)

Query: 21  KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKK 59
           K  K+  KKKK+K     +  K    + KKK ++     
Sbjct: 93  KAYKEPTKKKKKKDPTAAKSPKAAAPRPKKKSERISWAP 131



 Score = 33.1 bits (76), Expect = 0.30
 Identities = 16/57 (28%), Positives = 26/57 (45%)

Query: 6   KHQIGTFTEPTPLDWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKN 62
           K        PT LD  + K       + K    E+ KE++ + KK + K +K+K+K 
Sbjct: 122 KKSERISWAPTLLDSPRRKSSRSSTVQNKEATHERLKEREIRRKKIQAKARKRKEKK 178



 Score = 31.6 bits (72), Expect = 0.91
 Identities = 12/34 (35%), Positives = 23/34 (67%)

Query: 28  KKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
           + K+    + KE++  +KK + K +K+K+KKK+K
Sbjct: 148 QNKEATHERLKEREIRRKKIQAKARKRKEKKKEK 181



 Score = 31.2 bits (71), Expect = 1.1
 Identities = 10/42 (23%), Positives = 16/42 (38%), Gaps = 1/42 (2%)

Query: 21  KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKN 62
           +  KKK KKK    AK  +    + KK+ ++           
Sbjct: 97  EPTKKK-KKKDPTAAKSPKAAAPRPKKKSERISWAPTLLDSP 137



 Score = 30.8 bits (70), Expect = 1.6
 Identities = 11/42 (26%), Positives = 18/42 (42%)

Query: 22  KAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNN 63
           K   K+KKKK+  A K  K    + K+K ++         + 
Sbjct: 96  KEPTKKKKKKDPTAAKSPKAAAPRPKKKSERISWAPTLLDSP 137



 Score = 30.4 bits (69), Expect = 1.9
 Identities = 10/35 (28%), Positives = 22/35 (62%)

Query: 22  KAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKK 56
              K+   ++ K+ + + KK + K +++K+KKK+K
Sbjct: 147 VQNKEATHERLKEREIRRKKIQAKARKRKEKKKEK 181



 Score = 30.4 bits (69), Expect = 2.1
 Identities = 19/68 (27%), Positives = 30/68 (44%), Gaps = 27/68 (39%)

Query: 21  KKAKKKEKKKKEKKAKKKEKKKEKK---------------------------KKEKKKKK 53
           K+ + + KK + K  K+KEKKKEK+                           ++EKKK K
Sbjct: 158 KEREIRRKKIQAKARKRKEKKKEKELTQEERLAEAKETERINLKSLERYEEQEEEKKKAK 217

Query: 54  KKKKKKKK 61
            +  KK++
Sbjct: 218 IQALKKRR 225



 Score = 28.5 bits (64), Expect = 9.2
 Identities = 13/33 (39%), Positives = 21/33 (63%)

Query: 21  KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKK 53
            K    E+ K+ +  +KK + K +K+KEKKK+K
Sbjct: 149 NKEATHERLKEREIRRKKIQAKARKRKEKKKEK 181


>gnl|CDD|240520 cd13156, KOW_RPL6, KOW motif of Ribosomal Protein L6.  RPL6
           contains KOW motif that has an extra ribosomal role as
           an oncogenic. KOW domain is known as an RNA-binding
           motif that is shared so far among some families of
           ribosomal proteins, the essential bacterial
           transcriptional elongation factor NusG, the eukaryotic
           chromatin elongation factor Spt5, the higher eukaryotic
           KIN17 proteins and Mtr4. .
          Length = 152

 Score = 38.3 bits (90), Expect = 0.002
 Identities = 21/54 (38%), Positives = 32/54 (59%), Gaps = 2/54 (3%)

Query: 4   STKHQIGTFTEPTPLD--WKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKK 55
           STK  I     P   D  +K+ KKK+KKKKE +  +++KKK    +E+K+ +K 
Sbjct: 63  STKVDISGVKVPKLNDAYFKRKKKKKKKKKEGEFFEEKKKKYVVSEERKEDQKA 116



 Score = 35.2 bits (82), Expect = 0.029
 Identities = 15/35 (42%), Positives = 24/35 (68%)

Query: 22  KAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKK 56
           K KKK+KKKK++    +EKKK+    E++K+ +K 
Sbjct: 82  KRKKKKKKKKKEGEFFEEKKKKYVVSEERKEDQKA 116



 Score = 35.2 bits (82), Expect = 0.031
 Identities = 14/35 (40%), Positives = 25/35 (71%)

Query: 28  KKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKN 62
           K+KK+KK KKKE +  ++KK+K    +++K+ +K 
Sbjct: 82  KRKKKKKKKKKEGEFFEEKKKKYVVSEERKEDQKA 116



 Score = 33.7 bits (78), Expect = 0.100
 Identities = 12/35 (34%), Positives = 25/35 (71%)

Query: 25  KKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKK 59
           K++KKKK+KK + +  +++KKK    +++K+ +K 
Sbjct: 82  KRKKKKKKKKKEGEFFEEKKKKYVVSEERKEDQKA 116



 Score = 32.9 bits (76), Expect = 0.17
 Identities = 16/35 (45%), Positives = 24/35 (68%)

Query: 26  KEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKK 60
           K KKKK+KK K+ E  +EKKKK    +++K+ +K 
Sbjct: 82  KRKKKKKKKKKEGEFFEEKKKKYVVSEERKEDQKA 116



 Score = 31.4 bits (72), Expect = 0.62
 Identities = 13/35 (37%), Positives = 25/35 (71%)

Query: 24  KKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKK 58
           K+K+KKKK+KK  +  ++K+KK    +++K+ +K 
Sbjct: 82  KRKKKKKKKKKEGEFFEEKKKKYVVSEERKEDQKA 116


>gnl|CDD|227509 COG5182, CUS1, Splicing factor 3b, subunit 2 [RNA processing and
          modification].
          Length = 429

 Score = 40.4 bits (94), Expect = 0.002
 Identities = 11/58 (18%), Positives = 21/58 (36%)

Query: 22 KAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNNNNNN 79
          K++K+    + K A     K E       ++ ++KKK    N+    N   +      
Sbjct: 5  KSRKRSGNNQNKNASVVNNKAEIAAMIDARRLEQKKKGGVTNSKGKTNKVVDAKLEKE 62



 Score = 33.4 bits (76), Expect = 0.31
 Identities = 9/55 (16%), Positives = 16/55 (29%)

Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNN 75
          K++   + K       K E       +  ++KKK      K   N   +      
Sbjct: 8  KRSGNNQNKNASVVNNKAEIAAMIDARRLEQKKKGGVTNSKGKTNKVVDAKLEKE 62


>gnl|CDD|237555 PRK13914, PRK13914, invasion associated secreted endopeptidase;
           Provisional.
          Length = 481

 Score = 40.2 bits (93), Expect = 0.002
 Identities = 23/85 (27%), Positives = 33/85 (38%), Gaps = 7/85 (8%)

Query: 14  EPTPLDWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNN---- 69
           E T  +    +KKE   +++ A K   +  K            K     N N NN     
Sbjct: 273 ENTNTNTATTEKKETTTQQQTAPKAPTEAAKPAPAPSTNTNANKTNTNTNTNTNNTNTST 332

Query: 70  ---NNNNNNNNNNNNNNNNNEEKGT 91
              N N N N+N N N+N N  +G+
Sbjct: 333 PSKNTNTNTNSNTNTNSNTNANQGS 357



 Score = 34.4 bits (78), Expect = 0.14
 Identities = 20/89 (22%), Positives = 31/89 (34%), Gaps = 4/89 (4%)

Query: 3   SSTKHQIGTFTEPTPLDWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKN 62
           ++ K +  T  +  P    KA  +  K     +      K                  KN
Sbjct: 281 TTEKKETTTQQQTAP----KAPTEAAKPAPAPSTNTNANKTNTNTNTNTNNTNTSTPSKN 336

Query: 63  NNNNNNNNNNNNNNNNNNNNNNNNEEKGT 91
            N N N+N N N+N N N  ++NN    +
Sbjct: 337 TNTNTNSNTNTNSNTNANQGSSNNNSNSS 365



 Score = 30.5 bits (68), Expect = 2.3
 Identities = 17/66 (25%), Positives = 26/66 (39%)

Query: 26  KEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNNNNNNNNNNNN 85
           KE         +K++   +++   K   +  K     + N N N  N N N N NN N +
Sbjct: 272 KENTNTNTATTEKKETTTQQQTAPKAPTEAAKPAPAPSTNTNANKTNTNTNTNTNNTNTS 331

Query: 86  NEEKGT 91
              K T
Sbjct: 332 TPSKNT 337



 Score = 30.2 bits (67), Expect = 3.2
 Identities = 19/69 (27%), Positives = 27/69 (39%)

Query: 17  PLDWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNNN 76
           P   K+A    K+         EKK+   +++   K   +  K     + N N N  N N
Sbjct: 261 PAAEKQAAPVVKENTNTNTATTEKKETTTQQQTAPKAPTEAAKPAPAPSTNTNANKTNTN 320

Query: 77  NNNNNNNNN 85
            N N NN N
Sbjct: 321 TNTNTNNTN 329


>gnl|CDD|225620 COG3078, COG3078, Uncharacterized protein conserved in bacteria
           [Function unknown].
          Length = 169

 Score = 38.3 bits (89), Expect = 0.003
 Identities = 22/93 (23%), Positives = 47/93 (50%), Gaps = 9/93 (9%)

Query: 23  AKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNNNNNNNNN 82
            + K+ ++   KA KK ++K +++ + + + +K++KK+K   + + ++  N N+ N   N
Sbjct: 2   KRSKKTRRPRSKADKKARRKTREELDAEARDRKRQKKRKGLASGSRHSGGNENSGNKQQN 61

Query: 83  NNNNEEKGTFLYKMAAPLVLVLTYVAE-ITQPH 114
              +   G+   K   PL      V E +T+ H
Sbjct: 62  QKKDPRIGS---KKPIPLG-----VTEKVTKQH 86



 Score = 32.9 bits (75), Expect = 0.23
 Identities = 15/60 (25%), Positives = 32/60 (53%)

Query: 22 KAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNNNNNNNN 81
          K  KK ++ + K  KK  +K  ++   + + +K++KK+K   + + ++  N N+ N   N
Sbjct: 2  KRSKKTRRPRSKADKKARRKTREELDAEARDRKRQKKRKGLASGSRHSGGNENSGNKQQN 61



 Score = 32.5 bits (74), Expect = 0.33
 Identities = 14/58 (24%), Positives = 32/58 (55%)

Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNNNNN 78
           K  ++ + K +KKA++K +++   +   +K++KK+K     + ++  N N+ N   N
Sbjct: 4  SKKTRRPRSKADKKARRKTREELDAEARDRKRQKKRKGLASGSRHSGGNENSGNKQQN 61



 Score = 31.7 bits (72), Expect = 0.59
 Identities = 15/60 (25%), Positives = 33/60 (55%)

Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNNNNNNN 80
          K++KK  + + +   K + K +E+   E + +K++KK+K   + + ++  N N+ N   N
Sbjct: 2  KRSKKTRRPRSKADKKARRKTREELDAEARDRKRQKKRKGLASGSRHSGGNENSGNKQQN 61


>gnl|CDD|172341 PRK13808, PRK13808, adenylate kinase; Provisional.
          Length = 333

 Score = 39.5 bits (92), Expect = 0.003
 Identities = 21/43 (48%), Positives = 21/43 (48%), Gaps = 2/43 (4%)

Query: 21  KKAKKKEKK--KKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
           KKAKK  KK  KK  KA KK  KK  K   K  K   K  K K
Sbjct: 250 KKAKKTAKKALKKAAKAVKKAAKKAAKAAAKAAKGAAKATKGK 292



 Score = 38.3 bits (89), Expect = 0.007
 Identities = 20/47 (42%), Positives = 21/47 (44%), Gaps = 4/47 (8%)

Query: 21  KKAKK--KEKKKKEKKAKKKEKKKEKK--KKEKKKKKKKKKKKKKNN 63
           K A K  K+ KK  KKA KK  K  KK  KK  K   K  K   K  
Sbjct: 243 KTAAKAAKKAKKTAKKALKKAAKAVKKAAKKAAKAAAKAAKGAAKAT 289



 Score = 37.9 bits (88), Expect = 0.009
 Identities = 19/47 (40%), Positives = 20/47 (42%)

Query: 21  KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNN 67
            KA KK  KK  K A K  K   K  K K K KKK  KK    +   
Sbjct: 264 AKAVKKAAKKAAKAAAKAAKGAAKATKGKAKAKKKAGKKAAAGSKAK 310



 Score = 37.9 bits (88), Expect = 0.010
 Identities = 20/50 (40%), Positives = 22/50 (44%), Gaps = 2/50 (4%)

Query: 21  KKAKKKEKK--KKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNN 68
           KKA K  KK  KK  KA  K  K   K  + K K KKK  KK    +   
Sbjct: 261 KKAAKAVKKAAKKAAKAAAKAAKGAAKATKGKAKAKKKAGKKAAAGSKAK 310



 Score = 37.6 bits (87), Expect = 0.012
 Identities = 18/43 (41%), Positives = 19/43 (44%), Gaps = 2/43 (4%)

Query: 21  KKAKKKEKKKKEKKAKKKEKKKEKK--KKEKKKKKKKKKKKKK 61
                K+  K   KA KK KK  KK  KK  K  KK  KK  K
Sbjct: 234 TAVSAKKAAKTAAKAAKKAKKTAKKALKKAAKAVKKAAKKAAK 276



 Score = 37.2 bits (86), Expect = 0.016
 Identities = 17/43 (39%), Positives = 20/43 (46%), Gaps = 2/43 (4%)

Query: 21  KKAKKKEKKKKEKKAKKK--EKKKEKKKKEKKKKKKKKKKKKK 61
           KKA  K K   +K +KKK  +     KK  K   K  KK KK 
Sbjct: 213 KKASAKAKSAAKKVSKKKAAKTAVSAKKAAKTAAKAAKKAKKT 255



 Score = 37.2 bits (86), Expect = 0.018
 Identities = 18/50 (36%), Positives = 21/50 (42%), Gaps = 2/50 (4%)

Query: 13  TEPTPLDWKKAKKKEKKKK--EKKAKKKEKKKEKKKKEKKKKKKKKKKKK 60
            +      K +KKK  K     KKA K   K  KK K+  KK  KK  K 
Sbjct: 217 AKAKSAAKKVSKKKAAKTAVSAKKAAKTAAKAAKKAKKTAKKALKKAAKA 266



 Score = 36.8 bits (85), Expect = 0.025
 Identities = 19/41 (46%), Positives = 20/41 (48%)

Query: 21  KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
           KKA KK  K  +K AKK  K   K  K   K  K K K KK
Sbjct: 257 KKALKKAAKAVKKAAKKAAKAAAKAAKGAAKATKGKAKAKK 297



 Score = 36.8 bits (85), Expect = 0.025
 Identities = 19/43 (44%), Positives = 20/43 (46%), Gaps = 2/43 (4%)

Query: 21  KKAKKKEKK--KKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
           K AKK  KK  K  KKA KK  K   K  +   K  K K K K
Sbjct: 254 KTAKKALKKAAKAVKKAAKKAAKAAAKAAKGAAKATKGKAKAK 296



 Score = 36.4 bits (84), Expect = 0.029
 Identities = 15/41 (36%), Positives = 16/41 (39%)

Query: 21  KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
           KKA K    K   K    + K   KK  KKK  K     KK
Sbjct: 200 KKAAKTPAAKSGAKKASAKAKSAAKKVSKKKAAKTAVSAKK 240



 Score = 36.4 bits (84), Expect = 0.035
 Identities = 16/40 (40%), Positives = 16/40 (40%)

Query: 21  KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKK 60
             A K   KK   KAK   KK  KKK  K     KK  K 
Sbjct: 205 TPAAKSGAKKASAKAKSAAKKVSKKKAAKTAVSAKKAAKT 244



 Score = 36.4 bits (84), Expect = 0.035
 Identities = 16/41 (39%), Positives = 17/41 (41%)

Query: 21  KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
           KK  KK  KK  K  KK  KK  K   +  K   K  K K 
Sbjct: 253 KKTAKKALKKAAKAVKKAAKKAAKAAAKAAKGAAKATKGKA 293



 Score = 36.0 bits (83), Expect = 0.040
 Identities = 17/40 (42%), Positives = 18/40 (45%)

Query: 21  KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKK 60
           K AK     KK  K   K  KK KK  +K  KK  K  KK
Sbjct: 230 KAAKTAVSAKKAAKTAAKAAKKAKKTAKKALKKAAKAVKK 269



 Score = 35.6 bits (82), Expect = 0.049
 Identities = 14/47 (29%), Positives = 16/47 (34%)

Query: 15  PTPLDWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
            +       KK  K     K   K   K  KK +K  KK  KK  K 
Sbjct: 220 KSAAKKVSKKKAAKTAVSAKKAAKTAAKAAKKAKKTAKKALKKAAKA 266



 Score = 35.6 bits (82), Expect = 0.052
 Identities = 16/41 (39%), Positives = 18/41 (43%), Gaps = 2/41 (4%)

Query: 23  AKKKEKKKKEKKAKKKEKKKEKKKKE--KKKKKKKKKKKKK 61
                 KK  K A K  KK +K  K+  KK  K  KK  KK
Sbjct: 233 KTAVSAKKAAKTAAKAAKKAKKTAKKALKKAAKAVKKAAKK 273



 Score = 35.6 bits (82), Expect = 0.052
 Identities = 17/43 (39%), Positives = 19/43 (44%)

Query: 21  KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNN 63
            KAK   KK  +KKA K     +K  K   K  KK KK  K  
Sbjct: 217 AKAKSAAKKVSKKKAAKTAVSAKKAAKTAAKAAKKAKKTAKKA 259



 Score = 35.2 bits (81), Expect = 0.067
 Identities = 16/46 (34%), Positives = 17/46 (36%)

Query: 16  TPLDWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
           TP     AKK   K K    K  +KK  K     KK  K   K  K
Sbjct: 205 TPAAKSGAKKASAKAKSAAKKVSKKKAAKTAVSAKKAAKTAAKAAK 250



 Score = 35.2 bits (81), Expect = 0.075
 Identities = 18/43 (41%), Positives = 19/43 (44%), Gaps = 2/43 (4%)

Query: 21  KKAKKKEKKKKEKKAKKKEKK--KEKKKKEKKKKKKKKKKKKK 61
            K   K   K  KKAKK  KK  K+  K  KK  KK  K   K
Sbjct: 238 AKKAAKTAAKAAKKAKKTAKKALKKAAKAVKKAAKKAAKAAAK 280



 Score = 34.9 bits (80), Expect = 0.087
 Identities = 16/50 (32%), Positives = 18/50 (36%)

Query: 22  KAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNN 71
           K   K  KK  KKA K   K  K   +  K K K KKK        +   
Sbjct: 261 KKAAKAVKKAAKKAAKAAAKAAKGAAKATKGKAKAKKKAGKKAAAGSKAK 310



 Score = 34.5 bits (79), Expect = 0.12
 Identities = 15/41 (36%), Positives = 18/41 (43%)

Query: 21  KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
           KKA K   K  +  AK  + K + KKK  KK     K K  
Sbjct: 272 KKAAKAAAKAAKGAAKATKGKAKAKKKAGKKAAAGSKAKAT 312



 Score = 32.9 bits (75), Expect = 0.38
 Identities = 18/40 (45%), Positives = 21/40 (52%)

Query: 21  KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKK 60
            KAKKK  KK    +K K   K  K+  K KK KK  KK+
Sbjct: 293 AKAKKKAGKKAAAGSKAKATAKAPKRGAKGKKAKKVTKKR 332



 Score = 32.6 bits (74), Expect = 0.49
 Identities = 14/41 (34%), Positives = 14/41 (34%)

Query: 21  KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
           K    K   KK     K   KK  KKK  K     KK  K 
Sbjct: 204 KTPAAKSGAKKASAKAKSAAKKVSKKKAAKTAVSAKKAAKT 244



 Score = 32.6 bits (74), Expect = 0.55
 Identities = 18/41 (43%), Positives = 20/41 (48%)

Query: 21  KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
            KAK K+K  K+  A  K K   K  K   K KK KK  KK
Sbjct: 291 GKAKAKKKAGKKAAAGSKAKATAKAPKRGAKGKKAKKVTKK 331



 Score = 32.6 bits (74), Expect = 0.57
 Identities = 14/38 (36%), Positives = 15/38 (39%)

Query: 23  AKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKK 60
           AKK  K    K   KK   K K   +K  KKK  K   
Sbjct: 199 AKKAAKTPAAKSGAKKASAKAKSAAKKVSKKKAAKTAV 236



 Score = 30.6 bits (69), Expect = 2.0
 Identities = 14/40 (35%), Positives = 16/40 (40%)

Query: 21  KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKK 60
            KA  K  K   K  K K K K+K  K+     K K   K
Sbjct: 275 AKAAAKAAKGAAKATKGKAKAKKKAGKKAAAGSKAKATAK 314



 Score = 29.5 bits (66), Expect = 4.5
 Identities = 17/42 (40%), Positives = 17/42 (40%), Gaps = 1/42 (2%)

Query: 21  KKAKKKEKK-KKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
           K A K  K   K  K K K KKK  KK     K K   K  K
Sbjct: 276 KAAAKAAKGAAKATKGKAKAKKKAGKKAAAGSKAKATAKAPK 317



 Score = 29.1 bits (65), Expect = 5.6
 Identities = 14/41 (34%), Positives = 18/41 (43%)

Query: 21  KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
            KA K   K  + KAK K+K  +K     K K   K  K+ 
Sbjct: 279 AKAAKGAAKATKGKAKAKKKAGKKAAAGSKAKATAKAPKRG 319



 Score = 29.1 bits (65), Expect = 5.6
 Identities = 12/40 (30%), Positives = 14/40 (35%)

Query: 22  KAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
            A  K+  K        +K   K K   KK  KKK  K  
Sbjct: 196 AANAKKAAKTPAAKSGAKKASAKAKSAAKKVSKKKAAKTA 235


>gnl|CDD|224415 COG1498, SIK1, Protein implicated in ribosomal biogenesis, Nop56p
           homolog [Translation, ribosomal structure and
           biogenesis].
          Length = 395

 Score = 39.7 bits (93), Expect = 0.003
 Identities = 13/40 (32%), Positives = 22/40 (55%)

Query: 21  KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKK 60
           K  K   K K E+  K++  +  +KKKEKK K +++  + 
Sbjct: 356 KPPKPPTKAKPERDKKERPGRYRRKKKEKKAKSERRGLQN 395



 Score = 37.8 bits (88), Expect = 0.014
 Identities = 13/61 (21%), Positives = 28/61 (45%), Gaps = 10/61 (16%)

Query: 13  TEPTPLDWK----------KAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKN 62
            EP  +  +          K K  +   K K  + K+++  + +++KK+KK K +++   
Sbjct: 335 GEPDGISLREELEKRIEKLKEKPPKPPTKAKPERDKKERPGRYRRKKKEKKAKSERRGLQ 394

Query: 63  N 63
           N
Sbjct: 395 N 395


>gnl|CDD|221857 pfam12923, RRP7, Ribosomal RNA-processing protein 7 (RRP7).  RRP7
          is an essential protein in yeast that is involved in
          pre-rRNA processing and ribosome assembly. It is
          speculated to be required for correct assembly of rpS27
          into the pre-ribosomal particle.
          Length = 131

 Score = 37.6 bits (88), Expect = 0.003
 Identities = 14/39 (35%), Positives = 24/39 (61%)

Query: 20 WKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKK 58
          W    +  +K+K   ++ K  ++ +K KEKKKKKKK+ +
Sbjct: 52 WTTVTRGGRKRKAGASRNKAAEERRKLKEKKKKKKKELE 90



 Score = 37.2 bits (87), Expect = 0.004
 Identities = 21/53 (39%), Positives = 30/53 (56%), Gaps = 13/53 (24%)

Query: 22  KAKKKEKKKKEKKAKK-------KEKKKE------KKKKEKKKKKKKKKKKKK 61
           + K KEKKKK+KK  +       +EKKKE      KK +E KK+ ++ K  +K
Sbjct: 75  RRKLKEKKKKKKKELENFYRFQIREKKKEELAELRKKFEEDKKRIEQLKAARK 127



 Score = 31.0 bits (71), Expect = 0.58
 Identities = 11/32 (34%), Positives = 20/32 (62%)

Query: 30 KKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
          +  +K K    + +  ++ +K K+KKKKKKK+
Sbjct: 57 RGGRKRKAGASRNKAAEERRKLKEKKKKKKKE 88



 Score = 27.6 bits (62), Expect = 9.0
 Identities = 9/30 (30%), Positives = 19/30 (63%)

Query: 32 EKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
           +  +K++    + K  ++++K K+KKKKK
Sbjct: 56 TRGGRKRKAGASRNKAAEERRKLKEKKKKK 85


>gnl|CDD|217502 pfam03343, SART-1, SART-1 family.  SART-1 is a protein involved in
           cell cycle arrest and pre-mRNA splicing. It has been
           shown to be a component of U4/U6 x U5 tri-snRNP complex
           in human, Schizosaccharomyces pombe and Saccharomyces
           cerevisiae. SART-1 is a known tumour antigen in a range
           of cancers recognised by T cells.
          Length = 603

 Score = 39.7 bits (93), Expect = 0.004
 Identities = 21/85 (24%), Positives = 35/85 (41%), Gaps = 15/85 (17%)

Query: 21  KKAKKKEKKKKEKKAKK---------------KEKKKEKKKKEKKKKKKKKKKKKKNNNN 65
            + K++E KKK K                      +  K KK KKKKKKKKK++K  + +
Sbjct: 236 AEKKRQEVKKKLKINNVSLDDDSTETPASDYYDVSEMVKFKKPKKKKKKKKKRRKDLDED 295

Query: 66  NNNNNNNNNNNNNNNNNNNNNEEKG 90
                     ++++ +  +  EE  
Sbjct: 296 ELEPEAEGLGSSDSGSRKDVEEENA 320



 Score = 37.0 bits (86), Expect = 0.026
 Identities = 17/68 (25%), Positives = 29/68 (42%), Gaps = 17/68 (25%)

Query: 21  KKAKKK-EKKKKEKKAK----------------KKEKKKEKKKKEKKKKKKKKKKKKKNN 63
            KA++K E+  K    K                KK KK++KKK+ ++KK     +K+K  
Sbjct: 63  AKAREKRERNSKLGGIKTLGEDDDDDDDTKAWLKKSKKRQKKKEAERKKALLLDEKEKER 122

Query: 64  NNNNNNNN 71
                + +
Sbjct: 123 AAEYTSED 130



 Score = 32.0 bits (73), Expect = 0.85
 Identities = 12/51 (23%), Positives = 20/51 (39%), Gaps = 9/51 (17%)

Query: 23  AKKKEKKKKEKKAKKKEKKKEKKKKEK---------KKKKKKKKKKKKNNN 64
             +K++++ +KK K      +    E          +  K KK KKKK   
Sbjct: 235 EAEKKRQEVKKKLKINNVSLDDDSTETPASDYYDVSEMVKFKKPKKKKKKK 285



 Score = 31.7 bits (72), Expect = 1.2
 Identities = 19/107 (17%), Positives = 36/107 (33%), Gaps = 38/107 (35%)

Query: 2   DSSTKHQIGTFTEPTPLDWKK------AKKKEKKKKEKKAKKKEKKKEKKK--------- 46
            S+ + +     E     WKK      AK+K ++ +EK AK +EK++   K         
Sbjct: 28  GSTKESRDAAAYEN----WKKRQEEAEAKRKREELREKIAKAREKRERNSKLGGIKTLGE 83

Query: 47  -------------------KEKKKKKKKKKKKKKNNNNNNNNNNNNN 74
                              K+K+ ++KK     +          + +
Sbjct: 84  DDDDDDDTKAWLKKSKKRQKKKEAERKKALLLDEKEKERAAEYTSED 130


>gnl|CDD|206228 pfam14058, PcfK, PcfK-like protein.  The PcfK-like protein family
           includes the Enterococcus faecalis PcfK protein, which
           is functionally uncharacterized. This family of proteins
           is found in bacteria and viruses. Proteins in this
           family are typically between 137 and 257 amino acids in
           length. There are two completely conserved residues (D
           and L) that may be functionally important.
          Length = 136

 Score = 37.7 bits (88), Expect = 0.004
 Identities = 12/38 (31%), Positives = 24/38 (63%)

Query: 21  KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKK 58
            +  ++EK +  K+A K  +++E +K +K+ KK KK +
Sbjct: 88  VELTEEEKAEARKEALKAYQQEELRKIQKRSKKSKKAE 125



 Score = 36.2 bits (84), Expect = 0.012
 Identities = 13/43 (30%), Positives = 22/43 (51%)

Query: 24  KKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNN 66
              E  ++EK   +KE  K  +++E +K +K+ KK KK     
Sbjct: 86  HTVELTEEEKAEARKEALKAYQQEELRKIQKRSKKSKKAEPVQ 128



 Score = 30.0 bits (68), Expect = 1.3
 Identities = 8/51 (15%), Positives = 23/51 (45%)

Query: 11  TFTEPTPLDWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
                 P++ +       +  E++  +  K+  K  ++++ +K +K+ KK 
Sbjct: 71  DIKVGKPINCRVTVNHTVELTEEEKAEARKEALKAYQQEELRKIQKRSKKS 121



 Score = 28.5 bits (64), Expect = 4.5
 Identities = 11/45 (24%), Positives = 20/45 (44%)

Query: 5   TKHQIGTFTEPTPLDWKKAKKKEKKKKEKKAKKKEKKKEKKKKEK 49
           T +     TE    + +K   K  +++E +  +K  KK KK +  
Sbjct: 83  TVNHTVELTEEEKAEARKEALKAYQQEELRKIQKRSKKSKKAEPV 127


>gnl|CDD|227446 COG5116, RPN2, 26S proteasome regulatory complex component
           [Posttranslational modification, protein turnover,
           chaperones].
          Length = 926

 Score = 39.9 bits (93), Expect = 0.004
 Identities = 13/53 (24%), Positives = 24/53 (45%)

Query: 19  DWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNN 71
             K A + ++K KEK    KE K E    E + ++   K++++   +     N
Sbjct: 790 TIKAAARAKQKPKEKGPNDKEIKIESPSVETEGERCTIKQREEKGIDAPAILN 842


>gnl|CDD|115057 pfam06375, BLVR, Bovine leukaemia virus receptor (BLVR).  This
           family consists of several bovine specific leukaemia
           virus receptors which are thought to function as
           transmembrane proteins, although their exact function is
           unknown.
          Length = 561

 Score = 39.7 bits (92), Expect = 0.004
 Identities = 20/41 (48%), Positives = 28/41 (68%), Gaps = 1/41 (2%)

Query: 21  KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
           K  +K +    EKK+KK  KKKEKK+KEK++ K KKK+ + 
Sbjct: 185 KSPEKGDVPAVEKKSKKP-KKKEKKEKEKERDKDKKKEVEG 224



 Score = 39.3 bits (91), Expect = 0.006
 Identities = 19/39 (48%), Positives = 25/39 (64%), Gaps = 2/39 (5%)

Query: 14  EPTPLDWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKK 52
              P   KK+KK   KKKEKK K+KE+ K+KKK+ +  K
Sbjct: 190 GDVPAVEKKSKKP--KKKEKKEKEKERDKDKKKEVEGFK 226



 Score = 38.9 bits (90), Expect = 0.006
 Identities = 16/31 (51%), Positives = 25/31 (80%)

Query: 29  KKKEKKAKKKEKKKEKKKKEKKKKKKKKKKK 59
           +KK KK KKKEKK+++K+++K KKK+ +  K
Sbjct: 196 EKKSKKPKKKEKKEKEKERDKDKKKEVEGFK 226



 Score = 38.1 bits (88), Expect = 0.011
 Identities = 18/56 (32%), Positives = 32/56 (57%)

Query: 2   DSSTKHQIGTFTEPTPLDWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKK 57
           ++S  + +      +  D  K  + E+ KK  K KKK+++KEK++K+KKKK    +
Sbjct: 247 EASLANTVSGTAPDSEPDEPKDAEAEETKKSPKHKKKKQRKEKEEKKKKKKHHHHR 302



 Score = 38.1 bits (88), Expect = 0.012
 Identities = 16/40 (40%), Positives = 26/40 (65%)

Query: 15  PTPLDWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKK 54
           P   D    +KK KK K+K+ K+KEK+++K KK++ +  K
Sbjct: 187 PEKGDVPAVEKKSKKPKKKEKKEKEKERDKDKKKEVEGFK 226



 Score = 37.4 bits (86), Expect = 0.018
 Identities = 17/39 (43%), Positives = 25/39 (64%)

Query: 23  AKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
           +K  EK       KK +K K+K+KKEK+K++ K KKK+ 
Sbjct: 184 SKSPEKGDVPAVEKKSKKPKKKEKKEKEKERDKDKKKEV 222



 Score = 37.0 bits (85), Expect = 0.028
 Identities = 18/50 (36%), Positives = 29/50 (58%)

Query: 11  TFTEPTPLDWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKK 60
           T +   P       K  + ++ KK+ K +KKK++K+KE+KKKKKK    +
Sbjct: 253 TVSGTAPDSEPDEPKDAEAEETKKSPKHKKKKQRKEKEEKKKKKKHHHHR 302



 Score = 36.2 bits (83), Expect = 0.047
 Identities = 9/40 (22%), Positives = 28/40 (70%)

Query: 17  PLDWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKK 56
           P+  +  K +E+++  ++ +K +++K+K++KEK+ +++  
Sbjct: 76  PMSDQYVKLEEERRHRQRLEKDKREKKKREKEKRGRRRHH 115



 Score = 35.8 bits (82), Expect = 0.058
 Identities = 15/40 (37%), Positives = 26/40 (65%)

Query: 22  KAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
           +  K  +K      +KK KK +KK+K++K+K++ K KKK+
Sbjct: 182 ETSKSPEKGDVPAVEKKSKKPKKKEKKEKEKERDKDKKKE 221



 Score = 34.7 bits (79), Expect = 0.15
 Identities = 18/50 (36%), Positives = 29/50 (58%), Gaps = 1/50 (2%)

Query: 13  TEPTPLDWKKAKKKEKKKKEKKAKKKEKK-KEKKKKEKKKKKKKKKKKKK 61
           +E  P+   +  +  K  ++      EKK K+ KKKEKK+K+K++ K KK
Sbjct: 170 SEKLPVQKHRNAETSKSPEKGDVPAVEKKSKKPKKKEKKEKEKERDKDKK 219



 Score = 34.3 bits (78), Expect = 0.16
 Identities = 22/59 (37%), Positives = 31/59 (52%), Gaps = 6/59 (10%)

Query: 3   SSTKHQIGTFTEPTPLDWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
           S      GT  +  P + K A+ +E KK  K       KK+K++KEK++KKKKKK    
Sbjct: 249 SLANTVSGTAPDSEPDEPKDAEAEETKKSPKH------KKKKQRKEKEEKKKKKKHHHH 301



 Score = 33.5 bits (76), Expect = 0.33
 Identities = 10/26 (38%), Positives = 22/26 (84%)

Query: 36  KKKEKKKEKKKKEKKKKKKKKKKKKK 61
           K +E+++ +++ EK K++KKK++K+K
Sbjct: 83  KLEEERRHRQRLEKDKREKKKREKEK 108



 Score = 33.1 bits (75), Expect = 0.45
 Identities = 10/34 (29%), Positives = 27/34 (79%)

Query: 31  KEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNN 64
           K ++ ++  ++ EK K+EKKK++K+K+ ++++++
Sbjct: 83  KLEEERRHRQRLEKDKREKKKREKEKRGRRRHHS 116



 Score = 32.3 bits (73), Expect = 0.78
 Identities = 7/34 (20%), Positives = 27/34 (79%)

Query: 30  KKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNN 63
           K E++ + +++ ++ K+++KK++K+K+ +++ ++
Sbjct: 83  KLEEERRHRQRLEKDKREKKKREKEKRGRRRHHS 116



 Score = 32.0 bits (72), Expect = 0.96
 Identities = 7/33 (21%), Positives = 26/33 (78%)

Query: 28  KKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKK 60
           K ++E++ +++ +K +++KK+++K+K+ +++  
Sbjct: 83  KLEEERRHRQRLEKDKREKKKREKEKRGRRRHH 115



 Score = 31.6 bits (71), Expect = 1.3
 Identities = 14/42 (33%), Positives = 26/42 (61%)

Query: 21  KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKN 62
           +K +  E  K  +K      +K+ KK +KK+KK+K+K++ K+
Sbjct: 176 QKHRNAETSKSPEKGDVPAVEKKSKKPKKKEKKEKEKERDKD 217



 Score = 31.2 bits (70), Expect = 1.8
 Identities = 8/34 (23%), Positives = 26/34 (76%)

Query: 29  KKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKN 62
           K +E++  ++  +K+K++K+K++K+K+ +++  +
Sbjct: 83  KLEEERRHRQRLEKDKREKKKREKEKRGRRRHHS 116



 Score = 30.8 bits (69), Expect = 2.1
 Identities = 10/34 (29%), Positives = 26/34 (76%), Gaps = 1/34 (2%)

Query: 25  KKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKK 58
           K E++++ ++  +K+K++ KKK+EK+K+ +++  
Sbjct: 83  KLEEERRHRQRLEKDKRE-KKKREKEKRGRRRHH 115



 Score = 30.4 bits (68), Expect = 2.8
 Identities = 9/28 (32%), Positives = 18/28 (64%)

Query: 21  KKAKKKEKKKKEKKAKKKEKKKEKKKKE 48
           +  ++ EK K+EKK ++KEK+  ++   
Sbjct: 89  RHRQRLEKDKREKKKREKEKRGRRRHHS 116



 Score = 29.3 bits (65), Expect = 6.7
 Identities = 17/43 (39%), Positives = 26/43 (60%), Gaps = 2/43 (4%)

Query: 22  KAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKK--KKKKKKN 62
                 +  + K A+ +E KK  K K+KK++K+K  KKKKKK+
Sbjct: 256 GTAPDSEPDEPKDAEAEETKKSPKHKKKKQRKEKEEKKKKKKH 298


>gnl|CDD|129965 TIGR00887, 2A0109, phosphate:H+ symporter.  This model represents
           the phosphate uptake symporter subfamily of the major
           facilitator superfamily (pfam00083) [Transport and
           binding proteins, Anions].
          Length = 502

 Score = 39.3 bits (92), Expect = 0.004
 Identities = 41/159 (25%), Positives = 63/159 (39%), Gaps = 48/159 (30%)

Query: 286 GALLCVVLIHYTGKRPLALISTGGSAACFIVVAVYAQFHLSYGWDSPLVPTVFLVLAAFL 345
             L  V+   Y        +ST G  A +    V AQF  ++G ++    T F+V     
Sbjct: 376 TVLFFVLGFAYNH------LSTHGFLAIY----VLAQFFANFGPNA----TTFIV----- 416

Query: 346 THICIRLLPWMLIGEVFPNNIRATASGASGSSSYI------FAF-------AVNKLYYPM 392
                   P    GEVFP   R+TA G S +S         F F          K Y   
Sbjct: 417 --------P----GEVFPTRYRSTAHGISAASGKAGAIIGQFGFLYLAQHGDPTKGYPTG 464

Query: 393 LDTFHLWGTLYFYAAISVVGTLYMYFVMPETEGRTLRDI 431
           +   H+   L  +A    +G L+   ++PET+G++L ++
Sbjct: 465 IWMGHV---LEIFALFMFLGILFT-LLIPETKGKSLEEL 499


>gnl|CDD|218752 pfam05793, TFIIF_alpha, Transcription initiation factor IIF, alpha
           subunit (TFIIF-alpha).  Transcription initiation factor
           IIF, alpha subunit (TFIIF-alpha) or RNA polymerase
           II-associating protein 74 (RAP74) is the large subunit
           of transcription factor IIF (TFIIF), which is essential
           for accurate initiation and stimulates elongation by RNA
           polymerase II.
          Length = 528

 Score = 39.6 bits (92), Expect = 0.004
 Identities = 19/63 (30%), Positives = 29/63 (46%)

Query: 2   DSSTKHQIGTFTEPTPLDWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
           DSS         E        AK + ++ ++ +  ++EK +E+    KK KK KK K KK
Sbjct: 289 DSSASGNDPEEREDKLSPEIPAKPEIEQDEDSEESEEEKNEEEGGLSKKGKKLKKLKGKK 348

Query: 62  NNN 64
           N  
Sbjct: 349 NGL 351



 Score = 37.6 bits (87), Expect = 0.017
 Identities = 16/49 (32%), Positives = 24/49 (48%)

Query: 17  PLDWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNN 65
            L  +   K E ++ E   + +E+K E++    KK KK KK K K N  
Sbjct: 303 KLSPEIPAKPEIEQDEDSEESEEEKNEEEGGLSKKGKKLKKLKGKKNGL 351



 Score = 36.5 bits (84), Expect = 0.040
 Identities = 14/77 (18%), Positives = 38/77 (49%), Gaps = 1/77 (1%)

Query: 14  EPTPLDWKKAKKKEKKKKEKKAKK-KEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNN 72
            P   + ++ +  E+ ++EK  ++    KK KK K+ K KK    K   ++ +++++++ 
Sbjct: 308 IPAKPEIEQDEDSEESEEEKNEEEGGLSKKGKKLKKLKGKKNGLDKDDSDSGDDSDDSDI 367

Query: 73  NNNNNNNNNNNNNNEEK 89
           +  ++ +       +E 
Sbjct: 368 DGEDSVSLVTAKKQKEP 384



 Score = 35.3 bits (81), Expect = 0.075
 Identities = 15/75 (20%), Positives = 40/75 (53%)

Query: 21  KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNNNNNNN 80
           K + +   K + ++ +  E+ +E+K +E+    KK KK KK     N  + +++++ +++
Sbjct: 303 KLSPEIPAKPEIEQDEDSEESEEEKNEEEGGLSKKGKKLKKLKGKKNGLDKDDSDSGDDS 362

Query: 81  NNNNNNEEKGTFLYK 95
           ++++ + E    L  
Sbjct: 363 DDSDIDGEDSVSLVT 377



 Score = 32.6 bits (74), Expect = 0.58
 Identities = 14/50 (28%), Positives = 22/50 (44%)

Query: 19  DWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNN 68
           D    +   K + E+    +E ++EK ++E    KK KK KK     N  
Sbjct: 302 DKLSPEIPAKPEIEQDEDSEESEEEKNEEEGGLSKKGKKLKKLKGKKNGL 351



 Score = 31.5 bits (71), Expect = 1.4
 Identities = 13/30 (43%), Positives = 17/30 (56%)

Query: 19  DWKKAKKKEKKKKEKKAKKKEKKKEKKKKE 48
           D  + K K+KKKK  K KKK    +K K+ 
Sbjct: 235 DGDEEKSKKKKKKLAKNKKKLDDDKKGKRG 264



 Score = 31.1 bits (70), Expect = 1.6
 Identities = 17/77 (22%), Positives = 29/77 (37%), Gaps = 24/77 (31%)

Query: 21  KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKK------------------------KKKK 56
            + +K E++    K  KK KK + KK    K                         KK+K
Sbjct: 323 SEEEKNEEEGGLSKKGKKLKKLKGKKNGLDKDDSDSGDDSDDSDIDGEDSVSLVTAKKQK 382

Query: 57  KKKKKNNNNNNNNNNNN 73
           + KK+   ++N ++  N
Sbjct: 383 EPKKEEPVDSNPSSPGN 399



 Score = 30.3 bits (68), Expect = 3.1
 Identities = 14/42 (33%), Positives = 22/42 (52%)

Query: 19  DWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKK 60
           D     + +K  ++   +K +KKK+K  K KKK    KK K+
Sbjct: 222 DEDDGDESDKGGEDGDEEKSKKKKKKLAKNKKKLDDDKKGKR 263



 Score = 28.8 bits (64), Expect = 9.1
 Identities = 20/75 (26%), Positives = 27/75 (36%), Gaps = 12/75 (16%)

Query: 1   MDSSTKHQIGTFTEPTPLDWKKAKKKEKKKKEKKAKKKE-----------KKKEKKKKEK 49
           M  + K+    F +      +  K    + K+ K K  E           K  E   +EK
Sbjct: 182 MMKAAKNGPAAFGDEDEET-EGEKGGGGRGKDLKIKDLEGDDEDDGDESDKGGEDGDEEK 240

Query: 50  KKKKKKKKKKKKNNN 64
            KKKKKK  K K   
Sbjct: 241 SKKKKKKLAKNKKKL 255



 Score = 28.8 bits (64), Expect = 9.8
 Identities = 13/25 (52%), Positives = 15/25 (60%)

Query: 37  KKEKKKEKKKKEKKKKKKKKKKKKK 61
            +EK K+KKKK  K KKK    KK 
Sbjct: 237 DEEKSKKKKKKLAKNKKKLDDDKKG 261


>gnl|CDD|220431 pfam09831, DUF2058, Uncharacterized protein conserved in bacteria
          (DUF2058).  This domain, found in various prokaryotic
          proteins, has no known function.
          Length = 177

 Score = 38.0 bits (89), Expect = 0.004
 Identities = 18/59 (30%), Positives = 35/59 (59%), Gaps = 17/59 (28%)

Query: 19 DWKKAKK--KEKKKKEKKAKK-------------KEKKKEKKKKEKK--KKKKKKKKKK 60
          D KKAKK  KEK+K+ K+A+K             +E K EK +++++  ++++ + ++K
Sbjct: 13 DKKKAKKAKKEKRKQRKQARKGADDGDDELKQAAEEAKAEKAERDRELNRQRQAEAEQK 71



 Score = 33.7 bits (78), Expect = 0.11
 Identities = 11/53 (20%), Positives = 31/53 (58%), Gaps = 12/53 (22%)

Query: 21 KKAKKKEKKKKEKKAKK------------KEKKKEKKKKEKKKKKKKKKKKKK 61
          KKAKK+++K++++  K             +E K EK +++++  ++++ + ++
Sbjct: 18 KKAKKEKRKQRKQARKGADDGDDELKQAAEEAKAEKAERDRELNRQRQAEAEQ 70



 Score = 33.7 bits (78), Expect = 0.12
 Identities = 14/48 (29%), Positives = 26/48 (54%), Gaps = 14/48 (29%)

Query: 29 KKKEKKAKKKEKKKEKKKK--------------EKKKKKKKKKKKKKN 62
          KKK KKAKK+++K+ K+ +              E+ K +K ++ ++ N
Sbjct: 14 KKKAKKAKKEKRKQRKQARKGADDGDDELKQAAEEAKAEKAERDRELN 61



 Score = 29.5 bits (67), Expect = 3.3
 Identities = 10/24 (41%), Positives = 16/24 (66%)

Query: 35 AKKKEKKKEKKKKEKKKKKKKKKK 58
            KK+ KK KK+K K++K+ +K  
Sbjct: 12 VDKKKAKKAKKEKRKQRKQARKGA 35



 Score = 28.7 bits (65), Expect = 6.2
 Identities = 8/21 (38%), Positives = 16/21 (76%)

Query: 43 EKKKKEKKKKKKKKKKKKKNN 63
          +KK K+ KK+K+K++K+ +  
Sbjct: 14 KKKAKKAKKEKRKQRKQARKG 34


>gnl|CDD|218858 pfam06024, DUF912, Nucleopolyhedrovirus protein of unknown function
           (DUF912).  This family consists of several
           Nucleopolyhedrovirus proteins of unknown function.
          Length = 101

 Score = 36.5 bits (85), Expect = 0.004
 Identities = 11/50 (22%), Positives = 24/50 (48%)

Query: 62  NNNNNNNNNNNNNNNNNNNNNNNNNEEKGTFLYKMAAPLVLVLTYVAEIT 111
           N++N++NN   N NN  +    N+   + +   K  A  ++++  +A   
Sbjct: 24  NDDNDDNNAPINANNTQSVYPKNDITNEDSNASKANAGNIILIGLLAFFC 73


>gnl|CDD|237744 PRK14521, rpsP, 30S ribosomal protein S16; Provisional.
          Length = 186

 Score = 38.2 bits (89), Expect = 0.004
 Identities = 17/43 (39%), Positives = 21/43 (48%), Gaps = 4/43 (9%)

Query: 21  KKAKKKEKKKKEKKAKKKEKKK----EKKKKEKKKKKKKKKKK 59
           K+ K   KK K  KAKK  KK     EKK  E + +   +KK 
Sbjct: 110 KEGKVNAKKDKLSKAKKAAKKAALEAEKKVNEARAEAVAEKKA 152



 Score = 34.4 bits (79), Expect = 0.075
 Identities = 12/39 (30%), Positives = 21/39 (53%)

Query: 18  LDWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKK 56
           ++ KK K  + KK  KKA  + +KK  + + +   +KK 
Sbjct: 114 VNAKKDKLSKAKKAAKKAALEAEKKVNEARAEAVAEKKA 152



 Score = 30.1 bits (68), Expect = 1.9
 Identities = 20/55 (36%), Positives = 27/55 (49%), Gaps = 6/55 (10%)

Query: 10  GTFTEPTPLDWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNN 64
           G FTE       +A+ K +  KE+K  K   KK+K  K KK  KK   + +K  N
Sbjct: 92  GAFTE------AQAEAKFEAWKEEKEGKVNAKKDKLSKAKKAAKKAALEAEKKVN 140


>gnl|CDD|216985 pfam02349, MSG, Major surface glycoprotein.  This is a novel
          repeat in Pneumocystis carinii Major surface
          glycoprotein (MSG) some members of the alignment have
          up to nine repeats of this family, the repeats
          containing several conserved cysteines. The MSG of P.
          carinii is an important protein in host-pathogen
          interactions. Surface glycoprotein A from Pneumocystis
          carinii is a main target for the host immune system,
          this protein is implicated in the attachment of
          Pneumocystis carinii to the host alveolar epithelial
          cells, alveolar macrophages, host surfactant and
          possibly accounts in part for the hypoxia seen in
          Pneumocystis carinii pneumonia (PCP).
          Length = 81

 Score = 35.9 bits (83), Expect = 0.004
 Identities = 18/65 (27%), Positives = 27/65 (41%)

Query: 7  HQIGTFTEPTPLDWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNN 66
            +           +K K+K  K K+    ++   KE K + KKKKK KK  K+K     
Sbjct: 17 EFLEEACPLLKEKCEKLKEKCYKLKKDLKLEELLLKELKGELKKKKKCKKALKEKCTKLK 76

Query: 67 NNNNN 71
            +N 
Sbjct: 77 KESNE 81



 Score = 26.6 bits (59), Expect = 8.5
 Identities = 15/61 (24%), Positives = 24/61 (39%), Gaps = 9/61 (14%)

Query: 16 TPLDWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKK---------KKKNNNNN 66
            L+ K  + +E  +  ++A    K+K +K KEK  K KK  K         K +     
Sbjct: 3  FKLEEKCKELEEYCEFLEEACPLLKEKCEKLKEKCYKLKKDLKLEELLLKELKGELKKKK 62

Query: 67 N 67
           
Sbjct: 63 K 63


>gnl|CDD|236944 PRK11642, PRK11642, exoribonuclease R; Provisional.
          Length = 813

 Score = 39.7 bits (93), Expect = 0.005
 Identities = 14/41 (34%), Positives = 21/41 (51%)

Query: 21  KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
           +  KK + K  +K A+K +K   K +K     K K+  KKK
Sbjct: 770 RGEKKAKPKAAKKDARKAKKPSAKTQKIAAATKAKRAAKKK 810



 Score = 39.3 bits (92), Expect = 0.006
 Identities = 13/42 (30%), Positives = 19/42 (45%)

Query: 21  KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKN 62
           +K  K +  KK+ +  KK   K +K     K K+  KKK   
Sbjct: 772 EKKAKPKAAKKDARKAKKPSAKTQKIAAATKAKRAAKKKVAE 813



 Score = 36.3 bits (84), Expect = 0.049
 Identities = 14/50 (28%), Positives = 20/50 (40%)

Query: 14  EPTPLDWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNN 63
           EP      + K K K  K+   K K+   + +K     K K+  KKK   
Sbjct: 764 EPDSAFRGEKKAKPKAAKKDARKAKKPSAKTQKIAAATKAKRAAKKKVAE 813



 Score = 35.5 bits (82), Expect = 0.089
 Identities = 16/47 (34%), Positives = 21/47 (44%), Gaps = 7/47 (14%)

Query: 21  KKAKKKEKKKKEKKAKKKEK-------KKEKKKKEKKKKKKKKKKKK 60
           KK    +K  K ++  KK         + EKK K K  KK  +K KK
Sbjct: 743 KKGDAGKKGGKRRQVGKKVNFEPDSAFRGEKKAKPKAAKKDARKAKK 789



 Score = 34.7 bits (80), Expect = 0.16
 Identities = 17/50 (34%), Positives = 24/50 (48%), Gaps = 9/50 (18%)

Query: 22  KAKKKEKKKKEKKAKKKEKK---------KEKKKKEKKKKKKKKKKKKKN 62
           KAKK +  KK  K ++  KK         + +KK + K  KK  +K KK 
Sbjct: 741 KAKKGDAGKKGGKRRQVGKKVNFEPDSAFRGEKKAKPKAAKKDARKAKKP 790



 Score = 33.2 bits (76), Expect = 0.37
 Identities = 15/48 (31%), Positives = 18/48 (37%), Gaps = 2/48 (4%)

Query: 10  GTFTEPTPLDWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKK 57
             F        K AKK  +K   KK   K +K     K K+  KKK  
Sbjct: 767 SAFRGEKKAKPKAAKKDARKA--KKPSAKTQKIAAATKAKRAAKKKVA 812



 Score = 30.9 bits (70), Expect = 2.0
 Identities = 16/55 (29%), Positives = 22/55 (40%), Gaps = 9/55 (16%)

Query: 21  KKAKKKEKKKKEKKAKKKEKKKEKK---------KKEKKKKKKKKKKKKKNNNNN 66
           K A++K KK    K   K ++  KK         + EKK K K  KK  +     
Sbjct: 736 KTAREKAKKGDAGKKGGKRRQVGKKVNFEPDSAFRGEKKAKPKAAKKDARKAKKP 790


>gnl|CDD|218435 pfam05104, Rib_recp_KP_reg, Ribosome receptor lysine/proline rich
          region.  This highly conserved region is found towards
          the C-terminus of the transmembrane domain. The
          function is unclear.
          Length = 151

 Score = 37.7 bits (87), Expect = 0.005
 Identities = 16/39 (41%), Positives = 25/39 (64%)

Query: 29 KKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNN 67
          K++++  K + +K +KKKKEK  +KK K KKK+   N  
Sbjct: 12 KQRKESGKTQSQKSDKKKKEKVSEKKGKSKKKEEKPNGK 50



 Score = 37.3 bits (86), Expect = 0.005
 Identities = 17/45 (37%), Positives = 29/45 (64%)

Query: 20 WKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNN 64
          +++A  K++K+  K   +K  KK+K+K  +KK K KKK++K N  
Sbjct: 6  YEEALAKQRKESGKTQSQKSDKKKKEKVSEKKGKSKKKEEKPNGK 50



 Score = 29.2 bits (65), Expect = 3.3
 Identities = 15/39 (38%), Positives = 21/39 (53%), Gaps = 3/39 (7%)

Query: 33 KKAKKKEKKKEKKKKEKKKKK---KKKKKKKKNNNNNNN 68
          K+ K+  K + +K  +KKK+K   KK K KKK    N  
Sbjct: 12 KQRKESGKTQSQKSDKKKKEKVSEKKGKSKKKEEKPNGK 50


>gnl|CDD|227238 COG4901, COG4901, Ribosomal protein S25 [Translation, ribosomal
          structure and biogenesis].
          Length = 107

 Score = 36.4 bits (84), Expect = 0.005
 Identities = 16/36 (44%), Positives = 20/36 (55%)

Query: 26 KEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
             K +  K KK EK K    K+KKK  KKKKK++ 
Sbjct: 1  MAPKSQLSKEKKAEKAKAGTAKDKKKWSKKKKKEEA 36



 Score = 34.1 bits (78), Expect = 0.041
 Identities = 14/37 (37%), Positives = 21/37 (56%)

Query: 22 KAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKK 58
           A K +  K++K  K K    + KKK  KKKKK++ +
Sbjct: 1  MAPKSQLSKEKKAEKAKAGTAKDKKKWSKKKKKEEAR 37



 Score = 33.0 bits (75), Expect = 0.081
 Identities = 13/37 (35%), Positives = 21/37 (56%)

Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKK 57
             K +  K+K+ +  K    K+KKK  KKKKK++ +
Sbjct: 1  MAPKSQLSKEKKAEKAKAGTAKDKKKWSKKKKKEEAR 37



 Score = 29.9 bits (67), Expect = 1.0
 Identities = 15/36 (41%), Positives = 22/36 (61%)

Query: 24 KKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKK 59
            K +  KEKKA+K +    K KK+  KKKKK++ +
Sbjct: 2  APKSQLSKEKKAEKAKAGTAKDKKKWSKKKKKEEAR 37



 Score = 29.9 bits (67), Expect = 1.2
 Identities = 13/41 (31%), Positives = 21/41 (51%)

Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
          K  K K    K+KK   K+KKKE+ ++     ++   K +K
Sbjct: 12 KAEKAKAGTAKDKKKWSKKKKKEEARRAVTVDEELLDKIRK 52



 Score = 28.7 bits (64), Expect = 2.7
 Identities = 14/41 (34%), Positives = 20/41 (48%), Gaps = 2/41 (4%)

Query: 22 KAKKKEKKK--KEKKAKKKEKKKEKKKKEKKKKKKKKKKKK 60
          K KK EK K    K  KK  KKK+K++  +     ++   K
Sbjct: 9  KEKKAEKAKAGTAKDKKKWSKKKKKEEARRAVTVDEELLDK 49


>gnl|CDD|240578 cd12951, RRP7_Rrp7A, RRP7 domain ribosomal RNA-processing protein
          7 homolog A (Rrp7A) and similar proteins.  The family
          corresponds to the RRP7 domain of Rrp7A, also termed
          gastric cancer antigen Zg14, and similar proteins which
          are yeast ribosomal RNA-processing protein 7 (Rrp7p)
          homologs mainly found in Metazoans. The cellular
          function of Rrp7A remains unclear currently. Rrp7A
          harbors an N-terminal RNA recognition motif (RRM), also
          termed RBD (RNA binding domain) or RNP
          (ribonucleoprotein domain), and a C-terminal RRP7
          domain.
          Length = 129

 Score = 36.9 bits (86), Expect = 0.005
 Identities = 19/59 (32%), Positives = 31/59 (52%), Gaps = 13/59 (22%)

Query: 19 DWKKAKKKEKKKKE-------------KKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNN 64
          D ++ ++KE+K+KE             K  + K  +KE    +  +K+KKKKKKK+  N
Sbjct: 31 DKEEEEEKEEKEKEAEPDEDGWVTVTKKGRRPKTARKESVAAKAAEKEKKKKKKKELKN 89


>gnl|CDD|219913 pfam08576, DUF1764, Eukaryotic protein of unknown function
          (DUF1764).  This is a family of eukaryotic proteins of
          unknown function. This family contains many
          hypothetical proteins.
          Length = 98

 Score = 36.3 bits (84), Expect = 0.006
 Identities = 11/43 (25%), Positives = 25/43 (58%)

Query: 19 DWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
            K+ KKK++  K  + K  +K ++K KK+ +  +  ++ K++
Sbjct: 21 IKKRKKKKKRTAKTARPKATKKGQKKDKKKDEFPEFPEESKRR 63



 Score = 35.1 bits (81), Expect = 0.012
 Identities = 16/46 (34%), Positives = 24/46 (52%)

Query: 19 DWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNN 64
          D     KK KKKK++ AK    K  KK ++K KKK +  +  + + 
Sbjct: 16 DIFSNIKKRKKKKKRTAKTARPKATKKGQKKDKKKDEFPEFPEESK 61



 Score = 34.8 bits (80), Expect = 0.018
 Identities = 17/51 (33%), Positives = 25/51 (49%), Gaps = 5/51 (9%)

Query: 21 KKAKKKEKKK-----KEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNN 66
          KK +K +K+         K +KK+KK+  K    K  KK +KK KK +   
Sbjct: 4  KKNEKTDKRDIDDIFSNIKKRKKKKKRTAKTARPKATKKGQKKDKKKDEFP 54



 Score = 31.7 bits (72), Expect = 0.22
 Identities = 17/48 (35%), Positives = 23/48 (47%), Gaps = 7/48 (14%)

Query: 26 KEKKKKEKKAKKK-------EKKKEKKKKEKKKKKKKKKKKKKNNNNN 66
          KE+KK EK  K+         KK++KKKK   K  + K  KK    + 
Sbjct: 1  KEEKKNEKTDKRDIDDIFSNIKKRKKKKKRTAKTARPKATKKGQKKDK 48



 Score = 30.9 bits (70), Expect = 0.40
 Identities = 15/45 (33%), Positives = 22/45 (48%), Gaps = 2/45 (4%)

Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNN 65
             KK++KKKK     K  + K  KK +KK KKK +  +    + 
Sbjct: 19 SNIKKRKKKKKRTA--KTARPKATKKGQKKDKKKDEFPEFPEESK 61



 Score = 30.1 bits (68), Expect = 0.80
 Identities = 13/43 (30%), Positives = 22/43 (51%)

Query: 24 KKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNN 66
             +K+KK+KK   K  + +  KK +KK KKK +  +    + 
Sbjct: 19 SNIKKRKKKKKRTAKTARPKATKKGQKKDKKKDEFPEFPEESK 61


>gnl|CDD|219563 pfam07767, Nop53, Nop53 (60S ribosomal biogenesis).  This nucleolar
           family of proteins are involved in 60S ribosomal
           biogenesis. They are specifically involved in the
           processing beyond the 27S stage of 25S rRNA maturation.
           This family contains sequences that bear similarity to
           the glioma tumour suppressor candidate region gene 2
           protein (p60). This protein has been found to interact
           with herpes simplex type 1 regulatory proteins.
          Length = 387

 Score = 38.9 bits (91), Expect = 0.006
 Identities = 12/30 (40%), Positives = 22/30 (73%)

Query: 22  KAKKKEKKKKEKKAKKKEKKKEKKKKEKKK 51
           K K K ++ KEK+ K+ E++ +++K+ KKK
Sbjct: 276 KRKTKAQRNKEKRRKELEREAKEEKQLKKK 305



 Score = 38.5 bits (90), Expect = 0.007
 Identities = 13/35 (37%), Positives = 24/35 (68%), Gaps = 1/35 (2%)

Query: 22  KAKKKEKKKKEKKAK-KKEKKKEKKKKEKKKKKKK 55
           K  + ++K K ++ K K+ K+ E++ KE+K+ KKK
Sbjct: 271 KPVRPKRKTKAQRNKEKRRKELEREAKEEKQLKKK 305



 Score = 38.1 bits (89), Expect = 0.009
 Identities = 14/38 (36%), Positives = 23/38 (60%), Gaps = 1/38 (2%)

Query: 21  KKAKKKEKKKKEKKAKKKEKK-KEKKKKEKKKKKKKKK 57
              K    K+K K  + KEK+ KE +++ K++K+ KKK
Sbjct: 268 PINKPVRPKRKTKAQRNKEKRRKELEREAKEEKQLKKK 305



 Score = 37.8 bits (88), Expect = 0.011
 Identities = 12/43 (27%), Positives = 25/43 (58%)

Query: 23  AKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNN 65
            +   K  + K+  K ++ KEK++KE +++ K++K+ KK    
Sbjct: 266 YEPINKPVRPKRKTKAQRNKEKRRKELEREAKEEKQLKKKLAQ 308



 Score = 37.0 bits (86), Expect = 0.021
 Identities = 17/56 (30%), Positives = 30/56 (53%), Gaps = 9/56 (16%)

Query: 21  KKAKKKEKKKKEKKAKK---------KEKKKEKKKKEKKKKKKKKKKKKKNNNNNN 67
           K+ K+ E++ KE+K  K         KE  KE  +KEK + +KK+++K++      
Sbjct: 287 KRRKELEREAKEEKQLKKKLAQLARLKEIAKEVAQKEKARARKKEQRKERGEKKKL 342



 Score = 33.5 bits (77), Expect = 0.28
 Identities = 15/49 (30%), Positives = 27/49 (55%), Gaps = 10/49 (20%)

Query: 20  WKKAKKKE----------KKKKEKKAKKKEKKKEKKKKEKKKKKKKKKK 58
            +K  KK+           K+  +K K + +KKE++K+  +KKK K++K
Sbjct: 298 EEKQLKKKLAQLARLKEIAKEVAQKEKARARKKEQRKERGEKKKLKRRK 346



 Score = 33.1 bits (76), Expect = 0.37
 Identities = 14/44 (31%), Positives = 23/44 (52%), Gaps = 6/44 (13%)

Query: 27  EKKKKEKKAKKKEKKKEKKKKEKKKK------KKKKKKKKKNNN 64
           E +   K  + K K K ++ KEK++K      K++K+ KKK   
Sbjct: 265 EYEPINKPVRPKRKTKAQRNKEKRRKELEREAKEEKQLKKKLAQ 308



 Score = 33.1 bits (76), Expect = 0.37
 Identities = 16/58 (27%), Positives = 32/58 (55%), Gaps = 16/58 (27%)

Query: 21  KKAKKKEKKKKEKKAKKKEKK----------------KEKKKKEKKKKKKKKKKKKKN 62
           K+ ++KE +++ K+ K+ +KK                KEK +  KK+++K++ +KKK 
Sbjct: 285 KEKRRKELEREAKEEKQLKKKLAQLARLKEIAKEVAQKEKARARKKEQRKERGEKKKL 342



 Score = 32.8 bits (75), Expect = 0.44
 Identities = 8/43 (18%), Positives = 22/43 (51%)

Query: 19  DWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
           +       E  + E +   K  + ++K K ++ K+K++K+ ++
Sbjct: 252 ESDDESAWEGFESEYEPINKPVRPKRKTKAQRNKEKRRKELER 294



 Score = 29.7 bits (67), Expect = 4.7
 Identities = 17/51 (33%), Positives = 22/51 (43%)

Query: 12 FTEPTPLDWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKN 62
          F   T  D    KKK +KK +K  K  E  + + K      KK+  KKK  
Sbjct: 43 FVVDTTGDEATIKKKRRKKLKKPLKSDEILENRSKVPAPSVKKRHVKKKSK 93



 Score = 28.9 bits (65), Expect = 8.5
 Identities = 18/50 (36%), Positives = 22/50 (44%), Gaps = 6/50 (12%)

Query: 18 LDWKKAKKKEKKKKEKKAKKKEKKKE------KKKKEKKKKKKKKKKKKK 61
          +D    +   KKK+ KK KK  K  E      K      KK+  KKK KK
Sbjct: 45 VDTTGDEATIKKKRRKKLKKPLKSDEILENRSKVPAPSVKKRHVKKKSKK 94


>gnl|CDD|215521 PLN02967, PLN02967, kinase.
          Length = 581

 Score = 39.3 bits (91), Expect = 0.006
 Identities = 14/70 (20%), Positives = 36/70 (51%)

Query: 21  KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNNNNNNN 80
           K++KK  ++ + K A      +E+K ++K +K++K KK  ++  +  + +  ++   +  
Sbjct: 108 KESKKTPRRTRRKAAAASSDVEEEKTEKKVRKRRKVKKMDEDVEDQGSESEVSDVEESEF 167

Query: 81  NNNNNNEEKG 90
             +  NE + 
Sbjct: 168 VTSLENESEE 177



 Score = 32.7 bits (74), Expect = 0.61
 Identities = 12/46 (26%), Positives = 25/46 (54%)

Query: 14  EPTPLDWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKK 59
           E   LD +  K   + +++  A   + ++EK +K+ +K++K KK  
Sbjct: 102 EDAALDKESKKTPRRTRRKAAAASSDVEEEKTEKKVRKRRKVKKMD 147



 Score = 29.2 bits (65), Expect = 7.2
 Identities = 12/43 (27%), Positives = 22/43 (51%), Gaps = 11/43 (25%)

Query: 32  EKKAKKKEKKKEKK-----------KKEKKKKKKKKKKKKKNN 63
           +K++KK  ++  +K           K EKK +K++K KK   +
Sbjct: 107 DKESKKTPRRTRRKAAAASSDVEEEKTEKKVRKRRKVKKMDED 149



 Score = 28.9 bits (64), Expect = 9.5
 Identities = 12/38 (31%), Positives = 22/38 (57%)

Query: 23  AKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKK 60
           A  KE KK  ++ ++K        +E+K +KK +K++K
Sbjct: 105 ALDKESKKTPRRTRRKAAAASSDVEEEKTEKKVRKRRK 142


>gnl|CDD|215214 PLN02381, PLN02381, valyl-tRNA synthetase.
          Length = 1066

 Score = 39.1 bits (91), Expect = 0.006
 Identities = 22/76 (28%), Positives = 38/76 (50%), Gaps = 4/76 (5%)

Query: 10 GTFTEPTPLDWKKAKKKE---KKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNN 66
          G+ TE    + K   ++E   KKKKE+KAK+KE KK K  +++ K K + ++     N  
Sbjct: 1  GSRTESEA-EKKILTEEELERKKKKEEKAKEKELKKLKAAQKEAKAKLQAQQASDGTNVP 59

Query: 67 NNNNNNNNNNNNNNNN 82
            +   +   +  + N
Sbjct: 60 KKSEKKSRKRDVEDEN 75



 Score = 36.8 bits (85), Expect = 0.038
 Identities = 18/69 (26%), Positives = 37/69 (53%)

Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNNNNNNN 80
           +A+KK   ++E + KKK+++K K+K+ KK K  +K+ K K      ++  N    +   
Sbjct: 6  SEAEKKILTEEELERKKKKEEKAKEKELKKLKAAQKEAKAKLQAQQASDGTNVPKKSEKK 65

Query: 81 NNNNNNEEK 89
          +   + E++
Sbjct: 66 SRKRDVEDE 74


>gnl|CDD|215641 PLN03237, PLN03237, DNA topoisomerase 2; Provisional.
          Length = 1465

 Score = 39.5 bits (92), Expect = 0.006
 Identities = 21/84 (25%), Positives = 36/84 (42%), Gaps = 2/84 (2%)

Query: 18   LDWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKK--KKNNNNNNNNNNNNNN 75
            LD + AK +E ++K ++A  + +    KK  ++  KK   KK  KK + +        ++
Sbjct: 1168 LDKEDAKAEEAREKLQRAAARGESGAAKKVSRQAPKKPAPKKTTKKASESETTEETYGSS 1227

Query: 76   NNNNNNNNNNNEEKGTFLYKMAAP 99
                 N     + KG    K  AP
Sbjct: 1228 AMETENVAEVVKPKGRAGAKKKAP 1251


>gnl|CDD|237258 PRK12903, secA, preprotein translocase subunit SecA; Reviewed.
          Length = 925

 Score = 39.3 bits (92), Expect = 0.006
 Identities = 15/93 (16%), Positives = 36/93 (38%), Gaps = 2/93 (2%)

Query: 12  FTEPTPLDWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNN 71
             +      ++  K  + ++E+   + E+   +   E++K  K++ K + +         
Sbjct: 807 LIDQIIESEEERLKALRIQREEMLMRPEE--LELINEEQKNLKQEIKLELSEIQEAEEEI 864

Query: 72  NNNNNNNNNNNNNNNEEKGTFLYKMAAPLVLVL 104
            N N N N      N+ K      +A  +++ L
Sbjct: 865 QNINENKNEFVEFKNDPKKLNKLIIAKDVLIKL 897



 Score = 37.0 bits (86), Expect = 0.027
 Identities = 9/85 (10%), Positives = 30/85 (35%)

Query: 5   TKHQIGTFTEPTPLDWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNN 64
           +     +     P +      + ++++ K  + + ++   + +E +   +++K  K+   
Sbjct: 792 SDGINNSDINDRPQELIDQIIESEEERLKALRIQREEMLMRPEELELINEEQKNLKQEIK 851

Query: 65  NNNNNNNNNNNNNNNNNNNNNNEEK 89
              +          N N N N   +
Sbjct: 852 LELSEIQEAEEEIQNINENKNEFVE 876



 Score = 35.4 bits (82), Expect = 0.088
 Identities = 12/92 (13%), Positives = 35/92 (38%)

Query: 21  KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNNNNNNN 80
           +  +++ K  + ++ +   + +E +   +++K  K++ K + +          N N N N
Sbjct: 813 ESEEERLKALRIQREEMLMRPEELELINEEQKNLKQEIKLELSEIQEAEEEIQNINENKN 872

Query: 81  NNNNNNEEKGTFLYKMAAPLVLVLTYVAEITQ 112
                  +       + A  VL+   ++    
Sbjct: 873 EFVEFKNDPKKLNKLIIAKDVLIKLVISSDEI 904


>gnl|CDD|205206 pfam13025, DUF3886, Protein of unknown function (DUF3886).  This
          family of proteins is functionally uncharacterized.
          This family of proteins is found in bacteria. Proteins
          in this family are approximately 90 amino acids in
          length. There are two completely conserved L residues
          that may be functionally important.
          Length = 70

 Score = 35.4 bits (82), Expect = 0.006
 Identities = 15/35 (42%), Positives = 28/35 (80%), Gaps = 2/35 (5%)

Query: 22 KAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKK 56
          KAKKKE K +E   K++E+++ +K++E+K+++K K
Sbjct: 23 KAKKKELKAEE--EKREEEEEARKREERKEREKNK 55



 Score = 32.7 bits (75), Expect = 0.059
 Identities = 13/37 (35%), Positives = 27/37 (72%)

Query: 22 KAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKK 58
           AK K KKK+ K  ++K +++E+ +K +++K+++K K
Sbjct: 19 LAKLKAKKKELKAEEEKREEEEEARKREERKEREKNK 55



 Score = 32.3 bits (74), Expect = 0.066
 Identities = 13/33 (39%), Positives = 27/33 (81%), Gaps = 1/33 (3%)

Query: 34 KAKKKE-KKKEKKKKEKKKKKKKKKKKKKNNNN 65
          KAKKKE K +E+K++E+++ +K++++K++  N 
Sbjct: 23 KAKKKELKAEEEKREEEEEARKREERKEREKNK 55



 Score = 30.4 bits (69), Expect = 0.33
 Identities = 12/41 (29%), Positives = 26/41 (63%)

Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
          +    K+    +  AK K KKKE K +E+K++++++ +K++
Sbjct: 6  EPVTLKDMLNADLLAKLKAKKKELKAEEEKREEEEEARKRE 46



 Score = 27.3 bits (61), Expect = 4.5
 Identities = 7/24 (29%), Positives = 19/24 (79%)

Query: 21 KKAKKKEKKKKEKKAKKKEKKKEK 44
          ++ K++E+++  K+ ++KE++K K
Sbjct: 32 EEEKREEEEEARKREERKEREKNK 55


>gnl|CDD|218188 pfam04641, Rtf2, Replication termination factor 2.  It is vital for
           effective cell-replication that replication is not
           stalled at any point by, for instance, damaged bases.
           Rtf2 stabilizes the replication fork stalled at the
           site-specific replication barrier RTS1 by preventing
           replication restart until completion of DNA synthesis by
           a converging replication fork initiated at a flanking
           origin. The RTS1 element terminates replication forks
           that are moving in the cen2-distal direction while
           allowing forks moving in the cen2-proximal direction to
           pass through the region. Rtf2 contains a C2HC2 motif
           related to the C3HC4 RING-finger motif, and would appear
           to fold up, creating a RING finger-like structure but
           forming only one functional Zn2+ ion-binding site.
          Length = 254

 Score = 38.1 bits (89), Expect = 0.006
 Identities = 13/62 (20%), Positives = 24/62 (38%)

Query: 15  PTPLDWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNN 74
           PT  + +  K + ++++ KK KKK+KKK KK        +           ++       
Sbjct: 164 PTEEEVELLKARLEEERAKKKKKKKKKKTKKNNATGSSAEATVSSAVPTELSSGAGQVGE 223

Query: 75  NN 76
             
Sbjct: 224 AK 225



 Score = 37.3 bits (87), Expect = 0.011
 Identities = 11/64 (17%), Positives = 22/64 (34%)

Query: 24  KKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNNNNNNNNNN 83
           +++ KKKK+KK KK +K        +         +  +               +   +N
Sbjct: 178 EERAKKKKKKKKKKTKKNNATGSSAEATVSSAVPTELSSGAGQVGEAKKLKKKRSIAPDN 237

Query: 84  NNNE 87
             +E
Sbjct: 238 EKSE 241



 Score = 33.5 bits (77), Expect = 0.25
 Identities = 11/54 (20%), Positives = 21/54 (38%), Gaps = 2/54 (3%)

Query: 26  KEKKKKEKKAKKKEKKKEKKKK--EKKKKKKKKKKKKKNNNNNNNNNNNNNNNN 77
            + + +E++AKKK+KKK+KK K         +           ++         
Sbjct: 172 LKARLEEERAKKKKKKKKKKTKKNNATGSSAEATVSSAVPTELSSGAGQVGEAK 225



 Score = 33.1 bits (76), Expect = 0.28
 Identities = 12/55 (21%), Positives = 18/55 (32%), Gaps = 3/55 (5%)

Query: 21  KKAKKKEKK---KKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNN 72
           KK KKK KK                  +      +  + KK KK  +   +N  +
Sbjct: 186 KKKKKKTKKNNATGSSAEATVSSAVPTELSSGAGQVGEAKKLKKKRSIAPDNEKS 240



 Score = 32.7 bits (75), Expect = 0.37
 Identities = 11/57 (19%), Positives = 20/57 (35%)

Query: 22  KAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNNNNN 78
               K + ++E+  KKK+KKK+K KK        +           ++         
Sbjct: 169 VELLKARLEEERAKKKKKKKKKKTKKNNATGSSAEATVSSAVPTELSSGAGQVGEAK 225


>gnl|CDD|240317 PTZ00217, PTZ00217, flap endonuclease-1; Provisional.
          Length = 393

 Score = 38.8 bits (91), Expect = 0.007
 Identities = 18/48 (37%), Positives = 22/48 (45%)

Query: 12  FTEPTPLDWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKK 59
           FT       K   K + KKK+KKA      K +  +E K   KKK KK
Sbjct: 346 FTATKKPIKKSNSKAKLKKKKKKAGASAVPKSETSQEAKSSGKKKVKK 393



 Score = 36.9 bits (86), Expect = 0.026
 Identities = 19/47 (40%), Positives = 24/47 (51%), Gaps = 7/47 (14%)

Query: 22  KAKKKEKKKKEKKAKKKEKKKE-------KKKKEKKKKKKKKKKKKK 61
            A KK  KK   KAK K+KKK+       K +  ++ K   KKK KK
Sbjct: 347 TATKKPIKKSNSKAKLKKKKKKAGASAVPKSETSQEAKSSGKKKVKK 393



 Score = 36.5 bits (85), Expect = 0.034
 Identities = 16/50 (32%), Positives = 23/50 (46%)

Query: 11  TFTEPTPLDWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKK 60
           +F   T    KK+  K K KK+KK        + +  ++ K   KKK KK
Sbjct: 344 SFFTATKKPIKKSNSKAKLKKKKKKAGASAVPKSETSQEAKSSGKKKVKK 393



 Score = 31.1 bits (71), Expect = 1.6
 Identities = 16/58 (27%), Positives = 24/58 (41%), Gaps = 6/58 (10%)

Query: 21  KKAKKKEKKKKEK------KAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNN 72
           ++ KK + KK +        A KK  KK   K + KKKKKK        +  +    +
Sbjct: 328 ERLKKAKTKKTQTRLDSFFTATKKPIKKSNSKAKLKKKKKKAGASAVPKSETSQEAKS 385



 Score = 30.4 bits (69), Expect = 2.6
 Identities = 17/58 (29%), Positives = 24/58 (41%), Gaps = 6/58 (10%)

Query: 25  KKEKKKKEKKAKK------KEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNNN 76
           ++ KK K KK +          KK  KK   K K KKKKKK   +    +  +    +
Sbjct: 328 ERLKKAKTKKTQTRLDSFFTATKKPIKKSNSKAKLKKKKKKAGASAVPKSETSQEAKS 385



 Score = 29.2 bits (66), Expect = 6.5
 Identities = 15/58 (25%), Positives = 23/58 (39%), Gaps = 8/58 (13%)

Query: 30  KKEKKAKKKEK--------KKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNNNNNN 79
           ++ KKAK K+            KK  +K   K K KKKKK    +    +  +    +
Sbjct: 328 ERLKKAKTKKTQTRLDSFFTATKKPIKKSNSKAKLKKKKKKAGASAVPKSETSQEAKS 385


>gnl|CDD|235943 PRK07133, PRK07133, DNA polymerase III subunits gamma and tau;
           Validated.
          Length = 725

 Score = 39.0 bits (91), Expect = 0.007
 Identities = 16/75 (21%), Positives = 29/75 (38%)

Query: 21  KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNNNNNNN 80
            + + K KK +E      + K++K + E   + K   K  +      +N N N++  N  
Sbjct: 362 DENEIKFKKIEENSIDNLDIKEKKIENENDIEGKSDTKNLEEGFETKDNKNKNSSFINKT 421

Query: 81  NNNNNNEEKGTFLYK 95
            N   N      L +
Sbjct: 422 ENILTNSPLKDELLE 436



 Score = 36.3 bits (84), Expect = 0.046
 Identities = 19/79 (24%), Positives = 34/79 (43%)

Query: 21  KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNNNNNNN 80
           KK ++      + K KK E + + + K   K  ++  + K N N N++  N   N   N+
Sbjct: 369 KKIEENSIDNLDIKEKKIENENDIEGKSDTKNLEEGFETKDNKNKNSSFINKTENILTNS 428

Query: 81  NNNNNNEEKGTFLYKMAAP 99
              +   EK T +  +  P
Sbjct: 429 PLKDELLEKTTEIINIENP 447



 Score = 33.6 bits (77), Expect = 0.30
 Identities = 11/37 (29%), Positives = 14/37 (37%)

Query: 20  WKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKK 56
             +AK   K     K     K  EK KK +K    K+
Sbjct: 674 INEAKIYWKAINNTKKSVVIKPLEKPKKYEKTSVSKE 710



 Score = 31.7 bits (72), Expect = 1.0
 Identities = 11/62 (17%), Positives = 23/62 (37%)

Query: 17  PLDWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNNN 76
               ++        KEKK + +   + K   +  ++  + K  K  N++  N   N   N
Sbjct: 368 FKKIEENSIDNLDIKEKKIENENDIEGKSDTKNLEEGFETKDNKNKNSSFINKTENILTN 427

Query: 77  NN 78
           + 
Sbjct: 428 SP 429



 Score = 29.8 bits (67), Expect = 4.3
 Identities = 16/81 (19%), Positives = 24/81 (29%), Gaps = 8/81 (9%)

Query: 15  PTPLDWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNN 74
               +        K+KK +     E K + K  E+  + K  K K  +  N   N   N+
Sbjct: 369 KKIEENSIDNLDIKEKKIENENDIEGKSDTKNLEEGFETKDNKNKNSSFINKTENILTNS 428

Query: 75  NN--------NNNNNNNNNNE 87
                         N  N  E
Sbjct: 429 PLKDELLEKTTEIINIENPQE 449


>gnl|CDD|205936 pfam13763, DUF4167, Domain of unknown function (DUF4167). 
          Length = 80

 Score = 35.4 bits (82), Expect = 0.007
 Identities = 10/25 (40%), Positives = 13/25 (52%)

Query: 62 NNNNNNNNNNNNNNNNNNNNNNNNN 86
             NNNN NNN +  N  N   ++N
Sbjct: 1  RGRNNNNRNNNRSGGNPRNRVFDSN 25



 Score = 35.0 bits (81), Expect = 0.011
 Identities = 10/26 (38%), Positives = 12/26 (46%)

Query: 65 NNNNNNNNNNNNNNNNNNNNNNEEKG 90
             NNNN NNN +  N  N   +  G
Sbjct: 1  RGRNNNNRNNNRSGGNPRNRVFDSNG 26



 Score = 33.8 bits (78), Expect = 0.027
 Identities = 11/32 (34%), Positives = 18/32 (56%)

Query: 60 KKNNNNNNNNNNNNNNNNNNNNNNNNNEEKGT 91
          + NNN NNN +  N  N   ++N  + + +GT
Sbjct: 3  RNNNNRNNNRSGGNPRNRVFDSNGPDVKVRGT 34



 Score = 30.7 bits (70), Expect = 0.29
 Identities = 8/23 (34%), Positives = 12/23 (52%)

Query: 59 KKKNNNNNNNNNNNNNNNNNNNN 81
          +  NN NNN +  N  N   ++N
Sbjct: 3  RNNNNRNNNRSGGNPRNRVFDSN 25



 Score = 28.8 bits (65), Expect = 1.8
 Identities = 7/23 (30%), Positives = 11/23 (47%)

Query: 58 KKKKNNNNNNNNNNNNNNNNNNN 80
          +   N NNN +  N  N   ++N
Sbjct: 3  RNNNNRNNNRSGGNPRNRVFDSN 25



 Score = 28.0 bits (63), Expect = 3.2
 Identities = 9/22 (40%), Positives = 10/22 (45%)

Query: 71 NNNNNNNNNNNNNNNNEEKGTF 92
             NNNN NNN +  N     F
Sbjct: 1  RGRNNNNRNNNRSGGNPRNRVF 22


>gnl|CDD|217756 pfam03839, Sec62, Translocation protein Sec62. 
          Length = 217

 Score = 37.9 bits (88), Expect = 0.007
 Identities = 19/49 (38%), Positives = 26/49 (53%)

Query: 12 FTEPTPLDWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKK 60
            E       + K  EK K   KA+  E+ K+   +EKK++KKK KKKK
Sbjct: 29 NPEIYNKINSQDKAIEKFKLLIKAQMAERVKKLHSQEKKEEKKKPKKKK 77



 Score = 32.8 bits (75), Expect = 0.28
 Identities = 14/47 (29%), Positives = 25/47 (53%)

Query: 15 PTPLDWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
            P  + K   ++K  ++ K   K +  E+ KK   ++KK++KKK K
Sbjct: 28 GNPEIYNKINSQDKAIEKFKLLIKAQMAERVKKLHSQEKKEEKKKPK 74



 Score = 32.8 bits (75), Expect = 0.31
 Identities = 17/57 (29%), Positives = 23/57 (40%), Gaps = 14/57 (24%)

Query: 19 DWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKK--------------KKKKKKKKKKK 61
           +K  K K   +   K   ++K  EK K   K              KK++KKK KKK
Sbjct: 20 KYKANKDKGNPEIYNKINSQDKAIEKFKLLIKAQMAERVKKLHSQEKKEEKKKPKKK 76



 Score = 31.3 bits (71), Expect = 1.0
 Identities = 10/38 (26%), Positives = 16/38 (42%)

Query: 22 KAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKK 59
          +A + EK K  K     E   +   ++K  +K K   K
Sbjct: 14 RALESEKYKANKDKGNPEIYNKINSQDKAIEKFKLLIK 51



 Score = 30.5 bits (69), Expect = 1.6
 Identities = 15/62 (24%), Positives = 22/62 (35%), Gaps = 18/62 (29%)

Query: 18 LDWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKK------------------KKKKKK 59
          L+ +K K  + K   +   K   + +  +K K   K                  KKK KK
Sbjct: 16 LESEKYKANKDKGNPEIYNKINSQDKAIEKFKLLIKAQMAERVKKLHSQEKKEEKKKPKK 75

Query: 60 KK 61
          KK
Sbjct: 76 KK 77



 Score = 29.0 bits (65), Expect = 5.2
 Identities = 12/22 (54%), Positives = 14/22 (63%)

Query: 38  KEKKKEKKKKEKKKKKKKKKKK 59
           K  K   KK +K KKKKKKK+ 
Sbjct: 196 KGDKSSAKKDKKSKKKKKKKRS 217



 Score = 28.2 bits (63), Expect = 9.3
 Identities = 13/21 (61%), Positives = 13/21 (61%)

Query: 42  KEKKKKEKKKKKKKKKKKKKN 62
           K  K   KK KK KKKKKKK 
Sbjct: 196 KGDKSSAKKDKKSKKKKKKKR 216



 Score = 28.2 bits (63), Expect = 9.6
 Identities = 11/48 (22%), Positives = 21/48 (43%), Gaps = 2/48 (4%)

Query: 19 DWKKAKK--KEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNN 64
          D+ +AK+  +  + ++ KA K +   E   K   + K  +K K     
Sbjct: 5  DFFRAKRVVRALESEKYKANKDKGNPEIYNKINSQDKAIEKFKLLIKA 52


>gnl|CDD|217884 pfam04086, SRP-alpha_N, Signal recognition particle, alpha subunit,
           N-terminal.  SRP is a complex of six distinct
           polypeptides and a 7S RNA that is essential for
           transferring nascent polypeptide chains that are
           destined for export from the cell to the translocation
           apparatus of the endoplasmic reticulum (ER) membrane.
           SRP binds hydrophobic signal sequences as they emerge
           from the ribosome, and arrests translation.
          Length = 272

 Score = 38.2 bits (89), Expect = 0.008
 Identities = 13/41 (31%), Positives = 18/41 (43%)

Query: 22  KAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKN 62
           KAK+   ++ +K AK         +K  K K   KK  KK 
Sbjct: 172 KAKEGPSRRAKKAAKLSSTASSGDEKSPKSKAAPKKAGKKM 212



 Score = 35.9 bits (83), Expect = 0.043
 Identities = 15/82 (18%), Positives = 34/82 (41%), Gaps = 3/82 (3%)

Query: 21  KKAKKKEKKKKEKKAKK---KEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNNNN 77
           K  +  E+ KK KK      + K KE   K K++KK ++      ++ ++  N++  +  
Sbjct: 109 KAMRTFEESKKSKKTVDSMIERKPKEPGLKRKQRKKAQESATSPESSPSSTPNSSRPSTP 168

Query: 78  NNNNNNNNNEEKGTFLYKMAAP 99
           +          +     K+++ 
Sbjct: 169 HLLKAKEGPSRRAKKAAKLSST 190



 Score = 35.1 bits (81), Expect = 0.072
 Identities = 18/76 (23%), Positives = 38/76 (50%), Gaps = 5/76 (6%)

Query: 21  KKAKKKEKKKKEKKAKKKEKK-----KEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNN 75
           KK  K  K  +  +  KK KK      E+K KE   K+K++KK +++  +  ++ ++  N
Sbjct: 102 KKQAKSPKAMRTFEESKKSKKTVDSMIERKPKEPGLKRKQRKKAQESATSPESSPSSTPN 161

Query: 76  NNNNNNNNNNNEEKGT 91
           ++  +  +    ++G 
Sbjct: 162 SSRPSTPHLLKAKEGP 177



 Score = 33.6 bits (77), Expect = 0.19
 Identities = 15/76 (19%), Positives = 27/76 (35%), Gaps = 5/76 (6%)

Query: 21  KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKK-----KKKKKKKKNNNNNNNNNNNNNN 75
           KKA K          K  + K   KK  KK +K      +      + +  + N+ N + 
Sbjct: 182 KKAAKLSSTASSGDEKSPKSKAAPKKAGKKMRKWDLDGDEDDDAVLDYSAPDANDENADA 241

Query: 76  NNNNNNNNNNNEEKGT 91
             +    +  +  +GT
Sbjct: 242 PEDVEEVDQESWGRGT 257



 Score = 33.6 bits (77), Expect = 0.22
 Identities = 11/34 (32%), Positives = 14/34 (41%)

Query: 24  KKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKK 57
            KK  K     +   EK  + K   KK  KK +K
Sbjct: 181 AKKAAKLSSTASSGDEKSPKSKAAPKKAGKKMRK 214



 Score = 33.6 bits (77), Expect = 0.22
 Identities = 12/37 (32%), Positives = 15/37 (40%)

Query: 25  KKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
            +  KK  K +       EK  K K   KK  KK +K
Sbjct: 178 SRRAKKAAKLSSTASSGDEKSPKSKAAPKKAGKKMRK 214



 Score = 31.3 bits (71), Expect = 1.2
 Identities = 8/37 (21%), Positives = 16/37 (43%)

Query: 24  KKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKK 60
            ++ KK  +  +      ++  K +   KK  KK +K
Sbjct: 178 SRRAKKAAKLSSTASSGDEKSPKSKAAPKKAGKKMRK 214



 Score = 30.5 bits (69), Expect = 2.2
 Identities = 10/37 (27%), Positives = 16/37 (43%)

Query: 23  AKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKK 59
           +++ +K  K         +K  K K   KK  KK +K
Sbjct: 178 SRRAKKAAKLSSTASSGDEKSPKSKAAPKKAGKKMRK 214


>gnl|CDD|184900 PRK14907, rplD, 50S ribosomal protein L4; Provisional.
          Length = 295

 Score = 38.0 bits (88), Expect = 0.008
 Identities = 21/68 (30%), Positives = 28/68 (41%), Gaps = 1/68 (1%)

Query: 11  TFTEPTPLDWKKAKKKEKKKKEKKAKKKEK-KKEKKKKEKKKKKKKKKKKKKNNNNNNNN 69
           T    +    KKA K +K K  K   KK   K EK +  KK+   KK  KK+  +     
Sbjct: 35  TAKTTSTKAAKKAAKVKKTKSVKTTTKKVTVKFEKTESVKKESVAKKTVKKEAVSAEVFE 94

Query: 70  NNNNNNNN 77
            +N    N
Sbjct: 95  ASNKLFKN 102



 Score = 37.2 bits (86), Expect = 0.017
 Identities = 19/51 (37%), Positives = 21/51 (41%)

Query: 11 TFTEPTPLDWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
          T  E  P   K    KE  K +K AK    K  KK  + KK K  K   KK
Sbjct: 12 TTEEKKPAAKKATTSKETAKTKKTAKTTSTKAAKKAAKVKKTKSVKTTTKK 62



 Score = 35.7 bits (82), Expect = 0.047
 Identities = 20/61 (32%), Positives = 25/61 (40%), Gaps = 2/61 (3%)

Query: 21  KKAKKKEKKKKEKKAKKKEKKK--EKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNNNNN 78
           K AKK  K KK K  K   KK   + +K E  KK+   KK  K    +      +N    
Sbjct: 42  KAAKKAAKVKKTKSVKTTTKKVTVKFEKTESVKKESVAKKTVKKEAVSAEVFEASNKLFK 101

Query: 79  N 79
           N
Sbjct: 102 N 102



 Score = 34.2 bits (78), Expect = 0.15
 Identities = 16/51 (31%), Positives = 18/51 (35%)

Query: 11 TFTEPTPLDWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
          T         ++ K   KK    K   K KK  K    K  KK  K KK K
Sbjct: 4  TKKTTKKKTTEEKKPAAKKATTSKETAKTKKTAKTTSTKAAKKAAKVKKTK 54



 Score = 33.4 bits (76), Expect = 0.26
 Identities = 15/40 (37%), Positives = 18/40 (45%)

Query: 22 KAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
          K KK  K    K AKK  K K+ K  +   KK   K +K 
Sbjct: 31 KTKKTAKTTSTKAAKKAAKVKKTKSVKTTTKKVTVKFEKT 70



 Score = 33.4 bits (76), Expect = 0.27
 Identities = 14/44 (31%), Positives = 20/44 (45%)

Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNN 64
          K AK    K  +K AK K+ K  K   +K   K +K +  K  +
Sbjct: 34 KTAKTTSTKAAKKAAKVKKTKSVKTTTKKVTVKFEKTESVKKES 77



 Score = 33.4 bits (76), Expect = 0.27
 Identities = 19/51 (37%), Positives = 21/51 (41%)

Query: 11 TFTEPTPLDWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
          T  + T  + K A KK    KE    KK  K    K  KK  K KK K  K
Sbjct: 7  TTKKKTTEEKKPAAKKATTSKETAKTKKTAKTTSTKAAKKAAKVKKTKSVK 57



 Score = 31.8 bits (72), Expect = 0.70
 Identities = 16/48 (33%), Positives = 19/48 (39%)

Query: 13 TEPTPLDWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKK 60
          T       KK  K    K  KKA K +K K  K   KK   K +K + 
Sbjct: 25 TSKETAKTKKTAKTTSTKAAKKAAKVKKTKSVKTTTKKVTVKFEKTES 72



 Score = 30.7 bits (69), Expect = 1.7
 Identities = 19/53 (35%), Positives = 21/53 (39%), Gaps = 4/53 (7%)

Query: 13 TEPTPLDWKKAKKKEKKKKEKKAK-KKEKKKEKK---KKEKKKKKKKKKKKKK 61
          T  T    K    K  KK  K  K K  K   KK   K EK +  KK+   KK
Sbjct: 29 TAKTKKTAKTTSTKAAKKAAKVKKTKSVKTTTKKVTVKFEKTESVKKESVAKK 81



 Score = 30.7 bits (69), Expect = 2.1
 Identities = 15/37 (40%), Positives = 17/37 (45%)

Query: 25 KKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
            E KK  KK   +EKK   KK    K+  K KK  K
Sbjct: 1  MAETKKTTKKKTTEEKKPAAKKATTSKETAKTKKTAK 37



 Score = 29.1 bits (65), Expect = 5.9
 Identities = 17/71 (23%), Positives = 22/71 (30%), Gaps = 3/71 (4%)

Query: 4   STKHQIGTFTEPTPLDWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNN 63
           + K       +      K    K      KK   K +K E  KKE   KK  KK+     
Sbjct: 35  TAKTTSTKAAKKAAKVKKTKSVK---TTTKKVTVKFEKTESVKKESVAKKTVKKEAVSAE 91

Query: 64  NNNNNNNNNNN 74
               +N    N
Sbjct: 92  VFEASNKLFKN 102


>gnl|CDD|214661 smart00435, TOPEUc, DNA Topoisomerase I (eukaryota).  DNA
           Topoisomerase I (eukaryota), DNA topoisomerase V,
           Vaccina virus topoisomerase, Variola virus
           topoisomerase, Shope fibroma virus topoisomeras.
          Length = 391

 Score = 38.5 bits (90), Expect = 0.008
 Identities = 16/55 (29%), Positives = 32/55 (58%)

Query: 21  KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNN 75
           +K K K ++  EK   + ++KK++KKKE+KKKK+ ++ +++         +   N
Sbjct: 311 RKLKSKFERDNEKLDAEVKEKKKEKKKEEKKKKQIERLEERIEKLEVQATDKEEN 365



 Score = 37.3 bits (87), Expect = 0.016
 Identities = 16/44 (36%), Positives = 28/44 (63%)

Query: 18  LDWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
           +   +     K+K + K ++  +K + + KEKKK+KKK++KKKK
Sbjct: 300 ILLFEMISDLKRKLKSKFERDNEKLDAEVKEKKKEKKKEEKKKK 343



 Score = 37.3 bits (87), Expect = 0.018
 Identities = 17/44 (38%), Positives = 29/44 (65%)

Query: 21  KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNN 64
              K+K K K E+  +K + + ++KKKEKKK++KKKK+ ++   
Sbjct: 307 SDLKRKLKSKFERDNEKLDAEVKEKKKEKKKEEKKKKQIERLEE 350



 Score = 36.9 bits (86), Expect = 0.022
 Identities = 16/44 (36%), Positives = 28/44 (63%)

Query: 18  LDWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
           L ++     ++K K K  +  EK   + K++KK+KKK++KKKK+
Sbjct: 301 LLFEMISDLKRKLKSKFERDNEKLDAEVKEKKKEKKKEEKKKKQ 344



 Score = 35.4 bits (82), Expect = 0.063
 Identities = 15/50 (30%), Positives = 23/50 (46%), Gaps = 5/50 (10%)

Query: 21  KKAKKKEKK-----KKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNN 65
           K   K+ KK     +     K+K K K ++  EK   + K+KKK+K    
Sbjct: 290 KYQLKRLKKMILLFEMISDLKRKLKSKFERDNEKLDAEVKEKKKEKKKEE 339



 Score = 31.9 bits (73), Expect = 0.79
 Identities = 17/39 (43%), Positives = 27/39 (69%)

Query: 21  KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKK 59
            K KKKEKKK+EKK K+ E+ +E+ +K + +   K++ K
Sbjct: 328 VKEKKKEKKKEEKKKKQIERLEERIEKLEVQATDKEENK 366



 Score = 30.8 bits (70), Expect = 2.1
 Identities = 10/43 (23%), Positives = 19/43 (44%)

Query: 22  KAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNN 64
           K ++K K  K +  + K+     +     K+K K K ++ N  
Sbjct: 281 KLQEKIKALKYQLKRLKKMILLFEMISDLKRKLKSKFERDNEK 323


>gnl|CDD|240370 PTZ00342, PTZ00342, acyl-CoA synthetase; Provisional.
          Length = 746

 Score = 38.5 bits (90), Expect = 0.009
 Identities = 17/38 (44%), Positives = 19/38 (50%)

Query: 59  KKKNNNNNNNNNNNNNNNNNNNNNNNNNEEKGTFLYKM 96
           K K  N N    NN +N NNN N NN  E+KG  L   
Sbjct: 212 KSKEININKEEKNNGSNVNNNGNKNNKEEQKGNDLSNE 249



 Score = 38.2 bits (89), Expect = 0.012
 Identities = 13/36 (36%), Positives = 16/36 (44%)

Query: 57  KKKKKNNNNNNNNNNNNNNNNNNNNNNNNNEEKGTF 92
           K K+ N N    NN +N NNN N NN    +     
Sbjct: 212 KSKEININKEEKNNGSNVNNNGNKNNKEEQKGNDLS 247



 Score = 37.0 bits (86), Expect = 0.026
 Identities = 12/37 (32%), Positives = 15/37 (40%)

Query: 54  KKKKKKKKNNNNNNNNNNNNNNNNNNNNNNNNNEEKG 90
           K K+        NN +N NNN N NN      N+   
Sbjct: 212 KSKEININKEEKNNGSNVNNNGNKNNKEEQKGNDLSN 248



 Score = 36.6 bits (85), Expect = 0.038
 Identities = 16/40 (40%), Positives = 23/40 (57%)

Query: 50  KKKKKKKKKKKKNNNNNNNNNNNNNNNNNNNNNNNNNEEK 89
           K K+    K++KNN +N NNN N NN      N+ +NE +
Sbjct: 212 KSKEININKEEKNNGSNVNNNGNKNNKEEQKGNDLSNELE 251



 Score = 35.5 bits (82), Expect = 0.081
 Identities = 15/40 (37%), Positives = 21/40 (52%)

Query: 49  KKKKKKKKKKKKKNNNNNNNNNNNNNNNNNNNNNNNNNEE 88
           K K+    K++K N +N NNN N NN      N+ +N  E
Sbjct: 212 KSKEININKEEKNNGSNVNNNGNKNNKEEQKGNDLSNELE 251



 Score = 34.7 bits (80), Expect = 0.15
 Identities = 13/46 (28%), Positives = 18/46 (39%)

Query: 45  KKKEKKKKKKKKKKKKKNNNNNNNNNNNNNNNNNNNNNNNNNEEKG 90
           K KE    K++K      NNN N NN      N+ +N   +     
Sbjct: 212 KSKEININKEEKNNGSNVNNNGNKNNKEEQKGNDLSNELEDISLGP 257



 Score = 33.9 bits (78), Expect = 0.28
 Identities = 12/44 (27%), Positives = 19/44 (43%)

Query: 44  KKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNNNNNNNNNNNNNE 87
           K K+    K++K      NNN N NN      N+ +N   + + 
Sbjct: 212 KSKEININKEEKNNGSNVNNNGNKNNKEEQKGNDLSNELEDISL 255



 Score = 33.2 bits (76), Expect = 0.40
 Identities = 14/49 (28%), Positives = 21/49 (42%)

Query: 46  KKEKKKKKKKKKKKKKNNNNNNNNNNNNNNNNNNNNNNNNNEEKGTFLY 94
           K ++    K++K    N NNN N NN      N+ +N   +   G   Y
Sbjct: 212 KSKEININKEEKNNGSNVNNNGNKNNKEEQKGNDLSNELEDISLGPLEY 260



 Score = 32.4 bits (74), Expect = 0.79
 Identities = 10/48 (20%), Positives = 18/48 (37%)

Query: 42  KEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNNNNNNNNNNNNNEEK 89
           K K+    K++K        N N NN      N+ +N   + +    +
Sbjct: 212 KSKEININKEEKNNGSNVNNNGNKNNKEEQKGNDLSNELEDISLGPLE 259



 Score = 30.8 bits (70), Expect = 2.1
 Identities = 10/28 (35%), Positives = 12/28 (42%)

Query: 62  NNNNNNNNNNNNNNNNNNNNNNNNNEEK 89
             +   N N    NN +N NNN N   K
Sbjct: 211 IKSKEININKEEKNNGSNVNNNGNKNNK 238



 Score = 30.5 bits (69), Expect = 2.6
 Identities = 9/43 (20%), Positives = 16/43 (37%)

Query: 40  KKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNNNNNNNNN 82
           K KE    +++K          N NN      N+ +N   + +
Sbjct: 212 KSKEININKEEKNNGSNVNNNGNKNNKEEQKGNDLSNELEDIS 254



 Score = 30.5 bits (69), Expect = 3.3
 Identities = 9/48 (18%), Positives = 17/48 (35%)

Query: 41  KKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNNNNNNNNNNNNNEE 88
           K ++    K++K          N NN      N+ +N   + +    E
Sbjct: 212 KSKEININKEEKNNGSNVNNNGNKNNKEEQKGNDLSNELEDISLGPLE 259



 Score = 30.1 bits (68), Expect = 4.1
 Identities = 10/43 (23%), Positives = 16/43 (37%)

Query: 38  KEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNNNNNNN 80
           K K+    K+EK           KNN      N+ +N   + +
Sbjct: 212 KSKEININKEEKNNGSNVNNNGNKNNKEEQKGNDLSNELEDIS 254


>gnl|CDD|225606 COG3064, TolA, Membrane protein involved in colicin uptake [Cell
           envelope biogenesis, outer membrane].
          Length = 387

 Score = 38.0 bits (88), Expect = 0.010
 Identities = 14/42 (33%), Positives = 20/42 (47%)

Query: 19  DWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKK 60
              +AK K +KK E  A++K   ++KK   K K  K     K
Sbjct: 209 AEAEAKAKAEKKAEAAAEEKAAAEKKKAAAKAKADKAAAAAK 250



 Score = 37.6 bits (87), Expect = 0.014
 Identities = 14/40 (35%), Positives = 21/40 (52%)

Query: 21  KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKK 60
            KA ++ K K E  A KK+ + E K   +K K + + K K
Sbjct: 177 AKAAEEAKAKAEAAAAKKKAEAEAKAAAEKAKAEAEAKAK 216



 Score = 37.2 bits (86), Expect = 0.017
 Identities = 14/40 (35%), Positives = 21/40 (52%)

Query: 22  KAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
           KA+ + K K EKKA+   ++K   +K+K   K K  K   
Sbjct: 208 KAEAEAKAKAEKKAEAAAEEKAAAEKKKAAAKAKADKAAA 247



 Score = 37.2 bits (86), Expect = 0.018
 Identities = 15/44 (34%), Positives = 23/44 (52%), Gaps = 3/44 (6%)

Query: 21  KKAKKKEKKKK---EKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
            KA+    KKK   E KA  ++ K E + K K +KK +   ++K
Sbjct: 185 AKAEAAAAKKKAEAEAKAAAEKAKAEAEAKAKAEKKAEAAAEEK 228



 Score = 36.9 bits (85), Expect = 0.026
 Identities = 14/42 (33%), Positives = 21/42 (50%)

Query: 22  KAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNN 63
           KA+KK +   E+KA  ++KK   K K  K     K  ++K  
Sbjct: 216 KAEKKAEAAAEEKAAAEKKKAAAKAKADKAAAAAKAAERKAA 257



 Score = 36.1 bits (83), Expect = 0.043
 Identities = 15/39 (38%), Positives = 20/39 (51%)

Query: 23  AKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
               EK K E +AK K +KK +   E+K   +KKK   K
Sbjct: 201 KAAAEKAKAEAEAKAKAEKKAEAAAEEKAAAEKKKAAAK 239



 Score = 35.3 bits (81), Expect = 0.071
 Identities = 13/42 (30%), Positives = 19/42 (45%)

Query: 19  DWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKK 60
           + KK  ++  K  E+   K E    KKK E + K   +K K 
Sbjct: 168 EAKKKAEEAAKAAEEAKAKAEAAAAKKKAEAEAKAAAEKAKA 209



 Score = 35.3 bits (81), Expect = 0.074
 Identities = 18/45 (40%), Positives = 23/45 (51%), Gaps = 4/45 (8%)

Query: 21  KKAKKKEKKKKEKKAKKKEKKK----EKKKKEKKKKKKKKKKKKK 61
           K A +K K + E KAK ++K +    EK   EKKK   K K  K 
Sbjct: 201 KAAAEKAKAEAEAKAKAEKKAEAAAEEKAAAEKKKAAAKAKADKA 245



 Score = 33.8 bits (77), Expect = 0.24
 Identities = 15/44 (34%), Positives = 23/44 (52%), Gaps = 3/44 (6%)

Query: 21  KKAKKKEKKKKEKKAKKKEKKK---EKKKKEKKKKKKKKKKKKK 61
           KK  + E K   +KAK + + K   EKK +   ++K   +KKK 
Sbjct: 193 KKKAEAEAKAAAEKAKAEAEAKAKAEKKAEAAAEEKAAAEKKKA 236



 Score = 33.0 bits (75), Expect = 0.36
 Identities = 13/40 (32%), Positives = 20/40 (50%)

Query: 22  KAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
           K K +E  K  ++AK K +    KKK + + K   +K K 
Sbjct: 170 KKKAEEAAKAAEEAKAKAEAAAAKKKAEAEAKAAAEKAKA 209



 Score = 32.2 bits (73), Expect = 0.62
 Identities = 13/71 (18%), Positives = 26/71 (36%)

Query: 21  KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNNNNNNN 80
            +A  +EK   EKK    + K +K     K  ++K      ++     ++  N       
Sbjct: 221 AEAAAEEKAAAEKKKAAAKAKADKAAAAAKAAERKAAAAALDDIFGGLSSGKNAPKTGGG 280

Query: 81  NNNNNNEEKGT 91
             N +++  G 
Sbjct: 281 AKNASSQGSGA 291



 Score = 32.2 bits (73), Expect = 0.63
 Identities = 14/40 (35%), Positives = 21/40 (52%)

Query: 21  KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKK 60
           KKA+  + K   + AK K   + KKK E+  K  ++ K K
Sbjct: 147 KKAEAAKAKAAAEAAKLKAAAEAKKKAEEAAKAAEEAKAK 186



 Score = 31.8 bits (72), Expect = 0.87
 Identities = 14/44 (31%), Positives = 23/44 (52%)

Query: 18  LDWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
           L+ K+ +++ +K   ++ KK E  K K   E  K K   + KKK
Sbjct: 129 LEQKQQEEQARKAAAEQKKKAEAAKAKAAAEAAKLKAAAEAKKK 172



 Score = 31.5 bits (71), Expect = 1.1
 Identities = 12/39 (30%), Positives = 20/39 (51%)

Query: 23  AKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
             K E    +KKA+ + K   +K K + + K K +KK +
Sbjct: 184 KAKAEAAAAKKKAEAEAKAAAEKAKAEAEAKAKAEKKAE 222



 Score = 31.5 bits (71), Expect = 1.3
 Identities = 13/35 (37%), Positives = 19/35 (54%)

Query: 27  EKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
           E+KKK + AK K   +  K K   + KKK ++  K
Sbjct: 144 EQKKKAEAAKAKAAAEAAKLKAAAEAKKKAEEAAK 178



 Score = 31.5 bits (71), Expect = 1.4
 Identities = 15/41 (36%), Positives = 21/41 (51%)

Query: 21  KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
           K A +  K K   +AKKK ++  K  +E K K +    KKK
Sbjct: 155 KAAAEAAKLKAAAEAKKKAEEAAKAAEEAKAKAEAAAAKKK 195



 Score = 31.1 bits (70), Expect = 1.7
 Identities = 13/44 (29%), Positives = 21/44 (47%)

Query: 18  LDWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
            + KK  +  K K   +A K +   E KKK ++  K  ++ K K
Sbjct: 143 AEQKKKAEAAKAKAAAEAAKLKAAAEAKKKAEEAAKAAEEAKAK 186



 Score = 30.7 bits (69), Expect = 1.8
 Identities = 12/41 (29%), Positives = 21/41 (51%)

Query: 21  KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
           K A +++KK +  KAK   +  + K   + KKK ++  K  
Sbjct: 140 KAAAEQKKKAEAAKAKAAAEAAKLKAAAEAKKKAEEAAKAA 180



 Score = 30.3 bits (68), Expect = 2.5
 Identities = 12/41 (29%), Positives = 19/41 (46%)

Query: 21  KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
            K K   + KK+ +   K  ++ K K E    KKK + + K
Sbjct: 161 AKLKAAAEAKKKAEEAAKAAEEAKAKAEAAAAKKKAEAEAK 201



 Score = 30.3 bits (68), Expect = 2.8
 Identities = 13/71 (18%), Positives = 26/71 (36%)

Query: 21  KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNNNNNNN 80
           KKA+   ++K   + KK   K +  K     K  ++K      ++     ++  N     
Sbjct: 219 KKAEAAAEEKAAAEKKKAAAKAKADKAAAAAKAAERKAAAAALDDIFGGLSSGKNAPKTG 278

Query: 81  NNNNNNEEKGT 91
               N   +G+
Sbjct: 279 GGAKNASSQGS 289



 Score = 29.9 bits (67), Expect = 3.9
 Identities = 12/43 (27%), Positives = 33/43 (76%)

Query: 18  LDWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKK 60
           L+ ++ K +E++K+ ++A+K+ + ++K+++E+ +K   ++KKK
Sbjct: 106 LEKERLKAQEQQKQAEEAEKQAQLEQKQQEEQARKAAAEQKKK 148



 Score = 29.1 bits (65), Expect = 6.2
 Identities = 12/41 (29%), Positives = 20/41 (48%)

Query: 21  KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
           K     E KKK ++A K  ++ + K +    KKK + + K 
Sbjct: 162 KLKAAAEAKKKAEEAAKAAEEAKAKAEAAAAKKKAEAEAKA 202


>gnl|CDD|224748 COG1835, COG1835, Predicted acyltransferases [Lipid metabolism].
          Length = 386

 Score = 38.2 bits (89), Expect = 0.010
 Identities = 35/240 (14%), Positives = 61/240 (25%), Gaps = 41/240 (17%)

Query: 226 RMYMRRTFLLPYAIVTSLFFIGQFGGMTTLQTYAVGIFESIHAPLDPYFATLLLGVAELG 285
           R       LL   +  + FF+             +            Y+ T       L 
Sbjct: 173 RRVRFALALLLLLLALASFFL-------------LLALGPFATGAALYYGTATRLWEFLL 219

Query: 286 GALLCVVLIHYTGKRPLALISTGGSAACFIVVAVYAQFHLSYGWDSPLVPTVFLVLAAFL 345
           GAL  +          L   +        +++AV      S  +    +  V  + AA L
Sbjct: 220 GALAALWRPKRRPWPALLAAALALLGLALLLLAVAL-IPGSSPFFPGDLLLVPSLGAALL 278

Query: 346 THICIRLLPWMLIGEVFPNNIRATASGASGSSSY--------IFAFAVNKLYYPMLDTFH 397
                 L+       +  +          G  SY        +  F +  L         
Sbjct: 279 VLAVAALVGG---SRLLSSRP----LVFLGDISYSLYLWHWPVLVF-LLYLVGLSGLPIS 330

Query: 398 LWGTLYFYAAISVVGTLYMYFVMPETEGRTLRDIEEHFADKGKTFVTNIRRAEKKRHAAR 457
               +    A+S++     Y             +E+     G+  V +  RA      AR
Sbjct: 331 ALLLILAALALSILLAALSYRF-----------VEKPLRRLGRRRVWSRIRAVLAAGRAR 379


>gnl|CDD|240254 PTZ00069, PTZ00069, 60S ribosomal protein L5; Provisional.
          Length = 300

 Score = 37.7 bits (88), Expect = 0.010
 Identities = 16/39 (41%), Positives = 25/39 (64%)

Query: 23  AKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
             KK+KKKK+K   KK K K+   K++K + K KK +++
Sbjct: 254 KVKKKKKKKKKVVHKKYKTKKLTGKQRKARVKAKKAQRR 292



 Score = 37.0 bits (86), Expect = 0.018
 Identities = 21/48 (43%), Positives = 26/48 (54%), Gaps = 6/48 (12%)

Query: 20  WKKAKKK-----EKKKKEKKAKKK-EKKKEKKKKEKKKKKKKKKKKKK 61
           +KKA         K KK+KK KKK   KK K KK   K++K + K KK
Sbjct: 241 YKKAHAAIRANPSKVKKKKKKKKKVVHKKYKTKKLTGKQRKARVKAKK 288



 Score = 34.3 bits (79), Expect = 0.15
 Identities = 17/40 (42%), Positives = 26/40 (65%)

Query: 22  KAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
           K KKK+KKKK+   KK + KK   K+ K + K KK ++++
Sbjct: 254 KVKKKKKKKKKVVHKKYKTKKLTGKQRKARVKAKKAQRRE 293



 Score = 34.3 bits (79), Expect = 0.16
 Identities = 15/40 (37%), Positives = 25/40 (62%)

Query: 21  KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKK 60
            K KKK+KKK   K  K +K   K++K + K KK +++++
Sbjct: 255 VKKKKKKKKKVVHKKYKTKKLTGKQRKARVKAKKAQRRER 294



 Score = 33.9 bits (78), Expect = 0.17
 Identities = 16/41 (39%), Positives = 26/41 (63%)

Query: 21  KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
           K  KKK+KKKK    K K KK   K+++ + K KK +++++
Sbjct: 254 KVKKKKKKKKKVVHKKYKTKKLTGKQRKARVKAKKAQRRER 294



 Score = 33.9 bits (78), Expect = 0.20
 Identities = 15/41 (36%), Positives = 25/41 (60%)

Query: 21  KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
           KK KKK   KK K  K   K+++ + K KK +++++ +KK 
Sbjct: 259 KKKKKKVVHKKYKTKKLTGKQRKARVKAKKAQRRERLQKKI 299



 Score = 32.7 bits (75), Expect = 0.37
 Identities = 16/47 (34%), Positives = 29/47 (61%)

Query: 15  PTPLDWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
           P+ +  KK KKK+   K+ K KK   K+ K + + KK +++++ +KK
Sbjct: 252 PSKVKKKKKKKKKVVHKKYKTKKLTGKQRKARVKAKKAQRRERLQKK 298



 Score = 32.3 bits (74), Expect = 0.54
 Identities = 13/37 (35%), Positives = 21/37 (56%)

Query: 21  KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKK 57
           K   KK K KK    ++K + K KK + +++ +KK K
Sbjct: 264 KVVHKKYKTKKLTGKQRKARVKAKKAQRRERLQKKIK 300


>gnl|CDD|233166 TIGR00880, 2_A_01_02, Multidrug resistance protein. 
          Length = 141

 Score = 36.5 bits (85), Expect = 0.010
 Identities = 21/76 (27%), Positives = 34/76 (44%), Gaps = 5/76 (6%)

Query: 97  AAPLVLVLTYVAEITQPHLR----GMLSATASMTTIFGTVSQLFLGSFLHWRSAAILNLL 152
           A  LV     +A+I  P  R    G++SA  ++  + G      L  FL WR+  +   +
Sbjct: 66  AFALVAGAALIADIYPPEERGVALGLMSAGIALGPLLGPPLGGVLAQFLGWRAPFLFLAI 125

Query: 153 FPILALCALYFI-PES 167
             + A   L F+ PE+
Sbjct: 126 LALAAFILLAFLLPET 141


>gnl|CDD|227891 COG5604, COG5604, Uncharacterized conserved protein [Function
          unknown].
          Length = 523

 Score = 37.9 bits (88), Expect = 0.011
 Identities = 17/70 (24%), Positives = 35/70 (50%), Gaps = 2/70 (2%)

Query: 22 KAKKKEKKK--KEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNNNNNN 79
          KA KK  K   K    ++K+  + KK    K +    + K ++  N+NN N    +   +
Sbjct: 6  KATKKFTKNHLKNTIDRRKQLARSKKVYGTKNRNSHTENKMESGTNDNNKNKEKLSKLYS 65

Query: 80 NNNNNNNEEK 89
          + +++++EE+
Sbjct: 66 DVDSSSSEEE 75



 Score = 37.9 bits (88), Expect = 0.014
 Identities = 15/74 (20%), Positives = 34/74 (45%), Gaps = 6/74 (8%)

Query: 22 KAKKKEKKKKEKKAKKKEKKKEKKKKEKK------KKKKKKKKKKKNNNNNNNNNNNNNN 75
          KA K  KK  +   K    ++++  + KK      +    + K +   N+NN N    + 
Sbjct: 3  KASKATKKFTKNHLKNTIDRRKQLARSKKVYGTKNRNSHTENKMESGTNDNNKNKEKLSK 62

Query: 76 NNNNNNNNNNNEEK 89
            ++ +++++ EE+
Sbjct: 63 LYSDVDSSSSEEEE 76



 Score = 34.9 bits (80), Expect = 0.12
 Identities = 21/64 (32%), Positives = 31/64 (48%), Gaps = 2/64 (3%)

Query: 31 KEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNNNNNNNNNNNNNEEKG 90
          K  KA KK  K   K    ++K+  + KK     N N++  N   +  N+NN N  +EK 
Sbjct: 3  KASKATKKFTKNHLKNTIDRRKQLARSKKVYGTKNRNSHTENKMESGTNDNNKN--KEKL 60

Query: 91 TFLY 94
          + LY
Sbjct: 61 SKLY 64


>gnl|CDD|240419 PTZ00440, PTZ00440, reticulocyte binding protein 2-like protein;
          Provisional.
          Length = 2722

 Score = 38.3 bits (89), Expect = 0.011
 Identities = 14/55 (25%), Positives = 23/55 (41%)

Query: 35 AKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNNNNNNNNNNNNNEEK 89
           +K  +    K     +  +       N  N   ++NNNNNN NN++ N   + K
Sbjct: 19 IEKSHQNNIYKVNGNNENFQFSHLNFYNEKNEEKSSNNNNNNYNNSSINRKKDNK 73



 Score = 37.9 bits (88), Expect = 0.019
 Identities = 16/56 (28%), Positives = 26/56 (46%)

Query: 28 KKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNNNNNNNNNN 83
          +K  +    K     E  +        +K ++K +NNNNNN NN++ N   +N N 
Sbjct: 20 EKSHQNNIYKVNGNNENFQFSHLNFYNEKNEEKSSNNNNNNYNNSSINRKKDNKNY 75



 Score = 37.5 bits (87), Expect = 0.024
 Identities = 17/62 (27%), Positives = 27/62 (43%)

Query: 39  EKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNNNNNNNNNNNNNEEKGTFLYKMAA 98
           EK  +    +     +  +    N  N  N   ++NNNNNN NN++ N +K    Y   +
Sbjct: 20  EKSHQNNIYKVNGNNENFQFSHLNFYNEKNEEKSSNNNNNNYNNSSINRKKDNKNYMKRS 79

Query: 99  PL 100
            L
Sbjct: 80  LL 81



 Score = 35.6 bits (82), Expect = 0.077
 Identities = 15/56 (26%), Positives = 27/56 (48%)

Query: 29 KKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNNNNNNNNNNN 84
          +K  +    K     +  +        +K ++K++NNNNNN NN++ N   +N N 
Sbjct: 20 EKSHQNNIYKVNGNNENFQFSHLNFYNEKNEEKSSNNNNNNYNNSSINRKKDNKNY 75



 Score = 35.6 bits (82), Expect = 0.077
 Identities = 16/56 (28%), Positives = 25/56 (44%)

Query: 27 EKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNNNNNNNNN 82
          EK  +    K     +  +        +K ++K  NNNNNN NN++ N   +N N 
Sbjct: 20 EKSHQNNIYKVNGNNENFQFSHLNFYNEKNEEKSSNNNNNNYNNSSINRKKDNKNY 75



 Score = 34.4 bits (79), Expect = 0.19
 Identities = 12/53 (22%), Positives = 23/53 (43%)

Query: 38 KEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNNNNNNNNNNNNNEEKG 90
          ++  +    K     +  +       N  N   ++NNNNNN NN++ N ++  
Sbjct: 20 EKSHQNNIYKVNGNNENFQFSHLNFYNEKNEEKSSNNNNNNYNNSSINRKKDN 72



 Score = 31.7 bits (72), Expect = 1.3
 Identities = 13/62 (20%), Positives = 23/62 (37%)

Query: 23 AKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNNNNNNNNN 82
           +K  +    K     E  +        +K ++K     NNN NN++ N   +N N    
Sbjct: 19 IEKSHQNNIYKVNGNNENFQFSHLNFYNEKNEEKSSNNNNNNYNNSSINRKKDNKNYMKR 78

Query: 83 NN 84
          + 
Sbjct: 79 SL 80



 Score = 31.3 bits (71), Expect = 1.8
 Identities = 11/76 (14%), Positives = 36/76 (47%), Gaps = 4/76 (5%)

Query: 23   AKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNN----NNNNNNNN 78
             K++E+ + E +    ++++  ++++++ +K++++K       N   +     N N+N  
Sbjct: 2578 RKERERIESETQEDNTDEEQINRQQQERLQKEEEQKAYSQERLNREVSGTDDTNKNHNTG 2637

Query: 79   NNNNNNNNEEKGTFLY 94
            ++ +N          Y
Sbjct: 2638 HDESNYGRYSNKRNDY 2653



 Score = 30.6 bits (69), Expect = 3.1
 Identities = 18/68 (26%), Positives = 29/68 (42%)

Query: 22   KAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNNNNNNNN 81
              K K  K KE+    +EK +   KK  + K K  + K K++ +  N +   N    + N
Sbjct: 1067 IKKYKNPKIKEEIKLLEEKVEALLKKIDENKNKLIEIKNKSHEHVVNADKEKNKQTEHYN 1126

Query: 82   NNNNNEEK 89
                + EK
Sbjct: 1127 KKKKSLEK 1134



 Score = 30.6 bits (69), Expect = 3.3
 Identities = 24/108 (22%), Positives = 38/108 (35%), Gaps = 14/108 (12%)

Query: 5    TKHQIGTFTEPTPLDWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKK-------KKKK 57
                +  FT     D    K KE      +      +      EK K +        ++ 
Sbjct: 2442 RISNVDAFTNKILQDIDNEKNKENNNMNAEKIDDLIENVTSHNEKIKSELLIINDALRRV 2501

Query: 58   KKKKNNNNN-------NNNNNNNNNNNNNNNNNNNNEEKGTFLYKMAA 98
            K+KK+  N        NNNNNNN+  N  +N+     E  + + K+  
Sbjct: 2502 KEKKDEMNKLFNSLTENNNNNNNSAKNIVDNSTYIINELESHVSKLNE 2549



 Score = 30.2 bits (68), Expect = 4.4
 Identities = 19/79 (24%), Positives = 34/79 (43%), Gaps = 2/79 (2%)

Query: 6    KHQIGTFTEPTPLDWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNN 65
            K ++ +F     +D KK K  + K++ K  ++K +   KK  E K K  + K K   +  
Sbjct: 1055 KTKLSSFH--FNIDIKKYKNPKIKEEIKLLEEKVEALLKKIDENKNKLIEIKNKSHEHVV 1112

Query: 66   NNNNNNNNNNNNNNNNNNN 84
            N +   N    + N    +
Sbjct: 1113 NADKEKNKQTEHYNKKKKS 1131



 Score = 29.4 bits (66), Expect = 6.8
 Identities = 19/75 (25%), Positives = 29/75 (38%)

Query: 19   DWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNNNNN 78
            D    K K  +   +      K KEK    K++  K  K  K+NN   NNNN  +  +N 
Sbjct: 2343 DENYGKDKNIELNNENNSYIIKTKEKINNLKEEFSKLLKNIKRNNTLCNNNNIKDFISNI 2402

Query: 79   NNNNNNNNEEKGTFL 93
              +     +   + L
Sbjct: 2403 GKSVETIKQRFSSNL 2417



 Score = 29.0 bits (65), Expect = 8.3
 Identities = 13/71 (18%), Positives = 21/71 (29%)

Query: 19   DWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNNNNN 78
            D    + K+ K   +       +      +    + +   K KN   NN NN+       
Sbjct: 2308 DTNNDELKKVKLYIENITHLLNRINTLINDLDNYQDENYGKDKNIELNNENNSYIIKTKE 2367

Query: 79   NNNNNNNNEEK 89
              NN      K
Sbjct: 2368 KINNLKEEFSK 2378



 Score = 29.0 bits (65), Expect = 9.3
 Identities = 17/63 (26%), Positives = 29/63 (46%)

Query: 28   KKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNNNNNNNNNNNNNE 87
            KK K  K K++ K  E+K +   KK  + K K     N ++ +  N +   N    + N+
Sbjct: 1068 KKYKNPKIKEEIKLLEEKVEALLKKIDENKNKLIEIKNKSHEHVVNADKEKNKQTEHYNK 1127

Query: 88   EKG 90
            +K 
Sbjct: 1128 KKK 1130


>gnl|CDD|219882 pfam08524, rRNA_processing, rRNA processing.  This is a family of
           proteins that are involved in rRNA processing. In a
           localisation study they were found to localise to the
           nucleus and nucleolus. The family also includes other
           metazoa members from plants to mammals where the protein
           has been named BR22 and is associated with TTF-1,
           thyroid transcription factor 1. In the lungs, the family
           binds TTF-1 to form a complex which influences the
           expression of the key lung surfactant protein-B (SP-B)
           and -C (SP-C), the small hydrophobic surfactant proteins
           that maintain surface tension in alveoli.
          Length = 150

 Score = 36.4 bits (84), Expect = 0.011
 Identities = 18/40 (45%), Positives = 30/40 (75%), Gaps = 2/40 (5%)

Query: 21  KKAKKKEKKKK--EKKAKKKEKKKEKKKKEKKKKKKKKKK 58
           K+ +K EKKKK  EKK   K++K+E+++KE  K++K+ +K
Sbjct: 63  KEDRKFEKKKKLDEKKEIAKQRKREQREKELAKRQKELEK 102



 Score = 36.0 bits (83), Expect = 0.014
 Identities = 19/52 (36%), Positives = 36/52 (69%), Gaps = 2/52 (3%)

Query: 12  FTEPTPLDWKKAKKKEKK--KKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
           F +   LD KK   K++K  ++EK+  K++K+ EK +  KKK+K++++++KK
Sbjct: 68  FEKKKKLDEKKEIAKQRKREQREKELAKRQKELEKIELSKKKQKERERRRKK 119



 Score = 33.3 bits (76), Expect = 0.15
 Identities = 14/38 (36%), Positives = 27/38 (71%)

Query: 24 KKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
            KE +K EKK K  EKK+  K++++++++K+  K++K
Sbjct: 61 SSKEDRKFEKKKKLDEKKEIAKQRKREQREKELAKRQK 98



 Score = 32.2 bits (73), Expect = 0.33
 Identities = 18/50 (36%), Positives = 30/50 (60%), Gaps = 2/50 (4%)

Query: 21  KKAKKKEKK--KKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNN 68
           KK K  EKK   K++K +++EK+  K++KE +K +  KKK+K+       
Sbjct: 70  KKKKLDEKKEIAKQRKREQREKELAKRQKELEKIELSKKKQKERERRRKK 119



 Score = 30.6 bits (69), Expect = 1.0
 Identities = 14/38 (36%), Positives = 27/38 (71%)

Query: 24  KKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
           ++KE  K++K+ +K E  K+K+K+ ++++KK  KK K 
Sbjct: 89  REKELAKRQKELEKIELSKKKQKERERRRKKLTKKTKS 126



 Score = 29.5 bits (66), Expect = 2.3
 Identities = 14/42 (33%), Positives = 29/42 (69%), Gaps = 1/42 (2%)

Query: 22 KAKKKEKKKKEKKAKKKEKKKE-KKKKEKKKKKKKKKKKKKN 62
          K   +++ K  K+ +K EKKK+  +KKE  K++K+++++K+ 
Sbjct: 52 KESAEKQVKSSKEDRKFEKKKKLDEKKEIAKQRKREQREKEL 93



 Score = 29.5 bits (66), Expect = 2.6
 Identities = 13/40 (32%), Positives = 30/40 (75%)

Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKK 60
          + A+K+ K  KE +  +K+KK ++KK+  K++K+++++K+
Sbjct: 53 ESAEKQVKSSKEDRKFEKKKKLDEKKEIAKQRKREQREKE 92



 Score = 27.9 bits (62), Expect = 8.2
 Identities = 12/35 (34%), Positives = 27/35 (77%)

Query: 21  KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKK 55
           + AK++++ +K + +KKK+K++E+++K+  KK K 
Sbjct: 92  ELAKRQKELEKIELSKKKQKERERRRKKLTKKTKS 126


>gnl|CDD|222703 pfam14351, DUF4401, Domain of unknown function (DUF4401).  This
           family of proteins is found in bacteria. Proteins in
           this family are typically between 357 and 735 amino
           acids in length. The family is found in association with
           pfam09925. There is a single completely conserved
           residue K that may be functionally important.
          Length = 321

 Score = 37.6 bits (88), Expect = 0.012
 Identities = 23/108 (21%), Positives = 34/108 (31%), Gaps = 18/108 (16%)

Query: 244 FFIGQFG------GMTTLQTYAVGIFESIHAPLDPYFATLLLGVAELGGALLCVVLIHYT 297
            F+ Q        G   L   A G+F+     LD      LL +A +   L    L+   
Sbjct: 57  LFLRQLALALSLAGQLLL---AFGLFDL----LDESVLAALLLLALILALLY--FLMPD- 106

Query: 298 GKRPLALISTGGSAACFIVVAVYAQFHLSYGWDSPLVPTVFLVLAAFL 345
             R L  +S        + +              PL P + + L  FL
Sbjct: 107 --RLLRFLSAAAGVLALLGLLAELLLSGLAYLLLPLAPLLLVALFVFL 152


>gnl|CDD|227519 COG5192, BMS1, GTP-binding protein required for 40S ribosome
            biogenesis [Translation, ribosomal structure and
            biogenesis].
          Length = 1077

 Score = 38.2 bits (88), Expect = 0.012
 Identities = 12/46 (26%), Positives = 28/46 (60%), Gaps = 1/46 (2%)

Query: 22   KAKKKEKKKKEKKA-KKKEKKKEKKKKEKKKKKKKKKKKKKNNNNN 66
            K + K++++KE+    ++ K++E  KKEK+++++ +K    N    
Sbjct: 1023 KERIKDQEEKERMESLQRAKEEEIGKKEKEREQRIRKTIHDNYKEM 1068



 Score = 36.3 bits (83), Expect = 0.055
 Identities = 11/42 (26%), Positives = 27/42 (64%)

Query: 21   KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKN 62
            K  ++KE+ +  ++AK++E  K++K++E++ +K      K+ 
Sbjct: 1027 KDQEEKERMESLQRAKEEEIGKKEKEREQRIRKTIHDNYKEM 1068



 Score = 35.5 bits (81), Expect = 0.082
 Identities = 11/41 (26%), Positives = 25/41 (60%)

Query: 21   KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
            K+  +  ++ KE++  KKEK++E++ ++      K+  KK+
Sbjct: 1032 KERMESLQRAKEEEIGKKEKEREQRIRKTIHDNYKEMAKKR 1072



 Score = 32.8 bits (74), Expect = 0.62
 Identities = 13/39 (33%), Positives = 27/39 (69%), Gaps = 2/39 (5%)

Query: 21   KKAKKKE--KKKKEKKAKKKEKKKEKKKKEKKKKKKKKK 57
            ++AK++E  KK+KE++ + ++   +  K+  KK+ KKK+
Sbjct: 1039 QRAKEEEIGKKEKEREQRIRKTIHDNYKEMAKKRLKKKR 1077



 Score = 31.6 bits (71), Expect = 1.3
 Identities = 12/42 (28%), Positives = 26/42 (61%), Gaps = 1/42 (2%)

Query: 21   KKAKKKEKKKK-EKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
             ++ ++ K+++  KK K++E++  K   +  K+  KK+ KKK
Sbjct: 1035 MESLQRAKEEEIGKKEKEREQRIRKTIHDNYKEMAKKRLKKK 1076



 Score = 29.3 bits (65), Expect = 6.5
 Identities = 10/43 (23%), Positives = 25/43 (58%)

Query: 19   DWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
               + K + K +  K+  K +++KE+ +  ++ K+++  KK+K
Sbjct: 1009 PECREKHEIKDRIVKERIKDQEEKERMESLQRAKEEEIGKKEK 1051


>gnl|CDD|217286 pfam02919, Topoisom_I_N, Eukaryotic DNA topoisomerase I, DNA
           binding fragment.  Topoisomerase I promotes the
           relaxation of DNA superhelical tension by introducing a
           transient single-stranded break in duplex DNA and are
           vital for the processes of replication, transcription,
           and recombination. This family may be more than one
           structural domain.
          Length = 215

 Score = 37.2 bits (87), Expect = 0.013
 Identities = 19/56 (33%), Positives = 27/56 (48%), Gaps = 13/56 (23%)

Query: 19  DWKKAKKKEKKKKEKKAKK-------------KEKKKEKKKKEKKKKKKKKKKKKK 61
           D++K  KKE     K  +K             KEKKK   K+EKK  K++K K ++
Sbjct: 67  DFRKILKKEGGAIIKDLEKCDFTPIYEYFEAEKEKKKAMSKEEKKAIKEEKDKLEE 122



 Score = 34.9 bits (81), Expect = 0.058
 Identities = 13/26 (50%), Positives = 18/26 (69%)

Query: 21  KKAKKKEKKKKEKKAKKKEKKKEKKK 46
           +K KKK   K+EKKA K+EK K ++ 
Sbjct: 98  EKEKKKAMSKEEKKAIKEEKDKLEEP 123



 Score = 34.9 bits (81), Expect = 0.065
 Identities = 13/27 (48%), Positives = 17/27 (62%)

Query: 24  KKKEKKKKEKKAKKKEKKKEKKKKEKK 50
            +KEKKK   K +KK  K+EK K E+ 
Sbjct: 97  AEKEKKKAMSKEEKKAIKEEKDKLEEP 123



 Score = 33.3 bits (77), Expect = 0.18
 Identities = 12/28 (42%), Positives = 20/28 (71%), Gaps = 1/28 (3%)

Query: 27  EKKKKEKKAKKK-EKKKEKKKKEKKKKK 53
           E +K++KKA  K EKK  K++K+K ++ 
Sbjct: 96  EAEKEKKKAMSKEEKKAIKEEKDKLEEP 123


>gnl|CDD|238356 cd00660, Topoisomer_IB_N, Topoisomer_IB_N: N-terminal DNA binding
           fragment found in eukaryotic DNA topoisomerase (topo) IB
           proteins similar to the monomeric yeast and human topo I
           and heterodimeric topo I from Leishmania donvanni. Topo
           I enzymes are divided into:  topo type IA (bacterial)
           and type IB (eukaryotic). Topo I relaxes superhelical
           tension in duplex DNA by creating a single-strand nick,
           the broken strand can then rotate around the unbroken
           strand to remove DNA supercoils and, the nick is
           religated, liberating topo I. These enzymes regulate the
           topological changes that accompany DNA replication,
           transcription and other nuclear processes.  Human topo I
           is the target of a diverse set of anticancer drugs
           including camptothecins (CPTs). CPTs bind to the topo
           I-DNA complex and inhibit re-ligation of the
           single-strand nick, resulting in the accumulation of
           topo I-DNA adducts.  In addition to differences in
           structure and some biochemical properties,
           Trypanosomatid parasite topo I differ from human topo I
           in their sensitivity to CPTs and other classical topo I
           inhibitors. Trypanosomatid topos I play putative roles
           in organizing the kinetoplast DNA network unique to
           these parasites.  This family may represent more than
           one structural domain.
          Length = 215

 Score = 36.9 bits (86), Expect = 0.013
 Identities = 20/56 (35%), Positives = 29/56 (51%), Gaps = 13/56 (23%)

Query: 19  DWKKAKKKEKKKKEKKAKK-------------KEKKKEKKKKEKKKKKKKKKKKKK 61
           D++K   KE+K   KK  K             KEKKK   K+EKK  K++K+K ++
Sbjct: 66  DFRKILTKEEKHIIKKLSKCDFTPIYQYFEEEKEKKKAMSKEEKKAIKEEKEKLEE 121



 Score = 36.5 bits (85), Expect = 0.022
 Identities = 18/52 (34%), Positives = 28/52 (53%), Gaps = 2/52 (3%)

Query: 1   MDSSTKHQIGTFTEP--TPLDWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKK 50
           +    KH I   ++   TP+     ++KEKKK   K +KK  K+EK+K E+ 
Sbjct: 71  LTKEEKHIIKKLSKCDFTPIYQYFEEEKEKKKAMSKEEKKAIKEEKEKLEEP 122


>gnl|CDD|222060 pfam13347, MFS_2, MFS/sugar transport protein.  This family is part
           of the major facilitator superfamily of membrane
           transport proteins.
          Length = 425

 Score = 38.0 bits (89), Expect = 0.013
 Identities = 46/263 (17%), Positives = 86/263 (32%), Gaps = 51/263 (19%)

Query: 105 TYVAEITQ-PHLRGMLSATASM-TTIFGTVSQLFLGSFLHWRSAAILNLLFPILAL--CA 160
           + +  +T+ P  R  L++   +  TI G +  +     +           + + AL    
Sbjct: 123 SLIPVLTRDPKERTSLASYRRIGATIGGLLVAVIAPPLVALLGGGNGAKGYFLAALIIAI 182

Query: 161 LYFIPESPHWLISQGRMQEASASLCWL--RGWVTPDKVQTELSQITKAIEESELKRLGKD 218
           L  +                   +C+   +  V+    Q E   +         K L K+
Sbjct: 183 LAVV----------------GFLICFAGTKERVSVPPEQKEKVSLRD-----MFKALFKN 221

Query: 219 GQRRPNYRMYMRRTFLLPYAIVTSLFFIGQFGGMTTLQTYAVGIFESIHAPLDPYFATLL 278
                     +    LL   ++ +L    + G +    TY +G     +A L      + 
Sbjct: 222 RP--------LLILLLL--YLLNALAMAVRNGLLLYYFTYVLG-----NAGLFSVLLLIG 266

Query: 279 LGVAELGGALLCVVLIHYTGKRPLALISTGGSAACFIVVAVYAQFHLSYGWDSPLVPTVF 338
              A LG A L   L    GK+   L+    +A   +++             S  +  V 
Sbjct: 267 TIAAILG-APLWPWLAKRFGKKRTFLLGMLLAAIGLVLLFFL-------PPGSLWLFLVL 318

Query: 339 LVLAAFLTHICIRLLPWMLIGEV 361
           +VLA     +   LLPW ++ +V
Sbjct: 319 VVLAGIGLGLAT-LLPWAMLADV 340


>gnl|CDD|151665 pfam11223, DUF3020, Protein of unknown function (DUF3020).  This
          family of fungal proteins is conserved towards the
          C-terminus of HMG domains. The function is not known.
          Length = 49

 Score = 33.7 bits (77), Expect = 0.014
 Identities = 11/37 (29%), Positives = 20/37 (54%)

Query: 24 KKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKK 60
           ++ KKK  +   ++ K  + + + KKK KKK  K+ 
Sbjct: 1  NRERKKKWREANSERNKDNDLRSRVKKKAKKKFGKED 37



 Score = 33.7 bits (77), Expect = 0.016
 Identities = 13/42 (30%), Positives = 23/42 (54%)

Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKN 62
          ++ KKK ++   ++ K  + +   KKK KKK  K+  K+K  
Sbjct: 2  RERKKKWREANSERNKDNDLRSRVKKKAKKKFGKEDSKEKDA 43



 Score = 32.9 bits (75), Expect = 0.025
 Identities = 14/39 (35%), Positives = 25/39 (64%), Gaps = 1/39 (2%)

Query: 25 KKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNN 63
           +E+KKK ++A   E+ K+   + + KKK KKK  K+++
Sbjct: 1  NRERKKKWREANS-ERNKDNDLRSRVKKKAKKKFGKEDS 38



 Score = 30.2 bits (68), Expect = 0.26
 Identities = 12/38 (31%), Positives = 20/38 (52%)

Query: 30 KKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNN 67
           +E+K K +E   E+ K    + + KKK KKK    ++
Sbjct: 1  NRERKKKWREANSERNKDNDLRSRVKKKAKKKFGKEDS 38



 Score = 29.5 bits (66), Expect = 0.40
 Identities = 11/36 (30%), Positives = 21/36 (58%)

Query: 20 WKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKK 55
          W++A  +  K  + +++ K+K K+K  KE  K+K  
Sbjct: 8  WREANSERNKDNDLRSRVKKKAKKKFGKEDSKEKDA 43


>gnl|CDD|216337 pfam01159, Ribosomal_L6e, Ribosomal protein L6e. 
          Length = 108

 Score = 35.3 bits (82), Expect = 0.016
 Identities = 19/59 (32%), Positives = 32/59 (54%), Gaps = 4/59 (6%)

Query: 4  STKHQIGTFTEPTPLDWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKN 62
          STK  I     P  ++     +  K++K+KK KK E +   +KKEKK+  +++K  +K 
Sbjct: 18 STKVDISGVKVPEKIN----DEYFKREKKKKKKKSEGEFFAEKKEKKEVSEQRKADQKA 72


>gnl|CDD|220818 pfam10595, UPF0564, Uncharacterized protein family UPF0564.  This
           family of proteins has no known function. However, one
           of the members is annotated as an EF-hand family
           protein.
          Length = 349

 Score = 37.5 bits (87), Expect = 0.016
 Identities = 15/60 (25%), Positives = 31/60 (51%)

Query: 13  TEPTPLDWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNN 72
               P   KK +   K  + ++ +K+ K++++KK+ +++KKKKK+   K       N+  
Sbjct: 261 RRKLPPSTKKWESLVKFLRTERKEKEAKEQQEKKELEQRKKKKKEMAPKVKQRFEANDPA 320



 Score = 34.4 bits (79), Expect = 0.13
 Identities = 16/49 (32%), Positives = 28/49 (57%)

Query: 11 TFTEPTPLDWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKK 59
          T  +P  +  ++ KK+EKK   +   ++E  K +KK+E+ + KKK K  
Sbjct: 4  TVPQPFQMTKREKKKREKKSIRQSKLEEELNKLEKKEEEAECKKKFKAN 52



 Score = 33.6 bits (77), Expect = 0.26
 Identities = 16/50 (32%), Positives = 29/50 (58%)

Query: 12 FTEPTPLDWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
           T P P    K +KK+++KK  +  K E++  K +K++++ + KKK K  
Sbjct: 3  ITVPQPFQMTKREKKKREKKSIRQSKLEEELNKLEKKEEEAECKKKFKAN 52


>gnl|CDD|177089 CHL00189, infB, translation initiation factor 2; Provisional.
          Length = 742

 Score = 37.9 bits (88), Expect = 0.016
 Identities = 21/64 (32%), Positives = 34/64 (53%)

Query: 18  LDWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNNNN 77
           LD  K   ++  KK KK  K +  K+ KK + K+KKK KKK   +++ +N  ++ NN+  
Sbjct: 46  LDINKKLHEKLDKKNKKFNKTDDLKDSKKTKLKQKKKIKKKLHIDDDYDNFFDSKNNSKQ 105

Query: 78  NNNN 81
               
Sbjct: 106 FAGP 109



 Score = 37.9 bits (88), Expect = 0.017
 Identities = 22/86 (25%), Positives = 29/86 (33%), Gaps = 25/86 (29%)

Query: 18  LDWKKAKKKEKKKKEKKAKK-------------------------KEKKKEKKKKEKKKK 52
            D KK K K+KKK +KK                              K K K +K KKK 
Sbjct: 70  KDSKKTKLKQKKKIKKKLHIDDDYDNFFDSKNNSKQFAGPLAISLMRKPKPKTEKLKKKI 129

Query: 53  KKKKKKKKKNNNNNNNNNNNNNNNNN 78
              K   KK     ++ +     +NN
Sbjct: 130 TVNKSTNKKKKKVLSSKDELIKYDNN 155



 Score = 34.8 bits (80), Expect = 0.12
 Identities = 21/60 (35%), Positives = 34/60 (56%)

Query: 25  KKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNNNNNNNNNNN 84
           KK  +K +KK KK  K  + K  +K K K+KKK KKK + +++ +N  ++ NN+      
Sbjct: 50  KKLHEKLDKKNKKFNKTDDLKDSKKTKLKQKKKIKKKLHIDDDYDNFFDSKNNSKQFAGP 109



 Score = 34.0 bits (78), Expect = 0.26
 Identities = 9/39 (23%), Positives = 14/39 (35%)

Query: 46  KKEKKKKKKKKKKKKKNNNNNNNNNNNNNNNNNNNNNNN 84
             EK+ K K  K K+ N  +       +     N  +N 
Sbjct: 502 NSEKEAKLKIIKNKENNKKDTTKRITLSTTKTINKKDNK 540



 Score = 33.7 bits (77), Expect = 0.28
 Identities = 22/90 (24%), Positives = 29/90 (32%), Gaps = 21/90 (23%)

Query: 21  KKAKKKEKKKKEKKAKKKEKKKEKKK-----------KEKKKKK----------KKKKKK 59
           KK  K +  K  KK K K+KKK KKK             K   K           +K K 
Sbjct: 61  KKFNKTDDLKDSKKTKLKQKKKIKKKLHIDDDYDNFFDSKNNSKQFAGPLAISLMRKPKP 120

Query: 60  KKNNNNNNNNNNNNNNNNNNNNNNNNNEEK 89
           K          N + N       ++ +E  
Sbjct: 121 KTEKLKKKITVNKSTNKKKKKVLSSKDELI 150



 Score = 33.3 bits (76), Expect = 0.35
 Identities = 10/39 (25%), Positives = 14/39 (35%)

Query: 41  KKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNNNNNN 79
             EK+ K K  K K+  KK        +     N  +N 
Sbjct: 502 NSEKEAKLKIIKNKENNKKDTTKRITLSTTKTINKKDNK 540



 Score = 32.9 bits (75), Expect = 0.52
 Identities = 9/39 (23%), Positives = 14/39 (35%)

Query: 44  KKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNNNNNNNNN 82
             +KE K K  K K+  K +       +     N  +N 
Sbjct: 502 NSEKEAKLKIIKNKENNKKDTTKRITLSTTKTINKKDNK 540



 Score = 32.5 bits (74), Expect = 0.60
 Identities = 9/40 (22%), Positives = 16/40 (40%)

Query: 49  KKKKKKKKKKKKKNNNNNNNNNNNNNNNNNNNNNNNNNEE 88
            +K+ K K  K K NN  +       +     N  +N ++
Sbjct: 503 SEKEAKLKIIKNKENNKKDTTKRITLSTTKTINKKDNKKQ 542



 Score = 32.1 bits (73), Expect = 0.81
 Identities = 9/41 (21%), Positives = 16/41 (39%)

Query: 47  KEKKKKKKKKKKKKKNNNNNNNNNNNNNNNNNNNNNNNNNE 87
             +K+ K K  K K+NN  +       +     N  +N  +
Sbjct: 502 NSEKEAKLKIIKNKENNKKDTTKRITLSTTKTINKKDNKKQ 542



 Score = 31.3 bits (71), Expect = 1.4
 Identities = 9/47 (19%), Positives = 18/47 (38%)

Query: 49  KKKKKKKKKKKKKNNNNNNNNNNNNNNNNNNNNNNNNNEEKGTFLYK 95
             +K+ K K  K   NN  +       +     N  +N+++   + K
Sbjct: 502 NSEKEAKLKIIKNKENNKKDTTKRITLSTTKTINKKDNKKQINLIIK 548



 Score = 30.6 bits (69), Expect = 2.6
 Identities = 11/38 (28%), Positives = 15/38 (39%)

Query: 40  KKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNNNN 77
            +KE K K  K K+  KK   K    +     N  +N 
Sbjct: 503 SEKEAKLKIIKNKENNKKDTTKRITLSTTKTINKKDNK 540



 Score = 29.8 bits (67), Expect = 4.2
 Identities = 20/69 (28%), Positives = 32/69 (46%), Gaps = 1/69 (1%)

Query: 29  KKKEKKAKKKEKKKEKKKKEKKKKKKK-KKKKKKNNNNNNNNNNNNNNNNNNNNNNNNNE 87
           KK  +K  KK KK  K    K  KK K K+KKK     + +++ +N  ++ NN+      
Sbjct: 50  KKLHEKLDKKNKKFNKTDDLKDSKKTKLKQKKKIKKKLHIDDDYDNFFDSKNNSKQFAGP 109

Query: 88  EKGTFLYKM 96
              + + K 
Sbjct: 110 LAISLMRKP 118



 Score = 29.8 bits (67), Expect = 5.2
 Identities = 19/61 (31%), Positives = 26/61 (42%), Gaps = 2/61 (3%)

Query: 3   SSTKHQIGTFTEPTPLDWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKN 62
             +K+    F  P  +   + K K K +K KK K    K   KKK+K    K +  K  N
Sbjct: 97  FDSKNNSKQFAGPLAISLMR-KPKPKTEKLKK-KITVNKSTNKKKKKVLSSKDELIKYDN 154

Query: 63  N 63
           N
Sbjct: 155 N 155


>gnl|CDD|227818 COG5531, COG5531, SWIB-domain-containing proteins implicated in
           chromatin remodeling [Chromatin structure and dynamics].
          Length = 237

 Score = 37.0 bits (86), Expect = 0.016
 Identities = 12/65 (18%), Positives = 23/65 (35%)

Query: 3   SSTKHQIGTFTEPTPLDWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKN 62
            S   +     +      +K   K    KE+ +     K+ +    + K+  KKKK    
Sbjct: 56  DSLAEEPRVLRKEKYNITRKTTGKNDLPKEEDSSLPSSKETENGDTEGKETDKKKKSSTI 115

Query: 63  NNNNN 67
           + N+ 
Sbjct: 116 SKNSP 120


>gnl|CDD|240576 cd12932, RRP7_like, RRP7 domain ribosomal RNA-processing protein
          7 (Rrp7p), ribosomal RNA-processing protein 7 homolog A
          (Rrp7A), and similar proteins.  This CD corresponds to
          the RRP7 domain of Rrp7p and Rrp7A. Rrp7p is encoded by
          YCL031C gene from Saccharomyces cerevisiae. It is an
          essential yeast protein involved in pre-rRNA processing
          and ribosome assembly, and is speculated to be required
          for correct assembly of rpS27 into the pre-ribosomal
          particle. Rrp7A, also termed gastric cancer antigen
          Zg14, is the Rrp7p homolog mainly found in Metazoans.
          The cellular function of Rrp7A remains unclear
          currently. Both Rrp7p and Rrp7A harbor an N-terminal
          RNA recognition motif (RRM), also termed RBD (RNA
          binding domain) or RNP (ribonucleoprotein domain), and
          a C-terminal RRP7 domain.
          Length = 118

 Score = 35.3 bits (82), Expect = 0.017
 Identities = 11/33 (33%), Positives = 22/33 (66%)

Query: 30 KKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKN 62
          +  +K K   ++  + K ++K+KKKKKKK+ ++
Sbjct: 58 RGGRKGKTAREEAVEAKAKEKEKKKKKKKELED 90



 Score = 33.8 bits (78), Expect = 0.051
 Identities = 11/32 (34%), Positives = 22/32 (68%)

Query: 28 KKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKK 59
          +  ++ K  ++E  + K K+++KKKKKKK+ +
Sbjct: 58 RGGRKGKTAREEAVEAKAKEKEKKKKKKKELE 89



 Score = 33.8 bits (78), Expect = 0.060
 Identities = 15/59 (25%), Positives = 28/59 (47%), Gaps = 13/59 (22%)

Query: 19 DWKKAKKKEKKKKE-------------KKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNN 64
            ++ +K+E K+                  K K  ++E  + + K+K+KKKKKKK+  +
Sbjct: 32 KREEEEKEEAKEARNEPDEDGFVTVTRGGRKGKTAREEAVEAKAKEKEKKKKKKKELED 90



 Score = 30.7 bits (70), Expect = 0.76
 Identities = 20/53 (37%), Positives = 32/53 (60%), Gaps = 9/53 (16%)

Query: 15  PTPLDWKKAKKKEKKKKEKKAKKKE-----KKKEKKKKE----KKKKKKKKKK 58
               +  +AK KEK+KK+KK K+ E     + +EKKK+E    +KK ++ KK+
Sbjct: 65  TAREEAVEAKAKEKEKKKKKKKELEDFYRFQIREKKKEELAELRKKFEEDKKR 117


>gnl|CDD|239570 cd03488, Topoisomer_IB_N_htopoI_like, Topoisomer_IB_N_htopoI_like :
           N-terminal DNA binding fragment found in eukaryotic DNA
           topoisomerase (topo) IB proteins similar to the
           monomeric yeast and human topo I.  Topo I enzymes are
           divided into:  topo type IA (bacterial) and type IB
           (eukaryotic). Topo I relaxes superhelical tension in
           duplex DNA by creating a single-strand nick, the broken
           strand can then rotate around the unbroken strand to
           remove DNA supercoils and, the nick is religated,
           liberating topo I. These enzymes regulate the
           topological changes that accompany DNA replication,
           transcription and other nuclear processes.  Human topo I
           is the target of a diverse set of anticancer drugs
           including camptothecins (CPTs). CPTs bind to the topo
           I-DNA complex and inhibit religation of the
           single-strand nick, resulting in the accumulation of
           topo I-DNA adducts.  This family may represent more than
           one structural domain.
          Length = 215

 Score = 36.5 bits (85), Expect = 0.019
 Identities = 19/53 (35%), Positives = 27/53 (50%), Gaps = 10/53 (18%)

Query: 19  DWKKAKKKEKKKKEKKAKK----------KEKKKEKKKKEKKKKKKKKKKKKK 61
           D+KK   KE+K   K   K          K +K+EKK   K++KK  K +K+K
Sbjct: 66  DFKKVMTKEEKVIIKDFSKCDFTQMFAYFKAQKEEKKAMSKEEKKAIKAEKEK 118



 Score = 33.9 bits (78), Expect = 0.14
 Identities = 17/54 (31%), Positives = 26/54 (48%), Gaps = 2/54 (3%)

Query: 1   MDSSTKHQIGTFTEP--TPLDWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKK 52
           M    K  I  F++   T +      +KE+KK   K +KK  K EK+K E++  
Sbjct: 71  MTKEEKVIIKDFSKCDFTQMFAYFKAQKEEKKAMSKEEKKAIKAEKEKLEEEYG 124


>gnl|CDD|221185 pfam11719, Drc1-Sld2, DNA replication and checkpoint protein.
           Genome duplication is precisely regulated by
           cyclin-dependent kinases CDKs, which bring about the
           onset of S phase by activating replication origins and
           then prevent relicensing of origins until mitosis is
           completed. The optimum sequence motif for CDK
           phosphorylation is S/T-P-K/R-K/R, and Drc1-Sld2 is found
           to have at least 11 potential phosphorylation sites.
           Drc1 is required for DNA synthesis and S-M replication
           checkpoint control. Drc1 associates with Cdc2 and is
           phosphorylated at the onset of S phase when Cdc2 is
           activated. Thus Cdc2 promotes DNA replication by
           phosphorylating Drc1 and regulating its association with
           Cut5. Sld2 and Sld3 represent the minimal set of S-CDK
           substrates required for DNA replication.
          Length = 397

 Score = 37.1 bits (86), Expect = 0.020
 Identities = 14/52 (26%), Positives = 24/52 (46%), Gaps = 6/52 (11%)

Query: 29  KKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNN------NNNNNNNNN 74
            + ++    +E K+E +KK+K KKK +K+K    +NN       N       
Sbjct: 339 DEDDEDEDDEESKEEVEKKQKVKKKPRKRKVNPVSNNFRRLKLRNKRKKGFK 390



 Score = 35.2 bits (81), Expect = 0.075
 Identities = 14/62 (22%), Positives = 27/62 (43%), Gaps = 6/62 (9%)

Query: 22  KAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNN------NNNNNNN 75
                     ++  + ++ ++ K++ EKK+K KKK +K+K N  +NN       N     
Sbjct: 329 SEFLGYMGGIDEDDEDEDDEESKEEVEKKQKVKKKPRKRKVNPVSNNFRRLKLRNKRKKG 388

Query: 76  NN 77
             
Sbjct: 389 FK 390



 Score = 34.0 bits (78), Expect = 0.18
 Identities = 17/68 (25%), Positives = 27/68 (39%), Gaps = 6/68 (8%)

Query: 18  LDWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNN------N 71
           LD K   +            +++  E+ K+E +KK+K KKK +K   N  +NN       
Sbjct: 323 LDEKSLSEFLGYMGGIDEDDEDEDDEESKEEVEKKQKVKKKPRKRKVNPVSNNFRRLKLR 382

Query: 72  NNNNNNNN 79
           N       
Sbjct: 383 NKRKKGFK 390



 Score = 29.4 bits (66), Expect = 5.9
 Identities = 14/55 (25%), Positives = 25/55 (45%), Gaps = 11/55 (20%)

Query: 16  TPLDWKKAKKKEKKKKEKKAKKKEKKKEKKK-----------KEKKKKKKKKKKK 59
              D +    +E K++ +K +K +KK  K+K           K + K+KK  K +
Sbjct: 338 IDEDDEDEDDEESKEEVEKKQKVKKKPRKRKVNPVSNNFRRLKLRNKRKKGFKGR 392


>gnl|CDD|240274 PTZ00112, PTZ00112, origin recognition complex 1 protein;
           Provisional.
          Length = 1164

 Score = 37.7 bits (87), Expect = 0.020
 Identities = 15/70 (21%), Positives = 26/70 (37%), Gaps = 9/70 (12%)

Query: 26  KEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNN---------NNNNNNN 76
           K     + K K KEK K  KK     K+ K+  +K    N++ +          N  +  
Sbjct: 215 KNNTNDKNKEKNKEKDKNIKKDRDGDKQTKRNSEKSKVQNSHFDVRILRSYTKENKKDEK 274

Query: 77  NNNNNNNNNN 86
           N  +   ++ 
Sbjct: 275 NVVSGIRSSV 284



 Score = 36.9 bits (85), Expect = 0.031
 Identities = 16/76 (21%), Positives = 33/76 (43%), Gaps = 8/76 (10%)

Query: 28  KKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNN--------NNNNNNNNNNNNN 79
            K KEK  +K +  K+ +  +K+ K+  +K K +N++ +          N  +  N  + 
Sbjct: 220 DKNKEKNKEKDKNIKKDRDGDKQTKRNSEKSKVQNSHFDVRILRSYTKENKKDEKNVVSG 279

Query: 80  NNNNNNNEEKGTFLYK 95
             ++   + K   L K
Sbjct: 280 IRSSVLLKRKSQCLRK 295



 Score = 35.0 bits (80), Expect = 0.14
 Identities = 14/59 (23%), Positives = 29/59 (49%)

Query: 28  KKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNNNNNNNNNNNNN 86
           K+K +   K        +KK K    K    +   ++N+NN++ +++N+  +N  +N N
Sbjct: 287 KRKSQCLRKDSYVYSNHQKKAKTGDPKNIIHRNNGSSNSNNDDTSSSNHLGSNRISNRN 345



 Score = 34.6 bits (79), Expect = 0.15
 Identities = 13/79 (16%), Positives = 32/79 (40%), Gaps = 13/79 (16%)

Query: 21  KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNN-------- 72
           +++ +     K     K ++K ++K K  KK +   K+ K+N+  +   N++        
Sbjct: 205 RRSPRNTSTIKNNTNDKNKEKNKEKDKNIKKDRDGDKQTKRNSEKSKVQNSHFDVRILRS 264

Query: 73  ---NNNNNNNN--NNNNNN 86
               N  +  N  +   ++
Sbjct: 265 YTKENKKDEKNVVSGIRSS 283



 Score = 33.8 bits (77), Expect = 0.27
 Identities = 17/112 (15%), Positives = 36/112 (32%), Gaps = 41/112 (36%)

Query: 21  KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKK-------KKKKKKKKKNNNNNNNNNNNN 73
            K K+K K+K +   K ++  K+ K+  +K K        +  +   K N  +  N  + 
Sbjct: 220 DKNKEKNKEKDKNIKKDRDGDKQTKRNSEKSKVQNSHFDVRILRSYTKENKKDEKNVVSG 279

Query: 74  NNNNN----------------------------------NNNNNNNNEEKGT 91
             ++                                   NN ++N+N +  +
Sbjct: 280 IRSSVLLKRKSQCLRKDSYVYSNHQKKAKTGDPKNIIHRNNGSSNSNNDDTS 331



 Score = 33.4 bits (76), Expect = 0.35
 Identities = 21/85 (24%), Positives = 36/85 (42%), Gaps = 15/85 (17%)

Query: 26  KEKKKKEKKAKKKEKKKE--KKKKE-------------KKKKKKKKKKKKKNNNNNNNNN 70
           KE KK EK      +     K+K +             KK K    K     NN ++N+N
Sbjct: 267 KENKKDEKNVVSGIRSSVLLKRKSQCLRKDSYVYSNHQKKAKTGDPKNIIHRNNGSSNSN 326

Query: 71  NNNNNNNNNNNNNNNNEEKGTFLYK 95
           N++ +++N+  +N  +    +  YK
Sbjct: 327 NDDTSSSNHLGSNRISNRNPSSPYK 351



 Score = 32.7 bits (74), Expect = 0.56
 Identities = 18/119 (15%), Positives = 43/119 (36%), Gaps = 48/119 (40%)

Query: 21  KKAKKKEKKKKEKKAKKKEKKKEKKK-------------KEKKKKKKKKKKKKKNNNNNN 67
           +K K+K+K  K+ +   K+ K+  +K             +   K+ KK +K   +   ++
Sbjct: 224 EKNKEKDKNIKKDRDGDKQTKRNSEKSKVQNSHFDVRILRSYTKENKKDEKNVVSGIRSS 283

Query: 68  -----------------------------------NNNNNNNNNNNNNNNNNNNEEKGT 91
                                              NN ++N+NN++ +++N+    + +
Sbjct: 284 VLLKRKSQCLRKDSYVYSNHQKKAKTGDPKNIIHRNNGSSNSNNDDTSSSNHLGSNRIS 342



 Score = 31.9 bits (72), Expect = 1.2
 Identities = 16/80 (20%), Positives = 31/80 (38%), Gaps = 8/80 (10%)

Query: 20  WKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNNNN-- 77
           + K ++  +     K    +K KEK K++ K  KK +   K+   N+  +   N++ +  
Sbjct: 201 YDKMRRSPRNTSTIKNNTNDKNKEKNKEKDKNIKKDRDGDKQTKRNSEKSKVQNSHFDVR 260

Query: 78  ------NNNNNNNNNEEKGT 91
                   N  +  N   G 
Sbjct: 261 ILRSYTKENKKDEKNVVSGI 280


>gnl|CDD|131944 TIGR02898, spore_YhcN_YlaJ, sporulation lipoprotein, YhcN/YlaJ
          family.  YhcN and YlaJ are predicted lipoproteins that
          have been detected as spore proteins but not vegetative
          proteins in Bacillus subtilis. Both appear to be
          expressed under control of the RNA polymerase sigma-G
          factor. The YlaJ-like members of this family have a
          low-complexity, strongly acidic 40-residue C-terminal
          domain that is not included in the seed alignment for
          this model. A portion of the low-complexity region
          between the lipoprotein signal sequence and the main
          conserved region of the protein family was also excised
          from the seed alignment [Cellular processes,
          Sporulation and germination].
          Length = 158

 Score = 35.8 bits (83), Expect = 0.021
 Identities = 12/35 (34%), Positives = 17/35 (48%)

Query: 55 KKKKKKKNNNNNNNNNNNNNNNNNNNNNNNNNEEK 89
           +KK   NNN    N  + NNN  N  N+ N+ + 
Sbjct: 18 AQKKNPPNNNVGETNVMSKNNNGMNTTNDTNDGDL 52



 Score = 35.0 bits (81), Expect = 0.032
 Identities = 15/52 (28%), Positives = 21/52 (40%)

Query: 59  KKKNNNNNNNNNNNNNNNNNNNNNNNNNEEKGTFLYKMAAPLVLVLTYVAEI 110
            +K N  NNN    N  + NNN  N  N+     LY +A  +      V  +
Sbjct: 18  AQKKNPPNNNVGETNVMSKNNNGMNTTNDTNDGDLYDVADEIASEAAKVKGV 69



 Score = 27.7 bits (62), Expect = 9.7
 Identities = 9/33 (27%), Positives = 13/33 (39%)

Query: 50 KKKKKKKKKKKKNNNNNNNNNNNNNNNNNNNNN 82
           +KK          N  + NNN  N  N+ N+ 
Sbjct: 18 AQKKNPPNNNVGETNVMSKNNNGMNTTNDTNDG 50


>gnl|CDD|217476 pfam03286, Pox_Ag35, Pox virus Ag35 surface protein. 
          Length = 198

 Score = 36.3 bits (84), Expect = 0.022
 Identities = 22/77 (28%), Positives = 42/77 (54%), Gaps = 3/77 (3%)

Query: 15  PTPLDWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNN 74
           P+P   KK +    +K     K K+K KEK  +E+KK +    K ++N N+ +NN  + +
Sbjct: 46  PSPKQPKKKRPTTPRKPATTKKSKKKDKEKLTEEEKKPESDDDKTEENENDPDNNEESGD 105

Query: 75  ---NNNNNNNNNNNNEE 88
              + + N+ ++ +NE+
Sbjct: 106 SQESASANSLSDIDNED 122



 Score = 35.1 bits (81), Expect = 0.049
 Identities = 17/55 (30%), Positives = 28/55 (50%)

Query: 36  KKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNNNNNNNNNNNNNEEKG 90
           + K+K+    +K    KK KKK K+K         ++++    N N+ +NNEE G
Sbjct: 50  QPKKKRPTTPRKPATTKKSKKKDKEKLTEEEKKPESDDDKTEENENDPDNNEESG 104



 Score = 34.4 bits (79), Expect = 0.090
 Identities = 18/61 (29%), Positives = 34/61 (55%)

Query: 17  PLDWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNNN 76
           P D +    K+ KKK     +K    +K KK+ K+K  +++KK +++++    N N+ +N
Sbjct: 40  PEDIEIPSPKQPKKKRPTTPRKPATTKKSKKKDKEKLTEEEKKPESDDDKTEENENDPDN 99

Query: 77  N 77
           N
Sbjct: 100 N 100



 Score = 32.4 bits (74), Expect = 0.33
 Identities = 14/63 (22%), Positives = 32/63 (50%)

Query: 29  KKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNNNNNNNNNNNNNEE 88
            K+ KK +    +K    K+ KKK K+K  +++    ++++    N N+ +NN  + + +
Sbjct: 48  PKQPKKKRPTTPRKPATTKKSKKKDKEKLTEEEKKPESDDDKTEENENDPDNNEESGDSQ 107

Query: 89  KGT 91
           +  
Sbjct: 108 ESA 110



 Score = 30.9 bits (70), Expect = 1.3
 Identities = 13/61 (21%), Positives = 34/61 (55%)

Query: 21  KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNNNNNNN 80
           KK+KKK+K+K  ++ KK E   +K ++ +      ++      + + N+ ++ +N ++ +
Sbjct: 66  KKSKKKDKEKLTEEEKKPESDDDKTEENENDPDNNEESGDSQESASANSLSDIDNEDDMD 125

Query: 81  N 81
           +
Sbjct: 126 D 126



 Score = 30.5 bits (69), Expect = 1.6
 Identities = 13/68 (19%), Positives = 38/68 (55%)

Query: 21  KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNNNNNNN 80
           K A  K+ KKK+K+   +E+KK +   +K ++ +      + + ++  + + N+ ++ +N
Sbjct: 61  KPATTKKSKKKDKEKLTEEEKKPESDDDKTEENENDPDNNEESGDSQESASANSLSDIDN 120

Query: 81  NNNNNNEE 88
            ++ ++ +
Sbjct: 121 EDDMDDSD 128



 Score = 29.7 bits (67), Expect = 3.0
 Identities = 15/68 (22%), Positives = 35/68 (51%)

Query: 21  KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNNNNNNN 80
               +K    K+ K K KEK  E++KK +    K ++ +   +NN  + ++  + + N+ 
Sbjct: 56  PTTPRKPATTKKSKKKDKEKLTEEEKKPESDDDKTEENENDPDNNEESGDSQESASANSL 115

Query: 81  NNNNNNEE 88
           ++ +N ++
Sbjct: 116 SDIDNEDD 123


>gnl|CDD|149343 pfam08229, SHR3_chaperone, ER membrane protein SH3.  This family of
           proteins are membrane localised chaperones that are
           required for correct plasma membrane localisation of
           amino acid permeases (AAPs). SH3 prevents AAPs proteins
           from aggregating and assists in their correct folding.
           In the absence of SH3, AAPs are retained in the ER.
          Length = 196

 Score = 36.2 bits (84), Expect = 0.023
 Identities = 14/38 (36%), Positives = 21/38 (55%)

Query: 20  WKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKK 57
           WK AK  E+   E+       K+E+  + +KK+ KKKK
Sbjct: 159 WKDAKLLEEFAAEEAEAAAAAKEEESAEGEKKESKKKK 196



 Score = 33.5 bits (77), Expect = 0.17
 Identities = 12/39 (30%), Positives = 23/39 (58%)

Query: 23  AKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
           A+ K+ K  E+ A ++ +     K+E+  + +KK+ KKK
Sbjct: 157 AEWKDAKLLEEFAAEEAEAAAAAKEEESAEGEKKESKKK 195


>gnl|CDD|220413 pfam09805, Nop25, Nucleolar protein 12 (25kDa).  Members of this
           family of proteins are part of the yeast nuclear pore
           complex-associated pre-60S ribosomal subunit. The family
           functions as a highly conserved exonuclease that is
           required for the 5'-end maturation of 5.8S and 25S
           rRNAs, demonstrating that 5'-end processing also has a
           redundant pathway. Nop25 binds late pre-60S ribosomes,
           accompanying them from the nucleolus to the nuclear
           periphery; and there is evidence for both physical and
           functional links between late 60S subunit processing and
           export.
          Length = 134

 Score = 35.0 bits (81), Expect = 0.025
 Identities = 13/70 (18%), Positives = 43/70 (61%)

Query: 21  KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNNNNNNN 80
           +K  ++E K+KE++ + +E+K+ +++++++ +K+ K++K+        N++  +    + 
Sbjct: 35  RKKAQEEAKEKEREERIEERKRIREERKQELEKQLKERKEALKLLEEENDDEEDAETEDT 94

Query: 81  NNNNNNEEKG 90
            +  ++E +G
Sbjct: 95  EDVEDDEWEG 104



 Score = 34.2 bits (79), Expect = 0.056
 Identities = 14/77 (18%), Positives = 43/77 (55%), Gaps = 8/77 (10%)

Query: 24  KKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKK--------KKNNNNNNNNNNNNNN 75
           K+K++++K+ + + KEK++E++ +E+K+ ++++K++        K+        N++  +
Sbjct: 29  KRKQQRRKKAQEEAKEKEREERIEERKRIREERKQELEKQLKERKEALKLLEEENDDEED 88

Query: 76  NNNNNNNNNNNEEKGTF 92
               +  +  ++E   F
Sbjct: 89  AETEDTEDVEDDEWEGF 105


>gnl|CDD|235760 PRK06265, PRK06265, cobalt transport protein CbiM; Validated.
          Length = 199

 Score = 36.0 bits (84), Expect = 0.025
 Identities = 30/135 (22%), Positives = 45/135 (33%), Gaps = 32/135 (23%)

Query: 227 MYMRRTFLLPYAIVTSLFFIG---QFGGMTTLQTYAVGIFESIHAPLDPYFATLLLGVAE 283
           +     F    AI+ +L        FGG+T L     G+             TL + +  
Sbjct: 72  VLGWAAFP---AILVALLLQALLFGFGGLTVL-----GV------------NTLNMALPA 111

Query: 284 LGGALLCVVLIHYTGKRPLALISTGGSAACFIVVAVYAQFHLSYGWDSPLVPTVFLVLAA 343
           +    L   L+  T           G+ A F+   + A   L+     P VP   L+   
Sbjct: 112 VLAGYLFRKLLRRTPPPRGLAAFLAGALAVFLAALLCA-LELALSGGFPFVPAAKLL--- 167

Query: 344 FLTHICIRLLPWMLI 358
            L H     LP M+I
Sbjct: 168 GLAH-----LPLMVI 177


>gnl|CDD|218552 pfam05316, VAR1, Mitochondrial ribosomal protein (VAR1).  This
           family consists of the yeast mitochondrial ribosomal
           proteins VAR1. Mitochondria possess their own ribosomes
           responsible for the synthesis of a small number of
           proteins encoded by the mitochondrial genome. In yeast
           the two ribosomal RNAs and a single ribosomal protein,
           VAR1, are products of mitochondrial genes, and the
           remaining approximately 80 ribosomal proteins are
           encoded in the nucleus. VAR1 along with 15S rRNA are
           necessary for the formation of mature 37S subunits.
          Length = 350

 Score = 36.8 bits (85), Expect = 0.025
 Identities = 16/35 (45%), Positives = 18/35 (51%)

Query: 61  KNNNNNNNNNNNNNNNNNNNNNNNNNEEKGTFLYK 95
           K NN  NN+NNN N NN  NN N N       + K
Sbjct: 240 KYNNMINNSNNNLNINNIYNNLNINKISMDLLMNK 274



 Score = 36.5 bits (84), Expect = 0.032
 Identities = 16/31 (51%), Positives = 17/31 (54%)

Query: 62  NNNNNNNNNNNNNNNNNNNNNNNNNEEKGTF 92
           NN NNNNN   NN  NN+NNN N N      
Sbjct: 231 NNINNNNNIKYNNMINNSNNNLNINNIYNNL 261



 Score = 34.9 bits (80), Expect = 0.095
 Identities = 15/27 (55%), Positives = 16/27 (59%)

Query: 60  KKNNNNNNNNNNNNNNNNNNNNNNNNN 86
           K NN  NN+NNN N NN  NN N N  
Sbjct: 240 KYNNMINNSNNNLNINNIYNNLNINKI 266



 Score = 33.4 bits (76), Expect = 0.26
 Identities = 14/30 (46%), Positives = 17/30 (56%)

Query: 59  KKKNNNNNNNNNNNNNNNNNNNNNNNNNEE 88
           K  N  NN+NNN N NN  NN N N  + +
Sbjct: 240 KYNNMINNSNNNLNINNIYNNLNINKISMD 269



 Score = 33.0 bits (75), Expect = 0.37
 Identities = 16/28 (57%), Positives = 17/28 (60%), Gaps = 2/28 (7%)

Query: 61  KNNNNNNNNNNNNNNNNN--NNNNNNNN 86
             N  NN NNNNN   NN  NN+NNN N
Sbjct: 226 SMNYINNINNNNNIKYNNMINNSNNNLN 253



 Score = 31.8 bits (72), Expect = 1.0
 Identities = 15/27 (55%), Positives = 16/27 (59%)

Query: 60  KKNNNNNNNNNNNNNNNNNNNNNNNNN 86
             N  NN NNNNN   NN  NN+NNN 
Sbjct: 226 SMNYINNINNNNNIKYNNMINNSNNNL 252



 Score = 31.5 bits (71), Expect = 1.1
 Identities = 19/37 (51%), Positives = 19/37 (51%), Gaps = 13/37 (35%)

Query: 63  NNNNNNNNNNNNNN-------------NNNNNNNNNN 86
            NNN NN NNNNNN             N NNNNN NN
Sbjct: 123 YNNNINNINNNNNNYYMNMINKLMNILNINNNNNMNN 159



 Score = 31.1 bits (70), Expect = 1.7
 Identities = 17/40 (42%), Positives = 20/40 (50%)

Query: 57  KKKKKNNNNNNNNNNNNNNNNNNNNNNNNNEEKGTFLYKM 96
           K K K+  N  N N    NNN NN NNNNN      + K+
Sbjct: 106 KPKFKHTINKVNINFYYYNNNINNINNNNNNYYMNMINKL 145



 Score = 30.7 bits (69), Expect = 1.9
 Identities = 15/33 (45%), Positives = 17/33 (51%)

Query: 55  KKKKKKKNNNNNNNNNNNNNNNNNNNNNNNNNE 87
             K   KNN N N   N   NN+N  NNNN N+
Sbjct: 305 SNKLYLKNNINKNYKLNYIPNNHNIYNNNNVNK 337



 Score = 29.9 bits (67), Expect = 3.5
 Identities = 19/38 (50%), Positives = 20/38 (52%), Gaps = 13/38 (34%)

Query: 62  NNNNNNNNNNNN-------------NNNNNNNNNNNNN 86
           NN NN NNNNNN             N NNNNN NN +N
Sbjct: 125 NNINNINNNNNNYYMNMINKLMNILNINNNNNMNNISN 162



 Score = 29.9 bits (67), Expect = 3.6
 Identities = 17/31 (54%), Positives = 17/31 (54%)

Query: 56  KKKKKKNNNNNNNNNNNNNNNNNNNNNNNNN 86
           K K K   N  N N    NNN NN NNNNNN
Sbjct: 106 KPKFKHTINKVNINFYYYNNNINNINNNNNN 136



 Score = 29.9 bits (67), Expect = 3.7
 Identities = 18/38 (47%), Positives = 18/38 (47%), Gaps = 13/38 (34%)

Query: 62  NNNNNNNNNNNNNNNN-------------NNNNNNNNN 86
              NNN NN NNNNNN             N NNNNN N
Sbjct: 121 YYYNNNINNINNNNNNYYMNMINKLMNILNINNNNNMN 158



 Score = 29.1 bits (65), Expect = 5.7
 Identities = 17/43 (39%), Positives = 18/43 (41%)

Query: 51 KKKKKKKKKKKNNNNNNNNNNNNNNNNNNNNNNNNNEEKGTFL 93
           K   K    K NNN  NN + N  NNN N   N    KG  L
Sbjct: 1  MKLLLKNYLLKMNNNRMNNMDINIKNNNMNKYLNEYNNKGNKL 43



 Score = 29.1 bits (65), Expect = 6.0
 Identities = 20/71 (28%), Positives = 26/71 (36%)

Query: 26 KEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNNNNNNNNNNNN 85
          K    +        K     K   +   K  K +  NN NN NN   N N NN  N   N
Sbjct: 11 KMNNNRMNNMDINIKNNNMNKYLNEYNNKGNKLQHINNMNNWNNQLYNYNKNNVINTWIN 70

Query: 86 NEEKGTFLYKM 96
          ++     LYK+
Sbjct: 71 DKLVNKLLYKL 81



 Score = 28.8 bits (64), Expect = 8.1
 Identities = 17/31 (54%), Positives = 17/31 (54%)

Query: 54  KKKKKKKKNNNNNNNNNNNNNNNNNNNNNNN 84
           K K K   N  N N    NNN NN NNNNNN
Sbjct: 106 KPKFKHTINKVNINFYYYNNNINNINNNNNN 136



 Score = 28.8 bits (64), Expect = 8.6
 Identities = 21/52 (40%), Positives = 21/52 (40%), Gaps = 13/52 (25%)

Query: 51  KKKKKKKKKKKNNNNNNNNNNNNNNNNNNNN-------------NNNNNEEK 89
           K K K    K N N    NNN NN NNNNNN             N NNN   
Sbjct: 106 KPKFKHTINKVNINFYYYNNNINNINNNNNNYYMNMINKLMNILNINNNNNM 157



 Score = 28.8 bits (64), Expect = 9.1
 Identities = 14/32 (43%), Positives = 15/32 (46%)

Query: 54  KKKKKKKKNNNNNNNNNNNNNNNNNNNNNNNN 85
             K   K N N N   N   NN+N  NNNN N
Sbjct: 305 SNKLYLKNNINKNYKLNYIPNNHNIYNNNNVN 336


>gnl|CDD|217840 pfam04006, Mpp10, Mpp10 protein.  This family includes proteins
           related to Mpp10 (M phase phosphoprotein 10). The U3
           small nucleolar ribonucleoprotein (snoRNP) is required
           for three cleavage events that generate the mature 18S
           rRNA from the pre-rRNA. In Saccharomyces cerevisiae,
           depletion of Mpp10, a U3 snoRNP-specific protein, halts
           18S rRNA production and impairs cleavage at the three U3
           snoRNP-dependent sites.
          Length = 613

 Score = 36.9 bits (85), Expect = 0.025
 Identities = 13/50 (26%), Positives = 21/50 (42%)

Query: 28  KKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNNNN 77
           KK K  +  K E  +  K +E++KKK+K++  KK                
Sbjct: 542 KKMKAIEKSKTELDRTDKNRERRKKKRKRRAAKKAVTKAKKERKIGKEKV 591



 Score = 36.9 bits (85), Expect = 0.031
 Identities = 14/48 (29%), Positives = 26/48 (54%)

Query: 21  KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNN 68
            KA +K K + ++  K +E++K+K+K+   KK   K KK++       
Sbjct: 544 MKAIEKSKTELDRTDKNRERRKKKRKRRAAKKAVTKAKKERKIGKEKV 591



 Score = 36.1 bits (83), Expect = 0.047
 Identities = 16/39 (41%), Positives = 21/39 (53%)

Query: 21  KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKK 59
           ++ KKK K++  KKA  K KK+ K  KEK     KK   
Sbjct: 562 ERRKKKRKRRAAKKAVTKAKKERKIGKEKVDGVAKKSSS 600



 Score = 32.7 bits (74), Expect = 0.54
 Identities = 17/51 (33%), Positives = 28/51 (54%), Gaps = 1/51 (1%)

Query: 13  TEPTPLDWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNN 63
           TE    D K  ++++KK+K + AKK   K +K++K  K+K     KK  + 
Sbjct: 552 TELDRTD-KNRERRKKKRKRRAAKKAVTKAKKERKIGKEKVDGVAKKSSSA 601



 Score = 31.9 bits (72), Expect = 1.1
 Identities = 14/43 (32%), Positives = 27/43 (62%)

Query: 22  KAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNN 64
           K +++ KKK++++A KK   K KK+++  K+K     KK ++ 
Sbjct: 559 KNRERRKKKRKRRAAKKAVTKAKKERKIGKEKVDGVAKKSSSA 601



 Score = 31.5 bits (71), Expect = 1.5
 Identities = 11/50 (22%), Positives = 21/50 (42%)

Query: 24  KKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNN 73
           KK +  +K K    +  K  +++K+K+K++  KK   K            
Sbjct: 542 KKMKAIEKSKTELDRTDKNRERRKKKRKRRAAKKAVTKAKKERKIGKEKV 591



 Score = 29.2 bits (65), Expect = 6.1
 Identities = 11/50 (22%), Positives = 19/50 (38%)

Query: 33  KKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNNNNNNNNN 82
           KK K  EK K +  +  K ++++KKK+K+                     
Sbjct: 542 KKMKAIEKSKTELDRTDKNRERRKKKRKRRAAKKAVTKAKKERKIGKEKV 591


>gnl|CDD|214659 smart00433, TOP2c, TopoisomeraseII.  Eukaryotic DNA topoisomerase
           II, GyrB, ParE.
          Length = 594

 Score = 37.2 bits (87), Expect = 0.026
 Identities = 10/44 (22%), Positives = 17/44 (38%)

Query: 18  LDWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
           L + +    E  K  +K     K +   KK ++  +KKK     
Sbjct: 323 LSFLEENPVEASKIVEKVLLAAKARAAAKKARELTRKKKLSSIS 366


>gnl|CDD|219564 pfam07771, TSGP1, Tick salivary peptide group 1.  This contains a
           group of peptides derived from a salivary gland cDNA
           library of the tick Ixodes scapularis. Also present are
           peptides from a related tick species, Ixodes ricinus.
           They are characterized by a putative signal peptide
           indicative of secretion and conserved cysteine residues.
          Length = 120

 Score = 34.9 bits (80), Expect = 0.026
 Identities = 15/23 (65%), Positives = 16/23 (69%)

Query: 39  EKKKEKKKKEKKKKKKKKKKKKK 61
           EK K+KKKK KK KK KK  KK 
Sbjct: 98  EKPKKKKKKSKKTKKPKKSSKKD 120



 Score = 34.5 bits (79), Expect = 0.033
 Identities = 17/44 (38%), Positives = 23/44 (52%), Gaps = 1/44 (2%)

Query: 7   HQIGTFTEPTPLDWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKK 50
           H   +  EP+  D    +  EK KK+KK K K+ KK KK  +K 
Sbjct: 78  HLTTSSGEPSHPDDHPPEPTEKPKKKKK-KSKKTKKPKKSSKKD 120



 Score = 32.2 bits (73), Expect = 0.23
 Identities = 13/22 (59%), Positives = 15/22 (68%)

Query: 38  KEKKKEKKKKEKKKKKKKKKKK 59
           K KKK+KK K+ KK KK  KK 
Sbjct: 99  KPKKKKKKSKKTKKPKKSSKKD 120



 Score = 30.6 bits (69), Expect = 0.77
 Identities = 13/25 (52%), Positives = 16/25 (64%)

Query: 39  EKKKEKKKKEKKKKKKKKKKKKKNN 63
            +  EK KK+KKK KK KK KK + 
Sbjct: 94  PEPTEKPKKKKKKSKKTKKPKKSSK 118



 Score = 29.5 bits (66), Expect = 2.1
 Identities = 12/20 (60%), Positives = 15/20 (75%)

Query: 45  KKKEKKKKKKKKKKKKKNNN 64
           K K+KKKK KK KK KK++ 
Sbjct: 99  KPKKKKKKSKKTKKPKKSSK 118


>gnl|CDD|221654 pfam12589, WBS_methylT, Methyltransferase involved in
          Williams-Beuren syndrome.  This domain family is found
          in eukaryotes, and is typically between 72 and 83 amino
          acids in length. The family is found in association
          with pfam08241. This family is made up of
          S-adenosylmethionine-dependent methyltransferases. The
          proteins are deleted in Williams-Beuren syndrome (WBS),
          a complex developmental disorder with multisystemic
          manifestations including supravalvular aortic stenosis
          (SVAS) and a specific cognitive phenotype.
          Length = 85

 Score = 33.8 bits (78), Expect = 0.026
 Identities = 13/35 (37%), Positives = 25/35 (71%), Gaps = 2/35 (5%)

Query: 24 KKKEKKKKEKKAKKKEKKKE--KKKKEKKKKKKKK 56
           ++ ++KK+KK K K+K KE   +KKE+ +++ K+
Sbjct: 36 SQRNRRKKKKKKKLKKKSKEWILRKKEQMRRRGKE 70



 Score = 31.9 bits (73), Expect = 0.13
 Identities = 12/37 (32%), Positives = 25/37 (67%)

Query: 24 KKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKK 60
          K +   ++ ++ KKK+KK +KK KE   +KK++ +++
Sbjct: 31 KVRRISQRNRRKKKKKKKLKKKSKEWILRKKEQMRRR 67



 Score = 29.6 bits (67), Expect = 0.85
 Identities = 12/27 (44%), Positives = 18/27 (66%)

Query: 34 KAKKKEKKKEKKKKEKKKKKKKKKKKK 60
           A K  +  ++ +++KKKKKK KKK K
Sbjct: 28 DASKVRRISQRNRRKKKKKKKLKKKSK 54



 Score = 29.2 bits (66), Expect = 1.2
 Identities = 11/24 (45%), Positives = 17/24 (70%)

Query: 39 EKKKEKKKKEKKKKKKKKKKKKKN 62
           K +   ++ ++KKKKKKK KKK+
Sbjct: 30 SKVRRISQRNRRKKKKKKKLKKKS 53



 Score = 29.2 bits (66), Expect = 1.4
 Identities = 11/36 (30%), Positives = 23/36 (63%), Gaps = 2/36 (5%)

Query: 27 EKKKKEKKAKKKEKKKEKKK--KEKKKKKKKKKKKK 60
            ++  +K KKK+K K+K K    +KK++ +++ K+
Sbjct: 35 ISQRNRRKKKKKKKLKKKSKEWILRKKEQMRRRGKE 70



 Score = 27.6 bits (62), Expect = 4.0
 Identities = 13/34 (38%), Positives = 20/34 (58%), Gaps = 4/34 (11%)

Query: 31 KEKKAKKKEKKKEKKKKEKKKKKKKKK----KKK 60
             K ++  ++  +KKK+KKK KKK K    +KK
Sbjct: 28 DASKVRRISQRNRRKKKKKKKLKKKSKEWILRKK 61



 Score = 27.6 bits (62), Expect = 5.0
 Identities = 16/49 (32%), Positives = 25/49 (51%), Gaps = 4/49 (8%)

Query: 15 PTPLDWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKK----KKK 59
          P  L  +  +  E+    K  +  ++ + KKKK+KK KKK K    +KK
Sbjct: 13 PNGLGEEGEEDDEQIDASKVRRISQRNRRKKKKKKKLKKKSKEWILRKK 61



 Score = 27.3 bits (61), Expect = 6.5
 Identities = 12/31 (38%), Positives = 22/31 (70%)

Query: 31 KEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
          K ++  ++ ++K+KKKK+ KKK K+   +KK
Sbjct: 31 KVRRISQRNRRKKKKKKKLKKKSKEWILRKK 61



 Score = 26.9 bits (60), Expect = 7.6
 Identities = 12/34 (35%), Positives = 24/34 (70%)

Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKK 54
          ++ ++K+KKKK+ K K KE    KK++ +++ K+
Sbjct: 37 QRNRRKKKKKKKLKKKSKEWILRKKEQMRRRGKE 70


>gnl|CDD|235971 PRK07219, PRK07219, DNA topoisomerase I; Validated.
          Length = 822

 Score = 36.9 bits (86), Expect = 0.027
 Identities = 9/39 (23%), Positives = 16/39 (41%)

Query: 24  KKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKN 62
           +K++K+K+ K   +  K    K  EK K    +      
Sbjct: 766 EKQKKEKESKSELEALKGVGAKTAEKLKDAGVETVTDLT 804



 Score = 36.5 bits (85), Expect = 0.039
 Identities = 8/45 (17%), Positives = 14/45 (31%), Gaps = 2/45 (4%)

Query: 20  WKKAKKKEKKKKEKK--AKKKEKKKEKKKKEKKKKKKKKKKKKKN 62
            K+ + K + +  K   AK  EK K+   +               
Sbjct: 769 KKEKESKSELEALKGVGAKTAEKLKDAGVETVTDLTAADPDAVAA 813



 Score = 33.1 bits (76), Expect = 0.48
 Identities = 9/44 (20%), Positives = 19/44 (43%)

Query: 21  KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNN 64
           +K KK+++ K E +A K    K  +K +    +        + +
Sbjct: 766 EKQKKEKESKSELEALKGVGAKTAEKLKDAGVETVTDLTAADPD 809



 Score = 32.7 bits (75), Expect = 0.66
 Identities = 5/41 (12%), Positives = 7/41 (17%)

Query: 21  KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
           +  K    K  EK      +               K     
Sbjct: 779 EALKGVGAKTAEKLKDAGVETVTDLTAADPDAVAAKVDGVS 819



 Score = 31.1 bits (71), Expect = 1.6
 Identities = 10/31 (32%), Positives = 15/31 (48%)

Query: 31  KEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
            EK+ K+KE K E +  +    K  +K K  
Sbjct: 765 TEKQKKEKESKSELEALKGVGAKTAEKLKDA 795



 Score = 30.4 bits (69), Expect = 3.3
 Identities = 4/36 (11%), Positives = 9/36 (25%)

Query: 21  KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKK 56
           K A+K +    E          +    +       +
Sbjct: 787 KTAEKLKDAGVETVTDLTAADPDAVAAKVDGVSADR 822



 Score = 30.4 bits (69), Expect = 3.6
 Identities = 10/54 (18%), Positives = 18/54 (33%), Gaps = 12/54 (22%)

Query: 19  DWKKAKKKEKKKKE------------KKAKKKEKKKEKKKKEKKKKKKKKKKKK 60
                + K     E            +K KK+++ K + +  K    K  +K K
Sbjct: 740 GLPLLRVKGGFGDELGCCNNPKCNYTEKQKKEKESKSELEALKGVGAKTAEKLK 793



 Score = 30.0 bits (68), Expect = 4.6
 Identities = 7/43 (16%), Positives = 10/43 (23%), Gaps = 1/43 (2%)

Query: 18  LDWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKK 60
           L    AK  E K K+   +               K       +
Sbjct: 781 LKGVGAKTAE-KLKDAGVETVTDLTAADPDAVAAKVDGVSADR 822


>gnl|CDD|240253 PTZ00068, PTZ00068, 60S ribosomal protein L13a; Provisional.
          Length = 202

 Score = 35.8 bits (83), Expect = 0.029
 Identities = 17/49 (34%), Positives = 25/49 (51%), Gaps = 3/49 (6%)

Query: 20  WKKA---KKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNN 65
           WK A    K E+K+KE+ A   +KK + +K  K+ +KK   K  K    
Sbjct: 146 WKYADVVAKLEEKRKERAAAYYKKKVKLRKAWKEARKKALAKLPKAIVA 194



 Score = 28.5 bits (64), Expect = 7.8
 Identities = 13/37 (35%), Positives = 18/37 (48%)

Query: 21  KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKK 57
           + A   +KK K +KA K+ +KK   K  K      KK
Sbjct: 162 RAAAYYKKKVKLRKAWKEARKKALAKLPKAIVAVLKK 198


>gnl|CDD|224668 COG1754, COG1754, Uncharacterized C-terminal domain of
           topoisomerase IA [General function prediction only].
          Length = 298

 Score = 36.2 bits (84), Expect = 0.030
 Identities = 17/37 (45%), Positives = 17/37 (45%)

Query: 21  KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKK 57
              KK  KK    KAKK   KK   KK  K KK  KK
Sbjct: 260 GPGKKPAKKATAAKAKKTTAKKAAAKKAAKTKKAAKK 296



 Score = 35.1 bits (81), Expect = 0.078
 Identities = 15/37 (40%), Positives = 16/37 (43%)

Query: 25  KKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
              KK  +K    K KK   KK   KK  K KK  KK
Sbjct: 260 GPGKKPAKKATAAKAKKTTAKKAAAKKAAKTKKAAKK 296



 Score = 34.3 bits (79), Expect = 0.12
 Identities = 16/51 (31%), Positives = 22/51 (43%)

Query: 11  TFTEPTPLDWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
           + T    L+    ++ +    +K AKK    K KK   KK   KK  K KK
Sbjct: 242 SITLEEALELLAERRAKGGPGKKPAKKATAAKAKKTTAKKAAAKKAAKTKK 292



 Score = 34.3 bits (79), Expect = 0.13
 Identities = 15/40 (37%), Positives = 18/40 (45%)

Query: 21  KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKK 60
            K    +K  K+  A K +K   KK   KK  K KK  KK
Sbjct: 257 AKGGPGKKPAKKATAAKAKKTTAKKAAAKKAAKTKKAAKK 296



 Score = 33.9 bits (78), Expect = 0.20
 Identities = 17/41 (41%), Positives = 20/41 (48%)

Query: 21  KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
           ++AK    KK  KKA   + KK   KK   KK  K KK  K
Sbjct: 255 RRAKGGPGKKPAKKATAAKAKKTTAKKAAAKKAAKTKKAAK 295


>gnl|CDD|234707 PRK00270, rpsU, 30S ribosomal protein S21; Reviewed.
          Length = 64

 Score = 33.3 bits (77), Expect = 0.030
 Identities = 15/51 (29%), Positives = 29/51 (56%), Gaps = 8/51 (15%)

Query: 22 KAKKKEKKKKEK-----KAKKK---EKKKEKKKKEKKKKKKKKKKKKKNNN 64
          KA ++ K+K EK     + +++   EK  EK+K++K   +K+++KK     
Sbjct: 14 KALRRFKRKVEKAGILRELRRREFYEKPSEKRKRKKAAARKRRRKKLAREE 64


>gnl|CDD|177978 PLN02344, PLN02344, chorismate mutase.
          Length = 284

 Score = 36.1 bits (84), Expect = 0.032
 Identities = 17/51 (33%), Positives = 21/51 (41%), Gaps = 4/51 (7%)

Query: 446 IRRAEKKRHAARVEGQEEVKGEDNFGMEGEDGKYISRVVGTLYMYFVMPET 496
            +R EKK   AR  GQE V             K    +V  LY  ++MP T
Sbjct: 225 KKRVEKK---ARTFGQE-VTLSGEEDGADPKYKVDPSLVARLYGEWIMPLT 271


>gnl|CDD|214408 MTH00036, ATP8, ATP synthase F0 subunit 8; Validated.
          Length = 54

 Score = 32.7 bits (75), Expect = 0.034
 Identities = 9/24 (37%), Positives = 17/24 (70%)

Query: 62 NNNNNNNNNNNNNNNNNNNNNNNN 85
            NN  + N +N+++N +++NNNN
Sbjct: 28 LLNNWPSTNASNSSSNTSSSNNNN 51



 Score = 32.7 bits (75), Expect = 0.034
 Identities = 9/24 (37%), Positives = 17/24 (70%)

Query: 63 NNNNNNNNNNNNNNNNNNNNNNNN 86
            NN  + N +N+++N +++NNNN
Sbjct: 28 LLNNWPSTNASNSSSNTSSSNNNN 51



 Score = 32.7 bits (75), Expect = 0.037
 Identities = 9/23 (39%), Positives = 17/23 (73%)

Query: 61 KNNNNNNNNNNNNNNNNNNNNNN 83
           NN  + N +N+++N +++NNNN
Sbjct: 29 LNNWPSTNASNSSSNTSSSNNNN 51



 Score = 30.8 bits (70), Expect = 0.16
 Identities = 8/23 (34%), Positives = 16/23 (69%)

Query: 60 KKNNNNNNNNNNNNNNNNNNNNN 82
            N  + N +N+++N +++NNNN
Sbjct: 29 LNNWPSTNASNSSSNTSSSNNNN 51



 Score = 30.8 bits (70), Expect = 0.19
 Identities = 8/24 (33%), Positives = 16/24 (66%)

Query: 64 NNNNNNNNNNNNNNNNNNNNNNNE 87
            NN  + N +N+++N +++NNN 
Sbjct: 28 LLNNWPSTNASNSSSNTSSSNNNN 51



 Score = 29.7 bits (67), Expect = 0.43
 Identities = 7/23 (30%), Positives = 15/23 (65%)

Query: 58 KKKKNNNNNNNNNNNNNNNNNNN 80
               + N +N+++N +++NNNN
Sbjct: 29 LNNWPSTNASNSSSNTSSSNNNN 51


>gnl|CDD|223475 COG0398, COG0398, Uncharacterized conserved protein [Function
           unknown].
          Length = 223

 Score = 35.8 bits (83), Expect = 0.036
 Identities = 33/201 (16%), Positives = 48/201 (23%), Gaps = 41/201 (20%)

Query: 233 FLLPYAIVTSLFFIGQFGGMTTLQTYAVGIFESIHAPLDPYFATLLLGVAELGGALLCVV 292
           F+L Y + T           + L      +F        P+   L   +    G+ L  +
Sbjct: 54  FILLYLVATLPII-----PGSILTLAGGLLF-------GPFLGFLYSLIGATAGSTLAFL 101

Query: 293 LIHYTGKRPLALISTGGSAACFIVVAVYAQFHLSYGWD-------SPLVPTVFLVLAAFL 345
           L  Y G R   L   GG       V          G+         P+ P   +  AA L
Sbjct: 102 LARYLG-RDWVLKFVGGKEK----VQRIDAGLERNGFWAILLLRLIPIFPFDLVNYAAGL 156

Query: 346 THICIRLLPWMLIGEVFPNNIRATASGASGSSSYIFAFAVNKLYYPMLDTFHLWGTLYFY 405
           T   I    + +                 G       +    L    L        L   
Sbjct: 157 TG--ISFRDFAIA-------------TLLGKLPGTIVYTY--LGSAFLGIGSPLLLLVLA 199

Query: 406 AAISVVGTLYMYFVMPETEGR 426
             I  V  +           R
Sbjct: 200 LLILGVFAIVAKRKELRRHKR 220


>gnl|CDD|227594 COG5269, ZUO1, Ribosome-associated chaperone zuotin [Translation,
           ribosomal structure and biogenesis / Posttranslational
           modification, protein turnover, chaperones].
          Length = 379

 Score = 36.2 bits (83), Expect = 0.037
 Identities = 15/40 (37%), Positives = 20/40 (50%)

Query: 22  KAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
           K K + K K E +A+        KKK K+  KK  K +KK
Sbjct: 263 KGKAEAKNKAEIEAEALASATAVKKKAKEVMKKALKMEKK 302



 Score = 35.4 bits (81), Expect = 0.063
 Identities = 16/43 (37%), Positives = 21/43 (48%)

Query: 21  KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNN 63
            KA+ K K + E +A       +KK KE  KK  K +KK   N
Sbjct: 264 GKAEAKNKAEIEAEALASATAVKKKAKEVMKKALKMEKKAIKN 306



 Score = 31.2 bits (70), Expect = 1.3
 Identities = 12/41 (29%), Positives = 18/41 (43%)

Query: 23  AKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNN 63
           A  K K + + KA+ + +        KKK K+  KK  K  
Sbjct: 260 AALKGKAEAKNKAEIEAEALASATAVKKKAKEVMKKALKME 300


>gnl|CDD|222436 pfam13886, DUF4203, Domain of unknown function (DUF4203).  This is
           the N-terminal region of 7tm proteins. The function is
           not known.
          Length = 210

 Score = 35.7 bits (83), Expect = 0.038
 Identities = 27/101 (26%), Positives = 42/101 (41%), Gaps = 8/101 (7%)

Query: 232 TFLLPYAIVT-SLFFIGQFGGMTTLQTYAVGIFESIHAPLDPYFATLLLGVAELGGALLC 290
                + IV+  LF +G   G       A  +       L  Y +  +L V  L GA+L 
Sbjct: 70  LLWWVFLIVSVGLFCVGLLLGFLL----ASALLFLPLGNLYIYHSDWVLLVTLLCGAILF 125

Query: 291 VVLIHYTGKRPLALISTG--GSAACFIVVAVYAQFHLSYGW 329
            +L     ++ L ++ST   G+AA  + V  +A   L Y W
Sbjct: 126 ALLTLKW-QKYLLILSTSVVGAAAVVLGVDYFAGASLKYFW 165



 Score = 29.5 bits (67), Expect = 3.5
 Identities = 27/164 (16%), Positives = 45/164 (27%), Gaps = 34/164 (20%)

Query: 244 FFIGQFGGMTTLQTYAVGIFESIHAPLDPYFATLLLGVAELGGALLCVVLIHYTGKRPLA 303
           F  G              +   I   +        L    +GGA+L ++   +       
Sbjct: 31  FLFGSLIVF--------LLITKITLLVTNARLGASLVAGVVGGAILGLLWWVFL--IVSV 80

Query: 304 LISTGGSAACFIVVAVYAQFHLSYGWDSPLVPTVFLVLA-------AFLTHICIRLLPWM 356
            +   G    F++ A    F             V LV         A LT    + L   
Sbjct: 81  GLFCVGLLLGFLL-ASALLFLPLGNLYIYHSDWVLLVTLLCGAILFALLTLKWQKYL--- 136

Query: 357 LIGEVFPNNIRATASGASGSSSYIFA---FAVNKLYYPMLDTFH 397
           LI           ++   G+++ +     FA   L Y  L+   
Sbjct: 137 LI----------LSTSVVGAAAVVLGVDYFAGASLKYFWLNILK 170


>gnl|CDD|218440 pfam05110, AF-4, AF-4 proto-oncoprotein.  This family consists of
           AF4 (Proto-oncogene AF4) and FMR2 (Fragile X E mental
           retardation syndrome) nuclear proteins. These proteins
           have been linked to human diseases such as acute
           lymphoblastic leukaemia and mental retardation. The
           family also contains a Drosophila AF4 protein homologue
           Lilliputian which contains an AT-hook domain.
           Lilliputian represents a novel pair-rule gene that acts
           in cytoskeleton regulation, segmentation and
           morphogenesis in Drosophila.
          Length = 1154

 Score = 36.4 bits (84), Expect = 0.041
 Identities = 16/69 (23%), Positives = 41/69 (59%)

Query: 21  KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNNNNNNN 80
           K+  K   +K   K K+K K  E+  K + KK++ ++K    + +++++++++++N  + 
Sbjct: 734 KQTSKTASEKSSSKGKRKHKNDEEADKIESKKQRLEEKSSSCSPSSSSSHHHSSSNKESR 793

Query: 81  NNNNNNEEK 89
            ++ N EE+
Sbjct: 794 KSSRNKEEE 802



 Score = 34.5 bits (79), Expect = 0.18
 Identities = 17/72 (23%), Positives = 40/72 (55%), Gaps = 5/72 (6%)

Query: 22  KAKKKEKKKKEKKAKKKEKKK-----EKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNNN 76
             KK+  K   +K+  K K+K     E  K E KK++ ++K    + +++++++++++N 
Sbjct: 731 APKKQTSKTASEKSSSKGKRKHKNDEEADKIESKKQRLEEKSSSCSPSSSSSHHHSSSNK 790

Query: 77  NNNNNNNNNNEE 88
            +  ++ N  EE
Sbjct: 791 ESRKSSRNKEEE 802



 Score = 29.9 bits (67), Expect = 4.9
 Identities = 13/78 (16%), Positives = 38/78 (48%), Gaps = 13/78 (16%)

Query: 25  KKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKK-------------KKKKKNNNNNNNNNN 71
           K  +K      KK+  K   +K   K K+K K             + ++K+++ + ++++
Sbjct: 722 KPAEKDSLSAPKKQTSKTASEKSSSKGKRKHKNDEEADKIESKKQRLEEKSSSCSPSSSS 781

Query: 72  NNNNNNNNNNNNNNNEEK 89
           +++++++N  +  ++  K
Sbjct: 782 SHHHSSSNKESRKSSRNK 799


>gnl|CDD|214395 CHL00204, ycf1, Ycf1; Provisional.
          Length = 1832

 Score = 36.6 bits (85), Expect = 0.041
 Identities = 16/50 (32%), Positives = 25/50 (50%)

Query: 14   EPTPLDWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNN 63
            EP      ++ K++K   E     +EK  E+   E   KK+K KK+ K+N
Sbjct: 1529 EPAGQGELESDKEKKGNLESVLSNQEKNIEEDYAESDIKKRKNKKQYKSN 1578



 Score = 35.8 bits (83), Expect = 0.072
 Identities = 16/40 (40%), Positives = 24/40 (60%)

Query: 9   IGTFTEPTPLDWKKAKKKEKKKKEKKAKKKEKKKEKKKKE 48
            G   E    D  + K K+KKKKEKK +++ K++EK + E
Sbjct: 729 NGKDAEFKISDSVEEKTKKKKKKEKKKEEEYKREEKARIE 768



 Score = 33.9 bits (78), Expect = 0.28
 Identities = 17/43 (39%), Positives = 26/43 (60%), Gaps = 4/43 (9%)

Query: 19  DWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
           +W   K  E K  +      EK K+KKKKEKKK+++ K+++K 
Sbjct: 727 NWNG-KDAEFKISDSVE---EKTKKKKKKEKKKEEEYKREEKA 765



 Score = 32.8 bits (75), Expect = 0.61
 Identities = 11/40 (27%), Positives = 20/40 (50%)

Query: 25   KKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNN 64
             KEKK   +     ++K  ++   +   KK+K KK+  +N
Sbjct: 1539 DKEKKGNLESVLSNQEKNIEEDYAESDIKKRKNKKQYKSN 1578



 Score = 32.4 bits (74), Expect = 0.79
 Identities = 14/56 (25%), Positives = 26/56 (46%), Gaps = 11/56 (19%)

Query: 21   KKAKKKEKKKKEKKAKKKEKK--------KEKKKKEK---KKKKKKKKKKKKNNNN 65
            +  ++KE   + +    KEKK         ++K  E+   +   KK+K KK+  +N
Sbjct: 1523 RNQEEKEPAGQGELESDKEKKGNLESVLSNQEKNIEEDYAESDIKKRKNKKQYKSN 1578



 Score = 30.1 bits (68), Expect = 3.7
 Identities = 14/29 (48%), Positives = 17/29 (58%)

Query: 36  KKKEKKKEKKKKEKKKKKKKKKKKKKNNN 64
           K  E K     +EK KKKKKK+KKK+   
Sbjct: 731 KDAEFKISDSVEEKTKKKKKKEKKKEEEY 759



 Score = 29.7 bits (67), Expect = 5.4
 Identities = 13/34 (38%), Positives = 23/34 (67%)

Query: 29  KKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKN 62
           K  E K     ++K KKKK+K+KKK+++ K+++ 
Sbjct: 731 KDAEFKISDSVEEKTKKKKKKEKKKEEEYKREEK 764



 Score = 29.7 bits (67), Expect = 6.0
 Identities = 9/34 (26%), Positives = 16/34 (47%)

Query: 21   KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKK 54
                 +EK  +E  A+   KK++ KK+ K   + 
Sbjct: 1548 SVLSNQEKNIEEDYAESDIKKRKNKKQYKSNTEA 1581



 Score = 29.3 bits (66), Expect = 7.3
 Identities = 13/26 (50%), Positives = 22/26 (84%), Gaps = 1/26 (3%)

Query: 26  KEKKKKEKKAKKKEKKKEKKKKEKKK 51
           +EK KK+KK K+K+K++E K++EK +
Sbjct: 742 EEKTKKKKK-KEKKKEEEYKREEKAR 766


>gnl|CDD|191179 pfam05053, Menin, Menin.  MEN1, the gene responsible for multiple
           endocrine neoplasia type 1, is a tumour suppressor gene
           that encodes a protein called Menin which may be an
           atypical GTPase stimulated by nm23.
          Length = 618

 Score = 36.5 bits (84), Expect = 0.041
 Identities = 21/61 (34%), Positives = 35/61 (57%)

Query: 24  KKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNNNNNNNNNN 83
           K  EK+ KE K    E+ +E +++  +++ K ++     + N     NNNN+N+NNNNNN
Sbjct: 458 KLPEKEAKESKEAAGEEAREGRRRGPRRESKSQEPSGGESPNPELPANNNNSNSNNNNNN 517

Query: 84  N 84
            
Sbjct: 518 G 518



 Score = 31.5 bits (71), Expect = 1.4
 Identities = 17/62 (27%), Positives = 35/62 (56%)

Query: 29  KKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNNNNNNNNNNNNNEE 88
           K  EK+AK+ ++   ++ +E +++  +++ K +  +   + N     NNNN+N+NNNN  
Sbjct: 458 KLPEKEAKESKEAAGEEAREGRRRGPRRESKSQEPSGGESPNPELPANNNNSNSNNNNNN 517

Query: 89  KG 90
             
Sbjct: 518 GA 519


>gnl|CDD|221466 pfam12220, U1snRNP70_N, U1 small nuclear ribonucleoprotein of
          70kDa MW N terminal.  This domain is found in
          eukaryotes. This domain is about 90 amino acids in
          length. This domain is found associated with pfam00076.
          This domain is part of U1 snRNP, which is the pre-mRNA
          binding protein of the penta-snRNP spliceosome complex.
          It extends over a distance of 180 A from its RNA
          binding domain, wraps around the core domain of U1
          snRNP consisting of the seven Sm proteins and finally
          contacts U1-C, which is crucial for 5'-splice-site
          recognition.
          Length = 94

 Score = 33.8 bits (78), Expect = 0.041
 Identities = 12/53 (22%), Positives = 27/53 (50%), Gaps = 1/53 (1%)

Query: 19 DWKKAKKKEKKKKEK-KAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNN 70
           +    K  K +      +   +K+E++K+EKK+K +KK +++    + N + 
Sbjct: 42 QYLSEFKDYKDEPPPEPTETWLEKREREKREKKEKLEKKLEEELKEWDPNEDP 94


>gnl|CDD|219312 pfam07150, DUF1390, Protein of unknown function (DUF1390).  This
          family consists of several Paramecium bursaria
          chlorella virus 1 (PBCV-1) proteins of around 250
          residues in length. The function of this family is
          unknown.
          Length = 229

 Score = 35.4 bits (82), Expect = 0.044
 Identities = 17/58 (29%), Positives = 22/58 (37%)

Query: 29 KKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNNNNNNNNNNNNN 86
          KK     + K + KE   +E  + +K        N    NNN NN N   N  N   N
Sbjct: 26 KKTSCGHEMKSESKEFVLEEDYRDEKLATTVSNINTGVVNNNVNNINKTINTKNITIN 83



 Score = 35.1 bits (81), Expect = 0.065
 Identities = 16/56 (28%), Positives = 24/56 (42%)

Query: 31 KEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNNNNNNNNNNNNN 86
          K KK     + K + K+   ++  + +K     +N N    NNN NN N   N  N
Sbjct: 24 KHKKTSCGHEMKSESKEFVLEEDYRDEKLATTVSNINTGVVNNNVNNINKTINTKN 79



 Score = 33.1 bits (76), Expect = 0.27
 Identities = 14/53 (26%), Positives = 21/53 (39%)

Query: 37 KKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNNNNNNNNNNNNNEEK 89
          KK     + K E K+   ++  + +      +N N    NNN NN N     K
Sbjct: 26 KKTSCGHEMKSESKEFVLEEDYRDEKLATTVSNINTGVVNNNVNNINKTINTK 78



 Score = 33.1 bits (76), Expect = 0.28
 Identities = 17/62 (27%), Positives = 23/62 (37%)

Query: 26 KEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNNNNNNNNNNNN 85
          K KK       K E K+   +++ + +K        N    NNN NN N   N  N   N
Sbjct: 24 KHKKTSCGHEMKSESKEFVLEEDYRDEKLATTVSNINTGVVNNNVNNINKTINTKNITIN 83

Query: 86 NE 87
            
Sbjct: 84 LT 85



 Score = 30.4 bits (69), Expect = 1.8
 Identities = 18/60 (30%), Positives = 22/60 (36%), Gaps = 1/60 (1%)

Query: 25 KKEKKKKEKKAKKKEK-KKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNNNNNNNNNN 83
          KK     E K++ KE   +E  + EK             NNN NN N   N  N   N  
Sbjct: 26 KKTSCGHEMKSESKEFVLEEDYRDEKLATTVSNINTGVVNNNVNNINKTINTKNITINLT 85


>gnl|CDD|225324 COG2604, COG2604, Uncharacterized protein conserved in bacteria
           [Function unknown].
          Length = 594

 Score = 36.2 bits (84), Expect = 0.045
 Identities = 12/88 (13%), Positives = 24/88 (27%), Gaps = 10/88 (11%)

Query: 21  KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNN---------- 70
           K+  +K   K + K   K        +EK   K  +K KK N+                 
Sbjct: 445 KEVCEKLLDKDKNKPFIKLILLSNNSQEKAVLKLLQKIKKNNDFIKEFETDALKLQEILE 504

Query: 71  NNNNNNNNNNNNNNNNEEKGTFLYKMAA 98
             ++ +      +   +       +   
Sbjct: 505 KVDSKSEKLEKISAKIDNIKELFDESKM 532



 Score = 30.4 bits (69), Expect = 2.5
 Identities = 15/82 (18%), Positives = 27/82 (32%), Gaps = 7/82 (8%)

Query: 21  KKAKKKEKKKKEKKAKKKEKKKE-------KKKKEKKKKKKKKKKKKKNNNNNNNNNNNN 73
           K   +   K+  +K   K+K K            ++K   K  +K KKNN+       + 
Sbjct: 437 KGTIELPFKEVCEKLLDKDKNKPFIKLILLSNNSQEKAVLKLLQKIKKNNDFIKEFETDA 496

Query: 74  NNNNNNNNNNNNNEEKGTFLYK 95
                     ++  EK   +  
Sbjct: 497 LKLQEILEKVDSKSEKLEKISA 518


>gnl|CDD|222131 pfam13440, Polysacc_synt_3, Polysaccharide biosynthesis protein. 
          Length = 251

 Score = 35.8 bits (83), Expect = 0.045
 Identities = 31/209 (14%), Positives = 64/209 (30%), Gaps = 29/209 (13%)

Query: 242 SLFFIGQFGGMTTLQTYAVGIFESIHAPLDPYFATLLLGVAELGGALLCVVLIHYTGKRP 301
               + +F G+     +++ +  SI         +LLL ++  G     V      G   
Sbjct: 7   LAILLARFLGLEDFGAFSLAL--SI--------ISLLLILSSFGLRPSLVRD--SGGDPR 54

Query: 302 -------LALISTGGSAACFIVVAVYAQFHLSYGWDSPLVPTVFLVLAAFLTHICIRLLP 354
                   +L++    +    ++               L+  + L+LA       + LL 
Sbjct: 55  ELRQAVRFSLLAALLLSLAVALLVSLIALFFGDEETGLLLLLLGLILA-------LLLLL 107

Query: 355 WMLIGEVFPNNIRATASGASGSSSYIFAFAVNKLYYPMLDTFHLWGTLYFYAAISVVGTL 414
            +L   +F    R  A+  S     +    +  L   +     L   L  +AA ++V  L
Sbjct: 108 DILRSILFARG-RFRAAAVSDLLRLVVVLLL--LVLFLYLGATLASLLLAWAASALVAAL 164

Query: 415 YMYFVMPETEGRTLRDIEEHFADKGKTFV 443
             +++      R  +         G    
Sbjct: 165 IAFYLSRRRPARAPKFALLRHLKYGLKLG 193


>gnl|CDD|234767 PRK00448, polC, DNA polymerase III PolC; Validated.
          Length = 1437

 Score = 36.4 bits (85), Expect = 0.046
 Identities = 14/47 (29%), Positives = 24/47 (51%)

Query: 27  EKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNN 73
            K++ EK   +KE++ EK  KE  +  KK + +KK  + N +     
Sbjct: 168 SKEELEKFEAQKEEEDEKLAKEALEAMKKLEAEKKKQSKNFDPKEGP 214



 Score = 35.6 bits (83), Expect = 0.072
 Identities = 12/46 (26%), Positives = 26/46 (56%)

Query: 24  KKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNN 69
           K++ +K + +K ++ EK  ++  +  KK + +KKK+ KN +     
Sbjct: 169 KEELEKFEAQKEEEDEKLAKEALEAMKKLEAEKKKQSKNFDPKEGP 214



 Score = 34.4 bits (80), Expect = 0.18
 Identities = 13/44 (29%), Positives = 22/44 (50%)

Query: 21  KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNN 64
           +  K + +K++E +   KE  +  KK E +KKK+ K    K   
Sbjct: 171 ELEKFEAQKEEEDEKLAKEALEAMKKLEAEKKKQSKNFDPKEGP 214



 Score = 34.0 bits (79), Expect = 0.24
 Identities = 11/47 (23%), Positives = 24/47 (51%)

Query: 22  KAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNN 68
             ++ EK + +K+ + ++  KE  +  KK + +KKK+ K  +     
Sbjct: 168 SKEELEKFEAQKEEEDEKLAKEALEAMKKLEAEKKKQSKNFDPKEGP 214



 Score = 32.5 bits (75), Expect = 0.74
 Identities = 13/55 (23%), Positives = 30/55 (54%)

Query: 8   QIGTFTEPTPLDWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKN 62
           +I    + +  + +K + +++++ EK AK+  +  +K + EKKK+ K    K+  
Sbjct: 160 KIDFEIDDSKEELEKFEAQKEEEDEKLAKEALEAMKKLEAEKKKQSKNFDPKEGP 214



 Score = 31.7 bits (73), Expect = 1.1
 Identities = 11/46 (23%), Positives = 24/46 (52%)

Query: 21  KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNN 66
           K+  +K + +KE++ +K  K+  +  K+ + +KKK+ K        
Sbjct: 169 KEELEKFEAQKEEEDEKLAKEALEAMKKLEAEKKKQSKNFDPKEGP 214



 Score = 31.7 bits (73), Expect = 1.3
 Identities = 10/59 (16%), Positives = 27/59 (45%)

Query: 9   IGTFTEPTPLDWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNN 67
            G       +D  K + ++ + ++++  +K  K+  +  +K + +KKK+ K  +     
Sbjct: 156 FGILKIDFEIDDSKEELEKFEAQKEEEDEKLAKEALEAMKKLEAEKKKQSKNFDPKEGP 214



 Score = 31.3 bits (72), Expect = 1.5
 Identities = 11/62 (17%), Positives = 26/62 (41%)

Query: 27  EKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNNNNNNNNNNNNN 86
           E    +++ +K E +KE++ ++  K+  +  KK +      + N +            + 
Sbjct: 164 EIDDSKEELEKFEAQKEEEDEKLAKEALEAMKKLEAEKKKQSKNFDPKEGPVQIGKKIDK 223

Query: 87  EE 88
           EE
Sbjct: 224 EE 225



 Score = 31.3 bits (72), Expect = 1.7
 Identities = 12/37 (32%), Positives = 21/37 (56%)

Query: 24  KKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKK 60
           K++E +K  K+A +  KK E +KK++ K    K+   
Sbjct: 179 KEEEDEKLAKEALEAMKKLEAEKKKQSKNFDPKEGPV 215



 Score = 30.6 bits (70), Expect = 2.8
 Identities = 12/38 (31%), Positives = 19/38 (50%)

Query: 21  KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKK 58
           +K ++ EK  KE     K+ + EKKK+ K    K+   
Sbjct: 178 QKEEEDEKLAKEALEAMKKLEAEKKKQSKNFDPKEGPV 215


>gnl|CDD|224874 COG1963, COG1963, Uncharacterized protein conserved in bacteria
           [Function unknown].
          Length = 153

 Score = 34.7 bits (80), Expect = 0.047
 Identities = 24/75 (32%), Positives = 36/75 (48%), Gaps = 7/75 (9%)

Query: 277 LLLGVAELGGALLCVVLIHY--TGKRPL-ALISTGG--SAACFIVVAVYAQFHLSYGWDS 331
           LL  +  +  A +  VLI    T K  +  L STGG  S+   +V A+     L+ G DS
Sbjct: 13  LLSALVAILLAQVIKVLIELIRTRKLNVTLLFSTGGMPSSHSALVTALATSIALTEGLDS 72

Query: 332 PL--VPTVFLVLAAF 344
           PL  +  VF ++  +
Sbjct: 73  PLFAIAAVFAIIVMY 87


>gnl|CDD|149515 pfam08492, SRP72, SRP72 RNA-binding domain.  This region has been
          identified as the binding site of the SRP72 protein to
          SRP RNA.
          Length = 57

 Score = 32.4 bits (74), Expect = 0.049
 Identities = 17/36 (47%), Positives = 22/36 (61%)

Query: 31 KEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNN 66
          + KK KK  KKK   +++KKK  KKK+K K   N N
Sbjct: 1  EAKKPKKIAKKKPGTEQKKKKSAKKKRKPKLPKNFN 36



 Score = 32.0 bits (73), Expect = 0.060
 Identities = 17/30 (56%), Positives = 21/30 (70%)

Query: 27 EKKKKEKKAKKKEKKKEKKKKEKKKKKKKK 56
          E KK +K AKKK   ++KKKK  KKK+K K
Sbjct: 1  EAKKPKKIAKKKPGTEQKKKKSAKKKRKPK 30



 Score = 27.8 bits (62), Expect = 2.4
 Identities = 14/30 (46%), Positives = 20/30 (66%)

Query: 22 KAKKKEKKKKEKKAKKKEKKKEKKKKEKKK 51
          +AKK +K  K+K   +++KKK  KKK K K
Sbjct: 1  EAKKPKKIAKKKPGTEQKKKKSAKKKRKPK 30



 Score = 27.0 bits (60), Expect = 4.0
 Identities = 13/27 (48%), Positives = 17/27 (62%)

Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKK 47
          KK KK  KKK   + KKK+  K+K+K 
Sbjct: 3  KKPKKIAKKKPGTEQKKKKSAKKKRKP 29


>gnl|CDD|218738 pfam05766, NinG, Bacteriophage Lambda NinG protein.  NinG or Rap
          is involved in recombination. Rap (recombination adept
          with plasmid) increases lambda-by-plasmid recombination
          catalyzed by Escherichia coli's RecBCD pathway.
          Length = 188

 Score = 35.0 bits (81), Expect = 0.050
 Identities = 11/38 (28%), Positives = 21/38 (55%)

Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKK 58
             ++K ++KK K   + E+++ K +KEK K +    K
Sbjct: 33 ALKREKAQEKKRKAEAQAERRELKARKEKLKTRSDWLK 70



 Score = 33.5 bits (77), Expect = 0.14
 Identities = 11/38 (28%), Positives = 21/38 (55%)

Query: 24 KKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
            K +K +EKK K + + + ++ K +K+K K +    K
Sbjct: 33 ALKREKAQEKKRKAEAQAERRELKARKEKLKTRSDWLK 70



 Score = 31.9 bits (73), Expect = 0.47
 Identities = 9/40 (22%), Positives = 24/40 (60%)

Query: 19 DWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKK 58
            + A   +++K ++K +K E + E+++ + +K+K K + 
Sbjct: 27 SPECALALKREKAQEKKRKAEAQAERRELKARKEKLKTRS 66



 Score = 30.4 bits (69), Expect = 1.8
 Identities = 10/36 (27%), Positives = 22/36 (61%)

Query: 27 EKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKN 62
          E     K+ K +EKK++ + + ++++ K +K+K K 
Sbjct: 29 ECALALKREKAQEKKRKAEAQAERRELKARKEKLKT 64


>gnl|CDD|173965 cd08045, TAF4, TATA Binding Protein (TBP) Associated Factor 4
           (TAF4) is one of several TAFs that bind TBP and is
           involved in forming Transcription Factor IID (TFIID)
           complex.  The TATA Binding Protein (TBP) Associated
           Factor 4 (TAF4) is one of several TAFs that bind TBP and
           are involved in forming the Transcription Factor IID
           (TFIID) complex. TFIID is one of seven General
           Transcription Factors (GTF) (TFIIA, TFIIB, TFIID, TFIIE,
           TFIIF, and TFIID) that are involved in accurate
           initiation of transcription by RNA polymerase II in
           eukaryote. TFIID plays an important role in the
           recognition of promoter DNA and assembly of the
           pre-initiation complex. TFIID complex is composed of the
           TBP and at least 13 TAFs. TAFs from various species were
           originally named by their predicted molecular weight or
           their electrophoretic mobility in polyacrylamide gels. A
           new, unified nomenclature for the pol II TAFs has been
           suggested to show the relationship between TAF orthologs
           and paralogs. Several hypotheses are proposed for TAFs
           functions such as serving as activator-binding sites,
           core-promoter recognition or a role in essential
           catalytic activity. Each TAF, with the help of a
           specific activator, is required only for the expression
           of subset of genes and is not universally involved for
           transcription as are GTFs. In yeast and human cells,
           TAFs have been found as components of other complexes
           besides TFIID.   Several TAFs interact via histone-fold
           (HFD) motifs; HFD is the interaction motif involved in
           heterodimerization of the core histones and their
           assembly into nucleosome octamers. The minimal HFD
           contains three alpha-helices linked by two loops and is
           found in core histones, TAFS and many other
           transcription factors. TFIID has a histone octamer-like
           substructure. TAF4 domain interacts with TAF12 and makes
           a novel histone-like heterodimer that binds DNA and has
           a core promoter function of a subset of genes.
          Length = 212

 Score = 35.0 bits (81), Expect = 0.053
 Identities = 8/41 (19%), Positives = 28/41 (68%)

Query: 22  KAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKN 62
           + +++E++K++++ +++  +  K + E+ + K+K K+ +K 
Sbjct: 121 QLEREEEEKRDEEERERLLRAAKSRSEQSRLKQKAKEMQKE 161



 Score = 34.2 bits (79), Expect = 0.093
 Identities = 8/42 (19%), Positives = 24/42 (57%)

Query: 27  EKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNN 68
           E+ ++E++ K+ E+++E+  +  K + ++ + K+K       
Sbjct: 120 EQLEREEEEKRDEEERERLLRAAKSRSEQSRLKQKAKEMQKE 161



 Score = 32.7 bits (75), Expect = 0.37
 Identities = 7/41 (17%), Positives = 25/41 (60%)

Query: 25  KKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNN 65
           + E++++EK+ +++ ++  +  K + ++ + K+K K+    
Sbjct: 121 QLEREEEEKRDEEERERLLRAAKSRSEQSRLKQKAKEMQKE 161



 Score = 31.5 bits (72), Expect = 0.70
 Identities = 9/48 (18%), Positives = 28/48 (58%)

Query: 19  DWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNN 66
           + +K  ++E+++  + AK + ++   K+K K+ +K++ ++ +    N 
Sbjct: 126 EEEKRDEEERERLLRAAKSRSEQSRLKQKAKEMQKEEDEEMRHRAANA 173



 Score = 30.0 bits (68), Expect = 2.4
 Identities = 7/41 (17%), Positives = 27/41 (65%)

Query: 21  KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
           +  +++E+K+ E++ ++  +  + + ++ + K+K K+ +K+
Sbjct: 121 QLEREEEEKRDEEERERLLRAAKSRSEQSRLKQKAKEMQKE 161



 Score = 30.0 bits (68), Expect = 2.4
 Identities = 9/49 (18%), Positives = 26/49 (53%), Gaps = 7/49 (14%)

Query: 25  KKEKKKKEKKAKKKEKKKEKKKKEKKKKKKK-------KKKKKKNNNNN 66
           +K+ +  E+  +++E+K++++++E+  +  K        K+K K     
Sbjct: 113 RKQLRFLEQLEREEEEKRDEEERERLLRAAKSRSEQSRLKQKAKEMQKE 161


>gnl|CDD|220296 pfam09580, Spore_YhcN_YlaJ, Sporulation lipoprotein YhcN/YlaJ
          (Spore_YhcN_YlaJ).  This entry contains YhcN and YlaJ,
          which are predicted lipoproteins that have been
          detected as spore proteins but not vegetative proteins
          in Bacillus subtilis. Both appear to be expressed under
          control of the RNA polymerase sigma-G factor. The
          YlaJ-like members of this family have a low-complexity,
          strongly acidic, 40-residue C-terminal domain.
          Length = 169

 Score = 34.6 bits (80), Expect = 0.057
 Identities = 11/38 (28%), Positives = 14/38 (36%)

Query: 61 KNNNNNNNNNNNNNNNNNNNNNNNNNEEKGTFLYKMAA 98
          +NNNN       NN    N  NN    E    + +  A
Sbjct: 39 RNNNNGYGRLTKNNVTGENGYNNIERGEVADKIARRVA 76



 Score = 33.9 bits (78), Expect = 0.10
 Identities = 13/29 (44%), Positives = 14/29 (48%)

Query: 62 NNNNNNNNNNNNNNNNNNNNNNNNNEEKG 90
          NN NNNN       NN    N  NN E+G
Sbjct: 37 NNRNNNNGYGRLTKNNVTGENGYNNIERG 65



 Score = 31.9 bits (73), Expect = 0.49
 Identities = 12/36 (33%), Positives = 14/36 (38%)

Query: 62 NNNNNNNNNNNNNNNNNNNNNNNNNEEKGTFLYKMA 97
           NN NN NNNN       NN    N        ++A
Sbjct: 33 VNNRNNRNNNNGYGRLTKNNVTGENGYNNIERGEVA 68



 Score = 31.9 bits (73), Expect = 0.49
 Identities = 11/28 (39%), Positives = 11/28 (39%)

Query: 62 NNNNNNNNNNNNNNNNNNNNNNNNNEEK 89
          N NN NN NNNN       NN       
Sbjct: 32 NVNNRNNRNNNNGYGRLTKNNVTGENGY 59



 Score = 31.5 bits (72), Expect = 0.56
 Identities = 13/36 (36%), Positives = 16/36 (44%)

Query: 64 NNNNNNNNNNNNNNNNNNNNNNNEEKGTFLYKMAAP 99
          +N+ N NN NN NN NNNN      K     +    
Sbjct: 25 DNDGNTNNVNNRNNRNNNNGYGRLTKNNVTGENGYN 60



 Score = 31.5 bits (72), Expect = 0.58
 Identities = 13/27 (48%), Positives = 13/27 (48%)

Query: 62 NNNNNNNNNNNNNNNNNNNNNNNNNEE 88
          NN NN NN NNNN       NN   E 
Sbjct: 31 NNVNNRNNRNNNNGYGRLTKNNVTGEN 57



 Score = 31.5 bits (72), Expect = 0.64
 Identities = 14/32 (43%), Positives = 16/32 (50%)

Query: 56 KKKKKKNNNNNNNNNNNNNNNNNNNNNNNNNE 87
                N  N  +N+ N NN NN NN NNNN 
Sbjct: 13 NAADDDNRQNVTDNDGNTNNVNNRNNRNNNNG 44



 Score = 31.2 bits (71), Expect = 0.77
 Identities = 14/31 (45%), Positives = 14/31 (45%)

Query: 61 KNNNNNNNNNNNNNNNNNNNNNNNNNEEKGT 91
            NN NN NN NNNN       NN   E G 
Sbjct: 29 NTNNVNNRNNRNNNNGYGRLTKNNVTGENGY 59



 Score = 30.8 bits (70), Expect = 0.94
 Identities = 9/28 (32%), Positives = 10/28 (35%)

Query: 60 KKNNNNNNNNNNNNNNNNNNNNNNNNNE 87
          + NNN       NN    N  NN    E
Sbjct: 39 RNNNNGYGRLTKNNVTGENGYNNIERGE 66



 Score = 30.8 bits (70), Expect = 0.97
 Identities = 13/25 (52%), Positives = 13/25 (52%)

Query: 62 NNNNNNNNNNNNNNNNNNNNNNNNN 86
           N NN NN NN NNNN       NN
Sbjct: 28 GNTNNVNNRNNRNNNNGYGRLTKNN 52



 Score = 30.8 bits (70), Expect = 1.1
 Identities = 12/33 (36%), Positives = 17/33 (51%)

Query: 59 KKKNNNNNNNNNNNNNNNNNNNNNNNNNEEKGT 91
             +++N  N  +N+ N NN NN NN N   G 
Sbjct: 13 NAADDDNRQNVTDNDGNTNNVNNRNNRNNNNGY 45



 Score = 30.4 bits (69), Expect = 1.5
 Identities = 12/27 (44%), Positives = 13/27 (48%)

Query: 62 NNNNNNNNNNNNNNNNNNNNNNNNNEE 88
          N+ N NN NN NN NNNN         
Sbjct: 26 NDGNTNNVNNRNNRNNNNGYGRLTKNN 52



 Score = 30.0 bits (68), Expect = 2.0
 Identities = 11/33 (33%), Positives = 16/33 (48%)

Query: 62 NNNNNNNNNNNNNNNNNNNNNNNNNEEKGTFLY 94
          N  +++N  N  +N+ N NN NN N       Y
Sbjct: 13 NAADDDNRQNVTDNDGNTNNVNNRNNRNNNNGY 45



 Score = 30.0 bits (68), Expect = 2.3
 Identities = 12/23 (52%), Positives = 14/23 (60%)

Query: 62 NNNNNNNNNNNNNNNNNNNNNNN 84
          +N+ N NN NN NN NNNN    
Sbjct: 25 DNDGNTNNVNNRNNRNNNNGYGR 47



 Score = 30.0 bits (68), Expect = 2.3
 Identities = 12/23 (52%), Positives = 14/23 (60%)

Query: 63 NNNNNNNNNNNNNNNNNNNNNNN 85
          +N+ N NN NN NN NNNN    
Sbjct: 25 DNDGNTNNVNNRNNRNNNNGYGR 47



 Score = 28.8 bits (65), Expect = 5.4
 Identities = 10/30 (33%), Positives = 16/30 (53%)

Query: 62 NNNNNNNNNNNNNNNNNNNNNNNNNEEKGT 91
          NN  +++N  N  +N+ N NN NN   +  
Sbjct: 12 NNAADDDNRQNVTDNDGNTNNVNNRNNRNN 41



 Score = 28.5 bits (64), Expect = 7.0
 Identities = 7/26 (26%), Positives = 8/26 (30%)

Query: 59 KKKNNNNNNNNNNNNNNNNNNNNNNN 84
          +  NN       NN    N  NN   
Sbjct: 39 RNNNNGYGRLTKNNVTGENGYNNIER 64


>gnl|CDD|224212 COG1293, COG1293, Predicted RNA-binding protein homologous to
           eukaryotic snRNP [Transcription].
          Length = 564

 Score = 35.8 bits (83), Expect = 0.058
 Identities = 14/40 (35%), Positives = 25/40 (62%)

Query: 22  KAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
            AK   +K + KKA ++ +++  ++   K KKKK+KKK+ 
Sbjct: 398 SAKTALEKAEGKKAIEEIREELIEEGLLKSKKKKRKKKEW 437



 Score = 35.4 bits (82), Expect = 0.085
 Identities = 16/41 (39%), Positives = 25/41 (60%)

Query: 21  KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
           K A +K + KK  +  ++E  +E   K KKKK+KKK+  +K
Sbjct: 400 KTALEKAEGKKAIEEIREELIEEGLLKSKKKKRKKKEWFEK 440



 Score = 32.7 bits (75), Expect = 0.50
 Identities = 14/39 (35%), Positives = 21/39 (53%)

Query: 23  AKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
              K   +K +  K  E+ +E+  +E   K KKKK+KKK
Sbjct: 397 ESAKTALEKAEGKKAIEEIREELIEEGLLKSKKKKRKKK 435


>gnl|CDD|216298 pfam01101, HMG14_17, HMG14 and HMG17. 
          Length = 93

 Score = 33.2 bits (76), Expect = 0.060
 Identities = 19/54 (35%), Positives = 26/54 (48%)

Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNN 74
          K A  K + K +K A KK+K   K KK  K KK+ K+++ K  N   N      
Sbjct: 28 KPAPPKPEPKPKKAAAKKKKPAVKGKKGAKGKKETKQEEAKEENPAENGETKTE 81



 Score = 32.1 bits (73), Expect = 0.16
 Identities = 17/49 (34%), Positives = 21/49 (42%), Gaps = 1/49 (2%)

Query: 30 KKEKKAKKKEKKKEKKK-KEKKKKKKKKKKKKKNNNNNNNNNNNNNNNN 77
          K E K KK   KK+K   K KK  K KK+ K++     N   N      
Sbjct: 33 KPEPKPKKAAAKKKKPAVKGKKGAKGKKETKQEEAKEENPAENGETKTE 81



 Score = 27.9 bits (62), Expect = 4.9
 Identities = 17/37 (45%), Positives = 21/37 (56%), Gaps = 3/37 (8%)

Query: 14 EPTPLDWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKK 50
          EP P   KKA  K+KK   K  K  + KKE K++E K
Sbjct: 35 EPKP---KKAAAKKKKPAVKGKKGAKGKKETKQEEAK 68



 Score = 27.5 bits (61), Expect = 6.1
 Identities = 15/49 (30%), Positives = 17/49 (34%)

Query: 34 KAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNNNNNNNNN 82
          K + K KK   KKK+   K KK  K KK          N   N      
Sbjct: 33 KPEPKPKKAAAKKKKPAVKGKKGAKGKKETKQEEAKEENPAENGETKTE 81



 Score = 27.1 bits (60), Expect = 9.3
 Identities = 17/53 (32%), Positives = 19/53 (35%), Gaps = 13/53 (24%)

Query: 28 KKKKEKKAKKKEKKKEKK-------------KKEKKKKKKKKKKKKKNNNNNN 67
          K+K E K   K  K+E K             K E K KK   KKKK       
Sbjct: 2  KRKGEAKDAAKVTKQEPKRRSARLSAKPAPPKPEPKPKKAAAKKKKPAVKGKK 54


>gnl|CDD|221712 pfam12687, DUF3801, Protein of unknown function (DUF3801).  This
           family of proteins is functionally uncharacterized. This
           family of proteins is found in bacteria. Proteins in
           this family are typically between 158 and 187 amino
           acids in length. This family includes the PcfB protein.
          Length = 137

 Score = 34.2 bits (79), Expect = 0.064
 Identities = 17/39 (43%), Positives = 28/39 (71%)

Query: 23  AKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
           AKK +K++ EK + +K+ KK K+++  KKK++K K K K
Sbjct: 97  AKKLKKEQDEKPSVRKKLKKFKEEQAAKKKERKDKVKNK 135



 Score = 33.4 bits (77), Expect = 0.094
 Identities = 17/40 (42%), Positives = 27/40 (67%)

Query: 21  KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKK 60
           KK KK++ +K   + K K+ K+E+  K+K++K K K KKK
Sbjct: 98  KKLKKEQDEKPSVRKKLKKFKEEQAAKKKERKDKVKNKKK 137



 Score = 32.6 bits (75), Expect = 0.21
 Identities = 15/40 (37%), Positives = 26/40 (65%)

Query: 20  WKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKK 59
            K  K++++K   +K  KK K+++  KK+++K K K KKK
Sbjct: 98  KKLKKEQDEKPSVRKKLKKFKEEQAAKKKERKDKVKNKKK 137



 Score = 32.2 bits (74), Expect = 0.27
 Identities = 15/38 (39%), Positives = 26/38 (68%)

Query: 18  LDWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKK 55
           L  ++ +K   +KK KK K+++  K+K++K+K K KKK
Sbjct: 100 LKKEQDEKPSVRKKLKKFKEEQAAKKKERKDKVKNKKK 137



 Score = 28.4 bits (64), Expect = 4.7
 Identities = 17/38 (44%), Positives = 24/38 (63%)

Query: 26  KEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNN 63
            +K KKE+  K   +KK KK KE++  KKK++K K  N
Sbjct: 97  AKKLKKEQDEKPSVRKKLKKFKEEQAAKKKERKDKVKN 134



 Score = 27.6 bits (62), Expect = 8.4
 Identities = 13/38 (34%), Positives = 24/38 (63%)

Query: 24  KKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
           K+   KK +K+  +K   ++K KK K+++  KKK++K 
Sbjct: 93  KEFTAKKLKKEQDEKPSVRKKLKKFKEEQAAKKKERKD 130


>gnl|CDD|220102 pfam09073, BUD22, BUD22.  BUD22 has been shown in yeast to be a
           nuclear protein involved in bud-site selection. It plays
           a role in positioning the proximal bud pole signal. More
           recently it has been shown to be involved in ribosome
           biogenesis.
          Length = 424

 Score = 35.6 bits (82), Expect = 0.065
 Identities = 12/41 (29%), Positives = 23/41 (56%)

Query: 21  KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
           +   KK K KK+ K  KK++ KE   K+ +++ + + + K 
Sbjct: 143 ETKAKKGKAKKKTKKSKKKEAKESSDKDDEEESESEDESKS 183



 Score = 33.3 bits (76), Expect = 0.32
 Identities = 17/55 (30%), Positives = 22/55 (40%), Gaps = 11/55 (20%)

Query: 19  DWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKK-----------KKKKKKKKKN 62
           D K  +  E      + K    K+ +KKK KK             KKK K+KK N
Sbjct: 359 DAKAGRAAEPTGSRTQQKGDRPKRGEKKKPKKPSVDKPLHPSWEAKKKAKEKKAN 413



 Score = 32.9 bits (75), Expect = 0.43
 Identities = 12/30 (40%), Positives = 18/30 (60%)

Query: 36  KKKEKKKEKKKKEKKKKKKKKKKKKKNNNN 65
            K +K K KKK +K KKK+ K+   K++  
Sbjct: 144 TKAKKGKAKKKTKKSKKKEAKESSDKDDEE 173



 Score = 32.5 bits (74), Expect = 0.55
 Identities = 12/31 (38%), Positives = 16/31 (51%)

Query: 33  KKAKKKEKKKEKKKKEKKKKKKKKKKKKKNN 63
           +   KK K K+K KK KKK+ K+   K    
Sbjct: 143 ETKAKKGKAKKKTKKSKKKEAKESSDKDDEE 173



 Score = 32.5 bits (74), Expect = 0.57
 Identities = 13/31 (41%), Positives = 19/31 (61%)

Query: 32  EKKAKKKEKKKEKKKKEKKKKKKKKKKKKKN 62
           E KAKK + KK+ KK +KK+ K+   K  + 
Sbjct: 143 ETKAKKGKAKKKTKKSKKKEAKESSDKDDEE 173



 Score = 32.5 bits (74), Expect = 0.61
 Identities = 11/31 (35%), Positives = 19/31 (61%)

Query: 37  KKEKKKEKKKKEKKKKKKKKKKKKKNNNNNN 67
           + + KK K KK+ KK KKK+ K+  + ++  
Sbjct: 143 ETKAKKGKAKKKTKKSKKKEAKESSDKDDEE 173



 Score = 31.3 bits (71), Expect = 1.4
 Identities = 15/49 (30%), Positives = 25/49 (51%), Gaps = 8/49 (16%)

Query: 21  KKAKKKEKKKKEKKAKKKEKKKEK--------KKKEKKKKKKKKKKKKK 61
           +  +K ++ K+ +K K K+   +K        KKK K+KK   K + KK
Sbjct: 372 RTQQKGDRPKRGEKKKPKKPSVDKPLHPSWEAKKKAKEKKANAKFQGKK 420



 Score = 30.6 bits (69), Expect = 2.6
 Identities = 14/43 (32%), Positives = 20/43 (46%)

Query: 24  KKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNN 66
              E   K    + K KK + KKK KK KKK+ K+    ++  
Sbjct: 131 SIMEGLDKILGIETKAKKGKAKKKTKKSKKKEAKESSDKDDEE 173



 Score = 30.2 bits (68), Expect = 3.4
 Identities = 12/31 (38%), Positives = 21/31 (67%)

Query: 39  EKKKEKKKKEKKKKKKKKKKKKKNNNNNNNN 69
           E K +K K +KK KK KKK+ K++++ ++  
Sbjct: 143 ETKAKKGKAKKKTKKSKKKEAKESSDKDDEE 173



 Score = 29.8 bits (67), Expect = 4.3
 Identities = 10/45 (22%), Positives = 27/45 (60%)

Query: 21  KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNN 65
           K   KK+ KK +KK  K+   K+ +++ + + + K ++  +++++
Sbjct: 148 KGKAKKKTKKSKKKEAKESSDKDDEEESESEDESKSEESAEDDSD 192



 Score = 29.4 bits (66), Expect = 5.9
 Identities = 9/43 (20%), Positives = 23/43 (53%)

Query: 24  KKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNN 66
           K++EK++KE++ ++ E +  + K+E    K  +  +   +   
Sbjct: 332 KEREKEQKEREGRQSEWEARQAKREGGDAKAGRAAEPTGSRTQ 374



 Score = 29.0 bits (65), Expect = 6.9
 Identities = 13/34 (38%), Positives = 20/34 (58%)

Query: 29  KKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKN 62
           + K KK K K+K K+ KKKE K+   K  +++  
Sbjct: 143 ETKAKKGKAKKKTKKSKKKEAKESSDKDDEEESE 176



 Score = 29.0 bits (65), Expect = 7.0
 Identities = 9/41 (21%), Positives = 24/41 (58%)

Query: 22  KAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKN 62
           + K K+ K K+K  K K+K+ ++   +  +++ + + + K+
Sbjct: 143 ETKAKKGKAKKKTKKSKKKEAKESSDKDDEEESESEDESKS 183



 Score = 28.6 bits (64), Expect = 8.5
 Identities = 12/39 (30%), Positives = 22/39 (56%)

Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKK 59
          KKAK  E++K  ++ K  EKK +  K  + +++ +  K 
Sbjct: 14 KKAKGFERQKLIRRIKNAEKKSKNDKLARIEEEIEALKT 52



 Score = 28.6 bits (64), Expect = 8.7
 Identities = 13/43 (30%), Positives = 24/43 (55%)

Query: 10  GTFTEPTPLDWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKK 52
           G  T+      KK  KK KKK+ K++  K+ ++E + +++ K 
Sbjct: 141 GIETKAKKGKAKKKTKKSKKKEAKESSDKDDEEESESEDESKS 183


>gnl|CDD|220383 pfam09756, DDRGK, DDRGK domain.  This is a family of proteins of
          approximately 300 residues, found in plants and
          vertebrates. They contain a highly conserved DDRGK
          motif.
          Length = 189

 Score = 34.7 bits (80), Expect = 0.070
 Identities = 10/40 (25%), Positives = 33/40 (82%)

Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKK 60
          +K  +++++ + K+ ++ E+++EKKK+E+++K+++++ +K
Sbjct: 31 RKKLEEKREGERKEEEELEEEREKKKEEEERKEREEQARK 70



 Score = 32.4 bits (74), Expect = 0.34
 Identities = 12/41 (29%), Positives = 32/41 (78%)

Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
          K   KK  K +EK+A++++++ E++++E++KK ++K++ ++
Sbjct: 2  KIGAKKRAKLEEKQARRQQREAEEEEREERKKLEEKREGER 42



 Score = 32.4 bits (74), Expect = 0.37
 Identities = 11/42 (26%), Positives = 35/42 (83%)

Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKN 62
          K  +K+E ++KE++  ++E++K+K+++E+K+++++ +K+++ 
Sbjct: 33 KLEEKREGERKEEEELEEEREKKKEEEERKEREEQARKEQEE 74



 Score = 31.2 bits (71), Expect = 0.96
 Identities = 13/50 (26%), Positives = 35/50 (70%), Gaps = 8/50 (16%)

Query: 21 KKAKKKEKKKKEKKA--------KKKEKKKEKKKKEKKKKKKKKKKKKKN 62
           K ++K+ ++++++A        KK E+K+E ++KE+++ +++++KKK+ 
Sbjct: 9  AKLEEKQARRQQREAEEEEREERKKLEEKREGERKEEEELEEEREKKKEE 58



 Score = 30.4 bits (69), Expect = 1.4
 Identities = 12/41 (29%), Positives = 33/41 (80%)

Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
          +  +K+E++ +E++ KKKE+++ K+++E+ +K++++ +K K
Sbjct: 39 EGERKEEEELEEEREKKKEEEERKEREEQARKEQEEYEKLK 79



 Score = 28.9 bits (65), Expect = 5.1
 Identities = 8/47 (17%), Positives = 31/47 (65%), Gaps = 4/47 (8%)

Query: 21 KKAKKKEKKKKEKKAKKKE----KKKEKKKKEKKKKKKKKKKKKKNN 63
          KK  K E+K+  ++ ++ E    ++++K +++++ ++K++++ ++  
Sbjct: 6  KKRAKLEEKQARRQQREAEEEEREERKKLEEKREGERKEEEELEEER 52



 Score = 28.5 bits (64), Expect = 6.7
 Identities = 8/44 (18%), Positives = 36/44 (81%)

Query: 18 LDWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
          L+ K+  +++++++ ++ ++K+K++E++K+ +++ +K++++ +K
Sbjct: 34 LEEKREGERKEEEELEEEREKKKEEEERKEREEQARKEQEEYEK 77



 Score = 28.1 bits (63), Expect = 8.5
 Identities = 9/41 (21%), Positives = 32/41 (78%)

Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
          ++ +++E +++ +K K++E++KE++++ +K++++ +K K  
Sbjct: 41 ERKEEEELEEEREKKKEEEERKEREEQARKEQEEYEKLKSS 81


>gnl|CDD|217970 pfam04220, YihI, Der GTPase activator (YihI).  YihI activates the
           GTPase activity of Der, a 50S ribosomal subunit
           stability factor. The stimulation is specific to Der as
           YihI does not stimulate the GTPase activity of Era or
           ObgE. The interaction of YihI with Der requires only the
           C-terminal 78 amino acids of YihI. A yihI deletion
           mutant is viable and shows a shorter lag period, but the
           same post-lag growth rate as a wild-type strain. yihI is
           expressed during the lag period. Overexpression of yihI
           inhibits cell growth and biogenesis of the 50S ribosomal
           subunit. YihI is an unusual, highly hydrophilic protein
           with an uneven distribution of charged residues,
           resulting in an N-terminal region with high pI and a
           C-terminal region with low pI.
          Length = 169

 Score = 34.2 bits (79), Expect = 0.073
 Identities = 20/89 (22%), Positives = 40/89 (44%), Gaps = 4/89 (4%)

Query: 25  KKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNNNNNNNNNNN 84
           +K  K   K A K  KKK + + +++ +++K+KKK+K   + + +N  + +         
Sbjct: 1   RKSGKNGPKLAPKG-KKKTRYELDQEARERKRKKKRKGLKSGSRHNEESESQKQKGAAQK 59

Query: 85  NNEEKGTFLYKMAAPLVLVLTYVAEITQP 113
            +   G+   K   PL++      E   P
Sbjct: 60  KDPRIGS---KKPVPLIVEFKVKPEKQAP 85



 Score = 32.7 bits (75), Expect = 0.22
 Identities = 14/57 (24%), Positives = 28/57 (49%)

Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNNNN 77
          +K+ K   K   K  KK   + +++ +E+K+KKK+K  K  + +N  + +       
Sbjct: 1  RKSGKNGPKLAPKGKKKTRYELDQEARERKRKKKRKGLKSGSRHNEESESQKQKGAA 57



 Score = 31.9 bits (73), Expect = 0.50
 Identities = 12/56 (21%), Positives = 27/56 (48%)

Query: 24 KKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNNNNNN 79
          K  +   K     KK+ + E  ++ +++K+KKK+K  K+ + +N  + +       
Sbjct: 2  KSGKNGPKLAPKGKKKTRYELDQEARERKRKKKRKGLKSGSRHNEESESQKQKGAA 57


>gnl|CDD|222466 pfam13945, NST1, Splicing factor, salt tolerance regulator.  NST1
          is a family of proteins that seem to be involved,
          directly or indirectly, in the salt sensitivity of some
          cellular functions in yeast. These proteins also
          interact with the splicing factor Msl1p.
          Length = 189

 Score = 34.5 bits (79), Expect = 0.073
 Identities = 11/27 (40%), Positives = 18/27 (66%)

Query: 50 KKKKKKKKKKKKNNNNNNNNNNNNNNN 76
          K KKKKKK+ K  + ++N ++ + NN 
Sbjct: 32 KSKKKKKKRSKATSPSHNASDQSTNNV 58



 Score = 34.1 bits (78), Expect = 0.086
 Identities = 11/28 (39%), Positives = 18/28 (64%)

Query: 48 EKKKKKKKKKKKKKNNNNNNNNNNNNNN 75
           K KKKKKK+ K  + ++N ++ + NN 
Sbjct: 31 SKSKKKKKKRSKATSPSHNASDQSTNNV 58



 Score = 33.4 bits (76), Expect = 0.17
 Identities = 11/27 (40%), Positives = 18/27 (66%)

Query: 51 KKKKKKKKKKKNNNNNNNNNNNNNNNN 77
          K KKKKKK+ K  + ++N ++ + NN 
Sbjct: 32 KSKKKKKKRSKATSPSHNASDQSTNNV 58



 Score = 31.4 bits (71), Expect = 0.77
 Identities = 11/27 (40%), Positives = 17/27 (62%)

Query: 47 KEKKKKKKKKKKKKKNNNNNNNNNNNN 73
          K KKKKKK+ K    ++N ++ + NN 
Sbjct: 32 KSKKKKKKRSKATSPSHNASDQSTNNV 58



 Score = 31.4 bits (71), Expect = 0.81
 Identities = 10/27 (37%), Positives = 18/27 (66%)

Query: 53 KKKKKKKKKNNNNNNNNNNNNNNNNNN 79
          K KKKKKK++   + ++N ++ + NN 
Sbjct: 32 KSKKKKKKRSKATSPSHNASDQSTNNV 58



 Score = 31.1 bits (70), Expect = 0.91
 Identities = 10/27 (37%), Positives = 17/27 (62%)

Query: 52 KKKKKKKKKKNNNNNNNNNNNNNNNNN 78
          K KKKKKK+    + ++N ++ + NN 
Sbjct: 32 KSKKKKKKRSKATSPSHNASDQSTNNV 58



 Score = 30.7 bits (69), Expect = 1.5
 Identities = 10/27 (37%), Positives = 17/27 (62%)

Query: 54 KKKKKKKKNNNNNNNNNNNNNNNNNNN 80
          K KKKKKK +   + ++N ++ + NN 
Sbjct: 32 KSKKKKKKRSKATSPSHNASDQSTNNV 58



 Score = 30.3 bits (68), Expect = 1.6
 Identities = 10/27 (37%), Positives = 16/27 (59%)

Query: 46 KKEKKKKKKKKKKKKKNNNNNNNNNNN 72
          K +KKKKK+ K     +N ++ + NN 
Sbjct: 32 KSKKKKKKRSKATSPSHNASDQSTNNV 58



 Score = 30.3 bits (68), Expect = 2.0
 Identities = 10/27 (37%), Positives = 15/27 (55%)

Query: 45 KKKEKKKKKKKKKKKKKNNNNNNNNNN 71
          K K+KKKK+ K      N ++ + NN 
Sbjct: 32 KSKKKKKKRSKATSPSHNASDQSTNNV 58



 Score = 29.1 bits (65), Expect = 4.2
 Identities = 9/28 (32%), Positives = 14/28 (50%)

Query: 43 EKKKKEKKKKKKKKKKKKKNNNNNNNNN 70
           K KK+KKK+ K        ++ + NN 
Sbjct: 31 SKSKKKKKKRSKATSPSHNASDQSTNNV 58


>gnl|CDD|220271 pfam09507, CDC27, DNA polymerase subunit Cdc27.  This protein forms
           the C subunit of DNA polymerase delta. It carries the
           essential residues for binding to the Pol1 subunit of
           polymerase alpha, from residues 293-332, which are
           characterized by the motif D--G--VT, referred to as the
           DPIM motif. The first 160 residues of the protein form
           the minimal domain for binding to the B subunit, Cdc1,
           of polymerase delta, the final 10 C-terminal residues,
           362-372, being the DNA sliding clamp, PCNA, binding
           motif.
          Length = 427

 Score = 35.6 bits (82), Expect = 0.073
 Identities = 11/43 (25%), Positives = 20/43 (46%)

Query: 17  PLDWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKK 59
           P           KKK K+ K+K++  E   KE+ +++  K+  
Sbjct: 211 PPKKSNIMSSFFKKKTKEKKEKKEASESTVKEESEEESGKRDV 253



 Score = 33.3 bits (76), Expect = 0.35
 Identities = 18/55 (32%), Positives = 23/55 (41%), Gaps = 13/55 (23%)

Query: 21  KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKK-------------KKKKKKKKKKN 62
           K AKK +   KE   +K E K   K    K+             KKK K+KK+K 
Sbjct: 179 KSAKKTQDTSKETTTEKTEGKTSVKAASLKRNPPKKSNIMSSFFKKKTKEKKEKK 233



 Score = 33.3 bits (76), Expect = 0.37
 Identities = 17/63 (26%), Positives = 23/63 (36%), Gaps = 1/63 (1%)

Query: 2   DSSTKHQIGTFTEPTPLDWKKAKKKEKKKKEKKAKKKEKKKE-KKKKEKKKKKKKKKKKK 60
            S+ K Q  +    T     K   K    K    KK        KKK K+KK+KK+  + 
Sbjct: 179 KSAKKTQDTSKETTTEKTEGKTSVKAASLKRNPPKKSNIMSSFFKKKTKEKKEKKEASES 238

Query: 61  KNN 63
              
Sbjct: 239 TVK 241



 Score = 32.5 bits (74), Expect = 0.68
 Identities = 13/48 (27%), Positives = 22/48 (45%)

Query: 14  EPTPLDWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
           +   L     KK        K K KEKK++K+  E   K++ +++  K
Sbjct: 203 KAASLKRNPPKKSNIMSSFFKKKTKEKKEKKEASESTVKEESEEESGK 250



 Score = 32.1 bits (73), Expect = 0.81
 Identities = 11/53 (20%), Positives = 21/53 (39%)

Query: 14  EPTPLDWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNN 66
             + +  +   K  KK ++   +   +K E K   K    K+   KK N  ++
Sbjct: 168 PKSIMSPEVKVKSAKKTQDTSKETTTEKTEGKTSVKAASLKRNPPKKSNIMSS 220



 Score = 31.7 bits (72), Expect = 1.2
 Identities = 11/41 (26%), Positives = 23/41 (56%)

Query: 15  PTPLDWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKK 55
             P    + + +++ K   +    E++ E+K+KEK+K+ KK
Sbjct: 260 AEPTGLDEDEDEDEPKPSGERSDSEEETEEKEKEKRKRLKK 300



 Score = 31.3 bits (71), Expect = 1.2
 Identities = 11/64 (17%), Positives = 21/64 (32%)

Query: 15  PTPLDWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNN 74
                   +K  +     +   K  KK +   KE   +K + K   K  +   N    +N
Sbjct: 157 TANGKRPSSKPPKSIMSPEVKVKSAKKTQDTSKETTTEKTEGKTSVKAASLKRNPPKKSN 216

Query: 75  NNNN 78
             ++
Sbjct: 217 IMSS 220



 Score = 31.3 bits (71), Expect = 1.3
 Identities = 13/54 (24%), Positives = 24/54 (44%)

Query: 14  EPTPLDWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNN 67
            P  +   + K K  KK +  +K+   +K + K   K    K+   KK+N  ++
Sbjct: 167 PPKSIMSPEVKVKSAKKTQDTSKETTTEKTEGKTSVKAASLKRNPPKKSNIMSS 220



 Score = 30.6 bits (69), Expect = 2.2
 Identities = 10/36 (27%), Positives = 22/36 (61%)

Query: 24  KKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKK 59
             +++ + E K   +    E++ +EK+K+K+K+ KK
Sbjct: 265 LDEDEDEDEPKPSGERSDSEEETEEKEKEKRKRLKK 300



 Score = 30.2 bits (68), Expect = 3.4
 Identities = 11/50 (22%), Positives = 25/50 (50%), Gaps = 2/50 (4%)

Query: 14  EPTPLDWKKAKKKEKKKKEKKAKKKEKKKEKKKKE--KKKKKKKKKKKKK 61
               L+ + A+     + E + + K   +    +E  ++K+K+K+K+ KK
Sbjct: 251 RDVILEDESAEPTGLDEDEDEDEPKPSGERSDSEEETEEKEKEKRKRLKK 300



 Score = 29.8 bits (67), Expect = 4.5
 Identities = 13/48 (27%), Positives = 25/48 (52%), Gaps = 7/48 (14%)

Query: 21  KKAKKKEKKKKEKK------AKKKEKKKEKKKKEKKKKKKKKKKKKKN 62
            KA   ++   +K        KKK K+K K+KKE  +   K++ ++++
Sbjct: 202 VKAASLKRNPPKKSNIMSSFFKKKTKEK-KEKKEASESTVKEESEEES 248



 Score = 29.4 bits (66), Expect = 6.2
 Identities = 10/31 (32%), Positives = 13/31 (41%)

Query: 33  KKAKKKEKKKEKKKKEKKKKKKKKKKKKKNN 63
              K K K          KK K  KKKK+++
Sbjct: 389 PPTKPKPKVSTPAVPAAAKKPKAPKKKKQSS 419



 Score = 29.0 bits (65), Expect = 6.5
 Identities = 9/41 (21%), Positives = 23/41 (56%)

Query: 24  KKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNN 64
           KK ++KK++K+A +   K+E +++  K+    + +  +   
Sbjct: 224 KKTKEKKEKKEASESTVKEESEEESGKRDVILEDESAEPTG 264


>gnl|CDD|233174 TIGR00893, 2A0114, D-galactonate transporter.  [Transport and
           binding proteins, Carbohydrates, organic alcohols, and
           acids].
          Length = 399

 Score = 35.4 bits (82), Expect = 0.074
 Identities = 58/339 (17%), Positives = 95/339 (28%), Gaps = 70/339 (20%)

Query: 106 YVAEITQPHLRG-----MLSATASMTTIFGTVSQLFLGSFLHWRSAAIL-NLLFPILALC 159
            VA       R        SA      I G +    L  F  W+ A I+  +L  I  + 
Sbjct: 106 IVASWFPASERATAVSIFNSAQGLGGIIGGPLVGWILIHF-SWQWAFIIEGVLGIIWGVL 164

Query: 160 ALYFI---PESPHWLISQGRMQEASASLCWLRGWVTPDKVQTELSQITKAIEESELKRLG 216
            L FI   P+   WL  + +       L   +G         +  QI +      LK   
Sbjct: 165 WLKFIPDPPQKAKWLTEEEKYIVVGGLLAEQQGKG---PSTPKKYQIKEL-----LK--- 213

Query: 217 KDGQRRPNYRMYMRRTFLLPYAIVTSLFFIG--------QFGGMTTLQTYAVGIFESIHA 268
                  + R++     L  + +   L F          Q  G++ L+            
Sbjct: 214 -------DRRVWG--LALGQFLVNIGLGFFLTWFPTYLVQERGLSILEAG---------- 254

Query: 269 PLDPYFATLLLGVAELGGALLCVVLIHYTGKRPLALIS------TGGSAACFIVVAVYAQ 322
                F   L G+    G +L   L     +R  +L+         G     ++ A    
Sbjct: 255 -----FMASLPGIVGFIGMILGGRLSDLLLRRGKSLVFARKTAIIAGLVLSLLMFA---- 305

Query: 323 FHLSYGWDSPLVPTVFLVLAAFLTHICIRLLPWMLIGEVFPNNIRATASGASGSSSYIFA 382
            +     + P      + L  F        + W LI +  P NI     G   S   +  
Sbjct: 306 TNYV---NIPYAALALVALGFFGLGAGA--IGWALISDNAPGNIAGLTGGLINSLGNLGG 360

Query: 383 FAVNKLYYPMLDTFHLWGTLYFY--AAISVVGTLYMYFV 419
                +   +  T   +        A   +    Y+  V
Sbjct: 361 IVGPIVIGAIAATTGSFAGALMVVAALALIGALSYLLLV 399


>gnl|CDD|192718 pfam11176, DUF2962, Protein of unknown function (DUF2962).  This
          eukaryotic family of proteins has no known function.
          Length = 155

 Score = 34.2 bits (79), Expect = 0.075
 Identities = 14/46 (30%), Positives = 24/46 (52%), Gaps = 3/46 (6%)

Query: 21 KKAKKKEKKKK---EKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNN 63
          K  K   KKKK    K  K K+  +   + +K +KKK ++ +K++ 
Sbjct: 1  KVTKNIAKKKKVLHPKSRKAKQLNRATLRDDKLEKKKAERAEKRSL 46


>gnl|CDD|153340 cd07656, F-BAR_srGAP, The F-BAR (FES-CIP4 Homology and
           Bin/Amphiphysin/Rvs) domain of Slit-Robo GTPase
           Activating Proteins.  F-BAR domains are dimerization
           modules that bind and bend membranes and are found in
           proteins involved in membrane dynamics and actin
           reorganization. Slit-Robo GTPase Activating Proteins
           (srGAPs) are Rho GAPs that interact with Robo1, the
           transmembrane receptor of Slit proteins. Slit proteins
           are secreted proteins that control axon guidance and the
           migration of neurons and leukocytes. Vertebrates contain
           three isoforms of srGAPs, all of which are expressed
           during embryonic and early development in the nervous
           system but with different localization and timing.
           srGAPs contain an N-terminal F-BAR domain, a Rho GAP
           domain, and a C-terminal SH3 domain. F-BAR domains form
           banana-shaped dimers with a positively-charged concave
           surface that binds to negatively-charged lipid
           membranes. They can induce membrane deformation in the
           form of long tubules.
          Length = 241

 Score = 35.0 bits (81), Expect = 0.075
 Identities = 15/45 (33%), Positives = 29/45 (64%), Gaps = 1/45 (2%)

Query: 21  KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNN 65
           +K K+ EK++ EK+ +  EKK E+ +  KK +K+ +K++ K +  
Sbjct: 152 RKLKEAEKQE-EKQEQSPEKKLERSRSSKKIEKEVEKRQAKYSEA 195



 Score = 30.8 bits (70), Expect = 1.4
 Identities = 13/50 (26%), Positives = 25/50 (50%)

Query: 27  EKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNNN 76
           E K  E+K K+ EK++EK+++  +KK ++ +  KK            +  
Sbjct: 146 ESKSAERKLKEAEKQEEKQEQSPEKKLERSRSSKKIEKEVEKRQAKYSEA 195



 Score = 30.8 bits (70), Expect = 1.5
 Identities = 12/48 (25%), Positives = 27/48 (56%)

Query: 26  KEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNN 73
           K  ++K K+A+K+E+K+E+  ++K ++ +  KK +K         +  
Sbjct: 148 KSAERKLKEAEKQEEKQEQSPEKKLERSRSSKKIEKEVEKRQAKYSEA 195


>gnl|CDD|215278 PLN02502, PLN02502, lysyl-tRNA synthetase.
          Length = 553

 Score = 35.4 bits (82), Expect = 0.078
 Identities = 11/38 (28%), Positives = 18/38 (47%)

Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKK 58
          K A KK  K K+ + +K  K++ K        K + +K
Sbjct: 10 KNALKKRLKAKQAEEEKAAKEEAKAAAAAAAAKGRSRK 47



 Score = 34.6 bits (80), Expect = 0.16
 Identities = 8/37 (21%), Positives = 17/37 (45%)

Query: 25 KKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
          K   KK+ K  + +E+K  K++ +        K + +
Sbjct: 10 KNALKKRLKAKQAEEEKAAKEEAKAAAAAAAAKGRSR 46



 Score = 31.5 bits (72), Expect = 1.2
 Identities = 13/38 (34%), Positives = 17/38 (44%), Gaps = 2/38 (5%)

Query: 26 KEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNN 63
          K   KK  KAK+ E+  EK  KE+ K        K  +
Sbjct: 10 KNALKKRLKAKQAEE--EKAAKEEAKAAAAAAAAKGRS 45


>gnl|CDD|219572 pfam07780, Spb1_C, Spb1 C-terminal domain.  This presumed domain is
           found at the C-terminus of a family of FtsJ-like
           methyltransferases. Members of this family are involved
           in 60S ribosomal biogenesis.
          Length = 212

 Score = 34.6 bits (80), Expect = 0.079
 Identities = 21/55 (38%), Positives = 27/55 (49%), Gaps = 14/55 (25%)

Query: 21  KKAKKKEKKKKEK--KAKKKEKKKEKKKK------------EKKKKKKKKKKKKK 61
           KKAKKK+KK+K+    AKK  K K  + K             KK  +  K+K KK
Sbjct: 158 KKAKKKKKKRKKTYVVAKKGNKGKAGRPKGVKGKYKVVDPRMKKDLRALKRKAKK 212



 Score = 34.2 bits (79), Expect = 0.094
 Identities = 23/58 (39%), Positives = 29/58 (50%), Gaps = 17/58 (29%)

Query: 21  KKAKKK-------------EKKKKEKKAKKKEKKKEKKKKEK----KKKKKKKKKKKK 61
           +KAKKK             EK K+ KK  KK KKK+KK+K+     KK  K K  + K
Sbjct: 129 EKAKKKAEAIAEDEDISEREKAKQIKKLYKKAKKKKKKRKKTYVVAKKGNKGKAGRPK 186



 Score = 31.9 bits (73), Expect = 0.62
 Identities = 24/55 (43%), Positives = 29/55 (52%), Gaps = 16/55 (29%)

Query: 23  AKKKEKKKKEK---KAKKK-------------EKKKEKKKKEKKKKKKKKKKKKK 61
           AK ++K +  K   KAKKK             EK K+ KK  KK KKKKKK+KK 
Sbjct: 116 AKARKKMRALKRLEKAKKKAEAIAEDEDISEREKAKQIKKLYKKAKKKKKKRKKT 170


>gnl|CDD|240577 cd12950, RRP7_Rrp7p, RRP7 domain ribosomal RNA-processing protein 7
           (Rrp7p) and similar proteins.  This CD corresponds to
           the RRP7 domain of Rrp7p. Rrp7p is encoded by YCL031C
           gene from Saccharomyces cerevisiae. It is an essential
           yeast protein involved in pre-rRNA processing and
           ribosome assembly. Rrp7p contains an N-terminal RNA
           recognition motif (RRM), also termed RBD (RNA binding
           domain) or RNP (ribonucleoprotein domain), and a
           C-terminal RRP7 domain.
          Length = 128

 Score = 33.4 bits (77), Expect = 0.081
 Identities = 21/57 (36%), Positives = 35/57 (61%), Gaps = 15/57 (26%)

Query: 21  KKAKKKEKKKKEKKAKKKEKK-------KEKKKKE----KKK----KKKKKKKKKKN 62
           K A++++K+K++KK KKKE +       +EKKK+E     KK    K++ +K K+K 
Sbjct: 72  KAAEEEKKEKEKKKKKKKELEDFYRFQLREKKKEEQADLLKKFEEDKERVEKMKEKR 128



 Score = 31.1 bits (71), Expect = 0.61
 Identities = 17/44 (38%), Positives = 28/44 (63%), Gaps = 6/44 (13%)

Query: 23  AKKKEKKKKEKKAKKKEKKKEKK------KKEKKKKKKKKKKKK 60
           A K  +++K++K KKK+KKKE +       +EKKK+++    KK
Sbjct: 70  AGKAAEEEKKEKEKKKKKKKELEDFYRFQLREKKKEEQADLLKK 113



 Score = 27.6 bits (62), Expect = 8.5
 Identities = 13/29 (44%), Positives = 20/29 (68%)

Query: 33 KKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
          +K     ++  K  +E+KK+K+KKKKKKK
Sbjct: 61 RKGPAAGEEAGKAAEEEKKEKEKKKKKKK 89


>gnl|CDD|187760 cd09320, TDT_like_2, The Tellurite-resistance/Dicarboxylate
           Transporter (TDT) family.  The
           Tellurite-resistance/Dicarboxylate Transporter (TDT)
           family includes members from all three kingdoms, but
           only three members of the family have been functionally
           characterized: the TehA protein of E. coli functioning
           as a tellurite-resistance uptake permease, the Mae1
           protein of S. pombe functioning in the uptake of malate
           and other dicarboxylates, and the sulfite efflux pump
           (SSU1) of Saccharomyces cerevisiae. In plants, the
           plasma membrane protein SLAC1 (Slow Anion
           Channel-Associated 1), which is preferentially expressed
           in guard cells, encodes a distant homolog of fungal and
           bacterial dicarboxylate/malic acid transport proteins.
           SLAC1 is essential  in mediating stomatal responses to
           physiological and stress stimuli. Members of the TDT
           family exhibit 10 putative transmembrane alpha-helical
           spanners (TMSs).
          Length = 327

 Score = 35.0 bits (81), Expect = 0.081
 Identities = 25/120 (20%), Positives = 34/120 (28%), Gaps = 25/120 (20%)

Query: 238 AIVTSLFFIGQFGGMTTLQTYAVGIFESIHAPLDPYFATLLLGV-----AELGGALLCVV 292
           A+ T L+ IG   G+ T       +         P     L  V     A  G  L+   
Sbjct: 100 AVDTVLWVIGTVLGLVTAVVIPRRLIR--RHRGTPTPVWGLPVVPPMVSATTGAQLV--- 154

Query: 293 LIHY--TGKRPLALISTGGSAACF---------IVVAVYAQFHLSYGWDSPLVPTVFLVL 341
             H      R   L+      ACF         I   VY +            PT ++ L
Sbjct: 155 -PHLPAGQPRLTLLL---ACYACFGLSLLAAAIIFALVYRRLWRGGPLPLAAAPTAWIPL 210


>gnl|CDD|151365 pfam10918, DUF2718, Protein of unknown function (DUF2718).  This
           viral family of proteins has no known function.
          Length = 140

 Score = 33.8 bits (77), Expect = 0.082
 Identities = 18/98 (18%), Positives = 28/98 (28%), Gaps = 15/98 (15%)

Query: 12  FTEPTPLDWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNN-------- 63
           F +  P   KK    +        +      EK  K K++ +K    ++ NN        
Sbjct: 40  FHDVLPDPHKK--DDDASVPIMPPEPSPIPNEKPLKLKEQAEKPPISERNNNGVFDFMKI 97

Query: 64  -----NNNNNNNNNNNNNNNNNNNNNNNEEKGTFLYKM 96
                          N N  N       + +  FL KM
Sbjct: 98  PNPFKRYYEYCYPYPNKNAKNQPKRVEEKSEKGFLEKM 135


>gnl|CDD|206039 pfam13868, Trichoplein, Tumour suppressor, Mitostatin.  Trichoplein
           or mitostatin, was first defined as a meiosis-specific
           nuclear structural protein. It has since been linked
           with mitochondrial movement. It is associated with the
           mitochondrial outer membrane, and over-expression leads
           to reduction in mitochondrial motility whereas lack of
           it enhances mitochondrial movement. The activity appears
           to be mediated through binding the mitochondria to the
           actin intermediate filaments (IFs).
          Length = 349

 Score = 34.9 bits (81), Expect = 0.085
 Identities = 8/41 (19%), Positives = 32/41 (78%)

Query: 20  WKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKK 60
            K+ +K+ ++++E K  + +++K ++++E++ +++++K++K
Sbjct: 140 RKEEEKEREREEELKILEYQREKAEREEEREAERRERKEEK 180



 Score = 31.8 bits (73), Expect = 0.82
 Identities = 7/39 (17%), Positives = 29/39 (74%)

Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKK 59
          K  +K+E+++ ++  +++  K   +++E+++K+K+++++
Sbjct: 33 KAEEKEEERRIDEMMEEERLKALAEEEERERKRKEERRE 71



 Score = 31.8 bits (73), Expect = 0.93
 Identities = 6/42 (14%), Positives = 31/42 (73%)

Query: 20  WKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
            ++ +++++ + ++K +K++K +E+  +  +++ ++K+++K+
Sbjct: 105 IERIQEEDEAEAQEKREKQKKLREEIDEFNEERIERKEEEKE 146



 Score = 31.0 bits (71), Expect = 1.7
 Identities = 11/42 (26%), Positives = 31/42 (73%), Gaps = 1/42 (2%)

Query: 21 KKAKKKEKKKKEKK-AKKKEKKKEKKKKEKKKKKKKKKKKKK 61
          KK  K E+K++E++  +  E+++ K   E++++++K+K++++
Sbjct: 29 KKRIKAEEKEEERRIDEMMEEERLKALAEEEERERKRKEERR 70



 Score = 30.6 bits (70), Expect = 1.8
 Identities = 8/43 (18%), Positives = 32/43 (74%)

Query: 21  KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNN 63
           +KA+++E+++ E++ +K+EK++E  +   ++++ + ++++ + 
Sbjct: 161 EKAEREEEREAERRERKEEKEREVARLRAQQEEAEDEREELDE 203



 Score = 29.5 bits (67), Expect = 4.4
 Identities = 10/46 (21%), Positives = 33/46 (71%), Gaps = 6/46 (13%)

Query: 22  KAKKKEKKKKEKKAKKKEKKK------EKKKKEKKKKKKKKKKKKK 61
             ++ E+K++EK+ +++E+ K      EK ++E++++ +++++K++
Sbjct: 134 NEERIERKEEEKEREREEELKILEYQREKAEREEEREAERRERKEE 179



 Score = 29.1 bits (66), Expect = 6.6
 Identities = 7/39 (17%), Positives = 31/39 (79%)

Query: 23  AKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
            ++K ++++E++A+++E+K+EK+++  + + ++++ + +
Sbjct: 159 QREKAEREEEREAERRERKEEKEREVARLRAQQEEAEDE 197



 Score = 29.1 bits (66), Expect = 6.7
 Identities = 10/46 (21%), Positives = 31/46 (67%), Gaps = 6/46 (13%)

Query: 23 AKKKEKKKKEKKAKKKEKK------KEKKKKEKKKKKKKKKKKKKN 62
          A+ +EKK+ + + K++E++      +E+ K   +++++++K+K++ 
Sbjct: 24 AQIEEKKRIKAEEKEEERRIDEMMEEERLKALAEEEERERKRKEER 69



 Score = 28.3 bits (64), Expect = 9.7
 Identities = 12/42 (28%), Positives = 32/42 (76%), Gaps = 5/42 (11%)

Query: 22  KAKKKEKKKKEKKAKK-----KEKKKEKKKKEKKKKKKKKKK 58
           +A+ +EK++K+KK ++      E++ E+K++EK+++++++ K
Sbjct: 113 EAEAQEKREKQKKLREEIDEFNEERIERKEEEKEREREEELK 154


>gnl|CDD|168775 PRK07034, PRK07034, hypothetical protein; Provisional.
          Length = 536

 Score = 35.1 bits (80), Expect = 0.086
 Identities = 14/31 (45%), Positives = 21/31 (67%)

Query: 56  KKKKKKNNNNNNNNNNNNNNNNNNNNNNNNN 86
           +++   N++  N NNNN N +NNN N +NNN
Sbjct: 84  QQQPNANDSYPNGNNNNPNGDNNNPNGSNNN 114


>gnl|CDD|220611 pfam10169, Laps, Learning-associated protein.  This is a family of
           121-amino acid secretory proteins. Laps functions in the
           regulation of neuronal cell adhesion and/or movement and
           synapse attachment. Laps binds to the ApC/EBP (Aplysia
           CCAAT/enhancer binding protein) promoter and activates
           the transcription of ApC/EBP mRNA.
          Length = 124

 Score = 33.3 bits (76), Expect = 0.088
 Identities = 15/46 (32%), Positives = 25/46 (54%), Gaps = 9/46 (19%)

Query: 26  KEKKKKEKKAKKKE---------KKKEKKKKEKKKKKKKKKKKKKN 62
            +   + KK  + +         +++ KK K+K++KKKKKKK  KN
Sbjct: 77  NKAAVRNKKTLRDQHGQYPAWMNQRQAKKLKKKREKKKKKKKGAKN 122



 Score = 32.1 bits (73), Expect = 0.21
 Identities = 18/48 (37%), Positives = 24/48 (50%), Gaps = 7/48 (14%)

Query: 21  KKAKKKEKKKKEKK-------AKKKEKKKEKKKKEKKKKKKKKKKKKK 61
            KA  + KK    +         +++ KK KKK+EKKKKKKK  K   
Sbjct: 77  NKAAVRNKKTLRDQHGQYPAWMNQRQAKKLKKKREKKKKKKKGAKNLA 124



 Score = 29.8 bits (67), Expect = 1.4
 Identities = 11/44 (25%), Positives = 23/44 (52%)

Query: 22 KAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNN 65
          K  ++ K+KK+K+  + E+  EK + + K   + KK  +  +  
Sbjct: 50 KPAEEVKEKKKKEGTESEEDDEKMEVDNKAAVRNKKTLRDQHGQ 93



 Score = 27.5 bits (61), Expect = 8.2
 Identities = 10/28 (35%), Positives = 17/28 (60%)

Query: 20  WKKAKKKEKKKKEKKAKKKEKKKEKKKK 47
           W   ++ +K KK+++ KKK+KK  K   
Sbjct: 97  WMNQRQAKKLKKKREKKKKKKKGAKNLA 124


>gnl|CDD|204122 pfam09072, TMA7, Translation machinery associated TMA7.  TMA7
          plays a role in protein translation. Deletions of the
          TMA7 gene results in altered protein synthesis rates.
          Length = 63

 Score = 31.7 bits (72), Expect = 0.094
 Identities = 15/40 (37%), Positives = 23/40 (57%)

Query: 17 PLDWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKK 56
          PL   K +KKE  +++   K+K+K++ K  KE   K K K
Sbjct: 10 PLKAPKKQKKELDEEDLAFKQKQKEEAKALKELAAKAKGK 49



 Score = 31.4 bits (71), Expect = 0.13
 Identities = 17/42 (40%), Positives = 22/42 (52%), Gaps = 9/42 (21%)

Query: 29 KKKEKKAKKKEKK---------KEKKKKEKKKKKKKKKKKKK 61
          KKK  KA KK+KK         K+K+K+E K  K+   K K 
Sbjct: 7  KKKPLKAPKKQKKELDEEDLAFKQKQKEEAKALKELAAKAKG 48



 Score = 30.2 bits (68), Expect = 0.34
 Identities = 14/35 (40%), Positives = 18/35 (51%)

Query: 26 KEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKK 60
          K+K+K+E KA K+   K K K        KK  KK
Sbjct: 29 KQKQKEEAKALKELAAKAKGKGPLGGGGIKKSGKK 63



 Score = 28.3 bits (63), Expect = 1.8
 Identities = 12/34 (35%), Positives = 22/34 (64%)

Query: 28 KKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
          KKK  K  KK++K+ +++    K+K+K++ K  K
Sbjct: 7  KKKPLKAPKKQKKELDEEDLAFKQKQKEEAKALK 40


>gnl|CDD|217393 pfam03154, Atrophin-1, Atrophin-1 family.  Atrophin-1 is the
           protein product of the dentatorubral-pallidoluysian
           atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive
           neurodegenerative disorder. It is caused by the
           expansion of a CAG repeat in the DRPLA gene on
           chromosome 12p. This results in an extended
           polyglutamine region in atrophin-1, that is thought to
           confer toxicity to the protein, possibly through
           altering its interactions with other proteins. The
           expansion of a CAG repeat is also the underlying defect
           in six other neurodegenerative disorders, including
           Huntington's disease. One interaction of expanded
           polyglutamine repeats that is thought to be pathogenic
           is that with the short glutamine repeat in the
           transcriptional coactivator CREB binding protein, CBP.
           This interaction draws CBP away from its usual nuclear
           location to the expanded polyglutamine repeat protein
           aggregates that are characteristic of the polyglutamine
           neurodegenerative disorders. This interferes with
           CBP-mediated transcription and causes cytotoxicity.
          Length = 979

 Score = 35.4 bits (81), Expect = 0.099
 Identities = 16/52 (30%), Positives = 35/52 (67%), Gaps = 5/52 (9%)

Query: 16  TPLDWKKAKKK-----EKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKN 62
           TPL   K  KK     EK K+E + K +E+++ +K+KEK+++++++++ ++ 
Sbjct: 570 TPLASSKLAKKREEAVEKAKREAEQKAREEREREKEKEKEREREREREAERA 621


>gnl|CDD|113815 pfam05058, ActA, ActA Protein.  The ActA family is found in
           Listeria and is associated with motility. ActA protein
           acts as a scaffold to assemble and activate host cell
           actin cytoskeletal factors at the bacterial surface,
           resulting in directional actin polymerisation and
           propulsion of the bacterium through the cytoplasm of the
           host cell.
          Length = 601

 Score = 35.2 bits (80), Expect = 0.10
 Identities = 23/78 (29%), Positives = 31/78 (39%)

Query: 13  TEPTPLDWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNN 72
           TE  P +     + E  ++      KE +K  K K+  K       K K     N NNNN
Sbjct: 42  TEEQPSEINTGPRYETAREVSSRDIKELEKSNKVKDANKADLIAMLKAKAEKGPNINNNN 101

Query: 73  NNNNNNNNNNNNNNEEKG 90
           NN +  + N   N E  G
Sbjct: 102 NNYSEQSENAAINEEASG 119



 Score = 31.0 bits (69), Expect = 2.2
 Identities = 17/73 (23%), Positives = 32/73 (43%)

Query: 19  DWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNNNNN 78
           +W++ K +E+  +     + E  +E   ++ K+ +K  K K  N  +             
Sbjct: 36  EWEEEKTEEQPSEINTGPRYETAREVSSRDIKELEKSNKVKDANKADLIAMLKAKAEKGP 95

Query: 79  NNNNNNNNEEKGT 91
           N NNNNNN  + +
Sbjct: 96  NINNNNNNYSEQS 108


>gnl|CDD|220600 pfam10147, CR6_interact, Growth arrest and DNA-damage-inducible
           proteins-interacting protein 1.  Members of this family
           of proteins act as negative regulators of G1 to S cell
           cycle phase progression by inhibiting cyclin-dependent
           kinases. Inhibitory effects are additive with GADD45
           proteins but occur also in the absence of GADD45
           proteins. Furthermore, they act as a repressor of the
           orphan nuclear receptor NR4A1 by inhibiting AB
           domain-mediated transcriptional activity.
          Length = 217

 Score = 34.0 bits (78), Expect = 0.12
 Identities = 18/68 (26%), Positives = 33/68 (48%), Gaps = 27/68 (39%)

Query: 19  DWKKAK-KKEKKKKEKKAKK--------------------------KEKKKEKKKKEKKK 51
           DW+  K K+E+K +  K +K                          ++K+KE+KKK K+ 
Sbjct: 139 DWRAQKRKREQKARAAKERKERLVAEAREHFGYWVDPRDPRFQEMLQQKEKEEKKKVKEA 198

Query: 52  KKKKKKKK 59
           K+++K++K
Sbjct: 199 KRREKEEK 206



 Score = 31.3 bits (71), Expect = 0.87
 Identities = 10/30 (33%), Positives = 19/30 (63%)

Query: 26  KEKKKKEKKAKKKEKKKEKKKKEKKKKKKK 55
           ++K+K+EKK  K+ K++EK++K        
Sbjct: 185 QQKEKEEKKKVKEAKRREKEEKRMAALVAA 214



 Score = 30.2 bits (68), Expect = 2.0
 Identities = 11/28 (39%), Positives = 20/28 (71%)

Query: 24  KKKEKKKKEKKAKKKEKKKEKKKKEKKK 51
           K+KE+KKK K+AK++EK++++       
Sbjct: 187 KEKEEKKKVKEAKRREKEEKRMAALVAA 214



 Score = 28.6 bits (64), Expect = 6.7
 Identities = 16/68 (23%), Positives = 32/68 (47%), Gaps = 26/68 (38%)

Query: 20  WKKAKKKEKKKKEKKAKKKEKKKEK--------------------------KKKEKKKKK 53
                + +K+K+E+KA+  +++KE+                          K+KE+KKK 
Sbjct: 136 MIADWRAQKRKREQKARAAKERKERLVAEAREHFGYWVDPRDPRFQEMLQQKEKEEKKKV 195

Query: 54  KKKKKKKK 61
           K+ K+++K
Sbjct: 196 KEAKRREK 203



 Score = 28.2 bits (63), Expect = 8.3
 Identities = 9/54 (16%), Positives = 26/54 (48%), Gaps = 6/54 (11%)

Query: 15  PTPLDWKKAKKKEKKKKEKKAKKKEKKKEKKKK------EKKKKKKKKKKKKKN 62
           P+     +  ++++K+KE + + +E +  K            + +K+K+++K  
Sbjct: 99  PSLNQMLEENREQQKEKEARRQAREAEIAKNMAKMPQMIADWRAQKRKREQKAR 152


>gnl|CDD|233352 TIGR01310, L7, 60S ribosomal protein L7, eukaryotic.  This model
          describes the eukaryotic 60S (cytosolic) ribosomal
          protein L7 and paralogs that may or may not also be L7.
          Human, Drosophila, and Arabidopsis all have both a
          typical L7 and an L7-related paralog. This family is
          designated subfamily rather than equivalog to reflect
          these uncharacterized paralogs. Members of this family
          average ~ 250 residues in length, somewhat longer than
          the archaeal L30P/L7E homolog (~ 155 residues) and much
          longer than the related bacterial/organellar form (~ 60
          residues).
          Length = 235

 Score = 34.3 bits (79), Expect = 0.12
 Identities = 12/37 (32%), Positives = 21/37 (56%)

Query: 28 KKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNN 64
          KK+K ++    +  K+ K K+K  KKK+K   K+  +
Sbjct: 5  KKRKARQELAVQVAKQAKAKKKANKKKRKIYFKRAES 41



 Score = 33.1 bits (76), Expect = 0.28
 Identities = 13/34 (38%), Positives = 20/34 (58%)

Query: 25 KKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKK 58
          KK K ++E   +  ++ K KKK  KKK+K   K+
Sbjct: 5  KKRKARQELAVQVAKQAKAKKKANKKKRKIYFKR 38



 Score = 32.3 bits (74), Expect = 0.45
 Identities = 12/34 (35%), Positives = 22/34 (64%)

Query: 26 KEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKK 59
          K++K +++ A +  K+ + KKK  KKK+K   K+
Sbjct: 5  KKRKARQELAVQVAKQAKAKKKANKKKRKIYFKR 38



 Score = 32.3 bits (74), Expect = 0.54
 Identities = 10/36 (27%), Positives = 20/36 (55%)

Query: 22 KAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKK 57
            KK++ +++      K+ K +KK  +KK+K   K+
Sbjct: 3  LLKKRKARQELAVQVAKQAKAKKKANKKKRKIYFKR 38



 Score = 31.9 bits (73), Expect = 0.56
 Identities = 12/35 (34%), Positives = 21/35 (60%)

Query: 27 EKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
          E   K++KA+++   +  K+ + KKK  KKK+K  
Sbjct: 1  ELLLKKRKARQELAVQVAKQAKAKKKANKKKRKIY 35



 Score = 31.2 bits (71), Expect = 1.1
 Identities = 12/34 (35%), Positives = 18/34 (52%)

Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKK 54
          KK K +++   +   + K KKK  KKK K   K+
Sbjct: 5  KKRKARQELAVQVAKQAKAKKKANKKKRKIYFKR 38



 Score = 30.8 bits (70), Expect = 1.6
 Identities = 11/33 (33%), Positives = 17/33 (51%)

Query: 32 EKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNN 64
          E   KK++ ++E   +  K+ K KKK  KK   
Sbjct: 1  ELLLKKRKARQELAVQVAKQAKAKKKANKKKRK 33



 Score = 30.0 bits (68), Expect = 2.6
 Identities = 17/44 (38%), Positives = 24/44 (54%), Gaps = 2/44 (4%)

Query: 21 KKAKKKEKKKKEKKAKKKEKK--KEKKKKEKKKKKKKKKKKKKN 62
           KAKKK  KKK K   K+ +   KE +K E++  + K+  KK  
Sbjct: 21 AKAKKKANKKKRKIYFKRAESFVKEYRKAEREIIRLKRLAKKPG 64



 Score = 28.9 bits (65), Expect = 5.5
 Identities = 15/41 (36%), Positives = 24/41 (58%)

Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
          KKA KK++K   K+A+   K+  K ++E  + K+  KK  K
Sbjct: 25 KKANKKKRKIYFKRAESFVKEYRKAEREIIRLKRLAKKPGK 65


>gnl|CDD|241262 cd01228, PH_BCR-related, Breakpoint Cluster Region-related
           pleckstrin homology (PH) domain.  The BCR gene is one of
           the two genes in the BCR-ABL complex, which is
           associated with the Philadelphia chromosome, a product
           of a reciprocal translocation between chromosomes 22 and
           9. BCR is a GTPase-activating protein (GAP) for RAC1
           (primarily) and CDC42. The Dbl region of BCR has the
           most RhoGEF activity for Cdc42, and less activity
           towards Rac and Rho. Since BCR possesses both GAP and
           GEF activities, it may function to temporally regulate
           the activity of these GTPases. It also displays
           serine/threonine kinase activity. The BCR protein
           contains multiple domains including an N-terminal kinase
           domain, a RhoGEF domain, a PH domain, a C1 domain, a C2
           domain, and a C-terminal RhoGAP domain. ABR, a related
           smaller protein, is structurally similar to BCR, but
           lacks the N-terminal kinase domain and has GAP activity
           for both Rac and Cdc42. PH domains have diverse
           functions, but in general are involved in targeting
           proteins to the appropriate cellular location or in the
           interaction with a binding partner. They share little
           sequence conservation, but all have a common fold, which
           is electrostatically polarized. Less than 10% of PH
           domains bind phosphoinositide phosphates (PIPs) with
           high affinity and specificity. PH domains are
           distinguished from other PIP-binding domains by their
           specific high-affinity binding to PIPs with two vicinal
           phosphate groups: PtdIns(3,4)P2, PtdIns(4,5)P2 or
           PtdIns(3,4,5)P3 which results in targeting some PH
           domain proteins to the plasma membrane. A few display
           strong specificity in lipid binding. Any specificity is
           usually determined by loop regions or insertions in the
           N-terminus of the domain, which are not conserved across
           all PH domains. PH domains are found in cellular
           signaling proteins such as serine/threonine kinase,
           tyrosine kinases, regulators of G-proteins, endocytotic
           GTPases, adaptors, as well as cytoskeletal associated
           molecules and in lipid associated enzymes.
          Length = 179

 Score = 33.5 bits (77), Expect = 0.13
 Identities = 11/49 (22%), Positives = 20/49 (40%)

Query: 13  TEPTPLDWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
            +  P   ++  + +K     K K K    E+KK      +  +K +KK
Sbjct: 81  PDVVPKSSEELVQMKKAISALKQKIKRDSAERKKAASSGSRAIEKLRKK 129


>gnl|CDD|146486 pfam03879, Cgr1, Cgr1 family.  Members of this family are
          coiled-coil proteins that are involved in pre-rRNA
          processing.
          Length = 105

 Score = 32.4 bits (74), Expect = 0.13
 Identities = 12/43 (27%), Positives = 27/43 (62%), Gaps = 2/43 (4%)

Query: 20 WKKAKKKEKKKKEKKAKKKEKKKEKK--KKEKKKKKKKKKKKK 60
          W+K  +K  +++  KA++KE K EK+  ++ + +  K+++  K
Sbjct: 30 WEKRMEKRLEQQAIKAREKELKDEKEAERQRRIQAIKERRAAK 72



 Score = 30.8 bits (70), Expect = 0.54
 Identities = 16/49 (32%), Positives = 32/49 (65%), Gaps = 5/49 (10%)

Query: 19  DWKKAKKKEKKK--KEKKAKKKEKKKEKKKKEKKKKKK---KKKKKKKN 62
           D K+A+++ + +  KE++A K+EK++ +K   K   KK    K+++K+N
Sbjct: 52  DEKEAERQRRIQAIKERRAAKEEKERYEKMAAKMHAKKVERLKRREKRN 100



 Score = 29.7 bits (67), Expect = 1.1
 Identities = 15/37 (40%), Positives = 23/37 (62%), Gaps = 2/37 (5%)

Query: 21  KKAKKKEKKKKEKKAKKKEKKK--EKKKKEKKKKKKK 55
           ++A K+EK++ EK A K   KK    K++EK+ K  K
Sbjct: 68  RRAAKEEKERYEKMAAKMHAKKVERLKRREKRNKLLK 104


>gnl|CDD|218941 pfam06217, GAGA_bind, GAGA binding protein-like family.  This
           family includes gbp a protein from Soybean that binds to
           GAGA element dinucleotide repeat DNA. It seems likely
           that the this domain mediates DNA binding. This putative
           domain contains several conserved cysteines and a
           histidine suggesting this may be a zinc-binding DNA
           interaction domain.
          Length = 301

 Score = 34.5 bits (79), Expect = 0.13
 Identities = 13/75 (17%), Positives = 22/75 (29%)

Query: 1   MDSSTKHQIGTFTEPTPLDWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKK 60
           +            +      +    +         + KE KK KK +  K  K  K KK 
Sbjct: 108 IHHLQMLHHPQLGDNPYGTREMHHLEVPPISTAPPEAKEVKKPKKGQSPKVPKAPKPKKP 167

Query: 61  KNNNNNNNNNNNNNN 75
           K   + +N +     
Sbjct: 168 KKKGSVSNRSVKMPG 182



 Score = 31.4 bits (71), Expect = 1.1
 Identities = 12/42 (28%), Positives = 17/42 (40%)

Query: 21  KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKN 62
           K+ KK +K +  K  K  + KK KKK     +  K       
Sbjct: 145 KEVKKPKKGQSPKVPKAPKPKKPKKKGSVSNRSVKMPGIDPR 186



 Score = 31.0 bits (70), Expect = 1.3
 Identities = 11/44 (25%), Positives = 16/44 (36%), Gaps = 2/44 (4%)

Query: 21  KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKK--KKN 62
            K  +  K  K  K KK +KK     +  K      + K   K+
Sbjct: 150 PKKGQSPKVPKAPKPKKPKKKGSVSNRSVKMPGIDPRSKPDWKS 193



 Score = 29.1 bits (65), Expect = 5.3
 Identities = 10/39 (25%), Positives = 16/39 (41%)

Query: 39  EKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNNNN 77
            K+ +K KK +  K  K  K KK     + +N +     
Sbjct: 144 AKEVKKPKKGQSPKVPKAPKPKKPKKKGSVSNRSVKMPG 182



 Score = 28.7 bits (64), Expect = 9.1
 Identities = 15/57 (26%), Positives = 19/57 (33%)

Query: 7   HQIGTFTEPTPLDWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNN 63
           H +      T     K  KK KK +  K  K  K K+ KKK     +  K       
Sbjct: 130 HHLEVPPISTAPPEAKEVKKPKKGQSPKVPKAPKPKKPKKKGSVSNRSVKMPGIDPR 186


>gnl|CDD|220121 pfam09135, Alb1, Alb1.  Alb1 is a nuclear shuttling factor
          involved in ribosome biogenesis.
          Length = 112

 Score = 32.5 bits (74), Expect = 0.14
 Identities = 16/50 (32%), Positives = 26/50 (52%), Gaps = 9/50 (18%)

Query: 21 KKAKKKEKKKKEKKAKKKEKKKEK---------KKKEKKKKKKKKKKKKK 61
          K+ KKK K+   K+ K+++K  E+         KK+ K   + K  KK+K
Sbjct: 50 KQRKKKPKRLTRKQRKRQKKGLERAEIVTDTLSKKRAKSIGRAKYVKKRK 99



 Score = 31.7 bits (72), Expect = 0.30
 Identities = 16/52 (30%), Positives = 28/52 (53%), Gaps = 8/52 (15%)

Query: 22 KAKKKEKKKKEKKAKKKEKKKEKKKKEK--------KKKKKKKKKKKKNNNN 65
           +KKK++KKK K+  +K++K++KK  E+         KK+ K   + K    
Sbjct: 46 VSKKKQRKKKPKRLTRKQRKRQKKGLERAEIVTDTLSKKRAKSIGRAKYVKK 97



 Score = 29.0 bits (65), Expect = 2.1
 Identities = 12/52 (23%), Positives = 24/52 (46%), Gaps = 2/52 (3%)

Query: 21 KKAKKKEK--KKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNN 70
          K+  +K        K     +KK+ KKK ++  +K++K++KK         +
Sbjct: 28 KRTDRKRPGILAASKNEGVSKKKQRKKKPKRLTRKQRKRQKKGLERAEIVTD 79



 Score = 28.2 bits (63), Expect = 4.9
 Identities = 14/40 (35%), Positives = 25/40 (62%)

Query: 22 KAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
          +AK+ ++K+    A  K +   KKK+ KKK K+  +K++K
Sbjct: 26 RAKRTDRKRPGILAASKNEGVSKKKQRKKKPKRLTRKQRK 65



 Score = 27.8 bits (62), Expect = 5.6
 Identities = 14/66 (21%), Positives = 30/66 (45%), Gaps = 12/66 (18%)

Query: 18 LDWKKAKKKEKKKKEKKAKKKE------------KKKEKKKKEKKKKKKKKKKKKKNNNN 65
          +D  K+      + ++  +K+             KKK++KKK K+  +K++K++KK    
Sbjct: 14 IDLDKSSLTSLPRAKRTDRKRPGILAASKNEGVSKKKQRKKKPKRLTRKQRKRQKKGLER 73

Query: 66 NNNNNN 71
               +
Sbjct: 74 AEIVTD 79


>gnl|CDD|200340 TIGR03927, T7SS_EssA_Firm, type VII secretion protein EssA.
          Members of this family are associated with type VII
          secretion of WXG100 family targets in the Firmicutes,
          but not in the Actinobacteria. This highly divergent
          protein family consists largely of a central region of
          highly polar low-complexity sequence containing
          occasional LF motifs in weak repeats about 17 residues
          in length, flanked by hydrophobic N- and C-terminal
          regions [Protein fate, Protein and peptide secretion
          and trafficking].
          Length = 150

 Score = 33.1 bits (76), Expect = 0.15
 Identities = 11/59 (18%), Positives = 30/59 (50%)

Query: 14 EPTPLDWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNN 72
          E   ++      +E+ + +K+    E++K+   ++ K+K ++++ K +  + N   NN 
Sbjct: 32 EKKDIEINTDYLQEETELDKELFTPEEQKKITFQKHKEKPEQEELKNQLFSENATENNT 90



 Score = 32.7 bits (75), Expect = 0.19
 Identities = 14/57 (24%), Positives = 25/57 (43%), Gaps = 6/57 (10%)

Query: 24 KKKEKKKKEKKA-----KKKEKKKEKKK-KEKKKKKKKKKKKKKNNNNNNNNNNNNN 74
          K  E      +      K+    +E+KK   +K K+K ++++ KN   + N   NN 
Sbjct: 34 KDIEINTDYLQEETELDKELFTPEEQKKITFQKHKEKPEQEELKNQLFSENATENNT 90



 Score = 30.0 bits (68), Expect = 1.8
 Identities = 12/53 (22%), Positives = 26/53 (49%)

Query: 24 KKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNNN 76
             +  ++E +  K+    E++KK   +K K+K ++++  N   + N   NN 
Sbjct: 38 INTDYLQEETELDKELFTPEEQKKITFQKHKEKPEQEELKNQLFSENATENNT 90



 Score = 28.5 bits (64), Expect = 5.3
 Identities = 11/43 (25%), Positives = 21/43 (48%)

Query: 16 TPLDWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKK 58
          TP + KK   ++ K+K ++ + K +   +   E    K  KK+
Sbjct: 55 TPEEQKKITFQKHKEKPEQEELKNQLFSENATENNTVKATKKQ 97



 Score = 28.5 bits (64), Expect = 5.9
 Identities = 12/57 (21%), Positives = 25/57 (43%)

Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNNNN 77
          K  +      +E+    KE    +++K+   +K K+K +++   N   + N   NN 
Sbjct: 34 KDIEINTDYLQEETELDKELFTPEEQKKITFQKHKEKPEQEELKNQLFSENATENNT 90



 Score = 28.1 bits (63), Expect = 7.0
 Identities = 15/61 (24%), Positives = 27/61 (44%), Gaps = 6/61 (9%)

Query: 14  EPTPLDWKKAKKKEKKK-KEKKAKKKEKKKEKKK-----KEKKKKKKKKKKKKKNNNNNN 67
           E T LD +    +E+KK   +K K+K +++E K         +    K  KK+  ++   
Sbjct: 45  EETELDKELFTPEEQKKITFQKHKEKPEQEELKNQLFSENATENNTVKATKKQLFSSEYE 104

Query: 68  N 68
            
Sbjct: 105 Q 105


>gnl|CDD|114629 pfam05917, DUF874, Helicobacter pylori protein of unknown function
           (DUF874).  This family consists of several hypothetical
           proteins specific to Helicobacter pylori. The function
           of this family is unknown.
          Length = 417

 Score = 34.5 bits (78), Expect = 0.15
 Identities = 17/69 (24%), Positives = 32/69 (46%)

Query: 21  KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNNNNNNN 80
           K+  +KE  +      K E++K+K ++EK+K +++K+K     N N            N 
Sbjct: 162 KQKTEKEGIELANSQIKAEQEKQKTEQEKQKTEQEKQKTSNIANKNAIELEQEKQKTENE 221

Query: 81  NNNNNNEEK 89
             +   E+K
Sbjct: 222 KQDLIKEQK 230



 Score = 31.0 bits (69), Expect = 1.5
 Identities = 18/94 (19%), Positives = 45/94 (47%)

Query: 2   DSSTKHQIGTFTEPTPLDWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
           D + K  I    E    + +K K +++  +   ++ K +++++K +++K+K +++K+K  
Sbjct: 142 DRANKSGIELEQEEQKTEQEKQKTEKEGIELANSQIKAEQEKQKTEQEKQKTEQEKQKTS 201

Query: 62  NNNNNNNNNNNNNNNNNNNNNNNNNEEKGTFLYK 95
           N  N N            N   +  +E+  F+ +
Sbjct: 202 NIANKNAIELEQEKQKTENEKQDLIKEQKDFIKE 235



 Score = 29.9 bits (66), Expect = 4.3
 Identities = 13/61 (21%), Positives = 28/61 (45%)

Query: 21  KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNNNNNNN 80
           K  ++K+K ++EK+       K   + E++K+K + +K+       +       N   N+
Sbjct: 185 KTEQEKQKTEQEKQKTSNIANKNAIELEQEKQKTENEKQDLIKEQKDFIKEAEQNCQENH 244

Query: 81  N 81
           N
Sbjct: 245 N 245



 Score = 29.1 bits (64), Expect = 6.3
 Identities = 13/64 (20%), Positives = 31/64 (48%)

Query: 21  KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNNNNNNN 80
           +K K +++K+K ++ K+K      K   + +++K+K + +K +      +       N  
Sbjct: 182 EKQKTEQEKQKTEQEKQKTSNIANKNAIELEQEKQKTENEKQDLIKEQKDFIKEAEQNCQ 241

Query: 81  NNNN 84
            N+N
Sbjct: 242 ENHN 245



 Score = 29.1 bits (64), Expect = 6.4
 Identities = 15/69 (21%), Positives = 33/69 (47%), Gaps = 2/69 (2%)

Query: 21  KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNN--NNNN 78
           K  ++K+K ++EK+  ++EK+K      K   + +++K+K  N   +      +      
Sbjct: 178 KAEQEKQKTEQEKQKTEQEKQKTSNIANKNAIELEQEKQKTENEKQDLIKEQKDFIKEAE 237

Query: 79  NNNNNNNNE 87
            N   N+N+
Sbjct: 238 QNCQENHNQ 246


>gnl|CDD|217834 pfam03998, Utp11, Utp11 protein.  This protein is found to be part
           of a large ribonucleoprotein complex containing the U3
           snoRNA. Depletion of the Utp proteins impedes production
           of the 18S rRNA, indicating that they are part of the
           active pre-rRNA processing complex. This large RNP
           complex has been termed the small subunit (SSU)
           processome.
          Length = 239

 Score = 33.9 bits (78), Expect = 0.15
 Identities = 13/35 (37%), Positives = 19/35 (54%)

Query: 23  AKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKK 57
             ++E  KK K  KKK  K +  K   K KK++K+
Sbjct: 205 ELQRELMKKGKGKKKKIVKDKDGKVVYKWKKERKR 239



 Score = 33.9 bits (78), Expect = 0.16
 Identities = 18/41 (43%), Positives = 28/41 (68%)

Query: 26  KEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNN 66
           KE+K++EKK KK E++ E +++  KK K KKKK  K+ +  
Sbjct: 188 KERKEREKKLKKVEQRLELQRELMKKGKGKKKKIVKDKDGK 228



 Score = 32.7 bits (75), Expect = 0.39
 Identities = 22/54 (40%), Positives = 29/54 (53%), Gaps = 7/54 (12%)

Query: 21  KKAKKKEKKKKEKKAKKKEKKKEKK-------KKEKKKKKKKKKKKKKNNNNNN 67
           K AKKK K  KE K +K+ +KK KK       ++E  KK K KKKK   + +  
Sbjct: 175 KSAKKKRKLYKELKERKEREKKLKKVEQRLELQRELMKKGKGKKKKIVKDKDGK 228



 Score = 32.3 bits (74), Expect = 0.47
 Identities = 13/35 (37%), Positives = 21/35 (60%)

Query: 27  EKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
           E +++  K  K +KKK  K K+ K   K KK++K+
Sbjct: 205 ELQRELMKKGKGKKKKIVKDKDGKVVYKWKKERKR 239



 Score = 31.6 bits (72), Expect = 0.76
 Identities = 14/41 (34%), Positives = 24/41 (58%)

Query: 21  KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
           KK +++ + ++E   K K KKK+  K +  K   K KK++K
Sbjct: 198 KKVEQRLELQRELMKKGKGKKKKIVKDKDGKVVYKWKKERK 238



 Score = 31.2 bits (71), Expect = 1.0
 Identities = 16/41 (39%), Positives = 23/41 (56%)

Query: 21  KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
           KK KK E++ + ++   K+ K +KKK  K K  K   K KK
Sbjct: 195 KKLKKVEQRLELQRELMKKGKGKKKKIVKDKDGKVVYKWKK 235



 Score = 30.4 bits (69), Expect = 2.1
 Identities = 17/48 (35%), Positives = 24/48 (50%), Gaps = 6/48 (12%)

Query: 21  KKAKKKEKK------KKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKN 62
           K+ K++EKK      + E + +  +K K KKKK  K K  K   K K 
Sbjct: 188 KERKEREKKLKKVEQRLELQRELMKKGKGKKKKIVKDKDGKVVYKWKK 235



 Score = 30.0 bits (68), Expect = 2.6
 Identities = 17/41 (41%), Positives = 24/41 (58%)

Query: 21  KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
               +K     EK+ KK  KKK K  KE K++K+++KK KK
Sbjct: 159 ISQLEKTSLVDEKQKKKSAKKKRKLYKELKERKEREKKLKK 199



 Score = 29.3 bits (66), Expect = 4.6
 Identities = 12/37 (32%), Positives = 22/37 (59%)

Query: 22  KAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKK 58
           + + + +  K+ K KKK+  K+K  K   K KK++K+
Sbjct: 203 RLELQRELMKKGKGKKKKIVKDKDGKVVYKWKKERKR 239



 Score = 28.9 bits (65), Expect = 7.0
 Identities = 23/68 (33%), Positives = 31/68 (45%), Gaps = 17/68 (25%)

Query: 11  TFTEPTPLDWKKAKKKEKKKKEKKAKKKEKK----KEKKKKEKK-------------KKK 53
               P     +K    ++K+K+K AKKK K     KE+K++EKK               K
Sbjct: 153 RENRPRISQLEKTSLVDEKQKKKSAKKKRKLYKELKERKEREKKLKKVEQRLELQRELMK 212

Query: 54  KKKKKKKK 61
           K K KKKK
Sbjct: 213 KGKGKKKK 220



 Score = 28.5 bits (64), Expect = 8.8
 Identities = 12/39 (30%), Positives = 23/39 (58%)

Query: 24  KKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKN 62
            +  + +K     +K+KKK  KKK K  K+ K++K+++ 
Sbjct: 157 PRISQLEKTSLVDEKQKKKSAKKKRKLYKELKERKEREK 195


>gnl|CDD|219511 pfam07681, DoxX, DoxX.  These proteins appear to have some sequence
           similarity with pfam04173 but their function is unknown.
          Length = 84

 Score = 31.8 bits (73), Expect = 0.17
 Identities = 28/95 (29%), Positives = 40/95 (42%), Gaps = 15/95 (15%)

Query: 233 FLLPYAIVTSLFFIGQFGGMTTLQ---TYAVGIFESIHAPLDPYFATLLLGVAELGGALL 289
            LL   ++  L FI    G+  L        G F S+  P  P     L  + EL G LL
Sbjct: 1   GLLLLRLLLGLLFI--AHGLQKLFGGFAGTAGYFASLGLPP-PALLAYLAALLELVGGLL 57

Query: 290 CVVLIHYTGKRPLALISTGGSAACFIVVAVYAQFH 324
            ++L  +T  R  AL       A F++VA++   H
Sbjct: 58  -LLLGLFT--RLAALG-----LAVFMLVAIFL-VH 83


>gnl|CDD|223625 COG0551, TopA, Zn-finger domain associated with topoisomerase type
           I [DNA replication, recombination, and repair].
          Length = 140

 Score = 32.9 bits (75), Expect = 0.18
 Identities = 17/47 (36%), Positives = 23/47 (48%)

Query: 20  WKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNN 66
            +K K KEKK  +  ++K  +KKEKKK    +   K    K  N  N
Sbjct: 94  TEKPKPKEKKCPKCGSRKLVEKKEKKKGVFLECSNKPCCLKVENLKN 140



 Score = 29.1 bits (65), Expect = 3.6
 Identities = 14/37 (37%), Positives = 19/37 (51%)

Query: 30  KKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNN 66
           +  +K K KEKK  K    K  +KK+KKK      +N
Sbjct: 92  RFTEKPKPKEKKCPKCGSRKLVEKKEKKKGVFLECSN 128



 Score = 27.9 bits (62), Expect = 8.7
 Identities = 13/46 (28%), Positives = 21/46 (45%)

Query: 22  KAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNN 67
           +  K ++KK  K   +K  +K++KKK    +   K    K  N  N
Sbjct: 95  EKPKPKEKKCPKCGSRKLVEKKEKKKGVFLECSNKPCCLKVENLKN 140


>gnl|CDD|236172 PRK08173, PRK08173, DNA topoisomerase III; Validated.
          Length = 862

 Score = 34.3 bits (79), Expect = 0.18
 Identities = 13/41 (31%), Positives = 16/41 (39%)

Query: 21  KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
           ++ K    KK   KA      K +K   KK   KK   KK 
Sbjct: 816 REPKAAAAKKTAAKATAAAATKAEKAAAKKAPAKKTAAKKT 856



 Score = 34.3 bits (79), Expect = 0.20
 Identities = 15/45 (33%), Positives = 17/45 (37%)

Query: 14  EPTPLDWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKK 58
           EP     KK   K       KA+K   KK   KK   KK   +K 
Sbjct: 817 EPKAAAAKKTAAKATAAAATKAEKAAAKKAPAKKTAAKKTAARKT 861



 Score = 32.3 bits (74), Expect = 0.82
 Identities = 14/40 (35%), Positives = 16/40 (40%)

Query: 22  KAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
            A KK   K    A  K +K   KK   KK   KK   +K
Sbjct: 821 AAAKKTAAKATAAAATKAEKAAAKKAPAKKTAAKKTAARK 860



 Score = 31.9 bits (73), Expect = 1.0
 Identities = 11/36 (30%), Positives = 13/36 (36%)

Query: 24  KKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKK 59
            ++ K    KK   K       K EK   KK   KK
Sbjct: 815 PREPKAAAAKKTAAKATAAAATKAEKAAAKKAPAKK 850



 Score = 31.6 bits (72), Expect = 1.2
 Identities = 17/48 (35%), Positives = 18/48 (37%)

Query: 14  EPTPLDWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
           EP       AKK   K     A K EK   KK   KK   KK   +K 
Sbjct: 814 EPREPKAAAAKKTAAKATAAAATKAEKAAAKKAPAKKTAAKKTAARKT 861


>gnl|CDD|218078 pfam04419, 4F5, 4F5 protein family.  Members of this family are
          short proteins that are rich in aspartate, glutamate,
          lysine and arginine. Although the function of these
          proteins is unknown, they are found to be ubiquitously
          expressed.
          Length = 38

 Score = 30.3 bits (69), Expect = 0.18
 Identities = 11/29 (37%), Positives = 16/29 (55%)

Query: 29 KKKEKKAKKKEKKKEKKKKEKKKKKKKKK 57
          + +E+ AKK  KKK+K  K + K   K  
Sbjct: 9  QARERNAKKAAKKKKKGAKSQLKAAAKAL 37



 Score = 27.6 bits (62), Expect = 1.6
 Identities = 10/33 (30%), Positives = 17/33 (51%)

Query: 30 KKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKN 62
          ++E+  ++  KK  KKKK+  K + K   K   
Sbjct: 6  QREQARERNAKKAAKKKKKGAKSQLKAAAKALE 38


>gnl|CDD|217479 pfam03297, Ribosomal_S25, S25 ribosomal protein. 
          Length = 105

 Score = 32.0 bits (73), Expect = 0.18
 Identities = 16/35 (45%), Positives = 19/35 (54%)

Query: 29 KKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNN 63
          KK++KKA K        K +KKK  K K K K NN
Sbjct: 4  KKQQKKAAKAAAASAGGKAKKKKWSKGKVKDKLNN 38



 Score = 31.6 bits (72), Expect = 0.30
 Identities = 15/35 (42%), Positives = 17/35 (48%)

Query: 30 KKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNN 64
          KK++K   K        K KKKK  K K K K NN
Sbjct: 4  KKQQKKAAKAAAASAGGKAKKKKWSKGKVKDKLNN 38



 Score = 30.4 bits (69), Expect = 0.61
 Identities = 14/35 (40%), Positives = 18/35 (51%)

Query: 28 KKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKN 62
          KK+++K AK        K K+KK  K K K K  N
Sbjct: 4  KKQQKKAAKAAAASAGGKAKKKKWSKGKVKDKLNN 38



 Score = 30.0 bits (68), Expect = 1.0
 Identities = 15/34 (44%), Positives = 17/34 (50%)

Query: 23 AKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKK 56
            KK++KK  K A      K KKKK  K K K K
Sbjct: 2  PPKKQQKKAAKAAAASAGGKAKKKKWSKGKVKDK 35



 Score = 29.3 bits (66), Expect = 1.6
 Identities = 10/25 (40%), Positives = 12/25 (48%)

Query: 35 AKKKEKKKEKKKKEKKKKKKKKKKK 59
            KK++KK  K        K KKKK
Sbjct: 2  PPKKQQKKAAKAAAASAGGKAKKKK 26


>gnl|CDD|235219 PRK04098, PRK04098, sec-independent translocase; Provisional.
          Length = 158

 Score = 33.2 bits (76), Expect = 0.18
 Identities = 14/58 (24%), Positives = 22/58 (37%), Gaps = 1/58 (1%)

Query: 2   DSSTKHQIGTFTEPTPLDWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKK 59
           D    ++     +  P    +    E K   +++   E  KE K   K K KK  K+K
Sbjct: 102 DLLQDYKKSLEEDTIPNHLNEEVSNETKLT-QESSSDESPKEVKLATKNKTKKHDKEK 158



 Score = 30.9 bits (70), Expect = 0.93
 Identities = 12/67 (17%), Positives = 29/67 (43%)

Query: 22  KAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNNNNNNNN 81
             KKK K ++    K   + + K  ++  +  KK  ++    N+ N   +N       ++
Sbjct: 76  SLKKKLKFEELDDLKITAENEIKSIQDLLQDYKKSLEEDTIPNHLNEEVSNETKLTQESS 135

Query: 82  NNNNNEE 88
           ++ + +E
Sbjct: 136 SDESPKE 142



 Score = 30.5 bits (69), Expect = 1.1
 Identities = 6/63 (9%), Positives = 21/63 (33%)

Query: 27  EKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNNNNNNNNNNNNN 86
           E + K  +   ++ KK  ++        ++   +      ++++ +         N    
Sbjct: 94  ENEIKSIQDLLQDYKKSLEEDTIPNHLNEEVSNETKLTQESSSDESPKEVKLATKNKTKK 153

Query: 87  EEK 89
            +K
Sbjct: 154 HDK 156



 Score = 29.7 bits (67), Expect = 2.4
 Identities = 12/64 (18%), Positives = 24/64 (37%)

Query: 25  KKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNNNNNNNNNNN 84
             E  KK+ K ++ +  K   + E K  +   +  KK+   +   N+ N   +N      
Sbjct: 73  AVESLKKKLKFEELDDLKITAENEIKSIQDLLQDYKKSLEEDTIPNHLNEEVSNETKLTQ 132

Query: 85  NNEE 88
            +  
Sbjct: 133 ESSS 136



 Score = 28.9 bits (65), Expect = 4.2
 Identities = 14/69 (20%), Positives = 27/69 (39%), Gaps = 2/69 (2%)

Query: 21  KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNNNNNNN 80
           KK  K E+    K   + E K    +   +  KK  ++    N+ N   +N       ++
Sbjct: 78  KKKLKFEELDDLKITAENEIKS--IQDLLQDYKKSLEEDTIPNHLNEEVSNETKLTQESS 135

Query: 81  NNNNNNEEK 89
           ++ +  E K
Sbjct: 136 SDESPKEVK 144


>gnl|CDD|218437 pfam05107, DUF694, Family of unknown function (DUF694).  Family of
           hypothetical bacterial proteins.
          Length = 278

 Score = 33.7 bits (78), Expect = 0.19
 Identities = 13/42 (30%), Positives = 27/42 (64%)

Query: 20  WKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
            +K K  ++  +  +   + ++  K+KK+KKKKKKKK++ ++
Sbjct: 63  QEKEKLNDEHGELYERADRAEELLKEKKKKKKKKKKKEELRE 104



 Score = 28.4 bits (64), Expect = 9.1
 Identities = 11/33 (33%), Positives = 22/33 (66%)

Query: 21  KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKK 53
            +  ++  + +E   +KK+KKK+KKKKE+ ++ 
Sbjct: 73  GELYERADRAEELLKEKKKKKKKKKKKEELREW 105


>gnl|CDD|223605 COG0531, PotE, Amino acid transporters [Amino acid transport and
           metabolism].
          Length = 466

 Score = 34.0 bits (78), Expect = 0.19
 Identities = 21/144 (14%), Positives = 47/144 (32%), Gaps = 18/144 (12%)

Query: 227 MYMRRTFLLPYAIVTS-LFFIGQFGGMTTLQTYAVGIFESIHAPLDPYFATLLLGVAELG 285
            Y +R        +   L+ +     +  +   A      +           LL +  + 
Sbjct: 80  AYAKRALGPRLGFLAGWLYLLAYVIALAAIAIGAASYLSYLFPGPGLLSIGPLLIIL-IA 138

Query: 286 GALLCVV-LIHYTGKRPLALISTGGSAACFIVVAVYAQFHLSYG---------WDSPLVP 335
            AL+ ++ L++  G +  A I++  +    I++ ++    L              +P   
Sbjct: 139 LALIALLTLLNLRGIKASAKINSIITILKIIILLIFIILGLFAFGFSNGNLFAPFNPGGG 198

Query: 336 TVFLVLAAFLTHICIRLLPWMLIG 359
           +   +LAA L      L  +   G
Sbjct: 199 SFGGILAAIL------LAFFAFTG 216


>gnl|CDD|236080 PRK07734, motB, flagellar motor protein MotB; Reviewed.
          Length = 259

 Score = 33.6 bits (77), Expect = 0.21
 Identities = 15/47 (31%), Positives = 26/47 (55%)

Query: 13  TEPTPLDWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKK 59
            E          ++ KKK+E +AKKK++ +E K  +KK  +  K+K+
Sbjct: 76  DEKELSASSLEAEQAKKKEEAEAKKKKEMEELKAVQKKIDQYIKEKQ 122



 Score = 30.9 bits (70), Expect = 1.4
 Identities = 11/41 (26%), Positives = 21/41 (51%)

Query: 24  KKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNN 64
               + ++ KK ++ E KK+K+ +E K  +KK  +  K   
Sbjct: 82  ASSLEAEQAKKKEEAEAKKKKEMEELKAVQKKIDQYIKEKQ 122



 Score = 28.6 bits (64), Expect = 8.6
 Identities = 11/44 (25%), Positives = 23/44 (52%)

Query: 11  TFTEPTPLDWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKK 54
           +  +P       A   E ++ +KK + + KKK++ ++ K  +KK
Sbjct: 70  SDQKPEDEKELSASSLEAEQAKKKEEAEAKKKKEMEELKAVQKK 113


>gnl|CDD|234537 TIGR04313, aro_clust_Mycop, aromatic cluster surface protein.
           Members of this family are absolutely restricted to the
           Mollicutes (Mycoplasma and Ureaplasma). All have a
           signal peptide, usually of the lipoprotein type,
           suggesting surface expression. Most members have lengths
           of about 280 residues but some members have a nearly
           full-length duplication. The mostly nearly invariant
           residue, a Trp,is part of a strongly conserved 9-residue
           motif, [ND]-W-[LY]-[WF]-X-[LF]-X-N-[LI], where X usually
           is hydrophobic. Because the hydrophobic six-residue core
           of this motif almost always contains three to four
           aromatic residues, we name this family aromatic cluster
           surface protein. Multiple paralogs may occur in a given
           Mycoplasma, usually clustered on the genome.
          Length = 297

 Score = 33.5 bits (77), Expect = 0.21
 Identities = 15/84 (17%), Positives = 31/84 (36%), Gaps = 8/84 (9%)

Query: 3   SSTKHQIGTFTEPTPLDWKKAKKKE--------KKKKEKKAKKKEKKKEKKKKEKKKKKK 54
              K++     +     WK     +          K ++  K K  K +K    +   K 
Sbjct: 34  KLIKNKFDVKKDKKDKKWKVFLNNKYINNLLNLVFKNDEDEKNKYIKSQKNINSELYLKL 93

Query: 55  KKKKKKKNNNNNNNNNNNNNNNNN 78
            K   +  NN NN+++N+++  + 
Sbjct: 94  LKLSLEYLNNINNSSSNDSDFFSY 117



 Score = 32.0 bits (73), Expect = 0.78
 Identities = 19/92 (20%), Positives = 36/92 (39%), Gaps = 17/92 (18%)

Query: 21  KKAKKKEKKKKEKKAK----------------KKEKKKEKKKKEKKKKKKKKKKKKKNNN 64
              KK +K KK K                   + EK K  K ++    +   K  K +  
Sbjct: 40  FDVKKDKKDKKWKVFLNNKYINNLLNLVFKNDEDEKNKYIKSQKNINSELYLKLLKLSLE 99

Query: 65  -NNNNNNNNNNNNNNNNNNNNNNEEKGTFLYK 95
             NN NN+++N+++  +     +++K   L+ 
Sbjct: 100 YLNNINNSSSNDSDFFSYLYEKSKKKLNELFS 131



 Score = 29.7 bits (67), Expect = 3.5
 Identities = 23/92 (25%), Positives = 37/92 (40%), Gaps = 17/92 (18%)

Query: 21  KKAKKKEKKKKEKKAKKKEKKKEKKKKEKK--------------KKKKKKKKKKKNNNNN 66
           +   KK  K K    K K+ KK K     K               +K K  K +KN N+ 
Sbjct: 29  ENDLKKLIKNKFDVKKDKKDKKWKVFLNNKYINNLLNLVFKNDEDEKNKYIKSQKNINSE 88

Query: 67  NNNNNNNNNN---NNNNNNNNNNEEKGTFLYK 95
                   +    NN NN+++N+ +  ++LY+
Sbjct: 89  LYLKLLKLSLEYLNNINNSSSNDSDFFSYLYE 120



 Score = 28.9 bits (65), Expect = 6.2
 Identities = 19/70 (27%), Positives = 26/70 (37%), Gaps = 7/70 (10%)

Query: 33 KKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNN-------NNNNNNNNNNNNNNNNN 85
          +     +K  + K   KK KK KK K   NN   NN       N+ +  N    +  N N
Sbjct: 27 QSENDLKKLIKNKFDVKKDKKDKKWKVFLNNKYINNLLNLVFKNDEDEKNKYIKSQKNIN 86

Query: 86 NEEKGTFLYK 95
          +E     L  
Sbjct: 87 SELYLKLLKL 96



 Score = 28.5 bits (64), Expect = 8.5
 Identities = 14/51 (27%), Positives = 21/51 (41%)

Query: 36  KKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNNNNNNNNNNNNN 86
           K+ E+     K       K  KKK K+N   +N  +   N   +  +NN N
Sbjct: 157 KQNEQFNIDAKHNSLFLGKFLKKKPKSNKFIDNYIDKLKNGEESAESNNYN 207


>gnl|CDD|218797 pfam05890, Ebp2, Eukaryotic rRNA processing protein EBP2.  This
           family consists of several Eukaryotic rRNA processing
           protein EBP2 sequences. Ebp2p is required for the
           maturation of 25S rRNA and 60S subunit assembly. Ebp2p
           may be one of the target proteins of Rrs1p for executing
           the signal to regulate ribosome biogenesis. This family
           also plays a role in chromosome segregation.
          Length = 270

 Score = 33.5 bits (77), Expect = 0.21
 Identities = 19/57 (33%), Positives = 28/57 (49%), Gaps = 13/57 (22%)

Query: 23  AKKKEKKKKEKKAKKKEKKK----------EKKKKEKKK---KKKKKKKKKKNNNNN 66
           A+   KK  E+  K++E KK          +++ KEKK    K KK KKK+K   + 
Sbjct: 126 AEAAAKKASEEARKQRELKKFGKQVQVEKLQERAKEKKDMLEKIKKLKKKRKGGGDE 182



 Score = 33.1 bits (76), Expect = 0.32
 Identities = 19/43 (44%), Positives = 27/43 (62%), Gaps = 4/43 (9%)

Query: 22  KAKK-KEKKKKEKK---AKKKEKKKEKKKKEKKKKKKKKKKKK 60
           +A+K +E KK  K+    K +E+ KEKK   +K KK KKK+K 
Sbjct: 136 EARKQRELKKFGKQVQVEKLQERAKEKKDMLEKIKKLKKKRKG 178



 Score = 30.8 bits (70), Expect = 1.8
 Identities = 15/66 (22%), Positives = 21/66 (31%), Gaps = 20/66 (30%)

Query: 19  DWKKAKK------------KEKKKKEKKAKKKEKKKEKKKKEKKKKKKK--------KKK 58
             KK +K             +   K    KK  K    + K   K+K K        KK+
Sbjct: 171 KLKKKRKGGGDEGDLFDFLVDDAAKGGSKKKGRKGGAARGKPNAKRKAKDAKFGFGGKKR 230

Query: 59  KKKNNN 64
             K+N 
Sbjct: 231 GSKSNT 236


>gnl|CDD|197876 smart00792, Agouti, Agouti protein.  The agouti protein regulates
          pigmentation in the mouse hair follicle producing a
          black hair with a subapical yellow band. A highly
          homologous protein agouti signal protein (ASIP) is
          present in humans and is expressed at highest levels in
          adipose tissue where it may play a role in energy
          homeostasis and possibly human pigmentation.
          Length = 124

 Score = 32.2 bits (73), Expect = 0.22
 Identities = 12/28 (42%), Positives = 16/28 (57%)

Query: 35 AKKKEKKKEKKKKEKKKKKKKKKKKKKN 62
            KK KK   ++ EKK  +KK+KK   N
Sbjct: 50 LNKKPKKISAEEAEKKLLQKKEKKALTN 77


>gnl|CDD|149438 pfam08374, Protocadherin, Protocadherin.  The structure of
           protocadherins is similar to that of classic cadherins
           (pfam00028), but particularly on the cytoplasmic domains
           they also have some unique features. They are expressed
           in a variety of organisms and are found in high
           concentrations in the brain where they seem to be
           localised mainly at cell-cell contact sites. Their
           expression seems to be developmentally regulated.
          Length = 223

 Score = 33.3 bits (76), Expect = 0.22
 Identities = 18/66 (27%), Positives = 32/66 (48%)

Query: 24  KKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNNNNNNNNNN 83
           ++ E+KK  +  KK+ +      +E K+KKKKK KKKK+  +   N      +  ++  +
Sbjct: 64  RQAERKKGYQAGKKETEDWFSPNQENKQKKKKKDKKKKSPKSLLLNFVTVEESKPDDEVH 123

Query: 84  NNNEEK 89
               E 
Sbjct: 124 EQKSET 129



 Score = 31.0 bits (70), Expect = 1.3
 Identities = 15/36 (41%), Positives = 21/36 (58%)

Query: 21  KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKK 56
           KK  +  KK+ E      ++ K+KKKK+ KKKK  K
Sbjct: 69  KKGYQAGKKETEDWFSPNQENKQKKKKKDKKKKSPK 104


>gnl|CDD|218688 pfam05680, ATP-synt_E, ATP synthase E chain.  This family
          consists of several ATP synthase E chain sequences
          which are components of the CF(0) subunit.
          Length = 83

 Score = 31.3 bits (71), Expect = 0.22
 Identities = 15/35 (42%), Positives = 24/35 (68%)

Query: 24 KKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKK 58
          KKKE+K +E +A++K  +K K    KKK+ K++ K
Sbjct: 33 KKKEEKIREYEAQEKLIEKAKAAYAKKKQAKQEAK 67



 Score = 30.1 bits (68), Expect = 0.64
 Identities = 13/35 (37%), Positives = 23/35 (65%)

Query: 26 KEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKK 60
          K+K++K ++ + +EK  EK K    KKK+ K++ K
Sbjct: 33 KKKEEKIREYEAQEKLIEKAKAAYAKKKQAKQEAK 67


>gnl|CDD|217935 pfam04158, Sof1, Sof1-like domain.  Sof1 is essential for cell
          growth and is a component of the nucleolar rRNA
          processing machinery.
          Length = 88

 Score = 31.5 bits (72), Expect = 0.23
 Identities = 14/38 (36%), Positives = 22/38 (57%)

Query: 24 KKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
          KK +K K+E K  KK K++ ++K  K      K ++KK
Sbjct: 48 KKAQKIKREMKEAKKRKEENRRKHSKPGSVPPKPERKK 85



 Score = 29.9 bits (68), Expect = 0.67
 Identities = 13/38 (34%), Positives = 22/38 (57%), Gaps = 1/38 (2%)

Query: 24 KKKEKKKKEKKAKK-KEKKKEKKKKEKKKKKKKKKKKK 60
          K ++ K++ K+AKK KE+ + K  K      K ++KK 
Sbjct: 49 KAQKIKREMKEAKKRKEENRRKHSKPGSVPPKPERKKH 86


>gnl|CDD|227925 COG5638, COG5638, Uncharacterized conserved protein [Function
           unknown].
          Length = 622

 Score = 34.0 bits (77), Expect = 0.24
 Identities = 13/41 (31%), Positives = 26/41 (63%)

Query: 21  KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
           +K +++++ +KE++ KK +   +   K+KK+   KK KK K
Sbjct: 439 RKLRERQENRKERQLKKTKDDSDVDLKDKKESINKKNKKGK 479



 Score = 32.4 bits (73), Expect = 0.70
 Identities = 12/43 (27%), Positives = 20/43 (46%)

Query: 29  KKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNN 71
           K   K  +++E +KE++ K+ K       K KK + N  N   
Sbjct: 436 KLDRKLRERQENRKERQLKKTKDDSDVDLKDKKESINKKNKKG 478



 Score = 32.1 bits (72), Expect = 0.90
 Identities = 10/43 (23%), Positives = 19/43 (44%)

Query: 24  KKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNN 66
           K   K ++ ++ +K+ + K+ K       K KK+   K N   
Sbjct: 436 KLDRKLRERQENRKERQLKKTKDDSDVDLKDKKESINKKNKKG 478



 Score = 29.8 bits (66), Expect = 4.5
 Identities = 16/66 (24%), Positives = 31/66 (46%), Gaps = 6/66 (9%)

Query: 11  TFTEPTPLDWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKK------KKKKKKNNN 64
           TFT   P   +     + +   +K  +K +++++ +KE++ KK K       K KK++ N
Sbjct: 413 TFTSHLPASNESESDDKLETTIEKLDRKLRERQENRKERQLKKTKDDSDVDLKDKKESIN 472

Query: 65  NNNNNN 70
             N   
Sbjct: 473 KKNKKG 478



 Score = 28.6 bits (63), Expect = 9.3
 Identities = 13/52 (25%), Positives = 27/52 (51%)

Query: 14  EPTPLDWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNN 65
           +PT  ++KK    +K   EK+ + K   ++ +  + + K KK+K +K +   
Sbjct: 557 DPTHPEFKKTGGMKKIMDEKRKRLKNNIEQTQDGKPELKIKKRKAEKGDQRQ 608


>gnl|CDD|223553 COG0477, ProP, Permeases of the major facilitator superfamily
           [Carbohydrate transport and metabolism / Amino acid
           transport and metabolism / Inorganic ion transport and
           metabolism / General function prediction only].
          Length = 338

 Score = 33.5 bits (75), Expect = 0.24
 Identities = 32/236 (13%), Positives = 59/236 (25%), Gaps = 9/236 (3%)

Query: 96  MAAPLVLVLTYVAEITQPHLR--GMLSATASMTTIFGTVSQLFLGSFLHWRSAAILNLLF 153
               L +    ++E           +          G      L   L           F
Sbjct: 108 GGGLLPVASALLSEWFPEATERGLAVGLVTLGAGALGLALGPLLAGLLLGALLWGWRAAF 167

Query: 154 PILALCALYFIPESPHWLISQGRMQEASASLCWLRGWVTPDKVQTELSQITKAIEESELK 213
            + AL  L  +      L+    +  A+  L              +L  +   +    L 
Sbjct: 168 LLAALLGLLLL-----ILVLLLDLLLAAPRL--PLAVGLALIELGKLLLLLLILLLGGLV 220

Query: 214 RLGKDGQRRPNYRMYMRRTFLLPYAIVTSLFFIGQFGGMTTLQTYAVGIFESIHAPLDPY 273
            L                   L  A++           +  +   A G   ++       
Sbjct: 221 LLLLGLYLSAAGFGLRALLLGLGLALLALFVGALILLVLLPILLGAAGGLSALVGAAGGL 280

Query: 274 FATLLLGVAELGGALLCVVLIHYTGKRPLALISTGGSAACFIVVAVYAQFHLSYGW 329
            A +LLG+A L G L+   L+     R L L +   + A  + + +          
Sbjct: 281 LALILLGLAGLLGTLVGGSLLVDRLGRSLGLRAILLAGALLLAILLLVLLLALVLT 336


>gnl|CDD|203462 pfam06476, DUF1090, Protein of unknown function (DUF1090).  This
           family consists of several bacterial proteins of unknown
           function and is known as YqjC in E. coli.
          Length = 115

 Score = 31.8 bits (73), Expect = 0.24
 Identities = 11/39 (28%), Positives = 25/39 (64%), Gaps = 1/39 (2%)

Query: 24  KKKEKKKKEKKAKKKEKKKE-KKKKEKKKKKKKKKKKKK 61
            +++ K +EK+ +  E+++E K+ + K    K +KK++K
Sbjct: 66  AERQAKVEEKEQEVAEREQELKEAQAKGDADKIEKKQRK 104



 Score = 27.2 bits (61), Expect = 9.3
 Identities = 12/44 (27%), Positives = 28/44 (63%), Gaps = 4/44 (9%)

Query: 21  KKAKKKEKKKK----EKKAKKKEKKKEKKKKEKKKKKKKKKKKK 60
           ++AK +EK+++    E++ K+ + K +  K EKK++K  + + +
Sbjct: 68  RQAKVEEKEQEVAEREQELKEAQAKGDADKIEKKQRKLAEAQAE 111


>gnl|CDD|220909 pfam10935, DUF2637, Protein of unknown function (DUF2637).  This
           family of proteins has no known function.
          Length = 139

 Score = 32.4 bits (74), Expect = 0.25
 Identities = 16/124 (12%), Positives = 27/124 (21%), Gaps = 28/124 (22%)

Query: 303 ALISTGGSAACFIVVAVYAQFHLSYGWDSPLVPTVFLVLAAFLTHICIRLLPWMLIGEVF 362
           A ++    A  F  +   A      G  + L P V   L    T   +  L         
Sbjct: 8   AAVAGAAFALSFTALRDLAIAAGGPGSLAWLFPLVIDGLILAAT-AALLALAL------- 59

Query: 363 PNNIRATASGASGSSSYIFAFAVNKLYYPMLDTFHLWGTLYFYAAISVVGTLYMYFVMPE 422
                     A+      +                       +AA+S+   +        
Sbjct: 60  ------AGRSAALYRLLAWLLVA------------ASTVAPVFAAMSLAANV--LHAYLL 99

Query: 423 TEGR 426
              R
Sbjct: 100 AAAR 103


>gnl|CDD|216669 pfam01732, DUF31, Putative peptidase (DUF31).  This domain has no
           known function. It is found in various hypothetical
           proteins and putative lipoproteins from mycoplasmas. It
           appears to be related to the superfamily of trypsin
           peptidases and so may have a peptidase function.
          Length = 307

 Score = 33.6 bits (77), Expect = 0.25
 Identities = 17/93 (18%), Positives = 24/93 (25%), Gaps = 10/93 (10%)

Query: 11  TFTEPTPLDWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKK----------KKKKKKKKKK 60
             T      +       KK K  K K   K  +    +            +         
Sbjct: 188 NITNDAINAYAALDNSLKKFKFLKLKSLLKDTKYSDLDTPLLYALGYPLSENDYFSNSLW 247

Query: 61  KNNNNNNNNNNNNNNNNNNNNNNNNNEEKGTFL 93
            N +N  N N N+ N N    +N      G  L
Sbjct: 248 INYDNKLNKNLNSPNTNKFKKDNIEGGGLGYLL 280


>gnl|CDD|237177 PRK12704, PRK12704, phosphodiesterase; Provisional.
          Length = 520

 Score = 33.6 bits (78), Expect = 0.26
 Identities = 18/105 (17%), Positives = 39/105 (37%), Gaps = 6/105 (5%)

Query: 21  KKAKKKEKKKKEKKAK------KKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNN 74
           KK    E K++  K +       +E++ E +K EK+  +K++   +K             
Sbjct: 55  KKEALLEAKEEIHKLRNEFEKELRERRNELQKLEKRLLQKEENLDRKLELLEKREEELEK 114

Query: 75  NNNNNNNNNNNNEEKGTFLYKMAAPLVLVLTYVAEITQPHLRGML 119
                       E+K   L ++    +  L  ++ +T    + +L
Sbjct: 115 KEKELEQKQQELEKKEEELEELIEEQLQELERISGLTAEEAKEIL 159



 Score = 33.6 bits (78), Expect = 0.26
 Identities = 15/45 (33%), Positives = 29/45 (64%), Gaps = 2/45 (4%)

Query: 21 KKAKK-KEKKKKEKKAKKKEKKKEKKKKEKKKKKK-KKKKKKKNN 63
          ++AK+  E+ KKE +A KKE   E K++  K + + +K+ +++ N
Sbjct: 38 EEAKRILEEAKKEAEAIKKEALLEAKEEIHKLRNEFEKELRERRN 82



 Score = 32.1 bits (74), Expect = 0.96
 Identities = 15/71 (21%), Positives = 29/71 (40%), Gaps = 3/71 (4%)

Query: 22  KAKKKEKK---KKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNNNNN 78
           +A KKE     K+E    + E +KE +++  + +K +K+  +K  N +            
Sbjct: 52  EAIKKEALLEAKEEIHKLRNEFEKELRERRNELQKLEKRLLQKEENLDRKLELLEKREEE 111

Query: 79  NNNNNNNNEEK 89
                   E+K
Sbjct: 112 LEKKEKELEQK 122



 Score = 29.7 bits (68), Expect = 4.1
 Identities = 11/34 (32%), Positives = 20/34 (58%)

Query: 28 KKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
          KK  E K K+ E++ ++  +E KK+ +  KK+  
Sbjct: 26 KKIAEAKIKEAEEEAKRILEEAKKEAEAIKKEAL 59



 Score = 29.0 bits (66), Expect = 7.0
 Identities = 12/47 (25%), Positives = 27/47 (57%), Gaps = 4/47 (8%)

Query: 20 WKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKK----KKKKKKKKN 62
           KK  + + K+ E++AK+  ++ +K+ +  KK+     K++  K +N
Sbjct: 25 RKKIAEAKIKEAEEEAKRILEEAKKEAEAIKKEALLEAKEEIHKLRN 71


>gnl|CDD|216652 pfam01698, FLO_LFY, Floricaula / Leafy protein.  This family
           consists of various plant development proteins which are
           homologues of floricaula (FLO) and Leafy (LFY) proteins
           which are floral meristem identity proteins. Mutations
           in the sequences of these proteins affect flower and
           leaf development.
          Length = 382

 Score = 33.4 bits (77), Expect = 0.27
 Identities = 7/48 (14%), Positives = 25/48 (52%)

Query: 32  EKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNNNNNN 79
                       +KKK++KK+++K+ K+ + +++++ + +++      
Sbjct: 170 GLVGVPGHSSDSEKKKQRKKQRRKRSKELREDDDDDEDEDDDGEGGGE 217



 Score = 31.1 bits (71), Expect = 1.6
 Identities = 8/47 (17%), Positives = 26/47 (55%)

Query: 43  EKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNNNNNNNNNNNNNEEK 89
                 +KKK++KK+++K++     +++++ + +++        EE+
Sbjct: 176 GHSSDSEKKKQRKKQRRKRSKELREDDDDDEDEDDDGEGGGEGGEER 222



 Score = 30.7 bits (70), Expect = 1.8
 Identities = 6/42 (14%), Positives = 25/42 (59%)

Query: 39  EKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNNNNNNN 80
               + +KK+++KK+++K+ K+   +++++ + +++      
Sbjct: 176 GHSSDSEKKKQRKKQRRKRSKELREDDDDDEDEDDDGEGGGE 217



 Score = 28.4 bits (64), Expect = 9.5
 Identities = 6/41 (14%), Positives = 23/41 (56%)

Query: 42  KEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNNNNNNNNN 82
                ++KK++KK+++K+ K    +++++ + +++      
Sbjct: 177 HSSDSEKKKQRKKQRRKRSKELREDDDDDEDEDDDGEGGGE 217


>gnl|CDD|115185 pfam06513, DUF1103, Repeat of unknown function (DUF1103).  This
          family consists of several repeats of around 30
          residues in length which are found specifically in
          mature-parasite-infected erythrocyte surface antigen
          proteins from Plasmodium falciparum. This family often
          found in conjunction with pfam00226.
          Length = 215

 Score = 32.9 bits (74), Expect = 0.27
 Identities = 19/55 (34%), Positives = 31/55 (56%)

Query: 19 DWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNN 73
          D +  K+K   KK+K    ++ +KE K+K +K+ K+K KKK K     N+   N+
Sbjct: 7  DKEDVKEKNDGKKDKVIGSEKTQKEIKEKVEKRVKEKCKKKVKKGIKENDTEGND 61


>gnl|CDD|241486 cd13332, FERM_C_JAK1, Janus kinase 1 FERM domain C-lobe.  JAK1 is a
           tyrosine kinase protein essential in signaling type I
           and type II cytokines. It interacts with the gamma chain
           of type I cytokine receptors to elicit signals from the
           IL-2 receptor family, the IL-4 receptor family, the
           gp130 receptor family, ciliary neurotrophic factor
           receptor (CNTF-R), neurotrophin-1 receptor (NNT-1R) and
           Leptin-R). It also is involved in transducing a signal
           by type I (IFN-alpha/beta) and type II (IFN-gamma)
           interferons, and members of the IL-10 family via type II
           cytokine receptors. JAK (also called Just Another
           Kinase) is a family of intracellular, non-receptor
           tyrosine kinases that transduce cytokine-mediated
           signals via the JAK-STAT pathway. The JAK family in
           mammals consists of 4 members: JAK1, JAK2, JAK3 and
           TYK2. JAKs are composed of seven JAK homology (JH)
           domains (JH1-JH7) . The C-terminal JH1 domain is the
           main catalytic domain, followed by JH2, which is often
           referred to as a pseudokinase domain, followed by
           JH3-JH4 which is homologous to the SH2 domain, and
           lastly JH5-JH7 which is a FERM domain.  Named after
           Janus, the two-faced Roman god of doorways, JAKs possess
           two near-identical phosphate-transferring domains; one
           which displays the kinase activity (JH1), while the
           other negatively regulates the kinase activity of the
           first (JH2). The FERM domain has a cloverleaf tripart
           structure (FERM_N, FERM_M, FERM_C/N, alpha-, and
           C-lobe/A-lobe,A-lobe, B-lobe, C-lobe/F1, F2, F3). The
           C-lobe/F3 within the FERM domain is part of the PH
           domain family. The FERM domain is found in the
           cytoskeletal-associated proteins such as ezrin, moesin,
           radixin, 4.1R, and merlin. These proteins provide a link
           between the membrane and cytoskeleton and are involved
           in signal transduction pathways. The FERM domain is also
           found in protein tyrosine phosphatases (PTPs) , the
           tyrosine kinases FAK and JAK, in addition to other
           proteins involved in signaling. This domain is
           structurally similar to the PH and PTB domains and
           consequently is capable of binding to both peptides and
           phospholipids at different sites.
          Length = 198

 Score = 32.9 bits (75), Expect = 0.28
 Identities = 12/32 (37%), Positives = 18/32 (56%)

Query: 14  EPTPLDWKKAKKKEKKKKEKKAKKKEKKKEKK 45
           +P     +K KK + KK + K KK E KK+ +
Sbjct: 83  KPATTAVEKKKKGKSKKNKLKGKKDEDKKKAR 114



 Score = 32.9 bits (75), Expect = 0.30
 Identities = 15/39 (38%), Positives = 23/39 (58%)

Query: 28  KKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNN 66
           ++K    A +K+KK + KK + K KK + KKK +   NN
Sbjct: 81  RRKPATTAVEKKKKGKSKKNKLKGKKDEDKKKAREGWNN 119



 Score = 31.3 bits (71), Expect = 0.76
 Identities = 14/39 (35%), Positives = 22/39 (56%)

Query: 16  TPLDWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKK 54
           T + W++       +K+KK K K+ K + KK E KKK +
Sbjct: 76  TGISWRRKPATTAVEKKKKGKSKKNKLKGKKDEDKKKAR 114



 Score = 31.0 bits (70), Expect = 1.1
 Identities = 15/30 (50%), Positives = 18/30 (60%)

Query: 39  EKKKEKKKKEKKKKKKKKKKKKKNNNNNNN 68
           EKKK+ K K+ K K KK + KKK     NN
Sbjct: 90  EKKKKGKSKKNKLKGKKDEDKKKAREGWNN 119



 Score = 30.6 bits (69), Expect = 1.7
 Identities = 16/39 (41%), Positives = 21/39 (53%)

Query: 29  KKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNN 67
           ++K      ++KKK K KK K K KK + KKK     NN
Sbjct: 81  RRKPATTAVEKKKKGKSKKNKLKGKKDEDKKKAREGWNN 119



 Score = 29.4 bits (66), Expect = 4.0
 Identities = 14/30 (46%), Positives = 16/30 (53%)

Query: 43  EKKKKEKKKKKKKKKKKKKNNNNNNNNNNN 72
           EKKKK K KK K K KK ++        NN
Sbjct: 90  EKKKKGKSKKNKLKGKKDEDKKKAREGWNN 119



 Score = 29.0 bits (65), Expect = 4.3
 Identities = 13/39 (33%), Positives = 19/39 (48%)

Query: 36  KKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNN 74
           ++K      +KK+K K KK K K KK+ +        NN
Sbjct: 81  RRKPATTAVEKKKKGKSKKNKLKGKKDEDKKKAREGWNN 119



 Score = 29.0 bits (65), Expect = 4.8
 Identities = 15/39 (38%), Positives = 20/39 (51%)

Query: 31  KEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNN 69
           + K A    +KK+K K +K K K KK + KK      NN
Sbjct: 81  RRKPATTAVEKKKKGKSKKNKLKGKKDEDKKKAREGWNN 119



 Score = 28.6 bits (64), Expect = 7.0
 Identities = 12/30 (40%), Positives = 14/30 (46%)

Query: 48  EKKKKKKKKKKKKKNNNNNNNNNNNNNNNN 77
           EKKKK K KK K K   + +        NN
Sbjct: 90  EKKKKGKSKKNKLKGKKDEDKKKAREGWNN 119


>gnl|CDD|182964 PRK11102, PRK11102, bicyclomycin/multidrug efflux system;
           Provisional.
          Length = 377

 Score = 33.3 bits (77), Expect = 0.28
 Identities = 19/44 (43%), Positives = 25/44 (56%), Gaps = 7/44 (15%)

Query: 377 SSYIFAFAVNKLYY-PMLDTFH-----LWGTLYFYAAISVVGTL 414
           S+YI  FA+ +L+Y PM D+F      L GTL F A  +V   L
Sbjct: 33  SAYILGFAIGQLFYGPMADSFGRKPVILGGTLVF-ALAAVACAL 75


>gnl|CDD|238227 cd00386, Heme_Cu_Oxidase_III_like, Heme-copper oxidase subunit III.
            Heme-copper oxidases are transmembrane protein
           complexes in the respiratory chains of prokaryotes and
           mitochondria which couple the reduction of molecular
           oxygen to water to, proton pumping across the membrane.
           The heme-copper oxidase superfamily is diverse in terms
           of electron donors, subunit composition, and heme types.
            This superfamily includes cytochrome c and ubiquinol
           oxidases.  Bacterial oxidases typically contain 3 or 4
           subunits in contrast to the 13 subunit bovine cytochrome
           c oxidase (CcO). Subunits I, II, and III of mammalian
           CcO are encoded within the mitochondrial genome and the
           remaining 10 subunits are encoded within the nuclear
           genome. Subunits I, II and III of ubiquinol oxidase are
           homologous to the corresponding subunits in CcO.  This
           group additionally contains proteins which are fusions
           between subunits I and III, such as Sulfolobus
           acidocaldarius SoxM, a subunit of the SoxM terminal
           oxidase complex. It also includes NorE which has been
           speculated to be a subunit of nitric oxide reductase.
           Some archaebacterial cytochrome oxidases lack subunit
           III.   Although not required for catalytic activity,
           subunit III is believed to play a role in assembly of
           the multimer complex. Rhodobacter CcO subunit III
           stabilizes the integrity of the binuclear center in
           subunit I.  It has been proposed that archaea acquired
           heme-copper oxidases through gene transfer from
           gram-positive bacteria.
          Length = 183

 Score = 32.6 bits (75), Expect = 0.28
 Identities = 25/123 (20%), Positives = 43/123 (34%), Gaps = 10/123 (8%)

Query: 243 LFFIGQFGGMTTLQTYAVGIFESIHAPLDPYFATLLLGVAELGGALLCV---VLIHYTGK 299
           + F   F      +      F +   PLD         +  L G+ +      L    G 
Sbjct: 22  MLFGSFFWAYFHSRLSPPVEFGAGLDPLDLPLLNTNTLL--LSGSSVTWAHASLAARRGN 79

Query: 300 RPLA---LISTGGSAACFIVVAVYAQFHLSYGWDSPLVPTVFLVLAAF-LTHICIRLLPW 355
           R  A   L+ T      F+ +  Y   HL +     +  + F +L  F   H+ I L+ +
Sbjct: 80  RKKARLWLLLTILLGLAFLGLQAYEYSHLIFTISDSVFGSTFFLLTGFHGLHVIIGLI-F 138

Query: 356 MLI 358
           +L+
Sbjct: 139 LLV 141


>gnl|CDD|240377 PTZ00352, PTZ00352, 60S ribosomal protein L13; Provisional.
          Length = 212

 Score = 32.8 bits (75), Expect = 0.29
 Identities = 12/36 (33%), Positives = 21/36 (58%)

Query: 25  KKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKK 60
           K+E+     +  ++ K   K   +K+KKK+ K+KKK
Sbjct: 177 KEERAFNAYRTLRQAKLNAKFVGKKEKKKQAKEKKK 212



 Score = 30.5 bits (69), Expect = 1.7
 Identities = 12/36 (33%), Positives = 21/36 (58%)

Query: 21  KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKK 56
           K+ +     +  ++AK   K   KK+K+K+ K+KKK
Sbjct: 177 KEERAFNAYRTLRQAKLNAKFVGKKEKKKQAKEKKK 212


>gnl|CDD|218754 pfam05795, Plasmodium_Vir, Plasmodium vivax Vir protein.  This
           family consists of several Vir proteins specific to
           Plasmodium vivax. The vir genes are present at about
           600-1,000 copies per haploid genome and encode proteins
           that are immunovariant in natural infections, indicating
           that they may have a functional role in establishing
           chronic infection through antigenic variation.
          Length = 348

 Score = 33.3 bits (76), Expect = 0.30
 Identities = 11/37 (29%), Positives = 16/37 (43%)

Query: 52  KKKKKKKKKKNNNNNNNNNNNNNNNNNNNNNNNNNEE 88
           +KK  +KKK  NNNN            + N + N+  
Sbjct: 304 RKKLGRKKKIMNNNNEEEYELELYAYESENPSLNSSR 340



 Score = 32.9 bits (75), Expect = 0.37
 Identities = 11/37 (29%), Positives = 17/37 (45%)

Query: 51  KKKKKKKKKKKNNNNNNNNNNNNNNNNNNNNNNNNNE 87
           +KK  +KKK  NNNN            + N + N++ 
Sbjct: 304 RKKLGRKKKIMNNNNEEEYELELYAYESENPSLNSSR 340



 Score = 31.8 bits (72), Expect = 1.0
 Identities = 16/82 (19%), Positives = 22/82 (26%), Gaps = 20/82 (24%)

Query: 28  KKKKEKKAKKKEKKK----------EKKKKEKKKKKKKKKK----------KKKNNNNNN 67
           K     K K   K +          E KK  K K K                K N  N+ 
Sbjct: 48  KYISNIKNKNDRKDRCSYLNYWLYDEIKKILKSKSKNINIIPFFNKLFDIWNKINKKNSK 107

Query: 68  NNNNNNNNNNNNNNNNNNNEEK 89
             N    + N  +  +   + K
Sbjct: 108 KKNKCKPDFNYIDILDEWKKRK 129



 Score = 31.4 bits (71), Expect = 1.2
 Identities = 10/36 (27%), Positives = 16/36 (44%)

Query: 50  KKKKKKKKKKKKNNNNNNNNNNNNNNNNNNNNNNNN 85
           +KK  +KKK   NNN            + N + N++
Sbjct: 304 RKKLGRKKKIMNNNNEEEYELELYAYESENPSLNSS 339



 Score = 29.9 bits (67), Expect = 3.5
 Identities = 9/36 (25%), Positives = 15/36 (41%)

Query: 49  KKKKKKKKKKKKKNNNNNNNNNNNNNNNNNNNNNNN 84
           +KK  +KKK    NN            + N + N++
Sbjct: 304 RKKLGRKKKIMNNNNEEEYELELYAYESENPSLNSS 339



 Score = 29.9 bits (67), Expect = 4.0
 Identities = 9/36 (25%), Positives = 13/36 (36%)

Query: 54  KKKKKKKKNNNNNNNNNNNNNNNNNNNNNNNNNEEK 89
           +KK  +KK   NNNN            + N +    
Sbjct: 304 RKKLGRKKKIMNNNNEEEYELELYAYESENPSLNSS 339



 Score = 29.5 bits (66), Expect = 4.6
 Identities = 9/38 (23%), Positives = 14/38 (36%)

Query: 49  KKKKKKKKKKKKKNNNNNNNNNNNNNNNNNNNNNNNNN 86
           KK  +KKK     N            + N + N++  N
Sbjct: 305 KKLGRKKKIMNNNNEEEYELELYAYESENPSLNSSRYN 342



 Score = 29.5 bits (66), Expect = 5.1
 Identities = 8/38 (21%), Positives = 13/38 (34%)

Query: 55  KKKKKKKNNNNNNNNNNNNNNNNNNNNNNNNNEEKGTF 92
           +KK  +K    NNNN            + N +     +
Sbjct: 304 RKKLGRKKKIMNNNNEEEYELELYAYESENPSLNSSRY 341



 Score = 29.1 bits (65), Expect = 5.9
 Identities = 22/73 (30%), Positives = 32/73 (43%), Gaps = 7/73 (9%)

Query: 19  DWKKAK-------KKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNN 71
           +WKK K         +K KK+ K+K K   K  K  +  K   KK K+K  ++N+NN   
Sbjct: 124 EWKKRKDLYDYFENYDKIKKKIKSKNKNCNKYCKYLKYIKSLYKKYKEKCCSSNDNNYCP 183

Query: 72  NNNNNNNNNNNNN 84
                +   N  N
Sbjct: 184 EYFKCDKKYNPKN 196


>gnl|CDD|219668 pfam07964, Red1, Rec10 / Red1.  Rec10 / Red1 is involved in meiotic
           recombination and chromosome segregation during
           homologous chromosome formation. This protein localises
           to the synaptonemal complex in S. cerevisiae and the
           analogous structures (linear elements) in S. pombe. This
           family is currently only found in fungi.
          Length = 706

 Score = 33.7 bits (77), Expect = 0.31
 Identities = 22/76 (28%), Positives = 32/76 (42%), Gaps = 4/76 (5%)

Query: 20  WKKAKKKEKKKKEKKAK--KKEKKKEKK--KKEKKKKKKKKKKKKKNNNNNNNNNNNNNN 75
            K+ +KK  K K+ K K  K    K KK       +  K KK  K   NN  N  +    
Sbjct: 459 SKELRKKVGKSKQTKLKNFKPVPNKSKKQLANNNSQNIKSKKVVKAKTNNKANLQDVGEC 518

Query: 76  NNNNNNNNNNNEEKGT 91
           ++  NN   N+++  T
Sbjct: 519 SSPPNNKEKNDKQTST 534



 Score = 32.2 bits (73), Expect = 0.76
 Identities = 17/53 (32%), Positives = 21/53 (39%)

Query: 15  PTPLDWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNN 67
              L  K  K K+ K K  K    + KK+      +  K KK  K K NN  N
Sbjct: 459 SKELRKKVGKSKQTKLKNFKPVPNKSKKQLANNNSQNIKSKKVVKAKTNNKAN 511


>gnl|CDD|240420 PTZ00441, PTZ00441, sporozoite surface protein 2 (SSP2);
           Provisional.
          Length = 576

 Score = 33.4 bits (76), Expect = 0.32
 Identities = 20/81 (24%), Positives = 34/81 (41%), Gaps = 5/81 (6%)

Query: 17  PLDWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNN-----N 71
           P D +     E KK E K   + +    K  + ++ +  K K+    N +N+ +     +
Sbjct: 413 PEDDRDNNFNEPKKPENKGDGQNEPVIPKPLDNERDQSNKNKQVNPGNRHNSEDRYTRPH 472

Query: 72  NNNNNNNNNNNNNNNEEKGTF 92
             NN N N NN N++  K   
Sbjct: 473 GRNNENRNYNNKNSDIPKHPE 493



 Score = 31.9 bits (72), Expect = 1.1
 Identities = 10/54 (18%), Positives = 24/54 (44%)

Query: 36  KKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNNNNNNNNNNNNNEEK 89
           ++ +  K K+     +   + +  + +  NN N N NN N++   +   +  E+
Sbjct: 446 ERDQSNKNKQVNPGNRHNSEDRYTRPHGRNNENRNYNNKNSDIPKHPERSEHEQ 499



 Score = 30.7 bits (69), Expect = 2.0
 Identities = 15/76 (19%), Positives = 28/76 (36%), Gaps = 11/76 (14%)

Query: 15  PTPLDWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNN 74
           P PLD            E+    K K+     +   + +  +   + N N N NN N++ 
Sbjct: 440 PKPLD-----------NERDQSNKNKQVNPGNRHNSEDRYTRPHGRNNENRNYNNKNSDI 488

Query: 75  NNNNNNNNNNNNEEKG 90
             +   + +   E+K 
Sbjct: 489 PKHPERSEHEQPEDKK 504


>gnl|CDD|219783 pfam08297, U3_snoRNA_assoc, U3 snoRNA associated.  This family of
          proteins is associated with U3 snoRNA. U3 snoRNA is
          required for nucleolar processing of pre-18S ribosomal
          RNA.
          Length = 87

 Score = 30.8 bits (70), Expect = 0.33
 Identities = 12/32 (37%), Positives = 19/32 (59%)

Query: 29 KKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKK 60
           +  ++  KK+KKK K +  KK +K+ K  KK
Sbjct: 21 PESYEEELKKQKKKNKLRFLKKLEKRPKDVKK 52



 Score = 27.8 bits (62), Expect = 4.0
 Identities = 10/24 (41%), Positives = 16/24 (66%)

Query: 38 KEKKKEKKKKEKKKKKKKKKKKKK 61
          K++KK+ K +  KK +K+ K  KK
Sbjct: 29 KKQKKKNKLRFLKKLEKRPKDVKK 52



 Score = 27.4 bits (61), Expect = 5.3
 Identities = 12/33 (36%), Positives = 20/33 (60%)

Query: 15 PTPLDWKKAKKKEKKKKEKKAKKKEKKKEKKKK 47
          P P  +++  KK+KKK + +  KK +K+ K  K
Sbjct: 19 PPPESYEEELKKQKKKNKLRFLKKLEKRPKDVK 51



 Score = 26.6 bits (59), Expect = 9.8
 Identities = 13/35 (37%), Positives = 20/35 (57%)

Query: 13 TEPTPLDWKKAKKKEKKKKEKKAKKKEKKKEKKKK 47
          T P     ++ KK++KK K +  KK EK+ +  KK
Sbjct: 18 TPPPESYEEELKKQKKKNKLRFLKKLEKRPKDVKK 52


>gnl|CDD|146145 pfam03357, Snf7, Snf7.  This family of proteins are involved in
          protein sorting and transport from the endosome to the
          vacuole/lysosome in eukaryotic cells.
          Vacuoles/lysosomes play an important role in the
          degradation of both lipids and cellular proteins. In
          order to perform this degradative function,
          vacuoles/lysosomes contain numerous hydrolases which
          have been transported in the form of inactive
          precursors via the biosynthetic pathway and are
          proteolytically activated upon delivery to the
          vacuole/lysosome. The delivery of transmembrane
          proteins, such as activated cell surface receptors to
          the lumen of the vacuole/lysosome, either for
          degradation/downregulation, or in the case of
          hydrolases, for proper localisation, requires the
          formation of multivesicular bodies (MVBs). These late
          endosomal structures are formed by invaginating and
          budding of the limiting membrane into the lumen of the
          compartment. During this process, a subset of the
          endosomal membrane proteins is sorted into the forming
          vesicles. Mature MVBs fuse with the vacuole/lysosome,
          thereby releasing cargo containing vesicles into its
          hydrolytic lumen for degradation. Endosomal proteins
          that are not sorted into the intralumenal MVB vesicles
          are either recycled back to the plasma membrane or
          Golgi complex, or remain in the limiting membrane of
          the MVB and are thereby transported to the limiting
          membrane of the vacuole/lysosome as a consequence of
          fusion. Therefore, the MVB sorting pathway plays a
          critical role in the decision between recycling and
          degradation of membrane proteins. A few archaeal
          sequences are also present within this family.
          Length = 169

 Score = 32.2 bits (74), Expect = 0.34
 Identities = 13/36 (36%), Positives = 20/36 (55%)

Query: 25 KKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKK 60
          +K  ++ +KK +  EKK +K + E KK  KK  K  
Sbjct: 7  RKAIRELDKKQESLEKKIKKLEAEIKKLAKKGNKDA 42



 Score = 28.4 bits (64), Expect = 6.1
 Identities = 13/34 (38%), Positives = 20/34 (58%), Gaps = 1/34 (2%)

Query: 29 KKKEKKAKKKEKKKEKK-KKEKKKKKKKKKKKKK 61
          +K  ++  KK++  EKK KK + + KK  KK  K
Sbjct: 7  RKAIRELDKKQESLEKKIKKLEAEIKKLAKKGNK 40


>gnl|CDD|225381 COG2825, HlpA, Outer membrane protein [Cell envelope biogenesis,
           outer membrane].
          Length = 170

 Score = 32.4 bits (74), Expect = 0.34
 Identities = 15/65 (23%), Positives = 29/65 (44%)

Query: 21  KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNNNNNNN 80
           K+ +K +K+ K K+AK ++  K +   ++ K + + KK+K  N  N        + N   
Sbjct: 58  KELQKMQKELKAKEAKLQDDGKMEALSDRAKAEAEIKKEKLVNAFNKKQQEYEKDLNRRE 117

Query: 81  NNNNN 85
                
Sbjct: 118 AEEEQ 122



 Score = 28.1 bits (63), Expect = 8.1
 Identities = 8/45 (17%), Positives = 23/45 (51%)

Query: 19 DWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNN 63
            KK     + + +K+ K+ +K +++ K ++ K +   K +  ++
Sbjct: 41 QAKKVSADLESEFKKRQKELQKMQKELKAKEAKLQDDGKMEALSD 85


>gnl|CDD|233421 TIGR01453, grpIintron_endo, group I intron endonuclease.  This
           model represents one subfamily of endonucleases
           containing the endo/excinuclease amino terminal domain,
           pfam01541 at its amino end. A distinct subfamily
           includes excinuclease abc subunit c (uvrC). Members of
           pfam01541 are often termed GIY-YIG endonucleases after
           conserved motifs near the amino end. This subfamily in
           This model is found in open reading frames of group I
           introns in both phage and mitochondria. The closely
           related endonucleases of phage T4: segA, segB, segC,
           segD and segE, score below the trusted cutoff for the
           family.
          Length = 214

 Score = 32.7 bits (75), Expect = 0.34
 Identities = 16/49 (32%), Positives = 21/49 (42%)

Query: 27  EKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNN 75
           E K K  K K  E      KK  ++ KKK  +K+K     N  + NN  
Sbjct: 127 ETKAKISKNKLGENNPFFGKKHSEETKKKISEKEKGAKKVNVYDKNNGL 175



 Score = 29.3 bits (66), Expect = 3.7
 Identities = 14/49 (28%), Positives = 22/49 (44%), Gaps = 1/49 (2%)

Query: 22  KAKKKE-KKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNN 69
           + K K  K K  +      KK  ++ K+K  +K+K  KK    + NN  
Sbjct: 127 ETKAKISKNKLGENNPFFGKKHSEETKKKISEKEKGAKKVNVYDKNNGL 175


>gnl|CDD|217318 pfam03011, PFEMP, PFEMP DBL domain.  PfEMP1 (Plasmodium falciparum
           erythrocyte membrane protein) has been identified as the
           rosetting ligand of the malaria parasite P. falciparum.
           Rosetting is the adhesion of infected erythrocytes with
           uninfected erythrocytes in the vasculature of the
           infected organ, and is associated with severe malaria.
           PfEMP1 interacts with Complement Receptor One on
           uninfected erythrocytes to form rosettes.
          Length = 150

 Score = 31.9 bits (73), Expect = 0.35
 Identities = 24/117 (20%), Positives = 37/117 (31%), Gaps = 39/117 (33%)

Query: 19  DWKKAKKKEKKKKEKKAKKKEKKK---------EKKKKEKKKKKKKKKKKKKNNNNNNNN 69
            W+K  K      + K  K   KK         EKKKKE +K K+   K+ K ++  +  
Sbjct: 15  KWRKKLKHCINNGKGKCCKCICKKKCECVEKWIEKKKKEWEKIKEHFNKQYKIDDGRDEY 74

Query: 70  NNNNN------------------------------NNNNNNNNNNNNEEKGTFLYKM 96
           N   +                              N  NN+ N+  N E    +  +
Sbjct: 75  NVLESFLEDLLFQIDIKKAYGDAKELKKIKELLKCNGCNNSANSAKNGENKDAIDCL 131



 Score = 29.6 bits (67), Expect = 2.1
 Identities = 20/81 (24%), Positives = 32/81 (39%), Gaps = 3/81 (3%)

Query: 19 DWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKK--KKKKKKKKNNNNNNNNNNNNNNN 76
          D  K +KK K       K K  K   KKK +  +K  +KKKK+ +    + N     ++ 
Sbjct: 12 DSIKWRKKLKHCI-NNGKGKCCKCICKKKCECVEKWIEKKKKEWEKIKEHFNKQYKIDDG 70

Query: 77 NNNNNNNNNNEEKGTFLYKMA 97
           +  N   +  E   F   + 
Sbjct: 71 RDEYNVLESFLEDLLFQIDIK 91



 Score = 28.1 bits (63), Expect = 8.7
 Identities = 14/52 (26%), Positives = 19/52 (36%), Gaps = 7/52 (13%)

Query: 19  DWKKAKKKEKKKKEKKAKKKEKKKEKK-------KKEKKKKKKKKKKKKKNN 63
           + KK K+  K      +    K  E K          +KK KK KKK  +  
Sbjct: 99  ELKKIKELLKCNGCNNSANSAKNGENKDAIDCLLDHLQKKAKKCKKKHSEEE 150


>gnl|CDD|240256 PTZ00071, PTZ00071, 40S ribosomal protein S24; Provisional.
          Length = 132

 Score = 31.5 bits (72), Expect = 0.36
 Identities = 15/34 (44%), Positives = 19/34 (55%)

Query: 27  EKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKK 60
            KKKK  +  +KE K  +KK   K+K K   KKK
Sbjct: 99  IKKKKAGRKARKELKNRRKKVRGKEKTKVSGKKK 132



 Score = 29.6 bits (67), Expect = 1.8
 Identities = 15/41 (36%), Positives = 23/41 (56%), Gaps = 2/41 (4%)

Query: 23  AKKKEKKKKEKKAK--KKEKKKEKKKKEKKKKKKKKKKKKK 61
            KK E K +  +    KK+K   K +KE K ++KK + K+K
Sbjct: 84  LKKFEPKYRLIRLGLIKKKKAGRKARKELKNRRKKVRGKEK 124



 Score = 28.1 bits (63), Expect = 5.7
 Identities = 14/33 (42%), Positives = 18/33 (54%)

Query: 23  AKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKK 55
            KKK  +K  K+ K + KK   K+K K   KKK
Sbjct: 100 KKKKAGRKARKELKNRRKKVRGKEKTKVSGKKK 132



 Score = 28.1 bits (63), Expect = 6.5
 Identities = 16/33 (48%), Positives = 19/33 (57%), Gaps = 1/33 (3%)

Query: 21  KKAKKKEKKKKEKKAK-KKEKKKEKKKKEKKKK 52
           KK K   K +KE K + KK + KEK K   KKK
Sbjct: 100 KKKKAGRKARKELKNRRKKVRGKEKTKVSGKKK 132



 Score = 27.7 bits (62), Expect = 9.6
 Identities = 13/34 (38%), Positives = 20/34 (58%)

Query: 23  AKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKK 56
            KKK+  +K +K  K  +KK + K++ K   KKK
Sbjct: 99  IKKKKAGRKARKELKNRRKKVRGKEKTKVSGKKK 132


>gnl|CDD|221826 pfam12878, SICA_beta, SICA extracellular beta domain.  The SICA
          (schizont-infected cell agglutination) proteins of P.
          knowlesi, one of the variant antigen gene families, are
          associated with parasitic virulence. These proteins are
          comprised of multiple domains, with the extracellular
          domains occurring at different frequencies. There can
          be between 1 and 10 copies of this cysteine-rich
          domain.
          Length = 169

 Score = 32.1 bits (74), Expect = 0.38
 Identities = 14/54 (25%), Positives = 21/54 (38%)

Query: 23 AKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNNN 76
          A KK  K K  +    +   ++ K E KK  K+     K N      N+  + N
Sbjct: 9  AAKKWFKNKNGQGGGTDDFWDEVKGELKKLWKELSTAMKQNGVAGYCNDFGDGN 62



 Score = 31.4 bits (72), Expect = 0.64
 Identities = 14/59 (23%), Positives = 21/59 (35%), Gaps = 4/59 (6%)

Query: 35 AKKKEKKKEKKKKEKKK--KKKKKKKKK--KNNNNNNNNNNNNNNNNNNNNNNNNNEEK 89
          AKK  K K  +         + K + KK  K  +     N      N+  + N  + EK
Sbjct: 10 AKKWFKNKNGQGGGTDDFWDEVKGELKKLWKELSTAMKQNGVAGYCNDFGDGNATSAEK 68



 Score = 31.4 bits (72), Expect = 0.77
 Identities = 14/41 (34%), Positives = 20/41 (48%)

Query: 37  KKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNNNN 77
           KK K+K K K    +  KK   K  +  NN ++ N   NN+
Sbjct: 112 KKMKEKAKGKCSIDEGIKKAFDKWNDIMNNTSSCNGGGNNS 152



 Score = 30.6 bits (70), Expect = 1.1
 Identities = 15/45 (33%), Positives = 20/45 (44%), Gaps = 7/45 (15%)

Query: 42  KEKKKKEKKKKKKK-------KKKKKKNNNNNNNNNNNNNNNNNN 79
               KK K+K K K       KK   K N+  NN ++ N   NN+
Sbjct: 108 NAYAKKMKEKAKGKCSIDEGIKKAFDKWNDIMNNTSSCNGGGNNS 152



 Score = 30.6 bits (70), Expect = 1.1
 Identities = 16/42 (38%), Positives = 21/42 (50%), Gaps = 1/42 (2%)

Query: 35  AKK-KEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNN 75
           AKK KEK K K   ++  KK   K     NN ++ N   NN+
Sbjct: 111 AKKMKEKAKGKCSIDEGIKKAFDKWNDIMNNTSSCNGGGNNS 152



 Score = 28.3 bits (64), Expect = 7.1
 Identities = 11/53 (20%), Positives = 17/53 (32%)

Query: 28 KKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNNNNNNN 80
           KK  K    +    +    E K + KK  K+       N      N+  + N
Sbjct: 10 AKKWFKNKNGQGGGTDDFWDEVKGELKKLWKELSTAMKQNGVAGYCNDFGDGN 62


>gnl|CDD|234750 PRK00409, PRK00409, recombination and DNA strand exchange inhibitor
           protein; Reviewed.
          Length = 782

 Score = 33.3 bits (77), Expect = 0.39
 Identities = 12/39 (30%), Positives = 22/39 (56%)

Query: 24  KKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKN 62
           K      K  +  +  K+  K  ++K+KKKKK+K+K++ 
Sbjct: 598 KGGYASVKAHELIEARKRLNKANEKKEKKKKKQKEKQEE 636



 Score = 33.3 bits (77), Expect = 0.41
 Identities = 11/40 (27%), Positives = 27/40 (67%)

Query: 23  AKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKN 62
             ++E ++K ++A+   K+ EK K+E ++KK+K ++++  
Sbjct: 527 ELERELEQKAEEAEALLKEAEKLKEELEEKKEKLQEEEDK 566



 Score = 32.5 bits (75), Expect = 0.68
 Identities = 15/39 (38%), Positives = 22/39 (56%)

Query: 22  KAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKK 60
           K      K  E    +K   K  +KKEKKKKK+K+K+++
Sbjct: 598 KGGYASVKAHELIEARKRLNKANEKKEKKKKKQKEKQEE 636



 Score = 31.3 bits (72), Expect = 1.4
 Identities = 14/41 (34%), Positives = 26/41 (63%)

Query: 21  KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
           K      K  +  +A+K+  K  +KK++KKKK+K+K+++ K
Sbjct: 598 KGGYASVKAHELIEARKRLNKANEKKEKKKKKQKEKQEELK 638



 Score = 31.0 bits (71), Expect = 1.9
 Identities = 11/41 (26%), Positives = 28/41 (68%)

Query: 21  KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
           +KA++ E   KE +  K+E +++K+K ++++ K  ++ +K+
Sbjct: 534 QKAEEAEALLKEAEKLKEELEEKKEKLQEEEDKLLEEAEKE 574



 Score = 30.2 bits (69), Expect = 3.5
 Identities = 12/44 (27%), Positives = 26/44 (59%), Gaps = 3/44 (6%)

Query: 19  DWKKAKKKEK---KKKEKKAKKKEKKKEKKKKEKKKKKKKKKKK 59
             ++A+   K   K KE+  +KKEK +E++ K  ++ +K+ ++ 
Sbjct: 535 KAEEAEALLKEAEKLKEELEEKKEKLQEEEDKLLEEAEKEAQQA 578



 Score = 29.8 bits (68), Expect = 4.9
 Identities = 9/35 (25%), Positives = 21/35 (60%)

Query: 31  KEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNN 65
            E++ ++K ++ E   KE +K K++ ++KK+    
Sbjct: 528 LERELEQKAEEAEALLKEAEKLKEELEEKKEKLQE 562



 Score = 29.0 bits (66), Expect = 8.2
 Identities = 12/41 (29%), Positives = 29/41 (70%)

Query: 21  KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
           K+A+K +++ +EKK K +E++ +  ++ +K+ ++  K+ KK
Sbjct: 544 KEAEKLKEELEEKKEKLQEEEDKLLEEAEKEAQQAIKEAKK 584


>gnl|CDD|192292 pfam09429, Wbp11, WW domain binding protein 11.  The WW domain is
          a small protein module with a triple-stranded
          beta-sheet fold. This is a family of WW domain binding
          proteins.
          Length = 78

 Score = 30.3 bits (69), Expect = 0.41
 Identities = 13/29 (44%), Positives = 19/29 (65%)

Query: 23 AKKKEKKKKEKKAKKKEKKKEKKKKEKKK 51
          A +KE+KKKE K  K E++  ++ K  KK
Sbjct: 9  AYRKEQKKKELKKNKAERQARREAKLAKK 37



 Score = 29.5 bits (67), Expect = 0.75
 Identities = 12/29 (41%), Positives = 21/29 (72%), Gaps = 1/29 (3%)

Query: 35 AKKKEKKKE-KKKKEKKKKKKKKKKKKKN 62
           +K++KKKE KK K +++ +++ K  KKN
Sbjct: 10 YRKEQKKKELKKNKAERQARREAKLAKKN 38



 Score = 29.5 bits (67), Expect = 0.90
 Identities = 12/28 (42%), Positives = 18/28 (64%), Gaps = 2/28 (7%)

Query: 33 KKAKKKEKKKEKKKKEKKKKKKKKKKKK 60
          K+ KKKE K  K K E++ +++ K  KK
Sbjct: 12 KEQKKKELK--KNKAERQARREAKLAKK 37



 Score = 28.4 bits (64), Expect = 1.9
 Identities = 13/53 (24%), Positives = 27/53 (50%), Gaps = 2/53 (3%)

Query: 21 KKAKKKE--KKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNN 71
          K+ KKKE  K K E++A+++ K  +K     +++  K K ++++         
Sbjct: 12 KEQKKKELKKNKAERQARREAKLAKKNPDRLQRELDKLKAQEESGKLRPVEKQ 64



 Score = 26.5 bits (59), Expect = 8.5
 Identities = 10/28 (35%), Positives = 18/28 (64%)

Query: 27 EKKKKEKKAKKKEKKKEKKKKEKKKKKK 54
           +K+++KK  KK K + + ++E K  KK
Sbjct: 10 YRKEQKKKELKKNKAERQARREAKLAKK 37


>gnl|CDD|220231 pfam09420, Nop16, Ribosome biogenesis protein Nop16.  Nop16 is a
          protein involved in ribosome biogenesis.
          Length = 173

 Score = 32.0 bits (73), Expect = 0.42
 Identities = 12/32 (37%), Positives = 19/32 (59%)

Query: 36 KKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNN 67
          +KK+K +    K  +K+ K+K +KKK N   N
Sbjct: 2  RKKKKNRSSNYKVNRKRLKRKDRKKKINIIGN 33



 Score = 30.1 bits (68), Expect = 2.1
 Identities = 10/32 (31%), Positives = 17/32 (53%)

Query: 37 KKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNN 68
          +K+KK      +  +K+ K+K +KK  N   N
Sbjct: 2  RKKKKNRSSNYKVNRKRLKRKDRKKKINIIGN 33



 Score = 29.7 bits (67), Expect = 2.5
 Identities = 11/28 (39%), Positives = 17/28 (60%)

Query: 28 KKKKEKKAKKKEKKKEKKKKEKKKKKKK 55
          KKKK + +  K  +K  K+K++KKK   
Sbjct: 3  KKKKNRSSNYKVNRKRLKRKDRKKKINI 30



 Score = 29.7 bits (67), Expect = 2.6
 Identities = 7/32 (21%), Positives = 13/32 (40%)

Query: 45 KKKEKKKKKKKKKKKKKNNNNNNNNNNNNNNN 76
          +KK+K +    K  +K+    +     N   N
Sbjct: 2  RKKKKNRSSNYKVNRKRLKRKDRKKKINIIGN 33



 Score = 29.3 bits (66), Expect = 3.8
 Identities = 7/33 (21%), Positives = 13/33 (39%)

Query: 43 EKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNN 75
           +KKK+ +    K  +K+    +     N   N
Sbjct: 1  VRKKKKNRSSNYKVNRKRLKRKDRKKKINIIGN 33



 Score = 28.9 bits (65), Expect = 4.3
 Identities = 9/33 (27%), Positives = 15/33 (45%)

Query: 39 EKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNN 71
           +KK+K +    K  +K+ K+K      N   N
Sbjct: 1  VRKKKKNRSSNYKVNRKRLKRKDRKKKINIIGN 33



 Score = 28.9 bits (65), Expect = 5.2
 Identities = 9/32 (28%), Positives = 16/32 (50%)

Query: 41 KKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNN 72
          +K+KK +    K  +K+ K+K+     N   N
Sbjct: 2  RKKKKNRSSNYKVNRKRLKRKDRKKKINIIGN 33



 Score = 28.6 bits (64), Expect = 5.5
 Identities = 7/32 (21%), Positives = 12/32 (37%)

Query: 46 KKEKKKKKKKKKKKKKNNNNNNNNNNNNNNNN 77
          +K+KK +    K  +K     +     N   N
Sbjct: 2  RKKKKNRSSNYKVNRKRLKRKDRKKKINIIGN 33



 Score = 28.6 bits (64), Expect = 6.9
 Identities = 10/31 (32%), Positives = 15/31 (48%)

Query: 33 KKAKKKEKKKEKKKKEKKKKKKKKKKKKKNN 63
          KK K +    +  +K  K+K +KKK     N
Sbjct: 3  KKKKNRSSNYKVNRKRLKRKDRKKKINIIGN 33



 Score = 28.2 bits (63), Expect = 7.5
 Identities = 10/32 (31%), Positives = 17/32 (53%)

Query: 31 KEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKN 62
          ++KK  +    K  +K+ K+K +KKK     N
Sbjct: 2  RKKKKNRSSNYKVNRKRLKRKDRKKKINIIGN 33



 Score = 28.2 bits (63), Expect = 7.9
 Identities = 10/33 (30%), Positives = 16/33 (48%)

Query: 32 EKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNN 64
           +K KK      K  +++ K+K +KKK     N
Sbjct: 1  VRKKKKNRSSNYKVNRKRLKRKDRKKKINIIGN 33



 Score = 28.2 bits (63), Expect = 8.8
 Identities = 8/32 (25%), Positives = 14/32 (43%)

Query: 42 KEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNN 73
          ++KKK      K  +K+ K+ +     N   N
Sbjct: 2  RKKKKNRSSNYKVNRKRLKRKDRKKKINIIGN 33


>gnl|CDD|218391 pfam05029, TIMELESS_C, Timeless protein C terminal region.  The
           timeless (tim) gene is essential for circadian function
           in Drosophila. Putative homologues of Drosophila tim
           have been identified in both mice and humans (mTim and
           hTIM, respectively). Mammalian TIM is not the true
           orthologue of Drosophila TIM, but is the likely
           orthologue of a fly gene, timeout (also called tim-2).
           mTim has been shown to be essential for embryonic
           development, but does not have substantiated circadian
           function. Some family members contain a SANT domain in
           this region.
          Length = 507

 Score = 33.1 bits (75), Expect = 0.43
 Identities = 11/69 (15%), Positives = 35/69 (50%)

Query: 23  AKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNNNNNNNNN 82
            +K  +++K   A  +E+   +++++   KKK+  +K K    +  N++   ++ + + +
Sbjct: 408 LRKNARQRKAGLASPEEEALGEEEQKAPPKKKQLNQKNKQQTGSGTNSDEERDDTSLDED 467

Query: 83  NNNNEEKGT 91
            +  ++ G 
Sbjct: 468 RDLADDGGL 476


>gnl|CDD|216716 pfam01810, LysE, LysE type translocator.  This family consists of
           various hypothetical proteins and an l-lysine exporter
           LysE from Corynebacterium glutamicum which is proposed
           to be the first of a novel family of translocators. LysE
           exports l-lysine from the cell into the surrounding
           medium and is predicted to span the membrane six times.
           The physiological function of the exporter is to excrete
           excess l-Lysine as a result of natural flux imbalances
           or peptide hydrolysis; and also after artificial
           deregulation of l-Lysine biosynthesis as used by the
           biotechnology. industry for the production of l-lysine.
          Length = 191

 Score = 32.3 bits (74), Expect = 0.44
 Identities = 16/120 (13%), Positives = 28/120 (23%), Gaps = 9/120 (7%)

Query: 201 SQITKAIEESELKRLGKDGQRRPNYRMYMRRTFLLPYAIVTSLFFIGQFGGMTTLQTYAV 260
             +  A           +   + ++   +  +   P  I+  L     F           
Sbjct: 76  LTLRAAWSPKNESLSSSEASLKKSFLRGLLVSLSNPKVILFWLSVGSAFLDKQQYGDAGR 135

Query: 261 GIFESIHAPLDPYFATLLLGVAELGGALLCVVLIHYTGKRPLALISTGGSAACFIVVAVY 320
             F +        + +LL        A L   L           I    + A  I   VY
Sbjct: 136 IAFAAGLVVASLIWFSLL--------AFLASRLARRL-STKKVRIINVIAGALLIGFGVY 186


>gnl|CDD|222649 pfam14284, PcfJ, PcfJ-like protein.  The PcfJ-like protein family
          includes the E. faecalis PcfJ protein, which is
          functionally uncharacterized. It is found in bacteria
          and viruses, and is typically between 159 and 170 amino
          acids in length. There is a conserved HCV sequence
          motif.
          Length = 169

 Score = 31.9 bits (73), Expect = 0.44
 Identities = 13/44 (29%), Positives = 24/44 (54%), Gaps = 5/44 (11%)

Query: 17 PLDWKKA-----KKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKK 55
          P + K A      K + KK+++KA+K  K+  + +K  ++ K K
Sbjct: 24 PKNLKAAHDRAVAKYQAKKEKEKAEKMRKRYPELEKIFRELKSK 67


>gnl|CDD|217020 pfam02403, Seryl_tRNA_N, Seryl-tRNA synthetase N-terminal domain.
           This domain is found associated with the Pfam tRNA
          synthetase class II domain (pfam00587) and represents
          the N-terminal domain of seryl-tRNA synthetase.
          Length = 108

 Score = 31.0 bits (71), Expect = 0.45
 Identities = 13/40 (32%), Positives = 22/40 (55%)

Query: 22 KAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
          +A++ E  K+  KAKKK++  E    E K+ K + K  + 
Sbjct: 49 QAERNELSKEIGKAKKKKEDAEALIAEVKELKDELKALEA 88



 Score = 28.3 bits (64), Expect = 3.8
 Identities = 12/37 (32%), Positives = 21/37 (56%)

Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKK 57
           KAKKK++  +   A+ KE K E K  E + ++ + +
Sbjct: 60 GKAKKKKEDAEALIAEVKELKDELKALEAELRELEAE 96



 Score = 27.9 bits (63), Expect = 5.4
 Identities = 12/44 (27%), Positives = 23/44 (52%), Gaps = 1/44 (2%)

Query: 20 WKKAKKK-EKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKN 62
           ++ + + E+ + E+    KE  K KKKKE  +    + K+ K+
Sbjct: 38 RRELQVELEELQAERNELSKEIGKAKKKKEDAEALIAEVKELKD 81



 Score = 27.2 bits (61), Expect = 8.5
 Identities = 9/40 (22%), Positives = 23/40 (57%)

Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKK 60
          K+  K +KKK++ +A   E K+ K + +  + + ++ + +
Sbjct: 57 KEIGKAKKKKEDAEALIAEVKELKDELKALEAELRELEAE 96


>gnl|CDD|237631 PRK14162, PRK14162, heat shock protein GrpE; Provisional.
          Length = 194

 Score = 32.1 bits (73), Expect = 0.46
 Identities = 13/41 (31%), Positives = 23/41 (56%)

Query: 26 KEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNN 66
          KE+   EK   ++E   E  KKE K+  K++ ++K+N   +
Sbjct: 3  KEEFPSEKDLPQEETTDEAPKKEAKEAPKEEDQEKQNPVED 43



 Score = 30.2 bits (68), Expect = 2.1
 Identities = 9/38 (23%), Positives = 23/38 (60%)

Query: 24 KKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
          K   +++   +A KKE K+  K+++++K+   +  +K+
Sbjct: 10 KDLPQEETTDEAPKKEAKEAPKEEDQEKQNPVEDLEKE 47



 Score = 29.8 bits (67), Expect = 2.9
 Identities = 13/50 (26%), Positives = 21/50 (42%)

Query: 13 TEPTPLDWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKN 62
                  ++    E  KKE K   KE+ +EK+   +  +K+    K KN
Sbjct: 6  FPSEKDLPQEETTDEAPKKEAKEAPKEEDQEKQNPVEDLEKEIADLKAKN 55



 Score = 29.8 bits (67), Expect = 3.0
 Identities = 10/41 (24%), Positives = 21/41 (51%)

Query: 31 KEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNN 71
          KE+   +K+  +E+   E  KK+ K+  K+++    N   +
Sbjct: 3  KEEFPSEKDLPQEETTDEAPKKEAKEAPKEEDQEKQNPVED 43



 Score = 28.3 bits (63), Expect = 7.8
 Identities = 9/41 (21%), Positives = 20/41 (48%)

Query: 30 KKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNN 70
          K+E  ++K   ++E   +  KK+ K+  K++     N   +
Sbjct: 3  KEEFPSEKDLPQEETTDEAPKKEAKEAPKEEDQEKQNPVED 43


>gnl|CDD|148820 pfam07432, Hc1, Histone H1-like protein Hc1.  This family consists
           of several bacterial histone H1-like Hc1 proteins. In
           Chlamydia, Hc1 is expressed in the late stages of the
           life cycle, concomitant with the reorganisation of
           chlamydial reticulate bodies into elementary bodies.
           This suggests that Hc1 protein plays a role in the
           condensation of chromatin during intracellular
           differentiation.
          Length = 123

 Score = 31.3 bits (70), Expect = 0.47
 Identities = 18/40 (45%), Positives = 18/40 (45%)

Query: 22  KAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
           KA KK  K   KKA     KK K  K K   KK   KK K
Sbjct: 74  KAPKKAAKAPAKKAAAAPAKKAKAVKAKPATKKTAAKKVK 113



 Score = 29.7 bits (66), Expect = 1.4
 Identities = 13/40 (32%), Positives = 16/40 (40%)

Query: 21  KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKK 60
           KKA K  K  K+      +K      K+ K  K K   KK
Sbjct: 67  KKAPKAAKAPKKAAKAPAKKAAAAPAKKAKAVKAKPATKK 106



 Score = 29.7 bits (66), Expect = 1.5
 Identities = 15/40 (37%), Positives = 18/40 (45%)

Query: 22 KAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
          K  +KE  K EK    K+K  +K  K  K  KK  K   K
Sbjct: 46 KLYRKESIKAEKSGLMKKKPAKKAPKAAKAPKKAAKAPAK 85



 Score = 29.3 bits (65), Expect = 1.9
 Identities = 13/40 (32%), Positives = 16/40 (40%)

Query: 22 KAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
          KA+K    KK+   K  +  K  KK  K   KK      K
Sbjct: 54 KAEKSGLMKKKPAKKAPKAAKAPKKAAKAPAKKAAAAPAK 93



 Score = 28.2 bits (62), Expect = 5.6
 Identities = 17/41 (41%), Positives = 19/41 (46%)

Query: 21  KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
           K AK   KK     AKK +  K K   +K   KK KK  KK
Sbjct: 78  KAAKAPAKKAAAAPAKKAKAVKAKPATKKTAAKKVKKMSKK 118



 Score = 27.4 bits (60), Expect = 8.9
 Identities = 13/41 (31%), Positives = 17/41 (41%)

Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
          K++ K EK    KK   K+  K  K  +K  K   KK    
Sbjct: 50 KESIKAEKSGLMKKKPAKKAPKAAKAPKKAAKAPAKKAAAA 90


>gnl|CDD|221161 pfam11661, DUF2986, Protein of unknown function (DUF2986).  This
          family of proteins has no known function.
          Length = 44

 Score = 29.2 bits (66), Expect = 0.47
 Identities = 12/26 (46%), Positives = 14/26 (53%)

Query: 24 KKKEKKKKEKKAKKKEKKKEKKKKEK 49
          KKK  +K +KKAKK   K     K K
Sbjct: 4  KKKINQKLKKKAKKANAKLHPSNKPK 29



 Score = 26.5 bits (59), Expect = 4.9
 Identities = 11/27 (40%), Positives = 14/27 (51%)

Query: 31 KEKKAKKKEKKKEKKKKEKKKKKKKKK 57
          ++KK  +K KKK KK   K     K K
Sbjct: 3  RKKKINQKLKKKAKKANAKLHPSNKPK 29


>gnl|CDD|220267 pfam09494, Slx4, Slx4 endonuclease.  The Slx4 protein is a
           heteromeric structure-specific endonuclease found in
           fungi. Slx4 with Slx1 acts as a nuclease on branched DNA
           substrates, particularly simple-Y, 5'-flap, or
           replication fork structures by cleaving the strand
           bearing the 5' non-homologous arm at the branch junction
           and thus generating ligatable nicked products from
           5'-flap or replication fork substrates.
          Length = 627

 Score = 33.0 bits (75), Expect = 0.48
 Identities = 13/43 (30%), Positives = 20/43 (46%)

Query: 40  KKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNNNNNNNNN 82
           K+  KKKK KK K +K+ K+K     +    N  N   +   +
Sbjct: 69  KRVPKKKKIKKPKLRKRTKRKNKKIKSLTAFNEENFETDRAPS 111



 Score = 31.9 bits (72), Expect = 1.0
 Identities = 14/44 (31%), Positives = 23/44 (52%)

Query: 35  AKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNNNNN 78
            K+  KKK+ KK + +K+ K+K KK K+    N  N   +   +
Sbjct: 68  GKRVPKKKKIKKPKLRKRTKRKNKKIKSLTAFNEENFETDRAPS 111



 Score = 31.9 bits (72), Expect = 1.0
 Identities = 11/43 (25%), Positives = 19/43 (44%)

Query: 43  EKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNNNNNNNNNNNN 85
           ++  K+KK KK K +K+ K  N    +    N  N   +   +
Sbjct: 69  KRVPKKKKIKKPKLRKRTKRKNKKIKSLTAFNEENFETDRAPS 111



 Score = 31.9 bits (72), Expect = 1.0
 Identities = 11/43 (25%), Positives = 21/43 (48%)

Query: 39  EKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNNNNNNNN 81
           ++  +KKK +K K +K+ K+K K   +    N  N   +   +
Sbjct: 69  KRVPKKKKIKKPKLRKRTKRKNKKIKSLTAFNEENFETDRAPS 111



 Score = 31.1 bits (70), Expect = 1.6
 Identities = 13/43 (30%), Positives = 21/43 (48%)

Query: 41  KKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNNNNNNNNNN 83
           K+  KKK+ KK K +K+ K+KN    +    N  N   +   +
Sbjct: 69  KRVPKKKKIKKPKLRKRTKRKNKKIKSLTAFNEENFETDRAPS 111



 Score = 31.1 bits (70), Expect = 1.9
 Identities = 12/43 (27%), Positives = 19/43 (44%)

Query: 42  KEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNNNNNNNNNNN 84
           K   KK+K KK K +K+ K+ N    +    N  N   +   +
Sbjct: 69  KRVPKKKKIKKPKLRKRTKRKNKKIKSLTAFNEENFETDRAPS 111



 Score = 30.7 bits (69), Expect = 2.3
 Identities = 11/43 (25%), Positives = 18/43 (41%)

Query: 44  KKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNNNNNNNNNNNNN 86
           K+  +KKK KK K +K+    N    +    N  N   +   +
Sbjct: 69  KRVPKKKKIKKPKLRKRTKRKNKKIKSLTAFNEENFETDRAPS 111



 Score = 30.7 bits (69), Expect = 2.5
 Identities = 11/47 (23%), Positives = 15/47 (31%)

Query: 50  KKKKKKKKKKKKNNNNNNNNNNNNNNNNNNNNNNNNNEEKGTFLYKM 96
           K+  KKKK KK          N    +    N  N   ++   L   
Sbjct: 69  KRVPKKKKIKKPKLRKRTKRKNKKIKSLTAFNEENFETDRAPSLLSY 115



 Score = 30.3 bits (68), Expect = 3.1
 Identities = 13/43 (30%), Positives = 21/43 (48%)

Query: 38  KEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNNNNNNN 80
           K   K+KK K+ K +K+ K+K KK  +    N  N   +   +
Sbjct: 69  KRVPKKKKIKKPKLRKRTKRKNKKIKSLTAFNEENFETDRAPS 111



 Score = 29.9 bits (67), Expect = 4.0
 Identities = 14/43 (32%), Positives = 22/43 (51%)

Query: 37  KKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNNNNNN 79
           K+  KK+K KK K +K+ K+K KK  +    N  N   +   +
Sbjct: 69  KRVPKKKKIKKPKLRKRTKRKNKKIKSLTAFNEENFETDRAPS 111



 Score = 29.6 bits (66), Expect = 5.6
 Identities = 13/48 (27%), Positives = 19/48 (39%)

Query: 46  KKEKKKKKKKKKKKKKNNNNNNNNNNNNNNNNNNNNNNNNNEEKGTFL 93
           K+  KKKK KK K +K     N    +    N  N   +      ++L
Sbjct: 69  KRVPKKKKIKKPKLRKRTKRKNKKIKSLTAFNEENFETDRAPSLLSYL 116


>gnl|CDD|220897 pfam10883, DUF2681, Protein of unknown function (DUF2681).  This
          family of proteins is found in bacteria. Proteins in
          this family are typically between 81 and 117 amino
          acids in length.
          Length = 87

 Score = 30.4 bits (69), Expect = 0.49
 Identities = 10/45 (22%), Positives = 22/45 (48%), Gaps = 2/45 (4%)

Query: 29 KKKEKKAKKKEKKKEKKKKEKKKKKKKKK--KKKKNNNNNNNNNN 71
          KK +++ +K + + E+   EK   + + K  K ++ N  N    +
Sbjct: 26 KKAQRENRKLQAENEQLATEKAVAETEVKNAKVRQKNEENTRRLS 70



 Score = 29.3 bits (66), Expect = 1.3
 Identities = 11/49 (22%), Positives = 26/49 (53%)

Query: 20 WKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNN 68
          + K KK +++ ++ +A+ ++   EK   E + K  K ++K + N    +
Sbjct: 22 YWKLKKAQRENRKLQAENEQLATEKAVAETEVKNAKVRQKNEENTRRLS 70



 Score = 28.9 bits (65), Expect = 1.6
 Identities = 10/47 (21%), Positives = 22/47 (46%)

Query: 31 KEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNNNN 77
          K KKA+++ +K + + ++   +K   + + KN      N  N    +
Sbjct: 24 KLKKAQRENRKLQAENEQLATEKAVAETEVKNAKVRQKNEENTRRLS 70


>gnl|CDD|220303 pfam09592, DUF2031, Protein of unknown function (DUF2031).  This
           protein is expressed in Plasmodium; its function is
           unknown. It may be the product of gene family pyst-b.
          Length = 228

 Score = 32.3 bits (74), Expect = 0.49
 Identities = 23/96 (23%), Positives = 35/96 (36%), Gaps = 16/96 (16%)

Query: 16  TPLDWKKAKKKEKKKKEKKAKKKEK-KKEKKKKE------KKKKKKKKKKKKKNN----- 63
           T  D     KK KK   +  K+ E+ KKE   K       +  + K+  KK +N+     
Sbjct: 99  TLPDLNNVDKKTKKLINELRKELEEVKKELDNKRNGELAIQPIQDKRIIKKDENSSVSEH 158

Query: 64  -NNN---NNNNNNNNNNNNNNNNNNNNEEKGTFLYK 95
            +     N  N      N   ++NN  + K     K
Sbjct: 159 EDFKQLENEENFLETEYNEITSSNNYKKLKINRKLK 194



 Score = 29.6 bits (67), Expect = 3.2
 Identities = 18/84 (21%), Positives = 31/84 (36%), Gaps = 14/84 (16%)

Query: 24  KKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKK---------KKKKNNNN-----NNNN 69
           K+ E+ KKE   K+  +   +  ++K+  KK +          K+ +N  N      N  
Sbjct: 119 KELEEVKKELDNKRNGELAIQPIQDKRIIKKDENSSVSEHEDFKQLENEENFLETEYNEI 178

Query: 70  NNNNNNNNNNNNNNNNNEEKGTFL 93
            ++NN      N      EK    
Sbjct: 179 TSSNNYKKLKINRKLKKAEKKLIK 202


>gnl|CDD|218514 pfam05231, MASE1, MASE1.  Predicted integral membrane sensory
           domain found in histidine kinases, diguanylate cyclases
           and other bacterial signaling proteins. This entry also
           includes members of the 8 transmembrane UhpB type
           (8TMR-UT) domain family.
          Length = 298

 Score = 32.4 bits (74), Expect = 0.49
 Identities = 23/124 (18%), Positives = 38/124 (30%), Gaps = 11/124 (8%)

Query: 303 ALISTGGSAACFIVVAVYAQFHLSYGWDSPLVPTVFLVLAAFLTHICIRLLPWMLIGEVF 362
            L+      A    V       L     + +     L LAA L     R  P +L+G+V 
Sbjct: 1   LLLLLLLLYALAAAVLWSLALALVSSGIAAIWLPTGLRLAALLLF-GRRYWPGILLGDVL 59

Query: 363 ----------PNNIRATASGASGSSSYIFAFAVNKLYYPMLDTFHLWGTLYFYAAISVVG 412
                        + A       +   + A A+ +   P      L   L F  A +++ 
Sbjct: 60  ASLFLGLLSGLGLLLALLILLVNALEALLAVALLRRLLPGRHRLQLLFWLRFLLAAAIIA 119

Query: 413 TLYM 416
            L +
Sbjct: 120 ALLL 123


>gnl|CDD|222446 pfam13903, Claudin_2, PMP-22/EMP/MP20/Claudin tight junction.
           Members of this family are claudins, that form tight
           junctions between cells.
          Length = 170

 Score = 31.9 bits (73), Expect = 0.50
 Identities = 25/121 (20%), Positives = 34/121 (28%), Gaps = 28/121 (23%)

Query: 253 TTLQTYAVGIFESIHAPLDPYFATLLLGVAELGGALLCVVLIHYTGKRPLALISTGG--- 309
                  V  F     P       L L +  +   L CV+ +    K+  AL+   G   
Sbjct: 47  CFTSEVDVNNFVLTSLPDLTIVVLLFLTLGLVLSLLACVLGLCSALKKSGALLLLTGVLL 106

Query: 310 ---SAACFIVVAVYAQF--------------------HLSYGWDSPLVP--TVFLVLAAF 344
                   I V +Y Q                       S GW   L    TV ++LA  
Sbjct: 107 LLAGILGLIAVVLYVQKVQLEIKNVLLTWMSQWQSNGLASLGWSFYLAVVGTVLVLLAGL 166

Query: 345 L 345
           L
Sbjct: 167 L 167


>gnl|CDD|197773 smart00527, HMG17, domain in high mobilty group proteins HMG14
          and HMG 17. 
          Length = 88

 Score = 30.3 bits (68), Expect = 0.50
 Identities = 15/44 (34%), Positives = 19/44 (43%)

Query: 30 KKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNN 73
          K E K KK   KK+ +    K KK  K K+ +      NN   N
Sbjct: 31 KPEPKPKKAAAKKKSENVAAKGKKGAKGKQTEEAGKEENNPAEN 74



 Score = 29.5 bits (66), Expect = 1.00
 Identities = 18/53 (33%), Positives = 24/53 (45%), Gaps = 5/53 (9%)

Query: 15 PTPLDWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNN 67
          P P      K + K KK    KK E    K KK  K K+ ++  K++NN   N
Sbjct: 27 PAP-----PKPEPKPKKAAAKKKSENVAAKGKKGAKGKQTEEAGKEENNPAEN 74


>gnl|CDD|221630 pfam12542, CWC25, Pre-mRNA splicing factor.  This domain family
          is found in eukaryotes, and is approximately 100 amino
          acids in length. The family is found in association
          with pfam10197. There is a single completely conserved
          residue Y that may be functionally important. Cwc25 has
          been identified to associate with pre-mRNA splicing
          factor Cef1/Ntc85, a component of the Prp19-associated
          complex (NTC) involved in spliceosome activation. Cwc25
          is neither tightly associated with NTC nor required for
          spliceosome activation, but is required for the first
          catalytic reaction.
          Length = 95

 Score = 30.5 bits (69), Expect = 0.50
 Identities = 15/74 (20%), Positives = 31/74 (41%), Gaps = 14/74 (18%)

Query: 28 KKKKEK---------KAKKKEKKKE-----KKKKEKKKKKKKKKKKKKNNNNNNNNNNNN 73
          KKK E+               +++E     KK+ +K  ++K  ++ K+ ++        +
Sbjct: 1  KKKLERLEWMYEGPPAESGFAEEREEYLLGKKRVDKLLEEKDAEEAKQKSSGEALIEGAS 60

Query: 74 NNNNNNNNNNNNNE 87
          N N+ N+  N   E
Sbjct: 61 NANSANDTWNKLRE 74


>gnl|CDD|218538 pfam05285, SDA1, SDA1.  This family consists of several SDA1
           protein homologues. SDA1 is a Saccharomyces cerevisiae
           protein which is involved in the control of the actin
           cytoskeleton. The protein is essential for cell
           viability and is localised in the nucleus.
          Length = 317

 Score = 32.3 bits (74), Expect = 0.51
 Identities = 18/49 (36%), Positives = 29/49 (59%), Gaps = 9/49 (18%)

Query: 23  AKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKK---------KKKKKN 62
           AKKK++ K+E+ A  KE +++++K   +K KK K         K +KKN
Sbjct: 233 AKKKKQTKEERIATAKEGREDREKFGSRKGKKDKEGKSTTNKEKARKKN 281



 Score = 30.8 bits (70), Expect = 1.8
 Identities = 9/43 (20%), Positives = 26/43 (60%)

Query: 19  DWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
           D +  ++K++  K+ K    E+  E+ ++E  ++++ + +K+K
Sbjct: 131 DSEDEEEKDEAAKKAKEDSDEELSEEDEEEAAEEEEAEAEKEK 173



 Score = 30.4 bits (69), Expect = 2.5
 Identities = 12/42 (28%), Positives = 21/42 (50%), Gaps = 4/42 (9%)

Query: 22  KAKKKE----KKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKK 59
           KA+KK       KK  + K+K   ++K+K  +    ++KK  
Sbjct: 276 KARKKNFMMTLHKKRVRGKQKRSLRDKQKVLRAHILRQKKGG 317



 Score = 29.6 bits (67), Expect = 4.4
 Identities = 14/48 (29%), Positives = 24/48 (50%), Gaps = 6/48 (12%)

Query: 21  KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKK------KKKKKKKKN 62
           K+ ++  +K   +K KK ++ K    KEK +KK       KK+ + K 
Sbjct: 248 KEGREDREKFGSRKGKKDKEGKSTTNKEKARKKNFMMTLHKKRVRGKQ 295



 Score = 29.2 bits (66), Expect = 5.7
 Identities = 13/59 (22%), Positives = 26/59 (44%), Gaps = 17/59 (28%)

Query: 20  WKKAKK---------KEKKKKE--------KKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
            +K KK         KEK +K+        K+ + K+K+  + K++  +    ++KK  
Sbjct: 259 SRKGKKDKEGKSTTNKEKARKKNFMMTLHKKRVRGKQKRSLRDKQKVLRAHILRQKKGG 317


>gnl|CDD|130366 TIGR01299, synapt_SV2, synaptic vesicle protein SV2.  This model
           describes a tightly conserved subfamily of the larger
           family of sugar (and other) transporters described by
           PFAM model pfam00083. Members of this subfamily include
           closely related forms SV2A and SV2B of synaptic vesicle
           protein from vertebrates and a more distantly related
           homolog (below trusted cutoff) from Drosophila
           melanogaster. Members are predicted to have two sets of
           six transmembrane helices.
          Length = 742

 Score = 33.0 bits (75), Expect = 0.52
 Identities = 25/84 (29%), Positives = 39/84 (46%), Gaps = 6/84 (7%)

Query: 140 FLHWRSAAILNLLFPILALCALYFIPESPHWLISQGRMQEASASL-----CWLRGWVTPD 194
           F  WR   I+     + A+ AL F+PESP + +  G+  EA   L       +R    P+
Sbjct: 330 FHSWRVFVIVCAFPCVFAIGALTFMPESPRFFLENGKHDEAWMILKLIHDTNMRAKGHPE 389

Query: 195 KVQTELSQITKAIEESELKRLGKD 218
           KV + ++ I    +E EL  +  D
Sbjct: 390 KVFS-VNHIKTIHQEDELIEIESD 412


>gnl|CDD|221049 pfam11262, Tho2, Transcription factor/nuclear export subunit
          protein 2.  THO and TREX form a eukaryotic complex
          which functions in messenger ribonucleoprotein
          metabolism and plays a role in preventing the
          transcription-associated genetic instability. Tho2,
          along with four other subunits forms THO.
          Length = 296

 Score = 32.3 bits (74), Expect = 0.52
 Identities = 14/52 (26%), Positives = 24/52 (46%)

Query: 25 KKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNNN 76
           KE ++ EK+ K+ +       K+KK+KK+ K   KK       +  +N   
Sbjct: 30 DKEIERLEKQIKELDSSSSGIDKKKKEKKRLKSLIKKLEEELKKHIEHNEKT 81



 Score = 28.4 bits (64), Expect = 8.9
 Identities = 14/49 (28%), Positives = 26/49 (53%)

Query: 21  KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNN 69
           KK K+K++ K   K  ++E KK  +  EK KK+  ++K     + + + 
Sbjct: 52  KKKKEKKRLKSLIKKLEEELKKHIEHNEKTKKRLSEEKSSWFPSKDPDE 100


>gnl|CDD|221947 pfam13160, DUF3995, Protein of unknown function (DUF3995).  This
           family of proteins is functionally uncharacterized. This
           family of proteins is found in bacteria and eukaryotes.
           Proteins in this family are typically between 138 and
           149 amino acids in length. There are two completely
           conserved residues (W and P) that may be functionally
           important.
          Length = 125

 Score = 31.1 bits (71), Expect = 0.53
 Identities = 13/48 (27%), Positives = 17/48 (35%)

Query: 274 FATLLLGVAELGGALLCVVLIHYTGKRPLALISTGGSAACFIVVAVYA 321
           FATLL+    L  A L +      G  P   +   GS     V  +  
Sbjct: 39  FATLLVAALLLLAAALVLAQAGGVGPFPPPGLVRAGSWVLAAVFLLRG 86


>gnl|CDD|206063 pfam13892, DBINO, DNA-binding domain.  DBINO is a DNA-binding
           domain found on global transcription activator SNF2L1
           proteins and chromatin re-modelling proteins.
          Length = 140

 Score = 31.5 bits (72), Expect = 0.55
 Identities = 14/38 (36%), Positives = 29/38 (76%), Gaps = 3/38 (7%)

Query: 20  WKKAKKKE---KKKKEKKAKKKEKKKEKKKKEKKKKKK 54
           WKK +K+E   +K+ EK+A ++ KK+E+ ++ K++++K
Sbjct: 78  WKKNEKEERELRKRAEKEALEQAKKEEELREAKRQQRK 115


>gnl|CDD|234397 TIGR03920, T7SS_EccD, type VII secretion integral membrane protein
           EccD.  Members of this family are EccD, a component of
           actinobacterial type VII secretion systems (T7SS) with
           ten to eleven predicted transmembrane helix regions
           [Protein fate, Protein and peptide secretion and
           trafficking].
          Length = 431

 Score = 32.7 bits (75), Expect = 0.56
 Identities = 23/116 (19%), Positives = 41/116 (35%), Gaps = 19/116 (16%)

Query: 274 FATLLLGVAELGGALLCVVLIHYTGKRPLALISTGGSAACFIVVAVYAQFHLSYGWDSPL 333
            A  ++G+  L  A++   L      R +A +S  G+A   + VA       + G    L
Sbjct: 146 IAAGVIGLLVLAAAVVARTL----RDRAVAALSLLGAALVLVAVAGALAVPGALGVPHVL 201

Query: 334 VP-TVFLVLAAFL--------------THICIRLLPWMLIGEVFPNNIRATASGAS 374
           +  TV   LA  L                + + L     +G ++   +   A+G  
Sbjct: 202 LAGTVLAALAVLLLRLTGRGLAVFTAVVTVAVILALAAAVGMLWQTPVPGIAAGLI 257


>gnl|CDD|236698 PRK10475, PRK10475, 23S rRNA pseudouridine synthase F; Provisional.
          Length = 290

 Score = 32.4 bits (74), Expect = 0.57
 Identities = 12/48 (25%), Positives = 19/48 (39%)

Query: 13  TEPTPLDWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKK 60
           +E  P    K K    K+   K +K  +K  +     K+     +KKK
Sbjct: 241 SEAKPKAKAKPKTAGIKRPVVKMEKTAEKGGRPASNGKRFTSPGRKKK 288



 Score = 32.0 bits (73), Expect = 0.81
 Identities = 12/43 (27%), Positives = 17/43 (39%), Gaps = 3/43 (6%)

Query: 22  KAKKKEKKKK---EKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
           K K K K K    ++   K EK  EK  +     K+     +K
Sbjct: 244 KPKAKAKPKTAGIKRPVVKMEKTAEKGGRPASNGKRFTSPGRK 286



 Score = 29.3 bits (66), Expect = 4.4
 Identities = 11/40 (27%), Positives = 17/40 (42%)

Query: 23  AKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKN 62
           AK K K K +    K+   K +K  EK  +     K+  +
Sbjct: 243 AKPKAKAKPKTAGIKRPVVKMEKTAEKGGRPASNGKRFTS 282



 Score = 28.5 bits (64), Expect = 9.9
 Identities = 12/51 (23%), Positives = 21/51 (41%)

Query: 2   DSSTKHQIGTFTEPTPLDWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKK 52
           +SS++ +     +P     K+   K +K  EK  +     K      +KKK
Sbjct: 238 NSSSEAKPKAKAKPKTAGIKRPVVKMEKTAEKGGRPASNGKRFTSPGRKKK 288


>gnl|CDD|221957 pfam13175, AAA_15, AAA ATPase domain.  This family of domains
           contain a P-loop motif that is characteristic of the AAA
           superfamily.
          Length = 320

 Score = 32.4 bits (74), Expect = 0.58
 Identities = 20/82 (24%), Positives = 29/82 (35%), Gaps = 9/82 (10%)

Query: 21  KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNNNNN-- 78
           K   K EK  + K AKKK+  K       +      K   K  ++N    +N   N +  
Sbjct: 126 KILDKIEKFDEIKGAKKKDFFK--SINIYEIDNFIDKTLSKILDDNIVFQDNLEKNIDDK 183

Query: 79  -----NNNNNNNNEEKGTFLYK 95
                N   +N   EK +   K
Sbjct: 184 LSNIFNEKLSNLKNEKISINEK 205



 Score = 30.5 bits (69), Expect = 2.2
 Identities = 16/74 (21%), Positives = 26/74 (35%), Gaps = 2/74 (2%)

Query: 24  KKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNNNNNN--NN 81
           K+K+K        K +  KE   K+ KK K     +  +N   + +     N      NN
Sbjct: 46  KEKKKIYISISDNKADIDKENLNKKSKKIKISIIIELSSNEIKSFSIKFYLNYYLEIINN 105

Query: 82  NNNNNEEKGTFLYK 95
           +  +  E   F   
Sbjct: 106 DVEDEIELLLFSKN 119



 Score = 29.7 bits (67), Expect = 3.8
 Identities = 15/66 (22%), Positives = 26/66 (39%), Gaps = 1/66 (1%)

Query: 25  KKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKK-KKKKNNNNNNNNNNNNNNNNNNNNNN 83
           K  +   +K   K EK  E K  +KK   K     +  N  +   +   ++N    +N  
Sbjct: 118 KNLEYILKKILDKIEKFDEIKGAKKKDFFKSINIYEIDNFIDKTLSKILDDNIVFQDNLE 177

Query: 84  NNNEEK 89
            N ++K
Sbjct: 178 KNIDDK 183


>gnl|CDD|233467 TIGR01554, major_cap_HK97, phage major capsid protein, HK97
          family.  This model family represents the major capsid
          protein component of the heads (capsids) of
          bacteriophage HK97, phi-105, P27, and related phage.
          This model represents one of several analogous families
          lacking detectable sequence similarity. The gene
          encoding this component is typically located in an
          operon encoding the small and large terminase subunits,
          the portal protein and the prohead or maturation
          protease [Mobile and extrachromosomal element
          functions, Prophage functions].
          Length = 384

 Score = 32.3 bits (74), Expect = 0.59
 Identities = 10/87 (11%), Positives = 25/87 (28%), Gaps = 11/87 (12%)

Query: 23 AKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNN-----------NNNNNNN 71
           + +E   +  K  +K  + EK  + +++K +    K++ +             +     
Sbjct: 2  KELREALAELAKELRKLTEDEKLAEAEEEKAEYDALKEEIDKLDAEIDRLEELLDELEAK 61

Query: 72 NNNNNNNNNNNNNNNEEKGTFLYKMAA 98
             +           E K       A 
Sbjct: 62 PAASGEGGGGEEEEEEAKAEAAEFRAY 88


>gnl|CDD|185507 PTZ00191, PTZ00191, 60S ribosomal protein L23a; Provisional.
          Length = 145

 Score = 31.2 bits (71), Expect = 0.61
 Identities = 14/33 (42%), Positives = 17/33 (51%)

Query: 33 KKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNN 65
          K   KK KKK KK  +  KK  K KK+K   + 
Sbjct: 1  KAPAKKAKKKAKKAAKAAKKGVKVKKRKVRTSV 33



 Score = 30.8 bits (70), Expect = 0.91
 Identities = 15/31 (48%), Positives = 18/31 (58%)

Query: 29 KKKEKKAKKKEKKKEKKKKEKKKKKKKKKKK 59
          K   KKAKKK KK  K  K+  K KK+K + 
Sbjct: 1  KAPAKKAKKKAKKAAKAAKKGVKVKKRKVRT 31



 Score = 30.4 bits (69), Expect = 1.0
 Identities = 14/31 (45%), Positives = 18/31 (58%)

Query: 25 KKEKKKKEKKAKKKEKKKEKKKKEKKKKKKK 55
          K   KK +KKAKK  K  +K  K KK+K + 
Sbjct: 1  KAPAKKAKKKAKKAAKAAKKGVKVKKRKVRT 31



 Score = 29.3 bits (66), Expect = 2.5
 Identities = 13/33 (39%), Positives = 19/33 (57%)

Query: 32 EKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNN 64
          +  AKK +KK +K  K  KK  K KK+K + + 
Sbjct: 1  KAPAKKAKKKAKKAAKAAKKGVKVKKRKVRTSV 33



 Score = 28.9 bits (65), Expect = 4.2
 Identities = 18/48 (37%), Positives = 23/48 (47%), Gaps = 7/48 (14%)

Query: 21 KKAKKKEKKKKEKKAK--KKEKKKEKKKKE-----KKKKKKKKKKKKK 61
          K   KK KKK +K AK  KK  K +K+K        + K  +  KK K
Sbjct: 1  KAPAKKAKKKAKKAAKAAKKGVKVKKRKVRTSVRFFRPKTLRLPKKPK 48


>gnl|CDD|218312 pfam04889, Cwf_Cwc_15, Cwf15/Cwc15 cell cycle control protein.
           This family represents Cwf15/Cwc15 (from
           Schizosaccharomyces pombe and Saccharomyces cerevisiae
           respectively) and their homologues. The function of
           these proteins is unknown, but they form part of the
           spliceosome and are thus thought to be involved in mRNA
           splicing.
          Length = 241

 Score = 32.0 bits (73), Expect = 0.62
 Identities = 10/27 (37%), Positives = 20/27 (74%)

Query: 27  EKKKKEKKAKKKEKKKEKKKKEKKKKK 53
           EK KKE+  +K+ +++EK  +E+K ++
Sbjct: 152 EKIKKERAEEKEREEEEKAAEEEKARE 178



 Score = 30.1 bits (68), Expect = 2.5
 Identities = 8/26 (30%), Positives = 21/26 (80%)

Query: 37  KKEKKKEKKKKEKKKKKKKKKKKKKN 62
           KKE+ +EK+++E++K  +++K +++ 
Sbjct: 155 KKERAEEKEREEEEKAAEEEKAREEE 180



 Score = 29.3 bits (66), Expect = 4.4
 Identities = 7/29 (24%), Positives = 21/29 (72%)

Query: 35  AKKKEKKKEKKKKEKKKKKKKKKKKKKNN 63
            K K+++ E+K++E+++K  +++K ++  
Sbjct: 152 EKIKKERAEEKEREEEEKAAEEEKAREEE 180


>gnl|CDD|114011 pfam05262, Borrelia_P83, Borrelia P83/100 protein.  This family
           consists of several Borrelia P83/P100 antigen proteins.
          Length = 489

 Score = 32.3 bits (73), Expect = 0.63
 Identities = 11/40 (27%), Positives = 22/40 (55%)

Query: 21  KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKK 60
           K A     K+ ++ A+ ++++ EK + E KK  ++  K K
Sbjct: 262 KPADTSSPKEDKQVAENQKREIEKAQIEIKKNDEEALKAK 301



 Score = 31.1 bits (70), Expect = 1.6
 Identities = 11/41 (26%), Positives = 22/41 (53%)

Query: 24  KKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNN 64
           K++E K   K A     K++K+  E +K++ +K + +   N
Sbjct: 253 KQQEAKNLPKPADTSSPKEDKQVAENQKREIEKAQIEIKKN 293



 Score = 30.7 bits (69), Expect = 2.1
 Identities = 10/39 (25%), Positives = 20/39 (51%)

Query: 21  KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKK 59
           ++AK   K       K+ ++  E +K+E +K + + KK 
Sbjct: 255 QEAKNLPKPADTSSPKEDKQVAENQKREIEKAQIEIKKN 293



 Score = 30.4 bits (68), Expect = 2.6
 Identities = 13/38 (34%), Positives = 21/38 (55%)

Query: 24  KKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
           K  E+  K K  K  + K+E K  EK+ + K+ + +KK
Sbjct: 292 KNDEEALKAKDHKAFDLKQESKASEKEAEDKELEAQKK 329



 Score = 30.0 bits (67), Expect = 3.5
 Identities = 13/39 (33%), Positives = 22/39 (56%)

Query: 21  KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKK 59
           +KA+ + KK  E+  K K+ K    K+E K  +K+ + K
Sbjct: 284 EKAQIEIKKNDEEALKAKDHKAFDLKQESKASEKEAEDK 322



 Score = 30.0 bits (67), Expect = 4.1
 Identities = 14/41 (34%), Positives = 25/41 (60%), Gaps = 2/41 (4%)

Query: 21  KKAKKKEKKKKEKKAK--KKEKKKEKKKKEKKKKKKKKKKK 59
           KK  ++  K K+ KA   K+E K  +K+ E K+ + +KK++
Sbjct: 291 KKNDEEALKAKDHKAFDLKQESKASEKEAEDKELEAQKKRE 331



 Score = 29.2 bits (65), Expect = 7.0
 Identities = 7/43 (16%), Positives = 23/43 (53%)

Query: 18  LDWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKK 60
            D ++ + ++K+++ K   K       K+ ++  + +K++ +K
Sbjct: 243 ADKQRDEVRQKQQEAKNLPKPADTSSPKEDKQVAENQKREIEK 285



 Score = 29.2 bits (65), Expect = 7.3
 Identities = 9/38 (23%), Positives = 22/38 (57%)

Query: 24  KKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
           K+ +  K ++KA   +   +K++ E ++K+++ K   K
Sbjct: 225 KQIDADKAQQKADFAQDNADKQRDEVRQKQQEAKNLPK 262



 Score = 28.8 bits (64), Expect = 9.6
 Identities = 8/37 (21%), Positives = 20/37 (54%)

Query: 23  AKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKK 59
           AK+ ++ K+E   K+ +  K ++K +  +    K++ 
Sbjct: 212 AKRAQQLKEELDKKQIDADKAQQKADFAQDNADKQRD 248


>gnl|CDD|165026 PHA02644, PHA02644, hypothetical protein; Provisional.
          Length = 112

 Score = 30.7 bits (68), Expect = 0.64
 Identities = 14/52 (26%), Positives = 30/52 (57%)

Query: 22  KAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNN 73
           K + +++KK EK   + EKK EK + E +KK +K + + +    +++ ++  
Sbjct: 60  KKENEDEKKPEKPENEDEKKPEKPENEDEKKPEKPENEDEKEFEDSDASDKK 111



 Score = 30.0 bits (66), Expect = 1.2
 Identities = 13/52 (25%), Positives = 31/52 (59%)

Query: 26  KEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNNNN 77
           K++ + EKK +K E + EKK ++ + + +KK +K +N +     +++ ++  
Sbjct: 60  KKENEDEKKPEKPENEDEKKPEKPENEDEKKPEKPENEDEKEFEDSDASDKK 111



 Score = 28.4 bits (62), Expect = 4.6
 Identities = 13/55 (23%), Positives = 29/55 (52%)

Query: 27  EKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNNNNNNNN 81
           E  KKE + +KK +K E + ++K +K + + +KK     N +     +++ ++  
Sbjct: 57  EHIKKENEDEKKPEKPENEDEKKPEKPENEDEKKPEKPENEDEKEFEDSDASDKK 111



 Score = 27.3 bits (59), Expect = 9.2
 Identities = 16/62 (25%), Positives = 32/62 (51%), Gaps = 2/62 (3%)

Query: 19  DWKKAKKKEKKKKEKKAKKKEK--KKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNNN 76
           D  K   +  KK+ +  KK EK   +++KK EK + + +KK +K  N +     +++ ++
Sbjct: 50  DEMKTTHEHIKKENEDEKKPEKPENEDEKKPEKPENEDEKKPEKPENEDEKEFEDSDASD 109

Query: 77  NN 78
             
Sbjct: 110 KK 111


>gnl|CDD|215590 PLN03123, PLN03123, poly [ADP-ribose] polymerase; Provisional.
          Length = 981

 Score = 32.5 bits (74), Expect = 0.65
 Identities = 31/141 (21%), Positives = 57/141 (40%), Gaps = 20/141 (14%)

Query: 21  KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNNNNNNN 80
           KK+  + K++K ++ K++ KK  K+KK+     K KK K       + + +   +   ++
Sbjct: 184 KKSPSEAKEEKAEERKQESKKGAKRKKDASGDDKSKKAKTD----RDVSTSTAASQKKSS 239

Query: 81  NNNNNNEEKGTFLYKMAAPLVLVLTYVAEITQPHLRGMLSATASMTTIFGTVSQLFLGSF 140
           +  +  E +   L+ +   L         ++   LR ML A    T+          GS 
Sbjct: 240 DLESKLEAQSKELWSLKDDL------KKHVSTAELREMLEANGQDTS----------GSE 283

Query: 141 LHWRSAAILNLLFPILALCAL 161
           L  R      ++F  L  C L
Sbjct: 284 LDLRDRCADGMMFGALGPCPL 304


>gnl|CDD|235505 PRK05563, PRK05563, DNA polymerase III subunits gamma and tau;
           Validated.
          Length = 559

 Score = 32.5 bits (75), Expect = 0.65
 Identities = 16/51 (31%), Positives = 25/51 (49%)

Query: 27  EKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNNNN 77
           E++ K+ KA+      E+K+K+K+KKK KKKK K                +
Sbjct: 382 EQELKQLKAQPVGVAPEQKEKKKEKKKNKKKKYKVPRGKIYKVLKEATRQD 432



 Score = 29.5 bits (67), Expect = 6.0
 Identities = 14/51 (27%), Positives = 25/51 (49%)

Query: 25  KKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNN 75
           ++E K+ + +      ++++KKKEKKK KKKK K  +             +
Sbjct: 382 EQELKQLKAQPVGVAPEQKEKKKEKKKNKKKKYKVPRGKIYKVLKEATRQD 432


>gnl|CDD|113514 pfam04747, DUF612, Protein of unknown function, DUF612.  This
           family includes several uncharacterized proteins from
           Caenorhabditis elegans.
          Length = 517

 Score = 32.3 bits (72), Expect = 0.66
 Identities = 16/42 (38%), Positives = 28/42 (66%)

Query: 20  WKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
           WK  +++ +K++EKK    +K + +KKKEK  K +K +K +K
Sbjct: 122 WKAEQERIQKEQEKKEADLKKLQAEKKKEKAVKAEKAEKAEK 163



 Score = 32.3 bits (72), Expect = 0.75
 Identities = 17/36 (47%), Positives = 25/36 (69%)

Query: 24  KKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKK 59
           K++EKK+ + K  + EKKKEK  K +K +K +K KK
Sbjct: 131 KEQEKKEADLKKLQAEKKKEKAVKAEKAEKAEKTKK 166



 Score = 32.0 bits (71), Expect = 0.88
 Identities = 14/41 (34%), Positives = 31/41 (75%)

Query: 21  KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
           KKA +KE ++ E +AKK+  ++E+ K+ K ++++ +K+++K
Sbjct: 95  KKAAEKEARRAEAEAKKRAAQEEEHKQWKAEQERIQKEQEK 135



 Score = 31.2 bits (69), Expect = 1.7
 Identities = 16/41 (39%), Positives = 29/41 (70%)

Query: 21  KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
           KK   +E++ K+ KA+++  +KE++KKE   KK + +KKK+
Sbjct: 110 KKRAAQEEEHKQWKAEQERIQKEQEKKEADLKKLQAEKKKE 150



 Score = 29.3 bits (64), Expect = 6.3
 Identities = 14/41 (34%), Positives = 30/41 (73%)

Query: 21  KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
           K+A ++E+ K+ K  +++ +K+++KK+   KK + +KKK+K
Sbjct: 111 KRAAQEEEHKQWKAEQERIQKEQEKKEADLKKLQAEKKKEK 151


>gnl|CDD|214818 smart00784, SPT2, SPT2 chromatin protein.  This entry includes the
           Saccharomyces cerevisiae protein SPT2 which is a
           chromatin protein involved in transcriptional
           regulation.
          Length = 106

 Score = 30.4 bits (69), Expect = 0.69
 Identities = 8/34 (23%), Positives = 28/34 (82%)

Query: 26  KEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKK 59
           +E+++  + A+ +++++E+ +KE++++K+ +K+K
Sbjct: 73  EEERRSARLARLEDREEERLEKEEEREKRARKRK 106


>gnl|CDD|227381 COG5048, COG5048, FOG: Zn-finger [General function prediction
          only].
          Length = 467

 Score = 32.4 bits (73), Expect = 0.70
 Identities = 4/37 (10%), Positives = 13/37 (35%)

Query: 55 KKKKKKKNNNNNNNNNNNNNNNNNNNNNNNNNEEKGT 91
            K   K+ +N    ++  N  ++ +   +      +
Sbjct: 19 TPKSTLKSLSNAPRPDSCPNCTDSFSRLEHLTRHIRS 55



 Score = 30.0 bits (67), Expect = 3.2
 Identities = 4/34 (11%), Positives = 11/34 (32%)

Query: 54 KKKKKKKKNNNNNNNNNNNNNNNNNNNNNNNNNE 87
            K   K  +N    ++  N  ++ +   +    
Sbjct: 19 TPKSTLKSLSNAPRPDSCPNCTDSFSRLEHLTRH 52



 Score = 29.7 bits (66), Expect = 4.5
 Identities = 9/87 (10%), Positives = 27/87 (31%), Gaps = 3/87 (3%)

Query: 57  KKKKKNNNNNNNNNNNNNNNNNNNNNNNNNEEKGTFLYKMAAPL---VLVLTYVAEITQP 113
              + ++  N  ++ +   +   +  ++  E+     Y          L L+        
Sbjct: 29  NAPRPDSCPNCTDSFSRLEHLTRHIRSHTGEKPSQCSYSGCDKSFSRPLELSRHLRTHHN 88

Query: 114 HLRGMLSATASMTTIFGTVSQLFLGSF 140
           +   + S +  ++    + S L   S 
Sbjct: 89  NPSDLNSKSLPLSNSKASSSSLSSSSS 115



 Score = 28.9 bits (64), Expect = 8.0
 Identities = 5/41 (12%), Positives = 15/41 (36%)

Query: 49 KKKKKKKKKKKKKNNNNNNNNNNNNNNNNNNNNNNNNNEEK 89
           K   K      + ++  N  ++ +   +   +  ++  EK
Sbjct: 20 PKSTLKSLSNAPRPDSCPNCTDSFSRLEHLTRHIRSHTGEK 60


>gnl|CDD|225651 COG3109, ProQ, Activator of osmoprotectant transporter ProP [Signal
           transduction mechanisms].
          Length = 208

 Score = 31.7 bits (72), Expect = 0.74
 Identities = 7/41 (17%), Positives = 19/41 (46%)

Query: 25  KKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNN 65
           + + ++ E++AKK+E+     +K   ++     + K     
Sbjct: 109 RVQAQRAEQQAKKREEAPAAGEKPTAERPATAARPKPKAPR 149



 Score = 29.4 bits (66), Expect = 3.4
 Identities = 8/43 (18%), Positives = 22/43 (51%)

Query: 21  KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNN 63
           ++A+++ KK++E  A  ++   E+     + K K  +++    
Sbjct: 113 QRAEQQAKKREEAPAAGEKPTAERPATAARPKPKAPREEVHTP 155



 Score = 29.1 bits (65), Expect = 4.6
 Identities = 6/39 (15%), Positives = 19/39 (48%)

Query: 23  AKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
           A++ E++ K+++      +K   ++     + K K  ++
Sbjct: 112 AQRAEQQAKKREEAPAAGEKPTAERPATAARPKPKAPRE 150



 Score = 28.7 bits (64), Expect = 7.3
 Identities = 7/43 (16%), Positives = 20/43 (46%)

Query: 22  KAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNN 64
           KA+ + ++ +++  K++E     +K   ++     + K K   
Sbjct: 107 KARVQAQRAEQQAKKREEAPAAGEKPTAERPATAARPKPKAPR 149


>gnl|CDD|197700 smart00398, HMG, high mobility group. 
          Length = 70

 Score = 29.6 bits (67), Expect = 0.74
 Identities = 12/32 (37%), Positives = 20/32 (62%)

Query: 19 DWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKK 50
           WK   ++EK   E+KAKK +++ E++  E K
Sbjct: 39 RWKLLSEEEKAPYEEKAKKDKERYEEEMPEYK 70



 Score = 28.8 bits (65), Expect = 1.3
 Identities = 11/43 (25%), Positives = 20/43 (46%)

Query: 18 LDWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKK 60
           +       E  KK  +  K   ++EK   E+K KK K++ ++
Sbjct: 22 AENPDLSNAEISKKLGERWKLLSEEEKAPYEEKAKKDKERYEE 64



 Score = 26.9 bits (60), Expect = 6.0
 Identities = 10/38 (26%), Positives = 23/38 (60%)

Query: 23 AKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKK 60
          +KK  ++ K    ++K   +EK KK+K++ +++  + K
Sbjct: 33 SKKLGERWKLLSEEEKAPYEEKAKKDKERYEEEMPEYK 70


>gnl|CDD|234298 TIGR03659, IsdE, heme ABC transporter, heme-binding protein isdE.
           This family of ABC substrate-binding proteins is
           observed primarily in close proximity with proteins
           localized to the cell wall and bearing the NEAT (NEAr
           Transporter, pfam05031) heme-binding domain. IsdE has
           been shown to bind heme and is involved in the process
           of scavenging heme for the purpose of obtaining iron.
          Length = 289

 Score = 31.8 bits (73), Expect = 0.74
 Identities = 16/34 (47%), Positives = 23/34 (67%)

Query: 27  EKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKK 60
           EK  +E++A+K  K+  +K+ E KKK K KKK K
Sbjct: 136 EKYGREEQAEKLVKEINEKEAEVKKKVKGKKKPK 169


>gnl|CDD|149048 pfam07768, PVL_ORF50, PVL ORF-50-like family.  This is a family
          of sequences found in both bacteria and bacteriophages.
          This region is approximately 130 residues long and in
          some cases is found as part of the PVL
          (Panton-Valentine leukocidin) group of genes, which
          encode a member of the leukocidin group of bacterial
          toxins that kill leukocytes by creation of pores in the
          cell membrane. PVL appears to be a virulence factor
          associated with a number of human diseases.
          Length = 118

 Score = 30.6 bits (69), Expect = 0.75
 Identities = 10/34 (29%), Positives = 25/34 (73%)

Query: 19 DWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKK 52
          ++++ KK E  ++E+K ++ E+K+ K+ + ++KK
Sbjct: 53 EYREIKKSENIEQERKERELERKRRKEAELRRKK 86


>gnl|CDD|236940 PRK11633, PRK11633, cell division protein DedD; Provisional.
          Length = 226

 Score = 31.5 bits (72), Expect = 0.75
 Identities = 14/44 (31%), Positives = 22/44 (50%)

Query: 14  EPTPLDWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKK 57
           EP P++  K K  EK K + K ++K +     K E K   ++K 
Sbjct: 100 EPAPVEPPKPKPVEKPKPKPKPQQKVEAPPAPKPEPKPVVEEKA 143


>gnl|CDD|235132 PRK03577, PRK03577, acid shock protein precursor; Provisional.
          Length = 102

 Score = 30.4 bits (68), Expect = 0.75
 Identities = 17/37 (45%), Positives = 20/37 (54%), Gaps = 2/37 (5%)

Query: 23 AKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKK 59
          AKK  K KKE+KA   E+K +  KK  KK   K   K
Sbjct: 61 AKKHHKNKKEQKA--PEQKAQAAKKHAKKHSHKTAAK 95



 Score = 30.0 bits (67), Expect = 0.85
 Identities = 16/41 (39%), Positives = 25/41 (60%)

Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
          KK  KK  ++K + AKK  K K+++K  ++K +  KK  KK
Sbjct: 47 KKQHKKAPEQKAQAAKKHHKNKKEQKAPEQKAQAAKKHAKK 87



 Score = 28.1 bits (62), Expect = 4.4
 Identities = 13/40 (32%), Positives = 21/40 (52%)

Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKK 60
          +KA+  +K  K KK +K  ++K +  K+  KK   K   K
Sbjct: 56 QKAQAAKKHHKNKKEQKAPEQKAQAAKKHAKKHSHKTAAK 95



 Score = 27.7 bits (61), Expect = 5.2
 Identities = 16/52 (30%), Positives = 24/52 (46%), Gaps = 1/52 (1%)

Query: 11 TFTEPTPLDWKKAKKKEKKKKEKKAKKK-EKKKEKKKKEKKKKKKKKKKKKK 61
          T   P       A  K     +K+ KK  E+K +  KK  K KK++K  ++K
Sbjct: 26 TTAAPAATTTTAAPAKTTHHHKKQHKKAPEQKAQAAKKHHKNKKEQKAPEQK 77


>gnl|CDD|188873 cd09213, Luminal_IRE1_like, The Luminal domain, a dimerization
           domain, of Inositol-requiring protein 1-like proteins.
           The Luminal domain is a dimerization domain present in
           Inositol-requiring protein 1 (IRE1), eukaryotic
           translation Initiation Factor 2-Alpha Kinase 3
           (EIF2AK3), and similar proteins. IRE1 and EIF2AK3 are
           serine/threonine protein kinases (STKs) and are type I
           transmembrane proteins that are localized in the
           endoplasmic reticulum (ER). They are kinase receptors
           that are activated through the release of BiP, a
           chaperone bound to their luminal domains under
           unstressed conditions. This results in dimerization
           through their luminal domains, allowing
           trans-autophosphorylation of their kinase domains and
           activation. They play roles in the signaling of the
           unfolded protein response (UPR), which is activated when
           protein misfolding is detected in the ER in order to
           decrease the synthesis of new proteins and increase the
           capacity of the ER to cope with the stress. IRE1, also
           called Endoplasmic reticulum (ER)-to-nucleus signaling
           protein (or ERN), contains an endoribonuclease domain in
           its cytoplasmic side and acts as an ER stress sensor. It
           is the oldest and most conserved component of the UPR in
           eukaryotes. Its activation results in the cleavage of
           its mRNA substrate, HAC1 in yeast and Xbp1 in metazoans,
           promoting a splicing event that enables translation into
           a transcription factor which activates the UPR. EIF2AK3,
           also called PKR-like Endoplasmic Reticulum Kinase
           (PERK), phosphorylates the alpha subunit of eIF-2,
           resulting in the downregulation of protein synthesis. It
           functions as the central regulator of translational
           control during the UPR pathway. In addition to the eIF-2
           alpha subunit, EIF2AK3 also phosphorylates Nrf2, a
           leucine zipper transcription factor which regulates
           cellular redox status and promotes cell survival during
           the UPR.
          Length = 312

 Score = 32.1 bits (73), Expect = 0.77
 Identities = 4/32 (12%), Positives = 10/32 (31%)

Query: 55  KKKKKKKNNNNNNNNNNNNNNNNNNNNNNNNN 86
              K +   +  +N     N + +     + N
Sbjct: 281 SPSKSEDKESAISNILEGENTSPSALEGVSGN 312



 Score = 31.7 bits (72), Expect = 0.87
 Identities = 3/32 (9%), Positives = 10/32 (31%)

Query: 56  KKKKKKNNNNNNNNNNNNNNNNNNNNNNNNNE 87
              K ++  +  +N     N + +     +  
Sbjct: 281 SPSKSEDKESAISNILEGENTSPSALEGVSGN 312



 Score = 31.7 bits (72), Expect = 0.88
 Identities = 5/32 (15%), Positives = 11/32 (34%)

Query: 54  KKKKKKKKNNNNNNNNNNNNNNNNNNNNNNNN 85
              K + K +  +N     N + +     + N
Sbjct: 281 SPSKSEDKESAISNILEGENTSPSALEGVSGN 312



 Score = 30.5 bits (69), Expect = 2.2
 Identities = 5/32 (15%), Positives = 12/32 (37%)

Query: 53  KKKKKKKKKNNNNNNNNNNNNNNNNNNNNNNN 84
              K + K++  +N     N + +     + N
Sbjct: 281 SPSKSEDKESAISNILEGENTSPSALEGVSGN 312



 Score = 29.0 bits (65), Expect = 5.7
 Identities = 5/32 (15%), Positives = 11/32 (34%)

Query: 52  KKKKKKKKKKNNNNNNNNNNNNNNNNNNNNNN 83
              K + K+   +N     N + +     + N
Sbjct: 281 SPSKSEDKESAISNILEGENTSPSALEGVSGN 312



 Score = 29.0 bits (65), Expect = 6.4
 Identities = 5/32 (15%), Positives = 11/32 (34%)

Query: 51  KKKKKKKKKKKNNNNNNNNNNNNNNNNNNNNN 82
              K + K+   +N     N + +     + N
Sbjct: 281 SPSKSEDKESAISNILEGENTSPSALEGVSGN 312



 Score = 29.0 bits (65), Expect = 7.0
 Identities = 3/32 (9%), Positives = 10/32 (31%)

Query: 60  KKNNNNNNNNNNNNNNNNNNNNNNNNNEEKGT 91
             + + +  +  +N     N + +      G 
Sbjct: 281 SPSKSEDKESAISNILEGENTSPSALEGVSGN 312


>gnl|CDD|185611 PTZ00428, PTZ00428, 60S ribosomal protein L4; Provisional.
          Length = 381

 Score = 32.0 bits (73), Expect = 0.77
 Identities = 12/45 (26%), Positives = 19/45 (42%)

Query: 23  AKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNN 67
           A KK +K      +K   K  +K K ++ +KK KK +        
Sbjct: 331 AAKKLRKLAVLAQEKATAKGAQKVKNRRARKKAKKARLAKVAKAL 375



 Score = 31.6 bits (72), Expect = 0.99
 Identities = 13/43 (30%), Positives = 20/43 (46%)

Query: 21  KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNN 63
           KK +K     +EK   K  +K + ++  KK KK +  K  K  
Sbjct: 333 KKLRKLAVLAQEKATAKGAQKVKNRRARKKAKKARLAKVAKAL 375



 Score = 30.8 bits (70), Expect = 1.7
 Identities = 10/43 (23%), Positives = 20/43 (46%)

Query: 26  KEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNN 68
           K+ +K    A++K   K  +K + ++ +KK KK +        
Sbjct: 333 KKLRKLAVLAQEKATAKGAQKVKNRRARKKAKKARLAKVAKAL 375



 Score = 30.8 bits (70), Expect = 2.0
 Identities = 13/37 (35%), Positives = 17/37 (45%)

Query: 21  KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKK 57
           K   K  +K K ++A+KK KK    K  K   K  K 
Sbjct: 345 KATAKGAQKVKNRRARKKAKKARLAKVAKALYKSIKA 381



 Score = 30.4 bits (69), Expect = 2.8
 Identities = 12/44 (27%), Positives = 19/44 (43%)

Query: 15  PTPLDWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKK 58
           P     +K     ++K   K  +K K +  +KK KK +  K  K
Sbjct: 330 PAAKKLRKLAVLAQEKATAKGAQKVKNRRARKKAKKARLAKVAK 373



 Score = 30.0 bits (68), Expect = 3.3
 Identities = 10/43 (23%), Positives = 18/43 (41%)

Query: 28  KKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNN 70
           KK ++     +EK   K  ++ K ++ +KK KK          
Sbjct: 333 KKLRKLAVLAQEKATAKGAQKVKNRRARKKAKKARLAKVAKAL 375



 Score = 28.9 bits (65), Expect = 8.1
 Identities = 12/43 (27%), Positives = 18/43 (41%)

Query: 29  KKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNN 71
           KK  K A   ++K   K  +K K ++ +KK KK          
Sbjct: 333 KKLRKLAVLAQEKATAKGAQKVKNRRARKKAKKARLAKVAKAL 375



 Score = 28.9 bits (65), Expect = 8.4
 Identities = 12/42 (28%), Positives = 18/42 (42%)

Query: 21  KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKN 62
             A++K   K  +K K +  +K+ KK    K  K   K  K 
Sbjct: 340 VLAQEKATAKGAQKVKNRRARKKAKKARLAKVAKALYKSIKA 381


>gnl|CDD|215565 PLN03083, PLN03083, E3 UFM1-protein ligase 1 homolog; Provisional.
          Length = 803

 Score = 32.5 bits (74), Expect = 0.79
 Identities = 25/87 (28%), Positives = 36/87 (41%), Gaps = 17/87 (19%)

Query: 2   DSSTKHQIGTFTEPTPLDWKKAKKKEKKKK--------------EKKAKKKEKKKEKKKK 47
            SS K      + P   +  K  KK+K K               E+ A KK KK +KK +
Sbjct: 398 GSSEKSLGSNESSPAASNSDKGSKKKKGKSTSTKGGTAESIPDDEEDAPKKGKKNQKKGR 457

Query: 48  EKKKKKKKKKK---KKKNNNNNNNNNN 71
           +K  K     K   KK++  +  +NNN
Sbjct: 458 DKSSKVPSDSKAGGKKESVKSQEDNNN 484


>gnl|CDD|213640 TIGR01601, PYST-C1, Plasmodium yoelii subtelomeric domain
          PYST-C1.  This model represents the N-terminal domain
          of a paralogous family of Plasmodium yoelii genes
          preferentially located in the subtelomeric regions of
          the chromosomes. There are no obvious homologs to these
          genes in any other organism. The C-terminal portions of
          the genes which contain this domain are divergent and
          some contain other yoelii-specific paralogous domains
          such as PYST-C2 (TIGR01604).
          Length = 82

 Score = 29.6 bits (66), Expect = 0.81
 Identities = 8/34 (23%), Positives = 16/34 (47%)

Query: 58 KKKKNNNNNNNNNNNNNNNNNNNNNNNNNEEKGT 91
          K+   +N  N+    +    NNNN  ++ ++K  
Sbjct: 40 KEINRSNEKNDIETKSEIQLNNNNPKDDIDDKDN 73



 Score = 28.8 bits (64), Expect = 1.6
 Identities = 10/42 (23%), Positives = 23/42 (54%)

Query: 42 KEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNNNNNNNNNN 83
          KE  +  +K   + K + + NNNN  ++ ++ +N   ++ +N
Sbjct: 40 KEINRSNEKNDIETKSEIQLNNNNPKDDIDDKDNPQKDDKDN 81



 Score = 28.4 bits (63), Expect = 2.4
 Identities = 10/43 (23%), Positives = 22/43 (51%)

Query: 47 KEKKKKKKKKKKKKKNNNNNNNNNNNNNNNNNNNNNNNNNEEK 89
          KE  +  +K   + K+    NNNN  ++ ++ +N   ++ + K
Sbjct: 40 KEINRSNEKNDIETKSEIQLNNNNPKDDIDDKDNPQKDDKDNK 82



 Score = 28.0 bits (62), Expect = 3.3
 Identities = 9/43 (20%), Positives = 23/43 (53%)

Query: 45 KKKEKKKKKKKKKKKKKNNNNNNNNNNNNNNNNNNNNNNNNNE 87
          K+  +  +K   + K +   NNNN  ++ ++ +N   ++ +N+
Sbjct: 40 KEINRSNEKNDIETKSEIQLNNNNPKDDIDDKDNPQKDDKDNK 82



 Score = 27.7 bits (61), Expect = 3.8
 Identities = 7/36 (19%), Positives = 17/36 (47%)

Query: 57 KKKKKNNNNNNNNNNNNNNNNNNNNNNNNNEEKGTF 92
          K+  ++N  N+    +    NNNN  ++ +++    
Sbjct: 40 KEINRSNEKNDIETKSEIQLNNNNPKDDIDDKDNPQ 75



 Score = 27.3 bits (60), Expect = 6.0
 Identities = 9/42 (21%), Positives = 22/42 (52%)

Query: 41 KKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNNNNNNNNN 82
          K+  +  EK   + K + +  NNN  ++ ++ +N   ++ +N
Sbjct: 40 KEINRSNEKNDIETKSEIQLNNNNPKDDIDDKDNPQKDDKDN 81


>gnl|CDD|218337 pfam04939, RRS1, Ribosome biogenesis regulatory protein (RRS1).
           This family consists of several eukaryotic ribosome
           biogenesis regulatory (RRS1) proteins. RRS1 is a nuclear
           protein that is essential for the maturation of 25 S
           rRNA and the 60 S ribosomal subunit assembly in
           Saccharomyces cerevisiae.
          Length = 164

 Score = 31.1 bits (71), Expect = 0.86
 Identities = 12/55 (21%), Positives = 27/55 (49%), Gaps = 11/55 (20%)

Query: 19  DWK-----KAKKKEKKK------KEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKN 62
           +W      K   K+ +K      K     K++   +K++++K++  K +K++ KN
Sbjct: 108 EWVPRWGYKRANKDLEKDWIIEVKPNDDPKEDPFAKKRREKKERVAKNEKRELKN 162



 Score = 30.7 bits (70), Expect = 1.0
 Identities = 13/35 (37%), Positives = 22/35 (62%)

Query: 23  AKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKK 57
            K  +  K++  AKK+ +KKE+  K +K++ K KK
Sbjct: 130 VKPNDDPKEDPFAKKRREKKERVAKNEKRELKNKK 164



 Score = 29.6 bits (67), Expect = 2.9
 Identities = 17/50 (34%), Positives = 22/50 (44%), Gaps = 7/50 (14%)

Query: 24  KKKEKKKKEKKAKKKEKKKEKKKK----EKKKKKKKK---KKKKKNNNNN 66
           K K   K EK AKKK  KK KK K    E   +   +   K+  K+   +
Sbjct: 76  KPKPPTKWEKFAKKKGIKKRKKSKLVYDEATGEWVPRWGYKRANKDLEKD 125



 Score = 28.4 bits (64), Expect = 6.6
 Identities = 11/32 (34%), Positives = 18/32 (56%)

Query: 22  KAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKK 53
               KE    +K+ +KKE+  + +K+E K KK
Sbjct: 133 NDDPKEDPFAKKRREKKERVAKNEKRELKNKK 164



 Score = 28.0 bits (63), Expect = 8.0
 Identities = 20/80 (25%), Positives = 33/80 (41%), Gaps = 35/80 (43%)

Query: 20  WKK-AKKK--EKKKKEK----------------KAKKKEKKKE----------------K 44
           W+K AKKK  +K+KK K                K   K+ +K+                 
Sbjct: 83  WEKFAKKKGIKKRKKSKLVYDEATGEWVPRWGYKRANKDLEKDWIIEVKPNDDPKEDPFA 142

Query: 45  KKKEKKKKKKKKKKKKKNNN 64
           KK+ +KK++  K +K++  N
Sbjct: 143 KKRREKKERVAKNEKRELKN 162


>gnl|CDD|204414 pfam10211, Ax_dynein_light, Axonemal dynein light chain.  Axonemal
           dynein light chain proteins play a dynamic role in
           flagellar and cilia motility. Eukaryotic cilia and
           flagella are complex organelles consisting of a core
           structure, the axoneme, which is composed of nine
           microtubule doublets forming a cylinder that surrounds a
           pair of central singlet microtubules. This
           ultra-structural arrangement seems to be one of the most
           stable micro-tubular assemblies known and is responsible
           for the flagellar and ciliary movement of a large number
           of organisms ranging from protozoan to mammals. This
           light chain interacts directly with the N-terminal half
           of the heavy chains.
          Length = 189

 Score = 31.4 bits (72), Expect = 0.88
 Identities = 12/58 (20%), Positives = 29/58 (50%), Gaps = 2/58 (3%)

Query: 21  KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKK--KKNNNNNNNNNNNNNNN 76
           +KA + E+ K E + + K+ ++EK++ EK+  + + K +  +K            + +
Sbjct: 113 RKALQAEQGKSELEQEIKKLEEEKEELEKRVAELEAKLEAIEKREEEERQIEEKRHAD 170


>gnl|CDD|177779 PLN00185, PLN00185, 60S ribosomal protein L4-1; Provisional.
          Length = 405

 Score = 32.1 bits (73), Expect = 0.88
 Identities = 10/39 (25%), Positives = 20/39 (51%)

Query: 21  KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKK 59
           K A++     +  + K K++K  KK+K+  K++    K 
Sbjct: 338 KTARRMALLAEAARVKAKKEKLAKKRKQVSKEEAAAIKA 376



 Score = 30.5 bits (69), Expect = 2.1
 Identities = 9/33 (27%), Positives = 16/33 (48%)

Query: 27  EKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKK 59
           E  + + K +K  KK+++  KE+    K   K 
Sbjct: 348 EAARVKAKKEKLAKKRKQVSKEEAAAIKAAGKA 380


>gnl|CDD|225100 COG2189, COG2189, Adenine specific DNA methylase Mod [DNA
           replication, recombination, and repair].
          Length = 590

 Score = 32.1 bits (73), Expect = 0.88
 Identities = 14/67 (20%), Positives = 21/67 (31%)

Query: 9   IGTFTEPTPLDWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNN 68
           IG     +   W   KK   K  +      +    +      + KK+K  KK  N   + 
Sbjct: 300 IGKPFNGSGRIWANFKKYLIKGLKADQNYNDNGYPRYYPFNLRYKKEKLDKKSKNRVIDP 359

Query: 69  NNNNNNN 75
            N N   
Sbjct: 360 INGNGTT 366


>gnl|CDD|219947 pfam08639, SLD3, DNA replication regulator SLD3.  The SLD3 DNA
           replication regulator is required for loading and
           maintenance of Cdc45 on chromatin during DNA
           replication.
          Length = 437

 Score = 32.1 bits (73), Expect = 0.90
 Identities = 14/38 (36%), Positives = 22/38 (57%), Gaps = 4/38 (10%)

Query: 29  KKKEKKAKKKEKKKE----KKKKEKKKKKKKKKKKKKN 62
            K EK+ KKK  K+     + K+ K++K+KK KK+   
Sbjct: 124 TKFEKEYKKKLLKRSQNLDRSKRRKRRKRKKNKKQDLP 161



 Score = 31.3 bits (71), Expect = 1.5
 Identities = 12/36 (33%), Positives = 20/36 (55%)

Query: 26  KEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
           KE KKK  K  +   + +++K+ K+KK KK+     
Sbjct: 128 KEYKKKLLKRSQNLDRSKRRKRRKRKKNKKQDLPVL 163



 Score = 30.6 bits (69), Expect = 2.4
 Identities = 11/39 (28%), Positives = 19/39 (48%)

Query: 21  KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKK 59
           K  K+ +KK  ++       K+ K++K KK KK+     
Sbjct: 125 KFEKEYKKKLLKRSQNLDRSKRRKRRKRKKNKKQDLPVL 163



 Score = 29.8 bits (67), Expect = 4.0
 Identities = 12/41 (29%), Positives = 24/41 (58%)

Query: 21  KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
            +  K EK+ K+K  K+ +     K+++++K+KK KK+   
Sbjct: 121 DRFTKFEKEYKKKLLKRSQNLDRSKRRKRRKRKKNKKQDLP 161



 Score = 29.8 bits (67), Expect = 4.5
 Identities = 11/37 (29%), Positives = 22/37 (59%), Gaps = 5/37 (13%)

Query: 32  EKKAKKKEKKKEKKKKEK-----KKKKKKKKKKKKNN 63
             +  K EK+ +KK  ++     + K++K++K+KKN 
Sbjct: 120 LDRFTKFEKEYKKKLLKRSQNLDRSKRRKRRKRKKNK 156


>gnl|CDD|221012 pfam11169, DUF2956, Protein of unknown function (DUF2956).  This
          family of proteins with unknown function appears to be
          restricted to Gammaproteobacteria.
          Length = 103

 Score = 30.0 bits (68), Expect = 0.91
 Identities = 10/44 (22%), Positives = 22/44 (50%)

Query: 40 KKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNNNNNNNNNN 83
          KK++K K  +  K +K++ K K     N++  +   +   + +N
Sbjct: 40 KKQQKAKAREADKARKQQLKAKQRQAANDDEEDTIESVTQSQSN 83



 Score = 29.6 bits (67), Expect = 1.4
 Identities = 10/24 (41%), Positives = 18/24 (75%)

Query: 30 KKEKKAKKKEKKKEKKKKEKKKKK 53
          KK++KAK +E  K +K++ K K++
Sbjct: 40 KKQQKAKAREADKARKQQLKAKQR 63



 Score = 27.3 bits (61), Expect = 7.0
 Identities = 10/46 (21%), Positives = 23/46 (50%)

Query: 39 EKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNNNNNNNNNNN 84
          E KK++K K ++  K +K++ K       N++  +   +   + +N
Sbjct: 38 EYKKQQKAKAREADKARKQQLKAKQRQAANDDEEDTIESVTQSQSN 83


>gnl|CDD|234533 TIGR04285, nucleoid_noc, nucleoid occlusion protein.  This model
           describes nucleoid occlusion protein, a close homolog to
           ParB chromosome partitioning proteins including Spo0J in
           Bacillus subtilis. Its gene often is located near the
           gene for the Spo0J ortholog. This protein bind a
           specific DNA sequence and blocks cytokinesis from
           happening until chromosome segregation is complete.
          Length = 255

 Score = 31.3 bits (72), Expect = 0.92
 Identities = 12/36 (33%), Positives = 21/36 (58%)

Query: 26  KEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
            E  +K    K+ E+  +K  ++ +K KKKKK++K 
Sbjct: 175 NEIIEKGLNVKQTEELIKKLLEKPEKPKKKKKRRKG 210


>gnl|CDD|149453 pfam08397, IMD, IRSp53/MIM homology domain.  The N-terminal
           predicted helical stretch of the insulin receptor
           tyrosine kinase substrate p53 (IRSp53) is an
           evolutionary conserved F-actin bundling domain involved
           in filopodium formation. The domain has been named IMD
           after the IRSp53 and missing in metastasis (MIM)
           proteins in which it occurs. Filopodium-inducing IMD
           activity is regulated by Cdc42 and Rac1 and is
           SH3-independent.
          Length = 218

 Score = 31.3 bits (71), Expect = 0.92
 Identities = 14/42 (33%), Positives = 22/42 (52%)

Query: 19  DWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKK 60
             KK      K  + + KKK  + EKK+ + KK ++K +K K
Sbjct: 95  LDKKYANALDKDYQTEYKKKRDELEKKQSDLKKLQRKSQKGK 136


>gnl|CDD|129022 smart00786, SHR3_chaperone, ER membrane protein SH3.  This family
           of proteins are membrane localised chaperones that are
           required for correct plasma membrane localisation of
           amino acid permeases (AAPs). Shr3 prevents AAPs proteins
           from aggregating and assists in their correct folding.
           In the absence of Shr3, AAPs are retained in the ER.
          Length = 196

 Score = 31.2 bits (71), Expect = 0.93
 Identities = 15/41 (36%), Positives = 21/41 (51%), Gaps = 3/41 (7%)

Query: 20  WKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKK 60
            K AK+KE+   E++   KE      KK    +K + KKKK
Sbjct: 159 RKDAKQKEEFAAEER---KEALAAAAKKSATPQKVETKKKK 196


>gnl|CDD|202045 pfam01920, Prefoldin_2, Prefoldin subunit.  This family includes
           prefoldin subunits that are not detected by pfam02996.
          Length = 106

 Score = 29.9 bits (68), Expect = 0.94
 Identities = 12/39 (30%), Positives = 22/39 (56%)

Query: 25  KKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNN 63
           ++ K+  EK+ K  EK+ EK +KE ++ K++  K     
Sbjct: 68  EERKETLEKEIKTLEKQLEKLEKELEELKEELYKLFGQQ 106



 Score = 27.6 bits (62), Expect = 6.1
 Identities = 10/38 (26%), Positives = 23/38 (60%)

Query: 22  KAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKK 59
           + +K+  +K+ K  +K+ +K EK+ +E K++  K   +
Sbjct: 68  EERKETLEKEIKTLEKQLEKLEKELEELKEELYKLFGQ 105


>gnl|CDD|153289 cd07605, I-BAR_IMD, Inverse (I)-BAR, also known as the IRSp53/MIM
           homology Domain (IMD), a dimerization module that binds
           and bends membranes.  Inverse (I)-BAR (or IMD) is a
           member of the Bin/Amphiphysin/Rvs (BAR) domain family.
           It is a dimerization and lipid-binding module that bends
           membranes and induces membrane protrusions in the
           opposite direction compared to classical BAR and F-BAR
           domains, which produce membrane invaginations. IMD
           domains are found in Insulin Receptor tyrosine kinase
           Substrate p53 (IRSp53), Missing in Metastasis (MIM), and
           Brain-specific Angiogenesis Inhibitor 1-Associated
           Protein 2-like (BAIAP2L) proteins. These are
           multi-domain proteins that act as scaffolding proteins
           and transducers of a variety of signaling pathways that
           link membrane dynamics and the underlying actin
           cytoskeleton. Most members contain an N-terminal IMD, an
           SH3 domain, and a WASP homology 2 (WH2) actin-binding
           motif at the C-terminus, exccept for MIM which does not
           carry an SH3 domain. Some members contain additional
           domains and motifs. The IMD domain binds and bundles
           actin filaments, binds membranes and produces membrane
           protrusions, and interacts with the small GTPase Rac.
          Length = 223

 Score = 31.2 bits (71), Expect = 0.96
 Identities = 17/46 (36%), Positives = 25/46 (54%), Gaps = 1/46 (2%)

Query: 18  LDWKKAKKKEKK-KKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKN 62
           LD K   K EK  KKE K K+++  K + + +K +KK +K    K 
Sbjct: 105 LDQKVINKFEKDYKKEYKQKREDLDKARSELKKLQKKSQKSGTGKY 150


>gnl|CDD|224055 COG1132, MdlB, ABC-type multidrug transport system, ATPase and
           permease components [Defense mechanisms].
          Length = 567

 Score = 32.0 bits (73), Expect = 0.96
 Identities = 36/174 (20%), Positives = 54/174 (31%), Gaps = 54/174 (31%)

Query: 125 MTTIFGTVSQLFLGSFLHWRSAAILNLLFPILALCALYFIPESPHWLISQGRMQEASASL 184
            T+I   +  L L   L WR A IL L+ P+LAL                 + ++ S   
Sbjct: 134 FTSILLLIGSLVLLFSLSWRLALILLLILPLLALVLSLL----------ARKSRKLSR-- 181

Query: 185 CWLRGWVTPDKVQTELSQITKAIEES---------------ELKRLGKDGQ--RRPNYRM 227
                     +V+  L ++   + ES               ELKR  +  +  RR N R 
Sbjct: 182 ----------RVREALGELNARLLESLSGIRVIKAFGAEDRELKRFEEANEELRRANLRA 231

Query: 228 YMRRTFLLP-------------YAIVTSLFFIGQ--FGGMTTLQTYAVGIFESI 266
                 L P              A+   L   G    G +     Y + +   I
Sbjct: 232 SRLEALLAPLMLLLSSLGTVLVLALGGFLVLSGSLTVGALAAFILYLLRLLTPI 285


>gnl|CDD|222791 PHA00431, PHA00431, internal virion protein C.
          Length = 746

 Score = 32.1 bits (73), Expect = 1.00
 Identities = 7/44 (15%), Positives = 25/44 (56%)

Query: 20  WKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNN 63
           +++ ++K  ++  +KA K   K+       + +++ ++K+K+ +
Sbjct: 694 YQENQQKLAEEAREKALKDANKRAPIVAATQAREEVREKRKQTS 737


>gnl|CDD|152754 pfam12319, TryThrA_C, Tryptophan-Threonine-rich plasmodium antigen
           C terminal.  This protein is found in eukaryotes.
           Proteins in this family are typically between 254 to 536
           amino acids in length. This family is the C terminal of
           a surface antigen of malarial Plasmodium species. It is
           currently being targeted for use as part of a subunit
           vaccine against Plasmodium falciparum, the main species
           involved in causing human malaria.
          Length = 217

 Score = 31.1 bits (71), Expect = 1.00
 Identities = 18/51 (35%), Positives = 21/51 (41%), Gaps = 2/51 (3%)

Query: 20  WKKAKKKEKKKKEKKAKKKEK--KKEKKKKEKKKKKKKKKKKKKNNNNNNN 68
           WK  K  E    + K ++ E   K EKKK  K    KKKK   K     N 
Sbjct: 111 WKNKKILEWLMSDWKREEDEYWEKWEKKKWSKPLNLKKKKNWLKWKERINR 161


>gnl|CDD|204032 pfam08703, PLC-beta_C, PLC-beta C terminal.  This domain
           corresponds to the alpha helical C terminal domain of
           phospholipase C beta.
          Length = 181

 Score = 30.9 bits (70), Expect = 1.0
 Identities = 18/47 (38%), Positives = 31/47 (65%), Gaps = 2/47 (4%)

Query: 21  KKAKK-KEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNN 66
           KK K+  EK+KKE K KK ++K+ ++ +E K K+K  ++++K   N 
Sbjct: 55  KKLKEISEKEKKELK-KKLDRKRLERIQEAKTKEKHAQEREKTEINR 100


>gnl|CDD|225816 COG3277, GAR1, RNA-binding protein involved in rRNA processing
          [Translation, ribosomal structure and biogenesis].
          Length = 98

 Score = 29.6 bits (67), Expect = 1.1
 Identities = 9/36 (25%), Positives = 16/36 (44%)

Query: 25 KKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKK 60
           K +            K  +KK++  +KK++ KK K
Sbjct: 63 VKLESLVGDTLYIPPDKLIRKKRKLPRKKRRPKKPK 98


>gnl|CDD|219761 pfam08243, SPT2, SPT2 chromatin protein.  This family includes the
           Saccharomyces cerevisiae protein SPT2 which is a
           chromatin protein involved in transcriptional
           regulation.
          Length = 116

 Score = 30.2 bits (68), Expect = 1.1
 Identities = 11/34 (32%), Positives = 27/34 (79%)

Query: 26  KEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKK 59
           KE+++  + A+ +++++  +++E++K+KKKKK K
Sbjct: 83  KEERRSARMARLEDERELAREEEEEKRKKKKKNK 116



 Score = 28.7 bits (64), Expect = 2.9
 Identities = 13/45 (28%), Positives = 30/45 (66%), Gaps = 2/45 (4%)

Query: 19  DWKKAKKKEKKKKEKKAKK--KEKKKEKKKKEKKKKKKKKKKKKK 61
           D  +A   E +K+E+++ +  + + + +  +E++++K+KKKKK K
Sbjct: 72  DNMEATFMEIQKEERRSARMARLEDERELAREEEEEKRKKKKKNK 116


>gnl|CDD|190741 pfam03763, Remorin_C, Remorin, C-terminal region.  Remorins are
          plant-specific plasma membrane-associated proteins. In
          tobacco remorin co-purifies with lipid rafts. Most
          remorins have a variable, proline-rich C-half and a
          more conserved N-half that is predicted to form coiled
          coils. Consistent with this, circular dichroism studies
          have demonstrated that much of the protein is
          alpha-helical. Remorins exist in plasma membrane
          preparations as oligomeric structures and form
          filaments in vitro. The proteins can bind polyanions
          including the extracellular matrix component
          oligogalacturonic acid (OGA). In vitro, remorin in
          plasma membrane preparations is phosphorylated
          (principally on threonine residues) in the presence of
          OGA and thus co-purifies with a protein kinases(s). The
          biological functions of remorins are unknown but roles
          as components of the membrane/cytoskeleton are
          possible.
          Length = 112

 Score = 30.0 bits (68), Expect = 1.1
 Identities = 16/35 (45%), Positives = 20/35 (57%)

Query: 25 KKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKK 59
           K+K K E + KK E+K EKKK E  +K K K   
Sbjct: 37 NKKKAKAEAELKKIERKLEKKKAEAAEKLKNKLAA 71



 Score = 30.0 bits (68), Expect = 1.2
 Identities = 20/58 (34%), Positives = 29/58 (50%), Gaps = 16/58 (27%)

Query: 20 WKKAKKKEKKKKEKKAKKKE--------KKKEK-----KKKEKKKKKKKKKKKKKNNN 64
          W++A   EK K   K +++E        KKK K     KK E+K +KKK +  +K  N
Sbjct: 13 WEEA---EKAKINNKYQREEAKIQAWENKKKAKAEAELKKIERKLEKKKAEAAEKLKN 67



 Score = 30.0 bits (68), Expect = 1.2
 Identities = 14/40 (35%), Positives = 22/40 (55%)

Query: 22 KAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
          KA+ + KK + K  KKK +  EK K +     KK ++K+ 
Sbjct: 42 KAEAELKKIERKLEKKKAEAAEKLKNKLAAAHKKAEEKRA 81


>gnl|CDD|218734 pfam05758, Ycf1, Ycf1.  The chloroplast genomes of most higher
           plants contain two giant open reading frames designated
           ycf1 and ycf2. Although the function of Ycf1 is unknown,
           it is known to be an essential gene.
          Length = 832

 Score = 31.9 bits (73), Expect = 1.1
 Identities = 13/31 (41%), Positives = 21/31 (67%)

Query: 20  WKKAKKKEKKKKEKKAKKKEKKKEKKKKEKK 50
            KK + K     E+K KK+EKK++ K++EK+
Sbjct: 648 EKKKEFKILDYTEEKTKKEEKKEKNKREEKE 678



 Score = 30.0 bits (68), Expect = 3.6
 Identities = 12/81 (14%), Positives = 36/81 (44%)

Query: 24  KKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNNNNNNNNNN 83
           K KE  + E++ ++ + + E   + K  K++++   +++ +  +    + +   + +   
Sbjct: 225 KLKETSETEEREEETDVEIETTSETKGTKQEQEGSTEEDPSLFSEEKEDPDKTEDLDKLE 284

Query: 84  NNNEEKGTFLYKMAAPLVLVL 104
              E+K   L+    P   +L
Sbjct: 285 ILKEKKDEELFWFEKPWPTLL 305



 Score = 29.6 bits (67), Expect = 5.9
 Identities = 10/36 (27%), Positives = 25/36 (69%)

Query: 20  WKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKK 55
           ++   +K+K+ K     +++ KKE+KK++ K+++K+
Sbjct: 643 FRNWIEKKKEFKILDYTEEKTKKEEKKEKNKREEKE 678


>gnl|CDD|221937 pfam13148, DUF3987, Protein of unknown function (DUF3987).  A
           family of uncharacterized proteins found by clustering
           human gut metagenomic sequences.
          Length = 379

 Score = 31.5 bits (72), Expect = 1.1
 Identities = 16/35 (45%), Positives = 23/35 (65%)

Query: 27  EKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
           E++ KE +A+K+  + EKK  EKK KK  KK K +
Sbjct: 75  EEELKEYEAEKEIWEAEKKGLEKKAKKAIKKGKDE 109



 Score = 29.9 bits (68), Expect = 3.9
 Identities = 12/44 (27%), Positives = 25/44 (56%), Gaps = 3/44 (6%)

Query: 19  DWKKAK---KKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKK 59
           +++  K   + EKK  EKKAKK  KK + ++   ++  + + ++
Sbjct: 80  EYEAEKEIWEAEKKGLEKKAKKAIKKGKDEEALAEELLELEAEE 123



 Score = 28.8 bits (65), Expect = 7.4
 Identities = 12/39 (30%), Positives = 23/39 (58%)

Query: 23  AKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
            + +E+ ++E K  + EK+  + +K+  +KK KK  KK 
Sbjct: 68  EELREEYEEELKEYEAEKEIWEAEKKGLEKKAKKAIKKG 106


>gnl|CDD|240528 cd13123, MATE_MurJ_like, MurJ/MviN, a subfamily of the multidrug
           and toxic compound extrusion (MATE)-like proteins.
           Escherichia coli MurJ (MviN) has been identified as
           essential for murein biosynthesis. It has been suggested
           that MurJ functions as the peptidoglycan lipid II
           flippase which is involved in translocation of
           lipid-anchored peptidoglycan precursors across the
           cytoplasmic membrane, though results obtained in
           Bacillus subtilis seem to indicate that its MurJ
           homologs are not essential for growth. Some MviN family
           members (e.g. in Mycobacterium tuberculosis) possess an
           extended C-terminal region that contains an
           intracellular pseudo-kinase domain and an extracellular
           domain resembling carbohydrate-binding proteins.
           Proteins from the MATE family are involved in exporting
           metabolites across the cell membrane and are often
           responsible for multidrug resistance (MDR).
          Length = 420

 Score = 31.7 bits (73), Expect = 1.1
 Identities = 26/109 (23%), Positives = 42/109 (38%), Gaps = 23/109 (21%)

Query: 233 FLLPYAIVTSLFFIGQFG----GMTT--LQTYAVGI------------FESIHAPLDP-Y 273
            +L   IV  LF  G F      MT   L  YA+G+            F ++     P  
Sbjct: 314 IVLAEPIVRLLFERGAFTAEDTLMTASALAAYALGLPAFALVKLLSRAFYALKDTKTPVK 373

Query: 274 FATLLLGVAELGGALLCVVLIHYTGKRPLALISTGGSAACFIVVAVYAQ 322
            A + + V      LL ++LI   G   LAL ++  +    +++ V  +
Sbjct: 374 IAVIAVAV----NILLNLLLIKPLGHVGLALATSLSAWVNALLLLVLLR 418


>gnl|CDD|233605 TIGR01865, cas_Csn1, CRISPR-associated protein Cas9/Csn1, subtype
           II/NMEMI.  CRISPR loci appear to be mobile elements with
           a wide host range. This model represents a protein found
           only in CRISPR-containing species, near other
           CRISPR-associated proteins (cas), as part of the NMENI
           subtype of CRISPR/Cas locus. The species range so far
           for this protein is animal pathogens and commensals only
           [Mobile and extrachromosomal element functions, Other].
          Length = 805

 Score = 32.0 bits (73), Expect = 1.1
 Identities = 11/49 (22%), Positives = 21/49 (42%)

Query: 25  KKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNN 73
           ++E+     K   KE+ K+ + K K+      K+  K     N++ N  
Sbjct: 524 REEQGTNFGKRNSKERYKKNEDKIKEFASALGKEILKEEPTENSSKNIL 572



 Score = 31.2 bits (71), Expect = 1.7
 Identities = 10/50 (20%), Positives = 23/50 (46%)

Query: 23  AKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNN 72
           A++++     K+  K+  KK + K ++      K+  K+    N++ N  
Sbjct: 523 AREEQGTNFGKRNSKERYKKNEDKIKEFASALGKEILKEEPTENSSKNIL 572



 Score = 28.9 bits (65), Expect = 8.7
 Identities = 10/49 (20%), Positives = 19/49 (38%)

Query: 29  KKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNNNN 77
           ++++     K   KE+ KK + K K+      K         N++ N  
Sbjct: 524 REEQGTNFGKRNSKERYKKNEDKIKEFASALGKEILKEEPTENSSKNIL 572


>gnl|CDD|204792 pfam11947, DUF3464, Protein of unknown function (DUF3464).  This
          family of proteins are functionally uncharacterized.
          This protein is found in bacteria and eukaryotes.
          Proteins in this family are typically between 137 to
          196 amino acids in length.
          Length = 149

 Score = 30.4 bits (69), Expect = 1.2
 Identities = 11/33 (33%), Positives = 17/33 (51%)

Query: 28 KKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKK 60
          K+ ++K  K   K    K K+K+KK K K+   
Sbjct: 13 KRSRKKGVKALRKAAVAKSKDKQKKPKSKRAAS 45



 Score = 29.2 bits (66), Expect = 3.0
 Identities = 12/42 (28%), Positives = 19/42 (45%)

Query: 22 KAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNN 63
           A  +++   E K  +K+  K  +K    K K K+KK K   
Sbjct: 1  MASSRKRLPFEPKRSRKKGVKALRKAAVAKSKDKQKKPKSKR 42



 Score = 29.2 bits (66), Expect = 3.3
 Identities = 13/41 (31%), Positives = 19/41 (46%)

Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
          +K    E K+  KK  K  +K    K + K+KK K K+   
Sbjct: 5  RKRLPFEPKRSRKKGVKALRKAAVAKSKDKQKKPKSKRAAS 45



 Score = 28.1 bits (63), Expect = 6.5
 Identities = 13/42 (30%), Positives = 21/42 (50%)

Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKN 62
            + +K    + K+++KK  K  +K    K K K+KK K K 
Sbjct: 1  MASSRKRLPFEPKRSRKKGVKALRKAAVAKSKDKQKKPKSKR 42



 Score = 27.7 bits (62), Expect = 9.4
 Identities = 9/41 (21%), Positives = 18/41 (43%)

Query: 24 KKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNN 64
              K+   +  + ++K  +  +K    K K K+KK K+  
Sbjct: 2  ASSRKRLPFEPKRSRKKGVKALRKAAVAKSKDKQKKPKSKR 42


>gnl|CDD|222225 pfam13564, DoxX_2, DoxX-like family.  This family of
           uncharacterized proteins are related to DoxX pfam07681.
          Length = 102

 Score = 29.8 bits (68), Expect = 1.2
 Identities = 32/109 (29%), Positives = 42/109 (38%), Gaps = 11/109 (10%)

Query: 238 AIVTSLFFIGQFGGMTTLQT-YAVGIFESIHAPLDPYFATLLLGVAELGGALLCVVLIHY 296
             + +LFF+   G M   +    V  F  +  P   YF  LL G+ EL GAL   +L+  
Sbjct: 3   TGLLALFFLF-SGVMKLFRPEEVVAAFAWLGYP--LYFVRLL-GILELLGALG--LLLPL 56

Query: 297 TGKRPLALISTGGSAACFIVVAVYAQFHLSYG-WDSPLVPTVFLVLAAF 344
            G   L  ++  G A   I        HL  G       P V L L A 
Sbjct: 57  VGAPRLKELAAAGLALLMIGAIA---AHLRVGDPFHVAFPAVLLALLAV 102


>gnl|CDD|238686 cd01390, HMGB-UBF_HMG-box, HMGB-UBF_HMG-box, class II and III
          members of the HMG-box superfamily of DNA-binding
          proteins. These proteins bind the minor groove of DNA
          in a non-sequence specific fashion and contain two or
          more tandem HMG boxes. Class II members include
          non-histone chromosomal proteins, HMG1 and HMG2, which
          bind to bent or distorted DNA such as four-way DNA
          junctions, synthetic DNA cruciforms, kinked
          cisplatin-modified DNA, DNA bulges, cross-overs in
          supercoiled DNA, and can cause looping of linear DNA.
          Class III members include nucleolar and mitochondrial
          transcription factors, UBF and mtTF1, which bind
          four-way DNA junctions.
          Length = 66

 Score = 28.7 bits (65), Expect = 1.2
 Identities = 13/28 (46%), Positives = 22/28 (78%)

Query: 20 WKKAKKKEKKKKEKKAKKKEKKKEKKKK 47
          WK+  ++EKKK E+KA+K +++ EK+ K
Sbjct: 39 WKELSEEEKKKYEEKAEKDKERYEKEMK 66


>gnl|CDD|240292 PTZ00148, PTZ00148, 40S ribosomal protein S8; Provisional.
          Length = 205

 Score = 30.7 bits (70), Expect = 1.2
 Identities = 13/34 (38%), Positives = 20/34 (58%)

Query: 23  AKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKK 56
            KKK+ KK+E+  K K+K +    K + +KK  K
Sbjct: 122 GKKKKYKKEEENKKAKKKSESLVAKLEVRKKNHK 155



 Score = 30.7 bits (70), Expect = 1.3
 Identities = 12/31 (38%), Positives = 20/31 (64%)

Query: 33  KKAKKKEKKKEKKKKEKKKKKKKKKKKKKNN 63
           KK  KKE++ +K KK+ +    K + +KKN+
Sbjct: 124 KKKYKKEEENKKAKKKSESLVAKLEVRKKNH 154



 Score = 30.3 bits (69), Expect = 1.9
 Identities = 13/32 (40%), Positives = 18/32 (56%)

Query: 35  AKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNN 66
            KKK KK+E+ KK KKK +    K +    N+
Sbjct: 123 KKKKYKKEEENKKAKKKSESLVAKLEVRKKNH 154



 Score = 30.0 bits (68), Expect = 2.4
 Identities = 13/33 (39%), Positives = 20/33 (60%)

Query: 28  KKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKK 60
           KKKK KK ++ +K K+K +    K + +KK  K
Sbjct: 123 KKKKYKKEEENKKAKKKSESLVAKLEVRKKNHK 155



 Score = 29.6 bits (67), Expect = 3.3
 Identities = 12/33 (36%), Positives = 19/33 (57%)

Query: 35  AKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNN 67
            KKK+ KKE++ K+ KKK +    K +    N+
Sbjct: 122 GKKKKYKKEEENKKAKKKSESLVAKLEVRKKNH 154



 Score = 29.2 bits (66), Expect = 4.7
 Identities = 13/32 (40%), Positives = 21/32 (65%)

Query: 31  KEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKN 62
           K+KK KK+E+ K+ KKK +    K + +KK +
Sbjct: 123 KKKKYKKEEENKKAKKKSESLVAKLEVRKKNH 154



 Score = 29.2 bits (66), Expect = 4.9
 Identities = 15/37 (40%), Positives = 23/37 (62%)

Query: 18  LDWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKK 54
           +D  K KK +K+++ KKAKKK +    K + +KK  K
Sbjct: 119 IDLGKKKKYKKEEENKKAKKKSESLVAKLEVRKKNHK 155



 Score = 28.8 bits (65), Expect = 5.7
 Identities = 14/33 (42%), Positives = 21/33 (63%)

Query: 25  KKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKK 57
           KK+K KKE++ KK +KK E    + + +KK  K
Sbjct: 123 KKKKYKKEEENKKAKKKSESLVAKLEVRKKNHK 155



 Score = 28.8 bits (65), Expect = 6.1
 Identities = 13/33 (39%), Positives = 20/33 (60%)

Query: 26  KEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKK 58
           K+KK K+++  KK KKK +    K + +KK  K
Sbjct: 123 KKKKYKKEEENKKAKKKSESLVAKLEVRKKNHK 155



 Score = 28.0 bits (63), Expect = 9.9
 Identities = 12/33 (36%), Positives = 20/33 (60%)

Query: 29  KKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
           KKK+ K +++ KK +KK +    K + +KK  K
Sbjct: 123 KKKKYKKEEENKKAKKKSESLVAKLEVRKKNHK 155


>gnl|CDD|205480 pfam13300, DUF4078, Domain of unknown function (DUF4078).  This
          family is found from fungi to humans, but its exact
          function is not known.
          Length = 88

 Score = 29.2 bits (66), Expect = 1.2
 Identities = 10/36 (27%), Positives = 30/36 (83%)

Query: 25 KKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKK 60
          +K +++ E++ K++E++KEK+K+  ++++KK ++++
Sbjct: 53 EKAREETERERKEREERKEKRKRAIEERRKKIEERR 88



 Score = 26.9 bits (60), Expect = 7.7
 Identities = 8/32 (25%), Positives = 25/32 (78%)

Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKK 52
          ++ +++ K+++E+K K+K   +E++KK ++++
Sbjct: 57 EETERERKEREERKEKRKRAIEERRKKIEERR 88


>gnl|CDD|150786 pfam10153, DUF2361, Uncharacterized conserved protein (DUF2361). 
          This is a region of 120 amino acids that is conserved
          in a family of proteins found from plants to fungi. The
          function is not known.
          Length = 114

 Score = 29.9 bits (68), Expect = 1.3
 Identities = 13/34 (38%), Positives = 21/34 (61%)

Query: 27 EKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKK 60
          E+KK  ++ K+ EKK E+   EK+ K  K++  K
Sbjct: 60 ERKKATRRLKQLEKKLEETSDEKEIKDLKEQLSK 93



 Score = 28.7 bits (65), Expect = 3.1
 Identities = 17/47 (36%), Positives = 25/47 (53%), Gaps = 6/47 (12%)

Query: 21 KKAKKKEKKKK------EKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
          KK KK  KK         KKA ++ K+ EKK +E   +K+ K  K++
Sbjct: 44 KKRKKMAKKYHMVRFFERKKATRRLKQLEKKLEETSDEKEIKDLKEQ 90


>gnl|CDD|191868 pfam07839, CaM_binding, Plant calmodulin-binding domain.  The
          sequences featured in this family are found repeated in
          a number of plant calmodulin-binding proteins, and are
          thought to constitute the calmodulin-binding domains.
          Binding of the proteins to calmodulin depends on the
          presence of calcium ions. These proteins are thought to
          be involved in various processes, such as plant defence
          responses and stolonisation or tuberization.
          Length = 115

 Score = 29.7 bits (67), Expect = 1.3
 Identities = 12/55 (21%), Positives = 20/55 (36%)

Query: 17 PLDWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNN 71
              KK   K +K+K K     +K    K+  K  +K +K   +  N      + 
Sbjct: 7  ERRSKKEDPKPEKEKPKSWSNLKKVILLKRFVKSLEKVRKFNPRGPNFLPLEPDP 61


>gnl|CDD|178945 PRK00247, PRK00247, putative inner membrane protein translocase
           component YidC; Validated.
          Length = 429

 Score = 31.4 bits (71), Expect = 1.3
 Identities = 8/63 (12%), Positives = 29/63 (46%)

Query: 21  KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNNNNNNN 80
           KK   ++++  E++  ++ +++      + + ++   K KK    + + N +  + N  +
Sbjct: 347 KKEIAQKRRAAEREINREARQERAAAMARARARRAAVKAKKKGLIDASPNEDTPSENEES 406

Query: 81  NNN 83
             +
Sbjct: 407 KGS 409



 Score = 30.2 bits (68), Expect = 2.7
 Identities = 9/47 (19%), Positives = 24/47 (51%)

Query: 20  WKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNN 66
           W+  +   +  + KK +  EK + K +K++  +K++  +++ N    
Sbjct: 320 WRAPELHAENAEIKKTRTAEKNEAKARKKEIAQKRRAAEREINREAR 366



 Score = 30.2 bits (68), Expect = 3.5
 Identities = 14/73 (19%), Positives = 35/73 (47%), Gaps = 3/73 (4%)

Query: 22  KAKKKEKKKKEKKAKK---KEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNNNNN 78
           KA+KKE  +K + A++   +E ++E+     + + ++   K K     + + N +  + N
Sbjct: 344 KARKKEIAQKRRAAEREINREARQERAAAMARARARRAAVKAKKKGLIDASPNEDTPSEN 403

Query: 79  NNNNNNNNEEKGT 91
             +  +  + + T
Sbjct: 404 EESKGSPPQVEAT 416


>gnl|CDD|180505 PRK06278, PRK06278, cobyrinic acid a,c-diamide synthase; Validated.
          Length = 476

 Score = 31.5 bits (72), Expect = 1.3
 Identities = 12/36 (33%), Positives = 19/36 (52%)

Query: 37  KKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNN 72
           K+++K+E  KK K  K+K K +   + NN     N 
Sbjct: 202 KEDEKEEIFKKNKILKEKLKSRSGISTNNKKEERNK 237



 Score = 31.2 bits (71), Expect = 1.4
 Identities = 15/41 (36%), Positives = 21/41 (51%)

Query: 37  KKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNNNN 77
           K  K KE +K+E  KK K  K+K K+ +  + NN     N 
Sbjct: 197 KYLKIKEDEKEEIFKKNKILKEKLKSRSGISTNNKKEERNK 237



 Score = 31.2 bits (71), Expect = 1.6
 Identities = 14/41 (34%), Positives = 20/41 (48%)

Query: 33  KKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNN 73
           K  K KE +KE+  K+ K  K+K K +   + NN     N 
Sbjct: 197 KYLKIKEDEKEEIFKKNKILKEKLKSRSGISTNNKKEERNK 237



 Score = 31.2 bits (71), Expect = 1.8
 Identities = 14/41 (34%), Positives = 22/41 (53%)

Query: 18  LDWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKK 58
           L  K+ +K+E  KK K  K+K K +       KK+++ K K
Sbjct: 199 LKIKEDEKEEIFKKNKILKEKLKSRSGISTNNKKEERNKPK 239


>gnl|CDD|220569 pfam10101, DUF2339, Predicted membrane protein (DUF2339).  This
           domain, found in various hypothetical bacterial
           proteins, has no known function.
          Length = 738

 Score = 31.6 bits (72), Expect = 1.3
 Identities = 24/129 (18%), Positives = 35/129 (27%), Gaps = 13/129 (10%)

Query: 229 MRRTFLLPYAIVTSLFFIGQFGGMTTLQTYAVGIFESIHAPLDPYFATLLLGVAELGGAL 288
           + R F L   ++  L  +           +  G+            A   L  A L  A 
Sbjct: 319 LLRAFGLLLQLLALLSLLALLLLGYLNSAFLTGLL-------LWALAAAALAAAWLLLAE 371

Query: 289 LCVVLIHYTGKRPLALISTGGSAACFIVVAVYAQFHLSYGWDSPLVPTVFLVLAAFLTHI 348
                    G   L  +    + A   +        LS  W +  +    LVLAA     
Sbjct: 372 RLSRRRLRQGLAALFAL---AALAALALALTME---LSDAWLTLALAAQVLVLAALARRY 425

Query: 349 CIRLLPWML 357
               L W L
Sbjct: 426 VAPRLRWAL 434


>gnl|CDD|224208 COG1289, COG1289, Predicted membrane protein [Function unknown].
          Length = 674

 Score = 31.6 bits (72), Expect = 1.3
 Identities = 26/151 (17%), Positives = 43/151 (28%), Gaps = 28/151 (18%)

Query: 268 APLDPYFATLLLGVAELGGALLCVVL--IHYTG----KRPLALISTGGSAACFIVVAVYA 321
           A         LLG+ +   A+  V +     +G    K    LI   G+   F V  +  
Sbjct: 20  AACLALALAFLLGLPQPSWAVSTVAIVSAPDSGAVLSKGLKRLI---GTLIGFAVALLLV 76

Query: 322 QFHLSYGWDSPLVPTVFLVLAAFLTHICIRLLPWMLIGEVFPNNIRATASGASGSSSYIF 381
                  W   L+ T++L L   +                     R  AS A   + Y  
Sbjct: 77  ALLAQEPWLFLLLLTLWLGLCTAIGS---LY--------------RTIASYAFVLAGYTA 119

Query: 382 AFAVNKLYYPMLDTFHLWGTLYFYAAISVVG 412
                    P  +   L+    +     ++G
Sbjct: 120 LIIGPAPAIPEPEL--LFDGAVWRVVEILLG 148


>gnl|CDD|237034 PRK12278, PRK12278, 50S ribosomal protein L21/unknown domain fusion
           protein; Provisional.
          Length = 221

 Score = 31.0 bits (70), Expect = 1.3
 Identities = 8/40 (20%), Positives = 9/40 (22%)

Query: 21  KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKK 60
             +  K      K     E    K KKE   K        
Sbjct: 107 DASGVKAATGAGKVEVAAEAAPAKAKKEAAPKAAPAPAAA 146


>gnl|CDD|239175 cd02774, MopB_Res-Cmplx1_Nad11-M, MopB_Res_Cmplx1_Nad11_M:
           Mitochondrial-encoded NADH-quinone
           oxidoreductase/respiratory complex I, the second domain
           of the Nad11/75-kDa subunit of some protists.
           NADH-quinone oxidoreductase is the first
           energy-transducting complex in the respiratory chain and
           functions as a redox pump that uses the redox energy to
           translocate H+ ions across the membrane, resulting in a
           significant contribution to energy production. The nad11
           gene codes for the largest (75-kDa) subunit of the
           mitochondrial NADH-quinone oxidoreductase, it
           constitutes the electron input part of the enzyme, or
           the so-called NADH dehydrogenase fragment. The Nad11
           subunit is made of two domains: the first contains three
           binding sites for FeS clusters (the fer2 domain), the
           second domain (this CD), is of unknown function or, as
           postulated, has lost an ancestral formate dehydrogenase
           activity that became redundant during the evolution of
           the complex I enzyme. Although only vestigial sequence
           evidence remains of a molybdopterin binding site, this
           protein domain belongs to the molybdopterin_binding
           (MopB) superfamily of proteins.
          Length = 366

 Score = 31.2 bits (71), Expect = 1.4
 Identities = 14/35 (40%), Positives = 16/35 (45%)

Query: 52  KKKKKKKKKKNNNNNNNNNNNNNNNNNNNNNNNNN 86
           KK   K    N N+NN   NNN  N +  N   NN
Sbjct: 107 KKLLNKLGSLNTNSNNFLENNNYFNLDLENYLFNN 141



 Score = 30.8 bits (70), Expect = 1.9
 Identities = 15/46 (32%), Positives = 19/46 (41%), Gaps = 2/46 (4%)

Query: 50  KKKKKKKKKKKKNNNNNNNNNNNNNNNNNN--NNNNNNNEEKGTFL 93
           KK   K      N+NN   NNN  N +  N   NN+  N +K    
Sbjct: 107 KKLLNKLGSLNTNSNNFLENNNYFNLDLENYLFNNSLKNLDKSDLC 152


>gnl|CDD|233294 TIGR01159, DRP1, density-regulated protein DRP1.  This protein
          family shows weak but suggestive similarity to
          translation initiation factor SUI1 and its prokaryotic
          homologs.
          Length = 173

 Score = 30.6 bits (69), Expect = 1.4
 Identities = 10/36 (27%), Positives = 20/36 (55%)

Query: 26 KEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
             + +E+K +K     +K ++ ++  KKKKK  +K
Sbjct: 55 TIGEAQEEKGEKDLLGIQKAQEAREGGKKKKKLPQK 90


>gnl|CDD|217927 pfam04147, Nop14, Nop14-like family.  Emg1 and Nop14 are novel
           proteins whose interaction is required for the
           maturation of the 18S rRNA and for 40S ribosome
           production.
          Length = 809

 Score = 31.5 bits (72), Expect = 1.4
 Identities = 17/46 (36%), Positives = 25/46 (54%), Gaps = 6/46 (13%)

Query: 23  AKKKEKKKKEKKAKKKEKKKE------KKKKEKKKKKKKKKKKKKN 62
            K K + KKE+K   +E +K+      +K KEKK+K  +  KK K 
Sbjct: 752 NKLKAQLKKERKGAMRELRKDNRFIAREKLKEKKEKDAEYHKKMKR 797



 Score = 28.8 bits (65), Expect = 8.4
 Identities = 11/61 (18%), Positives = 27/61 (44%), Gaps = 9/61 (14%)

Query: 10  GTFTEPTPLDWKKAKKKEKKKK------EKKAKKKEKKK---EKKKKEKKKKKKKKKKKK 60
                P P    + K  E  ++      +++A+  ++ K   E  K+E ++ KK + ++ 
Sbjct: 220 TVKPPPKPPMTPEEKDDEYDQRVRELTFDRRAQPTDRTKTEEELAKEEAERLKKLEAERL 279

Query: 61  K 61
           +
Sbjct: 280 R 280


>gnl|CDD|234547 TIGR04330, cas_Cpf1, CRISPR-associated protein Cpf1, subtype
           PREFRAN.  This family is the long protein of a novel
           CRISPR subtype, PREFRAN, which is most common in
           Prevotella and Francisella, although widely distributed.
           The PREFRAN type has Cas1, Cas2, and Cas4, but lacks the
           helicase Cas3 and endonuclease Cas3-HD.
          Length = 1287

 Score = 31.7 bits (72), Expect = 1.4
 Identities = 14/65 (21%), Positives = 29/65 (44%)

Query: 25  KKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNNNNNNNNNNN 84
           K +K KK+KK   KE+   +K+  K  +    ++  KN  +         +   N + ++
Sbjct: 82  KLKKFKKDKKDLAKEQDTLRKQIVKSLQDNLPEEIFKNIFSKKLLKGVLEDFIKNTHKDS 141

Query: 85  NNEEK 89
            +E+ 
Sbjct: 142 EDEQN 146


>gnl|CDD|221533 pfam12329, TMF_DNA_bd, TATA element modulatory factor 1 DNA
          binding.  This is the middle region of a family of TATA
          element modulatory factor 1 proteins conserved in
          eukaryotes that contains at its N-terminal section a
          number of leucine zippers that could potentially form
          coiled coil structures. The whole proteins bind to the
          TATA element of some RNA polymerase II promoters and
          repress their activity. by competing with the binding
          of TATA binding protein. TMFs are evolutionarily
          conserved golgins that bind Rab6, a ubiquitous ras-like
          GTP-binding Golgi protein, and contribute to Golgi
          organisation in animal and plant cells.
          Length = 74

 Score = 28.8 bits (65), Expect = 1.4
 Identities = 18/44 (40%), Positives = 26/44 (59%), Gaps = 2/44 (4%)

Query: 21 KKAKKKEKKKKE--KKAKKKEKKKEKKKKEKKKKKKKKKKKKKN 62
          +K  KKE K     KK + K K+ EK+  E KKK +K +K+ +N
Sbjct: 22 EKLSKKELKHNNTIKKLRAKNKELEKEIAELKKKLEKLEKELEN 65


>gnl|CDD|180892 PRK07220, PRK07220, DNA topoisomerase I; Validated.
          Length = 740

 Score = 31.3 bits (71), Expect = 1.4
 Identities = 14/36 (38%), Positives = 24/36 (66%)

Query: 27  EKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKN 62
           +KK+KE+KA++ +K+K K  K+ K   KK  +K + 
Sbjct: 669 QKKQKEEKAQQPKKEKPKSIKDIKGIGKKTAEKLEE 704


>gnl|CDD|237855 PRK14900, valS, valyl-tRNA synthetase; Provisional.
          Length = 1052

 Score = 31.5 bits (71), Expect = 1.5
 Identities = 17/40 (42%), Positives = 21/40 (52%)

Query: 21   KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKK 60
            KKA  K+   K+K AKK   KK  +K   KK  KK  +K 
Sbjct: 1008 KKAAAKKAAAKKKVAKKAPAKKVARKPAAKKAAKKPARKA 1047



 Score = 31.1 bits (70), Expect = 1.8
 Identities = 16/40 (40%), Positives = 21/40 (52%)

Query: 21   KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKK 60
            KKA  K+K  K+  AKK  +K   KK  KK  +K   +K 
Sbjct: 1013 KKAAAKKKVAKKAPAKKVARKPAAKKAAKKPARKAAGRKA 1052



 Score = 31.1 bits (70), Expect = 2.2
 Identities = 17/41 (41%), Positives = 21/41 (51%)

Query: 21   KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
            KKA  K+   K+  AKK   KK+  KK   KK  +K   KK
Sbjct: 998  KKAPAKKAAAKKAAAKKAAAKKKVAKKAPAKKVARKPAAKK 1038



 Score = 30.7 bits (69), Expect = 2.6
 Identities = 18/40 (45%), Positives = 21/40 (52%)

Query: 21   KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKK 60
            KKA  K+   K+  AKKK  KK   KK  +K   KK  KK
Sbjct: 1003 KKAAAKKAAAKKAAAKKKVAKKAPAKKVARKPAAKKAAKK 1042



 Score = 30.0 bits (67), Expect = 4.7
 Identities = 16/41 (39%), Positives = 20/41 (48%), Gaps = 2/41 (4%)

Query: 23   AKKKEKKKK--EKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
            AKK   KKK  +K   KK  +K   KK  KK  +K   +K 
Sbjct: 1012 AKKAAAKKKVAKKAPAKKVARKPAAKKAAKKPARKAAGRKA 1052



 Score = 30.0 bits (67), Expect = 4.8
 Identities = 18/48 (37%), Positives = 20/48 (41%)

Query: 14   EPTPLDWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
            +      KKA       K+K AKK   KK   KK   KK   KKK  K
Sbjct: 976  KTVRRSVKKAAATRAAMKKKVAKKAPAKKAAAKKAAAKKAAAKKKVAK 1023


>gnl|CDD|129141 TIGR00030, S21p, ribosomal protein S21.  This model describes
          bacterial ribosomal protein S21 and most mitochondrial
          and chloroplast equivalents [Protein synthesis,
          Ribosomal proteins: synthesis and modification].
          Length = 58

 Score = 28.4 bits (64), Expect = 1.5
 Identities = 13/45 (28%), Positives = 25/45 (55%), Gaps = 3/45 (6%)

Query: 22 KAKKKEKKKKEKKAKKKEKKKEK---KKKEKKKKKKKKKKKKKNN 63
           A ++ K+K EK+   +E KK +   K  E++++K+K   K+   
Sbjct: 13 SALRRFKRKLEKEGILRELKKRRYYEKPSERRRRKEKAAAKRIRK 57



 Score = 28.0 bits (63), Expect = 1.9
 Identities = 13/43 (30%), Positives = 28/43 (65%), Gaps = 4/43 (9%)

Query: 21 KKAKKK-EKKKKEKKAKKK---EKKKEKKKKEKKKKKKKKKKK 59
          ++ K+K EK+   ++ KK+   EK  E++++++K   K+ +KK
Sbjct: 16 RRFKRKLEKEGILRELKKRRYYEKPSERRRRKEKAAAKRIRKK 58


>gnl|CDD|224259 COG1340, COG1340, Uncharacterized archaeal coiled-coil protein
           [Function unknown].
          Length = 294

 Score = 30.8 bits (70), Expect = 1.5
 Identities = 13/55 (23%), Positives = 28/55 (50%), Gaps = 3/55 (5%)

Query: 13  TEPTPLDWKKAKKKEKKKKEKK---AKKKEKKKEKKKKEKKKKKKKKKKKKKNNN 64
           T     + ++   ++ K+  K+   AKK  ++ EK K+ K +  + KKK ++ + 
Sbjct: 125 TSVLTPEEERELVQKIKELRKELEDAKKALEENEKLKELKAEIDELKKKAREIHE 179



 Score = 29.6 bits (67), Expect = 3.9
 Identities = 13/43 (30%), Positives = 27/43 (62%)

Query: 21  KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNN 63
            + ++ EKK K  +AK+K  K+ +K++E K++ ++  +K K  
Sbjct: 235 NELRELEKKIKALRAKEKAAKRREKREELKERAEEIYEKFKRG 277


>gnl|CDD|224099 COG1178, ThiP, ABC-type Fe3+ transport system, permease component
           [Inorganic ion transport and metabolism].
          Length = 540

 Score = 31.1 bits (71), Expect = 1.6
 Identities = 30/171 (17%), Positives = 56/171 (32%), Gaps = 47/171 (27%)

Query: 234 LLPYAIVTSLFFIGQFGGMTTL----QTYAVGIFESIHAPLDPYFATLLLGVAELGGALL 289
           +   A++  +  +  FG    L     T    I+++I +  D   A LL  +  L  A L
Sbjct: 197 IAAGALLVFMLCLADFGTPLILGGGYTTLETAIYQAIMSSFDLARAALLALLLLLITAAL 256

Query: 290 CVVLIHYTGKRPLALISTGGS-------------------AACFIVVA-----VYAQFHL 325
            ++       +  + +S  GS                      F+++      V   F L
Sbjct: 257 LLLERRLRRAKQKSTLSGKGSRPKQRLGRLRRLLGAALIVLLLFLILLPLLAVVLPSFVL 316

Query: 326 SYGWD-----------SPLVPTVFL-VLAAFLTHIC-------IRLLPWML 357
           ++G               L+ ++ L + AA L  +        +R L   L
Sbjct: 317 NWGRPLSLVLASPQFWQALLNSLLLALAAALLAVVLALLLAYAVRRLRSRL 367


>gnl|CDD|234032 TIGR02855, spore_yabG, sporulation peptidase YabG.  Members of
          this family are the protein YabG, demonstrated for
          Bacillus subtilis to be an endopeptidase able to
          release N-terminal peptides from a number of
          sporulation proteins, including CotT, CotF, and SpoIVA.
          It appears to be expressed under control of sigma-K
          [Cellular processes, Sporulation and germination].
          Length = 283

 Score = 30.8 bits (70), Expect = 1.6
 Identities = 12/52 (23%), Positives = 27/52 (51%)

Query: 19 DWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNN 70
          D +   + E+KK++K+ K++ +   +  ++  K  K+KK+ +      NN  
Sbjct: 48 DLELVDESERKKRKKRFKERMETSYRLFRQDYKLMKEKKEYRATGGYKNNQL 99


>gnl|CDD|235175 PRK03918, PRK03918, chromosome segregation protein; Provisional.
          Length = 880

 Score = 31.2 bits (71), Expect = 1.6
 Identities = 13/46 (28%), Positives = 27/46 (58%)

Query: 21  KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNN 66
           K+ ++KE++ +E K K KE +K  ++ E++ +  ++ K KK     
Sbjct: 331 KELEEKEERLEELKKKLKELEKRLEELEERHELYEEAKAKKEELER 376



 Score = 29.6 bits (67), Expect = 4.7
 Identities = 12/42 (28%), Positives = 24/42 (57%), Gaps = 3/42 (7%)

Query: 23  AKKKEKKKKEKKAKKKEKKK---EKKKKEKKKKKKKKKKKKK 61
             +KE +  E   +K E+K    E++ +E KK+ ++ ++K K
Sbjct: 242 ELEKELESLEGSKRKLEEKIRELEERIEELKKEIEELEEKVK 283



 Score = 29.3 bits (66), Expect = 6.8
 Identities = 14/41 (34%), Positives = 26/41 (63%)

Query: 21  KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
              +K E+K +E + + +E KKE ++ E+K K+ K+ K+K 
Sbjct: 252 GSKRKLEEKIRELEERIEELKKEIEELEEKVKELKELKEKA 292



 Score = 29.3 bits (66), Expect = 6.9
 Identities = 12/36 (33%), Positives = 24/36 (66%)

Query: 21  KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKK 56
            + K+ EK+ KE + K+++ +KE ++ EK  KK+ +
Sbjct: 459 AELKRIEKELKEIEEKERKLRKELRELEKVLKKESE 494



 Score = 28.9 bits (65), Expect = 9.1
 Identities = 15/46 (32%), Positives = 28/46 (60%), Gaps = 4/46 (8%)

Query: 21  KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKK----KKKKKKKKN 62
           ++ KK  +K KE+  ++++ KKE +K EK  ++    ++K KK K 
Sbjct: 690 EEIKKTLEKLKEELEEREKAKKELEKLEKALERVEELREKVKKYKA 735



 Score = 28.9 bits (65), Expect = 9.9
 Identities = 9/42 (21%), Positives = 25/42 (59%)

Query: 21  KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKN 62
            + ++KE KK E++  K  ++  + +K  ++ +K+ ++ +K 
Sbjct: 614 LEREEKELKKLEEELDKAFEELAETEKRLEELRKELEELEKK 655


>gnl|CDD|217091 pfam02537, CRCB, CrcB-like protein.  CRCB is a putative integral
           membrane protein possibly involved in chromosome
           condensation. Over expression in E. coli also leads to
           camphor resistance.
          Length = 117

 Score = 29.5 bits (67), Expect = 1.7
 Identities = 16/54 (29%), Positives = 21/54 (38%), Gaps = 2/54 (3%)

Query: 243 LFFIGQFGGMTTLQTYAVGIFESIHAPLDPYFATLLLGVAELGGALLCVVLIHY 296
           L   G  GG TT  T+A+   + +      Y   LL  +  L   LL V L   
Sbjct: 66  LLGTGFCGGFTTFSTFALETVQLLRQ--GKYLRALLYLLLSLLLGLLAVALGFA 117


>gnl|CDD|177120 MTH00045, ND6, NADH dehydrogenase subunit 6; Validated.
          Length = 162

 Score = 30.3 bits (69), Expect = 1.7
 Identities = 7/39 (17%), Positives = 18/39 (46%)

Query: 69  NNNNNNNNNNNNNNNNNNEEKGTFLYKMAAPLVLVLTYV 107
           +N    +NNN +  + ++    + LY      +L+  ++
Sbjct: 101 DNFLELSNNNWSLVSLSDLVGSSELYNSGGLYLLLGGFI 139


>gnl|CDD|226646 COG4178, COG4178, ABC-type uncharacterized transport system,
           permease and ATPase components [General function
           prediction only].
          Length = 604

 Score = 31.1 bits (71), Expect = 1.7
 Identities = 22/80 (27%), Positives = 32/80 (40%), Gaps = 21/80 (26%)

Query: 208 EESELKRLGKD-GQRRPNYRMYMRR--------------TFLLPYAIVTSLFFIGQ--FG 250
           E+ E +RL         N+R  +R               + +LP  I    +F GQ  FG
Sbjct: 264 EKVERRRLDDRFDAVLGNWRRLVRAQIRLTWFQLGYGWLSVVLPILIAAPRYFSGQITFG 323

Query: 251 GMTTLQTYAVGIFESIHAPL 270
           G+      AVG F  +H+ L
Sbjct: 324 GLMQ----AVGAFGQVHSSL 339


>gnl|CDD|216213 pfam00956, NAP, Nucleosome assembly protein (NAP).  NAP proteins
           are involved in moving histones into the nucleus,
           nucleosome assembly and chromatin fluidity. They affect
           the transcription of many genes.
          Length = 236

 Score = 30.4 bits (69), Expect = 1.7
 Identities = 11/39 (28%), Positives = 21/39 (53%), Gaps = 2/39 (5%)

Query: 13  TEPTPLDWKKAKK--KEKKKKEKKAKKKEKKKEKKKKEK 49
           +E TP++WK+ K    +  KK+++ KK  + +   K   
Sbjct: 144 SEGTPIEWKEGKNLTVKTVKKKQRNKKTGQTRTITKTVP 182


>gnl|CDD|227935 COG5648, NHP6B, Chromatin-associated proteins containing the HMG
           domain [Chromatin structure and dynamics].
          Length = 211

 Score = 30.6 bits (69), Expect = 1.8
 Identities = 12/48 (25%), Positives = 23/48 (47%), Gaps = 3/48 (6%)

Query: 20  WKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKK---KKKKNNN 64
           WK+   +EK+   K+A    ++ +++K+E  KK   K       +N  
Sbjct: 109 WKELTDEEKEPYYKEANSDRERYQREKEEYNKKLPNKAPIGPFIENEP 156



 Score = 29.4 bits (66), Expect = 3.4
 Identities = 20/62 (32%), Positives = 24/62 (38%), Gaps = 8/62 (12%)

Query: 9   IGTFTEPTPLDWKKAKKKEKKKKEKKAKKKEKKKEKKK----KEKKKKKKKKKKKKKNNN 64
           IG F E  P    K + K +     KA  +E K   K      E KKKK   K KK    
Sbjct: 148 IGPFIENEP----KIRPKVEGPSPDKALVEETKIISKAWSELDESKKKKYIDKYKKLKEE 203

Query: 65  NN 66
            +
Sbjct: 204 YD 205



 Score = 28.7 bits (64), Expect = 6.0
 Identities = 10/46 (21%), Positives = 19/46 (41%)

Query: 3   SSTKHQIGTFTEPTPLDWKKAKKKEKKKKEKKAKKKEKKKEKKKKE 48
            S    +   T+     W +  + +KKK   K KK +++ +    E
Sbjct: 165 PSPDKALVEETKIISKAWSELDESKKKKYIDKYKKLKEEYDSFYPE 210


>gnl|CDD|147641 pfam05582, Peptidase_U57, YabG peptidase U57.  YabG is a protease
           involved in the proteolysis and maturation of SpoIVA and
           YrbA proteins, conserved with the cortex and/or coat
           assembly by Bacillus subtilis.
          Length = 287

 Score = 30.6 bits (70), Expect = 1.8
 Identities = 10/52 (19%), Positives = 25/52 (48%)

Query: 19  DWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNN 70
           D +K    E  + +K+ ++K  +  K  ++ +K  ++K++ +     N N  
Sbjct: 49  DLEKVDASEVTRYKKRFEEKINESLKLIRQDRKLMREKQEYRATRGYNENEL 100



 Score = 29.5 bits (67), Expect = 4.2
 Identities = 10/49 (20%), Positives = 20/49 (40%)

Query: 24  KKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNN 72
           K    +    K + +EK  E  K  ++ +K  ++K++       N N  
Sbjct: 52  KVDASEVTRYKKRFEEKINESLKLIRQDRKLMREKQEYRATRGYNENEL 100


>gnl|CDD|218896 pfam06098, Radial_spoke_3, Radial spoke protein 3.  This family
           consists of several radial spoke protein 3 (RSP3)
           sequences. Eukaryotic cilia and flagella present in
           diverse types of cells perform motile, sensory, and
           developmental functions in organisms from protists to
           humans. They are centred by precisely organised,
           microtubule-based structures, the axonemes. The axoneme
           consists of two central singlet microtubules, called the
           central pair, and nine outer doublet microtubules. These
           structures are well-conserved during evolution. The
           outer doublet microtubules, each composed of A and B
           sub-fibres, are connected to each other by nexin links,
           while the central pair is held at the centre of the
           axoneme by radial spokes. The radial spokes are T-shaped
           structures extending from the A-tubule of each outer
           doublet microtubule to the centre of the axoneme. Radial
           spoke protein 3 (RSP3), is present at the proximal end
           of the spoke stalk and helps in anchoring the radial
           spoke to the outer doublet. It is thought that radial
           spokes regulate the activity of inner arm dynein through
           protein phosphorylation and dephosphorylation.
          Length = 288

 Score = 30.8 bits (70), Expect = 1.8
 Identities = 10/41 (24%), Positives = 30/41 (73%)

Query: 21  KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
           + A+  E ++ E+  +++ ++KE++KK+ K++K+++K+  +
Sbjct: 168 RNAELAETQRLEEAERRRREEKERRKKQDKERKQREKETAE 208



 Score = 28.9 bits (65), Expect = 7.1
 Identities = 10/30 (33%), Positives = 25/30 (83%)

Query: 25  KKEKKKKEKKAKKKEKKKEKKKKEKKKKKK 54
           + E++++E+K ++K++ KE+K++EK+  +K
Sbjct: 180 EAERRRREEKERRKKQDKERKQREKETAEK 209


>gnl|CDD|233309 TIGR01200, GLPGLI, GLPGLI family protein.  This protein family was
           first noted as a paralogous set in Porphyromonas
           gingivalis, but it is more widely distributed among the
           Bacteroidetes. The protein family is now renamed GLPGLI
           after its best-conserved motif.
          Length = 227

 Score = 30.4 bits (69), Expect = 1.8
 Identities = 10/35 (28%), Positives = 14/35 (40%)

Query: 25  KKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKK 59
            K +          + KK KK   +K KK K+K  
Sbjct: 193 IKIELNPTPIIIIPKPKKGKKISREKFKKIKEKYY 227



 Score = 29.6 bits (67), Expect = 3.8
 Identities = 14/37 (37%), Positives = 17/37 (45%)

Query: 22  KAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKK 58
           KA K E          K KK +K  +EK KK K+K  
Sbjct: 191 KAIKIELNPTPIIIIPKPKKGKKISREKFKKIKEKYY 227


>gnl|CDD|222571 pfam14153, Spore_coat_CotO, Spore coat protein CotO.  Bacillus
          spores are protected by a protein shell consisting of
          over 50 different polypeptides, known as the coat. This
          family of proteins has an important morphogenetic role
          in coat assembly, it is involved in the assembly of at
          least 5 different coat proteins including CotB, CotG,
          CotS, CotSA and CotW. It is likely to act at a late
          stage of coat assembly.
          Length = 185

 Score = 30.2 bits (68), Expect = 1.8
 Identities = 12/39 (30%), Positives = 25/39 (64%)

Query: 20 WKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKK 58
           KKA +KE++K+      K K++E+K + ++ +K+K+  
Sbjct: 33 IKKADEKEEEKENSDEHVKSKEEEQKIEYEEAEKEKEAG 71


>gnl|CDD|217348 pfam03064, U79_P34, HSV U79 / HCMV P34.  This family represents
           herpes virus protein U79 and cytomegalovirus early
           phosphoprotein P34 (UL112).
          Length = 238

 Score = 30.6 bits (69), Expect = 1.8
 Identities = 11/40 (27%), Positives = 29/40 (72%)

Query: 24  KKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNN 63
           KK + + +++  K+KEK++ +  ++ K+ ++KK+++K+ N
Sbjct: 153 KKSDDEHRKRSGKQKEKRRVEDSQKHKEDRRKKQEEKRRN 192



 Score = 30.2 bits (68), Expect = 2.3
 Identities = 10/49 (20%), Positives = 28/49 (57%)

Query: 21  KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNN 69
           +  K+KEK++ E   K KE +++K++++++  + K+      ++   + 
Sbjct: 162 RSGKQKEKRRVEDSQKHKEDRRKKQEEKRRNDEDKRPGGGGGSSGGQSG 210



 Score = 29.8 bits (67), Expect = 2.8
 Identities = 9/50 (18%), Positives = 31/50 (62%)

Query: 21  KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNN 70
           K++ K+++K++ + ++K ++ + KK++EK++  + K+      ++   + 
Sbjct: 161 KRSGKQKEKRRVEDSQKHKEDRRKKQEEKRRNDEDKRPGGGGGSSGGQSG 210


>gnl|CDD|216016 pfam00602, Flu_PB1, Influenza RNA-dependent RNA polymerase subunit
           PB1.  Two GTP binding sites exist in this protein.
          Length = 740

 Score = 31.0 bits (70), Expect = 1.9
 Identities = 11/41 (26%), Positives = 21/41 (51%)

Query: 18  LDWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKK 58
           LD  + K  + K  +++   KE  K  K+K+  K+K +  +
Sbjct: 175 LDKPEVKFTKHKNNKRRYDDKETGKLLKRKQPSKEKDRITR 215


>gnl|CDD|222160 pfam13476, AAA_23, AAA domain. 
          Length = 204

 Score = 30.2 bits (68), Expect = 1.9
 Identities = 10/38 (26%), Positives = 24/38 (63%)

Query: 22  KAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKK 59
           +  + EK   EK  ++KEKKKE ++ +++ ++  ++ +
Sbjct: 167 EELEDEKDLLEKLLEEKEKKKELEELKEELEELLEELE 204


>gnl|CDD|145700 pfam02681, DUF212, Divergent PAP2 family.  This family is related
           to the pfam01569 family (personal obs: C Yeats).
          Length = 141

 Score = 29.8 bits (68), Expect = 1.9
 Identities = 18/61 (29%), Positives = 23/61 (37%), Gaps = 15/61 (24%)

Query: 292 VLIHY--TGK-RPLALISTGG-----SAACFIVVAVYAQFHLSYGWDSPLVPTVFLVLAA 343
           V+I      +     L  TGG     SA    V A+     L  G+DS L    F + A 
Sbjct: 22  VIIELFKERRWDFRVLFETGGMPSSHSAL---VTALATGVGLQEGFDSSL----FAIAAV 74

Query: 344 F 344
           F
Sbjct: 75  F 75


>gnl|CDD|236278 PRK08506, PRK08506, replicative DNA helicase; Provisional.
          Length = 472

 Score = 30.7 bits (70), Expect = 1.9
 Identities = 12/27 (44%), Positives = 21/27 (77%)

Query: 24  KKKEKKKKEKKAKKKEKKKEKKKKEKK 50
           K++E+K+KEKKAKK+ K++ +   + K
Sbjct: 391 KEREEKEKEKKAKKEGKEERRIHFQNK 417


>gnl|CDD|220357 pfam09716, ETRAMP, Malarial early transcribed membrane protein
          (ETRAMP).  These sequences represent a family of
          proteins from the malaria parasite Plasmodium
          falciparum, several of which have been shown to be
          expressed specifically in the ring stage as well as the
          rodent parasite Plasmodium yoelii. A homologue from
          Plasmodium chabaudi was localized to the
          parasitophorous vacuole membrane. Members have an
          initial hydrophobic, Phe/Tyr-rich, stretch long enough
          to span the membrane, a highly charged region rich in
          Lys, a second putative transmembrane region and a
          second highly charged, low complexity sequence region.
          Some members have up to 100 residues of additional
          C-terminal sequence. These genes have been shown to be
          found in the sub-telomeric regions of both Plasmodium
          falciparum and P. yoelii chromosomes.
          Length = 82

 Score = 28.4 bits (64), Expect = 2.0
 Identities = 16/25 (64%), Positives = 17/25 (68%)

Query: 27 EKKKKEKKAKKKEKKKEKKKKEKKK 51
           KKKK KK KK +   EKKKK KKK
Sbjct: 29 AKKKKLKKLKKIDDDLEKKKKNKKK 53



 Score = 28.0 bits (63), Expect = 2.9
 Identities = 13/25 (52%), Positives = 17/25 (68%)

Query: 36 KKKEKKKEKKKKEKKKKKKKKKKKK 60
           KK+K K+ KK +   +KKKK KKK
Sbjct: 29 AKKKKLKKLKKIDDDLEKKKKNKKK 53



 Score = 27.2 bits (61), Expect = 5.9
 Identities = 14/25 (56%), Positives = 15/25 (60%)

Query: 32 EKKAKKKEKKKEKKKKEKKKKKKKK 56
           KK K K+ KK     EKKKK KKK
Sbjct: 29 AKKKKLKKLKKIDDDLEKKKKNKKK 53



 Score = 27.2 bits (61), Expect = 6.5
 Identities = 15/25 (60%), Positives = 18/25 (72%)

Query: 35 AKKKEKKKEKKKKEKKKKKKKKKKK 59
          AKKK+ KK KK  +  +KKKK KKK
Sbjct: 29 AKKKKLKKLKKIDDDLEKKKKNKKK 53


>gnl|CDD|214922 smart00935, OmpH, Outer membrane protein (OmpH-like).  This
          family includes outer membrane proteins such as OmpH
          among others. Skp (OmpH) has been characterized as a
          molecular chaperone that interacts with unfolded
          proteins as they emerge in the periplasm from the Sec
          translocation machinery.
          Length = 140

 Score = 29.5 bits (67), Expect = 2.0
 Identities = 13/47 (27%), Positives = 27/47 (57%), Gaps = 5/47 (10%)

Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKK-----KEKKKKKKKKKKKKKN 62
           K ++ E +K EK+ +K ++K +K         ++KK+K+ +KK + 
Sbjct: 27 FKKRQAELEKLEKELQKLKEKLQKDAATLSEAAREKKEKELQKKVQE 73



 Score = 29.5 bits (67), Expect = 2.4
 Identities = 14/56 (25%), Positives = 29/56 (51%), Gaps = 2/56 (3%)

Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKKE--KKKKKKKKKKKKKNNNNNNNNNNNNN 74
          K+ +K ++K ++  A   E  +EKK+KE  KK ++ ++K++K   +          
Sbjct: 39 KELQKLKEKLQKDAATLSEAAREKKEKELQKKVQEFQRKQQKLQQDLQKRQQEELQ 94



 Score = 29.1 bits (66), Expect = 3.0
 Identities = 14/40 (35%), Positives = 25/40 (62%)

Query: 25 KKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNN 64
          K  +K+ EK+ KK++ + EK +KE +K K+K +K     +
Sbjct: 17 KAAQKQLEKEFKKRQAELEKLEKELQKLKEKLQKDAATLS 56



 Score = 27.5 bits (62), Expect = 9.5
 Identities = 11/38 (28%), Positives = 24/38 (63%)

Query: 19 DWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKK 56
            K A+K+ +K+ +K+  + EK +++ +K K+K +K  
Sbjct: 15 AGKAAQKQLEKEFKKRQAELEKLEKELQKLKEKLQKDA 52


>gnl|CDD|236304 PRK08581, PRK08581, N-acetylmuramoyl-L-alanine amidase; Validated.
          Length = 619

 Score = 30.9 bits (70), Expect = 2.0
 Identities = 11/66 (16%), Positives = 23/66 (34%), Gaps = 3/66 (4%)

Query: 23  AKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNN---NNNNNNNNNN 79
            K     +  K+ K   +K       K     K+    K    N +N    +++  N  +
Sbjct: 157 IKNDTDTQSSKQDKADNQKAPSSNNTKPSTSNKQPNSPKPTQPNQSNSQPASDDTANQKS 216

Query: 80  NNNNNN 85
           ++ +N 
Sbjct: 217 SSKDNQ 222



 Score = 29.8 bits (67), Expect = 4.2
 Identities = 9/65 (13%), Positives = 24/65 (36%)

Query: 25  KKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNNNNNNNNNNN 84
           K +   +  K  K + +K       K     K+         N +N+   +++  N  ++
Sbjct: 158 KNDTDTQSSKQDKADNQKAPSSNNTKPSTSNKQPNSPKPTQPNQSNSQPASDDTANQKSS 217

Query: 85  NNEEK 89
           + + +
Sbjct: 218 SKDNQ 222


>gnl|CDD|233042 TIGR00598, rad14, DNA repair protein.  All proteins in this family
           for which functions are known are used for the
           recognition of DNA damage as part of nucleotide excision
           repair. This family is based on the phylogenomic
           analysis of JA Eisen (1999, Ph.D. Thesis, Stanford
           University) [DNA metabolism, DNA replication,
           recombination, and repair].
          Length = 172

 Score = 30.1 bits (68), Expect = 2.0
 Identities = 15/61 (24%), Positives = 30/61 (49%), Gaps = 5/61 (8%)

Query: 6   KHQIGTFTEPTPLDWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKK-----KKKKKK 60
           K Q+           +   +++++++E K + KEKK EKK KE ++  +      KK+ +
Sbjct: 79  KLQVEKRALEVWGSEEALDEEKERREESKEEMKEKKFEKKLKELRRAVRSSEYTNKKEGR 138

Query: 61  K 61
            
Sbjct: 139 V 139


>gnl|CDD|173534 PTZ00341, PTZ00341, Ring-infected erythrocyte surface antigen;
           Provisional.
          Length = 1136

 Score = 30.9 bits (69), Expect = 2.0
 Identities = 22/67 (32%), Positives = 30/67 (44%)

Query: 21  KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNNNNNNN 80
           K   K   KK +K AKK E++ +  K+E   +  K     K    NN   N+   N  N 
Sbjct: 757 KHVAKSHLKKPKKAAKKLEQRSKANKEELANENNKLMNILKEYFGNNEQINSITYNFENI 816

Query: 81  NNNNNNE 87
           N N +NE
Sbjct: 817 NLNEDNE 823



 Score = 30.5 bits (68), Expect = 3.2
 Identities = 24/88 (27%), Positives = 43/88 (48%), Gaps = 6/88 (6%)

Query: 30  KKEKKAKKKEKKKEKKKKEKKKKKKKK-----KKKKKNNNNNNNNNNNNNNNNNNNNNNN 84
           KK KKA KK +++ K  KE+   +  K     K+   NN   N+   N  N N N +N N
Sbjct: 765 KKPKKAAKKLEQRSKANKEELANENNKLMNILKEYFGNNEQINSITYNFENINLNEDNEN 824

Query: 85  NNEEKGTFLYKMAAPLV-LVLTYVAEIT 111
            +++     +K    +   +++Y+ +I+
Sbjct: 825 GSKKILDLNHKDQKEIFEEIISYIVDIS 852



 Score = 29.4 bits (65), Expect = 6.9
 Identities = 14/32 (43%), Positives = 20/32 (62%)

Query: 27  EKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKK 58
           E K  E +A K  + + KK+ EK KKKK K++
Sbjct: 310 ELKSLEHRAAKAAEAEMKKRAEKPKKKKSKRR 341


>gnl|CDD|237869 PRK14962, PRK14962, DNA polymerase III subunits gamma and tau;
           Provisional.
          Length = 472

 Score = 30.9 bits (70), Expect = 2.0
 Identities = 9/42 (21%), Positives = 22/42 (52%)

Query: 24  KKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNN 65
              ++   E+K      ++++KKKE+ K K++K++  +    
Sbjct: 337 PNVQENDVEEKNDNSNVQQKEKKKEESKAKEEKQEDIEFEKR 378



 Score = 29.7 bits (67), Expect = 4.2
 Identities = 13/49 (26%), Positives = 28/49 (57%)

Query: 11  TFTEPTPLDWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKK 59
            F+ P   +    +K +    ++K KKKE+ K K++K++  + +K+ K+
Sbjct: 333 RFSSPNVQENDVEEKNDNSNVQQKEKKKEESKAKEEKQEDIEFEKRFKE 381



 Score = 29.7 bits (67), Expect = 4.5
 Identities = 14/39 (35%), Positives = 24/39 (61%), Gaps = 1/39 (2%)

Query: 24  KKKEKKKKEKKAKKKEKKKEK-KKKEKKKKKKKKKKKKK 61
              E+K      ++KEKKKE+ K KE+K++  + +K+ K
Sbjct: 342 NDVEEKNDNSNVQQKEKKKEESKAKEEKQEDIEFEKRFK 380


>gnl|CDD|219392 pfam07382, HC2, Histone H1-like nucleoprotein HC2.  This family
          contains the bacterial histone H1-like nucleoprotein
          HC2 (approximately 200 residues long), which seems to
          be found mostly in Chlamydia. HC2 functions in DNA
          condensation, although it has been suggested that it
          also has other roles.
          Length = 187

 Score = 30.1 bits (67), Expect = 2.0
 Identities = 18/41 (43%), Positives = 19/41 (46%)

Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
          K A +K   KK   AKK   KK  KK   KK   KK   KK
Sbjct: 42 KPAARKTVAKKTVAAKKPAAKKAAKKAVAKKVVAKKPVAKK 82



 Score = 29.0 bits (64), Expect = 4.4
 Identities = 17/41 (41%), Positives = 18/41 (43%)

Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
          K   KK    K+  AKK  KK   KK   KK   KK   KK
Sbjct: 47 KTVAKKTVAAKKPAAKKAAKKAVAKKVVAKKPVAKKAVAKK 87



 Score = 29.0 bits (64), Expect = 4.9
 Identities = 16/41 (39%), Positives = 17/41 (41%)

Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
          K A KK   +K    K    KK   KK  KK   KK   KK
Sbjct: 37 KVAAKKPAARKTVAKKTVAAKKPAAKKAAKKAVAKKVVAKK 77


>gnl|CDD|225168 COG2259, COG2259, Predicted membrane protein [Function unknown].
          Length = 142

 Score = 29.6 bits (67), Expect = 2.1
 Identities = 24/88 (27%), Positives = 40/88 (45%), Gaps = 10/88 (11%)

Query: 242 SLFFIGQFGGMTTLQTYAVGIFESIHAPLDPYFATLLLGVAELGGALLCVVLIHYTGKRP 301
            +F +   G +       +  FES+  P  P    +L  + EL G LL  +L+     R 
Sbjct: 25  LIFIVSGLGKLFGGLAGTIQYFESLGLPP-PTLLAILAAILELVGGLL--LLLGL-FTRL 80

Query: 302 LALISTGGSAACFIVVAVYAQFHLSYGW 329
            AL+      A F++VA++A  H  +G+
Sbjct: 81  AALV-----LAVFMLVAIFA-VHAFWGF 102


>gnl|CDD|240414 PTZ00425, PTZ00425, asparagine-tRNA ligase; Provisional.
          Length = 586

 Score = 30.8 bits (69), Expect = 2.2
 Identities = 14/42 (33%), Positives = 24/42 (57%)

Query: 52  KKKKKKKKKKNNNNNNNNNNNNNNNNNNNNNNNNNEEKGTFL 93
            KK KK +K+ +  N  N NNNN N++++N  ++      +L
Sbjct: 273 NKKNKKGEKREDILNTCNANNNNGNSSSSNAVSSPAYPDQYL 314



 Score = 30.4 bits (68), Expect = 3.2
 Identities = 13/30 (43%), Positives = 21/30 (70%)

Query: 51  KKKKKKKKKKKNNNNNNNNNNNNNNNNNNN 80
            KK KK +K+++  N  N NNNN N++++N
Sbjct: 273 NKKNKKGEKREDILNTCNANNNNGNSSSSN 302



 Score = 30.0 bits (67), Expect = 3.6
 Identities = 15/46 (32%), Positives = 26/46 (56%)

Query: 50  KKKKKKKKKKKKNNNNNNNNNNNNNNNNNNNNNNNNNEEKGTFLYK 95
            KK KK +K++   N  N NNNN N++++N  ++    ++    YK
Sbjct: 273 NKKNKKGEKREDILNTCNANNNNGNSSSSNAVSSPAYPDQYLIDYK 318



 Score = 29.6 bits (66), Expect = 5.3
 Identities = 13/30 (43%), Positives = 20/30 (66%)

Query: 49  KKKKKKKKKKKKKNNNNNNNNNNNNNNNNN 78
            KK KK +K++   N  N NNNN N++++N
Sbjct: 273 NKKNKKGEKREDILNTCNANNNNGNSSSSN 302


>gnl|CDD|205184 pfam13003, MRL1, Ribosomal protein L1.  This family includes
           putative ribosomal L1 and L10 proteins and fragments.
          Length = 105

 Score = 29.2 bits (64), Expect = 2.2
 Identities = 20/88 (22%), Positives = 32/88 (36%)

Query: 23  AKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNNNNNNNNN 82
            KKK K   + K   K  K +KKK +  +K K     +        N     NN   + N
Sbjct: 12  YKKKVKTNDKTKKGTKTVKNDKKKVDNIEKIKSYSYMENEKRLYPRNIYEIENNVYLDLN 71

Query: 83  NNNNEEKGTFLYKMAAPLVLVLTYVAEI 110
            +    K   +   A+ ++     + EI
Sbjct: 72  LDMALGKKKKVEPFASKILGPKPLLPEI 99


>gnl|CDD|227497 COG5169, HSF1, Heat shock transcription factor [Transcription].
          Length = 282

 Score = 30.5 bits (69), Expect = 2.2
 Identities = 14/76 (18%), Positives = 27/76 (35%), Gaps = 7/76 (9%)

Query: 19  DWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNNNNN 78
            + K   K  ++        E   +  +    +  KK K+KK  +N  ++NN + +   N
Sbjct: 72  GFHKVSHKSGQRSYYNENVWEFGNKNFQLGMIELLKKIKRKKAPSNRVDSNNESKDAMMN 131

Query: 79  ---NN----NNNNNNE 87
               N     +   N 
Sbjct: 132 IEVENIILPQSELYNS 147


>gnl|CDD|227931 COG5644, COG5644, Uncharacterized conserved protein [Function
           unknown].
          Length = 869

 Score = 30.8 bits (69), Expect = 2.3
 Identities = 9/44 (20%), Positives = 22/44 (50%)

Query: 21  KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNN 64
           K+ KKK  ++  +  + K+K  E++  ++     +KK+     +
Sbjct: 645 KRIKKKAYRRIRRDKRLKKKMPEEENTQENHLGSEKKRHGGVPD 688


>gnl|CDD|178635 PLN03086, PLN03086, PRLI-interacting factor K; Provisional.
          Length = 567

 Score = 30.6 bits (69), Expect = 2.3
 Identities = 11/42 (26%), Positives = 31/42 (73%), Gaps = 1/42 (2%)

Query: 18 LDWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKK 59
          +D++  + +EK ++E++ ++K++ K K ++E+K K++  K++
Sbjct: 1  MDFELRRAREKLEREQR-ERKQRAKLKLERERKAKEEAAKQR 41


>gnl|CDD|100110 cd05832, Ribosomal_L12p, Ribosomal protein L12p. This subfamily
          includes archaeal L12p, the protein that is
          functionally equivalent to L7/L12 in bacteria and the
          P1 and P2 proteins in eukaryotes. L12p is homologous to
          P1 and P2 but is not homologous to bacterial L7/L12. It
          is located in the L12 stalk, with proteins L10, L11,
          and 23S rRNA. L12p is the only protein in the ribosome
          to occur as multimers, always appearing as sets of
          dimers. Recent data indicate that most archaeal species
          contain six copies of L12p (three homodimers), while
          eukaryotes have four copies (two heterodimers), and
          bacteria may have four or six copies (two or three
          homodimers), depending on the species. The organization
          of proteins within the stalk has been characterized
          primarily in bacteria, where L7/L12 forms either two or
          three homodimers and each homodimer binds to the
          extended C-terminal helix of L10. L7/L12 is attached to
          the ribosome through L10 and is the only ribosomal
          protein that does not directly interact with rRNA.
          Archaeal L12p is believed to function in a similar
          fashion. However, hybrid ribosomes containing the large
          subunit from E. coli with an archaeal stalk are able to
          bind archaeal and eukaryotic elongation factors but not
          bacterial elongation factors. In several mesophilic and
          thermophilic archaeal species, the binding of 23S rRNA
          to protein L11 and to the L10/L12p pentameric complex
          was found to be temperature-dependent and cooperative.
          Length = 106

 Score = 29.0 bits (65), Expect = 2.3
 Identities = 10/26 (38%), Positives = 22/26 (84%)

Query: 23 AKKKEKKKKEKKAKKKEKKKEKKKKE 48
          A  +EK +++++ KKKE++KE++++E
Sbjct: 72 AAAEEKAEEKEEEKKKEEEKEEEEEE 97



 Score = 28.6 bits (64), Expect = 2.6
 Identities = 9/21 (42%), Positives = 17/21 (80%)

Query: 32 EKKAKKKEKKKEKKKKEKKKK 52
            + K +EK++EKKK+E+K++
Sbjct: 73 AAEEKAEEKEEEKKKEEEKEE 93



 Score = 27.8 bits (62), Expect = 6.3
 Identities = 10/23 (43%), Positives = 19/23 (82%)

Query: 35 AKKKEKKKEKKKKEKKKKKKKKK 57
          A   E+K E+K++EKKK+++K++
Sbjct: 71 AAAAEEKAEEKEEEKKKEEEKEE 93


>gnl|CDD|223898 COG0828, RpsU, Ribosomal protein S21 [Translation, ribosomal
          structure and biogenesis].
          Length = 67

 Score = 28.1 bits (63), Expect = 2.4
 Identities = 14/45 (31%), Positives = 26/45 (57%), Gaps = 3/45 (6%)

Query: 21 KKAKKKEKKKKEKKAK---KKEKKKEKKKKEKKKKKKKKKKKKKN 62
          K+  +KE   +E K +   +K  +K K+KK   +K+K K+ +K+ 
Sbjct: 20 KRKVEKEGILREMKEREFYEKPSEKRKRKKAAARKRKFKRLRKEQ 64


>gnl|CDD|237171 PRK12678, PRK12678, transcription termination factor Rho;
           Provisional.
          Length = 672

 Score = 30.6 bits (70), Expect = 2.4
 Identities = 5/78 (6%), Positives = 38/78 (48%)

Query: 13  TEPTPLDWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNN 72
              T      A++ E+++++++ ++ +++  + + E+ ++ +++++ +  ++ +  +   
Sbjct: 151 QPATEARADAAERTEEEERDERRRRGDREDRQAEAERGERGRREERGRDGDDRDRRDRRE 210

Query: 73  NNNNNNNNNNNNNNEEKG 90
             +        +  + +G
Sbjct: 211 QGDRREERGRRDGGDRRG 228



 Score = 29.5 bits (67), Expect = 5.8
 Identities = 1/30 (3%), Positives = 8/30 (26%)

Query: 61  KNNNNNNNNNNNNNNNNNNNNNNNNNEEKG 90
           + +        +  +        +  + +G
Sbjct: 211 QGDRREERGRRDGGDRRGRRRRRDRRDARG 240



 Score = 29.5 bits (67), Expect = 6.0
 Identities = 0/35 (0%), Positives = 9/35 (25%)

Query: 56  KKKKKKNNNNNNNNNNNNNNNNNNNNNNNNNEEKG 90
           + +   + +  +     +        +  +   + 
Sbjct: 196 RGRDGDDRDRRDRREQGDRREERGRRDGGDRRGRR 230



 Score = 29.5 bits (67), Expect = 6.1
 Identities = 3/70 (4%), Positives = 33/70 (47%)

Query: 21  KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNNNNNNN 80
           ++ +++E+ ++ ++  +++++ E ++ E+ +++++ +     +  +     +        
Sbjct: 162 ERTEEEERDERRRRGDREDRQAEAERGERGRREERGRDGDDRDRRDRREQGDRREERGRR 221

Query: 81  NNNNNNEEKG 90
           +  +    + 
Sbjct: 222 DGGDRRGRRR 231



 Score = 29.1 bits (66), Expect = 6.5
 Identities = 0/28 (0%), Positives = 6/28 (21%)

Query: 61  KNNNNNNNNNNNNNNNNNNNNNNNNNEE 88
           +       +  +        +  +   +
Sbjct: 214 RREERGRRDGGDRRGRRRRRDRRDARGD 241



 Score = 29.1 bits (66), Expect = 8.3
 Identities = 1/46 (2%), Positives = 18/46 (39%)

Query: 42  KEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNNNNNNNNNNNNNE 87
           +E++ ++   + ++ ++++ +        +  +        +  + 
Sbjct: 193 REERGRDGDDRDRRDRREQGDRREERGRRDGGDRRGRRRRRDRRDA 238



 Score = 28.7 bits (65), Expect = 9.9
 Identities = 2/44 (4%), Positives = 14/44 (31%)

Query: 43  EKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNNNNNNNNNNNNN 86
           E+ ++E++ +    + ++      +        +  +       
Sbjct: 189 ERGRREERGRDGDDRDRRDRREQGDRREERGRRDGGDRRGRRRR 232


>gnl|CDD|222812 PHA00727, PHA00727, hypothetical protein.
          Length = 278

 Score = 30.2 bits (68), Expect = 2.5
 Identities = 15/40 (37%), Positives = 25/40 (62%)

Query: 22 KAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
          K  K+  K  EK+  + +K++ ++ K +  KKKKK KK+K
Sbjct: 38 KTLKRLYKVYEKREFELKKQQFEQLKAELSKKKKKFKKEK 77



 Score = 30.2 bits (68), Expect = 2.9
 Identities = 16/42 (38%), Positives = 27/42 (64%)

Query: 19 DWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKK 60
          D K  K+  K  ++++ + K+++ E+ K E  KKKKK KK+K
Sbjct: 36 DGKTLKRLYKVYEKREFELKKQQFEQLKAELSKKKKKFKKEK 77


>gnl|CDD|185443 PTZ00097, PTZ00097, 60S ribosomal protein L19; Provisional.
          Length = 175

 Score = 29.7 bits (67), Expect = 2.5
 Identities = 12/32 (37%), Positives = 18/32 (56%)

Query: 21  KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKK 52
           K  K KEKK +++   ++ K K  + K K KK
Sbjct: 144 KNEKVKEKKIQDQLEARRAKAKALRNKRKAKK 175



 Score = 29.7 bits (67), Expect = 2.7
 Identities = 12/32 (37%), Positives = 19/32 (59%)

Query: 25  KKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKK 56
           K EK K++K   + E ++ K K  + K+K KK
Sbjct: 144 KNEKVKEKKIQDQLEARRAKAKALRNKRKAKK 175



 Score = 28.5 bits (64), Expect = 6.4
 Identities = 11/31 (35%), Positives = 18/31 (58%)

Query: 31  KEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
           K K  K KEKK + + + ++ K K  + K+K
Sbjct: 142 KTKNEKVKEKKIQDQLEARRAKAKALRNKRK 172



 Score = 28.1 bits (63), Expect = 8.4
 Identities = 11/34 (32%), Positives = 21/34 (61%)

Query: 26  KEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKK 59
           K K +K K+ K +++ + ++ K K  + K+K KK
Sbjct: 142 KTKNEKVKEKKIQDQLEARRAKAKALRNKRKAKK 175



 Score = 28.1 bits (63), Expect = 9.1
 Identities = 12/34 (35%), Positives = 20/34 (58%)

Query: 28  KKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
           K K EK  +KK + + + ++ K K  + K+K KK
Sbjct: 142 KTKNEKVKEKKIQDQLEARRAKAKALRNKRKAKK 175


>gnl|CDD|216256 pfam01032, FecCD, FecCD transport family.  This is a sub-family of
           bacterial binding protein-dependent transport systems
           family. This Pfam entry contains the inner components of
           this multicomponent transport system.
          Length = 311

 Score = 30.2 bits (69), Expect = 2.5
 Identities = 33/155 (21%), Positives = 51/155 (32%), Gaps = 29/155 (18%)

Query: 278 LLGVAELGGALLCVVLIHYTGKRPLALISTGG--SAACFIVVAVYAQFHLSYGWDSPLVP 335
            L V+   GALL  V      + PLA  S  G  S A    V     F     +  PL  
Sbjct: 55  ALAVS---GALLQTVT-----RNPLADPSILGVSSGAALGAVLALLLFGGLSLFGLPLAA 106

Query: 336 TVFLVLAAFLTHICIRLL----PWMLI------GEVFPNNIRATASGASGSSSYIFAFAV 385
            +  +LAA L     R      P  LI        +F       ++  S         A+
Sbjct: 107 FIGALLAALLVLFLARRRGGLSPLRLILAGIAVSALF-------SALTSLLLLLANPEAL 159

Query: 386 NKLYYPMLDTFHL--WGTLYFYAAISVVGTLYMYF 418
             + + +L +     W  +     + ++G L +  
Sbjct: 160 QSILFWLLGSLSGASWSDVLILLPLLLIGLLLLLL 194


>gnl|CDD|226138 COG3611, DnaB, Replication initiation/membrane attachment protein
           [DNA replication, recombination, and repair].
          Length = 417

 Score = 30.4 bits (69), Expect = 2.5
 Identities = 12/57 (21%), Positives = 23/57 (40%)

Query: 19  DWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNN 75
           DW + K K   +  K+ +K+++    KK+    K+K+   K  N       +     
Sbjct: 341 DWARKKVKTAYEAIKENRKRKELALAKKQSSTTKRKENLPKWSNPEYKKTTSQEEQA 397


>gnl|CDD|191187 pfam05087, Rota_VP2, Rotavirus VP2 protein.  Rotavirus particles
          consist of three concentric proteinaceous capsid
          layers. The innermost capsid (core) is made of VP2. The
          genomic RNA and the two minor proteins VP1 and VP3 are
          encapsidated within this layer. The N-terminus of
          rotavirus VP2 is necessary for the encapsidation of VP1
          and VP3.
          Length = 887

 Score = 30.7 bits (69), Expect = 2.5
 Identities = 14/76 (18%), Positives = 30/76 (39%), Gaps = 3/76 (3%)

Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNNNN--- 77
           + +         + K  EK+ +K + E K+K   KK++   +N ++     ++  N   
Sbjct: 5  NRREANINNNDRMQEKDDEKQDQKNRMELKEKVLDKKEEVVTDNVDSPVKEQSSQENLKI 64

Query: 78 NNNNNNNNNEEKGTFL 93
           +    +  EE    L
Sbjct: 65 ADEVKKSTKEESKQLL 80



 Score = 29.1 bits (65), Expect = 7.5
 Identities = 12/66 (18%), Positives = 29/66 (43%)

Query: 24 KKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNNNNNNNNNN 83
          + + +       + +EK  EK+ ++ + + K+K   KK     +N ++     ++  N  
Sbjct: 4  RNRREANINNNDRMQEKDDEKQDQKNRMELKEKVLDKKEEVVTDNVDSPVKEQSSQENLK 63

Query: 84 NNNEEK 89
            +E K
Sbjct: 64 IADEVK 69


>gnl|CDD|218215 pfam04696, Pinin_SDK_memA, pinin/SDK/memA/ protein conserved
          region.  Members of this family have very varied
          localisations within the eukaryotic cell. pinin is
          known to localise at the desmosomes and is implicated
          in anchoring intermediate filaments to the desmosomal
          plaque. SDK2/3 is a dynamically localised nuclear
          protein thought to be involved in modulation of
          alternative pre-mRNA splicing. memA is a tumour marker
          preferentially expressed in human melanoma cell lines.
          A common feature of the members of this family is that
          they may all participate in regulating protein-protein
          interactions.
          Length = 131

 Score = 29.4 bits (66), Expect = 2.6
 Identities = 9/43 (20%), Positives = 31/43 (72%)

Query: 19 DWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
          +  +   KEK++ E + K +E++K+++++ +K+K++  +++++
Sbjct: 26 EESRLTSKEKRRAEIEQKLEEQEKQEREELRKEKRELFEERRR 68



 Score = 28.6 bits (64), Expect = 4.8
 Identities = 9/39 (23%), Positives = 27/39 (69%), Gaps = 2/39 (5%)

Query: 22 KAKKKEKKKKEKKAKKK--EKKKEKKKKEKKKKKKKKKK 58
          K  ++E +   K+ ++   E+K E+++K+++++ +K+K+
Sbjct: 22 KFSQEESRLTSKEKRRAEIEQKLEEQEKQEREELRKEKR 60


>gnl|CDD|234874 PRK00955, PRK00955, hypothetical protein; Provisional.
          Length = 620

 Score = 30.7 bits (70), Expect = 2.7
 Identities = 14/37 (37%), Positives = 17/37 (45%)

Query: 21  KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKK 57
            K     KK K+K  KK           KK+KKK+KK
Sbjct: 584 PKCLIPPKKPKKKSNKKNNNNNYSSNNSKKRKKKRKK 620



 Score = 29.9 bits (68), Expect = 4.2
 Identities = 12/39 (30%), Positives = 16/39 (41%)

Query: 22  KAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKK 60
              K     K+ K K  +K           KK+KKK+KK
Sbjct: 582 NGPKCLIPPKKPKKKSNKKNNNNNYSSNNSKKRKKKRKK 620


>gnl|CDD|220376 pfam09745, DUF2040, Coiled-coil domain-containing protein 55
          (DUF2040).  This entry is a conserved domain of
          approximately 130 residues of proteins conserved from
          fungi to humans. The proteins do contain a coiled-coil
          domain, but the function is unknown.
          Length = 128

 Score = 28.9 bits (65), Expect = 2.7
 Identities = 13/50 (26%), Positives = 28/50 (56%), Gaps = 7/50 (14%)

Query: 21 KKAKKKEKKKKEKKAKKKEK-------KKEKKKKEKKKKKKKKKKKKKNN 63
           K +KK+ K  E+K  +K K         E++K+E++  +++K +K++  
Sbjct: 30 AKEEKKQAKLSERKENRKPKYIGSLLEAAERRKREREIAEERKLQKEREK 79



 Score = 28.1 bits (63), Expect = 5.1
 Identities = 6/38 (15%), Positives = 23/38 (60%)

Query: 24 KKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
          K  +++KK+ K  ++++ ++ K      +  +++K+++
Sbjct: 28 KAAKEEKKQAKLSERKENRKPKYIGSLLEAAERRKRER 65


>gnl|CDD|200352 TIGR04101, CCGSCS, CCGSCS motif protein.  This protein family,
          with average protein length about 58 residues, occurs
          in several marine bacteria, such as Shewanella benthica
          KT99, Marinobacter sp. ELB17, and Photobacterium
          profundum 3TCK. The striking feature is a C-terminal
          motif CCGSCS, which (perhaps coincidentally) resembles
          conserved core motif [LC]CGSC shared by two
          methanobactin precursors (see TIGR04071). There is no
          detectable conserved gene region for these proteins.
          Length = 59

 Score = 27.6 bits (61), Expect = 2.7
 Identities = 9/36 (25%), Positives = 22/36 (61%)

Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKK 56
          KK+  K+ K + +  ++ ++ +E+   + ++KKK K
Sbjct: 12 KKSAAKQVKLELEAIQQAQQSEEQVAADIEEKKKPK 47


>gnl|CDD|233787 TIGR02223, ftsN, cell division protein FtsN.  FtsN is a poorly
           conserved protein active in cell division in a number of
           Proteobacteria. The N-terminal 30 residue region tends
           to by Lys/Arg-rich, and is followed by a
           membrane-spanning region. This is followed by an acidic
           low-complexity region of variable length and a
           well-conserved C-terminal domain of two tandem regions
           matched by pfam05036 (Sporulation related repeat), found
           in several cell division and sporulation proteins. The
           role of FtsN as a suppressor for other cell division
           mutations is poorly understood; it may involve cell wall
           hydrolysis [Cellular processes, Cell division].
          Length = 298

 Score = 30.0 bits (67), Expect = 2.7
 Identities = 17/51 (33%), Positives = 24/51 (47%), Gaps = 2/51 (3%)

Query: 13  TEPTPLDWKKAKKKEK--KKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
           TE      K+AK+K+K   K+  + +   K  E   K  K  K K K K+K
Sbjct: 173 TEKIASKVKEAKQKQKALPKQTAETQSNSKPIETAPKADKADKTKPKPKEK 223



 Score = 28.5 bits (63), Expect = 9.8
 Identities = 14/39 (35%), Positives = 24/39 (61%)

Query: 22  KAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKK 60
           +A+K+  +KK +KA+  E +K   + EK   K K+ K+K
Sbjct: 148 EARKQTAEKKPQKARTAEAQKTPVETEKIASKVKEAKQK 186


>gnl|CDD|235638 PRK05896, PRK05896, DNA polymerase III subunits gamma and tau;
           Validated.
          Length = 605

 Score = 30.6 bits (69), Expect = 2.7
 Identities = 13/53 (24%), Positives = 27/53 (50%)

Query: 18  LDWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNN 70
           + +    K +    +KK  +  K+++ KK+E  +K+  KK +K   N+   +N
Sbjct: 406 VHFDTLYKTKIFYHKKKINQNNKEQDIKKEELLEKEFVKKSEKIPKNDELLDN 458



 Score = 29.0 bits (65), Expect = 7.1
 Identities = 15/68 (22%), Positives = 25/68 (36%)

Query: 22  KAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNNNNNNNN 81
              K E KK E   K ++K     +    KK    ++  KN+ + +         +    
Sbjct: 364 NPDKLENKKIEISIKPEKKTINNSQIITTKKINIVEESNKNSVHFDTLYKTKIFYHKKKI 423

Query: 82  NNNNNEEK 89
           N NN E+ 
Sbjct: 424 NQNNKEQD 431


>gnl|CDD|224194 COG1275, TehA, Tellurite resistance protein and related permeases
           [Inorganic ion transport and metabolism].
          Length = 329

 Score = 30.4 bits (69), Expect = 2.8
 Identities = 25/164 (15%), Positives = 40/164 (24%), Gaps = 34/164 (20%)

Query: 274 FATLLLGVAELGGALLCVVLIHYTGKRPLALISTGGSAACFIVVAVYAQFHLSY-----G 328
           +     G       L  +VL        L   +       FI +A      +        
Sbjct: 169 YVFFGAGFF-SYLVLFPIVLYRLFTAGLLPPAARPTI---FIQLAPIGLVGVGLLLIVNS 224

Query: 329 WDSPLVPTVFLVLAAFLTHICIRLLPWMLIGE---------VFPNNIRATASGASGSSSY 379
             S     +            + LL  +L+            FP  I AT++   G S  
Sbjct: 225 GPSLTFVLILWGFGLLFLFFALLLLLRVLLRLPFSPSWWAFTFPLVILATSALELGKSI- 283

Query: 380 IFAFAVNKLYYPMLDTFHLWGTLYFYAAISVVGTL---YMYFVM 420
                        +  FH  G +     I +   L   Y    +
Sbjct: 284 ------------GIGVFHYLGLILGTFLIFIWIVLLVAYTIRAI 315


>gnl|CDD|227400 COG5068, ARG80, Regulator of arginine metabolism and related MADS
           box-containing transcription factors [Transcription].
          Length = 412

 Score = 30.4 bits (68), Expect = 2.8
 Identities = 8/27 (29%), Positives = 12/27 (44%)

Query: 59  KKKNNNNNNNNNNNNNNNNNNNNNNNN 85
             KN  N + N +  NN +  N  + N
Sbjct: 308 PPKNLPNGSPNQSPLNNLSRGNPASPN 334



 Score = 30.0 bits (67), Expect = 3.6
 Identities = 7/27 (25%), Positives = 11/27 (40%)

Query: 60  KKNNNNNNNNNNNNNNNNNNNNNNNNN 86
              N  N + N +  NN +  N  + N
Sbjct: 308 PPKNLPNGSPNQSPLNNLSRGNPASPN 334


>gnl|CDD|216447 pfam01346, FKBP_N, Domain amino terminal to FKBP-type
           peptidyl-prolyl isomerase.  This family is only found at
           the amino terminus of pfam00254. This domain is of
           unknown function.
          Length = 124

 Score = 29.1 bits (66), Expect = 2.8
 Identities = 12/33 (36%), Positives = 19/33 (57%)

Query: 21  KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKK 53
           KK + K++ K EKKA++ +   E    E  KK+
Sbjct: 79  KKLQAKQQAKAEKKAEENKAAGEAFLAENAKKE 111



 Score = 28.7 bits (65), Expect = 3.3
 Identities = 11/34 (32%), Positives = 18/34 (52%)

Query: 27  EKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKK 60
           +KK + K+  K EKK E+ K   +    +  KK+
Sbjct: 78  QKKLQAKQQAKAEKKAEENKAAGEAFLAENAKKE 111


>gnl|CDD|226814 COG4379, COG4379, Mu-like prophage tail protein gpP [General
           function prediction only].
          Length = 386

 Score = 30.2 bits (68), Expect = 2.8
 Identities = 12/41 (29%), Positives = 19/41 (46%), Gaps = 2/41 (4%)

Query: 10  GTFTEPTPLDWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKK 50
           G F        KKA+   K+K ++K    + KK  KK + +
Sbjct: 343 GAFDVDA--YPKKAEAARKRKGDRKGVSHKAKKGGKKNQAE 381



 Score = 29.8 bits (67), Expect = 3.7
 Identities = 11/34 (32%), Positives = 15/34 (44%)

Query: 25  KKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKK 58
               KK E   K+K  +K    K KK  KK + +
Sbjct: 348 DAYPKKAEAARKRKGDRKGVSHKAKKGGKKNQAE 381



 Score = 28.7 bits (64), Expect = 9.3
 Identities = 11/28 (39%), Positives = 14/28 (50%)

Query: 36  KKKEKKKEKKKKEKKKKKKKKKKKKKNN 63
           KK E  +++K   K    K KK  KKN 
Sbjct: 352 KKAEAARKRKGDRKGVSHKAKKGGKKNQ 379



 Score = 28.7 bits (64), Expect = 9.4
 Identities = 10/33 (30%), Positives = 14/33 (42%)

Query: 32  EKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNN 64
                KK +   K+K ++K    K KK  K N 
Sbjct: 347 VDAYPKKAEAARKRKGDRKGVSHKAKKGGKKNQ 379


>gnl|CDD|223912 COG0842, COG0842, ABC-type multidrug transport system, permease
           component [Defense mechanisms].
          Length = 286

 Score = 30.3 bits (68), Expect = 2.8
 Identities = 36/179 (20%), Positives = 61/179 (34%), Gaps = 28/179 (15%)

Query: 232 TFLLPYAIVTSLFFIGQFGGMTTLQTYAV-GIFESIH-APLDPYFATLL-----LGVAEL 284
            F++P  I+ S+ F G F   + L      G  E +  +P+   F  L      L VA L
Sbjct: 88  AFIVPGVILMSVLFSGIFSFSSALFREREFGTLERLLVSPVSRLFILLGKIVPYLVVASL 147

Query: 285 GGALLCVVLIHYTGKRPLALISTGGSAACFIVVAVYA------QFHLSYGWDSPLVPTVF 338
              L+ +V+    G   L  +         +++A  A       F  S    +  V  + 
Sbjct: 148 IAGLVLLVIAFLLGVPFLGSLLLLLLLLLLLLLATVALGLLLSTFAKSQLQCASAVGNLL 207

Query: 339 LVLAAFLTHIC-----IRLLPWML--IGEVFPNN-----IRATASGASGSSSYIFAFAV 385
           ++   FL         + LLP  L  I  + P       +R    G   +     +  +
Sbjct: 208 ILPLGFL---SGVFFPLELLPAWLQGISYINPLTYAIDALRYVYLGGWRNDGIWISLLI 263


>gnl|CDD|218684 pfam05672, MAP7, MAP7 (E-MAP-115) family.  The organisation of
           microtubules varies with the cell type and is presumably
           controlled by tissue-specific microtubule-associated
           proteins (MAPs). The 115-kDa epithelial MAP
           (E-MAP-115/MAP7) has been identified as a
           microtubule-stabilising protein predominantly expressed
           in cell lines of epithelial origin. The binding of this
           microtubule associated protein is nucleotide
           independent.
          Length = 171

 Score = 29.7 bits (66), Expect = 2.9
 Identities = 12/47 (25%), Positives = 34/47 (72%)

Query: 14  EPTPLDWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKK 60
           E    + +  +++E++ +EK+ K K K +E++K+E++++++ +K+K+
Sbjct: 67  ERLRREEEARRQEEERAREKEEKAKRKAEEEEKQEQEEQERIQKQKE 113


>gnl|CDD|214945 smart00970, s48_45, Sexual stage antigen s48/45 domain.  This
          family contains sexual stage s48/45 antigens from
          Plasmodium (approximately 450 residues long). These are
          surface proteins expressed by Plasmodium male and
          female gametes that have been shown to play a conserved
          and important role in fertilisation.
          Length = 116

 Score = 29.0 bits (65), Expect = 2.9
 Identities = 6/36 (16%), Positives = 14/36 (38%)

Query: 57 KKKKKNNNNNNNNNNNNNNNNNNNNNNNNNEEKGTF 92
           +   NN + +  +N++N  N      +   +   F
Sbjct: 9  NEPSTNNLSYSFLSNDDNVENIECTIKSKPNDIIGF 44


>gnl|CDD|215579 PLN03106, TCP2, Protein TCP2; Provisional.
          Length = 447

 Score = 30.4 bits (68), Expect = 2.9
 Identities = 15/60 (25%), Positives = 27/60 (45%), Gaps = 5/60 (8%)

Query: 30  KKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNN-----NNNNNNNNNNNNN 84
           + E + K +E+ +E+  KEK+K+         NNN  + N++         +  NNN   
Sbjct: 203 RSELRDKARERARERTAKEKEKEDHNHAAHHNNNNPISQNSSFTELLTGGIDPVNNNRQW 262


>gnl|CDD|223587 COG0513, SrmB, Superfamily II DNA and RNA helicases [DNA
           replication, recombination, and repair / Transcription /
           Translation, ribosomal structure and biogenesis].
          Length = 513

 Score = 30.5 bits (69), Expect = 2.9
 Identities = 15/128 (11%), Positives = 33/128 (25%), Gaps = 3/128 (2%)

Query: 418 FVMPETEGRTLRDIEEHFADKGKTFVTNIRRAEKKRHAARVEGQEEVKGEDNFGMEGEDG 477
           FV  E E + L+ IE+    K  + V       +     +       +  D      +  
Sbjct: 376 FVTEEEEVKKLKRIEKRLERKLPSAVLLPLDEPEDAKLLKTTRPGLEEESDISDEIKKLK 435

Query: 478 KYISRVVGTLYMYFVMPETEGRTLRDIEEHFADKGKTFVTNIRRAEKKRHAAKVEGQEEV 537
                ++  L + F + +      ++I           +           A  +E   + 
Sbjct: 436 SSKKALLRGLGVRFTLSKLLANLGKEI---PGAGDAVTIDPELERRSPNSADDIEYILKG 492

Query: 538 KGEDNFGM 545
                   
Sbjct: 493 LSYRAEER 500


>gnl|CDD|192333 pfam09690, PYST-C1, Plasmodium yoelii subtelomeric region
          (PYST-C1).  This group of sequences are defined by the
          N-terminal domain of a paralogous family of Plasmodium
          yoelii genes preferentially located in the subtelomeric
          regions of the chromosomes. There are no obvious
          homologues to these genes in any other organism. The
          C-terminal portions of the genes that contain this
          domain are divergent and some contain other
          yoelii-specific paralogous domains such as PYST-C2
          (IPR006491).
          Length = 57

 Score = 27.4 bits (61), Expect = 3.0
 Identities = 11/44 (25%), Positives = 24/44 (54%)

Query: 46 KKEKKKKKKKKKKKKKNNNNNNNNNNNNNNNNNNNNNNNNNEEK 89
          KK  +  +K   + K+    NNNNN +  ++ +++ + +N + K
Sbjct: 14 KKINRSNEKNGIESKRETQLNNNNNKDPKDDKDDDGDKDNKKNK 57



 Score = 27.0 bits (60), Expect = 3.6
 Identities = 11/43 (25%), Positives = 24/43 (55%)

Query: 41 KKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNNNNNNNNNN 83
          KK  +  EK   + K++ +  NNNN +  ++ +++ + +N  N
Sbjct: 14 KKINRSNEKNGIESKRETQLNNNNNKDPKDDKDDDGDKDNKKN 56



 Score = 27.0 bits (60), Expect = 4.3
 Identities = 10/43 (23%), Positives = 25/43 (58%)

Query: 42 KEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNNNNNNNNNNN 84
          K+  +  +K   + K++ + NNNNN +  ++ +++ + +N  N
Sbjct: 14 KKINRSNEKNGIESKRETQLNNNNNKDPKDDKDDDGDKDNKKN 56


>gnl|CDD|191968 pfam08213, DUF1713, Mitochondrial domain of unknown function
          (DUF1713).  This domain is found at the C terminal end
          of mitochondrial proteins of unknown function.
          Length = 34

 Score = 26.5 bits (59), Expect = 3.0
 Identities = 9/26 (34%), Positives = 17/26 (65%)

Query: 36 KKKEKKKEKKKKEKKKKKKKKKKKKK 61
           +K ++K KK K KK +K+ +  ++K
Sbjct: 6  LRKRRRKMKKHKYKKLRKRTRALRRK 31


>gnl|CDD|151885 pfam11446, DUF2897, Protein of unknown function (DUF2897).  This
          is a bacterial family of uncharacterized proteins.
          Length = 55

 Score = 27.2 bits (61), Expect = 3.0
 Identities = 8/28 (28%), Positives = 14/28 (50%)

Query: 23 AKKKEKKKKEKKAKKKEKKKEKKKKEKK 50
          AK K  +  ++K KK ++  E+  K   
Sbjct: 27 AKMKFPQFDKQKKKKAKRNAEQDAKNSA 54


>gnl|CDD|221177 pfam11708, Slu7, Pre-mRNA splicing Prp18-interacting factor.  The
           spliceosome, an assembly of snRNAs (U1, U2, U4/U6, and
           U5) and proteins, catalyzes the excision of introns from
           pre-mRNAs in two successive trans-esterification
           reactions. Step 2 depends upon integral spliceosome
           constituents such as U5 snRNA and Prp8 and
           non-spliceosomal proteins Prp16, Slu7, Prp18, and Prp22.
           ATP hydrolysis by the DEAH-box enzyme Prp16 promotes a
           conformational change in the spliceosome that leads to
           protection of the 3'ss from targeted RNase H cleavage.
           This change, which probably reflects binding of the 3'ss
           PyAG in the catalytic centre of the spliceosome,
           requires the ordered recruitment of Slu7, Prp18, and
           Prp22 to the spliceosome. There is a close functional
           relationship between Prp8, Prp18, and Slu7, and Prp18
           interacts with Slu7, so that together they recruit Prp22
           to the spliceosome. Most members of the family carry a
           zinc-finger of the CCHC-type upstream of this domain.
          Length = 236

 Score = 29.7 bits (67), Expect = 3.1
 Identities = 12/36 (33%), Positives = 20/36 (55%)

Query: 21  KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKK 56
              KK+++KK++ K +KK+   EK   E+   K  K
Sbjct: 160 LLRKKEKEKKEQLKIQKKQSLLEKYGGEEHLDKPPK 195



 Score = 29.3 bits (66), Expect = 4.0
 Identities = 14/46 (30%), Positives = 23/46 (50%), Gaps = 7/46 (15%)

Query: 20  WKKAKKKEK-------KKKEKKAKKKEKKKEKKKKEKKKKKKKKKK 58
           W+ A+K           K E   KK+++KKE+ K +KK+   +K  
Sbjct: 140 WEAAEKGGDVHLQANPTKLELLRKKEKEKKEQLKIQKKQSLLEKYG 185



 Score = 29.3 bits (66), Expect = 5.0
 Identities = 12/39 (30%), Positives = 21/39 (53%), Gaps = 2/39 (5%)

Query: 27  EKKKKEKKAKKKEKKKEKKKK--EKKKKKKKKKKKKKNN 63
           E  +K++K KK++ K +KK+   EK   ++   K  K  
Sbjct: 159 ELLRKKEKEKKEQLKIQKKQSLLEKYGGEEHLDKPPKEL 197


>gnl|CDD|216807 pfam01956, DUF106, Integral membrane protein DUF106.  This
          archaebacterial protein family has no known function.
          Members are predicted to be integral membrane proteins.
          Length = 168

 Score = 29.5 bits (67), Expect = 3.1
 Identities = 13/68 (19%), Positives = 29/68 (42%), Gaps = 10/68 (14%)

Query: 6  KHQIGTFTEPTPLDWKKAKKKEKKK---------KEKKAKKKEKKKEKKKKE-KKKKKKK 55
          +  +    +   +D K  K ++++K         ++   K   KK EK+++E  + +K+ 
Sbjct: 26 RGLLSELLQWLLIDRKMEKYQKREKEIQKRARELRKNGDKLSPKKFEKRQEELMEDQKEM 85

Query: 56 KKKKKKNN 63
               K N
Sbjct: 86 MMDMMKPN 93


>gnl|CDD|202101 pfam02050, FliJ, Flagellar FliJ protein. 
          Length = 122

 Score = 28.8 bits (65), Expect = 3.1
 Identities = 11/38 (28%), Positives = 28/38 (73%)

Query: 24  KKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
           K+ E+ +++ +   +E+KK +K  E++KKK++K++ ++
Sbjct: 73  KQVEQAREQWQEANQERKKLEKLLEREKKKEQKEENRR 110



 Score = 27.6 bits (62), Expect = 9.0
 Identities = 10/43 (23%), Positives = 30/43 (69%)

Query: 18  LDWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKK 60
           L+  + + ++ +++ ++A ++ KK EK  + +KKK++K++ ++
Sbjct: 68  LEQAEKQVEQAREQWQEANQERKKLEKLLEREKKKEQKEENRR 110


>gnl|CDD|221122 pfam11490, DNA_pol3_alph_N, DNA polymerase III polC-type
           N-terminus.  This is an N-terminal domain of DNA
           polymerase III polC subunit A that is found only in
           Firmicutes. DNA polymerase polC-type III enzyme
           functions as the 'replicase' in low G + C Gram-positive
           bacteria. Purine asymmetry is a characteristic of
           organisms with a heterodimeric DNA polymerase III
           alpha-subunit constituted by polC which probably plays a
           direct role in the maintenance of strand-biased gene
           distribution; since, among prokaryotic genomes, the
           distribution of genes on the leading and lagging strands
           of the replication fork is known to be biased. The
           domain is associated with DNA_pol3_alpha pfam07733.
          Length = 180

 Score = 29.6 bits (67), Expect = 3.1
 Identities = 13/31 (41%), Positives = 22/31 (70%)

Query: 26  KEKKKKEKKAKKKEKKKEKKKKEKKKKKKKK 56
           +E+K++E+    +E  +  KKKE +KKKK+K
Sbjct: 149 EEQKEEEEAKLAEEALEALKKKEAEKKKKEK 179



 Score = 28.8 bits (65), Expect = 5.4
 Identities = 12/44 (27%), Positives = 30/44 (68%)

Query: 19  DWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKN 62
           + + ++++ ++ +E+K +++ K  E+  +  KKK+ +KKKK+K 
Sbjct: 137 EDESSEEEIEEFEEQKEEEEAKLAEEALEALKKKEAEKKKKEKE 180


>gnl|CDD|146016 pfam03179, V-ATPase_G, Vacuolar (H+)-ATPase G subunit.  This
          family represents the eukaryotic vacuolar (H+)-ATPase
          (V-ATPase) G subunit. V-ATPases generate an acidic
          environment in several intracellular compartments.
          Correspondingly, they are found as membrane-attached
          proteins in several organelles. They are also found in
          the plasma membranes of some specialised cells.
          V-ATPases consist of peripheral (V1) and membrane
          integral (V0) heteromultimeric complexes. The G subunit
          is part of the V1 subunit, but is also thought to be
          strongly attached to the V0 complex. It may be involved
          in the coupling of ATP degradation to H+ translocation.
          Length = 105

 Score = 28.3 bits (64), Expect = 3.2
 Identities = 9/37 (24%), Positives = 23/37 (62%), Gaps = 3/37 (8%)

Query: 25 KKEKKKKEKKAK---KKEKKKEKKKKEKKKKKKKKKK 58
          +K + K+ K+AK   +KE ++ + ++E + K+ + + 
Sbjct: 24 RKRRAKRLKQAKEEAEKEIEEYRAQREAEFKEFEAEH 60


>gnl|CDD|235334 PRK05035, PRK05035, electron transport complex protein RnfC;
           Provisional.
          Length = 695

 Score = 30.3 bits (69), Expect = 3.2
 Identities = 12/47 (25%), Positives = 25/47 (53%), Gaps = 8/47 (17%)

Query: 22  KAKKKEKKKKE--------KKAKKKEKKKEKKKKEKKKKKKKKKKKK 60
           +A ++EKKK E        ++A+ + +K  ++ + KK  + +  K K
Sbjct: 439 RAIEQEKKKAEEAKARFEARQARLEREKAAREARHKKAAEARAAKDK 485



 Score = 29.1 bits (66), Expect = 6.8
 Identities = 9/49 (18%), Positives = 27/49 (55%), Gaps = 7/49 (14%)

Query: 22  KAKKKEKKKKEKKAKK-------KEKKKEKKKKEKKKKKKKKKKKKKNN 63
           KA+ +  ++++KKA++       ++ + E++K  ++ + KK  + +   
Sbjct: 435 KAEIRAIEQEKKKAEEAKARFEARQARLEREKAAREARHKKAAEARAAK 483


>gnl|CDD|153294 cd07610, FCH_F-BAR, The Extended FES-CIP4 Homology (FCH) or F-BAR
           (FCH and Bin/Amphiphysin/Rvs) domain, a dimerization
           module that binds and bends membranes.  F-BAR domains
           are dimerization modules that bind and bend membranes
           and are found in proteins involved in membrane dynamics
           and actin reorganization. F-BAR domain containing
           proteins, also known as Pombe Cdc15 homology (PCH)
           family proteins, include Fes and Fer tyrosine kinases,
           PACSINs/Syndapins, FCHO, PSTPIP, CIP4-like proteins and
           srGAPs. Many members also contain an SH3 domain and play
           roles in endocytosis. F-BAR domains form banana-shaped
           dimers with a positively-charged concave surface that
           binds to negatively-charged lipid membranes. They can
           induce membrane deformation in the form of long tubules.
           These tubules have diameters larger than those observed
           with N-BARs. The F-BAR domains of some members such as
           NOSTRIN and Rgd1 are important for the subcellular
           localization of the protein.
          Length = 191

 Score = 29.6 bits (67), Expect = 3.4
 Identities = 9/42 (21%), Positives = 22/42 (52%), Gaps = 2/42 (4%)

Query: 25  KKEKKKKEKKAKK--KEKKKEKKKKEKKKKKKKKKKKKKNNN 64
           K++K++  KK     ++ KK+ ++   K  KK  ++ ++   
Sbjct: 92  KEDKEQARKKELAEGEKLKKKLQELWAKLAKKADEEYREQVE 133



 Score = 28.1 bits (63), Expect = 8.9
 Identities = 10/45 (22%), Positives = 26/45 (57%)

Query: 18  LDWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKN 62
            + K+  +K++  + +K KKK ++   K  +K  ++ +++ +K N
Sbjct: 92  KEDKEQARKKELAEGEKLKKKLQELWAKLAKKADEEYREQVEKLN 136


>gnl|CDD|222932 PHA02809, PHA02809, hypothetical protein; Provisional.
          Length = 111

 Score = 28.5 bits (63), Expect = 3.4
 Identities = 10/61 (16%), Positives = 19/61 (31%)

Query: 31  KEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNNNNNNNNNNNNNEEKG 90
           K  K K         +   K  K    +   +        +N     ++ ++  NN+EK 
Sbjct: 42  KPNKEKNNFLDNGYDESFIKPNKVSYDETNIDKGQKIEQTDNEKQLLSDFSDILNNDEKN 101

Query: 91  T 91
            
Sbjct: 102 H 102


>gnl|CDD|219256 pfam06991, Prp19_bind, Splicing factor, Prp19-binding domain.  This
           family represents the C-terminus (approximately 300
           residues) of proteins that are involved as binding
           partners for Prp19 as part of the nuclear pore complex.
           The family in Drosophila is necessary for pre-mRNA
           splicing, and the human protein has been found in
           purifications of the spliceosome. In the past this
           family was thought, erroneously, to be associated with
           microfibrillin.
          Length = 277

 Score = 29.9 bits (67), Expect = 3.5
 Identities = 16/60 (26%), Positives = 35/60 (58%), Gaps = 1/60 (1%)

Query: 21  KKAKKKEKKKKEKKAKKKEKKKEKKK-KEKKKKKKKKKKKKKNNNNNNNNNNNNNNNNNN 79
           ++A K++  ++E K K +E+K+E  K  E++ KK+ + KK+      N ++ + ++ N  
Sbjct: 55  REAAKEKALEEEAKRKAEERKRETLKIVEEEVKKELELKKRNTLLEANIDDVDTDDENEE 114


>gnl|CDD|223436 COG0359, RplI, Ribosomal protein L9 [Translation, ribosomal
          structure and biogenesis].
          Length = 148

 Score = 29.1 bits (66), Expect = 3.5
 Identities = 12/40 (30%), Positives = 18/40 (45%)

Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKK 60
            A K   K  E +  K EKK  ++  E +  K+K + K 
Sbjct: 37 VPATKGNLKLLEARRAKLEKKAAEELAEAEALKEKLEGKT 76


>gnl|CDD|233728 TIGR02102, pullulan_Gpos, pullulanase, extracellular, Gram-positive. 
            Pullulan is an unusual, industrially important
            polysaccharide in which short alpha-1,4 chains
            (maltotriose) are connected in alpha-1,6 linkages.
            Enzymes that cleave alpha-1,6 linkages in pullulan and
            release maltotriose are called pullulanases although
            pullulan itself may not be the natural substrate. In
            contrast, a glycogen debranching enzyme such GlgX,
            homologous to this family, can release glucose at
            alpha,1-6 linkages from glycogen first subjected to limit
            degradation by phosphorylase. Characterized members of
            this family include a surface-located pullulanase from
            Streptococcus pneumoniae (PMID:11083842) and an
            extracellular bifunctional amylase/pullulanase with
            C-terminal pullulanase activity (PMID:8798645).
          Length = 1111

 Score = 30.2 bits (68), Expect = 3.5
 Identities = 13/64 (20%), Positives = 23/64 (35%)

Query: 27   EKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNNNNNNNNNNNNN 86
              +  EK     E + +  K   +     K K K +  +N +      +++ N      N
Sbjct: 1006 GIEAPEKTPPPPEHEPQAPKPPTQDPDGSKPKDKVDPKDNKDPLTPPGSDDENGETPKGN 1065

Query: 87   EEKG 90
            EEK 
Sbjct: 1066 EEKK 1069


>gnl|CDD|225123 COG2213, MtlA, Phosphotransferase system, mannitol-specific IIBC
           component [Carbohydrate transport and metabolism].
          Length = 472

 Score = 30.0 bits (68), Expect = 3.6
 Identities = 10/42 (23%), Positives = 17/42 (40%)

Query: 25  KKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNN 66
           K  K + E   +    K ++ K E K +        KN+ +N
Sbjct: 336 KTSKTESEDDLEAATAKMKEMKGESKGQSSALAADVKNDLSN 377



 Score = 29.6 bits (67), Expect = 5.2
 Identities = 7/42 (16%), Positives = 17/42 (40%)

Query: 30  KKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNN 71
           K  K   + + +    K ++ K + K +      +  N+ +N
Sbjct: 336 KTSKTESEDDLEAATAKMKEMKGESKGQSSALAADVKNDLSN 377



 Score = 29.2 bits (66), Expect = 6.4
 Identities = 8/42 (19%), Positives = 18/42 (42%)

Query: 31  KEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNN 72
           K  K + ++  +    K K+ K + K +      +  N+ +N
Sbjct: 336 KTSKTESEDDLEAATAKMKEMKGESKGQSSALAADVKNDLSN 377


>gnl|CDD|227379 COG5046, MAF1, Protein involved in Mod5 protein sorting
          [Posttranslational modification, protein turnover,
          chaperones].
          Length = 282

 Score = 29.6 bits (66), Expect = 3.6
 Identities = 16/41 (39%), Positives = 26/41 (63%)

Query: 52 KKKKKKKKKKNNNNNNNNNNNNNNNNNNNNNNNNNEEKGTF 92
          ++K++    + N+NNN NN+N N++NNNN +  N     TF
Sbjct: 15 EQKRRISFSEYNSNNNTNNSNGNSSNNNNYSGPNGSSPATF 55


>gnl|CDD|218550 pfam05311, Baculo_PP31, Baculovirus 33KDa late protein (PP31).
           Autographa californica nuclear polyhedrosis virus
           (AcMNPV) pp31 is a nuclear phosphoprotein that
           accumulates in the virogenic stroma, which is the viral
           replication centre in the infected-cell nucleus, binds
           to DNA, and serves as a late expression factor.
          Length = 267

 Score = 29.7 bits (67), Expect = 3.6
 Identities = 10/46 (21%), Positives = 22/46 (47%)

Query: 31  KEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNNN 76
           +     +K K  +K  K+    KK+K+K+      N  ++N +++ 
Sbjct: 219 QSSSPSRKRKAPKKVAKKVAAAKKRKQKRSPPPPFNMVSDNTDDSQ 264



 Score = 29.3 bits (66), Expect = 5.3
 Identities = 8/28 (28%), Positives = 14/28 (50%)

Query: 24  KKKEKKKKEKKAKKKEKKKEKKKKEKKK 51
               +K+K  K   K+    KK+K+K+ 
Sbjct: 221 SSPSRKRKAPKKVAKKVAAAKKRKQKRS 248



 Score = 28.5 bits (64), Expect = 7.9
 Identities = 12/39 (30%), Positives = 22/39 (56%)

Query: 40  KKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNNNNN 78
           K+K  KK  KK    KK+K+K++     N  ++N +++ 
Sbjct: 226 KRKAPKKVAKKVAAAKKRKQKRSPPPPFNMVSDNTDDSQ 264


>gnl|CDD|217787 pfam03910, Adeno_PV, Adenovirus minor core protein PV. 
          Length = 336

 Score = 29.8 bits (67), Expect = 3.6
 Identities = 10/24 (41%), Positives = 16/24 (66%)

Query: 34 KAKKKEKKKEKKKKEKKKKKKKKK 57
          +  K E K  K K+ KK+KK+++K
Sbjct: 22 RPVKDEAKPRKIKRVKKRKKREEK 45


>gnl|CDD|201540 pfam00992, Troponin, Troponin.  Troponin (Tn) contains three
           subunits, Ca2+ binding (TnC), inhibitory (TnI), and
           tropomyosin binding (TnT). this Pfam contains members of
           the TnT subunit. Troponin is a complex of three
           proteins, Ca2+ binding (TnC), inhibitory (TnI), and
           tropomyosin binding (TnT). The troponin complex
           regulates Ca++ induced muscle contraction. This family
           includes troponin T and troponin I. Troponin I binds to
           actin and troponin T binds to tropomyosin.
          Length = 131

 Score = 28.7 bits (65), Expect = 3.7
 Identities = 15/41 (36%), Positives = 22/41 (53%), Gaps = 6/41 (14%)

Query: 27  EKKKKEKKAKKKEKKKEKKKK---EKKKKKKK---KKKKKK 61
           E+   E+K  KK+K+ E  KK   + + K KK   KK +K 
Sbjct: 64  ERYDIEEKVAKKDKEIEDLKKKVNDLRGKFKKPTLKKVRKS 104


>gnl|CDD|220684 pfam10310, DUF2413, Protein of unknown function (DUF2413).  This
          is a family of proteins conserved in fungi. The
          function is not known.
          Length = 436

 Score = 29.8 bits (67), Expect = 3.7
 Identities = 13/55 (23%), Positives = 20/55 (36%), Gaps = 9/55 (16%)

Query: 23 AKKKEKKKKEKKAKKKEKKKEKK---------KKEKKKKKKKKKKKKKNNNNNNN 68
           K   KK K+  A K   + ++          + EK K  KK K+  +     N 
Sbjct: 11 EKAPTKKPKKGDASKDSTEDDEDILEFLDELEQSEKAKPPKKPKEASRPGTPRNP 65


>gnl|CDD|222959 PHA02999, PHA02999, Hypothetical protein; Provisional.
          Length = 382

 Score = 29.7 bits (67), Expect = 3.7
 Identities = 20/69 (28%), Positives = 27/69 (39%), Gaps = 4/69 (5%)

Query: 5   TKHQIGTFTEPTPLDWKKAKKKEKKKKEKKAKK----KEKKKEKKKKEKKKKKKKKKKKK 60
           TK QI +      +D KK K K+K KK ++            E  KK   K KK  +K  
Sbjct: 88  TKKQISSEENTMKMDKKKYKLKKKIKKVREINDTIRFAHYDYETIKKGYIKFKKMIRKDI 147

Query: 61  KNNNNNNNN 69
             +  N   
Sbjct: 148 ATSIPNICQ 156


>gnl|CDD|179580 PRK03449, PRK03449, putative inner membrane protein translocase
           component YidC; Provisional.
          Length = 304

 Score = 29.6 bits (67), Expect = 3.8
 Identities = 12/37 (32%), Positives = 19/37 (51%)

Query: 24  KKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKK 60
            K +K+++ KK  K E++     K   K K+ KKK  
Sbjct: 268 GKIDKEEEAKKQAKAERRAANAPKPGAKPKRSKKKAP 304


>gnl|CDD|150240 pfam09495, DUF2462, Protein of unknown function (DUF2462).  This
          protein is highly conserved, but its function is
          unknown. It can be isolated from HeLa cell nucleoli and
          is found to be homologous with Leydig cell tumour
          protein whose function is unknown.
          Length = 82

 Score = 27.8 bits (62), Expect = 3.8
 Identities = 11/40 (27%), Positives = 19/40 (47%)

Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKK 60
          K     +  KK++K  KK   +  K K+ +  + +K KK 
Sbjct: 11 KGPAAVKAGKKKQKGPKKAGPRVIKPKKAQAVQAQKLKKV 50



 Score = 26.6 bits (59), Expect = 9.4
 Identities = 16/47 (34%), Positives = 21/47 (44%), Gaps = 5/47 (10%)

Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKK-----KEKKKKKKKKKKKKKN 62
             K K K     KA KK++K  KK      K KK +  + +K KK 
Sbjct: 4  GALKVKAKGPAAVKAGKKKQKGPKKAGPRVIKPKKAQAVQAQKLKKV 50


>gnl|CDD|225180 COG2271, UhpC, Sugar phosphate permease [Carbohydrate transport and
           metabolism].
          Length = 448

 Score = 29.9 bits (68), Expect = 3.8
 Identities = 13/48 (27%), Positives = 22/48 (45%), Gaps = 5/48 (10%)

Query: 394 DTFHLWGTLYFYAAISVVGTLYMYFVM---PETEGRTLRDIEEHFADK 438
                    YF   I+++  L + F++   P++EG  L  IEE+  D 
Sbjct: 180 FHGGWRAAFYFPGIIAIIVALILLFLLRDRPQSEG--LPPIEEYRGDP 225


>gnl|CDD|193409 pfam12936, Kri1_C, KRI1-like family C-terminal.  The yeast member
          of this family (Kri1p) is found to be required for 40S
          ribosome biogenesis in the nucleolus. This is the
          C-terminal domain of the family.
          Length = 93

 Score = 27.9 bits (63), Expect = 3.9
 Identities = 11/25 (44%), Positives = 18/25 (72%)

Query: 37 KKEKKKEKKKKEKKKKKKKKKKKKK 61
          K+E++K+KKK  KK + ++ KKK  
Sbjct: 69 KEERRKDKKKYGKKARLREWKKKVF 93



 Score = 27.6 bits (62), Expect = 5.3
 Identities = 11/23 (47%), Positives = 17/23 (73%)

Query: 25 KKEKKKKEKKAKKKEKKKEKKKK 47
          K+E++K +KK  KK + +E KKK
Sbjct: 69 KEERRKDKKKYGKKARLREWKKK 91



 Score = 27.6 bits (62), Expect = 5.7
 Identities = 11/25 (44%), Positives = 18/25 (72%)

Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKK 45
          K+ ++K+KKK  KKA+ +E KK+  
Sbjct: 69 KEERRKDKKKYGKKARLREWKKKVF 93


>gnl|CDD|221893 pfam13010, pRN1_helical, Primase helical domain.  This alpha
          helical domain is found in a set of bacterial plasmid
          replication proteins. The domain is found to the
          C-terminus of the primase/polymerase domain. Mutants of
          this domain are defective in template binding,
          dinucleotide formation and conformation change prior to
          DNA extension.
          Length = 135

 Score = 28.6 bits (64), Expect = 3.9
 Identities = 17/52 (32%), Positives = 28/52 (53%)

Query: 14 EPTPLDWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNN 65
          E +  D++K K++  K    K K  E  +E+  K+ KK  ++ K+KKK   N
Sbjct: 7  EDSEEDFEKLKEEMAKYDRFKGKTVEAIREEVCKKIKKSLEELKEKKKAILN 58


>gnl|CDD|237748 PRK14537, PRK14537, 50S ribosomal protein L20/unknown domain fusion
           protein; Provisional.
          Length = 230

 Score = 29.6 bits (66), Expect = 4.0
 Identities = 12/74 (16%), Positives = 25/74 (33%), Gaps = 2/74 (2%)

Query: 7   HQIGTFTEPTPLDWK--KAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNN 64
            Q   F +   L  K     +     K KK    E  +EK   + ++K   + +K  +  
Sbjct: 101 LQPEVFVQYVELAKKFLSQFQSTTITKPKKVLINEVLQEKTINQNEEKTSLQNEKVLSPE 160

Query: 65  NNNNNNNNNNNNNN 78
            +   +++      
Sbjct: 161 LSEEKSDSELETQP 174



 Score = 28.4 bits (63), Expect = 8.2
 Identities = 14/82 (17%), Positives = 23/82 (28%)

Query: 38  KEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNNNNNNNNNNNNNEEKGTFLYKMA 97
           K+   + +     K KK    +       N N    +  N    +   +EEK     +  
Sbjct: 114 KKFLSQFQSTTITKPKKVLINEVLQEKTINQNEEKTSLQNEKVLSPELSEEKSDSELETQ 173

Query: 98  APLVLVLTYVAEITQPHLRGML 119
                +      I    L  ML
Sbjct: 174 PQKTQLKEKKPSIEHIDLSKML 195


>gnl|CDD|218771 pfam05835, Synaphin, Synaphin protein.  This family consists of
          several eukaryotic synaphin 1 and 2 proteins.
          Synaphin/complexin is a cytosolic protein that
          preferentially binds to syntaxin within the SNARE
          complex. Synaphin promotes SNAREs to form precomplexes
          that oligomerise into higher order structures. A
          peptide from the central, syntaxin binding domain of
          synaphin competitively inhibits these two proteins from
          interacting and prevents SNARE complexes from
          oligomerising. It is thought that oligomerisation of
          SNARE complexes into a higher order structure creates a
          SNARE scaffold for efficient, regulated fusion of
          synaptic vesicles. Synaphin promotes neuronal
          exocytosis by promoting interaction between the
          complementary syntaxin and synaptobrevin transmembrane
          regions that reside in opposing membranes prior to
          fusion.
          Length = 139

 Score = 28.7 bits (64), Expect = 4.0
 Identities = 8/36 (22%), Positives = 24/36 (66%)

Query: 26 KEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
          KE +  E  A++++++ ++  +E ++++K K +K +
Sbjct: 23 KEDEGDESDAEEEDEEIQEALREAEEERKAKHRKME 58


>gnl|CDD|220207 pfam09368, Sas10_Utp3_C, Sas10 C-terminal domain.  This family
          contains a C-terminal presumed domain in Sas10 which
          hash been identified as a regulator of chromatin
          silencing.
          Length = 76

 Score = 27.6 bits (62), Expect = 4.0
 Identities = 7/36 (19%), Positives = 21/36 (58%)

Query: 28 KKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNN 63
          +KK  +  + K++KK +K  ++ K +++  +++   
Sbjct: 23 RKKDRRNPRVKKRKKYEKAVKRLKSQRRVVREEDGG 58



 Score = 26.8 bits (60), Expect = 6.4
 Identities = 12/43 (27%), Positives = 19/43 (44%), Gaps = 2/43 (4%)

Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKK--KKEKKKKKKKKKKKKK 61
          KK KK EK  K  K++++  ++E      E    K    +  K
Sbjct: 33 KKRKKYEKAVKRLKSQRRVVREEDGGYGGETTGIKTNVVRSVK 75


>gnl|CDD|218332 pfam04929, Herpes_DNAp_acc, Herpes DNA replication accessory
           factor.  Replicative DNA polymerases are capable of
           polymerising tens of thousands of nucleotides without
           dissociating from their DNA templates. The high
           processivity of these polymerases is dependent upon
           accessory proteins that bind to the catalytic subunit of
           the polymerase or to the substrate. The Epstein-Barr
           virus (EBV) BMRF1 protein is an essential component of
           the viral DNA polymerase and is absolutely required for
           lytic virus replication. BMRF1 is also a transactivator.
           This family is predicted to have a UL42 like structure.
          Length = 381

 Score = 29.6 bits (67), Expect = 4.0
 Identities = 12/41 (29%), Positives = 18/41 (43%)

Query: 13  TEPTPLDWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKK 53
             P  L        E+       +K+E  K++KKK+ KK K
Sbjct: 334 PPPASLSHSPPAAFERPLALSPKRKREGDKKQKKKKSKKLK 374



 Score = 29.6 bits (67), Expect = 4.3
 Identities = 12/24 (50%), Positives = 15/24 (62%)

Query: 43  EKKKKEKKKKKKKKKKKKKNNNNN 66
            K+K+E  KK+KKKK KK     N
Sbjct: 355 PKRKREGDKKQKKKKSKKLKLTFN 378


>gnl|CDD|238037 cd00084, HMG-box, High Mobility Group (HMG)-box is found in a
          variety of eukaryotic chromosomal proteins and
          transcription factors. HMGs bind to the minor groove of
          DNA and have been classified by DNA binding
          preferences. Two phylogenically distinct groups of
          Class I proteins bind DNA in a sequence specific
          fashion and contain a single HMG box. One group
          (SOX-TCF) includes transcription factors, TCF-1, -3,
          -4; and also SRY and LEF-1, which bind four-way DNA
          junctions and duplex DNA targets. The second group
          (MATA) includes fungal mating type gene products MC,
          MATA1 and Ste11. Class II and III proteins (HMGB-UBF)
          bind DNA in a non-sequence specific fashion and contain
          two or more tandem HMG boxes. Class II members include
          non-histone chromosomal proteins, HMG1 and HMG2, which
          bind to bent or distorted DNA such as four-way DNA
          junctions, synthetic DNA cruciforms, kinked
          cisplatin-modified DNA, DNA bulges, cross-overs in
          supercoiled DNA, and can cause looping of linear DNA.
          Class III members include nucleolar and mitochondrial
          transcription factors, UBF and mtTF1, which bind
          four-way DNA junctions.
          Length = 66

 Score = 27.2 bits (61), Expect = 4.0
 Identities = 12/28 (42%), Positives = 20/28 (71%)

Query: 20 WKKAKKKEKKKKEKKAKKKEKKKEKKKK 47
          WK   ++EKKK E+KA+K +++ EK+  
Sbjct: 39 WKSLSEEEKKKYEEKAEKDKERYEKEMP 66


>gnl|CDD|233674 TIGR01992, PTS-IIBC-Tre, PTS system, trehalose-specific IIBC
           component.  This model represents the fused enzyme II B
           and C components of the trehalose-specific PTS sugar
           transporter system. Trehalose is converted to
           trehalose-6-phosphate in the process of translocation
           into the cell. These transporters lack their own IIA
           domains and instead use the glucose IIA protein (IIAglc
           or Crr). The exceptions to this rule are Staphylococci
           and Streptococci which contain their own A domain as a
           C-terminal fusion. This family is closely related to the
           sucrose transporting PTS IIBC enzymes and the B and C
           domains of each are described by subfamily-domain level
           TIGRFAMs models (TIGR00826 and TIGR00852, respectively).
           In E. coli, B. subtilis and P. fluorescens the presence
           of this gene is associated with the presence of
           trehalase which degrades T6P to glucose and glucose-6-P.
           Trehalose may also be transported (in Salmonella) via
           the mannose PTS or galactose permease systems, or (in
           Sinorhizobium, Thermococcus and Sulfolobus, for
           instance) by ABC transporters.
          Length = 462

 Score = 29.7 bits (67), Expect = 4.0
 Identities = 18/62 (29%), Positives = 28/62 (45%), Gaps = 5/62 (8%)

Query: 334 VPTVFLVLAAFLTHICIRLLPWMLIGEVFPNNIRATASGASGSSSYIFAFAVNKLYYPML 393
           VP V L++  FL H  I  +  + IG    + + A  + A+     IF      LY P++
Sbjct: 263 VPPVSLLVTGFLAHAIIGPIGRL-IGNGITSGVTALFTSAAWLGGAIFGL----LYAPLV 317

Query: 394 DT 395
            T
Sbjct: 318 IT 319


>gnl|CDD|214697 smart00498, FH2, Formin Homology 2 Domain.  FH proteins control
           rearrangements of the actin cytoskeleton, especially in
           the context of cytokinesis and cell polarisation.
           Members of this family have been found to interact with
           Rho-GTPases, profilin and other actin-assoziated
           proteins. These interactions are mediated by the
           proline-rich FH1 domain, usually located in front of FH2
           (but not listed in SMART). Despite this cytosolic
           function, vertebrate formins have been assigned
           functions within the nucleus. A set of Formin-Binding
           Proteins (FBPs) has been shown to bind FH1 with their WW
           domain.
          Length = 392

 Score = 29.6 bits (67), Expect = 4.2
 Identities = 10/39 (25%), Positives = 25/39 (64%)

Query: 21  KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKK 59
           K A++  KK++E++ ++K+  KE  + E+   ++K++  
Sbjct: 332 KAAEENIKKEEEEEERRKKLVKETTEYEQSSSRQKERNP 370



 Score = 29.2 bits (66), Expect = 5.4
 Identities = 10/46 (21%), Positives = 25/46 (54%), Gaps = 5/46 (10%)

Query: 20  WKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNN 65
           + KA ++  KK+E+     E+++ KK  ++  + ++   ++K  N 
Sbjct: 330 FSKAAEENIKKEEE-----EEERRKKLVKETTEYEQSSSRQKERNP 370


>gnl|CDD|236610 PRK09669, PRK09669, putative symporter YagG; Provisional.
          Length = 444

 Score = 29.7 bits (67), Expect = 4.2
 Identities = 20/62 (32%), Positives = 31/62 (50%), Gaps = 2/62 (3%)

Query: 112 QPHLRGMLSATASMTTIFGTV-SQLFLGSFLHWRSAAILNLLFPILALCALYFIPESPHW 170
           +P L  +   T  +  +FG + S+  LG F   R+     + F IL+   ++FIP S  W
Sbjct: 262 RPDLATLFLVTGMIAGLFGALLSERLLGKFDRVRAFKWTIVAFVILSA-LIFFIPPSNVW 320

Query: 171 LI 172
           LI
Sbjct: 321 LI 322


>gnl|CDD|99903 cd06080, MUM1_like, Mutated melanoma-associated antigen 1 (MUM-1)
          is a melanoma-associated antigen (MAA).  MUM-1 belongs
          to the mutated or aberrantly expressed type of MAAs,
          along with antigens such as CDK4, beta-catenin,
          gp100-in4, p15, and N-acetylglucosaminyltransferase V. 
          It is highly expressed in several types of human
          cancers.  The PWWP domain, named for a conserved
          Pro-Trp-Trp-Pro motif, is a small domain consisting of
          100-150 amino acids. The PWWP domain is found in
          numerous proteins that are involved in cell division,
          growth and differentiation. Most PWWP-domain proteins
          seem to be nuclear, often DNA-binding, proteins that
          function as transcription factors regulating a variety
          of developmental processes.
          Length = 80

 Score = 27.4 bits (61), Expect = 4.2
 Identities = 8/30 (26%), Positives = 15/30 (50%)

Query: 28 KKKKEKKAKKKEKKKEKKKKEKKKKKKKKK 57
          +KK  +  K+  K  +  +K+K   K K+ 
Sbjct: 43 EKKGIRVVKRWLKHFDCTEKQKLTNKAKES 72


>gnl|CDD|180007 PRK05309, PRK05309, 30S ribosomal protein S11; Validated.
          Length = 128

 Score = 28.6 bits (65), Expect = 4.2
 Identities = 10/14 (71%), Positives = 10/14 (71%)

Query: 49 KKKKKKKKKKKKKN 62
           KKKK  KKK KKN
Sbjct: 1  AKKKKTVKKKVKKN 14


>gnl|CDD|218517 pfam05236, TAF4, Transcription initiation factor TFIID component
           TAF4 family.  This region of similarity is found in
           Transcription initiation factor TFIID component TAF4.
          Length = 255

 Score = 29.3 bits (66), Expect = 4.4
 Identities = 5/36 (13%), Positives = 20/36 (55%)

Query: 32  EKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNN 67
            +K K++E+++ ++++E   +  ++ + K+      
Sbjct: 119 AQKQKEEEERRVERRRELGLEDPEQLRLKQKAKEEQ 154



 Score = 28.9 bits (65), Expect = 6.1
 Identities = 9/35 (25%), Positives = 25/35 (71%), Gaps = 1/35 (2%)

Query: 26  KEKKKKEKKAKKK-EKKKEKKKKEKKKKKKKKKKK 59
           K+K+++E++ +++ E   E  ++ + K+K K+++K
Sbjct: 121 KQKEEEERRVERRRELGLEDPEQLRLKQKAKEEQK 155


>gnl|CDD|226703 COG4252, COG4252, Predicted transmembrane sensor domain [Signal
           transduction mechanisms].
          Length = 400

 Score = 29.7 bits (67), Expect = 4.4
 Identities = 18/89 (20%), Positives = 29/89 (32%), Gaps = 20/89 (22%)

Query: 266 IHAPLD---------PYFATLLLGVAELGGALLCVVLIHYTGKRPLA--LISTGGSAACF 314
           + A LD              L +    L G LL   L     +  LA  L   G     +
Sbjct: 307 LSALLDGRPLLPVWPDGAELLWIFAWSLLGGLLAWRLRS-PLRLLLAVGLALAGLLLISY 365

Query: 315 IVVAVYAQFHLSYGWDSPLVPTVFLVLAA 343
           ++           GW  PL+P +  ++ +
Sbjct: 366 LL--------FLAGWWIPLIPPLLALVGS 386


>gnl|CDD|226625 COG4145, PanF, Na+/panthothenate symporter [Coenzyme metabolism].
          Length = 473

 Score = 29.7 bits (67), Expect = 4.4
 Identities = 21/100 (21%), Positives = 39/100 (39%), Gaps = 10/100 (10%)

Query: 252 MTTLQTYAVGIFESIHAPLDPYFATLLLGVAELGG---ALLCVVLIHYTGKRPLALISTG 308
           ++ + T   G    I A   P     L   A  GG     L  +++    ++  A    G
Sbjct: 366 ISMIATLIFGALLLIAALNPPEMIIWLNLFA-FGGLEAVFLWPLVLGLYWEKANA---AG 421

Query: 309 GSAACFIVVAVY---AQFHLSYGWDSPLVPTVFLVLAAFL 345
             A+  + + +Y   A   +      P+VP++ + + AFL
Sbjct: 422 ALASMIVGLVLYVLLAFLGIKILGFHPIVPSLLVSIIAFL 461


>gnl|CDD|219627 pfam07897, DUF1675, Protein of unknown function (DUF1675).  The
           members of this family are sequences derived from
           hypothetical plant proteins of unknown function. One
           member of this family is annotated as a putative
           RNA-binding protein, but no evidence was found to
           support this.
          Length = 283

 Score = 29.5 bits (66), Expect = 4.5
 Identities = 10/44 (22%), Positives = 26/44 (59%)

Query: 36  KKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNNNNNN 79
            ++E +K K+ +  ++ + K+K+ +K  N  +N ++ ++ N  N
Sbjct: 68  TEEEWRKRKEMQSLRRLEAKRKRSEKEYNGVSNGDDMDSINAAN 111


>gnl|CDD|187861 cd09730, Cas8a1_I-A, CRISPR/Cas system-associated protein Cas8a1.
           CRISPR (Clustered Regularly Interspaced Short
           Palindromic Repeats) and associated Cas proteins
           comprise a system for heritable host defense by
           prokaryotic cells against phage and other foreign DNA;
           Large proteins, some contain Zn-finger domain; signature
           gene for I-A subtype; also known as TM1802 family.
          Length = 579

 Score = 29.6 bits (67), Expect = 4.5
 Identities = 19/69 (27%), Positives = 25/69 (36%), Gaps = 7/69 (10%)

Query: 2   DSSTKHQIGTFTEPTPLDWKKAKKKEKKKKEKKAKKKEKKKEK-------KKKEKKKKKK 54
           D    ++I    + T    KK KK  +K       KKE  K K       K  EK  K  
Sbjct: 79  DYLPTNKITYLLKTTEPLNKKIKKWFEKSYLLLESKKEILKLKNIKDVLEKNIEKILKDI 138

Query: 55  KKKKKKKNN 63
           +  K  +N 
Sbjct: 139 QDLKPGENV 147


>gnl|CDD|206034 pfam13863, DUF4200, Domain of unknown function (DUF4200).  This
          family is found in eukaryotes. It is a coiled-coil
          domain of unknwon function.
          Length = 126

 Score = 28.3 bits (64), Expect = 4.5
 Identities = 12/34 (35%), Positives = 26/34 (76%)

Query: 26 KEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKK 59
          KE + K ++A+KK ++++K +KEK+++ K+ K +
Sbjct: 56 KENEAKRRRAEKKAEEEKKLRKEKEEEIKELKAE 89



 Score = 27.6 bits (62), Expect = 8.2
 Identities = 15/49 (30%), Positives = 28/49 (57%), Gaps = 5/49 (10%)

Query: 22  KAKKKEKKKKEKKAKKKEKKKEKKKKEKKK-----KKKKKKKKKKNNNN 65
           +AK++  +KK ++ KK  K+KE++ KE K      K + +K ++K    
Sbjct: 59  EAKRRRAEKKAEEEKKLRKEKEEEIKELKAELEELKAEIEKLEEKLEEY 107


>gnl|CDD|70841 pfam07390, P30, Mycoplasma P30 protein.  This family consists of
          several P30 proteins which seem to be specific to
          Mycoplasma agalactiae. P30 is a 30-kDa immunodominant
          antigen and is known to be a transmembrane protein.
          Length = 266

 Score = 29.4 bits (65), Expect = 4.5
 Identities = 18/55 (32%), Positives = 29/55 (52%), Gaps = 2/55 (3%)

Query: 40 KKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNN--NNNNNNNNNNNNNNEEKGTF 92
          K  E  KKEK  K K +  K  +N+  N+N+N      N +N+ N++N+  +  F
Sbjct: 25 KCSEDDKKEKVTKPKNEPTKPVDNSKTNDNSNEMVGETNLSNSINSSNSSTQNHF 79



 Score = 28.6 bits (63), Expect = 7.3
 Identities = 16/53 (30%), Positives = 27/53 (50%)

Query: 38 KEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNNNNNNNNNNNNNEEKG 90
          ++ KKEK  K K +  K     K N+N+N      N +N+ N++N++     G
Sbjct: 28 EDDKKEKVTKPKNEPTKPVDNSKTNDNSNEMVGETNLSNSINSSNSSTQNHFG 80


>gnl|CDD|217461 pfam03261, CDK5_activator, Cyclin-dependent kinase 5 activator
          protein. 
          Length = 314

 Score = 29.5 bits (66), Expect = 4.6
 Identities = 16/77 (20%), Positives = 25/77 (32%), Gaps = 3/77 (3%)

Query: 8  QIGTFTEPTPLDWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNN 67
          Q+  +T  +     K  K EK  K           ++      KKKK  KK   N  +++
Sbjct: 26 QLDHYTSVSSS---KNAKSEKNLKRHSIIIPALSWKRLVASGSKKKKTSKKSTANLASSS 82

Query: 68 NNNNNNNNNNNNNNNNN 84
                  N  N   + 
Sbjct: 83 YQKIVIQLNRENLKKSV 99


>gnl|CDD|114709 pfam06003, SMN, Survival motor neuron protein (SMN).  This family
          consists of several eukaryotic survival motor neuron
          (SMN) proteins. The Survival of Motor Neurons (SMN)
          protein, the product of the spinal muscular
          atrophy-determining gene, is part of a large
          macromolecular complex (SMN complex) that functions in
          the assembly of spliceosomal small nuclear
          ribonucleoproteins (snRNPs). The SMN complex functions
          as a specificity factor essential for the efficient
          assembly of Sm proteins on U snRNAs and likely protects
          cells from illicit, and potentially deleterious,
          non-specific binding of Sm proteins to RNAs.
          Length = 264

 Score = 29.2 bits (65), Expect = 4.7
 Identities = 11/37 (29%), Positives = 18/37 (48%)

Query: 33 KKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNN 69
          K A K E     ++ +++   KK+K  KKN +    N
Sbjct: 26 KNALKNEDCATPQENDEQNPGKKRKNNKKNRSRKKCN 62


>gnl|CDD|223265 COG0187, GyrB, Type IIA topoisomerase (DNA gyrase/topo II,
           topoisomerase IV), B subunit [DNA replication,
           recombination, and repair].
          Length = 635

 Score = 29.8 bits (68), Expect = 4.8
 Identities = 9/41 (21%), Positives = 19/41 (46%)

Query: 20  WKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKK 60
           + +   +E KK  +KA    K +E  +K ++  ++K     
Sbjct: 361 FLEENPQEAKKIVEKAIAAAKAREAARKARELTRRKSALDI 401


>gnl|CDD|237629 PRK14160, PRK14160, heat shock protein GrpE; Provisional.
          Length = 211

 Score = 29.0 bits (65), Expect = 4.9
 Identities = 8/47 (17%), Positives = 17/47 (36%)

Query: 43 EKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNNNNNNNNNNNNNEEK 89
          EK+ K+ K +  ++   K+N N   +     +            E+ 
Sbjct: 2  EKECKDAKHENMEEDCCKENENKEEDKGKEEDLEFEEIEKEEIIEDS 48


>gnl|CDD|165332 PHA03035, PHA03035, hypothetical protein; Provisional.
          Length = 158

 Score = 28.6 bits (63), Expect = 4.9
 Identities = 15/38 (39%), Positives = 18/38 (47%)

Query: 51  KKKKKKKKKKKNNNNNNNNNNNNNNNNNNNNNNNNNEE 88
           K KK  KK KKNN  NN N + N     N     N+ +
Sbjct: 104 KSKKNGKKNKKNNKLNNANPSGNGKRKKNTKKAANSTD 141



 Score = 27.9 bits (61), Expect = 9.4
 Identities = 14/40 (35%), Positives = 16/40 (40%)

Query: 52  KKKKKKKKKKNNNNNNNNNNNNNNNNNNNNNNNNNEEKGT 91
           K KK  KK K NN  NN N + N     N     N    +
Sbjct: 104 KSKKNGKKNKKNNKLNNANPSGNGKRKKNTKKAANSTDDS 143



 Score = 27.9 bits (61), Expect = 10.0
 Identities = 13/45 (28%), Positives = 22/45 (48%)

Query: 34  KAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNNNNN 78
           K+KK  KK +K  K          K+KKN     N+ +++++  N
Sbjct: 104 KSKKNGKKNKKNNKLNNANPSGNGKRKKNTKKAANSTDDSDHKQN 148


>gnl|CDD|153271 cd07307, BAR, The Bin/Amphiphysin/Rvs (BAR) domain, a dimerization
           module that binds membranes and detects membrane
           curvature.  BAR domains are dimerization, lipid binding
           and curvature sensing modules found in many different
           proteins with diverse functions including organelle
           biogenesis, membrane trafficking or remodeling, and cell
           division and migration. Mutations in BAR containing
           proteins have been linked to diseases and their
           inactivation in cells leads to altered membrane
           dynamics. A BAR domain with an additional N-terminal
           amphipathic helix (an N-BAR) can drive membrane
           curvature. These N-BAR domains are found in amphiphysins
           and endophilins, among others. BAR domains are also
           frequently found alongside domains that determine lipid
           specificity, such as the Pleckstrin Homology (PH) and
           Phox Homology (PX) domains which are present in beta
           centaurins (ACAPs and ASAPs) and sorting nexins,
           respectively. A FES-CIP4 Homology (FCH) domain together
           with a coiled coil region is called the F-BAR domain and
           is present in Pombe/Cdc15 homology (PCH) family
           proteins, which include Fes/Fes tyrosine kinases, PACSIN
           or syndapin, CIP4-like proteins, and srGAPs, among
           others. The Inverse (I)-BAR or IRSp53/MIM homology
           Domain (IMD) is found in multi-domain proteins, such as
           IRSp53 and MIM, that act as scaffolding proteins and
           transducers of a variety of signaling pathways that link
           membrane dynamics and the underlying actin cytoskeleton.
           BAR domains form dimers that bind to membranes, induce
           membrane bending and curvature, and may also be involved
           in protein-protein interactions. The I-BAR domain
           induces membrane protrusions in the opposite direction
           compared to classical BAR and F-BAR domains, which
           produce membrane invaginations. BAR domains that also
           serve as protein interaction domains include those of
           arfaptin and OPHN1-like proteins, among others, which
           bind to Rac and Rho GAP domains, respectively.
          Length = 194

 Score = 28.9 bits (65), Expect = 5.1
 Identities = 12/43 (27%), Positives = 22/43 (51%)

Query: 21  KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNN 63
           KK +KK  K +      +EK K+ +KK+K   K  + +++   
Sbjct: 97  KKRRKKLDKARLDYDAAREKLKKLRKKKKDSSKLAEAEEELQE 139



 Score = 28.2 bits (63), Expect = 7.9
 Identities = 18/50 (36%), Positives = 24/50 (48%), Gaps = 2/50 (4%)

Query: 16  TPLD-WKKAKKKEKKKKEKKAKKKEKKKEK-KKKEKKKKKKKKKKKKKNN 63
            PL  + K   KE KK+ KK  K     +  ++K KK +KKKK   K   
Sbjct: 83  EPLKEYLKKDLKEIKKRRKKLDKARLDYDAAREKLKKLRKKKKDSSKLAE 132


>gnl|CDD|239572 cd03490, Topoisomer_IB_N_1, Topoisomer_IB_N_1: A subgroup of the
           N-terminal DNA binding fragment found in eukaryotic DNA
           topoisomerase (topo) IB. Topo IB proteins include the
           monomeric yeast and human topo I and heterodimeric topo
           I from Leishmania donvanni. Topo I enzymes are divided
           into:  topo type IA (bacterial) and type IB
           (eukaryotic). Topo I relaxes superhelical tension in
           duplex DNA by creating a single-strand nick, the broken
           strand can then rotate around the unbroken strand to
           remove DNA supercoils and, the nick is religated,
           liberating topo I. These enzymes regulate the
           topological changes that accompany DNA replication,
           transcription and other nuclear processes.  Human topo I
           is the target of a diverse set of anticancer drugs
           including camptothecins (CPTs). CPTs bind to the topo
           I-DNA complex and inhibit religation of the
           single-strand nick, resulting in the accumulation of
           topo I-DNA adducts.  In addition to differences in
           structure and some biochemical properties,
           Trypanosomatid parasite topos I differ from human topo I
           in their sensitivity to CPTs and other classical topo I
           inhibitors. Trypanosomatid topos I have putative roles
           in organizing the kinetoplast DNA network unique to
           these parasites.  This family may represent more than
           one structural domain.
          Length = 217

 Score = 29.1 bits (65), Expect = 5.2
 Identities = 16/56 (28%), Positives = 27/56 (48%), Gaps = 6/56 (10%)

Query: 11  TFTEPTPLDWKKAKKKEKKK------KEKKAKKKEKKKEKKKKEKKKKKKKKKKKK 60
            F      D K  ++ +         K    ++KEKKK   K+EK+ KKK++ K++
Sbjct: 64  VFVNSFEKDHKFIRRCKLSDADFSLIKNHLEEEKEKKKNLNKEEKEAKKKERAKRE 119


>gnl|CDD|130141 TIGR01069, mutS2, MutS2 family protein.  Function of MutS2 is
           unknown. It should not be considered a DNA mismatch
           repair protein. It is likely a DNA mismatch binding
           protein of unknown cellular function [DNA metabolism,
           Other].
          Length = 771

 Score = 29.8 bits (67), Expect = 5.2
 Identities = 13/40 (32%), Positives = 29/40 (72%)

Query: 23  AKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKN 62
           A +KE ++K +  +K  K++EK KKE +++ ++ K++++N
Sbjct: 522 ALEKELEQKNEHLEKLLKEQEKLKKELEQEMEELKERERN 561


>gnl|CDD|233034 TIGR00584, mug, mismatch-specific thymine-DNA glycosylate (mug).
           All proteins in this family for whcih functions are
           known are G-T or G-U mismatch glycosylases that function
           in base excision repair. This family is based on the
           phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis,
           Stanford University). Used 2pf model [DNA metabolism,
           DNA replication, recombination, and repair].
          Length = 328

 Score = 29.3 bits (65), Expect = 5.2
 Identities = 14/55 (25%), Positives = 26/55 (47%)

Query: 6   KHQIGTFTEPTPLDWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKK 60
           K +  +     P++ KK    +K  K  K+K+K++K   K K K+K  +     +
Sbjct: 61  KRKPRSNEPYRPVEPKKPSDSKKSGKSTKSKEKQEKITDKFKVKRKVDRFNGVSE 115


>gnl|CDD|218167 pfam04599, Pox_G5, Poxvirus G5 protein.  This protein has been
           predicted to be related to the FEN-1 endonuclease.
          Length = 426

 Score = 29.6 bits (67), Expect = 5.3
 Identities = 14/48 (29%), Positives = 20/48 (41%)

Query: 39  EKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNNNNNNNNNNNNN 86
           E KKE   KE +K  K   K K+      N+  +  +      NN +N
Sbjct: 351 EPKKEISVKEIRKLCKIIDKNKRYKKVTINDIKSVISIFGYEINNTSN 398


>gnl|CDD|237276 PRK13024, PRK13024, bifunctional preprotein translocase subunit
           SecD/SecF; Reviewed.
          Length = 755

 Score = 29.4 bits (67), Expect = 5.3
 Identities = 14/40 (35%), Positives = 24/40 (60%), Gaps = 5/40 (12%)

Query: 9   IGTFTE-----PTPLDWKKAKKKEKKKKEKKAKKKEKKKE 43
           +GT++      P  LD +K + K+KKK++K  K + +K E
Sbjct: 716 VGTYSSIFIAAPLWLDLEKRRLKKKKKRKKVKKWEVEKIE 755


>gnl|CDD|221244 pfam11818, DUF3340, C-terminal domain of tail specific protease
           (DUF3340).  This presumed domain is found at the
           C-terminus of tail specific proteases. Its function is
           unknown. This family is found in bacteria and
           eukaryotes. This presumed domain is typically between 88
           to 187 amino acids in length.
          Length = 139

 Score = 28.3 bits (64), Expect = 5.4
 Identities = 12/40 (30%), Positives = 20/40 (50%)

Query: 22  KAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
             +K E+++ EKK  K+E ++ K   E+  K      K K
Sbjct: 78  AERKAEREEDEKKRLKRENERRKALGEEPLKSLDDLPKDK 117


>gnl|CDD|177060 CHL00138, rps5, ribosomal protein S5; Validated.
          Length = 143

 Score = 28.4 bits (64), Expect = 5.4
 Identities = 6/25 (24%), Positives = 8/25 (32%)

Query: 65 NNNNNNNNNNNNNNNNNNNNNNEEK 89
                 N   N  +N   N  EE+
Sbjct: 3  FLLKKMYNKKKNRKSNIKENKWEER 27


>gnl|CDD|215329 PLN02610, PLN02610, probable methionyl-tRNA synthetase.
          Length = 801

 Score = 29.7 bits (67), Expect = 5.4
 Identities = 22/71 (30%), Positives = 28/71 (39%), Gaps = 17/71 (23%)

Query: 7   HQIGTFTEPTPL-----DWKKAKKKEK---------KKKEKKAKKKEKKKEKKKKEKKKK 52
           H+IGT   P PL     D +    +EK          + E    KK  K+ KKK      
Sbjct: 560 HKIGT---PEPLFKELKDEEVEAYREKFAGSQADRAARAEAAEAKKLAKQLKKKALSDGG 616

Query: 53  KKKKKKKKKNN 63
           KKK+ KK    
Sbjct: 617 KKKQGKKAGGG 627



 Score = 29.0 bits (65), Expect = 8.9
 Identities = 15/39 (38%), Positives = 19/39 (48%)

Query: 22  KAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKK 60
           +A+  E KK  K+ KKK      KKK+ KK     K K 
Sbjct: 594 RAEAAEAKKLAKQLKKKALSDGGKKKQGKKAGGGGKSKA 632


>gnl|CDD|221733 pfam12720, DUF3807, Protein of unknown function (DUF3807).  This is
           a family of conserved fungal proteins of unknown
           function.
          Length = 169

 Score = 28.6 bits (64), Expect = 5.4
 Identities = 16/77 (20%), Positives = 32/77 (41%), Gaps = 18/77 (23%)

Query: 5   TKHQIGTF--TEPTPLDWKKAKKKEKKK----------------KEKKAKKKEKKKEKKK 46
           T  QI  F  +E   L  ++  K+E +                  E  A +KE +++   
Sbjct: 50  TDEQIEIFRHSEIHALLRERELKEEAEAEEEGEVDASPDAGAVAGESSADRKEAEQQGAA 109

Query: 47  KEKKKKKKKKKKKKKNN 63
           +++K  + K++K  K+ 
Sbjct: 110 QKRKSCRDKERKSAKDP 126


>gnl|CDD|223562 COG0488, Uup, ATPase components of ABC transporters with duplicated
           ATPase domains [General function prediction only].
          Length = 530

 Score = 29.5 bits (67), Expect = 5.5
 Identities = 11/42 (26%), Positives = 23/42 (54%)

Query: 21  KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKN 62
           +KA++  ++    + ++KE  KE++   + K    K KK K+
Sbjct: 241 QKAERLRQEAAAYEKQQKELAKEQEWIRRGKAAASKAKKAKS 282


>gnl|CDD|232939 TIGR00360, ComEC_N-term, ComEC/Rec2-related protein.  The related
           model ComEC_Rec2 (TIGR00361) describes a set of proteins
           of ~ 700-800 residues, one each from a number of
           different species, of which most can become competent
           for natural transformation with exogenous DNA. The
           best-studied examples are ComEC from Bacillus subtilis
           and Rec-2 from Haemophilus influenzae, where the protein
           appears to form part of the DNA import structure. This
           model represents a region found in full-length
           ComEC/Rec2 and shorter homologs of unknown function from
           large number of additional bacterial species, most of
           which are not known to become competent for
           transformation (an exception is Helicobacter pylori)
           [Unknown function, General].
          Length = 171

 Score = 28.5 bits (64), Expect = 5.5
 Identities = 12/71 (16%), Positives = 22/71 (30%), Gaps = 7/71 (9%)

Query: 134 QLFLGSFLHWRSAAILNLLFPILALCALYFIPESPHWLISQGRMQEASASLCWLRGWVTP 193
             F  S         L L   ++ L        +P  L+S G      A+   +  +   
Sbjct: 65  LAFKLSLRKLNLIGALLLSAIVILLM-------NPVALLSFGFQLSFLATFGLVVMFPNF 117

Query: 194 DKVQTELSQIT 204
            ++   LS + 
Sbjct: 118 QQLLRPLSSLI 128


>gnl|CDD|235847 PRK06655, flgD, flagellar basal body rod modification protein;
          Reviewed.
          Length = 225

 Score = 29.1 bits (66), Expect = 5.5
 Identities = 3/32 (9%), Positives = 16/32 (50%)

Query: 62 NNNNNNNNNNNNNNNNNNNNNNNNNEEKGTFL 93
          N   +++ +  +  ++++++     + +  FL
Sbjct: 5  NTVGSSSTSGLSTTSSSSSSKKGAADLQDDFL 36


>gnl|CDD|214998 smart01054, CaM_binding, Plant calmodulin-binding domain.  The
          sequences featured in this family are found repeated in
          a number of plant calmodulin-binding proteins, and are
          thought to constitute the calmodulin-binding domains..
          Binding of the proteins to calmodulin depends on the
          presence of calcium ions.. These proteins are thought
          to be involved in various processes, such as plant
          defence responses.and stolonisation or tuberization.
          Length = 115

 Score = 28.1 bits (63), Expect = 5.5
 Identities = 9/46 (19%), Positives = 20/46 (43%)

Query: 19 DWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNN 64
            K    K + ++ +K K  +K    K+  K  + K++   ++ N 
Sbjct: 9  RGKIEDPKPEGERPRKWKFLKKVILLKRFVKSLEDKRRLNPREPNV 54


>gnl|CDD|226400 COG3883, COG3883, Uncharacterized protein conserved in bacteria
           [Function unknown].
          Length = 265

 Score = 28.9 bits (65), Expect = 5.6
 Identities = 11/51 (21%), Positives = 23/51 (45%)

Query: 32  EKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNNNNNNNNN 82
           +KK  +++K+ +K  +EK+   + K +      N      N+ N+     N
Sbjct: 146 DKKILEQQKEDKKSLEEKQAALEDKLETLVALQNELETQLNSLNSQKAEKN 196



 Score = 28.5 bits (64), Expect = 8.7
 Identities = 8/47 (17%), Positives = 19/47 (40%)

Query: 43  EKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNNNNNNNNNNNNNEEK 89
           E++K++KK  ++K+   +          N      N+ N+    +  
Sbjct: 151 EQQKEDKKSLEEKQAALEDKLETLVALQNELETQLNSLNSQKAEKNA 197


>gnl|CDD|227577 COG5252, COG5252, Uncharacterized conserved protein, contains
          CCCH-type Zn-finger protein [General function
          prediction only].
          Length = 299

 Score = 29.3 bits (65), Expect = 5.8
 Identities = 9/40 (22%), Positives = 20/40 (50%)

Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKK 60
          K AKK+++  K+     +++ ++K    K K +  K +  
Sbjct: 5  KMAKKQQESGKKATRDMRKELEDKTFGLKNKNRSTKVQAI 44



 Score = 28.9 bits (64), Expect = 6.2
 Identities = 10/36 (27%), Positives = 20/36 (55%)

Query: 25 KKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKK 60
          K    K +   K+ E    K++ EKK+K + ++K++
Sbjct: 35 KNRSTKVQAIIKQIETLNLKEQLEKKEKMRMEEKRR 70


>gnl|CDD|177283 PHA00451, PHA00451, protein kinase.
          Length = 362

 Score = 29.4 bits (66), Expect = 5.9
 Identities = 19/88 (21%), Positives = 39/88 (44%), Gaps = 11/88 (12%)

Query: 21  KKAKKKEKK--KKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNN---------NN 69
           +KAK+      +  +KA  K +K+ +K + +  K+    + + N    N         + 
Sbjct: 252 RKAKRDPNGTFQVARKAAMKRRKRNRKLRARNAKELAAMRMEANQIRRNEPRARMLMGDK 311

Query: 70  NNNNNNNNNNNNNNNNNEEKGTFLYKMA 97
           + +N   +NNN +    EE+    ++MA
Sbjct: 312 HWHNAWLHNNNLDFAKLEERAALAWQMA 339


>gnl|CDD|150787 pfam10154, DUF2362, Uncharacterized conserved protein (DUF2362).
           This is a family of proteins conserved from nematodes to
           humans. The function is not known.
          Length = 501

 Score = 29.4 bits (66), Expect = 5.9
 Identities = 10/39 (25%), Positives = 19/39 (48%)

Query: 39  EKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNNNN 77
           +K  E ++ E  K+ ++  K   + + NN    +N N N
Sbjct: 132 KKLNELQQSEMNKQMQELGKSLTDQDVNNLAAQHNENQN 170



 Score = 29.4 bits (66), Expect = 6.4
 Identities = 7/27 (25%), Positives = 14/27 (51%)

Query: 54  KKKKKKKKNNNNNNNNNNNNNNNNNNN 80
           K+ ++  K+  + + NN    +N N N
Sbjct: 144 KQMQELGKSLTDQDVNNLAAQHNENQN 170



 Score = 29.4 bits (66), Expect = 6.5
 Identities = 7/27 (25%), Positives = 13/27 (48%)

Query: 52  KKKKKKKKKKNNNNNNNNNNNNNNNNN 78
           K+ ++  K   + + NN    +N N N
Sbjct: 144 KQMQELGKSLTDQDVNNLAAQHNENQN 170



 Score = 29.0 bits (65), Expect = 6.6
 Identities = 7/27 (25%), Positives = 13/27 (48%)

Query: 53  KKKKKKKKKNNNNNNNNNNNNNNNNNN 79
           K+ ++  K   + + NN    +N N N
Sbjct: 144 KQMQELGKSLTDQDVNNLAAQHNENQN 170


>gnl|CDD|238558 cd01138, FeuA, Periplasmic binding protein FeuA.  These proteins
           have predicted to function as initial receptors in ABC
           transport of metal ions in some eubacterial species.
           They belong to the TroA superfamily of periplasmic metal
           binding proteins that share a distinct fold and ligand
           binding mechanism. A typical TroA protein is comprised
           of two globular subdomains connected by a single helix
           and can bind their ligands in the cleft between these
           domains.
          Length = 248

 Score = 28.8 bits (65), Expect = 6.0
 Identities = 13/31 (41%), Positives = 18/31 (58%)

Query: 31  KEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
           KE +A+K     ++K KE K+K KKK    K
Sbjct: 112 KEDEAEKWLADYKQKAKEAKEKIKKKLGNDK 142


>gnl|CDD|239571 cd03489, Topoisomer_IB_N_LdtopoI_like,
           Topoisomer_IB_N_LdtopoI_like: N-terminal DNA binding
           fragment found in eukaryotic DNA topoisomerase (topo) IB
           proteins similar to the heterodimeric topo I from
           Leishmania donvanni. Topo I enzymes are divided into:
           topo type IA (bacterial) and type IB (eukaryotic). Topo
           I relaxes superhelical tension in duplex DNA by creating
           a single-strand nick, the broken strand can then rotate
           around the unbroken strand to remove DNA supercoils and,
           the nick is religated, liberating topo I. These enzymes
           regulate the topological changes that accompany DNA
           replication, transcription and other nuclear processes. 
           Human topo I is the target of a diverse set of
           anticancer drugs including camptothecins (CPTs). CPTs
           bind to the topo I-DNA complex and inhibit re-ligation
           of the single-strand nick, resulting in the accumulation
           of topo I-DNA adducts. In addition to differences in
           structure and some biochemical properties,
           Trypanosomatid parasite topo I differ from human topo I
           in their sensitivity to CPTs and other classical topo I
           inhibitors. Trypanosomatid topo I play putative roles in
           organizing the kinetoplast DNA network unique to these
           parasites.  This family may represent more than one
           structural domain.
          Length = 212

 Score = 28.7 bits (64), Expect = 6.0
 Identities = 14/26 (53%), Positives = 19/26 (73%)

Query: 21  KKAKKKEKKKKEKKAKKKEKKKEKKK 46
           +K KKK + K+EKKA K+EK KE + 
Sbjct: 94  EKEKKKSRTKEEKKALKEEKDKEAEP 119


>gnl|CDD|153327 cd07643, I-BAR_IMD_MIM, Inverse (I)-BAR, also known as the
           IRSp53/MIM homology Domain (IMD), of Missing In
           Metastasis.  The IMD domain, also called
           Inverse-Bin/Amphiphysin/Rvs (I-BAR) domain, is a
           dimerization and lipid-binding module that bends
           membranes and induces membrane protrusions. Members of
           this subfamily include missing in metastasis (MIM) or
           metastasis suppressor 1 (MTSS1), metastasis suppressor
           1-like (MTSSL) or ABBA (Actin-Bundling protein with
           BAIAP2 homology), and similar proteins. They contain an
           N-terminal IMD and a WASP homology 2 (WH2) actin-binding
           motif at the C-terminus. MIM was originally identified
           as a missing transcript from metastatic bladder and
           prostate cancer cells. It is a scaffold protein that
           functions in a signaling pathway between the PDGF
           receptor, Src kinases, and actin assembly. It may also
           function as a cofactor of the Sonic hedgehog (Shh)
           transcriptional pathway and may participate in tumor
           development and progression via this pathway. ABBA
           regulates actin and plasma membrane dynamics to promote
           the extension of radial glia, which is important in
           neuronal migration, axon guidance and neurogenesis. The
           IMD domain of MIM binds and bundles actin filaments,
           binds membranes, and interacts with the small GTPase
           Rac.
          Length = 231

 Score = 29.0 bits (65), Expect = 6.1
 Identities = 13/42 (30%), Positives = 23/42 (54%)

Query: 19  DWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKK 60
           +WKK   +  K   K+ KK  ++ +KK  +  + +KK +K K
Sbjct: 108 EWKKVANQLDKDHAKEYKKARQEIKKKSSDTIRLQKKARKGK 149


>gnl|CDD|217512 pfam03359, GKAP, Guanylate-kinase-associated protein (GKAP)
           protein. 
          Length = 342

 Score = 29.0 bits (65), Expect = 6.1
 Identities = 14/56 (25%), Positives = 24/56 (42%), Gaps = 12/56 (21%)

Query: 19  DWKKA------KKKEKKKKEKKAKKKEKK---KEKKKKEKKK---KKKKKKKKKKN 62
           DW+K       KK  K    K A  +EK     +++++E +      K+    K+N
Sbjct: 267 DWQKKVPPPVPKKPAKGPIVKPAISREKSLDSSDRQRQEARSRLAAAKRAASFKQN 322


>gnl|CDD|153342 cd07658, F-BAR_NOSTRIN, The F-BAR (FES-CIP4 Homology and
           Bin/Amphiphysin/Rvs) domain of Nitric Oxide Synthase
           TRaffic INducer (NOSTRIN).  F-BAR domains are
           dimerization modules that bind and bend membranes and
           are found in proteins involved in membrane dynamics and
           actin reorganization. Nitric Oxide Synthase TRaffic
           INducer (NOSTRIN) is expressed in endothelial and
           epithelial cells and is involved in the regulation,
           trafficking and targeting of endothelial NOS (eNOS).
           NOSTRIN facilitates the endocytosis of eNOS by
           coordinating the functions of dynamin and the
           Wiskott-Aldrich syndrome protein (WASP). Increased
           expression of NOSTRIN may be correlated to preeclampsia.
           NOSTRIN contains an N-terminal F-BAR domain and a
           C-terminal SH3 domain. F-BAR domains form banana-shaped
           dimers with a positively-charged concave surface that
           binds to negatively-charged lipid membranes. They can
           induce membrane deformation in the form of long tubules.
           The F-BAR domain of NOSTRIN is necessary and sufficient
           for its membrane association and is responsible for its
           subcellular localization.
          Length = 239

 Score = 28.9 bits (65), Expect = 6.3
 Identities = 13/49 (26%), Positives = 24/49 (48%)

Query: 19  DWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNN 67
           DW+  + K KKK    A++ EK +++ +  K+   K+K   K   +   
Sbjct: 120 DWRSEQIKVKKKLHGLARENEKLQDQVEDNKQSCTKQKMLNKLKKSAEV 168


>gnl|CDD|129660 TIGR00569, ccl1, cyclin ccl1.  All proteins in this family for
           which functions are known are cyclins that are
           components of TFIIH, a complex that is involved in
           nucleotide excision repair and transcription initiation.
           This family is based on the phylogenomic analysis of JA
           Eisen (1999, Ph.D. Thesis, StanfordUniversity) [DNA
           metabolism, DNA replication, recombination, and repair].
          Length = 305

 Score = 29.0 bits (65), Expect = 6.4
 Identities = 11/62 (17%), Positives = 25/62 (40%), Gaps = 4/62 (6%)

Query: 13  TEPTPLDWKKAKKKEKKKKE----KKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNN 68
             P  +D  +  +   KK E    +K    +KK ++    ++     K +K+K   ++  
Sbjct: 240 ELPQLIDIMRELRILVKKYEEPRSEKVAALKKKLDECHSIEENLSSVKIRKRKGYEDDTY 299

Query: 69  NN 70
            +
Sbjct: 300 IS 301


>gnl|CDD|224114 COG1193, COG1193, Mismatch repair ATPase (MutS family) [DNA
           replication, recombination, and repair].
          Length = 753

 Score = 29.2 bits (66), Expect = 6.6
 Identities = 13/42 (30%), Positives = 19/42 (45%)

Query: 20  WKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
             +A  + K  K+KK        E   K+KK K   + KK+K
Sbjct: 571 KVEAVDEVKFLKKKKGILDAGAFESTLKDKKNKVLPEAKKRK 612


>gnl|CDD|234541 TIGR04320, Surf_Exclu_PgrA, SEC10/PgrA surface exclusion domain. 
          This model describes a conserved domain found in
          surface proteins of a number of Firmutes. Many members
          have LPXTG C-terminal anchoring motifs and a
          substantial number have the KxYKxGKxW putative sorting
          signal at the N-terminus. The tetracycline resistance
          plasmid pCF10 in Enterococcus faecalis promotes
          conjugal plasmid transfer in response to sex
          pheromones, but PgrA/Sec10 encoded by that plasmid, a
          member of this family, specifically inhibits the
          ability of cells to receive homologous plasmids. The
          phenomenon is called surface exclusion.
          Length = 356

 Score = 28.9 bits (65), Expect = 6.6
 Identities = 12/79 (15%), Positives = 20/79 (25%), Gaps = 11/79 (13%)

Query: 19 DWKKAKKKEKKKKEKKA-----------KKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNN 67
          +   A+ K  K                     K   +          K +  K      +
Sbjct: 8  ELNNAQAKVDKANALAGVNTITLPAGYNIDALKNIYESGDFDNSDAAKAEAFKAAAEAKS 67

Query: 68 NNNNNNNNNNNNNNNNNNN 86
           NN  +N  + N   + NN
Sbjct: 68 LNNYKSNAADKNRTVDINN 86


>gnl|CDD|222613 pfam14235, DUF4337, Domain of unknown function (DUF4337).  This
           family of proteins is functionally uncharacterized. This
           family of proteins is found in bacteria. Proteins in
           this family are typically between 187 and 201 amino
           acids in length. There is a single completely conserved
           residue Q that may be functionally important.
          Length = 158

 Score = 28.3 bits (64), Expect = 6.6
 Identities = 7/48 (14%), Positives = 21/48 (43%)

Query: 16  TPLDWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNN 63
             L+    + + + K  +  K+K + + + K+ + K K+ + +     
Sbjct: 61  LELEAAAPRAELQAKIARYKKEKARYRSEAKELEAKAKEAEAESDHAL 108


>gnl|CDD|218611 pfam05499, DMAP1, DNA methyltransferase 1-associated protein 1
          (DMAP1).  DNA methylation can contribute to
          transcriptional silencing through several
          transcriptionally repressive complexes, which include
          methyl-CpG binding domain proteins (MBDs) and histone
          deacetylases (HDACs). The chief enzyme that maintains
          mammalian DNA methylation, DNMT1, can also establish a
          repressive transcription complex. The non-catalytic
          amino terminus of DNMT1 binds to HDAC2 and DMAP1 (for
          DNMT1 associated protein), and can mediate
          transcriptional repression. DMAP1 has intrinsic
          transcription repressive activity, and binds to the
          transcriptional co-repressor TSG101. DMAP1 is targeted
          to replication foci through interaction with the far N
          terminus of DNMT1 throughout S phase, whereas HDAC2
          joins DNMT1 and DMAP1 only during late S phase,
          providing a platform for how histones may become
          deacetylated in heterochromatin following replication.
          Length = 175

 Score = 28.3 bits (63), Expect = 6.7
 Identities = 16/53 (30%), Positives = 26/53 (49%), Gaps = 15/53 (28%)

Query: 24 KKKEKKKKEKKAKKKEKKK---------------EKKKKEKKKKKKKKKKKKK 61
          +K E +KKE++ K ++ +K               E +K+EKK  KKK  +K  
Sbjct: 10 RKIEARKKEREKKAQDLQKLITAADQQAAGFSTAEARKREKKLPKKKIPQKPG 62


>gnl|CDD|218370 pfam04997, RNA_pol_Rpb1_1, RNA polymerase Rpb1, domain 1.  RNA
           polymerases catalyze the DNA dependent polymerisation of
           RNA. Prokaryotes contain a single RNA polymerase
           compared to three in eukaryotes (not including
           mitochondrial. and chloroplast polymerases). This
           domain, domain 1, represents the clamp domain, which a
           mobile domain involved in positioning the DNA,
           maintenance of the transcription bubble and positioning
           of the nascent RNA strand.
          Length = 330

 Score = 28.8 bits (65), Expect = 7.0
 Identities = 13/54 (24%), Positives = 16/54 (29%), Gaps = 2/54 (3%)

Query: 26  KEKKKKEKKAKKKEKKKEKKKKEKK--KKKKKKKKKKKNNNNNNNNNNNNNNNN 77
           +  K    K     K K  KK+ KK     KKK    K   +N           
Sbjct: 107 ESVKYFFLKVVIDPKGKNSKKRLKKINNLCKKKSICSKCGEDNGGLKAFEGCGK 160


>gnl|CDD|148072 pfam06244, DUF1014, Protein of unknown function (DUF1014).  This
          family consists of several hypothetical eukaryotic
          proteins of unknown function.
          Length = 122

 Score = 27.8 bits (62), Expect = 7.0
 Identities = 11/42 (26%), Positives = 21/42 (50%)

Query: 33 KKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNN 74
          +   ++  +KE++K E  ++K KK   ++     N N N N 
Sbjct: 2  RAQIEERLEKEQRKLEAPEEKSKKNAAREVPLEENLNVNRNR 43


>gnl|CDD|223732 COG0659, SUL1, Sulfate permease and related transporters (MFS
           superfamily) [Inorganic ion transport and metabolism].
          Length = 554

 Score = 29.2 bits (66), Expect = 7.1
 Identities = 30/169 (17%), Positives = 53/169 (31%), Gaps = 51/169 (30%)

Query: 214 RLGKDGQRRPNYRMYMRRTFLLPYAIVT-------SLFFIGQFGGMTTLQTYAVGIFESI 266
           RLG+           +R    +P  ++         L  + Q   +  L +   G +  +
Sbjct: 118 RLGR----------LIR---FIPRPVLIGFTAGIAILIILTQLPVLLGLASKVSGFWAKV 164

Query: 267 HA----PLDPYFATLLLGVAELGGALLCVVLIHYTGKRP-------LALISTGGSAACFI 315
            A     L    ATLLLG       LL + ++ +  +         +AL+         I
Sbjct: 165 SALFTVLLTINLATLLLG-------LLTLAILLFLPRLTPRIPSPLIALV-----LGTLI 212

Query: 316 VVAVYAQFHLSYGWDSPLVPTVFLVLAAFLTHICIRLLPWMLIGEVFPN 364
           V             DS     +   L + L H  +  +   L+  + P 
Sbjct: 213 VWIF--------PLDSLRYGEIPGSLPSGLPHFRLPNVSLSLLLALLPY 253


>gnl|CDD|204935 pfam12474, PKK, Polo kinase kinase.  This domain family is found in
           eukaryotes, and is approximately 140 amino acids in
           length. The family is found in association with
           pfam00069. Polo-like kinase 1 (Plx1) is essential during
           mitosis for the activation of Cdc25C, for spindle
           assembly, and for cyclin B degradation. This family is
           Polo kinase kinase (PKK) which phosphorylates Polo
           kinase and Polo-like kinase to activate them. PKK is a
           serine/threonine kinase.
          Length = 142

 Score = 28.1 bits (63), Expect = 7.1
 Identities = 14/33 (42%), Positives = 22/33 (66%)

Query: 26  KEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKK 58
           KE  K EKK  K+E +K  + +E++KK+ K +K
Sbjct: 69  KESLKIEKKELKQEVEKLPRFQEQEKKRMKAEK 101


>gnl|CDD|236498 PRK09401, PRK09401, reverse gyrase; Reviewed.
          Length = 1176

 Score = 29.1 bits (66), Expect = 7.2
 Identities = 11/49 (22%), Positives = 24/49 (48%)

Query: 15  PTPLDWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNN 63
           PT L  ++  +K +K  EK     +        +KK+K++  ++ K+ +
Sbjct: 131 PTRLLVEQVVEKLEKFGEKVGCGVKILYYHSSLKKKEKEEFLERLKEGD 179


>gnl|CDD|240309 PTZ00189, PTZ00189, 60S ribosomal protein L21; Provisional.
          Length = 160

 Score = 28.0 bits (63), Expect = 7.2
 Identities = 7/26 (26%), Positives = 12/26 (46%)

Query: 36  KKKEKKKEKKKKEKKKKKKKKKKKKK 61
            K+    +  K E KKK +    K++
Sbjct: 106 LKRVAANDAIKAEAKKKGELPSTKRQ 131


>gnl|CDD|237028 PRK12267, PRK12267, methionyl-tRNA synthetase; Reviewed.
          Length = 648

 Score = 29.0 bits (66), Expect = 7.2
 Identities = 7/33 (21%), Positives = 19/33 (57%)

Query: 25  KKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKK 57
           + + +++    K++ +    K+ E+K+KK +K 
Sbjct: 509 RIDVEEEIAYIKEQMEGSAPKEPEEKEKKPEKP 541


>gnl|CDD|192481 pfam10197, Cir_N, N-terminal domain of CBF1 interacting
          co-repressor CIR.  This is a 45 residue conserved
          region at the N-terminal end of a family of proteins
          referred to as CIRs (CBF1-interacting co-repressors).
          CBF1 (centromere-binding factor 1) acts as a
          transcription factor that causes repression by binding
          specifically to GTGGGAA motifs in responsive promoters,
          and it requires CIR as a co-repressor. CIR binds to
          histone deacetylase and to SAP30 and serves as a linker
          between CBF1 and the histone deacetylase complex.
          Length = 37

 Score = 25.7 bits (57), Expect = 7.2
 Identities = 8/30 (26%), Positives = 20/30 (66%)

Query: 28 KKKKEKKAKKKEKKKEKKKKEKKKKKKKKK 57
           K +EK  + ++K  E++KK ++ +K+ ++
Sbjct: 7  MKNQEKVWEAEQKALEEQKKIEELRKEIEE 36


>gnl|CDD|224756 COG1843, FlgD, Flagellar hook capping protein [Cell motility and
          secretion].
          Length = 222

 Score = 28.5 bits (64), Expect = 7.3
 Identities = 6/32 (18%), Positives = 15/32 (46%)

Query: 62 NNNNNNNNNNNNNNNNNNNNNNNNNEEKGTFL 93
          N  +  + +    N+  ++ ++N +  K  FL
Sbjct: 4  NMTSPTSASTITTNSKTDSTSSNASLGKDDFL 35


>gnl|CDD|221643 pfam12572, DUF3752, Protein of unknown function (DUF3752).  This
           domain family is found in eukaryotes, and is typically
           between 140 and 163 amino acids in length.
          Length = 148

 Score = 28.1 bits (63), Expect = 7.3
 Identities = 14/58 (24%), Positives = 30/58 (51%)

Query: 7   HQIGTFTEPTPLDWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNN 64
            + G             + KE  +K K+  +K++ K   ++ +KK+KKKKK+++ ++ 
Sbjct: 53  VEAGASAPAAASAKASKRDKEMARKVKEYNEKKRGKSLVEQHQKKQKKKKKEEENDDP 110



 Score = 27.8 bits (62), Expect = 8.1
 Identities = 18/51 (35%), Positives = 30/51 (58%)

Query: 16  TPLDWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNN 66
                 KA K++K+   K  +  EKK+ K   E+ +KK+KKKKK++ N++ 
Sbjct: 60  PAAASAKASKRDKEMARKVKEYNEKKRGKSLVEQHQKKQKKKKKEEENDDP 110


>gnl|CDD|235848 PRK06664, fliD, flagellar hook-associated protein FliD; Validated.
          Length = 661

 Score = 29.0 bits (65), Expect = 7.3
 Identities = 17/60 (28%), Positives = 29/60 (48%), Gaps = 8/60 (13%)

Query: 22  KAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKK--------KKKKKKKNNNNNNNNNNNN 73
           K K  +++  +   K +E +K+ + KE+K K K        KK K++ N   N N +  N
Sbjct: 601 KVKGLDERIADNNKKIEEYEKKLESKERKLKGKYLTMDQTVKKMKEQSNYLKNFNKSGRN 660


>gnl|CDD|219630 pfam07901, DUF1672, Protein of unknown function (DUF1672).  This
           family is composed of hypothetical bacterial proteins of
           unknown function.
          Length = 277

 Score = 28.6 bits (64), Expect = 7.4
 Identities = 22/87 (25%), Positives = 34/87 (39%), Gaps = 13/87 (14%)

Query: 19  DWKKAKKKEKKKKEKKAK----------KKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNN 68
           ++KK  K+ KK+ + KA           K    K+    +     ++ KKKK    N   
Sbjct: 188 EFKKGMKQAKKQLDYKANTEVVTTLFSTKDNFTKDNTVDDVIDLSEELKKKKNIPENTTI 247

Query: 69  N---NNNNNNNNNNNNNNNNNEEKGTF 92
               + N  N    N ++NN  E G F
Sbjct: 248 TLQLSKNKINTKRPNYSDNNPIEYGVF 274


>gnl|CDD|211393 cd11381, NSA2, pre-ribosomal protein NSA2 (Nop seven-associated
          2).  NSA2 appears to be a protein required for the
          maturation of 27S pre-rRNA in yeast; it has been
          characterized in mammalian cells as a nucleolar protein
          that might play a role in the regulation of the cell
          cycle and in cell proliferation.
          Length = 257

 Score = 28.7 bits (65), Expect = 7.4
 Identities = 18/62 (29%), Positives = 29/62 (46%), Gaps = 15/62 (24%)

Query: 21 KKAKK--KEKKKKEKKAKK----------KEKKKEK---KKKEKKKKKKKKKKKKKNNNN 65
          +K KK  +E  K+ KKA+K          K++ KEK   KK  K  +++  K+K  +   
Sbjct: 20 RKRKKEAREPHKRSKKAQKLRGLKAKLYNKKRYKEKIQMKKTIKMHEERNVKQKVDDKVP 79

Query: 66 NN 67
            
Sbjct: 80 EG 81


>gnl|CDD|234659 PRK00137, rplI, 50S ribosomal protein L9; Reviewed.
          Length = 147

 Score = 28.1 bits (64), Expect = 7.5
 Identities = 4/31 (12%), Positives = 14/31 (45%)

Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKK 51
          K   K+ + ++ +   K  ++  + +   +K
Sbjct: 41 KGNLKQLEARRAELEAKAAEELAEAEALAEK 71


>gnl|CDD|223092 COG0013, AlaS, Alanyl-tRNA synthetase [Translation, ribosomal
           structure and biogenesis].
          Length = 879

 Score = 29.2 bits (66), Expect = 7.5
 Identities = 12/57 (21%), Positives = 21/57 (36%), Gaps = 2/57 (3%)

Query: 21  KKAKKKEKKKKEKKAKKKEKKKEKKKKEKK--KKKKKKKKKKKNNNNNNNNNNNNNN 75
           K+     K   E   K +   +E K+ EK+  + KKK    +  +  +N        
Sbjct: 718 KEIASLLKVPPELLPKVERLLEELKELEKELERLKKKLAAAELADLLSNAKAEEIGG 774


>gnl|CDD|215856 pfam00318, Ribosomal_S2, Ribosomal protein S2. 
          Length = 205

 Score = 28.3 bits (64), Expect = 7.7
 Identities = 14/47 (29%), Positives = 20/47 (42%)

Query: 10  GTFTEPTPLDWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKK 56
           GT T    +     + +E  KKE    K+E +K +K     K  KK 
Sbjct: 91  GTLTNWKTIKKSIKRLEELSKKEALKLKRELEKLEKYLGGIKNMKKL 137


>gnl|CDD|223619 COG0545, FkpA, FKBP-type peptidyl-prolyl cis-trans isomerases 1
          [Posttranslational modification, protein turnover,
          chaperones].
          Length = 205

 Score = 28.5 bits (64), Expect = 7.7
 Identities = 8/38 (21%), Positives = 17/38 (44%)

Query: 24 KKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
          ++ E++ ++    K+ K       E K   +K  K+K 
Sbjct: 58 QRAEQRLQQAAKAKRAKDAAANAAEGKAFLEKNAKEKG 95


>gnl|CDD|193106 pfam12628, Inhibitor_I71, Falstatin, cysteine peptidase inhibitor. 
           This family of peptidase inhibitors is expressed from
           plasmodial protozoal species. Falstatin is found to be a
           potent reversible inhibitor of the P. falciparum
           cysteine proteases falcipain-2 and falcipain-3, as well
           as other parasite- and non-parasite-derived cysteine
           proteases, but is only a relatively weak inhibitor of
           the P. falciparum cysteine proteases falcipain-1 and
           dipeptidyl aminopeptidase 1. Thus, P. falciparum
           requires expression of falstatin to limit proteolysis by
           certain host or parasite cysteine proteases during
           erythrocyte invasion.
          Length = 173

 Score = 28.3 bits (63), Expect = 7.7
 Identities = 11/33 (33%), Positives = 19/33 (57%)

Query: 57  KKKKKNNNNNNNNNNNNNNNNNNNNNNNNNEEK 89
           K  K N N+++ N  NNN +++ N  ++N   K
Sbjct: 91  KPIKINENDSSKNVINNNQSSSQNETDSNKPPK 123


>gnl|CDD|227520 COG5193, LHP1, La protein, small RNA-binding pol III transcript
           stabilizing protein and related La-motif-containing
           proteins involved in translation [Posttranslational
           modification, protein turnover, chaperones /
           Translation, ribosomal structure and biogenesis].
          Length = 438

 Score = 28.9 bits (64), Expect = 7.7
 Identities = 15/78 (19%), Positives = 23/78 (29%), Gaps = 3/78 (3%)

Query: 16  TPLDWKKAKKKEKKKKEK---KAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNN 72
            P       K   K+ E     +K  +    K K  K    K KK+ K          N 
Sbjct: 64  NPSKLTSNTKWTLKQVEFYFSGSKDTDSNFPKDKFLKTTAPKNKKRDKWVPIKTIATFNR 123

Query: 73  NNNNNNNNNNNNNNEEKG 90
             N+ +  +  +    K 
Sbjct: 124 MKNSGSPVSAVSGALRKS 141


>gnl|CDD|215180 PLN02316, PLN02316, synthase/transferase.
          Length = 1036

 Score = 29.1 bits (65), Expect = 7.7
 Identities = 9/32 (28%), Positives = 19/32 (59%)

Query: 25  KKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKK 56
           ++E++ +E+  + K +K  + K E K+K  K 
Sbjct: 440 QEERRLREEAIRAKAEKTARMKAEMKEKTLKM 471


>gnl|CDD|223624 COG0550, TopA, Topoisomerase IA [DNA replication, recombination,
           and repair].
          Length = 570

 Score = 29.1 bits (66), Expect = 7.8
 Identities = 12/61 (19%), Positives = 27/61 (44%), Gaps = 4/61 (6%)

Query: 2   DSSTKHQIGTFTEPTPLDWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
            +  +   G F+        + + K++ + + K + +E   + K K  K    +KK KK+
Sbjct: 199 KAIFEKGGGEFS----ARLTEIEGKKEGRLKDKDEAEEIVNKLKGKPAKVVSVEKKPKKR 254

Query: 62  N 62
           +
Sbjct: 255 S 255


>gnl|CDD|234582 PRK00028, infC, translation initiation factor IF-3; Reviewed.
          Length = 177

 Score = 28.2 bits (64), Expect = 7.8
 Identities = 14/24 (58%), Positives = 18/24 (75%), Gaps = 3/24 (12%)

Query: 25 KKEKKKKEKKAKKKEKK---KEKK 45
          K E+KKK+K+AKKK+K    KE K
Sbjct: 72 KYEQKKKQKEAKKKQKVIEVKEIK 95


>gnl|CDD|181784 PRK09334, PRK09334, 30S ribosomal protein S25e; Provisional.
          Length = 86

 Score = 26.9 bits (60), Expect = 8.0
 Identities = 16/38 (42%), Positives = 21/38 (55%), Gaps = 5/38 (13%)

Query: 21 KKAKKKEKKKKEKKAKKK-----EKKKEKKKKEKKKKK 53
          KK KKKE  K EK+ K       E+  ++  KE KK+K
Sbjct: 3  KKKKKKEGSKTEKEIKSTIITLDEELLKRVAKEVKKEK 40


>gnl|CDD|215364 PLN02678, PLN02678, seryl-tRNA synthetase.
          Length = 448

 Score = 28.9 bits (65), Expect = 8.1
 Identities = 13/41 (31%), Positives = 20/41 (48%)

Query: 21  KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
            K K  ++   E  A+ KE KKE  +KE + ++ K     K
Sbjct: 64  AKLKIAKEDATELIAETKELKKEITEKEAEVQEAKAALDAK 104


>gnl|CDD|227352 COG5019, CDC3, Septin family protein [Cell division and chromosome
           partitioning / Cytoskeleton].
          Length = 373

 Score = 28.8 bits (65), Expect = 8.1
 Identities = 13/49 (26%), Positives = 28/49 (57%)

Query: 21  KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNN 69
           +K ++KEK+ +E +    E++KE   K ++ +KK +  +K+     +N 
Sbjct: 324 EKIREKEKRLEELEQNLIEERKELNSKLEEIQKKLEDLEKRLEKLKSNK 372


>gnl|CDD|235614 PRK05807, PRK05807, hypothetical protein; Provisional.
          Length = 136

 Score = 27.8 bits (62), Expect = 8.2
 Identities = 15/52 (28%), Positives = 25/52 (48%), Gaps = 1/52 (1%)

Query: 15  PTPLDWKKAKKKEKKKK-EKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNN 65
           P  +DW+K K K      E +  K  K  E++ ++ KK +  K +  K +N 
Sbjct: 84  PAEIDWQKEKNKNNNGNFEDRLSKFLKDSEERFQDLKKHQDSKGRGYKKSNG 135


>gnl|CDD|132949 cd06618, PKc_MKK7, Catalytic domain of the dual-specificity Protein
           Kinase, MAP kinase kinase 7.  Protein kinases (PKs), MAP
           kinase kinase 7 (MKK7) subfamily, catalytic (c) domain.
           PKs catalyze the transfer of the gamma-phosphoryl group
           from ATP to serine/threonine or tyrosine residues on
           protein substrates. The MKK7 subfamily is part of a
           larger superfamily that includes the catalytic domains
           of other protein serine/threonine kinases, protein
           tyrosine kinases, RIO kinases, aminoglycoside
           phosphotransferase, choline kinase, and phosphoinositide
           3-kinase. The mitogen-activated protein (MAP) kinase
           signaling pathways are important mediators of cellular
           responses to extracellular signals. The pathways involve
           a triple kinase core cascade comprising the MAP kinase
           (MAPK), which is phosphorylated and activated by a MAPK
           kinase (MAPKK or MKK), which itself is phosphorylated
           and activated by a MAPK kinase kinase (MAPKKK or MKKK).
           MKK7 is a dual-specificity PK that phosphorylates and
           activates its downstream target, c-Jun N-terminal kinase
           (JNK), on specific threonine and tyrosine residues.
           Although MKK7 is capable of dual phosphorylation, it
           prefers to phosphorylate the threonine residue of JNK.
           Thus, optimal activation of JNK requires both MKK4 (not
           included in this subfamily) and MKK7. MKK7 is primarily
           activated by cytokines. MKK7 is essential for liver
           formation during embryogenesis. It plays roles in G2/M
           cell cycle arrest and cell growth. In addition, it is
           involved in the control of programmed cell death, which
           is crucial in oncogenesis, cancer chemoresistance, and
           antagonism to TNFalpha-induced killing, through its
           inhibition by Gadd45beta and the subsequent suppression
           of the JNK cascade.
          Length = 296

 Score = 28.5 bits (64), Expect = 8.4
 Identities = 9/23 (39%), Positives = 14/23 (60%)

Query: 215 LGKDGQRRPNYRMYMRRTFLLPY 237
           L KD ++RP YR  ++  F+  Y
Sbjct: 257 LTKDHRKRPKYRELLQHPFIRRY 279


>gnl|CDD|215570 PLN03091, PLN03091, hypothetical protein; Provisional.
          Length = 459

 Score = 28.8 bits (64), Expect = 8.4
 Identities = 9/14 (64%), Positives = 13/14 (92%)

Query: 61  KNNNNNNNNNNNNN 74
           K NN+NNNN++N+N
Sbjct: 195 KINNSNNNNHSNSN 208


>gnl|CDD|184287 PRK13735, PRK13735, conjugal transfer mating pair stabilization
           protein TraG; Provisional.
          Length = 942

 Score = 28.9 bits (65), Expect = 8.4
 Identities = 9/39 (23%), Positives = 18/39 (46%)

Query: 23  AKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
           A  +  K  E+K+ ++        +E  K+ K+ + K K
Sbjct: 903 ALSQNNKYNEEKSAQERMPGADSPEELMKRAKEYQDKHK 941


>gnl|CDD|218071 pfam04409, DUF530, Protein of unknown function (DUF530).  Family of
           hypothetical archaeal proteins.
          Length = 512

 Score = 28.7 bits (64), Expect = 8.4
 Identities = 19/58 (32%), Positives = 23/58 (39%), Gaps = 4/58 (6%)

Query: 7   HQIGTFTEPTPLDWKKAKKKEKKKKEKK----AKKKEKKKEKKKKEKKKKKKKKKKKK 60
           + I  + E    +      +E  KK  K         KK EKKK EK  K K  K KK
Sbjct: 448 YGISDWDEVEVAEILGIDIEELLKKMAKIEPSEILLFKKVEKKKFEKFGKPKSDKAKK 505


>gnl|CDD|224340 COG1422, COG1422, Predicted membrane protein [Function unknown].
          Length = 201

 Score = 28.5 bits (64), Expect = 8.5
 Identities = 12/43 (27%), Positives = 25/43 (58%), Gaps = 3/43 (6%)

Query: 19  DWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
           D +K K+    +K  K  +KE ++ ++  + KK KK ++K+ +
Sbjct: 70  DQEKMKEL---QKMMKEFQKEFREAQESGDMKKLKKLQEKQME 109


>gnl|CDD|222863 PHA02540, 61, DNA primase; Provisional.
          Length = 337

 Score = 28.8 bits (65), Expect = 8.5
 Identities = 8/34 (23%), Positives = 18/34 (52%)

Query: 19  DWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKK 52
           ++   + KE+   + +   K K + KK+K+  +K
Sbjct: 81  EYIMERFKERGTGKGRPVPKPKFEFKKEKKVIEK 114


>gnl|CDD|221195 pfam11737, DUF3300, Protein of unknown function (DUF3300).  This
           hypothetical bacterial gene product has a long
           hydrophobic segment and is thus likely to be a membrane
           protein.
          Length = 230

 Score = 28.4 bits (64), Expect = 8.6
 Identities = 7/25 (28%), Positives = 13/25 (52%)

Query: 62  NNNNNNNNNNNNNNNNNNNNNNNNN 86
           +N+NN  N NN+   ++      +N
Sbjct: 206 HNHNNRINRNNDRVVSSGGQRWQHN 230



 Score = 28.4 bits (64), Expect = 9.7
 Identities = 7/25 (28%), Positives = 13/25 (52%)

Query: 62  NNNNNNNNNNNNNNNNNNNNNNNNN 86
           N+N+NN  N NN+   ++      +
Sbjct: 205 NHNHNNRINRNNDRVVSSGGQRWQH 229


>gnl|CDD|234818 PRK00708, PRK00708, sec-independent translocase; Provisional.
          Length = 209

 Score = 28.2 bits (63), Expect = 8.7
 Identities = 14/40 (35%), Positives = 15/40 (37%)

Query: 21  KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKK 60
            K  +    KK KK     K    K     KK  KKKK K
Sbjct: 169 PKPTETITAKKAKKTAAAPKPTADKTATPAKKTTKKKKTK 208


>gnl|CDD|233160 TIGR00869, sec62, protein translocation protein, Sec62 family.
           Members of the NSCC2 family have been sequenced from
           various yeast, fungal and animals species including
           Saccharomyces cerevisiae, Drosophila melanogaster and
           Homo sapiens. These proteins are the Sec62 proteins,
           believed to be associated with the Sec61 and Sec63
           constituents of the general protein secretary systems of
           yeast microsomes. They are also the non-selective cation
           (NS) channels of the mammalian cytoplasmic membrane. The
           yeast Sec62 protein has been shown to be essential for
           cell growth. The mammalian NS channel proteins has been
           implicated in platelet derived growth factor(PGDF)
           dependent single channel current in fibroblasts. These
           channels are essentially closed in serum deprived
           tissue-culture cells and are specifically opened by
           exposure to PDGF. These channels are reported to exhibit
           equal selectivity for Na+, K+ and Cs+ with low
           permeability to Ca2+, and no permeability to anions
           [Transport and binding proteins, Amino acids, peptides
           and amines].
          Length = 232

 Score = 28.3 bits (63), Expect = 8.8
 Identities = 13/26 (50%), Positives = 16/26 (61%)

Query: 39  EKKKEKKKKEKKKKKKKKKKKKKNNN 64
           EK K   KK+ K KK KKK+ K+  N
Sbjct: 207 EKDKYSYKKKLKSKKLKKKQAKREAN 232


>gnl|CDD|130217 TIGR01147, V_ATP_synt_G, vacuolar ATP synthase, subunit G.  This
          model describes the vacuolar ATP synthase G subunit in
          eukaryotes and includes members from diverse groups
          e.g., fungi, plants, parasites etc. V-ATPases are
          multi-subunit enzymes composed of two functional
          domains: A transmembrane Vo domain and a peripheral
          catalytic domain V1. The G subunit is one of the
          subunits of the catalytic domain. V-ATPases are
          responsible for the acidification of endosomes and
          lysosomes, which are part of the central vacuolar
          system [Energy metabolism, ATP-proton motive force
          interconversion].
          Length = 113

 Score = 27.5 bits (61), Expect = 8.8
 Identities = 13/41 (31%), Positives = 30/41 (73%)

Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
          K+A +K  + +++K K+ ++ KE+ +KE +K K++++K+ K
Sbjct: 16 KRAAEKVSEARKRKTKRLKQAKEEAQKEVEKYKQQREKEFK 56


>gnl|CDD|220719 pfam10368, YkyA, Putative cell-wall binding lipoprotein.  YkyA is a
           family of proteins containing a lipoprotein signal and a
           hydrolase domain. It is similar to cell wall binding
           proteins and might also be recognisable by a host immune
           defence system. It is thus likely to belong to pathways
           important for pathogenicity.
          Length = 205

 Score = 28.1 bits (63), Expect = 8.9
 Identities = 16/42 (38%), Positives = 26/42 (61%), Gaps = 1/42 (2%)

Query: 21  KKAKKKEKK-KKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
               K+EK  KKEK++ +K +++ K  K+  +K + KK KKK
Sbjct: 76  ANVDKREKLLKKEKESIEKSEEEFKSAKKYIEKIEDKKLKKK 117


>gnl|CDD|198139 smart01071, CDC37_N, Cdc37 N terminal kinase binding.  Cdc37 is a
           molecular chaperone required for the activity of
           numerous eukaryotic protein kinases. This domain
           corresponds to the N terminal domain which binds
           predominantly to protein kinases.and is found N terminal
           to the Hsp (Heat shocked protein) 90-binding domain.
           Expression of a construct consisting of only the
           N-terminal domain of Saccharomyces pombe Cdc37 results
           in cellular viability. This indicates that interactions
           with the cochaperone Hsp90 may not be essential for
           Cdc37 function.
          Length = 154

 Score = 27.8 bits (62), Expect = 9.0
 Identities = 10/41 (24%), Positives = 23/41 (56%)

Query: 22  KAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKN 62
           +    + +   ++ KK   K +K++KE +KK  + +K++K 
Sbjct: 100 EEANGDSEGLLEELKKHRDKLKKEQKELRKKLDELEKEEKK 140



 Score = 27.8 bits (62), Expect = 9.3
 Identities = 15/46 (32%), Positives = 27/46 (58%), Gaps = 5/46 (10%)

Query: 21  KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKK-----KKKKKK 61
           KK  ++     E   ++ +K ++K KKE+K+ +KK     K++KKK
Sbjct: 96  KKELEEANGDSEGLLEELKKHRDKLKKEQKELRKKLDELEKEEKKK 141


>gnl|CDD|223957 COG1026, COG1026, Predicted Zn-dependent peptidases,
           insulinase-like [General function prediction only].
          Length = 978

 Score = 28.8 bits (65), Expect = 9.0
 Identities = 9/39 (23%), Positives = 20/39 (51%)

Query: 27  EKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNN 65
           E+K ++++ +  +K+  +   E  +K  K  KK K   +
Sbjct: 462 EEKLEKEERELLQKRSSELTDEDLEKIIKDSKKLKERQD 500


>gnl|CDD|224097 COG1176, PotB, ABC-type spermidine/putrescine transport system,
           permease component I [Amino acid transport and
           metabolism].
          Length = 287

 Score = 28.3 bits (64), Expect = 9.0
 Identities = 32/132 (24%), Positives = 50/132 (37%), Gaps = 35/132 (26%)

Query: 233 FLLPYAIV--TSLFFIGQF----GGMTTLQTYAVGIFESIHAPLDPYFATLLLGVAELGG 286
           FLLP  IV   S            G+ TL  YA  + +S++  L     +L +       
Sbjct: 24  FLLPLLIVLVPSFLLPLLSFFDYSGVFTLGNYARLLTDSLY--LTVLLRSLWIAAL---V 78

Query: 287 ALLCVVLIHYTGKRPLALISTGGSAACFIVVAVYAQFHLSYGWDSPLVPTVFLVLAAFLT 346
            L+C+ L+ Y    P+A          F+          S      L+  + LV+  F T
Sbjct: 79  TLICL-LLGY----PVAY---------FLA--------RSPPRWRNLL--LLLVILPFWT 114

Query: 347 HICIRLLPWMLI 358
            + IR   W++I
Sbjct: 115 SLLIRTYAWIVI 126


>gnl|CDD|217483 pfam03305, Lipoprotein_X, Mycoplasma MG185/MG260 protein.  Most of
           the aligned regions in this family are found towards the
           middle of the member proteins.
          Length = 237

 Score = 28.5 bits (64), Expect = 9.0
 Identities = 13/56 (23%), Positives = 20/56 (35%), Gaps = 5/56 (8%)

Query: 9   IGTFTEPTPLDWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNN 64
           +G FT       K +    + K  K+A K E  +   K  +  K K    K    +
Sbjct: 180 LGKFTLD-----KNSLTSPEVKAFKEADKSEDPQTDSKTIELYKDKIPTDKNAGVD 230


>gnl|CDD|202096 pfam02029, Caldesmon, Caldesmon. 
          Length = 431

 Score = 28.8 bits (64), Expect = 9.1
 Identities = 10/35 (28%), Positives = 28/35 (80%)

Query: 27  EKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
            KKK+E++ K  E++++++K+E+  +K +++++K+
Sbjct: 219 LKKKREERRKVLEEEEQRRKQEEADRKSREEEEKR 253


>gnl|CDD|227038 COG4694, COG4694, Uncharacterized protein conserved in bacteria
           [Function unknown].
          Length = 758

 Score = 28.7 bits (64), Expect = 9.1
 Identities = 10/44 (22%), Positives = 21/44 (47%)

Query: 21  KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNN 64
           K++  K+ +KK K     + K ++++KE+K       +     N
Sbjct: 124 KESINKKNEKKIKNEASLQVKTQREEKEEKDFIADCWRNLYKKN 167


>gnl|CDD|234402 TIGR03928, T7_EssCb_Firm, type VII secretion protein EssC,
           C-terminal domain.  This model describes the C-terminal
           domain, or longer subunit, of the Firmicutes type VII
           secretion protein EssC. This protein (homologous to EccC
           in Actinobacteria) and the WXG100 target proteins are
           the only homologous parts of type VII secretion between
           Firmicutes and Actinobacteria [Protein fate, Protein and
           peptide secretion and trafficking].
          Length = 1296

 Score = 28.8 bits (65), Expect = 9.1
 Identities = 12/37 (32%), Positives = 21/37 (56%)

Query: 22  KAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKK 58
           + KKK KK  EK+ +      +KK+KE +   +K++ 
Sbjct: 84  REKKKYKKDVEKRNRSYRLYLDKKRKELQALSEKQRH 120


>gnl|CDD|173592 PTZ00401, PTZ00401, aspartyl-tRNA synthetase; Provisional.
          Length = 550

 Score = 28.8 bits (64), Expect = 9.2
 Identities = 10/31 (32%), Positives = 18/31 (58%)

Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKK 51
            A   EKK+ +K+A+K  +  E+K +  +K
Sbjct: 8  AGAPAVEKKQSDKEARKAARLAEEKARAAEK 38


>gnl|CDD|216790 pfam01925, TauE, Sulfite exporter TauE/SafE.  This is a family of
           integral membrane proteins where the alignment appears
           to contain two duplicated modules of three transmembrane
           helices. The proteins are involved in the transport of
           anions across the cytoplasmic membrane during taurine
           metabolism as an exporter of sulfoacetate. This family
           used to be known as DUF81.
          Length = 236

 Score = 28.3 bits (64), Expect = 9.3
 Identities = 8/50 (16%), Positives = 17/50 (34%), Gaps = 1/50 (2%)

Query: 100 LVLVLTYVAEITQPHLRGMLSATASMTTIFGTVSQLFLGSFLHWRSAAIL 149
           LV  L Y+  +          A   ++++   +     G  + W    +L
Sbjct: 150 LVPALLYLLGLPLKKAVATSLAVFLVSSLAALLGYALAG-LVDWPLLLLL 198


>gnl|CDD|216292 pfam01086, Clathrin_lg_ch, Clathrin light chain. 
          Length = 225

 Score = 28.2 bits (63), Expect = 9.5
 Identities = 17/59 (28%), Positives = 31/59 (52%), Gaps = 11/59 (18%)

Query: 14  EPTPL-DWKKAKKKE-KKKKEKKAKKKEKKKEKKKKE---------KKKKKKKKKKKKK 61
           EP  +  W++ +    +++ E   KKKE+  EK +KE         +KK+K K + +K+
Sbjct: 110 EPESIRKWRERRDLRIEERDEASEKKKEELIEKAQKEIDDFYENYNEKKEKTKAQNRKE 168


>gnl|CDD|220366 pfam09728, Taxilin, Myosin-like coiled-coil protein.  Taxilin
          contains an extraordinarily long coiled-coil domain in
          its C-terminal half and is ubiquitously expressed. It
          is a novel binding partner of several syntaxin family
          members and is possibly involved in Ca2+-dependent
          exocytosis in neuroendocrine cells. Gamma-taxilin,
          described as leucine zipper protein Factor Inhibiting
          ATF4-mediated Transcription (FIAT), localises to the
          nucleus in osteoblasts and dimerises with ATF4 to form
          inactive dimers, thus inhibiting ATF4-mediated
          transcription.
          Length = 309

 Score = 28.5 bits (64), Expect = 10.0
 Identities = 13/52 (25%), Positives = 28/52 (53%), Gaps = 2/52 (3%)

Query: 15 PTPLDWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNN 66
           TP +  K +   K+  E   + K  +K+ KK +K+ ++ +K+K K  + ++
Sbjct: 18 DTPEE--KLETLIKRLAELLKEMKRLEKDLKKLQKRGEQLQKEKDKLQSEHS 67


>gnl|CDD|217803 pfam03938, OmpH, Outer membrane protein (OmpH-like).  This family
          includes outer membrane proteins such as OmpH among
          others. Skp (OmpH) has been characterized as a
          molecular chaperone that interacts with unfolded
          proteins as they emerge in the periplasm from the Sec
          translocation machinery.
          Length = 157

 Score = 27.6 bits (62), Expect = 10.0
 Identities = 14/47 (29%), Positives = 29/47 (61%), Gaps = 3/47 (6%)

Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKK---EKKKKKKKKKKKKKNNN 64
           K  + E +KKEK+ +K+E+K +K+     E+ +K K+++ ++K   
Sbjct: 44 FKKLQAELQKKEKELQKEEQKLQKQAATLSEEARKAKQQELQQKQQE 90


  Database: CDD.v3.10
    Posted date:  Mar 20, 2013  7:55 AM
  Number of letters in database: 10,937,602
  Number of sequences in database:  44,354
  
Lambda     K      H
   0.319    0.134    0.398 

Gapped
Lambda     K      H
   0.267   0.0677    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 28,686,579
Number of extensions: 2943830
Number of successful extensions: 30390
Number of sequences better than 10.0: 1
Number of HSP's gapped: 21246
Number of HSP's successfully gapped: 2657
Length of query: 553
Length of database: 10,937,602
Length adjustment: 102
Effective length of query: 451
Effective length of database: 6,413,494
Effective search space: 2892485794
Effective search space used: 2892485794
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 62 (27.8 bits)