RPS-BLAST 2.2.26 [Sep-21-2011]

Database: CDD.v3.10 
           44,354 sequences; 10,937,602 total letters

Searching..................................................done

Query= psy114
         (609 letters)



>gnl|CDD|218624 pfam05529, Bap31, B-cell receptor-associated protein 31-like.
           Bap31 is a polytopic integral protein of the endoplasmic
           reticulum membrane and a substrate of caspase-8. Bap31
           is cleaved within its cytosolic domain, generating
           pro-apoptotic p20 Bap31.
          Length = 149

 Score =  150 bits (381), Expect = 9e-43
 Identities = 57/143 (39%), Positives = 79/143 (55%), Gaps = 5/143 (3%)

Query: 1   MSLQWTLIATVLYFEMAFMLLLILPILSTQRLHKILKSKFVQGVKTQAGWYFGCILVILS 60
           MSL WTL+  +LY EMA  LLL+LP+ S  R  KI KS     +  Q    F  +L+ L 
Sbjct: 1   MSLYWTLVFGLLYAEMAVFLLLVLPLPSPIR-RKIFKSVSKLQLSQQFQTGFKILLIFLL 59

Query: 61  LFFLDAIREMRKYAS---PEVKEEAHGHLDAEMQNNMKLFRAQRNFYISGFSLFLWLVIR 117
           + F+D++R + KY+            G   +E Q   +LF AQRN Y+SGF+LFL LVIR
Sbjct: 60  VLFIDSVRRVYKYSGELQGSGNASLGGGERSEYQTLARLFYAQRNLYLSGFTLFLTLVIR 119

Query: 118 QIIQLIAQQANLLAQNEASMNQA 140
           +++ L+ +   L    EA   QA
Sbjct: 120 RVVTLVEELVKLEEL-EALKKQA 141


>gnl|CDD|227666 COG5374, COG5374, Uncharacterized conserved protein [Function
           unknown].
          Length = 192

 Score = 56.4 bits (136), Expect = 4e-09
 Identities = 40/175 (22%), Positives = 71/175 (40%), Gaps = 24/175 (13%)

Query: 1   MSLQWTLIATVLYFEMAFMLLLILPILSTQR---LHKILKSKFVQGVKTQAGWYFGCILV 57
           M + +TL+ ++L  EM    +L+LP+    R   +     SK  +G K           +
Sbjct: 1   MGIYYTLVFSLLVVEMVMFFILVLPLPKRLRRSLMKLYSTSKVYRGFKH----ILKITFI 56

Query: 58  ILSLFFLDAIREM------RKYASPEVKEEAHGHLDAEMQNNMKLFRAQRNFYISGFSLF 111
            + L F+D+ + +          S  +   A   +          F AQRN Y+SG +LF
Sbjct: 57  FILLLFIDSWKRVYRVSKEANLYSASINNYAVTRIAVLA----SRFYAQRNMYLSGSALF 112

Query: 112 LWLVI-RQ---IIQLIAQQANLLAQ---NEASMNQARQAAVAAQALLDGPRGKWF 159
           L +V+ R    + +++ + A    +    EA     +     AQ LL+G +    
Sbjct: 113 LSIVVMRVMSIVEEMLEENAKKGGKIDKMEADSTDLKARLRKAQILLEGLQKNQE 167



 Score = 32.5 bits (74), Expect = 0.45
 Identities = 20/80 (25%), Positives = 37/80 (46%), Gaps = 13/80 (16%)

Query: 529 VQKVVTEQGDK--KDDKYQKGDSDEIKRLKEKLSKTEEELKKEKTNSAALKNQADSVGKE 586
           V++++ E   K  K DK +   +D    LK +L K +  L+  + N   L        K 
Sbjct: 124 VEEMLEENAKKGGKIDKMEADSTD----LKARLRKAQILLEGLQKNQEELF-------KL 172

Query: 587 YDRLLKEHEKVQKVVTEQGD 606
            D+  +  E+VQK  +++ +
Sbjct: 173 LDKYNELREQVQKESSKKKE 192


>gnl|CDD|216504 pfam01437, PSI, Plexin repeat.  A cysteine rich repeat found in
           several different extracellular receptors. The function
           of the repeat is unknown. Three copies of the repeat are
           found Plexin. Two copies of the repeat are found in
           mahogany protein. A related C. elegans protein contains
           four copies of the repeat. The Met receptor contains a
           single copy of the repeat. The Pfam alignment shows 6
           conserved cysteine residues that may form three
           conserved disulphide bridges, whereas shows 8 conserved
           cysteines. The pattern of conservation suggests that
           cysteines 5 and 7 (that are not absolutely conserved)
           form a disulphide bridge (Personal observation. A
           Bateman).
          Length = 48

 Score = 48.7 bits (116), Expect = 7e-08
 Identities = 15/50 (30%), Positives = 23/50 (46%), Gaps = 3/50 (6%)

Query: 278 DCNTYSSCTQCVSS-DFPCDWCVDGHRCTHDTAENCRNDILITGVSRVGP 326
           +C+ Y+SC+ C+++ D  C WC    RCT      C      +  S   P
Sbjct: 1   NCSQYTSCSSCLAARDPYCGWCSSEGRCTR--GTECPRSEGPSWSSSKCP 48



 Score = 41.0 bits (96), Expect = 4e-05
 Identities = 13/41 (31%), Positives = 17/41 (41%), Gaps = 1/41 (2%)

Query: 445 NCGLCLALPEKYGCGWCESSKRCEIFEQCDKGLEGDSDEIK 485
           +C  CLA  +   CGWC S  RC    +C +         K
Sbjct: 7   SCSSCLAARD-PYCGWCSSEGRCTRGTECPRSEGPSWSSSK 46


>gnl|CDD|222417 pfam13851, GAS, Growth-arrest specific micro-tubule binding.  This
           family is the highly conserved central region of a
           number of metazoan proteins referred to as growth-arrest
           proteins. In mouse, Gas8 is predominantly a testicular
           protein, whose expression is developmentally regulated
           during puberty and spermatogenesis. In humans, it is
           absent in infertile males who lack the ability to
           generate gametes. The localisation of Gas8 in the
           motility apparatus of post-meiotic gametocytes and
           mature spermatozoa, together with the detection of Gas8
           also in cilia at the apical surfaces of epithelial cells
           lining the pulmonary bronchi and Fallopian tubes
           suggests that the Gas8 protein may have a role in the
           functioning of motile cellular appendages. Gas8 is a
           microtubule-binding protein localised to regions of
           dynein regulation in mammalian cells.
          Length = 201

 Score = 52.6 bits (127), Expect = 7e-08
 Identities = 38/125 (30%), Positives = 62/125 (49%), Gaps = 9/125 (7%)

Query: 484 IKRLKEKLSKTEEELKKEKTNSAALKSQADSVGKEYDRLLKEHEKVQKVVTEQGDKKDDK 543
           IK LKE++     E+KK   N    +     + +E  RL++  +K ++ V E   K  D 
Sbjct: 29  IKSLKEEI----AEMKK---NEEHNEKLMAEIAQENKRLVEPLKKAEEEVEELRKKLKD- 80

Query: 544 YQKGDSDEIKRLKEKLSKTEEELKKEKTNSAALKNQADSVGKEYDRLLKEHEKVQKVVTE 603
           Y+K D   +K LK +L + E+ELK  K  S  L+ + + V +E D L  + E   + V +
Sbjct: 81  YEK-DKQSLKNLKARLKELEKELKNLKWESEVLEQRFEKVERERDELYDKFEAAIQDVQQ 139

Query: 604 QGDKK 608
           +   K
Sbjct: 140 KTGLK 144



 Score = 38.4 bits (90), Expect = 0.004
 Identities = 29/114 (25%), Positives = 56/114 (49%), Gaps = 4/114 (3%)

Query: 480 DSDEIKRLKEKLSKTEEELKKEKTNSAALKSQADSVGKEYDRLLKEHEKVQKVVTEQGDK 539
           D   +K LK +L + E+ELK  K  S  L+ + + V +E D L  + E   + V ++   
Sbjct: 84  DKQSLKNLKARLKELEKELKNLKWESEVLEQRFEKVERERDELYDKFEAAIQDVQQKTGL 143

Query: 540 KDDKYQKGDSDEIKRLKEKLSKTEEELKKEKTNSAALKNQADSVGKEYDRLLKE 593
           K+   ++    +++ L E+L K E +L +    +        +V ++ D +L+E
Sbjct: 144 KNLLLEQ----KLEALNEELEKKEAQLNEVLAAANLDPAALQAVSEKLDDVLEE 193


>gnl|CDD|235175 PRK03918, PRK03918, chromosome segregation protein; Provisional.
          Length = 880

 Score = 53.1 bits (128), Expect = 3e-07
 Identities = 33/133 (24%), Positives = 72/133 (54%), Gaps = 8/133 (6%)

Query: 475 KGLEGDSDEIKRLKEKLSKTEEELKKEKTNSAALKSQADSVG----KEYDRLLKEHEKVQ 530
           K L+ + ++++ LK+KL++ E++L + +   A L  + + +G    +E +  LKE E   
Sbjct: 542 KSLKKELEKLEELKKKLAELEKKLDELEEELAELLKELEELGFESVEELEERLKELEPFY 601

Query: 531 KVVTEQGDKKDDKYQKGDSDEIKRLKEKLSKTEEELKKEKTNSAALKNQADSVGKEYDRL 590
               E  D + +  ++    E+K+L+E+L K  EEL + +     L+ + + + K+Y   
Sbjct: 602 NEYLELKDAEKELERE--EKELKKLEEELDKAFEELAETEKRLEELRKELEELEKKYSE- 658

Query: 591 LKEHEKVQKVVTE 603
            +E+E++++   E
Sbjct: 659 -EEYEELREEYLE 670



 Score = 46.6 bits (111), Expect = 3e-05
 Identities = 33/125 (26%), Positives = 71/125 (56%), Gaps = 4/125 (3%)

Query: 475 KGLEGDSDEIKRLKEKLSKTEEELKKEKTNSAALKSQADSVGKEYDRLLKEHEKVQKVVT 534
           K LE +  E+K+L+E+L K  EEL + +     L+ + + + K+Y    +E+E++++   
Sbjct: 612 KELEREEKELKKLEEELDKAFEELAETEKRLEELRKELEELEKKYSE--EEYEELREEYL 669

Query: 535 EQGDKKDDKYQKGDSDEIKRLKEKLSKTEEELKKEKTNSAALKNQADSVGKEYDRLLKEH 594
           E    ++    + + +E+++ +E++ KT E+LK+E       K + + + K  +R+ +  
Sbjct: 670 EL--SRELAGLRAELEELEKRREEIKKTLEKLKEELEEREKAKKELEKLEKALERVEELR 727

Query: 595 EKVQK 599
           EKV+K
Sbjct: 728 EKVKK 732



 Score = 43.9 bits (104), Expect = 2e-04
 Identities = 30/142 (21%), Positives = 64/142 (45%), Gaps = 13/142 (9%)

Query: 468 EIFEQCDKGLEGDSDEIKRLKE------KLSKTEEELKKEKTNSAALKSQADSVGKEYDR 521
           E+ E   + +E   + +K L+       +L   E+EL++E+     L+ + D   +E   
Sbjct: 578 ELEELGFESVEELEERLKELEPFYNEYLELKDAEKELEREEKELKKLEEELDKAFEELAE 637

Query: 522 LLKEHEKVQKVVTEQGDKKDDKYQKGDSDEIKRLKEKLSKTEEELKKEKTNSAALKNQAD 581
             K  E+++K + E   K  ++  +   +E   L  +L+    EL+        L+ + +
Sbjct: 638 TEKRLEELRKELEELEKKYSEEEYEELREEYLELSRELAGLRAELE-------ELEKRRE 690

Query: 582 SVGKEYDRLLKEHEKVQKVVTE 603
            + K  ++L +E E+ +K   E
Sbjct: 691 EIKKTLEKLKEELEEREKAKKE 712



 Score = 43.9 bits (104), Expect = 3e-04
 Identities = 35/139 (25%), Positives = 66/139 (47%), Gaps = 13/139 (9%)

Query: 462 ESSKRCEIFEQCDKGLEGDSDEIKRLKEKLSKTEEELKKEKTNSAALKSQADSVGKEYDR 521
           E  K  E  E   + LE    E++   E+L K  EEL+++      LK +A    +EY +
Sbjct: 242 ELEKELESLEGSKRKLEEKIRELEERIEELKKEIEELEEKVKELKELKEKA----EEYIK 297

Query: 522 LLKEHEKVQKVVTEQGDKKDDKYQKGDSDEIKRLKEKLSKTEEELKKEKTNSAALKNQAD 581
           L + +E+    + E  +K+  +      +EI  ++E++ + EE  K+E+     LK +  
Sbjct: 298 LSEFYEEYLDELRE-IEKRLSRL----EEEINGIEERIKELEE--KEERLEE--LKKKLK 348

Query: 582 SVGKEYDRLLKEHEKVQKV 600
            + K  + L + HE  ++ 
Sbjct: 349 ELEKRLEELEERHELYEEA 367



 Score = 41.6 bits (98), Expect = 0.001
 Identities = 27/131 (20%), Positives = 60/131 (45%), Gaps = 8/131 (6%)

Query: 462 ESSKRCEIFEQCDKGLEGDSDEIKRLKEKLSKTEEELKKEKTNSAALKSQADSVGKEYDR 521
           E  KR    E+   G+E    E++  +E+L + +++LK+ +     L+ +     + Y+ 
Sbjct: 311 EIEKRLSRLEEEINGIEERIKELEEKEERLEELKKKLKELEKRLEELEERH----ELYEE 366

Query: 522 LLKEHEKVQKVVTEQGDKKDDKYQKGDSDEIKRLKEKLSKTEEELKKEKTNSAALKNQAD 581
              + E+++++         +K +K    E++ L++   + EEE+ K       LK +  
Sbjct: 367 AKAKKEELERLKKRLTGLTPEKLEK----ELEELEKAKEEIEEEISKITARIGELKKEIK 422

Query: 582 SVGKEYDRLLK 592
            + K  + L K
Sbjct: 423 ELKKAIEELKK 433



 Score = 41.2 bits (97), Expect = 0.001
 Identities = 30/124 (24%), Positives = 56/124 (45%), Gaps = 14/124 (11%)

Query: 476 GLEGDSDEIKRLKEKLSKTEEELKKEKTNSAALKSQADSVGKEYDRLLKEHEKVQKVVTE 535
           G E   +  K L E+ +   + ++KE       + +     +E +++LK+  ++ K    
Sbjct: 442 GRELTEEHRKELLEEYTAELKRIEKELKEIEEKERKLRKELRELEKVLKKESELIK---- 497

Query: 536 QGDKKDDKYQKGDSDEIKRLKEKLSKTE-EELKKEKTNSAALKNQADSVGKEYDRLLKEH 594
                     K  ++++K L+EKL K   EEL+K+      LK +   +  E   L KE 
Sbjct: 498 ---------LKELAEQLKELEEKLKKYNLEELEKKAEEYEKLKEKLIKLKGEIKSLKKEL 548

Query: 595 EKVQ 598
           EK++
Sbjct: 549 EKLE 552



 Score = 40.8 bits (96), Expect = 0.002
 Identities = 27/124 (21%), Positives = 62/124 (50%), Gaps = 6/124 (4%)

Query: 475 KGLEGDSDEIKRLKEKLSKTEEELKKEKTNSAALKSQADSVGKEYDRLLKEHEKVQKVVT 534
           K L    +EI  ++E++ + EE  K+E+     LK +   + K  + L + HE  ++   
Sbjct: 314 KRLSRLEEEINGIEERIKELEE--KEERLEE--LKKKLKELEKRLEELEERHELYEEAKA 369

Query: 535 EQGDKKDDKYQKGDS--DEIKRLKEKLSKTEEELKKEKTNSAALKNQADSVGKEYDRLLK 592
           ++ + +  K +      +++++  E+L K +EE+++E +   A   +     KE  + ++
Sbjct: 370 KKEELERLKKRLTGLTPEKLEKELEELEKAKEEIEEEISKITARIGELKKEIKELKKAIE 429

Query: 593 EHEK 596
           E +K
Sbjct: 430 ELKK 433



 Score = 39.7 bits (93), Expect = 0.004
 Identities = 31/138 (22%), Positives = 76/138 (55%), Gaps = 7/138 (5%)

Query: 463 SSKRCEIFEQCDKGLEGDSDEIKRLKEKLSKTEEELKKEKTNSAALKSQADSVGKEYDRL 522
           SS+  E+ E+ +K LE +  E++ LKE++ + E+EL+  + +   L+ +   + +  + L
Sbjct: 213 SSELPELREELEK-LEKEVKELEELKEEIEELEKELESLEGSKRKLEEKIRELEERIEEL 271

Query: 523 LKEHEKVQKVVTEQGDKKDDKYQKGDSDEIKRLKEKLSKTEEELKKEKTNSAALKNQADS 582
            KE E++++ V      K+ K  K  ++E  +L E   +  +EL++ +   + L+ + + 
Sbjct: 272 KKEIEELEEKV------KELKELKEKAEEYIKLSEFYEEYLDELREIEKRLSRLEEEING 325

Query: 583 VGKEYDRLLKEHEKVQKV 600
           + +    L ++ E+++++
Sbjct: 326 IEERIKELEEKEERLEEL 343



 Score = 39.3 bits (92), Expect = 0.006
 Identities = 23/128 (17%), Positives = 60/128 (46%), Gaps = 15/128 (11%)

Query: 482 DEIKRLKEKLSKTEEELKKEKTNSAALKSQADSVGKEYDRLLKEHEKVQKVVTEQGDKKD 541
           + IK  +++L +   E+ +  +    L+ + + + KE   L +  E++     E+ +K+ 
Sbjct: 193 ELIKEKEKELEEVLREINEISSELPELREELEKLEKEVKELEELKEEI-----EELEKEL 247

Query: 542 DKYQKGDSDEIKRLKEKLSKTEEELKKEKTNSAALKNQADSV------GKEYDRLLKEHE 595
           +  +       ++L+EK+ + EE +++ K     L+ +   +       +EY +L + +E
Sbjct: 248 ESLEG----SKRKLEEKIRELEERIEELKKEIEELEEKVKELKELKEKAEEYIKLSEFYE 303

Query: 596 KVQKVVTE 603
           +    + E
Sbjct: 304 EYLDELRE 311



 Score = 39.3 bits (92), Expect = 0.007
 Identities = 35/149 (23%), Positives = 65/149 (43%), Gaps = 27/149 (18%)

Query: 482 DEIKRLKEKLS--------KTEEELKKEKTNSAALKSQADSVGKEYDRL------LKE-H 526
           +++K L+EKL         K  EE +K K     LK +  S+ KE ++L      L E  
Sbjct: 503 EQLKELEEKLKKYNLEELEKKAEEYEKLKEKLIKLKGEIKSLKKELEKLEELKKKLAELE 562

Query: 527 EKVQKVVTEQGDKKDDKYQKGDSD------EIKRLKE------KLSKTEEELKKEKTNSA 574
           +K+ ++  E  +   +  + G          +K L+       +L   E+EL++E+    
Sbjct: 563 KKLDELEEELAELLKELEELGFESVEELEERLKELEPFYNEYLELKDAEKELEREEKELK 622

Query: 575 ALKNQADSVGKEYDRLLKEHEKVQKVVTE 603
            L+ + D   +E     K  E+++K + E
Sbjct: 623 KLEEELDKAFEELAETEKRLEELRKELEE 651



 Score = 38.1 bits (89), Expect = 0.013
 Identities = 35/128 (27%), Positives = 67/128 (52%), Gaps = 9/128 (7%)

Query: 483 EIKRLKEKLSKTEEELKKEKTNSAALKSQADSVGKEYDRLLKEHEKVQ--KVVTEQGDKK 540
           E+KR++++L + EE+ +K +        + + V K+   L+K  E  +  K + E+  K 
Sbjct: 460 ELKRIEKELKEIEEKERKLRKE----LRELEKVLKKESELIKLKELAEQLKELEEKLKKY 515

Query: 541 DDKYQKGDSDEIKRLKEKLSKTE---EELKKEKTNSAALKNQADSVGKEYDRLLKEHEKV 597
           + +  +  ++E ++LKEKL K +   + LKKE      LK +   + K+ D L +E  ++
Sbjct: 516 NLEELEKKAEEYEKLKEKLIKLKGEIKSLKKELEKLEELKKKLAELEKKLDELEEELAEL 575

Query: 598 QKVVTEQG 605
            K + E G
Sbjct: 576 LKELEELG 583


>gnl|CDD|214655 smart00423, PSI, domain found in Plexins, Semaphorins and
           Integrins. 
          Length = 47

 Score = 46.4 bits (110), Expect = 5e-07
 Identities = 14/40 (35%), Positives = 21/40 (52%), Gaps = 3/40 (7%)

Query: 278 DCNTYSSCTQCVSSDFP-CDWCVDGHRCTHDTAENCRNDI 316
            C+ Y+SC++C+ +  P C WC    RCT    E C +  
Sbjct: 1   RCSKYTSCSECLLARDPYCAWCSSQGRCTS--GERCDSRR 38



 Score = 37.9 bits (88), Expect = 4e-04
 Identities = 14/40 (35%), Positives = 18/40 (45%), Gaps = 2/40 (5%)

Query: 439 CRDLANNCGLCLALPEKYGCGWCESSKRCEIFEQCDKGLE 478
           C     +C  CL   + Y C WC S  RC   E+CD   +
Sbjct: 2   CS-KYTSCSECLLARDPY-CAWCSSQGRCTSGERCDSRRQ 39


>gnl|CDD|224117 COG1196, Smc, Chromosome segregation ATPases [Cell division and
           chromosome partitioning].
          Length = 1163

 Score = 49.7 bits (119), Expect = 4e-06
 Identities = 33/158 (20%), Positives = 80/158 (50%), Gaps = 10/158 (6%)

Query: 462 ESSKRCEIFEQCDKGLEGDSD----EIKRLKEKLSKTEEELKKEKTNSAALKSQADSVGK 517
           E  +  E  ++    L+ + +    EI  L+E+L + E EL++ +     LK + +++ +
Sbjct: 278 ELREELEELQEELLELKEEIEELEGEISLLRERLEELENELEELEERLEELKEKIEALKE 337

Query: 518 EYDR---LLKEHEKVQKVVTEQGDKKDDK---YQKGDSDEIKRLKEKLSKTEEELKKEKT 571
           E +    LL+E E++   + E  ++ ++K     +   +  + L+E+L++ E EL + + 
Sbjct: 338 ELEERETLLEELEQLLAELEEAKEELEEKLSALLEELEELFEALREELAELEAELAEIRN 397

Query: 572 NSAALKNQADSVGKEYDRLLKEHEKVQKVVTEQGDKKD 609
               LK + +S+ +  +RL +  E +++ + E   + +
Sbjct: 398 ELEELKREIESLEERLERLSERLEDLKEELKELEAELE 435



 Score = 48.9 bits (117), Expect = 6e-06
 Identities = 32/133 (24%), Positives = 72/133 (54%), Gaps = 5/133 (3%)

Query: 482 DEIKRLKEKLSKTEEELKKEKTNSAALKSQADSVGKEYDRLLKEHEKVQKVVTE-QGDKK 540
            ++K L+++L + EEEL + +     L+ + +   KE + L  E E++++ + E Q +  
Sbjct: 232 AKLKELRKELEELEEELSRLEEELEELQEELEEAEKEIEELKSELEELREELEELQEELL 291

Query: 541 DDKYQKGD-SDEIKRLKEKLSKTEEELKKEKTNSAALKNQADSVGKEYDR---LLKEHEK 596
           + K +  +   EI  L+E+L + E EL++ +     LK + +++ +E +    LL+E E+
Sbjct: 292 ELKEEIEELEGEISLLRERLEELENELEELEERLEELKEKIEALKEELEERETLLEELEQ 351

Query: 597 VQKVVTEQGDKKD 609
           +   + E  ++ +
Sbjct: 352 LLAELEEAKEELE 364



 Score = 48.9 bits (117), Expect = 7e-06
 Identities = 24/121 (19%), Positives = 64/121 (52%), Gaps = 1/121 (0%)

Query: 484 IKRLKEKLSKTEEELKKEKTNSAALKSQADSVGKEYDRLLKEHEKVQKVVTEQGDKKDDK 543
           + + KE+  + E +L++ + N   L+   + + K+ ++L ++ EK ++    + + ++ +
Sbjct: 167 VSKYKERKEEAERKLERTEENLERLEDLLEELEKQLEKLERQAEKAERYQELKAELRELE 226

Query: 544 YQKGDSDEIKRLKEKLSKTEEELKKEKTNSAALKNQADSVGKEYDRLLKEHEKVQKVVTE 603
                  ++K L+++L + EEEL + +     L+ + +   KE + L  E E++++ + E
Sbjct: 227 LALL-LAKLKELRKELEELEEELSRLEEELEELQEELEEAEKEIEELKSELEELREELEE 285

Query: 604 Q 604
            
Sbjct: 286 L 286



 Score = 47.8 bits (114), Expect = 2e-05
 Identities = 28/144 (19%), Positives = 73/144 (50%), Gaps = 14/144 (9%)

Query: 471 EQCDKGLEGDSDEIKRLKEKLSKTEEELKKEKTNSAALKSQADSVGKEYDRLLKEHEKV- 529
           E+ ++ L    +E++ L+E+L + E+E+++ K+    L+ + + + +E   L +E E++ 
Sbjct: 242 EELEEELSRLEEELEELQEELEEAEKEIEELKSELEELREELEELQEELLELKEEIEELE 301

Query: 530 -----QKVVTEQGDKKDDKYQKGDSDEIKRLKEKLSKTEEELKKEKTNSAALKNQADSVG 584
                 +   E+ + + ++ ++     ++ LKEK+   +EEL++ +T    L+       
Sbjct: 302 GEISLLRERLEELENELEELEE----RLEELKEKIEALKEELEERETLLEELEQLL---- 353

Query: 585 KEYDRLLKEHEKVQKVVTEQGDKK 608
            E +   +E E+    + E+ ++ 
Sbjct: 354 AELEEAKEELEEKLSALLEELEEL 377



 Score = 46.2 bits (110), Expect = 5e-05
 Identities = 29/120 (24%), Positives = 67/120 (55%), Gaps = 5/120 (4%)

Query: 481 SDEIKRLKEKLSKTEEELKKEKTNSAALKSQADSVGKEYDRLLKEHEKVQKVVTEQGDKK 540
            DE+K L+E+  + EEEL++ ++  A LK + + + +  + L  + E+++  + E  ++ 
Sbjct: 876 EDELKELEEEKEELEEELRELESELAELKEEIEKLRERLEELEAKLERLEVELPELEEEL 935

Query: 541 DDKYQKGDSDEIKRLKEKLSKTEEELKKEKT-NSAALKNQADSVGKEYDRLLKEHEKVQK 599
           +++Y+     E++R  E+L   EEE++     N  A+  + + V + Y+ L  + E +++
Sbjct: 936 EEEYEDTLETELEREIERL---EEEIEALGPVNLRAI-EEYEEVEERYEELKSQREDLEE 991



 Score = 45.5 bits (108), Expect = 9e-05
 Identities = 28/124 (22%), Positives = 63/124 (50%), Gaps = 9/124 (7%)

Query: 482 DEIKRLKEKLSKTEEELKKEKTNSAALKSQADSVGKEYDRLLKEHEKVQKVVTEQGDKKD 541
           + ++ L+E+L   EE L K K     L+ +  ++ +E + L +E E+ ++ +     + +
Sbjct: 758 ERLEELEEELESLEEALAKLKEEIEELEEKRQALQEELEELEEELEEAERRLDALERELE 817

Query: 542 DKYQKGDS--DEIKRLKEKLSKTEEELKKEKTNSAALKNQADSVGKEYDRLLKEHEKVQK 599
              Q+ +    EI+ L+E++ + EE+L         L+ + + + KE + L +E E+++ 
Sbjct: 818 SLEQRRERLEQEIEELEEEIEELEEKL-------DELEEELEELEKELEELKEELEELEA 870

Query: 600 VVTE 603
              E
Sbjct: 871 EKEE 874



 Score = 43.5 bits (103), Expect = 4e-04
 Identities = 28/133 (21%), Positives = 66/133 (49%), Gaps = 9/133 (6%)

Query: 477 LEGDSDEIKRLKEKLSKTEEELKKEKTNSAALKSQADSVGKEYDRL---LKEHEKVQKVV 533
           LE   +  + L+E+L++ E EL + +     LK + +S+ +  +RL   L++ ++  K +
Sbjct: 371 LEELEELFEALREELAELEAELAEIRNELEELKREIESLEERLERLSERLEDLKEELKEL 430

Query: 534 TEQGDKKDDKYQKGDS------DEIKRLKEKLSKTEEELKKEKTNSAALKNQADSVGKEY 587
             + ++   + ++ +       ++++ L+++L + E EL + +     L+ +  S+    
Sbjct: 431 EAELEELQTELEELNEELEELEEQLEELRDRLKELERELAELQEELQRLEKELSSLEARL 490

Query: 588 DRLLKEHEKVQKV 600
           DRL  E    Q V
Sbjct: 491 DRLEAEQRASQGV 503



 Score = 42.8 bits (101), Expect = 6e-04
 Identities = 29/131 (22%), Positives = 75/131 (57%), Gaps = 2/131 (1%)

Query: 471 EQCDKGLEGDSDEIKRLKEKLSKTEEELKKEKTNSAALKSQADSVGKEYDRLLKEHEKVQ 530
           E+  + LE    +++ LK +L+  EEEL++ ++    L+ + + + +E + L +  E+++
Sbjct: 705 EELRRQLEELERQLEELKRELAALEEELEQLQSRLEELEEELEELEEELEELQERLEELE 764

Query: 531 KVVTEQGDK--KDDKYQKGDSDEIKRLKEKLSKTEEELKKEKTNSAALKNQADSVGKEYD 588
           + +    +   K  +  +   ++ + L+E+L + EEEL++ +    AL+ + +S+ +  +
Sbjct: 765 EELESLEEALAKLKEEIEELEEKRQALQEELEELEEELEEAERRLDALERELESLEQRRE 824

Query: 589 RLLKEHEKVQK 599
           RL +E E++++
Sbjct: 825 RLEQEIEELEE 835



 Score = 41.6 bits (98), Expect = 0.001
 Identities = 32/142 (22%), Positives = 75/142 (52%), Gaps = 9/142 (6%)

Query: 477 LEGDSDEIKRLKEKLSKTEEELKKEKTNSAAL-------KSQADSVGKEYDRLLKEHEKV 529
           L+   +E+  L+ +L K EEELK  K    +L       + Q + + ++ + L +E   +
Sbjct: 669 LKELEEELAELEAQLEKLEEELKSLKNELRSLEDLLEELRRQLEELERQLEELKRELAAL 728

Query: 530 QKVVTE-QGDKKD-DKYQKGDSDEIKRLKEKLSKTEEELKKEKTNSAALKNQADSVGKEY 587
           ++ + + Q   ++ ++  +   +E++ L+E+L + EEEL+  +   A LK + + + ++ 
Sbjct: 729 EEELEQLQSRLEELEEELEELEEELEELQERLEELEEELESLEEALAKLKEEIEELEEKR 788

Query: 588 DRLLKEHEKVQKVVTEQGDKKD 609
             L +E E++++ + E   + D
Sbjct: 789 QALQEELEELEEELEEAERRLD 810



 Score = 39.7 bits (93), Expect = 0.005
 Identities = 25/110 (22%), Positives = 56/110 (50%), Gaps = 11/110 (10%)

Query: 477 LEGDSDEIKRLKEKLSKTEEELKKEKTNSA-------ALKSQADSVGKEYDRL---LKEH 526
            E    EI+ L+E++ + EE+L + +            LK + + +  E + L   LKE 
Sbjct: 823 RERLEQEIEELEEEIEELEEKLDELEEELEELEKELEELKEELEELEAEKEELEDELKEL 882

Query: 527 EKVQKVVTEQGDKKDDKYQKGDSDEIKRLKEKLSKTEEELKKEKTNSAAL 576
           E+ ++ + E+  + + +  +   +EI++L+E+L + E +L++ +     L
Sbjct: 883 EEEKEELEEELRELESELAE-LKEEIEKLRERLEELEAKLERLEVELPEL 931



 Score = 37.4 bits (87), Expect = 0.027
 Identities = 34/141 (24%), Positives = 78/141 (55%), Gaps = 11/141 (7%)

Query: 468 EIFEQCDKGLEGDSDEIKRLKEKLSKTEEELKKEKTNSAALKSQADSVGKEYDRLLKEHE 527
           E+ E+ D  LE + +E+++  E+L +  EEL+ EK        + +   +E +  L+E E
Sbjct: 839 ELEEKLD-ELEEELEELEKELEELKEELEELEAEKEELEDELKELEEEKEELEEELRELE 897

Query: 528 KVQKVVTEQGDKKDDKYQKGDSDEIKRLKEKLSKTEEELKKE--KTNSAALKNQADSVGK 585
                + E+ +K  ++ ++ ++ +++RL+ +L + EEEL++E   T    L+ + + + +
Sbjct: 898 SELAELKEEIEKLRERLEELEA-KLERLEVELPELEEELEEEYEDTLETELEREIERLEE 956

Query: 586 EYDRL-------LKEHEKVQK 599
           E + L       ++E+E+V++
Sbjct: 957 EIEALGPVNLRAIEEYEEVEE 977



 Score = 35.5 bits (82), Expect = 0.10
 Identities = 24/117 (20%), Positives = 56/117 (47%), Gaps = 13/117 (11%)

Query: 482 DEIKRLKEKLSKTEEELKKEKTNSAALKS-QADSVGKEYDRLLKEHEKVQKVVTEQGDKK 540
           +E+++  EKL +  E+ ++ +   A L+  +   +  +   L KE E++++ +       
Sbjct: 196 EELEKQLEKLERQAEKAERYQELKAELRELELALLLAKLKELRKELEELEEEL------- 248

Query: 541 DDKYQKGDSDEIKRLKEKLSKTEEELKKEKTNSAALKNQADSVGKEYDRLLKEHEKV 597
                + + +E++   E+  K  EELK E      L+ + + + +E   L +E E++
Sbjct: 249 --SRLEEELEELQEELEEAEKEIEELKSELEE---LREELEELQEELLELKEEIEEL 300



 Score = 30.5 bits (69), Expect = 3.4
 Identities = 27/137 (19%), Positives = 52/137 (37%), Gaps = 30/137 (21%)

Query: 458  CGWCESSKRCEIFEQCDKGLEGDSDEIKRLKEKLSKTEEELKKEKT-NSAALKSQADSVG 516
                E        E  ++  +    E++R  E+L   EEE++     N  A++ + + V 
Sbjct: 921  LERLEVELPELEEELEEEYEDTLETELEREIERL---EEEIEALGPVNLRAIE-EYEEVE 976

Query: 517  KEYDRLLKEHEKVQKVVTEQGDKKDDKYQKGDSDEIKRLKEKLSKTEEELKKEKTNSAAL 576
            + Y+ L  + E +++                     ++L E +    EEL KEK      
Sbjct: 977  ERYEELKSQREDLEE-------------------AKEKLLEVI----EELDKEKRE--RF 1011

Query: 577  KNQADSVGKEYDRLLKE 593
            K   D + + +  + KE
Sbjct: 1012 KETFDKINENFSEIFKE 1028



 Score = 30.1 bits (68), Expect = 4.0
 Identities = 10/58 (17%), Positives = 33/58 (56%)

Query: 552 IKRLKEKLSKTEEELKKEKTNSAALKNQADSVGKEYDRLLKEHEKVQKVVTEQGDKKD 609
           + + KE+  + E +L++ + N   L++  + + K+ ++L ++ EK ++    + + ++
Sbjct: 167 VSKYKERKEEAERKLERTEENLERLEDLLEELEKQLEKLERQAEKAERYQELKAELRE 224



 Score = 28.9 bits (65), Expect = 9.5
 Identities = 16/63 (25%), Positives = 32/63 (50%)

Query: 470 FEQCDKGLEGDSDEIKRLKEKLSKTEEELKKEKTNSAALKSQADSVGKEYDRLLKEHEKV 529
            E+ ++ LE   ++++ L+++L + E EL + +     L+ +  S+    DRL  E    
Sbjct: 441 LEELNEELEELEEQLEELRDRLKELERELAELQEELQRLEKELSSLEARLDRLEAEQRAS 500

Query: 530 QKV 532
           Q V
Sbjct: 501 QGV 503


>gnl|CDD|173412 PTZ00121, PTZ00121, MAEBL; Provisional.
          Length = 2084

 Score = 46.7 bits (110), Expect = 4e-05
 Identities = 41/153 (26%), Positives = 75/153 (49%), Gaps = 12/153 (7%)

Query: 462  ESSKRCEIFEQCDKGLEGDSDEIKRLKEKLSKTEEELKK----EKTNSAALKSQADSVGK 517
            E  K+ E  ++ ++  +  + E  +  E+  K  EE KK    EK  + ALK +A+   K
Sbjct: 1644 EEKKKAEELKKAEEENKIKAAEEAKKAEEDKKKAEEAKKAEEDEKKAAEALKKEAEEAKK 1703

Query: 518  EYDRLLKEHEKVQKVVTEQGDKKDDKYQKGDSDEIKRLKEKLSKTEEELKKEKTNSAALK 577
              +   KE E+ +K    +  KK ++  K  ++E K+  E+  K  EE KK++      K
Sbjct: 1704 AEELKKKEAEEKKKA---EELKKAEEENKIKAEEAKKEAEEDKKKAEEAKKDEEE----K 1756

Query: 578  NQADSVGKEYDRLLKEHEKVQKVVTEQG-DKKD 609
             +   + KE ++  +E  K ++ V E+  D++D
Sbjct: 1757 KKIAHLKKEEEKKAEEIRKEKEAVIEEELDEED 1789



 Score = 46.7 bits (110), Expect = 4e-05
 Identities = 34/139 (24%), Positives = 58/139 (41%), Gaps = 14/139 (10%)

Query: 481  SDEIKRLKEKLSKTEEELKK---EKTNSAALKSQADSVGKEYDRLLKEHEKVQKVVTEQG 537
            +DE K+  E+  K  +  KK   E   +A           +     +E  +  +   E+ 
Sbjct: 1317 ADEAKKKAEEAKKKADAAKKKAEEAKKAAEAAKAEAEAAADEAEAAEEKAEAAEKKKEEA 1376

Query: 538  DKKDDKYQKG-----DSDEIKRLKEKLSKTEEELKK---EKTNSAALKNQADSVGKEYDR 589
             KK D  +K       +DE K+  E+  K  +ELKK    K  +   K +A+   K+ D 
Sbjct: 1377 KKKADAAKKKAEEKKKADEAKKKAEEDKKKADELKKAAAAKKKADEAKKKAEEK-KKADE 1435

Query: 590  LLKEHEKVQKVVTEQGDKK 608
              K+ E+ +K   ++  KK
Sbjct: 1436 AKKKAEEAKK--ADEAKKK 1452



 Score = 45.5 bits (107), Expect = 8e-05
 Identities = 34/131 (25%), Positives = 63/131 (48%), Gaps = 12/131 (9%)

Query: 481  SDEIKRLKEKLSKTEEELKKEKTNSAALKSQADSVGKEYDRLLKEHEKVQKVVTEQGDKK 540
            + E  + + + +  E E  +EK  +A  K +     K+ D   K+ E+ +K   ++  KK
Sbjct: 1344 AAEAAKAEAEAAADEAEAAEEKAEAAEKKKEEAK--KKADAAKKKAEEKKK--ADEAKKK 1399

Query: 541  DDKYQKGDSDEIKRLKEKLSKTEEELKK---EKTNSAALKNQADSVGKEYDRLLKEHEKV 597
             ++ +K  +DE+K+      K  +E KK   EK  +   K +A+   K+ D   K+ E+ 
Sbjct: 1400 AEEDKK-KADELKK-AAAAKKKADEAKKKAEEKKKADEAKKKAEE-AKKADEAKKKAEEA 1456

Query: 598  QKVVTEQGDKK 608
            +K   E+  KK
Sbjct: 1457 KK--AEEAKKK 1465



 Score = 44.0 bits (103), Expect = 3e-04
 Identities = 37/136 (27%), Positives = 63/136 (46%), Gaps = 13/136 (9%)

Query: 481  SDEIKRLKEKLSKTEEELKK--EKTNSAALKSQADSVGKEYDRLLKEHEKVQKVVTEQGD 538
            +DE K+  E+  K EE  KK  E   +   K +A+   K+ D   K+ E+ +K   E   
Sbjct: 1446 ADEAKKKAEEAKKAEEAKKKAEEAKKADEAKKKAEE-AKKADEAKKKAEEAKKKADEAKK 1504

Query: 539  KKDDKYQKGDSDEIKRLKEK-----LSKTEEELKKEKTNSAALKNQADSVGKEYDRLLKE 593
              + K +   +DE K+ +E        K EE  K ++   A  K +AD + K  +  LK+
Sbjct: 1505 AAEAKKK---ADEAKKAEEAKKADEAKKAEEAKKADEAKKAEEKKKADELKKAEE--LKK 1559

Query: 594  HEKVQKVVTEQGDKKD 609
             E+ +K    +  ++D
Sbjct: 1560 AEEKKKAEEAKKAEED 1575



 Score = 42.8 bits (100), Expect = 6e-04
 Identities = 29/128 (22%), Positives = 57/128 (44%), Gaps = 6/128 (4%)

Query: 481  SDEIKRLKEKLSKTEEELKK---EKTNSAALKSQADSVGKEYDRLLKEHEKVQKVVTEQG 537
            +DE K+ +EK  K  +E KK   E   +   K +A+   K+ D   K+ E+ +K      
Sbjct: 1292 ADEAKKAEEK--KKADEAKKKAEEAKKADEAKKKAEEAKKKADAAKKKAEEAKKAAEAAK 1349

Query: 538  DKKDDKYQKGDSDEIKRLKEKLSKTEEELKKEKTNSAAL-KNQADSVGKEYDRLLKEHEK 596
             + +    + ++ E K    +  K E + K +     A  K +AD   K+ +   K+ ++
Sbjct: 1350 AEAEAAADEAEAAEEKAEAAEKKKEEAKKKADAAKKKAEEKKKADEAKKKAEEDKKKADE 1409

Query: 597  VQKVVTEQ 604
            ++K    +
Sbjct: 1410 LKKAAAAK 1417



 Score = 42.4 bits (99), Expect = 7e-04
 Identities = 34/133 (25%), Positives = 63/133 (47%), Gaps = 9/133 (6%)

Query: 478  EGDSDEIKRLKEKLSKTEEELKKEKTNSAALKSQADSVGKEYDRLLKEHEKVQKVVTEQG 537
            E  +DE +  +EK    E++ ++ K  + A K +A+   K+ D   K+ E+ +K   E  
Sbjct: 1353 EAAADEAEAAEEKAEAAEKKKEEAKKKADAAKKKAEEK-KKADEAKKKAEEDKKKADELK 1411

Query: 538  DKKDDKYQKGDSDEIKRLKEKLSKTEEELKK--EKTNSAALKNQADSVGKEYDRLLKEHE 595
                 K +   +DE K+  E+  K +E  KK  E   +   K +A+   K+ +   K+ E
Sbjct: 1412 KAAAAKKK---ADEAKKKAEEKKKADEAKKKAEEAKKADEAKKKAEE-AKKAEEAKKKAE 1467

Query: 596  KVQKVVTEQGDKK 608
            + +K   ++  KK
Sbjct: 1468 EAKK--ADEAKKK 1478



 Score = 42.4 bits (99), Expect = 7e-04
 Identities = 39/154 (25%), Positives = 71/154 (46%), Gaps = 10/154 (6%)

Query: 462  ESSKRCEIFEQCDKGLEGDSDEIKRLKEKLSKTEEELKKE---KTNSAALKSQADSVGKE 518
            E ++  E+ +  ++  +  ++E K+ +E   K EE  K E   K      K +A+   K 
Sbjct: 1590 EEARIEEVMKLYEEEKKMKAEEAKKAEEAKIKAEELKKAEEEKKKVEQLKKKEAEEKKKA 1649

Query: 519  YDRLLKEHEKVQKVVTEQGDKKDDKYQKGDSDEIKRLKEKLSKTEEELKK---EKTNSAA 575
             +  LK+ E+  K+   +  KK ++  K  ++E K+ +E   K  E LKK   E   +  
Sbjct: 1650 EE--LKKAEEENKIKAAEEAKKAEE-DKKKAEEAKKAEEDEKKAAEALKKEAEEAKKAEE 1706

Query: 576  LKNQADSVGKEYDRLLKEHEKVQKVVTEQGDKKD 609
            LK +     K+ + L K  E+  K+  E+  K+ 
Sbjct: 1707 LKKKEAEEKKKAEELKKA-EEENKIKAEEAKKEA 1739



 Score = 41.3 bits (96), Expect = 0.002
 Identities = 32/127 (25%), Positives = 63/127 (49%), Gaps = 11/127 (8%)

Query: 482  DEIKRLKEKLSKTEEELKKE----KTNSAALKSQADSVGKEYDRLLKEHEKVQKVVTEQG 537
            +++K+ + +  K  EELKK     K  +A    +A+   K+ +   K  E  +K   E  
Sbjct: 1636 EQLKKKEAEEKKKAEELKKAEEENKIKAAEEAKKAEEDKKKAEEAKKAEEDEKK-AAEAL 1694

Query: 538  DKKDDKYQKGDSDEIKRLKEKLSKTEEELKKE----KTNSAALKNQADSVGKEYDRLLKE 593
             K+ ++ +K  ++E+K+ + +  K  EELKK     K  +   K +A+   K+ +   K+
Sbjct: 1695 KKEAEEAKK--AEELKKKEAEEKKKAEELKKAEEENKIKAEEAKKEAEEDKKKAEEAKKD 1752

Query: 594  HEKVQKV 600
             E+ +K+
Sbjct: 1753 EEEKKKI 1759



 Score = 40.9 bits (95), Expect = 0.002
 Identities = 37/142 (26%), Positives = 63/142 (44%), Gaps = 5/142 (3%)

Query: 471  EQCDKGLEGDSDEIKRLKEKLSKTEEELKKEKTNSAALKSQADSVGKEYDRLLKEHEKVQ 530
            E+  K  E    E K+  ++L K EE  K E+   A    +A+       R  +E +K +
Sbjct: 1531 EEAKKADEAKKAEEKKKADELKKAEELKKAEEKKKAEEAKKAEEDKNMALRKAEEAKKAE 1590

Query: 531  KVVTEQGDKKDDKYQKGDSDEIKRLKEKLSKTEEELKKE---KTNSAALKNQADSVGKEY 587
            +   E+  K  ++ +K  ++E K+ +E   K EE  K E   K      K +A+   K  
Sbjct: 1591 EARIEEVMKLYEEEKKMKAEEAKKAEEAKIKAEELKKAEEEKKKVEQLKKKEAEEKKKAE 1650

Query: 588  DRLLKEHEKVQKVVTEQGDKKD 609
            +  LK+ E+  K+   +  KK 
Sbjct: 1651 E--LKKAEEENKIKAAEEAKKA 1670



 Score = 39.0 bits (90), Expect = 0.009
 Identities = 37/138 (26%), Positives = 61/138 (44%), Gaps = 15/138 (10%)

Query: 480  DSDEIKRLKEKLSKTEEELKK---EKTNSAALKSQADSVGKEYDRLLKEHEKVQKVVTEQ 536
            ++ E K+ + K  K  +  KK   EK  +   K +A+   K+ D L K     +K   ++
Sbjct: 1367 EAAEKKKEEAK--KKADAAKKKAEEKKKADEAKKKAEEDKKKADELKKAAAAKKK--ADE 1422

Query: 537  GDKKDDKYQKGD-----SDEIKRLKEKLSKTEEELKKEK-TNSAALKNQADSVGKEYDRL 590
              KK ++ +K D     ++E K+  E   K EE  K E+    A    +AD   K+ +  
Sbjct: 1423 AKKKAEEKKKADEAKKKAEEAKKADEAKKKAEEAKKAEEAKKKAEEAKKADEAKKKAEEA 1482

Query: 591  LKEHEKVQKVVTEQGDKK 608
             K  E  +K   E+  KK
Sbjct: 1483 KKADEAKKK--AEEAKKK 1498



 Score = 38.6 bits (89), Expect = 0.011
 Identities = 32/121 (26%), Positives = 57/121 (47%), Gaps = 12/121 (9%)

Query: 481  SDEIKRLKEKLSKTEEELKK--EKTNSAALKSQADSVGKEYDRLLKEHEKVQKVVTEQGD 538
            +DE K+  E+  K +E  KK  E   +   K +A+   K+ +   K+ E+ +K   ++  
Sbjct: 1420 ADEAKKKAEEKKKADEAKKKAEEAKKADEAKKKAEE-AKKAEEAKKKAEEAKK--ADEAK 1476

Query: 539  KKDDKYQKGDSDEIKRLKEKLSKTEEELKK---EKTNSAALKNQADSVGKEYDRLLKEHE 595
            KK ++ +K  +DE K+  E+  K  +E KK    K  +   K   ++  K+ D   K  E
Sbjct: 1477 KKAEEAKK--ADEAKKKAEEAKKKADEAKKAAEAKKKADEAKKAEEA--KKADEAKKAEE 1532

Query: 596  K 596
             
Sbjct: 1533 A 1533



 Score = 37.0 bits (85), Expect = 0.032
 Identities = 36/138 (26%), Positives = 62/138 (44%), Gaps = 15/138 (10%)

Query: 481  SDEIKRLKEK-----LSKTEEELKKEKTNSAALKSQADSVGKEYDRLLKEHEKVQKVVTE 535
            +DE K+ +E        K EE  K ++   A  K +AD + K  +  LK+ E+ +K    
Sbjct: 1512 ADEAKKAEEAKKADEAKKAEEAKKADEAKKAEEKKKADELKKAEE--LKKAEEKKKAEEA 1569

Query: 536  QGDKKDDKYQKGDSDEIKRLKEK----LSKTEEELKKEKTNSAALKNQADSVGKEYDRLL 591
            +  ++D       ++E K+ +E     + K  EE KK K   A    +A    +E    L
Sbjct: 1570 KKAEEDKNMALRKAEEAKKAEEARIEEVMKLYEEEKKMKAEEAKKAEEAKIKAEE----L 1625

Query: 592  KEHEKVQKVVTEQGDKKD 609
            K+ E+ +K V +   K+ 
Sbjct: 1626 KKAEEEKKKVEQLKKKEA 1643



 Score = 36.3 bits (83), Expect = 0.057
 Identities = 43/166 (25%), Positives = 73/166 (43%), Gaps = 21/166 (12%)

Query: 462  ESSKRCEIFEQCDKGLEGDSDEIKRLKEKLSKTEE-----ELKK---EKTNSAALKSQAD 513
            E+ K+ E  ++ D+  +  ++E K+ +E   K EE     E KK   E   +   K +A+
Sbjct: 1435 EAKKKAEEAKKADE-AKKKAEEAKKAEEAKKKAEEAKKADEAKKKAEEAKKADEAKKKAE 1493

Query: 514  SVGKEYDRLLKEHE---KVQKVVTEQGDKKDDKYQKGD----SDEIKRLKEK-----LSK 561
               K+ D   K  E   K  +    +  KK D+ +K +    +DE K+ +EK     L K
Sbjct: 1494 EAKKKADEAKKAAEAKKKADEAKKAEEAKKADEAKKAEEAKKADEAKKAEEKKKADELKK 1553

Query: 562  TEEELKKEKTNSAALKNQADSVGKEYDRLLKEHEKVQKVVTEQGDK 607
             EE  K E+   A    +A+       R  +E +K ++   E+  K
Sbjct: 1554 AEELKKAEEKKKAEEAKKAEEDKNMALRKAEEAKKAEEARIEEVMK 1599



 Score = 35.9 bits (82), Expect = 0.068
 Identities = 30/128 (23%), Positives = 53/128 (41%), Gaps = 4/128 (3%)

Query: 482  DEIKRLKEKLSKTEEELKKEKTNSAALKSQADSVGKEYDRLLKEHEKVQKVVTEQGDKKD 541
            +E K+  E+  K EEE   E+         A    ++     +E  K  ++   +  KK 
Sbjct: 1233 EEAKKDAEEAKKAEEERNNEEIRKFEEARMAHFARRQAAIKAEEARKADELKKAEEKKKA 1292

Query: 542  DKYQKGDSDEIKRLKEKLSKTEEELKKE--KTNSAALKNQADSVGKEYDRLLKEHEKVQK 599
            D+ +K  ++E K+  E   K EE  K +  K  +   K +AD+  K+ +   K  E  + 
Sbjct: 1293 DEAKK--AEEKKKADEAKKKAEEAKKADEAKKKAEEAKKKADAAKKKAEEAKKAAEAAKA 1350

Query: 600  VVTEQGDK 607
                  D+
Sbjct: 1351 EAEAAADE 1358



 Score = 35.9 bits (82), Expect = 0.089
 Identities = 28/115 (24%), Positives = 47/115 (40%), Gaps = 21/115 (18%)

Query: 493  KTEEELKKEKTNSAALKSQADSVGKEYDRLLKEHEKVQKVVTEQGDKKDDKYQKGD---- 548
            K EE  K ++   A  K +AD          K+ E+ +K   ++  KK ++ +K D    
Sbjct: 1273 KAEEARKADELKKAEEKKKADE--------AKKAEEKKKA--DEAKKKAEEAKKADEAKK 1322

Query: 549  -SDEIKRLKEKLSKTEE------ELKKEKTNSAALKNQADSVGKEYDRLLKEHEK 596
             ++E K+  +   K  E      E  K +  +AA + +A     E     KE  K
Sbjct: 1323 KAEEAKKKADAAKKKAEEAKKAAEAAKAEAEAAADEAEAAEEKAEAAEKKKEEAK 1377



 Score = 35.5 bits (81), Expect = 0.10
 Identities = 32/142 (22%), Positives = 63/142 (44%), Gaps = 7/142 (4%)

Query: 462  ESSKRCEIFEQCDKGLEGDSDEIKRLKEKLSKTEEELKKEKTNSAALKSQADSVGKEYDR 521
            E  K+ E  ++ ++  +  ++E K+  E+  K  EE KK++     +        K+ + 
Sbjct: 1713 EEKKKAEELKKAEEENKIKAEEAKKEAEEDKKKAEEAKKDEEEKKKIAHLKKEEEKKAEE 1772

Query: 522  LLKEHEKVQKVVTEQGDKKDDKYQKGDSDEIKRLKEKLSKTEEELKKEKTNSAALKNQAD 581
            + KE E V   + E+ D++D+K +     +IK + +  +   E  K+        K   D
Sbjct: 1773 IRKEKEAV---IEEELDEEDEKRRMEVDKKIKDIFDNFANIIEGGKEGNLVINDSKEMED 1829

Query: 582  SVGKEY----DRLLKEHEKVQK 599
            S  KE     +  L+E +  +K
Sbjct: 1830 SAIKEVADSKNMQLEEADAFEK 1851



 Score = 35.1 bits (80), Expect = 0.15
 Identities = 34/146 (23%), Positives = 70/146 (47%), Gaps = 16/146 (10%)

Query: 465  KRCEIFEQCDKGLEGDSDEIKRLKEKLSKTE-EELKKEKTNSAALKSQADSVGKEYDRLL 523
            ++ E  ++ +   E   +E+ +L E+  K + EE KK +      K +A+ + K  +   
Sbjct: 1581 RKAEEAKKAE---EARIEEVMKLYEEEKKMKAEEAKKAEEA----KIKAEELKKA-EEEK 1632

Query: 524  KEHEKVQKVVTEQGDKKDDKYQKGDSDEIKRLKEKLSKTEEELKKEKTNSAALKNQADSV 583
            K+ E+++K   E+  K ++  +  + ++IK    + +K  EE   +K  +   K   +  
Sbjct: 1633 KKVEQLKKKEAEEKKKAEELKKAEEENKIK--AAEEAKKAEE---DKKKAEEAKKAEEDE 1687

Query: 584  GKEYDRLLKEHEKVQKVVTEQGDKKD 609
             K  + L KE E+ +K   E+  KK+
Sbjct: 1688 KKAAEALKKEAEEAKKA--EELKKKE 1711



 Score = 32.8 bits (74), Expect = 0.71
 Identities = 30/142 (21%), Positives = 58/142 (40%), Gaps = 6/142 (4%)

Query: 470  FEQCDKGLEGDSDEIKRLKEKLSKTEEELKKEKTNSAALKSQADSVGKEYDRLLKEH-EK 528
             E+  K  E    E  +  E + K EE  K  +    A + + +   ++++     H  +
Sbjct: 1208 AEEERKAEEARKAEDAKKAEAVKKAEEAKKDAEEAKKAEEERNNEEIRKFEEARMAHFAR 1267

Query: 529  VQKVVTEQGDKKDDKYQKGDSDEIKRLKEKLSKTEEELK-KEKTNSAALKNQADSVGKEY 587
             Q  +  +  +K D+ +K    E K+  ++  K EE+ K  E    A    +AD   K+ 
Sbjct: 1268 RQAAIKAEEARKADELKKA---EEKKKADEAKKAEEKKKADEAKKKAEEAKKADEAKKKA 1324

Query: 588  DRLLKEHEKVQKVVTEQGDKKD 609
            +   K+ +  +K   E+  K  
Sbjct: 1325 EEAKKKADAAKK-KAEEAKKAA 1345


>gnl|CDD|233758 TIGR02169, SMC_prok_A, chromosome segregation protein SMC,
           primarily archaeal type.  SMC (structural maintenance of
           chromosomes) proteins bind DNA and act in organizing and
           segregating chromosomes for partition. SMC proteins are
           found in bacteria, archaea, and eukaryotes. It is found
           in a single copy and is homodimeric in prokaryotes, but
           six paralogs (excluded from this family) are found in
           eukarotes, where SMC proteins are heterodimeric. This
           family represents the SMC protein of archaea and a few
           bacteria (Aquifex, Synechocystis, etc); the SMC of other
           bacteria is described by TIGR02168. The N- and
           C-terminal domains of this protein are well conserved,
           but the central hinge region is skewed in composition
           and highly divergent [Cellular processes, Cell division,
           DNA metabolism, Chromosome-associated proteins].
          Length = 1164

 Score = 45.8 bits (109), Expect = 6e-05
 Identities = 26/135 (19%), Positives = 66/135 (48%), Gaps = 12/135 (8%)

Query: 481 SDEIKRLKEKLSKTEEELKKEKTNSAALKSQADSVGK-------EYDRLLKEHEKVQKVV 533
            +E  R+KEK+ + E E+   + + A  + + +   +       E D+LL E E++++ +
Sbjct: 286 EEEQLRVKEKIGELEAEIASLERSIAEKERELEDAEERLAKLEAEIDKLLAEIEELEREI 345

Query: 534 TEQGDKKDDKYQKGDSDEIKRLKEKLSKTEEELKKEKTNSAALKNQADSVGKEYDRLLKE 593
            E+  ++D       ++E   LKE+L     EL++     A  +++     ++ ++L +E
Sbjct: 346 EEERKRRDKL-----TEEYAELKEELEDLRAELEEVDKEFAETRDELKDYREKLEKLKRE 400

Query: 594 HEKVQKVVTEQGDKK 608
             ++++ +    ++ 
Sbjct: 401 INELKRELDRLQEEL 415



 Score = 44.7 bits (106), Expect = 1e-04
 Identities = 23/132 (17%), Positives = 62/132 (46%), Gaps = 9/132 (6%)

Query: 481 SDEIKRLKEKLSKTEEELKKEKTNSAALKSQADSVGKEYDRLLKEHEKVQKVVTEQGDKK 540
           ++E   LKE+L     EL++     A  + +     ++ ++L +E  ++++ +    ++ 
Sbjct: 356 TEEYAELKEELEDLRAELEEVDKEFAETRDELKDYREKLEKLKREINELKRELDRLQEEL 415

Query: 541 DDKYQKGD--SDEIKRLKEKLSKTEE-------ELKKEKTNSAALKNQADSVGKEYDRLL 591
               ++    +  I  ++ K+++ EE       E+KK++     L        +E   L 
Sbjct: 416 QRLSEELADLNAAIAGIEAKINELEEEKEDKALEIKKQEWKLEQLAADLSKYEQELYDLK 475

Query: 592 KEHEKVQKVVTE 603
           +E+++V+K +++
Sbjct: 476 EEYDRVEKELSK 487



 Score = 44.7 bits (106), Expect = 2e-04
 Identities = 25/126 (19%), Positives = 62/126 (49%), Gaps = 12/126 (9%)

Query: 471 EQCDKGLEGDSDEIKRLKEKLSKTEEELKKEKTNSAALKSQADSVGKEYDRLLKEHEKVQ 530
              ++ +     E++  +E+L+K E E+ K       L ++ + + +E +   K  +K+ 
Sbjct: 304 ASLERSIAEKERELEDAEERLAKLEAEIDK-------LLAEIEELEREIEEERKRRDKLT 356

Query: 531 KVVTEQGDKKDDKYQKGDSDEIKRLKEKLSKTEEELKKEKTNSAALKNQADSVGKEYDRL 590
           +   E  ++ +D        E++ + ++ ++T +ELK  +     LK + + + +E DRL
Sbjct: 357 EEYAELKEELEDLRA-----ELEEVDKEFAETRDELKDYREKLEKLKREINELKRELDRL 411

Query: 591 LKEHEK 596
            +E ++
Sbjct: 412 QEELQR 417



 Score = 42.0 bits (99), Expect = 0.001
 Identities = 28/140 (20%), Positives = 63/140 (45%), Gaps = 21/140 (15%)

Query: 471 EQCDKGLEGDSDEIKRLKEKLSKTEEELKKEKTNSAALK-----SQADSVGKEYDRLLKE 525
              ++ +E    E+K L+ ++ + EE+L K +     L+     S+   +  E  +L +E
Sbjct: 747 SSLEQEIENVKSELKELEARIEELEEDLHKLEEALNDLEARLSHSRIPEIQAELSKLEEE 806

Query: 526 HEKVQKVV--TEQGDKKDDKYQKGDSDEIKRLKEKLSKTEEELKKEKTNSAALKNQADSV 583
             +++  +   EQ              ++ RL  +    E+E+++ +     LK Q  S+
Sbjct: 807 VSRIEARLREIEQ--------------KLNRLTLEKEYLEKEIQELQEQRIDLKEQIKSI 852

Query: 584 GKEYDRLLKEHEKVQKVVTE 603
            KE + L  + E++++ + E
Sbjct: 853 EKEIENLNGKKEELEEELEE 872



 Score = 40.4 bits (95), Expect = 0.003
 Identities = 28/134 (20%), Positives = 64/134 (47%), Gaps = 25/134 (18%)

Query: 483 EIKRLKEKLSKTEEELKKEKTNSAALKSQADSVGKEYDRLLKEHEKVQKVVTEQGDKKDD 542
           + + ++ +L+  EEEL+K       L  +   + K  + + +  E++ K + + G     
Sbjct: 238 QKEAIERQLASLEEELEK-------LTEEISELEKRLEEIEQLLEELNKKIKDLG----- 285

Query: 543 KYQKGDSDEIKRLKEKLSKTEEELKK-EKTNSAALKNQADSVGK------EYDRLLKEHE 595
                  +E  R+KEK+ + E E+   E++ +   +   D+  +      E D+LL E E
Sbjct: 286 ------EEEQLRVKEKIGELEAEIASLERSIAEKERELEDAEERLAKLEAEIDKLLAEIE 339

Query: 596 KVQKVVTEQGDKKD 609
           ++++ + E+  ++D
Sbjct: 340 ELEREIEEERKRRD 353



 Score = 38.9 bits (91), Expect = 0.008
 Identities = 23/138 (16%), Positives = 61/138 (44%), Gaps = 7/138 (5%)

Query: 463 SSKRCEIFEQCDKGLEGDSDEIKRLKEKLSKTEEELKKEKTN-SAALKSQADSVGKEYDR 521
           +S   E+ E+  + +      ++ +++ L +  +++K         +K +   +  E   
Sbjct: 247 ASLEEEL-EKLTEEISELEKRLEEIEQLLEELNKKIKDLGEEEQLRVKEKIGELEAEIAS 305

Query: 522 LLKEHEKVQKVVTEQGDKKDDKYQKGDSDEIKRLKEKLSKTEEELKKEKTNSAALKNQAD 581
           L    E+       + +  +++  K ++ EI +L  ++ + E E+++E+     L  +  
Sbjct: 306 L----ERSIAEKERELEDAEERLAKLEA-EIDKLLAEIEELEREIEEERKRRDKLTEEYA 360

Query: 582 SVGKEYDRLLKEHEKVQK 599
            + +E + L  E E+V K
Sbjct: 361 ELKEELEDLRAELEEVDK 378



 Score = 38.1 bits (89), Expect = 0.015
 Identities = 33/136 (24%), Positives = 62/136 (45%), Gaps = 2/136 (1%)

Query: 471 EQCDKGLEGDSDEIKRLKEKLSKTEEELKKEKTNSAALKSQADSVGKEYDRLLKEHEKVQ 530
           E  ++ L     EI +L  ++ + E E+++E+     L  +   + +E + L  E E+V 
Sbjct: 318 EDAEERLAKLEAEIDKLLAEIEELEREIEEERKRRDKLTEEYAELKEELEDLRAELEEVD 377

Query: 531 KVVTEQGDKKDDKYQKGD--SDEIKRLKEKLSKTEEELKKEKTNSAALKNQADSVGKEYD 588
           K   E  D+  D  +K +    EI  LK +L + +EEL++     A L      +  + +
Sbjct: 378 KEFAETRDELKDYREKLEKLKREINELKRELDRLQEELQRLSEELADLNAAIAGIEAKIN 437

Query: 589 RLLKEHEKVQKVVTEQ 604
            L +E E     + +Q
Sbjct: 438 ELEEEKEDKALEIKKQ 453



 Score = 37.7 bits (88), Expect = 0.021
 Identities = 24/86 (27%), Positives = 41/86 (47%), Gaps = 5/86 (5%)

Query: 482 DEIKRLKEKLSKTEEELKKEKTNSAALKSQADSVGKEYDRLLKEHEKVQKVVTEQGDKKD 541
            EI+ L  K  + EEEL++ +     L+S+   + KE D L  +  ++++ + E   + +
Sbjct: 854 KEIENLNGKKEELEEELEELEAALRDLESRLGDLKKERDELEAQLRELERKIEELEAQIE 913

Query: 542 DKYQKGDSDEIKRLKEKLSKTEEELK 567
            K        +  LK KL   EEEL 
Sbjct: 914 KK-----RKRLSELKAKLEALEEELS 934



 Score = 35.4 bits (82), Expect = 0.097
 Identities = 23/118 (19%), Positives = 49/118 (41%), Gaps = 1/118 (0%)

Query: 489 EKLSKTEEELKKEKTNSAALKSQADSVGKEYDRLLKEHEKVQKVVTEQGDKKDDKYQKGD 548
            K  K  EEL++ + N   L    D   ++ +RL +E EK ++       +K +      
Sbjct: 170 RKKEKALEELEEVEENIERLDLIIDEKRQQLERLRREREKAERYQALL-KEKREYEGYEL 228

Query: 549 SDEIKRLKEKLSKTEEELKKEKTNSAALKNQADSVGKEYDRLLKEHEKVQKVVTEQGD 606
             E + L+ +    E +L   +     L  +   + K  + + +  E++ K + + G+
Sbjct: 229 LKEKEALERQKEAIERQLASLEEELEKLTEEISELEKRLEEIEQLLEELNKKIKDLGE 286



 Score = 34.3 bits (79), Expect = 0.25
 Identities = 23/130 (17%), Positives = 44/130 (33%), Gaps = 9/130 (6%)

Query: 471 EQCDKGLEGDSDEIKRLKEKLSKTEEELKKEKTNSAALKSQADSVGKEYDRLLKEHEKVQ 530
           E     ++    ++++L   LSK E+EL   K     ++ +   + +E      +    +
Sbjct: 444 EDKALEIKKQEWKLEQLAADLSKYEQELYDLKEEYDRVEKELSKLQRELAEAEAQ---AR 500

Query: 531 KVVTEQGDKKDDKYQKGDSDE-IKRLKEKLSKTEEELKK--EKTNSAALKN---QADSVG 584
                    +  +     S + +     +L    E      E      L N   + D+V 
Sbjct: 501 ASEERVRGGRAVEEVLKASIQGVHGTVAQLGSVGERYATAIEVAAGNRLNNVVVEDDAVA 560

Query: 585 KEYDRLLKEH 594
           KE   LLK  
Sbjct: 561 KEAIELLKRR 570



 Score = 32.3 bits (74), Expect = 0.79
 Identities = 26/131 (19%), Positives = 63/131 (48%), Gaps = 10/131 (7%)

Query: 471  EQCDKGLEGDSDEIKRLKEKLSKTEEELKKEKTNSAALKSQADSVGKEYDRLLKEHEKVQ 530
               +  L     E   L+ +L + E ++++       L++Q +   K    L  + E ++
Sbjct: 878  RDLESRLGDLKKERDELEAQLRELERKIEE-------LEAQIEKKRKRLSELKAKLEALE 930

Query: 531  KVVTEQGD-KKDDKYQKGDSDEIKRLKEKLSKTEEELKK-EKTNSAALKNQADSVGKEYD 588
            + ++E  D K +D+    +   ++ ++ +L + EEE++  E  N  A++ + + V K  D
Sbjct: 931  EELSEIEDPKGEDEEIPEEELSLEDVQAELQRVEEEIRALEPVNMLAIQ-EYEEVLKRLD 989

Query: 589  RLLKEHEKVQK 599
             L ++  K+++
Sbjct: 990  ELKEKRAKLEE 1000



 Score = 32.3 bits (74), Expect = 0.85
 Identities = 18/103 (17%), Positives = 48/103 (46%), Gaps = 3/103 (2%)

Query: 475 KGLEGDSDEIKRLKEKLSKTEEELKKEKTNSAALKSQADSVGKEYDRLLKEHEKVQKVVT 534
           +G    S       ++L +  E LK+E ++  +   + ++   E  + L +  +  K+  
Sbjct: 663 RGGILFSRSEPAELQRLRERLEGLKRELSSLQSELRRIENRLDELSQELSDASR--KIGE 720

Query: 535 EQGDKKD-DKYQKGDSDEIKRLKEKLSKTEEELKKEKTNSAAL 576
            + + +  ++ ++   + ++ L+E LS  E+E++  K+    L
Sbjct: 721 IEKEIEQLEQEEEKLKERLEELEEDLSSLEQEIENVKSELKEL 763


>gnl|CDD|223496 COG0419, SbcC, ATPase involved in DNA repair [DNA replication,
           recombination, and repair].
          Length = 908

 Score = 45.5 bits (108), Expect = 7e-05
 Identities = 24/122 (19%), Positives = 57/122 (46%), Gaps = 7/122 (5%)

Query: 482 DEIKRLKEKLSKTEEELKKEKTNSAALKSQADSVGKEYDRLLKEHEKVQKVVTEQGDKKD 541
           +  + L+ +L K   + + E+   AAL+   + V +    + +E ++++         + 
Sbjct: 630 EAEEELESELEKLNLQAELEELLQAALEELEEKVEELEAEIRRELQRIENE-------EQ 682

Query: 542 DKYQKGDSDEIKRLKEKLSKTEEELKKEKTNSAALKNQADSVGKEYDRLLKEHEKVQKVV 601
            + +  + ++++   E+L +  EEL K+      L  + +S   E + L KE EK++K +
Sbjct: 683 LEEKLEELEQLEEELEQLREELEELLKKLGEIEQLIEELESRKAELEELKKELEKLEKAL 742

Query: 602 TE 603
             
Sbjct: 743 EL 744



 Score = 44.8 bits (106), Expect = 1e-04
 Identities = 29/135 (21%), Positives = 69/135 (51%), Gaps = 4/135 (2%)

Query: 462 ESSKRCEIFEQCDKGLEGDSDEIKRLKEKLSKTEEELKKEKTNSAALKSQADSVGKEYDR 521
           + S+  E  E   + LE +  E+K+L+E   + EEE  +++    AL+ +   + +E +R
Sbjct: 193 QLSELLEDIEDLLEALEEELKELKKLEEIQEEQEEEELEQEIE--ALEERLAELEEEKER 250

Query: 522 LLKEHEKVQKVVTEQGDKKDDKYQKGDSDEIKRLKEKLSKTEEELKKEKTNSAALKNQAD 581
           L +   ++ ++ + + +    K ++ +  E++RL E+L +  E L++ +     L+ + +
Sbjct: 251 LEELKARLLEIESLELEA--LKIREEELRELERLLEELEEKIERLEELEREIEELEEELE 308

Query: 582 SVGKEYDRLLKEHEK 596
            +    + L +  EK
Sbjct: 309 GLRALLEELEELLEK 323



 Score = 40.9 bits (96), Expect = 0.002
 Identities = 29/122 (23%), Positives = 64/122 (52%), Gaps = 8/122 (6%)

Query: 482 DEIKRLKEKLSKTEEELKKEKTNSAALKSQADSVGKEYDRLLKEHEKVQKVVTEQGDKKD 541
           +E++ L EKL   EE L+K +     L+S+     +E      E  K+ +   ++ +++ 
Sbjct: 315 EELEELLEKLKSLEERLEKLEEKLEKLESE----LEELAEEKNELAKLLEERLKELEERL 370

Query: 542 DKYQKGDSDEIKRLKEKLSKTEEELKKEKTNSAALKNQADSVGKEYDRLLKEHEKVQKVV 601
           ++ +K    E+++  E+L + EE +++ K   A L    + + +E + L KE E++++ +
Sbjct: 371 EELEK----ELEKALERLKQLEEAIQELKEELAELSAALEEIQEELEELEKELEELEREL 426

Query: 602 TE 603
            E
Sbjct: 427 EE 428



 Score = 39.0 bits (91), Expect = 0.008
 Identities = 32/142 (22%), Positives = 64/142 (45%), Gaps = 24/142 (16%)

Query: 477 LEGDSDEIKRLKEKLSKTEEELKKEKTN--------SAALKSQADSVGKEYDRLLKE--- 525
           LE + + ++    +L + +E L++ K            ALK + + + +E +RLL+E   
Sbjct: 230 LEQEIEALEERLAELEEEKERLEELKARLLEIESLELEALKIREEEL-RELERLLEELEE 288

Query: 526 ---------HEKVQKVVTEQGDKKDDKYQKGDSDEIKRLKEKLSKTEEELKK---EKTNS 573
                     E  +     +G +   +  +   +++K L+E+L K EE+L+K   E    
Sbjct: 289 KIERLEELEREIEELEEELEGLRALLEELEELLEKLKSLEERLEKLEEKLEKLESELEEL 348

Query: 574 AALKNQADSVGKEYDRLLKEHE 595
           A  KN+   + +E  + L+E  
Sbjct: 349 AEEKNELAKLLEERLKELEERL 370



 Score = 37.0 bits (86), Expect = 0.031
 Identities = 23/134 (17%), Positives = 63/134 (47%), Gaps = 13/134 (9%)

Query: 468 EIF--EQCDKGLEGDSDEIKRLKEKLSKTEEELKKEKTNSAALKSQADSVGKEYDRLLKE 525
           E+F  E+ +K  E   + IK  K K+ + E +L +   +               + L +E
Sbjct: 162 ELFGLEKYEKLSELLKEVIKEAKAKIEELEGQLSELLEDIE----------DLLEALEEE 211

Query: 526 HEKVQKVVTEQGDKKDDKYQKGDSDEIKRLKEKLSKTEEELKKEKTNSAALKNQADSVGK 585
            ++++K+  E  ++++++  + + + ++    +L + +E L++ K     +++      K
Sbjct: 212 LKELKKL-EEIQEEQEEEELEQEIEALEERLAELEEEKERLEELKARLLEIESLELEALK 270

Query: 586 EYDRLLKEHEKVQK 599
             +  L+E E++ +
Sbjct: 271 IREEELRELERLLE 284



 Score = 36.7 bits (85), Expect = 0.045
 Identities = 29/130 (22%), Positives = 62/130 (47%), Gaps = 9/130 (6%)

Query: 477 LEGDSDEIKRLKEKLSKTEEELKKEKTNSAALKSQADSVGKEYDRLLKEHEKVQKVVTEQ 536
           LE    E++ L+E+LS+ +EE  + +     L+ +   + +E   LL+  E +++ + E+
Sbjct: 476 LELYELELEELEEELSREKEE-AELREEIEELEKELRELEEELIELLELEEALKEELEEK 534

Query: 537 GDK-------KDDKYQKGDSDEIKRLKEKLSKTEEELKKEKTNSAALKNQADSVGKEYDR 589
            +K        ++  +K    ++K    +L    +ELK+       L+ + + + +E   
Sbjct: 535 LEKLENLLEELEELKEKLQLQQLKEELRQLEDRLQELKELLEELRLLRTRKEEL-EELRE 593

Query: 590 LLKEHEKVQK 599
            LKE +K  K
Sbjct: 594 RLKELKKKLK 603



 Score = 34.4 bits (79), Expect = 0.20
 Identities = 21/74 (28%), Positives = 38/74 (51%), Gaps = 4/74 (5%)

Query: 462 ESSKRCEIFEQCDKGLEGDSDEIKRLKEKLSKTEEELKKEKTNSAALKSQADSVGKEYDR 521
           +  +  E  E+  + LE   +E+++L+E+L    EEL K+      L  + +S   E + 
Sbjct: 675 QRIENEEQLEEKLEELEQLEEELEQLREEL----EELLKKLGEIEQLIEELESRKAELEE 730

Query: 522 LLKEHEKVQKVVTE 535
           L KE EK++K +  
Sbjct: 731 LKKELEKLEKALEL 744



 Score = 33.2 bits (76), Expect = 0.42
 Identities = 20/108 (18%), Positives = 47/108 (43%), Gaps = 5/108 (4%)

Query: 483 EIKRLKEKLSKTEEELKKEKTNSAAL--KSQADSVGKEYDRLLKEHEKVQKVVTEQGDKK 540
             + LKE+L +  E+L+        L  K Q   + +E  +L      +Q++     + +
Sbjct: 523 LEEALKEELEEKLEKLENLLEELEELKEKLQLQQLKEELRQLEDR---LQELKELLEELR 579

Query: 541 DDKYQKGDSDEIKRLKEKLSKTEEELKKEKTNSAALKNQADSVGKEYD 588
             + +K + +E++   ++L K  +EL++  +    L    +    E +
Sbjct: 580 LLRTRKEELEELRERLKELKKKLKELEERLSQLEELLQSLELSEAENE 627



 Score = 31.3 bits (71), Expect = 1.7
 Identities = 26/128 (20%), Positives = 54/128 (42%), Gaps = 13/128 (10%)

Query: 485 KRLKEKLSKTEEELKKEKTN--SAALKSQADSVGKEYDRLLKEHEKVQKVVTEQGDKKDD 542
           ++LKE+L + E+ L++ K       L        +E    LKE +K  K + E+  + ++
Sbjct: 555 QQLKEELRQLEDRLQELKELLEELRLLRTRKEELEELRERLKELKKKLKELEERLSQLEE 614

Query: 543 KYQKGDSD----EIKRLKEKLSKTEEELKKEKTNSAA-------LKNQADSVGKEYDRLL 591
             Q  +      E++  +E+L    E+L  +             L+ + + +  E  R L
Sbjct: 615 LLQSLELSEAENELEEAEEELESELEKLNLQAELEELLQAALEELEEKVEELEAEIRREL 674

Query: 592 KEHEKVQK 599
           +  E  ++
Sbjct: 675 QRIENEEQ 682


>gnl|CDD|114011 pfam05262, Borrelia_P83, Borrelia P83/100 protein.  This family
           consists of several Borrelia P83/P100 antigen proteins.
          Length = 489

 Score = 45.4 bits (107), Expect = 7e-05
 Identities = 32/144 (22%), Positives = 68/144 (47%), Gaps = 1/144 (0%)

Query: 465 KRCEIFEQCDKGLEGDSDEIKRLKEKLSKTEEELKKEKTNSAALKSQADSVGKEYDRLLK 524
            R ++ +  ++  + D+   ++LKE+L K + +  K +  +   +  AD    E  +  +
Sbjct: 196 FRRDMTDLKERESQEDAKRAQQLKEELDKKQIDADKAQQKADFAQDNADKQRDEVRQKQQ 255

Query: 525 EHEKVQKVVTEQGDKKDDKYQKGDSDEIKRLKEKLSKTEEELKKEKTNSA-ALKNQADSV 583
           E + + K       K+D +  +    EI++ + ++ K +EE  K K + A  LK ++ + 
Sbjct: 256 EAKNLPKPADTSSPKEDKQVAENQKREIEKAQIEIKKNDEEALKAKDHKAFDLKQESKAS 315

Query: 584 GKEYDRLLKEHEKVQKVVTEQGDK 607
            KE +    E +K ++ V E   K
Sbjct: 316 EKEAEDKELEAQKKREPVAEDLQK 339


>gnl|CDD|197874 smart00787, Spc7, Spc7 kinetochore protein.  This domain is found
           in cell division proteins which are required for
           kinetochore-spindle association.
          Length = 312

 Score = 44.2 bits (105), Expect = 1e-04
 Identities = 25/112 (22%), Positives = 50/112 (44%), Gaps = 6/112 (5%)

Query: 480 DSDEIKRLKEKLSKTEEELKKEKTNSAALKSQADSVGKEYDRLLKEHEKVQKVVTEQGDK 539
           D  E+ R KEKL K  +E+  +      L+ +   +  + + L  +  ++   + E  +K
Sbjct: 202 DPTELDRAKEKLKKLLQEIMIKVKKLEELEEELQELESKIEDLTNKKSELNTEIAEA-EK 260

Query: 540 KDDKYQKGDSDEIKRLKEKLSKTEEELKKEKTNSAALKNQADSVGKEYDRLL 591
           K ++ +     EI++LKE+L   +       T     K   +++   YDR +
Sbjct: 261 KLEQCRGFTFKEIEKLKEQLKLLQ-----SLTGWKITKLSGNTLSMTYDREI 307



 Score = 42.7 bits (101), Expect = 3e-04
 Identities = 24/100 (24%), Positives = 42/100 (42%), Gaps = 12/100 (12%)

Query: 484 IKRLKEKLSKTEEELKKEKTN----SAALKSQADSVGKEYDRLLKEHEKVQKVVTEQGDK 539
           ++ LKE L +  E LK++          L S    +    D L +E  +++++  E  D 
Sbjct: 142 LEGLKEGLDENLEGLKEDYKLLMKELELLNSIKPKLRDRKDALEEELRQLKQLEDELEDC 201

Query: 540 KDDKYQKGDSDEIKRLKEKLSKTEEELKKEKTNSAALKNQ 579
                   D  E+ R KEKL K  +E+  +      L+ +
Sbjct: 202 --------DPTELDRAKEKLKKLLQEIMIKVKKLEELEEE 233


>gnl|CDD|219791 pfam08317, Spc7, Spc7 kinetochore protein.  This domain is found in
           cell division proteins which are required for
           kinetochore-spindle association.
          Length = 321

 Score = 42.7 bits (101), Expect = 4e-04
 Identities = 13/86 (15%), Positives = 42/86 (48%), Gaps = 1/86 (1%)

Query: 480 DSDEIKRLKEKLSKTEEELKKEKTNSAALKSQADSVGKEYDRLLKEHEKVQKVVTEQGDK 539
           D  E+++ +++L     ++ +++     L+ +   +    + L  +  ++ + + E  +K
Sbjct: 205 DPLELEKARQELRSLSVKISEKRKQLEELQQELQELTIAIEALTNKKSELLEEIAEA-EK 263

Query: 540 KDDKYQKGDSDEIKRLKEKLSKTEEE 565
             ++ +   + EI +LK K+S  ++ 
Sbjct: 264 IREECRGWSAKEISKLKAKVSLLQKL 289



 Score = 41.6 bits (98), Expect = 8e-04
 Identities = 15/104 (14%), Positives = 45/104 (43%), Gaps = 12/104 (11%)

Query: 484 IKRLKEKLSKTEEELKKEKT----NSAALKSQADSVGKEYDRLLKEHEKVQKVVTEQGDK 539
           ++ LKE L +  E +K+++     +   + S    + K+   L +E   ++++  E    
Sbjct: 145 LEGLKEGLEENLEGMKRDEELLNKDLNLINSIKPKLRKKLQALKEEIASLRQLADELNLC 204

Query: 540 KDDKYQKGDSDEIKRLKEKLSKTEEELKKEKTNSAALKNQADSV 583
                   D  E+++ +++L     ++ +++     L+ +   +
Sbjct: 205 --------DPLELEKARQELRSLSVKISEKRKQLEELQQELQEL 240


>gnl|CDD|217051 pfam02463, SMC_N, RecF/RecN/SMC N terminal domain.  This domain is
           found at the N terminus of SMC proteins. The SMC
           (structural maintenance of chromosomes) superfamily
           proteins have ATP-binding domains at the N- and
           C-termini, and two extended coiled-coil domains
           separated by a hinge in the middle. The eukaryotic SMC
           proteins form two kind of heterodimers: the SMC1/SMC3
           and the SMC2/SMC4 types. These heterodimers constitute
           an essential part of higher order complexes, which are
           involved in chromatin and DNA dynamics. This family also
           includes the RecF and RecN proteins that are involved in
           DNA metabolism and recombination.
          Length = 1162

 Score = 42.6 bits (100), Expect = 6e-04
 Identities = 28/133 (21%), Positives = 58/133 (43%), Gaps = 6/133 (4%)

Query: 480 DSDEIKRLKEKLSKTEEELKKEKTNSAALKSQADSVGKEYDRLLKEHEKVQK----VVTE 535
           D +++K  +++L K E+ELKKEK     L+ +   +     R  +E E+ Q        E
Sbjct: 313 DEEKLKESEKELKKLEKELKKEKEEIEELEKELKELEI--KREAEEEEEEQLEKLQEKLE 370

Query: 536 QGDKKDDKYQKGDSDEIKRLKEKLSKTEEELKKEKTNSAALKNQADSVGKEYDRLLKEHE 595
           Q +++    +K +S+ +    +   +  E   +E+  +  L   ++          KE  
Sbjct: 371 QLEEELLAKKKLESERLSSAAKLKEEELELKNEEEKEAKLLLELSEQEEDLLKEEKKEEL 430

Query: 596 KVQKVVTEQGDKK 608
           K+ + + E  + K
Sbjct: 431 KIVEELEESLETK 443



 Score = 33.0 bits (75), Expect = 0.55
 Identities = 16/139 (11%), Positives = 52/139 (37%), Gaps = 10/139 (7%)

Query: 481 SDEIKRLKEKLSKTEEELKKEKTNSAALKSQADSVGKEYDRLLKEHEKVQKVVTEQGDKK 540
             ++ +  ++  K ++  ++E    A  + +  S   + +R   + E+  K   ++  K 
Sbjct: 268 LAQVLKENKEEEKEKKLQEEELKLLAKEEEELKSELLKLERRKVDDEEKLKESEKELKKL 327

Query: 541 DDKYQKGDSDEIKRLKEKLS----------KTEEELKKEKTNSAALKNQADSVGKEYDRL 590
           + + +K   +  +  KE             + E+  K ++      +        E +RL
Sbjct: 328 EKELKKEKEEIEELEKELKELEIKREAEEEEEEQLEKLQEKLEQLEEELLAKKKLESERL 387

Query: 591 LKEHEKVQKVVTEQGDKKD 609
               +  ++ +  + +++ 
Sbjct: 388 SSAAKLKEEELELKNEEEK 406



 Score = 30.7 bits (69), Expect = 2.6
 Identities = 26/138 (18%), Positives = 53/138 (38%), Gaps = 15/138 (10%)

Query: 482 DEIKRLKEKLSKTEEELKKEKTNSAALKSQADSVGKE---YDRLLKEHEKVQKVVTEQGD 538
           +E    +EK  K +E LKK    +  L      + +      +L ++ +K  +    +  
Sbjct: 161 EEAAGSREKRKK-KERLKKLIEETENLAELIIDLEELKLQELKLKEQAKKALEYYQLKEK 219

Query: 539 KKD-------DKYQKGDSDEIKRLKEKLSKTEEELKKEKTNSAALKNQADSVGKEYDRLL 591
            +          Y K + + I  L+E L   +EE++  K        + + +  +  +  
Sbjct: 220 LELEEENLLYLDYLKLNEERIDLLQELLRDEQEEIESSK----QELEKEEEILAQVLKEN 275

Query: 592 KEHEKVQKVVTEQGDKKD 609
           KE EK +K+  E+     
Sbjct: 276 KEEEKEKKLQEEELKLLA 293


>gnl|CDD|217902 pfam04111, APG6, Autophagy protein Apg6.  In yeast, 15 Apg proteins
           coordinate the formation of autophagosomes. Autophagy is
           a bulk degradation process induced by starvation in
           eukaryotic cells. Apg6/Vps30p has two distinct functions
           in the autophagic process, either associated with the
           membrane or in a retrieval step of the carboxypeptidase
           Y sorting pathway.
          Length = 356

 Score = 41.4 bits (97), Expect = 0.001
 Identities = 34/130 (26%), Positives = 61/130 (46%), Gaps = 6/130 (4%)

Query: 471 EQCDKGLEGDSDEIKRLKEKLSKTEEELKKEKTNSAALKSQADSVGKEYDRLLKEHEKVQ 530
           E+ D  L     E    K+ LSK E +  +   N  AL S+ D + KE +RLL E E+++
Sbjct: 32  ERLDSELRDAEKERDTYKQYLSKLESQNVEIS-NYEALDSELDELKKEEERLLDELEELE 90

Query: 531 KVVTEQGDKKDDKYQKGDSDEIKRLKEKLSKTEEELKKEKTNSAALKNQADSVGKEYDRL 590
           K    + D  D +  +   ++ +   E+L     E      N+  L++   S+  +Y+  
Sbjct: 91  K----EDDDLDGELVELQEEKEQLENEELQY-LREYNLFDRNNLQLEDNLQSLELQYEYS 145

Query: 591 LKEHEKVQKV 600
           L + +K++K 
Sbjct: 146 LNQLDKLRKT 155


>gnl|CDD|234750 PRK00409, PRK00409, recombination and DNA strand exchange inhibitor
           protein; Reviewed.
          Length = 782

 Score = 41.7 bits (99), Expect = 0.001
 Identities = 36/147 (24%), Positives = 65/147 (44%), Gaps = 31/147 (21%)

Query: 462 ESSKRC----EIFEQCDKGLEGDSDEIKRLKEKLSKTEEELKKEKTNSAALKSQADSVGK 517
           E +KR      I E+  K +  D +++  L   L + E EL+++   + AL  +A+ + +
Sbjct: 492 EIAKRLGLPENIIEEAKKLIGEDKEKLNELIASLEELERELEQKAEEAEALLKEAEKLKE 551

Query: 518 EY-----------DRLLKE-HEKVQKVVTE------QGDKKDDKYQKGD---------SD 550
           E            D+LL+E  ++ Q+ + E      +  K+  + QKG           +
Sbjct: 552 ELEEKKEKLQEEEDKLLEEAEKEAQQAIKEAKKEADEIIKELRQLQKGGYASVKAHELIE 611

Query: 551 EIKRLKEKLSKTEEELKKEKTNSAALK 577
             KRL +   K E++ KK+K     LK
Sbjct: 612 ARKRLNKANEKKEKKKKKQKEKQEELK 638


>gnl|CDD|224495 COG1579, COG1579, Zn-ribbon protein, possibly nucleic acid-binding
           [General function prediction only].
          Length = 239

 Score = 40.8 bits (96), Expect = 0.001
 Identities = 29/141 (20%), Positives = 68/141 (48%), Gaps = 7/141 (4%)

Query: 470 FEQCDKGLEGDSDEIKRLKEKLSKTEEELKK--EKTNSAALKSQADSVGKEYDRLLKEHE 527
            E  +K LE    E++ L+ ++S+ E E+++  E+   A  K  A    +E   L  E +
Sbjct: 40  LEALNKALEALEIELEDLENQVSQLESEIQEIRERIKRAEEKLSAVKDERELRALNIEIQ 99

Query: 528 KVQKVVTEQGDKKDDKYQKGDSDEIKRLKEKLSKTEEELKKEKTNSAALKNQADSVGKEY 587
             +    E+ +  +D+  +   +EI++L++++   +E L++ + N A  + + +    E 
Sbjct: 100 IAK----ERINSLEDELAELM-EEIEKLEKEIEDLKERLERLEKNLAEAEARLEEEVAEI 154

Query: 588 DRLLKEHEKVQKVVTEQGDKK 608
               +E    ++ + E+ D +
Sbjct: 155 REEGQELSSKREELKEKLDPE 175



 Score = 34.3 bits (79), Expect = 0.14
 Identities = 27/131 (20%), Positives = 56/131 (42%), Gaps = 28/131 (21%)

Query: 482 DEIKRLKEKLSKTEEELKKEKTNSAALKSQADSVGKEYDRLLKEHEKVQKVVTEQGDKKD 541
            E  RL+ ++ +  + LKK        K++ +++ K  + L  E E ++  V        
Sbjct: 17  LEKDRLEPRIKEIRKALKK-------AKAELEALNKALEALEIELEDLENQV-------- 61

Query: 542 DKYQKGDSDEIKRLKEKLSKTEEELKKEKTNSA---------ALKNQADSVGKEYDRLLK 592
                    EI+ ++E++ + EE+L   K               K + +S+  E   L++
Sbjct: 62  ----SQLESEIQEIRERIKRAEEKLSAVKDERELRALNIEIQIAKERINSLEDELAELME 117

Query: 593 EHEKVQKVVTE 603
           E EK++K + +
Sbjct: 118 EIEKLEKEIED 128


>gnl|CDD|219838 pfam08432, DUF1742, Fungal protein of unknown function (DUF1742).
           This is a family of fungal proteins of unknown function.
          Length = 182

 Score = 39.7 bits (93), Expect = 0.001
 Identities = 23/97 (23%), Positives = 38/97 (39%), Gaps = 13/97 (13%)

Query: 482 DEIKRLKEKLSKTEEELKKEKTNSAALKSQADSVGKEYDRLLKEHEKVQKVVTEQGDKKD 541
            E+    EK+ K  EE +K K      K + D             +   K   ++ DK +
Sbjct: 65  KELAEEIEKVKKEYEEKQKWKWKKKKSKKKKD-------------KDKDKKDDKKDDKSE 111

Query: 542 DKYQKGDSDEIKRLKEKLSKTEEELKKEKTNSAALKN 578
            K +K   D+++ L +  S+T   L + K    AL  
Sbjct: 112 KKDEKEAEDKLEDLTKSYSETLSTLSELKPRKYALHK 148



 Score = 35.1 bits (81), Expect = 0.050
 Identities = 23/92 (25%), Positives = 37/92 (40%), Gaps = 12/92 (13%)

Query: 493 KTEEELKKEKTNSAALKSQADSVGKEY---DRLLKEHEKVQKVVTEQGDKKDDKYQKGDS 549
           +  E  KK+K     L  + + V KEY    +   + +K +K   +  DK     +K D 
Sbjct: 56  EYTEAKKKKK----ELAEEIEKVKKEYEEKQKWKWKKKKSKK--KKDKDKDKKDDKKDDK 109

Query: 550 DEIKRLKE---KLSKTEEELKKEKTNSAALKN 578
            E K  KE   KL    +   +  +  + LK 
Sbjct: 110 SEKKDEKEAEDKLEDLTKSYSETLSTLSELKP 141


>gnl|CDD|233757 TIGR02168, SMC_prok_B, chromosome segregation protein SMC, common
           bacterial type.  SMC (structural maintenance of
           chromosomes) proteins bind DNA and act in organizing and
           segregating chromosomes for partition. SMC proteins are
           found in bacteria, archaea, and eukaryotes. This family
           represents the SMC protein of most bacteria. The smc
           gene is often associated with scpB (TIGR00281) and scpA
           genes, where scp stands for segregation and condensation
           protein. SMC was shown (in Caulobacter crescentus) to be
           induced early in S phase but present and bound to DNA
           throughout the cell cycle [Cellular processes, Cell
           division, DNA metabolism, Chromosome-associated
           proteins].
          Length = 1179

 Score = 41.2 bits (97), Expect = 0.002
 Identities = 25/130 (19%), Positives = 62/130 (47%), Gaps = 6/130 (4%)

Query: 471 EQCDKGLEGD----SDEIKRLKEKLSKTEEELKKEKTNSAALKSQADSVGKEYDRLLKEH 526
           E+  + LE         +  L+++L + EEEL++ +     L  Q  ++ K+  RL  E 
Sbjct: 683 EEKIEELEEKIAELEKALAELRKELEELEEELEQLRKELEELSRQISALRKDLARLEAEV 742

Query: 527 EKVQKVVTEQGD--KKDDKYQKGDSDEIKRLKEKLSKTEEELKKEKTNSAALKNQADSVG 584
           E++++ + +      + +   +   + ++  +E+L++ E E+++ +     LK +  ++ 
Sbjct: 743 EQLEERIAQLSKELTELEAEIEELEERLEEAEEELAEAEAEIEELEAQIEQLKEELKALR 802

Query: 585 KEYDRLLKEH 594
           +  D L  E 
Sbjct: 803 EALDELRAEL 812



 Score = 40.4 bits (95), Expect = 0.003
 Identities = 27/132 (20%), Positives = 63/132 (47%), Gaps = 9/132 (6%)

Query: 482 DEIKRLKEKLSKTEEELKKEKTNSAALKSQADSVGKEYDRLLKEHEKVQKVVTEQGDKKD 541
           +E++ L+E+L + EEEL++       L+ + + +  E   L +E E++QK +    ++  
Sbjct: 239 EELEELQEELKEAEEELEELTAELQELEEKLEELRLEVSELEEEIEELQKELYALANEIS 298

Query: 542 DKYQ---------KGDSDEIKRLKEKLSKTEEELKKEKTNSAALKNQADSVGKEYDRLLK 592
              Q              +++ L+ +L + E +L +     A L+ + + + +E + L  
Sbjct: 299 RLEQQKQILRERLANLERQLEELEAQLEELESKLDELAEELAELEEKLEELKEELESLEA 358

Query: 593 EHEKVQKVVTEQ 604
           E E+++  + E 
Sbjct: 359 ELEELEAELEEL 370



 Score = 39.7 bits (93), Expect = 0.005
 Identities = 20/114 (17%), Positives = 53/114 (46%), Gaps = 5/114 (4%)

Query: 477 LEGDSDEIKRLKEKLSKTEEELKKEKTNSAALKSQADSVGKEYDRLLKEHEKVQKVVTEQ 536
           L     EI+ L+ ++ + +EELK  +     L+++   + +E   L +  E +++ +   
Sbjct: 777 LAEAEAEIEELEAQIEQLKEELKALREALDELRAELTLLNEEAANLRERLESLERRIAAT 836

Query: 537 GDKKDDKYQKGDSDEIKRLKEKLSKTEEELKKEKTNSAALKNQADSVGKEYDRL 590
             + +D       ++I+ L E +     E+++ +     L+++ +++  E   L
Sbjct: 837 ERRLEDL-----EEQIEELSEDIESLAAEIEELEELIEELESELEALLNERASL 885



 Score = 38.9 bits (91), Expect = 0.008
 Identities = 23/134 (17%), Positives = 55/134 (41%), Gaps = 8/134 (5%)

Query: 468 EIFEQCDKGLEGDSDEIKRLKEKLSKTEEELKKEKTNSAALKSQADSVGKEYDRLLKEHE 527
           +I  +    LE   +E++   E+L    +EL +E    A L+ + + + +E + L  E E
Sbjct: 305 QILRERLANLERQLEELEAQLEELESKLDELAEEL---AELEEKLEELKEELESLEAELE 361

Query: 528 KVQKVVTEQGDKKDDK-----YQKGDSDEIKRLKEKLSKTEEELKKEKTNSAALKNQADS 582
           +++  + E   + ++        +    +++     L+   E L+         + +   
Sbjct: 362 ELEAELEELESRLEELEEQLETLRSKVAQLELQIASLNNEIERLEARLERLEDRRERLQQ 421

Query: 583 VGKEYDRLLKEHEK 596
             +E  + L+E E 
Sbjct: 422 EIEELLKKLEEAEL 435



 Score = 37.3 bits (87), Expect = 0.023
 Identities = 25/131 (19%), Positives = 65/131 (49%), Gaps = 6/131 (4%)

Query: 475 KGLEGDSDEIKRLKEKLSKTEEELKKE----KTNSAALKSQADSVGKEYDRLLKEHEKVQ 530
             L  + +E++   E+L K  EEL ++    + + A L+++ + + +   +L KE  +++
Sbjct: 701 AELRKELEELEEELEQLRKELEELSRQISALRKDLARLEAEVEQLEERIAQLSKELTELE 760

Query: 531 KVVTEQGDKKDDKYQ--KGDSDEIKRLKEKLSKTEEELKKEKTNSAALKNQADSVGKEYD 588
             + E  ++ ++  +       EI+ L+ ++ + +EELK  +     L+ +   + +E  
Sbjct: 761 AEIEELEERLEEAEEELAEAEAEIEELEAQIEQLKEELKALREALDELRAELTLLNEEAA 820

Query: 589 RLLKEHEKVQK 599
            L +  E +++
Sbjct: 821 NLRERLESLER 831



 Score = 35.8 bits (83), Expect = 0.077
 Identities = 27/122 (22%), Positives = 62/122 (50%), Gaps = 3/122 (2%)

Query: 486 RLKEKLSKTEEELKKEKTNSAALKSQADSVGKEYDRLLKEHEKVQKVVTEQGDKKDDKYQ 545
            L E+L++ EE+L++ K    +L+++ + +  E + L    E++++ +     K      
Sbjct: 334 ELAEELAELEEKLEELKEELESLEAELEELEAELEELESRLEELEEQLETLRSKVAQLEL 393

Query: 546 KGDS--DEIKRLKEKLSKTEEELKKEKTN-SAALKNQADSVGKEYDRLLKEHEKVQKVVT 602
           +  S  +EI+RL+ +L + E+  ++ +      LK   ++  KE    L+E E+  + + 
Sbjct: 394 QIASLNNEIERLEARLERLEDRRERLQQEIEELLKKLEEAELKELQAELEELEEELEELQ 453

Query: 603 EQ 604
           E+
Sbjct: 454 EE 455



 Score = 35.4 bits (82), Expect = 0.11
 Identities = 22/123 (17%), Positives = 60/123 (48%), Gaps = 5/123 (4%)

Query: 484 IKRLKEKLSKTEEELKKEKTNSAALKSQADSVGKEYDRLLKEHEKVQKV--VTEQGDKKD 541
           I + KE+  +TE +L++ + N   L+   + + ++   L ++ EK ++   +  +  + +
Sbjct: 167 ISKYKERRKETERKLERTRENLDRLEDILNELERQLKSLERQAEKAERYKELKAELRELE 226

Query: 542 DKYQKGDSDEIKRLKEKLSKTEEELKKEKTNSAALKNQADSVGKEYDRLLKEHEKVQKVV 601
                     ++ L+E+L + +EELK+ +     L  +   + ++ + L  E  ++++ +
Sbjct: 227 LALLVLR---LEELREELEELQEELKEAEEELEELTAELQELEEKLEELRLEVSELEEEI 283

Query: 602 TEQ 604
            E 
Sbjct: 284 EEL 286



 Score = 35.0 bits (81), Expect = 0.14
 Identities = 27/133 (20%), Positives = 61/133 (45%), Gaps = 8/133 (6%)

Query: 475 KGLEGDSDEIKRLKEKLSKTEEELKKEKTNSAALKSQADSVGKEYDRLLKEHEKVQKVVT 534
           + LE   +E++    +L +  EEL+KE    A   S+ +   +     L   E+  + + 
Sbjct: 263 QELEEKLEELRLEVSELEEEIEELQKELYALANEISRLEQQKQILRERLANLERQLEELE 322

Query: 535 EQGDKKDDKYQKGDSDEIKRLKEKLSKTEEELKKEKTNSAALKNQADSVGKEYDRLLKEH 594
            Q ++ + K        +  L E+L++ EE+L++ K    +L+ + + +  E + L    
Sbjct: 323 AQLEELESK--------LDELAEELAELEEKLEELKEELESLEAELEELEAELEELESRL 374

Query: 595 EKVQKVVTEQGDK 607
           E++++ +     K
Sbjct: 375 EELEEQLETLRSK 387



 Score = 33.1 bits (76), Expect = 0.46
 Identities = 15/120 (12%), Positives = 53/120 (44%), Gaps = 12/120 (10%)

Query: 471 EQCDKGLEGDSDEIKRLKEKLSKTEE-------ELKKEKTNSAALKSQADSVGKEYDRLL 523
           E  ++ +      ++ L+E++ +  E       E+++ +     L+S+ +++  E     
Sbjct: 827 ESLERRIAATERRLEDLEEQIEELSEDIESLAAEIEELEELIEELESELEALLNERA--- 883

Query: 524 KEHEKVQKVVTEQGDKKDDKYQKGDSDEIKRLKEKLSKTEEELKKEKTNSAALKNQADSV 583
              E+   ++  + ++  ++  +    +   L+ +L +  E+L + +     L+ + D++
Sbjct: 884 -SLEEALALLRSELEELSEE-LRELESKRSELRRELEELREKLAQLELRLEGLEVRIDNL 941


>gnl|CDD|198139 smart01071, CDC37_N, Cdc37 N terminal kinase binding.  Cdc37 is a
           molecular chaperone required for the activity of
           numerous eukaryotic protein kinases. This domain
           corresponds to the N terminal domain which binds
           predominantly to protein kinases.and is found N terminal
           to the Hsp (Heat shocked protein) 90-binding domain.
           Expression of a construct consisting of only the
           N-terminal domain of Saccharomyces pombe Cdc37 results
           in cellular viability. This indicates that interactions
           with the cochaperone Hsp90 may not be essential for
           Cdc37 function.
          Length = 154

 Score = 38.9 bits (91), Expect = 0.002
 Identities = 22/94 (23%), Positives = 45/94 (47%), Gaps = 4/94 (4%)

Query: 483 EIKRLKEKLSKTEEELK-KEKTNSAALKSQADSVGKEYDRLLKE-HEKVQKVVTEQGDKK 540
           EIK LK +L   +   K  +K      + +       Y+ +L E  ++++K + E     
Sbjct: 47  EIKNLKYELIMNDHLNKRIDKLLKGLREEELSPETPTYNEMLAELQDQLKKELEEANGDS 106

Query: 541 DDKYQ--KGDSDEIKRLKEKLSKTEEELKKEKTN 572
           +   +  K   D++K+ +++L K  +EL+KE+  
Sbjct: 107 EGLLEELKKHRDKLKKEQKELRKKLDELEKEEKK 140


>gnl|CDD|220614 pfam10174, Cast, RIM-binding protein of the cytomatrix active zone.
            This is a family of proteins that form part of the CAZ
           (cytomatrix at the active zone) complex which is
           involved in determining the site of synaptic vesicle
           fusion. The C-terminus is a PDZ-binding motif that binds
           directly to RIM (a small G protein Rab-3A effector). The
           family also contains four coiled-coil domains.
          Length = 774

 Score = 40.8 bits (95), Expect = 0.002
 Identities = 43/156 (27%), Positives = 72/156 (46%), Gaps = 28/156 (17%)

Query: 462 ESSKRCEIFEQCDKGLEGDSDEIKRLKEKLSKTEEELKKEKTNS-----------AALKS 510
           E SK      +    LE    E+++++EK  K E+ELK+ + N               +S
Sbjct: 501 EVSKLASNQLKQRSDLERAHIELEKIREKHEKLEKELKRLRANPESADRGSAVDAGTSRS 560

Query: 511 QADSVG--KEYDRLLKEHEKVQKVVTEQGDKKDDKYQKGDSDEIKRLKEKLSKTEEELKK 568
           +ADS G   E DRLL   EK ++       ++DD        E  RL ++L K +  L K
Sbjct: 561 RADSAGARNEVDRLLDRLEKAEQ-------ERDDT-----EMEAGRLAKELEKAQRHLTK 608

Query: 569 EKTNSAALKNQADSVGKEYDRLLKEHEKVQKVVTEQ 604
           ++  + A + + +    E   LL+E E+++K   E+
Sbjct: 609 QQEKTEATRIEFERKSAE---LLEEAERLEKSEAEE 641



 Score = 36.9 bits (85), Expect = 0.034
 Identities = 33/132 (25%), Positives = 60/132 (45%), Gaps = 7/132 (5%)

Query: 471 EQCDKGLEGDSDEIKRLKEKLSKTEEELKKEKTNSAALKSQ--ADSVGKEYDRLLKEHEK 528
           E+  + L+   +EIK+L E L       K E+ +S  L     A+      + LL++ EK
Sbjct: 145 ERLQQTLDARDEEIKKLLEMLQSKGLSAKAEEEDSERLGRIREAEDQQSHLEVLLEQKEK 204

Query: 529 VQKVVTEQGDKK-----DDKYQKGDSDEIKRLKEKLSKTEEELKKEKTNSAALKNQADSV 583
              ++ E+  +K     DD   +     I     K+ + E+ L+K +     L+++ D+ 
Sbjct: 205 EHMMLREEIHQKLQMERDDAKTEASQKLIDEKDTKIKEFEKMLEKAENEIYRLQSRCDTS 264

Query: 584 GKEYDRLLKEHE 595
             + +RL KE E
Sbjct: 265 EADRNRLDKEVE 276



 Score = 35.8 bits (82), Expect = 0.087
 Identities = 25/101 (24%), Positives = 47/101 (46%), Gaps = 11/101 (10%)

Query: 508 LKSQADSVGKEYDRLLKEHE----KVQKVVTE------QGDKKDDKYQKGDSDEIKRLKE 557
           L++Q ++  +E + L KE E    K+   +           K++   +K ++  I   KE
Sbjct: 1   LQAQLENAQRENELLKKELERKQSKLGSSMNSIKTFWSPELKRERALRKDEAARITVWKE 60

Query: 558 KLSKTEEELKKEKTNSAALKNQADSVGKEYDRLLKEHEKVQ 598
           +L  T+EE +  +    AL+ +  S   E +RL ++ EK  
Sbjct: 61  QLRVTQEENQHAQLTVQALQEELKSQR-ELNRLQQKLEKSG 100



 Score = 35.8 bits (82), Expect = 0.087
 Identities = 20/94 (21%), Positives = 41/94 (43%), Gaps = 9/94 (9%)

Query: 483 EIKRLKEKLSKTEEELKKEKTNSAALKSQADSVGKEYDRLLKEHEKVQKVVTE-----QG 537
           E  RL ++L K +  L K++  + A + + +    E   LL+E E+++K   E     Q 
Sbjct: 591 EAGRLAKELEKAQRHLTKQQEKTEATRIEFERKSAE---LLEEAERLEKSEAEEETLRQS 647

Query: 538 DKKDDKYQKGDSDEIKRLKEKLSKTEEELKKEKT 571
            +         +     + +KL    ++L+ E+ 
Sbjct: 648 TQIGHAQAAAHNHIEHHV-QKLESDLKQLRAERE 680



 Score = 32.3 bits (73), Expect = 0.81
 Identities = 35/158 (22%), Positives = 66/158 (41%), Gaps = 16/158 (10%)

Query: 462 ESSKRCEIFEQCDKGLEGDSDEIKRLKEKLSKTEEELKKEKTNSAAL-------KSQADS 514
           +     E FE   K  E   +E++ L+ KLS+ E +L+  K   + L       +S  + 
Sbjct: 459 DERYEQEEFETYKKEFEDLKEEVQNLQLKLSERELQLELLKEEVSKLASNQLKQRSDLER 518

Query: 515 VGKEYDRLLKEHEKVQKVVTEQGDKKDDKYQKGDSD-----EIKRLKEKLSKTEEELKKE 569
              E +++ ++HEK++K    +  +     +  D          R +   +    E+ + 
Sbjct: 519 AHIELEKIREKHEKLEK----ELKRLRANPESADRGSAVDAGTSRSRADSAGARNEVDRL 574

Query: 570 KTNSAALKNQADSVGKEYDRLLKEHEKVQKVVTEQGDK 607
                  + + D    E  RL KE EK Q+ +T+Q +K
Sbjct: 575 LDRLEKAEQERDDTEMEAGRLAKELEKAQRHLTKQQEK 612



 Score = 31.9 bits (72), Expect = 1.0
 Identities = 25/107 (23%), Positives = 55/107 (51%), Gaps = 6/107 (5%)

Query: 462 ESSKRCEIFEQCDKGLEGDSDEIKRLKEKLSKTEEELK--KEKTNSAALKSQADSVGKEY 519
           E     + +E+ ++ L     +I+ L+E   + E  LK  KE+  S    +  D+  ++ 
Sbjct: 379 EIEDMRDRYEKTERKLRVLQKKIENLQETFRRKERRLKEEKERLRSLQTDTNTDTALEKL 438

Query: 520 DRLLKEHEKVQKVVTEQGDKKDDKYQKGDSDEIKRLKEKLSKTEEEL 566
           ++ L E E++ + + EQ D+ D++Y+    +E +  K++    +EE+
Sbjct: 439 EKALAEKERIIERLKEQRDR-DERYE---QEEFETYKKEFEDLKEEV 481



 Score = 30.4 bits (68), Expect = 3.2
 Identities = 22/128 (17%), Positives = 59/128 (46%), Gaps = 5/128 (3%)

Query: 471 EQCDKGLEGDSDEIKRLKEKLSKTEEELKKEKTNSAALKSQADSVGKEYDRLLKEHEKVQ 530
            + DK +E +   +  +K K  +  +EL ++KT    L+++ +++  +   + +  +K++
Sbjct: 269 NRLDKEVEAERSALAAMKAKCDRAAQELSRKKTELLGLQTELETLANQDSDMRQHLDKLK 328

Query: 531 KVVTEQGDKKDDKYQKGDSDEIKRLKEKLSKTEEELKKEKTNSAALKNQADSVGKEYDRL 590
           + +T    +K          E+  L+ +L +    L K+  +  A + +  +   E + +
Sbjct: 329 EDLTRAEQEKAIL-----QTEVDALRYELERKHNTLTKKTASLQAAQEEKATYAGEIEDM 383

Query: 591 LKEHEKVQ 598
              +EK +
Sbjct: 384 RDRYEKTE 391



 Score = 29.6 bits (66), Expect = 5.9
 Identities = 26/130 (20%), Positives = 56/130 (43%), Gaps = 5/130 (3%)

Query: 474 DKGLEGDSDEIKRLKEKLSKTEEELKKEKTNSAALKSQADSVGKEYDRLLKEHEKVQKVV 533
           +K LE   +EI RL+ +   +E +  +      A +S   ++  + DR  +E  + +  +
Sbjct: 244 EKMLEKAENEIYRLQSRCDTSEADRNRLDKEVEAERSALAAMKAKCDRAAQELSRKKTEL 303

Query: 534 TEQGDKKDDKYQKGDSDEIKRLKEKLSKTEEELKKEKTNSAALKNQADSVGKEYDRLLKE 593
                + +      DSD    +++ L K +E+L + +   A L+ + D++  E +R    
Sbjct: 304 LGLQTELETL-ANQDSD----MRQHLDKLKEDLTRAEQEKAILQTEVDALRYELERKHNT 358

Query: 594 HEKVQKVVTE 603
             K    +  
Sbjct: 359 LTKKTASLQA 368



 Score = 29.2 bits (65), Expect = 7.5
 Identities = 21/139 (15%), Positives = 48/139 (34%), Gaps = 1/139 (0%)

Query: 470 FEQCDKGLEGDSDEIKRLKEKLSKTEEEL-KKEKTNSAALKSQADSVGKEYDRLLKEHEK 528
            E+    L    + IK       K E  L K E       K Q     +E        + 
Sbjct: 19  LERKQSKLGSSMNSIKTFWSPELKRERALRKDEAARITVWKEQLRVTQEENQHAQLTVQA 78

Query: 529 VQKVVTEQGDKKDDKYQKGDSDEIKRLKEKLSKTEEELKKEKTNSAALKNQADSVGKEYD 588
           +Q+ +  Q +    + +   S       ++L K+E +  + +        +   +    +
Sbjct: 79  LQEELKSQRELNRLQQKLEKSGNETGELQQLEKSEGQFSRIQAERERQAMELFLLRDTLE 138

Query: 589 RLLKEHEKVQKVVTEQGDK 607
           +L    E++Q+ +  + ++
Sbjct: 139 KLQLRMERLQQTLDARDEE 157


>gnl|CDD|202427 pfam02841, GBP_C, Guanylate-binding protein, C-terminal domain.
           Transcription of the anti-viral guanylate-binding
           protein (GBP) is induced by interferon-gamma during
           macrophage induction. This family contains GBP1 and
           GPB2, both GTPases capable of binding GTP, GDP and GMP.
          Length = 297

 Score = 40.3 bits (95), Expect = 0.002
 Identities = 26/141 (18%), Positives = 63/141 (44%), Gaps = 32/141 (22%)

Query: 468 EIFEQCDKGLEGDSDEIKRLKEKLSKTEEELKKEKTNSAALKSQADSVGKEYDRLLKEHE 527
           E+ ++     E   + I +  + L+  E+ ++ E+  + A +++ +        LL+E +
Sbjct: 176 EVLQEFLNSKEAVEEAILQTDQALTAKEKAIEAERAKAEAAEAEQE--------LLREKQ 227

Query: 528 KVQKVVTEQGDKKDDKYQKGDSDEIKRLKEKLSKTEEELKKEKTNSAALKNQADSVGKEY 587
           K ++ + E    ++  YQ    + +K+L EK+    E+L  E+                 
Sbjct: 228 KEEEQMMEA---QERSYQ----EHVKQLIEKMEAEREKLLAEQ----------------- 263

Query: 588 DRLLKEHEKVQKVVTEQGDKK 608
           +R+L+   + Q+ + ++G K 
Sbjct: 264 ERMLEHKLQEQEELLKEGFKT 284



 Score = 29.9 bits (68), Expect = 3.7
 Identities = 17/76 (22%), Positives = 34/76 (44%), Gaps = 22/76 (28%)

Query: 482 DEIKRLKEKLSKTEEELKKEKTNSAALKSQADSVGKEYDRLLKEHEKVQKVVTEQG-DKK 540
           + +K+L EK+    E+L  E+                 +R+L+   + Q+ + ++G   +
Sbjct: 243 EHVKQLIEKMEAEREKLLAEQ-----------------ERMLEHKLQEQEELLKEGFKTE 285

Query: 541 DDKYQKGDSDEIKRLK 556
            +  QK    EI+ LK
Sbjct: 286 AESLQK----EIQDLK 297


>gnl|CDD|237177 PRK12704, PRK12704, phosphodiesterase; Provisional.
          Length = 520

 Score = 40.5 bits (96), Expect = 0.003
 Identities = 21/93 (22%), Positives = 50/93 (53%), Gaps = 5/93 (5%)

Query: 482 DEIKRLKEKLSKTEEELKKEKTNSAALKSQADSVGKEYDRLLKEHEKVQKVVTEQGDKKD 541
           +E+++L+++L + EE L ++       + + +   KE ++  +E EK ++ + E  +++ 
Sbjct: 82  NELQKLEKRLLQKEENLDRKLELLEKREEELEKKEKELEQKQQELEKKEEELEELIEEQL 141

Query: 542 DKYQK--G-DSDEIKRLKEKLSKTEEELKKEKT 571
            + ++  G  ++E K  +  L K EEE + E  
Sbjct: 142 QELERISGLTAEEAK--EILLEKVEEEARHEAA 172



 Score = 39.0 bits (92), Expect = 0.008
 Identities = 27/111 (24%), Positives = 50/111 (45%), Gaps = 26/111 (23%)

Query: 488 KEKLSKTEEELKKEKTNSAALKSQADSVGKEYDRLLKEHEK--VQKVVTEQGDKKDDKYQ 545
           KE L + +EE+ K       L+++ +   +E    L++ EK  +QK   E  D+K     
Sbjct: 56  KEALLEAKEEIHK-------LRNEFEKELRERRNELQKLEKRLLQK--EENLDRK----- 101

Query: 546 KGDSDEIKRLKEKLSKTEEELKKEKTNSAALKNQADSVGKEYDRLLKEHEK 596
                      E L K EEEL+K++      + + +   +E + L++E  +
Sbjct: 102 ----------LELLEKREEELEKKEKELEQKQQELEKKEEELEELIEEQLQ 142



 Score = 38.2 bits (90), Expect = 0.011
 Identities = 28/129 (21%), Positives = 63/129 (48%), Gaps = 24/129 (18%)

Query: 485 KRLKEKLSKTEEE----LKKEKTNSAALKSQADSVGK-EYDRLLKEHEKVQKVVTEQGDK 539
           K  + K+ + EEE    L++ K  + A+K +A    K E  +L  E EK         + 
Sbjct: 27  KIAEAKIKEAEEEAKRILEEAKKEAEAIKKEALLEAKEEIHKLRNEFEK---------EL 77

Query: 540 KDDKYQKGDSDEIKRLKEKLSKTEEELKKEKTNSAALKNQADSVGKEYDRLLKE----HE 595
           ++ +      +E+++L+++L + EE L ++       + + +   KE ++  +E     E
Sbjct: 78  RERR------NELQKLEKRLLQKEENLDRKLELLEKREEELEKKEKELEQKQQELEKKEE 131

Query: 596 KVQKVVTEQ 604
           ++++++ EQ
Sbjct: 132 ELEELIEEQ 140


>gnl|CDD|224755 COG1842, PspA, Phage shock protein A (IM30), suppresses
           sigma54-dependent transcription [Transcription / Signal
           transduction mechanisms].
          Length = 225

 Score = 38.8 bits (91), Expect = 0.004
 Identities = 24/143 (16%), Positives = 56/143 (39%), Gaps = 9/143 (6%)

Query: 474 DKGLEGDSDEIKRLKEKLSKTEEELKKEKTNSAALKSQADSVGKEYDRLLKEHEKVQKVV 533
           ++ L+   D  K L++ +   E EL K +   A   ++   + ++ +      EK+++  
Sbjct: 16  NELLDKAEDPEKMLEQAIRDMESELAKARQALAQAIARQKQLERKLEEAQARAEKLEEKA 75

Query: 534 TEQGDKKDDKYQKGDSDEIKRLKEKLSKTEEELKKEKTNSAALKNQADSVGKEYDRLLKE 593
                  ++   +   +E + L++     E EL++ +     LK Q  ++ ++   L  +
Sbjct: 76  ELALQAGNEDLAREALEEKQSLEDLAKALEAELQQAEEQVEKLKKQLAALEQKIAELRAK 135

Query: 594 HE---------KVQKVVTEQGDK 607
            E         K Q+ V      
Sbjct: 136 KEALKARKAAAKAQEKVNRSLGG 158


>gnl|CDD|130141 TIGR01069, mutS2, MutS2 family protein.  Function of MutS2 is
           unknown. It should not be considered a DNA mismatch
           repair protein. It is likely a DNA mismatch binding
           protein of unknown cellular function [DNA metabolism,
           Other].
          Length = 771

 Score = 39.8 bits (93), Expect = 0.004
 Identities = 33/144 (22%), Positives = 61/144 (42%), Gaps = 18/144 (12%)

Query: 468 EIFEQCDKGLEGDSDEIKRLKEKLSKTEEELKKEKTNSAALKSQADSVGKEYDRLLKEHE 527
            I EQ         +EI  L EKLS  E+EL+++  +               ++LLKE E
Sbjct: 497 FIIEQAKTFYGEFKEEINVLIEKLSALEKELEQKNEHL--------------EKLLKEQE 542

Query: 528 KVQKVVTEQGDKKDDKYQKGDSDEIKRLKEKLSKTEEELKKEKTNSAALKNQADSVGKEY 587
           K++K + ++ ++  ++ +    +  K  +E L   ++E++         K       K  
Sbjct: 543 KLKKELEQEMEELKERERNKKLELEKEAQEALKALKKEVESIIRELKEKKIHKAKEIKSI 602

Query: 588 DRLLK----EHEKVQKVVTEQGDK 607
           + L+K    + +  QK    Q DK
Sbjct: 603 EDLVKLKETKQKIPQKPTNFQADK 626


>gnl|CDD|227278 COG4942, COG4942, Membrane-bound metallopeptidase [Cell division
           and chromosome partitioning].
          Length = 420

 Score = 39.3 bits (92), Expect = 0.005
 Identities = 22/99 (22%), Positives = 48/99 (48%), Gaps = 6/99 (6%)

Query: 506 AALKSQADSVGKEYDRLLKEHEKVQKVVTEQGDKKDDKYQKGDSDEIKRLKEKLSKTEEE 565
           AA  S A    K+  ++ KE   ++K + EQ D++  K +K    ++K L+ +++  E +
Sbjct: 28  AAAFSAAAD-DKQLKQIQKEIAALEKKIREQQDQRA-KLEK----QLKSLETEIASLEAQ 81

Query: 566 LKKEKTNSAALKNQADSVGKEYDRLLKEHEKVQKVVTEQ 604
           L +   +   L+ Q   +    + L  +  + ++ + EQ
Sbjct: 82  LIETADDLKKLRKQIADLNARLNALEVQEREQRRRLAEQ 120



 Score = 35.5 bits (82), Expect = 0.088
 Identities = 19/140 (13%), Positives = 48/140 (34%), Gaps = 18/140 (12%)

Query: 482 DEIKRLKEKLSKTEEELKKEKTNSAALKSQADSVGKEYDRLLKEHEKVQKVVTEQGDKKD 541
            ++K L+ +++  E +L +   +   L+ Q   +    + L  +  + ++ + EQ     
Sbjct: 66  KQLKSLETEIASLEAQLIETADDLKKLRKQIADLNARLNALEVQEREQRRRLAEQLAA-- 123

Query: 542 DKYQKG------------DSDEIKRLKEKLSKTEEELKKEKTNSAALKNQADSVGKEYDR 589
              + G            D+    RL            +      ALK     +      
Sbjct: 124 -LQRSGRNPPPALLVSPEDAQRSVRLAIYYGALNPARAERI---DALKATLKQLAAVRAE 179

Query: 590 LLKEHEKVQKVVTEQGDKKD 609
           +  E  ++  +++EQ  ++ 
Sbjct: 180 IAAEQAELTTLLSEQRAQQA 199



 Score = 30.8 bits (70), Expect = 2.3
 Identities = 21/99 (21%), Positives = 40/99 (40%), Gaps = 1/99 (1%)

Query: 482 DEIKRLKEKLSKTEEELKKEKTNSAALKSQADSVGKEYDRLLKEHEKVQKVVTEQGDKKD 541
           D +K   ++L+    E+  E+     L S+  +   +  +LL+E +K            D
Sbjct: 164 DALKATLKQLAAVRAEIAAEQAELTTLLSEQRAQQAKLAQLLEERKKTLA-QLNSELSAD 222

Query: 542 DKYQKGDSDEIKRLKEKLSKTEEELKKEKTNSAALKNQA 580
            K  +       RLK +++  E    K +  +AA +  A
Sbjct: 223 QKKLEELRANESRLKNEIASAEAAAAKAREAAAAAEAAA 261


>gnl|CDD|203462 pfam06476, DUF1090, Protein of unknown function (DUF1090).  This
           family consists of several bacterial proteins of unknown
           function and is known as YqjC in E. coli.
          Length = 115

 Score = 36.9 bits (86), Expect = 0.005
 Identities = 26/105 (24%), Positives = 50/105 (47%), Gaps = 16/105 (15%)

Query: 465 KRCEIFEQCDKG-LEGDSDEIKRLKEKLSKTEEELKKEKTNSAALKSQADSVGKEYDRLL 523
           KR  I  Q +     G+   I  L++ L     E++   T++              +R  
Sbjct: 25  KRQAIETQIEYAKAHGNQHRIAGLEKALK----EVQAHCTDAGLRA----------ERQA 70

Query: 524 KEHEKVQKVVTEQGDKKDDKYQKGDSDEIKRLKEKLSKTEEELKK 568
           K  EK Q+V   + + K+ +  KGD+D+I++ + KL++ + EL++
Sbjct: 71  KVEEKEQEVAEREQELKEAQ-AKGDADKIEKKQRKLAEAQAELQE 114


>gnl|CDD|227599 COG5274, CYB5, Cytochrome b involved in lipid metabolism [Energy
           production and conversion / Lipid metabolism].
          Length = 164

 Score = 37.5 bits (87), Expect = 0.007
 Identities = 25/105 (23%), Positives = 39/105 (37%), Gaps = 12/105 (11%)

Query: 483 EIKRLKEKLSKTEEELKKEKTNSAALKSQADSVGKEYD--RLLKEHEKVQKVVTEQGDKK 540
           +I   +     T EE+ K    S       +  GK YD  + L EH   + ++ +   K 
Sbjct: 42  DILNDESPKPITAEEVAKHN-KSEDCWIVIN--GKVYDVSQFLDEHPGGEDIIKDTAGKD 98

Query: 541 D-DKYQKGD-SDEIKRLKEK-----LSKTEEELKKEKTNSAALKN 578
               +     S +I  L +      + + EEEL   KT    LK 
Sbjct: 99  ATKAFNFLHHSHQIGNLLKDVYVDQVHRPEEELPLFKTAWYPLKV 143


>gnl|CDD|214360 CHL00094, dnaK, heat shock protein 70.
          Length = 621

 Score = 38.6 bits (90), Expect = 0.010
 Identities = 30/108 (27%), Positives = 53/108 (49%), Gaps = 6/108 (5%)

Query: 480 DSDEIKRLKEKLSKTEEELKKEKTNSAALKSQADSVGKEYDRLLKE-----HEKVQKVVT 534
             DE++R+ ++  K   E  KEK     LK+QA+S+  + ++ LKE      E+ ++ + 
Sbjct: 503 PKDEVERMVKEAEKNAAE-DKEKREKIDLKNQAESLCYQAEKQLKELKDKISEEKKEKIE 561

Query: 535 EQGDKKDDKYQKGDSDEIKRLKEKLSKTEEELKKEKTNSAALKNQADS 582
               K     Q  + + IK L E+L K   E+ KE  +S +  + A +
Sbjct: 562 NLIKKLRQALQNDNYESIKSLLEELQKALMEIGKEVYSSTSTTDPASN 609



 Score = 30.5 bits (69), Expect = 2.9
 Identities = 19/65 (29%), Positives = 34/65 (52%), Gaps = 10/65 (15%)

Query: 548 DSDEIKRLKEKLSKTEEELKKEKTNSAALKNQADSVGKEYDRLLKE---------HEKVQ 598
             DE++R+ ++  K   E  KEK     LKNQA+S+  + ++ LKE          EK++
Sbjct: 503 PKDEVERMVKEAEKNAAE-DKEKREKIDLKNQAESLCYQAEKQLKELKDKISEEKKEKIE 561

Query: 599 KVVTE 603
            ++ +
Sbjct: 562 NLIKK 566



 Score = 30.1 bits (68), Expect = 4.3
 Identities = 31/124 (25%), Positives = 58/124 (46%), Gaps = 11/124 (8%)

Query: 491 LSKTEEELKKEKTNSAALKSQADSVGKEYDRLLKEHEKVQKVVTEQGDKKDDKYQKGDS- 549
           LS T ++    K  S  ++  +     E +R++KE EK      E+ +K D K Q  +S 
Sbjct: 479 LSVTAKDKGTGKEQSITIQGASTLPKDEVERMVKEAEKNAAEDKEKREKIDLKNQ-AESL 537

Query: 550 -----DEIKRLKEKLSKTEEELKKEKTNSAALKNQADSVGKEYDRLLKEHEKVQKVVTEQ 604
                 ++K LK+K+S+ ++E  K +     L+    +    Y+ +    E++QK + E 
Sbjct: 538 CYQAEKQLKELKDKISEEKKE--KIENLIKKLRQALQN--DNYESIKSLLEELQKALMEI 593

Query: 605 GDKK 608
           G + 
Sbjct: 594 GKEV 597


>gnl|CDD|191748 pfam07464, ApoLp-III, Apolipophorin-III precursor (apoLp-III).
           This family consists of several insect
           apolipoprotein-III sequences. Exchangeable
           apolipoproteins constitute a functionally important
           family of proteins that play critical roles in lipid
           transport and lipoprotein metabolism. Apolipophorin III
           (apoLp-III) is a prototypical exchangeable
           apolipoprotein found in many insect species that
           functions in transport of diacylglycerol (DAG) from the
           fat body lipid storage depot to flight muscles in the
           adult life stage.
          Length = 155

 Score = 36.8 bits (85), Expect = 0.010
 Identities = 18/87 (20%), Positives = 41/87 (47%), Gaps = 5/87 (5%)

Query: 484 IKRLKEKLSKTEEELKKEKTNSAALKSQ-ADSVGKEYDRLLKEHEKVQKVVTEQGDKKDD 542
           +++L++ + +T EEL+K          +  D +       ++E +K+ K V    +  ++
Sbjct: 65  LEQLRQNIEETAEELRKAHPEVEQQAQELRDKLQAAVQSTVQESQKLAKEVASNVEGTNE 124

Query: 543 KYQKGDSDEIKRLKEKLSKTEEELKKE 569
           K       +IK+  +   K  EE++K+
Sbjct: 125 KLA----PKIKQAYDDFVKNAEEVQKK 147


>gnl|CDD|218681 pfam05667, DUF812, Protein of unknown function (DUF812).  This
           family consists of several eukaryotic proteins of
           unknown function.
          Length = 536

 Score = 38.6 bits (90), Expect = 0.010
 Identities = 32/138 (23%), Positives = 60/138 (43%), Gaps = 18/138 (13%)

Query: 481 SDEIKRLKEKLSKTEEELKKEKTNSAALKSQADSVGKEYDRLLKEHE----KVQKVVTEQ 536
             EIK L  K+ +  EEL   +  +  L+ +     +  + LL + E    K+Q +V   
Sbjct: 290 LSEIKALASKIKQVNEELTTVRQENEELEEEYKIKKRTVE-LLPDAENNVAKLQALVVAS 348

Query: 537 GDKKD------DKYQKGDSDEIKRLKEKLSKTEEELKKEKTNSAALKNQADSV------- 583
            ++        + ++    DE + LKEK    E+E +++      L+N+ + +       
Sbjct: 349 SERLLELAQQWEAHRTPLIDEYRSLKEKNRNKEDETQRQLDEIKKLRNKIEELESELQTK 408

Query: 584 GKEYDRLLKEHEKVQKVV 601
            + Y +LL E+E   K V
Sbjct: 409 EQLYKQLLDEYENAPKSV 426


>gnl|CDD|227512 COG5185, HEC1, Protein involved in chromosome segregation,
           interacts with SMC proteins [Cell division and
           chromosome partitioning].
          Length = 622

 Score = 38.4 bits (89), Expect = 0.012
 Identities = 32/148 (21%), Positives = 70/148 (47%), Gaps = 15/148 (10%)

Query: 477 LEGDSDEIKRLKEKLSKTEEELKKEKTNSAALKSQADSVGKEYDRLLKEHEKVQKVVTEQ 536
           L+ + +     +E     E+ +    T+ A LK+Q D++ ++    +K  +K++ +  + 
Sbjct: 245 LKLEDNYEPSEQELKLGFEKFVHIINTDIANLKTQNDNLYEKIQEAMKISQKIKTLREKW 304

Query: 537 GDKKDD--KYQ------KGDSDE----IKRLKEKLSKTEEELKKEKTNSAALKNQADSVG 584
              K D  KY+      K  S E    +++LK ++   EEE+K  ++N   L  Q    G
Sbjct: 305 RALKSDSNKYENYVNAMKQKSQEWPGKLEKLKSEIELKEEEIKALQSNIDELHKQLRKQG 364

Query: 585 ---KEYDRLLKEHEKVQKVVTEQGDKKD 609
              ++++ + +E EK+ + + +   + D
Sbjct: 365 ISTEQFELMNQEREKLTRELDKINIQSD 392



 Score = 33.4 bits (76), Expect = 0.43
 Identities = 25/129 (19%), Positives = 54/129 (41%), Gaps = 9/129 (6%)

Query: 470 FEQCDKGLEGDSDE----IKRLKEKLSKTEEELKKEKTNSAALKSQADSVG---KEYDRL 522
           +E     ++  S E    +++LK ++   EEE+K  ++N   L  Q    G   ++++ +
Sbjct: 314 YENYVNAMKQKSQEWPGKLEKLKSEIELKEEEIKALQSNIDELHKQLRKQGISTEQFELM 373

Query: 523 LKEHEKVQKVVTEQGDKKDD--KYQKGDSDEIKRLKEKLSKTEEELKKEKTNSAALKNQA 580
            +E EK+ + + +   + D   K  K    E + + + L KT  +      N    ++Q 
Sbjct: 374 NQEREKLTRELDKINIQSDKLTKSVKSRKLEAQGIFKSLEKTLRQYDSLIQNITRSRSQI 433

Query: 581 DSVGKEYDR 589
                +   
Sbjct: 434 GHNVNDSSL 442


>gnl|CDD|224259 COG1340, COG1340, Uncharacterized archaeal coiled-coil protein
           [Function unknown].
          Length = 294

 Score = 37.4 bits (87), Expect = 0.014
 Identities = 31/147 (21%), Positives = 70/147 (47%), Gaps = 12/147 (8%)

Query: 464 SKRCEIFEQCDKGLEGDSDEIKRLKEKLSKTEEELKKEKTNSAALKSQADSVGKEYDRLL 523
            +R EI E+  +  E   +   +L+E   +  E  +K    +   +S   S+ +E +RL 
Sbjct: 62  EERDEINEEVQELKEKRDEINAKLQELRKEYRELKEKRNEFNLGGRSI-KSLEREIERLE 120

Query: 524 -----------KEHEKVQKVVTEQGDKKDDKYQKGDSDEIKRLKEKLSKTEEELKKEKTN 572
                      +E E VQK+   + + +D K    +++++K LK ++ + +++ ++    
Sbjct: 121 KKQQTSVLTPEEERELVQKIKELRKELEDAKKALEENEKLKELKAEIDELKKKAREIHEK 180

Query: 573 SAALKNQADSVGKEYDRLLKEHEKVQK 599
              L N+A    +E  +L +E ++++K
Sbjct: 181 IQELANEAQEYHEEMIKLFEEADELRK 207



 Score = 37.0 bits (86), Expect = 0.020
 Identities = 24/132 (18%), Positives = 59/132 (44%), Gaps = 17/132 (12%)

Query: 483 EIKRLKEKLSKTEEELK-KEKTNSAALKSQADSVGKEYDRLLKEHEKVQKVVTEQGDKKD 541
           +IK L+++L   ++ L+  EK     LK++ D + K+   +   HEK+Q++  E  +  +
Sbjct: 139 KIKELRKELEDAKKALEENEKL--KELKAEIDELKKKAREI---HEKIQELANEAQEYHE 193

Query: 542 D---KYQKGDSDEIKRLKEKLSKTEEELKKEKTNSAALKNQADSVGKE---YDRLLKEHE 595
           +    +++ D      L+++  +  EE  +       L  +  ++  E    ++ +K   
Sbjct: 194 EMIKLFEEAD-----ELRKEADELHEEFVELSKKIDELHEEFRNLQNELRELEKKIKALR 248

Query: 596 KVQKVVTEQGDK 607
             +K    +  +
Sbjct: 249 AKEKAAKRREKR 260



 Score = 32.3 bits (74), Expect = 0.56
 Identities = 27/123 (21%), Positives = 52/123 (42%), Gaps = 22/123 (17%)

Query: 462 ESSKRCEIFEQCD---KGLEGDSDEIKRLKEKLSKTEEELKKEKTNSAALKSQADSVGKE 518
           E+ K  E+  + D   K      ++I+ L  +  +  EE+ K       L  +AD + KE
Sbjct: 156 ENEKLKELKAEIDELKKKAREIHEKIQELANEAQEYHEEMIK-------LFEEADELRKE 208

Query: 519 YDRLLKEHEKVQKVVTEQGDKKDDKYQKGDSDEIKRLKEKLSKTEEELKKEKTNSAALKN 578
            D L +E  ++ K + E              +E + L+ +L + E+++K  +    A K 
Sbjct: 209 ADELHEEFVELSKKIDEL------------HEEFRNLQNELRELEKKIKALRAKEKAAKR 256

Query: 579 QAD 581
           +  
Sbjct: 257 REK 259



 Score = 31.2 bits (71), Expect = 1.4
 Identities = 28/142 (19%), Positives = 61/142 (42%), Gaps = 19/142 (13%)

Query: 477 LEGDSDEIKRLKEKLSKTEEELKKEKTNSAA-----------LKSQADSVGKEYDRLLKE 525
           LE    ++K   E+L +  +EL+KE +  A            L+ +A  + +E D + +E
Sbjct: 11  LELKRKQLKEEIEELKEKRDELRKEASELAEKRDELNAKVRELREKAQELREERDEINEE 70

Query: 526 HEKVQKVVTEQGDKKDDKYQK--------GDSDEIKRLKEKLSKTEEELKKEKTNSAALK 577
            +++++   E   K  +  ++         + +   R  + L +  E L+K++  S    
Sbjct: 71  VQELKEKRDEINAKLQELRKEYRELKEKRNEFNLGGRSIKSLEREIERLEKKQQTSVLTP 130

Query: 578 NQADSVGKEYDRLLKEHEKVQK 599
            +   + ++   L KE E  +K
Sbjct: 131 EEERELVQKIKELRKELEDAKK 152


>gnl|CDD|215618 PLN03184, PLN03184, chloroplast Hsp70; Provisional.
          Length = 673

 Score = 37.9 bits (88), Expect = 0.015
 Identities = 30/106 (28%), Positives = 53/106 (50%), Gaps = 13/106 (12%)

Query: 480 DSDEIKRLKEKLSKTEEELKKEKTNSAALKSQADSVGKEYDRLLKEHEKVQKVVTEQGDK 539
             DE++R+ ++  K  +E  KEK ++   K+QADSV  + ++ LKE     KV  +  +K
Sbjct: 540 PKDEVERMVQEAEKFAKE-DKEKRDAVDTKNQADSVVYQTEKQLKELG--DKVPADVKEK 596

Query: 540 KDDKYQKGDSDEIKRLKEKLSKTEEELKKEKTNSAALKNQADSVGK 585
            + K        +K LK+ ++      +K K   AAL  +   +G+
Sbjct: 597 VEAK--------LKELKDAIAS--GSTQKMKDAMAALNQEVMQIGQ 632



 Score = 34.8 bits (80), Expect = 0.13
 Identities = 23/62 (37%), Positives = 37/62 (59%), Gaps = 2/62 (3%)

Query: 548 DSDEIKRLKEKLSKTEEELKKEKTNSAALKNQADSVGKEYDRLLKE-HEKVQKVVTEQGD 606
             DE++R+ ++  K  +E  KEK ++   KNQADSV  + ++ LKE  +KV   V E+ +
Sbjct: 540 PKDEVERMVQEAEKFAKE-DKEKRDAVDTKNQADSVVYQTEKQLKELGDKVPADVKEKVE 598

Query: 607 KK 608
            K
Sbjct: 599 AK 600


>gnl|CDD|149160 pfam07926, TPR_MLP1_2, TPR/MLP1/MLP2-like protein.  The sequences
           featured in this family are similar to a region of human
           TPR protein and to yeast myosin-like proteins 1 (MLP1)
           and 2 (MLP2). These proteins share a number of features;
           for example, they all have coiled-coil regions and all
           three are associated with nuclear pores. TPR is thought
           to be a component of nuclear pore complex- attached
           intra-nuclear filaments, and is implicated in nuclear
           protein import. Moreover, its N-terminal region is
           involved in the activation of oncogenic kinases,
           possibly by mediating the dimerisation of kinase domains
           or by targeting these kinases to the nuclear pore
           complex. MLP1 and MLP2 are involved in the process of
           telomere length regulation, where they are thought to
           interact with proteins such as Tel1p and modulate their
           activity.
          Length = 132

 Score = 35.7 bits (83), Expect = 0.015
 Identities = 27/118 (22%), Positives = 46/118 (38%), Gaps = 27/118 (22%)

Query: 482 DEIKRLKEKLSKTEEELKKEKTNSAALKSQADSVGKEYDRLLKEHEKVQKVVTEQGDKKD 541
            E+KRL EK    EE+++K + +       A+   ++Y+R L +H               
Sbjct: 10  AELKRLLEKKEDAEEKIQKLQEDLEEQAEIANEAQQKYERELVKH--------------- 54

Query: 542 DKYQKGDSDEIKRLKEKLSKTEEELKKEKTNSAALKNQADSVGKEYDRLLKEHEKVQK 599
                          E+L    ++L + K   A LK +A+S   E     +E  + QK
Sbjct: 55  -----------AEDIEELQALRKQLNELKKEIAQLKAEAESAQAELSE-AEESWEEQK 100


>gnl|CDD|191382 pfam05837, CENP-H, Centromere protein H (CENP-H).  This family
           consists of several eukaryotic centromere protein H
           (CENP-H) sequences. Macromolecular
           centromere-kinetochore complex plays a critical role in
           sister chromatid separation, but its complete protein
           composition as well as its precise dynamic function
           during mitosis has not yet been clearly determined.
           CENP-H contains a coiled-coil structure and a nuclear
           localisation signal. CENP-H is specifically and
           constitutively localised in kinetochores throughout the
           cell cycle. CENP-H may play a role in kinetochore
           organisation and function throughout the cell cycle.
           This the C-terminus of the region, which is conserved
           from fungi to humans.
          Length = 106

 Score = 35.4 bits (82), Expect = 0.016
 Identities = 24/90 (26%), Positives = 34/90 (37%), Gaps = 20/90 (22%)

Query: 481 SDEIKRLKEKLSKTEEELKKEKTNSAALKSQADSVGKEYDRLLKEHEKVQKVVTEQGDKK 540
           S EI  L++   +T EEL   +     LK              K  E   +++     K+
Sbjct: 2   SLEILNLQQASRETREELSDLEKERLQLK-------------QKNVELALELLELTKKKE 48

Query: 541 DDKYQKGDSDEIKRLKEKLSKTEEELKKEK 570
                     E   LKE+L K E +LKK K
Sbjct: 49  SW-------REDMELKEQLEKLEADLKKSK 71


>gnl|CDD|225871 COG3334, COG3334, Uncharacterized conserved protein [Function
           unknown].
          Length = 192

 Score = 36.3 bits (84), Expect = 0.020
 Identities = 21/121 (17%), Positives = 29/121 (23%), Gaps = 14/121 (11%)

Query: 482 DEIKRLKEKLSKTEEELKKEKTNSAALKSQADSVGKEYDRLLKEHEKVQKVVTEQGDKKD 541
           D        L K   E  K+           +    E   L +E E +          +D
Sbjct: 66  DAAADQLYALQKELLEKLKDLAEVNERLKALEKKKAELKDLEEEREGIL----RSKQAED 121

Query: 542 DK----YQKGDSDEIKRLKEKLSKTEEELK-----KEKTNSAALKNQADSVGKEYDRLLK 592
            K    Y K   D    + E L   EE        K +     L             L+ 
Sbjct: 122 GKLVKIYSKMKPDAAAAILENL-PDEEAAAILMKLKPRKLGLILAKMDPEKAATLTELIA 180

Query: 593 E 593
            
Sbjct: 181 S 181


>gnl|CDD|202833 pfam03962, Mnd1, Mnd1 family.  This family of proteins includes
           MND1 from S. cerevisiae. The mnd1 protein forms a
           complex with hop2 to promote homologous chromosome
           pairing and meiotic double-strand break repair.
          Length = 188

 Score = 35.7 bits (83), Expect = 0.033
 Identities = 25/83 (30%), Positives = 49/83 (59%), Gaps = 7/83 (8%)

Query: 481 SDEIKRLKEKLSKTEEELKKEKTNSAALKSQADS--VGKEY----DRLLKEHEKVQKVVT 534
           S  + +LK +L K ++EL++ K   A L++Q +    G+E       LL+E ++++K + 
Sbjct: 61  SQALNKLKTRLEKLKKELEELKQRIAELQAQIEKLKKGREETEERTELLEELKQLEKELK 120

Query: 535 EQGDKKDDKYQKGDSDEIKRLKE 557
           +    + +KY+K D + I++LKE
Sbjct: 121 KL-KAELEKYEKNDPERIEKLKE 142



 Score = 32.2 bits (74), Expect = 0.49
 Identities = 15/71 (21%), Positives = 34/71 (47%), Gaps = 7/71 (9%)

Query: 535 EQGDKKDDKYQKGDSDEIKRLKEKLSKTEEELKKEKT------NSAALKNQADSVGKEYD 588
           +  +K   + +K    E++ LK+++++ + +++K K           L  +   + KE  
Sbjct: 62  QALNKLKTRLEKLKK-ELEELKQRIAELQAQIEKLKKGREETEERTELLEELKQLEKELK 120

Query: 589 RLLKEHEKVQK 599
           +L  E EK +K
Sbjct: 121 KLKAELEKYEK 131


>gnl|CDD|234173 TIGR03346, chaperone_ClpB, ATP-dependent chaperone ClpB.  Members
           of this protein family are the bacterial ATP-dependent
           chaperone ClpB. This protein belongs to the AAA family,
           ATPases associated with various cellular activities
           (pfam00004). This molecular chaperone does not act as a
           protease, but rather serves to disaggregate misfolded
           and aggregated proteins [Protein fate, Protein folding
           and stabilization].
          Length = 852

 Score = 36.9 bits (86), Expect = 0.035
 Identities = 25/89 (28%), Positives = 42/89 (47%), Gaps = 11/89 (12%)

Query: 482 DEIKRLKEKLSKTEEELKKEKTNSAALKSQADSVGKEYDRLLKEHEKVQKVVTEQGDKKD 541
           DE+ R   +L    E LKKEK    A K + + + KE    L E E+    + EQ     
Sbjct: 408 DELDRRIIQLEIEREALKKEK--DEASKERLEDLEKE----LAELEEEYADLEEQWKA-- 459

Query: 542 DKYQKGDSDEIKRLKEKLSKTEEELKKEK 570
              +K     I+++KE++ +   EL++ +
Sbjct: 460 ---EKAAIQGIQQIKEEIEQVRLELEQAE 485


>gnl|CDD|215591 PLN03124, PLN03124, poly [ADP-ribose] polymerase; Provisional.
          Length = 643

 Score = 36.7 bits (85), Expect = 0.036
 Identities = 38/138 (27%), Positives = 56/138 (40%), Gaps = 27/138 (19%)

Query: 472 QCDKGLEGDSDEIKRLKEKLSKTEEELKKEKTNSAALKSQADSVGKEYD---RLLKEHEK 528
           Q D   E  S +   + E    T  EL++     AA +    + G++ D   RL    E 
Sbjct: 61  QDDGDDEPVSPKRIAIDEVKGMTVRELRE-----AASERGLATTGRKKDLLERLCAALES 115

Query: 529 VQKVVTEQGDKKDDKYQKGDSDEIKRLKEKLSKTEEELKKEKTNSAALKNQADSVGKEYD 588
             KV +  G  +D+K + GD              EE  K+EK  +A  K +A       D
Sbjct: 116 DVKVGSANGTGEDEKEKGGD--------------EEREKEEKIVTATKKGRA-----VLD 156

Query: 589 RLLKEHEKVQKVVTEQGD 606
           + L +H K    V E+GD
Sbjct: 157 QWLPDHIKSNYHVLEEGD 174


>gnl|CDD|223294 COG0216, PrfA, Protein chain release factor A [Translation,
           ribosomal structure and biogenesis].
          Length = 363

 Score = 36.4 bits (85), Expect = 0.037
 Identities = 26/96 (27%), Positives = 45/96 (46%), Gaps = 22/96 (22%)

Query: 489 EKLSKTEEELKKEKTNSAALKSQADSVGKEYDRLLKEHEKVQKVVTE-------QGDKKD 541
           E L +  EEL+   ++   +         EY +L KE+ +++ +V +       Q D +D
Sbjct: 10  ESLLERYEELEALLSDPEVISDP-----DEYRKLSKEYAELEPIVEKYREYKKAQEDLED 64

Query: 542 DK---YQKGDSD-------EIKRLKEKLSKTEEELK 567
            K    ++ D +       EIK L+ K+ + EEELK
Sbjct: 65  AKEMLAEEKDPEMREMAEEEIKELEAKIEELEEELK 100


>gnl|CDD|221408 pfam12072, DUF3552, Domain of unknown function (DUF3552).  This
           presumed domain is functionally uncharacterized. This
           domain is found in bacteria, archaea and eukaryotes.
           This domain is about 200 amino acids in length. This
           domain is found associated with pfam00013, pfam01966.
           This domain has a single completely conserved residue A
           that may be functionally important.
          Length = 201

 Score = 35.6 bits (83), Expect = 0.038
 Identities = 31/127 (24%), Positives = 58/127 (45%), Gaps = 34/127 (26%)

Query: 485 KRLKEKLSKTEEELKKEKTNSAA-----LKSQADSVGKEYDRLLKEHEK--VQKVVTEQG 537
           K++ E+  K  E LKKE    A      L+++A+   KE    L+  EK  +QK   E  
Sbjct: 37  KQIIEEAKKEAEALKKEALLEAKEEIHKLRAEAERELKERRNELQRQEKRLLQK--EETL 94

Query: 538 DKKDDKYQKGDSDEIKRLKEKLSKTEEELKKEKTNSAALKNQADSVGKEYDRLLKEHEKV 597
           D+KD+         +++ +E L + E+EL   +                  +L ++ E++
Sbjct: 95  DRKDES--------LEKKEESLEEKEKELAARQ-----------------QQLEEKEEEL 129

Query: 598 QKVVTEQ 604
           ++++ EQ
Sbjct: 130 EELIEEQ 136


>gnl|CDD|220719 pfam10368, YkyA, Putative cell-wall binding lipoprotein.  YkyA is a
           family of proteins containing a lipoprotein signal and a
           hydrolase domain. It is similar to cell wall binding
           proteins and might also be recognisable by a host immune
           defence system. It is thus likely to belong to pathways
           important for pathogenicity.
          Length = 205

 Score = 35.8 bits (83), Expect = 0.039
 Identities = 28/144 (19%), Positives = 57/144 (39%), Gaps = 28/144 (19%)

Query: 465 KRCEIFEQCDKGLEGDSDEIKRLKEKLSKTEEELKKEKTNSAALKSQADSVGKEYDRLLK 524
           KR ++ ++  + +E   +E K  K+ + K E++  K+K              K+  +++K
Sbjct: 80  KREKLLKKEKESIEKSEEEFKSAKKYIEKIEDKKLKKK-------------AKQLVKVMK 126

Query: 525 EHEKVQKVVTEQGDKKDDKYQKGDSDEIKRLKEKLSKTEEELKKEKTNSAALKNQADSVG 584
           E  K         DK    Y+K     +   KE        LK +      L  +  +V 
Sbjct: 127 ERYKSY-------DKLYKAYKKA----LNLEKELYEY----LKDKDLTLKELDEKIKAVN 171

Query: 585 KEYDRLLKEHEKVQKVVTEQGDKK 608
           + Y+++ K  +K  +   +   +K
Sbjct: 172 QSYEKIQKAKKKFNEYTKKYNKEK 195



 Score = 30.4 bits (69), Expect = 1.9
 Identities = 27/129 (20%), Positives = 54/129 (41%), Gaps = 6/129 (4%)

Query: 480 DSDEIKRLKEKLSKTEEELKKEKTNSAALKSQADSVGKEYDRLLKEHEKVQKVVTEQGDK 539
                + + E+L K  E+ K  +     L+       + YD++++  +   + + +  D+
Sbjct: 14  GKKPEESIYEQLEKAAEQEKPFEEEQKKLEELEKKEQELYDQIIQLGKDDNEAIKKLSDQ 73

Query: 540 KDDKYQKGDSDEIKRLKEKLSKTEEELKK-----EKTNSAALKNQADSVGKEYDRLLKEH 594
                 K +   +K+ KE + K+EEE K      EK     LK +A  + K      K +
Sbjct: 74  ALANVDKRE-KLLKKEKESIEKSEEEFKSAKKYIEKIEDKKLKKKAKQLVKVMKERYKSY 132

Query: 595 EKVQKVVTE 603
           +K+ K   +
Sbjct: 133 DKLYKAYKK 141


>gnl|CDD|221641 pfam12569, NARP1, NMDA receptor-regulated protein 1.  This domain
           family is found in eukaryotes, and is approximately 40
           amino acids in length. The family is found in
           association with pfam07719, pfam00515. There is a single
           completely conserved residue L that may be functionally
           important. NARP1 is the mammalian homologue of a yeast
           N-terminal acetyltransferase that regulates entry into
           the G(0) phase of the cell cycle.
          Length = 516

 Score = 36.4 bits (85), Expect = 0.047
 Identities = 23/89 (25%), Positives = 38/89 (42%), Gaps = 21/89 (23%)

Query: 480 DSDEIKRLKEKLSKTEEELKKEKTNSAALKSQADSVGKEYDRLLKEHEKVQKVVTEQGDK 539
              E K+L++K  K E++ +KE+   AA K +A           +   K  K    +  K
Sbjct: 407 SPAERKKLRKKQRKAEKKAEKEEAEKAAAKKKA-----------EAAAKKAKGPDGETKK 455

Query: 540 KDDKYQKGDSDEIKRLKEKLSKTEEELKK 568
            D             L EKL++TE+ L++
Sbjct: 456 VDP----------DPLGEKLARTEDPLEE 474



 Score = 31.1 bits (71), Expect = 2.0
 Identities = 16/62 (25%), Positives = 26/62 (41%), Gaps = 11/62 (17%)

Query: 548 DSDEIKRLKEKLSKTEEELKKEKTNSAALKNQADSVGKEYDRLLKEHEKVQKVVTEQGDK 607
              E K+L++K  K E++ +KE+   AA K +A           +   K  K    +  K
Sbjct: 407 SPAERKKLRKKQRKAEKKAEKEEAEKAAAKKKA-----------EAAAKKAKGPDGETKK 455

Query: 608 KD 609
            D
Sbjct: 456 VD 457


>gnl|CDD|218636 pfam05557, MAD, Mitotic checkpoint protein.  This family consists
           of several eukaryotic mitotic checkpoint (Mitotic arrest
           deficient or MAD) proteins. The mitotic spindle
           checkpoint monitors proper attachment of the bipolar
           spindle to the kinetochores of aligned sister chromatids
           and causes a cell cycle arrest in prometaphase when
           failures occur. Multiple components of the mitotic
           spindle checkpoint have been identified in yeast and
           higher eukaryotes. In S.cerevisiae, the existence of a
           Mad1-dependent complex containing Mad2, Mad3, Bub3 and
           Cdc20 has been demonstrated.
          Length = 722

 Score = 36.4 bits (84), Expect = 0.047
 Identities = 32/139 (23%), Positives = 66/139 (47%), Gaps = 13/139 (9%)

Query: 462 ESSKRCEIFEQCDKGLEGDSDEIKRLKEKLSKTEEELKKEKTNSAALKSQADSVGKEYDR 521
             S + +  E+ D+  E   + I R+K  LS+ +   +   T    L+S+ + + ++ + 
Sbjct: 146 AESLKLQN-EKEDQLKEA-KESISRIKNDLSEMQCRAQNADTELKLLESELEELREQLEE 203

Query: 522 LLKE----HEKVQKVVTEQGDKKDDKYQKGDSDEIKRLKEKLSKTEEELKKEKTNSAALK 577
             KE     +K+Q + +EQ    D+      S +IK L+E+L + E++ +  K+    L 
Sbjct: 204 CQKELAEAEKKLQSLTSEQASSADN------SVKIKHLEEELKRYEQDAEVVKSMKEQLL 257

Query: 578 NQADSVGKEYDRLLKEHEK 596
              + + +E   L +E+ K
Sbjct: 258 QIPE-LERELAALREENRK 275



 Score = 32.6 bits (74), Expect = 0.79
 Identities = 34/138 (24%), Positives = 59/138 (42%), Gaps = 21/138 (15%)

Query: 483 EIKRLKEKLSKTEEELKKEKTNSAALKSQADSVGKEYDRLLKEHEKVQKVV---TEQGDK 539
           EIK LKE+L   E  L   K  + AL+ + +++ +E DRL +E   ++  +     QGD 
Sbjct: 483 EIKLLKEQLVSNERLLSFVKEATNALRLKIETLERERDRLRQEKSLLEMKLEHLCLQGDY 542

Query: 540 KDD-------------KYQKGDSDEIKRLKEKLSKTEEELKK---EKTNSAALKNQADS- 582
                           + ++     I+ L+ +  K +E L+     K+    L+    S 
Sbjct: 543 NASRTKVLHMSLNPASEAEQIAKQTIEALQAECEKLKERLQALEEGKSQPGDLEKAVGSH 602

Query: 583 -VGKEYDRLLKEHEKVQK 599
              KE  +L K+ E  +K
Sbjct: 603 ISSKEIAQLKKQVESAEK 620



 Score = 31.1 bits (70), Expect = 2.0
 Identities = 25/100 (25%), Positives = 49/100 (49%), Gaps = 8/100 (8%)

Query: 483 EIKRLKEKLSKTEEELKK----EKTNSAALKSQADSVGKEYDRLLKEHEKVQKVVTEQGD 538
           E+ R K KL + E EL +     K     L+ +A ++ + Y+R L  + +++ V  +  +
Sbjct: 62  ELIRSKSKLIQLENELMQKELEHKRAQIELERKASTLAENYERELDRNLELE-VRLKALE 120

Query: 539 KKDDKYQKGDS---DEIKRLKEKLSKTEEELKKEKTNSAA 575
           + + K +   +   +E K LK+KL     +L+ EK +   
Sbjct: 121 ELEKKAENEAAEAEEEAKLLKDKLDAESLKLQNEKEDQLK 160



 Score = 30.3 bits (68), Expect = 3.9
 Identities = 22/109 (20%), Positives = 48/109 (44%), Gaps = 10/109 (9%)

Query: 460 WCESSKRCEIFEQCDKGLEGDSDEIKRLKEKLSKTEEELKKEKTNSAALKSQADSVGKEY 519
             E+ K+ +           +S +IK L+E+L + E++ +  K+     K Q   + +  
Sbjct: 208 LAEAEKKLQSLTSEQASSADNSVKIKHLEEELKRYEQDAEVVKSM----KEQLLQIPELE 263

Query: 520 DRLLKEHEKVQKVVTEQGDKKDDKYQKGDSDEIKRLKEKLSKTEEELKK 568
             L    E+ +K+ + + D +  K +  D      L+ +L + E+  +K
Sbjct: 264 RELAALREENRKLRSMKEDNELLKEELED------LQSRLERFEKMREK 306


>gnl|CDD|200533 cd11272, Sema_plexin_A2, The Sema domain, a protein interacting
           module, of Plexin A2.  Plexin A2 serves as a receptor
           for class 6 semaphorins. Interactions between Plexin A2,
           A4 and semaphorins 6A and 6B control the
           lamina-restricted projection of hippocampal mossy
           fibers. Sema6B also repels the growth of mossy fibers in
           a Plexin A4 dependent manner. Plexin A2 does not
           suppress Sema6B function. In addition, studies have
           shown that Plexin A2 may be related to anxiety and other
           psychiatric disorders. The Sema domain is located at the
           N-terminus and contains four disulfide bonds formed by
           eight conserved cysteine residues. It serves as a
           ligand-recognition and -binding module.
          Length = 515

 Score = 36.1 bits (83), Expect = 0.053
 Identities = 11/28 (39%), Positives = 17/28 (60%), Gaps = 1/28 (3%)

Query: 279 CNTYSSCTQCVSSDFP-CDWCVDGHRCT 305
           C  Y++C +C+SS  P C WC   + C+
Sbjct: 474 CEQYTTCGECLSSGDPHCGWCALHNMCS 501


>gnl|CDD|226883 COG4477, EzrA, Negative regulator of septation ring formation [Cell
           division and chromosome partitioning].
          Length = 570

 Score = 36.2 bits (84), Expect = 0.056
 Identities = 28/126 (22%), Positives = 54/126 (42%), Gaps = 16/126 (12%)

Query: 477 LEGDSDEIKRLKEKLSKTEEELKKEKTNSAALKSQADSVGKEYDRLLKEHEKVQKVVTEQ 536
           LE + +    ++E L    + L+K K N+  LK + + V + Y   L E E         
Sbjct: 297 LEREVEAKNVVEENLPILPDYLEKAKENNEHLKEEIERVKESY--RLAETELG------- 347

Query: 537 GDKKDDKYQKGDSDEIKRLKEKLSKTEEELKKEKTNSAALKNQADSVGKEYDRLLKEHEK 596
                    +    E+K L+  L +  E ++ ++   + L++  + + K    +  E EK
Sbjct: 348 -------SVRKFEKELKELESVLDEILENIEAQEVAYSELQDNLEEIEKALTDIEDEQEK 400

Query: 597 VQKVVT 602
           VQ+ +T
Sbjct: 401 VQEHLT 406



 Score = 34.3 bits (79), Expect = 0.21
 Identities = 29/123 (23%), Positives = 51/123 (41%), Gaps = 15/123 (12%)

Query: 483 EIKRLKEKLSKTEEELKKEKTNSAALKSQ--ADSVGKEYDRLLKEHE-KVQKVVTEQGDK 539
            ++RLKE+L +  E L + + + A  +     + +   YD L  E E + + VV E    
Sbjct: 256 RLERLKEQLVENSELLTQLELDEAEEELGLIQEKIESLYDLL--EREVEAKNVVEENLPI 313

Query: 540 KDDKYQKGDS------DEIKRLKEKLSKTEEELKKEKTNSAALKNQADSVGKEYDRLLKE 593
             D  +K         +EI+R+KE     E EL   +      + +   +    D +L+ 
Sbjct: 314 LPDYLEKAKENNEHLKEEIERVKESYRLAETELGSVR----KFEKELKELESVLDEILEN 369

Query: 594 HEK 596
            E 
Sbjct: 370 IEA 372


>gnl|CDD|226889 COG4487, COG4487, Uncharacterized protein conserved in bacteria
           [Function unknown].
          Length = 438

 Score = 35.9 bits (83), Expect = 0.056
 Identities = 21/116 (18%), Positives = 54/116 (46%), Gaps = 6/116 (5%)

Query: 483 EIKRLKEKLSKTEEELKKE-----KTNSAALKSQADSVGKEYDRLLKEHEKVQKVVTEQG 537
           E+ +L+E+L   ++E K       K    AL+ +   + +  + L  E +K  +++ ++ 
Sbjct: 65  ELSQLEEQLINQKKEQKNLFNEQIKQFELALQDEIAKL-EALELLNLEKDKELELLEKEL 123

Query: 538 DKKDDKYQKGDSDEIKRLKEKLSKTEEELKKEKTNSAALKNQADSVGKEYDRLLKE 593
           D+   + QK   +  + +++K    + E + +  N   L+   +   ++++  L E
Sbjct: 124 DELSKELQKQLQNTAEIIEKKRENNKNEERLKFENEKKLEESLELEREKFEEQLHE 179


>gnl|CDD|224234 COG1315, COG1315, Uncharacterized conserved protein [Function
           unknown].
          Length = 543

 Score = 35.9 bits (83), Expect = 0.057
 Identities = 33/140 (23%), Positives = 51/140 (36%), Gaps = 31/140 (22%)

Query: 482 DEIKRLKEKLSKTEEELKKEKTNSAALKSQAD-------SVGKEYDRLLKEHEKVQKVVT 534
           + +K L E++S  EE LKK      AL               KE       + K+     
Sbjct: 410 ERLKELTEEISLHEERLKKLTKLLVALVKVKIESKKNILPPDKESLLTAVNNTKITL--- 466

Query: 535 EQGDKKDDKYQKGDSDEIKRLKEKLSKTEEELKKEKT-NSAALKNQADS-----VGKEYD 588
                          + I+++K +L   +EEL+      S ++KNQ  S      G  Y 
Sbjct: 467 --------------RNSIEKIKAELEGLQEELEVVGIEGSVSVKNQVYSGVKIYFGDAYQ 512

Query: 589 RLLKEHEKVQKVVTEQGDKK 608
             +KEH  V +     G+ K
Sbjct: 513 ETIKEHGNV-EFYLRDGEIK 531


>gnl|CDD|222878 PHA02562, 46, endonuclease subunit; Provisional.
          Length = 562

 Score = 35.8 bits (83), Expect = 0.062
 Identities = 30/155 (19%), Positives = 56/155 (36%), Gaps = 27/155 (17%)

Query: 481 SDEIKRLKEKLSKTEEELKKEKTNSAALKSQADSVGKEY--------------------D 520
           +DE+  L   +      L K  T +A +KS+ +   K                      D
Sbjct: 240 TDELLNLVMDIEDPSAALNKLNTAAAKIKSKIEQFQKVIKMYEKGGVCPTCTQQISEGPD 299

Query: 521 RLLKEHEKVQKVVTEQGDKKDDKYQKGDS--DEIKRLKEKLSKTEEELKKEKTNSAALKN 578
           R+ K  +K+ K +    +K D    + +   DE     +KL + + ++   K +   L +
Sbjct: 300 RITKIKDKL-KELQHSLEKLDTAIDELEEIMDEFNEQSKKLLELKNKISTNKQSLITLVD 358

Query: 579 QADSVGKEYDRLLKEH----EKVQKVVTEQGDKKD 609
           +A  V    + L  E     E++ K+  E      
Sbjct: 359 KAKKVKAAIEELQAEFVDNAEELAKLQDELDKIVK 393



 Score = 31.1 bits (71), Expect = 2.0
 Identities = 27/112 (24%), Positives = 40/112 (35%), Gaps = 32/112 (28%)

Query: 472 QCDKGLEGDSDEIKRLKEK------LSKTEEELK----KEKTNSAALKSQADSVGKEYDR 521
           +    LE     I  L+E        SK   ELK      K +   L  +A  V    + 
Sbjct: 310 ELQHSLEKLDTAIDELEEIMDEFNEQSKKLLELKNKISTNKQSLITLVDKAKKVKAAIEE 369

Query: 522 LLKEHEKVQKVVTEQGDKKDDKYQKGDSDEIKRL---KEKLSKTEEELKKEK 570
           L             Q +  D+       +E+ +L    +K+ KT+ EL KEK
Sbjct: 370 L-------------QAEFVDNA------EELAKLQDELDKIVKTKSELVKEK 402


>gnl|CDD|188306 TIGR03319, RNase_Y, ribonuclease Y.  Members of this family are
           RNase Y, an endoribonuclease. The member from Bacillus
           subtilis, YmdA, has been shown to be involved in
           turnover of yitJ riboswitch [Transcription, Degradation
           of RNA].
          Length = 514

 Score = 36.1 bits (84), Expect = 0.063
 Identities = 27/120 (22%), Positives = 53/120 (44%), Gaps = 6/120 (5%)

Query: 485 KRLKEKLSKTEEELKKEKTNSAALKSQADSVGKEYDRLLKEHEKVQKVVTEQGDKKDDKY 544
           KR+ E+  K  E LKKE    A  K +   +  E +R LKE     + +  +  ++++  
Sbjct: 35  KRIIEEAKKEAETLKKEALLEA--KEEVHKLRAELERELKERRNELQRLERRLLQREETL 92

Query: 545 QKGDSDEIKRLKEKLSKTEEELKKEKTNSAALKNQADSVGKEYDRLLKEHEKVQKVVTEQ 604
            +      K+  E L K E+EL  ++ N    + + +   +      +E E++  +  E+
Sbjct: 93  DRKMESLDKKE-ENLEKKEKELSNKEKNLDEKEEELE---ELIAEQREELERISGLTQEE 148



 Score = 30.7 bits (70), Expect = 2.9
 Identities = 29/126 (23%), Positives = 58/126 (46%), Gaps = 21/126 (16%)

Query: 483 EIKRLKEKLSKTEEELKKEKTNSAALKSQADSVGKEYDRLLKEHEKVQKVVTEQGDKKDD 542
           ++   +E   +  EE KKE          A+++ KE   LL+  E+V K+  E      +
Sbjct: 26  KLGSAEELAKRIIEEAKKE----------AETLKKE--ALLEAKEEVHKLRAEL-----E 68

Query: 543 KYQKGDSDEIKRLKEKLSKTEEELKKEKTNSAALKNQADSVGKEYDRLLKE----HEKVQ 598
           +  K   +E++RL+ +L + EE L ++  +    +   +   KE     K      E+++
Sbjct: 69  RELKERRNELQRLERRLLQREETLDRKMESLDKKEENLEKKEKELSNKEKNLDEKEEELE 128

Query: 599 KVVTEQ 604
           +++ EQ
Sbjct: 129 ELIAEQ 134


>gnl|CDD|235461 PRK05431, PRK05431, seryl-tRNA synthetase; Provisional.
          Length = 425

 Score = 35.8 bits (84), Expect = 0.063
 Identities = 22/93 (23%), Positives = 46/93 (49%), Gaps = 16/93 (17%)

Query: 475 KGLEGDSDEIKRLKEKLSKTEEELKKEKTNSAALKSQADSVGKEYDRLLKEHEKVQKVVT 534
           +G   D DE+  L       +EE ++ +T    L+++ +++ KE  +  ++ E  + ++ 
Sbjct: 21  RGFPLDVDELLEL-------DEERRELQTELEELQAERNALSKEIGQAKRKGEDAEALIA 73

Query: 535 EQGDKKDDKYQKGDSDEIKRLKEKLSKTEEELK 567
           E  + K+         EIK L+ +L + E EL+
Sbjct: 74  EVKELKE---------EIKALEAELDELEAELE 97



 Score = 28.9 bits (66), Expect = 9.3
 Identities = 10/55 (18%), Positives = 25/55 (45%), Gaps = 4/55 (7%)

Query: 550 DEIKRLKEKLSKTEEELKKEKTNSAALKNQADSVGKEYDRLLKEHEKVQKVVTEQ 604
           +E++  +  LSK   + K++  ++ AL  +   + +E    +K  E     +  +
Sbjct: 45  EELQAERNALSKEIGQAKRKGEDAEALIAEVKELKEE----IKALEAELDELEAE 95


>gnl|CDD|216531 pfam01496, V_ATPase_I, V-type ATPase 116kDa subunit family.  This
           family consists of the 116kDa V-type ATPase (vacuolar
           (H+)-ATPases) subunits, as well as V-type ATP synthase
           subunit i. The V-type ATPases family are proton pumps
           that acidify intracellular compartments in eukaryotic
           cells for example yeast central vacuoles,
           clathrin-coated and synaptic vesicles. They have
           important roles in membrane trafficking processes. The
           116kDa subunit (subunit a) in the V-type ATPase is part
           of the V0 functional domain responsible for proton
           transport. The a subunit is a transmembrane glycoprotein
           with multiple putative transmembrane helices it has a
           hydrophilic amino terminal and a hydrophobic carboxy
           terminal. It has roles in proton transport and assembly
           of the V-type ATPase complex. This subunit is encoded by
           two homologous gene in yeast VPH1 and STV1.
          Length = 707

 Score = 35.8 bits (83), Expect = 0.066
 Identities = 19/93 (20%), Positives = 42/93 (45%), Gaps = 2/93 (2%)

Query: 518 EYDR-LLKEHEKVQKVVTEQGDKKDDKYQKGDSDEIKRLKEKLSKTEEELKKEKTNSAAL 576
           E +R L K   K++K+     D          S E   L+E++   E E+K+ + N  +L
Sbjct: 35  EVERKLRKLESKIKKLGIPLKDTGGKP-DVPPSKEFLDLEEEILDLEAEIKEVEENLESL 93

Query: 577 KNQADSVGKEYDRLLKEHEKVQKVVTEQGDKKD 609
           + + + + +  + L +E   + + + E  +  +
Sbjct: 94  EKEINELEEWLNVLDEEKSFLDENLEELSELSN 126



 Score = 32.4 bits (74), Expect = 0.84
 Identities = 15/78 (19%), Positives = 36/78 (46%), Gaps = 2/78 (2%)

Query: 480 DSDEIKRLKEKLSKTEEELKKEKTNSAALKSQADSVGKEYDRLLKEHEKVQKVVTEQGDK 539
            S E   L+E++   E E+K+ + N  +L+ + + + +  + L +E   + + + E  + 
Sbjct: 65  PSKEFLDLEEEILDLEAEIKEVEENLESLEKEINELEEWLNVLDEEKSFLDENLEELSEL 124

Query: 540 K--DDKYQKGDSDEIKRL 555
              D  ++     E  +L
Sbjct: 125 SNLDIDFKYLRGAEGLKL 142


>gnl|CDD|226513 COG4026, COG4026, Uncharacterized protein containing TOPRIM domain,
           potential nuclease [General function prediction only].
          Length = 290

 Score = 35.3 bits (81), Expect = 0.076
 Identities = 32/121 (26%), Positives = 53/121 (43%), Gaps = 31/121 (25%)

Query: 483 EIKRLKEKLSKTEEELKKEKTNSAALKSQADSVGKEYDRLLKEHEKVQKVVTEQGDKKDD 542
           ++K   E+L +  EEL+KEK                   LLKE E+++            
Sbjct: 132 DLKEDYEELKEKLEELQKEKE-----------------ELLKELEELEA----------- 163

Query: 543 KYQKGDSDEIKRLKEKLSKTEEELKKEKTNSAALKNQADSVGKEYDRLLKEHEKVQKVVT 602
           +Y++   + +KRL+ + S+ EE LKK       LK + D +    +  L E E +  +V 
Sbjct: 164 EYEEVQ-ERLKRLEVENSRLEEMLKKLPGEVYDLKKRWDELEPGVE--LPEEELISDLVK 220

Query: 603 E 603
           E
Sbjct: 221 E 221



 Score = 29.5 bits (66), Expect = 4.7
 Identities = 19/65 (29%), Positives = 31/65 (47%), Gaps = 2/65 (3%)

Query: 471 EQCDKGLEGDSDEIKRLKEKLSKTEEELKKEKTNSAALKSQADSVGKEYDRLLKEHEKVQ 530
           E+ +   E   + +KRL+ + S+ EE LKK       LK + D +    +  L E E + 
Sbjct: 159 EELEAEYEEVQERLKRLEVENSRLEEMLKKLPGEVYDLKKRWDELEPGVE--LPEEELIS 216

Query: 531 KVVTE 535
            +V E
Sbjct: 217 DLVKE 221


>gnl|CDD|130673 TIGR01612, 235kDa-fam, reticulocyte binding/rhoptry protein.  This
            model represents a group of paralogous families in
            plasmodium species alternately annotated as reticulocyte
            binding protein, 235-kDa family protein and rhoptry
            protein. Rhoptry protein is localized on the cell surface
            and is extremely large (although apparently lacking in
            repeat structure) and is important for the process of
            invasion of the RBCs by the parasite. These proteins are
            found in P. falciparum, P. vivax and P. yoelii.
          Length = 2757

 Score = 35.8 bits (82), Expect = 0.077
 Identities = 35/131 (26%), Positives = 63/131 (48%), Gaps = 18/131 (13%)

Query: 482  DEIKRLKEKLSKTEEELKKEKTNSA----ALKSQADSVGKEYDRLL-----KEHEKVQKV 532
            D+IK L +K+    + L++ K  S      +K+Q + +    D+ +     +E EK  + 
Sbjct: 1118 DDIKNLDQKIDHHIKALEEIKKKSENYIDEIKAQINDLEDVADKAISNDDPEEIEKKIEN 1177

Query: 533  VTEQGDKKDDKYQKGDSDEIKRLKEKLSKTEEELKKEKTNSAALKNQADSVGKEYDRLLK 592
            +  + DKK + Y     DEIK+L  ++++ E    K+KT+   +K    S GK   +L  
Sbjct: 1178 IVTKIDKKKNIY-----DEIKKLLNEIAEIE----KDKTSLEEVKGINLSYGKNLGKLFL 1228

Query: 593  EHEKVQKVVTE 603
            E    +K  +E
Sbjct: 1229 EKIDEEKKKSE 1239



 Score = 35.0 bits (80), Expect = 0.14
 Identities = 38/155 (24%), Positives = 73/155 (47%), Gaps = 27/155 (17%)

Query: 468  EIFEQCDK--GLEGDSDEIKRLKEKLSKTEEELKKE------KTNSAALKSQADSVGKEY 519
            E+ E  DK  G + ++D+  +  EK  +  E+ KK+      K ++ A+K++     K+ 
Sbjct: 1487 ELKEHIDKSKGCKDEADKNAKAIEKNKELFEQYKKDVTELLNKYSALAIKNKFAKTKKDS 1546

Query: 520  DRLLKEHEKVQKVVTEQGDKKDDKYQKGDSDEIKRLKEKLSKTEEELKKEKTNSAALKNQ 579
            + ++KE +   K    + +K + K +     EIK  KEK    ++  K +K+N AA+  Q
Sbjct: 1547 EIIIKEIKDAHKKFILEAEKSEQKIK-----EIK--KEKFRIEDDAAKNDKSNKAAIDIQ 1599

Query: 580  AD------------SVGKEYDRLLKEHEKVQKVVT 602
                           + K+ +  LKE E ++K ++
Sbjct: 1600 LSLENFENKFLKISDIKKKINDCLKETESIEKKIS 1634



 Score = 33.1 bits (75), Expect = 0.49
 Identities = 34/117 (29%), Positives = 58/117 (49%), Gaps = 10/117 (8%)

Query: 482  DEIKRLKEKLSKTEEELKKEKTNSAALKSQADSVGKEYDRLLKEHEKVQKVVTEQGDKKD 541
            DEIK+L  ++++ E    K+KT+   +K    S GK   +L  E    +K  +E   K  
Sbjct: 1190 DEIKKLLNEIAEIE----KDKTSLEEVKGINLSYGKNLGKLFLEKIDEEKKKSEHMIKAM 1245

Query: 542  DKYQKGDSDEIKRLKEKLSKTEEELKKEKTNSAALK--NQADSVGKEYDRLLKEHEK 596
            + Y + D DEI   KEK  + E E+  E    A ++  N +    K++  + K+H++
Sbjct: 1246 EAYIE-DLDEI---KEKSPEIENEMGIEMDIKAEMETFNISHDDDKDHHIISKKHDE 1298


>gnl|CDD|226400 COG3883, COG3883, Uncharacterized protein conserved in bacteria
           [Function unknown].
          Length = 265

 Score = 34.7 bits (80), Expect = 0.091
 Identities = 17/85 (20%), Positives = 44/85 (51%), Gaps = 9/85 (10%)

Query: 499 KKEKTNSAALKSQADSVGKEYDRLLKEHEKVQKVVTEQGDKKDDKYQKGDSDEIKRLKEK 558
            ++   S   K + + +  E + L  + E++Q  + E   K+ D+ +     EIK+L+++
Sbjct: 35  NQDSKLSELQKEKKN-IQNEIESLDNQIEEIQSKIDEL-QKEIDQSKA----EIKKLQKE 88

Query: 559 LSKTEEELKKEKTNSAALKNQADSV 583
           +++ +E + + +     LK +A ++
Sbjct: 89  IAELKENIVERQE---LLKKRARAM 110



 Score = 30.5 bits (69), Expect = 2.1
 Identities = 17/86 (19%), Positives = 37/86 (43%), Gaps = 3/86 (3%)

Query: 498 LKKEKTNSAALKSQADSVGKEYDRL---LKEHEKVQKVVTEQGDKKDDKYQKGDSDEIKR 554
           L+++K +  +L+ +  ++  + + L     E E     +  Q  +K+       + E   
Sbjct: 150 LEQQKEDKKSLEEKQAALEDKLETLVALQNELETQLNSLNSQKAEKNALIAALAAKEASA 209

Query: 555 LKEKLSKTEEELKKEKTNSAALKNQA 580
           L EK +  E++   E   + A K +A
Sbjct: 210 LGEKAALEEQKALAEAAAAEAAKQEA 235


>gnl|CDD|223520 COG0443, DnaK, Molecular chaperone [Posttranslational modification,
           protein turnover, chaperones].
          Length = 579

 Score = 35.4 bits (82), Expect = 0.095
 Identities = 28/103 (27%), Positives = 51/103 (49%), Gaps = 5/103 (4%)

Query: 475 KGLEGDSD-EIKRLKEKLSKTEEELKKEKTNSAALKSQADSVGKEYDRLLKEHEKVQKVV 533
           K   G SD EI+R+ E         KK +    A +++A+S+    ++ LKE  KV +  
Sbjct: 478 KASSGLSDEEIERMVEDAEANAALDKKFRELVEA-RNEAESLIYSLEKALKEIVKVSEEE 536

Query: 534 TEQ-GDKKDD--KYQKGDSDEIKRLKEKLSKTEEELKKEKTNS 573
            E+  +   D  +  +G+ +EIK   E+L +  ++L ++K   
Sbjct: 537 KEKIEEAITDLEEALEGEKEEIKAKIEELQEVTQKLAEKKYQQ 579


>gnl|CDD|218701 pfam05698, Trigger_C, Bacterial trigger factor protein (TF)
           C-terminus.  In the E. coli cytosol, a fraction of the
           newly synthesised proteins requires the activity of
           molecular chaperones for folding to the native state.
           The major chaperones implicated in this folding process
           are the ribosome-associated Trigger Factor (TF), and the
           DnaK and GroEL chaperones with their respective
           co-chaperones. Trigger Factor is an ATP-independent
           chaperone and displays chaperone and
           peptidyl-prolyl-cis-trans-isomerase (PPIase) activities
           in vitro. It is composed of at least three domains, an
           N-terminal domain which mediates association with the
           large ribosomal subunit, a central substrate binding and
           PPIase domain with homology to FKBP proteins, and a
           C-terminal domain of unknown function. The positioning
           of TF at the peptide exit channel, together with its
           ability to interact with nascent chains as short as 57
           residues renders TF a prime candidate for being the
           first chaperone that binds to the nascent polypeptide
           chains. This family represents the C-terminal region of
           the protein.
          Length = 162

 Score = 33.8 bits (78), Expect = 0.097
 Identities = 39/156 (25%), Positives = 65/156 (41%), Gaps = 38/156 (24%)

Query: 484 IKRLKEKLSKTEEELKKEKTNSA-------ALKSQAD------SVGKEYDRLLKE----- 525
           ++ LKEKL K  EE KKE T           L   A+       V +E DRLL++     
Sbjct: 2   LEELKEKLRKQLEEEKKEATAEELKEAILDKLVENAEIELPESLVEEEIDRLLRQFLQQL 61

Query: 526 ---HEKVQKVVTEQGDKKDDKYQKGDSDEIKRLK-----------EKLSKTEEELKKEKT 571
                 +++ ++  G  +++  ++   +  KR+K           EK+  T+EE+K+E  
Sbjct: 62  QGQGLDLEEYLSLSGSSEEELREEFKEEAEKRVKLGLILEEIAKEEKIEVTDEEIKEEIE 121

Query: 572 NSA-ALKNQADSVGKEYD-----RLLKEHEKVQKVV 601
             A     + + V + Y        LKE    +K V
Sbjct: 122 ELASQYGMEPEEVKEFYKKNEQLSALKEDLLEEKAV 157


>gnl|CDD|215656 pfam00012, HSP70, Hsp70 protein.  Hsp70 chaperones help to fold
           many proteins. Hsp70 assisted folding involves repeated
           cycles of substrate binding and release. Hsp70 activity
           is ATP dependent. Hsp70 proteins are made up of two
           regions: the amino terminus is the ATPase domain and the
           carboxyl terminus is the substrate binding region.
          Length = 598

 Score = 35.3 bits (82), Expect = 0.11
 Identities = 30/110 (27%), Positives = 51/110 (46%), Gaps = 13/110 (11%)

Query: 462 ESSKRCEIFEQCDKGLEGDSDEIKRLKEKLSKTEEELKKEKTNSAALKSQADSVGKEYDR 521
            + K  +I      GL    DEI+R+ +   +   E KK K    A K++A+     Y  
Sbjct: 486 GTGKEQKITITASSGL--SDDEIERMVKDAEEYAAEDKKRKERIEA-KNEAEEY--VYS- 539

Query: 522 LLKEHEKVQKVVTEQGDKKDDKYQKGDSDEIKRLKEKLSKTEEELKKEKT 571
                  ++K + E+GDK  +  +K   + I+ LKE+L   ++E  + KT
Sbjct: 540 -------LEKSLKEEGDKLPEADKKKVEEAIEWLKEELEGEDKEEIEAKT 582


>gnl|CDD|204405 pfam10146, zf-C4H2, Zinc finger-containing protein.  This is a
           family of proteins which appears to have a highly
           conserved zinc finger domain at the C terminal end,
           described as -C-X2-CH-X3-H-X5-C-X2-C-. The structure is
           predicted to contain a coiled coil. Members are
           annotated as being tumour-associated antigen HCA127 in
           humans but this could not confirmed.
          Length = 215

 Score = 34.1 bits (78), Expect = 0.13
 Identities = 26/118 (22%), Positives = 52/118 (44%), Gaps = 10/118 (8%)

Query: 482 DEIKRLKEKLSKTEEELKKEKTNSAALKSQADSVGKEYDRLLKEHEK-VQKVVTEQGDKK 540
            EI+   ++L K ++E+K E+    + +       KE + LL+E  + V+++     D  
Sbjct: 4   KEIRNKTDELEKLKDEIKAEEEALESEEKHLKEYDKEMEELLEEKMQHVEELRQIHADIN 63

Query: 541 DDKYQKGDSDEIKRLKEKLSKTEEELKKEKTNSAALKNQADSVGKEY---DRLLKEHE 595
           D +       EIK+ K +L +   ++ +       LK   + + K     + L + HE
Sbjct: 64  DME------TEIKQSKSELERRMGKIARMHGEYNPLKESINEMRKLELGLEELPQLHE 115


>gnl|CDD|218899 pfam06102, DUF947, Domain of unknown function (DUF947).  Family of
           eukaryotic proteins with unknown function.
          Length = 168

 Score = 33.4 bits (77), Expect = 0.14
 Identities = 31/100 (31%), Positives = 53/100 (53%), Gaps = 10/100 (10%)

Query: 482 DEIKRLKEKLSKTEEELKKEKTNSA--ALKSQADSVGKEYD--RLLKEHEKVQKVVTEQG 537
            EI+ L++ L KT++  +KE+      ++KS+  ++  +     +LKEH+K +K + ++G
Sbjct: 63  KEIEELEKALKKTKDSEEKEELKRTLQSMKSRLKTLKNKDREREILKEHKKQEKELIKEG 122

Query: 538 DKKDDKYQKGDSDEIKRLKEKLSKTEEELKKEKTNSAALK 577
            K    Y K  S+     K  L K  +ELKK K    AL+
Sbjct: 123 KKP--YYLK-KSE---IKKLVLKKKFDELKKSKQLDKALE 156


>gnl|CDD|219186 pfam06810, Phage_GP20, Phage minor structural protein GP20.  This
           family consists of several phage minor structural
           protein GP20 sequences of around 180 residues in length.
           The function of this family is unknown.
          Length = 156

 Score = 33.1 bits (76), Expect = 0.16
 Identities = 27/109 (24%), Positives = 42/109 (38%), Gaps = 30/109 (27%)

Query: 471 EQCDKGLEGDSDEIKRLKEKLSKTEEELKKEKTNSAALKSQADSVGKEYDRLLKEHEKVQ 530
           EQ DK +     +I+  K KL K  EE         +LK Q           LK  +K  
Sbjct: 2   EQIDKIMAEYGKDIQNPKSKLDKANEERD-------SLKKQ-----------LKNRDK-- 41

Query: 531 KVVTEQGDKKDDKYQKGDSDEIKRLKEKLSKTEEELKKEKTNSAALKNQ 579
                    +D K +  D++E   L++K+ K +++ K  K    A    
Sbjct: 42  -------QIEDLKKKVKDNEE---LQKKIEKLKQQNKTAKEEYEAKLAD 80


>gnl|CDD|220366 pfam09728, Taxilin, Myosin-like coiled-coil protein.  Taxilin
           contains an extraordinarily long coiled-coil domain in
           its C-terminal half and is ubiquitously expressed. It is
           a novel binding partner of several syntaxin family
           members and is possibly involved in Ca2+-dependent
           exocytosis in neuroendocrine cells. Gamma-taxilin,
           described as leucine zipper protein Factor Inhibiting
           ATF4-mediated Transcription (FIAT), localises to the
           nucleus in osteoblasts and dimerises with ATF4 to form
           inactive dimers, thus inhibiting ATF4-mediated
           transcription.
          Length = 309

 Score = 33.9 bits (78), Expect = 0.19
 Identities = 27/111 (24%), Positives = 50/111 (45%), Gaps = 8/111 (7%)

Query: 462 ESSKRCEIFEQCDKGLEGDSDEIKRLKEKLSKTEEELKKEKTNSAALKSQADSVGKEYDR 521
           E  K   +  Q    L+   ++++ L ++L++  +E+K+ + +   L+ + + + KE D+
Sbjct: 2   EVKKAARLLNQSLSKLDTPEEKLETLIKRLAELLKEMKRLEKDLKKLQKRGEQLQKEKDK 61

Query: 522 LLKEHEKVQKVVTEQGDKKDDKYQKGDSDEIKRLKE---KLSKTEEELKKE 569
           L  EH K           K +   +      K LKE     S  EEE +KE
Sbjct: 62  LQSEHSK-----AILARSKLESLCRELQKHNKTLKEENKARSDEEEEKRKE 107



 Score = 32.7 bits (75), Expect = 0.49
 Identities = 31/139 (22%), Positives = 59/139 (42%), Gaps = 14/139 (10%)

Query: 483 EIKRLKE---KLSKTEEELKKEKTN--SAALK---SQADSVGKEYDRLLKEHEKVQ---K 531
             K LKE     S  EEE +KE T    + LK    Q +       +L +E+E+++   K
Sbjct: 86  HNKTLKEENKARSDEEEEKRKELTEKFQSTLKDIQDQMEEHSNPNHKLCEENEELREKLK 145

Query: 532 VVTEQGDKKD---DKYQKGDSDEIKRLKEKLSKTEEELKKEKTNSAALKNQADSVGKEYD 588
            + EQ + ++   +K  K    E +    KL + +E+ ++E+      K        +  
Sbjct: 146 ELIEQYELREQHFEKLLKTKDLEQQLNDAKLEQAQEQAEQEQEKHKREKEILLKEAAQVS 205

Query: 589 RLLKEHEKVQKVVTEQGDK 607
            L +  ++++  +    DK
Sbjct: 206 TLKETEKELRAQLNLYSDK 224



 Score = 30.0 bits (68), Expect = 3.6
 Identities = 28/129 (21%), Positives = 56/129 (43%), Gaps = 15/129 (11%)

Query: 483 EIKRLKEKLSKTEEELKKEKTNSAALKSQADSVGKEYDRLLKEHEKVQKVVTEQGDKKDD 542
           E +    KL + +E+ ++E+      K        +   L +  ++++  +    DK  +
Sbjct: 168 EQQLNDAKLEQAQEQAEQEQEKHKREKEILLKEAAQVSTLKETEKELRAQLNLYSDKFKE 227

Query: 543 KYQK-GDSDEI----KRLKEKLSKTEEELKKEKTNSAALKNQADSVGK-------EYDRL 590
                  S+E+    K+  EK+SK  ++L+KE   + A K + +   K       E  + 
Sbjct: 228 FQDTLNKSNEVFLTFKQEMEKMSKKIKKLEKE---NLAWKRKWEKSNKALLEMAEERQKR 284

Query: 591 LKEHEKVQK 599
            KE EK++K
Sbjct: 285 DKEIEKLKK 293


>gnl|CDD|218806 pfam05911, DUF869, Plant protein of unknown function (DUF869).
           This family consists of a number of sequences found in
           Arabidopsis thaliana, Oryza sativa and Lycopersicon
           esculentum (Tomato). The function of this family is
           unknown.
          Length = 767

 Score = 34.5 bits (79), Expect = 0.20
 Identities = 36/161 (22%), Positives = 70/161 (43%), Gaps = 22/161 (13%)

Query: 449 CLALPEKYGCGWCESSKRCEIFEQCDKGLEGDSDEIKRLKEKLSKTEEELKK-EKTNSAA 507
           C++  E+      E         +C   LE   ++++  ++KL + + EL   +++NS A
Sbjct: 589 CISTLEELETLKSEKENLDGELSKCKDDLEESKNKLQETEKKLEELKSELDASQESNSLA 648

Query: 508 LKSQADSVGKEYDRLLKEHEKVQKVVTEQGDKKDDKYQKGDSDEIKRLKEKLSKTEEELK 567
            ++Q   + + Y   L+   K  +                   E K L+EK+S  E+EL+
Sbjct: 649 -ETQLKCMVESY-ESLELRAKDLEA------------------EHKSLQEKISSLEDELE 688

Query: 568 KEKTNSAALKNQADSVGKEYDRLLKEHEKVQKVVTEQGDKK 608
           KE+ N   L  +   + +E     ++ E +QK+  E+   K
Sbjct: 689 KERQNHQELIAKCREL-EEKIERAEQEENMQKLDEEEQKIK 728



 Score = 31.1 bits (70), Expect = 2.0
 Identities = 28/97 (28%), Positives = 49/97 (50%), Gaps = 8/97 (8%)

Query: 468 EIFEQCDKGLEGDSDEIKRLKEKLSKTEEELKKEKTNSAALKSQADSVGKEYDRLLKEHE 527
           E +E  +   +    E K L+EK+S  E+EL+KE+ N   L ++   + +E     ++ E
Sbjct: 657 ESYESLELRAKDLEAEHKSLQEKISSLEDELEKERQNHQELIAKCREL-EEKIERAEQEE 715

Query: 528 KVQKVVTEQGDKKDDKYQKGDSDEIKRLKEKLSKTEE 564
            +QK+  E+   K +K       EI    EKL++ +E
Sbjct: 716 NMQKLDEEEQKIKQEK-------EIAAAAEKLAECQE 745


>gnl|CDD|224114 COG1193, COG1193, Mismatch repair ATPase (MutS family) [DNA
           replication, recombination, and repair].
          Length = 753

 Score = 34.2 bits (79), Expect = 0.20
 Identities = 28/146 (19%), Positives = 57/146 (39%), Gaps = 10/146 (6%)

Query: 468 EIFEQCDKGLEGDSDEIKRLKEKLSKTEEELKKEKTNSAALKSQAD----SVGKEYDRLL 523
            I E+       + + ++ L EKL +  +EL++E      L  + +    +       LL
Sbjct: 490 PIIEEAKTEFGEEKELLEELIEKLEEVRKELEEELEEVEKLLDEVELLTGANSGGKTSLL 549

Query: 524 KEHEKVQKVVTEQGDKKDDKYQKGDSDEIKRLKEK---LSKTE-EELKKEKTNSAALKNQ 579
           +   ++  +         ++ +    DE+K LK+K   L     E   K+K N    + +
Sbjct: 550 ELKAQIVVLAHMGLPVPAEEAKVEAVDEVKFLKKKKGILDAGAFESTLKDKKNKVLPEAK 609

Query: 580 ADS--VGKEYDRLLKEHEKVQKVVTE 603
                +G E + +  E   V K++  
Sbjct: 610 KRKLKLGDEVEVITGEPGAVVKIIAG 635


>gnl|CDD|191973 pfam08232, Striatin, Striatin family.  Striatin is an intracellular
           protein which has a caveolin-binding motif, a
           coiled-coil structure, a calmodulin-binding site, and a
           WD (pfam00400) repeat domain. It acts as a scaffold
           protein and is involved in signalling pathways.
          Length = 133

 Score = 32.7 bits (75), Expect = 0.21
 Identities = 26/94 (27%), Positives = 43/94 (45%), Gaps = 11/94 (11%)

Query: 477 LEGDSDEIKRLKEKLSKT----EEELKKEKTNSAALKSQADSVGKEYDRLLKEHEKVQKV 532
           LEG+   ++ LKE L +     E  LK+E+     LK   DS  K      +E       
Sbjct: 37  LEGERRGLENLKEDLKRRIKMLEYALKQERAKLKKLKYGTDSPQKPSKEEEEEEL----- 91

Query: 533 VTEQGDKKDDKYQKGDSDEIKRLKEKLSKTEEEL 566
             E+ D ++   +K D  E+K+ +  L K+ +E+
Sbjct: 92  --EEDDDEESNSEKVDESELKKSRLYLRKSLQEI 123


>gnl|CDD|218704 pfam05701, DUF827, Plant protein of unknown function (DUF827).
           This family consists of several plant proteins of
           unknown function. Several sequences in this family are
           described as being "myosin heavy chain-like".
          Length = 484

 Score = 34.2 bits (78), Expect = 0.22
 Identities = 32/128 (25%), Positives = 64/128 (50%), Gaps = 10/128 (7%)

Query: 486 RLKEKLSKTEEELKKEKTNSAALKSQADSVGKEYDRLLKEHEKVQKVVTE-----QGDKK 540
           +  E++   E EL   +  +  LK + +   KE  ++LKE E  ++ V +     +  +K
Sbjct: 26  QRSERVLVKETELMLAQEEANKLKKELEVAEKEKLQVLKELESTKRTVEDLKLKLEKAEK 85

Query: 541 DDKYQKGDSDEIKRLKEKLSKTEEELKKEKTNSA-----ALKNQADSVGKEYDRLLKEHE 595
           +++  K DS+  K   E+L +  +EL+ E+  +A     ++K +   + +EYD L++E +
Sbjct: 86  EEQQAKQDSELAKLRAEELEQGIQELEVERYITATAELDSVKEELRKIRQEYDALVEERD 145

Query: 596 KVQKVVTE 603
              K   E
Sbjct: 146 AALKRAEE 153



 Score = 32.7 bits (74), Expect = 0.56
 Identities = 32/151 (21%), Positives = 68/151 (45%), Gaps = 17/151 (11%)

Query: 462 ESSKRCEIFEQCDKGLEGDSDEIKRLKEKLSKTEEELKKEKTNSAALKSQADSVGKEYDR 521
           E+    ++ E+  + L  +   +K   E+      E ++E+  +A  K Q     + Y++
Sbjct: 153 EAICASKVNEKKVEELTKEIIAMKESLERAHAAHLEAEEERIGAALEKDQDR---ETYEK 209

Query: 522 LLKEHEKVQKVVTEQGD-KKDDKYQKGDSDEIKRLKEKLSKTE-------------EELK 567
            LKE EK  + + +  D +KD +     S E++ L++++S                  L+
Sbjct: 210 ELKEAEKELERLKQDLDPEKDLEKLAEASAELESLQKEISIMASVASVLKELEEAKANLE 269

Query: 568 KEKTNSAALKNQADSVGKEYDRLLKEHEKVQ 598
           K      +L+N  +S+ +E +   KE E+++
Sbjct: 270 KAAEEEKSLRNLVESLKQELEEEKKELEELR 300


>gnl|CDD|240419 PTZ00440, PTZ00440, reticulocyte binding protein 2-like protein;
            Provisional.
          Length = 2722

 Score = 34.4 bits (79), Expect = 0.22
 Identities = 23/127 (18%), Positives = 55/127 (43%), Gaps = 8/127 (6%)

Query: 475  KGLEGDSDEIKRLKEKLSKTEEELKKEKTNSAALKSQADSVGKEYDRLLKEHEKVQKVVT 534
            K ++ + +++  +K K  +      KEK        Q +   K+   L K +++++K + 
Sbjct: 1091 KKIDENKNKLIEIKNKSHEHVVNADKEKNK------QTEHYNKKKKSLEKIYKQMEKTLK 1144

Query: 535  EQGDKKDDKYQKGDSDEIKRLKEKLSKTE--EELKKEKTNSAALKNQADSVGKEYDRLLK 592
            E  +   +     + +EI+   E++      E++  E   S  +  + +S  K+ D++ K
Sbjct: 1145 ELENMNLEDITLNEVNEIEIEYERILIDHIVEQINNEAKKSKTIMEEIESYKKDIDQVKK 1204

Query: 593  EHEKVQK 599
               K + 
Sbjct: 1205 NMSKERN 1211



 Score = 29.8 bits (67), Expect = 5.1
 Identities = 14/72 (19%), Positives = 34/72 (47%), Gaps = 3/72 (4%)

Query: 519  YDRLLKEHEKVQKVVTEQGDKKDDKYQKGDSDEIKRLKEKLSKTEEELKKEKTNSAALKN 578
            YD+    +E ++++ TE    K +  +  + DE+K +K ++      L++    +  ++N
Sbjct: 1223 YDKATASYENIEELTTEAKGLKGEANRSTNVDELKEIKLQV---FSYLQQVIKENNKMEN 1279

Query: 579  QADSVGKEYDRL 590
                +   Y+ L
Sbjct: 1280 ALHEIKNMYEFL 1291


>gnl|CDD|227352 COG5019, CDC3, Septin family protein [Cell division and chromosome
           partitioning / Cytoskeleton].
          Length = 373

 Score = 33.8 bits (78), Expect = 0.23
 Identities = 21/103 (20%), Positives = 43/103 (41%), Gaps = 12/103 (11%)

Query: 484 IKRLKEKLSKT------EEELKKEK-TNSAALKSQADSVGKEYDRLLKEHEKVQKVVTEQ 536
           ++ LKE            E+L   K +   +LK   ++   E +R LK+    +    E+
Sbjct: 272 LQELKETTENLLYENYRTEKLSGLKNSGEPSLKEIHEARLNEEERELKKKFTEKIREKEK 331

Query: 537 G-DKKDDKYQKGDSDEIKRLKEKLSKTEEELKKEKTNSAALKN 578
             ++ +    +    E K L  KL + +++L+  +     LK+
Sbjct: 332 RLEELEQNLIE----ERKELNSKLEEIQKKLEDLEKRLEKLKS 370



 Score = 28.8 bits (65), Expect = 9.5
 Identities = 9/28 (32%), Positives = 15/28 (53%)

Query: 483 EIKRLKEKLSKTEEELKKEKTNSAALKS 510
           E K L  KL + +++L+  +     LKS
Sbjct: 343 ERKELNSKLEEIQKKLEDLEKRLEKLKS 370


>gnl|CDD|220623 pfam10186, Atg14, UV radiation resistance protein and
           autophagy-related subunit 14.  The Atg14 or Apg14
           proteins are hydrophilic proteins with a predicted
           molecular mass of 40.5 kDa, and have a coiled-coil motif
           at the N terminus region. Yeast cells with mutant Atg14
           are defective not only in autophagy but also in sorting
           of carboxypeptidase Y (CPY), a vacuolar-soluble
           hydrolase, to the vacuole. Subcellular fractionation
           indicate that Apg14p and Apg6p are peripherally
           associated with a membrane structure(s). Apg14p was
           co-immunoprecipitated with Apg6p, suggesting that they
           form a stable protein complex. These results imply that
           Apg6/Vps30p has two distinct functions: in the
           autophagic process and in the vacuolar protein sorting
           pathway. Apg14p may be a component specifically required
           for the function of Apg6/Vps30p through the autophagic
           pathway. There are 17 auto-phagosomal component proteins
           which are categorized into six functional units, one of
           which is the AS-PI3K complex (Vps30/Atg6 and Atg14). The
           AS-PI3K complex and the Atg2-Atg18 complex are essential
           for nucleation, and the specific function of the AS-PI3K
           apparently is to produce phosphatidylinositol
           3-phosphate (PtdIns(3)P) at the pre-autophagosomal
           structure (PAS). The localisation of this complex at the
           PAS is controlled by Atg14. Autophagy mediates the
           cellular response to nutrient deprivation, protein
           aggregation, and pathogen invasion in humans, and
           malfunction of autophagy has been implicated in multiple
           human diseases including cancer. This effect seems to be
           mediated through direct interaction of the human Atg14
           with Beclin 1 in the human phosphatidylinositol 3-kinase
           class III complex.
          Length = 307

 Score = 33.9 bits (78), Expect = 0.23
 Identities = 25/108 (23%), Positives = 55/108 (50%), Gaps = 8/108 (7%)

Query: 472 QCDKGLEGDSDEIKRLKEKLSKTEEELKKEKTNSAALKSQADSVGKEYDRLLKEHEKVQK 531
           + ++ LEG ++E  +L   L K E   KKE+ N   ++++   + +E ++  +  E++++
Sbjct: 42  KVEEALEGATNEDGKLAADLLKLEVARKKERLN--QIRARISQLKEEIEQKRERIEELKR 99

Query: 532 VV-TEQGDKKDDKYQKGDSDEIKRLKEKLSKTEEELKKEKTNSAALKN 578
            +   + D     YQ       KR   +L K ++E+K+ ++   AL +
Sbjct: 100 ALAQRRSDLSSASYQL-----EKRRASQLEKLQDEIKRTRSKLNALHS 142


>gnl|CDD|227881 COG5594, COG5594, Uncharacterized integral membrane protein
           [Function unknown].
          Length = 827

 Score = 33.9 bits (78), Expect = 0.26
 Identities = 27/140 (19%), Positives = 52/140 (37%), Gaps = 11/140 (7%)

Query: 454 EKYGCGWCESSKRCEIFEQCDKGLEGDSDEIKRLKEKLSKTEEELKKEKTNSAALKSQA- 512
           +K   G  +S   C       +  +     +K+L++ L++   +L K+       KS   
Sbjct: 231 DKLKVGEIDSDVLCRDLGTLQELYKERDKILKKLEKALNELLNKLLKKSHLKTNKKSGKL 290

Query: 513 -DSVGKEYDRLLKE-------HEKVQKVVTEQGDKKDD--KYQKGDSDEIKRLKEKLSKT 562
             S  KE++ L +          +++K+      KK D   Y      ++    E   K+
Sbjct: 291 TPSRKKEFEILPEYVPDKKRPKHRIKKLNKGIFGKKVDAIDYYSAKLTKLDAEIENARKS 350

Query: 563 EEELKKEKTNSAALKNQADS 582
             E    K+     K+QA +
Sbjct: 351 LYENTPTKSGFITFKSQASA 370



 Score = 30.1 bits (68), Expect = 3.8
 Identities = 22/117 (18%), Positives = 44/117 (37%), Gaps = 10/117 (8%)

Query: 497 ELKKEKTNSAALKSQADSVGK---EYDRLLKEHEKVQKVVTEQGDKKDDKYQKGDSDEIK 553
            L  E  +   LK   D +     + D L ++   +Q++      K+ DK  K     + 
Sbjct: 215 GLPSELRSDEELKELFDKLKVGEIDSDVLCRDLGTLQELY-----KERDKILKKLEKALN 269

Query: 554 RLKEKLSK--TEEELKKEKTNSAALKNQADSVGKEYDRLLKEHEKVQKVVTEQGDKK 608
            L  KL K    +  KK    + + K + + + +      +   +++K+      KK
Sbjct: 270 ELLNKLLKKSHLKTNKKSGKLTPSRKKEFEILPEYVPDKKRPKHRIKKLNKGIFGKK 326


>gnl|CDD|217443 pfam03234, CDC37_N, Cdc37 N terminal kinase binding.  Cdc37 is a
           molecular chaperone required for the activity of
           numerous eukaryotic protein kinases. This domain
           corresponds to the N terminal domain which binds
           predominantly to protein kinases and is found N terminal
           to the Hsp (Heat shocked protein) 90-binding domain
           pfam08565. Expression of a construct consisting of only
           the N-terminal domain of Saccharomyces pombe Cdc37
           results in cellular viability. This indicates that
           interactions with the cochaperone Hsp90 may not be
           essential for Cdc37 function.
          Length = 172

 Score = 32.8 bits (75), Expect = 0.26
 Identities = 21/108 (19%), Positives = 53/108 (49%), Gaps = 8/108 (7%)

Query: 471 EQCDKGLEGDSDEIKRLKEKLSKTEEE-------LKKEKTNSAALKSQADSVGKEYDRLL 523
           E+ +K    +   ++R+ + LS+ +EE       + K    +   K   +     Y+ ++
Sbjct: 49  EELEKERIMNDRLLERVDKLLSELKEESLDSSQAVMKSLNENFTDKENVEPEQPTYNEMV 108

Query: 524 KE-HEKVQKVVTEQGDKKDDKYQKGDSDEIKRLKEKLSKTEEELKKEK 570
           ++  ++V+  V E+      +  +   D++K+ +++L K  +EL+KE+
Sbjct: 109 EDLFDQVKDEVDEKNGAALIEELQKHRDKLKKEQKELLKKLDELEKEE 156


>gnl|CDD|222613 pfam14235, DUF4337, Domain of unknown function (DUF4337).  This
           family of proteins is functionally uncharacterized. This
           family of proteins is found in bacteria. Proteins in
           this family are typically between 187 and 201 amino
           acids in length. There is a single completely conserved
           residue Q that may be functionally important.
          Length = 158

 Score = 32.6 bits (75), Expect = 0.27
 Identities = 9/42 (21%), Positives = 22/42 (52%)

Query: 553 KRLKEKLSKTEEELKKEKTNSAALKNQADSVGKEYDRLLKEH 594
             L+ K+++ ++E  + ++ +  L+ +A     E D  L +H
Sbjct: 70  AELQAKIARYKKEKARYRSEAKELEAKAKEAEAESDHALHQH 111



 Score = 31.8 bits (73), Expect = 0.51
 Identities = 9/42 (21%), Positives = 23/42 (54%)

Query: 485 KRLKEKLSKTEEELKKEKTNSAALKSQADSVGKEYDRLLKEH 526
             L+ K+++ ++E  + ++ +  L+++A     E D  L +H
Sbjct: 70  AELQAKIARYKKEKARYRSEAKELEAKAKEAEAESDHALHQH 111


>gnl|CDD|227674 COG5384, Mpp10, U3 small nucleolar ribonucleoprotein component
           [Translation, ribosomal structure and biogenesis].
          Length = 569

 Score = 33.9 bits (77), Expect = 0.28
 Identities = 24/106 (22%), Positives = 41/106 (38%), Gaps = 12/106 (11%)

Query: 488 KEKLSKTEEELKKEKTNSAALKSQADSVGKEYDRLLKEHEKVQKVVTEQGDKKDDKYQKG 547
             K  K +EEL +E+ +SA  K + D    E D    E           G+  D      
Sbjct: 227 DVKDPKEDEELDEEEHDSAMDKVKLDLFADEEDEPNAE---------GVGEASDKNLSSF 277

Query: 548 DSDEIKRLKEKLSKTEEELKKEKTNSAALKNQADSVGKEYDRLLKE 593
           +  +I  + E++ + E+EL   K        +  +  +  + LL E
Sbjct: 278 EKQQI-EMDEQIEELEKELVAPK--EWKYAGEVSAKKRPKNSLLAE 320


>gnl|CDD|216108 pfam00769, ERM, Ezrin/radixin/moesin family.  This family of
           proteins contain a band 4.1 domain (pfam00373), at their
           amino terminus. This family represents the rest of these
           proteins.
          Length = 244

 Score = 33.2 bits (76), Expect = 0.31
 Identities = 29/130 (22%), Positives = 57/130 (43%), Gaps = 9/130 (6%)

Query: 468 EIFEQCDKGLEGDSDEIKRLKEKLSKTEEE---LKKE----KTNSAALKSQADSVGKEYD 520
           E   +  K LE   +    L+EKL + EEE   L+K+    +  +  L+ +A +  +E +
Sbjct: 19  EDMRRAQKELEEYEETALELEEKLKQEEEEAQLLEKKADELEEENRRLEEEAAASEEERE 78

Query: 521 RLLKEHEKVQKVVTEQGDKKDDKYQKGD--SDEIKRLKEKLSKTEEELKKEKTNSAALKN 578
           RL  E ++    V +  ++++ K  +      E++  +E   +  +EL +      A  +
Sbjct: 79  RLEAEVDEATAEVAKLEEEREKKEAETRQLQQELREAQEAHERARQELLEAAAAPTAPPH 138

Query: 579 QADSVGKEYD 588
            A  V  E  
Sbjct: 139 VAAPVNGEQL 148


>gnl|CDD|226894 COG4499, COG4499, Predicted membrane protein [Function unknown].
          Length = 434

 Score = 33.6 bits (77), Expect = 0.31
 Identities = 23/77 (29%), Positives = 40/77 (51%), Gaps = 6/77 (7%)

Query: 472 QCDKGLEGD--SDEIKRLKEKLSKTEEELKKEKTNSAALKSQADSVGKEYDRLLKEHEKV 529
           + +  L GD   + +K   +KL    ++L + K  + A + +A++  KE     +E+EK 
Sbjct: 357 KSNTDLSGDKRQELLKEYNKKLQDYTKKLGEVKDETDASE-EAEAKAKEEKLKQEENEKK 415

Query: 530 QKVVTEQGDKKDDKYQK 546
           QK   EQ D+  +K QK
Sbjct: 416 QK---EQADEDKEKRQK 429



 Score = 29.4 bits (66), Expect = 5.7
 Identities = 24/87 (27%), Positives = 42/87 (48%), Gaps = 12/87 (13%)

Query: 522 LLKEHEKVQKVVTEQGDKKDDKYQKGDSDEIKRLKEKLSKTEEELKKEKTNSAALKNQAD 581
           L K +E+V+      GDK+         + +K   +KL    ++L + K  + A + +A+
Sbjct: 349 LTKLYEEVKSNTDLSGDKR--------QELLKEYNKKLQDYTKKLGEVKDETDASE-EAE 399

Query: 582 SVGKEYDRLLKEHEKVQKVVTEQGDKK 608
           +  KE     +E+EK QK   EQ D+ 
Sbjct: 400 AKAKEEKLKQEENEKKQK---EQADED 423


>gnl|CDD|192773 pfam11559, ADIP, Afadin- and alpha -actinin-Binding.  This family
           is found in mammals where it is localised at cell-cell
           adherens junctions, and in Sch. pombe and other fungi
           where it anchors spindle-pole bodies to spindle
           microtubules. It is a coiled-coil structure, and in
           pombe, it is required for anchoring the minus end of
           spindle microtubules to the centrosome equivalent, the
           spindle-pole body. The name ADIP derives from the family
           being composed of Afadin- and alpha -Actinin-Binding
           Proteins Localised at Cell-Cell Adherens Junctions.
          Length = 149

 Score = 32.3 bits (74), Expect = 0.32
 Identities = 24/85 (28%), Positives = 40/85 (47%), Gaps = 13/85 (15%)

Query: 483 EIKRLKEKLSKTEEEL----KKEKTNSAALKSQADSVGKEYDRLLKEHEKVQKVVTEQGD 538
            I+RLK +L   E EL     KE+     LK+    +  E + + +    +Q+  T+   
Sbjct: 72  TIERLKTQLEDLERELALLQAKERQLEKKLKTLEQKLKNEKEEVQRLKNIIQQRKTQYNH 131

Query: 539 --KKDDKYQKGDSDEIKRLKEKLSK 561
             KK D+       EI++LKE+L +
Sbjct: 132 ELKKRDR-------EIEKLKERLLQ 149


>gnl|CDD|223542 COG0466, Lon, ATP-dependent Lon protease, bacterial type
           [Posttranslational modification, protein turnover,
           chaperones].
          Length = 782

 Score = 33.7 bits (78), Expect = 0.32
 Identities = 30/91 (32%), Positives = 48/91 (52%), Gaps = 17/91 (18%)

Query: 485 KRLKEKLSKTEEE---LKKEKTNSAALKSQADSVGKEYDRLLKEHEKV-QKVVTEQGDKK 540
           +RL++ L   E+E   L+ EK     +K Q +   +EY   L+E  K  QK   E G+  
Sbjct: 192 ERLEKLLDLLEKEIDLLQLEKRIRKKVKEQMEKSQREY--YLREQLKAIQK---ELGEDD 246

Query: 541 DDKYQKGDSDEIKRLKEKLSKTE--EELKKE 569
           DDK      DE++ L+EK+ K +  +E K++
Sbjct: 247 DDK------DEVEELREKIEKLKLPKEAKEK 271


>gnl|CDD|234715 PRK00290, dnaK, molecular chaperone DnaK; Provisional.
          Length = 627

 Score = 33.5 bits (78), Expect = 0.33
 Identities = 26/112 (23%), Positives = 51/112 (45%), Gaps = 6/112 (5%)

Query: 482 DEIKRLKEKLSKTEEELKKEKTNSAALKSQADSVGKEYDRLLKEH-EKV----QKVVTEQ 536
           +EI+R+ +      EE KK K      ++QADS+  + ++ LKE  +KV    ++ +   
Sbjct: 503 EEIERMVKDAEANAEEDKKRK-ELVEARNQADSLIYQTEKTLKELGDKVPADEKEKIEAA 561

Query: 537 GDKKDDKYQKGDSDEIKRLKEKLSKTEEELKKEKTNSAALKNQADSVGKEYD 588
             +  +  +  D + IK   E+L++  ++L +     A     A     + D
Sbjct: 562 IKELKEALKGEDKEAIKAKTEELTQASQKLGEAMYQQAQAAQGAAGAAAKDD 613


>gnl|CDD|129661 TIGR00570, cdk7, CDK-activating kinase assembly factor MAT1.  All
           proteins in this family for which functions are known
           are cyclin dependent protein kinases that are components
           of TFIIH, a complex that is involved in nucleotide
           excision repair and transcription initiation. Also known
           as MAT1 (menage a trois 1). This family is based on the
           phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis,
           Stanford University) [DNA metabolism, DNA replication,
           recombination, and repair].
          Length = 309

 Score = 33.2 bits (76), Expect = 0.33
 Identities = 34/144 (23%), Positives = 68/144 (47%), Gaps = 26/144 (18%)

Query: 466 RCEIFEQCDKGLEGDSDEIKRLKEKLSKTEEELKKEKTNSAALKSQADSVGKEYDRLLKE 525
           R ++FE  D  +E + D  KR+ +  +K EE+         +L+        EY+  L+E
Sbjct: 59  RVQLFE--DPTVEKEVDIRKRVLKIYNKREEDF-------PSLR--------EYNDYLEE 101

Query: 526 HEKV-----QKVVTEQGDKKDDKYQKGDSDEIKRLKEKLSKTEEELKKEKTNSAALKNQA 580
            E +       +  E   KK + YQK + D I++ KEK ++ +EEL++    +   + + 
Sbjct: 102 VEDIVYNLTNNIDLENTKKKIETYQKENKDVIQKNKEKSTREQEELEE----ALEFEKEE 157

Query: 581 DSVGKEYDRLLKEHEKVQKVVTEQ 604
           +   +   +  +E +++ K   +Q
Sbjct: 158 EEQRRLLLQKEEEEQQMNKRKNKQ 181


>gnl|CDD|240402 PTZ00399, PTZ00399, cysteinyl-tRNA-synthetase; Provisional.
          Length = 651

 Score = 33.5 bits (77), Expect = 0.34
 Identities = 29/103 (28%), Positives = 47/103 (45%), Gaps = 12/103 (11%)

Query: 496 EELKKEKTNSAALKSQADSVGKEYDRLLKEHEKVQKVVTEQGDKKDDKYQKGDSDEIKRL 555
           EEL++EK    ALK Q         RL K  ++ +K   E   +K +K +   ++  KR 
Sbjct: 551 EELQREKEEKEALKEQ--------KRLRKLKKQEEKKKKEL--EKLEKAKIPPAEFFKRQ 600

Query: 556 KEKLSKTEEEL--KKEKTNSAALKNQADSVGKEYDRLLKEHEK 596
           ++K S  +E      +       K +   + KEYD+  K HE+
Sbjct: 601 EDKYSAFDETGLPTHDADGEEISKKERKKLSKEYDKQAKLHEE 643


>gnl|CDD|233830 TIGR02350, prok_dnaK, chaperone protein DnaK.  Members of this
           family are the chaperone DnaK, of the DnaK-DnaJ-GrpE
           chaperone system. All members of the seed alignment were
           taken from completely sequenced bacterial or archaeal
           genomes and (except for Mycoplasma sequence) found
           clustered with other genes of this systems. This model
           excludes DnaK homologs that are not DnaK itself, such as
           the heat shock cognate protein HscA (TIGR01991).
           However, it is not designed to distinguish among DnaK
           paralogs in eukaryotes. Note that a number of dnaK genes
           have shadow ORFs in the same reverse (relative to dnaK)
           reading frame, a few of which have been assigned
           glutamate dehydrogenase activity. The significance of
           this observation is unclear; lengths of such shadow ORFs
           are highly variable as if the presumptive protein
           product is not conserved [Protein fate, Protein folding
           and stabilization].
          Length = 595

 Score = 33.4 bits (77), Expect = 0.35
 Identities = 26/108 (24%), Positives = 45/108 (41%), Gaps = 13/108 (12%)

Query: 480 DSDEIKRLKEKLSKTEEELKKEKTNSAALKSQADSVGKEYDRLLKEHEKVQKVVTEQGDK 539
             +EI+R+ ++     EE KK K      ++ ADS+  + ++ LK          E GDK
Sbjct: 499 SEEEIERMVKEAEANAEEDKKRKEE-IEARNNADSLAYQAEKTLK----------EAGDK 547

Query: 540 KDDKYQKGDSDEIKRLKEKLSKTEEELKKEKTNSAALKNQADSVGKEY 587
              + ++     +  LKE L   + E  K KT    L+     + +  
Sbjct: 548 LPAEEKEKIEKAVAELKEALKGEDVEEIKAKT--EELQQALQKLAEAM 593



 Score = 31.5 bits (72), Expect = 1.4
 Identities = 20/65 (30%), Positives = 32/65 (49%), Gaps = 10/65 (15%)

Query: 548 DSDEIKRLKEKLSKTEEELKKEKTNSAALKNQADSVGKEYDRLLKEH---------EKVQ 598
             +EI+R+ ++     EE KK K      +N ADS+  + ++ LKE          EK++
Sbjct: 499 SEEEIERMVKEAEANAEEDKKRKEE-IEARNNADSLAYQAEKTLKEAGDKLPAEEKEKIE 557

Query: 599 KVVTE 603
           K V E
Sbjct: 558 KAVAE 562


>gnl|CDD|148614 pfam07106, TBPIP, Tat binding protein 1(TBP-1)-interacting protein
           (TBPIP).  This family consists of several eukaryotic
           TBP-1 interacting protein (TBPIP) sequences. TBP-1 has
           been demonstrated to interact with the human
           immunodeficiency virus type 1 (HIV-1) viral protein Tat,
           then modulate the essential replication process of HIV.
           In addition, TBP-1 has been shown to be a component of
           the 26S proteasome, a basic multiprotein complex that
           degrades ubiquitinated proteins in an ATP-dependent
           fashion. Human TBPIP interacts with human TBP-1 then
           modulates the inhibitory action of human TBP-1 on
           HIV-Tat-mediated transactivation.
          Length = 169

 Score = 32.3 bits (74), Expect = 0.37
 Identities = 26/108 (24%), Positives = 50/108 (46%), Gaps = 11/108 (10%)

Query: 496 EELKKEKTNSAALKSQADSVGKEYDRLLKEHEKVQKVVTEQGDKKDDKYQKGDSDEIKRL 555
           EEL K       L+ +   + ++   L  E + +   +T       ++ Q+    EI+ L
Sbjct: 72  EELNKLDMEIEELREEVQLLKQDCSTLEIELKSLTSDLT------TEELQE----EIQEL 121

Query: 556 KEKLSKTEEELKK-EKTNSAALKNQADSVGKEYDRLLKEHEKVQKVVT 602
           K+++ + EE+L+  E+        + + V KEY  L KE +K +K+  
Sbjct: 122 KKEVREIEEKLESLEEGWKPVTPEEMEKVKKEYKDLHKEWKKRKKLCN 169


>gnl|CDD|179385 PRK02224, PRK02224, chromosome segregation protein; Provisional.
          Length = 880

 Score = 33.5 bits (77), Expect = 0.38
 Identities = 18/126 (14%), Positives = 60/126 (47%), Gaps = 11/126 (8%)

Query: 481 SDEIKRLKEKLSKTEEELKKEKTNSAALKSQADSVGKEYDRLLKEHEKVQKVVTEQGDKK 540
               + L+++  +  + L++ +  + A   +A+S+ ++ D L +  E+++    E+  + 
Sbjct: 313 EARREELEDRDEELRDRLEECRVAAQAHNEEAESLREDADDLEERAEELR----EEAAEL 368

Query: 541 DDKYQKGDSDEIKRLKEKLSKTEEELKKEKTNSAALKNQADSVGKEYDR---LLKEHEKV 597
           + + ++   + ++  +E++ + EEE+   +       +    +G   D    L +E +++
Sbjct: 369 ESELEE-AREAVEDRREEIEELEEEI---EELRERFGDAPVDLGNAEDFLEELREERDEL 424

Query: 598 QKVVTE 603
           ++   E
Sbjct: 425 REREAE 430



 Score = 30.4 bits (69), Expect = 3.4
 Identities = 21/108 (19%), Positives = 49/108 (45%), Gaps = 12/108 (11%)

Query: 483 EIKRLKEKLSKTEEELKKEKTNSAALKSQADSVGKEYDRLLKEHEKVQKVVTEQGDKKDD 542
           EI+ L+E +++TE E ++       L+ + + + +E D LL E            D   +
Sbjct: 259 EIEDLRETIAETEREREELAEEVRDLRERLEELEEERDDLLAE--------AGLDDADAE 310

Query: 543 KYQKGDSDEIKRLKEKLSKTEEELKKEKTNSAALKNQADSVGKEYDRL 590
                     + L+++  +  + L++ +  + A   +A+S+ ++ D L
Sbjct: 311 AV----EARREELEDRDEELRDRLEECRVAAQAHNEEAESLREDADDL 354



 Score = 30.4 bits (69), Expect = 3.6
 Identities = 22/112 (19%), Positives = 54/112 (48%), Gaps = 8/112 (7%)

Query: 482 DEIKRLKEKLSKTEEELKKEKTNSAALKSQADSVGKEYDRLLKEHEKVQKVVTEQGDKKD 541
           D I+RL+E+    EE + + +      + +A+ + +    L  E E+ ++   E  ++ +
Sbjct: 509 DRIERLEERREDLEELIAERRETIEEKRERAEELRERAAELEAEAEEKREAAAEAEEEAE 568

Query: 542 DKYQKGDSDEIKRLKEKLSKTEEE---LKKEKTNSAALKNQADSVGKEYDRL 590
           +       +E+  L  KL++ +E    L++ +T  AA+ +  D + +  ++ 
Sbjct: 569 EA-----REEVAELNSKLAELKERIESLERIRTLLAAIADAEDEIERLREKR 615



 Score = 30.4 bits (69), Expect = 3.6
 Identities = 26/141 (18%), Positives = 56/141 (39%), Gaps = 33/141 (23%)

Query: 473 CDKGLEG--DSDEIKRLKEKLSKTEEELKKEKTNSAALKSQADSVGKEYDRL--LKEHEK 528
           C + +EG    + I+  +E++ + E EL         L+ + + V +  +R   L E E 
Sbjct: 457 CGQPVEGSPHVETIEEDRERVEELEAEL-------EDLEEEVEEVEERLERAEDLVEAE- 508

Query: 529 VQKVVTEQGDKKDDKYQKGDSDEIKRLKEKLSKTEEELKKEKTNSAALKNQADSVGKEYD 588
                                D I+RL+E+    EE + + +      + +A+ + +   
Sbjct: 509 ---------------------DRIERLEERREDLEELIAERRETIEEKRERAEELRERAA 547

Query: 589 RLLKEHEKVQKVVTEQGDKKD 609
            L  E E+ ++   E  ++ +
Sbjct: 548 ELEAEAEEKREAAAEAEEEAE 568


>gnl|CDD|129694 TIGR00606, rad50, rad50.  All proteins in this family for which
           functions are known are involvedin recombination,
           recombinational repair, and/or non-homologous end
           joining.They are components of an exonuclease complex
           with MRE11 homologs. This family is distantly related to
           the SbcC family of bacterial proteins.This family is
           based on the phylogenomic analysis of JA Eisen (1999,
           Ph.D. Thesis, Stanford University).
          Length = 1311

 Score = 33.5 bits (76), Expect = 0.41
 Identities = 28/140 (20%), Positives = 59/140 (42%), Gaps = 8/140 (5%)

Query: 475 KGLEGDSDEIKRLKEKLSKTEEELKKEKTNSAALKSQADSVGKEYDRLLKEHEKVQKVVT 534
           K L+ +  ++ R   KL +  E+L    T    ++        + +++ K   +    +T
Sbjct: 504 KSLQNEKADLDRKLRKLDQEMEQLNHHTTTRTQMEMLTKDKMDKDEQIRKIKSRHSDELT 563

Query: 535 EQGDKKDDKYQKGD-----SDEIKRLKEKLSKTEEELKKEKTNSAALKNQADSVGKEYDR 589
                  +K Q  D     S EI + +++L+K  +EL   + N   + N+ +S  ++   
Sbjct: 564 SLLGYFPNKKQLEDWLHSKSKEINQTRDRLAKLNKELASLEQNKNHINNELESKEEQ--- 620

Query: 590 LLKEHEKVQKVVTEQGDKKD 609
           L    +K+  V   Q ++ D
Sbjct: 621 LSSYEDKLFDVCGSQDEESD 640



 Score = 31.6 bits (71), Expect = 1.4
 Identities = 22/121 (18%), Positives = 50/121 (41%), Gaps = 10/121 (8%)

Query: 482 DEIKRLKEKLSKTEEELKKEKTNSAALKSQADSVGKEYDRLLKEHEKVQKVVTE------ 535
           DE K L   +   +E L+K++     +  +   +    DR+L+  ++++K   E      
Sbjct: 433 DEKKGLGRTIELKKEILEKKQEELKFVIKELQQLEGSSDRILELDQELRKAERELSKAEK 492

Query: 536 ----QGDKKDDKYQKGDSDEIKRLKEKLSKTEEELKKEKTNSAALKNQADSVGKEYDRLL 591
               +  KK+ K  + +  ++ R   KL +  E+L    T    ++        + +++ 
Sbjct: 493 NSLTETLKKEVKSLQNEKADLDRKLRKLDQEMEQLNHHTTTRTQMEMLTKDKMDKDEQIR 552

Query: 592 K 592
           K
Sbjct: 553 K 553



 Score = 29.2 bits (65), Expect = 8.7
 Identities = 26/156 (16%), Positives = 55/156 (35%), Gaps = 30/156 (19%)

Query: 481 SDEIKRLKEKLSKTEEELK----KEKTNSAAL---------KSQADSVGKEYDRLLKEHE 527
           + +I+RLK  + + E  L     +E++    L         + +   V ++  +   + +
Sbjct: 757 NRDIQRLKNDIEEQETLLGTIMPEEESAKVCLTDVTIMERFQMELKDVERKIAQQAAKLQ 816

Query: 528 KV--QKVVTEQGDKKDDKYQKGD------------SDEIKRLKEKLSKTEEELKKEKTNS 573
                + V +   +K +K  + D              + +   + L     ELK EK   
Sbjct: 817 GSDLDRTVQQVNQEKQEKQHELDTVVSKIELNRKLIQDQQEQIQHLKSKTNELKSEKLQI 876

Query: 574 AALKNQADSVGKEYDRLLKEHEKVQKVVTEQGDKKD 609
                +     ++   L  E   VQ ++ E  D K+
Sbjct: 877 GTNLQRRQQFEEQLVELSTE---VQSLIREIKDAKE 909


>gnl|CDD|114629 pfam05917, DUF874, Helicobacter pylori protein of unknown function
           (DUF874).  This family consists of several hypothetical
           proteins specific to Helicobacter pylori. The function
           of this family is unknown.
          Length = 417

 Score = 32.9 bits (74), Expect = 0.42
 Identities = 31/119 (26%), Positives = 57/119 (47%), Gaps = 10/119 (8%)

Query: 458 CGWCESSKRCEIFEQCDKGLEGDSDEIKR----LKEKLSKTEEELKKEKTNSAALKSQAD 513
           C   +  K+ E+  Q  K  E   D   +    L+++  KTE+E  K+KT    ++    
Sbjct: 119 CFADDQDKKIEL-AQAKKEAENARDRANKSGIELEQEEQKTEQE--KQKTEKEGIELANS 175

Query: 514 SVGKEYDRLLKEHEKVQKVVTEQGDKKDDKYQKGDSDEIKRLKEKLSKTEEELKKEKTN 572
            +  E ++   E EK QK  TEQ  +K       ++ E+++ K+K    +++L KE+ +
Sbjct: 176 QIKAEQEKQKTEQEK-QK--TEQEKQKTSNIANKNAIELEQEKQKTENEKQDLIKEQKD 231


>gnl|CDD|223250 COG0172, SerS, Seryl-tRNA synthetase [Translation, ribosomal
           structure and biogenesis].
          Length = 429

 Score = 33.0 bits (76), Expect = 0.45
 Identities = 21/98 (21%), Positives = 42/98 (42%), Gaps = 14/98 (14%)

Query: 484 IKRLKEKLSKTEEELKKEKTNSAALKSQADSVGKEYDRLLKEHE-------KVQKVVTEQ 536
           +K ++E      E+LKK +   A    +   + +E  +LL+E E       ++ K +   
Sbjct: 4   LKLIRENPDAVREKLKK-RGGDALDVDKLLELDEERRKLLRELEELQAERNELSKEIGRA 62

Query: 537 GDKKDDKYQ------KGDSDEIKRLKEKLSKTEEELKK 568
             + +D  +      K   +++K L+  L + E EL  
Sbjct: 63  LKRGEDDAEELIAEVKELKEKLKELEAALDELEAELDT 100


>gnl|CDD|223783 COG0711, AtpF, F0F1-type ATP synthase, subunit b [Energy production
           and conversion].
          Length = 161

 Score = 31.9 bits (73), Expect = 0.46
 Identities = 19/101 (18%), Positives = 52/101 (51%), Gaps = 8/101 (7%)

Query: 485 KRLKEKLSKTEEELKKEKTNSAALKSQADSVGKEYDRLLKE-HEKVQKVV---TEQGDKK 540
           K + + L + + ++  +   +  LK +A ++  EY++ L+E  E+  +++    ++ ++ 
Sbjct: 29  KPILKALDERQAKIADDLAEAERLKEEAQALLAEYEQELEEAREQASEIIEQAKKEAEQI 88

Query: 541 DDKYQKGDSDEIKRLKEKLSKTEEELKKEK-TNSAALKNQA 580
            ++ +    +E++R+KE     E E++ EK      L+ + 
Sbjct: 89  AEEIKAEAEEELERIKE---AAEAEIEAEKERALEELRAEV 126


>gnl|CDD|227602 COG5277, COG5277, Actin and related proteins [Cytoskeleton].
          Length = 444

 Score = 33.2 bits (76), Expect = 0.46
 Identities = 19/101 (18%), Positives = 43/101 (42%), Gaps = 8/101 (7%)

Query: 480 DSDEIKRLKEKLSKTEEELKKEKTNSAALKSQADSVGKEYDRLLKEHEKVQKV-VTEQGD 538
            S+ +  +KE++ +T++E       SA +   A+   +E +    E        ++++  
Sbjct: 221 SSEIVNEIKEEVCETDDE-------SAYVSLDAEEEFEEEEEKPAEKSTESTFQLSKETS 273

Query: 539 KKDDKYQKGDSDEIKRLKEKLSKTEEELKKEKTNSAALKNQ 579
              +  +  D +EI+   E+  K  E L K +   + L+  
Sbjct: 274 IAKESKELPDGEEIEFGNEERFKAPEILFKPELPISGLEEA 314


>gnl|CDD|204032 pfam08703, PLC-beta_C, PLC-beta C terminal.  This domain
           corresponds to the alpha helical C terminal domain of
           phospholipase C beta.
          Length = 181

 Score = 32.1 bits (73), Expect = 0.48
 Identities = 23/105 (21%), Positives = 48/105 (45%), Gaps = 14/105 (13%)

Query: 477 LEGDSDEIKRLKEKLSKTEEELKK----EKTNSAALKSQADSVGKEYDRLLKE------H 526
            E  + ++K+LKE   K ++ELKK    ++          +   +E  R   E       
Sbjct: 47  REKQAAQLKKLKEISEKEKKELKKKLDRKRLERIQEAKTKEKHAQE--REKTEINRSHIQ 104

Query: 527 EKVQ--KVVTEQGDKKDDKYQKGDSDEIKRLKEKLSKTEEELKKE 569
           E VQ  K + E   ++ +K ++  ++ +++++E+  K + E   E
Sbjct: 105 EVVQSIKRLEEAQKRRQEKLEEKQAEILQQIQEEEPKLQAEATAE 149


>gnl|CDD|223683 COG0610, COG0610, Type I site-specific restriction-modification
           system, R (restriction) subunit and related helicases
           [Defense mechanisms].
          Length = 962

 Score = 33.2 bits (76), Expect = 0.50
 Identities = 13/100 (13%), Positives = 31/100 (31%), Gaps = 4/100 (4%)

Query: 482 DEIKRLKEKLSKTEEELKKEKTNSAALKSQADSVGKEYDRLLKEHEKVQKVVTEQGDKKD 541
             I+RL E +++   +          L   A    +      +E    +++        D
Sbjct: 855 KLIERLIEAINQYRAKKLDTAEKLEELYILAKKEEEFKQFAEEEGLNEEELAF----YDD 910

Query: 542 DKYQKGDSDEIKRLKEKLSKTEEELKKEKTNSAALKNQAD 581
                G   E      +    E+ L+++  +    K + +
Sbjct: 911 LALNGGKLPENGTELVEKLAKEKSLREKNKDDWKAKEEVE 950


>gnl|CDD|202045 pfam01920, Prefoldin_2, Prefoldin subunit.  This family includes
           prefoldin subunits that are not detected by pfam02996.
          Length = 106

 Score = 31.0 bits (71), Expect = 0.50
 Identities = 25/99 (25%), Positives = 50/99 (50%), Gaps = 21/99 (21%)

Query: 486 RLKEKLSKTEEELKKEKTNSAALKSQADSVGKEYDRLLKEHEKVQK------------VV 533
           +L+++L    +++K+ +T       + + V +E + LL E  KV K             V
Sbjct: 9   QLQQQLQLLAQQIKQLETQ----LKEIELVLEELE-LLDEDTKVYKLIGDVLVKQDKEEV 63

Query: 534 TEQGDKKDDKYQKGDSDEIKRLKEKLSKTEEELKKEKTN 572
            E+ +++ +  +K    EIK L+++L K E+EL++ K  
Sbjct: 64  KEELEERKETLEK----EIKTLEKQLEKLEKELEELKEE 98


>gnl|CDD|205039 pfam12718, Tropomyosin_1, Tropomyosin like.  This family is a set
           of eukaryotic tropomyosins. Within the yeast Tmp1 and
           Tmp2, biochemical and sequence analyses indicate that
           Tpm2p spans four actin monomers along a filament,
           whereas Tpmlp spans five. Despite its shorter length,
           Tpm2p can compete with Tpm1p for binding to F-actin.
           Over-expression of Tpm2p in vivo alters the axial
           budding of haploids to a bipolar pattern, and this can
           be partially suppressed by co-over-expression of Tpm1p.
           This suggests distinct functions for the two
           tropomyosins, and indicates that the ratio between them
           is important for correct morphogenesis. The family also
           contains higher eukaryote Tmp3 members.
          Length = 143

 Score = 31.4 bits (72), Expect = 0.50
 Identities = 36/144 (25%), Positives = 69/144 (47%), Gaps = 25/144 (17%)

Query: 466 RCEIFEQCDKGLEGDS----DEIKRLKEKLSKTEEELKKEKTNSAALKSQADSVGKEYDR 521
           R E  E+  K LE ++     EI  L++K  + EEE++K +      K + +    E ++
Sbjct: 15  RAEELEEKLKELEQENLEKEQEITSLQKKNQQLEEEVEKLEEQLKEAKEKLE----ESEK 70

Query: 522 LLKEHEKVQKVVT---EQGDKKDDKYQKGDSDEIKRLKEKLSKTEE-ELKKEKT--NSAA 575
           L    E + + +    E+ ++ +           KRLKE   K  E + K E++     A
Sbjct: 71  LATNAEALTRRIQLLEEELEESE-----------KRLKETTEKLREADKKAEESERKVKA 119

Query: 576 LKNQADSVGKEYDRLLKEHEKVQK 599
           L+N+ D   ++Y+ L K++++ +K
Sbjct: 120 LENERDEWEEKYEELEKKYKEAKK 143


>gnl|CDD|191111 pfam04849, HAP1_N, HAP1 N-terminal conserved region.  This family
           represents an N-terminal conserved region found in
           several huntingtin-associated protein 1 (HAP1)
           homologues. HAP1 binds to huntingtin in a polyglutamine
           repeat-length-dependent manner. However, its possible
           role in the pathogenesis of Huntington's disease is
           unclear. This family also includes a similar N-terminal
           conserved region from hypothetical protein products of
           ALS2CR3 genes found in the human juvenile amyotrophic
           lateral sclerosis critical region 2q33-2q34.
          Length = 307

 Score = 32.5 bits (74), Expect = 0.52
 Identities = 27/105 (25%), Positives = 50/105 (47%), Gaps = 12/105 (11%)

Query: 471 EQCDKGLEGDSDEIKRLKEKLSKTEEELKKEKTNSAALKSQADSVGKEYDRLLKEHEKVQ 530
             C K L   +D+I  L E+L+K  E+L++++     L SQ   + K+      E+E++Q
Sbjct: 203 NDCVKQLREANDQIASLSEELAKKTEDLERQQEEITHLLSQIVDLQKKCKSYALENEELQ 262

Query: 531 KVVTEQGDKKDDKYQKGDSDEIKRLKEK-------LSKTEEELKK 568
           + +    D +          E++ L++K       L + +EELK 
Sbjct: 263 QHLAAAKDAQRQL-----QAELQELQDKYAECMEMLHEAQEELKN 302



 Score = 31.3 bits (71), Expect = 1.2
 Identities = 34/148 (22%), Positives = 68/148 (45%), Gaps = 15/148 (10%)

Query: 462 ESSKRCEIFEQCDKGLEGDSDEIKRLKEKLSKTEEELKKEKTNSAALKSQADSVGKEYDR 521
           ESS+   +  Q          +++ L+EKL   EEE       +  L+S+A  +  E   
Sbjct: 141 ESSESTPLRPQESSSSSHGCFQLEALQEKLKLLEEE-------NEHLRSEASHLKTE--- 190

Query: 522 LLKEHEKVQKVVTEQGDKKDDKYQKGDSDEIKRLKEKLSKTEEELKKEKTNSAALKNQAD 581
            +   EK Q++V +   +  +      +D+I  L E+L+K  E+L++++     L +Q  
Sbjct: 191 TVTYEEKEQQLVNDCVKQLREA-----NDQIASLSEELAKKTEDLERQQEEITHLLSQIV 245

Query: 582 SVGKEYDRLLKEHEKVQKVVTEQGDKKD 609
            + K+      E+E++Q+ +    D + 
Sbjct: 246 DLQKKCKSYALENEELQQHLAAAKDAQR 273


>gnl|CDD|222689 pfam14335, DUF4391, Domain of unknown function (DUF4391).  This
           family of proteins is functionally uncharacterized. This
           family of proteins is found in bacteria and archaea.
           Proteins in this family are typically between 220 and
           257 amino acids in length.
          Length = 221

 Score = 32.2 bits (74), Expect = 0.53
 Identities = 12/42 (28%), Positives = 27/42 (64%)

Query: 532 VVTEQGDKKDDKYQKGDSDEIKRLKEKLSKTEEELKKEKTNS 573
             T   +K+D K +    ++I+ L+++++K +++LKKEK  +
Sbjct: 164 SNTGVYEKEDLKERVERLEQIEELEKEIAKLKKKLKKEKQFN 205


>gnl|CDD|221952 pfam13166, AAA_13, AAA domain.  This family of domains contain a
           P-loop motif that is characteristic of the AAA
           superfamily. Many of the proteins in this family are
           conjugative transfer proteins. This family includes the
           PrrC protein that is thought to be the active component
           of the anticodon nuclease.
          Length = 713

 Score = 33.1 bits (76), Expect = 0.54
 Identities = 27/104 (25%), Positives = 46/104 (44%), Gaps = 12/104 (11%)

Query: 468 EIFEQCDKGLEGDSDEIKRLKEKLSKTEEELKKEKTNSAALKSQADSVGKEYDRLLKEHE 527
              E+  K L+   ++I++L+ ++ K EEEL+K K     L      + K+YD  L E  
Sbjct: 95  AQIEELKKELKKLEEKIEQLEAEIEKKEEELEKAKNK--FLDKAWKKLAKKYDSNLSEAL 152

Query: 528 KVQKVVTEQGDKKDDKYQKGDSDEIKRLKEKLS--KTEEELKKE 569
           K        G      +++    E+K +    S   + EELK +
Sbjct: 153 K--------GLNYKKNFKEKLLKELKSVILNASSLLSLEELKAK 188



 Score = 30.0 bits (68), Expect = 4.1
 Identities = 28/137 (20%), Positives = 53/137 (38%), Gaps = 31/137 (22%)

Query: 480 DSDEIKRLKEKLSKTEEELKK---EKTNSAALKSQADSVGKEYD----------RLLKEH 526
           D +E+K L E L +  E+  +   EK    +   + +S+    +           L++EH
Sbjct: 321 DVEELKALLEALEEILEKNLQKLEEKLKDPSTSIELESITDLIESINDIIDAINELIREH 380

Query: 527 -EKVQKVVTEQGDKKD--------------DKYQKGDSD---EIKRLKEKLSKTEEELKK 568
            EK+  +  E+   K               D YQK        I  L++++ + E E+K 
Sbjct: 381 NEKIDNLKKEKNKAKKKLWLHLVAELKEDIDAYQKEKKGLEKAINSLEKEIKQLEAEIKA 440

Query: 569 EKTNSAALKNQADSVGK 585
            +     L+ Q  ++  
Sbjct: 441 LEKEIKELEKQLTNIEP 457


>gnl|CDD|221371 pfam12004, DUF3498, Domain of unknown function (DUF3498).  This
           presumed domain is functionally uncharacterized. This
           domain is found in eukaryotes. This domain is typically
           between 433 to 538 amino acids in length. This domain is
           found associated with pfam00616, pfam00168. This domain
           has two conserved sequence motifs: DLQ and PLSFQNP.
          Length = 489

 Score = 32.8 bits (74), Expect = 0.54
 Identities = 24/87 (27%), Positives = 43/87 (49%), Gaps = 11/87 (12%)

Query: 483 EIKRLKEKLSKTEEELKKEKTNSAALKSQADSVGKEYDRLLKEHEKVQKVVTEQGDKKDD 542
           EI RLKE+L  +   L++ +      + Q   + +EY   L++ E+  +    Q ++KD 
Sbjct: 371 EIARLKERLRVSVRRLEEYERRLLGQEQQMQKLLQEYQARLEDSEERLR---RQQEEKD- 426

Query: 543 KYQKGDSDEIKRLKEKLSKTEEELKKE 569
                   ++K +  +L   EEELKK+
Sbjct: 427 -------SQMKSIISRLMAVEEELKKD 446


>gnl|CDD|223599 COG0525, ValS, Valyl-tRNA synthetase [Translation, ribosomal
           structure and biogenesis].
          Length = 877

 Score = 33.0 bits (76), Expect = 0.55
 Identities = 17/61 (27%), Positives = 35/61 (57%), Gaps = 4/61 (6%)

Query: 476 GLEGDSDEIKRLKEKLSKTEEELKK--EKTNSAALKSQADS--VGKEYDRLLKEHEKVQK 531
           GL   + E+ RL+++L K E+E+ +  +K ++    ++A    V KE ++L +   K+ K
Sbjct: 808 GLIDLAAELARLEKELEKLEKEIDRIEKKLSNEGFVAKAPEEVVEKEKEKLAEYQVKLAK 867

Query: 532 V 532
           +
Sbjct: 868 L 868



 Score = 32.6 bits (75), Expect = 0.81
 Identities = 15/54 (27%), Positives = 31/54 (57%), Gaps = 4/54 (7%)

Query: 551 EIKRLKEKLSKTEEELKK--EKTNSAALKNQADS--VGKEYDRLLKEHEKVQKV 600
           E+ RL+++L K E+E+ +  +K ++     +A    V KE ++L +   K+ K+
Sbjct: 815 ELARLEKELEKLEKEIDRIEKKLSNEGFVAKAPEEVVEKEKEKLAEYQVKLAKL 868


>gnl|CDD|219563 pfam07767, Nop53, Nop53 (60S ribosomal biogenesis).  This nucleolar
           family of proteins are involved in 60S ribosomal
           biogenesis. They are specifically involved in the
           processing beyond the 27S stage of 25S rRNA maturation.
           This family contains sequences that bear similarity to
           the glioma tumour suppressor candidate region gene 2
           protein (p60). This protein has been found to interact
           with herpes simplex type 1 regulatory proteins.
          Length = 387

 Score = 32.8 bits (75), Expect = 0.59
 Identities = 28/140 (20%), Positives = 48/140 (34%), Gaps = 26/140 (18%)

Query: 489 EKLSKTEEELKKEKTNSAALKSQADSVGKEYDRLLKEHEKVQKVVTEQGDKKDDKYQKGD 548
            K  K  +EL++ +       +   S   E    L E          + +  D+   +G 
Sbjct: 207 VKAEKKRQELERVEEKKLEKMAPEASRLDEMSEGLLEESDDD----GEEESDDESAWEGF 262

Query: 549 SDE--------------------IKRLKEKLSKTEEELKKEKTNSAALKNQADSVGKEYD 588
             E                     KR KE L +  +E K+ K   A L  +   + KE  
Sbjct: 263 ESEYEPINKPVRPKRKTKAQRNKEKRRKE-LEREAKEEKQLKKKLAQL-ARLKEIAKEVA 320

Query: 589 RLLKEHEKVQKVVTEQGDKK 608
           +  K   + ++   E+G+KK
Sbjct: 321 QKEKARARKKEQRKERGEKK 340


>gnl|CDD|184039 PRK13411, PRK13411, molecular chaperone DnaK; Provisional.
          Length = 653

 Score = 32.8 bits (75), Expect = 0.61
 Identities = 21/62 (33%), Positives = 39/62 (62%), Gaps = 2/62 (3%)

Query: 548 DSDEIKRLKEKLSKTEEELKKEKTNSAALKNQADSVGKEYDRLLKEH-EKVQKVVTEQGD 606
            S+EI+R++++  K  EE ++ K     LKNQADS+   Y+  LKE+ E + + + ++ +
Sbjct: 503 SSNEIERMRQEAEKYAEEDRRRK-QLIELKNQADSLLYSYESTLKENGELISEELKQRAE 561

Query: 607 KK 608
           +K
Sbjct: 562 QK 563



 Score = 31.6 bits (72), Expect = 1.2
 Identities = 26/90 (28%), Positives = 53/90 (58%), Gaps = 3/90 (3%)

Query: 480 DSDEIKRLKEKLSKTEEELKKEKTNSAALKSQADSVGKEYDRLLKEH-EKVQKVVTEQGD 538
            S+EI+R++++  K  EE ++ K     LK+QADS+   Y+  LKE+ E + + + ++ +
Sbjct: 503 SSNEIERMRQEAEKYAEEDRRRK-QLIELKNQADSLLYSYESTLKENGELISEELKQRAE 561

Query: 539 KKDDKYQKGDSDEIKRLKEKLSKTEEELKK 568
           +K ++ +   +D    L E+L +  EE ++
Sbjct: 562 QKVEQLEAALTDPNISL-EELKQQLEEFQQ 590


>gnl|CDD|217840 pfam04006, Mpp10, Mpp10 protein.  This family includes proteins
           related to Mpp10 (M phase phosphoprotein 10). The U3
           small nucleolar ribonucleoprotein (snoRNP) is required
           for three cleavage events that generate the mature 18S
           rRNA from the pre-rRNA. In Saccharomyces cerevisiae,
           depletion of Mpp10, a U3 snoRNP-specific protein, halts
           18S rRNA production and impairs cleavage at the three U3
           snoRNP-dependent sites.
          Length = 613

 Score = 32.7 bits (74), Expect = 0.64
 Identities = 25/125 (20%), Positives = 54/125 (43%), Gaps = 15/125 (12%)

Query: 474 DKGLEGDSDEIKRLK----EKLSKTEEELKKEKTNSAALKSQADSVGKEYDRLLKEHEKV 529
           D  LE D  + + +K     K  + +EE  +++ +        +   +     +K  E V
Sbjct: 272 DAELEDDEPDKEAVKKEADSKPEEEDEEDDEQEDDQDE-----EEPPEAAMDKVKLDEPV 326

Query: 530 QKVVTEQGDKKDDKYQKGDSDEIKRLKEKLSKTEEELKKEKTNSAALKNQADSVGKEYDR 589
            + V  +  K+   ++K  +    +LK+++ + E+E    K  S  LK +  +  +  + 
Sbjct: 327 LEGVDLESPKELSSFEKRQA----KLKQQIEQLEKENLAPK--SWTLKGEVTAKKRPQNS 380

Query: 590 LLKEH 594
           LL+E 
Sbjct: 381 LLEED 385


>gnl|CDD|184358 PRK13874, PRK13874, conjugal transfer protein TrbJ; Provisional.
          Length = 230

 Score = 31.8 bits (73), Expect = 0.67
 Identities = 13/34 (38%), Positives = 17/34 (50%), Gaps = 4/34 (11%)

Query: 117 RQIIQLIAQQANLLAQNEA-SMNQARQAAVAAQA 149
           +Q+  L A  A   AQ  A ++  ARQAA   Q 
Sbjct: 190 QQLADLQALMA---AQGRAQALEAARQAAAEEQG 220


>gnl|CDD|220376 pfam09745, DUF2040, Coiled-coil domain-containing protein 55
           (DUF2040).  This entry is a conserved domain of
           approximately 130 residues of proteins conserved from
           fungi to humans. The proteins do contain a coiled-coil
           domain, but the function is unknown.
          Length = 128

 Score = 30.8 bits (70), Expect = 0.67
 Identities = 28/106 (26%), Positives = 51/106 (48%), Gaps = 14/106 (13%)

Query: 482 DEIKRLKEKLSKTEEELKKEKTNS---AALKSQADSVGKEYDRLLKEHEKVQKVVTEQGD 538
           D IK  KE+  + +   +KE        +L   A+   ++ +R + E  K+QK   ++GD
Sbjct: 25  DSIKAAKEEKKQAKLSERKENRKPKYIGSLLEAAER--RKREREIAEERKLQKEREKEGD 82

Query: 539 KKDDK-------YQKGDSDEIKRLKEKLSKTEE-ELKKEKTNSAAL 576
           +  DK       Y+K   +E ++L+E+  + EE E + + T    L
Sbjct: 83  EFADKEKFVTSAYKK-QLEENRKLEEEEKEREELEEENDVTKGKDL 127


>gnl|CDD|236892 PRK11281, PRK11281, hypothetical protein; Provisional.
          Length = 1113

 Score = 32.6 bits (75), Expect = 0.69
 Identities = 25/111 (22%), Positives = 54/111 (48%), Gaps = 20/111 (18%)

Query: 506 AALKSQADSVGKEYDRLLKEHEKVQKVVT-------EQGDKKDDKYQKGDSDEIKRLKEK 558
           A +++Q D++ K+     K  E   K+V           DK D   QK ++++   LK++
Sbjct: 39  ADVQAQLDALNKQ-----KLLEAEDKLVQQDLEQTLALLDKIDR--QKEETEQ---LKQQ 88

Query: 559 LSKTEEELKKEKTNSAALKNQADSVGKE-YDRL-LKEHE-KVQKVVTEQGD 606
           L++   +L++ +    ALK+  D   +E    L L++ E ++ + + +  +
Sbjct: 89  LAQAPAKLRQAQAELEALKDDNDEETRETLSTLSLRQLESRLAQTLDQLQN 139


>gnl|CDD|234501 TIGR04211, SH3_and_anchor, SH3 domain protein.  Members of this
           protein family have a signal peptide, a strongly
           conserved SH3 domain, a variable region, and then a
           C-terminal hydrophobic transmembrane alpha helix region.
          Length = 198

 Score = 31.9 bits (73), Expect = 0.72
 Identities = 18/112 (16%), Positives = 50/112 (44%), Gaps = 2/112 (1%)

Query: 498 LKKEKTNSAALKSQADSVGKEYDRLLKEHEKVQKVVTEQGDKKDDKYQKGDSDEIKRLKE 557
           L +  +++ + + +   + +E   L +E  ++Q+ + E   +  +  Q+  + E +   E
Sbjct: 54  LSRYLSDTPSARERLPELQQELAELQEELAELQEQLAELQQENQELKQELSTLEAE--LE 111

Query: 558 KLSKTEEELKKEKTNSAALKNQADSVGKEYDRLLKEHEKVQKVVTEQGDKKD 609
           +L K    +K+   N+  L  +   + +E   L +E+E ++       + + 
Sbjct: 112 RLQKELARIKQLSANAIELDEENRELREELAELKQENEALEAENERLQENEQ 163


>gnl|CDD|214016 cd12923, iSH2_PI3K_IA_R, Inter-Src homology 2 (iSH2) helical domain
           of Class IA Phosphoinositide 3-kinase Regulatory
           subunits.  PI3Ks catalyze the transfer of the
           gamma-phosphoryl group from ATP to the 3-hydroxyl of the
           inositol ring of D-myo-phosphatidylinositol (PtdIns) or
           its derivatives. They play an important role in a
           variety of fundamental cellular processes, including
           cell motility, the Ras pathway, vesicle trafficking and
           secretion, immune cell activation, and apoptosis. They
           are classified according to their substrate specificity,
           regulation, and domain structure. Class IA PI3Ks are
           heterodimers of a p110 catalytic (C) subunit and a
           p85-related regulatory (R) subunit. The R subunit
           down-regulates PI3K basal activity, stabilizes the C
           subunit, and plays a role in the activation downstream
           of tyrosine kinases. All R subunits contain two SH2
           domains that flank an intervening helical domain (iSH2),
           which binds to the N-terminal adaptor-binding domain
           (ABD) of the catalytic subunit. In vertebrates, there
           are three genes (PIK3R1, PIK3R2, and PIK3R3) that encode
           for different Class IA PI3K R subunits.
          Length = 152

 Score = 31.0 bits (71), Expect = 0.77
 Identities = 28/143 (19%), Positives = 63/143 (44%), Gaps = 18/143 (12%)

Query: 474 DKGLEGDSDEIKRLKEKLSKTEEELKKEKTNSAALKSQADSVGKEYDRLLKEHEKVQKVV 533
           +K     S E   L EK +K  +E++        LK QA    +E  ++ +E  + Q   
Sbjct: 14  NKEYLDKSREYDELYEKYNKLSQEIQ--------LKRQALEAFEEAVKMFEEQLRTQ--- 62

Query: 534 TEQGDKKDDKYQKGDSDEIKRLKEKLS-------KTEEELKKEKTNSAALKNQADSVGKE 586
            +   +     ++   +  + LK +L        + EE+L+K+   +  L+ + +S+  E
Sbjct: 63  EKFQKEAQPHEKQRLMENNELLKSRLKELEESKEQLEEDLRKQVAYNRELEREMNSLKPE 122

Query: 587 YDRLLKEHEKVQKVVTEQGDKKD 609
             +L K+ ++  + +  +G  ++
Sbjct: 123 LMQLRKQKDQYLRWLKRKGVSQE 145


>gnl|CDD|202143 pfam02183, HALZ, Homeobox associated leucine zipper. 
          Length = 45

 Score = 28.6 bits (65), Expect = 0.77
 Identities = 16/56 (28%), Positives = 23/56 (41%), Gaps = 19/56 (33%)

Query: 506 AALKSQADSVGKEYDRLLKEHEKVQKVVTEQGDKKDDKYQKGDSDEIKRLKEKLSK 561
             LK   D++  E D L KE+EK++                    E+  LKEKL +
Sbjct: 8   EVLKRCYDALKAENDSLQKENEKLRA-------------------EVLSLKEKLQE 44



 Score = 27.1 bits (61), Expect = 3.5
 Identities = 12/34 (35%), Positives = 18/34 (52%)

Query: 574 AALKNQADSVGKEYDRLLKEHEKVQKVVTEQGDK 607
             LK   D++  E D L KE+EK++  V    +K
Sbjct: 8   EVLKRCYDALKAENDSLQKENEKLRAEVLSLKEK 41


>gnl|CDD|149223 pfam08019, DUF1705, Domain of unknown function (DUF1705).  Some
           members of this family are putative bacterial membrane
           proteins. This domain is found immediately N terminal to
           the sulfatase domain in many sulfatases.
          Length = 156

 Score = 31.3 bits (72), Expect = 0.78
 Identities = 22/83 (26%), Positives = 36/83 (43%), Gaps = 17/83 (20%)

Query: 3   LQWTLIATVLYFEMAFMLLLILPILSTQRLHKILKSKFVQGVKTQAGWYFGCILVILSLF 62
           L   LI  +L       LL +LP L   R+ KI    +++ +  +       +LVIL + 
Sbjct: 64  LSPKLILYLL-------LLGVLPALLLLRV-KIRYRPWLRELLQRLLSILASLLVILLIA 115

Query: 63  FLD-----AI----REMRKYASP 76
           FL      +     +E+RK  +P
Sbjct: 116 FLFYQDYASFFRNNKELRKLINP 138


>gnl|CDD|234252 TIGR03545, TIGR03545, TIGR03545 family protein.  This model
           represents a relatively rare but broadly distributed
           uncharacterized protein family, distributed in 1-2
           percent of bacterial genomes, all of which have outer
           membranes. In many of these genomes, it is part of a
           two-gene pair.
          Length = 555

 Score = 32.4 bits (74), Expect = 0.79
 Identities = 31/131 (23%), Positives = 57/131 (43%), Gaps = 19/131 (14%)

Query: 478 EGDSDEIKRLKEKLSKTEEELKKEKTNSAALKSQADSVGKEYDRLLKEHEKVQKVVTEQG 537
           E  S E+K++  +L      LK E      LK+      +E ++ LK  ++  K   +  
Sbjct: 135 EQKSSELKKVDSQLPDPRALLKGED-----LKTV--ETAEEIEKSLKAMQQKWKKRKKDL 187

Query: 538 DKKDD--KYQK----------GDSDEIKRLKEKLSKTEEELKKEKTNSAALKNQADSVGK 585
             K D  +Y+K           +  E++++KE+  K ++E K +K    + KN   +  K
Sbjct: 188 PNKQDLEEYKKRLEAIKKKDIKNPLELQKIKEEFDKLKKEGKADKQKIKSAKNDLQNDKK 247

Query: 586 EYDRLLKEHEK 596
           +    L E +K
Sbjct: 248 QLKADLAELKK 258


>gnl|CDD|116500 pfam07888, CALCOCO1, Calcium binding and coiled-coil domain
           (CALCOCO1) like.  Proteins found in this family are
           similar to the coiled-coil transcriptional coactivator
           protein coexpressed by Mus musculus (CoCoA/CALCOCO1).
           This protein binds to a highly conserved N-terminal
           domain of p160 coactivators, such as GRIP1, and thus
           enhances transcriptional activation by a number of
           nuclear receptors. CALCOCO1 has a central coiled-coil
           region with three leucine zipper motifs, which is
           required for its interaction with GRIP1 and may regulate
           the autonomous transcriptional activation activity of
           the C-terminal region.
          Length = 546

 Score = 32.3 bits (73), Expect = 0.85
 Identities = 27/117 (23%), Positives = 57/117 (48%), Gaps = 14/117 (11%)

Query: 477 LEGDSDEIKRLKEKLSKTEEELKKEKTNSAALKSQADSVGKEYDRLLKEHEKVQKVVTE- 535
            E D D I++L  +L K EE L++E++    L+ +   +G E D    +  + ++ ++E 
Sbjct: 366 AEADKDRIQKLSAELLKLEEWLQEERSQREKLEVE---LGIEKDCNRVQLSENRRELSEL 422

Query: 536 -------QGDKKDDKYQKGD-SDEIKRLKEKLSKTEEELKKEKTNSAALKNQADSVG 584
                  Q +K+  + +K +  D I+ L+ +L K  +E  K   ++A  ++    + 
Sbjct: 423 RSALRVLQKEKEQLQEEKQELLDYIRVLELRLDKEADE--KWAEDAATCEDAKSYLH 477


>gnl|CDD|237785 PRK14671, uvrC, excinuclease ABC subunit C; Provisional.
          Length = 621

 Score = 32.4 bits (74), Expect = 0.88
 Identities = 24/75 (32%), Positives = 41/75 (54%), Gaps = 11/75 (14%)

Query: 468 EIFEQCDKGLEGDSDEIKR-LKEKLSKTEEELKKEKTNSAALKSQADSVGKEYDRLLKEH 526
           E+ E+  + L+G +  + R L E++ +   ELK E+  +A LK Q +S        LK +
Sbjct: 200 EMIEEIVRLLKGKTSALIRSLTEEMQRAAAELKFEE--AAELKDQIES--------LKRY 249

Query: 527 EKVQKVVTEQGDKKD 541
            + QKVV++ G  +D
Sbjct: 250 AERQKVVSQDGADRD 264



 Score = 31.2 bits (71), Expect = 1.7
 Identities = 21/61 (34%), Positives = 34/61 (55%), Gaps = 10/61 (16%)

Query: 549 SDEIKRLKEKLSKTEEELKKEKTNSAALKNQADSVGKEYDRLLKEHEKVQKVVTEQGDKK 608
           S  I+ L E++ +   ELK E+  +A LK+Q +S        LK + + QKVV++ G  +
Sbjct: 214 SALIRSLTEEMQRAAAELKFEE--AAELKDQIES--------LKRYAERQKVVSQDGADR 263

Query: 609 D 609
           D
Sbjct: 264 D 264


>gnl|CDD|153341 cd07657, F-BAR_Fes_Fer, The F-BAR (FES-CIP4 Homology and
           Bin/Amphiphysin/Rvs) domain of Fes (feline sarcoma) and
           Fer (Fes related) tyrosine kinases.  F-BAR domains are
           dimerization modules that bind and bend membranes and
           are found in proteins involved in membrane dynamics and
           actin reorganization. Fes (feline sarcoma), also called
           Fps (Fujinami poultry sarcoma), and Fer (Fes related)
           are cytoplasmic (or nonreceptor) tyrosine kinases that
           play roles in haematopoiesis, inflammation and immunity,
           growth factor signaling, cytoskeletal regulation, cell
           migration and adhesion, and the regulation of cell-cell
           interactions. Although Fes and Fer show redundancy in
           their biological functions, they show differences in
           their expression patterns. Fer is ubiquitously expressed
           while Fes is expressed predominantly in myeloid and
           endothelial cells. Fes and Fer contain an N-terminal
           F-BAR domain, an SH2 domain, and a C-terminal catalytic
           kinase domain. F-BAR domains form banana-shaped dimers
           with a positively-charged concave surface that binds to
           negatively-charged lipid membranes. They can induce
           membrane deformation in the form of long tubules. The
           F-BAR domain of Fes is critical in its role in
           microtubule nucleation and bundling.
          Length = 237

 Score = 31.6 bits (72), Expect = 0.88
 Identities = 24/84 (28%), Positives = 46/84 (54%), Gaps = 4/84 (4%)

Query: 488 KEKLSKTEEELKKEKTNSAALKSQADSVGKEYDRLLKEHEKVQKVVTEQGDKKDDKYQKG 547
           KE +  T++  K  K ++ AL+S       +   L+K+  K +K   E+  + D++Y+K 
Sbjct: 68  KEIMDSTDQLSKLIKQHAEALES---GTLDKLTLLIKDKRKAKKAYQEERQQIDEQYKK- 123

Query: 548 DSDEIKRLKEKLSKTEEELKKEKT 571
            +DE+++LK +  K  E+ K  K+
Sbjct: 124 LTDEVEKLKSEYQKLLEDYKAAKS 147


>gnl|CDD|148363 pfam06717, DUF1202, Protein of unknown function (DUF1202).  This
           family consists of several hypothetical bacterial
           proteins of around 335 residues in length. Members of
           this family are found exclusively in Escherichia coli
           and Salmonella species and are often referred to as YggM
           proteins. The function of this family is unknown.
          Length = 308

 Score = 31.8 bits (72), Expect = 0.89
 Identities = 22/130 (16%), Positives = 51/130 (39%), Gaps = 11/130 (8%)

Query: 482 DEIKRLKEKLSKTEEELKKEKTNSAALKSQADSV-GKEYDRLLKEHEKVQKVVTEQGDKK 540
             + R K  +   E+++K         + +AD+  GK  D      E   K + ++   +
Sbjct: 145 SALNRQKNSIPALEKDVKALDKQIVVAQKEADAYWGKGADGKQMTREDAFKKIHKE---R 201

Query: 541 DDKYQKGDSD--EIKRLKEKLSKTEEELKKEKTNSAALKNQADSVGKEYDRLLKEHEKVQ 598
           DD  ++ DS+   +K  KE         +K+       +     + ++ D  + E  +  
Sbjct: 202 DDFNKQNDSEAFAVKYDKEVYQPAIAACRKQS-----EECYEVPIQQKRDFDINEQRRQT 256

Query: 599 KVVTEQGDKK 608
            + +++  +K
Sbjct: 257 FLKSQELSRK 266



 Score = 30.3 bits (68), Expect = 2.6
 Identities = 16/80 (20%), Positives = 30/80 (37%), Gaps = 12/80 (15%)

Query: 539 KKDDKYQKGDS-----------DEIKRLKEKLSKTEEELKKEKTNSAALKNQADSV-GKE 586
           K ++KY   +S             + R K  +   E+++K         + +AD+  GK 
Sbjct: 123 KTNNKYLIVNSEDFNYRFSQIKSALNRQKNSIPALEKDVKALDKQIVVAQKEADAYWGKG 182

Query: 587 YDRLLKEHEKVQKVVTEQGD 606
            D      E   K + ++ D
Sbjct: 183 ADGKQMTREDAFKKIHKERD 202


>gnl|CDD|179877 PRK04778, PRK04778, septation ring formation regulator EzrA;
           Provisional.
          Length = 569

 Score = 32.1 bits (74), Expect = 0.90
 Identities = 15/87 (17%), Positives = 38/87 (43%), Gaps = 8/87 (9%)

Query: 482 DEIKRLKEKLSKTEEELKKEKTNSAALKSQADSVGKEYDRLLKEHEKVQKVVTEQGDKKD 541
            EI  ++  L   EE++++       L    +   +E ++L   + +++K +        
Sbjct: 105 HEINEIESLLDLIEEDIEQILEELQELLESEEKNREEVEQLKDLYRELRKSLLAN----- 159

Query: 542 DKYQKGDSDEIKRLKEKLSKTEEELKK 568
            ++  G +  +  L+++L   EEE  +
Sbjct: 160 -RFSFGPA--LDELEKQLENLEEEFSQ 183



 Score = 31.7 bits (73), Expect = 1.1
 Identities = 31/131 (23%), Positives = 61/131 (46%), Gaps = 22/131 (16%)

Query: 483 EIKRLKEKLSKTEE-----ELKKEKTNSAALKSQADSVGKEYDRLLKEHE-KVQKVVTEQ 536
           EI+ LKE++ +        +L + +  +  ++ + D +   YD L  E E K +K V + 
Sbjct: 257 EIQDLKEQIDENLALLEELDLDEAEEKNEEIQERIDQL---YDIL--EREVKARKYVEKN 311

Query: 537 GDKKDDKYQKGD------SDEIKRLKEKLSKTEEELKKEKTNSAALKNQADSVGKEYDRL 590
            D   D  +          +EI R+K+  +  E EL+  +     L+ Q +S+ K+YD +
Sbjct: 312 SDTLPDFLEHAKEQNKELKEEIDRVKQSYTLNESELESVR----QLEKQLESLEKQYDEI 367

Query: 591 LKEHEKVQKVV 601
            +   + Q++ 
Sbjct: 368 TERIAE-QEIA 377


>gnl|CDD|129130 TIGR00019, prfA, peptide chain release factor 1.  This model
           describes peptide chain release factor 1 (PrfA, RF-1),
           and excludes the related peptide chain release factor 2
           (PrfB, RF-2). RF-1 helps recognize and terminate
           translation at UAA and UAG stop codons. The
           mitochondrial release factors are prfA-like, although
           not included above the trusted cutoff for this model.
           RF-1 does not have a translational frameshift [Protein
           synthesis, Translation factors].
          Length = 360

 Score = 32.0 bits (73), Expect = 0.92
 Identities = 22/95 (23%), Positives = 50/95 (52%), Gaps = 11/95 (11%)

Query: 482 DEIKRLKEKLSKTEEELKKEKTNSAALKSQADSVGKEYDRLL------KEHEKVQKVVTE 535
           ++++ L E+  + E  L   +  S   K +   + KEY +L       +E+++ Q+ + E
Sbjct: 7   EKLESLLERYEELEALLSDPEVISDQDKLRK--LSKEYSQLEEIVDCYREYQQAQEDIKE 64

Query: 536 QG---DKKDDKYQKGDSDEIKRLKEKLSKTEEELK 567
                ++ D + ++   +E++ L+EK+ + EE+LK
Sbjct: 65  AKEILEESDPEMREMAKEELEELEEKIEELEEQLK 99


>gnl|CDD|227570 COG5245, DYN1, Dynein, heavy chain [Cytoskeleton].
          Length = 3164

 Score = 32.3 bits (73), Expect = 0.94
 Identities = 22/128 (17%), Positives = 50/128 (39%), Gaps = 16/128 (12%)

Query: 483  EIKRLKEKLSKTEEELKKEKTNSAALKSQADSVGKEYDRLLK--EHEKVQKVVTEQGDKK 540
            E +            L   +    ALK+            L       + K++    + +
Sbjct: 2605 ETEIPDALEKLVSGPLFVHEKALNALKA-------CGSLFLWVLARYLLAKLMLSISNME 2657

Query: 541  DDKYQKGDSDEIKRLKEKLSKTEEELKKEKTNSAALKNQADSVGKEYDRLLKEHEKVQKV 600
                    +DEI  L   L K+ +E+++E++ S  ++++ D++  EY+  +K  E ++  
Sbjct: 2658 Q-------TDEIAVLLHNLKKSRKEIEEEESESMEIEDRIDALKSEYNASVKRLESIRVE 2710

Query: 601  VTEQGDKK 608
            +    +K 
Sbjct: 2711 IAMFDEKA 2718


>gnl|CDD|221275 pfam11861, DUF3381, Domain of unknown function (DUF3381).  This
           domain is functionally uncharacterized. This domain is
           found in eukaryotes. This presumed domain is typically
           between 156 to 174 amino acids in length. This domain is
           found associated with pfam07780, pfam01728.
          Length = 154

 Score = 30.7 bits (70), Expect = 0.98
 Identities = 29/103 (28%), Positives = 50/103 (48%), Gaps = 17/103 (16%)

Query: 482 DEIKRLKEKLSKTEEELKKEKTNSAALKSQADSVG-KEYDRLLKEHEKVQKVVTEQGDKK 540
           D  K +K K  +T  E+K        LK     +G K++ +LLK  +KV+K++    DKK
Sbjct: 50  DAWKAIK-KHKQTTPEIK---ECCKDLKV----LGKKDFKKLLKWRKKVRKLLGL--DKK 99

Query: 541 DDKYQKGDSDEIKRLKEKLSKTEE------ELKKEKTNSAALK 577
           + + ++ +  E++ L E+    E       +LK+EK      K
Sbjct: 100 EKEEEEEEEVEVEELDEEEQIDELLEKELAKLKREKRRENERK 142


>gnl|CDD|206034 pfam13863, DUF4200, Domain of unknown function (DUF4200).  This
           family is found in eukaryotes. It is a coiled-coil
           domain of unknwon function.
          Length = 126

 Score = 30.3 bits (69), Expect = 1.0
 Identities = 32/121 (26%), Positives = 52/121 (42%), Gaps = 22/121 (18%)

Query: 483 EIKRLKEKLSKTEEELKKEKTNSAALKSQADSVGKEYDRLLKEHEKVQKVVTEQGDKKDD 542
           E +R +E L + EEEL+K                KE   L +   K  K + E   K+  
Sbjct: 22  EFERREELLKQREEELEK----------------KEE-ELQESLIKFDKFLKENEAKRRR 64

Query: 543 KYQKGDSDEIKRLKEKLSKTEEELKKEKTNSAALKNQADSVGKEYDRLLKEHEKVQKVVT 602
             +K + +E K  KEK    EEE+K+ K     LK + + + ++ +      E ++ VV 
Sbjct: 65  AEKKAE-EEKKLRKEK----EEEIKELKAELEELKAEIEKLEEKLEEYQPYEEFLESVVE 119

Query: 603 E 603
            
Sbjct: 120 R 120


>gnl|CDD|225606 COG3064, TolA, Membrane protein involved in colicin uptake [Cell
           envelope biogenesis, outer membrane].
          Length = 387

 Score = 31.8 bits (72), Expect = 1.0
 Identities = 26/128 (20%), Positives = 52/128 (40%), Gaps = 3/128 (2%)

Query: 482 DEIKRLKEKLSKTEEELKKEKTNSAALKSQADSVGKEYDRLLKEHEKVQKVVTEQGDK-K 540
            E +RLK+   +  +  +++K    A K       ++ ++  K   + +K       K  
Sbjct: 98  AEQERLKQLEKERLKAQEQQKQAEEAEKQAQLEQKQQEEQARKAAAEQKKKAEAAKAKAA 157

Query: 541 DDKYQKGDSDEIKRLKEKLSKTEEELKKEKTNSAALKNQADSVGKEYDRLLKEHEKVQKV 600
            +  +   + E K+  E+ +K  EE K  K  +AA K +A++  K      K   +  K 
Sbjct: 158 AEAAKLKAAAEAKKKAEEAAKAAEEAKA-KAEAAAAKKKAEAEAKAAAEKAKAEAEA-KA 215

Query: 601 VTEQGDKK 608
             E+  + 
Sbjct: 216 KAEKKAEA 223



 Score = 30.3 bits (68), Expect = 3.0
 Identities = 24/98 (24%), Positives = 41/98 (41%), Gaps = 8/98 (8%)

Query: 483 EIKRLKEKLSKTEEELKKEKTNSAALKSQADSVGK----EYDRLLKEHEKVQKVVTEQGD 538
           E K+  E+ +K  EE K  K  +AA K +A++  K    +     +   K +K      +
Sbjct: 168 EAKKKAEEAAKAAEEAKA-KAEAAAAKKKAEAEAKAAAEKAKAEAEAKAKAEKKAEAAAE 226

Query: 539 KKDDKYQKGDSDEIKRLKEKLSKTEEELKKEKTNSAAL 576
           +K    +K  + + K  K   +    E    K  +AAL
Sbjct: 227 EKAAAEKKKAAAKAKADKAAAAAKAAE---RKAAAAAL 261


>gnl|CDD|129705 TIGR00618, sbcc, exonuclease SbcC.  All proteins in this family for
           which functions are known are part of an exonuclease
           complex with sbcD homologs. This complex is involved in
           the initiation of recombination to regulate the levels
           of palindromic sequences in DNA. This family is based on
           the phylogenomic analysis of JA Eisen (1999, Ph.D.
           Thesis, Stanford University) [DNA metabolism, DNA
           replication, recombination, and repair].
          Length = 1042

 Score = 32.2 bits (73), Expect = 1.0
 Identities = 21/190 (11%), Positives = 45/190 (23%), Gaps = 22/190 (11%)

Query: 433 HVNIYKCRDLANNCGLCLALPEKYGCGWCESSKRCEIFEQCDKGLEGDSDEIKRLKEKLS 492
           +       +            + Y            ++ Q     +    +   LKE++ 
Sbjct: 514 NPARQDIDNPGPLTRRMQRGEQTYAQLETSEED---VYHQLTSERK----QRASLKEQMQ 566

Query: 493 KTEEELKKEKTNSAALKSQADSVGKEYDRLLK------EHEK----VQKVVTEQGDKKDD 542
           + ++            K    ++     RL        E E      Q  +  +   + D
Sbjct: 567 EIQQSFSILTQCDNRSKEDIPNLQNITVRLQDLTEKLSEAEDMLACEQHALLRKLQPEQD 626

Query: 543 KYQKGDSD-----EIKRLKEKLSKTEEELKKEKTNSAALKNQADSVGKEYDRLLKEHEKV 597
                        E+      L   +  L +E+    AL  +         R L   +  
Sbjct: 627 LQDVRLHLQQCSQELALKLTALHALQLTLTQERVREHALSIRVLPKELLASRQLALQKMQ 686

Query: 598 QKVVTEQGDK 607
            +       K
Sbjct: 687 SEKEQLTYWK 696


>gnl|CDD|184064 PRK13461, PRK13461, F0F1 ATP synthase subunit B; Provisional.
          Length = 159

 Score = 30.8 bits (70), Expect = 1.0
 Identities = 19/64 (29%), Positives = 36/64 (56%), Gaps = 5/64 (7%)

Query: 527 EKVQKVVTEQGDKKDDKYQKGDSDEIKRLKEKLSKTEEELKKEKTNSAAL----KNQADS 582
           +K++ V+  +  + D+K +K D D+ K+ +E   K E ELK  K     +    K++A++
Sbjct: 28  DKIKAVIDSRQSEIDNKIEKADEDQ-KKARELKLKNERELKNAKEEGKKIVEEYKSKAEN 86

Query: 583 VGKE 586
           V +E
Sbjct: 87  VYEE 90


>gnl|CDD|227606 COG5281, COG5281, Phage-related minor tail protein [Function
           unknown].
          Length = 833

 Score = 31.9 bits (72), Expect = 1.0
 Identities = 25/147 (17%), Positives = 47/147 (31%), Gaps = 18/147 (12%)

Query: 468 EIFEQCDKGLEGDSD-EIKRLKEKLSKTEEELKKEKTNSAALKSQADSVGKEYDRLLKEH 526
           E+ +   K     +D   K  +E + +   E +     +AA +SQ            +  
Sbjct: 442 ELNKALAKAKILQADKAAKAYQEDILQ--REAQSRGKTAAAERSQE-----------QMT 488

Query: 527 EKVQKVVTEQGDKKDDKYQKGDSDEIKRLKEK----LSKTEEELKKEKTNSAALKNQADS 582
             ++ ++  Q    D    K  + + K L  K     +  +EE K+ +        +   
Sbjct: 489 AALKALLAFQQQIADLSGAKEKASDQKSLLWKAEEQYALLKEEAKQRQLQEQKALLEHKK 548

Query: 583 VGKEYDRLLKEHEKVQKVVTEQGDKKD 609
              EY   L E    Q    E   +  
Sbjct: 549 ETLEYTSQLAELLDQQADRFELSAQAA 575


>gnl|CDD|236545 PRK09510, tolA, cell envelope integrity inner membrane protein
           TolA; Provisional.
          Length = 387

 Score = 31.7 bits (72), Expect = 1.0
 Identities = 35/124 (28%), Positives = 53/124 (42%), Gaps = 11/124 (8%)

Query: 463 SSKRCEIFEQCDKGLEGDSDEIKRLKEKLSKTEEELKK-EKTNSAALKSQADSVGKEYDR 521
           S+KR E  EQ  K  +  ++E   L++K +  +E LK+ EK   AA + +     K+ + 
Sbjct: 73  SAKRAE--EQRKKKEQQQAEE---LQQKQAAEQERLKQLEKERLAAQEQK-----KQAEE 122

Query: 522 LLKEHEKVQKVVTEQGDKKDDKYQKGDSDEIKRLKEKLSKTEEELKKEKTNSAALKNQAD 581
             K+    QK   E   K     +     E KR      K   E KK+    AA K  A+
Sbjct: 123 AAKQAALKQKQAEEAAAKAAAAAKAKAEAEAKRAAAAAKKAAAEAKKKAEAEAAKKAAAE 182

Query: 582 SVGK 585
           +  K
Sbjct: 183 AKKK 186


>gnl|CDD|217933 pfam04156, IncA, IncA protein.  Chlamydia trachomatis is an
           obligate intracellular bacterium that develops within a
           parasitophorous vacuole termed an inclusion. The
           inclusion is non-fusogenic with lysosomes but intercepts
           lipids from a host cell exocytic pathway. Initiation of
           chlamydial development is concurrent with modification
           of the inclusion membrane by a set of C.
           trachomatis-encoded proteins collectively designated
           Incs. One of these Incs, IncA, is functionally
           associated with the homotypic fusion of inclusions. This
           family probably includes members of the wider Inc family
           rather than just IncA.
          Length = 186

 Score = 31.3 bits (71), Expect = 1.1
 Identities = 18/99 (18%), Positives = 47/99 (47%), Gaps = 8/99 (8%)

Query: 470 FEQCDKGLEGDSDEIKRLKEKLSKTEEELKKEKTNSAALKSQADSVGKEYDRLLKEHEKV 529
            E  ++ +     E++ LKE L    E LK  +    +L+            L KE  ++
Sbjct: 95  LEDLEERIAELESELEDLKEDLQLLRELLKSLEERLESLEE-------SIKELAKELREL 147

Query: 530 QKVVTEQGDKKDDKYQKGDSDEIKRLKEKLSKTEEELKK 568
           ++ + E+ ++  ++ ++   + ++RL+E + + +  L++
Sbjct: 148 RQDLREEVEELREELER-LQENLQRLQEAIQELQSLLEQ 185


>gnl|CDD|197711 smart00414, H2A, Histone 2A. 
          Length = 106

 Score = 30.0 bits (68), Expect = 1.1
 Identities = 19/54 (35%), Positives = 25/54 (46%), Gaps = 8/54 (14%)

Query: 166 YLTPNLLVTRSYSTRVARSFSAQNGISDLKPRTLDLAIENLPELPGQLLCAFTI 219
           YLT  +L        +A + +  N    + PR L LAI N  EL  +LL   TI
Sbjct: 42  YLTAEVL-------ELAGNAARDNKKRRITPRHLQLAIRNDEEL-NKLLKGVTI 87


>gnl|CDD|234084 TIGR03007, pepcterm_ChnLen, polysaccharide chain length determinant
           protein, PEP-CTERM locus subfamily.  Members of this
           protein family belong to the family of polysaccharide
           chain length determinant proteins (pfam02706). All are
           found in species that encode the PEP-CTERM/exosortase
           system predicted to act in protein sorting in a number
           of Gram-negative bacteria, and are found near the epsH
           homolog that is the putative exosortase gene [Cell
           envelope, Biosynthesis and degradation of surface
           polysaccharides and lipopolysaccharides].
          Length = 498

 Score = 31.9 bits (73), Expect = 1.2
 Identities = 19/127 (14%), Positives = 48/127 (37%), Gaps = 11/127 (8%)

Query: 483 EIKRLKEKLSKTEEELKKEKTNSAALKSQADSVGKEYDRLLKEHEKVQKVVTEQGDKKDD 542
           ++   K ++++ EE+ ++E +       + +     Y +L  E  + +  +    + +  
Sbjct: 276 DVIATKREIAQLEEQKEEEGSAKNGGPERGEIANPVYQQLQIELAEAEAEIASL-EARVA 334

Query: 543 KYQKGDSDEIKRLKEKLSKTEEELKKEKTNSAALKNQADSVGKEYDRLLKEHE--KVQKV 600
           +        I+RL+  L      + + +     L    +     Y++LL   E  +V K 
Sbjct: 335 ELTA----RIERLESLL----RTIPEVEAELTQLNRDYEVNKSNYEQLLTRRESAEVSKQ 386

Query: 601 VTEQGDK 607
           +  Q   
Sbjct: 387 MEVQDKA 393


>gnl|CDD|225288 COG2433, COG2433, Uncharacterized conserved protein [Function
           unknown].
          Length = 652

 Score = 32.0 bits (73), Expect = 1.2
 Identities = 28/124 (22%), Positives = 57/124 (45%), Gaps = 9/124 (7%)

Query: 478 EGDSDEIKRLKEKLSKTEEELKKEKTNSAALKSQADSVGKEYDRLLKEHEKVQKVVTEQG 537
           E +  EI   ++++ K EE +++ +  ++ LK        E + L +E EK++  +    
Sbjct: 411 EEERREITVYEKRIKKLEETVERLEEENSELKR-------ELEELKREIEKLESELERFR 463

Query: 538 DKKDDKYQKGDSDEIKRLKEKLSKTEEELKKEKTNSAALKNQADSVGKEYDRLLKEHEKV 597
            +  DK +K    EI+    ++ + E+EL+++K     L+ +   + K     L      
Sbjct: 464 REVRDKVRK--DREIRARDRRIERLEKELEEKKKRVEELERKLAELRKMRKLELSGKGTP 521

Query: 598 QKVV 601
            KVV
Sbjct: 522 VKVV 525



 Score = 30.1 bits (68), Expect = 4.4
 Identities = 25/130 (19%), Positives = 53/130 (40%), Gaps = 31/130 (23%)

Query: 476 GLEGDSDEIK-------RLKEKLSKTEEELKKEKTNSAALKSQADSVGKEYDRLLKEHEK 528
           G+  D + IK        L E LSK +EE +             +  G E     +    
Sbjct: 372 GIWKDVERIKALVIRGYPLAEALSKVKEEERPR-----------EKEGTE--EEERREIT 418

Query: 529 VQKVVTEQGDKKDDKYQKGDSDEIKRLKEKLSKTEEELKKEKTNSAALKNQADSVGKEYD 588
           V +   ++ ++  ++ ++    E   LK +L    EELK+E      L+++ +   +E  
Sbjct: 419 VYEKRIKKLEETVERLEE----ENSELKREL----EELKREIEK---LESELERFRREVR 467

Query: 589 RLLKEHEKVQ 598
             +++  +++
Sbjct: 468 DKVRKDREIR 477



 Score = 29.3 bits (66), Expect = 7.9
 Identities = 20/102 (19%), Positives = 50/102 (49%), Gaps = 15/102 (14%)

Query: 471 EQCDKGLEGDSDEIKRLKEKLSKTEEELKKEKTNSAALKSQADSVGKEYDRLLKEHEKVQ 530
           ++ ++ +E   +E   LK +L    EELK+E      L+S+ +   +E    +++  ++ 
Sbjct: 425 KKLEETVERLEEENSELKREL----EELKREIEK---LESELERFRREVRDKVRKDREI- 476

Query: 531 KVVTEQGDKKDDKYQKGDSDE---IKRLKEKLSKTEEELKKE 569
                  D++ ++ +K   ++   ++ L+ KL++  +  K E
Sbjct: 477 ----RARDRRIERLEKELEEKKKRVEELERKLAELRKMRKLE 514


>gnl|CDD|234801 PRK00591, prfA, peptide chain release factor 1; Validated.
          Length = 359

 Score = 31.6 bits (73), Expect = 1.2
 Identities = 26/101 (25%), Positives = 49/101 (48%), Gaps = 23/101 (22%)

Query: 484 IKRLKEKLSKTEEELKKEKTNSAALKSQADSVGKEYDRLLKEHEKVQKVVTE-------Q 536
           + +L E L +  EEL+   ++   +  Q     K + +L KE+ +++ +V         Q
Sbjct: 5   LDKL-EALEERYEELEALLSDPEVISDQ-----KRFRKLSKEYAELEPIVEAYREYKQAQ 58

Query: 537 GDKKDDK---YQKGDSD-------EIKRLKEKLSKTEEELK 567
            D ++ K    ++ D +       E+K L+E+L + EEELK
Sbjct: 59  EDLEEAKEMLEEESDPEMREMAKEELKELEERLEELEEELK 99


>gnl|CDD|235549 PRK05658, PRK05658, RNA polymerase sigma factor RpoD; Validated.
          Length = 619

 Score = 31.7 bits (73), Expect = 1.2
 Identities = 26/140 (18%), Positives = 58/140 (41%), Gaps = 21/140 (15%)

Query: 474 DKGLEGDSDEIKRLKEKLSKTEEELKKEKTNSAALKSQADSVGKEYDRLLKEHEKVQKVV 533
           ++       E++ L +   + EEE + + + +A      + V +++  L K+++K++K  
Sbjct: 179 EEDPAHVGSELEELDDDEDEEEEEDENDDSLAADESELPEKVLEKFKALAKQYKKLRKAQ 238

Query: 534 TEQGD---KKDDKYQKGDSDEIKRLKEKL------SKTEEEL--------KKEKTNSAAL 576
            ++ +    +  KY K      ++LKE+L      SK  +EL        K+ +     L
Sbjct: 239 EKKVEGRLAQHKKYAK----LREKLKEELKSLRLTSKQIDELVEQLRDINKRVRGQEREL 294

Query: 577 KNQADSVGKEYDRLLKEHEK 596
               + +       LK  + 
Sbjct: 295 LRLVERLKMPRKDFLKLFQG 314


>gnl|CDD|148679 pfam07218, RAP1, Rhoptry-associated protein 1 (RAP-1).  This family
           consists of several rhoptry-associated protein 1 (RAP-1)
           sequences which appear to be specific to Plasmodium
           falciparum.
          Length = 790

 Score = 32.0 bits (72), Expect = 1.2
 Identities = 22/104 (21%), Positives = 50/104 (48%), Gaps = 3/104 (2%)

Query: 476 GLEGDSDEIKRLKEKLSKTEEELKKEKTNSAALKSQADSVGKEYDRLLKEHEKVQKVVTE 535
           G+ G  +E     +      EEL +   N  ALK   + + +E D L  + +KV +   E
Sbjct: 187 GIVGADEEAPPAPKNTLTPLEELYETNVNLFALKHPLEKLEEEIDILKNDGDKVAE---E 243

Query: 536 QGDKKDDKYQKGDSDEIKRLKEKLSKTEEELKKEKTNSAALKNQ 579
           +  + D+++++ + D+ + L++  ++ +EE  K       +++ 
Sbjct: 244 EEFELDEEHEEAEEDKKEALEKIGAEGDEEKFKFDEEIKFIEHD 287


>gnl|CDD|217817 pfam03961, DUF342, Protein of unknown function (DUF342).  This
           family of bacterial proteins has no known function. The
           proteins are in the region of 500-600 amino acid
           residues in length.
          Length = 450

 Score = 31.5 bits (72), Expect = 1.3
 Identities = 25/92 (27%), Positives = 45/92 (48%), Gaps = 14/92 (15%)

Query: 482 DEIKRLKEKLSKTEEELKKEKTNSAALKSQADSVGKEYDRLLKEHEKVQKVVTEQGDKKD 541
           +E+K L+E+L + EEEL+K K     L               K   ++     EQ +K  
Sbjct: 334 EELKELEEELKELEEELEKIKKLLKKLPK-------------KARGQLPPEKREQLEKLL 380

Query: 542 DKYQKGDSDEIKRLKEKLSKTEEELKKEKTNS 573
           +  +K  S+E++ L+E+L + +EEL+   +  
Sbjct: 381 ETKEKL-SEELEELEEELKELKEELESLYSEG 411



 Score = 31.5 bits (72), Expect = 1.4
 Identities = 23/96 (23%), Positives = 47/96 (48%), Gaps = 3/96 (3%)

Query: 492 SKTEEELKKEKTNSAALKSQADSVGKEYDRLLKEHEKVQKVVTEQGDKKDDKYQKGDSDE 551
           +KTE E+     +   LK +   + +E   L +E EK++K++ +   K   +      ++
Sbjct: 319 TKTEIEVG---VDFPELKEELKELEEELKELEEELEKIKKLLKKLPKKARGQLPPEKREQ 375

Query: 552 IKRLKEKLSKTEEELKKEKTNSAALKNQADSVGKEY 587
           +++L E   K  EEL++ +     LK + +S+  E 
Sbjct: 376 LEKLLETKEKLSEELEELEEELKELKEELESLYSEG 411


>gnl|CDD|215878 pfam00362, Integrin_beta, Integrin, beta chain.  Integrins have
           been found in animals and their homologues have also
           been found in cyanobacteria, probably due to horizontal
           gene transfer. The sequences repeats have been trimmed
           due to an overlap with EGF.
          Length = 424

 Score = 31.4 bits (72), Expect = 1.3
 Identities = 11/36 (30%), Positives = 15/36 (41%), Gaps = 10/36 (27%)

Query: 284 SCTQCVSSDFPCDWCVD--------GHRCTHDTAEN 311
           +C +C+ S   C WC            RC  DT E+
Sbjct: 1   TCGECIRSGPSCAWCTQENFTGEFNSPRC--DTLES 34


>gnl|CDD|239398 cd03124, alpha_CA_prokaryotic_like, Carbonic anhydrase alpha,
           prokaryotic-like subfamily. Carbonic anhydrases (CAs)
           are zinc-containing enzymes that catalyze the reversible
           hydration of carbon dioxide in a two-step mechanism: a
           nucleophilic attack of a zinc-bound hydroxide ion on
           carbon dioxide, followed by the regeneration of the
           active site by ionization of the zinc-bound water
           molecule and removal of a proton from the active site.
           They are ubiquitous enzymes involved in fundamental
           processes like photosynthesis, respiration, pH
           homeostasis and ion transport. Most alpha CAs are
           monomeric enzymes. The zinc ion is complexed by three
           histidines. This sub-family includes bacterial carbonic
           anhydrase alpha, as well as plant enzymes such as
           tobacco nectarin III and yam dioscorin and, carbonic
           anhydrases from molluscs, such as nacrein, which are
           part of the organic matrix layer in shells. Other
           members of this family may be involved in maintaining pH
           balance, in facilitating transport of carbon dioxide or
           carbonic acid, or in sensing carbon dioxide levels in
           the environment.  Dioscorin is the major storage protein
           of yam tubers and may play a role as an antioxidant.
           Tobacco Nectarin may play a role in the maintenace of pH
           and oxidative balance in nectar. Mollusc nacrein may
           participate in calcium carbonate crystal formation of
           the nacreous layer.  This subfamily also includes three
           alpha carbonic anhydrases from Chlamydomonas reinhardtii
           (CAH 1-3).  CAHs1-2 are localized in the periplasmic
           space. CAH1 faciliates the movement of carbon dioxide
           across the plasma membrane when the medium is alkaline.
           CAH3 is localized to the thylakoid lumen and provides
           CO2 to Rubisco.
          Length = 216

 Score = 31.1 bits (71), Expect = 1.3
 Identities = 25/105 (23%), Positives = 35/105 (33%), Gaps = 24/105 (22%)

Query: 53  GCILVILSLF-------FLDAI-REMRKYASPEVKEEAHGHLDAEMQNNMKLFRAQRNFY 104
           G + V+  LF       FL  I   M K    EV   A    +        L    R++Y
Sbjct: 108 GQLAVVAVLFEEGKENPFLKKILDNMPKKEGTEVNLPAILDPNE-------LLPESRSYY 160

Query: 105 ISGFSL--------FLWLVIRQIIQLIAQQANLLAQNEASMNQAR 141
               SL          W+V++Q I +  +Q           N AR
Sbjct: 161 RYEGSLTTPPCSEGVRWIVLKQPITISKEQLAKFRAAVYPNN-AR 204


>gnl|CDD|100796 PRK01156, PRK01156, chromosome segregation protein; Provisional.
          Length = 895

 Score = 31.8 bits (72), Expect = 1.3
 Identities = 20/141 (14%), Positives = 47/141 (33%), Gaps = 13/141 (9%)

Query: 475 KGLEGDSDEIKRLKEKLSKTEEELKKEKTNSAALKSQADSVGKEYDRLLKEHEKVQKVVT 534
             L    D   R + ++   E +L  E   +   K     + + + ++   ++ V K   
Sbjct: 242 NELSSLEDMKNRYESEIKTAESDLSMELEKNNYYK----ELEERHMKI--INDPVYKNRN 295

Query: 535 EQGDKKDDKYQKGDSDEI-KRLKEKLSKTEEELKK----EKTNSAALKNQA--DSVGKEY 587
              D    K    +  +I   +  +++K    +KK    +K  +  +K ++  D +  + 
Sbjct: 296 YINDYFKYKNDIENKKQILSNIDAEINKYHAIIKKLSVLQKDYNDYIKKKSRYDDLNNQI 355

Query: 588 DRLLKEHEKVQKVVTEQGDKK 608
             L          +      K
Sbjct: 356 LELEGYEMDYNSYLKSIESLK 376



 Score = 31.8 bits (72), Expect = 1.5
 Identities = 22/99 (22%), Positives = 41/99 (41%), Gaps = 8/99 (8%)

Query: 483 EIKRLKEKLSKTEEELKKEKTNSAALKSQADSVGKEYDRLLKEHEKVQKVVTEQGDKKDD 542
           E K L EKL    +  KK+    A + S    + +   R+    + ++K      D K +
Sbjct: 640 ENKILIEKLRGKIDNYKKQ---IAEIDSIIPDLKEITSRINDIEDNLKKSRKALDDAKAN 696

Query: 543 KYQKGDSDEIKR-----LKEKLSKTEEELKKEKTNSAAL 576
           + +   + EI R     L ++++   E L+  K    A+
Sbjct: 697 RARLESTIEILRTRINELSDRINDINETLESMKKIKKAI 735



 Score = 31.4 bits (71), Expect = 1.5
 Identities = 29/157 (18%), Positives = 63/157 (40%), Gaps = 18/157 (11%)

Query: 471 EQCDKGLEGDSDEIKRLKEKLSKTEEELKKE-----------KTNSAALKSQADSVGKEY 519
           E+  K +E  S  I  + +      + +KKE            +  ++L  +  ++ +  
Sbjct: 380 EEYSKNIERMSAFISEILKIQEIDPDAIKKELNEINVKLQDISSKVSSLNQRIRALRENL 439

Query: 520 DRL------LKEHEKVQKVVTEQGDKKDDKYQKGDSDEIKRLKEKLSKTEEELKKEKTNS 573
           D L      L          T  G++K +      +++  RL+EK+ + E E+K      
Sbjct: 440 DELSRNMEMLNGQSVCPVCGTTLGEEKSNHIINHYNEKKSRLEEKIREIEIEVKDIDEKI 499

Query: 574 AALKNQADSV-GKEYDRLLKEHEKVQKVVTEQGDKKD 609
             LK + + +  +E ++ + E+ K++    +  D K 
Sbjct: 500 VDLKKRKEYLESEEINKSINEYNKIESARADLEDIKI 536



 Score = 30.3 bits (68), Expect = 3.6
 Identities = 26/134 (19%), Positives = 45/134 (33%), Gaps = 9/134 (6%)

Query: 482 DEIKRLKEKLSK---TEEELKKEKTNSAALKSQADSVGKEYDRLLKEHEKVQKVVTEQGD 538
           D I  L+ ++S     EE+LK        +K Q     K +   LKE E++        D
Sbjct: 173 DVIDMLRAEISNIDYLEEKLKSSNLELENIKKQIADDEKSHSITLKEIERLSIEYNNAMD 232

Query: 539 KKDD-----KYQKGDSDEIKRLKEKLSKTEEELKKEKTNSAALKNQADSVGK-EYDRLLK 592
             ++            D   R + ++   E +L  E   +   K   +   K   D + K
Sbjct: 233 DYNNLKSALNELSSLEDMKNRYESEIKTAESDLSMELEKNNYYKELEERHMKIINDPVYK 292

Query: 593 EHEKVQKVVTEQGD 606
               +      + D
Sbjct: 293 NRNYINDYFKYKND 306


>gnl|CDD|221389 pfam12037, DUF3523, Domain of unknown function (DUF3523).  This
           presumed domain is functionally uncharacterized. This
           domain is found in eukaryotes. This domain is typically
           between 257 to 277 amino acids in length. This domain is
           found associated with pfam00004. This domain has a
           conserved LER sequence motif.
          Length = 276

 Score = 31.3 bits (71), Expect = 1.3
 Identities = 26/101 (25%), Positives = 47/101 (46%), Gaps = 16/101 (15%)

Query: 491 LSKTEE-----ELKKEKTNSAALKSQADSVGKEYDRLLKEHEKVQKVVTEQGDKKDDKYQ 545
           LSK +E     EL+ +     A ++QA       +R   E E+ +K + EQ  ++  + Q
Sbjct: 56  LSKMQEKTRQAELEAKIKEYEAQQAQAKL-----ERARVEAEERRKTLQEQTQQEQQRAQ 110

Query: 546 KGDSDEIKRLKEKLSKTEEELKKEKTNSAALKNQADSVGKE 586
             D    KR +++L       ++ + N   LK Q +SV ++
Sbjct: 111 YQDELARKRYQKEL------EQQRRQNEELLKMQEESVLRQ 145


>gnl|CDD|226703 COG4252, COG4252, Predicted transmembrane sensor domain [Signal
           transduction mechanisms].
          Length = 400

 Score = 31.6 bits (72), Expect = 1.4
 Identities = 18/71 (25%), Positives = 26/71 (36%), Gaps = 3/71 (4%)

Query: 140 ARQAAVAAQALLDGPRGKWFGSHVWAYLTPNLLVTRSYSTRVARSFSAQNGISDLKPRTL 199
            RQA +    L+    GK     +    TP      S + ++A  + A  GIS   P   
Sbjct: 148 PRQAQIGFSDLILDSDGKVRRLLL--AATPGPEPKYSLALKLALQYLASLGISPKYPDPE 205

Query: 200 DLAIENLPELP 210
            L +     LP
Sbjct: 206 RLRL-GKTTLP 215


>gnl|CDD|131526 TIGR02473, flagell_FliJ, flagellar export protein FliJ.  Members of
           this family are the FliJ protein found, in nearly every
           case, in the midst of other flagellar biosynthesis genes
           in bacgterial genomes. Typically the fliJ gene is found
           adjacent to the gene for the flagellum-specific ATPase
           FliI. Sequence scoring in the gray zone between trusted
           and noise cutoffs include both probable FliJ proteins
           and components of bacterial type III secretion systems.
          Length = 141

 Score = 30.4 bits (69), Expect = 1.5
 Identities = 22/102 (21%), Positives = 40/102 (39%), Gaps = 14/102 (13%)

Query: 62  FFLDAIREMRKYASPEVKEE------AHGHLDAEMQNNMKL---FRAQRNFYI-SGFSLF 111
           F L  + ++R+    + K E          L+ ++Q  +K    +  Q    + +G S  
Sbjct: 2   FRLQKLLDLREKEEEQAKLELAKAQAEFERLETQLQQLIKYREEYEQQALEKVGAGTSAL 61

Query: 112 LWL----VIRQIIQLIAQQANLLAQNEASMNQARQAAVAAQA 149
                   IRQ+ Q I QQ   LA  +  +   R+  + A+ 
Sbjct: 62  ELSNYQRFIRQLDQRIQQQQQELALLQQEVEAKRERLLEARR 103


>gnl|CDD|221959 pfam13177, DNA_pol3_delta2, DNA polymerase III, delta subunit.  DNA
           polymerase III, delta subunit (EC 2.7.7.7) is required
           for, along with delta' subunit, the assembly of the
           processivity factor beta(2) onto primed DNA in the DNA
           polymerase III holoenzyme-catalyzed reaction. The delta
           subunit is also known as HolA.
          Length = 161

 Score = 30.2 bits (69), Expect = 1.5
 Identities = 22/77 (28%), Positives = 31/77 (40%), Gaps = 14/77 (18%)

Query: 448 LCLALPEKYGCGWCESSKRCEIFEQCD-KGLEGDS-----DEIKRLKEKLSKTEEELKK- 500
            C    +   CG C+S KR E     D   +E +      D+I+ L E+ SK   E KK 
Sbjct: 44  FCENPQDDEPCGECKSCKRIENGNHPDVIIIEPEGKSIKIDQIRELIEEFSKKPFEGKKK 103

Query: 501 -------EKTNSAALKS 510
                  EK   +A  +
Sbjct: 104 VYIIEDAEKMTESAANA 120


>gnl|CDD|222290 pfam13654, AAA_32, AAA domain.  This family includes a wide variety
           of AAA domains including some that have lost essential
           nucleotide binding residues in the P-loop.
          Length = 509

 Score = 31.3 bits (72), Expect = 1.5
 Identities = 34/156 (21%), Positives = 68/156 (43%), Gaps = 40/156 (25%)

Query: 485 KRLKEKLSKTEEELKKEKTNSAALKSQA---------DSVGKEYDRLLKEHEKVQK---- 531
           + LK+ + +  EELKKE    AA +S+          +   ++ +   +E E+  K    
Sbjct: 87  RELKKDMEELVEELKKEI--PAAFESEEYEARKEEIEEEFQEKREEAFEELEEEAKEKGF 144

Query: 532 --VVTEQG-------DKK---DDKYQKGDSDEIKRLKEKLSKTEEEL--------KKEKT 571
             V T  G       D +   +++++    +E + L+EK+ + EEEL        + E+ 
Sbjct: 145 ALVRTPGGFVFAPLKDGEPLTEEEFEALPEEEREELEEKIDELEEELQEILRQLRELERE 204

Query: 572 NSAALK--NQ---ADSVGKEYDRLLKEHEKVQKVVT 602
               L+  ++     +VG   D L +++  + KV+ 
Sbjct: 205 AREKLRELDREVALFAVGPLLDELKEKYADLPKVLA 240


>gnl|CDD|119001 pfam10481, Cenp-F_N, Cenp-F N-terminal domain.  Mitosin or
           centromere-associated protein-F (Cenp-F) is found bound
           across the centromere as one of the proteins of the
           outer layer of the kinetochore. Most of the
           kinetochore/centromere functions appear to depend upon
           binding of the C-terminal par to f the molecule, whereas
           the N-terminal part, here, may be a cytoplasmic player
           in controlling the function of microtubules and dynein.
          Length = 288

 Score = 31.3 bits (70), Expect = 1.5
 Identities = 31/114 (27%), Positives = 58/114 (50%), Gaps = 9/114 (7%)

Query: 477 LEGDSDEIKRLKEK----LSKTEEELKKEKTNSAALKSQADSVGKEYDRLLK--EH-EKV 529
           LEG  D++K+ K++    L   E  L+K+K      K++  ++ +E  RL++  EH EK 
Sbjct: 23  LEGQLDKLKKEKQQRQFQLDSLEAALQKQKQKVENEKTEGANLKRENQRLMEICEHLEKT 82

Query: 530 QKVVTEQGDKKDDK--YQKGDSDEIKRLKEKLSKTEEELKKEKTNSAALKNQAD 581
           ++ ++ +   K+ +  +Q+G  +  K+  EKL +  +  K E   S      AD
Sbjct: 83  KQKISHELQVKESQVNFQEGQLNSCKKQIEKLEQELKRCKSELERSQQAAQSAD 136


>gnl|CDD|115072 pfam06391, MAT1, CDK-activating kinase assembly factor MAT1.  MAT1
           is an assembly/targeting factor for cyclin-dependent
           kinase-activating kinase (CAK), which interacts with the
           transcription factor TFIIH. The domain found to the
           N-terminal side of this domain is a C3HC4 RING finger.
          Length = 200

 Score = 30.9 bits (70), Expect = 1.5
 Identities = 35/144 (24%), Positives = 70/144 (48%), Gaps = 26/144 (18%)

Query: 466 RCEIFEQCDKGLEGDSDEIKRLKEKLSKTEEELKKEKTNSAALKSQADSVGKEYDRLLKE 525
           R +IFE  D  +E + D  KR+ +  +K EE+                S+ +EY+  L+E
Sbjct: 7   RVQIFE--DPTIEKEVDIRKRILKIFNKREEDF--------------PSL-REYNDYLEE 49

Query: 526 HEKV-----QKVVTEQGDKKDDKYQKGDSDEIKRLKEKLSKTEEELKKEKTNSAALKNQA 580
            E +       +  E+ ++K D+Y+K + D I R K +L++ +EEL++       +K + 
Sbjct: 50  VETIVFNLTNGIDVEETEEKVDQYEKENKDSIMRNKRRLTREQEELEQALEEEKEMKEEK 109

Query: 581 DSVGKEYDRLLKEHEKVQKVVTEQ 604
               + + +  ++ +K+ K   +Q
Sbjct: 110 ----RLHLQKEEQEQKMAKEKDKQ 129


>gnl|CDD|214922 smart00935, OmpH, Outer membrane protein (OmpH-like).  This family
           includes outer membrane proteins such as OmpH among
           others. Skp (OmpH) has been characterized as a molecular
           chaperone that interacts with unfolded proteins as they
           emerge in the periplasm from the Sec translocation
           machinery.
          Length = 140

 Score = 30.2 bits (69), Expect = 1.6
 Identities = 27/110 (24%), Positives = 60/110 (54%), Gaps = 11/110 (10%)

Query: 462 ESSKRCEIFEQCDKGLEGDSDEIKRLKEKLSKTEEELKKEK-TNSAALKSQADSVGKEYD 520
           ES       +Q +K  +    E+++L+++L K +E+L+K+  T S A + + +   KE  
Sbjct: 12  ESPAGKAAQKQLEKEFKKRQAELEKLEKELQKLKEKLQKDAATLSEAAREKKE---KELQ 68

Query: 521 RLLKEHEKVQKVVTEQGDKKDDKYQKGDSDEIKRLKEKLSKTEEELKKEK 570
           + ++E ++ Q+       K     QK   +E++++ +K++K  +E+ K+K
Sbjct: 69  KKVQEFQRKQQ-------KLQQDLQKRQQEELQKILDKINKAIKEVAKKK 111



 Score = 29.1 bits (66), Expect = 3.5
 Identities = 20/81 (24%), Positives = 46/81 (56%), Gaps = 9/81 (11%)

Query: 520 DRLLKEHEKVQKVVTEQGDKKDDKYQKGDSDEIKRLKEKLSKTEEELKKEK-TNSAALKN 578
            ++L+E    +    +Q +K+  K Q     E+++L+++L K +E+L+K+  T S A + 
Sbjct: 7   QKILQESPAGKAAQ-KQLEKEFKKRQA----ELEKLEKELQKLKEKLQKDAATLSEAARE 61

Query: 579 QADSVGKEYDRLLKEHEKVQK 599
           + +   KE  + ++E ++ Q+
Sbjct: 62  KKE---KELQKKVQEFQRKQQ 79


>gnl|CDD|214661 smart00435, TOPEUc, DNA Topoisomerase I (eukaryota).  DNA
           Topoisomerase I (eukaryota), DNA topoisomerase V,
           Vaccina virus topoisomerase, Variola virus
           topoisomerase, Shope fibroma virus topoisomeras.
          Length = 391

 Score = 31.2 bits (71), Expect = 1.6
 Identities = 18/71 (25%), Positives = 33/71 (46%)

Query: 539 KKDDKYQKGDSDEIKRLKEKLSKTEEELKKEKTNSAALKNQADSVGKEYDRLLKEHEKVQ 598
           KK     +  SD  ++LK K  +  E+L  E       K + +   K+ +RL +  EK++
Sbjct: 297 KKMILLFEMISDLKRKLKSKFERDNEKLDAEVKEKKKEKKKEEKKKKQIERLEERIEKLE 356

Query: 599 KVVTEQGDKKD 609
              T++ + K 
Sbjct: 357 VQATDKEENKT 367



 Score = 31.2 bits (71), Expect = 1.6
 Identities = 25/104 (24%), Positives = 54/104 (51%), Gaps = 10/104 (9%)

Query: 474 DKGLEGDSDEIKRLKEKLSKTEEELKKEKTNSAALKSQADSVGKEYDRLLKEHEKVQKVV 533
           +K +E   ++IK LK +L + ++ +   +  S   +       ++ ++L  E ++ +K  
Sbjct: 276 EKSMEKLQEKIKALKYQLKRLKKMILLFEMISDLKRKLKSKFERDNEKLDAEVKEKKKEK 335

Query: 534 TEQGDKKDDKYQKGDSDEIKRLKEKLSKTEEELK-KEKTNSAAL 576
                KK++K +K    +I+RL+E++ K E +   KE+  + AL
Sbjct: 336 -----KKEEKKKK----QIERLEERIEKLEVQATDKEENKTVAL 370


>gnl|CDD|226809 COG4372, COG4372, Uncharacterized protein conserved in bacteria
           with the myosin-like domain [Function unknown].
          Length = 499

 Score = 31.1 bits (70), Expect = 1.6
 Identities = 28/156 (17%), Positives = 60/156 (38%), Gaps = 14/156 (8%)

Query: 465 KRCEIFEQCDKGLEGDSDEIKRLKEKLSKTEEELKKEKTNSAALKSQADSVGKEYDRLLK 524
            R E+             E +  + +L K  +E +  +   AA +       +E  RL K
Sbjct: 86  LRTEL-GTAQGEKRAAETEREAARSELQKARQEREAVRQELAAARQNLAKAQQELARLTK 144

Query: 525 EHEKVQ---KVVTEQGDKKDDKYQ---------KGDSDEIKRLKEKLSKTEEELKKEKTN 572
           + + +Q   K + EQ  + + + Q         +  + ++K     L     ++++E  N
Sbjct: 145 QAQDLQTRLKTLAEQRRQLEAQAQSLQASQKQLQASATQLKSQVLDLKLRSAQIEQEAQN 204

Query: 573 SAALKNQADSVGKEYDRLLKEHEKVQKVVTEQGDKK 608
            A   N A +  +E  R     ++  + + +Q D +
Sbjct: 205 LATRANAAQARTEELARRAAAAQQTAQAI-QQRDAQ 239



 Score = 31.1 bits (70), Expect = 2.0
 Identities = 20/113 (17%), Positives = 49/113 (43%), Gaps = 5/113 (4%)

Query: 475 KGLEGDSDEIKRLKEKLSKTEEELKKEKTNSAALKSQADSVGKEYDRLLKEHEKVQKVVT 534
           K L+  + ++K     L     ++++E  N A   + A +  +E  R     ++  + + 
Sbjct: 175 KQLQASATQLKSQVLDLKLRSAQIEQEAQNLATRANAAQARTEELARRAAAAQQTAQAIQ 234

Query: 535 EQGDKKDDKYQKGDSDEIKRLKEKLSKTEEELKKEKTNSAALKNQADSVGKEY 587
           ++  +   K Q     +I    E++ + E +L++ +T  A L+ +   +   Y
Sbjct: 235 QRDAQISQKAQ-----QIAARAEQIRERERQLQRLETAQARLEQEVAQLEAYY 282


>gnl|CDD|218434 pfam05103, DivIVA, DivIVA protein.  The Bacillus subtilis divIVA1
           mutation causes misplacement of the septum during cell
           division, resulting in the formation of small, circular,
           anucleate mini-cells. Inactivation of divIVA produces a
           mini-cell phenotype, whereas overproduction of DivIVA
           results in a filamentation phenotype. These proteins
           appear to contain coiled-coils.
          Length = 131

 Score = 29.9 bits (68), Expect = 1.6
 Identities = 17/89 (19%), Positives = 44/89 (49%), Gaps = 4/89 (4%)

Query: 483 EIKRLKEKLSKTEEELKKEKTNSAALKSQADSVGKEYDRLLKEHEKVQKVVTEQGDKKDD 542
           E + LKE++ + EE+L++ K     L+       +  + +    +K  +++ ++ + K +
Sbjct: 40  ENEELKEEIERLEEKLEEYKELEETLQKTLVVAQETAEEVKANAQKEAELIIKEAEAKAE 99

Query: 543 KYQKGDSDEIKRLKEKLSKTEEELKKEKT 571
           +     ++E K+L  +     +ELK++  
Sbjct: 100 RIVNDANEEAKKLATEY----DELKRQSR 124



 Score = 27.6 bits (62), Expect = 8.5
 Identities = 15/66 (22%), Positives = 25/66 (37%), Gaps = 26/66 (39%)

Query: 513 DSVGKEYDRLLKEHEKVQKVVTEQGDKKDDKYQKGDSDEIKRLKEKLSKTEEELKKEKTN 572
           D V K+Y+ L KE                           + LKE++ + EE+L++ K  
Sbjct: 28  DQVIKDYEALYKE--------------------------NEELKEEIERLEEKLEEYKEL 61

Query: 573 SAALKN 578
              L+ 
Sbjct: 62  EETLQK 67


>gnl|CDD|217834 pfam03998, Utp11, Utp11 protein.  This protein is found to be part
           of a large ribonucleoprotein complex containing the U3
           snoRNA. Depletion of the Utp proteins impedes production
           of the 18S rRNA, indicating that they are part of the
           active pre-rRNA processing complex. This large RNP
           complex has been termed the small subunit (SSU)
           processome.
          Length = 239

 Score = 30.8 bits (70), Expect = 1.7
 Identities = 22/129 (17%), Positives = 53/129 (41%), Gaps = 15/129 (11%)

Query: 484 IKRLKEKLSKTEEELKKEKT-NSAALKSQADSVGKEYDRLLKE------------HEKVQ 530
           I + +EK   ++E++K  KT +   ++++  +  K+ ++L +E            H    
Sbjct: 69  ILKREEKEVLSKEQVKLLKTQDLNYVRTKRQAEAKKIEKLKEELHLTDSGSAGPKHIVFV 128

Query: 531 KVVTEQGDKKDDKYQKGDSDEIKRLKEKLSKTEEELKKEKTNSAALKNQADSVGKEYDRL 590
               EQ      +Y     + + R + +      +L+K        K ++    ++  + 
Sbjct: 129 DDEEEQKSFDPAEYFDTTPELLDRRENRP--RISQLEKTSLVDEKQKKKSAKKKRKLYKE 186

Query: 591 LKEHEKVQK 599
           LKE ++ +K
Sbjct: 187 LKERKEREK 195


>gnl|CDD|218177 pfam04615, Utp14, Utp14 protein.  This protein is found to be part
           of a large ribonucleoprotein complex containing the U3
           snoRNA. Depletion of the Utp proteins impedes production
           of the 18S rRNA, indicating that they are part of the
           active pre-rRNA processing complex. This large RNP
           complex has been termed the small subunit (SSU)
           processome.
          Length = 728

 Score = 31.2 bits (71), Expect = 1.7
 Identities = 22/124 (17%), Positives = 51/124 (41%), Gaps = 6/124 (4%)

Query: 482 DEIKRLKEKL-----SKTEEELKKEKTNSAALK-SQADSVGKEYDRLLKEHEKVQKVVTE 535
            EI+ L+ +L     S  EE  +  K N    K    +   +   + LK+  K +    +
Sbjct: 392 AEIEELRRELEGEEESDEEENEEPSKKNVGRRKFGPENGEKEAESKKLKKENKNEFKEKK 451

Query: 536 QGDKKDDKYQKGDSDEIKRLKEKLSKTEEELKKEKTNSAALKNQADSVGKEYDRLLKEHE 595
           + D++++   + ++   K   + L ++E+  K+E+      +N          +  K+ +
Sbjct: 452 ESDEEEELEDEEEAKVEKVANKLLKRSEKAQKEEEEEELDEENPWLKTTSSVGKSAKKQD 511

Query: 596 KVQK 599
             +K
Sbjct: 512 SKKK 515


>gnl|CDD|187794 cd09663, Csm4_III-A, CRISPR/Cas system-associated RAMP superfamily
           protein Csm4.  CRISPR (Clustered Regularly Interspaced
           Short Palindromic Repeats) and associated Cas proteins
           comprise a system for heritable host defense by
           prokaryotic cells against phage and other foreign DNA;
           RAMP superfamily protein.
          Length = 301

 Score = 30.9 bits (70), Expect = 1.7
 Identities = 16/79 (20%), Positives = 31/79 (39%), Gaps = 11/79 (13%)

Query: 474 DKGLEGDSDEIKR--------LKEKLSKTEEELKKEKTNSAALKSQADSVGKEYDRLLKE 525
              LE    ++K+        L+  LS   +E K+ K  +   K + +  G    +  K 
Sbjct: 77  LNLLEKKKKKLKKISYIPIDELESWLSGEADEKKQLKNIT---KIKTEGNGDNKGKPHKV 133

Query: 526 HEKVQKVVTEQGDKKDDKY 544
             +  KV  ++ + K + Y
Sbjct: 134 LIEEPKVSIDRIEIKSNLY 152


>gnl|CDD|218482 pfam05178, Kri1, KRI1-like family.  The yeast member of this family
           (Kri1p) is found to be required for 40S ribosome
           biogenesis in the nucleolus.
          Length = 99

 Score = 29.2 bits (66), Expect = 1.7
 Identities = 21/88 (23%), Positives = 45/88 (51%), Gaps = 7/88 (7%)

Query: 483 EIKRLK-EKLSKTEEELKKEKT---NSAALKSQADSVGKEYDRLLKEHEKV-QKVVTEQG 537
           E+KRLK  K  + EE+L+K K       A  S+ D    ++D   ++ ++   K+  +  
Sbjct: 14  ELKRLKNLKREEIEEKLEKIKKVAGLRGADLSEEDLADGDFDP--EKWDEEMAKLFGDDY 71

Query: 538 DKKDDKYQKGDSDEIKRLKEKLSKTEEE 565
            +++D+ +K   D+ + + + +   +EE
Sbjct: 72  YEEEDEEEKPTWDDDEDIGDLVPDEDEE 99


>gnl|CDD|233065 TIGR00634, recN, DNA repair protein RecN.  All proteins in this
           family for which functions are known are ATP binding
           proteins involved in the initiation of recombination and
           recombinational repair [DNA metabolism, DNA replication,
           recombination, and repair].
          Length = 563

 Score = 31.2 bits (71), Expect = 1.9
 Identities = 26/112 (23%), Positives = 54/112 (48%), Gaps = 10/112 (8%)

Query: 483 EIKRLKEKLSKTEEELKKEKTNSAALKSQADSVGKEYDRLLKEHEKVQKVVTEQGDKKDD 542
           E++   ++L    E L + +   A +K      G   + +L+  EK++    E+ D+ DD
Sbjct: 288 ELQNYLDELEFDPERLNEIEERLAQIKRLKRKYGASVEEVLEYAEKIK----EELDQLDD 343

Query: 543 KYQKGDSDEIKRLKEKLSKTEEELKKEKTNSAALKNQ-ADSVGKEYDRLLKE 593
             +  ++     L+E++ K EEEL K     + ++ + A+ + K  ++ LK 
Sbjct: 344 SDESLEA-----LEEEVDKLEEELDKAAVALSLIRRKAAERLAKRVEQELKA 390


>gnl|CDD|240246 PTZ00053, PTZ00053, methionine aminopeptidase 2; Provisional.
          Length = 470

 Score = 31.2 bits (71), Expect = 1.9
 Identities = 17/79 (21%), Positives = 31/79 (39%), Gaps = 8/79 (10%)

Query: 524 KEHEKVQKVVTEQGDKKDDKYQKGDSDEIKRLKEKLSKTEEEL--------KKEKTNSAA 575
            E ++ ++   ++G KK +K  K D D+      +L    +E         KK+K     
Sbjct: 8   NEVKQQKQQNKQKGTKKKNKKSKKDVDDDDAFLAELISENQEAENKQNNKKKKKKKKKKK 67

Query: 576 LKNQADSVGKEYDRLLKEH 594
            KN  ++    YD  +   
Sbjct: 68  KKNLGEAYDLAYDLPVVWS 86


>gnl|CDD|216447 pfam01346, FKBP_N, Domain amino terminal to FKBP-type
           peptidyl-prolyl isomerase.  This family is only found at
           the amino terminus of pfam00254. This domain is of
           unknown function.
          Length = 124

 Score = 29.5 bits (67), Expect = 1.9
 Identities = 23/114 (20%), Positives = 47/114 (41%), Gaps = 21/114 (18%)

Query: 501 EKTNSAALKSQAD--------SVGKEYDRLLKEHEKVQKVVTEQGDKKDDKYQKGDS--- 549
              ++A LK+  D        S+G+   +  +E      ++  QG +  D     +    
Sbjct: 11  AAADAAKLKTDKDKLSYAIGASLGRYLKQQGEEQLDKDALL--QGLQ--DALAGKELKLT 66

Query: 550 -DEIKRLKEKLSKTEEELKKEKTNSAALKNQADSVGKEYDRLLKEHEKVQKVVT 602
            +EI++  +   K  +  ++ K    A +N+A   G+ +   L E+ K + V T
Sbjct: 67  DEEIQKALQAFQKKLQAKQQAKAEKKAEENKAA--GEAF---LAENAKKEGVKT 115


>gnl|CDD|234017 TIGR02794, tolA_full, TolA protein.  TolA couples the inner
           membrane complex of itself with TolQ and TolR to the
           outer membrane complex of TolB and OprL (also called
           Pal). Most of the length of the protein consists of
           low-complexity sequence that may differ in both length
           and composition from one species to another,
           complicating efforts to discriminate TolA (the most
           divergent gene in the tol-pal system) from paralogs such
           as TonB. Selection of members of the seed alignment and
           criteria for setting scoring cutoffs are based largely
           conserved operon struction. //The Tol-Pal complex is
           required for maintaining outer membrane integrity. Also
           involved in transport (uptake) of colicins and
           filamentous DNA, and implicated in pathogenesis.
           Transport is energized by the proton motive force. TolA
           is an inner membrane protein that interacts with
           periplasmic TolB and with outer membrane porins ompC,
           phoE and lamB [Transport and binding proteins, Other,
           Cellular processes, Pathogenesis].
          Length = 346

 Score = 31.0 bits (70), Expect = 2.0
 Identities = 30/130 (23%), Positives = 57/130 (43%), Gaps = 3/130 (2%)

Query: 482 DEIKRLKEKLSKTEEELKKEKTNSAALKS--QADSVGKEYDRLLKEHEKVQKVVTEQGDK 539
           +  K+L+++  + E++   E+     L+    A+   K+ ++  K+ E+ QK   E   K
Sbjct: 68  ERQKKLEQQAEEAEKQRAAEQARQKELEQRAAAEKAAKQAEQAAKQAEEKQKQAEEAKAK 127

Query: 540 -KDDKYQKGDSDEIKRLKEKLSKTEEELKKEKTNSAALKNQADSVGKEYDRLLKEHEKVQ 598
              +   K +++  K+ KE+  K  EE  K K  + A K  A++  K       + E   
Sbjct: 128 QAAEAKAKAEAEAEKKAKEEAKKQAEEEAKAKAAAEAKKKAAEAKKKAEAEAKAKAEAKA 187

Query: 599 KVVTEQGDKK 608
           K   E+   K
Sbjct: 188 KAKAEEAKAK 197



 Score = 29.4 bits (66), Expect = 6.0
 Identities = 25/101 (24%), Positives = 39/101 (38%), Gaps = 15/101 (14%)

Query: 485 KRLKEKLSKTEEELKKEKTNSAALKSQADSVGKEYDRLLKEHEKVQKVVTEQGDKKDDKY 544
           K+  E  +K E E +K+       K QA     E +   K   + +        KK  + 
Sbjct: 127 KQAAEAKAKAEAEAEKKAK--EEAKKQA-----EEEAKAKAAAEAK--------KKAAEA 171

Query: 545 QKGDSDEIKRLKEKLSKTEEELKKEKTNSAALKNQADSVGK 585
           +K    E K   E  +K + E  K K  +A  K  A++  K
Sbjct: 172 KKKAEAEAKAKAEAKAKAKAEEAKAKAEAAKAKAAAEAAAK 212


>gnl|CDD|238288 cd00520, RRF, Ribosome recycling factor (RRF). Ribosome recycling
           factor dissociates the posttermination complex, composed
           of the ribosome, deacylated tRNA, and mRNA, after
           termination of translation.  Thus ribosomes are
           "recycled" and ready for another round of protein
           synthesis.  RRF is believed to bind the ribosome at the
           A-site in a manner that mimics tRNA, but the specific
           mechanisms remain unclear.  RRF is essential for
           bacterial growth.  It is not necessary for cell growth
           in archaea or eukaryotes, but is found in mitochondria
           or chloroplasts of some eukaryotic species.
          Length = 179

 Score = 29.9 bits (68), Expect = 2.2
 Identities = 22/73 (30%), Positives = 38/73 (52%), Gaps = 3/73 (4%)

Query: 524 KEHEKVQKVVTEQGDKKDDKYQKGDSDEIKRLKEKLSKTEEELKKEKTNSAALKNQADSV 583
           KE  K  K + E+        ++  +D+IK+L+++   +E+E+KK +     L+   D  
Sbjct: 107 KELVKDAKKIAEEAKVAIRNIRRDANDKIKKLEKEKEISEDEVKKAE---EDLQKLTDEY 163

Query: 584 GKEYDRLLKEHEK 596
            K+ D LLK  EK
Sbjct: 164 IKKIDELLKSKEK 176


>gnl|CDD|215590 PLN03123, PLN03123, poly [ADP-ribose] polymerase; Provisional.
          Length = 981

 Score = 30.9 bits (70), Expect = 2.2
 Identities = 21/105 (20%), Positives = 39/105 (37%), Gaps = 19/105 (18%)

Query: 493 KTEEELKKEKTNSAALKSQADSVGKEYDRLLKEHEKVQKVVTEQGDKKDDKY-QKGDSDE 551
           K E+  ++++ +    K + D+ G +            K      D        +  S +
Sbjct: 191 KEEKAEERKQESKKGAKRKKDASGDD----------KSKKAKTDRDVSTSTAASQKKSSD 240

Query: 552 IKRLKEKLSKTEEELK---KEKTNSAALK-----NQADSVGKEYD 588
           ++   E  SK    LK   K+  ++A L+     N  D+ G E D
Sbjct: 241 LESKLEAQSKELWSLKDDLKKHVSTAELREMLEANGQDTSGSELD 285


>gnl|CDD|237631 PRK14162, PRK14162, heat shock protein GrpE; Provisional.
          Length = 194

 Score = 30.2 bits (68), Expect = 2.3
 Identities = 40/144 (27%), Positives = 63/144 (43%), Gaps = 21/144 (14%)

Query: 474 DKGLEGDSDEIKRLKEKLSKTEEELKKEKTNSAALKSQADSVG------KEYDRLLK-EH 526
              +E    EI  LK K    E++          L+SQA+         KE  +L+K E 
Sbjct: 38  QNPVEDLEKEIADLKAKNKDLEDKY---------LRSQAEIQNMQNRYAKERAQLIKYES 88

Query: 527 EKVQKVVTEQGDKKDDKYQ-KGDSDEIKRLKEKLSKTEEELKKEKTNSAALKNQADSVGK 585
           + + K V    D  +     K D +  K+LK+ +  T + L K   +    + +AD  G+
Sbjct: 89  QSLAKDVLPAMDNLERALAVKADDEAAKQLKKGVQMTLDHLVKALKDHGVTEIKAD--GE 146

Query: 586 EYDRLLKEHEKVQKVVTEQGDKKD 609
           ++D  L  H+ VQ V  E  D+KD
Sbjct: 147 KFDPTL--HQAVQTVAAENDDQKD 168



 Score = 29.8 bits (67), Expect = 3.1
 Identities = 19/91 (20%), Positives = 44/91 (48%), Gaps = 4/91 (4%)

Query: 488 KEKLSKTEEELKKEKTNSAALKSQADSVGKEYDRLLKEHEKVQKVVTE---QGDKKDDKY 544
           KE+    ++  ++E T+ A  K   ++  +E        E ++K + +   +    +DKY
Sbjct: 3   KEEFPSEKDLPQEETTDEAPKKEAKEAPKEEDQEKQNPVEDLEKEIADLKAKNKDLEDKY 62

Query: 545 QKGDSDEIKRLKEKLSKTEEELKKEKTNSAA 575
            +  + EI+ ++ + +K   +L K ++ S A
Sbjct: 63  LRSQA-EIQNMQNRYAKERAQLIKYESQSLA 92


>gnl|CDD|214660 smart00434, TOP4c, DNA Topoisomerase IV.  Bacterial DNA
           topoisomerase IV, GyrA, ParC.
          Length = 444

 Score = 30.6 bits (70), Expect = 2.3
 Identities = 19/94 (20%), Positives = 35/94 (37%), Gaps = 29/94 (30%)

Query: 481 SDEIKRLKEKLSKTEEELKKEK-TNSAALKSQADSV---------GKEYDRLLKEHEKVQ 530
            DEI  L        +E K++        + QAD++           E ++L KE ++++
Sbjct: 368 IDEIIVLIRSSKDLAKEAKEKLMERFELSEIQADAILDMRLRRLTKLEVEKLEKELKELE 427

Query: 531 KVVTEQGDKKDDKYQKGDSDEIKRLKEKLSKTEE 564
           K                   EI+ L++ L+  E 
Sbjct: 428 K-------------------EIEDLEKILASEEW 442



 Score = 29.1 bits (66), Expect = 8.8
 Identities = 14/61 (22%), Positives = 26/61 (42%), Gaps = 10/61 (16%)

Query: 549 SDEIKRLKEKLSKTEEELKKEK-TNSAALKNQADSV---------GKEYDRLLKEHEKVQ 598
            DEI  L        +E K++        + QAD++           E ++L KE ++++
Sbjct: 368 IDEIIVLIRSSKDLAKEAKEKLMERFELSEIQADAILDMRLRRLTKLEVEKLEKELKELE 427

Query: 599 K 599
           K
Sbjct: 428 K 428


>gnl|CDD|163064 TIGR02894, DNA_bind_RsfA, transcription factor, RsfA family.  In a
           subset of endospore-forming members of the Firmcutes,
           members of this protein family are found, several to a
           genome. Two very strongly conserved sequences regions
           are separated by a highly variable linker region. Much
           of the linker region was excised from the seed alignment
           for this model. A characterized member is the
           prespore-specific transcription RsfA from Bacillus
           subtilis, previously called YwfN, which is controlled by
           sigma factor F and seems to fine-tune expression of some
           genes in the sigma-F regulon. A paralog in Bacillus
           subtilis is designated YlbO [Regulatory functions, DNA
           interactions, Cellular processes, Sporulation and
           germination].
          Length = 161

 Score = 29.7 bits (67), Expect = 2.3
 Identities = 17/72 (23%), Positives = 36/72 (50%), Gaps = 4/72 (5%)

Query: 535 EQGDKKDDKYQKGDSDEIKRLKEKLSKTEEELKKEKTNSAALKNQADSVGKEYDRLLKEH 594
           +  +  D   QK +++ +K   E L K  EEL+KE      L+ +  ++ ++Y  L+   
Sbjct: 93  KTTNPSDQALQK-ENERLKNQNESLQKRNEELEKEL---EKLRQRLSTIEEDYQTLIDIM 148

Query: 595 EKVQKVVTEQGD 606
           ++ +K+   + D
Sbjct: 149 DRARKLAVVEDD 160



 Score = 29.7 bits (67), Expect = 2.6
 Identities = 16/65 (24%), Positives = 35/65 (53%), Gaps = 3/65 (4%)

Query: 474 DKGLEGDSDEIKRLKEKLSKTEEELKKEKTNSAALKSQADSVGKEYDRLLKEHEKVQKVV 533
           D+ L+ +++ +K   E L K  EEL+KE      L+ +  ++ ++Y  L+   ++ +K+ 
Sbjct: 99  DQALQKENERLKNQNESLQKRNEELEKEL---EKLRQRLSTIEEDYQTLIDIMDRARKLA 155

Query: 534 TEQGD 538
             + D
Sbjct: 156 VVEDD 160


>gnl|CDD|234547 TIGR04330, cas_Cpf1, CRISPR-associated protein Cpf1, subtype
           PREFRAN.  This family is the long protein of a novel
           CRISPR subtype, PREFRAN, which is most common in
           Prevotella and Francisella, although widely distributed.
           The PREFRAN type has Cas1, Cas2, and Cas4, but lacks the
           helicase Cas3 and endonuclease Cas3-HD.
          Length = 1287

 Score = 31.0 bits (70), Expect = 2.3
 Identities = 24/116 (20%), Positives = 43/116 (37%), Gaps = 9/116 (7%)

Query: 483 EIKRLKEKLSKTEEELKKEKTNSAALKSQAD------SVGKEYDRLLKEHEKVQKVVTEQ 536
           E ++ K K  K EE++KK         S            KE+D   +  E ++    E 
Sbjct: 386 EPRKGKSKKEKYEEKVKK-VVKKQDYLSIQFINDLLEKYIKEFDSKDRLFENIRDYFAEI 444

Query: 537 GDKKDDKYQKGDSDEIKRLKEKLSKTEEELKKEKTNSAALKNQADSVGKEYDRLLK 592
              +++ +Q  D+  I  +K      E  L+KE+ +    +     +    D L  
Sbjct: 445 SLMRNEDFQTNDN--IANIKTAYQTFERILQKERKDLKQDEEVVKLIKSLLDALKN 498


>gnl|CDD|203371 pfam06005, DUF904, Protein of unknown function (DUF904).  This
           family consists of several bacterial and archaeal
           hypothetical proteins of unknown function.
          Length = 72

 Score = 28.0 bits (63), Expect = 2.4
 Identities = 12/63 (19%), Positives = 30/63 (47%), Gaps = 4/63 (6%)

Query: 468 EIFEQCDKGLEGDSDEIKRLKEKLSKTEEELKKEKTNSAALKSQADSVGKEYDRLLKEHE 527
              E  ++ LE    +I++  E ++  + E+++ K  +  L  + + + +E  +L +E  
Sbjct: 1   MSLELLEE-LES---KIQQAVETIALLQMEIEELKEENEQLSEENEELEEENKKLKEERN 56

Query: 528 KVQ 530
             Q
Sbjct: 57  AWQ 59



 Score = 27.3 bits (61), Expect = 5.9
 Identities = 15/64 (23%), Positives = 30/64 (46%), Gaps = 8/64 (12%)

Query: 535 EQGDKKDDKYQKGDSDEIKRLKEKLSKTEEELKKEKTNSAALKNQADSVGKEYDRLLKEH 594
           E  ++ + K Q+   + I  L+ ++    EELK+E      L  + + + +E  +L +E 
Sbjct: 4   ELLEELESKIQQA-VETIALLQMEI----EELKEENE---QLSEENEELEEENKKLKEER 55

Query: 595 EKVQ 598
              Q
Sbjct: 56  NAWQ 59


>gnl|CDD|235250 PRK04195, PRK04195, replication factor C large subunit;
           Provisional.
          Length = 482

 Score = 30.7 bits (70), Expect = 2.4
 Identities = 23/114 (20%), Positives = 44/114 (38%), Gaps = 15/114 (13%)

Query: 469 IFEQCDKGLEGDSDEIKRLKEKLSKTEEELKKEKTNSAALKSQADSVGKEYDRLLKEHEK 528
           IF+          +   RL   L  TEEE++          S+  +   +      E ++
Sbjct: 377 IFKHNP-------ELAARLAAFLELTEEEIEF------LTGSKKATKKIKKIVEKAEKKR 423

Query: 529 VQKVVTEQGDKKDDKYQKGDSDEIKRLKEKLSKTEEELKKE--KTNSAALKNQA 580
            ++   ++      K ++ + +E K  KE+  + EEE  +E  +      K QA
Sbjct: 424 EEEKKEKKKKAFAGKKKEEEEEEEKEKKEEEKEEEEEEAEEEKEEEEEKKKKQA 477


>gnl|CDD|225121 COG2211, MelB, Na+/melibiose symporter and related transporters
           [Carbohydrate transport and metabolism].
          Length = 467

 Score = 30.7 bits (70), Expect = 2.5
 Identities = 32/167 (19%), Positives = 57/167 (34%), Gaps = 15/167 (8%)

Query: 17  AFMLLLILPILSTQRLHKILKSKFVQGVKTQAGWYFGCILVILSLFFLDAIREMRKYASP 76
             +L+ +L     +      K+   QG         G I VIL LF    ++E       
Sbjct: 161 GGLLVAVLFPPLVKLFGGGDKALGYQGT----ALVLGVIGVILLLFCFFNVKER--VVET 214

Query: 77  EVKEEAHGHLDAEMQNNMKLFRAQRNFYI-SGFSLFLWL--VIRQIIQLIAQQANL-LAQ 132
           +  +        +++++  L    R   +    +L L++   IR  I +      L   +
Sbjct: 215 QPTKTGVKL---KLKDSFLLIFKNRPLLLLLLMNLLLFIAFNIRGSIMVYYVTYVLGDPE 271

Query: 133 NEASM-NQARQAAVAAQALLDGPRGKWFGSHVWAYLTPNLLVTRSYS 178
             A +   A  A +    +L     K FG     +L   LL+   Y 
Sbjct: 272 LFAYLLLLASGAGLLIGLILWPRLVKKFGKK-KLFLIGLLLLAVGYL 317


>gnl|CDD|221432 pfam12128, DUF3584, Protein of unknown function (DUF3584).  This
           protein is found in bacteria and eukaryotes. Proteins in
           this family are typically between 943 to 1234 amino
           acids in length. This family contains a P-loop motif
           suggesting it is a nucleotide binding protein. It may be
           involved in replication.
          Length = 1198

 Score = 30.8 bits (70), Expect = 2.6
 Identities = 21/131 (16%), Positives = 53/131 (40%), Gaps = 7/131 (5%)

Query: 475 KGLEGDSDEIKRLKEKLSKTEEELKKEKTNSAALKSQADSVGKEYDRLLKEHEKV--QKV 532
           + L+ D +++  L+ +L     EL  ++   A  + +         + L+  E    +  
Sbjct: 249 EKLQEDFEQLLSLELRLQHLHGELVADEERLAEEQEERQEAKNRLRQQLRTLEDQLKEAR 308

Query: 533 VTEQGDKKDDKYQ-KGDSDEIKRLKEKLSKTE----EELKKEKTNSAALKNQADSVGKEY 587
                +      +   D  E++ L+++    E    E+L+ +     +++++ + V    
Sbjct: 309 DELNQELSAANAKLAADRSELELLEDQKGAFEDADIEQLQADLDQLPSIRSELEEVEARL 368

Query: 588 DRLLKEHEKVQ 598
           D L  +H+ VQ
Sbjct: 369 DALTGKHQDVQ 379



 Score = 28.9 bits (65), Expect = 9.9
 Identities = 20/110 (18%), Positives = 47/110 (42%), Gaps = 3/110 (2%)

Query: 484 IKRLKEKLSKTEEELKKEKTNSAALKSQADSVGKEYDRLLKEHEKVQKVVTEQGDKKDDK 543
            +RL E+  + +E   + +     L+ Q      E ++ L            + +  +D+
Sbjct: 276 EERLAEEQEERQEAKNRLRQQLRTLEDQLKEARDELNQELSAANAKLAADRSELELLEDQ 335

Query: 544 YQKGDSDEIKRLK---EKLSKTEEELKKEKTNSAALKNQADSVGKEYDRL 590
               +  +I++L+   ++L     EL++ +    AL  +   V ++Y+RL
Sbjct: 336 KGAFEDADIEQLQADLDQLPSIRSELEEVEARLDALTGKHQDVQRKYERL 385


>gnl|CDD|235600 PRK05771, PRK05771, V-type ATP synthase subunit I; Validated.
          Length = 646

 Score = 30.7 bits (70), Expect = 2.7
 Identities = 38/159 (23%), Positives = 69/159 (43%), Gaps = 28/159 (17%)

Query: 468 EIFEQCDKGLEGDSDEIKRLKEKLSKTEEELKKEKTNSAALKSQAD-------------- 513
           E+ +  ++ LE    EIK L+E++S+ E E+K+ +     L+   +              
Sbjct: 86  ELIKDVEEELEKIEKEIKELEEEISELENEIKELEQEIERLEPWGNFDLDLSLLLGFKYV 145

Query: 514 -----SVGKEYDRLLKEHEKVQKVVTEQGDKKDDKY-----QKGDSDEIKRLKEKLSKTE 563
                +V ++    LK  E   + V      K   Y      K  SDE++   +KL    
Sbjct: 146 SVFVGTVPEDKLEELKL-ESDVENVEYISTDKGYVYVVVVVLKELSDEVEEELKKLGFER 204

Query: 564 EELKKEKTNSAAL---KNQADSVGKEYDRLLKEHEKVQK 599
            EL++E T S  +   K + + + KE + LL+E +++ K
Sbjct: 205 LELEEEGTPSELIREIKEELEEIEKERESLLEELKELAK 243


>gnl|CDD|219256 pfam06991, Prp19_bind, Splicing factor, Prp19-binding domain.  This
           family represents the C-terminus (approximately 300
           residues) of proteins that are involved as binding
           partners for Prp19 as part of the nuclear pore complex.
           The family in Drosophila is necessary for pre-mRNA
           splicing, and the human protein has been found in
           purifications of the spliceosome. In the past this
           family was thought, erroneously, to be associated with
           microfibrillin.
          Length = 277

 Score = 30.3 bits (68), Expect = 2.7
 Identities = 32/125 (25%), Positives = 54/125 (43%), Gaps = 13/125 (10%)

Query: 462 ESSKRCEIFEQCDKGLEGDSDEIKRLKEKLSKTEEELKKEKTNSAALKSQADSVGKEYDR 521
           E++K   + E+  +  E    E  ++ E+  K E ELKK  T   A     D+   E + 
Sbjct: 56  EAAKEKALEEEAKRKAEERKRETLKIVEEEVKKELELKKRNTLLEANIDDVDTDD-ENEE 114

Query: 522 LLKEHEKVQKVVTEQGDKKDDKYQKGDSDEIKRLKEKLSK----TEEE-LKKEKTNSAAL 576
              E  K++    E    K D+ +    +E++R K ++ K    TEEE   + + N    
Sbjct: 115 EEYEAWKLR----ELKRIKRDREE---REEMEREKAEIEKMRNMTEEERRAELRKNPKVS 167

Query: 577 KNQAD 581
            N+A 
Sbjct: 168 TNKAK 172


>gnl|CDD|145900 pfam02994, Transposase_22, L1 transposable element. 
          Length = 370

 Score = 30.4 bits (68), Expect = 2.8
 Identities = 24/87 (27%), Positives = 38/87 (43%), Gaps = 3/87 (3%)

Query: 487 LKEKLSKTEEELKKEKTNSAALKSQADSVGKEYDRLLKEHEKVQKVVTEQGDKKDDKYQK 546
            K  L   EE  KK+  +S  L+   ++ GKE    LKE E+      E   +  +K  K
Sbjct: 41  EKSYLIMMEEGFKKDNYSS--LREDIETKGKEVQNFLKELEECITKQVEAHIENTEKCLK 98

Query: 547 GDSDEIKRLKEKLSKTEEELKKEKTNS 573
              +  K+ +E L +    +KKE   +
Sbjct: 99  ELMELKKKARE-LKEECRSIKKEHMET 124


>gnl|CDD|224691 COG1777, COG1777, Predicted transcriptional regulators
           [Transcription].
          Length = 217

 Score = 30.1 bits (68), Expect = 2.8
 Identities = 23/111 (20%), Positives = 58/111 (52%), Gaps = 5/111 (4%)

Query: 495 EEELKKEKTNSAALKSQADSVGKEYDRLLKEHEKVQKVVTEQGDKKDDKYQKGDSDEIKR 554
           E++L+ E++  + L    + + +   RLL+ + +++++   Q +      QK  ++ + R
Sbjct: 100 EDDLESERSEVSKLFKSPEGISELISRLLEINREIEELSRAQTE-----LQKQLNELMDR 154

Query: 555 LKEKLSKTEEELKKEKTNSAALKNQADSVGKEYDRLLKEHEKVQKVVTEQG 605
           +KE++   + ++ +       LKN A  V +   R + + E+V +++ E+G
Sbjct: 155 IKEEIEDKDGDMTERIVLEYLLKNGAADVEETSRRTVLKIEEVLEILAEKG 205


>gnl|CDD|237255 PRK12900, secA, preprotein translocase subunit SecA; Reviewed.
          Length = 1025

 Score = 30.5 bits (69), Expect = 2.9
 Identities = 19/67 (28%), Positives = 33/67 (49%), Gaps = 4/67 (5%)

Query: 474 DKGLEGDSDEIK-RLKEKLSKTEE---ELKKEKTNSAALKSQADSVGKEYDRLLKEHEKV 529
           D  L     E+K R++  L   E+   +L+K+  N      +A+S+ +E D L KE+E+ 
Sbjct: 39  DDALREKGMELKSRVRGALEPIEQKKKDLEKKLDNPDISLEEAESINEELDTLAKEYEEA 98

Query: 530 QKVVTEQ 536
                E+
Sbjct: 99  TAAALEE 105



 Score = 29.7 bits (67), Expect = 5.8
 Identities = 31/125 (24%), Positives = 53/125 (42%), Gaps = 34/125 (27%)

Query: 489 EKL--SKTEEELKK-----EKTNS--AALKSQADSVGKEYDRLLKEHEKVQKVVTEQGDK 539
           EK+  SK E+++KK     ++ N   A+L S +D   +E    LK   +V+  +     K
Sbjct: 6   EKIFGSKHEKDIKKIQPIIDRINEIQASLASLSDDALREKGMELKS--RVRGALEPIEQK 63

Query: 540 KDDKYQKGDSDEIKRLKEKLSKTEEELKKEKTNSAALKNQADSVGKEYDRLLKEHEKVQK 599
           K D  +K D+ +I                          +A+S+ +E D L KE+E+   
Sbjct: 64  KKDLEKKLDNPDI-----------------------SLEEAESINEELDTLAKEYEEATA 100

Query: 600 VVTEQ 604
              E+
Sbjct: 101 AALEE 105


>gnl|CDD|217502 pfam03343, SART-1, SART-1 family.  SART-1 is a protein involved in
           cell cycle arrest and pre-mRNA splicing. It has been
           shown to be a component of U4/U6 x U5 tri-snRNP complex
           in human, Schizosaccharomyces pombe and Saccharomyces
           cerevisiae. SART-1 is a known tumour antigen in a range
           of cancers recognised by T cells.
          Length = 603

 Score = 30.5 bits (69), Expect = 3.0
 Identities = 25/96 (26%), Positives = 38/96 (39%), Gaps = 16/96 (16%)

Query: 485 KRLKEKLSKTEEELKKEKTNSAALKSQADSVGKEYDRLLKEHEKVQKVVTEQGDKKDDKY 544
           KR +E  +K + E  +EK   A  K + +S      +L        K + E  D  DD  
Sbjct: 44  KRQEEAEAKRKREELREKIAKAREKRERNS------KLGG-----IKTLGEDDDDDDD-- 90

Query: 545 QKGDSDEIKRLKEKLSKTEEELKKEKTNSAALKNQA 580
                  +K+ K++  K E E KK        K +A
Sbjct: 91  ---TKAWLKKSKKRQKKKEAERKKALLLDEKEKERA 123


>gnl|CDD|224212 COG1293, COG1293, Predicted RNA-binding protein homologous to
           eukaryotic snRNP [Transcription].
          Length = 564

 Score = 30.4 bits (69), Expect = 3.1
 Identities = 26/113 (23%), Positives = 46/113 (40%), Gaps = 16/113 (14%)

Query: 483 EIKRLKEKLSKTEEELKKEKTNSAALKSQADSVGKEYDRLLKEHEKVQKVVTEQGDKKDD 542
           E+K+L+ KL K E+EL++ +  +  L+ + + +      +    E+  K V        D
Sbjct: 294 ELKKLENKLEKQEDELEELEKAAEELRQKGELLYANLQLI----EEGLKSVR-----LAD 344

Query: 543 KYQKGDSDEIKRLKEKLSKTEEELKKEKTNSAALKNQADSVGKEYDRLLKEHE 595
            Y    ++EIK   +K     E  ++       LK          DR L E +
Sbjct: 345 FYG---NEEIKIELDKSKTPSENAQRYFKKYKKLKGAK----VNLDRQLSELK 390


>gnl|CDD|220413 pfam09805, Nop25, Nucleolar protein 12 (25kDa).  Members of this
           family of proteins are part of the yeast nuclear pore
           complex-associated pre-60S ribosomal subunit. The family
           functions as a highly conserved exonuclease that is
           required for the 5'-end maturation of 5.8S and 25S
           rRNAs, demonstrating that 5'-end processing also has a
           redundant pathway. Nop25 binds late pre-60S ribosomes,
           accompanying them from the nucleolus to the nuclear
           periphery; and there is evidence for both physical and
           functional links between late 60S subunit processing and
           export.
          Length = 134

 Score = 29.2 bits (66), Expect = 3.3
 Identities = 18/90 (20%), Positives = 38/90 (42%), Gaps = 16/90 (17%)

Query: 482 DEIKRLKEKLSKTEE--ELKKEKTNSAALKSQADSVGKEYDRLLKEHEKVQKVVTEQGDK 539
           +E K  KE+  + EE   +++E+              +E ++ LKE ++  K++ E+ D 
Sbjct: 40  EEAKE-KEREERIEERKRIREERK-------------QELEKQLKERKEALKLLEEENDD 85

Query: 540 KDDKYQKGDSDEIKRLKEKLSKTEEELKKE 569
           ++D   +   D      E   +      +E
Sbjct: 86  EEDAETEDTEDVEDDEWEGFPEPTVTDYEE 115


>gnl|CDD|222977 PHA03089, PHA03089, late transcription factor VLTF-4; Provisional.
          Length = 191

 Score = 29.8 bits (67), Expect = 3.3
 Identities = 30/121 (24%), Positives = 53/121 (43%), Gaps = 15/121 (12%)

Query: 488 KEKLSKTEEELKKEKTNSAALKSQADSVGKEYDRLLKEHEKVQKVVTEQGDKKDDKYQKG 547
           K K  K E+E   E     A +  +DS   E +    ++E  +   T      +D     
Sbjct: 66  KTKKKKKEKEEVPEL----AAEELSDSEENEENDKKVDYELPKVQNTAAEVNHEDVI--- 118

Query: 548 DSDEIKRLKEKLSKTEEELKKEKTNSAALKN-----QADSVGKEYDRLLKEHEKVQKVVT 602
           D  ++K   E + K   +LKK  +  +A+       QA S+ ++Y  L+KE +K++ +V 
Sbjct: 119 DLSDLKLATENIIK---DLKKINSRVSAVSTVLEDVQAASISRQYTSLIKEIDKLKDLVN 175

Query: 603 E 603
            
Sbjct: 176 L 176


>gnl|CDD|219737 pfam08172, CASP_C, CASP C terminal.  This domain is the C-terminal
           region of the CASP family of proteins. It is a Golgi
           membrane protein which is thought to have a role in
           vesicle transport.
          Length = 245

 Score = 30.1 bits (68), Expect = 3.4
 Identities = 11/45 (24%), Positives = 29/45 (64%), Gaps = 3/45 (6%)

Query: 554 RLKEKLSKTEEELKKEKTNSAALKNQADSVGKEYDRLLKEHEKVQ 598
           R +++ ++ EEEL+K+    ++L+ + +S+  +  +L   +EK++
Sbjct: 90  RFRKRNTELEEELRKQNQTISSLRRELESLKADNIKL---YEKIR 131



 Score = 29.3 bits (66), Expect = 4.7
 Identities = 11/45 (24%), Positives = 29/45 (64%), Gaps = 3/45 (6%)

Query: 486 RLKEKLSKTEEELKKEKTNSAALKSQADSVGKEYDRLLKEHEKVQ 530
           R +++ ++ EEEL+K+    ++L+ + +S+  +  +L   +EK++
Sbjct: 90  RFRKRNTELEEELRKQNQTISSLRRELESLKADNIKL---YEKIR 131


>gnl|CDD|192111 pfam08647, BRE1, BRE1 E3 ubiquitin ligase.  BRE1 is an E3 ubiquitin
           ligase that has been shown to act as a transcriptional
           activator through direct activator interactions.
          Length = 96

 Score = 28.4 bits (64), Expect = 3.4
 Identities = 29/106 (27%), Positives = 43/106 (40%), Gaps = 26/106 (24%)

Query: 497 ELKKEKTNSAALKSQADSVGKEYDRLLKEHEKVQKVVTEQGDKKDDKY-----QKGDSD- 550
           E   ++ +S ALK  A+    E        +K  ++  E+  K D KY      K   D 
Sbjct: 9   EKAFKELSSLALKKVAELTILE--------QKKLRLTAEK-AKADQKYFAAMRSKDALDA 59

Query: 551 EIKRLKEKLSKTEEELKKEKTNSAALKNQADSVGKEYDRLLKEHEK 596
           E K+L  +L+K+ E           L  Q   +  E  R +KE EK
Sbjct: 60  ENKKLNTQLNKSSE-----------LIEQLKDLENELRRRIKELEK 94


>gnl|CDD|238103 cd00176, SPEC, Spectrin repeats, found in several proteins involved
           in cytoskeletal structure; family members include
           spectrin, alpha-actinin and dystrophin; the spectrin
           repeat forms a three helix bundle with the second helix
           interrupted by proline in some sequences; the repeats
           are independent folding units; tandem repeats are found
           in differing numbers and arrange in an antiparallel
           manner to form dimers; the repeats are defined by a
           characteristic tryptophan (W) residue in helix A and a
           leucine (L) at the carboxyl end of helix C and separated
           by a linker of 5 residues; two copies of the repeat are
           present here.
          Length = 213

 Score = 29.7 bits (67), Expect = 3.5
 Identities = 21/144 (14%), Positives = 57/144 (39%), Gaps = 27/144 (18%)

Query: 479 GDSDEIKRLKEKLSKTEEELKKEKTNSAALKSQA-----------DSVGKEYDRLLKEHE 527
            D + ++ L +K    E EL   +    AL               + + +  + L +  E
Sbjct: 30  DDLESVEALLKKHEALEAELAAHEERVEALNELGEQLIEEGHPDAEEIQERLEELNQRWE 89

Query: 528 KVQKVVTEQGDKKDDKYQK-GDSDEIKRLKEKLSKTEEELKKEKTNSAALKNQADSVGKE 586
           +++++  E+  + ++         +   L++ L + E  L             ++ +GK+
Sbjct: 90  ELRELAEERRQRLEEALDLQQFFRDADDLEQWLEEKEAAL------------ASEDLGKD 137

Query: 587 YD---RLLKEHEKVQKVVTEQGDK 607
            +    LLK+H+++++ +     +
Sbjct: 138 LESVEELLKKHKELEEELEAHEPR 161


>gnl|CDD|218769 pfam05833, FbpA, Fibronectin-binding protein A N-terminus (FbpA).
           This family consists of the N-terminal region of the
           prokaryotic fibronectin-binding protein. Fibronectin
           binding is considered to be an important virulence
           factor in streptococcal infections. Fibronectin is a
           dimeric glycoprotein that is present in a soluble form
           in plasma and extracellular fluids; it is also present
           in a fibrillar form on cell surfaces. Both the soluble
           and cellular forms of fibronectin may be incorporated
           into the extracellular tissue matrix. While fibronectin
           has critical roles in eukaryotic cellular processes,
           such as adhesion, migration and differentiation, it is
           also a substrate for the attachment of bacteria. The
           binding of pathogenic Streptococcus pyogenes and
           Staphylococcus aureus to epithelial cells via
           fibronectin facilitates their internalisation and
           systemic spread within the host.
          Length = 447

 Score = 30.3 bits (69), Expect = 3.5
 Identities = 22/101 (21%), Positives = 48/101 (47%), Gaps = 19/101 (18%)

Query: 483 EIKRLKEKLSKTEEELKKEKTNSAALKSQADSV---------GKEYDRLL----KEHEKV 529
           E+++L++KL K E+EL++ + N+   +   + +         G +   L      E   +
Sbjct: 304 ELEKLEKKLEKLEKELEEAE-NAENYRLYGELLTANLYAIKKGMKEVELPNYYTGEEITI 362

Query: 530 ----QKVVTEQGDKKDDKYQKGDSDEIKRLKEKLSKTEEEL 566
                K  +E   K   KY+K     ++ ++E++ +T+EE+
Sbjct: 363 PLDPAKSPSENAQKYYKKYKKLKR-AVEAVEEQIEETKEEI 402


>gnl|CDD|218661 pfam05622, HOOK, HOOK protein.  This family consists of several
           HOOK1, 2 and 3 proteins from different eukaryotic
           organisms. The different members of the human gene
           family are HOOK1, HOOK2 and HOOK3. Different domains
           have been identified in the three human HOOK proteins,
           and it was demonstrated that the highly conserved
           NH2-domain mediates attachment to microtubules, whereas
           the central coiled-coil motif mediates homodimerisation
           and the more divergent C-terminal domains are involved
           in binding to specific organelles (organelle-binding
           domains). It has been demonstrated that endogenous HOOK3
           binds to Golgi membranes, whereas both HOOK1 and HOOK2
           are localised to discrete but unidentified cellular
           structures. In mice the Hook1 gene is predominantly
           expressed in the testis. Hook1 function is necessary for
           the correct positioning of microtubular structures
           within the haploid germ cell. Disruption of Hook1
           function in mice causes abnormal sperm head shape and
           fragile attachment of the flagellum to the sperm head.
          Length = 713

 Score = 30.2 bits (68), Expect = 3.5
 Identities = 36/171 (21%), Positives = 64/171 (37%), Gaps = 40/171 (23%)

Query: 477 LEGDSDEIKRLKEKLSKTEEELKKEKT--NSAALKSQADSVGKEYDRLLKEHEKVQKVVT 534
           LE    +++ L  KLS  EE  K +K       L+ + +++ KE +RLL E + +++   
Sbjct: 365 LETYKRQVQELHAKLS--EESKKADKLEFEYKRLEEKLEALQKEKERLLAERDSLRETNE 422

Query: 535 E---------QGDKKDDKYQKGD---SDEI--KRLKEKLSKTEEELK--------KEKTN 572
           E         Q  + D          + E+     +EKL + + E K         E   
Sbjct: 423 ELRCGQAQQDQLTQADAGTSPSGDNLAAELLPSEYREKLIRLQHENKMLRLGQEGSENER 482

Query: 573 SAALKNQADSVGKEYDRLLKEH--------------EKVQKVVTEQGDKKD 609
              L+   +   +  + L  +               E +QK + EQG K +
Sbjct: 483 ITELQQLLEDANRRNNELETQLRLANQRILELQQQVEDLQKALQEQGSKSE 533


>gnl|CDD|215019 smart01090, Copper-fist, Copper fist is an N-terminal domain
           involved in copper-dependent DNA binding.  The domain is
           named for its resemblance to a fist. It can be found in
           some fungal transcription factors. These proteins
           activate the transcription of the metallothionein gene
           in response to copper. Metallothionein maintains copper
           levels in yeast. The copper fist domain is similar in
           structure to metallothionein itself, and on copper
           binding undergoes a large conformational change, which
           allows DNA binding.
          Length = 38

 Score = 26.9 bits (60), Expect = 3.6
 Identities = 12/34 (35%), Positives = 16/34 (47%), Gaps = 10/34 (29%)

Query: 295 CDWCVDGHR---CTHDTAENCRNDILITGVSRVG 325
           C+ C+ GHR   CTH       ND  +  V + G
Sbjct: 9   CEKCIRGHRSSTCTH-------NDRPLVKVRKRG 35


>gnl|CDD|235640 PRK05901, PRK05901, RNA polymerase sigma factor; Provisional.
          Length = 509

 Score = 30.0 bits (68), Expect = 3.8
 Identities = 16/119 (13%), Positives = 33/119 (27%)

Query: 485 KRLKEKLSKTEEELKKEKTNSAALKSQADSVGKEYDRLLKEHEKVQKVVTEQGDKKDDKY 544
            +   K    +E    +K        + D +    D  +           +  D  DD  
Sbjct: 88  AKAPAKKKLKDELDSSKKAEKKNALDKDDDLNYVKDIDVLNQADDDDDDDDDDDLDDDDI 147

Query: 545 QKGDSDEIKRLKEKLSKTEEELKKEKTNSAALKNQADSVGKEYDRLLKEHEKVQKVVTE 603
              D DE     +     ++E +++K      K   D      +   +   + +K    
Sbjct: 148 DDDDDDEDDDEDDDDDDVDDEDEEKKEAKELEKLSDDDDFVWDEDDSEALRQARKDAKL 206


>gnl|CDD|221120 pfam11488, Lge1, Transcriptional regulatory protein LGE1.  This
           family of proteins is conserved from fungi to human. In
           yeasts it is involved in the ubiquitination of histones
           H2A and H2B. This ubiquitination step is a vital one in
           the regulation of the transcriptional activity of RNA
           polymerase II. In S. cerevisiae, Rad6 and Bre1 are
           present in a complex, also containing Lge1, that is
           required for H2B ubiquitination. Bre1 is the H2B
           ubiquitin ligase that interacts with acidic activators,
           such as Gal4, and recruits Rad6 and its binding partner
           Lge1 to target promoters. In S. pombe the equivalent
           protein to Lge1 appears to be Shf1.
          Length = 75

 Score = 27.7 bits (62), Expect = 3.8
 Identities = 10/39 (25%), Positives = 14/39 (35%)

Query: 462 ESSKRCEIFEQCDKGLEGDSDEIKRLKEKLSKTEEELKK 500
                 E      K LE    +++ LK +  K E EL  
Sbjct: 12  SDETTQEKLRSRYKELESLDQKLEELKLQKLKLELELAL 50


>gnl|CDD|219934 pfam08614, ATG16, Autophagy protein 16 (ATG16).  Autophagy is a
           ubiquitous intracellular degradation system for
           eukaryotic cells. During autophagy, cytoplasmic
           components are enclosed in autophagosomes and delivered
           to lysosomes/vacuoles. ATG16 (also known as Apg16) has
           been shown to be bind to Apg5 and is required for the
           function of the Apg12p-Apg5p conjugate in the yeast
           autophagy pathway.
          Length = 194

 Score = 29.4 bits (66), Expect = 3.9
 Identities = 17/138 (12%), Positives = 53/138 (38%), Gaps = 15/138 (10%)

Query: 467 CEIFEQCDKGLEGDSDEIKRLKEKLSKTEEELKKEKTNSAALK-----SQADSVGKEYDR 521
            E+FEQ  +  +  +  +K   + L   + E +   ++S +       +    + ++  +
Sbjct: 20  SELFEQYRQLAD-RTRLLKVENDALQAEKYEQQSSHSSSPSADGPGSDAAIAEMEQKLAK 78

Query: 522 LLKEHEKVQK---------VVTEQGDKKDDKYQKGDSDEIKRLKEKLSKTEEELKKEKTN 572
           L +E  ++ K         ++     ++  +  +     I  L+ +++  E E++  +  
Sbjct: 79  LREELTELHKKRGELAQRLLLLNDELEQLRREIQQLEKTIAELRSEITSLETEIRDLREE 138

Query: 573 SAALKNQADSVGKEYDRL 590
               +   +++  E   L
Sbjct: 139 LQEKEKDNETLQDELISL 156


>gnl|CDD|222812 PHA00727, PHA00727, hypothetical protein.
          Length = 278

 Score = 29.8 bits (67), Expect = 3.9
 Identities = 25/88 (28%), Positives = 43/88 (48%), Gaps = 21/88 (23%)

Query: 491 LSKTEEELKKEKTNSAALKSQADSV------GKEYDRLLKEHEKVQKVVTEQGDKKDDKY 544
           +S  EEEL+K ++    LK + +        GK   RL K +EK +  + +Q        
Sbjct: 7   VSAWEEELRKAQS-LEELKQKYEEAQKQIADGKTLKRLYKVYEKREFELKKQ-------- 57

Query: 545 QKGDSDEIKRLKEKLSKTEEELKKEKTN 572
                 + ++LK +LSK +++ KKEK +
Sbjct: 58  ------QFEQLKAELSKKKKKFKKEKVD 79


>gnl|CDD|222426 pfam13870, DUF4201, Domain of unknown function (DUF4201).  This is
           a family of coiled-coil proteins from eukaryotes. The
           function is not known.
          Length = 177

 Score = 29.5 bits (67), Expect = 3.9
 Identities = 16/73 (21%), Positives = 38/73 (52%), Gaps = 5/73 (6%)

Query: 468 EIFEQCDKGLEGDSDEIKRLKEKLSKTEEELKKEKTNSAALKSQ-----ADSVGKEYDRL 522
           E  E+  K L+   + + +L+++L K + E  K +  +  L+ Q       ++  +YD+ 
Sbjct: 91  EELERLKKELKEKEERLAKLRKELYKVKLERDKLRKQNKKLRQQGGLLHVPALLYDYDKT 150

Query: 523 LKEHEKVQKVVTE 535
           +++ E+ +K V +
Sbjct: 151 VEKVEEKRKSVEK 163


>gnl|CDD|197563 smart00187, INB, Integrin beta subunits (N-terminal portion of
           extracellular region).  Portion of beta integrins that
           lies N-terminal to their EGF-like repeats. Integrins are
           cell adhesion molecules that mediate cell-extracellular
           matrix and cell-cell interactions. They contain both
           alpha and beta subunits. Beta integrins are proposed to
           have a von Willebrand factor type-A "insert" or "I"
           -like domain (although this remains to be confirmed).
          Length = 423

 Score = 30.0 bits (68), Expect = 3.9
 Identities = 13/35 (37%), Positives = 15/35 (42%), Gaps = 9/35 (25%)

Query: 284 SCTQCVSSDFPCDWCVD-------GHRCTHDTAEN 311
           SC +C+ S   C WC D         RC  DT  N
Sbjct: 1   SCEECIQSGPNCAWCTDENFTSGGSARC--DTRAN 33


>gnl|CDD|147381 pfam05167, DUF711, Uncharacterized ACR (DUF711).  The proteins in
           this family are functionally uncharacterized. The
           proteins are around 450 amino acids long. It is likely
           that this family represents a group of
           glycerol-3-phosphate dehydrogenases.
          Length = 390

 Score = 29.9 bits (68), Expect = 4.2
 Identities = 19/101 (18%), Positives = 33/101 (32%), Gaps = 17/101 (16%)

Query: 478 EGDSDEIKRLKEKLSKTEEELKKEK---------------TNSAALKSQADSVGKEYDRL 522
           E   D+I    E L +  EE++ E                  + A      S+ K  D+ 
Sbjct: 20  EKIYDKITSKAENLVEVAEEIEAEYGIPIVTKRISVTPISLVAEASDDDYVSLAKALDKA 79

Query: 523 LKEHEKVQKVVTEQGDKKDDKYQKGDSDEIKRLKEKLSKTE 563
            K+       +          + +GD   I  + + L+ TE
Sbjct: 80  AKDVG--VDFIGGFSALVHKGFTRGDRALIDSIPDALASTE 118


>gnl|CDD|226684 COG4231, COG4231, Indolepyruvate ferredoxin oxidoreductase, alpha
           and beta subunits [Energy production and conversion].
          Length = 640

 Score = 30.0 bits (68), Expect = 4.3
 Identities = 13/47 (27%), Positives = 20/47 (42%), Gaps = 2/47 (4%)

Query: 119 IIQLIAQQANLLAQ--NEASMNQARQAAVAAQALLDGPRGKWFGSHV 163
           +I+ +A+   +L     E S+N+      AA A   G R      HV
Sbjct: 45  LIETLAKAKKILGDVYFEWSLNEKVALETAAGASYAGVRALVTMKHV 91


>gnl|CDD|235219 PRK04098, PRK04098, sec-independent translocase; Provisional.
          Length = 158

 Score = 28.9 bits (65), Expect = 4.4
 Identities = 24/105 (22%), Positives = 42/105 (40%), Gaps = 8/105 (7%)

Query: 482 DEIKRLKEKLSKTEEELKKEKTNSAALKSQADSVGKEYDRLLKEHEKVQKVVTEQGDKKD 541
           +E  + K++     E LKK K     L     +   E   +    +  +K + E  D   
Sbjct: 61  EEALKYKKEFESAVESLKK-KLKFEELDDLKITAENEIKSIQDLLQDYKKSLEE--DTIP 117

Query: 542 DKYQKGDSDEIKRLKEKLSKTEEELKKEKTNSAALKNQADSVGKE 586
           +   +  S+E K  +E  S ++E  K+ K  +   KN+     KE
Sbjct: 118 NHLNEEVSNETKLTQE--SSSDESPKEVKLAT---KNKTKKHDKE 157


>gnl|CDD|217803 pfam03938, OmpH, Outer membrane protein (OmpH-like).  This family
           includes outer membrane proteins such as OmpH among
           others. Skp (OmpH) has been characterized as a molecular
           chaperone that interacts with unfolded proteins as they
           emerge in the periplasm from the Sec translocation
           machinery.
          Length = 157

 Score = 29.1 bits (66), Expect = 4.4
 Identities = 13/81 (16%), Positives = 37/81 (45%), Gaps = 7/81 (8%)

Query: 519 YDRLLKEHEKVQKVVTEQGDKKDDKYQKGDSDEIKRLKEKLSKTEEELKKEKTNSAALKN 578
             ++L E    +    +      +K  K    E+++ +++L K E++L+K+   +   + 
Sbjct: 23  VQKVLSESPAGKAAQKQL-----EKEFKKLQAELQKKEKELQKEEQKLQKQA--ATLSEE 75

Query: 579 QADSVGKEYDRLLKEHEKVQK 599
              +  +E  +  +E ++ Q+
Sbjct: 76  ARKAKQQELQQKQQELQQKQQ 96



 Score = 28.0 bits (63), Expect = 9.4
 Identities = 22/109 (20%), Positives = 53/109 (48%), Gaps = 9/109 (8%)

Query: 462 ESSKRCEIFEQCDKGLEGDSDEIKRLKEKLSKTEEELKKEKTNSAALKSQADSVGKEYDR 521
           ES       +Q +K  +    E+++ +++L K E++L+K+    AA  S+          
Sbjct: 29  ESPAGKAAQKQLEKEFKKLQAELQKKEKELQKEEQKLQKQ----AATLSEE-----ARKA 79

Query: 522 LLKEHEKVQKVVTEQGDKKDDKYQKGDSDEIKRLKEKLSKTEEELKKEK 570
             +E ++ Q+ + ++      + Q+   + ++ + +K+ K  +E+ KEK
Sbjct: 80  KQQELQQKQQELQQKQQAAQQELQQKQQELLQPIYDKIDKAIKEVAKEK 128


>gnl|CDD|217020 pfam02403, Seryl_tRNA_N, Seryl-tRNA synthetase N-terminal domain.
           This domain is found associated with the Pfam tRNA
           synthetase class II domain (pfam00587) and represents
           the N-terminal domain of seryl-tRNA synthetase.
          Length = 108

 Score = 28.3 bits (64), Expect = 4.6
 Identities = 23/116 (19%), Positives = 49/116 (42%), Gaps = 24/116 (20%)

Query: 484 IKRLKEKLSKTEEELKKEKTNSAALKSQADSVGKEYDRLLKEHEKVQKVVTEQGDKKDDK 543
           IK ++E     +E+L+K   +   +           D LL+  E+ +++  E        
Sbjct: 4   IKLIRENPEAVKEKLRKRGVDVLDV-----------DELLELDEERRELQVE-------- 44

Query: 544 YQKGDSDEIKRLKEKLSKTEEELKKEKTNSAALKNQADSVGKEYDRLLKEHEKVQK 599
                 +E++  + +LSK   + KK+K ++ AL  +   +  E   L  E  +++ 
Sbjct: 45  -----LEELQAERNELSKEIGKAKKKKEDAEALIAEVKELKDELKALEAELRELEA 95


>gnl|CDD|214826 smart00806, AIP3, Actin interacting protein 3.  Aip3p/Bud6p is a
           regulator of cell and cytoskeletal polarity in
           Saccharomyces cerevisiae that was previously identified
           as an actin-interacting protein. Actin-interacting
           protein 3 (Aip3p) localizes at the cell cortex where
           cytoskeleton assembly must be achieved to execute
           polarized cell growth, and deletion of AIP3 causes gross
           defects in cell and cytoskeletal polarity. Aip3p
           localization is mediated by the secretory pathway,
           mutations in early- or late-acting components of the
           secretory apparatus lead to Aip3p mislocalization.
          Length = 426

 Score = 29.6 bits (67), Expect = 4.6
 Identities = 22/144 (15%), Positives = 59/144 (40%), Gaps = 12/144 (8%)

Query: 468 EIFEQCDKGLEGDSDEIKRLKEKLSKTEEELKKEKTN-SAALKSQADSVGKEYDRLLKEH 526
              ++  + LE   + I+RLKE+   +   + +   + S  L S + ++    +      
Sbjct: 92  NELDEVKQALESQREAIQRLKERQQNSAANIARPAASPSPVLASSSSAISLANNPDKLNK 151

Query: 527 EKVQKVVTEQGDKKDDKYQKGDSDEIKRLKEKLSKTEEELKKEKTNSAALKNQA------ 580
           E+  ++ + Q +    +     +     +KE +    E++ K K++S +    +      
Sbjct: 152 EQRAELKSLQRELAVLRQTH--NSFFTEIKESIKDILEKIDKFKSSSLSASGSSNRAYVE 209

Query: 581 ---DSVGKEYDRLLKEHEKVQKVV 601
                + ++ D LL + + +Q ++
Sbjct: 210 SSKKKLSEDSDSLLTKVDDLQDII 233


>gnl|CDD|225381 COG2825, HlpA, Outer membrane protein [Cell envelope biogenesis,
           outer membrane].
          Length = 170

 Score = 28.9 bits (65), Expect = 4.6
 Identities = 19/119 (15%), Positives = 56/119 (47%), Gaps = 7/119 (5%)

Query: 462 ESSKRCEIFEQCDKGLEGDSDEIKRLKEKLSKTEEELKKEKTNSAALKSQADSVGKEYDR 521
           ES +  ++    +   +    E+++++++L    +  + +  +   +++ +D   K    
Sbjct: 38  ESPQAKKVSADLESEFKKRQKELQKMQKEL----KAKEAKLQDDGKMEALSD-RAKAEAE 92

Query: 522 LLKEHEKVQKVVTEQGDKKDDKYQKGDSDEIKRLKEKLSKTEEELKKEKTNSAALKNQA 580
           + KE   V     +Q + + D  ++ +++E ++L EK+ +  E + ++   S  L + A
Sbjct: 93  IKKEK-LVNAFNKKQQEYEKDLNRR-EAEEEQKLLEKIQRAIESVAEKGGYSLVLDSNA 149


>gnl|CDD|222160 pfam13476, AAA_23, AAA domain. 
          Length = 204

 Score = 29.4 bits (66), Expect = 4.6
 Identities = 20/95 (21%), Positives = 45/95 (47%), Gaps = 1/95 (1%)

Query: 475 KGLEGDSDEIKRLKEKLSKTEEELKKEKTNSAALKS-QADSVGKEYDRLLKEHEKVQKVV 533
           K ++    EI  + E     +E LK +K     L     + + +   +  ++ E+++++ 
Sbjct: 104 KKVKKSILEIVEIDELQEFIDELLKSDKEILPLLLYLGQERLEELKFKRKEKKERLEELE 163

Query: 534 TEQGDKKDDKYQKGDSDEIKRLKEKLSKTEEELKK 568
            E  + +D+K       E K  K++L + +EEL++
Sbjct: 164 KELEELEDEKDLLEKLLEEKEKKKELEELKEELEE 198



 Score = 29.1 bits (65), Expect = 6.1
 Identities = 18/87 (20%), Positives = 40/87 (45%), Gaps = 6/87 (6%)

Query: 481 SDEIKRLKEKLSKTEEELKKEKTNSAALKSQADSVGKEYDRLLKEHEKVQKVVTEQGDKK 540
            + +K  KE L       ++        + +     +E ++ L+E E  + ++ +  ++K
Sbjct: 124 DELLKSDKEILPLLLYLGQERLEELKFKRKEKKERLEELEKELEELEDEKDLLEKLLEEK 183

Query: 541 DDKYQKGDSDEIKRLKEKLSKTEEELK 567
           + K       E++ LKE+L +  EEL+
Sbjct: 184 EKK------KELEELKEELEELLEELE 204


>gnl|CDD|202101 pfam02050, FliJ, Flagellar FliJ protein. 
          Length = 122

 Score = 28.4 bits (64), Expect = 4.7
 Identities = 12/34 (35%), Positives = 16/34 (47%)

Query: 116 IRQIIQLIAQQANLLAQNEASMNQARQAAVAAQA 149
           I  + + IAQQ   L Q E  + QAR+    A  
Sbjct: 54  ISALDEAIAQQQQELEQAEKQVEQAREQWQEANQ 87


>gnl|CDD|218312 pfam04889, Cwf_Cwc_15, Cwf15/Cwc15 cell cycle control protein.
           This family represents Cwf15/Cwc15 (from
           Schizosaccharomyces pombe and Saccharomyces cerevisiae
           respectively) and their homologues. The function of
           these proteins is unknown, but they form part of the
           spliceosome and are thus thought to be involved in mRNA
           splicing.
          Length = 241

 Score = 29.3 bits (66), Expect = 4.7
 Identities = 28/141 (19%), Positives = 46/141 (32%), Gaps = 36/141 (25%)

Query: 474 DKGLEGDSDEIKR-LKEKLSKTEEELKKEKTNSAAL----KSQADSVGKEYDRLLKEHEK 528
             G   + +  KR L+ +L + E   K +K N  A+    KS       E D    E ++
Sbjct: 45  QPGQGTEDELRKRDLRAELEEAERAHKSKKENKLAIEDADKSTNLDASNEGDEDDDEEDE 104

Query: 529 VQKVVTEQG-------DKKDDKYQKGDSDE-------------------IK--RLKEKLS 560
           +++   E+        D           D+                   IK  R +EK  
Sbjct: 105 IKRKRIEEDARNSDADDSDSSSDSDSSDDDSDDDDSEDETAALLRELEKIKKERAEEKER 164

Query: 561 KTEE---ELKKEKTNSAALKN 578
           + EE   E +K +       N
Sbjct: 165 EEEEKAAEEEKAREEEILTGN 185


>gnl|CDD|214017 cd12924, iSH2_PIK3R1, Inter-Src homology 2 (iSH2) helical domain of
           Class IA Phosphoinositide 3-kinase Regulatory subunit 1,
           PIK3R1, also called p85alpha.  PI3Ks catalyze the
           transfer of the gamma-phosphoryl group from ATP to the
           3-hydroxyl of the inositol ring of
           D-myo-phosphatidylinositol (PtdIns) or its derivatives.
           They play an important role in a variety of fundamental
           cellular processes, including cell motility, the Ras
           pathway, vesicle trafficking and secretion, immune cell
           activation and apoptosis. They are classified according
           to their substrate specificity, regulation, and domain
           structure. Class IA PI3Ks are heterodimers of a p110
           catalytic (C) subunit and a p85-related regulatory (R)
           subunit. The R subunit down-regulates PI3K basal
           activity, stabilizes the C subunit, and plays a role in
           the activation downstream of tyrosine kinases. All R
           subunits contain two SH2 domains that flank an
           intervening helical domain (iSH2), which binds to the
           N-terminal adaptor-binding domain (ABD) of the catalytic
           subunit. In addition, p85alpha, also called PIK3R1,
           contains N-terminal SH3 and GAP domains. p85alpha carry
           functions independent of its PI3K regulatory role. It
           can independently stimulate signaling pathways involved
           in cytoskeletal rearrangements. Insulin-sensitive
           tissues express splice variants of the PIK3R1 gene,
           p50alpha and p55alpha, which may play important roles in
           insulin signaling during lipid and glucose metabolism.
           Mice deficient with PIK3R1 die perinatally, indicating
           its importance in development.
          Length = 161

 Score = 28.9 bits (64), Expect = 4.7
 Identities = 30/143 (20%), Positives = 70/143 (48%), Gaps = 24/143 (16%)

Query: 477 LEGDSDEIKRLKEKLSKTEEELKKEKTNSAALKSQADSVGKEYDRLLKEHEKVQKVVTEQ 536
            +  S E  RL E+ ++T +E++ ++T   A+++  +++ K ++   +  E+  K   E+
Sbjct: 17  FQEKSREYDRLYEEYTRTSQEIQMKRT---AIEAFNETI-KIFEEQCQTQERYSKEYIEK 72

Query: 537 GDKKDDKYQKGDSDEIKR-------LKEKLS-------KTEEELKKEKTNSAALKNQADS 582
             ++      G+  EI+R       LK ++S       + EE+LKK+      +  + +S
Sbjct: 73  FKRE------GNEKEIQRIMHNYEKLKSRISEIVDSRRRLEEDLKKQAAEYREIDKRMNS 126

Query: 583 VGKEYDRLLKEHEKVQKVVTEQG 605
           +  +  +L K  ++    +T++G
Sbjct: 127 IKPDLIQLRKTRDQYLMWLTQKG 149


>gnl|CDD|222636 pfam14265, DUF4355, Domain of unknown function (DUF4355).  This
           family of proteins is found in bacteria and viruses.
           Proteins in this family are typically between 180 and
           214 amino acids in length.
          Length = 125

 Score = 28.4 bits (64), Expect = 4.9
 Identities = 20/97 (20%), Positives = 41/97 (42%), Gaps = 15/97 (15%)

Query: 492 SKTEEELKKEKTNSAALKSQADSVGKEYDRLLKEHEKVQKVVTEQGDKKDDKYQKGDSDE 551
              EE+   +K    A+  +     K+ +    E EK+ K+  E+  +           E
Sbjct: 1   EPEEEKTFTDKEVDKAIAKEKAKWEKKQEEKKSEAEKLAKMSAEEKAEY----------E 50

Query: 552 IKRLKEKLSKTEEELKKEKTNSAALKNQADSVGKEYD 588
           +++L+++L + E EL + +     LK +A  +  E  
Sbjct: 51  LEKLEKELEELEAELARRE-----LKAEAKKMLSEKG 82


>gnl|CDD|193253 pfam12777, MT, Microtubule-binding stalk of dynein motor.  the 380
           kDa motor unit of dynein belongs to the AAA class of
           chaperone-like ATPases. The core of the 380 kDa motor
           unit contains a concatenated chain of six AAA modules,
           of which four correspond to the ATP binding sites with
           P-loop signatures described previously, and two are
           modules in which the P loop has been lost in evolution.
           This family is the region between D4 and D5 and is the
           two predicted alpha-helical coiled coil segments that
           form the stalk supporting the ATP-sensitive microtubule
           binding component.
          Length = 344

 Score = 29.6 bits (66), Expect = 5.0
 Identities = 25/90 (27%), Positives = 45/90 (50%), Gaps = 8/90 (8%)

Query: 477 LEGDSDEIKRLKEKLSKTEEELKKEKTNSAALKSQADSVGKEYDRLLKEHEKVQKVVTEQ 536
           L+  + ++  LK KL+  E ELK++  ++  L      VG E +++ KE     K + +Q
Sbjct: 10  LQSTASQVDDLKAKLAIQEVELKQKNEDADKL---IQVVGVETEKVSKE-----KAIADQ 61

Query: 537 GDKKDDKYQKGDSDEIKRLKEKLSKTEEEL 566
            + K     K  +++ K  +E L+K E  L
Sbjct: 62  EEVKVAVINKEVTEKQKDCEEDLAKAEPAL 91


>gnl|CDD|216249 pfam01025, GrpE, GrpE. 
          Length = 165

 Score = 28.7 bits (65), Expect = 5.2
 Identities = 26/141 (18%), Positives = 55/141 (39%), Gaps = 32/141 (22%)

Query: 488 KEKLSKTEEELKKEKTNSAALKSQADSVGKEYDRLLKEHEKVQK-VVTEQGDKKDDKYQK 546
           KE+  + E+E +  +     L+ + + +     RLL E E  +K    E+ + K    +K
Sbjct: 3   KEEEEELEDEEEALEEELEELEEEIEELKDRLLRLLAEFENYRKRTEREREEAKKYAIEK 62

Query: 547 -------------------GDSDEIKRLKEKLSKTEEELKKEKTNSAALKNQA----DSV 583
                               +++++K L E +    ++L K       L+       +  
Sbjct: 63  FAKDLLPVLDNLERALAAAKENEDVKSLVEGVEMILKQLLK------VLEKHGVEKIEPE 116

Query: 584 GKEYDRLLKEHEKVQKVVTEQ 604
           G+ +D     HE V +V +++
Sbjct: 117 GEPFDP--NLHEAVSQVPSDE 135


>gnl|CDD|214019 cd12926, iSH2_PIK3R2, Inter-Src homology 2 (iSH2) helical domain of
           Class IA Phosphoinositide 3-kinase Regulatory subunit 2,
           PIK3R2, also called p85beta.  PI3Ks catalyze the
           transfer of the gamma-phosphoryl group from ATP to the
           3-hydroxyl of the inositol ring of
           D-myo-phosphatidylinositol (PtdIns) or its derivatives.
           They play an important role in a variety of fundamental
           cellular processes, including cell motility, the Ras
           pathway, vesicle trafficking and secretion, immune cell
           activation, and apoptosis. They are classified according
           to their substrate specificity, regulation, and domain
           structure. Class IA PI3Ks are heterodimers of a p110
           catalytic (C) subunit and a p85-related regulatory (R)
           subunit. The R subunit down-regulates PI3K basal
           activity, stabilizes the C subunit, and plays a role in
           the activation downstream of tyrosine kinases. All R
           subunits contain two SH2 domains that flank an
           intervening helical domain (iSH2), which binds to the
           N-terminal adaptor-binding domain (ABD) of the catalytic
           subunit. p85beta, also called PIK3R2, contains
           N-terminal SH3 and GAP domains. It is expressed
           ubiquitously but at lower levels than p85alpha. Its
           expression is increased in breast and colon cancer,
           correlates with tumor progression, and enhanced
           invasion. During viral infection, the viral
           nonstructural (NS1) protein binds p85beta specifically,
           which leads to PI3K activation and the promotion of
           viral replication. Mice deficient with PIK3R2 develop
           normally and exhibit moderate metabolic and
           immunological defects.
          Length = 161

 Score = 28.9 bits (64), Expect = 5.3
 Identities = 20/92 (21%), Positives = 42/92 (45%), Gaps = 9/92 (9%)

Query: 482 DEIKRLKEKLSKTEEELKKEKTNSAALKSQADSVGKEYDRLLKEHEKVQKVVTEQGDKK- 540
           +E  ++ E+  +T+E+  KE       + + +   KE  R+L   E+++  + E  + + 
Sbjct: 49  NETIKIFEEQGQTQEKCSKEYLE----RFRREGNEKEMQRILLNSERLKSRIAEIHESRT 104

Query: 541 ----DDKYQKGDSDEIKRLKEKLSKTEEELKK 568
               D + Q  D+ EI +    L     +L+K
Sbjct: 105 KLEQDLRAQASDNREIDKRMNSLKPDLMQLRK 136


>gnl|CDD|178593 PLN03021, PLN03021, Low-temperature-induced protein; Provisional.
          Length = 619

 Score = 29.6 bits (65), Expect = 5.7
 Identities = 13/43 (30%), Positives = 26/43 (60%)

Query: 526 HEKVQKVVTEQGDKKDDKYQKGDSDEIKRLKEKLSKTEEELKK 568
           HE+ ++ +     ++D+ ++KG S  +K++KEK  K +  L K
Sbjct: 11  HEQAEEPIRIHHPEEDEHHEKGASKVLKKVKEKAKKIKNSLTK 53


>gnl|CDD|227250 COG4913, COG4913, Uncharacterized protein conserved in bacteria
           [Function unknown].
          Length = 1104

 Score = 29.6 bits (66), Expect = 5.8
 Identities = 14/63 (22%), Positives = 30/63 (47%), Gaps = 2/63 (3%)

Query: 464 SKRCEIFEQCDKGLEGDSDEIKRLKEKLSKTEEELKKE-KTNSAALKSQADSVGKEYDRL 522
            +R E   Q  K ++  +  ++RL+E++     + KKE     + + ++ D +  EY   
Sbjct: 772 IERIEHRRQLQKRIDAVNARLRRLREEIIGRMSDAKKEDTAALSEVGAELDDIP-EYLAR 830

Query: 523 LKE 525
           L+ 
Sbjct: 831 LQT 833


>gnl|CDD|148617 pfam07111, HCR, Alpha helical coiled-coil rod protein (HCR).  This
           family consists of several mammalian alpha helical
           coiled-coil rod HCR proteins. The function of HCR is
           unknown but it has been implicated in psoriasis in
           humans and is thought to affect keratinocyte
           proliferation.
          Length = 739

 Score = 29.6 bits (66), Expect = 5.8
 Identities = 24/101 (23%), Positives = 49/101 (48%), Gaps = 6/101 (5%)

Query: 483 EIKRLKEKLSKTEEELKKEKTNSAALKSQADSVGKEYDRLLKEHEKVQKVVTEQGDKKDD 542
           E +RL E   + E EL++ + + A+  +Q ++         +E   +++ +T+Q +    
Sbjct: 515 ERQRLAEVAQQLERELQESQESLASAGAQLEAARAGQQESTEEAASLRQELTQQQEVYGQ 574

Query: 543 KYQKGDSDEIKRLKEKLSKTEEELKKEKTNSAALKNQADSV 583
             Q+  ++   RL+E+LS TE  L      + A +  A +V
Sbjct: 575 ALQEKVAEVETRLREQLSDTERRL------NEARREHAKAV 609


>gnl|CDD|237629 PRK14160, PRK14160, heat shock protein GrpE; Provisional.
          Length = 211

 Score = 29.0 bits (65), Expect = 5.8
 Identities = 33/112 (29%), Positives = 49/112 (43%), Gaps = 19/112 (16%)

Query: 488 KEKLSKTEEELKKEKTNSAALKSQADSVGKEYDRLLKEHEKVQKVVTEQGDKKDDKYQKG 547
             K    EE+  KE  N      + +  GKE D   +E EK + +  E  ++ ++   + 
Sbjct: 7   DAKHENMEEDCCKENEN------KEEDKGKEEDLEFEEIEKEEII--EDSEESNEVKIEE 58

Query: 548 DSDEIKRLKEKLSKTEEELKKEKTNSAALKNQADSVGKEYD----RLLKEHE 595
             DE  +LKE+  K E EL+       ALK++      EYD    R  KE E
Sbjct: 59  LKDENNKLKEENKKLENELE-------ALKDRLLRTVAEYDNYRKRTAKEKE 103


>gnl|CDD|145573 pfam02507, PSI_PsaF, Photosystem I reaction centre subunit III.
           Photosystem I (PSI) is an integral membrane protein
           complex that uses light energy to mediate electron
           transfer from plastocyanin to ferredoxin. Subunit III
           (or PSI-F) is one of at least 14 different subunits that
           compose the PSI complex.
          Length = 159

 Score = 28.6 bits (64), Expect = 6.0
 Identities = 17/40 (42%), Positives = 23/40 (57%), Gaps = 3/40 (7%)

Query: 485 KRLKEKLSKTEEELKKEKTNSA---ALKSQADSVGKEYDR 521
           KRLK  + K E  LKK   NSA   AL++Q +   K ++R
Sbjct: 21  KRLKNSVKKLESRLKKYDPNSAPALALQAQIEKTKKRFER 60



 Score = 28.2 bits (63), Expect = 7.8
 Identities = 17/40 (42%), Positives = 22/40 (55%), Gaps = 3/40 (7%)

Query: 553 KRLKEKLSKTEEELKKEKTNSA---ALKNQADSVGKEYDR 589
           KRLK  + K E  LKK   NSA   AL+ Q +   K ++R
Sbjct: 21  KRLKNSVKKLESRLKKYDPNSAPALALQAQIEKTKKRFER 60


>gnl|CDD|176511 cd08568, GDPD_TmGDE_like, Glycerophosphodiester phosphodiesterase
           domain of Thermotoga maritime and similar proteins.
           This subfamily corresponds to the glycerophosphodiester
           phosphodiesterase domain (GDPD) present in Thermotoga
           maritime glycerophosphodiester phosphodiesterase (TmGDE,
           EC 3.1.4.46) and its uncharacterized  homologs. Members
           in this family show high sequence similarity to
           Escherichia coli GP-GDE, which catalyzes the degradation
           of glycerophosphodiesters to produce
           sn-glycerol-3-phosphate (G3P) and the corresponding
           alcohols. TmGDE exists as a monomer that might be the
           biologically relevant form.
          Length = 226

 Score = 29.2 bits (66), Expect = 6.0
 Identities = 22/68 (32%), Positives = 30/68 (44%), Gaps = 10/68 (14%)

Query: 57  VILSLFFLDAIREMRKYASPEVK------EEAHGHLDAEMQNNMKLFRAQ---RNFYISG 107
           VI S F  DA+RE+RK   P+ K      EE  G    E+   +KL+            G
Sbjct: 121 VIFSSFNHDALRELRKL-DPDAKVGLLIGEEEEGFSIPELHEKLKLYSLHVPIDAIGYIG 179

Query: 108 FSLFLWLV 115
           F  F+ L+
Sbjct: 180 FEKFVELL 187


>gnl|CDD|234342 TIGR03752, conj_TIGR03752, integrating conjugative element protein,
           PFL_4705 family.  Members of this protein family are
           found occasionally on plasmids such as the Pseudomonas
           putida toluene catabolic TOL plasmid pWWO_p085. Usually,
           however, they are found on the bacterial main chromosome
           in regions flanked by markers of conjugative transfer
           and/or transposition [Mobile and extrachromosomal
           element functions, Plasmid functions].
          Length = 472

 Score = 29.6 bits (67), Expect = 6.1
 Identities = 14/81 (17%), Positives = 35/81 (43%), Gaps = 3/81 (3%)

Query: 507 ALKSQADSVGKEYDRLLKEHEKVQKV---VTEQGDKKDDKYQKGDSDEIKRLKEKLSKTE 563
            L+ +   +  E + L  E+E++QK    + +Q  +      +  + EI++LK +  + +
Sbjct: 70  ELRKRLAKLISENEALKAENERLQKREQSIDQQIQQAVQSETQELTKEIEQLKSERQQLQ 129

Query: 564 EELKKEKTNSAALKNQADSVG 584
             + + +   A +       G
Sbjct: 130 GLIDQLQRRLAGVLTGPSGGG 150


>gnl|CDD|217450 pfam03247, Prothymosin, Prothymosin/parathymosin family.
           Prothymosin alpha and parathymosin are two ubiquitous
           small acidic nuclear proteins that are thought to be
           involved in cell cycle progression, proliferation, and
           cell differentiation.
          Length = 106

 Score = 28.0 bits (62), Expect = 6.1
 Identities = 14/72 (19%), Positives = 29/72 (40%), Gaps = 3/72 (4%)

Query: 538 DKKDDKYQKGDSDEIKRLKEKLSKTEEELKKEKTNSAALKNQADSVGKEYDRLLKEHEKV 597
           D K D   +  + ++K  KE + + E         +   +N A     E +   +E E  
Sbjct: 2   DTKVDAAAELSAKDLKEKKEVVEEKENGKNAPANGNENEENGAQEGDDEME---EEEEVD 58

Query: 598 QKVVTEQGDKKD 609
           +    E+G+ ++
Sbjct: 59  EDDEEEEGEGEE 70


>gnl|CDD|227468 COG5139, COG5139, Uncharacterized conserved protein [Function
           unknown].
          Length = 397

 Score = 29.3 bits (65), Expect = 6.2
 Identities = 19/81 (23%), Positives = 32/81 (39%), Gaps = 7/81 (8%)

Query: 480 DSDEIKRLK-EKLSKTEEELKKEKTNSAALKSQADSVGKEYDRLLKEHEKVQKVVTEQGD 538
            S   KR   + L  T E +    +     ++Q+  +G   DR LK         +    
Sbjct: 91  MSLLRKRKNDQSLQPTREPMDSRDSGQDFTEAQSGELGDTGDRQLK-----APAASRARR 145

Query: 539 KKDDKYQKGDSDEIKRLKEKL 559
           K+D   Q  D +   RLK+++
Sbjct: 146 KEDLLEQTVD-EISLRLKKRM 165


>gnl|CDD|226127 COG3599, DivIVA, Cell division initiation protein [Cell division
           and chromosome partitioning].
          Length = 212

 Score = 28.9 bits (65), Expect = 6.2
 Identities = 24/102 (23%), Positives = 41/102 (40%), Gaps = 26/102 (25%)

Query: 513 DSVGKEYDRLLKEHEKVQKVVTEQGDKKDDKYQKGDSDEIKRLKEKLSKTEEELKKEKTN 572
           D V  +Y++LL E+E ++                   DEI  LKE+L   E    ++   
Sbjct: 33  DDVIDDYEQLLDENEDLE-------------------DEIDELKEELK--EAADAEDSQA 71

Query: 573 SAALKNQADSV----GKEYDRLLKE-HEKVQKVVTEQGDKKD 609
               + +A+ +      E D +LK    + Q+V  +   K D
Sbjct: 72  IQQAETEAEELKQAAEAEADDILKRASAQAQRVFGKAQYKAD 113


>gnl|CDD|217479 pfam03297, Ribosomal_S25, S25 ribosomal protein. 
          Length = 105

 Score = 27.7 bits (62), Expect = 6.4
 Identities = 21/65 (32%), Positives = 31/65 (47%), Gaps = 9/65 (13%)

Query: 538 DKKDDKYQKGDSDEIKRLKEKLSKTEEELKKEKTNSAALKNQADSVGKEYDRLLKEHEKV 597
            +K        S   K  K+K SK +    K+K N+A L ++A      YD+LLKE    
Sbjct: 6   QQKKAAKAAAASAGGKAKKKKWSKGKV---KDKLNNAVLFDKA-----TYDKLLKEVPT- 56

Query: 598 QKVVT 602
            K++T
Sbjct: 57  YKLIT 61


>gnl|CDD|234096 TIGR03031, cas_csx12, CRISPR-associated protein Cas9/Csx12, subtype
           II-B/NMENI.  Members of this family of CRISPR-associated
           (cas) protein are found, so far, in CRISPR/cas loci in
           Wolinella succinogenes DSM 1740, Legionella pneumophila
           str. Paris, and Francisella tularensis, where the last
           probably is an example of a degenerate CRISPR locus,
           having neither repeats nor a functional Cas1. The
           characteristic repeat length is 37 base pairs and period
           is about 72. One region of this large protein shows
           sequence similarity to pfam01844, HNH endonuclease
           [Mobile and extrachromosomal element functions, Other].
          Length = 802

 Score = 29.6 bits (66), Expect = 6.5
 Identities = 22/96 (22%), Positives = 33/96 (34%), Gaps = 4/96 (4%)

Query: 480 DSDEIKRLKEKLSKTEEELKKEKTNSAALKSQADSVGKEYDRLLKEHEKVQKV--VTEQG 537
           D+ EI  LKE L      + ++      L+    S   +Y     E  +   V    E+ 
Sbjct: 111 DATEINLLKELLPFKLRSISEDNFIDWFLQKIQSSA--DYLANYSEEFRKILVSKTEEKK 168

Query: 538 DKKDDKYQKGDSDEIKRLKEKLSKTEEELKKEKTNS 573
           D K+ K    D   IK      +  +E L     N 
Sbjct: 169 DDKELKNAVKDKYNIKEFLTGATINDEILGTFDKNE 204


>gnl|CDD|204194 pfam09304, Cortex-I_coil, Cortexillin I, coiled coil.  Members of
           this family are predominantly found in the
           actin-bundling protein Cortexillin I from Dictyostelium
           discoideum. They adopt a structure consisting of an
           18-heptad-repeat alpha-helical coiled-coil, and are a
           prerequisite for the assembly of Cortexillin I.
          Length = 107

 Score = 27.6 bits (61), Expect = 6.5
 Identities = 22/85 (25%), Positives = 36/85 (42%), Gaps = 7/85 (8%)

Query: 487 LKEKLSKTEEELKKEKTNSAALKSQADSVGKEYDRLLKEHEKVQKVVTEQGDKKDDKYQK 546
           L  KL+  E  L+ EK +   L  Q D +      L  +  + ++ + E   K D+  + 
Sbjct: 14  LANKLAGLENSLESEKASQEELAKQKDQLRSLLASLEAQKAEREQRLRELQAKIDELLKN 73

Query: 547 GDSDEIKRLKEKLSKTEEELKKEKT 571
            +        EKL+K E E +  KT
Sbjct: 74  LE-------LEKLAKMELESRLSKT 91


>gnl|CDD|223618 COG0544, Tig, FKBP-type peptidyl-prolyl cis-trans isomerase
           (trigger factor) [Posttranslational modification,
           protein turnover, chaperones].
          Length = 441

 Score = 29.2 bits (66), Expect = 6.8
 Identities = 33/122 (27%), Positives = 49/122 (40%), Gaps = 33/122 (27%)

Query: 479 GDSDEIKRLKEKLSKTEEELKKEKTNSAALKSQADS--------------VGKEYDRLLK 524
           G+ D ++ LKEKL K  E   KE T     K Q                 V  E D LLK
Sbjct: 257 GEEDTLEELKEKLRKNLERELKEATL-EKRKEQLLDALVEANDFDLPESLVEAEIDNLLK 315

Query: 525 E------HEKVQKVVTEQGDKKDDKYQKGDSDEIKRLK-----------EKLSKTEEELK 567
           +       + +   +   G+ +++  ++   +  KR+K           EKL  TEEE+K
Sbjct: 316 QALQQLQQQGID-SLEASGESEEELREEFKEEAEKRVKLGLLLEEIAKEEKLEVTEEEIK 374

Query: 568 KE 569
            E
Sbjct: 375 AE 376


>gnl|CDD|219445 pfam07517, SecA_DEAD, SecA DEAD-like domain.  SecA protein binds to
           the plasma membrane where it interacts with proOmpA to
           support translocation of proOmpA through the membrane.
           SecA protein achieves this translocation, in association
           with SecY protein, in an ATP dependent manner. This
           domain represents the N-terminal ATP-dependent helicase
           domain, which is related to the pfam00270.
          Length = 381

 Score = 29.0 bits (66), Expect = 6.8
 Identities = 12/45 (26%), Positives = 25/45 (55%), Gaps = 5/45 (11%)

Query: 550 DEIKRLKEKLSK-TEEELKKEKTNSAALKNQADSVGKEYDRLLKE 593
           D+I  L+E+    ++EEL+ +   +  L+ +  + G+  D +L E
Sbjct: 21  DQINALEEEYKALSDEELRAK---TQELRKRLAN-GETLDDILPE 61


>gnl|CDD|214369 CHL00118, atpG, ATP synthase CF0 B' subunit; Validated.
          Length = 156

 Score = 28.4 bits (64), Expect = 6.9
 Identities = 23/115 (20%), Positives = 63/115 (54%), Gaps = 11/115 (9%)

Query: 485 KRLKEKLSKTEEELKKEKTNSAALKSQADSVGKEYDR-LLKEHEKVQKVVTEQGDKKDDK 543
           K L + L + +E ++K  T ++ + ++A+ + K+Y++ L K  ++ Q  +T    +   +
Sbjct: 45  KPLLKVLDERKEYIRKNLTKASEILAKANELTKQYEQELSKARKEAQLEIT----QSQKE 100

Query: 544 YQKGDSDEIKRLKEK----LSKTEEELKKEK-TNSAALKNQADSVGKE-YDRLLK 592
            ++   +E+K+ ++     L++  ++L+ +K     +L+ Q D++  +  ++LL 
Sbjct: 101 AKEIVENELKQAQKYIDSLLNEATKQLEAQKEKALKSLEEQVDTLSDQIEEKLLI 155


>gnl|CDD|178768 PLN03229, PLN03229, acetyl-coenzyme A carboxylase carboxyl
           transferase subunit alpha; Provisional.
          Length = 762

 Score = 29.4 bits (66), Expect = 7.1
 Identities = 31/136 (22%), Positives = 55/136 (40%), Gaps = 23/136 (16%)

Query: 462 ESSKRCEIFEQCDKGLEGDSDEIKRLKEKLSKTEEELKKEKTNSAALKSQADSVGKEYDR 521
           E S+   + E+  K  +  ++  K+ KE + + E + K E   +    S A S  +  D 
Sbjct: 542 EFSRAKALSEKKSKAEKLKAEINKKFKEVMDRPEIKEKMEALKAEVASSGASSGDELDDD 601

Query: 522 LLKEHEKVQK----------------VVTEQGDKKDDKYQKGDSDEIKRLKEKLSKTEEE 565
           L ++ EK++K                V+      KD   Q    +    L+EK+    EE
Sbjct: 602 LKEKVEKMKKEIELELAGVLKSMGLEVIGVTKKNKDTAEQTPPPN----LQEKIESLNEE 657

Query: 566 LKKE---KTNSAALKN 578
           + K+      S+ LK+
Sbjct: 658 INKKIERVIRSSDLKS 673


>gnl|CDD|206039 pfam13868, Trichoplein, Tumour suppressor, Mitostatin.  Trichoplein
           or mitostatin, was first defined as a meiosis-specific
           nuclear structural protein. It has since been linked
           with mitochondrial movement. It is associated with the
           mitochondrial outer membrane, and over-expression leads
           to reduction in mitochondrial motility whereas lack of
           it enhances mitochondrial movement. The activity appears
           to be mediated through binding the mitochondria to the
           actin intermediate filaments (IFs).
          Length = 349

 Score = 29.1 bits (66), Expect = 7.2
 Identities = 22/125 (17%), Positives = 55/125 (44%), Gaps = 15/125 (12%)

Query: 480 DSDEIKRLKEKL-----SKTEEELKKEKTNSAALKSQADSVGKEYDRLLKEHEKVQKVVT 534
           +S+E++ L EKL     +K  +   +EK    A + + +   +  D ++ E E+++ +  
Sbjct: 2   NSEELRELNEKLRAAKVNKERDAQIEEKKRIKAEEKEEE---RRIDEMM-EEERLKALAE 57

Query: 535 EQGDKKDDKYQKGDSDEIKRLKEKLSKTEEELKKEKTNSAALKNQADSVGKEYDRLLKEH 594
           E+  ++  K ++      +  +  L +  EE +K +      + Q      E    ++E 
Sbjct: 58  EEERERKRKEER------REGRAVLQEQIEEREKRRQEEYEERLQEREQMDEIIERIQEE 111

Query: 595 EKVQK 599
           ++ + 
Sbjct: 112 DEAEA 116


>gnl|CDD|221857 pfam12923, RRP7, Ribosomal RNA-processing protein 7 (RRP7).  RRP7
           is an essential protein in yeast that is involved in
           pre-rRNA processing and ribosome assembly. It is
           speculated to be required for correct assembly of rpS27
           into the pre-ribosomal particle.
          Length = 131

 Score = 28.0 bits (63), Expect = 7.3
 Identities = 20/74 (27%), Positives = 36/74 (48%), Gaps = 12/74 (16%)

Query: 506 AALKSQADSVGKEYDRLLKEHEKVQK-----------VVTEQGDKKDDKYQKGDSDEIKR 554
            AL+ + D+  ++YD+  +E ++  K            VT  G K+     +  + E +R
Sbjct: 17  EALQEEVDTYMEKYDKREEEAKEEAKARSEPDEDGWTTVTRGGRKRKAGASRNKAAEERR 76

Query: 555 -LKEKLSKTEEELK 567
            LKEK  K ++EL+
Sbjct: 77  KLKEKKKKKKKELE 90


>gnl|CDD|235316 PRK04863, mukB, cell division protein MukB; Provisional.
          Length = 1486

 Score = 29.5 bits (67), Expect = 7.5
 Identities = 24/135 (17%), Positives = 57/135 (42%)

Query: 470  FEQCDKGLEGDSDEIKRLKEKLSKTEEELKKEKTNSAALKSQADSVGKEYDRLLKEHEKV 529
            +E   + L  +SD  ++L+++L + E+E  + +      ++Q     +    L   ++  
Sbjct: 973  YEDAAEMLAKNSDLNEKLRQRLEQAEQERTRAREQLRQAQAQLAQYNQVLASLKSSYDAK 1032

Query: 530  QKVVTEQGDKKDDKYQKGDSDEIKRLKEKLSKTEEELKKEKTNSAALKNQADSVGKEYDR 589
            ++++ E   +  D     DS   +R + +  +    L   ++    L+ Q      E D 
Sbjct: 1033 RQMLQELKQELQDLGVPADSGAEERARARRDELHARLSANRSRRNQLEKQLTFCEAEMDN 1092

Query: 590  LLKEHEKVQKVVTEQ 604
            L K+  K+++   E 
Sbjct: 1093 LTKKLRKLERDYHEM 1107


>gnl|CDD|240345 PTZ00287, PTZ00287, 6-phosphofructokinase; Provisional.
          Length = 1419

 Score = 29.2 bits (65), Expect = 7.6
 Identities = 27/91 (29%), Positives = 42/91 (46%), Gaps = 6/91 (6%)

Query: 506 AALKSQADSVGKEYDRLLKEHEKVQKVVT--EQGDKKDDKYQKGDSDEIKRLKEKLSKTE 563
            A K  AD + K    LL+++E    +    +Q  K      K DS+ I  L +KL+  +
Sbjct: 10  EAKKPSADGLVKTMSVLLRDNECAFNIEENEDQNSKDQLPCDKKDSEMIDCLMQKLTSKK 69

Query: 564 EELKKEKTNSAALKNQADSVGKEYDRLLKEH 594
             ++KE  NS  L +  +   K+    LKEH
Sbjct: 70  FLVEKETKNSFYLVDNENMKIKK----LKEH 96


>gnl|CDD|220365 pfam09726, Macoilin, Transmembrane protein.  This entry is a highly
           conserved protein present in eukaryotes.
          Length = 680

 Score = 29.1 bits (65), Expect = 8.0
 Identities = 30/110 (27%), Positives = 52/110 (47%), Gaps = 17/110 (15%)

Query: 482 DEIKRLKEKLSKTEEELKKEKTNSAALKSQAD-------SVGKEYDRLLKEHEKVQKVVT 534
             I+RL++ + K + EL++ + N + L++Q         S+  +  +L KE++ +Q  + 
Sbjct: 406 SAIERLEQDIKKLQAELQQARQNESELRNQISLLTSLERSLKSDLGQLKKENDMLQTKLN 465

Query: 535 EQGDKKDDKYQKGDSDEIKRLK---------EKLSKTEEELKKEKTNSAA 575
                K    Q   S E KRLK         EK    E++ KKE+  +AA
Sbjct: 466 SMVSAKQKDKQSMQSME-KRLKSEADSRVNAEKQLAEEKKRKKEEEETAA 514


>gnl|CDD|220252 pfam09468, RNase_H2-Ydr279, Ydr279p protein family (RNase H2
           complex component).  RNases H are enzymes that
           specifically hydrolyse RNA when annealed to a
           complementary DNA and are present in all living
           organisms. In yeast RNase H2 is composed of a complex of
           three proteins (Rnh2Ap, Ydr279p and Ylr154p), this
           family represents the homologues of Ydr279p. It is not
           known whether non yeast proteins in this family fulfil
           the same function.
          Length = 287

 Score = 28.8 bits (65), Expect = 8.1
 Identities = 22/127 (17%), Positives = 42/127 (33%), Gaps = 28/127 (22%)

Query: 487 LKEKLSKTEEELKKEKTNSAALKSQADSVGKEYDRLLKEH------------EKVQKVVT 534
           L +K+ +  + L K K  S    +    +  +   LL+              +  ++++ 
Sbjct: 154 LLKKVERLVKYLPKSKEESLVRSALELDIPDDILNLLRLRYACDLLCSYLPPDLYKELLK 213

Query: 535 EQG---DKKDDKYQKGDSDE-----------IKRLKEKLSKTEEELKKEKTNSAALKN-- 578
                  K  DKY K    +             R ++K    EE  KK+   S  +K   
Sbjct: 214 SLLIPEFKPLDKYLKESKKKKRETEEDVEAAESRAEKKRKSKEEIKKKKPKESKGVKALK 273

Query: 579 QADSVGK 585
           +  + G 
Sbjct: 274 KVVAKGM 280


>gnl|CDD|187774 cd09643, Csn1, CRISPR/Cas system-associated protein Cas9.  CRISPR
           (Clustered Regularly Interspaced Short Palindromic
           Repeats) and associated Cas proteins comprise a system
           for heritable host defense by prokaryotic cells against
           phage and other foreign DNA; Very large protein
           containing McrA/HNH-nuclease related domain and a
           RuvC-like nuclease domain; signature gene for type II.
          Length = 799

 Score = 29.3 bits (66), Expect = 8.1
 Identities = 23/105 (21%), Positives = 43/105 (40%), Gaps = 14/105 (13%)

Query: 482 DEIKRLKEKLSKTEEELKKEKTNSAALKSQADSVGKEYDRLLKEHEKVQKVV-------- 533
            +  +    + K +  +K E  N    ++   +  K  + L+K++    K+V        
Sbjct: 469 SDELKFLPIIKKAQ--VKDEILNPVVKRALLQAR-KVVNELVKKYGPPDKIVIEMARENG 525

Query: 534 TEQGDKKDDKYQKGDSDEIKRLKEKLSKTEEELKKEKTNSAALKN 578
           T +G K   K QK + D I   KE  S  E++LK+   +  +   
Sbjct: 526 TNKGTKNRKKRQKKNEDNI---KEAASALEQKLKELPLDIKSKNI 567


>gnl|CDD|225638 COG3096, MukB, Uncharacterized protein involved in chromosome
            partitioning [Cell division and chromosome partitioning].
          Length = 1480

 Score = 29.1 bits (65), Expect = 8.3
 Identities = 22/117 (18%), Positives = 53/117 (45%), Gaps = 4/117 (3%)

Query: 487  LKEKLSKTEEELKKEKTNSAALKSQADSVGKEYDRLLKE----HEKVQKVVTEQGDKKDD 542
            L EKL +  E+ + E+T +     Q  +   +Y+++L      ++  ++++ E   +  D
Sbjct: 985  LNEKLRQRLEQAEAERTRAREQLRQHQAQLSQYNQVLASLKSSYDTKKELLNELQQELQD 1044

Query: 543  KYQKGDSDEIKRLKEKLSKTEEELKKEKTNSAALKNQADSVGKEYDRLLKEHEKVQK 599
               + DS   +R + +  +   +L   ++    L+ Q      E D L ++  K+++
Sbjct: 1045 IGVRADSGAEERARIRRDELHAQLSTNRSRRNQLEKQLTFCEAEMDNLTRKLRKLER 1101


>gnl|CDD|217417 pfam03194, LUC7, LUC7 N_terminus.  This family contains the N
           terminal region of several LUC7 protein homologues and
           only contains eukaryotic proteins. LUC7 has been shown
           to be a U1 snRNA associated protein with a role in
           splice site recognition. The family also contains human
           and mouse LUC7 like (LUC7L) proteins and human cisplatin
           resistance-associated overexpressed protein (CROP).
          Length = 252

 Score = 28.5 bits (64), Expect = 8.3
 Identities = 29/106 (27%), Positives = 44/106 (41%), Gaps = 28/106 (26%)

Query: 467 CEIF-EQCDKGLEGDSDEIKRLKEKLSKTEEELKKEKTNSAALKSQADSVGKEYDRLLKE 525
            E F   CD+       +I + K++L  T+EE  K   +S A   +   + +E  +LL E
Sbjct: 88  LERFVHDCDR-------KIDKAKQRLELTQEEQTKIAADSKAE--ELAELDEEIGKLLAE 138

Query: 526 HEKVQKVVTEQGDKKDDKYQKGDSDEIKRLKEKLSKTEEELKKEKT 571
            E + +              +G  DE      KL K  EELK +K 
Sbjct: 139 AEALGE--------------EGKVDE----AMKLMKEVEELKAKKK 166


>gnl|CDD|185272 PRK15374, PRK15374, pathogenicity island 1 effector protein SipB;
           Provisional.
          Length = 593

 Score = 29.2 bits (65), Expect = 8.3
 Identities = 22/125 (17%), Positives = 40/125 (32%), Gaps = 7/125 (5%)

Query: 491 LSKTEEELKKEKTNSAALKSQADSVGKEYDRLLKEHEKVQKVVTEQGDKKDDKYQKGDSD 550
           LS+ E  L   +    + K     V KE+   L E ++   +      K D      D+ 
Sbjct: 101 LSQLESRLAVWQAMIESQKEMGIQVSKEFQTALGEAQEATDLYEASIKKTDTAKSVYDAA 160

Query: 551 EIK------RLKEKLSKTEEELKKEKTNSAALKNQADSVGKEYDRLLKEHEKVQKVVTEQ 604
           E K      +L+  L   +    + +        +A    +  D+      K       +
Sbjct: 161 EKKLTQAQNKLQS-LDPADPGYAQAEAAVEQAGKEATEAKEALDKATDATVKAGTDAKAK 219

Query: 605 GDKKD 609
            +K D
Sbjct: 220 AEKAD 224


>gnl|CDD|232965 TIGR00414, serS, seryl-tRNA synthetase.  This model represents the
           seryl-tRNA synthetase found in most organisms. This
           protein is a class II tRNA synthetase, and is recognized
           by the pfam model tRNA-synt_2b. The seryl-tRNA
           synthetases of two archaeal species, Methanococcus
           jannaschii and Methanobacterium thermoautotrophicum,
           differ considerably and are included in a different
           model [Protein synthesis, tRNA aminoacylation].
          Length = 418

 Score = 28.9 bits (65), Expect = 8.3
 Identities = 27/115 (23%), Positives = 47/115 (40%), Gaps = 12/115 (10%)

Query: 490 KLSKTEEELKKEKTNSAALKSQADSVGKEYDRL--LKEHEKVQKVVTEQGDKKDDKYQKG 547
           KL +   +L KE      LK++  SV  + ++L  L +  K      E+   K ++  K 
Sbjct: 5   KLLRNNPDLVKES-----LKARGLSVDIDLEKLIALDDERKKLLSEIEELQAKRNELSKQ 59

Query: 548 DSDEIKRLKEKLSKTEEELKKEKTNSAALKNQADSVGKEYDRLLKE-----HEKV 597
                 + K+K+ + ++ELK+ K     L     ++  E    L       HE V
Sbjct: 60  IGKAKGQKKDKIEEIKKELKELKEELTELSAALKALEAELQDKLLSIPNIPHESV 114


>gnl|CDD|235505 PRK05563, PRK05563, DNA polymerase III subunits gamma and tau;
           Validated.
          Length = 559

 Score = 29.1 bits (66), Expect = 8.5
 Identities = 21/97 (21%), Positives = 43/97 (44%), Gaps = 6/97 (6%)

Query: 480 DSDEIKRLKEKLSKTEEELKKEKTNSAALKSQADSVGKEYDRLLKEHEKVQKVVTEQGDK 539
              E++ L +++ + E+ELK+ K     +  +     KE     K+  K +K    +G K
Sbjct: 367 YDTELEVLLQRVEQLEQELKQLKAQPVGVAPEQKEKKKE-----KKKNKKKKYKVPRG-K 420

Query: 540 KDDKYQKGDSDEIKRLKEKLSKTEEELKKEKTNSAAL 576
                ++    +++ LK    +  E LK ++ +  AL
Sbjct: 421 IYKVLKEATRQDLELLKNVWGEILESLKAQRKSLRAL 457


>gnl|CDD|221919 pfam13094, CENP-Q, CENP-A-nucleosome distal (CAD) centromere
           subunit.  CENP-Q is one of the components that assembles
           onto the CENP-A-nucleosome distal (CAD) centromere. The
           centromere, which is the basic element of chromosome
           inheritance, is epigenetically determined in mammals.
           CENP-A, the centromere-specific histone H3 variant,
           assembles an array of nucleosomes and it is this that
           seems to be the prime candidate for specifying
           centromere identity. CENP-A nucleosomes directly recruit
           a proximal CENP-A nucleosome associated complex (NAC)
           comprised of CENP-M, CENP-N and CENP-T, CENP-U(50),
           CENP-C and CENP-H. Assembly of the CENP-A NAC at
           centromeres is dependent on CENP-M, CENP-N and CENP-T.
           Additionally, there are seven other subunits which make
           up the CENP-A-nucleosome distal (CAD) centromere,
           CENP-K, CENP-L, CENP-O, CENP-P, CENP-Q, CENP-R and
           CENP-S, also assembling on the CENP-A NAC. Fta7 is the
           equivalent component of the fission yeast Sim4 complex.
          Length = 159

 Score = 28.1 bits (63), Expect = 8.6
 Identities = 24/114 (21%), Positives = 49/114 (42%), Gaps = 12/114 (10%)

Query: 477 LEGDSDEIKRLKEKLSKTEEELKKEKTNSAALKSQADSVGKEYDRLLKEHEKVQKVVTEQ 536
           LE D + ++ L++     E E ++E  N   +    +S       +L+E+E +Q  + E 
Sbjct: 57  LEKDLEYLQELEKNAKALEREREEESKNLHPVLRLLES------EVLEENELLQDSLLEL 110

Query: 537 GDKKDDKYQKGDSDEIKRLKEKLSKTEEELKKEKTNSAALKNQADSVGKEYDRL 590
            ++           E+  L ++L+K  E L   + N   L    +++ + Y  L
Sbjct: 111 SERNFS---PNLDPELLDLLKQLNKHLESL---QGNLEQLAGLVEALERLYAAL 158


>gnl|CDD|215969 pfam00521, DNA_topoisoIV, DNA gyrase/topoisomerase IV, subunit A. 
          Length = 427

 Score = 29.0 bits (66), Expect = 8.6
 Identities = 26/120 (21%), Positives = 47/120 (39%), Gaps = 37/120 (30%)

Query: 462 ESSKRCEIFEQCDKGLEGDSDEIKRL---KEKLSKTEEELKKEKTNSAALKSQADSV--- 515
           +  +R  I E   K L    D +  +      L K ++EL +E +     + QAD +   
Sbjct: 331 KLEERLHILEGLLKALN-KIDFVIEVIRGSIDLKKAKKELIEELS-----EIQADYLLDM 384

Query: 516 ------GKEYDRLLKEHEKVQKVVTEQGDKKDDKYQKGDSDEIKRLKEKLSKTEEELKKE 569
                  +E ++L KE E+++K                   EI  L++ L+  ++   KE
Sbjct: 385 RLRRLTKEEIEKLEKEIEELEK-------------------EIAELEKILASEKKLWIKE 425


>gnl|CDD|145972 pfam03112, DUF244, Uncharacterized protein family (ORF7) DUF.
           Several members of this family are Borrelia burgdorferi
           plasmid proteins of uncharacterized function.
          Length = 158

 Score = 28.2 bits (63), Expect = 8.6
 Identities = 19/50 (38%), Positives = 25/50 (50%), Gaps = 1/50 (2%)

Query: 471 EQCDKGLEGDSDEIKRLKEKLSKTEEELKK-EKTNSAALKSQADSVGKEY 519
           EQ    LE    EI  LK +L+K + E KK EK     LK + D +  +Y
Sbjct: 66  EQSALNLECSLFEIDNLKTELNKIQNENKKEEKPIKDLLKIKIDKIMIKY 115


>gnl|CDD|184287 PRK13735, PRK13735, conjugal transfer mating pair stabilization
           protein TraG; Provisional.
          Length = 942

 Score = 28.9 bits (65), Expect = 8.8
 Identities = 9/87 (10%), Positives = 37/87 (42%), Gaps = 8/87 (9%)

Query: 485 KRLKEKLSKTEEELKKEKTNSAALKSQADSVGKEYDRLLKEHEKVQKVVTEQGDKKDDKY 544
            ++   +++ E  +   + +   ++ + ++V  +Y  L   H+      ++     ++K 
Sbjct: 861 HQVDNMVTEYEGNIGDTQNS---IRGEENTVKGQYSELQNHHKTEAL--SQNNKYNEEKS 915

Query: 545 QKG---DSDEIKRLKEKLSKTEEELKK 568
            +     +D  + L ++  + +++ K 
Sbjct: 916 AQERMPGADSPEELMKRAKEYQDKHKG 942


>gnl|CDD|241543 cd13392, PH_AKAP13, A-kinase anchoring protein 13 Pleckstrin
           homology (PH) domain.  The Rho-specific GEF activity of
           AKAP13 (also called Brx-1, AKAP-Lbc, and proto-Lbc)
           mediates signaling downstream of G-protein coupled
           receptors and Toll-like receptor 2. It plays a role in
           cell growth, cell development and actin fiber formation.
           Protein kinase A (PKA) binds and phosphorylates AKAP13,
           regulating its Rho-GEF activity. Alternative splicing of
           this gene in humans has at least 3 transcript variants
           encoding different isoforms (i.e. proto-/onco-Lymphoid
           blast crisis, Lbc and breast cancer nuclear
           receptor-binding auxiliary protein, Brx) containing a
           dbl oncogene homology (DH) domain and PH domain which
           are required for full transforming activity. The DH
           domain is associated with guanine nucleotide exchange
           activation while the PH domain has multiple functions
           including determine protein sub-cellular localisation
           via phosphoinositide interactions, while others bind
           protein partners. Other ligands include protein kinase C
           which is bound by the PH domain of AKAP13, serving to
           activate protein kinase D and mobilize a cardiac
           hypertrophy signaling pathway. PH domains have diverse
           functions, but in general are involved in targeting
           proteins to the appropriate cellular location or in the
           interaction with a binding partner. They share little
           sequence conservation, but all have a common fold, which
           is electrostatically polarized. Less than 10% of PH
           domains bind phosphoinositide phosphates (PIPs) with
           high affinity and specificity. PH domains are
           distinguished from other PIP-binding domains by their
           specific high-affinity binding to PIPs with two vicinal
           phosphate groups: PtdIns(3,4)P2, PtdIns(4,5)P2 or
           PtdIns(3,4,5)P3 which results in targeting some PH
           domain proteins to the plasma membrane. A few display
           strong specificity in lipid binding. Any specificity is
           usually determined by loop regions or insertions in the
           N-terminus of the domain, which are not conserved across
           all PH domains. PH domains are found in cellular
           signaling proteins such as serine/threonine kinase,
           tyrosine kinases, regulators of G-proteins, endocytotic
           GTPases, adaptors, as well as cytoskeletal associated
           molecules and in lipid associated enzymes.
          Length = 157

 Score = 28.0 bits (62), Expect = 8.9
 Identities = 24/82 (29%), Positives = 35/82 (42%), Gaps = 8/82 (9%)

Query: 535 EQGDKKDDKYQKGDSDEIKRLKEKLSKTEEELKKEK---TNSAALKNQADSVGKEYDRLL 591
           E+  +  + Y + DS  I R+K       E+LK+ K       +LKN A  +      LL
Sbjct: 19  EKKKRLKEIYGRTDSKSIMRMKSGQMFAREDLKRRKLVRDGPLSLKNTAGRLKDVQAVLL 78

Query: 592 KE-----HEKVQKVVTEQGDKK 608
            +      EK QK V    D+K
Sbjct: 79  SDILVFLQEKDQKYVFASLDQK 100


>gnl|CDD|237551 PRK13902, alaS, alanyl-tRNA synthetase; Provisional.
          Length = 900

 Score = 29.0 bits (66), Expect = 8.9
 Identities = 25/94 (26%), Positives = 43/94 (45%), Gaps = 22/94 (23%)

Query: 471 EQCDKGLEGDSDEIKRLKEKLSKTEEELKKEKTNSAALKSQADSVGKEYDRLLKEHEKVQ 530
           EQ  K +E   +E K  K+++ K  +EL +   +   L S+A+ VG              
Sbjct: 757 EQLPKTVERFFEEWKEQKKEIEKLRKELAELLAS--ELLSKAEEVGG------------V 802

Query: 531 KVVTEQGDKKDDKYQKGDSDEIKRLKEKLSKTEE 564
           KVV E+ D         D DE++++  +L+K  +
Sbjct: 803 KVVVEEVD--------ADEDELRKIANELAKEGK 828


>gnl|CDD|206755 cd08774, 14-3-3, 14-3-3 domain.  14-3-3 domain is an essential part
           of 14-3-3 proteins, a ubiquitous class of regulatory,
           phosphoserine/threonine-binding proteins found in all
           eukaryotic cells, including yeast, protozoa and
           mammalian cells. 14-3-3 proteins play important roles in
           many biological processes that are regulated by
           phosphorylation, including cell cycle regulation, cell
           proliferation, protein trafficking, metabolic regulation
           and apoptosis.  More than 300 binding partners of the
           14-3-3 domain have been identified in all subcellular
           compartments and include transcription factors,
           signaling molecules, tumor suppressors, biosynthetic
           enzymes, cytoskeletal proteins and apoptosis factors.
           14-3-3 binding can alter the conformation, localization,
           stability, phosphorylation state, activity as well as
           molecular interactions of a target protein. They
           function only as dimers, some preferring strictly
           homodimeric interaction, while others form heterodimers.
           Binding of the 14-3-3 domain to its target occurs in a
           phosphospecific manner where it binds to one of two
           consensus sequences of their target proteins; RSXpSXP
           (mode-1) and RXXXpSXP (mode-2). In some instances,
           14-3-3 domain containing proteins are involved in
           regulation and signaling of a number of cellular
           processes in phosphorylation-independent manner. Many
           organisms express multiple isoforms: there are seven
           mammalian 14-3-3 family members (beta, gamma, eta,
           theta, epsilon, sigma, zeta), each encoded by a distinct
           gene, while plants contain up to 13 isoforms. The
           flexible C-terminal segment of 14-3-3 isoforms shows the
           highest sequence variability and may significantly
           contribute to individual isoform uniqueness by playing
           an important regulatory role by occupying the ligand
           binding groove and blocking the binding of inappropriate
           ligands in a distinct manner. Elevated amounts of 14-3-3
           proteins are found in the cerebrospinal fluid of
           patients with Creutzfeldt-Jakob disease. In protozoa,
           like Plasmodium or Cryptosporidium parvum 14-3-3
           proteins play an important role in key steps of parasite
           development.
          Length = 225

 Score = 28.3 bits (64), Expect = 9.0
 Identities = 25/105 (23%), Positives = 47/105 (44%), Gaps = 21/105 (20%)

Query: 466 RCEIFEQCDKGLEGDSDEIKRLKEKLSKTEEELKKEKTN--SAALKSQADSVGKEYDRLL 523
             ++ EQ         D++ +  +++++   EL KE+ N  S A K+   S    + R+L
Sbjct: 6   LAKLAEQA-----ERYDDMVKYMKQVAELNGELTKEERNLLSVAYKNVVGSRRASW-RIL 59

Query: 524 KEHEKVQKVVTEQGDKKDDKYQKGDSDEIKRLKEKLSKTEEELKK 568
              E+ +               KG+ +++K +KE   K E+ELK 
Sbjct: 60  SSIEQKES-------------SKGNEEKLKLIKEYKEKIEKELKD 91


>gnl|CDD|153337 cd07653, F-BAR_CIP4-like, The F-BAR (FES-CIP4 Homology and
           Bin/Amphiphysin/Rvs) domain of Cdc42-Interacting Protein
           4 and similar proteins.  F-BAR domains are dimerization
           modules that bind and bend membranes and are found in
           proteins involved in membrane dynamics and actin
           reorganization. This subfamily is composed of
           Cdc42-Interacting Protein 4 (CIP4), Formin Binding
           Protein 17 (FBP17), FormiN Binding Protein 1-Like
           (FNBP1L), and similar proteins. CIP4 and FNBP1L are
           Cdc42 effectors that bind Wiskott-Aldrich syndrome
           protein (WASP) and function in endocytosis. CIP4 and
           FBP17 bind to the Fas ligand and may be implicated in
           the inflammatory response. CIP4 may also play a role in
           phagocytosis. Members of this subfamily typically
           contain an N-terminal F-BAR domain and a C-terminal SH3
           domain. In addition, some members such as FNBP1L contain
           a central Cdc42-binding HR1 domain. F-BAR domains form
           banana-shaped dimers with a positively-charged concave
           surface that binds to negatively-charged lipid
           membranes. They can induce membrane deformation in the
           form of long tubules.
          Length = 251

 Score = 28.4 bits (64), Expect = 9.2
 Identities = 26/100 (26%), Positives = 43/100 (43%), Gaps = 26/100 (26%)

Query: 484 IKRLKEKLSKTEEELKKEKTNSAALKSQA---DSVGKEYDRLLKEHEKVQKVVTEQGDKK 540
           I  L+++  K   E  K       L+S     +   K Y++  KE EK +          
Sbjct: 100 ISELRQERKKHLSEGSKL---QQKLESSIKQLEKSKKAYEKAFKEAEKAK---------- 146

Query: 541 DDKYQKGDSDEIKRLKEKLSKTEEELKKEKTNSAALKNQA 580
             KY+K D+D        ++ T+ +++K K  +A LK QA
Sbjct: 147 -QKYEKADAD--------MNLTKADVEKAK-ANANLKTQA 176


>gnl|CDD|185746 cd08915, V_Alix_like, Protein-interacting V-domain of mammalian
           Alix and related domains.  This superfamily contains the
           V-shaped (V) domain of mammalian Alix (apoptosis-linked
           gene-2 interacting protein X), His-Domain type N23
           protein tyrosine phosphatase (HD-PTP, also known as
           PTPN23), Bro1 and Rim20 (also known as PalA) from
           Saccharomyces cerevisiae, and related domains. Alix,
           HD-PTP, Bro1, and Rim20 all interact with the ESCRT
           (Endosomal Sorting Complexes Required for Transport)
           system. Alix, also known as apoptosis-linked gene-2
           interacting protein 1 (AIP1), participates in membrane
           remodeling processes during the budding of enveloped
           viruses, vesicle budding inside late endosomal
           multivesicular bodies (MVBs), and the abscission
           reactions of mammalian cell division. It also functions
           in apoptosis. HD-PTP functions in cell migration and
           endosomal trafficking, Bro1 in endosomal trafficking,
           and Rim20 in the response to the external pH via the
           Rim101 pathway. The Alix V-domain contains a binding
           site, partially conserved in this superfamily, for the
           retroviral late assembly (L) domain YPXnL motif. The
           Alix V-domain is also a dimerization domain. Members of
           this superfamily have an N-terminal Bro1-like domain,
           which binds components of the ESCRT-III complex. The
           Bro1-like domains of Alix and HD-PTP can also bind human
           immunodeficiency virus type 1 (HIV-1) nucleocapsid. Many
           members, including Alix, HD-PTP, and Bro1, also have a
           proline-rich region (PRR), which binds multiple partners
           in Alix, including Tsg101 (tumor susceptibility gene
           101, a component of ESCRT-1) and the apoptotic protein
           ALG-2. The C-terminal portion (V-domain and PRR) of Bro1
           interacts with Doa4, a ubiquitin thiolesterase needed to
           remove ubiquitin from MVB cargoes; it interacts with a
           YPxL motif in Doa4s catalytic domain to stimulate its
           deubiquitination activity. Rim20 may bind the ESCRT-III
           subunit Snf7, bringing the protease Rim13 (a
           YPxL-containing transcription factor) into proximity
           with Rim101, and promoting the proteolytic activation of
           Rim101. HD-PTP is encoded by the PTPN23 gene, a tumor
           suppressor gene candidate often absent in human kidney,
           breast, lung, and cervical tumors. HD-PTP has a
           C-terminal catalytically inactive tyrosine phosphatase
           domain.
          Length = 342

 Score = 28.8 bits (65), Expect = 9.4
 Identities = 22/133 (16%), Positives = 51/133 (38%), Gaps = 21/133 (15%)

Query: 467 CEIFEQCDKGLEGDSDEIKRLKEK-------LSKTEEELKKEKTNSAALKS---QADSVG 516
            E+ ++C++ LE ++ E  +L+ K          ++E  K+       L+    QA +  
Sbjct: 90  EELLQECEELLEEEAAEDDQLRAKFGTLRWRRPSSDEAAKELYEKVTKLRGYLEQASNSD 149

Query: 517 KEYDRLLKEHEKVQKVVTEQGDKK---------DDKYQKGDSDEIKRLKEKLSKTEEELK 567
            E  +  +  +    +V   G  K                 S+ +  L+  L++  E  K
Sbjct: 150 NEVLQCYESIDP--NLVLLCGGYKELKAFIPSPYPALDPEVSEVVSSLRPLLNEVSELEK 207

Query: 568 KEKTNSAALKNQA 580
           + +   + L+ ++
Sbjct: 208 ERERFISELEIKS 220


>gnl|CDD|224237 COG1318, COG1318, Predicted transcriptional regulators
           [Transcription].
          Length = 182

 Score = 28.3 bits (63), Expect = 9.6
 Identities = 13/88 (14%), Positives = 29/88 (32%), Gaps = 9/88 (10%)

Query: 483 EIKRLKEKLSKTEEELKKEKTNSAALKSQADSVGK--EYDRLLKEHEKVQKVVTEQGDKK 540
           ++ R   +  K       E       K             ++  EH  + K       + 
Sbjct: 90  QLVRETYEKLKEGGLDAVEVEIEKLEKEGLKIRWAVEVLKKIKGEHFPMDK-------EL 142

Query: 541 DDKYQKGDSDEIKRLKEKLSKTEEELKK 568
            ++  KG+  + + L+E + K E  ++ 
Sbjct: 143 LEEKLKGEVIKGEPLEEVIDKLEYPVES 170


>gnl|CDD|215212 PLN02372, PLN02372, violaxanthin de-epoxidase.
          Length = 455

 Score = 28.7 bits (64), Expect = 9.6
 Identities = 22/87 (25%), Positives = 46/87 (52%), Gaps = 3/87 (3%)

Query: 484 IKRLKEKLSKTEEELKKEKTNSAA-LKSQADSVGKEYDRLLKEHEKVQKVVTEQGDKKDD 542
           ++RL++ + + E+ + KE       L+ + + +GKE + L K    +++ + E  ++ ++
Sbjct: 363 LERLEKDVEEGEKTIVKEARQIEEELEKEVEKLGKEEESLFKR-VALEEGLKEL-EQDEE 420

Query: 543 KYQKGDSDEIKRLKEKLSKTEEELKKE 569
            + K  S E K L EKL     E++K 
Sbjct: 421 NFLKELSKEEKELLEKLKMEASEVEKL 447


>gnl|CDD|148313 pfam06631, DUF1154, Protein of unknown function (DUF1154).  This
           family represents a small conserved region of unknown
           function within eukaryotic phospholipase C (EC:3.1.4.3).
           All members also contain pfam00387 and pfam00388.
          Length = 47

 Score = 25.8 bits (57), Expect = 9.7
 Identities = 14/37 (37%), Positives = 19/37 (51%), Gaps = 4/37 (10%)

Query: 562 TEEELKKEKTNSAALKNQADSVGKEYDRLLKEHEKVQ 598
           T E L+++K     LK Q     KE D L K+H K +
Sbjct: 10  TLESLRQDKAYQKLLKKQQ----KELDSLKKKHAKER 42



 Score = 25.8 bits (57), Expect = 9.9
 Identities = 14/37 (37%), Positives = 19/37 (51%), Gaps = 4/37 (10%)

Query: 494 TEEELKKEKTNSAALKSQADSVGKEYDRLLKEHEKVQ 530
           T E L+++K     LK Q     KE D L K+H K +
Sbjct: 10  TLESLRQDKAYQKLLKKQQ----KELDSLKKKHAKER 42


>gnl|CDD|225887 COG3351, FlaD, Putative archaeal flagellar protein D/E [Cell
           motility and secretion].
          Length = 214

 Score = 28.3 bits (63), Expect = 9.7
 Identities = 22/107 (20%), Positives = 45/107 (42%), Gaps = 11/107 (10%)

Query: 465 KRCEIFEQCDKGLEGDSDEIKRLKEKLSKTEEELKK-EKTNSAALKSQADSVGKEYDRLL 523
           K+  + E+ +K  +  +  ++ LKEK+ +   + KK +      L  +     +  +  L
Sbjct: 1   KKPYLEEKIEKAEKVTAFALEELKEKIEELPIQAKKSDDELVEELPER---YEQTKENSL 57

Query: 524 KEHEKVQKVVTEQGDKKDDKYQKGDSDEIKRLKEKLSKTEEELKKEK 570
            E     +    + +K         S+++K   +  S +EEE KK K
Sbjct: 58  IEKVDSIEEEISEKEKV-------MSEKLKEPAQMSSTSEEEEKKAK 97


>gnl|CDD|227680 COG5391, COG5391, Phox homology (PX) domain protein [Intracellular
           trafficking and secretion / General function prediction
           only].
          Length = 524

 Score = 29.0 bits (65), Expect = 9.8
 Identities = 26/147 (17%), Positives = 43/147 (29%), Gaps = 19/147 (12%)

Query: 462 ESSKRCEIFEQCDKGLEGDSDEIKRLKEKLSKTEEELKKEKTNSAALKSQADSVGKEYDR 521
           E   + E      K L     +  + +E L  T+  L+K          +         R
Sbjct: 355 EGVVQAETLRSSLKELLTQLQDEIKSRESLILTDSNLEKLTD----QNLEDVEELSRSLR 410

Query: 522 LLKEHEKVQ--------KVVTEQGDKKDDKYQKGDSDEIKRLKEKLSKTEEELKKEKTNS 573
                  V                  +D   +K  S  I+ L++   K EE+L       
Sbjct: 411 KNSSQRAVVSQQPEGLTSFSKLSYKLRDFVQEKSRSKSIESLQQDKEKLEEQL------- 463

Query: 574 AALKNQADSVGKEYDRLLKEHEKVQKV 600
           A  +  A  + +E    LK    V+  
Sbjct: 464 AIAEKDAQEINEELKNELKFFFSVRNS 490


>gnl|CDD|216689 pfam01765, RRF, Ribosome recycling factor.  The ribosome recycling
           factor (RRF / ribosome release factor) dissociates the
           ribosome from the mRNA after termination of translation,
           and is essential bacterial growth. Thus ribosomes are
           "recycled" and ready for another round of protein
           synthesis.
          Length = 165

 Score = 27.8 bits (63), Expect = 9.8
 Identities = 14/47 (29%), Positives = 28/47 (59%), Gaps = 3/47 (6%)

Query: 550 DEIKRLKEKLSKTEEELKKEKTNSAALKNQADSVGKEYDRLLKEHEK 596
           D++K+L++    +E+E+K+ +     ++   D   K+ D LLK+ EK
Sbjct: 119 DKLKKLEKDKEISEDEVKRAEKE---IQKLTDKYIKKIDELLKKKEK 162


  Database: CDD.v3.10
    Posted date:  Mar 20, 2013  7:55 AM
  Number of letters in database: 10,937,602
  Number of sequences in database:  44,354
  
Lambda     K      H
   0.318    0.134    0.397 

Gapped
Lambda     K      H
   0.267   0.0715    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 29,909,620
Number of extensions: 2893741
Number of successful extensions: 6081
Number of sequences better than 10.0: 1
Number of HSP's gapped: 5350
Number of HSP's successfully gapped: 588
Length of query: 609
Length of database: 10,937,602
Length adjustment: 103
Effective length of query: 506
Effective length of database: 6,369,140
Effective search space: 3222784840
Effective search space used: 3222784840
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 62 (27.7 bits)