RPS-BLAST 2.2.26 [Sep-21-2011]

Database: CDD.v3.10 
           44,354 sequences; 10,937,602 total letters

Searching..................................................done

Query= psy10054
         (910 letters)



>gnl|CDD|221473 pfam12230, PRP21_like_P, Pre-mRNA splicing factor PRP21 like
           protein.  This domain family is found in eukaryotes, and
           is typically between 212 and 238 amino acids in length.
           The family is found in association with pfam01805. There
           are two completely conserved residues (W and H) that may
           be functionally important. PRP21 is required for
           assembly of the prespliceosome and it interacts with U2
           snRNP and/or pre-mRNA in the prespliceosome. This family
           also contains proteins similar to PRP21, such as the
           mammalian SF3a. SF3a also interacts with U2 snRNP from
           the prespliceosome, converting it to its active form.
          Length = 230

 Score =  104 bits (261), Expect = 4e-25
 Identities = 67/285 (23%), Positives = 112/285 (39%), Gaps = 73/285 (25%)

Query: 217 ILDEVKYRADYIRYQEAQRKQEEARVEKDRVAYAQIDWHDFVVVETVDYMPGEPGNFPPP 276
           IL+    RA++ ++Q+ + +++E   E++R+A+A IDW DFVVVET+++   +    PPP
Sbjct: 19  ILERAYKRAEWEKHQKKEEQKKEEEEEEERIAFASIDWQDFVVVETIEFNDIDVLELPPP 78

Query: 277 TTPDEVGARVLMQERMDNGEEDLEEDEEMEEEYGQRRPEDDDMQIGSDEENDYYNARMRG 336
            T +E+  R L Q+               E +      E+++M+                
Sbjct: 79  LTLNELIYRSLEQKSKA----------LEEPDAAAAPEEEEEME---------------- 112

Query: 337 RNANRGRGRMQEEINEDDMQEGTTSEEESDNENEEARKSKEAKDNTQVQDMEEDSSSEEE 396
                     +E+I+E+              + EE+ K                      
Sbjct: 113 --------EEEEDIDEE--------------KKEESAKKSTPPIK--------------- 135

Query: 397 QEDIPPPPPPPSLKKVEPAPPLPSQPSIQPPPLPPAPDKVVVKKGYDPKQEKIPASKVQE 456
            E   P      LK  +        PS   P       K++         E IP  +  E
Sbjct: 136 -EMKIPAAGESRLKIRK-----DYVPSGNKPRAEKPKKKMIKCP---ITGELIPEDEFDE 186

Query: 457 HMRIGLLDPRWVEQRDKHLNEKINQETVYAPGQAIEASLKQLAER 501
           HMRI LLDPR+ EQ+D++   K  Q         +  +LK+LA++
Sbjct: 187 HMRILLLDPRYKEQKDQYEA-KNQQYASNLTTDDVYRNLKRLAKK 230



 Score = 75.2 bits (185), Expect = 5e-15
 Identities = 48/171 (28%), Positives = 77/171 (45%), Gaps = 31/171 (18%)

Query: 644 GNFPPPTTPDEVGARVLMQERMDNGEEDLEEDEEMEEEYGQRRPEDDDMQIDMEEDSSSE 703
              PPP T +E+  R L        E+  +  EE +       PE+++   + EED   E
Sbjct: 73  LELPPPLTLNELIYRSL--------EQKSKALEEPDAAAA---PEEEEEMEEEEEDIDEE 121

Query: 704 EEQEDIPPPPPPPSLKKVEPAPPLPSQPSIQPPPLPPAPD-KVVVKKGYDPKQARPPVVK 762
           +++E                     S P I+   +P A + ++ ++K Y P   +P   K
Sbjct: 122 KKEE-----------------SAKKSTPPIKEMKIPAAGESRLKIRKDYVPSGNKPRAEK 164

Query: 763 PSPTDEFLISPITGEKIPASKVQEHMRIGLLDPRWVEQRDKHLNEKINQET 813
           P    + +  PITGE IP  +  EHMRI LLDPR+ EQ+D++  +     +
Sbjct: 165 PK--KKMIKCPITGELIPEDEFDEHMRILLLDPRYKEQKDQYEAKNQQYAS 213


>gnl|CDD|197818 smart00648, SWAP, Suppressor-of-White-APricot splicing regulator.
           domain present in regulators which are responsible for
          pre-mRNA splicing processes.
          Length = 54

 Score = 90.0 bits (224), Expect = 4e-22
 Identities = 30/54 (55%), Positives = 43/54 (79%)

Query: 31 RNIVDKTANFVARNGPEFESRIRQNEIGNPKFNFLSPGDPYHAYYQHRVKDIRE 84
           +I+DKTA FVARNGPEFE+++ + E  NP+F+FL P DPYHAYY+ ++ + R+
Sbjct: 1  LDIIDKTAQFVARNGPEFEAKLMERERNNPQFDFLKPNDPYHAYYRKKLAEYRQ 54



 Score = 84.6 bits (210), Expect = 3e-20
 Identities = 27/54 (50%), Positives = 35/54 (64%)

Query: 142 LDIVKLTAQFVARNGRGFLTSLMQKEQRNYQFDFLRPQHSLFQYFTKLLEQYTK 195
           LDI+  TAQFVARNG  F   LM++E+ N QFDFL+P      Y+ K L +Y +
Sbjct: 1   LDIIDKTAQFVARNGPEFEAKLMERERNNPQFDFLKPNDPYHAYYRKKLAEYRQ 54


>gnl|CDD|216714 pfam01805, Surp, Surp module.  This domain is also known as the
          SWAP domain. SWAP stands for
          Suppressor-of-White-APricot. It has been suggested that
          these domains may be RNA binding.
          Length = 54

 Score = 82.3 bits (204), Expect = 2e-19
 Identities = 28/54 (51%), Positives = 38/54 (70%), Gaps = 1/54 (1%)

Query: 31 RNIVDKTANFVARNGPEFESRIRQNEIGNPKFNFL-SPGDPYHAYYQHRVKDIR 83
           +I+DKTA FVARNGPEFE+ + + E  NP+FNFL    DPYH YY+ ++ +  
Sbjct: 1  LDIIDKTAQFVARNGPEFEALLMEREQDNPQFNFLKPTNDPYHVYYRWKLAEYS 54



 Score = 77.3 bits (191), Expect = 1e-17
 Identities = 26/53 (49%), Positives = 33/53 (62%), Gaps = 1/53 (1%)

Query: 142 LDIVKLTAQFVARNGRGFLTSLMQKEQRNYQFDFLRPQ-HSLFQYFTKLLEQY 193
           LDI+  TAQFVARNG  F   LM++EQ N QF+FL+P       Y+   L +Y
Sbjct: 1   LDIIDKTAQFVARNGPEFEALLMEREQDNPQFNFLKPTNDPYHVYYRWKLAEY 53


>gnl|CDD|176395 cd01800, SF3a120_C, Ubiquitin-like domain of Mammalian splicing
           factor SF3a_120.  SF3a120_C  Mammalian splicing factor
           SF3a consists of three subunits of 60, 66, and 120 kDa
           and functions early during pre-mRNA splicing by
           converting the U2 snRNP to its active form.  The 120kDa
           subunit (SF3a120) has a carboxy-terminal ubiquitin-like
           domain and two SWAP (suppressor-of-white-apricot)
           domains, referred to collectively as the SURP module, at
           its amino-terminus.
          Length = 76

 Score = 64.4 bits (157), Expect = 6e-13
 Identities = 37/81 (45%), Positives = 45/81 (55%), Gaps = 18/81 (22%)

Query: 806 NEKINQETVYAPEFLLDPRWVEQRDKHLNEKINQET------------GLFFKDSNSLAY 853
             K+N + +     L DP  V      L  KI++ET            G+F KDSNSLAY
Sbjct: 2   EWKLNGQMLNFTLQLSDPVSV------LKVKIHEETGMPAGKQKLQYEGIFIKDSNSLAY 55

Query: 854 YNITPGTKVHLQLKERGGRKK 874
           YN+  GT +HLQLKERGGRKK
Sbjct: 56  YNLANGTIIHLQLKERGGRKK 76


>gnl|CDD|220972 pfam11081, DUF2890, Protein of unknown function (DUF2890).  This
           family is conserved in dsDNA adenoviruses of
           vertebrates. The function is not known.
          Length = 172

 Score = 42.2 bits (99), Expect = 3e-04
 Identities = 35/110 (31%), Positives = 46/110 (41%), Gaps = 21/110 (19%)

Query: 635 MPPMNKPKPGNFPPPTTPDEVGARVLMQERMDNGEEDLEEDEEMEEEYGQRRPEDDDMQI 694
           MPP    K     PP T DE       +E  D+  E++EEDEE  E+       +D +  
Sbjct: 1   MPPKGNAKKLKVRPPPTKDE-------EEDWDSQAEEVEEDEEEMED------WEDSLD- 46

Query: 695 DMEEDSSSEEEQEDIPPPPPPPSLKKVEPAPPLPSQPSIQPPPLPPAPDK 744
             EED  +EE +E+       PS           SQ +I  PP PPA   
Sbjct: 47  --EEDEEAEEVEEETAASSKAPSSS-----SKSSSQETISIPPTPPARRP 89



 Score = 33.0 bits (75), Expect = 0.37
 Identities = 22/77 (28%), Positives = 34/77 (44%), Gaps = 5/77 (6%)

Query: 359 TTSEEESDNENEEARKSKEAKDNTQVQDMEEDSSSEEEQEDIPPPPPPPSLKKVEPAPPL 418
              EE+ D++ EE  + +E  ++ +    EED  +EE +E+       PS          
Sbjct: 18  KDEEEDWDSQAEEVEEDEEEMEDWEDSLDEEDEEAEEVEEETAASSKAPSSS-----SKS 72

Query: 419 PSQPSIQPPPLPPAPDK 435
            SQ +I  PP PPA   
Sbjct: 73  SSQETISIPPTPPARRP 89


>gnl|CDD|218191 pfam04652, DUF605, Vta1 like.  Vta1 (VPS20-associated protein 1) is
           a positive regulator of Vps4. Vps4 is an ATPase that is
           required in the multivesicular body (MVB) sorting
           pathway to dissociate the endosomal sorting complex
           required for transport (ESCRT). Vta1 promotes correct
           assembly of Vps4 and stimulates its ATPase activity
           through its conserved Vta1/SBP1/LIP5 region.
          Length = 315

 Score = 41.2 bits (97), Expect = 0.002
 Identities = 27/139 (19%), Positives = 41/139 (29%), Gaps = 3/139 (2%)

Query: 669 EEDLEEDEEMEEEYGQRRPEDDDM-QIDMEEDSSSEEEQEDIPPPPPPPSLKKVEPAPPL 727
                 DEE E+        D+     D +  S+S  +     P  P       +P+ P 
Sbjct: 149 NPGPPLDEEDEDADVATTNSDNSFPGEDADPASASPSDPPSSSPGVPSFPSPPEDPSSPS 208

Query: 728 PSQPSIQPPPLPPAPDKVVVKKGYDPKQARPPVVKPSPTDEFLISPITGEKIPASKVQEH 787
            S     P            +   +P     P   P P  + +    T +  P S     
Sbjct: 209 DSSLPPAPSSFQSDTPPPSPESPTNPSPPPGPAAPPPPPVQQVPPLSTAKPTPPSASATP 268

Query: 788 MRIG--LLDPRWVEQRDKH 804
             IG   LD   + +  KH
Sbjct: 269 APIGGITLDDDAIAKAQKH 287



 Score = 34.3 bits (79), Expect = 0.24
 Identities = 19/129 (14%), Positives = 30/129 (23%), Gaps = 2/129 (1%)

Query: 348 EEINEDDMQEGTTSEEESDNENEEARKSKEAKDNTQVQDMEEDSSSEEEQEDIPPPPPPP 407
           E+ +        +   E  +    +     +                   +   PP P  
Sbjct: 159 EDADVATTNSDNSFPGEDADPASASPSDPPSSSPGVPSFPSPPEDPSSPSDSSLPPAPSS 218

Query: 408 SLKKVEPAPPLPSQPSIQPPPLPPAPDKVVVKKGYDPKQEKIPASKVQEHMRIG--LLDP 465
                 P  P        PP     P   V +       +  P S       IG   LD 
Sbjct: 219 FQSDTPPPSPESPTNPSPPPGPAAPPPPPVQQVPPLSTAKPTPPSASATPAPIGGITLDD 278

Query: 466 RWVEQRDKH 474
             + +  KH
Sbjct: 279 DAIAKAQKH 287


>gnl|CDD|114709 pfam06003, SMN, Survival motor neuron protein (SMN).  This family
           consists of several eukaryotic survival motor neuron
           (SMN) proteins. The Survival of Motor Neurons (SMN)
           protein, the product of the spinal muscular
           atrophy-determining gene, is part of a large
           macromolecular complex (SMN complex) that functions in
           the assembly of spliceosomal small nuclear
           ribonucleoproteins (snRNPs). The SMN complex functions
           as a specificity factor essential for the efficient
           assembly of Sm proteins on U snRNAs and likely protects
           cells from illicit, and potentially deleterious,
           non-specific binding of Sm proteins to RNAs.
          Length = 264

 Score = 40.0 bits (93), Expect = 0.004
 Identities = 22/100 (22%), Positives = 36/100 (36%), Gaps = 13/100 (13%)

Query: 348 EEINEDDMQEGTTSEEESDNENEEARKSKEAKDNTQVQDMEEDSSSEEEQEDIPPPPPPP 407
              + D+      +  E+++  +E+ +S  + +     +      S       P PPPPP
Sbjct: 121 PPPDVDEDALKEANVNETESSTDESDRSSHSHEVRSKSNFPMGPPSPWNPRFPPGPPPPP 180

Query: 408 --------SLKKVEPA-----PPLPSQPSIQPPPLPPAPD 434
                         P      PP P  P + PPP P +PD
Sbjct: 181 PGFGRHGEKPSGWPPFLSGWPPPFPLGPPMIPPPPPMSPD 220



 Score = 29.2 bits (65), Expect = 9.6
 Identities = 19/76 (25%), Positives = 28/76 (36%)

Query: 358 GTTSEEESDNENEEARKSKEAKDNTQVQDMEEDSSSEEEQEDIPPPPPPPSLKKVEPAPP 417
           G  +EEE +  +         +D  +  ++ E  SS +E +         S       PP
Sbjct: 106 GYGNEEEQNLADLLTPPPDVDEDALKEANVNETESSTDESDRSSHSHEVRSKSNFPMGPP 165

Query: 418 LPSQPSIQPPPLPPAP 433
            P  P   P P PP P
Sbjct: 166 SPWNPRFPPGPPPPPP 181


>gnl|CDD|227596 COG5271, MDN1, AAA ATPase containing von Willebrand factor type A
            (vWA) domain [General function prediction only].
          Length = 4600

 Score = 40.8 bits (95), Expect = 0.005
 Identities = 35/148 (23%), Positives = 62/148 (41%), Gaps = 36/148 (24%)

Query: 289  QERMDNGEEDLEEDEEMEEEYGQ-RRPEDD--DMQIGSDEENDYYNARMRGRNANRGRGR 345
            +E +   EE  E+  + +EE  Q   PED   D     D ++ + +A     N ++G   
Sbjct: 4068 EENVQENEESTEDGVKSDEELEQGEVPEDQAIDNHPKMDAKSTFASAEADEENTDKGIVG 4127

Query: 346  MQEEINEDD--------------MQEGTTSEEESDNEN--------------EEARKSKE 377
              EE+ E+D              +QE T++ +E+ +E               ++A +  E
Sbjct: 4128 ENEELGEEDGVRGNGTADGEFEQVQEDTSTPKEAMSEADRQYQSLGDHLREWQQANRIHE 4187

Query: 378  AKDNTQVQDMEEDSSS-----EEEQEDI 400
             +D T+ Q    D S      E+E+ED+
Sbjct: 4188 WEDLTESQSQAFDDSEFMHVKEDEEEDL 4215



 Score = 40.4 bits (94), Expect = 0.005
 Identities = 23/123 (18%), Positives = 50/123 (40%), Gaps = 19/123 (15%)

Query: 292  MDNGEEDLEEDE-EMEEEYGQRRPEDDDMQ--IGSDEENDYYNARMRGRNANRGRGRMQE 348
            +++ +   +EDE EM ++ G     DD++Q  I  +               +      + 
Sbjct: 3945 LEDKDRQEKEDEEEMSDDVGI----DDEIQPDIQENNSQPPPENEDLDLPEDLKLDEKEG 4000

Query: 349  EINED------DMQEGTTSEEESDNEN------EEARKSKEAKDNTQVQDMEEDSSSEEE 396
            ++++D      DM+    ++EE+D E       E+  +     D    QD   D + ++E
Sbjct: 4001 DVSKDSDLEDMDMEAADENKEEADAEKDEPMQDEDPLEENNTLDEDIQQDDFSDLAEDDE 4060

Query: 397  QED 399
            + +
Sbjct: 4061 KMN 4063



 Score = 39.2 bits (91), Expect = 0.014
 Identities = 29/134 (21%), Positives = 56/134 (41%), Gaps = 5/134 (3%)

Query: 288  MQERMDNGEEDLEEDEEMEEEYGQRRPEDDDMQIGSDEENDYYNARMRGRNANRGRGRM- 346
            + E  D  E   +  ++ E  + +   E+D   +G+ E+ D   +  R  +AN+    M 
Sbjct: 4185 IHEWEDLTESQSQAFDDSEFMHVKEDEEEDLQALGNAEK-DQIKSIDRDESANQNPDSMN 4243

Query: 347  QEEINEDDMQEGTTSEEESDNENEEARKSKEAKDNTQ--VQDMEEDSSSEEEQEDIPPPP 404
               I ED+  E    + +   +  + +++ E    T+    +  E      E EDI    
Sbjct: 4244 STNIAEDEADEVGDKQLQDGQDISDIKQTGEDTLPTEFGSINQSEKVFELSEDEDIEDEL 4303

Query: 405  PPPSLKKVEPAPPL 418
            P  ++ K+ PA P+
Sbjct: 4304 PDYNV-KITPAMPI 4316



 Score = 38.8 bits (90), Expect = 0.017
 Identities = 25/128 (19%), Positives = 44/128 (34%), Gaps = 33/128 (25%)

Query: 294  NGEEDLEEDEEMEEEYGQRRPEDDDMQIGSDEENDYYNARMRGRNANRGRGRMQEEINED 353
               E+   D E E +       D D ++    E+           +N    ++ +E NE+
Sbjct: 3872 VVSENENSDSEEENQ-------DLDEEVNDIPEDL----------SNSLNEKLWDEPNEE 3914

Query: 354  DMQEGTTSEEESDNENEEARKSKEAKDNTQVQDME----------------EDSSSEEEQ 397
            D+ E      E    N E+    +  DN  ++D +                +D    + Q
Sbjct: 3915 DLLETEQKSNEQSAANNESDLVSKEDDNKALEDKDRQEKEDEEEMSDDVGIDDEIQPDIQ 3974

Query: 398  EDIPPPPP 405
            E+   PPP
Sbjct: 3975 ENNSQPPP 3982



 Score = 33.8 bits (77), Expect = 0.56
 Identities = 27/133 (20%), Positives = 51/133 (38%), Gaps = 30/133 (22%)

Query: 294  NGEEDLEEDEEMEEEYGQRRPEDDDMQIGSDEENDYYNARMRGRNANRGRGRMQEEINED 353
            N  E +EE++E ++EYG++    D      D  +  Y+        +      +EE  E 
Sbjct: 2924 NVSEQIEEEDEKDDEYGEQEKVSDKNDYLKDFID--YDDDGSISTESE----PEEEKREC 2977

Query: 354  DMQ-EGTTSEEESDNENEEARKSKEAKDNTQ-----------------------VQDMEE 389
            +   +G    EE ++++E+  K  E + N +                           E 
Sbjct: 2978 ESGFDGWNDYEEEEDDDEKNEKECELEQNAEILIEDEKIGSEGSLESGGEELVCSNLSEG 3037

Query: 390  DSSSEEEQEDIPP 402
            + SSE E++D+  
Sbjct: 3038 NESSEIEEDDVES 3050



 Score = 32.3 bits (73), Expect = 1.8
 Identities = 24/104 (23%), Positives = 41/104 (39%), Gaps = 18/104 (17%)

Query: 296  EEDLEEDEEMEEEYGQRRPEDDDMQIGSDEENDYYNARMRGRNANRGRGRMQEEINEDDM 355
            E+ LEE+  ++E+  Q    DD   +  D+E             N        + NE+  
Sbjct: 4034 EDPLEENNTLDEDIQQ----DDFSDLAEDDEKM-----------NEDGFEENVQENEEST 4078

Query: 356  QEGTTSEEESDNENEEARKSKEAKDNTQVQDMEEDSSSEEEQED 399
            ++G  S+EE +          +A DN    D +   +S E  E+
Sbjct: 4079 EDGVKSDEELEQGEVP---EDQAIDNHPKMDAKSTFASAEADEE 4119



 Score = 30.4 bits (68), Expect = 6.0
 Identities = 22/88 (25%), Positives = 38/88 (43%), Gaps = 7/88 (7%)

Query: 320  QIGSDEENDYYNARMRGRNANRGRGRMQEEINEDDMQEGTTSEEESDN---ENEEARKSK 376
             +  ++ N        G  +  G   +   +NEDD  E   +EE++ N    +E   +  
Sbjct: 3801 DLPQEKSNSGELESGTGLGSGVGAEDITNTLNEDDDLEELANEEDTANQSDLDESEAREL 3860

Query: 377  EAKDNTQVQDM----EEDSSSEEEQEDI 400
            E+  N   +D      E+S SEEE +D+
Sbjct: 3861 ESDMNGVTKDSVVSENENSDSEEENQDL 3888



 Score = 30.4 bits (68), Expect = 6.2
 Identities = 16/50 (32%), Positives = 30/50 (60%), Gaps = 8/50 (16%)

Query: 665  MDNGEEDLEEDE-EMEEEYGQRRPEDDDMQIDMEEDSSS---EEEQEDIP 710
            +++ +   +EDE EM ++ G     DD++Q D++E++S    E E  D+P
Sbjct: 3945 LEDKDRQEKEDEEEMSDDVGI----DDEIQPDIQENNSQPPPENEDLDLP 3990



 Score = 30.0 bits (67), Expect = 7.3
 Identities = 32/172 (18%), Positives = 58/172 (33%), Gaps = 14/172 (8%)

Query: 555  DEETAIGKKIGEEDTRKDDKATWDGHTSSVEAATRAARANITLEDQIHQIHKVKGLLPDE 614
            +  T +G  +G ED         D    + E  T A ++++   +       + G+  D 
Sbjct: 3813 ESGTGLGSGVGAEDITNTLNEDDDLEELANEEDT-ANQSDLDESEARELESDMNGVTKDS 3871

Query: 615  EKEKIGPKRSHAPNPPLPRSMPPMNKPKPGNFPPPTTPDEVGARVLMQERMDNGEEDLEE 674
               +     S   N  L   +  + +    N       DE     L++    + E+    
Sbjct: 3872 VVSENENSDSEEENQDLDEEVNDIPEDLS-NSLNEKLWDEPNEEDLLETEQKSNEQSAAN 3930

Query: 675  DEE--MEEEYGQRRPEDDDMQIDMEE----------DSSSEEEQEDIPPPPP 714
            +E   + +E   +  ED D Q   +E          D    + QE+   PPP
Sbjct: 3931 NESDLVSKEDDNKALEDKDRQEKEDEEEMSDDVGIDDEIQPDIQENNSQPPP 3982



 Score = 30.0 bits (67), Expect = 8.7
 Identities = 13/54 (24%), Positives = 25/54 (46%), Gaps = 1/54 (1%)

Query: 346  MQEEINEDDMQEGTTSEEESDNENEEARKSKEAKDNTQVQDMEEDSSSEEEQED 399
            ++E  NE+D     +  +ES+    E+  +   KD+   ++   DS  E +  D
Sbjct: 3837 LEELANEEDTAN-QSDLDESEARELESDMNGVTKDSVVSENENSDSEEENQDLD 3889


>gnl|CDD|227665 COG5373, COG5373, Predicted membrane protein [Function unknown].
          Length = 931

 Score = 39.8 bits (93), Expect = 0.008
 Identities = 22/94 (23%), Positives = 31/94 (32%), Gaps = 5/94 (5%)

Query: 341 RGRGRMQEEINEDD--MQEGTTSEEESDNENEEARKSKEAKDNTQVQDMEEDSSSEEEQE 398
              GR++ E+ E    + EG         E   A ++ EA     +    E  +S E   
Sbjct: 23  SRIGRIERELRELRSLVAEGAAGPVAKAAEQMAAPEAAEAAP---LPAAAESIASPEVPP 79

Query: 399 DIPPPPPPPSLKKVEPAPPLPSQPSIQPPPLPPA 432
            +PP P           P     PS  P P  P 
Sbjct: 80  PVPPAPAQEGEAPAAEQPSAVPAPSAAPAPAEPV 113



 Score = 36.3 bits (84), Expect = 0.075
 Identities = 12/47 (25%), Positives = 14/47 (29%)

Query: 695 DMEEDSSSEEEQEDIPPPPPPPSLKKVEPAPPLPSQPSIQPPPLPPA 741
              E  +S E    +PP P           P     PS  P P  P 
Sbjct: 67  AAAESIASPEVPPPVPPAPAQEGEAPAAEQPSAVPAPSAAPAPAEPV 113


>gnl|CDD|130009 TIGR00934, 2a38euk, potassium uptake protein, Trk family.  The
           proteins of the Trk family are derived from
           Gram-negative and Gram-positive bacteria, yeast and
           wheat. The proteins of E. coli K12 TrkH and TrkG as well
           as several yeast proteins have been functionally
           characterized.The E. coli TrkH and TrkG proteins are
           complexed to two peripheral membrane proteins, TrkA, an
           NAD-binding protein, and TrkE, an ATP-binding protein.
           This complex forms the potassium uptake system. This
           family is specific for the eukaryotic Trk system
           [Transport and binding proteins, Cations and iron
           carrying compounds].
          Length = 800

 Score = 39.6 bits (92), Expect = 0.008
 Identities = 22/159 (13%), Positives = 51/159 (32%), Gaps = 11/159 (6%)

Query: 285 RVLMQERMDNGEEDLEEDEEMEEEYGQRRPEDDDMQIGSDEENDYYNARMRGRNANRGRG 344
            +L +E  D  E  +     +     +RR   D +  G   + D  +       ++    
Sbjct: 152 TLLQRELEDRPETGVAGRVTVPHGSAKRRDFQDKLFSGEFVKRDEPDQNSPDVKSDT--- 208

Query: 345 RMQEEINEDDMQEGTTSEEESDNENEEARKSKE--------AKDNTQVQDMEEDSSSEEE 396
           R  E I++ + ++        D + E+  +S           ++ +      ++ SSE  
Sbjct: 209 RADESISDLEFEKFAKRRGSRDVDPEDLYRSIMMLQGIHERIREKSSANSRSDERSSESI 268

Query: 397 QEDIPPPPPPPSLKKVEPAPPLPSQPSIQPPPLPPAPDK 435
           QE +   P    +++   +             +     K
Sbjct: 269 QEQVERRPSTSDIERNSQSLTRRYDDKSFDKAVRLRRSK 307



 Score = 35.7 bits (82), Expect = 0.13
 Identities = 16/135 (11%), Positives = 32/135 (23%), Gaps = 10/135 (7%)

Query: 612 PDEEKEKIGPKRSHAPNPPLPRSMPPMNKPKP--GNFPPPTTPDEVGARVLMQERMDNGE 669
             ++K   G              +    +      +          G+R +  E +    
Sbjct: 181 DFQDKLFSGEFVKRDEPDQNSPDVKSDTRADESISDLEFEKFAKRRGSRDVDPEDLYRSI 240

Query: 670 EDLEEDEEMEEEYGQRRPEDDDMQIDMEEDSSSEEEQEDIPPPPPPPSLKKVEPAPPLPS 729
             L+   E   E        D+         SSE  QE +   P    +++   +     
Sbjct: 241 MMLQGIHERIREKSSANSRSDE--------RSSESIQEQVERRPSTSDIERNSQSLTRRY 292

Query: 730 QPSIQPPPLPPAPDK 744
                   +     K
Sbjct: 293 DDKSFDKAVRLRRSK 307


>gnl|CDD|219321 pfam07174, FAP, Fibronectin-attachment protein (FAP).  This family
           contains bacterial fibronectin-attachment proteins
           (FAP). Family members are rich in alanine and proline,
           are approximately 300 long, and seem to be restricted to
           mycobacteria. These proteins contain a
           fibronectin-binding motif that allows mycobacteria to
           bind to fibronectin in the extracellular matrix.
          Length = 297

 Score = 38.7 bits (90), Expect = 0.009
 Identities = 19/54 (35%), Positives = 22/54 (40%)

Query: 710 PPPPPPPSLKKVEPAPPLPSQPSIQPPPLPPAPDKVVVKKGYDPKQARPPVVKP 763
           PPPP   +      APP P  P+  P P P  P+        DP    PP V P
Sbjct: 45  PPPPSTAAAAPAPAAPPPPPPPAAPPAPQPDDPNAAPPPPPADPNAPPPPPVDP 98



 Score = 34.5 bits (79), Expect = 0.23
 Identities = 17/36 (47%), Positives = 17/36 (47%)

Query: 399 DIPPPPPPPSLKKVEPAPPLPSQPSIQPPPLPPAPD 434
           D  PPPPPPS     PAP  P  P     P  P PD
Sbjct: 40  DPAPPPPPPSTAAAAPAPAAPPPPPPPAAPPAPQPD 75



 Score = 32.6 bits (74), Expect = 0.78
 Identities = 16/58 (27%), Positives = 17/58 (29%)

Query: 710 PPPPPPPSLKKVEPAPPLPSQPSIQPPPLPPAPDKVVVKKGYDPKQARPPVVKPSPTD 767
           P P  PP        P          PP PPA          DP    PP  +P   D
Sbjct: 55  PAPAAPPPPPPPAAPPAPQPDDPNAAPPPPPADPNAPPPPPVDPNAPPPPAPEPGRID 112



 Score = 30.6 bits (69), Expect = 3.1
 Identities = 10/33 (30%), Positives = 10/33 (30%)

Query: 401 PPPPPPPSLKKVEPAPPLPSQPSIQPPPLPPAP 433
             P P        PA P   QP       PP P
Sbjct: 53  AAPAPAAPPPPPPPAAPPAPQPDDPNAAPPPPP 85



 Score = 30.2 bits (68), Expect = 4.3
 Identities = 14/51 (27%), Positives = 14/51 (27%)

Query: 401 PPPPPPPSLKKVEPAPPLPSQPSIQPPPLPPAPDKVVVKKGYDPKQEKIPA 451
           P P  PP        P          PP PPA          DP     PA
Sbjct: 55  PAPAAPPPPPPPAAPPAPQPDDPNAAPPPPPADPNAPPPPPVDPNAPPPPA 105


>gnl|CDD|220603 pfam10152, DUF2360, Predicted coiled-coil domain-containing protein
           (DUF2360).  This is the conserved 140 amino acid region
           of a family of proteins conserved from nematodes to
           humans. One C. elegans member is annotated as a
           Daf-16-dependent longevity protein 1 but this could not
           be confirmed. The function is unknown.
          Length = 147

 Score = 36.6 bits (85), Expect = 0.016
 Identities = 24/95 (25%), Positives = 35/95 (36%), Gaps = 8/95 (8%)

Query: 383 QVQDMEEDSSSEEEQEDIPPPPPPPSLKKVEPAPPLPSQPSIQPPPLPPAPDKVVVKKGY 442
            V              +  PPPPPP+  +    PP  +       P P AP +  V    
Sbjct: 52  DVTVQTTPPPPASAITNGGPPPPPPARAEAASPPPPEAPAEPPAEPEPEAPAENTVTVAK 111

Query: 443 DPKQEK--------IPASKVQEHMRIGLLDPRWVE 469
           DP+  K        +PA  V+  M+   LDP  ++
Sbjct: 112 DPRYAKYFKMLKLGVPAQAVKNKMQSEGLDPNLLD 146



 Score = 32.0 bits (73), Expect = 0.67
 Identities = 22/72 (30%), Positives = 31/72 (43%), Gaps = 3/72 (4%)

Query: 687 PEDDDMQIDMEEDSSSEEEQEDIPPPPPPPSLKKVEPAPPLPSQ-PSIQPPPLPPAPDKV 745
           P  +D+ +       +       PPPPPP   +   P PP     P  +P P  PA + V
Sbjct: 48  PGLEDVTVQTTPPPPASAITNGGPPPPPPARAEAASPPPPEAPAEPPAEPEPEAPAENTV 107

Query: 746 VVKKGYDPKQAR 757
            V K  DP+ A+
Sbjct: 108 TVAK--DPRYAK 117


>gnl|CDD|233191 TIGR00927, 2A1904, K+-dependent Na+/Ca+ exchanger.  [Transport and
           binding proteins, Cations and iron carrying compounds].
          Length = 1096

 Score = 38.4 bits (89), Expect = 0.021
 Identities = 28/134 (20%), Positives = 54/134 (40%), Gaps = 1/134 (0%)

Query: 278 TPDEVGARVLMQERMDNGEEDLEEDEEMEEEYGQRRPEDDDMQIGSDEENDYYNARMRGR 337
           T  E       +E  D GE    ED EM+ + G     + + +  + E++++        
Sbjct: 764 TEHEGETEAEGKEDEDEGEIQAGEDGEMKGDEGAEGKVEHEGETEAGEKDEHEGQSETQA 823

Query: 338 NANRGRGRMQE-EINEDDMQEGTTSEEESDNENEEARKSKEAKDNTQVQDMEEDSSSEEE 396
           +    +    E E+N ++  E    E+  D          E ++  + ++ EE+   EEE
Sbjct: 824 DDTEVKDETGEQELNAENQGEAKQDEKGVDGGGGSDGGDSEEEEEEEEEEEEEEEEEEEE 883

Query: 397 QEDIPPPPPPPSLK 410
           +E+      P SL+
Sbjct: 884 EEEEEENEEPLSLE 897



 Score = 33.0 bits (75), Expect = 0.88
 Identities = 20/109 (18%), Positives = 40/109 (36%), Gaps = 5/109 (4%)

Query: 295 GEEDLEEDEEMEEEYGQRRPEDDDMQIGSDEENDYYNARMRGRNANRGRGRMQEEINEDD 354
            + + E + E+E +    + E +  ++  + E +       G       G   E+  E +
Sbjct: 689 RKGEQEGEGEIEAKEADHKGETEAEEVEHEGETEAEGTEDEGEIETGEEGEEVEDEGEGE 748

Query: 355 MQEGTTSEEESDN-----ENEEARKSKEAKDNTQVQDMEEDSSSEEEQE 398
            +     E E D      E E   + KE +D  ++Q  E+     +E  
Sbjct: 749 AEGKHEVETEGDRKETEHEGETEAEGKEDEDEGEIQAGEDGEMKGDEGA 797



 Score = 31.9 bits (72), Expect = 2.0
 Identities = 27/137 (19%), Positives = 50/137 (36%), Gaps = 26/137 (18%)

Query: 289 QERMDNGEEDLEEDEEMEEEYGQRRPEDDDMQIGSDEENDYY-------NARMRGRNANR 341
           +E  D GE + E   E+E E  ++  E +       +E++         +  M+G     
Sbjct: 739 EEVEDEGEGEAEGKHEVETEGDRKETEHEGETEAEGKEDEDEGEIQAGEDGEMKGDEGAE 798

Query: 342 GRGRMQEEINEDDMQEGTTSEEESDNENEEARKSKEAKDNTQVQ---------------- 385
           G+   + E    +  E     E   ++ E   ++ E + N + Q                
Sbjct: 799 GKVEHEGETEAGEKDEHEGQSETQADDTEVKDETGEQELNAENQGEAKQDEKGVDGGGGS 858

Query: 386 ---DMEEDSSSEEEQED 399
              D EE+   EEE+E+
Sbjct: 859 DGGDSEEEEEEEEEEEE 875



 Score = 30.0 bits (67), Expect = 6.7
 Identities = 21/116 (18%), Positives = 35/116 (30%), Gaps = 17/116 (14%)

Query: 293 DNGEEDLEEDEEMEE----------EYGQRRPEDDDMQIGSDEENDYYNARMRGRNANRG 342
           D GE +  E+ E  E             +   +  + +   + E +       G      
Sbjct: 728 DEGEIETGEEGEEVEDEGEGEAEGKHEVETEGDRKETEHEGETEAEGKEDEDEGEIQAGE 787

Query: 343 RGRMQEEINEDDMQEGTTSEEESDNENEEARKSKEAKDNTQVQDMEEDSSSEEEQE 398
            G M+ +       EG   + E + E E   K +    +    D  E      EQE
Sbjct: 788 DGEMKGD-------EGAEGKVEHEGETEAGEKDEHEGQSETQADDTEVKDETGEQE 836


>gnl|CDD|234938 PRK01297, PRK01297, ATP-dependent RNA helicase RhlB; Provisional.
          Length = 475

 Score = 37.6 bits (87), Expect = 0.029
 Identities = 21/79 (26%), Positives = 32/79 (40%), Gaps = 9/79 (11%)

Query: 702 SEEEQEDIPPPPPPPSLKKVEPA-PPLPSQPSIQPPPLPPAPDKVVVKKGYDPKQARPPV 760
            E EQ    PP P  +     PA    P+  +  P    P  +K    K   P++ R P 
Sbjct: 12  GEAEQPAPAPPSPAAAPAPPPPAKTAAPATKAAAPAAAAPRAEKPKKDK---PRRERKP- 67

Query: 761 VKPSPT---DEFLISPITG 776
            KP+     ++F++ P  G
Sbjct: 68  -KPASLWKLEDFVVEPQEG 85



 Score = 32.6 bits (74), Expect = 1.0
 Identities = 16/59 (27%), Positives = 23/59 (38%), Gaps = 4/59 (6%)

Query: 393 SEEEQEDIPPPPPPPSLKKVEPA-PPLPSQPSIQPPPLPPAPDKVVVKKGYDPKQEKIP 450
            E EQ    PP P  +     PA    P+  +  P    P  +K    K   P++E+ P
Sbjct: 12  GEAEQPAPAPPSPAAAPAPPPPAKTAAPATKAAAPAAAAPRAEKPKKDK---PRRERKP 67


>gnl|CDD|218332 pfam04929, Herpes_DNAp_acc, Herpes DNA replication accessory
           factor.  Replicative DNA polymerases are capable of
           polymerising tens of thousands of nucleotides without
           dissociating from their DNA templates. The high
           processivity of these polymerases is dependent upon
           accessory proteins that bind to the catalytic subunit of
           the polymerase or to the substrate. The Epstein-Barr
           virus (EBV) BMRF1 protein is an essential component of
           the viral DNA polymerase and is absolutely required for
           lytic virus replication. BMRF1 is also a transactivator.
           This family is predicted to have a UL42 like structure.
          Length = 381

 Score = 37.3 bits (87), Expect = 0.031
 Identities = 20/83 (24%), Positives = 30/83 (36%), Gaps = 5/83 (6%)

Query: 380 DNTQVQDMEEDSSSE---EEQEDIPPPPPPPSLKKVEPAPPLPSQPSIQPPPLPPAPDKV 436
           +   V+     S S+       D  P PP  S        P P+  S  PP     P  +
Sbjct: 294 EANGVEPEPTGSVSDRPRHLSSDSSPSPPDTSDSDPSTETPPPASLSHSPPAAFERPLAL 353

Query: 437 --VVKKGYDPKQEKIPASKVQEH 457
               K+  D KQ+K  + K++  
Sbjct: 354 SPKRKREGDKKQKKKKSKKLKLT 376



 Score = 37.3 bits (87), Expect = 0.035
 Identities = 16/67 (23%), Positives = 20/67 (29%), Gaps = 2/67 (2%)

Query: 700 SSSEEEQEDIPPPPPPPSLKKVEPAPPLPSQPSIQPPPLPPAPDKV--VVKKGYDPKQAR 757
                   D  P PP  S        P P+  S  PP     P  +    K+  D KQ +
Sbjct: 308 DRPRHLSSDSSPSPPDTSDSDPSTETPPPASLSHSPPAAFERPLALSPKRKREGDKKQKK 367

Query: 758 PPVVKPS 764
               K  
Sbjct: 368 KKSKKLK 374


>gnl|CDD|215813 pfam00240, ubiquitin, Ubiquitin family.  This family contains a
           number of ubiquitin-like proteins: SUMO (smt3
           homologue), Nedd8, Elongin B, Rub1, and Parkin. A number
           of them are thought to carry a distinctive five-residue
           motif termed the proteasome-interacting motif (PIM),
           which may have a biologically significant role in
           protein delivery to proteasomes and recruitment of
           proteasomes to transcription sites.
          Length = 69

 Score = 33.7 bits (78), Expect = 0.036
 Identities = 16/51 (31%), Positives = 22/51 (43%), Gaps = 12/51 (23%)

Query: 831 KHLNEKINQETG-------LFFK-----DSNSLAYYNITPGTKVHLQLKER 869
             L EKI  + G       L F      D  +L+ Y I  G+ +HL L+ R
Sbjct: 19  SELKEKIEDKEGIPVDQQRLIFSGKVLEDDTTLSEYGIQDGSTLHLVLRLR 69


>gnl|CDD|223021 PHA03247, PHA03247, large tegument protein UL36; Provisional.
          Length = 3151

 Score = 37.6 bits (87), Expect = 0.037
 Identities = 16/33 (48%), Positives = 17/33 (51%), Gaps = 1/33 (3%)

Query: 401  PPPPPPPSLKKVEPAPPLPSQPSIQPPPLPPAP 433
            P  PPPP  +   P PP P QP   PPP P  P
Sbjct: 2911 PQAPPPPQPQPQPPPPPQP-QPPPPPPPRPQPP 2942



 Score = 37.6 bits (87), Expect = 0.038
 Identities = 42/158 (26%), Positives = 57/158 (36%), Gaps = 17/158 (10%)

Query: 621  PKRSHAPNPPLPRSMPPMNKPKPGN---FPPPTTPDEVGA--RVLMQERMDNGEEDLEED 675
            P  +HAP+PP P   P  N+P P      PPP  P +  A  RV    R        +  
Sbjct: 2618 PPDTHAPDPPPPSPSPAANEPDPHPPPTVPPPERPRDDPAPGRVSRPRRARRLGRAAQAS 2677

Query: 676  EEMEEEYGQRRPEDDDMQIDMEEDSSSEEEQEDIPPPPPPPSLKKVEPAPPLP-----SQ 730
               +     RR         +   S ++       P P P +L    P PP P     + 
Sbjct: 2678 SPPQ---RPRRRAARPTVGSLT--SLADPPPPPPTPEPAPHALVSATPLPPGPAAARQAS 2732

Query: 731  PSIQPPPLPPAPDKVVVKKGYDPKQARPPVVK--PSPT 766
            P++   P PPA        G   + ARPP     P+P 
Sbjct: 2733 PALPAAPAPPAVPAGPATPGGPARPARPPTTAGPPAPA 2770



 Score = 36.8 bits (85), Expect = 0.072
 Identities = 17/61 (27%), Positives = 21/61 (34%), Gaps = 1/61 (1%)

Query: 706  QEDIPPPPPPPSLKKVEPAPPLPSQPSIQPPPLPPA-PDKVVVKKGYDPKQARPPVVKPS 764
             +    PP P +    +P P  P  P  QPPP PP  P   +           P    P 
Sbjct: 2902 PDQPERPPQPQAPPPPQPQPQPPPPPQPQPPPPPPPRPQPPLAPTTDPAGAGEPSGAVPQ 2961

Query: 765  P 765
            P
Sbjct: 2962 P 2962



 Score = 36.1 bits (83), Expect = 0.11
 Identities = 13/36 (36%), Positives = 16/36 (44%)

Query: 397  QEDIPPPPPPPSLKKVEPAPPLPSQPSIQPPPLPPA 432
             +    PP P +    +P P  P  P  QPPP PP 
Sbjct: 2902 PDQPERPPQPQAPPPPQPQPQPPPPPQPQPPPPPPP 2937



 Score = 36.1 bits (83), Expect = 0.12
 Identities = 31/163 (19%), Positives = 45/163 (27%), Gaps = 23/163 (14%)

Query: 621  PKRSHAPNPPLPRSMPPMNKPKPGN-----------FPPPTTPDEVGARVLMQERMDNGE 669
            P     P  P P S+P      PG               P  P     R L +  +    
Sbjct: 2836 PTAPPPPPGPPPPSLPLGGSVAPGGDVRRRPPSRSPAAKPAAPARPPVRRLARPAVSRST 2895

Query: 670  EDLEEDEEMEEEYGQRRPEDDDMQIDMEEDSSSEEEQEDIPPPPPPPSLKKVEPAPPLPS 729
            E      +  E     +P+         +     + Q    PPPPPP   +   AP    
Sbjct: 2896 ESFALPPDQPER--PPQPQAPPPPQPQPQPPPPPQPQ----PPPPPPPRPQPPLAPTTDP 2949

Query: 730  QPSIQPPPLPPAPDKVVVKKGYDP------KQARPPVVKPSPT 766
              + +P    P P    +  G          Q  P    P+ +
Sbjct: 2950 AGAGEPSGAVPQPWLGALVPGRVAVPRFRVPQPAPSREAPASS 2992



 Score = 34.9 bits (80), Expect = 0.27
 Identities = 29/158 (18%), Positives = 37/158 (23%), Gaps = 3/158 (1%)

Query: 612  PDEEKEKIGPKRSHAPNPPLPRSMPPMNKPKPGNFPPPTTPDEVGARVLMQERMDNGEED 671
            PD   +   P+             PP   P   + P P  P    A              
Sbjct: 2589 PDAPPQSARPRAPVDDRGDPRGPAPPSPLPPDTHAPDPPPPSPSPAANEPDPHPPPTVPP 2648

Query: 672  LEEDEEMEEEYGQRRPEDDDMQIDMEEDSSSEEEQEDIPPPPPPPSLKKVEPAPPLPSQP 731
             E   +        RP          + SS  +        P   SL  +   PP P  P
Sbjct: 2649 PERPRDDPAPGRVSRPRRARRLGRAAQASSPPQRPRRRAARPTVGSLTSLADPPPPPPTP 2708

Query: 732  SIQPPPLPPA---PDKVVVKKGYDPKQARPPVVKPSPT 766
               P  L  A   P      +   P     P     P 
Sbjct: 2709 EPAPHALVSATPLPPGPAAARQASPALPAAPAPPAVPA 2746



 Score = 33.4 bits (76), Expect = 0.75
 Identities = 11/55 (20%), Positives = 13/55 (23%)

Query: 401  PPPPPPPSLKKVEPAPPLPSQPSIQPPPLPPAPDKVVVKKGYDPKQEKIPASKVQ 455
                   S       P  P QP   PPP P        +    P     P   + 
Sbjct: 2890 AVSRSTESFALPPDQPERPPQPQAPPPPQPQPQPPPPPQPQPPPPPPPRPQPPLA 2944



 Score = 33.0 bits (75), Expect = 1.0
 Identities = 18/67 (26%), Positives = 20/67 (29%), Gaps = 10/67 (14%)

Query: 701  SSEEEQEDIPPPPPPPSLKKVEPAPPLPSQPSIQPPPLPPAPDKVVVKKGYDPKQARPPV 760
            S   E   +PP  P    +   P PP P      PP   P P          P   RP  
Sbjct: 2892 SRSTESFALPPDQPERPPQPQAPPPPQPQPQPPPPPQPQPPP----------PPPPRPQP 2941

Query: 761  VKPSPTD 767
                 TD
Sbjct: 2942 PLAPTTD 2948



 Score = 32.2 bits (73), Expect = 1.6
 Identities = 17/53 (32%), Positives = 20/53 (37%), Gaps = 2/53 (3%)

Query: 382  TQVQDMEEDSSSEEEQEDIPPPP--PPPSLKKVEPAPPLPSQPSIQPPPLPPA 432
            T+   +  D      Q   PPPP   P      +P PP P  P  QPP  P  
Sbjct: 2895 TESFALPPDQPERPPQPQAPPPPQPQPQPPPPPQPQPPPPPPPRPQPPLAPTT 2947



 Score = 31.1 bits (70), Expect = 3.6
 Identities = 18/65 (27%), Positives = 23/65 (35%), Gaps = 1/65 (1%)

Query: 710  PPPPPPPSLKKVEPAPPLPSQPSIQPPPLP-PAPDKVVVKKGYDPKQARPPVVKPSPTDE 768
            P   PP  L +   A    S+ S+  P  P   P  V+      P  A P    P PT  
Sbjct: 2775 PAAGPPRRLTRPAVASLSESRESLPSPWDPADPPAAVLAPAAALPPAASPAGPLPPPTSA 2834

Query: 769  FLISP 773
               +P
Sbjct: 2835 QPTAP 2839



 Score = 31.1 bits (70), Expect = 4.2
 Identities = 28/126 (22%), Positives = 39/126 (30%), Gaps = 14/126 (11%)

Query: 629  PPLPRSMPPMNKPKPGNFPPPTTPDEVGARVLMQERMDNGEEDLEEDEE----------- 677
            PP P + P  ++  P   P     + V  R+L   R   G E+L  D+            
Sbjct: 2503 PPDPDAPPAPSRLAPAILPDEPVGEPVHPRMLTWIR---GLEELASDDAGDPPPPLPPAA 2559

Query: 678  MEEEYGQRRPEDDDMQIDMEEDSSSEEEQEDIPPPPPPPSLKKVEPAPPLPSQPSIQPPP 737
                  +  P         E   +S   + D PP    P     +   P    P    PP
Sbjct: 2560 PPAAPDRSVPPPRPAPRPSEPAVTSRARRPDAPPQSARPRAPVDDRGDPRGPAPPSPLPP 2619

Query: 738  LPPAPD 743
               APD
Sbjct: 2620 DTHAPD 2625



 Score = 30.3 bits (68), Expect = 6.5
 Identities = 21/72 (29%), Positives = 25/72 (34%), Gaps = 4/72 (5%)

Query: 710  PPPPPPPSLKKVEPAPPLPSQPSIQPPPLPPAPDKVVVKKGYDPKQARPPVVKPSPTDEF 769
            P  P  P+       P  P   +  P P PPA        G   +  RP V   S + E 
Sbjct: 2742 PAVPAGPATPGGPARPARPPTTAGPPAPAPPA----APAAGPPRRLTRPAVASLSESRES 2797

Query: 770  LISPITGEKIPA 781
            L SP      PA
Sbjct: 2798 LPSPWDPADPPA 2809



 Score = 30.3 bits (68), Expect = 7.3
 Identities = 13/33 (39%), Positives = 15/33 (45%)

Query: 401  PPPPPPPSLKKVEPAPPLPSQPSIQPPPLPPAP 433
             P P P +L    P PP P+      P LP AP
Sbjct: 2707 TPEPAPHALVSATPLPPGPAAARQASPALPAAP 2739



 Score = 30.3 bits (68), Expect = 7.4
 Identities = 19/53 (35%), Positives = 23/53 (43%), Gaps = 5/53 (9%)

Query: 710  PPPPPPPSLKKVEPAPPLPSQPSIQPPPLPPAP--DKVVVKKGYDPKQARPPV 760
            PPPP PP+     PA P  S P  +P P P  P       +    P+ ARP  
Sbjct: 2551 PPPPLPPAAP---PAAPDRSVPPPRPAPRPSEPAVTSRARRPDAPPQSARPRA 2600



 Score = 29.9 bits (67), Expect = 7.9
 Identities = 11/40 (27%), Positives = 14/40 (35%)

Query: 394  EEEQEDIPPPPPPPSLKKVEPAPPLPSQPSIQPPPLPPAP 433
                    PPPPPP   +   AP      + +P    P P
Sbjct: 2923 PPPPPQPQPPPPPPPRPQPPLAPTTDPAGAGEPSGAVPQP 2962



 Score = 29.9 bits (67), Expect = 8.6
 Identities = 19/83 (22%), Positives = 26/83 (31%), Gaps = 7/83 (8%)

Query: 710  PPPPPPPSLKKVEPAPPLPSQPSIQPPPLPPAPDKVVVKKG-------YDPKQARPPVVK 762
            P  P  P      PAP  P+ P+  PP     P    + +        +DP      V+ 
Sbjct: 2754 PARPARPPTTAGPPAPAPPAAPAAGPPRRLTRPAVASLSESRESLPSPWDPADPPAAVLA 2813

Query: 763  PSPTDEFLISPITGEKIPASKVQ 785
            P+       SP      P S   
Sbjct: 2814 PAAALPPAASPAGPLPPPTSAQP 2836


>gnl|CDD|223044 PHA03325, PHA03325, nuclear-egress-membrane-like protein;
           Provisional.
          Length = 418

 Score = 36.8 bits (85), Expect = 0.050
 Identities = 28/164 (17%), Positives = 49/164 (29%), Gaps = 11/164 (6%)

Query: 272 NFPPPTT-PDEVGARVLMQERMDNGEEDLEEDEEMEEEYGQRRPEDDDMQIGSDEENDYY 330
           N   PT+ P     R            DL +D+  E         D +    S       
Sbjct: 265 NSSLPTSAPKRRSRRAGAMRAAAGETADLADDDGSEHS-------DPEPLPASLPPPPVR 317

Query: 331 NARMRGRNANRGRGRMQEEINEDDMQEGTTSEEESDNENEEARKSKEAKDNTQVQDMEED 390
             R++   A +            +  +  TS          + ++K++        +   
Sbjct: 318 RPRVKHPEAGKEEPDGARNAEAKEPAQPATSTSSK---GSSSAQNKDSGSTGPGSSLAAA 374

Query: 391 SSSEEEQEDIPPPPPPPSLKKVEPAPPLPSQPSIQPPPLPPAPD 434
           SS  E+ +   PP    +  +  P+P + S P     PL    D
Sbjct: 375 SSFLEDDDFGSPPLDLTTSLRHMPSPSVTSAPEPPSIPLTYLSD 418



 Score = 33.7 bits (77), Expect = 0.43
 Identities = 24/134 (17%), Positives = 38/134 (28%), Gaps = 21/134 (15%)

Query: 620 GPKRSHAPNPPLPRSMPPMNKPKPGNFPPPTTPDEVGARVLMQERMDNGEEDLEEDEEME 679
                H+   PLP S+PP    +P    P    +E            +G  + E  E  +
Sbjct: 296 DDGSEHSDPEPLPASLPPPPVRRPRVKHPEAGKEE-----------PDGARNAEAKEPAQ 344

Query: 680 EEYGQRRPEDDDMQIDMEEDSSS----------EEEQEDIPPPPPPPSLKKVEPAPPLPS 729
                        Q      +             E+ +   PP    +  +  P+P + S
Sbjct: 345 PATSTSSKGSSSAQNKDSGSTGPGSSLAAASSFLEDDDFGSPPLDLTTSLRHMPSPSVTS 404

Query: 730 QPSIQPPPLPPAPD 743
            P     PL    D
Sbjct: 405 APEPPSIPLTYLSD 418


>gnl|CDD|217927 pfam04147, Nop14, Nop14-like family.  Emg1 and Nop14 are novel
           proteins whose interaction is required for the
           maturation of the 18S rRNA and for 40S ribosome
           production.
          Length = 809

 Score = 36.9 bits (86), Expect = 0.057
 Identities = 26/131 (19%), Positives = 55/131 (41%), Gaps = 12/131 (9%)

Query: 289 QERMDNGEEDLEED--EEMEEEYGQRRPEDDDMQIGSDEENDYYNARMRGRNANRGRGRM 346
            ER+    + LE +    M  E      E+D  +   D ++++        + N G G+ 
Sbjct: 268 AERL----KKLEAERLRRMRGEEEDDEEEEDSKESADDLDDEFEPDD----DDNFGLGQG 319

Query: 347 QEEINEDDMQEGTTSEEESDNENEEARKSKEAKDNTQVQDMEEDSSSEEEQEDIPPPPPP 406
           +E+  E++       EE+ D++ EE  +  +   + + +D E++ S +E+ E+       
Sbjct: 320 EEDEEEEEDGVDDEDEEDDDDDLEEEEE--DVDLSDEEEDEEDEDSDDEDDEEEEEEEKE 377

Query: 407 PSLKKVEPAPP 417
              KK   +  
Sbjct: 378 KKKKKSAESTR 388



 Score = 35.0 bits (81), Expect = 0.20
 Identities = 32/138 (23%), Positives = 55/138 (39%), Gaps = 17/138 (12%)

Query: 285 RVLMQERMD--NGEEDLEEDEEMEEEYGQ----RRPEDDDMQIGSDEENDYYNARMRGRN 338
           + L  ER+    GEE+ +E+EE  +E           DDD   G  +  +       G +
Sbjct: 272 KKLEAERLRRMRGEEEDDEEEEDSKESADDLDDEFEPDDDDNFGLGQGEEDEEEEEDGVD 331

Query: 339 ANRGRGRMQEEINEDDMQEGTTSEEESDNENEEARKSKEAKDNTQVQDMEEDSSSEEEQE 398
                    EE ++DD++E     + SD E +E  +  + +D     D EE+   +E+++
Sbjct: 332 DE------DEEDDDDDLEEEEEDVDLSDEEEDEEDEDSDDED-----DEEEEEEEKEKKK 380

Query: 399 DIPPPPPPPSLKKVEPAP 416
                     L    P P
Sbjct: 381 KKSAESTRSELPFTFPCP 398



 Score = 35.0 bits (81), Expect = 0.22
 Identities = 22/119 (18%), Positives = 46/119 (38%), Gaps = 31/119 (26%)

Query: 290 ERMDNGEEDLEEDEEMEEEYGQRRPEDDDMQIGSDEENDYYNARMRGRNANRGRGRMQEE 349
           +    G+ + +E+EE +    +   +DDD     +E+ D                     
Sbjct: 312 DNFGLGQGEEDEEEEEDGVDDEDEEDDDDDLEEEEEDVD-------------------LS 352

Query: 350 INEDDMQEGTTSEEESDNENEEARKSKEAKDNTQVQDMEEDSSSEEEQEDIPPPPPPPS 408
             E+D ++  + +E+ + E EE ++ K+ K            S+E  + ++P   P P 
Sbjct: 353 DEEEDEEDEDSDDEDDEEEEEEEKEKKKKK------------SAESTRSELPFTFPCPK 399



 Score = 31.1 bits (71), Expect = 3.7
 Identities = 16/43 (37%), Positives = 25/43 (58%)

Query: 666 DNGEEDLEEDEEMEEEYGQRRPEDDDMQIDMEEDSSSEEEQED 708
           D  EED ++D E EEE      E++D + +  +D   EEE+E+
Sbjct: 332 DEDEEDDDDDLEEEEEDVDLSDEEEDEEDEDSDDEDDEEEEEE 374



 Score = 30.7 bits (70), Expect = 4.8
 Identities = 13/47 (27%), Positives = 24/47 (51%)

Query: 663 ERMDNGEEDLEEDEEMEEEYGQRRPEDDDMQIDMEEDSSSEEEQEDI 709
           +    G+ + +E+EE +    +   +DDD   + EED    +E+ED 
Sbjct: 312 DNFGLGQGEEDEEEEEDGVDDEDEEDDDDDLEEEEEDVDLSDEEEDE 358



 Score = 30.4 bits (69), Expect = 5.7
 Identities = 28/147 (19%), Positives = 50/147 (34%), Gaps = 44/147 (29%)

Query: 298 DLEEDEEMEEE---YGQ---RRPEDDDMQIGSDEENDYYNARMRGRNANRGRGRMQEEIN 351
           +LE+D+E E      GQ      +DDD++   D ++D                      +
Sbjct: 105 NLEDDDEDEFVLTHLGQSLSEIDKDDDVRDDDDFDDD--------------DLGDLASDD 150

Query: 352 EDDMQEGTTSEEESDNENEEARKSKE-----------------AKDNTQVQDMEEDSSSE 394
                 G   +EE + E  E +KSK+                  K   + +D+ E+   +
Sbjct: 151 RAAHFGGGEDDEEDEEEQPERKKSKKEVMKEVIAKSKFYKAERQKAKEEDEDLREE--LD 208

Query: 395 EEQEDI-----PPPPPPPSLKKVEPAP 416
           ++ +D+        PPP      E   
Sbjct: 209 DDFKDLMSLLRTVKPPPKPPMTPEEKD 235


>gnl|CDD|221185 pfam11719, Drc1-Sld2, DNA replication and checkpoint protein.
           Genome duplication is precisely regulated by
           cyclin-dependent kinases CDKs, which bring about the
           onset of S phase by activating replication origins and
           then prevent relicensing of origins until mitosis is
           completed. The optimum sequence motif for CDK
           phosphorylation is S/T-P-K/R-K/R, and Drc1-Sld2 is found
           to have at least 11 potential phosphorylation sites.
           Drc1 is required for DNA synthesis and S-M replication
           checkpoint control. Drc1 associates with Cdc2 and is
           phosphorylated at the onset of S phase when Cdc2 is
           activated. Thus Cdc2 promotes DNA replication by
           phosphorylating Drc1 and regulating its association with
           Cut5. Sld2 and Sld3 represent the minimal set of S-CDK
           substrates required for DNA replication.
          Length = 397

 Score = 36.7 bits (85), Expect = 0.058
 Identities = 31/129 (24%), Positives = 47/129 (36%), Gaps = 5/129 (3%)

Query: 290 ERMDNGEEDLEEDEEMEEEYGQRRPEDDDMQIGSDEENDYYNARMRGRNANRG--RGRMQ 347
           E +D+  + L E E  E   G    E  D Q  +DE    +  + + R   R   R    
Sbjct: 244 ESIDDELDVLREIEAEEAGIGPIEEEVVDSQAANDEPRRVFKKKGQKRTTRRVKMRPVRA 303

Query: 348 EEINEDDMQEGTTSEEESDNENEEARKSKEAKDN-TQVQDM-EEDSSSEEEQEDIPPPPP 405
           +  +E  + E    EE     +E++              D  E+D  S+EE E       
Sbjct: 304 KPSDEPSLPESDIHEEI-PKLDEKSLSEFLGYMGGIDEDDEDEDDEESKEEVEKKQKVKK 362

Query: 406 PPSLKKVEP 414
            P  +KV P
Sbjct: 363 KPRKRKVNP 371


>gnl|CDD|219069 pfam06512, Na_trans_assoc, Sodium ion transport-associated.
           Members of this family contain a region found
           exclusively in eukaryotic sodium channels or their
           subunits, many of which are voltage-gated. Members very
           often also contain between one and four copies of
           pfam00520 and, less often, one copy of pfam00612.
          Length = 230

 Score = 35.8 bits (83), Expect = 0.062
 Identities = 14/68 (20%), Positives = 27/68 (39%), Gaps = 1/68 (1%)

Query: 351 NEDDMQEGTTSEEESDNENEEARKSKEAKDNTQVQDMEEDSSSEEEQEDIPPPPPPPSLK 410
            E D++     +  S++      +SK+    T   + ++ SSSE    D+ PP      +
Sbjct: 110 GESDLENLNDEDTSSESSYGFKEESKKGSAETLKLEEKDSSSSEGSTVDLEPPEEEEE-E 168

Query: 411 KVEPAPPL 418
             E    +
Sbjct: 169 IAEEEEEV 176



 Score = 30.8 bits (70), Expect = 2.6
 Identities = 17/68 (25%), Positives = 28/68 (41%), Gaps = 9/68 (13%)

Query: 668 GEEDLE--EDEEMEEEYGQRRPEDD------DMQIDMEEDSSSEEEQEDIPPPPPPPSLK 719
           GE DLE   DE+   E      E+        ++++ ++ SSSE    D+ PP      +
Sbjct: 110 GESDLENLNDEDTSSESSYGFKEESKKGSAETLKLEEKDSSSSEGSTVDLEPPEEEEE-E 168

Query: 720 KVEPAPPL 727
             E    +
Sbjct: 169 IAEEEEEV 176


>gnl|CDD|218538 pfam05285, SDA1, SDA1.  This family consists of several SDA1
           protein homologues. SDA1 is a Saccharomyces cerevisiae
           protein which is involved in the control of the actin
           cytoskeleton. The protein is essential for cell
           viability and is localised in the nucleus.
          Length = 317

 Score = 36.2 bits (84), Expect = 0.068
 Identities = 24/103 (23%), Positives = 48/103 (46%), Gaps = 12/103 (11%)

Query: 296 EEDLEEDEEMEEEYGQRRPEDDDMQIGSDEENDYYNARMRGRNANRGRGRMQEEINEDDM 355
           +   EE ++ E E G    +DDD +   + E D           +   G   E I+ +  
Sbjct: 77  KWKEEERKKKEAEQGLESDDDDDEEEEWEVEED---------EDSDDEG---EWIDVESD 124

Query: 356 QEGTTSEEESDNENEEARKSKEAKDNTQVQDMEEDSSSEEEQE 398
           +E  +S+ E + E +EA K  +   + ++ + +E+ ++EEE+ 
Sbjct: 125 KEIESSDSEDEEEKDEAAKKAKEDSDEELSEEDEEEAAEEEEA 167



 Score = 31.5 bits (72), Expect = 1.8
 Identities = 13/48 (27%), Positives = 25/48 (52%), Gaps = 2/48 (4%)

Query: 666 DNGEEDL--EEDEEMEEEYGQRRPEDDDMQIDMEEDSSSEEEQEDIPP 711
            + E +    EDEE ++E  ++  ED D ++  E++  + EE+E    
Sbjct: 123 SDKEIESSDSEDEEEKDEAAKKAKEDSDEELSEEDEEEAAEEEEAEAE 170


>gnl|CDD|173412 PTZ00121, PTZ00121, MAEBL; Provisional.
          Length = 2084

 Score = 36.7 bits (84), Expect = 0.069
 Identities = 34/172 (19%), Positives = 68/172 (39%), Gaps = 9/172 (5%)

Query: 232  EAQRKQEEARVEKDRVAYAQIDWHDFVVVETVDYMPGEPGNFPPPTTPDEVGARVLMQER 291
            E  +K EEA+++ + +  A+    +   VE +     E           E   ++   E 
Sbjct: 1610 EEAKKAEEAKIKAEELKKAE---EEKKKVEQLKKKEAEEKKKAEELKKAEEENKIKAAEE 1666

Query: 292  MDNGEEDLEEDEEMEEEYGQRRPEDDDMQIGSDEENDYYNARMRGRNANRGRG----RMQ 347
                EED ++ EE ++     +   +   +  + E       ++ + A   +     +  
Sbjct: 1667 AKKAEEDKKKAEEAKKAEEDEKKAAE--ALKKEAEEAKKAEELKKKEAEEKKKAEELKKA 1724

Query: 348  EEINEDDMQEGTTSEEESDNENEEARKSKEAKDNTQVQDMEEDSSSEEEQED 399
            EE N+   +E     EE   + EEA+K +E K        EE+  +EE +++
Sbjct: 1725 EEENKIKAEEAKKEAEEDKKKAEEAKKDEEEKKKIAHLKKEEEKKAEEIRKE 1776



 Score = 30.9 bits (69), Expect = 4.8
 Identities = 35/185 (18%), Positives = 77/185 (41%), Gaps = 9/185 (4%)

Query: 219  DEVKYRADYIRYQEAQRKQEEARVEKDRVAYAQIDWHDFVVVETVDYMPGEPGNFPPPTT 278
            DE+K +A+ ++  E ++K EEA+  ++    A     +    E                 
Sbjct: 1549 DELK-KAEELKKAEEKKKAEEAKKAEEDKNMALRKAEEAKKAEEARIEEVMKLYEEEKKM 1607

Query: 279  PDEVGARVLMQERMDNGEEDLEEDEEMEEEYGQRRPEDDDM---QIGSDEENDYYNARMR 335
              E  A+   + ++   E    E+E+ + E  +++  ++     ++   EE +   A   
Sbjct: 1608 KAE-EAKKAEEAKIKAEELKKAEEEKKKVEQLKKKEAEEKKKAEELKKAEEENKIKAAEE 1666

Query: 336  GRNANRGRGRMQEEINEDDMQEGTTSEEESDNENEEARKSKEAK--DNTQVQDMEEDSSS 393
             + A   + + +E    +  ++   + E    E EEA+K++E K  +  + +  EE   +
Sbjct: 1667 AKKAEEDKKKAEEAKKAE--EDEKKAAEALKKEAEEAKKAEELKKKEAEEKKKAEELKKA 1724

Query: 394  EEEQE 398
            EEE +
Sbjct: 1725 EEENK 1729


>gnl|CDD|240289 PTZ00144, PTZ00144, dihydrolipoamide succinyltransferase;
           Provisional.
          Length = 418

 Score = 36.2 bits (84), Expect = 0.077
 Identities = 12/54 (22%), Positives = 19/54 (35%)

Query: 370 EEARKSKEAKDNTQVQDMEEDSSSEEEQEDIPPPPPPPSLKKVEPAPPLPSQPS 423
                +  AK      +  + ++   E      P PP + K  EPAP     P+
Sbjct: 127 AAPAAAAAAKAEKTTPEKPKAAAPTPEPPAASKPTPPAAAKPPEPAPAAKPPPT 180



 Score = 32.3 bits (74), Expect = 1.1
 Identities = 10/46 (21%), Positives = 17/46 (36%)

Query: 697 EEDSSSEEEQEDIPPPPPPPSLKKVEPAPPLPSQPSIQPPPLPPAP 742
           + + ++ E+ +   P P PP+  K  P               PP P
Sbjct: 136 KAEKTTPEKPKAAAPTPEPPAASKPTPPAAAKPPEPAPAAKPPPTP 181



 Score = 31.6 bits (72), Expect = 2.3
 Identities = 12/60 (20%), Positives = 17/60 (28%), Gaps = 4/60 (6%)

Query: 708 DIPPPPPPPSLKKVEPAPPLPSQPSIQPPPLPPAPDKVVVKKGYDPKQARPPVVKPSPTD 767
             PP   P +    +     P +P    P   P            P  A+PP   P+   
Sbjct: 122 GAPPAAAPAAAAAAKAEKTTPEKPKAAAPTPEPPAASKPT----PPAAAKPPEPAPAAKP 177



 Score = 30.4 bits (69), Expect = 4.2
 Identities = 11/62 (17%), Positives = 13/62 (20%), Gaps = 1/62 (1%)

Query: 702 SEEEQEDIPPPPPPPSLKKVEPAPPLPSQPSIQPPPLPPAPDKVVV-KKGYDPKQARPPV 760
                        P   K   P P  P+     PP     P+     K    P     P 
Sbjct: 129 PAAAAAAKAEKTTPEKPKAAAPTPEPPAASKPTPPAAAKPPEPAPAAKPPPTPVARADPR 188

Query: 761 VK 762
             
Sbjct: 189 ET 190



 Score = 30.0 bits (68), Expect = 6.4
 Identities = 16/79 (20%), Positives = 25/79 (31%), Gaps = 8/79 (10%)

Query: 654 EVGARVLMQERMDNGEEDLEEDEEMEEEYGQRRPEDDDMQIDMEEDSSSEEEQEDIPPPP 713
           EVGA +     +D G                 +   +       + ++   E      P 
Sbjct: 110 EVGAPLS---EIDTGGAPPAAAPAAAAAAKAEKTTPE-----KPKAAAPTPEPPAASKPT 161

Query: 714 PPPSLKKVEPAPPLPSQPS 732
           PP + K  EPAP     P+
Sbjct: 162 PPAAAKPPEPAPAAKPPPT 180


>gnl|CDD|236766 PRK10811, rne, ribonuclease E; Reviewed.
          Length = 1068

 Score = 36.6 bits (85), Expect = 0.080
 Identities = 28/161 (17%), Positives = 46/161 (28%), Gaps = 23/161 (14%)

Query: 310 GQRRPEDDDMQIGSDEENDYYNARMRGRNANRGRGRMQEEINEDDMQEGTTSEEESDNEN 369
            +RR   D+       EN   N R R +   +     + +  E   +  T  E++     
Sbjct: 618 NERRDTRDNRTRREGRENREENRRNRRQAQQQTAETRESQQAEVTEKARTQDEQQQAPRR 677

Query: 370 EEARKSKEAKDNTQ----VQDMEEDSSSEEEQEDIPPPPPP------------------- 406
           E  R+  + K   Q      ++EE S  E EQE+      P                   
Sbjct: 678 ERQRRRNDEKRQAQQEAKALNVEEQSVQETEQEERVQQVQPRRKQRQLNQKVRIEQSVAE 737

Query: 407 PSLKKVEPAPPLPSQPSIQPPPLPPAPDKVVVKKGYDPKQE 447
            ++  V            + P       KV +        E
Sbjct: 738 EAVAPVVEETVAAEPVVQEVPAPRTELVKVPLPVVAQTAPE 778


>gnl|CDD|237864 PRK14950, PRK14950, DNA polymerase III subunits gamma and tau;
           Provisional.
          Length = 585

 Score = 35.9 bits (83), Expect = 0.11
 Identities = 16/71 (22%), Positives = 20/71 (28%), Gaps = 4/71 (5%)

Query: 701 SSEEEQEDIPPPPPPPSLKKVEPAPPLPSQPSIQPPPLPPAPDKVVVKKGYDPKQARPPV 760
             +E   +   PPP P      P  P         P L  A   V  K  Y P       
Sbjct: 398 PPKEPVRETATPPPVPP----RPVAPPVPHTPESAPKLTRAAIPVDEKPKYTPPAPPKEE 453

Query: 761 VKPSPTDEFLI 771
            K    D  ++
Sbjct: 454 EKALIADGDVL 464



 Score = 34.4 bits (79), Expect = 0.27
 Identities = 14/53 (26%), Positives = 16/53 (30%), Gaps = 4/53 (7%)

Query: 392 SSEEEQEDIPPPPPPPSLKKVEPAPPLPSQPSIQPPPLPPAPDKVVVKKGYDP 444
             +E   +   PPP P      P  P         P L  A   V  K  Y P
Sbjct: 398 PPKEPVRETATPPPVPP----RPVAPPVPHTPESAPKLTRAAIPVDEKPKYTP 446



 Score = 33.2 bits (76), Expect = 0.75
 Identities = 14/72 (19%), Positives = 16/72 (22%), Gaps = 16/72 (22%)

Query: 710 PPPPPPPSLKKVEPAPPLPSQPSIQPPPLPPAPDKVVVKKGYDPKQARPPVVKPSPTDEF 769
           P P   P        PP         PP  P                 PPV     +   
Sbjct: 383 PAPSTRPKAAAAANIPPKEPVRETATPPPVPPRP------------VAPPVPHTPESAP- 429

Query: 770 LISPITGEKIPA 781
               +T   IP 
Sbjct: 430 ---KLTRAAIPV 438



 Score = 30.9 bits (70), Expect = 3.7
 Identities = 9/55 (16%), Positives = 12/55 (21%)

Query: 401 PPPPPPPSLKKVEPAPPLPSQPSIQPPPLPPAPDKVVVKKGYDPKQEKIPASKVQ 455
           P P   P        PP         PP  P                K+  + + 
Sbjct: 383 PAPSTRPKAAAAANIPPKEPVRETATPPPVPPRPVAPPVPHTPESAPKLTRAAIP 437



 Score = 30.2 bits (68), Expect = 6.0
 Identities = 16/61 (26%), Positives = 22/61 (36%), Gaps = 1/61 (1%)

Query: 709 IPPPPPPPSLKKVEPAPPLPSQPSIQPPPLPPAPDKV-VVKKGYDPKQARPPVVKPSPTD 767
                 PP     E A P P  P    PP+P  P+    + +   P   +P    P+P  
Sbjct: 392 AAAANIPPKEPVRETATPPPVPPRPVAPPVPHTPESAPKLTRAAIPVDEKPKYTPPAPPK 451

Query: 768 E 768
           E
Sbjct: 452 E 452


>gnl|CDD|225629 COG3087, FtsN, Cell division protein [Cell division and chromosome
           partitioning].
          Length = 264

 Score = 35.2 bits (81), Expect = 0.12
 Identities = 33/163 (20%), Positives = 46/163 (28%), Gaps = 15/163 (9%)

Query: 601 IHQIHKVKGLLPDEEKEKIGPKRSHAPNPPLPRSMPPMNKPKPGNFPPPTTPDEVGARVL 660
               H  K  +P    +  G      PN P         K           P E  A   
Sbjct: 34  YFITHHKKAPIPFLSNQGTGSL---LPNKPEEVWSYI--KALEDRQIGVPQPTEPAAVKD 88

Query: 661 MQERMDNGEEDLEEDEEMEEEYGQRRPEDDDMQIDMEEDSSSEEEQEDIPPPPPPPSLKK 720
            +       + LE+ E  ++    +  E  +            EEQ              
Sbjct: 89  AERLTPEQRQLLEQMEVDQKAQPTQLGEQPEQA--------RIEEQPRTQSQKAQSQATT 140

Query: 721 VEPAPPLPSQPSIQPPPLPPAPDKVVVKKGYDPKQARPPVVKP 763
           V+  P  P     +P P+ PAP    V+K   PK    P  KP
Sbjct: 141 VQTQPVKPKPRPEKPQPVAPAPAPEPVEK--APKAEAAPPPKP 181



 Score = 32.5 bits (74), Expect = 0.85
 Identities = 26/143 (18%), Positives = 39/143 (27%), Gaps = 9/143 (6%)

Query: 605 HKVKGLLPDEEKEK---IGPKRSHAPNPPLPRSMPPMNKPKPGNFPPPTTPDEVGARVLM 661
                LLP++ +E    I          P P       +P         TP++      M
Sbjct: 50  QGTGSLLPNKPEEVWSYIKALEDRQIGVPQP------TEPAAVKDAERLTPEQRQLLEQM 103

Query: 662 QERMDNGEEDLEEDEEMEEEYGQRRPEDDDMQIDMEEDSSSEEEQEDIPPPPPPPSLKKV 721
           +         L E  E      Q R +    Q       +   + +  P  P P +    
Sbjct: 104 EVDQKAQPTQLGEQPEQARIEEQPRTQSQKAQSQATTVQTQPVKPKPRPEKPQPVAPAPA 163

Query: 722 EPAPPLPSQPSIQPPPLPPAPDK 744
                   +    PPP P A D 
Sbjct: 164 PEPVEKAPKAEAAPPPKPKAEDA 186



 Score = 30.2 bits (68), Expect = 5.0
 Identities = 18/92 (19%), Positives = 32/92 (34%), Gaps = 5/92 (5%)

Query: 370 EEARKSKEAKDNTQVQDMEEDSSSEEEQ--EDIPPPPPPPSLKKVEPAPPLPSQPSI--- 424
            E R+  E  +  Q     +     E+   E+ P      +  +       P +P     
Sbjct: 94  PEQRQLLEQMEVDQKAQPTQLGEQPEQARIEEQPRTQSQKAQSQATTVQTQPVKPKPRPE 153

Query: 425 QPPPLPPAPDKVVVKKGYDPKQEKIPASKVQE 456
           +P P+ PAP    V+K    +    P  K ++
Sbjct: 154 KPQPVAPAPAPEPVEKAPKAEAAPPPKPKAED 185


>gnl|CDD|237030 PRK12270, kgd, alpha-ketoglutarate decarboxylase; Reviewed.
          Length = 1228

 Score = 36.0 bits (84), Expect = 0.12
 Identities = 19/101 (18%), Positives = 31/101 (30%), Gaps = 5/101 (4%)

Query: 690 DDMQIDMEEDSSSEEEQEDIPPPPPPPSLKKVEPAPPLPSQPSIQPPPLPPAPDKVVVKK 749
            D         ++           P  +     PA P P+ P+   P  PP P       
Sbjct: 34  ADYGPGSTAAPTAAAAAAAAAASAPAAAPAAKAPAAPAPAPPAAAAPAAPPKPAAAAAAA 93

Query: 750 G--YDPKQARPPVVKPSPTDEFLISPITGEKIPASKVQEHM 788
                P  A       +   E  ++P+ G    A+ V ++M
Sbjct: 94  AAPAAPPAAAAAAAPAAAAVEDEVTPLRG---AAAAVAKNM 131


>gnl|CDD|237171 PRK12678, PRK12678, transcription termination factor Rho;
           Provisional.
          Length = 672

 Score = 35.6 bits (83), Expect = 0.13
 Identities = 23/111 (20%), Positives = 37/111 (33%), Gaps = 1/111 (0%)

Query: 289 QERMDNGEE-DLEEDEEMEEEYGQRRPEDDDMQIGSDEENDYYNARMRGRNANRGRGRMQ 347
            +R D   E +  E    EE        D   +    +  +    R  G    R R R +
Sbjct: 176 GDREDRQAEAERGERGRREERGRDGDDRDRRDRREQGDRREERGRRDGGDRRGRRRRRDR 235

Query: 348 EEINEDDMQEGTTSEEESDNENEEARKSKEAKDNTQVQDMEEDSSSEEEQE 398
            +   DD +E     +  D E    R+ +  +D  +      D  +E E E
Sbjct: 236 RDARGDDNREDRGDRDGDDGEGRGGRRGRRFRDRDRRGRRGGDGGNEREPE 286



 Score = 32.6 bits (75), Expect = 1.3
 Identities = 15/66 (22%), Positives = 25/66 (37%), Gaps = 1/66 (1%)

Query: 289 QERMDNGEEDLEEDEEMEEEYGQRRPEDDDMQIGSDEENDYYNARMRGRNANRGRGRMQE 348
           + R    +      ++  E+ G R  +D + + G          R RGR    G    + 
Sbjct: 227 RGRRRRRDRRDARGDDNREDRGDRDGDDGEGRGGRRGRRFRDRDR-RGRRGGDGGNEREP 285

Query: 349 EINEDD 354
           E+ EDD
Sbjct: 286 ELREDD 291



 Score = 30.6 bits (70), Expect = 4.5
 Identities = 25/132 (18%), Positives = 41/132 (31%), Gaps = 12/132 (9%)

Query: 268 GEPGNFPPPTTPDEVGARVLMQERMDNGEEDLEEDEEMEEEYGQRRPEDDDMQIGSDEEN 327
           GE           E G +   + R D  E   EE+ +     G R     + + G     
Sbjct: 134 GEAARRGAARKAGEGGEQPATEARADAAERTEEEERDERRRRGDREDRQAEAERGE---- 189

Query: 328 DYYNARMRGRNANRGRGRMQEEINEDDMQEGTTSEEESDNENEEARKSKEAKDNTQVQDM 387
                  RGR   RGR     +   D  ++G   EE    +  + R  +  +D    +  
Sbjct: 190 -------RGRREERGRDGDDRD-RRDRREQGDRREERGRRDGGDRRGRRRRRDRRDARGD 241

Query: 388 EEDSSSEEEQED 399
           +      +   D
Sbjct: 242 DNREDRGDRDGD 253


>gnl|CDD|222581 pfam14181, YqfQ, YqfQ-like protein.  The YqfQ-like protein family
           includes the B. subtilis YqfQ protein, also known as
           VrrA, which is functionally uncharacterized. This family
           of proteins is found in bacteria. Proteins in this
           family are typically between 146 and 237 amino acids in
           length. There are two conserved sequence motifs: QYGP
           and PKLY.
          Length = 155

 Score = 34.0 bits (78), Expect = 0.14
 Identities = 21/60 (35%), Positives = 32/60 (53%), Gaps = 4/60 (6%)

Query: 365 SDNENEEARKSKEAKDNTQVQDMEE--DSSSEEEQEDIPPPPPPPSLKKVEPAPPLPSQP 422
           SD+E EE    +E+ D T+ +D  E    S E+++ ++P P       K EP  P PS+P
Sbjct: 94  SDDEEEE--TEEESTDETEQEDPPETKTESKEKKKREVPKPKTEKEKPKTEPKKPKPSKP 151



 Score = 29.7 bits (67), Expect = 3.3
 Identities = 25/102 (24%), Positives = 42/102 (41%), Gaps = 26/102 (25%)

Query: 630 PLPRSMPPMNKPKPGNFPPPTTPDEVGARVLMQERMDNGEEDLEEDEEMEEEYGQRRPED 689
           PL R++P M K                   + +E   + +E+ E +EE  +E  Q  P +
Sbjct: 76  PLVRNLPAMWK-------------------IFRELSSSDDEEEETEEESTDETEQEDPPE 116

Query: 690 DDMQIDMEEDSSSEEEQEDIPPPPPPPSLKKVEPAPPLPSQP 731
                   +  S E+++ ++P P       K EP  P PS+P
Sbjct: 117 -------TKTESKEKKKREVPKPKTEKEKPKTEPKKPKPSKP 151


>gnl|CDD|223880 COG0810, TonB, Periplasmic protein TonB, links inner and outer
           membranes [Cell envelope biogenesis, outer membrane].
          Length = 244

 Score = 34.8 bits (80), Expect = 0.14
 Identities = 19/70 (27%), Positives = 26/70 (37%)

Query: 387 MEEDSSSEEEQEDIPPPPPPPSLKKVEPAPPLPSQPSIQPPPLPPAPDKVVVKKGYDPKQ 446
           +         +E  P P PP    K    P  P +P+   P   P P+K   K    PK 
Sbjct: 44  LAAKVLEAPTEEPQPEPEPPEEQPKPPTEPETPPEPTPPKPKEKPKPEKKPKKPKPKPKP 103

Query: 447 EKIPASKVQE 456
           +  P  KV+ 
Sbjct: 104 KPKPKPKVKP 113



 Score = 32.1 bits (73), Expect = 1.2
 Identities = 19/90 (21%), Positives = 27/90 (30%), Gaps = 2/90 (2%)

Query: 696 MEEDSSSEEEQEDIPPPPPPPSLKKVEPAPPLPSQPSIQPPPLPPAPDKVVVKKGYDPKQ 755
           +         +E  P P PP    K    P  P +P+   P   P P+K   K    PK 
Sbjct: 44  LAAKVLEAPTEEPQPEPEPPEEQPKPPTEPETPPEPTPPKPKEKPKPEKKPKKP--KPKP 101

Query: 756 ARPPVVKPSPTDEFLISPITGEKIPASKVQ 785
              P  KP    +        +    +   
Sbjct: 102 KPKPKPKPKVKPQPKPKKPPSKTAAKAPAA 131



 Score = 30.1 bits (68), Expect = 5.0
 Identities = 15/69 (21%), Positives = 21/69 (30%), Gaps = 1/69 (1%)

Query: 381 NTQVQDMEEDSSSEEEQEDIPPPPPPPSLKKVEPAPPLPSQPSIQ-PPPLPPAPDKVVVK 439
             Q +    +   +   E   PP P P   K +P P    +     P P P    KV  +
Sbjct: 55  EPQPEPEPPEEQPKPPTEPETPPEPTPPKPKEKPKPEKKPKKPKPKPKPKPKPKPKVKPQ 114

Query: 440 KGYDPKQEK 448
                   K
Sbjct: 115 PKPKKPPSK 123


>gnl|CDD|236154 PRK08119, PRK08119, flagellar motor switch protein; Validated.
          Length = 382

 Score = 35.2 bits (82), Expect = 0.14
 Identities = 21/80 (26%), Positives = 31/80 (38%), Gaps = 13/80 (16%)

Query: 690 DDMQIDMEEDSSSEEEQEDIPPPPPPPSLKKVEPAPPLPSQPSIQPPPLPPAPDKVVVKK 749
             +  + EE+    EE+E    P   P+  +  PAP    Q   Q PP    P       
Sbjct: 221 AILLGEEEEEEEEVEEEEAQASPAAEPATAQAAPAPK---QEQQQAPPQRQEP------- 270

Query: 750 GYDPKQARPPVVKPSPTDEF 769
               K+A+P  V+P+    F
Sbjct: 271 ---EKEAQPVNVQPAQFQSF 287



 Score = 34.1 bits (79), Expect = 0.29
 Identities = 13/53 (24%), Positives = 20/53 (37%)

Query: 385 QDMEEDSSSEEEQEDIPPPPPPPSLKKVEPAPPLPSQPSIQPPPLPPAPDKVV 437
            + EE+    EE+E    P   P+  +  PAP    Q +      P    + V
Sbjct: 225 GEEEEEEEEVEEEEAQASPAAEPATAQAAPAPKQEQQQAPPQRQEPEKEAQPV 277


>gnl|CDD|221668 pfam12619, MCM2_N, Mini-chromosome maintenance protein 2.  This
           domain family is found in eukaryotes, and is typically
           between 138 and 153 amino acids in length. The family is
           found in association with pfam00493. Mini-chromosome
           maintenance (MCM) proteins are essential for DNA
           replication. These proteins use ATPase activity to
           perform this function.
          Length = 145

 Score = 33.8 bits (78), Expect = 0.15
 Identities = 30/137 (21%), Positives = 56/137 (40%), Gaps = 22/137 (16%)

Query: 279 PDEVGARVLMQERMDNGEEDLEEDEE--------MEEEYGQRRPEDDDMQIGSDEENDY- 329
            DE      +++ +D+ +E+ EE+E+        ME +Y      D   + G D+E+DY 
Sbjct: 1   EDEPDEEEELEDDIDDLDEEAEEEEDGEDLFGDNMERDYRANPELDRYDEDGLDDESDYE 60

Query: 330 ---------YNARMRGRNANRGRGRMQEEINEDDMQEGTTSEEESDNENEEARKSKEAKD 380
                      AR+  R+    R RM     +DD  +    + + D   +  R+ ++  +
Sbjct: 61  ELSAAARRAAEARLNRRDRLLRRRRMPAAFLDDDDDD----DGDFDLTAQPRRRRRQYDE 116

Query: 381 NTQVQDMEEDSSSEEEQ 397
           +    D + D   EE  
Sbjct: 117 DRDADDEDIDPLEEELS 133


>gnl|CDD|237378 PRK13406, bchD, magnesium chelatase subunit D; Provisional.
          Length = 584

 Score = 35.4 bits (82), Expect = 0.17
 Identities = 13/48 (27%), Positives = 21/48 (43%), Gaps = 5/48 (10%)

Query: 391 SSSEEEQEDIPPPPPPPSLKKVEPAPPLPSQPSIQPPPLPPAPDKVVV 438
           +  +  +E+ PPPPPPP     E     P     Q      A +++V+
Sbjct: 260 APPQPPEEEPPPPPPPP-----EDDDDPPEDEEEQDDAEDRALEEIVL 302



 Score = 35.4 bits (82), Expect = 0.17
 Identities = 13/48 (27%), Positives = 21/48 (43%), Gaps = 5/48 (10%)

Query: 700 SSSEEEQEDIPPPPPPPSLKKVEPAPPLPSQPSIQPPPLPPAPDKVVV 747
           +  +  +E+ PPPPPPP     E     P     Q      A +++V+
Sbjct: 260 APPQPPEEEPPPPPPPP-----EDDDDPPEDEEEQDDAEDRALEEIVL 302


>gnl|CDD|215533 PLN02983, PLN02983, biotin carboxyl carrier protein of acetyl-CoA
           carboxylase.
          Length = 274

 Score = 34.4 bits (79), Expect = 0.23
 Identities = 22/68 (32%), Positives = 26/68 (38%), Gaps = 5/68 (7%)

Query: 710 PPPPPPPSLKKVEPAPPLPSQPSIQPPPLPPAPDKVVVKKGYDPKQARPPVVKPSPTDEF 769
            PPPP        PA P  +QP+   P   P P          PK + PP+  P     F
Sbjct: 154 QPPPPHAMP----PASPPAAQPAPSAPASSPPPTPASPPPAKAPKSSHPPLKSPM-AGTF 208

Query: 770 LISPITGE 777
             SP  GE
Sbjct: 209 YRSPAPGE 216



 Score = 31.0 bits (70), Expect = 2.6
 Identities = 14/34 (41%), Positives = 18/34 (52%), Gaps = 2/34 (5%)

Query: 402 PPPPPPSLKKVEPAP--PLPSQPSIQPPPLPPAP 433
            PPPP ++    P    P PS P+  PPP P +P
Sbjct: 154 QPPPPHAMPPASPPAAQPAPSAPASSPPPTPASP 187



 Score = 30.2 bits (68), Expect = 5.0
 Identities = 11/33 (33%), Positives = 13/33 (39%)

Query: 401 PPPPPPPSLKKVEPAPPLPSQPSIQPPPLPPAP 433
           P   PP S    +PAP  P+      P  PP  
Sbjct: 158 PHAMPPASPPAAQPAPSAPASSPPPTPASPPPA 190



 Score = 29.8 bits (67), Expect = 5.6
 Identities = 12/33 (36%), Positives = 14/33 (42%), Gaps = 4/33 (12%)

Query: 401 PPPPPPPSLKKVEPAPPLPSQPSIQPPPLPPAP 433
            PPPP        PA P  +QP+   P   P P
Sbjct: 154 QPPPPHAMP----PASPPAAQPAPSAPASSPPP 182


>gnl|CDD|237862 PRK14948, PRK14948, DNA polymerase III subunits gamma and tau;
           Provisional.
          Length = 620

 Score = 34.6 bits (80), Expect = 0.25
 Identities = 14/63 (22%), Positives = 22/63 (34%)

Query: 371 EARKSKEAKDNTQVQDMEEDSSSEEEQEDIPPPPPPPSLKKVEPAPPLPSQPSIQPPPLP 430
                  A + +      + S+ + +Q    P PPP       P P  P++PS  PP   
Sbjct: 373 APANPTPAPNPSPPPAPIQPSAPKTKQAATTPSPPPAKASPPIPVPAEPTEPSPTPPANA 432

Query: 431 PAP 433
              
Sbjct: 433 ANA 435



 Score = 33.8 bits (78), Expect = 0.47
 Identities = 14/57 (24%), Positives = 19/57 (33%)

Query: 377 EAKDNTQVQDMEEDSSSEEEQEDIPPPPPPPSLKKVEPAPPLPSQPSIQPPPLPPAP 433
            A  +         + +       PP P  PS  K + A   PS P  +  P  P P
Sbjct: 362 SAFISEIANASAPANPTPAPNPSPPPAPIQPSAPKTKQAATTPSPPPAKASPPIPVP 418



 Score = 33.0 bits (76), Expect = 0.91
 Identities = 14/89 (15%), Positives = 20/89 (22%), Gaps = 8/89 (8%)

Query: 709 IPPPPPPPSLKKVEPAPPLPSQPSIQPPPLPPAPDKVVVKKGYDPKQARPPVVKPSPTDE 768
           I       +     PAP     P+   P  P                + PP     P   
Sbjct: 365 ISEIANASAPANPTPAPNPSPPPAPIQPSAPKTKQA-------ATTPSPPPAKASPPIPV 417

Query: 769 FLISPITGEKIPASKVQEHMRIGLLDPRW 797
                      PA+       +  L+  W
Sbjct: 418 PAEPTEPSPTPPANAANAPPSLN-LEELW 445



 Score = 32.6 bits (75), Expect = 1.0
 Identities = 13/46 (28%), Positives = 19/46 (41%)

Query: 697 EEDSSSEEEQEDIPPPPPPPSLKKVEPAPPLPSQPSIQPPPLPPAP 742
            + S+ + +Q    P PPP       P P  P++PS  PP      
Sbjct: 390 IQPSAPKTKQAATTPSPPPAKASPPIPVPAEPTEPSPTPPANAANA 435



 Score = 32.2 bits (74), Expect = 1.5
 Identities = 17/68 (25%), Positives = 21/68 (30%), Gaps = 5/68 (7%)

Query: 703 EEEQEDIPPPPPPPSLKKVEPAPPLPSQPSIQPPPLPPAPDKVVVKKGYDPKQARPPVVK 762
                  PP P  PS  K + A   PS P  +    PP P      +   P    P    
Sbjct: 379 PAPNPSPPPAPIQPSAPKTKQAATTPSPPPAKAS--PPIPVPAEPTE---PSPTPPANAA 433

Query: 763 PSPTDEFL 770
            +P    L
Sbjct: 434 NAPPSLNL 441



 Score = 31.1 bits (71), Expect = 2.9
 Identities = 17/69 (24%), Positives = 23/69 (33%)

Query: 700 SSSEEEQEDIPPPPPPPSLKKVEPAPPLPSQPSIQPPPLPPAPDKVVVKKGYDPKQARPP 759
           S++ +        PPPP+     P P   + P   PPP P A           P  + PP
Sbjct: 518 SNTAKTPPPPQKSPPPPAPTPPLPQPTATAPPPTPPPPPPTATQASSNAPAQIPADSSPP 577

Query: 760 VVKPSPTDE 768
              P     
Sbjct: 578 PPIPEEPTP 586



 Score = 31.1 bits (71), Expect = 3.6
 Identities = 20/78 (25%), Positives = 28/78 (35%)

Query: 709 IPPPPPPPSLKKVEPAPPLPSQPSIQPPPLPPAPDKVVVKKGYDPKQARPPVVKPSPTDE 768
             PPPP  S     P PPLP   +  PPP PP P     +   +     P    P P   
Sbjct: 522 KTPPPPQKSPPPPAPTPPLPQPTATAPPPTPPPPPPTATQASSNAPAQIPADSSPPPPIP 581

Query: 769 FLISPITGEKIPASKVQE 786
              +P   +     ++ +
Sbjct: 582 EEPTPSPTKDSSPEEIDK 599



 Score = 29.9 bits (68), Expect = 6.7
 Identities = 13/75 (17%), Positives = 21/75 (28%)

Query: 391 SSSEEEQEDIPPPPPPPSLKKVEPAPPLPSQPSIQPPPLPPAPDKVVVKKGYDPKQEKIP 450
            +    Q+  PPP P P L +     P P+ P   P     + +              IP
Sbjct: 522 KTPPPPQKSPPPPAPTPPLPQPTATAPPPTPPPPPPTATQASSNAPAQIPADSSPPPPIP 581

Query: 451 ASKVQEHMRIGLLDP 465
                   +    + 
Sbjct: 582 EEPTPSPTKDSSPEE 596



 Score = 29.5 bits (67), Expect = 8.8
 Identities = 14/72 (19%), Positives = 19/72 (26%), Gaps = 4/72 (5%)

Query: 367 NENEEARKSKEAKDNTQVQDMEEDSSSEE---EQEDIPPPPPPPSLKKVEPAP-PLPSQP 422
             +E A  S  A            +  +    + +     P PP  K   P P P     
Sbjct: 364 FISEIANASAPANPTPAPNPSPPPAPIQPSAPKTKQAATTPSPPPAKASPPIPVPAEPTE 423

Query: 423 SIQPPPLPPAPD 434
               PP   A  
Sbjct: 424 PSPTPPANAANA 435


>gnl|CDD|218556 pfam05327, RRN3, RNA polymerase I specific transcription initiation
           factor RRN3.  This family consists of several eukaryotic
           proteins which are homologous to the yeast RRN3 protein.
           RRN3 is one of the RRN genes specifically required for
           the transcription of rDNA by RNA polymerase I (Pol I) in
           Saccharomyces cerevisiae.
          Length = 554

 Score = 34.6 bits (80), Expect = 0.27
 Identities = 14/66 (21%), Positives = 28/66 (42%), Gaps = 12/66 (18%)

Query: 659 VLMQERMDNGEEDLEEDEEMEEEYGQRRPEDDDMQIDMEEDSSSEEEQEDIPPPPPPPSL 718
           V +Q  +D    D++++EE      +   ++DDM  DM++D   E +       P     
Sbjct: 216 VEIQNELD----DIDDEEEERVLADEDDDDEDDM-FDMDDDDEEESD-------PEVERT 263

Query: 719 KKVEPA 724
             ++  
Sbjct: 264 STIKEV 269


>gnl|CDD|233757 TIGR02168, SMC_prok_B, chromosome segregation protein SMC, common
           bacterial type.  SMC (structural maintenance of
           chromosomes) proteins bind DNA and act in organizing and
           segregating chromosomes for partition. SMC proteins are
           found in bacteria, archaea, and eukaryotes. This family
           represents the SMC protein of most bacteria. The smc
           gene is often associated with scpB (TIGR00281) and scpA
           genes, where scp stands for segregation and condensation
           protein. SMC was shown (in Caulobacter crescentus) to be
           induced early in S phase but present and bound to DNA
           throughout the cell cycle [Cellular processes, Cell
           division, DNA metabolism, Chromosome-associated
           proteins].
          Length = 1179

 Score = 34.6 bits (80), Expect = 0.28
 Identities = 21/116 (18%), Positives = 46/116 (39%), Gaps = 6/116 (5%)

Query: 288 MQERMDNGEEDLE----EDEEMEEEYGQRRPEDDDMQIGSDE-ENDYYNARMRGRNANRG 342
           ++E++   E+ L     E EE+EEE  Q R E +++         D              
Sbjct: 689 LEEKIAELEKALAELRKELEELEEELEQLRKELEELSRQISALRKDLARLEAEVEQLEER 748

Query: 343 RGRMQEEINEDDMQ-EGTTSEEESDNENEEARKSKEAKDNTQVQDMEEDSSSEEEQ 397
             ++ +E+ E + + E      E   E     +++  +   Q++ ++E+  +  E 
Sbjct: 749 IAQLSKELTELEAEIEELEERLEEAEEELAEAEAEIEELEAQIEQLKEELKALREA 804


>gnl|CDD|236940 PRK11633, PRK11633, cell division protein DedD; Provisional.
          Length = 226

 Score = 33.8 bits (78), Expect = 0.31
 Identities = 17/57 (29%), Positives = 23/57 (40%), Gaps = 3/57 (5%)

Query: 709 IPPPPPPPSLKKVEPAPPLPSQPSIQPPPLPPAPDKVVVKKGYDPKQARPPVVKPSP 765
           + PP  P    + EPAP  P +P     P P    +  V+    PK    PVV+   
Sbjct: 90  VAPPNTPV---EPEPAPVEPPKPKPVEKPKPKPKPQQKVEAPPAPKPEPKPVVEEKA 143



 Score = 32.3 bits (74), Expect = 0.74
 Identities = 15/56 (26%), Positives = 20/56 (35%)

Query: 711 PPPPPPSLKKVEPAPPLPSQPSIQPPPLPPAPDKVVVKKGYDPKQARPPVVKPSPT 766
            PP  P   +  P  P   +P  +P P P    KV       P+       K +PT
Sbjct: 91  APPNTPVEPEPAPVEPPKPKPVEKPKPKPKPQQKVEAPPAPKPEPKPVVEEKAAPT 146



 Score = 31.9 bits (73), Expect = 1.2
 Identities = 17/65 (26%), Positives = 23/65 (35%), Gaps = 1/65 (1%)

Query: 704 EEQEDIPPPPPPPSLKKVEPAPPLPSQPSIQPPPLPPAPDKVVVKKGYDPKQARPPVVKP 763
             +         PSL     APP  +    +P P+ P   K V K    PK  +     P
Sbjct: 71  AAEAVRAGDAAAPSLDPATVAPP-NTPVEPEPAPVEPPKPKPVEKPKPKPKPQQKVEAPP 129

Query: 764 SPTDE 768
           +P  E
Sbjct: 130 APKPE 134



 Score = 30.7 bits (70), Expect = 2.9
 Identities = 14/74 (18%), Positives = 25/74 (33%), Gaps = 3/74 (4%)

Query: 698 EDSSSEEEQEDIPPPPPPPSLKKVEPAPPLPSQPSIQPP---PLPPAPDKVVVKKGYDPK 754
           E ++      D   P   P+       P  P    ++PP   P+     K   ++  +  
Sbjct: 69  EGAAEAVRAGDAAAPSLDPATVAPPNTPVEPEPAPVEPPKPKPVEKPKPKPKPQQKVEAP 128

Query: 755 QARPPVVKPSPTDE 768
            A  P  KP   ++
Sbjct: 129 PAPKPEPKPVVEEK 142



 Score = 30.0 bits (68), Expect = 5.2
 Identities = 16/56 (28%), Positives = 23/56 (41%), Gaps = 3/56 (5%)

Query: 400 IPPPPPPPSLKKVEPAPPLPSQPSIQPPPLPPAPDKVVVKKGYDPKQEKIPASKVQ 455
           + PP  P    + EPAP  P +P     P P    +  V+    PK E  P  + +
Sbjct: 90  VAPPNTPV---EPEPAPVEPPKPKPVEKPKPKPKPQQKVEAPPAPKPEPKPVVEEK 142



 Score = 29.2 bits (66), Expect = 8.7
 Identities = 12/48 (25%), Positives = 16/48 (33%)

Query: 709 IPPPPPPPSLKKVEPAPPLPSQPSIQPPPLPPAPDKVVVKKGYDPKQA 756
           + P P P    K +P      +P  Q     P   K   K   + K A
Sbjct: 97  VEPEPAPVEPPKPKPVEKPKPKPKPQQKVEAPPAPKPEPKPVVEEKAA 144


>gnl|CDD|222997 PHA03132, PHA03132, thymidine kinase; Provisional.
          Length = 580

 Score = 34.4 bits (79), Expect = 0.34
 Identities = 15/97 (15%), Positives = 22/97 (22%)

Query: 668 GEEDLEEDEEMEEEYGQRRPEDDDMQIDMEEDSSSEEEQEDIPPPPPPPSLKKVEPAPPL 727
                  DE  E    +    + D  +     +S    ++D    PP  +      A   
Sbjct: 11  RWNYDSSDESPEGSRDENFDAERDDFLTPLGSTSEATSEDDDDLYPPRETGSGGGVATST 70

Query: 728 PSQPSIQPPPLPPAPDKVVVKKGYDPKQARPPVVKPS 764
                  P       DK             P  V P 
Sbjct: 71  IYTVPRPPRGPEQTLDKPDSLPASRELPPGPTPVPPG 107



 Score = 32.4 bits (74), Expect = 1.1
 Identities = 23/119 (19%), Positives = 39/119 (32%), Gaps = 14/119 (11%)

Query: 336 GRNANRGRGRMQEEINEDDMQEGTTSEEESDNENEEARKSKEAKDNTQVQDMEEDSSSEE 395
           G    RG+       + D+      S E S +EN +A +                 ++ E
Sbjct: 1   GPGKPRGQPGRWNYDSSDE------SPEGSRDENFDAERDDFLTPLGSTS-----EATSE 49

Query: 396 EQEDIPPPPPPPSLKKVEPAPP--LPSQPSIQPPPLPPAPDKVVVKKGYDPKQEKIPAS 452
           + +D+ PP    S   V  +    +P  P   P      PD +   +   P    +P  
Sbjct: 50  DDDDLYPPRETGSGGGVATSTIYTVPRPPR-GPEQTLDKPDSLPASRELPPGPTPVPPG 107


>gnl|CDD|227505 COG5178, PRP8, U5 snRNP spliceosome subunit [RNA processing and
           modification].
          Length = 2365

 Score = 34.6 bits (79), Expect = 0.35
 Identities = 18/31 (58%), Positives = 18/31 (58%), Gaps = 10/31 (32%)

Query: 401 PPPPPPPSLKKVEPAPPLPSQPSIQPPPLPP 431
           PPPPPPP     EP    PSQP   PPP PP
Sbjct: 10  PPPPPPPGF---EP----PSQP---PPPPPP 30



 Score = 34.6 bits (79), Expect = 0.35
 Identities = 18/31 (58%), Positives = 18/31 (58%), Gaps = 10/31 (32%)

Query: 710 PPPPPPPSLKKVEPAPPLPSQPSIQPPPLPP 740
           PPPPPPP     EP    PSQP   PPP PP
Sbjct: 10  PPPPPPPGF---EP----PSQP---PPPPPP 30


>gnl|CDD|219355 pfam07267, Nucleo_P87, Nucleopolyhedrovirus capsid protein P87.
           This family consists of several Nucleopolyhedrovirus
           capsid protein P87 sequences. P87 is expressed late in
           infection and concentrated in infected cell nuclei.
          Length = 606

 Score = 34.1 bits (78), Expect = 0.35
 Identities = 24/103 (23%), Positives = 41/103 (39%), Gaps = 3/103 (2%)

Query: 300 EEDEEMEEEYGQRRPEDDDMQIGSDEENDYYNARMRGRNANRGRGRMQEEINEDDMQEGT 359
           E     EE   +R P D        + +   +     R   R R     E + D+  E  
Sbjct: 319 EPIYTWEELLRERFPSDLFAISSLPDSDSEASDSGPTRKRKRRRVPPLPEYSSDE-DEDD 377

Query: 360 TSEEESDNENEEARKSKEAKDNTQVQDME--EDSSSEEEQEDI 400
           + E+E D E E  R+ +E K+  +++ +E  + +   E  E I
Sbjct: 378 SDEDEVDYEKERKRRREEDKNFLRLKALELSKYAGVNERMEKI 420


>gnl|CDD|218177 pfam04615, Utp14, Utp14 protein.  This protein is found to be part
           of a large ribonucleoprotein complex containing the U3
           snoRNA. Depletion of the Utp proteins impedes production
           of the 18S rRNA, indicating that they are part of the
           active pre-rRNA processing complex. This large RNP
           complex has been termed the small subunit (SSU)
           processome.
          Length = 728

 Score = 34.3 bits (79), Expect = 0.36
 Identities = 42/192 (21%), Positives = 73/192 (38%), Gaps = 18/192 (9%)

Query: 280 DEVGARVLMQERMDNGEEDLEEDEEMEEEYGQRRPEDDDMQIGSDEENDYYNARMRGR-- 337
           DE  + +L  + M   E    + EE + E  + R E +  +   +EEN+  + +  GR  
Sbjct: 367 DEENSGLLSMKFMQRAEA--RKKEENDAEIEELRRELEGEEESDEEENEEPSKKNVGRRK 424

Query: 338 -----NANRGRGRMQEEINEDDMQEGTTSEEESDNENEEARK-----SKEAKDNTQVQDM 387
                       +  ++ N+++ +E   S+EE + E+EE  K     +K  K + + Q  
Sbjct: 425 FGPENGEKEAESKKLKKENKNEFKEKKESDEEEELEDEEEAKVEKVANKLLKRSEKAQKE 484

Query: 388 EEDS-SSEEEQEDIPPPPPPPSLKKVEPAPPLPSQPSIQPPPLPPAPDKVVVKKGYDPKQ 446
           EE+    EE            S KK +      S+       +  A  KV  KK    K+
Sbjct: 485 EEEEELDEENPWLKTTSSVGKSAKKQDSKKKSSSKLDKAANKISKAAVKVKKKK---KKE 541

Query: 447 EKIPASKVQEHM 458
           + I         
Sbjct: 542 KSIDLDDDLIDE 553


>gnl|CDD|220271 pfam09507, CDC27, DNA polymerase subunit Cdc27.  This protein forms
           the C subunit of DNA polymerase delta. It carries the
           essential residues for binding to the Pol1 subunit of
           polymerase alpha, from residues 293-332, which are
           characterized by the motif D--G--VT, referred to as the
           DPIM motif. The first 160 residues of the protein form
           the minimal domain for binding to the B subunit, Cdc1,
           of polymerase delta, the final 10 C-terminal residues,
           362-372, being the DNA sliding clamp, PCNA, binding
           motif.
          Length = 427

 Score = 34.0 bits (78), Expect = 0.37
 Identities = 25/83 (30%), Positives = 39/83 (46%), Gaps = 3/83 (3%)

Query: 352 EDDMQEGTTSEEESDNENEEARKSKEAKDNTQVQDMEEDSSSEEEQEDIPPPPPPPSLKK 411
           ++D  E   S E SD+E E   K KE +   +++ M ED   +EE E +P  P     + 
Sbjct: 268 DEDEDEPKPSGERSDSEEETEEKEKEKRK--RLKKMMEDEDEDEEMEIVPESPVEEE-ES 324

Query: 412 VEPAPPLPSQPSIQPPPLPPAPD 434
            EP PP   +   +   +  +PD
Sbjct: 325 EEPEPPPLPKKEEEKEEVTVSPD 347



 Score = 32.1 bits (73), Expect = 1.3
 Identities = 26/114 (22%), Positives = 44/114 (38%), Gaps = 16/114 (14%)

Query: 614 EEKEKIGPKRSHAPNPPLPRSMPPMNKPKPGNFPPPTTPDEVGARVLMQERMDNGEEDLE 673
           EE E+   KR               ++ +  + P P             ER D+ EE  E
Sbjct: 242 EESEEESGKRDVILEDESAEPTGL-DEDEDEDEPKP-----------SGERSDSEEETEE 289

Query: 674 EDEEMEEEYGQRRPEDDDMQIDMEED--SSSEEEQEDIPPPPPPPSLKKVEPAP 725
           +++E  +    ++  +D+ + +  E    S  EE+E   P PPP   K+ E   
Sbjct: 290 KEKEKRKR--LKKMMEDEDEDEEMEIVPESPVEEEESEEPEPPPLPKKEEEKEE 341



 Score = 29.8 bits (67), Expect = 7.5
 Identities = 21/97 (21%), Positives = 37/97 (38%), Gaps = 3/97 (3%)

Query: 649 PTTPDEVGARVLMQERMDNGEEDLEEDE--EMEEEYGQRRPEDDDMQIDMEEDSSSEEEQ 706
               ++  A     +  ++ +E     E  + EEE  ++  E       M ED   +EE 
Sbjct: 252 DVILEDESAEPTGLDEDEDEDEPKPSGERSDSEEETEEKEKEKRKRLKKMMEDEDEDEEM 311

Query: 707 EDIPPPPPPPSLKKVEPAPPLPSQPSIQPPPLPPAPD 743
           E +P  P     +  EP PP   +   +   +  +PD
Sbjct: 312 EIVPESPVEEE-ESEEPEPPPLPKKEEEKEEVTVSPD 347


>gnl|CDD|227466 COG5137, COG5137, Histone chaperone involved in gene silencing
           [Transcription / Chromatin structure and dynamics].
          Length = 279

 Score = 33.4 bits (76), Expect = 0.39
 Identities = 22/82 (26%), Positives = 32/82 (39%), Gaps = 10/82 (12%)

Query: 626 APNPPLPRSMPPMNKPKPGNFPPPTTPDEVGARVLMQERMDNGEEDLEEDEEMEEEYGQR 685
           A  P + R     +  +  +  PP  PD V          +  EE LEE +  EEE  + 
Sbjct: 146 AEKPRVTRFNIVWDNDEDNDEAPPAQPD-VD---------NEEEERLEESDGREEEEDEE 195

Query: 686 RPEDDDMQIDMEEDSSSEEEQE 707
              D   + + E +   EEE E
Sbjct: 196 VGSDSYGEGNRELNEEEEEEAE 217



 Score = 30.3 bits (68), Expect = 4.5
 Identities = 27/123 (21%), Positives = 48/123 (39%), Gaps = 25/123 (20%)

Query: 293 DNGEEDLEEDEEMEEEYGQRRPEDDDMQIGSDEENDYYNARMRGRNANRGRGRMQEEINE 352
           +  EE LEE +  EEE      ED+++   S  E +                    E+NE
Sbjct: 176 NEEEERLEESDGREEE------EDEEVGSDSYGEGN-------------------RELNE 210

Query: 353 DDMQEGTTSEEESDNENEEARKSKEAKDNTQVQDMEEDSSSEEEQEDIPPPPPPPSLKKV 412
           ++ +E   S++  D  + E  +  + +   +  + E  +  E E E+  P    P   + 
Sbjct: 211 EEEEEAEGSDDGEDVVDYEGERIDKKQGEEEEMEEEVINLFEIEWEEESPSEEVPRNNEE 270

Query: 413 EPA 415
            PA
Sbjct: 271 SPA 273


>gnl|CDD|152115 pfam11679, DUF3275, Protein of unknown function (DUF3275).  This
           family of proteins with unknown function appear to be
           restricted to Proteobacteria.
          Length = 211

 Score = 33.3 bits (76), Expect = 0.40
 Identities = 22/75 (29%), Positives = 32/75 (42%), Gaps = 3/75 (4%)

Query: 690 DDMQIDMEEDSSSEEEQEDIPPPPPPPSLKKVEPAPPLPSQPSIQPPPLPPAPDKVVVKK 749
           DDMQ+D   D  S +E     P  P P  +    A P+ S P+  P P P +P     + 
Sbjct: 75  DDMQLD-GIDKLSRDEPRRTEPQEPDPLDESPASAAPVASAPA--PAPSPQSPKPASRRA 131

Query: 750 GYDPKQARPPVVKPS 764
             D ++  P  +  S
Sbjct: 132 SRDMRRIAPFGMNAS 146


>gnl|CDD|220392 pfam09770, PAT1, Topoisomerase II-associated protein PAT1.  Members
           of this family are necessary for accurate chromosome
           transmission during cell division.
          Length = 804

 Score = 34.0 bits (78), Expect = 0.43
 Identities = 38/239 (15%), Positives = 63/239 (26%), Gaps = 41/239 (17%)

Query: 606 KVKGLLPDEEKEKIGPKRSHAPNP--PLPRSMPPMNKPKPGNFPPPTTPDEVGARVLMQE 663
             + +L  EE E    +R  AP    P  + +P    P+   FP    P           
Sbjct: 149 PAQKMLSLEEVEAQLQQRQQAPQLPQPPQQVLPQGMPPRQAAFPQQGPP----------- 197

Query: 664 RMDNGEEDLEEDEEMEEEYGQRRPEDDDMQIDMEEDSSSEEEQEDIPPPPPPPS---LKK 720
                        E    Y Q      +     +   +  +     P PP  P      +
Sbjct: 198 -------------EQPPGYPQPPQGHPEQVQPQQFLPAPSQAPAQPPLPPQLPQQPPPLQ 244

Query: 721 VEPAPPLPSQPSIQPPPLPPAPDKVVVKKGYDPKQARPPVVKPSPTDEFLISPITGEKIP 780
               P L  Q    PP  P    +    +   P Q +P      P  +    P       
Sbjct: 245 QPQFPGLSQQMPPPPPQPPQQQQQPPQPQAQPPPQNQPTPHPGLPQGQNAPLPP----PQ 300

Query: 781 ASKVQEHMRIGLLDPRWVEQRDKHLNEKINQETVYAPEFLLDPRWVEQRDKHLNEKINQ 839
             ++   ++      R  + R++ +     Q    + E        E +      KI  
Sbjct: 301 QPQLLPLVQQPQGQQRGPQFREQLVQLSQQQREALSQE--------EAKRAKRRHKIVS 351


>gnl|CDD|235285 PRK04335, PRK04335, cell division protein ZipA; Provisional.
          Length = 313

 Score = 33.6 bits (77), Expect = 0.44
 Identities = 24/91 (26%), Positives = 37/91 (40%), Gaps = 6/91 (6%)

Query: 352 EDDMQEGTTSEEESDNENEEARKSKEAKDNTQVQDME---EDSSSEEEQEDIPPPPPPPS 408
           +D +  G   EEE   E EEA    +A+   Q + +E   E+   EE  E+  P      
Sbjct: 90  DDPLFGGELEEEEDKFEQEEAPIPVQAQSQPQPEKVEPQVEEPRDEEVLEEPEPVAAKVP 149

Query: 409 LKKVEPAPPLPSQPSIQPPPLP-PAPDKVVV 438
               E  P   ++  +  P  P P P+  V+
Sbjct: 150 --MAEVQPEEETEIEVDEPEEPKPEPELDVI 178



 Score = 32.5 bits (74), Expect = 0.85
 Identities = 23/99 (23%), Positives = 37/99 (37%), Gaps = 6/99 (6%)

Query: 673 EEDEEMEEEYGQRRPEDDDMQIDMEEDSSSEEEQEDIPP--PPPPPSLKKVEPAPPLPSQ 730
           E D   E  +     +D     ++EE+    E++E   P      P  +KVEP    P  
Sbjct: 75  EPDFGRENSFHDPLIDDPLFGGELEEEEDKFEQEEAPIPVQAQSQPQPEKVEPQVEEPRD 134

Query: 731 PSIQPPPLPPAPD----KVVVKKGYDPKQARPPVVKPSP 765
             +   P P A      +V  ++  + +   P   KP P
Sbjct: 135 EEVLEEPEPVAAKVPMAEVQPEEETEIEVDEPEEPKPEP 173



 Score = 31.7 bits (72), Expect = 1.6
 Identities = 24/107 (22%), Positives = 39/107 (36%), Gaps = 6/107 (5%)

Query: 352 EDDMQEGTTSEEESDNENEEARKSKEAKDNTQVQDMEEDSSSEEEQEDIPP--PPPPPSL 409
           EDD +      +E D   E +       D     ++EE+    E++E   P      P  
Sbjct: 63  EDDFEIIRKERKEPDFGRENSFHDPLIDDPLFGGELEEEEDKFEQEEAPIPVQAQSQPQP 122

Query: 410 KKVEPAPPLPSQPSIQPPPLPPAPDKVVVKKGYDPKQEKIPASKVQE 456
           +KVEP    P    +   P P A    +     + + E+    +V E
Sbjct: 123 EKVEPQVEEPRDEEVLEEPEPVAAKVPMA----EVQPEEETEIEVDE 165


>gnl|CDD|216648 pfam01690, PLRV_ORF5, Potato leaf roll virus readthrough protein.
           This family consists mainly of the potato leaf roll
           virus readthrough protein. This is generated via a
           readthrough of open reading frame 3 a coat protein
           allowing transcription of open reading frame 5 to give
           an extended coat protein with a large c-terminal
           addition or read through domain. The readthrough protein
           is thought to play a role in the circulative aphid
           transmission of potato leaf roll virus. Also in the
           family is open reading frame 6 from beet western yellows
           virus and potato leaf roll virus both luteovirus and an
           unknown protein from cucurbit aphid-borne yellows virus
           a closterovirus.
          Length = 460

 Score = 33.5 bits (77), Expect = 0.46
 Identities = 21/127 (16%), Positives = 39/127 (30%), Gaps = 10/127 (7%)

Query: 316 DDDMQIGS-----DEENDYYNA----RMRGRNANRGRGRMQEEINEDDMQEGTTSEEESD 366
           D DM+ G      DE  +   +    R   R  +       E +   +  E ++  +   
Sbjct: 219 DRDMEFGLISVVLDEHLEASRSSNHVRRPPRPGHLLVVSTPELVLPPEPDEQSSERQTFK 278

Query: 367 NENEEARKSK-EAKDNTQVQDMEEDSSSEEEQEDIPPPPPPPSLKKVEPAPPLPSQPSIQ 425
              +    S  E    + V + +++  S + + D PP     +    E     P  P   
Sbjct: 279 TPPQPESSSDAENGLVSLVDEDDKEEVSRDSESDAPPDDTDLTRALAEYEAAAPEVPDAA 338

Query: 426 PPPLPPA 432
              L   
Sbjct: 339 RTVLQGK 345


>gnl|CDD|235124 PRK03427, PRK03427, cell division protein ZipA; Provisional.
          Length = 333

 Score = 33.5 bits (77), Expect = 0.49
 Identities = 25/143 (17%), Positives = 40/143 (27%), Gaps = 11/143 (7%)

Query: 613 DEEKEKIGPKRSHAPNP-PLPRSMPPMNKPKPGNFPPPTTPDEVGARVLMQERMDNGEED 671
            E+ E +G  R H  N  P         +P P +   P        + + Q      E  
Sbjct: 57  VEDDEGVGEVRVHRVNHAPANAQEHEAARPSPQHQYQPPYASAQPRQPVQQPP----EAQ 112

Query: 672 LEEDEEMEEEYGQRRPEDDDMQIDMEEDSSSEEEQEDIPPPP------PPPSLKKVEPAP 725
           +             +P          E    +     + P P      P P+ +  +PA 
Sbjct: 113 VPPQHAPRPAQPAPQPVQQPAYQPQPEQPLQQPVSPQVAPAPQPVHSAPQPAQQAFQPAE 172

Query: 726 PLPSQPSIQPPPLPPAPDKVVVK 748
           P+ +          P  DK   K
Sbjct: 173 PVAAPQPEPVAEPAPVMDKPKRK 195



 Score = 33.1 bits (76), Expect = 0.61
 Identities = 16/90 (17%), Positives = 29/90 (32%), Gaps = 5/90 (5%)

Query: 684 QRRPEDDDMQIDMEEDSSSEEEQEDIPPPPPPPSLKKVEPAPPLPSQPSIQPPPLPPAPD 743
            R+P     +  +    +    Q    P   P    + E     P  P + P P P    
Sbjct: 101 PRQPVQQPPEAQVPPQHAPRPAQPAPQPVQQPAYQPQPEQPLQQPVSPQVAPAPQPVHSA 160

Query: 744 KVVVKKGYDPKQARP-----PVVKPSPTDE 768
               ++ + P +        PV +P+P  +
Sbjct: 161 PQPAQQAFQPAEPVAAPQPEPVAEPAPVMD 190



 Score = 29.6 bits (67), Expect = 6.9
 Identities = 20/159 (12%), Positives = 41/159 (25%), Gaps = 28/159 (17%)

Query: 296 EEDLEEDEEMEEEYGQRR---------PEDDDMQIGSDEENDYYNARMRGRNANRGRGRM 346
           ++  +ED E +E  G+ R            +        ++ Y       +     +   
Sbjct: 50  DDSYDEDVEDDEGVGEVRVHRVNHAPANAQEHEAARPSPQHQYQPPYASAQPRQPVQQPP 109

Query: 347 QEEINEDDMQEGTTSEEESDNENEEARKSKEAKDNTQVQDMEEDSSSEEEQEDIPPPP-- 404
           + ++             +                    Q   E    +     + P P  
Sbjct: 110 EAQVPPQHAPRPAQPAPQ-------------PVQQPAYQPQPEQPLQQPVSPQVAPAPQP 156

Query: 405 ----PPPSLKKVEPAPPLPSQPSIQPPPLPPAPDKVVVK 439
               P P+ +  +PA P+ +          P  DK   K
Sbjct: 157 VHSAPQPAQQAFQPAEPVAAPQPEPVAEPAPVMDKPKRK 195



 Score = 29.6 bits (67), Expect = 8.7
 Identities = 28/124 (22%), Positives = 46/124 (37%), Gaps = 10/124 (8%)

Query: 669 EEDLEEDEEMEEEYGQRR-----PEDDDMQIDMEEDSSSEEEQEDIPPPPPPPSLKKVEP 723
           ++  +ED E +E  G+ R         + Q   E +++    Q    PP      ++   
Sbjct: 50  DDSYDEDVEDDEGVGEVRVHRVNHAPANAQ---EHEAARPSPQHQYQPPYASAQPRQPVQ 106

Query: 724 APPL-PSQPSIQPPPLPPAPDKVVVKKGYDPKQARPPVVKPSPTDEFLISPITGEKIPAS 782
            PP     P   P P  PAP + V +  Y P+  +P     SP       P+     PA 
Sbjct: 107 QPPEAQVPPQHAPRPAQPAP-QPVQQPAYQPQPEQPLQQPVSPQVAPAPQPVHSAPQPAQ 165

Query: 783 KVQE 786
           +  +
Sbjct: 166 QAFQ 169



 Score = 29.2 bits (66), Expect = 8.8
 Identities = 12/76 (15%), Positives = 24/76 (31%), Gaps = 2/76 (2%)

Query: 383 QVQDMEEDSSSEEEQEDIPPPPPPPSLKKVEPAPPL-PSQPSIQPPPLPPAPDKVVVKKG 441
             +       +    +  P P   P+ +     P   P  P + P P P        ++ 
Sbjct: 108 PPEAQVPPQHAPRPAQPAPQPVQQPAYQPQPEQPLQQPVSPQVAPAPQPVHSAPQPAQQA 167

Query: 442 YDPKQEKIPASKVQEH 457
           + P  E + A + +  
Sbjct: 168 FQP-AEPVAAPQPEPV 182


>gnl|CDD|130712 TIGR01651, CobT, cobaltochelatase, CobT subunit.  This model
           describes Pseudomonas denitrificans CobT gene product,
           which is a cobalt chelatase subunit that functions in
           cobalamin biosynthesis. Cobalamin (vitamin B12) can be
           synthesized via several pathways, including an aerobic
           pathway (found in Pseudomonas denitrificans) and an
           anaerobic pathway (found in P. shermanii and Salmonella
           typhimurium). These pathways differ in the point of
           cobalt insertion during corrin ring formation. There are
           apparently a number of variations on these two pathways,
           where the major differences seem to be concerned with
           the process of ring contraction. Confusion regarding the
           functions of enzymes found in the aerobic vs. anaerobic
           pathways has arisen because nonhomologous genes in these
           different pathways were given the same gene symbols.
           Thus, cobT in the aerobic pathway (P. denitrificans) is
           not a homolog of cobT in the anaerobic pathway (S.
           typhimurium). It should be noted that E. coli
           synthesizes cobalamin only when it is supplied with the
           precursor cobinamide, which is a complex intermediate.
           Additionally, all E. coli cobalamin synthesis genes
           (cobU, cobS and cobT) were named after their Salmonella
           typhimurium homologs which function in the anaerobic
           cobalamin synthesis pathway. This model describes the
           aerobic cobalamin pathway Pseudomonas denitrificans CobT
           gene product, which is a cobalt chelatase subunit, with
           a MW ~70 kDa. The aerobic pathway cobalt chelatase is a
           heterotrimeric, ATP-dependent enzyme that catalyzes
           cobalt insertion during cobalamin biosynthesis. The
           other two subunits are the P. denitrificans CobS
           (TIGR01650) and CobN (pfam02514 CobN/Magnesium
           Chelatase) proteins. To avoid potential confusion with
           the nonhomologous Salmonella typhimurium/E.coli cobT
           gene product, the P. denitrificans gene symbol is not
           used in the name of this model [Biosynthesis of
           cofactors, prosthetic groups, and carriers, Heme,
           porphyrin, and cobalamin].
          Length = 600

 Score = 33.8 bits (77), Expect = 0.52
 Identities = 19/109 (17%), Positives = 37/109 (33%), Gaps = 14/109 (12%)

Query: 298 DLEEDEEMEEEYGQRRPEDDDMQIGSDEENDYYNARMRGRNANRGRGRMQEEINEDDMQE 357
           ++    E+ EE G     +D+      E+ D         +      + ++   E + QE
Sbjct: 192 EMLRSMELAEEMGDDTESEDE------EDGD--------DDQPTENEQEEQGEGEGEGQE 237

Query: 358 GTTSEEESDNENEEARKSKEAKDNTQVQDMEEDSSSEEEQEDIPPPPPP 406
           G+  +E    + E     +E   + Q    +E     E   +   P  P
Sbjct: 238 GSAPQESEATDRESESGEEEMVQSDQDDLPDESDDDSETPGEGARPARP 286


>gnl|CDD|148139 pfam06346, Drf_FH1, Formin Homology Region 1.  This region is found
           in some of the Diaphanous related formins (Drfs). It
           consists of low complexity repeats of around 12
           residues.
          Length = 160

 Score = 32.2 bits (73), Expect = 0.54
 Identities = 17/32 (53%), Positives = 18/32 (56%)

Query: 400 IPPPPPPPSLKKVEPAPPLPSQPSIQPPPLPP 431
            PPPPP P    V P PPLP  P + PPP P 
Sbjct: 97  PPPPPPLPGGAAVPPPPPLPGGPGVPPPPPPF 128



 Score = 32.2 bits (73), Expect = 0.54
 Identities = 17/32 (53%), Positives = 18/32 (56%)

Query: 709 IPPPPPPPSLKKVEPAPPLPSQPSIQPPPLPP 740
            PPPPP P    V P PPLP  P + PPP P 
Sbjct: 97  PPPPPPLPGGAAVPPPPPLPGGPGVPPPPPPF 128



 Score = 32.2 bits (73), Expect = 0.59
 Identities = 20/39 (51%), Positives = 21/39 (53%), Gaps = 1/39 (2%)

Query: 400 IPPPPPPPSLKKVEPAPPLPSQPSIQPPPLPPAPDKVVV 438
           IPPPPP P    + P PPLP    I PPP PP P    V
Sbjct: 72  IPPPPPLPGAAGIPPPPPLPGGAGIPPPP-PPLPGGAAV 109



 Score = 32.2 bits (73), Expect = 0.59
 Identities = 20/39 (51%), Positives = 21/39 (53%), Gaps = 1/39 (2%)

Query: 709 IPPPPPPPSLKKVEPAPPLPSQPSIQPPPLPPAPDKVVV 747
           IPPPPP P    + P PPLP    I PPP PP P    V
Sbjct: 72  IPPPPPLPGAAGIPPPPPLPGGAGIPPPP-PPLPGGAAV 109



 Score = 30.3 bits (68), Expect = 2.3
 Identities = 17/32 (53%), Positives = 20/32 (62%)

Query: 400 IPPPPPPPSLKKVEPAPPLPSQPSIQPPPLPP 431
           IPPPPP P +  + P PPLP   +I PPP  P
Sbjct: 48  IPPPPPLPGVAGIPPPPPLPGATAIPPPPPLP 79



 Score = 30.3 bits (68), Expect = 2.3
 Identities = 17/32 (53%), Positives = 20/32 (62%)

Query: 709 IPPPPPPPSLKKVEPAPPLPSQPSIQPPPLPP 740
           IPPPPP P +  + P PPLP   +I PPP  P
Sbjct: 48  IPPPPPLPGVAGIPPPPPLPGATAIPPPPPLP 79



 Score = 30.3 bits (68), Expect = 2.9
 Identities = 17/32 (53%), Positives = 19/32 (59%)

Query: 400 IPPPPPPPSLKKVEPAPPLPSQPSIQPPPLPP 431
           IPPPPP P    + P PPLP   +I PPP  P
Sbjct: 24  IPPPPPLPGGTGIPPPPPLPGGAAIPPPPPLP 55



 Score = 30.3 bits (68), Expect = 2.9
 Identities = 17/32 (53%), Positives = 19/32 (59%)

Query: 709 IPPPPPPPSLKKVEPAPPLPSQPSIQPPPLPP 740
           IPPPPP P    + P PPLP   +I PPP  P
Sbjct: 24  IPPPPPLPGGTGIPPPPPLPGGAAIPPPPPLP 55



 Score = 29.9 bits (67), Expect = 3.1
 Identities = 17/32 (53%), Positives = 18/32 (56%)

Query: 400 IPPPPPPPSLKKVEPAPPLPSQPSIQPPPLPP 431
           IPPPPP P    + P PPLP    I PPP  P
Sbjct: 60  IPPPPPLPGATAIPPPPPLPGAAGIPPPPPLP 91



 Score = 29.9 bits (67), Expect = 3.1
 Identities = 17/32 (53%), Positives = 18/32 (56%)

Query: 709 IPPPPPPPSLKKVEPAPPLPSQPSIQPPPLPP 740
           IPPPPP P    + P PPLP    I PPP  P
Sbjct: 60  IPPPPPLPGATAIPPPPPLPGAAGIPPPPPLP 91



 Score = 29.5 bits (66), Expect = 4.2
 Identities = 17/32 (53%), Positives = 18/32 (56%)

Query: 400 IPPPPPPPSLKKVEPAPPLPSQPSIQPPPLPP 431
           IPPPPP P    + P PPLP    I PPP  P
Sbjct: 36  IPPPPPLPGGAAIPPPPPLPGVAGIPPPPPLP 67



 Score = 29.5 bits (66), Expect = 4.2
 Identities = 17/32 (53%), Positives = 18/32 (56%)

Query: 709 IPPPPPPPSLKKVEPAPPLPSQPSIQPPPLPP 740
           IPPPPP P    + P PPLP    I PPP  P
Sbjct: 36  IPPPPPLPGGAAIPPPPPLPGVAGIPPPPPLP 67



 Score = 29.5 bits (66), Expect = 4.2
 Identities = 14/32 (43%), Positives = 14/32 (43%)

Query: 400 IPPPPPPPSLKKVEPAPPLPSQPSIQPPPLPP 431
            PPPP P       P PP P  P I PPP   
Sbjct: 110 PPPPPLPGGPGVPPPPPPFPGAPGIPPPPPGM 141



 Score = 29.1 bits (65), Expect = 6.2
 Identities = 16/31 (51%), Positives = 17/31 (54%)

Query: 401 PPPPPPPSLKKVEPAPPLPSQPSIQPPPLPP 431
           PPPPP P    + P PPLP    I PPP  P
Sbjct: 1   PPPPPLPGGVGIPPPPPLPGGVCIPPPPPLP 31



 Score = 29.1 bits (65), Expect = 6.2
 Identities = 16/31 (51%), Positives = 17/31 (54%)

Query: 710 PPPPPPPSLKKVEPAPPLPSQPSIQPPPLPP 740
           PPPPP P    + P PPLP    I PPP  P
Sbjct: 1   PPPPPLPGGVGIPPPPPLPGGVCIPPPPPLP 31



 Score = 28.4 bits (63), Expect = 9.8
 Identities = 17/32 (53%), Positives = 18/32 (56%)

Query: 400 IPPPPPPPSLKKVEPAPPLPSQPSIQPPPLPP 431
           IPPPPP P    + P PPLP    I PPP  P
Sbjct: 12  IPPPPPLPGGVCIPPPPPLPGGTGIPPPPPLP 43



 Score = 28.4 bits (63), Expect = 9.8
 Identities = 17/32 (53%), Positives = 18/32 (56%)

Query: 709 IPPPPPPPSLKKVEPAPPLPSQPSIQPPPLPP 740
           IPPPPP P    + P PPLP    I PPP  P
Sbjct: 12  IPPPPPLPGGVCIPPPPPLPGGTGIPPPPPLP 43


>gnl|CDD|233365 TIGR01347, sucB, 2-oxoglutarate dehydrogenase complex
           dihydrolipoamide succinyltransferase (E2 component).
           This model describes the TCA cycle 2-oxoglutarate system
           E2 component, dihydrolipoamide succinyltransferase. It
           is closely related to the pyruvate dehydrogenase E2
           component, dihydrolipoamide acetyltransferase. The seed
           for this model includes mitochondrial and Gram-negative
           bacterial forms. Mycobacterial candidates are highly
           derived, differ in having and extra copy of the
           lipoyl-binding domain at the N-terminus. They score
           below the trusted cutoff, but above the noise cutoff and
           above all examples of dihydrolipoamide acetyltransferase
           [Energy metabolism, TCA cycle].
          Length = 403

 Score = 33.2 bits (76), Expect = 0.57
 Identities = 11/72 (15%), Positives = 21/72 (29%), Gaps = 6/72 (8%)

Query: 388 EEDSSSEEEQEDIPPPPPPPSLKKVEPAPPLPSQPSIQPPPLPPAPDKVVVKKGYDPKQE 447
           E + ++                     AP   +      P L PA  ++  + G D    
Sbjct: 76  EGNDATAAPPAKSGEEKEETPAASAAAAPTAAA----NRPSLSPAARRLAKEHGIDL--S 129

Query: 448 KIPASKVQEHMR 459
            +P + V   + 
Sbjct: 130 AVPGTGVTGRVT 141



 Score = 32.8 bits (75), Expect = 0.88
 Identities = 13/91 (14%), Positives = 25/91 (27%), Gaps = 8/91 (8%)

Query: 697 EEDSSSEEEQEDIPPPPPPPSLKKVEPAPPLPSQPSIQPPPLPPAPDKVVVKKGYDPKQA 756
           E + ++                     AP   +      P L PA  ++  + G D    
Sbjct: 76  EGNDATAAPPAKSGEEKEETPAASAAAAPTAAA----NRPSLSPAARRLAKEHGIDLSAV 131

Query: 757 RPPVVKPSPTDE----FLISPITGEKIPASK 783
               V    T E       +P + ++   + 
Sbjct: 132 PGTGVTGRVTKEDIIKKTEAPASAQQPAPAA 162


>gnl|CDD|217203 pfam02724, CDC45, CDC45-like protein.  CDC45 is an essential gene
           required for initiation of DNA replication in S.
           cerevisiae, forming a complex with MCM5/CDC46.
           Homologues of CDC45 have been identified in human, mouse
           and smut fungus among others.
          Length = 583

 Score = 33.4 bits (77), Expect = 0.58
 Identities = 21/83 (25%), Positives = 33/83 (39%), Gaps = 6/83 (7%)

Query: 280 DEVGARVLMQERMDNGEEDLEEDEEMEEEYGQRRPEDDDMQIGSDEENDYYNARMR--GR 337
           DE       ++  ++ ++D E DEE EE       EDDD     D+++D          R
Sbjct: 112 DEPRYDDAYRDLEEDDDDDEESDEEDEESSKSEDDEDDD---DDDDDDDIATRERSLERR 168

Query: 338 NANRGRGRMQEEINEDDMQEGTT 360
              R     + E+ E    +GT 
Sbjct: 169 RRRREWEEKRAEL-EFYYYQGTY 190



 Score = 31.1 bits (71), Expect = 3.4
 Identities = 16/55 (29%), Positives = 26/55 (47%)

Query: 653 DEVGARVLMQERMDNGEEDLEEDEEMEEEYGQRRPEDDDMQIDMEEDSSSEEEQE 707
           DE       ++  ++ ++D E DEE EE       EDDD   D ++ ++ E   E
Sbjct: 112 DEPRYDDAYRDLEEDDDDDEESDEEDEESSKSEDDEDDDDDDDDDDIATRERSLE 166



 Score = 29.6 bits (67), Expect = 9.2
 Identities = 13/47 (27%), Positives = 25/47 (53%), Gaps = 3/47 (6%)

Query: 665 MDNGEEDLEEDEEMEEEYGQRR---PEDDDMQIDMEEDSSSEEEQED 708
            D+G+ + E  +E   +   R     +DDD + D E++ SS+ E ++
Sbjct: 101 FDDGDIEEELQDEPRYDDAYRDLEEDDDDDEESDEEDEESSKSEDDE 147


>gnl|CDD|237011 PRK11892, PRK11892, pyruvate dehydrogenase subunit beta;
           Provisional.
          Length = 464

 Score = 33.4 bits (77), Expect = 0.58
 Identities = 12/57 (21%), Positives = 17/57 (29%), Gaps = 3/57 (5%)

Query: 395 EEQEDIPPPPPPPSLKKVEPAPPLPSQPSIQPPPLPPAPDKVVVKKGYDPKQEKIPA 451
           E   D    P   +           +  + +  P P AP     +   DP    IPA
Sbjct: 82  ESASDAGAAPAAAAEAAAAAPAAAAAAAAKKAAPAPAAPAAPAAEVAADP---DIPA 135



 Score = 33.0 bits (76), Expect = 0.68
 Identities = 9/50 (18%), Positives = 14/50 (28%)

Query: 704 EEQEDIPPPPPPPSLKKVEPAPPLPSQPSIQPPPLPPAPDKVVVKKGYDP 753
           E   D    P   +           +  + +  P P AP     +   DP
Sbjct: 82  ESASDAGAAPAAAAEAAAAAPAAAAAAAAKKAAPAPAAPAAPAAEVAADP 131



 Score = 30.7 bits (70), Expect = 3.6
 Identities = 8/44 (18%), Positives = 15/44 (34%)

Query: 391 SSSEEEQEDIPPPPPPPSLKKVEPAPPLPSQPSIQPPPLPPAPD 434
           ++     E     P   +    + A P P+ P+     +   PD
Sbjct: 89  AAPAAAAEAAAAAPAAAAAAAAKKAAPAPAAPAAPAAEVAADPD 132



 Score = 30.7 bits (70), Expect = 3.6
 Identities = 8/44 (18%), Positives = 15/44 (34%)

Query: 700 SSSEEEQEDIPPPPPPPSLKKVEPAPPLPSQPSIQPPPLPPAPD 743
           ++     E     P   +    + A P P+ P+     +   PD
Sbjct: 89  AAPAAAAEAAAAAPAAAAAAAAKKAAPAPAAPAAPAAEVAADPD 132


>gnl|CDD|235826 PRK06549, PRK06549, acetyl-CoA carboxylase biotin carboxyl carrier
           protein subunit; Validated.
          Length = 130

 Score = 31.7 bits (72), Expect = 0.59
 Identities = 12/40 (30%), Positives = 16/40 (40%), Gaps = 3/40 (7%)

Query: 396 EQEDI--PPPPPPPSLKKVEPAPPLPSQPSIQPPPLPPAP 433
           E E+I  P     P+     P P  P++ S Q     P P
Sbjct: 17  EMEEIGAPAQAAAPAQPASTPVPV-PTEASPQVEAQAPQP 55



 Score = 31.7 bits (72), Expect = 0.59
 Identities = 12/40 (30%), Positives = 16/40 (40%), Gaps = 3/40 (7%)

Query: 705 EQEDI--PPPPPPPSLKKVEPAPPLPSQPSIQPPPLPPAP 742
           E E+I  P     P+     P P  P++ S Q     P P
Sbjct: 17  EMEEIGAPAQAAAPAQPASTPVPV-PTEASPQVEAQAPQP 55


>gnl|CDD|217509 pfam03353, Lin-8, Ras-mediated vulval-induction antagonist.  LIN-8
           is a nuclear protein, present at the sites of
           transcriptional repressor complexes, which interacts
           with LIN-35 Rb.Lin35 Rb is a product of the class B
           synMuv gene lin-35 which silences genes required for
           vulval specification through chromatin modification and
           remodelling. The biological role of the interaction has
           not yet been determined however predictions have been
           made. The interaction shows that class A synMuv genes
           control vulval induction through the transcriptional
           regulation of gene expression. LIN-8 normally functions
           as part of a protein complex however when the complex is
           absent, other family members can partially replace LIN-8
           activity.
          Length = 316

 Score = 33.1 bits (76), Expect = 0.60
 Identities = 25/108 (23%), Positives = 40/108 (37%), Gaps = 18/108 (16%)

Query: 657 ARVLMQERMDNGEE-DLEEDEEMEEEYGQRRPEDDDMQIDMEEDSSSEEEQEDIPPPPPP 715
            R    + M+  +E D  E EE+ ++YG     D+  Q  M      +E ++ +P    P
Sbjct: 152 DRTRESQDMEENDEVDEVEVEEVPDDYGANLAVDEPEQSTMSR---PQEVKQSVPAQQAP 208

Query: 716 PSLKKVEPAPPLPSQPSIQ--------------PPPLPPAPDKVVVKK 749
           P+ ++  P+    S  S                PP   PAP      K
Sbjct: 209 PNPQQPMPSASSESATSKSASTSRESSPQPQSPPPRRVPAPTVDSTAK 256


>gnl|CDD|214563 smart00213, UBQ, Ubiquitin homologues.  Ubiquitin-mediated
           proteolysis is involved in the regulated turnover of
           proteins required for controlling cell cycle
           progression.
          Length = 72

 Score = 30.3 bits (69), Expect = 0.61
 Identities = 15/49 (30%), Positives = 21/49 (42%), Gaps = 12/49 (24%)

Query: 831 KHLNEKINQET------------GLFFKDSNSLAYYNITPGTKVHLQLK 867
             L EKI + T            G   +D  +LA Y I  G+ +HL L+
Sbjct: 24  SELKEKIAELTGIPPEQQRLIYKGKVLEDDRTLADYGIQDGSTIHLVLR 72


>gnl|CDD|234706 PRK00269, zipA, cell division protein ZipA; Reviewed.
          Length = 293

 Score = 32.8 bits (75), Expect = 0.61
 Identities = 12/95 (12%), Positives = 31/95 (32%), Gaps = 6/95 (6%)

Query: 346 MQEEINEDDMQEGTTSEEESDNENEEARKSKEAKDNTQVQDMEEDSSSEEEQEDIPPPPP 405
            + +++E D+   +    E         K+ + +   +  + ++   + +  E  P    
Sbjct: 67  KEPQLDEHDLPSMSARPRER----RRDTKTAKQQKRGRGSEPQQGDLNLDLDEVEPALFS 122

Query: 406 PPSLKKVEPAPPLPSQPSIQPPP--LPPAPDKVVV 438
                          +     PP  LPP  + +V+
Sbjct: 123 DRDDDFTPDKRKSKGREPRIEPPKELPPVEEVLVI 157


>gnl|CDD|237541 PRK13881, PRK13881, conjugal transfer protein TrbI; Provisional.
          Length = 472

 Score = 32.8 bits (75), Expect = 0.77
 Identities = 22/94 (23%), Positives = 37/94 (39%), Gaps = 7/94 (7%)

Query: 367 NENEEARKSKEAKDNTQVQDMEEDSSSEEEQEDIPPPPP--PPSLKKVEPAPPLPSQPSI 424
            +N  A+  KE   NT +    ++ + ++    I P  P   P +     + PLP     
Sbjct: 55  KQNAPAQGPKEKAGNTSM--FAKEIAGDQTGGLIEPASPLKVPEMPTGPASAPLPI-ARP 111

Query: 425 QPPPLPPAPDKVVVKKGY--DPKQEKIPASKVQE 456
             P  PP P       G   D + ++I  +K+Q 
Sbjct: 112 DNPDAPPTPPANPGNPGQVNDDEAQRIRMAKLQM 145


>gnl|CDD|217503 pfam03344, Daxx, Daxx Family.  The Daxx protein (also known as the
           Fas-binding protein) is thought to play a role in
           apoptosis, but precise role played by Daxx remains to be
           determined. Daxx forms a complex with Axin.
          Length = 715

 Score = 33.0 bits (75), Expect = 0.81
 Identities = 26/124 (20%), Positives = 38/124 (30%), Gaps = 2/124 (1%)

Query: 621 PKRSHAPNPPLPRSMPPMNKPKPGNFPPPTTPDEVGARV--LMQERMDNGEEDLEEDEEM 678
              S + +P    S    +              E         +E     EE+  EDEE 
Sbjct: 413 GTSSRSSDPSKASSTSGESPSMASQESEEEESVEEEEEEEEEEEEEEQESEEEEGEDEEE 472

Query: 679 EEEYGQRRPEDDDMQIDMEEDSSSEEEQEDIPPPPPPPSLKKVEPAPPLPSQPSIQPPPL 738
           EEE       +++M+   E D   EE +ED        +          P   S+QP   
Sbjct: 473 EEEVEADNGSEEEMEGSSEGDGDGEEPEEDAERRNSEMAGISRMSEGQQPRGSSVQPESP 532

Query: 739 PPAP 742
              P
Sbjct: 533 QEEP 536



 Score = 33.0 bits (75), Expect = 0.91
 Identities = 37/200 (18%), Positives = 59/200 (29%), Gaps = 12/200 (6%)

Query: 229 RYQEAQRKQEEA----RVEKDRVAYAQIDWHDFVVVETVDYMPGEPGNFPPPTTPDEVGA 284
           +Y   Q   EE     R E++R   +           T    P             E   
Sbjct: 390 KYAMKQDDTEEEERRKRQERERQGTSSRSSDPSKASSTSGESPSMASQESEEEESVEEEE 449

Query: 285 RVLMQERMDNGEEDLEEDEEMEEEYGQRRPEDDDMQIG------SDEENDYYNARMRGRN 338
               +E  +  E + EE E+ EEE         + ++        D E    +A  R   
Sbjct: 450 EEEEEEEEEEQESEEEEGEDEEEEEEVEADNGSEEEMEGSSEGDGDGEEPEEDAERRNSE 509

Query: 339 ANRGRGRMQEEIN-EDDMQEGTTSEEESDNENEEARKSKEAKDNTQVQDMEEDSSSEEEQ 397
                   + +      +Q  +  EE    E+ +A    E  D   + + E   SS  E 
Sbjct: 510 MAGISRMSEGQQPRGSSVQPESPQEEPLQPESMDAESVGEESDEELLAE-ESPLSSHTEL 568

Query: 398 EDIPPPPPPPSLKKVEPAPP 417
           E +  P         +  PP
Sbjct: 569 EGVATPVETKISSSRKLPPP 588


>gnl|CDD|223079 PHA03419, PHA03419, E4 protein; Provisional.
          Length = 200

 Score = 32.2 bits (73), Expect = 0.81
 Identities = 31/113 (27%), Positives = 45/113 (39%), Gaps = 7/113 (6%)

Query: 627 PNPPLPRSMPPMNKPKPGNFPPPTTPDEVGARVLMQERMDNGEEDLEEDEEMEEEYGQRR 686
           P  P P S PP   P PG+  PP T D     + +Q      +++ ++ E  +   G  +
Sbjct: 54  PTTPHPSSQPPPCPPSPGH--PPQTNDTHEKDLALQPPPGGKKKEKKKKETEKPAQGGEK 111

Query: 687 PEDDDMQIDMEEDSSSEEEQEDIPPPPPPPSLKKVEPAPPLPSQPSIQPPPLP 739
           P D   +   E +    E+      PPPP    +VE  P     P   P PL 
Sbjct: 112 P-DQGPEAKGEGEGHEPEDPPPEDTPPPPGGEGEVEGGP----SPGPGPGPLD 159


>gnl|CDD|236692 PRK10431, PRK10431, N-acetylmuramoyl-l-alanine amidase II;
           Provisional.
          Length = 445

 Score = 32.9 bits (75), Expect = 0.92
 Identities = 15/33 (45%), Positives = 20/33 (60%), Gaps = 5/33 (15%)

Query: 399 DIPPPPPPPSL--KKVEP---APPLPSQPSIQP 426
           D+PPPPPPP +  K+VE      P  S+P+  P
Sbjct: 128 DVPPPPPPPPVVAKRVETPAVVAPRVSEPARNP 160



 Score = 32.9 bits (75), Expect = 0.92
 Identities = 15/33 (45%), Positives = 20/33 (60%), Gaps = 5/33 (15%)

Query: 708 DIPPPPPPPSL--KKVEP---APPLPSQPSIQP 735
           D+PPPPPPP +  K+VE      P  S+P+  P
Sbjct: 128 DVPPPPPPPPVVAKRVETPAVVAPRVSEPARNP 160


>gnl|CDD|223039 PHA03307, PHA03307, transcriptional regulator ICP4; Provisional.
          Length = 1352

 Score = 32.8 bits (75), Expect = 0.92
 Identities = 31/205 (15%), Positives = 48/205 (23%), Gaps = 29/205 (14%)

Query: 586 AATRAARANITLEDQIHQIHKVKGLLPDEEKEKIGPKRSHAPNPPLPRSMPPMNKPKPGN 645
           A  RA+  +IT  D +     +      E    + P          P  +      +   
Sbjct: 744 ARARASAWDIT--DALFSNPSLVPAKLAEALALLEPAEPQRGAGSSP-PVRAEAAFRRPG 800

Query: 646 FPPPTTPDEVGARVLMQERMDNGEEDLEEDEEMEEEY-----GQRRPED----------- 689
               + P    A     +R           E            +  P             
Sbjct: 801 RLRRSGPAADAASRTASKRKSRSHTPDGGSESSGPARPPGAAARPPPARSSESSKSKPAA 860

Query: 690 --DDMQIDMEEDSSSEEEQEDIPPPPPPPSLKKVEPAPPLPSQPSIQPPPLPPAPDKVVV 747
                +           E    P    PP       A P    P+ +P P P      + 
Sbjct: 861 AGGRARGKNGRRRPRPPEPRARPGAAAPPKAA---AAAPPAGAPAPRPRPAPRVKLGPMP 917

Query: 748 KKGYDPKQA--RPPV---VKPSPTD 767
             G DP+    R P      P+P+ 
Sbjct: 918 PGGPDPRGGFRRVPPGDLHTPAPSA 942



 Score = 32.5 bits (74), Expect = 1.3
 Identities = 21/145 (14%), Positives = 27/145 (18%), Gaps = 29/145 (20%)

Query: 626 APNPPLPRSMPPMNKPKPGN------FPPPTTPDEVGARVLMQERMDNGEEDLEEDEEME 679
           + +   P S      P P         PP   P                  DL E     
Sbjct: 91  SLSTLAPASPAREGSPTPPGPSSPDPPPPTPPPASPPPSP---------APDLSEMLRPV 141

Query: 680 EEYGQRRPEDDDMQIDMEEDSSSEEEQEDIPPPPPPPSLKKVEPAPPLPSQPSIQPPPLP 739
              G   P             ++                   + A PL S       P  
Sbjct: 142 GSPG---PPPAASPPAAGASPAAVAS-----DAASSR-----QAALPLSSPEETARAPSS 188

Query: 740 PAPDKVVVKKGYDPKQARPPVVKPS 764
           P  +             RPP     
Sbjct: 189 PPAE-PPPSTPPAAASPRPPRRSSP 212



 Score = 31.3 bits (71), Expect = 3.2
 Identities = 16/82 (19%), Positives = 17/82 (20%), Gaps = 4/82 (4%)

Query: 687 PEDDDMQIDMEEDSSSEEEQEDIPPPPPPPSLKKVEPAPPLPSQPSIQPPPLPPAPDKVV 746
                       + S        P PPPP       PA P PS        L P      
Sbjct: 91  SLSTLAPASPAREGSPTPPGPSSPDPPPPTP----PPASPPPSPAPDLSEMLRPVGSPGP 146

Query: 747 VKKGYDPKQARPPVVKPSPTDE 768
                 P     P    S    
Sbjct: 147 PPAASPPAAGASPAAVASDAAS 168



 Score = 30.9 bits (70), Expect = 3.9
 Identities = 17/59 (28%), Positives = 22/59 (37%)

Query: 401 PPPPPPPSLKKVEPAPPLPSQPSIQPPPLPPAPDKVVVKKGYDPKQEKIPASKVQEHMR 459
           PPP P P L ++      P  P    PP   A    V       +Q  +P S  +E  R
Sbjct: 126 PPPSPAPDLSEMLRPVGSPGPPPAASPPAAGASPAAVASDAASSRQAALPLSSPEETAR 184



 Score = 29.8 bits (67), Expect = 8.0
 Identities = 16/81 (19%), Positives = 20/81 (24%)

Query: 353 DDMQEGTTSEEESDNENEEARKSKEAKDNTQVQDMEEDSSSEEEQEDIPPPPPPPSLKKV 412
            D  E  T           A +S+     +         + E       P  P P     
Sbjct: 62  CDRFEPPTGPPPGPGTEAPANESRSTPTWSLSTLAPASPAREGSPTPPGPSSPDPPPPTP 121

Query: 413 EPAPPLPSQPSIQPPPLPPAP 433
            PA P PS        L P  
Sbjct: 122 PPASPPPSPAPDLSEMLRPVG 142



 Score = 29.8 bits (67), Expect = 9.6
 Identities = 23/127 (18%), Positives = 31/127 (24%), Gaps = 9/127 (7%)

Query: 620 GPKRSHAPNPPLPRSMPPMNKPKPGNFPPPTTPDEVGA---RVLMQERMDNGEEDLEEDE 676
            P R  +P PP P S  P     P   PPP+   ++      V                 
Sbjct: 99  SPAREGSPTPPGPSSPDPPPPTPPPASPPPSPAPDLSEMLRPVGSPGPPPAASPPAAGAS 158

Query: 677 EMEEEYGQRRPEDDDMQIDMEEDSSSEEEQEDIPPPPPPPSLKKVEPAPPLPSQPSIQPP 736
                          + +   E++         P  PP        PA   P  P    P
Sbjct: 159 PAAVASDAASSRQAALPLSSPEET------ARAPSSPPAEPPPSTPPAAASPRPPRRSSP 212

Query: 737 PLPPAPD 743
               A  
Sbjct: 213 ISASASS 219


>gnl|CDD|225657 COG3115, ZipA, Cell division protein [Cell division and chromosome
           partitioning].
          Length = 324

 Score = 32.5 bits (74), Expect = 0.96
 Identities = 20/90 (22%), Positives = 31/90 (34%), Gaps = 7/90 (7%)

Query: 345 RMQEEINEDDMQEGTTSEEESDNENEEARKSKEAKDNTQVQDMEEDSSSEEEQEDIPPPP 404
           RM+ + ++D   E     ++         +  EA   TQ  +    S   + Q +     
Sbjct: 42  RMKSKRDDDPYDE---VADDEGVGEVRVVRKNEAPQFTQEHEAARQSPQHQYQPEYASAQ 98

Query: 405 PPPSLKKVEPAPPLPSQPSIQPPPLPPAPD 434
               +    P PP  S P   P P  PA D
Sbjct: 99  IKIPV----PQPPQISDPPAHPQPTQPALD 124



 Score = 32.1 bits (73), Expect = 1.1
 Identities = 25/126 (19%), Positives = 45/126 (35%), Gaps = 11/126 (8%)

Query: 662 QERMDNGEEDLEEDEEMEEEYGQRRPEDDDMQIDMEEDSSSEEEQEDIPPPPPPPSLKKV 721
            +R D+  +++ +DE + E   +   +++  Q   E +++ +  Q    P      +K  
Sbjct: 45  SKRDDDPYDEVADDEGVGEV--RVVRKNEAPQFTQEHEAARQSPQHQYQPEYASAQIKIP 102

Query: 722 EPAPPLPSQPSIQPPPLPPAPDKVVVKKGYDPKQARPPVVKPSPTDEFLISPITGEKIPA 781
            P PP  S P   P P  PA D+         +          P +     P+     P 
Sbjct: 103 VPQPPQISDPPAHPQPTQPALDQ--------EQPPEEARQPVLPQEAPAPQPVHSAA-PQ 153

Query: 782 SKVQEH 787
             VQ  
Sbjct: 154 PAVQTV 159


>gnl|CDD|237605 PRK14086, dnaA, chromosomal replication initiation protein;
           Provisional.
          Length = 617

 Score = 32.9 bits (75), Expect = 1.0
 Identities = 20/119 (16%), Positives = 27/119 (22%), Gaps = 3/119 (2%)

Query: 627 PNPPLPRS--MPPMNKPKPGNFPPPTTPDEVGARVLMQERMDNGEEDLEEDEEMEEEYGQ 684
           P            +  P    +  P +      R          E D    +        
Sbjct: 159 PRAADDYGWQQQRLGFPPRAPYASPASYAPEQERDREPYDAGRPEYDQRRRDYDHPRPDW 218

Query: 685 RRPEDDDMQIDMEEDSSSEEEQEDIPPPPPPPSLKKVEPAPPLPSQPSIQPPPLPPAPD 743
            RP  D      E    +       P PP       V   P  P   + QP P P   +
Sbjct: 219 DRPRRDRTDRP-EPPPGAGHVHRGGPGPPERDDAPVVPIRPSAPGPLAAQPAPAPGPGE 276


>gnl|CDD|113290 pfam04514, BTV_NS2, Bluetongue virus non-structural protein NS2.
           This family includes NS2 proteins from other members of
           the Orbivirus genus. NS2 is a non-specific
           single-stranded RNA-binding protein that forms large
           homomultimers and accumulates in viral inclusion bodies
           of infected cells. Three RNA binding regions have been
           identified in Bluetongue virus serotype 17 at residues
           2-11, 153-166 and 274-286. NS2 multimers also possess
           nucleotidyl phosphatase activity. The precise function
           of NS2 is not known, but it may be involved in the
           transport and condensation of viral mRNAs.
          Length = 363

 Score = 32.5 bits (74), Expect = 1.0
 Identities = 21/119 (17%), Positives = 45/119 (37%), Gaps = 3/119 (2%)

Query: 285 RVLMQERMDNGEEDLEEDEEMEEEYGQRRPEDDDMQIGSDEENDYYNARMRGRNANRGRG 344
           R  ++E  +  +E+       + E     P+ D+ +   DEE          ++    R 
Sbjct: 158 RQKIREEREKEKEEQPMKPAFKPERWMGGPDSDEDENPLDEEAPDMTPETSKQDQKEERR 217

Query: 345 RMQEEINEDDMQEGTTSEEESDNENEEARKSKEAKDNTQVQDMEEDSSSEEEQEDIPPP 403
              E    + ++    + EE      + RK +E ++N +    +ED   E+ ++    P
Sbjct: 218 AAVERRLAELVEMINWNLEERR---RDLRKEQELEENVERDSDDEDEHGEDSEDGETKP 273


>gnl|CDD|217469 pfam03276, Gag_spuma, Spumavirus gag protein. 
          Length = 582

 Score = 32.6 bits (74), Expect = 1.1
 Identities = 14/33 (42%), Positives = 16/33 (48%), Gaps = 2/33 (6%)

Query: 401 PPPPPPPSLKKVEPAPPLPSQPSIQPPPLPPAP 433
           P P  P +     PAPP+P  P IQ    PP P
Sbjct: 228 PSPAQPSAPPASIPAPPIP--PVIQYVAPPPVP 258



 Score = 32.6 bits (74), Expect = 1.1
 Identities = 14/33 (42%), Positives = 16/33 (48%), Gaps = 2/33 (6%)

Query: 710 PPPPPPPSLKKVEPAPPLPSQPSIQPPPLPPAP 742
           P P  P +     PAPP+P  P IQ    PP P
Sbjct: 228 PSPAQPSAPPASIPAPPIP--PVIQYVAPPPVP 258


>gnl|CDD|147601 pfam05505, Ebola_NP, Ebola nucleoprotein.  This family consists of
           Ebola and Marburg virus nucleoproteins. These proteins
           are responsible for encapsidation of genomic RNA. It has
           been found that nucleoprotein DNA vaccines can offer
           protection from the virus.
          Length = 717

 Score = 32.6 bits (74), Expect = 1.1
 Identities = 26/160 (16%), Positives = 47/160 (29%), Gaps = 15/160 (9%)

Query: 306 EEEYGQRRPEDDDMQIGSDEENDYYNARMRGRNANRGRGRMQEEINEDDMQEGTTSEEES 365
            E  G   P     Q G+    D  N   +  + +               QE T  +EE 
Sbjct: 493 RERQGIPEPPPGSHQPGNRASQDLNNNNQKQEDESTNPIGKTS----LRYQELTPVQEED 548

Query: 366 DNENEEARKSKEAKDNTQVQDMEEDSSSEEEQEDIPPPPPPPSLKKVEPAPPLPSQPSIQ 425
           + E++      +      ++  ++  S  E+  D+    PP  + + E        P+  
Sbjct: 549 EPEDQ---TDDDDSSLPPLESDDDPGSDNEQGVDLTEVAPPAPVYRDEKEQDEIPHPAQN 605

Query: 426 PPPLPPAPDKVVVKKGYDPKQEKIPASKV--QEHMRIGLL 463
           P       D        D    +  +      E   + LL
Sbjct: 606 P------QDPTGSIGNVDSDILRSNSKPSAPLEETYMHLL 639



 Score = 30.7 bits (69), Expect = 4.4
 Identities = 31/153 (20%), Positives = 46/153 (30%), Gaps = 17/153 (11%)

Query: 621 PKRSHAPNPPLPRSMPPMNKPKPGNFPPPTTPDEVGARV-----LMQERMDNGEEDLEED 675
            +R   P PP     P     +  N       DE    +       QE     EED  ED
Sbjct: 493 RERQGIPEPPPGSHQPGNRASQDLNNNNQKQEDESTNPIGKTSLRYQELTPVQEEDEPED 552

Query: 676 EEMEEEYGQRRPEDDDMQIDMEEDSSSEEEQEDIPPPPPPPSLKKVEPAPPLPSQPSIQP 735
           +  +++      E DD      +  S  E+  D+    PP  + + E        P+  P
Sbjct: 553 QTDDDDSSLPPLESDD------DPGSDNEQGVDLTEVAPPAPVYRDEKEQDEIPHPAQNP 606

Query: 736 PPLPPAPDKVVVKKGYDPKQARPPVVKPSPTDE 768
                  D        D    R      +P +E
Sbjct: 607 ------QDPTGSIGNVDSDILRSNSKPSAPLEE 633


>gnl|CDD|220462 pfam09899, DUF2126, Putative amidoligase enzyme (DUF2126).  Members
           of this family of bacterial domains are predominantly
           found in transglutaminase and transglutaminase-like
           proteins. Their exact function is, as yet, unknown, but
           they are likely to act as amidoligase enzymes Protein in
           this family are found in conserved gene neighborhoods
           encoding a glutamine amidotransferase-like thiol
           peptidase (in proteobacteria) or an Aig2 family
           cyclotransferase protein (in firmicutes).
          Length = 817

 Score = 32.6 bits (75), Expect = 1.3
 Identities = 19/88 (21%), Positives = 25/88 (28%), Gaps = 21/88 (23%)

Query: 726 PLPSQPSIQPPPLPPAPDKVVVKKGYDPKQARPPVVKPSPTDEFLISPITGEKIPASKVQ 785
           PL S P + P   P    +       DP   RPP+                   PA    
Sbjct: 260 PLDSLPWVPPADYPYIHPR-------DPFAPRPPLPPAQREQA-------AASAPAGAAG 305

Query: 786 EHMRIGLLDPRW-------VEQRDKHLN 806
           + +R  +            VE RD  L 
Sbjct: 306 QDVREQVPHAEGVVRTALCVEPRDGRLY 333



 Score = 29.9 bits (68), Expect = 7.8
 Identities = 15/64 (23%), Positives = 18/64 (28%), Gaps = 5/64 (7%)

Query: 699 DSSSEEEQEDIPPPPPPPSLKKVEPAPPLPSQPSIQPPPLPPAPDKVVVKKGYDPKQARP 758
           DS       D P   P        P PPLP     Q     PA      +   +      
Sbjct: 262 DSLPWVPPADYPYIHPRDPF---APRPPLPPAQREQAAASAPA--GAAGQDVREQVPHAE 316

Query: 759 PVVK 762
            VV+
Sbjct: 317 GVVR 320


>gnl|CDD|185411 PTZ00044, PTZ00044, ubiquitin; Provisional.
          Length = 76

 Score = 29.4 bits (66), Expect = 1.4
 Identities = 10/25 (40%), Positives = 16/25 (64%)

Query: 847 DSNSLAYYNITPGTKVHLQLKERGG 871
           D   L+ Y + PG+ +H+ L+ RGG
Sbjct: 52  DDLKLSDYKVVPGSTIHMVLQLRGG 76


>gnl|CDD|185594 PTZ00395, PTZ00395, Sec24-related protein; Provisional.
          Length = 1560

 Score = 32.4 bits (73), Expect = 1.4
 Identities = 14/43 (32%), Positives = 24/43 (55%)

Query: 335 RGRNANRGRGRMQEEINEDDMQEGTTSEEESDNENEEARKSKE 377
           R  +   G  R + E ++++  E ++SE  S+NENE   K +E
Sbjct: 549 REDHPEGGTNRQKYEQSDEESVESSSSENSSENENEVTDKGEE 591


>gnl|CDD|236511 PRK09429, mepA, penicillin-insensitive murein endopeptidase;
           Reviewed.
          Length = 275

 Score = 31.8 bits (73), Expect = 1.4
 Identities = 18/54 (33%), Positives = 19/54 (35%), Gaps = 16/54 (29%)

Query: 390 DSSSEEEQEDIPPPPPPP-----------SLKKVEPAPPLPSQPSIQPPPLPPA 432
            S   E+Q      PPPP           S    EP  P        PPPLPPA
Sbjct: 219 GSPECEDQ-----APPPPGDGCGAELASWSDAPFEPPKPTTKPKPKPPPPLPPA 267



 Score = 31.8 bits (73), Expect = 1.4
 Identities = 18/54 (33%), Positives = 19/54 (35%), Gaps = 16/54 (29%)

Query: 699 DSSSEEEQEDIPPPPPPP-----------SLKKVEPAPPLPSQPSIQPPPLPPA 741
            S   E+Q      PPPP           S    EP  P        PPPLPPA
Sbjct: 219 GSPECEDQ-----APPPPGDGCGAELASWSDAPFEPPKPTTKPKPKPPPPLPPA 267


>gnl|CDD|200219 TIGR02927, SucB_Actino, 2-oxoglutarate dehydrogenase, E2 component,
           dihydrolipoamide succinyltransferase.  This model
           represents an Actinobacterial clade of E2 enzyme, a
           component of the 2-oxoglutarate dehydrogenase complex
           involved in the TCA cycle. These proteins have multiple
           domains including the catalytic domain (pfam00198), one
           or two biotin domains (pfam00364) and an E3-component
           binding domain (pfam02817).
          Length = 579

 Score = 32.3 bits (73), Expect = 1.5
 Identities = 17/74 (22%), Positives = 24/74 (32%)

Query: 370 EEARKSKEAKDNTQVQDMEEDSSSEEEQEDIPPPPPPPSLKKVEPAPPLPSQPSIQPPPL 429
           E A +   A      +   + ++        PP P P   K   PA   P       P +
Sbjct: 209 EPAEEEAPAPSEAGSEPAPDPAARAPHAAPDPPAPAPAPAKTAAPAAAAPVSSGDSGPYV 268

Query: 430 PPAPDKVVVKKGYD 443
            P   K+   KG D
Sbjct: 269 TPLVRKLAKDKGVD 282



 Score = 29.6 bits (66), Expect = 9.4
 Identities = 14/43 (32%), Positives = 16/43 (37%)

Query: 710 PPPPPPPSLKKVEPAPPLPSQPSIQPPPLPPAPDKVVVKKGYD 752
           PP P P   K   PA   P       P + P   K+   KG D
Sbjct: 240 PPAPAPAPAKTAAPAAAAPVSSGDSGPYVTPLVRKLAKDKGVD 282


>gnl|CDD|222010 pfam13254, DUF4045, Domain of unknown function (DUF4045).  This
           presumed domain is functionally uncharacterized. This
           domain family is found in bacteria and eukaryotes, and
           is typically between 384 and 430 amino acids in length.
          Length = 414

 Score = 32.1 bits (73), Expect = 1.5
 Identities = 23/118 (19%), Positives = 36/118 (30%), Gaps = 1/118 (0%)

Query: 649 PTTPDEVGARVLMQERMDNGEEDLEEDEEMEEEYGQRRPEDDDMQIDMEEDSSSEEEQED 708
             +  EV    LM+               + +    R  E    +   +E SS + E+  
Sbjct: 195 TNSFKEVTPVGLMRTPPPGSHSKSPSKSGIPDLPSSRDSEKTKPEKPQQETSSMDTEKSS 254

Query: 709 IPPPPPPPSLKKVEPAPPLPSQPSI-QPPPLPPAPDKVVVKKGYDPKQARPPVVKPSP 765
            P P      K  E APP+ +     + P   P   +    +   P    P     SP
Sbjct: 255 APKPRETLDPKSPEKAPPIDTTEEELKSPEASPKESEEASARKRSPSLLSPSPKAESP 312



 Score = 29.4 bits (66), Expect = 8.4
 Identities = 11/72 (15%), Positives = 24/72 (33%), Gaps = 3/72 (4%)

Query: 368 ENEEARKSKEAKDNTQVQDMEEDSSSEEEQEDIPPPPPPPSLKKVEPAPPLPSQPSIQPP 427
            + ++ K+K  K   +   M+ + SS  +  +   P  P   +K  P      +      
Sbjct: 229 SSRDSEKTKPEKPQQETSSMDTEKSSAPKPRETLDPKSP---EKAPPIDTTEEELKSPEA 285

Query: 428 PLPPAPDKVVVK 439
               + +    K
Sbjct: 286 SPKESEEASARK 297


>gnl|CDD|220596 pfam10138, Tellurium_res, Tellurium resistance protein.  Members of
           this family confer resistance to the metalloid element
           tellurium and its salts.
          Length = 98

 Score = 30.1 bits (68), Expect = 1.5
 Identities = 9/24 (37%), Positives = 10/24 (41%)

Query: 413 EPAPPLPSQPSIQPPPLPPAPDKV 436
             AP  P  P+   P  PPA   V
Sbjct: 1   PAAPVPPPAPAPPAPAPPPAAPPV 24



 Score = 30.1 bits (68), Expect = 1.5
 Identities = 9/24 (37%), Positives = 10/24 (41%)

Query: 722 EPAPPLPSQPSIQPPPLPPAPDKV 745
             AP  P  P+   P  PPA   V
Sbjct: 1   PAAPVPPPAPAPPAPAPPPAAPPV 24


>gnl|CDD|227676 COG5386, COG5386, Cell surface protein [Cell envelope biogenesis,
           outer membrane].
          Length = 352

 Score = 31.8 bits (72), Expect = 1.6
 Identities = 12/58 (20%), Positives = 19/58 (32%), Gaps = 6/58 (10%)

Query: 689 DDDMQIDMEEDSSSEEEQEDIPPPPPPPSLKKVEPAPPLPSQPSIQPPPLPPAPDKVV 746
           D +  I +E D++ ++      P            A P  + P    P   P   KV 
Sbjct: 168 DHEYTIRLEFDTAIDQAANAATPNDDKN------VAQPGTTTPKTPTPQTTPNKPKVE 219



 Score = 29.5 bits (66), Expect = 8.4
 Identities = 9/48 (18%), Positives = 14/48 (29%), Gaps = 6/48 (12%)

Query: 390 DSSSEEEQEDIPPPPPPPSLKKVEPAPPLPSQPSIQPPPLPPAPDKVV 437
           D++ ++      P            A P  + P    P   P   KV 
Sbjct: 178 DTAIDQAANAATPNDDKN------VAQPGTTTPKTPTPQTTPNKPKVE 219


>gnl|CDD|223066 PHA03379, PHA03379, EBNA-3A; Provisional.
          Length = 935

 Score = 32.3 bits (73), Expect = 1.6
 Identities = 29/111 (26%), Positives = 40/111 (36%), Gaps = 21/111 (18%)

Query: 638 MNKPKPGNFPPPTTPDEVGARVLMQERMDNGEEDLEEDEEMEEEYGQRRPEDDDMQIDME 697
           M+K +PG   P    +EV +   +QE+   G                    D +  +  +
Sbjct: 1   MDKDRPGLPAPDDNIEEVPSTSGVQEQASGG--------------------DWENVLIED 40

Query: 698 EDSSSEEEQEDIPPPPPPPSLKKVEPAPPLPSQPSIQPPPLPPAPDKVVVK 748
            DSSSEEE ED    PP    K+             QPP  PP  D    +
Sbjct: 41  SDSSSEEEAEDAHLEPPQKGKKRKRVDDDAGGSAPAQPPC-PPQLDLPGRE 90



 Score = 32.0 bits (72), Expect = 2.0
 Identities = 23/78 (29%), Positives = 30/78 (38%), Gaps = 3/78 (3%)

Query: 362 EEESDNENEEARKSKEAKDNTQVQDMEEDSSSEEEQEDIPPPPPPPSLKKVEPAPPLPSQ 421
           EE       + + S    +N  ++D   DSSSEEE ED    PP    K+          
Sbjct: 16  EEVPSTSGVQEQASGGDWENVLIED--SDSSSEEEAEDAHLEPPQKGKKRKRVDDDAGGS 73

Query: 422 PSIQPPPLPPAPDKVVVK 439
              QPP  PP  D    +
Sbjct: 74  APAQPPC-PPQLDLPGRE 90


>gnl|CDD|218115 pfam04502, DUF572, Family of unknown function (DUF572).  Family of
           eukaryotic proteins with undetermined function.
          Length = 321

 Score = 31.6 bits (72), Expect = 1.6
 Identities = 26/124 (20%), Positives = 41/124 (33%), Gaps = 10/124 (8%)

Query: 669 EEDLEEDEEMEEEYGQRRPEDDDMQIDMEEDSSSEEEQEDIPPPPPPPSLKKVEP----- 723
           EE+ EEDE + +        ++D +   +EDS  +EE  D  P P   S    +P     
Sbjct: 185 EEEEEEDEALIKSLSFGPETEEDRRRADDEDSEDDEEDNDNTPSPKSGSSSPAKPTSILK 244

Query: 724 -----APPLPSQPSIQPPPLPPAPDKVVVKKGYDPKQARPPVVKPSPTDEFLISPITGEK 778
                    PS    +         +  +      K+A P     SP+     S      
Sbjct: 245 KSAAKRSEAPSSSKAKKNSRGIPKPRDALSSLVVRKKAAPESTSQSPSSAEPTSESPQTA 304

Query: 779 IPAS 782
             +S
Sbjct: 305 GNSS 308



 Score = 30.9 bits (70), Expect = 2.9
 Identities = 30/135 (22%), Positives = 54/135 (40%), Gaps = 8/135 (5%)

Query: 281 EVGA-RVLMQERMDNGEEDLEEDEEMEEEYGQR------RPEDDDMQIGSDEENDYYNAR 333
           E GA R    +++D  +E+  E E  EE  G        R  D   ++   E  +     
Sbjct: 101 ESGATRNYEADKLDEEQEERVEKEREEELAGDAMKKLENRTADSKREMEVLERLEELKE- 159

Query: 334 MRGRNANRGRGRMQEEINEDDMQEGTTSEEESDNENEEARKSKEAKDNTQVQDMEEDSSS 393
           ++ R A+     M E +   + +E    EEE +   +      E +++ +  D E+    
Sbjct: 160 LQSRRADVDVNSMLEALFRREKKEEEEEEEEDEALIKSLSFGPETEEDRRRADDEDSEDD 219

Query: 394 EEEQEDIPPPPPPPS 408
           EE+ ++ P P    S
Sbjct: 220 EEDNDNTPSPKSGSS 234


>gnl|CDD|235309 PRK04596, minC, septum formation inhibitor; Reviewed.
          Length = 248

 Score = 31.5 bits (71), Expect = 1.7
 Identities = 13/25 (52%), Positives = 15/25 (60%)

Query: 395 EEQEDIPPPPPPPSLKKVEPAPPLP 419
           E     PPPPPPP   + EPAPP+ 
Sbjct: 113 ERAAVSPPPPPPPPPARAEPAPPVA 137



 Score = 31.5 bits (71), Expect = 1.7
 Identities = 13/25 (52%), Positives = 15/25 (60%)

Query: 704 EEQEDIPPPPPPPSLKKVEPAPPLP 728
           E     PPPPPPP   + EPAPP+ 
Sbjct: 113 ERAAVSPPPPPPPPPARAEPAPPVA 137


>gnl|CDD|225499 COG2948, VirB10, Type IV secretory pathway, VirB10 components
           [Intracellular trafficking and secretion].
          Length = 360

 Score = 31.7 bits (72), Expect = 1.7
 Identities = 16/70 (22%), Positives = 29/70 (41%), Gaps = 12/70 (17%)

Query: 700 SSSEEEQEDIPPPPPPPSLKKVEPA-PPLPSQPSIQPPP--------LPPAPDKVVVKKG 750
            + + E++ I    PP ++++  P  PPLP  P + PP         LP    +    + 
Sbjct: 44  IALQGEKKRINNTQPPSNVERGTPPLPPLPDDPPLPPPLPVDLGAPVLPDQQVEEAKDQ- 102

Query: 751 YDPKQARPPV 760
             P++ R   
Sbjct: 103 --PRRLRAAE 110



 Score = 30.9 bits (70), Expect = 2.8
 Identities = 14/57 (24%), Positives = 24/57 (42%), Gaps = 9/57 (15%)

Query: 391 SSSEEEQEDIPPPPPPPSLKKVEPA-PPLPSQPSIQPPP--------LPPAPDKVVV 438
            + + E++ I    PP ++++  P  PPLP  P + PP         LP    +   
Sbjct: 44  IALQGEKKRINNTQPPSNVERGTPPLPPLPDDPPLPPPLPVDLGAPVLPDQQVEEAK 100


>gnl|CDD|240521 cd13394, Syo1_like, Fungal symportin 1 (syo1) and similar proteins.
            This family of eukaryotic proteins includes
           Saccharomyces cerevisiae Ydl063c and Chaetomium
           thermophilum Syo1, which mediate the co-import of two
           ribosomal proteins, Rpl5 and Rpl11 (which both interact
           with 5S rRNA) into the nucleus. Import precedes their
           association with rRNA and subsequent ribosome assembly
           in the nucleolus. The primary structure of syo1 is a
           mixture of Armadillo- (ARM, N-terminal part of syo1) and
           HEAT-repeats (C-terminal part of syo1).
          Length = 597

 Score = 31.9 bits (73), Expect = 1.8
 Identities = 16/75 (21%), Positives = 29/75 (38%), Gaps = 11/75 (14%)

Query: 669 EEDLEEDEEMEEEYGQRRPEDDDMQIDMEEDSSSEEEQEDIPPPP-----PPPSLKKVEP 723
           E D ++DE ME+E      + D+ + + E+D +SE    ++ P           L+ + P
Sbjct: 304 ELDKKDDEAMEDE------QSDEDEEEDEDDEASETLVPELSPLEISLDLLTALLELIAP 357

Query: 724 APPLPSQPSIQPPPL 738
            P             
Sbjct: 358 TPESVEPILPSSTIE 372


>gnl|CDD|237928 PRK15220, PRK15220, fimbrial chaperone protein; Provisional.
          Length = 178

 Score = 30.9 bits (70), Expect = 1.9
 Identities = 12/42 (28%), Positives = 18/42 (42%), Gaps = 4/42 (9%)

Query: 33 IVDKTANFVARNGPEFESRIRQNEIGNPKFNFLSPGDPYHAY 74
          +V  T  F+  N       I  N++G  +   L+PG  Y  Y
Sbjct: 36 VVGTTCQFLGTNSAT----IELNQVGVDRLANLNPGQIYTGY 73


>gnl|CDD|223067 PHA03380, PHA03380, transactivating tegument protein VP16;
           Provisional.
          Length = 432

 Score = 31.6 bits (72), Expect = 1.9
 Identities = 12/49 (24%), Positives = 17/49 (34%), Gaps = 1/49 (2%)

Query: 694 IDMEEDSSSEEEQEDIPPPPPPPSLKKVEPAPPLPSQPSIQPPPLPPAP 742
           + M  D    +  E +  PP   +      A   PS   +  PP  PA 
Sbjct: 1   MLMLVDDFIADFDESLLGPPESETAPAP-YATSAPSSAQLALPPPRPAS 48



 Score = 31.6 bits (72), Expect = 2.3
 Identities = 12/48 (25%), Positives = 16/48 (33%), Gaps = 1/48 (2%)

Query: 386 DMEEDSSSEEEQEDIPPPPPPPSLKKVEPAPPLPSQPSIQPPPLPPAP 433
            M  D    +  E +  PP   +      A   PS   +  PP  PA 
Sbjct: 2   LMLVDDFIADFDESLLGPPESETAPAP-YATSAPSSAQLALPPPRPAS 48



 Score = 30.8 bits (70), Expect = 3.3
 Identities = 13/51 (25%), Positives = 18/51 (35%), Gaps = 2/51 (3%)

Query: 685 RRPEDDDMQIDMEED--SSSEEEQEDIPPPPPPPSLKKVEPAPPLPSQPSI 733
                DD   D +E      E E    P     PS  ++   PP P+ P+ 
Sbjct: 1   MLMLVDDFIADFDESLLGPPESETAPAPYATSAPSSAQLALPPPRPASPAA 51


>gnl|CDD|188533 TIGR04018, Bthiol_YpdA, putative bacillithiol system
           oxidoreductase, YpdA family.  Members of this protein
           family, including YpdA from Bacillus subtilis, are
           apparent oxidoreductases present only in species with an
           active bacillithiol system. They have been suggested
           actually to be thiol disulfide oxidoreductases (TDOR),
           although the evidence is incomplete [Unknown function,
           Enzymes of unknown specificity].
          Length = 316

 Score = 31.4 bits (72), Expect = 2.0
 Identities = 13/42 (30%), Positives = 19/42 (45%), Gaps = 14/42 (33%)

Query: 46  PEFESRIRQNEIGNPKFNFLSPGDPYHAYYQHRVKDIREGKV 87
           P+ E+RI++  I               AY+  RVK+I E  V
Sbjct: 197 PDIENRIKEGSI--------------KAYFNSRVKEITEDSV 224


>gnl|CDD|219408 pfam07423, DUF1510, Protein of unknown function (DUF1510).  This
           family consists of several hypothetical bacterial
           proteins of around 200 residues in length. The function
           of this family is unknown.
          Length = 214

 Score = 31.2 bits (71), Expect = 2.0
 Identities = 15/44 (34%), Positives = 22/44 (50%), Gaps = 2/44 (4%)

Query: 356 QEGTTSEEESDNENEEARKSKEAKDNTQVQDMEEDSSSEEEQED 399
            + T   EE   E +EA  S++ +D       +ED  SEEE E+
Sbjct: 56  DQETAEIEEVKEEEKEAANSEDKEDKGD--AEKEDEESEEENEE 97



 Score = 29.3 bits (66), Expect = 6.4
 Identities = 9/46 (19%), Positives = 27/46 (58%), Gaps = 1/46 (2%)

Query: 358 GTTSEEESDNENEEARKS-KEAKDNTQVQDMEEDSSSEEEQEDIPP 402
            + S++ + +E E  +   +E  +  +V++ E+++++ E++ED   
Sbjct: 38  SSPSDQAAADEQEAKKSDDQETAEIEEVKEEEKEAANSEDKEDKGD 83


>gnl|CDD|236138 PRK07994, PRK07994, DNA polymerase III subunits gamma and tau;
           Validated.
          Length = 647

 Score = 31.8 bits (73), Expect = 2.0
 Identities = 13/78 (16%), Positives = 20/78 (25%), Gaps = 8/78 (10%)

Query: 710 PPPPPPPSLKKVEPAPPLPSQPSIQPPPLPPAPDKVV-VKKGYDPKQARPPVVKPSPTDE 768
           P     P      P PP  +       PLP    +++  ++     Q      K  P   
Sbjct: 387 PTAAVAPPQAPAVPPPPASAPQQAPAVPLPETTSQLLAARQQLQRAQGATKAKKSEP--- 443

Query: 769 FLISPITGEKIPASKVQE 786
                      P +   E
Sbjct: 444 ----AAASRARPVNSALE 457



 Score = 31.0 bits (71), Expect = 3.0
 Identities = 20/152 (13%), Positives = 35/152 (23%), Gaps = 10/152 (6%)

Query: 626 APNPPLPRSMPPMNKPKPGNFPPPTTPDEVGARVLMQERMDNGEEDLEEDEEMEEEYGQR 685
            P      +        P     P     V                +   E   +    R
Sbjct: 370 VPPQSAAPAASAQATAAPTAAVAPPQAPAVPPP---PASAPQQAPAVPLPETTSQLLAAR 426

Query: 686 RPEDDDMQIDMEEDSSSEEEQEDIPPPPPPPSLKKVEPAPPLPSQPSIQPPPLPPAPDKV 745
                  Q+   + ++  ++ E        P    +E    +   PS      P   +  
Sbjct: 427 ------QQLQRAQGATKAKKSEPAAASRARPVNSALERLASVRPAPSALEKA-PAKKEAY 479

Query: 746 VVKKGYDPKQARPPVVKPSPTDEFLISPITGE 777
             K     +  + PV  P    + L    T E
Sbjct: 480 RWKATNPVEVKKEPVATPKALKKALEHEKTPE 511


>gnl|CDD|219124 pfam06658, DUF1168, Protein of unknown function (DUF1168).  This
           family consists of several hypothetical eukaryotic
           proteins of unknown function.
          Length = 142

 Score = 30.0 bits (68), Expect = 2.3
 Identities = 28/102 (27%), Positives = 51/102 (50%), Gaps = 9/102 (8%)

Query: 289 QERMDNGEEDLEEDEEMEEEYGQRRPEDDDMQIGSDEENDYYNARMR-GRNANRGRGRMQ 347
            ER++  +E  +++ E  EE+ Q+R E    +   DEE        R  +   + + +  
Sbjct: 49  YERLELMDEKWKKETED-EEFQQKREE----KKRKDEEKTAKKRAKRQKKKQKKKKKKKA 103

Query: 348 EEINEDDMQEGTTSEEESDNENEEARKSKEAKDNTQVQDMEE 389
           ++ N+ + +EG+ S EES +E EE    +E K    V+ ME+
Sbjct: 104 KKGNKKEEKEGSKSSEESSDEEEE---GEEDKQEEPVEIMEK 142


>gnl|CDD|237871 PRK14965, PRK14965, DNA polymerase III subunits gamma and tau;
           Provisional.
          Length = 576

 Score = 31.6 bits (72), Expect = 2.4
 Identities = 16/56 (28%), Positives = 20/56 (35%), Gaps = 3/56 (5%)

Query: 711 PPPPPPSLKKVEPAPPLPSQPSIQPPPLPPAPDKVVVKKGYDPKQARPPVVKPSPT 766
           P PP  +     PA P    P+  PP  P AP +        P  A  P    +P 
Sbjct: 382 PAPPSAAWGAPTPAAPAAPPPAAAPPVPPAAPARPA---AARPAPAPAPPAAAAPP 434



 Score = 30.5 bits (69), Expect = 4.8
 Identities = 12/51 (23%), Positives = 21/51 (41%)

Query: 383 QVQDMEEDSSSEEEQEDIPPPPPPPSLKKVEPAPPLPSQPSIQPPPLPPAP 433
           +++ +E  + +        P P  P+      APP+P     +P    PAP
Sbjct: 373 RLEALERGAPAPPSAAWGAPTPAAPAAPPPAAAPPVPPAAPARPAAARPAP 423



 Score = 30.1 bits (68), Expect = 6.5
 Identities = 11/35 (31%), Positives = 13/35 (37%), Gaps = 1/35 (2%)

Query: 401 PPPPPPPSLKKVEPAPPLPSQPSIQPPPLPPAPDK 435
            PP  P       PAP  P+ P+   PP   A   
Sbjct: 408 VPPAAPARPAAARPAPA-PAPPAAAAPPARSADPA 441



 Score = 30.1 bits (68), Expect = 6.5
 Identities = 11/35 (31%), Positives = 13/35 (37%), Gaps = 1/35 (2%)

Query: 710 PPPPPPPSLKKVEPAPPLPSQPSIQPPPLPPAPDK 744
            PP  P       PAP  P+ P+   PP   A   
Sbjct: 408 VPPAAPARPAAARPAPA-PAPPAAAAPPARSADPA 441


>gnl|CDD|234048 TIGR02887, spore_ger_x_C, germination protein, Ger(x)C family.
           Members of this protein family are restricted to
           endospore-forming members of the Firmicutes lineage of
           bacteria, including the genera Bacillus, Clostridium,
           Thermoanaerobacter, Carboxydothermus, etc. Members are
           nearly all predicted lipoproteins and belong to probable
           transport operons, some of which have been characterized
           as crucial to germination in response to alanine.
           Members typically have been gene symbols gerKC, gerAC,
           gerYC, etc [Transport and binding proteins, Amino acids,
           peptides and amines, Cellular processes, Sporulation and
           germination].
          Length = 371

 Score = 31.5 bits (72), Expect = 2.4
 Identities = 15/89 (16%), Positives = 38/89 (42%), Gaps = 18/89 (20%)

Query: 468 VEQRDKHLNEKINQETVYAPGQAIEASLKQLAERRTDIFGVGDEETAIGKKIGEEDTRKD 527
           ++ +    + KI+ +        +E  + +  + + ++     +     +K  E++  K+
Sbjct: 261 IKVKGGKPHFKIHIK--------LEGRITESQDPKENLEK--PKNLKKIEKEAEKEIEKE 310

Query: 528 DKLFPGQAIEASLKQLAERRTDIFGVGDE 556
                   IE  +K+L + + D  G+GDE
Sbjct: 311 --------IEQLIKKLQKYKIDPLGLGDE 331


>gnl|CDD|215397 PLN02744, PLN02744, dihydrolipoyllysine-residue acetyltransferase
           component of pyruvate dehydrogenase complex.
          Length = 539

 Score = 31.4 bits (71), Expect = 2.6
 Identities = 16/60 (26%), Positives = 28/60 (46%)

Query: 377 EAKDNTQVQDMEEDSSSEEEQEDIPPPPPPPSLKKVEPAPPLPSQPSIQPPPLPPAPDKV 436
           E +D  + +D +  SS+        P PPPP  ++VE     P   + +P   P + D++
Sbjct: 190 EEEDIGKFKDYKPSSSAAPAAPKAKPSPPPPKEEEVEKPASSPEPKASKPSAPPSSGDRI 249



 Score = 30.2 bits (68), Expect = 5.9
 Identities = 14/51 (27%), Positives = 23/51 (45%)

Query: 695 DMEEDSSSEEEQEDIPPPPPPPSLKKVEPAPPLPSQPSIQPPPLPPAPDKV 745
           D +  SS+        P PPPP  ++VE     P   + +P   P + D++
Sbjct: 199 DYKPSSSAAPAAPKAKPSPPPPKEEEVEKPASSPEPKASKPSAPPSSGDRI 249



 Score = 30.2 bits (68), Expect = 6.4
 Identities = 14/55 (25%), Positives = 21/55 (38%)

Query: 688 EDDDMQIDMEEDSSSEEEQEDIPPPPPPPSLKKVEPAPPLPSQPSIQPPPLPPAP 742
           E+D  +    + SSS         P PPP  ++    P    +P    P  PP+ 
Sbjct: 191 EEDIGKFKDYKPSSSAAPAAPKAKPSPPPPKEEEVEKPASSPEPKASKPSAPPSS 245



 Score = 29.8 bits (67), Expect = 8.2
 Identities = 22/78 (28%), Positives = 31/78 (39%), Gaps = 19/78 (24%)

Query: 704 EEQEDI-------PPPPPPPSLKKVEPAPPLPSQPSIQPPPLPPAPDKVVVKKGYDPKQA 756
           EE+EDI       P     P+  K +P+PP P +  ++ P   P P            +A
Sbjct: 189 EEEEDIGKFKDYKPSSSAAPAAPKAKPSPPPPKEEEVEKPASSPEP------------KA 236

Query: 757 RPPVVKPSPTDEFLISPI 774
             P   PS  D    SP+
Sbjct: 237 SKPSAPPSSGDRIFASPL 254


>gnl|CDD|237057 PRK12323, PRK12323, DNA polymerase III subunits gamma and tau;
           Provisional.
          Length = 700

 Score = 31.4 bits (71), Expect = 2.6
 Identities = 19/117 (16%), Positives = 28/117 (23%), Gaps = 8/117 (6%)

Query: 626 APNPPLPRSMPPMNKPKPGNFPPPTTPDEVGARVLMQERMDNGEEDLEEDEEMEEEYGQR 685
           AP      +              P       A           ++D    EE+  E+   
Sbjct: 457 APAAAARPAAAGPRPVAAAAAAAPARAAPAAAPA-------PADDDPPPWEELPPEFASP 509

Query: 686 RPEDDDMQIDMEEDSSSEEEQEDIPPPPPPPSLKKVEPAPPLPSQPSIQPPPLPPAP 742
            P   D         S  +      P     +L     A P P   +   P + P P
Sbjct: 510 APAQPDAAPAGWVAESIPDPA-TADPDDAFETLAPAPAAAPAPRAAAATEPVVAPRP 565



 Score = 30.2 bits (68), Expect = 6.5
 Identities = 14/66 (21%), Positives = 18/66 (27%), Gaps = 7/66 (10%)

Query: 710 PPPPPPPSLKKVEPAPPLPSQPSIQP-------PPLPPAPDKVVVKKGYDPKQARPPVVK 762
           P    P  +     A P  + P+  P       PP    P +        P  A    V 
Sbjct: 464 PAAAGPRPVAAAAAAAPARAAPAAAPAPADDDPPPWEELPPEFASPAPAQPDAAPAGWVA 523

Query: 763 PSPTDE 768
            S  D 
Sbjct: 524 ESIPDP 529


>gnl|CDD|237874 PRK14971, PRK14971, DNA polymerase III subunits gamma and tau;
           Provisional.
          Length = 614

 Score = 31.3 bits (71), Expect = 2.7
 Identities = 14/67 (20%), Positives = 18/67 (26%), Gaps = 9/67 (13%)

Query: 698 EDSSSEEEQEDIPPPPP-PPSLKKVEPAPPLPSQPSIQPPPLPPAPDKVVVKKGYDPKQA 756
             +S    Q      P  P S  +    PP  S       P+ P            P+  
Sbjct: 396 AAASPSPSQSSAAAQPSAPQSATQPAGTPPTVSVDPPAAVPVNPP--------STAPQAV 447

Query: 757 RPPVVKP 763
           RP   K 
Sbjct: 448 RPAQFKE 454



 Score = 30.9 bits (70), Expect = 3.8
 Identities = 20/70 (28%), Positives = 25/70 (35%), Gaps = 6/70 (8%)

Query: 389 EDSSSEEEQEDIPPPPP-PPSLKKVEPAPPLPSQPSIQPPPLPP---APDKVVVKKGYDP 444
             +S    Q      P  P S  +    PP  S       P+ P   AP  V   +  + 
Sbjct: 396 AAASPSPSQSSAAAQPSAPQSATQPAGTPPTVSVDPPAAVPVNPPSTAPQAVRPAQFKEE 455

Query: 445 KQEKIPASKV 454
           K  KIP SKV
Sbjct: 456 K--KIPVSKV 463


>gnl|CDD|237865 PRK14951, PRK14951, DNA polymerase III subunits gamma and tau;
           Provisional.
          Length = 618

 Score = 31.2 bits (71), Expect = 2.7
 Identities = 11/86 (12%), Positives = 17/86 (19%), Gaps = 2/86 (2%)

Query: 711 PPPPPPSLKKVEPAPPLPSQPSIQPPPLPPAPDKVVVKKGYDPKQARPPVVKPSPTDEFL 770
           P     +      AP   +     PP     P  V       P  A              
Sbjct: 393 PVAQAAAAPAPAAAPAAAASAPAAPPA-AAPPAPVAAPAAAAPAAAPAAAPAAVALAPAP 451

Query: 771 ISPITGEKIPASKVQEHMRIGLLDPR 796
            +     +  A  V+      +    
Sbjct: 452 PAQAA-PETVAIPVRVAPEPAVASAA 476



 Score = 30.8 bits (70), Expect = 4.0
 Identities = 15/86 (17%), Positives = 20/86 (23%), Gaps = 4/86 (4%)

Query: 711 PPPPPPSLKKVEPAPPLPSQPSIQPPPLPPAPDKVVVKKGYDPKQARPPVVKPSP-TDEF 769
           P   P +      APP  + P    P   PA           P         P+    E 
Sbjct: 403 PAAAPAAAASAPAAPPAAAPP---APVAAPAAAAPAAAPAAAPAAVALAPAPPAQAAPET 459

Query: 770 LISPITGEKIPASKVQEHMRIGLLDP 795
           +  P+     PA              
Sbjct: 460 VAIPVRVAPEPAVASAAPAPAAAPAA 485



 Score = 30.1 bits (68), Expect = 6.5
 Identities = 12/59 (20%), Positives = 16/59 (27%)

Query: 710 PPPPPPPSLKKVEPAPPLPSQPSIQPPPLPPAPDKVVVKKGYDPKQARPPVVKPSPTDE 768
           PP P          A P  +  ++   P PPA                P V   +P   
Sbjct: 422 PPAPVAAPAAAAPAAAPAAAPAAVALAPAPPAQAAPETVAIPVRVAPEPAVASAAPAPA 480


>gnl|CDD|217310 pfam02993, MCPVI, Minor capsid protein VI.  This minor capsid
           protein may act as a link between the external capsid
           and the internal DNA-protein core. The C-terminal 11
           residues may function as a protease cofactor leading to
           enzyme activation.
          Length = 238

 Score = 30.9 bits (70), Expect = 2.7
 Identities = 16/73 (21%), Positives = 25/73 (34%), Gaps = 16/73 (21%)

Query: 705 EQEDIPPPPPPPSLKKVEPAPPLPSQPSIQPP-------------PLPPAPDKVVVKKGY 751
           E+  +P  P PPS ++     P P +  +                 LPPAP         
Sbjct: 138 EEVLVPAAPEPPSYEETIKPGPAPVEEPVDSMAIAVPAIDTPVTLELPPAPQPPPPVV-- 195

Query: 752 DPKQARPPVVKPS 764
            P+ +   V + S
Sbjct: 196 -PQPSTMVVHRRS 207



 Score = 30.5 bits (69), Expect = 3.0
 Identities = 13/55 (23%), Positives = 19/55 (34%), Gaps = 13/55 (23%)

Query: 396 EQEDIPPPPPPPSLKKVEPAPPLPSQPSIQPP-------------PLPPAPDKVV 437
           E+  +P  P PPS ++     P P +  +                 LPPAP    
Sbjct: 138 EEVLVPAAPEPPSYEETIKPGPAPVEEPVDSMAIAVPAIDTPVTLELPPAPQPPP 192


>gnl|CDD|178744 PLN03205, PLN03205, ATR interacting protein; Provisional.
          Length = 652

 Score = 31.3 bits (70), Expect = 2.9
 Identities = 25/93 (26%), Positives = 37/93 (39%), Gaps = 4/93 (4%)

Query: 688 EDDDMQIDMEEDSSSEEEQEDIPPPPPPPSLKKVEPAPPLP--SQPSIQPPPLPPAPDKV 745
           ED+D  I   +   + E + D+    PPPSL    PAPP+   + PS +       P ++
Sbjct: 7   EDEDDFIKAIDIIEAIEAKADMYRLLPPPSLPTFLPAPPVSEMTTPSTKISSSLSHPMQL 66

Query: 746 VVKKGYDPK--QARPPVVKPSPTDEFLISPITG 776
               G      Q   P +  SP  E     + G
Sbjct: 67  QSSAGQKRDQIQVPDPFISYSPPRELSQRVVGG 99


>gnl|CDD|237802 PRK14723, flhF, flagellar biosynthesis regulator FlhF; Provisional.
          Length = 767

 Score = 31.3 bits (71), Expect = 3.0
 Identities = 14/51 (27%), Positives = 19/51 (37%)

Query: 386 DMEEDSSSEEEQEDIPPPPPPPSLKKVEPAPPLPSQPSIQPPPLPPAPDKV 436
           D +  + +   Q   PP P P     V PAP   S  +    P P A   +
Sbjct: 45  DEDLGAVAASAQAYAPPAPAPLPAALVAPAPAAASIAAPAAVPAPGAIGDL 95



 Score = 31.3 bits (71), Expect = 3.0
 Identities = 14/51 (27%), Positives = 19/51 (37%)

Query: 695 DMEEDSSSEEEQEDIPPPPPPPSLKKVEPAPPLPSQPSIQPPPLPPAPDKV 745
           D +  + +   Q   PP P P     V PAP   S  +    P P A   +
Sbjct: 45  DEDLGAVAASAQAYAPPAPAPLPAALVAPAPAAASIAAPAAVPAPGAIGDL 95


>gnl|CDD|184918 PRK14954, PRK14954, DNA polymerase III subunits gamma and tau;
           Provisional.
          Length = 620

 Score = 31.1 bits (70), Expect = 3.1
 Identities = 15/70 (21%), Positives = 24/70 (34%), Gaps = 16/70 (22%)

Query: 689 DDDMQIDMEEDSSSEEEQEDIPPPPPP----------PSLK-KVEPAP---PLPSQ--PS 732
           D  +          +++  +   P P           P  +    P+P   P P Q  P 
Sbjct: 377 DGGVAPSPAGSPDVKKKAPEPDLPQPDRHPGPAKPEAPGARPAELPSPASAPTPEQQPPV 436

Query: 733 IQPPPLPPAP 742
            +  PLPP+P
Sbjct: 437 ARSAPLPPSP 446



 Score = 31.1 bits (70), Expect = 3.1
 Identities = 14/61 (22%), Positives = 22/61 (36%), Gaps = 16/61 (26%)

Query: 389 EDSSSEEEQEDIPPPPPP----------PSLK-KVEPAP---PLPSQ--PSIQPPPLPPA 432
                +++  +   P P           P  +    P+P   P P Q  P  +  PLPP+
Sbjct: 386 GSPDVKKKAPEPDLPQPDRHPGPAKPEAPGARPAELPSPASAPTPEQQPPVARSAPLPPS 445

Query: 433 P 433
           P
Sbjct: 446 P 446


>gnl|CDD|217461 pfam03261, CDK5_activator, Cyclin-dependent kinase 5 activator
           protein. 
          Length = 314

 Score = 31.0 bits (70), Expect = 3.1
 Identities = 22/104 (21%), Positives = 34/104 (32%), Gaps = 11/104 (10%)

Query: 358 GTTSEEESDNENEEARKSKEAKDNTQVQDMEEDSSSEEEQEDIPPPPP------PPSLKK 411
           G+  ++ S         S   K   Q+ + E    S     D             PS  +
Sbjct: 64  GSKKKKTSKKSTANLASSSYQKIVIQL-NRENLKKSVPRPSDTTLSCANLSSLREPSPGQ 122

Query: 412 VEPAPP---LPSQPSIQPPPLPPAPDKVVVKKGYDPKQEKIPAS 452
             PAPP   L  + +   P    AP   +   G  P++  + AS
Sbjct: 123 P-PAPPPSVLSGKNANCIPSQKNAPSIAITSTGGSPRRVIVQAS 165



 Score = 30.2 bits (68), Expect = 4.4
 Identities = 19/64 (29%), Positives = 25/64 (39%), Gaps = 2/64 (3%)

Query: 687 PEDDDMQIDMEEDSSSEEEQEDIPPPPPPPSL--KKVEPAPPLPSQPSIQPPPLPPAPDK 744
           P   D  +     SS  E     PP PPP  L  K     P   + PSI       +P +
Sbjct: 100 PRPSDTTLSCANLSSLREPSPGQPPAPPPSVLSGKNANCIPSQKNAPSIAITSTGGSPRR 159

Query: 745 VVVK 748
           V+V+
Sbjct: 160 VIVQ 163


>gnl|CDD|129694 TIGR00606, rad50, rad50.  All proteins in this family for which
           functions are known are involvedin recombination,
           recombinational repair, and/or non-homologous end
           joining.They are components of an exonuclease complex
           with MRE11 homologs. This family is distantly related to
           the SbcC family of bacterial proteins.This family is
           based on the phylogenomic analysis of JA Eisen (1999,
           Ph.D. Thesis, Stanford University).
          Length = 1311

 Score = 31.2 bits (70), Expect = 3.4
 Identities = 24/110 (21%), Positives = 51/110 (46%), Gaps = 4/110 (3%)

Query: 291 RMDNGEEDLEEDE-EMEEEYGQRRPEDDDMQIGSDEE-NDYYNARMR-GRNANRGRGRMQ 347
           ++DN  + L+  + +ME++  +   + + +  G+DE+ ND Y+   R  R   R     Q
Sbjct: 266 KLDNEIKALKSRKKQMEKDNSELELKMEKVFQGTDEQLNDLYHNHQRTVREKERELVDCQ 325

Query: 348 EEINEDDMQEGTTSEEESDNENEEARKSKEAKDNTQVQDMEEDSSSEEEQ 397
            E+ + + +    ++E+++   E+ R   +A D  Q      DS  +   
Sbjct: 326 RELEKLNKERRLLNQEKTELLVEQGRLQLQA-DRHQEHIRARDSLIQSLA 374


>gnl|CDD|237866 PRK14952, PRK14952, DNA polymerase III subunits gamma and tau;
           Provisional.
          Length = 584

 Score = 31.0 bits (70), Expect = 3.4
 Identities = 9/49 (18%), Positives = 20/49 (40%)

Query: 401 PPPPPPPSLKKVEPAPPLPSQPSIQPPPLPPAPDKVVVKKGYDPKQEKI 449
             P P  +  + +  P    +P + P P    P+   V+  +   ++K+
Sbjct: 398 AAPAPSAAAPEPKHQPAPEPRPVLAPTPASGEPNAAAVRSMWSTVRDKV 446



 Score = 29.8 bits (67), Expect = 8.7
 Identities = 23/118 (19%), Positives = 45/118 (38%), Gaps = 3/118 (2%)

Query: 685 RRPEDDDMQIDMEEDSSSEEEQEDIPPPPPPPSLKKVEPAPPLPSQPSIQPPPLPPAPDK 744
           +R E  + ++DM   ++          P P  +  + +  P    +P + P P    P+ 
Sbjct: 374 QRVERIETRLDMSIPANLLHNAPQ-AAPAPSAAAPEPKHQPAPEPRPVLAPTPASGEPNA 432

Query: 745 VVVKKGYDPKQARPPVVKPSPTDEFLISPITGEKIPASKVQEHMRIGLLDPRWVEQRD 802
             V+  +     R  V + S T E +++  T   +  + +        L  R  EQR+
Sbjct: 433 AAVRSMW--STVRDKVRQRSRTTEVMLAGATVRALEGNTLVLTHESAPLARRLSEQRN 488


>gnl|CDD|172376 PRK13855, PRK13855, type IV secretion system protein VirB10;
           Provisional.
          Length = 376

 Score = 30.6 bits (69), Expect = 3.6
 Identities = 20/91 (21%), Positives = 35/91 (38%), Gaps = 5/91 (5%)

Query: 701 SSEEEQEDIPPPPPPPSLKKVEPAPPLPSQPSIQPPPLPPAPDKVVVKKGYDPKQARPPV 760
               ++E+ P PP        +P  P P    I  PP PPA  + V        Q+ P  
Sbjct: 50  GGRSKKENEPAPPSTMIATNTKPFHPAP----IDVPPDPPAAQEAVQPTAPPSAQSEPER 105

Query: 761 VKPSPTDEFLISPITGEKIPASKVQEHMRIG 791
            +P P +  + +  +G++  + +   H    
Sbjct: 106 NEPRPEETPIFAYSSGDQGGSKRAG-HGDTD 135


>gnl|CDD|218752 pfam05793, TFIIF_alpha, Transcription initiation factor IIF, alpha
           subunit (TFIIF-alpha).  Transcription initiation factor
           IIF, alpha subunit (TFIIF-alpha) or RNA polymerase
           II-associating protein 74 (RAP74) is the large subunit
           of transcription factor IIF (TFIIF), which is essential
           for accurate initiation and stimulates elongation by RNA
           polymerase II.
          Length = 528

 Score = 30.7 bits (69), Expect = 3.8
 Identities = 39/186 (20%), Positives = 62/186 (33%), Gaps = 34/186 (18%)

Query: 290 ERMDNGEEDLEEDEEMEEEYGQRRPEDDDMQIGSDEENDYYNARMRGRNANRGRGRMQEE 349
           +  D  + D  +DE  EE+Y       D    G+D E              R      E 
Sbjct: 267 DDADEYDSDDGDDEGREEDY-----ISDSSASGNDPEE-------------REDKLSPEI 308

Query: 350 IN-EDDMQEGTTSEEESDNENEEARKSKEAKDNTQVQ----DMEEDSSSEEEQED----- 399
               +  Q+  + E E +   EE   SK+ K   +++     +++D S   +  D     
Sbjct: 309 PAKPEIEQDEDSEESEEEKNEEEGGLSKKGKKLKKLKGKKNGLDKDDSDSGDDSDDSDID 368

Query: 400 ------IPPPPPPPSLKKVEPAPPLPSQPSIQPPPLPPAPDKVVVKKGYDPKQEKIPASK 453
                 +         KK EP    PS P    P  P    K   K+    +  K PAS 
Sbjct: 369 GEDSVSLVTAKKQKEPKKEEPVDSNPSSPGNSGPARPSPESKDKGKRKAANEVSKSPASV 428

Query: 454 VQEHMR 459
             + ++
Sbjct: 429 PAKKLK 434


>gnl|CDD|226920 COG4547, CobT, Cobalamin biosynthesis protein CobT
           (nicotinate-mononucleotide:5, 6-dimethylbenzimidazole
           phosphoribosyltransferase) [Coenzyme metabolism].
          Length = 620

 Score = 31.0 bits (70), Expect = 3.8
 Identities = 24/106 (22%), Positives = 43/106 (40%), Gaps = 18/106 (16%)

Query: 298 DLEEDEEMEEEYGQRRPEDDDMQIGSDEENDYYNARMRGRNANRGRGRMQEEINEDDMQE 357
           D+    +M EE G     DD ++  +DEE+   +      ++  GR        E+    
Sbjct: 211 DMLGSMDMAEETG-----DDGIEEDADEEDGDDDQPDNNEDSEAGR--------EESEGS 257

Query: 358 GTTSEEESDNENEEARKSKEAKDNTQVQDMEEDSSSEEEQEDIPPP 403
             + E+E++  + E  + +         +  EDS S+E  ED   P
Sbjct: 258 DESEEDEAEATDGEGEEGEM-----DAAEASEDSESDESDEDTETP 298


>gnl|CDD|218673 pfam05642, Sporozoite_P67, Sporozoite P67 surface antigen.  This
           family consists of several Theileria P67 surface
           antigens. A stage specific surface antigen of Theileria
           parva, p67, is the basis for the development of an
           anti-sporozoite vaccine for the control of East Coast
           fever (ECF) in cattle. The antigen has been shown to
           contain five distinct linear peptide sequences
           recognised by sporozoite-neutralising murine monoclonal
           antibodies.
          Length = 727

 Score = 30.8 bits (69), Expect = 3.8
 Identities = 15/56 (26%), Positives = 22/56 (39%)

Query: 349 EINEDDMQEGTTSEEESDNENEEARKSKEAKDNTQVQDMEEDSSSEEEQEDIPPPP 404
           E N+D     T SEE+ D+  EE  KS  +KD    +  +   S+           
Sbjct: 104 EQNQDTKGSKTDSEEDDDDSEEEDNKSTSSKDGKGSKKTQPGVSTSSGSTTSGTDL 159


>gnl|CDD|237191 PRK12757, PRK12757, cell division protein FtsN; Provisional.
          Length = 256

 Score = 30.4 bits (69), Expect = 4.2
 Identities = 16/82 (19%), Positives = 24/82 (29%), Gaps = 3/82 (3%)

Query: 690 DDMQIDMEEDSSSEEE---QEDIPPPPPPPSLKKVEPAPPLPSQPSIQPPPLPPAPDKVV 746
           + MQ DM +  +   E    E  P  P      + +     P   + QP P+ P      
Sbjct: 89  EQMQADMRQQPTQLSEVPYNEQTPQVPRSTVQIQQQAQQQQPPATTAQPQPVTPPRQTTA 148

Query: 747 VKKGYDPKQARPPVVKPSPTDE 768
             +   P   R     P     
Sbjct: 149 PVQPQTPAPVRTQPAAPVTQAV 170


>gnl|CDD|233692 TIGR02031, BchD-ChlD, magnesium chelatase ATPase subunit D.  This
           model represents one of two ATPase subunits of the
           trimeric magnesium chelatase responsible for insertion
           of magnesium ion into protoporphyrin IX. This is an
           essential step in the biosynthesis of both chlorophyll
           and bacteriochlorophyll. This subunit is found in green
           plants, photosynthetic algae, cyanobacteria and other
           photosynthetic bacteria. Unlike subunit I (TIGR02030),
           this subunit is not found in archaea [Biosynthesis of
           cofactors, prosthetic groups, and carriers, Chlorophyll
           and bacteriochlorphyll].
          Length = 589

 Score = 30.5 bits (69), Expect = 4.2
 Identities = 13/42 (30%), Positives = 17/42 (40%), Gaps = 4/42 (9%)

Query: 394 EEEQEDIPPPPPPPSLKKVEPAPPLPSQ----PSIQPPPLPP 431
            E +   PPPPPPP   + E  P  P Q       +   +P 
Sbjct: 267 PEPEPQPPPPPPPPEPPEPEEEPDEPDQTDPDDGEETDQIPE 308



 Score = 30.5 bits (69), Expect = 4.2
 Identities = 13/42 (30%), Positives = 17/42 (40%), Gaps = 4/42 (9%)

Query: 703 EEEQEDIPPPPPPPSLKKVEPAPPLPSQ----PSIQPPPLPP 740
            E +   PPPPPPP   + E  P  P Q       +   +P 
Sbjct: 267 PEPEPQPPPPPPPPEPPEPEEEPDEPDQTDPDDGEETDQIPE 308


>gnl|CDD|223023 PHA03249, PHA03249, DNA packaging tegument protein UL25;
           Provisional.
          Length = 653

 Score = 30.7 bits (69), Expect = 4.2
 Identities = 25/127 (19%), Positives = 37/127 (29%), Gaps = 16/127 (12%)

Query: 651 TPDEVGARVLMQERMDNGEEDLEEDEEMEEEYGQRRPEDDDMQIDMEEDSSSEEEQEDIP 710
            P  +   V +  R     EDL+  E     Y          ++  E DS SE E E   
Sbjct: 21  IPSSLEDEVFLDPRPRAPTEDLDRMEAGLSSYSSSSDNKSSFEVVSETDSGSEAEAERGR 80

Query: 711 ---------PPPPPPSLKKVEPAPPLPSQPSIQPPPLPPAPDKVVVKKGYDPKQARPPVV 761
                       P    K  +  P      ++      PA        G  PK + PP +
Sbjct: 81  RAGMGGRNKATKPSRRNKTTQCRPT---SLALATAATMPATPSS----GKSPKVSSPPSI 133

Query: 762 KPSPTDE 768
                ++
Sbjct: 134 PSLSEED 140


>gnl|CDD|152901 pfam12467, CMV_1a, Cucumber mosaic virus 1a protein.  This domain
           family is found in viruses, and is typically between 156
           and 171 amino acids in length. The family is found in
           association with pfam01443, pfam01660. 1a protein is the
           major virulence factor of the cucumber mosaic virus
           (CMV). The Ns strain of CMV causes necrotic lesions to
           Nicotiana spp. while other strains cause systemic
           mosaic. The determinant of the pathogenesis of these
           different strains is the specific amino acid residue at
           the 461 residue of the 1a protein.
          Length = 175

 Score = 29.8 bits (67), Expect = 4.3
 Identities = 15/81 (18%), Positives = 23/81 (28%), Gaps = 2/81 (2%)

Query: 351 NEDDMQEGTTSEEESDNENEEARKSKEAKDNTQVQDMEEDSSSEEEQEDIPPPPPPPSLK 410
           +  D+ +      E        +K K  +   Q    +  S   E+  D P       +K
Sbjct: 60  SAADVLKRAAWAVEDGKTLRAEKKKKLEEALQQPVQEDSVSEEFEDAPDAPSESVRDDVK 119

Query: 411 KVEPAPPLPSQPSIQPPPLPP 431
              P       P   PP L  
Sbjct: 120 PENPVVGQTQAPE--PPELKS 138


>gnl|CDD|219339 pfam07223, DUF1421, Protein of unknown function (DUF1421).  This
           family represents a conserved region approximately 350
           residues long within a number of plant proteins of
           unknown function.
          Length = 357

 Score = 30.3 bits (68), Expect = 4.3
 Identities = 14/58 (24%), Positives = 19/58 (32%), Gaps = 2/58 (3%)

Query: 710 PPPPPPPSLKKVEPAPPLPSQPSIQPPPLPPAPDKVVVKKGYDP--KQARPPVVKPSP 765
            PPP  P  ++ +  P  P      PP    AP    +     P   Q+ PP      
Sbjct: 142 QPPPQVPQQQQYQSPPQQPQYQQNPPPQAQSAPQVSGLYPEESPYQPQSYPPNEPLPS 199


>gnl|CDD|218621 pfam05518, Totivirus_coat, Totivirus coat protein. 
          Length = 753

 Score = 30.5 bits (69), Expect = 4.3
 Identities = 14/68 (20%), Positives = 22/68 (32%)

Query: 714 PPPSLKKVEPAPPLPSQPSIQPPPLPPAPDKVVVKKGYDPKQARPPVVKPSPTDEFLISP 773
            P  + + +P  P     +++ P  P          G  P    P    P+P    LI+ 
Sbjct: 673 RPSRVARGDPVRPTAHHAALRAPQAPRPGGPPGGGGGLPPPPDLPAAAGPAPCGSSLIAS 732

Query: 774 ITGEKIPA 781
            T    P 
Sbjct: 733 PTAPPEPE 740


>gnl|CDD|215239 PLN02436, PLN02436, cellulose synthase A.
          Length = 1094

 Score = 30.6 bits (69), Expect = 4.4
 Identities = 15/55 (27%), Positives = 20/55 (36%), Gaps = 2/55 (3%)

Query: 388 EEDSSSEEEQEDIPPPPPPPSLKKVEPAPPLPSQPSIQPPPLPPAPDKVVVKKGY 442
           EED     ++  +  PP      +V P P      S    P P  P K +   GY
Sbjct: 167 EEDVEISSDRHALIVPPSTGHGNRVHPMPF--PDSSASLQPRPMVPQKDLAVYGY 219



 Score = 30.6 bits (69), Expect = 4.4
 Identities = 15/55 (27%), Positives = 20/55 (36%), Gaps = 2/55 (3%)

Query: 697 EEDSSSEEEQEDIPPPPPPPSLKKVEPAPPLPSQPSIQPPPLPPAPDKVVVKKGY 751
           EED     ++  +  PP      +V P P      S    P P  P K +   GY
Sbjct: 167 EEDVEISSDRHALIVPPSTGHGNRVHPMPF--PDSSASLQPRPMVPQKDLAVYGY 219


>gnl|CDD|215224 PLN02400, PLN02400, cellulose synthase.
          Length = 1085

 Score = 30.7 bits (69), Expect = 4.5
 Identities = 28/108 (25%), Positives = 48/108 (44%), Gaps = 8/108 (7%)

Query: 340 NRGRGRMQEEINEDDMQEGTTSEEESDNENEEARKSKEAKDNTQVQDMEEDSSSEEEQED 399
           ++G  R++ + +EDD+       ++ +NE   A+ + +A+   Q +D+E  SSS  E + 
Sbjct: 91  HKGSPRVEGDEDEDDV-------DDLENEFNYAQGNGKARHQWQGEDIELSSSSRHESQP 143

Query: 400 IPP-PPPPPSLKKVEPAPPLPSQPSIQPPPLPPAPDKVVVKKGYDPKQ 446
           IP      P   ++  A P          PL PA          DP+Q
Sbjct: 144 IPLLTHGQPVSGEIPCATPDNQSVRTTSGPLGPAERNANSSPYIDPRQ 191


>gnl|CDD|237538 PRK13876, PRK13876, conjugal transfer coupling protein TraG;
           Provisional.
          Length = 663

 Score = 30.7 bits (70), Expect = 4.6
 Identities = 12/30 (40%), Positives = 12/30 (40%)

Query: 405 PPPSLKKVEPAPPLPSQPSIQPPPLPPAPD 434
           PPP L      P  P   S  P P  PAP 
Sbjct: 555 PPPDLAPPAGPPARPDDWSGLPIPAVPAPA 584



 Score = 30.7 bits (70), Expect = 4.6
 Identities = 12/30 (40%), Positives = 12/30 (40%)

Query: 714 PPPSLKKVEPAPPLPSQPSIQPPPLPPAPD 743
           PPP L      P  P   S  P P  PAP 
Sbjct: 555 PPPDLAPPAGPPARPDDWSGLPIPAVPAPA 584


>gnl|CDD|218549 pfam05308, Mito_fiss_reg, Mitochondrial fission regulator.  In
           eukaryotes, this family of proteins induces
           mitochondrial fission.
          Length = 248

 Score = 30.1 bits (68), Expect = 4.6
 Identities = 33/147 (22%), Positives = 50/147 (34%), Gaps = 23/147 (15%)

Query: 609 GLLPDEEKEKIGPKRSHAP---NPPLPRSMPPMNKPKPGNFPPPTTPDEVGARVLMQERM 665
             +  +E+E     RS      +   P  + PM   +      P+ P+       +Q   
Sbjct: 60  AWVAADEEESYARFRSDTRPNRHEWKPSPLLPMKLSRQ-----PSVPNLRQEEPRLQSLK 114

Query: 666 DNGEEDLEEDEEMEEEYGQRRPEDDDMQI----DMEEDSSS----EEEQEDIPPPPPPPS 717
              EE L++   +E+E    R      QI      +E S+S        ++  P     S
Sbjct: 115 PPNEEALQKISALEDELAFLRA-----QIAKIVAAQEQSNSTTSDLLSSDESVPSSSTTS 169

Query: 718 LKKVEP--APPLPSQPSIQPPPLPPAP 742
                P   P L   P   PPP PP P
Sbjct: 170 FPISPPTEEPVLEVPPPPPPPPPPPPP 196



 Score = 29.3 bits (66), Expect = 7.4
 Identities = 15/37 (40%), Positives = 18/37 (48%), Gaps = 5/37 (13%)

Query: 393 SEEEQEDIPPPPPPPSLKKVEPAPPLPSQPSIQPPPL 429
           +EE   ++PPPPPPP      P PP   Q S     L
Sbjct: 176 TEEPVLEVPPPPPPPP-----PPPPPSLQQSTSAIDL 207


>gnl|CDD|182338 PRK10255, PRK10255, PTS system N-acetyl glucosamine specific
           transporter subunits IIABC; Provisional.
          Length = 648

 Score = 30.6 bits (69), Expect = 4.7
 Identities = 18/67 (26%), Positives = 29/67 (43%), Gaps = 13/67 (19%)

Query: 718 LKKVEPAPPLPSQPSIQPPPLPPAPDKVVVKKGYDPKQARPPVVKPSPTDEFLISPITGE 777
           +KKV    P+ +  S +  P   AP             A+P  V  + +   L+SPITG+
Sbjct: 462 MKKVVARGPVAAA-SAEATPATAAP------------VAKPQAVPNAVSIAELVSPITGD 508

Query: 778 KIPASKV 784
            +   +V
Sbjct: 509 VVALDQV 515


>gnl|CDD|216263 pfam01044, Vinculin, Vinculin family. 
          Length = 850

 Score = 30.5 bits (69), Expect = 4.8
 Identities = 15/35 (42%), Positives = 18/35 (51%)

Query: 388 EEDSSSEEEQEDIPPPPPPPSLKKVEPAPPLPSQP 422
           E  + SE E+ED P  P PP   + EP P  P  P
Sbjct: 620 ELPTDSEFEREDEPADPRPPPPVETEPPPRPPPPP 654



 Score = 30.5 bits (69), Expect = 4.8
 Identities = 15/35 (42%), Positives = 18/35 (51%)

Query: 697 EEDSSSEEEQEDIPPPPPPPSLKKVEPAPPLPSQP 731
           E  + SE E+ED P  P PP   + EP P  P  P
Sbjct: 620 ELPTDSEFEREDEPADPRPPPPVETEPPPRPPPPP 654


>gnl|CDD|236768 PRK10819, PRK10819, transport protein TonB; Provisional.
          Length = 246

 Score = 30.0 bits (68), Expect = 4.9
 Identities = 19/76 (25%), Positives = 27/76 (35%), Gaps = 12/76 (15%)

Query: 703 EEEQEDIPPPPPPPSLKKVEPAP-PLPSQPSIQPPPLP---------PAPDKVVVKKGYD 752
            + +      PPP  + + EP P P+P  P   P  +P         P P    VKK   
Sbjct: 56  ADLEPPQAVQPPPEPVVEPEPEPEPIPEPPKEAPVVIPKPEPKPKPKPKPKPKPVKK--V 113

Query: 753 PKQARPPVVKPSPTDE 768
            +Q +  V    P   
Sbjct: 114 EEQPKREVKPVEPRPA 129



 Score = 29.7 bits (67), Expect = 5.7
 Identities = 20/67 (29%), Positives = 26/67 (38%), Gaps = 2/67 (2%)

Query: 693 QIDMEEDSSSEEEQEDIPPPPPPPSLKKVEPAPPLPSQPSIQPPPLPPAPDKVVVKKGYD 752
           Q   E     E E E IP PP    +  V P P    +P  +P P P    +   K+   
Sbjct: 65  QPPPEPVVEPEPEPEPIPEPPKEAPV--VIPKPEPKPKPKPKPKPKPVKKVEEQPKREVK 122

Query: 753 PKQARPP 759
           P + RP 
Sbjct: 123 PVEPRPA 129


>gnl|CDD|219900 pfam08553, VID27, VID27 cytoplasmic protein.  This is a family of
           fungal and plant proteins and contains many hypothetical
           proteins. VID27 is a cytoplasmic protein that plays a
           potential role in vacuolar protein degradation.
          Length = 794

 Score = 30.5 bits (69), Expect = 5.1
 Identities = 19/46 (41%), Positives = 27/46 (58%), Gaps = 1/46 (2%)

Query: 663 ERMDNGEEDLEEDEEMEEEYGQRRPEDDDMQIDMEEDSSSEEEQED 708
           ER D  EED EE+EE +E+ G  +   DD + + E+D  S+ E  D
Sbjct: 387 ERDDEEEEDEEEEEEEDEDEGPSKEHSDDEEFE-EDDVESKYEDSD 431


>gnl|CDD|227693 COG5406, COG5406, Nucleosome binding factor SPN, SPT16 subunit
           [Transcription / DNA replication, recombination, and
           repair / Chromatin structure and dynamics].
          Length = 1001

 Score = 30.4 bits (68), Expect = 5.2
 Identities = 16/53 (30%), Positives = 29/53 (54%), Gaps = 5/53 (9%)

Query: 347 QEEINEDDMQEGTTSEEESDNENEEARKSKEAKDNTQVQDMEEDSSSEEEQED 399
           +EE++E +      S+E  ++E     +S E+ ++    + E DSS EE+ ED
Sbjct: 935 EEEVSEYEASSDDESDETDEDE-----ESDESSEDLSEDESENDSSDEEDGED 982


>gnl|CDD|222095 pfam13388, DUF4106, Protein of unknown function (DUF4106).  This
           family of proteins are found in large numbers in the
           Trichomonas vaginalis proteome. The function of this
           protein is unknown.
          Length = 422

 Score = 30.4 bits (68), Expect = 5.2
 Identities = 8/38 (21%), Positives = 13/38 (34%)

Query: 396 EQEDIPPPPPPPSLKKVEPAPPLPSQPSIQPPPLPPAP 433
           +Q  +  P   P  +  +       QP+ Q P   P  
Sbjct: 214 QQPTVQNPAQQPQQQPQQQPVQPAQQPTPQNPAQQPPQ 251



 Score = 30.4 bits (68), Expect = 5.2
 Identities = 8/38 (21%), Positives = 13/38 (34%)

Query: 705 EQEDIPPPPPPPSLKKVEPAPPLPSQPSIQPPPLPPAP 742
           +Q  +  P   P  +  +       QP+ Q P   P  
Sbjct: 214 QQPTVQNPAQQPQQQPQQQPVQPAQQPTPQNPAQQPPQ 251



 Score = 30.0 bits (67), Expect = 6.4
 Identities = 15/68 (22%), Positives = 24/68 (35%)

Query: 374 KSKEAKDNTQVQDMEEDSSSEEEQEDIPPPPPPPSLKKVEPAPPLPSQPSIQPPPLPPAP 433
           + + A   TQ   ++  +     Q     P   P  + V+PA     Q   Q PP     
Sbjct: 196 RHRHAPKPTQQPTVQNPAQQPTVQNPAQQPQQQPQQQPVQPAQQPTPQNPAQQPPQTEQG 255

Query: 434 DKVVVKKG 441
            K   ++G
Sbjct: 256 HKRSREQG 263


>gnl|CDD|237410 PRK13524, PRK13524, outer membrane receptor FepA; Provisional.
          Length = 744

 Score = 30.4 bits (69), Expect = 5.2
 Identities = 19/51 (37%), Positives = 23/51 (45%), Gaps = 14/51 (27%)

Query: 501 RRTDIFGVGDEETAI---GKKI-----------GEEDTRKDDKLFPGQAIE 537
           R+ DI G+G E T I   GK +           GE DTR D    P + IE
Sbjct: 97  RQIDIRGMGPENTLILIDGKPVSSRNSVRYGWRGERDTRGDTNWVPPEMIE 147


>gnl|CDD|237063 PRK12329, nusA, transcription elongation factor NusA; Provisional.
          Length = 449

 Score = 30.1 bits (68), Expect = 5.3
 Identities = 20/74 (27%), Positives = 29/74 (39%), Gaps = 4/74 (5%)

Query: 300 EEDEEMEEEYGQRRPEDDDMQIGSDEENDYYNARMRGRNANRGRGRMQEEINEDDMQEGT 359
            ED ++ E   QR  E+   +    E  +   A    R     R R    + ED+ ++  
Sbjct: 380 AEDAKVAELISQREEEEALQR----EAEERLEAEQAERAEEDARLRELYPLPEDEFEDED 435

Query: 360 TSEEESDNENEEAR 373
             EE    E EEAR
Sbjct: 436 ELEEAQPEEEEEAR 449



 Score = 29.8 bits (67), Expect = 8.6
 Identities = 13/61 (21%), Positives = 30/61 (49%)

Query: 345 RMQEEINEDDMQEGTTSEEESDNENEEARKSKEAKDNTQVQDMEEDSSSEEEQEDIPPPP 404
           ++ E I++ + +E    E E   E E+A +++E     ++  + ED   +E++ +   P 
Sbjct: 384 KVAELISQREEEEALQREAEERLEAEQAERAEEDARLRELYPLPEDEFEDEDELEEAQPE 443

Query: 405 P 405
            
Sbjct: 444 E 444


>gnl|CDD|233677 TIGR01995, PTS-II-ABC-beta, PTS system, beta-glucoside-specific
           IIABC component.  This model represents a family of PTS
           enzyme II proteins in which all three domains are found
           in the same polypeptide chain and which appear to have a
           broad specificity for beta-glucosides including salicin
           (beta-D-glucose-1-salicylate) and arbutin
           (Hydroquinone-O-beta-D-glucopyranoside). These are
           distinct from the closely related sucrose-specific and
           trehalose-specific PTS transporters.
          Length = 610

 Score = 30.4 bits (69), Expect = 5.3
 Identities = 9/36 (25%), Positives = 16/36 (44%)

Query: 752 DPKQARPPVVKPSPTDEFLISPITGEKIPASKVQEH 787
           D +            DE L +P+ GE +P ++V + 
Sbjct: 447 DQEDVVTQEAAALSKDESLYAPVAGEMLPLNEVPDE 482


>gnl|CDD|236122 PRK07877, PRK07877, hypothetical protein; Provisional.
          Length = 722

 Score = 30.3 bits (69), Expect = 5.4
 Identities = 14/53 (26%), Positives = 22/53 (41%), Gaps = 4/53 (7%)

Query: 694 IDMEEDSSSEEEQEDIPPPPPPPSLKKVEPAPPLPSQPSIQPPPLPPAPDKVV 746
           +D++E     ++    PPPPP  +  + E APP P            A + V 
Sbjct: 333 VDLDELLDRLDD----PPPPPSGAGWEAESAPPAPPADRAAEALPQDALEIVA 381


>gnl|CDD|145949 pfam03066, Nucleoplasmin, Nucleoplasmin.  Nucleoplasmins are also
           known as chromatin decondensation proteins. They bind to
           core histones and transfer DNA to them in a reaction
           that requires ATP. This is thought to play a role in the
           assembly of regular nucleosomal arrays.
          Length = 146

 Score = 29.2 bits (66), Expect = 5.6
 Identities = 17/55 (30%), Positives = 26/55 (47%), Gaps = 12/55 (21%)

Query: 662 QERMDNGEEDLEEDEEMEEEYGQRRPEDDDMQIDMEEDSSSEEEQEDIPPPPPPP 716
           Q  + + E++ ++DEE EEE      EDD      EED   +E +E+  P     
Sbjct: 104 QHLVASEEDESDDDEEDEEE------EDD------EEDDDEDESEEEESPVKKVK 146


>gnl|CDD|237940 PRK15313, PRK15313, autotransport protein MisL; Provisional.
          Length = 955

 Score = 30.5 bits (68), Expect = 5.7
 Identities = 24/76 (31%), Positives = 30/76 (39%), Gaps = 15/76 (19%)

Query: 700 SSSEEEQEDIPPPPPPPSLKKVEPAPPLPSQPS-IQPPPLPPA-PDKVVVKKGYDPKQAR 757
           +S  E  E I P P       V+P  P P  P  + P P+ P  PD V+      P    
Sbjct: 555 TSYIEPDEPIIPDP-------VDPVIPDPVIPDPVDPDPVDPVIPDPVI------PDPVD 601

Query: 758 PPVVKPSPTDEFLISP 773
           P  V P P D  +  P
Sbjct: 602 PDPVDPEPVDPVIPDP 617


>gnl|CDD|152960 pfam12526, DUF3729, Protein of unknown function (DUF3729).  This
           family of proteins is found in viruses. Proteins in this
           family are typically between 145 and 1707 amino acids in
           length. The family is found in association with
           pfam01443, pfam01661, pfam05417, pfam01660, pfam00978.
           There is a single completely conserved residue L that
           may be functionally important.
          Length = 115

 Score = 28.5 bits (64), Expect = 5.8
 Identities = 17/59 (28%), Positives = 23/59 (38%)

Query: 708 DIPPPPPPPSLKKVEPAPPLPSQPSIQPPPLPPAPDKVVVKKGYDPKQARPPVVKPSPT 766
           D PPP   P    V+  PP+ +   + PP  P AP          P     P+  P+P 
Sbjct: 38  DDPPPVGDPRPPVVDTPPPVSAVWVLPPPSEPAAPPPDPEPPVPGPAGPPSPLAPPAPA 96



 Score = 28.5 bits (64), Expect = 7.1
 Identities = 14/36 (38%), Positives = 15/36 (41%)

Query: 396 EQEDIPPPPPPPSLKKVEPAPPLPSQPSIQPPPLPP 431
           E    PP P PP      P  PL      + PPLPP
Sbjct: 68  EPAAPPPDPEPPVPGPAGPPSPLAPPAPARKPPLPP 103



 Score = 28.5 bits (64), Expect = 7.1
 Identities = 14/36 (38%), Positives = 15/36 (41%)

Query: 705 EQEDIPPPPPPPSLKKVEPAPPLPSQPSIQPPPLPP 740
           E    PP P PP      P  PL      + PPLPP
Sbjct: 68  EPAAPPPDPEPPVPGPAGPPSPLAPPAPARKPPLPP 103


>gnl|CDD|236669 PRK10263, PRK10263, DNA translocase FtsK; Provisional.
          Length = 1355

 Score = 30.4 bits (68), Expect = 5.9
 Identities = 21/137 (15%), Positives = 36/137 (26%), Gaps = 9/137 (6%)

Query: 624 SHAPNPPLPRSMPPMNKPKPGNFPPPTTPDEVGARVLMQERMDNGEEDLEEDEEMEEEYG 683
            +       +     N+P      P        A    Q+             E   +  
Sbjct: 379 GYPQQSQYAQPAVQYNEPLQQPVQPQQPYYAPAAEQPAQQ------PYYAPAPEQPAQQP 432

Query: 684 QRRPEDDDMQIDMEEDSSSEEEQEDIPPPPPPPSLKKVEPAPPLPSQPSIQPPPLPPAPD 743
              P     +  +  ++   EEQ+    P      ++    P        QP P+   P 
Sbjct: 433 YYAPAP---EQPVAGNAWQAEEQQSTFAPQSTYQTEQTYQQPAAQEPLYQQPQPVEQQPV 489

Query: 744 KVVVKKGYDPKQARPPV 760
                   + K ARPP+
Sbjct: 490 VEPEPVVEETKPARPPL 506



 Score = 30.4 bits (68), Expect = 5.9
 Identities = 22/105 (20%), Positives = 40/105 (38%), Gaps = 1/105 (0%)

Query: 661 MQERMDNGEEDLEEDEEMEEEYGQRRPEDDDMQIDMEEDSSSEEEQEDIPPPPPPPSLKK 720
           M+  +D+G  +      +E     ++P     Q    +   + + Q   P  P  P  + 
Sbjct: 732 MKALLDDGPHEPLFTPIVEPVQQPQQPVAPQQQYQQPQQPVAPQPQYQQPQQPVAPQPQY 791

Query: 721 VEPAPPLPSQPSIQPPPLPPAPDKVVVKKGYDPKQARPPVVKPSP 765
            +P  P+  QP  Q P  P AP +   ++   P   +P   +P  
Sbjct: 792 QQPQQPVAPQPQYQQPQQPVAP-QPQYQQPQQPVAPQPQYQQPQQ 835


>gnl|CDD|226435 COG3921, COG3921, Uncharacterized protein conserved in bacteria
           [Function unknown].
          Length = 300

 Score = 29.8 bits (67), Expect = 6.3
 Identities = 18/100 (18%), Positives = 29/100 (29%), Gaps = 10/100 (10%)

Query: 677 EMEEEYGQRRPEDDDMQIDMEEDSSSEEEQEDIPPPPPPPSLKKVEPAPPLPSQPSIQPP 736
            + E+      E    Q + ++  + E   +  P   P  +   V P    P +P   P 
Sbjct: 19  TVPEQDVMPGAEPVSGQANEQKRIAEEAHPQ--PVARPSSTDDPVTPTEGKPVRPKGLPI 76

Query: 737 PLPPAP--------DKVVVKKGYDPKQARPPVVKPSPTDE 768
                P        D        DP     P   P+  +E
Sbjct: 77  LALAGPVGELGQPMDLPAPANPGDPLALPEPPSPPTKPEE 116


>gnl|CDD|234178 TIGR03354, VI_FHA, type VI secretion system FHA domain protein.
           Members of this protein family are FHA
           (forkhead-associated) domain-containing proteins that
           are part of type VI secretion loci in a considerable
           number of bacteria, most of which are known pathogens.
           Species include Pseudomonas aeruginosa PAO1, Aeromonas
           hydrophila, Yersinia pestis, Burkholderia mallei, etc
           [Protein fate, Protein and peptide secretion and
           trafficking, Cellular processes, Pathogenesis].
          Length = 396

 Score = 30.0 bits (68), Expect = 6.3
 Identities = 11/51 (21%), Positives = 27/51 (52%), Gaps = 1/51 (1%)

Query: 384 VQDMEEDSSSEEEQEDIPPP-PPPPSLKKVEPAPPLPSQPSIQPPPLPPAP 433
           ++ ++++  S  + +D+  P  PP   +    A P+ ++P++ PP +P   
Sbjct: 141 LKALDQEPLSAADLDDLSAPLFPPLDARLPAFAAPIDAEPTMVPPFVPLPA 191



 Score = 29.6 bits (67), Expect = 7.0
 Identities = 11/48 (22%), Positives = 25/48 (52%), Gaps = 1/48 (2%)

Query: 696 MEEDSSSEEEQEDIPPP-PPPPSLKKVEPAPPLPSQPSIQPPPLPPAP 742
           ++++  S  + +D+  P  PP   +    A P+ ++P++ PP +P   
Sbjct: 144 LDQEPLSAADLDDLSAPLFPPLDARLPAFAAPIDAEPTMVPPFVPLPA 191


>gnl|CDD|227464 COG5135, COG5135, Uncharacterized conserved protein [Function
           unknown].
          Length = 245

 Score = 29.6 bits (66), Expect = 6.5
 Identities = 18/68 (26%), Positives = 26/68 (38%), Gaps = 1/68 (1%)

Query: 361 SEEESDNENEEARKSKEAKDNTQVQDMEEDSSSEEEQEDIPPPPPPPSLKKVEPAPPLPS 420
           SE   D  +EE R      D+   +++E D   E+E +  PPP      K +    P P 
Sbjct: 125 SETNDDLTDEEIRTPINWDDDDDKRNIENDEHHEDEADFYPPPQLSRHQKSLYRK-PAPG 183

Query: 421 QPSIQPPP 428
           Q       
Sbjct: 184 QKLTSETS 191


>gnl|CDD|218333 pfam04931, DNA_pol_phi, DNA polymerase phi.  This family includes
           the fifth essential DNA polymerase in yeast EC:2.7.7.7.
           Pol5p is localised exclusively to the nucleolus and
           binds near or at the enhancer region of rRNA-encoding
           DNA repeating units.
          Length = 784

 Score = 30.2 bits (68), Expect = 6.7
 Identities = 21/83 (25%), Positives = 35/83 (42%), Gaps = 1/83 (1%)

Query: 289 QERMDNGEEDLEEDEEMEEEYGQRRPEDDDMQIGSDEENDYYNARMRGRNANRG-RGRMQ 347
           ++  D  E D +EDE    E  +   E D      DE+ D   A       ++  R  + 
Sbjct: 650 EDEDDLEETDDDEDECEAIEDSESESESDGEDGEEDEQEDDAEANEGVVPIDKAVRRALP 709

Query: 348 EEINEDDMQEGTTSEEESDNENE 370
           + +N  D  +G  SE+E   ++E
Sbjct: 710 KVLNLPDALDGGDSEDEEGMDDE 732



 Score = 29.5 bits (66), Expect = 9.9
 Identities = 12/48 (25%), Positives = 24/48 (50%)

Query: 661 MQERMDNGEEDLEEDEEMEEEYGQRRPEDDDMQIDMEEDSSSEEEQED 708
           + E  +  E+DLEE ++ E+E       + + + D E+    E+E + 
Sbjct: 644 LFEGEEEDEDDLEETDDDEDECEAIEDSESESESDGEDGEEDEQEDDA 691


>gnl|CDD|215601 PLN03142, PLN03142, Probable chromatin-remodeling complex ATPase
           chain; Provisional.
          Length = 1033

 Score = 30.2 bits (68), Expect = 6.9
 Identities = 18/63 (28%), Positives = 26/63 (41%)

Query: 662 QERMDNGEEDLEEDEEMEEEYGQRRPEDDDMQIDMEEDSSSEEEQEDIPPPPPPPSLKKV 721
           +E   N + + EEDEE  E   +    D D      ED   +EE ++    P    + K 
Sbjct: 1   EEEQVNTQANEEEDEEELEAVARSAGSDSDDDEVPAEDEDEDEEDDEEAESPAKAEISKR 60

Query: 722 EPA 724
           E A
Sbjct: 61  EKA 63


>gnl|CDD|221175 pfam11705, RNA_pol_3_Rpc31, DNA-directed RNA polymerase III subunit
           Rpc31.  RNA polymerase III contains seventeen subunits
           in yeasts and in human cells. Twelve of these are akin
           to RNA polymerase I or II and the other five are RNA pol
           III-specific, and form the functionally distinct groups
           (i) Rpc31-Rpc34-Rpc82, and (ii) Rpc37-Rpc53. Rpc31,
           Rpc34 and Rpc82 form a cluster of enzyme-specific
           subunits that contribute to transcription initiation in
           S.cerevisiae and H.sapiens. There is evidence that these
           subunits are anchored at or near the N-terminal Zn-fold
           of Rpc1, itself prolonged by a highly conserved but RNA
           polymerase III-specific domain.
          Length = 221

 Score = 29.3 bits (66), Expect = 7.5
 Identities = 17/43 (39%), Positives = 25/43 (58%), Gaps = 5/43 (11%)

Query: 287 LMQERMDNGEEDLEEDEEMEEEYGQRRPEDDDMQIGSDEENDY 329
           L  E +D  +E  EE+EE EEE  +   +DDD     D+++DY
Sbjct: 163 LEAEDVDEEDEKDEEEEEEEEEEDEDFDDDDD-----DDDDDY 200


>gnl|CDD|220684 pfam10310, DUF2413, Protein of unknown function (DUF2413).  This is
           a family of proteins conserved in fungi. The function is
           not known.
          Length = 436

 Score = 29.8 bits (67), Expect = 7.5
 Identities = 16/52 (30%), Positives = 23/52 (44%), Gaps = 1/52 (1%)

Query: 359 TTSEEESDNENEEA-RKSKEAKDNTQVQDMEEDSSSEEEQEDIPPPPPPPSL 409
           T S   S  E     RKS E+  ++  +     + SEEE+E    P P  S+
Sbjct: 72  TESSAASSEEKPAKPRKSAESTRSSHPKSKAPSTESEEEEEPEETPDPIASI 123


>gnl|CDD|221518 pfam12301, CD99L2, CD99 antigen like protein 2.  This family of
           proteins is found in eukaryotes. Proteins in this family
           are typically between 165 and 237 amino acids in length.
           CD99L2 and CD99 are involved in trans-endothelial
           migration of neutrophils in vitro and in the recruitment
           of neutrophils into inflamed peritoneum.
          Length = 154

 Score = 29.0 bits (65), Expect = 7.5
 Identities = 12/64 (18%), Positives = 16/64 (25%), Gaps = 7/64 (10%)

Query: 704 EEQEDIPPPPPPPSLKKVEPAPPLPSQPSIQPPPLPPAPDKVVVKKGYDPKQARPPVVKP 763
           ++   + P   P + KK    P            L    D     K   PK    P    
Sbjct: 8   DDDPAMKPTKKPATPKK----PSKGDDDFDLEDALDGGNDGGAGPK---PKPRPQPDPGN 60

Query: 764 SPTD 767
              D
Sbjct: 61  PGGD 64


>gnl|CDD|202096 pfam02029, Caldesmon, Caldesmon. 
          Length = 431

 Score = 29.6 bits (66), Expect = 7.6
 Identities = 25/124 (20%), Positives = 48/124 (38%), Gaps = 11/124 (8%)

Query: 291 RMDNGEEDLEEDEEMEEEYGQRRPEDDDMQIGSDEENDYYNARMRGRNANRGRGRMQEEI 350
           + +   +  + +E  +EE      E++  + GS EEN+      + ++      R   E 
Sbjct: 134 KSEQKNDWRDAEECQKEEKEPEPEEEEKPKRGSLEENNGEFMTHKLKHTENTFSRGGAEG 193

Query: 351 NEDDM-----------QEGTTSEEESDNENEEARKSKEAKDNTQVQDMEEDSSSEEEQED 399
            + +            QE     EE   + EE RK  E ++  + Q+  +  S EEE++ 
Sbjct: 194 AQVEAGKEFEKLKQKQQEAALELEELKKKREERRKVLEEEEQRRKQEEADRKSREEEEKR 253

Query: 400 IPPP 403
               
Sbjct: 254 RLKE 257


>gnl|CDD|223003 PHA03169, PHA03169, hypothetical protein; Provisional.
          Length = 413

 Score = 29.6 bits (66), Expect = 8.1
 Identities = 34/150 (22%), Positives = 44/150 (29%), Gaps = 37/150 (24%)

Query: 274 PPPTTPDEVGARVLMQERMDNGEEDLEEDEEMEEEYGQRRPEDDDMQIGSDEENDYYNAR 333
           P P  P   G +V  +   + G    E D E  EE   R  E ++   G           
Sbjct: 46  PAPPAPTTSGPQV--RAVAEQGHRQTESDTETAEE--SRHGEKEERGQGGPS-------- 93

Query: 334 MRGRNANRGRGRMQEEINEDDMQEGTTSEEESDNENEEARKSKEAKDNTQVQDMEEDSSS 393
                     G   E +         ++EE +   + E                   SS 
Sbjct: 94  ----------GSGSESVGSPTPSPSGSAEELASGLSPEN---------------TSGSSP 128

Query: 394 EEEQEDIPPPPPPPSLKKVEPAPPLPSQPS 423
           E      PPP PP      EPAPP    PS
Sbjct: 129 ESPASHSPPPSPPSHPGPHEPAPPESHNPS 158


>gnl|CDD|219005 pfam06387, Calcyon, D1 dopamine receptor-interacting protein
           (calcyon).  This family consists of several D1 dopamine
           receptor-interacting (calcyon) proteins. D1/D5 dopamine
           receptors in the basal ganglia, hippocampus, and
           cerebral cortex modulate motor, reward, and cognitive
           behaviour. D1-like dopamine receptors likely modulate
           neocortical and hippocampal neuronal excitability and
           synaptic function via Ca(2+) as well as cAMP-dependent
           signaling. Defective calcyon proteins have been
           implicated in both attention-deficit/hyperactivity
           disorder (ADHD) and schizophrenia.
          Length = 186

 Score = 29.2 bits (65), Expect = 8.3
 Identities = 16/48 (33%), Positives = 21/48 (43%), Gaps = 4/48 (8%)

Query: 739 PPAPDKVVVKKG--YDPKQARPPVVKPSPTDEFLIS--PITGEKIPAS 782
           P  PDKVVVK    Y P  A     +     EF +   P  G ++P +
Sbjct: 38  PSFPDKVVVKTQTEYQPTSADKGKKRAPLIAEFTVCSHPEEGRRLPTA 85


>gnl|CDD|236658 PRK10153, PRK10153, DNA-binding transcriptional activator CadC;
           Provisional.
          Length = 517

 Score = 29.6 bits (67), Expect = 8.7
 Identities = 16/41 (39%), Positives = 20/41 (48%), Gaps = 4/41 (9%)

Query: 393 SEEEQEDIPPPPPPPSLKKVEPAPPLPSQPSIQPPPLPPAP 433
           SEEE E+I    PPP    +  A P  ++PS       PAP
Sbjct: 108 SEEEGEEILASSPPP----IPEAVPATAEPSESANAQFPAP 144



 Score = 29.6 bits (67), Expect = 8.7
 Identities = 16/41 (39%), Positives = 20/41 (48%), Gaps = 4/41 (9%)

Query: 702 SEEEQEDIPPPPPPPSLKKVEPAPPLPSQPSIQPPPLPPAP 742
           SEEE E+I    PPP    +  A P  ++PS       PAP
Sbjct: 108 SEEEGEEILASSPPP----IPEAVPATAEPSESANAQFPAP 144


>gnl|CDD|221179 pfam11711, Tim54, Inner membrane protein import complex subunit
           Tim54.  Mitochondrial function depends on the import of
           hundreds of different proteins synthesised in the
           cytosol. Protein import is a multi-step pathway which
           includes the binding of precursor proteins to surface
           receptors, translocation of the precursor across one or
           both mitochondrial membranes, and folding and assembly
           of the imported protein inside the mitochondrion. Most
           precursor proteins carry amino-terminal targeting
           signals, called pre-sequences, and are imported into
           mitochondria via import complexes located in both the
           outer and the inner membrane (IM). The IM complex, TIM,
           is made up of at least two proteins which mediate
           translocation of proteins into the matrix by removing
           their signal peptide and another pair of proteins, Tim54
           and Tim22, that insert the polytopic proteins, that
           carry internal targetting information, into the inner
           membrane.
          Length = 377

 Score = 29.3 bits (66), Expect = 9.0
 Identities = 18/86 (20%), Positives = 28/86 (32%)

Query: 348 EEINEDDMQEGTTSEEESDNENEEARKSKEAKDNTQVQDMEEDSSSEEEQEDIPPPPPPP 407
           E       +    +E E+    E   +  E    T  ++ E+    E  +   PP P P 
Sbjct: 199 EPPEPTVDEAAPETEVEATPAAESPAEPAEETAETTPEETEDAPEEENNKPVKPPVPKPY 258

Query: 408 SLKKVEPAPPLPSQPSIQPPPLPPAP 433
                 P+ PLP +      P    P
Sbjct: 259 ISPDEYPSAPLPPELPQLLQPSLVIP 284


>gnl|CDD|185618 PTZ00438, PTZ00438, gamete antigen 27/25-like protein; Provisional.
          Length = 374

 Score = 29.3 bits (65), Expect = 9.0
 Identities = 21/76 (27%), Positives = 35/76 (46%)

Query: 323 SDEENDYYNARMRGRNANRGRGRMQEEINEDDMQEGTTSEEESDNENEEARKSKEAKDNT 382
           S   N    A  +  N N         I +++ + GT  EEE D + EE  + +E +   
Sbjct: 69  SACGNVASIANDKSDNENDVELEGLNIIVKNEEERGTQKEEEEDEDVEEIEEVEEVEVVE 128

Query: 383 QVQDMEEDSSSEEEQE 398
           +  D +EDS  ++E+E
Sbjct: 129 EEYDDDEDSEKDDEKE 144


>gnl|CDD|197891 smart00818, Amelogenin, Amelogenins, cell adhesion proteins, play a
           role in the biomineralisation of teeth.  They seem to
           regulate formation of crystallites during the secretory
           stage of tooth enamel development and are thought to
           play a major role in the structural organisation and
           mineralisation of developing enamel. The extracellular
           matrix of the developing enamel comprises two major
           classes of protein: the hydrophobic amelogenins and the
           acidic enamelins. Circular dichroism studies of porcine
           amelogenin have shown that the protein consists of 3
           discrete folding units: the N-terminal region appears to
           contain beta-strand structures, while the C-terminal
           region displays characteristics of a random coil
           conformation. Subsequent studies on the bovine protein
           have indicated the amelogenin structure to contain a
           repetitive beta-turn segment and a "beta-spiral" between
           Gln112 and Leu138, which sequester a (Pro, Leu, Gln)
           rich region. The beta-spiral offers a probable site for
           interactions with Ca2+ ions. Muatations in the human
           amelogenin gene (AMGX) cause X-linked hypoplastic
           amelogenesis imperfecta, a disease characterised by
           defective enamel. A 9bp deletion in exon 2 of AMGX
           results in the loss of codons for Ile5, Leu6, Phe7 and
           Ala8, and replacement by a new threonine codon,
           disrupting the 16-residue (Met1-Ala16) amelogenin signal
           peptide.
          Length = 165

 Score = 28.6 bits (64), Expect = 9.1
 Identities = 15/38 (39%), Positives = 21/38 (55%), Gaps = 1/38 (2%)

Query: 395 EEQEDIPPPPPPPSLKKVEPAPPLPSQPSIQP-PPLPP 431
           + Q+ + P PP   +  + P PPLP    +QP PPL P
Sbjct: 109 QPQQPMQPQPPVHPIPPLPPQPPLPPMFPMQPLPPLLP 146



 Score = 28.6 bits (64), Expect = 9.1
 Identities = 15/38 (39%), Positives = 21/38 (55%), Gaps = 1/38 (2%)

Query: 704 EEQEDIPPPPPPPSLKKVEPAPPLPSQPSIQP-PPLPP 740
           + Q+ + P PP   +  + P PPLP    +QP PPL P
Sbjct: 109 QPQQPMQPQPPVHPIPPLPPQPPLPPMFPMQPLPPLLP 146


>gnl|CDD|129265 TIGR00161, TIGR00161, TIGR00161 family protein.  This model
           represents one out of two closely related ortholgous
           sets of proteins that, so far, are found only in the
           Archaea. This ortholog set includes MJ0106 from
           Methanococcus jannaschii and AF1251 from Archaeoglobus
           fulgidus, but not MJ1210 or AF0525 [Hypothetical
           proteins, Conserved].
          Length = 238

 Score = 29.0 bits (65), Expect = 9.2
 Identities = 17/41 (41%), Positives = 24/41 (58%), Gaps = 4/41 (9%)

Query: 464 DPR----WVEQRDKHLNEKINQETVYAPGQAIEASLKQLAE 500
           DPR     VE  +K LN  ++ E +    +AIE+ LK+LAE
Sbjct: 182 DPRAAASLVEVLNKMLNTNVDPEPLLKEAEAIESRLKKLAE 222


>gnl|CDD|227563 COG5238, RNA1, Ran GTPase-activating protein (RanGAP) involved in
           mRNA processing and transport [Signal transduction
           mechanisms / RNA processing and modification].
          Length = 388

 Score = 29.5 bits (66), Expect = 9.3
 Identities = 27/129 (20%), Positives = 45/129 (34%), Gaps = 15/129 (11%)

Query: 270 PGNFPPPTTPDEVGARVLMQERMDNGEEDLEEDEEMEEEYGQRRPEDDDMQIGSDEENDY 329
           P   P P   +E    +++   ++  E+D        E  G R  E  D         DY
Sbjct: 272 PNLMPLPGDYNERRGGIILDISLNEFEQDAVPLLVDLERNGNRIKELADF-------GDY 324

Query: 330 YNARMRGRNANRGRGRMQEEINEDDMQEGTTSEEESDNENEEARKSKEAKDNTQVQDMEE 389
           Y                QE   +   +E T  +EES++E E      E     ++ +++ 
Sbjct: 325 Y----EDIFEVVEVVEKQE--GDVVTEEST--DEESEDEVEIDESVIEEVAEMELLEVQV 376

Query: 390 DSSSEEEQE 398
           D  +E   E
Sbjct: 377 DDLAERLAE 385


>gnl|CDD|220950 pfam11029, DAZAP2, DAZ associated protein 2 (DAZAP2).  DAZ
           associated protein 2 has a highly conserved sequence
           throughout evolution including a conserved polyproline
           region and several SH2/SH3 binding sites. It occurs as a
           single copy gene with a four-exon organisation and is
           located on chromosome 12. It encodes a ubiquitously
           expressed protein and binds to DAZ and DAZL1 through DAZ
           repeats.
          Length = 136

 Score = 28.3 bits (63), Expect = 9.4
 Identities = 14/54 (25%), Positives = 17/54 (31%)

Query: 713 PPPPSLKKVEPAPPLPSQPSIQPPPLPPAPDKVVVKKGYDPKQARPPVVKPSPT 766
           PP   + +     P PS        +   P        Y P  A PPV  P  T
Sbjct: 19  PPQAQMPQASAPYPGPSMYLPMAQVMAVGPQSSHPPMAYYPIGAPPPVYPPGST 72


>gnl|CDD|218737 pfam05764, YL1, YL1 nuclear protein.  The proteins in this family
           are designated YL1. These proteins have been shown to be
           DNA-binding and may be a transcription factor.
          Length = 238

 Score = 28.9 bits (65), Expect = 9.5
 Identities = 19/71 (26%), Positives = 28/71 (39%), Gaps = 10/71 (14%)

Query: 335 RGRNANRGRGRMQ------EEINEDDMQEGTTSEEESDNENEEARKSKEAKDNTQVQDME 388
           R R +N G  RM+       E +E         EEE D E E     +E ++     D +
Sbjct: 4   RARRSNAGN-RMKKLLEEELEEDEFFWTYLLFEEEEDDEEFEI---EEEEEEEEVDSDFD 59

Query: 389 EDSSSEEEQED 399
           +    E E +D
Sbjct: 60  DSEDDEPESDD 70


>gnl|CDD|223266 COG0188, GyrA, Type IIA topoisomerase (DNA gyrase/topo II,
           topoisomerase IV), A subunit [DNA replication,
           recombination, and repair].
          Length = 804

 Score = 29.5 bits (67), Expect = 9.5
 Identities = 21/90 (23%), Positives = 39/90 (43%), Gaps = 12/90 (13%)

Query: 487 PGQAI-EASLKQLAERRTDIFGVGDEETAIGKKIG--EEDTRKDDKLFPGQAIEASLKQL 543
             +AI +  L++L     +   +  E   + K+I   E+    +++L     I+  L ++
Sbjct: 419 QAEAILDLRLRRLT--GLEEEKIEKELKELEKEIADLEKILASEERLL--DIIKKELLEI 474

Query: 544 A-----ERRTDIFGVGDEETAIGKKIGEED 568
                 ERRT+I    ++E      I EED
Sbjct: 475 KKKFGDERRTEIVEEEEDEIEDEDLIAEED 504


>gnl|CDD|119406 cd00225, API3, Ascaris pepsin inhibitor-3 (API3); protein inhibitor
           that reversibly inhibits aspartic proteinase cathepsin
           E, and gastric enzymes pepsin and gastricsin.
          Length = 159

 Score = 28.8 bits (64), Expect = 9.5
 Identities = 12/62 (19%), Positives = 24/62 (38%), Gaps = 2/62 (3%)

Query: 349 EINEDDMQEGTTSEEESDNENEEARKSKEAKDNTQVQDMEEDS--SSEEEQEDIPPPPPP 406
           E+  D+ QE     E+  +  EE +++ + +              +    +E +P  P  
Sbjct: 30  ELTPDEQQELAQYVEDVADYKEEVKQALKERQEGLKLRRAGKKKKAVTLAEEKLPKAPKK 89

Query: 407 PS 408
           PS
Sbjct: 90  PS 91


>gnl|CDD|191187 pfam05087, Rota_VP2, Rotavirus VP2 protein.  Rotavirus particles
           consist of three concentric proteinaceous capsid layers.
           The innermost capsid (core) is made of VP2. The genomic
           RNA and the two minor proteins VP1 and VP3 are
           encapsidated within this layer. The N-terminus of
           rotavirus VP2 is necessary for the encapsidation of VP1
           and VP3.
          Length = 887

 Score = 29.5 bits (66), Expect = 9.8
 Identities = 15/67 (22%), Positives = 27/67 (40%), Gaps = 9/67 (13%)

Query: 345 RMQEEINEDDMQEGTTSEEESDNENEEARKSKEAKDNTQVQDMEEDSSSEEEQEDIPPPP 404
           R     N D MQ       E D+E ++ +   E K+  +V D +E+  ++     +    
Sbjct: 7   REANINNNDRMQ-------EKDDEKQDQKNRMELKE--KVLDKKEEVVTDNVDSPVKEQS 57

Query: 405 PPPSLKK 411
              +LK 
Sbjct: 58  SQENLKI 64


  Database: CDD.v3.10
    Posted date:  Mar 20, 2013  7:55 AM
  Number of letters in database: 10,937,602
  Number of sequences in database:  44,354
  
Lambda     K      H
   0.310    0.132    0.385 

Gapped
Lambda     K      H
   0.267   0.0647    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 49,515,871
Number of extensions: 5175565
Number of successful extensions: 16100
Number of sequences better than 10.0: 1
Number of HSP's gapped: 12157
Number of HSP's successfully gapped: 1082
Length of query: 910
Length of database: 10,937,602
Length adjustment: 106
Effective length of query: 804
Effective length of database: 6,236,078
Effective search space: 5013806712
Effective search space used: 5013806712
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.2 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (21.7 bits)
S2: 64 (28.6 bits)