RPS-BLAST 2.2.26 [Sep-21-2011]

Database: CDD.v3.10 
           44,354 sequences; 10,937,602 total letters

Searching..................................................done

Query= psy17824
         (272 letters)



>gnl|CDD|238907 cd01926, cyclophilin_ABH_like, cyclophilin_ABH_like: Cyclophilin
           A, B and H-like cyclophilin-type peptidylprolyl cis-
           trans isomerase (PPIase) domain. This family represents
           the archetypal cystolic cyclophilin similar to human
           cyclophilins A, B and H. PPIase is an enzyme which
           accelerates protein folding by catalyzing the cis-trans
           isomerization of the peptide bonds preceding proline
           residues. These enzymes have been implicated in protein
           folding processes which depend on catalytic
           /chaperone-like activities. As cyclophilins, Human
           hCyP-A, human cyclophilin-B (hCyP-19), S. cerevisiae
           Cpr1 and C. elegans Cyp-3, are inhibited by the
           immunosuppressive drug cyclopsporin A (CsA). CsA binds
           to the PPIase active site. Cyp-3. S. cerevisiae Cpr1
           interacts with the Rpd3 - Sin3 complex and in addition
           is a component of the Set3 complex. S. cerevisiae Cpr1
           has also been shown to have a role in Zpr1p nuclear
           transport. Human cyclophilin H associates with the
           [U4/U6.U5] tri-snRNP particles of the splicesome.
          Length = 164

 Score =  260 bits (667), Expect = 9e-89
 Identities = 97/167 (58%), Positives = 113/167 (67%), Gaps = 4/167 (2%)

Query: 6   PRCYFDVEISDIPLGRIVFELYSDICPITCENFRALCTGEKGIGKTTGKNLHYLNIVFHR 65
           P+ +FD+ I   P GRIV EL++D+ P T ENFRALCTGEKG G   GK   Y    FHR
Sbjct: 1   PKVFFDITIGGEPAGRIVMELFADVVPKTAENFRALCTGEKGKG---GKPFGYKGSTFHR 57

Query: 66  VVKGFMIQSGDFSVGNGTGGESIYGGTFEDENFELKHDEPFLLSMANRGKDTNGSQFFIT 125
           V+  FMIQ GDF+ GNGTGG+SIYG  F DENF+LKH  P LLSMAN G +TNGSQFFIT
Sbjct: 58  VIPDFMIQGGDFTRGNGTGGKSIYGEKFPDENFKLKHTGPGLLSMANAGPNTNGSQFFIT 117

Query: 126 TQPAPHLDGVHVVFGHVVSGQNVVSHIENLPVDKMSRPLQDVRVVKC 172
           T   P LDG HVVFG VV G +VV  IEN+      +P + V +  C
Sbjct: 118 TVKTPWLDGKHVVFGKVVEGMDVVKKIENVGSGN-GKPKKKVVIADC 163


>gnl|CDD|240249 PTZ00060, PTZ00060, cyclophilin; Provisional.
          Length = 183

 Score =  227 bits (581), Expect = 1e-75
 Identities = 98/171 (57%), Positives = 118/171 (69%), Gaps = 3/171 (1%)

Query: 5   RPRCYFDVEISDIPLGRIVFELYSDICPITCENFRALCTGEKGIGKTTGKNLHYLNIVFH 64
           RP+ +FD+ I + P GRIVFEL+SD+ P T ENFRALC G+K    ++GKNLHY   +FH
Sbjct: 15  RPKVFFDISIDNAPAGRIVFELFSDVTPKTAENFRALCIGDKV--GSSGKNLHYKGSIFH 72

Query: 65  RVVKGFMIQSGDFSVGNGTGGESIYGGTFEDENFELKHDEPFLLSMANRGKDTNGSQFFI 124
           R++  FM Q GD +  NGTGGESIYG  F DENF+LKHD+P LLSMAN G +TNGSQFFI
Sbjct: 73  RIIPQFMCQGGDITNHNGTGGESIYGRKFTDENFKLKHDQPGLLSMANAGPNTNGSQFFI 132

Query: 125 TTQPAPHLDGVHVVFGHVVSGQNVVSHIENLPVDKMSRPLQDVRVVKCNEL 175
           TT P P LDG HVVFG V+ G  VV  +E     +   P + V V  C EL
Sbjct: 133 TTVPCPWLDGKHVVFGKVIEGMEVVRAMEKEGT-QSGYPKKPVVVTDCGEL 182


>gnl|CDD|238194 cd00317, cyclophilin, cyclophilin: cyclophilin-type peptidylprolyl
           cis- trans isomerases. This family contains eukaryotic,
           bacterial and archeal proteins which exhibit a
           peptidylprolyl cis- trans isomerases activity (PPIase,
           Rotamase) and in addition bind the immunosuppressive
           drug cyclosporin (CsA).  Immunosuppression in
           vertebrates is believed to be the result of the
           cyclophilin A-cyclosporin protein drug complex binding
           to and inhibiting the protein-phosphatase calcineurin.  
           PPIase is an enzyme which accelerates protein folding by
           catalyzing the cis-trans isomerization of the peptide
           bonds preceding proline residues. Cyclophilins are a
           diverse family in terms of function and have been
           implicated in protein folding processes which depend on
           catalytic /chaperone-like activities. This group
           contains human cyclophilin 40, a co-chaperone of the
           hsp90 chaperone system;  human cyclophilin A, a
           chaperone in the HIV-1 infectious process and; human
           cyclophilin H, a component of the U4/U6 snRNP, whose
           isomerization or chaperoning activities may play a role
           in RNA splicing. .
          Length = 146

 Score =  192 bits (491), Expect = 2e-62
 Identities = 77/157 (49%), Positives = 86/157 (54%), Gaps = 14/157 (8%)

Query: 16  DIPLGRIVFELYSDICPITCENFRALCTGEKGIGKTTGKNLHYLNIVFHRVVKGFMIQSG 75
           D   GRIV ELY D  P T ENF +L  G             Y    FHRV+ GFMIQ G
Sbjct: 3   DTTKGRIVIELYGDEAPKTVENFLSLARGG-----------FYDGTTFHRVIPGFMIQGG 51

Query: 76  DFSVGNGTGGESIYGGTFEDENFELK-HDEPFLLSMANRGKDTNGSQFFITTQPAPHLDG 134
           D +     GG S  G  F DENF LK H     LSMAN G +TNGSQFFITT P PHLDG
Sbjct: 52  DPT--GTGGGGSGPGYKFPDENFPLKYHHRRGTLSMANAGPNTNGSQFFITTAPTPHLDG 109

Query: 135 VHVVFGHVVSGQNVVSHIENLPVDKMSRPLQDVRVVK 171
            H VFG VV G +VV  IE    D+  RP++ V +  
Sbjct: 110 KHTVFGKVVEGMDVVDKIERGDTDENGRPIKPVTISD 146


>gnl|CDD|215759 pfam00160, Pro_isomerase, Cyclophilin type peptidyl-prolyl
           cis-trans isomerase/CLD.  The peptidyl-prolyl cis-trans
           isomerases, also known as cyclophilins, share this
           domain of about 109 amino acids. Cyclophilins have been
           found in all organisms studied so far and catalyze
           peptidyl-prolyl isomerisation during which the peptide
           bond preceding proline (the peptidyl-prolyl bond) is
           stabilised in the cis conformation. Mammalian
           cyclophilin A (CypA) is a major cellular target for the
           immunosuppressive drug cyclosporin A (CsA). Other roles
           for cyclophilins may include chaperone and cell
           signalling function.
          Length = 144

 Score =  185 bits (471), Expect = 2e-59
 Identities = 71/164 (43%), Positives = 90/164 (54%), Gaps = 20/164 (12%)

Query: 9   YFDVEISDIPLGRIVFELYSDICPITCENFRALCTGEKGIGKTTGKNLHYLNIVFHRVVK 68
           + D+  +   LGRIV EL+ D  P T ENF +LC           K   Y   +FHRV+ 
Sbjct: 1   FLDITTN---LGRIVIELFGDEAPKTVENFLSLC-----------KKGFYDGTIFHRVIP 46

Query: 69  GFMIQSGDFSVGNGTGGESIYGGTFEDENFELKHDEPFLLSMANRGKDTNGSQFFITTQP 128
           GFMIQ GD + G GTGG+     +  DE       +   LSMA  G ++ GSQFFIT   
Sbjct: 47  GFMIQGGDPT-GPGTGGK----RSIPDEFVSKLKHKRGTLSMARSGPNSAGSQFFITLGD 101

Query: 129 APHLDGVHVVFGHVVSGQNVVSHIENLPVDKMSRPLQDVRVVKC 172
           APHLDG + VFG VV G +VV  IE +  D   RPL+DV++  C
Sbjct: 102 APHLDGGYTVFGRVVEGMDVVEKIEKVETDG-DRPLKDVKITSC 144


>gnl|CDD|178694 PLN03149, PLN03149, peptidyl-prolyl isomerase H (cyclophilin H);
           Provisional.
          Length = 186

 Score =  183 bits (466), Expect = 3e-58
 Identities = 88/171 (51%), Positives = 101/171 (59%), Gaps = 4/171 (2%)

Query: 6   PRCYFDVEISDIPLGRIVFELYSDICPITCENFRALCTGEKGIGKTTGKNLHYLNIVFHR 65
           P  +FDV I  IP GRI  EL++DI P T ENFR  CTGE    +  G    Y    FHR
Sbjct: 19  PVVFFDVTIGGIPAGRIKMELFADIAPKTAENFRQFCTGEF---RKAGLPQGYKGCQFHR 75

Query: 66  VVKGFMIQSGDFSVGNGTGGESIYGGTFEDENFELKHDEPFLLSMANRGKDTNGSQFFIT 125
           V+K FMIQ GDF  G+GTG  SIYG  FEDENF  KH  P LLSMAN G +TNG QFFIT
Sbjct: 76  VIKDFMIQGGDFLKGDGTGCVSIYGSKFEDENFIAKHTGPGLLSMANSGPNTNGCQFFIT 135

Query: 126 TQPAPHLDGVHVVFGHVV-SGQNVVSHIENLPVDKMSRPLQDVRVVKCNEL 175
                 LD  HVVFG V+  G  VV  IEN+     +RP     + +C E+
Sbjct: 136 CAKCDWLDNKHVVFGRVLGDGLLVVRKIENVATGPNNRPKLACVISECGEM 186


>gnl|CDD|223725 COG0652, PpiB, Peptidyl-prolyl cis-trans isomerase (rotamase) -
           cyclophilin family [Posttranslational modification,
           protein turnover, chaperones].
          Length = 158

 Score =  177 bits (452), Expect = 3e-56
 Identities = 67/169 (39%), Positives = 82/169 (48%), Gaps = 20/169 (11%)

Query: 14  ISDIPLGRIVFELYSDICPITCENFRALCTGEKGIGKTTGKNLHYLNIVFHRVVKGFMIQ 73
           I +   G I  ELY D  P T  NF  L            K   Y   +FHRV+ GFMIQ
Sbjct: 3   ILETNKGDITIELYPDKAPKTVANFLQLV-----------KEGFYDGTIFHRVIPGFMIQ 51

Query: 74  SGDFSVGNGTGGESIYGGTFEDENFELKHD--EPFLLSMANRG-KDTNGSQFFITTQPAP 130
            GD + G+GTGG       F+DENF L  D  +   LSMA  G  ++NGSQFFIT    P
Sbjct: 52  GGDPTGGDGTGGPGP---PFKDENFALNGDRHKRGTLSMARAGDPNSNGSQFFITVVDNP 108

Query: 131 HLDGVHVVFGHVVSGQNVVSHIENLPVDKMSRPLQ---DVRVVKCNELV 176
            LDG + VFG VV G +VV  I+N   D          D   +   ++V
Sbjct: 109 FLDGKYTVFGQVVEGMDVVDKIKNGDTDDSGYVQDVPADPVKILSVKIV 157


>gnl|CDD|238908 cd01927, cyclophilin_WD40, cyclophilin_WD40: cyclophilin-type
           peptidylprolyl cis- trans isomerases (cyclophilins)
           having a WD40 domain. This group consists of several
           hypothetical and putative eukaryotic and bacterial
           proteins which have a cyclophilin domain and a WD40
           domain. Function of the protein is not known.
          Length = 148

 Score =  175 bits (445), Expect = 2e-55
 Identities = 77/153 (50%), Positives = 96/153 (62%), Gaps = 13/153 (8%)

Query: 19  LGRIVFELYSDICPITCENFRALCTGEKGIGKTTGKNLHYLNIVFHRVVKGFMIQSGDFS 78
            G I   L+ +  P T ENF               +N +Y N +FHRV+KGFMIQ+GD  
Sbjct: 6   KGDIHIRLFPEEAPKTVENFTTHA-----------RNGYYNNTIFHRVIKGFMIQTGD-P 53

Query: 79  VGNGTGGESIYGGTFEDE-NFELKHDEPFLLSMANRGKDTNGSQFFITTQPAPHLDGVHV 137
            G+GTGGESI+G  FEDE +  LKHD P+ LSMAN G +TNGSQFFITT   P LD  H 
Sbjct: 54  TGDGTGGESIWGKEFEDEFSPSLKHDRPYTLSMANAGPNTNGSQFFITTVATPWLDNKHT 113

Query: 138 VFGHVVSGQNVVSHIENLPVDKMSRPLQDVRVV 170
           VFG VV G +VV  IEN+  DK  RP +D++++
Sbjct: 114 VFGRVVKGMDVVQRIENVKTDKNDRPYEDIKII 146


>gnl|CDD|238909 cd01928, Cyclophilin_PPIL3_like, Cyclophilin_PPIL3_like. Proteins
           similar to Human cyclophilin-like peptidylprolyl cis-
           trans isomerase (PPIL3). Members of this family lack a
           key residue important for cyclosporin binding: the
           tryptophan residue corresponding to W121 in human
           hCyP-18a; most members have a histidine at this
           position. The exact function of the protein is not
           known.
          Length = 153

 Score =  165 bits (420), Expect = 1e-51
 Identities = 77/152 (50%), Positives = 95/152 (62%), Gaps = 13/152 (8%)

Query: 19  LGRIVFELYSDICPITCENFRALCTGEKGIGKTTGKNLHYLNIVFHRVVKGFMIQSGDFS 78
           LG I  EL+ D CP  CENF ALC              +Y   +FHR +KGFM+Q+GD  
Sbjct: 9   LGDIKIELFCDDCPKACENFLALCASG-----------YYNGCIFHRNIKGFMVQTGD-P 56

Query: 79  VGNGTGGESIYGGTFEDENFE-LKHDEPFLLSMANRGKDTNGSQFFITTQPAPHLDGVHV 137
            G G GGESI+G  FEDE  E LKHD   ++SMAN G +TNGSQFFIT    PHLDG + 
Sbjct: 57  TGTGKGGESIWGKKFEDEFRETLKHDSRGVVSMANNGPNTNGSQFFITYAKQPHLDGKYT 116

Query: 138 VFGHVVSGQNVVSHIENLPVDKMSRPLQDVRV 169
           VFG V+ G   +  +E LPVDK  RPL+++R+
Sbjct: 117 VFGKVIDGFETLDTLEKLPVDKKYRPLEEIRI 148


>gnl|CDD|238904 cd01923, cyclophilin_RING, cyclophilin_RING: cyclophilin-type
           peptidylprolyl cis- trans isomerases (cyclophilins)
           having a modified RING finger domain. This group
           includes the nuclear proteins, Human hCyP-60 and
           Caenorhabditis elegans MOG-6 which, compared to the
           archetypal cyclophilin Human cyclophilin A exhibit
           reduced peptidylprolyl cis- trans isomerase activity and
           lack a residue important for cyclophilin binding. Human
           hCyP-60 has been shown to physically interact with the
           proteinase inhibitor peptide eglin c and; C. elegans
           MOG-6 to physically interact with MEP-1, a nuclear zinc
           finger protein. MOG-6 has been shown to function in
           germline sex determination.
          Length = 159

 Score =  159 bits (403), Expect = 5e-49
 Identities = 67/152 (44%), Positives = 87/152 (57%), Gaps = 13/152 (8%)

Query: 19  LGRIVFELYSDICPITCENFRALCTGEKGIGKTTGKNLHYLNIVFHRVVKGFMIQSGDFS 78
            G +  EL+ D  P  CENF  LC           K  +Y   +FHR ++ FMIQ GD +
Sbjct: 8   KGDLNLELHCDKAPKACENFIKLC-----------KKGYYDGTIFHRSIRNFMIQGGDPT 56

Query: 79  VGNGTGGESIYGGTFEDE-NFELKHDEPFLLSMANRGKDTNGSQFFITTQPAPHLDGVHV 137
            G G GGESI+G  F+DE    L HD   +LSMAN G +TNGSQFFIT +   HLDG H 
Sbjct: 57  -GTGRGGESIWGKPFKDEFKPNLSHDGRGVLSMANSGPNTNGSQFFITYRSCKHLDGKHT 115

Query: 138 VFGHVVSGQNVVSHIENLPVDKMSRPLQDVRV 169
           VFG VV G   +  +EN+P     RP +++++
Sbjct: 116 VFGRVVGGLETLEAMENVPDPGTDRPKEEIKI 147


>gnl|CDD|238903 cd01922, cyclophilin_SpCYP2_like, cyclophilin_SpCYP2_like:
           cyclophilin 2-like peptidylprolyl cis- trans isomerase
           (PPIase) domain similar to Schizosaccharomyces pombe
           cyp-2. These proteins bind their respective SNW
           chromatin binding protein in autologous systems, in a
           CsA independent manner indicating interaction with a
           surface outside the PPIase active site. SNW proteins
           play a basic and broad range role in signaling.
          Length = 146

 Score =  145 bits (368), Expect = 7e-44
 Identities = 69/154 (44%), Positives = 90/154 (58%), Gaps = 14/154 (9%)

Query: 19  LGRIVFELYSDICPITCENFRALCTGEKGIGKTTGKNLHYLNIVFHRVVKGFMIQSGDFS 78
           +G I  ELY +  P TC+NF  L    +G         +Y   +FHR++K FMIQ GD  
Sbjct: 6   MGEITLELYWNHAPKTCKNFYELAK--RG---------YYNGTIFHRLIKDFMIQGGD-P 53

Query: 79  VGNGTGGESIYGGTFEDE-NFELKHDEPFLLSMANRGKDTNGSQFFITTQPAPHLDGVHV 137
            G G GG SIYG  FEDE + ELKH    +LSMAN G +TNGSQFFIT  P P LDG H 
Sbjct: 54  TGTGRGGASIYGKKFEDEIHPELKHTGAGILSMANAGPNTNGSQFFITLAPTPWLDGKHT 113

Query: 138 VFGHVVSGQNVVSHIENLPVDKMSRPLQDVRVVK 171
           +FG V  G  V+ ++  +      RP+ +V+++K
Sbjct: 114 IFGRVSKGMKVIENMVEVQTQT-DRPIDEVKILK 146


>gnl|CDD|238906 cd01925, cyclophilin_CeCYP16-like, cyclophilin_CeCYP16-like:
           cyclophilin-type peptidylprolyl cis- trans isomerase)
           (PPIase) domain similar to Caenorhabditis elegans
           cyclophilin 16. C. elegans CeCYP-16, compared to the
           archetypal cyclophilin Human cyclophilin A has, a
           reduced peptidylprolyl cis- trans isomerase activity, is
           cyclosporin insensitive and shows an altered substrate
           preference favoring, hydrophobic, acidic or amide amino
           acids. Most members of this subfamily have a glutamate
           residue in the active site at the position equivalent to
           a tryptophan (W121 in Human cyclophilin A), which has
           been shown to be important for cyclophilin binding.
          Length = 171

 Score =  124 bits (314), Expect = 2e-35
 Identities = 62/157 (39%), Positives = 83/157 (52%), Gaps = 16/157 (10%)

Query: 19  LGRIVFELYSDICPITCENFRALCTGEKGIGKTTGKNLHYLNIVFHRVVKGFMIQSGDFS 78
            G I  EL+S   P  C NF  LC              +Y N +FHRVV GF+IQ GD +
Sbjct: 14  AGDIDIELWSKEAPKACRNFIQLCL-----------EGYYDNTIFHRVVPGFIIQGGDPT 62

Query: 79  VGNGTGGESIYGGTFEDE-NFELKHDEPFLLSMANRGKDTNGSQFFITTQPAPHLDGVHV 137
            G GTGGESIYG  F+DE +  L+ +   L+ MAN G D+NGSQFF T   A  L+  H 
Sbjct: 63  -GTGTGGESIYGEPFKDEFHSRLRFNRRGLVGMANAGDDSNGSQFFFTLDKADELNNKHT 121

Query: 138 VFGHVV--SGQNVVSHIENLPVDKMSRPLQDVRVVKC 172
           +FG V   +  N++  +  +  DK  RP+   ++   
Sbjct: 122 LFGKVTGDTIYNLLK-LAEVETDKDERPVYPPKITSV 157


>gnl|CDD|238902 cd01921, cyclophilin_RRM, cyclophilin_RRM: cyclophilin-type
           peptidylprolyl cis- trans isomerase domain occuring with
           a C-terminal RNA recognition motif domain (RRM). This
           subfamily of the cyclophilin domain family contains a
           number of eukaryotic cyclophilins having the RRM domain
           including the nuclear proteins: human hCyP-57,
           Arabidopsis thaliana AtCYP59, Caenorhabditis elegans
           CeCyP-44 and Paramecium tetrurelia Kin241. The Kin241
           protein has been shown to have a role in cell
           morphogenesis.
          Length = 166

 Score =  117 bits (294), Expect = 1e-32
 Identities = 65/160 (40%), Positives = 88/160 (55%), Gaps = 21/160 (13%)

Query: 19  LGRIVFELYSDICPITCENFRALCTGEKGIGKTTGKNLHYLNIVFHRVVKGFMIQSGDFS 78
           LG +V +L++D CP+ C NF  LC           K  +Y   +F+ V K F+ Q+GD +
Sbjct: 6   LGDLVIDLFTDECPLACLNFLKLC-----------KLKYYNFCLFYNVQKDFIAQTGDPT 54

Query: 79  VGNGTGGESIYGGT-------FEDE-NFELKHDEPFLLSMANRGKDTNGSQFFIT-TQPA 129
            G G GGESIY          FE E    LKH +   +SM N G + NGSQF+IT  +  
Sbjct: 55  -GTGAGGESIYSQLYGRQARFFEPEILPLLKHSKKGTVSMVNAGDNLNGSQFYITLGENL 113

Query: 130 PHLDGVHVVFGHVVSGQNVVSHIENLPVDKMSRPLQDVRV 169
            +LDG H VFG VV G +V+  I +  VD   RPL+D+R+
Sbjct: 114 DYLDGKHTVFGQVVEGFDVLEKINDAIVDDDGRPLKDIRI 153


>gnl|CDD|140248 PTZ00221, PTZ00221, cyclophilin; Provisional.
          Length = 249

 Score =  118 bits (296), Expect = 5e-32
 Identities = 76/201 (37%), Positives = 104/201 (51%), Gaps = 7/201 (3%)

Query: 7   RCYFDVEISDIPLGRIVFELYSDICPITCENFRALCTGEKGIGKTTGKNLHYLNIVFHRV 66
           R + D+ I D+  GR+VFEL+ D+ P T ENFRAL TG  GI   TG  L YL    H V
Sbjct: 54  RAFLDISIGDVLAGRLVFELFEDVVPETVENFRALITGSCGIDTNTGVKLDYLYTPVHHV 113

Query: 67  VKGF-MIQSGDF-SVGNGTGGESIYGGTFEDENFELKHDEPFLLSMANRGKDTNGSQFFI 124
            +   +I  G+  S    + G  I      DE +  +H E  LL+M + G  T+GS F I
Sbjct: 114 DRNNNIIVLGELDSFNVSSTGTPI-----ADEGYRHRHTERGLLTMISEGPHTSGSVFGI 168

Query: 125 TTQPAPHLDGVHVVFGHVVSGQNVVSHIENLPVDKMSRPLQDVRVVKCNELVLKSKLKAK 184
           T  P+P LD   VVFG  V   +++  +E+LP+D + RPL  V V  C  L  +     +
Sbjct: 169 TLGPSPSLDFKQVVFGKAVDDLSLLEKLESLPLDDVGRPLLPVTVSFCGALTGEKPPGRQ 228

Query: 185 EKRKASPTSASSDESDKEEEE 205
           +   A+  SASS+      EE
Sbjct: 229 QLLAAADDSASSEHVACAAEE 249


>gnl|CDD|238901 cd01920, cyclophilin_EcCYP_like, cyclophilin_EcCYP_like:
           cyclophilin-type A-like peptidylprolyl cis- trans
           isomerase (PPIase) domain similar to the cytosolic E.
           coli cyclophilin A and Streptomyces antibioticus
           SanCyp18. Compared to the archetypal cyclophilin Human
           cyclophilin A, these have reduced affinity for
           cyclosporin A.  E. coli cyclophilin A has a similar
           peptidylprolyl cis- trans isomerase activity to the
           human cyclophilin A. Most members of this subfamily
           contain a phenylalanine residue at the position
           equivalent to Human cyclophilin W121, where a tyrptophan
           has been shown to be important for cyclophilin binding.
          Length = 155

 Score = 80.2 bits (198), Expect = 1e-18
 Identities = 53/170 (31%), Positives = 65/170 (38%), Gaps = 47/170 (27%)

Query: 19  LGRIVFELYSDICPITCENFRALCTGEKGIGKTTGKNLHYLNIVFHRVVKGFMIQSGDFS 78
           LG IV ELY D  PIT ENF A     KG          Y N +FHRV+ GF+IQ     
Sbjct: 6   LGDIVVELYDDKAPITVENFLAYV--RKG---------FYDNTIFHRVISGFVIQ----- 49

Query: 79  VGNGTGGESIYGGTFEDENFELKHDEPFLLSMANRGKDTNG--------------SQFFI 124
                      GG F  +  + +  +P      N   +T G              SQFFI
Sbjct: 50  -----------GGGFTPDLAQKETLKPIKNEAGNGLSNTRGTIAMARTNAPDSATSQFFI 98

Query: 125 TTQPAPHLD-----GVHVVFGHVVSGQNVVSHIENLPVDKMSRPLQDVRV 169
             +    LD       + VFG V  G +VV  I  +         QDV V
Sbjct: 99  NLKDNASLDYQNEQWGYTVFGEVTEGMDVVDKIAGVETYS-FGSYQDVPV 147


>gnl|CDD|182824 PRK10903, PRK10903, peptidyl-prolyl cis-trans isomerase A (rotamase
           A); Provisional.
          Length = 190

 Score = 56.4 bits (136), Expect = 1e-09
 Identities = 48/171 (28%), Positives = 64/171 (37%), Gaps = 51/171 (29%)

Query: 20  GRIVFELYSDICPITCENFRALCTGEKGIGKTTGKNLHYLNIVFHRVVKGFMIQSGDFSV 79
           G I  EL S   P++ +NF                +  Y N  FHRV+ GFMIQ G F+ 
Sbjct: 38  GNIELELNSQKAPVSVKNFVDYVN-----------SGFYNNTTFHRVIPGFMIQGGGFT- 85

Query: 80  GNGTGGESIYGGTFEDENFELKHDEPFLLSMANRG---------------KDTNGSQFFI 124
                           E  + K   P + + A+ G               KD+  SQFFI
Sbjct: 86  ----------------EQMQQKKPNPPIKNEADNGLRNTRGTIAMARTADKDSATSQFFI 129

Query: 125 TTQPAPHLD------GVHVVFGHVVSGQNVVSHIENLPVDKMSRPLQDVRV 169
                  LD      G + VFG VV G +V   I  +P   +  P Q+V  
Sbjct: 130 NVADNAFLDHGQRDFG-YAVFGKVVKGMDVADKISQVPTHDVG-PYQNVPS 178


>gnl|CDD|182734 PRK10791, PRK10791, peptidyl-prolyl cis-trans isomerase B (rotamase
           B); Provisional.
          Length = 164

 Score = 51.8 bits (124), Expect = 3e-08
 Identities = 45/147 (30%), Positives = 63/147 (42%), Gaps = 27/147 (18%)

Query: 20  GRIVFELYSDICPITCENFRALCTGEKGIGKTTGKNLHYLNIVFHRVVKGFMIQSGDFSV 79
           G IV + + D  P T +NF   C           +   Y N +FHRV+ GFMIQ G F  
Sbjct: 9   GDIVIKTFDDKAPETVKNFLDYC-----------REGFYNNTIFHRVINGFMIQGGGFEP 57

Query: 80  GNGTGGESIYGGTFEDE-NFELKHDEPFLLSMANRGKDTNG--SQFFITTQP-------A 129
           G     +       ++E N  LK+     L+MA R +  +   +QFFI            
Sbjct: 58  GM---KQKATKEPIKNEANNGLKNTRG-TLAMA-RTQAPHSATAQFFINVVDNDFLNFSG 112

Query: 130 PHLDGV-HVVFGHVVSGQNVVSHIENL 155
             L G  + VF  VV G +VV  I+ +
Sbjct: 113 ESLQGWGYCVFAEVVEGMDVVDKIKGV 139


>gnl|CDD|238905 cd01924, cyclophilin_TLP40_like, cyclophilin_TLP40_like:
           cyclophilin-type peptidylprolyl cis- trans isomerases
           (cyclophilins) similar ot the Spinach thylakoid lumen
           protein TLP40.  Compared to the archetypal cyclophilin
           Human cyclophilin A, these proteins have similar
           peptidylprolyl cis- trans isomerase activity and reduced
           affinity for cyclosporin A. Spinach TLP40 has been shown
           to have a dual function as a folding catalyst and
           regulator of dephosphorylation.
          Length = 176

 Score = 51.7 bits (124), Expect = 3e-08
 Identities = 47/167 (28%), Positives = 67/167 (40%), Gaps = 47/167 (28%)

Query: 21  RIVFELYSDICPITCENFRALCTGEKGIGKTTGKNLHYLNIVFHRVVKGFMIQSGD-FSV 79
            IV + Y+   P+T  NF  L   E+G          Y  + FHRV  GF++Q+GD    
Sbjct: 10  TIVLDGYN--APVTAGNFVDLV--ERGF---------YDGMEFHRVEGGFVVQTGDPQGK 56

Query: 80  GNG-----TG--------------GESIYGGTFE-----DENFELKHDEPFLLSMANRGK 115
             G     TG               + +YG T E     DE   L  +    ++MA    
Sbjct: 57  NPGFPDPETGKSRTIPLEIKPEGQKQPVYGKTLEEAGRYDEQPVLPFNAFGAIAMARTEF 116

Query: 116 DTNG--SQFFI-------TTQPAPHLDGVHVVFGHVVSGQNVVSHIE 153
           D N   SQFF        T      LDG + VFG+V  G +++  ++
Sbjct: 117 DPNSASSQFFFLLKDNELTPSRNNVLDGRYAVFGYVTDGLDILRELK 163


>gnl|CDD|235250 PRK04195, PRK04195, replication factor C large subunit;
           Provisional.
          Length = 482

 Score = 48.0 bits (115), Expect = 2e-06
 Identities = 24/74 (32%), Positives = 46/74 (62%)

Query: 199 SDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKYEEEEEKEEGEQTDSEKEAKKR 258
             K+  +K +     +EK+ ++ KK KK K    KK + EEEEEKE+ E+   E+E +  
Sbjct: 404 GSKKATKKIKKIVEKAEKKREEEKKEKKKKAFAGKKKEEEEEEEKEKKEEEKEEEEEEAE 463

Query: 259 KKHKKDKKKRKKKS 272
           ++ +++++K+KK++
Sbjct: 464 EEKEEEEEKKKKQA 477



 Score = 44.9 bits (107), Expect = 2e-05
 Identities = 22/73 (30%), Positives = 45/73 (61%)

Query: 194 ASSDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKYEEEEEKEEGEQTDSEK 253
             S ++ K+ ++  E  +   E+E K++KK     KKK+++ + E+E+++EE E+ + E 
Sbjct: 403 TGSKKATKKIKKIVEKAEKKREEEKKEKKKKAFAGKKKEEEEEEEKEKKEEEKEEEEEEA 462

Query: 254 EAKKRKKHKKDKK 266
           E +K ++ +K KK
Sbjct: 463 EEEKEEEEEKKKK 475



 Score = 44.9 bits (107), Expect = 2e-05
 Identities = 20/72 (27%), Positives = 42/72 (58%)

Query: 186 KRKASPTSASSDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKYEEEEEKEE 245
             K +           E++ +EE ++   +  A K+K+ +++++K+KK+ + EEEEE+ E
Sbjct: 404 GSKKATKKIKKIVEKAEKKREEEKKEKKKKAFAGKKKEEEEEEEKEKKEEEKEEEEEEAE 463

Query: 246 GEQTDSEKEAKK 257
            E+ + E++ KK
Sbjct: 464 EEKEEEEEKKKK 475



 Score = 44.1 bits (105), Expect = 4e-05
 Identities = 20/73 (27%), Positives = 42/73 (57%)

Query: 189 ASPTSASSDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKYEEEEEKEEGEQ 248
                A+       E+ +++ E+   EK+ K     KK+++++++K K EEE+E+EE E 
Sbjct: 403 TGSKKATKKIKKIVEKAEKKREEEKKEKKKKAFAGKKKEEEEEEEKEKKEEEKEEEEEEA 462

Query: 249 TDSEKEAKKRKKH 261
            + ++E +++KK 
Sbjct: 463 EEEKEEEEEKKKK 475



 Score = 43.8 bits (104), Expect = 5e-05
 Identities = 19/69 (27%), Positives = 42/69 (60%)

Query: 204 EEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKYEEEEEKEEGEQTDSEKEAKKRKKHKK 263
            E+E    T S+K  KK KK  +  +KK+++ K E++++   G++ + E+E +K KK ++
Sbjct: 395 TEEEIEFLTGSKKATKKIKKIVEKAEKKREEEKKEKKKKAFAGKKKEEEEEEEKEKKEEE 454

Query: 264 DKKKRKKKS 272
            +++ ++  
Sbjct: 455 KEEEEEEAE 463



 Score = 41.1 bits (97), Expect = 4e-04
 Identities = 17/76 (22%), Positives = 47/76 (61%), Gaps = 8/76 (10%)

Query: 174 ELVLKSKLKAKEKRKASPTSASSDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKK 233
           + + K   KA++KR+        +E  +++++   G++ + E+E +K KK ++ ++++++
Sbjct: 410 KKIKKIVEKAEKKRE--------EEKKEKKKKAFAGKKKEEEEEEEKEKKEEEKEEEEEE 461

Query: 234 KSKYEEEEEKEEGEQT 249
             + +EEEE+++ +Q 
Sbjct: 462 AEEEKEEEEEKKKKQA 477



 Score = 40.3 bits (95), Expect = 7e-04
 Identities = 24/81 (29%), Positives = 47/81 (58%), Gaps = 8/81 (9%)

Query: 200 DKEEEEKEEGEQTDSEKEAK-KRKKHKKDKKKKKKKSKY-------EEEEEKEEGEQTDS 251
            +EE E   G +  ++K  K   K  KK +++KK+K K        EEEEE+E+ ++ + 
Sbjct: 395 TEEEIEFLTGSKKATKKIKKIVEKAEKKREEEKKEKKKKAFAGKKKEEEEEEEKEKKEEE 454

Query: 252 EKEAKKRKKHKKDKKKRKKKS 272
           ++E ++  + +K++++ KKK 
Sbjct: 455 KEEEEEEAEEEKEEEEEKKKK 475



 Score = 33.4 bits (77), Expect = 0.11
 Identities = 15/58 (25%), Positives = 38/58 (65%)

Query: 178 KSKLKAKEKRKASPTSASSDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKS 235
           K + + K+++K    +    E ++EEE++++ E+ + E+E  + +K ++++KKKK+ +
Sbjct: 421 KKREEEKKEKKKKAFAGKKKEEEEEEEKEKKEEEKEEEEEEAEEEKEEEEEKKKKQAT 478


>gnl|CDD|220648 pfam10243, MIP-T3, Microtubule-binding protein MIP-T3.  This
           protein, which interacts with both microtubules and
           TRAF3 (tumour necrosis factor receptor-associated factor
           3), is conserved from worms to humans. The N-terminal
           region is the microtubule binding domain and is
           well-conserved; the C-terminal 100 residues, also
           well-conserved, constitute the coiled-coil region which
           binds to TRAF3. The central region of the protein is
           rich in lysine and glutamic acid and carries KKE motifs
           which may also be necessary for tubulin-binding, but
           this region is the least well-conserved.
          Length = 506

 Score = 47.2 bits (112), Expect = 5e-06
 Identities = 29/106 (27%), Positives = 55/106 (51%), Gaps = 1/106 (0%)

Query: 168 RVVKCNELVLKSKLK-AKEKRKASPTSASSDESDKEEEEKEEGEQTDSEKEAKKRKKHKK 226
           RV K       +K K AKE +  S      ++   +EE+K++ E+   E + +K K+  K
Sbjct: 79  RVEKGGSKGPAAKTKPAKEPKNESGKEEEKEKEQVKEEKKKKKEKPKEEPKDRKPKEEAK 138

Query: 227 DKKKKKKKSKYEEEEEKEEGEQTDSEKEAKKRKKHKKDKKKRKKKS 272
           +K+  K+K K +E++ +E  ++ + +K  + R K +  K  +KK  
Sbjct: 139 EKRPPKEKEKEKEKKVEEPRDREEEKKRERVRAKSRPKKPPKKKPP 184



 Score = 41.8 bits (98), Expect = 2e-04
 Identities = 28/109 (25%), Positives = 52/109 (47%), Gaps = 1/109 (0%)

Query: 165 QDVRVVKCNELVLKSKLKAKEKRKASPTSASSDESDKEEEEKEEGEQTD-SEKEAKKRKK 223
            D  V +  +   K      +  K     +  +E  ++E+ KEE ++     KE  K +K
Sbjct: 73  SDEAVKRVEKGGSKGPAAKTKPAKEPKNESGKEEEKEKEQVKEEKKKKKEKPKEEPKDRK 132

Query: 224 HKKDKKKKKKKSKYEEEEEKEEGEQTDSEKEAKKRKKHKKDKKKRKKKS 272
            K++ K+K+   + E+E+EK+  E  D E+E K+ +   K + K+  K 
Sbjct: 133 PKEEAKEKRPPKEKEKEKEKKVEEPRDREEEKKRERVRAKSRPKKPPKK 181



 Score = 41.4 bits (97), Expect = 3e-04
 Identities = 31/102 (30%), Positives = 53/102 (51%), Gaps = 1/102 (0%)

Query: 171 KCNELVLKSKLKAKEKRKASPTSASSDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKK 230
           KC E  L S    K   K      ++     +E + E G++ + EKE  K +K KK +K 
Sbjct: 65  KCAESKLSSDEAVKRVEKGGSKGPAAKTKPAKEPKNESGKEEEKEKEQVKEEKKKKKEKP 124

Query: 231 KK-KKSKYEEEEEKEEGEQTDSEKEAKKRKKHKKDKKKRKKK 271
           K+  K +  +EE KE+    + EKE +K+ +  +D+++ KK+
Sbjct: 125 KEEPKDRKPKEEAKEKRPPKEKEKEKEKKVEEPRDREEEKKR 166



 Score = 40.3 bits (94), Expect = 7e-04
 Identities = 22/101 (21%), Positives = 45/101 (44%), Gaps = 5/101 (4%)

Query: 177 LKSKLKAKEKRKASPTSASSDESDKEEEEKEEGEQTDSE----KEAKKRKKHKKDKKKKK 232
                K  +           ++  +E+++K+E  + + +    KE  K K+  K+K+K+K
Sbjct: 91  KTKPAKEPKNESGKEEEKEKEQVKEEKKKKKEKPKEEPKDRKPKEEAKEKRPPKEKEKEK 150

Query: 233 KKSKYEE-EEEKEEGEQTDSEKEAKKRKKHKKDKKKRKKKS 272
           +K   E  + E+E+  +    K   K+   KK   K+K+  
Sbjct: 151 EKKVEEPRDREEEKKRERVRAKSRPKKPPKKKPPNKKKEPP 191



 Score = 38.7 bits (90), Expect = 0.002
 Identities = 19/94 (20%), Positives = 47/94 (50%), Gaps = 1/94 (1%)

Query: 178 KSKLKAKEKRKASPTSASSDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKY 237
           +   + K+K+K  P     D   K EE KE+    + EKE +K+ +  +D++++KK+ + 
Sbjct: 111 EQVKEEKKKKKEKPKEEPKDRKPK-EEAKEKRPPKEKEKEKEKKVEEPRDREEEKKRERV 169

Query: 238 EEEEEKEEGEQTDSEKEAKKRKKHKKDKKKRKKK 271
             +   ++  +     + K+  + +K ++  ++ 
Sbjct: 170 RAKSRPKKPPKKKPPNKKKEPPEEEKQRQAAREA 203



 Score = 38.7 bits (90), Expect = 0.002
 Identities = 23/94 (24%), Positives = 43/94 (45%), Gaps = 2/94 (2%)

Query: 178 KSKLKAKEKRKASPTSASSDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKY 237
             K K + K K  P     ++  K EE ++  E+   E+   + K   K   KKK  +K 
Sbjct: 130 DRKPKEEAKEKRPPKEKEKEKEKKVEEPRDREEEKKRERV--RAKSRPKKPPKKKPPNKK 187

Query: 238 EEEEEKEEGEQTDSEKEAKKRKKHKKDKKKRKKK 271
           +E  E+E+  Q   E    K ++   ++++ K++
Sbjct: 188 KEPPEEEKQRQAAREAVKGKPEEPDVNEEREKEE 221



 Score = 34.9 bits (80), Expect = 0.038
 Identities = 23/94 (24%), Positives = 41/94 (43%), Gaps = 4/94 (4%)

Query: 178 KSKLKAKEKRKASPTSASSDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKY 237
           K K K KEK+   P      E +K+ E      +     + KK    KK+  +++K+ + 
Sbjct: 144 KEKEKEKEKKVEEP---RDREEEKKRERVRAKSRPKKPPK-KKPPNKKKEPPEEEKQRQA 199

Query: 238 EEEEEKEEGEQTDSEKEAKKRKKHKKDKKKRKKK 271
             E  K + E+ D  +E +K +   KD++     
Sbjct: 200 AREAVKGKPEEPDVNEEREKEEDDGKDRETTTSP 233



 Score = 29.1 bits (65), Expect = 2.7
 Identities = 17/97 (17%), Positives = 41/97 (42%), Gaps = 2/97 (2%)

Query: 173 NELVLKSKLKAKEK-RKASPTSASSDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKK 231
           +    K + + + K R   P         KE  E+E+  Q   E    K ++   +++++
Sbjct: 159 DREEEKKRERVRAKSRPKKPPKKKPPNKKKEPPEEEKQRQAAREAVKGKPEEPDVNEERE 218

Query: 232 KKKSKYEEEEE-KEEGEQTDSEKEAKKRKKHKKDKKK 267
           K++   ++ E      E+ +S + ++  ++     KK
Sbjct: 219 KEEDDGKDRETTTSPMEEDESRQSSEISRRSSSSLKK 255


>gnl|CDD|173412 PTZ00121, PTZ00121, MAEBL; Provisional.
          Length = 2084

 Score = 46.7 bits (110), Expect = 9e-06
 Identities = 27/107 (25%), Positives = 49/107 (45%), Gaps = 3/107 (2%)

Query: 169  VVKCNELVLKSKLKAKEKRKASPTSASSDESDKEEEEKEEGEQTDSEKEAKKR---KKHK 225
              K        + +A E++  +      +   K +  K++ E+     EAKK+    K K
Sbjct: 1347 AAKAEAEAAADEAEAAEEKAEAAEKKKEEAKKKADAAKKKAEEKKKADEAKKKAEEDKKK 1406

Query: 226  KDKKKKKKKSKYEEEEEKEEGEQTDSEKEAKKRKKHKKDKKKRKKKS 272
             D+ KK   +K + +E K++ E+     EAKK+ +  K   + KKK+
Sbjct: 1407 ADELKKAAAAKKKADEAKKKAEEKKKADEAKKKAEEAKKADEAKKKA 1453



 Score = 46.7 bits (110), Expect = 9e-06
 Identities = 27/99 (27%), Positives = 53/99 (53%), Gaps = 5/99 (5%)

Query: 178  KSKLKAKEKRKASPTSASSDESDKEEEEKEEGEQTDSEKEAKKRK-----KHKKDKKKKK 232
            + K+KA+E +KA      ++E  K EEEK++ EQ   ++  +K+K     K +++ K K 
Sbjct: 1604 EKKMKAEEAKKAEEAKIKAEELKKAEEEKKKVEQLKKKEAEEKKKAEELKKAEEENKIKA 1663

Query: 233  KKSKYEEEEEKEEGEQTDSEKEAKKRKKHKKDKKKRKKK 271
             +   + EE+K++ E+    +E +K+      K+  + K
Sbjct: 1664 AEEAKKAEEDKKKAEEAKKAEEDEKKAAEALKKEAEEAK 1702



 Score = 46.3 bits (109), Expect = 1e-05
 Identities = 33/99 (33%), Positives = 57/99 (57%), Gaps = 5/99 (5%)

Query: 178  KSKLKAKEKRKASPTSASSDESDKEEEEKEEGEQTDSEKEAKK-----RKKHKKDKKKKK 232
            ++K KA+E +KA      ++E+ K +E K++ E+     EAKK     +KK  + KK  +
Sbjct: 1448 EAKKKAEEAKKAEEAKKKAEEAKKADEAKKKAEEAKKADEAKKKAEEAKKKADEAKKAAE 1507

Query: 233  KKSKYEEEEEKEEGEQTDSEKEAKKRKKHKKDKKKRKKK 271
             K K +E ++ EE ++ D  K+A++ KK  + KK  +KK
Sbjct: 1508 AKKKADEAKKAEEAKKADEAKKAEEAKKADEAKKAEEKK 1546



 Score = 45.9 bits (108), Expect = 2e-05
 Identities = 29/90 (32%), Positives = 51/90 (56%), Gaps = 2/90 (2%)

Query: 182  KAKEKRKASPTSASSDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKYEEEE 241
            KA+EK+KA      ++E+ K +E K++ E+  ++K+A   KK  ++ KK  + +K E E 
Sbjct: 1297 KAEEKKKADEAKKKAEEAKKADEAKKKAEE--AKKKADAAKKKAEEAKKAAEAAKAEAEA 1354

Query: 242  EKEEGEQTDSEKEAKKRKKHKKDKKKRKKK 271
              +E E  + + EA ++KK +  KK    K
Sbjct: 1355 AADEAEAAEEKAEAAEKKKEEAKKKADAAK 1384



 Score = 45.5 bits (107), Expect = 2e-05
 Identities = 26/95 (27%), Positives = 51/95 (53%), Gaps = 5/95 (5%)

Query: 182  KAKEKRKASPTSASSDESDKEEEEKEEGEQTDSEKEAKKRK-----KHKKDKKKKKKKSK 236
             A+E +KA      ++E+ K EE++++  +   ++  + +K     K + ++KKK ++ K
Sbjct: 1663 AAEEAKKAEEDKKKAEEAKKAEEDEKKAAEALKKEAEEAKKAEELKKKEAEEKKKAEELK 1722

Query: 237  YEEEEEKEEGEQTDSEKEAKKRKKHKKDKKKRKKK 271
              EEE K + E+   E E  K+K  +  K + +KK
Sbjct: 1723 KAEEENKIKAEEAKKEAEEDKKKAEEAKKDEEEKK 1757



 Score = 45.5 bits (107), Expect = 2e-05
 Identities = 28/95 (29%), Positives = 49/95 (51%)

Query: 178  KSKLKAKEKRKASPTSASSDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKY 237
            ++K KA   +K +  +  + E+ K E E    E   +E++A+  +K K++ KKK   +K 
Sbjct: 1326 EAKKKADAAKKKAEEAKKAAEAAKAEAEAAADEAEAAEEKAEAAEKKKEEAKKKADAAKK 1385

Query: 238  EEEEEKEEGEQTDSEKEAKKRKKHKKDKKKRKKKS 272
            + EE+K+  E     +E KK+    K     KKK+
Sbjct: 1386 KAEEKKKADEAKKKAEEDKKKADELKKAAAAKKKA 1420



 Score = 44.4 bits (104), Expect = 4e-05
 Identities = 29/97 (29%), Positives = 54/97 (55%), Gaps = 2/97 (2%)

Query: 178  KSKLKAKEKRKASPTSASSDESDKEEEEKEEGEQTDSEKEAKKR--KKHKKDKKKKKKKS 235
            ++K KA+EK+KA      ++E+ K +E K++ E+    +EAKK+  +  K D+ KKK + 
Sbjct: 1422 EAKKKAEEKKKADEAKKKAEEAKKADEAKKKAEEAKKAEEAKKKAEEAKKADEAKKKAEE 1481

Query: 236  KYEEEEEKEEGEQTDSEKEAKKRKKHKKDKKKRKKKS 272
              + +E K++ E+   + +  K+    K K    KK+
Sbjct: 1482 AKKADEAKKKAEEAKKKADEAKKAAEAKKKADEAKKA 1518



 Score = 44.0 bits (103), Expect = 5e-05
 Identities = 29/92 (31%), Positives = 51/92 (55%), Gaps = 3/92 (3%)

Query: 178  KSKLKAKEKRKASPTSASSDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKY 237
            ++K KA+E +KA      ++E+ K EE K++ E+     EAKK+ +  K   + KKK+  
Sbjct: 1435 EAKKKAEEAKKADEAKKKAEEAKKAEEAKKKAEEAKKADEAKKKAEEAKKADEAKKKA-- 1492

Query: 238  EEEEEKEEGEQTDSEKEAKKRKKHKKDKKKRK 269
             EE +K+  E   + +  KK  + KK ++ +K
Sbjct: 1493 -EEAKKKADEAKKAAEAKKKADEAKKAEEAKK 1523



 Score = 43.6 bits (102), Expect = 8e-05
 Identities = 26/96 (27%), Positives = 50/96 (52%), Gaps = 5/96 (5%)

Query: 182  KAKEKRKASPTSAS-----SDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSK 236
            KA+EK+KA     +     +DE+ K+ EE ++ ++   + E  K+K     KK ++ K  
Sbjct: 1285 KAEEKKKADEAKKAEEKKKADEAKKKAEEAKKADEAKKKAEEAKKKADAAKKKAEEAKKA 1344

Query: 237  YEEEEEKEEGEQTDSEKEAKKRKKHKKDKKKRKKKS 272
             E  + + E    ++E   +K +  +K K++ KKK+
Sbjct: 1345 AEAAKAEAEAAADEAEAAEEKAEAAEKKKEEAKKKA 1380



 Score = 43.2 bits (101), Expect = 1e-04
 Identities = 29/95 (30%), Positives = 49/95 (51%), Gaps = 2/95 (2%)

Query: 180  KLKAKEKRKASPTSASSDESDKEEEEKEEGEQTDSEKEAKKR--KKHKKDKKKKKKKSKY 237
            K  A  K+KA      ++E  K +E K++ E+     EAKK+  +  K ++ KKK +   
Sbjct: 1411 KKAAAAKKKADEAKKKAEEKKKADEAKKKAEEAKKADEAKKKAEEAKKAEEAKKKAEEAK 1470

Query: 238  EEEEEKEEGEQTDSEKEAKKRKKHKKDKKKRKKKS 272
            + +E K++ E+     EAKK+ +  K K    KK+
Sbjct: 1471 KADEAKKKAEEAKKADEAKKKAEEAKKKADEAKKA 1505



 Score = 42.8 bits (100), Expect = 1e-04
 Identities = 25/94 (26%), Positives = 45/94 (47%), Gaps = 1/94 (1%)

Query: 178  KSKLKAKEKRKASPTSASSDESDKE-EEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSK 236
            + K KA E +K +  +  +DE+ K+ EE K++ +    + E  K+       + +    +
Sbjct: 1299 EEKKKADEAKKKAEEAKKADEAKKKAEEAKKKADAAKKKAEEAKKAAEAAKAEAEAAADE 1358

Query: 237  YEEEEEKEEGEQTDSEKEAKKRKKHKKDKKKRKK 270
             E  EEK E  +   E+  KK    KK  +++KK
Sbjct: 1359 AEAAEEKAEAAEKKKEEAKKKADAAKKKAEEKKK 1392



 Score = 42.4 bits (99), Expect = 2e-04
 Identities = 30/105 (28%), Positives = 55/105 (52%), Gaps = 10/105 (9%)

Query: 178  KSKLKAKEKRKASPTSAS-----SDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKK 232
            ++ +KA+E RKA     +     +DE+ K EE+K+  E     +EAKK  + KK  ++ K
Sbjct: 1269 QAAIKAEEARKADELKKAEEKKKADEAKKAEEKKKADEAKKKAEEAKKADEAKKKAEEAK 1328

Query: 233  KKS-----KYEEEEEKEEGEQTDSEKEAKKRKKHKKDKKKRKKKS 272
            KK+     K EE ++  E  + ++E  A + +  ++  +  +KK 
Sbjct: 1329 KKADAAKKKAEEAKKAAEAAKAEAEAAADEAEAAEEKAEAAEKKK 1373



 Score = 42.4 bits (99), Expect = 2e-04
 Identities = 31/95 (32%), Positives = 52/95 (54%), Gaps = 4/95 (4%)

Query: 178  KSKLKAKEKRKASPTSASSDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKY 237
            + K KA E +KA+     +DE+ K+ EEK++ ++   + E  K    K D+ KKK +   
Sbjct: 1402 EDKKKADELKKAAAAKKKADEAKKKAEEKKKADEAKKKAEEAK----KADEAKKKAEEAK 1457

Query: 238  EEEEEKEEGEQTDSEKEAKKRKKHKKDKKKRKKKS 272
            + EE K++ E+     EAKK+ +  K   + KKK+
Sbjct: 1458 KAEEAKKKAEEAKKADEAKKKAEEAKKADEAKKKA 1492



 Score = 42.1 bits (98), Expect = 2e-04
 Identities = 29/96 (30%), Positives = 52/96 (54%), Gaps = 3/96 (3%)

Query: 180  KLKAKEKRKASPTSASSDESDKEEEEKEEGEQTDSEKEAKKRK---KHKKDKKKKKKKSK 236
            K K + K+KA      ++E  K +E K++ E+   + +  K+    K K D+ KKK + K
Sbjct: 1371 KKKEEAKKKADAAKKKAEEKKKADEAKKKAEEDKKKADELKKAAAAKKKADEAKKKAEEK 1430

Query: 237  YEEEEEKEEGEQTDSEKEAKKRKKHKKDKKKRKKKS 272
             + +E K++ E+     EAKK+ +  K  ++ KKK+
Sbjct: 1431 KKADEAKKKAEEAKKADEAKKKAEEAKKAEEAKKKA 1466



 Score = 41.7 bits (97), Expect = 3e-04
 Identities = 28/100 (28%), Positives = 56/100 (56%), Gaps = 6/100 (6%)

Query: 178  KSKLKAKEKRKASPTSA--SSDESDKEEEEKEEGEQTDSEKEAKKRK----KHKKDKKKK 231
            + K KA+E +KA   +   +++E+ K EE+K++ E+    +E +K+     K + ++ KK
Sbjct: 1644 EEKKKAEELKKAEEENKIKAAEEAKKAEEDKKKAEEAKKAEEDEKKAAEALKKEAEEAKK 1703

Query: 232  KKKSKYEEEEEKEEGEQTDSEKEAKKRKKHKKDKKKRKKK 271
             ++ K +E EEK++ E+    +E  K K  +  K+  + K
Sbjct: 1704 AEELKKKEAEEKKKAEELKKAEEENKIKAEEAKKEAEEDK 1743



 Score = 41.3 bits (96), Expect = 4e-04
 Identities = 29/98 (29%), Positives = 58/98 (59%), Gaps = 6/98 (6%)

Query: 178  KSKLKAKEKRKASPTSASSDES-DKEEEEKEEGEQTDSEKEAKKRK-----KHKKDKKKK 231
            + K KA+E +KA      + E+  KE EE ++ E+   ++  +K+K     K +++ K K
Sbjct: 1672 EDKKKAEEAKKAEEDEKKAAEALKKEAEEAKKAEELKKKEAEEKKKAEELKKAEEENKIK 1731

Query: 232  KKKSKYEEEEEKEEGEQTDSEKEAKKRKKHKKDKKKRK 269
             +++K E EE+K++ E+   ++E KK+  H K ++++K
Sbjct: 1732 AEEAKKEAEEDKKKAEEAKKDEEEKKKIAHLKKEEEKK 1769



 Score = 41.3 bits (96), Expect = 5e-04
 Identities = 28/87 (32%), Positives = 53/87 (60%), Gaps = 2/87 (2%)

Query: 180  KLKAKEKRKASPTSASSDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKYEE 239
            K KA+E +KA      ++E+ K +E K++ E+  ++K+A + KK  + KKK  +  K EE
Sbjct: 1463 KKKAEEAKKADEAKKKAEEAKKADEAKKKAEE--AKKKADEAKKAAEAKKKADEAKKAEE 1520

Query: 240  EEEKEEGEQTDSEKEAKKRKKHKKDKK 266
             ++ +E ++ +  K+A + KK ++ KK
Sbjct: 1521 AKKADEAKKAEEAKKADEAKKAEEKKK 1547



 Score = 40.5 bits (94), Expect = 8e-04
 Identities = 28/95 (29%), Positives = 51/95 (53%), Gaps = 1/95 (1%)

Query: 178  KSKLKAKE-KRKASPTSASSDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSK 236
            +   KA E K+KA      +DE+ K  E K++ ++    +EAKK  + KK ++ KK    
Sbjct: 1480 EEAKKADEAKKKAEEAKKKADEAKKAAEAKKKADEAKKAEEAKKADEAKKAEEAKKADEA 1539

Query: 237  YEEEEEKEEGEQTDSEKEAKKRKKHKKDKKKRKKK 271
             + EE+K+  E   +E+  K  +K K ++ K+ ++
Sbjct: 1540 KKAEEKKKADELKKAEELKKAEEKKKAEEAKKAEE 1574



 Score = 39.4 bits (91), Expect = 0.002
 Identities = 25/98 (25%), Positives = 49/98 (50%), Gaps = 4/98 (4%)

Query: 178  KSKLKAKEKRKASPTSASSDESDKEEEEKEEGEQTDSEKEAKKR----KKHKKDKKKKKK 233
            +++++   K         ++E+ K EE K + E+    +E KK+    KK + ++KKK +
Sbjct: 1591 EARIEEVMKLYEEEKKMKAEEAKKAEEAKIKAEELKKAEEEKKKVEQLKKKEAEEKKKAE 1650

Query: 234  KSKYEEEEEKEEGEQTDSEKEAKKRKKHKKDKKKRKKK 271
            + K  EEE K +  +   + E  K+K  +  K +  +K
Sbjct: 1651 ELKKAEEENKIKAAEEAKKAEEDKKKAEEAKKAEEDEK 1688



 Score = 38.6 bits (89), Expect = 0.003
 Identities = 29/93 (31%), Positives = 49/93 (52%), Gaps = 3/93 (3%)

Query: 182  KAKEKRKASPTSASSDESDKEEEEKEEGEQTDSEKEAKKRK---KHKKDKKKKKKKSKYE 238
            KA+E +KA            EEE+K + E+    +EAK +    K  +++KKK ++ K +
Sbjct: 1582 KAEEAKKAEEARIEEVMKLYEEEKKMKAEEAKKAEEAKIKAEELKKAEEEKKKVEQLKKK 1641

Query: 239  EEEEKEEGEQTDSEKEAKKRKKHKKDKKKRKKK 271
            E EEK++ E+    +E  K K  ++ KK  + K
Sbjct: 1642 EAEEKKKAEELKKAEEENKIKAAEEAKKAEEDK 1674



 Score = 37.4 bits (86), Expect = 0.008
 Identities = 25/93 (26%), Positives = 51/93 (54%), Gaps = 1/93 (1%)

Query: 178  KSKLKAKEKRKASPTSASSDESDKE-EEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSK 236
            +   KA E +K +  +  +DE+ K+ EE K++ ++     EAKK+    K  ++ KK  +
Sbjct: 1467 EEAKKADEAKKKAEEAKKADEAKKKAEEAKKKADEAKKAAEAKKKADEAKKAEEAKKADE 1526

Query: 237  YEEEEEKEEGEQTDSEKEAKKRKKHKKDKKKRK 269
             ++ EE ++ ++    +E KK  + KK ++ +K
Sbjct: 1527 AKKAEEAKKADEAKKAEEKKKADELKKAEELKK 1559



 Score = 37.0 bits (85), Expect = 0.010
 Identities = 30/99 (30%), Positives = 53/99 (53%), Gaps = 4/99 (4%)

Query: 178  KSKLKAKEKRKASPTSASSDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKY 237
            ++K KA E +KA+     +DE+ K EE K+  E   +E+  K  +  K ++KKK  + K 
Sbjct: 1494 EAKKKADEAKKAAEAKKKADEAKKAEEAKKADEAKKAEEAKKADEAKKAEEKKKADELKK 1553

Query: 238  EEE----EEKEEGEQTDSEKEAKKRKKHKKDKKKRKKKS 272
             EE    EEK++ E+    +E K     K ++ K+ +++
Sbjct: 1554 AEELKKAEEKKKAEEAKKAEEDKNMALRKAEEAKKAEEA 1592



 Score = 36.7 bits (84), Expect = 0.015
 Identities = 25/91 (27%), Positives = 53/91 (58%)

Query: 182  KAKEKRKASPTSASSDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKYEEEE 241
            KA+EK+KA     + ++ +    + EE ++ +  +  +  K ++++KK K +++K  EE 
Sbjct: 1559 KAEEKKKAEEAKKAEEDKNMALRKAEEAKKAEEARIEEVMKLYEEEKKMKAEEAKKAEEA 1618

Query: 242  EKEEGEQTDSEKEAKKRKKHKKDKKKRKKKS 272
            + +  E   +E+E KK ++ KK + + KKK+
Sbjct: 1619 KIKAEELKKAEEEKKKVEQLKKKEAEEKKKA 1649



 Score = 34.7 bits (79), Expect = 0.060
 Identities = 24/90 (26%), Positives = 47/90 (52%)

Query: 182  KAKEKRKASPTSASSDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKYEEEE 241
            KA+  +KA      ++E+ K EEE+   E    E+        ++   K ++  K +E +
Sbjct: 1225 KAEAVKKAEEAKKDAEEAKKAEEERNNEEIRKFEEARMAHFARRQAAIKAEEARKADELK 1284

Query: 242  EKEEGEQTDSEKEAKKRKKHKKDKKKRKKK 271
            + EE ++ D  K+A+++KK  + KKK ++ 
Sbjct: 1285 KAEEKKKADEAKKAEEKKKADEAKKKAEEA 1314



 Score = 33.6 bits (76), Expect = 0.12
 Identities = 30/96 (31%), Positives = 52/96 (54%), Gaps = 8/96 (8%)

Query: 182  KAKEKRKASPTSASSDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKYE--- 238
            KA+EK+KA     +  E  K+ EEK++ E+    +E K     K ++ KK ++++ E   
Sbjct: 1541 KAEEKKKADELKKA--EELKKAEEKKKAEEAKKAEEDKNMALRKAEEAKKAEEARIEEVM 1598

Query: 239  ---EEEEKEEGEQTDSEKEAKKRKKHKKDKKKRKKK 271
               EEE+K + E+    +EAK + +  K  ++ KKK
Sbjct: 1599 KLYEEEKKMKAEEAKKAEEAKIKAEELKKAEEEKKK 1634



 Score = 33.6 bits (76), Expect = 0.14
 Identities = 30/92 (32%), Positives = 56/92 (60%), Gaps = 10/92 (10%)

Query: 180  KLKAKEKRKASPTSASSDESDKEEEE---KEEGEQTDSEKEAKKRKKHKKDKKKKKKKSK 236
            K +A+EK+KA       +E  K EEE   K E  + ++E++ KK ++ KKD+++KKK + 
Sbjct: 1709 KKEAEEKKKA-------EELKKAEEENKIKAEEAKKEAEEDKKKAEEAKKDEEEKKKIAH 1761

Query: 237  YEEEEEKEEGEQTDSEKEAKKRKKHKKDKKKR 268
             ++EEEK+  E    ++   + +  ++D+K+R
Sbjct: 1762 LKKEEEKKAEEIRKEKEAVIEEELDEEDEKRR 1793



 Score = 32.0 bits (72), Expect = 0.34
 Identities = 27/133 (20%), Positives = 58/133 (43%), Gaps = 22/133 (16%)

Query: 162  RPLQDVRVVKCNELVLKSKLKAKEKRKA----------------------SPTSASSDES 199
            R  +D +  +  +   ++K  A+E +KA                         +  ++E+
Sbjct: 1218 RKAEDAKKAEAVKKAEEAKKDAEEAKKAEEERNNEEIRKFEEARMAHFARRQAAIKAEEA 1277

Query: 200  DKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKYEEEEEKEEGEQTDSEKEAKKRK 259
             K +E K+  E+  +++  K  +K K D+ KKK +   + +E K++ E+   + +A K+K
Sbjct: 1278 RKADELKKAEEKKKADEAKKAEEKKKADEAKKKAEEAKKADEAKKKAEEAKKKADAAKKK 1337

Query: 260  KHKKDKKKRKKKS 272
              +  K     K+
Sbjct: 1338 AEEAKKAAEAAKA 1350



 Score = 32.0 bits (72), Expect = 0.36
 Identities = 27/94 (28%), Positives = 54/94 (57%), Gaps = 1/94 (1%)

Query: 178  KSKLKAKEKRKASPTSASSDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKY 237
            ++K KA E +KA   +  +DE+ K EE K+  E   +E++ K  +  K ++ KK ++ K 
Sbjct: 1507 EAKKKADEAKKAE-EAKKADEAKKAEEAKKADEAKKAEEKKKADELKKAEELKKAEEKKK 1565

Query: 238  EEEEEKEEGEQTDSEKEAKKRKKHKKDKKKRKKK 271
             EE +K E ++  + ++A++ KK ++ + +   K
Sbjct: 1566 AEEAKKAEEDKNMALRKAEEAKKAEEARIEEVMK 1599



 Score = 31.6 bits (71), Expect = 0.53
 Identities = 27/92 (29%), Positives = 45/92 (48%), Gaps = 1/92 (1%)

Query: 182  KAKEKRKASPTSASSDESDKEEEEK-EEGEQTDSEKEAKKRKKHKKDKKKKKKKSKYEEE 240
            KA+E RKA     +      EEE K EE  + +  K+A+  KK ++ KK  ++  K EEE
Sbjct: 1189 KAEELRKAEDARKAEAARKAEEERKAEEARKAEDAKKAEAVKKAEEAKKDAEEAKKAEEE 1248

Query: 241  EEKEEGEQTDSEKEAKKRKKHKKDKKKRKKKS 272
               EE  + +  + A   ++    K +  +K+
Sbjct: 1249 RNNEEIRKFEEARMAHFARRQAAIKAEEARKA 1280



 Score = 31.3 bits (70), Expect = 0.61
 Identities = 24/95 (25%), Positives = 53/95 (55%)

Query: 178  KSKLKAKEKRKASPTSASSDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKY 237
            + K KA+E +KA   +    E  K+E E+++ +  +++K+ +++KK    KK+++KK++ 
Sbjct: 1713 EEKKKAEELKKAEEENKIKAEEAKKEAEEDKKKAEEAKKDEEEKKKIAHLKKEEEKKAEE 1772

Query: 238  EEEEEKEEGEQTDSEKEAKKRKKHKKDKKKRKKKS 272
              +E++   E+   E++ K+R +  K  K      
Sbjct: 1773 IRKEKEAVIEEELDEEDEKRRMEVDKKIKDIFDNF 1807



 Score = 30.9 bits (69), Expect = 0.94
 Identities = 24/94 (25%), Positives = 51/94 (54%), Gaps = 1/94 (1%)

Query: 178  KSKLKAKEKRKASPTSASSDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKY 237
            +   KA E +KA   +  +DE+ K EE+K+  E   +E+  K  +K K ++ KK ++ K 
Sbjct: 1519 EEAKKADEAKKAE-EAKKADEAKKAEEKKKADELKKAEELKKAEEKKKAEEAKKAEEDKN 1577

Query: 238  EEEEEKEEGEQTDSEKEAKKRKKHKKDKKKRKKK 271
                + EE ++ +  +  +  K ++++KK + ++
Sbjct: 1578 MALRKAEEAKKAEEARIEEVMKLYEEEKKMKAEE 1611



 Score = 30.1 bits (67), Expect = 1.6
 Identities = 24/90 (26%), Positives = 45/90 (50%)

Query: 182  KAKEKRKASPTSASSDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKYEEEE 241
            KA+E RKA     +      EE +K+  E   +E+E    +  K ++ +    ++ +   
Sbjct: 1213 KAEEARKAEDAKKAEAVKKAEEAKKDAEEAKKAEEERNNEEIRKFEEARMAHFARRQAAI 1272

Query: 242  EKEEGEQTDSEKEAKKRKKHKKDKKKRKKK 271
            + EE  + D  K+A+++KK  + KK  +KK
Sbjct: 1273 KAEEARKADELKKAEEKKKADEAKKAEEKK 1302



 Score = 28.6 bits (63), Expect = 5.2
 Identities = 31/103 (30%), Positives = 51/103 (49%), Gaps = 8/103 (7%)

Query: 178  KSKLKAKEKRKAS---PTSASSDESDKEEEEKEEGEQTDSEKEAKKRK-----KHKKDKK 229
            + K KA+E +KA      +    E  K+ EE    E     +E KK K     K ++ K 
Sbjct: 1561 EEKKKAEEAKKAEEDKNMALRKAEEAKKAEEARIEEVMKLYEEEKKMKAEEAKKAEEAKI 1620

Query: 230  KKKKKSKYEEEEEKEEGEQTDSEKEAKKRKKHKKDKKKRKKKS 272
            K ++  K EEE++K E  +    +E KK ++ KK +++ K K+
Sbjct: 1621 KAEELKKAEEEKKKVEQLKKKEAEEKKKAEELKKAEEENKIKA 1663



 Score = 28.2 bits (62), Expect = 5.5
 Identities = 22/90 (24%), Positives = 46/90 (51%)

Query: 170  VKCNELVLKSKLKAKEKRKASPTSASSDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKK 229
             K  E + K++ + K K + +   A  D+   EE +K+E E+       K+ +K  ++ +
Sbjct: 1715 KKKAEELKKAEEENKIKAEEAKKEAEEDKKKAEEAKKDEEEKKKIAHLKKEEEKKAEEIR 1774

Query: 230  KKKKKSKYEEEEEKEEGEQTDSEKEAKKRK 259
            K+K+    EE +E++E  + + +K+ K   
Sbjct: 1775 KEKEAVIEEELDEEDEKRRMEVDKKIKDIF 1804


>gnl|CDD|221641 pfam12569, NARP1, NMDA receptor-regulated protein 1.  This domain
           family is found in eukaryotes, and is approximately 40
           amino acids in length. The family is found in
           association with pfam07719, pfam00515. There is a single
           completely conserved residue L that may be functionally
           important. NARP1 is the mammalian homologue of a yeast
           N-terminal acetyltransferase that regulates entry into
           the G(0) phase of the cell cycle.
          Length = 516

 Score = 44.9 bits (107), Expect = 2e-05
 Identities = 25/73 (34%), Positives = 35/73 (47%), Gaps = 7/73 (9%)

Query: 197 DESDKEEEEKEEGEQTD-SEKEAKK-RKKHKKDKKKKKKKSKYEEEEEKEEGEQTDSEKE 254
           D+    E E+EEGE  + S  E KK RKK +K +KK +K     EE EK   ++      
Sbjct: 389 DKPLLAEGEEEEGENGNLSPAERKKLRKKQRKAEKKAEK-----EEAEKAAAKKKAEAAA 443

Query: 255 AKKRKKHKKDKKK 267
            K +    + KK 
Sbjct: 444 KKAKGPDGETKKV 456



 Score = 34.1 bits (79), Expect = 0.078
 Identities = 18/87 (20%), Positives = 36/87 (41%)

Query: 170 VKCNELVLKSKLKAKEKRKASPTSASSDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKK 229
           ++    +    L A+ + +       S    K+  +K+   +  +EKE  ++   KK  +
Sbjct: 381 IEIYLKLHDKPLLAEGEEEEGENGNLSPAERKKLRKKQRKAEKKAEKEEAEKAAAKKKAE 440

Query: 230 KKKKKSKYEEEEEKEEGEQTDSEKEAK 256
              KK+K  + E K+       EK A+
Sbjct: 441 AAAKKAKGPDGETKKVDPDPLGEKLAR 467



 Score = 28.4 bits (64), Expect = 5.2
 Identities = 18/40 (45%), Positives = 21/40 (52%), Gaps = 3/40 (7%)

Query: 232 KKKSKYEEEEEKEEGEQTDSEKEAKK-RKKHKKDKKKRKK 270
           K      EEEE E G    S  E KK RKK +K +KK +K
Sbjct: 390 KPLLAEGEEEEGENGNL--SPAERKKLRKKQRKAEKKAEK 427


>gnl|CDD|234017 TIGR02794, tolA_full, TolA protein.  TolA couples the inner
           membrane complex of itself with TolQ and TolR to the
           outer membrane complex of TolB and OprL (also called
           Pal). Most of the length of the protein consists of
           low-complexity sequence that may differ in both length
           and composition from one species to another,
           complicating efforts to discriminate TolA (the most
           divergent gene in the tol-pal system) from paralogs such
           as TonB. Selection of members of the seed alignment and
           criteria for setting scoring cutoffs are based largely
           conserved operon struction. //The Tol-Pal complex is
           required for maintaining outer membrane integrity. Also
           involved in transport (uptake) of colicins and
           filamentous DNA, and implicated in pathogenesis.
           Transport is energized by the proton motive force. TolA
           is an inner membrane protein that interacts with
           periplasmic TolB and with outer membrane porins ompC,
           phoE and lamB [Transport and binding proteins, Other,
           Cellular processes, Pathogenesis].
          Length = 346

 Score = 44.4 bits (105), Expect = 3e-05
 Identities = 22/95 (23%), Positives = 44/95 (46%), Gaps = 7/95 (7%)

Query: 178 KSKLKAKEKRKASPTSASSDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKK---KKK 234
           K+K +A+ ++KA   +    E + + +   E ++    K A+ +KK + + K K   K K
Sbjct: 133 KAKAEAEAEKKAKEEAKKQAEEEAKAKAAAEAKK----KAAEAKKKAEAEAKAKAEAKAK 188

Query: 235 SKYEEEEEKEEGEQTDSEKEAKKRKKHKKDKKKRK 269
           +K EE + K E  +  +  EA  + + +       
Sbjct: 189 AKAEEAKAKAEAAKAKAAAEAAAKAEAEAAAAAAA 223



 Score = 41.0 bits (96), Expect = 4e-04
 Identities = 25/94 (26%), Positives = 47/94 (50%), Gaps = 3/94 (3%)

Query: 180 KLKAKEKRKASPTSASSDESD---KEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSK 236
           + K  E+R A+  +A   E      EE++K+  E    +    K K   + +KK K+++K
Sbjct: 90  RQKELEQRAAAEKAAKQAEQAAKQAEEKQKQAEEAKAKQAAEAKAKAEAEAEKKAKEEAK 149

Query: 237 YEEEEEKEEGEQTDSEKEAKKRKKHKKDKKKRKK 270
            + EEE +     +++K+A + KK  + + K K 
Sbjct: 150 KQAEEEAKAKAAAEAKKKAAEAKKKAEAEAKAKA 183



 Score = 40.2 bits (94), Expect = 6e-04
 Identities = 21/89 (23%), Positives = 42/89 (47%), Gaps = 7/89 (7%)

Query: 188 KASPTSASSDESDKEEEEKEEGEQT-----DSEKEAKKRKKHKKDKKKKKKKSKYEEEEE 242
           K        +   K E++ EE E+        +KE ++R   +K  K+ ++ +K  EE++
Sbjct: 59  KKPAAKKEQERQKKLEQQAEEAEKQRAAEQARQKELEQRAAAEKAAKQAEQAAKQAEEKQ 118

Query: 243 KEEGEQTDSEKEAKKRKKHKKDKKKRKKK 271
           K+  E     K+A + K   + + ++K K
Sbjct: 119 KQAEEA--KAKQAAEAKAKAEAEAEKKAK 145



 Score = 37.9 bits (88), Expect = 0.003
 Identities = 18/89 (20%), Positives = 40/89 (44%), Gaps = 3/89 (3%)

Query: 183 AKEKRKASPTSASSDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKYEEEEE 242
            + +++  P +    E  K+ E++ E  +     E + R+K  + +   +K +K  E+  
Sbjct: 53  NRIQQQKKPAAKKEQERQKKLEQQAEEAEKQRAAE-QARQKELEQRAAAEKAAKQAEQAA 111

Query: 243 KEEGEQTDSEKEAKKRKKHKKDKKKRKKK 271
           K+  E+    +EAK   K   + K + + 
Sbjct: 112 KQAEEKQKQAEEAK--AKQAAEAKAKAEA 138



 Score = 36.7 bits (85), Expect = 0.009
 Identities = 22/90 (24%), Positives = 48/90 (53%), Gaps = 1/90 (1%)

Query: 178 KSKLKAKEKRKASPTSASSDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKY 237
           ++  +A+EK+K +  + +   ++ + + + E E+   E+  K+ ++  K K   + K K 
Sbjct: 109 QAAKQAEEKQKQAEEAKAKQAAEAKAKAEAEAEKKAKEEAKKQAEEEAKAKAAAEAKKKA 168

Query: 238 EEEEEKEEGEQTDSEKEAKKRKKHKKDKKK 267
            E ++K E E   ++ EAK + K ++ K K
Sbjct: 169 AEAKKKAEAE-AKAKAEAKAKAKAEEAKAK 197



 Score = 34.4 bits (79), Expect = 0.056
 Identities = 16/70 (22%), Positives = 40/70 (57%), Gaps = 2/70 (2%)

Query: 202 EEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKYEEEEEKEEGEQTDSEKEAKKRKKH 261
           +  +KE  ++  +EK AK+ ++  K  ++K+K++  EE + K+  E     +   ++K  
Sbjct: 88  QARQKELEQRAAAEKAAKQAEQAAKQAEEKQKQA--EEAKAKQAAEAKAKAEAEAEKKAK 145

Query: 262 KKDKKKRKKK 271
           ++ KK+ +++
Sbjct: 146 EEAKKQAEEE 155



 Score = 34.0 bits (78), Expect = 0.072
 Identities = 23/89 (25%), Positives = 40/89 (44%)

Query: 177 LKSKLKAKEKRKASPTSASSDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSK 236
            K K   + K K +  + +  E++ E++ KEE ++   E+   K     K K  + KK  
Sbjct: 116 EKQKQAEEAKAKQAAEAKAKAEAEAEKKAKEEAKKQAEEEAKAKAAAEAKKKAAEAKKKA 175

Query: 237 YEEEEEKEEGEQTDSEKEAKKRKKHKKDK 265
             E + K E +     +EAK + +  K K
Sbjct: 176 EAEAKAKAEAKAKAKAEEAKAKAEAAKAK 204



 Score = 29.8 bits (67), Expect = 1.3
 Identities = 18/91 (19%), Positives = 45/91 (49%), Gaps = 8/91 (8%)

Query: 182 KAKEKRKASPTSASSDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKYEEEE 241
            A  ++           + KE+E +++ EQ    +EA+K++  ++ ++K+ ++    E+ 
Sbjct: 46  GAVAQQANRIQQQKKPAAKKEQERQKKLEQQ--AEEAEKQRAAEQARQKELEQRAAAEKA 103

Query: 242 EKEEGEQTDSEKEAKKRKKHKKDKKKRKKKS 272
            K+       E+ AK+ ++ +K  ++ K K 
Sbjct: 104 AKQA------EQAAKQAEEKQKQAEEAKAKQ 128



 Score = 28.3 bits (63), Expect = 5.0
 Identities = 14/72 (19%), Positives = 35/72 (48%), Gaps = 1/72 (1%)

Query: 200 DKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKYEEEEEKEEGEQTDSEKEAKKRK 259
           D     ++       +K A K+++ ++ KK +++  + E++   E+  Q + E+ A   K
Sbjct: 44  DPGAVAQQANRIQQQKKPAAKKEQ-ERQKKLEQQAEEAEKQRAAEQARQKELEQRAAAEK 102

Query: 260 KHKKDKKKRKKK 271
             K+ ++  K+ 
Sbjct: 103 AAKQAEQAAKQA 114


>gnl|CDD|236877 PRK11192, PRK11192, ATP-dependent RNA helicase SrmB; Provisional.
          Length = 434

 Score = 43.4 bits (103), Expect = 6e-05
 Identities = 18/53 (33%), Positives = 26/53 (49%), Gaps = 1/53 (1%)

Query: 221 RKKHKKDK-KKKKKKSKYEEEEEKEEGEQTDSEKEAKKRKKHKKDKKKRKKKS 272
           R K K    KK  K SK    +  E+ E+   + + KKR +  K+  KR+K S
Sbjct: 377 RPKTKAPSEKKTGKPSKKVLAKRAEKKEKEKEKPKVKKRHRDTKNIGKRRKPS 429



 Score = 34.5 bits (80), Expect = 0.048
 Identities = 27/90 (30%), Positives = 40/90 (44%), Gaps = 15/90 (16%)

Query: 157 VDKMSRPLQDV---RVVKCNELVLKSKLKAKEKRKASPTSASSDESDKEEEEKEEGEQTD 213
           + K+ R +++    RV+   EL  K+K    EK+   P   S     K  E+KE+     
Sbjct: 357 LGKIERYIEEPLKARVID--ELRPKTK-APSEKKTGKP---SKKVLAKRAEKKEK----- 405

Query: 214 SEKEAKKRKKHKKDKKKKKKKSKYEEEEEK 243
            EKE  K KK  +D K   K+ K     E+
Sbjct: 406 -EKEKPKVKKRHRDTKNIGKRRKPSGTSEE 434



 Score = 33.4 bits (77), Expect = 0.12
 Identities = 17/72 (23%), Positives = 25/72 (34%), Gaps = 22/72 (30%)

Query: 201 KEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKYEEEEEKEEGEQTDSEKEAKKRKK 260
           K   EK+ G    S+K   KR + K+ +K+K K  K                   + R  
Sbjct: 381 KAPSEKKTG--KPSKKVLAKRAEKKEKEKEKPKVKK-------------------RHRDT 419

Query: 261 HKKDKKKRKKKS 272
                K+RK   
Sbjct: 420 KNI-GKRRKPSG 430


>gnl|CDD|219746 pfam08208, RNA_polI_A34, DNA-directed RNA polymerase I subunit
           RPA34.5.  This is a family of proteins conserved from
           yeasts to human. Subunit A34.5 of RNA polymerase I is a
           non-essential subunit which is thought to help Pol I
           overcome topological constraints imposed on ribosomal
           DNA during the process of transcription.
          Length = 193

 Score = 41.6 bits (98), Expect = 1e-04
 Identities = 24/70 (34%), Positives = 40/70 (57%)

Query: 188 KASPTSASSDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKYEEEEEKEEGE 247
             S   + S+ S+KE   K E E    E+E K++KK K+ KK+KK+K   +E+  + +G 
Sbjct: 124 PPSELGSESETSEKETTAKVEKEAEVEEEEKKEKKKKKEVKKEKKEKKDKKEKMVEPKGS 183

Query: 248 QTDSEKEAKK 257
           +   +K+ KK
Sbjct: 184 KKKKKKKKKK 193



 Score = 33.9 bits (78), Expect = 0.040
 Identities = 18/68 (26%), Positives = 37/68 (54%)

Query: 174 ELVLKSKLKAKEKRKASPTSASSDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKK 233
              L S+ +  EK   +     ++  ++E++EK++ ++   EK+ KK KK K  + K  K
Sbjct: 125 PSELGSESETSEKETTAKVEKEAEVEEEEKKEKKKKKEVKKEKKEKKDKKEKMVEPKGSK 184

Query: 234 KSKYEEEE 241
           K K ++++
Sbjct: 185 KKKKKKKK 192


>gnl|CDD|240388 PTZ00372, PTZ00372, endonuclease 4-like protein; Provisional.
          Length = 413

 Score = 42.0 bits (99), Expect = 2e-04
 Identities = 27/96 (28%), Positives = 42/96 (43%), Gaps = 9/96 (9%)

Query: 185 EKRKASPTS--------ASSDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSK 236
           +K K  P S         S +      EE+ +   T S K+ KK  K+ + KKK +KK K
Sbjct: 17  QKSKLQPISYIYSNVLVLSKEILSTFSEEENK-VATTSTKKDKKEDKNNESKKKSEKKKK 75

Query: 237 YEEEEEKEEGEQTDSEKEAKKRKKHKKDKKKRKKKS 272
            ++E+++ + E          +K  K  KK  K K 
Sbjct: 76  KKKEKKEPKSEGETKLGFKTPKKSKKTKKKPPKPKP 111



 Score = 41.6 bits (98), Expect = 3e-04
 Identities = 27/111 (24%), Positives = 48/111 (43%), Gaps = 8/111 (7%)

Query: 158 DKMSRPLQDVRVVKCNELVLKSKLKAKEKRKASP-------TSASSDESDKEEEEKEEGE 210
             +S    +V V+    L   S+ + K    ++         + S  +S+K++++K+E +
Sbjct: 22  QPISYIYSNVLVLSKEILSTFSEEENKVATTSTKKDKKEDKNNESKKKSEKKKKKKKEKK 81

Query: 211 QTDSEKEAKK-RKKHKKDKKKKKKKSKYEEEEEKEEGEQTDSEKEAKKRKK 260
           +  SE E K   K  KK KK KKK  K +  E+ +      +E   K    
Sbjct: 82  EPKSEGETKLGFKTPKKSKKTKKKPPKPKPNEDVDNAFNKIAELAEKSNVY 132


>gnl|CDD|219838 pfam08432, DUF1742, Fungal protein of unknown function (DUF1742).
           This is a family of fungal proteins of unknown function.
          Length = 182

 Score = 41.2 bits (97), Expect = 2e-04
 Identities = 23/84 (27%), Positives = 46/84 (54%), Gaps = 4/84 (4%)

Query: 189 ASPTSASSDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKYEEEEEKEEGEQ 248
           A+P   +     K+++++   E    +KE ++++K K  KKK KKK    ++++K++ + 
Sbjct: 49  ATPIYDAEYTEAKKKKKELAEEIEKVKKEYEEKQKWKWKKKKSKKK----KDKDKDKKDD 104

Query: 249 TDSEKEAKKRKKHKKDKKKRKKKS 272
              +K  KK +K  +DK +   KS
Sbjct: 105 KKDDKSEKKDEKEAEDKLEDLTKS 128



 Score = 37.0 bits (86), Expect = 0.005
 Identities = 27/89 (30%), Positives = 48/89 (53%), Gaps = 6/89 (6%)

Query: 183 AKEKRKASPTSASSDESDKEEEEKEEGEQTDSEKEAKKRKKHK-KDKKKKKKKSKYEEEE 241
           AK+K+K        +E +K ++E EE ++   +K+  K+KK K KDKK  KK  K E+++
Sbjct: 60  AKKKKKELA-----EEIEKVKKEYEEKQKWKWKKKKSKKKKDKDKDKKDDKKDDKSEKKD 114

Query: 242 EKEEGEQTDSEKEAKKRKKHKKDKKKRKK 270
           EKE  ++ +   ++         + K +K
Sbjct: 115 EKEAEDKLEDLTKSYSETLSTLSELKPRK 143



 Score = 29.3 bits (66), Expect = 1.5
 Identities = 14/64 (21%), Positives = 28/64 (43%)

Query: 178 KSKLKAKEKRKASPTSASSDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKY 237
             K+K + + K            K++++K++ +    +K  KK +K  +DK +   KS  
Sbjct: 71  IEKVKKEYEEKQKWKWKKKKSKKKKDKDKDKKDDKKDDKSEKKDEKEAEDKLEDLTKSYS 130

Query: 238 EEEE 241
           E   
Sbjct: 131 ETLS 134


>gnl|CDD|219408 pfam07423, DUF1510, Protein of unknown function (DUF1510).  This
           family consists of several hypothetical bacterial
           proteins of around 200 residues in length. The function
           of this family is unknown.
          Length = 214

 Score = 41.3 bits (97), Expect = 2e-04
 Identities = 19/87 (21%), Positives = 41/87 (47%), Gaps = 1/87 (1%)

Query: 175 LVLKSKLKAKEKRKASPTSASSDESDKEEEE-KEEGEQTDSEKEAKKRKKHKKDKKKKKK 233
           L   S    +          S D+   E EE KEE ++  + ++ + +   +K+ ++ ++
Sbjct: 34  LFFPSSPSDQAAADEQEAKKSDDQETAEIEEVKEEEKEAANSEDKEDKGDAEKEDEESEE 93

Query: 234 KSKYEEEEEKEEGEQTDSEKEAKKRKK 260
           +++ E+EE  +E E+   EK     +K
Sbjct: 94  ENEEEDEESSDENEKETEEKTESNVEK 120



 Score = 37.0 bits (86), Expect = 0.005
 Identities = 20/83 (24%), Positives = 41/83 (49%), Gaps = 6/83 (7%)

Query: 178 KSKLKAKEKRKASPTSASSDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKY 237
               +A    + +  S   + ++ EE ++EE E  +SE +  K    K+D++ +++  + 
Sbjct: 39  SPSDQAAADEQEAKKSDDQETAEIEEVKEEEKEAANSEDKEDKGDAEKEDEESEEENEEE 98

Query: 238 EEEE------EKEEGEQTDSEKE 254
           +EE       E EE  +++ EKE
Sbjct: 99  DEESSDENEKETEEKTESNVEKE 121



 Score = 32.8 bits (75), Expect = 0.11
 Identities = 19/83 (22%), Positives = 42/83 (50%), Gaps = 2/83 (2%)

Query: 190 SPTSASSDESDKEEEEKEEGEQTDSE--KEAKKRKKHKKDKKKKKKKSKYEEEEEKEEGE 247
           S  S  +   ++E ++ ++ E  + E  KE +K   + +DK+ K    K +EE E+E  E
Sbjct: 38  SSPSDQAAADEQEAKKSDDQETAEIEEVKEEEKEAANSEDKEDKGDAEKEDEESEEENEE 97

Query: 248 QTDSEKEAKKRKKHKKDKKKRKK 270
           + +   +  +++  +K +   +K
Sbjct: 98  EDEESSDENEKETEEKTESNVEK 120



 Score = 31.2 bits (71), Expect = 0.34
 Identities = 12/74 (16%), Positives = 40/74 (54%)

Query: 199 SDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKYEEEEEKEEGEQTDSEKEAKKR 258
                ++    EQ   + + ++  + ++ K+++K+ +  E++E+K + E+ D E E +  
Sbjct: 37  PSSPSDQAAADEQEAKKSDDQETAEIEEVKEEEKEAANSEDKEDKGDAEKEDEESEEENE 96

Query: 259 KKHKKDKKKRKKKS 272
           ++ ++   + +K++
Sbjct: 97  EEDEESSDENEKET 110


>gnl|CDD|114011 pfam05262, Borrelia_P83, Borrelia P83/100 protein.  This family
           consists of several Borrelia P83/P100 antigen proteins.
          Length = 489

 Score = 41.9 bits (98), Expect = 2e-04
 Identities = 27/97 (27%), Positives = 44/97 (45%), Gaps = 4/97 (4%)

Query: 178 KSKLKAKEKRKASPTSASSD----ESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKK 233
           K +      + A  +S   D    E+ K E EK + E   +++EA K K HK    K++ 
Sbjct: 253 KQQEAKNLPKPADTSSPKEDKQVAENQKREIEKAQIEIKKNDEEALKAKDHKAFDLKQES 312

Query: 234 KSKYEEEEEKEEGEQTDSEKEAKKRKKHKKDKKKRKK 270
           K+  +E E+KE   Q   E  A+  +K K   + +  
Sbjct: 313 KASEKEAEDKELEAQKKREPVAEDLQKTKPQVEAQPT 349



 Score = 40.0 bits (93), Expect = 0.001
 Identities = 24/99 (24%), Positives = 44/99 (44%), Gaps = 5/99 (5%)

Query: 174 ELVLKSKLKAKEKRKASPTSASSDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKK 233
           EL  K     K ++KA     ++D+   E  +K++  +   +       K  K   + +K
Sbjct: 221 ELDKKQIDADKAQQKADFAQDNADKQRDEVRQKQQEAKNLPKPADTSSPKEDKQVAENQK 280

Query: 234 KSKYEEEEEKEEGEQTDSEKEAKKRKKHKKDKKKRKKKS 272
           +     E EK + E   +++EA K K HK    K++ K+
Sbjct: 281 R-----EIEKAQIEIKKNDEEALKAKDHKAFDLKQESKA 314



 Score = 32.3 bits (73), Expect = 0.27
 Identities = 18/86 (20%), Positives = 45/86 (52%), Gaps = 1/86 (1%)

Query: 186 KRKASPTSASSDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKYEEEEEKEE 245
           K + S   A   +  KEE +K++ +   ++++A   + +  DK++ + + K +E +   +
Sbjct: 204 KERESQEDAKRAQQLKEELDKKQIDADKAQQKADFAQDN-ADKQRDEVRQKQQEAKNLPK 262

Query: 246 GEQTDSEKEAKKRKKHKKDKKKRKKK 271
              T S KE K+  +++K + ++ + 
Sbjct: 263 PADTSSPKEDKQVAENQKREIEKAQI 288


>gnl|CDD|215521 PLN02967, PLN02967, kinase.
          Length = 581

 Score = 42.0 bits (98), Expect = 3e-04
 Identities = 28/146 (19%), Positives = 56/146 (38%), Gaps = 5/146 (3%)

Query: 120 SQFFITTQPAPHLDGVHVVFGHVVSGQNVVSHIENLPVDKMSRPLQDVRVVKCNELVLKS 179
           +QF    +    +  +    G                +   SR       +  +E   ++
Sbjct: 7   TQFLSFPRCNADVPCLLQSLGFNKFRGERWRLQNKFLLAAGSRKKI-ESALAVDEEPDEN 65

Query: 180 KLKAKEKRKASPTSASSDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKYEE 239
              +K+K   S   A+     +  E  EEG +    ++A   K+ KK  ++ ++K+    
Sbjct: 66  GAVSKKKPTRSVKRATKKTVVEISEPLEEGSELVVNEDAALDKESKKTPRRTRRKAAAAS 125

Query: 240 EEEKEEGEQTDSEKEAKKRKKHKKDK 265
            + +EE     +EK+ +KR+K KK  
Sbjct: 126 SDVEEEK----TEKKVRKRRKVKKMD 147



 Score = 33.1 bits (75), Expect = 0.16
 Identities = 20/74 (27%), Positives = 36/74 (48%)

Query: 199 SDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKYEEEEEKEEGEQTDSEKEAKKR 258
           S K+ E     ++   E  A  +KK  +  K+  KK+  E  E  EEG +    ++A   
Sbjct: 48  SRKKIESALAVDEEPDENGAVSKKKPTRSVKRATKKTVVEISEPLEEGSELVVNEDAALD 107

Query: 259 KKHKKDKKKRKKKS 272
           K+ KK  ++ ++K+
Sbjct: 108 KESKKTPRRTRRKA 121



 Score = 31.2 bits (70), Expect = 0.63
 Identities = 18/77 (23%), Positives = 34/77 (44%), Gaps = 8/77 (10%)

Query: 178 KSKLKAKEKRKASPTSASSDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKY 237
           K   K   + +    +ASSD  +++ E+K            +K KK  +D + +  +S+ 
Sbjct: 108 KESKKTPRRTRRKAAAASSDVEEEKTEKKVRKR--------RKVKKMDEDVEDQGSESEV 159

Query: 238 EEEEEKEEGEQTDSEKE 254
            + EE E     ++E E
Sbjct: 160 SDVEESEFVTSLENESE 176



 Score = 31.2 bits (70), Expect = 0.68
 Identities = 17/108 (15%), Positives = 41/108 (37%), Gaps = 1/108 (0%)

Query: 161 SRPLQDVR-VVKCNELVLKSKLKAKEKRKASPTSASSDESDKEEEEKEEGEQTDSEKEAK 219
            R   DV  +++          + + + K    + S  + +      EE ++  +  + K
Sbjct: 13  PRCNADVPCLLQSLGFNKFRGERWRLQNKFLLAAGSRKKIESALAVDEEPDENGAVSKKK 72

Query: 220 KRKKHKKDKKKKKKKSKYEEEEEKEEGEQTDSEKEAKKRKKHKKDKKK 267
             +  K+  KK   +     EE  E     D+  + + +K  ++ ++K
Sbjct: 73  PTRSVKRATKKTVVEISEPLEEGSELVVNEDAALDKESKKTPRRTRRK 120


>gnl|CDD|222665 pfam14303, NAM-associated, No apical meristem-associated C-terminal
           domain.  This domain is found in a number of different
           types of plant proteins including NAM-like proteins.
          Length = 147

 Score = 40.0 bits (94), Expect = 3e-04
 Identities = 25/102 (24%), Positives = 55/102 (53%), Gaps = 3/102 (2%)

Query: 172 CNELVLKSKLKAKEKRKASPTSASSDES---DKEEEEKEEGEQTDSEKEAKKRKKHKKDK 228
            ++     K   K+K++++ +  S+      D+++E   E ++ +  K+AK++ +  K K
Sbjct: 19  KSKRSELKKASKKKKKRSNSSPGSTSNEENEDEDDESTAESKRPEGRKKAKEKLRRDKLK 78

Query: 229 KKKKKKSKYEEEEEKEEGEQTDSEKEAKKRKKHKKDKKKRKK 270
            KK++  K +E+EE+      ++EKE  + +K K + K  K+
Sbjct: 79  AKKEEAEKEKEKEERFMKALAEAEKERAELEKKKAEAKLMKE 120


>gnl|CDD|219924 pfam08597, eIF3_subunit, Translation initiation factor eIF3
           subunit.  This is a family of proteins which are
           subunits of the eukaryotic translation initiation factor
           3 (eIF3). In yeast it is called Hcr1. The Saccharomyces
           cerevisiae protein eIF3j (HCR1) has been shown to be
           required for processing of 20S pre-rRNA and binds to 18S
           rRNA and eIF3 subunits Rpg1p and Prt1p.
          Length = 242

 Score = 40.8 bits (96), Expect = 3e-04
 Identities = 22/67 (32%), Positives = 42/67 (62%), Gaps = 1/67 (1%)

Query: 196 SDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKYEEEEEKEEGEQTDSEKEA 255
           S + +++EE++EE  +  ++ +AKK  K K ++K+K K+ K EE+  +E  E T  ++ A
Sbjct: 36  SWDEEEDEEKEEEKAKVAAKAKAKKALKAKIEEKEKAKREK-EEKGLRELEEDTPEDELA 94

Query: 256 KKRKKHK 262
           +K +  K
Sbjct: 95  EKLRLRK 101



 Score = 31.9 bits (73), Expect = 0.23
 Identities = 18/92 (19%), Positives = 42/92 (45%), Gaps = 5/92 (5%)

Query: 185 EKRKASPTSASSDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKYEEEEEKE 244
           E       +   D+ D E+E+ +  +  D E++ +K ++  K   K K K   + + E++
Sbjct: 10  EPPAPPAKAVVKDKWDDEDEDDDVKDSWDEEEDEEKEEEKAKVAAKAKAKKALKAKIEEK 69

Query: 245 EGEQTDSEKEAKKRKKH-----KKDKKKRKKK 271
           E  + + E++  +  +      +  +K R +K
Sbjct: 70  EKAKREKEEKGLRELEEDTPEDELAEKLRLRK 101


>gnl|CDD|240246 PTZ00053, PTZ00053, methionine aminopeptidase 2; Provisional.
          Length = 470

 Score = 41.6 bits (98), Expect = 3e-04
 Identities = 21/66 (31%), Positives = 38/66 (57%)

Query: 207 EEGEQTDSEKEAKKRKKHKKDKKKKKKKSKYEEEEEKEEGEQTDSEKEAKKRKKHKKDKK 266
              E  ++E + +K++  +K  KKK KKSK + +++     +  SE +  + K++ K KK
Sbjct: 1   AMNENGENEVKQQKQQNKQKGTKKKNKKSKKDVDDDDAFLAELISENQEAENKQNNKKKK 60

Query: 267 KRKKKS 272
           K+KKK 
Sbjct: 61  KKKKKK 66



 Score = 36.2 bits (84), Expect = 0.013
 Identities = 21/72 (29%), Positives = 36/72 (50%), Gaps = 2/72 (2%)

Query: 202 EEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKYEE--EEEKEEGEQTDSEKEAKKRK 259
              E  E E    +++ K++   KK+KK KK     +    E   E ++ ++++  KK+K
Sbjct: 1   AMNENGENEVKQQKQQNKQKGTKKKNKKSKKDVDDDDAFLAELISENQEAENKQNNKKKK 60

Query: 260 KHKKDKKKRKKK 271
           K KK KKK+   
Sbjct: 61  KKKKKKKKKNLG 72



 Score = 35.5 bits (82), Expect = 0.023
 Identities = 19/88 (21%), Positives = 32/88 (36%), Gaps = 5/88 (5%)

Query: 171 KCNELVLKSKLKAKEKRKASPTSASSDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKK 230
           +      K + K K  +K +  S    + D     +       SE +  + K++ K KKK
Sbjct: 7   ENEVKQQKQQNKQKGTKKKNKKSKKDVDDDDAFLAELI-----SENQEAENKQNNKKKKK 61

Query: 231 KKKKSKYEEEEEKEEGEQTDSEKEAKKR 258
           KKKK K +   E  +         +   
Sbjct: 62  KKKKKKKKNLGEAYDLAYDLPVVWSSAA 89



 Score = 34.7 bits (80), Expect = 0.049
 Identities = 23/66 (34%), Positives = 35/66 (53%)

Query: 207 EEGEQTDSEKEAKKRKKHKKDKKKKKKKSKYEEEEEKEEGEQTDSEKEAKKRKKHKKDKK 266
           E GE    +++ + ++K  K K KK KK   +++    E    + E E K+  K KK KK
Sbjct: 4   ENGENEVKQQKQQNKQKGTKKKNKKSKKDVDDDDAFLAELISENQEAENKQNNKKKKKKK 63

Query: 267 KRKKKS 272
           K+KKK 
Sbjct: 64  KKKKKK 69


>gnl|CDD|222581 pfam14181, YqfQ, YqfQ-like protein.  The YqfQ-like protein family
           includes the B. subtilis YqfQ protein, also known as
           VrrA, which is functionally uncharacterized. This family
           of proteins is found in bacteria. Proteins in this
           family are typically between 146 and 237 amino acids in
           length. There are two conserved sequence motifs: QYGP
           and PKLY.
          Length = 155

 Score = 39.4 bits (92), Expect = 4e-04
 Identities = 16/52 (30%), Positives = 27/52 (51%)

Query: 192 TSASSDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKYEEEEEK 243
           +S+  +E + EEE  +E EQ D  +   + K+ KK +  K K  K + + E 
Sbjct: 92  SSSDDEEEETEEESTDETEQEDPPETKTESKEKKKREVPKPKTEKEKPKTEP 143



 Score = 31.3 bits (71), Expect = 0.26
 Identities = 14/64 (21%), Positives = 31/64 (48%), Gaps = 4/64 (6%)

Query: 177 LKSKLKAKEKRKASPTSASSDESDKEEEEKEEGEQTDSEKEAKKRKKHK----KDKKKKK 232
           +  +L + +  +      S+DE+++E+  + + E  + +K    + K +    K + KK 
Sbjct: 87  IFRELSSSDDEEEETEEESTDETEQEDPPETKTESKEKKKREVPKPKTEKEKPKTEPKKP 146

Query: 233 KKSK 236
           K SK
Sbjct: 147 KPSK 150



 Score = 29.0 bits (65), Expect = 1.7
 Identities = 20/74 (27%), Positives = 35/74 (47%), Gaps = 17/74 (22%)

Query: 199 SDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKYEEEEEKEEGEQTDSEKEAKKR 258
           S  +EEE+ E E TD  +        ++D  + K +SK  E++++E         + K  
Sbjct: 93  SSDDEEEETEEESTDETE--------QEDPPETKTESK--EKKKRE-------VPKPKTE 135

Query: 259 KKHKKDKKKRKKKS 272
           K+  K + K+ K S
Sbjct: 136 KEKPKTEPKKPKPS 149


>gnl|CDD|219563 pfam07767, Nop53, Nop53 (60S ribosomal biogenesis).  This nucleolar
           family of proteins are involved in 60S ribosomal
           biogenesis. They are specifically involved in the
           processing beyond the 27S stage of 25S rRNA maturation.
           This family contains sequences that bear similarity to
           the glioma tumour suppressor candidate region gene 2
           protein (p60). This protein has been found to interact
           with herpes simplex type 1 regulatory proteins.
          Length = 387

 Score = 40.5 bits (95), Expect = 6e-04
 Identities = 21/97 (21%), Positives = 46/97 (47%), Gaps = 13/97 (13%)

Query: 188 KASPTSASSDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKYEEEEEKEEGE 247
                   SD+    E  + E E  +     K++ K +++K+K++K+ + E +EEK+  +
Sbjct: 245 SDDDGEEESDDESAWEGFESEYEPINKPVRPKRKTKAQRNKEKRRKELEREAKEEKQLKK 304

Query: 248 QT-------------DSEKEAKKRKKHKKDKKKRKKK 271
           +                +++A+ RKK ++ ++  KKK
Sbjct: 305 KLAQLARLKEIAKEVAQKEKARARKKEQRKERGEKKK 341



 Score = 27.7 bits (62), Expect = 6.4
 Identities = 17/66 (25%), Positives = 36/66 (54%), Gaps = 2/66 (3%)

Query: 174 ELVLKSKLKAKEKRKASPTSASSDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKK 233
           EL  ++K + + K+K +    +  +   +E  ++E  +   +++ K+R + KK K++K  
Sbjct: 291 ELEREAKEEKQLKKKLA--QLARLKEIAKEVAQKEKARARKKEQRKERGEKKKLKRRKLG 348

Query: 234 KSKYEE 239
           K KY E
Sbjct: 349 KHKYPE 354


>gnl|CDD|233191 TIGR00927, 2A1904, K+-dependent Na+/Ca+ exchanger.  [Transport and
           binding proteins, Cations and iron carrying compounds].
          Length = 1096

 Score = 40.7 bits (95), Expect = 7e-04
 Identities = 28/88 (31%), Positives = 43/88 (48%), Gaps = 18/88 (20%)

Query: 174 ELVLKSKLKAKEKRKASPTSASSDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKK 233
           EL  +++ +AK+  K       SD  D EEEE+EE E+ + E+E +              
Sbjct: 836 ELNAENQGEAKQDEKGVDGGGGSDGGDSEEEEEEEEEEEEEEEEEE-------------- 881

Query: 234 KSKYEEEEEKEEGEQTDSEKEAKKRKKH 261
               EEEEE+EE E+  S +  + R+K 
Sbjct: 882 ----EEEEEEEENEEPLSLEWPETRQKQ 905



 Score = 34.6 bits (79), Expect = 0.058
 Identities = 22/84 (26%), Positives = 41/84 (48%), Gaps = 3/84 (3%)

Query: 185 EKRKASPTSASSDESDKEEEEKEEGEQ---TDSEKEAKKRKKHKKDKKKKKKKSKYEEEE 241
              K      S  ++D  E + E GEQ    +++ EAK+ +K              EEEE
Sbjct: 809 AGEKDEHEGQSETQADDTEVKDETGEQELNAENQGEAKQDEKGVDGGGGSDGGDSEEEEE 868

Query: 242 EKEEGEQTDSEKEAKKRKKHKKDK 265
           E+EE E+ + E+E ++ ++ + ++
Sbjct: 869 EEEEEEEEEEEEEEEEEEEEENEE 892



 Score = 33.0 bits (75), Expect = 0.18
 Identities = 22/85 (25%), Positives = 35/85 (41%), Gaps = 5/85 (5%)

Query: 174 ELVLKSKLKAKEKRKASPTSASSDESDKEEEEKE-EGEQTDSEKEAKKRKKHKKDKKKKK 232
           EL +K +L    +R  +   A  D S  +  E E  GE+T  E E     + +  ++   
Sbjct: 609 ELWVKEQL---SRRPVAKVMALGDLSKGDVAEAEHTGERTGEEGERPTEAEGENGEESGG 665

Query: 233 KKSKYEEEEEKEEGEQT-DSEKEAK 256
           +  +  E E K E E   +   E K
Sbjct: 666 EAEQEGETETKGENESEGEIPAERK 690



 Score = 32.7 bits (74), Expect = 0.22
 Identities = 24/97 (24%), Positives = 46/97 (47%), Gaps = 7/97 (7%)

Query: 178 KSKLKAKEKRKASPTSASSDESDKE----EEEKEEGEQTDSEKEAKKRKKHKKDKKKKKK 233
           K + +A+E      T A   E + E    EE +E  ++ + E E K   + + D+K+ + 
Sbjct: 707 KGETEAEEVEHEGETEAEGTEDEGEIETGEEGEEVEDEGEGEAEGKHEVETEGDRKETEH 766

Query: 234 KSKYEEEEEKEEGEQTDSEKEAKKRKKHKKDKKKRKK 270
           + + E E +++E E    E +A +  + K D+    K
Sbjct: 767 EGETEAEGKEDEDE---GEIQAGEDGEMKGDEGAEGK 800



 Score = 32.3 bits (73), Expect = 0.33
 Identities = 15/71 (21%), Positives = 33/71 (46%)

Query: 197 DESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKYEEEEEKEEGEQTDSEKEAK 256
            E + E E KE   + ++E E  + +   + +  + +      EE +E  ++ + E E K
Sbjct: 693 QEGEGEIEAKEADHKGETEAEEVEHEGETEAEGTEDEGEIETGEEGEEVEDEGEGEAEGK 752

Query: 257 KRKKHKKDKKK 267
              + + D+K+
Sbjct: 753 HEVETEGDRKE 763



 Score = 29.2 bits (65), Expect = 3.0
 Identities = 17/69 (24%), Positives = 32/69 (46%), Gaps = 3/69 (4%)

Query: 197 DESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKYEEEEEKEEGEQTDSEKEAK 256
           +ES+ E   + +GEQ    +   K   HK + + ++ + + E E E  E E    E E  
Sbjct: 679 NESEGEIPAERKGEQEGEGEIEAKEADHKGETEAEEVEHEGETEAEGTEDEG---EIETG 735

Query: 257 KRKKHKKDK 265
           +  +  +D+
Sbjct: 736 EEGEEVEDE 744



 Score = 27.7 bits (61), Expect = 9.3
 Identities = 15/79 (18%), Positives = 38/79 (48%), Gaps = 1/79 (1%)

Query: 194 ASSDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKYEEEEEKEEGEQTDSEK 253
            + + + +E E   E E  + E+   + ++  + + K + +S+ E   E++  ++ + E 
Sbjct: 640 HTGERTGEEGERPTEAEGENGEESGGEAEQEGETETKGENESEGEIPAERKGEQEGEGEI 699

Query: 254 EAKKRKKHKKDKKKRKKKS 272
           EA K   HK + +  + + 
Sbjct: 700 EA-KEADHKGETEAEEVEH 717


>gnl|CDD|225606 COG3064, TolA, Membrane protein involved in colicin uptake [Cell
           envelope biogenesis, outer membrane].
          Length = 387

 Score = 39.9 bits (93), Expect = 8e-04
 Identities = 23/91 (25%), Positives = 42/91 (46%), Gaps = 2/91 (2%)

Query: 178 KSKLKAKEKRKASPTSASSDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKY 237
           K K  A+ K+KA   + +++E+  + E      +  +E EAK   +  K + + K K++ 
Sbjct: 162 KLKAAAEAKKKAEEAAKAAEEAKAKAEAAA--AKKKAEAEAKAAAEKAKAEAEAKAKAEK 219

Query: 238 EEEEEKEEGEQTDSEKEAKKRKKHKKDKKKR 268
           + E   EE    + +K A K K  K     +
Sbjct: 220 KAEAAAEEKAAAEKKKAAAKAKADKAAAAAK 250



 Score = 37.6 bits (87), Expect = 0.005
 Identities = 25/93 (26%), Positives = 42/93 (45%), Gaps = 3/93 (3%)

Query: 182 KAKEKRKASPTSASSDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKYEEEE 241
           KAK   +A+   A+++   K EE  +  E+  ++ EA   KK  + + K   +    E E
Sbjct: 153 KAKAAAEAAKLKAAAEAKKKAEEAAKAAEEAKAKAEAAAAKKKAEAEAKAAAEKAKAEAE 212

Query: 242 EKEEGEQT---DSEKEAKKRKKHKKDKKKRKKK 271
            K + E+     +E++A   KK    K K  K 
Sbjct: 213 AKAKAEKKAEAAAEEKAAAEKKKAAAKAKADKA 245



 Score = 37.2 bits (86), Expect = 0.006
 Identities = 25/120 (20%), Positives = 53/120 (44%), Gaps = 7/120 (5%)

Query: 159 KMSRPLQDVRVVKCNELVLKSKLKAKEKRKASPTSASSDESDKEEEEKEEGEQTDSEKEA 218
           K  +  +  R+ +  +  LK++ + K+  +A   +    +  +E+  K   EQ    + A
Sbjct: 93  KPKQAAEQERLKQLEKERLKAQEQQKQAEEAEKQAQLEQKQQEEQARKAAAEQKKKAEAA 152

Query: 219 K-------KRKKHKKDKKKKKKKSKYEEEEEKEEGEQTDSEKEAKKRKKHKKDKKKRKKK 271
           K        + K   + KKK +++    EE K + E   ++K+A+   K   +K K + +
Sbjct: 153 KAKAAAEAAKLKAAAEAKKKAEEAAKAAEEAKAKAEAAAAKKKAEAEAKAAAEKAKAEAE 212



 Score = 33.0 bits (75), Expect = 0.17
 Identities = 15/89 (16%), Positives = 45/89 (50%)

Query: 182 KAKEKRKASPTSASSDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKYEEEE 241
           K +++RK      + +   K+  E+E  +Q + E+   + ++ + ++ +K+ + + +++E
Sbjct: 76  KGEQQRKKKEEQVAEELKPKQAAEQERLKQLEKERLKAQEQQKQAEEAEKQAQLEQKQQE 135

Query: 242 EKEEGEQTDSEKEAKKRKKHKKDKKKRKK 270
           E+      + +K+A+  K     +  + K
Sbjct: 136 EQARKAAAEQKKKAEAAKAKAAAEAAKLK 164



 Score = 32.6 bits (74), Expect = 0.21
 Identities = 20/91 (21%), Positives = 47/91 (51%), Gaps = 3/91 (3%)

Query: 185 EKRKASPTSASSDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKYEEEEEKE 244
           + +++S          KEE+  EE +   + ++ + ++  K+  K ++++ + EE E++ 
Sbjct: 68  QSQQSSAKKGEQQRKKKEEQVAEELKPKQAAEQERLKQLEKERLKAQEQQKQAEEAEKQA 127

Query: 245 EGEQTDSEKEAKK---RKKHKKDKKKRKKKS 272
           + EQ   E++A+K    +K K +  K K  +
Sbjct: 128 QLEQKQQEEQARKAAAEQKKKAEAAKAKAAA 158



 Score = 32.2 bits (73), Expect = 0.29
 Identities = 23/98 (23%), Positives = 44/98 (44%), Gaps = 5/98 (5%)

Query: 180 KLKAKEKRKASPTSASSDESDK-----EEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKK 234
           K  A++K+KA    A +           E +K+  E   + +EAK + +    KKK + +
Sbjct: 140 KAAAEQKKKAEAAKAKAAAEAAKLKAAAEAKKKAEEAAKAAEEAKAKAEAAAAKKKAEAE 199

Query: 235 SKYEEEEEKEEGEQTDSEKEAKKRKKHKKDKKKRKKKS 272
           +K   E+ K E E     ++  +    +K   ++KK +
Sbjct: 200 AKAAAEKAKAEAEAKAKAEKKAEAAAEEKAAAEKKKAA 237



 Score = 30.3 bits (68), Expect = 1.2
 Identities = 27/109 (24%), Positives = 49/109 (44%), Gaps = 13/109 (11%)

Query: 177 LKSKLKAKEKR----KASPTSASSDESDKEEEEKEEGEQTDSEKE------AKKRKKHKK 226
           LK K  A+++R    +     A   +   EE EK+   +   ++E      A+++KK + 
Sbjct: 92  LKPKQAAEQERLKQLEKERLKAQEQQKQAEEAEKQAQLEQKQQEEQARKAAAEQKKKAEA 151

Query: 227 DKKKKK---KKSKYEEEEEKEEGEQTDSEKEAKKRKKHKKDKKKRKKKS 272
            K K      K K   E +K+  E   + +EAK + +    KKK + ++
Sbjct: 152 AKAKAAAEAAKLKAAAEAKKKAEEAAKAAEEAKAKAEAAAAKKKAEAEA 200



 Score = 28.4 bits (63), Expect = 4.8
 Identities = 15/75 (20%), Positives = 35/75 (46%), Gaps = 1/75 (1%)

Query: 178 KSKLKAKEKRKASPTSASSDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKY 237
           K++  A +K+  +   A+++++  E E K + E+  +E  A+++   +K K   K K+  
Sbjct: 186 KAEAAAAKKKAEAEAKAAAEKAKAEAEAKAKAEKK-AEAAAEEKAAAEKKKAAAKAKADK 244

Query: 238 EEEEEKEEGEQTDSE 252
                K    +  + 
Sbjct: 245 AAAAAKAAERKAAAA 259


>gnl|CDD|236545 PRK09510, tolA, cell envelope integrity inner membrane protein
           TolA; Provisional.
          Length = 387

 Score = 39.8 bits (93), Expect = 0.001
 Identities = 20/94 (21%), Positives = 43/94 (45%), Gaps = 2/94 (2%)

Query: 178 KSKLKAKEKRKASPTSASSDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKY 237
            +K KA+ + K +  +A    ++ +++ + E     +  EAKK K   +   K   ++K 
Sbjct: 144 AAKAKAEAEAKRAAAAAKKAAAEAKKKAEAE-AAKKAAAEAKK-KAEAEAAAKAAAEAKK 201

Query: 238 EEEEEKEEGEQTDSEKEAKKRKKHKKDKKKRKKK 271
           + E E ++    +++K+A    K    K   + K
Sbjct: 202 KAEAEAKKKAAAEAKKKAAAEAKAAAAKAAAEAK 235



 Score = 35.2 bits (81), Expect = 0.031
 Identities = 17/85 (20%), Positives = 37/85 (43%)

Query: 182 KAKEKRKASPTSASSDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKYEEEE 241
           +   KR          +  +E ++K+  EQ   ++  K+R   ++ KK+ ++ +K    +
Sbjct: 71  QKSAKRAEEQRKKKEQQQAEELQQKQAAEQERLKQLEKERLAAQEQKKQAEEAAKQAALK 130

Query: 242 EKEEGEQTDSEKEAKKRKKHKKDKK 266
           +K+  E       A K K   + K+
Sbjct: 131 QKQAEEAAAKAAAAAKAKAEAEAKR 155



 Score = 34.0 bits (78), Expect = 0.064
 Identities = 19/83 (22%), Positives = 42/83 (50%), Gaps = 7/83 (8%)

Query: 196 SDESDKEEEEKEEGEQTDSE---KEAKKRKKHKKDKKKK----KKKSKYEEEEEKEEGEQ 248
                 EE+ K++ +Q   E   K+A ++++ K+ +K++    ++K + EE  ++   +Q
Sbjct: 72  KSAKRAEEQRKKKEQQQAEELQQKQAAEQERLKQLEKERLAAQEQKKQAEEAAKQAALKQ 131

Query: 249 TDSEKEAKKRKKHKKDKKKRKKK 271
             +E+ A K     K K + + K
Sbjct: 132 KQAEEAAAKAAAAAKAKAEAEAK 154



 Score = 34.0 bits (78), Expect = 0.074
 Identities = 15/74 (20%), Positives = 36/74 (48%), Gaps = 3/74 (4%)

Query: 197 DESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKK---KSKYEEEEEKEEGEQTDSEK 253
            E ++   ++++ +  ++ K+A  ++K  ++   K     K+K E E ++       +  
Sbjct: 106 LEKERLAAQEQKKQAEEAAKQAALKQKQAEEAAAKAAAAAKAKAEAEAKRAAAAAKKAAA 165

Query: 254 EAKKRKKHKKDKKK 267
           EAKK+ + +  KK 
Sbjct: 166 EAKKKAEAEAAKKA 179



 Score = 32.9 bits (75), Expect = 0.15
 Identities = 22/89 (24%), Positives = 43/89 (48%), Gaps = 1/89 (1%)

Query: 178 KSKLKAKEKRKASPTSASSDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKY 237
            +K  A ++++A   +A +  + K + E E      + K+A    K KK + +  KK+  
Sbjct: 123 AAKQAALKQKQAEEAAAKAAAAAKAKAEAEAKRAAAAAKKAAAEAK-KKAEAEAAKKAAA 181

Query: 238 EEEEEKEEGEQTDSEKEAKKRKKHKKDKK 266
           E +++ E      +  EAKK+ + +  KK
Sbjct: 182 EAKKKAEAEAAAKAAAEAKKKAEAEAKKK 210



 Score = 32.5 bits (74), Expect = 0.21
 Identities = 21/93 (22%), Positives = 43/93 (46%)

Query: 178 KSKLKAKEKRKASPTSASSDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKY 237
           ++ LK K+  +A+  +A++ ++  E E K             K+K   +  KK   ++K 
Sbjct: 126 QAALKQKQAEEAAAKAAAAAKAKAEAEAKRAAAAAKKAAAEAKKKAEAEAAKKAAAEAKK 185

Query: 238 EEEEEKEEGEQTDSEKEAKKRKKHKKDKKKRKK 270
           + E E       +++K+A+   K K   + +KK
Sbjct: 186 KAEAEAAAKAAAEAKKKAEAEAKKKAAAEAKKK 218



 Score = 31.7 bits (72), Expect = 0.37
 Identities = 19/83 (22%), Positives = 40/83 (48%), Gaps = 1/83 (1%)

Query: 174 ELVLKSKLKAKEKRKASPTSASSDESDKE-EEEKEEGEQTDSEKEAKKRKKHKKDKKKKK 232
           E   K+  +AK+K +A   + ++ E+ K+ E E ++    +++K+A    K    K   +
Sbjct: 174 EAAKKAAAEAKKKAEAEAAAKAAAEAKKKAEAEAKKKAAAEAKKKAAAEAKAAAAKAAAE 233

Query: 233 KKSKYEEEEEKEEGEQTDSEKEA 255
            K+  E+    +  E+  + K A
Sbjct: 234 AKAAAEKAAAAKAAEKAAAAKAA 256



 Score = 31.3 bits (71), Expect = 0.50
 Identities = 19/88 (21%), Positives = 35/88 (39%), Gaps = 9/88 (10%)

Query: 180 KLKAKEKRKASPTSASSDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKYEE 239
             KA  + K    + ++ ++  E ++K E E   ++K+A    K K   + K   +K   
Sbjct: 176 AKKAAAEAKKKAEAEAAAKAAAEAKKKAEAE---AKKKAAAEAKKKAAAEAKAAAAKAAA 232

Query: 240 EEEKEEGEQTDSEKEAKKRKKHKKDKKK 267
           E +        +EK A  +   K    K
Sbjct: 233 EAKA------AAEKAAAAKAAEKAAAAK 254



 Score = 30.9 bits (70), Expect = 0.60
 Identities = 23/98 (23%), Positives = 42/98 (42%), Gaps = 4/98 (4%)

Query: 178 KSKLKAKEKRK-ASPTSASSDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKK---KK 233
           K +L A+E++K A   +  +    K+ EE        ++ +A+   K      KK   + 
Sbjct: 108 KERLAAQEQKKQAEEAAKQAALKQKQAEEAAAKAAAAAKAKAEAEAKRAAAAAKKAAAEA 167

Query: 234 KSKYEEEEEKEEGEQTDSEKEAKKRKKHKKDKKKRKKK 271
           K K E E  K+   +   + EA+   K   + KK+ + 
Sbjct: 168 KKKAEAEAAKKAAAEAKKKAEAEAAAKAAAEAKKKAEA 205



 Score = 29.4 bits (66), Expect = 2.3
 Identities = 16/58 (27%), Positives = 37/58 (63%)

Query: 210 EQTDSEKEAKKRKKHKKDKKKKKKKSKYEEEEEKEEGEQTDSEKEAKKRKKHKKDKKK 267
           EQ + +++ +K  K  ++++KKK++ + EE ++K+  EQ   ++  K+R   ++ KK+
Sbjct: 62  EQYNRQQQQQKSAKRAEEQRKKKEQQQAEELQQKQAAEQERLKQLEKERLAAQEQKKQ 119



 Score = 28.2 bits (63), Expect = 5.2
 Identities = 15/79 (18%), Positives = 36/79 (45%), Gaps = 7/79 (8%)

Query: 198 ESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKYEEEEEKEEGEQTDSEKEAKK 257
           +   E+E  ++ E+ +     +++K+ ++  K+   K K  EE   +      ++ EA+ 
Sbjct: 95  KQAAEQERLKQLEK-ERLAAQEQKKQAEEAAKQAALKQKQAEEAAAKAAAAAKAKAEAEA 153

Query: 258 R------KKHKKDKKKRKK 270
           +      KK   + KK+ +
Sbjct: 154 KRAAAAAKKAAAEAKKKAE 172



 Score = 28.2 bits (63), Expect = 5.4
 Identities = 13/75 (17%), Positives = 32/75 (42%), Gaps = 1/75 (1%)

Query: 178 KSKLKAKEKRKASPTSASSDESDKEEEEKEEGEQ-TDSEKEAKKRKKHKKDKKKKKKKSK 236
           K K +A+   KA+  +    E++ +++   E ++   +E +A   K   + K   +K + 
Sbjct: 184 KKKAEAEAAAKAAAEAKKKAEAEAKKKAAAEAKKKAAAEAKAAAAKAAAEAKAAAEKAAA 243

Query: 237 YEEEEEKEEGEQTDS 251
            +  E+    +    
Sbjct: 244 AKAAEKAAAAKAAAE 258



 Score = 27.8 bits (62), Expect = 6.3
 Identities = 14/66 (21%), Positives = 35/66 (53%), Gaps = 1/66 (1%)

Query: 206 KEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKYEEEEEKEEGEQTDSEKEAKKRKKHKKDK 265
           ++   Q   +K AK R + ++ KK++++  + ++++  E+      EKE    ++ KK  
Sbjct: 62  EQYNRQQQQQKSAK-RAEEQRKKKEQQQAEELQQKQAAEQERLKQLEKERLAAQEQKKQA 120

Query: 266 KKRKKK 271
           ++  K+
Sbjct: 121 EEAAKQ 126


>gnl|CDD|215214 PLN02381, PLN02381, valyl-tRNA synthetase.
          Length = 1066

 Score = 39.9 bits (93), Expect = 0.001
 Identities = 20/65 (30%), Positives = 34/65 (52%)

Query: 203 EEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKYEEEEEKEEGEQTDSEKEAKKRKKHK 262
           E E E+   T+ E E KK+K+ K  +K+ KK    ++E + +   Q  S+     +K  K
Sbjct: 5   ESEAEKKILTEEELERKKKKEEKAKEKELKKLKAAQKEAKAKLQAQQASDGTNVPKKSEK 64

Query: 263 KDKKK 267
           K +K+
Sbjct: 65  KSRKR 69



 Score = 36.8 bits (85), Expect = 0.011
 Identities = 19/90 (21%), Positives = 39/90 (43%), Gaps = 16/90 (17%)

Query: 187 RKASPTSASSDESDKEEEEKEEGEQTDSEKEAKKRKKHKKD----------------KKK 230
            +    +     +++E E K++ E+   EKE KK K  +K+                 KK
Sbjct: 2   SRTESEAEKKILTEEELERKKKKEEKAKEKELKKLKAAQKEAKAKLQAQQASDGTNVPKK 61

Query: 231 KKKKSKYEEEEEKEEGEQTDSEKEAKKRKK 260
            +KKS+  + E++   +  D +    ++K+
Sbjct: 62  SEKKSRKRDVEDENPEDFIDPDTPFGQKKR 91



 Score = 36.4 bits (84), Expect = 0.015
 Identities = 19/73 (26%), Positives = 38/73 (52%)

Query: 193 SASSDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKYEEEEEKEEGEQTDSE 252
           S +  E++K+   +EE E+   ++E  K K+ KK K  +K+     + ++  +G     +
Sbjct: 2   SRTESEAEKKILTEEELERKKKKEEKAKEKELKKLKAAQKEAKAKLQAQQASDGTNVPKK 61

Query: 253 KEAKKRKKHKKDK 265
            E K RK+  +D+
Sbjct: 62  SEKKSRKRDVEDE 74



 Score = 33.3 bits (76), Expect = 0.12
 Identities = 21/65 (32%), Positives = 39/65 (60%), Gaps = 8/65 (12%)

Query: 214 SEKEAKKRKKHKKDKKKKKKKSKYEEEEEKEEGEQTDSEKEAKKRKKHKKDK------KK 267
           +E EA+K+   +++ ++KKKK   E+ +EKE  +   ++KEAK + + ++        KK
Sbjct: 4   TESEAEKKILTEEELERKKKKE--EKAKEKELKKLKAAQKEAKAKLQAQQASDGTNVPKK 61

Query: 268 RKKKS 272
            +KKS
Sbjct: 62  SEKKS 66



 Score = 32.9 bits (75), Expect = 0.19
 Identities = 21/64 (32%), Positives = 35/64 (54%), Gaps = 2/64 (3%)

Query: 209 GEQTDSEKEAKKRKKHKKDKKKKKKKSKYEEEEEKEEGEQTDSEKEAKKRKKHKKDKKKR 268
           G +T+SE E K   + + ++KKKK++   E+E +K +  Q   E +AK + +   D    
Sbjct: 1   GSRTESEAEKKILTEEELERKKKKEEKAKEKELKKLKAAQ--KEAKAKLQAQQASDGTNV 58

Query: 269 KKKS 272
            KKS
Sbjct: 59  PKKS 62


>gnl|CDD|218752 pfam05793, TFIIF_alpha, Transcription initiation factor IIF, alpha
           subunit (TFIIF-alpha).  Transcription initiation factor
           IIF, alpha subunit (TFIIF-alpha) or RNA polymerase
           II-associating protein 74 (RAP74) is the large subunit
           of transcription factor IIF (TFIIF), which is essential
           for accurate initiation and stimulates elongation by RNA
           polymerase II.
          Length = 528

 Score = 39.6 bits (92), Expect = 0.001
 Identities = 21/77 (27%), Positives = 37/77 (48%), Gaps = 7/77 (9%)

Query: 197 DESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKYEEEEEKEEGEQTDSEKEA- 255
            E D++ EE EE +  +    +KK KK KK K KK    K + +   ++ + +D + E  
Sbjct: 314 IEQDEDSEESEEEKNEEEGGLSKKGKKLKKLKGKKNGLDKDDSDSG-DDSDDSDIDGEDS 372

Query: 256 -----KKRKKHKKDKKK 267
                 K++K  K ++ 
Sbjct: 373 VSLVTAKKQKEPKKEEP 389



 Score = 38.4 bits (89), Expect = 0.003
 Identities = 22/106 (20%), Positives = 40/106 (37%), Gaps = 11/106 (10%)

Query: 178 KSKLKAKEKRKASPTSASSDESDKEEEEKEE-----------GEQTDSEKEAKKRKKHKK 226
           K     K+ ++     A   +SD  ++E  E            +  + E +       K 
Sbjct: 253 KKLDDDKKGKRGGDDDADEYDSDDGDDEGREEDYISDSSASGNDPEEREDKLSPEIPAKP 312

Query: 227 DKKKKKKKSKYEEEEEKEEGEQTDSEKEAKKRKKHKKDKKKRKKKS 272
           + ++ +   + EEE+ +EEG  +   K+ KK K  K    K    S
Sbjct: 313 EIEQDEDSEESEEEKNEEEGGLSKKGKKLKKLKGKKNGLDKDDSDS 358



 Score = 33.8 bits (77), Expect = 0.096
 Identities = 23/114 (20%), Positives = 47/114 (41%), Gaps = 20/114 (17%)

Query: 178 KSKLKAKEKRKASPTSASSDESDKEEEEKEEGEQTDSEK-------------------EA 218
           + K K K+K+ A       D+   +    ++ ++ DS+                    + 
Sbjct: 238 EEKSKKKKKKLAKNKKKLDDDKKGKRGGDDDADEYDSDDGDDEGREEDYISDSSASGNDP 297

Query: 219 KKRKKHKKDKKKKKKKSKYEEEEEKEEGEQTDSE-KEAKKRKKHKKDKKKRKKK 271
           ++R+     +   K + + +E+ E+ E E+ + E   +KK KK KK K K+   
Sbjct: 298 EEREDKLSPEIPAKPEIEQDEDSEESEEEKNEEEGGLSKKGKKLKKLKGKKNGL 351



 Score = 33.4 bits (76), Expect = 0.11
 Identities = 17/58 (29%), Positives = 26/58 (44%)

Query: 213 DSEKEAKKRKKHKKDKKKKKKKSKYEEEEEKEEGEQTDSEKEAKKRKKHKKDKKKRKK 270
           D E E +K    +    K K     +E++  E  +  +   E K +KK KK  K +KK
Sbjct: 197 DEETEGEKGGGGRGKDLKIKDLEGDDEDDGDESDKGGEDGDEEKSKKKKKKLAKNKKK 254



 Score = 31.1 bits (70), Expect = 0.67
 Identities = 19/69 (27%), Positives = 30/69 (43%), Gaps = 2/69 (2%)

Query: 184 KEKRKASPTSASSDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKYEEEEEK 243
           K K          DESDK  E  +  E+   +K+ K  K  KK    KK K   +++ ++
Sbjct: 214 KIKDLEGDDEDDGDESDKGGE--DGDEEKSKKKKKKLAKNKKKLDDDKKGKRGGDDDADE 271

Query: 244 EEGEQTDSE 252
            + +  D E
Sbjct: 272 YDSDDGDDE 280



 Score = 28.4 bits (63), Expect = 4.2
 Identities = 16/58 (27%), Positives = 28/58 (48%), Gaps = 2/58 (3%)

Query: 198 ESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKYEEE--EEKEEGEQTDSEK 253
           E D E++  E  +  +   E K +KK KK  K KKK    ++      ++ ++ DS+ 
Sbjct: 219 EGDDEDDGDESDKGGEDGDEEKSKKKKKKLAKNKKKLDDDKKGKRGGDDDADEYDSDD 276


>gnl|CDD|218177 pfam04615, Utp14, Utp14 protein.  This protein is found to be part
           of a large ribonucleoprotein complex containing the U3
           snoRNA. Depletion of the Utp proteins impedes production
           of the 18S rRNA, indicating that they are part of the
           active pre-rRNA processing complex. This large RNP
           complex has been termed the small subunit (SSU)
           processome.
          Length = 728

 Score = 39.7 bits (93), Expect = 0.001
 Identities = 18/77 (23%), Positives = 40/77 (51%)

Query: 195 SSDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKYEEEEEKEEGEQTDSEKE 254
           S +E ++E  +K  G +    +  +K  + KK KK+ K + K ++E ++EE  + + E +
Sbjct: 407 SDEEENEEPSKKNVGRRKFGPENGEKEAESKKLKKENKNEFKEKKESDEEEELEDEEEAK 466

Query: 255 AKKRKKHKKDKKKRKKK 271
            +K       + ++ +K
Sbjct: 467 VEKVANKLLKRSEKAQK 483



 Score = 37.7 bits (88), Expect = 0.005
 Identities = 22/88 (25%), Positives = 44/88 (50%)

Query: 185 EKRKASPTSASSDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKYEEEEEKE 244
           EK   S      ++++ +E+++ + E+   ++E  K +K      K+ +K++ EEEEE+ 
Sbjct: 431 EKEAESKKLKKENKNEFKEKKESDEEEELEDEEEAKVEKVANKLLKRSEKAQKEEEEEEL 490

Query: 245 EGEQTDSEKEAKKRKKHKKDKKKRKKKS 272
           + E    +  +   K  KK   K+K  S
Sbjct: 491 DEENPWLKTTSSVGKSAKKQDSKKKSSS 518



 Score = 35.0 bits (81), Expect = 0.034
 Identities = 23/94 (24%), Positives = 42/94 (44%), Gaps = 3/94 (3%)

Query: 177 LKSKLKAKEKRKASPTSASSDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSK 236
           L+ + +AK ++ A+     S+++ KEEEE    E  +     K      K  KK+  K K
Sbjct: 459 LEDEEEAKVEKVANKLLKRSEKAQKEEEE---EELDEENPWLKTTSSVGKSAKKQDSKKK 515

Query: 237 YEEEEEKEEGEQTDSEKEAKKRKKHKKDKKKRKK 270
              + +K   + + +  + KK+KK +K       
Sbjct: 516 SSSKLDKAANKISKAAVKVKKKKKKEKSIDLDDD 549



 Score = 30.4 bits (69), Expect = 1.0
 Identities = 23/106 (21%), Positives = 39/106 (36%), Gaps = 23/106 (21%)

Query: 190 SPTSASSDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDK--------------------- 228
           S +    DE    EEE E+ ++ D + E     + K  K                     
Sbjct: 322 SVSEEDEDEDSDSEEEDEDDDEDDDDGENPWMLRKKLGKLKEGEDDEENSGLLSMKFMQR 381

Query: 229 --KKKKKKSKYEEEEEKEEGEQTDSEKEAKKRKKHKKDKKKRKKKS 272
              +KK+++  E EE + E E  +   E +  +  KK+  +RK   
Sbjct: 382 AEARKKEENDAEIEELRRELEGEEESDEEENEEPSKKNVGRRKFGP 427



 Score = 28.9 bits (65), Expect = 3.7
 Identities = 16/75 (21%), Positives = 41/75 (54%), Gaps = 1/75 (1%)

Query: 197 DESDKEEEE-KEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKYEEEEEKEEGEQTDSEKEA 255
           +E+D E EE + E E  +   E +  +  KK+  ++K   +  E+E + +  + +++ E 
Sbjct: 388 EENDAEIEELRRELEGEEESDEEENEEPSKKNVGRRKFGPENGEKEAESKKLKKENKNEF 447

Query: 256 KKRKKHKKDKKKRKK 270
           K++K+  ++++   +
Sbjct: 448 KEKKESDEEEELEDE 462



 Score = 28.1 bits (63), Expect = 6.3
 Identities = 21/71 (29%), Positives = 36/71 (50%)

Query: 202 EEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKYEEEEEKEEGEQTDSEKEAKKRKKH 261
           E  +KEE +    E   +   + + D+++ ++ SK      K   E  + E E+KK KK 
Sbjct: 383 EARKKEENDAEIEELRRELEGEEESDEEENEEPSKKNVGRRKFGPENGEKEAESKKLKKE 442

Query: 262 KKDKKKRKKKS 272
            K++ K KK+S
Sbjct: 443 NKNEFKEKKES 453



 Score = 28.1 bits (63), Expect = 6.6
 Identities = 18/89 (20%), Positives = 30/89 (33%), Gaps = 10/89 (11%)

Query: 194 ASSDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKYEEEEEKEEGEQTDSEK 253
            S D+   EE++ +E + +            KK  +KKK     E  +E E    ++   
Sbjct: 1   MSDDDESDEEDDDDEDKHSKLLSAISSLGGEKKKDEKKKADRSEESGKESEFNLPSEGAG 60

Query: 254 EA----------KKRKKHKKDKKKRKKKS 272
                       +     KK  K+  K S
Sbjct: 61  GKLTLSDLLGSIQDSSSLKKSLKQLDKIS 89


>gnl|CDD|215774 pfam00183, HSP90, Hsp90 protein. 
          Length = 529

 Score = 38.6 bits (90), Expect = 0.002
 Identities = 18/58 (31%), Positives = 32/58 (55%)

Query: 184 KEKRKASPTSASSDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKYEEEE 241
           KE  K  P     +E ++++EE+E+    + E + ++ K+ KK K KK K++  E E 
Sbjct: 29  KEVEKEVPDEEEEEEKEEKKEEEEKTTDKEEEVDEEEEKEEKKKKTKKVKETTTEWEL 86



 Score = 37.8 bits (88), Expect = 0.004
 Identities = 15/46 (32%), Positives = 25/46 (54%)

Query: 197 DESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKYEEEEE 242
           +E  +E++E+EE      E+  ++ +K +K KK KK K    E E 
Sbjct: 41  EEEKEEKKEEEEKTTDKEEEVDEEEEKEEKKKKTKKVKETTTEWEL 86



 Score = 36.3 bits (84), Expect = 0.016
 Identities = 18/51 (35%), Positives = 32/51 (62%)

Query: 195 SSDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKYEEEEEKEE 245
             +E ++E+EEK+E E+  ++KE +  ++ +K++KKKK K   E   E E 
Sbjct: 36  PDEEEEEEKEEKKEEEEKTTDKEEEVDEEEEKEEKKKKTKKVKETTTEWEL 86



 Score = 33.6 bits (77), Expect = 0.11
 Identities = 23/64 (35%), Positives = 37/64 (57%), Gaps = 11/64 (17%)

Query: 204 EEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKYEEEEEKEEGEQTDSEKEAKKRKKHKK 263
           E++ E E  D E+E +K +K ++++K   K+ + +EEEEKEE           K+KK KK
Sbjct: 28  EKEVEKEVPDEEEEEEKEEKKEEEEKTTDKEEEVDEEEEKEE-----------KKKKTKK 76

Query: 264 DKKK 267
            K+ 
Sbjct: 77  VKET 80



 Score = 31.3 bits (71), Expect = 0.49
 Identities = 19/61 (31%), Positives = 35/61 (57%), Gaps = 11/61 (18%)

Query: 203 EEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKYEEEEEKEEGEQTDSEKEAKKRKKHK 262
           E EKE       ++E ++ K+ KK++++K    + E +EE+E+      E++ KK KK K
Sbjct: 30  EVEKEV-----PDEEEEEEKEEKKEEEEKTTDKEEEVDEEEEK------EEKKKKTKKVK 78

Query: 263 K 263
           +
Sbjct: 79  E 79



 Score = 28.6 bits (64), Expect = 4.4
 Identities = 17/48 (35%), Positives = 30/48 (62%), Gaps = 4/48 (8%)

Query: 228 KKKKKKKSKYEEEEEKEEGEQTDSEKEAKKR----KKHKKDKKKRKKK 271
           K+ +K+    EEEEEKEE ++ + +   K+     ++ K++KKK+ KK
Sbjct: 29  KEVEKEVPDEEEEEEKEEKKEEEEKTTDKEEEVDEEEEKEEKKKKTKK 76


>gnl|CDD|235401 PRK05306, infB, translation initiation factor IF-2; Validated.
          Length = 746

 Score = 38.7 bits (91), Expect = 0.002
 Identities = 20/97 (20%), Positives = 40/97 (41%), Gaps = 1/97 (1%)

Query: 177 LKSKLKAKEKRKASPTSA-SSDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKS 235
           L  KLK       S +S    +E+ KEE ++E  E+  +E E     + +++ K +   +
Sbjct: 20  LLEKLKELGIEVKSHSSTVEEEEARKEEAKREAEEEAKAEAEEAAAAEAEEEAKAEAAAA 79

Query: 236 KYEEEEEKEEGEQTDSEKEAKKRKKHKKDKKKRKKKS 272
              EE  +       + + A+       +   R+ K+
Sbjct: 80  APAEEAAEAAAAAEAAARPAEDEAARPAEAAARRPKA 116



 Score = 37.9 bits (89), Expect = 0.004
 Identities = 22/98 (22%), Positives = 40/98 (40%), Gaps = 1/98 (1%)

Query: 174 ELVLKSKLKAKEKRKASPTSASSDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKK 233
           EL + SK +  EK K       S  S  EEEE  + E     +E  K +  +    + ++
Sbjct: 12  ELGVSSK-ELLEKLKELGIEVKSHSSTVEEEEARKEEAKREAEEEAKAEAEEAAAAEAEE 70

Query: 234 KSKYEEEEEKEEGEQTDSEKEAKKRKKHKKDKKKRKKK 271
           ++K E        E  ++   A+   +  +D+  R  +
Sbjct: 71  EAKAEAAAAAPAEEAAEAAAAAEAAARPAEDEAARPAE 108


>gnl|CDD|215565 PLN03083, PLN03083, E3 UFM1-protein ligase 1 homolog; Provisional.
          Length = 803

 Score = 38.6 bits (90), Expect = 0.003
 Identities = 21/102 (20%), Positives = 50/102 (49%), Gaps = 6/102 (5%)

Query: 157 VDKMSRPLQDVRVVKCNELVLKSKLKAKEKRKASPTSASSDESDKEEEEKEEGE-----Q 211
            D++ + +    +   +  ++ S  K+    ++SP +++SD+  K+++ K         +
Sbjct: 377 YDQIEKEMDAFSIQASSAGLIGSSEKSLGSNESSPAASNSDKGSKKKKGKSTSTKGGTAE 436

Query: 212 TDSEKEAKKRKKHKKDKKKKKKKSKYEEEEEKEEGEQTDSEK 253
           +  + E    KK KK++KK + KS  +   + + G + +S K
Sbjct: 437 SIPDDEEDAPKKGKKNQKKGRDKSS-KVPSDSKAGGKKESVK 477



 Score = 33.6 bits (77), Expect = 0.12
 Identities = 17/83 (20%), Positives = 31/83 (37%), Gaps = 1/83 (1%)

Query: 190 SPTSASSDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKYEEEEEKEEGEQT 249
           + ++     S+K     E      +  +  K+KK K    K        ++EE    +  
Sbjct: 391 ASSAGLIGSSEKSLGSNESSPAASNSDKGSKKKKGKSTSTKGGTAESIPDDEEDAPKKGK 450

Query: 250 DSEKEAK-KRKKHKKDKKKRKKK 271
            ++K+ + K  K   D K   KK
Sbjct: 451 KNQKKGRDKSSKVPSDSKAGGKK 473



 Score = 33.3 bits (76), Expect = 0.14
 Identities = 24/96 (25%), Positives = 36/96 (37%), Gaps = 8/96 (8%)

Query: 177 LKSKLKAKEKRKASPTSASSDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSK 236
           +K      EK   + +  +S        EK  G    +E         K  KKKK K + 
Sbjct: 373 IKGIYDQIEKEMDAFSIQASSAGLIGSSEKSLG---SNESSPAASNSDKGSKKKKGKSTS 429

Query: 237 YEEEEEKEEGEQTDSEKEAKKRKKHKKDKKKRKKKS 272
                 K    ++  + E    KK KK++KK + KS
Sbjct: 430 -----TKGGTAESIPDDEEDAPKKGKKNQKKGRDKS 460



 Score = 28.6 bits (64), Expect = 4.0
 Identities = 20/68 (29%), Positives = 33/68 (48%), Gaps = 16/68 (23%)

Query: 184 KEKRKASPTSASSDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKK---------KKK 234
           K+K K++ T   + ES  ++EE       D+ K+ KK +K  +DK  K         KK+
Sbjct: 422 KKKGKSTSTKGGTAESIPDDEE-------DAPKKGKKNQKKGRDKSSKVPSDSKAGGKKE 474

Query: 235 SKYEEEEE 242
           S   +E+ 
Sbjct: 475 SVKSQEDN 482


>gnl|CDD|226894 COG4499, COG4499, Predicted membrane protein [Function unknown].
          Length = 434

 Score = 38.2 bits (89), Expect = 0.004
 Identities = 25/95 (26%), Positives = 49/95 (51%), Gaps = 3/95 (3%)

Query: 173 NELVLKSKLKAKEKRKASPTSASSDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKK 232
           N+L L +  K  E+ K S T  S D+  ++E  KE  ++     +     K + D  ++ 
Sbjct: 342 NDLTLLALTKLYEEVK-SNTDLSGDK--RQELLKEYNKKLQDYTKKLGEVKDETDASEEA 398

Query: 233 KKSKYEEEEEKEEGEQTDSEKEAKKRKKHKKDKKK 267
           +    EE+ ++EE E+   E+  + ++K +KD++K
Sbjct: 399 EAKAKEEKLKQEENEKKQKEQADEDKEKRQKDERK 433



 Score = 34.4 bits (79), Expect = 0.053
 Identities = 16/55 (29%), Positives = 25/55 (45%), Gaps = 1/55 (1%)

Query: 180 KLKAKEKRKASPTSASSDESDKEEEEKEEGEQTDSE-KEAKKRKKHKKDKKKKKK 233
               K       T AS +   K +EEK + E+ + + KE     K K+ K ++KK
Sbjct: 380 DYTKKLGEVKDETDASEEAEAKAKEEKLKQEENEKKQKEQADEDKEKRQKDERKK 434



 Score = 32.8 bits (75), Expect = 0.17
 Identities = 16/63 (25%), Positives = 31/63 (49%), Gaps = 4/63 (6%)

Query: 213 DSEKEAKKRKKHKKDKKKKKKKSKYEEEEEKEEGEQTDSEK----EAKKRKKHKKDKKKR 268
             ++  K+  K  +D  KK  + K E +  +E   +   EK    E +K++K + D+ K 
Sbjct: 366 KRQELLKEYNKKLQDYTKKLGEVKDETDASEEAEAKAKEEKLKQEENEKKQKEQADEDKE 425

Query: 269 KKK 271
           K++
Sbjct: 426 KRQ 428



 Score = 31.7 bits (72), Expect = 0.43
 Identities = 13/50 (26%), Positives = 26/50 (52%)

Query: 177 LKSKLKAKEKRKASPTSASSDESDKEEEEKEEGEQTDSEKEAKKRKKHKK 226
           L       +  + +   A  ++  +EE EK++ EQ D +KE +++ + KK
Sbjct: 385 LGEVKDETDASEEAEAKAKEEKLKQEENEKKQKEQADEDKEKRQKDERKK 434



 Score = 29.8 bits (67), Expect = 1.8
 Identities = 10/47 (21%), Positives = 29/47 (61%), Gaps = 2/47 (4%)

Query: 197 DESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKYEEEEEK 243
           DE+D  EE + + +  + + + ++ +K +K++  + K+ + ++E +K
Sbjct: 390 DETDASEEAEAKAK--EEKLKQEENEKKQKEQADEDKEKRQKDERKK 434


>gnl|CDD|218737 pfam05764, YL1, YL1 nuclear protein.  The proteins in this family
           are designated YL1. These proteins have been shown to be
           DNA-binding and may be a transcription factor.
          Length = 238

 Score = 37.4 bits (87), Expect = 0.004
 Identities = 20/77 (25%), Positives = 30/77 (38%), Gaps = 6/77 (7%)

Query: 193 SASSDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKYEEEEEKEEGEQTDSE 252
           S   +    +EEE E+  Q +   + KKR K K  K+  KKK K      K+       +
Sbjct: 61  SEDDEPESDDEEEGEKELQREERLKKKKRVKTKAYKEPTKKKKK------KDPTAAKSPK 114

Query: 253 KEAKKRKKHKKDKKKRK 269
             A + KK  +      
Sbjct: 115 AAAPRPKKKSERISWAP 131



 Score = 32.7 bits (75), Expect = 0.13
 Identities = 20/87 (22%), Positives = 39/87 (44%), Gaps = 1/87 (1%)

Query: 178 KSKLKAKEKRKASPTSASSDESDKEEEE-KEEGEQTDSEKEAKKRKKHKKDKKKKKKKSK 236
             K   K+K+K  PT+A S ++     + K E            R+K  +    + K++ 
Sbjct: 94  AYKEPTKKKKKKDPTAAKSPKAAAPRPKKKSERISWAPTLLDSPRRKSSRSSTVQNKEAT 153

Query: 237 YEEEEEKEEGEQTDSEKEAKKRKKHKK 263
           +E  +E+E   +    K  K+++K K+
Sbjct: 154 HERLKEREIRRKKIQAKARKRKEKKKE 180



 Score = 29.7 bits (67), Expect = 1.5
 Identities = 23/79 (29%), Positives = 41/79 (51%), Gaps = 5/79 (6%)

Query: 194 ASSDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKYEEEEEKEEGEQTDSEK 253
               +SD ++ E +E E  D E+  K+ ++ ++ KKKK+ K+K  +E  K++      +K
Sbjct: 51  EEEVDSDFDDSEDDEPESDDEEEGEKELQREERLKKKKRVKTKAYKEPTKKK-----KKK 105

Query: 254 EAKKRKKHKKDKKKRKKKS 272
           +    K  K    + KKKS
Sbjct: 106 DPTAAKSPKAAAPRPKKKS 124



 Score = 28.5 bits (64), Expect = 3.1
 Identities = 22/96 (22%), Positives = 45/96 (46%), Gaps = 2/96 (2%)

Query: 178 KSKLKAKEKRKASPTSASSDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKY 237
             ++        SP   SS  S  + +E       + E   KK +   + +K+KKK+ + 
Sbjct: 124 SERISWAPTLLDSPRRKSSRSSTVQNKEATHERLKEREIRRKKIQAKARKRKEKKKEKEL 183

Query: 238 EEEEEKEEGEQTDSE--KEAKKRKKHKKDKKKRKKK 271
            +EE   E ++T+    K  ++ ++ +++KKK K +
Sbjct: 184 TQEERLAEAKETERINLKSLERYEEQEEEKKKAKIQ 219


>gnl|CDD|220431 pfam09831, DUF2058, Uncharacterized protein conserved in bacteria
           (DUF2058).  This domain, found in various prokaryotic
           proteins, has no known function.
          Length = 177

 Score = 36.8 bits (86), Expect = 0.004
 Identities = 17/59 (28%), Positives = 37/59 (62%), Gaps = 1/59 (1%)

Query: 214 SEKEAKKRKKHK-KDKKKKKKKSKYEEEEEKEEGEQTDSEKEAKKRKKHKKDKKKRKKK 271
            +K+AKK KK K K +K+ +K +   ++E K+  E+  +EK  + R+ +++ + + ++K
Sbjct: 13  DKKKAKKAKKEKRKQRKQARKGADDGDDELKQAAEEAKAEKAERDRELNRQRQAEAEQK 71


>gnl|CDD|217051 pfam02463, SMC_N, RecF/RecN/SMC N terminal domain.  This domain is
           found at the N terminus of SMC proteins. The SMC
           (structural maintenance of chromosomes) superfamily
           proteins have ATP-binding domains at the N- and
           C-termini, and two extended coiled-coil domains
           separated by a hinge in the middle. The eukaryotic SMC
           proteins form two kind of heterodimers: the SMC1/SMC3
           and the SMC2/SMC4 types. These heterodimers constitute
           an essential part of higher order complexes, which are
           involved in chromatin and DNA dynamics. This family also
           includes the RecF and RecN proteins that are involved in
           DNA metabolism and recombination.
          Length = 1162

 Score = 38.0 bits (88), Expect = 0.005
 Identities = 25/89 (28%), Positives = 42/89 (47%)

Query: 184 KEKRKASPTSASSDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKYEEEEEK 243
              ++         ES K+E EKEE       KE K+ +K KK ++++ K    EEEE K
Sbjct: 241 DLLQELLRDEQEEIESSKQELEKEEEILAQVLKENKEEEKEKKLQEEELKLLAKEEEELK 300

Query: 244 EEGEQTDSEKEAKKRKKHKKDKKKRKKKS 272
            E  + +  K   + K  + +K+ +K + 
Sbjct: 301 SELLKLERRKVDDEEKLKESEKELKKLEK 329



 Score = 32.6 bits (74), Expect = 0.23
 Identities = 22/99 (22%), Positives = 47/99 (47%), Gaps = 1/99 (1%)

Query: 174 ELVLKSKLKAKEKRKASPTSASSDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKK 233
            L    K   +E+++            KEEEE +        ++    +K K+ +K+ KK
Sbjct: 267 ILAQVLKENKEEEKEKKLQEEELKLLAKEEEELKSELLKLERRKVDDEEKLKESEKELKK 326

Query: 234 KSKYEEEEEKEEGEQTDSEKEAKKRKKHKKDKKKRKKKS 272
             K E ++EKEE E+ + E +  + K+  +++++ + + 
Sbjct: 327 LEK-ELKKEKEEIEELEKELKELEIKREAEEEEEEQLEK 364



 Score = 32.2 bits (73), Expect = 0.29
 Identities = 18/108 (16%), Positives = 52/108 (48%), Gaps = 2/108 (1%)

Query: 165 QDVRVVKCNELVLKSKLKAKEKRKASPTSASSDESDKEEEEKEEGEQTDSEKEAKKRKKH 224
           Q ++  +  E   + K + +E+ K+  +    + +++EE+ ++   +   EKE K + + 
Sbjct: 744 QKIKEKEEEEEKSRLKKEEEEEEKSELSLKEKELAEEEEKTEKLKVEE--EKEEKLKAQE 801

Query: 225 KKDKKKKKKKSKYEEEEEKEEGEQTDSEKEAKKRKKHKKDKKKRKKKS 272
           ++ +  +++  +  E  E+E+      EK  ++  +    + K ++K 
Sbjct: 802 EELRALEEELKEEAELLEEEQLLIEQEEKIKEEELEELALELKEEQKL 849



 Score = 31.9 bits (72), Expect = 0.37
 Identities = 21/98 (21%), Positives = 42/98 (42%), Gaps = 3/98 (3%)

Query: 173  NELVLKSKLKAKEKRKASPTSASSDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKK 232
             E   + + +  E+        S  E    EE  E+ ++ D+++E ++R K     K++ 
Sbjct: 914  EEKENEIEERIAEEAIILLKYESEPEELLLEEADEKEKEEDNKEEEEERNKRLLLAKEEL 973

Query: 233  KKSK---YEEEEEKEEGEQTDSEKEAKKRKKHKKDKKK 267
                     E EEKEE    D  K+ +  ++ K+  ++
Sbjct: 974  GNVNLMAIAEFEEKEERYNKDELKKERLEEEKKELLRE 1011



 Score = 31.9 bits (72), Expect = 0.40
 Identities = 18/119 (15%), Positives = 50/119 (42%), Gaps = 1/119 (0%)

Query: 153 ENLPVDKMSRPLQDVRVVKCNELVLKSKLKAKEKRKASPTSASSDESDKEEEEKEEGEQT 212
           +    ++  + LQ+  +    +   +       K +           + E+E K+  ++ 
Sbjct: 273 KENKEEEKEKKLQEEELKLLAKE-EEELKSELLKLERRKVDDEEKLKESEKELKKLEKEL 331

Query: 213 DSEKEAKKRKKHKKDKKKKKKKSKYEEEEEKEEGEQTDSEKEAKKRKKHKKDKKKRKKK 271
             EKE  +  + +  + + K++++ EEEE+ E+ ++   + E +   K K + ++    
Sbjct: 332 KKEKEEIEELEKELKELEIKREAEEEEEEQLEKLQEKLEQLEEELLAKKKLESERLSSA 390



 Score = 31.5 bits (71), Expect = 0.58
 Identities = 26/105 (24%), Positives = 49/105 (46%), Gaps = 7/105 (6%)

Query: 174 ELVLKSKLKAKEKRKASPTSASSDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKK 233
              L+ +L AK+K ++   S+++   ++E E K E E+       +  ++ +   K++KK
Sbjct: 369 LEQLEEELLAKKKLESERLSSAAKLKEEELELKNEEEKEAKL-LLELSEQEEDLLKEEKK 427

Query: 234 KSKYEEEEEKEEGE------QTDSEKEAKKRKKHKKDKKKRKKKS 272
           +     EE +E  E        + E+  K+  K  KDK + KK  
Sbjct: 428 EELKIVEELEESLETKQGKLTEEKEELEKQALKLLKDKLELKKSE 472



 Score = 31.1 bits (70), Expect = 0.65
 Identities = 22/91 (24%), Positives = 40/91 (43%), Gaps = 2/91 (2%)

Query: 184 KEKRKASPTSASSDESDKEEEEKEEGEQTDSEKEAKKRKK--HKKDKKKKKKKSKYEEEE 241
           K K +A          +K E E+E     D  K  ++R     +  + ++++    ++E 
Sbjct: 202 KLKEQAKKALEYYQLKEKLELEEENLLYLDYLKLNEERIDLLQELLRDEQEEIESSKQEL 261

Query: 242 EKEEGEQTDSEKEAKKRKKHKKDKKKRKKKS 272
           EKEE       KE K+ +K KK +++  K  
Sbjct: 262 EKEEEILAQVLKENKEEEKEKKLQEEELKLL 292



 Score = 30.7 bits (69), Expect = 1.0
 Identities = 20/109 (18%), Positives = 51/109 (46%)

Query: 164 LQDVRVVKCNELVLKSKLKAKEKRKASPTSASSDESDKEEEEKEEGEQTDSEKEAKKRKK 223
           LQ++   +  E+    +   KE+   +     + E +KE++ +EE  +  +++E + + +
Sbjct: 243 LQELLRDEQEEIESSKQELEKEEEILAQVLKENKEEEKEKKLQEEELKLLAKEEEELKSE 302

Query: 224 HKKDKKKKKKKSKYEEEEEKEEGEQTDSEKEAKKRKKHKKDKKKRKKKS 272
             K +++K    +  +E EKE  +     K+ K+  +  + + K  +  
Sbjct: 303 LLKLERRKVDDEEKLKESEKELKKLEKELKKEKEEIEELEKELKELEIK 351



 Score = 29.2 bits (65), Expect = 2.9
 Identities = 18/97 (18%), Positives = 45/97 (46%)

Query: 176 VLKSKLKAKEKRKASPTSASSDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKS 235
           +   K +    +        ++E    E++ +E E+ + +   KK ++ ++  +   K+ 
Sbjct: 716 LKLEKEELLADKVQEAQDKINEELKLLEQKIKEKEEEEEKSRLKKEEEEEEKSELSLKEK 775

Query: 236 KYEEEEEKEEGEQTDSEKEAKKRKKHKKDKKKRKKKS 272
           +  EEEEK E  + + EKE K + + ++ +   ++  
Sbjct: 776 ELAEEEEKTEKLKVEEEKEEKLKAQEEELRALEEELK 812



 Score = 28.4 bits (63), Expect = 5.2
 Identities = 18/102 (17%), Positives = 48/102 (47%), Gaps = 6/102 (5%)

Query: 177 LKSKLKAKEKRKASPTSASSDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSK 236
           L+ +    E++             + ++EKEE E+ + E +  + K+  +++++++ +  
Sbjct: 306 LERRKVDDEEKLKESEKELKKLEKELKKEKEEIEELEKELKELEIKREAEEEEEEQLEKL 365

Query: 237 YEEEEEKEE------GEQTDSEKEAKKRKKHKKDKKKRKKKS 272
            E+ E+ EE        +++    A K K+ + + K  ++K 
Sbjct: 366 QEKLEQLEEELLAKKKLESERLSSAAKLKEEELELKNEEEKE 407


>gnl|CDD|218538 pfam05285, SDA1, SDA1.  This family consists of several SDA1
           protein homologues. SDA1 is a Saccharomyces cerevisiae
           protein which is involved in the control of the actin
           cytoskeleton. The protein is essential for cell
           viability and is localised in the nucleus.
          Length = 317

 Score = 37.3 bits (87), Expect = 0.005
 Identities = 30/101 (29%), Positives = 51/101 (50%), Gaps = 19/101 (18%)

Query: 174 ELVLKSKLKAKEKRKASPTSASSDESDKEEEEKEEGEQ-------------------TDS 214
           EL+ K K + ++K++A     S D+ D+EEE + E ++                   +DS
Sbjct: 73  ELLEKWKEEERKKKEAEQGLESDDDDDEEEEWEVEEDEDSDDEGEWIDVESDKEIESSDS 132

Query: 215 EKEAKKRKKHKKDKKKKKKKSKYEEEEEKEEGEQTDSEKEA 255
           E E +K +  KK K+   ++   E+EEE  E E+ ++EKE 
Sbjct: 133 EDEEEKDEAAKKAKEDSDEELSEEDEEEAAEEEEAEAEKEK 173


>gnl|CDD|184287 PRK13735, PRK13735, conjugal transfer mating pair stabilization
           protein TraG; Provisional.
          Length = 942

 Score = 37.8 bits (88), Expect = 0.006
 Identities = 17/82 (20%), Positives = 33/82 (40%)

Query: 186 KRKASPTSASSDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKYEEEEEKEE 245
           K +        + +  + +    GE+   + +  + + H K +   +     EE+  +E 
Sbjct: 860 KHQVDNMVTEYEGNIGDTQNSIRGEENTVKGQYSELQNHHKTEALSQNNKYNEEKSAQER 919

Query: 246 GEQTDSEKEAKKRKKHKKDKKK 267
               DS +E  KR K  +DK K
Sbjct: 920 MPGADSPEELMKRAKEYQDKHK 941



 Score = 29.7 bits (67), Expect = 1.8
 Identities = 11/31 (35%), Positives = 16/31 (51%)

Query: 201 KEEEEKEEGEQTDSEKEAKKRKKHKKDKKKK 231
           +E+  +E     DS +E  KR K  +DK K 
Sbjct: 912 EEKSAQERMPGADSPEELMKRAKEYQDKHKG 942


>gnl|CDD|235640 PRK05901, PRK05901, RNA polymerase sigma factor; Provisional.
          Length = 509

 Score = 37.3 bits (87), Expect = 0.006
 Identities = 15/91 (16%), Positives = 32/91 (35%), Gaps = 1/91 (1%)

Query: 182 KAKEKRKASPTSASSDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKYEEEE 241
            A  K + +    +  +  K   + +       E+  +  +  KK  ++  +   +    
Sbjct: 3   TASTKAELAAEEEAKKKLKKLAAKSKSKGFITKEEIKEALESKKKTPEQIDQVLIFLSGM 62

Query: 242 EKEEGEQTDSEKEAKKRKK-HKKDKKKRKKK 271
            K+  + T+S+   KK K   K    K   K
Sbjct: 63  VKDTDDATESDIPKKKTKTAAKAAAAKAPAK 93



 Score = 36.5 bits (85), Expect = 0.013
 Identities = 20/89 (22%), Positives = 33/89 (37%), Gaps = 3/89 (3%)

Query: 182 KAKEKRKASPTSASSDESDKEEEEKEEGEQTDSEKEAKKRK---KHKKDKKKKKKKSKYE 238
           +A E +K +P              K+  + T+S+   KK K   K    K   KKK K E
Sbjct: 40  EALESKKKTPEQIDQVLIFLSGMVKDTDDATESDIPKKKTKTAAKAAAAKAPAKKKLKDE 99

Query: 239 EEEEKEEGEQTDSEKEAKKRKKHKKDKKK 267
            +  K+  ++   +K+         D   
Sbjct: 100 LDSSKKAEKKNALDKDDDLNYVKDIDVLN 128



 Score = 32.7 bits (75), Expect = 0.18
 Identities = 18/95 (18%), Positives = 37/95 (38%), Gaps = 1/95 (1%)

Query: 178 KSKLKAKEKRKASPTSASSDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKY 237
           K++L A+E+ K      ++    K    KEE  +   E + K  ++  +         K 
Sbjct: 7   KAELAAEEEAKKKLKKLAAKSKSKGFITKEE-IKEALESKKKTPEQIDQVLIFLSGMVKD 65

Query: 238 EEEEEKEEGEQTDSEKEAKKRKKHKKDKKKRKKKS 272
            ++  + +  +  ++  AK        KKK K + 
Sbjct: 66  TDDATESDIPKKKTKTAAKAAAAKAPAKKKLKDEL 100



 Score = 32.7 bits (75), Expect = 0.22
 Identities = 17/91 (18%), Positives = 31/91 (34%)

Query: 177 LKSKLKAKEKRKASPTSASSDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSK 236
           L+SK K  E+        S    D ++  + +  +  ++  AK        KKK K +  
Sbjct: 42  LESKKKTPEQIDQVLIFLSGMVKDTDDATESDIPKKKTKTAAKAAAAKAPAKKKLKDELD 101

Query: 237 YEEEEEKEEGEQTDSEKEAKKRKKHKKDKKK 267
             ++ EK+     D +    K          
Sbjct: 102 SSKKAEKKNALDKDDDLNYVKDIDVLNQADD 132


>gnl|CDD|215595 PLN03130, PLN03130, ABC transporter C family member; Provisional.
          Length = 1622

 Score = 37.4 bits (87), Expect = 0.007
 Identities = 18/73 (24%), Positives = 31/73 (42%)

Query: 174 ELVLKSKLKAKEKRKASPTSASSDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKK 233
           EL     L  K    A       +E+ +EE+++   +   +      +K     KK K+ 
Sbjct: 825 ELSNNGPLFQKLMENAGKMEEYVEENGEEEDDQTSSKPVANGNANNLKKDSSSKKKSKEG 884

Query: 234 KSKYEEEEEKEEG 246
           KS   ++EE+E G
Sbjct: 885 KSVLIKQEERETG 897


>gnl|CDD|217502 pfam03343, SART-1, SART-1 family.  SART-1 is a protein involved in
           cell cycle arrest and pre-mRNA splicing. It has been
           shown to be a component of U4/U6 x U5 tri-snRNP complex
           in human, Schizosaccharomyces pombe and Saccharomyces
           cerevisiae. SART-1 is a known tumour antigen in a range
           of cancers recognised by T cells.
          Length = 603

 Score = 37.4 bits (87), Expect = 0.007
 Identities = 24/91 (26%), Positives = 44/91 (48%), Gaps = 3/91 (3%)

Query: 185 EKRKASPTSASSDESDKEEEEKEEGEQTDSE-KEAKKRKKHKKDKKKKKKKSKYEEEEEK 243
           +  K S  S   +E + ++++ +     DS      + +K +++ KKK K +    +++ 
Sbjct: 199 KFNKRSILSKYDEEIEGKKKKSDNLFTLDSGGSTDDEAEKKRQEVKKKLKINNVSLDDDS 258

Query: 244 EEGEQTD--SEKEAKKRKKHKKDKKKRKKKS 272
            E   +D     E  K KK KK KKK+KK+ 
Sbjct: 259 TETPASDYYDVSEMVKFKKPKKKKKKKKKRR 289



 Score = 33.2 bits (76), Expect = 0.13
 Identities = 26/97 (26%), Positives = 45/97 (46%), Gaps = 6/97 (6%)

Query: 178 KSKLKAKEKRKASPTSASSDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKY 237
           K + + K+K K +     S + D  E    +        + KK KK KK KKK++K    
Sbjct: 238 KKRQEVKKKLKIN---NVSLDDDSTETPASDYYDVSEMVKFKKPKKKKKKKKKRRKDLDE 294

Query: 238 EEEEEKEEG---EQTDSEKEAKKRKKHKKDKKKRKKK 271
           +E E + EG     + S K+ ++     +D  K++K+
Sbjct: 295 DELEPEAEGLGSSDSGSRKDVEEENARLEDSPKKRKE 331



 Score = 31.7 bits (72), Expect = 0.41
 Identities = 19/85 (22%), Positives = 43/85 (50%)

Query: 188 KASPTSASSDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKYEEEEEKEEGE 247
           K S  +A+ +   K +EE E   + +  +E   + + K+++  K    K   E++ ++ +
Sbjct: 31  KESRDAAAYENWKKRQEEAEAKRKREELREKIAKAREKRERNSKLGGIKTLGEDDDDDDD 90

Query: 248 QTDSEKEAKKRKKHKKDKKKRKKKS 272
                K++KKR+K K+ ++K+    
Sbjct: 91  TKAWLKKSKKRQKKKEAERKKALLL 115


>gnl|CDD|227458 COG5129, MAK16, Nuclear protein with HMG-like acidic region
           [General function prediction only].
          Length = 303

 Score = 36.9 bits (85), Expect = 0.007
 Identities = 25/75 (33%), Positives = 37/75 (49%), Gaps = 4/75 (5%)

Query: 178 KSKLKAKEKRKASPTSASSDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSK- 236
           K+K K  EK   S  S  + ES++EE  + E ++ + E    K +K K D  KK +K   
Sbjct: 225 KTKKKDLEKWLGSDQSMETSESEEEESSESESDEDEDEDNKGKIRKRKTDDAKKSRKPHI 284

Query: 237 ---YEEEEEKEEGEQ 248
              YE+E E E+   
Sbjct: 285 HIEYEQERENEKIPA 299



 Score = 35.4 bits (81), Expect = 0.020
 Identities = 35/132 (26%), Positives = 56/132 (42%), Gaps = 14/132 (10%)

Query: 147 NVVSHIENLPVDKMSRPLQDVRVVKCNELVLKSKLKAKEKRKASPTSASSDESDKEEEEK 206
               +IE   + ++   +     +   E +       +EKR+         E   EEEE+
Sbjct: 156 KFEKNIEKELMLRLKSGIYGDTPLNVREHLWNKAATEREKRQ------DEKERYVEEEEE 209

Query: 207 EEGE----QTDSEKEAKKRKKHKK----DKKKKKKKSKYEEEEEKEEGEQTDSEKEAKKR 258
            + E      DSEKE  K+K  +K    D+  +  +S+ EE  E E  E  D + + K R
Sbjct: 210 SDTELEAVTDDSEKEKTKKKDLEKWLGSDQSMETSESEEEESSESESDEDEDEDNKGKIR 269

Query: 259 KKHKKDKKKRKK 270
           K+   D KK +K
Sbjct: 270 KRKTDDAKKSRK 281


>gnl|CDD|219124 pfam06658, DUF1168, Protein of unknown function (DUF1168).  This
           family consists of several hypothetical eukaryotic
           proteins of unknown function.
          Length = 142

 Score = 35.8 bits (83), Expect = 0.007
 Identities = 24/52 (46%), Positives = 37/52 (71%)

Query: 203 EEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKYEEEEEKEEGEQTDSEKE 254
           ++++EE ++ D EK AKKR K +K K+KKKKK K ++  +KEE E + S +E
Sbjct: 69  QQKREEKKRKDEEKTAKKRAKRQKKKQKKKKKKKAKKGNKKEEKEGSKSSEE 120



 Score = 31.9 bits (73), Expect = 0.15
 Identities = 21/46 (45%), Positives = 32/46 (69%)

Query: 227 DKKKKKKKSKYEEEEEKEEGEQTDSEKEAKKRKKHKKDKKKRKKKS 272
           D+K KK+    E ++++EE ++ D EK AKKR K +K K+K+KKK 
Sbjct: 56  DEKWKKETEDEEFQQKREEKKRKDEEKTAKKRAKRQKKKQKKKKKK 101


>gnl|CDD|219355 pfam07267, Nucleo_P87, Nucleopolyhedrovirus capsid protein P87.
           This family consists of several Nucleopolyhedrovirus
           capsid protein P87 sequences. P87 is expressed late in
           infection and concentrated in infected cell nuclei.
          Length = 606

 Score = 37.2 bits (86), Expect = 0.009
 Identities = 22/66 (33%), Positives = 33/66 (50%), Gaps = 6/66 (9%)

Query: 182 KAKEKRKASPTSASSDESDKEEEEKEEGEQTDSEKEAKKRKKHKKD--KKKKKKKSKYEE 239
           K K +R       SSDE + + +E E     D EKE K+R++  K+  + K  + SKY  
Sbjct: 357 KRKRRRVPPLPEYSSDEDEDDSDEDE----VDYEKERKRRREEDKNFLRLKALELSKYAG 412

Query: 240 EEEKEE 245
             E+ E
Sbjct: 413 VNERME 418



 Score = 28.3 bits (63), Expect = 4.6
 Identities = 15/74 (20%), Positives = 32/74 (43%), Gaps = 4/74 (5%)

Query: 191 PTSASSDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKYEEEEEKEEGEQTD 250
                SD          + E +DS    K++++          +   +E+E+  + ++ D
Sbjct: 329 RERFPSDLFAISSLPDSDSEASDSGPTRKRKRRRVP----PLPEYSSDEDEDDSDEDEVD 384

Query: 251 SEKEAKKRKKHKKD 264
            EKE K+R++  K+
Sbjct: 385 YEKERKRRREEDKN 398


>gnl|CDD|148682 pfam07222, PBP_sp32, Proacrosin binding protein sp32.  This family
           consists of several mammalian specific proacrosin
           binding protein sp32 sequences. sp32 is a sperm specific
           protein which is known to bind with with 55- and 53-kDa
           proacrosins and the 49-kDa acrosin intermediate. The
           exact function of sp32 is unclear, it is thought however
           that the binding of sp32 to proacrosin may be involved
           in packaging the acrosin zymogen into the acrosomal
           matrix.
          Length = 243

 Score = 36.2 bits (83), Expect = 0.010
 Identities = 17/63 (26%), Positives = 35/63 (55%), Gaps = 3/63 (4%)

Query: 192 TSASSDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKYEEEEEKEEGEQTDS 251
           +S S   S + +  K + EQ  S+ +    ++HK ++K+ +++   EEEE +EE +Q + 
Sbjct: 182 SSLSLGGSVQVKAPKPKQEQLLSKLQEYL-QEHKTEEKQPQEEQ--EEEEVEEEAKQEEG 238

Query: 252 EKE 254
           +  
Sbjct: 239 QGT 241


>gnl|CDD|150884 pfam10278, Med19, Mediator of RNA pol II transcription subunit 19. 
           Med19 represents a family of conserved proteins which
           are members of the multi-protein co-activator Mediator
           complex. Mediator is required for activation of RNA
           polymerase II transcription by DNA binding
           transactivators.
          Length = 178

 Score = 35.6 bits (82), Expect = 0.013
 Identities = 20/64 (31%), Positives = 31/64 (48%)

Query: 177 LKSKLKAKEKRKASPTSASSDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSK 236
            K K K K+ R   P    +    +  +  E+  +    ++ K+RKK KK+KKKKKK+  
Sbjct: 111 KKHKHKHKKHRTQDPLPEETPSDSEGLKGHEKKHKKKKHEDDKERKKKKKEKKKKKKRHS 170

Query: 237 YEEE 240
            E  
Sbjct: 171 PEHP 174



 Score = 32.9 bits (75), Expect = 0.11
 Identities = 23/83 (27%), Positives = 31/83 (37%), Gaps = 18/83 (21%)

Query: 187 RKASPTSASSDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKYEEEEEKEEG 246
               P      +  K   +    E+T S+ E  K  + K  KKK +              
Sbjct: 105 HIQPPKKKHKHKHKKHRTQDPLPEETPSDSEGLKGHEKKHKKKKHED------------- 151

Query: 247 EQTDSEKEAKKRKKHKKDKKKRK 269
                +KE KK+KK KK KKKR 
Sbjct: 152 -----DKERKKKKKEKKKKKKRH 169



 Score = 32.1 bits (73), Expect = 0.18
 Identities = 17/50 (34%), Positives = 25/50 (50%)

Query: 222 KKHKKDKKKKKKKSKYEEEEEKEEGEQTDSEKEAKKRKKHKKDKKKRKKK 271
           KKHK   KK + +    EE   +       EK+ KK+K     ++K+KKK
Sbjct: 111 KKHKHKHKKHRTQDPLPEETPSDSEGLKGHEKKHKKKKHEDDKERKKKKK 160



 Score = 31.7 bits (72), Expect = 0.23
 Identities = 19/56 (33%), Positives = 30/56 (53%), Gaps = 2/56 (3%)

Query: 216 KEAKKRKKHKKDKKKKKKKSKYEEEEEKEEGEQTDSEKEAKKRKKHKKDKKKRKKK 271
            +  K+K   K KK + +    EE     EG +   +K   K+KKH+ DK+++KKK
Sbjct: 106 IQPPKKKHKHKHKKHRTQDPLPEETPSDSEGLKGHEKK--HKKKKHEDDKERKKKK 159


>gnl|CDD|221275 pfam11861, DUF3381, Domain of unknown function (DUF3381).  This
           domain is functionally uncharacterized. This domain is
           found in eukaryotes. This presumed domain is typically
           between 156 to 174 amino acids in length. This domain is
           found associated with pfam07780, pfam01728.
          Length = 154

 Score = 35.0 bits (81), Expect = 0.017
 Identities = 18/76 (23%), Positives = 36/76 (47%), Gaps = 3/76 (3%)

Query: 164 LQDVRVVKCNEL--VLKSKLKA-KEKRKASPTSASSDESDKEEEEKEEGEQTDSEKEAKK 220
            +D++V+   +   +LK + K  K            +E + E EE +E EQ D   E + 
Sbjct: 69  CKDLKVLGKKDFKKLLKWRKKVRKLLGLDKKEKEEEEEEEVEVEELDEEEQIDELLEKEL 128

Query: 221 RKKHKKDKKKKKKKSK 236
            K  ++ +++ ++K K
Sbjct: 129 AKLKREKRRENERKQK 144



 Score = 34.2 bits (79), Expect = 0.030
 Identities = 12/47 (25%), Positives = 27/47 (57%)

Query: 225 KKDKKKKKKKSKYEEEEEKEEGEQTDSEKEAKKRKKHKKDKKKRKKK 271
              K+K++++ +  E EE +E EQ D   E +  K  ++ +++ ++K
Sbjct: 96  LDKKEKEEEEEEEVEVEELDEEEQIDELLEKELAKLKREKRRENERK 142



 Score = 27.3 bits (61), Expect = 5.7
 Identities = 11/61 (18%), Positives = 32/61 (52%), Gaps = 7/61 (11%)

Query: 174 ELVLKSKLKAKEKRKASPTSASSDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKK 233
           +L+   K + +E+ +        DE ++ +E  E       ++ AK +++ +++ ++K+K
Sbjct: 92  KLLGLDKKEKEEEEEEEVEVEELDEEEQIDELLE-------KELAKLKREKRRENERKQK 144

Query: 234 K 234
           +
Sbjct: 145 E 145


>gnl|CDD|236081 PRK07735, PRK07735, NADH dehydrogenase subunit C; Validated.
          Length = 430

 Score = 36.1 bits (83), Expect = 0.017
 Identities = 29/93 (31%), Positives = 41/93 (44%), Gaps = 3/93 (3%)

Query: 182 KAKEKRKASPTSASSDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKYEE-- 239
           KAK K KA+  + +   +  +++ +   E T+ EK A K K     K K    +K +   
Sbjct: 90  KAKAKAKAAAAAKAKAAALAKQKREGTEEVTEEEKAAAKAKAAAAAKAKAAALAKQKREG 149

Query: 240 -EEEKEEGEQTDSEKEAKKRKKHKKDKKKRKKK 271
            EE  EE E+TD EK   K     K K     K
Sbjct: 150 TEEVTEEEEETDKEKAKAKAAAAAKAKAAALAK 182



 Score = 35.3 bits (81), Expect = 0.028
 Identities = 30/90 (33%), Positives = 37/90 (41%), Gaps = 1/90 (1%)

Query: 178 KSKLKAKEKRKASPTSASSDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKY 237
            +K KA    KA   + +  + +  EE  EE E+TD EK   K     K K     K K 
Sbjct: 126 AAKAKAAAAAKAKAAALAKQKREGTEEVTEEEEETDKEKAKAKAAAAAKAKAAALAKQKA 185

Query: 238 EEEEEKEEGEQTDSEKEAKKRKKHKKDKKK 267
            E  E  E E T+ EK   K K     K K
Sbjct: 186 AEAGEGTE-EVTEEEKAKAKAKAAAAAKAK 214



 Score = 32.3 bits (73), Expect = 0.29
 Identities = 23/83 (27%), Positives = 39/83 (46%)

Query: 178 KSKLKAKEKRKASPTSASSDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKY 237
           K+  KAK    A   +A+  +  +E  E+   E+ +++KE  K K     K K    +K 
Sbjct: 124 KAAAKAKAAAAAKAKAAALAKQKREGTEEVTEEEEETDKEKAKAKAAAAAKAKAAALAKQ 183

Query: 238 EEEEEKEEGEQTDSEKEAKKRKK 260
           +  E  E  E+   E++AK + K
Sbjct: 184 KAAEAGEGTEEVTEEEKAKAKAK 206



 Score = 30.7 bits (69), Expect = 0.72
 Identities = 22/91 (24%), Positives = 40/91 (43%), Gaps = 2/91 (2%)

Query: 183 AKEKRKASPTSASSDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKK--KKSKYEEE 240
            + KR+A+  + +   +  +++ +   E T+ EK   K K     K K     K K E  
Sbjct: 57  EEAKRRAAAAAKAKAAALAKQKREGTEEVTEEEKAKAKAKAAAAAKAKAAALAKQKREGT 116

Query: 241 EEKEEGEQTDSEKEAKKRKKHKKDKKKRKKK 271
           EE  E E+  ++ +A    K K     ++K+
Sbjct: 117 EEVTEEEKAAAKAKAAAAAKAKAAALAKQKR 147



 Score = 28.0 bits (62), Expect = 6.7
 Identities = 24/92 (26%), Positives = 38/92 (41%), Gaps = 10/92 (10%)

Query: 182 KAKEKRKASPTSASSDESDKEEEEKEEGEQT------DSEKEAKKRKKHKKDKKKKKKKS 235
           +AKE+ +    +    E  K EEE  E E+        + +EAK+R         K K +
Sbjct: 17  RAKEEARKRLVAKHGAEISKLEEENREKEKALPKNDDMTIEEAKRRAA----AAAKAKAA 72

Query: 236 KYEEEEEKEEGEQTDSEKEAKKRKKHKKDKKK 267
              +++ +   E T+ EK   K K     K K
Sbjct: 73  ALAKQKREGTEEVTEEEKAKAKAKAAAAAKAK 104



 Score = 27.6 bits (61), Expect = 6.9
 Identities = 20/67 (29%), Positives = 34/67 (50%), Gaps = 8/67 (11%)

Query: 195 SSDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKYEEEEEKEEGEQTDSE-- 252
              E D E+ +KE   +  +++EA+KR   K   +  K +   EE  EKE+    + +  
Sbjct: 1   MDPEKDLEDLKKEAARR--AKEEARKRLVAKHGAEISKLE---EENREKEKALPKNDDMT 55

Query: 253 -KEAKKR 258
            +EAK+R
Sbjct: 56  IEEAKRR 62


>gnl|CDD|221733 pfam12720, DUF3807, Protein of unknown function (DUF3807).  This is
           a family of conserved fungal proteins of unknown
           function.
          Length = 169

 Score = 34.7 bits (80), Expect = 0.022
 Identities = 13/65 (20%), Positives = 28/65 (43%)

Query: 201 KEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKYEEEEEKEEGEQTDSEKEAKKRKK 260
           KEE E EE  + D+  +A          +K+ ++    ++ +    ++  S K+ +   +
Sbjct: 72  KEEAEAEEEGEVDASPDAGAVAGESSADRKEAEQQGAAQKRKSCRDKERKSAKDPRGGTQ 131

Query: 261 HKKDK 265
              DK
Sbjct: 132 DVVDK 136



 Score = 29.7 bits (67), Expect = 1.0
 Identities = 12/59 (20%), Positives = 29/59 (49%), Gaps = 4/59 (6%)

Query: 214 SEKEAKKRKKHKKDKKKKKKKSKYEEEEEKEEGEQTDSEKEAKKRKKHKKDKKKRKKKS 272
           SE  A  R++    + K++ +++ E E +        + + +  RK+ ++    +K+KS
Sbjct: 60  SEIHALLRER----ELKEEAEAEEEGEVDASPDAGAVAGESSADRKEAEQQGAAQKRKS 114



 Score = 27.0 bits (60), Expect = 7.6
 Identities = 14/81 (17%), Positives = 28/81 (34%), Gaps = 5/81 (6%)

Query: 178 KSKLKAKEKRKASPTSASSDESDKEEEE---KEEGEQTDSEKEAKKRKKHKKDKKKKKKK 234
           +   +  E  +     AS D      E    ++E EQ  +    K++    K++K  K  
Sbjct: 69  RELKEEAEAEEEGEVDASPDAGAVAGESSADRKEAEQQGAA--QKRKSCRDKERKSAKDP 126

Query: 235 SKYEEEEEKEEGEQTDSEKEA 255
               ++   +     D  +E 
Sbjct: 127 RGGTQDVVDKSQASLDYGEEE 147


>gnl|CDD|178945 PRK00247, PRK00247, putative inner membrane protein translocase
           component YidC; Validated.
          Length = 429

 Score = 35.6 bits (82), Expect = 0.026
 Identities = 17/88 (19%), Positives = 36/88 (40%), Gaps = 2/88 (2%)

Query: 178 KSKLKAKEKRKASPTSASSDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKY 237
           K++ KA++K  A    A+  E ++E   ++E     +   A++     K K         
Sbjct: 340 KNEAKARKKEIAQKRRAAEREINREA--RQERAAAMARARARRAAVKAKKKGLIDASPNE 397

Query: 238 EEEEEKEEGEQTDSEKEAKKRKKHKKDK 265
           +   E EE + +  + EA    +  ++ 
Sbjct: 398 DTPSENEESKGSPPQVEATTTAEPNREP 425



 Score = 33.7 bits (77), Expect = 0.082
 Identities = 15/93 (16%), Positives = 35/93 (37%), Gaps = 5/93 (5%)

Query: 178 KSKLKAKEKRKA----SPTSASSDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKK 233
           K+ L    + +     +P  A    ++  E +K    + +  K  KK    K+ +  +++
Sbjct: 302 KAFLWTLRRNRLRMIITPWRAPELHAENAEIKKTRTAEKNEAKARKKEIAQKR-RAAERE 360

Query: 234 KSKYEEEEEKEEGEQTDSEKEAKKRKKHKKDKK 266
            ++   +E      +  + + A K KK      
Sbjct: 361 INREARQERAAAMARARARRAAVKAKKKGLIDA 393



 Score = 33.3 bits (76), Expect = 0.13
 Identities = 16/61 (26%), Positives = 25/61 (40%), Gaps = 1/61 (1%)

Query: 210 EQTDSEKEAKKRKKHKK-DKKKKKKKSKYEEEEEKEEGEQTDSEKEAKKRKKHKKDKKKR 268
           E  + +K     K   K  KK+  +K +  E E   E  Q  +   A+ R +    K K+
Sbjct: 328 ENAEIKKTRTAEKNEAKARKKEIAQKRRAAEREINREARQERAAAMARARARRAAVKAKK 387

Query: 269 K 269
           K
Sbjct: 388 K 388



 Score = 29.8 bits (67), Expect = 1.5
 Identities = 14/75 (18%), Positives = 31/75 (41%), Gaps = 10/75 (13%)

Query: 203 EEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKK----------SKYEEEEEKEEGEQTDSE 252
           +E KE   +  ++   K+++K       ++ +           +   E  + +  +T  +
Sbjct: 281 DEFKEHHAEQRAQYREKQKEKKAFLWTLRRNRLRMIITPWRAPELHAENAEIKKTRTAEK 340

Query: 253 KEAKKRKKHKKDKKK 267
            EAK RKK    K++
Sbjct: 341 NEAKARKKEIAQKRR 355


>gnl|CDD|227880 COG5593, COG5593, Nucleic-acid-binding protein possibly involved in
           ribosomal biogenesis [Translation, ribosomal structure
           and biogenesis].
          Length = 821

 Score = 35.4 bits (81), Expect = 0.027
 Identities = 22/82 (26%), Positives = 38/82 (46%), Gaps = 2/82 (2%)

Query: 190 SPTSASSDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKYEEEEEKEEGEQT 249
           S      D  D +E+ K EG Q   ++E      +  D ++    S + E  E++E  + 
Sbjct: 722 SDDEPKLDAID-DEDAKSEGSQESDQEEGLDEIFYSFDGEQDNSDS-FAESSEEDESSEE 779

Query: 250 DSEKEAKKRKKHKKDKKKRKKK 271
           + E+E  K    K+ KKK++K 
Sbjct: 780 EKEEEENKEVSAKRAKKKQRKN 801


>gnl|CDD|215641 PLN03237, PLN03237, DNA topoisomerase 2; Provisional.
          Length = 1465

 Score = 35.6 bits (82), Expect = 0.028
 Identities = 20/85 (23%), Positives = 34/85 (40%), Gaps = 10/85 (11%)

Query: 197  DESD-KEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKYEEEEEKEEGE-------- 247
            D+ D K EE +E+ ++  +  E+   KK  +   KK    K  ++  + E          
Sbjct: 1169 DKEDAKAEEAREKLQRAAARGESGAAKKVSRQAPKKPAPKKTTKKASESETTEETYGSSA 1228

Query: 248  -QTDSEKEAKKRKKHKKDKKKRKKK 271
             +T++  E  K K     KKK    
Sbjct: 1229 METENVAEVVKPKGRAGAKKKAPAA 1253



 Score = 31.8 bits (72), Expect = 0.47
 Identities = 15/70 (21%), Positives = 29/70 (41%), Gaps = 4/70 (5%)

Query: 180  KLKAKEKRKASPTSASSDESDKEEEEKEEG----EQTDSEKEAKKRKKHKKDKKKKKKKS 235
            K+  +  +K +P   +   S+ E  E+  G    E  +  +  K + +    KK      
Sbjct: 1196 KVSRQAPKKPAPKKTTKKASESETTEETYGSSAMETENVAEVVKPKGRAGAKKKAPAAAK 1255

Query: 236  KYEEEEEKEE 245
            + EEE+E  +
Sbjct: 1256 EKEEEDEILD 1265



 Score = 31.8 bits (72), Expect = 0.49
 Identities = 19/103 (18%), Positives = 31/103 (30%), Gaps = 13/103 (12%)

Query: 180  KLKAKEKRKASPTSASSDESDKEEEEKEE-------------GEQTDSEKEAKKRKKHKK 226
            +  A  K+  +  S  SD  D +++   E                  ++K AK     KK
Sbjct: 1295 RRAAARKKPLASVSVISDSDDDDDDFAVEVSLAERLKKKGGRKPAAANKKAAKPPAAAKK 1354

Query: 227  DKKKKKKKSKYEEEEEKEEGEQTDSEKEAKKRKKHKKDKKKRK 269
                  +  +    E  +  E      E K RK       K+ 
Sbjct: 1355 RGPATVQSGQKLLTEMLKPAEAIGISPEKKVRKMRASPFNKKS 1397


>gnl|CDD|237284 PRK13108, PRK13108, prolipoprotein diacylglyceryl transferase;
           Reviewed.
          Length = 460

 Score = 35.3 bits (81), Expect = 0.029
 Identities = 12/74 (16%), Positives = 28/74 (37%), Gaps = 1/74 (1%)

Query: 192 TSASSDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKYEEEEEKEEGEQTDS 251
           ++ + +E+ + + E+E+      +  A   +   +      ++      E  +E E    
Sbjct: 359 STPAVEETSEADIEREQPGDLAGQAPAAH-QVDAEAASAAPEEPAALASEAHDETEPEVP 417

Query: 252 EKEAKKRKKHKKDK 265
           EK A      K D+
Sbjct: 418 EKAAPIPDPAKPDE 431


>gnl|CDD|240271 PTZ00108, PTZ00108, DNA topoisomerase 2-like protein; Provisional.
          Length = 1388

 Score = 35.4 bits (82), Expect = 0.030
 Identities = 25/98 (25%), Positives = 38/98 (38%), Gaps = 1/98 (1%)

Query: 175  LVLKSKLKAKEKRKASPTSASSDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKK 234
            L  + K K   KR  S    S     K  + +  G    S    KK KK  +      KK
Sbjct: 1265 LSKEGKPKNAPKR-VSAVQYSPPPPSKRPDGESNGGSKPSSPTKKKVKKRLEGSLAALKK 1323

Query: 235  SKYEEEEEKEEGEQTDSEKEAKKRKKHKKDKKKRKKKS 272
             K  E++   + +     K+A   +  +  ++ RKKKS
Sbjct: 1324 KKKSEKKTARKKKSKTRVKQASASQSSRLLRRPRKKKS 1361



 Score = 33.5 bits (77), Expect = 0.11
 Identities = 12/84 (14%), Positives = 31/84 (36%), Gaps = 11/84 (13%)

Query: 176  VLKSKLKAKEKRKASPTSASSDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKS 235
            +  S    K+K+K+   +A   +S    +           + +  +      + +KKK  
Sbjct: 1314 LEGSLAALKKKKKSEKKTARKKKSKTRVK-----------QASASQSSRLLRRPRKKKSD 1362

Query: 236  KYEEEEEKEEGEQTDSEKEAKKRK 259
               E+++  E + ++ E +     
Sbjct: 1363 SSSEDDDDSEVDDSEDEDDEDDED 1386



 Score = 33.5 bits (77), Expect = 0.12
 Identities = 18/76 (23%), Positives = 35/76 (46%), Gaps = 1/76 (1%)

Query: 197  DESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKYEEEEEKEEGEQTDSEKEAK 256
            +E +  +E++ + +      + +K K  KK+KKK KK S  + ++    G     + + K
Sbjct: 1148 EEKEIAKEQRLKSKTKGKASKLRKPKLKKKEKKK-KKSSADKSKKASVVGNSKRVDSDEK 1206

Query: 257  KRKKHKKDKKKRKKKS 272
            ++   K D KK     
Sbjct: 1207 RKLDDKPDNKKSNSSG 1222



 Score = 31.9 bits (73), Expect = 0.41
 Identities = 24/107 (22%), Positives = 49/107 (45%), Gaps = 11/107 (10%)

Query: 177  LKSKLKAKEK-----RKASPTSASSDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKK- 230
            L ++L+ KEK     +  +P     ++ DK EE  EE E+ + ++ AK+++   K K K 
Sbjct: 1107 LNAELEKKEKELEKLKNTTPKDMWLEDLDKFEEALEEQEEVEEKEIAKEQRLKSKTKGKA 1166

Query: 231  -----KKKKSKYEEEEEKEEGEQTDSEKEAKKRKKHKKDKKKRKKKS 272
                  K K K +++++    +   +      ++    +K+K   K 
Sbjct: 1167 SKLRKPKLKKKEKKKKKSSADKSKKASVVGNSKRVDSDEKRKLDDKP 1213



 Score = 30.8 bits (70), Expect = 0.89
 Identities = 14/106 (13%), Positives = 35/106 (33%)

Query: 161  SRPLQDVRVVKCNELVLKSKLKAKEKRKASPTSASSDESDKEEEEKEEGEQTDSEKEAKK 220
                 D    +        ++ A +     P+     ES+   +     ++   ++    
Sbjct: 1258 DEFSSDDLSKEGKPKNAPKRVSAVQYSPPPPSKRPDGESNGGSKPSSPTKKKVKKRLEGS 1317

Query: 221  RKKHKKDKKKKKKKSKYEEEEEKEEGEQTDSEKEAKKRKKHKKDKK 266
                KK KK +KK ++ ++ + + +           +R + KK   
Sbjct: 1318 LAALKKKKKSEKKTARKKKSKTRVKQASASQSSRLLRRPRKKKSDS 1363



 Score = 30.4 bits (69), Expect = 1.2
 Identities = 23/95 (24%), Positives = 40/95 (42%), Gaps = 6/95 (6%)

Query: 178  KSKLKAKEKRKASPTSASSDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKY 237
            K KLK KEK+K          +DK ++    G     + + K++   K D KK       
Sbjct: 1171 KPKLKKKEKKK------KKSSADKSKKASVVGNSKRVDSDEKRKLDDKPDNKKSNSSGSD 1224

Query: 238  EEEEEKEEGEQTDSEKEAKKRKKHKKDKKKRKKKS 272
            +E++E+++ +   S  +  K KK+   K       
Sbjct: 1225 QEDDEEQKTKPKKSSVKRLKSKKNNSSKSSEDNDE 1259



 Score = 29.6 bits (67), Expect = 1.9
 Identities = 19/101 (18%), Positives = 39/101 (38%), Gaps = 1/101 (0%)

Query: 173  NELVLKSKLKAKEKRKASPTSASSDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKK 232
            ++   K+K K    ++      +S +S ++ +E    + +   K     K+    +    
Sbjct: 1227 DDEEQKTKPKKSSVKRLKSKKNNSSKSSEDNDEFSSDDLSKEGKPKNAPKRVSAVQYSPP 1286

Query: 233  KKSKYEEEEEKEEGEQTDSEKEAK-KRKKHKKDKKKRKKKS 272
              SK  + E     + +   K+   KR +      K+KKKS
Sbjct: 1287 PPSKRPDGESNGGSKPSSPTKKKVKKRLEGSLAALKKKKKS 1327


>gnl|CDD|222636 pfam14265, DUF4355, Domain of unknown function (DUF4355).  This
           family of proteins is found in bacteria and viruses.
           Proteins in this family are typically between 180 and
           214 amino acids in length.
          Length = 125

 Score = 33.4 bits (77), Expect = 0.035
 Identities = 13/64 (20%), Positives = 34/64 (53%)

Query: 208 EGEQTDSEKEAKKRKKHKKDKKKKKKKSKYEEEEEKEEGEQTDSEKEAKKRKKHKKDKKK 267
           E E+  +  + +  K   K+K K +KK + ++ E ++  + +  EK   + +K +K+ ++
Sbjct: 1   EPEEEKTFTDKEVDKAIAKEKAKWEKKQEEKKSEAEKLAKMSAEEKAEYELEKLEKELEE 60

Query: 268 RKKK 271
            + +
Sbjct: 61  LEAE 64



 Score = 32.2 bits (74), Expect = 0.096
 Identities = 18/74 (24%), Positives = 36/74 (48%)

Query: 198 ESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKYEEEEEKEEGEQTDSEKEAKK 257
           E +K   +KE  +    EK   ++K+ +K  + +K      EE+ + E E+ + E E  +
Sbjct: 3   EEEKTFTDKEVDKAIAKEKAKWEKKQEEKKSEAEKLAKMSAEEKAEYELEKLEKELEELE 62

Query: 258 RKKHKKDKKKRKKK 271
            +  +++ K   KK
Sbjct: 63  AELARRELKAEAKK 76



 Score = 31.5 bits (72), Expect = 0.16
 Identities = 18/65 (27%), Positives = 34/65 (52%)

Query: 201 KEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKYEEEEEKEEGEQTDSEKEAKKRKK 260
            +E+ K E +Q + + EA+K  K   ++K + +  K E+E E+ E E    E +A+ +K 
Sbjct: 18  AKEKAKWEKKQEEKKSEAEKLAKMSAEEKAEYELEKLEKELEELEAELARRELKAEAKKM 77

Query: 261 HKKDK 265
             +  
Sbjct: 78  LSEKG 82



 Score = 28.8 bits (65), Expect = 1.4
 Identities = 15/68 (22%), Positives = 32/68 (47%)

Query: 205 EKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKYEEEEEKEEGEQTDSEKEAKKRKKHKKD 264
           E EE +    ++  K   K K   +KK+++ K E E+  +   +  +E E +K +K  ++
Sbjct: 1   EPEEEKTFTDKEVDKAIAKEKAKWEKKQEEKKSEAEKLAKMSAEEKAEYELEKLEKELEE 60

Query: 265 KKKRKKKS 272
            +    + 
Sbjct: 61  LEAELARR 68


>gnl|CDD|219256 pfam06991, Prp19_bind, Splicing factor, Prp19-binding domain.  This
           family represents the C-terminus (approximately 300
           residues) of proteins that are involved as binding
           partners for Prp19 as part of the nuclear pore complex.
           The family in Drosophila is necessary for pre-mRNA
           splicing, and the human protein has been found in
           purifications of the spliceosome. In the past this
           family was thought, erroneously, to be associated with
           microfibrillin.
          Length = 277

 Score = 34.5 bits (79), Expect = 0.036
 Identities = 23/83 (27%), Positives = 44/83 (53%), Gaps = 3/83 (3%)

Query: 186 KRKASPTSASSDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKYEEEEEKEE 245
           + +        +  ++EEEE EE E+TDSE + + R K    +KK +      +E E+E 
Sbjct: 1   ETEVLELEEEDESGEEEEEESEEEEETDSEDDMEPRLKPVFTRKKDRITI---QEREREA 57

Query: 246 GEQTDSEKEAKKRKKHKKDKKKR 268
            ++   E+EAK++ + +K +  +
Sbjct: 58  AKEKALEEEAKRKAEERKRETLK 80


>gnl|CDD|218188 pfam04641, Rtf2, Replication termination factor 2.  It is vital for
           effective cell-replication that replication is not
           stalled at any point by, for instance, damaged bases.
           Rtf2 stabilizes the replication fork stalled at the
           site-specific replication barrier RTS1 by preventing
           replication restart until completion of DNA synthesis by
           a converging replication fork initiated at a flanking
           origin. The RTS1 element terminates replication forks
           that are moving in the cen2-distal direction while
           allowing forks moving in the cen2-proximal direction to
           pass through the region. Rtf2 contains a C2HC2 motif
           related to the C3HC4 RING-finger motif, and would appear
           to fold up, creating a RING finger-like structure but
           forming only one functional Zn2+ ion-binding site.
          Length = 254

 Score = 34.6 bits (80), Expect = 0.036
 Identities = 21/69 (30%), Positives = 31/69 (44%), Gaps = 2/69 (2%)

Query: 203 EEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKYEEEEEKEEGEQTDSEKEAKKRKKHK 262
           EEE E  +    E+ AKK+KK  K KKK KK +      E        +E  +   +  +
Sbjct: 166 EEEVELLKARLEEERAKKKKK--KKKKKTKKNNATGSSAEATVSSAVPTELSSGAGQVGE 223

Query: 263 KDKKKRKKK 271
             K K+K+ 
Sbjct: 224 AKKLKKKRS 232



 Score = 31.9 bits (73), Expect = 0.28
 Identities = 19/71 (26%), Positives = 25/71 (35%), Gaps = 6/71 (8%)

Query: 201 KEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKYEEEEEKEEGEQTDSEKEAKKRKK 260
           +EE E  +    +   + KK+KK KK KK     S  E               E      
Sbjct: 166 EEEVELLKARLEEERAKKKKKKKKKKTKKNNATGSSAEATVSSA------VPTELSSGAG 219

Query: 261 HKKDKKKRKKK 271
              + KK KKK
Sbjct: 220 QVGEAKKLKKK 230



 Score = 30.4 bits (69), Expect = 0.80
 Identities = 14/65 (21%), Positives = 27/65 (41%), Gaps = 1/65 (1%)

Query: 180 KLKAKEKRKASPTSASSDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKYEE 239
           + +AK+K+K        + +     E        +E  +   +   + KK KKK+S   +
Sbjct: 178 EERAKKKKKKKKKKTKKNNATGSSAEATVSSAVPTELSSGAGQVG-EAKKLKKKRSIAPD 236

Query: 240 EEEKE 244
            E+ E
Sbjct: 237 NEKSE 241



 Score = 27.3 bits (61), Expect = 8.1
 Identities = 16/60 (26%), Positives = 25/60 (41%)

Query: 178 KSKLKAKEKRKASPTSASSDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKY 237
           K K K K+ +K + T +S++ +       E         EAKK KK +      +K   Y
Sbjct: 184 KKKKKKKKTKKNNATGSSAEATVSSAVPTELSSGAGQVGEAKKLKKKRSIAPDNEKSEVY 243



 Score = 27.3 bits (61), Expect = 8.3
 Identities = 20/80 (25%), Positives = 33/80 (41%), Gaps = 5/80 (6%)

Query: 198 ESDKEEEEKEEGEQTDSEKEAKKRKKHKK-----DKKKKKKKSKYEEEEEKEEGEQTDSE 252
           E + E  +    E+   +K+ KK+KK KK        +    S    E     G+  +++
Sbjct: 166 EEEVELLKARLEEERAKKKKKKKKKKTKKNNATGSSAEATVSSAVPTELSSGAGQVGEAK 225

Query: 253 KEAKKRKKHKKDKKKRKKKS 272
           K  KKR     ++K    KS
Sbjct: 226 KLKKKRSIAPDNEKSEVYKS 245


>gnl|CDD|237177 PRK12704, PRK12704, phosphodiesterase; Provisional.
          Length = 520

 Score = 35.1 bits (82), Expect = 0.037
 Identities = 21/96 (21%), Positives = 48/96 (50%), Gaps = 1/96 (1%)

Query: 175 LVLKSKLKAKEKRKASPTSASSDESDKE-EEEKEEGEQTDSEKEAKKRKKHKKDKKKKKK 233
            V K   +AK K          +E+ KE E  K+E      E+  K R + +K+ ++++ 
Sbjct: 23  FVRKKIAEAKIKEAEEEAKRILEEAKKEAEAIKKEALLEAKEEIHKLRNEFEKELRERRN 82

Query: 234 KSKYEEEEEKEEGEQTDSEKEAKKRKKHKKDKKKRK 269
           + +  E+   ++ E  D + E  ++++ + +KK+++
Sbjct: 83  ELQKLEKRLLQKEENLDRKLELLEKREEELEKKEKE 118



 Score = 32.4 bits (75), Expect = 0.27
 Identities = 22/103 (21%), Positives = 53/103 (51%), Gaps = 7/103 (6%)

Query: 176 VLKSKLKAKEKRKASPTSASSDESDKEEEEKEEGEQ--TDSEKEAKKRKKH--KKDKKKK 231
           + +++ +AK   + +   A + + +   E KEE  +   + EKE ++R+    K +K+  
Sbjct: 33  IKEAEEEAKRILEEAKKEAEAIKKEALLEAKEEIHKLRNEFEKELRERRNELQKLEKRLL 92

Query: 232 KKKSKYEEEEE---KEEGEQTDSEKEAKKRKKHKKDKKKRKKK 271
           +K+   + + E   K E E    EKE +++++  + K++  ++
Sbjct: 93  QKEENLDRKLELLEKREEELEKKEKELEQKQQELEKKEEELEE 135



 Score = 27.8 bits (63), Expect = 8.0
 Identities = 17/70 (24%), Positives = 39/70 (55%)

Query: 201 KEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKYEEEEEKEEGEQTDSEKEAKKRKK 260
           + E EKE  E+ +  ++ +KR   K++   +K +   + EEE E+ E+   +K+ +  KK
Sbjct: 70  RNEFEKELRERRNELQKLEKRLLQKEENLDRKLELLEKREEELEKKEKELEQKQQELEKK 129

Query: 261 HKKDKKKRKK 270
            ++ ++  ++
Sbjct: 130 EEELEELIEE 139


>gnl|CDD|222447 pfam13904, DUF4207, Domain of unknown function (DUF4207).  This
           family is found in eukaryotes; it has several conserved
           tryptophan residues. The function is not known.
          Length = 261

 Score = 34.3 bits (79), Expect = 0.041
 Identities = 22/110 (20%), Positives = 45/110 (40%), Gaps = 19/110 (17%)

Query: 181 LKAKEKRKASPTSASSDESDKEEEEKEEGEQTDSEKEAKKR-----KKHKKDKKKKKKKS 235
           L AK+ ++        +E  K+E EKE  E    ++ AK++     ++  +   K++   
Sbjct: 93  LSAKQAQRQKKLQKLLEEKQKQEREKEREEAELRQRLAKEKYEEWCRQKAQQAAKQRTPK 152

Query: 236 KYEEEEEKEEGEQTDSEKEAKK--------------RKKHKKDKKKRKKK 271
             +E  E      + S K  +                KK K+ ++KR+++
Sbjct: 153 HKKEAAESASSSLSGSAKPERNVSQEEAKKRLQEWELKKLKQQQQKREEE 202



 Score = 28.2 bits (63), Expect = 4.3
 Identities = 25/93 (26%), Positives = 45/93 (48%), Gaps = 8/93 (8%)

Query: 188 KASPTSASSDES---DKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKYEEEEEKE 244
            AS  S SS  S    + + E++  E  ++   AK+ ++ KK +K  ++K K E E+E+E
Sbjct: 62  TASSLSDSSTYSRSLKEVKLERQAQEAYENWLSAKQAQRQKKLQKLLEEKQKQEREKERE 121

Query: 245 EGEQTDSEKEAK-----KRKKHKKDKKKRKKKS 272
           E E      + K     ++K  +  K++  K  
Sbjct: 122 EAELRQRLAKEKYEEWCRQKAQQAAKQRTPKHK 154



 Score = 28.2 bits (63), Expect = 5.0
 Identities = 21/83 (25%), Positives = 43/83 (51%)

Query: 190 SPTSASSDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKYEEEEEKEEGEQT 249
           S T++S  +S       +E +     +EA +     K  +++KK  K  EE++K+E E+ 
Sbjct: 60  SSTASSLSDSSTYSRSLKEVKLERQAQEAYENWLSAKQAQRQKKLQKLLEEKQKQEREKE 119

Query: 250 DSEKEAKKRKKHKKDKKKRKKKS 272
             E E ++R   +K ++  ++K+
Sbjct: 120 REEAELRQRLAKEKYEEWCRQKA 142



 Score = 27.8 bits (62), Expect = 6.5
 Identities = 20/91 (21%), Positives = 50/91 (54%), Gaps = 5/91 (5%)

Query: 178 KSKLKAKEKRKASPTSASSDESDKEEEEKEEGEQTDS-EKEAKKRKKHKKDKKKKKKKSK 236
             K K +    AS + + S + ++   ++E  ++    E +  K+++ K++++++K++ K
Sbjct: 150 TPKHKKEAAESASSSLSGSAKPERNVSQEEAKKRLQEWELKKLKQQQQKREEERRKQRKK 209

Query: 237 YEEEEEKEEGEQTDSEKEAKKRKKHKKDKKK 267
            +EEEE+    +  +E+  +K  K+ K + K
Sbjct: 210 QQEEEER----KQKAEEAWQKWMKNVKQRPK 236


>gnl|CDD|217756 pfam03839, Sec62, Translocation protein Sec62. 
          Length = 217

 Score = 34.4 bits (79), Expect = 0.042
 Identities = 17/57 (29%), Positives = 26/57 (45%), Gaps = 1/57 (1%)

Query: 215 EKEAKKRKKHKKDKKKKKKKSKYEEEEEKEEGEQTDSEKEAKKRKKHKKDKKKRKKK 271
           E E  K  K K + +   K +  ++  EK +        E  K K H ++KK+ KKK
Sbjct: 17  ESEKYKANKDKGNPEIYNKINSQDKAIEKFKLLIKAQMAERVK-KLHSQEKKEEKKK 72



 Score = 32.5 bits (74), Expect = 0.18
 Identities = 15/73 (20%), Positives = 29/73 (39%)

Query: 178 KSKLKAKEKRKASPTSASSDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKY 237
           +     K+K      +  + +    E+ K   +   +E+  K   + KK++KKK KK K 
Sbjct: 19  EKYKANKDKGNPEIYNKINSQDKAIEKFKLLIKAQMAERVKKLHSQEKKEEKKKPKKKKV 78

Query: 238 EEEEEKEEGEQTD 250
             +    +    D
Sbjct: 79  PLQVNPAQLFVDD 91



 Score = 30.1 bits (68), Expect = 0.97
 Identities = 16/61 (26%), Positives = 25/61 (40%), Gaps = 1/61 (1%)

Query: 202 EEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKYEEEEEKEEGEQTDSEKEAKKRKKH 261
            E+ K   ++ + E   K   + K  +K K        E  K+   Q   ++E KK KK 
Sbjct: 18  SEKYKANKDKGNPEIYNKINSQDKAIEKFKLLIKAQMAERVKKLHSQE-KKEEKKKPKKK 76

Query: 262 K 262
           K
Sbjct: 77  K 77



 Score = 29.0 bits (65), Expect = 2.2
 Identities = 17/65 (26%), Positives = 30/65 (46%), Gaps = 2/65 (3%)

Query: 198 ESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKYEEEEEKEEGEQTDSEKEAKK 257
           ES+K +  K++G      K   + K    +K K   K++  E  +K   ++   EK+  K
Sbjct: 17  ESEKYKANKDKGNPEIYNKINSQDKA--IEKFKLLIKAQMAERVKKLHSQEKKEEKKKPK 74

Query: 258 RKKHK 262
           +KK  
Sbjct: 75  KKKVP 79



 Score = 29.0 bits (65), Expect = 2.4
 Identities = 15/67 (22%), Positives = 26/67 (38%), Gaps = 7/67 (10%)

Query: 205 EKEEGEQTDSEKEAKKRKKHK-KDKKKKKKKSKYEEEEEKEEGEQTDSEKEAKKRKKHKK 263
           E E+ +    +   +   K   +DK  +K K         +       +K   + KK +K
Sbjct: 17  ESEKYKANKDKGNPEIYNKINSQDKAIEKFK------LLIKAQMAERVKKLHSQEKKEEK 70

Query: 264 DKKKRKK 270
            K K+KK
Sbjct: 71  KKPKKKK 77


>gnl|CDD|215590 PLN03123, PLN03123, poly [ADP-ribose] polymerase; Provisional.
          Length = 981

 Score = 35.2 bits (81), Expect = 0.042
 Identities = 24/93 (25%), Positives = 40/93 (43%), Gaps = 7/93 (7%)

Query: 181 LKAKEKRKASPTSASSDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKYEEE 240
           L   ++    P    S    KEE+   E  + +S+K AK++K    D K KK K+  +  
Sbjct: 171 LSDSDQEAVLPLVKKSPSEAKEEKA--EERKQESKKGAKRKKDASGDDKSKKAKTDRDVS 228

Query: 241 EEKEEGEQTDSEKEAKKRKKHK-----KDKKKR 268
                 ++  S+ E+K   + K     KD  K+
Sbjct: 229 TSTAASQKKSSDLESKLEAQSKELWSLKDDLKK 261



 Score = 29.8 bits (67), Expect = 1.6
 Identities = 17/40 (42%), Positives = 25/40 (62%)

Query: 233 KKSKYEEEEEKEEGEQTDSEKEAKKRKKHKKDKKKRKKKS 272
           KKS  E +EEK E  + +S+K AK++K    D K +K K+
Sbjct: 184 KKSPSEAKEEKAEERKQESKKGAKRKKDASGDDKSKKAKT 223


>gnl|CDD|220383 pfam09756, DDRGK, DDRGK domain.  This is a family of proteins of
           approximately 300 residues, found in plants and
           vertebrates. They contain a highly conserved DDRGK
           motif.
          Length = 189

 Score = 33.9 bits (78), Expect = 0.042
 Identities = 15/69 (21%), Positives = 43/69 (62%), Gaps = 2/69 (2%)

Query: 203 EEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKYEEEEEKEEGEQTDSEKEAKKRKKHK 262
           +  +++ E  + E+E +K+ + K++ ++K+++   EE E+K+E E  + ++  ++ +K +
Sbjct: 15  QARRQQREAEEEEREERKKLEEKREGERKEEEELEEEREKKKEEE--ERKEREEQARKEQ 72

Query: 263 KDKKKRKKK 271
           ++ +K K  
Sbjct: 73  EEYEKLKSS 81



 Score = 33.9 bits (78), Expect = 0.047
 Identities = 18/67 (26%), Positives = 44/67 (65%), Gaps = 2/67 (2%)

Query: 201 KEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKYEEEEEKEEGEQTDSEKEAKKRKK 260
           K+  + EE +    ++EA++ ++ ++ K ++K++ + +EEEE E  E+ + +KE ++RK+
Sbjct: 6   KKRAKLEEKQARRQQREAEEEEREERKKLEEKREGERKEEEELE--EEREKKKEEEERKE 63

Query: 261 HKKDKKK 267
            ++  +K
Sbjct: 64  REEQARK 70



 Score = 33.5 bits (77), Expect = 0.062
 Identities = 17/72 (23%), Positives = 43/72 (59%)

Query: 201 KEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKYEEEEEKEEGEQTDSEKEAKKRKK 260
            + EEK+   Q    +E ++ ++ K ++K++ ++ + EE EE+ E ++ + E++ ++ + 
Sbjct: 9   AKLEEKQARRQQREAEEEEREERKKLEEKREGERKEEEELEEEREKKKEEEERKEREEQA 68

Query: 261 HKKDKKKRKKKS 272
            K+ ++  K KS
Sbjct: 69  RKEQEEYEKLKS 80



 Score = 32.0 bits (73), Expect = 0.20
 Identities = 13/52 (25%), Positives = 38/52 (73%)

Query: 220 KRKKHKKDKKKKKKKSKYEEEEEKEEGEQTDSEKEAKKRKKHKKDKKKRKKK 271
            +K+ K ++K+ +++ +  EEEE+EE ++ + ++E +++++ + ++++ KKK
Sbjct: 5   AKKRAKLEEKQARRQQREAEEEEREERKKLEEKREGERKEEEELEEEREKKK 56



 Score = 31.6 bits (72), Expect = 0.25
 Identities = 21/85 (24%), Positives = 50/85 (58%), Gaps = 13/85 (15%)

Query: 180 KLKAKEKRKASPTSASSDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKYEE 239
           KL+ K+ R+         + + EEEE+EE ++ + ++E +++++ + +++++KKK     
Sbjct: 10  KLEEKQARRQ--------QREAEEEEREERKKLEEKREGERKEEEELEEEREKKK----- 56

Query: 240 EEEKEEGEQTDSEKEAKKRKKHKKD 264
           EEE+ +  +  + KE ++ +K K  
Sbjct: 57  EEEERKEREEQARKEQEEYEKLKSS 81



 Score = 30.1 bits (68), Expect = 0.87
 Identities = 16/69 (23%), Positives = 37/69 (53%)

Query: 184 KEKRKASPTSASSDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKYEEEEEK 243
           +E++K         + ++E EE+ E ++ + E++ ++ +  K+ ++ +K KS +  EEE 
Sbjct: 29  EERKKLEEKREGERKEEEELEEEREKKKEEEERKEREEQARKEQEEYEKLKSSFVVEEEG 88

Query: 244 EEGEQTDSE 252
            +    D E
Sbjct: 89  TDKLSADEE 97


>gnl|CDD|227594 COG5269, ZUO1, Ribosome-associated chaperone zuotin [Translation,
           ribosomal structure and biogenesis / Posttranslational
           modification, protein turnover, chaperones].
          Length = 379

 Score = 34.6 bits (79), Expect = 0.044
 Identities = 23/72 (31%), Positives = 33/72 (45%), Gaps = 1/72 (1%)

Query: 202 EEEEKEEGEQTDSEKEAKKR-KKHKKDKKKKKKKSKYEEEEEKEEGEQTDSEKEAKKRKK 260
           +E+EKE  +    E+EA  R K     K K + K+K E E E         +K  +  KK
Sbjct: 236 KEQEKEMKKIRKWEREAGARLKALAALKGKAEAKNKAEIEAEALASATAVKKKAKEVMKK 295

Query: 261 HKKDKKKRKKKS 272
             K +KK  K +
Sbjct: 296 ALKMEKKAIKNA 307



 Score = 27.7 bits (61), Expect = 7.7
 Identities = 15/63 (23%), Positives = 28/63 (44%), Gaps = 1/63 (1%)

Query: 182 KAKEKRKASPTSASSDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKYEEEE 241
           + K+ RK    + +  ++    + K E  +  +E EA+        KKK K+  K   + 
Sbjct: 241 EMKKIRKWEREAGARLKALAALKGKAE-AKNKAEIEAEALASATAVKKKAKEVMKKALKM 299

Query: 242 EKE 244
           EK+
Sbjct: 300 EKK 302


>gnl|CDD|240402 PTZ00399, PTZ00399, cysteinyl-tRNA-synthetase; Provisional.
          Length = 651

 Score = 34.6 bits (80), Expect = 0.048
 Identities = 18/55 (32%), Positives = 33/55 (60%), Gaps = 2/55 (3%)

Query: 185 EKRKASPTSASSDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKYEE 239
           E +   P+    D  DKEE ++E+ E+   +++ + RK  K+++KKKK+  K E+
Sbjct: 536 EDKPDGPSVWKLD--DKEELQREKEEKEALKEQKRLRKLKKQEEKKKKELEKLEK 588


>gnl|CDD|220271 pfam09507, CDC27, DNA polymerase subunit Cdc27.  This protein forms
           the C subunit of DNA polymerase delta. It carries the
           essential residues for binding to the Pol1 subunit of
           polymerase alpha, from residues 293-332, which are
           characterized by the motif D--G--VT, referred to as the
           DPIM motif. The first 160 residues of the protein form
           the minimal domain for binding to the B subunit, Cdc1,
           of polymerase delta, the final 10 C-terminal residues,
           362-372, being the DNA sliding clamp, PCNA, binding
           motif.
          Length = 427

 Score = 34.4 bits (79), Expect = 0.049
 Identities = 23/86 (26%), Positives = 43/86 (50%), Gaps = 8/86 (9%)

Query: 178 KSKLKAKEKRKASPTSASSDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKY 237
           K K K K+++K +  S   +ES++E  +++   + +S +     +   +D+ K   +   
Sbjct: 223 KKKTKEKKEKKEASESTVKEESEEESGKRDVILEDESAEPTGLDEDEDEDEPKPSGERSD 282

Query: 238 EEEEEKEEGEQTDSEKEAKKRKKHKK 263
            EEE +        EKE +KRK+ KK
Sbjct: 283 SEEETE--------EKEKEKRKRLKK 300



 Score = 34.0 bits (78), Expect = 0.062
 Identities = 19/99 (19%), Positives = 39/99 (39%), Gaps = 4/99 (4%)

Query: 178 KSKLKAKEKRKASPTSASSDESDKEEEEKEEGEQTDSEK----EAKKRKKHKKDKKKKKK 233
           KS  K ++  K + T  +  ++  +    +      S        KK K+ K+ K+  + 
Sbjct: 179 KSAKKTQDTSKETTTEKTEGKTSVKAASLKRNPPKKSNIMSSFFKKKTKEKKEKKEASES 238

Query: 234 KSKYEEEEEKEEGEQTDSEKEAKKRKKHKKDKKKRKKKS 272
             K E EEE  + +    ++ A+     + + +   K S
Sbjct: 239 TVKEESEEESGKRDVILEDESAEPTGLDEDEDEDEPKPS 277



 Score = 31.7 bits (72), Expect = 0.37
 Identities = 27/109 (24%), Positives = 49/109 (44%), Gaps = 15/109 (13%)

Query: 178 KSKLKAKEKRKASPTSASSDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKK------ 231
           +S  +       S      DE + E+E K  GE++DSE+E ++++K K+ + KK      
Sbjct: 247 ESGKRDVILEDESAEPTGLDEDEDEDEPKPSGERSDSEEETEEKEKEKRKRLKKMMEDED 306

Query: 232 -------KKKSKYEEEEEKEEGEQTDSEKEAKKRKKHKKDK--KKRKKK 271
                    +S  EEEE +E       +KE +K +        ++R ++
Sbjct: 307 EDEEMEIVPESPVEEEESEEPEPPPLPKKEEEKEEVTVSPDGGRRRGRR 355



 Score = 28.6 bits (64), Expect = 3.2
 Identities = 19/95 (20%), Positives = 47/95 (49%), Gaps = 4/95 (4%)

Query: 178 KSKLKAKEKRKASPTSASSDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKY 237
             K    +K     +       +K+E+++        E E +  K+    + +  + +  
Sbjct: 206 SLKRNPPKKSNIMSSFFKKKTKEKKEKKEASESTVKEESEEESGKRDVILEDESAEPTGL 265

Query: 238 EEEEEKEE----GEQTDSEKEAKKRKKHKKDKKKR 268
           +E+E+++E    GE++DSE+E ++++K K+ + K+
Sbjct: 266 DEDEDEDEPKPSGERSDSEEETEEKEKEKRKRLKK 300



 Score = 27.9 bits (62), Expect = 6.1
 Identities = 26/111 (23%), Positives = 36/111 (32%), Gaps = 23/111 (20%)

Query: 180 KLKAKEKRKASPTSASSD--------ESDKEEEEKEEGEQTDSEKE----------AKKR 221
           K K K KR         +        ES  EEEE EE E     K+              
Sbjct: 290 KEKEKRKRLKKMMEDEDEDEEMEIVPESPVEEEESEEPEPPPLPKKEEEKEEVTVSPDGG 349

Query: 222 KKHKKDKKKKKKKSKYEE-----EEEKEEGEQTDSEKEAKKRKKHKKDKKK 267
           ++  + +  KKK  K EE     ++  E    ++ E E    K   K    
Sbjct: 350 RRRGRRRVMKKKTFKDEEGYLVTKKVYEWESFSEDEAEPPPTKPKPKVSTP 400


>gnl|CDD|149438 pfam08374, Protocadherin, Protocadherin.  The structure of
           protocadherins is similar to that of classic cadherins
           (pfam00028), but particularly on the cytoplasmic domains
           they also have some unique features. They are expressed
           in a variety of organisms and are found in high
           concentrations in the brain where they seem to be
           localised mainly at cell-cell contact sites. Their
           expression seems to be developmentally regulated.
          Length = 223

 Score = 34.1 bits (78), Expect = 0.051
 Identities = 21/56 (37%), Positives = 28/56 (50%), Gaps = 4/56 (7%)

Query: 203 EEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKYEE----EEEKEEGEQTDSEKE 254
           +  K+E E   S  +  K+KK KKDKKKK  KS        EE K + E  + + E
Sbjct: 73  QAGKKETEDWFSPNQENKQKKKKKDKKKKSPKSLLLNFVTVEESKPDDEVHEQKSE 128



 Score = 28.7 bits (64), Expect = 2.5
 Identities = 15/33 (45%), Positives = 20/33 (60%)

Query: 240 EEEKEEGEQTDSEKEAKKRKKHKKDKKKRKKKS 272
           +  K+E E   S  +  K+KK KKDKKK+  KS
Sbjct: 73  QAGKKETEDWFSPNQENKQKKKKKDKKKKSPKS 105


>gnl|CDD|220252 pfam09468, RNase_H2-Ydr279, Ydr279p protein family (RNase H2
           complex component).  RNases H are enzymes that
           specifically hydrolyse RNA when annealed to a
           complementary DNA and are present in all living
           organisms. In yeast RNase H2 is composed of a complex of
           three proteins (Rnh2Ap, Ydr279p and Ylr154p), this
           family represents the homologues of Ydr279p. It is not
           known whether non yeast proteins in this family fulfil
           the same function.
          Length = 287

 Score = 34.2 bits (79), Expect = 0.052
 Identities = 16/59 (27%), Positives = 27/59 (45%)

Query: 213 DSEKEAKKRKKHKKDKKKKKKKSKYEEEEEKEEGEQTDSEKEAKKRKKHKKDKKKRKKK 271
              +     K  K+ KKKK++  +  E  E    ++  S++E KK+K  +    K  KK
Sbjct: 216 LIPEFKPLDKYLKESKKKKRETEEDVEAAESRAEKKRKSKEEIKKKKPKESKGVKALKK 274



 Score = 33.5 bits (77), Expect = 0.10
 Identities = 25/94 (26%), Positives = 35/94 (37%), Gaps = 19/94 (20%)

Query: 177 LKSKLKAKEKRKASPTSASSDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSK 236
            K  LK+    +  P      ES K++ E EE  +    +  KKRK  ++ KKKK K+SK
Sbjct: 208 YKELLKSLLIPEFKPLDKYLKESKKKKRETEEDVEAAESRAEKKRKSKEEIKKKKPKESK 267

Query: 237 YEEEEEKEEGEQTDSEKEAKKRKKHKKDKKKRKK 270
                                +   K   K  KK
Sbjct: 268 -------------------GVKALKKVVAKGMKK 282



 Score = 32.7 bits (75), Expect = 0.15
 Identities = 15/62 (24%), Positives = 24/62 (38%)

Query: 173 NELVLKSKLKAKEKRKASPTSASSDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKK 232
             L++       +  K S       E D E  E    ++  S++E KK+K  +    K  
Sbjct: 213 KSLLIPEFKPLDKYLKESKKKKRETEEDVEAAESRAEKKRKSKEEIKKKKPKESKGVKAL 272

Query: 233 KK 234
           KK
Sbjct: 273 KK 274


>gnl|CDD|218797 pfam05890, Ebp2, Eukaryotic rRNA processing protein EBP2.  This
           family consists of several Eukaryotic rRNA processing
           protein EBP2 sequences. Ebp2p is required for the
           maturation of 25S rRNA and 60S subunit assembly. Ebp2p
           may be one of the target proteins of Rrs1p for executing
           the signal to regulate ribosome biogenesis. This family
           also plays a role in chromosome segregation.
          Length = 270

 Score = 34.2 bits (79), Expect = 0.054
 Identities = 30/100 (30%), Positives = 48/100 (48%), Gaps = 11/100 (11%)

Query: 177 LKSKLKAKEKRKASPTSASSDESDKEEEEKEEGEQTDSEKEAKKRKKHKKD-----KKKK 231
           ++ KL A+   K +     S+E+ K+ E K+ G+Q   EK  ++R K KKD     KK K
Sbjct: 120 VRKKLLAEAAAKKA-----SEEARKQRELKKFGKQVQVEKL-QERAKEKKDMLEKIKKLK 173

Query: 232 KKKSKYEEEEEKEEGEQTDSEKEAKKRKKHKKDKKKRKKK 271
           KK+    +E +  +    D+ K   K+K  K    + K  
Sbjct: 174 KKRKGGGDEGDLFDFLVDDAAKGGSKKKGRKGGAARGKPN 213


>gnl|CDD|225664 COG3122, COG3122, Uncharacterized protein conserved in bacteria
           [Function unknown].
          Length = 215

 Score = 33.8 bits (77), Expect = 0.056
 Identities = 15/58 (25%), Positives = 31/58 (53%)

Query: 179 SKLKAKEKRKASPTSASSDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSK 236
           SK +AK +R A  +   + E+ +   E+ + EQ + +K+  +++K     K+ K + K
Sbjct: 57  SKKRAKVQRLAKKSRVDAREAYRIAAEENKAEQLERDKQLSEQQKQAALAKEYKAQVK 114



 Score = 29.2 bits (65), Expect = 1.7
 Identities = 15/56 (26%), Positives = 29/56 (51%), Gaps = 2/56 (3%)

Query: 214 SEKEAKKRKKHKKDKKKKKKKSKYEEEEEKEEGEQTDSEKEAKKRKKHKKDKKKRK 269
           S+K AK ++  KK +   ++  +   EE K   EQ + +K+  +++K     K+ K
Sbjct: 57  SKKRAKVQRLAKKSRVDAREAYRIAAEENKA--EQLERDKQLSEQQKQAALAKEYK 110


>gnl|CDD|113514 pfam04747, DUF612, Protein of unknown function, DUF612.  This
           family includes several uncharacterized proteins from
           Caenorhabditis elegans.
          Length = 517

 Score = 34.3 bits (77), Expect = 0.059
 Identities = 24/93 (25%), Positives = 55/93 (59%)

Query: 180 KLKAKEKRKASPTSASSDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKYEE 239
           K+K  EK+KA    A   E++++   K+  E+     EA+ +K+  ++++ K+ K++ E 
Sbjct: 69  KVKKSEKKKAQKQIAKDHEAEQKVNAKKAAEKEARRAEAEAKKRAAQEEEHKQWKAEQER 128

Query: 240 EEEKEEGEQTDSEKEAKKRKKHKKDKKKRKKKS 272
            ++++E ++ D +K   ++KK K  K ++ +K+
Sbjct: 129 IQKEQEKKEADLKKLQAEKKKEKAVKAEKAEKA 161


>gnl|CDD|219655 pfam07946, DUF1682, Protein of unknown function (DUF1682).  The
           members of this family are all hypothetical eukaryotic
           proteins of unknown function. One member is described as
           being an adipocyte-specific protein, but no evidence of
           this was found.
          Length = 322

 Score = 34.2 bits (79), Expect = 0.063
 Identities = 24/94 (25%), Positives = 43/94 (45%), Gaps = 21/94 (22%)

Query: 178 KSKLKAKEKRKASPTSASSDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKY 237
             KL  +  RK   T    +E   +  E+E  E      EA+++K+ KK ++++ K +K 
Sbjct: 250 NLKLSPEVLRKVDKTREEEEEKILKAAEEERQE------EAQEKKEEKKKEEREAKLAKL 303

Query: 238 EEEEEKEEGEQTDSEKEAKKRKKHKKDKKKRKKK 271
             EE+               RK  +K++KK+ +K
Sbjct: 304 SPEEQ---------------RKLEEKERKKQARK 322



 Score = 30.7 bits (70), Expect = 0.76
 Identities = 13/54 (24%), Positives = 29/54 (53%), Gaps = 1/54 (1%)

Query: 220 KRKKHKKDKKKKKKKSKYEEEEE-KEEGEQTDSEKEAKKRKKHKKDKKKRKKKS 272
              K   +  +K  K++ EEEE+  +  E+   E+  +K+++ KK++++ K   
Sbjct: 249 ANLKLSPEVLRKVDKTREEEEEKILKAAEEERQEEAQEKKEEKKKEEREAKLAK 302



 Score = 29.1 bits (66), Expect = 2.3
 Identities = 19/67 (28%), Positives = 34/67 (50%), Gaps = 4/67 (5%)

Query: 168 RVVKCNELVLKSKLKAKEKRKASPTSASSDESDKEEEEKEEGEQTDSEKEAKKRKKHKKD 227
           +V K  E   +  LKA E+ +        +E  KEE E +  +   S +E +K +  +K+
Sbjct: 260 KVDKTREEEEEKILKAAEEERQEEAQEKKEEKKKEEREAKLAK--LSPEEQRKLE--EKE 315

Query: 228 KKKKKKK 234
           +KK+ +K
Sbjct: 316 RKKQARK 322


>gnl|CDD|237035 PRK12280, rplW, 50S ribosomal protein L23; Reviewed.
          Length = 158

 Score = 33.2 bits (76), Expect = 0.064
 Identities = 21/66 (31%), Positives = 37/66 (56%)

Query: 200 DKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKYEEEEEKEEGEQTDSEKEAKKRK 259
           ++ E+E++E  +   EKEA K KK KK+KK+KK   K  +++  +  + T  +   K   
Sbjct: 93  EESEKEQKEVSKETEEKEAIKAKKEKKEKKEKKVAEKLAKKKSTKTTKNTTKKATKKTTT 152

Query: 260 KHKKDK 265
           K ++ K
Sbjct: 153 KKEEGK 158



 Score = 29.7 bits (67), Expect = 0.96
 Identities = 16/66 (24%), Positives = 31/66 (46%)

Query: 207 EEGEQTDSEKEAKKRKKHKKDKKKKKKKSKYEEEEEKEEGEQTDSEKEAKKRKKHKKDKK 266
           EE E+   E   +  +K     KK+KK+ K ++  EK   +++    +   +K  KK   
Sbjct: 93  EESEKEQKEVSKETEEKEAIKAKKEKKEKKEKKVAEKLAKKKSTKTTKNTTKKATKKTTT 152

Query: 267 KRKKKS 272
           K+++  
Sbjct: 153 KKEEGK 158



 Score = 27.8 bits (62), Expect = 4.8
 Identities = 15/63 (23%), Positives = 28/63 (44%), Gaps = 1/63 (1%)

Query: 210 EQTDSEKEAKKRKKHKKDKKKKKKKSKYEEEEEK-EEGEQTDSEKEAKKRKKHKKDKKKR 268
            +   +++ +  K+ ++ +  K KK K E++E+K  E        +  K    K  KK  
Sbjct: 92  PEESEKEQKEVSKETEEKEAIKAKKEKKEKKEKKVAEKLAKKKSTKTTKNTTKKATKKTT 151

Query: 269 KKK 271
            KK
Sbjct: 152 TKK 154



 Score = 27.0 bits (60), Expect = 7.1
 Identities = 12/58 (20%), Positives = 32/58 (55%)

Query: 215 EKEAKKRKKHKKDKKKKKKKSKYEEEEEKEEGEQTDSEKEAKKRKKHKKDKKKRKKKS 272
            +E++K +K    + ++K+  K ++E+++++ ++   +   KK  K  K+  K+  K 
Sbjct: 92  PEESEKEQKEVSKETEEKEAIKAKKEKKEKKEKKVAEKLAKKKSTKTTKNTTKKATKK 149


>gnl|CDD|227492 COG5163, NOP7, Protein required for biogenesis of the 60S ribosomal
           subunit [Translation, ribosomal structure and
           biogenesis].
          Length = 591

 Score = 34.3 bits (78), Expect = 0.066
 Identities = 21/77 (27%), Positives = 38/77 (49%), Gaps = 2/77 (2%)

Query: 197 DESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKK--KKKKKSKYEEEEEKEEGEQTDSEKE 254
           +E D ++EE +  ++ + E +  K  +  +  K   K K  K + +EE+EE +       
Sbjct: 491 EEDDDDDEELQAQKELELEAQGIKYSETSEADKDVNKSKNKKRKVDEEEEEKKLKMIMMS 550

Query: 255 AKKRKKHKKDKKKRKKK 271
            K++K +KK K    KK
Sbjct: 551 NKQKKLYKKMKYSNAKK 567



 Score = 33.1 bits (75), Expect = 0.16
 Identities = 26/83 (31%), Positives = 40/83 (48%), Gaps = 10/83 (12%)

Query: 188 KASPTSASSDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKYEEEEEKEEGE 247
           K S TS +  + +K + +K +    D E+E KK K      K+KK   K +    K+   
Sbjct: 514 KYSETSEADKDVNKSKNKKRK---VDEEEEEKKLKMIMMSNKQKKLYKKMKYSNAKK--- 567

Query: 248 QTDSEKEAKKRKKHKKDKKKRKK 270
               E++A+  KK KK   K+KK
Sbjct: 568 ----EEQAENLKKKKKQIAKQKK 586


>gnl|CDD|129022 smart00786, SHR3_chaperone, ER membrane protein SH3.  This family
           of proteins are membrane localised chaperones that are
           required for correct plasma membrane localisation of
           amino acid permeases (AAPs). Shr3 prevents AAPs proteins
           from aggregating and assists in their correct folding.
           In the absence of Shr3, AAPs are retained in the ER.
          Length = 196

 Score = 33.5 bits (77), Expect = 0.067
 Identities = 12/36 (33%), Positives = 16/36 (44%)

Query: 197 DESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKK 232
           D   KEE   EE ++  +    K     K + KKKK
Sbjct: 161 DAKQKEEFAAEERKEALAAAAKKSATPQKVETKKKK 196



 Score = 30.4 bits (69), Expect = 0.75
 Identities = 12/42 (28%), Positives = 18/42 (42%), Gaps = 4/42 (9%)

Query: 228 KKKKKKKSKYEEEEEKEEGEQTDSEKEAKKRKKHKKDKKKRK 269
           +K  K+K ++  EE KE           K     K + KK+K
Sbjct: 159 RKDAKQKEEFAAEERKEALA----AAAKKSATPQKVETKKKK 196


>gnl|CDD|233352 TIGR01310, L7, 60S ribosomal protein L7, eukaryotic.  This model
           describes the eukaryotic 60S (cytosolic) ribosomal
           protein L7 and paralogs that may or may not also be L7.
           Human, Drosophila, and Arabidopsis all have both a
           typical L7 and an L7-related paralog. This family is
           designated subfamily rather than equivalog to reflect
           these uncharacterized paralogs. Members of this family
           average ~ 250 residues in length, somewhat longer than
           the archaeal L30P/L7E homolog (~ 155 residues) and much
           longer than the related bacterial/organellar form (~ 60
           residues).
          Length = 235

 Score = 33.5 bits (77), Expect = 0.071
 Identities = 18/68 (26%), Positives = 30/68 (44%)

Query: 201 KEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKYEEEEEKEEGEQTDSEKEAKKRKK 260
           K +  +E   Q   + +AKK+   KK K   K+   + +E  K E E    ++ AKK  K
Sbjct: 6   KRKARQELAVQVAKQAKAKKKANKKKRKIYFKRAESFVKEYRKAEREIIRLKRLAKKPGK 65

Query: 261 HKKDKKKR 268
                + +
Sbjct: 66  FYVPAEHK 73


>gnl|CDD|234750 PRK00409, PRK00409, recombination and DNA strand exchange inhibitor
           protein; Reviewed.
          Length = 782

 Score = 34.0 bits (79), Expect = 0.074
 Identities = 30/131 (22%), Positives = 55/131 (41%), Gaps = 32/131 (24%)

Query: 174 ELVLKSKLKAKEKRKASPTSASSDESDKEEEEKEEGEQTDSE--KEAKKRKKHKKDKKKK 231
           +L+ +++ +A++  K +   A     +  + +K       +    EA+KR     +KK+K
Sbjct: 566 KLLEEAEKEAQQAIKEAKKEADEIIKELRQLQKGGYASVKAHELIEARKRLNKANEKKEK 625

Query: 232 KKKSKYEEEEEKEEGE--------------QTDSEKEA----------------KKRKKH 261
           KKK + E++EE + G+                  +KEA                +K +K 
Sbjct: 626 KKKKQKEKQEELKVGDEVKYLSLGQKGEVLSIPDDKEAIVQAGIMKMKVPLSDLEKIQKP 685

Query: 262 KKDKKKRKKKS 272
           KK KKK+ K  
Sbjct: 686 KKKKKKKPKTV 696



 Score = 30.6 bits (70), Expect = 0.94
 Identities = 23/65 (35%), Positives = 37/65 (56%), Gaps = 9/65 (13%)

Query: 194 ASSDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKYEEEEEKEEGEQTDSEK 253
           AS +E ++E E+K E      E EA  ++  K  ++ ++KK K +EEE+K   E    EK
Sbjct: 523 ASLEELERELEQKAE------EAEALLKEAEKLKEELEEKKEKLQEEEDKLLEEA---EK 573

Query: 254 EAKKR 258
           EA++ 
Sbjct: 574 EAQQA 578


>gnl|CDD|217834 pfam03998, Utp11, Utp11 protein.  This protein is found to be part
           of a large ribonucleoprotein complex containing the U3
           snoRNA. Depletion of the Utp proteins impedes production
           of the 18S rRNA, indicating that they are part of the
           active pre-rRNA processing complex. This large RNP
           complex has been termed the small subunit (SSU)
           processome.
          Length = 239

 Score = 33.5 bits (77), Expect = 0.077
 Identities = 19/76 (25%), Positives = 38/76 (50%)

Query: 195 SSDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKYEEEEEKEEGEQTDSEKE 254
           ++ E     E +    Q +      +++K K  KKK+K   + +E +E+E+  +   ++ 
Sbjct: 145 TTPELLDRRENRPRISQLEKTSLVDEKQKKKSAKKKRKLYKELKERKEREKKLKKVEQRL 204

Query: 255 AKKRKKHKKDKKKRKK 270
             +R+  KK K K+KK
Sbjct: 205 ELQRELMKKGKGKKKK 220



 Score = 28.9 bits (65), Expect = 2.6
 Identities = 20/81 (24%), Positives = 36/81 (44%), Gaps = 1/81 (1%)

Query: 193 SASSDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKK-SKYEEEEEKEEGEQTDS 251
           +       +E   +    +  S  + K++KK  K K+K  K+  + +E E+K +  +   
Sbjct: 145 TTPELLDRRENRPRISQLEKTSLVDEKQKKKSAKKKRKLYKELKERKEREKKLKKVEQRL 204

Query: 252 EKEAKKRKKHKKDKKKRKKKS 272
           E + +  KK K  KKK  K  
Sbjct: 205 ELQRELMKKGKGKKKKIVKDK 225



 Score = 28.5 bits (64), Expect = 3.5
 Identities = 19/70 (27%), Positives = 39/70 (55%)

Query: 200 DKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKYEEEEEKEEGEQTDSEKEAKKRK 259
            + E+     E+   +   KKRK +K+ K++K+++ K ++ E++ E ++   +K   K+K
Sbjct: 160 SQLEKTSLVDEKQKKKSAKKKRKLYKELKERKEREKKLKKVEQRLELQRELMKKGKGKKK 219

Query: 260 KHKKDKKKRK 269
           K  KDK  + 
Sbjct: 220 KIVKDKDGKV 229


>gnl|CDD|220102 pfam09073, BUD22, BUD22.  BUD22 has been shown in yeast to be a
           nuclear protein involved in bud-site selection. It plays
           a role in positioning the proximal bud pole signal. More
           recently it has been shown to be involved in ribosome
           biogenesis.
          Length = 424

 Score = 33.7 bits (77), Expect = 0.083
 Identities = 28/106 (26%), Positives = 42/106 (39%), Gaps = 13/106 (12%)

Query: 180 KLKAKEKRKASPTSASSDESDKEEEEKEEGEQTDSEKEAKKRKKHKK-----------DK 228
           K K K+       S         E+E ++ E  D ++  KK  K K            +K
Sbjct: 262 KPKEKKTSSTFLPSLMGGYFSGSEDEDDDDEDIDPDQVVKKPVKRKNRRGQRARQAIWEK 321

Query: 229 K--KKKKKSKYEEEEEKEEGEQTDSEKEAKKRKKHKKDKKKRKKKS 272
           K     K  K E E+E++E E   SE EA++ K+   D K  +   
Sbjct: 322 KYGSGAKHVKKEREKEQKEREGRQSEWEARQAKREGGDAKAGRAAE 367



 Score = 32.9 bits (75), Expect = 0.15
 Identities = 17/89 (19%), Positives = 40/89 (44%), Gaps = 8/89 (8%)

Query: 190 SPTSASSDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKK--------KSKYEEEE 241
           +  ++ S+  + + E  E    +DSE+ +   KK K+ K              S  E+E+
Sbjct: 229 NEDTSESESDESDSEISESRSVSDSEESSPPSKKPKEKKTSSTFLPSLMGGYFSGSEDED 288

Query: 242 EKEEGEQTDSEKEAKKRKKHKKDKKKRKK 270
           + +E    D   +   ++K+++ ++ R+ 
Sbjct: 289 DDDEDIDPDQVVKKPVKRKNRRGQRARQA 317



 Score = 28.3 bits (63), Expect = 4.3
 Identities = 20/78 (25%), Positives = 34/78 (43%), Gaps = 7/78 (8%)

Query: 201 KEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKYEEEEEKEEGEQTDSEKEAKKRKK 260
           + E+E++E E   SE EA++ K+   D K  +         +++       EK+  K+  
Sbjct: 333 EREKEQKEREGRQSEWEARQAKREGGDAKAGRAAEPTGSRTQQKGDRPKRGEKKKPKKPS 392

Query: 261 HKKD-------KKKRKKK 271
             K        KKK K+K
Sbjct: 393 VDKPLHPSWEAKKKAKEK 410



 Score = 28.3 bits (63), Expect = 4.5
 Identities = 20/99 (20%), Positives = 42/99 (42%), Gaps = 10/99 (10%)

Query: 182 KAKEKRKASPTSASSDESDKEEEEKEEGEQT----------DSEKEAKKRKKHKKDKKKK 231
           +++ + ++    ++ D+SD EEEE  + E              E+E ++      ++   
Sbjct: 174 ESESEDESKSEESAEDDSDDEEEEDSDSEDYSQYDGMLVDSSDEEEGEEAPSINYNEDTS 233

Query: 232 KKKSKYEEEEEKEEGEQTDSEKEAKKRKKHKKDKKKRKK 270
           + +S   + E  E    +DSE+ +   KK K+ K     
Sbjct: 234 ESESDESDSEISESRSVSDSEESSPPSKKPKEKKTSSTF 272


>gnl|CDD|149343 pfam08229, SHR3_chaperone, ER membrane protein SH3.  This family of
           proteins are membrane localised chaperones that are
           required for correct plasma membrane localisation of
           amino acid permeases (AAPs). SH3 prevents AAPs proteins
           from aggregating and assists in their correct folding.
           In the absence of SH3, AAPs are retained in the ER.
          Length = 196

 Score = 33.1 bits (76), Expect = 0.089
 Identities = 14/36 (38%), Positives = 18/36 (50%)

Query: 197 DESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKK 232
           D    EE   EE E   + KE +  +  KK+ KKKK
Sbjct: 161 DAKLLEEFAAEEAEAAAAAKEEESAEGEKKESKKKK 196



 Score = 30.0 bits (68), Expect = 0.83
 Identities = 10/36 (27%), Positives = 20/36 (55%)

Query: 236 KYEEEEEKEEGEQTDSEKEAKKRKKHKKDKKKRKKK 271
           K  +  E+   E+ ++   AK+ +  + +KK+ KKK
Sbjct: 160 KDAKLLEEFAAEEAEAAAAAKEEESAEGEKKESKKK 195



 Score = 28.8 bits (65), Expect = 2.0
 Identities = 11/41 (26%), Positives = 18/41 (43%), Gaps = 4/41 (9%)

Query: 229 KKKKKKSKYEEEEEKEEGEQTDSEKEAKKRKKHKKDKKKRK 269
           K  K   ++  EE +         KE +  +  KK+ KK+K
Sbjct: 160 KDAKLLEEFAAEEAEAAAAA----KEEESAEGEKKESKKKK 196



 Score = 27.7 bits (62), Expect = 6.1
 Identities = 9/36 (25%), Positives = 19/36 (52%)

Query: 201 KEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSK 236
           K+ +  EE    ++E  A  +++   + +KK+ K K
Sbjct: 160 KDAKLLEEFAAEEAEAAAAAKEEESAEGEKKESKKK 195


>gnl|CDD|203462 pfam06476, DUF1090, Protein of unknown function (DUF1090).  This
           family consists of several bacterial proteins of unknown
           function and is known as YqjC in E. coli.
          Length = 115

 Score = 31.8 bits (73), Expect = 0.12
 Identities = 12/54 (22%), Positives = 31/54 (57%)

Query: 217 EAKKRKKHKKDKKKKKKKSKYEEEEEKEEGEQTDSEKEAKKRKKHKKDKKKRKK 270
           +A  R + +   ++K+++    E+E KE   + D++K  KK++K  + + + ++
Sbjct: 61  DAGLRAERQAKVEEKEQEVAEREQELKEAQAKGDADKIEKKQRKLAEAQAELQE 114


>gnl|CDD|233787 TIGR02223, ftsN, cell division protein FtsN.  FtsN is a poorly
           conserved protein active in cell division in a number of
           Proteobacteria. The N-terminal 30 residue region tends
           to by Lys/Arg-rich, and is followed by a
           membrane-spanning region. This is followed by an acidic
           low-complexity region of variable length and a
           well-conserved C-terminal domain of two tandem regions
           matched by pfam05036 (Sporulation related repeat), found
           in several cell division and sporulation proteins. The
           role of FtsN as a suppressor for other cell division
           mutations is poorly understood; it may involve cell wall
           hydrolysis [Cellular processes, Cell division].
          Length = 298

 Score = 33.1 bits (75), Expect = 0.12
 Identities = 22/94 (23%), Positives = 38/94 (40%), Gaps = 4/94 (4%)

Query: 182 KAKEKRKASPTSASSD----ESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKY 237
             +   K   T+ S      E+ K+  EK+  +   +E +    +  K   K K+ K K 
Sbjct: 128 DMRAAEKVLATAPSEQTVAVEARKQTAEKKPQKARTAEAQKTPVETEKIASKVKEAKQKQ 187

Query: 238 EEEEEKEEGEQTDSEKEAKKRKKHKKDKKKRKKK 271
           +   ++    Q++S+      K  K DK K K K
Sbjct: 188 KALPKQTAETQSNSKPIETAPKADKADKTKPKPK 221


>gnl|CDD|221185 pfam11719, Drc1-Sld2, DNA replication and checkpoint protein.
           Genome duplication is precisely regulated by
           cyclin-dependent kinases CDKs, which bring about the
           onset of S phase by activating replication origins and
           then prevent relicensing of origins until mitosis is
           completed. The optimum sequence motif for CDK
           phosphorylation is S/T-P-K/R-K/R, and Drc1-Sld2 is found
           to have at least 11 potential phosphorylation sites.
           Drc1 is required for DNA synthesis and S-M replication
           checkpoint control. Drc1 associates with Cdc2 and is
           phosphorylated at the onset of S phase when Cdc2 is
           activated. Thus Cdc2 promotes DNA replication by
           phosphorylating Drc1 and regulating its association with
           Cut5. Sld2 and Sld3 represent the minimal set of S-CDK
           substrates required for DNA replication.
          Length = 397

 Score = 33.2 bits (76), Expect = 0.13
 Identities = 14/72 (19%), Positives = 30/72 (41%), Gaps = 6/72 (8%)

Query: 200 DKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKYEEEEEKEEGEQTDSEKEAKKRK 259
             +  ++    ++D  +E        K  +K   +         E+ E  D E+  ++ +
Sbjct: 302 RAKPSDEPSLPESDIHEE------IPKLDEKSLSEFLGYMGGIDEDDEDEDDEESKEEVE 355

Query: 260 KHKKDKKKRKKK 271
           K +K KKK +K+
Sbjct: 356 KKQKVKKKPRKR 367



 Score = 30.1 bits (68), Expect = 1.3
 Identities = 22/90 (24%), Positives = 37/90 (41%), Gaps = 14/90 (15%)

Query: 180 KLKAKEKRKASPTSASSDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKYEE 239
           ++K +  R       S  ESD  EE  +  E++ SE                      +E
Sbjct: 295 RVKMRPVRAKPSDEPSLPESDIHEEIPKLDEKSLSEFL--------------GYMGGIDE 340

Query: 240 EEEKEEGEQTDSEKEAKKRKKHKKDKKKRK 269
           ++E E+ E++  E E K++ K K  K+K  
Sbjct: 341 DDEDEDDEESKEEVEKKQKVKKKPRKRKVN 370


>gnl|CDD|219868 pfam08496, Peptidase_S49_N, Peptidase family S49 N-terminal.  This
           domain is found to the N-terminus of bacterial signal
           peptidases of the S49 family (pfam01343).
          Length = 154

 Score = 32.1 bits (74), Expect = 0.15
 Identities = 12/35 (34%), Positives = 21/35 (60%), Gaps = 2/35 (5%)

Query: 214 SEKEAKKRKKHKKDKKKKKKKSKYEEEEEKEEGEQ 248
            +KE K     K +KK +K K+K E+++ K+E  +
Sbjct: 64  DKKELKAW--EKAEKKAEKAKAKAEKKKAKKEEPK 96



 Score = 28.3 bits (64), Expect = 3.0
 Identities = 10/28 (35%), Positives = 14/28 (50%)

Query: 216 KEAKKRKKHKKDKKKKKKKSKYEEEEEK 243
              K  KK +K K K +KK   +EE + 
Sbjct: 70  AWEKAEKKAEKAKAKAEKKKAKKEEPKP 97



 Score = 27.5 bits (62), Expect = 4.6
 Identities = 15/54 (27%), Positives = 27/54 (50%), Gaps = 1/54 (1%)

Query: 218 AKKRKKHKKDKKKKKKKSKYEEEEEKEEGEQTDSEKEAKKRKKHKKDKKKRKKK 271
           A+K+K  K + +      +Y++ +E  E      +KE K  +K +K  +K K K
Sbjct: 32  AQKKKGKKGELEITDLNEEYKDLKESLE-AALLDKKELKAWEKAEKKAEKAKAK 84


>gnl|CDD|153340 cd07656, F-BAR_srGAP, The F-BAR (FES-CIP4 Homology and
           Bin/Amphiphysin/Rvs) domain of Slit-Robo GTPase
           Activating Proteins.  F-BAR domains are dimerization
           modules that bind and bend membranes and are found in
           proteins involved in membrane dynamics and actin
           reorganization. Slit-Robo GTPase Activating Proteins
           (srGAPs) are Rho GAPs that interact with Robo1, the
           transmembrane receptor of Slit proteins. Slit proteins
           are secreted proteins that control axon guidance and the
           migration of neurons and leukocytes. Vertebrates contain
           three isoforms of srGAPs, all of which are expressed
           during embryonic and early development in the nervous
           system but with different localization and timing.
           srGAPs contain an N-terminal F-BAR domain, a Rho GAP
           domain, and a C-terminal SH3 domain. F-BAR domains form
           banana-shaped dimers with a positively-charged concave
           surface that binds to negatively-charged lipid
           membranes. They can induce membrane deformation in the
           form of long tubules.
          Length = 241

 Score = 32.7 bits (75), Expect = 0.15
 Identities = 16/49 (32%), Positives = 32/49 (65%), Gaps = 1/49 (2%)

Query: 192 TSASSDESDKEEEEKEEGEQTDSEKEAKKRKKHKK-DKKKKKKKSKYEE 239
            S S++   KE E++EE ++   EK+ ++ +  KK +K+ +K+++KY E
Sbjct: 146 ESKSAERKLKEAEKQEEKQEQSPEKKLERSRSSKKIEKEVEKRQAKYSE 194



 Score = 30.4 bits (69), Expect = 0.87
 Identities = 12/40 (30%), Positives = 24/40 (60%)

Query: 232 KKKSKYEEEEEKEEGEQTDSEKEAKKRKKHKKDKKKRKKK 271
           +K  + E++EEK+E       + ++  KK +K+ +KR+ K
Sbjct: 152 RKLKEAEKQEEKQEQSPEKKLERSRSSKKIEKEVEKRQAK 191


>gnl|CDD|172341 PRK13808, PRK13808, adenylate kinase; Provisional.
          Length = 333

 Score = 32.9 bits (75), Expect = 0.15
 Identities = 24/94 (25%), Positives = 40/94 (42%), Gaps = 1/94 (1%)

Query: 178 KSKLKAKEKRKASPTSASSDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKY 237
               KA  K K++    S  ++ K     ++  +T ++   K +K  KK  KK  K  K 
Sbjct: 210 SGAKKASAKAKSAAKKVSKKKAAKTAVSAKKAAKTAAKAAKKAKKTAKKALKKAAKAVKK 269

Query: 238 EEEEEKEEGEQTDSEKEAKKRKKHKKDKKKRKKK 271
             ++  +   +  ++  AK  K   K KKK  KK
Sbjct: 270 AAKKAAKAAAKA-AKGAAKATKGKAKAKKKAGKK 302



 Score = 32.2 bits (73), Expect = 0.23
 Identities = 25/102 (24%), Positives = 42/102 (41%), Gaps = 8/102 (7%)

Query: 178 KSKLKAKEKRKASPTSASSDESDKEEEEKEEGEQTDS--------EKEAKKRKKHKKDKK 229
           K+   A   +KA+ T+A + +  K+  +K   +   +         K A K  K      
Sbjct: 230 KAAKTAVSAKKAAKTAAKAAKKAKKTAKKALKKAAKAVKKAAKKAAKAAAKAAKGAAKAT 289

Query: 230 KKKKKSKYEEEEEKEEGEQTDSEKEAKKRKKHKKDKKKRKKK 271
           K K K+K +  ++   G +  +  +A KR    K  KK  KK
Sbjct: 290 KGKAKAKKKAGKKAAAGSKAKATAKAPKRGAKGKKAKKVTKK 331



 Score = 27.5 bits (61), Expect = 7.1
 Identities = 25/99 (25%), Positives = 39/99 (39%), Gaps = 8/99 (8%)

Query: 176 VLKSKLKAKEKRKASPTSASSDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKS 235
           VL +   A  K+ A   +A S       + K   ++   +K AK     KK  K   K +
Sbjct: 190 VLAAVGAANAKKAAKTPAAKSGAKKASAKAKSAAKKVSKKKAAKTAVSAKKAAKTAAKAA 249

Query: 236 KYEEEEEKEEGEQTDSEKEAKK--RKKHKKDKKKRKKKS 272
           K  ++  K+        K+A K  +K  KK  K   K +
Sbjct: 250 KKAKKTAKKAL------KKAAKAVKKAAKKAAKAAAKAA 282


>gnl|CDD|217840 pfam04006, Mpp10, Mpp10 protein.  This family includes proteins
           related to Mpp10 (M phase phosphoprotein 10). The U3
           small nucleolar ribonucleoprotein (snoRNP) is required
           for three cleavage events that generate the mature 18S
           rRNA from the pre-rRNA. In Saccharomyces cerevisiae,
           depletion of Mpp10, a U3 snoRNP-specific protein, halts
           18S rRNA production and impairs cleavage at the three U3
           snoRNP-dependent sites.
          Length = 613

 Score = 33.0 bits (75), Expect = 0.16
 Identities = 21/75 (28%), Positives = 37/75 (49%)

Query: 193 SASSDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKYEEEEEKEEGEQTDSE 252
            ++ DE ++EE+E  E E  D E EA    + +   +        ++EE+K E E+   E
Sbjct: 113 DSADDEEEEEEDESLEDEMIDDEDEADLFNESESSLEDLSDDETEDDEEKKMEEEEAGEE 172

Query: 253 KEAKKRKKHKKDKKK 267
           KE+ ++   +K   K
Sbjct: 173 KESVEQATREKKFDK 187



 Score = 28.4 bits (63), Expect = 4.9
 Identities = 21/69 (30%), Positives = 35/69 (50%), Gaps = 5/69 (7%)

Query: 193 SASSDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKYEE-----EEEKEEGE 247
           S+  D SD E E+ EE +  + E   +K    +  ++KK  KS  ++     +E  E  E
Sbjct: 146 SSLEDLSDDETEDDEEKKMEEEEAGEEKESVEQATREKKFDKSGVDDKFFKLDEMNEFLE 205

Query: 248 QTDSEKEAK 256
            T++E+EA 
Sbjct: 206 ATEAEEEAA 214


>gnl|CDD|225288 COG2433, COG2433, Uncharacterized conserved protein [Function
           unknown].
          Length = 652

 Score = 33.1 bits (76), Expect = 0.16
 Identities = 26/88 (29%), Positives = 43/88 (48%), Gaps = 8/88 (9%)

Query: 182 KAKEKRKASPTSASSDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKYEEEE 241
           K KE+ +            +  EE+E  E T  EK  KK ++  +  +++  + K E EE
Sbjct: 396 KVKEEERPRE--------KEGTEEEERREITVYEKRIKKLEETVERLEEENSELKRELEE 447

Query: 242 EKEEGEQTDSEKEAKKRKKHKKDKKKRK 269
            K E E+ +SE E  +R+   K +K R+
Sbjct: 448 LKREIEKLESELERFRREVRDKVRKDRE 475


>gnl|CDD|221175 pfam11705, RNA_pol_3_Rpc31, DNA-directed RNA polymerase III subunit
           Rpc31.  RNA polymerase III contains seventeen subunits
           in yeasts and in human cells. Twelve of these are akin
           to RNA polymerase I or II and the other five are RNA pol
           III-specific, and form the functionally distinct groups
           (i) Rpc31-Rpc34-Rpc82, and (ii) Rpc37-Rpc53. Rpc31,
           Rpc34 and Rpc82 form a cluster of enzyme-specific
           subunits that contribute to transcription initiation in
           S.cerevisiae and H.sapiens. There is evidence that these
           subunits are anchored at or near the N-terminal Zn-fold
           of Rpc1, itself prolonged by a highly conserved but RNA
           polymerase III-specific domain.
          Length = 221

 Score = 32.4 bits (74), Expect = 0.16
 Identities = 22/79 (27%), Positives = 39/79 (49%), Gaps = 1/79 (1%)

Query: 177 LKSKLKA-KEKRKASPTSASSDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKS 235
           L S +   K+  K    S    +     EE+E+ ++  S  E K ++   +D  ++ +K 
Sbjct: 116 LYSVMGINKKAGKKLALSKFKRKVGLFTEEEEDIDEKLSMLEKKLKELEAEDVDEEDEKD 175

Query: 236 KYEEEEEKEEGEQTDSEKE 254
           + EEEEE+EE E  D + +
Sbjct: 176 EEEEEEEEEEDEDFDDDDD 194


>gnl|CDD|222571 pfam14153, Spore_coat_CotO, Spore coat protein CotO.  Bacillus
           spores are protected by a protein shell consisting of
           over 50 different polypeptides, known as the coat. This
           family of proteins has an important morphogenetic role
           in coat assembly, it is involved in the assembly of at
           least 5 different coat proteins including CotB, CotG,
           CotS, CotSA and CotW. It is likely to act at a late
           stage of coat assembly.
          Length = 185

 Score = 32.1 bits (73), Expect = 0.16
 Identities = 19/85 (22%), Positives = 43/85 (50%)

Query: 179 SKLKAKEKRKASPTSASSDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKYE 238
              KA EK +    S    +S +EE++ E  E    ++  +  ++   ++++K++ ++ E
Sbjct: 32  IIKKADEKEEEKENSDEHVKSKEEEQKIEYEEAEKEKEAGEPEREDIAEQQEKEEIAQEE 91

Query: 239 EEEEKEEGEQTDSEKEAKKRKKHKK 263
           E+EE+ E  +       K++K  K+
Sbjct: 92  EKEEEAEDVKQQEVFSFKRKKPFKE 116



 Score = 27.1 bits (60), Expect = 8.4
 Identities = 22/85 (25%), Positives = 44/85 (51%), Gaps = 1/85 (1%)

Query: 186 KRKASPTSASSDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKYEEEEEKEE 245
            +KA       + SD+  + KEE EQ    +EA+K K+  + +++   + + +EE  +EE
Sbjct: 33  IKKADEKEEEKENSDEHVKSKEE-EQKIEYEEAEKEKEAGEPEREDIAEQQEKEEIAQEE 91

Query: 246 GEQTDSEKEAKKRKKHKKDKKKRKK 270
            ++ ++E   ++     K KK  K+
Sbjct: 92  EKEEEAEDVKQQEVFSFKRKKPFKE 116


>gnl|CDD|222977 PHA03089, PHA03089, late transcription factor VLTF-4; Provisional.
          Length = 191

 Score = 32.1 bits (73), Expect = 0.16
 Identities = 25/84 (29%), Positives = 38/84 (45%)

Query: 184 KEKRKASPTSASSDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKYEEEEEK 243
           +   K++  S      D   E+ E   +  S+K+    +K K  KK KKKK + EE  E 
Sbjct: 21  RAHVKSTTESVDKVNDDIFPEDVEIPSKKTSKKKKTTPRKKKTTKKTKKKKKEKEEVPEL 80

Query: 244 EEGEQTDSEKEAKKRKKHKKDKKK 267
              E +DSE+  +  KK   +  K
Sbjct: 81  AAEELSDSEENEENDKKVDYELPK 104



 Score = 30.5 bits (69), Expect = 0.63
 Identities = 20/91 (21%), Positives = 39/91 (42%)

Query: 181 LKAKEKRKASPTSASSDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKYEEE 240
           ++A  K         +D+   E+ E    + +  +K   ++KK  K  KKKKK+ +   E
Sbjct: 20  IRAHVKSTTESVDKVNDDIFPEDVEIPSKKTSKKKKTTPRKKKTTKKTKKKKKEKEEVPE 79

Query: 241 EEKEEGEQTDSEKEAKKRKKHKKDKKKRKKK 271
              EE   ++  +E  K+  ++  K +    
Sbjct: 80  LAAEELSDSEENEENDKKVDYELPKVQNTAA 110


>gnl|CDD|221028 pfam11208, DUF2992, Protein of unknown function (DUF2992).  This
           bacterial family of proteins has no known function.
           However, the cis-regulatory yjdF motif, just upstream
           from the gene encoding the proteins for this family, is
           a small non-coding RNA, Rfam:RF01764. The yjdF motif is
           found in many Firmicutes, including Bacillus subtilis.
           In most cases, it resides in potential 5' UTRs of
           homologues of the yjdF gene whose function is unknown.
           However, in Streptococcus thermophilus, a yjdF RNA motif
           is associated with an operon whose protein products
           synthesise nicotinamide adenine dinucleotide (NAD+).
           Also, the S. thermophilus yjdF RNA lacks typical yjdF
           motif consensus features downstream of and including the
           P4 stem. Thus, if yjdF RNAs are riboswitch aptamers, the
           S. thermophilus RNAs might sense a distinct compound
           that structurally resembles the ligand bound by other
           yjdF RNAs. On the ohter hand, perhaps these RNAs have an
           alternative solution forming a similar binding site, as
           is observed with some SAM riboswitches.
          Length = 132

 Score = 31.5 bits (72), Expect = 0.18
 Identities = 18/42 (42%), Positives = 31/42 (73%), Gaps = 3/42 (7%)

Query: 221 RKKHKKDKKKKKKKSKYEEEEEKEEGEQTDSEKEAKKRKKHK 262
           + +H+++K++KKK+SK ++EEEKE   Q    K+ KK+ KH+
Sbjct: 92  KLEHERNKQEKKKRSKEKKEEEKERKRQL---KQQKKKAKHR 130


>gnl|CDD|233758 TIGR02169, SMC_prok_A, chromosome segregation protein SMC,
           primarily archaeal type.  SMC (structural maintenance of
           chromosomes) proteins bind DNA and act in organizing and
           segregating chromosomes for partition. SMC proteins are
           found in bacteria, archaea, and eukaryotes. It is found
           in a single copy and is homodimeric in prokaryotes, but
           six paralogs (excluded from this family) are found in
           eukarotes, where SMC proteins are heterodimeric. This
           family represents the SMC protein of archaea and a few
           bacteria (Aquifex, Synechocystis, etc); the SMC of other
           bacteria is described by TIGR02168. The N- and
           C-terminal domains of this protein are well conserved,
           but the central hinge region is skewed in composition
           and highly divergent [Cellular processes, Cell division,
           DNA metabolism, Chromosome-associated proteins].
          Length = 1164

 Score = 33.1 bits (76), Expect = 0.18
 Identities = 23/86 (26%), Positives = 42/86 (48%), Gaps = 2/86 (2%)

Query: 177 LKSKLKAKEKRKASPTSASSDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSK 236
           L+S+L+  E R    +   SD S K  E ++E EQ + E+E  K +   ++ ++     +
Sbjct: 693 LQSELRRIENRLDELSQELSDASRKIGEIEKEIEQLEQEEEKLKER--LEELEEDLSSLE 750

Query: 237 YEEEEEKEEGEQTDSEKEAKKRKKHK 262
            E E  K E ++ ++  E  +   HK
Sbjct: 751 QEIENVKSELKELEARIEELEEDLHK 776



 Score = 30.8 bits (70), Expect = 0.97
 Identities = 17/84 (20%), Positives = 39/84 (46%), Gaps = 4/84 (4%)

Query: 192 TSASSDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKYEEEEEKEEGEQTDS 251
            +  + E +  E+E +E ++   + + + +   K+ +    KK + EEE E+ E    D 
Sbjct: 821 LNRLTLEKEYLEKEIQELQEQRIDLKEQIKSIEKEIENLNGKKEELEEELEELEAALRDL 880

Query: 252 EKE----AKKRKKHKKDKKKRKKK 271
           E       K+R + +   ++ ++K
Sbjct: 881 ESRLGDLKKERDELEAQLRELERK 904


>gnl|CDD|221643 pfam12572, DUF3752, Protein of unknown function (DUF3752).  This
           domain family is found in eukaryotes, and is typically
           between 140 and 163 amino acids in length.
          Length = 148

 Score = 31.6 bits (72), Expect = 0.18
 Identities = 25/95 (26%), Positives = 45/95 (47%), Gaps = 2/95 (2%)

Query: 180 KLKAKEKRKASPTSASSDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKYEE 239
           +L+ +     +  SA +  S K  +  +E  +   E   KKR K   ++ +KK+K K +E
Sbjct: 45  RLQDEVLGVEAGASAPAAASAKASKRDKEMARKVKEYNEKKRGKSLVEQHQKKQKKKKKE 104

Query: 240 EEEKEEGEQT-DSEKE-AKKRKKHKKDKKKRKKKS 272
           EE  +   +  D EK+ A   K  K  +++   K+
Sbjct: 105 EENDDPSRRPFDREKDLAVGGKISKAQRRELINKA 139


>gnl|CDD|227693 COG5406, COG5406, Nucleosome binding factor SPN, SPT16 subunit
            [Transcription / DNA replication, recombination, and
            repair / Chromatin structure and dynamics].
          Length = 1001

 Score = 33.1 bits (75), Expect = 0.19
 Identities = 18/75 (24%), Positives = 38/75 (50%), Gaps = 5/75 (6%)

Query: 195  SSDESDKEEEEKEEGE-----QTDSEKEAKKRKKHKKDKKKKKKKSKYEEEEEKEEGEQT 249
            S DESD+ EEE  E E     ++D   E ++  +  +D  + + ++   +EE+ E+ ++ 
Sbjct: 927  SDDESDESEEEVSEYEASSDDESDETDEDEESDESSEDLSEDESENDSSDEEDGEDWDEL 986

Query: 250  DSEKEAKKRKKHKKD 264
            +S+     R   ++ 
Sbjct: 987  ESKAAYDSRPGKRRK 1001


>gnl|CDD|235334 PRK05035, PRK05035, electron transport complex protein RnfC;
           Provisional.
          Length = 695

 Score = 33.0 bits (76), Expect = 0.19
 Identities = 17/103 (16%), Positives = 46/103 (44%), Gaps = 8/103 (7%)

Query: 178 KSKLKA-KEKRKASPTSASSDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDK-------K 229
           K++++A ++++K +  + +  E+ +   E+E+  +    K+A + +  K          +
Sbjct: 435 KAEIRAIEQEKKKAEEAKARFEARQARLEREKAAREARHKKAAEARAAKDKDAVAAALAR 494

Query: 230 KKKKKSKYEEEEEKEEGEQTDSEKEAKKRKKHKKDKKKRKKKS 272
            K KK+   +    + G + D+      R+  K   + R+ + 
Sbjct: 495 VKAKKAAATQPIVIKAGARPDNSAVIAAREARKAQARARQAEK 537



 Score = 28.4 bits (64), Expect = 5.2
 Identities = 17/102 (16%), Positives = 40/102 (39%), Gaps = 10/102 (9%)

Query: 180 KLKAKEKRKASPTSASSD----------ESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKK 229
           K +A+ ++     +A++D             K ++  ++    ++E+E   +K       
Sbjct: 527 KAQARARQAEKQAAAAADPKKAAVAAAIARAKAKKAAQQAANAEAEEEVDPKKAAVAAAI 586

Query: 230 KKKKKSKYEEEEEKEEGEQTDSEKEAKKRKKHKKDKKKRKKK 271
            + K  K  ++    E E+  +E + KK        + + KK
Sbjct: 587 ARAKAKKAAQQAASAEPEEQVAEVDPKKAAVAAAIARAKAKK 628



 Score = 28.0 bits (63), Expect = 7.0
 Identities = 19/93 (20%), Positives = 32/93 (34%), Gaps = 5/93 (5%)

Query: 182 KAKEKRKASPTSASSDESDKEEEEKEEGEQTDSEKEAKKRKKHK-KDKKKKKKKSKYEEE 240
           KA E R A    A +    + + +K    Q    K   +         ++ +K      +
Sbjct: 475 KAAEARAAKDKDAVAAALARVKAKKAAATQPIVIKAGARPDNSAVIAAREARKAQARARQ 534

Query: 241 EEKEEGEQTDSEKE----AKKRKKHKKDKKKRK 269
            EK+     D +K     A  R K KK  ++  
Sbjct: 535 AEKQAAAAADPKKAAVAAAIARAKAKKAAQQAA 567


>gnl|CDD|214395 CHL00204, ycf1, Ycf1; Provisional.
          Length = 1832

 Score = 32.8 bits (75), Expect = 0.21
 Identities = 15/75 (20%), Positives = 40/75 (53%), Gaps = 8/75 (10%)

Query: 205  EKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKYEEEEEKEEG-------EQTDSEKEAKK 257
            +K++G    +EK++ + +   +++K+   + + E ++EK+         ++ + E++  +
Sbjct: 1504 QKKKGFIPSNEKKSIEIENRNQEEKEPAGQGELESDKEKKGNLESVLSNQEKNIEEDYAE 1563

Query: 258  RKKHKKDKKKRKKKS 272
                KK K K++ KS
Sbjct: 1564 SDI-KKRKNKKQYKS 1577



 Score = 32.8 bits (75), Expect = 0.25
 Identities = 17/62 (27%), Positives = 26/62 (41%), Gaps = 2/62 (3%)

Query: 180  KLKAKEKRKASPTSASSDESDKEEEEKE-EGEQTDSEKEAKKRKKHKKDKKKKKKKSKYE 238
                +EK  A      SD+  K   E     ++ + E++  +    KK K KK+ KS  E
Sbjct: 1522 NRNQEEKEPAGQGELESDKEKKGNLESVLSNQEKNIEEDYAESDI-KKRKNKKQYKSNTE 1580

Query: 239  EE 240
             E
Sbjct: 1581 AE 1582



 Score = 30.8 bits (70), Expect = 0.86
 Identities = 17/53 (32%), Positives = 26/53 (49%), Gaps = 1/53 (1%)

Query: 215  EKEAKKRKKHKKDKKKKKKKSKYEEEEEKEEGEQTDSEKEAKKRKKHKKDKKK 267
            EKE   + + + DK+KK         +EK   E+  +E + KKRK  K+ K  
Sbjct: 1527 EKEPAGQGELESDKEKKGNLESVLSNQEKNI-EEDYAESDIKKRKNKKQYKSN 1578



 Score = 30.5 bits (69), Expect = 1.1
 Identities = 17/44 (38%), Positives = 25/44 (56%)

Query: 208 EGEQTDSEKEAKKRKKHKKDKKKKKKKSKYEEEEEKEEGEQTDS 251
           + E   S+   +K KK KK +KKK+++ K EE+   E  E  DS
Sbjct: 732 DAEFKISDSVEEKTKKKKKKEKKKEEEYKREEKARIEIAEAWDS 775



 Score = 29.7 bits (67), Expect = 2.1
 Identities = 19/74 (25%), Positives = 32/74 (43%), Gaps = 16/74 (21%)

Query: 214  SEKEAKKRKKHKKDKKKKKKKSKYEEEEEKEEGEQTDSEKEAKKRK-------KHKK--- 263
            ++K  +K+K      +KK  + +   +EEKE   Q + E + +K+          +K   
Sbjct: 1499 NKKINQKKKGFIPSNEKKSIEIENRNQEEKEPAGQGELESDKEKKGNLESVLSNQEKNIE 1558

Query: 264  ------DKKKRKKK 271
                  D KKRK K
Sbjct: 1559 EDYAESDIKKRKNK 1572


>gnl|CDD|115057 pfam06375, BLVR, Bovine leukaemia virus receptor (BLVR).  This
           family consists of several bovine specific leukaemia
           virus receptors which are thought to function as
           transmembrane proteins, although their exact function is
           unknown.
          Length = 561

 Score = 32.7 bits (74), Expect = 0.22
 Identities = 20/76 (26%), Positives = 37/76 (48%)

Query: 196 SDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKYEEEEEKEEGEQTDSEKEA 255
           SDE DK+  +       D +K     +K    K +  + SK  E+ +    E+   + + 
Sbjct: 145 SDEDDKDPNDPYRALDIDLDKPLADSEKLPVQKHRNAETSKSPEKGDVPAVEKKSKKPKK 204

Query: 256 KKRKKHKKDKKKRKKK 271
           K++K+ +K++ K KKK
Sbjct: 205 KEKKEKEKERDKDKKK 220



 Score = 32.0 bits (72), Expect = 0.38
 Identities = 14/60 (23%), Positives = 30/60 (50%)

Query: 191 PTSASSDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKYEEEEEKEEGEQTD 250
           P    SD+  K EEE+   ++ + +K  KK+++ +K  +++      E +E+    +  D
Sbjct: 73  PGLPMSDQYVKLEEERRHRQRLEKDKREKKKREKEKRGRRRHHSLGTESDEDIAPAQMVD 132



 Score = 30.8 bits (69), Expect = 0.81
 Identities = 32/126 (25%), Positives = 53/126 (42%), Gaps = 45/126 (35%)

Query: 185 EKRKASPTSASSDESDKEEEEKEEGEQTDSEKEAKKRKKHK-KDKKKKKKKSKY------ 237
           +K + + TS S ++ D    EK+      S+K  KK KK K K++ K KKK         
Sbjct: 176 QKHRNAETSKSPEKGDVPAVEKK------SKKPKKKEKKEKEKERDKDKKKEVEGFKSLL 229

Query: 238 -------------------------------EEEEEKEEGEQTDSEKEAK-KRKKHKKDK 265
                                           E +E ++ E  +++K  K K+KK +K+K
Sbjct: 230 LALDDSPASAASVAEADEASLANTVSGTAPDSEPDEPKDAEAEETKKSPKHKKKKQRKEK 289

Query: 266 KKRKKK 271
           +++KKK
Sbjct: 290 EEKKKK 295



 Score = 28.9 bits (64), Expect = 3.1
 Identities = 16/56 (28%), Positives = 29/56 (51%)

Query: 193 SASSDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKYEEEEEKEEGEQ 248
           +A   E D+ ++ + E  +   + + KK++K K++KKKKKK   +         EQ
Sbjct: 257 TAPDSEPDEPKDAEAEETKKSPKHKKKKQRKEKEEKKKKKKHHHHRCHHSDGGAEQ 312



 Score = 28.9 bits (64), Expect = 3.1
 Identities = 12/40 (30%), Positives = 26/40 (65%)

Query: 215 EKEAKKRKKHKKDKKKKKKKSKYEEEEEKEEGEQTDSEKE 254
           E+E + R++ +KDK++KKK+ K +    +     T+S+++
Sbjct: 85  EEERRHRQRLEKDKREKKKREKEKRGRRRHHSLGTESDED 124


>gnl|CDD|226138 COG3611, DnaB, Replication initiation/membrane attachment protein
           [DNA replication, recombination, and repair].
          Length = 417

 Score = 32.4 bits (74), Expect = 0.22
 Identities = 16/64 (25%), Positives = 31/64 (48%), Gaps = 6/64 (9%)

Query: 214 SEKEAKKRKKHKKDKKKKKKKSKYEEEEEK------EEGEQTDSEKEAKKRKKHKKDKKK 267
           +  EA K  + +K+    KK+S   + +E        E ++T S++E  K +  K ++ K
Sbjct: 349 TAYEAIKENRKRKELALAKKQSSTTKRKENLPKWSNPEYKKTTSQEEQAKLEFIKLEELK 408

Query: 268 RKKK 271
           R + 
Sbjct: 409 RLEN 412



 Score = 31.2 bits (71), Expect = 0.53
 Identities = 18/68 (26%), Positives = 28/68 (41%), Gaps = 3/68 (4%)

Query: 202 EEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKYEEEEEKEEGEQTDSEK---EAKKR 258
             E  +E  +      AKK+    K K+   K S  E ++   + EQ   E    E  KR
Sbjct: 350 AYEAIKENRKRKELALAKKQSSTTKRKENLPKWSNPEYKKTTSQEEQAKLEFIKLEELKR 409

Query: 259 KKHKKDKK 266
            ++ K K+
Sbjct: 410 LENGKQKR 417


>gnl|CDD|184900 PRK14907, rplD, 50S ribosomal protein L4; Provisional.
          Length = 295

 Score = 32.2 bits (73), Expect = 0.23
 Identities = 24/95 (25%), Positives = 39/95 (41%), Gaps = 8/95 (8%)

Query: 178 KSKLKAKEKRKASPTSASSDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKY 237
           K+  + K   K + TS  + ++ K         +T S K AKK  K KK K  K    K 
Sbjct: 11  KTTEEKKPAAKKATTSKETAKTKKTA-------KTTSTKAAKKAAKVKKTKSVKTTTKKV 63

Query: 238 EEEEEKEEG-EQTDSEKEAKKRKKHKKDKKKRKKK 271
             + EK E  ++    K+  K++    +  +   K
Sbjct: 64  TVKFEKTESVKKESVAKKTVKKEAVSAEVFEASNK 98


>gnl|CDD|236978 PRK11778, PRK11778, putative inner membrane peptidase; Provisional.
          Length = 330

 Score = 32.1 bits (74), Expect = 0.23
 Identities = 14/45 (31%), Positives = 27/45 (60%), Gaps = 1/45 (2%)

Query: 199 SDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKYEEEEEK 243
           +++ +E KEE +    +K+  K   HK  KKK+K+++K  + + K
Sbjct: 46  NEQYKEMKEELKAALLDKKELK-AWHKAQKKKEKQEAKAAKAKSK 89



 Score = 28.6 bits (65), Expect = 3.4
 Identities = 14/55 (25%), Positives = 26/55 (47%), Gaps = 1/55 (1%)

Query: 218 AKKRKKHKKDKKKKKKKSKYEEEEEKEEGEQTDSEKEAKK-RKKHKKDKKKRKKK 271
             +RKK +K + +    ++  +E ++E       +KE K   K  KK +K+  K 
Sbjct: 29  LAQRKKSQKGELEVTNLNEQYKEMKEELKAALLDKKELKAWHKAQKKKEKQEAKA 83



 Score = 28.3 bits (64), Expect = 4.2
 Identities = 16/46 (34%), Positives = 24/46 (52%), Gaps = 7/46 (15%)

Query: 216 KEAKKR-KKHKKDKKKKKKKSKYEEEEEKEEGEQTDSEKEAKKRKK 260
           KE K+  K    DKK+ K   K ++++EK+E       K AK + K
Sbjct: 50  KEMKEELKAALLDKKELKAWHKAQKKKEKQE------AKAAKAKSK 89


>gnl|CDD|219461 pfam07543, PGA2, Protein trafficking PGA2.  A Saccharomyces
           cerevisiae member of this family (PGA2) is an ER protein
           which has been implicated in protein trafficking.
          Length = 139

 Score = 31.3 bits (71), Expect = 0.24
 Identities = 19/77 (24%), Positives = 36/77 (46%), Gaps = 5/77 (6%)

Query: 178 KSKLKAKEKRKASPTSASSDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKY 237
           K+K+     R  +      +++D EE+E++    +   +  KK +K     +++K   K 
Sbjct: 60  KAKISPNALRGGATAGHGEEDTDDEEDEEDFATPSAVPQWGKKARK-----RQRKVIRKL 114

Query: 238 EEEEEKEEGEQTDSEKE 254
            E EE+   +Q D E E
Sbjct: 115 LEAEEQLREDQYDDEDE 131


>gnl|CDD|218440 pfam05110, AF-4, AF-4 proto-oncoprotein.  This family consists of
           AF4 (Proto-oncogene AF4) and FMR2 (Fragile X E mental
           retardation syndrome) nuclear proteins. These proteins
           have been linked to human diseases such as acute
           lymphoblastic leukaemia and mental retardation. The
           family also contains a Drosophila AF4 protein homologue
           Lilliputian which contains an AT-hook domain.
           Lilliputian represents a novel pair-rule gene that acts
           in cytoskeleton regulation, segmentation and
           morphogenesis in Drosophila.
          Length = 1154

 Score = 32.6 bits (74), Expect = 0.25
 Identities = 25/102 (24%), Positives = 48/102 (47%), Gaps = 6/102 (5%)

Query: 174 ELVLKSKLKAKEKRKASPTSASSDESDKEEEEKEEGEQTDSEKEAKK-RKKHKKDKKKKK 232
           +L L S++     +K  P      E D     K++  +T SEK + K ++KHK D++  K
Sbjct: 703 DLDLLSRIPGHPYKKGVP--PKPAEKDSLSAPKKQTSKTASEKSSSKGKRKHKNDEEADK 760

Query: 233 ---KKSKYEEEEEKEEGEQTDSEKEAKKRKKHKKDKKKRKKK 271
              KK + EE+        + S   +   K+ +K  + ++++
Sbjct: 761 IESKKQRLEEKSSSCSPSSSSSHHHSSSNKESRKSSRNKEEE 802


>gnl|CDD|218734 pfam05758, Ycf1, Ycf1.  The chloroplast genomes of most higher
           plants contain two giant open reading frames designated
           ycf1 and ycf2. Although the function of Ycf1 is unknown,
           it is known to be an essential gene.
          Length = 832

 Score = 32.7 bits (75), Expect = 0.25
 Identities = 16/70 (22%), Positives = 34/70 (48%)

Query: 191 PTSASSDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKYEEEEEKEEGEQTD 250
            T    + S+ EE E+E   + ++  E K  K+ ++   ++      EE+E+ ++ E  D
Sbjct: 222 FTKKLKETSETEEREEETDVEIETTSETKGTKQEQEGSTEEDPSLFSEEKEDPDKTEDLD 281

Query: 251 SEKEAKKRKK 260
             +  K++K 
Sbjct: 282 KLEILKEKKD 291



 Score = 30.4 bits (69), Expect = 1.3
 Identities = 12/32 (37%), Positives = 22/32 (68%)

Query: 227 DKKKKKKKSKYEEEEEKEEGEQTDSEKEAKKR 258
           +KKK+ K   Y EE+ K+E ++  +++E K+R
Sbjct: 648 EKKKEFKILDYTEEKTKKEEKKEKNKREEKER 679



 Score = 28.4 bits (64), Expect = 4.4
 Identities = 14/62 (22%), Positives = 31/62 (50%)

Query: 198 ESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKYEEEEEKEEGEQTDSEKEAKK 257
           E+++ EEE +   +T SE +  K+++    ++     S+ +E+ +K E        + KK
Sbjct: 231 ETEEREEETDVEIETTSETKGTKQEQEGSTEEDPSLFSEEKEDPDKTEDLDKLEILKEKK 290

Query: 258 RK 259
            +
Sbjct: 291 DE 292



 Score = 28.4 bits (64), Expect = 4.5
 Identities = 18/77 (23%), Positives = 35/77 (45%), Gaps = 1/77 (1%)

Query: 187 RKASPTSASSDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKYEEEEEKEEG 246
           R  SP      +   E EE+EE    + E    + K  K++++   ++      EEKE+ 
Sbjct: 216 RIPSPFFTKKLKETSETEEREEETDVEIETT-SETKGTKQEQEGSTEEDPSLFSEEKEDP 274

Query: 247 EQTDSEKEAKKRKKHKK 263
           ++T+   + +  K+ K 
Sbjct: 275 DKTEDLDKLEILKEKKD 291


>gnl|CDD|178669 PLN03122, PLN03122, Poly [ADP-ribose] polymerase; Provisional.
          Length = 815

 Score = 32.5 bits (74), Expect = 0.26
 Identities = 19/73 (26%), Positives = 31/73 (42%)

Query: 182 KAKEKRKASPTSASSDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKYEEEE 241
           K  E R  +   A+ +      ++K E ++ + E+  KK KK KK         K  E+ 
Sbjct: 1   KVHETRSQAHAPAAEEGKGGTRKQKAENKEHEGEQSPKKAKKEKKQDDSGNGNGKSAEDA 60

Query: 242 EKEEGEQTDSEKE 254
            KE  E   + +E
Sbjct: 61  VKEFEEFCKAIEE 73


>gnl|CDD|235549 PRK05658, PRK05658, RNA polymerase sigma factor RpoD; Validated.
          Length = 619

 Score = 32.5 bits (75), Expect = 0.26
 Identities = 18/85 (21%), Positives = 38/85 (44%), Gaps = 4/85 (4%)

Query: 185 EKRKASPTSASSDESDKEEEEKEEGEQTDSEKEAK----KRKKHKKDKKKKKKKSKYEEE 240
            +   +   +  +E D +E+E+EE ++ D    A       K  +K K   K+  K  + 
Sbjct: 178 AEEDPAHVGSELEELDDDEDEEEEEDENDDSLAADESELPEKVLEKFKALAKQYKKLRKA 237

Query: 241 EEKEEGEQTDSEKEAKKRKKHKKDK 265
           +EK+   +    K+  K ++  K++
Sbjct: 238 QEKKVEGRLAQHKKYAKLREKLKEE 262



 Score = 31.7 bits (73), Expect = 0.45
 Identities = 17/91 (18%), Positives = 37/91 (40%), Gaps = 3/91 (3%)

Query: 180 KLKAKEKRKASPTSASSDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKYEE 239
                   +  P    S+  + +++E EE E  + E +        +  +K  +K K   
Sbjct: 171 DGFVDPNAEEDPAHVGSELEELDDDEDEEEE--EDENDDSLAADESELPEKVLEKFK-AL 227

Query: 240 EEEKEEGEQTDSEKEAKKRKKHKKDKKKRKK 270
            ++ ++  +   +K   +  +HKK  K R+K
Sbjct: 228 AKQYKKLRKAQEKKVEGRLAQHKKYAKLREK 258


>gnl|CDD|187777 cd09646, Cas7_I-E, CRISPR/Cas system-associated RAMP superfamily
           protein Cas7.  CRISPR (Clustered Regularly Interspaced
           Short Palindromic Repeats) and associated Cas proteins
           comprise a system for heritable host defense by
           prokaryotic cells against phage and other foreign DNA;
           Cas7 is a RAMP superfamily protein; Subunit of the
           Cascade complex; also known as Cse4/CasC family.
          Length = 325

 Score = 32.1 bits (73), Expect = 0.27
 Identities = 17/79 (21%), Positives = 20/79 (25%), Gaps = 4/79 (5%)

Query: 197 DESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKYEEEEEKE----EGEQTDSE 252
            E  K   E    E       AK     KK   KK  K       E         +   E
Sbjct: 68  TELAKRLLELGYDEDIAEPDAAKIAYGLKKASGKKSDKLLLLSAPEAAWVARYAAELADE 127

Query: 253 KEAKKRKKHKKDKKKRKKK 271
            +A        DK+  K  
Sbjct: 128 FDAAAAADAVADKELDKNF 146


>gnl|CDD|227519 COG5192, BMS1, GTP-binding protein required for 40S ribosome
            biogenesis [Translation, ribosomal structure and
            biogenesis].
          Length = 1077

 Score = 32.4 bits (73), Expect = 0.29
 Identities = 14/53 (26%), Positives = 31/53 (58%)

Query: 219  KKRKKHKKDKKKKKKKSKYEEEEEKEEGEQTDSEKEAKKRKKHKKDKKKRKKK 271
            + R+KH+   +  K++ K +EE+E+ E  Q   E+E  K++K ++ + ++   
Sbjct: 1010 ECREKHEIKDRIVKERIKDQEEKERMESLQRAKEEEIGKKEKEREQRIRKTIH 1062



 Score = 31.2 bits (70), Expect = 0.62
 Identities = 23/89 (25%), Positives = 41/89 (46%), Gaps = 6/89 (6%)

Query: 155  LPVDKMSRPLQDVRVVKCNELVLKSKLKAKEKRKASPTSASSDESDKEEEEKEEGEQTDS 214
            LP+DK S  +   R+       L    + +EK +           D+EE+E+ E  Q   
Sbjct: 989  LPLDKRSIAVVSRRIE------LPVPPECREKHEIKDRIVKERIKDQEEKERMESLQRAK 1042

Query: 215  EKEAKKRKKHKKDKKKKKKKSKYEEEEEK 243
            E+E  K++K ++ + +K     Y+E  +K
Sbjct: 1043 EEEIGKKEKEREQRIRKTIHDNYKEMAKK 1071



 Score = 29.3 bits (65), Expect = 2.8
 Identities = 17/68 (25%), Positives = 35/68 (51%), Gaps = 5/68 (7%)

Query: 209  GEQTDSEKEAKKR--KKHKKDKKKKKKKSKYEEEEEKEEGEQTDSEKEAKKRKKH---KK 263
              +   + E K R  K+  KD+++K++    +  +E+E G++    ++  ++  H   K+
Sbjct: 1008 PPECREKHEIKDRIVKERIKDQEEKERMESLQRAKEEEIGKKEKEREQRIRKTIHDNYKE 1067

Query: 264  DKKKRKKK 271
              KKR KK
Sbjct: 1068 MAKKRLKK 1075



 Score = 27.4 bits (60), Expect = 9.6
 Identities = 16/71 (22%), Positives = 32/71 (45%)

Query: 192 TSASSDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKYEEEEEKEEGEQTDS 251
             A+ D  + EEE  ++ E+ + E    +      + ++ +  S   E EE  +    ++
Sbjct: 582 KDATLDSIEGEEELIQDDEKGNFEDLEDEENSSDNEMEESRGSSVTAENEESADEVDYET 641

Query: 252 EKEAKKRKKHK 262
           E+E   RKK +
Sbjct: 642 EREENARKKEE 652


>gnl|CDD|217348 pfam03064, U79_P34, HSV U79 / HCMV P34.  This family represents
           herpes virus protein U79 and cytomegalovirus early
           phosphoprotein P34 (UL112).
          Length = 238

 Score = 31.8 bits (72), Expect = 0.30
 Identities = 20/78 (25%), Positives = 43/78 (55%), Gaps = 1/78 (1%)

Query: 193 SASSDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKYEEEEEKEEGEQTDSE 252
            A S +   +E  K  G+Q +  +  +  +KHK+D++KK+++ +  +E+++  G    S 
Sbjct: 148 RALSRKKSDDEHRKRSGKQKEKRRV-EDSQKHKEDRRKKQEEKRRNDEDKRPGGGGGSSG 206

Query: 253 KEAKKRKKHKKDKKKRKK 270
            ++    K +  K+KR+K
Sbjct: 207 GQSGLSTKDEPPKEKRQK 224



 Score = 29.8 bits (67), Expect = 1.2
 Identities = 22/95 (23%), Positives = 46/95 (48%), Gaps = 4/95 (4%)

Query: 182 KAKEKRKASPTSASSDESDK---EEEEKEEGEQTDSEKEAKKRKKHKKDKK-KKKKKSKY 237
           K +++ +A     S DE  K   +++EK   E +   KE +++K+ +K +  + K+    
Sbjct: 142 KFEKECRALSRKKSDDEHRKRSGKQKEKRRVEDSQKHKEDRRKKQEEKRRNDEDKRPGGG 201

Query: 238 EEEEEKEEGEQTDSEKEAKKRKKHKKDKKKRKKKS 272
                 + G  T  E   +KR+KH   +++ + +S
Sbjct: 202 GGSSGGQSGLSTKDEPPKEKRQKHHDPERRLEPQS 236


>gnl|CDD|220237 pfam09428, DUF2011, Fungal protein of unknown function (DUF2011).
           This is a family of fungal proteins whose function is
           unknown.
          Length = 130

 Score = 30.7 bits (70), Expect = 0.31
 Identities = 19/71 (26%), Positives = 37/71 (52%), Gaps = 7/71 (9%)

Query: 201 KEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKYEEEEEKEEGEQTDSEKEAKKRKK 260
           K+   K E E    +++ KKRK+  K ++   +  +   +E  E+      EK  +K ++
Sbjct: 65  KKHNAKVEKELLREKEKKKKRKRPGKKRRIALRLRRERTKERAEK------EKRTRKNRE 118

Query: 261 HKKDKKKRKKK 271
            KK K+++K+K
Sbjct: 119 -KKFKRRQKEK 128


>gnl|CDD|173965 cd08045, TAF4, TATA Binding Protein (TBP) Associated Factor 4
           (TAF4) is one of several TAFs that bind TBP and is
           involved in forming Transcription Factor IID (TFIID)
           complex.  The TATA Binding Protein (TBP) Associated
           Factor 4 (TAF4) is one of several TAFs that bind TBP and
           are involved in forming the Transcription Factor IID
           (TFIID) complex. TFIID is one of seven General
           Transcription Factors (GTF) (TFIIA, TFIIB, TFIID, TFIIE,
           TFIIF, and TFIID) that are involved in accurate
           initiation of transcription by RNA polymerase II in
           eukaryote. TFIID plays an important role in the
           recognition of promoter DNA and assembly of the
           pre-initiation complex. TFIID complex is composed of the
           TBP and at least 13 TAFs. TAFs from various species were
           originally named by their predicted molecular weight or
           their electrophoretic mobility in polyacrylamide gels. A
           new, unified nomenclature for the pol II TAFs has been
           suggested to show the relationship between TAF orthologs
           and paralogs. Several hypotheses are proposed for TAFs
           functions such as serving as activator-binding sites,
           core-promoter recognition or a role in essential
           catalytic activity. Each TAF, with the help of a
           specific activator, is required only for the expression
           of subset of genes and is not universally involved for
           transcription as are GTFs. In yeast and human cells,
           TAFs have been found as components of other complexes
           besides TFIID.   Several TAFs interact via histone-fold
           (HFD) motifs; HFD is the interaction motif involved in
           heterodimerization of the core histones and their
           assembly into nucleosome octamers. The minimal HFD
           contains three alpha-helices linked by two loops and is
           found in core histones, TAFS and many other
           transcription factors. TFIID has a histone octamer-like
           substructure. TAF4 domain interacts with TAF12 and makes
           a novel histone-like heterodimer that binds DNA and has
           a core promoter function of a subset of genes.
          Length = 212

 Score = 31.5 bits (72), Expect = 0.34
 Identities = 17/43 (39%), Positives = 27/43 (62%), Gaps = 1/43 (2%)

Query: 200 DKEEEEK-EEGEQTDSEKEAKKRKKHKKDKKKKKKKSKYEEEE 241
           ++EEEEK +E E+    + AK R +  + K+K K+  K E+EE
Sbjct: 123 EREEEEKRDEEERERLLRAAKSRSEQSRLKQKAKEMQKEEDEE 165



 Score = 30.0 bits (68), Expect = 0.96
 Identities = 19/69 (27%), Positives = 34/69 (49%), Gaps = 9/69 (13%)

Query: 202 EEEEKEEGEQTDSEKEAKKRKKHKKD--KKKKKKKSKYEEEEEKEEGEQTDSEKEA---- 255
           E+ E+EE E+ D E+  +  +  K    + + K+K+K  ++EE EE     +   A    
Sbjct: 120 EQLEREEEEKRDEEERERLLRAAKSRSEQSRLKQKAKEMQKEEDEEMRHRAANATALAAI 179

Query: 256 ---KKRKKH 261
              KK+K+ 
Sbjct: 180 GGRKKKKRR 188


>gnl|CDD|220684 pfam10310, DUF2413, Protein of unknown function (DUF2413).  This is
           a family of proteins conserved in fungi. The function is
           not known.
          Length = 436

 Score = 32.1 bits (73), Expect = 0.34
 Identities = 21/96 (21%), Positives = 35/96 (36%), Gaps = 6/96 (6%)

Query: 183 AKEKRKASPTSASSDESDKEE---EEKEEGEQTDSEKEAKKRKKHKK---DKKKKKKKSK 236
               +K     AS D ++ +E   E  +E EQ++  K  KK K+  +    +  KK    
Sbjct: 12  KAPTKKPKKGDASKDSTEDDEDILEFLDELEQSEKAKPPKKPKEASRPGTPRNPKKSSKP 71

Query: 237 YEEEEEKEEGEQTDSEKEAKKRKKHKKDKKKRKKKS 272
            E      E +     K A+  +      K    +S
Sbjct: 72  TESSAASSEEKPAKPRKSAESTRSSHPKSKAPSTES 107



 Score = 31.7 bits (72), Expect = 0.39
 Identities = 21/82 (25%), Positives = 31/82 (37%), Gaps = 1/82 (1%)

Query: 174 ELVLKSKLKAKEKRKASPTSASSDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKK 233
           EL    K K  +K K +    +     K  +  E    +  EK AK RK   +  +    
Sbjct: 40  ELEQSEKAKPPKKPKEASRPGTPRNPKKSSKPTESSAASSEEKPAKPRKS-AESTRSSHP 98

Query: 234 KSKYEEEEEKEEGEQTDSEKEA 255
           KSK    E +EE E  ++    
Sbjct: 99  KSKAPSTESEEEEEPEETPDPI 120


>gnl|CDD|235322 PRK04950, PRK04950, ProP expression regulator; Provisional.
          Length = 213

 Score = 31.4 bits (72), Expect = 0.34
 Identities = 17/84 (20%), Positives = 34/84 (40%), Gaps = 13/84 (15%)

Query: 179 SKLKAKEKRKASPTSASSDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKYE 238
           +K K + +R              E++ K+     + EK  ++ +K K    +KK+K + +
Sbjct: 106 AKAKVQAQRA-------------EQQAKKREAAGEKEKAPRRERKPKPKAPRKKRKPRAQ 152

Query: 239 EEEEKEEGEQTDSEKEAKKRKKHK 262
           + E +       SE    +  K K
Sbjct: 153 KPEPQHTPVSDISELTVGQAVKVK 176



 Score = 31.4 bits (72), Expect = 0.35
 Identities = 12/57 (21%), Positives = 30/57 (52%)

Query: 216 KEAKKRKKHKKDKKKKKKKSKYEEEEEKEEGEQTDSEKEAKKRKKHKKDKKKRKKKS 272
           +EAK + + ++ +++ KK+    E+E+    E+    K  +K++K +  K + +   
Sbjct: 104 EEAKAKVQAQRAEQQAKKREAAGEKEKAPRRERKPKPKAPRKKRKPRAQKPEPQHTP 160


>gnl|CDD|235505 PRK05563, PRK05563, DNA polymerase III subunits gamma and tau;
           Validated.
          Length = 559

 Score = 31.8 bits (73), Expect = 0.44
 Identities = 17/77 (22%), Positives = 35/77 (45%), Gaps = 5/77 (6%)

Query: 181 LKAKEKRKASPTSASSDESDKE-----EEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKS 235
           +K  E+  ASP   +  E   +     E+E ++ +        ++++K K+ KK KKKK 
Sbjct: 355 VKLCEQAAASPEYDTELEVLLQRVEQLEQELKQLKAQPVGVAPEQKEKKKEKKKNKKKKY 414

Query: 236 KYEEEEEKEEGEQTDSE 252
           K    +  +  ++   +
Sbjct: 415 KVPRGKIYKVLKEATRQ 431


>gnl|CDD|179712 PRK04019, rplP0, acidic ribosomal protein P0; Validated.
          Length = 330

 Score = 31.4 bits (72), Expect = 0.48
 Identities = 15/41 (36%), Positives = 24/41 (58%)

Query: 178 KSKLKAKEKRKASPTSASSDESDKEEEEKEEGEQTDSEKEA 218
           K  L  + K   S  + ++   ++EEEE+EE E+  SE+EA
Sbjct: 281 KDALDEELKEVLSAQAQAAAAEEEEEEEEEEEEEEPSEEEA 321


>gnl|CDD|217829 pfam03985, Paf1, Paf1.  Members of this family are components of
           the RNA polymerase II associated Paf1 complex. The Paf1
           complex functions during the elongation phase of
           transcription in conjunction with Spt4-Spt5 and
           Spt16-Pob3i.
          Length = 431

 Score = 31.2 bits (71), Expect = 0.50
 Identities = 20/91 (21%), Positives = 38/91 (41%), Gaps = 20/91 (21%)

Query: 181 LKAKEKRKASPTSASSDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKYEEE 240
            K ++KR+A       +E D++E+E+EE    + E+E                    E E
Sbjct: 352 SKMRDKRRARLDPIDFEEVDEDEDEEEEQRSDEHEEE--------------------EGE 391

Query: 241 EEKEEGEQTDSEKEAKKRKKHKKDKKKRKKK 271
           + +EEG Q+  +  ++       D + +  K
Sbjct: 392 DSEEEGSQSREDGSSESSSDVGSDSESKADK 422



 Score = 29.7 bits (67), Expect = 1.7
 Identities = 16/72 (22%), Positives = 29/72 (40%), Gaps = 12/72 (16%)

Query: 197 DESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKYEEEEEKEEGEQTDSEKEAK 256
           D  + +E+E EE EQ   E E            +++ +   EE  +  E   ++S  +  
Sbjct: 366 DFEEVDEDEDEEEEQRSDEHE------------EEEGEDSEEEGSQSREDGSSESSSDVG 413

Query: 257 KRKKHKKDKKKR 268
              + K DK+  
Sbjct: 414 SDSESKADKESA 425



 Score = 27.8 bits (62), Expect = 7.2
 Identities = 18/59 (30%), Positives = 29/59 (49%), Gaps = 3/59 (5%)

Query: 197 DESDKEEEE---KEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKYEEEEEKEEGEQTDSE 252
           DE ++EE+     EE E  DSE+E  + ++    +      S  E + +KE    +DSE
Sbjct: 373 DEDEEEEQRSDEHEEEEGEDSEEEGSQSREDGSSESSSDVGSDSESKADKESASDSDSE 431


>gnl|CDD|217503 pfam03344, Daxx, Daxx Family.  The Daxx protein (also known as the
           Fas-binding protein) is thought to play a role in
           apoptosis, but precise role played by Daxx remains to be
           determined. Daxx forms a complex with Axin.
          Length = 715

 Score = 31.4 bits (71), Expect = 0.53
 Identities = 21/79 (26%), Positives = 39/79 (49%), Gaps = 10/79 (12%)

Query: 178 KSKLKAKEKRKASPTSASSDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKY 237
               KA      SP+ AS +  ++E  E+EE E+ + E+E ++ ++ + +          
Sbjct: 419 SDPSKASSTSGESPSMASQESEEEESVEEEEEEEEEEEEEEQESEEEEGE---------- 468

Query: 238 EEEEEKEEGEQTDSEKEAK 256
           +EEEE+E      SE+E +
Sbjct: 469 DEEEEEEVEADNGSEEEME 487


>gnl|CDD|217203 pfam02724, CDC45, CDC45-like protein.  CDC45 is an essential gene
           required for initiation of DNA replication in S.
           cerevisiae, forming a complex with MCM5/CDC46.
           Homologues of CDC45 have been identified in human, mouse
           and smut fungus among others.
          Length = 583

 Score = 31.5 bits (72), Expect = 0.53
 Identities = 9/56 (16%), Positives = 29/56 (51%), Gaps = 1/56 (1%)

Query: 193 SASSDESDKEEEEKE-EGEQTDSEKEAKKRKKHKKDKKKKKKKSKYEEEEEKEEGE 247
               +ES K E++++ + +  D +   ++R   ++ ++++ ++ + E E    +G 
Sbjct: 134 DEEDEESSKSEDDEDDDDDDDDDDIATRERSLERRRRRREWEEKRAELEFYYYQGT 189



 Score = 31.1 bits (71), Expect = 0.60
 Identities = 8/51 (15%), Positives = 27/51 (52%)

Query: 189 ASPTSASSDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKYEE 239
              +S S D+ D ++++ ++   T      ++R++ + ++K+ + +  Y +
Sbjct: 137 DEESSKSEDDEDDDDDDDDDDIATRERSLERRRRRREWEEKRAELEFYYYQ 187



 Score = 29.2 bits (66), Expect = 2.4
 Identities = 10/78 (12%), Positives = 30/78 (38%), Gaps = 23/78 (29%)

Query: 194 ASSDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKYEEEEEKEEGEQTDSEK 253
              ++ D +EE  EE E++                          E++E ++ +  D + 
Sbjct: 122 DLEEDDDDDEESDEEDEESSKS-----------------------EDDEDDDDDDDDDDI 158

Query: 254 EAKKRKKHKKDKKKRKKK 271
             ++R   ++ +++  ++
Sbjct: 159 ATRERSLERRRRRREWEE 176


>gnl|CDD|227666 COG5374, COG5374, Uncharacterized conserved protein [Function
           unknown].
          Length = 192

 Score = 30.5 bits (69), Expect = 0.54
 Identities = 13/55 (23%), Positives = 28/55 (50%), Gaps = 2/55 (3%)

Query: 205 EKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKYEEEEEKEEGEQTDSEKEAKKRK 259
           +K E + TD +   +K +   +  +K +++     ++  E  EQ   +KE+ K+K
Sbjct: 139 DKMEADSTDLKARLRKAQILLEGLQKNQEELFKLLDKYNELREQV--QKESSKKK 191


>gnl|CDD|234311 TIGR03685, L12P_arch, 50S ribosomal protein L12P.  This model
           represents the L12P protein of the large (50S) subunit
           of the archaeal ribosome.
          Length = 105

 Score = 29.6 bits (67), Expect = 0.54
 Identities = 14/30 (46%), Positives = 24/30 (80%)

Query: 189 ASPTSASSDESDKEEEEKEEGEQTDSEKEA 218
           A+  +A++ E ++EEEE+EE E+ +SE+EA
Sbjct: 67  AAAAAAAAAEEEEEEEEEEEEEEEESEEEA 96


>gnl|CDD|114629 pfam05917, DUF874, Helicobacter pylori protein of unknown function
           (DUF874).  This family consists of several hypothetical
           proteins specific to Helicobacter pylori. The function
           of this family is unknown.
          Length = 417

 Score = 31.4 bits (70), Expect = 0.56
 Identities = 19/80 (23%), Positives = 42/80 (52%)

Query: 177 LKSKLKAKEKRKASPTSASSDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSK 236
           L+ + +  E+ K        + ++ + + ++E ++T+ EK+  +++K K      K   +
Sbjct: 151 LEQEEQKTEQEKQKTEKEGIELANSQIKAEQEKQKTEQEKQKTEQEKQKTSNIANKNAIE 210

Query: 237 YEEEEEKEEGEQTDSEKEAK 256
            E+E++K E E+ D  KE K
Sbjct: 211 LEQEKQKTENEKQDLIKEQK 230



 Score = 30.6 bits (68), Expect = 0.88
 Identities = 18/75 (24%), Positives = 40/75 (53%)

Query: 198 ESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKYEEEEEKEEGEQTDSEKEAKK 257
           E  K E+EK++ E+   E    + K  ++ +K +++K K E+E++K       +  E ++
Sbjct: 154 EEQKTEQEKQKTEKEGIELANSQIKAEQEKQKTEQEKQKTEQEKQKTSNIANKNAIELEQ 213

Query: 258 RKKHKKDKKKRKKKS 272
            K+  +++K+   K 
Sbjct: 214 EKQKTENEKQDLIKE 228


>gnl|CDD|237629 PRK14160, PRK14160, heat shock protein GrpE; Provisional.
          Length = 211

 Score = 30.9 bits (70), Expect = 0.57
 Identities = 16/67 (23%), Positives = 32/67 (47%)

Query: 197 DESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKYEEEEEKEEGEQTDSEKEAK 256
            +  K E  +E+  + +  KE  K K+   + ++ +K+   E+ EE  E +  + + E  
Sbjct: 5   CKDAKHENMEEDCCKENENKEEDKGKEEDLEFEEIEKEEIIEDSEESNEVKIEELKDENN 64

Query: 257 KRKKHKK 263
           K K+  K
Sbjct: 65  KLKEENK 71



 Score = 28.6 bits (64), Expect = 3.0
 Identities = 13/51 (25%), Positives = 26/51 (50%)

Query: 197 DESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKYEEEEEKEEGE 247
           ++  KEE+ + E  + +   E  +     K ++ K + +K +EE +K E E
Sbjct: 26  EDKGKEEDLEFEEIEKEEIIEDSEESNEVKIEELKDENNKLKEENKKLENE 76


>gnl|CDD|237753 PRK14552, PRK14552, C/D box methylation guide ribonucleoprotein
           complex aNOP56 subunit; Provisional.
          Length = 414

 Score = 31.1 bits (71), Expect = 0.58
 Identities = 19/57 (33%), Positives = 27/57 (47%), Gaps = 19/57 (33%)

Query: 215 EKEAKKRKKHKKDKKKKKKKSKYEEEEEKEEGEQTDSEKEAKKRKKHKKDKKKRKKK 271
           EK  K  KK +++KK +K+K K                   KKRKK  K +KK+ +K
Sbjct: 377 EKYPKPPKKKREEKKPQKRKKK-------------------KKRKKKGKKRKKKGRK 414



 Score = 31.1 bits (71), Expect = 0.69
 Identities = 13/46 (28%), Positives = 29/46 (63%)

Query: 227 DKKKKKKKSKYEEEEEKEEGEQTDSEKEAKKRKKHKKDKKKRKKKS 272
           +K+ ++ K KY +  +K+  E+   +++ KK++K K  K+K+K + 
Sbjct: 369 NKRIEEIKEKYPKPPKKKREEKKPQKRKKKKKRKKKGKKRKKKGRK 414



 Score = 28.4 bits (64), Expect = 3.9
 Identities = 14/42 (33%), Positives = 27/42 (64%)

Query: 203 EEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKYEEEEEKE 244
           EE KE+  +   +K  +K+ + +K KKK+KKK K  +++ ++
Sbjct: 373 EEIKEKYPKPPKKKREEKKPQKRKKKKKRKKKGKKRKKKGRK 414



 Score = 28.4 bits (64), Expect = 4.0
 Identities = 17/52 (32%), Positives = 25/52 (48%), Gaps = 15/52 (28%)

Query: 220 KRKKHKKDKKKKKKKSKYEEEEEKEEGEQTDSEKEAKKRKKHKKDKKKRKKK 271
           K K  K  KKK+++K                 +K  KK+K+ KK KK++KK 
Sbjct: 376 KEKYPKPPKKKREEKKP---------------QKRKKKKKRKKKGKKRKKKG 412


>gnl|CDD|236944 PRK11642, PRK11642, exoribonuclease R; Provisional.
          Length = 813

 Score = 31.3 bits (71), Expect = 0.60
 Identities = 16/69 (23%), Positives = 28/69 (40%), Gaps = 2/69 (2%)

Query: 204 EEKEEGEQTDSEKEAKKRKKHKKDKKKK--KKKSKYEEEEEKEEGEQTDSEKEAKKRKKH 261
           E         + ++AKK    KK  K++   KK  +E +      ++   +   K  +K 
Sbjct: 728 ERAPRNVGKTAREKAKKGDAGKKGGKRRQVGKKVNFEPDSAFRGEKKAKPKAAKKDARKA 787

Query: 262 KKDKKKRKK 270
           KK   K +K
Sbjct: 788 KKPSAKTQK 796



 Score = 29.3 bits (66), Expect = 2.2
 Identities = 15/75 (20%), Positives = 33/75 (44%), Gaps = 7/75 (9%)

Query: 205 EKEEGEQTDSEKEAKKRKKHKKDKK-------KKKKKSKYEEEEEKEEGEQTDSEKEAKK 257
            +E+ ++ D+ K+  KR++  K          + +KK+K +  ++     +  S K  K 
Sbjct: 738 AREKAKKGDAGKKGGKRRQVGKKVNFEPDSAFRGEKKAKPKAAKKDARKAKKPSAKTQKI 797

Query: 258 RKKHKKDKKKRKKKS 272
               K  +  +KK +
Sbjct: 798 AAATKAKRAAKKKVA 812



 Score = 29.3 bits (66), Expect = 2.6
 Identities = 13/62 (20%), Positives = 24/62 (38%)

Query: 178 KSKLKAKEKRKASPTSASSDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKY 237
             + +  +K    P SA   E   + +  ++  +   +  AK +K     K K+  K K 
Sbjct: 752 GKRRQVGKKVNFEPDSAFRGEKKAKPKAAKKDARKAKKPSAKTQKIAAATKAKRAAKKKV 811

Query: 238 EE 239
            E
Sbjct: 812 AE 813



 Score = 28.2 bits (63), Expect = 5.4
 Identities = 15/74 (20%), Positives = 32/74 (43%), Gaps = 7/74 (9%)

Query: 201 KEEEEKEEGEQTDSEKEAKKRKKH-------KKDKKKKKKKSKYEEEEEKEEGEQTDSEK 253
           +E+ +K +  +   ++    +K +       + +KK K K +K +  + K+   +T    
Sbjct: 739 REKAKKGDAGKKGGKRRQVGKKVNFEPDSAFRGEKKAKPKAAKKDARKAKKPSAKTQKIA 798

Query: 254 EAKKRKKHKKDKKK 267
            A K K+  K K  
Sbjct: 799 AATKAKRAAKKKVA 812



 Score = 27.8 bits (62), Expect = 7.0
 Identities = 18/89 (20%), Positives = 26/89 (29%), Gaps = 20/89 (22%)

Query: 184 KEKRKASPTSASSDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKYEEEEEK 243
           K K+  +           ++   E       EK+AK +   K  +K KK           
Sbjct: 741 KAKKGDAGKKGGKRRQVGKKVNFEPDSAFRGEKKAKPKAAKKDARKAKKPS--------- 791

Query: 244 EEGEQTDSEKEAKKRKKHKKDKKKRKKKS 272
                      AK +K     K KR  K 
Sbjct: 792 -----------AKTQKIAAATKAKRAAKK 809


>gnl|CDD|224212 COG1293, COG1293, Predicted RNA-binding protein homologous to
           eukaryotic snRNP [Transcription].
          Length = 564

 Score = 31.2 bits (71), Expect = 0.62
 Identities = 19/91 (20%), Positives = 34/91 (37%), Gaps = 5/91 (5%)

Query: 183 AKEKRKASPTSASSDESDKEEEE-KEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKYEEEE 241
             E  K+   S ++    K+ ++ K      D +    K      +  K       E+ E
Sbjct: 352 KIELDKSKTPSENAQRYFKKYKKLKGAKVNLDRQLSELKEAIAYYESAKTAL----EKAE 407

Query: 242 EKEEGEQTDSEKEAKKRKKHKKDKKKRKKKS 272
            K+  E+   E   +   K KK K+K+K+  
Sbjct: 408 GKKAIEEIREELIEEGLLKSKKKKRKKKEWF 438


>gnl|CDD|216652 pfam01698, FLO_LFY, Floricaula / Leafy protein.  This family
           consists of various plant development proteins which are
           homologues of floricaula (FLO) and Leafy (LFY) proteins
           which are floral meristem identity proteins. Mutations
           in the sequences of these proteins affect flower and
           leaf development.
          Length = 382

 Score = 31.1 bits (71), Expect = 0.62
 Identities = 7/36 (19%), Positives = 25/36 (69%)

Query: 212 TDSEKEAKKRKKHKKDKKKKKKKSKYEEEEEKEEGE 247
                +++K+K+ KK ++K+ K+ + +++++++E +
Sbjct: 175 PGHSSDSEKKKQRKKQRRKRSKELREDDDDDEDEDD 210


>gnl|CDD|217902 pfam04111, APG6, Autophagy protein Apg6.  In yeast, 15 Apg proteins
           coordinate the formation of autophagosomes. Autophagy is
           a bulk degradation process induced by starvation in
           eukaryotic cells. Apg6/Vps30p has two distinct functions
           in the autophagic process, either associated with the
           membrane or in a retrieval step of the carboxypeptidase
           Y sorting pathway.
          Length = 356

 Score = 31.0 bits (70), Expect = 0.63
 Identities = 21/96 (21%), Positives = 44/96 (45%), Gaps = 6/96 (6%)

Query: 177 LKSKLKAKEKRKASPTSASSDESDKEEEEKEEGEQTDSEKEAKKRKKHK-----KDKKKK 231
           L S+L+  EK + +       + + +  E    E  DSE +  K+++ +     ++ +K+
Sbjct: 34  LDSELRDAEKERDT-YKQYLSKLESQNVEISNYEALDSELDELKKEEERLLDELEELEKE 92

Query: 232 KKKSKYEEEEEKEEGEQTDSEKEAKKRKKHKKDKKK 267
                 E  E +EE EQ ++E+    R+ +  D+  
Sbjct: 93  DDDLDGELVELQEEKEQLENEELQYLREYNLFDRNN 128


>gnl|CDD|218336 pfam04935, SURF6, Surfeit locus protein 6.  The surfeit locus
           protein SURF-6 is shown to be a component of the
           nucleolar matrix and has a strong binding capacity for
           nucleic acids.
          Length = 206

 Score = 30.7 bits (70), Expect = 0.64
 Identities = 26/56 (46%), Positives = 37/56 (66%)

Query: 216 KEAKKRKKHKKDKKKKKKKSKYEEEEEKEEGEQTDSEKEAKKRKKHKKDKKKRKKK 271
           K+A KRK+ +K K KK+ K + E+ E+K+   Q   E+  KKRK  KK+KKK+K K
Sbjct: 145 KKALKRKEKQKKKSKKEWKERKEKVEKKKAERQKKREENLKKRKDDKKNKKKKKAK 200



 Score = 30.0 bits (68), Expect = 1.1
 Identities = 30/67 (44%), Positives = 39/67 (58%), Gaps = 1/67 (1%)

Query: 170 VKCNELVLKSKLKAKEKRKASPTSASSDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKK 229
           VK +E +LK  LK KEK+K        +  +K E++K E  Q   E+  KKRK  KK+KK
Sbjct: 137 VKDDEKLLKKALKRKEKQKKKSKKEWKERKEKVEKKKAE-RQKKREENLKKRKDDKKNKK 195

Query: 230 KKKKKSK 236
           KKK K K
Sbjct: 196 KKKAKKK 202



 Score = 28.0 bits (63), Expect = 4.9
 Identities = 16/51 (31%), Positives = 32/51 (62%)

Query: 215 EKEAKKRKKHKKDKKKKKKKSKYEEEEEKEEGEQTDSEKEAKKRKKHKKDK 265
           E+  +KR++ K  KK+K+K++K +E+ +K E E+  +E+   K+K    + 
Sbjct: 9   EQRRRKREQRKARKKQKRKEAKKKEDAQKSEAEEVKNEENKSKKKAAPIEN 59


>gnl|CDD|218311 pfam04888, SseC, Secretion system effector C (SseC) like family.
           SseC is a secreted protein that forms a complex together
           with SecB and SecD on the surface of Salmonella. All
           these proteins are secreted by the type III secretion
           system. Many mucosal pathogens use type III secretion
           systems for the injection of effector proteins into
           target cells. SecB, SseC and SecD are inserted into the
           target cell membrane. where they form a small pore or
           translocon. In addition to SseC, this family includes
           the bacterial secreted proteins PopB, PepB, YopB and
           EspD which are thought to be directly involved in pore
           formation, and type III secretion system translocon.
          Length = 303

 Score = 30.9 bits (70), Expect = 0.64
 Identities = 14/52 (26%), Positives = 27/52 (51%), Gaps = 8/52 (15%)

Query: 206 KEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKYEEEEEKEEGEQTDSEKEAKK 257
           K   +Q  S+ +  +R + K++KK ++ + + ++  EK E        EAKK
Sbjct: 11  KLAEKQAKSKLQQLERARDKQEKKAEEYQEQIKKAIEKAE--------EAKK 54


>gnl|CDD|219572 pfam07780, Spb1_C, Spb1 C-terminal domain.  This presumed domain is
           found at the C-terminus of a family of FtsJ-like
           methyltransferases. Members of this family are involved
           in 60S ribosomal biogenesis.
          Length = 212

 Score = 30.7 bits (70), Expect = 0.66
 Identities = 26/56 (46%), Positives = 33/56 (58%), Gaps = 1/56 (1%)

Query: 217 EAKKRKKHKKDKKKKKKKSKYEEEEEKEEGEQTDSEKEAKK-RKKHKKDKKKRKKK 271
           EAK RKK +  K+ +K K K E   E E+  + +  K+ KK  KK KK KKKRKK 
Sbjct: 115 EAKARKKMRALKRLEKAKKKAEAIAEDEDISEREKAKQIKKLYKKAKKKKKKRKKT 170


>gnl|CDD|214661 smart00435, TOPEUc, DNA Topoisomerase I (eukaryota).  DNA
           Topoisomerase I (eukaryota), DNA topoisomerase V,
           Vaccina virus topoisomerase, Variola virus
           topoisomerase, Shope fibroma virus topoisomeras.
          Length = 391

 Score = 30.8 bits (70), Expect = 0.66
 Identities = 23/96 (23%), Positives = 46/96 (47%), Gaps = 14/96 (14%)

Query: 177 LKSKLKAKEKRKASPTSA---SSDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKK 233
           L+ K+KA + +              SD + + K + E+ + + +A+ ++K K+ KK++KK
Sbjct: 282 LQEKIKALKYQLKRLKKMILLFEMISDLKRKLKSKFERDNEKLDAEVKEKKKEKKKEEKK 341

Query: 234 KSKYEEEEEKEEGEQTDSEKEAKKRKKHKKDKKKRK 269
           K + E  EE+            +K +    DK++ K
Sbjct: 342 KKQIERLEER-----------IEKLEVQATDKEENK 366



 Score = 28.1 bits (63), Expect = 5.1
 Identities = 15/70 (21%), Positives = 37/70 (52%), Gaps = 5/70 (7%)

Query: 201 KEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKYEEEEEKEEGEQTDSEKEAKKRKK 260
           K + ++ +      E  +  ++K K   ++  +K   E +E+K+E      +KE KK+K+
Sbjct: 290 KYQLKRLKKMILLFEMISDLKRKLKSKFERDNEKLDAEVKEKKKE-----KKKEEKKKKQ 344

Query: 261 HKKDKKKRKK 270
            ++ +++ +K
Sbjct: 345 IERLEERIEK 354



 Score = 27.7 bits (62), Expect = 8.2
 Identities = 13/58 (22%), Positives = 24/58 (41%)

Query: 215 EKEAKKRKKHKKDKKKKKKKSKYEEEEEKEEGEQTDSEKEAKKRKKHKKDKKKRKKKS 272
           EK  +K K  K   K+ KK     E     + +     +   ++   +  +KK++KK 
Sbjct: 280 EKLQEKIKALKYQLKRLKKMILLFEMISDLKRKLKSKFERDNEKLDAEVKEKKKEKKK 337


>gnl|CDD|191187 pfam05087, Rota_VP2, Rotavirus VP2 protein.  Rotavirus particles
           consist of three concentric proteinaceous capsid layers.
           The innermost capsid (core) is made of VP2. The genomic
           RNA and the two minor proteins VP1 and VP3 are
           encapsidated within this layer. The N-terminus of
           rotavirus VP2 is necessary for the encapsidation of VP1
           and VP3.
          Length = 887

 Score = 31.1 bits (70), Expect = 0.67
 Identities = 17/78 (21%), Positives = 33/78 (42%), Gaps = 1/78 (1%)

Query: 194 ASSDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKYEEEEEKEEGEQTDSEK 253
           A  +  +      +  ++ D EK+ +K +   K+K   KK+    +  +    EQ+  E 
Sbjct: 2   AYRNRREANINNNDRMQEKDDEKQDQKNRMELKEKVLDKKEEVVTDNVDSPVKEQSSQE- 60

Query: 254 EAKKRKKHKKDKKKRKKK 271
             K   + KK  K+  K+
Sbjct: 61  NLKIADEVKKSTKEESKQ 78



 Score = 27.6 bits (61), Expect = 9.7
 Identities = 19/84 (22%), Positives = 37/84 (44%), Gaps = 13/84 (15%)

Query: 194 ASSDESDKEEEEKEEGEQTDSEKEAKKRKKHKK-----DKKKKKKKSKYEEEEEKEEGEQ 248
           A+ + +D+ +E+ +E +   +  E K++   KK     D      K +  +E  K   E 
Sbjct: 9   ANINNNDRMQEKDDEKQDQKNRMELKEKVLDKKEEVVTDNVDSPVKEQSSQENLKIADEV 68

Query: 249 TDSEKE--------AKKRKKHKKD 264
             S KE         K +++H+K+
Sbjct: 69  KKSTKEESKQLLEVLKTKEEHQKE 92


>gnl|CDD|222440 pfam13897, GOLD_2, Golgi-dynamics membrane-trafficking.  Sec14-like
           Golgi-trafficking domain The GOLD domain is always found
           combined with lipid- or membrane-association domains.
          Length = 136

 Score = 30.0 bits (68), Expect = 0.69
 Identities = 14/31 (45%), Positives = 20/31 (64%)

Query: 193 SASSDESDKEEEEKEEGEQTDSEKEAKKRKK 223
           S SSDE ++EE E+EE E  D E  +K + +
Sbjct: 52  SESSDEEEEEEAEEEEAETGDVEAGSKSQSR 82


>gnl|CDD|240061 cd04710, BAH_fungalPHD, BAH, or Bromo Adjacent Homology domain, as
           present in fungal proteins containing PHD domains. BAH
           domains are found in a variety of proteins playing roles
           in transcriptional silencing and the remodeling of
           chromatin. It is assumed that in most or all of these
           instances the BAH domain mediates protein-protein
           interactions.
          Length = 135

 Score = 29.6 bits (67), Expect = 0.72
 Identities = 15/42 (35%), Positives = 23/42 (54%), Gaps = 10/42 (23%)

Query: 5   RPRCYFDVEISDIPLG--RIVF-ELYSDICPITCENFRALCT 43
           RPR     +IS   +   R+++  ++SDICPI   + R  CT
Sbjct: 66  RPR-----DISRRVVADSRLLYASMHSDICPI--GSVRGKCT 100


>gnl|CDD|217393 pfam03154, Atrophin-1, Atrophin-1 family.  Atrophin-1 is the
           protein product of the dentatorubral-pallidoluysian
           atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive
           neurodegenerative disorder. It is caused by the
           expansion of a CAG repeat in the DRPLA gene on
           chromosome 12p. This results in an extended
           polyglutamine region in atrophin-1, that is thought to
           confer toxicity to the protein, possibly through
           altering its interactions with other proteins. The
           expansion of a CAG repeat is also the underlying defect
           in six other neurodegenerative disorders, including
           Huntington's disease. One interaction of expanded
           polyglutamine repeats that is thought to be pathogenic
           is that with the short glutamine repeat in the
           transcriptional coactivator CREB binding protein, CBP.
           This interaction draws CBP away from its usual nuclear
           location to the expanded polyglutamine repeat protein
           aggregates that are characteristic of the polyglutamine
           neurodegenerative disorders. This interferes with
           CBP-mediated transcription and causes cytotoxicity.
          Length = 979

 Score = 31.2 bits (70), Expect = 0.77
 Identities = 20/91 (21%), Positives = 42/91 (46%), Gaps = 4/91 (4%)

Query: 178 KSKLKAKEKRKASPTSASSDESDKEEEEKEEGEQTDSEKEAKKR--KKHKKDKKKKKKKS 235
           + K  A    +ASPT+     S +           DS+ E+ K+  KK K++     K +
Sbjct: 21  RKKQTASPDGRASPTNEDQRSSGRNSPSAASTSSNDSKAESTKKPNKKIKEEATSPLKST 80

Query: 236 KYEEEEEKEEGEQTD--SEKEAKKRKKHKKD 264
           K + E+   + E+ +  + K++K ++  + +
Sbjct: 81  KRQREKPASDTEEPERVTAKKSKTQELSRPN 111



 Score = 30.4 bits (68), Expect = 1.2
 Identities = 17/50 (34%), Positives = 33/50 (66%), Gaps = 1/50 (2%)

Query: 214 SEKEAKKRKK-HKKDKKKKKKKSKYEEEEEKEEGEQTDSEKEAKKRKKHK 262
           S K AKKR++  +K K++ ++K++ E E EKE+ ++ + E+E +  +  K
Sbjct: 574 SSKLAKKREEAVEKAKREAEQKAREEREREKEKEKEREREREREAERAAK 623



 Score = 29.3 bits (65), Expect = 2.7
 Identities = 18/63 (28%), Positives = 37/63 (58%), Gaps = 3/63 (4%)

Query: 194 ASSDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKYEEEEEKEEGEQTDSEK 253
           ASS  + K EE  E+ ++   E E K R++ +++K+K+K++ +  E E +   + + S  
Sbjct: 573 ASSKLAKKREEAVEKAKR---EAEQKAREEREREKEKEKEREREREREAERAAKASSSSH 629

Query: 254 EAK 256
           E++
Sbjct: 630 ESR 632


>gnl|CDD|217838 pfam04004, Leo1, Leo1-like protein.  Members of this family are
           part of the Paf1/RNA polymerase II complex. The Paf1
           complex probably functions during the elongation phase
           of transcription. The Leo1 subunit of the yeast
           Paf1-complex binds RNA and contributes to complex
           recruitment. The subunit acts by co-ordinating
           co-transcriptional chromain modifications and helping
           recruitment of mRNA 3prime-end processing factors.
          Length = 312

 Score = 30.6 bits (69), Expect = 0.77
 Identities = 19/81 (23%), Positives = 34/81 (41%), Gaps = 14/81 (17%)

Query: 198 ESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKYEEEE----------EKEEGE 247
           E +K E EK+E ++      A++R+++++  K K                       E E
Sbjct: 236 EHEKREREKKEEQKL----RARRRRQNREKMKNKPPNRPGHGSGSDSNVAKAATTYSEDE 291

Query: 248 QTDSEKEAKKRKKHKKDKKKR 268
              S+ E +K K+ KK+   R
Sbjct: 292 DEGSDCETRKAKEVKKEGAGR 312


>gnl|CDD|237258 PRK12903, secA, preprotein translocase subunit SecA; Reviewed.
          Length = 925

 Score = 30.8 bits (70), Expect = 0.77
 Identities = 19/121 (15%), Positives = 40/121 (33%), Gaps = 17/121 (14%)

Query: 151 HIENLPVDKMSRPLQDVRVVKCNELVLKSKLKAKEKRKASPTSASSDESDKEEEEKEEGE 210
            ++ L + +    ++   +   NE     K + K +           E  + EEE +   
Sbjct: 818 RLKALRIQREEMLMRPEELELINEEQKNLKQEIKLE---------LSEIQEAEEEIQN-- 866

Query: 211 QTDSEKEAKKRKKHKKDKKKKKKKSKYEEEEEKEEGEQTDSEKEAKKRKKHKKDKKKRKK 270
              +E + +  +     KK  K     +   +         ++  +  K  KK KK  +K
Sbjct: 867 --INENKNEFVEFKNDPKKLNKLIIAKDVLIKLVISS----DEIKQDEKTTKKKKKDLEK 920

Query: 271 K 271
            
Sbjct: 921 T 921


>gnl|CDD|153331 cd07647, F-BAR_PSTPIP, The F-BAR (FES-CIP4 Homology and
           Bin/Amphiphysin/Rvs) domain of Proline-Serine-Threonine
           Phosphatase-Interacting Proteins.  F-BAR domains are
           dimerization modules that bind and bend membranes and
           are found in proteins involved in membrane dynamics and
           actin reorganization. Vetebrates contain two
           Proline-Serine-Threonine Phosphatase-Interacting
           Proteins (PSTPIPs), PSTPIP1 and PSTPIP2. PSTPIPs are
           mainly expressed in hematopoietic cells and are involved
           in the regulation of cell adhesion and motility.
           Mutations in PSTPIPs have been shown to cause
           autoinflammatory disorders. PSTPIP1 contains an
           N-terminal F-BAR domain, PEST motifs, and a C-terminal
           SH3 domain, while PSTPIP2 contains only the N-terminal
           F-BAR domain. F-BAR domains form banana-shaped dimers
           with a positively-charged concave surface that binds to
           negatively-charged lipid membranes. They can induce
           membrane deformation in the form of long tubules.
          Length = 239

 Score = 30.5 bits (69), Expect = 0.79
 Identities = 18/81 (22%), Positives = 39/81 (48%), Gaps = 4/81 (4%)

Query: 196 SDESDKEEEEKEEGEQTDSEKEAKKRKKHKKD----KKKKKKKSKYEEEEEKEEGEQTDS 251
            +E++K EE +E+ ++   + E   ++  K      KK  K K  YE++  +++  +   
Sbjct: 84  REEAEKLEEFREKQKEERKKTEDIMKRSQKNKKELYKKTMKAKKSYEQKCREKDKAEQAY 143

Query: 252 EKEAKKRKKHKKDKKKRKKKS 272
           EK +   +  + +K K+K   
Sbjct: 144 EKSSSGAQPKEAEKLKKKAAQ 164


>gnl|CDD|233224 TIGR00993, 3a0901s04IAP86, chloroplast protein import component
           Toc86/159, G and M domains.  The long precursor of the
           86K protein originally described is proposed to have
           three domains. The N-terminal A-domain is acidic,
           repetitive, weakly conserved, readily removed by
           proteolysis during chloroplast isolation, and not
           required for protein translocation. The other domains
           are designated G (GTPase) and M (membrane anchor); this
           family includes most of the G domain and all of M
           [Transport and binding proteins, Amino acids, peptides
           and amines].
          Length = 763

 Score = 31.1 bits (70), Expect = 0.80
 Identities = 20/83 (24%), Positives = 36/83 (43%), Gaps = 10/83 (12%)

Query: 196 SDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKK-------KKKKKSKYEEEEEKEEGEQ 248
            ++SD E E+  + ++   E E  +    K   K       K+++K+  EE + + +  Q
Sbjct: 385 EEDSDIELEDSSDSDEESGEDEYDQLPPFKPLTKAQMAKLSKEQRKAYLEEYDYRVKLLQ 444

Query: 249 TDSEKEAKKRKKHKKDKKKRKKK 271
               +E  KR    K  KK  K+
Sbjct: 445 KKQWREELKR---MKMMKKFGKE 464



 Score = 29.2 bits (65), Expect = 2.7
 Identities = 17/68 (25%), Positives = 28/68 (41%), Gaps = 5/68 (7%)

Query: 184 KEKRKASPTSASSDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSK-----YE 238
           K   KA     S ++     EE +   +   +K+ ++  K  K  KK  K+       Y 
Sbjct: 414 KPLTKAQMAKLSKEQRKAYLEEYDYRVKLLQKKQWREELKRMKMMKKFGKEIGELPDGYS 473

Query: 239 EEEEKEEG 246
           EE ++E G
Sbjct: 474 EEVDEENG 481


>gnl|CDD|203444 pfam06424, PRP1_N, PRP1 splicing factor, N-terminal.  This domain
           is specific to the N-terminal part of the prp1 splicing
           factor, which is involved in mRNA splicing (and possibly
           also poly(A)+ RNA nuclear export and cell cycle
           progression). This domain is specific to the N terminus
           of the RNA splicing factor encoded by prp1. It is
           involved in mRNA splicing and possibly also poly(A)and
           RNA nuclear export and cell cycle progression.
          Length = 131

 Score = 29.6 bits (67), Expect = 0.81
 Identities = 16/49 (32%), Positives = 31/49 (63%), Gaps = 1/49 (2%)

Query: 195 SSDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKYEEEEEK 243
           S  + D E+EE +   ++  E+  ++RKK ++++K+K++  KY EE  K
Sbjct: 52  SDGKYDDEDEEADRIYESIDERMDERRKK-RREQKEKEEIEKYREENPK 99


>gnl|CDD|220838 pfam10659, Trypan_glycop_C, Trypanosome variant surface
           glycoprotein C-terminal domain.  The trypanosome
           parasite expresses these proteins to evade the immune
           response.
          Length = 98

 Score = 28.9 bits (65), Expect = 0.85
 Identities = 16/61 (26%), Positives = 25/61 (40%)

Query: 210 EQTDSEKEAKKRKKHKKDKKKKKKKSKYEEEEEKEEGEQTDSEKEAKKRKKHKKDKKKRK 269
            + +  K   K K  K DKK+   K K +E + K+ G         +   +  K KK +K
Sbjct: 2   NKKNKTKTECKEKGCKWDKKEDDGKCKPKEGKAKKNGAPVTQTAGTETTTEKCKGKKDKK 61

Query: 270 K 270
            
Sbjct: 62  D 62



 Score = 28.2 bits (63), Expect = 1.7
 Identities = 14/45 (31%), Positives = 20/45 (44%)

Query: 199 SDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKYEEEEEK 243
             K ++      QT   +   ++ K KKDKK  KK  K+E    K
Sbjct: 31  EGKAKKNGAPVTQTAGTETTTEKCKGKKDKKDCKKGCKWEGNTCK 75



 Score = 27.4 bits (61), Expect = 3.2
 Identities = 13/63 (20%), Positives = 24/63 (38%)

Query: 205 EKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKYEEEEEKEEGEQTDSEKEAKKRKKHKKD 264
            K+   +T+ +++  K  K + D K K K+ K ++                K + K  K 
Sbjct: 2   NKKNKTKTECKEKGCKWDKKEDDGKCKPKEGKAKKNGAPVTQTAGTETTTEKCKGKKDKK 61

Query: 265 KKK 267
             K
Sbjct: 62  DCK 64


>gnl|CDD|226341 COG3820, COG3820, Uncharacterized protein conserved in bacteria
           [Function unknown].
          Length = 230

 Score = 30.3 bits (68), Expect = 0.87
 Identities = 16/84 (19%), Positives = 36/84 (42%), Gaps = 4/84 (4%)

Query: 162 RPLQDVRVVKCNELVLKSKL-KAKEKRKASPTSASSDESDKEEEEKEEGEQTDSEKEAKK 220
            P+  V +  C+++ L  ++ +A   R  +     +     E E+ +  E+ + E+E   
Sbjct: 150 TPMDPVTLGLCSQIDLDKEVERASRGRPTAKELGPTLLPASETEQLDYEEEREEERELDA 209

Query: 221 RKKHKKDKKKKKKKSKYEEEEEKE 244
                   K    KS  ++EE+++
Sbjct: 210 DAVF---AKLSSLKSGNKDEEDED 230


>gnl|CDD|218312 pfam04889, Cwf_Cwc_15, Cwf15/Cwc15 cell cycle control protein.
           This family represents Cwf15/Cwc15 (from
           Schizosaccharomyces pombe and Saccharomyces cerevisiae
           respectively) and their homologues. The function of
           these proteins is unknown, but they form part of the
           spliceosome and are thus thought to be involved in mRNA
           splicing.
          Length = 241

 Score = 30.1 bits (68), Expect = 0.89
 Identities = 14/58 (24%), Positives = 35/58 (60%)

Query: 188 KASPTSASSDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKYEEEEEKEE 245
            +S  S SSD+   +++ ++E      E E  K+++ ++ ++++++K+  EE+  +EE
Sbjct: 123 DSSSDSDSSDDDSDDDDSEDETAALLRELEKIKKERAEEKEREEEEKAAEEEKAREEE 180



 Score = 29.7 bits (67), Expect = 1.3
 Identities = 17/104 (16%), Positives = 44/104 (42%), Gaps = 9/104 (8%)

Query: 177 LKSKLKAKEKRKASPTSASSDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDK---KKKKK 233
            +++   K K++       +D+S   +   E  E  D E E K+++  +  +        
Sbjct: 64  EEAERAHKSKKENKLAIEDADKSTNLDASNEGDEDDDEEDEIKRKRIEEDARNSDADDSD 123

Query: 234 KSKYEEEEEKEEGEQTDSEKEA------KKRKKHKKDKKKRKKK 271
            S   +  + +  +    ++ A      +K KK + ++K+R+++
Sbjct: 124 SSSDSDSSDDDSDDDDSEDETAALLRELEKIKKERAEEKEREEE 167



 Score = 28.5 bits (64), Expect = 2.8
 Identities = 15/65 (23%), Positives = 34/65 (52%), Gaps = 6/65 (9%)

Query: 195 SSDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKYEEEEEKEEGEQTDSEKE 254
           S   SD +  + +  +    ++ A   ++ +K KK++       EE+E+EE E+   E++
Sbjct: 122 SDSSSDSDSSDDDSDDDDSEDETAALLRELEKIKKER------AEEKEREEEEKAAEEEK 175

Query: 255 AKKRK 259
           A++ +
Sbjct: 176 AREEE 180


>gnl|CDD|227441 COG5110, RPN1, 26S proteasome regulatory complex component
           [Posttranslational modification, protein turnover,
           chaperones].
          Length = 881

 Score = 30.7 bits (69), Expect = 0.94
 Identities = 16/53 (30%), Positives = 29/53 (54%), Gaps = 5/53 (9%)

Query: 195 SSDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKYEEEEEKEEGE 247
             DESDK+++  +E  Q   EK+   +K      KKK+++ +  EE+   +G+
Sbjct: 1   MVDESDKKQQTIDEQSQISPEKQTPNKK-----DKKKEEEEQLSEEDAMLKGD 48


>gnl|CDD|204414 pfam10211, Ax_dynein_light, Axonemal dynein light chain.  Axonemal
           dynein light chain proteins play a dynamic role in
           flagellar and cilia motility. Eukaryotic cilia and
           flagella are complex organelles consisting of a core
           structure, the axoneme, which is composed of nine
           microtubule doublets forming a cylinder that surrounds a
           pair of central singlet microtubules. This
           ultra-structural arrangement seems to be one of the most
           stable micro-tubular assemblies known and is responsible
           for the flagellar and ciliary movement of a large number
           of organisms ranging from protozoan to mammals. This
           light chain interacts directly with the N-terminal half
           of the heavy chains.
          Length = 189

 Score = 29.9 bits (68), Expect = 0.95
 Identities = 19/75 (25%), Positives = 38/75 (50%), Gaps = 10/75 (13%)

Query: 201 KEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKYEEEEEKEEGEQTDSEKE------ 254
           + E+ K E EQ   E +  + +K + +K+  + ++K E  E++EE E+   EK       
Sbjct: 117 QAEQGKSELEQ---EIKKLEEEKEELEKRVAELEAKLEAIEKREEEERQIEEKRHADEIA 173

Query: 255 -AKKRKKHKKDKKKR 268
             KK+ +  K + ++
Sbjct: 174 FLKKQNQQLKSQLEQ 188



 Score = 27.6 bits (62), Expect = 5.2
 Identities = 17/63 (26%), Positives = 33/63 (52%), Gaps = 2/63 (3%)

Query: 208 EGEQTDSEKEAKKRKKHKKDKKKKKKKSKYEEEEEKEEGEQTDSEKEAKKRKKHKKDKKK 267
           + EQ  SE E + +K   +++K++ +K   E E + E  E+ + E+   + K+H  +   
Sbjct: 117 QAEQGKSELEQEIKKL--EEEKEELEKRVAELEAKLEAIEKREEEERQIEEKRHADEIAF 174

Query: 268 RKK 270
            KK
Sbjct: 175 LKK 177


>gnl|CDD|114172 pfam05432, BSP_II, Bone sialoprotein II (BSP-II).  Bone
           sialoprotein (BSP) is a major structural protein of the
           bone matrix that is specifically expressed by
           fully-differentiated osteoblasts. The expression of bone
           sialoprotein (BSP) is normally restricted to mineralised
           connective tissues of bones and teeth where it has been
           associated with mineral crystal formation. However, it
           has been found that ectopic expression of BSP occurs in
           various lesions, including oral and extraoral
           carcinomas, in which it has been associated with the
           formation of microcrystalline deposits and the
           metastasis of cancer cells to bone.
          Length = 291

 Score = 30.4 bits (68), Expect = 0.96
 Identities = 21/79 (26%), Positives = 34/79 (43%)

Query: 177 LKSKLKAKEKRKASPTSASSDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSK 236
           L  K     K+      +  DE ++EEEE+EE E  ++E+       +  +       S 
Sbjct: 118 LPKKAGNAGKKATKEDESDEDEEEEEEEEEEEAEVEENEQGTNGTSTNSTEVDHGNGSSG 177

Query: 237 YEEEEEKEEGEQTDSEKEA 255
            +  EE EE   T++E E 
Sbjct: 178 GDNGEEGEEESVTEAEAEG 196


>gnl|CDD|205206 pfam13025, DUF3886, Protein of unknown function (DUF3886).  This
           family of proteins is functionally uncharacterized. This
           family of proteins is found in bacteria. Proteins in
           this family are approximately 90 amino acids in length.
           There are two completely conserved L residues that may
           be functionally important.
          Length = 70

 Score = 28.1 bits (63), Expect = 0.98
 Identities = 16/37 (43%), Positives = 23/37 (62%)

Query: 226 KDKKKKKKKSKYEEEEEKEEGEQTDSEKEAKKRKKHK 262
             K K KKK    EEE++EE E+    +E K+R+K+K
Sbjct: 19  LAKLKAKKKELKAEEEKREEEEEARKREERKEREKNK 55


>gnl|CDD|220600 pfam10147, CR6_interact, Growth arrest and DNA-damage-inducible
           proteins-interacting protein 1.  Members of this family
           of proteins act as negative regulators of G1 to S cell
           cycle phase progression by inhibiting cyclin-dependent
           kinases. Inhibitory effects are additive with GADD45
           proteins but occur also in the absence of GADD45
           proteins. Furthermore, they act as a repressor of the
           orphan nuclear receptor NR4A1 by inhibiting AB
           domain-mediated transcriptional activity.
          Length = 217

 Score = 30.2 bits (68), Expect = 1.00
 Identities = 21/97 (21%), Positives = 37/97 (38%), Gaps = 24/97 (24%)

Query: 198 ESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKYEEEEEKEEGEQ--------- 248
           E  KE+E + +  + +  K   K  +   D + +K+K + +    KE  E+         
Sbjct: 110 EQQKEKEARRQAREAEIAKNMAKMPQMIADWRAQKRKREQKARAAKERKERLVAEAREHF 169

Query: 249 ---------------TDSEKEAKKRKKHKKDKKKRKK 270
                             EKE KK+ K  K ++K +K
Sbjct: 170 GYWVDPRDPRFQEMLQQKEKEEKKKVKEAKRREKEEK 206



 Score = 27.5 bits (61), Expect = 6.0
 Identities = 16/73 (21%), Positives = 39/73 (53%), Gaps = 3/73 (4%)

Query: 202 EEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKYEEEEEK-EEGEQT--DSEKEAKKR 258
           EE E+EE E   S  +  +  + ++ +K+ +++++  E  +   +  Q   D   + +KR
Sbjct: 88  EELEEEEREWYPSLNQMLEENREQQKEKEARRQAREAEIAKNMAKMPQMIADWRAQKRKR 147

Query: 259 KKHKKDKKKRKKK 271
           ++  +  K+RK++
Sbjct: 148 EQKARAAKERKER 160



 Score = 27.1 bits (60), Expect = 8.3
 Identities = 13/72 (18%), Positives = 33/72 (45%), Gaps = 1/72 (1%)

Query: 201 KEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKYEEEEEKEEG-EQTDSEKEAKKRK 259
            E+ E+ E E+ +      +  +  ++++K+K+  +   E E  +   +        + +
Sbjct: 84  AEKLEELEEEEREWYPSLNQMLEENREQQKEKEARRQAREAEIAKNMAKMPQMIADWRAQ 143

Query: 260 KHKKDKKKRKKK 271
           K K+++K R  K
Sbjct: 144 KRKREQKARAAK 155


>gnl|CDD|227466 COG5137, COG5137, Histone chaperone involved in gene silencing
           [Transcription / Chromatin structure and dynamics].
          Length = 279

 Score = 30.3 bits (68), Expect = 1.0
 Identities = 14/67 (20%), Positives = 31/67 (46%)

Query: 200 DKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKYEEEEEKEEGEQTDSEKEAKKRK 259
           D EEEE+ E      E+E ++       +  ++   + EEE E  +  +   + E ++  
Sbjct: 175 DNEEEERLEESDGREEEEDEEVGSDSYGEGNRELNEEEEEEAEGSDDGEDVVDYEGERID 234

Query: 260 KHKKDKK 266
           K + +++
Sbjct: 235 KKQGEEE 241



 Score = 28.4 bits (63), Expect = 3.7
 Identities = 16/72 (22%), Positives = 34/72 (47%), Gaps = 4/72 (5%)

Query: 198 ESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKYEEEEEKEEGEQTDSEKEAKK 257
           E ++EE E  +  +   + E ++  K + ++++ +++     E E EE    +S  E   
Sbjct: 210 EEEEEEAEGSDDGEDVVDYEGERIDKKQGEEEEMEEEVINLFEIEWEE----ESPSEEVP 265

Query: 258 RKKHKKDKKKRK 269
           R   +   KK+K
Sbjct: 266 RNNEESPAKKQK 277


>gnl|CDD|218684 pfam05672, MAP7, MAP7 (E-MAP-115) family.  The organisation of
           microtubules varies with the cell type and is presumably
           controlled by tissue-specific microtubule-associated
           proteins (MAPs). The 115-kDa epithelial MAP
           (E-MAP-115/MAP7) has been identified as a
           microtubule-stabilising protein predominantly expressed
           in cell lines of epithelial origin. The binding of this
           microtubule associated protein is nucleotide
           independent.
          Length = 171

 Score = 29.7 bits (66), Expect = 1.1
 Identities = 15/83 (18%), Positives = 48/83 (57%)

Query: 189 ASPTSASSDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKYEEEEEKEEGEQ 248
           A+    ++  +D EE  +   E+    +E +++++ ++ +++++ + + EE + +   E+
Sbjct: 9   AALGKPTAGTTDAEEATRLLAEKRRQAREQREQEEQERREQEEQDRLEREELKRRAAEER 68

Query: 249 TDSEKEAKKRKKHKKDKKKRKKK 271
              E+EA+++++ +  +K+ K K
Sbjct: 69  LRREEEARRQEEERAREKEEKAK 91



 Score = 28.5 bits (63), Expect = 2.4
 Identities = 17/71 (23%), Positives = 46/71 (64%)

Query: 201 KEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKYEEEEEKEEGEQTDSEKEAKKRKK 260
           +E+E +E+ EQ   E+E  KR+  ++  +++++  + EEE  +E+ E+   + E +++++
Sbjct: 42  EEQERREQEEQDRLEREELKRRAAEERLRREEEARRQEEERAREKEEKAKRKAEEEEKQE 101

Query: 261 HKKDKKKRKKK 271
            ++ ++ +K+K
Sbjct: 102 QEEQERIQKQK 112


>gnl|CDD|218889 pfam06088, TLP-20, Nucleopolyhedrovirus telokin-like protein-20
           (TLP20).  This family consists of several
           Nucleopolyhedrovirus telokin-like protein-20 (TLP20)
           sequences. The function of this family is unknown but
           TLP20 is known to shares some antigenic similarities to
           the smooth muscle protein telokin although the amino
           acid sequence shows no homologies to telokin.
          Length = 162

 Score = 29.6 bits (67), Expect = 1.1
 Identities = 14/65 (21%), Positives = 27/65 (41%), Gaps = 5/65 (7%)

Query: 169 VVKCNELVLKSKLKAKEKRKASPTSASSDESDKEEEEKEEGEQTDSEK-----EAKKRKK 223
           V+K    + K      +K  ++P    S E + +E+E++  ++   E           KK
Sbjct: 94  VLKKGGCIFKIVYWNSKKFVSAPVPHHSSEDENDEDEEDNADRAGIESGIDDSAPPSPKK 153

Query: 224 HKKDK 228
            K D+
Sbjct: 154 QKLDE 158


>gnl|CDD|215239 PLN02436, PLN02436, cellulose synthase A.
          Length = 1094

 Score = 30.2 bits (68), Expect = 1.2
 Identities = 20/54 (37%), Positives = 27/54 (50%), Gaps = 4/54 (7%)

Query: 214 SEKEAKKRKKHKKDKKKKKKKSK----YEEEEEKEEGEQTDSEKEAKKRKKHKK 263
           S K+ KK+K  +K KKK ++ SK     E  EE  EG   +   E  + K  KK
Sbjct: 683 SRKKKKKKKSKEKKKKKNREASKQIHALENIEEGIEGSNNEKSSETPQLKLEKK 736


>gnl|CDD|202096 pfam02029, Caldesmon, Caldesmon. 
          Length = 431

 Score = 30.0 bits (67), Expect = 1.2
 Identities = 18/90 (20%), Positives = 49/90 (54%), Gaps = 2/90 (2%)

Query: 182 KAKEKRKASPTSASSDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKYEEEE 241
           + KE +  S   + S+ S + +E+     +T  E+E ++ ++ +++ ++ +  +K E++ 
Sbjct: 80  RQKEFKPTSTDQSLSEPSRRMQEDSGAENETVEEEEKEESREEREEVEETEGVTKSEQKN 139

Query: 242 EKEEGEQTDSEKEAKKRKKHKKDKKKRKKK 271
           +  + E  + +KE K+ +  +++K KR   
Sbjct: 140 DWRDAE--ECQKEEKEPEPEEEEKPKRGSL 167



 Score = 29.6 bits (66), Expect = 2.1
 Identities = 19/83 (22%), Positives = 44/83 (53%), Gaps = 4/83 (4%)

Query: 184 KEKRKASPTSASSDESDKEEEEK----EEGEQTDSEKEAKKRKKHKKDKKKKKKKSKYEE 239
           K K+K    +   +E  K+ EE+    EE EQ   ++EA ++ + +++K++ K++ +   
Sbjct: 204 KLKQKQQEAALELEELKKKREERRKVLEEEEQRRKQEEADRKSREEEEKRRLKEEIERRR 263

Query: 240 EEEKEEGEQTDSEKEAKKRKKHK 262
            E  E+ ++   +  ++ +K  K
Sbjct: 264 AEAAEKRQKVPEDGLSEDKKPFK 286



 Score = 28.5 bits (63), Expect = 4.5
 Identities = 19/86 (22%), Positives = 45/86 (52%)

Query: 184 KEKRKASPTSASSDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKYEEEEEK 243
           K K   +  S    E  + E  KE  +    ++EA    +  K K+++++K   EEE+ +
Sbjct: 178 KLKHTENTFSRGGAEGAQVEAGKEFEKLKQKQQEAALELEELKKKREERRKVLEEEEQRR 237

Query: 244 EEGEQTDSEKEAKKRKKHKKDKKKRK 269
           ++ E     +E +++++ K++ ++R+
Sbjct: 238 KQEEADRKSREEEEKRRLKEEIERRR 263


>gnl|CDD|206039 pfam13868, Trichoplein, Tumour suppressor, Mitostatin.  Trichoplein
           or mitostatin, was first defined as a meiosis-specific
           nuclear structural protein. It has since been linked
           with mitochondrial movement. It is associated with the
           mitochondrial outer membrane, and over-expression leads
           to reduction in mitochondrial motility whereas lack of
           it enhances mitochondrial movement. The activity appears
           to be mediated through binding the mitochondria to the
           actin intermediate filaments (IFs).
          Length = 349

 Score = 29.9 bits (68), Expect = 1.2
 Identities = 14/72 (19%), Positives = 44/72 (61%), Gaps = 1/72 (1%)

Query: 201 KEEEEKEEGEQTDSEKEAKK-RKKHKKDKKKKKKKSKYEEEEEKEEGEQTDSEKEAKKRK 259
           +EE+E E  E+ + +K+ ++   +  +++ ++K++ K  E EE+ +  +   EK  ++ +
Sbjct: 109 QEEDEAEAQEKREKQKKLREEIDEFNEERIERKEEEKEREREEELKILEYQREKAEREEE 168

Query: 260 KHKKDKKKRKKK 271
           +  + ++++++K
Sbjct: 169 REAERRERKEEK 180


>gnl|CDD|203848 pfam08079, Ribosomal_L30_N, Ribosomal L30 N-terminal domain.  This
           presumed domain is found at the N-terminus of Ribosomal
           L30 proteins and has been termed RL30NT or NUC018.
          Length = 71

 Score = 28.0 bits (63), Expect = 1.2
 Identities = 18/49 (36%), Positives = 26/49 (53%), Gaps = 5/49 (10%)

Query: 215 EKEAKKRKKHKKDKKKKKKK-----SKYEEEEEKEEGEQTDSEKEAKKR 258
            K AKKR   K  +KKK+K       KY +E    E ++   ++EAKK+
Sbjct: 14  AKRAKKRAAKKAARKKKRKLIFKRAEKYVKEYRAAERDEIRLKREAKKK 62


>gnl|CDD|236012 PRK07406, PRK07406, RNA polymerase sigma factor RpoD; Validated.
          Length = 373

 Score = 30.1 bits (68), Expect = 1.3
 Identities = 13/53 (24%), Positives = 23/53 (43%)

Query: 178 KSKLKAKEKRKASPTSASSDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKK 230
              L  +E R+ + T     + D+E   +   E ++ E +AK   + K   KK
Sbjct: 6   DELLLLEEDRQPAATDDLDSDEDEESSSENADELSEKEAKAKALARIKIGPKK 58


>gnl|CDD|177089 CHL00189, infB, translation initiation factor 2; Provisional.
          Length = 742

 Score = 30.2 bits (68), Expect = 1.3
 Identities = 23/92 (25%), Positives = 34/92 (36%), Gaps = 9/92 (9%)

Query: 171 KCNELVLKSKLKAKEKRKASPTSASSDESDKEEEEKEEGEQTD-----SEKEAKKRK--- 222
           K ++L    K K K+K+K        D+ D   + K   +Q       S     K K   
Sbjct: 65  KTDDLKDSKKTKLKQKKKIKKKLHIDDDYDNFFDSKNNSKQFAGPLAISLMRKPKPKTEK 124

Query: 223 -KHKKDKKKKKKKSKYEEEEEKEEGEQTDSEK 253
            K K    K   K K +    K+E  + D+ K
Sbjct: 125 LKKKITVNKSTNKKKKKVLSSKDELIKYDNNK 156



 Score = 29.8 bits (67), Expect = 1.6
 Identities = 16/75 (21%), Positives = 35/75 (46%), Gaps = 4/75 (5%)

Query: 198 ESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKYEEEEEKEEGEQTDSEKEAKK 257
             +K +++ ++  +TD  K++KK     K K+KKK K K   +++ +    + +  +   
Sbjct: 52  LHEKLDKKNKKFNKTDDLKDSKK----TKLKQKKKIKKKLHIDDDYDNFFDSKNNSKQFA 107

Query: 258 RKKHKKDKKKRKKKS 272
                   +K K K+
Sbjct: 108 GPLAISLMRKPKPKT 122


>gnl|CDD|148630 pfam07133, Merozoite_SPAM, Merozoite surface protein (SPAM).  This
           family consists of several Plasmodium falciparum SPAM
           (secreted polymorphic antigen associated with
           merozoites) proteins. Variation among SPAM alleles is
           the result of deletions and amino acid substitutions in
           non-repetitive sequences within and flanking the alanine
           heptad-repeat domain. Heptad repeats in which the a and
           d position contain hydrophobic residues generate
           amphipathic alpha-helices which give rise to helical
           bundles or coiled-coil structures in proteins. SPAM is
           an example of a P. falciparum antigen in which a
           repetitive sequence has features characteristic of a
           well-defined structural element.
          Length = 164

 Score = 29.4 bits (66), Expect = 1.3
 Identities = 18/91 (19%), Positives = 41/91 (45%)

Query: 182 KAKEKRKASPTSASSDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKYEEEE 241
           K  E  K        +E +++EEE EE E  + E+E  + ++ +++ ++     K  E++
Sbjct: 38  KENEDVKDEKQEDDEEEEEEDEEEIEEPEDIEDEEEIVEDEEEEEEDEEDNVDLKDIEKK 97

Query: 242 EKEEGEQTDSEKEAKKRKKHKKDKKKRKKKS 272
              +   +  +  A+        K ++ KK+
Sbjct: 98  NINDIFNSTQDDNAQNLISKNYKKNEKSKKT 128



 Score = 26.7 bits (59), Expect = 9.9
 Identities = 14/63 (22%), Positives = 34/63 (53%)

Query: 196 SDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKYEEEEEKEEGEQTDSEKEA 255
             E++  ++EK+E ++ + E++ ++ ++ +  + +++     EEEEE EE      + E 
Sbjct: 37  IKENEDVKDEKQEDDEEEEEEDEEEIEEPEDIEDEEEIVEDEEEEEEDEEDNVDLKDIEK 96

Query: 256 KKR 258
           K  
Sbjct: 97  KNI 99


>gnl|CDD|235175 PRK03918, PRK03918, chromosome segregation protein; Provisional.
          Length = 880

 Score = 30.0 bits (68), Expect = 1.4
 Identities = 21/75 (28%), Positives = 42/75 (56%)

Query: 196 SDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKYEEEEEKEEGEQTDSEKEA 255
            +E +K E+E +E E+   E E  +++    +  K+K + K  E EE+ E  + + E+  
Sbjct: 220 REELEKLEKEVKELEELKEEIEELEKELESLEGSKRKLEEKIRELEERIEELKKEIEELE 279

Query: 256 KKRKKHKKDKKKRKK 270
           +K K+ K+ K+K ++
Sbjct: 280 EKVKELKELKEKAEE 294


>gnl|CDD|221466 pfam12220, U1snRNP70_N, U1 small nuclear ribonucleoprotein of 70kDa
           MW N terminal.  This domain is found in eukaryotes. This
           domain is about 90 amino acids in length. This domain is
           found associated with pfam00076. This domain is part of
           U1 snRNP, which is the pre-mRNA binding protein of the
           penta-snRNP spliceosome complex. It extends over a
           distance of 180 A from its RNA binding domain, wraps
           around the core domain of U1 snRNP consisting of the
           seven Sm proteins and finally contacts U1-C, which is
           crucial for 5'-splice-site recognition.
          Length = 94

 Score = 28.4 bits (64), Expect = 1.4
 Identities = 20/70 (28%), Positives = 38/70 (54%), Gaps = 9/70 (12%)

Query: 177 LKSKLKAKEKRKASPTSASSD--ESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKK 234
           L    KA EKRK +P +  +      K+ +++   E T++  E       K++++K++KK
Sbjct: 22  LPPLDKAPEKRKTNPITGVAQYLSEFKDYKDEPPPEPTETWLE-------KREREKREKK 74

Query: 235 SKYEEEEEKE 244
            K E++ E+E
Sbjct: 75  EKLEKKLEEE 84


>gnl|CDD|234767 PRK00448, polC, DNA polymerase III PolC; Validated.
          Length = 1437

 Score = 30.2 bits (69), Expect = 1.4
 Identities = 16/59 (27%), Positives = 33/59 (55%), Gaps = 2/59 (3%)

Query: 197 DESDKEEEEKEEGEQTDSE--KEAKKRKKHKKDKKKKKKKSKYEEEEEKEEGEQTDSEK 253
           D  ++ E+ + + E+ D +  KEA +  K  + +KKK+ K+   +E   + G++ D E+
Sbjct: 167 DSKEELEKFEAQKEEEDEKLAKEALEAMKKLEAEKKKQSKNFDPKEGPVQIGKKIDKEE 225



 Score = 27.5 bits (62), Expect = 9.3
 Identities = 13/49 (26%), Positives = 26/49 (53%)

Query: 197 DESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKYEEEEEKEE 245
           D+S +E E+ E  ++ + EK AK+  +  K  + +KKK     + ++  
Sbjct: 166 DDSKEELEKFEAQKEEEDEKLAKEALEAMKKLEAEKKKQSKNFDPKEGP 214


>gnl|CDD|236766 PRK10811, rne, ribonuclease E; Reviewed.
          Length = 1068

 Score = 30.0 bits (68), Expect = 1.4
 Identities = 15/78 (19%), Positives = 39/78 (50%), Gaps = 3/78 (3%)

Query: 186 KRKASPTSASSDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKK---KKKSKYEEEEE 242
           ++    T+ + +    E  EK   +    +   ++R++ + D+K++   + K+   EE+ 
Sbjct: 644 RQAQQQTAETRESQQAEVTEKARTQDEQQQAPRRERQRRRNDEKRQAQQEAKALNVEEQS 703

Query: 243 KEEGEQTDSEKEAKKRKK 260
            +E EQ +  ++ + R+K
Sbjct: 704 VQETEQEERVQQVQPRRK 721


>gnl|CDD|200340 TIGR03927, T7SS_EssA_Firm, type VII secretion protein EssA.
           Members of this family are associated with type VII
           secretion of WXG100 family targets in the Firmicutes,
           but not in the Actinobacteria. This highly divergent
           protein family consists largely of a central region of
           highly polar low-complexity sequence containing
           occasional LF motifs in weak repeats about 17 residues
           in length, flanked by hydrophobic N- and C-terminal
           regions [Protein fate, Protein and peptide secretion and
           trafficking].
          Length = 150

 Score = 28.9 bits (65), Expect = 1.5
 Identities = 15/66 (22%), Positives = 30/66 (45%)

Query: 197 DESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKYEEEEEKEEGEQTDSEKEAK 256
           ++ D E       E+T+ +KE    ++ KK   +K K+   +EE + +   +  +E    
Sbjct: 32  EKKDIEINTDYLQEETELDKELFTPEEQKKITFQKHKEKPEQEELKNQLFSENATENNTV 91

Query: 257 KRKKHK 262
           K  K +
Sbjct: 92  KATKKQ 97



 Score = 27.7 bits (62), Expect = 4.3
 Identities = 10/48 (20%), Positives = 24/48 (50%), Gaps = 2/48 (4%)

Query: 227 DKKKKKKKSKYEEEEEKEEGE--QTDSEKEAKKRKKHKKDKKKRKKKS 272
           +KK  +  + Y +EE + + E    + +K+   +K  +K +++  K  
Sbjct: 32  EKKDIEINTDYLQEETELDKELFTPEEQKKITFQKHKEKPEQEELKNQ 79


>gnl|CDD|72638 pfam09222, Fim-adh_lectin, Fimbrial adhesin F17-AG, lectin
          domain.  Members of this family are
          carbohydrate-specific lectin domains found in bacterial
          fimbrial adhesins. They adopt a compact, elongated
          structure consisting of a beta-sandwich with two major
          sheets: one consisting of five long strands in mixed
          orientations, and a front sheet with four antiparallel
          strands, forming an immunoglobin-like fold.
          Length = 170

 Score = 29.3 bits (65), Expect = 1.5
 Identities = 17/46 (36%), Positives = 21/46 (45%), Gaps = 3/46 (6%)

Query: 52 TGKNLHYLNIVFHRVVKGFMI---QSGDFSVGNGTGGESIYGGTFE 94
          TG N+ Y N    R+  GF +      D  V     G+SIYG T E
Sbjct: 33 TGYNIGYQNANVWRISGGFCVGLDGKVDLPVVGSLDGQSIYGLTEE 78


>gnl|CDD|238558 cd01138, FeuA, Periplasmic binding protein FeuA.  These proteins
           have predicted to function as initial receptors in ABC
           transport of metal ions in some eubacterial species.
           They belong to the TroA superfamily of periplasmic metal
           binding proteins that share a distinct fold and ligand
           binding mechanism. A typical TroA protein is comprised
           of two globular subdomains connected by a single helix
           and can bind their ligands in the cleft between these
           domains.
          Length = 248

 Score = 29.6 bits (67), Expect = 1.6
 Identities = 22/53 (41%), Positives = 32/53 (60%), Gaps = 6/53 (11%)

Query: 188 KASPT-SASSDESDKEEEEKEEGEQTDSEKEAKK-----RKKHKKDKKKKKKK 234
           K +PT   S + SD EE+ KE G+  + E EA+K     ++K K+ K+K KKK
Sbjct: 85  KIAPTVPVSYNSSDWEEQLKEIGKLLNKEDEAEKWLADYKQKAKEAKEKIKKK 137


>gnl|CDD|235089 PRK02946, aceK, bifunctional isocitrate dehydrogenase
           kinase/phosphatase protein; Validated.
          Length = 575

 Score = 29.8 bits (68), Expect = 1.7
 Identities = 11/22 (50%), Positives = 15/22 (68%)

Query: 20  GRIVFELYSDICPITCENFRAL 41
           GR+VF  Y +IC +T  NFR +
Sbjct: 470 GRVVFYDYDEICYLTECNFRDI 491


>gnl|CDD|184724 PRK14520, rpsP, 30S ribosomal protein S16; Provisional.
          Length = 155

 Score = 28.9 bits (65), Expect = 1.7
 Identities = 7/48 (14%), Positives = 13/48 (27%)

Query: 210 EQTDSEKEAKKRKKHKKDKKKKKKKSKYEEEEEKEEGEQTDSEKEAKK 257
               +E +     +    KKKK        E      E   +    ++
Sbjct: 107 NAALAEADGGPTAEATTPKKKKAAAEAAAAEAAAPAAEAAAAAAAEEE 154


>gnl|CDD|236544 PRK09506, mrcB, bifunctional glycosyl transferase/transpeptidase;
           Reviewed.
          Length = 830

 Score = 29.7 bits (67), Expect = 1.8
 Identities = 10/58 (17%), Positives = 26/58 (44%)

Query: 213 DSEKEAKKRKKHKKDKKKKKKKSKYEEEEEKEEGEQTDSEKEAKKRKKHKKDKKKRKK 270
           D E   +K K  +  K+K  ++   ++++  +  +  D E   +K K   +  + ++ 
Sbjct: 5   DREPIGRKGKPSRPVKQKVSRRRYRDDDDYDDYDDYEDEEPMPRKGKGKGRKPRGKRG 62


>gnl|CDD|221937 pfam13148, DUF3987, Protein of unknown function (DUF3987).  A
           family of uncharacterized proteins found by clustering
           human gut metagenomic sequences.
          Length = 379

 Score = 29.5 bits (67), Expect = 1.8
 Identities = 17/55 (30%), Positives = 32/55 (58%), Gaps = 6/55 (10%)

Query: 217 EAKKRKKHKKDKKKKKKKSKYEEEEEKEEGEQTDSEKEAKKRKKHKKDKKKRKKK 271
           E + R++++++ K+      YE E+E  E E+   EK+AKK  K  KD++   ++
Sbjct: 67  EEELREEYEEELKE------YEAEKEIWEAEKKGLEKKAKKAIKKGKDEEALAEE 115



 Score = 29.2 bits (66), Expect = 2.4
 Identities = 16/58 (27%), Positives = 31/58 (53%), Gaps = 1/58 (1%)

Query: 200 DKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKYEEEEEKEEGEQTDSEKEAKK 257
           ++E  E+ E E  + E E +  +  KK  +KK KK   ++ +++E   +   E EA++
Sbjct: 67  EEELREEYEEELKEYEAEKEIWEAEKKGLEKKAKK-AIKKGKDEEALAEELLELEAEE 123


>gnl|CDD|215601 PLN03142, PLN03142, Probable chromatin-remodeling complex ATPase
           chain; Provisional.
          Length = 1033

 Score = 29.8 bits (67), Expect = 1.8
 Identities = 17/74 (22%), Positives = 34/74 (45%)

Query: 198 ESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKYEEEEEKEEGEQTDSEKEAKK 257
           E ++   +  E E  +  +   +      D  +   + + E+EE+ EE E     + +K+
Sbjct: 1   EEEQVNTQANEEEDEEELEAVARSAGSDSDDDEVPAEDEDEDEEDDEEAESPAKAEISKR 60

Query: 258 RKKHKKDKKKRKKK 271
            K   K+ KK+KK+
Sbjct: 61  EKARLKELKKQKKQ 74



 Score = 29.8 bits (67), Expect = 2.1
 Identities = 16/62 (25%), Positives = 33/62 (53%)

Query: 183 AKEKRKASPTSASSDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKYEEEEE 242
           A+     S       E + E+EE +E  ++ ++ E  KR+K +  + KK+KK + ++  E
Sbjct: 22  ARSAGSDSDDDEVPAEDEDEDEEDDEEAESPAKAEISKREKARLKELKKQKKQEIQKILE 81

Query: 243 KE 244
           ++
Sbjct: 82  QQ 83


>gnl|CDD|219882 pfam08524, rRNA_processing, rRNA processing.  This is a family of
           proteins that are involved in rRNA processing. In a
           localisation study they were found to localise to the
           nucleus and nucleolus. The family also includes other
           metazoa members from plants to mammals where the protein
           has been named BR22 and is associated with TTF-1,
           thyroid transcription factor 1. In the lungs, the family
           binds TTF-1 to form a complex which influences the
           expression of the key lung surfactant protein-B (SP-B)
           and -C (SP-C), the small hydrophobic surfactant proteins
           that maintain surface tension in alveoli.
          Length = 150

 Score = 28.7 bits (64), Expect = 1.9
 Identities = 22/69 (31%), Positives = 42/69 (60%)

Query: 203 EEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKYEEEEEKEEGEQTDSEKEAKKRKKHK 262
           E+E  E +   S+++ K  KK K D+KK+  K +  E+ EKE  ++    ++ +  KK +
Sbjct: 51  EKESAEKQVKSSKEDRKFEKKKKLDEKKEIAKQRKREQREKELAKRQKELEKIELSKKKQ 110

Query: 263 KDKKKRKKK 271
           K++++R+KK
Sbjct: 111 KERERRRKK 119


>gnl|CDD|183610 PRK12585, PRK12585, putative monovalent cation/H+ antiporter
           subunit G; Reviewed.
          Length = 197

 Score = 29.3 bits (65), Expect = 1.9
 Identities = 18/82 (21%), Positives = 43/82 (52%), Gaps = 3/82 (3%)

Query: 184 KEKRKASPTSASSDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKYEEEEEK 243
           K+K+         +++ +E EE EE  + +  +E    ++ ++++++++++   EE+ + 
Sbjct: 116 KKKKSLIIRQEQIEKARQEREELEERMEWERREEKIDEREDQEEQEREREEQTIEEQSDD 175

Query: 244 EEG---EQTDSEKEAKKRKKHK 262
            E    EQ +SE E+   K  K
Sbjct: 176 SEHEIIEQDESETESDDDKTEK 197



 Score = 28.9 bits (64), Expect = 2.1
 Identities = 19/74 (25%), Positives = 40/74 (54%), Gaps = 4/74 (5%)

Query: 181 LKAKEKRKASPTSASSDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKYEEE 240
           +K  + +K        ++ +K  +E+EE E    E+   +R++ K D+++ +++ + E E
Sbjct: 110 VKKDDIKKKKSLIIRQEQIEKARQEREELE----ERMEWERREEKIDEREDQEEQERERE 165

Query: 241 EEKEEGEQTDSEKE 254
           E+  E +  DSE E
Sbjct: 166 EQTIEEQSDDSEHE 179


>gnl|CDD|115441 pfam06783, UPF0239, Uncharacterized protein family (UPF0239). 
          Length = 85

 Score = 27.7 bits (61), Expect = 1.9
 Identities = 15/39 (38%), Positives = 22/39 (56%)

Query: 196 SDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKK 234
           S E++ E  E E  E T+ E  A   + HK  K++KKK+
Sbjct: 47  SHEAEAENSEPESAEVTEKEGAAVPSRLHKIKKEEKKKR 85


>gnl|CDD|219547 pfam07741, BRF1, Brf1-like TBP-binding domain.  This region covers
           both the Brf homology II and III regions. This region is
           involved in binding TATA binding protein.
          Length = 95

 Score = 28.0 bits (63), Expect = 2.0
 Identities = 15/51 (29%), Positives = 28/51 (54%)

Query: 210 EQTDSEKEAKKRKKHKKDKKKKKKKSKYEEEEEKEEGEQTDSEKEAKKRKK 260
           EQ + E + K  + +   KKKKK+K+K + +E        ++ K+  ++KK
Sbjct: 31  EQEEKELKQKADEGNNSGKKKKKRKAKKKRDEAGPASTAAEAVKKMLQKKK 81



 Score = 26.1 bits (58), Expect = 8.4
 Identities = 15/35 (42%), Positives = 22/35 (62%)

Query: 238 EEEEEKEEGEQTDSEKEAKKRKKHKKDKKKRKKKS 272
           EE+EEKE  ++ D    + K+KK +K KKKR +  
Sbjct: 30  EEQEEKELKQKADEGNNSGKKKKKRKAKKKRDEAG 64


>gnl|CDD|185616 PTZ00436, PTZ00436, 60S ribosomal protein L19-like protein;
           Provisional.
          Length = 357

 Score = 29.5 bits (65), Expect = 2.0
 Identities = 15/73 (20%), Positives = 38/73 (52%)

Query: 200 DKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKYEEEEEKEEGEQTDSEKEAKKRK 259
           +++++E++  EQ  +++   ++ +HK  K++ +K+ K  E   +E+     + K+    K
Sbjct: 147 NEKKKERQLAEQLAAKRLKDEQHRHKARKQELRKREKDRERARREDAAAAAAAKQKAAAK 206

Query: 260 KHKKDKKKRKKKS 272
           K      K+  K+
Sbjct: 207 KAAAPSGKKSAKA 219


>gnl|CDD|220521 pfam10014, BsmA, Biofilm formation and stress response factor.
           YjfO-BsmA (biofilm stress and motility) is a protein
           that is expressed at high levels in biofilms. It is
           found to be up-regulated during biofilm formation and pH
           and oxidative stress.
          Length = 194

 Score = 29.1 bits (66), Expect = 2.0
 Identities = 14/60 (23%), Positives = 24/60 (40%), Gaps = 10/60 (16%)

Query: 124 ITTQPAP---HLDGVHVVFGHVVSGQNV---VSHIENLPVDKMSRPLQDVRVVKCNELVL 177
              +P P   H DGV  VF H+V   N+    + I +         L    +++  + +L
Sbjct: 108 ELGEPTPEGLHQDGVDFVFVHLVGRHNIEGGETRIYDDD----GEILFRFTLLEPLDTLL 163


>gnl|CDD|130712 TIGR01651, CobT, cobaltochelatase, CobT subunit.  This model
           describes Pseudomonas denitrificans CobT gene product,
           which is a cobalt chelatase subunit that functions in
           cobalamin biosynthesis. Cobalamin (vitamin B12) can be
           synthesized via several pathways, including an aerobic
           pathway (found in Pseudomonas denitrificans) and an
           anaerobic pathway (found in P. shermanii and Salmonella
           typhimurium). These pathways differ in the point of
           cobalt insertion during corrin ring formation. There are
           apparently a number of variations on these two pathways,
           where the major differences seem to be concerned with
           the process of ring contraction. Confusion regarding the
           functions of enzymes found in the aerobic vs. anaerobic
           pathways has arisen because nonhomologous genes in these
           different pathways were given the same gene symbols.
           Thus, cobT in the aerobic pathway (P. denitrificans) is
           not a homolog of cobT in the anaerobic pathway (S.
           typhimurium). It should be noted that E. coli
           synthesizes cobalamin only when it is supplied with the
           precursor cobinamide, which is a complex intermediate.
           Additionally, all E. coli cobalamin synthesis genes
           (cobU, cobS and cobT) were named after their Salmonella
           typhimurium homologs which function in the anaerobic
           cobalamin synthesis pathway. This model describes the
           aerobic cobalamin pathway Pseudomonas denitrificans CobT
           gene product, which is a cobalt chelatase subunit, with
           a MW ~70 kDa. The aerobic pathway cobalt chelatase is a
           heterotrimeric, ATP-dependent enzyme that catalyzes
           cobalt insertion during cobalamin biosynthesis. The
           other two subunits are the P. denitrificans CobS
           (TIGR01650) and CobN (pfam02514 CobN/Magnesium
           Chelatase) proteins. To avoid potential confusion with
           the nonhomologous Salmonella typhimurium/E.coli cobT
           gene product, the P. denitrificans gene symbol is not
           used in the name of this model [Biosynthesis of
           cofactors, prosthetic groups, and carriers, Heme,
           porphyrin, and cobalamin].
          Length = 600

 Score = 29.5 bits (66), Expect = 2.1
 Identities = 13/58 (22%), Positives = 24/58 (41%), Gaps = 1/58 (1%)

Query: 191 PTSASSDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKYEEEEEKEEGEQ 248
           PT    +E  + E E +EG     E EA  R+    +++  +       +E  ++ E 
Sbjct: 220 PTENEQEEQGEGEGEGQEGSAPQ-ESEATDRESESGEEEMVQSDQDDLPDESDDDSET 276



 Score = 28.4 bits (63), Expect = 4.4
 Identities = 24/105 (22%), Positives = 45/105 (42%), Gaps = 18/105 (17%)

Query: 157 VDKMSRPLQD----VRVVKCNELVLKSKLKAKEKRKASPTSASSDESD---KEEEEKEEG 209
           +D++S  + D     RVV+    +L+S   A+E    + +    D  D    E E++E+G
Sbjct: 173 LDRLSAAIDDQQAFARVVR---EMLRSMELAEEMGDDTESEDEEDGDDDQPTENEQEEQG 229

Query: 210 EQTDSEKEAKKRKKHKKDKKKKKKKSKYEEEEEKEEGEQTDSEKE 254
           E     +E            ++ + +  E E  +EE  Q+D +  
Sbjct: 230 EGEGEGQEGSA--------PQESEATDRESESGEEEMVQSDQDDL 266


>gnl|CDD|237744 PRK14521, rpsP, 30S ribosomal protein S16; Provisional.
          Length = 186

 Score = 29.0 bits (65), Expect = 2.2
 Identities = 15/79 (18%), Positives = 29/79 (36%), Gaps = 2/79 (2%)

Query: 181 LKAKEKRKASPTSASSDESDKEEEEKEEGEQTDSEKEAKKRKKHKKD--KKKKKKKSKYE 238
            + KE +  +     S      ++   E E+  +E  A+   + K         +++   
Sbjct: 107 KEEKEGKVNAKKDKLSKAKKAAKKAALEAEKKVNEARAEAVAEKKAAEAAAVAAEEAAAA 166

Query: 239 EEEEKEEGEQTDSEKEAKK 257
           EEEE EE    ++  E   
Sbjct: 167 EEEEAEEAPAEEAPAEESA 185


>gnl|CDD|220377 pfam09747, DUF2052, Coiled-coil domain containing protein
           (DUF2052).  This entry is of sequences of two conserved
           domains separated by a region of low complexity,
           spanning some 200 residues. The function is unknown.
          Length = 178

 Score = 28.5 bits (64), Expect = 2.3
 Identities = 20/72 (27%), Positives = 35/72 (48%)

Query: 176 VLKSKLKAKEKRKASPTSASSDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKS 235
           VL+S L  +E R         +ES +EEEE  E E  + E+E ++    +++++      
Sbjct: 59  VLESSLDREEARLELLEQQEEEESAEEEEEDPEEENEEEEEEYQRGPFGEEEEEDGDSYD 118

Query: 236 KYEEEEEKEEGE 247
           +    EE+EE  
Sbjct: 119 ELPTPEEREELR 130


>gnl|CDD|224117 COG1196, Smc, Chromosome segregation ATPases [Cell division and
           chromosome partitioning].
          Length = 1163

 Score = 29.7 bits (67), Expect = 2.3
 Identities = 20/75 (26%), Positives = 39/75 (52%)

Query: 198 ESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKYEEEEEKEEGEQTDSEKEAKK 257
           E   +E E+E  E     +E K+  +  + +K++ +    E EEEKEE E+   E E++ 
Sbjct: 841 EEKLDELEEELEELEKELEELKEELEELEAEKEELEDELKELEEEKEELEEELRELESEL 900

Query: 258 RKKHKKDKKKRKKKS 272
            +  ++ +K R++  
Sbjct: 901 AELKEEIEKLRERLE 915



 Score = 29.7 bits (67), Expect = 2.3
 Identities = 17/90 (18%), Positives = 44/90 (48%), Gaps = 1/90 (1%)

Query: 182 KAKEKRKASPTSASSDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKYEEEE 241
           + KEK +A        E+  EE E+   E  ++++E +++     ++ ++  ++   EE 
Sbjct: 327 ELKEKIEALKEELEERETLLEELEQLLAELEEAKEELEEKLSALLEELEELFEA-LREEL 385

Query: 242 EKEEGEQTDSEKEAKKRKKHKKDKKKRKKK 271
            + E E  +   E ++ K+  +  ++R ++
Sbjct: 386 AELEAELAEIRNELEELKREIESLEERLER 415



 Score = 28.1 bits (63), Expect = 5.7
 Identities = 18/75 (24%), Positives = 45/75 (60%)

Query: 197 DESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKYEEEEEKEEGEQTDSEKEAK 256
           +E ++ E+E EE ++   E EA+K +   + K+ +++K + EEE  + E E  + ++E +
Sbjct: 849 EELEELEKELEELKEELEELEAEKEELEDELKELEEEKEELEEELRELESELAELKEEIE 908

Query: 257 KRKKHKKDKKKRKKK 271
           K ++  ++ + + ++
Sbjct: 909 KLRERLEELEAKLER 923



 Score = 27.8 bits (62), Expect = 7.7
 Identities = 15/95 (15%), Positives = 38/95 (40%), Gaps = 4/95 (4%)

Query: 177 LKSKLKAKEKRKASPTSASSDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSK 236
           L+ +L   E+          +   + EE K E E    E   +  +  ++  + K++  +
Sbjct: 244 LEEELSRLEEELEELQEELEEAEKEIEELKSELE----ELREELEELQEELLELKEEIEE 299

Query: 237 YEEEEEKEEGEQTDSEKEAKKRKKHKKDKKKRKKK 271
            E E         + E E ++ ++  ++ K++ + 
Sbjct: 300 LEGEISLLRERLEELENELEELEERLEELKEKIEA 334



 Score = 27.8 bits (62), Expect = 7.9
 Identities = 23/80 (28%), Positives = 42/80 (52%), Gaps = 4/80 (5%)

Query: 177 LKSKLKAKEKRKASPTSASSDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSK 236
           LK +L+  E  K        DE  + EEEKEE E+   E E++  +  ++ +K +++  +
Sbjct: 861 LKEELEELEAEKEE----LEDELKELEEEKEELEEELRELESELAELKEEIEKLRERLEE 916

Query: 237 YEEEEEKEEGEQTDSEKEAK 256
            E + E+ E E  + E+E +
Sbjct: 917 LEAKLERLEVELPELEEELE 936


>gnl|CDD|165563 PHA03308, PHA03308, transcriptional regulator ICP4; Provisional.
          Length = 1463

 Score = 29.4 bits (65), Expect = 2.3
 Identities = 12/37 (32%), Positives = 23/37 (62%)

Query: 190  SPTSASSDESDKEEEEKEEGEQTDSEKEAKKRKKHKK 226
            S +S+SSD S  EE+  E+ E+ D E+    +++ ++
Sbjct: 1263 SSSSSSSDSSSSEEDGDEKNEKEDRERAGGGKRRGRQ 1299


>gnl|CDD|234402 TIGR03928, T7_EssCb_Firm, type VII secretion protein EssC,
           C-terminal domain.  This model describes the C-terminal
           domain, or longer subunit, of the Firmicutes type VII
           secretion protein EssC. This protein (homologous to EccC
           in Actinobacteria) and the WXG100 target proteins are
           the only homologous parts of type VII secretion between
           Firmicutes and Actinobacteria [Protein fate, Protein and
           peptide secretion and trafficking].
          Length = 1296

 Score = 29.6 bits (67), Expect = 2.4
 Identities = 9/38 (23%), Positives = 23/38 (60%), Gaps = 1/38 (2%)

Query: 219 KKRKKHKKDKKKKKKK-SKYEEEEEKEEGEQTDSEKEA 255
           +++KK+KKD +K+ +    Y +++ KE    ++ ++  
Sbjct: 84  REKKKYKKDVEKRNRSYRLYLDKKRKELQALSEKQRHV 121



 Score = 28.4 bits (64), Expect = 5.1
 Identities = 7/16 (43%), Positives = 13/16 (81%)

Query: 256 KKRKKHKKDKKKRKKK 271
           +++KK+KKD +KR + 
Sbjct: 84  REKKKYKKDVEKRNRS 99


>gnl|CDD|218328 pfam04921, XAP5, XAP5, circadian clock regulator.  This protein is
           found in a wide range of eukaryotes. It is a nuclear
           protein and is suggested to be DNA binding. In plants,
           this family is essential for correct circadian clock
           functioning by acting as a light-quality regulator
           coordinating the activities of blue and red light
           signalling pathways during plant growth - inhibiting
           growth in red light but promoting growth in blue light.
          Length = 233

 Score = 28.9 bits (65), Expect = 2.4
 Identities = 18/48 (37%), Positives = 27/48 (56%), Gaps = 1/48 (2%)

Query: 225 KKDKKKKKKK-SKYEEEEEKEEGEQTDSEKEAKKRKKHKKDKKKRKKK 271
           KK KKKKK K S  +++EE++E E  D +K  K+  +  +      KK
Sbjct: 1   KKKKKKKKSKLSFGDDDEEEDEDEGEDEKKVPKESSEPDEANVNPNKK 48



 Score = 28.1 bits (63), Expect = 4.3
 Identities = 20/76 (26%), Positives = 34/76 (44%), Gaps = 11/76 (14%)

Query: 184 KEKRKASPTSASSDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKK---------- 233
           K+K+K   +  S  + D+EE+E E  ++    KE+ +  +   +  KKK           
Sbjct: 1   KKKKKKKKSKLSFGDDDEEEDEDEGEDEKKVPKESSEPDEANVNPNKKKIGKNPSVDTSF 60

Query: 234 -KSKYEEEEEKEEGEQ 248
              K  EE+E E  E+
Sbjct: 61  LPDKAREEKEAELREE 76


>gnl|CDD|220369 pfam09731, Mitofilin, Mitochondrial inner membrane protein.
           Mitofilin controls mitochondrial cristae morphology.
           Mitofilin is enriched in the narrow space between the
           inner boundary and the outer membranes, where it forms a
           homotypic interaction and assembles into a large
           multimeric protein complex. The first 78 amino acids
           contain a typical amino-terminal-cleavable mitochondrial
           presequence rich in positive-charged and hydroxylated
           residues and a membrane anchor domain. In addition, it
           has three centrally located coiled coil domains.
          Length = 493

 Score = 29.2 bits (66), Expect = 2.4
 Identities = 11/59 (18%), Positives = 31/59 (52%), Gaps = 1/59 (1%)

Query: 204 EEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKYEEEEEKEEGEQTDSEKEAKKRKKHK 262
           +EK E   +  E+E   R + K+   +K+ + ++E E+E+   ++ + +   +  ++ +
Sbjct: 195 KEKREELLSKLEEELLARLESKEAALEKQLRLEFEREKEELR-KKYEEKLRQELERQAE 252



 Score = 27.7 bits (62), Expect = 8.4
 Identities = 21/113 (18%), Positives = 57/113 (50%), Gaps = 5/113 (4%)

Query: 157 VDKMSRPLQDVRVVKCNELVLKSKLKAK-EKRKASPTSASSDESDKEEEEKEEGEQTDSE 215
           +D++S+ L +++     E  L+  LK K E+  +          + +E   E+  + + E
Sbjct: 172 LDQLSKKLAELK--AEEEEELERALKEKREELLSKLEEELLARLESKEAALEKQLRLEFE 229

Query: 216 KEAKKRKKHKKDKKKKKKKSKYEEEEEKEEGEQTDSEKEAKKRKKHKKDKKKR 268
           +E ++ +K  ++K +++ + + E  E+K + E     +  + +++  K+ K++
Sbjct: 230 REKEELRKKYEEKLRQELERQAEAHEQKLKNELA--LQAIELQREFNKEIKEK 280


>gnl|CDD|223496 COG0419, SbcC, ATPase involved in DNA repair [DNA replication,
           recombination, and repair].
          Length = 908

 Score = 29.3 bits (66), Expect = 2.4
 Identities = 17/76 (22%), Positives = 38/76 (50%), Gaps = 2/76 (2%)

Query: 197 DESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKK--KSKYEEEEEKEEGEQTDSEKE 254
           +E +KE  E  E E  + E+E  + K+  + +++ ++  K   E EEE  E  + +   +
Sbjct: 469 EEHEKELLELYELELEELEEELSREKEEAELREEIEELEKELRELEEELIELLELEEALK 528

Query: 255 AKKRKKHKKDKKKRKK 270
            +  +K +K +   ++
Sbjct: 529 EELEEKLEKLENLLEE 544



 Score = 29.0 bits (65), Expect = 3.5
 Identities = 16/95 (16%), Positives = 42/95 (44%)

Query: 177 LKSKLKAKEKRKASPTSASSDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSK 236
           LK   K +E ++         E +  EE   E E+     E  K +  + +  + +    
Sbjct: 212 LKELKKLEEIQEEQEEEELEQEIEALEERLAELEEEKERLEELKARLLEIESLELEALKI 271

Query: 237 YEEEEEKEEGEQTDSEKEAKKRKKHKKDKKKRKKK 271
            EEE  + E    + E++ ++ ++ +++ ++ +++
Sbjct: 272 REEELRELERLLEELEEKIERLEELEREIEELEEE 306



 Score = 28.6 bits (64), Expect = 4.8
 Identities = 21/99 (21%), Positives = 39/99 (39%), Gaps = 7/99 (7%)

Query: 177 LKSKLKAKEKRKASPTSASSDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKK------- 229
           L+ +L+  E+             ++ EE+ E+ E    E E  K K   +  K       
Sbjct: 506 LEKELRELEEELIELLELEEALKEELEEKLEKLENLLEELEELKEKLQLQQLKEELRQLE 565

Query: 230 KKKKKSKYEEEEEKEEGEQTDSEKEAKKRKKHKKDKKKR 268
            + ++ K   EE +    + +  +E ++R K  K K K 
Sbjct: 566 DRLQELKELLEELRLLRTRKEELEELRERLKELKKKLKE 604



 Score = 27.8 bits (62), Expect = 8.2
 Identities = 18/100 (18%), Positives = 40/100 (40%), Gaps = 2/100 (2%)

Query: 173 NELVLKSKLKAKEKRKASPTSASSDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKK 232
            EL     L  + +       +  +  +K EE+ E+ E     +E  + K       +++
Sbjct: 305 EELEGLRALLEELEELLEKLKSLEERLEKLEEKLEKLES--ELEELAEEKNELAKLLEER 362

Query: 233 KKSKYEEEEEKEEGEQTDSEKEAKKRKKHKKDKKKRKKKS 272
            K   E  EE E+  +   E+  +  +  ++ K++  + S
Sbjct: 363 LKELEERLEELEKELEKALERLKQLEEAIQELKEELAELS 402



 Score = 27.4 bits (61), Expect = 8.9
 Identities = 15/98 (15%), Positives = 36/98 (36%), Gaps = 3/98 (3%)

Query: 177 LKSKLKAKEKRKASPTSASSDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSK 236
           L+      E+ +       S E   E+ E++  +     +E  + K       +++ K  
Sbjct: 307 LEGLRALLEELEELLEKLKSLEERLEKLEEKLEKLESELEELAEEKNELAKLLEERLKEL 366

Query: 237 YEEEEEKEEGEQTDSEKE---AKKRKKHKKDKKKRKKK 271
            E  EE E+  +   E+     +  ++ K++  +    
Sbjct: 367 EERLEELEKELEKALERLKQLEEAIQELKEELAELSAA 404


>gnl|CDD|187751 cd09270, RNase_H2-B, Ribonuclease H2-B is a subunit of the
           eukaryotic RNase H complex which cleaves RNA-DNA
           hybrids.  Ribonuclease H2B is one of the three proteins
           of eukaryotic RNase H2 complex that is required for
           nucleic acid binding and hydrolysis. RNase H is
           classified into two families, type I (prokaryotic RNase
           HI, eukaryotic RNase H1 and viral RNase H) and type II
           (prokaryotic RNase HII and HIII, and eukaryotic RNase
           H2/HII). RNase H endonucleolytically hydrolyzes an RNA
           strand when it is annealed to a complementary DNA strand
           in the presence of divalent cations, in DNA replication
           and repair. The enzyme can be found in bacteria,
           archaea, and eukaryotes. Most prokaryotic and eukaryotic
           genomes contain multiple RNase H genes. Despite a lack
           of evidence for homology from sequence comparisons, type
           I and type II RNase H share a common fold and similar
           steric configurations of the four acidic active-site
           residues, suggesting identical or very similar catalytic
           mechanisms. Eukaryotic RNase HII is active during
           replication and is believed to play a role in removal of
           Okazaki fragment primers and single ribonucleotides in
           DNA-DNA duplexes. Eukaryotic RNase HII is functional
           when it forms a complex with RNase H2B and RNase H2C
           proteins. It is speculated that the two accessory
           subunits are required for correct folding of the
           catalytic subunit of RNase HII. Mutations in the three
           subunits of human RNase HII cause neurological disorder.
          Length = 211

 Score = 28.8 bits (65), Expect = 2.4
 Identities = 18/58 (31%), Positives = 31/58 (53%), Gaps = 9/58 (15%)

Query: 177 LKSKLKAKEKRKASPTSASSDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKK 234
           L  K++  +K++            KE E   + +++++ +E KK+KK K  KKKK KK
Sbjct: 149 LLKKVERLKKKE---------LDIKEAEAARDKKKSNNAEEEKKKKKKKSAKKKKLKK 197



 Score = 28.4 bits (64), Expect = 3.0
 Identities = 24/73 (32%), Positives = 38/73 (52%), Gaps = 5/73 (6%)

Query: 204 EEKEEGEQT----DSEKEAKKRKKHKKDKKKKKKKSKYEEEEEKEEGEQTDSEKEAKKRK 259
           + KEEG+        EK      K K ++ KKK+    E E  +++ +  ++E+E KK+K
Sbjct: 128 DVKEEGDDKFYKYSDEKLLAWLLK-KVERLKKKELDIKEAEAARDKKKSNNAEEEKKKKK 186

Query: 260 KHKKDKKKRKKKS 272
           K    KKK KK +
Sbjct: 187 KKSAKKKKLKKVA 199


>gnl|CDD|185628 PTZ00449, PTZ00449, 104 kDa microneme/rhoptry antigen; Provisional.
          Length = 943

 Score = 29.3 bits (65), Expect = 2.5
 Identities = 31/125 (24%), Positives = 48/125 (38%), Gaps = 23/125 (18%)

Query: 167 VRVVKCNELVLKSKLKAKEKRKASPTSASSDESDKEEEEKE-----------EGEQTDSE 215
            R+ K        KL  K K+K +P     ++SDK +E  E            G++   E
Sbjct: 474 TRISKIQFTQEIKKLIKKSKKKLAPI--EEEDSDKHDEPPEGPEASGLPPKAPGDKEGEE 531

Query: 216 KEAKKRKKHKKDKKKKKKKSKYEEEEEKEEGEQTD----------SEKEAKKRKKHKKDK 265
            E +  K+  + K+  K     E E  K+ G   +           + E  K  KH KD 
Sbjct: 532 GEHEDSKESDEPKEGGKPGETKEGEVGKKPGPAKEHKPSKIPTLSKKPEFPKDPKHPKDP 591

Query: 266 KKRKK 270
           ++ KK
Sbjct: 592 EEPKK 596


>gnl|CDD|217476 pfam03286, Pox_Ag35, Pox virus Ag35 surface protein. 
          Length = 198

 Score = 28.6 bits (64), Expect = 2.5
 Identities = 13/49 (26%), Positives = 24/49 (48%), Gaps = 1/49 (2%)

Query: 219 KKRKKHKKDKKKKKKKSKYEEEEEKEEGEQTDSEKEAKKRKKHKKDKKK 267
           +K    KK KKK K+K   EE++ + + ++T+ E E       +    +
Sbjct: 60  RKPATTKKSKKKDKEKLTEEEKKPESDDDKTE-ENENDPDNNEESGDSQ 107


>gnl|CDD|100110 cd05832, Ribosomal_L12p, Ribosomal protein L12p. This subfamily
           includes archaeal L12p, the protein that is functionally
           equivalent to L7/L12 in bacteria and the P1 and P2
           proteins in eukaryotes. L12p is homologous to P1 and P2
           but is not homologous to bacterial L7/L12. It is located
           in the L12 stalk, with proteins L10, L11, and 23S rRNA.
           L12p is the only protein in the ribosome to occur as
           multimers, always appearing as sets of dimers. Recent
           data indicate that most archaeal species contain six
           copies of L12p (three homodimers), while eukaryotes have
           four copies (two heterodimers), and bacteria may have
           four or six copies (two or three homodimers), depending
           on the species. The organization of proteins within the
           stalk has been characterized primarily in bacteria,
           where L7/L12 forms either two or three homodimers and
           each homodimer binds to the extended C-terminal helix of
           L10. L7/L12 is attached to the ribosome through L10 and
           is the only ribosomal protein that does not directly
           interact with rRNA. Archaeal L12p is believed to
           function in a similar fashion. However, hybrid ribosomes
           containing the large subunit from E. coli with an
           archaeal stalk are able to bind archaeal and eukaryotic
           elongation factors but not bacterial elongation factors.
           In several mesophilic and thermophilic archaeal species,
           the binding of 23S rRNA to protein L11 and to the
           L10/L12p pentameric complex was found to be
           temperature-dependent and cooperative.
          Length = 106

 Score = 27.8 bits (62), Expect = 2.6
 Identities = 12/30 (40%), Positives = 20/30 (66%)

Query: 189 ASPTSASSDESDKEEEEKEEGEQTDSEKEA 218
           A+  +A     +KEEE+K+E E+ + E+EA
Sbjct: 69  AAAAAAEEKAEEKEEEKKKEEEKEEEEEEA 98


>gnl|CDD|233503 TIGR01642, U2AF_lg, U2 snRNP auxilliary factor, large subunit,
           splicing factor.  These splicing factors consist of an
           N-terminal arginine-rich low complexity domain followed
           by three tandem RNA recognition motifs (pfam00076). The
           well-characterized members of this family are auxilliary
           components of the U2 small nuclear ribonuclearprotein
           splicing factor (U2AF). These proteins are closely
           related to the CC1-like subfamily of splicing factors
           (TIGR01622). Members of this subfamily are found in
           plants, metazoa and fungi.
          Length = 509

 Score = 29.1 bits (65), Expect = 2.6
 Identities = 7/81 (8%), Positives = 28/81 (34%)

Query: 186 KRKASPTSASSDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKYEEEEEKEE 245
            R++   S   D   ++    +          + +R + +  ++ +  +S  +      +
Sbjct: 38  HRRSRERSYREDSRPRDRRRYDSRSPRSLRYSSVRRSRDRPRRRSRSVRSIEQHRRRLRD 97

Query: 246 GEQTDSEKEAKKRKKHKKDKK 266
              ++  ++  K++     K 
Sbjct: 98  RSPSNQWRKDDKKRSLWDIKP 118


>gnl|CDD|227446 COG5116, RPN2, 26S proteasome regulatory complex component
           [Posttranslational modification, protein turnover,
           chaperones].
          Length = 926

 Score = 29.1 bits (65), Expect = 2.6
 Identities = 19/89 (21%), Positives = 35/89 (39%), Gaps = 1/89 (1%)

Query: 183 AKEKRKASPTSASSDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKYEEEEE 242
            +E+    P     + S K   +      + + K A + K+  K+K    K+ K E    
Sbjct: 760 LEEEEFEYP-RMYEEASGKSVRKVNTAVLSTTIKAAARAKQKPKEKGPNDKEIKIESPSV 818

Query: 243 KEEGEQTDSEKEAKKRKKHKKDKKKRKKK 271
           + EGE+   ++  +K          +KKK
Sbjct: 819 ETEGERCTIKQREEKGIDAPAILNVKKKK 847



 Score = 28.8 bits (64), Expect = 3.8
 Identities = 20/108 (18%), Positives = 41/108 (37%), Gaps = 18/108 (16%)

Query: 148 VVSHIENLPVDKMSRPLQDVR-----VVKCNELVL----KSKLKAKEKRKASPTSASSDE 198
                E+L  ++   P          V K N  VL    K+  +AK+K K        ++
Sbjct: 753 KFCFNEDLEEEEFEYPRMYEEASGKSVRKVNTAVLSTTIKAAARAKQKPK--------EK 804

Query: 199 SDKEEEEKEEGEQTDSEKEA-KKRKKHKKDKKKKKKKSKYEEEEEKEE 245
              ++E K E    ++E E    +++ +K        +  +++  K +
Sbjct: 805 GPNDKEIKIESPSVETEGERCTIKQREEKGIDAPAILNVKKKKPYKVD 852


>gnl|CDD|227360 COG5027, SAS2, Histone acetyltransferase (MYST family) [Chromatin
           structure and dynamics].
          Length = 395

 Score = 29.0 bits (65), Expect = 2.7
 Identities = 10/62 (16%), Positives = 24/62 (38%), Gaps = 2/62 (3%)

Query: 177 LKSKLKAKEKRKA--SPTSASSDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKK 234
           +  + K  EK K    P  +   + D E  + E      +E+E ++ +      +   + 
Sbjct: 65  IPKRKKQTEKGKKEKKPKVSDRMDLDNENVQLEMLYSISNEREIRQLRFGGSKVQNPHEG 124

Query: 235 SK 236
           ++
Sbjct: 125 AR 126



 Score = 28.2 bits (63), Expect = 5.0
 Identities = 12/59 (20%), Positives = 26/59 (44%), Gaps = 5/59 (8%)

Query: 219 KKRKKHKKDKKKKKKKSK-----YEEEEEKEEGEQTDSEKEAKKRKKHKKDKKKRKKKS 272
           K++K+ +K KK+KK K         E  + E      +E+E ++ +      +   + +
Sbjct: 67  KRKKQTEKGKKEKKPKVSDRMDLDNENVQLEMLYSISNEREIRQLRFGGSKVQNPHEGA 125


>gnl|CDD|215124 PLN02195, PLN02195, cellulose synthase A.
          Length = 977

 Score = 29.2 bits (65), Expect = 2.7
 Identities = 20/65 (30%), Positives = 30/65 (46%), Gaps = 1/65 (1%)

Query: 187 RKASPTSASSDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKYEEEEEKEEG 246
           R  S  S    E + E        + +S K+ KK KK K  KKK+  K++   E++ EE 
Sbjct: 93  RHISSVSTVDSELNDEYGNPIWKNRVESWKD-KKNKKKKSAKKKEAHKAQIPPEQQMEEK 151

Query: 247 EQTDS 251
              D+
Sbjct: 152 PSADA 156


>gnl|CDD|223599 COG0525, ValS, Valyl-tRNA synthetase [Translation, ribosomal
           structure and biogenesis].
          Length = 877

 Score = 29.1 bits (66), Expect = 2.7
 Identities = 27/132 (20%), Positives = 49/132 (37%), Gaps = 6/132 (4%)

Query: 146 QNVVSHIENLPVDKMSRPLQDVRVVKCNELVLKSKLKAKEKRKASPTSASSDESDKEEEE 205
           + ++S I NL  +    P   ++VV      L+ +L+A E       +    E    + E
Sbjct: 728 KEIISAIRNLRAEMNLSPSAPLKVVLVGSEELEDRLEANEDDIKGLANLEELEILSPDPE 787

Query: 206 KEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKYEEEEEKEEGE------QTDSEKEAKKRK 259
           +   E T     A+             + ++ E+E EK E E      +  +E    K  
Sbjct: 788 EPPVEVTAVVGGAELFLPLAGLIDLAAELARLEKELEKLEKEIDRIEKKLSNEGFVAKAP 847

Query: 260 KHKKDKKKRKKK 271
           +   +K+K K  
Sbjct: 848 EEVVEKEKEKLA 859


>gnl|CDD|227701 COG5414, COG5414, TATA-binding protein-associated factor
           [Transcription].
          Length = 392

 Score = 28.9 bits (64), Expect = 2.7
 Identities = 20/98 (20%), Positives = 42/98 (42%), Gaps = 4/98 (4%)

Query: 174 ELVLKSKLKAKEKRKASPTSASSDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKK 233
           E+    K + +E+ + +       +SD+ +E  EE E+ D  +E ++  +   D+  + +
Sbjct: 288 EVSEGDKEQQQEEVENAEAHKEEVQSDRPDEIGEEKEEDDENEENERHTELLADELNELE 347

Query: 234 KSKYEEEEEKEEGE----QTDSEKEAKKRKKHKKDKKK 267
           K   E+  + E       Q   E +     K  + K+K
Sbjct: 348 KGIEEKRRQMESATNPILQKRFESQLNVLLKELELKRK 385


>gnl|CDD|218985 pfam06315, AceK, Isocitrate dehydrogenase kinase/phosphatase
           (AceK).  This family consists of several bacterial
           isocitrate dehydrogenase kinase/phosphatase (AceK)
           proteins (EC:2.7.1.116).
          Length = 562

 Score = 29.1 bits (66), Expect = 2.7
 Identities = 10/22 (45%), Positives = 14/22 (63%)

Query: 20  GRIVFELYSDICPITCENFRAL 41
           GR+VF  Y +I  +T  NFR +
Sbjct: 463 GRVVFYDYDEIEYLTDCNFRDI 484


>gnl|CDD|240317 PTZ00217, PTZ00217, flap endonuclease-1; Provisional.
          Length = 393

 Score = 28.8 bits (65), Expect = 2.8
 Identities = 15/42 (35%), Positives = 17/42 (40%)

Query: 219 KKRKKHKKDKKKKKKKSKYEEEEEKEEGEQTDSEKEAKKRKK 260
             +K + K K KKKKK        K E  Q       KK KK
Sbjct: 352 PIKKSNSKAKLKKKKKKAGASAVPKSETSQEAKSSGKKKVKK 393



 Score = 28.8 bits (65), Expect = 3.2
 Identities = 14/46 (30%), Positives = 21/46 (45%)

Query: 218 AKKRKKHKKDKKKKKKKSKYEEEEEKEEGEQTDSEKEAKKRKKHKK 263
           A K+   K + K K KK K +         +T  E ++  +KK KK
Sbjct: 348 ATKKPIKKSNSKAKLKKKKKKAGASAVPKSETSQEAKSSGKKKVKK 393


>gnl|CDD|233870 TIGR02442, Cob-chelat-sub, cobaltochelatase subunit.
           Cobaltochelatase is responsible for the insertion of
           cobalt into the corrin ring of coenzyme B12 during its
           biosynthesis. Two versions have been well described.
           CbiK/CbiX is a monomeric, anaerobic version which acts
           early in the biosynthesis (pfam06180). CobNST is a
           trimeric, ATP-dependent, aerobic version which acts late
           in the biosynthesis (TIGR02257/TIGR01650/TIGR01651). A
           number of genomes (actinobacteria, cyanobacteria,
           betaproteobacteria and pseudomonads) which apparently
           biosynthesize B12, encode a cobN gene but are
           demonstrably lacking cobS and cobT. These genomes do,
           however contain a homolog (modelled here) of the
           magnesium chelatase subunits BchI/BchD family. Aside
           from the cyanobacteria (which have a separate magnesium
           chelatase trimer), these species do not make chlorins,
           so do not have any use for a magnesium chelatase.
           Furthermore, in nearly all cases the members of this
           family are proximal to either CobN itself or other genes
           involved in cobalt transport or B12 biosynthesis.
          Length = 633

 Score = 29.3 bits (66), Expect = 2.8
 Identities = 21/91 (23%), Positives = 38/91 (41%), Gaps = 8/91 (8%)

Query: 165 QDVRVVKCNELVLKSKLKAKEKRKASPTSASSDESDKEEEEKEEGEQTDSEKEAKKRKKH 224
           +DVR     ELVL        +R+  P      + +K+ EEK E    D EK  +     
Sbjct: 295 EDVREAA--ELVLP------HRRRRKPFEQPQGKDEKDLEEKPEEPGPDPEKPDEGEDDA 346

Query: 225 KKDKKKKKKKSKYEEEEEKEEGEQTDSEKEA 255
           ++   +       ++E+E +  E+ D +  +
Sbjct: 347 EQSGPRGHPTPGNDDEKEPDPQEEADGQGSS 377


>gnl|CDD|219743 pfam08202, Mis12_component, Mis12-Mtw1 protein family.  Mis12-Mtw1
           is a eukaryotic conserved kinetochore protein that is
           involved in chromosome segregation.
          Length = 296

 Score = 28.7 bits (64), Expect = 2.9
 Identities = 20/67 (29%), Positives = 33/67 (49%), Gaps = 1/67 (1%)

Query: 200 DKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKYEEEEEKEEGEQTDSEKEAKKRK 259
           +K E +KEE +++ + + A+  KK      K  K S   E  E+EEG+  DSE      +
Sbjct: 80  EKLESKKEETDESTARRIARVIKKEFLRDLKAGKVSISWENREEEEGDDNDSELT-NLFE 138

Query: 260 KHKKDKK 266
            + + K 
Sbjct: 139 DNDEKKL 145


>gnl|CDD|145949 pfam03066, Nucleoplasmin, Nucleoplasmin.  Nucleoplasmins are also
           known as chromatin decondensation proteins. They bind to
           core histones and transfer DNA to them in a reaction
           that requires ATP. This is thought to play a role in the
           assembly of regular nucleosomal arrays.
          Length = 146

 Score = 28.1 bits (63), Expect = 3.1
 Identities = 10/29 (34%), Positives = 21/29 (72%)

Query: 197 DESDKEEEEKEEGEQTDSEKEAKKRKKHK 225
           +E ++EE+++E+ ++ +SE+E    KK K
Sbjct: 118 EEDEEEEDDEEDDDEDESEEEESPVKKVK 146



 Score = 28.1 bits (63), Expect = 3.5
 Identities = 10/26 (38%), Positives = 18/26 (69%)

Query: 237 YEEEEEKEEGEQTDSEKEAKKRKKHK 262
            EEE+++E+ ++ +SE+E    KK K
Sbjct: 121 EEEEDDEEDDDEDESEEEESPVKKVK 146


>gnl|CDD|185429 PTZ00074, PTZ00074, 60S ribosomal protein L34; Provisional.
          Length = 135

 Score = 28.1 bits (63), Expect = 3.1
 Identities = 13/31 (41%), Positives = 18/31 (58%)

Query: 206 KEEGEQTDSEKEAKKRKKHKKDKKKKKKKSK 236
            +E  +   +K+ KK+KK KK  KK  KK K
Sbjct: 105 LKEKAKQKKQKKKKKKKKKKKTSKKAAKKKK 135


>gnl|CDD|220759 pfam10446, DUF2457, Protein of unknown function (DUF2457).  This is
           a family of uncharacterized proteins.
          Length = 449

 Score = 28.8 bits (64), Expect = 3.1
 Identities = 13/63 (20%), Positives = 28/63 (44%), Gaps = 6/63 (9%)

Query: 198 ESDKEEEEKEEGEQTDSEKEAK------KRKKHKKDKKKKKKKSKYEEEEEKEEGEQTDS 251
           E    E E ++  +   + + K       +K++   K  K+ + +  EEE+ +E +  D 
Sbjct: 4   EFASSELEDDDWVRGSLDYKEKLTLNDTMKKENAIRKLGKEAEEEAMEEEDDDEEDDDDD 63

Query: 252 EKE 254
           + E
Sbjct: 64  DDE 66


>gnl|CDD|218333 pfam04931, DNA_pol_phi, DNA polymerase phi.  This family includes
           the fifth essential DNA polymerase in yeast EC:2.7.7.7.
           Pol5p is localised exclusively to the nucleolus and
           binds near or at the enhancer region of rRNA-encoding
           DNA repeating units.
          Length = 784

 Score = 29.1 bits (65), Expect = 3.2
 Identities = 24/98 (24%), Positives = 34/98 (34%), Gaps = 15/98 (15%)

Query: 188 KASPTSASSDESDKEEEEKEEGEQTDSEKEA----KKRKKHKKDKKKKKKKSKYEEEEEK 243
           +A   S S  ESD E+ E++E E      E      K  +    K      +    + E 
Sbjct: 666 EAIEDSESESESDGEDGEEDEQEDDAEANEGVVPIDKAVRRALPKVLNLPDALDGGDSED 725

Query: 244 EEGEQTDSEK---------EAKKRKKHKKDKKKRKKKS 272
           EEG   D E+         +  K KK +       KK 
Sbjct: 726 EEG--MDDEQMMRLDTYLAQIFKEKKERIQAGGETKKE 761


>gnl|CDD|222812 PHA00727, PHA00727, hypothetical protein.
          Length = 278

 Score = 28.7 bits (64), Expect = 3.2
 Identities = 19/51 (37%), Positives = 28/51 (54%), Gaps = 1/51 (1%)

Query: 215 EKEAKKRKKHKKDKKKKKKKSKYEEEEEKEEGEQTDSEKEAKKRKKHKKDK 265
           E+  K+    K  K+  K   K E E +K++ EQ  +E   KK+KK KK+K
Sbjct: 28  EEAQKQIADGKTLKRLYKVYEKREFELKKQQFEQLKAELS-KKKKKFKKEK 77


>gnl|CDD|220135 pfam09184, PPP4R2, PPP4R2.  PPP4R2 (protein phosphatase 4 core
           regulatory subunit R2) is the regulatory subunit of the
           histone H2A phosphatase complex. It has been shown to
           confer resistance to the anticancer drug cisplatin in
           yeast, and may confer resistance in higher eukaryotes.
          Length = 285

 Score = 28.6 bits (64), Expect = 3.2
 Identities = 21/82 (25%), Positives = 33/82 (40%), Gaps = 5/82 (6%)

Query: 178 KSKLKAKEKRKASPTSASSDESDKEEE-EKEEGE----QTDSEKEAKKRKKHKKDKKKKK 232
           K  L         P S+       EE  EKE  +    Q D  K +  + K   D++   
Sbjct: 195 KDSLGNGSSTNGLPDSSQDKNKSLEEYYEKESSDAAASQDDGPKGSDVKNKKSDDEEDDD 254

Query: 233 KKSKYEEEEEKEEGEQTDSEKE 254
           +   Y EE+E +E E+ +  +E
Sbjct: 255 QDGDYVEEKELKEDEEEEETEE 276


>gnl|CDD|146486 pfam03879, Cgr1, Cgr1 family.  Members of this family are
           coiled-coil proteins that are involved in pre-rRNA
           processing.
          Length = 105

 Score = 27.4 bits (61), Expect = 3.3
 Identities = 19/71 (26%), Positives = 36/71 (50%), Gaps = 1/71 (1%)

Query: 201 KEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKYEEEEEKEEGEQTDSEKEAKKRKK 260
           + E+  E+      EKE K  K+ ++ ++ +  K +   +EEKE  E+  + K   K+ +
Sbjct: 33  RMEKRLEQQAIKAREKELKDEKEAERQRRIQAIKERRAAKEEKERYEKM-AAKMHAKKVE 91

Query: 261 HKKDKKKRKKK 271
             K ++KR K 
Sbjct: 92  RLKRREKRNKL 102


>gnl|CDD|179580 PRK03449, PRK03449, putative inner membrane protein translocase
           component YidC; Provisional.
          Length = 304

 Score = 28.5 bits (64), Expect = 3.3
 Identities = 12/34 (35%), Positives = 15/34 (44%)

Query: 200 DKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKK 233
           DKEEE K++ +       A K     K  KKK  
Sbjct: 271 DKEEEAKKQAKAERRAANAPKPGAKPKRSKKKAP 304


>gnl|CDD|237178 PRK12705, PRK12705, hypothetical protein; Provisional.
          Length = 508

 Score = 28.9 bits (65), Expect = 3.3
 Identities = 20/94 (21%), Positives = 48/94 (51%), Gaps = 4/94 (4%)

Query: 178 KSKLKAKEKR---KASPTSASSDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKK 234
           + +L+ +E+R   K     A +++ D  E + EE E+  S +E +  +  K+   +  + 
Sbjct: 76  REELQREEERLVQKEEQLDARAEKLDNLENQLEEREKALSARELELEELEKQLDNELYRV 135

Query: 235 SKYEEEEEKEE-GEQTDSEKEAKKRKKHKKDKKK 267
           +    E+ ++   +  D+E E +K ++ KK +++
Sbjct: 136 AGLTPEQARKLLLKLLDAELEEEKAQRVKKIEEE 169



 Score = 27.8 bits (62), Expect = 6.6
 Identities = 18/88 (20%), Positives = 45/88 (51%), Gaps = 1/88 (1%)

Query: 183 AKEKRKASPTSASSDESDKE-EEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKYEEEE 241
            K +R A        E+ KE EE+ E       E   ++R + +++ ++++++ + EEE 
Sbjct: 26  KKRQRLAKEAERILQEAQKEAEEKLEAALLEAKELLLRERNQQRQEARREREELQREEER 85

Query: 242 EKEEGEQTDSEKEAKKRKKHKKDKKKRK 269
             ++ EQ D+  E     +++ +++++ 
Sbjct: 86  LVQKEEQLDARAEKLDNLENQLEEREKA 113


>gnl|CDD|233467 TIGR01554, major_cap_HK97, phage major capsid protein, HK97 family.
            This model family represents the major capsid protein
           component of the heads (capsids) of bacteriophage HK97,
           phi-105, P27, and related phage. This model represents
           one of several analogous families lacking detectable
           sequence similarity. The gene encoding this component is
           typically located in an operon encoding the small and
           large terminase subunits, the portal protein and the
           prohead or maturation protease [Mobile and
           extrachromosomal element functions, Prophage functions].
          Length = 384

 Score = 28.9 bits (65), Expect = 3.4
 Identities = 15/68 (22%), Positives = 27/68 (39%)

Query: 180 KLKAKEKRKASPTSASSDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKYEE 239
           KL   EK   +    +  ++ KEE +K + E    E+   + +       +     + EE
Sbjct: 17  KLTEDEKLAEAEEEKAEYDALKEEIDKLDAEIDRLEELLDELEAKPAASGEGGGGEEEEE 76

Query: 240 EEEKEEGE 247
           E + E  E
Sbjct: 77  EAKAEAAE 84


>gnl|CDD|217451 pfam03249, TSA, Type specific antigen.  There are several antigenic
           variants in Rickettsia tsutsugamushi, and a
           type-specific antigen (TSA) of 56-kilodaltons located on
           the rickettsial surface is responsible for the
           variation. TSA proteins are probably integral membrane
           proteins.
          Length = 502

 Score = 28.7 bits (64), Expect = 3.5
 Identities = 13/52 (25%), Positives = 24/52 (46%)

Query: 216 KEAKKRKKHKKDKKKKKKKSKYEEEEEKEEGEQTDSEKEAKKRKKHKKDKKK 267
           K+  K ++H   KK  +K +  +EE+   +G     +K+    K  +  K K
Sbjct: 348 KDLVKLQRHAGIKKAMEKLAAQQEEDAGNQGGGDCKKKQGASEKSKEGGKGK 399


>gnl|CDD|216860 pfam02063, MARCKS, MARCKS family. 
          Length = 296

 Score = 28.6 bits (63), Expect = 3.5
 Identities = 22/87 (25%), Positives = 42/87 (48%)

Query: 178 KSKLKAKEKRKASPTSASSDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKY 237
           K   +A+    ASP  A  + +   E+ ++    + S +  KK+KK    KK  K     
Sbjct: 93  KEPAEAEPAEPASPAEAEGEAATSTEKAEDGATPSPSSETPKKKKKRFSFKKSFKLSGFS 152

Query: 238 EEEEEKEEGEQTDSEKEAKKRKKHKKD 264
            ++ +KE GE  ++E  A +++  K++
Sbjct: 153 FKKNKKEAGEGAEAEGAAAEKEGAKEE 179


>gnl|CDD|220818 pfam10595, UPF0564, Uncharacterized protein family UPF0564.  This
           family of proteins has no known function. However, one
           of the members is annotated as an EF-hand family
           protein.
          Length = 349

 Score = 28.6 bits (64), Expect = 3.5
 Identities = 25/93 (26%), Positives = 49/93 (52%), Gaps = 7/93 (7%)

Query: 184 KEKRKASPTSASSDES-----DKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKYE 238
           K  R+  P S    ES       E +EKE  EQ + ++  +++KK K+   K K++ +  
Sbjct: 258 KPTRRKLPPSTKKWESLVKFLRTERKEKEAKEQQEKKELEQRKKKKKEMAPKVKQRFEAN 317

Query: 239 EEEEKEEGEQTDSEKEAKKRKKHKKDKKKRKKK 271
           +  +K +  +   E+ AK RK+ K+ +K+ +++
Sbjct: 318 DPAQKLQ--EERKEQLAKLRKEEKEREKEYEQE 348


>gnl|CDD|240520 cd13156, KOW_RPL6, KOW motif of Ribosomal Protein L6.  RPL6
           contains KOW motif that has an extra ribosomal role as
           an oncogenic. KOW domain is known as an RNA-binding
           motif that is shared so far among some families of
           ribosomal proteins, the essential bacterial
           transcriptional elongation factor NusG, the eukaryotic
           chromatin elongation factor Spt5, the higher eukaryotic
           KIN17 proteins and Mtr4. .
          Length = 152

 Score = 27.9 bits (63), Expect = 3.5
 Identities = 13/35 (37%), Positives = 21/35 (60%)

Query: 225 KKDKKKKKKKSKYEEEEEKEEGEQTDSEKEAKKRK 259
           K+ KKKKKKK + E  EEK++      E++  ++ 
Sbjct: 82  KRKKKKKKKKKEGEFFEEKKKKYVVSEERKEDQKA 116


>gnl|CDD|220371 pfam09736, Bud13, Pre-mRNA-splicing factor of RES complex.  This
           entry is characterized by proteins with alternating
           conserved and low-complexity regions. Bud13 together
           with Snu17p and a newly identified factor,
           Pml1p/Ylr016c, form a novel trimeric complex. called The
           RES complex, pre-mRNA retention and splicing complex.
           Subunits of this complex are not essential for viability
           of yeasts but they are required for efficient splicing
           in vitro and in vivo. Furthermore, inactivation of this
           complex causes pre-mRNA leakage from the nucleus. Bud13
           contains a unique, phylogenetically conserved C-terminal
           region of unknown function.
          Length = 141

 Score = 27.7 bits (62), Expect = 3.7
 Identities = 13/51 (25%), Positives = 31/51 (60%)

Query: 222 KKHKKDKKKKKKKSKYEEEEEKEEGEQTDSEKEAKKRKKHKKDKKKRKKKS 272
           +    ++K+++K+ + EE+E KEE E+   +   +K ++ K+ ++  K K+
Sbjct: 10  RIIDIEEKREEKEREKEEKERKEEKEKEWGKGLVQKEEREKRLEELEKAKN 60


>gnl|CDD|218517 pfam05236, TAF4, Transcription initiation factor TFIID component
           TAF4 family.  This region of similarity is found in
           Transcription initiation factor TFIID component TAF4.
          Length = 255

 Score = 28.5 bits (64), Expect = 3.7
 Identities = 13/52 (25%), Positives = 24/52 (46%), Gaps = 7/52 (13%)

Query: 200 DKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKYEEEEEKEEGEQTDS 251
            KEEEE+    + +   E         ++ + K+K+K E++ E EE     +
Sbjct: 122 QKEEEERRVERRRELGLE-------DPEQLRLKQKAKEEQKAESEETRHRAA 166


>gnl|CDD|221177 pfam11708, Slu7, Pre-mRNA splicing Prp18-interacting factor.  The
           spliceosome, an assembly of snRNAs (U1, U2, U4/U6, and
           U5) and proteins, catalyzes the excision of introns from
           pre-mRNAs in two successive trans-esterification
           reactions. Step 2 depends upon integral spliceosome
           constituents such as U5 snRNA and Prp8 and
           non-spliceosomal proteins Prp16, Slu7, Prp18, and Prp22.
           ATP hydrolysis by the DEAH-box enzyme Prp16 promotes a
           conformational change in the spliceosome that leads to
           protection of the 3'ss from targeted RNase H cleavage.
           This change, which probably reflects binding of the 3'ss
           PyAG in the catalytic centre of the spliceosome,
           requires the ordered recruitment of Slu7, Prp18, and
           Prp22 to the spliceosome. There is a close functional
           relationship between Prp8, Prp18, and Slu7, and Prp18
           interacts with Slu7, so that together they recruit Prp22
           to the spliceosome. Most members of the family carry a
           zinc-finger of the CCHC-type upstream of this domain.
          Length = 236

 Score = 28.2 bits (63), Expect = 3.7
 Identities = 18/59 (30%), Positives = 26/59 (44%), Gaps = 2/59 (3%)

Query: 216 KEAKKRKKHKKDKKKKKKKSKY--EEEEEKEEGEQTDSEKEAKKRKKHKKDKKKRKKKS 272
           K+ K++K+  K +KK+    KY  EE  +K   E    + E          KKK K K 
Sbjct: 163 KKEKEKKEQLKIQKKQSLLEKYGGEEHLDKPPKELLLGQSEDYVEYDRAGKKKKAKSKY 221


>gnl|CDD|215656 pfam00012, HSP70, Hsp70 protein.  Hsp70 chaperones help to fold
           many proteins. Hsp70 assisted folding involves repeated
           cycles of substrate binding and release. Hsp70 activity
           is ATP dependent. Hsp70 proteins are made up of two
           regions: the amino terminus is the ATPase domain and the
           carboxyl terminus is the substrate binding region.
          Length = 598

 Score = 28.7 bits (65), Expect = 3.8
 Identities = 24/71 (33%), Positives = 33/71 (46%), Gaps = 5/71 (7%)

Query: 193 SASSDESDKEEEE-KEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKYEEEEE--KEEGEQT 249
            A+ D+  KE  E K E E+       +K  K + DK  +  K K EE  E  KEE E  
Sbjct: 516 YAAEDKKRKERIEAKNEAEEY--VYSLEKSLKEEGDKLPEADKKKVEEAIEWLKEELEGE 573

Query: 250 DSEKEAKKRKK 260
           D E+   K ++
Sbjct: 574 DKEEIEAKTEE 584



 Score = 27.6 bits (62), Expect = 8.3
 Identities = 20/87 (22%), Positives = 35/87 (40%), Gaps = 13/87 (14%)

Query: 193 SASSDESDKEEEEKEEGEQTDSEKE----AKKRKKHKKDKKKKKKKSKYEEEEEKEEGE- 247
           SA    + KE++         S+ E     K  +++  + KK+K     E  E K E E 
Sbjct: 481 SAKDKGTGKEQKITITASSGLSDDEIERMVKDAEEYAAEDKKRK-----ERIEAKNEAEE 535

Query: 248 ---QTDSEKEAKKRKKHKKDKKKRKKK 271
                +   + +  K  + DKKK ++ 
Sbjct: 536 YVYSLEKSLKEEGDKLPEADKKKVEEA 562


>gnl|CDD|240370 PTZ00342, PTZ00342, acyl-CoA synthetase; Provisional.
          Length = 746

 Score = 28.9 bits (65), Expect = 3.8
 Identities = 10/52 (19%), Positives = 22/52 (42%)

Query: 219 KKRKKHKKDKKKKKKKSKYEEEEEKEEGEQTDSEKEAKKRKKHKKDKKKRKK 270
               K +K+       +  +  +E+++G    +E E       + DK+K +K
Sbjct: 216 ININKEEKNNGSNVNNNGNKNNKEEQKGNDLSNELEDISLGPLEYDKEKLEK 267


>gnl|CDD|236080 PRK07734, motB, flagellar motor protein MotB; Reviewed.
          Length = 259

 Score = 28.6 bits (64), Expect = 3.8
 Identities = 16/65 (24%), Positives = 30/65 (46%)

Query: 190 SPTSASSDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKYEEEEEKEEGEQT 249
           S      ++       + E  +   E EAKK+K+ ++ K  +KK  +Y +E++     QT
Sbjct: 70  SDQKPEDEKELSASSLEAEQAKKKEEAEAKKKKEMEELKAVQKKIDQYIKEKQLSSSLQT 129

Query: 250 DSEKE 254
              +E
Sbjct: 130 KLTEE 134


>gnl|CDD|216269 pfam01056, Myc_N, Myc amino-terminal region.  The myc family
           belongs to the basic helix-loop-helix leucine zipper
           class of transcription factors, see pfam00010. Myc forms
           a heterodimer with Max, and this complex regulates cell
           growth through direct activation of genes involved in
           cell replication. Mutations in the C-terminal 20
           residues of this domain cause unique changes in the
           induction of apoptosis, transformation, and G2 arrest.
          Length = 329

 Score = 28.4 bits (63), Expect = 3.8
 Identities = 14/31 (45%), Positives = 19/31 (61%)

Query: 187 RKASPTSASSDESDKEEEEKEEGEQTDSEKE 217
            +  P S+SS  SD E EE EE E+ + E+E
Sbjct: 212 LRTPPNSSSSSGSDSESEEDEEEEEEEEEEE 242


>gnl|CDD|218941 pfam06217, GAGA_bind, GAGA binding protein-like family.  This
           family includes gbp a protein from Soybean that binds to
           GAGA element dinucleotide repeat DNA. It seems likely
           that the this domain mediates DNA binding. This putative
           domain contains several conserved cysteines and a
           histidine suggesting this may be a zinc-binding DNA
           interaction domain.
          Length = 301

 Score = 28.3 bits (63), Expect = 3.9
 Identities = 13/61 (21%), Positives = 18/61 (29%)

Query: 204 EEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKYEEEEEKEEGEQTDSEKEAKKRKKHKK 263
           E           KE KK KK +  K  K  K K  +++               + K   K
Sbjct: 133 EVPPISTAPPEAKEVKKPKKGQSPKVPKAPKPKKPKKKGSVSNRSVKMPGIDPRSKPDWK 192

Query: 264 D 264
            
Sbjct: 193 S 193


>gnl|CDD|221324 pfam11933, DUF3451, Domain of unknown function (DUF3451).  This
           presumed domain is functionally uncharacterized. This
           domain is found in eukaryotes. This domain is typically
           between 199 to 238 amino acids in length. This domain is
           found associated with pfam06512, pfam00520. This domain
           has a conserved ADD sequence motif.
          Length = 222

 Score = 28.2 bits (63), Expect = 4.2
 Identities = 12/56 (21%), Positives = 25/56 (44%)

Query: 214 SEKEAKKRKKHKKDKKKKKKKSKYEEEEEKEEGEQTDSEKEAKKRKKHKKDKKKRK 269
           S+  ++  K+ +    K KK+      EE +  ++  SE    +++     +K RK
Sbjct: 4   SKLSSESAKERRNRNDKNKKEEHSIGSEEGDSEKEPKSESADGRKRCRFLLRKTRK 59


>gnl|CDD|215283 PLN02518, PLN02518, pheophorbide a oxygenase.
          Length = 539

 Score = 28.7 bits (64), Expect = 4.2
 Identities = 9/36 (25%), Positives = 21/36 (58%)

Query: 187 RKASPTSASSDESDKEEEEKEEGEQTDSEKEAKKRK 222
           R A+P S  S+ + +++E +E+  + +  +E+   K
Sbjct: 49  RVAAPPSVPSEAALQQDEGEEQRVEQELGQESSDSK 84


>gnl|CDD|227881 COG5594, COG5594, Uncharacterized integral membrane protein
           [Function unknown].
          Length = 827

 Score = 28.5 bits (64), Expect = 4.2
 Identities = 17/53 (32%), Positives = 21/53 (39%), Gaps = 3/53 (5%)

Query: 210 EQTDSEKEAKKRKKHKKDKKKKKKKSKYEEEEEKEEGEQTDSEKEAKKRKKHK 262
           E+  +E   K  KK      K  KKS       K+E E        KKR KH+
Sbjct: 265 EKALNELLNKLLKKSHL---KTNKKSGKLTPSRKKEFEILPEYVPDKKRPKHR 314


>gnl|CDD|221654 pfam12589, WBS_methylT, Methyltransferase involved in
           Williams-Beuren syndrome.  This domain family is found
           in eukaryotes, and is typically between 72 and 83 amino
           acids in length. The family is found in association with
           pfam08241. This family is made up of
           S-adenosylmethionine-dependent methyltransferases. The
           proteins are deleted in Williams-Beuren syndrome (WBS),
           a complex developmental disorder with multisystemic
           manifestations including supravalvular aortic stenosis
           (SVAS) and a specific cognitive phenotype.
          Length = 85

 Score = 26.9 bits (60), Expect = 4.3
 Identities = 13/38 (34%), Positives = 21/38 (55%)

Query: 197 DESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKK 234
            E  +E++E+ +  +     +  +RKK KK K KKK K
Sbjct: 17  GEEGEEDDEQIDASKVRRISQRNRRKKKKKKKLKKKSK 54



 Score = 26.5 bits (59), Expect = 4.6
 Identities = 13/35 (37%), Positives = 23/35 (65%)

Query: 202 EEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSK 236
            EE+ E+ + +   + +++ ++ KK KKK KKKSK
Sbjct: 20  GEEDDEQIDASKVRRISQRNRRKKKKKKKLKKKSK 54


>gnl|CDD|113290 pfam04514, BTV_NS2, Bluetongue virus non-structural protein NS2.
           This family includes NS2 proteins from other members of
           the Orbivirus genus. NS2 is a non-specific
           single-stranded RNA-binding protein that forms large
           homomultimers and accumulates in viral inclusion bodies
           of infected cells. Three RNA binding regions have been
           identified in Bluetongue virus serotype 17 at residues
           2-11, 153-166 and 274-286. NS2 multimers also possess
           nucleotidyl phosphatase activity. The precise function
           of NS2 is not known, but it may be involved in the
           transport and condensation of viral mRNAs.
          Length = 363

 Score = 28.3 bits (63), Expect = 4.4
 Identities = 19/119 (15%), Positives = 39/119 (32%), Gaps = 15/119 (12%)

Query: 160 MSRPLQDVRVVKCNELVLKSKLKAKEKRKASPTSASSDESDKEEEEKEEGEQTDSEKEAK 219
           M  P  D      +E          E  K           ++   E  E    + E+   
Sbjct: 184 MGGPDSDEDENPLDEEA---PDMTPETSKQDQKEERRAAVERRLAELVEMINWNLEE--- 237

Query: 220 KRKKHKKDKKKKKKKSKYEEEEEKEEGEQTDSEKEAKKRKKH------KKDKKKRKKKS 272
              + +  +K+++ +   E + + E+    DSE    K + +      ++  + RK K 
Sbjct: 238 ---RRRDLRKEQELEENVERDSDDEDEHGEDSEDGETKPESYITSEYIERISEIRKMKD 293


>gnl|CDD|223068 PHA03384, PHA03384, early DNA-binding protein E2A; Provisional.
          Length = 445

 Score = 28.5 bits (64), Expect = 4.4
 Identities = 16/73 (21%), Positives = 23/73 (31%), Gaps = 7/73 (9%)

Query: 199 SDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKYEEEEEKEEGEQTDSEKEAKKR 258
           SD      +       E   KK       K +++     EEEEE+E             R
Sbjct: 8   SDSPYSSDDSPSLEPPELPPKK-------KGRRRVSPVEEEEEEEEAEVVAVGFSYPPVR 60

Query: 259 KKHKKDKKKRKKK 271
               KD K+  + 
Sbjct: 61  ISRGKDGKRPVRP 73


>gnl|CDD|234941 PRK01315, PRK01315, putative inner membrane protein translocase
           component YidC; Provisional.
          Length = 329

 Score = 28.2 bits (63), Expect = 4.4
 Identities = 11/59 (18%), Positives = 27/59 (45%), Gaps = 1/59 (1%)

Query: 215 EKEAKKRKKHKKDKKKKKKKSKYEEEEEKEEGEQTDSEKEAKKRKKHK-KDKKKRKKKS 272
            +E +  KK K   + + K    E    +    +  ++++  +R++ K + + KR+K  
Sbjct: 268 AREERLAKKGKDHGESEGKVVAPEGAVAQTTEVREQTKRQTVQRQQPKRQSRAKRQKGG 326


>gnl|CDD|218332 pfam04929, Herpes_DNAp_acc, Herpes DNA replication accessory
           factor.  Replicative DNA polymerases are capable of
           polymerising tens of thousands of nucleotides without
           dissociating from their DNA templates. The high
           processivity of these polymerases is dependent upon
           accessory proteins that bind to the catalytic subunit of
           the polymerase or to the substrate. The Epstein-Barr
           virus (EBV) BMRF1 protein is an essential component of
           the viral DNA polymerase and is absolutely required for
           lytic virus replication. BMRF1 is also a transactivator.
           This family is predicted to have a UL42 like structure.
          Length = 381

 Score = 28.5 bits (64), Expect = 4.4
 Identities = 15/51 (29%), Positives = 21/51 (41%), Gaps = 4/51 (7%)

Query: 190 SPTSASSDESDKEE----EEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSK 236
           +  S  S E+              E+  +    +KR+  KK KKKK KK K
Sbjct: 324 TSDSDPSTETPPPASLSHSPPAAFERPLALSPKRKREGDKKQKKKKSKKLK 374


>gnl|CDD|191249 pfam05279, Asp-B-Hydro_N, Aspartyl beta-hydroxylase N-terminal
           region.  This family includes the N-terminal regions of
           the junctin, junctate and aspartyl beta-hydroxylase
           proteins. Junctate is an integral ER/SR membrane calcium
           binding protein, which comes from an alternatively
           spliced form of the same gene that generates aspartyl
           beta-hydroxylase and junctin. Aspartyl beta-hydroxylase
           catalyzes the post-translational hydroxylation of
           aspartic acid or asparagine residues contained within
           epidermal growth factor (EGF) domains of proteins.
          Length = 240

 Score = 28.0 bits (62), Expect = 4.5
 Identities = 13/63 (20%), Positives = 31/63 (49%)

Query: 194 ASSDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKYEEEEEKEEGEQTDSEK 253
            +S++  ++ +EK   ++ +  KE  ++ +  K +     +  Y+EE+   E  +   E+
Sbjct: 168 TASEQYKQDMKEKASEQENEDSKEPVEKAERTKAETDDVTEEDYDEEDNPVEDSKAIKEE 227

Query: 254 EAK 256
            AK
Sbjct: 228 LAK 230


>gnl|CDD|148511 pfam06936, Selenoprotein_S, Selenoprotein S (SelS).  This family
           consists of several mammalian selenoprotein S (SelS)
           sequences. SelS is a plasma membrane protein and is
           present in a variety of tissues and cell types. The
           function of this family is unknown.
          Length = 190

 Score = 27.9 bits (62), Expect = 4.6
 Identities = 20/91 (21%), Positives = 36/91 (39%), Gaps = 18/91 (19%)

Query: 185 EKRKASPTSASSDESDKEEEEKEEGEQTDSEKEAKKRKK------------HKKDKKKKK 232
           +K +    S     S+KE          D +K  ++++               K  K K+
Sbjct: 52  QKLRPRWRSLQKRSSEKEIT------VFDPDKVVRRQEALEAARLRMQEDLDAKAAKFKE 105

Query: 233 KKSKYEEEEEKEEGEQTDSEKEAKKRKKHKK 263
           K+ + EEE+ +++ E  D  +E K  K   K
Sbjct: 106 KQKQLEEEKRRQKIEMWDRMQEGKSYKVSAK 136



 Score = 27.2 bits (60), Expect = 7.7
 Identities = 21/82 (25%), Positives = 37/82 (45%), Gaps = 2/82 (2%)

Query: 182 KAKEKRKASPTSASSDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKYEEEE 241
           + +E  +A+      D   K  + KE+  Q   E+E +++K    D+ ++ K  K   + 
Sbjct: 80  RRQEALEAARLRMQEDLDAKAAKFKEK--QKQLEEEKRRQKIEMWDRMQEGKSYKVSAKL 137

Query: 242 EKEEGEQTDSEKEAKKRKKHKK 263
            +E  E+T S     K K  KK
Sbjct: 138 AQEATEETSSSSVLPKPKTEKK 159


>gnl|CDD|222613 pfam14235, DUF4337, Domain of unknown function (DUF4337).  This
           family of proteins is functionally uncharacterized. This
           family of proteins is found in bacteria. Proteins in
           this family are typically between 187 and 201 amino
           acids in length. There is a single completely conserved
           residue Q that may be functionally important.
          Length = 158

 Score = 27.5 bits (62), Expect = 4.6
 Identities = 7/38 (18%), Positives = 19/38 (50%)

Query: 231 KKKKSKYEEEEEKEEGEQTDSEKEAKKRKKHKKDKKKR 268
           + K ++Y++E+ +   E  + E +AK+ +        +
Sbjct: 73  QAKIARYKKEKARYRSEAKELEAKAKEAEAESDHALHQ 110


>gnl|CDD|223630 COG0556, UvrB, Helicase subunit of the DNA excision repair complex
           [DNA replication, recombination, and repair].
          Length = 663

 Score = 28.3 bits (64), Expect = 4.7
 Identities = 24/73 (32%), Positives = 40/73 (54%), Gaps = 1/73 (1%)

Query: 201 KEEEEKEEGEQTDSEKEAKKRKKH-KKDKKKKKKKSKYEEEEEKEEGEQTDSEKEAKKRK 259
           K+  +  +GE  + E +AK  KK  K  KK+ +K  K  E+E KE  +  + E+ A+ R 
Sbjct: 591 KKIRDILDGEYEEDEYKAKIEKKASKMSKKELEKLIKKLEKEMKEAAKNLEFEEAARLRD 650

Query: 260 KHKKDKKKRKKKS 272
           + K+ K++   KS
Sbjct: 651 EIKELKEELLGKS 663


>gnl|CDD|226889 COG4487, COG4487, Uncharacterized protein conserved in bacteria
           [Function unknown].
          Length = 438

 Score = 28.2 bits (63), Expect = 4.7
 Identities = 21/105 (20%), Positives = 48/105 (45%), Gaps = 5/105 (4%)

Query: 173 NELVLKSKLKAKEKRKASPTSASSDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKK 232
           NE + + +L  +++          +    +E E  E E  +  KE +K+ ++  +  +KK
Sbjct: 85  NEQIKQFELALQDEIAKLEALELLNLEKDKELELLEKELDELSKELQKQLQNTAEIIEKK 144

Query: 233 KKSKYEEE-----EEKEEGEQTDSEKEAKKRKKHKKDKKKRKKKS 272
           +++   EE      EK+  E  + E+E  + + H+ +     K++
Sbjct: 145 RENNKNEERLKFENEKKLEESLELEREKFEEQLHEANLDLEFKEN 189


>gnl|CDD|236559 PRK09548, PRK09548, PTS system ascorbate-specific transporter
           subunits  IICB; Provisional.
          Length = 602

 Score = 28.6 bits (64), Expect = 4.8
 Identities = 11/48 (22%), Positives = 21/48 (43%), Gaps = 2/48 (4%)

Query: 178 KSKLKAKEKRKA--SPTSASSDESDKEEEEKEEGEQTDSEKEAKKRKK 223
             +L+AKE   A    T    D  + +EE     ++  S+++  K  +
Sbjct: 461 SKQLRAKEAAAAAGGKTVDQLDGYNLDEEVINAVQKLISKQDGGKPVR 508


>gnl|CDD|218598 pfam05470, eIF-3c_N, Eukaryotic translation initiation factor 3
           subunit 8 N-terminus.  The largest of the mammalian
           translation initiation factors, eIF3, consists of at
           least eight subunits ranging in mass from 35 to 170 kDa.
           eIF3 binds to the 40 S ribosome in an early step of
           translation initiation and promotes the binding of
           methionyl-tRNAi and mRNA.
          Length = 593

 Score = 28.2 bits (63), Expect = 4.9
 Identities = 14/69 (20%), Positives = 29/69 (42%), Gaps = 1/69 (1%)

Query: 195 SSDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKYEEEEEKEEGEQTDSEKE 254
             +  D+EEEE E+ +   S+ E +         ++    S+   +  KE+ E+ +    
Sbjct: 134 DPESEDEEEEEDEDDDDDGSDDEDEDEDGVGA-TEEVAASSESGVDRVKEDDEEDEDADL 192

Query: 255 AKKRKKHKK 263
           +KK    + 
Sbjct: 193 SKKDVLEEP 201


>gnl|CDD|220535 pfam10037, MRP-S27, Mitochondrial 28S ribosomal protein S27.
           Members of this family of small ribosomal proteins
           possess one of three conserved blocks of sequence found
           in proteins that stimulate the dissociation of guanine
           nucleotides from G-proteins, leaving open the
           possibility that MRP-S27 might be a functional partner
           of GTP-binding ribosomal proteins.
          Length = 417

 Score = 28.2 bits (63), Expect = 4.9
 Identities = 16/71 (22%), Positives = 34/71 (47%), Gaps = 11/71 (15%)

Query: 177 LKSKLKAKEKRKASPTSASSDESDKEEEEKEEGEQTDSEKEAKKRK----KHKKDKKKKK 232
            +SKL+A  K ++    +        EE     E+ D E   ++++    +++K  ++KK
Sbjct: 346 WRSKLQALAKVESRSLLSV-------EERLPTIEKEDLELYEQRQQLWFFENRKLWQRKK 398

Query: 233 KKSKYEEEEEK 243
           K  +  +EE +
Sbjct: 399 KLREQADEEYQ 409


>gnl|CDD|130141 TIGR01069, mutS2, MutS2 family protein.  Function of MutS2 is
           unknown. It should not be considered a DNA mismatch
           repair protein. It is likely a DNA mismatch binding
           protein of unknown cellular function [DNA metabolism,
           Other].
          Length = 771

 Score = 28.2 bits (63), Expect = 4.9
 Identities = 21/81 (25%), Positives = 40/81 (49%), Gaps = 2/81 (2%)

Query: 194 ASSDESDKE-EEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKYEEEEEKEEGEQ-TDS 251
                 +KE E++ E  E+   E+E  K++  ++ ++ K+++   + E EKE  E     
Sbjct: 518 EKLSALEKELEQKNEHLEKLLKEQEKLKKELEQEMEELKERERNKKLELEKEAQEALKAL 577

Query: 252 EKEAKKRKKHKKDKKKRKKKS 272
           +KE +   +  K+KK  K K 
Sbjct: 578 KKEVESIIRELKEKKIHKAKE 598



 Score = 27.9 bits (62), Expect = 7.0
 Identities = 20/80 (25%), Positives = 38/80 (47%), Gaps = 13/80 (16%)

Query: 199 SDKEEEEKEEGEQTDS--------EKEAKKRKKHKKDKKKKKKKSKYEEEEE-----KEE 245
           S  E+E +++ E  +         +KE ++  +  K++++ KK    +E +E     K+E
Sbjct: 521 SALEKELEQKNEHLEKLLKEQEKLKKELEQEMEELKERERNKKLELEKEAQEALKALKKE 580

Query: 246 GEQTDSEKEAKKRKKHKKDK 265
            E    E + KK  K K+ K
Sbjct: 581 VESIIRELKEKKIHKAKEIK 600


>gnl|CDD|240433 PTZ00482, PTZ00482, membrane-attack complex/perforin (MACPF)
           Superfamily; Provisional.
          Length = 844

 Score = 28.3 bits (63), Expect = 5.0
 Identities = 14/84 (16%), Positives = 25/84 (29%), Gaps = 1/84 (1%)

Query: 190 SPTSASSD-ESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKYEEEEEKEEGEQ 248
           S T   S  E    +E+ +     D   +  +         +  K+S      EK   E+
Sbjct: 113 SSTDDDSLLELPDRDEDADTQANNDQTNDFDQDDSSNSQTDQGLKQSVNLSSAEKLIEEK 172

Query: 249 TDSEKEAKKRKKHKKDKKKRKKKS 272
               +   K      D ++   K 
Sbjct: 173 KGQTENTFKFYNFGNDGEEAAAKD 196


>gnl|CDD|221121 pfam11489, DUF3210, Protein of unknown function (DUF3210).  This is
           a family of proteins conserved in yeasts. The function
           is not known. The Schizosaccharomyces pombe member is
           SPBC18E5.07 and the Saccharomyces cerevisiae member is
           AIM21.
          Length = 671

 Score = 28.4 bits (63), Expect = 5.0
 Identities = 18/88 (20%), Positives = 32/88 (36%)

Query: 185 EKRKASPTSASSDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKYEEEEEKE 244
            K  A  + A S   ++E+ EK   E     +E  K            + +  E+ EE E
Sbjct: 321 AKEPAGESPAVSPSFEREKSEKSRHESDPKSRENSKPASIYGSVPDLIRHTPLEDVEEYE 380

Query: 245 EGEQTDSEKEAKKRKKHKKDKKKRKKKS 272
                D  + A K    +  ++  ++K 
Sbjct: 381 PLFPEDESEIAVKPPTEESSRRPEEEKH 408


>gnl|CDD|237855 PRK14900, valS, valyl-tRNA synthetase; Provisional.
          Length = 1052

 Score = 28.4 bits (63), Expect = 5.0
 Identities = 19/90 (21%), Positives = 35/90 (38%), Gaps = 12/90 (13%)

Query: 183  AKEKRKASPTSASSDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKYEEEEE 242
            A+  RK    S       K    +   ++  ++K   K+   KK   KK          +
Sbjct: 971  AEAVRKTVRRSVK-----KAAATRAAMKKKVAKKAPAKKAAAKKAAAKKAAA-------K 1018

Query: 243  KEEGEQTDSEKEAKKRKKHKKDKKKRKKKS 272
            K+  ++  ++K A+K    K  KK  +K +
Sbjct: 1019 KKVAKKAPAKKVARKPAAKKAAKKPARKAA 1048


>gnl|CDD|236034 PRK07502, PRK07502, cyclohexadienyl dehydrogenase; Validated.
          Length = 307

 Score = 28.0 bits (63), Expect = 5.1
 Identities = 10/23 (43%), Positives = 15/23 (65%), Gaps = 1/23 (4%)

Query: 124 ITTQPAPHL-DGVHVVFGHVVSG 145
           +    APHL +GVH + GH ++G
Sbjct: 106 VIAAMAPHLPEGVHFIPGHPLAG 128


>gnl|CDD|227238 COG4901, COG4901, Ribosomal protein S25 [Translation, ribosomal
           structure and biogenesis].
          Length = 107

 Score = 26.8 bits (59), Expect = 5.2
 Identities = 17/51 (33%), Positives = 24/51 (47%)

Query: 215 EKEAKKRKKHKKDKKKKKKKSKYEEEEEKEEGEQTDSEKEAKKRKKHKKDK 265
            KE K  K      K KKK SK +++EE       D E   K RK+  +++
Sbjct: 8   SKEKKAEKAKAGTAKDKKKWSKKKKKEEARRAVTVDEELLDKIRKEVPRER 58


>gnl|CDD|218581 pfam05416, Peptidase_C37, Southampton virus-type processing
           peptidase.  Corresponds to Merops family C37.
           Norwalk-like viruses (NLVs), including the Southampton
           virus, cause acute non-bacterial gastroenteritis in
           humans. The NLV genome encodes three open reading frames
           (ORFs). ORF1 encodes a polyprotein, which is processed
           by the viral protease into six proteins.
          Length = 535

 Score = 28.3 bits (63), Expect = 5.4
 Identities = 23/114 (20%), Positives = 38/114 (33%), Gaps = 39/114 (34%)

Query: 197 DESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKS--------------------- 235
              D  E E+++       ++A    K  K+KK + KK                      
Sbjct: 207 TYYDGCESEEDDEGWYIEPQDATPEGKKGKNKKGRGKKHNAFSRRGLSDEEYDEYKKIRE 266

Query: 236 ----------------KYEEE--EEKEEGEQTDSEKEAKKRKKHKKDKKKRKKK 271
                           +YEEE  E +        E+EAK R++    +K RK++
Sbjct: 267 ERGGKYSIQEYLEDRERYEEELAERQATEADFCEEEEAKIRQRIFGLRKTRKQR 320


>gnl|CDD|227596 COG5271, MDN1, AAA ATPase containing von Willebrand factor type A
            (vWA) domain [General function prediction only].
          Length = 4600

 Score = 28.4 bits (63), Expect = 5.5
 Identities = 15/61 (24%), Positives = 28/61 (45%), Gaps = 4/61 (6%)

Query: 200  DKEEEEKEEGEQTDSEKEA----KKRKKHKKDKKKKKKKSKYEEEEEKEEGEQTDSEKEA 255
            +  EE+  E EQ  +E+ A          + D K  + K + E+E+E+E  +    + E 
Sbjct: 3910 EPNEEDLLETEQKSNEQSAANNESDLVSKEDDNKALEDKDRQEKEDEEEMSDDVGIDDEI 3969

Query: 256  K 256
            +
Sbjct: 3970 Q 3970



 Score = 28.0 bits (62), Expect = 6.9
 Identities = 12/71 (16%), Positives = 33/71 (46%), Gaps = 4/71 (5%)

Query: 196  SDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKYEEE----EEKEEGEQTDS 251
            SD ++ +E+  E+G + + ++  +  +   K  ++ ++    E++      K + + T +
Sbjct: 4053 SDLAEDDEKMNEDGFEENVQENEESTEDGVKSDEELEQGEVPEDQAIDNHPKMDAKSTFA 4112

Query: 252  EKEAKKRKKHK 262
              EA +    K
Sbjct: 4113 SAEADEENTDK 4123


>gnl|CDD|218704 pfam05701, DUF827, Plant protein of unknown function (DUF827).
           This family consists of several plant proteins of
           unknown function. Several sequences in this family are
           described as being "myosin heavy chain-like".
          Length = 484

 Score = 28.0 bits (62), Expect = 5.5
 Identities = 19/84 (22%), Positives = 40/84 (47%), Gaps = 4/84 (4%)

Query: 179 SKLKAKEKRKASPTSASSDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKK---S 235
            +L+ KE       S+   E ++ + EKE   +    +   K ++   + ++ +K+   +
Sbjct: 297 EELREKEGEAEEAASSLEAELNRTKSEKESKAREKMVEIPLKLQQASSEAEEARKEAEAA 356

Query: 236 KYEEEEEKEEGEQT-DSEKEAKKR 258
           + E  + KEE EQT  + + A+ R
Sbjct: 357 REELRKLKEEAEQTKAALETAELR 380


>gnl|CDD|236048 PRK07561, PRK07561, DNA topoisomerase I subunit omega; Validated.
          Length = 859

 Score = 28.2 bits (64), Expect = 5.7
 Identities = 11/59 (18%), Positives = 17/59 (28%), Gaps = 17/59 (28%)

Query: 214 SEKEAKKRKKHKKDKKKKKKKSKYEEEEEKEEGEQTDSEKEAKKRKKHKKDKKKRKKKS 272
            EK         KD   KK   K+                 ++K K+     +K  K +
Sbjct: 818 PEKLRYLADAPAKDPAGKKAAVKF-----------------SRKTKQQYVASEKDGKAT 859


>gnl|CDD|221190 pfam11727, ISG65-75, Invariant surface glycoprotein.  This family
           is found in Trypanosome species, and appears to be one
           of two invariant surface glycoproteins, ISG65 and ISG75.
           that are found in the mammalian stage of the parasitic
           protozoan. the sequence suggests the two families are
           polypeptides with N-terminal signal sequences,
           hydrophilic extracellular domains, single trans-membrane
           alpha-helices and short cytoplasmic domains. they are
           both expressed in the bloodstream form but not in the
           midgut stage. Both polypeptides are distributed over the
           entire surface of the parasite.
          Length = 286

 Score = 27.9 bits (62), Expect = 5.8
 Identities = 19/76 (25%), Positives = 34/76 (44%), Gaps = 1/76 (1%)

Query: 177 LKSKLKAKEKRKASPTSASSDESDKEEEEKEEGEQTDSEKEAKKRKKH-KKDKKKKKKKS 235
           +K  L+  E R  S  S S  +  KE  EK + +  ++  EAKK  +  +K  ++ K+ +
Sbjct: 78  VKYYLEKLESRSDSKLSDSDKKKIKEICEKAKDKIDEALPEAKKAGREAEKLAEEVKEAA 137

Query: 236 KYEEEEEKEEGEQTDS 251
                   +EG    +
Sbjct: 138 AKALGWALDEGTSNST 153


>gnl|CDD|215278 PLN02502, PLN02502, lysyl-tRNA synthetase.
          Length = 553

 Score = 28.0 bits (63), Expect = 5.8
 Identities = 15/67 (22%), Positives = 22/67 (32%), Gaps = 19/67 (28%)

Query: 206 KEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKYEEEEEKEEGEQTDSEKEAKKRKKHKKDK 265
            E   +  S+   KKR K K+ +++K                     KE  K        
Sbjct: 1   AESNGEPLSKNALKKRLKAKQAEEEKA-------------------AKEEAKAAAAAAAA 41

Query: 266 KKRKKKS 272
           K R +KS
Sbjct: 42  KGRSRKS 48


>gnl|CDD|235850 PRK06669, fliH, flagellar assembly protein H; Validated.
          Length = 281

 Score = 27.7 bits (62), Expect = 5.9
 Identities = 19/81 (23%), Positives = 41/81 (50%), Gaps = 1/81 (1%)

Query: 185 EKRKASPTSASSDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKYEEEEEKE 244
           E++       ++DE+ +  EE EE +  +  + A++  K +  KK  +  S  E+ + + 
Sbjct: 43  EEQVEQLREEANDEAKEIIEEAEE-DAFEIVEAAEEEAKEELLKKTDEASSIIEKLQMQI 101

Query: 245 EGEQTDSEKEAKKRKKHKKDK 265
           E EQ + E+E ++  +  K +
Sbjct: 102 EREQEEWEEELERLIEEAKAE 122


>gnl|CDD|188306 TIGR03319, RNase_Y, ribonuclease Y.  Members of this family are
           RNase Y, an endoribonuclease. The member from Bacillus
           subtilis, YmdA, has been shown to be involved in
           turnover of yitJ riboswitch [Transcription, Degradation
           of RNA].
          Length = 514

 Score = 28.0 bits (63), Expect = 6.0
 Identities = 15/71 (21%), Positives = 38/71 (53%)

Query: 201 KEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKYEEEEEKEEGEQTDSEKEAKKRKK 260
           + E E+E  E+ +  +  ++R   +++   +K +S  ++EE  E+ E+  S KE    +K
Sbjct: 64  RAELERELKERRNELQRLERRLLQREETLDRKMESLDKKEENLEKKEKELSNKEKNLDEK 123

Query: 261 HKKDKKKRKKK 271
            ++ ++   ++
Sbjct: 124 EEELEELIAEQ 134


>gnl|CDD|217927 pfam04147, Nop14, Nop14-like family.  Emg1 and Nop14 are novel
           proteins whose interaction is required for the
           maturation of the 18S rRNA and for 40S ribosome
           production.
          Length = 809

 Score = 28.0 bits (63), Expect = 6.0
 Identities = 14/43 (32%), Positives = 24/43 (55%)

Query: 195 SSDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKY 237
            SDE + EE+E  + E  + E+E +K KK KK  +  + +  +
Sbjct: 351 LSDEEEDEEDEDSDDEDDEEEEEEEKEKKKKKSAESTRSELPF 393



 Score = 27.7 bits (62), Expect = 9.3
 Identities = 11/47 (23%), Positives = 27/47 (57%)

Query: 194 ASSDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKYEEE 240
              ++ D  +EE++E ++   +++ ++ ++ +K+KKKKK       E
Sbjct: 344 EEEEDVDLSDEEEDEEDEDSDDEDDEEEEEEEKEKKKKKSAESTRSE 390


>gnl|CDD|223003 PHA03169, PHA03169, hypothetical protein; Provisional.
          Length = 413

 Score = 28.0 bits (62), Expect = 6.1
 Identities = 8/77 (10%), Positives = 26/77 (33%), Gaps = 6/77 (7%)

Query: 184 KEKRKASPTSASSDESDKEEEEK------EEGEQTDSEKEAKKRKKHKKDKKKKKKKSKY 237
           + +  A+  +  +  +      +      +   QT+S+ E  +  +H + +++ +     
Sbjct: 35  RRRGTAARAAKPAPPAPTTSGPQVRAVAEQGHRQTESDTETAEESRHGEKEERGQGGPSG 94

Query: 238 EEEEEKEEGEQTDSEKE 254
              E       + S   
Sbjct: 95  SGSESVGSPTPSPSGSA 111


>gnl|CDD|240400 PTZ00396, PTZ00396, Casein kinase II subunit beta; Provisional.
          Length = 251

 Score = 27.7 bits (62), Expect = 6.2
 Identities = 10/42 (23%), Positives = 22/42 (52%), Gaps = 1/42 (2%)

Query: 231 KKKKSKYEEEEEKEEGEQTDSEKEAKKRKKHKKDKKKRKKKS 272
           KKK SK  + +  E G+    +   KK  K   + +++++++
Sbjct: 208 KKKSSKEYKLQNGEFGKSAIEQ-FLKKSPKGNIELQEQEQQA 248


>gnl|CDD|114709 pfam06003, SMN, Survival motor neuron protein (SMN).  This family
           consists of several eukaryotic survival motor neuron
           (SMN) proteins. The Survival of Motor Neurons (SMN)
           protein, the product of the spinal muscular
           atrophy-determining gene, is part of a large
           macromolecular complex (SMN complex) that functions in
           the assembly of spliceosomal small nuclear
           ribonucleoproteins (snRNPs). The SMN complex functions
           as a specificity factor essential for the efficient
           assembly of Sm proteins on U snRNAs and likely protects
           cells from illicit, and potentially deleterious,
           non-specific binding of Sm proteins to RNAs.
          Length = 264

 Score = 27.6 bits (61), Expect = 6.2
 Identities = 13/45 (28%), Positives = 26/45 (57%)

Query: 188 KASPTSASSDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKK 232
           KA   + +S ++  + E+    ++ D +   KKRK +KK++ +KK
Sbjct: 16  KAYDKAVASFKNALKNEDCATPQENDEQNPGKKRKNNKKNRSRKK 60


>gnl|CDD|184885 PRK14891, PRK14891, 50S ribosomal protein L24e/unknown domain
           fusion protein; Provisional.
          Length = 131

 Score = 26.8 bits (59), Expect = 6.3
 Identities = 11/71 (15%), Positives = 35/71 (49%), Gaps = 1/71 (1%)

Query: 185 EKRKASPTSASSDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKYEEEEEKE 244
               A+  + +++E++  + +++  E  +++   +  ++ + D+   +   + + E E+ 
Sbjct: 62  PAAAATAAAEAAEEAEAADADEDADEAAEADAADEADEEEETDEAVDETADEADAEAEEA 121

Query: 245 EGEQTDSEKEA 255
           + E  D E EA
Sbjct: 122 DEE-EDEEAEA 131


>gnl|CDD|218115 pfam04502, DUF572, Family of unknown function (DUF572).  Family of
           eukaryotic proteins with undetermined function.
          Length = 321

 Score = 27.8 bits (62), Expect = 6.3
 Identities = 20/94 (21%), Positives = 40/94 (42%), Gaps = 2/94 (2%)

Query: 179 SKLKAKEKRKASPTSASSDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKYE 238
            +LK  + R+A     S  E+    E+KEE E+   E++    K      + ++ + + +
Sbjct: 155 EELKELQSRRADVDVNSMLEALFRREKKEEEEE--EEEDEALIKSLSFGPETEEDRRRAD 212

Query: 239 EEEEKEEGEQTDSEKEAKKRKKHKKDKKKRKKKS 272
           +E+ +++ E  D+    K             KKS
Sbjct: 213 DEDSEDDEEDNDNTPSPKSGSSSPAKPTSILKKS 246


>gnl|CDD|147051 pfam04698, MOBP_C-Myrip, Myelin-associated oligodendrocytic basic
           protein (MOBP).  MOBP is abundantly expressed in central
           nervous system myelin, and shares several
           characteristics with myelin basic protein (MBP), in
           terms of regional distribution and function. This family
           is the middle and C-terminal regions of MOBP which has
           been shown to be essential for normal arrangement of the
           radial component in central nervous system myelin. Most
           member-proteins carry a FVHE-PHD type zinc-finger at
           their N-terminus.
          Length = 710

 Score = 28.2 bits (62), Expect = 6.3
 Identities = 16/74 (21%), Positives = 39/74 (52%)

Query: 198 ESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKYEEEEEKEEGEQTDSEKEAKK 257
           E+D   E ++    TDS +E  + + ++   K  +K++   E++E E   + +++KE+  
Sbjct: 413 ETDISNEAQDPQSLTDSTEEKLRNRLYELAMKMSEKETSSGEDQESEPKAEPENQKESLS 472

Query: 258 RKKHKKDKKKRKKK 271
            + + +  ++  KK
Sbjct: 473 SEDNNQGVQEELKK 486


>gnl|CDD|226513 COG4026, COG4026, Uncharacterized protein containing TOPRIM domain,
           potential nuclease [General function prediction only].
          Length = 290

 Score = 27.6 bits (61), Expect = 6.4
 Identities = 16/54 (29%), Positives = 31/54 (57%)

Query: 200 DKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKYEEEEEKEEGEQTDSEK 253
           ++ ++EKEE  +   E EA+  +  ++ K+ + + S+ EE  +K  GE  D +K
Sbjct: 145 EELQKEKEELLKELEELEAEYEEVQERLKRLEVENSRLEEMLKKLPGEVYDLKK 198


>gnl|CDD|215349 PLN02647, PLN02647, acyl-CoA thioesterase.
          Length = 437

 Score = 27.8 bits (62), Expect = 6.4
 Identities = 16/59 (27%), Positives = 29/59 (49%), Gaps = 9/59 (15%)

Query: 187 RKASPTSASSDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKYEEEEEKEE 245
            K++P +  S E+++E+   EE E          R K +K K+ ++K+     E E+ E
Sbjct: 214 GKSAPVNRLSPETEEEKLLFEEAEA---------RNKLRKKKRGEQKREFENGEAERLE 263


>gnl|CDD|217884 pfam04086, SRP-alpha_N, Signal recognition particle, alpha subunit,
           N-terminal.  SRP is a complex of six distinct
           polypeptides and a 7S RNA that is essential for
           transferring nascent polypeptide chains that are
           destined for export from the cell to the translocation
           apparatus of the endoplasmic reticulum (ER) membrane.
           SRP binds hydrophobic signal sequences as they emerge
           from the ribosome, and arrests translation.
          Length = 272

 Score = 27.8 bits (62), Expect = 6.5
 Identities = 25/104 (24%), Positives = 39/104 (37%), Gaps = 15/104 (14%)

Query: 184 KEKRKASPTSASSDESDKEEEEKEEGEQT-DSEKEAKKRKKHKKDKKKKKKKSKYEEEEE 242
           +E  K S   A S ++ +  EE ++ ++T DS  E K ++   K K++KK +      E 
Sbjct: 95  RELEKESKKQAKSPKAMRTFEESKKSKKTVDSMIERKPKEPGLKRKQRKKAQESATSPES 154

Query: 243 K---------EEGEQTDSEKE-----AKKRKKHKKDKKKRKKKS 272
                              KE     AKK  K         +KS
Sbjct: 155 SPSSTPNSSRPSTPHLLKAKEGPSRRAKKAAKLSSTASSGDEKS 198


>gnl|CDD|218652 pfam05602, CLPTM1, Cleft lip and palate transmembrane protein 1
           (CLPTM1).  This family consists of several eukaryotic
           cleft lip and palate transmembrane protein 1 sequences.
           Cleft lip with or without cleft palate is a common birth
           defect that is genetically complex. The nonsyndromic
           forms have been studied genetically using linkage and
           candidate-gene association studies with only partial
           success in defining the loci responsible for orofacial
           clefting. CLPTM1 encodes a transmembrane protein and has
           strong homology to two Caenorhabditis elegans genes,
           suggesting that CLPTM1 may belong to a new gene family.
           This family also contains the human cisplatin resistance
           related protein CRR9p which is associated with
           CDDP-induced apoptosis.
          Length = 437

 Score = 27.6 bits (62), Expect = 6.7
 Identities = 11/33 (33%), Positives = 15/33 (45%), Gaps = 3/33 (9%)

Query: 223 KHKKDKKKKKKK---SKYEEEEEKEEGEQTDSE 252
            +   KK KKK     K E+EE +EE      +
Sbjct: 146 TYLPKKKVKKKNLLGGKSEKEEPEEEKTPAPDK 178


>gnl|CDD|218223 pfam04712, Radial_spoke, Radial spokehead-like protein.  This
           family includes the radial spoke head proteins RSP4 and
           RSP6 from Chlamydomonas reinhardtii, and several
           eukaryotic homologues, including mammalian RSHL1, the
           protein product of a familial ciliary dyskinesia
           candidate gene.
          Length = 481

 Score = 27.7 bits (62), Expect = 6.8
 Identities = 21/92 (22%), Positives = 36/92 (39%), Gaps = 27/92 (29%)

Query: 190 SPTSASSDESDKEEEEKEEGEQTDSEKEAKK---------------------------RK 222
           +P      E ++EEEE+EE  + D  +E                              R 
Sbjct: 280 APRGLYQLEEEEEEEEEEEPAERDELEENPDFEGLEVRELADSLDNWVHHVQHILPQGRC 339

Query: 223 KHKKDKKKKKKKSKYEEEEEKEEGEQTDSEKE 254
                ++K +++ + +EEEE+EE E  + E E
Sbjct: 340 TWVNPEQKDEEEEQEDEEEEEEEEEPEEPEPE 371


>gnl|CDD|227278 COG4942, COG4942, Membrane-bound metallopeptidase [Cell division
           and chromosome partitioning].
          Length = 420

 Score = 27.8 bits (62), Expect = 6.8
 Identities = 7/51 (13%), Positives = 21/51 (41%), Gaps = 1/51 (1%)

Query: 195 SSDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKYEEEEEKEE 245
           +   +++ E      EQ   ++    +   ++ K   +  S+   +++K E
Sbjct: 178 AEIAAEQAELTTLLSEQRA-QQAKLAQLLEERKKTLAQLNSELSADQKKLE 227



 Score = 27.8 bits (62), Expect = 7.1
 Identities = 11/69 (15%), Positives = 31/69 (44%), Gaps = 10/69 (14%)

Query: 203 EEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKYEEEEEKEEGEQTDSEKEAKKRKKHK 262
             E+ E     SE+ A++ K  +  +++KK  ++            ++   + KK ++ +
Sbjct: 181 AAEQAELTTLLSEQRAQQAKLAQLLEERKKTLAQLN----------SELSADQKKLEELR 230

Query: 263 KDKKKRKKK 271
            ++ + K +
Sbjct: 231 ANESRLKNE 239


>gnl|CDD|238673 cd01377, MYSc_type_II, Myosin motor domain, type II myosins. Myosin
           II mediates cortical contraction in cell motility, and
           is the motor in smooth and skeletal muscle. This
           catalytic (head) domain has ATPase activity and belongs
           to the larger group of P-loop NTPases. Myosins are
           actin-dependent molecular motors that play important
           roles in muscle contraction, cell motility, and
           organelle transport. The head domain is a molecular
           motor, which utilizes ATP hydrolysis to generate
           directed movement toward the plus end along actin
           filaments. A cyclical interaction between myosin and
           actin provides the driving force. Rates of ATP
           hydrolysis and consequently the speed of movement along
           actin filaments vary widely, from about 0.04 micrometer
           per second for myosin I to 4.5 micrometer per second for
           myosin II in skeletal muscle. Myosin II moves in
           discrete steps about 5-10 nm long and generates 1-5
           piconewtons of force. Upon ATP binding, the myosin head
           dissociates from an actin filament. ATP hydrolysis
           causes the head to pivot and associate with a new actin
           subunit. The release of Pi causes the head to pivot and
           move the filament (power stroke). Release of ADP
           completes the cycle.
          Length = 693

 Score = 27.9 bits (63), Expect = 6.8
 Identities = 15/29 (51%), Positives = 16/29 (55%), Gaps = 2/29 (6%)

Query: 36  ENFRALCTGEKGIGKT--TGKNLHYLNIV 62
           EN   L TGE G GKT  T K + YL  V
Sbjct: 90  ENQSILITGESGAGKTENTKKVIQYLASV 118


>gnl|CDD|218597 pfam05466, BASP1, Brain acid soluble protein 1 (BASP1 protein).
           This family consists of several brain acid soluble
           protein 1 (BASP1) or neuronal axonal membrane protein
           NAP-22. The BASP1 is a neuron enriched Ca(2+)-dependent
           calmodulin-binding protein of unknown function.
          Length = 233

 Score = 27.5 bits (60), Expect = 6.9
 Identities = 28/85 (32%), Positives = 45/85 (52%), Gaps = 10/85 (11%)

Query: 182 KAKEK-RKASPTSASSDESDKEEEEKEEGEQTDSEKEAKKRKKHK---------KDKKKK 231
           KAK+K +KA   +   + + KE EE +   +T   KEAK+ K  K         ++K+ +
Sbjct: 17  KAKDKDKKAEGAATEEEGTPKENEEAQAAAETTEVKEAKEEKPDKDAQDTANKTEEKEGE 76

Query: 232 KKKSKYEEEEEKEEGEQTDSEKEAK 256
           K+ +  +EE  K E E+T+   EAK
Sbjct: 77  KEAAAAKEEAPKAEPEKTEGAAEAK 101


>gnl|CDD|221012 pfam11169, DUF2956, Protein of unknown function (DUF2956).  This
           family of proteins with unknown function appears to be
           restricted to Gammaproteobacteria.
          Length = 103

 Score = 26.5 bits (59), Expect = 7.0
 Identities = 12/44 (27%), Positives = 25/44 (56%), Gaps = 4/44 (9%)

Query: 217 EAKKRKKHKK---DK-KKKKKKSKYEEEEEKEEGEQTDSEKEAK 256
           E KK++K K    DK +K++ K+K  +    +E +  +S  +++
Sbjct: 38  EYKKQQKAKAREADKARKQQLKAKQRQAANDDEEDTIESVTQSQ 81



 Score = 26.5 bits (59), Expect = 7.8
 Identities = 9/40 (22%), Positives = 23/40 (57%)

Query: 218 AKKRKKHKKDKKKKKKKSKYEEEEEKEEGEQTDSEKEAKK 257
           AK R+  K  K++ K K +    +++E+  ++ ++ ++ K
Sbjct: 45  AKAREADKARKQQLKAKQRQAANDDEEDTIESVTQSQSNK 84


>gnl|CDD|227935 COG5648, NHP6B, Chromatin-associated proteins containing the HMG
           domain [Chromatin structure and dynamics].
          Length = 211

 Score = 27.5 bits (61), Expect = 7.1
 Identities = 15/75 (20%), Positives = 29/75 (38%), Gaps = 2/75 (2%)

Query: 198 ESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKYEEEEEKEEGEQTDSEKEAKK 257
           + +KEE  K+   +       +   K +   +         EE +     +  SE +  K
Sbjct: 132 QREKEEYNKKLPNKAPIGPFIENEPKIRPKVEGPSPDKALVEETKI--ISKAWSELDESK 189

Query: 258 RKKHKKDKKKRKKKS 272
           +KK+    KK K++ 
Sbjct: 190 KKKYIDKYKKLKEEY 204


>gnl|CDD|129661 TIGR00570, cdk7, CDK-activating kinase assembly factor MAT1.  All
           proteins in this family for which functions are known
           are cyclin dependent protein kinases that are components
           of TFIIH, a complex that is involved in nucleotide
           excision repair and transcription initiation. Also known
           as MAT1 (menage a trois 1). This family is based on the
           phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis,
           Stanford University) [DNA metabolism, DNA replication,
           recombination, and repair].
          Length = 309

 Score = 27.5 bits (61), Expect = 7.2
 Identities = 19/98 (19%), Positives = 47/98 (47%), Gaps = 6/98 (6%)

Query: 178 KSKLKAKEKRKASPTSASSDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKY 237
           K K++  +K        + ++S +E+EE EE  + + E+E ++R   +K++++++   + 
Sbjct: 119 KKKIETYQKENKDVIQKNKEKSTREQEELEEALEFEKEEEEQRRLLLQKEEEEQQMNKRK 178

Query: 238 EEEEEKEEGEQTDS------EKEAKKRKKHKKDKKKRK 269
            ++   +E E +         +  K   K +   +K K
Sbjct: 179 NKQALLDELETSTLPAAELIAQHKKNSVKLEMQVEKPK 216



 Score = 27.5 bits (61), Expect = 8.0
 Identities = 19/64 (29%), Positives = 35/64 (54%), Gaps = 4/64 (6%)

Query: 212 TDSEKEAKKRKKHKKDKK----KKKKKSKYEEEEEKEEGEQTDSEKEAKKRKKHKKDKKK 267
            D E   KK + ++K+ K    K K+KS  E+EE +E  E    E+E ++    K+++++
Sbjct: 113 IDLENTKKKIETYQKENKDVIQKNKEKSTREQEELEEALEFEKEEEEQRRLLLQKEEEEQ 172

Query: 268 RKKK 271
           +  K
Sbjct: 173 QMNK 176


>gnl|CDD|178635 PLN03086, PLN03086, PRLI-interacting factor K; Provisional.
          Length = 567

 Score = 27.9 bits (62), Expect = 7.3
 Identities = 11/39 (28%), Positives = 24/39 (61%)

Query: 215 EKEAKKRKKHKKDKKKKKKKSKYEEEEEKEEGEQTDSEK 253
           E+E ++RK+  K K ++++K+K E  +++E  E     +
Sbjct: 13  EREQRERKQRAKLKLERERKAKEEAAKQREAIEAAQRSR 51


>gnl|CDD|217286 pfam02919, Topoisom_I_N, Eukaryotic DNA topoisomerase I, DNA
           binding fragment.  Topoisomerase I promotes the
           relaxation of DNA superhelical tension by introducing a
           transient single-stranded break in duplex DNA and are
           vital for the processes of replication, transcription,
           and recombination. This family may be more than one
           structural domain.
          Length = 215

 Score = 27.2 bits (61), Expect = 7.4
 Identities = 10/25 (40%), Positives = 17/25 (68%)

Query: 224 HKKDKKKKKKKSKYEEEEEKEEGEQ 248
            + +K+KKK  SK E++  KEE ++
Sbjct: 95  FEAEKEKKKAMSKEEKKAIKEEKDK 119


>gnl|CDD|218274 pfam04801, Sin_N, Sin-like protein conserved region.  Family of
           higher eukaryotic proteins. SIN was identified as a
           protein that interacts specifically with SXL (sex
           lethal) in a yeast two-hybrid assay. The interaction is
           mediated by one of the SXL RNA binding domains.
          Length = 422

 Score = 27.8 bits (62), Expect = 7.4
 Identities = 25/90 (27%), Positives = 41/90 (45%), Gaps = 15/90 (16%)

Query: 179 SKLKAKEKRKASPTSASSDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKYE 238
             L  K+KRK    +A  DE   EE E EE +Q        K  + + +K++K+++  Y 
Sbjct: 137 QYLDKKDKRKKEEDTADEDEDPDEEAE-EELKQVTV-----KFSRPETEKQRKRREQSYN 190

Query: 239 EEEEKEEGEQ---------TDSEKEAKKRK 259
             ++K   E           DSE E +++K
Sbjct: 191 FLQKKIAEEPWIELKYHGKKDSESELERQK 220


>gnl|CDD|218391 pfam05029, TIMELESS_C, Timeless protein C terminal region.  The
           timeless (tim) gene is essential for circadian function
           in Drosophila. Putative homologues of Drosophila tim
           have been identified in both mice and humans (mTim and
           hTIM, respectively). Mammalian TIM is not the true
           orthologue of Drosophila TIM, but is the likely
           orthologue of a fly gene, timeout (also called tim-2).
           mTim has been shown to be essential for embryonic
           development, but does not have substantiated circadian
           function. Some family members contain a SANT domain in
           this region.
          Length = 507

 Score = 27.8 bits (61), Expect = 7.7
 Identities = 17/89 (19%), Positives = 37/89 (41%), Gaps = 4/89 (4%)

Query: 170 VKCNELVLKSKLKAKEKRKASPTSASSDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKK 229
            K +   L+ +  +    +  P     D+ D E+ ++ E E     K A++RK       
Sbjct: 364 AKLSSTQLRRRAASLSGEEEEPEDELKDDVDGEQADESEHETLALRKNARQRKA----GL 419

Query: 230 KKKKKSKYEEEEEKEEGEQTDSEKEAKKR 258
              ++    EEE+K   ++    ++ K++
Sbjct: 420 ASPEEEALGEEEQKAPPKKKQLNQKNKQQ 448



 Score = 27.4 bits (60), Expect = 9.4
 Identities = 19/80 (23%), Positives = 39/80 (48%), Gaps = 1/80 (1%)

Query: 183 AKEKRKASPTSASSDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKYEEEEE 242
            + +R+A+  S   +E + E ++  +GEQ D   E +     K  +++K   +  EEE  
Sbjct: 369 TQLRRRAASLSGEEEEPEDELKDDVDGEQADES-EHETLALRKNARQRKAGLASPEEEAL 427

Query: 243 KEEGEQTDSEKEAKKRKKHK 262
            EE ++   +K+   +K  +
Sbjct: 428 GEEEQKAPPKKKQLNQKNKQ 447


>gnl|CDD|227496 COG5167, VID27, Protein involved in vacuole import and degradation
           [Intracellular trafficking and secretion].
          Length = 776

 Score = 27.6 bits (61), Expect = 7.8
 Identities = 15/59 (25%), Positives = 26/59 (44%)

Query: 200 DKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKYEEEEEKEEGEQTDSEKEAKKR 258
           ++E E K+    + S    K+       +K + +    EE E +EE E  + E +  KR
Sbjct: 343 NEEAERKDYILDSSSVPLEKQFDDILYFEKMEIENRNPEESEHEEEVEDYEDENDHSKR 401


>gnl|CDD|218561 pfam05340, DUF740, Protein of unknown function (DUF740).  This
           family consists of several uncharacterized plant
           proteins of unknown function.
          Length = 565

 Score = 27.7 bits (61), Expect = 8.0
 Identities = 22/78 (28%), Positives = 32/78 (41%), Gaps = 6/78 (7%)

Query: 198 ESDKEEEEKEEGEQTD------SEKEAKKRKKHKKDKKKKKKKSKYEEEEEKEEGEQTDS 251
           E D+E  EKE G+  D       E+E K  K     + + KK S  +  +         S
Sbjct: 149 EEDEEYYEKEPGKVVDEKSEEEEEEELKTMKDFIDLESQTKKPSVKDNGKSFWSAASVFS 208

Query: 252 EKEAKKRKKHKKDKKKRK 269
           +K  K R+K K  K +  
Sbjct: 209 KKLQKWRQKQKLKKPRSG 226


>gnl|CDD|227780 COG5493, COG5493, Uncharacterized conserved protein containing a
           coiled-coil domain [Function unknown].
          Length = 231

 Score = 27.1 bits (60), Expect = 8.1
 Identities = 13/43 (30%), Positives = 22/43 (51%)

Query: 216 KEAKKRKKHKKDKKKKKKKSKYEEEEEKEEGEQTDSEKEAKKR 258
           KE ++R+K   D+K + +K K  +E+ K      + E  A K 
Sbjct: 53  KETEQRQKELADEKLEVRKQKATKEDLKLLQRFQEEEFRATKE 95


>gnl|CDD|224415 COG1498, SIK1, Protein implicated in ribosomal biogenesis, Nop56p
           homolog [Translation, ribosomal structure and
           biogenesis].
          Length = 395

 Score = 27.4 bits (61), Expect = 8.1
 Identities = 12/46 (26%), Positives = 25/46 (54%)

Query: 227 DKKKKKKKSKYEEEEEKEEGEQTDSEKEAKKRKKHKKDKKKRKKKS 272
           +K+ +K K K  +   K + E+   E+  + R+K K+ K K +++ 
Sbjct: 347 EKRIEKLKEKPPKPPTKAKPERDKKERPGRYRRKKKEKKAKSERRG 392


>gnl|CDD|221429 pfam12118, SprA-related, SprA-related family.  This protein is
           found in bacteria. Proteins in this family are typically
           between 234 to 465 amino acids in length. There is a
           conserved GEV sequence motif.Most members are annotated
           as being SprA-related.
          Length = 261

 Score = 27.4 bits (61), Expect = 8.2
 Identities = 13/63 (20%), Positives = 33/63 (52%)

Query: 185 EKRKASPTSASSDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKYEEEEEKE 244
           +K K    ++++  S  +E +K+  E  D E   ++  +  + +  K+   K +E++++E
Sbjct: 50  QKAKLKGQASTAAGSASQELQKQASESNDEEVVGEEEPEKDQAEAGKQLSQKQQEQQQQE 109

Query: 245 EGE 247
           E +
Sbjct: 110 EKQ 112


>gnl|CDD|215525 PLN02972, PLN02972, Histidyl-tRNA synthetase.
          Length = 763

 Score = 27.5 bits (61), Expect = 8.4
 Identities = 17/33 (51%), Positives = 19/33 (57%), Gaps = 2/33 (6%)

Query: 202 EEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKK 234
           E  E  E  Q ++E EA K  K  KDKK KKKK
Sbjct: 228 EGGELIEKPQAEAENEAGKSDK--KDKKSKKKK 258


>gnl|CDD|220654 pfam10254, Pacs-1, PACS-1 cytosolic sorting protein.  PACS-1 is a
           cytosolic sorting protein that directs the localisation
           of membrane proteins in the trans-Golgi network
           (TGN)/endosomal system. PACS-1 connects the clathrin
           adaptor AP-1 to acidic cluster sorting motifs contained
           in the cytoplasmic domain of cargo proteins such as
           furin, the cation-independent mannose-6-phosphate
           receptor and in viral proteins such as human
           immunodeficiency virus type 1 Nef.
          Length = 413

 Score = 27.5 bits (61), Expect = 8.4
 Identities = 9/28 (32%), Positives = 11/28 (39%)

Query: 218 AKKRKKHKKDKKKKKKKSKYEEEEEKEE 245
             K K  K       KK K +E E K +
Sbjct: 330 VTKEKNKKVPVMFLGKKPKEKEVESKSQ 357


>gnl|CDD|215343 PLN02638, PLN02638, cellulose synthase A (UDP-forming), catalytic
           subunit.
          Length = 1079

 Score = 27.6 bits (61), Expect = 8.6
 Identities = 17/47 (36%), Positives = 20/47 (42%), Gaps = 7/47 (14%)

Query: 214 SEKEAKKRKKHKKDKKKKKKKS-------KYEEEEEKEEGEQTDSEK 253
           S K++ K  K   DKKK  K           E+ EE  EG   D EK
Sbjct: 662 SRKKSSKSSKKGSDKKKSGKHVDPTVPVFNLEDIEEGVEGAGFDDEK 708


>gnl|CDD|227061 COG4717, COG4717, Uncharacterized conserved protein [Function
           unknown].
          Length = 984

 Score = 27.5 bits (61), Expect = 8.6
 Identities = 15/74 (20%), Positives = 28/74 (37%), Gaps = 2/74 (2%)

Query: 198 ESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKYEEEEEKEEGEQTDSEKEAKK 257
             + E EE+      D E+E   R   +  ++K    S+  ++EE             K+
Sbjct: 397 LFEDEAEEEARQRLADDEEE--VRAGDEAREEKIAANSQVIDKEEVCNLYDRRDTAWQKQ 454

Query: 258 RKKHKKDKKKRKKK 271
           R   +K     ++K
Sbjct: 455 RFLREKQTAFERQK 468


>gnl|CDD|148051 pfam06213, CobT, Cobalamin biosynthesis protein CobT.  This family
           consists of several bacterial cobalamin biosynthesis
           (CobT) proteins. CobT is involved in the transformation
           of precorrin-3 into cobyrinic acid.
          Length = 282

 Score = 27.5 bits (61), Expect = 8.6
 Identities = 16/76 (21%), Positives = 31/76 (40%), Gaps = 2/76 (2%)

Query: 177 LKSKLKAKEKRKASPTSASSDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSK 236
           + S +   E+    P SA  D  D E+E+  + ++ D + E ++        +     S+
Sbjct: 202 MLSSMDMAEELGDEPESA--DSEDNEDEDDPKEDEDDDQGEEEESGSSDSLSEDSDASSE 259

Query: 237 YEEEEEKEEGEQTDSE 252
             E  E E  E +  +
Sbjct: 260 EMESGEMEAAEASADD 275


>gnl|CDD|205150 pfam12925, APP_E2, E2 domain of amyloid precursor protein.  The E2
           domain is the largest of the conserved domains of the
           amyloid precursor protein. The structure of E2 consists
           of two coiled-coil sub-structures connected through a
           continuous helix, and bears an unexpected resemblance to
           the spectrin family of protein structures.E 2 can
           reversibly dimerise in solution, and the dimerisation
           occurs along the longest dimension of the molecule in an
           antiparallel orientation, which enables the N-terminal
           substructure of one monomer to pack against the
           C-terminal substructure of a second monomer. The high
           degree of conservation of residues at the putative dimer
           interface suggests that the E2 dimer observed in the
           crystal could be physiologically relevant. Heparin
           sulfate proteoglycans, the putative ligands for the
           precursor present in extracellular matrix, bind to E2 at
           a conserved and positively charged site near the dimer
           interface.
          Length = 193

 Score = 27.0 bits (60), Expect = 8.7
 Identities = 19/71 (26%), Positives = 33/71 (46%)

Query: 190 SPTSASSDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKYEEEEEKEEGEQT 249
              S +SD  D    E +E  + +  KEAKKR + K  ++  +   ++EE E + +    
Sbjct: 3   PTPSPTSDAVDPYFSEPKEDNEHERYKEAKKRLEEKHRERMTQVMKEWEEAESQYKNLPK 62

Query: 250 DSEKEAKKRKK 260
              K A+  +K
Sbjct: 63  ADPKAAQLMRK 73


>gnl|CDD|218215 pfam04696, Pinin_SDK_memA, pinin/SDK/memA/ protein conserved
           region.  Members of this family have very varied
           localisations within the eukaryotic cell. pinin is known
           to localise at the desmosomes and is implicated in
           anchoring intermediate filaments to the desmosomal
           plaque. SDK2/3 is a dynamically localised nuclear
           protein thought to be involved in modulation of
           alternative pre-mRNA splicing. memA is a tumour marker
           preferentially expressed in human melanoma cell lines. A
           common feature of the members of this family is that
           they may all participate in regulating protein-protein
           interactions.
          Length = 131

 Score = 26.7 bits (59), Expect = 8.8
 Identities = 17/85 (20%), Positives = 39/85 (45%)

Query: 185 EKRKASPTSASSDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKYEEEEEKE 244
           +K     +  +S E  + E E++  EQ   E+E  +++K +  +++++K+ +  + E+K 
Sbjct: 21  QKFSQEESRLTSKEKRRAEIEQKLEEQEKQEREELRKEKRELFEERRRKQLELRKLEQKM 80

Query: 245 EGEQTDSEKEAKKRKKHKKDKKKRK 269
           E E+                + K K
Sbjct: 81  EDEKLQETWHEHNLALANFIRTKTK 105


>gnl|CDD|237875 PRK14974, PRK14974, cell division protein FtsY; Provisional.
          Length = 336

 Score = 27.2 bits (61), Expect = 8.9
 Identities = 20/68 (29%), Positives = 31/68 (45%), Gaps = 10/68 (14%)

Query: 177 LKSKLKAKEKRKASPTSASSDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSK 236
           LK KL                  +K EEE+EE      E+E ++ ++ KK+K     K+K
Sbjct: 5   LKEKLS----------KFVEKVEEKIEEEEEEEAPEAEEEEEEEDEEEKKEKPGFFDKAK 54

Query: 237 YEEEEEKE 244
             E +EK+
Sbjct: 55  ITEIKEKD 62


>gnl|CDD|218747 pfam05786, Cnd2, Condensin complex subunit 2.  This family consists
           of several Barren protein homologues from several
           eukaryotic organisms. In Drosophila Barren (barr) is
           required for sister-chromatid segregation in mitosis.
           barr encodes a novel protein that is present in
           proliferating cells and has homologues in yeast and
           human. Mitotic defects in barr embryos become apparent
           during cycle 16, resulting in a loss of PNS and CNS
           neurons. Centromeres move apart at the
           metaphase-anaphase transition and Cyclin B is degraded,
           but sister chromatids remain connected, resulting in
           chromatin bridging. Barren protein localises to
           chromatin throughout mitosis. Colocalisation and
           biochemical experiments indicate that Barren associates
           with Topoisomerase II throughout mitosis and alters the
           activity of Topoisomerase II. It has been suggested that
           this association is required for proper chromosomal
           segregation by facilitating the decatenation of
           chromatids at anaphase. This family forms one of the
           three non-structural maintenance of chromosomes (SMC)
           subunits of the mitotic condensation complex along with
           Cnd1 and Cnd3.
          Length = 719

 Score = 27.4 bits (61), Expect = 8.9
 Identities = 11/46 (23%), Positives = 25/46 (54%)

Query: 225 KKDKKKKKKKSKYEEEEEKEEGEQTDSEKEAKKRKKHKKDKKKRKK 270
             D K+  +  +  ++ E ++ E +D E++  K+K  KK ++K + 
Sbjct: 151 DSDNKEAPETGREGDDGEDDDEEGSDGEEDGAKKKAKKKRQRKPES 196


>gnl|CDD|227891 COG5604, COG5604, Uncharacterized conserved protein [Function
           unknown].
          Length = 523

 Score = 27.5 bits (61), Expect = 9.2
 Identities = 18/102 (17%), Positives = 37/102 (36%), Gaps = 11/102 (10%)

Query: 182 KAKEKRKASPTSASSDESDKEEEEKEEG--EQTDSEKEAKKRKKHK---KDKKKKKKKSK 236
           KA +  K    +   +  D+ ++         T +     + K       + K K+K SK
Sbjct: 3   KASKATKKFTKNHLKNTIDRRKQLARSKKVYGTKNRNSHTENKMESGTNDNNKNKEKLSK 62

Query: 237 YEEE------EEKEEGEQTDSEKEAKKRKKHKKDKKKRKKKS 272
              +      EE+E+G ++ S+     +K        +K + 
Sbjct: 63  LYSDVDSSSSEEEEDGSESISKLNVNSKKISLNQVSTQKWRK 104


>gnl|CDD|227925 COG5638, COG5638, Uncharacterized conserved protein [Function
           unknown].
          Length = 622

 Score = 27.4 bits (60), Expect = 9.3
 Identities = 26/89 (29%), Positives = 44/89 (49%), Gaps = 10/89 (11%)

Query: 194 ASSDESDKEEEEKEEGEQTDSEKEAKKR---KKHKKDKKKKKKKSKYEEEEEKEEGEQTD 250
               E+ KE + K+  + +D + + KK    KK+KK K   ++ +  +EE E  + +  D
Sbjct: 442 RERQENRKERQLKKTKDDSDVDLKDKKESINKKNKKGKHAIERTAASKEELELIKADDED 501

Query: 251 SE-------KEAKKRKKHKKDKKKRKKKS 272
            E       K   K +K KK++K +KK S
Sbjct: 502 DEQLDHFDMKSILKAEKFKKNRKLKKKAS 530


>gnl|CDD|219564 pfam07771, TSGP1, Tick salivary peptide group 1.  This contains a
           group of peptides derived from a salivary gland cDNA
           library of the tick Ixodes scapularis. Also present are
           peptides from a related tick species, Ixodes ricinus.
           They are characterized by a putative signal peptide
           indicative of secretion and conserved cysteine residues.
          Length = 120

 Score = 26.4 bits (58), Expect = 9.3
 Identities = 13/23 (56%), Positives = 15/23 (65%)

Query: 214 SEKEAKKRKKHKKDKKKKKKKSK 236
           +EK  KK+KK KK KK KK   K
Sbjct: 97  TEKPKKKKKKSKKTKKPKKSSKK 119


>gnl|CDD|236698 PRK10475, PRK10475, 23S rRNA pseudouridine synthase F; Provisional.
          Length = 290

 Score = 27.4 bits (61), Expect = 9.3
 Identities = 12/60 (20%), Positives = 20/60 (33%), Gaps = 8/60 (13%)

Query: 210 EQTDSEKEAKKRKKHKKDKKKKKKKSKYEEEEEKEEGEQTDSEKEAKKRKKHKKDKKKRK 269
           E + SE + K + K K    K+             E       + A   K+     +K+K
Sbjct: 237 ENSSSEAKPKAKAKPKTAGIKRPVVKM--------EKTAEKGGRPASNGKRFTSPGRKKK 288



 Score = 27.0 bits (60), Expect = 9.8
 Identities = 20/71 (28%), Positives = 29/71 (40%), Gaps = 4/71 (5%)

Query: 162 RPLQDVRVVKCNELVLKSKLKAKEKRKASPTSASSDESDKEEEEKEEGEQTDSEKEAKKR 221
           R L D  ++   +L+  S  +AK K KA P +A      K    K E       + A   
Sbjct: 222 RDLTDDELIDLFKLIENSSSEAKPKAKAKPKTAGI----KRPVVKMEKTAEKGGRPASNG 277

Query: 222 KKHKKDKKKKK 232
           K+     +KKK
Sbjct: 278 KRFTSPGRKKK 288


>gnl|CDD|223880 COG0810, TonB, Periplasmic protein TonB, links inner and outer
           membranes [Cell envelope biogenesis, outer membrane].
          Length = 244

 Score = 27.1 bits (60), Expect = 9.4
 Identities = 17/84 (20%), Positives = 29/84 (34%), Gaps = 3/84 (3%)

Query: 188 KASPTSASSDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKYEEEEEKEEGE 247
            A+    +  E  + E E  E +     +     +      K+K K    E++ +K + +
Sbjct: 44  LAAKVLEAPTEEPQPEPEPPEEQPKPPTEPETPPEPTPPKPKEKPKP---EKKPKKPKPK 100

Query: 248 QTDSEKEAKKRKKHKKDKKKRKKK 271
                K   K K   K KK   K 
Sbjct: 101 PKPKPKPKPKVKPQPKPKKPPSKT 124


>gnl|CDD|184696 PRK14474, PRK14474, F0F1 ATP synthase subunit B; Provisional.
          Length = 250

 Score = 27.1 bits (60), Expect = 9.6
 Identities = 13/88 (14%), Positives = 38/88 (43%), Gaps = 10/88 (11%)

Query: 181 LKAKEKRKASPTSASSDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKYEEE 240
           ++  +KR+    +   D   +++E  +E E+    ++ ++  + ++     + +   +E+
Sbjct: 31  IQVMKKRQQRIANRWQDAEQRQQEAGQEAER---YRQKQQSLEQQRASFMAQAQEAADEQ 87

Query: 241 ------EEKEEGEQTDSE-KEAKKRKKH 261
                 E +E+      E  E  +R+K 
Sbjct: 88  RQHLLNEAREDVATARDEWLEQLEREKQ 115


>gnl|CDD|218673 pfam05642, Sporozoite_P67, Sporozoite P67 surface antigen.  This
           family consists of several Theileria P67 surface
           antigens. A stage specific surface antigen of Theileria
           parva, p67, is the basis for the development of an
           anti-sporozoite vaccine for the control of East Coast
           fever (ECF) in cattle. The antigen has been shown to
           contain five distinct linear peptide sequences
           recognised by sporozoite-neutralising murine monoclonal
           antibodies.
          Length = 727

 Score = 27.3 bits (60), Expect = 9.8
 Identities = 23/98 (23%), Positives = 42/98 (42%), Gaps = 5/98 (5%)

Query: 163 PLQDVRVVKCNELVLKSKLKAKEKRKASPTSASSDESDKEEEEKEEGEQTDSEKEAKKRK 222
           P + V  +     V KS + + +  K  PT  S     +    +   E    E E +   
Sbjct: 49  PFKTVGALSKATKVWKSAVSSSDDSKTVPTPVS-----EPNITRSFQEPVSQESEVQDNT 103

Query: 223 KHKKDKKKKKKKSKYEEEEEKEEGEQTDSEKEAKKRKK 260
           +  +D K  K  S+ ++++ +EE  ++ S K+ K  KK
Sbjct: 104 EQNQDTKGSKTDSEEDDDDSEEEDNKSTSSKDGKGSKK 141


  Database: CDD.v3.10
    Posted date:  Mar 20, 2013  7:55 AM
  Number of letters in database: 10,937,602
  Number of sequences in database:  44,354
  
Lambda     K      H
   0.310    0.131    0.365 

Gapped
Lambda     K      H
   0.267   0.0677    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 13,494,132
Number of extensions: 1314821
Number of successful extensions: 13824
Number of sequences better than 10.0: 1
Number of HSP's gapped: 8446
Number of HSP's successfully gapped: 1769
Length of query: 272
Length of database: 10,937,602
Length adjustment: 95
Effective length of query: 177
Effective length of database: 6,723,972
Effective search space: 1190143044
Effective search space used: 1190143044
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.2 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (21.7 bits)
S2: 58 (26.2 bits)