RPS-BLAST 2.2.26 [Sep-21-2011]

Database: CDD.v3.10 
           44,354 sequences; 10,937,602 total letters

Searching..................................................done

Query= psy13068
         (950 letters)



>gnl|CDD|215083 PLN00162, PLN00162, transport protein sec23; Provisional.
          Length = 761

 Score =  319 bits (820), Expect = 4e-96
 Identities = 107/195 (54%), Positives = 144/195 (73%), Gaps = 1/195 (0%)

Query: 749 GAPIPQGGRGCIQFITHYQAPSGEKKVRVTTIARNWADATTQLDHISSGFDQEAAAVIMG 808
             P P G +  +QF+T YQ  +G+ ++RVTT+ R W + +   + + +GFDQEAAAV+M 
Sbjct: 469 SNPQPPGQQFFLQFLTRYQHSNGQTRLRVTTVTRRWVEGS-SSEELVAGFDQEAAAVVMA 527

Query: 809 RMVVNRAEQDDGPDVMRWADRTLIRLCQKFGHYNKDDPHSFQLPENFQMYPQFMYHLRRS 868
           R+  ++ E ++  D  RW DR LIRLC KFG Y KDDP SF+L  NF +YPQFM++LRRS
Sbjct: 528 RLASHKMETEEEFDATRWLDRALIRLCSKFGDYRKDDPSSFRLSPNFSLYPQFMFNLRRS 587

Query: 869 QFLQVFNNSPDETSYYRHVLMREDLSQSLIMIQPVLYSYGFNGPPVPVLLDTSSIKPDCI 928
           QF+QVFNNSPDET+Y+R +L RE+++ SL+MIQP L SY FNGPP PVLLD +SI  D I
Sbjct: 588 QFVQVFNNSPDETAYFRMMLNRENVTNSLVMIQPTLISYSFNGPPEPVLLDVASIAADRI 647

Query: 929 LLMDTFFQLLIFHGE 943
           LL+D++F ++IFHG 
Sbjct: 648 LLLDSYFSVVIFHGS 662



 Score =  195 bits (498), Expect = 4e-52
 Identities = 71/138 (51%), Positives = 89/138 (64%), Gaps = 29/138 (21%)

Query: 519 CQKFGHYNKDDPHSFQLPENFQMYPQFMYHLRRSQFLQVFNNSPDETSYYRTLLIHLPHY 578
           C KFG Y KDDP SF+L  NF +YPQFM++LRRSQF+QVF                    
Sbjct: 554 CSKFGDYRKDDPSSFRLSPNFSLYPQFMFNLRRSQFVQVF-------------------- 593

Query: 579 SRFQFLQVFNNSPDETSYYRHVLMREDLSQSLIMIQPVLYSYGFNGPPVPVLLDTSSIKP 638
                    NNSPDET+Y+R +L RE+++ SL+MIQP L SY FNGPP PVLLD +SI  
Sbjct: 594 ---------NNSPDETAYFRMMLNRENVTNSLVMIQPTLISYSFNGPPEPVLLDVASIAA 644

Query: 639 DCILLMDTFFQLLIFHGE 656
           D ILL+D++F ++IFHG 
Sbjct: 645 DRILLLDSYFSVVIFHGS 662



 Score =  189 bits (482), Expect = 4e-50
 Identities = 71/132 (53%), Positives = 96/132 (72%), Gaps = 7/132 (5%)

Query: 203 EERDGIRCTWNVWPSSKLEASRLVMPVGCLYQPLKEKPDLPPLQYEPLLCMRNQCRAILN 262
           E  DG+R +WNVWPSSK+EAS+ V+P+  LY PLK  P+LP L Y+PL C    CRA+LN
Sbjct: 7   EAIDGVRMSWNVWPSSKIEASKCVIPLAALYTPLKPLPELPVLPYDPLRCRT--CRAVLN 64

Query: 263 PLCQVDYKSKLWVCNFCFQRNAFPPQYAAITEQHQPAELHPQFTTIEYTIPVSQCFPKYP 322
           P C+VD+++K+W+C FCFQRN FPP Y++I+E + PAEL PQ+TT+EYT+P        P
Sbjct: 65  PYCRVDFQAKIWICPFCFQRNHFPPHYSSISETNLPAELFPQYTTVEYTLPPGSGGAPSP 124

Query: 323 LRSLPVFLPFLI 334
               PVF+ F++
Sbjct: 125 ----PVFV-FVV 131



 Score =  128 bits (324), Expect = 2e-30
 Identities = 44/75 (58%), Positives = 59/75 (78%), Gaps = 2/75 (2%)

Query: 351 SPNLPPLQYEPLLCMRNQCRAILNPLCQVDYKSKLWVCNFCFQRNAFPPQYAAITEQHQP 410
            P LP L Y+PL C    CRA+LNP C+VD+++K+W+C FCFQRN FPP Y++I+E + P
Sbjct: 43  LPELPVLPYDPLRCRT--CRAVLNPYCRVDFQAKIWICPFCFQRNHFPPHYSSISETNLP 100

Query: 411 AELHPQFTTIEYTIP 425
           AEL PQ+TT+EYT+P
Sbjct: 101 AELFPQYTTVEYTLP 115



 Score = 99.2 bits (248), Expect = 4e-21
 Identities = 37/64 (57%), Positives = 47/64 (73%), Gaps = 2/64 (3%)

Query: 12 EERDGIRCTWNVWPSSKLEASRLVMPVGCLYQPLKEKPDLPPLQYEPLLCMRNQCRAILN 71
          E  DG+R +WNVWPSSK+EAS+ V+P+  LY PLK  P+LP L Y+PL C    CRA+LN
Sbjct: 7  EAIDGVRMSWNVWPSSKIEASKCVIPLAALYTPLKPLPELPVLPYDPLRCRT--CRAVLN 64

Query: 72 PLCQ 75
          P C+
Sbjct: 65 PYCR 68



 Score = 77.3 bits (191), Expect = 2e-14
 Identities = 30/62 (48%), Positives = 39/62 (62%)

Query: 656 EEPIRSHNDIHKGNNKYQKMATKHYEGLALRAAIVGHAIDIYSCALDQTGLLEMKLCCNA 715
            EPIRSH D+ K    Y K A K YEGLA +    GH +D+++C+LDQ G+ EMK+    
Sbjct: 307 SEPIRSHKDLDKDAAPYYKKAVKFYEGLAKQLVAQGHVLDVFACSLDQVGVAEMKVAVER 366

Query: 716 TG 717
           TG
Sbjct: 367 TG 368


>gnl|CDD|227380 COG5047, SEC23, Vesicle coat complex COPII, subunit SEC23
           [Intracellular trafficking and secretion].
          Length = 755

 Score =  261 bits (669), Expect = 4e-75
 Identities = 118/201 (58%), Positives = 140/201 (69%), Gaps = 3/201 (1%)

Query: 743 STTAAHGAPIPQGGRGCIQFITHYQAPSGEKKVRVTTIARNWADATTQLDHISSGFDQEA 802
           +  AA G+   +     IQFIT YQ  SG  ++RVTT+AR + D    L  I+  FDQEA
Sbjct: 459 ALGAASGSA-QRPAEAYIQFITTYQHSSGTYRIRVTTVARMFTDG--GLPKINRSFDQEA 515

Query: 803 AAVIMGRMVVNRAEQDDGPDVMRWADRTLIRLCQKFGHYNKDDPHSFQLPENFQMYPQFM 862
           AAV M R+   +AE +D  DV RW DR LIRLCQKF  Y KDDP SF+L  NF +YPQFM
Sbjct: 516 AAVFMARIAAFKAETEDIIDVFRWIDRNLIRLCQKFADYRKDDPSSFRLDPNFTLYPQFM 575

Query: 863 YHLRRSQFLQVFNNSPDETSYYRHVLMREDLSQSLIMIQPVLYSYGFNGPPVPVLLDTSS 922
           YHLRRS FL VFNNSPDET++YRH+L   D++ SLIMIQP L SY F    VPVLLD+ S
Sbjct: 576 YHLRRSPFLSVFNNSPDETAFYRHMLNNADVNDSLIMIQPTLQSYSFEKGGVPVLLDSVS 635

Query: 923 IKPDCILLMDTFFQLLIFHGE 943
           +KPD ILL+DTFF +LIFHG 
Sbjct: 636 VKPDVILLLDTFFHILIFHGS 656



 Score =  158 bits (400), Expect = 8e-40
 Identities = 74/138 (53%), Positives = 85/138 (61%), Gaps = 29/138 (21%)

Query: 519 CQKFGHYNKDDPHSFQLPENFQMYPQFMYHLRRSQFLQVFNNSPDETSYYRTLLIHLPHY 578
           CQKF  Y KDDP SF+L  NF +YPQFMYHLRRS FL VFNNSPDET             
Sbjct: 548 CQKFADYRKDDPSSFRLDPNFTLYPQFMYHLRRSPFLSVFNNSPDET------------- 594

Query: 579 SRFQFLQVFNNSPDETSYYRHVLMREDLSQSLIMIQPVLYSYGFNGPPVPVLLDTSSIKP 638
                           ++YRH+L   D++ SLIMIQP L SY F    VPVLLD+ S+KP
Sbjct: 595 ----------------AFYRHMLNNADVNDSLIMIQPTLQSYSFEKGGVPVLLDSVSVKP 638

Query: 639 DCILLMDTFFQLLIFHGE 656
           D ILL+DTFF +LIFHG 
Sbjct: 639 DVILLLDTFFHILIFHGS 656



 Score =  151 bits (384), Expect = 7e-38
 Identities = 60/132 (45%), Positives = 80/132 (60%), Gaps = 8/132 (6%)

Query: 199 FYQNEERDGIRCTWNVWPSSKLEASRLVMPVGCLYQPLKEKPDLPPLQYEPLLCMRNQCR 258
           F   EE DGIR TWNV+P+++ +A+R V+P+ CLY PL E   L    YEP+ C    C+
Sbjct: 3   FEIIEENDGIRLTWNVFPATRGDATRTVIPIACLYTPLHEDDALTVNYYEPVKCTA-PCK 61

Query: 259 AILNPLCQVDYKSKLWVCNFCFQRNAFPPQYAAITEQHQPAELHPQFTTIEYTIPVSQCF 318
           A+LNP C +D +++ W+C FC QRN  PPQY  I+  + P EL PQ +TIEYT+      
Sbjct: 62  AVLNPYCHIDERNQSWICPFCNQRNTLPPQYRDISNANLPLELLPQSSTIEYTLSKPVIL 121

Query: 319 PKYPLRSLPVFL 330
           P       PVF 
Sbjct: 122 P-------PVFF 126



 Score = 95.7 bits (238), Expect = 4e-20
 Identities = 33/67 (49%), Positives = 44/67 (65%), Gaps = 1/67 (1%)

Query: 8  FYQNEERDGIRCTWNVWPSSKLEASRLVMPVGCLYQPLKEKPDLPPLQYEPLLCMRNQCR 67
          F   EE DGIR TWNV+P+++ +A+R V+P+ CLY PL E   L    YEP+ C    C+
Sbjct: 3  FEIIEENDGIRLTWNVFPATRGDATRTVIPIACLYTPLHEDDALTVNYYEPVKCTA-PCK 61

Query: 68 AILNPLC 74
          A+LNP C
Sbjct: 62 AVLNPYC 68



 Score = 89.2 bits (221), Expect = 4e-18
 Identities = 34/74 (45%), Positives = 46/74 (62%), Gaps = 1/74 (1%)

Query: 352 PNLPPLQYEPLLCMRNQCRAILNPLCQVDYKSKLWVCNFCFQRNAFPPQYAAITEQHQPA 411
             L    YEP+ C    C+A+LNP C +D +++ W+C FC QRN  PPQY  I+  + P 
Sbjct: 44  DALTVNYYEPVKCTA-PCKAVLNPYCHIDERNQSWICPFCNQRNTLPPQYRDISNANLPL 102

Query: 412 ELHPQFTTIEYTIP 425
           EL PQ +TIEYT+ 
Sbjct: 103 ELLPQSSTIEYTLS 116



 Score = 62.2 bits (151), Expect = 8e-10
 Identities = 30/61 (49%), Positives = 43/61 (70%)

Query: 657 EPIRSHNDIHKGNNKYQKMATKHYEGLALRAAIVGHAIDIYSCALDQTGLLEMKLCCNAT 716
           EP+RSH+DI   + ++ K ATK Y+GLA R A  GHA+DI++  LDQ G++EM+    +T
Sbjct: 303 EPMRSHHDIESDSAQHSKKATKFYKGLAERVANQGHALDIFAGCLDQIGIMEMEPLTTST 362

Query: 717 G 717
           G
Sbjct: 363 G 363


>gnl|CDD|218277 pfam04815, Sec23_helical, Sec23/Sec24 helical domain.  COPII-coated
           vesicles carry proteins from the endoplasmic reticulum
           to the Golgi complex. This vesicular transport can be
           reconstituted by using three cytosolic components
           containing five proteins: the small GTPase Sar1p, the
           Sec23p/24p complex, and the Sec13p/Sec31p complex. This
           domain is composed of five alpha helices.
          Length = 103

 Score =  138 bits (350), Expect = 2e-38
 Identities = 33/103 (32%), Positives = 52/103 (50%), Gaps = 2/103 (1%)

Query: 800 QEAAAVIMGRMVVNRAEQDDGPDVMRWADRTLIRLCQKFGHYNKDDPHS--FQLPENFQM 857
           QEA AV++ +  V +A      D  RW D+ L+ +   +  Y      S    LPE+ ++
Sbjct: 1   QEAIAVLLAKKAVEKALTSSLKDARRWLDKKLVDILAAYRKYCASSSSSGQLILPESLKL 60

Query: 858 YPQFMYHLRRSQFLQVFNNSPDETSYYRHVLMREDLSQSLIMI 900
            P +M  L +S  L+  N SPDE +Y R +L+   +   L+MI
Sbjct: 61  LPLYMLALLKSPALRGGNVSPDERAYARCLLLSLPVENLLLMI 103



 Score = 85.3 bits (212), Expect = 6e-20
 Identities = 22/116 (18%), Positives = 39/116 (33%), Gaps = 31/116 (26%)

Query: 500 QEDGNEEIEDERETKMEEECQKFGHYNKDDPHS--FQLPENFQMYPQFMYHLRRSQFLQV 557
                ++     + K+ +    +  Y      S    LPE+ ++ P +M  L +S     
Sbjct: 17  LTSSLKDARRWLDKKLVDILAAYRKYCASSSSSGQLILPESLKLLPLYMLALLKS----- 71

Query: 558 FNNSPDETSYYRTLLIHLPHYSRFQFLQVFNNSPDETSYYRHVLMREDLSQSLIMI 613
                                     L+  N SPDE +Y R +L+   +   L+MI
Sbjct: 72  ------------------------PALRGGNVSPDERAYARCLLLSLPVENLLLMI 103


>gnl|CDD|200443 cd11287, Sec23_C, C-terminal Actin depolymerization factor-homology
           domain of Sec23.  The C-terminal domain of the Sec23
           subunit of the coat protein complex II (COPII) is
           distantly related to gelsolin-like repeats and the actin
           depolymerizing domains found in cofilin and similar
           proteins. Sec23 forms a tight complex with Sec24. The
           cytoplasmic Sec23/24 complex is recruited together with
           Sar1-GTP and Sec13/31 to induce coat polymerization and
           membrane deformation in the forming of COPII-coated
           endoplasmic reticulum vesicles. The function of the
           Sec23 C-terminal domain is unclear.
          Length = 121

 Score =  105 bits (263), Expect = 1e-26
 Identities = 41/53 (77%), Positives = 46/53 (86%)

Query: 604 EDLSQSLIMIQPVLYSYGFNGPPVPVLLDTSSIKPDCILLMDTFFQLLIFHGE 656
           ED+S SLIMIQP LYSY FNGPP PVLLD+SSI PD ILL+DTFF +LI+HGE
Sbjct: 1   EDVSNSLIMIQPTLYSYSFNGPPEPVLLDSSSILPDRILLLDTFFHILIYHGE 53



 Score =  105 bits (263), Expect = 1e-26
 Identities = 41/53 (77%), Positives = 46/53 (86%)

Query: 891 EDLSQSLIMIQPVLYSYGFNGPPVPVLLDTSSIKPDCILLMDTFFQLLIFHGE 943
           ED+S SLIMIQP LYSY FNGPP PVLLD+SSI PD ILL+DTFF +LI+HGE
Sbjct: 1   EDVSNSLIMIQPTLYSYSFNGPPEPVLLDSSSILPDRILLLDTFFHILIYHGE 53


>gnl|CDD|238755 cd01478, Sec23-like, Sec23-like: Protein and membrane traffic in
           eukaryotes is mediated by at least in part by the
           budding and fusion of intracellular transport vesicles
           that selectively carry cargo proteins and lipids from
           donor to acceptor organelles. The two main classes of
           vesicular carriers within the endocytic and the
           biosynthetic pathways are COP- and clathrin-coated
           vesicles. Formation of COPII vesicles requires the
           ordered assembly of the coat built from several
           cytosolic components GTPase Sar1, complexes of
           Sec23-Sec24 and Sec13-Sec31. The process is initiated by
           the conversion of GDP to GTP by the GTPase Sar1 which
           then recruits the heterodimeric complex of Sec23 and
           Sec24. This heterodimeric complex generates the
           pre-budding complex. The final step leading to membrane
           deformation and budding of COPII-coated vesicles is
           carried by the heterodimeric complex Sec13-Sec31. The
           members of this CD belong to the Sec23-like family. Sec
           23 is very similar to Sec24. The Sec23 and Sec24
           polypeptides fold into five distinct domains: a
           beta-barrel, a zinc finger, a vWA or trunk, an all
           helical region and a carboxy Gelsolin domain. The
           members of this subgroup lack the consensus MIDAS motif
           but have the overall Para-Rossmann type fold that is
           characteristic of this superfamily.
          Length = 267

 Score =  102 bits (257), Expect = 4e-24
 Identities = 39/72 (54%), Positives = 50/72 (69%), Gaps = 2/72 (2%)

Query: 657 EPIRSHNDIHKGNNKYQKMATKHYEGLALRAAIVGHAIDIYSCALDQTGLLEMKLCCNAT 716
           +PIRSH+DI K N KY K A K Y+ LA R A  GHA+DI++  LDQ GLLEMK+  N+T
Sbjct: 187 DPIRSHHDIDKDNAKYYKKAVKFYDSLAKRLAANGHAVDIFAGCLDQVGLLEMKVLVNST 246

Query: 717 GKPSHVLIAIEF 728
           G   HV+++  F
Sbjct: 247 G--GHVVLSDSF 256


>gnl|CDD|203092 pfam04810, zf-Sec23_Sec24, Sec23/Sec24 zinc finger.  COPII-coated
           vesicles carry proteins from the endoplasmic reticulum
           to the Golgi complex. This vesicular transport can be
           reconstituted by using three cytosolic components
           containing five proteins: the small GTPase Sar1p, the
           Sec23p/24p complex, and the Sec13p/Sec31p complex. This
           domain is found to be zinc binding domain.
          Length = 39

 Score = 66.4 bits (163), Expect = 5e-14
 Identities = 22/41 (53%), Positives = 26/41 (63%), Gaps = 2/41 (4%)

Query: 249 PLLCMRNQCRAILNPLCQVDYKSKLWVCNFCFQRNAFPPQY 289
           P+ C R  CRA LNP CQ D+  + W CN C QRN  PP+Y
Sbjct: 1   PVRCRR--CRAYLNPYCQFDFGGRRWTCNLCGQRNDLPPEY 39



 Score = 66.4 bits (163), Expect = 5e-14
 Identities = 22/41 (53%), Positives = 26/41 (63%), Gaps = 2/41 (4%)

Query: 361 PLLCMRNQCRAILNPLCQVDYKSKLWVCNFCFQRNAFPPQY 401
           P+ C R  CRA LNP CQ D+  + W CN C QRN  PP+Y
Sbjct: 1   PVRCRR--CRAYLNPYCQFDFGGRRWTCNLCGQRNDLPPEY 39



 Score = 28.3 bits (64), Expect = 1.5
 Identities = 11/18 (61%), Positives = 12/18 (66%), Gaps = 2/18 (11%)

Query: 58 PLLCMRNQCRAILNPLCQ 75
          P+ C R  CRA LNP CQ
Sbjct: 1  PVRCRR--CRAYLNPYCQ 16


>gnl|CDD|147125 pfam04811, Sec23_trunk, Sec23/Sec24 trunk domain.  COPII-coated
           vesicles carry proteins from the endoplasmic reticulum
           to the Golgi complex. This vesicular transport can be
           reconstituted by using three cytosolic components
           containing five proteins: the small GTPase Sar1p, the
           Sec23p/24p complex, and the Sec13p/Sec31p complex. This
           domain is known as the trunk domain and has an
           alpha/beta vWA fold and forms the dimer interface.
          Length = 241

 Score = 70.7 bits (174), Expect = 2e-13
 Identities = 25/91 (27%), Positives = 39/91 (42%), Gaps = 15/91 (16%)

Query: 642 LLMDTFFQ---LLIFHGEEPI------------RSHNDIHKGNNKYQKMATKHYEGLALR 686
           LL+   F    +++F G  P              SH+D  K   K  K A K Y+ LA  
Sbjct: 125 LLLKAAFTGGKIMLFQGGLPTVGPGGKLKSRLDESHHDTDKEKAKLVKKADKFYKSLAKE 184

Query: 687 AAIVGHAIDIYSCALDQTGLLEMKLCCNATG 717
               GH++D+++ +LD   + E+      TG
Sbjct: 185 CVAQGHSVDLFAFSLDYVDVAELGCLSRLTG 215


>gnl|CDD|219707 pfam08033, Sec23_BS, Sec23/Sec24 beta-sandwich domain. 
          Length = 86

 Score = 55.2 bits (134), Expect = 1e-09
 Identities = 14/38 (36%), Positives = 20/38 (52%)

Query: 748 HGAPIPQGGRGCIQFITHYQAPSGEKKVRVTTIARNWA 785
              P+  G +  IQF   Y   SGE+++RVTT+A    
Sbjct: 49  IDEPLDSGKQAYIQFALLYTHSSGERRIRVTTVALPVT 86


>gnl|CDD|238745 cd01468, trunk_domain, trunk domain. COPII-coated vesicles carry
           proteins from the endoplasmic reticulum to the Golgi
           complex. This vesicular transport can be reconstituted
           by using three cytosolic components containing five
           proteins: the small GTPase Sar1p, the Sec23p/24p
           complex, and the Sec13p/Sec31p complex. This domain is
           known as the trunk domain and has an alpha/beta vWA fold
           and forms the dimer interface. Some members of this
           family possess a partial MIDAS motif that is a
           characteristic feature of most vWA domain proteins.
          Length = 239

 Score = 58.8 bits (143), Expect = 2e-09
 Identities = 20/61 (32%), Positives = 32/61 (52%), Gaps = 5/61 (8%)

Query: 657 EPIRSHNDIHKGNNKYQKMATKHYEGLALRAAIVGHAIDIYSCALDQTGLLEMKLCCNAT 716
           EPIRSH++      +  K ATK Y+ LA      G  +D+++ +LD   +  +K    +T
Sbjct: 160 EPIRSHDEA-----QLLKPATKFYKSLAKECVKSGICVDLFAFSLDYVDVATLKQLAKST 214

Query: 717 G 717
           G
Sbjct: 215 G 215


>gnl|CDD|200436 cd11280, gelsolin_like, Tandemly repeated domains found in
           gelsolin, severin, villin, and related proteins.
           Gelsolin repeats occur in gelsolin, severin, villin,
           advillin, villidin, supervillin, flightless, quail,
           fragmin, and other proteins, usually in several copies.
           They co-occur with villin headpiece domains,
           leucine-rich repeats, and several other domains. These
           gelsolin-related actin binding proteins (GRABPs) play
           regulatory roles in the assembly and disassembly of
           actin filaments; they are involved in F-actin capping,
           uncapping, severing, or the nucleation of actin
           filaments. Severing of actin filaments is Ca2+
           dependent. Villins are also linked to generating bundles
           of F-actin with uniform filament polarity, which is most
           likely mediated by their extra villin headpiece domain.
           Many family members have also adopted functions in the
           nucleus, including the regulation of transcription.
           Supervillin, gelsolin, and flightless I are involved in
           intracellular signaling via nuclear hormone receptors.
           The gelsolin-like domain is distantly related to the
           actin depolymerizing domains found in cofilin and
           similar proteins.
          Length = 88

 Score = 42.7 bits (101), Expect = 3e-05
 Identities = 10/43 (23%), Positives = 19/43 (44%)

Query: 904 LYSYGFNGPPVPVLLDTSSIKPDCILLMDTFFQLLIFHGEVSN 946
                       V L +SS+  D + ++DT  ++ I+ G  S+
Sbjct: 6   RVRGSKAIEIEEVPLASSSLDSDDVFVLDTGSEIYIWQGRASS 48



 Score = 40.8 bits (96), Expect = 2e-04
 Identities = 9/42 (21%), Positives = 17/42 (40%)

Query: 617 LYSYGFNGPPVPVLLDTSSIKPDCILLMDTFFQLLIFHGEEP 658
                       V L +SS+  D + ++DT  ++ I+ G   
Sbjct: 6   RVRGSKAIEIEEVPLASSSLDSDDVFVLDTGSEIYIWQGRAS 47


>gnl|CDD|233191 TIGR00927, 2A1904, K+-dependent Na+/Ca+ exchanger.  [Transport and
           binding proteins, Cations and iron carrying compounds].
          Length = 1096

 Score = 44.6 bits (105), Expect = 3e-04
 Identities = 24/66 (36%), Positives = 36/66 (54%), Gaps = 1/66 (1%)

Query: 453 GLQEENIEEKFDDSSQEEYFEAEDQEGQGDSQEDEEMEEQRIEDEEDQEDGNEEIEDERE 512
           G QE N E + +    E+  +       GDS E+EE EE+  E+EE++E+  EE E+E E
Sbjct: 833 GEQELNAENQGEAKQDEKGVDGGGGSDGGDS-EEEEEEEEEEEEEEEEEEEEEEEEEENE 891

Query: 513 TKMEEE 518
             +  E
Sbjct: 892 EPLSLE 897



 Score = 37.3 bits (86), Expect = 0.043
 Identities = 22/119 (18%), Positives = 50/119 (42%), Gaps = 5/119 (4%)

Query: 436 NDNVIRRNEIHIKPRICGLQEENIEEKFDDSSQEEYFEAEDQEGQGDSQEDEEMEEQRIE 495
           ++    + E   +       E   + +      E   E  +QE   ++Q + + +E+ ++
Sbjct: 794 DEGAEGKVEHEGETEAGEKDEHEGQSETQADDTEVKDETGEQELNAENQGEAKQDEKGVD 853

Query: 496 DEEDQEDGNEEIEDERETKMEEECQKFGHYNKDD-----PHSFQLPENFQMYPQFMYHL 549
                + G+ E E+E E + EEE ++     +++     P S + PE  Q    +++ L
Sbjct: 854 GGGGSDGGDSEEEEEEEEEEEEEEEEEEEEEEEEEENEEPLSLEWPETRQKQAIYLFLL 912



 Score = 33.8 bits (77), Expect = 0.48
 Identities = 19/70 (27%), Positives = 34/70 (48%), Gaps = 8/70 (11%)

Query: 455 QEENIE-EKFDDSSQEEYFEAEDQEGQGDSQEDEEMEEQRIEDEEDQEDGN-------EE 506
            E  I  E+  +   E   EA++ + +G+++ +E   E   E E  +++G        EE
Sbjct: 681 SEGEIPAERKGEQEGEGEIEAKEADHKGETEAEEVEHEGETEAEGTEDEGEIETGEEGEE 740

Query: 507 IEDERETKME 516
           +EDE E + E
Sbjct: 741 VEDEGEGEAE 750



 Score = 33.4 bits (76), Expect = 0.72
 Identities = 19/81 (23%), Positives = 38/81 (46%), Gaps = 6/81 (7%)

Query: 455 QEENIEEKFDDSSQEEYFEAEDQ------EGQGDSQEDEEMEEQRIEDEEDQEDGNEEIE 508
           + +  + K +  ++E   E E +      EG+ ++ E+ E  E   E E + +   E   
Sbjct: 700 EAKEADHKGETEAEEVEHEGETEAEGTEDEGEIETGEEGEEVEDEGEGEAEGKHEVETEG 759

Query: 509 DERETKMEEECQKFGHYNKDD 529
           D +ET+ E E +  G  ++D+
Sbjct: 760 DRKETEHEGETEAEGKEDEDE 780



 Score = 32.3 bits (73), Expect = 1.6
 Identities = 20/69 (28%), Positives = 30/69 (43%), Gaps = 5/69 (7%)

Query: 453 GLQEENIEEKFDDSSQEEYFEAEDQEGQGDSQEDEEMEEQRIEDEEDQEDGNEEIE---D 509
           G  E   E + +     E     +QEG+G+ +  E   +   E EE + +G  E E   D
Sbjct: 671 GETETKGENESEGEIPAE--RKGEQEGEGEIEAKEADHKGETEAEEVEHEGETEAEGTED 728

Query: 510 ERETKMEEE 518
           E E +  EE
Sbjct: 729 EGEIETGEE 737



 Score = 31.5 bits (71), Expect = 2.7
 Identities = 19/72 (26%), Positives = 34/72 (47%), Gaps = 3/72 (4%)

Query: 455 QEENIEEKFDDSSQEEYFEAE---DQEGQGDSQEDEEMEEQRIEDEEDQEDGNEEIEDER 511
            E    E+ ++   E   EAE   + E +GD +E E   E   E +ED+++G  +  ++ 
Sbjct: 730 GEIETGEEGEEVEDEGEGEAEGKHEVETEGDRKETEHEGETEAEGKEDEDEGEIQAGEDG 789

Query: 512 ETKMEEECQKFG 523
           E K +E  +   
Sbjct: 790 EMKGDEGAEGKV 801



 Score = 31.1 bits (70), Expect = 3.2
 Identities = 19/70 (27%), Positives = 34/70 (48%), Gaps = 5/70 (7%)

Query: 453 GLQEENIEEKFDDSSQEEYFEAEDQEGQGDSQEDEEMEEQRIEDEEDQEDGNE----EIE 508
             + E  E++ +  + EE  E  + EG+G+++   E+E +    E + E   E    E E
Sbjct: 720 ETEAEGTEDEGEIETGEEG-EEVEDEGEGEAEGKHEVETEGDRKETEHEGETEAEGKEDE 778

Query: 509 DERETKMEEE 518
           DE E +  E+
Sbjct: 779 DEGEIQAGED 788



 Score = 31.1 bits (70), Expect = 3.3
 Identities = 22/76 (28%), Positives = 35/76 (46%), Gaps = 13/76 (17%)

Query: 455 QEENIEEKFDDSSQEEYFEAE---DQEGQGDSQED------EEMEEQRIEDEEDQEDGNE 505
           QE   E K ++ S+ E   AE   +QEG+G+ +        E   E+   + E + +G E
Sbjct: 669 QEGETETKGENESEGE-IPAERKGEQEGEGEIEAKEADHKGETEAEEVEHEGETEAEGTE 727

Query: 506 ---EIEDERETKMEEE 518
              EIE   E +  E+
Sbjct: 728 DEGEIETGEEGEEVED 743



 Score = 30.0 bits (67), Expect = 9.2
 Identities = 20/55 (36%), Positives = 25/55 (45%), Gaps = 4/55 (7%)

Query: 465 DSSQEEYFEAEDQEGQGDSQEDEEMEEQRIEDEEDQEDGNE-EIEDERETKMEEE 518
           D S+ +  EAE     G+   +E       E E  +E G E E E E ETK E E
Sbjct: 629 DLSKGDVAEAEHT---GERTGEEGERPTEAEGENGEESGGEAEQEGETETKGENE 680


>gnl|CDD|217927 pfam04147, Nop14, Nop14-like family.  Emg1 and Nop14 are novel
           proteins whose interaction is required for the
           maturation of the 18S rRNA and for 40S ribosome
           production.
          Length = 809

 Score = 43.8 bits (104), Expect = 4e-04
 Identities = 27/107 (25%), Positives = 51/107 (47%), Gaps = 14/107 (13%)

Query: 456 EENIEEKFDDSSQEEY-------FEAEDQEGQGDSQEDEEMEEQRIEDEEDQEDGNEEIE 508
           E + ++ F     EE         + ED+E   D  E+EE +    ++EED+ED + + E
Sbjct: 307 EPDDDDNFGLGQGEEDEEEEEDGVDDEDEEDDDDDLEEEEEDVDLSDEEEDEEDEDSDDE 366

Query: 509 DERETKMEEECQK----FGHYNKDDPHSFQLPENFQMYPQFMYHLRR 551
           D+ E + EE+ +K          + P +F  P++   + +F+  L+ 
Sbjct: 367 DDEEEEEEEKEKKKKKSAESTRSELPFTFPCPKS---HEEFLELLKG 410



 Score = 38.1 bits (89), Expect = 0.027
 Identities = 25/65 (38%), Positives = 36/65 (55%), Gaps = 2/65 (3%)

Query: 456 EENIEEKFDDSSQEEYFEAED--QEGQGDSQEDEEMEEQRIEDEEDQEDGNEEIEDERET 513
           EE+ +E  DD   E   + +D    GQG+  E+EE +    EDEED +D  EE E++ + 
Sbjct: 292 EEDSKESADDLDDEFEPDDDDNFGLGQGEEDEEEEEDGVDDEDEEDDDDDLEEEEEDVDL 351

Query: 514 KMEEE 518
             EEE
Sbjct: 352 SDEEE 356



 Score = 35.7 bits (83), Expect = 0.14
 Identities = 17/57 (29%), Positives = 32/57 (56%)

Query: 456 EENIEEKFDDSSQEEYFEAEDQEGQGDSQEDEEMEEQRIEDEEDQEDGNEEIEDERE 512
           EE   ++  D   +E+   +D        E++E EE+   D+ED+ED ++++E+E E
Sbjct: 291 EEEDSKESADDLDDEFEPDDDDNFGLGQGEEDEEEEEDGVDDEDEEDDDDDLEEEEE 347



 Score = 33.4 bits (77), Expect = 0.75
 Identities = 17/60 (28%), Positives = 33/60 (55%), Gaps = 5/60 (8%)

Query: 464 DDSSQEEYFEAEDQEGQGDSQEDEEM-EEQRIEDEEDQEDG----NEEIEDERETKMEEE 518
           D+  ++    A+D + + +  +D+     Q  EDEE++EDG    +EE +D+   + EE+
Sbjct: 289 DEEEEDSKESADDLDDEFEPDDDDNFGLGQGEEDEEEEEDGVDDEDEEDDDDDLEEEEED 348



 Score = 30.7 bits (70), Expect = 4.6
 Identities = 13/64 (20%), Positives = 25/64 (39%), Gaps = 9/64 (14%)

Query: 457 ENIEEKFDDSSQEEYFEAEDQEGQGDSQEDEEMEEQRIEDEEDQEDGNEEIEDERETK-- 514
             I++  D    ++     D +  GD     +         ED E+  EE  + +++K  
Sbjct: 124 SEIDKDDDVRDDDD----FDDDDLGDL--ASDDRAAHFGGGEDDEEDEEEQPERKKSKKE 177

Query: 515 -MEE 517
            M+E
Sbjct: 178 VMKE 181


>gnl|CDD|148630 pfam07133, Merozoite_SPAM, Merozoite surface protein (SPAM).  This
           family consists of several Plasmodium falciparum SPAM
           (secreted polymorphic antigen associated with
           merozoites) proteins. Variation among SPAM alleles is
           the result of deletions and amino acid substitutions in
           non-repetitive sequences within and flanking the alanine
           heptad-repeat domain. Heptad repeats in which the a and
           d position contain hydrophobic residues generate
           amphipathic alpha-helices which give rise to helical
           bundles or coiled-coil structures in proteins. SPAM is
           an example of a P. falciparum antigen in which a
           repetitive sequence has features characteristic of a
           well-defined structural element.
          Length = 164

 Score = 41.4 bits (97), Expect = 5e-04
 Identities = 25/69 (36%), Positives = 36/69 (52%), Gaps = 6/69 (8%)

Query: 473 EAED--QEGQGDSQEDEEMEEQRIEDEEDQEDGNEEIEDERETKMEEECQKFGHYNKDDP 530
           E ED   E Q D +E+EE +E+ IE+ ED ED  E +EDE E + +EE       +  D 
Sbjct: 39  ENEDVKDEKQEDDEEEEEEDEEEIEEPEDIEDEEEIVEDEEEEEEDEEDNV----DLKDI 94

Query: 531 HSFQLPENF 539
               + + F
Sbjct: 95  EKKNINDIF 103



 Score = 41.0 bits (96), Expect = 8e-04
 Identities = 19/61 (31%), Positives = 36/61 (59%)

Query: 456 EENIEEKFDDSSQEEYFEAEDQEGQGDSQEDEEMEEQRIEDEEDQEDGNEEIEDERETKM 515
           E+  +EK +D  +EE  + E+ E   D +++EE+ E   E+EED+ED  +  + E++   
Sbjct: 41  EDVKDEKQEDDEEEEEEDEEEIEEPEDIEDEEEIVEDEEEEEEDEEDNVDLKDIEKKNIN 100

Query: 516 E 516
           +
Sbjct: 101 D 101



 Score = 38.7 bits (90), Expect = 0.005
 Identities = 21/64 (32%), Positives = 38/64 (59%), Gaps = 3/64 (4%)

Query: 457 ENIEEKFDDSSQEEYFEAEDQEGQGDSQEDEEMEEQRIEDE--EDQEDGNEEIEDERETK 514
           E+I ++ +D   E+  + E++E + D +E EE E+   E+E  ED+E+  E+ ED  + K
Sbjct: 34  EDIIKENEDVKDEKQEDDEEEE-EEDEEEIEEPEDIEDEEEIVEDEEEEEEDEEDNVDLK 92

Query: 515 MEEE 518
             E+
Sbjct: 93  DIEK 96



 Score = 34.4 bits (79), Expect = 0.13
 Identities = 24/99 (24%), Positives = 47/99 (47%), Gaps = 14/99 (14%)

Query: 433 ITENDNVIRRNEIHIKPRICGLQEENIEEKFDDSSQEEYFEAEDQEGQGDSQEDEEMEEQ 492
           I EN++V    +          +++  EE+ D+   EE  + ED+E   + +E+EE +E+
Sbjct: 37  IKENEDVKDEKQ----------EDDEEEEEEDEEEIEEPEDIEDEEEIVEDEEEEEEDEE 86

Query: 493 RIEDEEDQEDGNEEIEDERETKMEEECQKF--GHYNKDD 529
              D +D E   + I D   +  ++  Q     +Y K++
Sbjct: 87  DNVDLKDIE--KKNINDIFNSTQDDNAQNLISKNYKKNE 123


>gnl|CDD|217450 pfam03247, Prothymosin, Prothymosin/parathymosin family.
           Prothymosin alpha and parathymosin are two ubiquitous
           small acidic nuclear proteins that are thought to be
           involved in cell cycle progression, proliferation, and
           cell differentiation.
          Length = 106

 Score = 39.9 bits (93), Expect = 6e-04
 Identities = 22/64 (34%), Positives = 34/64 (53%), Gaps = 10/64 (15%)

Query: 456 EENIEEKFDDSSQEEYFEAEDQEGQGDSQEDEEMEEQRIEDEEDQEDGN-----EEIEDE 510
           EEN  ++ DD       E E++  + D +E+ E EE+  E+EE+ E        E+ ED+
Sbjct: 40  EENGAQEGDDEM-----EEEEEVDEDDEEEEGEGEEEEGEEEEETEGATGKRAAEDEEDD 94

Query: 511 RETK 514
            ETK
Sbjct: 95  AETK 98



 Score = 38.4 bits (89), Expect = 0.002
 Identities = 16/53 (30%), Positives = 26/53 (49%)

Query: 461 EKFDDSSQEEYFEAEDQEGQGDSQEDEEMEEQRIEDEEDQEDGNEEIEDERET 513
           E   ++        E+   +GD + +EE E    ++EE+ E   EE E+E ET
Sbjct: 27  ENGKNAPANGNENEENGAQEGDDEMEEEEEVDEDDEEEEGEGEEEEGEEEEET 79



 Score = 35.3 bits (81), Expect = 0.025
 Identities = 19/62 (30%), Positives = 32/62 (51%), Gaps = 4/62 (6%)

Query: 453 GLQEENIEEKFDDSSQEEYFEAEDQEGQGDSQEDEEMEEQ---RI-EDEEDQEDGNEEIE 508
           G QE + E + ++   E+  E E +  + + +E+EE E     R  EDEED  +  ++  
Sbjct: 43  GAQEGDDEMEEEEEVDEDDEEEEGEGEEEEGEEEEETEGATGKRAAEDEEDDAETKKQKT 102

Query: 509 DE 510
           DE
Sbjct: 103 DE 104



 Score = 33.8 bits (77), Expect = 0.093
 Identities = 12/44 (27%), Positives = 26/44 (59%)

Query: 475 EDQEGQGDSQEDEEMEEQRIEDEEDQEDGNEEIEDERETKMEEE 518
           E++E      +DE  EE+ +++++++E+G  E E+  E +  E 
Sbjct: 38  ENEENGAQEGDDEMEEEEEVDEDDEEEEGEGEEEEGEEEEETEG 81



 Score = 30.3 bits (68), Expect = 1.5
 Identities = 14/50 (28%), Positives = 25/50 (50%)

Query: 469 EEYFEAEDQEGQGDSQEDEEMEEQRIEDEEDQEDGNEEIEDERETKMEEE 518
           EE    ++    G+  E+   +E   E EE++E   ++ E+E E + EE 
Sbjct: 24  EEKENGKNAPANGNENEENGAQEGDDEMEEEEEVDEDDEEEEGEGEEEEG 73


>gnl|CDD|227361 COG5028, COG5028, Vesicle coat complex COPII, subunit SEC24/subunit
           SFB2/subunit SFB3 [Intracellular trafficking and
           secretion].
          Length = 861

 Score = 42.5 bits (100), Expect = 0.001
 Identities = 33/170 (19%), Positives = 63/170 (37%), Gaps = 9/170 (5%)

Query: 759 CIQFITHYQAPSGEKKVRVTTIARNWADATTQLDHISSGFDQEAAAVIMGRMVVNRAEQD 818
             Q    Y    GE+++RV  ++      ++ +  + +  DQ A A I+ +    +A   
Sbjct: 568 YFQVALLYTLNDGERRIRVVNLS---LPTSSSIREVYASADQLAIACILAKKASTKALNS 624

Query: 819 DGPDVMRWADRTLIRLCQ--KFGHYNKDDPHSFQLPENFQMYPQFMYHLRRSQFLQVFNN 876
              +     +++++ + +  K      +      LP N ++ P  M  L +S   +  + 
Sbjct: 625 SLKEARVLINKSMVDILKAYKKELVKSNTSTQLPLPANLKLLPLLMLALLKSSAFRSGST 684

Query: 877 SPDETSYYRHVLMREDLSQSLIMIQPVLYS----YGFNGPPVPVLLDTSS 922
             D      + L    L Q +  I P LY+        G P   LL   S
Sbjct: 685 PSDIRISALNRLTSLPLKQLMRNIYPTLYALHDMPIEAGLPDEGLLVLPS 734



 Score = 42.1 bits (99), Expect = 0.002
 Identities = 30/118 (25%), Positives = 46/118 (38%), Gaps = 14/118 (11%)

Query: 182 QLSLETLKISMTTYEEFF---YQNEERDGIRCTWNVWP--SSKLEASRLVMPVGCLYQP- 235
            L L    I       F      N     +R T    P  +  L+ S++  P G + +P 
Sbjct: 124 DLFLPPPPIVPPLTTNFVGSEQSNCSPKYVRSTMYAIPETNDLLKKSKI--PFGLVIRPF 181

Query: 236 --LKEKPDLPPLQYE--PLLCMRNQCRAILNPLCQVDYKSKLWVCNFCFQRNAFPPQY 289
             L  + D  PL  +   + C R  CR+ +NP  Q   + + W CN C  +N  P  +
Sbjct: 182 LELYPEEDPVPLVEDGSIVRCRR--CRSYINPFVQFIEQGRKWRCNICRSKNDVPEGF 237



 Score = 39.8 bits (93), Expect = 0.007
 Identities = 35/158 (22%), Positives = 56/158 (35%), Gaps = 38/158 (24%)

Query: 298 PAELHPQFTTIEYTIPVSQCFPKYPLRSLPVFLPFLIQFVFFLAITHLINQVGSP----- 352
           P  + P  TT       S C PKY +RS    +P             L+ +   P     
Sbjct: 129 PPPIVPPLTTNFVGSEQSNCSPKY-VRSTMYAIP---------ETNDLLKKSKIPFGLVI 178

Query: 353 ----------NLPPLQYE--PLLCMRNQCRAILNPLCQVDYKSKLWVCNFCFQRNAFPPQ 400
                     +  PL  +   + C R  CR+ +NP  Q   + + W CN C  +N  P  
Sbjct: 179 RPFLELYPEEDPVPLVEDGSIVRCRR--CRSYINPFVQFIEQGRKWRCNICRSKNDVPEG 236

Query: 401 YAAITEQHQP-------AELHPQFTTIEYTIPLLYSIT 431
           +   +  + P        EL  +   +++  P  YS+ 
Sbjct: 237 FDNPSGPNDPRSDRYSRPEL--KSGVVDFLAPKEYSLR 272


>gnl|CDD|217902 pfam04111, APG6, Autophagy protein Apg6.  In yeast, 15 Apg proteins
           coordinate the formation of autophagosomes. Autophagy is
           a bulk degradation process induced by starvation in
           eukaryotic cells. Apg6/Vps30p has two distinct functions
           in the autophagic process, either associated with the
           membrane or in a retrieval step of the carboxypeptidase
           Y sorting pathway.
          Length = 356

 Score = 41.8 bits (98), Expect = 0.001
 Identities = 30/98 (30%), Positives = 50/98 (51%), Gaps = 7/98 (7%)

Query: 469 EEYFEAEDQEGQGDSQEDEEMEEQ--RIEDEEDQEDGN-EEIEDERETKMEEECQKFGHY 525
              +EA D E     +E+E + ++   +E E+D  DG   E+++E+E    EE Q    Y
Sbjct: 62  ISNYEALDSELDELKKEEERLLDELEELEKEDDDLDGELVELQEEKEQLENEELQYLREY 121

Query: 526 NKDDPHSFQLPENFQ-MYPQFMYHLRRSQFLQ---VFN 559
           N  D ++ QL +N Q +  Q+ Y L +   L+   +FN
Sbjct: 122 NLFDRNNLQLEDNLQSLELQYEYSLNQLDKLRKTNIFN 159


>gnl|CDD|218538 pfam05285, SDA1, SDA1.  This family consists of several SDA1
           protein homologues. SDA1 is a Saccharomyces cerevisiae
           protein which is involved in the control of the actin
           cytoskeleton. The protein is essential for cell
           viability and is localised in the nucleus.
          Length = 317

 Score = 41.2 bits (97), Expect = 0.002
 Identities = 21/64 (32%), Positives = 32/64 (50%), Gaps = 2/64 (3%)

Query: 455 QEENIEEKFDDSSQEEYFEAEDQEGQGDSQEDEEMEEQRIEDEEDQEDGNEEIEDERETK 514
           + E   E  DD  +EE +E E+ E   D  E  ++E  +  +  D ED  EE +DE   K
Sbjct: 87  EAEQGLESDDDDDEEEEWEVEEDEDSDDEGEWIDVESDKEIESSDSED--EEEKDEAAKK 144

Query: 515 MEEE 518
            +E+
Sbjct: 145 AKED 148



 Score = 38.9 bits (91), Expect = 0.008
 Identities = 17/82 (20%), Positives = 34/82 (41%), Gaps = 4/82 (4%)

Query: 444 EIHIKPRICGL----QEENIEEKFDDSSQEEYFEAEDQEGQGDSQEDEEMEEQRIEDEED 499
           E  +   I GL    + +  E K  ++ Q    + +D E +    E++E  +   E  + 
Sbjct: 62  EEEVVDGIPGLELLEKWKEEERKKKEAEQGLESDDDDDEEEEWEVEEDEDSDDEGEWIDV 121

Query: 500 QEDGNEEIEDERETKMEEECQK 521
           + D   E  D  + + ++E  K
Sbjct: 122 ESDKEIESSDSEDEEEKDEAAK 143



 Score = 36.6 bits (85), Expect = 0.047
 Identities = 20/68 (29%), Positives = 39/68 (57%), Gaps = 5/68 (7%)

Query: 456 EENIEEKFDDSSQEEYF----EAEDQEG-QGDSQEDEEMEEQRIEDEEDQEDGNEEIEDE 510
           E+ +E   DD  +EE+     E  D EG   D + D+E+E    EDEE++++  ++ +++
Sbjct: 89  EQGLESDDDDDEEEEWEVEEDEDSDDEGEWIDVESDKEIESSDSEDEEEKDEAAKKAKED 148

Query: 511 RETKMEEE 518
            + ++ EE
Sbjct: 149 SDEELSEE 156



 Score = 30.4 bits (69), Expect = 4.2
 Identities = 17/59 (28%), Positives = 32/59 (54%), Gaps = 1/59 (1%)

Query: 464 DDSSQEEYFEAEDQEGQGDSQEDEEMEEQRIEDEEDQEDGN-EEIEDERETKMEEECQK 521
           DD  +    E++ +    DS+++EE +E   + +ED ++   EE E+E   + E E +K
Sbjct: 113 DDEGEWIDVESDKEIESSDSEDEEEKDEAAKKAKEDSDEELSEEDEEEAAEEEEAEAEK 171


>gnl|CDD|227466 COG5137, COG5137, Histone chaperone involved in gene silencing
           [Transcription / Chromatin structure and dynamics].
          Length = 279

 Score = 40.4 bits (94), Expect = 0.003
 Identities = 22/84 (26%), Positives = 37/84 (44%), Gaps = 7/84 (8%)

Query: 440 IRRNEIHIKPRIC--GLQEENIEEKFDDSSQEEYFEAEDQEGQGDSQEDEEMEEQRIEDE 497
           + R  +  KPR+    +  +N +E  D++   +     D E +   +E +  EE+  EDE
Sbjct: 140 VEREILAEKPRVTRFNIVWDN-DEDNDEAPPAQ--PDVDNEEEERLEESDGREEE--EDE 194

Query: 498 EDQEDGNEEIEDERETKMEEECQK 521
           E   D   E   E   + EEE + 
Sbjct: 195 EVGSDSYGEGNRELNEEEEEEAEG 218



 Score = 40.4 bits (94), Expect = 0.003
 Identities = 19/64 (29%), Positives = 30/64 (46%), Gaps = 4/64 (6%)

Query: 455 QEENIEEKFDDSSQEEYFEAEDQEGQGDSQEDEEMEEQRIEDEEDQEDGNEEIEDERETK 514
           + +  EE+ D    EE       EG  +  E+EE E +  +D ED  D   E  D+++ +
Sbjct: 184 ESDGREEEED----EEVGSDSYGEGNRELNEEEEEEAEGSDDGEDVVDYEGERIDKKQGE 239

Query: 515 MEEE 518
            EE 
Sbjct: 240 EEEM 243



 Score = 39.6 bits (92), Expect = 0.005
 Identities = 26/77 (33%), Positives = 36/77 (46%), Gaps = 12/77 (15%)

Query: 453 GLQEENIEEKFDDSSQEEYFEA-----EDQEGQGDSQEDEEMEEQRIED------EEDQE 501
           G +EE  EE   DS  E   E      E+ EG  D ++  + E +RI+       E ++E
Sbjct: 187 GREEEEDEEVGSDSYGEGNRELNEEEEEEAEGSDDGEDVVDYEGERIDKKQGEEEEMEEE 246

Query: 502 DGN-EEIEDERETKMEE 517
             N  EIE E E+  EE
Sbjct: 247 VINLFEIEWEEESPSEE 263



 Score = 39.2 bits (91), Expect = 0.006
 Identities = 17/64 (26%), Positives = 33/64 (51%)

Query: 455 QEENIEEKFDDSSQEEYFEAEDQEGQGDSQEDEEMEEQRIEDEEDQEDGNEEIEDERETK 514
           +EE +EE      +E+     D  G+G+ + +EE EE+    ++ ++  + E E   + +
Sbjct: 178 EEERLEESDGREEEEDEEVGSDSYGEGNRELNEEEEEEAEGSDDGEDVVDYEGERIDKKQ 237

Query: 515 MEEE 518
            EEE
Sbjct: 238 GEEE 241



 Score = 38.8 bits (90), Expect = 0.008
 Identities = 18/86 (20%), Positives = 38/86 (44%), Gaps = 4/86 (4%)

Query: 457 ENIEEKFDDSSQEEYFEAEDQEGQGD----SQEDEEMEEQRIEDEEDQEDGNEEIEDERE 512
              +   D+  +E   E++ +E + D    S    E   +  E+EE++ +G+++ ED  +
Sbjct: 168 PPAQPDVDNEEEERLEESDGREEEEDEEVGSDSYGEGNRELNEEEEEEAEGSDDGEDVVD 227

Query: 513 TKMEEECQKFGHYNKDDPHSFQLPEN 538
            + E   +K G   + +     L E 
Sbjct: 228 YEGERIDKKQGEEEEMEEEVINLFEI 253



 Score = 33.0 bits (75), Expect = 0.54
 Identities = 14/69 (20%), Positives = 27/69 (39%), Gaps = 2/69 (2%)

Query: 449 PRICGLQEENIEEKFDDSSQEEYFEAEDQEGQGDSQEDEEMEEQRIEDEEDQEDGNEEIE 508
           P    +  E  E   +   +EE  + E           E  EE+  E+E +  D  E++ 
Sbjct: 169 PAQPDVDNEEEERLEESDGREEEEDEEVGSDSYGEGNRELNEEE--EEEAEGSDDGEDVV 226

Query: 509 DERETKMEE 517
           D    ++++
Sbjct: 227 DYEGERIDK 235



 Score = 33.0 bits (75), Expect = 0.57
 Identities = 16/63 (25%), Positives = 25/63 (39%), Gaps = 3/63 (4%)

Query: 455 QEENIEEKFDDSSQEEYFEAEDQEGQGDSQEDEEMEEQRIEDEEDQEDGNEEIEDERETK 514
           +EE   E  DD      +E E  +     Q +EE  E+ + +  + E   E   +E    
Sbjct: 211 EEEEEAEGSDDGEDVVDYEGERID---KKQGEEEEMEEEVINLFEIEWEEESPSEEVPRN 267

Query: 515 MEE 517
            EE
Sbjct: 268 NEE 270


>gnl|CDD|217840 pfam04006, Mpp10, Mpp10 protein.  This family includes proteins
           related to Mpp10 (M phase phosphoprotein 10). The U3
           small nucleolar ribonucleoprotein (snoRNP) is required
           for three cleavage events that generate the mature 18S
           rRNA from the pre-rRNA. In Saccharomyces cerevisiae,
           depletion of Mpp10, a U3 snoRNP-specific protein, halts
           18S rRNA production and impairs cleavage at the three U3
           snoRNP-dependent sites.
          Length = 613

 Score = 40.7 bits (95), Expect = 0.003
 Identities = 21/90 (23%), Positives = 37/90 (41%), Gaps = 3/90 (3%)

Query: 456 EENIEEKFDDSSQEEYFEAEDQEGQGDSQEDEEMEEQRIEDEEDQEDGNEEIEDERETKM 515
           + + E+  DD  +EE  E+ + E   D  E +   E   E   +    +E  ED+ E KM
Sbjct: 108 DMDSEDSADDEEEEEEDESLEDEMIDDEDEADLFNES--ESSLEDLSDDET-EDDEEKKM 164

Query: 516 EEECQKFGHYNKDDPHSFQLPENFQMYPQF 545
           EEE       + +     +  +   +  +F
Sbjct: 165 EEEEAGEEKESVEQATREKKFDKSGVDDKF 194



 Score = 38.8 bits (90), Expect = 0.015
 Identities = 19/80 (23%), Positives = 36/80 (45%), Gaps = 7/80 (8%)

Query: 450 RICGLQEENIEEKFDDSSQEEYFEAEDQEG-QGDSQEDEEMEEQRIEDE------EDQED 502
            I  L +    E  +D S     ++ED    + + +EDE +E++ I+DE       + E 
Sbjct: 87  HIRVLAKLQDSESHEDGSDGSDMDSEDSADDEEEEEEDESLEDEMIDDEDEADLFNESES 146

Query: 503 GNEEIEDERETKMEEECQKF 522
             E++ D+     EE+  + 
Sbjct: 147 SLEDLSDDETEDDEEKKMEE 166



 Score = 33.0 bits (75), Expect = 0.84
 Identities = 13/61 (21%), Positives = 30/61 (49%)

Query: 456 EENIEEKFDDSSQEEYFEAEDQEGQGDSQEDEEMEEQRIEDEEDQEDGNEEIEDERETKM 515
           +E ++++ D   +EE  E ++QE   D +E  E    +++ +E   +G +    +  +  
Sbjct: 282 KEAVKKEADSKPEEEDEEDDEQEDDQDEEEPPEAAMDKVKLDEPVLEGVDLESPKELSSF 341

Query: 516 E 516
           E
Sbjct: 342 E 342



 Score = 32.7 bits (74), Expect = 1.2
 Identities = 17/57 (29%), Positives = 25/57 (43%), Gaps = 2/57 (3%)

Query: 460 EEKFDDSSQEEYFEAEDQEGQGDSQEDEEMEEQRIEDEEDQEDGNEEIEDERETKME 516
           +EK       +  E ED E   ++ + E   +   EDEED E   E+ +DE E    
Sbjct: 261 KEKDKKKDAGDDAELEDDEPDKEAVKKEADSKPEEEDEEDDE--QEDDQDEEEPPEA 315



 Score = 30.3 bits (68), Expect = 5.9
 Identities = 15/87 (17%), Positives = 37/87 (42%), Gaps = 1/87 (1%)

Query: 444 EIHIKPRICGLQEENIEEKFDDSSQEEYFEAEDQEGQGDSQEDEEM-EEQRIEDEEDQED 502
           ++  KP +  L ++ +E   +        + +D E   D  +  +M  E   +DEE++E+
Sbjct: 64  KLQNKPILDDLNQKYVEFLINKEHIRVLAKLQDSESHEDGSDGSDMDSEDSADDEEEEEE 123

Query: 503 GNEEIEDERETKMEEECQKFGHYNKDD 529
                ++  + + E +       + +D
Sbjct: 124 DESLEDEMIDDEDEADLFNESESSLED 150


>gnl|CDD|117592 pfam09026, Cenp-B_dimeris, Centromere protein B dimerisation
           domain.  The centromere protein B (CENP-B) dimerisation
           domain is composed of two alpha-helices, which are
           folded into an antiparallel configuration. Dimerisation
           of CENP-B is mediated by this domain, in which monomers
           dimerise to form a symmetrical, antiparallel, four-helix
           bundle structure with a large hydrophobic patch in which
           23 residues of one monomer form van der Waals contacts
           with the other monomer. This CENP-B dimer configuration
           may be suitable for capturing two distant CENP-B boxes
           during centromeric heterochromatin formation.
          Length = 101

 Score = 37.4 bits (86), Expect = 0.005
 Identities = 12/31 (38%), Positives = 21/31 (67%)

Query: 475 EDQEGQGDSQEDEEMEEQRIEDEEDQEDGNE 505
           ED +   D +ED++ E++  +DE+D ED +E
Sbjct: 10  EDSDSDSDEEEDDDDEDEEDDDEDDDEDDDE 40



 Score = 31.7 bits (71), Expect = 0.37
 Identities = 13/33 (39%), Positives = 23/33 (69%), Gaps = 2/33 (6%)

Query: 478 EGQGDSQEDEEMEEQRIEDEEDQEDGNEEIEDE 510
           EG+ DS  D + EE   +D+ED+ED +E+ +++
Sbjct: 7   EGEEDSDSDSDEEED--DDDEDEEDDDEDDDED 37


>gnl|CDD|183610 PRK12585, PRK12585, putative monovalent cation/H+ antiporter
           subunit G; Reviewed.
          Length = 197

 Score = 38.9 bits (90), Expect = 0.005
 Identities = 24/84 (28%), Positives = 47/84 (55%), Gaps = 5/84 (5%)

Query: 440 IRRNEIHIKPRICGLQEENIEEKFDDSSQEEYFEAEDQEGQGDSQEDEEMEEQRIED--- 496
           +++++I  K  +   QE+  + + +    EE  E E +E + D +ED+E +E+  E+   
Sbjct: 110 VKKDDIKKKKSLIIRQEQIEKARQEREELEERMEWERREEKIDEREDQEEQEREREEQTI 169

Query: 497 EEDQEDGNEEI--EDERETKMEEE 518
           EE  +D   EI  +DE ET+ +++
Sbjct: 170 EEQSDDSEHEIIEQDESETESDDD 193



 Score = 30.8 bits (69), Expect = 2.3
 Identities = 21/81 (25%), Positives = 44/81 (54%), Gaps = 4/81 (4%)

Query: 445 IHIKPRICGLQEENIEEKFDDSSQEEYFEAEDQEGQGDSQEDEEME-EQRIEDEEDQEDG 503
           I I+ ++  +++++I++K     ++E  E   QE +   + +E ME E+R E  +++ED 
Sbjct: 101 IRIRDQLRSVKKDDIKKKKSLIIRQEQIEKARQERE---ELEERMEWERREEKIDEREDQ 157

Query: 504 NEEIEDERETKMEEECQKFGH 524
            E+  +  E  +EE+     H
Sbjct: 158 EEQEREREEQTIEEQSDDSEH 178


>gnl|CDD|188306 TIGR03319, RNase_Y, ribonuclease Y.  Members of this family are
           RNase Y, an endoribonuclease. The member from Bacillus
           subtilis, YmdA, has been shown to be involved in
           turnover of yitJ riboswitch [Transcription, Degradation
           of RNA].
          Length = 514

 Score = 39.9 bits (94), Expect = 0.006
 Identities = 24/89 (26%), Positives = 42/89 (47%), Gaps = 6/89 (6%)

Query: 435 ENDNVIRRNEIHIKPRICGLQEENIEEKFDDSSQ-EEYFEAEDQEGQGDSQEDEEMEEQR 493
           E +   RRNE+    R    +EE ++ K +   + EE  E +++E     +  +E EE+ 
Sbjct: 68  ERELKERRNELQRLERRLLQREETLDRKMESLDKKEENLEKKEKELSNKEKNLDEKEEEL 127

Query: 494 IEDEEDQEDGNEEI-----EDERETKMEE 517
            E   +Q +  E I     E+ +E  +EE
Sbjct: 128 EELIAEQREELERISGLTQEEAKEILLEE 156


>gnl|CDD|237629 PRK14160, PRK14160, heat shock protein GrpE; Provisional.
          Length = 211

 Score = 38.6 bits (90), Expect = 0.008
 Identities = 19/63 (30%), Positives = 36/63 (57%), Gaps = 4/63 (6%)

Query: 456 EENIEEKFDDSSQEEYFEAEDQEGQGDSQEDEEMEEQRIEDEEDQEDGNEEIEDERETKM 515
            +  E K +D  +EE  E E+ E +   ++ EE  E +IE+ +D+   N ++++E + K+
Sbjct: 18  CKENENKEEDKGKEEDLEFEEIEKEEIIEDSEESNEVKIEELKDE---NNKLKEENK-KL 73

Query: 516 EEE 518
           E E
Sbjct: 74  ENE 76



 Score = 38.2 bits (89), Expect = 0.009
 Identities = 23/73 (31%), Positives = 37/73 (50%), Gaps = 9/73 (12%)

Query: 456 EENIEEKFDDSSQEEYF-EAEDQEGQGDSQEDEEMEEQRIEDEEDQEDGNEEIEDERE-- 512
           E+  ++   ++ +E+   E E++E     +ED E EE  IE EE  ED  E  E + E  
Sbjct: 2   EKECKDAKHENMEEDCCKENENKEEDKGKEEDLEFEE--IEKEEIIEDSEESNEVKIEEL 59

Query: 513 ----TKMEEECQK 521
                K++EE +K
Sbjct: 60  KDENNKLKEENKK 72



 Score = 37.8 bits (88), Expect = 0.013
 Identities = 19/63 (30%), Positives = 34/63 (53%), Gaps = 1/63 (1%)

Query: 452 CGLQEENIEEKFDDSSQEEYFEAEDQEGQGDSQEDEEMEEQRIEDE-EDQEDGNEEIEDE 510
           C  + EN EE        E+ E E +E   DS+E  E++ + ++DE    ++ N+++E+E
Sbjct: 17  CCKENENKEEDKGKEEDLEFEEIEKEEIIEDSEESNEVKIEELKDENNKLKEENKKLENE 76

Query: 511 RET 513
            E 
Sbjct: 77  LEA 79


>gnl|CDD|221175 pfam11705, RNA_pol_3_Rpc31, DNA-directed RNA polymerase III subunit
           Rpc31.  RNA polymerase III contains seventeen subunits
           in yeasts and in human cells. Twelve of these are akin
           to RNA polymerase I or II and the other five are RNA pol
           III-specific, and form the functionally distinct groups
           (i) Rpc31-Rpc34-Rpc82, and (ii) Rpc37-Rpc53. Rpc31,
           Rpc34 and Rpc82 form a cluster of enzyme-specific
           subunits that contribute to transcription initiation in
           S.cerevisiae and H.sapiens. There is evidence that these
           subunits are anchored at or near the N-terminal Zn-fold
           of Rpc1, itself prolonged by a highly conserved but RNA
           polymerase III-specific domain.
          Length = 221

 Score = 38.6 bits (90), Expect = 0.008
 Identities = 13/56 (23%), Positives = 30/56 (53%)

Query: 463 FDDSSQEEYFEAEDQEGQGDSQEDEEMEEQRIEDEEDQEDGNEEIEDERETKMEEE 518
           F +  ++   +    E +    E E+++E+  +DEE++E+  EE ED  +   +++
Sbjct: 142 FTEEEEDIDEKLSMLEKKLKELEAEDVDEEDEKDEEEEEEEEEEDEDFDDDDDDDD 197



 Score = 34.7 bits (80), Expect = 0.15
 Identities = 11/43 (25%), Positives = 22/43 (51%)

Query: 466 SSQEEYFEAEDQEGQGDSQEDEEMEEQRIEDEEDQEDGNEEIE 508
              E     E+ E   + +E+EE E++  +D++D +D +   E
Sbjct: 161 KELEAEDVDEEDEKDEEEEEEEEEEDEDFDDDDDDDDDDYNAE 203



 Score = 33.6 bits (77), Expect = 0.32
 Identities = 18/56 (32%), Positives = 34/56 (60%), Gaps = 5/56 (8%)

Query: 456 EENIEEKFDDSSQEEYF-EAEDQEGQGDSQEDEEMEEQRIEDEEDQEDGNEEIEDE 510
           EE+I+EK     ++    EAED + + +  E+EE EE    +EED++  +++ +D+
Sbjct: 146 EEDIDEKLSMLEKKLKELEAEDVDEEDEKDEEEEEEE----EEEDEDFDDDDDDDD 197



 Score = 29.3 bits (66), Expect = 7.4
 Identities = 11/54 (20%), Positives = 28/54 (51%), Gaps = 2/54 (3%)

Query: 453 GLQEENIEEKFDDSSQEEYFEAEDQEGQGDSQEDEEMEEQRIEDEEDQEDGNEE 506
            L     + K  ++   +  + +D+E + + +E++E  +   +D++D +D N E
Sbjct: 152 KLSMLEKKLKELEAEDVDEEDEKDEEEEEEEEEEDEDFDD--DDDDDDDDYNAE 203


>gnl|CDD|217861 pfam04050, Upf2, Up-frameshift suppressor 2.  Transcripts
           harbouring premature signals for translation termination
           are recognised and rapidly degraded by eukaryotic cells
           through a pathway known as nonsense-mediated mRNA decay.
           In Saccharomyces cerevisiae, three trans-acting factors
           (Upf1 to Upf3) are required for nonsense-mediated mRNA
           decay.
          Length = 171

 Score = 37.7 bits (88), Expect = 0.009
 Identities = 22/66 (33%), Positives = 33/66 (50%), Gaps = 2/66 (3%)

Query: 456 EENIEEKFDDSSQEEYFEAEDQEGQGDS-QEDEEMEEQRIEDEEDQEDGNEEIEDERETK 514
           E   E++ D+SS EE  +  D E   +S  E+E++   R E+E D E    E + E E  
Sbjct: 15  ELPEEDEDDESSDEEEVDLPDDEQDEESDSEEEQIFVTRQEEEVDPEA-EAEFDREFEKM 73

Query: 515 MEEECQ 520
           M E  +
Sbjct: 74  MAESLE 79



 Score = 34.3 bits (79), Expect = 0.13
 Identities = 16/59 (27%), Positives = 26/59 (44%), Gaps = 4/59 (6%)

Query: 464 DDSSQEEYFEAEDQEGQGDSQEDEEMEEQRIEDEE-DQEDGNEEI---EDERETKMEEE 518
            D  +E+    E+ E    S E+E       +DEE D E+    +   E+E + + E E
Sbjct: 7   SDDGEEDEELPEEDEDDESSDEEEVDLPDDEQDEESDSEEEQIFVTRQEEEVDPEAEAE 65



 Score = 33.5 bits (77), Expect = 0.27
 Identities = 15/55 (27%), Positives = 27/55 (49%), Gaps = 1/55 (1%)

Query: 464 DDSSQEEYFEAEDQ-EGQGDSQEDEEMEEQRIEDEEDQEDGNEEIEDERETKMEE 517
            +S  ++  E E+  E   D +  +E E    +DE+D+E  +EE +     + EE
Sbjct: 3   SESESDDGEEDEELPEEDEDDESSDEEEVDLPDDEQDEESDSEEEQIFVTRQEEE 57



 Score = 29.7 bits (67), Expect = 4.5
 Identities = 18/57 (31%), Positives = 28/57 (49%), Gaps = 3/57 (5%)

Query: 465 DSSQEEYFEAEDQEGQGDSQEDEE---MEEQRIEDEEDQEDGNEEIEDERETKMEEE 518
             S+ E  + E+ E   +  ED+E    EE  + D+E  E+ + E E    T+ EEE
Sbjct: 1   SGSESESDDGEEDEELPEEDEDDESSDEEEVDLPDDEQDEESDSEEEQIFVTRQEEE 57


>gnl|CDD|218312 pfam04889, Cwf_Cwc_15, Cwf15/Cwc15 cell cycle control protein.
           This family represents Cwf15/Cwc15 (from
           Schizosaccharomyces pombe and Saccharomyces cerevisiae
           respectively) and their homologues. The function of
           these proteins is unknown, but they form part of the
           spliceosome and are thus thought to be involved in mRNA
           splicing.
          Length = 241

 Score = 38.6 bits (90), Expect = 0.010
 Identities = 21/83 (25%), Positives = 34/83 (40%), Gaps = 10/83 (12%)

Query: 443 NEIHIKPRICGLQEENIEEKFDDSSQEEYFEAEDQEGQGDSQEDEE---MEE-QRIEDEE 498
            E  IK +       N +   DDS      ++ D +   D  EDE    + E ++I+ E 
Sbjct: 101 EEDEIKRKRIEEDARNSDA--DDSDSSSDSDSSDDDSDDDDSEDETAALLRELEKIKKER 158

Query: 499 DQEDGNEEIEDERETKMEEECQK 521
            +    E+  +E E   EEE  +
Sbjct: 159 AE----EKEREEEEKAAEEEKAR 177


>gnl|CDD|218737 pfam05764, YL1, YL1 nuclear protein.  The proteins in this family
           are designated YL1. These proteins have been shown to be
           DNA-binding and may be a transcription factor.
          Length = 238

 Score = 38.1 bits (89), Expect = 0.013
 Identities = 23/74 (31%), Positives = 38/74 (51%), Gaps = 2/74 (2%)

Query: 450 RICGLQEENIEEKFDDSSQEEYFEAEDQEGQGDSQEDEEMEEQRI--EDEEDQEDGNEEI 507
           R+  L EE +EE     +   + E ED E     +E+EE E      + E+D+ + ++E 
Sbjct: 13  RMKKLLEEELEEDEFFWTYLLFEEEEDDEEFEIEEEEEEEEVDSDFDDSEDDEPESDDEE 72

Query: 508 EDERETKMEEECQK 521
           E E+E + EE  +K
Sbjct: 73  EGEKELQREERLKK 86


>gnl|CDD|173534 PTZ00341, PTZ00341, Ring-infected erythrocyte surface antigen;
            Provisional.
          Length = 1136

 Score = 38.2 bits (88), Expect = 0.024
 Identities = 23/67 (34%), Positives = 39/67 (58%), Gaps = 1/67 (1%)

Query: 456  EENIEEKFDDSSQEEYFEAEDQEGQGDSQEDEEMEEQRIED-EEDQEDGNEEIEDERETK 514
            EENIEE  ++  +E   E E+   + D +  EE+EE   E+ EE+ E+  EE ++E   +
Sbjct: 1012 EENIEENVEEYDEENVEEVEENVEEYDEENVEEIEENAEENVEENIEENIEEYDEENVEE 1071

Query: 515  MEEECQK 521
            +EE  ++
Sbjct: 1072 IEENIEE 1078



 Score = 37.1 bits (85), Expect = 0.062
 Identities = 31/100 (31%), Positives = 48/100 (48%), Gaps = 6/100 (6%)

Query: 432  YITENDNVIRRNEIHIKPRICGLQEENIEEKFDDSSQEEYFEAEDQEGQGDSQEDEEMEE 491
            Y  EN   +  N            EEN EE  +++  EE  E  D+E   + +E EE  E
Sbjct: 1022 YDEENVEEVEENVEEYDEENVEEIEENAEENVEENI-EENIEEYDEE---NVEEIEENIE 1077

Query: 492  QRIED--EEDQEDGNEEIEDERETKMEEECQKFGHYNKDD 529
            + IE+  EE+ E+  EEIE+  E  +EE  ++    N ++
Sbjct: 1078 ENIEENVEENVEENVEEIEENVEENVEENAEENAEENAEE 1117



 Score = 35.5 bits (81), Expect = 0.18
 Identities = 22/74 (29%), Positives = 41/74 (55%), Gaps = 2/74 (2%)

Query: 456  EENIEEKFDDSSQEEYFEAEDQEGQGDSQEDEEMEEQRIEDEEDQEDGNEEIEDERETKM 515
            EENIEE  +++ +E   E  ++  + + +E EE  E+   DEE+ E+  E  E+  E  +
Sbjct: 1000 EENIEENVEENVEENIEENVEEYDEENVEEVEENVEEY--DEENVEEIEENAEENVEENI 1057

Query: 516  EEECQKFGHYNKDD 529
            EE  +++   N ++
Sbjct: 1058 EENIEEYDEENVEE 1071



 Score = 34.4 bits (78), Expect = 0.40
 Identities = 25/78 (32%), Positives = 40/78 (51%), Gaps = 5/78 (6%)

Query: 456  EENIEEKFDDSSQEEYFEAEDQ-EGQGDSQEDEEMEEQRIEDEEDQEDGNEE-IEDERET 513
            EENIEE  ++  +E   E E+  E   +   +E +EE   E EE+ E+  EE  E+  E 
Sbjct: 1054 EENIEENIEEYDEENVEEIEENIEENIEENVEENVEENVEEIEENVEENVEENAEENAEE 1113

Query: 514  KMEEECQKFGHYNKDDPH 531
              EE  ++   Y+ ++P 
Sbjct: 1114 NAEENAEE---YDDENPE 1128



 Score = 33.6 bits (76), Expect = 0.59
 Identities = 30/88 (34%), Positives = 50/88 (56%), Gaps = 5/88 (5%)

Query: 437  DNVIRRNEIHIKPRICGLQEENIEEKFDDSSQEEYFEAEDQEGQGDSQED-EEMEEQRIE 495
            +NV    E +I+  +    EEN+EE   + + EEY E   +E + +++E+ EE  E+ IE
Sbjct: 1005 ENVEENVEENIEENVEEYDEENVEEV--EENVEEYDEENVEEIEENAEENVEENIEENIE 1062

Query: 496  --DEEDQEDGNEEIEDERETKMEEECQK 521
              DEE+ E+  E IE+  E  +EE  ++
Sbjct: 1063 EYDEENVEEIEENIEENIEENVEENVEE 1090



 Score = 33.2 bits (75), Expect = 0.92
 Identities = 20/69 (28%), Positives = 35/69 (50%), Gaps = 6/69 (8%)

Query: 456 EENIEEKFDDSSQEEYFEAEDQEGQGDSQEDEE------MEEQRIEDEEDQEDGNEEIED 509
           +E++EE   D S E+  + E  E Q  ++  EE      +EE  + D  +QE  +E +E+
Sbjct: 455 DEHVEEHNADDSGEQQSDDESGEHQSVNEIVEEQSVNEHVEEPTVADIVEQETVDEHVEE 514

Query: 510 ERETKMEEE 518
               + EE+
Sbjct: 515 PAVDENEEQ 523



 Score = 33.2 bits (75), Expect = 0.93
 Identities = 18/59 (30%), Positives = 31/59 (52%), Gaps = 5/59 (8%)

Query: 465 DSSQEEYFEAEDQEGQGDSQE-----DEEMEEQRIEDEEDQEDGNEEIEDERETKMEEE 518
           D S++E  E  ++E  GD+ E     DE +EE   +D  +Q+  +E  E +   ++ EE
Sbjct: 429 DGSEDESVEDNEEEHSGDANEEELSVDEHVEEHNADDSGEQQSDDESGEHQSVNEIVEE 487



 Score = 32.8 bits (74), Expect = 1.2
 Identities = 18/62 (29%), Positives = 33/62 (53%), Gaps = 2/62 (3%)

Query: 456  EENIEEKFDDSSQEEYFEAEDQEGQGDSQEDEEMEEQRIEDEEDQEDGNEEIEDERETKM 515
            EENIEE  +++ +E   E E+   +   +  EE  E+    EE+ E+ ++E  +E   + 
Sbjct: 1077 EENIEENVEENVEENVEEIEENVEENVEENAEENAEENA--EENAEEYDDENPEEHNEEY 1134

Query: 516  EE 517
            +E
Sbjct: 1135 DE 1136



 Score = 32.8 bits (74), Expect = 1.2
 Identities = 21/77 (27%), Positives = 46/77 (59%), Gaps = 3/77 (3%)

Query: 456  EENIEEKFDDSSQEEYFEAEDQEGQGDSQED-EEMEEQRIED--EEDQEDGNEEIEDERE 512
            EEN+EE  +++ +E   E  ++  + + +E+ EE  E+ +E+  EE+ E+  EE ++E  
Sbjct: 968  EENVEENVEENVEENVEENVEENVEENVEENVEENIEENVEENVEENIEENVEEYDEENV 1027

Query: 513  TKMEEECQKFGHYNKDD 529
             ++EE  +++   N ++
Sbjct: 1028 EEVEENVEEYDEENVEE 1044



 Score = 32.5 bits (73), Expect = 1.2
 Identities = 27/99 (27%), Positives = 55/99 (55%), Gaps = 4/99 (4%)

Query: 435  ENDNVIRRNEIHIKPRICGLQEENIEEKFDDSSQEEYFEAEDQEGQGDSQED-EEMEEQR 493
            +N+NV    + H +  I    EEN+EE  +++ +E   E  ++  + + +E+ EE  E+ 
Sbjct: 931  QNENVPEHLKEHAEANIEEDAEENVEEDAEENVEENVEENVEENVEENVEENVEENVEEN 990

Query: 494  IED--EEDQEDGNEE-IEDERETKMEEECQKFGHYNKDD 529
            +E+  EE+ E+  EE +E+  E  +EE  +++   N ++
Sbjct: 991  VEENVEENVEENIEENVEENVEENIEENVEEYDEENVEE 1029



 Score = 31.7 bits (71), Expect = 2.4
 Identities = 22/76 (28%), Positives = 37/76 (48%), Gaps = 10/76 (13%)

Query: 456  EENIEEKFDDSSQEEYFE----------AEDQEGQGDSQEDEEMEEQRIEDEEDQEDGNE 505
            EEN+EE  +++ +E   E           E+ E   +  ++E +EE     EE  E+  E
Sbjct: 984  EENVEENVEENVEENVEENIEENVEENVEENIEENVEEYDEENVEEVEENVEEYDEENVE 1043

Query: 506  EIEDERETKMEEECQK 521
            EIE+  E  +EE  ++
Sbjct: 1044 EIEENAEENVEENIEE 1059



 Score = 31.3 bits (70), Expect = 3.5
 Identities = 19/64 (29%), Positives = 33/64 (51%), Gaps = 2/64 (3%)

Query: 457  ENIEEKFDDSSQEEYFEAEDQEGQGDSQEDEEMEEQRIED--EEDQEDGNEEIEDERETK 514
            E IEE  +++ +E   E  ++  +   +  EE  E+  E+  EE+ E+  EE +DE   +
Sbjct: 1070 EEIEENIEENIEENVEENVEENVEEIEENVEENVEENAEENAEENAEENAEEYDDENPEE 1129

Query: 515  MEEE 518
              EE
Sbjct: 1130 HNEE 1133


>gnl|CDD|218771 pfam05835, Synaphin, Synaphin protein.  This family consists of
           several eukaryotic synaphin 1 and 2 proteins.
           Synaphin/complexin is a cytosolic protein that
           preferentially binds to syntaxin within the SNARE
           complex. Synaphin promotes SNAREs to form precomplexes
           that oligomerise into higher order structures. A peptide
           from the central, syntaxin binding domain of synaphin
           competitively inhibits these two proteins from
           interacting and prevents SNARE complexes from
           oligomerising. It is thought that oligomerisation of
           SNARE complexes into a higher order structure creates a
           SNARE scaffold for efficient, regulated fusion of
           synaptic vesicles. Synaphin promotes neuronal exocytosis
           by promoting interaction between the complementary
           syntaxin and synaptobrevin transmembrane regions that
           reside in opposing membranes prior to fusion.
          Length = 139

 Score = 36.0 bits (83), Expect = 0.025
 Identities = 12/41 (29%), Positives = 26/41 (63%)

Query: 473 EAEDQEGQGDSQEDEEMEEQRIEDEEDQEDGNEEIEDERET 513
           + ++ +     +EDEE++E   E EE+++  + ++E+ERE 
Sbjct: 23  KEDEGDESDAEEEDEEIQEALREAEEERKAKHRKMEEEREV 63


>gnl|CDD|218752 pfam05793, TFIIF_alpha, Transcription initiation factor IIF, alpha
           subunit (TFIIF-alpha).  Transcription initiation factor
           IIF, alpha subunit (TFIIF-alpha) or RNA polymerase
           II-associating protein 74 (RAP74) is the large subunit
           of transcription factor IIF (TFIIF), which is essential
           for accurate initiation and stimulates elongation by RNA
           polymerase II.
          Length = 528

 Score = 37.6 bits (87), Expect = 0.032
 Identities = 15/70 (21%), Positives = 25/70 (35%), Gaps = 2/70 (2%)

Query: 461 EKFDDSSQEEYFEAEDQEGQGDSQEDEEMEEQRIEDEEDQEDGNEEIEDERETKMEEECQ 520
           +  DD  +EE + ++      D +E E+      E     E   +E  +E E +  EE  
Sbjct: 275 DDGDDEGREEDYISDSSASGNDPEEREDKLSP--EIPAKPEIEQDEDSEESEEEKNEEEG 332

Query: 521 KFGHYNKDDP 530
                 K   
Sbjct: 333 GLSKKGKKLK 342



 Score = 37.6 bits (87), Expect = 0.035
 Identities = 15/62 (24%), Positives = 25/62 (40%), Gaps = 1/62 (1%)

Query: 456 EENIEEKF-DDSSQEEYFEAEDQEGQGDSQEDEEMEEQRIEDEEDQEDGNEEIEDERETK 514
           +E  EE +  DSS       E ++        +   EQ  + EE +E+ NEE     +  
Sbjct: 279 DEGREEDYISDSSASGNDPEEREDKLSPEIPAKPEIEQDEDSEESEEEKNEEEGGLSKKG 338

Query: 515 ME 516
            +
Sbjct: 339 KK 340



 Score = 31.5 bits (71), Expect = 2.3
 Identities = 16/61 (26%), Positives = 24/61 (39%), Gaps = 2/61 (3%)

Query: 458 NIEEKFDDSSQEEYFEAEDQEGQGDSQEDEEMEEQRIEDEEDQEDGNEEIEDERETKMEE 517
           N ++  DD   +   + +  E   D  +DE  EE  I D      GN+  E E +   E 
Sbjct: 251 NKKKLDDDKKGKRGGDDDADEYDSDDGDDEGREEDYISDSSA--SGNDPEEREDKLSPEI 308

Query: 518 E 518
            
Sbjct: 309 P 309


>gnl|CDD|217503 pfam03344, Daxx, Daxx Family.  The Daxx protein (also known as the
           Fas-binding protein) is thought to play a role in
           apoptosis, but precise role played by Daxx remains to be
           determined. Daxx forms a complex with Axin.
          Length = 715

 Score = 37.6 bits (87), Expect = 0.036
 Identities = 19/64 (29%), Positives = 33/64 (51%), Gaps = 2/64 (3%)

Query: 455 QEENIEEKFDDSSQEEYFEAEDQEGQGDSQEDEEMEEQRIEDEEDQEDGNEEIEDERETK 514
           +    EE+ ++  +EE  E E +E +G+ +E+EE  E     EE+ E  +E   D  E +
Sbjct: 443 ESVEEEEEEEEEEEEE--EQESEEEEGEDEEEEEEVEADNGSEEEMEGSSEGDGDGEEPE 500

Query: 515 MEEE 518
            + E
Sbjct: 501 EDAE 504



 Score = 37.6 bits (87), Expect = 0.039
 Identities = 22/68 (32%), Positives = 34/68 (50%), Gaps = 7/68 (10%)

Query: 456 EENIEEKFDDSSQEEYFEAEDQEGQGDSQEDEEMEEQRIEDEED-------QEDGNEEIE 508
           EE++EE+ ++  +EE  E E +E +G+ +E+EE  E     EE+         DG E  E
Sbjct: 442 EESVEEEEEEEEEEEEEEQESEEEEGEDEEEEEEVEADNGSEEEMEGSSEGDGDGEEPEE 501

Query: 509 DERETKME 516
           D      E
Sbjct: 502 DAERRNSE 509



 Score = 37.2 bits (86), Expect = 0.052
 Identities = 21/55 (38%), Positives = 33/55 (60%), Gaps = 2/55 (3%)

Query: 466 SSQEEYFEAEDQEGQGDSQEDEEMEEQRIEDEEDQEDGNEEIEDERETKMEEECQ 520
           +SQE   E E+   + + +E+EE EE++  +EE+ ED  EE E E +   EEE +
Sbjct: 435 ASQES--EEEESVEEEEEEEEEEEEEEQESEEEEGEDEEEEEEVEADNGSEEEME 487



 Score = 34.1 bits (78), Expect = 0.43
 Identities = 17/51 (33%), Positives = 28/51 (54%)

Query: 460 EEKFDDSSQEEYFEAEDQEGQGDSQEDEEMEEQRIEDEEDQEDGNEEIEDE 510
           +E  ++ S EE  E E++E + + + +EE  E   E+EE + D   E E E
Sbjct: 437 QESEEEESVEEEEEEEEEEEEEEQESEEEEGEDEEEEEEVEADNGSEEEME 487



 Score = 33.3 bits (76), Expect = 0.67
 Identities = 21/75 (28%), Positives = 39/75 (52%)

Query: 457 ENIEEKFDDSSQEEYFEAEDQEGQGDSQEDEEMEEQRIEDEEDQEDGNEEIEDERETKME 516
           E+      +S +EE  E E++E + + +E++E EE+  EDEE++E+   +   E E +  
Sbjct: 430 ESPSMASQESEEEESVEEEEEEEEEEEEEEQESEEEEGEDEEEEEEVEADNGSEEEMEGS 489

Query: 517 EECQKFGHYNKDDPH 531
            E    G   ++D  
Sbjct: 490 SEGDGDGEEPEEDAE 504



 Score = 31.4 bits (71), Expect = 2.7
 Identities = 15/53 (28%), Positives = 27/53 (50%)

Query: 466 SSQEEYFEAEDQEGQGDSQEDEEMEEQRIEDEEDQEDGNEEIEDERETKMEEE 518
           SS      +   E    + ++ E EE   E+EE++E+  EE ++  E + E+E
Sbjct: 418 SSDPSKASSTSGESPSMASQESEEEESVEEEEEEEEEEEEEEQESEEEEGEDE 470



 Score = 31.4 bits (71), Expect = 2.8
 Identities = 16/66 (24%), Positives = 27/66 (40%)

Query: 455 QEENIEEKFDDSSQEEYFEAEDQEGQGDSQEDEEMEEQRIEDEEDQEDGNEEIEDERETK 514
           ++E  EE+ +  +  E       EG GD +E EE  E+R  +       +E  +    + 
Sbjct: 468 EDEEEEEEVEADNGSEEEMEGSSEGDGDGEEPEEDAERRNSEMAGISRMSEGQQPRGSSV 527

Query: 515 MEEECQ 520
             E  Q
Sbjct: 528 QPESPQ 533



 Score = 30.7 bits (69), Expect = 4.2
 Identities = 19/56 (33%), Positives = 30/56 (53%), Gaps = 2/56 (3%)

Query: 477 QEGQGDSQEDEEMEEQRIEDEEDQEDGNEEIEDERETKMEEECQKFGHYNKDDPHS 532
           QE + +   +EE EE+  E+EE++E  +EE E E E + EE     G   + +  S
Sbjct: 437 QESEEEESVEEEEEEE--EEEEEEEQESEEEEGEDEEEEEEVEADNGSEEEMEGSS 490


>gnl|CDD|223496 COG0419, SbcC, ATPase involved in DNA repair [DNA replication,
           recombination, and repair].
          Length = 908

 Score = 37.4 bits (87), Expect = 0.038
 Identities = 16/63 (25%), Positives = 28/63 (44%)

Query: 456 EENIEEKFDDSSQEEYFEAEDQEGQGDSQEDEEMEEQRIEDEEDQEDGNEEIEDERETKM 515
           EE   E  +   + E   A  +E +   ++ + +EE+  + EE  E    E+E+  E K 
Sbjct: 294 EELEREIEELEEELEGLRALLEELEELLEKLKSLEERLEKLEEKLEKLESELEELAEEKN 353

Query: 516 EEE 518
           E  
Sbjct: 354 ELA 356



 Score = 31.3 bits (71), Expect = 3.3
 Identities = 21/74 (28%), Positives = 37/74 (50%), Gaps = 5/74 (6%)

Query: 450 RICGLQE-ENIEEKFDDSSQEEYFEAEDQEGQ-GDSQEDEEMEEQRIEDEEDQEDGNEEI 507
            + GL++ E + E   +  +E   + E+ EGQ  +  ED E   + +E+E  +    EEI
Sbjct: 162 ELFGLEKYEKLSELLKEVIKEAKAKIEELEGQLSELLEDIEDLLEALEEELKELKKLEEI 221

Query: 508 ED---ERETKMEEE 518
           ++   E E + E E
Sbjct: 222 QEEQEEEELEQEIE 235


>gnl|CDD|240329 PTZ00248, PTZ00248, eukaryotic translation initiation factor 2
           subunit 1; Provisional.
          Length = 319

 Score = 36.9 bits (86), Expect = 0.039
 Identities = 16/66 (24%), Positives = 34/66 (51%), Gaps = 13/66 (19%)

Query: 447 IKPRICGLQEENIEEKFDDSSQEEYFEAEDQEGQGDSQEDEEMEEQRIEDEEDQEDGNEE 506
            +P + G  EE++EE  +        +AE++E + D  E E+ +E     E++ E+  E+
Sbjct: 267 GEPEVVGGDEEDLEELLE--------KAEEEEEEDDYSESEDEDE-----EDEDEEEEED 313

Query: 507 IEDERE 512
            ++  +
Sbjct: 314 DDEGDK 319



 Score = 36.2 bits (84), Expect = 0.075
 Identities = 13/46 (28%), Positives = 26/46 (56%), Gaps = 1/46 (2%)

Query: 473 EAEDQEGQGDSQEDEEMEEQRIEDEEDQEDGNEEIEDERETKMEEE 518
           + ED E   +  E+EE EE    + ED+++ +E+ E+E +    ++
Sbjct: 275 DEEDLEELLEKAEEEE-EEDDYSESEDEDEEDEDEEEEEDDDEGDK 319


>gnl|CDD|145949 pfam03066, Nucleoplasmin, Nucleoplasmin.  Nucleoplasmins are also
           known as chromatin decondensation proteins. They bind to
           core histones and transfer DNA to them in a reaction
           that requires ATP. This is thought to play a role in the
           assembly of regular nucleosomal arrays.
          Length = 146

 Score = 35.4 bits (82), Expect = 0.041
 Identities = 13/37 (35%), Positives = 23/37 (62%), Gaps = 4/37 (10%)

Query: 478 EGQGDSQEDEEMEEQRIEDEEDQEDGNEEIEDERETK 514
             + +S +DEE EE    +E+D+ED +E+  +E E+ 
Sbjct: 109 SEEDESDDDEEDEE----EEDDEEDDDEDESEEEESP 141



 Score = 29.6 bits (67), Expect = 3.9
 Identities = 10/37 (27%), Positives = 20/37 (54%)

Query: 466 SSQEEYFEAEDQEGQGDSQEDEEMEEQRIEDEEDQED 502
           S Q      ED+    +  E+EE +E+  +++E +E+
Sbjct: 102 SGQHLVASEEDESDDDEEDEEEEDDEEDDDEDESEEE 138


>gnl|CDD|226920 COG4547, CobT, Cobalamin biosynthesis protein CobT
           (nicotinate-mononucleotide:5, 6-dimethylbenzimidazole
           phosphoribosyltransferase) [Coenzyme metabolism].
          Length = 620

 Score = 37.1 bits (86), Expect = 0.043
 Identities = 16/58 (27%), Positives = 28/58 (48%), Gaps = 3/58 (5%)

Query: 460 EEKFDDSSQEEYFEAEDQEGQGDSQEDEEMEEQRIEDEEDQEDGNEEIEDERETKMEE 517
           EE  DD  +E+  E +  + Q D+ ED    E   E+ E  ++  E+  +  + + EE
Sbjct: 220 EETGDDGIEEDADEEDGDDDQPDNNED---SEAGREESEGSDESEEDEAEATDGEGEE 274



 Score = 36.0 bits (83), Expect = 0.11
 Identities = 18/83 (21%), Positives = 36/83 (43%), Gaps = 2/83 (2%)

Query: 455 QEENIEEKFDDSSQEEYFEAEDQEGQGDSQEDEEMEEQRIEDEEDQEDGNEEIEDERETK 514
           +E+  +++ D++   E     ++    D  E++E E    E EE + D  E  ED    +
Sbjct: 233 EEDGDDDQPDNNEDSE--AGREESEGSDESEEDEAEATDGEGEEGEMDAAEASEDSESDE 290

Query: 515 MEEECQKFGHYNKDDPHSFQLPE 537
            +E+ +  G   +      +L E
Sbjct: 291 SDEDTETPGEDARPATPFTELME 313



 Score = 34.8 bits (80), Expect = 0.22
 Identities = 14/65 (21%), Positives = 28/65 (43%), Gaps = 3/65 (4%)

Query: 453 GLQEENIEEKFDDSSQEEYFEAEDQEGQGDSQEDEEMEEQRIEDEEDQEDGNEEIEDERE 512
            + EE  ++  ++ + EE  + +D +   +   +   EE    DE  +ED  E  + E E
Sbjct: 217 DMAEETGDDGIEEDADEE--DGDDDQPDNNEDSEAGREESEGSDES-EEDEAEATDGEGE 273

Query: 513 TKMEE 517
               +
Sbjct: 274 EGEMD 278


>gnl|CDD|219900 pfam08553, VID27, VID27 cytoplasmic protein.  This is a family of
           fungal and plant proteins and contains many hypothetical
           proteins. VID27 is a cytoplasmic protein that plays a
           potential role in vacuolar protein degradation.
          Length = 794

 Score = 37.0 bits (86), Expect = 0.046
 Identities = 19/75 (25%), Positives = 37/75 (49%), Gaps = 13/75 (17%)

Query: 460 EEKFDDSSQEE--Y----FEAEDQEGQGDSQEDEEMEEQRIEDEEDQEDG-------NEE 506
           ++K+  + + E  Y    F A + E     ++DEE E++  E+EED+++G       +EE
Sbjct: 358 KQKWTKAKETEQDYILDAFSALEIEDANTERDDEEEEDEEEEEEEDEDEGPSKEHSDDEE 417

Query: 507 IEDERETKMEEECQK 521
            E++      E+   
Sbjct: 418 FEEDDVESKYEDSDG 432



 Score = 35.5 bits (82), Expect = 0.14
 Identities = 18/56 (32%), Positives = 28/56 (50%), Gaps = 4/56 (7%)

Query: 453 GLQEENIEEKFDD----SSQEEYFEAEDQEGQGDSQEDEEMEEQRIEDEEDQEDGN 504
            L+ E+   + DD      +EE  E ED+    +  +DEE EE  +E + +  DGN
Sbjct: 378 ALEIEDANTERDDEEEEDEEEEEEEDEDEGPSKEHSDDEEFEEDDVESKYEDSDGN 433



 Score = 33.6 bits (77), Expect = 0.65
 Identities = 16/54 (29%), Positives = 26/54 (48%), Gaps = 3/54 (5%)

Query: 457 ENIEEKFDDSSQEEYFEAEDQEGQGDSQEDEEMEEQRIEDEEDQEDGNEEIEDE 510
           E  ++   D+      E  + E   + +EDEE EE   E++ED+    E  +DE
Sbjct: 366 ETEQDYILDAFSALEIEDANTERDDEEEEDEEEEE---EEDEDEGPSKEHSDDE 416



 Score = 32.0 bits (73), Expect = 1.7
 Identities = 18/59 (30%), Positives = 29/59 (49%), Gaps = 1/59 (1%)

Query: 459 IEEKFDDSSQEEYFEAEDQEGQGDSQEDEEMEEQRIEDEEDQEDGNEEIEDERETKMEE 517
           I + F     E+     D E + D +E+EE +E     +E  +D  E  ED+ E+K E+
Sbjct: 372 ILDAFSALEIEDANTERDDEEEEDEEEEEEEDEDEGPSKEHSDD-EEFEEDDVESKYED 429


>gnl|CDD|218303 pfam04874, Mak16, Mak16 protein C-terminal region.  The precise
           function of this eukaryotic protein family is unknown.
           The yeast orthologues have been implicated in cell cycle
           progression and biogenesis of 60S ribosomal subunits.
           The Schistosoma mansoni Mak16 has been shown to target
           protein transport to the nucleolus.
          Length = 97

 Score = 34.5 bits (79), Expect = 0.046
 Identities = 16/45 (35%), Positives = 25/45 (55%)

Query: 455 QEENIEEKFDDSSQEEYFEAEDQEGQGDSQEDEEMEEQRIEDEED 499
           +    EE  ++  +EE  E E+ EG+ +   D+E  E+ IED ED
Sbjct: 53  KALEAEESEENDEEEEEEEEEEDEGEIEYVSDDEELEEEIEDLED 97



 Score = 31.8 bits (72), Expect = 0.40
 Identities = 18/55 (32%), Positives = 29/55 (52%), Gaps = 2/55 (3%)

Query: 458 NIEEKFDDSSQEEYFEAEDQE--GQGDSQEDEEMEEQRIEDEEDQEDGNEEIEDE 510
           +       S+ ++  EAE+ E   + + +E+EE +E  IE   D E+  EEIED 
Sbjct: 41  DEPYNISQSAFKKALEAEESEENDEEEEEEEEEEDEGEIEYVSDDEELEEEIEDL 95



 Score = 30.6 bits (69), Expect = 0.86
 Identities = 14/50 (28%), Positives = 30/50 (60%)

Query: 460 EEKFDDSSQEEYFEAEDQEGQGDSQEDEEMEEQRIEDEEDQEDGNEEIED 509
           +  F  + + E  E  D+E + + +E++E E + + D+E+ E+  E++ED
Sbjct: 48  QSAFKKALEAEESEENDEEEEEEEEEEDEGEIEYVSDDEELEEEIEDLED 97


>gnl|CDD|202096 pfam02029, Caldesmon, Caldesmon. 
          Length = 431

 Score = 36.9 bits (85), Expect = 0.053
 Identities = 25/75 (33%), Positives = 34/75 (45%), Gaps = 8/75 (10%)

Query: 455 QEENIEEKFDDSSQE-EYFEAEDQEGQGDSQEDEEMEEQRIEDEEDQEDG----NEEIED 509
           Q+E      D S  E      ED   + ++ E+EE EE R E EE +E      +E+  D
Sbjct: 81  QKEFKPTSTDQSLSEPSRRMQEDSGAENETVEEEEKEESREEREEVEETEGVTKSEQKND 140

Query: 510 ERETKMEEECQKFGH 524
            R+    EECQK   
Sbjct: 141 WRD---AEECQKEEK 152



 Score = 32.7 bits (74), Expect = 0.86
 Identities = 19/86 (22%), Positives = 41/86 (47%), Gaps = 6/86 (6%)

Query: 453 GLQEENIEEKFDDSSQEEYFEAEDQEGQGDSQEDEEMEEQRIEDEEDQEDGNEEIEDERE 512
           G + E +EE+  + S+EE    E +E +G ++ +++ + +  E+ + +E   E  E+E+ 
Sbjct: 105 GAENETVEEEEKEESREE--REEVEETEGVTKSEQKNDWRDAEECQKEEKEPEPEEEEKP 162

Query: 513 TKMEEECQKFGHYNKDDPHSFQLPEN 538
            +   E        +   H  +  EN
Sbjct: 163 KRGSLEENN----GEFMTHKLKHTEN 184


>gnl|CDD|130712 TIGR01651, CobT, cobaltochelatase, CobT subunit.  This model
           describes Pseudomonas denitrificans CobT gene product,
           which is a cobalt chelatase subunit that functions in
           cobalamin biosynthesis. Cobalamin (vitamin B12) can be
           synthesized via several pathways, including an aerobic
           pathway (found in Pseudomonas denitrificans) and an
           anaerobic pathway (found in P. shermanii and Salmonella
           typhimurium). These pathways differ in the point of
           cobalt insertion during corrin ring formation. There are
           apparently a number of variations on these two pathways,
           where the major differences seem to be concerned with
           the process of ring contraction. Confusion regarding the
           functions of enzymes found in the aerobic vs. anaerobic
           pathways has arisen because nonhomologous genes in these
           different pathways were given the same gene symbols.
           Thus, cobT in the aerobic pathway (P. denitrificans) is
           not a homolog of cobT in the anaerobic pathway (S.
           typhimurium). It should be noted that E. coli
           synthesizes cobalamin only when it is supplied with the
           precursor cobinamide, which is a complex intermediate.
           Additionally, all E. coli cobalamin synthesis genes
           (cobU, cobS and cobT) were named after their Salmonella
           typhimurium homologs which function in the anaerobic
           cobalamin synthesis pathway. This model describes the
           aerobic cobalamin pathway Pseudomonas denitrificans CobT
           gene product, which is a cobalt chelatase subunit, with
           a MW ~70 kDa. The aerobic pathway cobalt chelatase is a
           heterotrimeric, ATP-dependent enzyme that catalyzes
           cobalt insertion during cobalamin biosynthesis. The
           other two subunits are the P. denitrificans CobS
           (TIGR01650) and CobN (pfam02514 CobN/Magnesium
           Chelatase) proteins. To avoid potential confusion with
           the nonhomologous Salmonella typhimurium/E.coli cobT
           gene product, the P. denitrificans gene symbol is not
           used in the name of this model [Biosynthesis of
           cofactors, prosthetic groups, and carriers, Heme,
           porphyrin, and cobalamin].
          Length = 600

 Score = 36.8 bits (85), Expect = 0.054
 Identities = 16/61 (26%), Positives = 27/61 (44%), Gaps = 1/61 (1%)

Query: 460 EEKFDDSSQEEYFEAEDQEGQGDSQEDEEMEEQRIEDEEDQEDGNEEIEDERETKMEEEC 519
           EE  DD+  E+  + +D +   + QE++   E     E      +E  + E E+  EE  
Sbjct: 201 EEMGDDTESEDEEDGDDDQPTENEQEEQGEGEGE-GQEGSAPQESEATDRESESGEEEMV 259

Query: 520 Q 520
           Q
Sbjct: 260 Q 260



 Score = 36.1 bits (83), Expect = 0.098
 Identities = 9/60 (15%), Positives = 24/60 (40%)

Query: 455 QEENIEEKFDDSSQEEYFEAEDQEGQGDSQEDEEMEEQRIEDEEDQEDGNEEIEDERETK 514
           + E  E+   +   +E    ++ E      E  E E  + + ++  ++ +++ E   E  
Sbjct: 222 ENEQEEQGEGEGEGQEGSAPQESEATDRESESGEEEMVQSDQDDLPDESDDDSETPGEGA 281



 Score = 32.6 bits (74), Expect = 1.1
 Identities = 14/89 (15%), Positives = 31/89 (34%), Gaps = 3/89 (3%)

Query: 455 QEENIEEKFDD---SSQEEYFEAEDQEGQGDSQEDEEMEEQRIEDEEDQEDGNEEIEDER 511
            E   EE  DD   +  E+  + E +    +    +E E    E E  +E+  +  +D+ 
Sbjct: 207 TESEDEEDGDDDQPTENEQEEQGEGEGEGQEGSAPQESEATDRESESGEEEMVQSDQDDL 266

Query: 512 ETKMEEECQKFGHYNKDDPHSFQLPENFQ 540
             + +++ +  G   +             
Sbjct: 267 PDESDDDSETPGEGARPARPFTSTGGEPD 295


>gnl|CDD|227563 COG5238, RNA1, Ran GTPase-activating protein (RanGAP) involved in
           mRNA processing and transport [Signal transduction
           mechanisms / RNA processing and modification].
          Length = 388

 Score = 36.8 bits (85), Expect = 0.054
 Identities = 13/58 (22%), Positives = 23/58 (39%), Gaps = 1/58 (1%)

Query: 464 DDSSQEEYFEAEDQEGQGDSQE-DEEMEEQRIEDEEDQEDGNEEIEDERETKMEEECQ 520
                + Y +  +     + QE D   EE   E+ ED+ + +E + +E       E Q
Sbjct: 318 LADFGDYYEDIFEVVEVVEKQEGDVVTEESTDEESEDEVEIDESVIEEVAEMELLEVQ 375


>gnl|CDD|219236 pfam06949, DUF1292, Protein of unknown function (DUF1292).  This
           family consists of several hypothetical bacterial
           proteins of around 90 residues in length. The function
           of this family is unknown.
          Length = 74

 Score = 33.3 bits (77), Expect = 0.057
 Identities = 23/69 (33%), Positives = 34/69 (49%), Gaps = 9/69 (13%)

Query: 456 EENIEEKFDDSSQEEYFEAEDQE------GQGDSQEDEEMEEQRIEDEEDQEDGNEEIED 509
           EE  E +F+     + FE E +E         +  ED E+   +IE++ED E+    IED
Sbjct: 1   EEGNEVEFE---VLDTFEVEGKEYILLLPVGEEEDEDIEVYIFKIEEDEDGEEELLPIED 57

Query: 510 ERETKMEEE 518
           + E  M EE
Sbjct: 58  DEEWDMVEE 66


>gnl|CDD|217203 pfam02724, CDC45, CDC45-like protein.  CDC45 is an essential gene
           required for initiation of DNA replication in S.
           cerevisiae, forming a complex with MCM5/CDC46.
           Homologues of CDC45 have been identified in human, mouse
           and smut fungus among others.
          Length = 583

 Score = 36.9 bits (86), Expect = 0.058
 Identities = 17/83 (20%), Positives = 40/83 (48%), Gaps = 7/83 (8%)

Query: 437 DNVIRRNEIHIKPRICGLQEENIEEKFDD-SSQEEYFEAEDQEGQGDSQEDEEMEEQRIE 495
           DNV   +++ I        + +IEE+  D    ++ +   +++   D + DEE EE    
Sbjct: 90  DNVFGSDQVVI------FDDGDIEEELQDEPRYDDAYRDLEEDDDDDEESDEEDEESSKS 143

Query: 496 DEEDQEDGNEEIEDERETKMEEE 518
           ++++ +D +++ +D    +   E
Sbjct: 144 EDDEDDDDDDDDDDIATRERSLE 166


>gnl|CDD|219761 pfam08243, SPT2, SPT2 chromatin protein.  This family includes the
           Saccharomyces cerevisiae protein SPT2 which is a
           chromatin protein involved in transcriptional
           regulation.
          Length = 116

 Score = 34.5 bits (79), Expect = 0.059
 Identities = 19/60 (31%), Positives = 28/60 (46%), Gaps = 3/60 (5%)

Query: 459 IEEKFDDSSQEEYFEAEDQEGQGDSQEDEEMEEQRIEDEEDQEDGNEEIEDERETKMEEE 518
           I   F    +  Y++  D++   D+ E   ME Q+   EE +      +EDERE   EEE
Sbjct: 49  IWAIFGKGRKRSYYDRYDEDDALDNMEATFMEIQK---EERRSARMARLEDERELAREEE 105


>gnl|CDD|225880 COG3343, RpoE, DNA-directed RNA polymerase, delta subunit
           [Transcription].
          Length = 175

 Score = 35.5 bits (82), Expect = 0.065
 Identities = 16/62 (25%), Positives = 31/62 (50%), Gaps = 3/62 (4%)

Query: 460 EEKFDDSSQEEYFEAEDQEGQG---DSQEDEEMEEQRIEDEEDQEDGNEEIEDERETKME 516
           E   D+   +E  E E+ +      D  +DE+ ++  I +   ++D  +E ED+ E + +
Sbjct: 112 EGDEDELDYDEDKEEEEDDEVDSLDDENDDEDEDDDEIVEILIEDDEVDEDEDDDEDEED 171

Query: 517 EE 518
           EE
Sbjct: 172 EE 173



 Score = 30.9 bits (70), Expect = 1.8
 Identities = 15/70 (21%), Positives = 36/70 (51%)

Query: 460 EEKFDDSSQEEYFEAEDQEGQGDSQEDEEMEEQRIEDEEDQEDGNEEIEDERETKMEEEC 519
           ++K  D+ +E   +  D +   + +ED+E++    E++++ ED +E +E   E    +E 
Sbjct: 103 KDKEVDAFEEGDEDELDYDEDKEEEEDDEVDSLDDENDDEDEDDDEIVEILIEDDEVDED 162

Query: 520 QKFGHYNKDD 529
           +      +D+
Sbjct: 163 EDDDEDEEDE 172



 Score = 30.1 bits (68), Expect = 3.2
 Identities = 14/54 (25%), Positives = 30/54 (55%)

Query: 457 ENIEEKFDDSSQEEYFEAEDQEGQGDSQEDEEMEEQRIEDEEDQEDGNEEIEDE 510
           E+ EE+ DD       E +D++   D   +  +E+  ++++ED ++  E+ ED+
Sbjct: 122 EDKEEEEDDEVDSLDDENDDEDEDDDEIVEILIEDDEVDEDEDDDEDEEDEEDK 175



 Score = 30.1 bits (68), Expect = 3.2
 Identities = 16/67 (23%), Positives = 37/67 (55%), Gaps = 3/67 (4%)

Query: 454 LQEENI--EEKFDDSSQEEYFEAEDQEGQGDSQEDEEMEEQRIEDEEDQEDGNEEIEDER 511
           + EE     EK D  ++++  +A ++  + +   DE+ EE+  ++ +  +D N++ EDE 
Sbjct: 87  IDEEIQAMTEKKDIKAKDKEVDAFEEGDEDELDYDEDKEEEEDDEVDSLDDENDD-EDED 145

Query: 512 ETKMEEE 518
           + ++ E 
Sbjct: 146 DDEIVEI 152



 Score = 29.0 bits (65), Expect = 8.8
 Identities = 18/61 (29%), Positives = 32/61 (52%), Gaps = 5/61 (8%)

Query: 456 EENIEEKFD---DSSQEEYFEAED-QEGQGDSQEDEEMEEQRIEDEEDQEDGNEEIEDER 511
           EE  E++ D   D  +EE  E +   +   D  ED++ E   I  E+D+ D +E+ +++ 
Sbjct: 111 EEGDEDELDYDEDKEEEEDDEVDSLDDENDDEDEDDD-EIVEILIEDDEVDEDEDDDEDE 169

Query: 512 E 512
           E
Sbjct: 170 E 170


>gnl|CDD|219069 pfam06512, Na_trans_assoc, Sodium ion transport-associated.
           Members of this family contain a region found
           exclusively in eukaryotic sodium channels or their
           subunits, many of which are voltage-gated. Members very
           often also contain between one and four copies of
           pfam00520 and, less often, one copy of pfam00612.
          Length = 230

 Score = 35.8 bits (83), Expect = 0.073
 Identities = 20/68 (29%), Positives = 32/68 (47%), Gaps = 5/68 (7%)

Query: 457 ENIEEKFDDSSQEEYFEAEDQEGQGDSQEDEEM----EEQRIEDEEDQEDGNEEIEDERE 512
           EN+ ++   S     F+ E ++G  ++ + EE      E    D E  E+  EEI +E E
Sbjct: 115 ENLNDEDTSSESSYGFKEESKKGSAETLKLEEKDSSSSEGSTVDLEPPEEEEEEIAEEEE 174

Query: 513 TKME-EEC 519
              E E+C
Sbjct: 175 EVKEPEDC 182


>gnl|CDD|222792 PHA00435, PHA00435, capsid assembly protein.
          Length = 306

 Score = 36.0 bits (83), Expect = 0.077
 Identities = 19/62 (30%), Positives = 33/62 (53%), Gaps = 9/62 (14%)

Query: 456 EENIEEKFDDSSQEEY----FEAEDQEGQ-----GDSQEDEEMEEQRIEDEEDQEDGNEE 506
           ++ IE    ++S + Y       ED EG+      +  E+EE+EE   ++EE+ E+ +EE
Sbjct: 42  DDAIELAEPETSDDPYGNPDPFGEDDEGRIEVRISEDGEEEEVEEGEEDEEEEGEEESEE 101

Query: 507 IE 508
            E
Sbjct: 102 FE 103



 Score = 33.7 bits (77), Expect = 0.46
 Identities = 20/62 (32%), Positives = 35/62 (56%), Gaps = 4/62 (6%)

Query: 464 DDS-SQEEYFEAEDQEGQGD-SQEDEEMEEQ-RIEDEEDQEDGNEEIEDERETKMEEECQ 520
           DD+    E   ++D  G  D   ED+E   + RI ++ ++E+  EE E++ E + EEE +
Sbjct: 42  DDAIELAEPETSDDPYGNPDPFGEDDEGRIEVRISEDGEEEE-VEEGEEDEEEEGEEESE 100

Query: 521 KF 522
           +F
Sbjct: 101 EF 102


>gnl|CDD|219563 pfam07767, Nop53, Nop53 (60S ribosomal biogenesis).  This nucleolar
           family of proteins are involved in 60S ribosomal
           biogenesis. They are specifically involved in the
           processing beyond the 27S stage of 25S rRNA maturation.
           This family contains sequences that bear similarity to
           the glioma tumour suppressor candidate region gene 2
           protein (p60). This protein has been found to interact
           with herpes simplex type 1 regulatory proteins.
          Length = 387

 Score = 35.8 bits (83), Expect = 0.099
 Identities = 16/64 (25%), Positives = 27/64 (42%), Gaps = 5/64 (7%)

Query: 456 EENIEEKFDDSSQEEYFEAEDQEG-----QGDSQEDEEMEEQRIEDEEDQEDGNEEIEDE 510
           E   E+K +  + E     E  EG       D +E+ + E      E + E  N+ +  +
Sbjct: 217 ERVEEKKLEKMAPEASRLDEMSEGLLEESDDDGEEESDDESAWEGFESEYEPINKPVRPK 276

Query: 511 RETK 514
           R+TK
Sbjct: 277 RKTK 280


>gnl|CDD|227504 COG5177, COG5177, Uncharacterized conserved protein [Function
           unknown].
          Length = 769

 Score = 36.2 bits (83), Expect = 0.11
 Identities = 22/121 (18%), Positives = 48/121 (39%), Gaps = 21/121 (17%)

Query: 473 EAEDQEGQGDSQEDEEMEEQRIEDEEDQEDGNEEIEDERETKM-----------EEECQK 521
            AED+E +     DEE     I+D++ +E+ NEE+  + E+ +           EEE ++
Sbjct: 392 WAEDEEEEDGQCNDEESTMSAIDDDDPKENDNEEVAGDEESAIDDNEGFEELSPEEEERQ 451

Query: 522 FGHYNKDDPHSFQLPENFQMYPQFMYHLRRSQF----------LQVFNNSPDETSYYRTL 571
              +   +    + P+  ++ P      R  ++           +     P     +++L
Sbjct: 452 LREFRDMEKEDREFPDEAELQPSESAIERYKEYRGLRNLYTCSWRSDEKDPSFPEEWKSL 511

Query: 572 L 572
           +
Sbjct: 512 V 512


>gnl|CDD|239286 cd02988, Phd_like_VIAF, Phosducin (Phd)-like family, Viral
           inhibitor of apoptosis (IAP)-associated factor (VIAF)
           subfamily; VIAF is a Phd-like protein that functions in
           caspase activation during apoptosis. It was identified
           as an IAP binding protein through a screen of a human
           B-cell library using a prototype IAP. VIAF lacks a
           consensus IAP binding motif and while it does not
           function as an IAP antagonist, it still plays a
           regulatory role in the complete activation of caspases.
           VIAF itself is a substrate for IAP-mediated
           ubiquitination, suggesting that it may be a target of
           IAPs in the prevention of cell death. The similarity of
           VIAF to Phd points to a potential role distinct from
           apoptosis regulation. Phd functions as a cytosolic
           regulator of G protein by specifically binding to G
           protein betagamma (Gbg)-subunits. The C-terminal domain
           of Phd adopts a thioredoxin fold, but it does not
           contain a CXXC motif. Phd interacts with G protein beta
           mostly through the N-terminal helical domain.
          Length = 192

 Score = 34.5 bits (80), Expect = 0.12
 Identities = 20/123 (16%), Positives = 43/123 (34%), Gaps = 30/123 (24%)

Query: 458 NIEEKFDDSSQEEYFEAEDQEGQGDSQEDEEMEEQRIEDEEDQEDGNEEIEDER-----E 512
            +  K     +EE    E    +      E+     +++E D+E+ +  +E+ R     E
Sbjct: 12  ILPPKPPSPKEEEEEALELAIQEAHENALEKKLLDELDEELDEEEDDRFLEEYRRKRLAE 71

Query: 513 TKMEEECQKFGHYNKDDPHSFQLPENFQMYPQFMYHLRRSQFLQVFNNSPDETSYYRTLL 572
            K   E  KFG                      +Y + +  +++    +  +T     ++
Sbjct: 72  MKALAEKSKFGE---------------------VYEISKPDYVREVTEASKDT----WVV 106

Query: 573 IHL 575
           +HL
Sbjct: 107 VHL 109


>gnl|CDD|184885 PRK14891, PRK14891, 50S ribosomal protein L24e/unknown domain
           fusion protein; Provisional.
          Length = 131

 Score = 33.8 bits (77), Expect = 0.14
 Identities = 11/66 (16%), Positives = 30/66 (45%)

Query: 455 QEENIEEKFDDSSQEEYFEAEDQEGQGDSQEDEEMEEQRIEDEEDQEDGNEEIEDERETK 514
                 E  +++   +  E  D+  + D+ ++ + EE+  E  ++  D  +   +E + +
Sbjct: 65  AATAAAEAAEEAEAADADEDADEAAEADAADEADEEEETDEAVDETADEADAEAEEADEE 124

Query: 515 MEEECQ 520
            +EE +
Sbjct: 125 EDEEAE 130



 Score = 33.0 bits (75), Expect = 0.25
 Identities = 13/64 (20%), Positives = 26/64 (40%), Gaps = 1/64 (1%)

Query: 456 EENIEEKFDDSSQEEYFEA-EDQEGQGDSQEDEEMEEQRIEDEEDQEDGNEEIEDERETK 514
                   + + + E  +A ED +   ++   +E +E+   DE   E  +E   +  E  
Sbjct: 63  AAAATAAAEAAEEAEAADADEDADEAAEADAADEADEEEETDEAVDETADEADAEAEEAD 122

Query: 515 MEEE 518
            EE+
Sbjct: 123 EEED 126



 Score = 31.8 bits (72), Expect = 0.60
 Identities = 16/67 (23%), Positives = 23/67 (34%), Gaps = 11/67 (16%)

Query: 463 FDDSSQEE-YF---EAEDQE-------GQGDSQEDEEMEEQRIEDEEDQEDGNEEIEDER 511
           F DS  E+ Y    EA D E        +G +       E   E E    D + +   E 
Sbjct: 32  FVDSKCEKNYDLGREARDLEWTEAGRAEKGPAAAATAAAEAAEEAEAADADEDADEAAEA 91

Query: 512 ETKMEEE 518
           +   E +
Sbjct: 92  DAADEAD 98


>gnl|CDD|201354 pfam00626, Gelsolin, Gelsolin repeat. 
          Length = 76

 Score = 32.3 bits (74), Expect = 0.15
 Identities = 10/33 (30%), Positives = 16/33 (48%)

Query: 626 PVPVLLDTSSIKPDCILLMDTFFQLLIFHGEEP 658
           P PV L   S+      L+DT F + ++ G+  
Sbjct: 5   PPPVPLSQESLNSGDCYLLDTGFTIFLWVGKGS 37



 Score = 31.1 bits (71), Expect = 0.35
 Identities = 11/34 (32%), Positives = 18/34 (52%)

Query: 913 PVPVLLDTSSIKPDCILLMDTFFQLLIFHGEVSN 946
           P PV L   S+      L+DT F + ++ G+ S+
Sbjct: 5   PPPVPLSQESLNSGDCYLLDTGFTIFLWVGKGSS 38


>gnl|CDD|215601 PLN03142, PLN03142, Probable chromatin-remodeling complex ATPase
           chain; Provisional.
          Length = 1033

 Score = 35.5 bits (82), Expect = 0.15
 Identities = 22/70 (31%), Positives = 33/70 (47%), Gaps = 3/70 (4%)

Query: 455 QEENIEEKFDDSSQEEYFEAEDQEGQGDSQEDEEMEEQRIEDEEDQEDGNEEIEDE---R 511
           +EE +  + ++   EE  EA  +    DS +DE   E   EDEED E+     + E   R
Sbjct: 1   EEEQVNTQANEEEDEEELEAVARSAGSDSDDDEVPAEDEDEDEEDDEEAESPAKAEISKR 60

Query: 512 ETKMEEECQK 521
           E    +E +K
Sbjct: 61  EKARLKELKK 70


>gnl|CDD|185603 PTZ00415, PTZ00415, transmission-blocking target antigen s230;
           Provisional.
          Length = 2849

 Score = 35.8 bits (82), Expect = 0.16
 Identities = 20/72 (27%), Positives = 41/72 (56%), Gaps = 3/72 (4%)

Query: 454 LQEENI--EEKFDDSSQEEYFEAEDQEGQGDSQEDEEMEEQR-IEDEEDQEDGNEEIEDE 510
           L EE++   + F     +E  + +D + + D +E+EE EE +  +DE+++++G E+   E
Sbjct: 140 LAEEDMSPRDNFVIDDDDEDEDEDDDDEEDDEEEEEEEEEIKGFDDEDEEDEGGEDFTYE 199

Query: 511 RETKMEEECQKF 522
           +    + +C KF
Sbjct: 200 KSEVDKTDCFKF 211



 Score = 33.5 bits (76), Expect = 0.67
 Identities = 14/71 (19%), Positives = 33/71 (46%), Gaps = 8/71 (11%)

Query: 475 EDQEGQGDSQEDEEMEEQRIEDEEDQEDGNEEIEDERETKMEEECQK--------FGHYN 526
           ED   + +   D++ E++  +D+++++D  EE E+E     ++E ++        +    
Sbjct: 143 EDMSPRDNFVIDDDDEDEDEDDDDEEDDEEEEEEEEEIKGFDDEDEEDEGGEDFTYEKSE 202

Query: 527 KDDPHSFQLPE 537
            D    F+  E
Sbjct: 203 VDKTDCFKFIE 213



 Score = 30.8 bits (69), Expect = 4.4
 Identities = 23/107 (21%), Positives = 43/107 (40%), Gaps = 17/107 (15%)

Query: 441 RRNEIHIKPRICGLQEENIEEKFDDSSQEEYFEAEDQEGQGDSQE-----------DEEM 489
           +   + IK R C   +E    +    +   ++    +   GD               EE 
Sbjct: 85  KDTSVKIKQRTCPFHKEGSSFEMGSKNITCFYPIHGKAEIGDLDMIIIKRRRARHLAEED 144

Query: 490 EEQR----IEDEEDQE--DGNEEIEDERETKMEEECQKFGHYNKDDP 530
              R    I+D+++ E  D ++E +DE E + EEE + F   +++D 
Sbjct: 145 MSPRDNFVIDDDDEDEDEDDDDEEDDEEEEEEEEEIKGFDDEDEEDE 191


>gnl|CDD|218115 pfam04502, DUF572, Family of unknown function (DUF572).  Family of
           eukaryotic proteins with undetermined function.
          Length = 321

 Score = 35.1 bits (81), Expect = 0.17
 Identities = 21/87 (24%), Positives = 33/87 (37%), Gaps = 11/87 (12%)

Query: 435 ENDNVIRRNEIHIKPRICGLQEENIEEKFDDSSQEEYFEAEDQEGQGDSQEDEEMEEQRI 494
           EN     + E+ +  R+  L+E     +  D       EA  +  +      +E EE+  
Sbjct: 138 ENRTADSKREMEVLERLEELKELQ--SRRADVDVNSMLEALFRREK------KEEEEEEE 189

Query: 495 EDEEDQED---GNEEIEDERETKMEEE 518
           EDE   +    G E  ED R    E+ 
Sbjct: 190 EDEALIKSLSFGPETEEDRRRADDEDS 216


>gnl|CDD|178439 PLN02847, PLN02847, triacylglycerol lipase.
          Length = 633

 Score = 35.2 bits (81), Expect = 0.17
 Identities = 26/89 (29%), Positives = 35/89 (39%), Gaps = 9/89 (10%)

Query: 464 DDSSQEEYFEAEDQEGQGDSQED---------EEMEEQRIEDEEDQEDGNEEIEDERETK 514
           DD  +EE   +ED+      +E+          E E QR E E D +   EE    +E  
Sbjct: 471 DDEEEEEPLLSEDRVITSSVEEEVTEGELWYELEKELQRQETEVDAQAQEEEAAAAKEIT 530

Query: 515 MEEECQKFGHYNKDDPHSFQLPENFQMYP 543
            EE        +K    S  L E+ Q YP
Sbjct: 531 EEENVLAKAVESKTSITSSDLSESQQFYP 559


>gnl|CDD|218734 pfam05758, Ycf1, Ycf1.  The chloroplast genomes of most higher
           plants contain two giant open reading frames designated
           ycf1 and ycf2. Although the function of Ycf1 is unknown,
           it is known to be an essential gene.
          Length = 832

 Score = 35.4 bits (82), Expect = 0.17
 Identities = 20/76 (26%), Positives = 35/76 (46%), Gaps = 1/76 (1%)

Query: 462 KFDDSSQ-EEYFEAEDQEGQGDSQEDEEMEEQRIEDEEDQEDGNEEIEDERETKMEEECQ 520
           K  ++S+ EE  E  D E +  S+     +EQ    EED    +EE ED  +T+  ++ +
Sbjct: 225 KLKETSETEEREEETDVEIETTSETKGTKQEQEGSTEEDPSLFSEEKEDPDKTEDLDKLE 284

Query: 521 KFGHYNKDDPHSFQLP 536
                  ++   F+ P
Sbjct: 285 ILKEKKDEELFWFEKP 300



 Score = 30.0 bits (68), Expect = 7.1
 Identities = 14/61 (22%), Positives = 26/61 (42%)

Query: 456 EENIEEKFDDSSQEEYFEAEDQEGQGDSQEDEEMEEQRIEDEEDQEDGNEEIEDERETKM 515
           EE  EE   +       +   QE +G ++ED  +  +  ED +  ED ++    + +   
Sbjct: 233 EEREEETDVEIETTSETKGTKQEQEGSTEEDPSLFSEEKEDPDKTEDLDKLEILKEKKDE 292

Query: 516 E 516
           E
Sbjct: 293 E 293


>gnl|CDD|114172 pfam05432, BSP_II, Bone sialoprotein II (BSP-II).  Bone
           sialoprotein (BSP) is a major structural protein of the
           bone matrix that is specifically expressed by
           fully-differentiated osteoblasts. The expression of bone
           sialoprotein (BSP) is normally restricted to mineralised
           connective tissues of bones and teeth where it has been
           associated with mineral crystal formation. However, it
           has been found that ectopic expression of BSP occurs in
           various lesions, including oral and extraoral
           carcinomas, in which it has been associated with the
           formation of microcrystalline deposits and the
           metastasis of cancer cells to bone.
          Length = 291

 Score = 34.7 bits (79), Expect = 0.18
 Identities = 21/53 (39%), Positives = 29/53 (54%), Gaps = 9/53 (16%)

Query: 461 EKFDDSSQEEYFEAEDQEGQGDSQEDEEMEEQRIEDEEDQE-DGNEEIEDERE 512
           +   DSS+E         G GDS E+E  EE   E+E +++ DGNE+ E E E
Sbjct: 45  QSGSDSSEEN--------GDGDSSEEEGEEETSNEEENNEDSDGNEDEEAEAE 89


>gnl|CDD|227596 COG5271, MDN1, AAA ATPase containing von Willebrand factor type A
            (vWA) domain [General function prediction only].
          Length = 4600

 Score = 35.4 bits (81), Expect = 0.19
 Identities = 24/91 (26%), Positives = 43/91 (47%), Gaps = 13/91 (14%)

Query: 456  EENIEEKFDDSSQEEYFEAEDQEGQGDSQED-----EEMEEQRIEDEEDQEDGNEEIEDE 510
            EE++ E  +  S E+   A + E    S+ED     E+ + Q  EDEE+  D +  I+DE
Sbjct: 3913 EEDLLET-EQKSNEQ--SAANNESDLVSKEDDNKALEDKDRQEKEDEEEMSD-DVGIDDE 3968

Query: 511  RETKMEEECQKFGHYNKDDPHSFQLPENFQM 541
             +  ++E   +       +     LPE+ ++
Sbjct: 3969 IQPDIQENNSQ----PPPENEDLDLPEDLKL 3995



 Score = 33.0 bits (75), Expect = 0.88
 Identities = 17/75 (22%), Positives = 40/75 (53%), Gaps = 11/75 (14%)

Query: 455  QEENIEEKFDDSSQEEYFE------AEDQEGQGDSQEDEEMEEQRIEDEE---DQEDGNE 505
            ++  ++EK  D S++   E      A++ + + D+++DE M+++   +E    D++   +
Sbjct: 3991 EDLKLDEKEGDVSKDSDLEDMDMEAADENKEEADAEKDEPMQDEDPLEENNTLDEDIQQD 4050

Query: 506  EIED--ERETKMEEE 518
            +  D  E + KM E+
Sbjct: 4051 DFSDLAEDDEKMNED 4065



 Score = 32.7 bits (74), Expect = 1.4
 Identities = 20/80 (25%), Positives = 38/80 (47%), Gaps = 2/80 (2%)

Query: 454  LQEENIEEKFDDSSQEEYFEAEDQEGQGDSQEDEE-MEEQRIEDEEDQEDGNEEIEDERE 512
            +Q+E+  E+ +   ++   +      + D + +E+  EE   E+EE  EDG  + ++E E
Sbjct: 4031 MQDEDPLEENNTLDEDIQQDDFSDLAEDDEKMNEDGFEENVQENEESTEDG-VKSDEELE 4089

Query: 513  TKMEEECQKFGHYNKDDPHS 532
                 E Q   ++ K D  S
Sbjct: 4090 QGEVPEDQAIDNHPKMDAKS 4109



 Score = 32.3 bits (73), Expect = 1.8
 Identities = 21/82 (25%), Positives = 33/82 (40%), Gaps = 9/82 (10%)

Query: 456  EENIEEKFDDSSQEEYFEAEDQEGQGDSQEDE--EMEEQ---RIEDEEDQEDGNEEIEDE 510
            E+      +D   EE    ED   Q D  E E  E+E       +D    E+ N + E+E
Sbjct: 3825 EDITNTLNEDDDLEELANEEDTANQSDLDESEARELESDMNGVTKDSVVSENENSDSEEE 3884

Query: 511  RETKMEEECQKFGHYNKDDPHS 532
             +  ++EE        +D  +S
Sbjct: 3885 NQ-DLDEEVNDI---PEDLSNS 3902



 Score = 32.3 bits (73), Expect = 1.9
 Identities = 23/81 (28%), Positives = 37/81 (45%), Gaps = 14/81 (17%)

Query: 455  QEENIEEKFDDSSQEEYFEAE---DQEGQGDSQEDEEMEEQRIEDEEDQEDGNEE--IED 509
            +EE   EK +    E+  E     D++ Q D  +  ++ E   +DE+  EDG EE   E+
Sbjct: 4020 KEEADAEKDEPMQDEDPLEENNTLDEDIQQD--DFSDLAE---DDEKMNEDGFEENVQEN 4074

Query: 510  ERET----KMEEECQKFGHYN 526
            E  T    K +EE ++     
Sbjct: 4075 EESTEDGVKSDEELEQGEVPE 4095



 Score = 31.5 bits (71), Expect = 2.7
 Identities = 19/98 (19%), Positives = 37/98 (37%), Gaps = 31/98 (31%)

Query: 455  QEENIEEKFDDSSQEEYFEAEDQEG----------------------QGDSQEDEEMEEQ 492
            ++E+ EE  DD   ++  + + QE                       +GD  +D ++E+ 
Sbjct: 3952 EKEDEEEMSDDVGIDDEIQPDIQENNSQPPPENEDLDLPEDLKLDEKEGDVSKDSDLEDM 4011

Query: 493  RIE---------DEEDQEDGNEEIEDERETKMEEECQK 521
             +E         D E  E   +E   E    ++E+ Q+
Sbjct: 4012 DMEAADENKEEADAEKDEPMQDEDPLEENNTLDEDIQQ 4049



 Score = 31.5 bits (71), Expect = 2.8
 Identities = 14/86 (16%), Positives = 33/86 (38%), Gaps = 11/86 (12%)

Query: 441  RRNEIHIKPRICGLQEENIEEKFDDSSQEEYFEA---EDQEGQGDSQEDEEM---EEQRI 494
            + N IH +       +        D S+  + +    ED +  G++++D+      ++  
Sbjct: 4181 QANRIH-EWEDLTESQSQAF----DDSEFMHVKEDEEEDLQALGNAEKDQIKSIDRDESA 4235

Query: 495  EDEEDQEDGNEEIEDERETKMEEECQ 520
                D  +     EDE +   +++ Q
Sbjct: 4236 NQNPDSMNSTNIAEDEADEVGDKQLQ 4261



 Score = 30.4 bits (68), Expect = 6.1
 Identities = 15/56 (26%), Positives = 23/56 (41%), Gaps = 2/56 (3%)

Query: 465  DSSQEEYFEAEDQEGQG--DSQEDEEMEEQRIEDEEDQEDGNEEIEDERETKMEEE 518
            D  QE+    E + G G       E++     ED++ +E  NEE    +    E E
Sbjct: 3801 DLPQEKSNSGELESGTGLGSGVGAEDITNTLNEDDDLEELANEEDTANQSDLDESE 3856


>gnl|CDD|226894 COG4499, COG4499, Predicted membrane protein [Function unknown].
          Length = 434

 Score = 34.8 bits (80), Expect = 0.19
 Identities = 9/60 (15%), Positives = 32/60 (53%), Gaps = 2/60 (3%)

Query: 459 IEEKFDDSSQEEYFEAEDQEGQGDSQEDEEMEEQRIEDEEDQEDGNEEIEDERETKMEEE 518
             +K  D +++        E     + + + +E++++ EE+++   E+ ++++E + ++E
Sbjct: 374 YNKKLQDYTKKL--GEVKDETDASEEAEAKAKEEKLKQEENEKKQKEQADEDKEKRQKDE 431



 Score = 32.1 bits (73), Expect = 1.7
 Identities = 17/60 (28%), Positives = 33/60 (55%), Gaps = 7/60 (11%)

Query: 454 LQE--ENIEEKFDDSSQEEYFEAEDQEGQGDSQEDEEMEEQRIEDEEDQEDGNEEIEDER 511
           LQ+  + + E  D++   E  EA+ +E +   +E+E  ++Q+   E+  ED  +  +DER
Sbjct: 378 LQDYTKKLGEVKDETDASEEAEAKAKEEKLKQEENE--KKQK---EQADEDKEKRQKDER 432


>gnl|CDD|217829 pfam03985, Paf1, Paf1.  Members of this family are components of
           the RNA polymerase II associated Paf1 complex. The Paf1
           complex functions during the elongation phase of
           transcription in conjunction with Spt4-Spt5 and
           Spt16-Pob3i.
          Length = 431

 Score = 34.7 bits (80), Expect = 0.23
 Identities = 17/60 (28%), Positives = 25/60 (41%)

Query: 459 IEEKFDDSSQEEYFEAEDQEGQGDSQEDEEMEEQRIEDEEDQEDGNEEIEDERETKMEEE 518
              + D    EE  E ED+E +  S E EE E +  E+E  Q   +   E   +   + E
Sbjct: 358 RRARLDPIDFEEVDEDEDEEEEQRSDEHEEEEGEDSEEEGSQSREDGSSESSSDVGSDSE 417



 Score = 32.0 bits (73), Expect = 1.6
 Identities = 11/63 (17%), Positives = 29/63 (46%)

Query: 456 EENIEEKFDDSSQEEYFEAEDQEGQGDSQEDEEMEEQRIEDEEDQEDGNEEIEDERETKM 515
            E ++E  D+  ++   E E++EG+   +E  +  E    +       + E + ++E+  
Sbjct: 367 FEEVDEDEDEEEEQRSDEHEEEEGEDSEEEGSQSREDGSSESSSDVGSDSESKADKESAS 426

Query: 516 EEE 518
           + +
Sbjct: 427 DSD 429



 Score = 31.6 bits (72), Expect = 2.0
 Identities = 17/61 (27%), Positives = 22/61 (36%), Gaps = 6/61 (9%)

Query: 456 EENIEEKFDDSSQEEYFEAEDQEGQG---DSQEDEEMEEQRIEDEEDQEDGNEEIEDERE 512
           E+  EE+  D  +EE  E ED E +G         E       D E + D  E   D   
Sbjct: 374 EDEEEEQRSDEHEEE--EGEDSEEEGSQSREDGSSESSSDVGSDSESKAD-KESASDSDS 430

Query: 513 T 513
            
Sbjct: 431 E 431


>gnl|CDD|227693 COG5406, COG5406, Nucleosome binding factor SPN, SPT16 subunit
            [Transcription / DNA replication, recombination, and
            repair / Chromatin structure and dynamics].
          Length = 1001

 Score = 35.0 bits (80), Expect = 0.26
 Identities = 18/71 (25%), Positives = 29/71 (40%), Gaps = 4/71 (5%)

Query: 455  QEENIEEKFDDSSQEEYFEAEDQEGQGDSQEDEEMEEQRIEDE-EDQEDGNEEIEDERET 513
             EE + E    S  E     ED+E     +  E++ E   E++  D+EDG +  E E + 
Sbjct: 934  SEEEVSEYEASSDDESDETDEDEES---DESSEDLSEDESENDSSDEEDGEDWDELESKA 990

Query: 514  KMEEECQKFGH 524
              +    K   
Sbjct: 991  AYDSRPGKRRK 1001


>gnl|CDD|237177 PRK12704, PRK12704, phosphodiesterase; Provisional.
          Length = 520

 Score = 34.4 bits (80), Expect = 0.28
 Identities = 21/84 (25%), Positives = 42/84 (50%), Gaps = 10/84 (11%)

Query: 434 TENDNVIRRNEIHIKPRICGLQEENIEEKFDDSSQEEYFEAEDQEGQGDSQEDEEMEEQR 493
            E +   RRNE+    +    +EEN++ K       E  E  ++E     ++ E+  EQ+
Sbjct: 73  FEKELRERRNELQKLEKRLLQKEENLDRK------LELLEKREEE----LEKKEKELEQK 122

Query: 494 IEDEEDQEDGNEEIEDERETKMEE 517
            ++ E +E+  EE+ +E+  ++E 
Sbjct: 123 QQELEKKEEELEELIEEQLQELER 146


>gnl|CDD|240272 PTZ00109, PTZ00109, DNA gyrase subunit b; Provisional.
          Length = 903

 Score = 34.5 bits (79), Expect = 0.29
 Identities = 21/95 (22%), Positives = 42/95 (44%), Gaps = 9/95 (9%)

Query: 425 PLLYSITYI-TENDNVIRRNE---IHIKPRICGLQEENIEEKFDDSSQEEYFEAEDQEGQ 480
           P LY IT    +  NV  +N    I+        +E N+  K  +            E +
Sbjct: 722 PPLYRITNNRMKQFNVSTKNSKKYIYTWSD----EELNVLIKLLNKDYSSKETTRSVEEK 777

Query: 481 GDSQ-EDEEMEEQRIEDEEDQEDGNEEIEDERETK 514
           G++   D E E+++++++  +E+  +E+E + E  
Sbjct: 778 GNAPDLDNEYEDEKLDNKNMRENNVDEVELKTELG 812


>gnl|CDD|148051 pfam06213, CobT, Cobalamin biosynthesis protein CobT.  This family
           consists of several bacterial cobalamin biosynthesis
           (CobT) proteins. CobT is involved in the transformation
           of precorrin-3 into cobyrinic acid.
          Length = 282

 Score = 34.0 bits (78), Expect = 0.32
 Identities = 17/62 (27%), Positives = 28/62 (45%), Gaps = 3/62 (4%)

Query: 457 ENIEEKFDD--SSQEEYFEAEDQEGQGDSQEDEEMEEQRIEDEEDQEDGNEEIEDERETK 514
           +  EE  D+  S+  E  E ED   + D  +D+  EE+    +   ED +   E+    +
Sbjct: 207 DMAEELGDEPESADSEDNEDEDDP-KEDEDDDQGEEEESGSSDSLSEDSDASSEEMESGE 265

Query: 515 ME 516
           ME
Sbjct: 266 ME 267



 Score = 32.5 bits (74), Expect = 0.81
 Identities = 15/62 (24%), Positives = 25/62 (40%), Gaps = 6/62 (9%)

Query: 455 QEENIEEKFDDSSQEEYFEAEDQEGQGDSQ------EDEEMEEQRIEDEEDQEDGNEEIE 508
            E+N +E      +++    E++ G  DS         EEME   +E  E   D   + +
Sbjct: 221 SEDNEDEDDPKEDEDDDQGEEEESGSSDSLSEDSDASSEEMESGEMEAAEASADDTPDSD 280

Query: 509 DE 510
           D 
Sbjct: 281 DA 282



 Score = 31.7 bits (72), Expect = 1.4
 Identities = 6/46 (13%), Positives = 21/46 (45%)

Query: 473 EAEDQEGQGDSQEDEEMEEQRIEDEEDQEDGNEEIEDERETKMEEE 518
            A+ ++ + +    E+ ++ + E+EE     +   + +  ++  E 
Sbjct: 218 SADSEDNEDEDDPKEDEDDDQGEEEESGSSDSLSEDSDASSEEMES 263



 Score = 31.7 bits (72), Expect = 1.7
 Identities = 15/80 (18%), Positives = 30/80 (37%), Gaps = 6/80 (7%)

Query: 447 IKPRICGLQEENIEEK--FDDSSQEEYFEAEDQEGQGDSQEDEEMEEQRIED---EEDQE 501
               +  L    ++++  F    ++     +  E  GD  E  + E+   ED   E++ +
Sbjct: 178 AGKDLDRL-AACVDDQQAFARVVRDMLSSMDMAEELGDEPESADSEDNEDEDDPKEDEDD 236

Query: 502 DGNEEIEDERETKMEEECQK 521
           D  EE E      + E+   
Sbjct: 237 DQGEEEESGSSDSLSEDSDA 256


>gnl|CDD|227701 COG5414, COG5414, TATA-binding protein-associated factor
           [Transcription].
          Length = 392

 Score = 33.9 bits (77), Expect = 0.35
 Identities = 20/62 (32%), Positives = 28/62 (45%), Gaps = 5/62 (8%)

Query: 456 EENIEEKFDDSSQEEYFEAEDQEGQGDSQEDEEMEEQRIEDEEDQEDGNEEIEDERETKM 515
            + + E   +  QEE   AE  + +  S   +E+ E   E EED E  NEE E   E   
Sbjct: 286 NKEVSEGDKEQQQEEVENAEAHKEEVQSDRPDEIGE---EKEEDDE--NEENERHTELLA 340

Query: 516 EE 517
           +E
Sbjct: 341 DE 342



 Score = 33.1 bits (75), Expect = 0.63
 Identities = 16/65 (24%), Positives = 31/65 (47%), Gaps = 7/65 (10%)

Query: 461 EKFDDSSQEEYFEAEDQEG-------QGDSQEDEEMEEQRIEDEEDQEDGNEEIEDERET 513
           +K    ++EE  E   +E          + +  E  +EQ+ E+ E+ E   EE++ +R  
Sbjct: 258 KKEKQGAEEEGEEGMSEEDLDVGAAEIENKEVSEGDKEQQQEEVENAEAHKEEVQSDRPD 317

Query: 514 KMEEE 518
           ++ EE
Sbjct: 318 EIGEE 322



 Score = 30.1 bits (67), Expect = 7.1
 Identities = 14/64 (21%), Positives = 29/64 (45%), Gaps = 2/64 (3%)

Query: 460 EEKFDDSSQEEY--FEAEDQEGQGDSQEDEEMEEQRIEDEEDQEDGNEEIEDERETKMEE 517
           EE  +  S+E+     AE +  +    + E+ +E+    E  +E+   +  DE   + EE
Sbjct: 266 EEGEEGMSEEDLDVGAAEIENKEVSEGDKEQQQEEVENAEAHKEEVQSDRPDEIGEEKEE 325

Query: 518 ECQK 521
           + + 
Sbjct: 326 DDEN 329


>gnl|CDD|219408 pfam07423, DUF1510, Protein of unknown function (DUF1510).  This
           family consists of several hypothetical bacterial
           proteins of around 200 residues in length. The function
           of this family is unknown.
          Length = 214

 Score = 33.2 bits (76), Expect = 0.40
 Identities = 19/65 (29%), Positives = 34/65 (52%), Gaps = 1/65 (1%)

Query: 455 QEENIEEKFDDSSQEEYFEAEDQEGQGDSQEDEEME-EQRIEDEEDQEDGNEEIEDERET 513
           QE    E+  +  +E     + ++     +EDEE E E   EDEE  ++  +E E++ E+
Sbjct: 57  QETAEIEEVKEEEKEAANSEDKEDKGDAEKEDEESEEENEEEDEESSDENEKETEEKTES 116

Query: 514 KMEEE 518
            +E+E
Sbjct: 117 NVEKE 121



 Score = 32.0 bits (73), Expect = 1.0
 Identities = 13/64 (20%), Positives = 31/64 (48%), Gaps = 1/64 (1%)

Query: 455 QEENIEEKFDDSSQEEYFEAEDQEGQGDSQEDEEMEEQRIEDEEDQEDGNEEIEDERETK 514
            +   +E+    S ++   AE +E + + +E    E++  + + ++ED   E E+E E +
Sbjct: 42  DQAAADEQEAKKSDDQE-TAEIEEVKEEEKEAANSEDKEDKGDAEKEDEESEEENEEEDE 100

Query: 515 MEEE 518
              +
Sbjct: 101 ESSD 104



 Score = 31.2 bits (71), Expect = 2.0
 Identities = 15/64 (23%), Positives = 29/64 (45%)

Query: 455 QEENIEEKFDDSSQEEYFEAEDQEGQGDSQEDEEMEEQRIEDEEDQEDGNEEIEDERETK 514
           QE    +  + +  EE  E E +    + +ED+   E+  E+ E++ +  +E   +   K
Sbjct: 49  QEAKKSDDQETAEIEEVKEEEKEAANSEDKEDKGDAEKEDEESEEENEEEDEESSDENEK 108

Query: 515 MEEE 518
             EE
Sbjct: 109 ETEE 112


>gnl|CDD|221333 pfam11942, Spt5_N, Spt5 transcription elongation factor, acidic
           N-terminal.  This is the very acidic N-terminal region
           of the early transcription elongation factor Spt5. The
           Spt5-Spt4 complex regulates early transcription
           elongation by RNA polymerase II and has an imputed role
           in pre-mRNA processing via its physical association with
           mRNA capping enzymes. The actual function of this
           N-terminal domain is not known although it is
           dispensable for binding to Spt4.
          Length = 92

 Score = 31.3 bits (71), Expect = 0.43
 Identities = 17/77 (22%), Positives = 32/77 (41%), Gaps = 9/77 (11%)

Query: 456 EENIEEKFDDSSQEEYFEAEDQEGQGDS---QEDEEMEEQRIEDEEDQED------GNEE 506
           EE  EE   +   +E    ++ E + D    + D   E++  ED E+  +      G+E 
Sbjct: 14  EEEEEEDDLEDLSDEDEFIDEAEAEDDRRHRRLDRRREKEEEEDAEELAEYLRKRYGDEA 73

Query: 507 IEDERETKMEEECQKFG 523
             D  ++  +E  Q+  
Sbjct: 74  DADADDSDSDEVPQRLL 90



 Score = 31.3 bits (71), Expect = 0.47
 Identities = 21/63 (33%), Positives = 31/63 (49%), Gaps = 9/63 (14%)

Query: 464 DDSSQEEYFEAEDQEGQGDSQEDEEMEEQRIEDEE-----DQEDGNEEIEDERETKMEEE 518
           DD  +EE  E E+ + +  S EDE ++E   ED+      D+    EE ED  E  + E 
Sbjct: 9   DDEEEEE--EEEEDDLEDLSDEDEFIDEAEAEDDRRHRRLDRRREKEEEEDAEE--LAEY 64

Query: 519 CQK 521
            +K
Sbjct: 65  LRK 67


>gnl|CDD|215628 PLN03196, PLN03196, MOC1-like protein; Provisional.
          Length = 487

 Score = 33.5 bits (77), Expect = 0.49
 Identities = 17/79 (21%), Positives = 32/79 (40%), Gaps = 11/79 (13%)

Query: 444 EIHIKPRICGLQEENIE-----------EKFDDSSQEEYFEAEDQEGQGDSQEDEEMEEQ 492
           E  IKPR   + ++ I+           +KF+     ++ E E+ E         EM   
Sbjct: 405 ESRIKPRYERVAKKGIKCSLAWFLNCSDDKFEQRMSGDFIEGEEMEPLFSMGGKLEMPGS 464

Query: 493 RIEDEEDQEDGNEEIEDER 511
               +E+ +D ++E+   R
Sbjct: 465 ESVSDEEDDDDDDEVLYRR 483


>gnl|CDD|227492 COG5163, NOP7, Protein required for biogenesis of the 60S ribosomal
           subunit [Translation, ribosomal structure and
           biogenesis].
          Length = 591

 Score = 33.9 bits (77), Expect = 0.51
 Identities = 12/64 (18%), Positives = 30/64 (46%), Gaps = 2/64 (3%)

Query: 468 QEEYFEAEDQEGQGDSQEDEEMEEQRIEDE--EDQEDGNEEIEDERETKMEEECQKFGHY 525
           +      E+ +   +       E+ R E    E+ +D +EE++ ++E ++E +  K+   
Sbjct: 459 RASLMTMEETQRHSEEDLVNRFEDVRYEHVAGEEDDDDDEELQAQKELELEAQGIKYSET 518

Query: 526 NKDD 529
           ++ D
Sbjct: 519 SEAD 522



 Score = 31.6 bits (71), Expect = 2.3
 Identities = 21/79 (26%), Positives = 34/79 (43%), Gaps = 17/79 (21%)

Query: 461 EKFDDSSQEE---YFEAEDQE---GQGDSQEDEEMEEQR-----------IEDEEDQEDG 503
           E+    S+E+    FE    E   G+ D  +DEE++ Q+            E  E  +D 
Sbjct: 466 EETQRHSEEDLVNRFEDVRYEHVAGEEDDDDDEELQAQKELELEAQGIKYSETSEADKDV 525

Query: 504 NEEIEDERETKMEEECQKF 522
           N+    +R+   EEE +K 
Sbjct: 526 NKSKNKKRKVDEEEEEKKL 544


>gnl|CDD|220135 pfam09184, PPP4R2, PPP4R2.  PPP4R2 (protein phosphatase 4 core
           regulatory subunit R2) is the regulatory subunit of the
           histone H2A phosphatase complex. It has been shown to
           confer resistance to the anticancer drug cisplatin in
           yeast, and may confer resistance in higher eukaryotes.
          Length = 285

 Score = 33.3 bits (76), Expect = 0.54
 Identities = 22/66 (33%), Positives = 36/66 (54%), Gaps = 11/66 (16%)

Query: 464 DDSSQEEYFEAEDQEG---QGDSQEDEEMEEQRIEDEED-QEDGNE-------EIEDERE 512
            + S EEY+E E  +    Q D  +  +++ ++ +DEED  +DG+        E E+E E
Sbjct: 214 KNKSLEEYYEKESSDAAASQDDGPKGSDVKNKKSDDEEDDDQDGDYVEEKELKEDEEEEE 273

Query: 513 TKMEEE 518
           T+ EEE
Sbjct: 274 TEEEEE 279



 Score = 29.4 bits (66), Expect = 8.1
 Identities = 12/57 (21%), Positives = 29/57 (50%), Gaps = 4/57 (7%)

Query: 457 ENIEEKFDDSSQEEYFEAEDQEGQGDSQEDEEMEEQRIEDEEDQEDGNEEIEDERET 513
              +       + +  + E+     D Q+ + +EE+ ++++E++E+  EE E+E E 
Sbjct: 232 SQDDGPKGSDVKNKKSDDEED----DDQDGDYVEEKELKEDEEEEETEEEEEEEDED 284



 Score = 29.4 bits (66), Expect = 8.3
 Identities = 14/35 (40%), Positives = 23/35 (65%), Gaps = 4/35 (11%)

Query: 475 EDQEGQGDSQEDEEM----EEQRIEDEEDQEDGNE 505
           ED +  GD  E++E+    EE+  E+EE++ED +E
Sbjct: 251 EDDDQDGDYVEEKELKEDEEEEETEEEEEEEDEDE 285


>gnl|CDD|236545 PRK09510, tolA, cell envelope integrity inner membrane protein
           TolA; Provisional.
          Length = 387

 Score = 33.6 bits (77), Expect = 0.54
 Identities = 11/48 (22%), Positives = 22/48 (45%)

Query: 474 AEDQEGQGDSQEDEEMEEQRIEDEEDQEDGNEEIEDERETKMEEECQK 521
            E ++ +   Q +E  ++Q  E E  ++   E +  + + K  EE  K
Sbjct: 78  EEQRKKKEQQQAEELQQKQAAEQERLKQLEKERLAAQEQKKQAEEAAK 125



 Score = 33.2 bits (76), Expect = 0.63
 Identities = 12/52 (23%), Positives = 25/52 (48%), Gaps = 5/52 (9%)

Query: 475 EDQEGQGDSQEDEEMEEQRIEDEEDQEDGNE-----EIEDERETKMEEECQK 521
           + Q+    ++E  + +EQ+  +E  Q+   E     ++E ER    E++ Q 
Sbjct: 69  QQQKSAKRAEEQRKKKEQQQAEELQQKQAAEQERLKQLEKERLAAQEQKKQA 120


>gnl|CDD|234311 TIGR03685, L12P_arch, 50S ribosomal protein L12P.  This model
           represents the L12P protein of the large (50S) subunit
           of the archaeal ribosome.
          Length = 105

 Score = 31.6 bits (72), Expect = 0.54
 Identities = 21/83 (25%), Positives = 37/83 (44%), Gaps = 11/83 (13%)

Query: 433 ITENDNV--------IRRNEIHIKPRICGLQEENIEEKFDDSSQEEYFEAEDQEGQGDSQ 484
           I E +N+        +  +E  +K  +  L+  NIEE    ++      A        + 
Sbjct: 17  INE-ENLKAVLEAAGVEVDEARVKALVAALEGVNIEEAIKKAAAAPVAAAAAAAAAAAAA 75

Query: 485 EDEEMEEQRIEDEEDQEDGNEEI 507
           E+EE EE+  E+EE++ +  EE 
Sbjct: 76  EEEEEEEEEEEEEEEESE--EEA 96


>gnl|CDD|218482 pfam05178, Kri1, KRI1-like family.  The yeast member of this family
           (Kri1p) is found to be required for 40S ribosome
           biogenesis in the nucleolus.
          Length = 99

 Score = 31.1 bits (71), Expect = 0.56
 Identities = 11/44 (25%), Positives = 26/44 (59%), Gaps = 4/44 (9%)

Query: 456 EENIEEKFDDSSQEEYFEAEDQEGQGDSQEDEEMEEQRIEDEED 499
           +E + + F D    +Y+E ED+E +    +DE++ +   +++E+
Sbjct: 60  DEEMAKLFGD----DYYEEEDEEEKPTWDDDEDIGDLVPDEDEE 99



 Score = 29.5 bits (67), Expect = 2.3
 Identities = 19/75 (25%), Positives = 35/75 (46%), Gaps = 11/75 (14%)

Query: 441 RRNEI-----HIKPRICGLQEENI-EEKFDDSSQEEYFEAEDQEGQGDSQEDEEMEEQRI 494
           +R EI      IK ++ GL+  ++ EE   D      F+ E  + +      ++  E+  
Sbjct: 22  KREEIEEKLEKIK-KVAGLRGADLSEEDLADGD----FDPEKWDEEMAKLFGDDYYEEED 76

Query: 495 EDEEDQEDGNEEIED 509
           E+E+   D +E+I D
Sbjct: 77  EEEKPTWDDDEDIGD 91


>gnl|CDD|216546 pfam01516, Orbi_VP6, Orbivirus helicase VP6.  The VP6 protein a
           minor protein in the core of the virion is probably the
           viral helicase.
          Length = 322

 Score = 33.1 bits (75), Expect = 0.58
 Identities = 18/64 (28%), Positives = 31/64 (48%), Gaps = 10/64 (15%)

Query: 453 GLQEENIEEKFDDSSQEEYFEAEDQEGQGDSQEDEEMEEQRIEDEE--DQEDGNEEIEDE 510
            L++  I+    D         E  E +    +DE+ +EQRI D E   ++ G +E E +
Sbjct: 14  ELKQRQIQINLVD-------WFESGESKEKEPKDEDGQEQRISDGEGEQKQKGGKE-ESD 65

Query: 511 RETK 514
           +ET+
Sbjct: 66  KETE 69


>gnl|CDD|218333 pfam04931, DNA_pol_phi, DNA polymerase phi.  This family includes
           the fifth essential DNA polymerase in yeast EC:2.7.7.7.
           Pol5p is localised exclusively to the nucleolus and
           binds near or at the enhancer region of rRNA-encoding
           DNA repeating units.
          Length = 784

 Score = 33.7 bits (77), Expect = 0.59
 Identities = 13/60 (21%), Positives = 26/60 (43%)

Query: 458 NIEEKFDDSSQEEYFEAEDQEGQGDSQEDEEMEEQRIEDEEDQEDGNEEIEDERETKMEE 517
             +E      Q    E ED++   ++ +DE+  E   + E + E   E+ E++ +    E
Sbjct: 633 KADENKSRHQQLFEGEEEDEDDLEETDDDEDECEAIEDSESESESDGEDGEEDEQEDDAE 692



 Score = 33.3 bits (76), Expect = 0.70
 Identities = 13/61 (21%), Positives = 26/61 (42%), Gaps = 5/61 (8%)

Query: 456 EENIEEKFDDSSQEEYFEAEDQEGQGDSQEDEEMEEQRIEDEEDQEDGNEEIEDERETKM 515
           EE  E+  +++  +E      ++ + +S+ D E  E   EDE++ +   E  E       
Sbjct: 648 EEEDEDDLEETDDDEDECEAIEDSESESESDGEDGE---EDEQEDDA--EANEGVVPIDK 702

Query: 516 E 516
            
Sbjct: 703 A 703



 Score = 32.2 bits (73), Expect = 1.7
 Identities = 17/75 (22%), Positives = 32/75 (42%), Gaps = 1/75 (1%)

Query: 456 EENIEEKFDDSSQEEYFEAEDQEGQGDSQEDEEME-EQRIEDEEDQEDGNEEIEDERETK 514
           E+++EE  DD  + E  E  + E + D ++ EE E E   E  E     ++ +       
Sbjct: 652 EDDLEETDDDEDECEAIEDSESESESDGEDGEEDEQEDDAEANEGVVPIDKAVRRALPKV 711

Query: 515 MEEECQKFGHYNKDD 529
           +       G  ++D+
Sbjct: 712 LNLPDALDGGDSEDE 726


>gnl|CDD|177433 PHA02608, 67, prohead core protein; Provisional.
          Length = 80

 Score = 30.5 bits (69), Expect = 0.65
 Identities = 9/30 (30%), Positives = 19/30 (63%)

Query: 473 EAEDQEGQGDSQEDEEMEEQRIEDEEDQED 502
           E ED +   D  +D++ +++  +D++D ED
Sbjct: 49  EPEDDDDDEDDDDDDDKDDKDDDDDDDDED 78


>gnl|CDD|135898 PRK06397, PRK06397, V-type ATP synthase subunit H; Validated.
          Length = 111

 Score = 31.4 bits (71), Expect = 0.65
 Identities = 12/60 (20%), Positives = 27/60 (45%), Gaps = 1/60 (1%)

Query: 439 VIRRNEIHIKPRICGLQEENIEEKFDDSSQEEYFEAEDQEGQGDSQEDEEMEEQRIEDEE 498
           +I+  E  I   I  ++ E  E +  ++  +   +A+  E +  +  +  + E R E E+
Sbjct: 14  IIKEKEESIDKEIANIKNEQ-ENEIKEAKSKYEEKAKKTEEESLNMYNAALMEARKEAEK 72


>gnl|CDD|235033 PRK02363, PRK02363, DNA-directed RNA polymerase subunit delta;
           Reviewed.
          Length = 129

 Score = 31.5 bits (72), Expect = 0.68
 Identities = 11/52 (21%), Positives = 31/52 (59%)

Query: 455 QEENIEEKFDDSSQEEYFEAEDQEGQGDSQEDEEMEEQRIEDEEDQEDGNEE 506
           +E    E+  D  ++++ + +D     D   D++ +E+ +++E+D+++ +EE
Sbjct: 78  EEIIPLEEKFDKKKKKFMDGDDDIIDDDILPDDDFDEEDLDEEDDEDEEDEE 129



 Score = 30.0 bits (68), Expect = 2.3
 Identities = 9/57 (15%), Positives = 23/57 (40%)

Query: 462 KFDDSSQEEYFEAEDQEGQGDSQEDEEMEEQRIEDEEDQEDGNEEIEDERETKMEEE 518
             D+  +E     E  + +     D + +    +   D +   E++++E +   E+E
Sbjct: 72  PVDEIDEEIIPLEEKFDKKKKKFMDGDDDIIDDDILPDDDFDEEDLDEEDDEDEEDE 128



 Score = 28.8 bits (65), Expect = 6.3
 Identities = 14/59 (23%), Positives = 31/59 (52%), Gaps = 8/59 (13%)

Query: 465 DSSQEEYFEAEDQE----GQGDSQEDEEMEEQRIEDEE-DQEDGNEEIEDERETKMEEE 518
           D   EE    E++      +    +D+ +++  + D++ D+ED +EE +++ E   +EE
Sbjct: 74  DEIDEEIIPLEEKFDKKKKKFMDGDDDIIDDDILPDDDFDEEDLDEEDDEDEE---DEE 129


>gnl|CDD|241092 cd12648, RRM3_UHM_PUF60, RNA recognition motif 3 in UHM domain of
           poly(U)-binding-splicing factor PUF60 and similar
           proteins.  This subgroup corresponds to the RRM3 of
           PUF60, also termed FUSE-binding protein-interacting
           repressor (FBP-interacting repressor or FIR), or
           Ro-binding protein 1 (RoBP1), or Siah-binding protein 1
           (Siah-BP1), an essential splicing factor that functions
           as a poly-U RNA-binding protein required to reconstitute
           splicing in depleted nuclear extracts. Its function is
           enhanced through interaction with U2 auxiliary factor
           U2AF65. PUF60 also controls human c-myc gene expression
           by binding and inhibiting the transcription factor far
           upstream sequence element (FUSE)-binding-protein (FBP),
           an activator of c-myc promoters. PUF60 contains two
           central RNA recognition motifs (RRMs), also termed RBDs
           (RNA binding domains) or RNPs (ribonucleoprotein
           domains), and a C-terminal U2AF (U2 auxiliary factor)
           homology motifs (UHM) that harbors another RRM and binds
           to tryptophan-containing linear peptide motifs (UHM
           ligand motifs, ULMs) in several nuclear proteins. The
           research indicates that PUF60 binds FUSE as a dimer, and
           only the first two RRM domains participate in the
           single-stranded DNA recognition. .
          Length = 98

 Score = 30.8 bits (70), Expect = 0.69
 Identities = 11/25 (44%), Positives = 18/25 (72%)

Query: 503 GNEEIEDERETKMEEECQKFGHYNK 527
           G E+++D+ E ++ EEC KFG  N+
Sbjct: 11  GPEDLDDDLEGEVTEECGKFGAVNR 35


>gnl|CDD|220759 pfam10446, DUF2457, Protein of unknown function (DUF2457).  This is
           a family of uncharacterized proteins.
          Length = 449

 Score = 33.0 bits (75), Expect = 0.72
 Identities = 14/50 (28%), Positives = 28/50 (56%)

Query: 456 EENIEEKFDDSSQEEYFEAEDQEGQGDSQEDEEMEEQRIEDEEDQEDGNE 505
           EE  +++ DD   ++  E +D +   +  EDE+ ++  + D+   +DGNE
Sbjct: 51  EEEDDDEEDDDDDDDEDEDDDDDDDDEDDEDEDDDDSTLHDDSSADDGNE 100


>gnl|CDD|218177 pfam04615, Utp14, Utp14 protein.  This protein is found to be part
           of a large ribonucleoprotein complex containing the U3
           snoRNA. Depletion of the Utp proteins impedes production
           of the 18S rRNA, indicating that they are part of the
           active pre-rRNA processing complex. This large RNP
           complex has been termed the small subunit (SSU)
           processome.
          Length = 728

 Score = 33.1 bits (76), Expect = 0.74
 Identities = 18/70 (25%), Positives = 32/70 (45%), Gaps = 8/70 (11%)

Query: 477 QEGQGDSQEDEEMEEQRIEDEEDQEDGNEEIEDERETKMEEECQKFGHYNKDDPHSFQLP 536
            EG+  S+EDE+  E    +EED++D  ++ + E    + ++  K      D+ +S    
Sbjct: 318 IEGKSVSEEDED--EDSDSEEEDEDDDEDDDDGENPWMLRKKLGKLKEGEDDEENS---- 371

Query: 537 ENFQMYPQFM 546
               M   FM
Sbjct: 372 GLLSMK--FM 379



 Score = 32.7 bits (75), Expect = 1.1
 Identities = 22/111 (19%), Positives = 38/111 (34%), Gaps = 12/111 (10%)

Query: 434 TENDNVIRRNEIHIKPRICGLQEENIEE-KFDDSSQEEYFEAEDQEGQGDSQEDEEMEEQ 492
            EN  ++R+    +K      +   +   KF   ++    E  D E +   +E E  EE 
Sbjct: 348 GENPWMLRKKLGKLKEGEDDEENSGLLSMKFMQRAEARKKEENDAEIEELRRELEGEEES 407

Query: 493 RIEDEEDQEDGNE----------EIEDERETKMEEECQKF-GHYNKDDPHS 532
             E+ E+    N           E E E +   +E   +F      D+   
Sbjct: 408 DEEENEEPSKKNVGRRKFGPENGEKEAESKKLKKENKNEFKEKKESDEEEE 458


>gnl|CDD|240678 cd12232, RRM3_U2AF65, RNA recognition motif 3 found in U2 large
           nuclear ribonucleoprotein auxiliary factor U2AF 65 kDa
           subunit (U2AF65) and similar proteins.  This subfamily
           corresponds to the RRM3 of U2AF65 and dU2AF50. U2AF65,
           also termed U2AF2, is the large subunit of U2 small
           nuclear ribonucleoprotein (snRNP) auxiliary factor
           (U2AF), which has been implicated in the recruitment of
           U2 snRNP to pre-mRNAs and is a highly conserved
           heterodimer composed of large and small subunits. U2AF65
           specifically recognizes the intron polypyrimidine tract
           upstream of the 3' splice site and promotes binding of
           U2 snRNP to the pre-mRNA branchpoint. U2AF65 also plays
           an important role in the nuclear export of mRNA. It
           facilitates the formation of a messenger
           ribonucleoprotein export complex, containing both the
           NXF1 receptor and the RNA substrate. Moreover, U2AF65
           interacts directly and specifically with expanded CAG
           RNA, and serves as an adaptor to link expanded CAG RNA
           to NXF1 for RNA export. U2AF65 contains an N-terminal RS
           domain rich in arginine and serine, followed by a
           proline-rich segment and three C-terminal RNA
           recognition motifs (RRMs), also termed RBDs (RNA binding
           domains) or RNPs (ribonucleoprotein domains). The
           N-terminal RS domain stabilizes the interaction of U2
           snRNP with the branch point (BP) by contacting the
           branch region, and further promotes base pair
           interactions between U2 snRNA and the BP. The
           proline-rich segment mediates protein-protein
           interactions with the RRM domain of the small U2AF
           subunit (U2AF35 or U2AF1). The RRM1 and RRM2 are
           sufficient for specific RNA binding, while RRM3 is
           responsible for protein-protein interactions. The family
           also includes Splicing factor U2AF 50 kDa subunit
           (dU2AF50), the Drosophila ortholog of U2AF65. dU2AF50
           functions as an essential pre-mRNA splicing factor in
           flies. It associates with intronless mRNAs and plays a
           significant and unexpected role in the nuclear export of
           a large number of intronless mRNAs.
          Length = 89

 Score = 30.6 bits (70), Expect = 0.75
 Identities = 12/31 (38%), Positives = 18/31 (58%), Gaps = 5/31 (16%)

Query: 494 IEDEEDQEDGNEEIEDERETKMEEECQKFGH 524
            E+ ED E+  E +ED     ++EEC K+G 
Sbjct: 12  PEELEDDEEYEEILED-----VKEECGKYGK 37


>gnl|CDD|204194 pfam09304, Cortex-I_coil, Cortexillin I, coiled coil.  Members of
           this family are predominantly found in the
           actin-bundling protein Cortexillin I from Dictyostelium
           discoideum. They adopt a structure consisting of an
           18-heptad-repeat alpha-helical coiled-coil, and are a
           prerequisite for the assembly of Cortexillin I.
          Length = 107

 Score = 31.1 bits (70), Expect = 0.79
 Identities = 19/75 (25%), Positives = 37/75 (49%), Gaps = 6/75 (8%)

Query: 450 RICGLQEENIEEKFDDSSQEEYFEAEDQ--EGQGDSQEDEEMEEQRIEDEEDQ-EDGNEE 506
           ++ GL+     EK   +SQEE  + +DQ        +  +   EQR+ + + + ++  + 
Sbjct: 17  KLAGLENSLESEK---ASQEELAKQKDQLRSLLASLEAQKAEREQRLRELQAKIDELLKN 73

Query: 507 IEDERETKMEEECQK 521
           +E E+  KME E + 
Sbjct: 74  LELEKLAKMELESRL 88


>gnl|CDD|220383 pfam09756, DDRGK, DDRGK domain.  This is a family of proteins of
           approximately 300 residues, found in plants and
           vertebrates. They contain a highly conserved DDRGK
           motif.
          Length = 189

 Score = 32.0 bits (73), Expect = 0.89
 Identities = 18/73 (24%), Positives = 36/73 (49%), Gaps = 8/73 (10%)

Query: 454 LQEENIEE--KFDDSSQEEYFEAEDQEGQGDSQEDEEMEEQRIEDEED-QEDGNEE---- 506
            +EE  EE  K ++  + E  E E+ E + + +++EE  ++R E     QE+  +     
Sbjct: 23  AEEEEREERKKLEEKREGERKEEEELEEEREKKKEEEERKEREEQARKEQEEYEKLKSSF 82

Query: 507 -IEDERETKMEEE 518
            +E+E   K+  +
Sbjct: 83  VVEEEGTDKLSAD 95



 Score = 31.2 bits (71), Expect = 1.5
 Identities = 21/77 (27%), Positives = 35/77 (45%), Gaps = 4/77 (5%)

Query: 449 PRICGLQEENIEEKFDDSSQEEYFEAEDQE----GQGDSQEDEEMEEQRIEDEEDQEDGN 504
            +I   +   +EEK     Q E  E E +E     +    E +E EE   E E+ +E+  
Sbjct: 1   KKIGAKKRAKLEEKQARRQQREAEEEEREERKKLEEKREGERKEEEELEEEREKKKEEEE 60

Query: 505 EEIEDERETKMEEECQK 521
            +  +E+  K +EE +K
Sbjct: 61  RKEREEQARKEQEEYEK 77


>gnl|CDD|217051 pfam02463, SMC_N, RecF/RecN/SMC N terminal domain.  This domain is
            found at the N terminus of SMC proteins. The SMC
            (structural maintenance of chromosomes) superfamily
            proteins have ATP-binding domains at the N- and
            C-termini, and two extended coiled-coil domains separated
            by a hinge in the middle. The eukaryotic SMC proteins
            form two kind of heterodimers: the SMC1/SMC3 and the
            SMC2/SMC4 types. These heterodimers constitute an
            essential part of higher order complexes, which are
            involved in chromatin and DNA dynamics. This family also
            includes the RecF and RecN proteins that are involved in
            DNA metabolism and recombination.
          Length = 1162

 Score = 33.0 bits (75), Expect = 0.89
 Identities = 25/122 (20%), Positives = 50/122 (40%), Gaps = 9/122 (7%)

Query: 433  ITENDNVIRRNEIHIKPRICGLQEENIEEKFDDSSQEEYFEAEDQEGQGDSQEDEEMEEQ 492
             ++ DN++   E  I+ RI   +E  I  K++   +E   E  D++ + +  ++EE E  
Sbjct: 906  ESQKDNLLEEKENEIEERIA--EEAIILLKYESEPEELLLEEADEKEKEEDNKEEEEERN 963

Query: 493  RIEDEEDQEDGNEEIEDERETKMEEECQKFGH-------YNKDDPHSFQLPENFQMYPQF 545
            +      +E GN  +    E + +EE               K +     + E  Q + +F
Sbjct: 964  KRLLLAKEELGNVNLMAIAEFEEKEERYNKDELKKERLEEEKKELLREIIEETCQRFKEF 1023

Query: 546  MY 547
            + 
Sbjct: 1024 LE 1025



 Score = 31.1 bits (70), Expect = 3.3
 Identities = 19/124 (15%), Positives = 36/124 (29%), Gaps = 8/124 (6%)

Query: 403  AITEQHQPAELHPQFTTIEYTIPLLYSITYITENDNVIRRNEIHIKP---RICGLQEENI 459
             + E+    E       I            + E  +   + E + +    R   L     
Sbjct: 912  LLEEKENEIEERIAEEAIILLKYESEPEELLLEEADEKEKEEDNKEEEEERNKRLLLAKE 971

Query: 460  EEKFDDSSQEEYFEAEDQEGQGDSQEDEEMEEQRIE-----DEEDQEDGNEEIEDERETK 514
            E    +      FE +++    D  + E +EE++ E      EE  +   E +E      
Sbjct: 972  ELGNVNLMAIAEFEEKEERYNKDELKKERLEEEKKELLREIIEETCQRFKEFLELFVSIN 1031

Query: 515  MEEE 518
                
Sbjct: 1032 RGLN 1035


>gnl|CDD|216108 pfam00769, ERM, Ezrin/radixin/moesin family.  This family of
           proteins contain a band 4.1 domain (pfam00373), at their
           amino terminus. This family represents the rest of these
           proteins.
          Length = 244

 Score = 32.4 bits (74), Expect = 0.93
 Identities = 13/37 (35%), Positives = 23/37 (62%)

Query: 484 QEDEEMEEQRIEDEEDQEDGNEEIEDERETKMEEECQ 520
           +E +E+EE+  + EED     +E+E+  ET +E E +
Sbjct: 5   REQQELEERMEQMEEDMRRAQKELEEYEETALELEEK 41


>gnl|CDD|219405 pfam07418, PCEMA1, Acidic phosphoprotein precursor PCEMA1.  This
           family consists of several acidic phosphoprotein
           precursor PCEMA1 sequences which appear to be found
           exclusively in Plasmodium chabaudi. PCEMA1 is an antigen
           that is associated with the membrane of the infected
           erythrocyte throughout the entire intraerythrocytic
           cycle. The exact function of this family is unclear.
          Length = 286

 Score = 32.2 bits (73), Expect = 1.0
 Identities = 17/103 (16%), Positives = 40/103 (38%), Gaps = 14/103 (13%)

Query: 441 RRNEIHIKPRICGLQEENIEEKFDDSSQEEYFEAEDQE-----------GQGDSQEDEEM 489
           ++ E+ ++  +  + E++     +D   +E   +   E           G+ +  +DE  
Sbjct: 164 KKQELPVEETLSTIHEDDASTLHEDDELDEEVTSYLNELDDEVTSYFNDGENEENDDELE 223

Query: 490 EEQ--RIEDEEDQEDGNEEIEDE-RETKMEEECQKFGHYNKDD 529
            E    ++D E+  +  E+I  E RE K ++          + 
Sbjct: 224 AEVISYLKDGENDNEVKEKIRREYREWKGDKANTNETEIEDES 266



 Score = 29.5 bits (66), Expect = 7.7
 Identities = 28/143 (19%), Positives = 49/143 (34%), Gaps = 26/143 (18%)

Query: 393 QRNAFPPQYAAITEQHQPAELHPQFTTI----EYTIPLLYSITYITENDNVIRRNEIHIK 448
           Q +A PP       + Q   +    +TI      T+     +    + +     NE+  +
Sbjct: 152 QSDAPPPVP--DMPKKQELPVEETLSTIHEDDASTLHEDDEL----DEEVTSYLNELDDE 205

Query: 449 PRICGLQEENIEEK--FDDSSQEEYFEAED------------QEGQGDSQEDEEMEEQRI 494
                   EN E     +        + E+            +E +GD     E E +  
Sbjct: 206 VTSYFNDGENEENDDELEAEVISYLKDGENDNEVKEKIRREYREWKGDKANTNETEIEDE 265

Query: 495 EDEEDQEDGNEEIEDERETKMEE 517
            ++E +E+  EE E+E   K EE
Sbjct: 266 SEDEYEEEAGEEQENE--DKGEE 286


>gnl|CDD|235640 PRK05901, PRK05901, RNA polymerase sigma factor; Provisional.
          Length = 509

 Score = 32.7 bits (75), Expect = 1.0
 Identities = 10/62 (16%), Positives = 36/62 (58%), Gaps = 4/62 (6%)

Query: 457 ENIEEKFDDSSQEEYFEAEDQEGQGDSQEDEEMEEQRIEDEEDQEDGNEEIEDERETKME 516
             +++    +  ++  + +D     D  +D+++++   ++++D++D +++++DE E K E
Sbjct: 119 NYVKDIDVLNQADDDDDDDDD----DDLDDDDIDDDDDDEDDDEDDDDDDVDDEDEEKKE 174

Query: 517 EE 518
            +
Sbjct: 175 AK 176



 Score = 31.1 bits (71), Expect = 2.7
 Identities = 10/55 (18%), Positives = 28/55 (50%)

Query: 456 EENIEEKFDDSSQEEYFEAEDQEGQGDSQEDEEMEEQRIEDEEDQEDGNEEIEDE 510
           +++ ++  DD   ++  + +D E   D   D+E EE++   E ++   +++   +
Sbjct: 136 DDDDDDLDDDDIDDDDDDEDDDEDDDDDDVDDEDEEKKEAKELEKLSDDDDFVWD 190


>gnl|CDD|217940 pfam04177, TAP42, TAP42-like family.  The TOR signalling pathway
           activates a cell-growth program in response to
           nutrients. TIP41 (pfam04176) interacts with TAP42 and
           negatively regulates the TOR signaling pathway.
          Length = 335

 Score = 32.3 bits (74), Expect = 1.1
 Identities = 15/48 (31%), Positives = 21/48 (43%), Gaps = 2/48 (4%)

Query: 467 SQEEYFEAEDQEGQGDSQEDEEMEEQRIE--DEEDQEDGNEEIEDERE 512
           + +EY E E + G       E       E  DE+D+ED +EE    R 
Sbjct: 279 TVDEYLEQEMKRGGVPQGGGEAAASAEEEEDDEDDEEDDDEETLKARA 326


>gnl|CDD|218684 pfam05672, MAP7, MAP7 (E-MAP-115) family.  The organisation of
           microtubules varies with the cell type and is presumably
           controlled by tissue-specific microtubule-associated
           proteins (MAPs). The 115-kDa epithelial MAP
           (E-MAP-115/MAP7) has been identified as a
           microtubule-stabilising protein predominantly expressed
           in cell lines of epithelial origin. The binding of this
           microtubule associated protein is nucleotide
           independent.
          Length = 171

 Score = 31.6 bits (71), Expect = 1.1
 Identities = 18/67 (26%), Positives = 33/67 (49%)

Query: 455 QEENIEEKFDDSSQEEYFEAEDQEGQGDSQEDEEMEEQRIEDEEDQEDGNEEIEDERETK 514
           QE   +E+ D   +EE      +E     +E    EE+R  ++E++     E E+++E +
Sbjct: 44  QERREQEEQDRLEREELKRRAAEERLRREEEARRQEEERAREKEEKAKRKAEEEEKQEQE 103

Query: 515 MEEECQK 521
            +E  QK
Sbjct: 104 EQERIQK 110


>gnl|CDD|223003 PHA03169, PHA03169, hypothetical protein; Provisional.
          Length = 413

 Score = 32.6 bits (74), Expect = 1.1
 Identities = 10/66 (15%), Positives = 15/66 (22%), Gaps = 8/66 (12%)

Query: 453 GLQEENIEEKFDDSSQEEYFEAEDQEGQGDSQEDEEMEEQRIEDEEDQEDGNEEIEDERE 512
           GL  EN      +S       A               E      E      N++     +
Sbjct: 117 GLSPENTSGSSPES------PASHSPPPSPPSHPGPHEPA--PPESHNPSPNQQPSSFLQ 168

Query: 513 TKMEEE 518
              E+ 
Sbjct: 169 PSHEDS 174



 Score = 30.3 bits (68), Expect = 5.9
 Identities = 9/65 (13%), Positives = 19/65 (29%), Gaps = 2/65 (3%)

Query: 456 EENIEEKFDDSSQEEYFEAEDQEGQGDSQED--EEMEEQRIEDEEDQEDGNEEIEDERET 513
             +     + +  E +  + +Q+     Q    +  EE      E + D     + E  T
Sbjct: 140 PPSHPGPHEPAPPESHNPSPNQQPSSFLQPSHEDSPEEPEPPTSEPEPDSPGPPQSETPT 199

Query: 514 KMEEE 518
                
Sbjct: 200 SSPPP 204



 Score = 29.6 bits (66), Expect = 9.7
 Identities = 8/65 (12%), Positives = 18/65 (27%), Gaps = 2/65 (3%)

Query: 456 EENIEEKFDDSSQEEYFEAEDQE--GQGDSQEDEEMEEQRIEDEEDQEDGNEEIEDERET 513
             +       S    +  A  +      + Q    ++    +  E+ E    E E +   
Sbjct: 132 ASHSPPPSPPSHPGPHEPAPPESHNPSPNQQPSSFLQPSHEDSPEEPEPPTSEPEPDSPG 191

Query: 514 KMEEE 518
             + E
Sbjct: 192 PPQSE 196


>gnl|CDD|215774 pfam00183, HSP90, Hsp90 protein. 
          Length = 529

 Score = 32.4 bits (74), Expect = 1.1
 Identities = 18/66 (27%), Positives = 37/66 (56%), Gaps = 1/66 (1%)

Query: 466 SSQEEYFEAEDQEGQGDSQEDEEMEEQRIEDEEDQEDGNEEIEDERETK-MEEECQKFGH 524
             +E   E  D+E + + +E +E EE+  + EE+ ++  E+ E +++TK ++E   ++  
Sbjct: 27  VEKEVEKEVPDEEEEEEKEEKKEEEEKTTDKEEEVDEEEEKEEKKKKTKKVKETTTEWEL 86

Query: 525 YNKDDP 530
            NK  P
Sbjct: 87  LNKTKP 92



 Score = 31.7 bits (72), Expect = 1.9
 Identities = 15/59 (25%), Positives = 33/59 (55%), Gaps = 3/59 (5%)

Query: 460 EEKFDDSSQEEYFEAEDQEGQGDSQEDEEMEEQRIEDEEDQEDGNEEIEDERETKMEEE 518
           E + +   +EE  E E+++ + +   D+E E    ++EE++E+  ++ +  +ET  E E
Sbjct: 30  EVEKEVPDEEEEEEKEEKKEEEEKTTDKEEEV---DEEEEKEEKKKKTKKVKETTTEWE 85


>gnl|CDD|237063 PRK12329, nusA, transcription elongation factor NusA; Provisional.
          Length = 449

 Score = 32.1 bits (73), Expect = 1.4
 Identities = 11/63 (17%), Positives = 29/63 (46%)

Query: 456 EENIEEKFDDSSQEEYFEAEDQEGQGDSQEDEEMEEQRIEDEEDQEDGNEEIEDERETKM 515
            E I ++ ++ + +   E   +  Q +  E++    +     ED+ +  +E+E+ +  + 
Sbjct: 386 AELISQREEEEALQREAEERLEAEQAERAEEDARLRELYPLPEDEFEDEDELEEAQPEEE 445

Query: 516 EEE 518
           EE 
Sbjct: 446 EEA 448



 Score = 30.5 bits (69), Expect = 4.3
 Identities = 14/60 (23%), Positives = 23/60 (38%), Gaps = 6/60 (10%)

Query: 465 DSSQEEYFEAEDQEGQGDSQEDEEMEEQRIE------DEEDQEDGNEEIEDERETKMEEE 518
           D   E+   AE    + + +  +   E+R+E       EED          E E + E+E
Sbjct: 377 DQEAEDAKVAELISQREEEEALQREAEERLEAEQAERAEEDARLRELYPLPEDEFEDEDE 436



 Score = 29.4 bits (66), Expect = 10.0
 Identities = 19/64 (29%), Positives = 29/64 (45%), Gaps = 6/64 (9%)

Query: 462 KFDDSSQEEYFEAEDQEGQGDSQEDEEMEEQRIEDEEDQEDGNEEIEDERETKMEEECQK 521
           K D     EY    DQE +    +  E+  QR E+E  Q +  E +E E+  + EE+ + 
Sbjct: 367 KIDIKDSAEY----DQEAED--AKVAELISQREEEEALQREAEERLEAEQAERAEEDARL 420

Query: 522 FGHY 525
              Y
Sbjct: 421 RELY 424


>gnl|CDD|220102 pfam09073, BUD22, BUD22.  BUD22 has been shown in yeast to be a
           nuclear protein involved in bud-site selection. It plays
           a role in positioning the proximal bud pole signal. More
           recently it has been shown to be involved in ribosome
           biogenesis.
          Length = 424

 Score = 32.1 bits (73), Expect = 1.4
 Identities = 16/70 (22%), Positives = 26/70 (37%), Gaps = 6/70 (8%)

Query: 455 QEENIEEKFDDSSQEEYFEAEDQEGQGDSQEDEEMEEQRIEDEEDQEDG------NEEIE 508
           +E       DD  + E  +    E   +   D+E EE    ++  Q DG      +EE  
Sbjct: 161 KEAKESSDKDDEEESESEDESKSEESAEDDSDDEEEEDSDSEDYSQYDGMLVDSSDEEEG 220

Query: 509 DERETKMEEE 518
           +E  +    E
Sbjct: 221 EEAPSINYNE 230


>gnl|CDD|234017 TIGR02794, tolA_full, TolA protein.  TolA couples the inner
           membrane complex of itself with TolQ and TolR to the
           outer membrane complex of TolB and OprL (also called
           Pal). Most of the length of the protein consists of
           low-complexity sequence that may differ in both length
           and composition from one species to another,
           complicating efforts to discriminate TolA (the most
           divergent gene in the tol-pal system) from paralogs such
           as TonB. Selection of members of the seed alignment and
           criteria for setting scoring cutoffs are based largely
           conserved operon struction. //The Tol-Pal complex is
           required for maintaining outer membrane integrity. Also
           involved in transport (uptake) of colicins and
           filamentous DNA, and implicated in pathogenesis.
           Transport is energized by the proton motive force. TolA
           is an inner membrane protein that interacts with
           periplasmic TolB and with outer membrane porins ompC,
           phoE and lamB [Transport and binding proteins, Other,
           Cellular processes, Pathogenesis].
          Length = 346

 Score = 32.1 bits (73), Expect = 1.4
 Identities = 11/45 (24%), Positives = 22/45 (48%)

Query: 474 AEDQEGQGDSQEDEEMEEQRIEDEEDQEDGNEEIEDERETKMEEE 518
            E +  +   Q+ EE E+QR  ++  Q++  +    E+  K  E+
Sbjct: 65  KEQERQKKLEQQAEEAEKQRAAEQARQKELEQRAAAEKAAKQAEQ 109


>gnl|CDD|227674 COG5384, Mpp10, U3 small nucleolar ribonucleoprotein component
           [Translation, ribosomal structure and biogenesis].
          Length = 569

 Score = 32.4 bits (73), Expect = 1.5
 Identities = 18/77 (23%), Positives = 33/77 (42%), Gaps = 5/77 (6%)

Query: 464 DDSSQEEYFEAEDQEGQGDSQEDEEMEEQRIED-EEDQEDGNEEIEDERETKMEEECQKF 522
            + S+ E  E+  +E Q  S +  E+EEQ  +   E+ E+ + + E        EE  + 
Sbjct: 92  GEESELEEAESVFKEKQMLSADVSEIEEQSNDSLSENDEEPSMDDEKTSAEAAREEFAEE 151

Query: 523 GHYNKDDPHSFQLPENF 539
               K  P  + + + F
Sbjct: 152 ----KRIPDPYGINDKF 164


>gnl|CDD|235175 PRK03918, PRK03918, chromosome segregation protein; Provisional.
          Length = 880

 Score = 32.3 bits (74), Expect = 1.6
 Identities = 23/90 (25%), Positives = 46/90 (51%), Gaps = 6/90 (6%)

Query: 429 SITYITENDNVIRRNEIHIKPRICGLQEENIEEKFDDSSQEEYFEAEDQEGQGDSQEDEE 488
           +   + E +  +      ++  +  L+++  EE++++   EEY E   +E  G   E EE
Sbjct: 631 AFEELAETEKRLEE----LRKELEELEKKYSEEEYEELR-EEYLELS-RELAGLRAELEE 684

Query: 489 MEEQRIEDEEDQEDGNEEIEDERETKMEEE 518
           +E++R E ++  E   EE+E+  + K E E
Sbjct: 685 LEKRREEIKKTLEKLKEELEEREKAKKELE 714


>gnl|CDD|222571 pfam14153, Spore_coat_CotO, Spore coat protein CotO.  Bacillus
           spores are protected by a protein shell consisting of
           over 50 different polypeptides, known as the coat. This
           family of proteins has an important morphogenetic role
           in coat assembly, it is involved in the assembly of at
           least 5 different coat proteins including CotB, CotG,
           CotS, CotSA and CotW. It is likely to act at a late
           stage of coat assembly.
          Length = 185

 Score = 31.0 bits (70), Expect = 1.7
 Identities = 23/106 (21%), Positives = 41/106 (38%), Gaps = 21/106 (19%)

Query: 473 EAEDQEGQGDSQEDEEMEEQRIEDEEDQEDGNEEIEDERETKMEEECQKFGHYNKDDPHS 532
           E    + +    E EE E+++   E ++ED  E+ E + E   EEE ++           
Sbjct: 48  EHVKSKEEEQKIEYEEAEKEKEAGEPEREDIAEQQE-KEEIAQEEEKEE----------- 95

Query: 533 FQLPENFQMYPQFMYHLRRSQFLQVFNNSPDETSYYRTLLIHLPHY 578
                      +     R+  F ++   + +E   +   L HLPHY
Sbjct: 96  ---EAEDVKQQEVFSFKRKKPFKEM---NLEEKIDF---LAHLPHY 132


>gnl|CDD|219978 pfam08701, GN3L_Grn1, GNL3L/Grn1 putative GTPase.  Grn1 (yeast) and
           GNL3L (human) are putative GTPases which are required
           for growth and play a role in processing of nucleolar
           pre-rRNA. This family contains a potential nuclear
           localisation signal.
          Length = 80

 Score = 29.1 bits (66), Expect = 1.8
 Identities = 10/32 (31%), Positives = 18/32 (56%)

Query: 487 EEMEEQRIEDEEDQEDGNEEIEDERETKMEEE 518
           EE+EE++ + EE++E   E  + ER    +  
Sbjct: 49  EEIEEKKRKQEEEKERRKEARKAERAEARKRG 80


>gnl|CDD|221275 pfam11861, DUF3381, Domain of unknown function (DUF3381).  This
           domain is functionally uncharacterized. This domain is
           found in eukaryotes. This presumed domain is typically
           between 156 to 174 amino acids in length. This domain is
           found associated with pfam07780, pfam01728.
          Length = 154

 Score = 30.7 bits (70), Expect = 1.8
 Identities = 14/49 (28%), Positives = 25/49 (51%), Gaps = 5/49 (10%)

Query: 470 EYFEAEDQEGQGDSQEDEEMEEQRIEDEEDQEDGNEEIEDERETKMEEE 518
           +    + +E + + +E+ E+EE    DEE+Q D  E +E E      E+
Sbjct: 92  KLLGLDKKEKEEEEEEEVEVEE---LDEEEQID--ELLEKELAKLKREK 135



 Score = 29.6 bits (67), Expect = 4.2
 Identities = 12/46 (26%), Positives = 27/46 (58%), Gaps = 1/46 (2%)

Query: 473 EAEDQEGQGDSQEDEEM-EEQRIEDEEDQEDGNEEIEDERETKMEE 517
           + E +E + +  E EE+ EE++I++  ++E    + E  RE + ++
Sbjct: 98  KKEKEEEEEEEVEVEELDEEEQIDELLEKELAKLKREKRRENERKQ 143


>gnl|CDD|218673 pfam05642, Sporozoite_P67, Sporozoite P67 surface antigen.  This
           family consists of several Theileria P67 surface
           antigens. A stage specific surface antigen of Theileria
           parva, p67, is the basis for the development of an
           anti-sporozoite vaccine for the control of East Coast
           fever (ECF) in cattle. The antigen has been shown to
           contain five distinct linear peptide sequences
           recognised by sporozoite-neutralising murine monoclonal
           antibodies.
          Length = 727

 Score = 32.0 bits (72), Expect = 2.0
 Identities = 14/60 (23%), Positives = 28/60 (46%), Gaps = 2/60 (3%)

Query: 456 EENIEEKFDDS-SQEEYFEAEDQEGQGDSQEDEEMEEQRIEDEEDQEDGNEEIEDERETK 514
           E NI   F +  SQE   +   ++ Q       + EE   +D E++++ +   +D + +K
Sbjct: 82  EPNITRSFQEPVSQESEVQDNTEQNQDTKGSKTDSEEDD-DDSEEEDNKSTSSKDGKGSK 140


>gnl|CDD|240820 cd12374, RRM_UHM_SPF45_PUF60, RNA recognition motif in UHM domain
           of 45 kDa-splicing factor (SPF45) and similar proteins. 
           This subfamily corresponds to the RRM found in UHM
           domain of 45 kDa-splicing factor (SPF45 or RBM17),
           poly(U)-binding-splicing factor PUF60 (FIR or Hfp or
           RoBP1 or Siah-BP1), and similar proteins. SPF45 is an
           RNA-binding protein consisting of an unstructured
           N-terminal region, followed by a G-patch motif and a
           C-terminal U2AF (U2 auxiliary factor) homology motifs
           (UHM) that harbors a RNA recognition motif (RRM), also
           termed RBD (RNA binding domain) or RNP
           (ribonucleoprotein domain) and an Arg-Xaa-Phe sequence
           motif. SPF45 regulates alternative splicing of the
           apoptosis regulatory gene FAS (also known as CD95). It
           induces exon 6 skipping in FAS pre-mRNA through the UHM
           domain that binds to tryptophan-containing linear
           peptide motifs (UHM ligand motifs, ULMs) present in the
           3' splice site-recognizing factors U2AF65, SF1 and
           SF3b155. PUF60 is an essential splicing factor that
           functions as a poly-U RNA-binding protein required to
           reconstitute splicing in depleted nuclear extracts. Its
           function is enhanced through interaction with U2
           auxiliary factor U2AF65. PUF60 also controls human c-myc
           gene expression by binding and inhibiting the
           transcription factor far upstream sequence element
           (FUSE)-binding-protein (FBP), an activator of c-myc
           promoters. PUF60 contains two central RRMs and a
           C-terminal UHM domain. .
          Length = 85

 Score = 29.5 bits (67), Expect = 2.0
 Identities = 8/23 (34%), Positives = 16/23 (69%)

Query: 505 EEIEDERETKMEEECQKFGHYNK 527
            EI+++ + ++EEEC+K+G    
Sbjct: 13  GEIDEDLKDEIEEECEKYGKVLN 35


>gnl|CDD|218744 pfam05781, MRVI1, MRVI1 protein.  This family consists of mammalian
           MRVI1 proteins which are related to the
           lymphoid-restricted membrane protein (JAW1) and the IP3
           receptor associated cGMP kinase substrates A and B
           (IRAGA and IRAGB). The function of MRVI1 is unknown
           although mutations in the Mrvi1 gene induces myeloid
           leukaemia by altering the expression of a gene important
           for myeloid cell growth and/or differentiation so it has
           been speculated that Mrvi1 is a tumour suppressor gene.
           IRAG is very similar in sequence to MRVI1 and is an
           essential NO/cGKI-dependent regulator of IP3-induced
           calcium release. Activation of cGKI decreases
           IP3-stimulated elevations in intracellular calcium,
           induces smooth muscle relaxation and contributes to the
           antiproliferative and pro-apoptotic effects of NO/cGMP.
           Jaw1 is a member of a class of proteins with
           COOH-terminal hydrophobic membrane anchors and is
           structurally similar to proteins involved in vesicle
           targeting and fusion. This suggests that the function
           and/or the structure of the ER in lymphocytes may be
           modified by lymphoid-restricted resident ER proteins.
          Length = 538

 Score = 31.9 bits (72), Expect = 2.1
 Identities = 10/51 (19%), Positives = 21/51 (41%), Gaps = 4/51 (7%)

Query: 467 SQEEYFEAEDQ----EGQGDSQEDEEMEEQRIEDEEDQEDGNEEIEDERET 513
           S   + +AE++    + Q   + + E   +R       E+  E  E ++E 
Sbjct: 407 STYSWADAEEEKKTKKLQDLREPEGEEAVERTRKPSLSEEVAETEEWDKEQ 457


>gnl|CDD|185618 PTZ00438, PTZ00438, gamete antigen 27/25-like protein; Provisional.
          Length = 374

 Score = 31.6 bits (71), Expect = 2.2
 Identities = 20/68 (29%), Positives = 37/68 (54%), Gaps = 3/68 (4%)

Query: 443 NEIHIKPRICGLQEENIEEKFDDSSQEEYFEAEDQEGQGDSQEDEEMEEQRIEDEEDQED 502
           NE  ++     +  +N EE+    +Q+E  E ED E   + +E E +EE+  +DE+ ++D
Sbjct: 84  NENDVELEGLNIIVKNEEER---GTQKEEEEDEDVEEIEEVEEVEVVEEEYDDDEDSEKD 140

Query: 503 GNEEIEDE 510
             +E + E
Sbjct: 141 DEKESDAE 148



 Score = 30.8 bits (69), Expect = 3.3
 Identities = 30/116 (25%), Positives = 48/116 (41%), Gaps = 19/116 (16%)

Query: 405 TEQHQPAELHPQFTTIEYT----IPLLYSITYITEND------NVIRRNEIHIKPRICGL 454
            E+   +E        EY+    +  + +     END      N+I +NE   + R  G 
Sbjct: 51  PEEQDLSEACDYIAIEEYSACGNVASIANDKSDNENDVELEGLNIIVKNE---EER--GT 105

Query: 455 QEENIEEKFDDSSQEEYFEAEDQEGQGDSQEDEEMEEQRIEDEEDQEDGNEEIEDE 510
           Q+E  E++      EE  E E+ E   +  +D+E  E+  E E D E    E+  E
Sbjct: 106 QKEEEEDE----DVEEIEEVEEVEVVEEEYDDDEDSEKDDEKESDAEGDENELAGE 157


>gnl|CDD|218561 pfam05340, DUF740, Protein of unknown function (DUF740).  This
           family consists of several uncharacterized plant
           proteins of unknown function.
          Length = 565

 Score = 31.5 bits (71), Expect = 2.2
 Identities = 18/63 (28%), Positives = 32/63 (50%), Gaps = 6/63 (9%)

Query: 475 EDQEGQGDSQEDEEMEEQRIEDEEDQEDGNEEIEDERETKMEE----ECQKFGHYNKDDP 530
            ++E + + +EDEE  E+  E  +  ++ +EE E+E    M++    E Q      KD+ 
Sbjct: 140 LEEEEEVEMEEDEEYYEK--EPGKVVDEKSEEEEEEELKTMKDFIDLESQTKKPSVKDNG 197

Query: 531 HSF 533
            SF
Sbjct: 198 KSF 200


>gnl|CDD|191716 pfam07263, DMP1, Dentin matrix protein 1 (DMP1).  This family
           consists of several mammalian dentin matrix protein 1
           (DMP1) sequences. The dentin matrix acidic
           phosphoprotein 1 (DMP1) gene has been mapped to human
           chromosome 4q21. DMP1 is a bone and teeth specific
           protein initially identified from mineralised dentin.
           DMP1 is primarily localised in the nuclear compartment
           of undifferentiated osteoblasts. In the nucleus, DMP1
           acts as a transcriptional component for activation of
           osteoblast-specific genes like osteocalcin. During the
           early phase of osteoblast maturation, Ca(2+) surges into
           the nucleus from the cytoplasm, triggering the
           phosphorylation of DMP1 by a nuclear isoform of casein
           kinase II. This phosphorylated DMP1 is then exported out
           into the extracellular matrix, where it regulates
           nucleation of hydroxyapatite. DMP1 is a unique molecule
           that initiates osteoblast differentiation by
           transcription in the nucleus and orchestrates
           mineralised matrix formation extracellularly, at later
           stages of osteoblast maturation. The DMP1 gene has been
           found to be ectopically expressed in lung cancer
           although the reason for this is unknown.
          Length = 514

 Score = 31.6 bits (71), Expect = 2.3
 Identities = 18/64 (28%), Positives = 30/64 (46%), Gaps = 1/64 (1%)

Query: 454 LQEENIEEKFDDSSQEEYFEAEDQEGQGDSQEDEEMEEQRIEDEEDQEDGNEEIEDERET 513
           L + N  E   DS++        +  + +SQED E E Q  ED ++ +D + E   E + 
Sbjct: 283 LDDSNTMEVKSDSTENAGLSQSREHSRSESQEDSE-ENQSQEDSQEVQDPSSESSQEADL 341

Query: 514 KMEE 517
             +E
Sbjct: 342 PSQE 345



 Score = 30.8 bits (69), Expect = 4.2
 Identities = 21/73 (28%), Positives = 35/73 (47%), Gaps = 6/73 (8%)

Query: 454 LQEENIEEKFD-DSSQEEYFEAEDQEGQGDSQEDEEMEEQRIE-DEEDQEDGNEEIEDER 511
             EE+ E   D +SS +E  ++     +  SQE +  ++ R E D+ D +D +   ED  
Sbjct: 409 SSEESPESTEDENSSSQEGLQSHSASTESRSQESQSEQDSRSEEDDSDSQDSSRSKEDSN 468

Query: 512 ETK----MEEECQ 520
            T+     EE+ Q
Sbjct: 469 STESASSSEEDGQ 481


>gnl|CDD|221490 pfam12253, CAF1A, Chromatin assembly factor 1 subunit A.  The CAF-1
           or chromatin assembly factor-1 consists of three
           subunits, and this is the first, or A. The A domain is
           uniquely required for the progression of S phase in
           mouse cells, independent of its ability to promote
           histone deposition but dependent on its ability to
           interact with HP1 - heterochromatin protein 1-rich
           heterochromatin domains next to centromeres that are
           crucial for chromosome segregation during mitosis. This
           HP1-CAF-1 interaction module functions as a built-in
           replication control for heterochromatin, which, like a
           control barrier, has an impact on S-phase progression in
           addition to DNA-based checkpoints.
          Length = 76

 Score = 28.7 bits (65), Expect = 2.3
 Identities = 15/69 (21%), Positives = 32/69 (46%), Gaps = 12/69 (17%)

Query: 441 RRNEIHIKPRICGLQEENIEEKFDDSSQEEYFEAEDQEGQGDSQEDEEMEEQRIEDEEDQ 500
            +    I PR   L ++     +D  S  E+ E E+    G+  E E+       +E+++
Sbjct: 19  TKPSSVILPR-NPLSQDLPGLDYDYDSDAEWEEEEE----GEDLESED-------EEDEE 66

Query: 501 EDGNEEIED 509
           ED +++++ 
Sbjct: 67  EDDDDDMDG 75


>gnl|CDD|218747 pfam05786, Cnd2, Condensin complex subunit 2.  This family consists
           of several Barren protein homologues from several
           eukaryotic organisms. In Drosophila Barren (barr) is
           required for sister-chromatid segregation in mitosis.
           barr encodes a novel protein that is present in
           proliferating cells and has homologues in yeast and
           human. Mitotic defects in barr embryos become apparent
           during cycle 16, resulting in a loss of PNS and CNS
           neurons. Centromeres move apart at the
           metaphase-anaphase transition and Cyclin B is degraded,
           but sister chromatids remain connected, resulting in
           chromatin bridging. Barren protein localises to
           chromatin throughout mitosis. Colocalisation and
           biochemical experiments indicate that Barren associates
           with Topoisomerase II throughout mitosis and alters the
           activity of Topoisomerase II. It has been suggested that
           this association is required for proper chromosomal
           segregation by facilitating the decatenation of
           chromatids at anaphase. This family forms one of the
           three non-structural maintenance of chromosomes (SMC)
           subunits of the mitotic condensation complex along with
           Cnd1 and Cnd3.
          Length = 719

 Score = 31.7 bits (72), Expect = 2.5
 Identities = 13/63 (20%), Positives = 24/63 (38%), Gaps = 3/63 (4%)

Query: 457 ENIEEKFDDSSQEEYFEAEDQEGQGDS---QEDEEMEEQRIEDEEDQEDGNEEIEDERET 513
           +  +  FD +++ E    ED E  GD    +E E  +E  ++         E I+ +   
Sbjct: 336 KKGDVGFDINAEVEDDPDEDAEDGGDGGDAEEFEAEKEPMLKVCRVDRQVTEIIDGDEPV 395

Query: 514 KME 516
              
Sbjct: 396 DYC 398


>gnl|CDD|173412 PTZ00121, PTZ00121, MAEBL; Provisional.
          Length = 2084

 Score = 31.6 bits (71), Expect = 2.5
 Identities = 20/67 (29%), Positives = 31/67 (46%), Gaps = 1/67 (1%)

Query: 455  QEENIEEKFDDSSQEEYFEAEDQEGQGDSQEDEEMEEQRIEDEEDQEDGNEEIEDERETK 514
             EE  + + D    EE  +AE+ E +      +E EE + + EE ++   EE +   E K
Sbjct: 1664 AEEAKKAEEDKKKAEEAKKAEEDEKKAAEALKKEAEEAK-KAEELKKKEAEEKKKAEELK 1722

Query: 515  MEEECQK 521
              EE  K
Sbjct: 1723 KAEEENK 1729



 Score = 30.5 bits (68), Expect = 5.2
 Identities = 22/86 (25%), Positives = 42/86 (48%), Gaps = 9/86 (10%)

Query: 440  IRRNEIHIKPRICGLQEENIEEKFDDSSQEEYFEAED----QEGQGDSQEDEEMEEQRIE 495
            +R+ E   K      +E  IEE      +E+  +AE+    +E +  ++E ++ EE++ +
Sbjct: 1580 LRKAEEAKK-----AEEARIEEVMKLYEEEKKMKAEEAKKAEEAKIKAEELKKAEEEKKK 1634

Query: 496  DEEDQEDGNEEIEDERETKMEEECQK 521
             E+ ++   EE +   E K  EE  K
Sbjct: 1635 VEQLKKKEAEEKKKAEELKKAEEENK 1660


>gnl|CDD|111875 pfam03032, Brevenin, Brevenin/esculentin/gaegurin/rugosin family.
           This family contains a number of defence peptides
           secreted from the skin of amphibians, including the
           opiate-like dermorphins and deltorphins, and the
           antimicrobial dermoseptins and temporins. The alignment
           for this family includes the signal peptide.
          Length = 46

 Score = 27.7 bits (62), Expect = 2.6
 Identities = 13/25 (52%), Positives = 18/25 (72%), Gaps = 1/25 (4%)

Query: 490 EEQRIEDEEDQEDGNEEIEDERETK 514
           EE++ EDEE+ ED  EE E++ E K
Sbjct: 22  EEEKREDEEENEDE-EEGEEQSEVK 45


>gnl|CDD|234366 TIGR03825, FliH_bacil, flagellar assembly protein FliH.  This
           bacillus clade of FliH proteins is not found by the Pfam
           FliH model pfam02108, but is closely related to the
           sequences identified by that model. Sequences identified
           by this model are observed in flagellar operons in an
           analogous position relative to other flagellar operon
           genes.
          Length = 255

 Score = 30.9 bits (70), Expect = 2.7
 Identities = 21/88 (23%), Positives = 35/88 (39%), Gaps = 3/88 (3%)

Query: 433 ITENDNVIRRNEIHIKP-RICGLQEENIEEKFDDSSQEEYF--EAEDQEGQGDSQEDEEM 489
           I    +VI   E  I P R      E  +E  D+  + E    +AE +  Q   Q + + 
Sbjct: 5   IKSTKSVIPAQERGIIPLRQVTDVPETEQELADEEQEFEQILEKAEAEAAQIIEQAEAQA 64

Query: 490 EEQRIEDEEDQEDGNEEIEDERETKMEE 517
              R + E+++    EE E   +   +E
Sbjct: 65  AAIREQIEQERAQWEEERERLIQEAKQE 92


>gnl|CDD|233758 TIGR02169, SMC_prok_A, chromosome segregation protein SMC,
           primarily archaeal type.  SMC (structural maintenance of
           chromosomes) proteins bind DNA and act in organizing and
           segregating chromosomes for partition. SMC proteins are
           found in bacteria, archaea, and eukaryotes. It is found
           in a single copy and is homodimeric in prokaryotes, but
           six paralogs (excluded from this family) are found in
           eukarotes, where SMC proteins are heterodimeric. This
           family represents the SMC protein of archaea and a few
           bacteria (Aquifex, Synechocystis, etc); the SMC of other
           bacteria is described by TIGR02168. The N- and
           C-terminal domains of this protein are well conserved,
           but the central hinge region is skewed in composition
           and highly divergent [Cellular processes, Cell division,
           DNA metabolism, Chromosome-associated proteins].
          Length = 1164

 Score = 31.6 bits (72), Expect = 2.7
 Identities = 17/61 (27%), Positives = 34/61 (55%), Gaps = 3/61 (4%)

Query: 457 ENIEEKFDDSSQEEYFEAEDQEGQGDSQEDEEMEEQRIEDEEDQEDGNEEIEDERETKME 516
             IE+K +  + E+  E  ++E Q   ++  +++EQ    E++ E+ N + E E E ++E
Sbjct: 815 REIEQKLNRLTLEK--EYLEKEIQELQEQRIDLKEQIKSIEKEIENLNGKKE-ELEEELE 871

Query: 517 E 517
           E
Sbjct: 872 E 872


>gnl|CDD|216403 pfam01271, Granin, Granin (chromogranin or secretogranin). 
          Length = 585

 Score = 31.5 bits (71), Expect = 2.7
 Identities = 17/92 (18%), Positives = 33/92 (35%), Gaps = 11/92 (11%)

Query: 455 QEENIEEKFDDSSQEEYFEAED---------QEGQGDSQEDEEMEEQRIEDEEDQEDGNE 505
           Q +   E+ D   ++ Y + ED          E      ED    E+  ED+  +E    
Sbjct: 186 QSKQKRERSD-EREKSYQDDEDDTYRQNNIPYEDVV-GGEDWNPIEEGEEDQTQEEVKRS 243

Query: 506 EIEDERETKMEEECQKFGHYNKDDPHSFQLPE 537
           +    +   + +E ++ G    +D    +  E
Sbjct: 244 KERTHKGRSLPDESKRSGQLGLEDEAEEEKEE 275



 Score = 30.3 bits (68), Expect = 6.0
 Identities = 17/60 (28%), Positives = 31/60 (51%), Gaps = 1/60 (1%)

Query: 453 GLQEENIEEKFDDSSQEEYFEAEDQEGQGDSQEDEEMEE-QRIEDEEDQEDGNEEIEDER 511
           G     IEE  +D +QEE   ++++  +G S  DE     Q   ++E +E+  E+ ++ R
Sbjct: 222 GEDWNPIEEGEEDQTQEEVKRSKERTHKGRSLPDESKRSGQLGLEDEAEEEKEEKGQESR 281


>gnl|CDD|227447 COG5117, NOC3, Protein involved in the nuclear export of
           pre-ribosomes [Translation, ribosomal structure and
           biogenesis / Intracellular trafficking and secretion].
          Length = 657

 Score = 31.5 bits (71), Expect = 2.8
 Identities = 26/115 (22%), Positives = 42/115 (36%), Gaps = 20/115 (17%)

Query: 406 EQHQPAELHPQFTTIEYTIPLLYSITYITENDNVIRRNEIHIKPRICGLQEENIEEKFDD 465
           E+ Q  EL P+ ++        Y+   I   DNV+   +++           N EE   +
Sbjct: 53  EEEQDYELRPRVSSSWNN--ESYNRLPIKTKDNVVA--DVN-----------NGEEFLSE 97

Query: 466 SSQEEYFE---AEDQEGQGDSQEDEEMEEQRIEDEEDQEDGNEEIEDERETKMEE 517
           S  E   E       E Q   +E +   E  ++ + D E   E I       +EE
Sbjct: 98  SESEASLEIDSDIKDEKQKSLEEQKIAPEIPVKQQIDSE--KERIASICTKIIEE 150


>gnl|CDD|218223 pfam04712, Radial_spoke, Radial spokehead-like protein.  This
           family includes the radial spoke head proteins RSP4 and
           RSP6 from Chlamydomonas reinhardtii, and several
           eukaryotic homologues, including mammalian RSHL1, the
           protein product of a familial ciliary dyskinesia
           candidate gene.
          Length = 481

 Score = 31.2 bits (71), Expect = 2.9
 Identities = 22/96 (22%), Positives = 38/96 (39%), Gaps = 21/96 (21%)

Query: 456 EENIEEKFDDSSQEEYFEAEDQEG-QGDSQEDE--------------------EMEEQRI 494
           EE  EE+ + + ++E  E  D EG +     D                       E++  
Sbjct: 290 EEEEEEEEEPAERDELEENPDFEGLEVRELADSLDNWVHHVQHILPQGRCTWVNPEQKDE 349

Query: 495 EDEEDQEDGNEEIEDERETKMEEECQKFGHYNKDDP 530
           E+E++ E+  EE E+  E + EE        ++D P
Sbjct: 350 EEEQEDEEEEEEEEEPEEPEPEEGPPLLTPISEDAP 385


>gnl|CDD|227615 COG5296, COG5296, Transcription factor involved in TATA site
           selection and in elongation by RNA polymerase II
           [Transcription].
          Length = 521

 Score = 31.2 bits (70), Expect = 3.2
 Identities = 20/66 (30%), Positives = 30/66 (45%), Gaps = 7/66 (10%)

Query: 464 DDSSQEEYFEAEDQEGQGDSQEDEEMEEQRIE----DEEDQEDGNEEIEDERETKM---E 516
           D++   +Y E    EG   S+E++     R+E    DE D+       E ERE+ +   E
Sbjct: 41  DNNENVDYAEESGGEGNEKSEEEKFKNPYRLEGKFKDEADRAKIMAMTEIERESILFERE 100

Query: 517 EECQKF 522
           EE  K 
Sbjct: 101 EEISKL 106


>gnl|CDD|220377 pfam09747, DUF2052, Coiled-coil domain containing protein
           (DUF2052).  This entry is of sequences of two conserved
           domains separated by a region of low complexity,
           spanning some 200 residues. The function is unknown.
          Length = 178

 Score = 30.1 bits (68), Expect = 3.2
 Identities = 20/68 (29%), Positives = 33/68 (48%), Gaps = 3/68 (4%)

Query: 455 QEENIEEKFDDSSQEEYFEAEDQEGQGDSQEDEEMEEQRIEDEEDQEDGNEEIEDERETK 514
           QEE    + ++   EE  E E++E Q     +EE E+    DE    +  EE+ +E  ++
Sbjct: 77  QEEEESAEEEEEDPEEENEEEEEEYQRGPFGEEEEEDGDSYDELPTPEEREELREEFISE 136

Query: 515 MEEECQKF 522
           M    Q+F
Sbjct: 137 MH---QRF 141


>gnl|CDD|224495 COG1579, COG1579, Zn-ribbon protein, possibly nucleic acid-binding
           [General function prediction only].
          Length = 239

 Score = 30.8 bits (70), Expect = 3.2
 Identities = 20/82 (24%), Positives = 34/82 (41%), Gaps = 2/82 (2%)

Query: 437 DNVIRRNEIHIKPRICGLQEENIEEKFDDSSQEEYFEAEDQEGQGDSQEDEEMEEQRIED 496
           +N + + E  I  +    + +  EEK      E    A + E Q   +    +E++  E 
Sbjct: 58  ENQVSQLESEI--QEIRERIKRAEEKLSAVKDERELRALNIEIQIAKERINSLEDELAEL 115

Query: 497 EEDQEDGNEEIEDERETKMEEE 518
            E+ E   +EIED +E     E
Sbjct: 116 MEEIEKLEKEIEDLKERLERLE 137


>gnl|CDD|191249 pfam05279, Asp-B-Hydro_N, Aspartyl beta-hydroxylase N-terminal
           region.  This family includes the N-terminal regions of
           the junctin, junctate and aspartyl beta-hydroxylase
           proteins. Junctate is an integral ER/SR membrane calcium
           binding protein, which comes from an alternatively
           spliced form of the same gene that generates aspartyl
           beta-hydroxylase and junctin. Aspartyl beta-hydroxylase
           catalyzes the post-translational hydroxylation of
           aspartic acid or asparagine residues contained within
           epidermal growth factor (EGF) domains of proteins.
          Length = 240

 Score = 30.7 bits (69), Expect = 3.2
 Identities = 18/69 (26%), Positives = 25/69 (36%), Gaps = 4/69 (5%)

Query: 454 LQEENIEEKFDDSSQEEYFEAEDQEGQGDSQEDEEMEEQRIEDE----EDQEDGNEEIED 509
            QE     K  + +++ Y   E    Q      E+  EQ  ED     E  E    E +D
Sbjct: 146 RQETLEAGKVHEETEDSYHVEETASEQYKQDMKEKASEQENEDSKEPVEKAERTKAETDD 205

Query: 510 ERETKMEEE 518
             E   +EE
Sbjct: 206 VTEEDYDEE 214


>gnl|CDD|227496 COG5167, VID27, Protein involved in vacuole import and degradation
           [Intracellular trafficking and secretion].
          Length = 776

 Score = 31.1 bits (70), Expect = 3.4
 Identities = 11/53 (20%), Positives = 22/53 (41%)

Query: 458 NIEEKFDDSSQEEYFEAEDQEGQGDSQEDEEMEEQRIEDEEDQEDGNEEIEDE 510
           N E +  D   +      +++       ++   E R  +E + E+  E+ EDE
Sbjct: 343 NEEAERKDYILDSSSVPLEKQFDDILYFEKMEIENRNPEESEHEEEVEDYEDE 395



 Score = 30.7 bits (69), Expect = 5.0
 Identities = 18/56 (32%), Positives = 30/56 (53%), Gaps = 6/56 (10%)

Query: 457 ENIEEKFDDSSQEEYFEAEDQEGQGDSQEDEEMEEQRIEDEEDQEDGNEEIEDERE 512
             +E++FDD     YFE  + E   +   +E   E+ +ED ED+ D ++ I D+ E
Sbjct: 358 VPLEKQFDD---ILYFEKMEIE---NRNPEESEHEEEVEDYEDENDHSKRICDDDE 407


>gnl|CDD|203043 pfam04546, Sigma70_ner, Sigma-70, non-essential region.  The domain
           is found in the primary vegetative sigma factor. The
           function of this domain is unclear and can be removed
           without loss of function.
          Length = 211

 Score = 30.2 bits (69), Expect = 3.4
 Identities = 12/41 (29%), Positives = 20/41 (48%)

Query: 473 EAEDQEGQGDSQEDEEMEEQRIEDEEDQEDGNEEIEDERET 513
            A        + E E  EE   +D++D ED +E+ E+E + 
Sbjct: 33  NAAAAAATAAAIESELDEEDLEDDDDDDEDEDEDDEEEADL 73


>gnl|CDD|227458 COG5129, MAK16, Nuclear protein with HMG-like acidic region
           [General function prediction only].
          Length = 303

 Score = 30.8 bits (69), Expect = 3.7
 Identities = 16/86 (18%), Positives = 33/86 (38%), Gaps = 17/86 (19%)

Query: 446 HIKPRICGLQEENIEEKFDDSSQEEY------FEAEDQEGQ-----------GDSQEDEE 488
           H+  +    +E+  +EK     +EE          +D E +           G  Q  E 
Sbjct: 184 HLWNKAATEREKRQDEKERYVEEEEESDTELEAVTDDSEKEKTKKKDLEKWLGSDQSMET 243

Query: 489 MEEQRIEDEEDQEDGNEEIEDERETK 514
            E +  E  E + D +E+ +++ + +
Sbjct: 244 SESEEEESSESESDEDEDEDNKGKIR 269


>gnl|CDD|219882 pfam08524, rRNA_processing, rRNA processing.  This is a family of
           proteins that are involved in rRNA processing. In a
           localisation study they were found to localise to the
           nucleus and nucleolus. The family also includes other
           metazoa members from plants to mammals where the protein
           has been named BR22 and is associated with TTF-1,
           thyroid transcription factor 1. In the lungs, the family
           binds TTF-1 to form a complex which influences the
           expression of the key lung surfactant protein-B (SP-B)
           and -C (SP-C), the small hydrophobic surfactant proteins
           that maintain surface tension in alveoli.
          Length = 150

 Score = 29.9 bits (67), Expect = 3.9
 Identities = 16/72 (22%), Positives = 31/72 (43%), Gaps = 6/72 (8%)

Query: 454 LQEENI---EEKFDDSSQEEYFEAEDQEGQGDSQEDEEMEEQRIEDEEDQEDGNEEIEDE 510
           L++E     E++  +   +   E    E +    E +E+ +QR    E +E    + + E
Sbjct: 42  LEKEGYAVPEKESAEKQVKSSKEDRKFEKKKKLDEKKEIAKQR--KREQREKELAKRQKE 99

Query: 511 RETKMEEECQKF 522
            E K+E   +K 
Sbjct: 100 LE-KIELSKKKQ 110


>gnl|CDD|221250 pfam11831, Myb_Cef, pre-mRNA splicing factor component.  This
           family is a region of the Myb-Related Cdc5p/Cef1
           proteins, in fungi, and is part of the pre-mRNA splicing
           factor complex.
          Length = 363

 Score = 30.4 bits (69), Expect = 4.3
 Identities = 11/34 (32%), Positives = 19/34 (55%)

Query: 482 DSQEDEEMEEQRIEDEEDQEDGNEEIEDERETKM 515
           + +E EEMEE+  ED  D++      E+ +E + 
Sbjct: 153 EEEEPEEMEEELEEDAADRDARKRAAEEAKEQEE 186


>gnl|CDD|237047 PRK12298, obgE, GTPase CgtA; Reviewed.
          Length = 390

 Score = 30.6 bits (70), Expect = 4.5
 Identities = 9/52 (17%), Positives = 26/52 (50%)

Query: 460 EEKFDDSSQEEYFEAEDQEGQGDSQEDEEMEEQRIEDEEDQEDGNEEIEDER 511
           E +  ++ ++  F  +D   +   + +EE ++   +D ++ +D   E+  +R
Sbjct: 339 EAEEAEAPEKVEFMWDDYHREQLEEVEEEDDDDWDDDWDEDDDEGVEVIYKR 390


>gnl|CDD|221668 pfam12619, MCM2_N, Mini-chromosome maintenance protein 2.  This
           domain family is found in eukaryotes, and is typically
           between 138 and 153 amino acids in length. The family is
           found in association with pfam00493. Mini-chromosome
           maintenance (MCM) proteins are essential for DNA
           replication. These proteins use ATPase activity to
           perform this function.
          Length = 145

 Score = 29.2 bits (66), Expect = 4.7
 Identities = 13/28 (46%), Positives = 19/28 (67%), Gaps = 2/28 (7%)

Query: 485 EDEEMEEQRIEDEEDQEDGNEEIEDERE 512
           EDE  EE+ +ED+ D  D +EE E+E +
Sbjct: 1   EDEPDEEEELEDDID--DLDEEAEEEED 26



 Score = 29.2 bits (66), Expect = 5.2
 Identities = 12/42 (28%), Positives = 21/42 (50%), Gaps = 5/42 (11%)

Query: 470 EYFEAEDQEGQGDSQEDEEMEEQRIEDEEDQED-GNEEIEDE 510
           E    E++E + D  + +E      E+EED ED   + +E +
Sbjct: 1   EDEPDEEEELEDDIDDLDEEA----EEEEDGEDLFGDNMERD 38


>gnl|CDD|225281 COG2425, COG2425, Uncharacterized protein containing a von
           Willebrand factor type A (vWA) domain [General function
           prediction only].
          Length = 437

 Score = 30.4 bits (69), Expect = 4.7
 Identities = 29/184 (15%), Positives = 63/184 (34%), Gaps = 21/184 (11%)

Query: 461 EKFDDSSQEEYFEAEDQEGQGDSQEDEEMEEQRIEDEEDQEDGNEEIEDERETKMEEECQ 520
           E++++  Q+   E  +   +G+ +     +++ I    + E   EE+E+    ++ +   
Sbjct: 110 ERWEELLQDLQREGSEDFLEGEREGLLSEKQEEISLSGEMEGILEEVEELISGRLWDMSA 169

Query: 521 KFGHYNKDDPHSFQLPENFQMYPQFM-YHLRRSQFLQVFNNSPDETSYYRTLLIHLPHYS 579
                        +L      Y  F+       + L+       E    +T +       
Sbjct: 170 GELKKG-----DLELLGK---YADFLLKFPELRKILEQLGREDLEIEEGKTKIGRSFERR 221

Query: 580 -RFQFLQVFNN----SPDET-SYYRHVLMREDLSQSLIMIQPVLYSYGFNG---PPVPVL 630
                +   ++     P E  S    +L  E   +   +++  L +Y   G    PV +L
Sbjct: 222 GEVSGITQSDDLLRLLPIERASLADPILSVEFYRK---LVEKKLLTYRLQGKSEGPVILL 278

Query: 631 LDTS 634
           LD S
Sbjct: 279 LDKS 282


>gnl|CDD|233757 TIGR02168, SMC_prok_B, chromosome segregation protein SMC, common
           bacterial type.  SMC (structural maintenance of
           chromosomes) proteins bind DNA and act in organizing and
           segregating chromosomes for partition. SMC proteins are
           found in bacteria, archaea, and eukaryotes. This family
           represents the SMC protein of most bacteria. The smc
           gene is often associated with scpB (TIGR00281) and scpA
           genes, where scp stands for segregation and condensation
           protein. SMC was shown (in Caulobacter crescentus) to be
           induced early in S phase but present and bound to DNA
           throughout the cell cycle [Cellular processes, Cell
           division, DNA metabolism, Chromosome-associated
           proteins].
          Length = 1179

 Score = 30.8 bits (70), Expect = 5.0
 Identities = 15/65 (23%), Positives = 26/65 (40%)

Query: 454 LQEENIEEKFDDSSQEEYFEAEDQEGQGDSQEDEEMEEQRIEDEEDQEDGNEEIEDERET 513
           L+E   +   + +  E   E  ++  +   +E  E E +  E E   E   EE++  RE 
Sbjct: 745 LEERIAQLSKELTELEAEIEELEERLEEAEEELAEAEAEIEELEAQIEQLKEELKALREA 804

Query: 514 KMEEE 518
             E  
Sbjct: 805 LDELR 809


>gnl|CDD|227931 COG5644, COG5644, Uncharacterized conserved protein [Function
           unknown].
          Length = 869

 Score = 30.4 bits (68), Expect = 5.3
 Identities = 13/58 (22%), Positives = 23/58 (39%), Gaps = 1/58 (1%)

Query: 455 QEENIEEKFDDSSQEEYFEAEDQEGQGDSQEDEEMEEQRIEDEEDQEDGNEEIEDERE 512
           Q E  E   +D + ++    E      +  E EE + +  E E    D ++E  D + 
Sbjct: 135 QPEKNESGNNDHATDKENLLESDASSSNDSESEESDSES-EIESSDSDHDDENSDSKL 191


>gnl|CDD|220624 pfam10187, Nefa_Nip30_N, N-terminal domain of NEFA-interacting
           nuclear protein NIP30.  This is a the N-terminal 100
           amino acids of a family of proteins conserved from
           plants to humans. The full-length protein has putatively
           been called NEFA-interacting nuclear protein NIP30,
           however no reference could be found to confirm this.
          Length = 99

 Score = 28.5 bits (64), Expect = 5.3
 Identities = 13/48 (27%), Positives = 21/48 (43%), Gaps = 2/48 (4%)

Query: 473 EAEDQEGQGDSQEDEEMEEQRIEDEEDQEDGNE--EIEDERETKMEEE 518
           E+E  E +   QE+         + E++ DG    E   E + K +EE
Sbjct: 4   ESELDEARKRRQEEVRAPRDPKAEPEEEYDGRSLYERLQENKDKKQEE 51


>gnl|CDD|219293 pfam07093, SGT1, SGT1 protein.  This family consists of several
           eukaryotic SGT1 proteins. Human SGT1 or hSGT1 is known
           to suppress GCR2 and is highly expressed in the muscle
           and heart. The function of this family is unknown
           although it has been speculated that SGT1 may be
           functionally analogous to the Gcr2p protein of
           Saccharomyces cerevisiae which is known to be a
           regulatory factor of glycolytic gene expression.
          Length = 557

 Score = 30.1 bits (68), Expect = 6.1
 Identities = 16/59 (27%), Positives = 26/59 (44%), Gaps = 4/59 (6%)

Query: 456 EENIEEKFDDSSQ---EEYFEAEDQE-GQGDSQEDEEMEEQRIEDEEDQEDGNEEIEDE 510
           ++  E+   DS+    EEY +  D E  Q DS  + ++        ED +D  E +E  
Sbjct: 496 DDEDEDSSSDSTLEELEEYMDQMDAELKQTDSSNNADISNSGSSGAEDDDDDIEGVEPV 554


>gnl|CDD|226889 COG4487, COG4487, Uncharacterized protein conserved in bacteria
           [Function unknown].
          Length = 438

 Score = 30.2 bits (68), Expect = 6.3
 Identities = 18/79 (22%), Positives = 31/79 (39%), Gaps = 10/79 (12%)

Query: 454 LQEENIEEKFDDSSQEEY-FEAEDQEGQGDSQEDEEME--EQRIEDEEDQEDGNEE---- 506
           LQ+E  + +  +    E   E E  E + D    E  +  +   E  E + + N+     
Sbjct: 95  LQDEIAKLEALELLNLEKDKELELLEKELDELSKELQKQLQNTAEIIEKKRENNKNEERL 154

Query: 507 ---IEDERETKMEEECQKF 522
               E + E  +E E +KF
Sbjct: 155 KFENEKKLEESLELEREKF 173



 Score = 29.4 bits (66), Expect = 9.2
 Identities = 21/93 (22%), Positives = 38/93 (40%), Gaps = 18/93 (19%)

Query: 443 NEIHIKPRICGLQEENI------------EEKFDDSSQEEYFEAEDQEGQGDS-----QE 485
             I  KP      E++I            E++    +  E FE E  E +        +E
Sbjct: 6   VPIQTKPFTIPKCEDSIKGEQARYKQIEQEDQSRILNTLEEFEKEANEKRAQYRSAKKKE 65

Query: 486 DEEMEEQRIEDEEDQED-GNEEIEDERETKMEE 517
             ++EEQ I  +++Q++  NE+I+       +E
Sbjct: 66  LSQLEEQLINQKKEQKNLFNEQIKQFELALQDE 98


>gnl|CDD|224117 COG1196, Smc, Chromosome segregation ATPases [Cell division and
           chromosome partitioning].
          Length = 1163

 Score = 30.5 bits (69), Expect = 6.3
 Identities = 19/66 (28%), Positives = 35/66 (53%), Gaps = 6/66 (9%)

Query: 457 ENIEEKFDDSSQEEYFEAEDQEGQ-----GDSQEDEEMEEQRIEDEEDQEDGNEEIEDER 511
           + ++E+ ++  +EE  EAE +           ++  E  EQ IE+ E++ +  EE  DE 
Sbjct: 789 QALQEELEEL-EEELEEAERRLDALERELESLEQRRERLEQEIEELEEEIEELEEKLDEL 847

Query: 512 ETKMEE 517
           E ++EE
Sbjct: 848 EEELEE 853


>gnl|CDD|214818 smart00784, SPT2, SPT2 chromatin protein.  This entry includes the
           Saccharomyces cerevisiae protein SPT2 which is a
           chromatin protein involved in transcriptional
           regulation.
          Length = 106

 Score = 28.5 bits (64), Expect = 6.5
 Identities = 14/46 (30%), Positives = 25/46 (54%)

Query: 469 EEYFEAEDQEGQGDSQEDEEMEEQRIEDEEDQEDGNEEIEDERETK 514
           ++  + +D E  G   ++EE    R+   ED+E+   E E+ERE +
Sbjct: 56  DDDDDDDDMEAGGADIQEEERRSARLARLEDREEERLEKEEEREKR 101


>gnl|CDD|187810 cd09679, Cas10_III, CRISPR/Cas system-associated protein Cas10.
           CRISPR (Clustered Regularly Interspaced Short
           Palindromic Repeats) and associated Cas proteins
           comprise a system for heritable host defense by
           prokaryotic cells against phage and other foreign DNA;
           Multidomain protein with permuted HD nuclease domain,
           palm domain and Zn-ribbon; MTH326-like has inactivated
           polymerase catalytic domain; alr1562 and slr7011 -
           predicted only on the basis of size, presence of HD
           domain, and location with RAMPs in one operon; signature
           gene for type III; also known as Crm2 family.
          Length = 475

 Score = 30.1 bits (68), Expect = 6.7
 Identities = 18/66 (27%), Positives = 30/66 (45%), Gaps = 2/66 (3%)

Query: 454 LQEENIEEKFDDSSQ--EEYFEAEDQEGQGDSQEDEEMEEQRIEDEEDQEDGNEEIEDER 511
           L+EE  EE ++      +EY + E+ E      +  E    RIED  +  +  EE+E  R
Sbjct: 96  LKEEIEEETWESILNKVKEYLDIENVEPFLLKSDALERISDRIEDGNEDREEAEELEAAR 155

Query: 512 ETKMEE 517
           +   + 
Sbjct: 156 KNVRDF 161


>gnl|CDD|220413 pfam09805, Nop25, Nucleolar protein 12 (25kDa).  Members of this
           family of proteins are part of the yeast nuclear pore
           complex-associated pre-60S ribosomal subunit. The family
           functions as a highly conserved exonuclease that is
           required for the 5'-end maturation of 5.8S and 25S
           rRNAs, demonstrating that 5'-end processing also has a
           redundant pathway. Nop25 binds late pre-60S ribosomes,
           accompanying them from the nucleolus to the nuclear
           periphery; and there is evidence for both physical and
           functional links between late 60S subunit processing and
           export.
          Length = 134

 Score = 28.8 bits (65), Expect = 7.0
 Identities = 13/56 (23%), Positives = 25/56 (44%)

Query: 455 QEENIEEKFDDSSQEEYFEAEDQEGQGDSQEDEEMEEQRIEDEEDQEDGNEEIEDE 510
            E+ ++E+ +     E    ++++ + +  ED E +E     E    D  EE  DE
Sbjct: 65  LEKQLKERKEALKLLEEENDDEEDAETEDTEDVEDDEWEGFPEPTVTDYEEEYIDE 120


>gnl|CDD|148682 pfam07222, PBP_sp32, Proacrosin binding protein sp32.  This family
           consists of several mammalian specific proacrosin
           binding protein sp32 sequences. sp32 is a sperm specific
           protein which is known to bind with with 55- and 53-kDa
           proacrosins and the 49-kDa acrosin intermediate. The
           exact function of sp32 is unclear, it is thought however
           that the binding of sp32 to proacrosin may be involved
           in packaging the acrosin zymogen into the acrosomal
           matrix.
          Length = 243

 Score = 29.6 bits (66), Expect = 7.3
 Identities = 17/62 (27%), Positives = 31/62 (50%), Gaps = 1/62 (1%)

Query: 456 EENIEEKFDDSSQEEYFEAEDQEGQGDSQEDEEMEEQRIEDEEDQEDGNEEIEDERETKM 515
           EE ++         +    + ++ Q  S+  E ++E + E+++ QE+  EE E E E K 
Sbjct: 177 EELLQSSLSLGGSVQVKAPKPKQEQLLSKLQEYLQEHKTEEKQPQEEQEEE-EVEEEAKQ 235

Query: 516 EE 517
           EE
Sbjct: 236 EE 237


>gnl|CDD|240413 PTZ00423, PTZ00423, glideosome-associated protein 45; Provisional.
          Length = 193

 Score = 29.3 bits (65), Expect = 7.4
 Identities = 22/69 (31%), Positives = 36/69 (52%), Gaps = 4/69 (5%)

Query: 464 DDSSQEEYFEAEDQEGQGDSQED--EEMEEQRIEDEEDQEDGNEEIEDERETKME-EECQ 520
           D+ ++ E  + E +E      ED  EE+E+Q  E+  +QE+ NEE + + E     +E +
Sbjct: 19  DELAEREKLKKEVEEIPEQKPEDIVEELEDQP-EEPPEQEEENEEQKPKEEIDYPIQENK 77

Query: 521 KFGHYNKDD 529
            F   N DD
Sbjct: 78  SFDEKNLDD 86


>gnl|CDD|223649 COG0576, GrpE, Molecular chaperone GrpE (heat shock protein)
           [Posttranslational modification, protein turnover,
           chaperones].
          Length = 193

 Score = 29.2 bits (66), Expect = 7.5
 Identities = 20/75 (26%), Positives = 34/75 (45%), Gaps = 7/75 (9%)

Query: 455 QEENIEEKFDDSSQEEYFEAEDQEGQGDSQEDEEMEEQRIEDEE-------DQEDGNEEI 507
           +E    EK ++   EE    E+ E + + QE  E+E Q  E ++       + E+  +  
Sbjct: 14  EETEEAEKSEEEEAEEEEPEEENELEEEQQEIAELEAQLEELKDKYLRAQAEFENLRKRT 73

Query: 508 EDERETKMEEECQKF 522
           E ERE   +   +KF
Sbjct: 74  EREREEAKKYAIEKF 88


>gnl|CDD|240226 PTZ00007, PTZ00007, (NAP-L) nucleosome assembly protein -L;
           Provisional.
          Length = 337

 Score = 29.8 bits (67), Expect = 7.6
 Identities = 11/52 (21%), Positives = 20/52 (38%)

Query: 455 QEENIEEKFDDSSQEEYFEAEDQEGQGDSQEDEEMEEQRIEDEEDQEDGNEE 506
           ++ +     DD   + Y  ++      +S  D   E+ R E E +    NE 
Sbjct: 282 EDSDYSSDEDDDDYDSYDSSDSASSDSNSDVDTNEEDDRGEKESNGAKSNEL 333


>gnl|CDD|217509 pfam03353, Lin-8, Ras-mediated vulval-induction antagonist.  LIN-8
           is a nuclear protein, present at the sites of
           transcriptional repressor complexes, which interacts
           with LIN-35 Rb.Lin35 Rb is a product of the class B
           synMuv gene lin-35 which silences genes required for
           vulval specification through chromatin modification and
           remodelling. The biological role of the interaction has
           not yet been determined however predictions have been
           made. The interaction shows that class A synMuv genes
           control vulval induction through the transcriptional
           regulation of gene expression. LIN-8 normally functions
           as part of a protein complex however when the complex is
           absent, other family members can partially replace LIN-8
           activity.
          Length = 316

 Score = 29.7 bits (67), Expect = 8.0
 Identities = 18/74 (24%), Positives = 35/74 (47%), Gaps = 2/74 (2%)

Query: 464 DDSSQEEYFEAEDQEGQGDSQEDEEMEEQRIEDEEDQEDGNEEIEDERETKMEE-ECQKF 522
           DD   +  FE   +E Q D +E++E++E  +E+  D    N  +++  ++ M   +  K 
Sbjct: 142 DDIIYDGIFEDRTRESQ-DMEENDEVDEVEVEEVPDDYGANLAVDEPEQSTMSRPQEVKQ 200

Query: 523 GHYNKDDPHSFQLP 536
               +  P + Q P
Sbjct: 201 SVPAQQAPPNPQQP 214


>gnl|CDD|220785 pfam10498, IFT57, Intra-flagellar transport protein 57.  Eukaryotic
           cilia and flagella are specialised organelles found at
           the periphery of cells of diverse organisms.
           Intra-flagellar transport (IFT) is required for the
           assembly and maintenance of eukaryotic cilia and
           flagella, and consists of the bidirectional movement of
           large protein particles between the base and the distal
           tip of the organelle. IFT particles contain multiple
           copies of two distinct protein complexes, A and B, which
           contain at least 6 and 11 protein subunits. IFT57 is
           part of complex B but is not, however, required for the
           core subunits to stay associated. This protein is known
           as Huntington-interacting protein-1 in humans.
          Length = 355

 Score = 29.7 bits (67), Expect = 8.4
 Identities = 17/69 (24%), Positives = 27/69 (39%), Gaps = 15/69 (21%)

Query: 465 DSSQEEYFEAEDQEGQGDSQEDEEMEEQRIEDEEDQEDGN---------------EEIED 509
           +  +EE  + +D E   +  E+E   E+  +DE  QE                  E + D
Sbjct: 129 EEDEEENVDEDDAEIILEEVEEEVEIEEVDDDEGTQETKYKRGDTSLTPQAKDVLESLID 188

Query: 510 ERETKMEEE 518
             E K+E E
Sbjct: 189 AAEWKLEVE 197


>gnl|CDD|240419 PTZ00440, PTZ00440, reticulocyte binding protein 2-like protein;
            Provisional.
          Length = 2722

 Score = 29.8 bits (67), Expect = 8.7
 Identities = 20/84 (23%), Positives = 39/84 (46%), Gaps = 7/84 (8%)

Query: 431  TYITENDNVIRRNEIHIKPRICGLQEENIEEKFDDSSQEEYFEAEDQEGQGDSQEDEEME 490
              ++  DN I+  E   K ++  L++  IEE      + E  E+E QE   D ++    +
Sbjct: 2549 ELLSYIDNEIKELENE-KLKL--LEKAKIEE---SRKERERIESETQEDNTDEEQINRQQ 2602

Query: 491  EQRI-EDEEDQEDGNEEIEDERET 513
            ++R+ ++EE +    E +  E   
Sbjct: 2603 QERLQKEEEQKAYSQERLNREVSG 2626


>gnl|CDD|219256 pfam06991, Prp19_bind, Splicing factor, Prp19-binding domain.  This
           family represents the C-terminus (approximately 300
           residues) of proteins that are involved as binding
           partners for Prp19 as part of the nuclear pore complex.
           The family in Drosophila is necessary for pre-mRNA
           splicing, and the human protein has been found in
           purifications of the spliceosome. In the past this
           family was thought, erroneously, to be associated with
           microfibrillin.
          Length = 277

 Score = 29.5 bits (66), Expect = 8.9
 Identities = 20/75 (26%), Positives = 37/75 (49%), Gaps = 12/75 (16%)

Query: 457 ENIEEKFDDSSQEEYFEAEDQEGQGDSQEDEEME----------EQRIEDEEDQEDGNEE 506
           E +E + +D S EE  E E+ E + ++  +++ME          + RI  +E + +  +E
Sbjct: 3   EVLELEEEDESGEE--EEEESEEEEETDSEDDMEPRLKPVFTRKKDRITIQEREREAAKE 60

Query: 507 IEDERETKMEEECQK 521
              E E K + E +K
Sbjct: 61  KALEEEAKRKAEERK 75


>gnl|CDD|235549 PRK05658, PRK05658, RNA polymerase sigma factor RpoD; Validated.
          Length = 619

 Score = 29.8 bits (68), Expect = 9.3
 Identities = 13/47 (27%), Positives = 22/47 (46%)

Query: 472 FEAEDQEGQGDSQEDEEMEEQRIEDEEDQEDGNEEIEDERETKMEEE 518
           F   + E        E  E    EDEE++ED N++     E+++ E+
Sbjct: 173 FVDPNAEEDPAHVGSELEELDDDEDEEEEEDENDDSLAADESELPEK 219


>gnl|CDD|234173 TIGR03346, chaperone_ClpB, ATP-dependent chaperone ClpB.  Members
           of this protein family are the bacterial ATP-dependent
           chaperone ClpB. This protein belongs to the AAA family,
           ATPases associated with various cellular activities
           (pfam00004). This molecular chaperone does not act as a
           protease, but rather serves to disaggregate misfolded
           and aggregated proteins [Protein fate, Protein folding
           and stabilization].
          Length = 852

 Score = 29.5 bits (67), Expect = 9.4
 Identities = 21/59 (35%), Positives = 36/59 (61%), Gaps = 4/59 (6%)

Query: 448 KPRICGLQEE--NIEEKFDDSSQEEYFEAEDQEGQGDSQEDEEMEEQRIEDEEDQEDGN 504
           K R+  L++E   +EE++ D   EE ++AE    QG  Q  EE+E+ R+E E+ + +G+
Sbjct: 433 KERLEDLEKELAELEEEYAD--LEEQWKAEKAAIQGIQQIKEEIEQVRLELEQAEREGD 489


  Database: CDD.v3.10
    Posted date:  Mar 20, 2013  7:55 AM
  Number of letters in database: 10,937,602
  Number of sequences in database:  44,354
  
Lambda     K      H
   0.322    0.138    0.431 

Gapped
Lambda     K      H
   0.267   0.0696    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 48,885,622
Number of extensions: 4836736
Number of successful extensions: 7219
Number of sequences better than 10.0: 1
Number of HSP's gapped: 6054
Number of HSP's successfully gapped: 360
Length of query: 950
Length of database: 10,937,602
Length adjustment: 106
Effective length of query: 844
Effective length of database: 6,236,078
Effective search space: 5263249832
Effective search space used: 5263249832
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.9 bits)
S2: 64 (28.5 bits)