RPS-BLAST 2.2.26 [Sep-21-2011]

Database: CDD.v3.10 
           44,354 sequences; 10,937,602 total letters

Searching..................................................done

Query= psy2633
         (764 letters)



>gnl|CDD|202677 pfam03531, SSrecog, Structure-specific recognition protein (SSRP1).
            SSRP1 has been implicated in transcriptional initiation
           and elongation and in DNA replication and repair. This
           domain belongs to the Pleckstrin homology fold
           superfamily.
          Length = 213

 Score =  318 bits (818), Expect = e-104
 Identities = 126/215 (58%), Positives = 161/215 (74%), Gaps = 2/215 (0%)

Query: 74  FAGFKETEIDKICSFFKTNFKIDVPEKELSVRGWNWGGTKFIGNVLSFDVNNTTAFEIPL 133
           F GF++ ++DK+ +F K+NF   + EKELSV+GWNWG   F G+ L+FDV +  AFE+PL
Sbjct: 1   FDGFRDQDLDKLQNFIKSNFSKSIDEKELSVKGWNWGEADFKGSNLTFDVGSKPAFEVPL 60

Query: 134 RDVSQCTTGKNEVTVEFHPNDEAAESLIEMRFYIPTNEIAGDTDPVEAFKDSVMNQASVI 193
            +VSQ   GKNEVT+EFH ND++  SL+EMRF++P      D D VE FK++V+ +A VI
Sbjct: 61  SNVSQTLQGKNEVTLEFHQNDDSGVSLMEMRFHVPVTNTEDDGDAVEVFKETVLAKADVI 120

Query: 194 NATGDAIAVFNEIQCLTPRGRYDIKIFNSFFQLHGKTFDYKIPISTVLRMFTLPHKDGRQ 253
            AT DAI  F EI  LTPRGRYDIK++ +F +LHGKT+DYKIP S++ R+F LPHKD RQ
Sbjct: 121 QATEDAIVTFEEILILTPRGRYDIKLYPTFLRLHGKTYDYKIPYSSINRLFLLPHKDQRQ 180

Query: 254 NFFVISLDPPIKQGQTRYHFLTLLFNQDETSEMEL 288
            FFVISLDPPI+QGQTRY +L L F +DE  EMEL
Sbjct: 181 VFFVISLDPPIRQGQTRYPYLVLQFEKDE--EMEL 213


>gnl|CDD|227494 COG5165, POB3, Nucleosome-binding factor SPN, POB3 subunit
           [Transcription / DNA replication, recombination, and
           repair / Chromatin structure and dynamics].
          Length = 508

 Score =  324 bits (831), Expect = e-102
 Identities = 173/506 (34%), Positives = 265/506 (52%), Gaps = 34/506 (6%)

Query: 20  PGKLKITDQNIVFK-NKKTGKVEQINGPDLDFVNWQKLVGTWALRIFLKNGSLHRFAGFK 78
            G ++I    + +K +    K   +   ++    W + V  + L+I +K  +++   GF 
Sbjct: 16  KGTVRIARSGLGWKASDSERKPFTLPRNEVKDAEWSRGVRGYKLKIRVKGNAVYELDGFS 75

Query: 79  ETEIDKICSFFKTNFKIDVPEKELSVRGWNWGGTKFIGNVLSFDVNNTTAFEIPLRDV-S 137
           + +ID++ + F   F+I + +KELS+ GWNWG     G    F  N    FEIP+ D+ +
Sbjct: 76  QNDIDELKNIFSEYFRITLEQKELSIAGWNWGELGINGQEAVFFRNTKPIFEIPVDDIEN 135

Query: 138 QCTTGKNEVTVEFHPNDE----AAESLIEMRFYIP----TNEIAGDTD----PVEAFKDS 185
                KNEV+VEF   DE    A + L+EMRFY P      +IAG         EAF + 
Sbjct: 136 TNLDIKNEVSVEFRIQDEEYQPAGDELVEMRFYSPGVKTKEDIAGGESVEKSMAEAFYEE 195

Query: 186 VMNQASVINATGDAIAVFNEIQCLTPRGRYDIKIFNSFFQLHGKTFDYKIPISTVLRMFT 245
           +  +A +  + GDAI  F  +   TPRGRYDI  +  + +L GKT+DYKI   ++  ++ 
Sbjct: 196 LKEKADIGESAGDAIVSFEGLSLATPRGRYDIDFYRDYLRLRGKTYDYKIYYKSIKMLYV 255

Query: 246 LPHKDGRQNFFVISLDPPIKQGQTRYHFLTLLFNQDETSEMELPFSEQELKEKYEGKLDK 305
           LP  D    + VI  +PP++QGQTRY FL + F +DE  E+EL   +++ +E Y+ KL  
Sbjct: 256 LPKIDDGHRYVVIGAEPPLRQGQTRYPFLVVQFQKDEDVEVELNVEDEDYEENYKDKLKG 315

Query: 306 ELSGPTYEVMAKIMKVIVNRKITVPGSFRGHTGTSAVTCSYKAAAGYIYPLERGFIFIHK 365
           E  G   EV +++M+ +  RK+  P  F    G  AV CS KA  G +YPL+  F+F+ K
Sbjct: 316 EYDGLLSEVFSEVMEGLTVRKVVRPSEFESRDGMRAVRCSMKANEGQLYPLDDCFLFLPK 375

Query: 366 PPIHIRFEEIATVNFARSGGST---RSFDFEIELKSGVLHTFSSIEKEEYGKLFDFIREK 422
           P + +   +I+ V F+R G S+   R+FD  + L+S   +TF++I K+E G L  F+  K
Sbjct: 376 PTLRLDLSDISLVEFSRIGLSSMQARTFDLTLFLRSPGSYTFNNISKDEQGALEQFLHSK 435

Query: 423 KLRVKNTGKGEKPNYKEDFGSSDDEKEPDAYLARVKREAAERDDEDDEDEDESTDEDFNP 482
            ++ +N    E+   + D GS  D             E        +EDE E  DEDF  
Sbjct: 436 GIKARNEEVQER--LQTDLGSISDS------------EDINMGSAGEEDESE--DEDFQM 479

Query: 483 DQAESDVAEEYDSNPTPTDSDEDSDA 508
             ++SDVAEEYD     +D++  SD 
Sbjct: 480 -VSDSDVAEEYDLQAALSDAEGGSDE 504


>gnl|CDD|241384 cd13230, PH1_SSRP1-like, Structure Specific Recognition protein 1
           (SSRP1) Pleckstrin homology (PH) domain, repeat 1.
           SSRP1 is a component of FACT (facilitator of chromatin
           transcription), an essential chromatin reorganizing
           factor. In yeast FACT (yFACT) is composed of three
           proteins: Spt16/Cdc68, Pob3, and Nhp6. In metazoans the
           Pob3 and Nhp6 orthologs are fused to form SSRP1/T160 in
           human and mouse, respectively. The middle domain of the
           Pob3 subunit (Pob3-M) has an unusual double pleckstrin
           homology (PH) architecture. yFACT interacts in a
           physiologically important way with the central
           single-strand DNA binding factor RPA to promote a step
           in DNA Replication. Coordinated function by yFACT and
           RPA is important during nucleosome deposition. These
           results support the model that the FACT family has an
           essential role in constructing nucleosomes during DNA
           replication, and suggest that RPA contributes to this
           process. Members of this cd are composed of the first
           PH-like repeat. PH domains have diverse functions, but
           in general are involved in targeting proteins to the
           appropriate cellular location or in the interaction with
           a binding partner. They share little sequence
           conservation, but all have a common fold, which is
           electrostatically polarized. Less than 10% of PH domains
           bind phosphoinositide phosphates (PIPs) with high
           affinity and specificity. PH domains are distinguished
           from other PIP-binding domains by their specific
           high-affinity binding to PIPs with two vicinal phosphate
           groups: PtdIns(3,4)P2, PtdIns(4,5)P2 or PtdIns(3,4,5)P3
           which results in targeting some PH domain proteins to
           the plasma membrane. A few display strong specificity in
           lipid binding. Any specificity is usually determined by
           loop regions or insertions in the N-terminus of the
           domain, which are not conserved across all PH domains.
           PH domains are found in cellular signaling proteins such
           as serine/threonine kinase, tyrosine kinases, regulators
           of G-proteins, endocytotic GTPases, adaptors, as well as
           cytoskeletal associated molecules and in lipid
           associated enzymes.
          Length = 137

 Score =  234 bits (599), Expect = 2e-73
 Identities = 83/137 (60%), Positives = 108/137 (78%)

Query: 195 ATGDAIAVFNEIQCLTPRGRYDIKIFNSFFQLHGKTFDYKIPISTVLRMFTLPHKDGRQN 254
           ATGDAI  F ++ CLTPRGRYDI ++ SF +LHGKT+DYKIP S++ R+F LP  D R  
Sbjct: 1   ATGDAIVTFEDVLCLTPRGRYDIDMYPSFLRLHGKTYDYKIPYSSISRLFLLPKPDDRHV 60

Query: 255 FFVISLDPPIKQGQTRYHFLTLLFNQDETSEMELPFSEQELKEKYEGKLDKELSGPTYEV 314
           FFVISLDPPI+QGQTRY FL + F++DE  E+EL  +E+EL+EKY+GKL+KE  GP YEV
Sbjct: 61  FFVISLDPPIRQGQTRYPFLVMQFDKDEEIELELNLTEEELEEKYKGKLEKEYEGPLYEV 120

Query: 315 MAKIMKVIVNRKITVPG 331
           ++++ K +  +KITVPG
Sbjct: 121 VSRVFKALTGKKITVPG 137


>gnl|CDD|241385 cd13231, PH2_SSRP1-like, Structure Specific Recognition protein 1
           (SSRP1) Pleckstrin homology (PH) domain, repeat 2.
           SSRP1 is a component of FACT (facilitator of chromatin
           transcription), an essential chromatin reorganizing
           factor. In yeast FACT (yFACT) is composed of three
           proteins: Spt16/Cdc68, Pob3, and Nhp6. In metazoans the
           Pob3 and Nhp6 orthologs are fused to form SSRP1/T160 in
           human and mouse, respectively.The middle domain of the
           Pob3 subunit (Pob3-M) has an unusual double pleckstrin
           homology (PH) architecture. yFACT interacts in a
           physiologically important way with the central
           single-strand DNA binding factor RPA to promote a step
           in DNA Replication. Coordinated function by yFACT and
           RPA is important during nucleosome deposition. These
           results support the model that the FACT family has an
           essential role in constructing nucleosomes during DNA
           replication, and suggest that RPA contributes to this
           process. Members of this cd are composed of the second
           PH-like repeat. PH domains have diverse functions, but
           in general are involved in targeting proteins to the
           appropriate cellular location or in the interaction with
           a binding partner. They share little sequence
           conservation, but all have a common fold, which is
           electrostatically polarized. Less than 10% of PH domains
           bind phosphoinositide phosphates (PIPs) with high
           affinity and specificity. PH domains are distinguished
           from other PIP-binding domains by their specific
           high-affinity binding to PIPs with two vicinal phosphate
           groups: PtdIns(3,4)P2, PtdIns(4,5)P2 or PtdIns(3,4,5)P3
           which results in targeting some PH domain proteins to
           the plasma membrane. A few display strong specificity in
           lipid binding. Any specificity is usually determined by
           loop regions or insertions in the N-terminus of the
           domain, which are not conserved across all PH domains.
           PH domains are found in cellular signaling proteins such
           as serine/threonine kinase, tyrosine kinases, regulators
           of G-proteins, endocytotic GTPases, adaptors, as well as
           cytoskeletal associated molecules and in lipid
           associated enzymes.
          Length = 100

 Score =  164 bits (418), Expect = 4e-48
 Identities = 56/100 (56%), Positives = 76/100 (76%), Gaps = 3/100 (3%)

Query: 333 FRGHTGTSAVTCSYKAAAGYIYPLERGFIFIHKPPIHIRFEEIATVNFARSGG---STRS 389
           F+  +G  A+ CSYKA  G +YPLE+ F+F+HKPPI IRF++I++V F+R GG   S+R+
Sbjct: 1   FKSSSGQHAIKCSYKANEGLLYPLEKSFLFVHKPPILIRFDDISSVEFSRVGGSSTSSRT 60

Query: 390 FDFEIELKSGVLHTFSSIEKEEYGKLFDFIREKKLRVKNT 429
           FD EIELK G  HTFSSI++EEYG L DF++ K L++KN 
Sbjct: 61  FDLEIELKGGTEHTFSSIDREEYGPLEDFLKSKGLKIKNE 100


>gnl|CDD|219876 pfam08512, Rtt106, Histone chaperone Rttp106-like.  This family
           includes Rttp106, a histone chaperone involved in
           heterochromatin-mediated silencing. This domain belongs
           to the Pleckstrin homology domain superfamily.
          Length = 93

 Score =  118 bits (297), Expect = 1e-31
 Identities = 40/93 (43%), Positives = 59/93 (63%), Gaps = 2/93 (2%)

Query: 339 TSAVTCSYKAAAGYIYPLERGFIF-IHKPPIHIRFEEIATVNFARSGGSTRSFDFEIELK 397
              V CSYKA+ G +YPLE+G +F   KPP+ I  ++I +V+F R  G  R+FD  +  K
Sbjct: 1   FPGVPCSYKASEGLLYPLEKGLLFLFEKPPLVITLDDIESVSFERVSGGLRTFDLVVVFK 60

Query: 398 -SGVLHTFSSIEKEEYGKLFDFIREKKLRVKNT 429
            SG +H FSSI++EE   L DF++ K +++K  
Sbjct: 61  YSGPVHEFSSIDREELDVLKDFLKSKGIKIKEG 93


>gnl|CDD|189580 pfam00505, HMG_box, HMG (high mobility group) box. 
          Length = 69

 Score = 84.2 bits (209), Expect = 4e-20
 Identities = 31/69 (44%), Positives = 45/69 (65%)

Query: 554 PKKPMSAYMMWFNDMREKIKKDNPGISFTEIAKKGGELWKTVSSKEKAEYDEKVNKAKED 613
           PK+P+SA+ ++  + R K+K +NPG+   EI+K  GE WK +S +EK  Y+EK  K K  
Sbjct: 1   PKRPLSAFFLFSQEQRAKLKAENPGLKNAEISKILGEKWKNLSEEEKKPYEEKAEKEKAR 60

Query: 614 YNEALKAYK 622
           Y +A  AYK
Sbjct: 61  YEKAYPAYK 69


>gnl|CDD|238686 cd01390, HMGB-UBF_HMG-box, HMGB-UBF_HMG-box, class II and III
           members of the HMG-box superfamily of DNA-binding
           proteins. These proteins bind the minor groove of DNA in
           a non-sequence specific fashion and contain two or more
           tandem HMG boxes. Class II members include non-histone
           chromosomal proteins, HMG1 and HMG2, which bind to bent
           or distorted DNA such as four-way DNA junctions,
           synthetic DNA cruciforms, kinked cisplatin-modified DNA,
           DNA bulges, cross-overs in supercoiled DNA, and can
           cause looping of linear DNA. Class III members include
           nucleolar and mitochondrial transcription factors, UBF
           and mtTF1, which bind four-way DNA junctions.
          Length = 66

 Score = 81.5 bits (202), Expect = 4e-19
 Identities = 31/66 (46%), Positives = 45/66 (68%)

Query: 554 PKKPMSAYMMWFNDMREKIKKDNPGISFTEIAKKGGELWKTVSSKEKAEYDEKVNKAKED 613
           PK+P+SAY ++  + R K+KK+NP  S TE+ K  GE WK +S +EK +Y+EK  K KE 
Sbjct: 1   PKRPLSAYFLFSQEQRPKLKKENPDASVTEVTKILGEKWKELSEEEKKKYEEKAEKDKER 60

Query: 614 YNEALK 619
           Y + +K
Sbjct: 61  YEKEMK 66


>gnl|CDD|238037 cd00084, HMG-box, High Mobility Group (HMG)-box is found in a
           variety of eukaryotic chromosomal proteins and
           transcription factors. HMGs bind to the minor groove of
           DNA and have been classified by DNA binding preferences.
           Two phylogenically distinct groups of Class I proteins
           bind DNA in a sequence specific fashion and contain a
           single HMG box. One group (SOX-TCF) includes
           transcription factors, TCF-1, -3, -4; and also SRY and
           LEF-1, which bind four-way DNA junctions and duplex DNA
           targets. The second group (MATA) includes fungal mating
           type gene products MC, MATA1 and Ste11. Class II and III
           proteins (HMGB-UBF) bind DNA in a non-sequence specific
           fashion and contain two or more tandem HMG boxes. Class
           II members include non-histone chromosomal proteins,
           HMG1 and HMG2, which bind to bent or distorted DNA such
           as four-way DNA junctions, synthetic DNA cruciforms,
           kinked cisplatin-modified DNA, DNA bulges, cross-overs
           in supercoiled DNA, and can cause looping of linear DNA.
           Class III members include nucleolar and mitochondrial
           transcription factors, UBF and mtTF1, which bind
           four-way DNA junctions.
          Length = 66

 Score = 80.4 bits (199), Expect = 8e-19
 Identities = 29/66 (43%), Positives = 47/66 (71%)

Query: 554 PKKPMSAYMMWFNDMREKIKKDNPGISFTEIAKKGGELWKTVSSKEKAEYDEKVNKAKED 613
           PK+P+SAY ++  + R ++K +NPG+S  EI+K  GE+WK++S +EK +Y+EK  K KE 
Sbjct: 1   PKRPLSAYFLFSQEHRAEVKAENPGLSVGEISKILGEMWKSLSEEEKKKYEEKAEKDKER 60

Query: 614 YNEALK 619
           Y + + 
Sbjct: 61  YEKEMP 66


>gnl|CDD|197700 smart00398, HMG, high mobility group. 
          Length = 70

 Score = 78.9 bits (195), Expect = 4e-18
 Identities = 36/70 (51%), Positives = 48/70 (68%)

Query: 553 APKKPMSAYMMWFNDMREKIKKDNPGISFTEIAKKGGELWKTVSSKEKAEYDEKVNKAKE 612
            PK+PMSA+M++  + R KIK +NP +S  EI+KK GE WK +S +EKA Y+EK  K KE
Sbjct: 1   KPKRPMSAFMLFSQENRAKIKAENPDLSNAEISKKLGERWKLLSEEEKAPYEEKAKKDKE 60

Query: 613 DYNEALKAYK 622
            Y E +  YK
Sbjct: 61  RYEEEMPEYK 70


>gnl|CDD|227935 COG5648, NHP6B, Chromatin-associated proteins containing the HMG
           domain [Chromatin structure and dynamics].
          Length = 211

 Score = 79.9 bits (197), Expect = 6e-17
 Identities = 53/179 (29%), Positives = 81/179 (45%), Gaps = 23/179 (12%)

Query: 529 KPRKSKKDKESSGGSTKRKKKDKNAPKKPMSAYMMWFNDMREKIKKDNPGISFTEIAKKG 588
           KPRK  K K        RKKKD N PK+P+SAY ++  + R++I+K+NP ++F E+ K  
Sbjct: 51  KPRKKTKSK-----RLVRKKKDPNGPKRPLSAYFLYSAENRDEIRKENPKLTFGEVGKLL 105

Query: 589 GELWKTVSSKEKAEYDEKVNKAKEDYNEALKAYKESGGGQDSDDGKTSKSKKPAKKKETA 648
            E WK ++ +EK  Y ++ N  +E Y    + Y +          K           E  
Sbjct: 106 SEKWKELTDEEKEPYYKEANSDRERYQREKEEYNK----------KLPNKAPIGPFIENE 155

Query: 649 STAVSPHKVKSKEFIESNGSSSDSDDDKKASSKRKRDSDEDSKAKTKKKKEKSESESDS 707
                  K++ K    S   +    ++ K  SK   + DE  K K   K +K + E DS
Sbjct: 156 P------KIRPKVEGPSPDKA--LVEETKIISKAWSELDESKKKKYIDKYKKLKEEYDS 206


>gnl|CDD|240271 PTZ00108, PTZ00108, DNA topoisomerase 2-like protein; Provisional.
          Length = 1388

 Score = 68.1 bits (167), Expect = 1e-11
 Identities = 57/263 (21%), Positives = 86/263 (32%), Gaps = 18/263 (6%)

Query: 504  EDSDASEGTKKKKKEKSRKTITISEKPRKSKKDKESSGGSTKRKKKDKNAPKKPMSAYMM 563
            E+ +  E  +  K+++ +          +  K K+      K+KK   +  KK       
Sbjct: 1142 EEQEEVEEKEIAKEQRLKSKTKGKASKLRKPKLKK---KEKKKKKSSADKSKKASVVGNS 1198

Query: 564  WFNDMREKIKKDNPGISFTEIAKKGGELWKTVSSKEKAEYDEKVNKAKEDYNEALKAYKE 623
               D  EK K D+         KK           E+ +   K +  K   ++   + K 
Sbjct: 1199 KRVDSDEKRKLDD-----KPDNKKSNSSGSDQEDDEEQKTKPKKSSVKRLKSKKNNSSKS 1253

Query: 624  SGGGQDSDDGKTSKSKKPAKKKETASTAV-SPHKVKSKEFIESNGSSSDSDDDKKASSKR 682
            S    +      SK  KP    +  S    SP     +   ESNG S  S   KK   KR
Sbjct: 1254 SEDNDEFSSDDLSKEGKPKNAPKRVSAVQYSPPPPSKRPDGESNGGSKPSSPTKKKVKKR 1313

Query: 683  KRDSDEDSKAKTKKKKEKSES-ESDSGESEKEKKSKSKPKGDKSKSKSKKEDEIDDEEYD 741
               S    K K K +K+ +   +S +   +      S+      K KS    E DD+   
Sbjct: 1314 LEGSLAALKKKKKSEKKTARKKKSKTRVKQASASQSSRLLRRPRKKKSDSSSEDDDDSE- 1372

Query: 742  EPVESTPPSSDAGSDASSSEDDD 764
                          D    EDDD
Sbjct: 1373 -------VDDSEDEDDEDDEDDD 1388



 Score = 43.5 bits (103), Expect = 5e-04
 Identities = 26/164 (15%), Positives = 57/164 (34%)

Query: 590  ELWKTVSSKEKAEYDEKVNKAKEDYNEALKAYKESGGGQDSDDGKTSKSKKPAKKKETAS 649
            ++W     K +   +E+    +++  +  +   ++ G          K K+  KKK +A 
Sbjct: 1128 DMWLEDLDKFEEALEEQEEVEEKEIAKEQRLKSKTKGKASKLRKPKLKKKEKKKKKSSAD 1187

Query: 650  TAVSPHKVKSKEFIESNGSSSDSDDDKKASSKRKRDSDEDSKAKTKKKKEKSESESDSGE 709
             +     V + + ++S+      D      S       ED + +  K K+ S     S +
Sbjct: 1188 KSKKASVVGNSKRVDSDEKRKLDDKPDNKKSNSSGSDQEDDEEQKTKPKKSSVKRLKSKK 1247

Query: 710  SEKEKKSKSKPKGDKSKSKSKKEDEIDDEEYDEPVESTPPSSDA 753
            +   K S+   +        + + +   +       S PP S  
Sbjct: 1248 NNSSKSSEDNDEFSSDDLSKEGKPKNAPKRVSAVQYSPPPPSKR 1291


>gnl|CDD|238684 cd01388, SOX-TCF_HMG-box, SOX-TCF_HMG-box, class I member of the
           HMG-box superfamily of DNA-binding proteins. These
           proteins contain a single HMG box, and bind the minor
           groove of DNA in a highly sequence-specific manner.
           Members include SRY and its homologs in insects and
           vertebrates, and transcription factor-like proteins,
           TCF-1, -3, -4, and LEF-1. They appear to bind the minor
           groove of the A/T C A A A G/C-motif.
          Length = 72

 Score = 53.5 bits (129), Expect = 4e-09
 Identities = 23/68 (33%), Positives = 39/68 (57%)

Query: 555 KKPMSAYMMWFNDMREKIKKDNPGISFTEIAKKGGELWKTVSSKEKAEYDEKVNKAKEDY 614
           K+PM+A+M++    R K+ ++ P      I+K  G+ WK +S++EK  Y E+  K KE +
Sbjct: 3   KRPMNAFMLFSKRHRRKVLQEYPLKENRAISKILGDRWKALSNEEKQPYYEEAKKLKELH 62

Query: 615 NEALKAYK 622
            +    YK
Sbjct: 63  MKLYPDYK 70


>gnl|CDD|185511 PTZ00199, PTZ00199, high mobility group protein; Provisional.
          Length = 94

 Score = 53.7 bits (129), Expect = 5e-09
 Identities = 33/94 (35%), Positives = 51/94 (54%), Gaps = 2/94 (2%)

Query: 532 KSKKDKESSGGSTKRKKKDKNAPKKPMSAYMMWFNDMREKIKKDNPGIS--FTEIAKKGG 589
            +KK  +      KRKKKD NAPK+ +SAYM +  + R +I  +NP ++     + K  G
Sbjct: 1   MAKKQGKVLVRKNKRKKKDPNAPKRALSAYMFFAKEKRAEIIAENPELAKDVAAVGKMVG 60

Query: 590 ELWKTVSSKEKAEYDEKVNKAKEDYNEALKAYKE 623
           E W  +S +EKA Y++K  + K  Y +    Y +
Sbjct: 61  EAWNKLSEEEKAPYEKKAQEDKVRYEKEKAEYAK 94


>gnl|CDD|173412 PTZ00121, PTZ00121, MAEBL; Provisional.
          Length = 2084

 Score = 57.5 bits (138), Expect = 2e-08
 Identities = 73/305 (23%), Positives = 130/305 (42%), Gaps = 22/305 (7%)

Query: 431  KGEKPNYKEDFGSSDDEKEPDAYLARVKREAAERDDEDDEDEDESTDEDFNPDQAESDVA 490
            K ++   K +     DE +  A  A+ K +AA++  E+ +   E+   +      E++ A
Sbjct: 1303 KADEAKKKAEEAKKADEAKKKAEEAKKKADAAKKKAEEAKKAAEAAKAEAEAAADEAEAA 1362

Query: 491  EEYDSNPTPTDSDEDSDASEGTKKKKKEKSRKTITISEKPRKSKKDKESSGGSTKRKKKD 550
            EE          +    A     KKK E+ +K     +K  + KK  +    +   KKK 
Sbjct: 1363 EEKAEAAEKKKEEAKKKADAA--KKKAEEKKKADEAKKKAEEDKKKADELKKAAAAKKKA 1420

Query: 551  KNAPKKPMSAYMMWFNDMREKIKKDNPGISFTEIAKKGGELWKTVSSKEKAEYDEKVNKA 610
              A KK             EK K D       E  KK  E  K   +K+KAE  +K  +A
Sbjct: 1421 DEAKKKA-----------EEKKKAD-------EAKKKAEEAKKADEAKKKAEEAKKAEEA 1462

Query: 611  KEDYNEALKAYKESGGGQDSDDGKTSKSKKPAKKKETASTAVSPHKVKSKEFIESNGSSS 670
            K+   EA KA +     +++     +K K    KK+      +    K  +  +    + 
Sbjct: 1463 KKKAEEAKKADEAKKKAEEAKKADEAKKKAEEAKKKADEAKKAAEAKKKADEAKKAEEAK 1522

Query: 671  DSDDDKKASSKRKRDSDEDSKAKTKKKKEKSESESDSGESEKEKKSKSKPKGDKSKSKSK 730
             +D+ KKA   +K  +DE  KA+ KKK ++ +   +  ++E++KK++   K ++ K+ + 
Sbjct: 1523 KADEAKKAEEAKK--ADEAKKAEEKKKADELKKAEELKKAEEKKKAEEAKKAEEDKNMAL 1580

Query: 731  KEDEI 735
            ++ E 
Sbjct: 1581 RKAEE 1585



 Score = 55.9 bits (134), Expect = 7e-08
 Identities = 75/295 (25%), Positives = 128/295 (43%), Gaps = 19/295 (6%)

Query: 458  KREAAERDDEDDEDEDESTDEDFNPDQA-----ESDVAEEYDSNPTPTDSDEDSDASEGT 512
            K+   ER++E+    +E+    F   QA     E+  A+E         +DE   A E  
Sbjct: 1243 KKAEEERNNEEIRKFEEARMAHFARRQAAIKAEEARKADELKKAEEKKKADEAKKAEEKK 1302

Query: 513  K----KKKKEKSRKTITISEKPRKSKKDKESSGGSTKRKKKDKNAPKKPMSAYMMWFNDM 568
            K    KKK E+++K     +K  ++KK  +++    +  KK   A K    A        
Sbjct: 1303 KADEAKKKAEEAKKADEAKKKAEEAKKKADAAKKKAEEAKKAAEAAKAEAEAAADEAEAA 1362

Query: 569  REKIKKDNPGISFTEIAKKGGELWKTVSSKEKAEYDEKVNKAKEDYNEALKAYKESGGGQ 628
             EK +         E  KK     K    K+KA  DE   KA+ED  +A +  K +   +
Sbjct: 1363 EEKAEAAEKKKE--EAKKKADAAKKKAEEKKKA--DEAKKKAEEDKKKADELKKAAAAKK 1418

Query: 629  DSDDGKTSKSKK----PAKKKETASTAVSPHKVKSKEFIESNGSSSDSDDDKKASSKRKR 684
             +D+ K    +K     AKKK   +      K K++E  ++  +   +++ KKA   +K+
Sbjct: 1419 KADEAKKKAEEKKKADEAKKKAEEAKKADEAKKKAEEAKKAEEAKKKAEEAKKADEAKKK 1478

Query: 685  DSDEDSKAKTKKKKEKSESESDSGESEKEKKSKSKPKGDKSKSKSKKEDEIDDEE 739
              +     + KKK E+++ ++D  E++K  ++K K    K   ++KK DE    E
Sbjct: 1479 AEEAKKADEAKKKAEEAKKKAD--EAKKAAEAKKKADEAKKAEEAKKADEAKKAE 1531



 Score = 51.7 bits (123), Expect = 2e-06
 Identities = 73/310 (23%), Positives = 133/310 (42%), Gaps = 15/310 (4%)

Query: 439  EDFGSSDDEKEPDAYLARVKREAAERDDEDDEDEDESTDEDFNPDQAESDVAEEYDSNPT 498
            E+   + +  + +A  A  + EAAE   E  E + E  +     D A+    E+  ++  
Sbjct: 1339 EEAKKAAEAAKAEAEAAADEAEAAEEKAEAAEKKKE--EAKKKADAAKKKAEEKKKADEA 1396

Query: 499  PTDSDEDSDASEGTKKKKKEKSRKTITISEKPRKSKKDKESSGGSTKRKKKDKNAPKKPM 558
               ++ED   ++  KK    K  K    ++K  + KK  + +    +  KK   A KK  
Sbjct: 1397 KKKAEEDKKKADELKKAAAAK--KKADEAKKKAEEKKKADEAKKKAEEAKKADEAKKKAE 1454

Query: 559  SAYMMWFNDMREKIKKDNPGISFTEIAKKGGELWKTVSSKEKAEYDEKVNKAKEDYNEAL 618
             A         E  KK        E  KK  E  K   +K+KA   E+  K  ++  +A 
Sbjct: 1455 EA-----KKAEEAKKKAEEAKKADEAKKKAEEAKKADEAKKKA---EEAKKKADEAKKAA 1506

Query: 619  KAYKESGGGQDSDDGKTSKSKKPAKKKETASTAVSPHKVKSKEFIESNGSSSDSDDDKKA 678
            +A K++   + +++ K +   K A++ + A  A    + K  + ++       +++ KKA
Sbjct: 1507 EAKKKADEAKKAEEAKKADEAKKAEEAKKADEAKKAEEKKKADELKKAEELKKAEEKKKA 1566

Query: 679  SSKRKRDSDEDS---KAKTKKKKEKSESESDSGESEKEKKSKSKPKGDKSKSKSKKEDEI 735
               +K + D++    KA+  KK E++  E      E+EKK K++      ++K K E+  
Sbjct: 1567 EEAKKAEEDKNMALRKAEEAKKAEEARIEEVMKLYEEEKKMKAEEAKKAEEAKIKAEELK 1626

Query: 736  DDEEYDEPVE 745
              EE  + VE
Sbjct: 1627 KAEEEKKKVE 1636



 Score = 48.6 bits (115), Expect = 1e-05
 Identities = 63/309 (20%), Positives = 122/309 (39%), Gaps = 10/309 (3%)

Query: 457  VKREAAERDDEDDEDEDESTDEDFNPDQAESDVAEEYDSNPTPTDSDEDSDASEGTKKKK 516
            V++    R  ED    + +   +      E+  AE+         ++E    +E  KK +
Sbjct: 1187 VRKAEELRKAEDARKAEAARKAEEERKAEEARKAEDAKKAEAVKKAEEAKKDAEEAKKAE 1246

Query: 517  KEKSRKTITISEKPRKSKKDKESSGGSTKRKKKDKNAPKKPMSAYMMWFNDMREKIKKDN 576
            +E++ + I   E+ R +   +  +    +  +K     K              EK K D 
Sbjct: 1247 EERNNEEIRKFEEARMAHFARRQAAIKAEEARKADELKKAEEKKKADEAKKAEEKKKAD- 1305

Query: 577  PGISFTEIAKKGGELWKTVSSKEKAEYDEKVNKAKEDYNEALKAYKESGGGQDSDDGKTS 636
                  E  KK  E  K   +K+KAE  +K  KA     +A +A K +   +   +    
Sbjct: 1306 ------EAKKKAEEAKKADEAKKKAEEAKK--KADAAKKKAEEAKKAAEAAKAEAEAAAD 1357

Query: 637  KSKKPAKKKETASTAVSPHKVKSKEFIESNGSSSDSDDDKKASSKRKRDSDE-DSKAKTK 695
            +++   +K E A       K K+    +       +D+ KK + + K+ +DE    A  K
Sbjct: 1358 EAEAAEEKAEAAEKKKEEAKKKADAAKKKAEEKKKADEAKKKAEEDKKKADELKKAAAAK 1417

Query: 696  KKKEKSESESDSGESEKEKKSKSKPKGDKSKSKSKKEDEIDDEEYDEPVESTPPSSDAGS 755
            KK ++++ +++  +   E K K++      ++K K E+    EE  +  E    + +A  
Sbjct: 1418 KKADEAKKKAEEKKKADEAKKKAEEAKKADEAKKKAEEAKKAEEAKKKAEEAKKADEAKK 1477

Query: 756  DASSSEDDD 764
             A  ++  D
Sbjct: 1478 KAEEAKKAD 1486



 Score = 48.6 bits (115), Expect = 1e-05
 Identities = 66/302 (21%), Positives = 122/302 (40%), Gaps = 21/302 (6%)

Query: 458  KREAAERDDEDDEDEDESTDEDFNPDQAESDVAEEYDSNPTPTDSDEDSDASEGTKKK-- 515
            K E A + ++  + E     E+   D  E+  AEE  +N      +E   A    ++   
Sbjct: 1213 KAEEARKAEDAKKAEAVKKAEEAKKDAEEAKKAEEERNNEEIRKFEEARMAHFARRQAAI 1272

Query: 516  KKEKSRKTITISEKPRKSKKDKESSGGSTKRKKKDKNAPKKPMSAYMMWFNDMREKIKKD 575
            K E++RK   + +   K K D+       + KKK   A KK             E  K D
Sbjct: 1273 KAEEARKADELKKAEEKKKADEAKK---AEEKKKADEAKKKA-----------EEAKKAD 1318

Query: 576  NPGISFTEIAKKGGELWKTVS-SKEKAEYDEKVNKAKEDYNEALKAYKESGGGQDSDDGK 634
                   E  KK     K    +K+ AE  +   +A  D  EA +   E+   +  +  K
Sbjct: 1319 EAKKKAEEAKKKADAAKKKAEEAKKAAEAAKAEAEAAADEAEAAEEKAEAAEKKKEEAKK 1378

Query: 635  TS----KSKKPAKKKETASTAVSPHKVKSKEFIESNGSSSDSDDDKKASSKRKRDSDEDS 690
             +    K  +  KK + A       K K+ E  ++  +   +D+ KK + ++K+  +   
Sbjct: 1379 KADAAKKKAEEKKKADEAKKKAEEDKKKADELKKAAAAKKKADEAKKKAEEKKKADEAKK 1438

Query: 691  KAKTKKKKEKSESESDSGESEKEKKSKSKPKGDKSKSKSKKEDEIDDEEYDEPVESTPPS 750
            KA+  KK ++++ +++  +  +E K K++      ++K K E+    +E  +  E     
Sbjct: 1439 KAEEAKKADEAKKKAEEAKKAEEAKKKAEEAKKADEAKKKAEEAKKADEAKKKAEEAKKK 1498

Query: 751  SD 752
            +D
Sbjct: 1499 AD 1500



 Score = 45.9 bits (108), Expect = 1e-04
 Identities = 80/350 (22%), Positives = 156/350 (44%), Gaps = 22/350 (6%)

Query: 408  EKEEYGKLFDFIREKKLRVKNTGKGEKPNYKEDFGSSDDEKEPDAYLARVKREAAERDDE 467
            +KEE  K  D  ++K    K   + +K   +ED   +D+ K+  A  A+ K + A++  E
Sbjct: 1372 KKEEAKKKADAAKKKAEEKKKADEAKK-KAEEDKKKADELKKAAA--AKKKADEAKKKAE 1428

Query: 468  DDEDEDES---TDEDFNPDQAESDVAEEYDSNPTPTDSDEDSDASEGTKKKKKEKSRKTI 524
            + +  DE+    +E    D+A+    E   +      ++E   A E   KKK E+++K  
Sbjct: 1429 EKKKADEAKKKAEEAKKADEAKKKAEEAKKAEEAKKKAEEAKKADEA--KKKAEEAKKAD 1486

Query: 525  TISEKPRKSKKDKESSGGSTKRKKKDKNAPKKPMSAYMMWFNDMREKIKKDNPGISFTEI 584
               +K  ++KK  + +  + + KKK   A K   +          E  K D       E 
Sbjct: 1487 EAKKKAEEAKKKADEAKKAAEAKKKADEAKKAEEAKKADEAKKAEEAKKADE--AKKAEE 1544

Query: 585  AKKGGELWKT--VSSKEKAEYDEKVNKAKEDYNEALKAYKESGGGQDS---------DDG 633
             KK  EL K   +   E+ +  E+  KA+ED N AL+  +E+   +++         ++ 
Sbjct: 1545 KKKADELKKAEELKKAEEKKKAEEAKKAEEDKNMALRKAEEAKKAEEARIEEVMKLYEEE 1604

Query: 634  KTSKSKKPAKKKETASTAVSPHKVKSKEFIESNGSSSDSDDDKKASSKRKRDSDEDSKAK 693
            K  K+++  K +E    A    K + ++         ++++ KKA   +K + +   KA 
Sbjct: 1605 KKMKAEEAKKAEEAKIKAEELKKAEEEKKKVEQLKKKEAEEKKKAEELKKAEEENKIKAA 1664

Query: 694  TKKKKEKSES-ESDSGESEKEKKSKSKPKGDKSKSKSKKEDEIDDEEYDE 742
             + KK + +  +++  +  +E + K+     K   ++KK +E+  +E +E
Sbjct: 1665 EEAKKAEEDKKKAEEAKKAEEDEKKAAEALKKEAEEAKKAEELKKKEAEE 1714



 Score = 45.5 bits (107), Expect = 1e-04
 Identities = 64/330 (19%), Positives = 135/330 (40%), Gaps = 21/330 (6%)

Query: 428  NTGKGEKPNYKE-DFGSSDDEKEPDAYLARVKREAAERDDEDDEDEDESTDEDFNPDQAE 486
               +G KP+YK+ DF + +D +  +A      +    +  E  + E+    E+      +
Sbjct: 1067 GQDEGLKPSYKDFDFDAKEDNRADEATEEAFGKAEEAKKTETGKAEEARKAEEAKKKAED 1126

Query: 487  SDVAEEYDSNPTPTDSDEDSDASEGTK---KKKKEKSRKTITISEKPRKSKKDKESSGGS 543
            +  AEE         ++E   A +  +    +K E +RK    + K   +KK + +    
Sbjct: 1127 ARKAEEARKAEDARKAEEARKAEDAKRVEIARKAEDARK-AEEARKAEDAKKAEAARKAE 1185

Query: 544  TKRKKKDKNAPKKPMSAYMMWFNDMREKIK--------KDNPGISFTEIAKKGGELWKTV 595
              RK ++    +    A      +   K +        K    +   E AKK  E  K  
Sbjct: 1186 EVRKAEELRKAEDARKAEAARKAEEERKAEEARKAEDAKKAEAVKKAEEAKKDAEEAKKA 1245

Query: 596  SSKEKAEYDEKVNKAKEDYNEALKAYKESGGGQDSDDGKTSKSKKPA--------KKKET 647
              +   E   K  +A+  +    +A  ++   + +D+ K ++ KK A        KKK  
Sbjct: 1246 EEERNNEEIRKFEEARMAHFARRQAAIKAEEARKADELKKAEEKKKADEAKKAEEKKKAD 1305

Query: 648  ASTAVSPHKVKSKEFIESNGSSSDSDDDKKASSKRKRDSDEDSKAKTKKKKEKSESESDS 707
             +   +    K+ E  +    +    D  K  ++  + + E +KA+ +   +++E+  + 
Sbjct: 1306 EAKKKAEEAKKADEAKKKAEEAKKKADAAKKKAEEAKKAAEAAKAEAEAAADEAEAAEEK 1365

Query: 708  GESEKEKKSKSKPKGDKSKSKSKKEDEIDD 737
             E+ ++KK ++K K D +K K++++ + D+
Sbjct: 1366 AEAAEKKKEEAKKKADAAKKKAEEKKKADE 1395



 Score = 44.4 bits (104), Expect = 3e-04
 Identities = 71/338 (21%), Positives = 137/338 (40%), Gaps = 7/338 (2%)

Query: 421  EKKLRVKNTGKGEKPNYKEDFGSSDDEKEPDAYLARVKREAAERDDEDDEDEDESTDEDF 480
            E K + +   K ++   K +     DE +  A  A+ K + A++  E  +  DE+   + 
Sbjct: 1461 EAKKKAEEAKKADEAKKKAEEAKKADEAKKKAEEAKKKADEAKKAAEAKKKADEAKKAEE 1520

Query: 481  NPDQAESDVAEEYDSNPTPTDSDEDSDASEGTKKKKKEKSRKTITISEKPRKSKKDKESS 540
                 E+  AEE         ++E   A E  K ++ +K+ +     E  +  +    + 
Sbjct: 1521 AKKADEAKKAEEAKKADEAKKAEEKKKADELKKAEELKKAEEKKKAEEAKKAEEDKNMAL 1580

Query: 541  GGSTKRKKKDKNAPKKPMSAYMMWFNDMREKIKKDNPGISFTEIAKKGGELWKTVSSKEK 600
              + + KK ++   ++ M  Y        E+ KK        E  KK  E  K V   +K
Sbjct: 1581 RKAEEAKKAEEARIEEVMKLYEEEKKMKAEEAKKAEEAKIKAEELKKAEEEKKKVEQLKK 1640

Query: 601  AEYDEK-----VNKAKEDYNEALKAYKESGGGQDSDDGKTSKSKKPAKKKETASTAVSPH 655
             E +EK     + KA+E+         +          +  K+++  KK   A    +  
Sbjct: 1641 KEAEEKKKAEELKKAEEENKIKAAEEAKKAEEDKKKAEEAKKAEEDEKKAAEALKKEAEE 1700

Query: 656  KVKSKEFIESNGSSSDSDDDKKASSKRKRDSDEDSKAKTKKKKEKSESESDSGESEKEKK 715
              K++E  +         ++ K + +  +   E++K + ++ K+K+E E+   E EK+K 
Sbjct: 1701 AKKAEELKKKEAEEKKKAEELKKAEEENKIKAEEAKKEAEEDKKKAE-EAKKDEEEKKKI 1759

Query: 716  SKSKPKGDKSKSKSKKEDE-IDDEEYDEPVESTPPSSD 752
            +  K + +K   + +KE E + +EE DE  E      D
Sbjct: 1760 AHLKKEEEKKAEEIRKEKEAVIEEELDEEDEKRRMEVD 1797



 Score = 44.4 bits (104), Expect = 3e-04
 Identities = 74/331 (22%), Positives = 130/331 (39%), Gaps = 39/331 (11%)

Query: 421  EKKLRVKNTGKGEKPNYKEDFGSSDDEKEPDAYLARVKREAAERDDEDDEDEDESTDEDF 480
            E   + +   K E+    E+   ++D K+ +A    VK+    + D ++  + E    + 
Sbjct: 1197 EDARKAEAARKAEEERKAEEARKAEDAKKAEA----VKKAEEAKKDAEEAKKAEEERNNE 1252

Query: 481  NPDQAESDVAEEYDSNPTPTDSDEDSDASEGTKKKKKEKSRKTITISEKPR--KSKKDKE 538
               + E      +        ++E   A E  K ++K+K+ +     EK +  ++KK  E
Sbjct: 1253 EIRKFEEARMAHFARRQAAIKAEEARKADELKKAEEKKKADEAKKAEEKKKADEAKKKAE 1312

Query: 539  SSGGSTKRKKKDKNAPKKPMSAYMMWFNDMREKIKKDNPGISFTEIAKKGGELWKTVSSK 598
             +  + + KKK + A KK  +A         E  K      +  E A    E     +++
Sbjct: 1313 EAKKADEAKKKAEEAKKKADAAK----KKAEEAKKAAEAAKAEAEAAADEAE-----AAE 1363

Query: 599  EKAEYDEKVNKAKEDYNEALKAYKESGGGQDSDDGKTSKSKKPA---KKKETASTAVSPH 655
            EKAE  EK  +  +   +A K   E     D    K  + KK A   KK   A       
Sbjct: 1364 EKAEAAEKKKEEAKKKADAAKKKAEEKKKADEAKKKAEEDKKKADELKKAAAAKKKADEA 1423

Query: 656  KVKSKEFIESNGSSSDSDDDKKASSKRKRDSDEDSKAKTKKKKE--------KSESESDS 707
            K K++E              KKA   +K+  +     + KKK E        K ++E   
Sbjct: 1424 KKKAEE-------------KKKADEAKKKAEEAKKADEAKKKAEEAKKAEEAKKKAEEAK 1470

Query: 708  GESEKEKKSKSKPKGDKSKSKSKKEDEIDDE 738
               E +KK++   K D++K K+++  +  DE
Sbjct: 1471 KADEAKKKAEEAKKADEAKKKAEEAKKKADE 1501



 Score = 42.4 bits (99), Expect = 0.001
 Identities = 68/349 (19%), Positives = 138/349 (39%), Gaps = 9/349 (2%)

Query: 421  EKKLRVKNTGKGEKPNYKEDFGSSDDEKEPDAYLARVKREAAERDDEDDEDEDESTD-ED 479
            E K +     K      K D      E++  A  A+ K E A++ DE  +  +E+   E+
Sbjct: 1402 EDKKKADELKKAAAAKKKADEAKKKAEEKKKADEAKKKAEEAKKADEAKKKAEEAKKAEE 1461

Query: 480  FNPDQAESDVAEEYDSNPTPTDSDEDSDASEGTKKKKKEKSRKTITISEKPRKSKKDKES 539
                  E+  A+E           +++       KKK ++++K     +K  ++KK +E+
Sbjct: 1462 AKKKAEEAKKADEAKKKAEEAKKADEAKKKAEEAKKKADEAKKAAEAKKKADEAKKAEEA 1521

Query: 540  SGGSTKRKKKDKNAPKKPMSAYMMWFNDMREKIKKDNPGISFTEIAKKGGELWKTVSSKE 599
                  +K ++     +   A      D  +K ++        +  +          +  
Sbjct: 1522 KKADEAKKAEEAKKADEAKKAEEKKKADELKKAEELKKAEEKKKAEEAKKAEEDKNMALR 1581

Query: 600  KAEYDEKVNKAKEDYNEALKAYKESGGGQDSDDGKTSKSKKPA---KKKETASTAVSPHK 656
            KAE  +K  +A+ +  E +K Y+E    +  +  K  ++K  A   KK E     V   K
Sbjct: 1582 KAEEAKKAEEARIE--EVMKLYEEEKKMKAEEAKKAEEAKIKAEELKKAEEEKKKVEQLK 1639

Query: 657  VKSKEFI---ESNGSSSDSDDDKKASSKRKRDSDEDSKAKTKKKKEKSESESDSGESEKE 713
             K  E     E    + + +  K A   +K + D+    + KK +E  +  +++ + E E
Sbjct: 1640 KKEAEEKKKAEELKKAEEENKIKAAEEAKKAEEDKKKAEEAKKAEEDEKKAAEALKKEAE 1699

Query: 714  KKSKSKPKGDKSKSKSKKEDEIDDEEYDEPVESTPPSSDAGSDASSSED 762
            +  K++    K   + KK +E+   E +  +++     +A  D   +E+
Sbjct: 1700 EAKKAEELKKKEAEEKKKAEELKKAEEENKIKAEEAKKEAEEDKKKAEE 1748



 Score = 37.4 bits (86), Expect = 0.039
 Identities = 64/340 (18%), Positives = 126/340 (37%), Gaps = 11/340 (3%)

Query: 431  KGEKPNYKEDFGSSDDEKEPDAYLARVKREAAERDDEDDEDEDESTDEDFNPDQAESDVA 490
            K E+   +E     ++EK+  A  A+   EA  + +E  + E+E        +Q +   A
Sbjct: 1588 KAEEARIEEVMKLYEEEKKMKAEEAKKAEEAKIKAEELKKAEEEKK----KVEQLKKKEA 1643

Query: 491  EEYDSNPTPTDSDEDSDASEGTKKKKKEKSRKTITISEKPRKSKKDKESSGGSTKRKKKD 550
            EE         ++E++      + KK E+ +K    ++K  + +K    +      + K 
Sbjct: 1644 EEKKKAEELKKAEEENKIKAAEEAKKAEEDKKKAEEAKKAEEDEKKAAEALKKEAEEAKK 1703

Query: 551  KNAPKKPMSAYMMWFNDMRE-----KIKKDNPGISFTEIAKKGGELWKTVSSKEKAEYDE 605
                KK  +       ++++     KIK +       E  KK  E  K    K+K  + +
Sbjct: 1704 AEELKKKEAEEKKKAEELKKAEEENKIKAEEAKKEAEEDKKKAEEAKKDEEEKKKIAHLK 1763

Query: 606  KVNKAKEDYNEALKAYKESGGGQDSDDGKTSKSKKPAKKKETASTAVSPHKVKSKEFIES 665
            K  + K +     K         + D+ +  +  K  K        +     +    I  
Sbjct: 1764 KEEEKKAEEIRKEKEAVIEEELDEEDEKRRMEVDKKIKDIFDNFANIIEGGKEGNLVIND 1823

Query: 666  NGSSSDSDDDKKASSK--RKRDSDEDSKAKTKKKKEKSESESDSGESEKEKKSKSKPKGD 723
            +    DS   + A SK  +  ++D   K K  K  E  E  +   +  KEK  K   + +
Sbjct: 1824 SKEMEDSAIKEVADSKNMQLEEADAFEKHKFNKNNENGEDGNKEADFNKEKDLKEDDEEE 1883

Query: 724  KSKSKSKKEDEIDDEEYDEPVESTPPSSDAGSDASSSEDD 763
              ++   ++ + DD E + P  +    ++   D    +D+
Sbjct: 1884 IEEADEIEKIDKDDIEREIPNNNMAGKNNDIIDDKLDKDE 1923



 Score = 36.7 bits (84), Expect = 0.056
 Identities = 61/304 (20%), Positives = 117/304 (38%), Gaps = 11/304 (3%)

Query: 446  DEKEPDAYLARVKREAAERDDEDDEDEDESTDEDFNPDQAESDVAEEYDSNPTPTDSDED 505
            DE     +  + + +A    DE  +   +  D D   D    +  EE           E 
Sbjct: 1049 DEDIDGNHEGKAEAKAHVGQDEGLKPSYKDFDFDAKEDNRADEATEEAFGKAEEAKKTET 1108

Query: 506  SDASEGTKKKKKEKSRKTITISEKPRKSKKDKESSGGSTKRKKKDKNAPKKPMSAYMMWF 565
              A E  K ++ +K  +    +E+ RK++  +++         K     +K   A     
Sbjct: 1109 GKAEEARKAEEAKKKAEDARKAEEARKAEDARKAEEARKAEDAKRVEIARKAEDARKAEE 1168

Query: 566  NDMREKIKKDNPGISFTEIAK----KGGELWKTVSSKEKAEYDEKVNKAK--EDYNEALK 619
                E  KK        E+ K    +  E  +   +  KAE + K  +A+  ED  +A  
Sbjct: 1169 ARKAEDAKKAEAARKAEEVRKAEELRKAEDARKAEAARKAEEERKAEEARKAEDAKKAEA 1228

Query: 620  AYKESGGGQDSDDGKTSKSKKPAKKKETASTAVSPHKVKSKEFIESNGSSSDSDDDKKAS 679
              K     +D+++ K ++ ++  ++      A   H  + +  I++   +  +D+ KKA 
Sbjct: 1229 VKKAEEAKKDAEEAKKAEEERNNEEIRKFEEARMAHFARRQAAIKAE-EARKADELKKAE 1287

Query: 680  SKRK----RDSDEDSKAKTKKKKEKSESESDSGESEKEKKSKSKPKGDKSKSKSKKEDEI 735
             K+K    + ++E  KA   KKK +   ++D  + + E+  K      K   ++KK  E 
Sbjct: 1288 EKKKADEAKKAEEKKKADEAKKKAEEAKKADEAKKKAEEAKKKADAAKKKAEEAKKAAEA 1347

Query: 736  DDEE 739
               E
Sbjct: 1348 AKAE 1351



 Score = 31.6 bits (71), Expect = 2.0
 Identities = 65/345 (18%), Positives = 141/345 (40%), Gaps = 23/345 (6%)

Query: 408  EKEEYGKLFDFIREKKLRVKNTGKGEKPNYK-EDFGSSDDEKEPDAYLARVKREAAERDD 466
            E      +  +  EKK++ +   K E+   K E+   +++EK+    L + + E  ++ +
Sbjct: 1591 EARIEEVMKLYEEEKKMKAEEAKKAEEAKIKAEELKKAEEEKKKVEQLKKKEAEEKKKAE 1650

Query: 467  EDDEDEDESTDEDFNPDQAESDVAEEYDSNPTPTDSDEDSDASEGTKKKKKEKSRKTITI 526
            E  + E+E+        +      E+         ++ED   +    KK+ E+++K   +
Sbjct: 1651 ELKKAEEENK---IKAAEEAKKAEEDKKKAEEAKKAEEDEKKAAEALKKEAEEAKKAEEL 1707

Query: 527  SEKPRKSKKDKES-SGGSTKRKKKDKNAPKKPMSAYMMWFNDMREKIKKDNPGISFTEIA 585
             +K  + KK  E       + K K + A K+            +++ +K        E  
Sbjct: 1708 KKKEAEEKKKAEELKKAEEENKIKAEEAKKEAEEDKKKAEEAKKDEEEKKKIAHLKKEEE 1767

Query: 586  KKGGELWKTVSSKEKAEYDEKVNKAKEDYNEALKAYKES------GGGQDSDDGKTSKSK 639
            KK  E+ K   +  + E DE+  K + + ++ +K   ++      GG + +     SK  
Sbjct: 1768 KKAEEIRKEKEAVIEEELDEEDEKRRMEVDKKIKDIFDNFANIIEGGKEGNLVINDSKEM 1827

Query: 640  KPAKKKETASTAVS----PHKVKSKEFIESNGSSSDSDDDKKASSKRKRDSDEDSKAKTK 695
            + +  KE A +           +  +F ++N +  D +  K+A   +++D  ED + + +
Sbjct: 1828 EDSAIKEVADSKNMQLEEADAFEKHKFNKNNENGEDGN--KEADFNKEKDLKEDDEEEIE 1885

Query: 696  KKKEKSESESDSGESEKEKKSKSKPKGDKSKSKSKKEDEIDDEEY 740
            +  E  + + D  E E    +    K +        +D++D +EY
Sbjct: 1886 EADEIEKIDKDDIEREI-PNNNMAGKNNDI-----IDDKLDKDEY 1924


>gnl|CDD|204115 pfam09011, DUF1898, Domain of unknown function (DUF1898).  This
           domain is predominantly found in Maelstrom homolog
           proteins. It has no known function.
          Length = 69

 Score = 48.6 bits (116), Expect = 2e-07
 Identities = 22/64 (34%), Positives = 36/64 (56%), Gaps = 1/64 (1%)

Query: 552 NAPKKPMSAYMMWFNDMREKIKKDNPGIS-FTEIAKKGGELWKTVSSKEKAEYDEKVNKA 610
           N PK   +AY  +   MR ++K++ P +    E +K   E WK +S +EK +Y+EK  + 
Sbjct: 2   NKPKAKRNAYFFFVQTMRPELKREGPQVPGVAEFSKLCSEKWKAMSEEEKEKYEEKARED 61

Query: 611 KEDY 614
           K+ Y
Sbjct: 62  KKRY 65


>gnl|CDD|238685 cd01389, MATA_HMG-box, MATA_HMG-box, class I member of the HMG-box
           superfamily of DNA-binding proteins. These proteins
           contain a single HMG box, and bind the minor groove of
           DNA in a highly sequence-specific manner. Members
           include the fungal mating type gene products MC, MATA1
           and Ste11.
          Length = 77

 Score = 45.3 bits (108), Expect = 3e-06
 Identities = 20/69 (28%), Positives = 39/69 (56%)

Query: 554 PKKPMSAYMMWFNDMREKIKKDNPGISFTEIAKKGGELWKTVSSKEKAEYDEKVNKAKED 613
             +P +A++++  D   ++K +NPG++  EI++  G +W++ S + KA Y E   + KE 
Sbjct: 2   IPRPRNAFILYRQDKHAQLKTENPGLTNNEISRIIGRMWRSESPEVKAYYKELAEEEKER 61

Query: 614 YNEALKAYK 622
           +      YK
Sbjct: 62  HAREYPDYK 70


>gnl|CDD|235640 PRK05901, PRK05901, RNA polymerase sigma factor; Provisional.
          Length = 509

 Score = 45.0 bits (107), Expect = 1e-04
 Identities = 27/204 (13%), Positives = 60/204 (29%), Gaps = 9/204 (4%)

Query: 562 MMWFNDMREKIKKDNPGISFTEIAKKGGELWKTVSSKEKAEYDEKVNKAKEDYNEALKAY 621
           M   +   E   ++       ++A K         SK     +E + +A E   +  +  
Sbjct: 1   MTTASTKAELAAEEEAKKKLKKLAAKS-------KSKGFITKEE-IKEALESKKKTPEQI 52

Query: 622 KESGGGQDSDDGKTSKSKKPAKKKETASTAVSPHKVKSKEFIESNGSSSDSDD-DKKASS 680
            +           T  + +    K+   TA      K+    +       S   +KK + 
Sbjct: 53  DQVLIFLSGMVKDTDDATESDIPKKKTKTAAKAAAAKAPAKKKLKDELDSSKKAEKKNAL 112

Query: 681 KRKRDSDEDSKAKTKKKKEKSESESDSGESEKEKKSKSKPKGDKSKSKSKKEDEIDDEEY 740
            +  D +         + +  + + D  + + +         D  +     + + +DEE 
Sbjct: 113 DKDDDLNYVKDIDVLNQADDDDDDDDDDDLDDDDIDDDDDDEDDDEDDDDDDVDDEDEEK 172

Query: 741 DEPVESTPPSSDAGSDASSSEDDD 764
            E  E    S D        + + 
Sbjct: 173 KEAKELEKLSDDDDFVWDEDDSEA 196



 Score = 41.1 bits (97), Expect = 0.002
 Identities = 23/153 (15%), Positives = 53/153 (34%), Gaps = 5/153 (3%)

Query: 593 KTVSSKEKAEYDEKVNKAKEDYNEALKAYKESGGGQDSDDGKTSKSKKPAKKKETASTAV 652
           K  + ++  +    ++   +D ++A ++       + +     +K+    K K+   ++ 
Sbjct: 45  KKKTPEQIDQVLIFLSGMVKDTDDATESDIPKKKTKTAAKAAAAKAPAKKKLKDELDSSK 104

Query: 653 SPHKVKSKEFIESNGSSSDSDDDKKASSKRKRDSDEDSKAKTKKKK-----EKSESESDS 707
              K  + +  +      D D   +A      D D+D              +  + + D 
Sbjct: 105 KAEKKNALDKDDDLNYVKDIDVLNQADDDDDDDDDDDLDDDDIDDDDDDEDDDEDDDDDD 164

Query: 708 GESEKEKKSKSKPKGDKSKSKSKKEDEIDDEEY 740
            + E E+K ++K     S       DE D E  
Sbjct: 165 VDDEDEEKKEAKELEKLSDDDDFVWDEDDSEAL 197



 Score = 40.0 bits (94), Expect = 0.005
 Identities = 25/234 (10%), Positives = 58/234 (24%), Gaps = 26/234 (11%)

Query: 507 DASEGTKKKKKEKSRKTITISEKPRKSKKDKESSGGSTKRKKKDKNAPKKPMSAYMMWFN 566
             +  TK +   +      + +   KSK     +    K   + K    +          
Sbjct: 1   MTTASTKAELAAEEEAKKKLKKLAAKSKSKGFITKEEIKEALESKKKTPE---------- 50

Query: 567 DMREKIKKDNPGISFTEIAKKGGELWKTVSSKEKAEYDEKVNKAKEDYNEALKAYKESGG 626
             +             +             +K  A+       AK+   + L + K++  
Sbjct: 51  --QIDQVLIFLSGMVKDTDDATESDIPKKKTKTAAKAAAAKAPAKKKLKDELDSSKKAEK 108

Query: 627 GQDSDDGKTSKSKKPAKKKETASTAVSP---HKVKSKEFIESNGSSSDSDDDKKASSKRK 683
               D        K       A           +   +  + +    D +DD       +
Sbjct: 109 KNALDKDDDLNYVKDIDVLNQADDDDDDDDDDDLDDDDIDDDDDDEDDDEDDDDDDVDDE 168

Query: 684 RDSDEDSKAKTKKKKEKSESESDSGESEKEKKSKSKPKGDKSKSKSKKEDEIDD 737
            +  +++K        +  S+ D    +++     +        K  K     D
Sbjct: 169 DEEKKEAKE------LEKLSDDDDFVWDEDDSEALRQ-----ARKDAKLTATAD 211



 Score = 36.9 bits (86), Expect = 0.038
 Identities = 22/117 (18%), Positives = 43/117 (36%)

Query: 409 KEEYGKLFDFIREKKLRVKNTGKGEKPNYKEDFGSSDDEKEPDAYLARVKREAAERDDED 468
           K    K      +   + +     +K +        D   + D        +  + DD D
Sbjct: 89  KAPAKKKLKDELDSSKKAEKKNALDKDDDLNYVKDIDVLNQADDDDDDDDDDDLDDDDID 148

Query: 469 DEDEDESTDEDFNPDQAESDVAEEYDSNPTPTDSDEDSDASEGTKKKKKEKSRKTIT 525
           D+D+DE  DED + D  + +  E+ ++      SD+D    +    +   ++RK   
Sbjct: 149 DDDDDEDDDEDDDDDDVDDEDEEKKEAKELEKLSDDDDFVWDEDDSEALRQARKDAK 205



 Score = 36.5 bits (85), Expect = 0.059
 Identities = 30/136 (22%), Positives = 57/136 (41%)

Query: 403 TFSSIEKEEYGKLFDFIREKKLRVKNTGKGEKPNYKEDFGSSDDEKEPDAYLARVKREAA 462
           T S I K++          K    K        + K +  ++ D+ +   Y+  +     
Sbjct: 70  TESDIPKKKTKTAAKAAAAKAPAKKKLKDELDSSKKAEKKNALDKDDDLNYVKDIDVLNQ 129

Query: 463 ERDDEDDEDEDESTDEDFNPDQAESDVAEEYDSNPTPTDSDEDSDASEGTKKKKKEKSRK 522
             DD+DD+D+D+  D+D + D  + D  E+ D +    + +E  +A E  K    +    
Sbjct: 130 ADDDDDDDDDDDLDDDDIDDDDDDEDDDEDDDDDDVDDEDEEKKEAKELEKLSDDDDFVW 189

Query: 523 TITISEKPRKSKKDKE 538
               SE  R+++KD +
Sbjct: 190 DEDDSEALRQARKDAK 205



 Score = 32.3 bits (74), Expect = 1.0
 Identities = 25/148 (16%), Positives = 59/148 (39%), Gaps = 4/148 (2%)

Query: 420 REKKLRVKNTGKG---EKPNYKEDFGSSDDEKEPDAYLARVKREAAERDDEDDEDEDEST 476
              K + K   K    + P  K+     D  K+ +     + ++      +D +  +++ 
Sbjct: 73  DIPKKKTKTAAKAAAAKAPAKKKLKDELDSSKKAEK-KNALDKDDDLNYVKDIDVLNQAD 131

Query: 477 DEDFNPDQAESDVAEEYDSNPTPTDSDEDSDASEGTKKKKKEKSRKTITISEKPRKSKKD 536
           D+D + D  + D  +  D +    D ++D D     + ++K+++++   +S+       +
Sbjct: 132 DDDDDDDDDDLDDDDIDDDDDDEDDDEDDDDDDVDDEDEEKKEAKELEKLSDDDDFVWDE 191

Query: 537 KESSGGSTKRKKKDKNAPKKPMSAYMMW 564
            +S      RK     A   P+ AY+  
Sbjct: 192 DDSEALRQARKDAKLTATADPVKAYLKQ 219



 Score = 30.3 bits (69), Expect = 4.3
 Identities = 28/182 (15%), Positives = 58/182 (31%), Gaps = 14/182 (7%)

Query: 470 EDEDESTDEDFNPDQAESDVAEEYDSNPTPTDSDEDSDASEGTKKKKKEKSRKTITISEK 529
           E  +         DQ    ++         T+SD     ++   K    K+     + ++
Sbjct: 40  EALESKKKTPEQIDQVLIFLSGMVKDTDDATESDIPKKKTKTAAKAAAAKAPAKKKLKDE 99

Query: 530 PRKSKKDKESSGGSTKRKKKDKNAPKKPMSAYMMWFNDMREKIKKDNPGISFTEIAKKGG 589
              SKK         ++K              +   N   +    D+      ++     
Sbjct: 100 LDSSKK--------AEKKNALDKDDDLNYVKDIDVLNQADDDDDDDD----DDDLDDDDI 147

Query: 590 ELWKTVSSKEKAEYDEKVNKAKEDYNEALKAYKESGGGQDSDDGKTSKSKKPAKK--KET 647
           +        ++ + D+ V+   E+  EA +  K S       D   S++ + A+K  K T
Sbjct: 148 DDDDDDEDDDEDDDDDDVDDEDEEKKEAKELEKLSDDDDFVWDEDDSEALRQARKDAKLT 207

Query: 648 AS 649
           A+
Sbjct: 208 AT 209


>gnl|CDD|236081 PRK07735, PRK07735, NADH dehydrogenase subunit C; Validated.
          Length = 430

 Score = 44.2 bits (104), Expect = 2e-04
 Identities = 49/275 (17%), Positives = 92/275 (33%), Gaps = 24/275 (8%)

Query: 444 SDDEKEPDAYLARVKREAAER--------------DDEDDEDEDESTDEDFNPDQAESDV 489
            D EK+    L  +K+EAA R               +    +E+    E   P   +  +
Sbjct: 1   MDPEKD----LEDLKKEAARRAKEEARKRLVAKHGAEISKLEEENREKEKALPKNDDMTI 56

Query: 490 AEEYDSNPTPTDSDEDSDASEGTKKKKKEKSRKTITISEKPRKSKKDKESSGGSTKRKKK 549
            E          +   + A      K+K +  + +T  EK +   K   ++        K
Sbjct: 57  EE------AKRRAAAAAKAKAAALAKQKREGTEEVTEEEKAKAKAKAAAAAKAKAAALAK 110

Query: 550 DKNAPKKPMSAYMMWFNDMREKIKKDNPGISFTEIAKKGGELWKTVSSKEKAEYDEKVNK 609
            K    + ++         +          +  +  ++G E       +   E  +    
Sbjct: 111 QKREGTEEVTEEEKAAAKAKAAAAAKAKAAALAKQKREGTEEVTEEEEETDKEKAKAKAA 170

Query: 610 AKEDYNEALKAYKESGGGQDSDDGKTSKSKKPAKKKETASTAVSPHKVKSKEFIESNGSS 669
           A      A  A +++    +  +  T + K  AK K  A+       +  ++  + NG S
Sbjct: 171 AAAKAKAAALAKQKAAEAGEGTEEVTEEEKAKAKAKAAAAAKAKAAALAKQKASQGNGDS 230

Query: 670 SDSDDDKKASSKRKRDSDEDSKAKTKKKKEKSESE 704
            D D   KA +  K  +   ++AKTK  + K E E
Sbjct: 231 GDEDAKAKAIAAAKAKAAAAARAKTKGAEGKKEEE 265



 Score = 31.1 bits (70), Expect = 2.7
 Identities = 33/140 (23%), Positives = 59/140 (42%), Gaps = 7/140 (5%)

Query: 583 EIAKKGGELWKTVSSKEKAEYDEKVNKAKEDYNEALKAYKESGGGQDSDDGKTSKSKKPA 642
            +AK+  E  + V+ +EKA+   K   A +    AL   K  G         T + K  A
Sbjct: 73  ALAKQKREGTEEVTEEEKAKAKAKAAAAAKAKAAALAKQKREGTE-----EVTEEEKAAA 127

Query: 643 KKKETASTAVSPHKVKSKEFIESNGSSSDSDDDKKASSKRKRDSDEDSKAKTK-KKKEKS 701
           K K  A+       +  ++   +   + + ++  K  +K K  +   +KA    K+K   
Sbjct: 128 KAKAAAAAKAKAAALAKQKREGTEEVTEEEEETDKEKAKAKAAAAAKAKAAALAKQKAAE 187

Query: 702 ESESDSGESEKEKKSKSKPK 721
             E  + E  +E+K+K+K K
Sbjct: 188 AGE-GTEEVTEEEKAKAKAK 206


>gnl|CDD|240433 PTZ00482, PTZ00482, membrane-attack complex/perforin (MACPF)
           Superfamily; Provisional.
          Length = 844

 Score = 44.5 bits (105), Expect = 2e-04
 Identities = 30/184 (16%), Positives = 63/184 (34%), Gaps = 8/184 (4%)

Query: 584 IAKKGGELWKTVSSKEKAEYD-----EKVNKAKEDYNEALKAYKE-SGGGQDSDDGKTSK 637
             K    L+ ++   +K         + ++   +D  + L    E   G   S +  T  
Sbjct: 58  EGKHANILYNSILCNQKKHASFLNQRKSLDDDDDDEFDFLYEDDEDDAGNATSGESSTDD 117

Query: 638 SKKPAKKKETASTAVSPHKVKSKEFIESNGSSSDSDDDKKASSKRKRDSD--EDSKAKTK 695
                            +  ++ +F + + S+S +D   K S          E+ K +T+
Sbjct: 118 DSLLELPDRDEDADTQANNDQTNDFDQDDSSNSQTDQGLKQSVNLSSAEKLIEEKKGQTE 177

Query: 696 KKKEKSESESDSGESEKEKKSKSKPKGDKSKSKSKKEDEIDDEEYDEPVESTPPSSDAGS 755
              +     +D  E+  +   KSK       + S  + +  D E  E  ++   +  A +
Sbjct: 178 NTFKFYNFGNDGEEAAAKDGGKSKSSDPGPLNDSDGQGDDGDPESAEEDKAASNTRAAYT 237

Query: 756 DASS 759
            A+S
Sbjct: 238 KATS 241



 Score = 34.5 bits (79), Expect = 0.25
 Identities = 37/185 (20%), Positives = 66/185 (35%), Gaps = 14/185 (7%)

Query: 392 FEIELKSGVLHTFSSIEKEEYGKLFDFIREKKLRVKNTGKGEKPNYKEDFGSSDDEKEPD 451
             + L+S  L   S++E  E     D      LR +         Y     +        
Sbjct: 25  LRLGLRSNYLQ--STVEDIEGPSAVDERTSGVLRDEGKHANIL--YNSILCNQKKHASFL 80

Query: 452 AYLARVKREAAERDDEDDEDEDESTDEDFNPDQAESDVAEEYDSNPTPTDSDEDSDASEG 511
                 +R++ + DD+D+ D     DED   +    + + + DS     D DED+D    
Sbjct: 81  N-----QRKSLDDDDDDEFDFLYEDDEDDAGNATSGESSTDDDSLLELPDRDEDAD---- 131

Query: 512 TKKKKKEKSRKTITISEKPRKSKKDKESSGGSTKRKK-KDKNAPKKPMSAYMMWFNDMRE 570
           T+    + +      S   +  +  K+S   S+  K  ++K    +    +  + ND  E
Sbjct: 132 TQANNDQTNDFDQDDSSNSQTDQGLKQSVNLSSAEKLIEEKKGQTENTFKFYNFGNDGEE 191

Query: 571 KIKKD 575
              KD
Sbjct: 192 AAAKD 196



 Score = 31.8 bits (72), Expect = 1.6
 Identities = 20/111 (18%), Positives = 37/111 (33%), Gaps = 12/111 (10%)

Query: 665 SNGSSSDSDD-------DKKASSKRKRDS----DEDSKAKTKKKKEKSESESDSGESEKE 713
           S  SS+D D        D+ A ++   D     D+D  + ++  +   +S + S   EK 
Sbjct: 110 SGESSTDDDSLLELPDRDEDADTQANNDQTNDFDQDDSSNSQTDQGLKQSVNLSSA-EKL 168

Query: 714 KKSKSKPKGDKSKSKSKKEDEIDDEEYDEPVESTPPSSDAGSDASSSEDDD 764
            + K     +  K  +   D  +    D     +             +D D
Sbjct: 169 IEEKKGQTENTFKFYNFGNDGEEAAAKDGGKSKSSDPGPLNDSDGQGDDGD 219



 Score = 30.6 bits (69), Expect = 3.7
 Identities = 17/119 (14%), Positives = 38/119 (31%), Gaps = 12/119 (10%)

Query: 646 ETASTAVSPHKVKSKEFIESNGSSSDSDDDKKASSKRKRDSDEDSKAKTKKKKEKSESES 705
            +       H      F+    S  D DDD+        + DED        +  ++ +S
Sbjct: 67  NSILCNQKKHAS----FLNQRKSLDDDDDDEFDFL---YEDDEDDAGNATSGESSTDDDS 119

Query: 706 DSGESEKEKKSKSKPKGDKSKSKSKKEDEIDDEEYDEPVESTPPSSDAGSDASSSEDDD 764
                ++++ + ++   D        + + DD    +  +    S +  S     E+  
Sbjct: 120 LLELPDRDEDADTQANND-----QTNDFDQDDSSNSQTDQGLKQSVNLSSAEKLIEEKK 173


>gnl|CDD|217393 pfam03154, Atrophin-1, Atrophin-1 family.  Atrophin-1 is the
           protein product of the dentatorubral-pallidoluysian
           atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive
           neurodegenerative disorder. It is caused by the
           expansion of a CAG repeat in the DRPLA gene on
           chromosome 12p. This results in an extended
           polyglutamine region in atrophin-1, that is thought to
           confer toxicity to the protein, possibly through
           altering its interactions with other proteins. The
           expansion of a CAG repeat is also the underlying defect
           in six other neurodegenerative disorders, including
           Huntington's disease. One interaction of expanded
           polyglutamine repeats that is thought to be pathogenic
           is that with the short glutamine repeat in the
           transcriptional coactivator CREB binding protein, CBP.
           This interaction draws CBP away from its usual nuclear
           location to the expanded polyglutamine repeat protein
           aggregates that are characteristic of the polyglutamine
           neurodegenerative disorders. This interferes with
           CBP-mediated transcription and causes cytotoxicity.
          Length = 979

 Score = 44.3 bits (104), Expect = 2e-04
 Identities = 47/169 (27%), Positives = 76/169 (44%), Gaps = 30/169 (17%)

Query: 598 KEKAEYDEKVNKAKEDYNEALKAYKESGGGQDSDDGKTSKSKKPAKK-KETASTAVSPHK 656
           K+ A  D + +   ED   + +    S     S+D K   +KKP KK KE A+   SP K
Sbjct: 23  KQTASPDGRASPTNEDQRSSGR-NSPSAASTSSNDSKAESTKKPNKKIKEEAT---SPLK 78

Query: 657 VKSKEFIESNGSSSDSDDDKKASSKRKRDSDEDSKAKTKKKKEKSESESDSGESEKEKKS 716
             +K   E    +SD+++ ++ ++K         K+KT++    +      GE E E   
Sbjct: 79  -STKRQREK--PASDTEEPERVTAK---------KSKTQELSRPNSPSEGEGEGEGE--- 123

Query: 717 KSKPKGDKSKSKSKKED-EIDDEEYDEPVESTPPS----SDAGSDASSS 760
                G+ S S+S  E+   D ++ D+   S+ PS     D  SD+ SS
Sbjct: 124 -----GESSDSRSVNEEGSSDPKDIDQDNRSSSPSIPSPQDNESDSDSS 167



 Score = 38.5 bits (89), Expect = 0.013
 Identities = 33/128 (25%), Positives = 57/128 (44%), Gaps = 1/128 (0%)

Query: 594 TVSSKEKAEYDEKVNKA-KEDYNEALKAYKESGGGQDSDDGKTSKSKKPAKKKETASTAV 652
           T S+  KAE  +K NK  KE+    LK+ K       SD  +  +      K +  S   
Sbjct: 52  TSSNDSKAESTKKPNKKIKEEATSPLKSTKRQREKPASDTEEPERVTAKKSKTQELSRPN 111

Query: 653 SPHKVKSKEFIESNGSSSDSDDDKKASSKRKRDSDEDSKAKTKKKKEKSESESDSGESEK 712
           SP + + +   E   S S S +++ +S  +  D D  S + +    + +ES+SDS   ++
Sbjct: 112 SPSEGEGEGEGEGESSDSRSVNEEGSSDPKDIDQDNRSSSPSIPSPQDNESDSDSSAQQQ 171

Query: 713 EKKSKSKP 720
             + +  P
Sbjct: 172 LLQPQGPP 179


>gnl|CDD|215641 PLN03237, PLN03237, DNA topoisomerase 2; Provisional.
          Length = 1465

 Score = 43.7 bits (103), Expect = 4e-04
 Identities = 50/283 (17%), Positives = 91/283 (32%), Gaps = 51/283 (18%)

Query: 504  EDSDASEGTKKKKKEKSRKTITISEKPRKSKKDKESSGGSTKRKKKDKNAPKKPMSAYMM 563
            ED+ A E  +K ++  +R     ++K  +    K +   +TK+  + +   +   S+ M 
Sbjct: 1171 EDAKAEEAREKLQRAAARGESGAAKKVSRQAPKKPAPKKTTKKASESETTEETYGSSAME 1230

Query: 564  WFNDMREKIKKDNPGISFTEIAKKGGELWKTVSSKEKAEYDEKVNKAKEDYNEALKAYKE 623
              N       K            + G   K  ++ ++ E ++++     D  + L AY  
Sbjct: 1231 TENVAEVVKPK-----------GRAGAKKKAPAAAKEKEEEDEI----LDLKDRLAAYNL 1275

Query: 624  SGGGQDSDDGKTSKSKKPAKKKETASTAVSPHKVKSKEFIESNGSSSDSDDD-------- 675
                  S   + +    PA++       ++   V S         S D DDD        
Sbjct: 1276 DSAPAQSAKMEETVKAVPARRAAARKKPLASVSVISD--------SDDDDDDFAVEVSLA 1327

Query: 676  ----KKASSKRKRDSDEDSKAKTKKKKE----------------KSESESDSGESEKEKK 715
                KK   K    + + +K     KK                 K          +K +K
Sbjct: 1328 ERLKKKGGRKPAAANKKAAKPPAAAKKRGPATVQSGQKLLTEMLKPAEAIGISPEKKVRK 1387

Query: 716  SKSKPKGDKSKSKSKKEDEIDDEEYDEPVESTPPSSDAGSDAS 758
             ++ P   KS S   +     + E  E V  +  S     D S
Sbjct: 1388 MRASPFNKKSGSVLGRAATNKETESSENVSGSSSSEKDEIDVS 1430



 Score = 39.1 bits (91), Expect = 0.010
 Identities = 47/268 (17%), Positives = 89/268 (33%), Gaps = 21/268 (7%)

Query: 456  RVKREAAERDDEDDEDEDESTDEDFNPDQAESDVAEEYDSNPTPTDSDEDSDASEGTKKK 515
            +V R+A ++       +  S  E        S +  E  +          +        K
Sbjct: 1196 KVSRQAPKKPAPKKTTKKASESETTEETYGSSAMETENVAEVVKPKGRAGAKKKAPAAAK 1255

Query: 516  KKEKSRKTITISEKPRKSKKDKESSGGSTKRK------KKDKNAPKKPMSAYMMWFNDMR 569
            +KE+  + + + ++      D   +  +   +       +   A KKP+++  +  +   
Sbjct: 1256 EKEEEDEILDLKDRLAAYNLDSAPAQSAKMEETVKAVPARRAAARKKPLASVSVISDSDD 1315

Query: 570  EKIKKDNPGISFTEIAKKGGELWKTVSSKEKAEYD---------EKVNKAKEDYNEALKA 620
            +             + KKGG   K  ++ +KA              V   ++   E LK 
Sbjct: 1316 DDDDFAVEVSLAERLKKKGGR--KPAAANKKAAKPPAAAKKRGPATVQSGQKLLTEMLKP 1373

Query: 621  YKESGGGQDSDDGKTSKSKKPAKKKETASTAVSPHKVKSKEFIE-SNGSSSDSDDDKKAS 679
             +  G    S + K  K +     K++ S        K  E  E  +GSSS   D+   S
Sbjct: 1374 AEAIG---ISPEKKVRKMRASPFNKKSGSVLGRAATNKETESSENVSGSSSSEKDEIDVS 1430

Query: 680  SKRKRDSDEDSKAKTKKKKEKSESESDS 707
            +K +       +        +SES  DS
Sbjct: 1431 AKPRPQRANRKQTTYVLSDSESESADDS 1458


>gnl|CDD|227596 COG5271, MDN1, AAA ATPase containing von Willebrand factor type A
            (vWA) domain [General function prediction only].
          Length = 4600

 Score = 43.4 bits (102), Expect = 5e-04
 Identities = 61/343 (17%), Positives = 129/343 (37%), Gaps = 37/343 (10%)

Query: 438  KEDFGSSDDEKEPDAY-----LARVKREA--AERDDEDDEDEDESTDE---DFNPDQAES 487
            +ED  +  D  E +A      +  V +++  +E ++ D E+E++  DE   D   D + S
Sbjct: 3843 EEDTANQSDLDESEARELESDMNGVTKDSVVSENENSDSEEENQDLDEEVNDIPEDLSNS 3902

Query: 488  ------DVAEEYDSNPTPTDSDEDSDASEGTKKKKKEKSRKTITISEKPRKSKKDKESSG 541
                  D   E D   T   S+E S A+  +    KE   K +   ++  K  +++ S  
Sbjct: 3903 LNEKLWDEPNEEDLLETEQKSNEQSAANNESDLVSKEDDNKALEDKDRQEKEDEEEMSDD 3962

Query: 542  GSTKRKKKDKNAPKKPMSAYMMWFNDMREKIKKDNPGISFTEIAKKGGELWKTVSSKEKA 601
                 + +                 D+ E +K D          +K G+      SK+  
Sbjct: 3963 VGIDDEIQPDIQENNSQPPPENEDLDLPEDLKLD----------EKEGD-----VSKDSD 4007

Query: 602  EYDEKVNKAKEDYNEALKAYKESGGGQDSDDGKTSKSKKPAKKKETASTAVSPHKVKSKE 661
              D  +  A E+  EA     E    QD D  + + +     +++  S      +  +++
Sbjct: 4008 LEDMDMEAADENKEEADAEKDEPM--QDEDPLEENNTLDEDIQQDDFSDLAEDDEKMNED 4065

Query: 662  FIESNGSSSDSDDDKKASSKRKRDSDEDSKAKTKKKKEKSESESDSGESEKEKKSKSKPK 721
              E N   ++   +    S  + +  E  + +      K +++S    +E ++++  K  
Sbjct: 4066 GFEENVQENEESTEDGVKSDEELEQGEVPEDQAIDNHPKMDAKSTFASAEADEENTDKGI 4125

Query: 722  GDKSKSKSKKED----EIDDEEYDEPVESTPPSSDAGSDASSS 760
              +++   +++        D E+++  E T    +A S+A   
Sbjct: 4126 VGENEELGEEDGVRGNGTADGEFEQVQEDTSTPKEAMSEADRQ 4168



 Score = 38.8 bits (90), Expect = 0.014
 Identities = 57/330 (17%), Positives = 121/330 (36%), Gaps = 37/330 (11%)

Query: 443  SSDDEKEPDAYLARVKREAAERDDEDDEDEDESTDEDFNPDQAESDVAEEYDSNPTPTDS 502
              + +   D+ L  +  EAA      DE+++E+  E   P Q E D  EE ++       
Sbjct: 3997 EKEGDVSKDSDLEDMDMEAA------DENKEEADAEKDEPMQDE-DPLEENNTLDEDIQQ 4049

Query: 503  DEDSDASEGTKKKKKEKSRKTITISEKPRK--SKKDKESSGGSTKRKKKDKNAPKKPMSA 560
            D+ SD +E  +K  ++   + +  +E+  +   K D+E   G     +   N PK    +
Sbjct: 4050 DDFSDLAEDDEKMNEDGFEENVQENEESTEDGVKSDEELEQGEVPEDQAIDNHPKMDAKS 4109

Query: 561  YMMWFNDMREKIKKDNP----GISFTEIAKKGGELWKTVSSKEKAEYDEKVNKAKEDYNE 616
                      +  ++N          E+ ++ G      +  E  +  E  +  KE  +E
Sbjct: 4110 -----TFASAEADEENTDKGIVGENEELGEEDGVRGNGTADGEFEQVQEDTSTPKEAMSE 4164

Query: 617  ALKAYKESGGGQDSDDGKTSKSKKPAKKKETASTAVSPHKVKSKEF--IESNGS------ 668
            A + Y+  G     D  +  +      + E  + + S       EF  ++ +        
Sbjct: 4165 ADRQYQSLG-----DHLREWQQANRIHEWEDLTESQS-QAFDDSEFMHVKEDEEEDLQAL 4218

Query: 669  -SSDSDDDKKASSKRKRDSDEDSKAKTKKKKEKSESESDSGESEKEKKSKSKPKGDKSKS 727
             +++ D  K        + + DS   T   +++++   D    + +  S  K  G+ +  
Sbjct: 4219 GNAEKDQIKSIDRDESANQNPDSMNSTNIAEDEADEVGDKQLQDGQDISDIKQTGEDTLP 4278

Query: 728  ----KSKKEDEIDDEEYDEPVESTPPSSDA 753
                   + +++ +   DE +E   P  + 
Sbjct: 4279 TEFGSINQSEKVFELSEDEDIEDELPDYNV 4308



 Score = 32.7 bits (74), Expect = 0.98
 Identities = 17/100 (17%), Positives = 43/100 (43%), Gaps = 8/100 (8%)

Query: 665  SNGSSSDSDDDKKASSKRKRDSDEDSKAKTKKKKEKSESESDSGESEKEKKSKSKPKGDK 724
            +N   + +  D   S  R+ +SD +   K     E   S+S+    + +++    P+   
Sbjct: 3841 ANEEDTANQSDLDESEARELESDMNGVTKDSVVSENENSDSEEENQDLDEEVNDIPEDLS 3900

Query: 725  SKSKSKKEDEIDDEEYDEPVESTPPSSDAGSDASSSEDDD 764
                    + ++++ +DEP E     ++  S+  S+ +++
Sbjct: 3901 --------NSLNEKLWDEPNEEDLLETEQKSNEQSAANNE 3932


>gnl|CDD|233191 TIGR00927, 2A1904, K+-dependent Na+/Ca+ exchanger.  [Transport and
           binding proteins, Cations and iron carrying compounds].
          Length = 1096

 Score = 43.1 bits (101), Expect = 7e-04
 Identities = 49/274 (17%), Positives = 112/274 (40%), Gaps = 27/274 (9%)

Query: 502 SDEDSDASEGT-KKKKKEKSRKTITISEKPRKSKKDKESSG-----------GSTKRKKK 549
           S  D   +E T ++  +E  R T    E   +S  + E  G           G    ++K
Sbjct: 631 SKGDVAEAEHTGERTGEEGERPTEAEGENGEESGGEAEQEGETETKGENESEGEIPAERK 690

Query: 550 DKNAPKKPMSAYMMWFNDMREKIKKDNPGISFTEIAKKGGELWKTVSSKEK-------AE 602
            +   +  + A         E  + ++ G +  E  +  GE+      +E        AE
Sbjct: 691 GEQEGEGEIEAKEADHKGETEAEEVEHEGETEAEGTEDEGEIETGEEGEEVEDEGEGEAE 750

Query: 603 YDEKVNKAKEDYNEALKAYKESGGGQDSDDGKTSKSKKPAKKKETASTAVSPHKVKSK-- 660
              +V    +      +   E+ G +D D+G+    +    K +  +     H+ +++  
Sbjct: 751 GKHEVETEGDRKETEHEGETEAEGKEDEDEGEIQAGEDGEMKGDEGAEGKVEHEGETEAG 810

Query: 661 ---EFIESNGSSSDSDDDKKASSKRKRDSDEDSKAKTKKKKEKSESESDSGESEKEKKSK 717
              E    + + +D  + K  + +++ +++   +AK  +K       SD G+SE+E++ +
Sbjct: 811 EKDEHEGQSETQADDTEVKDETGEQELNAENQGEAKQDEKGVDGGGGSDGGDSEEEEEEE 870

Query: 718 SKPKGDKSKSKSKKEDEIDDEEYDEPVESTPPSS 751
            +   ++ + + ++E+E ++EE +EP+    P +
Sbjct: 871 EE---EEEEEEEEEEEEEEEEENEEPLSLEWPET 901



 Score = 32.7 bits (74), Expect = 0.92
 Identities = 23/161 (14%), Positives = 61/161 (37%), Gaps = 13/161 (8%)

Query: 610 AKEDYNEALKAYKESGGGQDSDDGKTSKSKKPAKKKETASTAVSPH------KVKSKEFI 663
           A  D ++   A  E  G +  ++G+     +    +E+   A          + +S+  I
Sbjct: 626 ALGDLSKGDVAEAEHTGERTGEEGERPTEAEGENGEESGGEAEQEGETETKGENESEGEI 685

Query: 664 ESNGSSSDSDDDKKASSKRKRDSDEDSKAKTKKKKEKSESESDSGESEKEKKSKSKPKGD 723
            +        + +  + +     + +++    + + ++E   D GE E  ++ +      
Sbjct: 686 PAERKGEQEGEGEIEAKEADHKGETEAEEVEHEGETEAEGTEDEGEIETGEEGEEVEDEG 745

Query: 724 KSKSKSKKEDEIDDEEYDEPVESTPPSSDAGSDASSSEDDD 764
           + +++ K E E + +  +   E         ++A   ED+D
Sbjct: 746 EGEAEGKHEVETEGDRKETEHE-------GETEAEGKEDED 779



 Score = 29.6 bits (66), Expect = 7.9
 Identities = 28/180 (15%), Positives = 70/180 (38%), Gaps = 8/180 (4%)

Query: 586 KKGGELWKTVSSKEKAEYDEKVNKAKEDYNEALKAYKESGGGQDSDDGKTSKSKKPAKKK 645
           + GGE  +   ++ K E     N+++ +     K  +E  G  ++ +       +  + +
Sbjct: 662 ESGGEAEQEGETETKGE-----NESEGEIPAERKGEQEGEGEIEAKEADHKGETEAEEVE 716

Query: 646 ETASTAVSPHKVKSKEFIESNGSSSDSDDDKKASSKRKRDSDEDSKAKTKKKKEKSESES 705
               T     + + +      G   + + + +A  K + +++ D K    + + ++E + 
Sbjct: 717 HEGETEAEGTEDEGEIETGEEGEEVEDEGEGEAEGKHEVETEGDRKETEHEGETEAEGKE 776

Query: 706 DSGESEKEKKSKSKPKGDK---SKSKSKKEDEIDDEEYDEPVESTPPSSDAGSDASSSED 762
           D  E E +     + KGD+    K + + E E  +++  E    T        D +  ++
Sbjct: 777 DEDEGEIQAGEDGEMKGDEGAEGKVEHEGETEAGEKDEHEGQSETQADDTEVKDETGEQE 836


>gnl|CDD|220648 pfam10243, MIP-T3, Microtubule-binding protein MIP-T3.  This
           protein, which interacts with both microtubules and
           TRAF3 (tumour necrosis factor receptor-associated factor
           3), is conserved from worms to humans. The N-terminal
           region is the microtubule binding domain and is
           well-conserved; the C-terminal 100 residues, also
           well-conserved, constitute the coiled-coil region which
           binds to TRAF3. The central region of the protein is
           rich in lysine and glutamic acid and carries KKE motifs
           which may also be necessary for tubulin-binding, but
           this region is the least well-conserved.
          Length = 506

 Score = 42.2 bits (99), Expect = 8e-04
 Identities = 29/148 (19%), Positives = 54/148 (36%), Gaps = 4/148 (2%)

Query: 606 KVNKAKEDYNEALKAYKESGGGQDSDDGKTSKSKKPAKKKETASTAVSPHKVKSKEFIES 665
           K  ++K   +EA+K  +   GG      KT  +K+P  +             + K+  + 
Sbjct: 65  KCAESKLSSDEAVK--RVEKGGSKGPAAKTKPAKEPKNESGKEEEKEKEQVKEEKK--KK 120

Query: 666 NGSSSDSDDDKKASSKRKRDSDEDSKAKTKKKKEKSESESDSGESEKEKKSKSKPKGDKS 725
                +   D+K   + K       K K K+KK +   + +  +  +  ++KS+PK    
Sbjct: 121 KEKPKEEPKDRKPKEEAKEKRPPKEKEKEKEKKVEEPRDREEEKKRERVRAKSRPKKPPK 180

Query: 726 KSKSKKEDEIDDEEYDEPVESTPPSSDA 753
           K    K+ E  +EE              
Sbjct: 181 KKPPNKKKEPPEEEKQRQAAREAVKGKP 208



 Score = 40.6 bits (95), Expect = 0.003
 Identities = 32/152 (21%), Positives = 58/152 (38%), Gaps = 4/152 (2%)

Query: 595 VSSKEKAEYDEKVNKAKEDYNEAL-KAYKESGGGQDSDDGKTSKSKKPAKKKETASTAVS 653
            S     E  ++V K       A  K  KE       ++ K  +  K  KKK+       
Sbjct: 68  ESKLSSDEAVKRVEKGGSKGPAAKTKPAKEPKNESGKEEEKEKEQVKEEKKKKKEKPKEE 127

Query: 654 PHKVKSKEFIESNGSSSDSDDDKKASSKRKRDSDEDSKAKTKKKK--EKSESESDSGESE 711
           P   K KE  +      + + +K+   +  RD +E+ K +  + K   K   +      +
Sbjct: 128 PKDRKPKEEAKEKRPPKEKEKEKEKKVEEPRDREEEKKRERVRAKSRPKKPPKKKPPNKK 187

Query: 712 KEKKSKSKPKGDKSKS-KSKKEDEIDDEEYDE 742
           KE   + K +    ++ K K E+   +EE ++
Sbjct: 188 KEPPEEEKQRQAAREAVKGKPEEPDVNEEREK 219



 Score = 39.1 bits (91), Expect = 0.009
 Identities = 22/129 (17%), Positives = 48/129 (37%)

Query: 634 KTSKSKKPAKKKETASTAVSPHKVKSKEFIESNGSSSDSDDDKKASSKRKRDSDEDSKAK 693
           ++  S   A K+     +  P           N S  + + +K+   + K+   E  K +
Sbjct: 68  ESKLSSDEAVKRVEKGGSKGPAAKTKPAKEPKNESGKEEEKEKEQVKEEKKKKKEKPKEE 127

Query: 694 TKKKKEKSESESDSGESEKEKKSKSKPKGDKSKSKSKKEDEIDDEEYDEPVESTPPSSDA 753
            K +K K E++      EKEK+ + K +  + + + KK + +  +   +      P +  
Sbjct: 128 PKDRKPKEEAKEKRPPKEKEKEKEKKVEEPRDREEEKKRERVRAKSRPKKPPKKKPPNKK 187

Query: 754 GSDASSSED 762
                  + 
Sbjct: 188 KEPPEEEKQ 196



 Score = 35.6 bits (82), Expect = 0.088
 Identities = 39/271 (14%), Positives = 92/271 (33%), Gaps = 13/271 (4%)

Query: 491 EEYDSNPTPTDSDEDSDASEGTKKKKKEKSRKTITISEKPRKSK-KDKESSGGSTKRKKK 549
           E  + +    + +++    E  KKK+K K        ++  K K   KE      K+ ++
Sbjct: 97  EPKNESGKEEEKEKEQVKEEKKKKKEKPKEEPKDRKPKEEAKEKRPPKEKEKEKEKKVEE 156

Query: 550 DKNAPKKPMSAYMMWFN--DMREKIKKDNPGISFTEIAKKGGELWKTVSSKEKAE-YDEK 606
            ++  ++     +   +      K K  N      E  K+     + V  K +    +E+
Sbjct: 157 PRDREEEKKRERVRAKSRPKKPPKKKPPNKKKEPPEEEKQRQAAREAVKGKPEEPDVNEE 216

Query: 607 VNKAKEDYNEALKAYKESGGGQDSDDGKTSKSKKPAKKKETASTAVSPHKVKSKEFIESN 666
             K ++D  +           +     + S+    + KK   S +++  + +        
Sbjct: 217 REKEEDDGKDRETTTSPMEEDESRQSSEISRRSSSSLKKPDPSPSMASPETR-------- 268

Query: 667 GSSSDSDDDKKASSKRKRDSDEDSKAKTKKKKEKSESESDSGESEKEKKSKSKPKGDKSK 726
             SS   + +  +S R   +   S      + ++ E  +   +++   K  S    +  K
Sbjct: 269 -ESSKRTETRPRTSLRPPSARPASARPAPPRVKRKEIVTVLQDAQGVGKIVSNVILEGKK 327

Query: 727 SKSKKEDEIDDEEYDEPVESTPPSSDAGSDA 757
           S+ + ++    E   +  +      D   D 
Sbjct: 328 SEDEDDENFVVEAAAQAPDIVAGGEDEAEDG 358



 Score = 32.6 bits (74), Expect = 0.99
 Identities = 33/155 (21%), Positives = 59/155 (38%), Gaps = 7/155 (4%)

Query: 593 KTVSSKEKAEYDEKVNKAKEDYNEALKAYKESGGGQDSDDGKTSKSKKPAKKKETASTAV 652
           K  ++K K   + K    KE+     +  KE    +     +  K +   +K +  +   
Sbjct: 86  KGPAAKTKPAKEPKNESGKEE-----EKEKEQVKEEKKKKKEKPKEEPKDRKPKEEAKEK 140

Query: 653 SPHKVKSKEFIESNGSSSDSDDDKKASSKRKRDSDEDSKAK--TKKKKEKSESESDSGES 710
            P K K KE  +      D +++KK    R +   +    K    KKKE  E E     +
Sbjct: 141 RPPKEKEKEKEKKVEEPRDREEEKKRERVRAKSRPKKPPKKKPPNKKKEPPEEEKQRQAA 200

Query: 711 EKEKKSKSKPKGDKSKSKSKKEDEIDDEEYDEPVE 745
            +  K K +      + + +++D  D E    P+E
Sbjct: 201 REAVKGKPEEPDVNEEREKEEDDGKDRETTTSPME 235



 Score = 32.2 bits (73), Expect = 1.0
 Identities = 42/257 (16%), Positives = 78/257 (30%), Gaps = 33/257 (12%)

Query: 502 SDEDSDASEGTKKKKKEKSRKTITISEKPRKSKKDKESSGGSTKRKKKDKNAPKKPMSAY 561
           S   SD +    +K   K     T   K  K++  KE      + K++ K   +KP    
Sbjct: 69  SKLSSDEAVKRVEKGGSKGPAAKTKPAKEPKNESGKEEEKEKEQVKEEKKKKKEKPKE-- 126

Query: 562 MMWFNDMREKIKKDNPGISFTEIAKKGGELWKTVSSKEKAEYDEKVNKAKEDYNEALKAY 621
                    K +K                  +    +   E +++  K  E+  +  +  
Sbjct: 127 -------EPKDRKPKE---------------EAKEKRPPKEKEKEKEKKVEEPRDREEE- 163

Query: 622 KESGGGQDSDDGKTSKSKKPAKKKETASTAVSPHKVKSKEFIESNGSSSDSDDDKKASSK 681
           K+    +     K    KKP  KK+         K +        G   + D +++   +
Sbjct: 164 KKRERVRAKSRPKKPPKKKPPNKKKEPPE---EEKQRQAAREAVKGKPEEPDVNEEREKE 220

Query: 682 RKRDSDEDSKAKTKKKKEKSESESDSGESEKEKKSKSKPKGDKSKSKSKKEDEIDDEEYD 741
                 +D K +        E ES        + S S  K D S S +  E     +  +
Sbjct: 221 E-----DDGKDRETTTSPMEEDESRQSSEISRRSSSSLKKPDPSPSMASPETRESSKRTE 275

Query: 742 EPVESTPPSSDAGSDAS 758
               ++     A   ++
Sbjct: 276 TRPRTSLRPPSARPASA 292


>gnl|CDD|218177 pfam04615, Utp14, Utp14 protein.  This protein is found to be part
           of a large ribonucleoprotein complex containing the U3
           snoRNA. Depletion of the Utp proteins impedes production
           of the 18S rRNA, indicating that they are part of the
           active pre-rRNA processing complex. This large RNP
           complex has been termed the small subunit (SSU)
           processome.
          Length = 728

 Score = 42.4 bits (100), Expect = 9e-04
 Identities = 53/290 (18%), Positives = 93/290 (32%), Gaps = 45/290 (15%)

Query: 457 VKREAAERDDEDDEDEDESTDEDFNPDQAESDVAEEYDSNPTPTDSDEDSDAS------- 509
            K  + E +DED + E+E  D+D + D  E+               D++ ++        
Sbjct: 320 GKSVSEEDEDEDSDSEEEDEDDDEDDDDGENPWMLRKKLGKLKEGEDDEENSGLLSMKFM 379

Query: 510 EGTKKKKKEKSRKTITISEKPRKSKKDKESSGGSTKRKKKDKNAPKKPMSAYMMWFNDMR 569
           +  + +KKE++   I   E+ R+  + +E S      +   KN  ++             
Sbjct: 380 QRAEARKKEENDAEI---EELRRELEGEEESDEEENEEPSKKNVGRRKFGP--------- 427

Query: 570 EKIKKDNPGISFTEIAKKGGELWKTVSSKEKAEYDEKVNKAKEDYNEALKAYKESGGGQD 629
                                  +  S K K E   +  + KE          E    +D
Sbjct: 428 ------------------ENGEKEAESKKLKKENKNEFKEKKES--------DEEEELED 461

Query: 630 SDDGKTSKSKKPAKKKETASTAVSPHKVKSKEFIESNGSSSDSDDDKKASSKRKRDSDED 689
            ++ K  K      K+   +      +   +E      +SS     KK  SK+K  S  D
Sbjct: 462 EEEAKVEKVANKLLKRSEKAQKEEEEEELDEENPWLKTTSSVGKSAKKQDSKKKSSSKLD 521

Query: 690 SKAKTKKKKEKSESESDSGESEKEKKSKSKPKGDKSKSKSKKEDEIDDEE 739
             A    K      +    E   +       + D  K     E++ DDEE
Sbjct: 522 KAANKISKAAVKVKKKKKKEKSIDLDDDLIDEEDSIKLDVDDEEDEDDEE 571



 Score = 33.5 bits (77), Expect = 0.47
 Identities = 22/147 (14%), Positives = 53/147 (36%), Gaps = 3/147 (2%)

Query: 413 GKLFDFIREKKLRVKNTGKGEKPNYKEDFGSSDDEKEPDAYLARVKREAAERDDEDDEDE 472
            +  +   E++    +     +  +  + G  + E +      + + +  +  DE++E E
Sbjct: 401 LEGEEESDEEENEEPSKKNVGRRKFGPENGEKEAESKKLKKENKNEFKEKKESDEEEELE 460

Query: 473 DESTDEDFNPDQAESDVAEEYDSNPTPTDSDEDSDASEGTKKKKKEKSRKTITISEKPRK 532
           DE   E+   ++  + + +  +      + +E  + +   K             S+K   
Sbjct: 461 DE---EEAKVEKVANKLLKRSEKAQKEEEEEELDEENPWLKTTSSVGKSAKKQDSKKKSS 517

Query: 533 SKKDKESSGGSTKRKKKDKNAPKKPMS 559
           SK DK ++  S    K  K   K+   
Sbjct: 518 SKLDKAANKISKAAVKVKKKKKKEKSI 544



 Score = 32.7 bits (75), Expect = 0.96
 Identities = 36/191 (18%), Positives = 74/191 (38%), Gaps = 15/191 (7%)

Query: 582 TEIAKKGGELWKTVSSKEKAEYDEKVNKAKEDYNEALKAYKESGGGQDSDDGKTSKSKKP 641
            E  ++G EL + +  K  +E DE  +   E+        ++     D  +      KK 
Sbjct: 305 EEQLRQGEELRRKIEGKSVSEEDEDEDSDSEE------EDEDDDEDDDDGENPWMLRKK- 357

Query: 642 AKKKETASTAVSPHKVKSKEFIESNGSSSDSDDDK-----KASSKRKRDSDEDS---KAK 693
             K +          + S +F++   +    ++D      +   + + +SDE+     +K
Sbjct: 358 LGKLKEGEDDEENSGLLSMKFMQRAEARKKEENDAEIEELRRELEGEEESDEEENEEPSK 417

Query: 694 TKKKKEKSESESDSGESEKEKKSKSKPKGDKSKSKSKKEDEIDDEEYDEPVESTPPSSDA 753
               + K   E+   E+E +K  K      K K +S +E+E++DEE  +  +        
Sbjct: 418 KNVGRRKFGPENGEKEAESKKLKKENKNEFKEKKESDEEEELEDEEEAKVEKVANKLLKR 477

Query: 754 GSDASSSEDDD 764
              A   E+++
Sbjct: 478 SEKAQKEEEEE 488


>gnl|CDD|218737 pfam05764, YL1, YL1 nuclear protein.  The proteins in this family
           are designated YL1. These proteins have been shown to be
           DNA-binding and may be a transcription factor.
          Length = 238

 Score = 41.2 bits (97), Expect = 9e-04
 Identities = 27/113 (23%), Positives = 43/113 (38%), Gaps = 15/113 (13%)

Query: 460 EAAERDDEDDED----EDESTDEDFNPDQAESDVAEEYDSNPTPTDSDEDSDASEG---- 511
           E    +DE        E+E  DE+F  ++ E +   + D + +  D  E  D  EG    
Sbjct: 19  EEELEEDEFFWTYLLFEEEEDDEEFEIEEEEEEEEVDSDFDDSEDDEPESDDEEEGEKEL 78

Query: 512 -------TKKKKKEKSRKTITISEKPRKSKKDKESSGGSTKRKKKDKNAPKKP 557
                   KK+ K K+ K  T  +K +     K     + + KKK +     P
Sbjct: 79  QREERLKKKKRVKTKAYKEPTKKKKKKDPTAAKSPKAAAPRPKKKSERISWAP 131


>gnl|CDD|217927 pfam04147, Nop14, Nop14-like family.  Emg1 and Nop14 are novel
           proteins whose interaction is required for the
           maturation of the 18S rRNA and for 40S ribosome
           production.
          Length = 809

 Score = 42.3 bits (100), Expect = 0.001
 Identities = 24/100 (24%), Positives = 45/100 (45%), Gaps = 8/100 (8%)

Query: 420 REKKLRVKNTGKGEKPNYKEDFGSSDDEKEPDAYLARVKREAAERDDEDDEDEDESTDED 479
            + K    +     +P+  ++FG    E++ +      +    + D+EDD+D+ E  +ED
Sbjct: 293 EDSKESADDLDDEFEPDDDDNFGLGQGEEDEE----EEEDGVDDEDEEDDDDDLEEEEED 348

Query: 480 FNPDQAESDVAEEYDSNPTPTDSDEDSDASEGTKKKKKEK 519
            +    E D  +E   +    + DE+ +  E  KKKKK  
Sbjct: 349 VDLSDEEEDEEDEDSDD----EDDEEEEEEEKEKKKKKSA 384



 Score = 40.4 bits (95), Expect = 0.004
 Identities = 59/301 (19%), Positives = 102/301 (33%), Gaps = 65/301 (21%)

Query: 462 AERDDEDDEDEDESTDEDFNPDQAESDVAEEYDSNPTPTDSDEDSDASEGTKKKKKEKSR 521
           +E D +DD  +D+  D+D   D A  D A  +         D++ D  E  ++KK   S+
Sbjct: 124 SEIDKDDDVRDDDDFDDDDLGDLASDDRAAHFGG-----GEDDEEDEEEQPERKK---SK 175

Query: 522 KTITISEKPRKSK-------KDKE-----------------SSGGSTKRKKKDKNAPKKP 557
           K + + E   KSK       K KE                 S   + K   K    P++ 
Sbjct: 176 KEV-MKEVIAKSKFYKAERQKAKEEDEDLREELDDDFKDLMSLLRTVKPPPKPPMTPEEK 234

Query: 558 MSAYMMWFN----DMR----EKIKKDNPGISFTEIAKKGGELWKTVSSKEKAEYDEKVNK 609
              Y         D R    ++ K +       E+AK+  E        +K E  E++ +
Sbjct: 235 DDEYDQRVRELTFDRRAQPTDRTKTEE------ELAKEEAE------RLKKLE-AERLRR 281

Query: 610 AKEDYNEALKAYKESGGGQDSDDGKTSKSKKPAKKKETASTAVSPHKVKSKEFIESNGSS 669
            + +  +  +         D DD                           +E    +   
Sbjct: 282 MRGEEEDDEEEEDSKESADDLDDEFEPDDDD--NFGLGQGEEDEE----EEEDGVDDEDE 335

Query: 670 SDSDDDKKASSKRKRDSDEDSKAKTKKKKEKSESESDSGESEKEKKSKSKPKGDKSKSKS 729
            D DDD +       + D D   + + ++++   + D  E E+E+K K K K  +S    
Sbjct: 336 EDDDDDLEEE-----EEDVDLSDEEEDEEDEDSDDEDDEEEEEEEKEKKKKKSAESTRSE 390

Query: 730 K 730
            
Sbjct: 391 L 391



 Score = 40.4 bits (95), Expect = 0.004
 Identities = 32/115 (27%), Positives = 58/115 (50%), Gaps = 4/115 (3%)

Query: 420 REKKLRVKNTGKGEKPNYKEDFGSSDDEKEPDA----YLARVKREAAERDDEDDEDEDES 475
           R +++R +     E+ + KE     DDE EPD      L + + +  E +D  D++++E 
Sbjct: 278 RLRRMRGEEEDDEEEEDSKESADDLDDEFEPDDDDNFGLGQGEEDEEEEEDGVDDEDEED 337

Query: 476 TDEDFNPDQAESDVAEEYDSNPTPTDSDEDSDASEGTKKKKKEKSRKTITISEKP 530
            D+D   ++ + D+++E +        DED +  E  +K+KK+K     T SE P
Sbjct: 338 DDDDLEEEEEDVDLSDEEEDEEDEDSDDEDDEEEEEEEKEKKKKKSAESTRSELP 392


>gnl|CDD|220102 pfam09073, BUD22, BUD22.  BUD22 has been shown in yeast to be a
           nuclear protein involved in bud-site selection. It plays
           a role in positioning the proximal bud pole signal. More
           recently it has been shown to be involved in ribosome
           biogenesis.
          Length = 424

 Score = 41.4 bits (97), Expect = 0.002
 Identities = 15/78 (19%), Positives = 34/78 (43%)

Query: 687 DEDSKAKTKKKKEKSESESDSGESEKEKKSKSKPKGDKSKSKSKKEDEIDDEEYDEPVES 746
            E    K K KK+  +S+    +   +K  + + + +      +  ++  D+E +E  +S
Sbjct: 142 IETKAKKGKAKKKTKKSKKKEAKESSDKDDEEESESEDESKSEESAEDDSDDEEEEDSDS 201

Query: 747 TPPSSDAGSDASSSEDDD 764
              S   G    SS++++
Sbjct: 202 EDYSQYDGMLVDSSDEEE 219



 Score = 35.2 bits (81), Expect = 0.14
 Identities = 32/128 (25%), Positives = 53/128 (41%), Gaps = 3/128 (2%)

Query: 420 REKKLRVKNTGKGEKPNYKEDFGSSDDEKEPDAYLARVKREAAERDDEDD--EDEDESTD 477
            +K    K T K +K   KE     D+E+  ++       E+AE D +D+  ED D    
Sbjct: 146 AKKGKAKKKTKKSKKKEAKESSDKDDEEES-ESEDESKSEESAEDDSDDEEEEDSDSEDY 204

Query: 478 EDFNPDQAESDVAEEYDSNPTPTDSDEDSDASEGTKKKKKEKSRKTITISEKPRKSKKDK 537
             ++    +S   EE +  P+   +++ S++       +  +SR      E    SKK K
Sbjct: 205 SQYDGMLVDSSDEEEGEEAPSINYNEDTSESESDESDSEISESRSVSDSEESSPPSKKPK 264

Query: 538 ESSGGSTK 545
           E    ST 
Sbjct: 265 EKKTSSTF 272



 Score = 34.0 bits (78), Expect = 0.29
 Identities = 29/166 (17%), Positives = 62/166 (37%), Gaps = 20/166 (12%)

Query: 593 KTVSSKEKAEYDEKVNKAKEDYNEALKAYKESGGGQDSDDGKTSKSKKPAKKKETASTAV 652
           K     +K++  E    + +D  E  ++  ES   + ++D    + ++ +  ++ +    
Sbjct: 150 KAKKKTKKSKKKEAKESSDKDDEEESESEDESKSEESAEDDSDDEEEEDSDSEDYSQYDG 209

Query: 653 SPHKVKSKEFIESNGSSSDSDDDKKASSKRKRDSDEDSKAKTKKKKEKSESESDSGESEK 712
                          SS + + ++  S     D+ E    ++  +  +S S SDS ES  
Sbjct: 210 -----------MLVDSSDEEEGEEAPSINYNEDTSESESDESDSEISESRSVSDSEESSP 258

Query: 713 EKKSKSKPKGDKSK---------SKSKKEDEIDDEEYDEPVESTPP 749
             K   + K   +          S S+ ED+ D++   + V   P 
Sbjct: 259 PSKKPKEKKTSSTFLPSLMGGYFSGSEDEDDDDEDIDPDQVVKKPV 304



 Score = 33.7 bits (77), Expect = 0.33
 Identities = 15/109 (13%), Positives = 33/109 (30%)

Query: 656 KVKSKEFIESNGSSSDSDDDKKASSKRKRDSDEDSKAKTKKKKEKSESESDSGESEKEKK 715
             K K   ++  S      +       +    ED     +  ++ S+ E +     ++  
Sbjct: 146 AKKGKAKKKTKKSKKKEAKESSDKDDEEESESEDESKSEESAEDDSDDEEEEDSDSEDYS 205

Query: 716 SKSKPKGDKSKSKSKKEDEIDDEEYDEPVESTPPSSDAGSDASSSEDDD 764
                  D S  +  +E    +   D     +  S    S++ S  D +
Sbjct: 206 QYDGMLVDSSDEEEGEEAPSINYNEDTSESESDESDSEISESRSVSDSE 254



 Score = 30.2 bits (68), Expect = 4.5
 Identities = 24/120 (20%), Positives = 45/120 (37%)

Query: 634 KTSKSKKPAKKKETASTAVSPHKVKSKEFIESNGSSSDSDDDKKASSKRKRDSDEDSKAK 693
           +T   K  AKKK   S      +   K+  E + S  +S  ++ A      + +EDS ++
Sbjct: 143 ETKAKKGKAKKKTKKSKKKEAKESSDKDDEEESESEDESKSEESAEDDSDDEEEEDSDSE 202

Query: 694 TKKKKEKSESESDSGESEKEKKSKSKPKGDKSKSKSKKEDEIDDEEYDEPVESTPPSSDA 753
              + +    +S   E  +E  S +  +        + + EI +       E + P S  
Sbjct: 203 DYSQYDGMLVDSSDEEEGEEAPSINYNEDTSESESDESDSEISESRSVSDSEESSPPSKK 262


>gnl|CDD|219746 pfam08208, RNA_polI_A34, DNA-directed RNA polymerase I subunit
           RPA34.5.  This is a family of proteins conserved from
           yeasts to human. Subunit A34.5 of RNA polymerase I is a
           non-essential subunit which is thought to help Pol I
           overcome topological constraints imposed on ribosomal
           DNA during the process of transcription.
          Length = 193

 Score = 39.7 bits (93), Expect = 0.002
 Identities = 20/68 (29%), Positives = 33/68 (48%)

Query: 664 ESNGSSSDSDDDKKASSKRKRDSDEDSKAKTKKKKEKSESESDSGESEKEKKSKSKPKGD 723
              GS S++ + +  +   K    E+ + K KKKK++ + E    + +KEK  + K    
Sbjct: 126 SELGSESETSEKETTAKVEKEAEVEEEEKKEKKKKKEVKKEKKEKKDKKEKMVEPKGSKK 185

Query: 724 KSKSKSKK 731
           K K K KK
Sbjct: 186 KKKKKKKK 193



 Score = 34.3 bits (79), Expect = 0.12
 Identities = 19/66 (28%), Positives = 36/66 (54%)

Query: 668 SSSDSDDDKKASSKRKRDSDEDSKAKTKKKKEKSESESDSGESEKEKKSKSKPKGDKSKS 727
             S+S+  +K ++ +     E  + + K+KK+K E + +  E + +K+   +PKG K K 
Sbjct: 128 LGSESETSEKETTAKVEKEAEVEEEEKKEKKKKKEVKKEKKEKKDKKEKMVEPKGSKKKK 187

Query: 728 KSKKED 733
           K KK+ 
Sbjct: 188 KKKKKK 193



 Score = 33.9 bits (78), Expect = 0.20
 Identities = 20/70 (28%), Positives = 39/70 (55%)

Query: 482 PDQAESDVAEEYDSNPTPTDSDEDSDASEGTKKKKKEKSRKTITISEKPRKSKKDKESSG 541
           PD   S++  E +++   T +  + +A    ++KK++K +K +   +K +K KK+K    
Sbjct: 121 PDGPPSELGSESETSEKETTAKVEKEAEVEEEEKKEKKKKKEVKKEKKEKKDKKEKMVEP 180

Query: 542 GSTKRKKKDK 551
             +K+KKK K
Sbjct: 181 KGSKKKKKKK 190



 Score = 32.4 bits (74), Expect = 0.55
 Identities = 15/63 (23%), Positives = 26/63 (41%)

Query: 669 SSDSDDDKKASSKRKRDSDEDSKAKTKKKKEKSESESDSGESEKEKKSKSKPKGDKSKSK 728
               D            S++++ AK +K+ E  E E    + +KE K + K K DK +  
Sbjct: 118 YGAPDGPPSELGSESETSEKETTAKVEKEAEVEEEEKKEKKKKKEVKKEKKEKKDKKEKM 177

Query: 729 SKK 731
            + 
Sbjct: 178 VEP 180


>gnl|CDD|235250 PRK04195, PRK04195, replication factor C large subunit;
           Provisional.
          Length = 482

 Score = 41.1 bits (97), Expect = 0.002
 Identities = 18/78 (23%), Positives = 38/78 (48%)

Query: 686 SDEDSKAKTKKKKEKSESESDSGESEKEKKSKSKPKGDKSKSKSKKEDEIDDEEYDEPVE 745
             + +  K KK  EK+E + +  + EK+KK+ +  K ++ + + K++ E + EE +E  E
Sbjct: 404 GSKKATKKIKKIVEKAEKKREEEKKEKKKKAFAGKKKEEEEEEEKEKKEEEKEEEEEEAE 463

Query: 746 STPPSSDAGSDASSSEDD 763
                 +      ++  D
Sbjct: 464 EEKEEEEEKKKKQATLFD 481



 Score = 39.1 bits (92), Expect = 0.007
 Identities = 12/79 (15%), Positives = 39/79 (49%)

Query: 672 SDDDKKASSKRKRDSDEDSKAKTKKKKEKSESESDSGESEKEKKSKSKPKGDKSKSKSKK 731
           ++++ +  +  K+ + +  K   K +K++ E + +  +     K K + + ++ + K ++
Sbjct: 395 TEEEIEFLTGSKKATKKIKKIVEKAEKKREEEKKEKKKKAFAGKKKEEEEEEEKEKKEEE 454

Query: 732 EDEIDDEEYDEPVESTPPS 750
           ++E ++E  +E  E     
Sbjct: 455 KEEEEEEAEEEKEEEEEKK 473



 Score = 37.2 bits (87), Expect = 0.031
 Identities = 12/77 (15%), Positives = 38/77 (49%)

Query: 658 KSKEFIESNGSSSDSDDDKKASSKRKRDSDEDSKAKTKKKKEKSESESDSGESEKEKKSK 717
           +  EF+  +  ++         +++KR+ ++  K K     +K E E +  + +KE++ +
Sbjct: 397 EEIEFLTGSKKATKKIKKIVEKAEKKREEEKKEKKKKAFAGKKKEEEEEEEKEKKEEEKE 456

Query: 718 SKPKGDKSKSKSKKEDE 734
            + +  + + + ++E +
Sbjct: 457 EEEEEAEEEKEEEEEKK 473



 Score = 33.7 bits (78), Expect = 0.42
 Identities = 20/75 (26%), Positives = 29/75 (38%), Gaps = 5/75 (6%)

Query: 690 SKAKTKKKKEKSESESDSGESEKEKKSKSKPKGDKSKSKSKKEDEIDDEEYDEPVESTPP 749
           SK  TKK K+  E      E EK++K K    G     K K+E+E +++E  E  +    
Sbjct: 405 SKKATKKIKKIVEKAEKKREEEKKEKKKKAFAG-----KKKEEEEEEEKEKKEEEKEEEE 459

Query: 750 SSDAGSDASSSEDDD 764
                      E   
Sbjct: 460 EEAEEEKEEEEEKKK 474



 Score = 32.6 bits (75), Expect = 0.85
 Identities = 23/83 (27%), Positives = 38/83 (45%), Gaps = 12/83 (14%)

Query: 637 KSKKPAKKKETASTAVSPHKVKSKEFIESNGSSSDSDDDKKASSKRKRDSDEDSKAKTKK 696
            SKK  KK           K + +E  E           KK +   K+  +E+ + K KK
Sbjct: 404 GSKKATKKI--KKIVEKAEKKREEEKKE----------KKKKAFAGKKKEEEEEEEKEKK 451

Query: 697 KKEKSESESDSGESEKEKKSKSK 719
           ++EK E E ++ E ++E++ K K
Sbjct: 452 EEEKEEEEEEAEEEKEEEEEKKK 474



 Score = 31.4 bits (72), Expect = 1.9
 Identities = 13/70 (18%), Positives = 34/70 (48%)

Query: 670 SDSDDDKKASSKRKRDSDEDSKAKTKKKKEKSESESDSGESEKEKKSKSKPKGDKSKSKS 729
                 KK     ++   +  + K +KKK+    +    E E+EK+ K + K ++ +   
Sbjct: 404 GSKKATKKIKKIVEKAEKKREEEKKEKKKKAFAGKKKEEEEEEEKEKKEEEKEEEEEEAE 463

Query: 730 KKEDEIDDEE 739
           ++++E ++++
Sbjct: 464 EEKEEEEEKK 473


>gnl|CDD|219408 pfam07423, DUF1510, Protein of unknown function (DUF1510).  This
           family consists of several hypothetical bacterial
           proteins of around 200 residues in length. The function
           of this family is unknown.
          Length = 214

 Score = 39.7 bits (93), Expect = 0.003
 Identities = 15/82 (18%), Positives = 39/82 (47%), Gaps = 1/82 (1%)

Query: 672 SDDDKKASSKRKRDSDEDSKAKTKKKKEKSESESDSGESEKEKKSKSKPKGDKSKSKSKK 731
           S    +A++  +     D +   + ++ K E +  +   +KE K  ++ + ++S+ ++++
Sbjct: 38  SSPSDQAAADEQEAKKSDDQETAEIEEVKEEEKEAANSEDKEDKGDAEKEDEESEEENEE 97

Query: 732 EDEIDDEEYDEPVESTPPSSDA 753
           EDE   +E +E        S+ 
Sbjct: 98  EDEESSDE-NEKETEEKTESNV 118



 Score = 37.4 bits (87), Expect = 0.014
 Identities = 16/94 (17%), Positives = 41/94 (43%), Gaps = 3/94 (3%)

Query: 630 SDDGKTSKSKKPAKKKETASTAVSPHKVKSKEFIESNGSSSDSDDDKKASSKRKRDSDED 689
           S     + + +   KK          +VK +E      ++S+  +DK  + K   +S+E+
Sbjct: 38  SSPSDQAAADEQEAKKSDDQETAEIEEVKEEE---KEAANSEDKEDKGDAEKEDEESEEE 94

Query: 690 SKAKTKKKKEKSESESDSGESEKEKKSKSKPKGD 723
           ++ + ++  +++E E++       +K  + P   
Sbjct: 95  NEEEDEESSDENEKETEEKTESNVEKEITNPSWK 128



 Score = 36.6 bits (85), Expect = 0.024
 Identities = 16/90 (17%), Positives = 44/90 (48%), Gaps = 4/90 (4%)

Query: 658 KSKEFIESNGSSSDSDDDKKASSKRKRDSDEDSKAKTKKKKEKSESESDSGESEKEKKSK 717
            S +       +  SDD + A  +  ++ ++++ A ++ K++K ++E +  ESE+E +  
Sbjct: 40  PSDQAAADEQEAKKSDDQETAEIEEVKEEEKEA-ANSEDKEDKGDAEKEDEESEEENE-- 96

Query: 718 SKPKGDKSKSKSKKEDEIDDEEYDEPVEST 747
            +   + S    K+ +E  +   ++ + + 
Sbjct: 97  -EEDEESSDENEKETEEKTESNVEKEITNP 125



 Score = 29.3 bits (66), Expect = 5.9
 Identities = 18/78 (23%), Positives = 32/78 (41%)

Query: 443 SSDDEKEPDAYLARVKREAAERDDEDDEDEDESTDEDFNPDQAESDVAEEYDSNPTPTDS 502
           ++ DE+E      +   E  E  +E+ E  +    ED    + E + +EE +       S
Sbjct: 44  AAADEQEAKKSDDQETAEIEEVKEEEKEAANSEDKEDKGDAEKEDEESEEENEEEDEESS 103

Query: 503 DEDSDASEGTKKKKKEKS 520
           DE+   +E   +   EK 
Sbjct: 104 DENEKETEEKTESNVEKE 121



 Score = 29.3 bits (66), Expect = 6.6
 Identities = 15/86 (17%), Positives = 37/86 (43%), Gaps = 3/86 (3%)

Query: 427 KNTGKGEKPNYKEDFGSSDDEKEPDAYLARVKREAAERDDEDDEDEDESTDEDFNPDQAE 486
                  +   +E+  +++ E + D   A  + E +E ++E++++E    +E    ++ E
Sbjct: 56  DQETAEIEEVKEEEKEAANSEDKEDKGDAEKEDEESEEENEEEDEESSDENEKETEEKTE 115

Query: 487 SDVAEEY---DSNPTPTDSDEDSDAS 509
           S+V +E       P  T+       +
Sbjct: 116 SNVEKEITNPSWKPVGTEQTGPHAMT 141


>gnl|CDD|218752 pfam05793, TFIIF_alpha, Transcription initiation factor IIF, alpha
           subunit (TFIIF-alpha).  Transcription initiation factor
           IIF, alpha subunit (TFIIF-alpha) or RNA polymerase
           II-associating protein 74 (RAP74) is the large subunit
           of transcription factor IIF (TFIIF), which is essential
           for accurate initiation and stimulates elongation by RNA
           polymerase II.
          Length = 528

 Score = 39.9 bits (93), Expect = 0.005
 Identities = 34/168 (20%), Positives = 64/168 (38%), Gaps = 11/168 (6%)

Query: 582 TEIAKKGGELWKTVSSKEKAEYDE----KVNKAKEDYNEALKAYKESGGGQDSDDGKTSK 637
           TE  K GG   K +  K+    DE    + +K  ED +E     K+    ++       K
Sbjct: 200 TEGEKGGGGRGKDLKIKDLEGDDEDDGDESDKGGEDGDEEKSKKKKKKLAKNKKKLDDDK 259

Query: 638 SKKPAKKKETASTAVSPHKVKSKEFI---ESNGSSSDSDDDKKASSKRKRDSDEDSKAKT 694
             K     +           + +E     +S+ S +D ++ +   S       E  + + 
Sbjct: 260 KGKRGGDDDADEYDSDDGDDEGREEDYISDSSASGNDPEEREDKLSPEIPAKPEIEQDED 319

Query: 695 KKKKEKSESESDSGESEKEKKSKSKPKGDKSKSKSKKEDEIDDEEYDE 742
            ++ E+ ++E + G S+K K    K K  K K     +D+ D  +  +
Sbjct: 320 SEESEEEKNEEEGGLSKKGK----KLKKLKGKKNGLDKDDSDSGDDSD 363



 Score = 37.2 bits (86), Expect = 0.036
 Identities = 27/181 (14%), Positives = 69/181 (38%), Gaps = 11/181 (6%)

Query: 592 WKTVSSKEKAEYDEKVNKAKEDYNEALKAYKESGGGQDSDDGKTSKSKKPAKKKETASTA 651
           +KT++++E  E  ++  K    +   +    ++G     D+ + ++ +K    +      
Sbjct: 156 YKTLTAEEAEEEMKRRKKTANGFQLMMMKAAKNGPAAFGDEDEETEGEKGGGGR---GKD 212

Query: 652 VSPHKVKSKEFIESNGSSSDSDDDKKASSKRKRDSDEDSKAKTKKKKEKS--------ES 703
           +    ++  +  + + S    +D  +  SK+K+     +K K    K+          E 
Sbjct: 213 LKIKDLEGDDEDDGDESDKGGEDGDEEKSKKKKKKLAKNKKKLDDDKKGKRGGDDDADEY 272

Query: 704 ESDSGESEKEKKSKSKPKGDKSKSKSKKEDEIDDEEYDEPVESTPPSSDAGSDASSSEDD 763
           +SD G+ E  ++              ++ED++  E   +P       S+   +  + E+ 
Sbjct: 273 DSDDGDDEGREEDYISDSSASGNDPEEREDKLSPEIPAKPEIEQDEDSEESEEEKNEEEG 332

Query: 764 D 764
            
Sbjct: 333 G 333



 Score = 36.9 bits (85), Expect = 0.039
 Identities = 61/310 (19%), Positives = 99/310 (31%), Gaps = 48/310 (15%)

Query: 456 RVKREAAERDDEDDEDEDESTDEDFNPDQAESDVAEEYDSNPTPTDSDEDSDASEGTKKK 515
             K   A   DED+E E E        D    D+  + + +   +D   +    E +KKK
Sbjct: 185 AAKNGPAAFGDEDEETEGEKGGGGRGKDLKIKDLEGDDEDDGDESDKGGEDGDEEKSKKK 244

Query: 516 KKEKSRKTITISEKPRKSKKDKESSGGSTKRKKKDKNAPKKPMSAYMMWFNDMREKIKKD 575
           KK+ +        K +K   D +       ++  D +A +             RE+    
Sbjct: 245 KKKLA--------KNKKKLDDDKK-----GKRGGDDDADEYDSDDGDDE---GREEDYIS 288

Query: 576 NPGISFTEIAKKGGELWKTVSSKEKAEYDEKVNKAKEDYNEALKAYKESGGGQDSDDGKT 635
           +   S  +              +E     E   K + + +E  +  +E     + + G +
Sbjct: 289 DSSASGND-----------PEEREDKLSPEIPAKPEIEQDEDSEESEEE--KNEEEGGLS 335

Query: 636 SKSKKPAKKKETASTAVSPHKVKSKEFIESNGSSSDSDDDKKASSKRKRDSDEDSKAKTK 695
            K KK  K K   +                +   SDS DD   S     DS     AK +
Sbjct: 336 KKGKKLKKLKGKKN--------------GLDKDDSDSGDDSDDSDIDGEDSVSLVTAKKQ 381

Query: 696 KKKEKSESESDSGESEKEKKSKSKPKGDKSKSKSKKEDEIDDEEYDEP-----VESTPPS 750
           K+ +K E    +  S             K K K K  +E+       P      E+ P S
Sbjct: 382 KEPKKEEPVDSNPSSPGNSGPARPSPESKDKGKRKAANEVSKSPASVPAKKLKTENAPKS 441

Query: 751 SDAGSDASSS 760
           S   S   + 
Sbjct: 442 SSGKSTPQTF 451



 Score = 30.7 bits (69), Expect = 3.6
 Identities = 39/191 (20%), Positives = 62/191 (32%), Gaps = 14/191 (7%)

Query: 587 KGGELWKTVSSKEKAEYDEKVNKAKEDYNEALKAYKESGGGQDSDDGK--------TSKS 638
           KGGE      SK+K +   K  K  +D  +  +   +     DSDDG          S S
Sbjct: 231 KGGEDGDEEKSKKKKKKLAKNKKKLDDDKKGKRGGDDDADEYDSDDGDDEGREEDYISDS 290

Query: 639 KKPAKKKETASTAVSPHKVKSKEFIESNGSSSDSDDDK-----KASSKRKRDSDEDSKAK 693
                  E     +SP    +K  IE +  S +S+++K       S K K+      K  
Sbjct: 291 SASGNDPEEREDKLSPEIP-AKPEIEQDEDSEESEEEKNEEEGGLSKKGKKLKKLKGKKN 349

Query: 694 TKKKKEKSESESDSGESEKEKKSKSKPKGDKSKSKSKKEDEIDDEEYDEPVESTPPSSDA 753
              K +    +         + S S     K K   K+E    +           PS ++
Sbjct: 350 GLDKDDSDSGDDSDDSDIDGEDSVSLVTAKKQKEPKKEEPVDSNPSSPGNSGPARPSPES 409

Query: 754 GSDASSSEDDD 764
                    ++
Sbjct: 410 KDKGKRKAANE 420



 Score = 29.2 bits (65), Expect = 8.8
 Identities = 27/120 (22%), Positives = 48/120 (40%), Gaps = 4/120 (3%)

Query: 431 KGEKPNYKEDFGSSDDEKEPDAYLARVKREAAERDDEDDEDEDESTDEDFNPDQAESDVA 490
           K +K   K      DD+K           +A E D +D +DE    D   +   + +D  
Sbjct: 243 KKKKKLAKNKKKLDDDKKGKRG----GDDDADEYDSDDGDDEGREEDYISDSSASGNDPE 298

Query: 491 EEYDSNPTPTDSDEDSDASEGTKKKKKEKSRKTITISEKPRKSKKDKESSGGSTKRKKKD 550
           E  D       +  + +  E +++ ++EK+ +   +S+K +K KK K    G  K     
Sbjct: 299 EREDKLSPEIPAKPEIEQDEDSEESEEEKNEEEGGLSKKGKKLKKLKGKKNGLDKDDSDS 358


>gnl|CDD|220684 pfam10310, DUF2413, Protein of unknown function (DUF2413).  This is
           a family of proteins conserved in fungi. The function is
           not known.
          Length = 436

 Score = 39.8 bits (93), Expect = 0.005
 Identities = 32/125 (25%), Positives = 46/125 (36%), Gaps = 5/125 (4%)

Query: 631 DDGKTSKSKKPAKKKETASTAVSPHKVKSKEFIESNGSSSDSDDDKKASSKRKRDSDEDS 690
           D     K+     KK  AS   +       EF++    S  +   KK        S   +
Sbjct: 6   DSLPDEKAPTKKPKKGDASKDSTEDDEDILEFLDELEQSEKAKPPKKPKEA----SRPGT 61

Query: 691 KAKTKKKKEKSESESDSGESEKEKKSKSKPKGDKSKSKSKKEDEIDDEEYDEPVESTPPS 750
               KK  + +ES + S E +  K  KS      S  KSK      +EE +EP E+  P 
Sbjct: 62  PRNPKKSSKPTESSAASSEEKPAKPRKSAESTRSSHPKSKAPSTESEEE-EEPEETPDPI 120

Query: 751 SDAGS 755
           +  G 
Sbjct: 121 ASIGG 125



 Score = 38.2 bits (89), Expect = 0.013
 Identities = 27/111 (24%), Positives = 40/111 (36%), Gaps = 11/111 (9%)

Query: 661 EFIESNGSSSDSDDDKK--ASSKRKRDSDEDS-------KAKTKKKKEKSESESDSGESE 711
           EF++S           K   +SK   + DED        +   K K  K   E+    + 
Sbjct: 3   EFLDSLPDEKAPTKKPKKGDASKDSTEDDEDILEFLDELEQSEKAKPPKKPKEASRPGTP 62

Query: 712 KEKKSKSKPKGDKSKSKSKKEDEIDDEEYDEPVESTPPSSDAGSDASSSED 762
           +  K  SKP    + S  +K  +       E   S+ P S A S  S  E+
Sbjct: 63  RNPKKSSKPTESSAASSEEKPAKPRKSA--ESTRSSHPKSKAPSTESEEEE 111



 Score = 30.5 bits (69), Expect = 3.6
 Identities = 30/119 (25%), Positives = 39/119 (32%), Gaps = 11/119 (9%)

Query: 439 EDFGSSDDEKEPDAYLARVKREAAERDDEDDEDEDESTDEDFNPDQAESDVAEEYDSNPT 498
           E   S  DEK P       K +A++   EDDED  E  DE             E    P 
Sbjct: 3   EFLDSLPDEKAPTK--KPKKGDASKDSTEDDEDILEFLDEL---------EQSEKAKPPK 51

Query: 499 PTDSDEDSDASEGTKKKKKEKSRKTITISEKPRKSKKDKESSGGSTKRKKKDKNAPKKP 557
                         KK  K       +  EKP K +K  ES+  S  + K      ++ 
Sbjct: 52  KPKEASRPGTPRNPKKSSKPTESSAASSEEKPAKPRKSAESTRSSHPKSKAPSTESEEE 110


>gnl|CDD|220271 pfam09507, CDC27, DNA polymerase subunit Cdc27.  This protein forms
           the C subunit of DNA polymerase delta. It carries the
           essential residues for binding to the Pol1 subunit of
           polymerase alpha, from residues 293-332, which are
           characterized by the motif D--G--VT, referred to as the
           DPIM motif. The first 160 residues of the protein form
           the minimal domain for binding to the B subunit, Cdc1,
           of polymerase delta, the final 10 C-terminal residues,
           362-372, being the DNA sliding clamp, PCNA, binding
           motif.
          Length = 427

 Score = 39.4 bits (92), Expect = 0.006
 Identities = 31/129 (24%), Positives = 47/129 (36%), Gaps = 6/129 (4%)

Query: 632 DGKTSKSKKPAKKKETASTAVSPHKVKSKEFIESNGSSSDSDDDKKASSKRKRDSDED-- 689
           +GK   SK P           S  K +      +   +      K AS KR      +  
Sbjct: 159 NGKRPSSKPPKSIMSPEVKVKSAKKTQDTSKETTTEKTEGKTSVKAASLKRNPPKKSNIM 218

Query: 690 ---SKAKTKKKKEKSESESDSGESEKEKKSKSKPKGDKSKSKSKKEDEIDDEEYDEPVES 746
               K KTK+KKEK E+   + + E E++S  +         ++     +DE+ DEP  S
Sbjct: 219 SSFFKKKTKEKKEKKEASESTVKEESEEESGKRDV-ILEDESAEPTGLDEDEDEDEPKPS 277

Query: 747 TPPSSDAGS 755
              S     
Sbjct: 278 GERSDSEEE 286



 Score = 38.3 bits (89), Expect = 0.014
 Identities = 34/115 (29%), Positives = 50/115 (43%), Gaps = 6/115 (5%)

Query: 637 KSKKPAKKKETASTAVSPHKVKSKEFIESNGSSSDSDDDKKASSKRKRDSDEDSKAKTKK 696
             KK  +KKE    + S  K +S+E  ES       +D+    +    D DED     K 
Sbjct: 222 FKKKTKEKKEKKEASESTVKEESEE--ESGKRDVILEDESAEPTGLDEDEDEDEP---KP 276

Query: 697 KKEKSESESDSGESEKEK-KSKSKPKGDKSKSKSKKEDEIDDEEYDEPVESTPPS 750
             E+S+SE ++ E EKEK K   K   D+ + +  +       E +E  E  PP 
Sbjct: 277 SGERSDSEEETEEKEKEKRKRLKKMMEDEDEDEEMEIVPESPVEEEESEEPEPPP 331



 Score = 32.9 bits (75), Expect = 0.61
 Identities = 19/66 (28%), Positives = 31/66 (46%)

Query: 457 VKREAAERDDEDDEDEDESTDEDFNPDQAESDVAEEYDSNPTPTDSDEDSDASEGTKKKK 516
            K  +     E+ E+E    D     + AE    +E +    P  S E SD+ E T++K+
Sbjct: 232 KKEASESTVKEESEEESGKRDVILEDESAEPTGLDEDEDEDEPKPSGERSDSEEETEEKE 291

Query: 517 KEKSRK 522
           KEK ++
Sbjct: 292 KEKRKR 297



 Score = 32.9 bits (75), Expect = 0.61
 Identities = 28/153 (18%), Positives = 61/153 (39%), Gaps = 10/153 (6%)

Query: 622 KESGGGQDSDDGKTSKSKKPAKKKETASTAVSPH----------KVKSKEFIESNGSSSD 671
           K +   QD+    T++  +     + AS   +P           K K+KE  E   +S  
Sbjct: 179 KSAKKTQDTSKETTTEKTEGKTSVKAASLKRNPPKKSNIMSSFFKKKTKEKKEKKEASES 238

Query: 672 SDDDKKASSKRKRDSDEDSKAKTKKKKEKSESESDSGESEKEKKSKSKPKGDKSKSKSKK 731
           +  ++      KRD   + ++      ++ E E +   S +   S+ + +  + + + + 
Sbjct: 239 TVKEESEEESGKRDVILEDESAEPTGLDEDEDEDEPKPSGERSDSEEETEEKEKEKRKRL 298

Query: 732 EDEIDDEEYDEPVESTPPSSDAGSDASSSEDDD 764
           +  ++DE+ DE +E  P S     ++   E   
Sbjct: 299 KKMMEDEDEDEEMEIVPESPVEEEESEEPEPPP 331



 Score = 32.5 bits (74), Expect = 0.78
 Identities = 24/121 (19%), Positives = 47/121 (38%), Gaps = 11/121 (9%)

Query: 633 GKTSKSKKPAKKKETA-STAVSPHKVKSKEFIESNGSSSDSDDDKKASSKRKRDSDEDSK 691
               K     K+  +    ++   +VK K       S+  + D  K ++  K +     K
Sbjct: 151 SPALKPTANGKRPSSKPPKSIMSPEVKVK-------SAKKTQDTSKETTTEKTEGKTSVK 203

Query: 692 AKTKKKKEKSESESDSGESEKEKKSKSKPKGDKSKSKSKKEDEIDDEEYDEPVESTPPSS 751
           A + K+    +S   S   +K+ K K +    K  S+S  ++E ++E     V     S+
Sbjct: 204 AASLKRNPPKKSNIMSSFFKKKTKEKKE---KKEASESTVKEESEEESGKRDVILEDESA 260

Query: 752 D 752
           +
Sbjct: 261 E 261



 Score = 32.5 bits (74), Expect = 0.81
 Identities = 43/256 (16%), Positives = 93/256 (36%), Gaps = 30/256 (11%)

Query: 482 PDQAESDVAEEYDSNPTPTDSDEDSDASEGTKKKKKEKSRKTITISEKPRKSKKDKESSG 541
             ++     +      T     + S  +   K+   +KS    +  +K  K KK+K+ + 
Sbjct: 177 KVKSAKKTQDTSKETTTEKTEGKTSVKAASLKRNPPKKSNIMSSFFKKKTKEKKEKKEAS 236

Query: 542 GSTKRKKKDKNAPKKPM------SAYMMWFNDMREKIKKDNPGISFTEIAKKGGELWKTV 595
            ST +++ ++ + K+ +      +       D  E   K +   S +E   +        
Sbjct: 237 ESTVKEESEEESGKRDVILEDESAEPTGLDEDEDEDEPKPSGERSDSEEETEE------- 289

Query: 596 SSKEKAEYDEKVNKAKEDYNEALKAYKESGGGQDSDDGKTSKSKKPAKKKETASTAVSPH 655
             KEK +  +K+ + +++  E     +     ++S++ +     K  ++KE  + +    
Sbjct: 290 KEKEKRKRLKKMMEDEDEDEEMEIVPESPVEEEESEEPEPPPLPKKEEEKEEVTVSPDGG 349

Query: 656 KVKSKEFIESNGSSSDSDDDKKASSKRKRDSDEDSKAKTKKKKEKSESESDSGESEKEKK 715
           + + +  +                 K+K   DE+    TKK  E  ES S+        K
Sbjct: 350 RRRGRRRVM----------------KKKTFKDEEGYLVTKKVYE-WESFSEDEAEPPPTK 392

Query: 716 SKSKPKGDKSKSKSKK 731
            K K       + +KK
Sbjct: 393 PKPKVSTPAVPAAAKK 408


>gnl|CDD|240388 PTZ00372, PTZ00372, endonuclease 4-like protein; Provisional.
          Length = 413

 Score = 39.3 bits (92), Expect = 0.006
 Identities = 28/110 (25%), Positives = 45/110 (40%), Gaps = 4/110 (3%)

Query: 632 DGKTSKSKKPAKKKETASTAVSPHKVKSKEFIESNGSSSDSDDDKKASSKRKRDSDEDSK 691
            G T KSK        ++  V   ++ S    E N  ++ S        K++  ++E  K
Sbjct: 13  SGTTQKSKLQPISYIYSNVLVLSKEILSTFSEEENKVATTS----TKKDKKEDKNNESKK 68

Query: 692 AKTKKKKEKSESESDSGESEKEKKSKSKPKGDKSKSKSKKEDEIDDEEYD 741
              KKKK+K E +    E E +   K+  K  K+K K  K    +D +  
Sbjct: 69  KSEKKKKKKKEKKEPKSEGETKLGFKTPKKSKKTKKKPPKPKPNEDVDNA 118



 Score = 33.2 bits (76), Expect = 0.54
 Identities = 27/89 (30%), Positives = 42/89 (47%), Gaps = 6/89 (6%)

Query: 491 EEYDSNPTPTDSDEDSDASEGTKKK-KKEKSRKTITISEKPRKSKKDKESSGGSTKRKKK 549
           EE     T T  D+  D +  +KKK +K+K +K     +K  KS+ + +    + K+ KK
Sbjct: 45  EENKVATTSTKKDKKEDKNNESKKKSEKKKKKK---KEKKEPKSEGETKLGFKTPKKSKK 101

Query: 550 DKNAPKKPMSAYMMW--FNDMREKIKKDN 576
            K  P KP     +   FN + E  +K N
Sbjct: 102 TKKKPPKPKPNEDVDNAFNKIAELAEKSN 130


>gnl|CDD|130712 TIGR01651, CobT, cobaltochelatase, CobT subunit.  This model
           describes Pseudomonas denitrificans CobT gene product,
           which is a cobalt chelatase subunit that functions in
           cobalamin biosynthesis. Cobalamin (vitamin B12) can be
           synthesized via several pathways, including an aerobic
           pathway (found in Pseudomonas denitrificans) and an
           anaerobic pathway (found in P. shermanii and Salmonella
           typhimurium). These pathways differ in the point of
           cobalt insertion during corrin ring formation. There are
           apparently a number of variations on these two pathways,
           where the major differences seem to be concerned with
           the process of ring contraction. Confusion regarding the
           functions of enzymes found in the aerobic vs. anaerobic
           pathways has arisen because nonhomologous genes in these
           different pathways were given the same gene symbols.
           Thus, cobT in the aerobic pathway (P. denitrificans) is
           not a homolog of cobT in the anaerobic pathway (S.
           typhimurium). It should be noted that E. coli
           synthesizes cobalamin only when it is supplied with the
           precursor cobinamide, which is a complex intermediate.
           Additionally, all E. coli cobalamin synthesis genes
           (cobU, cobS and cobT) were named after their Salmonella
           typhimurium homologs which function in the anaerobic
           cobalamin synthesis pathway. This model describes the
           aerobic cobalamin pathway Pseudomonas denitrificans CobT
           gene product, which is a cobalt chelatase subunit, with
           a MW ~70 kDa. The aerobic pathway cobalt chelatase is a
           heterotrimeric, ATP-dependent enzyme that catalyzes
           cobalt insertion during cobalamin biosynthesis. The
           other two subunits are the P. denitrificans CobS
           (TIGR01650) and CobN (pfam02514 CobN/Magnesium
           Chelatase) proteins. To avoid potential confusion with
           the nonhomologous Salmonella typhimurium/E.coli cobT
           gene product, the P. denitrificans gene symbol is not
           used in the name of this model [Biosynthesis of
           cofactors, prosthetic groups, and carriers, Heme,
           porphyrin, and cobalamin].
          Length = 600

 Score = 39.2 bits (91), Expect = 0.008
 Identities = 23/113 (20%), Positives = 41/113 (36%), Gaps = 9/113 (7%)

Query: 452 AYLARVKRE------AAERDDEDDEDEDESTDEDFNP-DQAESDVAEEYDSNPTPTDSDE 504
              ARV RE       AE   +D E EDE   +D  P +  + +  E        +   E
Sbjct: 184 QAFARVVREMLRSMELAEEMGDDTESEDEEDGDDDQPTENEQEEQGEGEGEGQEGSAPQE 243

Query: 505 DSDASEGTKKKKKEKSRKTITISEKPRKSKKDKESSGGSTKRKKKDKNAPKKP 557
                  ++  ++E  +      + P +S  D E+ G   +  +   +   +P
Sbjct: 244 SEATDRESESGEEEMVQ--SDQDDLPDESDDDSETPGEGARPARPFTSTGGEP 294



 Score = 29.1 bits (65), Expect = 9.1
 Identities = 23/97 (23%), Positives = 35/97 (36%), Gaps = 16/97 (16%)

Query: 430 GKGEKPNYKEDFGSSDDEKEPDAYLARVKREAAERDDE---------DDEDEDESTDEDF 480
           G  ++P   E     + E E     A  + EA +R+ E         D +D  + +D+D 
Sbjct: 215 GDDDQPTENEQEEQGEGEGEGQEGSAPQESEATDRESESGEEEMVQSDQDDLPDESDDDS 274

Query: 481 -------NPDQAESDVAEEYDSNPTPTDSDEDSDASE 510
                   P +  +    E D     T  DE  DA E
Sbjct: 275 ETPGEGARPARPFTSTGGEPDYKVFTTAFDETVDAEE 311


>gnl|CDD|219838 pfam08432, DUF1742, Fungal protein of unknown function (DUF1742).
           This is a family of fungal proteins of unknown function.
          Length = 182

 Score = 37.7 bits (88), Expect = 0.008
 Identities = 21/84 (25%), Positives = 37/84 (44%), Gaps = 5/84 (5%)

Query: 674 DDKKASSKRKRDSDEDSKAKTKK-----KKEKSESESDSGESEKEKKSKSKPKGDKSKSK 728
           D +   +K+K+    +   K KK     +K K + +    + +K+K  K   K DKS+ K
Sbjct: 54  DAEYTEAKKKKKELAEEIEKVKKEYEEKQKWKWKKKKSKKKKDKDKDKKDDKKDDKSEKK 113

Query: 729 SKKEDEIDDEEYDEPVESTPPSSD 752
            +KE E   E+  +    T  +  
Sbjct: 114 DEKEAEDKLEDLTKSYSETLSTLS 137



 Score = 33.5 bits (77), Expect = 0.24
 Identities = 13/59 (22%), Positives = 28/59 (47%)

Query: 681 KRKRDSDEDSKAKTKKKKEKSESESDSGESEKEKKSKSKPKGDKSKSKSKKEDEIDDEE 739
           ++ +   + SK K  K K+K + + D    +K++K       D +KS S+    + + +
Sbjct: 82  QKWKWKKKKSKKKKDKDKDKKDDKKDDKSEKKDEKEAEDKLEDLTKSYSETLSTLSELK 140



 Score = 32.7 bits (75), Expect = 0.36
 Identities = 18/71 (25%), Positives = 28/71 (39%), Gaps = 1/71 (1%)

Query: 656 KVKSKEFIESNGSSSDSDDDKKASSKRKRDSDEDSKAKTKKKKEKSESESDSGESEKEKK 715
           KVK KE+ E           KK   K K   D+    K++KK EK   +     ++   +
Sbjct: 73  KVK-KEYEEKQKWKWKKKKSKKKKDKDKDKKDDKKDDKSEKKDEKEAEDKLEDLTKSYSE 131

Query: 716 SKSKPKGDKSK 726
           + S     K +
Sbjct: 132 TLSTLSELKPR 142


>gnl|CDD|184900 PRK14907, rplD, 50S ribosomal protein L4; Provisional.
          Length = 295

 Score = 38.4 bits (89), Expect = 0.009
 Identities = 28/101 (27%), Positives = 47/101 (46%), Gaps = 9/101 (8%)

Query: 634 KTSKSKKPAKKKETASTAVSPHKVKSKEFIESNGSSSDSDDDKKASSKRKRDSDEDSKAK 693
           KT++ KKPA KK T S   +  K  +K        ++ +   KKA+  +K  S + +  K
Sbjct: 11  KTTEEKKPAAKKATTSKETAKTKKTAK--------TTSTKAAKKAAKVKKTKSVKTTTKK 62

Query: 694 TKKKKEKSES-ESDSGESEKEKKSKSKPKGDKSKSKSKKED 733
              K EK+ES + +S   +  KK     +  ++ +K  K  
Sbjct: 63  VTVKFEKTESVKKESVAKKTVKKEAVSAEVFEASNKLFKNT 103



 Score = 30.7 bits (69), Expect = 3.1
 Identities = 31/120 (25%), Positives = 45/120 (37%), Gaps = 13/120 (10%)

Query: 593 KTVSSKEKAEYDEKVNKAKEDYNEALKAYKESGGGQDSDDGKTSKSKKPAKKKETASTAV 652
           K  +++EK    +K   +KE       A   S          T  +KK AK K+T S   
Sbjct: 9   KKKTTEEKKPAAKKATTSKETAKTKKTAKTTS----------TKAAKKAAKVKKTKSVKT 58

Query: 653 SPHKVKSKEFIESNGSSSDSDDDKKASSKRKRDSDEDSKAKTKKKKEKSESESDSGESEK 712
           +  KV  K        S   +   K + K++  S E  +A  K  K  S+       SEK
Sbjct: 59  TTKKVTVKF---EKTESVKKESVAKKTVKKEAVSAEVFEASNKLFKNTSKLPKKLFASEK 115


>gnl|CDD|236304 PRK08581, PRK08581, N-acetylmuramoyl-L-alanine amidase; Validated.
          Length = 619

 Score = 39.0 bits (91), Expect = 0.010
 Identities = 49/290 (16%), Positives = 88/290 (30%), Gaps = 34/290 (11%)

Query: 469 DEDEDESTDEDFNPDQAESDVAEEYDSNPTPTDSDEDSDASEGTKKKKKEKSRKTITISE 528
           D+ + +ST +  + D  +S+  E   S  T +   + +D +  + +   +K   TI  S 
Sbjct: 28  DDPQKDSTAKTTSHDSKKSNDDET--SKDTSSKDTDKADNNNTSNQDNNDKKFSTIDSST 85

Query: 529 KPRKSKKDKESSGGSTKRKKKDKNAPKKPMSAYMMWFNDMREKIKKDNPGISFTEIAKKG 588
               +  D            +     K                    +   S T + +  
Sbjct: 86  SDSNNIIDFIYKNLPQTNINQLLTKNKY-------------------DDNYSLTTLIQNL 126

Query: 589 GELWKTVSSKEKAEYDEKVNKAKEDYNEALKAYKESGGGQDSDDGKTSKSKKPAKKKETA 648
             L   +S  E+    EK        +++  + K     Q S   K    K P+      
Sbjct: 127 FNLNSDISDYEQPRNSEKSTNDSNKNSDS--SIKNDTDTQSSKQDKADNQKAPSSNNTKP 184

Query: 649 STAVSPHKVKSKEFIESNGSSSDSDDDKKASSKRKRDS-----------DEDSKAKTKKK 697
           ST+              + S   SDD     S  K +            D+ S+   K +
Sbjct: 185 STSNKQPNSPKPTQPNQSNSQPASDDTANQKSSSKDNQSMSDSALDSILDQYSEDAKKTQ 244

Query: 698 KEKSESESDSGESEKEKKSKSKPKGDKSKSKSKKEDEIDDEEYDEPVEST 747
           K+ +             K+   P  D+ K KSK     +++       ST
Sbjct: 245 KDYASQSKKDKTETSNTKNPQLPTQDELKHKSKPAQSFENDVNQSNTRST 294



 Score = 38.2 bits (89), Expect = 0.016
 Identities = 25/164 (15%), Positives = 46/164 (28%), Gaps = 30/164 (18%)

Query: 629 DSDDGKTSKSKKPAKKKETASTAVSPHKVKSKEFIESNGSSSDSDDDKKASSKRKRD--- 685
           D+D    + +       +   + +      S   I+    +    +  +  +K K D   
Sbjct: 59  DTDKADNNNTSNQ-DNNDKKFSTIDSSTSDSNNIIDFIYKNLPQTNINQLLTKNKYDDNY 117

Query: 686 ----------------SDEDSKAKTKKKKEKSESESDSGESEKEKKSKSKPKGDKSKSKS 729
                           SD +    ++K    S   SDS          SK     ++   
Sbjct: 118 SLTTLIQNLFNLNSDISDYEQPRNSEKSTNDSNKNSDSSIKNDTDTQSSKQDKADNQKAP 177

Query: 730 KKEDEIDDEEYDEP----------VESTPPSSDAGSDASSSEDD 763
              +        +P            S P S D  +  SSS+D+
Sbjct: 178 SSNNTKPSTSNKQPNSPKPTQPNQSNSQPASDDTANQKSSSKDN 221



 Score = 29.8 bits (67), Expect = 7.7
 Identities = 11/104 (10%), Positives = 32/104 (30%), Gaps = 3/104 (2%)

Query: 664 ESNGSSSDSDDDKKASSKRKRDSDEDSKAKTKKKKEKSESESDSGESEKEKKSKSKPKGD 723
           ++    S   +D + S        + +       ++ ++ +  + +S     +       
Sbjct: 37  KTTSHDSKKSNDDETSKDTSSKDTDKADNNNTSNQDNNDKKFSTIDSSTSDSNNIIDFIY 96

Query: 724 KSKSKSKKEDEIDDEEYDEP---VESTPPSSDAGSDASSSEDDD 764
           K+  ++     +   +YD+            +  SD S  E   
Sbjct: 97  KNLPQTNINQLLTKNKYDDNYSLTTLIQNLFNLNSDISDYEQPR 140


>gnl|CDD|227693 COG5406, COG5406, Nucleosome binding factor SPN, SPT16 subunit
            [Transcription / DNA replication, recombination, and
            repair / Chromatin structure and dynamics].
          Length = 1001

 Score = 38.8 bits (90), Expect = 0.011
 Identities = 19/74 (25%), Positives = 37/74 (50%), Gaps = 3/74 (4%)

Query: 443  SSDDEKEPDAYLARVKREAAERDDEDDEDEDESTDEDFNPDQAESDVAEEYDSNPTPTDS 502
            S + E+E   Y A       E D+ D+++E + + ED + D++E+D ++E D        
Sbjct: 931  SDESEEEVSEYEAS---SDDESDETDEDEESDESSEDLSEDESENDSSDEEDGEDWDELE 987

Query: 503  DEDSDASEGTKKKK 516
             + +  S   K++K
Sbjct: 988  SKAAYDSRPGKRRK 1001



 Score = 30.0 bits (67), Expect = 5.5
 Identities = 18/80 (22%), Positives = 36/80 (45%), Gaps = 7/80 (8%)

Query: 461  AAERDDEDDED-----EDESTDEDFNPDQAESDVAEEYDSNPTPTDSDEDSDASEGTKKK 515
                DDE DE      E E++ +D + +  E + ++E   + +  +S+ DS   E  +  
Sbjct: 924  MVGSDDESDESEEEVSEYEASSDDESDETDEDEESDESSEDLSEDESENDSSDEEDGEDW 983

Query: 516  KKEKSRKTITISEKPRKSKK 535
             + +S+       +P K +K
Sbjct: 984  DELESKA--AYDSRPGKRRK 1001


>gnl|CDD|217502 pfam03343, SART-1, SART-1 family.  SART-1 is a protein involved in
           cell cycle arrest and pre-mRNA splicing. It has been
           shown to be a component of U4/U6 x U5 tri-snRNP complex
           in human, Schizosaccharomyces pombe and Saccharomyces
           cerevisiae. SART-1 is a known tumour antigen in a range
           of cancers recognised by T cells.
          Length = 603

 Score = 39.0 bits (91), Expect = 0.011
 Identities = 45/169 (26%), Positives = 67/169 (39%), Gaps = 12/169 (7%)

Query: 599 EKAEYDEKVN---KAKEDYNEALKAYKESGGGQDSDDGKTSKSKKPAKKKETASTAVSPH 655
           ++ E DE  N     KE   + L+  K+       DD K +K    +K  E         
Sbjct: 161 DEDEGDELENVELVEKEKDKKNLELKKKKPDYDPDDDDKFNKRSILSKYDEEIEGKKK-- 218

Query: 656 KVKSKEFIESNGSSSDSDDDKKASSKRKRDSDEDSKAKTKKKKEKSESESDSGESEKEKK 715
           K  +   ++S GS+ D  + K+   K+K   +  S      +   S+   D  E  K KK
Sbjct: 219 KSDNLFTLDSGGSTDDEAEKKRQEVKKKLKINNVSLDDDSTETPASD-YYDVSEMVKFKK 277

Query: 716 SKSKPKGDKSKSKSKKEDEIDDEEYDEPVESTPPSSDAGSDASSSEDDD 764
            K K      K K KK  +  DE+  EP      SSD+GS     E++ 
Sbjct: 278 PKKK------KKKKKKRRKDLDEDELEPEAEGLGSSDSGSRKDVEEENA 320



 Score = 32.4 bits (74), Expect = 0.93
 Identities = 38/198 (19%), Positives = 67/198 (33%), Gaps = 17/198 (8%)

Query: 545 KRKKKDKNAPKKPMSAYMMWFNDMREKIKKDNPGISFTEIAKKGGELWKTVSSKEKAEYD 604
           K KK  +   KKP        +D  +  K+        EI  K  +     +       D
Sbjct: 178 KDKKNLELKKKKPDYDP----DDDDKFNKRSILSKYDEEIEGKKKKSDNLFTLDSGGSTD 233

Query: 605 EKVNKAKEDYNEALKAYKESGGGQD--------SDDGKTSKSKKPAKKKETASTAVSPHK 656
           ++  K +++  + LK    S              D  +  K KKP KKK+         +
Sbjct: 234 DEAEKKRQEVKKKLKINNVSLDDDSTETPASDYYDVSEMVKFKKPKKKKKKKKK-----R 288

Query: 657 VKSKEFIESNGSSSDSDDDKKASSKRKRDSDEDSKAKTKKKKEKSESESDSGESEKEKKS 716
            K  +  E    +         S K   + +   +   KK+KE+ E +    + +  + S
Sbjct: 289 RKDLDEDELEPEAEGLGSSDSGSRKDVEEENARLEDSPKKRKEEQEDDDFVEDDDDLQAS 348

Query: 717 KSKPKGDKSKSKSKKEDE 734
            +K +    K + K   E
Sbjct: 349 LAKQRRLAQKKRKKLRPE 366



 Score = 30.1 bits (68), Expect = 5.5
 Identities = 30/162 (18%), Positives = 54/162 (33%), Gaps = 12/162 (7%)

Query: 593 KTVSSKEKAEYDEKVNKAKEDYNEALKAYKESGGGQDSDD------GKTSKSKKPAKKKE 646
           K +  K+K    +  +  K +    L  Y E   G+               +   A+KK 
Sbjct: 181 KNLELKKKKPDYDPDDDDKFNKRSILSKYDEEIEGKKKKSDNLFTLDSGGSTDDEAEKKR 240

Query: 647 TASTAVSPHKVKSKEFIESNGSSSDSDDDKKASSKRKRDSDEDSKAKTKKKKEKSESESD 706
                       S +   +   +SD  D  +    +K    +  K K +K  ++ E E +
Sbjct: 241 QEVKKKLKINNVSLDDDSTETPASDYYDVSEMVKFKKPKKKKKKKKKRRKDLDEDELEPE 300

Query: 707 SGESEKEK-KSKSKPKGDK-----SKSKSKKEDEIDDEEYDE 742
           +         S+   + +      S  K K+E E DD   D+
Sbjct: 301 AEGLGSSDSGSRKDVEEENARLEDSPKKRKEEQEDDDFVEDD 342


>gnl|CDD|217884 pfam04086, SRP-alpha_N, Signal recognition particle, alpha subunit,
           N-terminal.  SRP is a complex of six distinct
           polypeptides and a 7S RNA that is essential for
           transferring nascent polypeptide chains that are
           destined for export from the cell to the translocation
           apparatus of the endoplasmic reticulum (ER) membrane.
           SRP binds hydrophobic signal sequences as they emerge
           from the ribosome, and arrests translation.
          Length = 272

 Score = 37.8 bits (88), Expect = 0.013
 Identities = 37/160 (23%), Positives = 64/160 (40%), Gaps = 18/160 (11%)

Query: 605 EKVNKAKEDYNEALKAYKESGGGQDSDDGKTSKSKKPAKKKETASTAVSPHKVKSKEFIE 664
            K  KA   + E+ K+ K        +        K  ++K+   +A SP         E
Sbjct: 105 AKSPKAMRTFEESKKSKKTVDSM--IERKPKEPGLKRKQRKKAQESATSP---------E 153

Query: 665 SNGSSSDSDDDKKASSKRKRDSDEDSKAKTKKKKEKSESESDSGESEKEKKSKSKPKGDK 724
           S+ SS+ +          K      +K    ++ +K+   S +  S  EK  KSK    K
Sbjct: 154 SSPSSTPNSSRPSTPHLLK------AKEGPSRRAKKAAKLSSTASSGDEKSPKSKAAPKK 207

Query: 725 SKSKSKKEDEIDDEEYDEPVE-STPPSSDAGSDASSSEDD 763
           +  K +K D   DE+ D  ++ S P ++D  +DA    ++
Sbjct: 208 AGKKMRKWDLDGDEDDDAVLDYSAPDANDENADAPEDVEE 247



 Score = 29.3 bits (66), Expect = 7.0
 Identities = 17/74 (22%), Positives = 29/74 (39%), Gaps = 2/74 (2%)

Query: 688 EDSKAKTKKKKEKSESESDSGESEKEKKSKSKPKGDKSKSKSKKEDEIDDEEYDEPVEST 747
           E    K  K  +   +  +S +S+K   S  + K  +   K K+  +    E     ES+
Sbjct: 98  EKESKKQAKSPKAMRTFEESKKSKKTVDSMIERKPKEPGLKRKQRKKAQ--ESATSPESS 155

Query: 748 PPSSDAGSDASSSE 761
           P S+   S  S+  
Sbjct: 156 PSSTPNSSRPSTPH 169


>gnl|CDD|237284 PRK13108, PRK13108, prolipoprotein diacylglyceryl transferase;
           Reviewed.
          Length = 460

 Score = 37.7 bits (87), Expect = 0.022
 Identities = 21/154 (13%), Positives = 46/154 (29%), Gaps = 12/154 (7%)

Query: 607 VNKAKEDYNEALKAYKESGGGQDSDDGKTSKSKKPAKKKETASTAVSPHKVKSKEFIESN 666
           V++A E     L A   +     +         +P +  + A    +     + E    +
Sbjct: 292 VDEALEREPAELAAAAVAS---AASAVGPVGPGEPNQPDDVAEAVKAEVAEVTDEVAAES 348

Query: 667 GSSSDSDDDKKASSKRKRDSDEDSKAKTKKKKEKSESESDSGESEKEKKSKSKPKGDKSK 726
                  D +   +      +E S+A      E+ +    +G++    +  ++      +
Sbjct: 349 VVQVADRDGESTPA-----VEETSEADI----EREQPGDLAGQAPAAHQVDAEAASAAPE 399

Query: 727 SKSKKEDEIDDEEYDEPVESTPPSSDAGSDASSS 760
             +    E  DE   E  E   P  D       +
Sbjct: 400 EPAALASEAHDETEPEVPEKAAPIPDPAKPDELA 433



 Score = 36.5 bits (84), Expect = 0.052
 Identities = 19/102 (18%), Positives = 36/102 (35%), Gaps = 8/102 (7%)

Query: 439 EDFGSSDDEKEPDAYLARVKREAAERDDEDDEDEDESTDEDFNP--------DQAESDVA 490
           E   + ++  E D    +    A +       D + ++     P        D+ E +V 
Sbjct: 358 ESTPAVEETSEADIEREQPGDLAGQAPAAHQVDAEAASAAPEEPAALASEAHDETEPEVP 417

Query: 491 EEYDSNPTPTDSDEDSDASEGTKKKKKEKSRKTITISEKPRK 532
           E+    P P   DE + A  G    + +  R+    S + R+
Sbjct: 418 EKAAPIPDPAKPDELAVAGPGDDPAEPDGIRRQDDFSSRRRR 459



 Score = 35.3 bits (81), Expect = 0.12
 Identities = 18/112 (16%), Positives = 29/112 (25%), Gaps = 1/112 (0%)

Query: 449 EPDAYLARVKREAAERDDEDDEDEDESTDEDFNPDQAESDVAEEYDSNPTPTDSDEDSDA 508
           +PD     VK E AE  DE   +      +         +   E D        D    A
Sbjct: 325 QPDDVAEAVKAEVAEVTDEVAAESVVQVADRDGESTPAVEETSEADIE-REQPGDLAGQA 383

Query: 509 SEGTKKKKKEKSRKTITISEKPRKSKKDKESSGGSTKRKKKDKNAPKKPMSA 560
               +   +  S      +    ++  + E           D   P +   A
Sbjct: 384 PAAHQVDAEAASAAPEEPAALASEAHDETEPEVPEKAAPIPDPAKPDELAVA 435


>gnl|CDD|218538 pfam05285, SDA1, SDA1.  This family consists of several SDA1
           protein homologues. SDA1 is a Saccharomyces cerevisiae
           protein which is involved in the control of the actin
           cytoskeleton. The protein is essential for cell
           viability and is localised in the nucleus.
          Length = 317

 Score = 37.3 bits (87), Expect = 0.023
 Identities = 25/128 (19%), Positives = 51/128 (39%), Gaps = 10/128 (7%)

Query: 422 KKLRVKNTGKGEKPNYKEDFGSSDDE---------KEPDAYLARVKREAAERDDEDDEDE 472
           KK R K    G +   K+ +G  +           ++      + K      + +DD+DE
Sbjct: 41  KKDRGKPASMGLEAGEKKGYGEEEVVDGIPGLELLEKWKEEERKKKEAEQGLESDDDDDE 100

Query: 473 DESTDEDFNPDQAESDVAEEYDSNPTPTDSDEDSDASEGTKKKKKEKSRKTITISEKPRK 532
           +E  + + + D  +     + +S+     SD + D  E  +  KK K      +SE+  +
Sbjct: 101 EEEWEVEEDEDSDDEGEWIDVESDKEIESSDSE-DEEEKDEAAKKAKEDSDEELSEEDEE 159

Query: 533 SKKDKESS 540
              ++E +
Sbjct: 160 EAAEEEEA 167


>gnl|CDD|227519 COG5192, BMS1, GTP-binding protein required for 40S ribosome
           biogenesis [Translation, ribosomal structure and
           biogenesis].
          Length = 1077

 Score = 37.4 bits (86), Expect = 0.031
 Identities = 58/280 (20%), Positives = 107/280 (38%), Gaps = 19/280 (6%)

Query: 465 DDEDDEDEDESTDEDFNPDQAESDVAEEYDSNPTPTDSDEDSDASEGTKKKKKEKSRKTI 524
           DD D    DE+ D DF   +   +  +E D+     DSD   D SEG  + K+  + K  
Sbjct: 431 DDSDVSTSDENEDVDFTGKKGAINNEDESDNEEVAFDSDSQFDESEGNLRWKEGLASKL- 489

Query: 525 TISEKPRKSKKDKESSGGSTKRKKKDKNAPKKPMSAYMMWFNDMREK--IKKDNPGISFT 582
                  +S K   +     K    +  +P++ +  Y        E   + +D P   F 
Sbjct: 490 ----AYSQSGKRGRNIQ---KIFYDESLSPEECIEEYKGESAKSSESDLVVQDEPEDFF- 541

Query: 583 EIAKKGGELWKTVSSKEKAEYDEKVNKAKEDYNEALKAYKESGGGQDSDDGKTSKSKKPA 642
           +++K   E   +   K      E++ K        LK+  +     DS +G+    +   
Sbjct: 542 DVSKVANESISSNHEKLMESEFEELKKKWSSL-AQLKSRFQKDATLDSIEGEEELIQD-- 598

Query: 643 KKKETASTAVSPHKVKSKEFIESNGSSSDSDDDKKASSKRKRDSDEDSKAKTKKKKEKSE 702
            +K               E  ES GSS  +++++ A         E++  K ++ +   E
Sbjct: 599 DEKGNFEDLEDEENSSDNEMEESRGSSVTAENEESADEVDYETEREENARKKEELRGNFE 658

Query: 703 SESDSGESEKEKKSKSKPKGDKSKSKSKKEDEIDDEEYDE 742
            E + G+ EK    K      + K K +++ +I+  E++ 
Sbjct: 659 LE-ERGDPEK----KDVDWYTEEKRKIEEQLKINRSEFET 693


>gnl|CDD|235334 PRK05035, PRK05035, electron transport complex protein RnfC;
           Provisional.
          Length = 695

 Score = 36.8 bits (86), Expect = 0.044
 Identities = 41/218 (18%), Positives = 76/218 (34%), Gaps = 43/218 (19%)

Query: 528 EKPRKSKKDKESSGGSTKRKKKDKNAPKKPMSAYMMWFNDMREKIKKDNPGISFTEIAKK 587
            +  + KK  E       R  KDK+A    +            ++K           AKK
Sbjct: 468 AREARHKKAAE------ARAAKDKDAVAAAL-----------ARVK-----------AKK 499

Query: 588 GGELWKTVSSKEKAEYDEKVNKAKEDYNEALKAYKESGGGQDSDDGKTSKSKKPAKKKET 647
                  V        +  V  A+E      +A +       + D           KK  
Sbjct: 500 AAATQPIVIKAGARPDNSAVIAAREARKAQARARQAEKQAAAAAD----------PKKAA 549

Query: 648 ASTAVSPHKVKSKEFIESNGSSSDSDDDKKASSKRKRDSDEDSKAKTKKKKEKSESESDS 707
            + A++  K K      +N  + +  D KKA       +   ++AK KK  +++ S    
Sbjct: 550 VAAAIARAKAKKAAQQAANAEAEEEVDPKKA-----AVAAAIARAKAKKAAQQAASAEPE 604

Query: 708 GESEKEKKSKSKPKGDKSKSKSKKEDEIDDEEYDEPVE 745
            +  +    K+      +++K+KK ++  + E +EPV+
Sbjct: 605 EQVAEVDPKKAAVAAAIARAKAKKAEQQANAEPEEPVD 642



 Score = 36.1 bits (84), Expect = 0.083
 Identities = 46/261 (17%), Positives = 91/261 (34%), Gaps = 43/261 (16%)

Query: 447 EKEPDAYLARVKREAAERDDEDDEDEDESTDEDFNPDQAESDVAEEYDSNPTPTDSDEDS 506
           E+E  A  AR K+ A  R  +D +    +        +A +       +   P +S   +
Sbjct: 463 EREKAAREARHKKAAEARAAKDKDAVAAAL-ARVKAKKAAATQPIVIKAGARPDNSAVIA 521

Query: 507 DASEGTKKKKKEKSRKTITISEKPRKS-----------KKDKESSGGSTKRKKKDKNAPK 555
                  + +  ++ K    +  P+K+           KK  + +  +   ++ D   PK
Sbjct: 522 AREARKAQARARQAEKQAAAAADPKKAAVAAAIARAKAKKAAQQAANAEAEEEVD---PK 578

Query: 556 KPMSAYMMWFNDMREKIKKDNPGISFTEIAKKGGELWKTVSSKEKAEYDEKVNKAKEDYN 615
           K   A  +     R K KK     +  E  ++           +KA     + +AK    
Sbjct: 579 KAAVAAAI----ARAKAKKAAQQAASAEPEEQV-----AEVDPKKAAVAAAIARAK---- 625

Query: 616 EALKAYKESGGGQDSDDGKTSKSKKPAKKKETASTAVSPHKVKSKEFIESNGSSSDSDDD 675
            A KA ++               +    +K   + A++  K +     ++N    +++D 
Sbjct: 626 -AKKAEQQ---------ANAEPEEPVDPRKAAVAAAIARAKARKAAQQQANAEPEEAEDP 675

Query: 676 KKASSKRKRDSDEDSKAKTKK 696
           KKA       +   ++AK KK
Sbjct: 676 KKA-----AVAAAIARAKAKK 691


>gnl|CDD|220759 pfam10446, DUF2457, Protein of unknown function (DUF2457).  This is
           a family of uncharacterized proteins.
          Length = 449

 Score = 36.5 bits (84), Expect = 0.047
 Identities = 15/64 (23%), Positives = 33/64 (51%)

Query: 454 LARVKREAAERDDEDDEDEDESTDEDFNPDQAESDVAEEYDSNPTPTDSDEDSDASEGTK 513
           L +   E A  +++DDE++D+  D++   D  + D  ++ D +   +   +DS A +G +
Sbjct: 41  LGKEAEEEAMEEEDDDEEDDDDDDDEDEDDDDDDDDEDDEDEDDDDSTLHDDSSADDGNE 100

Query: 514 KKKK 517
              +
Sbjct: 101 TDNE 104



 Score = 31.9 bits (72), Expect = 1.2
 Identities = 36/151 (23%), Positives = 63/151 (41%), Gaps = 23/151 (15%)

Query: 420 REKKLRVKNTGKGEKPNYKEDFGSSDDEKEPDAYLARVKREAAERDDEDDEDEDESTDED 479
           +E  +R       E+   +ED    DD+ + D      + E  + DD+D++DEDE  D+ 
Sbjct: 34  KENAIRKLGKEAEEEAMEEEDDDEEDDDDDDD------EDEDDDDDDDDEDDEDEDDDDS 87

Query: 480 FNPDQAESDVAEEYDSNPTPTDSDEDSD------------ASEGTKKKKKEKSRKTITIS 527
              D + +D   E D+     DSD++SD             +  T  +K E  R+     
Sbjct: 88  TLHDDSSADDGNETDNEAGFADSDDESDDGSEYVFWAPGTTTAATSPRKLETMRRKSR-- 145

Query: 528 EKPRKSKKDKESSGGSTKRKKKDKNAPKKPM 558
              R++      S    K+++K K   + P+
Sbjct: 146 ---RRTSDSSADSLNERKQRRKWKRPRRSPI 173


>gnl|CDD|218440 pfam05110, AF-4, AF-4 proto-oncoprotein.  This family consists of
           AF4 (Proto-oncogene AF4) and FMR2 (Fragile X E mental
           retardation syndrome) nuclear proteins. These proteins
           have been linked to human diseases such as acute
           lymphoblastic leukaemia and mental retardation. The
           family also contains a Drosophila AF4 protein homologue
           Lilliputian which contains an AT-hook domain.
           Lilliputian represents a novel pair-rule gene that acts
           in cytoskeleton regulation, segmentation and
           morphogenesis in Drosophila.
          Length = 1154

 Score = 36.8 bits (85), Expect = 0.048
 Identities = 27/131 (20%), Positives = 43/131 (32%), Gaps = 2/131 (1%)

Query: 622 KESGGGQDSDDGKTSKSKKPAKKKETASTAVSPHKVKSKEFIESNGSSSDSDDDKKASSK 681
           K+    + S     S S           +  S    K +E + S  S   S   K     
Sbjct: 764 KKQRLEEKSSSCSPSSSSSHHHSSSNKESRKSSRN-KEEEMLPSPSSPLSSSSPKPEHPS 822

Query: 682 RKRDSDEDSKAKTKKKKEKSESESDSGESEKEKKSKSKPKG-DKSKSKSKKEDEIDDEEY 740
           RKR   ++  + +      S ++S S  S   K  K++ KG   SK       +  ++  
Sbjct: 823 RKRPRRQEDTSSSSGPFSASSTKSSSKSSSTSKHRKTEGKGSSTSKEHKGSSGDTPNKAS 882

Query: 741 DEPVESTPPSS 751
             PV      S
Sbjct: 883 SFPVPPLSNGS 893



 Score = 31.4 bits (71), Expect = 2.4
 Identities = 17/43 (39%), Positives = 27/43 (62%), Gaps = 3/43 (6%)

Query: 634 KTSKSKKPAKKKETASTAVSPHKVKSKEFIESNGSSSDSDDDK 676
           K+S      K+K  + + + P   KS+EFIE++ SSSDS +D+
Sbjct: 595 KSSVPPAAEKRKYKSPSKIVP---KSREFIETDSSSSDSPEDE 634



 Score = 29.9 bits (67), Expect = 6.3
 Identities = 24/107 (22%), Positives = 39/107 (36%), Gaps = 7/107 (6%)

Query: 643 KKKETASTAVSPHKVKSKEFIESNGSSSDSDDDKKASSKRKRDSDEDSKAKTKKKKEKSE 702
           +K+   S+  S +       +E +   S S+D  +  +  K  S     +      E + 
Sbjct: 336 QKQYDPSSKTSTNSQSGTSMLEDDLKLSSSEDSDEEQATEKPPSRNTPPSAPSSNPEPAA 395

Query: 703 SESDSGESEKEKKSKSKPKGDKSKSKSKKEDEIDDEEYDEPVESTPP 749
           S S S  S    +S        S S S+ E    D E +EP  +  P
Sbjct: 396 SSSGSSSSSSGSES-------SSGSDSESESSSSDSEENEPPRTASP 435


>gnl|CDD|220365 pfam09726, Macoilin, Transmembrane protein.  This entry is a highly
           conserved protein present in eukaryotes.
          Length = 680

 Score = 36.4 bits (84), Expect = 0.053
 Identities = 45/209 (21%), Positives = 69/209 (33%), Gaps = 11/209 (5%)

Query: 517 KEKSRK-TITISEKPRKSKKDKESSGGSTKRKKKDKNAPKKPMSAYMMWFNDMREKIK-- 573
           KE++   ++T  EK   S K   S+      +    N      S       +  EK K  
Sbjct: 203 KEENHTLSVTDKEKSEASSKGLTSTKELVPVQNSGGNHSLSKSSNSQTPELEYSEKGKDH 262

Query: 574 KDNPGISFTEIAKKGGELWKTVSSKEKAEYDEKVNKAKEDYNEALKAYKESGGGQDSDDG 633
             +       I           S  +  E  E  +      + +    KE+     S   
Sbjct: 263 HHSHNHQHHSIGINNHHSKHADSKLQTIEVIENHSNKSRPSSSSTNGSKETTSNSSSAAA 322

Query: 634 KT--SKSKKPAKKKETASTAVSPHKVKSKEFIESNGSSSDSDDDKKASSKRKRDSDEDSK 691
            +  SKS K AK      +  SP    S      + S SD++      SK+KR S   S 
Sbjct: 323 GSIGSKSSKSAKHSNRNKSNSSPKSHSSANGSVPSSSVSDNE------SKQKRASKSSSG 376

Query: 692 AKTKKKKEKSESESDSGESEKEKKSKSKP 720
           A+  KK     S + + E+   +   S P
Sbjct: 377 ARDSKKDASGMSANGTVENCIPENKISTP 405



 Score = 30.3 bits (68), Expect = 5.0
 Identities = 51/255 (20%), Positives = 93/255 (36%), Gaps = 29/255 (11%)

Query: 504 EDSDASEGTKKKKKEKSRKTITISEKPRKSKKDKESSG----GSTKR-KKKDKNAPKKPM 558
             S A+ G+   K  KS K    ++     K    ++G     S    + K K A K   
Sbjct: 316 NSSSAAAGSIGSKSSKSAKHSNRNKSNSSPKSHSSANGSVPSSSVSDNESKQKRASKSSS 375

Query: 559 SAYMMWFNDMREKIKKDNPGISFTEIAKKGGELWKTVSSKEKAEYDEKVNKAKEDYNEAL 618
            A            KKD  G+S     +      K  +       ++ + K + +  +A 
Sbjct: 376 GARD---------SKKDASGMSANGTVENCIPENKISTPSAIERLEQDIKKLQAELQQAR 426

Query: 619 KAYKE--SGGGQDSDDGKTSKSKKPAKKKE-----TASTAVSPHKVKSKEFIESNGSSSD 671
           +   E  +     +   ++ KS     KKE     T   ++   K K K+ ++S      
Sbjct: 427 QNESELRNQISLLTSLERSLKSDLGQLKKENDMLQTKLNSMVSAKQKDKQSMQSMEKRLK 486

Query: 672 SDDDKKAS------SKRKRDSDEDSKAKTKKKKEKSESESDSGESEKEKKSKSKPKGDKS 725
           S+ D + +       ++KR  +E+  A  +   + + S  +  ES K+ K   + +  K 
Sbjct: 487 SEADSRVNAEKQLAEEKKRKKEEEETAA-RAAAQAAASREECAESLKQAKQDLEMEIKKL 545

Query: 726 KSKSK-KEDEIDDEE 739
           +   K KE+E    E
Sbjct: 546 EHDLKLKEEECRMLE 560


>gnl|CDD|217348 pfam03064, U79_P34, HSV U79 / HCMV P34.  This family represents
           herpes virus protein U79 and cytomegalovirus early
           phosphoprotein P34 (UL112).
          Length = 238

 Score = 35.6 bits (82), Expect = 0.058
 Identities = 16/96 (16%), Positives = 40/96 (41%)

Query: 656 KVKSKEFIESNGSSSDSDDDKKASSKRKRDSDEDSKAKTKKKKEKSESESDSGESEKEKK 715
           +   KE    +   SD +  K++  ++++   EDS+   + +++K E +  + E ++   
Sbjct: 141 EKFEKECRALSRKKSDDEHRKRSGKQKEKRRVEDSQKHKEDRRKKQEEKRRNDEDKRPGG 200

Query: 716 SKSKPKGDKSKSKSKKEDEIDDEEYDEPVESTPPSS 751
                 G    S   +  +   +++ +P     P S
Sbjct: 201 GGGSSGGQSGLSTKDEPPKEKRQKHHDPERRLEPQS 236


>gnl|CDD|215521 PLN02967, PLN02967, kinase.
          Length = 581

 Score = 36.2 bits (83), Expect = 0.062
 Identities = 28/141 (19%), Positives = 54/141 (38%), Gaps = 14/141 (9%)

Query: 626 GGQDSDDGKTSKSKKPAKKKET--ASTAVSPHKVKSKEFIESNGSSSDSDDDKKASSKRK 683
              D +   + K    + K+ T      +S    +  E +     + D+  DK++    +
Sbjct: 60  EEPDENGAVSKKKPTRSVKRATKKTVVEISEPLEEGSELV----VNEDAALDKESKKTPR 115

Query: 684 RDSDEDSKAKTKKKKEKSESESDSGESEKEKKSKSKPKGDKSKSKSKKEDEIDDEEYDEP 743
           R   + + A +  ++EK+E        +K +K +   K D+       E E+ D E  E 
Sbjct: 116 RTRRKAAAASSDVEEEKTE--------KKVRKRRKVKKMDEDVEDQGSESEVSDVEESEF 167

Query: 744 VESTPPSSDAGSDASSSEDDD 764
           V S    S+   D    + +D
Sbjct: 168 VTSLENESEEELDLEKDDGED 188



 Score = 29.6 bits (66), Expect = 6.6
 Identities = 17/67 (25%), Positives = 27/67 (40%), Gaps = 8/67 (11%)

Query: 443 SSDDEKEPDAYLARVKREAAERDDEDDEDEDESTDEDFNPDQAESDVAEEYDSNPTPTDS 502
           +S D +E        KR   ++ DED ED+   ++         SDV E         +S
Sbjct: 124 ASSDVEEEKTEKKVRKRRKVKKMDEDVEDQGSESE--------VSDVEESEFVTSLENES 175

Query: 503 DEDSDAS 509
           +E+ D  
Sbjct: 176 EEELDLE 182


>gnl|CDD|191716 pfam07263, DMP1, Dentin matrix protein 1 (DMP1).  This family
           consists of several mammalian dentin matrix protein 1
           (DMP1) sequences. The dentin matrix acidic
           phosphoprotein 1 (DMP1) gene has been mapped to human
           chromosome 4q21. DMP1 is a bone and teeth specific
           protein initially identified from mineralised dentin.
           DMP1 is primarily localised in the nuclear compartment
           of undifferentiated osteoblasts. In the nucleus, DMP1
           acts as a transcriptional component for activation of
           osteoblast-specific genes like osteocalcin. During the
           early phase of osteoblast maturation, Ca(2+) surges into
           the nucleus from the cytoplasm, triggering the
           phosphorylation of DMP1 by a nuclear isoform of casein
           kinase II. This phosphorylated DMP1 is then exported out
           into the extracellular matrix, where it regulates
           nucleation of hydroxyapatite. DMP1 is a unique molecule
           that initiates osteoblast differentiation by
           transcription in the nucleus and orchestrates
           mineralised matrix formation extracellularly, at later
           stages of osteoblast maturation. The DMP1 gene has been
           found to be ectopically expressed in lung cancer
           although the reason for this is unknown.
          Length = 514

 Score = 36.2 bits (83), Expect = 0.067
 Identities = 70/331 (21%), Positives = 138/331 (41%), Gaps = 29/331 (8%)

Query: 459 REAAERDDEDDEDEDESTDEDFNPDQAESDVAEEYDSNPTPTDSDEDSD----ASEGTKK 514
           R   +  D+ D+DED+S D+ F  D +     E      +   SDEDS     +SE +  
Sbjct: 93  RSTGKEGDDKDDDEDDSGDDTFGDDDSGPGPEERQWGGNSRLGSDEDSADTTQSSEDSTP 152

Query: 515 KKKEKSRKTITIS-EKPRKSKKDKESSGGSTKRKKKDKNAPKKPMSAYMMWFNDMRE--- 570
           + +  ++ T + S +   + + D    GG + +  + +       S       D  E   
Sbjct: 153 QGENSAQDTTSESRDLDNEDEVDSRPEGGDSTQDSESEEHWVGGGSEGESSHGDGSEFDD 212

Query: 571 -KIKKDNPGISFTEIAKKGGELWKTVSSKEKAEYDEKVNKAKEDYNEALK---------- 619
             ++ D+P  + +E            S + K E +E+ +      +++++          
Sbjct: 213 EGMQSDDPESTRSERGNSRMSSAGLKSKESKGEDEEQASTQDSGESQSVEYPSRKFFRKS 272

Query: 620 AYKESGGGQDSDDGKTSKSKKPAKKKETASTAVSPHKVKSKEFIESNGSSSDSDDDKKAS 679
              E  G  + DD  T + K  + +    S +    + +S+E  E N S  DS + +  S
Sbjct: 273 RISEEDGRGELDDSNTMEVKSDSTENAGLSQSREHSRSESQEDSEENQSQEDSQEVQDPS 332

Query: 680 SKRKRDSDEDSKAKTKKKKEKSESES---------DSGESEKEKKSKSKPKGDK-SKSKS 729
           S+  +++D  S+  + + +E+  SES            E +++ +S  +   D  S S+S
Sbjct: 333 SESSQEADLPSQENSSESQEEVVSESRGDNPDNTTSHSEDQEDSESSEEDSLDTPSSSES 392

Query: 730 KKEDEIDDEEYDEPVESTPPSSDAGSDASSS 760
           +  +E  D E +E + S+  S ++  D +SS
Sbjct: 393 QSTEEQADSESNESLSSSEESPESTEDENSS 423



 Score = 32.7 bits (74), Expect = 0.78
 Identities = 29/137 (21%), Positives = 65/137 (47%), Gaps = 3/137 (2%)

Query: 628 QDSDDGKTSKSKKPAKKKETASTAVSPHKVKSKEFIESNGSSSDSDDDKKASSKRKRDSD 687
           +DS+  +      P+  +  ++        +S E + S+  S +S +D+ +SS+    S 
Sbjct: 374 EDSESSEEDSLDTPSSSESQSTE--EQADSESNESLSSSEESPESTEDENSSSQEGLQS- 430

Query: 688 EDSKAKTKKKKEKSESESDSGESEKEKKSKSKPKGDKSKSKSKKEDEIDDEEYDEPVEST 747
             +  +++ ++ +SE +S S E + + +  S+ K D + ++S    E D +  +  +ES 
Sbjct: 431 HSASTESRSQESQSEQDSRSEEDDSDSQDSSRSKEDSNSTESASSSEEDGQPKNTEIESR 490

Query: 748 PPSSDAGSDASSSEDDD 764
             + DA  +    + DD
Sbjct: 491 KLTVDAYHNKPIGDQDD 507


>gnl|CDD|217503 pfam03344, Daxx, Daxx Family.  The Daxx protein (also known as the
           Fas-binding protein) is thought to play a role in
           apoptosis, but precise role played by Daxx remains to be
           determined. Daxx forms a complex with Axin.
          Length = 715

 Score = 36.0 bits (83), Expect = 0.076
 Identities = 23/130 (17%), Positives = 54/130 (41%), Gaps = 5/130 (3%)

Query: 429 TGKGEKPNYK--EDFGSSDDEKEPDAYLARVKREAAERDDEDDEDEDESTDEDFNPDQAE 486
           +  GE P+    E       E+E +      + E    ++E +++E+E   E  N  + E
Sbjct: 426 STSGESPSMASQESEEEESVEEEEEEEEEEEEEEQESEEEEGEDEEEEEEVEADNGSEEE 485

Query: 487 SDVAEEYDSNPTPTDSDE---DSDASEGTKKKKKEKSRKTITISEKPRKSKKDKESSGGS 543
            + + E D +    + D    +S+ +  ++  + ++ R +    E P++     ES    
Sbjct: 486 MEGSSEGDGDGEEPEEDAERRNSEMAGISRMSEGQQPRGSSVQPESPQEEPLQPESMDAE 545

Query: 544 TKRKKKDKNA 553
           +  ++ D+  
Sbjct: 546 SVGEESDEEL 555



 Score = 30.3 bits (68), Expect = 4.8
 Identities = 14/90 (15%), Positives = 37/90 (41%)

Query: 675 DKKASSKRKRDSDEDSKAKTKKKKEKSESESDSGESEKEKKSKSKPKGDKSKSKSKKEDE 734
           +++      R SD    + T  +     S+    E   E++ + + + ++ + +S++E+ 
Sbjct: 408 ERERQGTSSRSSDPSKASSTSGESPSMASQESEEEESVEEEEEEEEEEEEEEQESEEEEG 467

Query: 735 IDDEEYDEPVESTPPSSDAGSDASSSEDDD 764
            D+EE +E         +    +    D +
Sbjct: 468 EDEEEEEEVEADNGSEEEMEGSSEGDGDGE 497



 Score = 30.3 bits (68), Expect = 5.4
 Identities = 26/117 (22%), Positives = 49/117 (41%), Gaps = 5/117 (4%)

Query: 643 KKKETASTAVSPH-KVKSKEFIESNGSSSDSDDDKKASSKRKRDSDEDSKAKTKKKKEKS 701
           K++E      S      SK    S  S S +  + +     + + +E+ +    +++E+ 
Sbjct: 405 KRQERERQGTSSRSSDPSKASSTSGESPSMASQESEEEESVEEEEEEEEE----EEEEEQ 460

Query: 702 ESESDSGESEKEKKSKSKPKGDKSKSKSKKEDEIDDEEYDEPVESTPPSSDAGSDAS 758
           ESE + GE E+E++      G + + +   E + D EE +E  E         S  S
Sbjct: 461 ESEEEEGEDEEEEEEVEADNGSEEEMEGSSEGDGDGEEPEEDAERRNSEMAGISRMS 517


>gnl|CDD|227504 COG5177, COG5177, Uncharacterized conserved protein [Function
           unknown].
          Length = 769

 Score = 35.8 bits (82), Expect = 0.11
 Identities = 20/108 (18%), Positives = 41/108 (37%), Gaps = 5/108 (4%)

Query: 439 EDFGSSDDEKEPDAYLARVK----REAAERDDEDDEDEDESTDEDFNPDQAESDVAEEYD 494
           +D G     ++     A  K     +A   +DE++ED  +  DE+      + D  +E D
Sbjct: 364 DDHGFLPGREQTSKKAAVPKGTSFYQAKWAEDEEEED-GQCNDEESTMSAIDDDDPKEND 422

Query: 495 SNPTPTDSDEDSDASEGTKKKKKEKSRKTITISEKPRKSKKDKESSGG 542
           +     D +   D +EG ++   E+  + +       K  ++      
Sbjct: 423 NEEVAGDEESAIDDNEGFEELSPEEEERQLREFRDMEKEDREFPDEAE 470


>gnl|CDD|219293 pfam07093, SGT1, SGT1 protein.  This family consists of several
           eukaryotic SGT1 proteins. Human SGT1 or hSGT1 is known
           to suppress GCR2 and is highly expressed in the muscle
           and heart. The function of this family is unknown
           although it has been speculated that SGT1 may be
           functionally analogous to the Gcr2p protein of
           Saccharomyces cerevisiae which is known to be a
           regulatory factor of glycolytic gene expression.
          Length = 557

 Score = 35.5 bits (82), Expect = 0.12
 Identities = 19/89 (21%), Positives = 35/89 (39%), Gaps = 3/89 (3%)

Query: 660 KEFIESNGSSSDSDDDKKASSKRKRDSDEDSKAKT---KKKKEKSESESDSGESEKEKKS 716
           K  +       D+D    +    + D ++D + +        E+ E   D  ++E ++  
Sbjct: 467 KNMLGLKDDEIDNDLPDDSDDADEDDDEDDDEDEDSSSDSTLEELEEYMDQMDAELKQTD 526

Query: 717 KSKPKGDKSKSKSKKEDEIDDEEYDEPVE 745
            S      +   S  ED+ DD E  EPV+
Sbjct: 527 SSNNADISNSGSSGAEDDDDDIEGVEPVD 555


>gnl|CDD|220284 pfam09538, FYDLN_acid, Protein of unknown function (FYDLN_acid).
           Members of this family are bacterial proteins with a
           conserved motif [KR]FYDLN, sometimes flanked by a pair
           of CXXC motifs, followed by a long region of low
           complexity sequence in which roughly half the residues
           are Asp and Glu, including multiple runs of five or more
           acidic residues. The function of members of this family
           is unknown.
          Length = 104

 Score = 32.7 bits (75), Expect = 0.17
 Identities = 14/51 (27%), Positives = 26/51 (50%)

Query: 455 ARVKREAAERDDEDDEDEDESTDEDFNPDQAESDVAEEYDSNPTPTDSDED 505
           A+   +  + DD   +D+D+  D+D  PD  + DV  + D +    D D++
Sbjct: 54  AKKDEDEEDEDDVVLDDDDDDDDDDDLPDLDDDDVDLDDDDDDFLEDDDDE 104



 Score = 29.6 bits (67), Expect = 1.7
 Identities = 15/54 (27%), Positives = 26/54 (48%)

Query: 455 ARVKREAAERDDEDDEDEDESTDEDFNPDQAESDVAEEYDSNPTPTDSDEDSDA 508
           A+ +  AA+ +D   +DEDE  ++D   D  + D  ++   +    D D D D 
Sbjct: 41  AKSRAPAADAEDAAKKDEDEEDEDDVVLDDDDDDDDDDDLPDLDDDDVDLDDDD 94


>gnl|CDD|227496 COG5167, VID27, Protein involved in vacuole import and degradation
           [Intracellular trafficking and secretion].
          Length = 776

 Score = 35.0 bits (80), Expect = 0.17
 Identities = 18/65 (27%), Positives = 29/65 (44%)

Query: 433 EKPNYKEDFGSSDDEKEPDAYLARVKREAAERDDEDDEDEDESTDEDFNPDQAESDVAEE 492
           E+ +Y  D  S   EK+ D  L   K E   R+ E+ E E+E  D +   D ++    ++
Sbjct: 347 ERKDYILDSSSVPLEKQFDDILYFEKMEIENRNPEESEHEEEVEDYEDENDHSKRICDDD 406

Query: 493 YDSNP 497
              N 
Sbjct: 407 ELENH 411



 Score = 33.8 bits (77), Expect = 0.35
 Identities = 16/78 (20%), Positives = 33/78 (42%), Gaps = 1/78 (1%)

Query: 445 DDEKEPDAYLARVKREAAERDDEDDEDEDESTDEDFNPDQAESDV-AEEYDSNPTPTDSD 503
           ++E E   Y+        E+  +D    ++   E+ NP+++E +   E+Y+     +   
Sbjct: 343 NEEAERKDYILDSSSVPLEKQFDDILYFEKMEIENRNPEESEHEEEVEDYEDENDHSKRI 402

Query: 504 EDSDASEGTKKKKKEKSR 521
            D D  E   +   EK+ 
Sbjct: 403 CDDDELENHFRAADEKNS 420


>gnl|CDD|153272 cd07588, BAR_Amphiphysin, The Bin/Amphiphysin/Rvs (BAR) domain of
           Amphiphysins.  BAR domains are dimerization, lipid
           binding and curvature sensing modules found in many
           different proteins with diverse functions. Amphiphysins
           function primarily in endocytosis and other membrane
           remodeling events. They contain an N-terminal BAR domain
           with an additional N-terminal amphipathic helix (an
           N-BAR), a variable central domain, and a C-terminal SH3
           domain. This subfamily is composed of different isoforms
           of amphiphysin and Bridging integrator 2 (Bin2).
           Amphiphysin I proteins, enriched in the brain and
           nervous system, contain domains that bind clathrin,
           Adaptor Protein complex 2 (AP2), dynamin and
           synaptojanin. They function in synaptic vesicle
           endocytosis. Some amphiphysin II isoforms, also called
           Bridging integrator 1 (Bin1), are localized in many
           different tissues and may function in intracellular
           vesicle trafficking. In skeletal muscle, Bin1 plays a
           role in the organization and maintenance of the T-tubule
           network. Bin2 is mainly expressed in hematopoietic cells
           and is upregulated during granulocyte differentiation.
           The N-BAR domains of amphiphysins form a curved dimer
           with a positively-charged concave face that can drive
           membrane bending and curvature.
          Length = 211

 Score = 34.3 bits (79), Expect = 0.17
 Identities = 19/67 (28%), Positives = 39/67 (58%), Gaps = 4/67 (5%)

Query: 557 PMSAYMMWFNDMREKIKKDNPGISFTEIAKKGGELWKTVSSKEKAEYDEKVNKAKEDYNE 616
           P++AY   F +++++I K    +   + A+   E    + +K+K + D+K+ KA+E+  +
Sbjct: 95  PLTAYQSQFPEVKKRIAKRGRKLVDYDSARHNLE---ALKAKKKVD-DQKLTKAEEELQQ 150

Query: 617 ALKAYKE 623
           A K Y+E
Sbjct: 151 AKKVYEE 157


>gnl|CDD|240226 PTZ00007, PTZ00007, (NAP-L) nucleosome assembly protein -L;
           Provisional.
          Length = 337

 Score = 34.4 bits (79), Expect = 0.20
 Identities = 12/59 (20%), Positives = 25/59 (42%), Gaps = 3/59 (5%)

Query: 463 ERDDEDDEDEDESTDEDFNPDQAESDVAEEYDSNPTPTDSDEDSDASEGTKKKKKEKSR 521
           E  DED +   +  D+D++   +    + + +S+    D++E+ D  E      K    
Sbjct: 278 EAIDEDSDYSSDEDDDDYDSYDSSDSASSDSNSD---VDTNEEDDRGEKESNGAKSNEL 333



 Score = 31.3 bits (71), Expect = 2.0
 Identities = 11/56 (19%), Positives = 28/56 (50%), Gaps = 5/56 (8%)

Query: 467 EDDEDEDESTDEDFNPDQAESDVAEEYDSNPTPTDSDEDSDASEGTKKKKKEKSRK 522
            D++ +  S ++D + D  +S      DS  + ++SD D++  +   +K+   ++ 
Sbjct: 280 IDEDSDYSSDEDDDDYDSYDSS-----DSASSDSNSDVDTNEEDDRGEKESNGAKS 330


>gnl|CDD|215774 pfam00183, HSP90, Hsp90 protein. 
          Length = 529

 Score = 34.7 bits (80), Expect = 0.20
 Identities = 14/61 (22%), Positives = 32/61 (52%)

Query: 677 KASSKRKRDSDEDSKAKTKKKKEKSESESDSGESEKEKKSKSKPKGDKSKSKSKKEDEID 736
           K   K   D +E+ + + KK++E+  ++ +    E+E+K + K K  K K  + + + ++
Sbjct: 29  KEVEKEVPDEEEEEEKEEKKEEEEKTTDKEEEVDEEEEKEEKKKKTKKVKETTTEWELLN 88

Query: 737 D 737
            
Sbjct: 89  K 89



 Score = 30.1 bits (68), Expect = 5.0
 Identities = 18/72 (25%), Positives = 33/72 (45%), Gaps = 10/72 (13%)

Query: 463 ERDDEDDEDEDESTDEDFNPDQAESDVAEEYDSNPTPTDSDEDSDASEGTKKKKKEKSRK 522
           E + E  ++E+E   E+           +E +   T  + + D +  +  KKKK +K ++
Sbjct: 30  EVEKEVPDEEEEEEKEE----------KKEEEEKTTDKEEEVDEEEEKEEKKKKTKKVKE 79

Query: 523 TITISEKPRKSK 534
           T T  E   K+K
Sbjct: 80  TTTEWELLNKTK 91


>gnl|CDD|115057 pfam06375, BLVR, Bovine leukaemia virus receptor (BLVR).  This
           family consists of several bovine specific leukaemia
           virus receptors which are thought to function as
           transmembrane proteins, although their exact function is
           unknown.
          Length = 561

 Score = 34.7 bits (79), Expect = 0.21
 Identities = 33/155 (21%), Positives = 57/155 (36%), Gaps = 9/155 (5%)

Query: 422 KKLRVKNTGKGEKPNYKEDFGSSDDEKEPDAYLARVKREA-AERDDEDDEDEDESTDEDF 480
           +KL V+     E     E       EK+      + K+E   ERD +  ++ +       
Sbjct: 171 EKLPVQKHRNAETSKSPEKGDVPAVEKKSKKPKKKEKKEKEKERDKDKKKEVEGFKSLLL 230

Query: 481 NPDQ---AESDVAEEYDSNPTPTDSDEDSDAS-EGTKKKKKEKSRKTITISEKPRKSKKD 536
             D    + + VAE  +++   T S    D+  +  K  + E+++K    S K +K K+ 
Sbjct: 231 ALDDSPASAASVAEADEASLANTVSGTAPDSEPDEPKDAEAEETKK----SPKHKKKKQR 286

Query: 537 KESSGGSTKRKKKDKNAPKKPMSAYMMWFNDMREK 571
           KE      K+K            A     N   E+
Sbjct: 287 KEKEEKKKKKKHHHHRCHHSDGGAEQPVQNGAVEE 321



 Score = 32.7 bits (74), Expect = 0.75
 Identities = 51/255 (20%), Positives = 79/255 (30%), Gaps = 46/255 (18%)

Query: 458 KREAAERDDEDDEDEDESTDEDFNPDQAESDVAEEYDSNPTPTDSDEDSDASEGTKKKKK 517
           KRE  +R           +DED  P Q    V EE   N  P+D D D D ++  +    
Sbjct: 103 KREKEKRGRRRHHSLGTESDEDIAPAQMVDIVTEEMPENALPSDED-DKDPNDPYRALDI 161

Query: 518 EKSRKTITISEKPRKSKKDKESSGGSTKRKKKDKNAPKKPMSAYMMWFNDMREKIKKDNP 577
           +  +      + P               +K ++    K P               K D P
Sbjct: 162 DLDKPLADSEKLP--------------VQKHRNAETSKSPE--------------KGDVP 193

Query: 578 GISFTEIAKKGGELWKTVSSKEKAEYDEKVNKAKEDYNEALKAYKESGGGQDSDDGKTSK 637
            +      KK     K    KEK E +++ +K K       K   E           +  
Sbjct: 194 AVE-----KKS----KKPKKKEKKEKEKERDKDK-------KKEVEGFKSLLLALDDSPA 237

Query: 638 SKKPAKKKETASTAVSPHKVKSKEFIESNGSSSDSDDDKKASSKRKRDSDE-DSKAKTKK 696
           S     + + AS A +          +    +   +  K    K+K+   E + K K KK
Sbjct: 238 SAASVAEADEASLANTVSGTAPDSEPDEPKDAEAEETKKSPKHKKKKQRKEKEEKKKKKK 297

Query: 697 KKEKSESESDSGESE 711
                   SD G  +
Sbjct: 298 HHHHRCHHSDGGAEQ 312



 Score = 32.0 bits (72), Expect = 1.5
 Identities = 34/136 (25%), Positives = 53/136 (38%), Gaps = 12/136 (8%)

Query: 628 QDSDDGKTSKSKKPAKKKETASTAVSPHKVKSKEFIESNGSSSDSDDDKKASSKRKRDSD 687
           +  D     K  K  KKKE         K K KE +E   S   + DD  AS+    ++D
Sbjct: 188 EKGDVPAVEKKSKKPKKKEKKEKEKERDKDKKKE-VEGFKSLLLALDDSPASAASVAEAD 246

Query: 688 EDSKAKT-----------KKKKEKSESESDSGESEKEKKSKSKPKGDKSKSKSKKEDEID 736
           E S A T           + K  ++E    S + +K+K+ K K +  K K          
Sbjct: 247 EASLANTVSGTAPDSEPDEPKDAEAEETKKSPKHKKKKQRKEKEEKKKKKKHHHHRCHHS 306

Query: 737 DEEYDEPVESTPPSSD 752
           D   ++PV++     +
Sbjct: 307 DGGAEQPVQNGAVEEE 322



 Score = 31.6 bits (71), Expect = 1.7
 Identities = 22/87 (25%), Positives = 42/87 (48%), Gaps = 1/87 (1%)

Query: 671 DSDDDKKASSKRKRDSDEDSKAKTKKKKEKSESESDSGESEKEKKSKSKPKGDKSKSKSK 730
           D D DK  +   K    +   A+T K  EK +  +   +S+K  K K K + +K + K K
Sbjct: 160 DIDLDKPLADSEKLPVQKHRNAETSKSPEKGDVPAVEKKSKK-PKKKEKKEKEKERDKDK 218

Query: 731 KEDEIDDEEYDEPVESTPPSSDAGSDA 757
           K++    +     ++ +P S+ + ++A
Sbjct: 219 KKEVEGFKSLLLALDDSPASAASVAEA 245


>gnl|CDD|218188 pfam04641, Rtf2, Replication termination factor 2.  It is vital for
           effective cell-replication that replication is not
           stalled at any point by, for instance, damaged bases.
           Rtf2 stabilizes the replication fork stalled at the
           site-specific replication barrier RTS1 by preventing
           replication restart until completion of DNA synthesis by
           a converging replication fork initiated at a flanking
           origin. The RTS1 element terminates replication forks
           that are moving in the cen2-distal direction while
           allowing forks moving in the cen2-proximal direction to
           pass through the region. Rtf2 contains a C2HC2 motif
           related to the C3HC4 RING-finger motif, and would appear
           to fold up, creating a RING finger-like structure but
           forming only one functional Zn2+ ion-binding site.
          Length = 254

 Score = 33.9 bits (78), Expect = 0.24
 Identities = 15/62 (24%), Positives = 31/62 (50%), Gaps = 2/62 (3%)

Query: 672 SDDDKKASSKRKRDSDEDSKAKTKKKKEKSESESDSGESEKEKKSKSKPKGDKSKSKSKK 731
           ++++ +    + R  +E +K K KKKK+K++  + +G S +   S + P    S +    
Sbjct: 165 TEEEVEL--LKARLEEERAKKKKKKKKKKTKKNNATGSSAEATVSSAVPTELSSGAGQVG 222

Query: 732 ED 733
           E 
Sbjct: 223 EA 224



 Score = 28.9 bits (65), Expect = 9.5
 Identities = 16/71 (22%), Positives = 25/71 (35%), Gaps = 4/71 (5%)

Query: 634 KTSKSKKPAKKKETASTAVSPHKVKSKEFIESNGSSSDSDDDKKASSKRKRDSDEDSKAK 693
           K  K KK  KK     ++       +     S+G+    +  K    +     +E S+  
Sbjct: 184 KKKKKKKKTKKNNATGSSAEATVSSAVPTELSSGAGQVGEAKKLKKKRSIAPDNEKSEVY 243

Query: 694 ----TKKKKEK 700
               T  KKEK
Sbjct: 244 KSLFTSHKKEK 254


>gnl|CDD|177447 PHA02664, PHA02664, hypothetical protein; Provisional.
          Length = 534

 Score = 34.2 bits (78), Expect = 0.25
 Identities = 26/86 (30%), Positives = 39/86 (45%), Gaps = 14/86 (16%)

Query: 671 DSDDDKKASSKRKR-DSDEDSKAKTKKKKEKSESESDSG--------ESEKEKKSKSKPK 721
           + D D  A +   R DSDED   + +   E+++ E DS          SE E  S     
Sbjct: 436 EFDQDPGAPAHADRADSDEDDMDEQESGDERADGEDDSDSSYSYSTTSSEDESDSADDSW 495

Query: 722 GDKSKSKSKKEDE-----IDDEEYDE 742
           GD+S S  + +D      I++EE +E
Sbjct: 496 GDESDSGIEHDDGGVGQAIEEEEEEE 521



 Score = 33.8 bits (77), Expect = 0.37
 Identities = 19/72 (26%), Positives = 32/72 (44%)

Query: 667 GSSSDSDDDKKASSKRKRDSDEDSKAKTKKKKEKSESESDSGESEKEKKSKSKPKGDKSK 726
             S + D D++ S   + D ++DS +        SE ESDS +     +S S  + D   
Sbjct: 450 ADSDEDDMDEQESGDERADGEDDSDSSYSYSTTSSEDESDSADDSWGDESDSGIEHDDGG 509

Query: 727 SKSKKEDEIDDE 738
                E+E ++E
Sbjct: 510 VGQAIEEEEEEE 521



 Score = 33.4 bits (76), Expect = 0.54
 Identities = 18/101 (17%), Positives = 40/101 (39%), Gaps = 9/101 (8%)

Query: 664 ESNGSSSDSDDDKKASSKRKRDSDEDSKAKTKKKKEKSESESDSGESEKEKKSKSKPKGD 723
            +  +++ +D D +A +    + D+D  A     +  S+ +    +   ++++  +   D
Sbjct: 417 AAAAANAPADQDVEAEAHD--EFDQDPGAPAHADRADSDEDDMDEQESGDERADGEDDSD 474

Query: 724 KSKSKSKKEDEIDDEEYDEPVESTPPSSDAGSDASSSEDDD 764
            S S S    E + +  D+       S    SD+    DD 
Sbjct: 475 SSYSYSTTSSEDESDSADD-------SWGDESDSGIEHDDG 508



 Score = 31.1 bits (70), Expect = 2.4
 Identities = 16/62 (25%), Positives = 25/62 (40%), Gaps = 1/62 (1%)

Query: 445 DDEKEPDAYLARVKREAAERDDEDDEDEDESTDEDFNPDQAESDVAEEYDSNPTPTDSDE 504
           + E   +          A+R D D++D DE    D   D  E D    Y  + T ++ + 
Sbjct: 430 EAEAHDEFDQDPGAPAHADRADSDEDDMDEQESGDERAD-GEDDSDSSYSYSTTSSEDES 488

Query: 505 DS 506
           DS
Sbjct: 489 DS 490



 Score = 30.7 bits (69), Expect = 3.7
 Identities = 18/82 (21%), Positives = 28/82 (34%)

Query: 455 ARVKREAAERDDEDDEDEDESTDEDFNPDQAESDVAEEYDSNPTPTDSDEDSDASEGTKK 514
            R    AA  +   D+D +    ++F+ D      A+  DS+    D  E  D     + 
Sbjct: 412 GRAAAAAAAANAPADQDVEAEAHDEFDQDPGAPAHADRADSDEDDMDEQESGDERADGED 471

Query: 515 KKKEKSRKTITISEKPRKSKKD 536
                   + T SE    S  D
Sbjct: 472 DSDSSYSYSTTSSEDESDSADD 493


>gnl|CDD|177089 CHL00189, infB, translation initiation factor 2; Provisional.
          Length = 742

 Score = 34.4 bits (79), Expect = 0.27
 Identities = 38/166 (22%), Positives = 55/166 (33%), Gaps = 26/166 (15%)

Query: 590 ELWKTVSSKEKAEYDEKVNKAKEDYNEALKAYKESGGGQDSDDGKTSKSKKPAKKKETAS 649
           +  K  S K + E D K +    D N+ L          +  D K  K  K    K++  
Sbjct: 24  KNLKHSSYKIRLESDIKDSLLNLDINKKL---------HEKLDKKNKKFNKTDDLKDSKK 74

Query: 650 TAVSPHKVKSKEFIESNGSSSDSDDD----KKASSKRKRDSDEDSKAKTKKKKEKSESES 705
           T     K+K K+ I+      D  D+    K  S +           K K K EK     
Sbjct: 75  T-----KLKQKKKIKKKLHIDDDYDNFFDSKNNSKQFAGPLAISLMRKPKPKTEK----- 124

Query: 706 DSGESEKEKKSKSKPKGDKSKSKSKKEDEIDDEEYDEPVESTPPSS 751
                +K   +KS  K  K    SK E    D    + +    P +
Sbjct: 125 ---LKKKITVNKSTNKKKKKVLSSKDELIKYDNNKPKSISIHSPLT 167


>gnl|CDD|217829 pfam03985, Paf1, Paf1.  Members of this family are components of
           the RNA polymerase II associated Paf1 complex. The Paf1
           complex functions during the elongation phase of
           transcription in conjunction with Spt4-Spt5 and
           Spt16-Pob3i.
          Length = 431

 Score = 33.9 bits (78), Expect = 0.29
 Identities = 12/63 (19%), Positives = 25/63 (39%), Gaps = 1/63 (1%)

Query: 460 EAAERDDEDDEDEDESTDEDFNPDQAESDVAEEYDSN-PTPTDSDEDSDASEGTKKKKKE 518
                +DED+E+E  S + +    +   +   +   +  + + SD  SD+     K+   
Sbjct: 367 FEEVDEDEDEEEEQRSDEHEEEEGEDSEEEGSQSREDGSSESSSDVGSDSESKADKESAS 426

Query: 519 KSR 521
            S 
Sbjct: 427 DSD 429


>gnl|CDD|223052 PHA03346, PHA03346, US22 family homolog; Provisional.
          Length = 520

 Score = 34.0 bits (78), Expect = 0.32
 Identities = 16/91 (17%), Positives = 31/91 (34%), Gaps = 4/91 (4%)

Query: 434 KPNYKEDFGSSDDEKEPDAYLARVKREAAERDDEDDEDEDESTDEDFNPDQAESDVAEEY 493
           +P    +     D     A      R       +DD++ D+  D+D+     + D  + Y
Sbjct: 394 RPRGPPENEDEADGGGGLAPPPEPPRPPPPPFPDDDDEGDDEDDDDWEDLGFDLDEDDVY 453

Query: 494 DSNPTPTDSDEDSDASEGTKKKKKEKSRKTI 524
           D      D DE          ++ E  ++ +
Sbjct: 454 D----LKDVDEWRLERRIATAERWELGQRLL 480


>gnl|CDD|217840 pfam04006, Mpp10, Mpp10 protein.  This family includes proteins
           related to Mpp10 (M phase phosphoprotein 10). The U3
           small nucleolar ribonucleoprotein (snoRNP) is required
           for three cleavage events that generate the mature 18S
           rRNA from the pre-rRNA. In Saccharomyces cerevisiae,
           depletion of Mpp10, a U3 snoRNP-specific protein, halts
           18S rRNA production and impairs cleavage at the three U3
           snoRNP-dependent sites.
          Length = 613

 Score = 33.8 bits (77), Expect = 0.39
 Identities = 49/275 (17%), Positives = 97/275 (35%), Gaps = 34/275 (12%)

Query: 440 DFGSSDDEKEPDAYLARVKREAAERDDEDDED-EDESTDEDFNPDQAESDVAEEYDSNPT 498
                 D +  +        ++ +  D+++E+ EDES +++   D+ E+D+  E +S+  
Sbjct: 90  VLAKLQDSESHEDGSDGSDMDSEDSADDEEEEEEDESLEDEMIDDEDEADLFNESESS-- 147

Query: 499 PTDSDEDSDASEGTKKKKKEKSRKTITISEKPRKSKKDKESSGGSTKRKKKDKNAPKKPM 558
             +   D +  +  +KK +E+       S +    +K  + SG   K  K D+       
Sbjct: 148 -LEDLSDDETEDDEEKKMEEEEAGEEKESVEQATREKKFDKSGVDDKFFKLDE------- 199

Query: 559 SAYMMWFNDMREKIKKDNPGISFTEIAKKG--------GELWKTVSSKEKAEYDEKVNKA 610
                  N+  E  + +       E   +         G+  +   S E  E DE+ N  
Sbjct: 200 ------MNEFLEATEAEEEAALGDEDDFEDYFQDDSEDGKDDEDFGSGEDEEDDEEGNIE 253

Query: 611 KEDYNEALKAYKESGGGQDSDDGKTSKSKKPAKKKETASTAVSPHKVKSKEFIESNGSSS 670
            ED+ +  +  K+   G D++       K+  KK+            K +E  E +    
Sbjct: 254 YEDFFDPKEKDKKKDAGDDAELEDDEPDKEAVKKEA---------DSKPEEEDEEDDEQE 304

Query: 671 DSDDDKKASSKRKRDSDEDSKAKTKKKKEKSESES 705
           D  D+++           D         E  +  S
Sbjct: 305 DDQDEEEPPEAAMDKVKLDEPVLEGVDLESPKELS 339



 Score = 33.4 bits (76), Expect = 0.44
 Identities = 52/277 (18%), Positives = 106/277 (38%), Gaps = 48/277 (17%)

Query: 463 ERDDEDDEDEDESTDEDFNPDQAESDVAEEYDSNPTPTDSDEDSDASEGTKKKKKEKSRK 522
             DDE++E+EDES +++   D+ E+D+  E +S+    +   D +  +  +KK +E+   
Sbjct: 114 SADDEEEEEEDESLEDEMIDDEDEADLFNESESS---LEDLSDDETEDDEEKKMEEEEAG 170

Query: 523 TITISEKPRKSKKDKESSGGSTKRKKKDKNAPKKPMSAYMMWFNDMREKIKKDNPGISFT 582
               S +    +K  + SG   K  K D+              N+  E  + +       
Sbjct: 171 EEKESVEQATREKKFDKSGVDDKFFKLDE-------------MNEFLEATEAEEEAALGD 217

Query: 583 EIAKKG--------GELWKTVSSKEKAEYDEKVNKAKEDYNEALKAYKESGGGQDSDDGK 634
           E   +         G+  +   S E  E DE+ N   ED+ +  +  K+   G D++   
Sbjct: 218 EDDFEDYFQDDSEDGKDDEDFGSGEDEEDDEEGNIEYEDFFDPKEKDKKKDAGDDAELED 277

Query: 635 TSKSKKPAKKKETASTAVSPHKVKSKEFIESNGSSSDSDDDKKASSKRKRDSDEDSKAKT 694
               K+  KK+                        +DS  +++     +++ D+D +   
Sbjct: 278 DEPDKEAVKKE------------------------ADSKPEEEDEEDDEQEDDQDEEEPP 313

Query: 695 KKKKEKSESESDSGESEKEKKSKSKPKGDKSKSKSKK 731
           +   +K + +    E    +  K     +K ++K K+
Sbjct: 314 EAAMDKVKLDEPVLEGVDLESPKELSSFEKRQAKLKQ 350



 Score = 30.7 bits (69), Expect = 3.0
 Identities = 30/123 (24%), Positives = 53/123 (43%), Gaps = 5/123 (4%)

Query: 431 KGEKPNYKEDFGSSDDEKEPDAYLARVKREAAERDDEDDEDEDESTDEDFNPDQAESDVA 490
           K +K +  +D    DDE + +A       +  E D+EDDE ED+  +E+      +    
Sbjct: 263 KDKKKDAGDDAELEDDEPDKEAVKKEADSKPEEEDEEDDEQEDDQDEEEPPEAAMDKVKL 322

Query: 491 EEYDSNPTPTDSDEDSDASEGTKKKKKEKSRKTITISEKPRKSKKDKESSGGSTKRKKKD 550
           +E    P     D +S     + +K++ K ++ I   EK   + K      G    KK+ 
Sbjct: 323 DE----PVLEGVDLESPKELSSFEKRQAKLKQQIEQLEKENLAPKSWTLK-GEVTAKKRP 377

Query: 551 KNA 553
           +N+
Sbjct: 378 QNS 380



 Score = 29.2 bits (65), Expect = 9.0
 Identities = 17/96 (17%), Positives = 36/96 (37%)

Query: 669 SSDSDDDKKASSKRKRDSDEDSKAKTKKKKEKSESESDSGESEKEKKSKSKPKGDKSKSK 728
           S D  DD+   S    + DE+   + +   +  E +      +  +    +P  +  K +
Sbjct: 229 SEDGKDDEDFGSGEDEEDDEEGNIEYEDFFDPKEKDKKKDAGDDAELEDDEPDKEAVKKE 288

Query: 729 SKKEDEIDDEEYDEPVESTPPSSDAGSDASSSEDDD 764
           +  + E +DEE DE  +         +     + D+
Sbjct: 289 ADSKPEEEDEEDDEQEDDQDEEEPPEAAMDKVKLDE 324


>gnl|CDD|227458 COG5129, MAK16, Nuclear protein with HMG-like acidic region
           [General function prediction only].
          Length = 303

 Score = 33.5 bits (76), Expect = 0.40
 Identities = 24/80 (30%), Positives = 38/80 (47%)

Query: 663 IESNGSSSDSDDDKKASSKRKRDSDEDSKAKTKKKKEKSESESDSGESEKEKKSKSKPKG 722
           +E+    S+ +  KK   ++   SD+  +    +++E SESESD  E E  K    K K 
Sbjct: 214 LEAVTDDSEKEKTKKKDLEKWLGSDQSMETSESEEEESSESESDEDEDEDNKGKIRKRKT 273

Query: 723 DKSKSKSKKEDEIDDEEYDE 742
           D +K   K    I+ E+  E
Sbjct: 274 DDAKKSRKPHIHIEYEQERE 293



 Score = 33.5 bits (76), Expect = 0.40
 Identities = 25/91 (27%), Positives = 37/91 (40%), Gaps = 3/91 (3%)

Query: 444 SDDEKEPDAYLARVKREAAERDDEDDEDEDESTDEDFNPDQAESDVAEEYDSNPTPTDSD 503
             DEKE   Y+   +    E +   D+ E E T +        SD + E   +     S+
Sbjct: 196 RQDEKER--YVEEEEESDTELEAVTDDSEKEKTKKKDLEKWLGSDQSMETSESEEEESSE 253

Query: 504 EDSDASEGTKKKKKEKSRKTITISEKPRKSK 534
            +SD  E    K K + RKT   ++K RK  
Sbjct: 254 SESDEDEDEDNKGKIRKRKTDD-AKKSRKPH 283



 Score = 32.7 bits (74), Expect = 0.68
 Identities = 33/173 (19%), Positives = 70/173 (40%), Gaps = 9/173 (5%)

Query: 545 KRKKKDKNAPKKPMSAYMMWFNDMREKIKKDNPGISFTEIAKKGGELW-KTVSSKEKAEY 603
           K  +++K   +K ++A     N  +E + +   GI           LW K  + +EK + 
Sbjct: 139 KMNRREKLRERKALNAAKFEKNIEKELMLRLKSGIYGDTPLNVREHLWNKAATEREKRQD 198

Query: 604 DEKVNKAKEDYNEALKAYKESGGGQDSDDGKTSKSKKPAKKKETASTAVSPHKVKSKEFI 663
           +++    +E+ ++             +DD +  K+KK   +K   S          +E  
Sbjct: 199 EKERYVEEEEESDTELEA-------VTDDSEKEKTKKKDLEKWLGSDQSMETSESEEEES 251

Query: 664 ESNGSSSDSDDDKKASSKRKRDSDEDSKAKTKKKKEKSESESDSGESEKEKKS 716
             + S  D D+D K   ++ R +D+  K++      + E E ++ +    + S
Sbjct: 252 SESESDEDEDEDNKGKIRK-RKTDDAKKSRKPHIHIEYEQERENEKIPAVQHS 303



 Score = 30.4 bits (68), Expect = 3.4
 Identities = 28/138 (20%), Positives = 56/138 (40%), Gaps = 18/138 (13%)

Query: 420 REKKLRVKNTGKGEKPNYKEDFGSSDDEKEPDAYLARVKREAAERDDEDDEDEDESTDED 479
           +E  LR+K+   G+ P        +  E   +      ++   E++   +E+E+  T+ +
Sbjct: 163 KELMLRLKSGIYGDTP-------LNVREHLWNKAATEREKRQDEKERYVEEEEESDTELE 215

Query: 480 FNPDQAESDVAEEYDSNPTPTDSDEDSDASEGTKKKKKEKSRKTITISEKPRKSKKDKES 539
              D +E +  ++ D              S  T + ++E+S      SE      +D+++
Sbjct: 216 AVTDDSEKEKTKKKDLEKWLGSDQ-----SMETSESEEEES------SESESDEDEDEDN 264

Query: 540 SGGSTKRKKKDKNAPKKP 557
            G   KRK  D    +KP
Sbjct: 265 KGKIRKRKTDDAKKSRKP 282


>gnl|CDD|215214 PLN02381, PLN02381, valyl-tRNA synthetase.
          Length = 1066

 Score = 33.7 bits (77), Expect = 0.41
 Identities = 24/86 (27%), Positives = 45/86 (52%), Gaps = 5/86 (5%)

Query: 667 GSSSDSDDDKKASS----KRKRDSDEDSKAKTKKKKEKSESESDSGESEKEKKSKSKPKG 722
           GS ++S+ +KK  +    +RK+  +E +K K  KK + ++ E+ + + + ++ S      
Sbjct: 1   GSRTESEAEKKILTEEELERKKKKEEKAKEKELKKLKAAQKEAKA-KLQAQQASDGTNVP 59

Query: 723 DKSKSKSKKEDEIDDEEYDEPVESTP 748
            KS+ KS+K D  D+   D     TP
Sbjct: 60  KKSEKKSRKRDVEDENPEDFIDPDTP 85



 Score = 30.6 bits (69), Expect = 3.8
 Identities = 17/69 (24%), Positives = 33/69 (47%), Gaps = 4/69 (5%)

Query: 495 SNPTPTDSDEDSDASEGTKKKK---KEKSRKTITISEKPRKSK-KDKESSGGSTKRKKKD 550
           +           +  E  KKK+   KEK  K +  ++K  K+K + +++S G+   KK +
Sbjct: 4   TESEAEKKILTEEELERKKKKEEKAKEKELKKLKAAQKEAKAKLQAQQASDGTNVPKKSE 63

Query: 551 KNAPKKPMS 559
           K + K+ + 
Sbjct: 64  KKSRKRDVE 72


>gnl|CDD|217203 pfam02724, CDC45, CDC45-like protein.  CDC45 is an essential gene
           required for initiation of DNA replication in S.
           cerevisiae, forming a complex with MCM5/CDC46.
           Homologues of CDC45 have been identified in human, mouse
           and smut fungus among others.
          Length = 583

 Score = 33.4 bits (77), Expect = 0.47
 Identities = 15/79 (18%), Positives = 33/79 (41%), Gaps = 12/79 (15%)

Query: 444 SDDEKEPDAYLARVKREAAERDDEDDEDEDESTDEDFNPDQAESDVAEEYDSNPTPTDSD 503
            + + EP    A    E  + DDE+ ++EDE + +  + +  + D            D D
Sbjct: 108 EELQDEPRYDDAYRDLEEDDDDDEESDEEDEESSKSEDDEDDDDD------------DDD 155

Query: 504 EDSDASEGTKKKKKEKSRK 522
           +D    E + ++++ +   
Sbjct: 156 DDIATRERSLERRRRRREW 174



 Score = 30.0 bits (68), Expect = 5.5
 Identities = 18/85 (21%), Positives = 38/85 (44%), Gaps = 5/85 (5%)

Query: 440 DFGSSDDEKEPDAYLARVKREAAERDDEDDEDEDESTDEDFNPDQAESDVAEEYDSNPTP 499
           D G  ++E + +       R+  E DD+D+E ++E  +   + D  + D  ++ D   T 
Sbjct: 102 DDGDIEEELQDEPRYDDAYRDLEEDDDDDEESDEEDEESSKSEDDEDDDDDDDDDDIATR 161

Query: 500 TDSDEDSDASEGTKKKKKEKSRKTI 524
             S E        ++++ E+ R  +
Sbjct: 162 ERSLERRR-----RRREWEEKRAEL 181



 Score = 29.6 bits (67), Expect = 8.1
 Identities = 15/78 (19%), Positives = 31/78 (39%), Gaps = 3/78 (3%)

Query: 465 DDEDDEDEDE---STDEDFNPDQAESDVAEEYDSNPTPTDSDEDSDASEGTKKKKKEKSR 521
           DD D E+E +     D+ +   + + D  EE D     +   ED +  +         +R
Sbjct: 102 DDGDIEEELQDEPRYDDAYRDLEEDDDDDEESDEEDEESSKSEDDEDDDDDDDDDDIATR 161

Query: 522 KTITISEKPRKSKKDKES 539
           +      + R+  ++K +
Sbjct: 162 ERSLERRRRRREWEEKRA 179


>gnl|CDD|178669 PLN03122, PLN03122, Poly [ADP-ribose] polymerase; Provisional.
          Length = 815

 Score = 33.6 bits (77), Expect = 0.47
 Identities = 19/84 (22%), Positives = 36/84 (42%), Gaps = 7/84 (8%)

Query: 686 SDEDSKAKTKKKKEKSESESDSGESEKEKKSKSKPKGDKSKSKSKKEDEIDDEEYDEPVE 745
           + E+ K  T+K+K +++        +K KK K +        KS ++   + EE+ + +E
Sbjct: 13  AAEEGKGGTRKQKAENKEHEGEQSPKKAKKEKKQDDSGNGNGKSAEDAVKEFEEFCKAIE 72

Query: 746 STPPSSDA-------GSDASSSED 762
                          G D+S S+D
Sbjct: 73  EHLSIEQMREILEENGQDSSGSDD 96


>gnl|CDD|227880 COG5593, COG5593, Nucleic-acid-binding protein possibly involved in
           ribosomal biogenesis [Translation, ribosomal structure
           and biogenesis].
          Length = 821

 Score = 33.5 bits (76), Expect = 0.50
 Identities = 31/99 (31%), Positives = 45/99 (45%), Gaps = 10/99 (10%)

Query: 430 GKGEKPNYKEDFGSSDDEKEPDAYLARVKREAAERDDEDDEDEDESTDEDFNPDQAESDV 489
           GKG+K N K  F S D+  E + + A VK      DD DD + D   ++DF      SD 
Sbjct: 669 GKGKKSN-KASFDSDDEMDENEIWSALVKSRPDVEDDSDDSELD-FAEDDF------SDS 720

Query: 490 AEEYDSNPTPTDSDEDSDASEGTKKKKKEKSRKTITISE 528
               D  P     D++   SEG+++  +E+    I  S 
Sbjct: 721 TS--DDEPKLDAIDDEDAKSEGSQESDQEEGLDEIFYSF 757



 Score = 30.4 bits (68), Expect = 4.3
 Identities = 20/105 (19%), Positives = 48/105 (45%), Gaps = 1/105 (0%)

Query: 434 KPNYKEDFGSSDDEKEPDAYLARVKREAAERDDEDDEDEDESTDEDFNPDQAESDVAEEY 493
           +P+ ++D   S+ +   D +      +  + D  DDED      ++ + ++   ++   +
Sbjct: 698 RPDVEDDSDDSELDFAEDDFSDSTSDDEPKLDAIDDEDAKSEGSQESDQEEGLDEIFYSF 757

Query: 494 DSNPTPTDSDEDSDASEGTKKKKKEKSRKTITISEKPRKSKKDKE 538
           D     +DS  +S   + + +++KE+  +   +S K  K K+ K 
Sbjct: 758 DGEQDNSDSFAESSEEDESSEEEKEE-EENKEVSAKRAKKKQRKN 801


>gnl|CDD|218115 pfam04502, DUF572, Family of unknown function (DUF572).  Family of
           eukaryotic proteins with undetermined function.
          Length = 321

 Score = 33.2 bits (76), Expect = 0.52
 Identities = 23/129 (17%), Positives = 36/129 (27%), Gaps = 9/129 (6%)

Query: 645 KETASTAVSPHKVKSKEFIESNGSSSDSDDDKKASSKRKRDSDEDSKAKTKKKKEKSESE 704
               S +  P   + +   +   S  D +D+    S +   S         KK     SE
Sbjct: 193 ALIKSLSFGPETEEDRRRADDEDSEDDEEDNDNTPSPKSGSSSPAKPTSILKKSAAKRSE 252

Query: 705 SDSGESEKEKKSKSKPKGDKSKS--KSKKE-------DEIDDEEYDEPVESTPPSSDAGS 755
           + S    K+         D   S    KK             E   E  ++   SS +  
Sbjct: 253 APSSSKAKKNSRGIPKPRDALSSLVVRKKAAPESTSQSPSSAEPTSESPQTAGNSSLSSL 312

Query: 756 DASSSEDDD 764
              S  D+ 
Sbjct: 313 GDYSDSDES 321



 Score = 30.5 bits (69), Expect = 3.6
 Identities = 33/201 (16%), Positives = 69/201 (34%), Gaps = 34/201 (16%)

Query: 385 GSTRSFDFEIELKSGVLHTFSSIEKEEYGKLFDFIREKKLRVKNTGKGEKPNYKE--DFG 442
           G+TR+++ +   +          E+E  G       E +              +E  +  
Sbjct: 103 GATRNYEADKLDEEQEERVEKEREEELAGDAM-KKLENRTADSKREMEVLERLEELKELQ 161

Query: 443 SSDDEKEPDAYL---ARVKREAAERDDED--------------------DEDEDESTDED 479
           S   + + ++ L    R +++  E ++E+                     +DED   DE+
Sbjct: 162 SRRADVDVNSMLEALFRREKKEEEEEEEEDEALIKSLSFGPETEEDRRRADDEDSEDDEE 221

Query: 480 FNPDQAESDVAEEYDSNPTPTDSDEDSDASEGTKKKKKEKSRKTITISEKPRKSKKDKES 539
            N +           + PT   S     A++ ++     K++K      KPR +      
Sbjct: 222 DNDNTPSPKSGSSSPAKPT---SILKKSAAKRSEAPSSSKAKKNSRGIPKPRDA-----L 273

Query: 540 SGGSTKRKKKDKNAPKKPMSA 560
           S    ++K   ++  + P SA
Sbjct: 274 SSLVVRKKAAPESTSQSPSSA 294


>gnl|CDD|220172 pfam09309, FCP1_C, FCP1, C-terminal.  The C-terminal domain of
           FCP-1 is required for interaction with the carboxy
           terminal domain of RAP74. Interaction relies extensively
           on van der Waals contacts between hydrophobic residues
           situated within alpha-helices in both domains.
          Length = 263

 Score = 32.7 bits (74), Expect = 0.53
 Identities = 26/106 (24%), Positives = 48/106 (45%), Gaps = 6/106 (5%)

Query: 660 KEFIESNGSSSDSDDDKKASSKRKRDSDEDSKAKTKKKKEKS-ESESDSGESEKEKKSKS 718
           KE  +  G  SD  D +K   K+   ++E+ +   + +K++      +  E       +S
Sbjct: 150 KEVDDILGEGSDDSDSEK---KKPPGNEEEQEQAPQPRKQQPPGPRREPDEGTPSSSERS 206

Query: 719 KPKGDKSKSKSKKEDEIDDEEYDEPVESTPPSSDAGSDASSSEDDD 764
              G   +   +K +E  ++E D   ES+  SS+   + SSSE D+
Sbjct: 207 NIGGRGPRGHKRKLNE--EDEEDAESESSFESSNEDEEGSSSEADE 250


>gnl|CDD|172341 PRK13808, PRK13808, adenylate kinase; Provisional.
          Length = 333

 Score = 32.9 bits (75), Expect = 0.53
 Identities = 23/106 (21%), Positives = 34/106 (32%), Gaps = 13/106 (12%)

Query: 625 GGGQDSDDGKTSKSKKPAKKKETASTAVSPHKVKSKEFIESNGSSSDSDDDKKASSKRKR 684
           G        KT  +K  AKK    + + +    K K               K A S +K 
Sbjct: 195 GAANAKKAAKTPAAKSGAKKASAKAKSAAKKVSKKKA-------------AKTAVSAKKA 241

Query: 685 DSDEDSKAKTKKKKEKSESESDSGESEKEKKSKSKPKGDKSKSKSK 730
                  AK  KK  K   +  +   +K  K  +K     +K  +K
Sbjct: 242 AKTAAKAAKKAKKTAKKALKKAAKAVKKAAKKAAKAAAKAAKGAAK 287



 Score = 31.8 bits (72), Expect = 1.2
 Identities = 26/99 (26%), Positives = 42/99 (42%), Gaps = 3/99 (3%)

Query: 633 GKTSKSKKPAKKKETASTAVSPHKVKSKEFIESNGSSSDSDDDKKASSKRKRDSDEDSKA 692
            KT+ S K A K  TA+ A    K  +K+ ++    +      KKA+    + +   +KA
Sbjct: 232 AKTAVSAKKAAK--TAAKAAKKAKKTAKKALKKAAKAV-KKAAKKAAKAAAKAAKGAAKA 288

Query: 693 KTKKKKEKSESESDSGESEKEKKSKSKPKGDKSKSKSKK 731
              K K K ++   +    K K +   PK      K+KK
Sbjct: 289 TKGKAKAKKKAGKKAAAGSKAKATAKAPKRGAKGKKAKK 327



 Score = 30.6 bits (69), Expect = 3.4
 Identities = 28/155 (18%), Positives = 46/155 (29%), Gaps = 20/155 (12%)

Query: 506 SDASEGTKKKKKEKSRKTITISEKPRKSKKDKESSGGSTKRKKKDKNAPKKPMSAYMMWF 565
           ++A +  K    +   K  +   K    K  K+ +  +    KK      K         
Sbjct: 197 ANAKKAAKTPAAKSGAKKASAKAKSAAKKVSKKKAAKTAVSAKKAAKTAAKAAK------ 250

Query: 566 NDMREKIKKDNPGISFTEIAKKGGELWKTVSSKEKAEYDEKVNKAKEDYNEALKAYKESG 625
                K KK          AKK  +       K   +  +   KA +   +A K   ++ 
Sbjct: 251 -----KAKKT---------AKKALKKAAKAVKKAAKKAAKAAAKAAKGAAKATKGKAKAK 296

Query: 626 GGQDSDDGKTSKSKKPAKKKETASTAVSPHKVKSK 660
                     SK+K  AK  +  +      KV  K
Sbjct: 297 KKAGKKAAAGSKAKATAKAPKRGAKGKKAKKVTKK 331



 Score = 30.2 bits (68), Expect = 4.1
 Identities = 11/58 (18%), Positives = 20/58 (34%)

Query: 674 DDKKASSKRKRDSDEDSKAKTKKKKEKSESESDSGESEKEKKSKSKPKGDKSKSKSKK 731
           + KKA+      S     +   K   K  S+  + ++    K  +K     +K   K 
Sbjct: 198 NAKKAAKTPAAKSGAKKASAKAKSAAKKVSKKKAAKTAVSAKKAAKTAAKAAKKAKKT 255



 Score = 30.2 bits (68), Expect = 4.4
 Identities = 31/136 (22%), Positives = 48/136 (35%), Gaps = 10/136 (7%)

Query: 606 KVNKAKEDYNEALKAYKESGGGQDSDDGKTSKSKKPAKKKETASTAVSPHK--VKSKEFI 663
             N  K     A K+  +    +     K    KK AK   +A  A        K  +  
Sbjct: 196 AANAKKAAKTPAAKSGAKKASAKAKSAAKKVSKKKAAKTAVSAKKAAKTAAKAAKKAKKT 255

Query: 664 ESNGSSSDSDDDKKASSKRKRDSDEDSK--AKTKKKKEKSESE------SDSGESEKEKK 715
                   +   KKA+ K  + + + +K  AK  K K K++ +      + S      K 
Sbjct: 256 AKKALKKAAKAVKKAAKKAAKAAAKAAKGAAKATKGKAKAKKKAGKKAAAGSKAKATAKA 315

Query: 716 SKSKPKGDKSKSKSKK 731
            K   KG K+K  +KK
Sbjct: 316 PKRGAKGKKAKKVTKK 331



 Score = 29.5 bits (66), Expect = 7.6
 Identities = 24/157 (15%), Positives = 50/157 (31%), Gaps = 2/157 (1%)

Query: 562 MMWFNDMREKIKKDNPGISFTEIAKKGGELWKTVSSKEKAEYDEKVNKAKEDYNEALKAY 621
           MM  +++  +I +    +      K          +K+ +   +   K       A  A 
Sbjct: 177 MMTIDEVTREIGRVLAAVGAANAKKAAKTPAAKSGAKKASAKAKSAAKKVSKKKAAKTAV 236

Query: 622 KESGGGQDSDDGKTSKSKKPAK--KKETASTAVSPHKVKSKEFIESNGSSSDSDDDKKAS 679
                 + +        K   K  KK   +   +  K        + G++  +    KA 
Sbjct: 237 SAKKAAKTAAKAAKKAKKTAKKALKKAAKAVKKAAKKAAKAAAKAAKGAAKATKGKAKAK 296

Query: 680 SKRKRDSDEDSKAKTKKKKEKSESESDSGESEKEKKS 716
            K  + +   SKAK   K  K  ++    +   +K++
Sbjct: 297 KKAGKKAAAGSKAKATAKAPKRGAKGKKAKKVTKKRA 333


>gnl|CDD|240246 PTZ00053, PTZ00053, methionine aminopeptidase 2; Provisional.
          Length = 470

 Score = 33.1 bits (76), Expect = 0.55
 Identities = 18/108 (16%), Positives = 39/108 (36%), Gaps = 1/108 (0%)

Query: 644 KKETASTAVSPHKVKSKEFIESNGSSSDSDDDKKASSKRKRDSDED-SKAKTKKKKEKSE 702
           + E         +  +K+  + +    D DD   A    +    E+    K KKKK+K +
Sbjct: 7   ENEVKQQKQQNKQKGTKKKNKKSKKDVDDDDAFLAELISENQEAENKQNNKKKKKKKKKK 66

Query: 703 SESDSGESEKEKKSKSKPKGDKSKSKSKKEDEIDDEEYDEPVESTPPS 750
            + + GE+              +   +    ++ +    E  ++ PP+
Sbjct: 67  KKKNLGEAYDLAYDLPVVWSSAAFQDNSHIRKLGNWPEQEWKQTQPPT 114


>gnl|CDD|227466 COG5137, COG5137, Histone chaperone involved in gene silencing
           [Transcription / Chromatin structure and dynamics].
          Length = 279

 Score = 32.7 bits (74), Expect = 0.57
 Identities = 17/118 (14%), Positives = 42/118 (35%)

Query: 438 KEDFGSSDDEKEPDAYLARVKREAAERDDEDDEDEDESTDEDFNPDQAESDVAEEYDSNP 497
            ++        +PD      +R       E++EDE+  +D     ++  ++  EE     
Sbjct: 160 NDEDNDEAPPAQPDVDNEEEERLEESDGREEEEDEEVGSDSYGEGNRELNEEEEEEAEGS 219

Query: 498 TPTDSDEDSDASEGTKKKKKEKSRKTITISEKPRKSKKDKESSGGSTKRKKKDKNAPK 555
              +   D +     KK+ +E+  +   I+    + +++  S       ++      K
Sbjct: 220 DDGEDVVDYEGERIDKKQGEEEEMEEEVINLFEIEWEEESPSEEVPRNNEESPAKKQK 277


>gnl|CDD|184885 PRK14891, PRK14891, 50S ribosomal protein L24e/unknown domain
           fusion protein; Provisional.
          Length = 131

 Score = 31.5 bits (71), Expect = 0.58
 Identities = 21/77 (27%), Positives = 34/77 (44%), Gaps = 1/77 (1%)

Query: 429 TGKGEKPNYKEDFGSSDDEKEPDAYLA-RVKREAAERDDEDDEDEDESTDEDFNPDQAES 487
            G+ EK        +++  +E +A  A     EAAE D  D+ DE+E TDE  +    E+
Sbjct: 55  AGRAEKGPAAAATAAAEAAEEAEAADADEDADEAAEADAADEADEEEETDEAVDETADEA 114

Query: 488 DVAEEYDSNPTPTDSDE 504
           D   E        +++ 
Sbjct: 115 DAEAEEADEEEDEEAEA 131



 Score = 29.9 bits (67), Expect = 1.9
 Identities = 11/52 (21%), Positives = 18/52 (34%)

Query: 459 REAAERDDEDDEDEDESTDEDFNPDQAESDVAEEYDSNPTPTDSDEDSDASE 510
             A    +  +E E    DED +         E  +   T    DE +D ++
Sbjct: 64  AAATAAAEAAEEAEAADADEDADEAAEADAADEADEEEETDEAVDETADEAD 115



 Score = 29.5 bits (66), Expect = 3.1
 Identities = 21/93 (22%), Positives = 42/93 (45%), Gaps = 2/93 (2%)

Query: 427 KNTGKGEKPNYKEDFGSSDDEKEPDAYLARVKREAAERD-DEDDEDEDESTDEDFNPDQA 485
           KN   G +    E   +   EK P A        A E +  + DED DE+ + D   D+A
Sbjct: 39  KNYDLGREARDLEWTEAGRAEKGPAAAATAAAEAAEEAEAADADEDADEAAEAD-AADEA 97

Query: 486 ESDVAEEYDSNPTPTDSDEDSDASEGTKKKKKE 518
           + +   +   + T  ++D +++ ++  + ++ E
Sbjct: 98  DEEEETDEAVDETADEADAEAEEADEEEDEEAE 130


>gnl|CDD|218883 pfam06075, DUF936, Plant protein of unknown function (DUF936).
           This family consists of several hypothetical proteins
           from Arabidopsis thaliana and Oryza sativa. The function
           of this family is unknown.
          Length = 564

 Score = 33.3 bits (76), Expect = 0.59
 Identities = 25/136 (18%), Positives = 48/136 (35%), Gaps = 5/136 (3%)

Query: 631 DDGKTSKSKKPAKKKETASTAVSPHKV---KSKEFIES--NGSSSDSDDDKKASSKRKRD 685
           D   T    +P    E   T  +P KV   +S+    S   G+S  S    + SS   R 
Sbjct: 156 DSSPTVIGPRPRSFSELNLTDRTPAKVRSSRSELGAPSPSGGTSCPSSSGGRRSSIGSRR 215

Query: 686 SDEDSKAKTKKKKEKSESESDSGESEKEKKSKSKPKGDKSKSKSKKEDEIDDEEYDEPVE 745
               +  + K     +  +  S  S+ +   +++    KS  KS  + +         + 
Sbjct: 216 LRGSASLRKKVAVLSAPRKPGSRSSDCKSSPRARSSSAKSPFKSSIQRKATKALSKLSLR 275

Query: 746 STPPSSDAGSDASSSE 761
           ++P  +   S +  + 
Sbjct: 276 ASPKDTSKSSKSEVAP 291


>gnl|CDD|221185 pfam11719, Drc1-Sld2, DNA replication and checkpoint protein.
           Genome duplication is precisely regulated by
           cyclin-dependent kinases CDKs, which bring about the
           onset of S phase by activating replication origins and
           then prevent relicensing of origins until mitosis is
           completed. The optimum sequence motif for CDK
           phosphorylation is S/T-P-K/R-K/R, and Drc1-Sld2 is found
           to have at least 11 potential phosphorylation sites.
           Drc1 is required for DNA synthesis and S-M replication
           checkpoint control. Drc1 associates with Cdc2 and is
           phosphorylated at the onset of S phase when Cdc2 is
           activated. Thus Cdc2 promotes DNA replication by
           phosphorylating Drc1 and regulating its association with
           Cut5. Sld2 and Sld3 represent the minimal set of S-CDK
           substrates required for DNA replication.
          Length = 397

 Score = 32.8 bits (75), Expect = 0.63
 Identities = 12/45 (26%), Positives = 18/45 (40%)

Query: 656 KVKSKEFIESNGSSSDSDDDKKASSKRKRDSDEDSKAKTKKKKEK 700
           K+  K   E  G     D+D +     +   + + K K KKK  K
Sbjct: 322 KLDEKSLSEFLGYMGGIDEDDEDEDDEESKEEVEKKQKVKKKPRK 366



 Score = 32.1 bits (73), Expect = 1.2
 Identities = 28/132 (21%), Positives = 54/132 (40%), Gaps = 2/132 (1%)

Query: 472 EDESTDEDFN-PDQAESDVAEEYDSNPTPTDSDEDSDASEGTKKKKKEKSRKTITISEKP 530
           E+ES D++ +   + E++ A          DS   +D      KKK +K R T  +  +P
Sbjct: 242 EEESIDDELDVLREIEAEEAGIGPIEEEVVDSQAANDEPRRVFKKKGQK-RTTRRVKMRP 300

Query: 531 RKSKKDKESSGGSTKRKKKDKNAPKKPMSAYMMWFNDMREKIKKDNPGISFTEIAKKGGE 590
            ++K   E S   +   ++     +K +S ++ +   + E  + ++   S  E+ KK   
Sbjct: 301 VRAKPSDEPSLPESDIHEEIPKLDEKSLSEFLGYMGGIDEDDEDEDDEESKEEVEKKQKV 360

Query: 591 LWKTVSSKEKAE 602
             K    K    
Sbjct: 361 KKKPRKRKVNPV 372



 Score = 30.5 bits (69), Expect = 3.4
 Identities = 20/103 (19%), Positives = 38/103 (36%), Gaps = 3/103 (2%)

Query: 622 KESGGGQDSDDGKTSKSKKPAKKKET--ASTAVSPHKVKSKEFI-ESNGSSSDSDDDKKA 678
           +E    Q ++D      KK  +K+ T          K   +  + ES+        D+K+
Sbjct: 268 EEVVDSQAANDEPRRVFKKKGQKRTTRRVKMRPVRAKPSDEPSLPESDIHEEIPKLDEKS 327

Query: 679 SSKRKRDSDEDSKAKTKKKKEKSESESDSGESEKEKKSKSKPK 721
            S+         +    +  E+S+ E +  +  K+K  K K  
Sbjct: 328 LSEFLGYMGGIDEDDEDEDDEESKEEVEKKQKVKKKPRKRKVN 370


>gnl|CDD|215601 PLN03142, PLN03142, Probable chromatin-remodeling complex ATPase
           chain; Provisional.
          Length = 1033

 Score = 33.2 bits (76), Expect = 0.66
 Identities = 25/141 (17%), Positives = 58/141 (41%), Gaps = 16/141 (11%)

Query: 602 EYDEKVNKAKEDYNEALKAYKESGGGQDSDDGKTSKSKKPAKKKETASTAVSPHKVKSK- 660
           E +     A  D ++     ++    ++ D+   S +K    K+E A       + K + 
Sbjct: 17  ELEAVARSAGSDSDDDEVPAEDEDEDEEDDEEAESPAKAEISKREKARLKELKKQKKQEI 76

Query: 661 -EFIESNGSSSDSDDDKKASSKRKRDSDEDSKAKTKKKKEKSESESDSGESEKEKKSKSK 719
            + +E   ++ D+D               ++K K + K    ++E  +  ++ ++ + +K
Sbjct: 77  QKILEQQNAAIDADM--------------NNKGKGRLKYLLQQTEIFAHFAKGDQSASAK 122

Query: 720 PKGDKSKSKSKKEDEIDDEEY 740
               + +  SK  +E +DEEY
Sbjct: 123 KAKGRGRHASKLTEEEEDEEY 143



 Score = 32.5 bits (74), Expect = 0.98
 Identities = 31/156 (19%), Positives = 59/156 (37%), Gaps = 12/156 (7%)

Query: 463 ERDDEDDEDEDESTDEDFNPDQAESDVAEEYDSNP---TPTDSDEDSDASEGTKKK--KK 517
           E +  + +  +E  +E+       +    + D  P      D ++D +A    K +  K+
Sbjct: 1   EEEQVNTQANEEEDEEELEAVARSAGSDSDDDEVPAEDEDEDEEDDEEAESPAKAEISKR 60

Query: 518 EKSRKTITISEKPRKSKKDKESSGGSTKRKKKDKNAPKKPMSAYMMWFNDMREKIKKDNP 577
           EK+R       K +K ++ ++           D N   K    Y++   ++     K + 
Sbjct: 61  EKARLK---ELKKQKKQEIQKILEQQNAAIDADMNNKGKGRLKYLLQQTEIFAHFAKGDQ 117

Query: 578 GISFTEIAKKGGELWKTVSSKEKAEYDEKVNKAKED 613
             S    AKK     +  S   + E DE+  K +ED
Sbjct: 118 SAS----AKKAKGRGRHASKLTEEEEDEEYLKEEED 149



 Score = 30.5 bits (69), Expect = 4.7
 Identities = 13/54 (24%), Positives = 27/54 (50%)

Query: 439 EDFGSSDDEKEPDAYLARVKREAAERDDEDDEDEDESTDEDFNPDQAESDVAEE 492
           E   +  +E+E +  L  V R A    D+D+   ++  +++ + ++AES    E
Sbjct: 3   EQVNTQANEEEDEEELEAVARSAGSDSDDDEVPAEDEDEDEEDDEEAESPAKAE 56


>gnl|CDD|215584 PLN03113, PLN03113, DNA ligase 1; Provisional.
          Length = 744

 Score = 33.0 bits (75), Expect = 0.70
 Identities = 32/121 (26%), Positives = 54/121 (44%), Gaps = 9/121 (7%)

Query: 634 KTSKSKKPAKKKETASTAVSPHKVKSKEFIESNGSSSDSDDDKKASSKRKRDSDEDSKAK 693
             +++    K+ +T S + SP K K  E  ++N   ++  +     ++   +   DS   
Sbjct: 13  SNARAAAKKKQPQTQSQSSSPKKRKIGETQDANLGKTNVSEGTLPKTEDTIEPKSDSA-- 70

Query: 694 TKKKKEKSESESDSGESEKEKKSKSKPKGDKSKSK----SKKEDEIDDEE--YDEPVEST 747
            K +   S    DS    K+ ++ SKPK D+ KSK     KK ++ D E+  Y E  E  
Sbjct: 71  -KPRSSTSSIAEDSKTGTKKAQTLSKPKKDEMKSKIGLLKKKPNDFDPEKVAYWEKGERV 129

Query: 748 P 748
           P
Sbjct: 130 P 130


>gnl|CDD|220710 pfam10351, Apt1, Golgi-body localisation protein domain.  This is
           the C-terminus of a family of proteins conserved from
           plants to humans. The plant members are localised to the
           Golgi proteins and appear to regulate membrane
           trafficking, as they are required for rapid vesicle
           accumulation at the tip of the pollen tube. The
           C-terminus probably contains the Golgi localisation
           signal and it is well-conserved.
          Length = 451

 Score = 32.6 bits (75), Expect = 0.76
 Identities = 16/76 (21%), Positives = 29/76 (38%), Gaps = 2/76 (2%)

Query: 623 ESGGGQDSDDGKTSKSKKPAKKKETASTAVSPHKVKSKEFIESNGSSSDSDDDKKASSKR 682
           +S   ++  D  +S S    +   T+ ++ S   + S         SSD   DK+ S K 
Sbjct: 306 DSSKREEDSDISSSSSSGSRRSSSTSRSSSSSSSLLS--SSSILSKSSDKSKDKRFSLKL 363

Query: 683 KRDSDEDSKAKTKKKK 698
            +   E+S    +   
Sbjct: 364 SKSEKEESDDLEEMIS 379


>gnl|CDD|219668 pfam07964, Red1, Rec10 / Red1.  Rec10 / Red1 is involved in meiotic
           recombination and chromosome segregation during
           homologous chromosome formation. This protein localises
           to the synaptonemal complex in S. cerevisiae and the
           analogous structures (linear elements) in S. pombe. This
           family is currently only found in fungi.
          Length = 706

 Score = 32.6 bits (74), Expect = 0.86
 Identities = 52/337 (15%), Positives = 95/337 (28%), Gaps = 33/337 (9%)

Query: 443 SSDDEKEPDAYLARVKREAAERDDEDD--EDEDESTDEDFNPDQAESDVAEEYDSNPTPT 500
           S+ D    +  L   +R    +    D  ED+ ES  +     +  S+          P 
Sbjct: 280 SAQDNHYDNTQLQTPERSTKRKSPIWDLKEDQKESKIKSGTNLKLSSE------KESIPE 333

Query: 501 DSDEDSDASEGTKKKKKEKSRKTITISEKPRKSKKDKESSGGSTKRKKKDKNAPKKPMSA 560
            S  +    E +   + +K +   + S+     K+  E    S K K+            
Sbjct: 334 TSYVNVLEEEQSPLVRLQKRKLARSTSKTLESLKEVFEDQASSVKNKQAQSEENLNESPK 393

Query: 561 YMMWFNDMREKIKKDNPGISFTEIAKKGGELWKTVSSKEKAEYDEKVNKAKEDYNEALKA 620
             +        +K          + K   E         K     K    K +    +  
Sbjct: 394 TPIAVTG-DPHLKDPTIIAGKKLMNKLTSEKINNPVKVVKVS---KYKGNKSEKKRDINV 449

Query: 621 YKESGGGQDSDDGKTSKSKKPAKKKETASTAVSPHKVKSKEFIESNGSSSD-------SD 673
                    S        KK  K K+T      P   KSK+ + +N S +        + 
Sbjct: 450 LDTIFASPVS----KELRKKVGKSKQTKLKNFKPVPNKSKKQLANNNSQNIKSKKVVKAK 505

Query: 674 DDKKASSKRKRDSDEDSKAKTKKKKEKSESESDSGESEKEKKSK------SKPKGDKSKS 727
            + KA+ +   +       K K  K+ S S S         + +       K +     +
Sbjct: 506 TNNKANLQDVGECSSPPNNKEKNDKQTSTSSSVLKSDRSSIEVRNPNANVKKLEDTTYNA 565

Query: 728 KSKKEDEIDDEEYDEPVESTPPSSDAGSDASSSEDDD 764
           K     + +     +       S DA + A  +   +
Sbjct: 566 KFPTVSKNNAYTLVDI----STSEDAVNSADDTRSVN 598


>gnl|CDD|222581 pfam14181, YqfQ, YqfQ-like protein.  The YqfQ-like protein family
           includes the B. subtilis YqfQ protein, also known as
           VrrA, which is functionally uncharacterized. This family
           of proteins is found in bacteria. Proteins in this
           family are typically between 146 and 237 amino acids in
           length. There are two conserved sequence motifs: QYGP
           and PKLY.
          Length = 155

 Score = 31.3 bits (71), Expect = 0.98
 Identities = 19/67 (28%), Positives = 28/67 (41%)

Query: 660 KEFIESNGSSSDSDDDKKASSKRKRDSDEDSKAKTKKKKEKSESESDSGESEKEKKSKSK 719
           K F E + S  + ++ ++ S+      D        K+K+K E      E EK K    K
Sbjct: 86  KIFRELSSSDDEEEETEEESTDETEQEDPPETKTESKEKKKREVPKPKTEKEKPKTEPKK 145

Query: 720 PKGDKSK 726
           PK  K K
Sbjct: 146 PKPSKPK 152



 Score = 30.9 bits (70), Expect = 1.3
 Identities = 23/79 (29%), Positives = 34/79 (43%), Gaps = 12/79 (15%)

Query: 456 RVKREAAERDDEDDEDEDESTDEDFNPDQAESDVAEEYDSNPTPTDSDEDSDASEGTKKK 515
           ++ RE +  DDE++E E+ESTDE    D  E                 ++    E  K K
Sbjct: 86  KIFRELSSSDDEEEETEEESTDETEQEDPPE------------TKTESKEKKKREVPKPK 133

Query: 516 KKEKSRKTITISEKPRKSK 534
            +++  KT     KP K K
Sbjct: 134 TEKEKPKTEPKKPKPSKPK 152



 Score = 28.6 bits (64), Expect = 6.6
 Identities = 17/61 (27%), Positives = 30/61 (49%), Gaps = 5/61 (8%)

Query: 498 TPTDSDEDSDASEGTKKKKKEKSRKTITISEKPRKSKKDKESSGGSTKRKKKDKNAPKKP 557
           + +D +E+    E T + ++E   +T T S++ +K +  K         K+K K  PKKP
Sbjct: 92  SSSDDEEEETEEESTDETEQEDPPETKTESKEKKKREVPK-----PKTEKEKPKTEPKKP 146

Query: 558 M 558
            
Sbjct: 147 K 147


>gnl|CDD|151958 pfam11521, TFIIE-A_C-term, C-terminal general transcription factor
           TFIIE alpha.  TFIIE is compiled of two subunits, alpha
           and beta. This family of proteins are the C terminal
           domain of the alpha subunit of the protein which is the
           largest subunit and contains several functional domains
           which are important for basal transcription and cell
           growth. The C terminal end of the protein binds directly
           to the amino-terminal PH domain of p62/Tfb1 (of IIH)
           which is involved in the recruitment of the general
           transcription factor IIH to the transcription
           preinitiation complex. P53 competes for the same binding
           site as TFIIE alpha which shows their structural
           similarity. Like p53, TFIIE alpha 336-439 can activate
           transcription in vivo.
          Length = 86

 Score = 29.8 bits (67), Expect = 1.0
 Identities = 15/42 (35%), Positives = 21/42 (50%)

Query: 703 SESDSGESEKEKKSKSKPKGDKSKSKSKKEDEIDDEEYDEPV 744
           SESD+ ES+ +  S   P G     +   ED+  +E  DEP 
Sbjct: 1   SESDTSESDDDSPSPEPPAGQSVDEEEDDEDDEFEEVEDEPT 42



 Score = 27.9 bits (62), Expect = 5.6
 Identities = 7/37 (18%), Positives = 19/37 (51%)

Query: 443 SSDDEKEPDAYLARVKREAAERDDEDDEDEDESTDED 479
           +S+ + +  +      +   E +D++D++ +E  DE 
Sbjct: 5   TSESDDDSPSPEPPAGQSVDEEEDDEDDEFEEVEDEP 41


>gnl|CDD|226920 COG4547, CobT, Cobalamin biosynthesis protein CobT
           (nicotinate-mononucleotide:5, 6-dimethylbenzimidazole
           phosphoribosyltransferase) [Coenzyme metabolism].
          Length = 620

 Score = 32.5 bits (74), Expect = 1.1
 Identities = 25/119 (21%), Positives = 45/119 (37%), Gaps = 15/119 (12%)

Query: 440 DFGSSDDEKEPDAYLARVKREAAERDDEDDEDEDESTDEDFNPDQAE-SDVAEEYDSNPT 498
           D   S D  E               DD  +ED DE   +D  PD  E S+   E      
Sbjct: 211 DMLGSMDMAEETG------------DDGIEEDADEEDGDDDQPDNNEDSEAGREESEGSD 258

Query: 499 PTDSDEDSDASEGTKKKKKEKSRKTITISEKPRKSKKDKESSGGSTKRKKKDKNAPKKP 557
            ++ DE ++A++G + ++ E      +   +  +S +D E+ G   +         ++ 
Sbjct: 259 ESEEDE-AEATDG-EGEEGEMDAAEASEDSESDESDEDTETPGEDARPATPFTELMEEV 315



 Score = 31.4 bits (71), Expect = 2.3
 Identities = 22/83 (26%), Positives = 36/83 (43%), Gaps = 4/83 (4%)

Query: 432 GEKPNYKEDFGSSDDEKEPDAYLARVKREAAERDDEDDEDE-DESTD---EDFNPDQAES 487
           G + +   D    D+ +  D      + +AAE  ++ + DE DE T+   ED  P    +
Sbjct: 250 GREESEGSDESEEDEAEATDGEGEEGEMDAAEASEDSESDESDEDTETPGEDARPATPFT 309

Query: 488 DVAEEYDSNPTPTDSDEDSDASE 510
           ++ EE D      + DE   A E
Sbjct: 310 ELMEEVDYKVFTREFDEIVLAEE 332



 Score = 31.0 bits (70), Expect = 2.5
 Identities = 23/98 (23%), Positives = 40/98 (40%), Gaps = 8/98 (8%)

Query: 659 SKEFIESNGSSSDSDDDKKASSKRKRDSDEDSKAKTKKKKEKSESESDSGE-SEKEKKSK 717
             + IE +    D DDD+        D++EDS+A  ++ +   ESE D  E ++ E +  
Sbjct: 223 GDDGIEEDADEEDGDDDQP-------DNNEDSEAGREESEGSDESEEDEAEATDGEGEEG 275

Query: 718 SKPKGDKSKSKSKKEDEIDDEEYDEPVESTPPSSDAGS 755
                + S+     E + D E   E      P ++   
Sbjct: 276 EMDAAEASEDSESDESDEDTETPGEDARPATPFTELME 313


>gnl|CDD|114011 pfam05262, Borrelia_P83, Borrelia P83/100 protein.  This family
           consists of several Borrelia P83/P100 antigen proteins.
          Length = 489

 Score = 32.3 bits (73), Expect = 1.1
 Identities = 35/160 (21%), Positives = 67/160 (41%), Gaps = 6/160 (3%)

Query: 604 DEKVNKAKEDYNEALKAYKESGGGQDSDDGKTSKSKKPAKKKETASTAVSPHKVKSKEFI 663
           D K  +++ED   A +  +E    Q   D K  +    A+            K +  + +
Sbjct: 202 DLKERESQEDAKRAQQLKEELDKKQIDAD-KAQQKADFAQDNADKQRDEVRQKQQEAKNL 260

Query: 664 ESNGSSSDSDDDKKASSKRKRDSDEDSKAKTKKKKEKSESESDSG--ESEKEKKSKSKPK 721
                +S   +DK+ +  +KR+  E ++ + KK  E++    D    + ++E K+  K  
Sbjct: 261 PKPADTSSPKEDKQVAENQKREI-EKAQIEIKKNDEEALKAKDHKAFDLKQESKASEKEA 319

Query: 722 GDKSKSKSKKEDEI--DDEEYDEPVESTPPSSDAGSDASS 759
            DK     KK + +  D ++    VE+ P S +  +  SS
Sbjct: 320 EDKELEAQKKREPVAEDLQKTKPQVEAQPTSLNEDAIDSS 359



 Score = 29.2 bits (65), Expect = 10.0
 Identities = 26/115 (22%), Positives = 47/115 (40%), Gaps = 9/115 (7%)

Query: 455 ARVKREAAERDDEDDEDEDE-STDEDFNPDQAESDVAE------EYDSNPTPTDSDEDSD 507
           A+  ++  E  D+   D D+     DF  D A+    E      E  + P P D+    +
Sbjct: 212 AKRAQQLKEELDKKQIDADKAQQKADFAQDNADKQRDEVRQKQQEAKNLPKPADTSSPKE 271

Query: 508 ASEGTKKKKKEKSRKTITISEKPRKSKKDKESSGGSTKR--KKKDKNAPKKPMSA 560
             +  + +K+E  +  I I +   ++ K K+      K+  K  +K A  K + A
Sbjct: 272 DKQVAENQKREIEKAQIEIKKNDEEALKAKDHKAFDLKQESKASEKEAEDKELEA 326


>gnl|CDD|220252 pfam09468, RNase_H2-Ydr279, Ydr279p protein family (RNase H2
           complex component).  RNases H are enzymes that
           specifically hydrolyse RNA when annealed to a
           complementary DNA and are present in all living
           organisms. In yeast RNase H2 is composed of a complex of
           three proteins (Rnh2Ap, Ydr279p and Ylr154p), this
           family represents the homologues of Ydr279p. It is not
           known whether non yeast proteins in this family fulfil
           the same function.
          Length = 287

 Score = 31.9 bits (73), Expect = 1.1
 Identities = 17/75 (22%), Positives = 29/75 (38%), Gaps = 4/75 (5%)

Query: 659 SKEFIESNGSSSDSDDDKK----ASSKRKRDSDEDSKAKTKKKKEKSESESDSGESEKEK 714
            KE ++S         DK        KR+ + D ++     +KK KS+ E    + ++ K
Sbjct: 208 YKELLKSLLIPEFKPLDKYLKESKKKKRETEEDVEAAESRAEKKRKSKEEIKKKKPKESK 267

Query: 715 KSKSKPKGDKSKSKS 729
             K+  K      K 
Sbjct: 268 GVKALKKVVAKGMKK 282


>gnl|CDD|234027 TIGR02824, quinone_pig3, putative NAD(P)H quinone oxidoreductase,
           PIG3 family.  Members of this family are putative
           quinone oxidoreductases that belong to the broader
           superfamily (modeled by Pfam pfam00107) of
           zinc-dependent alcohol (of medium chain length)
           dehydrogenases and quinone oxiooreductases. The
           alignment shows no motif of conserved Cys residues as
           are found in zinc-binding members of the superfamily,
           and members are likely to be quinone oxidoreductases
           instead. A member of this family in Homo sapiens, PIG3,
           is induced by p53 but is otherwise uncharacterized
           [Unknown function, Enzymes of unknown specificity].
          Length = 325

 Score = 31.8 bits (73), Expect = 1.1
 Identities = 19/58 (32%), Positives = 25/58 (43%), Gaps = 9/58 (15%)

Query: 578 GISFTEI--AK-KGGELWKTVSSKEKAEYDEK------VNKAKEDYNEALKAYKESGG 626
           GI  T I  AK  G  ++ T  S EK    E       +N  +ED+ E +KA     G
Sbjct: 151 GIGTTAIQLAKAFGARVFTTAGSDEKCAACEALGADIAINYREEDFVEVVKAETGGKG 208


>gnl|CDD|215565 PLN03083, PLN03083, E3 UFM1-protein ligase 1 homolog; Provisional.
          Length = 803

 Score = 32.5 bits (74), Expect = 1.1
 Identities = 22/90 (24%), Positives = 37/90 (41%), Gaps = 4/90 (4%)

Query: 675 DKKASSKRKRDSDEDSKAKTKKKKEKSESESDSGESEKEKKSKSKPKGDKSKSKSKKEDE 734
           +K+  +   + S       ++K    +ES   +  S+K  K K   KG  + +K    + 
Sbjct: 381 EKEMDAFSIQASSAGLIGSSEKSLGSNESSPAASNSDKGSKKK---KGKSTSTKGGTAES 437

Query: 735 IDDEEYDEPVESTPPSSDAGSDASSSEDDD 764
           I D+E D P +    +   G D SS    D
Sbjct: 438 IPDDEEDAPKKGKK-NQKKGRDKSSKVPSD 466



 Score = 32.1 bits (73), Expect = 1.2
 Identities = 24/108 (22%), Positives = 42/108 (38%), Gaps = 11/108 (10%)

Query: 664 ESNGSSSDSDDDKKASSKRKRDSDEDSKAKTKKKKEKSESESDSGESEKEK-KSKSKPKG 722
            SN SS  + +  K S K+K  S        +   +  E     G+  ++K + KS    
Sbjct: 405 GSNESSPAASNSDKGSKKKKGKSTSTKGGTAESIPDDEEDAPKKGKKNQKKGRDKSSKVP 464

Query: 723 DKSKSKSKKE-DEIDDEEYDEP---------VESTPPSSDAGSDASSS 760
             SK+  KKE  +  ++  + P         +E  P   + G++   S
Sbjct: 465 SDSKAGGKKESVKSQEDNNNIPPEEWVMKKILEWVPDLEEDGTEDPGS 512



 Score = 32.1 bits (73), Expect = 1.3
 Identities = 20/98 (20%), Positives = 34/98 (34%), Gaps = 4/98 (4%)

Query: 447 EKEPDAYLARVKREAAERDDEDDEDEDESTDEDFNPDQA----ESDVAEEYDSNPTPTDS 502
           EKE DA+  +          E     +ES+    N D+     +                
Sbjct: 381 EKEMDAFSIQASSAGLIGSSEKSLGSNESSPAASNSDKGSKKKKGKSTSTKGGTAESIPD 440

Query: 503 DEDSDASEGTKKKKKEKSRKTITISEKPRKSKKDKESS 540
           DE+    +G K +KK + + +   S+     KK+   S
Sbjct: 441 DEEDAPKKGKKNQKKGRDKSSKVPSDSKAGGKKESVKS 478


>gnl|CDD|218333 pfam04931, DNA_pol_phi, DNA polymerase phi.  This family includes
           the fifth essential DNA polymerase in yeast EC:2.7.7.7.
           Pol5p is localised exclusively to the nucleolus and
           binds near or at the enhancer region of rRNA-encoding
           DNA repeating units.
          Length = 784

 Score = 32.2 bits (73), Expect = 1.1
 Identities = 28/142 (19%), Positives = 54/142 (38%), Gaps = 32/142 (22%)

Query: 420 REKKLRVKNTGKGEKPNYKEDFGSSDDEKEPDAYLARVKREAAERDDEDDEDEDESTDED 479
            E K R +   +GE+   ++D   +DD+++        + EA E  + + E + E  +ED
Sbjct: 635 DENKSRHQQLFEGEE-EDEDDLEETDDDED--------ECEAIEDSESESESDGEDGEED 685

Query: 480 FNPDQAES-----DVAEEYDS------NPTPTDSDEDSDASEGTKKKK------------ 516
              D AE+      + +          N        DS+  EG   ++            
Sbjct: 686 EQEDDAEANEGVVPIDKAVRRALPKVLNLPDALDGGDSEDEEGMDDEQMMRLDTYLAQIF 745

Query: 517 KEKSRKTITISEKPRKSKKDKE 538
           KEK  +     E  ++++  K+
Sbjct: 746 KEKKERIQAGGETKKEAQSQKQ 767


>gnl|CDD|227360 COG5027, SAS2, Histone acetyltransferase (MYST family) [Chromatin
           structure and dynamics].
          Length = 395

 Score = 32.1 bits (73), Expect = 1.1
 Identities = 13/62 (20%), Positives = 24/62 (38%)

Query: 674 DDKKASSKRKRDSDEDSKAKTKKKKEKSESESDSGESEKEKKSKSKPKGDKSKSKSKKED 733
            +K    K+ + SD         + E   S S+  E  + +   SK +     ++ K  +
Sbjct: 72  TEKGKKEKKPKVSDRMDLDNENVQLEMLYSISNEREIRQLRFGGSKVQNPHEGARVKNIN 131

Query: 734 EI 735
           EI
Sbjct: 132 EI 133


>gnl|CDD|112562 pfam03753, HHV6-IE, Human herpesvirus 6 immediate early protein.
           The proteins in this family are poorly characterized,
           but an investigation has indicated that the immediate
           early protein is required the down-regulation of MHC
           class I expression in dendritic cells. Human herpesvirus
           6 immediate early protein is also referred to as U90.
          Length = 993

 Score = 32.4 bits (73), Expect = 1.2
 Identities = 23/85 (27%), Positives = 37/85 (43%), Gaps = 2/85 (2%)

Query: 632 DGKTSKSKKPAKKKETASTAVSPHKVKSKEFIESNGSSSDSDDDKKASSKRKRDSDEDSK 691
           D K   S     KK TA      +K    + I+S+   +D++     SS+ + + D  + 
Sbjct: 648 DKKKDISTNRKVKKRTAKRKTVGYKTDKSKKIKSDSLPTDTNV-IVISSESEDEEDGFNI 706

Query: 692 AKTKKKKEKSESESDSGESEKEKKS 716
            K  + K+K +SE  S ES  E   
Sbjct: 707 IKKSQLKKKIKSELKS-ESSSESDD 730


>gnl|CDD|221641 pfam12569, NARP1, NMDA receptor-regulated protein 1.  This domain
           family is found in eukaryotes, and is approximately 40
           amino acids in length. The family is found in
           association with pfam07719, pfam00515. There is a single
           completely conserved residue L that may be functionally
           important. NARP1 is the mammalian homologue of a yeast
           N-terminal acetyltransferase that regulates entry into
           the G(0) phase of the cell cycle.
          Length = 516

 Score = 31.8 bits (73), Expect = 1.3
 Identities = 18/82 (21%), Positives = 33/82 (40%), Gaps = 13/82 (15%)

Query: 653 SPHKVKSKEFIESNGSSSDSDDDKKASSKRKRDSDEDSKAKTKKKKEKSESESDSGESEK 712
            P   + +E  E    +    + KK   K++       KA+ K +KE++E       + K
Sbjct: 390 KPLLAEGEEE-EGENGNLSPAERKKLRKKQR-------KAEKKAEKEEAEK-----AAAK 436

Query: 713 EKKSKSKPKGDKSKSKSKKEDE 734
           +K   +  K      ++KK D 
Sbjct: 437 KKAEAAAKKAKGPDGETKKVDP 458



 Score = 31.1 bits (71), Expect = 2.9
 Identities = 14/70 (20%), Positives = 31/70 (44%)

Query: 675 DKKASSKRKRDSDEDSKAKTKKKKEKSESESDSGESEKEKKSKSKPKGDKSKSKSKKEDE 734
                 + + ++   S A+ KK ++K        E E+ +K+ +K K + +  K+K  D 
Sbjct: 392 LLAEGEEEEGENGNLSPAERKKLRKKQRKAEKKAEKEEAEKAAAKKKAEAAAKKAKGPDG 451

Query: 735 IDDEEYDEPV 744
              +   +P+
Sbjct: 452 ETKKVDPDPL 461


>gnl|CDD|178945 PRK00247, PRK00247, putative inner membrane protein translocase
           component YidC; Validated.
          Length = 429

 Score = 31.7 bits (72), Expect = 1.4
 Identities = 22/149 (14%), Positives = 55/149 (36%), Gaps = 15/149 (10%)

Query: 611 KEDYNEALKAYKESGGGQDSDDGKTSKSKKPAKKKETASTAVSPHKVKSKEFIESNGSSS 670
           K    +  K +      Q  +  K  K+     ++      ++P +             +
Sbjct: 276 KYPLTDEFKEHHAEQRAQYREKQKEKKAFLWTLRRNRLRMIITPWRAPELH--------A 327

Query: 671 DSDDDKKASSKRKRDSDEDSKAKTKKKKE-KSESESDSGESEKEKKSKSKPKGDKSKSKS 729
           ++ + KK  +  K ++    K   +K++  + E   ++ +      ++++ +    K+K 
Sbjct: 328 ENAEIKKTRTAEKNEAKARKKEIAQKRRAAEREINREARQERAAAMARARARRAAVKAKK 387

Query: 730 KKEDEIDDEEYDEPVES-----TPPSSDA 753
           K   +    E D P E+     +PP  +A
Sbjct: 388 KGLIDASPNE-DTPSENEESKGSPPQVEA 415


>gnl|CDD|217476 pfam03286, Pox_Ag35, Pox virus Ag35 surface protein. 
          Length = 198

 Score = 31.3 bits (71), Expect = 1.5
 Identities = 19/73 (26%), Positives = 34/73 (46%)

Query: 691 KAKTKKKKEKSESESDSGESEKEKKSKSKPKGDKSKSKSKKEDEIDDEEYDEPVESTPPS 750
           K  T  +K  +  +S   + EK  + + KP+ D  K++  + D  ++EE  +  ES   +
Sbjct: 54  KRPTTPRKPATTKKSKKKDKEKLTEEEKKPESDDDKTEENENDPDNNEESGDSQESASAN 113

Query: 751 SDAGSDASSSEDD 763
           S +  D     DD
Sbjct: 114 SLSDIDNEDDMDD 126


>gnl|CDD|233148 TIGR00844, c_cpa1, na(+)/h(+) antiporter.  The Monovalent
           Cation:Proton Antiporter-1 (CPA1) Family (TC 2.A.36) The
           CPA1 family is a large family of proteins derived from
           Gram-positive and Gram-negative bacteria, blue green
           bacteria, yeast, plants and animals. Transporters from
           eukaryotes have been functionally characterized, and all
           of these catalyze Na+:H+ exchange. Their primary
           physiological functions may be in (1) cytoplasmic pH
           regulation, extruding the H+ generated during
           metabolism, and (2) salt tolerance (in plants), due to
           Na+ uptake into vacuoles. This model is specific for the
           fungal members of this family [Transport and binding
           proteins, Cations and iron carrying compounds].
          Length = 810

 Score = 31.8 bits (72), Expect = 1.5
 Identities = 55/316 (17%), Positives = 102/316 (32%), Gaps = 52/316 (16%)

Query: 420 REKKLRVKNTGKGEKPNYKEDFGSSDDEKEPDAYLARVKREAA---ERDDEDDEDEDEST 476
             +KLR     + E      D G    +KE +A+ A          +   E+        
Sbjct: 512 WRQKLRNNK--EMEDDIEMNDLGRERLQKEKEAHAATFDLSTTTTTQLGTENGRGGGLEE 569

Query: 477 DEDFNPDQAESDVAEEYDSNPTPTDSDEDSDASEGTKKKKKEKSRKTITISEKPRKSKKD 536
               N      + +E  ++         D+   E         SR     SE       +
Sbjct: 570 RSKTN----IKERSENVNTIYGLDKLARDT---ENRDVTYVPTSRYDGIESEIDDVYTYE 622

Query: 537 KESSGGSTKRKKKDKNAPKKPMSAYMMWFNDMR-----------EKIKKDNPGISFTEIA 585
            +S   ++  +++ K   ++   AY+ +  D +           E +   + G    E+ 
Sbjct: 623 NDSESIASSERRRIKKLREEEQQAYIAYTEDNQVIIENRQGEILEYVDIHDRGARDAEVG 682

Query: 586 KKGGELWKTVSSKEKAEYDEKVNKAKEDYNEALKAYKESGGGQD----SDDGKTSKSKKP 641
              G   K   S    +  +  N+AK+    A K       G D     + G+  +  + 
Sbjct: 683 VHNGGRLKRALSPPLEKLHQITNEAKKSKYYAYKV------GNDLIIEDESGEVFRRYR- 735

Query: 642 AKKKETASTAVSPHKVKSKEFIESNGSSSDSDDDKKA------SSKRKRDSDEDSKAKTK 695
                     +SPH  K K  I+    S  S D++KA        +R       S+ +  
Sbjct: 736 ----------ISPHGGKRK--IKKRNDSVVSVDEEKAIEGPSRVPERGNHDLLHSEDEMA 783

Query: 696 KKKEKSESESDSGESE 711
             + +SE+  D  +S+
Sbjct: 784 DDEAESENMDDYEDSD 799


>gnl|CDD|241458 cd13304, PH2_Rtt106, Histone chaperone RTT106, regulator of Ty1
           transposition protein 106 Pleckstrin homology (PH)
           domain.  Rtt106 is a histone chaperone. Rtt106 contains
           an N-terminal homodimerization domain and two C-terminal
           pleckstrin-homology (PH) domains (PH1 and PH2). The
           binding of Rtt106 to H3K56-acetylated (H3-H4)2 tetramers
           contributes to nucleosome assembly in terms of DNA
           replication, gene silencing and maintenance of genomic
           stability. The N-terminal domain homodimerizes
           homodimerizes and interacts with H3-H4 independently of
           acetylation while the double PH domain binds the
           K56-containing region of H3. Rtt106 also interacts with
           both the SWI/SNF and RSC chromatin remodeling complexes
           and is involved in their cell-cycle dependent
           recruitment to histone gene pairs regulated by the HIR
           co-repressor complex (HTA1-HTB1, HHT1-HHF1, and
           HHT2-HHF2). This cd contains the second PH domain
           repeat. PH domains have diverse functions, but in
           general are involved in targeting proteins to the
           appropriate cellular location or in the interaction with
           a binding partner. They share little sequence
           conservation, but all have a common fold, which is
           electrostatically polarized. Less than 10% of PH domains
           bind phosphoinositide phosphates (PIPs) with high
           affinity and specificity. PH domains are distinguished
           from other PIP-binding domains by their specific
           high-affinity binding to PIPs with two vicinal phosphate
           groups: PtdIns(3,4)P2, PtdIns(4,5)P2 or PtdIns(3,4,5)P3
           which results in targeting some PH domain proteins to
           the plasma membrane. A few display strong specificity in
           lipid binding. Any specificity is usually determined by
           loop regions or insertions in the N-terminus of the
           domain, which are not conserved across all PH domains.
           PH domains are found in cellular signaling proteins such
           as serine/threonine kinase, tyrosine kinases, regulators
           of G-proteins, endocytotic GTPases, adaptors, as well as
           cytoskeletal associated molecules and in lipid
           associated enzymes.
          Length = 89

 Score = 29.2 bits (66), Expect = 1.7
 Identities = 19/84 (22%), Positives = 38/84 (45%), Gaps = 7/84 (8%)

Query: 342 VTCSYKAAAGYIYPLERGFIFIHKPPIHI-RFEEIATVNFARSGGSTRSFDFEIELKSGV 400
           V     A  G +  L+   IF  K PI +    +I ++++  S  +  +F+  + +K+  
Sbjct: 7   VEAHKGAKEGVLLFLQDYIIFGFKKPILLFDISQIESISY--SNITRLTFNLNLTVKNAK 64

Query: 401 LHT----FSSIEKEEYGKLFDFIR 420
                  FS I++  +  + DFI+
Sbjct: 65  KEEETLEFSMIDQAFFQVIDDFIK 88


>gnl|CDD|235040 PRK02463, PRK02463, OxaA-like protein precursor; Provisional.
          Length = 307

 Score = 31.2 bits (71), Expect = 1.7
 Identities = 18/68 (26%), Positives = 27/68 (39%), Gaps = 22/68 (32%)

Query: 489 VAEEYDSNPTPTDSDEDSDASEGTKKKKKEKSRKTITISEKPRKSKKDKESSGGSTKRKK 548
           +AEE+  NP               K  K    RK +T S+    +           K+KK
Sbjct: 259 IAEEFAKNPP--------------KAYKSPSGRKDVTPSQNQAIASL--------PKKKK 296

Query: 549 KDKNAPKK 556
           K++NA K+
Sbjct: 297 KNRNAGKQ 304


>gnl|CDD|218598 pfam05470, eIF-3c_N, Eukaryotic translation initiation factor 3
           subunit 8 N-terminus.  The largest of the mammalian
           translation initiation factors, eIF3, consists of at
           least eight subunits ranging in mass from 35 to 170 kDa.
           eIF3 binds to the 40 S ribosome in an early step of
           translation initiation and promotes the binding of
           methionyl-tRNAi and mRNA.
          Length = 593

 Score = 31.7 bits (72), Expect = 1.8
 Identities = 17/80 (21%), Positives = 40/80 (50%), Gaps = 2/80 (2%)

Query: 459 REAAERDDEDDEDEDESTDEDFNPDQAESDVAEEYDSNPTPTDSDEDSDASEGTKKKKKE 518
           RE  E +DE++E++++  D+  + +  + D     +     ++S  D    +  + +  +
Sbjct: 132 REDPESEDEEEEEDEDDDDDGSDDEDEDEDGVGATEEVAASSESGVDRVKEDDEEDEDAD 191

Query: 519 KSRKTITISEKPRKSKKDKE 538
            S+K +   E+P+  KK +E
Sbjct: 192 LSKKDVL--EEPKMFKKPEE 209



 Score = 29.4 bits (66), Expect = 8.3
 Identities = 19/62 (30%), Positives = 27/62 (43%)

Query: 458 KREAAERDDEDDEDEDESTDEDFNPDQAESDVAEEYDSNPTPTDSDEDSDASEGTKKKKK 517
           + E  + DD+DD  +DE  DED      E   + E   +    D +ED DA    K   +
Sbjct: 140 EEEEEDEDDDDDGSDDEDEDEDGVGATEEVAASSESGVDRVKEDDEEDEDADLSKKDVLE 199

Query: 518 EK 519
           E 
Sbjct: 200 EP 201


>gnl|CDD|218312 pfam04889, Cwf_Cwc_15, Cwf15/Cwc15 cell cycle control protein.
           This family represents Cwf15/Cwc15 (from
           Schizosaccharomyces pombe and Saccharomyces cerevisiae
           respectively) and their homologues. The function of
           these proteins is unknown, but they form part of the
           spliceosome and are thus thought to be involved in mRNA
           splicing.
          Length = 241

 Score = 31.2 bits (71), Expect = 1.8
 Identities = 15/80 (18%), Positives = 33/80 (41%), Gaps = 3/80 (3%)

Query: 688 EDSKAKTKKKKEKSESESDSGESEKEKKSKSKPKGDKSKSKSKK---EDEIDDEEYDEPV 744
           E+++   K KKE   +  D+ +S     S    + D  + + K+   E++  + + D+  
Sbjct: 64  EEAERAHKSKKENKLAIEDADKSTNLDASNEGDEDDDEEDEIKRKRIEEDARNSDADDSD 123

Query: 745 ESTPPSSDAGSDASSSEDDD 764
            S+   S          +D+
Sbjct: 124 SSSDSDSSDDDSDDDDSEDE 143



 Score = 28.9 bits (65), Expect = 7.9
 Identities = 16/51 (31%), Positives = 24/51 (47%), Gaps = 3/51 (5%)

Query: 460 EAAERDDEDDEDEDESTDEDFNPDQAESDVAEEYDSNPTPTDSDEDSDASE 510
           E  E DDE+DE + +  +ED     A+   +     +   +D D D D SE
Sbjct: 94  EGDEDDDEEDEIKRKRIEEDARNSDADDSDSSSDSDS---SDDDSDDDDSE 141


>gnl|CDD|238859 cd01821, Rhamnogalacturan_acetylesterase_like,
           Rhamnogalacturan_acetylesterase_like subgroup of
           SGNH-hydrolases. Rhamnogalacturan acetylesterase removes
           acetyl esters from rhamnogalacturonan substrates, and
           renders them susceptible to degradation by
           rhamnogalacturonases. Rhamnogalacturonans are highly
           branched regions in pectic polysaccharides, consisting
           of repeating -(1,2)-L-Rha-(1,4)-D-GalUA disaccharide
           units, with many rhamnose residues substituted by
           neutral oligosaccharides such as arabinans, galactans
           and arabinogalactans. Extracellular enzymes
           participating in the degradation of plant cell wall
           polymers, such as Rhamnogalacturonan acetylesterase,
           would typically be found in saprophytic and plant
           pathogenic fungi and bacteria.
          Length = 198

 Score = 30.6 bits (70), Expect = 1.8
 Identities = 10/21 (47%), Positives = 12/21 (57%)

Query: 373 EEIATVNFARSGGSTRSFDFE 393
             I  VN A+ G S+RSF  E
Sbjct: 33  TGITVVNHAKGGRSSRSFRDE 53


>gnl|CDD|113514 pfam04747, DUF612, Protein of unknown function, DUF612.  This
           family includes several uncharacterized proteins from
           Caenorhabditis elegans.
          Length = 517

 Score = 31.6 bits (70), Expect = 1.8
 Identities = 23/98 (23%), Positives = 52/98 (53%)

Query: 634 KTSKSKKPAKKKETASTAVSPHKVKSKEFIESNGSSSDSDDDKKASSKRKRDSDEDSKAK 693
           K  KS+K   +K+ A    +  KV +K+  E     ++++  K+A+ + +    +  + +
Sbjct: 69  KVKKSEKKKAQKQIAKDHEAEQKVNAKKAAEKEARRAEAEAKKRAAQEEEHKQWKAEQER 128

Query: 694 TKKKKEKSESESDSGESEKEKKSKSKPKGDKSKSKSKK 731
            +K++EK E++    ++EK+K+   K +  +   K+KK
Sbjct: 129 IQKEQEKKEADLKKLQAEKKKEKAVKAEKAEKAEKTKK 166


>gnl|CDD|219900 pfam08553, VID27, VID27 cytoplasmic protein.  This is a family of
           fungal and plant proteins and contains many hypothetical
           proteins. VID27 is a cytoplasmic protein that plays a
           potential role in vacuolar protein degradation.
          Length = 794

 Score = 31.6 bits (72), Expect = 2.0
 Identities = 22/84 (26%), Positives = 38/84 (45%), Gaps = 6/84 (7%)

Query: 431 KGEKPNYKEDFGSSDDEKEPDAYLARVKREAAERDDEDDEDEDESTDEDFNPDQAESDVA 490
           K  + +Y  D  +    +  DA   R   E  + ++E++EDEDE   ++     ++ +  
Sbjct: 365 KETEQDYILD--AFSALEIEDANTERDDEEEEDEEEEEEEDEDEGPSKEH----SDDEEF 418

Query: 491 EEYDSNPTPTDSDEDSDASEGTKK 514
           EE D      DSD +S  + G K 
Sbjct: 419 EEDDVESKYEDSDGNSSLAVGYKN 442


>gnl|CDD|219355 pfam07267, Nucleo_P87, Nucleopolyhedrovirus capsid protein P87.
           This family consists of several Nucleopolyhedrovirus
           capsid protein P87 sequences. P87 is expressed late in
           infection and concentrated in infected cell nuclei.
          Length = 606

 Score = 31.4 bits (71), Expect = 2.0
 Identities = 19/117 (16%), Positives = 38/117 (32%), Gaps = 3/117 (2%)

Query: 620 AYKESGGGQDSDDGKTSKSKKPAKKKETASTAVSPHKVKSKEFIESNGSSSDSDDDKKAS 679
           A + S    +      +  + PA     +S   +P       + E       SD    +S
Sbjct: 285 ARRISTPMTEEIKSWQTPLQTPA---MYSSDYQAPKPEPIYTWEELLRERFPSDLFAISS 341

Query: 680 SKRKRDSDEDSKAKTKKKKEKSESESDSGESEKEKKSKSKPKGDKSKSKSKKEDEID 736
                    DS    K+K+ +     +    E E  S       + + K ++E++ +
Sbjct: 342 LPDSDSEASDSGPTRKRKRRRVPPLPEYSSDEDEDDSDEDEVDYEKERKRRREEDKN 398


>gnl|CDD|217392 pfam03153, TFIIA, Transcription factor IIA, alpha/beta subunit.
           Transcription initiation factor IIA (TFIIA) is a
           heterotrimer, the three subunits being known as alpha,
           beta, and gamma, in order of molecular weight. The N and
           C-terminal domains of the gamma subunit are represented
           in pfam02268 and pfam02751, respectively. This family
           represents the precursor that yields both the alpha and
           beta subunits. The TFIIA heterotrimer is an essential
           general transcription initiation factor for the
           expression of genes transcribed by RNA polymerase II.
           Together with TFIID, TFIIA binds to the promoter region;
           this is the first step in the formation of a
           pre-initiation complex (PIC). Binding of the rest of the
           transcription machinery follows this step. After
           initiation, the PIC does not completely dissociate from
           the promoter. Some components, including TFIIA, remain
           attached and re-initiate a subsequent round of
           transcription.
          Length = 332

 Score = 31.3 bits (71), Expect = 2.1
 Identities = 18/70 (25%), Positives = 27/70 (38%), Gaps = 11/70 (15%)

Query: 425 RVKNTGKGEKPNYKEDFGSSDDEKEPDAYLARVKREAAERDDEDDEDEDESTDEDFNPDQ 484
           + K   K  K          D + E D             DD+DDED  ES  +D + D 
Sbjct: 228 QPKKQAKSSKRRTIAQIDGIDSDDEGD-----------GSDDDDDEDAIESDLDDSDDDV 276

Query: 485 AESDVAEEYD 494
           ++ D  + +D
Sbjct: 277 SDEDGEDLFD 286



 Score = 30.5 bits (69), Expect = 3.5
 Identities = 17/70 (24%), Positives = 29/70 (41%), Gaps = 10/70 (14%)

Query: 443 SSDDEKEPDAYLARVKREAAERDDEDDEDEDESTDEDFNPDQAESDVAEEYDSNPTPTDS 502
               ++      +  +R  A+ D  D +DE + +D+D + D  ESD            DS
Sbjct: 223 MKVLKQPKKQAKSSKRRTIAQIDGIDSDDEGDGSDDDDDEDAIESD----------LDDS 272

Query: 503 DEDSDASEGT 512
           D+D    +G 
Sbjct: 273 DDDVSDEDGE 282


>gnl|CDD|215590 PLN03123, PLN03123, poly [ADP-ribose] polymerase; Provisional.
          Length = 981

 Score = 31.3 bits (71), Expect = 2.2
 Identities = 20/89 (22%), Positives = 34/89 (38%), Gaps = 9/89 (10%)

Query: 641 PAKKKETASTAVSPHKVKSKEFIESNGSSSDSDDDKKASSKRKRDSDEDSKAKTKKKKEK 700
               +E     V     ++KE               K  +KRK+D+  D K+K K K ++
Sbjct: 172 SDSDQEAVLPLVKKSPSEAKEEKAEERKQE-----SKKGAKRKKDASGDDKSK-KAKTDR 225

Query: 701 SESESDSGESEKEKKSKSKPKGDKSKSKS 729
             S      +  +KKS       +++SK 
Sbjct: 226 DVST---STAASQKKSSDLESKLEAQSKE 251



 Score = 30.5 bits (69), Expect = 4.5
 Identities = 14/60 (23%), Positives = 30/60 (50%)

Query: 497 PTPTDSDEDSDASEGTKKKKKEKSRKTITISEKPRKSKKDKESSGGSTKRKKKDKNAPKK 556
            +P+++ E+       + KK  K +K  +  +K +K+K D++ S  +   +KK  +   K
Sbjct: 185 KSPSEAKEEKAEERKQESKKGAKRKKDASGDDKSKKAKTDRDVSTSTAASQKKSSDLESK 244


>gnl|CDD|225606 COG3064, TolA, Membrane protein involved in colicin uptake [Cell
           envelope biogenesis, outer membrane].
          Length = 387

 Score = 31.1 bits (70), Expect = 2.3
 Identities = 24/135 (17%), Positives = 64/135 (47%), Gaps = 1/135 (0%)

Query: 598 KEKAEYDEKVNKAKEDYNEALKAYKESGGGQDSDDGKTSKSKKPAKKKETASTAVSPHKV 657
            ++A   E++ + +++  +A +  K++   +     +  + ++ A+K   A         
Sbjct: 94  PKQAAEQERLKQLEKERLKAQEQQKQAEEAEKQAQLEQKQQEEQARK-AAAEQKKKAEAA 152

Query: 658 KSKEFIESNGSSSDSDDDKKASSKRKRDSDEDSKAKTKKKKEKSESESDSGESEKEKKSK 717
           K+K   E+    + ++  KKA    K   +  +KA+    K+K+E+E+ +   + + +++
Sbjct: 153 KAKAAAEAAKLKAAAEAKKKAEEAAKAAEEAKAKAEAAAAKKKAEAEAKAAAEKAKAEAE 212

Query: 718 SKPKGDKSKSKSKKE 732
           +K K +K    + +E
Sbjct: 213 AKAKAEKKAEAAAEE 227



 Score = 30.3 bits (68), Expect = 4.3
 Identities = 27/143 (18%), Positives = 60/143 (41%), Gaps = 1/143 (0%)

Query: 598 KEKAEYDEKVNKAKEDYNEALKAYKESGGGQDSDDGKTSKSKKPAKKKETASTAVSPHKV 657
           KE+ +  E+  +A+E   +A    K+          +  K  + AK K  A  A      
Sbjct: 108 KERLKAQEQQKQAEEAEKQAQLEQKQQEEQARKAAAEQKKKAEAAKAKAAAEAAKLKAAA 167

Query: 658 KSKEFIESNGSSSDSDDDKKASSKRKRDSDEDSKAKTKK-KKEKSESESDSGESEKEKKS 716
           ++K+  E    +++    K  ++  K+ ++ ++KA  +K K E         ++E   + 
Sbjct: 168 EAKKKAEEAAKAAEEAKAKAEAAAAKKKAEAEAKAAAEKAKAEAEAKAKAEKKAEAAAEE 227

Query: 717 KSKPKGDKSKSKSKKEDEIDDEE 739
           K+  +  K+ +K+K +      +
Sbjct: 228 KAAAEKKKAAAKAKADKAAAAAK 250


>gnl|CDD|227446 COG5116, RPN2, 26S proteasome regulatory complex component
           [Posttranslational modification, protein turnover,
           chaperones].
          Length = 926

 Score = 31.5 bits (71), Expect = 2.3
 Identities = 14/64 (21%), Positives = 25/64 (39%), Gaps = 1/64 (1%)

Query: 689 DSKAKTKKKKEKSESESDSGESEKEKKSKSKPKGDKSKSKSKKEDEIDDEEYDEPVESTP 748
             KA  + K++  E   +  E + E  S    +G++   K ++E  ID        +  P
Sbjct: 790 TIKAAARAKQKPKEKGPNDKEIKIESPSVET-EGERCTIKQREEKGIDAPAILNVKKKKP 848

Query: 749 PSSD 752
              D
Sbjct: 849 YKVD 852


>gnl|CDD|222665 pfam14303, NAM-associated, No apical meristem-associated C-terminal
           domain.  This domain is found in a number of different
           types of plant proteins including NAM-like proteins.
          Length = 147

 Score = 30.0 bits (68), Expect = 2.4
 Identities = 18/83 (21%), Positives = 42/83 (50%), Gaps = 1/83 (1%)

Query: 649 STAVSPHKVKSKEFIESNGSSSDSDDDKKASSKRKRDSDEDSKAKTKKKKEKSESESDSG 708
             A    K +S     S  +  + D+D +++++ KR  +   KAK K +++K +++ +  
Sbjct: 26  KKASKKKKKRSNSSPGSTSNEENEDEDDESTAESKR-PEGRKKAKEKLRRDKLKAKKEEA 84

Query: 709 ESEKEKKSKSKPKGDKSKSKSKK 731
           E EKEK+ +      +++ +  +
Sbjct: 85  EKEKEKEERFMKALAEAEKERAE 107


>gnl|CDD|221175 pfam11705, RNA_pol_3_Rpc31, DNA-directed RNA polymerase III subunit
           Rpc31.  RNA polymerase III contains seventeen subunits
           in yeasts and in human cells. Twelve of these are akin
           to RNA polymerase I or II and the other five are RNA pol
           III-specific, and form the functionally distinct groups
           (i) Rpc31-Rpc34-Rpc82, and (ii) Rpc37-Rpc53. Rpc31,
           Rpc34 and Rpc82 form a cluster of enzyme-specific
           subunits that contribute to transcription initiation in
           S.cerevisiae and H.sapiens. There is evidence that these
           subunits are anchored at or near the N-terminal Zn-fold
           of Rpc1, itself prolonged by a highly conserved but RNA
           polymerase III-specific domain.
          Length = 221

 Score = 30.5 bits (69), Expect = 2.4
 Identities = 25/97 (25%), Positives = 41/97 (42%), Gaps = 9/97 (9%)

Query: 418 FIREKKLRVKNTGKGEKPNYKEDFGSSDDEKEPDAYLARVKREAAE------RDDEDDED 471
            I +K  +     K ++          D +++      ++K   AE        DE++E+
Sbjct: 121 GINKKAGKKLALSKFKRKVGLFTEEEEDIDEKLSMLEKKLKELEAEDVDEEDEKDEEEEE 180

Query: 472 EDESTDEDFNPDQAESD---VAEEYDSNPTPTDSDED 505
           E+E  DEDF+ D  + D    AE Y  N    D D+D
Sbjct: 181 EEEEEDEDFDDDDDDDDDDYNAENYFDNGEDDDYDDD 217


>gnl|CDD|113290 pfam04514, BTV_NS2, Bluetongue virus non-structural protein NS2.
           This family includes NS2 proteins from other members of
           the Orbivirus genus. NS2 is a non-specific
           single-stranded RNA-binding protein that forms large
           homomultimers and accumulates in viral inclusion bodies
           of infected cells. Three RNA binding regions have been
           identified in Bluetongue virus serotype 17 at residues
           2-11, 153-166 and 274-286. NS2 multimers also possess
           nucleotidyl phosphatase activity. The precise function
           of NS2 is not known, but it may be involved in the
           transport and condensation of viral mRNAs.
          Length = 363

 Score = 31.0 bits (70), Expect = 2.5
 Identities = 13/96 (13%), Positives = 38/96 (39%), Gaps = 7/96 (7%)

Query: 654 PHKVKSKEFIESNGSSSDSDDDK-------KASSKRKRDSDEDSKAKTKKKKEKSESESD 706
           P K   K      G  SD D++              K+D  E+ +A  +++  +     +
Sbjct: 173 PMKPAFKPERWMGGPDSDEDENPLDEEAPDMTPETSKQDQKEERRAAVERRLAELVEMIN 232

Query: 707 SGESEKEKKSKSKPKGDKSKSKSKKEDEIDDEEYDE 742
               E+ +  + + + +++  +   +++   E+ ++
Sbjct: 233 WNLEERRRDLRKEQELEENVERDSDDEDEHGEDSED 268


>gnl|CDD|227931 COG5644, COG5644, Uncharacterized conserved protein [Function
           unknown].
          Length = 869

 Score = 31.2 bits (70), Expect = 2.5
 Identities = 24/104 (23%), Positives = 48/104 (46%), Gaps = 9/104 (8%)

Query: 668 SSSDSDDDKKASSKRKRDSDEDSKAKTKKKKEKSESESDSGESEK-EKKSKSKPKGDKSK 726
            +   ++D     +   +SD  S   ++ ++  SESE +S +S+  ++ S SK    ++ 
Sbjct: 138 KNESGNNDHATDKENLLESDASSSNDSESEESDSESEIESSDSDHDDENSDSKLDNLRNY 197

Query: 727 SKSKKEDE--------IDDEEYDEPVESTPPSSDAGSDASSSED 762
             S K+DE         DD +  E ++  P  ++  S +S + D
Sbjct: 198 IVSLKKDEADAESVLSSDDNDSIEEIKYDPHETNKESGSSETID 241


>gnl|CDD|233870 TIGR02442, Cob-chelat-sub, cobaltochelatase subunit.
           Cobaltochelatase is responsible for the insertion of
           cobalt into the corrin ring of coenzyme B12 during its
           biosynthesis. Two versions have been well described.
           CbiK/CbiX is a monomeric, anaerobic version which acts
           early in the biosynthesis (pfam06180). CobNST is a
           trimeric, ATP-dependent, aerobic version which acts late
           in the biosynthesis (TIGR02257/TIGR01650/TIGR01651). A
           number of genomes (actinobacteria, cyanobacteria,
           betaproteobacteria and pseudomonads) which apparently
           biosynthesize B12, encode a cobN gene but are
           demonstrably lacking cobS and cobT. These genomes do,
           however contain a homolog (modelled here) of the
           magnesium chelatase subunits BchI/BchD family. Aside
           from the cyanobacteria (which have a separate magnesium
           chelatase trimer), these species do not make chlorins,
           so do not have any use for a magnesium chelatase.
           Furthermore, in nearly all cases the members of this
           family are proximal to either CobN itself or other genes
           involved in cobalt transport or B12 biosynthesis.
          Length = 633

 Score = 30.8 bits (70), Expect = 2.8
 Identities = 18/99 (18%), Positives = 36/99 (36%), Gaps = 6/99 (6%)

Query: 456 RVKREAAERDDEDDEDEDESTDEDFNPDQAESDVAEEYDSNPTPTDS---DEDSDASEGT 512
           R +R+  E+    DE + E   E+  PD  + D  E+      P        D +     
Sbjct: 308 RRRRKPFEQPQGKDEKDLEEKPEEPGPDPEKPDEGEDDAEQSGPRGHPTPGNDDEKEPDP 367

Query: 513 KKKKKEKSRKTITISE--KPRK-SKKDKESSGGSTKRKK 548
           +++   +   T    +  + R  +     + G S +R +
Sbjct: 368 QEEADGQGSSTDPAGDIFRIRVLAPPQARARGASGRRSR 406


>gnl|CDD|226894 COG4499, COG4499, Predicted membrane protein [Function unknown].
          Length = 434

 Score = 30.9 bits (70), Expect = 2.8
 Identities = 19/86 (22%), Positives = 38/86 (44%), Gaps = 4/86 (4%)

Query: 657 VKSKEFIESNGSSSDSDDDKKASSKRKRDSDEDSKAKTKKKKEKSESESDSGESEKEKKS 716
            K  E ++SN    D   DK+     K  + +      K  + K E+++      K K+ 
Sbjct: 350 TKLYEEVKSNT---DLSGDKR-QELLKEYNKKLQDYTKKLGEVKDETDASEEAEAKAKEE 405

Query: 717 KSKPKGDKSKSKSKKEDEIDDEEYDE 742
           K K + ++ K K + +++ +  + DE
Sbjct: 406 KLKQEENEKKQKEQADEDKEKRQKDE 431


>gnl|CDD|218556 pfam05327, RRN3, RNA polymerase I specific transcription initiation
           factor RRN3.  This family consists of several eukaryotic
           proteins which are homologous to the yeast RRN3 protein.
           RRN3 is one of the RRN genes specifically required for
           the transcription of rDNA by RNA polymerase I (Pol I) in
           Saccharomyces cerevisiae.
          Length = 554

 Score = 30.7 bits (70), Expect = 3.0
 Identities = 18/61 (29%), Positives = 26/61 (42%), Gaps = 8/61 (13%)

Query: 462 AERDDEDDEDEDESTDEDFNPDQAESDVAEEYDSNPTPTDSDEDSDASEGTKKKKKEKSR 521
            E DD DDE+E+    ++   D  + D   + D      D +E+SD         KE S 
Sbjct: 220 NELDDIDDEEEERVLADE---DDDDEDDMFDMD-----DDDEEESDPEVERTSTIKEVSE 271

Query: 522 K 522
           K
Sbjct: 272 K 272


>gnl|CDD|217834 pfam03998, Utp11, Utp11 protein.  This protein is found to be part
           of a large ribonucleoprotein complex containing the U3
           snoRNA. Depletion of the Utp proteins impedes production
           of the 18S rRNA, indicating that they are part of the
           active pre-rRNA processing complex. This large RNP
           complex has been termed the small subunit (SSU)
           processome.
          Length = 239

 Score = 30.4 bits (69), Expect = 3.0
 Identities = 24/102 (23%), Positives = 41/102 (40%)

Query: 630 SDDGKTSKSKKPAKKKETASTAVSPHKVKSKEFIESNGSSSDSDDDKKASSKRKRDSDED 689
            DD +  KS  PA+  +T    +   + + +       S  D    KK++ K+++   E 
Sbjct: 128 VDDEEEQKSFDPAEYFDTTPELLDRRENRPRISQLEKTSLVDEKQKKKSAKKKRKLYKEL 187

Query: 690 SKAKTKKKKEKSESESDSGESEKEKKSKSKPKGDKSKSKSKK 731
            + K ++KK K   +    + E  KK K K K        K 
Sbjct: 188 KERKEREKKLKKVEQRLELQRELMKKGKGKKKKIVKDKDGKV 229


>gnl|CDD|236944 PRK11642, PRK11642, exoribonuclease R; Provisional.
          Length = 813

 Score = 30.9 bits (70), Expect = 3.3
 Identities = 20/102 (19%), Positives = 32/102 (31%), Gaps = 19/102 (18%)

Query: 636 SKSKKPAKKKETASTAVSPHKVKSKEFIESNGSSSDSDDDKKA-----SSKRKRDSDEDS 690
           S  + P    +TA       + K+K+              KK      S+ R     +  
Sbjct: 726 SSERAPRNVGKTA-------REKAKKGDAGKKGGKRRQVGKKVNFEPDSAFRGEKKAKPK 778

Query: 691 KAKTKKKKEKSESESDSGESEKEKKSKSKPKGDKSKSKSKKE 732
            AK   +K K         S K +K  +  K  ++  K   E
Sbjct: 779 AAKKDARKAKK-------PSAKTQKIAAATKAKRAAKKKVAE 813


>gnl|CDD|148314 pfam06632, XRCC4, DNA double-strand break repair and V(D)J
           recombination protein XRCC4.  This family consists of
           several eukaryotic DNA double-strand break repair and
           V(D)J recombination protein XRCC4 sequences. In the
           non-homologous end joining pathway of DNA double-strand
           break repair, the ligation step is catalyzed by a
           complex of XRCC4 and DNA ligase IV. It is thought that
           XRCC4 and ligase IV are essential for alignment-based
           gap filling, as well as for final ligation of the
           breaks.
          Length = 331

 Score = 30.6 bits (69), Expect = 3.3
 Identities = 22/68 (32%), Positives = 30/68 (44%)

Query: 695 KKKKEKSESESDSGESEKEKKSKSKPKGDKSKSKSKKEDEIDDEEYDEPVESTPPSSDAG 754
           KK K +S  +  S   E EK  K K +      K+  ED   D   DE  E+ P  S++ 
Sbjct: 187 KKAKIRSLHKLLSEAQELEKSIKQKKETSACSDKTPDEDSKYDGSTDEEQEAPPKPSESM 246

Query: 755 SDASSSED 762
             A S +D
Sbjct: 247 PAAVSKDD 254


>gnl|CDD|185628 PTZ00449, PTZ00449, 104 kDa microneme/rhoptry antigen; Provisional.
          Length = 943

 Score = 30.8 bits (69), Expect = 3.3
 Identities = 29/119 (24%), Positives = 46/119 (38%), Gaps = 7/119 (5%)

Query: 448 KEPDAYLARVKREAAERDDEDDEDEDESTDEDFNPDQAESDVAEEYDS----NPTPTDSD 503
           K+    LA ++ E +++ DE  E  + S      P   E +  E  DS     P      
Sbjct: 490 KKSKKKLAPIEEEDSDKHDEPPEGPEASGLPPKAPGDKEGEEGEHEDSKESDEPKEGGKP 549

Query: 504 EDSDASEGTKK---KKKEKSRKTITISEKPRKSKKDKESSGGSTKRKKKDKNAPKKPMS 559
            ++   E  KK    K+ K  K  T+S+KP   K  K        +K K   + ++P  
Sbjct: 550 GETKEGEVGKKPGPAKEHKPSKIPTLSKKPEFPKDPKHPKDPEEPKKPKRPRSAQRPTR 608


>gnl|CDD|148051 pfam06213, CobT, Cobalamin biosynthesis protein CobT.  This family
           consists of several bacterial cobalamin biosynthesis
           (CobT) proteins. CobT is involved in the transformation
           of precorrin-3 into cobyrinic acid.
          Length = 282

 Score = 30.2 bits (68), Expect = 3.5
 Identities = 17/69 (24%), Positives = 29/69 (42%), Gaps = 2/69 (2%)

Query: 685 DSDEDSKAKTKKKKEKSESESDSGESEKEKKSKSKPKGDKSKSKSKKEDEIDDEEYDEPV 744
               DS+    +   K + + D GE E+   S S  +   + S+  +  E++  E     
Sbjct: 216 PESADSEDNEDEDDPKEDEDDDQGEEEESGSSDSLSEDSDASSEEMESGEMEAAEASA-- 273

Query: 745 ESTPPSSDA 753
           + TP S DA
Sbjct: 274 DDTPDSDDA 282


>gnl|CDD|236289 PRK08563, PRK08563, DNA-directed RNA polymerase subunit E';
           Provisional.
          Length = 187

 Score = 29.8 bits (68), Expect = 3.6
 Identities = 10/15 (66%), Positives = 14/15 (93%)

Query: 293 QELKEKYEGKLDKEL 307
           + L+EKYEG++DKEL
Sbjct: 28  EVLREKYEGRIDKEL 42


>gnl|CDD|227666 COG5374, COG5374, Uncharacterized conserved protein [Function
           unknown].
          Length = 192

 Score = 29.8 bits (67), Expect = 3.6
 Identities = 14/74 (18%), Positives = 26/74 (35%), Gaps = 6/74 (8%)

Query: 660 KEFIESNGSSSDSDDDKKASSKRKRDSDEDSKAKTKKKKEKSESESDSGESEKEKKSKSK 719
           +E +E N       D  +A      DS +      K +      + +  E  K     ++
Sbjct: 125 EEMLEENAKKGGKIDKMEA------DSTDLKARLRKAQILLEGLQKNQEELFKLLDKYNE 178

Query: 720 PKGDKSKSKSKKED 733
            +    K  SKK++
Sbjct: 179 LREQVQKESSKKKE 192


>gnl|CDD|236545 PRK09510, tolA, cell envelope integrity inner membrane protein
           TolA; Provisional.
          Length = 387

 Score = 30.5 bits (69), Expect = 3.7
 Identities = 24/135 (17%), Positives = 58/135 (42%), Gaps = 1/135 (0%)

Query: 598 KEKAEYDEKVNKAKEDYNEALKAYKESGGGQDSDDGKTSKSKKPAKKKETASTAVSPHKV 657
           KE+    E+  +A+E   +A    K++     +     +K+K  A+ K  A+ A      
Sbjct: 108 KERLAAQEQKKQAEEAAKQAALKQKQAEEAA-AKAAAAAKAKAEAEAKRAAAAAKKAAAE 166

Query: 658 KSKEFIESNGSSSDSDDDKKASSKRKRDSDEDSKAKTKKKKEKSESESDSGESEKEKKSK 717
             K+        + ++  KKA ++    +  ++K K + + +K  +     ++  E K+ 
Sbjct: 167 AKKKAEAEAAKKAAAEAKKKAEAEAAAKAAAEAKKKAEAEAKKKAAAEAKKKAAAEAKAA 226

Query: 718 SKPKGDKSKSKSKKE 732
           +     ++K+ ++K 
Sbjct: 227 AAKAAAEAKAAAEKA 241


>gnl|CDD|176221 cd08260, Zn_ADH6, Alcohol dehydrogenases of the MDR family.
           NAD(P)(H)-dependent oxidoreductases are the major
           enzymes in the interconversion of alcohols and
           aldehydes, or ketones. This group has the characteristic
           catalytic and structural zinc sites of the
           zinc-dependent alcohol dehydrogenases.  Alcohol
           dehydrogenase in the liver converts ethanol and NAD+ to
           acetaldehyde and NADH, while in yeast and some other
           microorganisms ADH catalyzes the conversion acetaldehyde
           to ethanol in alcoholic fermentation. ADH is a member of
           the medium chain alcohol dehydrogenase family (MDR),
           which has a NAD(P)(H)-binding domain in a Rossmann fold
           of a beta-alpha form.  The NAD(H)-binding region is
           comprised of 2 structurally similar halves, each of
           which contacts a mononucleotide. A GxGxxG motif after
           the first mononucleotide contact half allows the close
           contact of the coenzyme with the ADH backbone. The
           N-terminal catalytic domain has a distant homology to
           GroES. These proteins typically form dimers (typically
           higher plants, mammals) or tetramers (yeast, bacteria),
           and have 2 tightly bound zinc atoms per subunit, a
           catalytic zinc at the active site and a structural zinc
           in a lobe of the catalytic domain. NAD(H)-binding occurs
           in the cleft between the catalytic  and coenzyme-binding
           domains at the active site, and coenzyme binding induces
           a conformational closing of this cleft. Coenzyme binding
           typically precedes and contributes to substrate binding.
           In human ADH catalysis, the zinc ion helps coordinate
           the alcohol, followed by deprotonation of  a histidine,
           the ribose of NAD, a serine, then the alcohol, which
           allows the transfer of a hydride to NAD+, creating NADH
           and a zinc-bound aldehyde or ketone. In yeast and some
           bacteria, the active site zinc binds an aldehyde,
           polarizing it, and leading to the reverse reaction.
          Length = 345

 Score = 30.3 bits (69), Expect = 3.8
 Identities = 12/43 (27%), Positives = 17/43 (39%), Gaps = 1/43 (2%)

Query: 173 AGDTDPVEAFKDSVMNQASV-INATGDAIAVFNEIQCLTPRGR 214
           +   D   A +D     A V ++A G      N +  L  RGR
Sbjct: 217 SEVEDVAAAVRDLTGGGAHVSVDALGIPETCRNSVASLRKRGR 259


>gnl|CDD|226947 COG4581, COG4581, Superfamily II RNA helicase [DNA replication,
           recombination, and repair].
          Length = 1041

 Score = 30.8 bits (70), Expect = 3.9
 Identities = 19/104 (18%), Positives = 34/104 (32%), Gaps = 3/104 (2%)

Query: 638 SKKPAKKKETASTAVSPHKVKSKEFIESNGSSSDSD-DDKKASSKRKRDSDEDSKAKTKK 696
           +K      E    +    +V+ K       S  + D  D    S       +++  + K 
Sbjct: 4   TKNFRVPSEYGFVSTKEKRVERKAATAEEESLKNIDISDLFERSVDMNKDPKEATDQVKA 63

Query: 697 KKEKSESESDSGESEKEKKSKSKPKGDKSKSKSKKEDEIDDEEY 740
             + SE ES +     + +  SK   D +       D + D E 
Sbjct: 64  VDKHSEIESLNVGMRLDVRLSSKELSDVAVPDD--YDMVPDAES 105


>gnl|CDD|227891 COG5604, COG5604, Uncharacterized conserved protein [Function
           unknown].
          Length = 523

 Score = 30.2 bits (68), Expect = 4.1
 Identities = 19/82 (23%), Positives = 36/82 (43%), Gaps = 7/82 (8%)

Query: 633 GKTSKSKKPAKKKETASTAVSPHKVKSKEFIESNGSSSDSDDDKKASSKRKRDSDEDSKA 692
           GK SK+ K   K    +T     ++   + +    +       + + ++ K +S  +   
Sbjct: 2   GKASKATKKFTKNHLKNTIDRRKQLARSKKVYGTKN-------RNSHTENKMESGTNDNN 54

Query: 693 KTKKKKEKSESESDSGESEKEK 714
           K K+K  K  S+ DS  SE+E+
Sbjct: 55  KNKEKLSKLYSDVDSSSSEEEE 76


>gnl|CDD|235549 PRK05658, PRK05658, RNA polymerase sigma factor RpoD; Validated.
          Length = 619

 Score = 30.5 bits (70), Expect = 4.1
 Identities = 18/70 (25%), Positives = 30/70 (42%), Gaps = 5/70 (7%)

Query: 411 EYGKLFDFIREKKLRVKNTGKGEKPNYKEDF-GSSDDEKEPDAYLARVKREAAERDDEDD 469
           E     D I E   R++N     +   +E   G  D   E D      + E  + D++++
Sbjct: 144 ESPLTIDAILEWYDRLEN----GERRLRELIDGFVDPNAEEDPAHVGSELEELDDDEDEE 199

Query: 470 EDEDESTDED 479
           E+EDE+ D  
Sbjct: 200 EEEDENDDSL 209


>gnl|CDD|239823 cd04331, RNAP_E_N, RNAP_E_N: RpoE, N-terminal ribonucleoprotein
           (RNP) domain. RpoE (subunit E) is a subunit of the
           archaeal RNA polymerase (RNAP) that is homologous to
           Rpb7 of eukaryotic RNAP II, Rpc25 of eukaryotic RNAP
           III, and Rpa43 of eukaryotic RNAP I. RpoE
           heterodimerizes with RpoF, another RNA polymerase
           subunit. RpoE has an elongated two-domain structure that
           includes an N-terminal RNP domain and a C-terminal
           oligonucleotide-binding (OB) domain. Both domains of
           RpoE bind single-stranded RNA.
          Length = 80

 Score = 27.9 bits (63), Expect = 4.1
 Identities = 11/14 (78%), Positives = 13/14 (92%)

Query: 294 ELKEKYEGKLDKEL 307
            LKEKYEG+LDK+L
Sbjct: 28  ILKEKYEGRLDKDL 41


>gnl|CDD|218908 pfam06136, DUF966, Domain of unknown function (DUF966).  Family of
           plant proteins with unknown function.
          Length = 308

 Score = 30.1 bits (68), Expect = 4.3
 Identities = 20/94 (21%), Positives = 36/94 (38%)

Query: 669 SSDSDDDKKASSKRKRDSDEDSKAKTKKKKEKSESESDSGESEKEKKSKSKPKGDKSKSK 728
           SS S  D + +S RK   + D+    ++  +   S   +     + +  +         +
Sbjct: 91  SSSSKGDPEEASSRKLQEESDTPPVNRRANQSWSSSDLAEYKVYKAEEPADASTQTDDRR 150

Query: 729 SKKEDEIDDEEYDEPVESTPPSSDAGSDASSSED 762
           S+   E +  E      S P SS + S +SS E 
Sbjct: 151 SRDSSEAESTELSREEISPPSSSSSPSSSSSPET 184


>gnl|CDD|149343 pfam08229, SHR3_chaperone, ER membrane protein SH3.  This family of
           proteins are membrane localised chaperones that are
           required for correct plasma membrane localisation of
           amino acid permeases (AAPs). SH3 prevents AAPs proteins
           from aggregating and assists in their correct folding.
           In the absence of SH3, AAPs are retained in the ER.
          Length = 196

 Score = 29.6 bits (67), Expect = 4.4
 Identities = 12/40 (30%), Positives = 20/40 (50%)

Query: 692 AKTKKKKEKSESESDSGESEKEKKSKSKPKGDKSKSKSKK 731
           A+ K  K   E  ++  E+    K +   +G+K +SK KK
Sbjct: 157 AEWKDAKLLEEFAAEEAEAAAAAKEEESAEGEKKESKKKK 196


>gnl|CDD|221750 pfam12745, HGTP_anticodon2, Anticodon binding domain of tRNAs.
           This is an HGTP_anticodon binding domain, found largely
           on Gcn2 proteins which bind tRNA to down regulate
           translation in certain stress situations.
          Length = 260

 Score = 30.0 bits (68), Expect = 4.5
 Identities = 22/104 (21%), Positives = 41/104 (39%), Gaps = 15/104 (14%)

Query: 420 REKKLRVKNTGKGEKPNYKEDFGSSDDEKEPDAYLARVKREAAERDDEDDEDEDESTDED 479
           +EK L+VKN G+ E           D + + D  +  ++ E  ER+ +  E     +   
Sbjct: 72  KEKPLKVKNLGRKE-----------DTDIDYDELVPWLRSEIRERNGQKREGSALRSTLL 120

Query: 480 FNPDQAESDVAEEYDSNPTPTDSDEDS----DASEGTKKKKKEK 519
            +P Q E   +    S    T+  +       A+   K  +++K
Sbjct: 121 RHPSQPEDANSSSGSSQIGDTERQKVVVVLNAATRSKKSNRRDK 164


>gnl|CDD|185272 PRK15374, PRK15374, pathogenicity island 1 effector protein SipB;
           Provisional.
          Length = 593

 Score = 30.3 bits (68), Expect = 4.5
 Identities = 20/120 (16%), Positives = 41/120 (34%), Gaps = 8/120 (6%)

Query: 592 WKTVSSKEKA---EYDEKVNKAKEDYNEALKAYKESGGGQDSDDGKTSKSKKPAKKKETA 648
           W+ +   +K    +  ++   A  +  EA   Y+ S    D+     S      KK   A
Sbjct: 111 WQAMIESQKEMGIQVSKEFQTALGEAQEATDLYEASIKKTDTAK---SVYDAAEKKLTQA 167

Query: 649 STAVSPHKVKSKEFIESNGSSSDSDDD-KKASSKRKRDSDEDSKAKTKKKKEKSESESDS 707
              +         + ++  +   +  +  +A     + +D   KA T   K K+E   + 
Sbjct: 168 QNKLQSLDPADPGYAQAEAAVEQAGKEATEAKEALDKATDATVKAGT-DAKAKAEKADNI 226


>gnl|CDD|218003 pfam04281, Tom22, Mitochondrial import receptor subunit Tom22.  The
           mitochondrial protein translocase family, which is
           responsible for movement of nuclear encoded pre-proteins
           into mitochondria, is very complex with at least 19
           components. These proteins include several chaperone
           proteins, four proteins of the outer membrane
           translocase (Tom) import receptor, five proteins of the
           Tom channel complex, five proteins of the inner membrane
           translocase (Tim) and three "motor" proteins. This
           family represents the Tom22 proteins. The N terminal
           region of Tom22 has been shown to have chaperone-like
           activity, and the C terminal region faces the
           intermembrane face.
          Length = 136

 Score = 29.1 bits (66), Expect = 4.5
 Identities = 13/48 (27%), Positives = 20/48 (41%), Gaps = 5/48 (10%)

Query: 445 DDEKEPDAYLARVKREAAERDDEDDEDEDESTDEDFNPDQAESDVAEE 492
           +DE   +   A       E DD+D++D D  +D        +SD   E
Sbjct: 8   EDETFQEKPAAPKNLAQEESDDDDEDDTDTDSDISD-----DSDFENE 50


>gnl|CDD|217783 pfam03896, TRAP_alpha, Translocon-associated protein (TRAP), alpha
           subunit.  The alpha-subunit of the TRAP complex (TRAP
           alpha) is a single-spanning membrane protein of the
           endoplasmic reticulum (ER) which is found in proximity
           of nascent polypeptide chains translocating across the
           membrane.
          Length = 281

 Score = 30.1 bits (68), Expect = 4.6
 Identities = 15/44 (34%), Positives = 25/44 (56%), Gaps = 1/44 (2%)

Query: 460 EAAERDDEDDEDEDESTDEDFNPDQAESDVAEEYDSNPTPTDSD 503
           E AE D  D+++EDE+  E+   +  E +  EE +   +P D+D
Sbjct: 38  EEAEDDVVDEDEEDEAVVEEDENELTEEEEDEEGEVKASP-DAD 80



 Score = 29.4 bits (66), Expect = 7.3
 Identities = 15/48 (31%), Positives = 23/48 (47%), Gaps = 6/48 (12%)

Query: 460 EAAERDDEDDEDEDESTDEDFNPDQAESDVAEEYDSNPTPTDSDEDSD 507
            +A+   ED+E ED+  DED      E D A   +     T+ +ED +
Sbjct: 29  ASAQDLTEDEEAEDDVVDED------EEDEAVVEEDENELTEEEEDEE 70


>gnl|CDD|214395 CHL00204, ycf1, Ycf1; Provisional.
          Length = 1832

 Score = 30.5 bits (69), Expect = 4.6
 Identities = 23/86 (26%), Positives = 39/86 (45%), Gaps = 13/86 (15%)

Query: 658  KSKEFIESNGSSS-----DSDDDKKASSKRKRDSDEDSKAKTKKKKEKSES--ESDSGES 710
            K K FI SN   S      + ++K+ + + + +SD++ K   +      E   E D  ES
Sbjct: 1505 KKKGFIPSNEKKSIEIENRNQEEKEPAGQGELESDKEKKGNLESVLSNQEKNIEEDYAES 1564

Query: 711  EKEKKSKSKPKGDKSKSKSKKEDEID 736
            + +K+   K      + KS  E E+D
Sbjct: 1565 DIKKRKNKK------QYKSNTEAELD 1584


>gnl|CDD|220135 pfam09184, PPP4R2, PPP4R2.  PPP4R2 (protein phosphatase 4 core
           regulatory subunit R2) is the regulatory subunit of the
           histone H2A phosphatase complex. It has been shown to
           confer resistance to the anticancer drug cisplatin in
           yeast, and may confer resistance in higher eukaryotes.
          Length = 285

 Score = 29.8 bits (67), Expect = 4.6
 Identities = 22/122 (18%), Positives = 48/122 (39%), Gaps = 17/122 (13%)

Query: 643 KKKETASTAVSPHKVKSKEFIESNGSSSDSDDDKKASSKRKRDSDEDSKAKTKKKKEKSE 702
           ++ ++ +    P      +    NGSS++   D          S +D     ++  EK  
Sbjct: 177 ERIDSVNGPGEPEPEDDPKDSLGNGSSTNGLPD----------SSQDKNKSLEEYYEKES 226

Query: 703 SESDSGESEKEKKSKSKPKGDKSKSKSKKEDEIDDEEYDEPVESTPPSSDAGSDASSSED 762
           S++ + + +        PKG   K+K   ++E DD++ D   E      +   +    E+
Sbjct: 227 SDAAASQDDG-------PKGSDVKNKKSDDEEDDDQDGDYVEEKELKEDEEEEETEEEEE 279

Query: 763 DD 764
           ++
Sbjct: 280 EE 281


>gnl|CDD|234780 PRK00488, pheS, phenylalanyl-tRNA synthetase subunit alpha;
           Validated.
          Length = 339

 Score = 30.1 bits (69), Expect = 4.6
 Identities = 11/44 (25%), Positives = 19/44 (43%), Gaps = 4/44 (9%)

Query: 580 SFTEIAKKGGELWKTVSSKEKAEYDEKVNKAKEDYNEALKAYKE 623
             TE+ K        +  +E+ E    +N+ K+    AL+  KE
Sbjct: 38  ELTELLKG----LGKLPPEERKEAGALINELKQAIEAALEERKE 77


>gnl|CDD|236877 PRK11192, PRK11192, ATP-dependent RNA helicase SrmB; Provisional.
          Length = 434

 Score = 30.3 bits (69), Expect = 4.7
 Identities = 14/58 (24%), Positives = 24/58 (41%), Gaps = 4/58 (6%)

Query: 676 KKASSKRKRDSDEDSKAKTKKKKEKSESESDSGESEKEKKSKSKPKGDKSKSKSKKED 733
           K  S K+     +   AK  +KKEK + +       K++   +K  G + K     E+
Sbjct: 381 KAPSEKKTGKPSKKVLAKRAEKKEKEKEKPKV----KKRHRDTKNIGKRRKPSGTSEE 434


>gnl|CDD|219069 pfam06512, Na_trans_assoc, Sodium ion transport-associated.
           Members of this family contain a region found
           exclusively in eukaryotic sodium channels or their
           subunits, many of which are voltage-gated. Members very
           often also contain between one and four copies of
           pfam00520 and, less often, one copy of pfam00612.
          Length = 230

 Score = 29.7 bits (67), Expect = 5.2
 Identities = 24/77 (31%), Positives = 33/77 (42%), Gaps = 9/77 (11%)

Query: 471 DEDESTDEDF--NPDQAES-DVAEEYDSNPTPTDSDEDSDASEGTKKKKKEKSRKTITIS 527
           D  ES    F  NP    S  +A          D D  S++S G K++ K+ S      +
Sbjct: 86  DVKESDPMSFITNPSLTVSVPIASGESDLENLNDEDTSSESSYGFKEESKKGS------A 139

Query: 528 EKPRKSKKDKESSGGST 544
           E  +  +KD  SS GST
Sbjct: 140 ETLKLEEKDSSSSEGST 156


>gnl|CDD|215278 PLN02502, PLN02502, lysyl-tRNA synthetase.
          Length = 553

 Score = 30.0 bits (68), Expect = 5.5
 Identities = 15/55 (27%), Positives = 26/55 (47%), Gaps = 2/55 (3%)

Query: 690 SKAKTKK--KKEKSESESDSGESEKEKKSKSKPKGDKSKSKSKKEDEIDDEEYDE 742
           SK   KK  K +++E E  + E  K   + +  KG   KS +  ++ +D  +Y  
Sbjct: 9   SKNALKKRLKAKQAEEEKAAKEEAKAAAAAAAAKGRSRKSAAADDETMDPTQYRA 63


>gnl|CDD|221668 pfam12619, MCM2_N, Mini-chromosome maintenance protein 2.  This
           domain family is found in eukaryotes, and is typically
           between 138 and 153 amino acids in length. The family is
           found in association with pfam00493. Mini-chromosome
           maintenance (MCM) proteins are essential for DNA
           replication. These proteins use ATPase activity to
           perform this function.
          Length = 145

 Score = 28.8 bits (65), Expect = 5.7
 Identities = 13/42 (30%), Positives = 20/42 (47%)

Query: 437 YKEDFGSSDDEKEPDAYLARVKREAAERDDEDDEDEDESTDE 478
           + +D    D + +  A   R +R+  E  D DDED D   +E
Sbjct: 90  FLDDDDDDDGDFDLTAQPRRRRRQYDEDRDADDEDIDPLEEE 131


>gnl|CDD|227508 COG5181, HSH155, U2 snRNP spliceosome subunit [RNA processing and
           modification].
          Length = 975

 Score = 29.9 bits (67), Expect = 5.8
 Identities = 20/128 (15%), Positives = 40/128 (31%), Gaps = 9/128 (7%)

Query: 443 SSDDEKEPDAYLARVKREAAERDDEDDEDEDESTDEDFNPDQAESDVAE-EYDSNPTPTD 501
           SS+ E E  A  +R   + A    ED  D   +  +     +  ++  + + +  P    
Sbjct: 1   SSNPEMEDKAEDSRRSTKQASVWYEDRNDSRGNVSQWEATPRTANESKDLKQEQTPLGYI 60

Query: 502 SDEDSDAS--------EGTKKKKKEKSRKTITISEKPRKSKKDKESSGGSTKRKKKDKNA 553
              +   S        +    KK+E         E P   +  +    G    ++ +   
Sbjct: 61  QVREDLYSKTRMDPGFQPDSDKKRELELNNTWDMEYPDSKRSSRWDEMGYEPPQEINMCL 120

Query: 554 PKKPMSAY 561
           P +   A 
Sbjct: 121 PARGYKAL 128


>gnl|CDD|213844 TIGR03657, IsdB, heme uptake protein IsdB.  Isd proteins are
           iron-regulated surface proteins found in Bacillus,
           Staphylococcus and Listeria species and are responsible
           for heme scavenging from hemoproteins. The IsdB protein
           is only observed in Staphylococcus and consists of an
           N-terminal hydrophobic signal sequence, a pair of tandem
           NEAT (NEAr Transporter, pfam05031) domains which confers
           the ability to bind heme and a C-terminal sortase
           processing signal which targets the protein to the cell
           wall. IsdB is believed to make a direct contact with
           methemoglobin facilitating transfer of heme to IsdB. The
           heme is then transferred to other cell wall-bound NEAT
           domain proteins such as IsdA and IsdC.
          Length = 644

 Score = 29.9 bits (66), Expect = 5.9
 Identities = 29/127 (22%), Positives = 59/127 (46%), Gaps = 4/127 (3%)

Query: 605 EKVNKAKEDYNEALKAYKESGGGQDSDDGKTSKSKKPAKKKETASTAVSPHKVKSKEFIE 664
           +K +  K+D  ++    KE+    +S   KT  +K    + E++ST  +     ++   +
Sbjct: 493 QKQDSQKDDNKQSPSVEKENDASSESGKDKTPATKPAKGEVESSSTTPTKVVSTTQNVAK 552

Query: 665 SNGSSSDSDDD---KKASSKRKRDSDEDSKAKTKKKKEKSESESDSGESEKEKKSKSKPK 721
              +SS++  D     A S   +DS    KA  K   +   ++S + ++ +E K+KS P+
Sbjct: 553 PTTASSETTKDVVQTSAGSSEAKDSAPLQKANIKNTND-GHTQSQNNKNTQENKAKSLPQ 611

Query: 722 GDKSKSK 728
             +  +K
Sbjct: 612 TGEESNK 618


>gnl|CDD|177546 PHA03151, PHA03151, hypothetical protein; Provisional.
          Length = 259

 Score = 29.4 bits (65), Expect = 5.9
 Identities = 22/107 (20%), Positives = 35/107 (32%)

Query: 443 SSDDEKEPDAYLARVKREAAERDDEDDEDEDESTDEDFNPDQAESDVAEEYDSNPTPTDS 502
           SSDD   PD   + V  +   R     E++ E   +         D   E  ++      
Sbjct: 114 SSDDSSAPDFITSLVSSDCEARGLSSSEEDGEPYSKQKMSQPLTIDAKTEEITSEEDCCV 173

Query: 503 DEDSDASEGTKKKKKEKSRKTITISEKPRKSKKDKESSGGSTKRKKK 549
            EDS  SE    +   + R  +   +K  K            K ++K
Sbjct: 174 QEDSSDSEEDVVEAFIRQRAQMAGKKKKGKRSISTSDDEPPRKSRRK 220


>gnl|CDD|221825 pfam12877, DUF3827, Domain of unknown function (DUF3827).  This
           family contains the human KIAA1549 protein which has
           been found to be fused fused to BRAF gene in many cases
           of pilocytic astrocytomas. The fusion is due mainly to a
           tandem duplication of 2 Mb at 7q34. Although nothing is
           known about the function of KIAA1549 protein, the BRAF
           protein is a well characterized oncoprotein. It is a
           serine/threonine protein kinase which is implicated in
           MAP/ERK signalling, a critical pathway for the
           regulation of cell division, differentiation and
           secretion.
          Length = 684

 Score = 29.9 bits (67), Expect = 6.0
 Identities = 21/94 (22%), Positives = 36/94 (38%), Gaps = 17/94 (18%)

Query: 675 DKKASSKRKRDSDEDSKAKTKKKKEKSESESD-------SGESEKEKKSK-SKPKGDKSK 726
           D +  + + + S + S  K +++  KS S+ D       S  S +EK  + S      ++
Sbjct: 359 DAEVPTPKSKSSQDGSSNKKRRRGRKSPSDGDSEGSSVISNRSSREKSGRPSTTPSVTAQ 418

Query: 727 SKSKKEDEIDDEEYDEPVESTPPSSDAGSDASSS 760
            K  KE         E  +   P S      SS+
Sbjct: 419 QKPTKE---------EGRKKPAPPSGTDEQLSSA 443


>gnl|CDD|240329 PTZ00248, PTZ00248, eukaryotic translation initiation factor 2
           subunit 1; Provisional.
          Length = 319

 Score = 29.6 bits (67), Expect = 6.0
 Identities = 11/44 (25%), Positives = 22/44 (50%)

Query: 444 SDDEKEPDAYLARVKREAAERDDEDDEDEDESTDEDFNPDQAES 487
             DE++ +  L + + E  E D  + EDEDE  +++   +  + 
Sbjct: 273 GGDEEDLEELLEKAEEEEEEDDYSESEDEDEEDEDEEEEEDDDE 316


>gnl|CDD|220388 pfam09766, FimP, Fms-interacting protein.  This entry carries part
           of the crucial 144 N-terminal residues of the FmiP
           protein, which is essential for the binding of the
           protein to the cytoplasmic domain of activated
           Fms-molecules in M-CSF induced haematopoietic
           differentiation of macrophages. The C-terminus contains
           a putative nuclear localisation sequence and a leucine
           zipper which suggest further, as yet unknown, nuclear
           functions. The level of FMIP expression might form a
           threshold that determines whether cells differentiate
           into macrophages or into granulocytes.
          Length = 352

 Score = 29.6 bits (67), Expect = 6.3
 Identities = 14/47 (29%), Positives = 24/47 (51%)

Query: 489 VAEEYDSNPTPTDSDEDSDASEGTKKKKKEKSRKTITISEKPRKSKK 535
             ++ +S+    D D DSD  E  K++KK +S     + +K  K+ K
Sbjct: 205 FEQQKESSEQSQDDDSDSDEEEEQKRQKKRRSTDQEKLDDKRLKNSK 251


>gnl|CDD|148679 pfam07218, RAP1, Rhoptry-associated protein 1 (RAP-1).  This family
           consists of several rhoptry-associated protein 1 (RAP-1)
           sequences which appear to be specific to Plasmodium
           falciparum.
          Length = 790

 Score = 30.0 bits (67), Expect = 6.3
 Identities = 39/162 (24%), Positives = 63/162 (38%), Gaps = 10/162 (6%)

Query: 470 EDEDESTDEDFNPDQAESDVAEEYDSNPT---PTDSDEDSDASEGTKKKKKEKSRKTITI 526
           E++D   D +F  +  E+    +   N      + SD  S AS  +  K   K  ++ + 
Sbjct: 112 EEDDSKDDMEFKANPNEAGKPGKPKGNQGEGLASSSDGKSKASAKSGSKSASKHGESNSS 171

Query: 527 SEKPRKSKKDKESSGGSTKRKKKDKNAPKKPMSAYMMWFNDMREKIKKDNPGISFTE--- 583
            E    S K   S  G     ++   APK  ++     +          +P     E   
Sbjct: 172 DESATDSGKASASVAGIVGADEEAPPAPKNTLTPLEELYETNVNLFALKHPLEKLEEEID 231

Query: 584 IAKKGGELWKTVSSKEKAEYDEKVNKAKEDYNEAL-KAYKES 624
           I K  G+    V+ +E+ E DE+  +A+ED  EAL K   E 
Sbjct: 232 ILKNDGDK---VAEEEEFELDEEHEEAEEDKKEALEKIGAEG 270


>gnl|CDD|222977 PHA03089, PHA03089, late transcription factor VLTF-4; Provisional.
          Length = 191

 Score = 29.0 bits (65), Expect = 6.4
 Identities = 14/74 (18%), Positives = 28/74 (37%)

Query: 634 KTSKSKKPAKKKETASTAVSPHKVKSKEFIESNGSSSDSDDDKKASSKRKRDSDEDSKAK 693
           K +  KK    ++  +T  +  K K KE +    +   SD ++   + +K D +      
Sbjct: 48  KKTSKKKKTTPRKKKTTKKTKKKKKEKEEVPELAAEELSDSEENEENDKKVDYELPKVQN 107

Query: 694 TKKKKEKSESESDS 707
           T  +    +    S
Sbjct: 108 TAAEVNHEDVIDLS 121


>gnl|CDD|237863 PRK14949, PRK14949, DNA polymerase III subunits gamma and tau;
           Provisional.
          Length = 944

 Score = 29.7 bits (67), Expect = 7.0
 Identities = 30/173 (17%), Positives = 58/173 (33%), Gaps = 23/173 (13%)

Query: 603 YDEKVNKAKEDYNEA---LKAYKES----GGGQDSDDGKTSKSKKPAKKKETASTAVSPH 655
             E     + DY +    L AY++          +      +     +  ++A+ A    
Sbjct: 531 TLESNGLDEGDYAQDSAPLDAYQDDYVAFSSESYNALSDDEQHSANVQSAQSAAEAQPSS 590

Query: 656 KVKSKEFIESNGSSSDSDDDKKASSKRKRDS---DEDSKAKTKKKKEKSESESDSGESEK 712
           +  S     +  ++S +DDD   +    RDS   D D           S  E D  +S  
Sbjct: 591 QSLSPISAVTTAAASLADDDILDAVLAARDSLLSDLD---------ALSPKEGDGKKSSA 641

Query: 713 EKKSKSKPKG----DKSKSKSKKEDEIDDEEYDEPVESTPPSSDAGSDASSSE 761
           ++K K+ P        SK  S  +       +D   +    +  +  +A+ + 
Sbjct: 642 DRKPKTPPSRAPPASLSKPASSPDASQTSASFDLDPDFELATHQSVPEAALAS 694


>gnl|CDD|215094 PLN00188, PLN00188, enhanced disease resistance protein (EDR2);
           Provisional.
          Length = 719

 Score = 29.8 bits (67), Expect = 7.1
 Identities = 28/137 (20%), Positives = 55/137 (40%), Gaps = 19/137 (13%)

Query: 438 KEDFGSSDDEKEPDAYLARVKREAAERDDEDDEDED-ESTDEDFNPDQAESDVAEEYDSN 496
           K++    +     D   A   R +   D++ D+DE+ +  + +  P+  +++  ++    
Sbjct: 416 KKNQKPQESSPSLDQTNA-ASRNSVMMDEDSDDDEEFQIPESEQEPETTKNET-KDTAME 473

Query: 497 PTPTDSDEDSDASEGTKKKKKEKSRKTITISEKPRKSKKDKESSGGSTKRKKKDKNAPKK 556
             P D  + S  S   ++  ++K+R    IS+        K  S    K    DK+  K 
Sbjct: 474 EEPQDKIDLSCFSGNLRRDDRDKARDCWRISD----GNNFKVRS----KNFCYDKS--KI 523

Query: 557 PMSAYMM------WFND 567
           P   ++M      WF D
Sbjct: 524 PAGKHLMDLVAVDWFKD 540


>gnl|CDD|111764 pfam02912, Phe_tRNA-synt_N, Aminoacyl tRNA synthetase class II,
           N-terminal domain. 
          Length = 73

 Score = 27.1 bits (61), Expect = 7.2
 Identities = 15/45 (33%), Positives = 24/45 (53%), Gaps = 4/45 (8%)

Query: 580 SFTEIAKKGGELWKTVSSKEKAEYDEKVNKAKEDYNEALKAYKES 624
             TE+ K  G+L    S +E+ +    +N+AKE   EAL+  K +
Sbjct: 22  PLTELLKGLGKL----SPEERPKVGALINEAKEAVEEALEERKAA 62


>gnl|CDD|114629 pfam05917, DUF874, Helicobacter pylori protein of unknown function
           (DUF874).  This family consists of several hypothetical
           proteins specific to Helicobacter pylori. The function
           of this family is unknown.
          Length = 417

 Score = 29.5 bits (65), Expect = 7.6
 Identities = 18/83 (21%), Positives = 38/83 (45%)

Query: 670 SDSDDDKKASSKRKRDSDEDSKAKTKKKKEKSESESDSGESEKEKKSKSKPKGDKSKSKS 729
              D DKK    + +   E+++ +  K   + E E    E EK+K  K   +   S+ K+
Sbjct: 120 FADDQDKKIELAQAKKEAENARDRANKSGIELEQEEQKTEQEKQKTEKEGIELANSQIKA 179

Query: 730 KKEDEIDDEEYDEPVESTPPSSD 752
           ++E +  ++E  +  +    +S+
Sbjct: 180 EQEKQKTEQEKQKTEQEKQKTSN 202


>gnl|CDD|223044 PHA03325, PHA03325, nuclear-egress-membrane-like protein;
           Provisional.
          Length = 418

 Score = 29.5 bits (66), Expect = 7.8
 Identities = 15/108 (13%), Positives = 31/108 (28%), Gaps = 8/108 (7%)

Query: 662 FIESNGSSSDSDDDKKASSKRKRDSDEDSKAKTKKKKEKSESE--SDSGESEKEKKSKSK 719
             E +              +R R    ++  +       +E++  +    S   K S S 
Sbjct: 298 GSEHSDPEPLPASLPPPPVRRPRVKHPEAGKEEPDGARNAEAKEPAQPATSTSSKGSSSA 357

Query: 720 PKGDKSKSK------SKKEDEIDDEEYDEPVESTPPSSDAGSDASSSE 761
              D   +       +      DD+    P++ T       S + +S 
Sbjct: 358 QNKDSGSTGPGSSLAAASSFLEDDDFGSPPLDLTTSLRHMPSPSVTSA 405


>gnl|CDD|221173 pfam11702, DUF3295, Protein of unknown function (DUF3295).  This
           family is conserved in fungi but the function is not
           known.
          Length = 509

 Score = 29.5 bits (66), Expect = 8.1
 Identities = 17/74 (22%), Positives = 31/74 (41%), Gaps = 3/74 (4%)

Query: 434 KPNYKEDFGSSDDEKEPDAYLARVKREAAERDDEDDEDEDESTDEDFNPDQAESDVAEEY 493
           KP   +  GS +  K   + ++  K+ A+ ++        E T +D   D+   +  E+ 
Sbjct: 241 KPKMFQLGGSDELGKSLPSLMSPRKKTASFKEQVVTRTFPERTSDD---DEDAIETEEDD 297

Query: 494 DSNPTPTDSDEDSD 507
                  D D+DSD
Sbjct: 298 VDESAIEDDDDDSD 311


>gnl|CDD|218881 pfam06070, Herpes_UL32, Herpesvirus large structural phosphoprotein
           UL32.  The large phosphorylated protein (UL32-like) of
           herpes viruses is the polypeptide most frequently
           reactive in immuno-blotting analyses with antisera when
           compared with other viral proteins.
          Length = 777

 Score = 29.5 bits (66), Expect = 8.2
 Identities = 54/311 (17%), Positives = 90/311 (28%), Gaps = 32/311 (10%)

Query: 459 REAAERDDED----DEDEDESTDEDFNPDQAESDVAEEYDSNPTPTDSDEDSDASE-GTK 513
            ++ E DD      DED+D+  D      +   D+      +      DED D S    +
Sbjct: 272 EDSLEYDDPGLESTDEDDDDDGDSSLQTFKPLLDLTGSSLWSDDEESGDEDGDGSGFAPE 331

Query: 514 KKKKEKSRKTITISEKPRKSKKDKESSGG--STKRKKKDKNAPKKPMSAYMMWFNDMREK 571
              K  SR   T+ +  R     K  S     T     +      P S            
Sbjct: 332 PLIKTDSRSNDTLVDLGRGGGSLKLDSVDAPGTSSYLFEPGLSPSPNSG----------- 380

Query: 572 IKKDNPGISFTEIAKKGGELWKTV----SSKEKAEYDEKVNKAKEDYNEA-LKAYKESGG 626
             K+ PGI  TE          +       K         +       +    A  +   
Sbjct: 381 --KEMPGILTTENLDLPLASTDSTEMDPEDKRGGAVKINNSGILAWGLKTPGLAVNDERS 438

Query: 627 GQDSDDGKTSKSKKPAKKKETASTAVSPHKVKSKEFIESNGSSSDSDDDKKASSKRKRDS 686
              S DG T     P+  +  +S  V        +   S  +S   D  +   +     S
Sbjct: 439 IAVSSDGITDVLDPPSPLRLHSSDKVIDSVSPPSKRRVSAPASRLDDAKRPEVTATPESS 498

Query: 687 DEDSKAKTKKKKEKSESESDSGESEKEKKSKSK-------PKGDKSKSKSKKEDEIDDEE 739
             DS+     +++++ S+++S  S KE + +         PK      ++          
Sbjct: 499 GSDSEGGASGREDETSSDAESVVSIKELRPRIGFINKSPPPKSPPKSRRTLIVALSLASP 558

Query: 740 YDEPVESTPPS 750
                    PS
Sbjct: 559 STAGSPRPKPS 569


>gnl|CDD|217553 pfam03430, TATR, Trans-activating transcriptional regulator.  This
           family of trans-activating transcriptional regulator
           (TATR), also known as intermediate early protein 1, are
           common to the Nucleopolyhedroviruses.
          Length = 571

 Score = 29.5 bits (66), Expect = 8.2
 Identities = 16/75 (21%), Positives = 27/75 (36%), Gaps = 4/75 (5%)

Query: 466 DEDDEDEDESTDEDFNPDQAESDVAEEYDSNPTPTDSDEDSDASEGTKKKKKEKSRKTIT 525
               E   ES  E      +      E DS+    DS + S   +   K KK +  K  T
Sbjct: 106 LSPGESLPESLSETKQVTVS-KRKGSELDSDS---DSSDSSKGKKLVNKPKKRQKYKKAT 161

Query: 526 ISEKPRKSKKDKESS 540
           I      ++++  ++
Sbjct: 162 IQSPTSLTEEENYNT 176


>gnl|CDD|193106 pfam12628, Inhibitor_I71, Falstatin, cysteine peptidase inhibitor. 
           This family of peptidase inhibitors is expressed from
           plasmodial protozoal species. Falstatin is found to be a
           potent reversible inhibitor of the P. falciparum
           cysteine proteases falcipain-2 and falcipain-3, as well
           as other parasite- and non-parasite-derived cysteine
           proteases, but is only a relatively weak inhibitor of
           the P. falciparum cysteine proteases falcipain-1 and
           dipeptidyl aminopeptidase 1. Thus, P. falciparum
           requires expression of falstatin to limit proteolysis by
           certain host or parasite cysteine proteases during
           erythrocyte invasion.
          Length = 173

 Score = 28.7 bits (64), Expect = 8.5
 Identities = 14/49 (28%), Positives = 23/49 (46%)

Query: 533 SKKDKESSGGSTKRKKKDKNAPKKPMSAYMMWFNDMREKIKKDNPGISF 581
           S K+  ++  S+ + + D N P KP    +   + +R  IK   PG  F
Sbjct: 100 SSKNVINNNQSSSQNETDSNKPPKPRMQLVGGPSMLRSVIKGHKPGKYF 148


>gnl|CDD|220371 pfam09736, Bud13, Pre-mRNA-splicing factor of RES complex.  This
           entry is characterized by proteins with alternating
           conserved and low-complexity regions. Bud13 together
           with Snu17p and a newly identified factor,
           Pml1p/Ylr016c, form a novel trimeric complex. called The
           RES complex, pre-mRNA retention and splicing complex.
           Subunits of this complex are not essential for viability
           of yeasts but they are required for efficient splicing
           in vitro and in vivo. Furthermore, inactivation of this
           complex causes pre-mRNA leakage from the nucleus. Bud13
           contains a unique, phylogenetically conserved C-terminal
           region of unknown function.
          Length = 141

 Score = 28.0 bits (63), Expect = 8.9
 Identities = 18/64 (28%), Positives = 36/64 (56%), Gaps = 1/64 (1%)

Query: 680 SKRKRDSDEDSKAKTKKKKEKSESESDSGESEKEKKSKSKPKGDKSKSKSKKEDEIDDEE 739
              ++  +++ + + K++KE+ E E   G  +KE++ K   + +K+K+K       DDE+
Sbjct: 13  DIEEKREEKEREKEEKERKEEKEKEWGKGLVQKEEREKRLEELEKAKNKPLA-RYADDED 71

Query: 740 YDEP 743
           YDE 
Sbjct: 72  YDEE 75


>gnl|CDD|236615 PRK09709, PRK09709, exonuclease VIII; Reviewed.
          Length = 877

 Score = 29.6 bits (66), Expect = 9.4
 Identities = 25/103 (24%), Positives = 44/103 (42%), Gaps = 5/103 (4%)

Query: 456 RVKREAAERDDEDDEDEDESTDEDFNPDQAESDVAEEYDSNPTPTDSDEDSDASEGTKKK 515
           RV  E  + D+E  +    + DE    +  E D A E+  +  P D+    ++ +   ++
Sbjct: 388 RVTEEGKQDDEEKPQPSGTTADEQATAETVEPD-ATEHHQDTQPLDAQSQVNSVDAKYQE 446

Query: 516 KK---EKSRKTITISEKPRKSKKDKESSGGSTKRKKKDKNAPK 555
            +    ++RK I  S+ P  + K   +S G       D N PK
Sbjct: 447 LRAELHEARKNIP-SKNPVDADKLLAASRGEFVEGISDPNDPK 488


>gnl|CDD|185429 PTZ00074, PTZ00074, 60S ribosomal protein L34; Provisional.
          Length = 135

 Score = 28.1 bits (63), Expect = 9.4
 Identities = 12/25 (48%), Positives = 13/25 (52%)

Query: 709 ESEKEKKSKSKPKGDKSKSKSKKED 733
           E  K+KK K K K  K K  SKK  
Sbjct: 107 EKAKQKKQKKKKKKKKKKKTSKKAA 131


>gnl|CDD|217373 pfam03115, Astro_capsid, Astrovirus capsid protein precursor.  This
           product is encoded by astrovirus ORF2, one of the three
           astrovirus ORFs (1a, 1b, 2). The 87kD precursor protein
           undergoes an intracellular cleavage to form a 79kD
           protein. Subsequently, extracellular trypsin cleavage
           yields the three proteins forming the infectious virion.
          Length = 787

 Score = 29.4 bits (66), Expect = 9.4
 Identities = 11/43 (25%), Positives = 18/43 (41%)

Query: 465 DDEDDEDEDESTDEDFNPDQAESDVAEEYDSNPTPTDSDEDSD 507
           D  +D    E TD +    + E D  + +D + +     ED D
Sbjct: 664 DPYEDLISLEETDTEDESTEDEDDELDRFDLHDSSGSEPEDDD 706


>gnl|CDD|113815 pfam05058, ActA, ActA Protein.  The ActA family is found in
           Listeria and is associated with motility. ActA protein
           acts as a scaffold to assemble and activate host cell
           actin cytoskeletal factors at the bacterial surface,
           resulting in directional actin polymerisation and
           propulsion of the bacterium through the cytoplasm of the
           host cell.
          Length = 601

 Score = 29.4 bits (65), Expect = 9.6
 Identities = 44/155 (28%), Positives = 65/155 (41%), Gaps = 16/155 (10%)

Query: 590 ELWKTVSSKEKAEYDEKVNKAKE----DYNEALKAYKESGGGQDSDDGKTSKSKKPAKKK 645
           E  + VSS++  E  EK NK K+    D    LKA  E G   ++++   S+  + A   
Sbjct: 56  ETAREVSSRDIKEL-EKSNKVKDANKADLIAMLKAKAEKGPNINNNNNNYSEQSENAAIN 114

Query: 646 ETASTAVSPHKVKSKEFIESNGSSSDSDDDKKASSKRKRDSDE--------DSKAKTKKK 697
           E AS A  P     +      G  SDS  + K   K    SD         D   KT KK
Sbjct: 115 EEASGADRPAIQVERR---HPGLPSDSAAEIKKRRKAIASSDSELESLTYPDKPTKTNKK 171

Query: 698 KEKSESESDSGESEKEKKSKSKPKGDKSKSKSKKE 732
           K   ES +D+ ES+ +   +S  +      K+ ++
Sbjct: 172 KVAKESIADASESDLDSSMQSADESSPQPLKANQQ 206


  Database: CDD.v3.10
    Posted date:  Mar 20, 2013  7:55 AM
  Number of letters in database: 10,937,602
  Number of sequences in database:  44,354
  
Lambda     K      H
   0.308    0.128    0.357 

Gapped
Lambda     K      H
   0.267   0.0722    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 38,314,376
Number of extensions: 3817270
Number of successful extensions: 10802
Number of sequences better than 10.0: 1
Number of HSP's gapped: 7295
Number of HSP's successfully gapped: 968
Length of query: 764
Length of database: 10,937,602
Length adjustment: 104
Effective length of query: 660
Effective length of database: 6,324,786
Effective search space: 4174358760
Effective search space used: 4174358760
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.1 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (21.6 bits)
S2: 63 (28.1 bits)