RPS-BLAST 2.2.26 [Sep-21-2011]

Database: CDD.v3.10 
           44,354 sequences; 10,937,602 total letters

Searching..................................................done

Query= psy12216
         (646 letters)



>gnl|CDD|214744 smart00611, SEC63, Domain of unknown function in Sec63p, Brr2p and
           other proteins. 
          Length = 312

 Score =  176 bits (448), Expect = 4e-50
 Identities = 70/268 (26%), Positives = 118/268 (44%), Gaps = 32/268 (11%)

Query: 67  GMWWYKSIRYTGDKVLL-ETINFYYAFFQITPHMALKRVLMILGASLEFDRRFNSEIIER 125
           G+W     R      +   TI  +     + P M  K +L IL  S EFD+         
Sbjct: 1   GIWPTDLGRIASYYYISYTTIRTF--NELLKPKMTTKDLLRILSMSSEFDQ--------- 49

Query: 126 PSDEIEVPQLIRQIPNLGEKNRERPLYHKYSIKARALIYAHLSNMQLTSETLEADRMYIV 185
                E   L+ ++        E P      +KA  L+ AHLS ++L S  LE+D +Y++
Sbjct: 50  IPVRHEEDLLLEELAEKLPIRLENPSLDDPHVKANLLLQAHLSRLKLPSFALESDTVYVL 109

Query: 186 KKCPYLLQEMVTCISQLILLAYAQRVPRLIHIETLENVMKLCPMIVQGMWDFKNPLLQLP 245
           +    LLQ MV    +   L+ A             N + L  MI+Q +W   +PLLQLP
Sbjct: 110 QNAGRLLQAMVDIALERGWLSTA------------LNALNLSQMIIQALWPTDSPLLQLP 157

Query: 246 YVTDDHLKHFICKKRYIKSLQQFAQMKNEERRSVVKFMNDEQYSDMLKVLGNMPYVDLHV 305
           ++ ++ LK    +K+ + SL+   ++++EER  ++  + D +   + KVL  +P +++ +
Sbjct: 158 HLPEEILKR--LEKKKVLSLEDLLELEDEERGELLG-LLDAEGERVYKVLSRLPKLNIEI 214

Query: 306 QPEVIDDEATTEYTAGAIITVTCTLMRK 333
             E I          G  +T+T  L   
Sbjct: 215 SLEPITRT-----VLGVEVTLTVDLTWD 237



 Score = 50.3 bits (121), Expect = 1e-06
 Identities = 17/70 (24%), Positives = 39/70 (55%), Gaps = 3/70 (4%)

Query: 373 LPYVTDDHLKHFICKKRYIKSLQQFAQMKNEERRSVVKFMNDEQYSDMLKVLGNMPYVDL 432
           LP++ ++ LK    +K+ + SL+   ++++EER  ++  + D +   + KVL  +P +++
Sbjct: 156 LPHLPEEILKR--LEKKKVLSLEDLLELEDEERGELLG-LLDAEGERVYKVLSRLPKLNI 212

Query: 433 HVQPEVIDDE 442
            +  E I   
Sbjct: 213 EISLEPITRT 222


>gnl|CDD|217270 pfam02889, Sec63, Sec63 Brl domain.  This domain (also known as the
           Brl domain) is required for assembly of functional
           endoplasmic reticulum translocons.
          Length = 309

 Score =  141 bits (358), Expect = 1e-37
 Identities = 61/255 (23%), Positives = 116/255 (45%), Gaps = 40/255 (15%)

Query: 63  PTAVGM---WWYKSIRYTGDKVLLETINFYYAFFQITPHMALKRVLMILGASLEFDRRFN 119
           PT +G     +Y         +L ET+  +     + P+  LK +L IL ++ EF+    
Sbjct: 1   PTELGRIASHYY---------ILYETMETF--NTSLKPNTTLKDLLEILSSASEFE---- 45

Query: 120 SEIIERPSDEIEVPQLIRQIPN-LGEKNRERPLYHKYSIKARALIYAHLSNMQLTSETLE 178
            EI  R  ++ E+ +L+ ++P  +   + + P       K   L+ A+LS ++L   +L 
Sbjct: 46  -EIPVREEEKKELKKLLERVPIPVKSPSIDDP-----HAKVNLLLQAYLSRLKLPDFSLV 99

Query: 179 ADRMYIVKKCPYLLQEMVTCISQLILLAYAQRVPRLIHIETLENVMKLCPMIVQGMWDFK 238
           +D  YI++    LL+ +       I L+          +    N ++L  MI Q +W   
Sbjct: 100 SDLNYILQNAGRLLRALF-----EIALSKG-------WLSPALNALELSKMIEQRLWPSD 147

Query: 239 NPLLQLPYVTDDHLKHFICKKRYIKSLQQFAQMKNEERRSVVKFMNDEQYSDMLKVLGNM 298
           +PL QLP++  + LK    +K+ I SL+    +  EE   ++   N +Q  D+ + +   
Sbjct: 148 SPLRQLPHIPPEVLKK--LEKKGISSLEDLMDLDPEELGELLG-NNPKQGKDIAEFVNRF 204

Query: 299 PYVDLHVQPEVIDDE 313
           P +++  + + I  E
Sbjct: 205 PRLEIEAEVQPITRE 219



 Score = 49.9 bits (120), Expect = 2e-06
 Identities = 19/89 (21%), Positives = 39/89 (43%), Gaps = 5/89 (5%)

Query: 373 LPYVTDDHLKHFICKKRYIKSLQQFAQMKNEERRSVVKFMNDEQYSDMLKVLGNMPYVDL 432
           LP++  + LK    +K+ I SL+    +  EE   ++   N +Q  D+ + +   P +++
Sbjct: 153 LPHIPPEVLKK--LEKKGISSLEDLMDLDPEELGELLG-NNPKQGKDIAEFVNRFPRLEI 209

Query: 433 HVQPEVIDDEATTEYTAGAIITPEVIDDE 461
             + + I  E          +T +   DE
Sbjct: 210 EAEVQPITRE--VSLRVEVTLTRDFEWDE 236


>gnl|CDD|227694 COG5407, SEC63, Preprotein translocase subunit Sec63 [Intracellular
           trafficking and secretion].
          Length = 610

 Score = 98.5 bits (245), Expect = 2e-21
 Identities = 70/329 (21%), Positives = 126/329 (38%), Gaps = 52/329 (15%)

Query: 2   KLTKAYQALTDEESRRNFEKYGNPDGPGAMSFGIALPSYIVEKENSVWVLGLYALVFMVA 61
            +TKAY  LTD++ R N+  YG PD P   S GIALP  IV  E S++   +Y+L+  V 
Sbjct: 148 TITKAYGLLTDKKRRENYLNYGTPDSPQHTSEGIALPKVIVRSERSMYAFVMYSLLLGVF 207

Query: 62  LPTAVGMWWYKSIRYTGDKVLLETINFYYAFFQITPHMALKRVLMILGASLEFDRRFNSE 121
           LP  V  WW +   YT   V   T+  +Y   +I   + L  +L +  +S E  R     
Sbjct: 208 LPYWVYRWWREIRDYTKVGVHFVTMEMFYE--RIDGSLTLDELLSLFASSKELTRMN--- 262

Query: 122 IIERPSDEIEVPQLIRQIPNLGEKNRERPLYHKYSIKARALIYAHLSNMQLTSETLEADR 181
                             P   +   E+            L+  HL N      ++E + 
Sbjct: 263 ------------------PKGPKCTLEK------------LLGDHL-NRA---RSVEFNE 288

Query: 182 MYIVKKCPYLLQEMVTCISQLILLAYAQRVPRLIHIETLENVMKLCPMIVQGMWDFKNPL 241
             I      LL  ++      I   +A  +         + +++  P+      D    +
Sbjct: 289 YRIKSNVEGLLGALLR-----IASNFAFPLKECGKENKGQYIVQAIPL------DHLFRI 337

Query: 242 LQLPYVTDDHLKHFICKKRYIKSLQQFAQMKNEERRSVVKFMNDEQYSDMLKVLGNMPYV 301
           LQLP    ++ +      R I+ ++    +  +   +V+K +   +  ++  V   +P  
Sbjct: 338 LQLPRSDVEYAQRV--SLRLIEGMKAIGSLIAKRYGNVLKNLVVLELMEIQAVADGIPRY 395

Query: 302 DLHVQPEVIDDEATTEYTAGAIITVTCTL 330
            L   P ++ ++      +   +++   L
Sbjct: 396 FLLQAPFLVFEDLFITEKSKERVSLKGYL 424


>gnl|CDD|214946 smart00973, Sec63, Sec63 Brl domain.  This domain was named after
           the yeast Sec63 (or NPL1) (also known as the Brl domain)
           protein in which it was found. This protein is required
           for assembly of functional endoplasmic reticulum
           translocons. Other yeast proteins containing this domain
           include pre-mRNA splicing helicase BRR2, HFM1 protein
           and putative helicases.
          Length = 314

 Score = 83.2 bits (206), Expect = 2e-17
 Identities = 59/256 (23%), Positives = 104/256 (40%), Gaps = 39/256 (15%)

Query: 63  PTAVGM---WWYKSIRYTGDKVLLETINFYYAFFQITPHMALKRVLMILGASLEFDRRFN 119
           PT +G    ++Y  I Y       ETI  +     + P   LK +L IL  + EF     
Sbjct: 1   PTELGRIASYYY--ISY-------ETIETFNQ--SLKPTTTLKDILEILSRASEFK---- 45

Query: 120 SEIIERPSDEIEVPQLIRQIP-NLGEKNRERPLYHKYSIKARALIYAHLSNMQLTSETLE 178
            EI  R +++ E+ +L +++P  + E   + P       K   L+ AHLS + L    L 
Sbjct: 46  -EIPVRHNEKKELNELNKRVPIPVKEGIIDSP-----HAKVNLLLQAHLSRLPLPDFDLV 99

Query: 179 ADRMYIVKKCPYLLQEMVTCISQLILLAYAQRVPRLIHIETLENVMKLCPMIVQGMWDFK 238
           +D  YI++  P +L+ +V        L  A             N + L  M+VQ +W+  
Sbjct: 100 SDLKYILQNAPRILRALVDIALSKGWLRTAL------------NALDLSQMVVQRLWEDS 147

Query: 239 N-PLLQLPYVTDDHLKHFICKKRYIKSLQQFAQMKNEERRSVVKFMNDEQYSDMLKVLGN 297
           + PL QLP+   + +   +  K   +S +    M   E    +  +         ++L  
Sbjct: 148 DSPLKQLPHFLIEDVYDKLELKDGSRSFELLLDMNAAELGEFLNRLPPNGRLIY-ELLRR 206

Query: 298 MPYVDLHVQPEVIDDE 313
            P +++  +   I  +
Sbjct: 207 FPKIEVEAEVLPITRD 222


>gnl|CDD|218333 pfam04931, DNA_pol_phi, DNA polymerase phi.  This family includes
           the fifth essential DNA polymerase in yeast EC:2.7.7.7.
           Pol5p is localised exclusively to the nucleolus and
           binds near or at the enhancer region of rRNA-encoding
           DNA repeating units.
          Length = 784

 Score = 40.2 bits (94), Expect = 0.004
 Identities = 27/105 (25%), Positives = 41/105 (39%), Gaps = 25/105 (23%)

Query: 542 KKKFTKNKHDKKSSGQKKITNTVEQTEAPADVAKVEKKDTPEESKDVSSESEEESDLSDV 601
           K    K++H +   G+++  + +E+T+          +D  E  +D  SESE ESD  D 
Sbjct: 633 KADENKSRHQQLFEGEEEDEDDLEETDD--------DEDECEAIED--SESESESDGEDG 682

Query: 602 ENDEVVDKKEKNEDSNNKSEE---------------SSDDDDDWE 631
           E DE  D  E NE      +                   D +D E
Sbjct: 683 EEDEQEDDAEANEGVVPIDKAVRRALPKVLNLPDALDGGDSEDEE 727



 Score = 29.5 bits (66), Expect = 6.8
 Identities = 16/57 (28%), Positives = 30/57 (52%), Gaps = 2/57 (3%)

Query: 578 KKDTPEESKDVSSESEEESDLSDVENDEVVDKKEKNEDSNNKSEESSDDDDDWEKYQ 634
            ++     +    E E+E DL + ++DE  D+ E  EDS ++SE   +D ++ E+  
Sbjct: 635 DENKSRHQQLFEGEEEDEDDLEETDDDE--DECEAIEDSESESESDGEDGEEDEQED 689


>gnl|CDD|220102 pfam09073, BUD22, BUD22.  BUD22 has been shown in yeast to be a
           nuclear protein involved in bud-site selection. It plays
           a role in positioning the proximal bud pole signal. More
           recently it has been shown to be involved in ribosome
           biogenesis.
          Length = 424

 Score = 39.4 bits (92), Expect = 0.006
 Identities = 24/112 (21%), Positives = 40/112 (35%), Gaps = 18/112 (16%)

Query: 536 NKKGGKKKKFTKNKHDKKSSGQKKITNTVEQTEAPADVAKVEKK---------DTPEESK 586
             K  K K   K K  KK   ++      E+     D +K E+          +   +S+
Sbjct: 143 ETKAKKGKAKKKTKKSKKKEAKESSDKDDEEESESEDESKSEESAEDDSDDEEEEDSDSE 202

Query: 587 DV---------SSESEEESDLSDVENDEVVDKKEKNEDSNNKSEESSDDDDD 629
           D          SS+ EE  +   +  +E   + E +E  +  SE  S  D +
Sbjct: 203 DYSQYDGMLVDSSDEEEGEEAPSINYNEDTSESESDESDSEISESRSVSDSE 254



 Score = 31.3 bits (71), Expect = 1.5
 Identities = 27/137 (19%), Positives = 50/137 (36%), Gaps = 20/137 (14%)

Query: 517 ITPVVNKEKQQTHRPVWMKNKKGGKKK--KFTKNKHDKKSSG---------QKKITNTVE 565
           IT    K   ++     + N K  K    +      +K  S           K + N +E
Sbjct: 71  ITSKPTKAWAESPEFRQIINDKSHKANPSRSEAEVVEKLKSLNNVVSRLLNSKPVKNVLE 130

Query: 566 QTEAPADVA-----KVEKKDTPEESKDVSS----ESEEESDLSDVENDEVVDKKEKNEDS 616
                 D       K +K    +++K        ES ++ D  + E+++    +E  ED 
Sbjct: 131 SIMEGLDKILGIETKAKKGKAKKKTKKSKKKEAKESSDKDDEEESESEDESKSEESAEDD 190

Query: 617 NNKSEESSDDDDDWEKY 633
           ++  EE   D +D+ +Y
Sbjct: 191 SDDEEEEDSDSEDYSQY 207



 Score = 29.8 bits (67), Expect = 5.3
 Identities = 23/109 (21%), Positives = 42/109 (38%), Gaps = 26/109 (23%)

Query: 540 GKKKKFTKNKHDKKSSGQKKITNTVEQTEAPADVAKVEKKDTPEESKDVSSESEEESDLS 599
           G + K  K K  KK+   KK     ++ +  +D    E+ ++ +ESK    ES E+    
Sbjct: 141 GIETKAKKGKAKKKTKKSKK-----KEAKESSDKDDEEESESEDESKS--EESAEDDSDD 193

Query: 600 DVE-------------------NDEVVDKKEKNEDSNNKSEESSDDDDD 629
           + E                   ++E  ++      + + SE  SD+ D 
Sbjct: 194 EEEEDSDSEDYSQYDGMLVDSSDEEEGEEAPSINYNEDTSESESDESDS 242


>gnl|CDD|220271 pfam09507, CDC27, DNA polymerase subunit Cdc27.  This protein forms
           the C subunit of DNA polymerase delta. It carries the
           essential residues for binding to the Pol1 subunit of
           polymerase alpha, from residues 293-332, which are
           characterized by the motif D--G--VT, referred to as the
           DPIM motif. The first 160 residues of the protein form
           the minimal domain for binding to the B subunit, Cdc1,
           of polymerase delta, the final 10 C-terminal residues,
           362-372, being the DNA sliding clamp, PCNA, binding
           motif.
          Length = 427

 Score = 39.0 bits (91), Expect = 0.007
 Identities = 26/134 (19%), Positives = 49/134 (36%), Gaps = 3/134 (2%)

Query: 498 EPSKEGEGEIEEEEKPNGTITPVVNKEKQQTHRPVWMKNKKGGKKKKFTKNKHDKKSSGQ 557
           + +K+ +   +E           V     + + P          KKK TK K +KK + +
Sbjct: 179 KSAKKTQDTSKETTTEKTEGKTSVKAASLKRNPPKKSNIMSSFFKKK-TKEKKEKKEASE 237

Query: 558 KKITNTVEQTEAPADVAKVEKKDTPEESKDVSSESE--EESDLSDVENDEVVDKKEKNED 615
             +    E+     DV   ++   P    +   E E     + SD E +    +KEK + 
Sbjct: 238 STVKEESEEESGKRDVILEDESAEPTGLDEDEDEDEPKPSGERSDSEEETEEKEKEKRKR 297

Query: 616 SNNKSEESSDDDDD 629
                E+  +D++ 
Sbjct: 298 LKKMMEDEDEDEEM 311


>gnl|CDD|218538 pfam05285, SDA1, SDA1.  This family consists of several SDA1
           protein homologues. SDA1 is a Saccharomyces cerevisiae
           protein which is involved in the control of the actin
           cytoskeleton. The protein is essential for cell
           viability and is localised in the nucleus.
          Length = 317

 Score = 36.9 bits (86), Expect = 0.022
 Identities = 15/63 (23%), Positives = 36/63 (57%)

Query: 580 DTPEESKDVSSESEEESDLSDVENDEVVDKKEKNEDSNNKSEESSDDDDDWEKYQTGLNK 639
           D+ +E + +  ES++E + SD E++E  D+  K    ++  E S +D+++  + +    +
Sbjct: 111 DSDDEGEWIDVESDKEIESSDSEDEEEKDEAAKKAKEDSDEELSEEDEEEAAEEEEAEAE 170

Query: 640 RDK 642
           ++K
Sbjct: 171 KEK 173



 Score = 29.2 bits (66), Expect = 5.8
 Identities = 23/82 (28%), Positives = 42/82 (51%), Gaps = 1/82 (1%)

Query: 552 KKSSGQKKITNTVEQTEAPADVAKVEKKDTPEESKDVSSESEEESDLSDVENDEVVDKKE 611
           KK  G++++ + +   E      + E+K    E    S + ++E +  +VE DE  D + 
Sbjct: 57  KKGYGEEEVVDGIPGLELLEKWKEEERKKKEAEQGLESDDDDDEEEEWEVEEDEDSDDEG 116

Query: 612 KNED-SNNKSEESSDDDDDWEK 632
           +  D  ++K  ESSD +D+ EK
Sbjct: 117 EWIDVESDKEIESSDSEDEEEK 138


>gnl|CDD|215527 PLN02976, PLN02976, amine oxidase.
          Length = 1713

 Score = 37.5 bits (87), Expect = 0.029
 Identities = 29/132 (21%), Positives = 49/132 (37%), Gaps = 23/132 (17%)

Query: 500 SKEGEGEIE-----EEEKPNGTITPVVNKEKQQTHRPVWMKNKKGGKKKKFTKNKHDKKS 554
           SK G   I+     ++++P G++  +                 K  K  K  K   +   
Sbjct: 10  SKRGLKVIQIGDDSDDDEPIGSLFKL-----------------KRPKNSKKVKVGLESTG 52

Query: 555 SGQKKITNTVEQTEAPADVAKVEKKDTPEESKDVSSESEEESDLSDVENDEVVDKKEKNE 614
             ++K++   E +E   D     +K      K V S S   S  + +EN E VD    NE
Sbjct: 53  KREEKLSALDEDSEGMDDTLASFRKRLKGPKKGVGSVSARMSQGALLENQEEVDTV-LNE 111

Query: 615 DSNNKSEESSDD 626
            S    + + +D
Sbjct: 112 GSKRSLDGNLED 123


>gnl|CDD|227596 COG5271, MDN1, AAA ATPase containing von Willebrand factor type A
            (vWA) domain [General function prediction only].
          Length = 4600

 Score = 37.3 bits (86), Expect = 0.032
 Identities = 22/74 (29%), Positives = 39/74 (52%), Gaps = 2/74 (2%)

Query: 556  GQKKITNTVEQTEAPADVAKVEKKDTPEESKDVSSESEEESDLSDVENDEVVDKKEKNED 615
            G + ITNT+ + +   ++A  E     +   D S   E ESD++ V  D VV + E N D
Sbjct: 3823 GAEDITNTLNEDDDLEELANEEDTAN-QSDLDESEARELESDMNGVTKDSVVSENE-NSD 3880

Query: 616  SNNKSEESSDDDDD 629
            S  ++++  ++ +D
Sbjct: 3881 SEEENQDLDEEVND 3894



 Score = 33.0 bits (75), Expect = 0.73
 Identities = 17/71 (23%), Positives = 33/71 (46%), Gaps = 1/71 (1%)

Query: 565  EQTEAPADVAKVEKKDTPEESKDVSSESEEESDLSDVENDEVVDKKEKNEDSNNKSEESS 624
             Q        + E  D PE+ K +  +  + S  SD+E+ ++    E  E+++ + +E  
Sbjct: 3973 IQENNSQPPPENEDLDLPEDLK-LDEKEGDVSKDSDLEDMDMEAADENKEEADAEKDEPM 4031

Query: 625  DDDDDWEKYQT 635
             D+D  E+  T
Sbjct: 4032 QDEDPLEENNT 4042



 Score = 32.7 bits (74), Expect = 0.80
 Identities = 24/134 (17%), Positives = 51/134 (38%), Gaps = 5/134 (3%)

Query: 502  EGEGEIEEEEKPNGTITPVVNKEKQQTHRPVWMKNKKGGKKKKFTKNKHDKKSSGQKKIT 561
            E     EE +  +  +  +        +  +W +  +    +   K+     ++ +  + 
Sbjct: 3877 ENSDSEEENQDLDEEVNDIPEDLSNSLNEKLWDEPNEEDLLETEQKSNEQSAANNESDLV 3936

Query: 562  NTVEQTEAPADVAKVEKKDTPEESKDVSSESEEESDLSDVENDEVVDKKEK---NEDSN- 617
            +  +  +A  D  + EK+D  E S DV  + E + D+ +  N +   + E     ED   
Sbjct: 3937 SKEDDNKALEDKDRQEKEDEEEMSDDVGIDDEIQPDIQE-NNSQPPPENEDLDLPEDLKL 3995

Query: 618  NKSEESSDDDDDWE 631
            ++ E     D D E
Sbjct: 3996 DEKEGDVSKDSDLE 4009


>gnl|CDD|220648 pfam10243, MIP-T3, Microtubule-binding protein MIP-T3.  This
           protein, which interacts with both microtubules and
           TRAF3 (tumour necrosis factor receptor-associated factor
           3), is conserved from worms to humans. The N-terminal
           region is the microtubule binding domain and is
           well-conserved; the C-terminal 100 residues, also
           well-conserved, constitute the coiled-coil region which
           binds to TRAF3. The central region of the protein is
           rich in lysine and glutamic acid and carries KKE motifs
           which may also be necessary for tubulin-binding, but
           this region is the least well-conserved.
          Length = 506

 Score = 36.8 bits (85), Expect = 0.034
 Identities = 23/134 (17%), Positives = 55/134 (41%), Gaps = 2/134 (1%)

Query: 498 EPSKEGEGEIEE--EEKPNGTITPVVNKEKQQTHRPVWMKNKKGGKKKKFTKNKHDKKSS 555
           E  KE   E ++  +EKP         KE+ +  RP   K K+  KK +  +++ ++K  
Sbjct: 107 EKEKEQVKEEKKKKKEKPKEEPKDRKPKEEAKEKRPPKEKEKEKEKKVEEPRDREEEKKR 166

Query: 556 GQKKITNTVEQTEAPADVAKVEKKDTPEESKDVSSESEEESDLSDVENDEVVDKKEKNED 615
            + +  +  ++        K ++    E+ +  + E+ +        N+E   +++  +D
Sbjct: 167 ERVRAKSRPKKPPKKKPPNKKKEPPEEEKQRQAAREAVKGKPEEPDVNEEREKEEDDGKD 226

Query: 616 SNNKSEESSDDDDD 629
               +    +D+  
Sbjct: 227 RETTTSPMEEDESR 240



 Score = 29.9 bits (67), Expect = 4.8
 Identities = 17/99 (17%), Positives = 34/99 (34%), Gaps = 4/99 (4%)

Query: 535 KNKKGGKKKKFTKNKHDKKSSGQKKITNTVEQTEAPADVAKVEKKDTPEES-KDVSSESE 593
           K K+  K++K  K +  K+    +K     ++   P +  K ++K   E   ++   + E
Sbjct: 108 KEKEQVKEEKKKKKEKPKEEPKDRKPKEEAKEKRPPKEKEKEKEKKVEEPRDREEEKKRE 167

Query: 594 EESDLSDVENDEVVDKKEKNEDSNNKSEESSDDDDDWEK 632
                S  +         K ++     EE        E 
Sbjct: 168 RVRAKSRPKKPPKKKPPNKKKE---PPEEEKQRQAAREA 203


>gnl|CDD|119392 cd06174, MFS, The Major Facilitator Superfamily (MFS) is a large
           and diverse group of secondary transporters that
           includes uniporters, symporters, and antiporters. MFS
           proteins facilitate the transport across cytoplasmic or
           internal membranes of a variety of substrates including
           ions, sugar phosphates, drugs, neurotransmitters,
           nucleosides, amino acids, and peptides. They do so using
           the electrochemical potential of the transported
           substrates. Uniporters transport a single substrate,
           while symporters and antiporters transport two
           substrates in the same or in opposite directions,
           respectively, across membranes. MFS proteins are
           typically 400 to 600 amino acids in length, and the
           majority contain 12 transmembrane alpha helices (TMs)
           connected by hydrophilic loops. The N- and C-terminal
           halves of these proteins display weak similarity and may
           be the result of a gene duplication/fusion event. Based
           on kinetic studies and the structures of a few bacterial
           superfamily members, GlpT (glycerol-3-phosphate
           transporter), LacY (lactose permease), and EmrD
           (multidrug transporter), MFS proteins are thought to
           function through a single substrate binding site,
           alternating-access mechanism involving a rocker-switch
           type of movement. Bacterial members function primarily
           for nutrient uptake, and as drug-efflux pumps to confer
           antibiotic resistance. Some MFS proteins have medical
           significance in humans such as the glucose transporter
           Glut4, which is impaired in type II diabetes, and
           glucose-6-phosphate transporter (G6PT), which causes
           glycogen storage disease when mutated.
          Length = 352

 Score = 36.5 bits (85), Expect = 0.036
 Identities = 16/92 (17%), Positives = 32/92 (34%), Gaps = 14/92 (15%)

Query: 31  MSFGIALPSYIVEKENSV----WVLGLYALVFMVALPTAVGMWWYKSIRYTGDKVLLETI 86
                ALP    +   S      ++  ++L + +     +  +   S R+   +VLL  +
Sbjct: 16  GLLSPALPLLAEDLGLSASQAGLIVSAFSLGYALGSL--LAGYL--SDRFGRRRVLLLGL 71

Query: 87  ------NFYYAFFQITPHMALKRVLMILGASL 112
                 +   AF      + + R L+ LG   
Sbjct: 72  LLFALGSLLLAFASSLWLLLVGRFLLGLGGGA 103


>gnl|CDD|220284 pfam09538, FYDLN_acid, Protein of unknown function (FYDLN_acid).
           Members of this family are bacterial proteins with a
           conserved motif [KR]FYDLN, sometimes flanked by a pair
           of CXXC motifs, followed by a long region of low
           complexity sequence in which roughly half the residues
           are Asp and Glu, including multiple runs of five or more
           acidic residues. The function of members of this family
           is unknown.
          Length = 104

 Score = 33.8 bits (78), Expect = 0.054
 Identities = 18/63 (28%), Positives = 28/63 (44%)

Query: 567 TEAPADVAKVEKKDTPEESKDVSSESEEESDLSDVENDEVVDKKEKNEDSNNKSEESSDD 626
            E P +VAK        E      E EE+ D   +++D+  D  +   D ++   +  DD
Sbjct: 34  EEVPPEVAKSRAPAADAEDAAKKDEDEEDEDDVVLDDDDDDDDDDDLPDLDDDDVDLDDD 93

Query: 627 DDD 629
           DDD
Sbjct: 94  DDD 96


>gnl|CDD|178151 PLN02536, PLN02536, diaminopimelate epimerase.
          Length = 267

 Score = 35.5 bits (82), Expect = 0.065
 Identities = 22/62 (35%), Positives = 26/62 (41%)

Query: 321 GAIITVTCTLMRKPMSVLFGDDTIKVQFLEPSKEGEGEIEEEEKPNGTITPVLPYVTDDH 380
           G    VTC  M  P  V FG+  +KV  L   K G      E  P  T T  +  V+  H
Sbjct: 135 GKTWLVTCVSMGNPHCVTFGEKELKVDDLPLEKIGPKFEHHEMFPARTNTEFVQVVSRSH 194

Query: 381 LK 382
           LK
Sbjct: 195 LK 196



 Score = 29.7 bits (67), Expect = 3.7
 Identities = 19/53 (35%), Positives = 21/53 (39%)

Query: 469 GAIITVTCTLMRKPMSVLFGDDTIKVQFLEPSKEGEGEIEEEEKPNGTITPVV 521
           G    VTC  M  P  V FG+  +KV  L   K G      E  P  T T  V
Sbjct: 135 GKTWLVTCVSMGNPHCVTFGEKELKVDDLPLEKIGPKFEHHEMFPARTNTEFV 187


>gnl|CDD|218177 pfam04615, Utp14, Utp14 protein.  This protein is found to be part
           of a large ribonucleoprotein complex containing the U3
           snoRNA. Depletion of the Utp proteins impedes production
           of the 18S rRNA, indicating that they are part of the
           active pre-rRNA processing complex. This large RNP
           complex has been termed the small subunit (SSU)
           processome.
          Length = 728

 Score = 35.4 bits (82), Expect = 0.092
 Identities = 31/131 (23%), Positives = 54/131 (41%), Gaps = 10/131 (7%)

Query: 501 KEGEGEIEEEEKPNGTITPVVNKEKQQTHRPVWMKNKKGGKKKKFTKNKHDKKSSGQKKI 560
           +E EGE E +E+ N         +K    R      + G K+ +  K K + K+  ++K 
Sbjct: 399 RELEGEEESDEEENE-----EPSKKNVGRRK--FGPENGEKEAESKKLKKENKNEFKEKK 451

Query: 561 TNTVEQTEAPADVAKVEKKDTPEESKDVSSESEEESDLSDVENDEVVDKKEKNE---DSN 617
            +  E+     + AKVEK       +   ++ EEE +  D EN  +       +     +
Sbjct: 452 ESDEEEELEDEEEAKVEKVANKLLKRSEKAQKEEEEEELDEENPWLKTTSSVGKSAKKQD 511

Query: 618 NKSEESSDDDD 628
           +K + SS  D 
Sbjct: 512 SKKKSSSKLDK 522



 Score = 32.7 bits (75), Expect = 0.69
 Identities = 23/132 (17%), Positives = 50/132 (37%), Gaps = 3/132 (2%)

Query: 501 KEGEGEIEEEEKPNGTITPVVN--KEKQQTHRPVWMKNKKGGKKKKFTKNKHDKKSSGQK 558
           K G    E+E +         N  KEK+++     +++++  K +K       +    QK
Sbjct: 424 KFGPENGEKEAESKKLKKENKNEFKEKKESDEEEELEDEEEAKVEKVANKLLKRSEKAQK 483

Query: 559 KITNTVEQTEAPADVAKVE-KKDTPEESKDVSSESEEESDLSDVENDEVVDKKEKNEDSN 617
           +        E P         K   ++     S S+ +   + +    V  KK+K ++ +
Sbjct: 484 EEEEEELDEENPWLKTTSSVGKSAKKQDSKKKSSSKLDKAANKISKAAVKVKKKKKKEKS 543

Query: 618 NKSEESSDDDDD 629
              ++   D++D
Sbjct: 544 IDLDDDLIDEED 555



 Score = 30.8 bits (70), Expect = 2.5
 Identities = 17/55 (30%), Positives = 29/55 (52%), Gaps = 1/55 (1%)

Query: 575 KVEKKDTPEESKDVSSESEEESDLSDVENDEVVDKKEKNEDSNNKSEESSDDDDD 629
           K+E K   EE +D  S+SEEE +  D ++D+  +          K +E  DD+++
Sbjct: 317 KIEGKSVSEEDEDEDSDSEEEDEDDDEDDDDGENPW-MLRKKLGKLKEGEDDEEN 370



 Score = 29.3 bits (66), Expect = 7.2
 Identities = 31/161 (19%), Positives = 52/161 (32%), Gaps = 31/161 (19%)

Query: 489 DDTIKVQFLEPSKEGEGEIEEEEKPNGTITPVVNKEKQQTHRPV--WMKNKKGGKKKK-- 544
               KV      +  + + EEEE+      P +               K K   K  K  
Sbjct: 465 AKVEKVANKLLKRSEKAQKEEEEEELDEENPWLKTTSSVGKSAKKQDSKKKSSSKLDKAA 524

Query: 545 FTKNKHDKKSSGQKKITNTVEQTEAPADVAKVEKKDTPEESKDVSSESEEESDLSDV--- 601
              +K   K   +KK   +++  +   D     K D  +E      E E++ +L  +   
Sbjct: 525 NKISKAAVKVKKKKKKEKSIDLDDDLIDEEDSIKLDVDDE------EDEDDEELPFLFKQ 578

Query: 602 --------ENDEVVDK--KEKNEDSNNKSEESSDDDDDWEK 632
                     D+VV +  KEK E          +++D  E 
Sbjct: 579 KDLIKEAFAGDDVVAEFEKEKKEV--------IEEEDPKEI 611


>gnl|CDD|218752 pfam05793, TFIIF_alpha, Transcription initiation factor IIF, alpha
           subunit (TFIIF-alpha).  Transcription initiation factor
           IIF, alpha subunit (TFIIF-alpha) or RNA polymerase
           II-associating protein 74 (RAP74) is the large subunit
           of transcription factor IIF (TFIIF), which is essential
           for accurate initiation and stimulates elongation by RNA
           polymerase II.
          Length = 528

 Score = 35.3 bits (81), Expect = 0.099
 Identities = 17/65 (26%), Positives = 26/65 (40%)

Query: 565 EQTEAPADVAKVEKKDTPEESKDVSSESEEESDLSDVENDEVVDKKEKNEDSNNKSEESS 624
           E   +P   AK E +   +  +    ++EEE  LS         K +KN    + S+   
Sbjct: 301 EDKLSPEIPAKPEIEQDEDSEESEEEKNEEEGGLSKKGKKLKKLKGKKNGLDKDDSDSGD 360

Query: 625 DDDDD 629
           D DD 
Sbjct: 361 DSDDS 365



 Score = 31.1 bits (70), Expect = 2.0
 Identities = 23/109 (21%), Positives = 44/109 (40%), Gaps = 9/109 (8%)

Query: 537 KKGGKKKKFTKNKHDKKSSGQKKITNTVEQTEAPADVAKVEKKDTPEESKDVSSESEEES 596
            +   KKK  K   +KK     K      +     D  + +  D  +E ++    S+  +
Sbjct: 237 DEEKSKKKKKKLAKNKKKLDDDK----KGKRGGDDDADEYDSDDGDDEGREEDYISDSSA 292

Query: 597 DLSDVENDEVV---DKKEKNEDSNNKSEESSDDDDDWEKYQTGLNKRDK 642
             +D E  E     +   K E   ++  E S+++ + E  + GL+K+ K
Sbjct: 293 SGNDPEEREDKLSPEIPAKPEIEQDEDSEESEEEKNEE--EGGLSKKGK 339


>gnl|CDD|219408 pfam07423, DUF1510, Protein of unknown function (DUF1510).  This
           family consists of several hypothetical bacterial
           proteins of around 200 residues in length. The function
           of this family is unknown.
          Length = 214

 Score = 34.3 bits (79), Expect = 0.10
 Identities = 17/99 (17%), Positives = 34/99 (34%), Gaps = 4/99 (4%)

Query: 533 WMKNKKGGKKKKFTKNKHDKKSSGQKKITNTVEQTEAPADVAKVEKKDTPEESKDVSSES 592
                    +    + +  KKS  Q+       + E        +K+D  +  K+     
Sbjct: 34  LFFPSSPSDQAAADEQE-AKKSDDQETAEIEEVKEEEKEAANSEDKEDKGDAEKEDEESE 92

Query: 593 EEESDLSDVENDEVVDKKEKNEDSNNKSEESSDDDDDWE 631
           EE  +  +  +DE   + E+  +SN    E    +  W+
Sbjct: 93  EENEEEDEESSDENEKETEEKTESN---VEKEITNPSWK 128


>gnl|CDD|217203 pfam02724, CDC45, CDC45-like protein.  CDC45 is an essential gene
           required for initiation of DNA replication in S.
           cerevisiae, forming a complex with MCM5/CDC46.
           Homologues of CDC45 have been identified in human, mouse
           and smut fungus among others.
          Length = 583

 Score = 35.3 bits (82), Expect = 0.11
 Identities = 21/71 (29%), Positives = 35/71 (49%), Gaps = 2/71 (2%)

Query: 572 DVAKVEKKDTPEESKDVSSESEEESDLSDVENDEVVDKKEKNEDSNNKSEESSDDDDDWE 631
           D   +E++   E   D +    EE D  D E+DE  ++  K+ED  +  ++  DDDDD  
Sbjct: 102 DDGDIEEELQDEPRYDDAYRDLEEDDDDDEESDEEDEESSKSEDDED--DDDDDDDDDIA 159

Query: 632 KYQTGLNKRDK 642
             +  L +R +
Sbjct: 160 TRERSLERRRR 170


>gnl|CDD|235640 PRK05901, PRK05901, RNA polymerase sigma factor; Provisional.
          Length = 509

 Score = 35.0 bits (81), Expect = 0.13
 Identities = 16/95 (16%), Positives = 36/95 (37%), Gaps = 1/95 (1%)

Query: 535 KNKKGGKKKKFTKNKHDKKSSGQKKITNTVEQTEAPADVAKVEKKDTPEESKDVSSESEE 594
            + K  +KK       D        + N  +  +   D   ++  D  ++  D   + E+
Sbjct: 101 DSSKKAEKKNALDKDDDLNYVKDIDVLNQADDDDDDDDDDDLDDDDI-DDDDDDEDDDED 159

Query: 595 ESDLSDVENDEVVDKKEKNEDSNNKSEESSDDDDD 629
           + D    + DE   + ++ E  ++  +   D+DD 
Sbjct: 160 DDDDDVDDEDEEKKEAKELEKLSDDDDFVWDEDDS 194



 Score = 34.2 bits (79), Expect = 0.21
 Identities = 19/145 (13%), Positives = 45/145 (31%), Gaps = 8/145 (5%)

Query: 493 KVQFLEPSKEGEGEIEEEEKPNGTI---TPVVNKEKQQTHRPVWMKNKKGGKKKKFTKNK 549
           +++    SK+   E  ++     +          E     +      K    K    K  
Sbjct: 37  EIKEALESKKKTPEQIDQVLIFLSGMVKDTDDATESDIPKKKTKTAAKAAAAKAPAKKKL 96

Query: 550 HDKKSSGQKKITNTVEQTEAPADVAKVEKKDTPEESKDVSSESEEESDLS-----DVEND 604
            D+  S +K         +   +  K        +  D   + ++  D       D E+D
Sbjct: 97  KDELDSSKKAEKKNALDKDDDLNYVKDIDVLNQADDDDDDDDDDDLDDDDIDDDDDDEDD 156

Query: 605 EVVDKKEKNEDSNNKSEESSDDDDD 629
           +  D  +  +D + + +E+ + +  
Sbjct: 157 DEDDDDDDVDDEDEEKKEAKELEKL 181



 Score = 32.3 bits (74), Expect = 0.88
 Identities = 26/145 (17%), Positives = 55/145 (37%), Gaps = 5/145 (3%)

Query: 490 DTIKVQFLEPSKEGEGEIEEEEKPNGTITPVVNKEKQQTHRPVWMKNKKGGKKKKFTKNK 549
              ++   E +K+   ++  + K  G IT    K +    +    +             K
Sbjct: 6   TKAELAAEEEAKKKLKKLAAKSKSKGFITKEEIK-EALESKKKTPEQIDQVLIFLSGMVK 64

Query: 550 HDKKSSGQKKITNTVEQTEAPADVAKVEKKDTPEESKDVSSESEEESDLSDVENDEVVDK 609
            D   + +  I     +T A A  AK   K   ++  D S ++E+++ L   ++   V  
Sbjct: 65  -DTDDATESDIPKKKTKTAAKAAAAKAPAKKKLKDELDSSKKAEKKNALDKDDDLNYVKD 123

Query: 610 KEKNEDSNNKSEESSDD---DDDWE 631
            +    +++  ++  DD   DDD +
Sbjct: 124 IDVLNQADDDDDDDDDDDLDDDDID 148


>gnl|CDD|240271 PTZ00108, PTZ00108, DNA topoisomerase 2-like protein; Provisional.
          Length = 1388

 Score = 35.0 bits (81), Expect = 0.15
 Identities = 21/125 (16%), Positives = 38/125 (30%), Gaps = 3/125 (2%)

Query: 507  IEEEEKPNGTITPV--VNKEKQQTHRPVWMKNKKGGKKKKFTKNKHDKKSSGQKKITNTV 564
            + +E KP      V  V        +    ++  G K    TK K  KK           
Sbjct: 1265 LSKEGKPKNAPKRVSAVQYSPPPPSKRPDGESNGGSKPSSPTKKK-VKKRLEGSLAALKK 1323

Query: 565  EQTEAPADVAKVEKKDTPEESKDVSSESEEESDLSDVENDEVVDKKEKNEDSNNKSEESS 624
            ++        K + K   +++    S            +    D  +   D +   ++  
Sbjct: 1324 KKKSEKKTARKKKSKTRVKQASASQSSRLLRRPRKKKSDSSSEDDDDSEVDDSEDEDDED 1383

Query: 625  DDDDD 629
            D+DDD
Sbjct: 1384 DEDDD 1388


>gnl|CDD|217927 pfam04147, Nop14, Nop14-like family.  Emg1 and Nop14 are novel
           proteins whose interaction is required for the
           maturation of the 18S rRNA and for 40S ribosome
           production.
          Length = 809

 Score = 35.0 bits (81), Expect = 0.15
 Identities = 11/53 (20%), Positives = 30/53 (56%)

Query: 577 EKKDTPEESKDVSSESEEESDLSDVENDEVVDKKEKNEDSNNKSEESSDDDDD 629
           E ++  E+  D   E +++ DL + E D  +  +E++E+  +  +E  +++++
Sbjct: 321 EDEEEEEDGVDDEDEEDDDDDLEEEEEDVDLSDEEEDEEDEDSDDEDDEEEEE 373



 Score = 33.4 bits (77), Expect = 0.41
 Identities = 18/66 (27%), Positives = 33/66 (50%)

Query: 577 EKKDTPEESKDVSSESEEESDLSDVENDEVVDKKEKNEDSNNKSEESSDDDDDWEKYQTG 636
            ++D  EE   V  E EE+ D    E +E VD  ++ ED  ++  +  DD+++ E+ +  
Sbjct: 319 GEEDEEEEEDGVDDEDEEDDDDDLEEEEEDVDLSDEEEDEEDEDSDDEDDEEEEEEEKEK 378

Query: 637 LNKRDK 642
             K+  
Sbjct: 379 KKKKSA 384


>gnl|CDD|219939 pfam08619, Nha1_C, Alkali metal cation/H+ antiporter Nha1 C
           terminus.  The C terminus of the plasma membrane Nha1
           antiporter plays an important role in the immediate cell
           response to hypo-osmotic shock which prevents an
           execessive loss of ions and water. This domain is found
           with pfam00999.
          Length = 430

 Score = 34.0 bits (78), Expect = 0.21
 Identities = 24/128 (18%), Positives = 38/128 (29%), Gaps = 19/128 (14%)

Query: 510 EEKPNGTITPVVNKEKQQTHRPVWMKNKKGGKKKKFTKNKHDKKSSGQKKITNTVEQTEA 569
            +       PVV++            +  G K  K        ++   K       +T A
Sbjct: 274 GKPERDRSNPVVSR----------ALSWVGLKGTKHD-FIKQIRNLDPKARREVDLETNA 322

Query: 570 PADVAKVEKKDTPE----ESKDVSSESEEESDLSD----VENDEVVDKKEKNEDSNNKSE 621
           P  V    + +          D  SE + E DL+      E  E    +E +  S     
Sbjct: 323 PERVKAPARDEVDTNDSSPLTDSKSEEKLEDDLAHSLLGSEASERRKNREPSSSSPPSRP 382

Query: 622 ESSDDDDD 629
            S  D +D
Sbjct: 383 RSRRDSED 390


>gnl|CDD|215521 PLN02967, PLN02967, kinase.
          Length = 581

 Score = 33.9 bits (77), Expect = 0.31
 Identities = 18/88 (20%), Positives = 30/88 (34%), Gaps = 4/88 (4%)

Query: 547 KNKHDKKSSGQKKITNTVEQTEAPADVAKVEKKDTPEESKDVSSESEEESDLSDVENDEV 606
           K         ++K        E      KV K+   ++  +   +   ES++SDVE  E 
Sbjct: 108 KESKKTPRRTRRKAAAASSDVEEEKTEKKVRKRRKVKKMDEDVEDQGSESEVSDVEESEF 167

Query: 607 VDKKEKNEDSNNKSEESSDDD----DDW 630
           V   E   +     E+   +D      W
Sbjct: 168 VTSLENESEEELDLEKDDGEDISHTYGW 195



 Score = 32.3 bits (73), Expect = 0.94
 Identities = 24/117 (20%), Positives = 38/117 (32%), Gaps = 9/117 (7%)

Query: 510 EEKPNGTITPVVNKEKQQTHRPVWMKNKKGGKKKKFTKNKHDKKSSGQKKITNTVEQTEA 569
           E        P  N            K K     K+ TK    + S   ++ +  V   +A
Sbjct: 53  ESALAVDEEPDENGAV--------SKKKPTRSVKRATKKTVVEISEPLEEGSELVVNEDA 104

Query: 570 PADVAKVEKKD-TPEESKDVSSESEEESDLSDVENDEVVDKKEKNEDSNNKSEESSD 625
             D    +    T  ++   SS+ EEE     V     V K +++ +      E SD
Sbjct: 105 ALDKESKKTPRRTRRKAAAASSDVEEEKTEKKVRKRRKVKKMDEDVEDQGSESEVSD 161


>gnl|CDD|234229 TIGR03490, Mycoplas_LppA, mycoides cluster lipoprotein, LppA/P72
           family.  Members of this protein family occur in
           Mycoplasma mycoides, Mycoplasma hyopneumoniae, and
           related Mycoplasmas in small paralogous families that
           may also include truncated forms and/or pseudogenes.
           Members are predicted lipoproteins with a conserved
           signal peptidase II processing and lipid attachment
           site. Note that the name for certain characterized
           members, p72, reflects an anomalous apparent molecular
           weight, given a theoretical MW of about 61 kDa.
          Length = 541

 Score = 33.7 bits (77), Expect = 0.33
 Identities = 21/110 (19%), Positives = 44/110 (40%), Gaps = 5/110 (4%)

Query: 535 KNKKGGKKKKFTKNKHDKKSSGQKKITNTVEQTEAPADVAKVEKKDTPEESKDVSSESEE 594
             KK   K      K  KK   ++   N   ++       +      PE+  D  S+++E
Sbjct: 34  PEKKPEIKPNENTPKIPKKPDNKEPSENNNNKSNNENKDEENPSSTNPEKKPDP-SKNKE 92

Query: 595 ESDLSDVENDEVVDKKEKNEDSNNKSEESSDDDDDWEKYQTGLNKRDKYL 644
           E +    E  +   K + ++ +N  +++ +++  D+      L+K  K L
Sbjct: 93  EIEKPKDEPKKPDKKPQADQPNNVHADQPNNNKVDFSD----LDKLKKEL 138


>gnl|CDD|219069 pfam06512, Na_trans_assoc, Sodium ion transport-associated.
           Members of this family contain a region found
           exclusively in eukaryotic sodium channels or their
           subunits, many of which are voltage-gated. Members very
           often also contain between one and four copies of
           pfam00520 and, less often, one copy of pfam00612.
          Length = 230

 Score = 33.1 bits (76), Expect = 0.34
 Identities = 26/140 (18%), Positives = 41/140 (29%), Gaps = 11/140 (7%)

Query: 490 DTIKVQFLEPSKEGEGEIEEEEKPNGTITPVVNKEKQQTHRPVWMKNKKGGKKKK---FT 546
           D  K    +  KE + + E E+         +       H                   T
Sbjct: 27  DFFKKLLKKKPKEIDEQTEAEDNLLDKAVDDILSN----HYLKEGNGTTSVAIGDGMEKT 82

Query: 547 KNKHDKKSSGQKKITNTVEQTEAPADVAKVEKKDTPEESKDVSSESEEESDLSDVENDEV 606
            +   K+S     ITN       P  +A  E        +D SSES       +      
Sbjct: 83  IHGDVKESDPMSFITNPSLTVSVP--IASGESDLENLNDEDTSSESSYGF--KEESKKGS 138

Query: 607 VDKKEKNEDSNNKSEESSDD 626
            +  +  E  ++ SE S+ D
Sbjct: 139 AETLKLEEKDSSSSEGSTVD 158


>gnl|CDD|222977 PHA03089, PHA03089, late transcription factor VLTF-4; Provisional.
          Length = 191

 Score = 32.9 bits (75), Expect = 0.36
 Identities = 12/71 (16%), Positives = 24/71 (33%)

Query: 558 KKITNTVEQTEAPADVAKVEKKDTPEESKDVSSESEEESDLSDVENDEVVDKKEKNEDSN 617
           KK   T  + +      K +K+            S+ E +  + +  +    K +N  + 
Sbjct: 52  KKKKTTPRKKKTTKKTKKKKKEKEEVPELAAEELSDSEENEENDKKVDYELPKVQNTAAE 111

Query: 618 NKSEESSDDDD 628
              E+  D  D
Sbjct: 112 VNHEDVIDLSD 122


>gnl|CDD|148630 pfam07133, Merozoite_SPAM, Merozoite surface protein (SPAM).  This
           family consists of several Plasmodium falciparum SPAM
           (secreted polymorphic antigen associated with
           merozoites) proteins. Variation among SPAM alleles is
           the result of deletions and amino acid substitutions in
           non-repetitive sequences within and flanking the alanine
           heptad-repeat domain. Heptad repeats in which the a and
           d position contain hydrophobic residues generate
           amphipathic alpha-helices which give rise to helical
           bundles or coiled-coil structures in proteins. SPAM is
           an example of a P. falciparum antigen in which a
           repetitive sequence has features characteristic of a
           well-defined structural element.
          Length = 164

 Score = 32.5 bits (74), Expect = 0.37
 Identities = 21/80 (26%), Positives = 44/80 (55%), Gaps = 4/80 (5%)

Query: 552 KKSSGQKKITNTVEQTEAPADVAKVEKKDTPEESKDVSSESEEESDLSDVENDEVVDKKE 611
           KK   +KK  N +E  +  +     +K+D  +E++DV  E +E+ +  + E++E +++ E
Sbjct: 11  KKPEEEKKDENLLEHVKITS----WDKEDIIKENEDVKDEKQEDDEEEEEEDEEEIEEPE 66

Query: 612 KNEDSNNKSEESSDDDDDWE 631
             ED     E+  ++++D E
Sbjct: 67  DIEDEEEIVEDEEEEEEDEE 86



 Score = 31.4 bits (71), Expect = 0.70
 Identities = 21/101 (20%), Positives = 48/101 (47%), Gaps = 5/101 (4%)

Query: 539 GGKKKKFTKNKHDKKSSGQKKIT-----NTVEQTEAPADVAKVEKKDTPEESKDVSSESE 593
           G  +KK  + K D+      KIT     + +++ E   D  + + ++  EE ++   E E
Sbjct: 7   GAPEKKPEEEKKDENLLEHVKITSWDKEDIIKENEDVKDEKQEDDEEEEEEDEEEIEEPE 66

Query: 594 EESDLSDVENDEVVDKKEKNEDSNNKSEESSDDDDDWEKYQ 634
           +  D  ++  DE  +++++ ++ + K  E  + +D +   Q
Sbjct: 67  DIEDEEEIVEDEEEEEEDEEDNVDLKDIEKKNINDIFNSTQ 107


>gnl|CDD|233191 TIGR00927, 2A1904, K+-dependent Na+/Ca+ exchanger.  [Transport and
           binding proteins, Cations and iron carrying compounds].
          Length = 1096

 Score = 33.8 bits (77), Expect = 0.39
 Identities = 34/195 (17%), Positives = 74/195 (37%), Gaps = 25/195 (12%)

Query: 435 QPEVIDDEATTEYTAGAIITPEVIDDEATTEYTAGAIITVTCTLMRKPMSVLFGDDTIKV 494
           + E  +DE   E         E ++DE   E      +                +   + 
Sbjct: 722 EAEGTEDEGEIETGEEG----EEVEDEGEGEAEGKHEVETEGDRKE-------TEHEGET 770

Query: 495 QFLEPSKEGEGEIEEEEKPNGTITPVVNKEKQQTHRPVWMKNKKGGKKKKFTKNKHDKKS 554
           +      E EGEI+  E  +G         + +       K +  G+ +   K++H+ +S
Sbjct: 771 EAEGKEDEDEGEIQAGE--DG---------EMKGDEGAEGKVEHEGETEAGEKDEHEGQS 819

Query: 555 SGQKKITNTVEQTEAPADVAKVEKKDTPEESKDVSSESEEESDLSDVENDEVVDKKEKNE 614
             Q   T   ++T    ++    + +  ++ K V  +    SD  D E +E  +++E+ E
Sbjct: 820 ETQADDTEVKDETGE-QELNAENQGEAKQDEKGV--DGGGGSDGGDSEEEEEEEEEEEEE 876

Query: 615 DSNNKSEESSDDDDD 629
           +   + EE  +++++
Sbjct: 877 EEEEEEEEEEEEENE 891



 Score = 30.3 bits (68), Expect = 4.5
 Identities = 32/145 (22%), Positives = 59/145 (40%), Gaps = 11/145 (7%)

Query: 478 LMRKPMS-VLFGDDTIKVQFLEPSKEGEGEIEEEEKPNGTITPVVNKEKQQTHRPVWMKN 536
           L R+P++ V+   D  K    E    GE   EE E+P  T     N E+           
Sbjct: 616 LSRRPVAKVMALGDLSKGDVAEAEHTGERTGEEGERP--TEAEGENGEESGGEA------ 667

Query: 537 KKGGKKKKFTKNKHDKKSSGQKKITNTVEQTEAPADVAKVEKKDTPEESKDVSSESEEES 596
           ++ G+ +   +N+ + +   ++K     E  E  A  A   K +T  E  +   E+E E 
Sbjct: 668 EQEGETETKGENESEGEIPAERKGEQEGE-GEIEAKEADH-KGETEAEEVEHEGETEAEG 725

Query: 597 DLSDVENDEVVDKKEKNEDSNNKSE 621
              + E +   + +E  ++   ++E
Sbjct: 726 TEDEGEIETGEEGEEVEDEGEGEAE 750


>gnl|CDD|112562 pfam03753, HHV6-IE, Human herpesvirus 6 immediate early protein.
           The proteins in this family are poorly characterized,
           but an investigation has indicated that the immediate
           early protein is required the down-regulation of MHC
           class I expression in dendritic cells. Human herpesvirus
           6 immediate early protein is also referred to as U90.
          Length = 993

 Score = 33.5 bits (76), Expect = 0.40
 Identities = 30/106 (28%), Positives = 48/106 (45%), Gaps = 4/106 (3%)

Query: 535 KNKKGGKKKKFTKNKHDKKSSGQKKITNTVEQTEAPADVAKVEKKDTPEESKDVSSESEE 594
              K  KKK  + N+  KK + ++K   TV      +   K +   T      +SSESE+
Sbjct: 643 TFYKRDKKKDISTNRKVKKRTAKRK---TVGYKTDKSKKIKSDSLPTDTNVIVISSESED 699

Query: 595 ESDLSDVENDEVVDKKEKNEDSNNKSEESSD-DDDDWEKYQTGLNK 639
           E D  ++     + KK K+E  +  S ES D   +D E + +  +K
Sbjct: 700 EEDGFNIIKKSQLKKKIKSELKSESSSESDDCTSEDNELHLSDYDK 745


>gnl|CDD|240388 PTZ00372, PTZ00372, endonuclease 4-like protein; Provisional.
          Length = 413

 Score = 33.2 bits (76), Expect = 0.50
 Identities = 18/130 (13%), Positives = 45/130 (34%), Gaps = 13/130 (10%)

Query: 518 TPVVNKEKQQTHRPVWMKNKKGGKK--KKFTKNKHDKKSSGQKKITNTVEQTEAPADVAK 575
           +    K K Q    ++       K+    F++ ++   ++  KK     +  E+     K
Sbjct: 13  SGTTQKSKLQPISYIYSNVLVLSKEILSTFSEEENKVATTSTKKDKKEDKNNESKKKSEK 72

Query: 576 VEKKDTPEESKDVSSESEEESDLSDVENDEVVDKKEKNEDSNNKSEESSDDDDDWEKYQT 635
            +KK           + E++   S+ E        +K++ +  K  +   ++D    +  
Sbjct: 73  KKKK-----------KKEKKEPKSEGETKLGFKTPKKSKKTKKKPPKPKPNEDVDNAFNK 121

Query: 636 GLNKRDKYLL 645
                +K  +
Sbjct: 122 IAELAEKSNV 131


>gnl|CDD|148051 pfam06213, CobT, Cobalamin biosynthesis protein CobT.  This family
           consists of several bacterial cobalamin biosynthesis
           (CobT) proteins. CobT is involved in the transformation
           of precorrin-3 into cobyrinic acid.
          Length = 282

 Score = 32.5 bits (74), Expect = 0.57
 Identities = 17/63 (26%), Positives = 29/63 (46%), Gaps = 4/63 (6%)

Query: 571 ADVAKVEKKDTPEESKDVSSESEEESDLSDVENDE----VVDKKEKNEDSNNKSEESSDD 626
           AD    E +D P+E +D     EEES  SD  +++      + +    ++   S + + D
Sbjct: 219 ADSEDNEDEDDPKEDEDDDQGEEEESGSSDSLSEDSDASSEEMESGEMEAAEASADDTPD 278

Query: 627 DDD 629
            DD
Sbjct: 279 SDD 281



 Score = 30.9 bits (70), Expect = 1.8
 Identities = 16/85 (18%), Positives = 37/85 (43%), Gaps = 8/85 (9%)

Query: 551 DKKSSGQKKITNTVEQTEAPADVAK--VEKKDTPEESKDVSSESEEE------SDLSDVE 602
           DK      ++   V+  +A A V +  +   D  EE  D    ++ E          D +
Sbjct: 176 DKAGKDLDRLAACVDDQQAFARVVRDMLSSMDMAEELGDEPESADSEDNEDEDDPKEDED 235

Query: 603 NDEVVDKKEKNEDSNNKSEESSDDD 627
           +D+  +++  + DS ++  ++S ++
Sbjct: 236 DDQGEEEESGSSDSLSEDSDASSEE 260


>gnl|CDD|219838 pfam08432, DUF1742, Fungal protein of unknown function (DUF1742).
           This is a family of fungal proteins of unknown function.
          Length = 182

 Score = 31.6 bits (72), Expect = 0.72
 Identities = 16/64 (25%), Positives = 32/64 (50%), Gaps = 4/64 (6%)

Query: 574 AKVEKKDTPEESKDVSSESEEE----SDLSDVENDEVVDKKEKNEDSNNKSEESSDDDDD 629
           AK +KK+  EE + V  E EE+          +  +  DK +K++  ++KSE+  + + +
Sbjct: 60  AKKKKKELAEEIEKVKKEYEEKQKWKWKKKKSKKKKDKDKDKKDDKKDDKSEKKDEKEAE 119

Query: 630 WEKY 633
            +  
Sbjct: 120 DKLE 123


>gnl|CDD|235250 PRK04195, PRK04195, replication factor C large subunit;
           Provisional.
          Length = 482

 Score = 32.2 bits (74), Expect = 0.98
 Identities = 19/72 (26%), Positives = 34/72 (47%)

Query: 541 KKKKFTKNKHDKKSSGQKKITNTVEQTEAPADVAKVEKKDTPEESKDVSSESEEESDLSD 600
             KK TK         +KK     ++ +  A   K ++++  EE +    E EEE + ++
Sbjct: 404 GSKKATKKIKKIVEKAEKKREEEKKEKKKKAFAGKKKEEEEEEEKEKKEEEKEEEEEEAE 463

Query: 601 VENDEVVDKKEK 612
            E +E  +KK+K
Sbjct: 464 EEKEEEEEKKKK 475



 Score = 29.1 bits (66), Expect = 7.2
 Identities = 23/80 (28%), Positives = 38/80 (47%), Gaps = 4/80 (5%)

Query: 550 HDKKSSGQKKITNTVEQTEAPADVAKVEKKDTPEE--SKDVSSESEEESDLSDVENDEVV 607
             KK++  KKI   VE+ E   +  K EKK        K+   E E+E    + E +E  
Sbjct: 404 GSKKAT--KKIKKIVEKAEKKREEEKKEKKKKAFAGKKKEEEEEEEKEKKEEEKEEEEEE 461

Query: 608 DKKEKNEDSNNKSEESSDDD 627
            ++EK E+   K ++++  D
Sbjct: 462 AEEEKEEEEEKKKKQATLFD 481


>gnl|CDD|191249 pfam05279, Asp-B-Hydro_N, Aspartyl beta-hydroxylase N-terminal
           region.  This family includes the N-terminal regions of
           the junctin, junctate and aspartyl beta-hydroxylase
           proteins. Junctate is an integral ER/SR membrane calcium
           binding protein, which comes from an alternatively
           spliced form of the same gene that generates aspartyl
           beta-hydroxylase and junctin. Aspartyl beta-hydroxylase
           catalyzes the post-translational hydroxylation of
           aspartic acid or asparagine residues contained within
           epidermal growth factor (EGF) domains of proteins.
          Length = 240

 Score = 31.8 bits (72), Expect = 0.98
 Identities = 19/133 (14%), Positives = 44/133 (33%)

Query: 490 DTIKVQFLEPSKEGEGEIEEEEKPNGTITPVVNKEKQQTHRPVWMKNKKGGKKKKFTKNK 549
              K +  E  KE    + E+   +        KE Q       +      +++     K
Sbjct: 95  RKTKQKVEEEVKEQLQSLLEKIVVSKQEEDGPGKEPQLDEDKFLLAEDSDDRQETLEAGK 154

Query: 550 HDKKSSGQKKITNTVEQTEAPADVAKVEKKDTPEESKDVSSESEEESDLSDVENDEVVDK 609
             +++     +  T  +        K  +++  +  + V      +++  DV  ++  ++
Sbjct: 155 VHEETEDSYHVEETASEQYKQDMKEKASEQENEDSKEPVEKAERTKAETDDVTEEDYDEE 214

Query: 610 KEKNEDSNNKSEE 622
               EDS    EE
Sbjct: 215 DNPVEDSKAIKEE 227


>gnl|CDD|215180 PLN02316, PLN02316, synthase/transferase.
          Length = 1036

 Score = 32.1 bits (73), Expect = 0.99
 Identities = 19/91 (20%), Positives = 35/91 (38%)

Query: 539 GGKKKKFTKNKHDKKSSGQKKITNTVEQTEAPADVAKVEKKDTPEESKDVSSESEEESDL 598
              +K+  +N  DK+ S     + +V   E  ++  +  + D    S+    E E E ++
Sbjct: 21  ESSQKRIQQNNGDKEDSSTSTSSLSVSAVEKTSNAKEEIQVDFQHNSESAVEEVEAEDEI 80

Query: 599 SDVENDEVVDKKEKNEDSNNKSEESSDDDDD 629
              +N   V K        + S +    DDD
Sbjct: 81  EVEQNQSDVLKSSSIVKEESISTDMDGIDDD 111


>gnl|CDD|221175 pfam11705, RNA_pol_3_Rpc31, DNA-directed RNA polymerase III subunit
           Rpc31.  RNA polymerase III contains seventeen subunits
           in yeasts and in human cells. Twelve of these are akin
           to RNA polymerase I or II and the other five are RNA pol
           III-specific, and form the functionally distinct groups
           (i) Rpc31-Rpc34-Rpc82, and (ii) Rpc37-Rpc53. Rpc31,
           Rpc34 and Rpc82 form a cluster of enzyme-specific
           subunits that contribute to transcription initiation in
           S.cerevisiae and H.sapiens. There is evidence that these
           subunits are anchored at or near the N-terminal Zn-fold
           of Rpc1, itself prolonged by a highly conserved but RNA
           polymerase III-specific domain.
          Length = 221

 Score = 31.6 bits (72), Expect = 1.0
 Identities = 16/52 (30%), Positives = 27/52 (51%)

Query: 580 DTPEESKDVSSESEEESDLSDVENDEVVDKKEKNEDSNNKSEESSDDDDDWE 631
            T EE       S  E  L ++E ++V ++ EK+E+   + EE  +D DD +
Sbjct: 142 FTEEEEDIDEKLSMLEKKLKELEAEDVDEEDEKDEEEEEEEEEEDEDFDDDD 193


>gnl|CDD|217503 pfam03344, Daxx, Daxx Family.  The Daxx protein (also known as the
           Fas-binding protein) is thought to play a role in
           apoptosis, but precise role played by Daxx remains to be
           determined. Daxx forms a complex with Axin.
          Length = 715

 Score = 32.2 bits (73), Expect = 1.1
 Identities = 17/64 (26%), Positives = 30/64 (46%)

Query: 568 EAPADVAKVEKKDTPEESKDVSSESEEESDLSDVENDEVVDKKEKNEDSNNKSEESSDDD 627
           E+  + +  E+++  EE ++   ESEEE    + E +EV       E+    SE   D +
Sbjct: 438 ESEEEESVEEEEEEEEEEEEEEQESEEEEGEDEEEEEEVEADNGSEEEMEGSSEGDGDGE 497

Query: 628 DDWE 631
           +  E
Sbjct: 498 EPEE 501



 Score = 31.8 bits (72), Expect = 1.4
 Identities = 15/95 (15%), Positives = 43/95 (45%)

Query: 535 KNKKGGKKKKFTKNKHDKKSSGQKKITNTVEQTEAPADVAKVEKKDTPEESKDVSSESEE 594
             ++  ++K+  + +    S        +    E+P+  ++  +++   E ++   E EE
Sbjct: 397 DTEEEERRKRQERERQGTSSRSSDPSKASSTSGESPSMASQESEEEESVEEEEEEEEEEE 456

Query: 595 ESDLSDVENDEVVDKKEKNEDSNNKSEESSDDDDD 629
           E +    E +   +++E+  +++N SEE  +   +
Sbjct: 457 EEEQESEEEEGEDEEEEEEVEADNGSEEEMEGSSE 491


>gnl|CDD|185638 PTZ00459, PTZ00459, mucin-associated surface protein (MASP);
           Provisional.
          Length = 291

 Score = 31.7 bits (71), Expect = 1.1
 Identities = 19/59 (32%), Positives = 31/59 (52%), Gaps = 5/59 (8%)

Query: 574 AKVEKKDTPEESKDVSSESEEESDL----SDV-ENDEVVDKKEKNEDSNNKSEESSDDD 627
           A+ +   +P ESK + + S+   DL    +   EN   +  +E +ED ++ SEE  DDD
Sbjct: 34  AQQDGGKSPPESKGLETSSQGTQDLKGGAAGAKENSPPLPTEEDDEDVDDDSEEGDDDD 92


>gnl|CDD|219928 pfam08602, Mgr1, Mgr1-like, i-AAA protease complex subunit.  The S.
           cerevisiae Mgr1 protein has been shown to be required
           for mitochondrial viability in yeast lacking
           mitochondrial DNA. It is a mitochondrial inner membrane
           protein, which interacts with Yme1 and is a new subunit
           of the i-AAA protease complex.
          Length = 362

 Score = 31.8 bits (72), Expect = 1.1
 Identities = 18/68 (26%), Positives = 29/68 (42%), Gaps = 4/68 (5%)

Query: 53  LYALVFMVALPTAVGMWWYKSIRYTGDKVLLETINFYYAFFQITPHMALKRVLMILGASL 112
           L  L  +V L T +G++++K  R  G K+    I    A F      +     M +   +
Sbjct: 54  LPGLYTLVGLQTGIGLFFFKRFRTLGKKLRKRDI----ADFPSLNRFSTTHGDMHMAPVI 109

Query: 113 EFDRRFNS 120
            + RRF S
Sbjct: 110 PWQRRFVS 117


>gnl|CDD|220759 pfam10446, DUF2457, Protein of unknown function (DUF2457).  This is
           a family of uncharacterized proteins.
          Length = 449

 Score = 31.9 bits (72), Expect = 1.3
 Identities = 22/107 (20%), Positives = 41/107 (38%), Gaps = 7/107 (6%)

Query: 533 WMKNKKGGKKKKFTKNKHDKKSSGQKKITNTVEQ------TEAPADVAKVEKKDTPEESK 586
           W+       K+K T N   KK +  +K+    E+       +   D    + +D  ++  
Sbjct: 15  WV-RGSLDYKEKLTLNDTMKKENAIRKLGKEAEEEAMEEEDDDEEDDDDDDDEDEDDDDD 73

Query: 587 DVSSESEEESDLSDVENDEVVDKKEKNEDSNNKSEESSDDDDDWEKY 633
           D   + E+E D     +D+         D+     +S D+ DD  +Y
Sbjct: 74  DDDEDDEDEDDDDSTLHDDSSADDGNETDNEAGFADSDDESDDGSEY 120


>gnl|CDD|227693 COG5406, COG5406, Nucleosome binding factor SPN, SPT16 subunit
           [Transcription / DNA replication, recombination, and
           repair / Chromatin structure and dynamics].
          Length = 1001

 Score = 31.9 bits (72), Expect = 1.3
 Identities = 13/53 (24%), Positives = 24/53 (45%), Gaps = 2/53 (3%)

Query: 582 PEESKDVSSESEEESDLSDVENDEVVDKKEKNEDSNNKS--EESSDDDDDWEK 632
            E S+  +S  +E  +  + E  +   +    ++S N S  EE  +D D+ E 
Sbjct: 936 EEVSEYEASSDDESDETDEDEESDESSEDLSEDESENDSSDEEDGEDWDELES 988



 Score = 30.8 bits (69), Expect = 2.8
 Identities = 22/58 (37%), Positives = 32/58 (55%), Gaps = 11/58 (18%)

Query: 585 SKDVSSESEEESD----LSDVENDEVVDKKEKNEDSNNKSEESSDDD-------DDWE 631
           S D S ESEEE       SD E+DE  + +E +E S + SE+ S++D       +DW+
Sbjct: 927 SDDESDESEEEVSEYEASSDDESDETDEDEESDESSEDLSEDESENDSSDEEDGEDWD 984



 Score = 29.2 bits (65), Expect = 9.8
 Identities = 21/56 (37%), Positives = 31/56 (55%), Gaps = 1/56 (1%)

Query: 580 DTPEESKDVSSESEEESDLSDVENDEVVDKKEKNED-SNNKSEESSDDDDDWEKYQ 634
           D  +ES++  SE E  SD    E DE  +  E +ED S ++SE  S D++D E + 
Sbjct: 929 DESDESEEEVSEYEASSDDESDETDEDEESDESSEDLSEDESENDSSDEEDGEDWD 984


>gnl|CDD|237662 PRK14292, PRK14292, chaperone protein DnaJ; Provisional.
          Length = 371

 Score = 31.4 bits (71), Expect = 1.3
 Identities = 9/28 (32%), Positives = 18/28 (64%)

Query: 2  KLTKAYQALTDEESRRNFEKYGNPDGPG 29
          ++ +AY  L+D E R +++++G   G G
Sbjct: 46 QINEAYAVLSDAEKRAHYDRFGTAPGAG 73


>gnl|CDD|217392 pfam03153, TFIIA, Transcription factor IIA, alpha/beta subunit.
           Transcription initiation factor IIA (TFIIA) is a
           heterotrimer, the three subunits being known as alpha,
           beta, and gamma, in order of molecular weight. The N and
           C-terminal domains of the gamma subunit are represented
           in pfam02268 and pfam02751, respectively. This family
           represents the precursor that yields both the alpha and
           beta subunits. The TFIIA heterotrimer is an essential
           general transcription initiation factor for the
           expression of genes transcribed by RNA polymerase II.
           Together with TFIID, TFIIA binds to the promoter region;
           this is the first step in the formation of a
           pre-initiation complex (PIC). Binding of the rest of the
           transcription machinery follows this step. After
           initiation, the PIC does not completely dissociate from
           the promoter. Some components, including TFIIA, remain
           attached and re-initiate a subsequent round of
           transcription.
          Length = 332

 Score = 31.6 bits (72), Expect = 1.4
 Identities = 17/74 (22%), Positives = 26/74 (35%), Gaps = 14/74 (18%)

Query: 556 GQKKITNTVEQTEAPADVAKVEKKDTPEESKDVSSESEEESDLSDVENDEVVDKKEKNED 615
            Q K      +    A +  ++  D  + S D   E   ESDL D ++D           
Sbjct: 227 KQPKKQAKSSKRRTIAQIDGIDSDDEGDGSDDDDDEDAIESDLDDSDDDV---------- 276

Query: 616 SNNKSEESSDDDDD 629
               S+E  +D  D
Sbjct: 277 ----SDEDGEDLFD 286


>gnl|CDD|240433 PTZ00482, PTZ00482, membrane-attack complex/perforin (MACPF)
           Superfamily; Provisional.
          Length = 844

 Score = 31.8 bits (72), Expect = 1.4
 Identities = 24/111 (21%), Positives = 42/111 (37%), Gaps = 4/111 (3%)

Query: 540 GKKKKFTKNKHDKKSS---GQKKITNTVEQTEAPADVAKVEKKDTPE-ESKDVSSESEEE 595
           G  +   K+ +   +S    QKK  + + Q ++  D    E     E +  D  + +  E
Sbjct: 53  GVLRDEGKHANILYNSILCNQKKHASFLNQRKSLDDDDDDEFDFLYEDDEDDAGNATSGE 112

Query: 596 SDLSDVENDEVVDKKEKNEDSNNKSEESSDDDDDWEKYQTGLNKRDKYLLT 646
           S   D    E+ D+ E  +   N  + +  D DD    QT    +    L+
Sbjct: 113 SSTDDDSLLELPDRDEDADTQANNDQTNDFDQDDSSNSQTDQGLKQSVNLS 163


>gnl|CDD|220361 pfam09721, Exosortase_EpsH, Transmembrane exosortase
          (Exosortase_EpsH).  Members of this family are
          designated exosortase, analogous to sortase in cell
          wall sorting mediated by LPXTG domains in Gram-positive
          bacteria. The phylogenetic distribution of the proteins
          in this entry is nearly perfectly correlated with the
          distribution of the proteins having the PEP-CTERM
          anchor motif, IPR013424. Members of this entry are
          integral membrane proteins with eight predicted
          transmembrane helices in common. Some members of this
          family have long trailing sequences past the region
          described by this model. This model does not include
          the region of the first predicted transmembrane region.
          The best characterized member is EpsH of
          Methylobacillus sp. 12S, where it is part of a locus
          associated with biosynthesis of the exopolysaccharide
          methanol-an.
          Length = 254

 Score = 31.1 bits (71), Expect = 1.5
 Identities = 12/55 (21%), Positives = 23/55 (41%), Gaps = 2/55 (3%)

Query: 16 RRNFEKYGNPDGPGAMSFGIALPSYIVEKENSVWVLGLYALVFMVALPTAVGMWW 70
          R    K      PG +   +AL  Y++ + + V +L L +L  +  L   + +  
Sbjct: 39 RDELAKLPPRPSPGLLLLALALLLYLLGRLSYVLLLALLSLPLL--LAGLLLLLL 91


>gnl|CDD|227880 COG5593, COG5593, Nucleic-acid-binding protein possibly involved in
           ribosomal biogenesis [Translation, ribosomal structure
           and biogenesis].
          Length = 821

 Score = 31.6 bits (71), Expect = 1.5
 Identities = 25/94 (26%), Positives = 35/94 (37%), Gaps = 9/94 (9%)

Query: 536 NKKGGKKKKFTKNKHDKKSSGQKKITNTVEQTEAPADVAKVEKKDTPEESKDVSSESEEE 595
            KK    K    NK    S          E  E     A V+ +   E+  D S     E
Sbjct: 663 TKKTADGKGKKSNKASFDSDD--------EMDENEIWSALVKSRPDVEDDSDDSELDFAE 714

Query: 596 SDLSDVENDEVVDKKEKNEDSNNKSEESSDDDDD 629
            D SD  +D+   K +  +D + KSE S + D +
Sbjct: 715 DDFSDSTSDD-EPKLDAIDDEDAKSEGSQESDQE 747


>gnl|CDD|217783 pfam03896, TRAP_alpha, Translocon-associated protein (TRAP), alpha
           subunit.  The alpha-subunit of the TRAP complex (TRAP
           alpha) is a single-spanning membrane protein of the
           endoplasmic reticulum (ER) which is found in proximity
           of nascent polypeptide chains translocating across the
           membrane.
          Length = 281

 Score = 30.9 bits (70), Expect = 1.7
 Identities = 17/60 (28%), Positives = 27/60 (45%), Gaps = 2/60 (3%)

Query: 570 PADVAKVEKKDTPEESKDVSSESEEESDLSDVENDEVVDKKEKNEDSNNKSEESSDDDDD 629
           PA +        P  S    +E EE  D  DV +++  D+    ED N  +EE  D++ +
Sbjct: 15  PAVLLTFGNGKGPFASAQDLTEDEEAED--DVVDEDEEDEAVVEEDENELTEEEEDEEGE 72


>gnl|CDD|227466 COG5137, COG5137, Histone chaperone involved in gene silencing
           [Transcription / Chromatin structure and dynamics].
          Length = 279

 Score = 31.1 bits (70), Expect = 1.8
 Identities = 18/70 (25%), Positives = 28/70 (40%), Gaps = 1/70 (1%)

Query: 562 NTVEQTEAPADVAKVEKKDTPEESKDVSSESEEESDLSDVENDEVVDKKEKNEDSNNKSE 621
           N V   +   D A   + D   E ++   ES+   +  D E        E N + N + E
Sbjct: 155 NIVWDNDEDNDEAPPAQPDVDNEEEERLEESDGREEEEDEEVGSDSY-GEGNRELNEEEE 213

Query: 622 ESSDDDDDWE 631
           E ++  DD E
Sbjct: 214 EEAEGSDDGE 223


>gnl|CDD|115196 pfam06524, NOA36, NOA36 protein.  This family consists of several
           NOA36 proteins which contain 29 highly conserved
           cysteine residues. The function of this protein is
           unknown.
          Length = 314

 Score = 31.1 bits (70), Expect = 1.9
 Identities = 14/52 (26%), Positives = 21/52 (40%)

Query: 592 SEEESDLSDVENDEVVDKKEKNEDSNNKSEESSDDDDDWEKYQTGLNKRDKY 643
           S +  D   V      D  E +  ++   EE  DDDD+ +   T  N  D+ 
Sbjct: 258 SNKYGDFKGVNYGYESDDDEGSSSNDYDEEEDGDDDDNEDNDDTNTNHGDED 309


>gnl|CDD|217476 pfam03286, Pox_Ag35, Pox virus Ag35 surface protein. 
          Length = 198

 Score = 30.5 bits (69), Expect = 2.0
 Identities = 16/78 (20%), Positives = 32/78 (41%), Gaps = 2/78 (2%)

Query: 552 KKSSGQKKITNTVEQTEAPADVAKVEKKDTPEESKDVSSESEEESDLSDVENDEVVDKKE 611
            K   +K+ T T  +        K +K+   EE K   S+ ++  +  +   D   +  +
Sbjct: 48  PKQPKKKRPT-TPRKPATTKKSKKKDKEKLTEEEKKPESDDDKTEENENDP-DNNEESGD 105

Query: 612 KNEDSNNKSEESSDDDDD 629
             E ++  S    D++DD
Sbjct: 106 SQESASANSLSDIDNEDD 123


>gnl|CDD|183953 PRK13290, ectC, L-ectoine synthase; Reviewed.
          Length = 125

 Score = 29.4 bits (67), Expect = 2.1
 Identities = 13/33 (39%), Positives = 18/33 (54%), Gaps = 5/33 (15%)

Query: 354 EGEGEIEEEEKPNGTITPVLP---YVTDDHLKH 383
           EGEGE+E+     G + P+ P   Y  D H +H
Sbjct: 63  EGEGEVEDLA--TGEVHPIRPGTMYALDKHDRH 93


>gnl|CDD|220611 pfam10169, Laps, Learning-associated protein.  This is a family of
           121-amino acid secretory proteins. Laps functions in the
           regulation of neuronal cell adhesion and/or movement and
           synapse attachment. Laps binds to the ApC/EBP (Aplysia
           CCAAT/enhancer binding protein) promoter and activates
           the transcription of ApC/EBP mRNA.
          Length = 124

 Score = 29.4 bits (66), Expect = 2.1
 Identities = 21/65 (32%), Positives = 28/65 (43%), Gaps = 5/65 (7%)

Query: 500 SKEGEGEIEEEEKPNGTI-TPVVNKE---KQQTHRPVWMKNKKGGKKKKFTKNKHDKKSS 555
            KEG    E++EK        V NK+    Q    P WM  ++  KK K  + K  KK  
Sbjct: 60  KKEGTESEEDDEKMEVDNKAAVRNKKTLRDQHGQYPAWMNQRQ-AKKLKKKREKKKKKKK 118

Query: 556 GQKKI 560
           G K +
Sbjct: 119 GAKNL 123


>gnl|CDD|132386 TIGR03343, biphenyl_bphD, 2-hydroxy-6-oxo-6-phenylhexa-2,4-dienoate
           hydrolase.  Members of this family are
           2-hydroxy-6-oxo-6-phenylhexa-2,4-dienoate hydrolase, or
           HOPD hydrolase, the BphD protein of biphenyl
           degradation. BphD acts on the product of ring
           meta-cleavage by BphC. Many species carrying bphC and
           bphD are capable of degrading polychlorinated biphenyls
           as well as biphenyl itself.
          Length = 282

 Score = 30.7 bits (69), Expect = 2.2
 Identities = 21/83 (25%), Positives = 43/83 (51%), Gaps = 12/83 (14%)

Query: 230 IVQGMWDFKNPLLQLPYVTDDHLKHFI--CKKRYIKSLQQFAQMKNEERRSVVKFMNDEQ 287
           ++QG W     + + P    +HLK+F+   +K  + +    A++   + +++V +  D++
Sbjct: 183 LLQGRW---ENIQRQP----EHLKNFLISSQKAPLSTWDVTARLGEIKAKTLVTWGRDDR 235

Query: 288 YSDM---LKVLGNMPYVDLHVQP 307
           +  +   LK+L NMP   LHV  
Sbjct: 236 FVPLDHGLKLLWNMPDAQLHVFS 258


>gnl|CDD|227931 COG5644, COG5644, Uncharacterized conserved protein [Function
           unknown].
          Length = 869

 Score = 31.2 bits (70), Expect = 2.3
 Identities = 25/84 (29%), Positives = 41/84 (48%), Gaps = 4/84 (4%)

Query: 547 KNKHDKKSSGQ--KKITNTVEQTEAPADVAKVEKKDTPEESKDVSSESEEESDLSDVEND 604
           + KH +K   Q   KI ++ E+  A  D  ++E  D    S  V+SE +EE D  +  ++
Sbjct: 5   QGKHQRKGKKQLENKILHSYEEESAGFDSEELEDNDEQGYSFGVNSEDDEEIDSDEAFDE 64

Query: 605 EVVDKKEKNEDSNNKSEESSDDDD 628
           E  D+K   + S N S+    + D
Sbjct: 65  E--DEKRFADWSFNASKSGKSNKD 86


>gnl|CDD|233222 TIGR00986, 3a0801s05tom22, mitochondrial import receptor subunit
           Tom22.  The mitochondrial protein translocase (MPT)
           family, which brings nuclearly encoded preproteins into
           mitochondria, is very complex with 19 currently
           identified protein constituents.These proteins include
           several chaperone proteins, four proteins of the outer
           membrane translocase (Tom) import receptor, five
           proteins of the Tom channel complex, five proteins of
           the inner membrane translocase (Tim) and three "motor"
           proteins. This family is specific for the Tom22 proteins
           [Transport and binding proteins, Amino acids, peptides
           and amines].
          Length = 145

 Score = 29.9 bits (67), Expect = 2.3
 Identities = 13/52 (25%), Positives = 24/52 (46%), Gaps = 1/52 (1%)

Query: 559 KITNTVEQTEA-PADVAKVEKKDTPEESKDVSSESEEESDLSDVENDEVVDK 609
           ++T   ++    P     VE ++  E+  DV SE   +SD   +E +   D+
Sbjct: 3   QLTEVEDEHFQQPEQQTVVEPEEDDEDFTDVDSEDSVDSDFESLEEETFTDR 54


>gnl|CDD|235943 PRK07133, PRK07133, DNA polymerase III subunits gamma and tau;
           Validated.
          Length = 725

 Score = 30.9 bits (70), Expect = 2.5
 Identities = 24/138 (17%), Positives = 54/138 (39%), Gaps = 6/138 (4%)

Query: 496 FLEPSKEGEGEIEEEEKPNGTITPVVNKEKQQTHRPVWMKNKKGGKKKKFTKNKHDKKSS 555
             E  +E E EI+ ++    +I  +  KEK+  +    ++ K   K  +      D K+ 
Sbjct: 355 LSELEEEDENEIKFKKIEENSIDNLDIKEKKIENE-NDIEGKSDTKNLEEGFETKDNKNK 413

Query: 556 GQKKITNTVEQTEAPADVAKVEKKDTPEESKDVSSESEEESDLSDVENDEVV-DKKEKNE 614
               I  T           ++ +K T      ++ E+ +E +   + ND +  +  + +E
Sbjct: 414 NSSFINKTENILTNSPLKDELLEKTTEI----INIENPQEFEFGQIGNDIISTEIAQLDE 469

Query: 615 DSNNKSEESSDDDDDWEK 632
           + N       D ++++  
Sbjct: 470 NQNLIDTGEFDLENNFSN 487


>gnl|CDD|218312 pfam04889, Cwf_Cwc_15, Cwf15/Cwc15 cell cycle control protein.
           This family represents Cwf15/Cwc15 (from
           Schizosaccharomyces pombe and Saccharomyces cerevisiae
           respectively) and their homologues. The function of
           these proteins is unknown, but they form part of the
           spliceosome and are thus thought to be involved in mRNA
           splicing.
          Length = 241

 Score = 30.5 bits (69), Expect = 2.5
 Identities = 24/140 (17%), Positives = 49/140 (35%), Gaps = 36/140 (25%)

Query: 528 THRPVWM----KNKKGGKKKK-------------FTKNKHDKKSSG------QKKITNTV 564
            HRP +     K ++GG +                TK K+ +   G      ++ +   +
Sbjct: 4   AHRPTFDPARGKEEQGGNRLSGPSSIYSSRDLPAHTKLKYRQPGQGTEDELRKRDLRAEL 63

Query: 565 EQTEAPADVAKVEKK-----DTPEESKDVSSESEEESD--------LSDVENDEVVDKKE 611
           E+ E      K  K      D        +   E++ +        + +   +   D  +
Sbjct: 64  EEAERAHKSKKENKLAIEDADKSTNLDASNEGDEDDDEEDEIKRKRIEEDARNSDADDSD 123

Query: 612 KNEDSNNKSEESSDDDDDWE 631
            + DS++  ++S DDD + E
Sbjct: 124 SSSDSDSSDDDSDDDDSEDE 143


>gnl|CDD|227891 COG5604, COG5604, Uncharacterized conserved protein [Function
           unknown].
          Length = 523

 Score = 30.6 bits (69), Expect = 2.6
 Identities = 17/71 (23%), Positives = 29/71 (40%), Gaps = 5/71 (7%)

Query: 535 KNKKGGKKKKFTKNKHDKKSSG----QKKITNTVEQTEAPADVAKVEKKD-TPEESKDVS 589
           K  K   K    + K   +S      + + ++T  + E+  +     K+  +   S   S
Sbjct: 10  KFTKNHLKNTIDRRKQLARSKKVYGTKNRNSHTENKMESGTNDNNKNKEKLSKLYSDVDS 69

Query: 590 SESEEESDLSD 600
           S SEEE D S+
Sbjct: 70  SSSEEEEDGSE 80


>gnl|CDD|237658 PRK14284, PRK14284, chaperone protein DnaJ; Provisional.
          Length = 391

 Score = 30.6 bits (69), Expect = 2.7
 Identities = 14/35 (40%), Positives = 25/35 (71%), Gaps = 3/35 (8%)

Query: 2  KLTKAYQALTDEESRRNFEKYGNPDGP--GAMSFG 34
          ++++AY+ L+D + R ++++YG  DGP  GA  FG
Sbjct: 46 EVSEAYEVLSDAQKRESYDRYGK-DGPFAGAGGFG 79


>gnl|CDD|227504 COG5177, COG5177, Uncharacterized conserved protein [Function
           unknown].
          Length = 769

 Score = 30.8 bits (69), Expect = 2.8
 Identities = 18/66 (27%), Positives = 30/66 (45%), Gaps = 4/66 (6%)

Query: 568 EAPADVAKVEKKDTPEESKDVSSESEEESDLSDVENDEVV----DKKEKNEDSNNKSEES 623
           E  +  A V K  +  ++K    E EE+   +D E+        D KE + +     EES
Sbjct: 373 EQTSKKAAVPKGTSFYQAKWAEDEEEEDGQCNDEESTMSAIDDDDPKENDNEEVAGDEES 432

Query: 624 SDDDDD 629
           + DD++
Sbjct: 433 AIDDNE 438


>gnl|CDD|236304 PRK08581, PRK08581, N-acetylmuramoyl-L-alanine amidase; Validated.
          Length = 619

 Score = 30.5 bits (69), Expect = 2.9
 Identities = 13/57 (22%), Positives = 25/57 (43%), Gaps = 5/57 (8%)

Query: 571 ADVAKVEKKDTPEESKDVSSESEEESDLSDVENDEVVDKKEKNEDSNNKSEESSDDD 627
           A     +K  T + +   S +S ++   S   + +  DK     D+NN S + ++D 
Sbjct: 25  AYADDPQKDSTAKTTSHDSKKSNDDET-SKDTSSKDTDK----ADNNNTSNQDNNDK 76



 Score = 29.4 bits (66), Expect = 6.6
 Identities = 5/47 (10%), Positives = 19/47 (40%)

Query: 583 EESKDVSSESEEESDLSDVENDEVVDKKEKNEDSNNKSEESSDDDDD 629
            +    +  +  +S  S+ +        +  + ++N +  + D++D 
Sbjct: 30  PQKDSTAKTTSHDSKKSNDDETSKDTSSKDTDKADNNNTSNQDNNDK 76


>gnl|CDD|215774 pfam00183, HSP90, Hsp90 protein. 
          Length = 529

 Score = 30.5 bits (69), Expect = 2.9
 Identities = 15/59 (25%), Positives = 34/59 (57%)

Query: 573 VAKVEKKDTPEESKDVSSESEEESDLSDVENDEVVDKKEKNEDSNNKSEESSDDDDDWE 631
           V K  +K+ P+E ++   E ++E +    + +E VD++E+ E+   K+++  +   +WE
Sbjct: 27  VEKEVEKEVPDEEEEEEKEEKKEEEEKTTDKEEEVDEEEEKEEKKKKTKKVKETTTEWE 85


>gnl|CDD|233148 TIGR00844, c_cpa1, na(+)/h(+) antiporter.  The Monovalent
           Cation:Proton Antiporter-1 (CPA1) Family (TC 2.A.36) The
           CPA1 family is a large family of proteins derived from
           Gram-positive and Gram-negative bacteria, blue green
           bacteria, yeast, plants and animals. Transporters from
           eukaryotes have been functionally characterized, and all
           of these catalyze Na+:H+ exchange. Their primary
           physiological functions may be in (1) cytoplasmic pH
           regulation, extruding the H+ generated during
           metabolism, and (2) salt tolerance (in plants), due to
           Na+ uptake into vacuoles. This model is specific for the
           fungal members of this family [Transport and binding
           proteins, Cations and iron carrying compounds].
          Length = 810

 Score = 30.7 bits (69), Expect = 2.9
 Identities = 22/98 (22%), Positives = 42/98 (42%), Gaps = 13/98 (13%)

Query: 534 MKNKKGGKKKKFTKNKHDKKSSGQKKITNTVEQTEAPADVAKVEKKDTPEESKD---VSS 590
           ++++ G   +++  + H  K   +K+  + V   E  A    +E      E  +   + S
Sbjct: 723 IEDESGEVFRRYRISPHGGKRKIKKRNDSVVSVDEEKA----IEGPSRVPERGNHDLLHS 778

Query: 591 ESEEESDLSDVENDEVVDKKEKNEDSNNKSEESSDDDD 628
           E E   D ++ EN       +  EDS++ + ES D  D
Sbjct: 779 EDEMADDEAESENM------DDYEDSDDNAYESKDHAD 810


>gnl|CDD|113815 pfam05058, ActA, ActA Protein.  The ActA family is found in
           Listeria and is associated with motility. ActA protein
           acts as a scaffold to assemble and activate host cell
           actin cytoskeletal factors at the bacterial surface,
           resulting in directional actin polymerisation and
           propulsion of the bacterium through the cytoplasm of the
           host cell.
          Length = 601

 Score = 30.6 bits (68), Expect = 3.0
 Identities = 28/123 (22%), Positives = 56/123 (45%), Gaps = 12/123 (9%)

Query: 473 TVTCTLMRKPMSVLFGDDTIKVQFLEPSKEGEGEIEEEEKPNGTITPVVNKEKQQTHRPV 532
           T T T+++K   V       K+  L  +K  E  I E +       P + K+ +   +P+
Sbjct: 454 TTTKTVLKKITPV---KIAPKLAELPATKPQETAIGENK------APFIEKQAETNKQPI 504

Query: 533 WMKNKKGGKKKKFTKNKHDKKSSGQKKITNTVEQTEAPADVAKVEKKDTPEESKDVSSES 592
            M +    +K+   ++K + K   ++K+   VE++E   D    ++    EE K ++  +
Sbjct: 505 DMPSLPVIQKEATERDKEEMKPQTEEKM---VEESEPANDANGKKRSAGIEEGKLIAKSA 561

Query: 593 EEE 595
           E+E
Sbjct: 562 EDE 564


>gnl|CDD|221403 pfam12064, DUF3544, Domain of unknown function (DUF3544).  This
           presumed domain is functionally uncharacterized. This
           domain is found in eukaryotes. This domain is typically
           between 198 to 216 amino acids in length. This domain is
           found associated with pfam00628, pfam01753, pfam00439,
           pfam00855.
          Length = 207

 Score = 30.0 bits (67), Expect = 3.1
 Identities = 24/79 (30%), Positives = 37/79 (46%), Gaps = 3/79 (3%)

Query: 546 TKNKHDKKSSGQKKITNTVEQTEAPADVAKV-EKKDTPEESKDVSSESEEESDLSDVEND 604
           T  K   +S  Q  +  T+E  +A   + ++ E   T  E  D  SE  E+SD SD E  
Sbjct: 131 TSPKRQIRSRFQLNLDKTIESCKAQLGINEISEDAYTGVEHSD--SEDSEKSDSSDSEYS 188

Query: 605 EVVDKKEKNEDSNNKSEES 623
              ++K KNE  +   +E+
Sbjct: 189 SDDEQKAKNEQEDASDKEA 207


>gnl|CDD|191716 pfam07263, DMP1, Dentin matrix protein 1 (DMP1).  This family
           consists of several mammalian dentin matrix protein 1
           (DMP1) sequences. The dentin matrix acidic
           phosphoprotein 1 (DMP1) gene has been mapped to human
           chromosome 4q21. DMP1 is a bone and teeth specific
           protein initially identified from mineralised dentin.
           DMP1 is primarily localised in the nuclear compartment
           of undifferentiated osteoblasts. In the nucleus, DMP1
           acts as a transcriptional component for activation of
           osteoblast-specific genes like osteocalcin. During the
           early phase of osteoblast maturation, Ca(2+) surges into
           the nucleus from the cytoplasm, triggering the
           phosphorylation of DMP1 by a nuclear isoform of casein
           kinase II. This phosphorylated DMP1 is then exported out
           into the extracellular matrix, where it regulates
           nucleation of hydroxyapatite. DMP1 is a unique molecule
           that initiates osteoblast differentiation by
           transcription in the nucleus and orchestrates
           mineralised matrix formation extracellularly, at later
           stages of osteoblast maturation. The DMP1 gene has been
           found to be ectopically expressed in lung cancer
           although the reason for this is unknown.
          Length = 514

 Score = 30.4 bits (68), Expect = 3.2
 Identities = 20/70 (28%), Positives = 36/70 (51%), Gaps = 7/70 (10%)

Query: 577 EKKDTPEESKDV---SSESEEESDLSDVEND----EVVDKKEKNEDSNNKSEESSDDDDD 629
           E+  + E+S++V   SSES +E+DL   EN     E V  + + ++ +N +  S D +D 
Sbjct: 317 EENQSQEDSQEVQDPSSESSQEADLPSQENSSESQEEVVSESRGDNPDNTTSHSEDQEDS 376

Query: 630 WEKYQTGLNK 639
               +  L+ 
Sbjct: 377 ESSEEDSLDT 386


>gnl|CDD|173412 PTZ00121, PTZ00121, MAEBL; Provisional.
          Length = 2084

 Score = 30.9 bits (69), Expect = 3.2
 Identities = 20/106 (18%), Positives = 46/106 (43%), Gaps = 6/106 (5%)

Query: 523  KEKQQTHRPVWMKNKKGGKKKKFTKNKHDKKSSGQKKITNTVEQTEAPADVAKVEKKDTP 582
            K+ ++  +       +        +   +K  + +KK      + +A A   K E+K   
Sbjct: 1336 KKAEEAKKAAEAAKAEAEAAADEAEAAEEKAEAAEKKKEE--AKKKADAAKKKAEEKKKA 1393

Query: 583  EESKDVSSESEEESDLSDVENDEVVDKKEKNEDSNNKSEESSDDDD 628
            +E+K  + E ++++D    E  +    K+K +++  K+EE    D+
Sbjct: 1394 DEAKKKAEEDKKKAD----ELKKAAAAKKKADEAKKKAEEKKKADE 1435



 Score = 29.3 bits (65), Expect = 8.1
 Identities = 32/143 (22%), Positives = 54/143 (37%), Gaps = 4/143 (2%)

Query: 490  DTIKVQFLEPSKEGEGEIEEEEKPNGTITPVVNKEKQQTHRPVWMKNKKGGKKKKFTKNK 549
            D  K +  E  K  E    E E           K +    +    K K    KKK  + K
Sbjct: 1332 DAAKKKAEEAKKAAEAAKAEAEAAADEAEAAEEKAEAAEKKKEEAKKKADAAKKKAEEKK 1391

Query: 550  H----DKKSSGQKKITNTVEQTEAPADVAKVEKKDTPEESKDVSSESEEESDLSDVENDE 605
                  KK+   KK  + +++  A    A   KK   E+ K   ++ + E      E  +
Sbjct: 1392 KADEAKKKAEEDKKKADELKKAAAAKKKADEAKKKAEEKKKADEAKKKAEEAKKADEAKK 1451

Query: 606  VVDKKEKNEDSNNKSEESSDDDD 628
              ++ +K E++  K+EE+   D+
Sbjct: 1452 KAEEAKKAEEAKKKAEEAKKADE 1474



 Score = 29.3 bits (65), Expect = 9.0
 Identities = 27/145 (18%), Positives = 60/145 (41%), Gaps = 6/145 (4%)

Query: 498  EPSKEGEGEIEEEEKPNGTITPVVNKEKQQTHRPVWMKNKKGGKKKKFTKNKHDKKSSGQ 557
            E  K+ E   ++E +       +   E++   +      K    KKK  + K  ++   +
Sbjct: 1630 EEKKKVEQLKKKEAEEKKKAEELKKAEEENKIKAAEEAKKAEEDKKKAEEAKKAEEDE-K 1688

Query: 558  KKITNTVEQTEAPADVAKVEKKDTPEESKDVSSESEEESDLSDVENDEVVDKKEKNEDSN 617
            K      ++ E      +++KK+  E+ K    +  EE +    E     + K++ E+  
Sbjct: 1689 KAAEALKKEAEEAKKAEELKKKEAEEKKKAEELKKAEEENKIKAE-----EAKKEAEEDK 1743

Query: 618  NKSEESSDDDDDWEKYQTGLNKRDK 642
             K+EE+  D+++ +K      + +K
Sbjct: 1744 KKAEEAKKDEEEKKKIAHLKKEEEK 1768


>gnl|CDD|215641 PLN03237, PLN03237, DNA topoisomerase 2; Provisional.
          Length = 1465

 Score = 30.6 bits (69), Expect = 3.4
 Identities = 28/158 (17%), Positives = 51/158 (32%), Gaps = 25/158 (15%)

Query: 497  LEPSKEGEGEIEEEEKPNGTITPVVNKEKQQTHRPVWMKNKKGGKKKKFTKNKHDKKSSG 556
            L          ++++     ++     +K+   +P     K         K       SG
Sbjct: 1304 LASVSVISDSDDDDDDFAVEVSLAERLKKKGGRKPAAANKKAAKPPAAAKKRGPATVQSG 1363

Query: 557  QKKITNTVEQTEA-----PADVAKVEKKDTPEESKDV--------SSESEEESDLSDVEN 603
            QK +T  ++  EA        V K+      ++S  V         +ES E    S    
Sbjct: 1364 QKLLTEMLKPAEAIGISPEKKVRKMRASPFNKKSGSVLGRAATNKETESSENVSGSSSSE 1423

Query: 604  DEVVD------------KKEKNEDSNNKSEESSDDDDD 629
             + +D            K+     S+++SE + D D D
Sbjct: 1424 KDEIDVSAKPRPQRANRKQTTYVLSDSESESADDSDFD 1461


>gnl|CDD|225519 COG2972, COG2972, Predicted signal transduction protein with a
           C-terminal ATPase domain [Signal transduction
           mechanisms].
          Length = 456

 Score = 30.1 bits (68), Expect = 3.7
 Identities = 28/134 (20%), Positives = 50/134 (37%), Gaps = 10/134 (7%)

Query: 88  FYYAFFQITPHMALKRVLMILGASLEFDRRFNSEIIERPSDEIEVPQLIRQIPNLGEKNR 147
                   +    L  +L ++         + SE+ E  S+ I   ++  ++  L E+N 
Sbjct: 191 VILLLSSFSSRSILLPILGLILLLSRIALGYLSELKEI-SETIN--EMEERLEYLIEENY 247

Query: 148 ERPLYHKYSIKARALIYAHLSNMQLTSETLEADRMYIVKKCPYLLQEMVTCISQLILLAY 207
                     + RAL      N      TLE  RM   +  P    ++V  +S+L+  + 
Sbjct: 248 SLEQEQLE-AELRAL--QSQINPHFLYNTLETIRMLAEEDDPEEAAKVVKALSKLLRYSL 304

Query: 208 AQRVPRLIHIETLE 221
           +     L +I TLE
Sbjct: 305 S----NLDNIVTLE 314


>gnl|CDD|217884 pfam04086, SRP-alpha_N, Signal recognition particle, alpha subunit,
           N-terminal.  SRP is a complex of six distinct
           polypeptides and a 7S RNA that is essential for
           transferring nascent polypeptide chains that are
           destined for export from the cell to the translocation
           apparatus of the endoplasmic reticulum (ER) membrane.
           SRP binds hydrophobic signal sequences as they emerge
           from the ribosome, and arrests translation.
          Length = 272

 Score = 30.1 bits (68), Expect = 3.7
 Identities = 15/93 (16%), Positives = 26/93 (27%), Gaps = 4/93 (4%)

Query: 537 KKGGKKKKFTKNKHDKKSSGQKKITNTVEQTEAPADVAKVEKKDTPEESKDVSSESEEES 596
           K     + F ++K  KK+          E         K ++  T  ES   S+ +    
Sbjct: 106 KSPKAMRTFEESKKSKKTVDSMIERKPKEPGLKRKQRKKAQESATSPESSPSSTPNSSRP 165

Query: 597 DLSDVENDEVV----DKKEKNEDSNNKSEESSD 625
               +   +       KK     S   S +   
Sbjct: 166 STPHLLKAKEGPSRRAKKAAKLSSTASSGDEKS 198


>gnl|CDD|216161 pfam00865, Osteopontin, Osteopontin. 
          Length = 293

 Score = 30.1 bits (67), Expect = 3.8
 Identities = 24/101 (23%), Positives = 47/101 (46%), Gaps = 4/101 (3%)

Query: 539 GGKKKKFTKNKHDKKSSGQKKITNTVEQTE-APADVAKVEKKDTPEESK--DVSSESEEE 595
           G  ++K   NKH    +   K   + +Q   AP +    E+ D  ++      S+ES E 
Sbjct: 6   GSSEEKQLYNKHPDAVATWLKPDPSQKQNLLAPQNSVSSEETDDLKQETLPSKSNESHEH 65

Query: 596 -SDLSDVENDEVVDKKEKNEDSNNKSEESSDDDDDWEKYQT 635
             D  D ++D+ VD ++ N + ++ ++ + D D   E + +
Sbjct: 66  MDDDDDDDDDDHVDSQDSNSNESDDADHTDDSDHSDESHHS 106


>gnl|CDD|184885 PRK14891, PRK14891, 50S ribosomal protein L24e/unknown domain
           fusion protein; Provisional.
          Length = 131

 Score = 28.8 bits (64), Expect = 3.9
 Identities = 9/72 (12%), Positives = 27/72 (37%)

Query: 563 TVEQTEAPADVAKVEKKDTPEESKDVSSESEEESDLSDVENDEVVDKKEKNEDSNNKSEE 622
                 A    A+  ++    ++ + + E+ E     + + +E  D+          +E 
Sbjct: 59  EKGPAAAATAAAEAAEEAEAADADEDADEAAEADAADEADEEEETDEAVDETADEADAEA 118

Query: 623 SSDDDDDWEKYQ 634
              D+++ E+ +
Sbjct: 119 EEADEEEDEEAE 130


>gnl|CDD|240413 PTZ00423, PTZ00423, glideosome-associated protein 45; Provisional.
          Length = 193

 Score = 29.3 bits (65), Expect = 4.3
 Identities = 30/112 (26%), Positives = 52/112 (46%), Gaps = 10/112 (8%)

Query: 534 MKNKKGGKKKKFTKNKHDKKSSGQKKITNTVEQT--EAPADVAKVEKKDTPEESKDVSSE 591
           M N     K K  K +   + + ++K+   VE+   + P D+ + E +D PEE  +   E
Sbjct: 1   MGNYCRKNKAKEPKRRDIDELAEREKLKKEVEEIPEQKPEDIVE-ELEDQPEEPPEQEEE 59

Query: 592 SEEESDLSDVENDEVVDK--KEKNEDSNNKS-----EESSDDDDDWEKYQTG 636
           +EE+    +++     +K   EKN D   +S      ES   D+  +K +TG
Sbjct: 60  NEEQKPKEEIDYPIQENKSFDEKNLDDLERSNSDIYSESHKYDNASDKLETG 111


>gnl|CDD|177447 PHA02664, PHA02664, hypothetical protein; Provisional.
          Length = 534

 Score = 30.0 bits (67), Expect = 4.5
 Identities = 19/68 (27%), Positives = 32/68 (47%), Gaps = 8/68 (11%)

Query: 569 APADVAKVEKKDTPEESKDVSS-------ESEEESDLSDVENDEVVDKKEKNEDSNNKSE 621
           APAD   VE +   E  +D  +       +S+E+        DE  D ++ ++ S + S 
Sbjct: 423 APAD-QDVEAEAHDEFDQDPGAPAHADRADSDEDDMDEQESGDERADGEDDSDSSYSYST 481

Query: 622 ESSDDDDD 629
            SS+D+ D
Sbjct: 482 TSSEDESD 489


>gnl|CDD|227496 COG5167, VID27, Protein involved in vacuole import and degradation
           [Intracellular trafficking and secretion].
          Length = 776

 Score = 30.0 bits (67), Expect = 4.7
 Identities = 11/67 (16%), Positives = 25/67 (37%), Gaps = 3/67 (4%)

Query: 573 VAKVEKKDTPEESKDVSSESEEESDLSDVENDEVVDKKE---KNEDSNNKSEESSDDDDD 629
           +   +  +   E KD   +S         ++    +K E   +N + +   EE  D +D+
Sbjct: 336 LNNEKWGNEEAERKDYILDSSSVPLEKQFDDILYFEKMEIENRNPEESEHEEEVEDYEDE 395

Query: 630 WEKYQTG 636
            +  +  
Sbjct: 396 NDHSKRI 402


>gnl|CDD|219293 pfam07093, SGT1, SGT1 protein.  This family consists of several
           eukaryotic SGT1 proteins. Human SGT1 or hSGT1 is known
           to suppress GCR2 and is highly expressed in the muscle
           and heart. The function of this family is unknown
           although it has been speculated that SGT1 may be
           functionally analogous to the Gcr2p protein of
           Saccharomyces cerevisiae which is known to be a
           regulatory factor of glycolytic gene expression.
          Length = 557

 Score = 30.1 bits (68), Expect = 4.7
 Identities = 16/60 (26%), Positives = 25/60 (41%), Gaps = 2/60 (3%)

Query: 572 DVAKVEKKDTPEESKDVSSESEEESD--LSDVENDEVVDKKEKNEDSNNKSEESSDDDDD 629
           D A  +  +  +E +D SS+S  E      D  + E+      N    + S  S  +DDD
Sbjct: 486 DDADEDDDEDDDEDEDSSSDSTLEELEEYMDQMDAELKQTDSSNNADISNSGSSGAEDDD 545



 Score = 29.3 bits (66), Expect = 8.6
 Identities = 19/122 (15%), Positives = 43/122 (35%), Gaps = 25/122 (20%)

Query: 537 KKGGKKKKFTKNKHDKKSSGQ-KKITNTVEQ-TEAPADVAKVEKKDTPEESKDVSSESEE 594
           ++ G KK    NK D       +++ +++E+     +     E  D  +E  D   +SE+
Sbjct: 393 ERQGDKKDLKSNKEDANEVDDLEEVVSSMEEFLNKVSSFEGAEFADDEDEDDDEPDDSED 452

Query: 595 ESDLSDVEN-----------------------DEVVDKKEKNEDSNNKSEESSDDDDDWE 631
           +    D +                         +  D+ +  +D  ++   S    ++ E
Sbjct: 453 KDVSFDEDEFFEFLKNMLGLKDDEIDNDLPDDSDDADEDDDEDDDEDEDSSSDSTLEELE 512

Query: 632 KY 633
           +Y
Sbjct: 513 EY 514


>gnl|CDD|221866 pfam12935, Sec16_N, Vesicle coat trafficking protein Sec16
           N-terminus.  Sec16 is a multi-domain vesicle coat
           protein. The overall function of Sec16 is in mediating
           the movement of protein-cargo between the organelles of
           the secretory pathway. Over-expression of truncated
           mutants of only the N-terminus are lethal, and this
           portion does not appear to be essential for function so
           may act as a stabilising region.
          Length = 246

 Score = 29.5 bits (66), Expect = 4.9
 Identities = 21/151 (13%), Positives = 49/151 (32%), Gaps = 13/151 (8%)

Query: 489 DDTIKVQFLEPSKEGEGEIEE---EEKPNGT---ITPVVNKEKQQTHRPVWMKNKKGGKK 542
           D+  + + +E S      I+    E++ +        +   E  +  +P     +K   +
Sbjct: 47  DNGEQNEPVEESAPQTVAIDSVFVEDEDDEGSDFFNSLHEGEAVEEQQPPPHLTRKSTSQ 106

Query: 543 KKFTKNKHDKKSSGQKKITNTVEQTEAPADVAKVEKKDTPEESKDVSSESEEESDLSDVE 602
              +   +    S            +            + + + +  S+SE  S+     
Sbjct: 107 VLDSLGLNPDSLSSPASAEPLDPTAQNEFSNVL---AASTDGNPESESQSEPSSEEELAA 163

Query: 603 NDEVVDKK-EKNEDSNNKSEESS---DDDDD 629
             E+ D + E     ++ +E      D+DDD
Sbjct: 164 RAELSDDESESTPTEDDLAERWQAFLDNDDD 194


>gnl|CDD|217502 pfam03343, SART-1, SART-1 family.  SART-1 is a protein involved in
           cell cycle arrest and pre-mRNA splicing. It has been
           shown to be a component of U4/U6 x U5 tri-snRNP complex
           in human, Schizosaccharomyces pombe and Saccharomyces
           cerevisiae. SART-1 is a known tumour antigen in a range
           of cancers recognised by T cells.
          Length = 603

 Score = 29.7 bits (67), Expect = 5.3
 Identities = 36/121 (29%), Positives = 55/121 (45%), Gaps = 14/121 (11%)

Query: 533 WMKNKKGGKKKKFTKNKHDKKSSGQKKITNTVEQTEAPADVAKVEKK-DTPEESKDV--- 588
           W+K  K  +KKK  + K       ++K     E T       KV  K +  EE +DV   
Sbjct: 94  WLKKSKKRQKKKEAERKKALLLDEKEK-ERAAEYTSEDLAGLKVGHKVEEFEEGEDVILT 152

Query: 589 -----SSESEEESDLSDVENDEVVDKK--EKNEDSNNKSEESSDDDDDWEKYQTGLNKRD 641
                  E E+E D   +EN E+V+K+  +KN +   K  +   DDDD    ++ L+K D
Sbjct: 153 LKDTGVLEDEDEGDE--LENVELVEKEKDKKNLELKKKKPDYDPDDDDKFNKRSILSKYD 210

Query: 642 K 642
           +
Sbjct: 211 E 211


>gnl|CDD|219668 pfam07964, Red1, Rec10 / Red1.  Rec10 / Red1 is involved in meiotic
           recombination and chromosome segregation during
           homologous chromosome formation. This protein localises
           to the synaptonemal complex in S. cerevisiae and the
           analogous structures (linear elements) in S. pombe. This
           family is currently only found in fungi.
          Length = 706

 Score = 29.9 bits (67), Expect = 5.5
 Identities = 38/183 (20%), Positives = 60/183 (32%), Gaps = 19/183 (10%)

Query: 456 EVIDDEATTEYTAGAIITVTCTLMRKPMSVLFGDDTIKVQFLEPSKEGEGEIEEEEKPNG 515
           EV +D+A++     A          K    + GD  +K   +   K+   ++  E+  N 
Sbjct: 368 EVFEDQASSVKNKQAQSEENLNESPKTPIAVTGDPHLKDPTIIAGKKLMNKLTSEKINNP 427

Query: 516 TITPVVNKEKQQTH--------------RPVWMKNKKGGKKKKFTKNKHDKKSSGQKKIT 561
                V+K K                   PV  + +K   K K TK K+ K    + K  
Sbjct: 428 VKVVKVSKYKGNKSEKKRDINVLDTIFASPVSKELRKKVGKSKQTKLKNFKPVPNKSKKQ 487

Query: 562 NTVEQTEAPADVAKVEKKDTPEESKDVSSESEEESDLSDVENDEVVDKKEKNEDSNNKSE 621
                ++      KV K  T  ++                 N E  DK+     S  KS+
Sbjct: 488 LANNNSQNI-KSKKVVKAKTNNKANLQDVGECSSPPN----NKEKNDKQTSTSSSVLKSD 542

Query: 622 ESS 624
            SS
Sbjct: 543 RSS 545


>gnl|CDD|148137 pfam06339, Ectoine_synth, Ectoine synthase.  This family consists
           of several bacterial ectoine synthase proteins. The
           ectABC genes encode the diaminobutyric acid
           acetyltransferase (EctA), the diaminobutyric acid
           aminotransferase (EctB), and the ectoine synthase
           (EctC). Together these proteins constitute the ectoine
           biosynthetic pathway.
          Length = 127

 Score = 28.3 bits (64), Expect = 5.8
 Identities = 14/33 (42%), Positives = 18/33 (54%), Gaps = 5/33 (15%)

Query: 354 EGEGEIEEEEKPNGTITPVLP---YVTDDHLKH 383
           EGEGEI  E+   G + P+ P   Y  D H +H
Sbjct: 63  EGEGEI--EDLATGEVHPIKPGTMYALDKHDRH 93


>gnl|CDD|219900 pfam08553, VID27, VID27 cytoplasmic protein.  This is a family of
           fungal and plant proteins and contains many hypothetical
           proteins. VID27 is a cytoplasmic protein that plays a
           potential role in vacuolar protein degradation.
          Length = 794

 Score = 29.7 bits (67), Expect = 6.0
 Identities = 17/53 (32%), Positives = 22/53 (41%), Gaps = 3/53 (5%)

Query: 576 VEKKDTPEESKDVSSESEEESDLSDVEND---EVVDKKEKNEDSNNKSEESSD 625
           +E +D   E  D   E EEE +  D +     E  D +E  ED      E SD
Sbjct: 379 LEIEDANTERDDEEEEDEEEEEEEDEDEGPSKEHSDDEEFEEDDVESKYEDSD 431


>gnl|CDD|219901 pfam08555, DUF1754, Eukaryotic family of unknown function
           (DUF1754).  This is a eukaryotic protein family of
           unknown function.
          Length = 90

 Score = 27.4 bits (61), Expect = 6.1
 Identities = 19/61 (31%), Positives = 28/61 (45%), Gaps = 4/61 (6%)

Query: 535 KNKKGGKKKKFTKNKHDKKSSGQKKITNTVEQTEAPADVAKVEKKDTPEESKDVSSESEE 594
           K KK   KK   K K  KK +  K+   T E+ E     A+ + K+  E+  +   E EE
Sbjct: 13  KGKKIDVKK---KKKKKKKKNKSKEEVVT-EKEEEEKSSAESDLKEGEEDEDNEKIEQEE 68

Query: 595 E 595
           +
Sbjct: 69  D 69


>gnl|CDD|227925 COG5638, COG5638, Uncharacterized conserved protein [Function
           unknown].
          Length = 622

 Score = 29.4 bits (65), Expect = 6.5
 Identities = 23/133 (17%), Positives = 51/133 (38%), Gaps = 11/133 (8%)

Query: 499 PSKEGEGEIEEEEKPNGTITPVVNKEKQQTHRPVWMKNKKGGKKKKFTKNKHDKKSSGQK 558
                  E E ++K   TI  +  K +++       + KK   K     +  DKK S  K
Sbjct: 416 SHLPASNESESDDKLETTIEKLDRKLRERQENRKERQLKK--TKDDSDVDLKDKKESINK 473

Query: 559 KITNTVEQTEAPADVAKVEKKDTPEESKDVSSESEEESDLSDVENDEVVDKKEKNEDSNN 618
           K                +E+    +E  ++    +E+ +  D  + + + K EK + +  
Sbjct: 474 KNKKGKHA---------IERTAASKEELELIKADDEDDEQLDHFDMKSILKAEKFKKNRK 524

Query: 619 KSEESSDDDDDWE 631
             +++S+ ++ + 
Sbjct: 525 LKKKASNLEEGFV 537


>gnl|CDD|221883 pfam12985, DUF3869, Domain of unknown function (DUF3869).  A family
           based on the N-terminal domain of 3KOG, which shows weak
           but consistent remote homology with adhesive families
           such as immunoglobulins and cadherins, suggesting it
           might form an attachment module.
          Length = 108

 Score = 27.8 bits (62), Expect = 6.9
 Identities = 20/52 (38%), Positives = 24/52 (46%), Gaps = 6/52 (11%)

Query: 441 DEATTEYTAGAIITPEVIDDEATTEYTAGAIITVTCTLMRKPMSVLFGDDTI 492
           + A     A A I P V D  ATT    G I+T T T+     SV  G+ TI
Sbjct: 12  NVAVEPAPAKATIAPTVYD--ATT----GTIVTGTATVTISAGSVTTGNGTI 57


>gnl|CDD|221193 pfam11735, CAP59_mtransfer, Cryptococcal mannosyltransferase 1.
           The capsule of pathogenic fungi is a complex
           polysaccharide whose formation is determined by a number
           of enzymes including, most importantly,
           alpha-1,3-mannosyltransferase 1, EC:2.4.1.-.
          Length = 244

 Score = 28.8 bits (65), Expect = 7.5
 Identities = 19/56 (33%), Positives = 24/56 (42%), Gaps = 7/56 (12%)

Query: 99  MALKRVLMILGASLEFDRRFNSEIIERPSDEIEVPQLIRQIPNLGE-KNRE-RPLY 152
            AL   L  LG      RR  +       DE  +P  +R+IP L E +N    PLY
Sbjct: 54  RALDPELDALGV-----RRSITLSEITHLDEDSLPPRLRRIPYLAELRNLALEPLY 104


>gnl|CDD|184611 PRK14297, PRK14297, chaperone protein DnaJ; Provisional.
          Length = 380

 Score = 29.0 bits (65), Expect = 7.6
 Identities = 11/32 (34%), Positives = 20/32 (62%)

Query: 3  LTKAYQALTDEESRRNFEKYGNPDGPGAMSFG 34
          + +AYQ L+D + +  ++++G  D  GA  FG
Sbjct: 50 INEAYQVLSDPQKKAQYDQFGTADFNGAGGFG 81


>gnl|CDD|226907 COG4530, COG4530, Uncharacterized protein conserved in bacteria
           [Function unknown].
          Length = 129

 Score = 27.9 bits (62), Expect = 8.4
 Identities = 18/61 (29%), Positives = 30/61 (49%), Gaps = 6/61 (9%)

Query: 574 AKVEKKDTPEESKDVSSESEEE------SDLSDVENDEVVDKKEKNEDSNNKSEESSDDD 627
             V++ D   E  +V S  + +       DL D+ +DE VD  + ++D+  + EE  DDD
Sbjct: 56  EVVKEVDAENEEVEVVSLEDADDDPKGGDDLPDLGDDEDVDLDDDDDDTFLEDEEDDDDD 115

Query: 628 D 628
            
Sbjct: 116 V 116


>gnl|CDD|184900 PRK14907, rplD, 50S ribosomal protein L4; Provisional.
          Length = 295

 Score = 28.8 bits (64), Expect = 8.8
 Identities = 16/82 (19%), Positives = 26/82 (31%), Gaps = 4/82 (4%)

Query: 535 KNKKGGKKKKFTKNKHDKKSSGQK----KITNTVEQTEAPADVAKVEKKDTPEESKDVSS 590
           + KK   KK  T  +  K     K    K      + +    V    KK T +  K  S 
Sbjct: 14  EEKKPAAKKATTSKETAKTKKTAKTTSTKAAKKAAKVKKTKSVKTTTKKVTVKFEKTESV 73

Query: 591 ESEEESDLSDVENDEVVDKKEK 612
           + E  +  +  +     +  E 
Sbjct: 74  KKESVAKKTVKKEAVSAEVFEA 95


>gnl|CDD|185618 PTZ00438, PTZ00438, gamete antigen 27/25-like protein; Provisional.
          Length = 374

 Score = 28.9 bits (64), Expect = 9.1
 Identities = 20/74 (27%), Positives = 33/74 (44%), Gaps = 4/74 (5%)

Query: 560 ITNTVEQTEAPADVAKVEKKDTPEESKDVSSESEEESDLSDVENDEVVDKKEK----NED 615
           I N     E   ++  +      EE +    E EE+ D+ ++E  E V+  E+    +ED
Sbjct: 77  IANDKSDNENDVELEGLNIIVKNEEERGTQKEEEEDEDVEEIEEVEEVEVVEEEYDDDED 136

Query: 616 SNNKSEESSDDDDD 629
           S    E+ SD + D
Sbjct: 137 SEKDDEKESDAEGD 150


>gnl|CDD|113514 pfam04747, DUF612, Protein of unknown function, DUF612.  This
           family includes several uncharacterized proteins from
           Caenorhabditis elegans.
          Length = 517

 Score = 28.9 bits (63), Expect = 9.1
 Identities = 36/143 (25%), Positives = 55/143 (38%), Gaps = 26/143 (18%)

Query: 500 SKEGEGEIEEEEKPNGTITPVVNKEKQQTHRPVWMKNKKGGKKKKFTKNKHDKKSSGQKK 559
           +K+ + + E E  P  + TPV          PV        + +K  KNK DKK S  +K
Sbjct: 259 NKKNKRKSESENVPAASETPV---------EPVVETTPPASENQK--KNKKDKKKSESEK 307

Query: 560 ITNTVEQTEAPAD---------------VAKVEKKDTPEESKDVSSESEEESDLSDVEND 604
           +     Q EAP                  AK E KD P E+     E   E+ + +V   
Sbjct: 308 VVEEPVQAEAPKSKKPTADDNMDFLDFVTAKEEPKDEPAETPAAPVEEVVENVVENVVEK 367

Query: 605 EVVDKKEKNEDSNNKSEESSDDD 627
                  +N+  N K ++ S+ +
Sbjct: 368 STTPPATENKKKNKKDKKKSESE 390


>gnl|CDD|147374 pfam05160, DSS1_SEM1, DSS1/SEM1 family.  This family contains the
           breast cancer tumour suppressor BRCA2-interacting
           protein DSS1 and its homologue SEM1, both of which are
           short acidic proteins. DSS1 has been shown to be a
           conserved component of the Rae1 mediated mRNA export
           pathway in Schizosaccharomyces pombe.
          Length = 63

 Score = 26.5 bits (59), Expect = 9.2
 Identities = 12/36 (33%), Positives = 21/36 (58%)

Query: 593 EEESDLSDVENDEVVDKKEKNEDSNNKSEESSDDDD 628
           EE+ +  D   ++  +++ + E+ NN  EE  DDDD
Sbjct: 12  EEDDEFEDFPVEDWPEEETEKENGNNLWEEDWDDDD 47


  Database: CDD.v3.10
    Posted date:  Mar 20, 2013  7:55 AM
  Number of letters in database: 10,937,602
  Number of sequences in database:  44,354
  
Lambda     K      H
   0.315    0.132    0.378 

Gapped
Lambda     K      H
   0.267   0.0705    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 33,118,197
Number of extensions: 3311735
Number of successful extensions: 3913
Number of sequences better than 10.0: 1
Number of HSP's gapped: 3441
Number of HSP's successfully gapped: 264
Length of query: 646
Length of database: 10,937,602
Length adjustment: 103
Effective length of query: 543
Effective length of database: 6,369,140
Effective search space: 3458443020
Effective search space used: 3458443020
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (22.0 bits)
S2: 62 (27.7 bits)