RPS-BLAST 2.2.26 [Sep-21-2011]

Database: CDD.v3.10 
           44,354 sequences; 10,937,602 total letters

Searching..................................................done

Query= psy13406
         (1043 letters)



>gnl|CDD|227381 COG5048, COG5048, FOG: Zn-finger [General function prediction
           only].
          Length = 467

 Score = 48.2 bits (114), Expect = 2e-05
 Identities = 48/201 (23%), Positives = 76/201 (37%), Gaps = 18/201 (8%)

Query: 173 HEGIKPFQCSVCSESFTQRSNLNIHLRIHDGIR-PFQCNVCYICFTNKSDLNRHSRIHN- 230
               +  + S+ + S    S          G   P +   C I F+  S L RH R  N 
Sbjct: 255 SSASESPRSSLPTASSQSSSPNESDSSSEKGFSLPIKSKQCNISFSRSSPLTRHLRSVNH 314

Query: 231 ---GIKPFLC--SMCAKCFSRKDDLNRHMKIHEGSKRYLCTMCPKCFTRKDDLN------ 279
               +KPF C  S+C K FSR D L RH+ +H         +          LN      
Sbjct: 315 SGESLKPFSCPYSLCGKLFSRNDALKRHILLHTSISPAKEKLLNSSSKFSPLLNNEPPQS 374

Query: 280 -RHMRNHDGLKPFHCSV--CFESFTQKALLNIHLRIHTNSKPHAC--SICTESFSQKSNL 334
            +  ++    K        C  +F + + L++H+  H + +P+ C    C++SF++  NL
Sbjct: 375 LQQYKDLKNDKKSETLSNSCIRNFKRDSNLSLHIITHLSFRPYNCKNPPCSKSFNRHYNL 434

Query: 335 YIHLKLQHGVNPFKCDEKSFN 355
             H K+     P  C      
Sbjct: 435 IPHKKIHTNHAPLLCSILKSF 455



 Score = 47.0 bits (111), Expect = 4e-05
 Identities = 38/178 (21%), Positives = 59/178 (33%), Gaps = 17/178 (9%)

Query: 149 KVFLCSVCSKTFTDKVKFNRHMRAH----EGIKPFQC--SVCSESFTQRSNLNIHLRIHD 202
                  C+ +F+      RH+R+     E +KPF C  S+C + F++   L  H+ +H 
Sbjct: 288 LPIKSKQCNISFSRSSPLTRHLRSVNHSGESLKPFSCPYSLCGKLFSRNDALKRHILLHT 347

Query: 203 GIRPFQC--NVCYICFTNKSDLNRHSRIH-----NGIKPFLCSMCAKCFSRKDDLNR-HM 254
            I P +         F+   +      +         K       +   + K D N    
Sbjct: 348 SISPAKEKLLNSSSKFSPLLNNEPPQSLQQYKDLKNDKKSETLSNSCIRNFKRDSNLSLH 407

Query: 255 KIHEGSKRYLCTMCPKC---FTRKDDLNRHMRNHDGLKPFHCSVCFESFTQKALLNIH 309
            I   S R      P C   F R  +L  H + H    P  CS+         L N  
Sbjct: 408 IITHLSFRPYNCKNPPCSKSFNRHYNLIPHKKIHTNHAPLLCSILKSFRRDLDLSNHG 465



 Score = 46.2 bits (109), Expect = 8e-05
 Identities = 57/342 (16%), Positives = 99/342 (28%), Gaps = 54/342 (15%)

Query: 57  FKCNICPKRYARKNRLTNHLRTHEAGSGDEKSEGGEGSFTCSQ--CPKTFVDKWHLNRHL 114
             C  C   ++R   LT H+R+H           GE    CS   C K+F     L+RHL
Sbjct: 34  DSCPNCTDSFSRLEHLTRHIRSHT----------GEKPSQCSYSGCDKSFSRPLELSRHL 83

Query: 115 KSHSENKVFRCEQCRFDFYVK-REYNRHMKIHDGVKVFLCSVCSKTFTDKVKFNRHMRAH 173
           ++H  N      +       K    +      +     L S  S   + +      + + 
Sbjct: 84  RTHHNNPSDLNSKSLPLSNSKASSSSLSSSSSNSNDNNLLSSHSLPPSSRDPQLPDLLSI 143

Query: 174 EGIKPFQCSVCSESFTQRSN----------------------LNIHLRIHDGIRPFQCNV 211
             ++       + S                            L I   +   I     N 
Sbjct: 144 SNLRNNPLPGNNSSSVNTPQSNSLHPPLPANSLSKDPSSNLSLLISSNVSTSIPSSSENS 203

Query: 212 CYICFTNKSDLNRHSRIHNGIKPFLCSMCAKCFSRKDDLNRHMKIHEGSKRYLCTMCPKC 271
                 +    +    + N       +  ++       L         S     +     
Sbjct: 204 PLSSSYSIPSSSSDQNLENSSSSLPLTTNSQLS--PKSLLSQSPSSLSSSDSSSSASESP 261

Query: 272 FTRKDDLNRHMRNHDGL---------KPFHCSVCFESFTQKALLNIHLR--IHT--NSKP 318
            +     +    + +            P     C  SF++ + L  HLR   H+  + KP
Sbjct: 262 RSSLPTASSQSSSPNESDSSSEKGFSLPIKSKQCNISFSRSSPLTRHLRSVNHSGESLKP 321

Query: 319 HAC--SICTESFSQKSNLYIHLKLQHGVNPFKCDEKSFNCSK 358
            +C  S+C + FS+   L  H+ L   ++P K   K  N S 
Sbjct: 322 FSCPYSLCGKLFSRNDALKRHILLHTSISPAKE--KLLNSSS 361



 Score = 44.3 bits (104), Expect = 3e-04
 Identities = 28/87 (32%), Positives = 37/87 (42%), Gaps = 7/87 (8%)

Query: 844 NRFKQKSNLNIHMKIHEGIKA-HQCNVCMMKFTNKSDLNRHMRK--HDG--VKPFLC--S 896
              +  S         +G     +   C + F+  S L RH+R   H G  +KPF C  S
Sbjct: 268 ASSQSSSPNESDSSSEKGFSLPIKSKQCNISFSRSSPLTRHLRSVNHSGESLKPFSCPYS 327

Query: 897 ICAKNFSRKDDLNRHMRKHDGIKPFLC 923
           +C K FSR D L RH+  H  I P   
Sbjct: 328 LCGKLFSRNDALKRHILLHTSISPAKE 354



 Score = 43.9 bits (103), Expect = 3e-04
 Identities = 44/194 (22%), Positives = 66/194 (34%), Gaps = 18/194 (9%)

Query: 149 KVFLCSVCSKTFTDKVKFNRHMRAHEGI-KPFQCSVCSESFTQRSNLNIHLR--IHDG-- 203
           +    S+ + +            + +G   P +   C+ SF++ S L  HLR   H G  
Sbjct: 259 ESPRSSLPTASSQSSSPNESDSSSEKGFSLPIKSKQCNISFSRSSPLTRHLRSVNHSGES 318

Query: 204 IRPFQCNV--CYICFTNKSDLNRHSRIHNGIKPFLCSMCAKCFSRKDDLN-------RHM 254
           ++PF C    C   F+    L RH  +H  I P    +          LN       +  
Sbjct: 319 LKPFSCPYSLCGKLFSRNDALKRHILLHTSISPAKEKLLNSSSKFSPLLNNEPPQSLQQY 378

Query: 255 KIHEGSKRYLCTMCPKCFTRKDDLNRHMRNH----DGLKPFHCSVCFESFTQKALLNIHL 310
           K  +  K+            K D N  +                 C +SF +   L  H 
Sbjct: 379 KDLKNDKKSETLSNSCIRNFKRDSNLSLHIITHLSFRPYNCKNPPCSKSFNRHYNLIPHK 438

Query: 311 RIHTNSKPHACSIC 324
           +IHTN  P  CSI 
Sbjct: 439 KIHTNHAPLLCSIL 452



 Score = 43.1 bits (101), Expect = 6e-04
 Identities = 45/188 (23%), Positives = 63/188 (33%), Gaps = 18/188 (9%)

Query: 190 QRSNLNIHLRIHDGIRPFQCNVCYICFTNKSDLNRHSRIHNGI-KPFLCSMCAKCFSRKD 248
             S+L+            + ++      + S     S    G   P     C   FSR  
Sbjct: 244 SPSSLSSSDSSSSASESPRSSLPTASSQSSSPNESDSSSEKGFSLPIKSKQCNISFSRSS 303

Query: 249 DLNRHM--KIH--EGSKRYLCTM--CPKCFTRKDDLNRHMRNHDGLKPFHCSVCFESFTQ 302
            L RH+    H  E  K + C    C K F+R D L RH+  H  + P    +   S   
Sbjct: 304 PLTRHLRSVNHSGESLKPFSCPYSLCGKLFSRNDALKRHILLHTSISPAKEKLLNSSSKF 363

Query: 303 KALLNIHLR-------IHTNSKPHAC--SICTESFSQKSNLYIHLKLQHGVNPFKCDEKS 353
             LLN              N K      + C  +F + SNL +H  + H         K+
Sbjct: 364 SPLLNNEPPQSLQQYKDLKNDKKSETLSNSCIRNFKRDSNLSLH-IITHLSFRPYNC-KN 421

Query: 354 FNCSKCLN 361
             CSK  N
Sbjct: 422 PPCSKSFN 429



 Score = 42.4 bits (99), Expect = 0.001
 Identities = 41/206 (19%), Positives = 79/206 (38%), Gaps = 19/206 (9%)

Query: 752 FMEKSHLNRHLRKHSGVVDNMEAEA----PYACSICDRKFTIKGQLSRHLRA-HENEKPE 806
               S      +  S    +  +E     P     C+  F+    L+RHLR+ + + +  
Sbjct: 260 SPRSSLPTASSQSSSPNESDSSSEKGFSLPIKSKQCNISFSRSSPLTRHLRSVNHSGESL 319

Query: 807 KPQTC--EVCQKTFDKMAKFKRHLKVHDGVRDHQCQVCGNRFKQKSNLNI----HMKIHE 860
           KP +C   +C K F +    KRH+ +H  +   + ++  +  K    LN      ++ ++
Sbjct: 320 KPFSCPYSLCGKLFSRNDALKRHILLHTSISPAKEKLLNSSSKFSPLLNNEPPQSLQQYK 379

Query: 861 GIKAHQCNVCMMKFTNKSDLNRH-MRKH------DGVKPFLCSICAKNFSRKDDLNRHMR 913
            +K  + +  +     ++      +  H                C+K+F+R  +L  H +
Sbjct: 380 DLKNDKKSETLSNSCIRNFKRDSNLSLHIITHLSFRPYNCKNPPCSKSFNRHYNLIPHKK 439

Query: 914 KHDGIKPFLCSMCAEAFSRKDHLKKH 939
            H    P LCS   ++F R   L  H
Sbjct: 440 IHTNHAPLLCS-ILKSFRRDLDLSNH 464



 Score = 41.2 bits (96), Expect = 0.003
 Identities = 32/210 (15%), Positives = 58/210 (27%), Gaps = 31/210 (14%)

Query: 177 KPFQCSVCSESFTQRSNLNIHLRIHDGIRPFQCNVCYICFTNKSD---LNRHSRIHNGIK 233
           +P  C  C++SF++  +L  H+R H G +P QC+    C  + S    L+RH R H+   
Sbjct: 32  RPDSCPNCTDSFSRLEHLTRHIRSHTGEKPSQCSYS-GCDKSFSRPLELSRHLRTHHNNP 90

Query: 234 PFLCSMCAKCFSRKDDLNRHMKIHEGSKRYLCTMCPKC---------FTRKDDLNRHMRN 284
             L S            +        S      +                      ++RN
Sbjct: 91  SDLNSKSLP--LSNSKASSSSLSSSSSNSNDNNLLSSHSLPPSSRDPQLPDLLSISNLRN 148

Query: 285 HD----GLKPFHCSVCFESFTQK------------ALLNIHLRIHTNSKPHACSICTESF 328
           +          +                         L I   + T+    + +    S 
Sbjct: 149 NPLPGNNSSSVNTPQSNSLHPPLPANSLSKDPSSNLSLLISSNVSTSIPSSSENSPLSSS 208

Query: 329 SQKSNLYIHLKLQHGVNPFKCDEKSFNCSK 358
               +      L++  +       S    K
Sbjct: 209 YSIPSSSSDQNLENSSSSLPLTTNSQLSPK 238



 Score = 39.7 bits (92), Expect = 0.008
 Identities = 41/177 (23%), Positives = 61/177 (34%), Gaps = 18/177 (10%)

Query: 94  SFTCSQCPKTFVDKWHLNRHLKSHS----ENKVFRC--EQCRFDFYVKREYNRHMKIHDG 147
                QC  +F     L RHL+S +      K F C    C   F       RH+ +H  
Sbjct: 289 PIKSKQCNISFSRSSPLTRHLRSVNHSGESLKPFSCPYSLCGKLFSRNDALKRHILLHTS 348

Query: 148 VKVFLC--SVCSKTFTDKVKFNRHMRAH-----EGIKPFQCSV--CSESFTQRSNLNIHL 198
           +          S  F+  +              +  K  +     C  +F + SNL++H+
Sbjct: 349 ISPAKEKLLNSSSKFSPLLNNEPPQSLQQYKDLKNDKKSETLSNSCIRNFKRDSNLSLHI 408

Query: 199 RIH--DGIRPFQCNVCYICFTNKSDLNRHSRIHNGIKPFLCSMCAKCFSRKDDLNRH 253
             H        +   C   F    +L  H +IH    P LCS+    F R  DL+ H
Sbjct: 409 ITHLSFRPYNCKNPPCSKSFNRHYNLIPHKKIHTNHAPLLCSILKS-FRRDLDLSNH 464



 Score = 37.4 bits (86), Expect = 0.038
 Identities = 42/207 (20%), Positives = 63/207 (30%), Gaps = 41/207 (19%)

Query: 715 FPCSECGKVFTRKTTLKRHVR--IHTG--IKEFQC--WICSKCFMEKSHLNRHLRKHSGV 768
               +C   F+R + L RH+R   H+G  +K F C   +C K F     L RH+  H+ +
Sbjct: 290 IKSKQCNISFSRSSPLTRHLRSVNHSGESLKPFSCPYSLCGKLFSRNDALKRHILLHTSI 349

Query: 769 VDNMEAEAPYACSICDRKFTIKGQLSRHLRAHEN--EKPEKPQTCEV--CQKTFDKMAKF 824
               E            KF+            +    K +K        C + F + +  
Sbjct: 350 SPAKEKL-----LNSSSKFSPLLNNEPPQSLQQYKDLKNDKKSETLSNSCIRNFKRDSNL 404

Query: 825 KRHLKVHDGVRDHQCQVCGNRFKQKSNLNIHMKIHEGIKAHQCNVCMMKFTNKSDLNRHM 884
             H+  H   R + C+                             C   F    +L  H 
Sbjct: 405 SLHIITHLSFRPYNCK--------------------------NPPCSKSFNRHYNLIPHK 438

Query: 885 RKHDGVKPFLCSICAKNFSRKDDLNRH 911
           + H    P LCSI        D  N  
Sbjct: 439 KIHTNHAPLLCSILKSFRRDLDLSNHG 465



 Score = 36.6 bits (84), Expect = 0.062
 Identities = 25/64 (39%), Positives = 31/64 (48%), Gaps = 2/64 (3%)

Query: 865 HQCNVCMMKFTNKSDLNRHMRKHDGVKPFLCS--ICAKNFSRKDDLNRHMRKHDGIKPFL 922
             C  C   F+    L RH+R H G KP  CS   C K+FSR  +L+RH+R H      L
Sbjct: 34  DSCPNCTDSFSRLEHLTRHIRSHTGEKPSQCSYSGCDKSFSRPLELSRHLRTHHNNPSDL 93

Query: 923 CSMC 926
            S  
Sbjct: 94  NSKS 97



 Score = 36.2 bits (83), Expect = 0.096
 Identities = 26/134 (19%), Positives = 41/134 (30%), Gaps = 13/134 (9%)

Query: 839 CQVCGNRFKQKSNLNIHMKIHEGIKAHQCNVCMMK------FTNKSDLNRHMRKH-DGVK 891
                    +         +     +   +             + S              
Sbjct: 229 LTTNSQLSPKSLLSQSPSSLSSSDSSSSASESPRSSLPTASSQSSSPNESDSSSEKGFSL 288

Query: 892 PFLCSICAKNFSRKDDLNRHMRK--HDG--IKPFLC--SMCAEAFSRKDHLKKHMRKCYG 945
           P     C  +FSR   L RH+R   H G  +KPF C  S+C + FSR D LK+H+     
Sbjct: 289 PIKSKQCNISFSRSSPLTRHLRSVNHSGESLKPFSCPYSLCGKLFSRNDALKRHILLHTS 348

Query: 946 PPTGEYRPRRRAKK 959
               + +    + K
Sbjct: 349 ISPAKEKLLNSSSK 362



 Score = 34.3 bits (78), Expect = 0.41
 Identities = 23/54 (42%), Positives = 31/54 (57%), Gaps = 2/54 (3%)

Query: 891 KPFLCSICAKNFSRKDDLNRHMRKHDGIKPFLCS--MCAEAFSRKDHLKKHMRK 942
           +P  C  C  +FSR + L RH+R H G KP  CS   C ++FSR   L +H+R 
Sbjct: 32  RPDSCPNCTDSFSRLEHLTRHIRSHTGEKPSQCSYSGCDKSFSRPLELSRHLRT 85



 Score = 32.4 bits (73), Expect = 1.3
 Identities = 30/142 (21%), Positives = 53/142 (37%), Gaps = 11/142 (7%)

Query: 708 KGDTPQEFPCSECGKVFTRKTTLKRHVRIHTGIKEFQCWI--CSKCFMEKSHLNRH--LR 763
           +   P   P S CGK+F+R   LKRH+ +HT I   +  +   S  F    +      L+
Sbjct: 317 ESLKPFSCPYSLCGKLFSRNDALKRHILLHTSISPAKEKLLNSSSKFSPLLNNEPPQSLQ 376

Query: 764 KHSGVVDNMEAEAPYACSI--CDRKFTIKGQLSRHLRAHENEKPEKPQTCEVCQKTFDKM 821
           ++  + ++ ++E      I    R   +   +  HL                C K+F++ 
Sbjct: 377 QYKDLKNDKKSETLSNSCIRNFKRDSNLSLHIITHLS-----FRPYNCKNPPCSKSFNRH 431

Query: 822 AKFKRHLKVHDGVRDHQCQVCG 843
                H K+H       C +  
Sbjct: 432 YNLIPHKKIHTNHAPLLCSILK 453



 Score = 30.0 bits (67), Expect = 7.8
 Identities = 24/67 (35%), Positives = 32/67 (47%), Gaps = 2/67 (2%)

Query: 710 DTPQEFPCSECGKVFTRKTTLKRHVRIHTGIKEFQCW--ICSKCFMEKSHLNRHLRKHSG 767
           + P+   C  C   F+R   L RH+R HTG K  QC    C K F     L+RHLR H  
Sbjct: 29  NAPRPDSCPNCTDSFSRLEHLTRHIRSHTGEKPSQCSYSGCDKSFSRPLELSRHLRTHHN 88

Query: 768 VVDNMEA 774
              ++ +
Sbjct: 89  NPSDLNS 95


>gnl|CDD|222150 pfam13465, zf-H2C2_2, Zinc-finger double domain. 
          Length = 26

 Score = 37.0 bits (86), Expect = 0.001
 Identities = 14/26 (53%), Positives = 18/26 (69%)

Query: 277 DLNRHMRNHDGLKPFHCSVCFESFTQ 302
           +L RHMR H G KP+ C VC +SF+ 
Sbjct: 1   NLRRHMRTHTGEKPYKCPVCGKSFSS 26



 Score = 35.1 bits (81), Expect = 0.005
 Identities = 12/26 (46%), Positives = 16/26 (61%)

Query: 249 DLNRHMKIHEGSKRYLCTMCPKCFTR 274
           +L RHM+ H G K Y C +C K F+ 
Sbjct: 1   NLRRHMRTHTGEKPYKCPVCGKSFSS 26



 Score = 35.1 bits (81), Expect = 0.006
 Identities = 13/26 (50%), Positives = 18/26 (69%)

Query: 907 DLNRHMRKHDGIKPFLCSMCAEAFSR 932
           +L RHMR H G KP+ C +C ++FS 
Sbjct: 1   NLRRHMRTHTGEKPYKCPVCGKSFSS 26



 Score = 34.7 bits (80), Expect = 0.007
 Identities = 14/26 (53%), Positives = 18/26 (69%)

Query: 879 DLNRHMRKHDGVKPFLCSICAKNFSR 904
           +L RHMR H G KP+ C +C K+FS 
Sbjct: 1   NLRRHMRTHTGEKPYKCPVCGKSFSS 26



 Score = 34.7 bits (80), Expect = 0.007
 Identities = 12/23 (52%), Positives = 17/23 (73%)

Query: 730 LKRHVRIHTGIKEFQCWICSKCF 752
           L+RH+R HTG K ++C +C K F
Sbjct: 2   LRRHMRTHTGEKPYKCPVCGKSF 24



 Score = 33.9 bits (78), Expect = 0.013
 Identities = 12/25 (48%), Positives = 16/25 (64%)

Query: 306 LNIHLRIHTNSKPHACSICTESFSQ 330
           L  H+R HT  KP+ C +C +SFS 
Sbjct: 2   LRRHMRTHTGEKPYKCPVCGKSFSS 26



 Score = 33.5 bits (77), Expect = 0.017
 Identities = 13/25 (52%), Positives = 17/25 (68%)

Query: 166 FNRHMRAHEGIKPFQCSVCSESFTQ 190
             RHMR H G KP++C VC +SF+ 
Sbjct: 2   LRRHMRTHTGEKPYKCPVCGKSFSS 26



 Score = 33.1 bits (76), Expect = 0.029
 Identities = 11/25 (44%), Positives = 15/25 (60%)

Query: 137 EYNRHMKIHDGVKVFLCSVCSKTFT 161
              RHM+ H G K + C VC K+F+
Sbjct: 1   NLRRHMRTHTGEKPYKCPVCGKSFS 25



 Score = 32.7 bits (75), Expect = 0.040
 Identities = 13/26 (50%), Positives = 16/26 (61%)

Query: 221 DLNRHSRIHNGIKPFLCSMCAKCFSR 246
           +L RH R H G KP+ C +C K FS 
Sbjct: 1   NLRRHMRTHTGEKPYKCPVCGKSFSS 26



 Score = 31.6 bits (72), Expect = 0.079
 Identities = 9/25 (36%), Positives = 15/25 (60%)

Query: 824 FKRHLKVHDGVRDHQCQVCGNRFKQ 848
            +RH++ H G + ++C VCG  F  
Sbjct: 2   LRRHMRTHTGEKPYKCPVCGKSFSS 26



 Score = 31.6 bits (72), Expect = 0.087
 Identities = 11/26 (42%), Positives = 16/26 (61%)

Query: 851 NLNIHMKIHEGIKAHQCNVCMMKFTN 876
           NL  HM+ H G K ++C VC   F++
Sbjct: 1   NLRRHMRTHTGEKPYKCPVCGKSFSS 26



 Score = 31.6 bits (72), Expect = 0.10
 Identities = 8/24 (33%), Positives = 15/24 (62%)

Query: 109 HLNRHLKSHSENKVFRCEQCRFDF 132
           +L RH+++H+  K ++C  C   F
Sbjct: 1   NLRRHMRTHTGEKPYKCPVCGKSF 24



 Score = 30.4 bits (69), Expect = 0.22
 Identities = 11/26 (42%), Positives = 17/26 (65%)

Query: 193 NLNIHLRIHDGIRPFQCNVCYICFTN 218
           NL  H+R H G +P++C VC   F++
Sbjct: 1   NLRRHMRTHTGEKPYKCPVCGKSFSS 26



 Score = 28.9 bits (65), Expect = 0.84
 Identities = 12/32 (37%), Positives = 18/32 (56%), Gaps = 7/32 (21%)

Query: 757 HLNRHLRKHSGVVDNMEAEAPYACSICDRKFT 788
           +L RH+R H+G       E PY C +C + F+
Sbjct: 1   NLRRHMRTHTG-------EKPYKCPVCGKSFS 25


>gnl|CDD|200998 pfam00096, zf-C2H2, Zinc finger, C2H2 type.  The C2H2 zinc finger
           is the classical zinc finger domain. The two conserved
           cysteines and histidines co-ordinate a zinc ion. The
           following pattern describes the zinc finger.
           #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can
           be any amino acid, and numbers in brackets indicate the
           number of residues. The positions marked # are those
           that are important for the stable fold of the zinc
           finger. The final position can be either his or cys. The
           C2H2 zinc finger is composed of two short beta strands
           followed by an alpha helix. The amino terminal part of
           the helix binds the major groove in DNA binding zinc
           fingers. The accepted consensus binding sequence for Sp1
           is usually defined by the asymmetric hexanucleotide core
           GGGCGG but this sequence does not include, among others,
           the GAG (=CTC) repeat that constitutes a high-affinity
           site for Sp1 binding to the wt1 promoter.
          Length = 22

 Score = 33.9 bits (78), Expect = 0.015
 Identities = 13/21 (61%), Positives = 17/21 (80%)

Query: 717 CSECGKVFTRKTTLKRHVRIH 737
           C +CGK F+RK+ LKRH+R H
Sbjct: 2   CPDCGKSFSRKSNLKRHLRTH 22



 Score = 33.5 bits (77), Expect = 0.019
 Identities = 11/21 (52%), Positives = 14/21 (66%)

Query: 265 CTMCPKCFTRKDDLNRHMRNH 285
           C  C K F+RK +L RH+R H
Sbjct: 2   CPDCGKSFSRKSNLKRHLRTH 22



 Score = 32.7 bits (75), Expect = 0.035
 Identities = 12/21 (57%), Positives = 15/21 (71%)

Query: 895 CSICAKNFSRKDDLNRHMRKH 915
           C  C K+FSRK +L RH+R H
Sbjct: 2   CPDCGKSFSRKSNLKRHLRTH 22



 Score = 32.3 bits (74), Expect = 0.053
 Identities = 10/22 (45%), Positives = 14/22 (63%)

Query: 838 QCQVCGNRFKQKSNLNIHMKIH 859
           +C  CG  F +KSNL  H++ H
Sbjct: 1   KCPDCGKSFSRKSNLKRHLRTH 22



 Score = 31.9 bits (73), Expect = 0.060
 Identities = 11/21 (52%), Positives = 14/21 (66%)

Query: 237 CSMCAKCFSRKDDLNRHMKIH 257
           C  C K FSRK +L RH++ H
Sbjct: 2   CPDCGKSFSRKSNLKRHLRTH 22



 Score = 31.2 bits (71), Expect = 0.10
 Identities = 11/22 (50%), Positives = 16/22 (72%)

Query: 180 QCSVCSESFTQRSNLNIHLRIH 201
           +C  C +SF+++SNL  HLR H
Sbjct: 1   KCPDCGKSFSRKSNLKRHLRTH 22



 Score = 31.2 bits (71), Expect = 0.13
 Identities = 12/22 (54%), Positives = 14/22 (63%)

Query: 744 QCWICSKCFMEKSHLNRHLRKH 765
           +C  C K F  KS+L RHLR H
Sbjct: 1   KCPDCGKSFSRKSNLKRHLRTH 22



 Score = 30.8 bits (70), Expect = 0.18
 Identities = 10/22 (45%), Positives = 14/22 (63%)

Query: 96  TCSQCPKTFVDKWHLNRHLKSH 117
            C  C K+F  K +L RHL++H
Sbjct: 1   KCPDCGKSFSRKSNLKRHLRTH 22



 Score = 30.0 bits (68), Expect = 0.27
 Identities = 10/21 (47%), Positives = 15/21 (71%)

Query: 923 CSMCAEAFSRKDHLKKHMRKC 943
           C  C ++FSRK +LK+H+R  
Sbjct: 2   CPDCGKSFSRKSNLKRHLRTH 22



 Score = 30.0 bits (68), Expect = 0.31
 Identities = 9/21 (42%), Positives = 12/21 (57%)

Query: 153 CSVCSKTFTDKVKFNRHMRAH 173
           C  C K+F+ K    RH+R H
Sbjct: 2   CPDCGKSFSRKSNLKRHLRTH 22



 Score = 30.0 bits (68), Expect = 0.32
 Identities = 12/22 (54%), Positives = 15/22 (68%)

Query: 58 KCNICPKRYARKNRLTNHLRTH 79
          KC  C K ++RK+ L  HLRTH
Sbjct: 1  KCPDCGKSFSRKSNLKRHLRTH 22



 Score = 29.6 bits (67), Expect = 0.43
 Identities = 10/22 (45%), Positives = 13/22 (59%)

Query: 208 QCNVCYICFTNKSDLNRHSRIH 229
           +C  C   F+ KS+L RH R H
Sbjct: 1   KCPDCGKSFSRKSNLKRHLRTH 22



 Score = 28.1 bits (63), Expect = 1.3
 Identities = 9/22 (40%), Positives = 13/22 (59%)

Query: 810 TCEVCQKTFDKMAKFKRHLKVH 831
            C  C K+F + +  KRHL+ H
Sbjct: 1   KCPDCGKSFSRKSNLKRHLRTH 22



 Score = 28.1 bits (63), Expect = 1.5
 Identities = 10/22 (45%), Positives = 14/22 (63%)

Query: 866 QCNVCMMKFTNKSDLNRHMRKH 887
           +C  C   F+ KS+L RH+R H
Sbjct: 1   KCPDCGKSFSRKSNLKRHLRTH 22



 Score = 28.1 bits (63), Expect = 1.7
 Identities = 10/21 (47%), Positives = 14/21 (66%)

Query: 293 CSVCFESFTQKALLNIHLRIH 313
           C  C +SF++K+ L  HLR H
Sbjct: 2   CPDCGKSFSRKSNLKRHLRTH 22



 Score = 27.3 bits (61), Expect = 2.4
 Identities = 10/21 (47%), Positives = 12/21 (57%)

Query: 780 CSICDRKFTIKGQLSRHLRAH 800
           C  C + F+ K  L RHLR H
Sbjct: 2   CPDCGKSFSRKSNLKRHLRTH 22



 Score = 27.3 bits (61), Expect = 3.1
 Identities = 11/22 (50%), Positives = 13/22 (59%)

Query: 490 KCALCEKTFVRKQTLDRHVTVH 511
           KC  C K+F RK  L RH+  H
Sbjct: 1   KCPDCGKSFSRKSNLKRHLRTH 22



 Score = 26.5 bits (59), Expect = 5.7
 Identities = 11/19 (57%), Positives = 14/19 (73%)

Query: 321 CSICTESFSQKSNLYIHLK 339
           C  C +SFS+KSNL  HL+
Sbjct: 2   CPDCGKSFSRKSNLKRHLR 20


>gnl|CDD|197676 smart00355, ZnF_C2H2, zinc finger. 
          Length = 23

 Score = 33.6 bits (77), Expect = 0.017
 Identities = 10/23 (43%), Positives = 13/23 (56%)

Query: 865 HQCNVCMMKFTNKSDLNRHMRKH 887
           ++C  C   F +KS L  HMR H
Sbjct: 1   YRCPECGKVFKSKSALREHMRTH 23



 Score = 32.8 bits (75), Expect = 0.036
 Identities = 11/23 (47%), Positives = 11/23 (47%)

Query: 263 YLCTMCPKCFTRKDDLNRHMRNH 285
           Y C  C K F  K  L  HMR H
Sbjct: 1   YRCPECGKVFKSKSALREHMRTH 23



 Score = 32.8 bits (75), Expect = 0.036
 Identities = 12/23 (52%), Positives = 16/23 (69%)

Query: 715 FPCSECGKVFTRKTTLKRHVRIH 737
           + C ECGKVF  K+ L+ H+R H
Sbjct: 1   YRCPECGKVFKSKSALREHMRTH 23



 Score = 32.4 bits (74), Expect = 0.039
 Identities = 11/23 (47%), Positives = 14/23 (60%)

Query: 837 HQCQVCGNRFKQKSNLNIHMKIH 859
           ++C  CG  FK KS L  HM+ H
Sbjct: 1   YRCPECGKVFKSKSALREHMRTH 23



 Score = 31.7 bits (72), Expect = 0.070
 Identities = 8/23 (34%), Positives = 13/23 (56%)

Query: 95  FTCSQCPKTFVDKWHLNRHLKSH 117
           + C +C K F  K  L  H+++H
Sbjct: 1   YRCPECGKVFKSKSALREHMRTH 23



 Score = 31.7 bits (72), Expect = 0.088
 Identities = 8/23 (34%), Positives = 13/23 (56%)

Query: 179 FQCSVCSESFTQRSNLNIHLRIH 201
           ++C  C + F  +S L  H+R H
Sbjct: 1   YRCPECGKVFKSKSALREHMRTH 23



 Score = 31.3 bits (71), Expect = 0.097
 Identities = 9/23 (39%), Positives = 11/23 (47%)

Query: 235 FLCSMCAKCFSRKDDLNRHMKIH 257
           + C  C K F  K  L  HM+ H
Sbjct: 1   YRCPECGKVFKSKSALREHMRTH 23



 Score = 30.9 bits (70), Expect = 0.15
 Identities = 10/23 (43%), Positives = 11/23 (47%)

Query: 893 FLCSICAKNFSRKDDLNRHMRKH 915
           + C  C K F  K  L  HMR H
Sbjct: 1   YRCPECGKVFKSKSALREHMRTH 23



 Score = 30.9 bits (70), Expect = 0.16
 Identities = 10/23 (43%), Positives = 13/23 (56%)

Query: 743 FQCWICSKCFMEKSHLNRHLRKH 765
           ++C  C K F  KS L  H+R H
Sbjct: 1   YRCPECGKVFKSKSALREHMRTH 23



 Score = 30.5 bits (69), Expect = 0.18
 Identities = 9/23 (39%), Positives = 10/23 (43%)

Query: 151 FLCSVCSKTFTDKVKFNRHMRAH 173
           + C  C K F  K     HMR H
Sbjct: 1   YRCPECGKVFKSKSALREHMRTH 23



 Score = 30.1 bits (68), Expect = 0.27
 Identities = 9/23 (39%), Positives = 14/23 (60%)

Query: 57 FKCNICPKRYARKNRLTNHLRTH 79
          ++C  C K +  K+ L  H+RTH
Sbjct: 1  YRCPECGKVFKSKSALREHMRTH 23



 Score = 29.4 bits (66), Expect = 0.52
 Identities = 9/23 (39%), Positives = 12/23 (52%)

Query: 207 FQCNVCYICFTNKSDLNRHSRIH 229
           ++C  C   F +KS L  H R H
Sbjct: 1   YRCPECGKVFKSKSALREHMRTH 23



 Score = 29.4 bits (66), Expect = 0.53
 Identities = 8/23 (34%), Positives = 11/23 (47%)

Query: 123 FRCEQCRFDFYVKREYNRHMKIH 145
           +RC +C   F  K     HM+ H
Sbjct: 1   YRCPECGKVFKSKSALREHMRTH 23



 Score = 28.6 bits (64), Expect = 1.0
 Identities = 9/23 (39%), Positives = 11/23 (47%)

Query: 778 YACSICDRKFTIKGQLSRHLRAH 800
           Y C  C + F  K  L  H+R H
Sbjct: 1   YRCPECGKVFKSKSALREHMRTH 23



 Score = 28.6 bits (64), Expect = 1.1
 Identities = 6/22 (27%), Positives = 10/22 (45%)

Query: 810 TCEVCQKTFDKMAKFKRHLKVH 831
            C  C K F   +  + H++ H
Sbjct: 2   RCPECGKVFKSKSALREHMRTH 23



 Score = 28.2 bits (63), Expect = 1.4
 Identities = 8/23 (34%), Positives = 12/23 (52%)

Query: 291 FHCSVCFESFTQKALLNIHLRIH 313
           + C  C + F  K+ L  H+R H
Sbjct: 1   YRCPECGKVFKSKSALREHMRTH 23



 Score = 27.8 bits (62), Expect = 1.8
 Identities = 8/23 (34%), Positives = 11/23 (47%)

Query: 489 FKCALCEKTFVRKQTLDRHVTVH 511
           ++C  C K F  K  L  H+  H
Sbjct: 1   YRCPECGKVFKSKSALREHMRTH 23



 Score = 26.3 bits (58), Expect = 6.3
 Identities = 8/22 (36%), Positives = 12/22 (54%)

Query: 921 FLCSMCAEAFSRKDHLKKHMRK 942
           + C  C + F  K  L++HMR 
Sbjct: 1   YRCPECGKVFKSKSALREHMRT 22


>gnl|CDD|221040 pfam11235, Med25_SD1, Mediator complex subunit 25 synapsin 1.  The
            overall function of the full-length Med25 is efficiently
            to coordinate the transcriptional activation of RAR/RXR
            (retinoic acid receptor/retinoic X receptor) in higher
            eukaryotic cells. Human Med25 consists of several domains
            with different binding properties, the N-terminal, VWA,
            domain, this SD1 - synapsin 1 - domain from residues
            229-381, a PTOV(B) or ACID domain from 395-545, an SD2
            domain from residues 564-645 and a C-terminal NR
            box-containing domain (646-650) from 646-747. This The
            function of the SD domains is unclear.
          Length = 168

 Score = 36.4 bits (83), Expect = 0.032
 Identities = 24/112 (21%), Positives = 34/112 (30%), Gaps = 18/112 (16%)

Query: 946  PPTGEYRPRRRAKKLPATDVPPTVVGSQLNTMISQDLTVREPPQYRSVQNSIVS---EHA 1002
            P  G   P     K P +  P  V+  Q        L    PPQ +  QN  +    + A
Sbjct: 2    PVGGGSVPGPLQSKQPVSLPPAAVLPPQSLPAPQNPLPPVTPPQMQVPQNVSLHAAHDAA 61

Query: 1003 HQVLQASVGQAAASG-----LFWCAAAQYYTT----------PPTIEFAPKP 1039
             + ++A+  Q          +     A               PP    APKP
Sbjct: 62   QKAVEAAKNQKQGLKNRFSPITPLQQAPIVGPPFSQAPAPVLPPGPPGAPKP 113


>gnl|CDD|206065 pfam13894, zf-C2H2_4, C2H2-type zinc finger.  This family contains
           a number of divergent C2H2 type zinc fingers.
          Length = 24

 Score = 31.1 bits (70), Expect = 0.13
 Identities = 11/23 (47%), Positives = 15/23 (65%)

Query: 263 YLCTMCPKCFTRKDDLNRHMRNH 285
           + C +C K F+ KD L RH+R H
Sbjct: 1   FKCPLCGKSFSSKDALKRHLRKH 23



 Score = 31.1 bits (70), Expect = 0.14
 Identities = 14/23 (60%), Positives = 17/23 (73%)

Query: 893 FLCSICAKNFSRKDDLNRHMRKH 915
           F C +C K+FS KD L RH+RKH
Sbjct: 1   FKCPLCGKSFSSKDALKRHLRKH 23



 Score = 30.7 bits (69), Expect = 0.16
 Identities = 11/23 (47%), Positives = 13/23 (56%)

Query: 95  FTCSQCPKTFVDKWHLNRHLKSH 117
           F C  C K+F  K  L RHL+ H
Sbjct: 1   FKCPLCGKSFSSKDALKRHLRKH 23



 Score = 30.3 bits (68), Expect = 0.26
 Identities = 13/23 (56%), Positives = 15/23 (65%)

Query: 715 FPCSECGKVFTRKTTLKRHVRIH 737
           F C  CGK F+ K  LKRH+R H
Sbjct: 1   FKCPLCGKSFSSKDALKRHLRKH 23



 Score = 29.9 bits (67), Expect = 0.31
 Identities = 10/23 (43%), Positives = 15/23 (65%)

Query: 865 HQCNVCMMKFTNKSDLNRHMRKH 887
            +C +C   F++K  L RH+RKH
Sbjct: 1   FKCPLCGKSFSSKDALKRHLRKH 23



 Score = 29.5 bits (66), Expect = 0.45
 Identities = 13/23 (56%), Positives = 15/23 (65%)

Query: 743 FQCWICSKCFMEKSHLNRHLRKH 765
           F+C +C K F  K  L RHLRKH
Sbjct: 1   FKCPLCGKSFSSKDALKRHLRKH 23



 Score = 29.5 bits (66), Expect = 0.49
 Identities = 10/23 (43%), Positives = 14/23 (60%)

Query: 151 FLCSVCSKTFTDKVKFNRHMRAH 173
           F C +C K+F+ K    RH+R H
Sbjct: 1   FKCPLCGKSFSSKDALKRHLRKH 23



 Score = 29.1 bits (65), Expect = 0.61
 Identities = 12/23 (52%), Positives = 15/23 (65%)

Query: 235 FLCSMCAKCFSRKDDLNRHMKIH 257
           F C +C K FS KD L RH++ H
Sbjct: 1   FKCPLCGKSFSSKDALKRHLRKH 23



 Score = 28.4 bits (63), Expect = 1.1
 Identities = 11/23 (47%), Positives = 15/23 (65%)

Query: 57 FKCNICPKRYARKNRLTNHLRTH 79
          FKC +C K ++ K+ L  HLR H
Sbjct: 1  FKCPLCGKSFSSKDALKRHLRKH 23



 Score = 28.0 bits (62), Expect = 1.6
 Identities = 10/23 (43%), Positives = 14/23 (60%)

Query: 778 YACSICDRKFTIKGQLSRHLRAH 800
           + C +C + F+ K  L RHLR H
Sbjct: 1   FKCPLCGKSFSSKDALKRHLRKH 23



 Score = 28.0 bits (62), Expect = 1.8
 Identities = 12/23 (52%), Positives = 17/23 (73%)

Query: 921 FLCSMCAEAFSRKDHLKKHMRKC 943
           F C +C ++FS KD LK+H+RK 
Sbjct: 1   FKCPLCGKSFSSKDALKRHLRKH 23



 Score = 26.8 bits (59), Expect = 4.8
 Identities = 10/23 (43%), Positives = 15/23 (65%)

Query: 179 FQCSVCSESFTQRSNLNIHLRIH 201
           F+C +C +SF+ +  L  HLR H
Sbjct: 1   FKCPLCGKSFSSKDALKRHLRKH 23



 Score = 26.4 bits (58), Expect = 5.2
 Identities = 8/23 (34%), Positives = 12/23 (52%)

Query: 837 HQCQVCGNRFKQKSNLNIHMKIH 859
            +C +CG  F  K  L  H++ H
Sbjct: 1   FKCPLCGKSFSSKDALKRHLRKH 23



 Score = 26.4 bits (58), Expect = 5.5
 Identities = 12/23 (52%), Positives = 14/23 (60%)

Query: 489 FKCALCEKTFVRKQTLDRHVTVH 511
           FKC LC K+F  K  L RH+  H
Sbjct: 1   FKCPLCGKSFSSKDALKRHLRKH 23



 Score = 26.4 bits (58), Expect = 6.5
 Identities = 9/24 (37%), Positives = 14/24 (58%)

Query: 419 YTCEMCEKTFKYKLPLVKHCKKQH 442
           + C +C K+F  K  L +H +K H
Sbjct: 1   FKCPLCGKSFSSKDALKRHLRKHH 24



 Score = 26.1 bits (57), Expect = 8.4
 Identities = 10/23 (43%), Positives = 14/23 (60%)

Query: 207 FQCNVCYICFTNKSDLNRHSRIH 229
           F+C +C   F++K  L RH R H
Sbjct: 1   FKCPLCGKSFSSKDALKRHLRKH 23


>gnl|CDD|215056 PLN00104, PLN00104, MYST -like histone acetyltransferase;
           Provisional.
          Length = 450

 Score = 35.1 bits (81), Expect = 0.22
 Identities = 18/48 (37%), Positives = 25/48 (52%), Gaps = 3/48 (6%)

Query: 916 DGIKPFLCSMCAEAFSRKDHLKKHMRKC--YGPPTGE-YRPRRRAKKL 960
           D  K + C  C +   RK+ L++HM+KC    PP  E YR   R + L
Sbjct: 194 DCSKLYFCEFCLKFMKRKEQLQRHMKKCDLKHPPGDEIYRHPTRQEGL 241



 Score = 31.6 bits (72), Expect = 2.6
 Identities = 13/29 (44%), Positives = 16/29 (55%)

Query: 888 DGVKPFLCSICAKNFSRKDDLNRHMRKHD 916
           D  K + C  C K   RK+ L RHM+K D
Sbjct: 194 DCSKLYFCEFCLKFMKRKEQLQRHMKKCD 222


>gnl|CDD|145949 pfam03066, Nucleoplasmin, Nucleoplasmin.  Nucleoplasmins are also
           known as chromatin decondensation proteins. They bind to
           core histones and transfer DNA to them in a reaction
           that requires ATP. This is thought to play a role in the
           assembly of regular nucleosomal arrays.
          Length = 146

 Score = 33.1 bits (76), Expect = 0.26
 Identities = 12/29 (41%), Positives = 19/29 (65%)

Query: 27  EESASEDSDDEEDEDFENDENSSGKKMVK 55
           EE   E+ D+E+D++ E++E  S  K VK
Sbjct: 118 EEDEEEEDDEEDDDEDESEEEESPVKKVK 146


>gnl|CDD|177301 PHA00733, PHA00733, hypothetical protein.
          Length = 128

 Score = 32.5 bits (74), Expect = 0.32
 Identities = 20/68 (29%), Positives = 28/68 (41%), Gaps = 15/68 (22%)

Query: 773 EAEAPYACSICDRKFTIKGQLSRHLRAHENEKPEKPQTCEVCQKTFDKMAKFKRHLKVHD 832
           +A +PY C +C   F+    L +H+R  E+ K      C VC K F             D
Sbjct: 69  KAVSPYVCPLCLMPFSSSVSLKQHIRYTEHSK-----VCPVCGKEFRNT----------D 113

Query: 833 GVRDHQCQ 840
              DH C+
Sbjct: 114 STLDHVCK 121



 Score = 32.5 bits (74), Expect = 0.32
 Identities = 16/51 (31%), Positives = 26/51 (50%), Gaps = 2/51 (3%)

Query: 232 IKPFLCSMCAKCFSRKDDLNRHMKIHEGSKRYLCTMCPKCFTRKDDLNRHM 282
           + P++C +C   FS    L +H++  E SK  +C +C K F   D    H+
Sbjct: 71  VSPYVCPLCLMPFSSSVSLKQHIRYTEHSK--VCPVCGKEFRNTDSTLDHV 119



 Score = 31.8 bits (72), Expect = 0.58
 Identities = 16/56 (28%), Positives = 30/56 (53%), Gaps = 3/56 (5%)

Query: 862 IKAHQCNVCMMKFTNKSDLNRHMRKHDGVKPFLCSICAKNFSRKDDLNRHM-RKHD 916
           +  + C +C+M F++   L +H+R  +  K  +C +C K F   D    H+ +KH+
Sbjct: 71  VSPYVCPLCLMPFSSSVSLKQHIRYTEHSK--VCPVCGKEFRNTDSTLDHVCKKHN 124



 Score = 31.4 bits (71), Expect = 0.84
 Identities = 16/53 (30%), Positives = 26/53 (49%), Gaps = 2/53 (3%)

Query: 890 VKPFLCSICAKNFSRKDDLNRHMRKHDGIKPFLCSMCAEAFSRKDHLKKHMRK 942
           V P++C +C   FS    L +H+R  +  K  +C +C + F   D    H+ K
Sbjct: 71  VSPYVCPLCLMPFSSSVSLKQHIRYTEHSK--VCPVCGKEFRNTDSTLDHVCK 121



 Score = 29.1 bits (65), Expect = 5.4
 Identities = 14/55 (25%), Positives = 25/55 (45%), Gaps = 2/55 (3%)

Query: 290 PFHCSVCFESFTQKALLNIHLRIHTNSKPHACSICTESFSQKSNLYIHLKLQHGV 344
           P+ C +C   F+    L  H+R   +SK   C +C + F    +   H+  +H +
Sbjct: 73  PYVCPLCLMPFSSSVSLKQHIRYTEHSK--VCPVCGKEFRNTDSTLDHVCKKHNI 125


>gnl|CDD|215521 PLN02967, PLN02967, kinase.
          Length = 581

 Score = 33.9 bits (77), Expect = 0.45
 Identities = 19/93 (20%), Positives = 38/93 (40%), Gaps = 13/93 (13%)

Query: 610 ENDDDDVPLSKRTSYYASSA----DEERPAPKKRGRKKGVPTRRKSTGSRKSAKSDVDTT 665
             D +     +RT   A++A    +EE+   K R R+K     +K     +   S+ + +
Sbjct: 105 ALDKESKKTPRRTRRKAAAASSDVEEEKTEKKVRKRRK----VKKMDEDVEDQGSESEVS 160

Query: 666 DYESAMDTDYVDDNEASESRNEDSGVEGKENED 698
           D E +     +++        E+  +E  + ED
Sbjct: 161 DVEESEFVTSLENES-----EEELDLEKDDGED 188



 Score = 31.2 bits (70), Expect = 3.7
 Identities = 27/109 (24%), Positives = 43/109 (39%), Gaps = 8/109 (7%)

Query: 619 SKRTSYYASSADEERPAPKKRGRKKGVPTRRKSTGSRKSAKSDVDTTDYESAMDTDYVDD 678
           S++    A + DEE        +KK   + +++T       S+      E  ++ D   D
Sbjct: 48  SRKKIESALAVDEEPDENGAVSKKKPTRSVKRATKKTVVEISEPLEEGSELVVNEDAALD 107

Query: 679 NEASESRNE--------DSGVEGKENEDGRRKRGKRKKGDTPQEFPCSE 719
            E+ ++            S VE ++ E   RKR K KK D   E   SE
Sbjct: 108 KESKKTPRRTRRKAAAASSDVEEEKTEKKVRKRRKVKKMDEDVEDQGSE 156


>gnl|CDD|220271 pfam09507, CDC27, DNA polymerase subunit Cdc27.  This protein forms
           the C subunit of DNA polymerase delta. It carries the
           essential residues for binding to the Pol1 subunit of
           polymerase alpha, from residues 293-332, which are
           characterized by the motif D--G--VT, referred to as the
           DPIM motif. The first 160 residues of the protein form
           the minimal domain for binding to the B subunit, Cdc1,
           of polymerase delta, the final 10 C-terminal residues,
           362-372, being the DNA sliding clamp, PCNA, binding
           motif.
          Length = 427

 Score = 34.0 bits (78), Expect = 0.46
 Identities = 25/138 (18%), Positives = 50/138 (36%), Gaps = 13/138 (9%)

Query: 570 PKQEPDSIVKEEMLSDYDDNDNSMMTATQDDDDEPFQSEVENDDDDVPLSKRTSYYASSA 629
            K+  +S VKEE   +    D          +DE  +    ++D+D    K +   + S 
Sbjct: 232 KKEASESTVKEESEEESGKRD-------VILEDESAEPTGLDEDEDEDEPKPSGERSDSE 284

Query: 630 DEERPAPKKRGRKKGVPTRRKSTGSRKSAKSDVDTTDYESAMDTDYVDDNEASESRNEDS 689
           +E     K++ ++       +          +    + ES       +       + E+ 
Sbjct: 285 EETEEKEKEKRKRLKKMMEDEDEDEEMEIVPESPVEEEES------EEPEPPPLPKKEEE 338

Query: 690 GVEGKENEDGRRKRGKRK 707
             E   + DG R+RG+R+
Sbjct: 339 KEEVTVSPDGGRRRGRRR 356


>gnl|CDD|218752 pfam05793, TFIIF_alpha, Transcription initiation factor IIF, alpha
           subunit (TFIIF-alpha).  Transcription initiation factor
           IIF, alpha subunit (TFIIF-alpha) or RNA polymerase
           II-associating protein 74 (RAP74) is the large subunit
           of transcription factor IIF (TFIIF), which is essential
           for accurate initiation and stimulates elongation by RNA
           polymerase II.
          Length = 528

 Score = 33.8 bits (77), Expect = 0.47
 Identities = 28/129 (21%), Positives = 47/129 (36%), Gaps = 34/129 (26%)

Query: 611 NDDDDVPLSKRTSYYASSADEERPAPKKRGRKKGVPTRRKSTGSRKSAKSDVDTTDYES- 669
           +D+DD   S +        D +    KK+ +K     ++     +     D D  +Y+S 
Sbjct: 221 DDEDDGDESDK-----GGEDGDEEKSKKKKKKLAKNKKKLDDDKKGKRGGDDDADEYDSD 275

Query: 670 -----AMDTDYVDDNEASES--------------------RNEDSGV---EGKENEDGRR 701
                  + DY+ D+ AS +                    ++EDS     E  E E G  
Sbjct: 276 DGDDEGREEDYISDSSASGNDPEEREDKLSPEIPAKPEIEQDEDSEESEEEKNEEEGGLS 335

Query: 702 KRGKRKKGD 710
           K+GK+ K  
Sbjct: 336 KKGKKLKKL 344


>gnl|CDD|217927 pfam04147, Nop14, Nop14-like family.  Emg1 and Nop14 are novel
           proteins whose interaction is required for the
           maturation of the 18S rRNA and for 40S ribosome
           production.
          Length = 809

 Score = 34.2 bits (79), Expect = 0.49
 Identities = 12/47 (25%), Positives = 28/47 (59%)

Query: 12  DANEEESKDIEKLVIEESASEDSDDEEDEDFENDENSSGKKMVKKFK 58
           + ++++  + E+  ++ S  E+ +++ED D E+DE    ++  KK K
Sbjct: 335 EEDDDDDLEEEEEDVDLSDEEEDEEDEDSDDEDDEEEEEEEKEKKKK 381


>gnl|CDD|203043 pfam04546, Sigma70_ner, Sigma-70, non-essential region.  The
          domain is found in the primary vegetative sigma factor.
          The function of this domain is unclear and can be
          removed without loss of function.
          Length = 211

 Score = 32.9 bits (76), Expect = 0.51
 Identities = 10/40 (25%), Positives = 20/40 (50%)

Query: 10 YVDANEEESKDIEKLVIEESASEDSDDEEDEDFENDENSS 49
          ++D N   +      +  E   ED +D++D+D + DE+  
Sbjct: 29 FIDPNAAAAAATAAAIESELDEEDLEDDDDDDEDEDEDDE 68



 Score = 30.2 bits (69), Expect = 4.5
 Identities = 11/38 (28%), Positives = 19/38 (50%), Gaps = 3/38 (7%)

Query: 12 DANEEESKDIEKLVIEESASEDSDDEEDEDFENDENSS 49
           A   ES+  E+   +    +D D++EDED E + +  
Sbjct: 40 TAAAIESELDEE---DLEDDDDDDEDEDEDDEEEADLG 74


>gnl|CDD|227596 COG5271, MDN1, AAA ATPase containing von Willebrand factor type A
            (vWA) domain [General function prediction only].
          Length = 4600

 Score = 34.2 bits (78), Expect = 0.55
 Identities = 31/153 (20%), Positives = 50/153 (32%), Gaps = 21/153 (13%)

Query: 580  EEMLSDYDDNDNSMMTATQDDD------DEPFQSE-----------VENDDDDVPLSKRT 622
             ++  D D  D  M  A ++ +      DEP Q E               DD   L++  
Sbjct: 4000 GDVSKDSDLEDMDMEAADENKEEADAEKDEPMQDEDPLEENNTLDEDIQQDDFSDLAEDD 4059

Query: 623  SYYASSADEERPAPKKRGRKKGVPTRRKSTGSRKSAKSDVDTTDYESAMDTDYVDDNEAS 682
                    EE     +   + GV +  +     +    +    D    MD      +  +
Sbjct: 4060 EKMNEDGFEENVQENEESTEDGVKSDEEL---EQGEVPEDQAIDNHPKMDAKSTFASAEA 4116

Query: 683  ESRNEDSGVEGKENEDGRRKRGKRKKGDTPQEF 715
            +  N D G+   ENE+   + G R  G    EF
Sbjct: 4117 DEENTDKGI-VGENEELGEEDGVRGNGTADGEF 4148



 Score = 30.4 bits (68), Expect = 7.9
 Identities = 22/132 (16%), Positives = 42/132 (31%), Gaps = 5/132 (3%)

Query: 572  QEPDSIVKEEMLSDYDDNDNSMMTATQDDDDEP----FQSEVENDDDDVPLSKRTSYYAS 627
            +  D +  ++ +      +NS      +D D P       +  +   D  L       A 
Sbjct: 3958 EMSDDVGIDDEIQPDIQENNSQPPPENEDLDLPEDLKLDEKEGDVSKDSDLEDMDMEAAD 4017

Query: 628  SADEERPAPKKRGRKKGVPTRRKST-GSRKSAKSDVDTTDYESAMDTDYVDDNEASESRN 686
               EE  A K    +   P    +T           D  + +  M+ D  ++N      +
Sbjct: 4018 ENKEEADAEKDEPMQDEDPLEENNTLDEDIQQDDFSDLAEDDEKMNEDGFEENVQENEES 4077

Query: 687  EDSGVEGKENED 698
             + GV+  E  +
Sbjct: 4078 TEDGVKSDEELE 4089


>gnl|CDD|227516 COG5189, SFP1, Putative transcriptional repressor regulating G2/M
           transition [Transcription / Cell division and chromosome
           partitioning].
          Length = 423

 Score = 33.5 bits (76), Expect = 0.58
 Identities = 51/296 (17%), Positives = 88/296 (29%), Gaps = 52/296 (17%)

Query: 513 ITGAEHDEIMGESNPATAQVNASETSGDLNTSLAPSDGTIPEIDINKSDTGNMYIVTPKQ 572
           ITGA    +     P   Q+N++  S   N+++   +  + +              + K 
Sbjct: 169 ITGAPGLLVDAVDVPHNKQINSNNNSTSHNSNMGSKNAKMND--------------SSKL 214

Query: 573 EPDSIVKEEMLSDYDDNDNSMMTATQDDDDEP---FQSEVENDDDDVPLSKRTSYYASSA 629
           +  S+++E     + +   S         D      +     DDD +  S     Y   +
Sbjct: 215 KKLSLIEE---KKFPERVRSRYIDIGHMMDTHQFYLEDVDLMDDDILGPSNEEMLYKYIS 271

Query: 630 DEERPAPKKRGRKKGVPTRRKSTGSRKSAKSDVDTTDYESAMDTDYVDDNEASESRNEDS 689
             +  A                    +     V   +    + T  + D       N  S
Sbjct: 272 PSQGSAELF--------EESSLGFDYEFIHKSVGNKEIRGGISTGEMIDVRKLPCTNSSS 323

Query: 690 GVEGKENEDGRR-----KRGKRKKGDTPQEFPCSECGKVFTRKTTLKRHVRIHTGIKEFQ 744
              GK    G R      R  + K   P + P   C K +  +  LK H      +    
Sbjct: 324 --NGKLAHGGERNIDTPSRMLKVKDGKPYKCPVEGCNKKYKNQNGLKYH------MLHGH 375

Query: 745 CWICSKCFMEKSHLNRHLRKHSGVVDNMEAEAPYACSICDRKFTIKGQLSRHLRAH 800
                    +K H N    K +        + PY C +CD+++     L  H R H
Sbjct: 376 Q-------NQKLHENPSPEKMNIFS---AKDKPYRCEVCDKRYKNLNGLKYH-RKH 420



 Score = 32.4 bits (73), Expect = 1.5
 Identities = 22/73 (30%), Positives = 31/73 (42%), Gaps = 18/73 (24%)

Query: 775 EAPYACSI--CDRKFTIKGQLSRHL-------RAHENEKPE---------KPQTCEVCQK 816
             PY C +  C++K+  +  L  H+       + HEN  PE         KP  CEVC K
Sbjct: 347 GKPYKCPVEGCNKKYKNQNGLKYHMLHGHQNQKLHENPSPEKMNIFSAKDKPYRCEVCDK 406

Query: 817 TFDKMAKFKRHLK 829
            +  +   K H K
Sbjct: 407 RYKNLNGLKYHRK 419


>gnl|CDD|227480 COG5151, SSL1, RNA polymerase II transcription
           initiation/nucleotide excision repair factor TFIIH,
           subunit SSL1 [Transcription / DNA replication,
           recombination, and repair].
          Length = 421

 Score = 33.4 bits (76), Expect = 0.66
 Identities = 25/93 (26%), Positives = 37/93 (39%), Gaps = 11/93 (11%)

Query: 777 PYACSICDRKFTIKGQLSRHLRAHENEKP--EKPQT-------CEVCQKTFDKMAKFKRH 827
           P +C IC  +  +   L+R        KP  EKP+        C VCQ  F K       
Sbjct: 322 PISCPICSLQLILSTHLARSYHHLYPLKPFVEKPEGTNPKSTHCFVCQGPFPKPPVSPFD 381

Query: 828 LKVHDGVRDHQCQVCGNRFKQKSNLNIHMKIHE 860
                G   +QC++C + F    ++ IH  +H 
Sbjct: 382 ESTSSGR--YQCELCKSTFCSDCDVFIHETLHF 412


>gnl|CDD|165332 PHA03035, PHA03035, hypothetical protein; Provisional.
          Length = 158

 Score = 32.1 bits (72), Expect = 0.80
 Identities = 18/80 (22%), Positives = 37/80 (46%), Gaps = 1/80 (1%)

Query: 602 DEPFQSEVENDDDDVPLSKRTSYYASSADEERPAPKKRGRKKGVPTR-RKSTGSRKSAKS 660
           D+   S +++DDDD+ ++   +Y   S    +   K        P+   K   + K A +
Sbjct: 79  DQQSGSAIKDDDDDIGINSADAYGGKSKKNGKKNKKNNKLNNANPSGNGKRKKNTKKAAN 138

Query: 661 DVDTTDYESAMDTDYVDDNE 680
             D +D++  +  D +D++E
Sbjct: 139 STDDSDHKQNLMGDMIDEDE 158


>gnl|CDD|235549 PRK05658, PRK05658, RNA polymerase sigma factor RpoD; Validated.
          Length = 619

 Score = 32.8 bits (76), Expect = 0.98
 Identities = 12/49 (24%), Positives = 20/49 (40%), Gaps = 4/49 (8%)

Query: 10  YVDANEEESKDIEKLVIEESASEDSDDEEDEDFENDENSSGKKMVKKFK 58
           +VD N EE              E  DDE++E+ E++ + S      +  
Sbjct: 173 FVDPNAEEDPAHV----GSELEELDDDEDEEEEEDENDDSLAADESELP 217


>gnl|CDD|237171 PRK12678, PRK12678, transcription termination factor Rho;
           Provisional.
          Length = 672

 Score = 33.0 bits (76), Expect = 1.1
 Identities = 12/124 (9%), Positives = 30/124 (24%), Gaps = 4/124 (3%)

Query: 595 TATQDDDDEPFQSEVENDDDDVPLSKRTSYYASSADEERPAPKKRGRKK----GVPTRRK 650
                   E   +           +       +    ER    +RG  +    G      
Sbjct: 95  AEAAAAKAEAAPAARAAAAAAAEAASAPEAAQARERRERGEAARRGAARKAGEGGEQPAT 154

Query: 651 STGSRKSAKSDVDTTDYESAMDTDYVDDNEASESRNEDSGVEGKENEDGRRKRGKRKKGD 710
              +  + +++ +  D             EA           G++ +D  R+  + +   
Sbjct: 155 EARADAAERTEEEERDERRRRGDREDRQAEAERGERGRREERGRDGDDRDRRDRREQGDR 214

Query: 711 TPQE 714
             + 
Sbjct: 215 REER 218


>gnl|CDD|177288 PHA00616, PHA00616, hypothetical protein.
          Length = 44

 Score = 28.6 bits (64), Expect = 1.4
 Identities = 7/28 (25%), Positives = 14/28 (50%)

Query: 418 VYTCEMCEKTFKYKLPLVKHCKKQHQVD 445
           +Y C  C   F+ K  +++H    H+ +
Sbjct: 1   MYQCLRCGGIFRKKKEVIEHLLSVHKQN 28


>gnl|CDD|224908 COG1997, RPL43A, Ribosomal protein L37AE/L43A [Translation,
           ribosomal structure and biogenesis].
          Length = 89

 Score = 30.0 bits (68), Expect = 1.5
 Identities = 12/35 (34%), Positives = 13/35 (37%), Gaps = 13/35 (37%)

Query: 717 CSECGKVFTRKTTLKRHVRIHTGIKEFQCWICSKC 751
           C  CG+            RI TGI     W C KC
Sbjct: 38  CPFCGR--------TTVKRIATGI-----WKCRKC 59


>gnl|CDD|227561 COG5236, COG5236, Uncharacterized conserved protein, contains RING
           Zn-finger [General function prediction only].
          Length = 493

 Score = 32.3 bits (73), Expect = 1.7
 Identities = 28/135 (20%), Positives = 46/135 (34%), Gaps = 26/135 (19%)

Query: 814 CQKTFDKMAKFKRHLKVHDGVRDHQCQVC-GNRFKQKSNLNIHMKIHEGIKAHQCNVCMM 872
           C +    + + K+H K   G     C  C GN+    + + +       ++ H+      
Sbjct: 159 CHRRCGSLKELKKHYKAQHGF--VLCSECIGNKKDFWNEIRLFRS--STLRDHK------ 208

Query: 873 KFTNKSDLNRHMRKHDGVKPFLCSICAKNFSRKDDLNRHMR-KHDGIKPFLCSMCAEA-- 929
              N        + H      LC  C   F   D+L RH R +H+     +C M      
Sbjct: 209 ---NGGLEEEGFKGHP-----LCIFCKIYFYDDDELRRHCRLRHE--ACHICDMVGPIRY 258

Query: 930 --FSRKDHLKKHMRK 942
             F   + L+ H R 
Sbjct: 259 QYFKSYEDLEAHFRN 273


>gnl|CDD|237629 PRK14160, PRK14160, heat shock protein GrpE; Provisional.
          Length = 211

 Score = 31.3 bits (71), Expect = 1.7
 Identities = 11/46 (23%), Positives = 19/46 (41%)

Query: 13 ANEEESKDIEKLVIEESASEDSDDEEDEDFENDENSSGKKMVKKFK 58
           N+EE K  E+ +  E   ++   E+ E+    +    K    K K
Sbjct: 22 ENKEEDKGKEEDLEFEEIEKEEIIEDSEESNEVKIEELKDENNKLK 67


>gnl|CDD|220102 pfam09073, BUD22, BUD22.  BUD22 has been shown in yeast to be a
           nuclear protein involved in bud-site selection. It plays
           a role in positioning the proximal bud pole signal. More
           recently it has been shown to be involved in ribosome
           biogenesis.
          Length = 424

 Score = 32.1 bits (73), Expect = 1.8
 Identities = 18/40 (45%), Positives = 28/40 (70%)

Query: 15  EEESKDIEKLVIEESASEDSDDEEDEDFENDENSSGKKMV 54
           EEES+  ++   EESA +DSDDEE+ED ++++ S    M+
Sbjct: 172 EEESESEDESKSEESAEDDSDDEEEEDSDSEDYSQYDGML 211


>gnl|CDD|218556 pfam05327, RRN3, RNA polymerase I specific transcription initiation
           factor RRN3.  This family consists of several eukaryotic
           proteins which are homologous to the yeast RRN3 protein.
           RRN3 is one of the RRN genes specifically required for
           the transcription of rDNA by RNA polymerase I (Pol I) in
           Saccharomyces cerevisiae.
          Length = 554

 Score = 31.9 bits (73), Expect = 2.1
 Identities = 23/114 (20%), Positives = 38/114 (33%), Gaps = 32/114 (28%)

Query: 581 EMLSDYDDNDNSMMTATQDDDDEPFQSEVENDDDDVPLSKRTSYYASSADEERPAPKKRG 640
           E+ ++ DD D+        D+D+  + ++ + DDD              +E  P      
Sbjct: 217 EIQNELDDIDDEEEERVLADEDDDDEDDMFDMDDD------------DEEESDP------ 258

Query: 641 RKKGVPTRRKSTGSRKSAKSDVDTTDYESAMDT--DYVDDNEASESRNEDSGVE 692
                   R ST    S K D       + MD    ++D        +E  GV 
Sbjct: 259 -----EVERTSTIKEVSEKLD-------AIMDLLFTHLDSCFTDGELDEGKGVN 300


>gnl|CDD|215299 PLN02543, PLN02543, pfkB-type carbohydrate kinase family protein.
          Length = 496

 Score = 31.8 bits (72), Expect = 2.2
 Identities = 19/67 (28%), Positives = 28/67 (41%), Gaps = 17/67 (25%)

Query: 628 SADEERPAPKKRGRKKGV-------PTRRKST----------GSRKSAKSDVDTTDYESA 670
           +A  E P P +RGRKK          T R+ T          G+ +  ++  D  DY+  
Sbjct: 58  AAVPESPKPSRRGRKKKPTSSPPKAKTTRRRTKKTDQELDPEGAEEDQEAAEDGEDYDDG 117

Query: 671 MDTDYVD 677
           +D  Y D
Sbjct: 118 IDFPYDD 124


>gnl|CDD|218333 pfam04931, DNA_pol_phi, DNA polymerase phi.  This family includes
           the fifth essential DNA polymerase in yeast EC:2.7.7.7.
           Pol5p is localised exclusively to the nucleolus and
           binds near or at the enhancer region of rRNA-encoding
           DNA repeating units.
          Length = 784

 Score = 31.8 bits (72), Expect = 2.4
 Identities = 16/45 (35%), Positives = 24/45 (53%), Gaps = 1/45 (2%)

Query: 12  DANEEESKDIEKLVIEESASEDSDDEEDEDFENDENSSGKKMVKK 56
           D +E+E + IE     ES S+  D EEDE  ++ E + G   + K
Sbjct: 659 DDDEDECEAIED-SESESESDGEDGEEDEQEDDAEANEGVVPIDK 702


>gnl|CDD|227701 COG5414, COG5414, TATA-binding protein-associated factor
           [Transcription].
          Length = 392

 Score = 31.2 bits (70), Expect = 2.8
 Identities = 13/56 (23%), Positives = 20/56 (35%), Gaps = 7/56 (12%)

Query: 9   KYVDANEEESKDIEKLVIEESASEDSDDEEDEDFE-------NDENSSGKKMVKKF 57
           K V   ++E +  E    E    E   D  DE  E       N+EN    +++   
Sbjct: 287 KEVSEGDKEQQQEEVENAEAHKEEVQSDRPDEIGEEKEEDDENEENERHTELLADE 342


>gnl|CDD|215774 pfam00183, HSP90, Hsp90 protein. 
          Length = 529

 Score = 31.3 bits (71), Expect = 3.1
 Identities = 18/48 (37%), Positives = 25/48 (52%), Gaps = 1/48 (2%)

Query: 9  KYVDANEEESKDIEKLVIEESASEDSDDEEDEDFENDENSSGKKMVKK 56
          K V   EEE +  EK   EE  + D ++E DE+ E +E     K VK+
Sbjct: 33 KEVPDEEEEEEKEEKKE-EEEKTTDKEEEVDEEEEKEEKKKKTKKVKE 79


>gnl|CDD|202452 pfam02892, zf-BED, BED zinc finger. 
          Length = 43

 Score = 27.6 bits (62), Expect = 3.2
 Identities = 7/27 (25%), Positives = 10/27 (37%), Gaps = 3/27 (11%)

Query: 55 KKFKCNICPKRYARK---NRLTNHLRT 78
           K +C  C K  +     + L  HL  
Sbjct: 14 TKARCKYCGKILSGGGGTSNLKRHLVR 40


>gnl|CDD|220605 pfam10156, Med17, Subunit 17 of Mediator complex.  This Mediator
          complex subunit was formerly known as Srb4 in yeasts or
          Trap80 in Drosophila and human. The Med17 subunit is
          located within the head domain and is essential for
          cell viability to the extent that a mutant strain of
          cerevisiae lacking it shows all RNA polymerase
          II-dependent transcription ceasing at non-permissive
          temperatures.
          Length = 454

 Score = 31.2 bits (71), Expect = 3.2
 Identities = 18/52 (34%), Positives = 31/52 (59%), Gaps = 2/52 (3%)

Query: 8  GKYVDANEEE-SKDIEKLVIEESASEDSD-DEEDEDFENDENSSGKKMVKKF 57
          G + D  EE   ++I K   +   SE+SD +E+DE+ +ND++   K  V++F
Sbjct: 44 GSFRDLTEESLREEIAKEAAKIDFSEESDEEEDDEEDDNDDSEENKDTVEEF 95


>gnl|CDD|227293 COG4957, COG4957, Predicted transcriptional regulator
           [Transcription].
          Length = 148

 Score = 30.0 bits (68), Expect = 3.3
 Identities = 15/33 (45%), Positives = 17/33 (51%), Gaps = 3/33 (9%)

Query: 707 KKGDTPQEFPCSECGKVFTRKTTLKRHVRIHTG 739
           KK  TP    C E GK F    +LKRH+  H G
Sbjct: 69  KKSVTPDYIICLEDGKKFK---SLKRHLTTHYG 98


>gnl|CDD|221333 pfam11942, Spt5_N, Spt5 transcription elongation factor, acidic
          N-terminal.  This is the very acidic N-terminal region
          of the early transcription elongation factor Spt5. The
          Spt5-Spt4 complex regulates early transcription
          elongation by RNA polymerase II and has an imputed role
          in pre-mRNA processing via its physical association
          with mRNA capping enzymes. The actual function of this
          N-terminal domain is not known although it is
          dispensable for binding to Spt4.
          Length = 92

 Score = 28.9 bits (65), Expect = 3.3
 Identities = 7/28 (25%), Positives = 15/28 (53%)

Query: 26 IEESASEDSDDEEDEDFENDENSSGKKM 53
          ++  A  D ++EE+E+ E+D      + 
Sbjct: 2  LDTEAEVDDEEEEEEEEEDDLEDLSDED 29


>gnl|CDD|220759 pfam10446, DUF2457, Protein of unknown function (DUF2457).  This is
           a family of uncharacterized proteins.
          Length = 449

 Score = 31.1 bits (70), Expect = 3.4
 Identities = 31/150 (20%), Positives = 51/150 (34%), Gaps = 25/150 (16%)

Query: 578 VKEEMLSDYDDNDNSMMTATQDDDDEPFQSEVENDDDDVPLSKRTSYYASSADEERPAPK 637
            +EE + + DD++        +D+D+    + + DD+D      T +  SSAD+      
Sbjct: 45  AEEEAMEEEDDDEEDDDDDDDEDEDDD-DDDDDEDDEDEDDDDSTLHDDSSADD------ 97

Query: 638 KRGRKKGVPTRRKSTGSRKSAKSDVDTTDYE-----SAMDTDYVDD----NEASESRNED 688
                 G  T  +  G   S     D ++Y      +                S  R  D
Sbjct: 98  ------GNETDNE-AGFADSDDESDDGSEYVFWAPGTTTAATSPRKLETMRRKSRRRTSD 150

Query: 689 SGVEGKENEDGRR--KRGKRKKGDTPQEFP 716
           S  +       RR  KR +R     P+  P
Sbjct: 151 SSADSLNERKQRRKWKRPRRSPIKPPKIRP 180


>gnl|CDD|227693 COG5406, COG5406, Nucleosome binding factor SPN, SPT16 subunit
           [Transcription / DNA replication, recombination, and
           repair / Chromatin structure and dynamics].
          Length = 1001

 Score = 31.1 bits (70), Expect = 3.5
 Identities = 11/35 (31%), Positives = 24/35 (68%), Gaps = 3/35 (8%)

Query: 12  DANEEESKDIEKLVIEESASEDSDDEEDEDFENDE 46
           + ++E S+D+ +   +ES ++ SD+E+ ED++  E
Sbjct: 956 EESDESSEDLSE---DESENDSSDEEDGEDWDELE 987


>gnl|CDD|218555 pfam05320, Pox_RNA_Pol_19, Poxvirus DNA-directed RNA polymerase 19
           kDa subunit.  This family contains several DNA-directed
           RNA polymerase 19 kDa polypeptides. The Poxvirus
           DNA-directed RNA polymerase (EC: 2.7.7.6) catalyzes
           DNA-template-directed extension of the 3'-end of an RNA
           strand by one nucleotide at a time.
          Length = 167

 Score = 30.1 bits (68), Expect = 3.7
 Identities = 23/85 (27%), Positives = 38/85 (44%), Gaps = 14/85 (16%)

Query: 575 DSIVKEEMLSDYDDNDNSMMTATQDDDDEPFQSEVENDDDDVPLSKRTSYYASSA----D 630
           DS    +  SD DD++       +++D+E  +S    +  DV   K+++Y   SA    +
Sbjct: 3   DSDDIIDYESDDDDSEEY---EEEEEDEEDAESL---ESSDVSSLKQSNYKIESASENIE 56

Query: 631 EERPAPKKRGRK----KGVPTRRKS 651
           +    PK+   K    K   TRR S
Sbjct: 57  DATSNPKQVSEKISAIKRRYTRRIS 81


>gnl|CDD|221175 pfam11705, RNA_pol_3_Rpc31, DNA-directed RNA polymerase III subunit
           Rpc31.  RNA polymerase III contains seventeen subunits
           in yeasts and in human cells. Twelve of these are akin
           to RNA polymerase I or II and the other five are RNA pol
           III-specific, and form the functionally distinct groups
           (i) Rpc31-Rpc34-Rpc82, and (ii) Rpc37-Rpc53. Rpc31,
           Rpc34 and Rpc82 form a cluster of enzyme-specific
           subunits that contribute to transcription initiation in
           S.cerevisiae and H.sapiens. There is evidence that these
           subunits are anchored at or near the N-terminal Zn-fold
           of Rpc1, itself prolonged by a highly conserved but RNA
           polymerase III-specific domain.
          Length = 221

 Score = 30.5 bits (69), Expect = 3.7
 Identities = 9/38 (23%), Positives = 21/38 (55%)

Query: 9   KYVDANEEESKDIEKLVIEESASEDSDDEEDEDFENDE 46
           K  +   E+  + ++   EE   E+ +DE+ +D ++D+
Sbjct: 159 KLKELEAEDVDEEDEKDEEEEEEEEEEDEDFDDDDDDD 196


>gnl|CDD|220684 pfam10310, DUF2413, Protein of unknown function (DUF2413).  This is
           a family of proteins conserved in fungi. The function is
           not known.
          Length = 436

 Score = 30.9 bits (70), Expect = 4.0
 Identities = 22/91 (24%), Positives = 30/91 (32%), Gaps = 7/91 (7%)

Query: 595 TATQDDDD--EPFQSEVENDDDDVPLSKRTSYYASSADEERPAPKKRGRKKGVPTRRKST 652
            +T+DD+D  E F  E+E  +   P  K     AS     R  PKK  +           
Sbjct: 26  DSTEDDEDILE-FLDELEQSEKAKPPKKPKE--ASRPGTPR-NPKKSSKPTESSAASSEE 81

Query: 653 GSRKSAKSDVDTTDYESAMDTDYVDDNEASE 683
              K  KS   T             ++E  E
Sbjct: 82  KPAKPRKSAESTRSSHPKSKAP-STESEEEE 111


>gnl|CDD|223041 PHA03321, PHA03321, tegument protein VP11/12; Provisional.
          Length = 694

 Score = 31.1 bits (70), Expect = 4.0
 Identities = 12/78 (15%), Positives = 22/78 (28%), Gaps = 2/78 (2%)

Query: 621 RTSYYASSADEERPAPKKRGRKKGVPTRRKSTGSRKSAKSDVDTTDYESAMDTDYVDDNE 680
             + Y +      P    R R     T R   G   +A  +     Y    D  +   + 
Sbjct: 497 SPATYYTRMGGGPPRLPPRNRA--TETLRPDWGPPAAAPPEQMEDPYLEPDDDRFDRRDG 554

Query: 681 ASESRNEDSGVEGKENED 698
           A+ +           ++D
Sbjct: 555 AAAAATSHPREAPAPDDD 572


>gnl|CDD|178770 PLN03231, PLN03231, putative alpha-galactosidase; Provisional.
          Length = 357

 Score = 30.7 bits (69), Expect = 4.3
 Identities = 21/73 (28%), Positives = 32/73 (43%), Gaps = 5/73 (6%)

Query: 502 QTLDRHVTVHGITGAEHDEIMGESNPATAQ-VNASE----TSGDLNTSLAPSDGTIPEID 556
           Q+L       GI   +HD + G  NP   + +  S+    +   +  SL+P DG  P + 
Sbjct: 166 QSLYDQYASWGIDFIKHDCVFGAENPQLDEILTVSKAIRNSGRPMIYSLSPGDGATPGLA 225

Query: 557 INKSDTGNMYIVT 569
              +   NMY VT
Sbjct: 226 ARVAQLVNMYRVT 238


>gnl|CDD|226907 COG4530, COG4530, Uncharacterized protein conserved in bacteria
           [Function unknown].
          Length = 129

 Score = 29.5 bits (66), Expect = 4.6
 Identities = 14/34 (41%), Positives = 21/34 (61%), Gaps = 3/34 (8%)

Query: 583 LSDYDDNDNSMMTATQDDDDEPFQSEVENDDDDV 616
           L D  D+++  +    DDDD+ F  + E+DDDDV
Sbjct: 86  LPDLGDDEDVDL---DDDDDDTFLEDEEDDDDDV 116


>gnl|CDD|166325 PLN02684, PLN02684, Probable galactinol--sucrose
           galactosyltransferase.
          Length = 750

 Score = 30.7 bits (69), Expect = 4.8
 Identities = 20/58 (34%), Positives = 27/58 (46%), Gaps = 5/58 (8%)

Query: 717 CSECGKVFTRKTTLKRHVRIHTGIKEFQCWICSKCFMEKSHLN----RHLRKHSGVVD 770
           C E G V T++++    + IH G   FQ  I       K HL     RH +K  G+VD
Sbjct: 146 CLESGDVDTKRSSFTHSLFIHAGTDPFQT-ITDAIRAVKLHLKSFRQRHEKKLPGIVD 202


>gnl|CDD|218591 pfam05443, ROS_MUCR, ROS/MUCR transcriptional regulator protein.
           This family consists of several ROS/MUCR transcriptional
           regulator proteins. The ros chromosomal gene is present
           in octopine and nopaline strains of Agrobacterium
           tumefaciens as well as in Rhizobium meliloti. This gene
           encodes a 15.5-kDa protein that specifically represses
           the virC and virD operons in the virulence region of the
           Ti plasmid and is necessary for succinoglycan
           production. Sinorhizobium meliloti can produce two types
           of acidic exopolysaccharides, succinoglycan and
           galactoglucan, that are interchangeable for infection of
           alfalfa nodules. MucR from Sinorhizobium meliloti acts
           as a transcriptional repressor that blocks the
           expression of the exp genes responsible for
           galactoglucan production therefore allowing the
           exclusive production of succinoglycan.
          Length = 130

 Score = 29.1 bits (66), Expect = 4.8
 Identities = 17/35 (48%), Positives = 19/35 (54%), Gaps = 3/35 (8%)

Query: 707 KKGDTPQEFPCSECGKVFTRKTTLKRHVRIHTGIK 741
           KK  TP    C E GK F    TLKRH+R H G+ 
Sbjct: 63  KKSVTPDYIICLEDGKKFK---TLKRHLRTHHGLT 94


>gnl|CDD|224178 COG1258, COG1258, Predicted pseudouridylate synthase [Translation,
           ribosomal structure and biogenesis].
          Length = 398

 Score = 30.4 bits (69), Expect = 5.2
 Identities = 12/34 (35%), Positives = 19/34 (55%), Gaps = 1/34 (2%)

Query: 792 QLSRHLRAHENEKPE-KPQTCEVCQKTFDKMAKF 824
           +LS  L   E  K E +P+ C +C   FD++ +F
Sbjct: 42  KLSLALELDEEYKNEVEPEPCYICGGIFDRVEEF 75


>gnl|CDD|221185 pfam11719, Drc1-Sld2, DNA replication and checkpoint protein.
           Genome duplication is precisely regulated by
           cyclin-dependent kinases CDKs, which bring about the
           onset of S phase by activating replication origins and
           then prevent relicensing of origins until mitosis is
           completed. The optimum sequence motif for CDK
           phosphorylation is S/T-P-K/R-K/R, and Drc1-Sld2 is found
           to have at least 11 potential phosphorylation sites.
           Drc1 is required for DNA synthesis and S-M replication
           checkpoint control. Drc1 associates with Cdc2 and is
           phosphorylated at the onset of S phase when Cdc2 is
           activated. Thus Cdc2 promotes DNA replication by
           phosphorylating Drc1 and regulating its association with
           Cut5. Sld2 and Sld3 represent the minimal set of S-CDK
           substrates required for DNA replication.
          Length = 397

 Score = 30.5 bits (69), Expect = 5.4
 Identities = 26/140 (18%), Positives = 55/140 (39%), Gaps = 22/140 (15%)

Query: 581 EMLSDYDDNDNSMMTATQDDDDEPFQSEVENDDDDVPLSKRTSYYASSADEERPAPKKRG 640
           E++ + +  D+ +    + + +E     +E +  D           ++ DE R   KK+G
Sbjct: 238 ELVQEEESIDDELDVLREIEAEEAGIGPIEEEVVDS---------QAANDEPRRVFKKKG 288

Query: 641 RKKGV------PTRRKSTGSRKSAKSD-------VDTTDYESAMDTDYVDDNEASESRNE 687
           +K+        P R K +      +SD       +D       +      D +  +  +E
Sbjct: 289 QKRTTRRVKMRPVRAKPSDEPSLPESDIHEEIPKLDEKSLSEFLGYMGGIDEDDEDEDDE 348

Query: 688 DSGVEGKENEDGRRKRGKRK 707
           +S  E ++ +  ++K  KRK
Sbjct: 349 ESKEEVEKKQKVKKKPRKRK 368


>gnl|CDD|227494 COG5165, POB3, Nucleosome-binding factor SPN, POB3 subunit
           [Transcription / DNA replication, recombination, and
           repair / Chromatin structure and dynamics].
          Length = 508

 Score = 30.4 bits (68), Expect = 5.7
 Identities = 18/55 (32%), Positives = 29/55 (52%), Gaps = 3/55 (5%)

Query: 584 SDYDDNDNSMMTATQDD--DDEPFQSEVEND-DDDVPLSKRTSYYASSADEERPA 635
           S  D  D +M +A ++D  +DE FQ   ++D  ++  L    S     +DEERP+
Sbjct: 454 SISDSEDINMGSAGEEDESEDEDFQMVSDSDVAEEYDLQAALSDAEGGSDEERPS 508


>gnl|CDD|219443 pfam07515, DUF1528, Protein of unknown function (DUF1528). 
          Length = 106

 Score = 28.4 bits (64), Expect = 5.8
 Identities = 13/48 (27%), Positives = 19/48 (39%), Gaps = 2/48 (4%)

Query: 824 FKRHLKVHDGVRDHQCQVCGNRFKQKS--NLNIHMKIHEGIKAHQCNV 869
           FKR+ + H  V          ++ QK    L +H K   G+    C V
Sbjct: 25  FKRYAQEHPNVPAAGNDEEDWQWVQKRFEKLGLHRKQRNGLNIWTCKV 72



 Score = 28.0 bits (63), Expect = 7.9
 Identities = 14/53 (26%), Positives = 23/53 (43%), Gaps = 5/53 (9%)

Query: 795 RHLRAHENE--KPEKPQTCEVCQKTFDKMAKFKRHLKVHDGVRDHQCQVCGNR 845
           R+ + H N        +  +  QK F+K+     H K  +G+    C+V G R
Sbjct: 27  RYAQEHPNVPAAGNDEEDWQWVQKRFEKL---GLHRKQRNGLNIWTCKVYGPR 76


>gnl|CDD|191249 pfam05279, Asp-B-Hydro_N, Aspartyl beta-hydroxylase N-terminal
           region.  This family includes the N-terminal regions of
           the junctin, junctate and aspartyl beta-hydroxylase
           proteins. Junctate is an integral ER/SR membrane calcium
           binding protein, which comes from an alternatively
           spliced form of the same gene that generates aspartyl
           beta-hydroxylase and junctin. Aspartyl beta-hydroxylase
           catalyzes the post-translational hydroxylation of
           aspartic acid or asparagine residues contained within
           epidermal growth factor (EGF) domains of proteins.
          Length = 240

 Score = 29.9 bits (67), Expect = 6.5
 Identities = 21/142 (14%), Positives = 46/142 (32%), Gaps = 18/142 (12%)

Query: 567 IVTPKQEPDSIVKEEMLSDYDDNDNSMMTATQDDDDEPFQSEVENDDDDVPLSKRTSYYA 626
           IV  KQE D   KE  L               D+D      + ++  + +   K      
Sbjct: 116 IVVSKQEEDGPGKEPQL---------------DEDKFLLAEDSDDRQETLEAGKVHEETE 160

Query: 627 SSADEERPA-PKKRGRKKGVPTRRKSTGSRKSAKSDVDTTDYESAMDTDYVDDNEASESR 685
            S   E  A  + +   K   + +++  S++  +    T       D    D +E     
Sbjct: 161 DSYHVEETASEQYKQDMKEKASEQENEDSKEPVEKAERT--KAETDDVTEEDYDEEDNPV 218

Query: 686 NEDSGVEGKENEDGRRKRGKRK 707
            +   ++ +  ++   ++ +  
Sbjct: 219 EDSKAIKEELAKEPVEEQQEVP 240


>gnl|CDD|182033 PRK09697, PRK09697, protein secretion protein GspB; Provisional.
          Length = 139

 Score = 28.9 bits (64), Expect = 6.5
 Identities = 14/42 (33%), Positives = 23/42 (54%)

Query: 15  EEESKDIEKLVIEESASEDSDDEEDEDFENDENSSGKKMVKK 56
           E   ++ E  V  E+A   S+DEE+   E+DE +  ++ VK 
Sbjct: 91  ESVEEEDEPGVAVENAPSSSEDEENTVEESDEKAGLRERVKN 132


>gnl|CDD|225880 COG3343, RpoE, DNA-directed RNA polymerase, delta subunit
           [Transcription].
          Length = 175

 Score = 29.3 bits (66), Expect = 6.7
 Identities = 13/41 (31%), Positives = 25/41 (60%), Gaps = 5/41 (12%)

Query: 10  YVDANEEESKD---IEKLVIEESASEDSDDEEDEDFENDEN 47
             D N++E +D   I +++IE+   ++  DE+D++ E DE 
Sbjct: 135 LDDENDDEDEDDDEIVEILIEDDEVDE--DEDDDEDEEDEE 173


>gnl|CDD|130712 TIGR01651, CobT, cobaltochelatase, CobT subunit.  This model
           describes Pseudomonas denitrificans CobT gene product,
           which is a cobalt chelatase subunit that functions in
           cobalamin biosynthesis. Cobalamin (vitamin B12) can be
           synthesized via several pathways, including an aerobic
           pathway (found in Pseudomonas denitrificans) and an
           anaerobic pathway (found in P. shermanii and Salmonella
           typhimurium). These pathways differ in the point of
           cobalt insertion during corrin ring formation. There are
           apparently a number of variations on these two pathways,
           where the major differences seem to be concerned with
           the process of ring contraction. Confusion regarding the
           functions of enzymes found in the aerobic vs. anaerobic
           pathways has arisen because nonhomologous genes in these
           different pathways were given the same gene symbols.
           Thus, cobT in the aerobic pathway (P. denitrificans) is
           not a homolog of cobT in the anaerobic pathway (S.
           typhimurium). It should be noted that E. coli
           synthesizes cobalamin only when it is supplied with the
           precursor cobinamide, which is a complex intermediate.
           Additionally, all E. coli cobalamin synthesis genes
           (cobU, cobS and cobT) were named after their Salmonella
           typhimurium homologs which function in the anaerobic
           cobalamin synthesis pathway. This model describes the
           aerobic cobalamin pathway Pseudomonas denitrificans CobT
           gene product, which is a cobalt chelatase subunit, with
           a MW ~70 kDa. The aerobic pathway cobalt chelatase is a
           heterotrimeric, ATP-dependent enzyme that catalyzes
           cobalt insertion during cobalamin biosynthesis. The
           other two subunits are the P. denitrificans CobS
           (TIGR01650) and CobN (pfam02514 CobN/Magnesium
           Chelatase) proteins. To avoid potential confusion with
           the nonhomologous Salmonella typhimurium/E.coli cobT
           gene product, the P. denitrificans gene symbol is not
           used in the name of this model [Biosynthesis of
           cofactors, prosthetic groups, and carriers, Heme,
           porphyrin, and cobalamin].
          Length = 600

 Score = 30.3 bits (68), Expect = 6.9
 Identities = 34/151 (22%), Positives = 44/151 (29%), Gaps = 40/151 (26%)

Query: 578 VKEEMLSDYDDNDNSMMTATQDDDDEPFQSEVENDDDDVPLSKRTSYYASSADEERPAPK 637
           V  EML         M  A +  DD   + E + DDD  P             E      
Sbjct: 189 VVREMLRS-------MELAEEMGDDTESEDEEDGDDDQ-PTENEQEEQGEGEGE------ 234

Query: 638 KRGRKKGVPTRRKSTGSRKSAKSDVDTTDYESAMDTDYVDDNEASESRNEDSGVEGKEN- 696
                          G   SA  + + TD ES        + E  +S  +D   E  ++ 
Sbjct: 235 ---------------GQEGSAPQESEATDRESES-----GEEEMVQSDQDDLPDESDDDS 274

Query: 697 -EDGRRKRGKRKKGDTPQEFPCSECGKVFTR 726
              G   R  R    T  E       KVFT 
Sbjct: 275 ETPGEGARPARPFTSTGGEPDY----KVFTT 301


>gnl|CDD|221173 pfam11702, DUF3295, Protein of unknown function (DUF3295).  This
           family is conserved in fungi but the function is not
           known.
          Length = 509

 Score = 30.3 bits (68), Expect = 7.2
 Identities = 14/42 (33%), Positives = 27/42 (64%), Gaps = 2/42 (4%)

Query: 14  NEEESKDIEKLVIEESASEDSDDEEDEDFENDENSSGKKMVK 55
           ++E++ + E+  ++ESA E  DD++D D+E+    SG+  V 
Sbjct: 286 DDEDAIETEEDDVDESAIE--DDDDDSDWEDSVEESGRSSVD 325


>gnl|CDD|185638 PTZ00459, PTZ00459, mucin-associated surface protein (MASP);
           Provisional.
          Length = 291

 Score = 29.8 bits (66), Expect = 7.2
 Identities = 16/67 (23%), Positives = 28/67 (41%), Gaps = 1/67 (1%)

Query: 643 KGVPTRRKSTGSRKSAKSDVDTTDYESAMDTDYVD-DNEASESRNEDSGVEGKENEDGRR 701
           KG+ T  + T   K   +           + D  D D+++ E  ++D G E +E E  R 
Sbjct: 46  KGLETSSQGTQDLKGGAAGAKENSPPLPTEEDDEDVDDDSEEGDDDDGGAEDEEEEKVRG 105

Query: 702 KRGKRKK 708
           + G+   
Sbjct: 106 QSGQEGT 112


>gnl|CDD|206083 pfam13912, zf-C2H2_6, C2H2-type zinc finger. 
          Length = 27

 Score = 26.0 bits (58), Expect = 7.4
 Identities = 9/24 (37%), Positives = 12/24 (50%)

Query: 150 VFLCSVCSKTFTDKVKFNRHMRAH 173
           V  C VC KTF+       H ++H
Sbjct: 1   VHTCGVCGKTFSSLQALGGHKKSH 24


>gnl|CDD|220172 pfam09309, FCP1_C, FCP1, C-terminal.  The C-terminal domain of
           FCP-1 is required for interaction with the carboxy
           terminal domain of RAP74. Interaction relies extensively
           on van der Waals contacts between hydrophobic residues
           situated within alpha-helices in both domains.
          Length = 263

 Score = 29.6 bits (66), Expect = 7.9
 Identities = 41/189 (21%), Positives = 65/189 (34%), Gaps = 30/189 (15%)

Query: 553 PEIDINKSDTGNMYIVTPK-QEPDSIVKEEMLSDYDDNDNSMMTATQDDDDEPFQSEVEN 611
           PE+ I    TG +    P+   P         S  D  + S   A Q    + F  E   
Sbjct: 54  PEVRIYDPVTGKLIRRGPQASAPAPFHAPGPSSLPDHGEPSSFRAVQPHQPQMFGEEA-- 111

Query: 612 DDDDVPLSKRTSYYASSADEERPAPKKRGRKKGVPTRRKSTGSRKSAKSDVDTTDYESAM 671
                          SS D E+P P +R R+   P+  ++       K D+++ D E   
Sbjct: 112 --------------PSSQDGEQPGPSRRKRQ---PSMSETMPLYTLCKEDLESMDKE--- 151

Query: 672 DTDYVDD---NEASESRNEDSGVEGKENEDGRRKRGKRKKGDTPQEFPCSECGKVFTRKT 728
               VDD     + +S +E     G E E  +  + ++++   P+  P         R  
Sbjct: 152 ----VDDILGEGSDDSDSEKKKPPGNEEEQEQAPQPRKQQPPGPRREPDEGTPSSSERSN 207

Query: 729 TLKRHVRIH 737
              R  R H
Sbjct: 208 IGGRGPRGH 216


>gnl|CDD|227504 COG5177, COG5177, Uncharacterized conserved protein [Function
           unknown].
          Length = 769

 Score = 30.0 bits (67), Expect = 8.2
 Identities = 35/164 (21%), Positives = 59/164 (35%), Gaps = 20/164 (12%)

Query: 559 KSDTGNMYIVTPKQEPDSIVK-----EEMLSDYDDNDNSMMTATQDDDDEPFQSEVENDD 613
           K +     +       D++ +     E++ SDYD++       T   DD  F    E   
Sbjct: 317 KLELDLQTVFESNMNRDTLDEYAPEGEDLRSDYDEDFEYDGLTTVRIDDHGFLPGREQTS 376

Query: 614 DDVPLSKRTSYY-ASSADEERPAPKKRGRKKGVPTRRKSTGSRKSAKSDVDTTDYESAMD 672
               + K TS+Y A  A++E     +                 +S  S +D  D +   D
Sbjct: 377 KKAAVPKGTSFYQAKWAEDEEEEDGQ-------------CNDEESTMSAIDDDDPKE-ND 422

Query: 673 TDYVDDNEASESRNEDSGVEGKENEDGRRKRGKRKKGDTPQEFP 716
            + V  +E S   + +   E    E+ R+ R  R      +EFP
Sbjct: 423 NEEVAGDEESAIDDNEGFEELSPEEEERQLREFRDMEKEDREFP 466


>gnl|CDD|214746 smart00614, ZnF_BED, BED zinc finger.  DNA-binding domain in
          chromatin-boundary-element-binding proteins and
          transposases.
          Length = 50

 Score = 26.6 bits (59), Expect = 8.2
 Identities = 9/23 (39%), Positives = 15/23 (65%), Gaps = 1/23 (4%)

Query: 58 KCNICPKRYARKNRL-TNHLRTH 79
          KC  C K+ +R ++  T++LR H
Sbjct: 20 KCKYCGKKLSRSSKGGTSNLRRH 42


>gnl|CDD|177546 PHA03151, PHA03151, hypothetical protein; Provisional.
          Length = 259

 Score = 29.4 bits (65), Expect = 8.6
 Identities = 27/136 (19%), Positives = 51/136 (37%), Gaps = 10/136 (7%)

Query: 600 DDDEPFQSEVENDDDDVPLSKRTSYYASSADEERPAPKKRGRKKGVPTRRKSTGSRKSAK 659
           ++D+ F  + +++D     S    +  S    +  A      ++      K   S +   
Sbjct: 99  NNDKDFDFKPQDEDTSSDDSSAPDFITSLVSSDCEARGLSSSEEDGEPYSKQKMS-QPLT 157

Query: 660 SDVDTTDYESAMDTDYVDDNEASESRNEDSGVEGKENEDGRRKRGKRKKGDTPQEFPCSE 719
            D  T +  S  D    +D+  SE    ++ +  +    G++K+GKR    +  E P   
Sbjct: 158 IDAKTEEITSEEDCCVQEDSSDSEEDVVEAFIRQRAQMAGKKKKGKRSISTSDDEPP--- 214

Query: 720 CGKVFTRKTTLKRHVR 735
                 RK+  KRH  
Sbjct: 215 ------RKSRRKRHSH 224


>gnl|CDD|185618 PTZ00438, PTZ00438, gamete antigen 27/25-like protein; Provisional.
          Length = 374

 Score = 29.6 bits (66), Expect = 9.1
 Identities = 14/51 (27%), Positives = 28/51 (54%), Gaps = 7/51 (13%)

Query: 15  EEESKDIEKL-------VIEESASEDSDDEEDEDFENDENSSGKKMVKKFK 58
           EEE +D+E++       V+EE   +D D E+D++ E+D      ++  ++ 
Sbjct: 109 EEEDEDVEEIEEVEEVEVVEEEYDDDEDSEKDDEKESDAEGDENELAGEYI 159


>gnl|CDD|217503 pfam03344, Daxx, Daxx Family.  The Daxx protein (also known as the
           Fas-binding protein) is thought to play a role in
           apoptosis, but precise role played by Daxx remains to be
           determined. Daxx forms a complex with Axin.
          Length = 715

 Score = 29.9 bits (67), Expect = 9.4
 Identities = 32/179 (17%), Positives = 61/179 (34%), Gaps = 9/179 (5%)

Query: 573 EPDSIVKEEMLSDYDDNDNSMMTATQDDDDEPFQSEVENDDDDVPLSKRTSYYASSADEE 632
           E +  V+EE   + ++ +    +  ++ +DE  + EVE D+      + +S      +E 
Sbjct: 440 EEEESVEEEEEEEEEEEEEEQESEEEEGEDEEEEEEVEADNGSEEEMEGSSEGDGDGEEP 499

Query: 633 RPAPKKRGRK-KGVPTRRKSTGSRKSAKSDVDTTDYES---AMDTDYVDDNEASESRNED 688
               ++R  +  G+    +    R S+       +      +MD + V +    E   E+
Sbjct: 500 EEDAERRNSEMAGISRMSEGQQPRGSSVQPESPQEEPLQPESMDAESVGEESDEELLAEE 559

Query: 689 SGVEG-----KENEDGRRKRGKRKKGDTPQEFPCSECGKVFTRKTTLKRHVRIHTGIKE 742
           S +               K    +K   P      E        TT   +V  HT   E
Sbjct: 560 SPLSSHTELEGVATPVETKISSSRKLPPPPVSTSLENDSATVTSTTRNGNVSPHTPQDE 618


>gnl|CDD|226920 COG4547, CobT, Cobalamin biosynthesis protein CobT
           (nicotinate-mononucleotide:5, 6-dimethylbenzimidazole
           phosphoribosyltransferase) [Coenzyme metabolism].
          Length = 620

 Score = 29.8 bits (67), Expect = 9.4
 Identities = 26/135 (19%), Positives = 44/135 (32%), Gaps = 21/135 (15%)

Query: 581 EMLSD-YDDNDNSMMTATQDDDDEPFQSEVEND---------------DDDVPLSKRTSY 624
            ML + YD  + S  T  +D   E    E                      + L +    
Sbjct: 130 AMLEEKYDKANFSKRTDKEDAPLE----EAVALLVREKLTGDAPPDSAGKVLDLWRDEIE 185

Query: 625 YASSADEERPAPKKRGRKKGVPTRRKSTGSRKSAKSDVDTTDYESAMDTDYVDDNEASES 684
             +  D +  A +   ++      R   GS   A+   D    E A + D  DD++   +
Sbjct: 186 DKAGEDLDGLAAEIDDQQAFARVVRDMLGSMDMAEETGDDGIEEDADEED-GDDDQPDNN 244

Query: 685 RNEDSGVEGKENEDG 699
            + ++G E  E  D 
Sbjct: 245 EDSEAGREESEGSDE 259


>gnl|CDD|240226 PTZ00007, PTZ00007, (NAP-L) nucleosome assembly protein -L;
           Provisional.
          Length = 337

 Score = 29.4 bits (66), Expect = 9.6
 Identities = 11/45 (24%), Positives = 20/45 (44%), Gaps = 3/45 (6%)

Query: 661 DVDTTDYESAMDTDYVDDNEASESRNEDSGVEGKENEDGRRKRGK 705
           D D   Y+S+ D+   D N   ++  ED    G++  +G +    
Sbjct: 292 DDDYDSYDSS-DSASSDSNSDVDTNEEDDR--GEKESNGAKSNEL 333


>gnl|CDD|165133 PHA02768, PHA02768, hypothetical protein; Provisional.
          Length = 55

 Score = 26.9 bits (59), Expect = 9.7
 Identities = 8/24 (33%), Positives = 17/24 (70%)

Query: 743 FQCWICSKCFMEKSHLNRHLRKHS 766
           ++C IC + ++++  +  HLRKH+
Sbjct: 6   YECPICGEIYIKRKSMITHLRKHN 29


  Database: CDD.v3.10
    Posted date:  Mar 20, 2013  7:55 AM
  Number of letters in database: 10,937,602
  Number of sequences in database:  44,354
  
Lambda     K      H
   0.317    0.131    0.399 

Gapped
Lambda     K      H
   0.267   0.0696    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 49,923,666
Number of extensions: 4682959
Number of successful extensions: 6556
Number of sequences better than 10.0: 1
Number of HSP's gapped: 6281
Number of HSP's successfully gapped: 249
Length of query: 1043
Length of database: 10,937,602
Length adjustment: 107
Effective length of query: 936
Effective length of database: 6,191,724
Effective search space: 5795453664
Effective search space used: 5795453664
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 64 (28.5 bits)