RPS-BLAST 2.2.26 [Sep-21-2011]

Database: CDD.v3.10 
           44,354 sequences; 10,937,602 total letters

Searching..................................................done

Query= psy14234
         (779 letters)



>gnl|CDD|99929 cd05497, Bromo_Brdt_I_like, Bromodomain, Brdt_like subfamily,
           repeat I. Human Brdt is a testis-specific member of the
           BET subfamily of bromodomain proteins; the first
           bromodomain in Brdt has been shown to be essential for
           male germ cell differentiation. Bromodomains are 110
           amino acid long domains, that are found in many
           chromatin associated proteins. Bromodomains can interact
           specifically with acetylated lysine.
          Length = 107

 Score =  212 bits (541), Expect = 2e-65
 Identities = 80/107 (74%), Positives = 92/107 (85%)

Query: 120 RNTNQLQYIVKNVMKAVWKHPHAWPFHEPVDAINLNLPDYHKVITQPMDLGTIKKRLENN 179
           R+TNQLQY++K V+KA+WKH  AWPF +PVDA+ LNLPDYHK+I  PMDLGTIKKRLENN
Sbjct: 1   RHTNQLQYLLKVVLKALWKHKFAWPFQQPVDAVKLNLPDYHKIIKTPMDLGTIKKRLENN 60

Query: 180 YYWSGKEAIQDFTTMFTNCYVYNKPGEDVVLMAQNLEQLFLTKITGM 226
           YYWS  E IQDF TMFTNCY+YNKPG+DVVLMAQ LE+LFL K+  M
Sbjct: 61  YYWSASECIQDFNTMFTNCYIYNKPGDDVVLMAQTLEKLFLQKLAQM 107



 Score =  111 bits (279), Expect = 5e-29
 Identities = 44/90 (48%), Positives = 58/90 (64%), Gaps = 1/90 (1%)

Query: 388 KHQNYAWPFYTPVDVEKLGLTDYFDIIKKPMDLGTVRKKMRNRTYKTAKEFADDVRLIFS 447
           KH+ +AWPF  PVD  KL L DY  IIK PMDLGT++K++ N  Y +A E   D   +F+
Sbjct: 19  KHK-FAWPFQQPVDAVKLNLPDYHKIIKTPMDLGTIKKRLENNYYWSASECIQDFNTMFT 77

Query: 448 NCYKYNPPDHNVVTMARQLSAVFEDRFAKM 477
           NCY YN P  +VV MA+ L  +F  + A+M
Sbjct: 78  NCYIYNKPGDDVVLMAQTLEKLFLQKLAQM 107


>gnl|CDD|99930 cd05498, Bromo_Brdt_II_like, Bromodomain, Brdt_like subfamily,
           repeat II. Human Brdt is a testis-specific member of the
           BET subfamily of bromodomain proteins; the first
           bromodomain in Brdt has been shown to be essential for
           male germ cell differentiation. Bromodomains are 110
           amino acid long domains, that are found in many
           chromatin associated proteins. Bromodomains can interact
           specifically with acetylated lysine.
          Length = 102

 Score =  187 bits (476), Expect = 3e-56
 Identities = 69/102 (67%), Positives = 79/102 (77%)

Query: 373 WYKYCSEIIAELFHKKHQNYAWPFYTPVDVEKLGLTDYFDIIKKPMDLGTVRKKMRNRTY 432
             K+CS I+ ELF KKH+ YAWPFY PVD E LGL DY DIIK PMDL T++KK+ NR Y
Sbjct: 1   QLKFCSGILKELFSKKHKAYAWPFYKPVDPEALGLHDYHDIIKHPMDLSTIKKKLDNREY 60

Query: 433 KTAKEFADDVRLIFSNCYKYNPPDHNVVTMARQLSAVFEDRF 474
             A+EFA DVRL+FSNCYKYNPPDH V  MAR+L  VFEDR+
Sbjct: 61  ADAQEFAADVRLMFSNCYKYNPPDHPVHAMARKLQDVFEDRW 102



 Score =  111 bits (280), Expect = 3e-29
 Identities = 39/78 (50%), Positives = 49/78 (62%)

Query: 142 AWPFHEPVDAINLNLPDYHKVITQPMDLGTIKKRLENNYYWSGKEAIQDFTTMFTNCYVY 201
           AWPF++PVD   L L DYH +I  PMDL TIKK+L+N  Y   +E   D   MF+NCY Y
Sbjct: 21  AWPFYKPVDPEALGLHDYHDIIKHPMDLSTIKKKLDNREYADAQEFAADVRLMFSNCYKY 80

Query: 202 NKPGEDVVLMAQNLEQLF 219
           N P   V  MA+ L+ +F
Sbjct: 81  NPPDHPVHAMARKLQDVF 98


>gnl|CDD|99931 cd05499, Bromo_BDF1_2_II, Bromodomain. BDF1/BDF2 like subfamily,
           restricted to fungi, repeat II. BDF1 and BDF2 are yeast
           transcription factors involved in the expression of a
           wide range of genes, including snRNAs; they are required
           for sporulation and DNA repair and protect histone H4
           from deacetylation. Bromodomains are 110 amino acid long
           domains, that are found in many chromatin associated
           proteins. Bromodomains can interact specifically with
           acetylated lysine.
          Length = 102

 Score =  145 bits (368), Expect = 3e-41
 Identities = 53/99 (53%), Positives = 65/99 (65%)

Query: 374 YKYCSEIIAELFHKKHQNYAWPFYTPVDVEKLGLTDYFDIIKKPMDLGTVRKKMRNRTYK 433
            K+C E++ EL   KH  Y WPF  PVD   L + +YF IIKKPMDLGT+ KK++N  Y+
Sbjct: 2   LKFCEEVLKELMKPKHSAYNWPFLDPVDPVALNIPNYFSIIKKPMDLGTISKKLQNGQYQ 61

Query: 434 TAKEFADDVRLIFSNCYKYNPPDHNVVTMARQLSAVFED 472
           +AKEF  DVRLIF NCY +NP   +V  M  QL  VF D
Sbjct: 62  SAKEFERDVRLIFKNCYTFNPEGTDVYMMGHQLEEVFND 100



 Score =  101 bits (254), Expect = 9e-26
 Identities = 38/78 (48%), Positives = 52/78 (66%)

Query: 142 AWPFHEPVDAINLNLPDYHKVITQPMDLGTIKKRLENNYYWSGKEAIQDFTTMFTNCYVY 201
            WPF +PVD + LN+P+Y  +I +PMDLGTI K+L+N  Y S KE  +D   +F NCY +
Sbjct: 21  NWPFLDPVDPVALNIPNYFSIIKKPMDLGTISKKLQNGQYQSAKEFERDVRLIFKNCYTF 80

Query: 202 NKPGEDVVLMAQNLEQLF 219
           N  G DV +M   LE++F
Sbjct: 81  NPEGTDVYMMGHQLEEVF 98


>gnl|CDD|99938 cd05506, Bromo_plant1, Bromodomain, uncharacterized subfamily
           specific to plants. Might function as a global
           transcription factor. Bromodomains are 110 amino acid
           long domains, that are found in many chromatin
           associated proteins. Bromodomains can interact
           specifically with acetylated lysine.
          Length = 99

 Score =  135 bits (342), Expect = 1e-37
 Identities = 53/102 (51%), Positives = 70/102 (68%), Gaps = 3/102 (2%)

Query: 373 WYKYCSEIIAELFHKKHQNYAWPFYTPVDVEKLGLTDYFDIIKKPMDLGTVRKKMRNRTY 432
             K C  ++ +L   KH+ + W F  PVDV  LGL DYFDIIKKPMDLGTV+KK+    Y
Sbjct: 1   VMKQCGTLLRKLM--KHK-WGWVFNAPVDVVALGLPDYFDIIKKPMDLGTVKKKLEKGEY 57

Query: 433 KTAKEFADDVRLIFSNCYKYNPPDHNVVTMARQLSAVFEDRF 474
            + +EFA DVRL F+N  +YNPP ++V TMA++L  +FE R+
Sbjct: 58  SSPEEFAADVRLTFANAMRYNPPGNDVHTMAKELLKIFETRW 99



 Score =  110 bits (276), Expect = 8e-29
 Identities = 40/92 (43%), Positives = 55/92 (59%)

Query: 131 NVMKAVWKHPHAWPFHEPVDAINLNLPDYHKVITQPMDLGTIKKRLENNYYWSGKEAIQD 190
            +++ + KH   W F+ PVD + L LPDY  +I +PMDLGT+KK+LE   Y S +E   D
Sbjct: 7   TLLRKLMKHKWGWVFNAPVDVVALGLPDYFDIIKKPMDLGTVKKKLEKGEYSSPEEFAAD 66

Query: 191 FTTMFTNCYVYNKPGEDVVLMAQNLEQLFLTK 222
               F N   YN PG DV  MA+ L ++F T+
Sbjct: 67  VRLTFANAMRYNPPGNDVHTMAKELLKIFETR 98


>gnl|CDD|197636 smart00297, BROMO, bromo domain. 
          Length = 107

 Score =  122 bits (309), Expect = 4e-33
 Identities = 41/108 (37%), Positives = 60/108 (55%), Gaps = 3/108 (2%)

Query: 371 PLWYKYCSEIIAELFHKKHQN-YAWPFYTPVDVEKLGLTDYFDIIKKPMDLGTVRKKMRN 429
           P   K   E++  +  K   +  +WPF  PV  ++    DY+DIIKKPMDL T++KK+ N
Sbjct: 2   PKLQKKLQELLKAVLDKLDSHPLSWPFLKPVSRKEAP--DYYDIIKKPMDLKTIKKKLEN 59

Query: 430 RTYKTAKEFADDVRLIFSNCYKYNPPDHNVVTMARQLSAVFEDRFAKM 477
             Y + +EF  D  L+FSN   YN PD  V   A++L   FE +  ++
Sbjct: 60  GKYSSVEEFVADFNLMFSNARTYNGPDSEVYKDAKKLEKFFEKKLREL 107



 Score =  119 bits (301), Expect = 6e-32
 Identities = 43/105 (40%), Positives = 62/105 (59%), Gaps = 2/105 (1%)

Query: 120 RNTNQLQYIVKNVMKAVWKHPHAWPFHEPVDAINLNLPDYHKVITQPMDLGTIKKRLENN 179
           +   +LQ ++K V+  +  HP +WPF +PV       PDY+ +I +PMDL TIKK+LEN 
Sbjct: 3   KLQKKLQELLKAVLDKLDSHPLSWPFLKPVSRKEA--PDYYDIIKKPMDLKTIKKKLENG 60

Query: 180 YYWSGKEAIQDFTTMFTNCYVYNKPGEDVVLMAQNLEQLFLTKIT 224
            Y S +E + DF  MF+N   YN P  +V   A+ LE+ F  K+ 
Sbjct: 61  KYSSVEEFVADFNLMFSNARTYNGPDSEVYKDAKKLEKFFEKKLR 105


>gnl|CDD|99922 cd04369, Bromodomain, Bromodomain. Bromodomains are found in many
           chromatin-associated proteins and in nuclear histone
           acetyltransferases. They interact specifically with
           acetylated lysine.
          Length = 99

 Score =  119 bits (299), Expect = 7e-32
 Identities = 39/102 (38%), Positives = 57/102 (55%), Gaps = 3/102 (2%)

Query: 373 WYKYCSEIIAELFHKKHQNYAWPFYTPVDVEKLGLTDYFDIIKKPMDLGTVRKKMRNRTY 432
             K    ++  L  K  ++ + PF  PVD       DY+++IK PMDL T++KK++N  Y
Sbjct: 1   LKKKLRSLLDAL-KKLKRDLSEPFLEPVD--PKEAPDYYEVIKNPMDLSTIKKKLKNGEY 57

Query: 433 KTAKEFADDVRLIFSNCYKYNPPDHNVVTMARQLSAVFEDRF 474
           K+ +EF  DVRLIFSN   YN P   +   A++L  +FE   
Sbjct: 58  KSLEEFEADVRLIFSNAKTYNGPGSPIYKDAKKLEKLFEKLL 99



 Score =  108 bits (273), Expect = 2e-28
 Identities = 39/101 (38%), Positives = 58/101 (57%), Gaps = 4/101 (3%)

Query: 125 LQYIVKNVMKAVWKH--PHAWPFHEPVDAINLNLPDYHKVITQPMDLGTIKKRLENNYYW 182
           L+  +++++ A+ K     + PF EPVD      PDY++VI  PMDL TIKK+L+N  Y 
Sbjct: 1   LKKKLRSLLDALKKLKRDLSEPFLEPVDP--KEAPDYYEVIKNPMDLSTIKKKLKNGEYK 58

Query: 183 SGKEAIQDFTTMFTNCYVYNKPGEDVVLMAQNLEQLFLTKI 223
           S +E   D   +F+N   YN PG  +   A+ LE+LF   +
Sbjct: 59  SLEEFEADVRLIFSNAKTYNGPGSPIYKDAKKLEKLFEKLL 99


>gnl|CDD|99941 cd05509, Bromo_gcn5_like, Bromodomain; Gcn5_like subfamily. Gcn5p
           is a histone acetyltransferase (HAT) which mediates
           acetylation of histones at lysine residues; such
           acetylation is generally correlated with the activation
           of transcription. Bromodomains are 110 amino acid long
           domains, that are found in many chromatin associated
           proteins. Bromodomains can interact specifically with
           acetylated lysine.
          Length = 101

 Score =  114 bits (287), Expect = 3e-30
 Identities = 40/103 (38%), Positives = 59/103 (57%), Gaps = 2/103 (1%)

Query: 124 QLQYIVKNVMKAVWKHPHAWPFHEPVDAINLNLPDYHKVITQPMDLGTIKKRLENNYYWS 183
            L   +K V+ ++  H  AWPF EPVD      PDY+ VI +PMDL T++++LEN YY +
Sbjct: 1   PLYTQLKKVLDSLKNHKSAWPFLEPVDK--EEAPDYYDVIKKPMDLSTMEEKLENGYYVT 58

Query: 184 GKEAIQDFTTMFTNCYVYNKPGEDVVLMAQNLEQLFLTKITGM 226
            +E + D   +F NC +YN P  +    A  LE+ F  K+  +
Sbjct: 59  LEEFVADLKLIFDNCRLYNGPDTEYYKCANKLEKFFWKKLKEL 101



 Score =  105 bits (264), Expect = 5e-27
 Identities = 40/91 (43%), Positives = 55/91 (60%), Gaps = 3/91 (3%)

Query: 387 KKHQNYAWPFYTPVDVEKLGLTDYFDIIKKPMDLGTVRKKMRNRTYKTAKEFADDVRLIF 446
           K H++ AWPF  PVD E+    DY+D+IKKPMDL T+ +K+ N  Y T +EF  D++LIF
Sbjct: 14  KNHKS-AWPFLEPVDKEEAP--DYYDVIKKPMDLSTMEEKLENGYYVTLEEFVADLKLIF 70

Query: 447 SNCYKYNPPDHNVVTMARQLSAVFEDRFAKM 477
            NC  YN PD      A +L   F  +  ++
Sbjct: 71  DNCRLYNGPDTEYYKCANKLEKFFWKKLKEL 101


>gnl|CDD|99932 cd05500, Bromo_BDF1_2_I, Bromodomain. BDF1/BDF2 like subfamily,
           restricted to fungi, repeat I. BDF1 and BDF2 are yeast
           transcription factors involved in the expression of a
           wide range of genes, including snRNAs; they are required
           for sporulation and DNA repair and protect histone H4
           from deacetylation. Bromodomains are 110 amino acid long
           domains, that are found in many chromatin associated
           proteins. Bromodomains can interact specifically with
           acetylated lysine.
          Length = 103

 Score =  100 bits (251), Expect = 2e-25
 Identities = 41/102 (40%), Positives = 56/102 (54%), Gaps = 3/102 (2%)

Query: 370 TPLWYKYCSEIIAELFHKKHQNYAWPFYTPVDVEKLGLTDYFDIIKKPMDLGTVRKKMRN 429
           T   +K+    I  L   K    A PF  PVD  KL +  Y  IIKKPMDLGT+ +K+++
Sbjct: 2   TKHQHKFLLSSIRSL---KRLKDARPFLVPVDPVKLNIPHYPTIIKKPMDLGTIERKLKS 58

Query: 430 RTYKTAKEFADDVRLIFSNCYKYNPPDHNVVTMARQLSAVFE 471
             Y + +EF  D  L+  NC  +N P+H V  M ++L A FE
Sbjct: 59  NVYTSVEEFTADFNLMVDNCLTFNGPEHPVSQMGKRLQAAFE 100



 Score = 92.4 bits (230), Expect = 2e-22
 Identities = 35/96 (36%), Positives = 58/96 (60%), Gaps = 1/96 (1%)

Query: 124 QLQYIVKNVMKAVWKHPHAWPFHEPVDAINLNLPDYHKVITQPMDLGTIKKRLENNYYWS 183
           Q ++++ ++ +++ +   A PF  PVD + LN+P Y  +I +PMDLGTI+++L++N Y S
Sbjct: 5   QHKFLLSSI-RSLKRLKDARPFLVPVDPVKLNIPHYPTIIKKPMDLGTIERKLKSNVYTS 63

Query: 184 GKEAIQDFTTMFTNCYVYNKPGEDVVLMAQNLEQLF 219
            +E   DF  M  NC  +N P   V  M + L+  F
Sbjct: 64  VEEFTADFNLMVDNCLTFNGPEHPVSQMGKRLQAAF 99


>gnl|CDD|99936 cd05504, Bromo_Acf1_like, Bromodomain; Acf1_like or BAZ1A_like
           subfamily. Bromo adjacent to zinc finger 1A (BAZ1A) was
           identified as a novel human bromodomain gene by cDNA
           library screening. The Drosophila homologue, Acf1, is
           part of the CHRAC (chromatin accessibility complex) and
           regulates ISWI-induced nucleosome remodeling.
           Bromodomains are 110 amino acid long domains, that are
           found in many chromatin associated proteins.
           Bromodomains can interact specifically with acetylated
           lysine.
          Length = 115

 Score = 96.3 bits (240), Expect = 1e-23
 Identities = 42/101 (41%), Positives = 64/101 (63%), Gaps = 5/101 (4%)

Query: 377 CSEIIAELFHKKHQNYAWPFYTPVDVEKLGLTDYFDIIKKPMDLGTVRKKMRNRTYKTAK 436
             +++ E+   KH++ +WPF  PV   K+ + DY+DIIKKPMDLGT+++K+    YK A+
Sbjct: 17  LEQLLVEI--VKHKD-SWPFLRPV--SKIEVPDYYDIIKKPMDLGTIKEKLNMGEYKLAE 71

Query: 437 EFADDVRLIFSNCYKYNPPDHNVVTMARQLSAVFEDRFAKM 477
           EF  D++L+FSNC+ YNP   +V     +L   F  R  K+
Sbjct: 72  EFLSDIQLVFSNCFLYNPEHTSVYKAGTRLQRFFIKRCRKL 112



 Score = 81.7 bits (202), Expect = 1e-18
 Identities = 32/85 (37%), Positives = 49/85 (57%), Gaps = 2/85 (2%)

Query: 138 KHPHAWPFHEPVDAINLNLPDYHKVITQPMDLGTIKKRLENNYYWSGKEAIQDFTTMFTN 197
           KH  +WPF  PV  I +  PDY+ +I +PMDLGTIK++L    Y   +E + D   +F+N
Sbjct: 26  KHKDSWPFLRPVSKIEV--PDYYDIIKKPMDLGTIKEKLNMGEYKLAEEFLSDIQLVFSN 83

Query: 198 CYVYNKPGEDVVLMAQNLEQLFLTK 222
           C++YN     V      L++ F+ +
Sbjct: 84  CFLYNPEHTSVYKAGTRLQRFFIKR 108


>gnl|CDD|99927 cd05495, Bromo_cbp_like, Bromodomain, cbp_like subfamily. Cbp (CREB
           binding protein or CREBBP) is an acetyltransferase
           acting on histone, which gives a specific tag for
           transcriptional activation and also acetylates
           non-histone proteins. CREBBP binds specifically to
           phosphorylated CREB protein and augments the activity of
           phosphorylated CREB to activate transcription of
           cAMP-responsive genes. Bromodomains are 110 amino acid
           long domains, that are found in many chromatin
           associated proteins. Bromodomains can interact
           specifically with acetylated lysine.
          Length = 108

 Score = 90.6 bits (225), Expect = 8e-22
 Identities = 35/86 (40%), Positives = 49/86 (56%)

Query: 388 KHQNYAWPFYTPVDVEKLGLTDYFDIIKKPMDLGTVRKKMRNRTYKTAKEFADDVRLIFS 447
           K    + PF  PVD + LG+ DYFDI+K PMDL T+R+K+    Y+   ++ DDV L+F 
Sbjct: 17  KQDPESLPFRQPVDPKLLGIPDYFDIVKNPMDLSTIRRKLDTGQYQDPWQYVDDVWLMFD 76

Query: 448 NCYKYNPPDHNVVTMARQLSAVFEDR 473
           N + YN     V     +L+ VFE  
Sbjct: 77  NAWLYNRKTSRVYKYCTKLAEVFEQE 102



 Score = 75.2 bits (185), Expect = 2e-16
 Identities = 30/99 (30%), Positives = 51/99 (51%)

Query: 125 LQYIVKNVMKAVWKHPHAWPFHEPVDAINLNLPDYHKVITQPMDLGTIKKRLENNYYWSG 184
            Q ++  + K   + P + PF +PVD   L +PDY  ++  PMDL TI+++L+   Y   
Sbjct: 5   RQALMPTLEKLYKQDPESLPFRQPVDPKLLGIPDYFDIVKNPMDLSTIRRKLDTGQYQDP 64

Query: 185 KEAIQDFTTMFTNCYVYNKPGEDVVLMAQNLEQLFLTKI 223
            + + D   MF N ++YN+    V      L ++F  +I
Sbjct: 65  WQYVDDVWLMFDNAWLYNRKTSRVYKYCTKLAEVFEQEI 103


>gnl|CDD|215921 pfam00439, Bromodomain, Bromodomain.  Bromodomains are 110 amino
           acid long domains, that are found in many chromatin
           associated proteins. Bromodomains can interact
           specifically with acetylated lysine.
          Length = 84

 Score = 89.4 bits (222), Expect = 1e-21
 Identities = 32/89 (35%), Positives = 49/89 (55%), Gaps = 5/89 (5%)

Query: 377 CSEIIAELFHKKHQNYAWPFYTPVDVEKLGLTDYFDIIKKPMDLGTVRKKMRNRTYKTAK 436
           C EI+ +L   +H             E     DY+++IK+PMDL T+R+K+++  YK+  
Sbjct: 1   CLEILEDL--MEHPLAEPFLEPVDPEE---YPDYYEVIKEPMDLSTIRQKLKSGKYKSLA 55

Query: 437 EFADDVRLIFSNCYKYNPPDHNVVTMARQ 465
           EF  DV LIFSN   YN  D ++   A++
Sbjct: 56  EFLKDVELIFSNAITYNGEDSDIYKDAKK 84



 Score = 84.0 bits (208), Expect = 8e-20
 Identities = 31/84 (36%), Positives = 50/84 (59%), Gaps = 2/84 (2%)

Query: 130 KNVMKAVWKHPHAWPFHEPVDAINLNLPDYHKVITQPMDLGTIKKRLENNYYWSGKEAIQ 189
             +++ + +HP A PF EPVD      PDY++VI +PMDL TI+++L++  Y S  E ++
Sbjct: 2   LEILEDLMEHPLAEPFLEPVD--PEEYPDYYEVIKEPMDLSTIRQKLKSGKYKSLAEFLK 59

Query: 190 DFTTMFTNCYVYNKPGEDVVLMAQ 213
           D   +F+N   YN    D+   A+
Sbjct: 60  DVELIFSNAITYNGEDSDIYKDAK 83


>gnl|CDD|99942 cd05510, Bromo_SPT7_like, Bromodomain; SPT7_like subfamily. SPT7 is
           a yeast protein that functions as a component of the
           transcription regulatory histone acetylation (HAT)
           complexes SAGA, SALSA, and SLIK. SAGA is involved in the
           RNA polymerase II-dependent transcriptional regulation
           of about 10% of all yeast genes. The SPT7 bromodomain
           has been shown to weakly interact with acetylated
           histone H3, but not H4. The human representative of this
           subfamily is cat eye syndrome critical region protein 2
           (CECR2). Bromodomains are 110 amino acid long domains,
           that are found in many chromatin associated proteins.
           Bromodomains can interact specifically with acetylated
           lysine.
          Length = 112

 Score = 89.4 bits (222), Expect = 2e-21
 Identities = 43/107 (40%), Positives = 60/107 (56%), Gaps = 5/107 (4%)

Query: 374 YKYCSEIIAELFHKKHQNYAWPFYTPVDVEKLGLTDYFDIIKKPMDLGTVRKKMRNRTYK 433
           Y+   +++ EL  K +  ++ PF T V   K    DY+DIIKKPMDLGT+ KK++N  YK
Sbjct: 9   YESLDKVLNEL--KTYTEHSTPFLTKV--SKREAPDYYDIIKKPMDLGTMLKKLKNLQYK 64

Query: 434 TAKEFADDVRLIFSNCYKYNP-PDHNVVTMARQLSAVFEDRFAKMPD 479
           +  EF DD+ LI+ NC  YN  P H +   A  +    E     +PD
Sbjct: 65  SKAEFVDDLNLIWKNCLLYNSDPSHPLRRHANFMKKKAEHLLKLIPD 111



 Score = 61.7 bits (150), Expect = 1e-11
 Identities = 25/63 (39%), Positives = 36/63 (57%), Gaps = 2/63 (3%)

Query: 140 PHAWPFHEPVDAINLNLPDYHKVITQPMDLGTIKKRLENNYYWSGKEAIQDFTTMFTNCY 199
            H+ PF   V       PDY+ +I +PMDLGT+ K+L+N  Y S  E + D   ++ NC 
Sbjct: 24  EHSTPFLTKVSKREA--PDYYDIIKKPMDLGTMLKKLKNLQYKSKAEFVDDLNLIWKNCL 81

Query: 200 VYN 202
           +YN
Sbjct: 82  LYN 84


>gnl|CDD|99943 cd05511, Bromo_TFIID, Bromodomain, TFIID-like subfamily. Human
           TAFII250 (or TAF250) is the largest subunit of TFIID, a
           large multi-domain complex, which initiates the assembly
           of the transcription machinery. TAFII250 contains two
           bromodomains that specifically bind to acetylated
           histone H4. Bromodomains are 110 amino acid long
           domains, that are found in many chromatin associated
           proteins. Bromodomains can interact specifically with
           acetylated lysine.
          Length = 112

 Score = 89.2 bits (222), Expect = 3e-21
 Identities = 39/103 (37%), Positives = 63/103 (61%), Gaps = 5/103 (4%)

Query: 378 SEIIAELFHKKHQNYAWPFYTPVDVEKLGLTDYFDIIKKPMDLGTVRKKMRNRTYKTAKE 437
            EI+ EL   K+   +WPF+TPV+ +K+   DY+ IIK+PMDL T+RKK+    Y++ +E
Sbjct: 6   DEIVNEL---KNLPDSWPFHTPVNKKKV--PDYYKIIKRPMDLQTIRKKISKHKYQSREE 60

Query: 438 FADDVRLIFSNCYKYNPPDHNVVTMARQLSAVFEDRFAKMPDE 480
           F +D+ LI  N   YN PD      A+++  + E+  A+  ++
Sbjct: 61  FLEDIELIVDNSVLYNGPDSVYTKKAKEMLELAEELLAEREEK 103



 Score = 76.1 bits (188), Expect = 1e-16
 Identities = 31/94 (32%), Positives = 54/94 (57%), Gaps = 2/94 (2%)

Query: 125 LQYIVKNVMKAVWKHPHAWPFHEPVDAINLNLPDYHKVITQPMDLGTIKKRLENNYYWSG 184
           L +I+  ++  +   P +WPFH PV+     +PDY+K+I +PMDL TI+K++  + Y S 
Sbjct: 1   LSFILDEIVNELKNLPDSWPFHTPVNK--KKVPDYYKIIKRPMDLQTIRKKISKHKYQSR 58

Query: 185 KEAIQDFTTMFTNCYVYNKPGEDVVLMAQNLEQL 218
           +E ++D   +  N  +YN P       A+ + +L
Sbjct: 59  EEFLEDIELIVDNSVLYNGPDSVYTKKAKEMLEL 92


>gnl|CDD|99934 cd05502, Bromo_tif1_like, Bromodomain; tif1_like subfamily. Tif1
           (transcription intermediary factor 1) is a member of the
           tripartite motif (TRIM) protein family, which is
           characterized by a particular domain architecture. It
           functions by recruiting coactivators and/or corepressors
           to modulate transcription. Vertebrate Tif1-gamma, also
           labeled E3 ubiquitin-protein ligase TRIM33, plays a role
           in the control of hematopoiesis. Its homologue in
           Xenopus laevis, Ectodermin, has been shown to function
           in germ-layer specification and control of cell growth
           during embryogenesis. Bromodomains are 110 amino acid
           long domains, that are found in many chromatin
           associated proteins. Bromodomains can interact
           specifically with acetylated lysine.
          Length = 109

 Score = 86.2 bits (214), Expect = 3e-20
 Identities = 38/111 (34%), Positives = 63/111 (56%), Gaps = 11/111 (9%)

Query: 370 TPLWYKYCSEIIAELF-HKKHQNYAWPFYTPVDVEKLGLTDYFDIIKKPMDLGTVRKKMR 428
           +P+  + C  ++ EL+ H+       PF+ PV      + +Y+ IIK PMDL  +RKK++
Sbjct: 2   SPIDQRKCERLLLELYCHELSL----PFHEPV---SPSVPNYYKIIKTPMDLSLIRKKLQ 54

Query: 429 NRT---YKTAKEFADDVRLIFSNCYKYNPPDHNVVTMARQLSAVFEDRFAK 476
            ++   Y + +EF  DVRL+F NCYK+N  D  V    ++L   FE++  +
Sbjct: 55  PKSPQHYSSPEEFVADVRLMFKNCYKFNEEDSEVAQAGKELELFFEEQLKE 105



 Score = 76.6 bits (189), Expect = 7e-17
 Identities = 31/82 (37%), Positives = 45/82 (54%), Gaps = 6/82 (7%)

Query: 144 PFHEPVDAINLNLPDYHKVITQPMDLGTIKKRLE---NNYYWSGKEAIQDFTTMFTNCYV 200
           PFHEPV       P+Y+K+I  PMDL  I+K+L+     +Y S +E + D   MF NCY 
Sbjct: 24  PFHEPVSPSV---PNYYKIIKTPMDLSLIRKKLQPKSPQHYSSPEEFVADVRLMFKNCYK 80

Query: 201 YNKPGEDVVLMAQNLEQLFLTK 222
           +N+   +V    + LE  F  +
Sbjct: 81  FNEEDSEVAQAGKELELFFEEQ 102


>gnl|CDD|99935 cd05503, Bromo_BAZ2A_B_like, Bromodomain, BAZ2A/BAZ2B_like
           subfamily. Bromo adjacent to zinc finger 2A (BAZ2A) and
           2B (BAZ2B) were identified as a novel human bromodomain
           gene by cDNA library screening. BAZ2A is also known as
           Tip5 (Transcription termination factor I-interacting
           protein 5) and hWALp3. The proteins may play roles in
           transcriptional regulation. Human Tip5 is part of a
           complex termed NoRC (nucleolar remodeling complex),
           which induces nucleosome sliding and may play a role in
           the regulation of the rDNA locus. Bromodomains are 110
           amino acid long domains, that are found in many
           chromatin associated proteins. Bromodomains can interact
           specifically with acetylated lysine.
          Length = 97

 Score = 83.2 bits (206), Expect = 2e-19
 Identities = 42/97 (43%), Positives = 58/97 (59%), Gaps = 5/97 (5%)

Query: 377 CSEIIAELFHKKHQNYAWPFYTPVDVEKLGLTDYFDIIKKPMDLGTVRKKMRNRTYKTAK 436
           C  I+ E+  + H++ AWPF  PV+  KL +  Y  IIKKPMD  T+R+K+ +  YKT +
Sbjct: 5   CETILDEM--EAHED-AWPFLEPVNT-KL-VPGYRKIIKKPMDFSTIREKLESGQYKTLE 59

Query: 437 EFADDVRLIFSNCYKYNPPDHNVVTMARQLSAVFEDR 473
           EFA+DVRL+F NC  +N  D  V      +   FE R
Sbjct: 60  EFAEDVRLVFDNCETFNEDDSEVGRAGHNMRKFFEKR 96



 Score = 73.2 bits (180), Expect = 8e-16
 Identities = 31/87 (35%), Positives = 47/87 (54%), Gaps = 4/87 (4%)

Query: 137 WKHPHAWPFHEPVDAINLNL-PDYHKVITQPMDLGTIKKRLENNYYWSGKEAIQDFTTMF 195
             H  AWPF EPV   N  L P Y K+I +PMD  TI+++LE+  Y + +E  +D   +F
Sbjct: 13  EAHEDAWPFLEPV---NTKLVPGYRKIIKKPMDFSTIREKLESGQYKTLEEFAEDVRLVF 69

Query: 196 TNCYVYNKPGEDVVLMAQNLEQLFLTK 222
            NC  +N+   +V     N+ + F  +
Sbjct: 70  DNCETFNEDDSEVGRAGHNMRKFFEKR 96


>gnl|CDD|99928 cd05496, Bromo_WDR9_II, Bromodomain; WDR9 repeat II_like subfamily.
           WDR9 is a human gene located in the Down Syndrome
           critical region-2 of chromosome 21. It encodes for a
           nuclear protein containing WD40 repeats and two
           bromodomains, which may function as a transcriptional
           regulator involved in chromatin remodeling and play a
           role in embryonic development. Bromodomains are 110
           amino acid long domains, that are found in many
           chromatin associated proteins. Bromodomains can interact
           specifically with acetylated lysine.
          Length = 119

 Score = 79.0 bits (195), Expect = 1e-17
 Identities = 43/106 (40%), Positives = 59/106 (55%), Gaps = 6/106 (5%)

Query: 373 WYKYCSEIIAELFHKKHQNYAWPFYTPVDVEKLGLTDYFDIIKKPMDLGTVRKKMRNRTY 432
           W K C E++  ++  +      PF  PVD+ K    DY DII  PMDLGTV++ +    Y
Sbjct: 6   WKKQCKELVNLMWDCEDSE---PFRQPVDLLKY--PDYRDIIDTPMDLGTVKETLFGGNY 60

Query: 433 KTAKEFADDVRLIFSNCYKYNPPDHN-VVTMARQLSAVFEDRFAKM 477
               EFA DVRLIFSN   Y P   + + +M  +LSA+FE+   K+
Sbjct: 61  DDPMEFAKDVRLIFSNSKSYTPNKRSRIYSMTLRLSALFEEHIKKI 106



 Score = 55.9 bits (135), Expect = 2e-09
 Identities = 26/73 (35%), Positives = 37/73 (50%), Gaps = 2/73 (2%)

Query: 130 KNVMKAVWKHPHAWPFHEPVDAINLNLPDYHKVITQPMDLGTIKKRLENNYYWSGKEAIQ 189
           K ++  +W    + PF +PVD   L  PDY  +I  PMDLGT+K+ L    Y    E  +
Sbjct: 11  KELVNLMWDCEDSEPFRQPVD--LLKYPDYRDIIDTPMDLGTVKETLFGGNYDDPMEFAK 68

Query: 190 DFTTMFTNCYVYN 202
           D   +F+N   Y 
Sbjct: 69  DVRLIFSNSKSYT 81


>gnl|CDD|99950 cd05519, Bromo_SNF2, Bromodomain, SNF2-like subfamily, specific to
           fungi. SNF2 is a yeast protein involved in
           transcriptional activation, it is the catalytic
           component of the SWI/SNF ATP-dependent chromatin
           remodeling complex. The protein is essential for the
           regulation of gene expression (both positive and
           negative) of a large number of genes. The SWI/SNF
           complex changes chromatin structure by altering
           DNA-histone contacts within the nucleosome, which
           results in a re-positioning of the nucleosome and
           facilitates or represses the binding of gene-specific
           transcription factors. Bromodomains are 110 amino acid
           long domains, that are found in many chromatin
           associated proteins. Bromodomains can interact
           specifically with acetylated lysine.
          Length = 103

 Score = 73.1 bits (180), Expect = 8e-16
 Identities = 26/67 (38%), Positives = 38/67 (56%)

Query: 157 PDYHKVITQPMDLGTIKKRLENNYYWSGKEAIQDFTTMFTNCYVYNKPGEDVVLMAQNLE 216
           PDY+ +I +P+ L  IK+R+E   Y S +E ++DF  MF N   YN+ G  V   A  +E
Sbjct: 37  PDYYVIIKRPIALDQIKRRIEGRAYKSLEEFLEDFHLMFANARTYNQEGSIVYEDAVEME 96

Query: 217 QLFLTKI 223
           + F  K 
Sbjct: 97  KAFKKKY 103



 Score = 53.9 bits (130), Expect = 6e-09
 Identities = 18/45 (40%), Positives = 31/45 (68%)

Query: 409 DYFDIIKKPMDLGTVRKKMRNRTYKTAKEFADDVRLIFSNCYKYN 453
           DY+ IIK+P+ L  +++++  R YK+ +EF +D  L+F+N   YN
Sbjct: 38  DYYVIIKRPIALDQIKRRIEGRAYKSLEEFLEDFHLMFANARTYN 82


>gnl|CDD|99937 cd05505, Bromo_WSTF_like, Bromodomain; Williams syndrome
           transcription factor-like subfamily (WSTF-like). The
           Williams-Beuren syndrome deletion transcript 9 is a
           putative transcriptional regulator. WSTF was found to
           play a role in vitamin D-mediated transcription as part
           of two chromatin remodeling complexes, WINAC and WICH.
           Bromodomains are 110 amino acid long domains, that are
           found in many chromatin associated proteins.
           Bromodomains can interact specifically with acetylated
           lysine.
          Length = 97

 Score = 70.3 bits (172), Expect = 1e-14
 Identities = 30/89 (33%), Positives = 50/89 (56%), Gaps = 5/89 (5%)

Query: 377 CSEIIAELFHKKHQNYAWPFYTPVDVEKLGLTDYFDIIKKPMDLGTVRKKMRNRTYKTAK 436
           C EI++++   +    +WPF  PV  ++    DY  +I  PMDL T++ K    +Y + +
Sbjct: 5   CEEILSKILKYRF---SWPFREPVTADEA--EDYKKVITNPMDLQTMQTKCSCGSYSSVQ 59

Query: 437 EFADDVRLIFSNCYKYNPPDHNVVTMARQ 465
           EF DD++L+FSN  KY      V++  R+
Sbjct: 60  EFLDDMKLVFSNAEKYYENGSYVLSCMRK 88



 Score = 61.0 bits (148), Expect = 1e-11
 Identities = 30/82 (36%), Positives = 42/82 (51%), Gaps = 2/82 (2%)

Query: 138 KHPHAWPFHEPVDAINLNLPDYHKVITQPMDLGTIKKRLENNYYWSGKEAIQDFTTMFTN 197
           K+  +WPF EPV A      DY KVIT PMDL T++ +     Y S +E + D   +F+N
Sbjct: 14  KYRFSWPFREPVTADEA--EDYKKVITNPMDLQTMQTKCSCGSYSSVQEFLDDMKLVFSN 71

Query: 198 CYVYNKPGEDVVLMAQNLEQLF 219
              Y + G  V+   +  EQ  
Sbjct: 72  AEKYYENGSYVLSCMRKTEQCC 93


>gnl|CDD|227408 COG5076, COG5076, Transcription factor involved in chromatin
           remodeling, contains bromodomain [Chromatin structure
           and dynamics / Transcription].
          Length = 371

 Score = 75.6 bits (186), Expect = 2e-14
 Identities = 37/112 (33%), Positives = 59/112 (52%), Gaps = 2/112 (1%)

Query: 369 KTPLWYKYCSEIIAELFHKKHQNYAWPFYTPVDVEKLGLTDYFDIIKKPMDLGTVRKKMR 428
                 K  ++   +LF +  +  +  F       K    DY++IIK PMDL T++KK++
Sbjct: 142 LLYADNKAIAKFKKQLFLRDGRFLSSIF--LGLPSKREYPDYYEIIKSPMDLLTIQKKLK 199

Query: 429 NRTYKTAKEFADDVRLIFSNCYKYNPPDHNVVTMARQLSAVFEDRFAKMPDE 480
           N  YK+ +EF  D+ L+F NC  YN PD +V   A++L   F     ++P+E
Sbjct: 200 NGRYKSFEEFVSDLNLMFDNCKLYNGPDSSVYVDAKELEKYFLKLIEEIPEE 251



 Score = 74.5 bits (183), Expect = 4e-14
 Identities = 49/191 (25%), Positives = 74/191 (38%), Gaps = 14/191 (7%)

Query: 123 NQLQYIVKNVMKAVWKHPHAWPFHEPVDAINLNLPDYHKVITQPMDLGTIKKRLENNYYW 182
           N+     K  +        +  F           PDY+++I  PMDL TI+K+L+N  Y 
Sbjct: 147 NKAIAKFKKQLFLRDGRFLSSIFLGLPSKREY--PDYYEIIKSPMDLLTIQKKLKNGRYK 204

Query: 183 SGKEAIQDFTTMFTNCYVYNKPGEDVVLMAQNLEQLFLTKITGMPSEEVVLDAPQPRSSK 242
           S +E + D   MF NC +YN P   V + A+ LE+ FL  I  +P EE++  + +P   +
Sbjct: 205 SFEEFVSDLNLMFDNCKLYNGPDSSVYVDAKELEKYFLKLIEEIP-EEMLELSIKPGREE 263

Query: 243 KKP-----------PVSASPSLNPVIKTPVIPLNKLPSATSTPKPRPPNPVLGSTATTTT 291
           ++             V A P L PV    V    K          +             T
Sbjct: 264 REERESVLITNSQAHVGAWPFLRPVSDEEVPDYYKDIRDPMDLSTKELKLRNNYYRPEET 323

Query: 292 APKVNHLNSMN 302
             +   L   N
Sbjct: 324 FVRDAKLFFDN 334



 Score = 32.9 bits (75), Expect = 0.70
 Identities = 20/88 (22%), Positives = 30/88 (34%), Gaps = 3/88 (3%)

Query: 399 PVDVEKLGLTDYFDIIKKPMDLGTVRKKMRNRTYKTAKEFADDVRLIFSNCYKYNPPDHN 458
           P   E+ G  + F    K M    +   + +R   +     DD+         Y+P + N
Sbjct: 37  PNAPEEEGSKNLFQKQLKRMPKEYITSIVDDREPGSMANVNDDLE--NVGGITYSPFEKN 94

Query: 459 VVTMARQLSAVFEDRFAKMPDESNLASR 486
                R    VF    +  P ES L S 
Sbjct: 95  RPESLRFDEIVFLAIESVTP-ESGLGSL 121



 Score = 30.2 bits (68), Expect = 4.7
 Identities = 27/76 (35%), Positives = 38/76 (50%), Gaps = 2/76 (2%)

Query: 378 SEIIAELFHKKHQNYAWPFYTPVDVEKLGLTDYFDIIKKPMDLGTVRKKMRNRTYKTAKE 437
                 + + +    AWPF  PV  E++   DY+  I+ PMDL T   K+RN  Y+  + 
Sbjct: 266 ERESVLITNSQAHVGAWPFLRPVSDEEV--PDYYKDIRDPMDLSTKELKLRNNYYRPEET 323

Query: 438 FADDVRLIFSNCYKYN 453
           F  D +L F NC  YN
Sbjct: 324 FVRDAKLFFDNCVMYN 339


>gnl|CDD|99939 cd05507, Bromo_brd8_like, Bromodomain, brd8_like subgroup. In
           mammals, brd8 (bromodomain containing 8) interacts with
           the thyroid hormone receptor in a ligand-dependent
           fashion and enhances thyroid hormone-dependent
           activation from thyroid response elements. Brd8 is
           thought to be a nuclear receptor coactivator.
           Bromodomains are 110 amino acid long domains, that are
           found in many chromatin associated proteins.
           Bromodomains can interact specifically with acetylated
           lysine.
          Length = 104

 Score = 67.4 bits (165), Expect = 9e-14
 Identities = 35/88 (39%), Positives = 47/88 (53%), Gaps = 9/88 (10%)

Query: 130 KNVMKAVWK----HPHAWPFHEPV-DAINLNLPDYHKVITQPMDLGTIKKRLENNYYWSG 184
           K ++  V++    H +A  F +PV + I    P YH V+ +PMDL TIKK +EN    S 
Sbjct: 6   KAIL-LVYRTLASHRYASVFLKPVTEDIA---PGYHSVVYRPMDLSTIKKNIENGTIRST 61

Query: 185 KEAIQDFTTMFTNCYVYNKPGEDVVLMA 212
            E  +D   MF N  +YN    DV LMA
Sbjct: 62  AEFQRDVLLMFQNAIMYNSSDHDVYLMA 89



 Score = 61.2 bits (149), Expect = 1e-11
 Identities = 28/75 (37%), Positives = 41/75 (54%), Gaps = 2/75 (2%)

Query: 392 YAWPFYTPVDVEKLGLTDYFDIIKKPMDLGTVRKKMRNRTYKTAKEFADDVRLIFSNCYK 451
           YA  F  PV  +      Y  ++ +PMDL T++K + N T ++  EF  DV L+F N   
Sbjct: 20  YASVFLKPVTED--IAPGYHSVVYRPMDLSTIKKNIENGTIRSTAEFQRDVLLMFQNAIM 77

Query: 452 YNPPDHNVVTMARQL 466
           YN  DH+V  MA ++
Sbjct: 78  YNSSDHDVYLMAVEM 92


>gnl|CDD|99957 cd05528, Bromo_AAA, Bromodomain; sub-family co-occurring with AAA
           domains. Bromodomains are 110 amino acid long domains,
           that are found in many chromatin associated proteins.
           Bromodomains can interact specifically with acetylated
           lysine. The structure(2DKW) in this alignment is an
           uncharacterized protein predicted from analysis of cDNA
           clones from human fetal liver.
          Length = 112

 Score = 67.8 bits (166), Expect = 1e-13
 Identities = 32/104 (30%), Positives = 49/104 (47%), Gaps = 15/104 (14%)

Query: 395 PFYTPVDVEKLGLTDYFDIIKKPMDLGTVRKKMRNRTYKTAKEFADDVRLIFSNCYKYNP 454
            F  PVD E++   DY++IIK+PMDL T+ +K+    Y TAK+F  D+ LI +N  +YNP
Sbjct: 23  AFTKPVDEEEV--PDYYEIIKQPMDLQTILQKLDTHQYLTAKDFLKDIDLIVTNALEYNP 80

Query: 455 PDHNVVTMARQLSAVFEDRFAKMPDESNLASRAAASVSSDDDSE 498
                  + R  +               L     A + ++ D  
Sbjct: 81  DRDPADKLIRSRA-------------CELRDEVHAMIEAELDPL 111



 Score = 53.5 bits (129), Expect = 9e-09
 Identities = 26/92 (28%), Positives = 42/92 (45%), Gaps = 6/92 (6%)

Query: 144 PFHEPVDAINLNLPDYHKVITQPMDLGTIKKRLENNYYWSGKEAIQDFTTMFTNCYVYN- 202
            F +PVD     +PDY+++I QPMDL TI ++L+ + Y + K+ ++D   + TN   YN 
Sbjct: 23  AFTKPVDE--EEVPDYYEIIKQPMDLQTILQKLDTHQYLTAKDFLKDIDLIVTNALEYNP 80

Query: 203 ---KPGEDVVLMAQNLEQLFLTKITGMPSEEV 231
                 + +   A  L       I        
Sbjct: 81  DRDPADKLIRSRACELRDEVHAMIEAELDPLF 112


>gnl|CDD|99944 cd05512, Bromo_brd1_like, Bromodomain; brd1_like subfamily. BRD1 is
           a mammalian gene which encodes for a nuclear protein
           assumed to be a transcriptional regulator. BRD1 has been
           implicated with brain development and susceptibility to
           schizophrenia and bipolar affective disorder.
           Bromodomains are 110 amino acid long domains that are
           found in many chromatin associated proteins.
           Bromodomains can interact specifically with acetylated
           lysine.
          Length = 98

 Score = 65.1 bits (159), Expect = 5e-13
 Identities = 27/61 (44%), Positives = 39/61 (63%), Gaps = 2/61 (3%)

Query: 396 FYTPVDVEKLGLTDYFDIIKKPMDLGTVRKKMRNRTYKTAKEFADDVRLIFSNCYKYNPP 455
           F  PVD+ ++   DY D IK+PMD  T+RKK+ ++ Y+T ++F  D  LI +NC  YN  
Sbjct: 22  FSEPVDLSEV--PDYLDHIKQPMDFSTMRKKLESQRYRTLEDFEADFNLIINNCLAYNAK 79

Query: 456 D 456
           D
Sbjct: 80  D 80



 Score = 57.4 bits (139), Expect = 3e-10
 Identities = 24/58 (41%), Positives = 33/58 (56%), Gaps = 2/58 (3%)

Query: 145 FHEPVDAINLNLPDYHKVITQPMDLGTIKKRLENNYYWSGKEAIQDFTTMFTNCYVYN 202
           F EPVD     +PDY   I QPMD  T++K+LE+  Y + ++   DF  +  NC  YN
Sbjct: 22  FSEPVDL--SEVPDYLDHIKQPMDFSTMRKKLESQRYRTLEDFEADFNLIINNCLAYN 77


>gnl|CDD|99958 cd05529, Bromo_WDR9_I_like, Bromodomain; WDR9 repeat I_like
           subfamily. WDR9 is a human gene located in the Down
           Syndrome critical region-2 of chromosome 21. It encodes
           for a nuclear protein containing WD40 repeats and two
           bromodomains, which may function as a transcriptional
           regulator involved in chromatin remodeling and play a
           role in embryonic development. Bromodomains are 110
           amino acid long domains, that are found in many
           chromatin associated proteins. Bromodomains can interact
           specifically with acetylated lysine.
          Length = 128

 Score = 65.1 bits (159), Expect = 1e-12
 Identities = 27/77 (35%), Positives = 38/77 (49%), Gaps = 1/77 (1%)

Query: 144 PFHEPVDAINLNLPDYHKVITQPMDLGTIKKRLENNYYWSGKEAIQDFTTMFTNCYVYNK 203
            F  PVD +    PDY   +  PMDL TI+ RLEN YY S +    D   + +N   +N+
Sbjct: 47  YFEYPVD-LRAWYPDYWNRVPVPMDLETIRSRLENRYYRSLEALRHDVRLILSNAETFNE 105

Query: 204 PGEDVVLMAQNLEQLFL 220
           P  ++   A+ L    L
Sbjct: 106 PNSEIAKKAKRLSDWLL 122



 Score = 63.9 bits (156), Expect = 3e-12
 Identities = 29/92 (31%), Positives = 47/92 (51%), Gaps = 1/92 (1%)

Query: 381 IAELFHKKHQNYAWPFYTPVDVEKLGLTDYFDIIKKPMDLGTVRKKMRNRTYKTAKEFAD 440
           + +L        A  F  PVD+ +    DY++ +  PMDL T+R ++ NR Y++ +    
Sbjct: 33  LDKLLLSLQLEIAEYFEYPVDL-RAWYPDYWNRVPVPMDLETIRSRLENRYYRSLEALRH 91

Query: 441 DVRLIFSNCYKYNPPDHNVVTMARQLSAVFED 472
           DVRLI SN   +N P+  +   A++LS     
Sbjct: 92  DVRLILSNAETFNEPNSEIAKKAKRLSDWLLR 123


>gnl|CDD|99945 cd05513, Bromo_brd7_like, Bromodomain, brd7_like subgroup. The BRD7
           gene encodes a nuclear protein that has been shown to
           inhibit cell growth and the progression of the cell
           cycle by regulating cell-cycle genes at the
           transcriptional level. BRD7 has been identified as a
           gene involved in nasopharyngeal carcinoma. The protein
           interacts with acetylated histone H3 via its
           bromodomain. Bromodomains are 110 amino acid long
           domains that are found in many chromatin associated
           proteins. Bromodomains can interact specifically with
           acetylated lysine.
          Length = 98

 Score = 61.7 bits (150), Expect = 1e-11
 Identities = 30/82 (36%), Positives = 44/82 (53%), Gaps = 8/82 (9%)

Query: 125 LQYIVKNVMKAVWKHPHAWPFHEPV-DAINLNLPDYHKVITQPMDLGTIKKRLENNYYWS 183
           L+ +++ + +   K PH + F  PV D I    P Y  +I  PMD  T+K++++NN Y S
Sbjct: 6   LEQLIRQLQR---KDPHGF-FAFPVTDFI---APGYSSIIKHPMDFSTMKEKIKNNDYQS 58

Query: 184 GKEAIQDFTTMFTNCYVYNKPG 205
            +E   DF  M  N   YNKP 
Sbjct: 59  IEEFKDDFKLMCENAMKYNKPD 80



 Score = 59.3 bits (144), Expect = 5e-11
 Identities = 28/85 (32%), Positives = 43/85 (50%), Gaps = 8/85 (9%)

Query: 374 YKYCSEIIAELFHKK--HQNYAWPFYTPVDVEKLGLTDYFDIIKKPMDLGTVRKKMRNRT 431
            +   E +     +K  H  +A+P      V       Y  IIK PMD  T+++K++N  
Sbjct: 2   LQKALEQLIRQLQRKDPHGFFAFP------VTDFIAPGYSSIIKHPMDFSTMKEKIKNND 55

Query: 432 YKTAKEFADDVRLIFSNCYKYNPPD 456
           Y++ +EF DD +L+  N  KYN PD
Sbjct: 56  YQSIEEFKDDFKLMCENAMKYNKPD 80


>gnl|CDD|99946 cd05515, Bromo_polybromo_V, Bromodomain, polybromo repeat V.
           Polybromo is a nuclear protein of unknown function,
           which contains 6 bromodomains. The human ortholog BAF180
           is part of a SWI/SNF chromatin-remodeling complex, and
           it may carry out the functions of Yeast Rsc-1 and Rsc-2.
           It was shown that polybromo bromodomains bind to histone
           H3 at specific acetyl-lysine positions. Bromodomains are
           found in many chromatin-associated proteins and in
           nuclear histone acetyltransferases. They interact
           specifically with acetylated lysine, but not all the
           bromodomains in polybromo may bind to acetyl-lysine.
          Length = 105

 Score = 61.2 bits (149), Expect = 2e-11
 Identities = 20/48 (41%), Positives = 30/48 (62%)

Query: 409 DYFDIIKKPMDLGTVRKKMRNRTYKTAKEFADDVRLIFSNCYKYNPPD 456
           DY+D+IKKP+D+  +R K+    Y++  +   D  L+F N  KYN PD
Sbjct: 38  DYYDVIKKPIDMEKIRSKIEGNQYQSLDDMVSDFVLMFDNACKYNEPD 85



 Score = 60.4 bits (147), Expect = 3e-11
 Identities = 23/70 (32%), Positives = 37/70 (52%)

Query: 156 LPDYHKVITQPMDLGTIKKRLENNYYWSGKEAIQDFTTMFTNCYVYNKPGEDVVLMAQNL 215
            PDY+ VI +P+D+  I+ ++E N Y S  + + DF  MF N   YN+P   +   A  L
Sbjct: 36  YPDYYDVIKKPIDMEKIRSKIEGNQYQSLDDMVSDFVLMFDNACKYNEPDSQIYKDALTL 95

Query: 216 EQLFLTKITG 225
           +++ L     
Sbjct: 96  QKVLLETKRE 105


>gnl|CDD|99949 cd05518, Bromo_polybromo_IV, Bromodomain, polybromo repeat IV.
           Polybromo is a nuclear protein of unknown function,
           which contains 6 bromodomains. The human ortholog BAF180
           is part of a SWI/SNF chromatin-remodeling complex, and
           it may carry out the functions of Yeast Rsc-1 and Rsc-2.
           It was shown that polybromo bromodomains bind to histone
           H3 at specific acetyl-lysine positions. Bromodomains are
           found in many chromatin-associated proteins and in
           nuclear histone acetyltransferases. They interact
           specifically with acetylated lysine, but not all the
           bromodomains in polybromo may bind to acetyl-lysine.
          Length = 103

 Score = 59.8 bits (145), Expect = 5e-11
 Identities = 26/66 (39%), Positives = 39/66 (59%)

Query: 157 PDYHKVITQPMDLGTIKKRLENNYYWSGKEAIQDFTTMFTNCYVYNKPGEDVVLMAQNLE 216
           PDY+K+I +P+DL TI+  + N+ Y + +E + DF  MF N   YN+ G  V   A  LE
Sbjct: 37  PDYYKIILEPIDLKTIEHNIRNDKYATEEELMDDFKLMFRNARHYNEEGSQVYEDANILE 96

Query: 217 QLFLTK 222
           ++   K
Sbjct: 97  KVLKEK 102



 Score = 54.8 bits (132), Expect = 3e-09
 Identities = 24/66 (36%), Positives = 35/66 (53%)

Query: 409 DYFDIIKKPMDLGTVRKKMRNRTYKTAKEFADDVRLIFSNCYKYNPPDHNVVTMARQLSA 468
           DY+ II +P+DL T+   +RN  Y T +E  DD +L+F N   YN     V   A  L  
Sbjct: 38  DYYKIILEPIDLKTIEHNIRNDKYATEEELMDDFKLMFRNARHYNEEGSQVYEDANILEK 97

Query: 469 VFEDRF 474
           V +++ 
Sbjct: 98  VLKEKR 103


>gnl|CDD|99940 cd05508, Bromo_RACK7, Bromodomain, RACK7_like subfamily. RACK7
           (also called human protein kinase C-binding protein) was
           identified as a potential tumor suppressor genes, it
           shares domain architecture with BS69/ZMYND11; both have
           been implicated in the regulation of cellular
           proliferation. Bromodomains are 110 amino acid long
           domains, that are found in many chromatin associated
           proteins. Bromodomains can interact specifically with
           acetylated lysine.
          Length = 99

 Score = 58.9 bits (143), Expect = 9e-11
 Identities = 28/77 (36%), Positives = 39/77 (50%), Gaps = 2/77 (2%)

Query: 390 QNYAWPFYTPVDVEKLGLTDYFDIIKKPMDLGTVRKKMRNRTYKTAKEFADDVRLIFSNC 449
           Q  A PF  PVD+E+    DY   + KPMDL T+ K +R + Y +   F  D + I  N 
Sbjct: 17  QPGAEPFLKPVDLEQF--PDYAQYVFKPMDLSTLEKNVRKKAYGSTDAFLADAKWILHNA 74

Query: 450 YKYNPPDHNVVTMARQL 466
             YN  DH +   A+ +
Sbjct: 75  IIYNGGDHKLTQAAKAI 91



 Score = 44.3 bits (105), Expect = 1e-05
 Identities = 26/81 (32%), Positives = 42/81 (51%), Gaps = 3/81 (3%)

Query: 122 TNQLQYIVKNVMKAVWKHPHAWPFHEPVDAINLNLPDYHKVITQPMDLGTIKKRLENNYY 181
            +QL  ++K  ++ + K P A PF +PVD      PDY + + +PMDL T++K +    Y
Sbjct: 1   VDQLSKLLKFALERM-KQPGAEPFLKPVDL--EQFPDYAQYVFKPMDLSTLEKNVRKKAY 57

Query: 182 WSGKEAIQDFTTMFTNCYVYN 202
            S    + D   +  N  +YN
Sbjct: 58  GSTDAFLADAKWILHNAIIYN 78


>gnl|CDD|99933 cd05501, Bromo_SP100C_like, Bromodomain, SP100C_like subfamily. The
           SP100C protein is a splice variant of SP100, a major
           component of PML-SP100 nuclear bodies (NBs), which are
           poorly understood. It is covalently modified by SUMO-1
           and may play a role in processes at the chromatin level.
           Bromodomains are 110 amino acid long domains, that are
           found in many chromatin associated proteins.
           Bromodomains can interact specifically with acetylated
           lysine.
          Length = 102

 Score = 57.1 bits (138), Expect = 5e-10
 Identities = 23/66 (34%), Positives = 33/66 (50%), Gaps = 1/66 (1%)

Query: 409 DYFDIIKKPMDLGTVRKKMRNRTYKTAKEFADDVRLIFSNCYKYNPPDHNVVTMARQLSA 468
           DY   IK+PM L  V++++  R Y T + F  D+RLIF N   +   D +   +   L  
Sbjct: 32  DYCQGIKEPMWLNKVKERLNERVYHTVEGFVRDMRLIFHNHKLFYKDD-DFGQVGITLEK 90

Query: 469 VFEDRF 474
            FE  F
Sbjct: 91  KFEKNF 96



 Score = 42.8 bits (101), Expect = 4e-05
 Identities = 18/62 (29%), Positives = 32/62 (51%), Gaps = 1/62 (1%)

Query: 158 DYHKVITQPMDLGTIKKRLENNYYWSGKEAIQDFTTMFTNCYVYNKPGEDVVLMAQNLEQ 217
           DY + I +PM L  +K+RL    Y + +  ++D   +F N  ++ K  +D   +   LE+
Sbjct: 32  DYCQGIKEPMWLNKVKERLNERVYHTVEGFVRDMRLIFHNHKLFYK-DDDFGQVGITLEK 90

Query: 218 LF 219
            F
Sbjct: 91  KF 92


>gnl|CDD|99951 cd05520, Bromo_polybromo_III, Bromodomain, polybromo repeat III.
           Polybromo is a nuclear protein of unknown function,
           which contains 6 bromodomains. The human ortholog BAF180
           is part of a SWI/SNF chromatin-remodeling complex, and
           it may carry out the functions of Yeast Rsc-1 and Rsc-2.
           It was shown that polybromo bromodomains bind to histone
           H3 at specific acetyl-lysine positions. Bromodomains are
           found in many chromatin-associated proteins and in
           nuclear histone acetyltransferases. They interact
           specifically with acetylated lysine, but not all the
           bromodomains in polybromo may bind to acetyl-lysine.
          Length = 103

 Score = 54.7 bits (132), Expect = 3e-09
 Identities = 21/61 (34%), Positives = 35/61 (57%)

Query: 409 DYFDIIKKPMDLGTVRKKMRNRTYKTAKEFADDVRLIFSNCYKYNPPDHNVVTMARQLSA 468
           DY+  IK P+ L  +R K++N  Y+T +E   D+ L+F N  +YN P+  +   A +L  
Sbjct: 38  DYYQEIKNPISLQQIRTKLKNGEYETLEELEADLNLMFENAKRYNVPNSRIYKDAEKLQK 97

Query: 469 V 469
           +
Sbjct: 98  L 98



 Score = 46.9 bits (112), Expect = 2e-06
 Identities = 22/66 (33%), Positives = 34/66 (51%)

Query: 157 PDYHKVITQPMDLGTIKKRLENNYYWSGKEAIQDFTTMFTNCYVYNKPGEDVVLMAQNLE 216
           PDY++ I  P+ L  I+ +L+N  Y + +E   D   MF N   YN P   +   A+ L+
Sbjct: 37  PDYYQEIKNPISLQQIRTKLKNGEYETLEELEADLNLMFENAKRYNVPNSRIYKDAEKLQ 96

Query: 217 QLFLTK 222
           +L   K
Sbjct: 97  KLMQAK 102


>gnl|CDD|99953 cd05522, Bromo_Rsc1_2_II, Bromodomain, repeat II in Rsc1/2_like
           subfamily, specific to fungi. Rsc1 and Rsc2 are
           components of the RSC complex (remodeling the structure
           of chromatin), are essential for transcriptional
           control, and have a specific domain architecture
           including two bromodomains. The RSC complex has also
           been linked to homologous recombination and
           nonhomologous end-joining repair of DNA double strand
           breaks. Bromodomains are 110 amino acid long domains,
           that are found in many chromatin associated proteins.
           Bromodomains can interact specifically with acetylated
           lysine.
          Length = 104

 Score = 53.0 bits (128), Expect = 1e-08
 Identities = 19/58 (32%), Positives = 31/58 (53%)

Query: 409 DYFDIIKKPMDLGTVRKKMRNRTYKTAKEFADDVRLIFSNCYKYNPPDHNVVTMARQL 466
           +Y+  I  P+ L  ++KK++ R YK+  +F +D+ L+F N   YN  D      A  L
Sbjct: 39  EYYQEISNPISLDDIKKKVKRRKYKSFDQFLNDLNLMFENAKLYNENDSQEYKDAVLL 96



 Score = 52.6 bits (127), Expect = 1e-08
 Identities = 19/63 (30%), Positives = 32/63 (50%)

Query: 157 PDYHKVITQPMDLGTIKKRLENNYYWSGKEAIQDFTTMFTNCYVYNKPGEDVVLMAQNLE 216
           P+Y++ I+ P+ L  IKK+++   Y S  + + D   MF N  +YN+        A  LE
Sbjct: 38  PEYYQEISNPISLDDIKKKVKRRKYKSFDQFLNDLNLMFENAKLYNENDSQEYKDAVLLE 97

Query: 217 QLF 219
           +  
Sbjct: 98  KEA 100


>gnl|CDD|99947 cd05516, Bromo_SNF2L2, Bromodomain, SNF2L2-like subfamily, specific
           to animals. SNF2L2 (SNF2-alpha) or SWI/SNF-related
           matrix-associated actin-dependent regulator of chromatin
           subfamily A member 2 is a global transcriptional
           activator, which cooperates with nuclear hormone
           receptors to boost transcriptional activation.
           Bromodomains are 110 amino acid long domains, that are
           found in many chromatin associated proteins.
           Bromodomains can interact specifically with acetylated
           lysine.
          Length = 107

 Score = 51.7 bits (124), Expect = 4e-08
 Identities = 18/64 (28%), Positives = 36/64 (56%)

Query: 156 LPDYHKVITQPMDLGTIKKRLENNYYWSGKEAIQDFTTMFTNCYVYNKPGEDVVLMAQNL 215
           LP+Y+++I +P+D   IK+R+ N+ Y S ++  +D   +  N   +N  G  +   +  L
Sbjct: 37  LPEYYELIRKPVDFKKIKERIRNHKYRSLEDLEKDVMLLCQNAQTFNLEGSLIYEDSIVL 96

Query: 216 EQLF 219
           + +F
Sbjct: 97  QSVF 100



 Score = 46.3 bits (110), Expect = 3e-06
 Identities = 14/47 (29%), Positives = 31/47 (65%)

Query: 407 LTDYFDIIKKPMDLGTVRKKMRNRTYKTAKEFADDVRLIFSNCYKYN 453
           L +Y+++I+KP+D   +++++RN  Y++ ++   DV L+  N   +N
Sbjct: 37  LPEYYELIRKPVDFKKIKERIRNHKYRSLEDLEKDVMLLCQNAQTFN 83


>gnl|CDD|99952 cd05521, Bromo_Rsc1_2_I, Bromodomain, repeat I in Rsc1/2_like
           subfamily, specific to fungi. Rsc1 and Rsc2 are
           components of the RSC complex (remodeling the structure
           of chromatin), are essential for transcriptional
           control, and have a specific domain architecture
           including two bromodomains. The RSC complex has also
           been linked to homologous recombination and
           nonhomologous end-joining repair of DNA double strand
           breaks. Bromodomains are 110 amino acid long domains,
           that are found in many chromatin associated proteins.
           Bromodomains can interact specifically with acetylated
           lysine.
          Length = 106

 Score = 49.2 bits (118), Expect = 2e-07
 Identities = 20/60 (33%), Positives = 31/60 (51%), Gaps = 2/60 (3%)

Query: 404 KLGLTDYFDIIKKPMDLGTVRKKMRNRTYKTAKEFADDVRLIFSNCYKYNPPDHNVVTMA 463
           +    DY+ IIK P+ L TV+K++ +  Y  A+EF +D+  I  N   YN     +   A
Sbjct: 34  RKDYPDYYKIIKNPLSLNTVKKRLPH--YTNAQEFVNDLAQIPWNARLYNTKGSVIYKYA 91



 Score = 46.5 bits (111), Expect = 2e-06
 Identities = 23/68 (33%), Positives = 35/68 (51%), Gaps = 2/68 (2%)

Query: 156 LPDYHKVITQPMDLGTIKKRLENNYYWSGKEAIQDFTTMFTNCYVYNKPGEDVVLMAQNL 215
            PDY+K+I  P+ L T+KKRL +  Y + +E + D   +  N  +YN  G  +   A  L
Sbjct: 37  YPDYYKIIKNPLSLNTVKKRLPH--YTNAQEFVNDLAQIPWNARLYNTKGSVIYKYALIL 94

Query: 216 EQLFLTKI 223
           E+     I
Sbjct: 95  EKYINDVI 102


>gnl|CDD|99954 cd05524, Bromo_polybromo_I, Bromodomain, polybromo repeat I.
           Polybromo is a nuclear protein of unknown function,
           which contains 6 bromodomains. The human ortholog BAF180
           is part of a SWI/SNF chromatin-remodeling complex, and
           it may carry out the functions of Yeast Rsc-1 and Rsc-2.
           It was shown that polybromo bromodomains bind to histone
           H3 at specific acetyl-lysine positions. Bromodomains are
           found in many chromatin-associated proteins and in
           nuclear histone acetyltransferases. They interact
           specifically with acetylated lysine, but not all the
           bromodomains in polybromo may bind to acetyl-lysine.
          Length = 113

 Score = 46.2 bits (110), Expect = 3e-06
 Identities = 12/48 (25%), Positives = 26/48 (54%)

Query: 409 DYFDIIKKPMDLGTVRKKMRNRTYKTAKEFADDVRLIFSNCYKYNPPD 456
           +Y++++  P+DL  +++K++   Y    +   D  L+ +N   Y  PD
Sbjct: 40  EYYEVVSNPIDLLKIQQKLKTEEYDDVDDLTADFELLINNAKAYYKPD 87



 Score = 41.6 bits (98), Expect = 1e-04
 Identities = 20/64 (31%), Positives = 33/64 (51%)

Query: 157 PDYHKVITQPMDLGTIKKRLENNYYWSGKEAIQDFTTMFTNCYVYNKPGEDVVLMAQNLE 216
           P+Y++V++ P+DL  I+++L+   Y    +   DF  +  N   Y KP       A  L 
Sbjct: 39  PEYYEVVSNPIDLLKIQQKLKTEEYDDVDDLTADFELLINNAKAYYKPDSPEHKDACKLW 98

Query: 217 QLFL 220
           +LFL
Sbjct: 99  ELFL 102


>gnl|CDD|99948 cd05517, Bromo_polybromo_II, Bromodomain, polybromo repeat II.
           Polybromo is a nuclear protein of unknown function,
           which contains 6 bromodomains. The human ortholog BAF180
           is part of a SWI/SNF chromatin-remodeling complex, and
           it may carry out the functions of Yeast Rsc-1 and Rsc-2.
           It was shown that polybromo bromodomains bind to histone
           H3 at specific acetyl-lysine positions. Bromodomains are
           found in many chromatin-associated proteins and in
           nuclear histone acetyltransferases. They interact
           specifically with acetylated lysine, but not all the
           bromodomains in polybromo may bind to acetyl-lysine.
          Length = 103

 Score = 45.9 bits (109), Expect = 4e-06
 Identities = 24/94 (25%), Positives = 46/94 (48%), Gaps = 13/94 (13%)

Query: 380 IIAELFHKKHQNYAWPFYTPVDVEKLGLTDYFDIIKKPMDLGTVRKKMRNRTYKTAKEFA 439
           +I+ELF K      +P             DY+ +IK+P+DL T+ +++++  YK+ ++  
Sbjct: 22  LISELFQKLPSKVLYP-------------DYYAVIKEPIDLKTIAQRIQSGYYKSIEDME 68

Query: 440 DDVRLIFSNCYKYNPPDHNVVTMARQLSAVFEDR 473
            D+ L+  N   +N P   V   A  +  +F  +
Sbjct: 69  KDLDLMVKNAKTFNEPGSQVYKDANAIKKIFTAK 102



 Score = 41.7 bits (98), Expect = 1e-04
 Identities = 23/63 (36%), Positives = 39/63 (61%)

Query: 157 PDYHKVITQPMDLGTIKKRLENNYYWSGKEAIQDFTTMFTNCYVYNKPGEDVVLMAQNLE 216
           PDY+ VI +P+DL TI +R+++ YY S ++  +D   M  N   +N+PG  V   A  ++
Sbjct: 37  PDYYAVIKEPIDLKTIAQRIQSGYYKSIEDMEKDLDLMVKNAKTFNEPGSQVYKDANAIK 96

Query: 217 QLF 219
           ++F
Sbjct: 97  KIF 99


>gnl|CDD|99926 cd05494, Bromodomain_1, Bromodomain; uncharacterized subfamily.
           Bromodomains are found in many chromatin-associated
           proteins and in nuclear histone acetyltransferases. They
           interact specifically with acetylated lysine.
          Length = 114

 Score = 45.1 bits (107), Expect = 9e-06
 Identities = 20/65 (30%), Positives = 30/65 (46%), Gaps = 3/65 (4%)

Query: 374 YKYCSEIIAELFHKKHQNYAWPFYTPVDVEKLGLTDYFDIIKKPMDLGTVRK---KMRNR 430
           Y+    ++ EL   +    AWPF  PV+  + G  DY D+IK+PM  GT      +   R
Sbjct: 2   YEALERVLRELKRHRRNEDAWPFLEPVNPPRRGAPDYRDVIKRPMSFGTKVNNIVETGAR 61

Query: 431 TYKTA 435
             +  
Sbjct: 62  DLEDL 66



 Score = 44.0 bits (104), Expect = 2e-05
 Identities = 18/51 (35%), Positives = 24/51 (47%)

Query: 127 YIVKNVMKAVWKHPHAWPFHEPVDAINLNLPDYHKVITQPMDLGTIKKRLE 177
             V   +K   ++  AWPF EPV+      PDY  VI +PM  GT    + 
Sbjct: 6   ERVLRELKRHRRNEDAWPFLEPVNPPRRGAPDYRDVIKRPMSFGTKVNNIV 56


>gnl|CDD|99955 cd05525, Bromo_ASH1, Bromodomain; ASH1_like sub-family. ASH1
           (absent, small, or homeotic 1) is a member of the
           trithorax-group in Drosophila melanogaster, an
           epigenetic transcriptional regulator of HOX genes.
           Drosophila ASH1 has been shown to methylate specific
           lysines in histones H3 and H4. Mammalian ASH1 has been
           shown to methylate histone H3. Bromodomains are 110
           amino acid long domains, that are found in many
           chromatin associated proteins. Bromodomains can interact
           specifically with acetylated lysine.
          Length = 106

 Score = 43.9 bits (104), Expect = 2e-05
 Identities = 18/50 (36%), Positives = 27/50 (54%)

Query: 404 KLGLTDYFDIIKKPMDLGTVRKKMRNRTYKTAKEFADDVRLIFSNCYKYN 453
           K    DY++ I  P+DL T+ K++    YKT + F  D+  +F N  KY 
Sbjct: 35  KKKNPDYYERITDPVDLSTIEKQILTGYYKTPEAFDSDMLKVFRNAEKYY 84



 Score = 36.6 bits (85), Expect = 0.006
 Identities = 18/70 (25%), Positives = 30/70 (42%)

Query: 152 INLNLPDYHKVITQPMDLGTIKKRLENNYYWSGKEAIQDFTTMFTNCYVYNKPGEDVVLM 211
                PDY++ IT P+DL TI+K++   YY + +    D   +F N   Y      +   
Sbjct: 34  SKKKNPDYYERITDPVDLSTIEKQILTGYYKTPEAFDSDMLKVFRNAEKYYGRKSPIGRD 93

Query: 212 AQNLEQLFLT 221
              L + +  
Sbjct: 94  VCRLRKAYYQ 103


>gnl|CDD|240271 PTZ00108, PTZ00108, DNA topoisomerase 2-like protein; Provisional.
          Length = 1388

 Score = 47.0 bits (112), Expect = 4e-05
 Identities = 61/382 (15%), Positives = 125/382 (32%), Gaps = 40/382 (10%)

Query: 401  DVEKLGLTDYFDIIKKPMDLGTVRKKMRNRTYKTAKEFADDVRLIFSNCYKYNPPDHNVV 460
            +++KLG   + DIIKK  +  T  ++        A +  D+  L  +  Y Y      ++
Sbjct: 1039 ELKKLGYVRFKDIIKKKSEKITAEEEEGAEEDDEADDEDDEEELGAAVSYDY------LL 1092

Query: 461  TMARQLSAVFEDRFAKMPDESNLASRAAASVSSDDDSE---DERQNQLKYLQEQLKSLTD 517
            +M   + ++ +++  K+  E     +    + +    +   ++     + L+EQ +   +
Sbjct: 1093 SMP--IWSLTKEKVEKLNAELEKKEKELEKLKNTTPKDMWLEDLDKFEEALEEQEE--VE 1148

Query: 518  QIRLLVEDSTKPKKKKK--KNRDQPKSKMPMGQNSAMMNDHVNKMNKAPAPLNNGQKPKS 575
            +  +  E   K K K K  K R     K    +  +  +          +   +  + + 
Sbjct: 1149 EKEIAKEQRLKSKTKGKASKLRKPKLKKKEKKKKKSSADKSKKASVVGNSKRVDSDEKRK 1208

Query: 576  LNNVRKPQASNPQQAKKPKPNNANTVAAKKQV-RTFDSEDEDVAKPMSYDEKRQLSLDIN 634
            L++    + SN   + +       T   K  V R    ++         DE     L   
Sbjct: 1209 LDDKPDNKKSNSSGSDQEDDEEQKTKPKKSSVKRLKSKKNNSSKSSEDNDEFSSDDLSKE 1268

Query: 635  KLPGDKLGKVVHIIQSREPSLREPNPDEIEIDFETLKPSTLRELEQYVSSCLRKRTYKKT 694
              P +   K V  +Q   P   +    E     +   P+              K+  KK 
Sbjct: 1269 GKPKNAP-KRVSAVQYSPPPPSKRPDGESNGGSKPSSPT--------------KKKVKKR 1313

Query: 695  PKPKDEKFAEKKHELEKRLQDVTSQIDSTNKKLKKPKPTTSAAGPTGASRLSASSSSSSD 754
             +       +KK   +K  +   S       K +  + + S +           S SSS+
Sbjct: 1314 LEGSLAALKKKKKSEKKTARKKKS-------KTRVKQASASQSSRLLRRPRKKKSDSSSE 1366

Query: 755  SD--SSGQSTSSSSSESSDSEG 774
             D  S    +     E  + + 
Sbjct: 1367 DDDDSEVDDSEDEDDEDDEDDD 1388


>gnl|CDD|218440 pfam05110, AF-4, AF-4 proto-oncoprotein.  This family consists of
           AF4 (Proto-oncogene AF4) and FMR2 (Fragile X E mental
           retardation syndrome) nuclear proteins. These proteins
           have been linked to human diseases such as acute
           lymphoblastic leukaemia and mental retardation. The
           family also contains a Drosophila AF4 protein homologue
           Lilliputian which contains an AT-hook domain.
           Lilliputian represents a novel pair-rule gene that acts
           in cytoskeleton regulation, segmentation and
           morphogenesis in Drosophila.
          Length = 1154

 Score = 41.8 bits (98), Expect = 0.002
 Identities = 31/105 (29%), Positives = 43/105 (40%), Gaps = 5/105 (4%)

Query: 680 QYVSSCLRKRTYKKTPKPKDEKFAEKKHELEKRLQDVTSQIDSTNKKLKKPKPTTSAAG- 738
           Q+VS   + +                   LE  L+  +S+     +  +KP    +    
Sbjct: 327 QHVSPGTQNQKQYDPSSKTSTNSQSGTSMLEDDLKLSSSEDSDEEQATEKPPSRNTPPSA 386

Query: 739 ----PTGASRLSASSSSSSDSDSSGQSTSSSSSESSDSEGEKGPH 779
               P  A+  S SSSSSS S+SS  S S S S SSDSE  + P 
Sbjct: 387 PSSNPEPAASSSGSSSSSSGSESSSGSDSESESSSSDSEENEPPR 431


>gnl|CDD|236776 PRK10856, PRK10856, cytoskeletal protein RodZ; Provisional.
          Length = 331

 Score = 38.1 bits (89), Expect = 0.013
 Identities = 16/78 (20%), Positives = 25/78 (32%), Gaps = 1/78 (1%)

Query: 15  NQNGTGDPDPVKAEQDVKVNNNNTSDSTSSSSPPPSSTEVTPVKKSPAPPSSSSSAPAPS 74
           +QN  G   P+               +   ++P  S T       +PA     ++  APS
Sbjct: 155 SQNS-GQSVPLDTSTTTDPATTPAPAAPVDTTPTNSQTPAVATAPAPAVDPQQNAVVAPS 213

Query: 75  HHNESNSGNSTKAASVEP 92
             N   +     AA   P
Sbjct: 214 QANVDTAATPAPAAPATP 231



 Score = 35.4 bits (82), Expect = 0.094
 Identities = 16/79 (20%), Positives = 33/79 (41%), Gaps = 7/79 (8%)

Query: 27  AEQDVKVNNNNTSDS----TSSSSPPPSSTEVTPVKKSPAPPSSSS---SAPAPSHHNES 79
           A+Q     + N+  S    TS+++ P ++        +    S +    +APAP+   + 
Sbjct: 147 ADQSSAELSQNSGQSVPLDTSTTTDPATTPAPAAPVDTTPTNSQTPAVATAPAPAVDPQQ 206

Query: 80  NSGNSTKAASVEPPPRDEP 98
           N+  +   A+V+      P
Sbjct: 207 NAVVAPSQANVDTAATPAP 225



 Score = 34.2 bits (79), Expect = 0.21
 Identities = 20/102 (19%), Positives = 34/102 (33%), Gaps = 7/102 (6%)

Query: 2   SAAETETATDPVVNQNGTGDPDPVKAEQDVKVNNNNTSDSTSSSSPPPSSTEVTPVKKSP 61
           SA  ++ +   V     T             V+   T+  T + +  P+   V P + + 
Sbjct: 151 SAELSQNSGQSVPLDTSTTTDPATTPAPAAPVDTTPTNSQTPAVATAPAPA-VDPQQNAV 209

Query: 62  APPSSSS------SAPAPSHHNESNSGNSTKAASVEPPPRDE 97
             PS ++       APA     +  +   T  A V  P  D 
Sbjct: 210 VAPSQANVDTAATPAPAAPATPDGAAPLPTDQAGVSTPAADP 251


>gnl|CDD|99924 cd05492, Bromo_ZMYND11, Bromodomain; ZMYND11_like sub-family.
           ZMYND11 or BS69 is a ubiquitously expressed nuclear
           protein that has been shown to associate with chromatin.
           It interacts with chromatin remodeling factors and might
           play a role in chromatin remodeling and gene expression.
           Bromodomains are 110 amino acid long domains, that are
           found in many chromatin associated proteins.
           Bromodomains can interact specifically with acetylated
           lysine.
          Length = 109

 Score = 35.4 bits (82), Expect = 0.021
 Identities = 13/57 (22%), Positives = 25/57 (43%), Gaps = 8/57 (14%)

Query: 133 MKAVWKHPHAW-PFHEPVDAINLN-------LPDYHKVITQPMDLGTIKKRLENNYY 181
           +K +     +W P      AI LN       LP   ++I   +D+  I++++ +  Y
Sbjct: 5   LKFIVSRMKSWLPPDTTNRAIVLNKRGKATKLPKRRRLIHTHLDVADIQEKINSEKY 61



 Score = 30.8 bits (70), Expect = 0.86
 Identities = 9/35 (25%), Positives = 19/35 (54%)

Query: 414 IKKPMDLGTVRKKMRNRTYKTAKEFADDVRLIFSN 448
           I   +D+  +++K+ +  Y + +EF  D  L+  N
Sbjct: 43  IHTHLDVADIQEKINSEKYTSLEEFKADALLLLHN 77


>gnl|CDD|237030 PRK12270, kgd, alpha-ketoglutarate decarboxylase; Reviewed.
          Length = 1228

 Score = 37.2 bits (87), Expect = 0.043
 Identities = 20/98 (20%), Positives = 36/98 (36%), Gaps = 8/98 (8%)

Query: 38  TSDSTSSSSPPPSSTEVTPVKKSPAPPSSSSSAPAPSHHNESNSGNSTKAASVEPPPRDE 97
            + + ++++   S+    P  K+PA P+ +  A A        +  +  AA+   PP   
Sbjct: 44  PTAAAAAAAAAASAPAAAPAAKAPAAPAPAPPAAAAPAAPPKPAAAAAAAAAPAAPPAAA 103

Query: 98  PRLEPVDGIVQPPVVPPKHRPGRNTNQLQYIVKNVMKA 135
               P    V+  V P +            + KN M A
Sbjct: 104 AAAAPAAAAVEDEVTPLRGAAAA-------VAKN-MDA 133



 Score = 30.6 bits (70), Expect = 4.5
 Identities = 10/73 (13%), Positives = 19/73 (26%)

Query: 1   MSAAETETATDPVVNQNGTGDPDPVKAEQDVKVNNNNTSDSTSSSSPPPSSTEVTPVKKS 60
                T   T                              + ++ + PP          +
Sbjct: 36  YGPGSTAAPTAAAAAAAAAASAPAAAPAAKAPAAPAPAPPAAAAPAAPPKPAAAAAAAAA 95

Query: 61  PAPPSSSSSAPAP 73
           PA P ++++A AP
Sbjct: 96  PAAPPAAAAAAAP 108


>gnl|CDD|218191 pfam04652, DUF605, Vta1 like.  Vta1 (VPS20-associated protein 1) is
           a positive regulator of Vps4. Vps4 is an ATPase that is
           required in the multivesicular body (MVB) sorting
           pathway to dissociate the endosomal sorting complex
           required for transport (ESCRT). Vta1 promotes correct
           assembly of Vps4 and stimulates its ATPase activity
           through its conserved Vta1/SBP1/LIP5 region.
          Length = 315

 Score = 36.2 bits (84), Expect = 0.047
 Identities = 18/115 (15%), Positives = 28/115 (24%)

Query: 10  TDPVVNQNGTGDPDPVKAEQDVKVNNNNTSDSTSSSSPPPSSTEVTPVKKSPAPPSSSSS 69
            +   + +            +     + +     SSSP   S    P   S    SS   
Sbjct: 155 DEEDEDADVATTNSDNSFPGEDADPASASPSDPPSSSPGVPSFPSPPEDPSSPSDSSLPP 214

Query: 70  APAPSHHNESNSGNSTKAASVEPPPRDEPRLEPVDGIVQPPVVPPKHRPGRNTNQ 124
           AP+    +       +      PP    P   PV  +       P       T  
Sbjct: 215 APSSFQSDTPPPSPESPTNPSPPPGPAAPPPPPVQQVPPLSTAKPTPPSASATPA 269



 Score = 30.4 bits (69), Expect = 3.2
 Identities = 20/111 (18%), Positives = 33/111 (29%), Gaps = 2/111 (1%)

Query: 1   MSAAETETATDPVVNQNGTGDPDPV--KAEQDVKVNNNNTSDSTSSSSPPPSSTEVTPVK 58
            ++  +    D         DP               + +S S SS  P PSS +     
Sbjct: 166 TNSDNSFPGEDADPASASPSDPPSSSPGVPSFPSPPEDPSSPSDSSLPPAPSSFQSDTPP 225

Query: 59  KSPAPPSSSSSAPAPSHHNESNSGNSTKAASVEPPPRDEPRLEPVDGIVQP 109
            SP  P++ S  P P+             ++ +P P          G +  
Sbjct: 226 PSPESPTNPSPPPGPAAPPPPPVQQVPPLSTAKPTPPSASATPAPIGGITL 276


>gnl|CDD|236090 PRK07764, PRK07764, DNA polymerase III subunits gamma and tau;
           Validated.
          Length = 824

 Score = 36.5 bits (85), Expect = 0.053
 Identities = 17/83 (20%), Positives = 30/83 (36%), Gaps = 2/83 (2%)

Query: 35  NNNTSDSTSSSSPPPSSTEVTPVKKSPAPPSSSSSAPAPSHHNESNSGNSTKAASVEPPP 94
                 +  +S+P P++ +  P+   P  P   + APA      + +  +  AA+  P P
Sbjct: 719 AQPPQAAQGASAPSPAADDPVPLPPEPDDPPDPAGAPAQPPPPPAPAPAAAPAAAPPPSP 778

Query: 95  RDEPRLEPVDGIVQPPVVPPKHR 117
             E      D    P +     R
Sbjct: 779 PSEEEEMAEDDA--PSMDDEDRR 799



 Score = 33.4 bits (77), Expect = 0.55
 Identities = 23/121 (19%), Positives = 35/121 (28%), Gaps = 5/121 (4%)

Query: 3   AAETETATDPVVNQNGTGDPDPVKAEQDVKVNNNNTSDSTSSSSP---PPSSTEVTPVKK 59
           A E       V + +  GD  P KA               + ++P    P+     P   
Sbjct: 649 APEHHPKHVAVPDASDGGDGWPAKAGGAAPAAPPPAPAPAAPAAPAGAAPAQPAPAPAAT 708

Query: 60  SPAPPSSSSSAPAPSHHNESNSGNSTKAASVEPPPRDEPRLEPVDGIVQPPVVPPKHRPG 119
            PA  +   +A  P     +++ +      V  PP  EP   P          PP     
Sbjct: 709 PPAGQADDPAAQPPQAAQGASAPSPAADDPVPLPP--EPDDPPDPAGAPAQPPPPPAPAP 766

Query: 120 R 120
            
Sbjct: 767 A 767



 Score = 30.0 bits (68), Expect = 6.1
 Identities = 13/79 (16%), Positives = 23/79 (29%), Gaps = 3/79 (3%)

Query: 44  SSSPPPSSTEVTPVKKSPAPPSSSSS---APAPSHHNESNSGNSTKAASVEPPPRDEPRL 100
           +++P  +         + A P+ +++   APAP+      S      A   P P      
Sbjct: 403 AAAPAAAPAPAAAAPAAAAAPAPAAAPQPAPAPAPAPAPPSPAGNAPAGGAPSPPPAAAP 462

Query: 101 EPVDGIVQPPVVPPKHRPG 119
                        P   P 
Sbjct: 463 SAQPAPAPAAAPEPTAAPA 481


>gnl|CDD|219053 pfam06482, Endostatin, Collagenase NC10 and Endostatin.  NC10
           stands for Non-helical region 10 and is taken from human
           COL15A1. A mutation in this region in human COL18A1 is
           associated with an increased risk of prostrate cancer.
           This domain is cleaved from the precursor and forms
           endostatin. Endostatin is a key tumour suppressor and
           has been used highly successfully to treat cancer. It is
           a potent angiogenesis inhibitor. Endostatin also binds a
           zinc ion near the N-terminus; this is likely to be of
           structural rather than functional importance according
           to.
          Length = 291

 Score = 36.3 bits (84), Expect = 0.055
 Identities = 24/88 (27%), Positives = 38/88 (43%), Gaps = 9/88 (10%)

Query: 230 EVVLDAPQP---RSSKKKPPVSASPSLNPVIKTPVIPLNKLPSATSTPKPRPPNPVLGST 286
           +V+L    P    ++ + PPV  +P  +P    P  P    P   S    R P+P L S 
Sbjct: 46  KVLLGELVPLPGTTATQPPPVVLTPWSDPR--LPDPPHLPDPQTHSATAHRNPHPPLNSP 103

Query: 287 ATTTTAP----KVNHLNSMNAPDTPDMK 310
           A           V HL ++N+P + +M+
Sbjct: 104 ARIAHVHQDFQPVLHLIALNSPLSGNMR 131


>gnl|CDD|217469 pfam03276, Gag_spuma, Spumavirus gag protein. 
          Length = 582

 Score = 36.4 bits (84), Expect = 0.065
 Identities = 18/77 (23%), Positives = 26/77 (33%)

Query: 43  SSSSPPPSSTEVTPVKKSPAPPSSSSSAPAPSHHNESNSGNSTKAASVEPPPRDEPRLEP 102
           +   PPPSS    P   S   PS+SS+          N      + +    P       P
Sbjct: 185 AIQPPPPSSLPGLPPGSSSLAPSASSTPGNRLPRVSFNPFLPGPSPAQPSAPPASIPAPP 244

Query: 103 VDGIVQPPVVPPKHRPG 119
           +  ++Q    PP   P 
Sbjct: 245 IPPVIQYVAPPPVPPPQ 261



 Score = 29.9 bits (67), Expect = 6.4
 Identities = 18/73 (24%), Positives = 21/73 (28%), Gaps = 10/73 (13%)

Query: 56  PVKKSPAPPSSSSSAPAPSHHNE----SNSGNSTKAASVEPPPRDEPRLEP------VDG 105
           P    P PPSS    P  S        S  GN     S  P        +P      +  
Sbjct: 183 PGAIQPPPPSSLPGLPPGSSSLAPSASSTPGNRLPRVSFNPFLPGPSPAQPSAPPASIPA 242

Query: 106 IVQPPVVPPKHRP 118
              PPV+     P
Sbjct: 243 PPIPPVIQYVAPP 255


>gnl|CDD|237865 PRK14951, PRK14951, DNA polymerase III subunits gamma and tau;
           Provisional.
          Length = 618

 Score = 36.2 bits (84), Expect = 0.065
 Identities = 18/116 (15%), Positives = 36/116 (31%), Gaps = 13/116 (11%)

Query: 3   AAETETATDPVVNQNGTGDPDPVKAEQDVKVNNNNTSDSTSSSSPPPSSTEVTPVKKSPA 62
            AE +T   P           P  A           + + + ++ P ++        + A
Sbjct: 375 PAEKKTPARPEA-------AAPAAAPVAQ------AAAAPAPAAAPAAAASAPAAPPAAA 421

Query: 63  PPSSSSSAPAPSHHNESNSGNSTKAASVEPPPRDEPRLEPVDGIVQPPVVPPKHRP 118
           PP+  ++  A +      +  +  A +  PP +  P    +   V P        P
Sbjct: 422 PPAPVAAPAAAAPAAAPAAAPAAVALAPAPPAQAAPETVAIPVRVAPEPAVASAAP 477


>gnl|CDD|178145 PLN02530, PLN02530, histidine-tRNA ligase.
          Length = 487

 Score = 35.9 bits (83), Expect = 0.095
 Identities = 14/70 (20%), Positives = 22/70 (31%), Gaps = 2/70 (2%)

Query: 41  STSSSSPPPSSTEVTPVKKSPAPPSSSSSAPAPSHHNESNSGNSTKAASVEPPPRDEPRL 100
           S +  S         P+    +   S+SS         + +G      +  PP   E   
Sbjct: 6   SATRVSASAIRRPSLPLSSRCSFLLSASSPRGGRCAASAAAGGGRSGGTTAPPSVQEDGK 65

Query: 101 EPVDGIVQPP 110
             +D  V PP
Sbjct: 66  PKID--VNPP 73


>gnl|CDD|223039 PHA03307, PHA03307, transcriptional regulator ICP4; Provisional.
          Length = 1352

 Score = 35.9 bits (83), Expect = 0.099
 Identities = 26/118 (22%), Positives = 35/118 (29%), Gaps = 7/118 (5%)

Query: 2   SAAETETATDPVVNQNGT-GDPDPVKAEQDVKVNNNNTSDSTSSSSPPPSSTEVTPVKKS 60
            A   + A D +    G         A   V             + PPP      P  +S
Sbjct: 25  PATPGDAADDLLSGSQGQLVSDSAELAAVTVVAGAAACDRFEPPTGPPPGPGTEAPANES 84

Query: 61  PAPPSSSSSAPAPSHHNESNSGNSTKAASVEPPPRDEPRLEPVDGIVQPPVVPPKHRP 118
            + P+ S S  AP+      S      +S +PPP   P   P      PP   P    
Sbjct: 85  RSTPTWSLSTLAPASPAREGSPTPPGPSSPDPPPPTPPPASP------PPSPAPDLSE 136



 Score = 35.1 bits (81), Expect = 0.19
 Identities = 20/96 (20%), Positives = 31/96 (32%), Gaps = 9/96 (9%)

Query: 36  NNTSDSTSSSSPPPSS-------TEVTPVKKSPAPPSSSSSAPAPSHHNESNSGNSTKAA 88
           +++S    S SP PSS       +       S +   SSSS+ + S  +   +  S   +
Sbjct: 287 SSSSPRERSPSPSPSSPGSGPAPSSPRASSSSSSSRESSSSSTSSSSESSRGAAVSPGPS 346

Query: 89  SVEPPPRDEPRLEPVDGIVQPPVVPPKHRPGRNTNQ 124
               P    P   P      P   P   R   +   
Sbjct: 347 PSRSPSPSRPP--PPADPSSPRKRPRPSRAPSSPAA 380



 Score = 33.6 bits (77), Expect = 0.53
 Identities = 15/94 (15%), Positives = 21/94 (22%), Gaps = 7/94 (7%)

Query: 36  NNTSDSTSSSSPPPSSTEV-------TPVKKSPAPPSSSSSAPAPSHHNESNSGNSTKAA 88
            + +    SS  PP  T              S       S  P P+    +   +    A
Sbjct: 104 GSPTPPGPSSPDPPPPTPPPASPPPSPAPDLSEMLRPVGSPGPPPAASPPAAGASPAAVA 163

Query: 89  SVEPPPRDEPRLEPVDGIVQPPVVPPKHRPGRNT 122
           S     R                  P   P  +T
Sbjct: 164 SDAASSRQAALPLSSPEETARAPSSPPAEPPPST 197



 Score = 33.2 bits (76), Expect = 0.72
 Identities = 16/64 (25%), Positives = 22/64 (34%), Gaps = 4/64 (6%)

Query: 39  SDSTSSSSPPPSSTEVTPVK---KSPAPPSSSSSAPAPSHHNESNSGNSTKAASVEPPPR 95
           S STSSSS       V+P     +SP+P      A   S         +  + +     R
Sbjct: 326 SSSTSSSSESSRGAAVSPGPSPSRSPSPSRPPPPADPSSPRKRPRPSRAPSSPAA-SAGR 384

Query: 96  DEPR 99
              R
Sbjct: 385 PTRR 388



 Score = 32.1 bits (73), Expect = 1.3
 Identities = 15/106 (14%), Positives = 30/106 (28%), Gaps = 7/106 (6%)

Query: 18  GTGDPDPVKAEQDVKVNNNNTSDSTSSSSPPPSSTEVTPVKKSPAPPSSSSSAPAPSHHN 77
                + ++           +  +  +S    +S   +  + +    S   +A APS   
Sbjct: 131 APDLSEMLRPVGSPGPPPAASPPAAGASPAAVASDAASSRQAALPLSSPEETARAPSSPP 190

Query: 78  ESNSGNSTKAASVEPPPRDEPRLEPVDGIVQPPVVPPKHRPGRNTN 123
                ++       PP    PR       +      P   PGR+  
Sbjct: 191 AEPPPST-------PPAAASPRPPRRSSPISASASSPAPAPGRSAA 229



 Score = 31.7 bits (72), Expect = 2.1
 Identities = 20/45 (44%), Positives = 25/45 (55%)

Query: 730 PKPTTSAAGPTGASRLSASSSSSSDSDSSGQSTSSSSSESSDSEG 774
           P P+ S+ G   A     +SSSSS S  S  S++SSSSESS    
Sbjct: 296 PSPSPSSPGSGPAPSSPRASSSSSSSRESSSSSTSSSSESSRGAA 340



 Score = 31.3 bits (71), Expect = 2.8
 Identities = 24/121 (19%), Positives = 41/121 (33%), Gaps = 1/121 (0%)

Query: 1   MSAAETETATDPVVNQNGTGDPDPVKAEQDVKVNNNNTSDSTSSSSPPPSSTEVTPVKKS 60
             ++ +E++      +N    P P       ++   +  +  SS   P SS+     +  
Sbjct: 237 SDSSSSESSGCGWGPENECPLPRPAPITLPTRIWEASGWNGPSSRPGPASSSSSPRERSP 296

Query: 61  PAPPSSSSSAPAPS-HHNESNSGNSTKAASVEPPPRDEPRLEPVDGIVQPPVVPPKHRPG 119
              PSS  S PAPS     S+S +S +++S       E            P   P     
Sbjct: 297 SPSPSSPGSGPAPSSPRASSSSSSSRESSSSSTSSSSESSRGAAVSPGPSPSRSPSPSRP 356

Query: 120 R 120
            
Sbjct: 357 P 357



 Score = 30.9 bits (70), Expect = 3.2
 Identities = 21/84 (25%), Positives = 28/84 (33%), Gaps = 13/84 (15%)

Query: 37  NTSDSTSSSSPPPSSTEVTPVKK--SPAPPSSSSSAPAPSHHNESNSGNSTKAASVEPPP 94
             ++S S+ +   S+       +  SP PP  SS  P P                  PPP
Sbjct: 80  PANESRSTPTWSLSTLAPASPAREGSPTPPGPSSPDPPPP-----------TPPPASPPP 128

Query: 95  RDEPRLEPVDGIVQPPVVPPKHRP 118
              P L  +   V  P  PP   P
Sbjct: 129 SPAPDLSEMLRPVGSPGPPPAASP 152



 Score = 30.1 bits (68), Expect = 5.1
 Identities = 20/105 (19%), Positives = 28/105 (26%), Gaps = 2/105 (1%)

Query: 1   MSAAETETATDPVVNQNGTGDPDPVKAEQDVKVNNNNTSDSTSSSSPP-PSSTEVTPVKK 59
           + A                G      A +       + S    +S     S T   P   
Sbjct: 55  VVAGAAACDRFEPPTGPPPGPGTEAPANESRSTPTWSLSTLAPASPAREGSPTPPGPSSP 114

Query: 60  SPAPPSSSSSAPAPSHHNESNSGNSTKAASVEPPPRDEPRLEPVD 104
            P PP+   ++P PS   +  S       S  PPP   P      
Sbjct: 115 DPPPPTPPPASPPPSPAPDL-SEMLRPVGSPGPPPAASPPAAGAS 158


>gnl|CDD|223021 PHA03247, PHA03247, large tegument protein UL36; Provisional.
          Length = 3151

 Score = 36.1 bits (83), Expect = 0.100
 Identities = 18/79 (22%), Positives = 22/79 (27%)

Query: 48   PPSSTEVTPVKKSPAPPSSSSSAPAPSHHNESNSGNSTKAASVEPPPRDEPRLEPVDGIV 107
            PPS +          PP    + PA S   ES +    +      P    P         
Sbjct: 2866 PPSRSPAAKPAAPARPPVRRLARPAVSRSTESFALPPDQPERPPQPQAPPPPQPQPQPPP 2925

Query: 108  QPPVVPPKHRPGRNTNQLQ 126
             P   PP   P R    L 
Sbjct: 2926 PPQPQPPPPPPPRPQPPLA 2944



 Score = 34.5 bits (79), Expect = 0.32
 Identities = 26/108 (24%), Positives = 34/108 (31%), Gaps = 10/108 (9%)

Query: 21   DPDPVKAEQDVKVNNNNTSDSTSSSSPPPSSTEVTPVKKSPAPPSSSSS---APAP---- 73
            DP    A           + S +   PPP+S + T     P PP  S     + AP    
Sbjct: 2803 DPADPPAAVLAPAAALPPAASPAGPLPPPTSAQPTAPPPPPGPPPPSLPLGGSVAPGGDV 2862

Query: 74   SHHNESNSGNSTKAASVEPPPRDEPR---LEPVDGIVQPPVVPPKHRP 118
                 S S  +  AA   PP R   R       +    PP  P +   
Sbjct: 2863 RRRPPSRSPAAKPAAPARPPVRRLARPAVSRSTESFALPPDQPERPPQ 2910



 Score = 31.4 bits (71), Expect = 2.7
 Identities = 16/88 (18%), Positives = 29/88 (32%)

Query: 38   TSDSTSSSSPPPSSTEVTPVKKSPAPPSSSSSAPAPSHHNESNSGNSTKAASVEPPPRDE 97
            TS +    +PP S+    PV     P   +  +P P   +  +    + + +   P    
Sbjct: 2583 TSRARRPDAPPQSARPRAPVDDRGDPRGPAPPSPLPPDTHAPDPPPPSPSPAANEPDPHP 2642

Query: 98   PRLEPVDGIVQPPVVPPKHRPGRNTNQL 125
            P   P     +    P +    R   +L
Sbjct: 2643 PPTVPPPERPRDDPAPGRVSRPRRARRL 2670



 Score = 30.3 bits (68), Expect = 4.9
 Identities = 20/70 (28%), Positives = 23/70 (32%), Gaps = 1/70 (1%)

Query: 220  LTKITGMPSEEVVLDAPQPRSSKKKP-PVSASPSLNPVIKTPVIPLNKLPSATSTPKPRP 278
            LT+       E     P P      P  V A  +  P   +P  PL    SA  T  P P
Sbjct: 2783 LTRPAVASLSESRESLPSPWDPADPPAAVLAPAAALPPAASPAGPLPPPTSAQPTAPPPP 2842

Query: 279  PNPVLGSTAT 288
            P P   S   
Sbjct: 2843 PGPPPPSLPL 2852


>gnl|CDD|191251 pfam05283, MGC-24, Multi-glycosylated core protein 24 (MGC-24).
           This family consists of several MGC-24 (or Cd164
           antigen) proteins from eukaryotic organisms.
           MGC-24/CD164 is a sialomucin expressed in many normal
           and cancerous tissues. In humans, soluble and
           transmembrane forms of MGC-24 are produced by
           alternative splicing.
          Length = 187

 Score = 34.2 bits (78), Expect = 0.15
 Identities = 11/65 (16%), Positives = 22/65 (33%), Gaps = 4/65 (6%)

Query: 33  VNNNNTSDSTSSSSPPPSSTEVTPVKKSPAPPSSSSSAPAPSHHNESNSGNSTKAASVEP 92
           V  +      ++ S   ++T   P   +  P  + S   +  H       N+T    V P
Sbjct: 94  VVKSGCQIYNTTDSCSVATTTPVPTNSTAKPTITPSPTTSHHHVTSEPKTNTT----VTP 149

Query: 93  PPRDE 97
             + +
Sbjct: 150 TSQPD 154


>gnl|CDD|99925 cd05493, Bromo_ALL-1, Bromodomain, ALL-1 like proteins. ALL-1 is a
           vertebrate homologue of Drosophila trithorax and is
           often affected in chromosomal rearrangements that are
           linked to acute leukemias, such as acute lymphocytic
           leukemia (ALL). Bromodomains are found in many
           chromatin-associated proteins and in nuclear histone
           acetyltransferases. They interact specifically with
           acetylated lysine.
          Length = 131

 Score = 33.2 bits (76), Expect = 0.15
 Identities = 11/31 (35%), Positives = 17/31 (54%)

Query: 416 KPMDLGTVRKKMRNRTYKTAKEFADDVRLIF 446
            P+DL  V KK+    Y +  +F+DD+  I 
Sbjct: 58  PPLDLEAVGKKLEAGFYTSVLDFSDDIVKII 88


>gnl|CDD|237082 PRK12373, PRK12373, NADH dehydrogenase subunit E; Provisional.
          Length = 400

 Score = 34.4 bits (79), Expect = 0.20
 Identities = 19/63 (30%), Positives = 23/63 (36%), Gaps = 5/63 (7%)

Query: 235 APQPRSSKKKPPVSASPSLNPVIKTPVIPLNKLPSATSTPKPRPPNPVLGSTATTTTAPK 294
           AP P S   +P  SA    N  +KTP       P A +     P    +  TA    APK
Sbjct: 239 APVPPSEAARPK-SADAETNAALKTPA----TAPKAAAKNAKAPEAQPVSGTAAAEPAPK 293

Query: 295 VNH 297
              
Sbjct: 294 EAA 296


>gnl|CDD|215145 PLN02258, PLN02258, 9-cis-epoxycarotenoid dioxygenase NCED.
          Length = 590

 Score = 34.7 bits (80), Expect = 0.24
 Identities = 16/70 (22%), Positives = 24/70 (34%), Gaps = 1/70 (1%)

Query: 213 QNLEQLFLTKITGMPSEEVVLDAPQPRSSKKKPPVSASPSLNPVIKTP-VIPLNKLPSAT 271
            +         +   S         P SS    P    PS + ++ TP ++PL KL S +
Sbjct: 1   SSSSNPTSRSQSHASSSSSSSSQSSPPSSTSPRPRRRKPSASSLLHTPSILPLPKLSSPS 60

Query: 272 STPKPRPPNP 281
                 PP  
Sbjct: 61  PPSVTLPPAA 70



 Score = 30.1 bits (68), Expect = 5.7
 Identities = 18/64 (28%), Positives = 27/64 (42%), Gaps = 8/64 (12%)

Query: 39 SDSTSSSSPPPSSTEVTPVKKSP---APPSSSSSAPAPSHHNESNSGNSTKAASVEPPPR 95
          S S+SS S PPSST   P ++ P   +   + S  P P       S  S  + ++ P   
Sbjct: 17 SSSSSSQSSPPSSTSPRPRRRKPSASSLLHTPSILPLPKL-----SSPSPPSVTLPPAAT 71

Query: 96 DEPR 99
           +  
Sbjct: 72 TQTP 75



 Score = 29.7 bits (67), Expect = 6.7
 Identities = 17/96 (17%), Positives = 26/96 (27%), Gaps = 5/96 (5%)

Query: 41  STSSSSPPPSSTEVTPVKKSPAPPSSSSSAPAPSHHNESNSGNSTKAASVEPPPRDEPRL 100
           S+SS+    S +  +    S +  S  SS              S  +    P     P+L
Sbjct: 1   SSSSNPTSRSQSHASSSSSSSSQSSPPSSTSPRP----RRRKPSASSLLHTPSILPLPKL 56

Query: 101 EPVDGIVQPPVVPPKHRPGRNTNQLQYIVKNVMKAV 136
                           +     N LQ      + AV
Sbjct: 57  SSPSPPSVTLPPAATTQT-PQLNPLQRAAAAALDAV 91


>gnl|CDD|222647 pfam14282, FlxA, FlxA-like protein.  This family includes FlxA from
           E. coli. The expression of FlxA is regulated by the FliA
           sigma factor, a transcription factor specific for class
           3 flagellar operons. However FlxA is not required for
           flagellar function or formation.
          Length = 106

 Score = 32.0 bits (73), Expect = 0.26
 Identities = 17/55 (30%), Positives = 30/55 (54%)

Query: 485 SRAAASVSSDDDSEDERQNQLKYLQEQLKSLTDQIRLLVEDSTKPKKKKKKNRDQ 539
           S  +AS +S   S     +Q+K LQ+Q++SLT Q++ L        ++K+K +  
Sbjct: 1   SSVSASSTSSSASSGSSDSQIKSLQKQIQSLTKQLKELSSSEDLTAEEKQKQQQL 55



 Score = 29.3 bits (66), Expect = 2.3
 Identities = 16/68 (23%), Positives = 33/68 (48%), Gaps = 7/68 (10%)

Query: 492 SSDDDSEDERQNQLKYLQEQLKSLTDQIRLLVEDSTKPKKKKKKNRDQPKSKMPMGQNSA 551
           SS+D + +E+Q Q + +Q+Q++ L  QI  L       ++++ +   Q +        S 
Sbjct: 40  SSEDLTAEEKQKQQQLIQQQIQMLQAQIAQL-------QQQQSEEAQQQQQSSIEDSPSQ 92

Query: 552 MMNDHVNK 559
              + VN+
Sbjct: 93  TSAEGVNR 100


>gnl|CDD|236138 PRK07994, PRK07994, DNA polymerase III subunits gamma and tau;
           Validated.
          Length = 647

 Score = 34.5 bits (80), Expect = 0.26
 Identities = 23/117 (19%), Positives = 37/117 (31%), Gaps = 4/117 (3%)

Query: 2   SAAETETATDPVVNQNGTGDPDPVKAEQDVKVNNNNTSDSTSSSSPPPSSTEVTPVKKSP 61
           + A    A  P         P     +         TS   ++      +   T  KKS 
Sbjct: 383 ATAAPTAAVAPPQAPAVPPPPASAPQQAPAVPLPETTSQLLAARQQLQRAQGATKAKKS- 441

Query: 62  APPSSSSSAPAPSHHNESNSGNSTKAASVEPPPRDEP-RLEPVDGIVQP--PVVPPK 115
            P ++S + P  S      S     +A  + P + E  R +  + +     PV  PK
Sbjct: 442 EPAAASRARPVNSALERLASVRPAPSALEKAPAKKEAYRWKATNPVEVKKEPVATPK 498


>gnl|CDD|237862 PRK14948, PRK14948, DNA polymerase III subunits gamma and tau;
           Provisional.
          Length = 620

 Score = 34.2 bits (79), Expect = 0.29
 Identities = 18/72 (25%), Positives = 31/72 (43%)

Query: 31  VKVNNNNTSDSTSSSSPPPSSTEVTPVKKSPAPPSSSSSAPAPSHHNESNSGNSTKAASV 90
           +K+N  + S S S+++  P   + +P   +P PP    +A AP          +T+A+S 
Sbjct: 506 IKLNLESQSGSASNTAKTPPPPQKSPPPPAPTPPLPQPTATAPPPTPPPPPPTATQASSN 565

Query: 91  EPPPRDEPRLEP 102
            P         P
Sbjct: 566 APAQIPADSSPP 577



 Score = 30.7 bits (70), Expect = 3.5
 Identities = 16/95 (16%), Positives = 32/95 (33%)

Query: 14  VNQNGTGDPDPVKAEQDVKVNNNNTSDSTSSSSPPPSSTEVTPVKKSPAPPSSSSSAPAP 73
           +N           A+       +    + +   P P++T   P    P P ++ +S+ AP
Sbjct: 508 LNLESQSGSASNTAKTPPPPQKSPPPPAPTPPLPQPTATAPPPTPPPPPPTATQASSNAP 567

Query: 74  SHHNESNSGNSTKAASVEPPPRDEPRLEPVDGIVQ 108
           +     +S          P P  +   E +D   +
Sbjct: 568 AQIPADSSPPPPIPEEPTPSPTKDSSPEEIDKAAK 602



 Score = 30.7 bits (70), Expect = 4.0
 Identities = 18/96 (18%), Positives = 29/96 (30%), Gaps = 8/96 (8%)

Query: 22  PDPVKAEQDVKVNNNNT------SDSTSSSSPPPSSTEVTPVKKSPAPPSSSSSAPAPSH 75
           P    +E        N       S   +   P    T+      SP P  +S   P P+ 
Sbjct: 361 PSAFISEIANASAPANPTPAPNPSPPPAPIQPSAPKTKQAATTPSPPPAKASPPIPVPA- 419

Query: 76  HNESNSGNSTKAASVEPPPRD-EPRLEPVDGIVQPP 110
                S      A+  PP  + E   + +   ++ P
Sbjct: 420 EPTEPSPTPPANAANAPPSLNLEELWQQILAKLELP 455



 Score = 29.9 bits (68), Expect = 5.7
 Identities = 17/86 (19%), Positives = 24/86 (27%), Gaps = 4/86 (4%)

Query: 3   AAETETATDPVVNQNGTGDPDPVKAEQDVKVNNNNTSDSTSSSSPPPSSTEVTPVKKSPA 62
            A+T               P P    Q        T      ++   SS     +    +
Sbjct: 520 TAKTPPPPQKSPPPPAPTPPLP----QPTATAPPPTPPPPPPTATQASSNAPAQIPADSS 575

Query: 63  PPSSSSSAPAPSHHNESNSGNSTKAA 88
           PP      P PS   +S+     KAA
Sbjct: 576 PPPPIPEEPTPSPTKDSSPEEIDKAA 601


>gnl|CDD|191582 pfam06679, DUF1180, Protein of unknown function (DUF1180).  This
           family consists of several hypothetical mammalian
           proteins of around 190 residues in length. The function
           of this family is unknown.
          Length = 163

 Score = 33.0 bits (75), Expect = 0.33
 Identities = 18/61 (29%), Positives = 25/61 (40%)

Query: 41  STSSSSPPPSSTEVTPVKKSPAPPSSSSSAPAPSHHNESNSGNSTKAASVEPPPRDEPRL 100
           S +SS P P +    P   +PA P++  S  A      S S     A    P  RD+P  
Sbjct: 35  SAASSQPGPETQPPLPPGPTPANPTALGSGDAAGGGGSSTSDGGALATRPSPLLRDKPTT 94

Query: 101 E 101
           +
Sbjct: 95  Q 95


>gnl|CDD|235600 PRK05771, PRK05771, V-type ATP synthase subunit I; Validated.
          Length = 646

 Score = 34.1 bits (79), Expect = 0.33
 Identities = 23/125 (18%), Positives = 51/125 (40%), Gaps = 13/125 (10%)

Query: 613 EDEDVAKPMSYDEKRQLSLDINKLPGDKLGKV----VHIIQSREPSLREPNPDEIEIDF- 667
           ++ +  +P          LD++ L G K   V    V   +  E  L     +   I   
Sbjct: 121 QEIERLEPWGN-----FDLDLSLLLGFKYVSVFVGTVPEDKLEELKLESDVENVEYISTD 175

Query: 668 ETLKPS---TLRELEQYVSSCLRKRTYKKTPKPKDEKFAEKKHELEKRLQDVTSQIDSTN 724
           +         L+EL   V   L+K  +++    ++   +E   E+++ L+++  + +S  
Sbjct: 176 KGYVYVVVVVLKELSDEVEEELKKLGFERLELEEEGTPSELIREIKEELEEIEKERESLL 235

Query: 725 KKLKK 729
           ++LK+
Sbjct: 236 EELKE 240


>gnl|CDD|216531 pfam01496, V_ATPase_I, V-type ATPase 116kDa subunit family.  This
           family consists of the 116kDa V-type ATPase (vacuolar
           (H+)-ATPases) subunits, as well as V-type ATP synthase
           subunit i. The V-type ATPases family are proton pumps
           that acidify intracellular compartments in eukaryotic
           cells for example yeast central vacuoles,
           clathrin-coated and synaptic vesicles. They have
           important roles in membrane trafficking processes. The
           116kDa subunit (subunit a) in the V-type ATPase is part
           of the V0 functional domain responsible for proton
           transport. The a subunit is a transmembrane glycoprotein
           with multiple putative transmembrane helices it has a
           hydrophilic amino terminal and a hydrophobic carboxy
           terminal. It has roles in proton transport and assembly
           of the V-type ATPase complex. This subunit is encoded by
           two homologous gene in yeast VPH1 and STV1.
          Length = 707

 Score = 33.5 bits (77), Expect = 0.49
 Identities = 42/253 (16%), Positives = 85/253 (33%), Gaps = 35/253 (13%)

Query: 494 DDDSEDERQNQLKYLQEQLKSLTDQIRLLVEDSTKPKKKKKKNRDQPKSKMPMGQNSAMM 553
            D +ED    Q K++ E ++   +  R L +  +K KK     +D    K  +   S   
Sbjct: 13  RDLNEDVSAFQRKFVNE-VRRCDEVERKLRKLESKIKKLGIPLKD-TGGKPDV-PPSKEF 69

Query: 554 NDHVNKMNKAPAPLNNGQKPKSLNNVRKPQASNPQQAKKPKPNNANTVAAKKQVRTFDSE 613
            D   ++    A +   +  ++L ++ K      +         +      +++    + 
Sbjct: 70  LDLEEEILDLEAEI--KEVEENLESLEKEINELEEWLNVLDEEKSFLDENLEELSELSNL 127

Query: 614 DEDVAKPMSYDEKRQLSLDINKLPGDKLGKVVHIIQ-SREPSLR---------EPNPDEI 663
           D D       +             G KLG V  +I   +  +                EI
Sbjct: 128 DIDFKYLRGAE-------------GLKLGFVAGVINREKLEAFERELWRACRGYIRQAEI 174

Query: 664 EIDFETLKPST------LRELEQYVSSCLRKRTYKKTPKPKDEK-FAEKKHELEKRLQDV 716
           E   E  K +        +E    V   L    ++    P+ E   +E   ++ KR++++
Sbjct: 175 EEPLEDPKKTVFIIFFVGKEDLDKVKKILDSFGFELYDVPETEGERSELISKVNKRIEEL 234

Query: 717 TSQIDSTNKKLKK 729
              ++ T   L+K
Sbjct: 235 QRVLEQTESHLEK 247


>gnl|CDD|217393 pfam03154, Atrophin-1, Atrophin-1 family.  Atrophin-1 is the
           protein product of the dentatorubral-pallidoluysian
           atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive
           neurodegenerative disorder. It is caused by the
           expansion of a CAG repeat in the DRPLA gene on
           chromosome 12p. This results in an extended
           polyglutamine region in atrophin-1, that is thought to
           confer toxicity to the protein, possibly through
           altering its interactions with other proteins. The
           expansion of a CAG repeat is also the underlying defect
           in six other neurodegenerative disorders, including
           Huntington's disease. One interaction of expanded
           polyglutamine repeats that is thought to be pathogenic
           is that with the short glutamine repeat in the
           transcriptional coactivator CREB binding protein, CBP.
           This interaction draws CBP away from its usual nuclear
           location to the expanded polyglutamine repeat protein
           aggregates that are characteristic of the polyglutamine
           neurodegenerative disorders. This interferes with
           CBP-mediated transcription and causes cytotoxicity.
          Length = 979

 Score = 33.5 bits (76), Expect = 0.51
 Identities = 21/83 (25%), Positives = 26/83 (31%), Gaps = 3/83 (3%)

Query: 38  TSDSTSSSSPPPSSTEVTPVKKSPAPPSSSSSAPAPSHHNESNSGNSTKAASVEPPPRDE 97
           +       SP P     T  ++SP PP+ SS  P  SHH               P     
Sbjct: 231 SLHPQRLPSPHPPLQPQTASQQSPQPPAPSSRHPQSSHHGPGPPMPHALQQG--PVFLQH 288

Query: 98  PRLEPVDGI-VQPPVVPPKHRPG 119
           P   P     +    VPP   P 
Sbjct: 289 PSSNPPQPFGLAQSQVPPLPLPS 311



 Score = 30.4 bits (68), Expect = 4.6
 Identities = 29/121 (23%), Positives = 45/121 (37%), Gaps = 7/121 (5%)

Query: 5   ETETATDPVVNQNGTGDP-----DPVKAEQDVKVNNNNTSDSTSSSSPPPSSTEVTPVKK 59
           E E++    VN+ G+ DP     D   +   +    +N SDS SS+       +  P  +
Sbjct: 123 EGESSDSRSVNEEGSSDPKDIDQDNRSSSPSIPSPQDNESDSDSSAQQQLLQPQGPPSIQ 182

Query: 60  SP--APPSSSSSAPAPSHHNESNSGNSTKAASVEPPPRDEPRLEPVDGIVQPPVVPPKHR 117
            P  A  + S+  P PS       G+   A     P +  P        + P  +P  H 
Sbjct: 183 VPPGAALAPSAPPPTPSAQAVPPQGSPIAAQPAPQPQQPSPLSLISAPSLHPQRLPSPHP 242

Query: 118 P 118
           P
Sbjct: 243 P 243


>gnl|CDD|99923 cd05491, Bromo_TBP7_like, Bromodomain; TBP7_like subfamily, limited
           to fungi. TBP7, or TAT-binding protein homolog 7, is a
           yeast protein of unknown function that contains
           AAA-superfamily ATP-ase domains and a bromodomain.
           Bromodomains are found in many chromatin-associated
           proteins and in nuclear histone acetyltransferases. They
           interact specifically with acetylated lysine.
          Length = 119

 Score = 31.7 bits (72), Expect = 0.52
 Identities = 18/89 (20%), Positives = 33/89 (37%), Gaps = 14/89 (15%)

Query: 368 SKTPLWYKYCSEIIAELFHKKHQNYAWPFYTPVDVEKLGLTDYFDIIKKP--------MD 419
            +  L ++      A +      N   P+    D +        D I +         MD
Sbjct: 13  QRKYLKFEKPPTEDAFVRPDNENNEERPYEIAKDKQG------VDTILETASGKKFYNMD 66

Query: 420 LGTVRKKMRNRTYKTAKEFADDVRLIFSN 448
           L T+ +++ N  Y T K+F  D++ I  +
Sbjct: 67  LDTIEERLWNGYYATPKDFLKDIKRIVRD 95



 Score = 29.0 bits (65), Expect = 4.4
 Identities = 12/31 (38%), Positives = 20/31 (64%)

Query: 167 MDLGTIKKRLENNYYWSGKEAIQDFTTMFTN 197
           MDL TI++RL N YY + K+ ++D   +  +
Sbjct: 65  MDLDTIEERLWNGYYATPKDFLKDIKRIVRD 95


>gnl|CDD|240349 PTZ00295, PTZ00295, glucosamine-fructose-6-phosphate
           aminotransferase; Provisional.
          Length = 640

 Score = 33.1 bits (76), Expect = 0.60
 Identities = 14/45 (31%), Positives = 23/45 (51%)

Query: 194 MFTNCYVYNKPGEDVVLMAQNLEQLFLTKITGMPSEEVVLDAPQP 238
            +TN Y+  K GE   L  +N+  L+  +      EEV+  +P+P
Sbjct: 230 KYTNEYISLKDGEIAELSLENVNDLYTQRRVEKIPEEVIEKSPEP 274


>gnl|CDD|215064 PLN00122, PLN00122, serine/threonine protein phosphatase 2A;
          Provisional.
          Length = 170

 Score = 32.0 bits (73), Expect = 0.62
 Identities = 16/55 (29%), Positives = 19/55 (34%), Gaps = 2/55 (3%)

Query: 46 SPPPSSTEVTPVKKSPAPPSSSSSAPAPSHHNESNSGNSTKAASVE--PPPRDEP 98
          S   SS    P   S    SSS  +   S    S    ++  A  E  P  RD P
Sbjct: 1  SSSASSGSSAPAVNSSPSSSSSGKSSNSSKSASSAVNPASVVAGYEPLPSFRDVP 55


>gnl|CDD|219500 pfam07655, Secretin_N_2, Secretin N-terminal domain.  This is a
           short domain found in bacterial type II/III secretory
           system proteins. The architecture of these proteins
           suggest that this family may be functionally analogous
           to pfam03958.
          Length = 95

 Score = 30.8 bits (70), Expect = 0.71
 Identities = 14/32 (43%), Positives = 19/32 (59%)

Query: 741 GASRLSASSSSSSDSDSSGQSTSSSSSESSDS 772
           G+S  S +S S S S S+  S+SS+SS    S
Sbjct: 15  GSSNTSVTSGSVSSSGSNSSSSSSNSSNGGSS 46



 Score = 28.4 bits (64), Expect = 3.5
 Identities = 12/41 (29%), Positives = 22/41 (53%), Gaps = 1/41 (2%)

Query: 733 TTSAAGPTGA-SRLSASSSSSSDSDSSGQSTSSSSSESSDS 772
           + S++G   + S  ++S+  SS S SSG S+S +   +   
Sbjct: 25  SVSSSGSNSSSSSSNSSNGGSSSSSSSGDSSSGTRITTESE 65



 Score = 28.4 bits (64), Expect = 3.7
 Identities = 13/29 (44%), Positives = 19/29 (65%)

Query: 742 ASRLSASSSSSSDSDSSGQSTSSSSSESS 770
           +  +S+S S+SS S S+  +  SSSS SS
Sbjct: 23  SGSVSSSGSNSSSSSSNSSNGGSSSSSSS 51



 Score = 28.4 bits (64), Expect = 4.3
 Identities = 15/38 (39%), Positives = 23/38 (60%)

Query: 733 TTSAAGPTGASRLSASSSSSSDSDSSGQSTSSSSSESS 770
           T+  +G   +S  ++SSSSS+ S+    S+SSS   SS
Sbjct: 19  TSVTSGSVSSSGSNSSSSSSNSSNGGSSSSSSSGDSSS 56



 Score = 28.4 bits (64), Expect = 4.4
 Identities = 17/35 (48%), Positives = 22/35 (62%)

Query: 740 TGASRLSASSSSSSDSDSSGQSTSSSSSESSDSEG 774
           +G+   S S+SSSS S+SS   +SSSSS    S G
Sbjct: 23  SGSVSSSGSNSSSSSSNSSNGGSSSSSSSGDSSSG 57



 Score = 28.1 bits (63), Expect = 5.9
 Identities = 13/30 (43%), Positives = 21/30 (70%)

Query: 743 SRLSASSSSSSDSDSSGQSTSSSSSESSDS 772
           + +++ S SSS S+SS  S++SS+  SS S
Sbjct: 19  TSVTSGSVSSSGSNSSSSSSNSSNGGSSSS 48


>gnl|CDD|235505 PRK05563, PRK05563, DNA polymerase III subunits gamma and tau;
           Validated.
          Length = 559

 Score = 32.9 bits (76), Expect = 0.73
 Identities = 24/73 (32%), Positives = 39/73 (53%), Gaps = 2/73 (2%)

Query: 483 LASRAAASVSSDDDSEDERQNQLKYLQEQLKSLTDQIRLLVEDSTKPKKKKKKNRDQPKS 542
           L  +AAAS   D + E   Q +++ L+++LK L  Q    V    K KKK+KK   + K 
Sbjct: 357 LCEQAAASPEYDTELEVLLQ-RVEQLEQELKQLKAQPV-GVAPEQKEKKKEKKKNKKKKY 414

Query: 543 KMPMGQNSAMMND 555
           K+P G+   ++ +
Sbjct: 415 KVPRGKIYKVLKE 427


>gnl|CDD|235175 PRK03918, PRK03918, chromosome segregation protein; Provisional.
          Length = 880

 Score = 32.7 bits (75), Expect = 0.82
 Identities = 26/121 (21%), Positives = 58/121 (47%), Gaps = 10/121 (8%)

Query: 604 KKQVRTFDSEDEDVAKPMSYDEKRQLSLDINKLPG-DKLGKVVHIIQSREPSLREPNPDE 662
           +K+++  + ++  + K     E R+L   + K     KL ++   ++  E  L++ N +E
Sbjct: 465 EKELKEIEEKERKLRK-----ELRELEKVLKKESELIKLKELAEQLKELEEKLKKYNLEE 519

Query: 663 IEID---FETLKPSTLRELEQYVSSCLRKRTYKKTPKPKDEKFAEKKHELEKRLQDVTSQ 719
           +E     +E LK   L +L+  + S  ++    +  K K  +  +K  ELE+ L ++  +
Sbjct: 520 LEKKAEEYEKLK-EKLIKLKGEIKSLKKELEKLEELKKKLAELEKKLDELEEELAELLKE 578

Query: 720 I 720
           +
Sbjct: 579 L 579


>gnl|CDD|237874 PRK14971, PRK14971, DNA polymerase III subunits gamma and tau;
           Provisional.
          Length = 614

 Score = 32.8 bits (75), Expect = 0.86
 Identities = 13/81 (16%), Positives = 21/81 (25%)

Query: 44  SSSPPPSSTEVTPVKKSPAPPSSSSSAPAPSHHNESNSGNSTKAASVEPPPRDEPRLEPV 103
           S SP  SS    P     A   + +          +   N    A     P      + +
Sbjct: 399 SPSPSQSSAAAQPSAPQSATQPAGTPPTVSVDPPAAVPVNPPSTAPQAVRPAQFKEEKKI 458

Query: 104 DGIVQPPVVPPKHRPGRNTNQ 124
                  + P   RP +   +
Sbjct: 459 PVSKVSSLGPSTLRPIQEKAE 479



 Score = 31.7 bits (72), Expect = 1.6
 Identities = 24/85 (28%), Positives = 29/85 (34%), Gaps = 7/85 (8%)

Query: 38  TSDSTSSSSPPPSSTEVTPVKKSPAPPSSSSSAPAPSHHNESNSGNSTKAASVEPPPRDE 97
           T     +S        + PV   PA     S+A A S    S S +S  A    P    +
Sbjct: 363 TQKGDDASGGRGPKQHIKPVFTQPAAAPQPSAAAAAS---PSPSQSSAAAQPSAPQSATQ 419

Query: 98  PRLEPVDGIVQP----PVVPPKHRP 118
           P   P    V P    PV PP   P
Sbjct: 420 PAGTPPTVSVDPPAAVPVNPPSTAP 444



 Score = 30.1 bits (68), Expect = 5.5
 Identities = 17/109 (15%), Positives = 36/109 (33%), Gaps = 5/109 (4%)

Query: 212 AQNLEQLFLTKITGMPSEEVVLDAPQPRSSKKKP-PVSASPSLNPVIKTPVIPLNKLPSA 270
            Q+++ +F               +P P  S     P +   +  P    P + ++   + 
Sbjct: 376 KQHIKPVFTQPAAAPQPSAAAAASPSPSQSSAAAQPSAPQSATQPAGTPPTVSVDPPAAV 435

Query: 271 TSTPKPRPPNPVLGSTATTTTAPKVNHLNSMNAPDTPDMKKAIKRKADG 319
              P    P  V  +         V+ ++S+     P   + I+ KA+ 
Sbjct: 436 PVNPPSTAPQAVRPAQFKEEKKIPVSKVSSL----GPSTLRPIQEKAEQ 480


>gnl|CDD|220337 pfam09669, Phage_pRha, Phage regulatory protein Rha (Phage_pRha).
           Members of this protein family are found in temperate
           phage and bacterial prophage regions. Members include
           the product of the rha gene of the lambdoid phage
           phi-80, a late operon gene. The presence of this gene
           interferes with infection of bacterial strains that lack
           integration host factor (IHF), which regulates the rha
           gene. It is suggested that Rha is a phage regulatory
           protein.
          Length = 88

 Score = 29.8 bits (68), Expect = 1.0
 Identities = 9/14 (64%), Positives = 11/14 (78%)

Query: 378 SEIIAELFHKKHQN 391
           S  IAELF K+H+N
Sbjct: 4   SLEIAELFGKRHKN 17


>gnl|CDD|219426 pfam07489, Tir_receptor_C, Translocated intimin receptor (Tir)
           C-terminus.  Intimin and its translocated intimin
           receptor (Tir) are bacterial proteins that mediate
           adhesion between mammalian cells and attaching and
           effacing (A/E) pathogens. A unique and essential feature
           of A/E bacterial pathogens is the formation of
           actin-rich pedestals beneath the intimately adherent
           bacteria and localised destruction of the intestinal
           brush border. The bacterial outer membrane adhesin,
           intimin, is necessary for the production of the A/E
           lesion and diarrhoea. The A/E bacteria translocate their
           own receptor for intimin, Tir, into the membrane of
           mammalian cells using the type III secretion system. The
           translocated Tir triggers additional host signalling
           events and actin nucleation, which are essential for
           lesion formation. This family represents the Tir
           C-terminal domain which has been reported to bind
           uninfected host cells and beta-1 integrins although the
           role of intimin binding to integrins is unclear. This
           intimin C-terminal domain has also been shown to be
           sufficient for Tir recognition.
          Length = 222

 Score = 31.4 bits (71), Expect = 1.2
 Identities = 18/62 (29%), Positives = 26/62 (41%)

Query: 710 EKRLQDVTSQIDSTNKKLKKPKPTTSAAGPTGASRLSASSSSSSDSDSSGQSTSSSSSES 769
           E+     T+   +      KP   T A G T  S    ++SS   S +S  ST+ S + S
Sbjct: 58  EQTTTTTTTTTTTHTTVENKPANNTPAQGNTDTSGAEETASSRRSSQASTASTTWSDTSS 117

Query: 770 SD 771
            D
Sbjct: 118 ID 119


>gnl|CDD|237871 PRK14965, PRK14965, DNA polymerase III subunits gamma and tau;
           Provisional.
          Length = 576

 Score = 32.0 bits (73), Expect = 1.6
 Identities = 9/65 (13%), Positives = 18/65 (27%), Gaps = 2/65 (3%)

Query: 41  STSSSSPPPSSTEVTPVKKSPAPPSSSSSAPAPSHHNESNSGNSTKAASVEPPPRDEPRL 100
             + + P  +    TP   +  PP+++   P  +      +      A   P     P  
Sbjct: 379 RGAPAPPSAAWGAPTPAAPAAPPPAAAPPVPPAAP--ARPAAARPAPAPAPPAAAAPPAR 436

Query: 101 EPVDG 105
                
Sbjct: 437 SADPA 441


>gnl|CDD|220365 pfam09726, Macoilin, Transmembrane protein.  This entry is a highly
           conserved protein present in eukaryotes.
          Length = 680

 Score = 31.8 bits (72), Expect = 1.6
 Identities = 20/66 (30%), Positives = 29/66 (43%), Gaps = 8/66 (12%)

Query: 721 DSTNKKLKKPKPTTSAAGPTGASRLSAS--------SSSSSDSDSSGQSTSSSSSESSDS 772
            STN   +    ++SAA  +  S+ S S        S+SS  S SS   +  SSS S + 
Sbjct: 305 SSTNGSKETTSNSSSAAAGSIGSKSSKSAKHSNRNKSNSSPKSHSSANGSVPSSSVSDNE 364

Query: 773 EGEKGP 778
             +K  
Sbjct: 365 SKQKRA 370


>gnl|CDD|223544 COG0468, RecA, RecA/RadA recombinase [DNA replication,
           recombination, and repair].
          Length = 279

 Score = 31.5 bits (72), Expect = 1.7
 Identities = 15/73 (20%), Positives = 32/73 (43%), Gaps = 11/73 (15%)

Query: 491 VSSDDDSEDERQNQLKYLQEQLKSLTDQIRLLVEDS-TKPKKKKKKNRDQPKSKMPMGQN 549
           VS  D  E     QL+  ++  +S  ++I LLV DS     + ++           +G  
Sbjct: 118 VSQPDTGEQ----QLEIAEKLARSGAEKIDLLVVDSVAALVRAEEIEDGH------LGLR 167

Query: 550 SAMMNDHVNKMNK 562
           + +++  + K+ +
Sbjct: 168 ARLLSKALRKLTR 180


>gnl|CDD|177486 PHA02752, PHA02752, hypothetical protein; Provisional.
          Length = 242

 Score = 31.2 bits (70), Expect = 1.8
 Identities = 11/53 (20%), Positives = 21/53 (39%), Gaps = 3/53 (5%)

Query: 37 NTSDSTSSSSPPPSSTEVTPVKKSPAPPSSSSSAPAPSHHNESNSGNSTKAAS 89
          N   + + +S PP+ T++ P   + A  +          H   + G  +  AS
Sbjct: 29 NQQPTITGTSAPPAVTDLYP---NAALANGFDGILIDDGHFNCDGGQCSITAS 78


>gnl|CDD|218612 pfam05501, DUF755, Domain of unknown function (DUF755).  This
           family is predominated by ORFs from Circoviridae. The
           function of this family remains to be determined.
          Length = 122

 Score = 29.7 bits (67), Expect = 1.9
 Identities = 21/58 (36%), Positives = 33/58 (56%), Gaps = 2/58 (3%)

Query: 710 EKRLQDVTSQIDSTNKKLKKPKPTTSAAGPTGASRLSASSSSSSDSDSSGQSTSSSSS 767
           +++ Q  +SQ  +  KK KK +  +S +    +++    SSSSS S SS  S+SS SS
Sbjct: 67  KRKEQQKSSQ--THKKKRKKSRHVSSRSAKKISAKKRRRSSSSSSSSSSSSSSSSESS 122


>gnl|CDD|236892 PRK11281, PRK11281, hypothetical protein; Provisional.
          Length = 1113

 Score = 31.4 bits (72), Expect = 2.0
 Identities = 20/110 (18%), Positives = 42/110 (38%), Gaps = 21/110 (19%)

Query: 484 ASRAAASVSSDDDSEDERQNQLKYLQEQLKSLTDQIRLLVEDSTKP----KKKKKKNRDQ 539
           ++ A A+ + D  +E + Q QL  L +Q K L  + +L+ +D  +      K  ++  + 
Sbjct: 24  SAFARAASNGDLPTEADVQAQLDALNKQ-KLLEAEDKLVQQDLEQTLALLDKIDRQKEE- 81

Query: 540 PKSKMPMGQNSAMMNDHVNKMNKAPAPLNNGQKPKSLNNVRKPQASNPQQ 589
                              ++ +APA L   Q    L  ++       ++
Sbjct: 82  -------------TEQLKQQLAQAPAKLRQAQ--AELEALKDDNDEETRE 116


>gnl|CDD|234938 PRK01297, PRK01297, ATP-dependent RNA helicase RhlB; Provisional.
          Length = 475

 Score = 31.4 bits (71), Expect = 2.1
 Identities = 14/57 (24%), Positives = 22/57 (38%), Gaps = 9/57 (15%)

Query: 46  SPPPSSTEVTPVKKSPAPPSSSSSAPAPSHHNESNSGNSTKAASVEPPPRDEPRLEP 102
           +P P S    P    PA  ++ ++  A           +  A   E P +D+PR E 
Sbjct: 18  APAPPSPAAAPAPPPPAKTAAPATKAAAP---------AAAAPRAEKPKKDKPRRER 65


>gnl|CDD|237019 PRK11907, PRK11907, bifunctional 2',3'-cyclic nucleotide
           2'-phosphodiesterase/3'-nucleotidase precursor protein;
           Reviewed.
          Length = 814

 Score = 31.4 bits (71), Expect = 2.2
 Identities = 18/93 (19%), Positives = 35/93 (37%), Gaps = 6/93 (6%)

Query: 19  TGDPDPVKAEQDVKVNNNNTSDSTSSSSP------PPSSTEVTPVKKSPAPPSSSSSAPA 72
           T     +   +++      TS     ++P        +    TPV  + A  + SSS  A
Sbjct: 19  TASNPKLAQAEEIVTTTPATSTEAEQTTPVESDATEEADNTETPVAATTAAEAPSSSETA 78

Query: 73  PSHHNESNSGNSTKAASVEPPPRDEPRLEPVDG 105
            +    S + ++T + +    P      +PV+G
Sbjct: 79  ETSDPTSEATDTTTSEARTVTPAATETSKPVEG 111


>gnl|CDD|165563 PHA03308, PHA03308, transcriptional regulator ICP4; Provisional.
          Length = 1463

 Score = 31.7 bits (71), Expect = 2.2
 Identities = 24/47 (51%), Positives = 27/47 (57%), Gaps = 2/47 (4%)

Query: 730  PKPTTSAAGPTGASRLSASSSSSSDSDSSGQSTSSSSSESSDSEGEK 776
            P P  S +  T  S  S+SSSSSS S SS  S SSSS E  D + EK
Sbjct: 1240 PCPDLSESSSTMHS--SSSSSSSSCSSSSSSSDSSSSEEDGDEKNEK 1284


>gnl|CDD|237057 PRK12323, PRK12323, DNA polymerase III subunits gamma and tau;
           Provisional.
          Length = 700

 Score = 31.4 bits (71), Expect = 2.2
 Identities = 14/54 (25%), Positives = 20/54 (37%), Gaps = 3/54 (5%)

Query: 61  PAPPSSSSSAPAPSHHNESNSGNSTKAASVEPPPRDEPRLEPVDGIVQPPVVPP 114
           PAP  + ++APA +    +       AA+   P R  P   P          PP
Sbjct: 448 PAPAPAPAAAPAAAARPAAAGPRPVAAAAAAAPARAAPAAAPA---PADDDPPP 498



 Score = 30.6 bits (69), Expect = 4.0
 Identities = 19/112 (16%), Positives = 33/112 (29%), Gaps = 1/112 (0%)

Query: 3   AAETETATDPVVNQNGTGDPDPVKAEQDVKVNNNNTSDSTSSSSPPPSSTEVTPVKKSPA 62
           A     A   V        P P  A    +  +        + +P P++      + + A
Sbjct: 409 APAAAAAARAVAAAPARRSPAPE-ALAAARQASARGPGGAPAPAPAPAAAPAAAARPAAA 467

Query: 63  PPSSSSSAPAPSHHNESNSGNSTKAASVEPPPRDEPRLEPVDGIVQPPVVPP 114
            P   ++A A +    + +     A    PP  + P         QP   P 
Sbjct: 468 GPRPVAAAAAAAPARAAPAAAPAPADDDPPPWEELPPEFASPAPAQPDAAPA 519


>gnl|CDD|217884 pfam04086, SRP-alpha_N, Signal recognition particle, alpha subunit,
           N-terminal.  SRP is a complex of six distinct
           polypeptides and a 7S RNA that is essential for
           transferring nascent polypeptide chains that are
           destined for export from the cell to the translocation
           apparatus of the endoplasmic reticulum (ER) membrane.
           SRP binds hydrophobic signal sequences as they emerge
           from the ribosome, and arrests translation.
          Length = 272

 Score = 30.9 bits (70), Expect = 2.3
 Identities = 32/189 (16%), Positives = 69/189 (36%), Gaps = 25/189 (13%)

Query: 437 EFADDVRLIFSNCYKYNPPDHNV-VTMARQLSAVFEDRFAKMPDESNLASRAAASVSSDD 495
           +  DDVR IF + YK          T   +    F+ +  ++  ES   +++  ++ + +
Sbjct: 56  KLLDDVRTIFRDLYKNQLRQEKARTTYDEEFDEYFDQQLRELEKESKKQAKSPKAMRTFE 115

Query: 496 DSEDERQNQLKYLQEQLKSLTDQIRLLVEDSTKPKKKKKKNRDQPKSKMPMGQNSAMMND 555
           +S+              K++   I    ++    +K++KK ++   S      ++   + 
Sbjct: 116 ESKK-----------SKKTVDSMIERKPKEPGLKRKQRKKAQESATSPESSPSSTPNSSR 164

Query: 556 HVNK--MNKAPAPLNNGQKPKSL-NNVRKPQASNPQQAKKPKPNNANTVAAKKQVRTFD- 611
                 +     P    +K   L +        +P+    PK        A K++R +D 
Sbjct: 165 PSTPHLLKAKEGPSRRAKKAAKLSSTASSGDEKSPKSKAAPKK-------AGKKMRKWDL 217

Query: 612 --SEDEDVA 618
              ED+D  
Sbjct: 218 DGDEDDDAV 226


>gnl|CDD|178376 PLN02777, PLN02777, photosystem I P subunit (PSI-P).
          Length = 167

 Score = 30.3 bits (68), Expect = 2.4
 Identities = 9/25 (36%), Positives = 14/25 (56%)

Query: 109 PPVVPPKHRPGRNTNQLQYIVKNVM 133
           PP V   +RP + T   + I +NV+
Sbjct: 36  PPPVQSHNRPAKATAYCRKIARNVV 60


>gnl|CDD|165099 PHA02732, PHA02732, hypothetical protein; Provisional.
          Length = 1467

 Score = 31.3 bits (70), Expect = 2.4
 Identities = 16/37 (43%), Positives = 21/37 (56%), Gaps = 6/37 (16%)

Query: 38   TSDSTSSSSPPPSSTEVTPVKKSPAPPSSSSSAPAPS 74
                 S++S PP++T  TP      PPSSSSS+ A S
Sbjct: 1224 AQLQLSAASSPPAATTPTP------PPSSSSSSSAQS 1254


>gnl|CDD|236766 PRK10811, rne, ribonuclease E; Reviewed.
          Length = 1068

 Score = 31.5 bits (72), Expect = 2.4
 Identities = 17/106 (16%), Positives = 27/106 (25%), Gaps = 6/106 (5%)

Query: 2    SAAETETATDPVVNQNGTGDPDPVKAEQDVKVNNNNTSDSTSSSSPPPSSTEVTPVKKS- 60
                    T P V      +   V  E DV V       +     P   + ++    ++ 
Sbjct: 907  EEVVVVETTHPEVIAAPVTEQPQVITESDVAVAQEVAEHAEPVVEPQDETADIEEAAETA 966

Query: 61   ---PAPPSSSSSAPAPSHHNESNSGNSTKAASVEPPPRDEPRLEPV 103
                A P   +   AP     +         +VEP        E  
Sbjct: 967  EVVVAEPEVVAQPAAPVVAEVAAEV--ETVTAVEPEVAPAQVPEAT 1010


>gnl|CDD|233041 TIGR00596, rad1, DNA repair protein (rad1).  All proteins in this
           family for which functions are known are components in a
           multiprotein endonuclease complex (usually made up of
           Rad1 and Rad10 homologs). This complex is used primarily
           for nucleotide excision repair but also for some aspects
           of recombinational repair in some species. Most Archaeal
           species also have homologs of these genes, but the
           function of these Archaeal genes is not known, so we
           have set our cutoff to only pick up the eukaryotic
           genes.This family is based on the phylogenomic analysis
           of JA Eisen (1999, Ph.D. Thesis, Stanford Universit [DNA
           metabolism, DNA replication, recombination, and repair].
          Length = 814

 Score = 31.3 bits (71), Expect = 2.6
 Identities = 14/93 (15%), Positives = 38/93 (40%), Gaps = 13/93 (13%)

Query: 674 TLRELEQYVSSC----------LRKRTYKKTPKPKDEKFAEKKHELEKRLQDVTSQIDST 723
           T  +L  Y+++           L K  + +  + +  K A++    +   ++ +S +   
Sbjct: 307 TCLQLRDYLTTSNKKRGSRAFLLNKLRWYRKWREETSKLAKEVQSQDTFPENASSNV--- 363

Query: 724 NKKLKKPKPTTSAAGPTGASRLSASSSSSSDSD 756
           NK  +K +  T      G S ++     +++++
Sbjct: 364 NKTFRKEQVPTKRRRVRGGSEVAVEKLRNANTN 396


>gnl|CDD|237864 PRK14950, PRK14950, DNA polymerase III subunits gamma and tau;
           Provisional.
          Length = 585

 Score = 30.9 bits (70), Expect = 2.8
 Identities = 25/127 (19%), Positives = 35/127 (27%), Gaps = 20/127 (15%)

Query: 217 QLFLTKITGMPSEEVVLDA-------PQPRSSKKKPPVSASPSLNPVIKTPVIPLNKLPS 269
           QL  T    +P E  V++A       PQP       P    P+  P  +        +P 
Sbjct: 340 QLRTTSYGQLPLELAVIEALLVPVPAPQPAKPTAAAPSPVRPTPAPSTRPKAAAAANIPP 399

Query: 270 ATSTPKPRPPNPVLGSTATTTTAPKVNHLNSMNAPDTPDMKKAIKRKADGSIDHTPSSLH 329
                +   P PV                   +AP           +A   +D  P    
Sbjct: 400 KEPVRETATPPPVPPRPVAPPVPHT-----PESAPKLT--------RAAIPVDEKPKYTP 446

Query: 330 PTPVKSA 336
           P P K  
Sbjct: 447 PAPPKEE 453



 Score = 30.9 bits (70), Expect = 3.1
 Identities = 16/105 (15%), Positives = 26/105 (24%), Gaps = 10/105 (9%)

Query: 1   MSAAETETATDPVVNQNGTGDPDPVKAEQDVKVNNNNTSDSTSSSSPPPSSTEVTPVKKS 60
           ++  E      P           P              + ST   +   ++       + 
Sbjct: 353 LAVIEALLVPVPAPQPAKPTAAAPSPVRP-------TPAPSTRPKAAAAANIPPKEPVRE 405

Query: 61  PAPPSSSSSAPA---PSHHNESNSGNSTKAASVEPPPRDEPRLEP 102
            A P      P      H  ES    +  A  V+  P+  P   P
Sbjct: 406 TATPPPVPPRPVAPPVPHTPESAPKLTRAAIPVDEKPKYTPPAPP 450


>gnl|CDD|114270 pfam05539, Pneumo_att_G, Pneumovirinae attachment membrane
           glycoprotein G. 
          Length = 408

 Score = 30.8 bits (69), Expect = 2.8
 Identities = 23/124 (18%), Positives = 38/124 (30%), Gaps = 10/124 (8%)

Query: 6   TETATDPVVNQNGTGDPDPVKAEQDVKVNNN-------NTSDSTSSSSPPPSSTEVTPVK 58
           TE +     +Q         +  Q    N          T  +T+SS+P P   +  P  
Sbjct: 182 TEVSHPTYPSQVTPQSQPATQGHQTATANQRLSSTEPVGTQGTTTSSNPEP---QTEPPP 238

Query: 59  KSPAPPSSSSSAPAPSHHNESNSGNSTKAASVEPPPRDEPRLEPVDGIVQPPVVPPKHRP 118
               P  S    P+ +  ++S +G+  +       P              PP    +   
Sbjct: 239 SQRGPSGSPQHPPSTTSQDQSTTGDGQEHTQRRKTPPATSNRRSPHSTATPPPTTKRQET 298

Query: 119 GRNT 122
           GR T
Sbjct: 299 GRPT 302


>gnl|CDD|220135 pfam09184, PPP4R2, PPP4R2.  PPP4R2 (protein phosphatase 4 core
           regulatory subunit R2) is the regulatory subunit of the
           histone H2A phosphatase complex. It has been shown to
           confer resistance to the anticancer drug cisplatin in
           yeast, and may confer resistance in higher eukaryotes.
          Length = 285

 Score = 30.6 bits (69), Expect = 2.9
 Identities = 22/102 (21%), Positives = 37/102 (36%), Gaps = 4/102 (3%)

Query: 4   AETETATDPVVNQNGTGDPDPVKAEQDVKVNNNNTSDSTSSSSPPPSSTEVTPVKKSPAP 63
           +E     + + + NG G+P+P    +D   N ++T+    SS     S E    K+S   
Sbjct: 170 SELSPFIERIDSVNGPGEPEPEDDPKDSLGNGSSTNGLPDSSQDKNKSLEEYYEKESSDA 229

Query: 64  PSSSSSAPAPSH----HNESNSGNSTKAASVEPPPRDEPRLE 101
            +S    P  S      ++    +      VE     E   E
Sbjct: 230 AASQDDGPKGSDVKNKKSDDEEDDDQDGDYVEEKELKEDEEE 271


>gnl|CDD|183756 PRK12799, motB, flagellar motor protein MotB; Reviewed.
          Length = 421

 Score = 30.8 bits (69), Expect = 2.9
 Identities = 17/76 (22%), Positives = 28/76 (36%)

Query: 38  TSDSTSSSSPPPSSTEVTPVKKSPAPPSSSSSAPAPSHHNESNSGNSTKAASVEPPPRDE 97
           +S  T SS+  PSS  +      P+  ++ S+    +     +S     +    P     
Sbjct: 307 SSAVTQSSAITPSSAAIPSPAVIPSSVTTQSATTTQASAVALSSAGVLPSDVTLPGTVAL 366

Query: 98  PRLEPVDGIVQPPVVP 113
           P  EPV+   QP    
Sbjct: 367 PAAEPVNMQPQPMSTT 382


>gnl|CDD|217869 pfam04065, Not3, Not1 N-terminal domain, CCR4-Not complex
           component. 
          Length = 233

 Score = 30.3 bits (69), Expect = 3.1
 Identities = 19/80 (23%), Positives = 34/80 (42%), Gaps = 3/80 (3%)

Query: 495 DDSEDERQNQLKYLQEQLKSLTDQIRLL---VEDSTKPKKKKKKNRDQPKSKMPMGQNSA 551
           D  E E+     +L   +  L  QI LL   VE  +  +KK+K +  + +    + +   
Sbjct: 114 DPKEKEKAEVCDFLSNTIDELNKQIELLEAEVESLSVQRKKRKADAAKQERIAELEELLE 173

Query: 552 MMNDHVNKMNKAPAPLNNGQ 571
               H++K+      L NG+
Sbjct: 174 RHKFHISKLELILRLLENGE 193


>gnl|CDD|215581 PLN03109, PLN03109, ETHYLENE-INSENSITIVE3-like3 protein;
           Provisional.
          Length = 599

 Score = 31.0 bits (70), Expect = 3.1
 Identities = 20/76 (26%), Positives = 32/76 (42%), Gaps = 14/76 (18%)

Query: 481 SNLASRAAASVSSDDD--------------SEDERQNQLKYLQEQLKSLTDQIRLLVEDS 526
                R   ++SSD D              S+D+R+N     QE  ++  D    +V D 
Sbjct: 307 GGHEDRNKDAISSDSDYDVDGLEDAPGSVSSKDDRRNLQPVAQEPERARDDAPNQVVPDK 366

Query: 527 TKPKKKKKKNRDQPKS 542
            K KK +K+ R + +S
Sbjct: 367 EKTKKPRKRKRPRGRS 382


>gnl|CDD|235124 PRK03427, PRK03427, cell division protein ZipA; Provisional.
          Length = 333

 Score = 30.8 bits (70), Expect = 3.2
 Identities = 13/97 (13%), Positives = 27/97 (27%), Gaps = 8/97 (8%)

Query: 30  DVKVNNNNTSDSTSSSSPPPSSTEV--------TPVKKSPAPPSSSSSAPAPSHHNESNS 81
           +V+V+  N + + +        +          +   + P      +  P       +  
Sbjct: 65  EVRVHRVNHAPANAQEHEAARPSPQHQYQPPYASAQPRQPVQQPPEAQVPPQHAPRPAQP 124

Query: 82  GNSTKAASVEPPPRDEPRLEPVDGIVQPPVVPPKHRP 118
                      P  ++P  +PV   V P   P    P
Sbjct: 125 APQPVQQPAYQPQPEQPLQQPVSPQVAPAPQPVHSAP 161


>gnl|CDD|221526 pfam12316, Dsh_C, Segment polarity protein dishevelled (Dsh) C
           terminal.  This domain family is found in eukaryotes,
           and is typically between 177 and 207 amino acids in
           length. The family is found in association with
           pfam00778, pfam02377, pfam00610, pfam00595. The segment
           polarity gene dishevelled (dsh) is required for pattern
           formation of the embryonic segments. It is involved in
           the determination of body organisation through the
           Wingless pathway (analogous to the Wnt-1 pathway).
          Length = 202

 Score = 29.9 bits (67), Expect = 3.4
 Identities = 26/82 (31%), Positives = 33/82 (40%), Gaps = 6/82 (7%)

Query: 39  SDSTSSSSPPPSSTEVTPVKKSPAPPSSSSSAPAPSHHNESNSGNSTKAASVEPPPRDEP 98
           S+ TS S    S     P ++S  PPS  S   + SH     S +S+  A   PPP + P
Sbjct: 106 SEHTSRSGSRRSGGRRAPSERSGPPPSEGSVRSSLSHP----SSHSSYGAPGVPPPYNPP 161

Query: 99  RLEPVDGIVQPPVVPPKHRPGR 120
            L        P   PP   P R
Sbjct: 162 MLMMKPPP--PSPGPPGAPPVR 181


>gnl|CDD|219833 pfam08418, Pol_alpha_B_N, DNA polymerase alpha subunit B
           N-terminal.  This is the eukaryotic DNA polymerase alpha
           subunit B N-terminal domain which is involved in complex
           formation. Also see pfam04058.
          Length = 239

 Score = 30.1 bits (68), Expect = 3.9
 Identities = 23/111 (20%), Positives = 35/111 (31%), Gaps = 8/111 (7%)

Query: 670 LKPSTLRELEQYVSSCL--RKRTYKKTPKPKDEKFAEKKHELEKRLQDVTSQIDSTNKKL 727
           L    LR+ E+ + S L  R RT       K      +  E         S      + L
Sbjct: 46  LTLENLRKFEEELQSQLEKRVRTPASIKTSKRLIEVPEAEESLLDSYTTPSDKGGMLRIL 105

Query: 728 KKP---KPTTSAAGPTGASRLSASSSSSSD---SDSSGQSTSSSSSESSDS 772
             P   K   S +  +  S     S +S     + S+    S+  S  S+ 
Sbjct: 106 STPELPKRKRSFSASSLESPSLFFSPASFSPSAAPSTPSPNSAKFSSRSNP 156


>gnl|CDD|183558 PRK12495, PRK12495, hypothetical protein; Provisional.
          Length = 226

 Score = 29.8 bits (67), Expect = 3.9
 Identities = 18/101 (17%), Positives = 29/101 (28%), Gaps = 1/101 (0%)

Query: 17  NGTGDPDPVKAEQDVKVNNNNTSDSTSSSSPPPSSTEVTPVKKSPAPPSSSSSAPAPSHH 76
           +  GD     A  D     +   D+  ++    +     P   S +    +++ P  +  
Sbjct: 76  DDAGDGAEATAPSDAGSQASPDDDAQPAAEAEAADQSAPPEASSTSATDEAATDPPATAA 135

Query: 77  NESNSGNSTKAASVEPPPRDEPRLEPVDGIVQP-PVVPPKH 116
                     A    P  R  PR  P      P P  P  H
Sbjct: 136 ARDGPTPDPTAQPATPDERRSPRQRPPVSGEPPTPSTPDAH 176


>gnl|CDD|233143 TIGR00834, ae, anion exchange protein.  The Anion Exchanger (AE)
           Family (TC 2.A.31)Characterized protein members of the
           AE family are found only in animals.They preferentially
           catalyze anion exchange (antiport) reactions, typically
           acting as HCO3-:Cl- antiporters, but also transporting a
           range of other inorganic and organic anions.
           Additionally, renal Na+:HCO3- cotransporters have been
           found to be members of the AE family. They catalyze the
           reabsorption of HCO3- in the renal proximal tubule
           [Transport and binding proteins, Anions].
          Length = 900

 Score = 30.5 bits (69), Expect = 3.9
 Identities = 27/102 (26%), Positives = 38/102 (37%), Gaps = 14/102 (13%)

Query: 283 LGSTATTTTAPKVNHLNSMNAPDTPDMKKAIKRKADGSIDHTPSS------LHPTPVKSA 336
           +G    T  + +V H  +  A D  D+   I    D SI   P        L P P    
Sbjct: 239 IGRAIATLMSDEVFHDAAYLADDRDDLLAGIDEFLDCSIVLPPGEWDPEIRLEP-PAPLQ 297

Query: 337 KQLNTRRESGSITKKPQRIS------EEGGGGSGLGGSKTPL 372
           ++L  +R   S T +P+  +       E GG  G  G   PL
Sbjct: 298 RELLRKRYEPS-TVRPENPTMGGDTEPEDGGSEGPHGDDDPL 338


>gnl|CDD|200219 TIGR02927, SucB_Actino, 2-oxoglutarate dehydrogenase, E2 component,
           dihydrolipoamide succinyltransferase.  This model
           represents an Actinobacterial clade of E2 enzyme, a
           component of the 2-oxoglutarate dehydrogenase complex
           involved in the TCA cycle. These proteins have multiple
           domains including the catalytic domain (pfam00198), one
           or two biotin domains (pfam00364) and an E3-component
           binding domain (pfam02817).
          Length = 579

 Score = 30.4 bits (68), Expect = 3.9
 Identities = 15/48 (31%), Positives = 25/48 (52%), Gaps = 1/48 (2%)

Query: 39  SDSTSSSSPPPSSTEVTPVKKSPAP-PSSSSSAPAPSHHNESNSGNST 85
            ++ S  +P     E  P  ++PAP P+ ++ APAP+      SG +T
Sbjct: 80  GEAGSEPAPAAPEPEAAPEPEAPAPAPTPAAEAPAPAAPQAGGSGEAT 127


>gnl|CDD|153369 cd07685, F-BAR_Fes, The F-BAR (FES-CIP4 Homology and
           Bin/Amphiphysin/Rvs) domain of Fes (feline sarcoma)
           tyrosine kinase.  F-BAR domains are dimerization modules
           that bind and bend membranes and are found in proteins
           involved in membrane dynamics and actin reorganization.
           Fes (feline sarcoma), also called Fps (Fujinami poultry
           sarcoma), is a cytoplasmic (or nonreceptor) tyrosine
           kinase whose gene was first isolated from tumor-causing
           retroviruses. It is expressed in myeloid, vascular
           endothelial, epithelial, and neuronal cells, and plays
           important roles in cell growth and differentiation,
           angiogenesis, inflammation and immunity, and
           cytoskeletal regulation. Fes kinase has also been
           implicated as a tumor suppressor in colorectal cancer.
           It contains an N-terminal F-BAR domain, an SH2 domain,
           and a C-terminal catalytic kinase domain. F-BAR domains
           form banana-shaped dimers with a positively-charged
           concave surface that binds to negatively-charged lipid
           membranes. They can induce membrane deformation in the
           form of long tubules. The F-BAR domain of Fes is
           critical in its role in microtubule nucleation and
           bundling.
          Length = 237

 Score = 29.9 bits (67), Expect = 4.2
 Identities = 17/48 (35%), Positives = 26/48 (54%), Gaps = 5/48 (10%)

Query: 501 RQNQLKYLQEQLKSLTDQIRLLVEDSTKPKKK-----KKKNRDQPKSK 543
           +Q   K  Q+ ++ L  Q R L +DS + K+K     K K+RD+ K K
Sbjct: 122 KQEYTKTTQQDIEKLKSQYRSLAKDSAQAKRKYQEASKDKDRDKAKEK 169


>gnl|CDD|235899 PRK06975, PRK06975, bifunctional uroporphyrinogen-III
           synthetase/uroporphyrin-III C-methyltransferase;
           Reviewed.
          Length = 656

 Score = 30.5 bits (69), Expect = 4.3
 Identities = 9/54 (16%), Positives = 20/54 (37%), Gaps = 9/54 (16%)

Query: 42  TSSSSPPPSSTEVTPVKKSPAPPSSSSSAPAPSHHNESNSGNSTKAASVEPPPR 95
           T++ +P   +        +  P +++++   P          +  A   EPP R
Sbjct: 274 TAAPAPSRMTDTNDSKSVTSQPAAAAAAPAPPP---------NPPATPPEPPAR 318


>gnl|CDD|236555 PRK09537, pylS, pyrolysyl-tRNA synthetase; Reviewed.
          Length = 417

 Score = 30.2 bits (68), Expect = 4.4
 Identities = 20/86 (23%), Positives = 26/86 (30%), Gaps = 6/86 (6%)

Query: 219 FLTKITGMPSEEVVLDAPQPRSSKKKPPVSA----SPSLNPVIKTPVIPLNKLPSATSTP 274
           FLTK     ++  V     P   KK  P S      P  NPV        +      S P
Sbjct: 82  FLTKTFEDKTQVKVKVVSAPTKKKKAMPKSVVRAPKPLENPV--PAQAESSGSKPVPSIP 139

Query: 275 KPRPPNPVLGSTATTTTAPKVNHLNS 300
              P         T +   ++  L S
Sbjct: 140 VSTPEVKAPAPALTPSQKDRLETLLS 165


>gnl|CDD|240410 PTZ00418, PTZ00418, Poly(A) polymerase; Provisional.
          Length = 593

 Score = 30.5 bits (69), Expect = 4.4
 Identities = 12/46 (26%), Positives = 24/46 (52%)

Query: 727 LKKPKPTTSAAGPTGASRLSASSSSSSDSDSSGQSTSSSSSESSDS 772
           +    P             ++S+++S  S SSG +++S+S+ESS +
Sbjct: 531 VLSQTPEEPVKTKANTKTNTSSATTSGQSGSSGSTSNSNSNESSPT 576


>gnl|CDD|223957 COG1026, COG1026, Predicted Zn-dependent peptidases,
           insulinase-like [General function prediction only].
          Length = 978

 Score = 30.4 bits (69), Expect = 4.5
 Identities = 11/61 (18%), Positives = 23/61 (37%)

Query: 648 IQSREPSLREPNPDEIEIDFETLKPSTLRELEQYVSSCLRKRTYKKTPKPKDEKFAEKKH 707
           +Q R   L + + ++I  D + LK    +   +   + L        P P ++   E + 
Sbjct: 473 LQKRSSELTDEDLEKIIKDSKKLKERQDQPDSEEDLATLPTLKLGDVPDPIEKTSLETEV 532

Query: 708 E 708
            
Sbjct: 533 S 533


>gnl|CDD|165527 PHA03269, PHA03269, envelope glycoprotein C; Provisional.
          Length = 566

 Score = 30.5 bits (68), Expect = 4.5
 Identities = 17/94 (18%), Positives = 26/94 (27%), Gaps = 11/94 (11%)

Query: 2   SAAETETATDPVV--NQNGTGDPDPVKAEQDVKVNNNNTSDSTSSSSPPPSSTEVTPVKK 59
                  A DP V      +  PD  +A          T  ++    P P+  +      
Sbjct: 46  PHQAASRAPDPAVAPTSAASRKPDLAQAP---------TPAASEKFDPAPAPHQAASRAP 96

Query: 60  SPAPPSSSSSAPAPSHHNESNSGNSTKAASVEPP 93
            PA     ++AP P       S      A  +  
Sbjct: 97  DPAVAPQLAAAPKPDAAEAFTSAAQAHEAPADAG 130


>gnl|CDD|218312 pfam04889, Cwf_Cwc_15, Cwf15/Cwc15 cell cycle control protein.
           This family represents Cwf15/Cwc15 (from
           Schizosaccharomyces pombe and Saccharomyces cerevisiae
           respectively) and their homologues. The function of
           these proteins is unknown, but they form part of the
           spliceosome and are thus thought to be involved in mRNA
           splicing.
          Length = 241

 Score = 29.7 bits (67), Expect = 4.7
 Identities = 22/107 (20%), Positives = 36/107 (33%), Gaps = 7/107 (6%)

Query: 669 TLKPSTLRELEQYVSSCLRKRTYKKTPKPKDEKFAEKKHELEKRLQDVTSQIDSTNKKLK 728
            LK    R+  Q     LRKR  +   +  +     KK      ++D      STN    
Sbjct: 40  KLKY---RQPGQGTEDELRKRDLRAELEEAERAHKSKKENK-LAIEDADK---STNLDAS 92

Query: 729 KPKPTTSAAGPTGASRLSASSSSSSDSDSSGQSTSSSSSESSDSEGE 775
                          +     + +SD+D S  S+ S SS+    + +
Sbjct: 93  NEGDEDDDEEDEIKRKRIEEDARNSDADDSDSSSDSDSSDDDSDDDD 139


>gnl|CDD|165468 PHA03201, PHA03201, uracil DNA glycosylase; Provisional.
          Length = 318

 Score = 29.9 bits (67), Expect = 4.8
 Identities = 20/73 (27%), Positives = 23/73 (31%), Gaps = 3/73 (4%)

Query: 44  SSSPPPSSTEVTPVKKSPAPPSSSSSAPAPSHHNESNSGNSTKAASVEPPPRDEPRLEP- 102
           S SP P      P    P  P +S     PS      +      A+    PR  PR  P 
Sbjct: 6   SRSPSPPRRPSPPRPTPPRSPDASPEETPPSPPG-PGAEPPPGRAAGPAAPRRRPRGCPA 64

Query: 103 -VDGIVQPPVVPP 114
            V      P  PP
Sbjct: 65  GVTFSSSAPPRPP 77


>gnl|CDD|237783 PRK14667, uvrC, excinuclease ABC subunit C; Provisional.
          Length = 567

 Score = 30.1 bits (68), Expect = 4.9
 Identities = 19/44 (43%), Positives = 25/44 (56%), Gaps = 5/44 (11%)

Query: 403 EKLGLTDYFDIIKKPMDLGTVRKKMRNRTYKTAKEF--ADDVRL 444
           EK GL D  D IK    +G V+K++  R +KT  +F  ADD  L
Sbjct: 508 EKEGLKDILDKIK---GIGEVKKEIIYRNFKTLYDFLKADDEEL 548


>gnl|CDD|205499 pfam13319, DUF4090, Protein of unknown function (DUF4090).  This
           family of proteins is functionally uncharacterized. This
           family of proteins is found in bacteria. Proteins in
           this family are approximately 90 amino acids in length.
          Length = 84

 Score = 27.8 bits (62), Expect = 4.9
 Identities = 20/58 (34%), Positives = 26/58 (44%), Gaps = 16/58 (27%)

Query: 398 TPVDVEKLGLTDYFDIIKKPMDL------GTVRKKMRNRTYKT-AKEFADDV---RLI 445
           +P+  E L L        + MDL        V+K MRNR  +T AK F  +    RLI
Sbjct: 18  SPIPPEMLDL------YNEVMDLEAQRQRSGVKKSMRNRIVRTGAKHFDQETLNQRLI 69


>gnl|CDD|218275 pfam04812, HNF-1B_C, Hepatocyte nuclear factor 1 (HNF-1), beta
           isoform C terminus.  This family consists of a region
           found within the alpha isoform and at the C terminus of
           the beta isoform of the homeobox-containing
           transcription factor of HNF-1. Different isoforms of
           HNF-1 are generated by the differential use of
           polyadenylation sites and by alternative splicing. The
           C-terminal region of HNF-1 is responsible for the
           activation of transcription. Mutations and polymorphisms
           in HNF-1 cause the type 3 form of maturity-onset
           diabetes of the young (MODY3).
          Length = 237

 Score = 29.9 bits (67), Expect = 5.0
 Identities = 17/81 (20%), Positives = 31/81 (38%), Gaps = 3/81 (3%)

Query: 34  NNNNTSDSTSSSSPPPSSTEVTPVKKSPAPPSSSSSAPAPSHHNESNSGNSTKAASVEPP 93
           + N T    S S  P   +++  V+ S       +S+   SH + ++         +EP 
Sbjct: 13  STNPTLSHHSGSPSPGPPSKLHGVRYSQQGSCEVTSSSTISHGDRASVLQQVSPGGLEPS 72

Query: 94  P---RDEPRLEPVDGIVQPPV 111
                 + ++  V G   PPV
Sbjct: 73  HSLLSTDAKMISVSGGPLPPV 93


>gnl|CDD|187552 cd05241, 3b-HSD-like_SDR_e, 3beta-hydroxysteroid dehydrogenases
           (3b-HSD)-like, extended (e) SDRs.  Extended SDR family
           domains belonging to this subgroup have the
           characteristic active site tetrad and a fairly
           well-conserved NAD(P)-binding motif. 3b-HSD catalyzes
           the NAD-dependent conversion of various steroids, such
           as pregnenolone to progesterone, or androstenediol to
           testosterone. This subgroup includes an unusual
           bifunctional 3b-HSD/C-4 decarboxylase from Arabidopsis
           thaliana, and Saccharomyces cerevisiae ERG26, a
           3b-HSD/C-4 decarboxylase, involved in the synthesis of
           ergosterol, the major sterol of yeast. It also includes
           human 3 beta-HSD/HSD3B1 and C(27) 3beta-HSD/
           [3beta-hydroxy-delta(5)-C(27)-steroid oxidoreductase;
           HSD3B7].  C(27) 3beta-HSD/HSD3B7 is a membrane-bound
           enzyme of the endoplasmic reticulum, that catalyzes the
           isomerization and oxidation of 7alpha-hydroxylated
           sterol intermediates, an early step in bile acid
           biosynthesis. Mutations in the human NSDHL (NAD(P)H
           steroid dehydrogenase-like protein) cause CHILD syndrome
           (congenital hemidysplasia with ichthyosiform nevus and
           limb defects), an X-linked dominant, male-lethal trait.
           Mutations in the human gene encoding C(27) 3beta-HSD
           underlie a rare autosomal recessive form of neonatal
           cholestasis. Extended SDRs are distinct from classical
           SDRs. In addition to the Rossmann fold (alpha/beta
           folding pattern with a central beta-sheet) core region
           typical of all SDRs, extended SDRs have a less conserved
           C-terminal extension of approximately 100 amino acids.
           Extended SDRs are a diverse collection of proteins, and
           include isomerases, epimerases, oxidoreductases, and
           lyases; they typically have a TGXXGXXG cofactor binding
           motif. SDRs are a functionally diverse family of
           oxidoreductases that have a single domain with a
           structurally conserved Rossmann fold, an
           NAD(P)(H)-binding region, and a structurally diverse
           C-terminal region. Sequence identity between different
           SDR enzymes is typically in the 15-30% range; they
           catalyze a wide range of activities including the
           metabolism of steroids, cofactors, carbohydrates,
           lipids, aromatic compounds, and amino acids, and act in
           redox sensing. Classical SDRs have an TGXXX[AG]XG
           cofactor binding motif and a YXXXK active site motif,
           with the Tyr residue of the active site motif serving as
           a critical catalytic residue (Tyr-151, human
           15-hydroxyprostaglandin dehydrogenase numbering). In
           addition to the Tyr and Lys, there is often an upstream
           Ser and/or an Asn, contributing to the active site;
           while substrate binding is in the C-terminal region,
           which determines specificity. The standard reaction
           mechanism is a 4-pro-S hydride transfer and proton relay
           involving the conserved Tyr and Lys, a water molecule
           stabilized by Asn, and nicotinamide. Atypical SDRs
           generally lack the catalytic residues characteristic of
           the SDRs, and their glycine-rich NAD(P)-binding motif is
           often different from the forms normally seen in
           classical or extended SDRs. Complex (multidomain) SDRs
           such as ketoreductase domains of fatty acid sythase have
           a GGXGXXG NAD(P)-binding motif and an altered active
           site motif (YXXXN). Fungal type ketoacyl reductases have
           a TGXXXGX(1-2)G NAD(P)-binding motif.
          Length = 331

 Score = 30.1 bits (68), Expect = 5.2
 Identities = 15/51 (29%), Positives = 21/51 (41%), Gaps = 2/51 (3%)

Query: 371 PLWYKYCSEIIAELFHKKHQNYAWPFYTPVDVEKLGLTDYFDIIKKPMDLG 421
                YC+ +++EL       Y    ++P  V  L    YF I K   DLG
Sbjct: 265 SGPLAYCAALLSELVSFMLGPY--FVFSPFYVRALVTPMYFSIAKAQKDLG 313


>gnl|CDD|219741 pfam08193, INO80_Ies4, INO80 complex subunit Ies4.  The INO80
           ATPase is a member of the SNF2 family of ATPases and
           functions as an integral component of a multisubunit
           ATP-dependent chromatin remodelling complex. This family
           of proteins corresponds to the fungal Ies4 subunit of
           INO80.
          Length = 228

 Score = 29.6 bits (66), Expect = 5.4
 Identities = 29/116 (25%), Positives = 48/116 (41%), Gaps = 7/116 (6%)

Query: 9   ATDPVVNQNGTGDPDPVKAEQDVKVNNNNTSDSTSSSSPPPSSTEVTPVKKSPAPPSSSS 68
           A  PV     +  P    A+  V  + +N SD+ S+ +   S+T+ TP +K    P    
Sbjct: 37  APSPVKEDEPSDSPASSAADPPVPSSVDNASDAASTPAAGTSATD-TPRRKGGPGPKKGE 95

Query: 69  SAPAPSHHNESNSGNSTKAASVEPPPRDEPRLEPVDGIVQPPVVPPKHRPGRNTNQ 124
              A     +  +  +T     +P P+ +PRL+  DG ++       HR G   N 
Sbjct: 96  KRSA----GQGTTSETTSKPRGKPGPKKKPRLD--DGTLEGAKGTSSHRLGPKANT 145


>gnl|CDD|114205 pfam05467, Herpes_U47, Herpesvirus glycoprotein U47. 
          Length = 627

 Score = 30.2 bits (67), Expect = 5.4
 Identities = 27/109 (24%), Positives = 47/109 (43%), Gaps = 7/109 (6%)

Query: 246 PVSASPSLNPVIKTPVIPLNKLPSATSTPKPRPPNPVLGSTATTTTAPKVN-HLNSMNAP 304
           P S   S +  I +P IP      +T+ P P PP      T   T   KV  +  ++ A 
Sbjct: 241 PSSTPSSTSASITSPHIP------STNIPTPEPPPVTKNFTELHTDTIKVTPNTPTITAQ 294

Query: 305 DTPDMKKAIKRKADGSIDHTPSSLHPTPVKSAKQLNTRRESGSITKKPQ 353
            T  +KK +KR       +TP+ +    ++    + T + + + T+ P+
Sbjct: 295 TTESIKKIVKRSDFPRPMYTPTDIPTLTIRLNATIKTEQNTENPTENPK 343


>gnl|CDD|223023 PHA03249, PHA03249, DNA packaging tegument protein UL25;
           Provisional.
          Length = 653

 Score = 30.0 bits (67), Expect = 5.4
 Identities = 24/113 (21%), Positives = 39/113 (34%), Gaps = 2/113 (1%)

Query: 2   SAAETETATDPVVNQNGTGDPDPVKAEQDVKVNNNNTSDSTSSSSPPPSSTEVTPVKKSP 61
           S+ E E   DP   +  T D D ++A      ++++   S    S   S +E    +   
Sbjct: 23  SSLEDEVFLDPR-PRAPTEDLDRMEAGLSSYSSSSDNKSSFEVVSETDSGSEAEAERGRR 81

Query: 62  APPSSSSSAPAPSHHNESNSGNSTKAASVEPPPR-DEPRLEPVDGIVQPPVVP 113
           A     + A  PS  N++     T  A          P       +  PP +P
Sbjct: 82  AGMGGRNKATKPSRRNKTTQCRPTSLALATAATMPATPSSGKSPKVSSPPSIP 134


>gnl|CDD|218673 pfam05642, Sporozoite_P67, Sporozoite P67 surface antigen.  This
           family consists of several Theileria P67 surface
           antigens. A stage specific surface antigen of Theileria
           parva, p67, is the basis for the development of an
           anti-sporozoite vaccine for the control of East Coast
           fever (ECF) in cattle. The antigen has been shown to
           contain five distinct linear peptide sequences
           recognised by sporozoite-neutralising murine monoclonal
           antibodies.
          Length = 727

 Score = 30.0 bits (67), Expect = 5.6
 Identities = 15/52 (28%), Positives = 25/52 (48%), Gaps = 2/52 (3%)

Query: 33  VNNNNTSDSTSSSSPPPSSTEVTPVKKSPAPPSSSSSAP--APSHHNESNSG 82
           V   +++ S S++ P  S+T  TP    P+ P    S+   A +   +S SG
Sbjct: 268 VRVGDSTSSPSTTRPSGSTTTTTPASSGPSAPGGPGSSSRNAVTRSTDSISG 319


>gnl|CDD|177871 PLN02226, PLN02226, 2-oxoglutarate dehydrogenase E2 component.
          Length = 463

 Score = 30.1 bits (67), Expect = 5.7
 Identities = 21/80 (26%), Positives = 30/80 (37%), Gaps = 13/80 (16%)

Query: 45  SSPPPSSTEVTPVKKSPAPPSSSSSAPAPSHHN----ESNSGNSTKAASVEPPPRDEPRL 100
           S    ++++VTP +K P       S PA          +      KA S  PPP+   + 
Sbjct: 166 SKSEDAASQVTPSQKIPETTDPKPSPPAEDKQKPKVESAPVAEKPKAPSSPPPPKQSAK- 224

Query: 101 EPVDGIVQPPVVPPKHRPGR 120
                    P +PPK R  R
Sbjct: 225 --------EPQLPPKERERR 236


>gnl|CDD|222290 pfam13654, AAA_32, AAA domain.  This family includes a wide variety
           of AAA domains including some that have lost essential
           nucleotide binding residues in the P-loop.
          Length = 509

 Score = 29.7 bits (68), Expect = 5.9
 Identities = 12/43 (27%), Positives = 24/43 (55%), Gaps = 4/43 (9%)

Query: 691 YKKTPKPKDEKFAEKKHELEKRLQDVTSQI----DSTNKKLKK 729
           ++  P+ + E+  EK  ELE+ LQ++  Q+        +KL++
Sbjct: 169 FEALPEEEREELEEKIDELEEELQEILRQLRELEREAREKLRE 211


>gnl|CDD|240289 PTZ00144, PTZ00144, dihydrolipoamide succinyltransferase;
           Provisional.
          Length = 418

 Score = 29.7 bits (67), Expect = 6.1
 Identities = 14/65 (21%), Positives = 24/65 (36%), Gaps = 10/65 (15%)

Query: 41  STSSSSPPPSSTEVTPVKKSPAPPSSSSSAPAPSHHNESNSGNSTKAASVEPPPRDEPRL 100
           + ++++    +T   P   +P P   ++S P P             A   EP P  +P  
Sbjct: 130 AAAAAAKAEKTTPEKPKAAAPTPEPPAASKPTPP----------AAAKPPEPAPAAKPPP 179

Query: 101 EPVDG 105
            PV  
Sbjct: 180 TPVAR 184


>gnl|CDD|131850 TIGR02803, ExbD_1, TonB system transport protein ExbD, group 1.
           Members of this family are Gram-negative bacterial inner
           membrane proteins, generally designated ExbD, related to
           the TolR family modeled by TIGRFAMs TIGR02801. Members
           always are encoded next to a protein designated ExbB
           (TIGR02797), related to the TolQ family modeled by
           TIGRFAMs TIGR02796. ExbD and ExbB together form a proton
           channel through which they can harness the proton-motive
           force to energize TonB, which in turn energizes
           TonB-dependent receptors in the outer membrane.
           TonB-dependent receptors with known specificity tend to
           import siderophores or vitamin B12. A TonB system and
           Tol-Pal system often will co-exist in a single bacterial
           genome.
          Length = 122

 Score = 28.3 bits (63), Expect = 6.9
 Identities = 9/16 (56%), Positives = 13/16 (81%)

Query: 267 LPSATSTPKPRPPNPV 282
           LP++T+ P+PRP  PV
Sbjct: 35  LPASTAAPQPRPEKPV 50


>gnl|CDD|222855 PHA02529, O, capsid-scaffolding protein; Provisional.
          Length = 278

 Score = 29.3 bits (66), Expect = 7.3
 Identities = 20/131 (15%), Positives = 47/131 (35%), Gaps = 11/131 (8%)

Query: 648 IQSREPSLREPNPDEI-----EIDFETLKPSTLRELEQYVSSCLRKRTYKKTPKPKDEKF 702
            +    + R+ NP+ +     E   E    + L  L + V     ++  K+     DE+F
Sbjct: 139 AKHNPLARRKQNPENLFSVACEAKEEFEPEAELAALFEKVKKLFSRK--KEDDAMTDEQF 196

Query: 703 AEKKHELEKRLQDVTSQIDSTNKKLKKPKPTTSAAGPTGASRLSASSSSSSDSDSSGQST 762
           A+    + + ++ V  +ID T+ ++   +                ++   S    +    
Sbjct: 197 AD----VHEAVEGVAEKIDHTSAQVATTETAEPPQEGKDTEENGETAEQFSALKETLDKL 252

Query: 763 SSSSSESSDSE 773
           +S  +E   + 
Sbjct: 253 ASKFNEEVPAP 263


>gnl|CDD|153283 cd07599, BAR_Rvs167p, The Bin/Amphiphysin/Rvs (BAR) domain of
           Saccharomyces cerevisiae Reduced viability upon
           starvation protein 167 and similar proteins.  BAR
           domains are dimerization, lipid binding and curvature
           sensing modules found in many different proteins with
           diverse functions. This subfamily is composed of fungal
           proteins with similarity to Saccharomyces cerevisiae
           Reduced viability upon starvation protein 167 (Rvs167p)
           and Schizosaccharomyces pombe Hob1 (homolog of Bin1). S.
           cerevisiae Rvs167p plays a role in regulation of the
           actin cytoskeleton, endocytosis, and sporulation. It
           forms a heterodimer with another BAR domain protein
           Rvs161p. Rvs161p and Rvs167p share common functions but
           are not interchangeable. Their BAR domains cannot be
           replaced with each other and the overexpression of one
           cannot suppress the mutant phenotypes of the other.
           Rvs167p also interacts with the GTPase activating
           protein (GAP) Gyp5p, which is involved in ER to Golgi
           vesicle trafficking. BAR domains form dimers that bind
           to membranes, induce membrane bending and curvature, and
           may also be involved in protein-protein interactions.
          Length = 216

 Score = 29.1 bits (66), Expect = 7.6
 Identities = 12/53 (22%), Positives = 27/53 (50%), Gaps = 3/53 (5%)

Query: 677 ELEQYVSSCLRKRTYKKTPKPKDEKFAEKKHELEKRLQDVTSQIDSTNKKLKK 729
           + ++  +   +    KK    KDEK   +  +LE++L++   + ++ N+ LK 
Sbjct: 125 DYDKLQNKLNKLLQKKKELSLKDEK---QLAKLERKLEEAKEEYEALNELLKS 174


>gnl|CDD|221391 pfam12042, RP1-2, Tubuliform egg casing silk strands structural
           domain.  Spiders use fibroins to make silk strands. This
           family includes tubuliform silk fibroins which are used
           to protect egg cases. This domain is a structural domain
           which is found in repeats of up to 20 in many
           individuals (although this is not necessarily the case).
           RP1 makes up structural domains in the N terminal while
           RP2 makes up structural domains in the C terminal.
          Length = 167

 Score = 28.6 bits (64), Expect = 7.8
 Identities = 11/36 (30%), Positives = 19/36 (52%)

Query: 735 SAAGPTGASRLSASSSSSSDSDSSGQSTSSSSSESS 770
           + A    A+  +A S +++ + S   S SSS S +S
Sbjct: 132 AQAASASAAAAAAQSQAAASAFSQAASQSSSQSAAS 167



 Score = 28.6 bits (64), Expect = 9.3
 Identities = 13/32 (40%), Positives = 21/32 (65%)

Query: 740 TGASRLSASSSSSSDSDSSGQSTSSSSSESSD 771
             AS+ ++S+SSSS + +  QS SS+ + SS 
Sbjct: 1   AAASQAASSASSSSSASAFAQSLSSALASSSQ 32


>gnl|CDD|235906 PRK07003, PRK07003, DNA polymerase III subunits gamma and tau;
           Validated.
          Length = 830

 Score = 29.8 bits (67), Expect = 7.8
 Identities = 17/108 (15%), Positives = 33/108 (30%), Gaps = 8/108 (7%)

Query: 3   AAETETATDPVVNQNGTGDPDPVKAEQDVKVNNNNTSDSTSSSSPPPSSTEVTPVKKSPA 62
            A            +      PV A+ + + + ++  D   +  P  S +   P   +P 
Sbjct: 427 PAAPAPPATADRGDDAADGDAPVPAKANARASADSRCDERDAQPPADSGSASAPASDAPP 486

Query: 63  P----PSSSSSAPAPSHHNESNSGNSTKAAS----VEPPPRDEPRLEP 102
                P+  ++AP+ +         +  AAS            P   P
Sbjct: 487 DAAFEPAPRAAAPSAATPAAVPDARAPAAASREDAPAAAAPPAPEARP 534


>gnl|CDD|233045 TIGR00601, rad23, UV excision repair protein Rad23.  All proteins
           in this family for which functions are known are
           components of a multiprotein complex used for targeting
           nucleotide excision repair to specific parts of the
           genome. In humans, Rad23 complexes with the XPC protein.
           This family is based on the phylogenomic analysis of JA
           Eisen (1999, Ph.D. Thesis, Stanford University) [DNA
           metabolism, DNA replication, recombination, and repair].
          Length = 378

 Score = 29.5 bits (66), Expect = 7.8
 Identities = 15/64 (23%), Positives = 23/64 (35%)

Query: 31  VKVNNNNTSDSTSSSSPPPSSTEVTPVKKSPAPPSSSSSAPAPSHHNESNSGNSTKAASV 90
           V V+   T     +      ++  TP    PA P+S  SA   S   E +    +  A+ 
Sbjct: 72  VMVSKPKTGTGKVAPPAATPTSAPTPTPSPPASPASGMSAAPASAVEEKSPSEESATATA 131

Query: 91  EPPP 94
              P
Sbjct: 132 PESP 135


>gnl|CDD|216412 pfam01285, TEA, TEA/ATTS domain family. 
          Length = 424

 Score = 29.4 bits (66), Expect = 7.8
 Identities = 14/73 (19%), Positives = 24/73 (32%), Gaps = 4/73 (5%)

Query: 236 PQPRSSKKKPPVSASPSLNPVIKTPVIPLNKLPSATSTPKPRPPNPVLGSTATTTTAPKV 295
           P  R+S K    S   S N    +      +     S   P P  P   ++ +      +
Sbjct: 154 PSWRTSIKPFSSSHYGSHNS---SAYSDHLRPLQPYSGELPPPLGPNWQASNSKKIR-GL 209

Query: 296 NHLNSMNAPDTPD 308
                ++ P+ PD
Sbjct: 210 EFSMWVSQPNQPD 222


>gnl|CDD|227315 COG4982, COG4982, 3-oxoacyl-[acyl-carrier protein].
          Length = 866

 Score = 29.4 bits (66), Expect = 8.1
 Identities = 15/52 (28%), Positives = 24/52 (46%), Gaps = 8/52 (15%)

Query: 47 PPPSSTEVTPVKKSPAPPSSSSSAPAPSHHNESNSGNSTKAASVEPPPRDEP 98
            P+  E TP   + + P+ +++APAP          +   A+  P P DEP
Sbjct: 9  EEPAKEEATPPAPAASAPAPAAAAPAPVA--------AAAPAAAGPRPDDEP 52


>gnl|CDD|215641 PLN03237, PLN03237, DNA topoisomerase 2; Provisional.
          Length = 1465

 Score = 29.8 bits (67), Expect = 8.2
 Identities = 33/249 (13%), Positives = 76/249 (30%), Gaps = 19/249 (7%)

Query: 528  KPKKKKKKNRDQPKSKMPMGQNSAMMNDHVNKMNKAPAPLNNGQKPKSLNNVRKPQASNP 587
            +    KK +R  PK   P             +     + +      +    V KP+    
Sbjct: 1190 ESGAAKKVSRQAPKKPAPKKTTKKASESETTEETYGSSAMETENVAE----VVKPKGRAG 1245

Query: 588  QQAKKPKPNNANTVAAKKQVRTFDSEDEDVAKPMSYDEKRQLSLDINKLPGDKLGKVVHI 647
             + K P         A  + +  + E  D+   ++            K+           
Sbjct: 1246 AKKKAP---------AAAKEKEEEDEILDLKDRLAAYNLDSAPAQSAKMEETVKAVPARR 1296

Query: 648  IQSREPSLR---EPNPDEIEIDFETLKPSTLRELEQYV---SSCLRKRTYKKTPKPKDEK 701
              +R+  L      +  + + D   ++ S    L++      +   K+  K     K   
Sbjct: 1297 AAARKKPLASVSVISDSDDDDDDFAVEVSLAERLKKKGGRKPAAANKKAAKPPAAAKKRG 1356

Query: 702  FAEKKHELEKRLQDVTSQIDSTNKKLKKPKPTTSAAGPTGASRLSASSSSSSDSDSSGQS 761
             A  +   +   + +           KK +   ++     +  +   ++++ +++SS   
Sbjct: 1357 PATVQSGQKLLTEMLKPAEAIGISPEKKVRKMRASPFNKKSGSVLGRAATNKETESSENV 1416

Query: 762  TSSSSSESS 770
            + SSSSE  
Sbjct: 1417 SGSSSSEKD 1425


>gnl|CDD|177952 PLN02318, PLN02318, phosphoribulokinase/uridine kinase.
          Length = 656

 Score = 29.4 bits (66), Expect = 8.4
 Identities = 21/74 (28%), Positives = 29/74 (39%), Gaps = 3/74 (4%)

Query: 709 LEKRLQDVTSQIDSTNKKLKKPKPTTSA---AGPTGASRLSASSSSSSDSDSSGQSTSSS 765
           L +R+ + TS+I+  N KL   K + S    A    A   SA +S       +G  T S 
Sbjct: 509 LNERMDEFTSRIEELNSKLSIKKNSPSQQNLALQAEACNGSAPTSYFVSGLGNGSLTGSI 568

Query: 766 SSESSDSEGEKGPH 779
              SS S       
Sbjct: 569 LPLSSSSSQLAKES 582


>gnl|CDD|227625 COG5309, COG5309, Exo-beta-1,3-glucanase [Carbohydrate transport
           and metabolism].
          Length = 305

 Score = 29.0 bits (65), Expect = 8.6
 Identities = 15/45 (33%), Positives = 22/45 (48%)

Query: 733 TTSAAGPTGASRLSASSSSSSDSDSSGQSTSSSSSESSDSEGEKG 777
           ++     T +S   A+ SSSS + SS  S  SSSS  + + G   
Sbjct: 3   SSMQFSSTSSSAALATLSSSSSALSSSASEVSSSSSRASASGFLA 47


>gnl|CDD|223031 PHA03273, PHA03273, envelope glycoprotein C; Provisional.
          Length = 486

 Score = 29.2 bits (65), Expect = 8.6
 Identities = 14/57 (24%), Positives = 25/57 (43%), Gaps = 2/57 (3%)

Query: 38 TSDSTSSSSPPPSSTEVTPVKKSPAPPSSSSSAPAPSHHNESNSGNSTKAASVEPPP 94
           S +++SSS   S      ++ +PA P+ ++S         ++  NST A   E   
Sbjct: 25 ASGASTSSSIENSDNSTAEMQSTPATPTHTTSNLTSPFGTGTD--NSTNANGTESTT 79


>gnl|CDD|236333 PRK08691, PRK08691, DNA polymerase III subunits gamma and tau;
           Validated.
          Length = 709

 Score = 29.3 bits (65), Expect = 8.7
 Identities = 26/146 (17%), Positives = 47/146 (32%), Gaps = 11/146 (7%)

Query: 4   AETETATDPVVNQNGTGDPDPVKAEQDVKVNNNNTSDSTSSSSPPPSSTEVTPVKKSPAP 63
            E ETA  PV   +    P   K    V    NN        +P  + T     + S   
Sbjct: 398 PEAETAQTPVQTASAAAMPSEGKTAGPVSNQENNDV-PPWEDAPDEAQTAAGTAQTSAKS 456

Query: 64  PSSSSSA--PAPSHHNESNSGNSTKAASVEPPPRDEPRL-----EPVDGIVQPPVVPPKH 116
             ++S A  P  +  +++ + ++   A +   P + P       E V+        P + 
Sbjct: 457 IQTASEAETPPENQVSKNKAADNETDAPLSEVPSENPIQATPNDEAVETETFAHEAPAEP 516

Query: 117 RPGRNTNQLQYIVKN---VMKAVWKH 139
             G          ++   +    W+H
Sbjct: 517 FYGYGFPDNDCPPEDGAEIPPPDWEH 542


>gnl|CDD|183064 PRK11267, PRK11267, biopolymer transport protein ExbD; Provisional.
          Length = 141

 Score = 28.2 bits (63), Expect = 8.8
 Identities = 11/16 (68%), Positives = 14/16 (87%)

Query: 267 LPSATSTPKPRPPNPV 282
           LP++TSTP+PRP  PV
Sbjct: 49  LPASTSTPQPRPEKPV 64


>gnl|CDD|233594 TIGR01844, type_I_sec_TolC, type I secretion outer membrane
           protein, TolC family.  Members of This model are outer
           membrane proteins from the TolC subfamily within the RND
           (Resistance-Nodulation-cell Division) efflux systems.
           These proteins, unlike the NodT subfamily, appear not to
           be lipoproteins. All are believed to participate in type
           I protein secretion, an ABC transporter system for
           protein secretion without cleavage of a signal sequence,
           although they may, like TolC, participate also in the
           efflux of smaller molecules as well. This family
           includes the well-documented examples TolC (E. coli),
           PrtF (Erwinia), and AprF (Pseudomonas aeruginosa)
           [Protein fate, Protein and peptide secretion and
           trafficking, Transport and binding proteins, Porins].
          Length = 415

 Score = 29.3 bits (66), Expect = 8.9
 Identities = 11/57 (19%), Positives = 18/57 (31%), Gaps = 3/57 (5%)

Query: 460 VTMARQLSAVFEDRFAKMPDESNLASRAAAS---VSSDDDSEDERQNQLKYLQEQLK 513
             +    +A    R        +L  R A +   V    +     +  L  L+EQL 
Sbjct: 88  NAVRAAEAAALAARETLRATAQDLILRTAEAYMEVLRAQEILALAEANLAALKEQLD 144


>gnl|CDD|217495 pfam03326, Herpes_TAF50, Herpesvirus transcription activation
           factor (transactivator).  This family includes EBV BRLF1
           and similar ORF 50 proteins from other herpesviruses.
          Length = 500

 Score = 29.4 bits (66), Expect = 9.0
 Identities = 18/82 (21%), Positives = 29/82 (35%), Gaps = 3/82 (3%)

Query: 44  SSSPPPSSTEVTPVKKSPAPPSS-SSSAPAPSHHNESNSGNSTKAASVEPPPRDEPRLEP 102
            S      T   P  +      S  S++P  S   ++ SG + + AS+ PP   + +LE 
Sbjct: 348 VSQNHDGGTVRAPRGRPSGSGQSPPSNSPLLSSLADTPSGAAHQPASLLPPAVVQQQLED 407

Query: 103 VDGIVQPPVVPPKHRPGRNTNQ 124
                  P  P      +   Q
Sbjct: 408 ASDKQ--PPTPGSSLVPQPDEQ 427


  Database: CDD.v3.10
    Posted date:  Mar 20, 2013  7:55 AM
  Number of letters in database: 10,937,602
  Number of sequences in database:  44,354
  
Lambda     K      H
   0.306    0.124    0.354 

Gapped
Lambda     K      H
   0.267   0.0798    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 38,426,153
Number of extensions: 3727658
Number of successful extensions: 7401
Number of sequences better than 10.0: 1
Number of HSP's gapped: 6570
Number of HSP's successfully gapped: 526
Length of query: 779
Length of database: 10,937,602
Length adjustment: 105
Effective length of query: 674
Effective length of database: 6,280,432
Effective search space: 4233011168
Effective search space used: 4233011168
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.1 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (21.6 bits)
S2: 63 (27.9 bits)