RPS-BLAST 2.2.26 [Sep-21-2011]
Database: CDD.v3.10
44,354 sequences; 10,937,602 total letters
Searching..................................................done
Query= psy9598
(1915 letters)
>gnl|CDD|238121 cd00200, WD40, WD40 domain, found in a number of eukaryotic
proteins that cover a wide variety of functions
including adaptor/regulatory modules in signal
transduction, pre-mRNA processing and cytoskeleton
assembly; typically contains a GH dipeptide 11-24
residues from its N-terminus and the WD dipeptide at its
C-terminus and is 40 residues long, hence the name WD40;
between GH and WD lies a conserved core; serves as a
stable propeller-like platform to which proteins can
bind either stably or reversibly; forms a propeller-like
structure with several blades where each blade is
composed of a four-stranded anti-parallel b-sheet;
instances with few detectable copies are hypothesized to
form larger structures by dimerization; each WD40
sequence repeat forms the first three strands of one
blade and the last strand in the next blade; the last
C-terminal WD40 repeat completes the blade structure of
the first WD40 repeat to create the closed ring
propeller-structure; residues on the top and bottom
surface of the propeller are proposed to coordinate
interactions with other proteins and/or small ligands; 7
copies of the repeat are present in this alignment.
Length = 289
Score = 181 bits (462), Expect = 7e-51
Identities = 95/320 (29%), Positives = 148/320 (46%), Gaps = 37/320 (11%)
Query: 474 GHLSAVFCVLFDKLGEVIVTGADDLLIKLWRARDGKLLATLRGCSGEISDIAIDNRNILL 533
GH V CV F G+++ TG+ D IK+W G+LL TL+G +G + D+A L
Sbjct: 7 GHTGGVTCVAFSPDGKLLATGSGDGTIKVWDLETGELLRTLKGHTGPVRDVAASADGTYL 66
Query: 534 AAGTVDKTIRIWNLQTLAPISVLVGHQGIITGVNFCPLEVNGFNYLASTSTDGCIGFWKY 593
A+G+ DKTIR+W+L+T + L GH ++ V F P L+S+S D I W
Sbjct: 67 ASGSSDKTIRLWDLETGECVRTLTGHTSYVSSVAFSPDG----RILSSSSRDKTIKVW-- 120
Query: 594 KLDIDISKTVFHEPVMFNERIRPGNAHILCSSFSPGGLFLATGSGDHHVRVYKMDGVDSP 653
D++ K + +R + +FSP G F+A+ S D ++++ +
Sbjct: 121 --DVETGKCLT--------TLRGHTDWVNSVAFSPDGTFVASSSQDGTIKLWDLRT-GKC 169
Query: 654 LGILEVEEHSDKVDSIQWSHSHLRFVSGSRDGTALIWYYKCSQWRFIRLDMNTCLPHTKE 713
+ L H+ +V+S+ +S + +S S DGT +W + + T H
Sbjct: 170 VATL--TGHTGEVNSVAFSPDGEKLLSSSSDGTIKLWDLSTGKC------LGTLRGHENG 221
Query: 714 TSDETNKKIKVTMVAWDASDRWVITAINFNFQIKIWDAFNGDLVQVLKGHTNEVFVLESH 773
V VA+ + + + + I++WD G+ VQ L GHTN V L
Sbjct: 222 ----------VNSVAFSPDGYLLASG-SEDGTIRVWDLRTGECVQTLSGHTNSVTSLAWS 270
Query: 774 PFDSRVLLSAGHDGLIIIWD 793
P D + L S DG I IWD
Sbjct: 271 P-DGKRLASGSADGTIRIWD 289
Score = 147 bits (372), Expect = 7e-39
Identities = 83/326 (25%), Positives = 135/326 (41%), Gaps = 58/326 (17%)
Query: 510 LLATLRGCSGEISDIAIDNRNILLAAGTVDKTIRIWNLQTLAPISVLVGHQGIITGVNFC 569
L TL+G +G ++ +A LLA G+ D TI++W+L+T + L GH G + V
Sbjct: 1 LRRTLKGHTGGVTCVAFSPDGKLLATGSGDGTIKVWDLETGELLRTLKGHTGPVRDVAAS 60
Query: 570 PLEVNGFNYLASTSTDGCIGFWKYKLDIDISKTVFHEPVMFNERIRPGNAHILCSSFSPG 629
YLAS S+D I W + + H +++ +FSP
Sbjct: 61 A----DGTYLASGSSDKTIRLWDLETGECVRTLTGHT------------SYVSSVAFSPD 104
Query: 630 GLFLATGSGDHHVRVYKMDGVDSPLGILEVEEHSDKVDSIQWSHSHLRFVSGSRDGTALI 689
G L++ S D ++V+ V++ + + H+D V+S+ +S S S+DGT +
Sbjct: 105 GRILSSSSRDKTIKVWD---VETGKCLTTLRGHTDWVNSVAFSPDGTFVASSSQDGTIKL 161
Query: 690 WYYKCSQWRFIRLDMNTCLP------HTKETSDETNKKIKVTMVAWDASDRWVITAINFN 743
W D+ T HT E V VA+ ++++ +
Sbjct: 162 W------------DLRTGKCVATLTGHTGE----------VNSVAFSPDGEKLLSSSSDG 199
Query: 744 FQIKIWDAFNGDLVQVLKGHTNEVFVLESHPFDSRVLLSAGHDGLIIIWDILSSKQRGYK 803
IK+WD G + L+GH N V + P D +L S DG I +WD+ + +
Sbjct: 200 -TIKLWDLSTGKCLGTLRGHENGVNSVAFSP-DGYLLASGSEDGTIRVWDLRTGECVQ-- 255
Query: 804 WFHNIVEGQG-EGALFDGKWSPDGTT 828
G ++ WSPDG
Sbjct: 256 ------TLSGHTNSVTSLAWSPDGKR 275
Score = 138 bits (349), Expect = 9e-36
Identities = 71/293 (24%), Positives = 118/293 (40%), Gaps = 44/293 (15%)
Query: 462 KYSNLRQHYVNLGHLSAVFCVLFDKLGEVIVTGADDLLIKLWRARDGKLLATLRGCSGEI 521
GH V V G + +G+ D I+LW G+ + TL G + +
Sbjct: 37 DLETGELLRTLKGHTGPVRDVAASADGTYLASGSSDKTIRLWDLETGECVRTLTGHTSYV 96
Query: 522 SDIAIDNRNILLAAGTVDKTIRIWNLQTLAPISVLVGHQGIITGVNFCPLEVNGFNYLAS 581
S +A +L++ + DKTI++W+++T ++ L GH + V F P ++AS
Sbjct: 97 SSVAFSPDGRILSSSSRDKTIKVWDVETGKCLTTLRGHTDWVNSVAFSPDG----TFVAS 152
Query: 582 TSTDGCIGFWKYKLDIDISKTV----FHEPVMFNERIRPGNAHILCSSFSPGGLFLATGS 637
+S DG I W D+ K V H + +FSP G L + S
Sbjct: 153 SSQDGTIKLW----DLRTGKCVATLTGHTGE------------VNSVAFSPDGEKLLSSS 196
Query: 638 GDHHVRVYKMDGVDSPLGILEVEEHSDKVDSIQWSHSHLRFVSGSRDGTALIWYYKCSQW 697
D ++++ + LG L H + V+S+ +S SGS DGT +W
Sbjct: 197 SDGTIKLWDLST-GKCLGTLRG--HENGVNSVAFSPDGYLLASGSEDGTIRVW------- 246
Query: 698 RFIRLDMNTCLPHTKETSDETNKKIKVTMVAWDASDRWVITAINFNFQIKIWD 750
L C+ ++ VT +AW + + + + I+IWD
Sbjct: 247 ---DLRTGECVQTLSGHTNS------VTSLAWSPDGKRLASGS-ADGTIRIWD 289
>gnl|CDD|99928 cd05496, Bromo_WDR9_II, Bromodomain; WDR9 repeat II_like subfamily.
WDR9 is a human gene located in the Down Syndrome
critical region-2 of chromosome 21. It encodes for a
nuclear protein containing WD40 repeats and two
bromodomains, which may function as a transcriptional
regulator involved in chromatin remodeling and play a
role in embryonic development. Bromodomains are 110
amino acid long domains, that are found in many
chromatin associated proteins. Bromodomains can interact
specifically with acetylated lysine.
Length = 119
Score = 152 bits (386), Expect = 5e-43
Identities = 57/115 (49%), Positives = 81/115 (70%)
Query: 117 NWKNECREVLETLWVCPDSEPFRKPVDKLDHPDYDKIVIAPMDLGTIREELRGDNYESPI 176
+WK +C+E++ +W C DSEPFR+PVD L +PDY I+ PMDLGT++E L G NY+ P+
Sbjct: 5 DWKKQCKELVNLMWDCEDSEPFRQPVDLLKYPDYRDIIDTPMDLGTVKETLFGGNYDDPM 64
Query: 177 DFVKDVKRVFHNSRNYNQNKRSKIYIMTTRLSEFAEEHLRRLTVRWNSHKNKENK 231
+F KDV+ +F NS++Y NKRS+IY MT RLS EEH++++ W S + K
Sbjct: 65 EFAKDVRLIFSNSKSYTPNKRSRIYSMTLRLSALFEEHIKKIISDWKSALKRNEK 119
Score = 152 bits (386), Expect = 5e-43
Identities = 57/115 (49%), Positives = 81/115 (70%)
Query: 1532 NWKNECREVLETLWVCPDSEPFRKPVDKLDHPDYDKIVIAPMDLGTIREELRGDNYESPI 1591
+WK +C+E++ +W C DSEPFR+PVD L +PDY I+ PMDLGT++E L G NY+ P+
Sbjct: 5 DWKKQCKELVNLMWDCEDSEPFRQPVDLLKYPDYRDIIDTPMDLGTVKETLFGGNYDDPM 64
Query: 1592 DFVKDVKRVFHNSRNYNQNKRSKIYIMTTRLSEFAEEHLRRLTVRWNSHKNKENK 1646
+F KDV+ +F NS++Y NKRS+IY MT RLS EEH++++ W S + K
Sbjct: 65 EFAKDVRLIFSNSKSYTPNKRSRIYSMTLRLSALFEEHIKKIISDWKSALKRNEK 119
Score = 41.7 bits (98), Expect = 4e-04
Identities = 18/59 (30%), Positives = 32/59 (54%)
Query: 1391 AEPFLVPVDISQYPAYAYVVEYPVDLTTLRARFQNRFYRRITSAQFDVRYLAANAEKFN 1449
+EPF PVD+ +YP Y +++ P+DL T++ Y DVR + +N++ +
Sbjct: 23 SEPFRQPVDLLKYPDYRDIIDTPMDLGTVKETLFGGNYDDPMEFAKDVRLIFSNSKSYT 81
>gnl|CDD|99958 cd05529, Bromo_WDR9_I_like, Bromodomain; WDR9 repeat I_like
subfamily. WDR9 is a human gene located in the Down
Syndrome critical region-2 of chromosome 21. It encodes
for a nuclear protein containing WD40 repeats and two
bromodomains, which may function as a transcriptional
regulator involved in chromatin remodeling and play a
role in embryonic development. Bromodomains are 110 amino
acid long domains, that are found in many chromatin
associated proteins. Bromodomains can interact
specifically with acetylated lysine.
Length = 128
Score = 127 bits (322), Expect = 3e-34
Identities = 52/128 (40%), Positives = 78/128 (60%), Gaps = 12/128 (9%)
Query: 1358 LYHPRPDEWPQGD--------RDATCEKIITGLEEVMG---LSVAEPFLVPVDI-SQYPA 1405
LY+P EW D RD E++I+GL++++ L +AE F PVD+ + YP
Sbjct: 1 LYNPLSSEWELFDPGWEQPHIRDEERERLISGLDKLLLSLQLEIAEYFEYPVDLRAWYPD 60
Query: 1406 YAYVVEYPVDLTTLRARFQNRFYRRITSAQFDVRYLAANAEKFNEPHSNIVKYARIITDL 1465
Y V P+DL T+R+R +NR+YR + + + DVR + +NAE FNEP+S I K A+ ++D
Sbjct: 61 YWNRVPVPMDLETIRSRLENRYYRSLEALRHDVRLILSNAETFNEPNSEIAKKAKRLSDW 120
Query: 1466 LLRIIKEP 1473
LLRI+
Sbjct: 121 LLRILSSL 128
Score = 82.8 bits (205), Expect = 2e-18
Identities = 27/58 (46%), Positives = 41/58 (70%)
Query: 1 LTTLRARFQNRFYRRITSAQFDVRYLAANAEKFNEPHSNIVKYARIITDLLLRIIKEP 58
L T+R+R +NR+YR + + + DVR + +NAE FNEP+S I K A+ ++D LLRI+
Sbjct: 71 LETIRSRLENRYYRSLEALRHDVRLILSNAETFNEPNSEIAKKAKRLSDWLLRILSSL 128
Score = 43.1 bits (102), Expect = 1e-04
Identities = 28/78 (35%), Positives = 37/78 (47%), Gaps = 2/78 (2%)
Query: 133 PDSEPFRKPVD-KLDHPDYDKIVIAPMDLGTIREELRGDNYESPIDFVKDVKRVFHNSRN 191
+E F PVD + +PDY V PMDL TIR L Y S DV+ + N+
Sbjct: 43 EIAEYFEYPVDLRAWYPDYWNRVPVPMDLETIRSRLENRYYRSLEALRHDVRLILSNAET 102
Query: 192 YNQNKRSKIYIMTTRLSE 209
+N S+I RLS+
Sbjct: 103 FN-EPNSEIAKKAKRLSD 119
Score = 43.1 bits (102), Expect = 1e-04
Identities = 28/78 (35%), Positives = 37/78 (47%), Gaps = 2/78 (2%)
Query: 1548 PDSEPFRKPVD-KLDHPDYDKIVIAPMDLGTIREELRGDNYESPIDFVKDVKRVFHNSRN 1606
+E F PVD + +PDY V PMDL TIR L Y S DV+ + N+
Sbjct: 43 EIAEYFEYPVDLRAWYPDYWNRVPVPMDLETIRSRLENRYYRSLEALRHDVRLILSNAET 102
Query: 1607 YNQNKRSKIYIMTTRLSE 1624
+N S+I RLS+
Sbjct: 103 FN-EPNSEIAKKAKRLSD 119
>gnl|CDD|225201 COG2319, COG2319, FOG: WD40 repeat [General function prediction
only].
Length = 466
Score = 124 bits (312), Expect = 1e-29
Identities = 94/397 (23%), Positives = 163/397 (41%), Gaps = 58/397 (14%)
Query: 448 YSGPLSRSLMISSHKYSNLRQHYVNLGHLSAVFCVLFDKLGEVIVTGADDLLIKLWRARD 507
SG L +L+ S + GH ++ + F GE++++G+ D IKLW +
Sbjct: 37 ESGILLLALLSDSLVSLPDLSSLLLRGHEDSITSIAFSPDGELLLSGSSDGTIKLWDLDN 96
Query: 508 G-KLLATLRGCSG----EISDIAIDNRNILLAAGTVDKTIRIWNLQTLA-PISVLVGHQG 561
G KL+ +L G +++ + D +ILLA+ ++D T+++W+L T I L GH
Sbjct: 97 GEKLIKSLEGLHDSSVSKLALSSPDGNSILLASSSLDGTVKLWDLSTPGKLIRTLEGHSE 156
Query: 562 IITGVNFCPLEVNGFNYLASTSTDGCIGFWKYKLDIDISKTVFHEPVMFNERIRPGNAHI 621
+T + F P +G + +S DG I W + +S H + + P +
Sbjct: 157 SVTSLAFSP---DGKLLASGSSLDGTIKLWDLRTGKPLSTLAGHTDPVSSLAFSPDGGLL 213
Query: 622 LCS-------------------------------SFSPGGLFLATGSGDHHVRVYKMDGV 650
+ S SFSP G LA+GS D +R++ +
Sbjct: 214 IASGSSDGTIRLWDLSTGKLLRSTLSGHSDSVVSSFSPDGSLLASGSSDGTIRLWDLRSS 273
Query: 651 DSPLGILEVEEHSDKVDSIQWSHSHLRFVSGSRDGTALIWYYKCSQWRFIRLDMNTCLPH 710
S L L HS V S+ +S SGS DGT +W +
Sbjct: 274 SSLLRTLSG--HSSSVLSVAFSPDGKLLASGSSDGTVRLW--------------DLETGK 317
Query: 711 TKETSDETNKKIKVTMVAWDASDRWVITAINFNFQIKIWDAFNGDLVQVLKGHTNEVFVL 770
+ + V+ +++ +++ + + I++WD G ++ L+GH+N + V
Sbjct: 318 LLSSLTLKGHEGPVSSLSFSPDGSLLVSGGSDDGTIRLWDLRTGKPLKTLEGHSNVLSV- 376
Query: 771 ESHPFDSRVLLSAGHDGLIIIWDILSSKQRGYKWFHN 807
S D RV+ S DG + +WD+ + H
Sbjct: 377 -SFSPDGRVVSSGSTDGTVRLWDLSTGSLLRNLDGHT 412
Score = 90.1 bits (222), Expect = 2e-18
Identities = 61/289 (21%), Positives = 110/289 (38%), Gaps = 31/289 (10%)
Query: 567 NFCPLEVNGFNYLASTSTDGCIGFWKYKLDIDISKTVFHEPVMFNERIRPGNAHILCSSF 626
+ +N + L+ S++ I D +S + +R I +F
Sbjct: 19 SELGPSLNSLSLLSLGSSESGILLLALLSDSLVSLPDLS-----SLLLRGHEDSITSIAF 73
Query: 627 SPGGLFLATGSGDHHVRVYKMDGVDSPLGILEVEEHSDKVDSIQWSH---SHLRFVSGSR 683
SP G L +GS D ++++ +D + + LE H V + S + + S S
Sbjct: 74 SPDGELLLSGSSDGTIKLWDLDNGEKLIKSLE-GLHDSSVSKLALSSPDGNSILLASSSL 132
Query: 684 DGTALIWYYKCSQWRFIRLDMNTCLPHTKETSDETNKKIKVTMVAWDASDRWVITAINFN 743
DGT +W + T H++ VT +A+ + + + + +
Sbjct: 133 DGTVKLW-----DLSTPGKLIRTLEGHSES----------VTSLAFSPDGKLLASGSSLD 177
Query: 744 FQIKIWDAFNGDLVQVLKGHTNEVFVLESHPFDSRVLLSAGHDGLIIIWDILSSKQRGYK 803
IK+WD G + L GHT+ V L P ++ S DG I +WD+ + K
Sbjct: 178 GTIKLWDLRTGKPLSTLAGHTDPVSSLAFSPDGGLLIASGSSDGTIRLWDLSTG-----K 232
Query: 804 WFHNIVEGQGEGALFDGKWSPDGTTCALTDSYGHLLVYGLGGFKRPQNI 852
+ + G + + +SPDG+ A S G + ++ L
Sbjct: 233 LLRSTLSGHSDSVV--SSFSPDGSLLASGSSDGTIRLWDLRSSSSLLRT 279
Score = 87.5 bits (215), Expect = 2e-17
Identities = 73/304 (24%), Positives = 119/304 (39%), Gaps = 39/304 (12%)
Query: 474 GHLSAVFCVLFDKLGE-VIVTGADDLLIKLWRARDGKLLATLRGCSGEISDIAIDNRNIL 532
GH V + F G +I +G+ D I+LW GKLL + + + L
Sbjct: 196 GHTDPVSSLAFSPDGGLLIASGSSDGTIRLWDLSTGKLLRSTLSGHSDSVVSSFSPDGSL 255
Query: 533 LAAGTVDKTIRIWNLQTLA-PISVLVGHQGIITGVNFCPLEVNGFNYLASTSTDGCIGFW 591
LA+G+ D TIR+W+L++ + + L GH + V F P LAS S+DG + W
Sbjct: 256 LASGSSDGTIRLWDLRSSSSLLRTLSGHSSSVLSVAFSPDG----KLLASGSSDGTVRLW 311
Query: 592 KYKLDIDISKTVFHEPVMFNERIRPGNAHILCSSFSPGGLFLATG-SGDHHVRVYKMDGV 650
D++ K + + +E + SFSP G L +G S D +R++ +
Sbjct: 312 ----DLETGKLLSSLTLKGHEGP------VSSLSFSPDGSLLVSGGSDDGTIRLWDLRTG 361
Query: 651 DSPLGILEVEEHSDKVDSIQWSHSHLRFVSGSRDGTALIWYYKCSQWRFIRLDMNTCLPH 710
+ E V S+ +S SGS DGT +W L
Sbjct: 362 KPLKTL----EGHSNVLSVSFSPDGRVVSSGSTDGTVRLW----------------DLST 401
Query: 711 TKETSDETNKKIKVTMVAWDASDRWVITAINFNFQIKIWDAFNGDLVQVLKGHTNEVFVL 770
+ +VT + + + + + + N I++WD L V +V
Sbjct: 402 GSLLRNLDGHTSRVTSLDFSPDGKSLASGSSDNT-IRLWDLKT-SLKSVSFSPDGKVLAS 459
Query: 771 ESHP 774
+S
Sbjct: 460 KSSD 463
Score = 48.9 bits (115), Expect = 2e-05
Identities = 26/92 (28%), Positives = 41/92 (44%)
Query: 465 NLRQHYVNLGHLSAVFCVLFDKLGEVIVTGADDLLIKLWRARDGKLLATLRGCSGEISDI 524
+ L S V V F G V+ +G+ D ++LW G LL L G + ++ +
Sbjct: 359 RTGKPLKTLEGHSNVLSVSFSPDGRVVSSGSTDGTVRLWDLSTGSLLRNLDGHTSRVTSL 418
Query: 525 AIDNRNILLAAGTVDKTIRIWNLQTLAPISVL 556
LA+G+ D TIR+W+L+T
Sbjct: 419 DFSPDGKSLASGSSDNTIRLWDLKTSLKSVSF 450
>gnl|CDD|99922 cd04369, Bromodomain, Bromodomain. Bromodomains are found in many
chromatin-associated proteins and in nuclear histone
acetyltransferases. They interact specifically with
acetylated lysine.
Length = 99
Score = 93.2 bits (232), Expect = 2e-22
Identities = 36/100 (36%), Positives = 57/100 (57%), Gaps = 3/100 (3%)
Query: 118 WKNECREVLETLW--VCPDSEPFRKPVDKLDHPDYDKIVIAPMDLGTIREELRGDNYESP 175
K + R +L+ L SEPF +PVD + PDY +++ PMDL TI+++L+ Y+S
Sbjct: 1 LKKKLRSLLDALKKLKRDLSEPFLEPVDPKEAPDYYEVIKNPMDLSTIKKKLKNGEYKSL 60
Query: 176 IDFVKDVKRVFHNSRNYNQNKRSKIYIMTTRLSEFAEEHL 215
+F DV+ +F N++ YN S IY +L + E+ L
Sbjct: 61 EEFEADVRLIFSNAKTYNG-PGSPIYKDAKKLEKLFEKLL 99
Score = 93.2 bits (232), Expect = 2e-22
Identities = 36/100 (36%), Positives = 57/100 (57%), Gaps = 3/100 (3%)
Query: 1533 WKNECREVLETLW--VCPDSEPFRKPVDKLDHPDYDKIVIAPMDLGTIREELRGDNYESP 1590
K + R +L+ L SEPF +PVD + PDY +++ PMDL TI+++L+ Y+S
Sbjct: 1 LKKKLRSLLDALKKLKRDLSEPFLEPVDPKEAPDYYEVIKNPMDLSTIKKKLKNGEYKSL 60
Query: 1591 IDFVKDVKRVFHNSRNYNQNKRSKIYIMTTRLSEFAEEHL 1630
+F DV+ +F N++ YN S IY +L + E+ L
Sbjct: 61 EEFEADVRLIFSNAKTYNG-PGSPIYKDAKKLEKLFEKLL 99
Score = 63.5 bits (155), Expect = 5e-12
Identities = 30/98 (30%), Positives = 58/98 (59%), Gaps = 5/98 (5%)
Query: 1375 CEKIITGLEEVMGLSVAEPFLVPVDISQYPAYAYVVEYPVDLTTLRARFQNRFYRRITSA 1434
++ L+++ ++EPFL PVD + P Y V++ P+DL+T++ + +N Y+ +
Sbjct: 5 LRSLLDALKKLKR-DLSEPFLEPVDPKEAPDYYEVIKNPMDLSTIKKKLKNGEYK--SLE 61
Query: 1435 QF--DVRYLAANAEKFNEPHSNIVKYARIITDLLLRII 1470
+F DVR + +NA+ +N P S I K A+ + L +++
Sbjct: 62 EFEADVRLIFSNAKTYNGPGSPIYKDAKKLEKLFEKLL 99
Score = 34.3 bits (79), Expect = 0.10
Identities = 17/57 (29%), Positives = 34/57 (59%), Gaps = 4/57 (7%)
Query: 1 LTTLRARFQNRFYRRITSAQF--DVRYLAANAEKFNEPHSNIVKYARIITDLLLRII 55
L+T++ + +N Y+ + +F DVR + +NA+ +N P S I K A+ + L +++
Sbjct: 45 LSTIKKKLKNGEYK--SLEEFEADVRLIFSNAKTYNGPGSPIYKDAKKLEKLFEKLL 99
>gnl|CDD|197636 smart00297, BROMO, bromo domain.
Length = 107
Score = 93.1 bits (232), Expect = 3e-22
Identities = 41/109 (37%), Positives = 60/109 (55%), Gaps = 3/109 (2%)
Query: 110 KKLKRPFNWKNECREVLETLWVCPDSEPFRKPVDKLDHPDYDKIVIAPMDLGTIREELRG 169
KL++ + + VL+ L P S PF KPV + + PDY I+ PMDL TI+++L
Sbjct: 2 PKLQKKL--QELLKAVLDKLDSHPLSWPFLKPVSRKEAPDYYDIIKKPMDLKTIKKKLEN 59
Query: 170 DNYESPIDFVKDVKRVFHNSRNYNQNKRSKIYIMTTRLSEFAEEHLRRL 218
Y S +FV D +F N+R YN S++Y +L +F E+ LR L
Sbjct: 60 GKYSSVEEFVADFNLMFSNARTYNG-PDSEVYKDAKKLEKFFEKKLREL 107
Score = 93.1 bits (232), Expect = 3e-22
Identities = 41/109 (37%), Positives = 60/109 (55%), Gaps = 3/109 (2%)
Query: 1525 KKLKRPFNWKNECREVLETLWVCPDSEPFRKPVDKLDHPDYDKIVIAPMDLGTIREELRG 1584
KL++ + + VL+ L P S PF KPV + + PDY I+ PMDL TI+++L
Sbjct: 2 PKLQKKL--QELLKAVLDKLDSHPLSWPFLKPVSRKEAPDYYDIIKKPMDLKTIKKKLEN 59
Query: 1585 DNYESPIDFVKDVKRVFHNSRNYNQNKRSKIYIMTTRLSEFAEEHLRRL 1633
Y S +FV D +F N+R YN S++Y +L +F E+ LR L
Sbjct: 60 GKYSSVEEFVADFNLMFSNARTYNG-PDSEVYKDAKKLEKFFEKKLREL 107
Score = 67.7 bits (166), Expect = 2e-13
Identities = 22/82 (26%), Positives = 42/82 (51%)
Query: 1391 AEPFLVPVDISQYPAYAYVVEYPVDLTTLRARFQNRFYRRITSAQFDVRYLAANAEKFNE 1450
+ PFL PV + P Y +++ P+DL T++ + +N Y + D + +NA +N
Sbjct: 25 SWPFLKPVSRKEAPDYYDIIKKPMDLKTIKKKLENGKYSSVEEFVADFNLMFSNARTYNG 84
Query: 1451 PHSNIVKYARIITDLLLRIIKE 1472
P S + K A+ + + ++E
Sbjct: 85 PDSEVYKDAKKLEKFFEKKLRE 106
Score = 40.3 bits (95), Expect = 9e-04
Identities = 13/57 (22%), Positives = 27/57 (47%)
Query: 1 LTTLRARFQNRFYRRITSAQFDVRYLAANAEKFNEPHSNIVKYARIITDLLLRIIKE 57
L T++ + +N Y + D + +NA +N P S + K A+ + + ++E
Sbjct: 50 LKTIKKKLENGKYSSVEEFVADFNLMFSNARTYNGPDSEVYKDAKKLEKFFEKKLRE 106
>gnl|CDD|99941 cd05509, Bromo_gcn5_like, Bromodomain; Gcn5_like subfamily. Gcn5p
is a histone acetyltransferase (HAT) which mediates
acetylation of histones at lysine residues; such
acetylation is generally correlated with the activation
of transcription. Bromodomains are 110 amino acid long
domains, that are found in many chromatin associated
proteins. Bromodomains can interact specifically with
acetylated lysine.
Length = 101
Score = 89.9 bits (224), Expect = 3e-21
Identities = 34/100 (34%), Positives = 56/100 (56%), Gaps = 1/100 (1%)
Query: 119 KNECREVLETLWVCPDSEPFRKPVDKLDHPDYDKIVIAPMDLGTIREELRGDNYESPIDF 178
+ ++VL++L + PF +PVDK + PDY ++ PMDL T+ E+L Y + +F
Sbjct: 3 YTQLKKVLDSLKNHKSAWPFLEPVDKEEAPDYYDVIKKPMDLSTMEEKLENGYYVTLEEF 62
Query: 179 VKDVKRVFHNSRNYNQNKRSKIYIMTTRLSEFAEEHLRRL 218
V D+K +F N R YN ++ Y +L +F + L+ L
Sbjct: 63 VADLKLIFDNCRLYNG-PDTEYYKCANKLEKFFWKKLKEL 101
Score = 89.9 bits (224), Expect = 3e-21
Identities = 34/100 (34%), Positives = 56/100 (56%), Gaps = 1/100 (1%)
Query: 1534 KNECREVLETLWVCPDSEPFRKPVDKLDHPDYDKIVIAPMDLGTIREELRGDNYESPIDF 1593
+ ++VL++L + PF +PVDK + PDY ++ PMDL T+ E+L Y + +F
Sbjct: 3 YTQLKKVLDSLKNHKSAWPFLEPVDKEEAPDYYDVIKKPMDLSTMEEKLENGYYVTLEEF 62
Query: 1594 VKDVKRVFHNSRNYNQNKRSKIYIMTTRLSEFAEEHLRRL 1633
V D+K +F N R YN ++ Y +L +F + L+ L
Sbjct: 63 VADLKLIFDNCRLYNG-PDTEYYKCANKLEKFFWKKLKEL 101
Score = 52.2 bits (126), Expect = 6e-08
Identities = 25/82 (30%), Positives = 44/82 (53%), Gaps = 4/82 (4%)
Query: 1393 PFLVPVDISQYPAYAYVVEYPVDLTTLRARFQNRFYRRITSAQF--DVRYLAANAEKFNE 1450
PFL PVD + P Y V++ P+DL+T+ + +N +Y +T +F D++ + N +N
Sbjct: 21 PFLEPVDKEEAPDYYDVIKKPMDLSTMEEKLENGYY--VTLEEFVADLKLIFDNCRLYNG 78
Query: 1451 PHSNIVKYARIITDLLLRIIKE 1472
P + K A + + +KE
Sbjct: 79 PDTEYYKCANKLEKFFWKKLKE 100
>gnl|CDD|99927 cd05495, Bromo_cbp_like, Bromodomain, cbp_like subfamily. Cbp (CREB
binding protein or CREBBP) is an acetyltransferase
acting on histone, which gives a specific tag for
transcriptional activation and also acetylates
non-histone proteins. CREBBP binds specifically to
phosphorylated CREB protein and augments the activity of
phosphorylated CREB to activate transcription of
cAMP-responsive genes. Bromodomains are 110 amino acid
long domains, that are found in many chromatin
associated proteins. Bromodomains can interact
specifically with acetylated lysine.
Length = 108
Score = 79.8 bits (197), Expect = 1e-17
Identities = 39/85 (45%), Positives = 52/85 (61%), Gaps = 4/85 (4%)
Query: 133 PDSEPFRKPVD--KLDHPDYDKIVIAPMDLGTIREELRGDNYESPIDFVKDVKRVFHNSR 190
P+S PFR+PVD L PDY IV PMDL TIR +L Y+ P +V DV +F N+
Sbjct: 20 PESLPFRQPVDPKLLGIPDYFDIVKNPMDLSTIRRKLDTGQYQDPWQYVDDVWLMFDNAW 79
Query: 191 NYNQNKRSKIYIMTTRLSE-FAEEH 214
YN+ K S++Y T+L+E F +E
Sbjct: 80 LYNR-KTSRVYKYCTKLAEVFEQEI 103
Score = 79.8 bits (197), Expect = 1e-17
Identities = 39/85 (45%), Positives = 52/85 (61%), Gaps = 4/85 (4%)
Query: 1548 PDSEPFRKPVD--KLDHPDYDKIVIAPMDLGTIREELRGDNYESPIDFVKDVKRVFHNSR 1605
P+S PFR+PVD L PDY IV PMDL TIR +L Y+ P +V DV +F N+
Sbjct: 20 PESLPFRQPVDPKLLGIPDYFDIVKNPMDLSTIRRKLDTGQYQDPWQYVDDVWLMFDNAW 79
Query: 1606 NYNQNKRSKIYIMTTRLSE-FAEEH 1629
YN+ K S++Y T+L+E F +E
Sbjct: 80 LYNR-KTSRVYKYCTKLAEVFEQEI 103
Score = 41.7 bits (98), Expect = 3e-04
Identities = 23/82 (28%), Positives = 39/82 (47%), Gaps = 2/82 (2%)
Query: 1391 AEPFLVPVD--ISQYPAYAYVVEYPVDLTTLRARFQNRFYRRITSAQFDVRYLAANAEKF 1448
+ PF PVD + P Y +V+ P+DL+T+R + Y+ DV + NA +
Sbjct: 22 SLPFRQPVDPKLLGIPDYFDIVKNPMDLSTIRRKLDTGQYQDPWQYVDDVWLMFDNAWLY 81
Query: 1449 NEPHSNIVKYARIITDLLLRII 1470
N S + KY + ++ + I
Sbjct: 82 NRKTSRVYKYCTKLAEVFEQEI 103
>gnl|CDD|99932 cd05500, Bromo_BDF1_2_I, Bromodomain. BDF1/BDF2 like subfamily,
restricted to fungi, repeat I. BDF1 and BDF2 are yeast
transcription factors involved in the expression of a
wide range of genes, including snRNAs; they are required
for sporulation and DNA repair and protect histone H4
from deacetylation. Bromodomains are 110 amino acid long
domains, that are found in many chromatin associated
proteins. Bromodomains can interact specifically with
acetylated lysine.
Length = 103
Score = 79.3 bits (196), Expect = 2e-17
Identities = 31/85 (36%), Positives = 42/85 (49%), Gaps = 3/85 (3%)
Query: 133 PDSEPFRKPVD--KLDHPDYDKIVIAPMDLGTIREELRGDNYESPIDFVKDVKRVFHNSR 190
D+ PF PVD KL+ P Y I+ PMDLGTI +L+ + Y S +F D + N
Sbjct: 20 KDARPFLVPVDPVKLNIPHYPTIIKKPMDLGTIERKLKSNVYTSVEEFTADFNLMVDNCL 79
Query: 191 NYNQNKRSKIYIMTTRLSEFAEEHL 215
+N + M RL E+HL
Sbjct: 80 TFN-GPEHPVSQMGKRLQAAFEKHL 103
Score = 79.3 bits (196), Expect = 2e-17
Identities = 31/85 (36%), Positives = 42/85 (49%), Gaps = 3/85 (3%)
Query: 1548 PDSEPFRKPVD--KLDHPDYDKIVIAPMDLGTIREELRGDNYESPIDFVKDVKRVFHNSR 1605
D+ PF PVD KL+ P Y I+ PMDLGTI +L+ + Y S +F D + N
Sbjct: 20 KDARPFLVPVDPVKLNIPHYPTIIKKPMDLGTIERKLKSNVYTSVEEFTADFNLMVDNCL 79
Query: 1606 NYNQNKRSKIYIMTTRLSEFAEEHL 1630
+N + M RL E+HL
Sbjct: 80 TFN-GPEHPVSQMGKRLQAAFEKHL 103
Score = 42.3 bits (100), Expect = 2e-04
Identities = 22/71 (30%), Positives = 31/71 (43%), Gaps = 8/71 (11%)
Query: 1388 LSVAEPFLVPVD-----ISQYPAYAYVVEYPVDLTTLRARFQNRFYRRITSAQFDVRYLA 1442
L A PFLVPVD I YP ++ P+DL T+ + ++ Y + D +
Sbjct: 19 LKDARPFLVPVDPVKLNIPHYPTI---IKKPMDLGTIERKLKSNVYTSVEEFTADFNLMV 75
Query: 1443 ANAEKFNEPHS 1453
N FN P
Sbjct: 76 DNCLTFNGPEH 86
>gnl|CDD|215921 pfam00439, Bromodomain, Bromodomain. Bromodomains are 110 amino
acid long domains, that are found in many chromatin
associated proteins. Bromodomains can interact
specifically with acetylated lysine.
Length = 84
Score = 75.5 bits (186), Expect = 2e-16
Identities = 36/82 (43%), Positives = 54/82 (65%), Gaps = 1/82 (1%)
Query: 122 CREVLETLWVCPDSEPFRKPVDKLDHPDYDKIVIAPMDLGTIREELRGDNYESPIDFVKD 181
C E+LE L P +EPF +PVD ++PDY +++ PMDL TIR++L+ Y+S +F+KD
Sbjct: 1 CLEILEDLMEHPLAEPFLEPVDPEEYPDYYEVIKEPMDLSTIRQKLKSGKYKSLAEFLKD 60
Query: 182 VKRVFHNSRNYNQNKRSKIYIM 203
V+ +F N+ YN + S IY
Sbjct: 61 VELIFSNAITYNG-EDSDIYKD 81
Score = 75.5 bits (186), Expect = 2e-16
Identities = 36/82 (43%), Positives = 54/82 (65%), Gaps = 1/82 (1%)
Query: 1537 CREVLETLWVCPDSEPFRKPVDKLDHPDYDKIVIAPMDLGTIREELRGDNYESPIDFVKD 1596
C E+LE L P +EPF +PVD ++PDY +++ PMDL TIR++L+ Y+S +F+KD
Sbjct: 1 CLEILEDLMEHPLAEPFLEPVDPEEYPDYYEVIKEPMDLSTIRQKLKSGKYKSLAEFLKD 60
Query: 1597 VKRVFHNSRNYNQNKRSKIYIM 1618
V+ +F N+ YN + S IY
Sbjct: 61 VELIFSNAITYNG-EDSDIYKD 81
Score = 68.6 bits (168), Expect = 5e-14
Identities = 32/87 (36%), Positives = 52/87 (59%), Gaps = 3/87 (3%)
Query: 1375 CEKIITGLEEVMGLSVAEPFLVPVDISQYPAYAYVVEYPVDLTTLRARFQNRFYRRITSA 1434
C +I LE++M +AEPFL PVD +YP Y V++ P+DL+T+R + ++ Y+ +
Sbjct: 1 CLEI---LEDLMEHPLAEPFLEPVDPEEYPDYYEVIKEPMDLSTIRQKLKSGKYKSLAEF 57
Query: 1435 QFDVRYLAANAEKFNEPHSNIVKYARI 1461
DV + +NA +N S+I K A+
Sbjct: 58 LKDVELIFSNAITYNGEDSDIYKDAKK 84
Score = 39.7 bits (93), Expect = 8e-04
Identities = 13/46 (28%), Positives = 25/46 (54%)
Query: 1 LTTLRARFQNRFYRRITSAQFDVRYLAANAEKFNEPHSNIVKYARI 46
L+T+R + ++ Y+ + DV + +NA +N S+I K A+
Sbjct: 39 LSTIRQKLKSGKYKSLAEFLKDVELIFSNAITYNGEDSDIYKDAKK 84
>gnl|CDD|99938 cd05506, Bromo_plant1, Bromodomain, uncharacterized subfamily
specific to plants. Might function as a global
transcription factor. Bromodomains are 110 amino acid
long domains, that are found in many chromatin
associated proteins. Bromodomains can interact
specifically with acetylated lysine.
Length = 99
Score = 73.1 bits (180), Expect = 2e-15
Identities = 32/95 (33%), Positives = 46/95 (48%), Gaps = 3/95 (3%)
Query: 121 ECREVLETLWVCPDSEPFRKPVD--KLDHPDYDKIVIAPMDLGTIREELRGDNYESPIDF 178
+C +L L F PVD L PDY I+ PMDLGT++++L Y SP +F
Sbjct: 4 QCGTLLRKLMKHKWGWVFNAPVDVVALGLPDYFDIIKKPMDLGTVKKKLEKGEYSSPEEF 63
Query: 179 VKDVKRVFHNSRNYNQNKRSKIYIMTTRLSEFAEE 213
DV+ F N+ YN + ++ M L + E
Sbjct: 64 AADVRLTFANAMRYNP-PGNDVHTMAKELLKIFET 97
Score = 73.1 bits (180), Expect = 2e-15
Identities = 32/95 (33%), Positives = 46/95 (48%), Gaps = 3/95 (3%)
Query: 1536 ECREVLETLWVCPDSEPFRKPVD--KLDHPDYDKIVIAPMDLGTIREELRGDNYESPIDF 1593
+C +L L F PVD L PDY I+ PMDLGT++++L Y SP +F
Sbjct: 4 QCGTLLRKLMKHKWGWVFNAPVDVVALGLPDYFDIIKKPMDLGTVKKKLEKGEYSSPEEF 63
Query: 1594 VKDVKRVFHNSRNYNQNKRSKIYIMTTRLSEFAEE 1628
DV+ F N+ YN + ++ M L + E
Sbjct: 64 AADVRLTFANAMRYNP-PGNDVHTMAKELLKIFET 97
Score = 44.2 bits (105), Expect = 3e-05
Identities = 19/61 (31%), Positives = 30/61 (49%), Gaps = 2/61 (3%)
Query: 1393 PFLVPVDISQY--PAYAYVVEYPVDLTTLRARFQNRFYRRITSAQFDVRYLAANAEKFNE 1450
F PVD+ P Y +++ P+DL T++ + + Y DVR ANA ++N
Sbjct: 20 VFNAPVDVVALGLPDYFDIIKKPMDLGTVKKKLEKGEYSSPEEFAADVRLTFANAMRYNP 79
Query: 1451 P 1451
P
Sbjct: 80 P 80
>gnl|CDD|99929 cd05497, Bromo_Brdt_I_like, Bromodomain, Brdt_like subfamily,
repeat I. Human Brdt is a testis-specific member of the
BET subfamily of bromodomain proteins; the first
bromodomain in Brdt has been shown to be essential for
male germ cell differentiation. Bromodomains are 110
amino acid long domains, that are found in many
chromatin associated proteins. Bromodomains can interact
specifically with acetylated lysine.
Length = 107
Score = 71.3 bits (175), Expect = 1e-14
Identities = 34/95 (35%), Positives = 53/95 (55%), Gaps = 3/95 (3%)
Query: 123 REVLETLWVCPDSEPFRKPVD--KLDHPDYDKIVIAPMDLGTIREELRGDNYESPIDFVK 180
+ VL+ LW + PF++PVD KL+ PDY KI+ PMDLGTI++ L + Y S + ++
Sbjct: 11 KVVLKALWKHKFAWPFQQPVDAVKLNLPDYHKIIKTPMDLGTIKKRLENNYYWSASECIQ 70
Query: 181 DVKRVFHNSRNYNQNKRSKIYIMTTRLSEFAEEHL 215
D +F N YN+ + +M L + + L
Sbjct: 71 DFNTMFTNCYIYNKPGD-DVVLMAQTLEKLFLQKL 104
Score = 71.3 bits (175), Expect = 1e-14
Identities = 34/95 (35%), Positives = 53/95 (55%), Gaps = 3/95 (3%)
Query: 1538 REVLETLWVCPDSEPFRKPVD--KLDHPDYDKIVIAPMDLGTIREELRGDNYESPIDFVK 1595
+ VL+ LW + PF++PVD KL+ PDY KI+ PMDLGTI++ L + Y S + ++
Sbjct: 11 KVVLKALWKHKFAWPFQQPVDAVKLNLPDYHKIIKTPMDLGTIKKRLENNYYWSASECIQ 70
Query: 1596 DVKRVFHNSRNYNQNKRSKIYIMTTRLSEFAEEHL 1630
D +F N YN+ + +M L + + L
Sbjct: 71 DFNTMFTNCYIYNKPGD-DVVLMAQTLEKLFLQKL 104
Score = 40.1 bits (94), Expect = 0.001
Identities = 22/82 (26%), Positives = 41/82 (50%), Gaps = 2/82 (2%)
Query: 1393 PFLVPVDIS--QYPAYAYVVEYPVDLTTLRARFQNRFYRRITSAQFDVRYLAANAEKFNE 1450
PF PVD P Y +++ P+DL T++ R +N +Y + D + N +N+
Sbjct: 25 PFQQPVDAVKLNLPDYHKIIKTPMDLGTIKKRLENNYYWSASECIQDFNTMFTNCYIYNK 84
Query: 1451 PHSNIVKYARIITDLLLRIIKE 1472
P ++V A+ + L L+ + +
Sbjct: 85 PGDDVVLMAQTLEKLFLQKLAQ 106
>gnl|CDD|99930 cd05498, Bromo_Brdt_II_like, Bromodomain, Brdt_like subfamily,
repeat II. Human Brdt is a testis-specific member of the
BET subfamily of bromodomain proteins; the first
bromodomain in Brdt has been shown to be essential for
male germ cell differentiation. Bromodomains are 110
amino acid long domains, that are found in many
chromatin associated proteins. Bromodomains can interact
specifically with acetylated lysine.
Length = 102
Score = 68.1 bits (167), Expect = 1e-13
Identities = 28/80 (35%), Positives = 40/80 (50%), Gaps = 3/80 (3%)
Query: 137 PFRKPVD--KLDHPDYDKIVIAPMDLGTIREELRGDNYESPIDFVKDVKRVFHNSRNYNQ 194
PF KPVD L DY I+ PMDL TI+++L Y +F DV+ +F N YN
Sbjct: 23 PFYKPVDPEALGLHDYHDIIKHPMDLSTIKKKLDNREYADAQEFAADVRLMFSNCYKYN- 81
Query: 195 NKRSKIYIMTTRLSEFAEEH 214
++ M +L + E+
Sbjct: 82 PPDHPVHAMARKLQDVFEDR 101
Score = 68.1 bits (167), Expect = 1e-13
Identities = 28/80 (35%), Positives = 40/80 (50%), Gaps = 3/80 (3%)
Query: 1552 PFRKPVD--KLDHPDYDKIVIAPMDLGTIREELRGDNYESPIDFVKDVKRVFHNSRNYNQ 1609
PF KPVD L DY I+ PMDL TI+++L Y +F DV+ +F N YN
Sbjct: 23 PFYKPVDPEALGLHDYHDIIKHPMDLSTIKKKLDNREYADAQEFAADVRLMFSNCYKYN- 81
Query: 1610 NKRSKIYIMTTRLSEFAEEH 1629
++ M +L + E+
Sbjct: 82 PPDHPVHAMARKLQDVFEDR 101
Score = 47.3 bits (113), Expect = 3e-06
Identities = 27/90 (30%), Positives = 44/90 (48%), Gaps = 6/90 (6%)
Query: 1375 CEKIITGLEEVMGLSVAEPFLVPVDISQ--YPAYAYVVEYPVDLTTLRARFQNRFYRRIT 1432
C I+ L + A PF PVD Y ++++P+DL+T++ + NR Y
Sbjct: 5 CSGILKELFSKKHKAYAWPFYKPVDPEALGLHDYHDIIKHPMDLSTIKKKLDNREYA--D 62
Query: 1433 SAQF--DVRYLAANAEKFNEPHSNIVKYAR 1460
+ +F DVR + +N K+N P + AR
Sbjct: 63 AQEFAADVRLMFSNCYKYNPPDHPVHAMAR 92
>gnl|CDD|99935 cd05503, Bromo_BAZ2A_B_like, Bromodomain, BAZ2A/BAZ2B_like
subfamily. Bromo adjacent to zinc finger 2A (BAZ2A) and
2B (BAZ2B) were identified as a novel human bromodomain
gene by cDNA library screening. BAZ2A is also known as
Tip5 (Transcription termination factor I-interacting
protein 5) and hWALp3. The proteins may play roles in
transcriptional regulation. Human Tip5 is part of a
complex termed NoRC (nucleolar remodeling complex),
which induces nucleosome sliding and may play a role in
the regulation of the rDNA locus. Bromodomains are 110
amino acid long domains, that are found in many
chromatin associated proteins. Bromodomains can interact
specifically with acetylated lysine.
Length = 97
Score = 67.8 bits (166), Expect = 2e-13
Identities = 27/77 (35%), Positives = 43/77 (55%)
Query: 119 KNECREVLETLWVCPDSEPFRKPVDKLDHPDYDKIVIAPMDLGTIREELRGDNYESPIDF 178
C +L+ + D+ PF +PV+ P Y KI+ PMD TIRE+L Y++ +F
Sbjct: 2 LALCETILDEMEAHEDAWPFLEPVNTKLVPGYRKIIKKPMDFSTIREKLESGQYKTLEEF 61
Query: 179 VKDVKRVFHNSRNYNQN 195
+DV+ VF N +N++
Sbjct: 62 AEDVRLVFDNCETFNED 78
Score = 67.8 bits (166), Expect = 2e-13
Identities = 27/77 (35%), Positives = 43/77 (55%)
Query: 1534 KNECREVLETLWVCPDSEPFRKPVDKLDHPDYDKIVIAPMDLGTIREELRGDNYESPIDF 1593
C +L+ + D+ PF +PV+ P Y KI+ PMD TIRE+L Y++ +F
Sbjct: 2 LALCETILDEMEAHEDAWPFLEPVNTKLVPGYRKIIKKPMDFSTIREKLESGQYKTLEEF 61
Query: 1594 VKDVKRVFHNSRNYNQN 1610
+DV+ VF N +N++
Sbjct: 62 AEDVRLVFDNCETFNED 78
Score = 53.1 bits (128), Expect = 2e-08
Identities = 22/65 (33%), Positives = 36/65 (55%)
Query: 1391 AEPFLVPVDISQYPAYAYVVEYPVDLTTLRARFQNRFYRRITSAQFDVRYLAANAEKFNE 1450
A PFL PV+ P Y +++ P+D +T+R + ++ Y+ + DVR + N E FNE
Sbjct: 18 AWPFLEPVNTKLVPGYRKIIKKPMDFSTIREKLESGQYKTLEEFAEDVRLVFDNCETFNE 77
Query: 1451 PHSNI 1455
S +
Sbjct: 78 DDSEV 82
Score = 28.9 bits (65), Expect = 8.0
Identities = 12/40 (30%), Positives = 21/40 (52%)
Query: 1 LTTLRARFQNRFYRRITSAQFDVRYLAANAEKFNEPHSNI 40
+T+R + ++ Y+ + DVR + N E FNE S +
Sbjct: 43 FSTIREKLESGQYKTLEEFAEDVRLVFDNCETFNEDDSEV 82
>gnl|CDD|99943 cd05511, Bromo_TFIID, Bromodomain, TFIID-like subfamily. Human
TAFII250 (or TAF250) is the largest subunit of TFIID, a
large multi-domain complex, which initiates the assembly
of the transcription machinery. TAFII250 contains two
bromodomains that specifically bind to acetylated
histone H4. Bromodomains are 110 amino acid long
domains, that are found in many chromatin associated
proteins. Bromodomains can interact specifically with
acetylated lysine.
Length = 112
Score = 68.1 bits (167), Expect = 2e-13
Identities = 28/61 (45%), Positives = 40/61 (65%)
Query: 133 PDSEPFRKPVDKLDHPDYDKIVIAPMDLGTIREELRGDNYESPIDFVKDVKRVFHNSRNY 192
PDS PF PV+K PDY KI+ PMDL TIR+++ Y+S +F++D++ + NS Y
Sbjct: 16 PDSWPFHTPVNKKKVPDYYKIIKRPMDLQTIRKKISKHKYQSREEFLEDIELIVDNSVLY 75
Query: 193 N 193
N
Sbjct: 76 N 76
Score = 68.1 bits (167), Expect = 2e-13
Identities = 28/61 (45%), Positives = 40/61 (65%)
Query: 1548 PDSEPFRKPVDKLDHPDYDKIVIAPMDLGTIREELRGDNYESPIDFVKDVKRVFHNSRNY 1607
PDS PF PV+K PDY KI+ PMDL TIR+++ Y+S +F++D++ + NS Y
Sbjct: 16 PDSWPFHTPVNKKKVPDYYKIIKRPMDLQTIRKKISKHKYQSREEFLEDIELIVDNSVLY 75
Query: 1608 N 1608
N
Sbjct: 76 N 76
Score = 46.5 bits (111), Expect = 8e-06
Identities = 23/97 (23%), Positives = 47/97 (48%), Gaps = 3/97 (3%)
Query: 1376 EKIITGLEEVMGLSVAEPFLVPVDISQYPAYAYVVEYPVDLTTLRARFQNRFYRRITSAQ 1435
++I+ L+ + + PF PV+ + P Y +++ P+DL T+R + Y+
Sbjct: 6 DEIVNELKNLPD---SWPFHTPVNKKKVPDYYKIIKRPMDLQTIRKKISKHKYQSREEFL 62
Query: 1436 FDVRYLAANAEKFNEPHSNIVKYARIITDLLLRIIKE 1472
D+ + N+ +N P S K A+ + +L ++ E
Sbjct: 63 EDIELIVDNSVLYNGPDSVYTKKAKEMLELAEELLAE 99
>gnl|CDD|99936 cd05504, Bromo_Acf1_like, Bromodomain; Acf1_like or BAZ1A_like
subfamily. Bromo adjacent to zinc finger 1A (BAZ1A) was
identified as a novel human bromodomain gene by cDNA
library screening. The Drosophila homologue, Acf1, is
part of the CHRAC (chromatin accessibility complex) and
regulates ISWI-induced nucleosome remodeling.
Bromodomains are 110 amino acid long domains, that are
found in many chromatin associated proteins.
Bromodomains can interact specifically with acetylated
lysine.
Length = 115
Score = 67.0 bits (164), Expect = 4e-13
Identities = 36/97 (37%), Positives = 55/97 (56%), Gaps = 1/97 (1%)
Query: 122 CREVLETLWVCPDSEPFRKPVDKLDHPDYDKIVIAPMDLGTIREELRGDNYESPIDFVKD 181
++L + DS PF +PV K++ PDY I+ PMDLGTI+E+L Y+ +F+ D
Sbjct: 17 LEQLLVEIVKHKDSWPFLRPVSKIEVPDYYDIIKKPMDLGTIKEKLNMGEYKLAEEFLSD 76
Query: 182 VKRVFHNSRNYNQNKRSKIYIMTTRLSEFAEEHLRRL 218
++ VF N YN + + +Y TRL F + R+L
Sbjct: 77 IQLVFSNCFLYNP-EHTSVYKAGTRLQRFFIKRCRKL 112
Score = 67.0 bits (164), Expect = 4e-13
Identities = 36/97 (37%), Positives = 55/97 (56%), Gaps = 1/97 (1%)
Query: 1537 CREVLETLWVCPDSEPFRKPVDKLDHPDYDKIVIAPMDLGTIREELRGDNYESPIDFVKD 1596
++L + DS PF +PV K++ PDY I+ PMDLGTI+E+L Y+ +F+ D
Sbjct: 17 LEQLLVEIVKHKDSWPFLRPVSKIEVPDYYDIIKKPMDLGTIKEKLNMGEYKLAEEFLSD 76
Query: 1597 VKRVFHNSRNYNQNKRSKIYIMTTRLSEFAEEHLRRL 1633
++ VF N YN + + +Y TRL F + R+L
Sbjct: 77 IQLVFSNCFLYNP-EHTSVYKAGTRLQRFFIKRCRKL 112
Score = 46.2 bits (110), Expect = 9e-06
Identities = 23/93 (24%), Positives = 45/93 (48%), Gaps = 7/93 (7%)
Query: 1382 LEEVMGLSVAEPFLVPVDISQYPAYAYVVEYPVDLTTLRARFQNRFYRRITSAQF--DVR 1439
L E++ + PFL PV + P Y +++ P+DL T++ + Y+ + +F D++
Sbjct: 21 LVEIVKHKDSWPFLRPVSKIEVPDYYDIIKKPMDLGTIKEKLNMGEYK--LAEEFLSDIQ 78
Query: 1440 YLAANAEKFNEPHSNIVKYARIITDLLLRIIKE 1472
+ +N +N H+++ K L IK
Sbjct: 79 LVFSNCFLYNPEHTSVYKAGT---RLQRFFIKR 108
>gnl|CDD|99937 cd05505, Bromo_WSTF_like, Bromodomain; Williams syndrome
transcription factor-like subfamily (WSTF-like). The
Williams-Beuren syndrome deletion transcript 9 is a
putative transcriptional regulator. WSTF was found to
play a role in vitamin D-mediated transcription as part
of two chromatin remodeling complexes, WINAC and WICH.
Bromodomains are 110 amino acid long domains, that are
found in many chromatin associated proteins.
Bromodomains can interact specifically with acetylated
lysine.
Length = 97
Score = 63.7 bits (155), Expect = 4e-12
Identities = 27/77 (35%), Positives = 43/77 (55%)
Query: 119 KNECREVLETLWVCPDSEPFRKPVDKLDHPDYDKIVIAPMDLGTIREELRGDNYESPIDF 178
+C E+L + S PFR+PV + DY K++ PMDL T++ + +Y S +F
Sbjct: 2 LQKCEEILSKILKYRFSWPFREPVTADEAEDYKKVITNPMDLQTMQTKCSCGSYSSVQEF 61
Query: 179 VKDVKRVFHNSRNYNQN 195
+ D+K VF N+ Y +N
Sbjct: 62 LDDMKLVFSNAEKYYEN 78
Score = 63.7 bits (155), Expect = 4e-12
Identities = 27/77 (35%), Positives = 43/77 (55%)
Query: 1534 KNECREVLETLWVCPDSEPFRKPVDKLDHPDYDKIVIAPMDLGTIREELRGDNYESPIDF 1593
+C E+L + S PFR+PV + DY K++ PMDL T++ + +Y S +F
Sbjct: 2 LQKCEEILSKILKYRFSWPFREPVTADEAEDYKKVITNPMDLQTMQTKCSCGSYSSVQEF 61
Query: 1594 VKDVKRVFHNSRNYNQN 1610
+ D+K VF N+ Y +N
Sbjct: 62 LDDMKLVFSNAEKYYEN 78
Score = 44.1 bits (104), Expect = 3e-05
Identities = 22/82 (26%), Positives = 41/82 (50%), Gaps = 3/82 (3%)
Query: 1375 CEKIITGLEEVMGLSVAEPFLVPVDISQYPAYAYVVEYPVDLTTLRARFQNRFYRRITSA 1434
CE+I L +++ + PF PV + Y V+ P+DL T++ + Y +
Sbjct: 5 CEEI---LSKILKYRFSWPFREPVTADEAEDYKKVITNPMDLQTMQTKCSCGSYSSVQEF 61
Query: 1435 QFDVRYLAANAEKFNEPHSNIV 1456
D++ + +NAEK+ E S ++
Sbjct: 62 LDDMKLVFSNAEKYYENGSYVL 83
>gnl|CDD|99942 cd05510, Bromo_SPT7_like, Bromodomain; SPT7_like subfamily. SPT7 is
a yeast protein that functions as a component of the
transcription regulatory histone acetylation (HAT)
complexes SAGA, SALSA, and SLIK. SAGA is involved in the
RNA polymerase II-dependent transcriptional regulation
of about 10% of all yeast genes. The SPT7 bromodomain
has been shown to weakly interact with acetylated
histone H3, but not H4. The human representative of this
subfamily is cat eye syndrome critical region protein 2
(CECR2). Bromodomains are 110 amino acid long domains,
that are found in many chromatin associated proteins.
Bromodomains can interact specifically with acetylated
lysine.
Length = 112
Score = 60.9 bits (148), Expect = 5e-11
Identities = 27/87 (31%), Positives = 44/87 (50%)
Query: 133 PDSEPFRKPVDKLDHPDYDKIVIAPMDLGTIREELRGDNYESPIDFVKDVKRVFHNSRNY 192
S PF V K + PDY I+ PMDLGT+ ++L+ Y+S +FV D+ ++ N Y
Sbjct: 24 EHSTPFLTKVSKREAPDYYDIIKKPMDLGTMLKKLKNLQYKSKAEFVDDLNLIWKNCLLY 83
Query: 193 NQNKRSKIYIMTTRLSEFAEEHLRRLT 219
N + + + + AE L+ +
Sbjct: 84 NSDPSHPLRRHANFMKKKAEHLLKLIP 110
Score = 60.9 bits (148), Expect = 5e-11
Identities = 27/87 (31%), Positives = 44/87 (50%)
Query: 1548 PDSEPFRKPVDKLDHPDYDKIVIAPMDLGTIREELRGDNYESPIDFVKDVKRVFHNSRNY 1607
S PF V K + PDY I+ PMDLGT+ ++L+ Y+S +FV D+ ++ N Y
Sbjct: 24 EHSTPFLTKVSKREAPDYYDIIKKPMDLGTMLKKLKNLQYKSKAEFVDDLNLIWKNCLLY 83
Query: 1608 NQNKRSKIYIMTTRLSEFAEEHLRRLT 1634
N + + + + AE L+ +
Sbjct: 84 NSDPSHPLRRHANFMKKKAEHLLKLIP 110
Score = 32.0 bits (73), Expect = 0.87
Identities = 19/80 (23%), Positives = 36/80 (45%), Gaps = 2/80 (2%)
Query: 1375 CEKIITGLEEVMGLSVAEPFLVPVDISQYPAYAYVVEYPVDLTTLRARFQNRFYRRITSA 1434
+K++ L+ S PFL V + P Y +++ P+DL T+ + +N Y+
Sbjct: 12 LDKVLNELKTYTEHS--TPFLTKVSKREAPDYYDIIKKPMDLGTMLKKLKNLQYKSKAEF 69
Query: 1435 QFDVRYLAANAEKFNEPHSN 1454
D+ + N +N S+
Sbjct: 70 VDDLNLIWKNCLLYNSDPSH 89
>gnl|CDD|99931 cd05499, Bromo_BDF1_2_II, Bromodomain. BDF1/BDF2 like subfamily,
restricted to fungi, repeat II. BDF1 and BDF2 are yeast
transcription factors involved in the expression of a
wide range of genes, including snRNAs; they are required
for sporulation and DNA repair and protect histone H4
from deacetylation. Bromodomains are 110 amino acid long
domains, that are found in many chromatin associated
proteins. Bromodomains can interact specifically with
acetylated lysine.
Length = 102
Score = 60.0 bits (146), Expect = 1e-10
Identities = 35/100 (35%), Positives = 55/100 (55%), Gaps = 8/100 (8%)
Query: 122 CREVLETLWVCPD----SEPFRKPVD--KLDHPDYDKIVIAPMDLGTIREELRGDNYESP 175
C EVL+ L + P + PF PVD L+ P+Y I+ PMDLGTI ++L+ Y+S
Sbjct: 5 CEEVLKEL-MKPKHSAYNWPFLDPVDPVALNIPNYFSIIKKPMDLGTISKKLQNGQYQSA 63
Query: 176 IDFVKDVKRVFHNSRNYNQNKRSKIYIMTTRLSEFAEEHL 215
+F +DV+ +F N +N + +Y+M +L E +
Sbjct: 64 KEFERDVRLIFKNCYTFNPEG-TDVYMMGHQLEEVFNDKW 102
Score = 60.0 bits (146), Expect = 1e-10
Identities = 35/100 (35%), Positives = 55/100 (55%), Gaps = 8/100 (8%)
Query: 1537 CREVLETLWVCPD----SEPFRKPVD--KLDHPDYDKIVIAPMDLGTIREELRGDNYESP 1590
C EVL+ L + P + PF PVD L+ P+Y I+ PMDLGTI ++L+ Y+S
Sbjct: 5 CEEVLKEL-MKPKHSAYNWPFLDPVDPVALNIPNYFSIIKKPMDLGTISKKLQNGQYQSA 63
Query: 1591 IDFVKDVKRVFHNSRNYNQNKRSKIYIMTTRLSEFAEEHL 1630
+F +DV+ +F N +N + +Y+M +L E +
Sbjct: 64 KEFERDVRLIFKNCYTFNPEG-TDVYMMGHQLEEVFNDKW 102
Score = 48.8 bits (117), Expect = 8e-07
Identities = 26/87 (29%), Positives = 40/87 (45%), Gaps = 3/87 (3%)
Query: 1374 TCEKIITGLEEVMGLSVAEPFLVPVD--ISQYPAYAYVVEYPVDLTTLRARFQNRFYRRI 1431
CE+++ L + + PFL PVD P Y +++ P+DL T+ + QN Y+
Sbjct: 4 FCEEVLKELMKPKHSAYNWPFLDPVDPVALNIPNYFSIIKKPMDLGTISKKLQNGQYQSA 63
Query: 1432 TSAQFDVRYLAANAEKFNEPHSNIVKY 1458
+ DVR + N FN P V
Sbjct: 64 KEFERDVRLIFKNCYTFN-PEGTDVYM 89
>gnl|CDD|99940 cd05508, Bromo_RACK7, Bromodomain, RACK7_like subfamily. RACK7
(also called human protein kinase C-binding protein) was
identified as a potential tumor suppressor genes, it
shares domain architecture with BS69/ZMYND11; both have
been implicated in the regulation of cellular
proliferation. Bromodomains are 110 amino acid long
domains, that are found in many chromatin associated
proteins. Bromodomains can interact specifically with
acetylated lysine.
Length = 99
Score = 59.7 bits (145), Expect = 1e-10
Identities = 27/66 (40%), Positives = 35/66 (53%)
Query: 132 CPDSEPFRKPVDKLDHPDYDKIVIAPMDLGTIREELRGDNYESPIDFVKDVKRVFHNSRN 191
P +EPF KPVD PDY + V PMDL T+ + +R Y S F+ D K + HN+
Sbjct: 17 QPGAEPFLKPVDLEQFPDYAQYVFKPMDLSTLEKNVRKKAYGSTDAFLADAKWILHNAII 76
Query: 192 YNQNKR 197
YN
Sbjct: 77 YNGGDH 82
Score = 59.7 bits (145), Expect = 1e-10
Identities = 27/66 (40%), Positives = 35/66 (53%)
Query: 1547 CPDSEPFRKPVDKLDHPDYDKIVIAPMDLGTIREELRGDNYESPIDFVKDVKRVFHNSRN 1606
P +EPF KPVD PDY + V PMDL T+ + +R Y S F+ D K + HN+
Sbjct: 17 QPGAEPFLKPVDLEQFPDYAQYVFKPMDLSTLEKNVRKKAYGSTDAFLADAKWILHNAII 76
Query: 1607 YNQNKR 1612
YN
Sbjct: 77 YNGGDH 82
Score = 44.3 bits (105), Expect = 3e-05
Identities = 23/59 (38%), Positives = 34/59 (57%)
Query: 1391 AEPFLVPVDISQYPAYAYVVEYPVDLTTLRARFQNRFYRRITSAQFDVRYLAANAEKFN 1449
AEPFL PVD+ Q+P YA V P+DL+TL + + Y + D +++ NA +N
Sbjct: 20 AEPFLKPVDLEQFPDYAQYVFKPMDLSTLEKNVRKKAYGSTDAFLADAKWILHNAIIYN 78
>gnl|CDD|227408 COG5076, COG5076, Transcription factor involved in chromatin
remodeling, contains bromodomain [Chromatin structure
and dynamics / Transcription].
Length = 371
Score = 64.4 bits (157), Expect = 2e-10
Identities = 33/158 (20%), Positives = 65/158 (41%), Gaps = 1/158 (0%)
Query: 75 ETSSSEEEEEEDSKSRQRRVRPTAVTRAQREAMNQKKLKRPFNWKNECREVLETLWVCPD 134
+ E S T+V + + + + L + ++ L
Sbjct: 106 FLAIESVTPESGLGSLLMAHLKTSVKKRKTPKIEDELLYADNKAIAKFKKQLFLRDGRFL 165
Query: 135 SEPFRKPVDKLDHPDYDKIVIAPMDLGTIREELRGDNYESPIDFVKDVKRVFHNSRNYNQ 194
S F K ++PDY +I+ +PMDL TI+++L+ Y+S +FV D+ +F N + YN
Sbjct: 166 SSIFLGLPSKREYPDYYEIIKSPMDLLTIQKKLKNGRYKSFEEFVSDLNLMFDNCKLYNG 225
Query: 195 NKRSKIYIMTTRLSEFAEEHLRRLTVRWNSHKNKENKK 232
S +Y+ L ++ + + + K ++
Sbjct: 226 -PDSSVYVDAKELEKYFLKLIEEIPEEMLELSIKPGRE 262
Score = 64.4 bits (157), Expect = 2e-10
Identities = 33/158 (20%), Positives = 65/158 (41%), Gaps = 1/158 (0%)
Query: 1490 ETSSSEEEEEEDSKSRQRRVRPTAVTRAQREAMNQKKLKRPFNWKNECREVLETLWVCPD 1549
+ E S T+V + + + + L + ++ L
Sbjct: 106 FLAIESVTPESGLGSLLMAHLKTSVKKRKTPKIEDELLYADNKAIAKFKKQLFLRDGRFL 165
Query: 1550 SEPFRKPVDKLDHPDYDKIVIAPMDLGTIREELRGDNYESPIDFVKDVKRVFHNSRNYNQ 1609
S F K ++PDY +I+ +PMDL TI+++L+ Y+S +FV D+ +F N + YN
Sbjct: 166 SSIFLGLPSKREYPDYYEIIKSPMDLLTIQKKLKNGRYKSFEEFVSDLNLMFDNCKLYNG 225
Query: 1610 NKRSKIYIMTTRLSEFAEEHLRRLTVRWNSHKNKENKK 1647
S +Y+ L ++ + + + K ++
Sbjct: 226 -PDSSVYVDAKELEKYFLKLIEEIPEEMLELSIKPGRE 262
Score = 60.2 bits (146), Expect = 5e-09
Identities = 55/221 (24%), Positives = 85/221 (38%), Gaps = 44/221 (19%)
Query: 1393 PFLVPVDISQYPAYAYVVEYPVDLTTLRARFQNRFYRRITSAQFDVRYLAANAEKFNEPH 1452
FL +YP Y +++ P+DL T++ + +N Y+ D+ + N + +N P
Sbjct: 168 IFLGLPSKREYPDYYEIIKSPMDLLTIQKKLKNGRYKSFEEFVSDLNLMFDNCKLYNGPD 227
Query: 1453 SNIVKYARIITDLLLRIIKEPNPVDVPVVYHELLQNYETSSSEEEEEEDSKSRQRRVRPT 1512
S++ A+ + L++I+E E+L+ EE EE +S V T
Sbjct: 228 SSVYVDAKELEKYFLKLIEEIPE--------EMLELSIKPGREEREERES------VLIT 273
Query: 1513 AVTRAQREAMNQKKLKRPFNWKNECREVLETLWVCPDSEPFRKPVDKLDHPDYDKIVIAP 1572
A A PF +PV + PDY K + P
Sbjct: 274 NSQ-AHVGAW-----------------------------PFLRPVSDEEVPDYYKDIRDP 303
Query: 1573 MDLGTIREELRGDNYESPIDFVKDVKRVFHNSRNYNQNKRS 1613
MDL T +LR + Y FV+D K F N YN
Sbjct: 304 MDLSTKELKLRNNYYRPEETFVRDAKLFFDNCVMYNGEVTD 344
Score = 44.0 bits (104), Expect = 5e-04
Identities = 48/198 (24%), Positives = 73/198 (36%), Gaps = 44/198 (22%)
Query: 1 LTTLRARFQNRFYRRITSAQFDVRYLAANAEKFNEPHSNIVKYARIITDLLLRIIKEPNP 60
L T++ + +N Y+ D+ + N + +N P S++ A+ + L++I+E
Sbjct: 191 LLTIQKKLKNGRYKSFEEFVSDLNLMFDNCKLYNGPDSSVYVDAKELEKYFLKLIEEIPE 250
Query: 61 VDVPVVYHELLQNYETSSSEEEEEEDSKSRQRRVRPTAVTRAQREAMNQKKLKRPFNWKN 120
E+L+ EE EE +S V T A A
Sbjct: 251 --------EMLELSIKPGREEREERES------VLITNSQ-AHVGAW------------- 282
Query: 121 ECREVLETLWVCPDSEPFRKPVDKLDHPDYDKIVIAPMDLGTIREELRGDNYESPIDFVK 180
PF +PV + PDY K + PMDL T +LR + Y FV+
Sbjct: 283 ----------------PFLRPVSDEEVPDYYKDIRDPMDLSTKELKLRNNYYRPEETFVR 326
Query: 181 DVKRVFHNSRNYNQNKRS 198
D K F N YN
Sbjct: 327 DAKLFFDNCVMYNGEVTD 344
>gnl|CDD|99934 cd05502, Bromo_tif1_like, Bromodomain; tif1_like subfamily. Tif1
(transcription intermediary factor 1) is a member of the
tripartite motif (TRIM) protein family, which is
characterized by a particular domain architecture. It
functions by recruiting coactivators and/or corepressors
to modulate transcription. Vertebrate Tif1-gamma, also
labeled E3 ubiquitin-protein ligase TRIM33, plays a role
in the control of hematopoiesis. Its homologue in
Xenopus laevis, Ectodermin, has been shown to function
in germ-layer specification and control of cell growth
during embryogenesis. Bromodomains are 110 amino acid
long domains, that are found in many chromatin
associated proteins. Bromodomains can interact
specifically with acetylated lysine.
Length = 109
Score = 58.5 bits (142), Expect = 5e-10
Identities = 32/88 (36%), Positives = 48/88 (54%), Gaps = 5/88 (5%)
Query: 134 DSEPFRKPVDKLDHPDYDKIVIAPMDLGTIREELRGDN---YESPIDFVKDVKRVFHNSR 190
S PF +PV P+Y KI+ PMDL IR++L+ + Y SP +FV DV+ +F N
Sbjct: 21 LSLPFHEPV-SPSVPNYYKIIKTPMDLSLIRKKLQPKSPQHYSSPEEFVADVRLMFKNCY 79
Query: 191 NYNQNKRSKIYIMTTRLSEFAEEHLRRL 218
+N+ S++ L F EE L+ +
Sbjct: 80 KFNEED-SEVAQAGKELELFFEEQLKEI 106
Score = 58.5 bits (142), Expect = 5e-10
Identities = 32/88 (36%), Positives = 48/88 (54%), Gaps = 5/88 (5%)
Query: 1549 DSEPFRKPVDKLDHPDYDKIVIAPMDLGTIREELRGDN---YESPIDFVKDVKRVFHNSR 1605
S PF +PV P+Y KI+ PMDL IR++L+ + Y SP +FV DV+ +F N
Sbjct: 21 LSLPFHEPV-SPSVPNYYKIIKTPMDLSLIRKKLQPKSPQHYSSPEEFVADVRLMFKNCY 79
Query: 1606 NYNQNKRSKIYIMTTRLSEFAEEHLRRL 1633
+N+ S++ L F EE L+ +
Sbjct: 80 KFNEED-SEVAQAGKELELFFEEQLKEI 106
Score = 37.7 bits (88), Expect = 0.008
Identities = 26/89 (29%), Positives = 44/89 (49%), Gaps = 7/89 (7%)
Query: 1375 CEKIITGLEEVMGLSVAEPFLVPVDISQYPAYAYVVEYPVDLTTLRARFQ-NRFYRRITS 1433
CE++ L E+ ++ PF PV S P Y +++ P+DL+ +R + Q +
Sbjct: 9 CERL---LLELYCHELSLPFHEPVSPSV-PNYYKIIKTPMDLSLIRKKLQPKSPQHYSSP 64
Query: 1434 AQF--DVRYLAANAEKFNEPHSNIVKYAR 1460
+F DVR + N KFNE S + + +
Sbjct: 65 EEFVADVRLMFKNCYKFNEEDSEVAQAGK 93
>gnl|CDD|99939 cd05507, Bromo_brd8_like, Bromodomain, brd8_like subgroup. In
mammals, brd8 (bromodomain containing 8) interacts with
the thyroid hormone receptor in a ligand-dependent
fashion and enhances thyroid hormone-dependent
activation from thyroid response elements. Brd8 is
thought to be a nuclear receptor coactivator.
Bromodomains are 110 amino acid long domains, that are
found in many chromatin associated proteins.
Bromodomains can interact specifically with acetylated
lysine.
Length = 104
Score = 55.4 bits (134), Expect = 4e-09
Identities = 26/79 (32%), Positives = 37/79 (46%)
Query: 117 NWKNECREVLETLWVCPDSEPFRKPVDKLDHPDYDKIVIAPMDLGTIREELRGDNYESPI 176
WK V TL + F KPV + P Y +V PMDL TI++ + S
Sbjct: 3 AWKKAILLVYRTLASHRYASVFLKPVTEDIAPGYHSVVYRPMDLSTIKKNIENGTIRSTA 62
Query: 177 DFVKDVKRVFHNSRNYNQN 195
+F +DV +F N+ YN +
Sbjct: 63 EFQRDVLLMFQNAIMYNSS 81
Score = 55.4 bits (134), Expect = 4e-09
Identities = 26/79 (32%), Positives = 37/79 (46%)
Query: 1532 NWKNECREVLETLWVCPDSEPFRKPVDKLDHPDYDKIVIAPMDLGTIREELRGDNYESPI 1591
WK V TL + F KPV + P Y +V PMDL TI++ + S
Sbjct: 3 AWKKAILLVYRTLASHRYASVFLKPVTEDIAPGYHSVVYRPMDLSTIKKNIENGTIRSTA 62
Query: 1592 DFVKDVKRVFHNSRNYNQN 1610
+F +DV +F N+ YN +
Sbjct: 63 EFQRDVLLMFQNAIMYNSS 81
Score = 46.2 bits (110), Expect = 8e-06
Identities = 23/80 (28%), Positives = 35/80 (43%)
Query: 1391 AEPFLVPVDISQYPAYAYVVEYPVDLTTLRARFQNRFYRRITSAQFDVRYLAANAEKFNE 1450
A FL PV P Y VV P+DL+T++ +N R Q DV + NA +N
Sbjct: 21 ASVFLKPVTEDIAPGYHSVVYRPMDLSTIKKNIENGTIRSTAEFQRDVLLMFQNAIMYNS 80
Query: 1451 PHSNIVKYARIITDLLLRII 1470
++ A + ++ I
Sbjct: 81 SDHDVYLMAVEMQREVMSQI 100
>gnl|CDD|99946 cd05515, Bromo_polybromo_V, Bromodomain, polybromo repeat V.
Polybromo is a nuclear protein of unknown function,
which contains 6 bromodomains. The human ortholog BAF180
is part of a SWI/SNF chromatin-remodeling complex, and
it may carry out the functions of Yeast Rsc-1 and Rsc-2.
It was shown that polybromo bromodomains bind to histone
H3 at specific acetyl-lysine positions. Bromodomains are
found in many chromatin-associated proteins and in
nuclear histone acetyltransferases. They interact
specifically with acetylated lysine, but not all the
bromodomains in polybromo may bind to acetyl-lysine.
Length = 105
Score = 54.6 bits (132), Expect = 9e-09
Identities = 25/86 (29%), Positives = 42/86 (48%), Gaps = 7/86 (8%)
Query: 122 CREVLETLWVCPD------SEPFRKPVDKLDHPDYDKIVIAPMDLGTIREELRGDNYESP 175
E+ + D S F + K ++PDY ++ P+D+ IR ++ G+ Y+S
Sbjct: 5 LWELYNAVKNYTDGRGRRLSLIFMRLPSKSEYPDYYDVIKKPIDMEKIRSKIEGNQYQSL 64
Query: 176 IDFVKDVKRVFHNSRNYNQNKRSKIY 201
D V D +F N+ YN+ S+IY
Sbjct: 65 DDMVSDFVLMFDNACKYNE-PDSQIY 89
Score = 54.6 bits (132), Expect = 9e-09
Identities = 25/86 (29%), Positives = 42/86 (48%), Gaps = 7/86 (8%)
Query: 1537 CREVLETLWVCPD------SEPFRKPVDKLDHPDYDKIVIAPMDLGTIREELRGDNYESP 1590
E+ + D S F + K ++PDY ++ P+D+ IR ++ G+ Y+S
Sbjct: 5 LWELYNAVKNYTDGRGRRLSLIFMRLPSKSEYPDYYDVIKKPIDMEKIRSKIEGNQYQSL 64
Query: 1591 IDFVKDVKRVFHNSRNYNQNKRSKIY 1616
D V D +F N+ YN+ S+IY
Sbjct: 65 DDMVSDFVLMFDNACKYNE-PDSQIY 89
Score = 49.6 bits (119), Expect = 4e-07
Identities = 22/68 (32%), Positives = 37/68 (54%)
Query: 1401 SQYPAYAYVVEYPVDLTTLRARFQNRFYRRITSAQFDVRYLAANAEKFNEPHSNIVKYAR 1460
S+YP Y V++ P+D+ +R++ + Y+ + D + NA K+NEP S I K A
Sbjct: 34 SEYPDYYDVIKKPIDMEKIRSKIEGNQYQSLDDMVSDFVLMFDNACKYNEPDSQIYKDAL 93
Query: 1461 IITDLLLR 1468
+ +LL
Sbjct: 94 TLQKVLLE 101
Score = 33.0 bits (76), Expect = 0.27
Identities = 15/53 (28%), Positives = 26/53 (49%)
Query: 1 LTTLRARFQNRFYRRITSAQFDVRYLAANAEKFNEPHSNIVKYARIITDLLLR 53
+ +R++ + Y+ + D + NA K+NEP S I K A + +LL
Sbjct: 49 MEKIRSKIEGNQYQSLDDMVSDFVLMFDNACKYNEPDSQIYKDALTLQKVLLE 101
>gnl|CDD|99948 cd05517, Bromo_polybromo_II, Bromodomain, polybromo repeat II.
Polybromo is a nuclear protein of unknown function, which
contains 6 bromodomains. The human ortholog BAF180 is
part of a SWI/SNF chromatin-remodeling complex, and it
may carry out the functions of Yeast Rsc-1 and Rsc-2. It
was shown that polybromo bromodomains bind to histone H3
at specific acetyl-lysine positions. Bromodomains are
found in many chromatin-associated proteins and in
nuclear histone acetyltransferases. They interact
specifically with acetylated lysine, but not all the
bromodomains in polybromo may bind to acetyl-lysine.
Length = 103
Score = 53.2 bits (128), Expect = 3e-08
Identities = 23/61 (37%), Positives = 35/61 (57%)
Query: 1402 QYPAYAYVVEYPVDLTTLRARFQNRFYRRITSAQFDVRYLAANAEKFNEPHSNIVKYARI 1461
YP Y V++ P+DL T+ R Q+ +Y+ I + D+ + NA+ FNEP S + K A
Sbjct: 35 LYPDYYAVIKEPIDLKTIAQRIQSGYYKSIEDMEKDLDLMVKNAKTFNEPGSQVYKDANA 94
Query: 1462 I 1462
I
Sbjct: 95 I 95
Score = 35.9 bits (83), Expect = 0.026
Identities = 20/60 (33%), Positives = 36/60 (60%)
Query: 135 SEPFRKPVDKLDHPDYDKIVIAPMDLGTIREELRGDNYESPIDFVKDVKRVFHNSRNYNQ 194
SE F+K K+ +PDY ++ P+DL TI + ++ Y+S D KD+ + N++ +N+
Sbjct: 24 SELFQKLPSKVLYPDYYAVIKEPIDLKTIAQRIQSGYYKSIEDMEKDLDLMVKNAKTFNE 83
Score = 35.9 bits (83), Expect = 0.026
Identities = 20/60 (33%), Positives = 36/60 (60%)
Query: 1550 SEPFRKPVDKLDHPDYDKIVIAPMDLGTIREELRGDNYESPIDFVKDVKRVFHNSRNYNQ 1609
SE F+K K+ +PDY ++ P+DL TI + ++ Y+S D KD+ + N++ +N+
Sbjct: 24 SELFQKLPSKVLYPDYYAVIKEPIDLKTIAQRIQSGYYKSIEDMEKDLDLMVKNAKTFNE 83
Score = 34.7 bits (80), Expect = 0.069
Identities = 17/47 (36%), Positives = 26/47 (55%)
Query: 1 LTTLRARFQNRFYRRITSAQFDVRYLAANAEKFNEPHSNIVKYARII 47
L T+ R Q+ +Y+ I + D+ + NA+ FNEP S + K A I
Sbjct: 49 LKTIAQRIQSGYYKSIEDMEKDLDLMVKNAKTFNEPGSQVYKDANAI 95
>gnl|CDD|99949 cd05518, Bromo_polybromo_IV, Bromodomain, polybromo repeat IV.
Polybromo is a nuclear protein of unknown function,
which contains 6 bromodomains. The human ortholog BAF180
is part of a SWI/SNF chromatin-remodeling complex, and
it may carry out the functions of Yeast Rsc-1 and Rsc-2.
It was shown that polybromo bromodomains bind to histone
H3 at specific acetyl-lysine positions. Bromodomains are
found in many chromatin-associated proteins and in
nuclear histone acetyltransferases. They interact
specifically with acetylated lysine, but not all the
bromodomains in polybromo may bind to acetyl-lysine.
Length = 103
Score = 51.3 bits (123), Expect = 1e-07
Identities = 26/62 (41%), Positives = 41/62 (66%), Gaps = 2/62 (3%)
Query: 140 KPVDKLDHPDYDKIVIAPMDLGTIREELRGDNYESPIDFVKDVKRVFHNSRNYNQNKRSK 199
KP K D+PDY KI++ P+DL TI +R D Y + + + D K +F N+R+YN+ + S+
Sbjct: 30 KPSKK-DYPDYYKIILEPIDLKTIEHNIRNDKYATEEELMDDFKLMFRNARHYNE-EGSQ 87
Query: 200 IY 201
+Y
Sbjct: 88 VY 89
Score = 51.3 bits (123), Expect = 1e-07
Identities = 26/62 (41%), Positives = 41/62 (66%), Gaps = 2/62 (3%)
Query: 1555 KPVDKLDHPDYDKIVIAPMDLGTIREELRGDNYESPIDFVKDVKRVFHNSRNYNQNKRSK 1614
KP K D+PDY KI++ P+DL TI +R D Y + + + D K +F N+R+YN+ + S+
Sbjct: 30 KPSKK-DYPDYYKIILEPIDLKTIEHNIRNDKYATEEELMDDFKLMFRNARHYNE-EGSQ 87
Query: 1615 IY 1616
+Y
Sbjct: 88 VY 89
Score = 35.1 bits (81), Expect = 0.052
Identities = 21/83 (25%), Positives = 36/83 (43%)
Query: 1384 EVMGLSVAEPFLVPVDISQYPAYAYVVEYPVDLTTLRARFQNRFYRRITSAQFDVRYLAA 1443
E G + + F+ YP Y ++ P+DL T+ +N Y D + +
Sbjct: 17 EGSGRRLCDLFMEKPSKKDYPDYYKIILEPIDLKTIEHNIRNDKYATEEELMDDFKLMFR 76
Query: 1444 NAEKFNEPHSNIVKYARIITDLL 1466
NA +NE S + + A I+ +L
Sbjct: 77 NARHYNEEGSQVYEDANILEKVL 99
>gnl|CDD|99957 cd05528, Bromo_AAA, Bromodomain; sub-family co-occurring with AAA
domains. Bromodomains are 110 amino acid long domains,
that are found in many chromatin associated proteins.
Bromodomains can interact specifically with acetylated
lysine. The structure(2DKW) in this alignment is an
uncharacterized protein predicted from analysis of cDNA
clones from human fetal liver.
Length = 112
Score = 48.1 bits (115), Expect = 2e-06
Identities = 24/57 (42%), Positives = 35/57 (61%)
Query: 137 PFRKPVDKLDHPDYDKIVIAPMDLGTIREELRGDNYESPIDFVKDVKRVFHNSRNYN 193
F KPVD+ + PDY +I+ PMDL TI ++L Y + DF+KD+ + N+ YN
Sbjct: 23 AFTKPVDEEEVPDYYEIIKQPMDLQTILQKLDTHQYLTAKDFLKDIDLIVTNALEYN 79
Score = 48.1 bits (115), Expect = 2e-06
Identities = 24/57 (42%), Positives = 35/57 (61%)
Query: 1552 PFRKPVDKLDHPDYDKIVIAPMDLGTIREELRGDNYESPIDFVKDVKRVFHNSRNYN 1608
F KPVD+ + PDY +I+ PMDL TI ++L Y + DF+KD+ + N+ YN
Sbjct: 23 AFTKPVDEEEVPDYYEIIKQPMDLQTILQKLDTHQYLTAKDFLKDIDLIVTNALEYN 79
Score = 38.1 bits (89), Expect = 0.006
Identities = 16/59 (27%), Positives = 31/59 (52%), Gaps = 4/59 (6%)
Query: 1393 PFLVPVDISQYPAYAYVVEYPVDLTTLRARFQNRFYRRITSAQF--DVRYLAANAEKFN 1449
F PVD + P Y +++ P+DL T+ + ++ +T+ F D+ + NA ++N
Sbjct: 23 AFTKPVDEEEVPDYYEIIKQPMDLQTILQKLDT--HQYLTAKDFLKDIDLIVTNALEYN 79
>gnl|CDD|197651 smart00320, WD40, WD40 repeats. Note that these repeats are
permuted with respect to the structural repeats (blades)
of the beta propeller domain.
Length = 40
Score = 45.8 bits (109), Expect = 2e-06
Identities = 17/41 (41%), Positives = 24/41 (58%), Gaps = 1/41 (2%)
Query: 753 NGDLVQVLKGHTNEVFVLESHPFDSRVLLSAGHDGLIIIWD 793
+G+L++ LKGHT V + P D + L S DG I +WD
Sbjct: 1 SGELLKTLKGHTGPVTSVAFSP-DGKYLASGSDDGTIKLWD 40
Score = 42.3 bits (100), Expect = 3e-05
Identities = 15/40 (37%), Positives = 26/40 (65%)
Query: 507 DGKLLATLRGCSGEISDIAIDNRNILLAAGTVDKTIRIWN 546
G+LL TL+G +G ++ +A LA+G+ D TI++W+
Sbjct: 1 SGELLKTLKGHTGPVTSVAFSPDGKYLASGSDDGTIKLWD 40
Score = 41.1 bits (97), Expect = 9e-05
Identities = 11/35 (31%), Positives = 18/35 (51%)
Query: 656 ILEVEEHSDKVDSIQWSHSHLRFVSGSRDGTALIW 690
+ ++ H+ V S+ +S SGS DGT +W
Sbjct: 5 LKTLKGHTGPVTSVAFSPDGKYLASGSDDGTIKLW 39
Score = 38.8 bits (91), Expect = 5e-04
Identities = 18/44 (40%), Positives = 22/44 (50%), Gaps = 4/44 (9%)
Query: 549 TLAPISVLVGHQGIITGVNFCPLEVNGFNYLASTSTDGCIGFWK 592
+ + L GH G +T V F P +G YLAS S DG I W
Sbjct: 1 SGELLKTLKGHTGPVTSVAFSP---DG-KYLASGSDDGTIKLWD 40
Score = 36.5 bits (85), Expect = 0.004
Identities = 13/30 (43%), Positives = 17/30 (56%)
Query: 474 GHLSAVFCVLFDKLGEVIVTGADDLLIKLW 503
GH V V F G+ + +G+DD IKLW
Sbjct: 10 GHTGPVTSVAFSPDGKYLASGSDDGTIKLW 39
Score = 36.1 bits (84), Expect = 0.004
Identities = 9/29 (31%), Positives = 17/29 (58%)
Query: 618 NAHILCSSFSPGGLFLATGSGDHHVRVYK 646
+ +FSP G +LA+GS D ++++
Sbjct: 12 TGPVTSVAFSPDGKYLASGSDDGTIKLWD 40
>gnl|CDD|240271 PTZ00108, PTZ00108, DNA topoisomerase 2-like protein; Provisional.
Length = 1388
Score = 52.4 bits (126), Expect = 3e-06
Identities = 46/358 (12%), Positives = 103/358 (28%), Gaps = 59/358 (16%)
Query: 1578 IREELRGDNYESPIDFVKDVKRVFHNSRNYNQNKRSKIYIMTTRLSEFAEEHLRRLTVRW 1637
I E E +D + + +Y+ Y+++ + +E + +L
Sbjct: 1059 ITAEEEEGAEEDDEADDEDDEEELGAAVSYD-------YLLSMPIWSLTKEKVEKLN--- 1108
Query: 1638 NSHKNKEN--------------KKDSKRGEKKIKRMKRRRRKIRRT---LVESDDEVLSR 1680
+ KE +D + E+ ++ + K L S+
Sbjct: 1109 AELEKKEKELEKLKNTTPKDMWLEDLDKFEEALEEQEEVEEKEIAKEQRLKSKTKGKASK 1168
Query: 1681 KRRGKSTYENDFDFNPAGETKTKAESGNEQVEEVKTRVRRRSSNRSRSNSQDNEENDEED 1740
R+ K + + K K+ + + V +R S+ R +
Sbjct: 1169 LRKPKLKKKE--------KKKKKSSADKSKKASVVGNSKRVDSDEKRKLDDKPDNKKSNS 1220
Query: 1741 VDIDNFDPEQPGPSTSYAEPVASTSRGRFAGRAAKRKFKEESASESQEDENSIPETDTTE 1800
D D E+ + S+ + + + +E +S+ E
Sbjct: 1221 SGSDQEDDEEQKTKPKKSSVKRLKSKKNNS--SKSSEDNDEFSSDDLSKEGKPKNAPKRV 1278
Query: 1801 QPAKKLKVLYPKRRGQQSEDGSGDNASEMSELKQDPLSLHDENEDEDNTNEMGKRKRTTR 1860
+ P +R +G +S + +
Sbjct: 1279 SAVQYSP-PPPSKRPDGESNGGSKPSSPTKKKVKKR-----------------LEGSLAA 1320
Query: 1861 LVKSNEPEDEEDDDDESDEDYSPSTHA----VTKRKRSVRVEKSNEVDEDEESDEDYK 1914
L K + E + +S ++ + + +R R + + S+E D+D E D+
Sbjct: 1321 LKKKKKSEKKTARKKKSKTRVKQASASQSSRLLRRPRKKKSDSSSEDDDDSEVDDSED 1378
Score = 50.4 bits (121), Expect = 9e-06
Identities = 44/277 (15%), Positives = 89/277 (32%), Gaps = 47/277 (16%)
Query: 1641 KNKENKKDSKRGEK-KIKRMKRRRRKIRRTLVESDDEVLSRKRRGKSTYENDFDF----N 1695
KE + SK K R + ++K ++ S D+ G S + + +
Sbjct: 1152 IAKEQRLKSKTKGKASKLRKPKLKKKEKKKKKSSADKSKKASVVGNSKRVDSDEKRKLDD 1211
Query: 1696 PAGETKTKAESGNEQVEEVKTRVRRRSSNRSRSNSQDNEENDEEDVDIDNFDPEQPGPST 1755
K+ + +++ +E + ++SS + + ++N ED D + D
Sbjct: 1212 KPDNKKSNSSGSDQEDDEEQKTKPKKSSVKRLKSKKNNSSKSSEDNDEFSSDDLSKEGKP 1271
Query: 1756 SYAEPVASTSRGRFAGRAAKRKFKEESASESQEDENSIPETDTTEQPAKKLKVLYPKRRG 1815
A S + + + + S P + T ++ K+L+
Sbjct: 1272 KNAPKRVSAVQ--------YSPPPPSKRPDGESNGGSKPSSPTKKKVKKRLE---GSLAA 1320
Query: 1816 QQSEDGSGDNASEMSELKQDPLSLHDENEDEDNTNEMGKRKRTTRLVKSNEPEDEEDDDD 1875
+ + S + + K + R R + + + +DDD
Sbjct: 1321 LKKKKKSEKKTARKKKSKTRVKQASAS-----------QSSRLLRRPRKKKSDSSSEDDD 1369
Query: 1876 ESDEDYSPSTHAVTKRKRSVRVEKSNEVDEDEESDED 1912
+S+ D S DED+E DED
Sbjct: 1370 DSEVDDS--------------------EDEDDEDDED 1386
Score = 33.1 bits (76), Expect = 2.0
Identities = 45/263 (17%), Positives = 88/263 (33%), Gaps = 27/263 (10%)
Query: 1494 SEEEEEEDSKSRQRRVRPTAVTRAQREAMNQKKLKRPFNWKNECREVLETLWVCPDSEPF 1553
+EE EE ++++R++ +A + + K K K+ + + V
Sbjct: 1143 EQEEVEEKEIAKEQRLKSKTKGKASKLRKPKLKKKEKKKKKSSADKSKKASVVGNSKRVD 1202
Query: 1554 RKPVDKLDHPDYDKIVIAPMDLGTIREELRGDNYESPIDFVKDVKRVFHNSRNYNQNKRS 1613
KLD +K + EE + +S + +K K S N S
Sbjct: 1203 SDEKRKLDDKPDNKKSNSSGSDQEDDEEQKTKPKKSSVKRLKSKKNNSSKSSEDNDEFSS 1262
Query: 1614 KIYIMTTRLSEFAEEHLRRLTVRWNSHK------NKENKKDSKRGEKKIKRMKRRRRKIR 1667
+ +R++ S + E+ SK K++K+R
Sbjct: 1263 D----DLSKEGKPKNAPKRVSAVQYSPPPPSKRPDGESNGGSKPSSPTKKKVKKRLEGSL 1318
Query: 1668 RTLVESDDEVLSRKRRGKSTYENDFDFNPAGETKTKAESGNEQVEEVKTRVRRRSSNRSR 1727
L + R+ KS T+ K S ++ +R+ RR +
Sbjct: 1319 AALKKKKKSEKKTARKKKSK------------TRVKQASASQS-----SRLLRRPRKKKS 1361
Query: 1728 SNSQDNEENDEEDVDIDNFDPEQ 1750
+S +++++ E D D D +
Sbjct: 1362 DSSSEDDDDSEVDDSEDEDDEDD 1384
>gnl|CDD|201208 pfam00400, WD40, WD domain, G-beta repeat.
Length = 39
Score = 45.0 bits (107), Expect = 4e-06
Identities = 16/40 (40%), Positives = 22/40 (55%), Gaps = 1/40 (2%)
Query: 754 GDLVQVLKGHTNEVFVLESHPFDSRVLLSAGHDGLIIIWD 793
G L++ LKGHT V + P D +L S DG + +WD
Sbjct: 1 GKLLRTLKGHTGPVTSVAFSP-DGNLLASGSDDGTVRVWD 39
Score = 42.3 bits (100), Expect = 3e-05
Identities = 17/39 (43%), Positives = 27/39 (69%)
Query: 508 GKLLATLRGCSGEISDIAIDNRNILLAAGTVDKTIRIWN 546
GKLL TL+G +G ++ +A LLA+G+ D T+R+W+
Sbjct: 1 GKLLRTLKGHTGPVTSVAFSPDGNLLASGSDDGTVRVWD 39
Score = 41.6 bits (98), Expect = 5e-05
Identities = 11/35 (31%), Positives = 18/35 (51%)
Query: 656 ILEVEEHSDKVDSIQWSHSHLRFVSGSRDGTALIW 690
+ ++ H+ V S+ +S SGS DGT +W
Sbjct: 4 LRTLKGHTGPVTSVAFSPDGNLLASGSDDGTVRVW 38
Score = 36.9 bits (86), Expect = 0.003
Identities = 17/41 (41%), Positives = 21/41 (51%), Gaps = 4/41 (9%)
Query: 552 PISVLVGHQGIITGVNFCPLEVNGFNYLASTSTDGCIGFWK 592
+ L GH G +T V F P +G N LAS S DG + W
Sbjct: 3 LLRTLKGHTGPVTSVAFSP---DG-NLLASGSDDGTVRVWD 39
Score = 36.9 bits (86), Expect = 0.003
Identities = 10/30 (33%), Positives = 17/30 (56%)
Query: 474 GHLSAVFCVLFDKLGEVIVTGADDLLIKLW 503
GH V V F G ++ +G+DD +++W
Sbjct: 9 GHTGPVTSVAFSPDGNLLASGSDDGTVRVW 38
Score = 35.0 bits (81), Expect = 0.012
Identities = 12/29 (41%), Positives = 16/29 (55%)
Query: 618 NAHILCSSFSPGGLFLATGSGDHHVRVYK 646
+ +FSP G LA+GS D VRV+
Sbjct: 11 TGPVTSVAFSPDGNLLASGSDDGTVRVWD 39
>gnl|CDD|99952 cd05521, Bromo_Rsc1_2_I, Bromodomain, repeat I in Rsc1/2_like
subfamily, specific to fungi. Rsc1 and Rsc2 are
components of the RSC complex (remodeling the structure
of chromatin), are essential for transcriptional
control, and have a specific domain architecture
including two bromodomains. The RSC complex has also
been linked to homologous recombination and
nonhomologous end-joining repair of DNA double strand
breaks. Bromodomains are 110 amino acid long domains,
that are found in many chromatin associated proteins.
Bromodomains can interact specifically with acetylated
lysine.
Length = 106
Score = 45.8 bits (109), Expect = 9e-06
Identities = 27/94 (28%), Positives = 44/94 (46%), Gaps = 13/94 (13%)
Query: 109 QKKLKRPFNWKNEC-REVLETLWVCPDSEPFRKPVDKLDHPDYDKIVIAPMDLGTIREEL 167
+ + K E E+ V P + D+PDY KI+ P+ L T+++ L
Sbjct: 7 KPLYDGIYTLKEENGIEIHPIFNVLPL---------RKDYPDYYKIIKNPLSLNTVKKRL 57
Query: 168 RGDNYESPIDFVKDVKRVFHNSRNYNQNKRSKIY 201
+Y + +FV D+ ++ N+R YN K S IY
Sbjct: 58 P--HYTNAQEFVNDLAQIPWNARLYN-TKGSVIY 88
Score = 45.8 bits (109), Expect = 9e-06
Identities = 27/94 (28%), Positives = 44/94 (46%), Gaps = 13/94 (13%)
Query: 1524 QKKLKRPFNWKNEC-REVLETLWVCPDSEPFRKPVDKLDHPDYDKIVIAPMDLGTIREEL 1582
+ + K E E+ V P + D+PDY KI+ P+ L T+++ L
Sbjct: 7 KPLYDGIYTLKEENGIEIHPIFNVLPL---------RKDYPDYYKIIKNPLSLNTVKKRL 57
Query: 1583 RGDNYESPIDFVKDVKRVFHNSRNYNQNKRSKIY 1616
+Y + +FV D+ ++ N+R YN K S IY
Sbjct: 58 P--HYTNAQEFVNDLAQIPWNARLYN-TKGSVIY 88
Score = 41.2 bits (97), Expect = 4e-04
Identities = 24/89 (26%), Positives = 40/89 (44%), Gaps = 2/89 (2%)
Query: 1382 LEEVMGLSVAEPFLVPVDISQYPAYAYVVEYPVDLTTLRARFQNRFYRRITSAQFDVRYL 1441
L+E G+ + F V YP Y +++ P+ L T++ R + Y D+ +
Sbjct: 16 LKEENGIEIHPIFNVLPLRKDYPDYYKIIKNPLSLNTVKKRLPH--YTNAQEFVNDLAQI 73
Query: 1442 AANAEKFNEPHSNIVKYARIITDLLLRII 1470
NA +N S I KYA I+ + +I
Sbjct: 74 PWNARLYNTKGSVIYKYALILEKYINDVI 102
>gnl|CDD|99944 cd05512, Bromo_brd1_like, Bromodomain; brd1_like subfamily. BRD1 is
a mammalian gene which encodes for a nuclear protein
assumed to be a transcriptional regulator. BRD1 has been
implicated with brain development and susceptibility to
schizophrenia and bipolar affective disorder.
Bromodomains are 110 amino acid long domains that are
found in many chromatin associated proteins.
Bromodomains can interact specifically with acetylated
lysine.
Length = 98
Score = 45.5 bits (108), Expect = 1e-05
Identities = 28/87 (32%), Positives = 42/87 (48%), Gaps = 1/87 (1%)
Query: 123 REVLETLWVCPDSEPFRKPVDKLDHPDYDKIVIAPMDLGTIREELRGDNYESPIDFVKDV 182
R+ L+ L +E F +PVD + PDY + PMD T+R++L Y + DF D
Sbjct: 7 RKTLDQLQEKDTAEIFSEPVDLSEVPDYLDHIKQPMDFSTMRKKLESQRYRTLEDFEADF 66
Query: 183 KRVFHNSRNYNQNKRSKIYIMTTRLSE 209
+ +N YN K + Y RL +
Sbjct: 67 NLIINNCLAYN-AKDTIFYRAAVRLRD 92
Score = 45.5 bits (108), Expect = 1e-05
Identities = 28/87 (32%), Positives = 42/87 (48%), Gaps = 1/87 (1%)
Query: 1538 REVLETLWVCPDSEPFRKPVDKLDHPDYDKIVIAPMDLGTIREELRGDNYESPIDFVKDV 1597
R+ L+ L +E F +PVD + PDY + PMD T+R++L Y + DF D
Sbjct: 7 RKTLDQLQEKDTAEIFSEPVDLSEVPDYLDHIKQPMDFSTMRKKLESQRYRTLEDFEADF 66
Query: 1598 KRVFHNSRNYNQNKRSKIYIMTTRLSE 1624
+ +N YN K + Y RL +
Sbjct: 67 NLIINNCLAYN-AKDTIFYRAAVRLRD 92
Score = 44.7 bits (106), Expect = 2e-05
Identities = 19/68 (27%), Positives = 37/68 (54%)
Query: 1382 LEEVMGLSVAEPFLVPVDISQYPAYAYVVEYPVDLTTLRARFQNRFYRRITSAQFDVRYL 1441
L+++ AE F PVD+S+ P Y ++ P+D +T+R + +++ YR + + D +
Sbjct: 10 LDQLQEKDTAEIFSEPVDLSEVPDYLDHIKQPMDFSTMRKKLESQRYRTLEDFEADFNLI 69
Query: 1442 AANAEKFN 1449
N +N
Sbjct: 70 INNCLAYN 77
>gnl|CDD|99947 cd05516, Bromo_SNF2L2, Bromodomain, SNF2L2-like subfamily, specific
to animals. SNF2L2 (SNF2-alpha) or SWI/SNF-related
matrix-associated actin-dependent regulator of chromatin
subfamily A member 2 is a global transcriptional
activator, which cooperates with nuclear hormone
receptors to boost transcriptional activation.
Bromodomains are 110 amino acid long domains, that are
found in many chromatin associated proteins.
Bromodomains can interact specifically with acetylated
lysine.
Length = 107
Score = 45.1 bits (107), Expect = 2e-05
Identities = 20/67 (29%), Positives = 36/67 (53%), Gaps = 1/67 (1%)
Query: 135 SEPFRKPVDKLDHPDYDKIVIAPMDLGTIREELRGDNYESPIDFVKDVKRVFHNSRNYNQ 194
+E F + + + P+Y +++ P+D I+E +R Y S D KDV + N++ +N
Sbjct: 25 AEVFIQLPSRKELPEYYELIRKPVDFKKIKERIRNHKYRSLEDLEKDVMLLCQNAQTFNL 84
Query: 195 NKRSKIY 201
+ S IY
Sbjct: 85 -EGSLIY 90
Score = 45.1 bits (107), Expect = 2e-05
Identities = 20/67 (29%), Positives = 36/67 (53%), Gaps = 1/67 (1%)
Query: 1550 SEPFRKPVDKLDHPDYDKIVIAPMDLGTIREELRGDNYESPIDFVKDVKRVFHNSRNYNQ 1609
+E F + + + P+Y +++ P+D I+E +R Y S D KDV + N++ +N
Sbjct: 25 AEVFIQLPSRKELPEYYELIRKPVDFKKIKERIRNHKYRSLEDLEKDVMLLCQNAQTFNL 84
Query: 1610 NKRSKIY 1616
+ S IY
Sbjct: 85 -EGSLIY 90
Score = 35.9 bits (83), Expect = 0.028
Identities = 22/69 (31%), Positives = 33/69 (47%)
Query: 1387 GLSVAEPFLVPVDISQYPAYAYVVEYPVDLTTLRARFQNRFYRRITSAQFDVRYLAANAE 1446
G +AE F+ + P Y ++ PVD ++ R +N YR + + DV L NA+
Sbjct: 21 GRQLAEVFIQLPSRKELPEYYELIRKPVDFKKIKERIRNHKYRSLEDLEKDVMLLCQNAQ 80
Query: 1447 KFNEPHSNI 1455
FN S I
Sbjct: 81 TFNLEGSLI 89
>gnl|CDD|99950 cd05519, Bromo_SNF2, Bromodomain, SNF2-like subfamily, specific to
fungi. SNF2 is a yeast protein involved in
transcriptional activation, it is the catalytic component
of the SWI/SNF ATP-dependent chromatin remodeling
complex. The protein is essential for the regulation of
gene expression (both positive and negative) of a large
number of genes. The SWI/SNF complex changes chromatin
structure by altering DNA-histone contacts within the
nucleosome, which results in a re-positioning of the
nucleosome and facilitates or represses the binding of
gene-specific transcription factors. Bromodomains are 110
amino acid long domains, that are found in many chromatin
associated proteins. Bromodomains can interact
specifically with acetylated lysine.
Length = 103
Score = 42.7 bits (101), Expect = 1e-04
Identities = 19/80 (23%), Positives = 35/80 (43%)
Query: 1374 TCEKIITGLEEVMGLSVAEPFLVPVDISQYPAYAYVVEYPVDLTTLRARFQNRFYRRITS 1433
+ E+ G ++E FL YP Y +++ P+ L ++ R + R Y+ +
Sbjct: 7 EIYDAVLNCEDETGRKLSELFLEKPSKKLYPDYYVIIKRPIALDQIKRRIEGRAYKSLEE 66
Query: 1434 AQFDVRYLAANAEKFNEPHS 1453
D + ANA +N+ S
Sbjct: 67 FLEDFHLMFANARTYNQEGS 86
Score = 42.7 bits (101), Expect = 1e-04
Identities = 22/60 (36%), Positives = 34/60 (56%)
Query: 135 SEPFRKPVDKLDHPDYDKIVIAPMDLGTIREELRGDNYESPIDFVKDVKRVFHNSRNYNQ 194
SE F + K +PDY I+ P+ L I+ + G Y+S +F++D +F N+R YNQ
Sbjct: 24 SELFLEKPSKKLYPDYYVIIKRPIALDQIKRRIEGRAYKSLEEFLEDFHLMFANARTYNQ 83
Score = 42.7 bits (101), Expect = 1e-04
Identities = 22/60 (36%), Positives = 34/60 (56%)
Query: 1550 SEPFRKPVDKLDHPDYDKIVIAPMDLGTIREELRGDNYESPIDFVKDVKRVFHNSRNYNQ 1609
SE F + K +PDY I+ P+ L I+ + G Y+S +F++D +F N+R YNQ
Sbjct: 24 SELFLEKPSKKLYPDYYVIIKRPIALDQIKRRIEGRAYKSLEEFLEDFHLMFANARTYNQ 83
>gnl|CDD|215641 PLN03237, PLN03237, DNA topoisomerase 2; Provisional.
Length = 1465
Score = 46.4 bits (110), Expect = 2e-04
Identities = 50/304 (16%), Positives = 99/304 (32%), Gaps = 84/304 (27%)
Query: 1643 KENKKDSKRGE----KKIKRMKRRRRKIRRTLVESDDEVLSRKRRGKSTYENDFDFNPAG 1698
++ ++ + RGE KK+ R ++ ++T ++ + + + G S E +
Sbjct: 1180 EKLQRAAARGESGAAKKVSRQAPKKPAPKKTTKKASESETTEETYGSSAMETENVA---- 1235
Query: 1699 ETKTKAESGNEQVEEVKTRVRRRSSNRSRSNSQDNEENDE-----EDVDIDNFDPEQPGP 1753
E VK + R + ++ + +++ EE DE + + N D P
Sbjct: 1236 -------------EVVKPKGRAGAKKKAPAAAKEKEEEDEILDLKDRLAAYNLD-SAPAQ 1281
Query: 1754 STSYAEPVASTSRGRFAGRAAKRKFKEESASESQEDENSIPETDTTEQPAKK-------- 1805
S E V + R A R + +D++ E E+ KK
Sbjct: 1282 SAKMEETVKAVPARRAAARKKPLASVSVISDSDDDDDDFAVEVSLAERLKKKGGRKPAAA 1341
Query: 1806 -----LKVLYPKRRGQQSEDGSGDNASEMSELKQD-----------------------PL 1837
K+RG + +EM + + L
Sbjct: 1342 NKKAAKPPAAAKKRGPATVQSGQKLLTEMLKPAEAIGISPEKKVRKMRASPFNKKSGSVL 1401
Query: 1838 SLHDENEDEDNTNEM---------------------GKRKRTTRLVKSNEPEDEEDDDDE 1876
N++ +++ + RK+TT ++ +E E +D D +
Sbjct: 1402 GRAATNKETESSENVSGSSSSEKDEIDVSAKPRPQRANRKQTTYVLSDSESESADDSDFD 1461
Query: 1877 SDED 1880
DED
Sbjct: 1462 DDED 1465
Score = 33.3 bits (76), Expect = 1.9
Identities = 25/136 (18%), Positives = 41/136 (30%), Gaps = 17/136 (12%)
Query: 1773 AAKRKFKEESASESQEDENSIPETDTTEQPAKKLKVLYPKRRGQQS--------EDGSGD 1824
A +K ++++ +E TE A+ +K PK R E D
Sbjct: 1205 PAPKKTTKKASESETTEETYGSSAMETENVAEVVK---PKGRAGAKKKAPAAAKEKEEED 1261
Query: 1825 NASEMSE------LKQDPLSLHDENEDEDNTNEMGKRKRTTRLVKSNEPEDEEDDDDESD 1878
++ + L P E R L + D +DDDD+
Sbjct: 1262 EILDLKDRLAAYNLDSAPAQSAKMEETVKAVPARRAAARKKPLASVSVISDSDDDDDDFA 1321
Query: 1879 EDYSPSTHAVTKRKRS 1894
+ S + K R
Sbjct: 1322 VEVSLAERLKKKGGRK 1337
>gnl|CDD|220135 pfam09184, PPP4R2, PPP4R2. PPP4R2 (protein phosphatase 4 core
regulatory subunit R2) is the regulatory subunit of the
histone H2A phosphatase complex. It has been shown to
confer resistance to the anticancer drug cisplatin in
yeast, and may confer resistance in higher eukaryotes.
Length = 285
Score = 44.4 bits (105), Expect = 3e-04
Identities = 29/161 (18%), Positives = 62/161 (38%), Gaps = 14/161 (8%)
Query: 1720 RRSSNRSRSNSQDNEENDEEDVDIDNFDPEQPGPSTSYAEPVASTSRGRFAGRAAKRKFK 1779
S+ NS D N I+ + + P + + E + S + K
Sbjct: 138 GERSDSENGNSLDGVMNGVSLSKIEWLNEDFPSELSPFIERIDSVNGPGEPEPEDDPKDS 197
Query: 1780 EESASESQEDENSIPETDTTEQPAKKLKVLYPKRRGQQSEDGSGDNASEMSELKQDPLSL 1839
+ S ++ D+++ K L+ + E S D A+ + +
Sbjct: 198 LGNGS------STNGLPDSSQDKNKSLE--------EYYEKESSDAAASQDDGPKGSDVK 243
Query: 1840 HDENEDEDNTNEMGKRKRTTRLVKSNEPEDEEDDDDESDED 1880
+ +++DE++ ++ G L + E E+ E++++E DED
Sbjct: 244 NKKSDDEEDDDQDGDYVEEKELKEDEEEEETEEEEEEEDED 284
>gnl|CDD|99953 cd05522, Bromo_Rsc1_2_II, Bromodomain, repeat II in Rsc1/2_like
subfamily, specific to fungi. Rsc1 and Rsc2 are
components of the RSC complex (remodeling the structure
of chromatin), are essential for transcriptional
control, and have a specific domain architecture
including two bromodomains. The RSC complex has also
been linked to homologous recombination and
nonhomologous end-joining repair of DNA double strand
breaks. Bromodomains are 110 amino acid long domains,
that are found in many chromatin associated proteins.
Bromodomains can interact specifically with acetylated
lysine.
Length = 104
Score = 41.1 bits (97), Expect = 5e-04
Identities = 18/59 (30%), Positives = 34/59 (57%)
Query: 137 PFRKPVDKLDHPDYDKIVIAPMDLGTIREELRGDNYESPIDFVKDVKRVFHNSRNYNQN 195
F K DK P+Y + + P+ L I+++++ Y+S F+ D+ +F N++ YN+N
Sbjct: 27 HFEKLPDKAREPEYYQEISNPISLDDIKKKVKRRKYKSFDQFLNDLNLMFENAKLYNEN 85
Score = 41.1 bits (97), Expect = 5e-04
Identities = 18/59 (30%), Positives = 34/59 (57%)
Query: 1552 PFRKPVDKLDHPDYDKIVIAPMDLGTIREELRGDNYESPIDFVKDVKRVFHNSRNYNQN 1610
F K DK P+Y + + P+ L I+++++ Y+S F+ D+ +F N++ YN+N
Sbjct: 27 HFEKLPDKAREPEYYQEISNPISLDDIKKKVKRRKYKSFDQFLNDLNLMFENAKLYNEN 85
Score = 31.4 bits (72), Expect = 0.98
Identities = 15/63 (23%), Positives = 29/63 (46%)
Query: 1399 DISQYPAYAYVVEYPVDLTTLRARFQNRFYRRITSAQFDVRYLAANAEKFNEPHSNIVKY 1458
D ++ P Y + P+ L ++ + + R Y+ D+ + NA+ +NE S K
Sbjct: 33 DKAREPEYYQEISNPISLDDIKKKVKRRKYKSFDQFLNDLNLMFENAKLYNENDSQEYKD 92
Query: 1459 ARI 1461
A +
Sbjct: 93 AVL 95
>gnl|CDD|99945 cd05513, Bromo_brd7_like, Bromodomain, brd7_like subgroup. The BRD7
gene encodes a nuclear protein that has been shown to
inhibit cell growth and the progression of the cell
cycle by regulating cell-cycle genes at the
transcriptional level. BRD7 has been identified as a
gene involved in nasopharyngeal carcinoma. The protein
interacts with acetylated histone H3 via its
bromodomain. Bromodomains are 110 amino acid long
domains that are found in many chromatin associated
proteins. Bromodomains can interact specifically with
acetylated lysine.
Length = 98
Score = 40.9 bits (96), Expect = 5e-04
Identities = 19/56 (33%), Positives = 31/56 (55%)
Query: 138 FRKPVDKLDHPDYDKIVIAPMDLGTIREELRGDNYESPIDFVKDVKRVFHNSRNYN 193
F PV P Y I+ PMD T++E+++ ++Y+S +F D K + N+ YN
Sbjct: 22 FAFPVTDFIAPGYSSIIKHPMDFSTMKEKIKNNDYQSIEEFKDDFKLMCENAMKYN 77
Score = 40.9 bits (96), Expect = 5e-04
Identities = 19/56 (33%), Positives = 31/56 (55%)
Query: 1553 FRKPVDKLDHPDYDKIVIAPMDLGTIREELRGDNYESPIDFVKDVKRVFHNSRNYN 1608
F PV P Y I+ PMD T++E+++ ++Y+S +F D K + N+ YN
Sbjct: 22 FAFPVTDFIAPGYSSIIKHPMDFSTMKEKIKNNDYQSIEEFKDDFKLMCENAMKYN 77
Score = 36.2 bits (84), Expect = 0.017
Identities = 19/67 (28%), Positives = 38/67 (56%)
Query: 1394 FLVPVDISQYPAYAYVVEYPVDLTTLRARFQNRFYRRITSAQFDVRYLAANAEKFNEPHS 1453
F PV P Y+ ++++P+D +T++ + +N Y+ I + D + + NA K+N+P +
Sbjct: 22 FAFPVTDFIAPGYSSIIKHPMDFSTMKEKIKNNDYQSIEEFKDDFKLMCENAMKYNKPDT 81
Query: 1454 NIVKYAR 1460
K A+
Sbjct: 82 IYYKAAK 88
>gnl|CDD|99951 cd05520, Bromo_polybromo_III, Bromodomain, polybromo repeat III.
Polybromo is a nuclear protein of unknown function,
which contains 6 bromodomains. The human ortholog BAF180
is part of a SWI/SNF chromatin-remodeling complex, and
it may carry out the functions of Yeast Rsc-1 and Rsc-2.
It was shown that polybromo bromodomains bind to histone
H3 at specific acetyl-lysine positions. Bromodomains are
found in many chromatin-associated proteins and in
nuclear histone acetyltransferases. They interact
specifically with acetylated lysine, but not all the
bromodomains in polybromo may bind to acetyl-lysine.
Length = 103
Score = 40.8 bits (96), Expect = 5e-04
Identities = 24/73 (32%), Positives = 39/73 (53%), Gaps = 1/73 (1%)
Query: 135 SEPFRKPVDKLDHPDYDKIVIAPMDLGTIREELRGDNYESPIDFVKDVKRVFHNSRNYNQ 194
+EPF K K +PDY + + P+ L IR +L+ YE+ + D+ +F N++ YN
Sbjct: 24 AEPFLKLPSKRKYPDYYQEIKNPISLQQIRTKLKNGEYETLEELEADLNLMFENAKRYNV 83
Query: 195 NKRSKIYIMTTRL 207
S+IY +L
Sbjct: 84 -PNSRIYKDAEKL 95
Score = 40.8 bits (96), Expect = 5e-04
Identities = 24/73 (32%), Positives = 39/73 (53%), Gaps = 1/73 (1%)
Query: 1550 SEPFRKPVDKLDHPDYDKIVIAPMDLGTIREELRGDNYESPIDFVKDVKRVFHNSRNYNQ 1609
+EPF K K +PDY + + P+ L IR +L+ YE+ + D+ +F N++ YN
Sbjct: 24 AEPFLKLPSKRKYPDYYQEIKNPISLQQIRTKLKNGEYETLEELEADLNLMFENAKRYNV 83
Query: 1610 NKRSKIYIMTTRL 1622
S+IY +L
Sbjct: 84 -PNSRIYKDAEKL 95
Score = 35.8 bits (83), Expect = 0.031
Identities = 23/73 (31%), Positives = 39/73 (53%)
Query: 1387 GLSVAEPFLVPVDISQYPAYAYVVEYPVDLTTLRARFQNRFYRRITSAQFDVRYLAANAE 1446
G +AEPFL +YP Y ++ P+ L +R + +N Y + + D+ + NA+
Sbjct: 20 GQLLAEPFLKLPSKRKYPDYYQEIKNPISLQQIRTKLKNGEYETLEELEADLNLMFENAK 79
Query: 1447 KFNEPHSNIVKYA 1459
++N P+S I K A
Sbjct: 80 RYNVPNSRIYKDA 92
>gnl|CDD|99933 cd05501, Bromo_SP100C_like, Bromodomain, SP100C_like subfamily. The
SP100C protein is a splice variant of SP100, a major
component of PML-SP100 nuclear bodies (NBs), which are
poorly understood. It is covalently modified by SUMO-1
and may play a role in processes at the chromatin level.
Bromodomains are 110 amino acid long domains, that are
found in many chromatin associated proteins.
Bromodomains can interact specifically with acetylated
lysine.
Length = 102
Score = 40.5 bits (95), Expect = 8e-04
Identities = 18/76 (23%), Positives = 34/76 (44%), Gaps = 2/76 (2%)
Query: 122 CREVLETLWVCPDSEPFRKPVDKLDHPDYDKIVIAPMDLGTIREELRGDNYESPIDFVKD 181
C +L ++ S F DY + + PM L ++E L Y + FV+D
Sbjct: 7 CEFLLLKVYCMSKSGFFIS--KPYYIRDYCQGIKEPMWLNKVKERLNERVYHTVEGFVRD 64
Query: 182 VKRVFHNSRNYNQNKR 197
++ +FHN + + ++
Sbjct: 65 MRLIFHNHKLFYKDDD 80
Score = 40.5 bits (95), Expect = 8e-04
Identities = 18/76 (23%), Positives = 34/76 (44%), Gaps = 2/76 (2%)
Query: 1537 CREVLETLWVCPDSEPFRKPVDKLDHPDYDKIVIAPMDLGTIREELRGDNYESPIDFVKD 1596
C +L ++ S F DY + + PM L ++E L Y + FV+D
Sbjct: 7 CEFLLLKVYCMSKSGFFIS--KPYYIRDYCQGIKEPMWLNKVKERLNERVYHTVEGFVRD 64
Query: 1597 VKRVFHNSRNYNQNKR 1612
++ +FHN + + ++
Sbjct: 65 MRLIFHNHKLFYKDDD 80
>gnl|CDD|99923 cd05491, Bromo_TBP7_like, Bromodomain; TBP7_like subfamily, limited
to fungi. TBP7, or TAT-binding protein homolog 7, is a
yeast protein of unknown function that contains
AAA-superfamily ATP-ase domains and a bromodomain.
Bromodomains are found in many chromatin-associated
proteins and in nuclear histone acetyltransferases. They
interact specifically with acetylated lysine.
Length = 119
Score = 39.4 bits (92), Expect = 0.002
Identities = 15/34 (44%), Positives = 22/34 (64%)
Query: 158 MDLGTIREELRGDNYESPIDFVKDVKRVFHNSRN 191
MDL TI E L Y +P DF+KD+KR+ +++
Sbjct: 65 MDLDTIEERLWNGYYATPKDFLKDIKRIVRDAKT 98
Score = 39.4 bits (92), Expect = 0.002
Identities = 15/34 (44%), Positives = 22/34 (64%)
Query: 1573 MDLGTIREELRGDNYESPIDFVKDVKRVFHNSRN 1606
MDL TI E L Y +P DF+KD+KR+ +++
Sbjct: 65 MDLDTIEERLWNGYYATPKDFLKDIKRIVRDAKT 98
>gnl|CDD|227615 COG5296, COG5296, Transcription factor involved in TATA site
selection and in elongation by RNA polymerase II
[Transcription].
Length = 521
Score = 41.6 bits (97), Expect = 0.004
Identities = 44/226 (19%), Positives = 77/226 (34%), Gaps = 40/226 (17%)
Query: 1704 AESGNEQVEEVKTRVRRRSSNRSRSNSQDNEENDEEDVDIDNFDPEQPGPSTSYAEPVAS 1763
A +G + + R R +S +E+N+E ++ E G +E
Sbjct: 10 ALAGIDDSDVASNRKRAHDDLDDVLSSSSDEDNNEN----VDYAEESGGEGNEKSEEEKF 65
Query: 1764 TSRGRFAGRAAKRKFKEESASE------SQEDENSIPETDTTEQPAKKLKVLYPKRRGQQ 1817
+ R G KFK+E+ E E+ + E + + + L + Q
Sbjct: 66 KNPYRLEG-----KFKDEADRAKIMAMTEIERESILFEREEEISKLMERRELAIRMEQQH 120
Query: 1818 SEDGSGD-NASEMSELKQDPLSLHDENEDEDNTNEMGKRK-RTTRLVKSN----EPEDEE 1871
G D S E PL+ E E + E+ K + R RL + +
Sbjct: 121 RSSGCTDTRRSTRYE----PLTSAAE-EKKKKLLELKKTREREERLYSERHIELQRFKDY 175
Query: 1872 DDDDES----DEDYSPSTHAVTKRKRSVRVEKSNEVDEDEESDEDY 1913
+ +ES E+Y+PS E ++ ++ E Y
Sbjct: 176 KELEESEQGLQEEYTPSY----------AEEAVEDISRTDDFAELY 211
>gnl|CDD|235285 PRK04335, PRK04335, cell division protein ZipA; Provisional.
Length = 313
Score = 39.4 bits (92), Expect = 0.013
Identities = 31/132 (23%), Positives = 47/132 (35%), Gaps = 12/132 (9%)
Query: 1787 QEDENSIPETDTTEQPAKKLKVLYPKRRGQQSEDGSGDNASEMSELKQDPLSLHDENEDE 1846
+DE PE + P +++ R ++ E G S L DPL +E
Sbjct: 47 SDDEQPAPERGFAQAPEDDFEII----RKERKEPDFGRENSFHDPLIDDPLF--GGELEE 100
Query: 1847 DNTNEMGKRKRTTRLVKSN------EPEDEEDDDDESDEDYSPSTHAVTKRKRSVRVEKS 1900
+ + +S EP+ EE D+E E+ P V + E
Sbjct: 101 EEDKFEQEEAPIPVQAQSQPQPEKVEPQVEEPRDEEVLEEPEPVAAKVPMAEVQPEEETE 160
Query: 1901 NEVDEDEESDED 1912
EVDE EE +
Sbjct: 161 IEVDEPEEPKPE 172
>gnl|CDD|233191 TIGR00927, 2A1904, K+-dependent Na+/Ca+ exchanger. [Transport and
binding proteins, Cations and iron carrying compounds].
Length = 1096
Score = 40.0 bits (93), Expect = 0.015
Identities = 38/220 (17%), Positives = 74/220 (33%), Gaps = 21/220 (9%)
Query: 1697 AGETKTKAESGNEQVEEVKTRVRRRSSNRSRSNSQDNEENDEEDVDIDNFDPEQPGPSTS 1756
GE + +A+ + + E V + + E E+ + + E
Sbjct: 694 EGEGEIEAKEADHKGETEAEEVEHEGETEAEGTEDEGEIETGEEGEEVEDEGEGEAEGKH 753
Query: 1757 YAEPVASTSRGRFAGRAAKRKFKEESASESQEDENSIPETDTTEQPAKKLKVLYPKRRGQ 1816
E G ++E E Q E+ + D + +
Sbjct: 754 EVETEGDRKETEHEGETEAEGKEDEDEGEIQAGEDGEMKGDEGAEGKVE--------HEG 805
Query: 1817 QSEDGSGDNASEMSELKQDPLSLHDENEDED-NTNEMGKRKRTTRLVKSNEPEDEEDDDD 1875
++E G D SE + D + DE +++ N G+ K+ + V D D ++
Sbjct: 806 ETEAGEKDEHEGQSETQADDTEVKDETGEQELNAENQGEAKQDEKGVDGGGGSDGGDSEE 865
Query: 1876 ESDEDYSPSTHAVTKRKRSVRVEKSNEVDEDEESDEDYKP 1915
E +E+ E+ E +E+EE +E+ +P
Sbjct: 866 EEEEEEEEEE------------EEEEEEEEEEEEEENEEP 893
>gnl|CDD|99955 cd05525, Bromo_ASH1, Bromodomain; ASH1_like sub-family. ASH1
(absent, small, or homeotic 1) is a member of the
trithorax-group in Drosophila melanogaster, an
epigenetic transcriptional regulator of HOX genes.
Drosophila ASH1 has been shown to methylate specific
lysines in histones H3 and H4. Mammalian ASH1 has been
shown to methylate histone H3. Bromodomains are 110
amino acid long domains, that are found in many
chromatin associated proteins. Bromodomains can interact
specifically with acetylated lysine.
Length = 106
Score = 36.6 bits (85), Expect = 0.018
Identities = 19/57 (33%), Positives = 31/57 (54%)
Query: 137 PFRKPVDKLDHPDYDKIVIAPMDLGTIREELRGDNYESPIDFVKDVKRVFHNSRNYN 193
PF K +PDY + + P+DL TI +++ Y++P F D+ +VF N+ Y
Sbjct: 28 PFINLPSKKKNPDYYERITDPVDLSTIEKQILTGYYKTPEAFDSDMLKVFRNAEKYY 84
Score = 36.6 bits (85), Expect = 0.018
Identities = 19/57 (33%), Positives = 31/57 (54%)
Query: 1552 PFRKPVDKLDHPDYDKIVIAPMDLGTIREELRGDNYESPIDFVKDVKRVFHNSRNYN 1608
PF K +PDY + + P+DL TI +++ Y++P F D+ +VF N+ Y
Sbjct: 28 PFINLPSKKKNPDYYERITDPVDLSTIEKQILTGYYKTPEAFDSDMLKVFRNAEKYY 84
>gnl|CDD|99954 cd05524, Bromo_polybromo_I, Bromodomain, polybromo repeat I.
Polybromo is a nuclear protein of unknown function,
which contains 6 bromodomains. The human ortholog BAF180
is part of a SWI/SNF chromatin-remodeling complex, and
it may carry out the functions of Yeast Rsc-1 and Rsc-2.
It was shown that polybromo bromodomains bind to histone
H3 at specific acetyl-lysine positions. Bromodomains are
found in many chromatin-associated proteins and in
nuclear histone acetyltransferases. They interact
specifically with acetylated lysine, but not all the
bromodomains in polybromo may bind to acetyl-lysine.
Length = 113
Score = 36.5 bits (85), Expect = 0.022
Identities = 16/71 (22%), Positives = 35/71 (49%), Gaps = 1/71 (1%)
Query: 148 PDYDKIVIAPMDLGTIREELRGDNYESPIDFVKDVKRVFHNSRNYNQNKRSKIYIMTTRL 207
P+Y ++V P+DL I+++L+ + Y+ D D + + +N++ Y + S + +L
Sbjct: 39 PEYYEVVSNPIDLLKIQQKLKTEEYDDVDDLTADFELLINNAKAYYK-PDSPEHKDACKL 97
Query: 208 SEFAEEHLRRL 218
E +
Sbjct: 98 WELFLSARNEV 108
Score = 36.5 bits (85), Expect = 0.022
Identities = 16/71 (22%), Positives = 35/71 (49%), Gaps = 1/71 (1%)
Query: 1563 PDYDKIVIAPMDLGTIREELRGDNYESPIDFVKDVKRVFHNSRNYNQNKRSKIYIMTTRL 1622
P+Y ++V P+DL I+++L+ + Y+ D D + + +N++ Y + S + +L
Sbjct: 39 PEYYEVVSNPIDLLKIQQKLKTEEYDDVDDLTADFELLINNAKAYYK-PDSPEHKDACKL 97
Query: 1623 SEFAEEHLRRL 1633
E +
Sbjct: 98 WELFLSARNEV 108
Score = 30.0 bits (68), Expect = 3.8
Identities = 14/50 (28%), Positives = 23/50 (46%)
Query: 1404 PAYAYVVEYPVDLTTLRARFQNRFYRRITSAQFDVRYLAANAEKFNEPHS 1453
P Y VV P+DL ++ + + Y + D L NA+ + +P S
Sbjct: 39 PEYYEVVSNPIDLLKIQQKLKTEEYDDVDDLTADFELLINNAKAYYKPDS 88
>gnl|CDD|218752 pfam05793, TFIIF_alpha, Transcription initiation factor IIF, alpha
subunit (TFIIF-alpha). Transcription initiation factor
IIF, alpha subunit (TFIIF-alpha) or RNA polymerase
II-associating protein 74 (RAP74) is the large subunit of
transcription factor IIF (TFIIF), which is essential for
accurate initiation and stimulates elongation by RNA
polymerase II.
Length = 528
Score = 39.2 bits (91), Expect = 0.025
Identities = 22/92 (23%), Positives = 41/92 (44%), Gaps = 10/92 (10%)
Query: 1829 MSELKQDPLSLHDENEDEDNTNEMGKRKRTTRLVKSNEPEDEEDDDDESDEDYSPSTHAV 1888
M K P + DE+E+ + G R + ++ + D+EDD DESD+
Sbjct: 183 MKAAKNGPAAFGDEDEETEGEKGGGGRGKDLKI--KDLEGDDEDDGDESDKGGEDGDEEK 240
Query: 1889 TKRK--------RSVRVEKSNEVDEDEESDED 1912
+K+K + + +K + D+++DE
Sbjct: 241 SKKKKKKLAKNKKKLDDDKKGKRGGDDDADEY 272
Score = 34.6 bits (79), Expect = 0.68
Identities = 40/200 (20%), Positives = 70/200 (35%), Gaps = 46/200 (23%)
Query: 1705 ESGNEQVEEVKTRVRRRSSNRSRSNSQDNEENDEEDVDIDNFDPEQPGPSTSYAEPVAST 1764
+ G + EE + +++ + + D + D D D +D +
Sbjct: 231 KGGEDGDEEKSKKKKKKLAKNKKKLDDDKKGKRGGDDDADEYD----------------S 274
Query: 1765 SRGRFAGRAAKRKFKEESASESQEDENSIPETDTTEQPAKKLKVLYPKRRGQQSEDGSGD 1824
G GR +E+ S+S N E + P K +Q ED
Sbjct: 275 DDGDDEGR------EEDYISDSSASGNDPEEREDKLSPEIP-----AKPEIEQDED---- 319
Query: 1825 NASEMSELKQDPLSLHDENEDEDNTNEMGKR-----KRTTRLVKSNEPEDEEDDDDESDE 1879
SE SE ++NE+E ++ GK+ + L K + ++ DD + D
Sbjct: 320 --SEESE--------EEKNEEEGGLSKKGKKLKKLKGKKNGLDKDDSDSGDDSDDSDIDG 369
Query: 1880 DYSPSTHAVTKRKRSVRVEK 1899
+ S S K+K + E
Sbjct: 370 EDSVSLVTAKKQKEPKKEEP 389
Score = 31.1 bits (70), Expect = 6.8
Identities = 27/146 (18%), Positives = 54/146 (36%), Gaps = 20/146 (13%)
Query: 1782 SASESQEDENSIPETDTTEQPAKKLKVLYPKRRGQ----------QSEDGSG-----DNA 1826
+ + ED + ++ AK K L ++G+ S+DG D
Sbjct: 228 ESDKGGEDGDEEKSKKKKKKLAKNKKKLDDDKKGKRGGDDDADEYDSDDGDDEGREEDYI 287
Query: 1827 SEMSELKQDPLSLHDENEDEDNTNEMGKRKRTTRLVKSNEPEDEEDDDDESDEDYSPSTH 1886
S+ S DP D+ E ++ + + +E E E++ S +
Sbjct: 288 SDSSASGNDPEEREDKLSPEIPAKPEIEQDEDS---EESEEEKNEEEGGLSKKGKKLKK- 343
Query: 1887 AVTKRKRSVRVEKSNEVDEDEESDED 1912
+ +K + + S+ D+ ++SD D
Sbjct: 344 -LKGKKNGLDKDDSDSGDDSDDSDID 368
>gnl|CDD|177776 PLN00181, PLN00181, protein SPA1-RELATED; Provisional.
Length = 793
Score = 38.9 bits (90), Expect = 0.034
Identities = 19/78 (24%), Positives = 38/78 (48%)
Query: 717 ETNKKIKVTMVAWDASDRWVITAINFNFQIKIWDAFNGDLVQVLKGHTNEVFVLESHPFD 776
E + K++ + W++ + + + NF +++WD LV +K H V+ ++ D
Sbjct: 528 ELASRSKLSGICWNSYIKSQVASSNFEGVVQVWDVARSQLVTEMKEHEKRVWSIDYSSAD 587
Query: 777 SRVLLSAGHDGLIIIWDI 794
+L S DG + +W I
Sbjct: 588 PTLLASGSDDGSVKLWSI 605
Score = 35.8 bits (82), Expect = 0.26
Identities = 53/227 (23%), Positives = 91/227 (40%), Gaps = 35/227 (15%)
Query: 477 SAVFCVL-FDKLGEVIVTGADDLLIKLWRA----RDGK-------LLATLRGCSGEISDI 524
S + C + FD+ GE T + IK++ +DG+ LA+ SG +
Sbjct: 483 SNLVCAIGFDRDGEFFATAGVNKKIKIFECESIIKDGRDIHYPVVELASRSKLSGICWNS 542
Query: 525 AIDNRNILLAAGTVDKTIRIWNLQTLAPISVLVGHQGIITGVNFCPLEVNGFNYLASTST 584
I ++ +A+ + +++W++ ++ + H+ + +++ + LAS S
Sbjct: 543 YIKSQ---VASSNFEGVVQVWDVARSQLVTEMKEHEKRVWSIDYSSADPT---LLASGSD 596
Query: 585 DGCIGFWKYKLDIDISKTVFHEPVMFNERIRPGNAHILCSSF-SPGGLFLATGSGDHHVR 643
DG + W + I A+I C F S G LA GS DH V
Sbjct: 597 DGSVKLWSINQGVSIGTI-------------KTKANICCVQFPSESGRSLAFGSADHKVY 643
Query: 644 VYKMDGVDSPLGILEVEEHSDKVDSIQWSHSHLRFVSGSRDGTALIW 690
Y + PL + HS V +++ S VS S D T +W
Sbjct: 644 YYDLRNPKLPLCTM--IGHSKTVSYVRFVDSS-TLVSSSTDNTLKLW 687
>gnl|CDD|218177 pfam04615, Utp14, Utp14 protein. This protein is found to be part of
a large ribonucleoprotein complex containing the U3
snoRNA. Depletion of the Utp proteins impedes production
of the 18S rRNA, indicating that they are part of the
active pre-rRNA processing complex. This large RNP
complex has been termed the small subunit (SSU)
processome.
Length = 728
Score = 38.5 bits (90), Expect = 0.036
Identities = 40/207 (19%), Positives = 86/207 (41%), Gaps = 21/207 (10%)
Query: 1719 RRR--SSNRSRSNSQDNEENDEEDVDIDNFDPEQPGPST-----------SYAEPVASTS 1765
RR+ + S + ++ +++EED D D D + P E +
Sbjct: 315 RRKIEGKSVSEEDEDEDSDSEEEDEDDDEDDDDGENPWMLRKKLGKLKEGEDDEENSGLL 374
Query: 1766 RGRFAGRAAKRKFKEESASESQEDENSIPETDTTEQPAKKLKVLYPKRR--GQQSEDGSG 1823
+F RA RK +E A + E ++ E+ ++ RR G ++ +
Sbjct: 375 SMKFMQRAEARKKEENDAEIEELRRELEGEEESDEEENEEPSKKNVGRRKFGPENGEKEA 434
Query: 1824 DNASEMSELKQDPLSLHDENEDEDNTNEMGKR------KRTTRLVKSNEPEDEEDDDDES 1877
++ E K + + +E+E+ +E + K R K+ + E+EE+ D+E+
Sbjct: 435 ESKKLKKENKNEFKEKKESDEEEELEDEEEAKVEKVANKLLKRSEKAQKEEEEEELDEEN 494
Query: 1878 DEDYSPSTHAVTKRKRSVRVEKSNEVD 1904
+ S+ + +K+ + + S+++D
Sbjct: 495 PWLKTTSSVGKSAKKQDSKKKSSSKLD 521
Score = 33.9 bits (78), Expect = 0.94
Identities = 31/152 (20%), Positives = 58/152 (38%), Gaps = 25/152 (16%)
Query: 1771 GRAAKRKFKEESASESQEDENSIPETDTTEQPAKKLKVLYPKRRGQQSEDGSGDNASEMS 1830
G++ + ++E + +EDE+ + D E P R + + G++ E S
Sbjct: 320 GKSVSEEDEDEDSDSEEEDEDDDEDDDDGENPWM--------LRKKLGKLKEGEDDEENS 371
Query: 1831 ELKQDPLSLHDENEDEDNTNEMGKRKRTTRLVKSNEPEDEEDDDDESDED---------- 1880
L E ++ + + R E E EE+ D+E +E+
Sbjct: 372 GLLSMKFMQRAEARKKEENDAEIEELR-------RELEGEEESDEEENEEPSKKNVGRRK 424
Query: 1881 YSPSTHAVTKRKRSVRVEKSNEVDEDEESDED 1912
+ P + ++ E NE E +ESDE+
Sbjct: 425 FGPENGEKEAESKKLKKENKNEFKEKKESDEE 456
>gnl|CDD|220648 pfam10243, MIP-T3, Microtubule-binding protein MIP-T3. This protein,
which interacts with both microtubules and TRAF3 (tumour
necrosis factor receptor-associated factor 3), is
conserved from worms to humans. The N-terminal region is
the microtubule binding domain and is well-conserved; the
C-terminal 100 residues, also well-conserved, constitute
the coiled-coil region which binds to TRAF3. The central
region of the protein is rich in lysine and glutamic acid
and carries KKE motifs which may also be necessary for
tubulin-binding, but this region is the least
well-conserved.
Length = 506
Score = 37.9 bits (88), Expect = 0.054
Identities = 30/260 (11%), Positives = 72/260 (27%), Gaps = 34/260 (13%)
Query: 1640 HKNKENKKDSKRGEKKIKRMKRRRRKIRRTLVESDDEVLSRKRRGKSTYENDFDFNPAGE 1699
K KE K+ + ++K K +++ + R E E + K R K +
Sbjct: 131 RKPKEEAKEKRPPKEKEKEKEKKVEEPRDREEEKKRERVRAKSRPKKPPKKK-----PPN 185
Query: 1700 TKTKAESGNEQVEEVKTRVRRRSSNRSRSNSQDNEENDEEDVDIDNFDPEQPGPSTSYAE 1759
K + +Q + + V+ + + ++ EE+D +D +
Sbjct: 186 KKKEPPEEEKQRQAAREAVKGKPEEPDVNEEREKEEDDGKDRE----------------T 229
Query: 1760 PVASTSRGRFAGRAAKRKFKEESASESQEDENSIPETDTTEQPAKKLKVLYPKRRGQQSE 1819
+ + + S + + + + R
Sbjct: 230 TTSPMEEDESRQSSEISRRSSSSLKKPDPSPSMASPETRESSKRTETRPRTSLRPP---- 285
Query: 1820 DGSGDNASEMSELKQDP-LSLHDENEDEDNTNEMGKRKRTTRLVKSNEPEDEEDDDDESD 1878
+A S P + + + +GK V + E++DD+
Sbjct: 286 -----SARPASARPAPPRVKRKEIVTVLQDAQGVGKIVSN---VILEGKKSEDEDDENFV 337
Query: 1879 EDYSPSTHAVTKRKRSVRVE 1898
+ + + +
Sbjct: 338 VEAAAQAPDIVAGGEDEAED 357
Score = 34.9 bits (80), Expect = 0.46
Identities = 36/232 (15%), Positives = 73/232 (31%), Gaps = 24/232 (10%)
Query: 1626 AEEHLRRLTVRWNS-HKNKENKKDSKRGEKKIKRMKRRRRKIRRTLVESDDEVLSRKRRG 1684
EE +R VR S K KK + ++ + K+R+ + ++ ++ +R
Sbjct: 160 REEEKKRERVRAKSRPKKPPKKKPPNKKKEPPEEEKQRQAAREAVKGKPEEPDVNEEREK 219
Query: 1685 KSTYENDFDFNPAGETKTKAESGNEQVEEVKTRVRRRSSNRSRSNSQDNEENDEEDVDID 1744
+ G+ + S E+ E ++ R S+ S + + +
Sbjct: 220 EED---------DGKDRETTTSPMEEDESRQSSEISRRSSSSLKKPDPS-PSMASPETRE 269
Query: 1745 NFDPEQPGPSTSYAEPVASTSRGRFAGRAAKRKFKEESASESQEDENSIPETDTTEQPAK 1804
+ + P TS P A + R A KRK ++Q + +
Sbjct: 270 SSKRTETRPRTSLRPPSARPASARPAPPRVKRKEIVTVLQDAQGVGKIVSNVILEGK--- 326
Query: 1805 KLKVLYPKRRGQQSEDGSGDNASEMSELKQDPLSLHDENEDEDNTNEMGKRK 1856
+ ED ++ DE ED + + ++
Sbjct: 327 ----------KSEDEDDENFVVEAAAQAPDIVAGGEDEAEDGEQQGGLVQKI 368
Score = 32.9 bits (75), Expect = 2.1
Identities = 29/154 (18%), Positives = 53/154 (34%), Gaps = 13/154 (8%)
Query: 1771 GRAAKRKFKEESASESQEDENSIPETDTTEQPAKKLKVLYPKR--RGQQSEDGSGDNASE 1828
++K K+E E +D E P +K K K + E+ +
Sbjct: 112 QVKEEKKKKKEKPKEEPKDRKPKEEAKEKRPPKEKEKEKEKKVEEPRDREEEKKRERVRA 171
Query: 1829 MSELKQDPLSLHDENEDEDNTNEMGKRKRTTRLVKSNEP------EDEEDDDDESDEDYS 1882
S K+ P N+ ++ E +R+ VK E E+++DD D + +
Sbjct: 172 KSRPKKPPKK-KPPNKKKEPPEEEKQRQAAREAVKGKPEEPDVNEEREKEEDDGKDRETT 230
Query: 1883 PSTHAVTKRKRSV----RVEKSNEVDEDEESDED 1912
S + ++S R S + + S
Sbjct: 231 TSPMEEDESRQSSEISRRSSSSLKKPDPSPSMAS 264
>gnl|CDD|235640 PRK05901, PRK05901, RNA polymerase sigma factor; Provisional.
Length = 509
Score = 37.3 bits (87), Expect = 0.077
Identities = 37/222 (16%), Positives = 78/222 (35%), Gaps = 32/222 (14%)
Query: 1700 TKTKAESGNEQVEEVKT-RVRRRSSNRSRSNSQDN----EENDEEDVDIDNFDPEQPGPS 1754
TKAE E+ + K ++ +S ++ ++ E + ID G
Sbjct: 4 ASTKAELAAEEEAKKKLKKLAAKSKSKGFITKEEIKEALESKKKTPEQIDQVLIFLSGMV 63
Query: 1755 TSYAEPVASTSRGRFAGRAAKRKFKEESASESQEDE-NSIPETDTTEQPAKKLKVLYPKR 1813
+ S + AAK + A + +DE +S + + K + Y K
Sbjct: 64 KDTDDATESDIPKKKTKTAAKAAAAKAPAKKKLKDELDSSKKAEKKNALDKDDDLNYVKD 123
Query: 1814 RGQQSEDGSGDNASEMSELKQDPLSLHDENEDEDNTNEMGKRKRTTRLVKSNEPEDEEDD 1873
++ D+ + +L D + D++ED+D ++ +D+ DD
Sbjct: 124 IDVLNQADDDDDDDDDDDLDDDDIDDDDDDEDDDE----------------DDDDDDVDD 167
Query: 1874 DDESDEDYSPSTHAVTKRKRSVRVEKSNEVDEDEESDEDYKP 1915
+DE ++ ++ ++ DE+ E +
Sbjct: 168 EDEEKKEAKELE----------KLSDDDDFVWDEDDSEALRQ 199
>gnl|CDD|226920 COG4547, CobT, Cobalamin biosynthesis protein CobT
(nicotinate-mononucleotide:5, 6-dimethylbenzimidazole
phosphoribosyltransferase) [Coenzyme metabolism].
Length = 620
Score = 37.5 bits (87), Expect = 0.085
Identities = 43/206 (20%), Positives = 68/206 (33%), Gaps = 33/206 (16%)
Query: 1717 RVRRRSSNRSRSNSQDNEENDEEDVDIDNFDPEQPGPSTSYAEPVASTSRGRFAGRAAK- 1775
RV + +++ EE D NF E VA R + G A
Sbjct: 112 RVEAIGAAAMPGVAKNLSAMLEEKYDKANFSKRTDKEDAPLEEAVALLVREKLTGDAPPD 171
Query: 1776 ---------RKFKEESASESQEDENSIPETDTTEQPAKKLKVLYPKRRGQQSEDGSGDNA 1826
R E+ A E D + + +Q R + GS D A
Sbjct: 172 SAGKVLDLWRDEIEDKAGE---DLDGLAAEIDDQQAFA---------RVVRDMLGSMDMA 219
Query: 1827 SEMSELKQDPLSLHDENEDEDNTNEMGKRKRTTRLVKSNEPEDEEDDDDESDEDYSPSTH 1886
E + + + ++ +D+ N S +E + DES+ED + +T
Sbjct: 220 EETGDDGIEEDADEEDGDDDQPDNNE----------DSEAGREESEGSDESEEDEAEATD 269
Query: 1887 AVTKRKRSVRVEKSNEVDEDEESDED 1912
+ E S + E +ESDED
Sbjct: 270 GEGEEGEMDAAEASED-SESDESDED 294
>gnl|CDD|218333 pfam04931, DNA_pol_phi, DNA polymerase phi. This family includes the
fifth essential DNA polymerase in yeast EC:2.7.7.7. Pol5p
is localised exclusively to the nucleolus and binds near
or at the enhancer region of rRNA-encoding DNA repeating
units.
Length = 784
Score = 37.5 bits (87), Expect = 0.086
Identities = 20/94 (21%), Positives = 37/94 (39%), Gaps = 13/94 (13%)
Query: 1819 EDGSGDNASEMSELKQDPLSLHDENEDEDNTNEMGKRKRTTRLVKSNEPEDEEDDDDESD 1878
E+ D+ E + + + ++ D + ++ E G+ EDE++DD E++
Sbjct: 648 EEEDEDDLEETDDDEDECEAIEDSESESESDGEDGE-------------EDEQEDDAEAN 694
Query: 1879 EDYSPSTHAVTKRKRSVRVEKSNEVDEDEESDED 1912
E P AV + V D E +E
Sbjct: 695 EGVVPIDKAVRRALPKVLNLPDALDGGDSEDEEG 728
Score = 31.4 bits (71), Expect = 6.6
Identities = 29/156 (18%), Positives = 52/156 (33%), Gaps = 30/156 (19%)
Query: 1772 RAAKRKFKEESASESQEDENSIPETDTTEQPAKKLKVLYPKRRGQQSEDGSGDNASEMSE 1831
K + ++ E +EDE+ + ETD E + ++ SE S + E E
Sbjct: 635 DENKSRHQQLFEGE-EEDEDDLEETDDDEDECEAIE---------DSESESESD-GEDGE 683
Query: 1832 LKQDPLSLHDENEDEDNTNEMGKRKRTTR-----LVKSNEPEDEEDDDDES--DE----- 1879
E++ E N + K R L + + + +D+E DE
Sbjct: 684 ------EDEQEDDAEANEGVVPIDKAVRRALPKVLNLPDALDGGDSEDEEGMDDEQMMRL 737
Query: 1880 -DYSPSTHAVTKRKRSVRVEKSNEVDEDEESDEDYK 1914
Y K + E E +++ +K
Sbjct: 738 DTYLAQIFKEKKERIQAGGETKKEAQSQKQNVISFK 773
>gnl|CDD|173611 PTZ00421, PTZ00421, coronin; Provisional.
Length = 493
Score = 37.2 bits (86), Expect = 0.092
Identities = 15/35 (42%), Positives = 23/35 (65%)
Query: 760 LKGHTNEVFVLESHPFDSRVLLSAGHDGLIIIWDI 794
L+GHT +V ++ HP VL SAG D ++ +WD+
Sbjct: 121 LQGHTKKVGIVSFHPSAMNVLASAGADMVVNVWDV 155
>gnl|CDD|227458 COG5129, MAK16, Nuclear protein with HMG-like acidic region [General
function prediction only].
Length = 303
Score = 36.2 bits (83), Expect = 0.15
Identities = 35/122 (28%), Positives = 55/122 (45%), Gaps = 9/122 (7%)
Query: 1768 RFAGRAAKRKFKEESASESQEDENSIPE--TDTTEQPAKKLKVLYPKRRGQQSEDGSGDN 1825
+ A KR+ ++E E +E+ ++ E TD +E+ K K L QS + S
Sbjct: 188 KAATEREKRQDEKERYVEEEEESDTELEAVTDDSEKEKTKKKDLEKWLGSDQSMETSESE 247
Query: 1826 ASEMSELKQDPLSLHDENEDEDNTNEMGKRKRTTRLVKSNEPEDEEDDDDESDEDYSPST 1885
E SE S DE+EDEDN ++ KRK KS +P + + E + + P+
Sbjct: 248 EEESSE------SESDEDEDEDNKGKIRKRKTDDA-KKSRKPHIHIEYEQERENEKIPAV 300
Query: 1886 HA 1887
Sbjct: 301 QH 302
>gnl|CDD|221175 pfam11705, RNA_pol_3_Rpc31, DNA-directed RNA polymerase III subunit
Rpc31. RNA polymerase III contains seventeen subunits in
yeasts and in human cells. Twelve of these are akin to
RNA polymerase I or II and the other five are RNA pol
III-specific, and form the functionally distinct groups
(i) Rpc31-Rpc34-Rpc82, and (ii) Rpc37-Rpc53. Rpc31, Rpc34
and Rpc82 form a cluster of enzyme-specific subunits that
contribute to transcription initiation in S.cerevisiae
and H.sapiens. There is evidence that these subunits are
anchored at or near the N-terminal Zn-fold of Rpc1,
itself prolonged by a highly conserved but RNA polymerase
III-specific domain.
Length = 221
Score = 35.5 bits (82), Expect = 0.16
Identities = 21/89 (23%), Positives = 39/89 (43%), Gaps = 10/89 (11%)
Query: 1793 IPETDTTEQPAKKLKVLYPKRRGQQSEDGSGDNASEMSELKQDPLSLHDENEDEDNTNEM 1852
I + + K K R+ + D ++S L++ L E+ DE++ +
Sbjct: 122 INKKAGKKLALSKFK-----RKVGLFTEEEEDIDEKLSMLEKKLKELEAEDVDEEDEKDE 176
Query: 1853 GKRKRTTRLVKSNEPEDEEDDDDESDEDY 1881
+ + + E ED +DDDD+ D+DY
Sbjct: 177 EEEEE-----EEEEDEDFDDDDDDDDDDY 200
>gnl|CDD|217927 pfam04147, Nop14, Nop14-like family. Emg1 and Nop14 are novel
proteins whose interaction is required for the maturation
of the 18S rRNA and for 40S ribosome production.
Length = 809
Score = 36.5 bits (85), Expect = 0.17
Identities = 34/162 (20%), Positives = 62/162 (38%), Gaps = 23/162 (14%)
Query: 1772 RAAKRKFKEESASESQEDE--NSIPE---------TDTT-------EQPAKKLKVLYPKR 1813
R K K E ++DE + E TD T ++ A++LK L +R
Sbjct: 219 RTVKPPPKPPMTPEEKDDEYDQRVRELTFDRRAQPTDRTKTEEELAKEEAERLKKLEAER 278
Query: 1814 RGQQSEDGSGDNASEMSELKQDPLSLHDENEDEDNTNEMGKRKRTTRLVKSNEPEDEEDD 1873
+ + D E S+ D L DE E +D+ N + E + +D+
Sbjct: 279 LRRMRGEEEDDEEEEDSKESAD--DLDDEFEPDDDDNFGLGQGEEDE---EEEEDGVDDE 333
Query: 1874 DDESDEDYSPSTHAVTKRKRSVRVEKSNEVDEDEESDEDYKP 1915
D+E D+D E+ + D++++ +E+ +
Sbjct: 334 DEEDDDDDLEEEEEDVDLSDEEEDEEDEDSDDEDDEEEEEEE 375
>gnl|CDD|233496 TIGR01622, SF-CC1, splicing factor, CC1-like family. This model
represents a subfamily of RNA splicing factors including
the Pad-1 protein (N. crassa), CAPER (M. musculus) and
CC1.3 (H.sapiens). These proteins are characterized by an
N-terminal arginine-rich, low complexity domain followed
by three (or in the case of 4 H. sapiens paralogs, two)
RNA recognition domains (rrm: pfam00706). These splicing
factors are closely related to the U2AF splicing factor
family (TIGR01642). A homologous gene from Plasmodium
falciparum was identified in the course of the analysis
of that genome at TIGR and was included in the seed.
Length = 457
Score = 36.4 bits (84), Expect = 0.17
Identities = 21/101 (20%), Positives = 38/101 (37%), Gaps = 22/101 (21%)
Query: 1640 HKNKENKKDSKRGEKKIKRMKRRRRKIRRTLVESDDEVLSRKRRGKSTYENDFDFNPAGE 1699
+N + D R E+ +R + R R SR+RR + Y G
Sbjct: 10 LRNDTRRSDKGR-ERSRRRSRSRDR--------------SRRRRDRDYYRG-----RRGR 49
Query: 1700 TKTKAESGNEQVEEVKTRVRRRSSNRSRSNSQDNEENDEED 1740
+++++ + + R RR RS N+++ E D
Sbjct: 50 SRSRSPNRYYRPRG--DRSYRRDDRRSGRNTKEPLTEAERD 88
>gnl|CDD|227596 COG5271, MDN1, AAA ATPase containing von Willebrand factor type A
(vWA) domain [General function prediction only].
Length = 4600
Score = 35.4 bits (81), Expect = 0.47
Identities = 43/241 (17%), Positives = 85/241 (35%), Gaps = 33/241 (13%)
Query: 1687 TYENDFDFNPAGETKTKAESGNEQVEEVK------TRVRRRSSNRSRSNSQDNEENDEED 1740
T D D + A + E + V + S NS EEN + D
Sbjct: 3830 TLNEDDDLEELANEEDTANQSDLDESEARELESDMNGVTKDSVVSENENSDSEEENQDLD 3889
Query: 1741 VDIDNFDPEQPGP--STSYAEPVASTSRGRFAGRAAKRKFKEESASESQEDENSIPETDT 1798
++++ + + EP + ES S+ED+N E
Sbjct: 3890 EEVNDIPEDLSNSLNEKLWDEPNEEDLLETEQKSNEQSAANNESDLVSKEDDNKALEDKD 3949
Query: 1799 TEQPAKKLKVLYPKRRGQQSEDGSGDNAS------EMSELKQDPLSLHDENEDEDNTNEM 1852
++ + ++ + + +N S E +L +D L L ++ D +++
Sbjct: 3950 RQEKEDEEEMSDDVGIDDEIQPDIQENNSQPPPENEDLDLPED-LKLDEKEGDVSKDSDL 4008
Query: 1853 GKRKRTTRLVKSNEPEDEED-DDDESDEDYSPSTHAVTKRKRSVRVEKSNEVDEDEESDE 1911
+ ++E ++E D + DE +D P +E++N +DED + D+
Sbjct: 4009 ----EDMDMEAADENKEEADAEKDEPMQDEDP-------------LEENNTLDEDIQQDD 4051
Query: 1912 D 1912
Sbjct: 4052 F 4052
Score = 34.6 bits (79), Expect = 0.75
Identities = 28/202 (13%), Positives = 71/202 (35%), Gaps = 34/202 (16%)
Query: 1727 RSNSQDNEENDEEDVDIDNFDPEQPGPSTSYAEPVASTSRGRFAGRAAKRKFKEESASES 1786
+ +++ + N+E D+ D + R K +E S
Sbjct: 3921 QKSNEQSAANNESDLVSKEDDNKALEDKD----------------RQEKEDEEEMSDDVG 3964
Query: 1787 QEDENSIPETDTTEQPAKKLKVL-YPKRRGQQSEDGSGDNASEMSEL------------- 1832
+DE + QP + + L P+ ++G S++ ++
Sbjct: 3965 IDDEIQPDIQENNSQPPPENEDLDLPEDLKLDEKEGDVSKDSDLEDMDMEAADENKEEAD 4024
Query: 1833 --KQDPLSLHDENEDEDNTNEMGKRKRTTRLVKSNEPEDEE--DDDDESDEDYSPSTHAV 1888
K +P+ D E+ + +E ++ + L + +E +E+ +++ + +E+ +
Sbjct: 4025 AEKDEPMQDEDPLEENNTLDEDIQQDDFSDLAEDDEKMNEDGFEENVQENEESTEDGVKS 4084
Query: 1889 TKRKRSVRVEKSNEVDEDEESD 1910
+ V + +D + D
Sbjct: 4085 DEELEQGEVPEDQAIDNHPKMD 4106
Score = 32.3 bits (73), Expect = 3.9
Identities = 32/230 (13%), Positives = 77/230 (33%), Gaps = 27/230 (11%)
Query: 1705 ESGNEQVEEV---KTRVRRRSSNRSRSNSQDNEENDEEDVDIDNFDPE--QPGPSTSYAE 1759
E+ +E EE K + +N+ D + ++ D+ D + + G + E
Sbjct: 4014 EAADENKEEADAEKDEPMQDEDPLEENNTLDEDIQQDDFSDLAEDDEKMNEDGFEENVQE 4073
Query: 1760 PVASTSRGRFAGRAAKRKFKEESASESQEDENSIPETDT--TEQPAKKLKVLYPKRRGQQ 1817
ST G + E+ + ++ P+ D T A+ + K +
Sbjct: 4074 NEESTEDG-----VKSDEELEQGEVPEDQAIDNHPKMDAKSTFASAEADEENTDKGIVGE 4128
Query: 1818 SED--------GSGDNASEMSELKQDPLSLHDENEDEDNT-NEMG------KRKRTTRLV 1862
+E+ G+G E ++++D + + + D +G ++
Sbjct: 4129 NEELGEEDGVRGNGTADGEFEQVQEDTSTPKEAMSEADRQYQSLGDHLREWQQANRIHEW 4188
Query: 1863 KSNEPEDEEDDDDESDEDYSPSTHAVTKRKRSVRVEKSNEVDEDEESDED 1912
+ + DD + + ++ +D DE ++++
Sbjct: 4189 EDLTESQSQAFDDSEFMHVKEDEEEDLQALGNAEKDQIKSIDRDESANQN 4238
Score = 31.9 bits (72), Expect = 5.0
Identities = 37/240 (15%), Positives = 75/240 (31%), Gaps = 21/240 (8%)
Query: 1689 ENDFDFNPAGETKTKAESGNEQVEEVKTRVRRRSSNRSRSNSQDNEEN-DEEDVDIDNFD 1747
E+D N G + E+ + VK+ + DN D +
Sbjct: 4057 EDDEKMNEDGFEENVQENEESTEDGVKSDEELEQGEVPEDQAIDNHPKMDAKSTFASAEA 4116
Query: 1748 PEQPGPSTSYAEPVASTSRGRFAGRAAKRKFKEESASESQEDENSIPETDTT-EQPAKKL 1806
E+ E G E+ ++ + ++ E D + L
Sbjct: 4117 DEENTDKGIVGENEELGEEDGVRGNGTADGEFEQVQEDTSTPKEAMSEADRQYQSLGDHL 4176
Query: 1807 KVLYPKRRGQQSEDGSGDNASEMSELKQDPLSLHDENEDEDNTNEMG-----KRKRTTRL 1861
+ R + ED S+ D +H + ++E++ +G + K R
Sbjct: 4177 REWQQANRIHEWED----LTESQSQAFDDSEFMHVKEDEEEDLQALGNAEKDQIKSIDRD 4232
Query: 1862 VKSNEPEDE-------EDDDDESDEDYSPSTHAVTKRKRSVRVEKSNE---VDEDEESDE 1911
+N+ D ED+ DE + ++ K++ E +++ E+ E
Sbjct: 4233 ESANQNPDSMNSTNIAEDEADEVGDKQLQDGQDISDIKQTGEDTLPTEFGSINQSEKVFE 4292
>gnl|CDD|225880 COG3343, RpoE, DNA-directed RNA polymerase, delta subunit
[Transcription].
Length = 175
Score = 33.2 bits (76), Expect = 0.61
Identities = 31/124 (25%), Positives = 45/124 (36%), Gaps = 37/124 (29%)
Query: 1789 DENSIPETDTTEQPAKKLKVLYPKRRGQQSEDGSGDNASEMSELKQDPLSLHDENEDEDN 1848
DE T+ + AK +V + ED + + E + SL DEN+DED
Sbjct: 88 DEEIQAMTEKKDIKAKDKEV---DAFEEGDEDELDYDEDKEEEEDDEVDSLDDENDDEDE 144
Query: 1849 TNEMGKRKRTTRLVKSNEPEDEEDDDDESDEDYSPSTHAVTKRKRSVRVEKSNEVDEDEE 1908
++ E + +DDE DED + DEDEE
Sbjct: 145 DDD--------------EIVEILIEDDEVDEDE--------------------DDDEDEE 170
Query: 1909 SDED 1912
+ED
Sbjct: 171 DEED 174
>gnl|CDD|202096 pfam02029, Caldesmon, Caldesmon.
Length = 431
Score = 34.2 bits (78), Expect = 0.66
Identities = 49/281 (17%), Positives = 95/281 (33%), Gaps = 25/281 (8%)
Query: 1653 EKKIKRMKRRRRKIRRTLVESDDEVL---------------SRKRRGKSTYENDFDFNPA 1697
E++ R +RRR + + + ++ L + ST +
Sbjct: 4 EEEAARERRRREREEQLRQKQEEGSLGQVTTQVEVNSQNSVPDEESKTSTDDEAALLERL 63
Query: 1698 GETKTKAESG----NEQVEEVKTRVRRRSSNRSRSNSQDNEENDEEDVDIDNFDPEQPGP 1753
+ + + E+ +E K +S + Q++ + E V+ + + +
Sbjct: 64 ARREERRDERFSEALERQKEFKPTSTDQSLSEPSRRMQEDSGAENETVEEEEKEESREER 123
Query: 1754 STSYAEPVASTSRGRFAGRAAKRKFKEESASESQEDENSIPET---DTTEQPAKKLKVLY 1810
+ S + R A+ KEE E +E+E + + E KLK
Sbjct: 124 EEVEETEGVTKSEQKNDWRDAEECQKEEKEPEPEEEEKPKRGSLEENNGEFMTHKLKHTE 183
Query: 1811 PKRRGQQSEDGSGDNASEMSELKQDPLSLHDENEDEDNTNEMGKRKRTTRLVKSNEPEDE 1870
+E + E +LKQ E E+ E +R + + E
Sbjct: 184 NTFSRGGAEGAQVEAGKEFEKLKQKQQEAALELEELKKKRE---ERRKVLEEEEQRRKQE 240
Query: 1871 EDDDDESDEDYSPSTHAVTKRKRSVRVEKSNEVDEDEESDE 1911
E D +E+ +R+R+ EK +V ED S++
Sbjct: 241 EADRKSREEEEKRRLKEEIERRRAEAAEKRQKVPEDGLSED 281
>gnl|CDD|227674 COG5384, Mpp10, U3 small nucleolar ribonucleoprotein component
[Translation, ribosomal structure and biogenesis].
Length = 569
Score = 34.3 bits (78), Expect = 0.75
Identities = 36/192 (18%), Positives = 71/192 (36%), Gaps = 23/192 (11%)
Query: 1732 DNEENDEEDVDIDNFDPEQPGPSTSYAEPVASTSRGRFAGR-------AAKRKFKEESAS 1784
D+ + D + DP G ST + + +A EE ++
Sbjct: 64 DSIDGDLIQGIQELKDPSLDG-STLNSSSGEESELEEAESVFKEKQMLSADVSEIEEQSN 122
Query: 1785 ESQEDENSIPETDTTEQPAKKLKVLYPKRRGQQSEDGSGDNASEMSELKQDPLSLHDENE 1844
+S + + P D + A+ + + + + G D ++ + +D L+ D NE
Sbjct: 123 DSLSENDEEPSMDDEKTSAEAAREEFAEEKRIPDPYGINDKFFDLEKFNRDTLAAEDSNE 182
Query: 1845 ----DEDNTNEMGKRKRTTRLVKSNEPEDEEDDDDESDEDY-SPSTHAVTKRKRSVRVEK 1899
ED + + + P D+E+++ ED+ T K+ V+ K
Sbjct: 183 ASEGSEDEDIDYFQ----------DMPSDDEEEEAIYYEDFFDKPTKEPVKKHSDVKDPK 232
Query: 1900 SNEVDEDEESDE 1911
+E ++EE D
Sbjct: 233 EDEELDEEEHDS 244
>gnl|CDD|240412 PTZ00420, PTZ00420, coronin; Provisional.
Length = 568
Score = 34.2 bits (78), Expect = 0.81
Identities = 13/41 (31%), Positives = 27/41 (65%)
Query: 759 VLKGHTNEVFVLESHPFDSRVLLSAGHDGLIIIWDILSSKQ 799
+LKGH ++ +++ +P + ++ S+G D + IWDI + K+
Sbjct: 120 ILKGHKKKISIIDWNPMNYYIMCSSGFDSFVNIWDIENEKR 160
Score = 32.2 bits (73), Expect = 3.4
Identities = 23/83 (27%), Positives = 40/83 (48%), Gaps = 19/83 (22%)
Query: 532 LLAAGTVDKTIRIWNL--------QTLAPISVLVGHQGIITGVNFCPLEVNGFNY--LAS 581
+LA+G+ D TIR+W + + P +L GH+ I+ +++ P+ NY + S
Sbjct: 89 ILASGSEDLTIRVWEIPHNDESVKEIKDPQCILKGHKKKISIIDWNPM-----NYYIMCS 143
Query: 582 TSTDGCIGFWKYKLDIDISKTVF 604
+ D + W DI+ K F
Sbjct: 144 SGFDSFVNIW----DIENEKRAF 162
>gnl|CDD|130366 TIGR01299, synapt_SV2, synaptic vesicle protein SV2. This model
describes a tightly conserved subfamily of the larger
family of sugar (and other) transporters described by
PFAM model pfam00083. Members of this subfamily include
closely related forms SV2A and SV2B of synaptic vesicle
protein from vertebrates and a more distantly related
homolog (below trusted cutoff) from Drosophila
melanogaster. Members are predicted to have two sets of
six transmembrane helices.
Length = 742
Score = 34.2 bits (78), Expect = 0.89
Identities = 30/145 (20%), Positives = 54/145 (37%), Gaps = 16/145 (11%)
Query: 1651 RGEKKIKR--MKRRRRKIRRTLVESDDEVLSRKRRGKSTYENDFDFNPAGETKTKAESGN 1708
+G K I + K +K+ + L + DE R ++D DF + ++ E
Sbjct: 13 KGAKDIAKEVKKHAAKKVNKALDRAQDEYSQRSYSRFEDEDDDDDFPAPADGYSRGE-AA 71
Query: 1709 EQVEEVKTRVRRRSSNRSRSNSQDNEENDEEDVDIDNFDPEQPGPSTSYAEPVA------ 1762
+ EE + SS+ + + +D+E + E I + + P A
Sbjct: 72 DDEEEGEA-----SSDATEGHDEDDEIYEGEYQGIPRAENGGKDEIMADGAPKAGVRDEL 126
Query: 1763 STSRGRFAGR--AAKRKFKEESASE 1785
G G A +R +EE A +
Sbjct: 127 KDGEGPPGGEGEAERRADEEELAQQ 151
Score = 32.3 bits (73), Expect = 3.2
Identities = 24/81 (29%), Positives = 34/81 (41%), Gaps = 10/81 (12%)
Query: 1044 VKKEKKKKVVRQA-RTRAEYSEHIQDQYE---DPDNLSLSGSEESASDSDQSQVDVSDLC 1099
VKK KKV + R + EYS+ ++E D D+ S ++ D
Sbjct: 22 VKKHAAKKVNKALDRAQDEYSQRSYSRFEDEDDDDDFPAPADGYSRGEAAD------DEE 75
Query: 1100 ESSSSSGGTSSSDYSDWIHEG 1120
E +SS T D D I+EG
Sbjct: 76 EGEASSDATEGHDEDDEIYEG 96
>gnl|CDD|235401 PRK05306, infB, translation initiation factor IF-2; Validated.
Length = 746
Score = 34.1 bits (79), Expect = 1.00
Identities = 34/201 (16%), Positives = 71/201 (35%), Gaps = 19/201 (9%)
Query: 1618 MTTRLSEFAEE----------HLRRLTVRWNSHKNKENKKDSKRGEKKIKRMKRRRRKIR 1667
R+ E A+E L+ L + SH + ++++++ E K + + + +
Sbjct: 2 SKVRVYELAKELGVSSKELLEKLKELGIEVKSHSSTVEEEEARKEEAKREAEEEAKAE-- 59
Query: 1668 RTLVESDDEVLSRKRRGKSTYENDFDFNPAGETKTKAESGNEQVEEVKTRVRRRSSNRSR 1727
+ + K+ A E AE+ E+ R ++ R +
Sbjct: 60 ----AEEAAAAEAEEEAKAEAAAAAPAEEAAEAAAAAEAAARPAEDEAARPAEAAARRPK 115
Query: 1728 SNSQDNEENDEEDVDIDNFDPEQPGPSTSYAEPVASTSRGRFAGRAAKRKFKEESASESQ 1787
+ ++ + P++ R R R ++K K++ +E
Sbjct: 116 AKKAAKKKKGPKP---KKKKPKRKAARGGKRGKGGKGRRRRRGRRRRRKKKKKQKPTEKI 172
Query: 1788 EDENSIPETDTTEQPAKKLKV 1808
E IPET T + A+K+ V
Sbjct: 173 PREVVIPETITVAELAEKMAV 193
>gnl|CDD|173412 PTZ00121, PTZ00121, MAEBL; Provisional.
Length = 2084
Score = 34.0 bits (77), Expect = 1.0
Identities = 52/325 (16%), Positives = 127/325 (39%), Gaps = 19/325 (5%)
Query: 1594 VKDVKRVFHNSRNYNQNKRSKIYIMTTRLSEFAEEHLRRLTVRWNSHKNKENKK--DSKR 1651
+ VK+ ++ + K+++ + +F E + R + K +E +K + K+
Sbjct: 1226 AEAVKKAEEAKKDAEEAKKAEEERNNEEIRKFEEARMAHFARRQAAIKAEEARKADELKK 1285
Query: 1652 GEKKIKRMKRRRRKIRRTLVESDDEVLSRKRRGKSTYENDFDFNPAGETKTKAESGNEQV 1711
E+K K + ++ + ++ E+ K++ + + D A E K KA++ ++
Sbjct: 1286 AEEKKKADEAKKAEEKKKADEA-------KKKAEEAKKADEAKKKAEEAKKKADAAKKKA 1338
Query: 1712 EEVKTR--VRRRSSNRSRSNSQDNEENDEEDVDIDNFDPEQPGPSTSYAEPVASTSRGRF 1769
EE K + + + ++ EE E ++ + AE +
Sbjct: 1339 EEAKKAAEAAKAEAEAAADEAEAAEEKAEAAEKKKEEAKKKADAAKKKAEEKKKADEAK- 1397
Query: 1770 AGRAAKRKFKEESASESQEDENSIPETDTTEQPAKKLKVLYPKRRGQQSEDGSGDNASEM 1829
+A + K K + ++ + E + KK K E D A +
Sbjct: 1398 -KKAEEDKKKADELKKAAAAKKKADEAKKKAEEKKKADEAKKK----AEEAKKADEAKKK 1452
Query: 1830 SELKQDPLSLHDENEDEDNTNEMGKRKRTTRLVKSNEPEDEEDDDDESDEDYSPSTHAVT 1889
+E + + E+ +E K+ + K++E + + ++ + ++ + A
Sbjct: 1453 AEEAKKAEEAKKKAEEAKKADEAKKKAEEAK--KADEAKKKAEEAKKKADEAKKAAEAKK 1510
Query: 1890 KRKRSVRVEKSNEVDEDEESDEDYK 1914
K + + E++ + DE ++++E K
Sbjct: 1511 KADEAKKAEEAKKADEAKKAEEAKK 1535
Score = 31.3 bits (70), Expect = 8.2
Identities = 31/151 (20%), Positives = 57/151 (37%), Gaps = 22/151 (14%)
Query: 1628 EHLRRLTVRWNSHKNKENKKDSKRG--EKKIKRMKRRRRKIRRTLVESDDEVLSRKRRGK 1685
HL++ + KE + + E+ KR +KI+ + + +
Sbjct: 1760 AHLKKEEEKKAEEIRKEKEAVIEEELDEEDEKRRMEVDKKIKDI--FDNFANIIEGGKEG 1817
Query: 1686 STYENDFDFNPAGETKTKAESGNEQVEEVKT----RVRRRSSNRSRSNSQ-------DNE 1734
+ ND K A+S N Q+EE + + + N N + D +
Sbjct: 1818 NLVINDSKEMEDSAIKEVADSKNMQLEEADAFEKHKFNKNNENGEDGNKEADFNKEKDLK 1877
Query: 1735 ENDEEDVD-------IDNFDPEQPGPSTSYA 1758
E+DEE+++ ID D E+ P+ + A
Sbjct: 1878 EDDEEEIEEADEIEKIDKDDIEREIPNNNMA 1908
>gnl|CDD|217503 pfam03344, Daxx, Daxx Family. The Daxx protein (also known as the
Fas-binding protein) is thought to play a role in
apoptosis, but precise role played by Daxx remains to be
determined. Daxx forms a complex with Axin.
Length = 715
Score = 33.7 bits (77), Expect = 1.1
Identities = 21/109 (19%), Positives = 40/109 (36%), Gaps = 23/109 (21%)
Query: 1775 KRKFKEESASESQEDENSIPETDTTEQPAKKLKVLYPKRRGQQSEDGSGDNASEMSELKQ 1834
KR+ +E + S+ + S + + E P+ Q+SE+ E E ++
Sbjct: 405 KRQERERQGTSSRSSDPSKASSTSGESPSMA---------SQESEEEESVEEEEEEEEEE 455
Query: 1835 DPLSLHDENEDEDNTNEMGKRKRTTRLVKSNEPEDEEDDDDESDEDYSP 1883
+ E E+ ++ E E E E D+ E + + S
Sbjct: 456 EEEEQESEEEEGEDEEE--------------EEEVEADNGSEEEMEGSS 490
Score = 33.3 bits (76), Expect = 1.7
Identities = 35/208 (16%), Positives = 65/208 (31%), Gaps = 23/208 (11%)
Query: 1645 NKKDSKRGEKKIKRMKRRRRKIRRTLVESDDEVLSRKRRGKSTYENDFDFNPAGETKTKA 1704
K+D E+ R KR+ R+ + T S D + G+S + + +
Sbjct: 393 MKQDDTEEEE---RRKRQERERQGTSSRSSDPSKASSTSGESPSMASQESEEEESVEEEE 449
Query: 1705 ESGNEQVEEVKTRVRRRSSNRSRSNSQDNEENDEEDVDIDNFDPEQPGPSTSYAEPVAST 1764
E E+ EE + + + + EE+++ + AE S
Sbjct: 450 EEEEEEEEEEQESEEEEGEDEEEEEEVEADNGSEEEMEGSSEGDGDGEEPEEDAERRNSE 509
Query: 1765 SRGRFAGRAAKRKFKEESASESQEDENSIPETDTTEQPAKKLKVLYPKRRGQQSEDGSGD 1824
G + Q +S+ E+P + + + G+
Sbjct: 510 MAG-----------ISRMSEGQQPRGSSVQPESPQEEPLQPESM---------DAESVGE 549
Query: 1825 NASEMSELKQDPLSLHDENEDEDNTNEM 1852
+ E ++ PLS H E E E
Sbjct: 550 ESDEELLAEESPLSSHTELEGVATPVET 577
>gnl|CDD|219627 pfam07897, DUF1675, Protein of unknown function (DUF1675). The
members of this family are sequences derived from
hypothetical plant proteins of unknown function. One
member of this family is annotated as a putative
RNA-binding protein, but no evidence was found to support
this.
Length = 283
Score = 33.4 bits (76), Expect = 1.1
Identities = 21/87 (24%), Positives = 37/87 (42%), Gaps = 9/87 (10%)
Query: 1034 DSSTTTQPLVVKKEKKKKVVRQARTRAEY-SEHIQDQYEDPDNLSLSGSEESAS------ 1086
TT+ P+ ++E +K+ Q+ R E + + +Y N S +A+
Sbjct: 58 LMRTTSLPVETEEEWRKRKEMQSLRRLEAKRKRSEKEYNGVSNGDDMDSINAANGGGSGR 117
Query: 1087 --DSDQSQVDVSDLCESSSSSGGTSSS 1111
+S + S + SSSS G +SS
Sbjct: 118 DKESSKGVETGSPIATSSSSGGPSSSG 144
>gnl|CDD|99926 cd05494, Bromodomain_1, Bromodomain; uncharacterized subfamily.
Bromodomains are found in many chromatin-associated
proteins and in nuclear histone acetyltransferases. They
interact specifically with acetylated lysine.
Length = 114
Score = 31.6 bits (72), Expect = 1.3
Identities = 12/34 (35%), Positives = 17/34 (50%), Gaps = 2/34 (5%)
Query: 134 DSEPFRKPVDK--LDHPDYDKIVIAPMDLGTIRE 165
D+ PF +PV+ PDY ++ PM GT
Sbjct: 20 DAWPFLEPVNPPRRGAPDYRDVIKRPMSFGTKVN 53
Score = 31.6 bits (72), Expect = 1.3
Identities = 12/34 (35%), Positives = 17/34 (50%), Gaps = 2/34 (5%)
Query: 1549 DSEPFRKPVDK--LDHPDYDKIVIAPMDLGTIRE 1580
D+ PF +PV+ PDY ++ PM GT
Sbjct: 20 DAWPFLEPVNPPRRGAPDYRDVIKRPMSFGTKVN 53
>gnl|CDD|177089 CHL00189, infB, translation initiation factor 2; Provisional.
Length = 742
Score = 33.7 bits (77), Expect = 1.3
Identities = 17/108 (15%), Positives = 40/108 (37%), Gaps = 5/108 (4%)
Query: 1641 KNKENKKDSKRGEKKIKRMKRRRRKIRRTL-VESDDEVLSRKRRGKSTYENDFDFNPAGE 1699
KNK+ K + K K ++++KI++ L ++ D + + + + +
Sbjct: 59 KNKKFNKTDDLKDSK-KTKLKQKKKIKKKLHIDDDYDNFFDSKNNSKQFAGPLAISLMRK 117
Query: 1700 TKTKAESGNEQVEEVKTRVRRRSSNRSRSNSQD--NEENDEEDVDIDN 1745
K K E + V ++ S + + N + + I +
Sbjct: 118 PKPKTEKLKK-KITVNKSTNKKKKKVLSSKDELIKYDNNKPKSISIHS 164
>gnl|CDD|113290 pfam04514, BTV_NS2, Bluetongue virus non-structural protein NS2.
This family includes NS2 proteins from other members of
the Orbivirus genus. NS2 is a non-specific
single-stranded RNA-binding protein that forms large
homomultimers and accumulates in viral inclusion bodies
of infected cells. Three RNA binding regions have been
identified in Bluetongue virus serotype 17 at residues
2-11, 153-166 and 274-286. NS2 multimers also possess
nucleotidyl phosphatase activity. The precise function of
NS2 is not known, but it may be involved in the transport
and condensation of viral mRNAs.
Length = 363
Score = 33.3 bits (76), Expect = 1.3
Identities = 24/143 (16%), Positives = 48/143 (33%), Gaps = 12/143 (8%)
Query: 1772 RAAKRKFKEESASESQEDENSI----------PETDTTEQPAKKL-KVLYPKRRGQQSED 1820
R ++K +EE E +E P++D E P + + P+ Q ++
Sbjct: 155 RELRQKIREEREKEKEEQPMKPAFKPERWMGGPDSDEDENPLDEEAPDMTPETSKQDQKE 214
Query: 1821 GSGDNASEMSELKQDPLSLHDENEDEDNTNEMGKRKRTTRLVKSNEPEDEEDDDDESDED 1880
+ ++ + E D E + R + E+ +D E+ +
Sbjct: 215 ERRAAVERRLAELVEMINWNLEERRRDLRKEQELEENVERDSDDEDEHGEDSEDGET-KP 273
Query: 1881 YSPSTHAVTKRKRSVRVEKSNEV 1903
S T +R +R K +
Sbjct: 274 ESYITSEYIERISEIRKMKDERL 296
>gnl|CDD|221185 pfam11719, Drc1-Sld2, DNA replication and checkpoint protein. Genome
duplication is precisely regulated by cyclin-dependent
kinases CDKs, which bring about the onset of S phase by
activating replication origins and then prevent
relicensing of origins until mitosis is completed. The
optimum sequence motif for CDK phosphorylation is
S/T-P-K/R-K/R, and Drc1-Sld2 is found to have at least 11
potential phosphorylation sites. Drc1 is required for DNA
synthesis and S-M replication checkpoint control. Drc1
associates with Cdc2 and is phosphorylated at the onset
of S phase when Cdc2 is activated. Thus Cdc2 promotes DNA
replication by phosphorylating Drc1 and regulating its
association with Cut5. Sld2 and Sld3 represent the
minimal set of S-CDK substrates required for DNA
replication.
Length = 397
Score = 33.2 bits (76), Expect = 1.4
Identities = 18/101 (17%), Positives = 37/101 (36%), Gaps = 12/101 (11%)
Query: 1826 ASEMSELKQDPLSLHDENEDEDNTNEMGKR---KRTTRLVK------SNEPEDEEDDDDE 1876
+E + + + D D + K+ KRTTR VK E + D
Sbjct: 257 EAEEAGIGPIEEEVVDSQAANDEPRRVFKKKGQKRTTRRVKMRPVRAKPSDEPSLPESDI 316
Query: 1877 SDEDYSPSTHAVTKRKRS---VRVEKSNEVDEDEESDEDYK 1914
+E ++++ + + +E DE+ + + + K
Sbjct: 317 HEEIPKLDEKSLSEFLGYMGGIDEDDEDEDDEESKEEVEKK 357
>gnl|CDD|217203 pfam02724, CDC45, CDC45-like protein. CDC45 is an essential gene
required for initiation of DNA replication in S.
cerevisiae, forming a complex with MCM5/CDC46. Homologues
of CDC45 have been identified in human, mouse and smut
fungus among others.
Length = 583
Score = 33.0 bits (76), Expect = 1.8
Identities = 19/98 (19%), Positives = 40/98 (40%), Gaps = 16/98 (16%)
Query: 1824 DNASEMSELKQDP--------LSLHDENEDEDNTNEMGKRKRTTRLVKSNEPEDEEDDDD 1875
D+ EL+ +P L D++++E + + +E ++++DDDD
Sbjct: 102 DDGDIEEELQDEPRYDDAYRDLEEDDDDDEESDEED--------EESSKSEDDEDDDDDD 153
Query: 1876 ESDEDYSPSTHAVTKRKRSVRVEKSNEVDEDEESDEDY 1913
+ D+ + +R+R EK E++ Y
Sbjct: 154 DDDDIATRERSLERRRRRREWEEKRAELEFYYYQGTYY 191
>gnl|CDD|215521 PLN02967, PLN02967, kinase.
Length = 581
Score = 32.7 bits (74), Expect = 2.1
Identities = 36/167 (21%), Positives = 56/167 (33%), Gaps = 29/167 (17%)
Query: 1747 DPEQPGPSTSYAEPVASTSRGRFAGRAAKRKFKEESASESQEDENSIPETDTTEQPAKKL 1806
P + A +R RA K+ E S + E + E ++ +KK
Sbjct: 56 LAVDEEPDENGAVSKKKPTRSV--KRATKKTVVEISEPLEEGSELVVNEDAALDKESKK- 112
Query: 1807 KVLYPKRRGQQSEDGSGDNASEMSELKQDPLSLHDENEDEDNTNEMGKRKRTTRLVKSNE 1866
P+R +++ S D E +E K+ R R VK +
Sbjct: 113 ---TPRRTRRKAAAASSDVEEEKTE----------------------KKVRKRRKVKKMD 147
Query: 1867 PEDEEDDDDESDEDYSPSTHAVTKRKRSVRVEKSNEVDEDEESDEDY 1913
ED ED ES+ + VT + E E D+ E+ Y
Sbjct: 148 -EDVEDQGSESEVSDVEESEFVTSLENESEEELDLEKDDGEDISHTY 193
Score = 32.7 bits (74), Expect = 2.2
Identities = 32/150 (21%), Positives = 58/150 (38%), Gaps = 24/150 (16%)
Query: 971 SRANGESSAATEGTS-----PSGTAPTAQNKNFVKRLPLSQ-------HKIVVERINAFA 1018
SR ES+ A + S PT K K+ + ++VV A
Sbjct: 48 SRKKIESALAVDEEPDENGAVSKKKPTRSVKRATKKTVVEISEPLEEGSELVVNEDAALD 107
Query: 1019 EIESEEFIKESKKLFDSSTTTQPLVVKKEKKKKVVRQARTRAEYSEHIQDQYEDPDNLSL 1078
+ + + +K +S+ V++EK +K VR+ R + E ++DQ + + +
Sbjct: 108 KESKKTPRRTRRKAAAASSD-----VEEEKTEKKVRKRRKVKKMDEDVEDQGSESEVSDV 162
Query: 1079 SGSE-------ESASDSDQSQVDVSDLCES 1101
SE ES + D + D D+ +
Sbjct: 163 EESEFVTSLENESEEELDLEKDDGEDISHT 192
>gnl|CDD|218538 pfam05285, SDA1, SDA1. This family consists of several SDA1 protein
homologues. SDA1 is a Saccharomyces cerevisiae protein
which is involved in the control of the actin
cytoskeleton. The protein is essential for cell viability
and is localised in the nucleus.
Length = 317
Score = 32.3 bits (74), Expect = 2.4
Identities = 25/147 (17%), Positives = 58/147 (39%), Gaps = 28/147 (19%)
Query: 1766 RGRFAGRAAKRKFKEESASESQEDENSIPETDTTEQPAKKLKVLYPKRRGQQSEDGSGDN 1825
RG+ A + K+ E E + IP + E+ ++ + K+ +Q + D+
Sbjct: 44 RGKPASMGLEAGEKKGYGEE--EVVDGIPGLELLEKWKEEER---KKKEAEQGLESDDDD 98
Query: 1826 ASEMSELKQDPLSLHDENEDEDNTNEMGKRKRTTRLVKSNEPEDEEDDDDESDEDYSPST 1885
E ++ DE+ D++ V+S++ + D +DE ++D
Sbjct: 99 DEEEEWEVEE-----DEDSDDEGEWID---------VESDKEIESSDSEDEEEKD----- 139
Query: 1886 HAVTKRKRSVRVEKSNEVDEDEESDED 1912
+ + + + E+ E++E +
Sbjct: 140 ----EAAKKAKEDSDEELSEEDEEEAA 162
>gnl|CDD|130009 TIGR00934, 2a38euk, potassium uptake protein, Trk family. The
proteins of the Trk family are derived from Gram-negative
and Gram-positive bacteria, yeast and wheat. The proteins
of E. coli K12 TrkH and TrkG as well as several yeast
proteins have been functionally characterized.The E. coli
TrkH and TrkG proteins are complexed to two peripheral
membrane proteins, TrkA, an NAD-binding protein, and
TrkE, an ATP-binding protein. This complex forms the
potassium uptake system. This family is specific for the
eukaryotic Trk system [Transport and binding proteins,
Cations and iron carrying compounds].
Length = 800
Score = 32.6 bits (74), Expect = 2.6
Identities = 44/270 (16%), Positives = 78/270 (28%), Gaps = 18/270 (6%)
Query: 1650 KRGEKKIKRMKRRRRKIRRTLVESDDEVLSRKRRGKSTYENDFDFNPAGETKTKAESGNE 1709
+R I+ ++R +RRT E+ R G + P G K +
Sbjct: 131 ERYFDGIRDSSKQRFFLRRTKTLLQRELEDRPETGVAGRVT----VPHGSAKRRDFQDKL 186
Query: 1710 QVEEVKTRVRRRSSNRSRSN--------SQDNEENDEEDVDIDNFDPEQPGPSTSYAEPV 1761
E R ++ + S E + + DPE S + +
Sbjct: 187 FSGEFVKRDEPDQNSPDVKSDTRADESISDLEFEKFAKRRGSRDVDPEDLYRSIMMLQGI 246
Query: 1762 ASTSRGRFAGRAAKRKFKEESASESQEDENSIPETDTTEQPAKKLKVLYPKRRGQQSEDG 1821
R + + + E S+ QE P T E+ ++ L Y + ++
Sbjct: 247 HERIREKSSANSRSD---ERSSESIQEQVERRPSTSDIERNSQSLTRRYDDKSFDKAVRL 303
Query: 1822 SGDNASEMSELKQDPLSLHDENEDEDNTNEMGKRKRTTRLVKSNEPEDEEDDDDESDEDY 1881
+ +E D L + E T + K R K+ P ++
Sbjct: 304 RRSKTIDRAEAC-DLEELDRAKDFEKMTYDN--WKAHHRKKKNFRPRGWNLKFRKASRFP 360
Query: 1882 SPSTHAVTKRKRSVRVEKSNEVDEDEESDE 1911
S + S +E S E
Sbjct: 361 KDSDRNYEDNGNHLSASSSFGSEEPSLSSE 390
>gnl|CDD|147601 pfam05505, Ebola_NP, Ebola nucleoprotein. This family consists of
Ebola and Marburg virus nucleoproteins. These proteins
are responsible for encapsidation of genomic RNA. It has
been found that nucleoprotein DNA vaccines can offer
protection from the virus.
Length = 717
Score = 32.6 bits (74), Expect = 2.7
Identities = 21/105 (20%), Positives = 40/105 (38%), Gaps = 16/105 (15%)
Query: 1816 QQSEDGSGDNASEMSELKQDPLSLHDENEDEDNTNEMGKRKRTTRLVKSNEPEDEEDDDD 1875
Q+ ED S + + S Q+ + +E+E ED T++ + E +D+ D+
Sbjct: 521 QKQEDESTNPIGKTSLRYQELTPVQEEDEPEDQTDD------DDSSLPPLESDDDPGSDN 574
Query: 1876 ESDEDYS-----PSTHAVTKRKRSVRVEKSNEVDE-----DEESD 1910
E D + + K + + N D + +SD
Sbjct: 575 EQGVDLTEVAPPAPVYRDEKEQDEIPHPAQNPQDPTGSIGNVDSD 619
>gnl|CDD|185618 PTZ00438, PTZ00438, gamete antigen 27/25-like protein; Provisional.
Length = 374
Score = 32.3 bits (73), Expect = 2.7
Identities = 38/183 (20%), Positives = 76/183 (41%), Gaps = 37/183 (20%)
Query: 1736 NDEEDVDIDNFDPEQPGPSTS--YAEPVASTSRGRFAGRAAKRKFKEESASESQEDENSI 1793
N E++ N++PE+ S + Y ++ G A A++ ++EN +
Sbjct: 39 NTREEIISVNYEPEEQDLSEACDYIAIEEYSACGNVA----------SIANDKSDNENDV 88
Query: 1794 PETDTTEQPAKKLKVLYPKRRGQQSEDGSGDNASEMSELKQDPLSLHDENEDEDNTNEMG 1853
E + V + RG Q E+ ++ E+ E+++ + + ++DED
Sbjct: 89 ------ELEGLNIIVKNEEERGTQKEEEEDEDVEEIEEVEEVEVVEEEYDDDED------ 136
Query: 1854 KRKRTTRLVKSNEPEDEEDDDDESDEDYSPSTHAVTKRKRSVRV--EKSNEVDEDEESDE 1911
+E +DE++ D E DE+ + + + V + E N +DE E+ +
Sbjct: 137 -----------SEKDDEKESDAEGDENELAGEYIIEEVDDDVAILNENINSIDELEDPLD 185
Query: 1912 DYK 1914
YK
Sbjct: 186 SYK 188
>gnl|CDD|220023 pfam08801, Nucleoporin_N, Nup133 N terminal like. Nup133 is a
nucleoporin that is crucial for nuclear pore complex
(NPC) biogenesis. The N terminal forms a seven-bladed
beta propeller structure. This family now contains other
sized nucleoporins, including Nup155, Nup8, Nuo132,
Nup15 and Nup170.
Length = 411
Score = 32.4 bits (74), Expect = 2.8
Identities = 17/81 (20%), Positives = 28/81 (34%), Gaps = 11/81 (13%)
Query: 514 LRGCSGEISDIAIDN-RNILLAAGTVDKTIRIWNLQTLAPISVLVGHQGIITGVNF--CP 570
+ EI + +D+ R LL T TIR+W+L + ++ V+
Sbjct: 186 VPSSDREIVSLRVDSSRGRLLYTLTSKGTIRVWDLSP--------SGEKLVREVDIYEII 237
Query: 571 LEVNGFNYLASTSTDGCIGFW 591
L + S ST
Sbjct: 238 LRDLQESIPKSASTLSSKSLK 258
>gnl|CDD|130712 TIGR01651, CobT, cobaltochelatase, CobT subunit. This model
describes Pseudomonas denitrificans CobT gene product,
which is a cobalt chelatase subunit that functions in
cobalamin biosynthesis. Cobalamin (vitamin B12) can be
synthesized via several pathways, including an aerobic
pathway (found in Pseudomonas denitrificans) and an
anaerobic pathway (found in P. shermanii and Salmonella
typhimurium). These pathways differ in the point of
cobalt insertion during corrin ring formation. There are
apparently a number of variations on these two pathways,
where the major differences seem to be concerned with the
process of ring contraction. Confusion regarding the
functions of enzymes found in the aerobic vs. anaerobic
pathways has arisen because nonhomologous genes in these
different pathways were given the same gene symbols.
Thus, cobT in the aerobic pathway (P. denitrificans) is
not a homolog of cobT in the anaerobic pathway (S.
typhimurium). It should be noted that E. coli synthesizes
cobalamin only when it is supplied with the precursor
cobinamide, which is a complex intermediate.
Additionally, all E. coli cobalamin synthesis genes
(cobU, cobS and cobT) were named after their Salmonella
typhimurium homologs which function in the anaerobic
cobalamin synthesis pathway. This model describes the
aerobic cobalamin pathway Pseudomonas denitrificans CobT
gene product, which is a cobalt chelatase subunit, with a
MW ~70 kDa. The aerobic pathway cobalt chelatase is a
heterotrimeric, ATP-dependent enzyme that catalyzes
cobalt insertion during cobalamin biosynthesis. The other
two subunits are the P. denitrificans CobS (TIGR01650)
and CobN (pfam02514 CobN/Magnesium Chelatase) proteins.
To avoid potential confusion with the nonhomologous
Salmonella typhimurium/E.coli cobT gene product, the P.
denitrificans gene symbol is not used in the name of this
model [Biosynthesis of cofactors, prosthetic groups, and
carriers, Heme, porphyrin, and cobalamin].
Length = 600
Score = 32.2 bits (73), Expect = 2.9
Identities = 19/85 (22%), Positives = 34/85 (40%), Gaps = 12/85 (14%)
Query: 1828 EMSELKQDPLSLHDENEDEDNTNEMGKRKRTTRLVKSNEPEDEEDDDDESDEDYSPSTHA 1887
E++E D DE + +D+ NE E++ + + E E +P
Sbjct: 198 ELAEEMGDDTESEDEEDGDDDQP------------TENEQEEQGEGEGEGQEGSAPQESE 245
Query: 1888 VTKRKRSVRVEKSNEVDEDEESDED 1912
T R+ E+ + D+D+ DE
Sbjct: 246 ATDRESESGEEEMVQSDQDDLPDES 270
>gnl|CDD|145949 pfam03066, Nucleoplasmin, Nucleoplasmin. Nucleoplasmins are also
known as chromatin decondensation proteins. They bind to
core histones and transfer DNA to them in a reaction that
requires ATP. This is thought to play a role in the
assembly of regular nucleosomal arrays.
Length = 146
Score = 30.8 bits (70), Expect = 3.0
Identities = 16/55 (29%), Positives = 26/55 (47%), Gaps = 19/55 (34%)
Query: 1861 LVKSNEPEDEEDDDDESDEDYSPSTHAVTKRKRSVRVEKSNEVDEDEESDEDYKP 1915
LV S E E ++D++DE +ED E D+++ES+E+ P
Sbjct: 106 LVASEEDESDDDEEDEEEED-------------------DEEDDDEDESEEEESP 141
Score = 30.0 bits (68), Expect = 7.0
Identities = 15/46 (32%), Positives = 25/46 (54%), Gaps = 13/46 (28%)
Query: 1838 SLHDENEDEDNTNEMGKRKRTTRLVKSNEPEDEEDDDDESDEDYSP 1883
S DE++D++ E E ++E+DD+DES+E+ SP
Sbjct: 109 SEEDESDDDEEDEEE-------------EDDEEDDDEDESEEEESP 141
>gnl|CDD|220267 pfam09494, Slx4, Slx4 endonuclease. The Slx4 protein is a
heteromeric structure-specific endonuclease found in
fungi. Slx4 with Slx1 acts as a nuclease on branched DNA
substrates, particularly simple-Y, 5'-flap, or
replication fork structures by cleaving the strand
bearing the 5' non-homologous arm at the branch junction
and thus generating ligatable nicked products from
5'-flap or replication fork substrates.
Length = 627
Score = 32.3 bits (73), Expect = 3.2
Identities = 44/261 (16%), Positives = 74/261 (28%), Gaps = 32/261 (12%)
Query: 1672 ESDDEVLSRKRRGKSTYENDFDFNPAGETKTKAESGNEQVEEVKTRVRRRSSNRSR---- 1727
E + + S E ++ + + NE SS S
Sbjct: 245 EKESDSYIDSEPNSSITEPYDHDIQVKNSEPEFKPSNEISSHQVNSTDNESSIISFPLHI 304
Query: 1728 ---------------SNSQDNEENDEEDVDIDNFDPEQ-PGPSTSYAEPVASTSRGRF-A 1770
S Q + E ++I P +S + S+ F A
Sbjct: 305 ADSSDSVSEISLTEPSRPQSIDSTIEPPIEIPRKMSTPFFTPRSSILDKHIELSQDSFSA 364
Query: 1771 GRAAKRKFKEESASESQEDENSIPETDTTEQPAKKLKVLYPKRRGQQSEDGSGDNASEMS 1830
A FK SA D + +T P ++ K+R + + ++
Sbjct: 365 VSTATSPFKVSSAQIINSDGDVPLTRTSTSIPTRQSGTAAYKKRKKLNTSRYEISSKLRV 424
Query: 1831 ELKQDPLSLHDENEDEDNTNEMGKRKRTTRL--VKSNEPEDEEDDDDESDEDYSPSTHAV 1888
+ Q+ + N ++ T RL NE D E DDD S +V
Sbjct: 425 KDYQEDKT---------NNKAKLLKEETKRLPVDNLNEIADSESDDDSSLSIIEIVDTSV 475
Query: 1889 TKRKRSVRVEKSNEVDEDEES 1909
+ S + S E +
Sbjct: 476 LQVPSSPSSDTSGRELEATNA 496
>gnl|CDD|176394 cd01799, Hoil1_N, Ubiquitin-like domain of HOIL1. HOIL1_N HOIL-1
(heme-oxidized IRP2 ubiquitin ligase-1) is an E3
ubiquitin-protein ligase that recognizes heme-oxidized
IRP2 (iron regulatory protein2) and is thought to affect
the turnover of oxidatively damaged proteins. Hoil-1
has an amino-terminal ubiquitin-like domain as well as
an RBR signature consisting of two RING finger domains
separated by an IBR/DRIL domain.
Length = 75
Score = 29.4 bits (66), Expect = 3.3
Identities = 15/31 (48%), Positives = 17/31 (54%), Gaps = 2/31 (6%)
Query: 900 YPPEVQRLVPGRENADDTELLPD--ITTNDD 928
+PP VQR V G+ A D E L I TN D
Sbjct: 38 FPPAVQRWVIGQRLARDQETLYSHGIRTNGD 68
>gnl|CDD|227061 COG4717, COG4717, Uncharacterized conserved protein [Function
unknown].
Length = 984
Score = 32.1 bits (73), Expect = 3.3
Identities = 21/114 (18%), Positives = 39/114 (34%), Gaps = 16/114 (14%)
Query: 208 SEFAEEHLRRLTVRWNSHKNKENKKDSKRGEKKIKRMKRRRRKIRRTLVESDDEVLSRKR 267
++F + R + ++ E I+R K E +DE+ +
Sbjct: 659 AQFIDLSTLFCVQRLRVAAELQKEEARLALEGNIERTK-----------ELNDELRAELE 707
Query: 268 RGKSTYENDFDFNPAGETKTKAEKFLMSAMETERLTRLDQELMLLIIKYLEKGP 321
+ + FD A + F +A E ++LT+ + L L LE
Sbjct: 708 LHRKEILDLFDCGTADT----EDAFREAAREEQQLTQRESRLESL-EAQLEGVA 756
>gnl|CDD|115057 pfam06375, BLVR, Bovine leukaemia virus receptor (BLVR). This family
consists of several bovine specific leukaemia virus
receptors which are thought to function as transmembrane
proteins, although their exact function is unknown.
Length = 561
Score = 32.0 bits (72), Expect = 3.7
Identities = 55/263 (20%), Positives = 91/263 (34%), Gaps = 60/263 (22%)
Query: 1641 KNKENKKDSKRGEKKIKRMKRRRRKIRRTLVESDDEVLSRKRRGKSTYENDFDFNPAGET 1700
K +E ++ +R EK + K+R ++ R R+R E+D D PA
Sbjct: 83 KLEEERRHRQRLEKDKREKKKREKEKR-----------GRRRHHSLGTESDEDIAPAQMV 131
Query: 1701 KTKAESGNEQVEEVKTRVRRRSSNRSRSNSQDNEENDEE---DVDIDNFDPEQPGPSTSY 1757
E E N S+ D + ND D+D+D
Sbjct: 132 DIVTEEMPE--------------NALPSDEDDKDPNDPYRALDIDLD------------- 164
Query: 1758 AEPVASTSRGRFAGRAAKRKFKEESASESQEDENSIPETDTTEQPAKKLKVLYPKRRGQQ 1817
+P+A + + +K + S+S E + +++P KK K K R +
Sbjct: 165 -KPLADSE------KLPVQKHRNAETSKSPEKGDVPAVEKKSKKPKKKEKKEKEKERDKD 217
Query: 1818 SEDGSGDNASEMSELKQDPLSLHDENE-DEDNTNEMGKRKRTTRLVKSNEPEDEEDDDDE 1876
+ S + L P S E DE + T + + E +E D E
Sbjct: 218 KKKEVEGFKSLLLALDDSPASAASVAEADEASLA-------NTVSGTAPDSEPDEPKDAE 270
Query: 1877 SDEDYSPSTHAVTKRKRSVRVEK 1899
++E H +K+ R EK
Sbjct: 271 AEETKKSPKH----KKKKQRKEK 289
>gnl|CDD|236766 PRK10811, rne, ribonuclease E; Reviewed.
Length = 1068
Score = 31.9 bits (73), Expect = 3.9
Identities = 38/204 (18%), Positives = 70/204 (34%), Gaps = 10/204 (4%)
Query: 1626 AEEHLRRLTVRWNSHKNKENKKDSKRGEKKIKRMKRRRRKIRRTLVESDDEVLSRKRRGK 1685
AE R R +N N++D R E++ R R RR+ R E+ +E R RR
Sbjct: 596 AEAKPERQQDRRKPRQN--NRRD--RNERRDTRDNRTRREGR----ENREEN-RRNRRQA 646
Query: 1686 STYENDFDFNPAGETKTKAESGNEQVEEVKT-RVRRRSSNRSRSNSQDNEENDEEDVDID 1744
+ + E KA + +EQ + + R RRR+ + ++ + N EE +
Sbjct: 647 QQQTAETRESQQAEVTEKARTQDEQQQAPRRERQRRRNDEKRQAQQEAKALNVEEQSVQE 706
Query: 1745 NFDPEQPGPSTSYAEPVASTSRGRFAGRAAKRKFKEESASESQEDENSIPETDTTEQPAK 1804
E+ + + R A+ + + K
Sbjct: 707 TEQEERVQQVQPRRKQRQLNQKVRIEQSVAEEAVAPVVEETVAAEPVVQEVPAPRTELVK 766
Query: 1805 KLKVLYPKRRGQQSEDGSGDNASE 1828
+ + +Q E+ + +N
Sbjct: 767 VPLPVVAQTAPEQDEENNAENRDN 790
>gnl|CDD|215601 PLN03142, PLN03142, Probable chromatin-remodeling complex ATPase
chain; Provisional.
Length = 1033
Score = 32.1 bits (73), Expect = 3.9
Identities = 34/170 (20%), Positives = 63/170 (37%), Gaps = 24/170 (14%)
Query: 1705 ESGNEQVEEVKTRVRRRSSNRSRSNSQDNEENDEEDVDIDNFDPEQPGPSTSYAEPVAST 1764
E N Q E + + RS + D++E ED D D D E+
Sbjct: 3 EQVNTQANEEEDEEELEAVARSAGSDSDDDEVPAEDEDEDEEDDEEAESPAKAEISKREK 62
Query: 1765 SRGRFAGRAAKRKFKEESASESQEDENSIPETDTTEQPAKKLKVLY------------PK 1812
+R + +K K++ + E +N+ + D + +LK L +
Sbjct: 63 ARLK-----ELKKQKKQEIQKILEQQNAAIDADMNNKGKGRLKYLLQQTEIFAHFAKGDQ 117
Query: 1813 RRGQQSEDGSGDNASEMSELKQDPLSLHDENEDEDNTNEMGKRKRTTRLV 1862
+ G G +AS+++E ++D L +E + + TRL+
Sbjct: 118 SASAKKAKGRGRHASKLTEEEEDEEYLKEEEDGLGGS-------GGTRLL 160
>gnl|CDD|107161 PHA02264, PHA02264, hypothetical protein.
Length = 152
Score = 30.9 bits (69), Expect = 4.0
Identities = 8/68 (11%), Positives = 24/68 (35%), Gaps = 9/68 (13%)
Query: 1199 YELSRLRSSPWGNLILQEPELVKVVGIKYEIAPPRMCCLKLAVMKPNGKLTNEFFNIKYH 1258
Y + + ++++ P+++ V G L + + ++ ++N
Sbjct: 84 YNNTDYNGNIERRIVVKHPKVITVEGKP--------SYQFLTISLEDARVITVWYN-SVD 134
Query: 1259 DMPDVIDF 1266
D +D
Sbjct: 135 DTHRTLDL 142
>gnl|CDD|226293 COG3770, MepA, Murein endopeptidase [Cell envelope biogenesis, outer
membrane].
Length = 284
Score = 31.3 bits (71), Expect = 4.2
Identities = 13/40 (32%), Positives = 15/40 (37%), Gaps = 1/40 (2%)
Query: 1329 WDLEPPKEGKEATEPGGSLPVEPEEIQELLYHPRPDEWPQ 1368
EPP G EP P P Q +L P P W +
Sbjct: 246 SWFEPPPPGTTKPEPKAP-PPLPPSCQAVLDEPAPSSWLE 284
>gnl|CDD|217840 pfam04006, Mpp10, Mpp10 protein. This family includes proteins
related to Mpp10 (M phase phosphoprotein 10). The U3
small nucleolar ribonucleoprotein (snoRNP) is required
for three cleavage events that generate the mature 18S
rRNA from the pre-rRNA. In Saccharomyces cerevisiae,
depletion of Mpp10, a U3 snoRNP-specific protein, halts
18S rRNA production and impairs cleavage at the three U3
snoRNP-dependent sites.
Length = 613
Score = 31.9 bits (72), Expect = 4.3
Identities = 28/209 (13%), Positives = 66/209 (31%), Gaps = 25/209 (11%)
Query: 1723 SNRSRSNSQDNEENDEEDVDIDNFDPEQPGPSTSYAEPVASTSRGRFAGRAAKRKFKEES 1782
S+ S +S+D+ +++EE+ + ++ + E S S + E
Sbjct: 104 SDGSDMDSEDSADDEEEEEEDESLEDEMIDDEDEADLFNESESS-----LEDLSDDETED 158
Query: 1783 ASESQEDENSIPETDTTEQPAKKLKVLYPK----------------RRGQQSEDGSGDNA 1826
E + +E E + + A + K + E+ + +
Sbjct: 159 DEEKKMEEEEAGEEKESVEQATREKKFDKSGVDDKFFKLDEMNEFLEATEAEEEAALGDE 218
Query: 1827 SEMSELKQDPLSLHDENEDEDNTNEMGKRKRTTRLVKSNEPEDEEDDDDESDEDYSPSTH 1886
+ + QD E+ +D G+ + + + + + +D
Sbjct: 219 DDFEDYFQDD----SEDGKDDEDFGSGEDEEDDEEGNIEYEDFFDPKEKDKKKDAGDDAE 274
Query: 1887 AVTKRKRSVRVEKSNEVDEDEESDEDYKP 1915
V+K + +EE +ED +
Sbjct: 275 LEDDEPDKEAVKKEADSKPEEEDEEDDEQ 303
>gnl|CDD|216205 pfam00937, Corona_nucleoca, Coronavirus nucleocapsid protein.
Length = 346
Score = 31.6 bits (72), Expect = 4.7
Identities = 22/89 (24%), Positives = 37/89 (41%), Gaps = 5/89 (5%)
Query: 970 RSRANGESSAATEGTSPSGTAPTAQNKNFVKRLPLSQHKIVVERINAFAEIESEEFIKES 1029
RSR++ SS+ + PS + ++N R S +V + A A++ + S
Sbjct: 155 RSRSSSRSSSRSNSRGPSRGSSRNNSRN---RNSSSPDDLVAAVLAALAKLGFGKQKSSS 211
Query: 1030 KKLFDSSTTTQPLVVKKEKKKKVVRQART 1058
KK + + KK+ K R RT
Sbjct: 212 KKPSRVTKKSAAEAAKKQLNKP--RWKRT 238
>gnl|CDD|220102 pfam09073, BUD22, BUD22. BUD22 has been shown in yeast to be a
nuclear protein involved in bud-site selection. It plays
a role in positioning the proximal bud pole signal. More
recently it has been shown to be involved in ribosome
biogenesis.
Length = 424
Score = 31.3 bits (71), Expect = 5.0
Identities = 19/105 (18%), Positives = 44/105 (41%), Gaps = 3/105 (2%)
Query: 1812 KRRGQQSEDGSGDNASEMSELKQDPLS-LHDENEDEDNTNEMGKRKRTTRLVKSNEPEDE 1870
K+ ++S D + SE + + S D +++E+ ++ + ++ + DE
Sbjct: 160 KKEAKESSDKDDEEESESEDESKSEESAEDDSDDEEEEDSDSEDYSQYDGMLVDSS--DE 217
Query: 1871 EDDDDESDEDYSPSTHAVTKRKRSVRVEKSNEVDEDEESDEDYKP 1915
E+ ++ +Y+ T + + +S V + EES K
Sbjct: 218 EEGEEAPSINYNEDTSESESDESDSEISESRSVSDSEESSPPSKK 262
Score = 30.6 bits (69), Expect = 9.1
Identities = 25/149 (16%), Positives = 47/149 (31%), Gaps = 27/149 (18%)
Query: 1763 STSRGRFAGRAAKRKFKEESASESQE-DENSIPETDTTEQPAKKLKVLYPKRRGQQSEDG 1821
S + EESA + + +E +++ Q Y S++
Sbjct: 167 SDKDDEEESESEDESKSEESAEDDSDDEEEEDSDSEDYSQ--------YDGMLVDSSDEE 218
Query: 1822 SGDNA----------SEMSELKQDPLSLHDENEDEDNTNEMGKRKRTTRLVKSNEPE--- 1868
G+ A S+ +S D + ++ K+ + + + P
Sbjct: 219 EGEEAPSINYNEDTSESESDESDSEISESRSVSDSEESSPPSKKPKEKKTSSTFLPSLMG 278
Query: 1869 -----DEEDDDDESDEDYSPSTHAVTKRK 1892
E++DDD+ D D KRK
Sbjct: 279 GYFSGSEDEDDDDEDIDPDQVVKKPVKRK 307
>gnl|CDD|191187 pfam05087, Rota_VP2, Rotavirus VP2 protein. Rotavirus particles
consist of three concentric proteinaceous capsid layers.
The innermost capsid (core) is made of VP2. The genomic
RNA and the two minor proteins VP1 and VP3 are
encapsidated within this layer. The N-terminus of
rotavirus VP2 is necessary for the encapsidation of VP1
and VP3.
Length = 887
Score = 31.8 bits (72), Expect = 5.1
Identities = 13/63 (20%), Positives = 28/63 (44%)
Query: 1852 MGKRKRTTRLVKSNEPEDEEDDDDESDEDYSPSTHAVTKRKRSVRVEKSNEVDEDEESDE 1911
M R R + +N+ E+DD+ + ++ V +K V + + +++ S E
Sbjct: 1 MAYRNRREANINNNDRMQEKDDEKQDQKNRMELKEKVLDKKEEVVTDNVDSPVKEQSSQE 60
Query: 1912 DYK 1914
+ K
Sbjct: 61 NLK 63
>gnl|CDD|180374 PRK06060, PRK06060, acyl-CoA synthetase; Validated.
Length = 705
Score = 31.5 bits (71), Expect = 5.1
Identities = 15/57 (26%), Positives = 25/57 (43%), Gaps = 10/57 (17%)
Query: 1341 TEPGGSLPVEPEEIQELLYHPRPDEWPQGDRDATCEKIITGLEEVMGLSVAEPFLVP 1397
TE G + V+P E++ L+ + +A E + + E G S + FLV
Sbjct: 390 TEVIGGVNVDPREVERLII----------EDEAVAEAAVVAVRESTGASTLQAFLVA 436
>gnl|CDD|220271 pfam09507, CDC27, DNA polymerase subunit Cdc27. This protein forms
the C subunit of DNA polymerase delta. It carries the
essential residues for binding to the Pol1 subunit of
polymerase alpha, from residues 293-332, which are
characterized by the motif D--G--VT, referred to as the
DPIM motif. The first 160 residues of the protein form
the minimal domain for binding to the B subunit, Cdc1, of
polymerase delta, the final 10 C-terminal residues,
362-372, being the DNA sliding clamp, PCNA, binding
motif.
Length = 427
Score = 31.3 bits (71), Expect = 5.5
Identities = 36/206 (17%), Positives = 66/206 (32%), Gaps = 40/206 (19%)
Query: 1699 ETKTKAESGNEQVEEVKTRVRRRSSNRSRSNSQDNEENDEEDVDIDNFDPEQPGPSTSYA 1758
E K +ES ++ E ++ R + +E+++ED + + S S
Sbjct: 231 EKKEASESTVKEESEEESGKRDVILEDESAEPTGLDEDEDEDEPKPSGER-----SDSEE 285
Query: 1759 EPVASTSRGRFAGRAAKRKFKEESASESQEDENSIPETDTTEQPAKKLKVLYPKRRGQQS 1818
E R KR K + E+ +PE+ E ++S
Sbjct: 286 ETEEKEKEKR------KRLKKMMEDEDEDEEMEIVPESPVEE---------------EES 324
Query: 1819 EDGSGDNASEMSELKQDPLSLHDENEDEDNTNEMGKRKRTTRLVKSNEPEDEE----DDD 1874
E+ + E K E+ + + G+R+ R++K +DEE
Sbjct: 325 EEPEPPPLPKKEEEK----------EEVTVSPDGGRRRGRRRVMKKKTFKDEEGYLVTKK 374
Query: 1875 DESDEDYSPSTHAVTKRKRSVRVEKS 1900
E +S K +V
Sbjct: 375 VYEWESFSEDEAEPPPTKPKPKVSTP 400
>gnl|CDD|227880 COG5593, COG5593, Nucleic-acid-binding protein possibly involved in
ribosomal biogenesis [Translation, ribosomal structure
and biogenesis].
Length = 821
Score = 31.6 bits (71), Expect = 5.7
Identities = 15/81 (18%), Positives = 35/81 (43%), Gaps = 3/81 (3%)
Query: 1815 GQQSEDGSGDNASEMSELKQDPLSLHDENEDEDNTNEMGKRKRTTRLVKSNE--PEDEED 1872
E E ++ + S +E DE + G++ + +S+E EE+
Sbjct: 721 TSDDEPKLDAIDDEDAKSEGSQESDQEEGLDEIFYSFDGEQDNSDSFAESSEEDESSEEE 780
Query: 1873 DDDESDEDYSPSTHAVTKRKR 1893
++E +++ S + A K+++
Sbjct: 781 KEEEENKEVS-AKRAKKKQRK 800
>gnl|CDD|130065 TIGR00992, 3a0901s03IAP75, chloroplast envelope protein translocase,
IAP75 family. Two families of proteins are involved in
the chloroplast envelope import appartus.They are the
three proteins of the outer membrane (TOC) and four
proteins in the inner membrane (TIC). This family is
specific for the TOC IAP75 protein [Transport and binding
proteins, Amino acids, peptides and amines].
Length = 718
Score = 31.5 bits (71), Expect = 5.8
Identities = 18/106 (16%), Positives = 32/106 (30%), Gaps = 15/106 (14%)
Query: 1740 DVDIDNFDPEQPGPSTSYAEPVASTSRGRFAGRAAKRKFKEESASESQEDENSIPETDTT 1799
+ + P G A + G F + A + DEN + D+
Sbjct: 4 GGGVGDLPPMGGGGGGELAG-----AHGNFWAQVASP-------VAAVADENLSADWDSH 51
Query: 1800 EQPAKKLKVLYPKRRGQQSEDGSGDNASEMSELKQDPLSLHDENED 1845
PA ++V P G + S + ++ D E+
Sbjct: 52 NLPA-HIEVQLPLLSGFKKYKES--DIVFFDRTEETTGVTEDSFEE 94
>gnl|CDD|185603 PTZ00415, PTZ00415, transmission-blocking target antigen s230;
Provisional.
Length = 2849
Score = 31.6 bits (71), Expect = 6.7
Identities = 34/162 (20%), Positives = 63/162 (38%), Gaps = 19/162 (11%)
Query: 1767 GRFAGRAAKRKFKEESASESQEDENSIPETDTTEQPAKKLKVLYPKRRGQQSEDGSGD-- 1824
G + A R K+ S Q E + E +K + YP ++E G D
Sbjct: 73 GIYNLGDAIRMGKDTSVKIKQRTCPFHKEGSSFEMGSKNITCFYPIH--GKAEIGDLDMI 130
Query: 1825 --------NASEMSELKQDPLSLHDENED----EDNTNEMGKRKRTTRLVKSNEPEDEED 1872
+ +E +D + D++ED +D+ + + + +K D+ED
Sbjct: 131 IIKRRRARHLAEEDMSPRDNFVIDDDDEDEDEDDDDEEDDEEEEEEEEEIK---GFDDED 187
Query: 1873 DDDESDEDYSPSTHAVTKRKRSVRVEKSNEVDEDEESDEDYK 1914
++DE ED++ V K +E D ++ DE ++
Sbjct: 188 EEDEGGEDFTYEKSEVDKTDCFKFIEAGAGDDVFDKIDEAFE 229
>gnl|CDD|218391 pfam05029, TIMELESS_C, Timeless protein C terminal region. The
timeless (tim) gene is essential for circadian function
in Drosophila. Putative homologues of Drosophila tim have
been identified in both mice and humans (mTim and hTIM,
respectively). Mammalian TIM is not the true orthologue
of Drosophila TIM, but is the likely orthologue of a fly
gene, timeout (also called tim-2). mTim has been shown to
be essential for embryonic development, but does not have
substantiated circadian function. Some family members
contain a SANT domain in this region.
Length = 507
Score = 31.2 bits (70), Expect = 6.9
Identities = 61/318 (19%), Positives = 108/318 (33%), Gaps = 27/318 (8%)
Query: 1596 DVKRVFHNSRNYNQNKRSKIYIMTTRLSEFAEEHLRRLTVRWNSHKNKENKKDSKRGEKK 1655
+++R++ R+ + + +T + S A+ + L + S K + +R + K
Sbjct: 176 ELQRLYEEFRDSDDVLGHIMKNITGKRSR-AQVVDKLLQLGLVSEKRELRGGKKRRKKLK 234
Query: 1656 IKRMKRRRRKIRRTLVESDDEVLSRKRRGKSTYENDFDFNPAGETKTKAESGNEQVEEVK 1715
K+ + ++E + + + T + F G T AE+ +Q+++ K
Sbjct: 235 PKQPNGEESGEDDFQEDPEEEEQLPESKPEETEKRVSAFQVEGSTLISAENLRQQLKQEK 294
Query: 1716 TRVRRRSSNRSRSNSQDNEENDEEDVDIDNFDPEQPGPSTSYAEPVASTSRGRFAGRAAK 1775
T + D+ E DE D Q P E R K
Sbjct: 295 TSWPLLWLQSCLIRAADDREEDECD---------QAVPLVPLTEENEEAMENEQFQRLLK 345
Query: 1776 RKFKEESASESQEDENSIPETDTTEQPAKKLKVLYPKRRGQQSEDGSGDNASEMSELKQD 1835
S QE IP ++ Q ++ L SG+ ELK D
Sbjct: 346 ALGLRPPRSG-QEGFWRIPAKLSSTQLRRRAASL------------SGEEEEPEDELKDD 392
Query: 1836 -PLSLHDENEDEDNTNEMGKRKRTTRLVKSNEPEDEEDDDDESDEDYSPSTHAVTKRKRS 1894
DE+E E R+R L PE+E ++E ++++
Sbjct: 393 VDGEQADESEHETLALRKNARQRKAGLAS---PEEEALGEEEQKAPPKKKQLNQKNKQQT 449
Query: 1895 VRVEKSNEVDEDEESDED 1912
S+E +D DED
Sbjct: 450 GSGTNSDEERDDTSLDED 467
>gnl|CDD|191716 pfam07263, DMP1, Dentin matrix protein 1 (DMP1). This family
consists of several mammalian dentin matrix protein 1
(DMP1) sequences. The dentin matrix acidic phosphoprotein
1 (DMP1) gene has been mapped to human chromosome 4q21.
DMP1 is a bone and teeth specific protein initially
identified from mineralised dentin. DMP1 is primarily
localised in the nuclear compartment of undifferentiated
osteoblasts. In the nucleus, DMP1 acts as a
transcriptional component for activation of
osteoblast-specific genes like osteocalcin. During the
early phase of osteoblast maturation, Ca(2+) surges into
the nucleus from the cytoplasm, triggering the
phosphorylation of DMP1 by a nuclear isoform of casein
kinase II. This phosphorylated DMP1 is then exported out
into the extracellular matrix, where it regulates
nucleation of hydroxyapatite. DMP1 is a unique molecule
that initiates osteoblast differentiation by
transcription in the nucleus and orchestrates mineralised
matrix formation extracellularly, at later stages of
osteoblast maturation. The DMP1 gene has been found to be
ectopically expressed in lung cancer although the reason
for this is unknown.
Length = 514
Score = 30.8 bits (69), Expect = 8.7
Identities = 56/253 (22%), Positives = 97/253 (38%), Gaps = 19/253 (7%)
Query: 1672 ESDDEVLSRKRRGKSTYENDFDFNPAG---ETKTKAESGNEQVEEVKTRVRRRSSNRSRS 1728
+++DEV SR G ST +++ + + G E ++ G+E +E S+ R
Sbjct: 169 DNEDEVDSRPEGGDSTQDSESEEHWVGGGSEGESSHGDGSEFDDEGMQSDDPESTRSERG 228
Query: 1729 NSQDNEENDEEDVDIDNFDPEQPGPSTSYAEPVASTSRGRFAGRAAKRKFKEESASESQE 1788
NS+ + + + D EQ S +F K + EE +
Sbjct: 229 NSRMSSAGLKSK-ESKGEDEEQASTQDSGESQSVEYPSRKFF---RKSRISEEDGRGELD 284
Query: 1789 DENSIP-ETDTTEQPAKKLKVLYPKRRGQQSEDGSGDNASEMSELKQDPLS--------L 1839
D N++ ++D+TE L R + ED + + E S+ QDP S
Sbjct: 285 DSNTMEVKSDSTENAG--LSQSREHSRSESQEDSEENQSQEDSQEVQDPSSESSQEADLP 342
Query: 1840 HDENEDEDNTNEMGKRKRTTRLVKSNEPEDEED-DDDESDEDYSPSTHAVTKRKRSVRVE 1898
EN E + + + ++ ED+ED + E D +PS+ + E
Sbjct: 343 SQENSSESQEEVVSESRGDNPDNTTSHSEDQEDSESSEEDSLDTPSSSESQSTEEQADSE 402
Query: 1899 KSNEVDEDEESDE 1911
+ + EES E
Sbjct: 403 SNESLSSSEESPE 415
>gnl|CDD|222977 PHA03089, PHA03089, late transcription factor VLTF-4; Provisional.
Length = 191
Score = 30.2 bits (68), Expect = 8.8
Identities = 18/107 (16%), Positives = 38/107 (35%), Gaps = 11/107 (10%)
Query: 1775 KRKFKEESASESQEDENSIPE---TDTTEQPAKKLKVLYPKRRGQQSEDGSGDNASEMSE 1831
+ K + S + +++ PE + + KK K+ +++ + +
Sbjct: 21 RAHVKSTTESVDKVNDDIFPEDVEIPSKKTSKKKKTTPRKKKTTKKT--------KKKKK 72
Query: 1832 LKQDPLSLHDENEDEDNTNEMGKRKRTTRLVKSNEPEDEEDDDDESD 1878
K++ L E + NE +K L K E + +D D
Sbjct: 73 EKEEVPELAAEELSDSEENEENDKKVDYELPKVQNTAAEVNHEDVID 119
>gnl|CDD|205277 pfam13096, ShortName, Family description.
Length = 177
Score = 29.9 bits (67), Expect = 9.4
Identities = 17/84 (20%), Positives = 31/84 (36%), Gaps = 14/84 (16%)
Query: 1010 VVERINAFAEIESEEFIKESKKLFDSSTTTQPLVVKKEKKKKVVRQARTRAEYSE----- 1064
+ E+ N + I E + D S V + E+ + ++ R+ +SE
Sbjct: 23 IQEKENLSRRVTDLNIIMEGVEFSDLSKF----VSRAEEDRDLLLFFRSLRSFSEWCEHR 78
Query: 1065 -----HIQDQYEDPDNLSLSGSEE 1083
H +++Y D L S E
Sbjct: 79 RETFEHFKEKYPDVVRLPEGVSGE 102
>gnl|CDD|227649 COG5345, COG5345, Uncharacterized protein conserved in bacteria
[Function unknown].
Length = 358
Score = 30.6 bits (69), Expect = 9.5
Identities = 27/114 (23%), Positives = 40/114 (35%), Gaps = 19/114 (16%)
Query: 1691 DFDFNPAGETKTKAESGNEQ------VEEVKTRVRRRSSNRSRSNSQDNEENDEEDVD-- 1742
D+D PA + K + G Q +E R R ++ Q N + D
Sbjct: 127 DWDITPALDNKASFQRGQNQAVRRTDLELADPLGRVRGTSWILPALQSARGNGQFSEDRW 186
Query: 1743 ---IDNFDPEQPGPSTSYAEPVASTSRGRFAGRAAKRKFKEESASESQEDENSI 1793
+ F P+ P PS A A+ + F G A AS Q +N +
Sbjct: 187 YFGLSPFGPKTPTPSFYRA---AARNLRNFNGDLATCL-----ASFDQRADNLV 232
>gnl|CDD|218312 pfam04889, Cwf_Cwc_15, Cwf15/Cwc15 cell cycle control protein. This
family represents Cwf15/Cwc15 (from Schizosaccharomyces
pombe and Saccharomyces cerevisiae respectively) and
their homologues. The function of these proteins is
unknown, but they form part of the spliceosome and are
thus thought to be involved in mRNA splicing.
Length = 241
Score = 30.1 bits (68), Expect = 9.8
Identities = 19/91 (20%), Positives = 32/91 (35%), Gaps = 18/91 (19%)
Query: 1822 SGDNASEMSELKQDPLSLHDENEDEDNTNEMGKRKRTTRLVKSNEPEDEEDDDDESDEDY 1881
S E +L NE +++ +E + KR R+ + D +D D SD D
Sbjct: 72 SKKENKLAIEDADKSTNLDASNEGDEDDDEEDEIKRK-RIEEDARNSDADDSDSSSDSD- 129
Query: 1882 SPSTHAVTKRKRSVRVEKSNEVDEDEESDED 1912
++ D+D+ DE
Sbjct: 130 ----------------SSDDDSDDDDSEDET 144
Database: CDD.v3.10
Posted date: Mar 20, 2013 7:55 AM
Number of letters in database: 10,937,602
Number of sequences in database: 44,354
Lambda K H
0.315 0.133 0.390
Gapped
Lambda K H
0.267 0.0728 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 98,989,621
Number of extensions: 10042622
Number of successful extensions: 10339
Number of sequences better than 10.0: 1
Number of HSP's gapped: 9656
Number of HSP's successfully gapped: 350
Length of query: 1915
Length of database: 10,937,602
Length adjustment: 111
Effective length of query: 1804
Effective length of database: 6,014,308
Effective search space: 10849811632
Effective search space used: 10849811632
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (22.0 bits)
S2: 66 (29.2 bits)