RPS-BLAST 2.2.26 [Sep-21-2011]
Database: CDD.v3.10
44,354 sequences; 10,937,602 total letters
Searching..................................................done
Query= psy14234
(779 letters)
>gnl|CDD|99929 cd05497, Bromo_Brdt_I_like, Bromodomain, Brdt_like subfamily,
repeat I. Human Brdt is a testis-specific member of the
BET subfamily of bromodomain proteins; the first
bromodomain in Brdt has been shown to be essential for
male germ cell differentiation. Bromodomains are 110
amino acid long domains, that are found in many
chromatin associated proteins. Bromodomains can interact
specifically with acetylated lysine.
Length = 107
Score = 212 bits (541), Expect = 2e-65
Identities = 80/107 (74%), Positives = 92/107 (85%)
Query: 120 RNTNQLQYIVKNVMKAVWKHPHAWPFHEPVDAINLNLPDYHKVITQPMDLGTIKKRLENN 179
R+TNQLQY++K V+KA+WKH AWPF +PVDA+ LNLPDYHK+I PMDLGTIKKRLENN
Sbjct: 1 RHTNQLQYLLKVVLKALWKHKFAWPFQQPVDAVKLNLPDYHKIIKTPMDLGTIKKRLENN 60
Query: 180 YYWSGKEAIQDFTTMFTNCYVYNKPGEDVVLMAQNLEQLFLTKITGM 226
YYWS E IQDF TMFTNCY+YNKPG+DVVLMAQ LE+LFL K+ M
Sbjct: 61 YYWSASECIQDFNTMFTNCYIYNKPGDDVVLMAQTLEKLFLQKLAQM 107
Score = 111 bits (279), Expect = 5e-29
Identities = 44/90 (48%), Positives = 58/90 (64%), Gaps = 1/90 (1%)
Query: 388 KHQNYAWPFYTPVDVEKLGLTDYFDIIKKPMDLGTVRKKMRNRTYKTAKEFADDVRLIFS 447
KH+ +AWPF PVD KL L DY IIK PMDLGT++K++ N Y +A E D +F+
Sbjct: 19 KHK-FAWPFQQPVDAVKLNLPDYHKIIKTPMDLGTIKKRLENNYYWSASECIQDFNTMFT 77
Query: 448 NCYKYNPPDHNVVTMARQLSAVFEDRFAKM 477
NCY YN P +VV MA+ L +F + A+M
Sbjct: 78 NCYIYNKPGDDVVLMAQTLEKLFLQKLAQM 107
>gnl|CDD|99930 cd05498, Bromo_Brdt_II_like, Bromodomain, Brdt_like subfamily,
repeat II. Human Brdt is a testis-specific member of the
BET subfamily of bromodomain proteins; the first
bromodomain in Brdt has been shown to be essential for
male germ cell differentiation. Bromodomains are 110
amino acid long domains, that are found in many
chromatin associated proteins. Bromodomains can interact
specifically with acetylated lysine.
Length = 102
Score = 187 bits (476), Expect = 3e-56
Identities = 69/102 (67%), Positives = 79/102 (77%)
Query: 373 WYKYCSEIIAELFHKKHQNYAWPFYTPVDVEKLGLTDYFDIIKKPMDLGTVRKKMRNRTY 432
K+CS I+ ELF KKH+ YAWPFY PVD E LGL DY DIIK PMDL T++KK+ NR Y
Sbjct: 1 QLKFCSGILKELFSKKHKAYAWPFYKPVDPEALGLHDYHDIIKHPMDLSTIKKKLDNREY 60
Query: 433 KTAKEFADDVRLIFSNCYKYNPPDHNVVTMARQLSAVFEDRF 474
A+EFA DVRL+FSNCYKYNPPDH V MAR+L VFEDR+
Sbjct: 61 ADAQEFAADVRLMFSNCYKYNPPDHPVHAMARKLQDVFEDRW 102
Score = 111 bits (280), Expect = 3e-29
Identities = 39/78 (50%), Positives = 49/78 (62%)
Query: 142 AWPFHEPVDAINLNLPDYHKVITQPMDLGTIKKRLENNYYWSGKEAIQDFTTMFTNCYVY 201
AWPF++PVD L L DYH +I PMDL TIKK+L+N Y +E D MF+NCY Y
Sbjct: 21 AWPFYKPVDPEALGLHDYHDIIKHPMDLSTIKKKLDNREYADAQEFAADVRLMFSNCYKY 80
Query: 202 NKPGEDVVLMAQNLEQLF 219
N P V MA+ L+ +F
Sbjct: 81 NPPDHPVHAMARKLQDVF 98
>gnl|CDD|99931 cd05499, Bromo_BDF1_2_II, Bromodomain. BDF1/BDF2 like subfamily,
restricted to fungi, repeat II. BDF1 and BDF2 are yeast
transcription factors involved in the expression of a
wide range of genes, including snRNAs; they are required
for sporulation and DNA repair and protect histone H4
from deacetylation. Bromodomains are 110 amino acid long
domains, that are found in many chromatin associated
proteins. Bromodomains can interact specifically with
acetylated lysine.
Length = 102
Score = 145 bits (368), Expect = 3e-41
Identities = 53/99 (53%), Positives = 65/99 (65%)
Query: 374 YKYCSEIIAELFHKKHQNYAWPFYTPVDVEKLGLTDYFDIIKKPMDLGTVRKKMRNRTYK 433
K+C E++ EL KH Y WPF PVD L + +YF IIKKPMDLGT+ KK++N Y+
Sbjct: 2 LKFCEEVLKELMKPKHSAYNWPFLDPVDPVALNIPNYFSIIKKPMDLGTISKKLQNGQYQ 61
Query: 434 TAKEFADDVRLIFSNCYKYNPPDHNVVTMARQLSAVFED 472
+AKEF DVRLIF NCY +NP +V M QL VF D
Sbjct: 62 SAKEFERDVRLIFKNCYTFNPEGTDVYMMGHQLEEVFND 100
Score = 101 bits (254), Expect = 9e-26
Identities = 38/78 (48%), Positives = 52/78 (66%)
Query: 142 AWPFHEPVDAINLNLPDYHKVITQPMDLGTIKKRLENNYYWSGKEAIQDFTTMFTNCYVY 201
WPF +PVD + LN+P+Y +I +PMDLGTI K+L+N Y S KE +D +F NCY +
Sbjct: 21 NWPFLDPVDPVALNIPNYFSIIKKPMDLGTISKKLQNGQYQSAKEFERDVRLIFKNCYTF 80
Query: 202 NKPGEDVVLMAQNLEQLF 219
N G DV +M LE++F
Sbjct: 81 NPEGTDVYMMGHQLEEVF 98
>gnl|CDD|99938 cd05506, Bromo_plant1, Bromodomain, uncharacterized subfamily
specific to plants. Might function as a global
transcription factor. Bromodomains are 110 amino acid
long domains, that are found in many chromatin
associated proteins. Bromodomains can interact
specifically with acetylated lysine.
Length = 99
Score = 135 bits (342), Expect = 1e-37
Identities = 53/102 (51%), Positives = 70/102 (68%), Gaps = 3/102 (2%)
Query: 373 WYKYCSEIIAELFHKKHQNYAWPFYTPVDVEKLGLTDYFDIIKKPMDLGTVRKKMRNRTY 432
K C ++ +L KH+ + W F PVDV LGL DYFDIIKKPMDLGTV+KK+ Y
Sbjct: 1 VMKQCGTLLRKLM--KHK-WGWVFNAPVDVVALGLPDYFDIIKKPMDLGTVKKKLEKGEY 57
Query: 433 KTAKEFADDVRLIFSNCYKYNPPDHNVVTMARQLSAVFEDRF 474
+ +EFA DVRL F+N +YNPP ++V TMA++L +FE R+
Sbjct: 58 SSPEEFAADVRLTFANAMRYNPPGNDVHTMAKELLKIFETRW 99
Score = 110 bits (276), Expect = 8e-29
Identities = 40/92 (43%), Positives = 55/92 (59%)
Query: 131 NVMKAVWKHPHAWPFHEPVDAINLNLPDYHKVITQPMDLGTIKKRLENNYYWSGKEAIQD 190
+++ + KH W F+ PVD + L LPDY +I +PMDLGT+KK+LE Y S +E D
Sbjct: 7 TLLRKLMKHKWGWVFNAPVDVVALGLPDYFDIIKKPMDLGTVKKKLEKGEYSSPEEFAAD 66
Query: 191 FTTMFTNCYVYNKPGEDVVLMAQNLEQLFLTK 222
F N YN PG DV MA+ L ++F T+
Sbjct: 67 VRLTFANAMRYNPPGNDVHTMAKELLKIFETR 98
>gnl|CDD|197636 smart00297, BROMO, bromo domain.
Length = 107
Score = 122 bits (309), Expect = 4e-33
Identities = 41/108 (37%), Positives = 60/108 (55%), Gaps = 3/108 (2%)
Query: 371 PLWYKYCSEIIAELFHKKHQN-YAWPFYTPVDVEKLGLTDYFDIIKKPMDLGTVRKKMRN 429
P K E++ + K + +WPF PV ++ DY+DIIKKPMDL T++KK+ N
Sbjct: 2 PKLQKKLQELLKAVLDKLDSHPLSWPFLKPVSRKEAP--DYYDIIKKPMDLKTIKKKLEN 59
Query: 430 RTYKTAKEFADDVRLIFSNCYKYNPPDHNVVTMARQLSAVFEDRFAKM 477
Y + +EF D L+FSN YN PD V A++L FE + ++
Sbjct: 60 GKYSSVEEFVADFNLMFSNARTYNGPDSEVYKDAKKLEKFFEKKLREL 107
Score = 119 bits (301), Expect = 6e-32
Identities = 43/105 (40%), Positives = 62/105 (59%), Gaps = 2/105 (1%)
Query: 120 RNTNQLQYIVKNVMKAVWKHPHAWPFHEPVDAINLNLPDYHKVITQPMDLGTIKKRLENN 179
+ +LQ ++K V+ + HP +WPF +PV PDY+ +I +PMDL TIKK+LEN
Sbjct: 3 KLQKKLQELLKAVLDKLDSHPLSWPFLKPVSRKEA--PDYYDIIKKPMDLKTIKKKLENG 60
Query: 180 YYWSGKEAIQDFTTMFTNCYVYNKPGEDVVLMAQNLEQLFLTKIT 224
Y S +E + DF MF+N YN P +V A+ LE+ F K+
Sbjct: 61 KYSSVEEFVADFNLMFSNARTYNGPDSEVYKDAKKLEKFFEKKLR 105
>gnl|CDD|99922 cd04369, Bromodomain, Bromodomain. Bromodomains are found in many
chromatin-associated proteins and in nuclear histone
acetyltransferases. They interact specifically with
acetylated lysine.
Length = 99
Score = 119 bits (299), Expect = 7e-32
Identities = 39/102 (38%), Positives = 57/102 (55%), Gaps = 3/102 (2%)
Query: 373 WYKYCSEIIAELFHKKHQNYAWPFYTPVDVEKLGLTDYFDIIKKPMDLGTVRKKMRNRTY 432
K ++ L K ++ + PF PVD DY+++IK PMDL T++KK++N Y
Sbjct: 1 LKKKLRSLLDAL-KKLKRDLSEPFLEPVD--PKEAPDYYEVIKNPMDLSTIKKKLKNGEY 57
Query: 433 KTAKEFADDVRLIFSNCYKYNPPDHNVVTMARQLSAVFEDRF 474
K+ +EF DVRLIFSN YN P + A++L +FE
Sbjct: 58 KSLEEFEADVRLIFSNAKTYNGPGSPIYKDAKKLEKLFEKLL 99
Score = 108 bits (273), Expect = 2e-28
Identities = 39/101 (38%), Positives = 58/101 (57%), Gaps = 4/101 (3%)
Query: 125 LQYIVKNVMKAVWKH--PHAWPFHEPVDAINLNLPDYHKVITQPMDLGTIKKRLENNYYW 182
L+ +++++ A+ K + PF EPVD PDY++VI PMDL TIKK+L+N Y
Sbjct: 1 LKKKLRSLLDALKKLKRDLSEPFLEPVDP--KEAPDYYEVIKNPMDLSTIKKKLKNGEYK 58
Query: 183 SGKEAIQDFTTMFTNCYVYNKPGEDVVLMAQNLEQLFLTKI 223
S +E D +F+N YN PG + A+ LE+LF +
Sbjct: 59 SLEEFEADVRLIFSNAKTYNGPGSPIYKDAKKLEKLFEKLL 99
>gnl|CDD|99941 cd05509, Bromo_gcn5_like, Bromodomain; Gcn5_like subfamily. Gcn5p
is a histone acetyltransferase (HAT) which mediates
acetylation of histones at lysine residues; such
acetylation is generally correlated with the activation
of transcription. Bromodomains are 110 amino acid long
domains, that are found in many chromatin associated
proteins. Bromodomains can interact specifically with
acetylated lysine.
Length = 101
Score = 114 bits (287), Expect = 3e-30
Identities = 40/103 (38%), Positives = 59/103 (57%), Gaps = 2/103 (1%)
Query: 124 QLQYIVKNVMKAVWKHPHAWPFHEPVDAINLNLPDYHKVITQPMDLGTIKKRLENNYYWS 183
L +K V+ ++ H AWPF EPVD PDY+ VI +PMDL T++++LEN YY +
Sbjct: 1 PLYTQLKKVLDSLKNHKSAWPFLEPVDK--EEAPDYYDVIKKPMDLSTMEEKLENGYYVT 58
Query: 184 GKEAIQDFTTMFTNCYVYNKPGEDVVLMAQNLEQLFLTKITGM 226
+E + D +F NC +YN P + A LE+ F K+ +
Sbjct: 59 LEEFVADLKLIFDNCRLYNGPDTEYYKCANKLEKFFWKKLKEL 101
Score = 105 bits (264), Expect = 5e-27
Identities = 40/91 (43%), Positives = 55/91 (60%), Gaps = 3/91 (3%)
Query: 387 KKHQNYAWPFYTPVDVEKLGLTDYFDIIKKPMDLGTVRKKMRNRTYKTAKEFADDVRLIF 446
K H++ AWPF PVD E+ DY+D+IKKPMDL T+ +K+ N Y T +EF D++LIF
Sbjct: 14 KNHKS-AWPFLEPVDKEEAP--DYYDVIKKPMDLSTMEEKLENGYYVTLEEFVADLKLIF 70
Query: 447 SNCYKYNPPDHNVVTMARQLSAVFEDRFAKM 477
NC YN PD A +L F + ++
Sbjct: 71 DNCRLYNGPDTEYYKCANKLEKFFWKKLKEL 101
>gnl|CDD|99932 cd05500, Bromo_BDF1_2_I, Bromodomain. BDF1/BDF2 like subfamily,
restricted to fungi, repeat I. BDF1 and BDF2 are yeast
transcription factors involved in the expression of a
wide range of genes, including snRNAs; they are required
for sporulation and DNA repair and protect histone H4
from deacetylation. Bromodomains are 110 amino acid long
domains, that are found in many chromatin associated
proteins. Bromodomains can interact specifically with
acetylated lysine.
Length = 103
Score = 100 bits (251), Expect = 2e-25
Identities = 41/102 (40%), Positives = 56/102 (54%), Gaps = 3/102 (2%)
Query: 370 TPLWYKYCSEIIAELFHKKHQNYAWPFYTPVDVEKLGLTDYFDIIKKPMDLGTVRKKMRN 429
T +K+ I L K A PF PVD KL + Y IIKKPMDLGT+ +K+++
Sbjct: 2 TKHQHKFLLSSIRSL---KRLKDARPFLVPVDPVKLNIPHYPTIIKKPMDLGTIERKLKS 58
Query: 430 RTYKTAKEFADDVRLIFSNCYKYNPPDHNVVTMARQLSAVFE 471
Y + +EF D L+ NC +N P+H V M ++L A FE
Sbjct: 59 NVYTSVEEFTADFNLMVDNCLTFNGPEHPVSQMGKRLQAAFE 100
Score = 92.4 bits (230), Expect = 2e-22
Identities = 35/96 (36%), Positives = 58/96 (60%), Gaps = 1/96 (1%)
Query: 124 QLQYIVKNVMKAVWKHPHAWPFHEPVDAINLNLPDYHKVITQPMDLGTIKKRLENNYYWS 183
Q ++++ ++ +++ + A PF PVD + LN+P Y +I +PMDLGTI+++L++N Y S
Sbjct: 5 QHKFLLSSI-RSLKRLKDARPFLVPVDPVKLNIPHYPTIIKKPMDLGTIERKLKSNVYTS 63
Query: 184 GKEAIQDFTTMFTNCYVYNKPGEDVVLMAQNLEQLF 219
+E DF M NC +N P V M + L+ F
Sbjct: 64 VEEFTADFNLMVDNCLTFNGPEHPVSQMGKRLQAAF 99
>gnl|CDD|99936 cd05504, Bromo_Acf1_like, Bromodomain; Acf1_like or BAZ1A_like
subfamily. Bromo adjacent to zinc finger 1A (BAZ1A) was
identified as a novel human bromodomain gene by cDNA
library screening. The Drosophila homologue, Acf1, is
part of the CHRAC (chromatin accessibility complex) and
regulates ISWI-induced nucleosome remodeling.
Bromodomains are 110 amino acid long domains, that are
found in many chromatin associated proteins.
Bromodomains can interact specifically with acetylated
lysine.
Length = 115
Score = 96.3 bits (240), Expect = 1e-23
Identities = 42/101 (41%), Positives = 64/101 (63%), Gaps = 5/101 (4%)
Query: 377 CSEIIAELFHKKHQNYAWPFYTPVDVEKLGLTDYFDIIKKPMDLGTVRKKMRNRTYKTAK 436
+++ E+ KH++ +WPF PV K+ + DY+DIIKKPMDLGT+++K+ YK A+
Sbjct: 17 LEQLLVEI--VKHKD-SWPFLRPV--SKIEVPDYYDIIKKPMDLGTIKEKLNMGEYKLAE 71
Query: 437 EFADDVRLIFSNCYKYNPPDHNVVTMARQLSAVFEDRFAKM 477
EF D++L+FSNC+ YNP +V +L F R K+
Sbjct: 72 EFLSDIQLVFSNCFLYNPEHTSVYKAGTRLQRFFIKRCRKL 112
Score = 81.7 bits (202), Expect = 1e-18
Identities = 32/85 (37%), Positives = 49/85 (57%), Gaps = 2/85 (2%)
Query: 138 KHPHAWPFHEPVDAINLNLPDYHKVITQPMDLGTIKKRLENNYYWSGKEAIQDFTTMFTN 197
KH +WPF PV I + PDY+ +I +PMDLGTIK++L Y +E + D +F+N
Sbjct: 26 KHKDSWPFLRPVSKIEV--PDYYDIIKKPMDLGTIKEKLNMGEYKLAEEFLSDIQLVFSN 83
Query: 198 CYVYNKPGEDVVLMAQNLEQLFLTK 222
C++YN V L++ F+ +
Sbjct: 84 CFLYNPEHTSVYKAGTRLQRFFIKR 108
>gnl|CDD|99927 cd05495, Bromo_cbp_like, Bromodomain, cbp_like subfamily. Cbp (CREB
binding protein or CREBBP) is an acetyltransferase
acting on histone, which gives a specific tag for
transcriptional activation and also acetylates
non-histone proteins. CREBBP binds specifically to
phosphorylated CREB protein and augments the activity of
phosphorylated CREB to activate transcription of
cAMP-responsive genes. Bromodomains are 110 amino acid
long domains, that are found in many chromatin
associated proteins. Bromodomains can interact
specifically with acetylated lysine.
Length = 108
Score = 90.6 bits (225), Expect = 8e-22
Identities = 35/86 (40%), Positives = 49/86 (56%)
Query: 388 KHQNYAWPFYTPVDVEKLGLTDYFDIIKKPMDLGTVRKKMRNRTYKTAKEFADDVRLIFS 447
K + PF PVD + LG+ DYFDI+K PMDL T+R+K+ Y+ ++ DDV L+F
Sbjct: 17 KQDPESLPFRQPVDPKLLGIPDYFDIVKNPMDLSTIRRKLDTGQYQDPWQYVDDVWLMFD 76
Query: 448 NCYKYNPPDHNVVTMARQLSAVFEDR 473
N + YN V +L+ VFE
Sbjct: 77 NAWLYNRKTSRVYKYCTKLAEVFEQE 102
Score = 75.2 bits (185), Expect = 2e-16
Identities = 30/99 (30%), Positives = 51/99 (51%)
Query: 125 LQYIVKNVMKAVWKHPHAWPFHEPVDAINLNLPDYHKVITQPMDLGTIKKRLENNYYWSG 184
Q ++ + K + P + PF +PVD L +PDY ++ PMDL TI+++L+ Y
Sbjct: 5 RQALMPTLEKLYKQDPESLPFRQPVDPKLLGIPDYFDIVKNPMDLSTIRRKLDTGQYQDP 64
Query: 185 KEAIQDFTTMFTNCYVYNKPGEDVVLMAQNLEQLFLTKI 223
+ + D MF N ++YN+ V L ++F +I
Sbjct: 65 WQYVDDVWLMFDNAWLYNRKTSRVYKYCTKLAEVFEQEI 103
>gnl|CDD|215921 pfam00439, Bromodomain, Bromodomain. Bromodomains are 110 amino
acid long domains, that are found in many chromatin
associated proteins. Bromodomains can interact
specifically with acetylated lysine.
Length = 84
Score = 89.4 bits (222), Expect = 1e-21
Identities = 32/89 (35%), Positives = 49/89 (55%), Gaps = 5/89 (5%)
Query: 377 CSEIIAELFHKKHQNYAWPFYTPVDVEKLGLTDYFDIIKKPMDLGTVRKKMRNRTYKTAK 436
C EI+ +L +H E DY+++IK+PMDL T+R+K+++ YK+
Sbjct: 1 CLEILEDL--MEHPLAEPFLEPVDPEE---YPDYYEVIKEPMDLSTIRQKLKSGKYKSLA 55
Query: 437 EFADDVRLIFSNCYKYNPPDHNVVTMARQ 465
EF DV LIFSN YN D ++ A++
Sbjct: 56 EFLKDVELIFSNAITYNGEDSDIYKDAKK 84
Score = 84.0 bits (208), Expect = 8e-20
Identities = 31/84 (36%), Positives = 50/84 (59%), Gaps = 2/84 (2%)
Query: 130 KNVMKAVWKHPHAWPFHEPVDAINLNLPDYHKVITQPMDLGTIKKRLENNYYWSGKEAIQ 189
+++ + +HP A PF EPVD PDY++VI +PMDL TI+++L++ Y S E ++
Sbjct: 2 LEILEDLMEHPLAEPFLEPVD--PEEYPDYYEVIKEPMDLSTIRQKLKSGKYKSLAEFLK 59
Query: 190 DFTTMFTNCYVYNKPGEDVVLMAQ 213
D +F+N YN D+ A+
Sbjct: 60 DVELIFSNAITYNGEDSDIYKDAK 83
>gnl|CDD|99942 cd05510, Bromo_SPT7_like, Bromodomain; SPT7_like subfamily. SPT7 is
a yeast protein that functions as a component of the
transcription regulatory histone acetylation (HAT)
complexes SAGA, SALSA, and SLIK. SAGA is involved in the
RNA polymerase II-dependent transcriptional regulation
of about 10% of all yeast genes. The SPT7 bromodomain
has been shown to weakly interact with acetylated
histone H3, but not H4. The human representative of this
subfamily is cat eye syndrome critical region protein 2
(CECR2). Bromodomains are 110 amino acid long domains,
that are found in many chromatin associated proteins.
Bromodomains can interact specifically with acetylated
lysine.
Length = 112
Score = 89.4 bits (222), Expect = 2e-21
Identities = 43/107 (40%), Positives = 60/107 (56%), Gaps = 5/107 (4%)
Query: 374 YKYCSEIIAELFHKKHQNYAWPFYTPVDVEKLGLTDYFDIIKKPMDLGTVRKKMRNRTYK 433
Y+ +++ EL K + ++ PF T V K DY+DIIKKPMDLGT+ KK++N YK
Sbjct: 9 YESLDKVLNEL--KTYTEHSTPFLTKV--SKREAPDYYDIIKKPMDLGTMLKKLKNLQYK 64
Query: 434 TAKEFADDVRLIFSNCYKYNP-PDHNVVTMARQLSAVFEDRFAKMPD 479
+ EF DD+ LI+ NC YN P H + A + E +PD
Sbjct: 65 SKAEFVDDLNLIWKNCLLYNSDPSHPLRRHANFMKKKAEHLLKLIPD 111
Score = 61.7 bits (150), Expect = 1e-11
Identities = 25/63 (39%), Positives = 36/63 (57%), Gaps = 2/63 (3%)
Query: 140 PHAWPFHEPVDAINLNLPDYHKVITQPMDLGTIKKRLENNYYWSGKEAIQDFTTMFTNCY 199
H+ PF V PDY+ +I +PMDLGT+ K+L+N Y S E + D ++ NC
Sbjct: 24 EHSTPFLTKVSKREA--PDYYDIIKKPMDLGTMLKKLKNLQYKSKAEFVDDLNLIWKNCL 81
Query: 200 VYN 202
+YN
Sbjct: 82 LYN 84
>gnl|CDD|99943 cd05511, Bromo_TFIID, Bromodomain, TFIID-like subfamily. Human
TAFII250 (or TAF250) is the largest subunit of TFIID, a
large multi-domain complex, which initiates the assembly
of the transcription machinery. TAFII250 contains two
bromodomains that specifically bind to acetylated
histone H4. Bromodomains are 110 amino acid long
domains, that are found in many chromatin associated
proteins. Bromodomains can interact specifically with
acetylated lysine.
Length = 112
Score = 89.2 bits (222), Expect = 3e-21
Identities = 39/103 (37%), Positives = 63/103 (61%), Gaps = 5/103 (4%)
Query: 378 SEIIAELFHKKHQNYAWPFYTPVDVEKLGLTDYFDIIKKPMDLGTVRKKMRNRTYKTAKE 437
EI+ EL K+ +WPF+TPV+ +K+ DY+ IIK+PMDL T+RKK+ Y++ +E
Sbjct: 6 DEIVNEL---KNLPDSWPFHTPVNKKKV--PDYYKIIKRPMDLQTIRKKISKHKYQSREE 60
Query: 438 FADDVRLIFSNCYKYNPPDHNVVTMARQLSAVFEDRFAKMPDE 480
F +D+ LI N YN PD A+++ + E+ A+ ++
Sbjct: 61 FLEDIELIVDNSVLYNGPDSVYTKKAKEMLELAEELLAEREEK 103
Score = 76.1 bits (188), Expect = 1e-16
Identities = 31/94 (32%), Positives = 54/94 (57%), Gaps = 2/94 (2%)
Query: 125 LQYIVKNVMKAVWKHPHAWPFHEPVDAINLNLPDYHKVITQPMDLGTIKKRLENNYYWSG 184
L +I+ ++ + P +WPFH PV+ +PDY+K+I +PMDL TI+K++ + Y S
Sbjct: 1 LSFILDEIVNELKNLPDSWPFHTPVNK--KKVPDYYKIIKRPMDLQTIRKKISKHKYQSR 58
Query: 185 KEAIQDFTTMFTNCYVYNKPGEDVVLMAQNLEQL 218
+E ++D + N +YN P A+ + +L
Sbjct: 59 EEFLEDIELIVDNSVLYNGPDSVYTKKAKEMLEL 92
>gnl|CDD|99934 cd05502, Bromo_tif1_like, Bromodomain; tif1_like subfamily. Tif1
(transcription intermediary factor 1) is a member of the
tripartite motif (TRIM) protein family, which is
characterized by a particular domain architecture. It
functions by recruiting coactivators and/or corepressors
to modulate transcription. Vertebrate Tif1-gamma, also
labeled E3 ubiquitin-protein ligase TRIM33, plays a role
in the control of hematopoiesis. Its homologue in
Xenopus laevis, Ectodermin, has been shown to function
in germ-layer specification and control of cell growth
during embryogenesis. Bromodomains are 110 amino acid
long domains, that are found in many chromatin
associated proteins. Bromodomains can interact
specifically with acetylated lysine.
Length = 109
Score = 86.2 bits (214), Expect = 3e-20
Identities = 38/111 (34%), Positives = 63/111 (56%), Gaps = 11/111 (9%)
Query: 370 TPLWYKYCSEIIAELF-HKKHQNYAWPFYTPVDVEKLGLTDYFDIIKKPMDLGTVRKKMR 428
+P+ + C ++ EL+ H+ PF+ PV + +Y+ IIK PMDL +RKK++
Sbjct: 2 SPIDQRKCERLLLELYCHELSL----PFHEPV---SPSVPNYYKIIKTPMDLSLIRKKLQ 54
Query: 429 NRT---YKTAKEFADDVRLIFSNCYKYNPPDHNVVTMARQLSAVFEDRFAK 476
++ Y + +EF DVRL+F NCYK+N D V ++L FE++ +
Sbjct: 55 PKSPQHYSSPEEFVADVRLMFKNCYKFNEEDSEVAQAGKELELFFEEQLKE 105
Score = 76.6 bits (189), Expect = 7e-17
Identities = 31/82 (37%), Positives = 45/82 (54%), Gaps = 6/82 (7%)
Query: 144 PFHEPVDAINLNLPDYHKVITQPMDLGTIKKRLE---NNYYWSGKEAIQDFTTMFTNCYV 200
PFHEPV P+Y+K+I PMDL I+K+L+ +Y S +E + D MF NCY
Sbjct: 24 PFHEPVSPSV---PNYYKIIKTPMDLSLIRKKLQPKSPQHYSSPEEFVADVRLMFKNCYK 80
Query: 201 YNKPGEDVVLMAQNLEQLFLTK 222
+N+ +V + LE F +
Sbjct: 81 FNEEDSEVAQAGKELELFFEEQ 102
>gnl|CDD|99935 cd05503, Bromo_BAZ2A_B_like, Bromodomain, BAZ2A/BAZ2B_like
subfamily. Bromo adjacent to zinc finger 2A (BAZ2A) and
2B (BAZ2B) were identified as a novel human bromodomain
gene by cDNA library screening. BAZ2A is also known as
Tip5 (Transcription termination factor I-interacting
protein 5) and hWALp3. The proteins may play roles in
transcriptional regulation. Human Tip5 is part of a
complex termed NoRC (nucleolar remodeling complex),
which induces nucleosome sliding and may play a role in
the regulation of the rDNA locus. Bromodomains are 110
amino acid long domains, that are found in many
chromatin associated proteins. Bromodomains can interact
specifically with acetylated lysine.
Length = 97
Score = 83.2 bits (206), Expect = 2e-19
Identities = 42/97 (43%), Positives = 58/97 (59%), Gaps = 5/97 (5%)
Query: 377 CSEIIAELFHKKHQNYAWPFYTPVDVEKLGLTDYFDIIKKPMDLGTVRKKMRNRTYKTAK 436
C I+ E+ + H++ AWPF PV+ KL + Y IIKKPMD T+R+K+ + YKT +
Sbjct: 5 CETILDEM--EAHED-AWPFLEPVNT-KL-VPGYRKIIKKPMDFSTIREKLESGQYKTLE 59
Query: 437 EFADDVRLIFSNCYKYNPPDHNVVTMARQLSAVFEDR 473
EFA+DVRL+F NC +N D V + FE R
Sbjct: 60 EFAEDVRLVFDNCETFNEDDSEVGRAGHNMRKFFEKR 96
Score = 73.2 bits (180), Expect = 8e-16
Identities = 31/87 (35%), Positives = 47/87 (54%), Gaps = 4/87 (4%)
Query: 137 WKHPHAWPFHEPVDAINLNL-PDYHKVITQPMDLGTIKKRLENNYYWSGKEAIQDFTTMF 195
H AWPF EPV N L P Y K+I +PMD TI+++LE+ Y + +E +D +F
Sbjct: 13 EAHEDAWPFLEPV---NTKLVPGYRKIIKKPMDFSTIREKLESGQYKTLEEFAEDVRLVF 69
Query: 196 TNCYVYNKPGEDVVLMAQNLEQLFLTK 222
NC +N+ +V N+ + F +
Sbjct: 70 DNCETFNEDDSEVGRAGHNMRKFFEKR 96
>gnl|CDD|99928 cd05496, Bromo_WDR9_II, Bromodomain; WDR9 repeat II_like subfamily.
WDR9 is a human gene located in the Down Syndrome
critical region-2 of chromosome 21. It encodes for a
nuclear protein containing WD40 repeats and two
bromodomains, which may function as a transcriptional
regulator involved in chromatin remodeling and play a
role in embryonic development. Bromodomains are 110
amino acid long domains, that are found in many
chromatin associated proteins. Bromodomains can interact
specifically with acetylated lysine.
Length = 119
Score = 79.0 bits (195), Expect = 1e-17
Identities = 43/106 (40%), Positives = 59/106 (55%), Gaps = 6/106 (5%)
Query: 373 WYKYCSEIIAELFHKKHQNYAWPFYTPVDVEKLGLTDYFDIIKKPMDLGTVRKKMRNRTY 432
W K C E++ ++ + PF PVD+ K DY DII PMDLGTV++ + Y
Sbjct: 6 WKKQCKELVNLMWDCEDSE---PFRQPVDLLKY--PDYRDIIDTPMDLGTVKETLFGGNY 60
Query: 433 KTAKEFADDVRLIFSNCYKYNPPDHN-VVTMARQLSAVFEDRFAKM 477
EFA DVRLIFSN Y P + + +M +LSA+FE+ K+
Sbjct: 61 DDPMEFAKDVRLIFSNSKSYTPNKRSRIYSMTLRLSALFEEHIKKI 106
Score = 55.9 bits (135), Expect = 2e-09
Identities = 26/73 (35%), Positives = 37/73 (50%), Gaps = 2/73 (2%)
Query: 130 KNVMKAVWKHPHAWPFHEPVDAINLNLPDYHKVITQPMDLGTIKKRLENNYYWSGKEAIQ 189
K ++ +W + PF +PVD L PDY +I PMDLGT+K+ L Y E +
Sbjct: 11 KELVNLMWDCEDSEPFRQPVD--LLKYPDYRDIIDTPMDLGTVKETLFGGNYDDPMEFAK 68
Query: 190 DFTTMFTNCYVYN 202
D +F+N Y
Sbjct: 69 DVRLIFSNSKSYT 81
>gnl|CDD|99950 cd05519, Bromo_SNF2, Bromodomain, SNF2-like subfamily, specific to
fungi. SNF2 is a yeast protein involved in
transcriptional activation, it is the catalytic
component of the SWI/SNF ATP-dependent chromatin
remodeling complex. The protein is essential for the
regulation of gene expression (both positive and
negative) of a large number of genes. The SWI/SNF
complex changes chromatin structure by altering
DNA-histone contacts within the nucleosome, which
results in a re-positioning of the nucleosome and
facilitates or represses the binding of gene-specific
transcription factors. Bromodomains are 110 amino acid
long domains, that are found in many chromatin
associated proteins. Bromodomains can interact
specifically with acetylated lysine.
Length = 103
Score = 73.1 bits (180), Expect = 8e-16
Identities = 26/67 (38%), Positives = 38/67 (56%)
Query: 157 PDYHKVITQPMDLGTIKKRLENNYYWSGKEAIQDFTTMFTNCYVYNKPGEDVVLMAQNLE 216
PDY+ +I +P+ L IK+R+E Y S +E ++DF MF N YN+ G V A +E
Sbjct: 37 PDYYVIIKRPIALDQIKRRIEGRAYKSLEEFLEDFHLMFANARTYNQEGSIVYEDAVEME 96
Query: 217 QLFLTKI 223
+ F K
Sbjct: 97 KAFKKKY 103
Score = 53.9 bits (130), Expect = 6e-09
Identities = 18/45 (40%), Positives = 31/45 (68%)
Query: 409 DYFDIIKKPMDLGTVRKKMRNRTYKTAKEFADDVRLIFSNCYKYN 453
DY+ IIK+P+ L +++++ R YK+ +EF +D L+F+N YN
Sbjct: 38 DYYVIIKRPIALDQIKRRIEGRAYKSLEEFLEDFHLMFANARTYN 82
>gnl|CDD|99937 cd05505, Bromo_WSTF_like, Bromodomain; Williams syndrome
transcription factor-like subfamily (WSTF-like). The
Williams-Beuren syndrome deletion transcript 9 is a
putative transcriptional regulator. WSTF was found to
play a role in vitamin D-mediated transcription as part
of two chromatin remodeling complexes, WINAC and WICH.
Bromodomains are 110 amino acid long domains, that are
found in many chromatin associated proteins.
Bromodomains can interact specifically with acetylated
lysine.
Length = 97
Score = 70.3 bits (172), Expect = 1e-14
Identities = 30/89 (33%), Positives = 50/89 (56%), Gaps = 5/89 (5%)
Query: 377 CSEIIAELFHKKHQNYAWPFYTPVDVEKLGLTDYFDIIKKPMDLGTVRKKMRNRTYKTAK 436
C EI++++ + +WPF PV ++ DY +I PMDL T++ K +Y + +
Sbjct: 5 CEEILSKILKYRF---SWPFREPVTADEA--EDYKKVITNPMDLQTMQTKCSCGSYSSVQ 59
Query: 437 EFADDVRLIFSNCYKYNPPDHNVVTMARQ 465
EF DD++L+FSN KY V++ R+
Sbjct: 60 EFLDDMKLVFSNAEKYYENGSYVLSCMRK 88
Score = 61.0 bits (148), Expect = 1e-11
Identities = 30/82 (36%), Positives = 42/82 (51%), Gaps = 2/82 (2%)
Query: 138 KHPHAWPFHEPVDAINLNLPDYHKVITQPMDLGTIKKRLENNYYWSGKEAIQDFTTMFTN 197
K+ +WPF EPV A DY KVIT PMDL T++ + Y S +E + D +F+N
Sbjct: 14 KYRFSWPFREPVTADEA--EDYKKVITNPMDLQTMQTKCSCGSYSSVQEFLDDMKLVFSN 71
Query: 198 CYVYNKPGEDVVLMAQNLEQLF 219
Y + G V+ + EQ
Sbjct: 72 AEKYYENGSYVLSCMRKTEQCC 93
>gnl|CDD|227408 COG5076, COG5076, Transcription factor involved in chromatin
remodeling, contains bromodomain [Chromatin structure
and dynamics / Transcription].
Length = 371
Score = 75.6 bits (186), Expect = 2e-14
Identities = 37/112 (33%), Positives = 59/112 (52%), Gaps = 2/112 (1%)
Query: 369 KTPLWYKYCSEIIAELFHKKHQNYAWPFYTPVDVEKLGLTDYFDIIKKPMDLGTVRKKMR 428
K ++ +LF + + + F K DY++IIK PMDL T++KK++
Sbjct: 142 LLYADNKAIAKFKKQLFLRDGRFLSSIF--LGLPSKREYPDYYEIIKSPMDLLTIQKKLK 199
Query: 429 NRTYKTAKEFADDVRLIFSNCYKYNPPDHNVVTMARQLSAVFEDRFAKMPDE 480
N YK+ +EF D+ L+F NC YN PD +V A++L F ++P+E
Sbjct: 200 NGRYKSFEEFVSDLNLMFDNCKLYNGPDSSVYVDAKELEKYFLKLIEEIPEE 251
Score = 74.5 bits (183), Expect = 4e-14
Identities = 49/191 (25%), Positives = 74/191 (38%), Gaps = 14/191 (7%)
Query: 123 NQLQYIVKNVMKAVWKHPHAWPFHEPVDAINLNLPDYHKVITQPMDLGTIKKRLENNYYW 182
N+ K + + F PDY+++I PMDL TI+K+L+N Y
Sbjct: 147 NKAIAKFKKQLFLRDGRFLSSIFLGLPSKREY--PDYYEIIKSPMDLLTIQKKLKNGRYK 204
Query: 183 SGKEAIQDFTTMFTNCYVYNKPGEDVVLMAQNLEQLFLTKITGMPSEEVVLDAPQPRSSK 242
S +E + D MF NC +YN P V + A+ LE+ FL I +P EE++ + +P +
Sbjct: 205 SFEEFVSDLNLMFDNCKLYNGPDSSVYVDAKELEKYFLKLIEEIP-EEMLELSIKPGREE 263
Query: 243 KKP-----------PVSASPSLNPVIKTPVIPLNKLPSATSTPKPRPPNPVLGSTATTTT 291
++ V A P L PV V K + T
Sbjct: 264 REERESVLITNSQAHVGAWPFLRPVSDEEVPDYYKDIRDPMDLSTKELKLRNNYYRPEET 323
Query: 292 APKVNHLNSMN 302
+ L N
Sbjct: 324 FVRDAKLFFDN 334
Score = 32.9 bits (75), Expect = 0.70
Identities = 20/88 (22%), Positives = 30/88 (34%), Gaps = 3/88 (3%)
Query: 399 PVDVEKLGLTDYFDIIKKPMDLGTVRKKMRNRTYKTAKEFADDVRLIFSNCYKYNPPDHN 458
P E+ G + F K M + + +R + DD+ Y+P + N
Sbjct: 37 PNAPEEEGSKNLFQKQLKRMPKEYITSIVDDREPGSMANVNDDLE--NVGGITYSPFEKN 94
Query: 459 VVTMARQLSAVFEDRFAKMPDESNLASR 486
R VF + P ES L S
Sbjct: 95 RPESLRFDEIVFLAIESVTP-ESGLGSL 121
Score = 30.2 bits (68), Expect = 4.7
Identities = 27/76 (35%), Positives = 38/76 (50%), Gaps = 2/76 (2%)
Query: 378 SEIIAELFHKKHQNYAWPFYTPVDVEKLGLTDYFDIIKKPMDLGTVRKKMRNRTYKTAKE 437
+ + + AWPF PV E++ DY+ I+ PMDL T K+RN Y+ +
Sbjct: 266 ERESVLITNSQAHVGAWPFLRPVSDEEV--PDYYKDIRDPMDLSTKELKLRNNYYRPEET 323
Query: 438 FADDVRLIFSNCYKYN 453
F D +L F NC YN
Sbjct: 324 FVRDAKLFFDNCVMYN 339
>gnl|CDD|99939 cd05507, Bromo_brd8_like, Bromodomain, brd8_like subgroup. In
mammals, brd8 (bromodomain containing 8) interacts with
the thyroid hormone receptor in a ligand-dependent
fashion and enhances thyroid hormone-dependent
activation from thyroid response elements. Brd8 is
thought to be a nuclear receptor coactivator.
Bromodomains are 110 amino acid long domains, that are
found in many chromatin associated proteins.
Bromodomains can interact specifically with acetylated
lysine.
Length = 104
Score = 67.4 bits (165), Expect = 9e-14
Identities = 35/88 (39%), Positives = 47/88 (53%), Gaps = 9/88 (10%)
Query: 130 KNVMKAVWK----HPHAWPFHEPV-DAINLNLPDYHKVITQPMDLGTIKKRLENNYYWSG 184
K ++ V++ H +A F +PV + I P YH V+ +PMDL TIKK +EN S
Sbjct: 6 KAIL-LVYRTLASHRYASVFLKPVTEDIA---PGYHSVVYRPMDLSTIKKNIENGTIRST 61
Query: 185 KEAIQDFTTMFTNCYVYNKPGEDVVLMA 212
E +D MF N +YN DV LMA
Sbjct: 62 AEFQRDVLLMFQNAIMYNSSDHDVYLMA 89
Score = 61.2 bits (149), Expect = 1e-11
Identities = 28/75 (37%), Positives = 41/75 (54%), Gaps = 2/75 (2%)
Query: 392 YAWPFYTPVDVEKLGLTDYFDIIKKPMDLGTVRKKMRNRTYKTAKEFADDVRLIFSNCYK 451
YA F PV + Y ++ +PMDL T++K + N T ++ EF DV L+F N
Sbjct: 20 YASVFLKPVTED--IAPGYHSVVYRPMDLSTIKKNIENGTIRSTAEFQRDVLLMFQNAIM 77
Query: 452 YNPPDHNVVTMARQL 466
YN DH+V MA ++
Sbjct: 78 YNSSDHDVYLMAVEM 92
>gnl|CDD|99957 cd05528, Bromo_AAA, Bromodomain; sub-family co-occurring with AAA
domains. Bromodomains are 110 amino acid long domains,
that are found in many chromatin associated proteins.
Bromodomains can interact specifically with acetylated
lysine. The structure(2DKW) in this alignment is an
uncharacterized protein predicted from analysis of cDNA
clones from human fetal liver.
Length = 112
Score = 67.8 bits (166), Expect = 1e-13
Identities = 32/104 (30%), Positives = 49/104 (47%), Gaps = 15/104 (14%)
Query: 395 PFYTPVDVEKLGLTDYFDIIKKPMDLGTVRKKMRNRTYKTAKEFADDVRLIFSNCYKYNP 454
F PVD E++ DY++IIK+PMDL T+ +K+ Y TAK+F D+ LI +N +YNP
Sbjct: 23 AFTKPVDEEEV--PDYYEIIKQPMDLQTILQKLDTHQYLTAKDFLKDIDLIVTNALEYNP 80
Query: 455 PDHNVVTMARQLSAVFEDRFAKMPDESNLASRAAASVSSDDDSE 498
+ R + L A + ++ D
Sbjct: 81 DRDPADKLIRSRA-------------CELRDEVHAMIEAELDPL 111
Score = 53.5 bits (129), Expect = 9e-09
Identities = 26/92 (28%), Positives = 42/92 (45%), Gaps = 6/92 (6%)
Query: 144 PFHEPVDAINLNLPDYHKVITQPMDLGTIKKRLENNYYWSGKEAIQDFTTMFTNCYVYN- 202
F +PVD +PDY+++I QPMDL TI ++L+ + Y + K+ ++D + TN YN
Sbjct: 23 AFTKPVDE--EEVPDYYEIIKQPMDLQTILQKLDTHQYLTAKDFLKDIDLIVTNALEYNP 80
Query: 203 ---KPGEDVVLMAQNLEQLFLTKITGMPSEEV 231
+ + A L I
Sbjct: 81 DRDPADKLIRSRACELRDEVHAMIEAELDPLF 112
>gnl|CDD|99944 cd05512, Bromo_brd1_like, Bromodomain; brd1_like subfamily. BRD1 is
a mammalian gene which encodes for a nuclear protein
assumed to be a transcriptional regulator. BRD1 has been
implicated with brain development and susceptibility to
schizophrenia and bipolar affective disorder.
Bromodomains are 110 amino acid long domains that are
found in many chromatin associated proteins.
Bromodomains can interact specifically with acetylated
lysine.
Length = 98
Score = 65.1 bits (159), Expect = 5e-13
Identities = 27/61 (44%), Positives = 39/61 (63%), Gaps = 2/61 (3%)
Query: 396 FYTPVDVEKLGLTDYFDIIKKPMDLGTVRKKMRNRTYKTAKEFADDVRLIFSNCYKYNPP 455
F PVD+ ++ DY D IK+PMD T+RKK+ ++ Y+T ++F D LI +NC YN
Sbjct: 22 FSEPVDLSEV--PDYLDHIKQPMDFSTMRKKLESQRYRTLEDFEADFNLIINNCLAYNAK 79
Query: 456 D 456
D
Sbjct: 80 D 80
Score = 57.4 bits (139), Expect = 3e-10
Identities = 24/58 (41%), Positives = 33/58 (56%), Gaps = 2/58 (3%)
Query: 145 FHEPVDAINLNLPDYHKVITQPMDLGTIKKRLENNYYWSGKEAIQDFTTMFTNCYVYN 202
F EPVD +PDY I QPMD T++K+LE+ Y + ++ DF + NC YN
Sbjct: 22 FSEPVDL--SEVPDYLDHIKQPMDFSTMRKKLESQRYRTLEDFEADFNLIINNCLAYN 77
>gnl|CDD|99958 cd05529, Bromo_WDR9_I_like, Bromodomain; WDR9 repeat I_like
subfamily. WDR9 is a human gene located in the Down
Syndrome critical region-2 of chromosome 21. It encodes
for a nuclear protein containing WD40 repeats and two
bromodomains, which may function as a transcriptional
regulator involved in chromatin remodeling and play a
role in embryonic development. Bromodomains are 110
amino acid long domains, that are found in many
chromatin associated proteins. Bromodomains can interact
specifically with acetylated lysine.
Length = 128
Score = 65.1 bits (159), Expect = 1e-12
Identities = 27/77 (35%), Positives = 38/77 (49%), Gaps = 1/77 (1%)
Query: 144 PFHEPVDAINLNLPDYHKVITQPMDLGTIKKRLENNYYWSGKEAIQDFTTMFTNCYVYNK 203
F PVD + PDY + PMDL TI+ RLEN YY S + D + +N +N+
Sbjct: 47 YFEYPVD-LRAWYPDYWNRVPVPMDLETIRSRLENRYYRSLEALRHDVRLILSNAETFNE 105
Query: 204 PGEDVVLMAQNLEQLFL 220
P ++ A+ L L
Sbjct: 106 PNSEIAKKAKRLSDWLL 122
Score = 63.9 bits (156), Expect = 3e-12
Identities = 29/92 (31%), Positives = 47/92 (51%), Gaps = 1/92 (1%)
Query: 381 IAELFHKKHQNYAWPFYTPVDVEKLGLTDYFDIIKKPMDLGTVRKKMRNRTYKTAKEFAD 440
+ +L A F PVD+ + DY++ + PMDL T+R ++ NR Y++ +
Sbjct: 33 LDKLLLSLQLEIAEYFEYPVDL-RAWYPDYWNRVPVPMDLETIRSRLENRYYRSLEALRH 91
Query: 441 DVRLIFSNCYKYNPPDHNVVTMARQLSAVFED 472
DVRLI SN +N P+ + A++LS
Sbjct: 92 DVRLILSNAETFNEPNSEIAKKAKRLSDWLLR 123
>gnl|CDD|99945 cd05513, Bromo_brd7_like, Bromodomain, brd7_like subgroup. The BRD7
gene encodes a nuclear protein that has been shown to
inhibit cell growth and the progression of the cell
cycle by regulating cell-cycle genes at the
transcriptional level. BRD7 has been identified as a
gene involved in nasopharyngeal carcinoma. The protein
interacts with acetylated histone H3 via its
bromodomain. Bromodomains are 110 amino acid long
domains that are found in many chromatin associated
proteins. Bromodomains can interact specifically with
acetylated lysine.
Length = 98
Score = 61.7 bits (150), Expect = 1e-11
Identities = 30/82 (36%), Positives = 44/82 (53%), Gaps = 8/82 (9%)
Query: 125 LQYIVKNVMKAVWKHPHAWPFHEPV-DAINLNLPDYHKVITQPMDLGTIKKRLENNYYWS 183
L+ +++ + + K PH + F PV D I P Y +I PMD T+K++++NN Y S
Sbjct: 6 LEQLIRQLQR---KDPHGF-FAFPVTDFI---APGYSSIIKHPMDFSTMKEKIKNNDYQS 58
Query: 184 GKEAIQDFTTMFTNCYVYNKPG 205
+E DF M N YNKP
Sbjct: 59 IEEFKDDFKLMCENAMKYNKPD 80
Score = 59.3 bits (144), Expect = 5e-11
Identities = 28/85 (32%), Positives = 43/85 (50%), Gaps = 8/85 (9%)
Query: 374 YKYCSEIIAELFHKK--HQNYAWPFYTPVDVEKLGLTDYFDIIKKPMDLGTVRKKMRNRT 431
+ E + +K H +A+P V Y IIK PMD T+++K++N
Sbjct: 2 LQKALEQLIRQLQRKDPHGFFAFP------VTDFIAPGYSSIIKHPMDFSTMKEKIKNND 55
Query: 432 YKTAKEFADDVRLIFSNCYKYNPPD 456
Y++ +EF DD +L+ N KYN PD
Sbjct: 56 YQSIEEFKDDFKLMCENAMKYNKPD 80
>gnl|CDD|99946 cd05515, Bromo_polybromo_V, Bromodomain, polybromo repeat V.
Polybromo is a nuclear protein of unknown function,
which contains 6 bromodomains. The human ortholog BAF180
is part of a SWI/SNF chromatin-remodeling complex, and
it may carry out the functions of Yeast Rsc-1 and Rsc-2.
It was shown that polybromo bromodomains bind to histone
H3 at specific acetyl-lysine positions. Bromodomains are
found in many chromatin-associated proteins and in
nuclear histone acetyltransferases. They interact
specifically with acetylated lysine, but not all the
bromodomains in polybromo may bind to acetyl-lysine.
Length = 105
Score = 61.2 bits (149), Expect = 2e-11
Identities = 20/48 (41%), Positives = 30/48 (62%)
Query: 409 DYFDIIKKPMDLGTVRKKMRNRTYKTAKEFADDVRLIFSNCYKYNPPD 456
DY+D+IKKP+D+ +R K+ Y++ + D L+F N KYN PD
Sbjct: 38 DYYDVIKKPIDMEKIRSKIEGNQYQSLDDMVSDFVLMFDNACKYNEPD 85
Score = 60.4 bits (147), Expect = 3e-11
Identities = 23/70 (32%), Positives = 37/70 (52%)
Query: 156 LPDYHKVITQPMDLGTIKKRLENNYYWSGKEAIQDFTTMFTNCYVYNKPGEDVVLMAQNL 215
PDY+ VI +P+D+ I+ ++E N Y S + + DF MF N YN+P + A L
Sbjct: 36 YPDYYDVIKKPIDMEKIRSKIEGNQYQSLDDMVSDFVLMFDNACKYNEPDSQIYKDALTL 95
Query: 216 EQLFLTKITG 225
+++ L
Sbjct: 96 QKVLLETKRE 105
>gnl|CDD|99949 cd05518, Bromo_polybromo_IV, Bromodomain, polybromo repeat IV.
Polybromo is a nuclear protein of unknown function,
which contains 6 bromodomains. The human ortholog BAF180
is part of a SWI/SNF chromatin-remodeling complex, and
it may carry out the functions of Yeast Rsc-1 and Rsc-2.
It was shown that polybromo bromodomains bind to histone
H3 at specific acetyl-lysine positions. Bromodomains are
found in many chromatin-associated proteins and in
nuclear histone acetyltransferases. They interact
specifically with acetylated lysine, but not all the
bromodomains in polybromo may bind to acetyl-lysine.
Length = 103
Score = 59.8 bits (145), Expect = 5e-11
Identities = 26/66 (39%), Positives = 39/66 (59%)
Query: 157 PDYHKVITQPMDLGTIKKRLENNYYWSGKEAIQDFTTMFTNCYVYNKPGEDVVLMAQNLE 216
PDY+K+I +P+DL TI+ + N+ Y + +E + DF MF N YN+ G V A LE
Sbjct: 37 PDYYKIILEPIDLKTIEHNIRNDKYATEEELMDDFKLMFRNARHYNEEGSQVYEDANILE 96
Query: 217 QLFLTK 222
++ K
Sbjct: 97 KVLKEK 102
Score = 54.8 bits (132), Expect = 3e-09
Identities = 24/66 (36%), Positives = 35/66 (53%)
Query: 409 DYFDIIKKPMDLGTVRKKMRNRTYKTAKEFADDVRLIFSNCYKYNPPDHNVVTMARQLSA 468
DY+ II +P+DL T+ +RN Y T +E DD +L+F N YN V A L
Sbjct: 38 DYYKIILEPIDLKTIEHNIRNDKYATEEELMDDFKLMFRNARHYNEEGSQVYEDANILEK 97
Query: 469 VFEDRF 474
V +++
Sbjct: 98 VLKEKR 103
>gnl|CDD|99940 cd05508, Bromo_RACK7, Bromodomain, RACK7_like subfamily. RACK7
(also called human protein kinase C-binding protein) was
identified as a potential tumor suppressor genes, it
shares domain architecture with BS69/ZMYND11; both have
been implicated in the regulation of cellular
proliferation. Bromodomains are 110 amino acid long
domains, that are found in many chromatin associated
proteins. Bromodomains can interact specifically with
acetylated lysine.
Length = 99
Score = 58.9 bits (143), Expect = 9e-11
Identities = 28/77 (36%), Positives = 39/77 (50%), Gaps = 2/77 (2%)
Query: 390 QNYAWPFYTPVDVEKLGLTDYFDIIKKPMDLGTVRKKMRNRTYKTAKEFADDVRLIFSNC 449
Q A PF PVD+E+ DY + KPMDL T+ K +R + Y + F D + I N
Sbjct: 17 QPGAEPFLKPVDLEQF--PDYAQYVFKPMDLSTLEKNVRKKAYGSTDAFLADAKWILHNA 74
Query: 450 YKYNPPDHNVVTMARQL 466
YN DH + A+ +
Sbjct: 75 IIYNGGDHKLTQAAKAI 91
Score = 44.3 bits (105), Expect = 1e-05
Identities = 26/81 (32%), Positives = 42/81 (51%), Gaps = 3/81 (3%)
Query: 122 TNQLQYIVKNVMKAVWKHPHAWPFHEPVDAINLNLPDYHKVITQPMDLGTIKKRLENNYY 181
+QL ++K ++ + K P A PF +PVD PDY + + +PMDL T++K + Y
Sbjct: 1 VDQLSKLLKFALERM-KQPGAEPFLKPVDL--EQFPDYAQYVFKPMDLSTLEKNVRKKAY 57
Query: 182 WSGKEAIQDFTTMFTNCYVYN 202
S + D + N +YN
Sbjct: 58 GSTDAFLADAKWILHNAIIYN 78
>gnl|CDD|99933 cd05501, Bromo_SP100C_like, Bromodomain, SP100C_like subfamily. The
SP100C protein is a splice variant of SP100, a major
component of PML-SP100 nuclear bodies (NBs), which are
poorly understood. It is covalently modified by SUMO-1
and may play a role in processes at the chromatin level.
Bromodomains are 110 amino acid long domains, that are
found in many chromatin associated proteins.
Bromodomains can interact specifically with acetylated
lysine.
Length = 102
Score = 57.1 bits (138), Expect = 5e-10
Identities = 23/66 (34%), Positives = 33/66 (50%), Gaps = 1/66 (1%)
Query: 409 DYFDIIKKPMDLGTVRKKMRNRTYKTAKEFADDVRLIFSNCYKYNPPDHNVVTMARQLSA 468
DY IK+PM L V++++ R Y T + F D+RLIF N + D + + L
Sbjct: 32 DYCQGIKEPMWLNKVKERLNERVYHTVEGFVRDMRLIFHNHKLFYKDD-DFGQVGITLEK 90
Query: 469 VFEDRF 474
FE F
Sbjct: 91 KFEKNF 96
Score = 42.8 bits (101), Expect = 4e-05
Identities = 18/62 (29%), Positives = 32/62 (51%), Gaps = 1/62 (1%)
Query: 158 DYHKVITQPMDLGTIKKRLENNYYWSGKEAIQDFTTMFTNCYVYNKPGEDVVLMAQNLEQ 217
DY + I +PM L +K+RL Y + + ++D +F N ++ K +D + LE+
Sbjct: 32 DYCQGIKEPMWLNKVKERLNERVYHTVEGFVRDMRLIFHNHKLFYK-DDDFGQVGITLEK 90
Query: 218 LF 219
F
Sbjct: 91 KF 92
>gnl|CDD|99951 cd05520, Bromo_polybromo_III, Bromodomain, polybromo repeat III.
Polybromo is a nuclear protein of unknown function,
which contains 6 bromodomains. The human ortholog BAF180
is part of a SWI/SNF chromatin-remodeling complex, and
it may carry out the functions of Yeast Rsc-1 and Rsc-2.
It was shown that polybromo bromodomains bind to histone
H3 at specific acetyl-lysine positions. Bromodomains are
found in many chromatin-associated proteins and in
nuclear histone acetyltransferases. They interact
specifically with acetylated lysine, but not all the
bromodomains in polybromo may bind to acetyl-lysine.
Length = 103
Score = 54.7 bits (132), Expect = 3e-09
Identities = 21/61 (34%), Positives = 35/61 (57%)
Query: 409 DYFDIIKKPMDLGTVRKKMRNRTYKTAKEFADDVRLIFSNCYKYNPPDHNVVTMARQLSA 468
DY+ IK P+ L +R K++N Y+T +E D+ L+F N +YN P+ + A +L
Sbjct: 38 DYYQEIKNPISLQQIRTKLKNGEYETLEELEADLNLMFENAKRYNVPNSRIYKDAEKLQK 97
Query: 469 V 469
+
Sbjct: 98 L 98
Score = 46.9 bits (112), Expect = 2e-06
Identities = 22/66 (33%), Positives = 34/66 (51%)
Query: 157 PDYHKVITQPMDLGTIKKRLENNYYWSGKEAIQDFTTMFTNCYVYNKPGEDVVLMAQNLE 216
PDY++ I P+ L I+ +L+N Y + +E D MF N YN P + A+ L+
Sbjct: 37 PDYYQEIKNPISLQQIRTKLKNGEYETLEELEADLNLMFENAKRYNVPNSRIYKDAEKLQ 96
Query: 217 QLFLTK 222
+L K
Sbjct: 97 KLMQAK 102
>gnl|CDD|99953 cd05522, Bromo_Rsc1_2_II, Bromodomain, repeat II in Rsc1/2_like
subfamily, specific to fungi. Rsc1 and Rsc2 are
components of the RSC complex (remodeling the structure
of chromatin), are essential for transcriptional
control, and have a specific domain architecture
including two bromodomains. The RSC complex has also
been linked to homologous recombination and
nonhomologous end-joining repair of DNA double strand
breaks. Bromodomains are 110 amino acid long domains,
that are found in many chromatin associated proteins.
Bromodomains can interact specifically with acetylated
lysine.
Length = 104
Score = 53.0 bits (128), Expect = 1e-08
Identities = 19/58 (32%), Positives = 31/58 (53%)
Query: 409 DYFDIIKKPMDLGTVRKKMRNRTYKTAKEFADDVRLIFSNCYKYNPPDHNVVTMARQL 466
+Y+ I P+ L ++KK++ R YK+ +F +D+ L+F N YN D A L
Sbjct: 39 EYYQEISNPISLDDIKKKVKRRKYKSFDQFLNDLNLMFENAKLYNENDSQEYKDAVLL 96
Score = 52.6 bits (127), Expect = 1e-08
Identities = 19/63 (30%), Positives = 32/63 (50%)
Query: 157 PDYHKVITQPMDLGTIKKRLENNYYWSGKEAIQDFTTMFTNCYVYNKPGEDVVLMAQNLE 216
P+Y++ I+ P+ L IKK+++ Y S + + D MF N +YN+ A LE
Sbjct: 38 PEYYQEISNPISLDDIKKKVKRRKYKSFDQFLNDLNLMFENAKLYNENDSQEYKDAVLLE 97
Query: 217 QLF 219
+
Sbjct: 98 KEA 100
>gnl|CDD|99947 cd05516, Bromo_SNF2L2, Bromodomain, SNF2L2-like subfamily, specific
to animals. SNF2L2 (SNF2-alpha) or SWI/SNF-related
matrix-associated actin-dependent regulator of chromatin
subfamily A member 2 is a global transcriptional
activator, which cooperates with nuclear hormone
receptors to boost transcriptional activation.
Bromodomains are 110 amino acid long domains, that are
found in many chromatin associated proteins.
Bromodomains can interact specifically with acetylated
lysine.
Length = 107
Score = 51.7 bits (124), Expect = 4e-08
Identities = 18/64 (28%), Positives = 36/64 (56%)
Query: 156 LPDYHKVITQPMDLGTIKKRLENNYYWSGKEAIQDFTTMFTNCYVYNKPGEDVVLMAQNL 215
LP+Y+++I +P+D IK+R+ N+ Y S ++ +D + N +N G + + L
Sbjct: 37 LPEYYELIRKPVDFKKIKERIRNHKYRSLEDLEKDVMLLCQNAQTFNLEGSLIYEDSIVL 96
Query: 216 EQLF 219
+ +F
Sbjct: 97 QSVF 100
Score = 46.3 bits (110), Expect = 3e-06
Identities = 14/47 (29%), Positives = 31/47 (65%)
Query: 407 LTDYFDIIKKPMDLGTVRKKMRNRTYKTAKEFADDVRLIFSNCYKYN 453
L +Y+++I+KP+D +++++RN Y++ ++ DV L+ N +N
Sbjct: 37 LPEYYELIRKPVDFKKIKERIRNHKYRSLEDLEKDVMLLCQNAQTFN 83
>gnl|CDD|99952 cd05521, Bromo_Rsc1_2_I, Bromodomain, repeat I in Rsc1/2_like
subfamily, specific to fungi. Rsc1 and Rsc2 are
components of the RSC complex (remodeling the structure
of chromatin), are essential for transcriptional
control, and have a specific domain architecture
including two bromodomains. The RSC complex has also
been linked to homologous recombination and
nonhomologous end-joining repair of DNA double strand
breaks. Bromodomains are 110 amino acid long domains,
that are found in many chromatin associated proteins.
Bromodomains can interact specifically with acetylated
lysine.
Length = 106
Score = 49.2 bits (118), Expect = 2e-07
Identities = 20/60 (33%), Positives = 31/60 (51%), Gaps = 2/60 (3%)
Query: 404 KLGLTDYFDIIKKPMDLGTVRKKMRNRTYKTAKEFADDVRLIFSNCYKYNPPDHNVVTMA 463
+ DY+ IIK P+ L TV+K++ + Y A+EF +D+ I N YN + A
Sbjct: 34 RKDYPDYYKIIKNPLSLNTVKKRLPH--YTNAQEFVNDLAQIPWNARLYNTKGSVIYKYA 91
Score = 46.5 bits (111), Expect = 2e-06
Identities = 23/68 (33%), Positives = 35/68 (51%), Gaps = 2/68 (2%)
Query: 156 LPDYHKVITQPMDLGTIKKRLENNYYWSGKEAIQDFTTMFTNCYVYNKPGEDVVLMAQNL 215
PDY+K+I P+ L T+KKRL + Y + +E + D + N +YN G + A L
Sbjct: 37 YPDYYKIIKNPLSLNTVKKRLPH--YTNAQEFVNDLAQIPWNARLYNTKGSVIYKYALIL 94
Query: 216 EQLFLTKI 223
E+ I
Sbjct: 95 EKYINDVI 102
>gnl|CDD|99954 cd05524, Bromo_polybromo_I, Bromodomain, polybromo repeat I.
Polybromo is a nuclear protein of unknown function,
which contains 6 bromodomains. The human ortholog BAF180
is part of a SWI/SNF chromatin-remodeling complex, and
it may carry out the functions of Yeast Rsc-1 and Rsc-2.
It was shown that polybromo bromodomains bind to histone
H3 at specific acetyl-lysine positions. Bromodomains are
found in many chromatin-associated proteins and in
nuclear histone acetyltransferases. They interact
specifically with acetylated lysine, but not all the
bromodomains in polybromo may bind to acetyl-lysine.
Length = 113
Score = 46.2 bits (110), Expect = 3e-06
Identities = 12/48 (25%), Positives = 26/48 (54%)
Query: 409 DYFDIIKKPMDLGTVRKKMRNRTYKTAKEFADDVRLIFSNCYKYNPPD 456
+Y++++ P+DL +++K++ Y + D L+ +N Y PD
Sbjct: 40 EYYEVVSNPIDLLKIQQKLKTEEYDDVDDLTADFELLINNAKAYYKPD 87
Score = 41.6 bits (98), Expect = 1e-04
Identities = 20/64 (31%), Positives = 33/64 (51%)
Query: 157 PDYHKVITQPMDLGTIKKRLENNYYWSGKEAIQDFTTMFTNCYVYNKPGEDVVLMAQNLE 216
P+Y++V++ P+DL I+++L+ Y + DF + N Y KP A L
Sbjct: 39 PEYYEVVSNPIDLLKIQQKLKTEEYDDVDDLTADFELLINNAKAYYKPDSPEHKDACKLW 98
Query: 217 QLFL 220
+LFL
Sbjct: 99 ELFL 102
>gnl|CDD|99948 cd05517, Bromo_polybromo_II, Bromodomain, polybromo repeat II.
Polybromo is a nuclear protein of unknown function,
which contains 6 bromodomains. The human ortholog BAF180
is part of a SWI/SNF chromatin-remodeling complex, and
it may carry out the functions of Yeast Rsc-1 and Rsc-2.
It was shown that polybromo bromodomains bind to histone
H3 at specific acetyl-lysine positions. Bromodomains are
found in many chromatin-associated proteins and in
nuclear histone acetyltransferases. They interact
specifically with acetylated lysine, but not all the
bromodomains in polybromo may bind to acetyl-lysine.
Length = 103
Score = 45.9 bits (109), Expect = 4e-06
Identities = 24/94 (25%), Positives = 46/94 (48%), Gaps = 13/94 (13%)
Query: 380 IIAELFHKKHQNYAWPFYTPVDVEKLGLTDYFDIIKKPMDLGTVRKKMRNRTYKTAKEFA 439
+I+ELF K +P DY+ +IK+P+DL T+ +++++ YK+ ++
Sbjct: 22 LISELFQKLPSKVLYP-------------DYYAVIKEPIDLKTIAQRIQSGYYKSIEDME 68
Query: 440 DDVRLIFSNCYKYNPPDHNVVTMARQLSAVFEDR 473
D+ L+ N +N P V A + +F +
Sbjct: 69 KDLDLMVKNAKTFNEPGSQVYKDANAIKKIFTAK 102
Score = 41.7 bits (98), Expect = 1e-04
Identities = 23/63 (36%), Positives = 39/63 (61%)
Query: 157 PDYHKVITQPMDLGTIKKRLENNYYWSGKEAIQDFTTMFTNCYVYNKPGEDVVLMAQNLE 216
PDY+ VI +P+DL TI +R+++ YY S ++ +D M N +N+PG V A ++
Sbjct: 37 PDYYAVIKEPIDLKTIAQRIQSGYYKSIEDMEKDLDLMVKNAKTFNEPGSQVYKDANAIK 96
Query: 217 QLF 219
++F
Sbjct: 97 KIF 99
>gnl|CDD|99926 cd05494, Bromodomain_1, Bromodomain; uncharacterized subfamily.
Bromodomains are found in many chromatin-associated
proteins and in nuclear histone acetyltransferases. They
interact specifically with acetylated lysine.
Length = 114
Score = 45.1 bits (107), Expect = 9e-06
Identities = 20/65 (30%), Positives = 30/65 (46%), Gaps = 3/65 (4%)
Query: 374 YKYCSEIIAELFHKKHQNYAWPFYTPVDVEKLGLTDYFDIIKKPMDLGTVRK---KMRNR 430
Y+ ++ EL + AWPF PV+ + G DY D+IK+PM GT + R
Sbjct: 2 YEALERVLRELKRHRRNEDAWPFLEPVNPPRRGAPDYRDVIKRPMSFGTKVNNIVETGAR 61
Query: 431 TYKTA 435
+
Sbjct: 62 DLEDL 66
Score = 44.0 bits (104), Expect = 2e-05
Identities = 18/51 (35%), Positives = 24/51 (47%)
Query: 127 YIVKNVMKAVWKHPHAWPFHEPVDAINLNLPDYHKVITQPMDLGTIKKRLE 177
V +K ++ AWPF EPV+ PDY VI +PM GT +
Sbjct: 6 ERVLRELKRHRRNEDAWPFLEPVNPPRRGAPDYRDVIKRPMSFGTKVNNIV 56
>gnl|CDD|99955 cd05525, Bromo_ASH1, Bromodomain; ASH1_like sub-family. ASH1
(absent, small, or homeotic 1) is a member of the
trithorax-group in Drosophila melanogaster, an
epigenetic transcriptional regulator of HOX genes.
Drosophila ASH1 has been shown to methylate specific
lysines in histones H3 and H4. Mammalian ASH1 has been
shown to methylate histone H3. Bromodomains are 110
amino acid long domains, that are found in many
chromatin associated proteins. Bromodomains can interact
specifically with acetylated lysine.
Length = 106
Score = 43.9 bits (104), Expect = 2e-05
Identities = 18/50 (36%), Positives = 27/50 (54%)
Query: 404 KLGLTDYFDIIKKPMDLGTVRKKMRNRTYKTAKEFADDVRLIFSNCYKYN 453
K DY++ I P+DL T+ K++ YKT + F D+ +F N KY
Sbjct: 35 KKKNPDYYERITDPVDLSTIEKQILTGYYKTPEAFDSDMLKVFRNAEKYY 84
Score = 36.6 bits (85), Expect = 0.006
Identities = 18/70 (25%), Positives = 30/70 (42%)
Query: 152 INLNLPDYHKVITQPMDLGTIKKRLENNYYWSGKEAIQDFTTMFTNCYVYNKPGEDVVLM 211
PDY++ IT P+DL TI+K++ YY + + D +F N Y +
Sbjct: 34 SKKKNPDYYERITDPVDLSTIEKQILTGYYKTPEAFDSDMLKVFRNAEKYYGRKSPIGRD 93
Query: 212 AQNLEQLFLT 221
L + +
Sbjct: 94 VCRLRKAYYQ 103
>gnl|CDD|240271 PTZ00108, PTZ00108, DNA topoisomerase 2-like protein; Provisional.
Length = 1388
Score = 47.0 bits (112), Expect = 4e-05
Identities = 61/382 (15%), Positives = 125/382 (32%), Gaps = 40/382 (10%)
Query: 401 DVEKLGLTDYFDIIKKPMDLGTVRKKMRNRTYKTAKEFADDVRLIFSNCYKYNPPDHNVV 460
+++KLG + DIIKK + T ++ A + D+ L + Y Y ++
Sbjct: 1039 ELKKLGYVRFKDIIKKKSEKITAEEEEGAEEDDEADDEDDEEELGAAVSYDY------LL 1092
Query: 461 TMARQLSAVFEDRFAKMPDESNLASRAAASVSSDDDSE---DERQNQLKYLQEQLKSLTD 517
+M + ++ +++ K+ E + + + + ++ + L+EQ + +
Sbjct: 1093 SMP--IWSLTKEKVEKLNAELEKKEKELEKLKNTTPKDMWLEDLDKFEEALEEQEE--VE 1148
Query: 518 QIRLLVEDSTKPKKKKK--KNRDQPKSKMPMGQNSAMMNDHVNKMNKAPAPLNNGQKPKS 575
+ + E K K K K K R K + + + + + + +
Sbjct: 1149 EKEIAKEQRLKSKTKGKASKLRKPKLKKKEKKKKKSSADKSKKASVVGNSKRVDSDEKRK 1208
Query: 576 LNNVRKPQASNPQQAKKPKPNNANTVAAKKQV-RTFDSEDEDVAKPMSYDEKRQLSLDIN 634
L++ + SN + + T K V R ++ DE L
Sbjct: 1209 LDDKPDNKKSNSSGSDQEDDEEQKTKPKKSSVKRLKSKKNNSSKSSEDNDEFSSDDLSKE 1268
Query: 635 KLPGDKLGKVVHIIQSREPSLREPNPDEIEIDFETLKPSTLRELEQYVSSCLRKRTYKKT 694
P + K V +Q P + E + P+ K+ KK
Sbjct: 1269 GKPKNAP-KRVSAVQYSPPPPSKRPDGESNGGSKPSSPT--------------KKKVKKR 1313
Query: 695 PKPKDEKFAEKKHELEKRLQDVTSQIDSTNKKLKKPKPTTSAAGPTGASRLSASSSSSSD 754
+ +KK +K + S K + + + S + S SSS+
Sbjct: 1314 LEGSLAALKKKKKSEKKTARKKKS-------KTRVKQASASQSSRLLRRPRKKKSDSSSE 1366
Query: 755 SD--SSGQSTSSSSSESSDSEG 774
D S + E + +
Sbjct: 1367 DDDDSEVDDSEDEDDEDDEDDD 1388
>gnl|CDD|218440 pfam05110, AF-4, AF-4 proto-oncoprotein. This family consists of
AF4 (Proto-oncogene AF4) and FMR2 (Fragile X E mental
retardation syndrome) nuclear proteins. These proteins
have been linked to human diseases such as acute
lymphoblastic leukaemia and mental retardation. The
family also contains a Drosophila AF4 protein homologue
Lilliputian which contains an AT-hook domain.
Lilliputian represents a novel pair-rule gene that acts
in cytoskeleton regulation, segmentation and
morphogenesis in Drosophila.
Length = 1154
Score = 41.8 bits (98), Expect = 0.002
Identities = 31/105 (29%), Positives = 43/105 (40%), Gaps = 5/105 (4%)
Query: 680 QYVSSCLRKRTYKKTPKPKDEKFAEKKHELEKRLQDVTSQIDSTNKKLKKPKPTTSAAG- 738
Q+VS + + LE L+ +S+ + +KP +
Sbjct: 327 QHVSPGTQNQKQYDPSSKTSTNSQSGTSMLEDDLKLSSSEDSDEEQATEKPPSRNTPPSA 386
Query: 739 ----PTGASRLSASSSSSSDSDSSGQSTSSSSSESSDSEGEKGPH 779
P A+ S SSSSSS S+SS S S S S SSDSE + P
Sbjct: 387 PSSNPEPAASSSGSSSSSSGSESSSGSDSESESSSSDSEENEPPR 431
>gnl|CDD|236776 PRK10856, PRK10856, cytoskeletal protein RodZ; Provisional.
Length = 331
Score = 38.1 bits (89), Expect = 0.013
Identities = 16/78 (20%), Positives = 25/78 (32%), Gaps = 1/78 (1%)
Query: 15 NQNGTGDPDPVKAEQDVKVNNNNTSDSTSSSSPPPSSTEVTPVKKSPAPPSSSSSAPAPS 74
+QN G P+ + ++P S T +PA ++ APS
Sbjct: 155 SQNS-GQSVPLDTSTTTDPATTPAPAAPVDTTPTNSQTPAVATAPAPAVDPQQNAVVAPS 213
Query: 75 HHNESNSGNSTKAASVEP 92
N + AA P
Sbjct: 214 QANVDTAATPAPAAPATP 231
Score = 35.4 bits (82), Expect = 0.094
Identities = 16/79 (20%), Positives = 33/79 (41%), Gaps = 7/79 (8%)
Query: 27 AEQDVKVNNNNTSDS----TSSSSPPPSSTEVTPVKKSPAPPSSSS---SAPAPSHHNES 79
A+Q + N+ S TS+++ P ++ + S + +APAP+ +
Sbjct: 147 ADQSSAELSQNSGQSVPLDTSTTTDPATTPAPAAPVDTTPTNSQTPAVATAPAPAVDPQQ 206
Query: 80 NSGNSTKAASVEPPPRDEP 98
N+ + A+V+ P
Sbjct: 207 NAVVAPSQANVDTAATPAP 225
Score = 34.2 bits (79), Expect = 0.21
Identities = 20/102 (19%), Positives = 34/102 (33%), Gaps = 7/102 (6%)
Query: 2 SAAETETATDPVVNQNGTGDPDPVKAEQDVKVNNNNTSDSTSSSSPPPSSTEVTPVKKSP 61
SA ++ + V T V+ T+ T + + P+ V P + +
Sbjct: 151 SAELSQNSGQSVPLDTSTTTDPATTPAPAAPVDTTPTNSQTPAVATAPAPA-VDPQQNAV 209
Query: 62 APPSSSS------SAPAPSHHNESNSGNSTKAASVEPPPRDE 97
PS ++ APA + + T A V P D
Sbjct: 210 VAPSQANVDTAATPAPAAPATPDGAAPLPTDQAGVSTPAADP 251
>gnl|CDD|99924 cd05492, Bromo_ZMYND11, Bromodomain; ZMYND11_like sub-family.
ZMYND11 or BS69 is a ubiquitously expressed nuclear
protein that has been shown to associate with chromatin.
It interacts with chromatin remodeling factors and might
play a role in chromatin remodeling and gene expression.
Bromodomains are 110 amino acid long domains, that are
found in many chromatin associated proteins.
Bromodomains can interact specifically with acetylated
lysine.
Length = 109
Score = 35.4 bits (82), Expect = 0.021
Identities = 13/57 (22%), Positives = 25/57 (43%), Gaps = 8/57 (14%)
Query: 133 MKAVWKHPHAW-PFHEPVDAINLN-------LPDYHKVITQPMDLGTIKKRLENNYY 181
+K + +W P AI LN LP ++I +D+ I++++ + Y
Sbjct: 5 LKFIVSRMKSWLPPDTTNRAIVLNKRGKATKLPKRRRLIHTHLDVADIQEKINSEKY 61
Score = 30.8 bits (70), Expect = 0.86
Identities = 9/35 (25%), Positives = 19/35 (54%)
Query: 414 IKKPMDLGTVRKKMRNRTYKTAKEFADDVRLIFSN 448
I +D+ +++K+ + Y + +EF D L+ N
Sbjct: 43 IHTHLDVADIQEKINSEKYTSLEEFKADALLLLHN 77
>gnl|CDD|237030 PRK12270, kgd, alpha-ketoglutarate decarboxylase; Reviewed.
Length = 1228
Score = 37.2 bits (87), Expect = 0.043
Identities = 20/98 (20%), Positives = 36/98 (36%), Gaps = 8/98 (8%)
Query: 38 TSDSTSSSSPPPSSTEVTPVKKSPAPPSSSSSAPAPSHHNESNSGNSTKAASVEPPPRDE 97
+ + ++++ S+ P K+PA P+ + A A + + AA+ PP
Sbjct: 44 PTAAAAAAAAAASAPAAAPAAKAPAAPAPAPPAAAAPAAPPKPAAAAAAAAAPAAPPAAA 103
Query: 98 PRLEPVDGIVQPPVVPPKHRPGRNTNQLQYIVKNVMKA 135
P V+ V P + + KN M A
Sbjct: 104 AAAAPAAAAVEDEVTPLRGAAAA-------VAKN-MDA 133
Score = 30.6 bits (70), Expect = 4.5
Identities = 10/73 (13%), Positives = 19/73 (26%)
Query: 1 MSAAETETATDPVVNQNGTGDPDPVKAEQDVKVNNNNTSDSTSSSSPPPSSTEVTPVKKS 60
T T + ++ + PP +
Sbjct: 36 YGPGSTAAPTAAAAAAAAAASAPAAAPAAKAPAAPAPAPPAAAAPAAPPKPAAAAAAAAA 95
Query: 61 PAPPSSSSSAPAP 73
PA P ++++A AP
Sbjct: 96 PAAPPAAAAAAAP 108
>gnl|CDD|218191 pfam04652, DUF605, Vta1 like. Vta1 (VPS20-associated protein 1) is
a positive regulator of Vps4. Vps4 is an ATPase that is
required in the multivesicular body (MVB) sorting
pathway to dissociate the endosomal sorting complex
required for transport (ESCRT). Vta1 promotes correct
assembly of Vps4 and stimulates its ATPase activity
through its conserved Vta1/SBP1/LIP5 region.
Length = 315
Score = 36.2 bits (84), Expect = 0.047
Identities = 18/115 (15%), Positives = 28/115 (24%)
Query: 10 TDPVVNQNGTGDPDPVKAEQDVKVNNNNTSDSTSSSSPPPSSTEVTPVKKSPAPPSSSSS 69
+ + + + + + SSSP S P S SS
Sbjct: 155 DEEDEDADVATTNSDNSFPGEDADPASASPSDPPSSSPGVPSFPSPPEDPSSPSDSSLPP 214
Query: 70 APAPSHHNESNSGNSTKAASVEPPPRDEPRLEPVDGIVQPPVVPPKHRPGRNTNQ 124
AP+ + + PP P PV + P T
Sbjct: 215 APSSFQSDTPPPSPESPTNPSPPPGPAAPPPPPVQQVPPLSTAKPTPPSASATPA 269
Score = 30.4 bits (69), Expect = 3.2
Identities = 20/111 (18%), Positives = 33/111 (29%), Gaps = 2/111 (1%)
Query: 1 MSAAETETATDPVVNQNGTGDPDPV--KAEQDVKVNNNNTSDSTSSSSPPPSSTEVTPVK 58
++ + D DP + +S S SS P PSS +
Sbjct: 166 TNSDNSFPGEDADPASASPSDPPSSSPGVPSFPSPPEDPSSPSDSSLPPAPSSFQSDTPP 225
Query: 59 KSPAPPSSSSSAPAPSHHNESNSGNSTKAASVEPPPRDEPRLEPVDGIVQP 109
SP P++ S P P+ ++ +P P G +
Sbjct: 226 PSPESPTNPSPPPGPAAPPPPPVQQVPPLSTAKPTPPSASATPAPIGGITL 276
>gnl|CDD|236090 PRK07764, PRK07764, DNA polymerase III subunits gamma and tau;
Validated.
Length = 824
Score = 36.5 bits (85), Expect = 0.053
Identities = 17/83 (20%), Positives = 30/83 (36%), Gaps = 2/83 (2%)
Query: 35 NNNTSDSTSSSSPPPSSTEVTPVKKSPAPPSSSSSAPAPSHHNESNSGNSTKAASVEPPP 94
+ +S+P P++ + P+ P P + APA + + + AA+ P P
Sbjct: 719 AQPPQAAQGASAPSPAADDPVPLPPEPDDPPDPAGAPAQPPPPPAPAPAAAPAAAPPPSP 778
Query: 95 RDEPRLEPVDGIVQPPVVPPKHR 117
E D P + R
Sbjct: 779 PSEEEEMAEDDA--PSMDDEDRR 799
Score = 33.4 bits (77), Expect = 0.55
Identities = 23/121 (19%), Positives = 35/121 (28%), Gaps = 5/121 (4%)
Query: 3 AAETETATDPVVNQNGTGDPDPVKAEQDVKVNNNNTSDSTSSSSP---PPSSTEVTPVKK 59
A E V + + GD P KA + ++P P+ P
Sbjct: 649 APEHHPKHVAVPDASDGGDGWPAKAGGAAPAAPPPAPAPAAPAAPAGAAPAQPAPAPAAT 708
Query: 60 SPAPPSSSSSAPAPSHHNESNSGNSTKAASVEPPPRDEPRLEPVDGIVQPPVVPPKHRPG 119
PA + +A P +++ + V PP EP P PP
Sbjct: 709 PPAGQADDPAAQPPQAAQGASAPSPAADDPVPLPP--EPDDPPDPAGAPAQPPPPPAPAP 766
Query: 120 R 120
Sbjct: 767 A 767
Score = 30.0 bits (68), Expect = 6.1
Identities = 13/79 (16%), Positives = 23/79 (29%), Gaps = 3/79 (3%)
Query: 44 SSSPPPSSTEVTPVKKSPAPPSSSSS---APAPSHHNESNSGNSTKAASVEPPPRDEPRL 100
+++P + + A P+ +++ APAP+ S A P P
Sbjct: 403 AAAPAAAPAPAAAAPAAAAAPAPAAAPQPAPAPAPAPAPPSPAGNAPAGGAPSPPPAAAP 462
Query: 101 EPVDGIVQPPVVPPKHRPG 119
P P
Sbjct: 463 SAQPAPAPAAAPEPTAAPA 481
>gnl|CDD|219053 pfam06482, Endostatin, Collagenase NC10 and Endostatin. NC10
stands for Non-helical region 10 and is taken from human
COL15A1. A mutation in this region in human COL18A1 is
associated with an increased risk of prostrate cancer.
This domain is cleaved from the precursor and forms
endostatin. Endostatin is a key tumour suppressor and
has been used highly successfully to treat cancer. It is
a potent angiogenesis inhibitor. Endostatin also binds a
zinc ion near the N-terminus; this is likely to be of
structural rather than functional importance according
to.
Length = 291
Score = 36.3 bits (84), Expect = 0.055
Identities = 24/88 (27%), Positives = 38/88 (43%), Gaps = 9/88 (10%)
Query: 230 EVVLDAPQP---RSSKKKPPVSASPSLNPVIKTPVIPLNKLPSATSTPKPRPPNPVLGST 286
+V+L P ++ + PPV +P +P P P P S R P+P L S
Sbjct: 46 KVLLGELVPLPGTTATQPPPVVLTPWSDPR--LPDPPHLPDPQTHSATAHRNPHPPLNSP 103
Query: 287 ATTTTAP----KVNHLNSMNAPDTPDMK 310
A V HL ++N+P + +M+
Sbjct: 104 ARIAHVHQDFQPVLHLIALNSPLSGNMR 131
>gnl|CDD|217469 pfam03276, Gag_spuma, Spumavirus gag protein.
Length = 582
Score = 36.4 bits (84), Expect = 0.065
Identities = 18/77 (23%), Positives = 26/77 (33%)
Query: 43 SSSSPPPSSTEVTPVKKSPAPPSSSSSAPAPSHHNESNSGNSTKAASVEPPPRDEPRLEP 102
+ PPPSS P S PS+SS+ N + + P P
Sbjct: 185 AIQPPPPSSLPGLPPGSSSLAPSASSTPGNRLPRVSFNPFLPGPSPAQPSAPPASIPAPP 244
Query: 103 VDGIVQPPVVPPKHRPG 119
+ ++Q PP P
Sbjct: 245 IPPVIQYVAPPPVPPPQ 261
Score = 29.9 bits (67), Expect = 6.4
Identities = 18/73 (24%), Positives = 21/73 (28%), Gaps = 10/73 (13%)
Query: 56 PVKKSPAPPSSSSSAPAPSHHNE----SNSGNSTKAASVEPPPRDEPRLEP------VDG 105
P P PPSS P S S GN S P +P +
Sbjct: 183 PGAIQPPPPSSLPGLPPGSSSLAPSASSTPGNRLPRVSFNPFLPGPSPAQPSAPPASIPA 242
Query: 106 IVQPPVVPPKHRP 118
PPV+ P
Sbjct: 243 PPIPPVIQYVAPP 255
>gnl|CDD|237865 PRK14951, PRK14951, DNA polymerase III subunits gamma and tau;
Provisional.
Length = 618
Score = 36.2 bits (84), Expect = 0.065
Identities = 18/116 (15%), Positives = 36/116 (31%), Gaps = 13/116 (11%)
Query: 3 AAETETATDPVVNQNGTGDPDPVKAEQDVKVNNNNTSDSTSSSSPPPSSTEVTPVKKSPA 62
AE +T P P A + + + ++ P ++ + A
Sbjct: 375 PAEKKTPARPEA-------AAPAAAPVAQ------AAAAPAPAAAPAAAASAPAAPPAAA 421
Query: 63 PPSSSSSAPAPSHHNESNSGNSTKAASVEPPPRDEPRLEPVDGIVQPPVVPPKHRP 118
PP+ ++ A + + + A + PP + P + V P P
Sbjct: 422 PPAPVAAPAAAAPAAAPAAAPAAVALAPAPPAQAAPETVAIPVRVAPEPAVASAAP 477
>gnl|CDD|178145 PLN02530, PLN02530, histidine-tRNA ligase.
Length = 487
Score = 35.9 bits (83), Expect = 0.095
Identities = 14/70 (20%), Positives = 22/70 (31%), Gaps = 2/70 (2%)
Query: 41 STSSSSPPPSSTEVTPVKKSPAPPSSSSSAPAPSHHNESNSGNSTKAASVEPPPRDEPRL 100
S + S P+ + S+SS + +G + PP E
Sbjct: 6 SATRVSASAIRRPSLPLSSRCSFLLSASSPRGGRCAASAAAGGGRSGGTTAPPSVQEDGK 65
Query: 101 EPVDGIVQPP 110
+D V PP
Sbjct: 66 PKID--VNPP 73
>gnl|CDD|223039 PHA03307, PHA03307, transcriptional regulator ICP4; Provisional.
Length = 1352
Score = 35.9 bits (83), Expect = 0.099
Identities = 26/118 (22%), Positives = 35/118 (29%), Gaps = 7/118 (5%)
Query: 2 SAAETETATDPVVNQNGT-GDPDPVKAEQDVKVNNNNTSDSTSSSSPPPSSTEVTPVKKS 60
A + A D + G A V + PPP P +S
Sbjct: 25 PATPGDAADDLLSGSQGQLVSDSAELAAVTVVAGAAACDRFEPPTGPPPGPGTEAPANES 84
Query: 61 PAPPSSSSSAPAPSHHNESNSGNSTKAASVEPPPRDEPRLEPVDGIVQPPVVPPKHRP 118
+ P+ S S AP+ S +S +PPP P P PP P
Sbjct: 85 RSTPTWSLSTLAPASPAREGSPTPPGPSSPDPPPPTPPPASP------PPSPAPDLSE 136
Score = 35.1 bits (81), Expect = 0.19
Identities = 20/96 (20%), Positives = 31/96 (32%), Gaps = 9/96 (9%)
Query: 36 NNTSDSTSSSSPPPSS-------TEVTPVKKSPAPPSSSSSAPAPSHHNESNSGNSTKAA 88
+++S S SP PSS + S + SSSS+ + S + + S +
Sbjct: 287 SSSSPRERSPSPSPSSPGSGPAPSSPRASSSSSSSRESSSSSTSSSSESSRGAAVSPGPS 346
Query: 89 SVEPPPRDEPRLEPVDGIVQPPVVPPKHRPGRNTNQ 124
P P P P P R +
Sbjct: 347 PSRSPSPSRPP--PPADPSSPRKRPRPSRAPSSPAA 380
Score = 33.6 bits (77), Expect = 0.53
Identities = 15/94 (15%), Positives = 21/94 (22%), Gaps = 7/94 (7%)
Query: 36 NNTSDSTSSSSPPPSSTEV-------TPVKKSPAPPSSSSSAPAPSHHNESNSGNSTKAA 88
+ + SS PP T S S P P+ + + A
Sbjct: 104 GSPTPPGPSSPDPPPPTPPPASPPPSPAPDLSEMLRPVGSPGPPPAASPPAAGASPAAVA 163
Query: 89 SVEPPPRDEPRLEPVDGIVQPPVVPPKHRPGRNT 122
S R P P +T
Sbjct: 164 SDAASSRQAALPLSSPEETARAPSSPPAEPPPST 197
Score = 33.2 bits (76), Expect = 0.72
Identities = 16/64 (25%), Positives = 22/64 (34%), Gaps = 4/64 (6%)
Query: 39 SDSTSSSSPPPSSTEVTPVK---KSPAPPSSSSSAPAPSHHNESNSGNSTKAASVEPPPR 95
S STSSSS V+P +SP+P A S + + + R
Sbjct: 326 SSSTSSSSESSRGAAVSPGPSPSRSPSPSRPPPPADPSSPRKRPRPSRAPSSPAA-SAGR 384
Query: 96 DEPR 99
R
Sbjct: 385 PTRR 388
Score = 32.1 bits (73), Expect = 1.3
Identities = 15/106 (14%), Positives = 30/106 (28%), Gaps = 7/106 (6%)
Query: 18 GTGDPDPVKAEQDVKVNNNNTSDSTSSSSPPPSSTEVTPVKKSPAPPSSSSSAPAPSHHN 77
+ ++ + + +S +S + + + S +A APS
Sbjct: 131 APDLSEMLRPVGSPGPPPAASPPAAGASPAAVASDAASSRQAALPLSSPEETARAPSSPP 190
Query: 78 ESNSGNSTKAASVEPPPRDEPRLEPVDGIVQPPVVPPKHRPGRNTN 123
++ PP PR + P PGR+
Sbjct: 191 AEPPPST-------PPAAASPRPPRRSSPISASASSPAPAPGRSAA 229
Score = 31.7 bits (72), Expect = 2.1
Identities = 20/45 (44%), Positives = 25/45 (55%)
Query: 730 PKPTTSAAGPTGASRLSASSSSSSDSDSSGQSTSSSSSESSDSEG 774
P P+ S+ G A +SSSSS S S S++SSSSESS
Sbjct: 296 PSPSPSSPGSGPAPSSPRASSSSSSSRESSSSSTSSSSESSRGAA 340
Score = 31.3 bits (71), Expect = 2.8
Identities = 24/121 (19%), Positives = 41/121 (33%), Gaps = 1/121 (0%)
Query: 1 MSAAETETATDPVVNQNGTGDPDPVKAEQDVKVNNNNTSDSTSSSSPPPSSTEVTPVKKS 60
++ +E++ +N P P ++ + + SS P SS+ +
Sbjct: 237 SDSSSSESSGCGWGPENECPLPRPAPITLPTRIWEASGWNGPSSRPGPASSSSSPRERSP 296
Query: 61 PAPPSSSSSAPAPS-HHNESNSGNSTKAASVEPPPRDEPRLEPVDGIVQPPVVPPKHRPG 119
PSS S PAPS S+S +S +++S E P P
Sbjct: 297 SPSPSSPGSGPAPSSPRASSSSSSSRESSSSSTSSSSESSRGAAVSPGPSPSRSPSPSRP 356
Query: 120 R 120
Sbjct: 357 P 357
Score = 30.9 bits (70), Expect = 3.2
Identities = 21/84 (25%), Positives = 28/84 (33%), Gaps = 13/84 (15%)
Query: 37 NTSDSTSSSSPPPSSTEVTPVKK--SPAPPSSSSSAPAPSHHNESNSGNSTKAASVEPPP 94
++S S+ + S+ + SP PP SS P P PPP
Sbjct: 80 PANESRSTPTWSLSTLAPASPAREGSPTPPGPSSPDPPPP-----------TPPPASPPP 128
Query: 95 RDEPRLEPVDGIVQPPVVPPKHRP 118
P L + V P PP P
Sbjct: 129 SPAPDLSEMLRPVGSPGPPPAASP 152
Score = 30.1 bits (68), Expect = 5.1
Identities = 20/105 (19%), Positives = 28/105 (26%), Gaps = 2/105 (1%)
Query: 1 MSAAETETATDPVVNQNGTGDPDPVKAEQDVKVNNNNTSDSTSSSSPP-PSSTEVTPVKK 59
+ A G A + + S +S S T P
Sbjct: 55 VVAGAAACDRFEPPTGPPPGPGTEAPANESRSTPTWSLSTLAPASPAREGSPTPPGPSSP 114
Query: 60 SPAPPSSSSSAPAPSHHNESNSGNSTKAASVEPPPRDEPRLEPVD 104
P PP+ ++P PS + S S PPP P
Sbjct: 115 DPPPPTPPPASPPPSPAPDL-SEMLRPVGSPGPPPAASPPAAGAS 158
>gnl|CDD|223021 PHA03247, PHA03247, large tegument protein UL36; Provisional.
Length = 3151
Score = 36.1 bits (83), Expect = 0.100
Identities = 18/79 (22%), Positives = 22/79 (27%)
Query: 48 PPSSTEVTPVKKSPAPPSSSSSAPAPSHHNESNSGNSTKAASVEPPPRDEPRLEPVDGIV 107
PPS + PP + PA S ES + + P P
Sbjct: 2866 PPSRSPAAKPAAPARPPVRRLARPAVSRSTESFALPPDQPERPPQPQAPPPPQPQPQPPP 2925
Query: 108 QPPVVPPKHRPGRNTNQLQ 126
P PP P R L
Sbjct: 2926 PPQPQPPPPPPPRPQPPLA 2944
Score = 34.5 bits (79), Expect = 0.32
Identities = 26/108 (24%), Positives = 34/108 (31%), Gaps = 10/108 (9%)
Query: 21 DPDPVKAEQDVKVNNNNTSDSTSSSSPPPSSTEVTPVKKSPAPPSSSSS---APAP---- 73
DP A + S + PPP+S + T P PP S + AP
Sbjct: 2803 DPADPPAAVLAPAAALPPAASPAGPLPPPTSAQPTAPPPPPGPPPPSLPLGGSVAPGGDV 2862
Query: 74 SHHNESNSGNSTKAASVEPPPRDEPR---LEPVDGIVQPPVVPPKHRP 118
S S + AA PP R R + PP P +
Sbjct: 2863 RRRPPSRSPAAKPAAPARPPVRRLARPAVSRSTESFALPPDQPERPPQ 2910
Score = 31.4 bits (71), Expect = 2.7
Identities = 16/88 (18%), Positives = 29/88 (32%)
Query: 38 TSDSTSSSSPPPSSTEVTPVKKSPAPPSSSSSAPAPSHHNESNSGNSTKAASVEPPPRDE 97
TS + +PP S+ PV P + +P P + + + + + P
Sbjct: 2583 TSRARRPDAPPQSARPRAPVDDRGDPRGPAPPSPLPPDTHAPDPPPPSPSPAANEPDPHP 2642
Query: 98 PRLEPVDGIVQPPVVPPKHRPGRNTNQL 125
P P + P + R +L
Sbjct: 2643 PPTVPPPERPRDDPAPGRVSRPRRARRL 2670
Score = 30.3 bits (68), Expect = 4.9
Identities = 20/70 (28%), Positives = 23/70 (32%), Gaps = 1/70 (1%)
Query: 220 LTKITGMPSEEVVLDAPQPRSSKKKP-PVSASPSLNPVIKTPVIPLNKLPSATSTPKPRP 278
LT+ E P P P V A + P +P PL SA T P P
Sbjct: 2783 LTRPAVASLSESRESLPSPWDPADPPAAVLAPAAALPPAASPAGPLPPPTSAQPTAPPPP 2842
Query: 279 PNPVLGSTAT 288
P P S
Sbjct: 2843 PGPPPPSLPL 2852
>gnl|CDD|191251 pfam05283, MGC-24, Multi-glycosylated core protein 24 (MGC-24).
This family consists of several MGC-24 (or Cd164
antigen) proteins from eukaryotic organisms.
MGC-24/CD164 is a sialomucin expressed in many normal
and cancerous tissues. In humans, soluble and
transmembrane forms of MGC-24 are produced by
alternative splicing.
Length = 187
Score = 34.2 bits (78), Expect = 0.15
Identities = 11/65 (16%), Positives = 22/65 (33%), Gaps = 4/65 (6%)
Query: 33 VNNNNTSDSTSSSSPPPSSTEVTPVKKSPAPPSSSSSAPAPSHHNESNSGNSTKAASVEP 92
V + ++ S ++T P + P + S + H N+T V P
Sbjct: 94 VVKSGCQIYNTTDSCSVATTTPVPTNSTAKPTITPSPTTSHHHVTSEPKTNTT----VTP 149
Query: 93 PPRDE 97
+ +
Sbjct: 150 TSQPD 154
>gnl|CDD|99925 cd05493, Bromo_ALL-1, Bromodomain, ALL-1 like proteins. ALL-1 is a
vertebrate homologue of Drosophila trithorax and is
often affected in chromosomal rearrangements that are
linked to acute leukemias, such as acute lymphocytic
leukemia (ALL). Bromodomains are found in many
chromatin-associated proteins and in nuclear histone
acetyltransferases. They interact specifically with
acetylated lysine.
Length = 131
Score = 33.2 bits (76), Expect = 0.15
Identities = 11/31 (35%), Positives = 17/31 (54%)
Query: 416 KPMDLGTVRKKMRNRTYKTAKEFADDVRLIF 446
P+DL V KK+ Y + +F+DD+ I
Sbjct: 58 PPLDLEAVGKKLEAGFYTSVLDFSDDIVKII 88
>gnl|CDD|237082 PRK12373, PRK12373, NADH dehydrogenase subunit E; Provisional.
Length = 400
Score = 34.4 bits (79), Expect = 0.20
Identities = 19/63 (30%), Positives = 23/63 (36%), Gaps = 5/63 (7%)
Query: 235 APQPRSSKKKPPVSASPSLNPVIKTPVIPLNKLPSATSTPKPRPPNPVLGSTATTTTAPK 294
AP P S +P SA N +KTP P A + P + TA APK
Sbjct: 239 APVPPSEAARPK-SADAETNAALKTPA----TAPKAAAKNAKAPEAQPVSGTAAAEPAPK 293
Query: 295 VNH 297
Sbjct: 294 EAA 296
>gnl|CDD|215145 PLN02258, PLN02258, 9-cis-epoxycarotenoid dioxygenase NCED.
Length = 590
Score = 34.7 bits (80), Expect = 0.24
Identities = 16/70 (22%), Positives = 24/70 (34%), Gaps = 1/70 (1%)
Query: 213 QNLEQLFLTKITGMPSEEVVLDAPQPRSSKKKPPVSASPSLNPVIKTP-VIPLNKLPSAT 271
+ + S P SS P PS + ++ TP ++PL KL S +
Sbjct: 1 SSSSNPTSRSQSHASSSSSSSSQSSPPSSTSPRPRRRKPSASSLLHTPSILPLPKLSSPS 60
Query: 272 STPKPRPPNP 281
PP
Sbjct: 61 PPSVTLPPAA 70
Score = 30.1 bits (68), Expect = 5.7
Identities = 18/64 (28%), Positives = 27/64 (42%), Gaps = 8/64 (12%)
Query: 39 SDSTSSSSPPPSSTEVTPVKKSP---APPSSSSSAPAPSHHNESNSGNSTKAASVEPPPR 95
S S+SS S PPSST P ++ P + + S P P S S + ++ P
Sbjct: 17 SSSSSSQSSPPSSTSPRPRRRKPSASSLLHTPSILPLPKL-----SSPSPPSVTLPPAAT 71
Query: 96 DEPR 99
+
Sbjct: 72 TQTP 75
Score = 29.7 bits (67), Expect = 6.7
Identities = 17/96 (17%), Positives = 26/96 (27%), Gaps = 5/96 (5%)
Query: 41 STSSSSPPPSSTEVTPVKKSPAPPSSSSSAPAPSHHNESNSGNSTKAASVEPPPRDEPRL 100
S+SS+ S + + S + S SS S + P P+L
Sbjct: 1 SSSSNPTSRSQSHASSSSSSSSQSSPPSSTSPRP----RRRKPSASSLLHTPSILPLPKL 56
Query: 101 EPVDGIVQPPVVPPKHRPGRNTNQLQYIVKNVMKAV 136
+ N LQ + AV
Sbjct: 57 SSPSPPSVTLPPAATTQT-PQLNPLQRAAAAALDAV 91
>gnl|CDD|222647 pfam14282, FlxA, FlxA-like protein. This family includes FlxA from
E. coli. The expression of FlxA is regulated by the FliA
sigma factor, a transcription factor specific for class
3 flagellar operons. However FlxA is not required for
flagellar function or formation.
Length = 106
Score = 32.0 bits (73), Expect = 0.26
Identities = 17/55 (30%), Positives = 30/55 (54%)
Query: 485 SRAAASVSSDDDSEDERQNQLKYLQEQLKSLTDQIRLLVEDSTKPKKKKKKNRDQ 539
S +AS +S S +Q+K LQ+Q++SLT Q++ L ++K+K +
Sbjct: 1 SSVSASSTSSSASSGSSDSQIKSLQKQIQSLTKQLKELSSSEDLTAEEKQKQQQL 55
Score = 29.3 bits (66), Expect = 2.3
Identities = 16/68 (23%), Positives = 33/68 (48%), Gaps = 7/68 (10%)
Query: 492 SSDDDSEDERQNQLKYLQEQLKSLTDQIRLLVEDSTKPKKKKKKNRDQPKSKMPMGQNSA 551
SS+D + +E+Q Q + +Q+Q++ L QI L ++++ + Q + S
Sbjct: 40 SSEDLTAEEKQKQQQLIQQQIQMLQAQIAQL-------QQQQSEEAQQQQQSSIEDSPSQ 92
Query: 552 MMNDHVNK 559
+ VN+
Sbjct: 93 TSAEGVNR 100
>gnl|CDD|236138 PRK07994, PRK07994, DNA polymerase III subunits gamma and tau;
Validated.
Length = 647
Score = 34.5 bits (80), Expect = 0.26
Identities = 23/117 (19%), Positives = 37/117 (31%), Gaps = 4/117 (3%)
Query: 2 SAAETETATDPVVNQNGTGDPDPVKAEQDVKVNNNNTSDSTSSSSPPPSSTEVTPVKKSP 61
+ A A P P + TS ++ + T KKS
Sbjct: 383 ATAAPTAAVAPPQAPAVPPPPASAPQQAPAVPLPETTSQLLAARQQLQRAQGATKAKKS- 441
Query: 62 APPSSSSSAPAPSHHNESNSGNSTKAASVEPPPRDEP-RLEPVDGIVQP--PVVPPK 115
P ++S + P S S +A + P + E R + + + PV PK
Sbjct: 442 EPAAASRARPVNSALERLASVRPAPSALEKAPAKKEAYRWKATNPVEVKKEPVATPK 498
>gnl|CDD|237862 PRK14948, PRK14948, DNA polymerase III subunits gamma and tau;
Provisional.
Length = 620
Score = 34.2 bits (79), Expect = 0.29
Identities = 18/72 (25%), Positives = 31/72 (43%)
Query: 31 VKVNNNNTSDSTSSSSPPPSSTEVTPVKKSPAPPSSSSSAPAPSHHNESNSGNSTKAASV 90
+K+N + S S S+++ P + +P +P PP +A AP +T+A+S
Sbjct: 506 IKLNLESQSGSASNTAKTPPPPQKSPPPPAPTPPLPQPTATAPPPTPPPPPPTATQASSN 565
Query: 91 EPPPRDEPRLEP 102
P P
Sbjct: 566 APAQIPADSSPP 577
Score = 30.7 bits (70), Expect = 3.5
Identities = 16/95 (16%), Positives = 32/95 (33%)
Query: 14 VNQNGTGDPDPVKAEQDVKVNNNNTSDSTSSSSPPPSSTEVTPVKKSPAPPSSSSSAPAP 73
+N A+ + + + P P++T P P P ++ +S+ AP
Sbjct: 508 LNLESQSGSASNTAKTPPPPQKSPPPPAPTPPLPQPTATAPPPTPPPPPPTATQASSNAP 567
Query: 74 SHHNESNSGNSTKAASVEPPPRDEPRLEPVDGIVQ 108
+ +S P P + E +D +
Sbjct: 568 AQIPADSSPPPPIPEEPTPSPTKDSSPEEIDKAAK 602
Score = 30.7 bits (70), Expect = 4.0
Identities = 18/96 (18%), Positives = 29/96 (30%), Gaps = 8/96 (8%)
Query: 22 PDPVKAEQDVKVNNNNT------SDSTSSSSPPPSSTEVTPVKKSPAPPSSSSSAPAPSH 75
P +E N S + P T+ SP P +S P P+
Sbjct: 361 PSAFISEIANASAPANPTPAPNPSPPPAPIQPSAPKTKQAATTPSPPPAKASPPIPVPA- 419
Query: 76 HNESNSGNSTKAASVEPPPRD-EPRLEPVDGIVQPP 110
S A+ PP + E + + ++ P
Sbjct: 420 EPTEPSPTPPANAANAPPSLNLEELWQQILAKLELP 455
Score = 29.9 bits (68), Expect = 5.7
Identities = 17/86 (19%), Positives = 24/86 (27%), Gaps = 4/86 (4%)
Query: 3 AAETETATDPVVNQNGTGDPDPVKAEQDVKVNNNNTSDSTSSSSPPPSSTEVTPVKKSPA 62
A+T P P Q T ++ SS + +
Sbjct: 520 TAKTPPPPQKSPPPPAPTPPLP----QPTATAPPPTPPPPPPTATQASSNAPAQIPADSS 575
Query: 63 PPSSSSSAPAPSHHNESNSGNSTKAA 88
PP P PS +S+ KAA
Sbjct: 576 PPPPIPEEPTPSPTKDSSPEEIDKAA 601
>gnl|CDD|191582 pfam06679, DUF1180, Protein of unknown function (DUF1180). This
family consists of several hypothetical mammalian
proteins of around 190 residues in length. The function
of this family is unknown.
Length = 163
Score = 33.0 bits (75), Expect = 0.33
Identities = 18/61 (29%), Positives = 25/61 (40%)
Query: 41 STSSSSPPPSSTEVTPVKKSPAPPSSSSSAPAPSHHNESNSGNSTKAASVEPPPRDEPRL 100
S +SS P P + P +PA P++ S A S S A P RD+P
Sbjct: 35 SAASSQPGPETQPPLPPGPTPANPTALGSGDAAGGGGSSTSDGGALATRPSPLLRDKPTT 94
Query: 101 E 101
+
Sbjct: 95 Q 95
>gnl|CDD|235600 PRK05771, PRK05771, V-type ATP synthase subunit I; Validated.
Length = 646
Score = 34.1 bits (79), Expect = 0.33
Identities = 23/125 (18%), Positives = 51/125 (40%), Gaps = 13/125 (10%)
Query: 613 EDEDVAKPMSYDEKRQLSLDINKLPGDKLGKV----VHIIQSREPSLREPNPDEIEIDF- 667
++ + +P LD++ L G K V V + E L + I
Sbjct: 121 QEIERLEPWGN-----FDLDLSLLLGFKYVSVFVGTVPEDKLEELKLESDVENVEYISTD 175
Query: 668 ETLKPS---TLRELEQYVSSCLRKRTYKKTPKPKDEKFAEKKHELEKRLQDVTSQIDSTN 724
+ L+EL V L+K +++ ++ +E E+++ L+++ + +S
Sbjct: 176 KGYVYVVVVVLKELSDEVEEELKKLGFERLELEEEGTPSELIREIKEELEEIEKERESLL 235
Query: 725 KKLKK 729
++LK+
Sbjct: 236 EELKE 240
>gnl|CDD|216531 pfam01496, V_ATPase_I, V-type ATPase 116kDa subunit family. This
family consists of the 116kDa V-type ATPase (vacuolar
(H+)-ATPases) subunits, as well as V-type ATP synthase
subunit i. The V-type ATPases family are proton pumps
that acidify intracellular compartments in eukaryotic
cells for example yeast central vacuoles,
clathrin-coated and synaptic vesicles. They have
important roles in membrane trafficking processes. The
116kDa subunit (subunit a) in the V-type ATPase is part
of the V0 functional domain responsible for proton
transport. The a subunit is a transmembrane glycoprotein
with multiple putative transmembrane helices it has a
hydrophilic amino terminal and a hydrophobic carboxy
terminal. It has roles in proton transport and assembly
of the V-type ATPase complex. This subunit is encoded by
two homologous gene in yeast VPH1 and STV1.
Length = 707
Score = 33.5 bits (77), Expect = 0.49
Identities = 42/253 (16%), Positives = 85/253 (33%), Gaps = 35/253 (13%)
Query: 494 DDDSEDERQNQLKYLQEQLKSLTDQIRLLVEDSTKPKKKKKKNRDQPKSKMPMGQNSAMM 553
D +ED Q K++ E ++ + R L + +K KK +D K + S
Sbjct: 13 RDLNEDVSAFQRKFVNE-VRRCDEVERKLRKLESKIKKLGIPLKD-TGGKPDV-PPSKEF 69
Query: 554 NDHVNKMNKAPAPLNNGQKPKSLNNVRKPQASNPQQAKKPKPNNANTVAAKKQVRTFDSE 613
D ++ A + + ++L ++ K + + +++ +
Sbjct: 70 LDLEEEILDLEAEI--KEVEENLESLEKEINELEEWLNVLDEEKSFLDENLEELSELSNL 127
Query: 614 DEDVAKPMSYDEKRQLSLDINKLPGDKLGKVVHIIQ-SREPSLR---------EPNPDEI 663
D D + G KLG V +I + + EI
Sbjct: 128 DIDFKYLRGAE-------------GLKLGFVAGVINREKLEAFERELWRACRGYIRQAEI 174
Query: 664 EIDFETLKPST------LRELEQYVSSCLRKRTYKKTPKPKDEK-FAEKKHELEKRLQDV 716
E E K + +E V L ++ P+ E +E ++ KR++++
Sbjct: 175 EEPLEDPKKTVFIIFFVGKEDLDKVKKILDSFGFELYDVPETEGERSELISKVNKRIEEL 234
Query: 717 TSQIDSTNKKLKK 729
++ T L+K
Sbjct: 235 QRVLEQTESHLEK 247
>gnl|CDD|217393 pfam03154, Atrophin-1, Atrophin-1 family. Atrophin-1 is the
protein product of the dentatorubral-pallidoluysian
atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive
neurodegenerative disorder. It is caused by the
expansion of a CAG repeat in the DRPLA gene on
chromosome 12p. This results in an extended
polyglutamine region in atrophin-1, that is thought to
confer toxicity to the protein, possibly through
altering its interactions with other proteins. The
expansion of a CAG repeat is also the underlying defect
in six other neurodegenerative disorders, including
Huntington's disease. One interaction of expanded
polyglutamine repeats that is thought to be pathogenic
is that with the short glutamine repeat in the
transcriptional coactivator CREB binding protein, CBP.
This interaction draws CBP away from its usual nuclear
location to the expanded polyglutamine repeat protein
aggregates that are characteristic of the polyglutamine
neurodegenerative disorders. This interferes with
CBP-mediated transcription and causes cytotoxicity.
Length = 979
Score = 33.5 bits (76), Expect = 0.51
Identities = 21/83 (25%), Positives = 26/83 (31%), Gaps = 3/83 (3%)
Query: 38 TSDSTSSSSPPPSSTEVTPVKKSPAPPSSSSSAPAPSHHNESNSGNSTKAASVEPPPRDE 97
+ SP P T ++SP PP+ SS P SHH P
Sbjct: 231 SLHPQRLPSPHPPLQPQTASQQSPQPPAPSSRHPQSSHHGPGPPMPHALQQG--PVFLQH 288
Query: 98 PRLEPVDGI-VQPPVVPPKHRPG 119
P P + VPP P
Sbjct: 289 PSSNPPQPFGLAQSQVPPLPLPS 311
Score = 30.4 bits (68), Expect = 4.6
Identities = 29/121 (23%), Positives = 45/121 (37%), Gaps = 7/121 (5%)
Query: 5 ETETATDPVVNQNGTGDP-----DPVKAEQDVKVNNNNTSDSTSSSSPPPSSTEVTPVKK 59
E E++ VN+ G+ DP D + + +N SDS SS+ + P +
Sbjct: 123 EGESSDSRSVNEEGSSDPKDIDQDNRSSSPSIPSPQDNESDSDSSAQQQLLQPQGPPSIQ 182
Query: 60 SP--APPSSSSSAPAPSHHNESNSGNSTKAASVEPPPRDEPRLEPVDGIVQPPVVPPKHR 117
P A + S+ P PS G+ A P + P + P +P H
Sbjct: 183 VPPGAALAPSAPPPTPSAQAVPPQGSPIAAQPAPQPQQPSPLSLISAPSLHPQRLPSPHP 242
Query: 118 P 118
P
Sbjct: 243 P 243
>gnl|CDD|99923 cd05491, Bromo_TBP7_like, Bromodomain; TBP7_like subfamily, limited
to fungi. TBP7, or TAT-binding protein homolog 7, is a
yeast protein of unknown function that contains
AAA-superfamily ATP-ase domains and a bromodomain.
Bromodomains are found in many chromatin-associated
proteins and in nuclear histone acetyltransferases. They
interact specifically with acetylated lysine.
Length = 119
Score = 31.7 bits (72), Expect = 0.52
Identities = 18/89 (20%), Positives = 33/89 (37%), Gaps = 14/89 (15%)
Query: 368 SKTPLWYKYCSEIIAELFHKKHQNYAWPFYTPVDVEKLGLTDYFDIIKKP--------MD 419
+ L ++ A + N P+ D + D I + MD
Sbjct: 13 QRKYLKFEKPPTEDAFVRPDNENNEERPYEIAKDKQG------VDTILETASGKKFYNMD 66
Query: 420 LGTVRKKMRNRTYKTAKEFADDVRLIFSN 448
L T+ +++ N Y T K+F D++ I +
Sbjct: 67 LDTIEERLWNGYYATPKDFLKDIKRIVRD 95
Score = 29.0 bits (65), Expect = 4.4
Identities = 12/31 (38%), Positives = 20/31 (64%)
Query: 167 MDLGTIKKRLENNYYWSGKEAIQDFTTMFTN 197
MDL TI++RL N YY + K+ ++D + +
Sbjct: 65 MDLDTIEERLWNGYYATPKDFLKDIKRIVRD 95
>gnl|CDD|240349 PTZ00295, PTZ00295, glucosamine-fructose-6-phosphate
aminotransferase; Provisional.
Length = 640
Score = 33.1 bits (76), Expect = 0.60
Identities = 14/45 (31%), Positives = 23/45 (51%)
Query: 194 MFTNCYVYNKPGEDVVLMAQNLEQLFLTKITGMPSEEVVLDAPQP 238
+TN Y+ K GE L +N+ L+ + EEV+ +P+P
Sbjct: 230 KYTNEYISLKDGEIAELSLENVNDLYTQRRVEKIPEEVIEKSPEP 274
>gnl|CDD|215064 PLN00122, PLN00122, serine/threonine protein phosphatase 2A;
Provisional.
Length = 170
Score = 32.0 bits (73), Expect = 0.62
Identities = 16/55 (29%), Positives = 19/55 (34%), Gaps = 2/55 (3%)
Query: 46 SPPPSSTEVTPVKKSPAPPSSSSSAPAPSHHNESNSGNSTKAASVE--PPPRDEP 98
S SS P S SSS + S S ++ A E P RD P
Sbjct: 1 SSSASSGSSAPAVNSSPSSSSSGKSSNSSKSASSAVNPASVVAGYEPLPSFRDVP 55
>gnl|CDD|219500 pfam07655, Secretin_N_2, Secretin N-terminal domain. This is a
short domain found in bacterial type II/III secretory
system proteins. The architecture of these proteins
suggest that this family may be functionally analogous
to pfam03958.
Length = 95
Score = 30.8 bits (70), Expect = 0.71
Identities = 14/32 (43%), Positives = 19/32 (59%)
Query: 741 GASRLSASSSSSSDSDSSGQSTSSSSSESSDS 772
G+S S +S S S S S+ S+SS+SS S
Sbjct: 15 GSSNTSVTSGSVSSSGSNSSSSSSNSSNGGSS 46
Score = 28.4 bits (64), Expect = 3.5
Identities = 12/41 (29%), Positives = 22/41 (53%), Gaps = 1/41 (2%)
Query: 733 TTSAAGPTGA-SRLSASSSSSSDSDSSGQSTSSSSSESSDS 772
+ S++G + S ++S+ SS S SSG S+S + +
Sbjct: 25 SVSSSGSNSSSSSSNSSNGGSSSSSSSGDSSSGTRITTESE 65
Score = 28.4 bits (64), Expect = 3.7
Identities = 13/29 (44%), Positives = 19/29 (65%)
Query: 742 ASRLSASSSSSSDSDSSGQSTSSSSSESS 770
+ +S+S S+SS S S+ + SSSS SS
Sbjct: 23 SGSVSSSGSNSSSSSSNSSNGGSSSSSSS 51
Score = 28.4 bits (64), Expect = 4.3
Identities = 15/38 (39%), Positives = 23/38 (60%)
Query: 733 TTSAAGPTGASRLSASSSSSSDSDSSGQSTSSSSSESS 770
T+ +G +S ++SSSSS+ S+ S+SSS SS
Sbjct: 19 TSVTSGSVSSSGSNSSSSSSNSSNGGSSSSSSSGDSSS 56
Score = 28.4 bits (64), Expect = 4.4
Identities = 17/35 (48%), Positives = 22/35 (62%)
Query: 740 TGASRLSASSSSSSDSDSSGQSTSSSSSESSDSEG 774
+G+ S S+SSSS S+SS +SSSSS S G
Sbjct: 23 SGSVSSSGSNSSSSSSNSSNGGSSSSSSSGDSSSG 57
Score = 28.1 bits (63), Expect = 5.9
Identities = 13/30 (43%), Positives = 21/30 (70%)
Query: 743 SRLSASSSSSSDSDSSGQSTSSSSSESSDS 772
+ +++ S SSS S+SS S++SS+ SS S
Sbjct: 19 TSVTSGSVSSSGSNSSSSSSNSSNGGSSSS 48
>gnl|CDD|235505 PRK05563, PRK05563, DNA polymerase III subunits gamma and tau;
Validated.
Length = 559
Score = 32.9 bits (76), Expect = 0.73
Identities = 24/73 (32%), Positives = 39/73 (53%), Gaps = 2/73 (2%)
Query: 483 LASRAAASVSSDDDSEDERQNQLKYLQEQLKSLTDQIRLLVEDSTKPKKKKKKNRDQPKS 542
L +AAAS D + E Q +++ L+++LK L Q V K KKK+KK + K
Sbjct: 357 LCEQAAASPEYDTELEVLLQ-RVEQLEQELKQLKAQPV-GVAPEQKEKKKEKKKNKKKKY 414
Query: 543 KMPMGQNSAMMND 555
K+P G+ ++ +
Sbjct: 415 KVPRGKIYKVLKE 427
>gnl|CDD|235175 PRK03918, PRK03918, chromosome segregation protein; Provisional.
Length = 880
Score = 32.7 bits (75), Expect = 0.82
Identities = 26/121 (21%), Positives = 58/121 (47%), Gaps = 10/121 (8%)
Query: 604 KKQVRTFDSEDEDVAKPMSYDEKRQLSLDINKLPG-DKLGKVVHIIQSREPSLREPNPDE 662
+K+++ + ++ + K E R+L + K KL ++ ++ E L++ N +E
Sbjct: 465 EKELKEIEEKERKLRK-----ELRELEKVLKKESELIKLKELAEQLKELEEKLKKYNLEE 519
Query: 663 IEID---FETLKPSTLRELEQYVSSCLRKRTYKKTPKPKDEKFAEKKHELEKRLQDVTSQ 719
+E +E LK L +L+ + S ++ + K K + +K ELE+ L ++ +
Sbjct: 520 LEKKAEEYEKLK-EKLIKLKGEIKSLKKELEKLEELKKKLAELEKKLDELEEELAELLKE 578
Query: 720 I 720
+
Sbjct: 579 L 579
>gnl|CDD|237874 PRK14971, PRK14971, DNA polymerase III subunits gamma and tau;
Provisional.
Length = 614
Score = 32.8 bits (75), Expect = 0.86
Identities = 13/81 (16%), Positives = 21/81 (25%)
Query: 44 SSSPPPSSTEVTPVKKSPAPPSSSSSAPAPSHHNESNSGNSTKAASVEPPPRDEPRLEPV 103
S SP SS P A + + + N A P + +
Sbjct: 399 SPSPSQSSAAAQPSAPQSATQPAGTPPTVSVDPPAAVPVNPPSTAPQAVRPAQFKEEKKI 458
Query: 104 DGIVQPPVVPPKHRPGRNTNQ 124
+ P RP + +
Sbjct: 459 PVSKVSSLGPSTLRPIQEKAE 479
Score = 31.7 bits (72), Expect = 1.6
Identities = 24/85 (28%), Positives = 29/85 (34%), Gaps = 7/85 (8%)
Query: 38 TSDSTSSSSPPPSSTEVTPVKKSPAPPSSSSSAPAPSHHNESNSGNSTKAASVEPPPRDE 97
T +S + PV PA S+A A S S S +S A P +
Sbjct: 363 TQKGDDASGGRGPKQHIKPVFTQPAAAPQPSAAAAAS---PSPSQSSAAAQPSAPQSATQ 419
Query: 98 PRLEPVDGIVQP----PVVPPKHRP 118
P P V P PV PP P
Sbjct: 420 PAGTPPTVSVDPPAAVPVNPPSTAP 444
Score = 30.1 bits (68), Expect = 5.5
Identities = 17/109 (15%), Positives = 36/109 (33%), Gaps = 5/109 (4%)
Query: 212 AQNLEQLFLTKITGMPSEEVVLDAPQPRSSKKKP-PVSASPSLNPVIKTPVIPLNKLPSA 270
Q+++ +F +P P S P + + P P + ++ +
Sbjct: 376 KQHIKPVFTQPAAAPQPSAAAAASPSPSQSSAAAQPSAPQSATQPAGTPPTVSVDPPAAV 435
Query: 271 TSTPKPRPPNPVLGSTATTTTAPKVNHLNSMNAPDTPDMKKAIKRKADG 319
P P V + V+ ++S+ P + I+ KA+
Sbjct: 436 PVNPPSTAPQAVRPAQFKEEKKIPVSKVSSL----GPSTLRPIQEKAEQ 480
>gnl|CDD|220337 pfam09669, Phage_pRha, Phage regulatory protein Rha (Phage_pRha).
Members of this protein family are found in temperate
phage and bacterial prophage regions. Members include
the product of the rha gene of the lambdoid phage
phi-80, a late operon gene. The presence of this gene
interferes with infection of bacterial strains that lack
integration host factor (IHF), which regulates the rha
gene. It is suggested that Rha is a phage regulatory
protein.
Length = 88
Score = 29.8 bits (68), Expect = 1.0
Identities = 9/14 (64%), Positives = 11/14 (78%)
Query: 378 SEIIAELFHKKHQN 391
S IAELF K+H+N
Sbjct: 4 SLEIAELFGKRHKN 17
>gnl|CDD|219426 pfam07489, Tir_receptor_C, Translocated intimin receptor (Tir)
C-terminus. Intimin and its translocated intimin
receptor (Tir) are bacterial proteins that mediate
adhesion between mammalian cells and attaching and
effacing (A/E) pathogens. A unique and essential feature
of A/E bacterial pathogens is the formation of
actin-rich pedestals beneath the intimately adherent
bacteria and localised destruction of the intestinal
brush border. The bacterial outer membrane adhesin,
intimin, is necessary for the production of the A/E
lesion and diarrhoea. The A/E bacteria translocate their
own receptor for intimin, Tir, into the membrane of
mammalian cells using the type III secretion system. The
translocated Tir triggers additional host signalling
events and actin nucleation, which are essential for
lesion formation. This family represents the Tir
C-terminal domain which has been reported to bind
uninfected host cells and beta-1 integrins although the
role of intimin binding to integrins is unclear. This
intimin C-terminal domain has also been shown to be
sufficient for Tir recognition.
Length = 222
Score = 31.4 bits (71), Expect = 1.2
Identities = 18/62 (29%), Positives = 26/62 (41%)
Query: 710 EKRLQDVTSQIDSTNKKLKKPKPTTSAAGPTGASRLSASSSSSSDSDSSGQSTSSSSSES 769
E+ T+ + KP T A G T S ++SS S +S ST+ S + S
Sbjct: 58 EQTTTTTTTTTTTHTTVENKPANNTPAQGNTDTSGAEETASSRRSSQASTASTTWSDTSS 117
Query: 770 SD 771
D
Sbjct: 118 ID 119
>gnl|CDD|237871 PRK14965, PRK14965, DNA polymerase III subunits gamma and tau;
Provisional.
Length = 576
Score = 32.0 bits (73), Expect = 1.6
Identities = 9/65 (13%), Positives = 18/65 (27%), Gaps = 2/65 (3%)
Query: 41 STSSSSPPPSSTEVTPVKKSPAPPSSSSSAPAPSHHNESNSGNSTKAASVEPPPRDEPRL 100
+ + P + TP + PP+++ P + + A P P
Sbjct: 379 RGAPAPPSAAWGAPTPAAPAAPPPAAAPPVPPAAP--ARPAAARPAPAPAPPAAAAPPAR 436
Query: 101 EPVDG 105
Sbjct: 437 SADPA 441
>gnl|CDD|220365 pfam09726, Macoilin, Transmembrane protein. This entry is a highly
conserved protein present in eukaryotes.
Length = 680
Score = 31.8 bits (72), Expect = 1.6
Identities = 20/66 (30%), Positives = 29/66 (43%), Gaps = 8/66 (12%)
Query: 721 DSTNKKLKKPKPTTSAAGPTGASRLSAS--------SSSSSDSDSSGQSTSSSSSESSDS 772
STN + ++SAA + S+ S S S+SS S SS + SSS S +
Sbjct: 305 SSTNGSKETTSNSSSAAAGSIGSKSSKSAKHSNRNKSNSSPKSHSSANGSVPSSSVSDNE 364
Query: 773 EGEKGP 778
+K
Sbjct: 365 SKQKRA 370
>gnl|CDD|223544 COG0468, RecA, RecA/RadA recombinase [DNA replication,
recombination, and repair].
Length = 279
Score = 31.5 bits (72), Expect = 1.7
Identities = 15/73 (20%), Positives = 32/73 (43%), Gaps = 11/73 (15%)
Query: 491 VSSDDDSEDERQNQLKYLQEQLKSLTDQIRLLVEDS-TKPKKKKKKNRDQPKSKMPMGQN 549
VS D E QL+ ++ +S ++I LLV DS + ++ +G
Sbjct: 118 VSQPDTGEQ----QLEIAEKLARSGAEKIDLLVVDSVAALVRAEEIEDGH------LGLR 167
Query: 550 SAMMNDHVNKMNK 562
+ +++ + K+ +
Sbjct: 168 ARLLSKALRKLTR 180
>gnl|CDD|177486 PHA02752, PHA02752, hypothetical protein; Provisional.
Length = 242
Score = 31.2 bits (70), Expect = 1.8
Identities = 11/53 (20%), Positives = 21/53 (39%), Gaps = 3/53 (5%)
Query: 37 NTSDSTSSSSPPPSSTEVTPVKKSPAPPSSSSSAPAPSHHNESNSGNSTKAAS 89
N + + +S PP+ T++ P + A + H + G + AS
Sbjct: 29 NQQPTITGTSAPPAVTDLYP---NAALANGFDGILIDDGHFNCDGGQCSITAS 78
>gnl|CDD|218612 pfam05501, DUF755, Domain of unknown function (DUF755). This
family is predominated by ORFs from Circoviridae. The
function of this family remains to be determined.
Length = 122
Score = 29.7 bits (67), Expect = 1.9
Identities = 21/58 (36%), Positives = 33/58 (56%), Gaps = 2/58 (3%)
Query: 710 EKRLQDVTSQIDSTNKKLKKPKPTTSAAGPTGASRLSASSSSSSDSDSSGQSTSSSSS 767
+++ Q +SQ + KK KK + +S + +++ SSSSS S SS S+SS SS
Sbjct: 67 KRKEQQKSSQ--THKKKRKKSRHVSSRSAKKISAKKRRRSSSSSSSSSSSSSSSSESS 122
>gnl|CDD|236892 PRK11281, PRK11281, hypothetical protein; Provisional.
Length = 1113
Score = 31.4 bits (72), Expect = 2.0
Identities = 20/110 (18%), Positives = 42/110 (38%), Gaps = 21/110 (19%)
Query: 484 ASRAAASVSSDDDSEDERQNQLKYLQEQLKSLTDQIRLLVEDSTKP----KKKKKKNRDQ 539
++ A A+ + D +E + Q QL L +Q K L + +L+ +D + K ++ +
Sbjct: 24 SAFARAASNGDLPTEADVQAQLDALNKQ-KLLEAEDKLVQQDLEQTLALLDKIDRQKEE- 81
Query: 540 PKSKMPMGQNSAMMNDHVNKMNKAPAPLNNGQKPKSLNNVRKPQASNPQQ 589
++ +APA L Q L ++ ++
Sbjct: 82 -------------TEQLKQQLAQAPAKLRQAQ--AELEALKDDNDEETRE 116
>gnl|CDD|234938 PRK01297, PRK01297, ATP-dependent RNA helicase RhlB; Provisional.
Length = 475
Score = 31.4 bits (71), Expect = 2.1
Identities = 14/57 (24%), Positives = 22/57 (38%), Gaps = 9/57 (15%)
Query: 46 SPPPSSTEVTPVKKSPAPPSSSSSAPAPSHHNESNSGNSTKAASVEPPPRDEPRLEP 102
+P P S P PA ++ ++ A + A E P +D+PR E
Sbjct: 18 APAPPSPAAAPAPPPPAKTAAPATKAAAP---------AAAAPRAEKPKKDKPRRER 65
>gnl|CDD|237019 PRK11907, PRK11907, bifunctional 2',3'-cyclic nucleotide
2'-phosphodiesterase/3'-nucleotidase precursor protein;
Reviewed.
Length = 814
Score = 31.4 bits (71), Expect = 2.2
Identities = 18/93 (19%), Positives = 35/93 (37%), Gaps = 6/93 (6%)
Query: 19 TGDPDPVKAEQDVKVNNNNTSDSTSSSSP------PPSSTEVTPVKKSPAPPSSSSSAPA 72
T + +++ TS ++P + TPV + A + SSS A
Sbjct: 19 TASNPKLAQAEEIVTTTPATSTEAEQTTPVESDATEEADNTETPVAATTAAEAPSSSETA 78
Query: 73 PSHHNESNSGNSTKAASVEPPPRDEPRLEPVDG 105
+ S + ++T + + P +PV+G
Sbjct: 79 ETSDPTSEATDTTTSEARTVTPAATETSKPVEG 111
>gnl|CDD|165563 PHA03308, PHA03308, transcriptional regulator ICP4; Provisional.
Length = 1463
Score = 31.7 bits (71), Expect = 2.2
Identities = 24/47 (51%), Positives = 27/47 (57%), Gaps = 2/47 (4%)
Query: 730 PKPTTSAAGPTGASRLSASSSSSSDSDSSGQSTSSSSSESSDSEGEK 776
P P S + T S S+SSSSSS S SS S SSSS E D + EK
Sbjct: 1240 PCPDLSESSSTMHS--SSSSSSSSCSSSSSSSDSSSSEEDGDEKNEK 1284
>gnl|CDD|237057 PRK12323, PRK12323, DNA polymerase III subunits gamma and tau;
Provisional.
Length = 700
Score = 31.4 bits (71), Expect = 2.2
Identities = 14/54 (25%), Positives = 20/54 (37%), Gaps = 3/54 (5%)
Query: 61 PAPPSSSSSAPAPSHHNESNSGNSTKAASVEPPPRDEPRLEPVDGIVQPPVVPP 114
PAP + ++APA + + AA+ P R P P PP
Sbjct: 448 PAPAPAPAAAPAAAARPAAAGPRPVAAAAAAAPARAAPAAAPA---PADDDPPP 498
Score = 30.6 bits (69), Expect = 4.0
Identities = 19/112 (16%), Positives = 33/112 (29%), Gaps = 1/112 (0%)
Query: 3 AAETETATDPVVNQNGTGDPDPVKAEQDVKVNNNNTSDSTSSSSPPPSSTEVTPVKKSPA 62
A A V P P A + + + +P P++ + + A
Sbjct: 409 APAAAAAARAVAAAPARRSPAPE-ALAAARQASARGPGGAPAPAPAPAAAPAAAARPAAA 467
Query: 63 PPSSSSSAPAPSHHNESNSGNSTKAASVEPPPRDEPRLEPVDGIVQPPVVPP 114
P ++A A + + + A PP + P QP P
Sbjct: 468 GPRPVAAAAAAAPARAAPAAAPAPADDDPPPWEELPPEFASPAPAQPDAAPA 519
>gnl|CDD|217884 pfam04086, SRP-alpha_N, Signal recognition particle, alpha subunit,
N-terminal. SRP is a complex of six distinct
polypeptides and a 7S RNA that is essential for
transferring nascent polypeptide chains that are
destined for export from the cell to the translocation
apparatus of the endoplasmic reticulum (ER) membrane.
SRP binds hydrophobic signal sequences as they emerge
from the ribosome, and arrests translation.
Length = 272
Score = 30.9 bits (70), Expect = 2.3
Identities = 32/189 (16%), Positives = 69/189 (36%), Gaps = 25/189 (13%)
Query: 437 EFADDVRLIFSNCYKYNPPDHNV-VTMARQLSAVFEDRFAKMPDESNLASRAAASVSSDD 495
+ DDVR IF + YK T + F+ + ++ ES +++ ++ + +
Sbjct: 56 KLLDDVRTIFRDLYKNQLRQEKARTTYDEEFDEYFDQQLRELEKESKKQAKSPKAMRTFE 115
Query: 496 DSEDERQNQLKYLQEQLKSLTDQIRLLVEDSTKPKKKKKKNRDQPKSKMPMGQNSAMMND 555
+S+ K++ I ++ +K++KK ++ S ++ +
Sbjct: 116 ESKK-----------SKKTVDSMIERKPKEPGLKRKQRKKAQESATSPESSPSSTPNSSR 164
Query: 556 HVNK--MNKAPAPLNNGQKPKSL-NNVRKPQASNPQQAKKPKPNNANTVAAKKQVRTFD- 611
+ P +K L + +P+ PK A K++R +D
Sbjct: 165 PSTPHLLKAKEGPSRRAKKAAKLSSTASSGDEKSPKSKAAPKK-------AGKKMRKWDL 217
Query: 612 --SEDEDVA 618
ED+D
Sbjct: 218 DGDEDDDAV 226
>gnl|CDD|178376 PLN02777, PLN02777, photosystem I P subunit (PSI-P).
Length = 167
Score = 30.3 bits (68), Expect = 2.4
Identities = 9/25 (36%), Positives = 14/25 (56%)
Query: 109 PPVVPPKHRPGRNTNQLQYIVKNVM 133
PP V +RP + T + I +NV+
Sbjct: 36 PPPVQSHNRPAKATAYCRKIARNVV 60
>gnl|CDD|165099 PHA02732, PHA02732, hypothetical protein; Provisional.
Length = 1467
Score = 31.3 bits (70), Expect = 2.4
Identities = 16/37 (43%), Positives = 21/37 (56%), Gaps = 6/37 (16%)
Query: 38 TSDSTSSSSPPPSSTEVTPVKKSPAPPSSSSSAPAPS 74
S++S PP++T TP PPSSSSS+ A S
Sbjct: 1224 AQLQLSAASSPPAATTPTP------PPSSSSSSSAQS 1254
>gnl|CDD|236766 PRK10811, rne, ribonuclease E; Reviewed.
Length = 1068
Score = 31.5 bits (72), Expect = 2.4
Identities = 17/106 (16%), Positives = 27/106 (25%), Gaps = 6/106 (5%)
Query: 2 SAAETETATDPVVNQNGTGDPDPVKAEQDVKVNNNNTSDSTSSSSPPPSSTEVTPVKKS- 60
T P V + V E DV V + P + ++ ++
Sbjct: 907 EEVVVVETTHPEVIAAPVTEQPQVITESDVAVAQEVAEHAEPVVEPQDETADIEEAAETA 966
Query: 61 ---PAPPSSSSSAPAPSHHNESNSGNSTKAASVEPPPRDEPRLEPV 103
A P + AP + +VEP E
Sbjct: 967 EVVVAEPEVVAQPAAPVVAEVAAEV--ETVTAVEPEVAPAQVPEAT 1010
>gnl|CDD|233041 TIGR00596, rad1, DNA repair protein (rad1). All proteins in this
family for which functions are known are components in a
multiprotein endonuclease complex (usually made up of
Rad1 and Rad10 homologs). This complex is used primarily
for nucleotide excision repair but also for some aspects
of recombinational repair in some species. Most Archaeal
species also have homologs of these genes, but the
function of these Archaeal genes is not known, so we
have set our cutoff to only pick up the eukaryotic
genes.This family is based on the phylogenomic analysis
of JA Eisen (1999, Ph.D. Thesis, Stanford Universit [DNA
metabolism, DNA replication, recombination, and repair].
Length = 814
Score = 31.3 bits (71), Expect = 2.6
Identities = 14/93 (15%), Positives = 38/93 (40%), Gaps = 13/93 (13%)
Query: 674 TLRELEQYVSSC----------LRKRTYKKTPKPKDEKFAEKKHELEKRLQDVTSQIDST 723
T +L Y+++ L K + + + + K A++ + ++ +S +
Sbjct: 307 TCLQLRDYLTTSNKKRGSRAFLLNKLRWYRKWREETSKLAKEVQSQDTFPENASSNV--- 363
Query: 724 NKKLKKPKPTTSAAGPTGASRLSASSSSSSDSD 756
NK +K + T G S ++ +++++
Sbjct: 364 NKTFRKEQVPTKRRRVRGGSEVAVEKLRNANTN 396
>gnl|CDD|237864 PRK14950, PRK14950, DNA polymerase III subunits gamma and tau;
Provisional.
Length = 585
Score = 30.9 bits (70), Expect = 2.8
Identities = 25/127 (19%), Positives = 35/127 (27%), Gaps = 20/127 (15%)
Query: 217 QLFLTKITGMPSEEVVLDA-------PQPRSSKKKPPVSASPSLNPVIKTPVIPLNKLPS 269
QL T +P E V++A PQP P P+ P + +P
Sbjct: 340 QLRTTSYGQLPLELAVIEALLVPVPAPQPAKPTAAAPSPVRPTPAPSTRPKAAAAANIPP 399
Query: 270 ATSTPKPRPPNPVLGSTATTTTAPKVNHLNSMNAPDTPDMKKAIKRKADGSIDHTPSSLH 329
+ P PV +AP +A +D P
Sbjct: 400 KEPVRETATPPPVPPRPVAPPVPHT-----PESAPKLT--------RAAIPVDEKPKYTP 446
Query: 330 PTPVKSA 336
P P K
Sbjct: 447 PAPPKEE 453
Score = 30.9 bits (70), Expect = 3.1
Identities = 16/105 (15%), Positives = 26/105 (24%), Gaps = 10/105 (9%)
Query: 1 MSAAETETATDPVVNQNGTGDPDPVKAEQDVKVNNNNTSDSTSSSSPPPSSTEVTPVKKS 60
++ E P P + ST + ++ +
Sbjct: 353 LAVIEALLVPVPAPQPAKPTAAAPSPVRP-------TPAPSTRPKAAAAANIPPKEPVRE 405
Query: 61 PAPPSSSSSAPA---PSHHNESNSGNSTKAASVEPPPRDEPRLEP 102
A P P H ES + A V+ P+ P P
Sbjct: 406 TATPPPVPPRPVAPPVPHTPESAPKLTRAAIPVDEKPKYTPPAPP 450
>gnl|CDD|114270 pfam05539, Pneumo_att_G, Pneumovirinae attachment membrane
glycoprotein G.
Length = 408
Score = 30.8 bits (69), Expect = 2.8
Identities = 23/124 (18%), Positives = 38/124 (30%), Gaps = 10/124 (8%)
Query: 6 TETATDPVVNQNGTGDPDPVKAEQDVKVNNN-------NTSDSTSSSSPPPSSTEVTPVK 58
TE + +Q + Q N T +T+SS+P P + P
Sbjct: 182 TEVSHPTYPSQVTPQSQPATQGHQTATANQRLSSTEPVGTQGTTTSSNPEP---QTEPPP 238
Query: 59 KSPAPPSSSSSAPAPSHHNESNSGNSTKAASVEPPPRDEPRLEPVDGIVQPPVVPPKHRP 118
P S P+ + ++S +G+ + P PP +
Sbjct: 239 SQRGPSGSPQHPPSTTSQDQSTTGDGQEHTQRRKTPPATSNRRSPHSTATPPPTTKRQET 298
Query: 119 GRNT 122
GR T
Sbjct: 299 GRPT 302
>gnl|CDD|220135 pfam09184, PPP4R2, PPP4R2. PPP4R2 (protein phosphatase 4 core
regulatory subunit R2) is the regulatory subunit of the
histone H2A phosphatase complex. It has been shown to
confer resistance to the anticancer drug cisplatin in
yeast, and may confer resistance in higher eukaryotes.
Length = 285
Score = 30.6 bits (69), Expect = 2.9
Identities = 22/102 (21%), Positives = 37/102 (36%), Gaps = 4/102 (3%)
Query: 4 AETETATDPVVNQNGTGDPDPVKAEQDVKVNNNNTSDSTSSSSPPPSSTEVTPVKKSPAP 63
+E + + + NG G+P+P +D N ++T+ SS S E K+S
Sbjct: 170 SELSPFIERIDSVNGPGEPEPEDDPKDSLGNGSSTNGLPDSSQDKNKSLEEYYEKESSDA 229
Query: 64 PSSSSSAPAPSH----HNESNSGNSTKAASVEPPPRDEPRLE 101
+S P S ++ + VE E E
Sbjct: 230 AASQDDGPKGSDVKNKKSDDEEDDDQDGDYVEEKELKEDEEE 271
>gnl|CDD|183756 PRK12799, motB, flagellar motor protein MotB; Reviewed.
Length = 421
Score = 30.8 bits (69), Expect = 2.9
Identities = 17/76 (22%), Positives = 28/76 (36%)
Query: 38 TSDSTSSSSPPPSSTEVTPVKKSPAPPSSSSSAPAPSHHNESNSGNSTKAASVEPPPRDE 97
+S T SS+ PSS + P+ ++ S+ + +S + P
Sbjct: 307 SSAVTQSSAITPSSAAIPSPAVIPSSVTTQSATTTQASAVALSSAGVLPSDVTLPGTVAL 366
Query: 98 PRLEPVDGIVQPPVVP 113
P EPV+ QP
Sbjct: 367 PAAEPVNMQPQPMSTT 382
>gnl|CDD|217869 pfam04065, Not3, Not1 N-terminal domain, CCR4-Not complex
component.
Length = 233
Score = 30.3 bits (69), Expect = 3.1
Identities = 19/80 (23%), Positives = 34/80 (42%), Gaps = 3/80 (3%)
Query: 495 DDSEDERQNQLKYLQEQLKSLTDQIRLL---VEDSTKPKKKKKKNRDQPKSKMPMGQNSA 551
D E E+ +L + L QI LL VE + +KK+K + + + + +
Sbjct: 114 DPKEKEKAEVCDFLSNTIDELNKQIELLEAEVESLSVQRKKRKADAAKQERIAELEELLE 173
Query: 552 MMNDHVNKMNKAPAPLNNGQ 571
H++K+ L NG+
Sbjct: 174 RHKFHISKLELILRLLENGE 193
>gnl|CDD|215581 PLN03109, PLN03109, ETHYLENE-INSENSITIVE3-like3 protein;
Provisional.
Length = 599
Score = 31.0 bits (70), Expect = 3.1
Identities = 20/76 (26%), Positives = 32/76 (42%), Gaps = 14/76 (18%)
Query: 481 SNLASRAAASVSSDDD--------------SEDERQNQLKYLQEQLKSLTDQIRLLVEDS 526
R ++SSD D S+D+R+N QE ++ D +V D
Sbjct: 307 GGHEDRNKDAISSDSDYDVDGLEDAPGSVSSKDDRRNLQPVAQEPERARDDAPNQVVPDK 366
Query: 527 TKPKKKKKKNRDQPKS 542
K KK +K+ R + +S
Sbjct: 367 EKTKKPRKRKRPRGRS 382
>gnl|CDD|235124 PRK03427, PRK03427, cell division protein ZipA; Provisional.
Length = 333
Score = 30.8 bits (70), Expect = 3.2
Identities = 13/97 (13%), Positives = 27/97 (27%), Gaps = 8/97 (8%)
Query: 30 DVKVNNNNTSDSTSSSSPPPSSTEV--------TPVKKSPAPPSSSSSAPAPSHHNESNS 81
+V+V+ N + + + + + + P + P +
Sbjct: 65 EVRVHRVNHAPANAQEHEAARPSPQHQYQPPYASAQPRQPVQQPPEAQVPPQHAPRPAQP 124
Query: 82 GNSTKAASVEPPPRDEPRLEPVDGIVQPPVVPPKHRP 118
P ++P +PV V P P P
Sbjct: 125 APQPVQQPAYQPQPEQPLQQPVSPQVAPAPQPVHSAP 161
>gnl|CDD|221526 pfam12316, Dsh_C, Segment polarity protein dishevelled (Dsh) C
terminal. This domain family is found in eukaryotes,
and is typically between 177 and 207 amino acids in
length. The family is found in association with
pfam00778, pfam02377, pfam00610, pfam00595. The segment
polarity gene dishevelled (dsh) is required for pattern
formation of the embryonic segments. It is involved in
the determination of body organisation through the
Wingless pathway (analogous to the Wnt-1 pathway).
Length = 202
Score = 29.9 bits (67), Expect = 3.4
Identities = 26/82 (31%), Positives = 33/82 (40%), Gaps = 6/82 (7%)
Query: 39 SDSTSSSSPPPSSTEVTPVKKSPAPPSSSSSAPAPSHHNESNSGNSTKAASVEPPPRDEP 98
S+ TS S S P ++S PPS S + SH S +S+ A PPP + P
Sbjct: 106 SEHTSRSGSRRSGGRRAPSERSGPPPSEGSVRSSLSHP----SSHSSYGAPGVPPPYNPP 161
Query: 99 RLEPVDGIVQPPVVPPKHRPGR 120
L P PP P R
Sbjct: 162 MLMMKPPP--PSPGPPGAPPVR 181
>gnl|CDD|219833 pfam08418, Pol_alpha_B_N, DNA polymerase alpha subunit B
N-terminal. This is the eukaryotic DNA polymerase alpha
subunit B N-terminal domain which is involved in complex
formation. Also see pfam04058.
Length = 239
Score = 30.1 bits (68), Expect = 3.9
Identities = 23/111 (20%), Positives = 35/111 (31%), Gaps = 8/111 (7%)
Query: 670 LKPSTLRELEQYVSSCL--RKRTYKKTPKPKDEKFAEKKHELEKRLQDVTSQIDSTNKKL 727
L LR+ E+ + S L R RT K + E S + L
Sbjct: 46 LTLENLRKFEEELQSQLEKRVRTPASIKTSKRLIEVPEAEESLLDSYTTPSDKGGMLRIL 105
Query: 728 KKP---KPTTSAAGPTGASRLSASSSSSSD---SDSSGQSTSSSSSESSDS 772
P K S + + S S +S + S+ S+ S S+
Sbjct: 106 STPELPKRKRSFSASSLESPSLFFSPASFSPSAAPSTPSPNSAKFSSRSNP 156
>gnl|CDD|183558 PRK12495, PRK12495, hypothetical protein; Provisional.
Length = 226
Score = 29.8 bits (67), Expect = 3.9
Identities = 18/101 (17%), Positives = 29/101 (28%), Gaps = 1/101 (0%)
Query: 17 NGTGDPDPVKAEQDVKVNNNNTSDSTSSSSPPPSSTEVTPVKKSPAPPSSSSSAPAPSHH 76
+ GD A D + D+ ++ + P S + +++ P +
Sbjct: 76 DDAGDGAEATAPSDAGSQASPDDDAQPAAEAEAADQSAPPEASSTSATDEAATDPPATAA 135
Query: 77 NESNSGNSTKAASVEPPPRDEPRLEPVDGIVQP-PVVPPKH 116
A P R PR P P P P H
Sbjct: 136 ARDGPTPDPTAQPATPDERRSPRQRPPVSGEPPTPSTPDAH 176
>gnl|CDD|233143 TIGR00834, ae, anion exchange protein. The Anion Exchanger (AE)
Family (TC 2.A.31)Characterized protein members of the
AE family are found only in animals.They preferentially
catalyze anion exchange (antiport) reactions, typically
acting as HCO3-:Cl- antiporters, but also transporting a
range of other inorganic and organic anions.
Additionally, renal Na+:HCO3- cotransporters have been
found to be members of the AE family. They catalyze the
reabsorption of HCO3- in the renal proximal tubule
[Transport and binding proteins, Anions].
Length = 900
Score = 30.5 bits (69), Expect = 3.9
Identities = 27/102 (26%), Positives = 38/102 (37%), Gaps = 14/102 (13%)
Query: 283 LGSTATTTTAPKVNHLNSMNAPDTPDMKKAIKRKADGSIDHTPSS------LHPTPVKSA 336
+G T + +V H + A D D+ I D SI P L P P
Sbjct: 239 IGRAIATLMSDEVFHDAAYLADDRDDLLAGIDEFLDCSIVLPPGEWDPEIRLEP-PAPLQ 297
Query: 337 KQLNTRRESGSITKKPQRIS------EEGGGGSGLGGSKTPL 372
++L +R S T +P+ + E GG G G PL
Sbjct: 298 RELLRKRYEPS-TVRPENPTMGGDTEPEDGGSEGPHGDDDPL 338
>gnl|CDD|200219 TIGR02927, SucB_Actino, 2-oxoglutarate dehydrogenase, E2 component,
dihydrolipoamide succinyltransferase. This model
represents an Actinobacterial clade of E2 enzyme, a
component of the 2-oxoglutarate dehydrogenase complex
involved in the TCA cycle. These proteins have multiple
domains including the catalytic domain (pfam00198), one
or two biotin domains (pfam00364) and an E3-component
binding domain (pfam02817).
Length = 579
Score = 30.4 bits (68), Expect = 3.9
Identities = 15/48 (31%), Positives = 25/48 (52%), Gaps = 1/48 (2%)
Query: 39 SDSTSSSSPPPSSTEVTPVKKSPAP-PSSSSSAPAPSHHNESNSGNST 85
++ S +P E P ++PAP P+ ++ APAP+ SG +T
Sbjct: 80 GEAGSEPAPAAPEPEAAPEPEAPAPAPTPAAEAPAPAAPQAGGSGEAT 127
>gnl|CDD|153369 cd07685, F-BAR_Fes, The F-BAR (FES-CIP4 Homology and
Bin/Amphiphysin/Rvs) domain of Fes (feline sarcoma)
tyrosine kinase. F-BAR domains are dimerization modules
that bind and bend membranes and are found in proteins
involved in membrane dynamics and actin reorganization.
Fes (feline sarcoma), also called Fps (Fujinami poultry
sarcoma), is a cytoplasmic (or nonreceptor) tyrosine
kinase whose gene was first isolated from tumor-causing
retroviruses. It is expressed in myeloid, vascular
endothelial, epithelial, and neuronal cells, and plays
important roles in cell growth and differentiation,
angiogenesis, inflammation and immunity, and
cytoskeletal regulation. Fes kinase has also been
implicated as a tumor suppressor in colorectal cancer.
It contains an N-terminal F-BAR domain, an SH2 domain,
and a C-terminal catalytic kinase domain. F-BAR domains
form banana-shaped dimers with a positively-charged
concave surface that binds to negatively-charged lipid
membranes. They can induce membrane deformation in the
form of long tubules. The F-BAR domain of Fes is
critical in its role in microtubule nucleation and
bundling.
Length = 237
Score = 29.9 bits (67), Expect = 4.2
Identities = 17/48 (35%), Positives = 26/48 (54%), Gaps = 5/48 (10%)
Query: 501 RQNQLKYLQEQLKSLTDQIRLLVEDSTKPKKK-----KKKNRDQPKSK 543
+Q K Q+ ++ L Q R L +DS + K+K K K+RD+ K K
Sbjct: 122 KQEYTKTTQQDIEKLKSQYRSLAKDSAQAKRKYQEASKDKDRDKAKEK 169
>gnl|CDD|235899 PRK06975, PRK06975, bifunctional uroporphyrinogen-III
synthetase/uroporphyrin-III C-methyltransferase;
Reviewed.
Length = 656
Score = 30.5 bits (69), Expect = 4.3
Identities = 9/54 (16%), Positives = 20/54 (37%), Gaps = 9/54 (16%)
Query: 42 TSSSSPPPSSTEVTPVKKSPAPPSSSSSAPAPSHHNESNSGNSTKAASVEPPPR 95
T++ +P + + P +++++ P + A EPP R
Sbjct: 274 TAAPAPSRMTDTNDSKSVTSQPAAAAAAPAPPP---------NPPATPPEPPAR 318
>gnl|CDD|236555 PRK09537, pylS, pyrolysyl-tRNA synthetase; Reviewed.
Length = 417
Score = 30.2 bits (68), Expect = 4.4
Identities = 20/86 (23%), Positives = 26/86 (30%), Gaps = 6/86 (6%)
Query: 219 FLTKITGMPSEEVVLDAPQPRSSKKKPPVSA----SPSLNPVIKTPVIPLNKLPSATSTP 274
FLTK ++ V P KK P S P NPV + S P
Sbjct: 82 FLTKTFEDKTQVKVKVVSAPTKKKKAMPKSVVRAPKPLENPV--PAQAESSGSKPVPSIP 139
Query: 275 KPRPPNPVLGSTATTTTAPKVNHLNS 300
P T + ++ L S
Sbjct: 140 VSTPEVKAPAPALTPSQKDRLETLLS 165
>gnl|CDD|240410 PTZ00418, PTZ00418, Poly(A) polymerase; Provisional.
Length = 593
Score = 30.5 bits (69), Expect = 4.4
Identities = 12/46 (26%), Positives = 24/46 (52%)
Query: 727 LKKPKPTTSAAGPTGASRLSASSSSSSDSDSSGQSTSSSSSESSDS 772
+ P ++S+++S S SSG +++S+S+ESS +
Sbjct: 531 VLSQTPEEPVKTKANTKTNTSSATTSGQSGSSGSTSNSNSNESSPT 576
>gnl|CDD|223957 COG1026, COG1026, Predicted Zn-dependent peptidases,
insulinase-like [General function prediction only].
Length = 978
Score = 30.4 bits (69), Expect = 4.5
Identities = 11/61 (18%), Positives = 23/61 (37%)
Query: 648 IQSREPSLREPNPDEIEIDFETLKPSTLRELEQYVSSCLRKRTYKKTPKPKDEKFAEKKH 707
+Q R L + + ++I D + LK + + + L P P ++ E +
Sbjct: 473 LQKRSSELTDEDLEKIIKDSKKLKERQDQPDSEEDLATLPTLKLGDVPDPIEKTSLETEV 532
Query: 708 E 708
Sbjct: 533 S 533
>gnl|CDD|165527 PHA03269, PHA03269, envelope glycoprotein C; Provisional.
Length = 566
Score = 30.5 bits (68), Expect = 4.5
Identities = 17/94 (18%), Positives = 26/94 (27%), Gaps = 11/94 (11%)
Query: 2 SAAETETATDPVV--NQNGTGDPDPVKAEQDVKVNNNNTSDSTSSSSPPPSSTEVTPVKK 59
A DP V + PD +A T ++ P P+ +
Sbjct: 46 PHQAASRAPDPAVAPTSAASRKPDLAQAP---------TPAASEKFDPAPAPHQAASRAP 96
Query: 60 SPAPPSSSSSAPAPSHHNESNSGNSTKAASVEPP 93
PA ++AP P S A +
Sbjct: 97 DPAVAPQLAAAPKPDAAEAFTSAAQAHEAPADAG 130
>gnl|CDD|218312 pfam04889, Cwf_Cwc_15, Cwf15/Cwc15 cell cycle control protein.
This family represents Cwf15/Cwc15 (from
Schizosaccharomyces pombe and Saccharomyces cerevisiae
respectively) and their homologues. The function of
these proteins is unknown, but they form part of the
spliceosome and are thus thought to be involved in mRNA
splicing.
Length = 241
Score = 29.7 bits (67), Expect = 4.7
Identities = 22/107 (20%), Positives = 36/107 (33%), Gaps = 7/107 (6%)
Query: 669 TLKPSTLRELEQYVSSCLRKRTYKKTPKPKDEKFAEKKHELEKRLQDVTSQIDSTNKKLK 728
LK R+ Q LRKR + + + KK ++D STN
Sbjct: 40 KLKY---RQPGQGTEDELRKRDLRAELEEAERAHKSKKENK-LAIEDADK---STNLDAS 92
Query: 729 KPKPTTSAAGPTGASRLSASSSSSSDSDSSGQSTSSSSSESSDSEGE 775
+ + +SD+D S S+ S SS+ + +
Sbjct: 93 NEGDEDDDEEDEIKRKRIEEDARNSDADDSDSSSDSDSSDDDSDDDD 139
>gnl|CDD|165468 PHA03201, PHA03201, uracil DNA glycosylase; Provisional.
Length = 318
Score = 29.9 bits (67), Expect = 4.8
Identities = 20/73 (27%), Positives = 23/73 (31%), Gaps = 3/73 (4%)
Query: 44 SSSPPPSSTEVTPVKKSPAPPSSSSSAPAPSHHNESNSGNSTKAASVEPPPRDEPRLEP- 102
S SP P P P P +S PS + A+ PR PR P
Sbjct: 6 SRSPSPPRRPSPPRPTPPRSPDASPEETPPSPPG-PGAEPPPGRAAGPAAPRRRPRGCPA 64
Query: 103 -VDGIVQPPVVPP 114
V P PP
Sbjct: 65 GVTFSSSAPPRPP 77
>gnl|CDD|237783 PRK14667, uvrC, excinuclease ABC subunit C; Provisional.
Length = 567
Score = 30.1 bits (68), Expect = 4.9
Identities = 19/44 (43%), Positives = 25/44 (56%), Gaps = 5/44 (11%)
Query: 403 EKLGLTDYFDIIKKPMDLGTVRKKMRNRTYKTAKEF--ADDVRL 444
EK GL D D IK +G V+K++ R +KT +F ADD L
Sbjct: 508 EKEGLKDILDKIK---GIGEVKKEIIYRNFKTLYDFLKADDEEL 548
>gnl|CDD|205499 pfam13319, DUF4090, Protein of unknown function (DUF4090). This
family of proteins is functionally uncharacterized. This
family of proteins is found in bacteria. Proteins in
this family are approximately 90 amino acids in length.
Length = 84
Score = 27.8 bits (62), Expect = 4.9
Identities = 20/58 (34%), Positives = 26/58 (44%), Gaps = 16/58 (27%)
Query: 398 TPVDVEKLGLTDYFDIIKKPMDL------GTVRKKMRNRTYKT-AKEFADDV---RLI 445
+P+ E L L + MDL V+K MRNR +T AK F + RLI
Sbjct: 18 SPIPPEMLDL------YNEVMDLEAQRQRSGVKKSMRNRIVRTGAKHFDQETLNQRLI 69
>gnl|CDD|218275 pfam04812, HNF-1B_C, Hepatocyte nuclear factor 1 (HNF-1), beta
isoform C terminus. This family consists of a region
found within the alpha isoform and at the C terminus of
the beta isoform of the homeobox-containing
transcription factor of HNF-1. Different isoforms of
HNF-1 are generated by the differential use of
polyadenylation sites and by alternative splicing. The
C-terminal region of HNF-1 is responsible for the
activation of transcription. Mutations and polymorphisms
in HNF-1 cause the type 3 form of maturity-onset
diabetes of the young (MODY3).
Length = 237
Score = 29.9 bits (67), Expect = 5.0
Identities = 17/81 (20%), Positives = 31/81 (38%), Gaps = 3/81 (3%)
Query: 34 NNNNTSDSTSSSSPPPSSTEVTPVKKSPAPPSSSSSAPAPSHHNESNSGNSTKAASVEPP 93
+ N T S S P +++ V+ S +S+ SH + ++ +EP
Sbjct: 13 STNPTLSHHSGSPSPGPPSKLHGVRYSQQGSCEVTSSSTISHGDRASVLQQVSPGGLEPS 72
Query: 94 P---RDEPRLEPVDGIVQPPV 111
+ ++ V G PPV
Sbjct: 73 HSLLSTDAKMISVSGGPLPPV 93
>gnl|CDD|187552 cd05241, 3b-HSD-like_SDR_e, 3beta-hydroxysteroid dehydrogenases
(3b-HSD)-like, extended (e) SDRs. Extended SDR family
domains belonging to this subgroup have the
characteristic active site tetrad and a fairly
well-conserved NAD(P)-binding motif. 3b-HSD catalyzes
the NAD-dependent conversion of various steroids, such
as pregnenolone to progesterone, or androstenediol to
testosterone. This subgroup includes an unusual
bifunctional 3b-HSD/C-4 decarboxylase from Arabidopsis
thaliana, and Saccharomyces cerevisiae ERG26, a
3b-HSD/C-4 decarboxylase, involved in the synthesis of
ergosterol, the major sterol of yeast. It also includes
human 3 beta-HSD/HSD3B1 and C(27) 3beta-HSD/
[3beta-hydroxy-delta(5)-C(27)-steroid oxidoreductase;
HSD3B7]. C(27) 3beta-HSD/HSD3B7 is a membrane-bound
enzyme of the endoplasmic reticulum, that catalyzes the
isomerization and oxidation of 7alpha-hydroxylated
sterol intermediates, an early step in bile acid
biosynthesis. Mutations in the human NSDHL (NAD(P)H
steroid dehydrogenase-like protein) cause CHILD syndrome
(congenital hemidysplasia with ichthyosiform nevus and
limb defects), an X-linked dominant, male-lethal trait.
Mutations in the human gene encoding C(27) 3beta-HSD
underlie a rare autosomal recessive form of neonatal
cholestasis. Extended SDRs are distinct from classical
SDRs. In addition to the Rossmann fold (alpha/beta
folding pattern with a central beta-sheet) core region
typical of all SDRs, extended SDRs have a less conserved
C-terminal extension of approximately 100 amino acids.
Extended SDRs are a diverse collection of proteins, and
include isomerases, epimerases, oxidoreductases, and
lyases; they typically have a TGXXGXXG cofactor binding
motif. SDRs are a functionally diverse family of
oxidoreductases that have a single domain with a
structurally conserved Rossmann fold, an
NAD(P)(H)-binding region, and a structurally diverse
C-terminal region. Sequence identity between different
SDR enzymes is typically in the 15-30% range; they
catalyze a wide range of activities including the
metabolism of steroids, cofactors, carbohydrates,
lipids, aromatic compounds, and amino acids, and act in
redox sensing. Classical SDRs have an TGXXX[AG]XG
cofactor binding motif and a YXXXK active site motif,
with the Tyr residue of the active site motif serving as
a critical catalytic residue (Tyr-151, human
15-hydroxyprostaglandin dehydrogenase numbering). In
addition to the Tyr and Lys, there is often an upstream
Ser and/or an Asn, contributing to the active site;
while substrate binding is in the C-terminal region,
which determines specificity. The standard reaction
mechanism is a 4-pro-S hydride transfer and proton relay
involving the conserved Tyr and Lys, a water molecule
stabilized by Asn, and nicotinamide. Atypical SDRs
generally lack the catalytic residues characteristic of
the SDRs, and their glycine-rich NAD(P)-binding motif is
often different from the forms normally seen in
classical or extended SDRs. Complex (multidomain) SDRs
such as ketoreductase domains of fatty acid sythase have
a GGXGXXG NAD(P)-binding motif and an altered active
site motif (YXXXN). Fungal type ketoacyl reductases have
a TGXXXGX(1-2)G NAD(P)-binding motif.
Length = 331
Score = 30.1 bits (68), Expect = 5.2
Identities = 15/51 (29%), Positives = 21/51 (41%), Gaps = 2/51 (3%)
Query: 371 PLWYKYCSEIIAELFHKKHQNYAWPFYTPVDVEKLGLTDYFDIIKKPMDLG 421
YC+ +++EL Y ++P V L YF I K DLG
Sbjct: 265 SGPLAYCAALLSELVSFMLGPY--FVFSPFYVRALVTPMYFSIAKAQKDLG 313
>gnl|CDD|219741 pfam08193, INO80_Ies4, INO80 complex subunit Ies4. The INO80
ATPase is a member of the SNF2 family of ATPases and
functions as an integral component of a multisubunit
ATP-dependent chromatin remodelling complex. This family
of proteins corresponds to the fungal Ies4 subunit of
INO80.
Length = 228
Score = 29.6 bits (66), Expect = 5.4
Identities = 29/116 (25%), Positives = 48/116 (41%), Gaps = 7/116 (6%)
Query: 9 ATDPVVNQNGTGDPDPVKAEQDVKVNNNNTSDSTSSSSPPPSSTEVTPVKKSPAPPSSSS 68
A PV + P A+ V + +N SD+ S+ + S+T+ TP +K P
Sbjct: 37 APSPVKEDEPSDSPASSAADPPVPSSVDNASDAASTPAAGTSATD-TPRRKGGPGPKKGE 95
Query: 69 SAPAPSHHNESNSGNSTKAASVEPPPRDEPRLEPVDGIVQPPVVPPKHRPGRNTNQ 124
A + + +T +P P+ +PRL+ DG ++ HR G N
Sbjct: 96 KRSA----GQGTTSETTSKPRGKPGPKKKPRLD--DGTLEGAKGTSSHRLGPKANT 145
>gnl|CDD|114205 pfam05467, Herpes_U47, Herpesvirus glycoprotein U47.
Length = 627
Score = 30.2 bits (67), Expect = 5.4
Identities = 27/109 (24%), Positives = 47/109 (43%), Gaps = 7/109 (6%)
Query: 246 PVSASPSLNPVIKTPVIPLNKLPSATSTPKPRPPNPVLGSTATTTTAPKVN-HLNSMNAP 304
P S S + I +P IP +T+ P P PP T T KV + ++ A
Sbjct: 241 PSSTPSSTSASITSPHIP------STNIPTPEPPPVTKNFTELHTDTIKVTPNTPTITAQ 294
Query: 305 DTPDMKKAIKRKADGSIDHTPSSLHPTPVKSAKQLNTRRESGSITKKPQ 353
T +KK +KR +TP+ + ++ + T + + + T+ P+
Sbjct: 295 TTESIKKIVKRSDFPRPMYTPTDIPTLTIRLNATIKTEQNTENPTENPK 343
>gnl|CDD|223023 PHA03249, PHA03249, DNA packaging tegument protein UL25;
Provisional.
Length = 653
Score = 30.0 bits (67), Expect = 5.4
Identities = 24/113 (21%), Positives = 39/113 (34%), Gaps = 2/113 (1%)
Query: 2 SAAETETATDPVVNQNGTGDPDPVKAEQDVKVNNNNTSDSTSSSSPPPSSTEVTPVKKSP 61
S+ E E DP + T D D ++A ++++ S S S +E +
Sbjct: 23 SSLEDEVFLDPR-PRAPTEDLDRMEAGLSSYSSSSDNKSSFEVVSETDSGSEAEAERGRR 81
Query: 62 APPSSSSSAPAPSHHNESNSGNSTKAASVEPPPR-DEPRLEPVDGIVQPPVVP 113
A + A PS N++ T A P + PP +P
Sbjct: 82 AGMGGRNKATKPSRRNKTTQCRPTSLALATAATMPATPSSGKSPKVSSPPSIP 134
>gnl|CDD|218673 pfam05642, Sporozoite_P67, Sporozoite P67 surface antigen. This
family consists of several Theileria P67 surface
antigens. A stage specific surface antigen of Theileria
parva, p67, is the basis for the development of an
anti-sporozoite vaccine for the control of East Coast
fever (ECF) in cattle. The antigen has been shown to
contain five distinct linear peptide sequences
recognised by sporozoite-neutralising murine monoclonal
antibodies.
Length = 727
Score = 30.0 bits (67), Expect = 5.6
Identities = 15/52 (28%), Positives = 25/52 (48%), Gaps = 2/52 (3%)
Query: 33 VNNNNTSDSTSSSSPPPSSTEVTPVKKSPAPPSSSSSAP--APSHHNESNSG 82
V +++ S S++ P S+T TP P+ P S+ A + +S SG
Sbjct: 268 VRVGDSTSSPSTTRPSGSTTTTTPASSGPSAPGGPGSSSRNAVTRSTDSISG 319
>gnl|CDD|177871 PLN02226, PLN02226, 2-oxoglutarate dehydrogenase E2 component.
Length = 463
Score = 30.1 bits (67), Expect = 5.7
Identities = 21/80 (26%), Positives = 30/80 (37%), Gaps = 13/80 (16%)
Query: 45 SSPPPSSTEVTPVKKSPAPPSSSSSAPAPSHHN----ESNSGNSTKAASVEPPPRDEPRL 100
S ++++VTP +K P S PA + KA S PPP+ +
Sbjct: 166 SKSEDAASQVTPSQKIPETTDPKPSPPAEDKQKPKVESAPVAEKPKAPSSPPPPKQSAK- 224
Query: 101 EPVDGIVQPPVVPPKHRPGR 120
P +PPK R R
Sbjct: 225 --------EPQLPPKERERR 236
>gnl|CDD|222290 pfam13654, AAA_32, AAA domain. This family includes a wide variety
of AAA domains including some that have lost essential
nucleotide binding residues in the P-loop.
Length = 509
Score = 29.7 bits (68), Expect = 5.9
Identities = 12/43 (27%), Positives = 24/43 (55%), Gaps = 4/43 (9%)
Query: 691 YKKTPKPKDEKFAEKKHELEKRLQDVTSQI----DSTNKKLKK 729
++ P+ + E+ EK ELE+ LQ++ Q+ +KL++
Sbjct: 169 FEALPEEEREELEEKIDELEEELQEILRQLRELEREAREKLRE 211
>gnl|CDD|240289 PTZ00144, PTZ00144, dihydrolipoamide succinyltransferase;
Provisional.
Length = 418
Score = 29.7 bits (67), Expect = 6.1
Identities = 14/65 (21%), Positives = 24/65 (36%), Gaps = 10/65 (15%)
Query: 41 STSSSSPPPSSTEVTPVKKSPAPPSSSSSAPAPSHHNESNSGNSTKAASVEPPPRDEPRL 100
+ ++++ +T P +P P ++S P P A EP P +P
Sbjct: 130 AAAAAAKAEKTTPEKPKAAAPTPEPPAASKPTPP----------AAAKPPEPAPAAKPPP 179
Query: 101 EPVDG 105
PV
Sbjct: 180 TPVAR 184
>gnl|CDD|131850 TIGR02803, ExbD_1, TonB system transport protein ExbD, group 1.
Members of this family are Gram-negative bacterial inner
membrane proteins, generally designated ExbD, related to
the TolR family modeled by TIGRFAMs TIGR02801. Members
always are encoded next to a protein designated ExbB
(TIGR02797), related to the TolQ family modeled by
TIGRFAMs TIGR02796. ExbD and ExbB together form a proton
channel through which they can harness the proton-motive
force to energize TonB, which in turn energizes
TonB-dependent receptors in the outer membrane.
TonB-dependent receptors with known specificity tend to
import siderophores or vitamin B12. A TonB system and
Tol-Pal system often will co-exist in a single bacterial
genome.
Length = 122
Score = 28.3 bits (63), Expect = 6.9
Identities = 9/16 (56%), Positives = 13/16 (81%)
Query: 267 LPSATSTPKPRPPNPV 282
LP++T+ P+PRP PV
Sbjct: 35 LPASTAAPQPRPEKPV 50
>gnl|CDD|222855 PHA02529, O, capsid-scaffolding protein; Provisional.
Length = 278
Score = 29.3 bits (66), Expect = 7.3
Identities = 20/131 (15%), Positives = 47/131 (35%), Gaps = 11/131 (8%)
Query: 648 IQSREPSLREPNPDEI-----EIDFETLKPSTLRELEQYVSSCLRKRTYKKTPKPKDEKF 702
+ + R+ NP+ + E E + L L + V ++ K+ DE+F
Sbjct: 139 AKHNPLARRKQNPENLFSVACEAKEEFEPEAELAALFEKVKKLFSRK--KEDDAMTDEQF 196
Query: 703 AEKKHELEKRLQDVTSQIDSTNKKLKKPKPTTSAAGPTGASRLSASSSSSSDSDSSGQST 762
A+ + + ++ V +ID T+ ++ + ++ S +
Sbjct: 197 AD----VHEAVEGVAEKIDHTSAQVATTETAEPPQEGKDTEENGETAEQFSALKETLDKL 252
Query: 763 SSSSSESSDSE 773
+S +E +
Sbjct: 253 ASKFNEEVPAP 263
>gnl|CDD|153283 cd07599, BAR_Rvs167p, The Bin/Amphiphysin/Rvs (BAR) domain of
Saccharomyces cerevisiae Reduced viability upon
starvation protein 167 and similar proteins. BAR
domains are dimerization, lipid binding and curvature
sensing modules found in many different proteins with
diverse functions. This subfamily is composed of fungal
proteins with similarity to Saccharomyces cerevisiae
Reduced viability upon starvation protein 167 (Rvs167p)
and Schizosaccharomyces pombe Hob1 (homolog of Bin1). S.
cerevisiae Rvs167p plays a role in regulation of the
actin cytoskeleton, endocytosis, and sporulation. It
forms a heterodimer with another BAR domain protein
Rvs161p. Rvs161p and Rvs167p share common functions but
are not interchangeable. Their BAR domains cannot be
replaced with each other and the overexpression of one
cannot suppress the mutant phenotypes of the other.
Rvs167p also interacts with the GTPase activating
protein (GAP) Gyp5p, which is involved in ER to Golgi
vesicle trafficking. BAR domains form dimers that bind
to membranes, induce membrane bending and curvature, and
may also be involved in protein-protein interactions.
Length = 216
Score = 29.1 bits (66), Expect = 7.6
Identities = 12/53 (22%), Positives = 27/53 (50%), Gaps = 3/53 (5%)
Query: 677 ELEQYVSSCLRKRTYKKTPKPKDEKFAEKKHELEKRLQDVTSQIDSTNKKLKK 729
+ ++ + + KK KDEK + +LE++L++ + ++ N+ LK
Sbjct: 125 DYDKLQNKLNKLLQKKKELSLKDEK---QLAKLERKLEEAKEEYEALNELLKS 174
>gnl|CDD|221391 pfam12042, RP1-2, Tubuliform egg casing silk strands structural
domain. Spiders use fibroins to make silk strands. This
family includes tubuliform silk fibroins which are used
to protect egg cases. This domain is a structural domain
which is found in repeats of up to 20 in many
individuals (although this is not necessarily the case).
RP1 makes up structural domains in the N terminal while
RP2 makes up structural domains in the C terminal.
Length = 167
Score = 28.6 bits (64), Expect = 7.8
Identities = 11/36 (30%), Positives = 19/36 (52%)
Query: 735 SAAGPTGASRLSASSSSSSDSDSSGQSTSSSSSESS 770
+ A A+ +A S +++ + S S SSS S +S
Sbjct: 132 AQAASASAAAAAAQSQAAASAFSQAASQSSSQSAAS 167
Score = 28.6 bits (64), Expect = 9.3
Identities = 13/32 (40%), Positives = 21/32 (65%)
Query: 740 TGASRLSASSSSSSDSDSSGQSTSSSSSESSD 771
AS+ ++S+SSSS + + QS SS+ + SS
Sbjct: 1 AAASQAASSASSSSSASAFAQSLSSALASSSQ 32
>gnl|CDD|235906 PRK07003, PRK07003, DNA polymerase III subunits gamma and tau;
Validated.
Length = 830
Score = 29.8 bits (67), Expect = 7.8
Identities = 17/108 (15%), Positives = 33/108 (30%), Gaps = 8/108 (7%)
Query: 3 AAETETATDPVVNQNGTGDPDPVKAEQDVKVNNNNTSDSTSSSSPPPSSTEVTPVKKSPA 62
A + PV A+ + + + ++ D + P S + P +P
Sbjct: 427 PAAPAPPATADRGDDAADGDAPVPAKANARASADSRCDERDAQPPADSGSASAPASDAPP 486
Query: 63 P----PSSSSSAPAPSHHNESNSGNSTKAAS----VEPPPRDEPRLEP 102
P+ ++AP+ + + AAS P P
Sbjct: 487 DAAFEPAPRAAAPSAATPAAVPDARAPAAASREDAPAAAAPPAPEARP 534
>gnl|CDD|233045 TIGR00601, rad23, UV excision repair protein Rad23. All proteins
in this family for which functions are known are
components of a multiprotein complex used for targeting
nucleotide excision repair to specific parts of the
genome. In humans, Rad23 complexes with the XPC protein.
This family is based on the phylogenomic analysis of JA
Eisen (1999, Ph.D. Thesis, Stanford University) [DNA
metabolism, DNA replication, recombination, and repair].
Length = 378
Score = 29.5 bits (66), Expect = 7.8
Identities = 15/64 (23%), Positives = 23/64 (35%)
Query: 31 VKVNNNNTSDSTSSSSPPPSSTEVTPVKKSPAPPSSSSSAPAPSHHNESNSGNSTKAASV 90
V V+ T + ++ TP PA P+S SA S E + + A+
Sbjct: 72 VMVSKPKTGTGKVAPPAATPTSAPTPTPSPPASPASGMSAAPASAVEEKSPSEESATATA 131
Query: 91 EPPP 94
P
Sbjct: 132 PESP 135
>gnl|CDD|216412 pfam01285, TEA, TEA/ATTS domain family.
Length = 424
Score = 29.4 bits (66), Expect = 7.8
Identities = 14/73 (19%), Positives = 24/73 (32%), Gaps = 4/73 (5%)
Query: 236 PQPRSSKKKPPVSASPSLNPVIKTPVIPLNKLPSATSTPKPRPPNPVLGSTATTTTAPKV 295
P R+S K S S N + + S P P P ++ + +
Sbjct: 154 PSWRTSIKPFSSSHYGSHNS---SAYSDHLRPLQPYSGELPPPLGPNWQASNSKKIR-GL 209
Query: 296 NHLNSMNAPDTPD 308
++ P+ PD
Sbjct: 210 EFSMWVSQPNQPD 222
>gnl|CDD|227315 COG4982, COG4982, 3-oxoacyl-[acyl-carrier protein].
Length = 866
Score = 29.4 bits (66), Expect = 8.1
Identities = 15/52 (28%), Positives = 24/52 (46%), Gaps = 8/52 (15%)
Query: 47 PPPSSTEVTPVKKSPAPPSSSSSAPAPSHHNESNSGNSTKAASVEPPPRDEP 98
P+ E TP + + P+ +++APAP + A+ P P DEP
Sbjct: 9 EEPAKEEATPPAPAASAPAPAAAAPAPVA--------AAAPAAAGPRPDDEP 52
>gnl|CDD|215641 PLN03237, PLN03237, DNA topoisomerase 2; Provisional.
Length = 1465
Score = 29.8 bits (67), Expect = 8.2
Identities = 33/249 (13%), Positives = 76/249 (30%), Gaps = 19/249 (7%)
Query: 528 KPKKKKKKNRDQPKSKMPMGQNSAMMNDHVNKMNKAPAPLNNGQKPKSLNNVRKPQASNP 587
+ KK +R PK P + + + + V KP+
Sbjct: 1190 ESGAAKKVSRQAPKKPAPKKTTKKASESETTEETYGSSAMETENVAE----VVKPKGRAG 1245
Query: 588 QQAKKPKPNNANTVAAKKQVRTFDSEDEDVAKPMSYDEKRQLSLDINKLPGDKLGKVVHI 647
+ K P A + + + E D+ ++ K+
Sbjct: 1246 AKKKAP---------AAAKEKEEEDEILDLKDRLAAYNLDSAPAQSAKMEETVKAVPARR 1296
Query: 648 IQSREPSLR---EPNPDEIEIDFETLKPSTLRELEQYV---SSCLRKRTYKKTPKPKDEK 701
+R+ L + + + D ++ S L++ + K+ K K
Sbjct: 1297 AAARKKPLASVSVISDSDDDDDDFAVEVSLAERLKKKGGRKPAAANKKAAKPPAAAKKRG 1356
Query: 702 FAEKKHELEKRLQDVTSQIDSTNKKLKKPKPTTSAAGPTGASRLSASSSSSSDSDSSGQS 761
A + + + + KK + ++ + + ++++ +++SS
Sbjct: 1357 PATVQSGQKLLTEMLKPAEAIGISPEKKVRKMRASPFNKKSGSVLGRAATNKETESSENV 1416
Query: 762 TSSSSSESS 770
+ SSSSE
Sbjct: 1417 SGSSSSEKD 1425
>gnl|CDD|177952 PLN02318, PLN02318, phosphoribulokinase/uridine kinase.
Length = 656
Score = 29.4 bits (66), Expect = 8.4
Identities = 21/74 (28%), Positives = 29/74 (39%), Gaps = 3/74 (4%)
Query: 709 LEKRLQDVTSQIDSTNKKLKKPKPTTSA---AGPTGASRLSASSSSSSDSDSSGQSTSSS 765
L +R+ + TS+I+ N KL K + S A A SA +S +G T S
Sbjct: 509 LNERMDEFTSRIEELNSKLSIKKNSPSQQNLALQAEACNGSAPTSYFVSGLGNGSLTGSI 568
Query: 766 SSESSDSEGEKGPH 779
SS S
Sbjct: 569 LPLSSSSSQLAKES 582
>gnl|CDD|227625 COG5309, COG5309, Exo-beta-1,3-glucanase [Carbohydrate transport
and metabolism].
Length = 305
Score = 29.0 bits (65), Expect = 8.6
Identities = 15/45 (33%), Positives = 22/45 (48%)
Query: 733 TTSAAGPTGASRLSASSSSSSDSDSSGQSTSSSSSESSDSEGEKG 777
++ T +S A+ SSSS + SS S SSSS + + G
Sbjct: 3 SSMQFSSTSSSAALATLSSSSSALSSSASEVSSSSSRASASGFLA 47
>gnl|CDD|223031 PHA03273, PHA03273, envelope glycoprotein C; Provisional.
Length = 486
Score = 29.2 bits (65), Expect = 8.6
Identities = 14/57 (24%), Positives = 25/57 (43%), Gaps = 2/57 (3%)
Query: 38 TSDSTSSSSPPPSSTEVTPVKKSPAPPSSSSSAPAPSHHNESNSGNSTKAASVEPPP 94
S +++SSS S ++ +PA P+ ++S ++ NST A E
Sbjct: 25 ASGASTSSSIENSDNSTAEMQSTPATPTHTTSNLTSPFGTGTD--NSTNANGTESTT 79
>gnl|CDD|236333 PRK08691, PRK08691, DNA polymerase III subunits gamma and tau;
Validated.
Length = 709
Score = 29.3 bits (65), Expect = 8.7
Identities = 26/146 (17%), Positives = 47/146 (32%), Gaps = 11/146 (7%)
Query: 4 AETETATDPVVNQNGTGDPDPVKAEQDVKVNNNNTSDSTSSSSPPPSSTEVTPVKKSPAP 63
E ETA PV + P K V NN +P + T + S
Sbjct: 398 PEAETAQTPVQTASAAAMPSEGKTAGPVSNQENNDV-PPWEDAPDEAQTAAGTAQTSAKS 456
Query: 64 PSSSSSA--PAPSHHNESNSGNSTKAASVEPPPRDEPRL-----EPVDGIVQPPVVPPKH 116
++S A P + +++ + ++ A + P + P E V+ P +
Sbjct: 457 IQTASEAETPPENQVSKNKAADNETDAPLSEVPSENPIQATPNDEAVETETFAHEAPAEP 516
Query: 117 RPGRNTNQLQYIVKN---VMKAVWKH 139
G ++ + W+H
Sbjct: 517 FYGYGFPDNDCPPEDGAEIPPPDWEH 542
>gnl|CDD|183064 PRK11267, PRK11267, biopolymer transport protein ExbD; Provisional.
Length = 141
Score = 28.2 bits (63), Expect = 8.8
Identities = 11/16 (68%), Positives = 14/16 (87%)
Query: 267 LPSATSTPKPRPPNPV 282
LP++TSTP+PRP PV
Sbjct: 49 LPASTSTPQPRPEKPV 64
>gnl|CDD|233594 TIGR01844, type_I_sec_TolC, type I secretion outer membrane
protein, TolC family. Members of This model are outer
membrane proteins from the TolC subfamily within the RND
(Resistance-Nodulation-cell Division) efflux systems.
These proteins, unlike the NodT subfamily, appear not to
be lipoproteins. All are believed to participate in type
I protein secretion, an ABC transporter system for
protein secretion without cleavage of a signal sequence,
although they may, like TolC, participate also in the
efflux of smaller molecules as well. This family
includes the well-documented examples TolC (E. coli),
PrtF (Erwinia), and AprF (Pseudomonas aeruginosa)
[Protein fate, Protein and peptide secretion and
trafficking, Transport and binding proteins, Porins].
Length = 415
Score = 29.3 bits (66), Expect = 8.9
Identities = 11/57 (19%), Positives = 18/57 (31%), Gaps = 3/57 (5%)
Query: 460 VTMARQLSAVFEDRFAKMPDESNLASRAAAS---VSSDDDSEDERQNQLKYLQEQLK 513
+ +A R +L R A + V + + L L+EQL
Sbjct: 88 NAVRAAEAAALAARETLRATAQDLILRTAEAYMEVLRAQEILALAEANLAALKEQLD 144
>gnl|CDD|217495 pfam03326, Herpes_TAF50, Herpesvirus transcription activation
factor (transactivator). This family includes EBV BRLF1
and similar ORF 50 proteins from other herpesviruses.
Length = 500
Score = 29.4 bits (66), Expect = 9.0
Identities = 18/82 (21%), Positives = 29/82 (35%), Gaps = 3/82 (3%)
Query: 44 SSSPPPSSTEVTPVKKSPAPPSS-SSSAPAPSHHNESNSGNSTKAASVEPPPRDEPRLEP 102
S T P + S S++P S ++ SG + + AS+ PP + +LE
Sbjct: 348 VSQNHDGGTVRAPRGRPSGSGQSPPSNSPLLSSLADTPSGAAHQPASLLPPAVVQQQLED 407
Query: 103 VDGIVQPPVVPPKHRPGRNTNQ 124
P P + Q
Sbjct: 408 ASDKQ--PPTPGSSLVPQPDEQ 427
Database: CDD.v3.10
Posted date: Mar 20, 2013 7:55 AM
Number of letters in database: 10,937,602
Number of sequences in database: 44,354
Lambda K H
0.306 0.124 0.354
Gapped
Lambda K H
0.267 0.0798 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 38,426,153
Number of extensions: 3727658
Number of successful extensions: 7401
Number of sequences better than 10.0: 1
Number of HSP's gapped: 6570
Number of HSP's successfully gapped: 526
Length of query: 779
Length of database: 10,937,602
Length adjustment: 105
Effective length of query: 674
Effective length of database: 6,280,432
Effective search space: 4233011168
Effective search space used: 4233011168
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.1 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (21.6 bits)
S2: 63 (27.9 bits)