RPS-BLAST 2.2.26 [Sep-21-2011]
Database: CDD.v3.10
44,354 sequences; 10,937,602 total letters
Searching..................................................done
Query= psy14676
(1827 letters)
>gnl|CDD|219941 pfam08623, TIP120, TATA-binding protein interacting (TIP20). TIP120
(also known as cullin-associated and
neddylation-dissociated protein 1) is a TATA binding
protein interacting protein that enhances transcription.
Length = 169
Score = 210 bits (537), Expect = 1e-62
Identities = 86/163 (52%), Positives = 117/163 (71%), Gaps = 6/163 (3%)
Query: 1598 PHIHILLPPVI----IIKTLIREVEMGPFKHTVDDGLDIRKAAFECMYTLLDTCLDRVDM 1653
P ++ LLP + + K LIREV+MGPFKH VDDGL++RK A+E MYTLLD+C R+D+
Sbjct: 5 PLLNELLPALYSETKVKKELIREVQMGPFKHVVDDGLELRKLAYETMYTLLDSCFSRLDI 64
Query: 1654 FEFLNHLESGLRDHYDIKMLTYLMLARLAQMTPAALLQRLERIVEPLRNTVTLKVKANSV 1713
EFL+ +E+GL D +DIKML LML++LA + P +L RL+R+ EPLR T++ KVK N+V
Sbjct: 65 PEFLDRIEAGLSDEHDIKMLALLMLSKLAVLAPEEVLARLDRLAEPLRKTLSKKVKENAV 124
Query: 1714 KQEYEKQDELKRSAMRAVAAL-LTIQDADKNPAL-NDFVATIK 1754
KQE EKQ EL RS +R V +L L + A +P + N+FV +K
Sbjct: 125 KQEIEKQQELIRSVLRLVVSLNLKLPSAQDSPVVWNEFVEQVK 167
>gnl|CDD|216598 pfam01602, Adaptin_N, Adaptin N terminal region. This family
consists of the N terminal region of various alpha, beta
and gamma subunits of the AP-1, AP-2 and AP-3 adaptor
protein complexes. The adaptor protein (AP) complexes are
involved in the formation of clathrin-coated pits and
vesicles. The N-terminal region of the various adaptor
proteins (APs) is constant by comparison to the
C-terminal which is variable within members of the AP-2
family; and it has been proposed that this constant
region interacts with another uniform component of the
coated vesicles.
Length = 522
Score = 47.6 bits (114), Expect = 5e-05
Identities = 72/440 (16%), Positives = 151/440 (34%), Gaps = 88/440 (20%)
Query: 1085 FLLSPVIASVGDNFYKITAEALLVLQQFVKVIRPLTSPSSYDFA-------PFTSDIYQC 1137
FL+ + + D + + A+ +L + K R + + Q
Sbjct: 151 FLVPELKELLSDKDPGVVSAAVALLYEIRKNDRLYLNKLLPLLVRRLCNLLTVCNPWLQV 210
Query: 1138 TLVRLKAADIDQEVKERAISCMGQIICNLGDYLKSELPVCLPILLDRLRNEITRLTTVKA 1197
++RL Q+ +E +++ ++ + L++ N VK
Sbjct: 211 KILRLLTRYAPQDPREPK-----ELLEDILNLLQN-------------SNNAVLYEAVKT 252
Query: 1198 LTKIAASPLRIDLRPILTDGIPILGSFLRKNQRALKLSTLLLLDTLLVNYPSAISTEMLN 1257
+ + P ++ + LG L L+ L L+ +L +P A + +
Sbjct: 253 IIHLDPEP------ELIVLAVNALGRLLSSPDENLRYVALRNLNKILEKHPPA----VQH 302
Query: 1258 KVTVELAPLVSESDLHIAQLTLTLLTSIAKIQPNALVEPAAASILPQIGRHTDLTSVADL 1317
+ + L ++ D+ I L LL + E I+ ++ ++ ++ +AD
Sbjct: 303 LDLI-IFCLKTDDDISIRLRALDLLYKLVD-------ESNVKEIVKELLKY--VSEIAD- 351
Query: 1318 KQVILNAFTPTSEEVKSAASYTLGSIAVGNLQ---QYVPFVLKEIENQPKR-QYLLLHSL 1373
E K +G +A + +L+ + ++ +
Sbjct: 352 ------------PEFKIKLVKAIGRLAEKFPTDAEWCIDVLLELLSLAGSYVVDEIVEVI 399
Query: 1374 KEIISWQSTSTSGVAQLQQFVPAIWAQLFLHCECQEEGTRNVVAECLGK---LTLIDPSN 1430
++II +L++++ +L + + R LG+ L PS+
Sbjct: 400 RDII-------RKYPELREYILEHLCELL--EDIESPEARAAALWILGEYGELIPNSPSD 450
Query: 1431 LLPRLQESLKSNSALMRTTVLTA-VKFTISDQMTSSDKDFRFMATNDLMSELQKDSIKLD 1489
LL + E S +R +LTA VK ++ T D+ N ++ + + +
Sbjct: 451 LLRSILEVFVLESLKVRLALLTALVKLSL----TFPDE----EVQNLIVQVVLSLATQDS 502
Query: 1490 DDSE---RKVVKMLLRLLED 1506
D E R V LRLL
Sbjct: 503 SDLELRDRAV--EYLRLLSL 520
Score = 32.6 bits (75), Expect = 2.5
Identities = 15/69 (21%), Positives = 28/69 (40%), Gaps = 5/69 (7%)
Query: 638 IPILGSFLRKNQRALKLSTLLLLDTLLVNYPSAISTEMLNKVTVELAPLVSESDLHIAQL 697
+ LG L L+ L L+ +L +P A + + + + L ++ D+ I
Sbjct: 267 VNALGRLLSSPDENLRYVALRNLNKILEKHPPA----VQHLDLI-IFCLKTDDDISIRLR 321
Query: 698 TLTLLTSIA 706
L LL +
Sbjct: 322 ALDLLYKLV 330
>gnl|CDD|221490 pfam12253, CAF1A, Chromatin assembly factor 1 subunit A. The CAF-1
or chromatin assembly factor-1 consists of three
subunits, and this is the first, or A. The A domain is
uniquely required for the progression of S phase in
mouse cells, independent of its ability to promote
histone deposition but dependent on its ability to
interact with HP1 - heterochromatin protein 1-rich
heterochromatin domains next to centromeres that are
crucial for chromosome segregation during mitosis. This
HP1-CAF-1 interaction module functions as a built-in
replication control for heterochromatin, which, like a
control barrier, has an impact on S-phase progression in
addition to DNA-based checkpoints.
Length = 76
Score = 42.6 bits (101), Expect = 7e-05
Identities = 22/46 (47%), Positives = 24/46 (52%), Gaps = 13/46 (28%)
Query: 391 DVTSVAEWDEEEGLGGEGGGEGMDTEEEDGEDEGEDDNEYSDDDDM 436
D S AEW+EEE EG ED E E E+D E DDDDM
Sbjct: 41 DYDSDAEWEEEE--------EG-----EDLESEDEEDEEEDDDDDM 73
Score = 36.8 bits (86), Expect = 0.008
Identities = 18/46 (39%), Positives = 24/46 (52%), Gaps = 14/46 (30%)
Query: 214 NYNYEDEEEGLGGEGGGEGMDTEEEDGED---EGEDDNEYSDDDDM 256
+Y+Y+ + E EEE+GED E E+D E DDDDM
Sbjct: 39 DYDYDSDAEW-----------EEEEEGEDLESEDEEDEEEDDDDDM 73
Score = 31.4 bits (72), Expect = 0.54
Identities = 13/38 (34%), Positives = 21/38 (55%), Gaps = 4/38 (10%)
Query: 211 YDPNYNYEDEEEGLGGEGGGEGMDTEEEDGEDEGEDDN 248
YD + +E+EEEG E E E+E+ +D+ + D
Sbjct: 42 YDSDAEWEEEEEGEDLESEDE----EDEEEDDDDDMDG 75
>gnl|CDD|205691 pfam13513, HEAT_EZ, HEAT-like repeat. The HEAT repeat family is
related to armadillo/beta-catenin-like repeats (see
pfam00514). These EZ repeats are found in subunits of
cyanobacterial phycocyanin lyase and other proteins and
probably carry out a scaffolding role.
Length = 55
Score = 38.2 bits (89), Expect = 0.001
Identities = 16/54 (29%), Positives = 20/54 (37%)
Query: 53 SVQLEALDILSDLLSRFGALLPSFHFVILTALLPQLSSPRQAVRKRTIVALSHL 106
V+ A L L LL +L ALLP L VR+ AL +
Sbjct: 2 EVREAAALALGALAGGGPELLRPAVPELLPALLPLLKDDDDEVREAAAWALGRI 55
Score = 29.7 bits (67), Expect = 1.3
Identities = 16/54 (29%), Positives = 26/54 (48%), Gaps = 2/54 (3%)
Query: 1150 EVKERAISCMGQIICNLGDYLKSELPVCLPILLDRLR--NEITRLTTVKALTKI 1201
EV+E A +G + + L+ +P LP LL L+ ++ R AL +I
Sbjct: 2 EVREAAALALGALAGGGPELLRPAVPELLPALLPLLKDDDDEVREAAAWALGRI 55
Score = 28.6 bits (64), Expect = 4.1
Identities = 13/45 (28%), Positives = 23/45 (51%)
Query: 258 WKVRRAAAKCLEAIIATRRDLIGDMYRLVSPPLIARFKEREENVK 302
W+VR AAA L A+ +L+ + P L+ K+ ++ V+
Sbjct: 1 WEVREAAALALGALAGGGPELLRPAVPELLPALLPLLKDDDDEVR 45
Score = 28.6 bits (64), Expect = 4.1
Identities = 13/45 (28%), Positives = 23/45 (51%)
Query: 438 WKVRRAAAKCLEAIIATRRDLIGDMYRLVSPPLIARFKEREENVK 482
W+VR AAA L A+ +L+ + P L+ K+ ++ V+
Sbjct: 1 WEVREAAALALGALAGGGPELLRPAVPELLPALLPLLKDDDDEVR 45
>gnl|CDD|233191 TIGR00927, 2A1904, K+-dependent Na+/Ca+ exchanger. [Transport and
binding proteins, Cations and iron carrying compounds].
Length = 1096
Score = 41.5 bits (97), Expect = 0.006
Identities = 14/38 (36%), Positives = 28/38 (73%)
Query: 218 EDEEEGLGGEGGGEGMDTEEEDGEDEGEDDNEYSDDDD 255
+ +E+G+ G GG +G D+EEE+ E+E E++ E ++++
Sbjct: 846 KQDEKGVDGGGGSDGGDSEEEEEEEEEEEEEEEEEEEE 883
Score = 41.1 bits (96), Expect = 0.006
Identities = 14/37 (37%), Positives = 27/37 (72%)
Query: 399 DEEEGLGGEGGGEGMDTEEEDGEDEGEDDNEYSDDDD 435
+E+G+ G GG +G D+EEE+ E+E E++ E ++++
Sbjct: 847 QDEKGVDGGGGSDGGDSEEEEEEEEEEEEEEEEEEEE 883
>gnl|CDD|182354 PRK10281, PRK10281, hypothetical protein; Provisional.
Length = 299
Score = 39.3 bits (92), Expect = 0.016
Identities = 30/102 (29%), Positives = 42/102 (41%), Gaps = 9/102 (8%)
Query: 1193 TTVKALTKIAASPLRI-----DLRPILTDGIPILGSFLRKNQRALKLSTL-LLLDTLLVN 1246
TV T P+ I D R +T G P L RA ++ L L D +L
Sbjct: 94 CTVWQTTLAGILPVDIEKENDDYRISMTQGTPEFEPPLEGETRAAIINALGLTEDDILPG 153
Query: 1247 YPSAISTEMLNKVTVELAPLVSESDLHIAQLTLTLLTSIAKI 1288
P +++ +KV + PL E DL L LT+I+K
Sbjct: 154 LPIQVASTGHSKVMI---PLKPEVDLDALSPNLAALTAISKQ 192
Score = 35.8 bits (83), Expect = 0.17
Identities = 25/81 (30%), Positives = 36/81 (44%), Gaps = 4/81 (4%)
Query: 629 DLRPILTDGIPILGSFLRKNQRALKLSTL-LLLDTLLVNYPSAISTEMLNKVTVELAPLV 687
D R +T G P L RA ++ L L D +L P +++ +KV + PL
Sbjct: 115 DYRISMTQGTPEFEPPLEGETRAAIINALGLTEDDILPGLPIQVASTGHSKVMI---PLK 171
Query: 688 SESDLHIAQLTLTLLTSIAKI 708
E DL L LT+I+K
Sbjct: 172 PEVDLDALSPNLAALTAISKQ 192
>gnl|CDD|218538 pfam05285, SDA1, SDA1. This family consists of several SDA1
protein homologues. SDA1 is a Saccharomyces cerevisiae
protein which is involved in the control of the actin
cytoskeleton. The protein is essential for cell
viability and is localised in the nucleus.
Length = 317
Score = 38.1 bits (89), Expect = 0.037
Identities = 14/50 (28%), Positives = 24/50 (48%)
Query: 389 IPDVTSVAEWDEEEGLGGEGGGEGMDTEEEDGEDEGEDDNEYSDDDDMSW 438
IP + + +W EEE E +++D E+E E + + DD+ W
Sbjct: 69 IPGLELLEKWKEEERKKKEAEQGLESDDDDDEEEEWEVEEDEDSDDEGEW 118
Score = 32.7 bits (75), Expect = 1.6
Identities = 10/39 (25%), Positives = 22/39 (56%)
Query: 217 YEDEEEGLGGEGGGEGMDTEEEDGEDEGEDDNEYSDDDD 255
+ +EE + +G++++++D E+E + E D DD
Sbjct: 76 EKWKEEERKKKEAEQGLESDDDDDEEEEWEVEEDEDSDD 114
>gnl|CDD|221175 pfam11705, RNA_pol_3_Rpc31, DNA-directed RNA polymerase III subunit
Rpc31. RNA polymerase III contains seventeen subunits
in yeasts and in human cells. Twelve of these are akin
to RNA polymerase I or II and the other five are RNA pol
III-specific, and form the functionally distinct groups
(i) Rpc31-Rpc34-Rpc82, and (ii) Rpc37-Rpc53. Rpc31,
Rpc34 and Rpc82 form a cluster of enzyme-specific
subunits that contribute to transcription initiation in
S.cerevisiae and H.sapiens. There is evidence that these
subunits are anchored at or near the N-terminal Zn-fold
of Rpc1, itself prolonged by a highly conserved but RNA
polymerase III-specific domain.
Length = 221
Score = 35.1 bits (81), Expect = 0.21
Identities = 13/47 (27%), Positives = 24/47 (51%), Gaps = 8/47 (17%)
Query: 218 EDEEEGLGGEGGGEGMDTEEEDGEDEGEDDNEYS--------DDDDM 256
DEE+ E E + +E+ +D+ +DD++Y+ +DDD
Sbjct: 168 VDEEDEKDEEEEEEEEEEDEDFDDDDDDDDDDYNAENYFDNGEDDDY 214
Score = 34.7 bits (80), Expect = 0.31
Identities = 13/46 (28%), Positives = 24/46 (52%), Gaps = 8/46 (17%)
Query: 399 DEEEGLGGEGGGEGMDTEEEDGEDEGEDDNEYS--------DDDDM 436
DEE+ E E + +E+ +D+ +DD++Y+ +DDD
Sbjct: 169 DEEDEKDEEEEEEEEEEDEDFDDDDDDDDDDYNAENYFDNGEDDDY 214
Score = 33.6 bits (77), Expect = 0.74
Identities = 14/36 (38%), Positives = 20/36 (55%)
Query: 220 EEEGLGGEGGGEGMDTEEEDGEDEGEDDNEYSDDDD 255
E E + E + EEE+ E+E ED ++ DDDD
Sbjct: 162 ELEAEDVDEEDEKDEEEEEEEEEEDEDFDDDDDDDD 197
Score = 33.6 bits (77), Expect = 0.74
Identities = 14/36 (38%), Positives = 20/36 (55%)
Query: 400 EEEGLGGEGGGEGMDTEEEDGEDEGEDDNEYSDDDD 435
E E + E + EEE+ E+E ED ++ DDDD
Sbjct: 162 ELEAEDVDEEDEKDEEEEEEEEEEDEDFDDDDDDDD 197
Score = 32.4 bits (74), Expect = 1.4
Identities = 13/39 (33%), Positives = 20/39 (51%)
Query: 218 EDEEEGLGGEGGGEGMDTEEEDGEDEGEDDNEYSDDDDM 256
+ +E + E EEE+ E+E ED++ DDDD
Sbjct: 158 KKLKELEAEDVDEEDEKDEEEEEEEEEEDEDFDDDDDDD 196
Score = 32.4 bits (74), Expect = 1.5
Identities = 14/40 (35%), Positives = 20/40 (50%)
Query: 397 EWDEEEGLGGEGGGEGMDTEEEDGEDEGEDDNEYSDDDDM 436
E +E + E EEE+ E+E ED++ DDDD
Sbjct: 157 EKKLKELEAEDVDEEDEKDEEEEEEEEEEDEDFDDDDDDD 196
Score = 32.4 bits (74), Expect = 1.5
Identities = 14/38 (36%), Positives = 23/38 (60%)
Query: 218 EDEEEGLGGEGGGEGMDTEEEDGEDEGEDDNEYSDDDD 255
E + + L E E + +EE+ E+E E+D ++ DDDD
Sbjct: 157 EKKLKELEAEDVDEEDEKDEEEEEEEEEEDEDFDDDDD 194
Score = 32.4 bits (74), Expect = 1.8
Identities = 15/38 (39%), Positives = 21/38 (55%), Gaps = 2/38 (5%)
Query: 218 EDEEEGLGGEGGGEGMDTEEEDGEDEGEDDNEYSDDDD 255
E E+ E E + EEE+ ED +DD++ DDDD
Sbjct: 164 EAEDVDEEDEKDEEEEEEEEEEDEDFDDDDDD--DDDD 199
Score = 32.0 bits (73), Expect = 2.3
Identities = 10/27 (37%), Positives = 18/27 (66%)
Query: 231 EGMDTEEEDGEDEGEDDNEYSDDDDMS 257
+ D ++E+ E+E E+++E DDDD
Sbjct: 169 DEEDEKDEEEEEEEEEEDEDFDDDDDD 195
Score = 32.0 bits (73), Expect = 2.3
Identities = 10/27 (37%), Positives = 18/27 (66%)
Query: 411 EGMDTEEEDGEDEGEDDNEYSDDDDMS 437
+ D ++E+ E+E E+++E DDDD
Sbjct: 169 DEEDEKDEEEEEEEEEEDEDFDDDDDD 195
Score = 31.3 bits (71), Expect = 3.9
Identities = 15/36 (41%), Positives = 21/36 (58%)
Query: 400 EEEGLGGEGGGEGMDTEEEDGEDEGEDDNEYSDDDD 435
E E + E + + EEE+ EDE DD++ DDDD
Sbjct: 164 EAEDVDEEDEKDEEEEEEEEEEDEDFDDDDDDDDDD 199
>gnl|CDD|240329 PTZ00248, PTZ00248, eukaryotic translation initiation factor 2
subunit 1; Provisional.
Length = 319
Score = 34.6 bits (80), Expect = 0.40
Identities = 26/93 (27%), Positives = 40/93 (43%), Gaps = 5/93 (5%)
Query: 345 PSLVKALQCQLREKSIRTRQDCLALLRELVQTLPGCLAPSLPALIPDVTSVAEWDEEEGL 404
P V C ++K + L ++E+++ G P+V E D EE L
Sbjct: 227 PQYVIVTTCSDKDKGMEIIGAALEAIKEVIKKKGGDFKVKGE---PEVVGGDEEDLEELL 283
Query: 405 GG--EGGGEGMDTEEEDGEDEGEDDNEYSDDDD 435
E E +E ED ++E ED+ E DDD+
Sbjct: 284 EKAEEEEEEDDYSESEDEDEEDEDEEEEEDDDE 316
>gnl|CDD|204198 pfam09324, DUF1981, Domain of unknown function (DUF1981). Members of
this family of functionally uncharacterized domains are
found in various plant and yeast protein transport
proteins.
Length = 86
Score = 31.8 bits (73), Expect = 0.49
Identities = 16/53 (30%), Positives = 21/53 (39%), Gaps = 9/53 (16%)
Query: 1129 PFTSDIYQCTLVRLKAADIDQEVKERAISCMGQIICNLGDYLKSELPVCLPIL 1181
PF I + EVKE + C+ QII + D LKS +L
Sbjct: 21 PF-EYIMSNNQ--------NVEVKELVLECISQIILSRADNLKSGWKTIFGVL 64
>gnl|CDD|225803 COG3264, COG3264, Small-conductance mechanosensitive channel [Cell
envelope biogenesis, outer membrane].
Length = 835
Score = 34.7 bits (80), Expect = 0.60
Identities = 25/114 (21%), Positives = 39/114 (34%), Gaps = 6/114 (5%)
Query: 645 LRKNQRALKLSTLLLLDTLLVNYPSAISTEMLNKVTVELAPLVSESDLHIAQLTLTLLTS 704
Q L L VN E + A L+ + QL+ L
Sbjct: 22 AESAQLEAALQLLQEA----VNSKRQEEAEPAAEEAELQAELIQQELAINDQLSQALNQQ 77
Query: 705 IAKIQPNALVEPAAASILPQVLQIAKSPLLQGAALTSMLEFFKSL--VLAELPG 756
++ A + A++L Q+LQ +++ Q A L L + L L LP
Sbjct: 78 TERLNALASDDRQLANLLLQLLQSSRTIREQIAVLRGSLLLSRILLQQLGPLPE 131
>gnl|CDD|218375 pfam05004, IFRD, Interferon-related developmental regulator (IFRD).
Interferon-related developmental regulator (IFRD1) is
the human homologue of the rat early response protein
PC4 and its murine homologue TIS7. The exact function of
IFRD1 is unknown but it has been shown that PC4 is
necessary to muscle differentiation and that it might
have a role in signal transduction. This family also
contains IFRD2 and its murine equivalent SKMc15 which
are highly expressed soon after gastrulation and in the
hepatic primordium, suggesting an involvement in early
hematopoiesis.
Length = 309
Score = 34.0 bits (78), Expect = 0.69
Identities = 24/86 (27%), Positives = 37/86 (43%), Gaps = 8/86 (9%)
Query: 519 LLSTQVPSLVKALQCQLREKSIRTRQDCLALLRELMTSSDKDFRFMATNDLMSEL----- 573
+L+T P L L + I + LALL EL ++DF L+S+L
Sbjct: 225 ILATHQPKLPTLLSSDDLDVRIAAGE-TLALLFELARDEEEDFVEEDMEALLSQLRQLAT 283
Query: 574 --QKDSIKLDDDSERKVVKMLLRLLE 597
K K D ++R + +LR +E
Sbjct: 284 DSSKYRAKKDRRAQRSTFRDVLRFVE 309
>gnl|CDD|218747 pfam05786, Cnd2, Condensin complex subunit 2. This family consists
of several Barren protein homologues from several
eukaryotic organisms. In Drosophila Barren (barr) is
required for sister-chromatid segregation in mitosis.
barr encodes a novel protein that is present in
proliferating cells and has homologues in yeast and
human. Mitotic defects in barr embryos become apparent
during cycle 16, resulting in a loss of PNS and CNS
neurons. Centromeres move apart at the
metaphase-anaphase transition and Cyclin B is degraded,
but sister chromatids remain connected, resulting in
chromatin bridging. Barren protein localises to
chromatin throughout mitosis. Colocalisation and
biochemical experiments indicate that Barren associates
with Topoisomerase II throughout mitosis and alters the
activity of Topoisomerase II. It has been suggested that
this association is required for proper chromosomal
segregation by facilitating the decatenation of
chromatids at anaphase. This family forms one of the
three non-structural maintenance of chromosomes (SMC)
subunits of the mitotic condensation complex along with
Cnd1 and Cnd3.
Length = 719
Score = 34.0 bits (78), Expect = 0.86
Identities = 35/228 (15%), Positives = 71/228 (31%), Gaps = 29/228 (12%)
Query: 223 GLGGEGGGEGMDTEEEDGEDE-GEDDNEYSDDDDMSWKVRRAAAKCLEAIIATRRDLIG- 280
GL E +T E + E +++ ++D K ++ + E+ + + +
Sbjct: 148 GLADSDNKEAPETGREGDDGEDDDEEGSDGEEDGAKKKAKKKRQRKPESTLEKNFEALNV 207
Query: 281 ---DMYRLVSP---PLIARFKEREENVKSDIFHAYMALLKGTRTF--SPLTPSDPNQMDQ 332
D+ V P A F +E + ++++ R S P + + D
Sbjct: 208 KKLDLEFAVDPLFKKTSADF---DEGGAKGLLLNHLSIDSEGRLVFDSSEDPEELSSEDD 264
Query: 333 EDSPLCLLSTQVPSLVKALQCQLREKSIRTRQDCLALLRELVQTLPGCLAPSLPALIPDV 392
E + L L ++ I L T PS P +
Sbjct: 265 EFPDDGEIDLTPLKFFPLLDR-LEKRPICPS------LAGFQLT-----DPSGSLDDPFL 312
Query: 393 TSVAEWDEEEGLGGEGGGEGMDTEEE----DGEDEGEDDNEYSDDDDM 436
+ + + E+ + ++ D E EDD + +D
Sbjct: 313 KAPEDTENEDESDDNADDFDDNDKKGDVGFDINAEVEDDPDEDAEDGG 360
>gnl|CDD|217450 pfam03247, Prothymosin, Prothymosin/parathymosin family.
Prothymosin alpha and parathymosin are two ubiquitous
small acidic nuclear proteins that are thought to be
involved in cell cycle progression, proliferation, and
cell differentiation.
Length = 106
Score = 31.5 bits (71), Expect = 1.0
Identities = 17/58 (29%), Positives = 27/58 (46%), Gaps = 8/58 (13%)
Query: 213 PNYNYEDEEEGLGGEGGGEGMDTEEED------GEDEGEDDNEYSDDDDMSWKVRRAA 264
P E+EE G + G + M+ EEE E EGE++ +++ +RAA
Sbjct: 33 PANGNENEENG--AQEGDDEMEEEEEVDEDDEEEEGEGEEEEGEEEEETEGATGKRAA 88
Score = 30.3 bits (68), Expect = 2.3
Identities = 11/38 (28%), Positives = 18/38 (47%)
Query: 218 EDEEEGLGGEGGGEGMDTEEEDGEDEGEDDNEYSDDDD 255
E +E+ EG GE + EEE+ + D++D
Sbjct: 56 EVDEDDEEEEGEGEEEEGEEEEETEGATGKRAAEDEED 93
Score = 29.9 bits (67), Expect = 4.3
Identities = 12/45 (26%), Positives = 23/45 (51%)
Query: 400 EEEGLGGEGGGEGMDTEEEDGEDEGEDDNEYSDDDDMSWKVRRAA 444
+EG E +D ++E+ E EGE++ +++ +RAA
Sbjct: 44 AQEGDDEMEEEEEVDEDDEEEEGEGEEEEGEEEEETEGATGKRAA 88
Score = 29.5 bits (66), Expect = 4.6
Identities = 12/45 (26%), Positives = 19/45 (42%)
Query: 391 DVTSVAEWDEEEGLGGEGGGEGMDTEEEDGEDEGEDDNEYSDDDD 435
D E +E+ EG GE + EEE+ + D++D
Sbjct: 49 DEMEEEEEVDEDDEEEEGEGEEEEGEEEEETEGATGKRAAEDEED 93
Score = 29.5 bits (66), Expect = 5.2
Identities = 12/36 (33%), Positives = 21/36 (58%)
Query: 400 EEEGLGGEGGGEGMDTEEEDGEDEGEDDNEYSDDDD 435
E E G + G + M+ EEE ED+ E++ E +++
Sbjct: 38 ENEENGAQEGDDEMEEEEEVDEDDEEEEGEGEEEEG 73
>gnl|CDD|217927 pfam04147, Nop14, Nop14-like family. Emg1 and Nop14 are novel
proteins whose interaction is required for the
maturation of the 18S rRNA and for 40S ribosome
production.
Length = 809
Score = 33.8 bits (78), Expect = 1.0
Identities = 14/41 (34%), Positives = 22/41 (53%)
Query: 217 YEDEEEGLGGEGGGEGMDTEEEDGEDEGEDDNEYSDDDDMS 257
E+EE+G+ E + D EE+ ED D E ++D+ S
Sbjct: 323 EEEEEDGVDDEDEEDDDDDLEEEEEDVDLSDEEEDEEDEDS 363
Score = 33.4 bits (77), Expect = 1.5
Identities = 17/40 (42%), Positives = 24/40 (60%), Gaps = 3/40 (7%)
Query: 216 NYEDEEEGLGGEGGGEGMDTEEEDGEDEGEDDNEYSDDDD 255
+E +++ G G GE + EEEDG D D++E DDDD
Sbjct: 305 EFEPDDDDNFGLGQGEEDEEEEEDGVD---DEDEEDDDDD 341
Score = 33.0 bits (76), Expect = 1.7
Identities = 15/39 (38%), Positives = 22/39 (56%), Gaps = 4/39 (10%)
Query: 397 EWDEEEGLGGEGGGEGMDTEEEDGEDEGEDDNEYSDDDD 435
E D+++ G G E + EE+ +DE E+D DDDD
Sbjct: 307 EPDDDDNFGLGQGEEDEEEEEDGVDDEDEED----DDDD 341
Score = 31.9 bits (73), Expect = 4.5
Identities = 13/39 (33%), Positives = 21/39 (53%)
Query: 399 DEEEGLGGEGGGEGMDTEEEDGEDEGEDDNEYSDDDDMS 437
+EE+G+ E + D EE+ ED D E ++D+ S
Sbjct: 325 EEEDGVDDEDEEDDDDDLEEEEEDVDLSDEEEDEEDEDS 363
>gnl|CDD|237987 cd00020, ARM, Armadillo/beta-catenin-like repeats. An approximately
40 amino acid long tandemly repeated sequence motif
first identified in the Drosophila segment polarity gene
armadillo; these repeats were also found in the
mammalian armadillo homolog beta-catenin, the junctional
plaque protein plakoglobin, the adenomatous polyposis
coli (APC) tumor suppressor protein, and a number of
other proteins. ARM has been implicated in mediating
protein-protein interactions, but no common features
among the target proteins recognized by the ARM repeats
have been identified; related to the HEAT domain; three
consecutive copies of the repeat are represented by this
alignment model.
Length = 120
Score = 31.5 bits (72), Expect = 1.1
Identities = 20/102 (19%), Positives = 33/102 (32%), Gaps = 10/102 (9%)
Query: 519 LLSTQVPSLVKALQCQLREKSIRT--------RQDCLALLRELMTSSDKDFRFMATNDLM 570
LLS+ ++ + L S L L +L+ S D++ A +
Sbjct: 15 LLSSSDENVQREAAWALSNLSAGNNDNIQAVVEAGGLPALVQLLKSEDEEVVKAAL-WAL 73
Query: 571 SELQKDSIKLDDDSERK-VVKMLLRLLEDKNGEVQNLAVKCL 611
L V L+ LL+ N ++Q A L
Sbjct: 74 RNLAAGPEDNKLIVLEAGGVPKLVNLLDSSNEDIQKNATGAL 115
>gnl|CDD|227466 COG5137, COG5137, Histone chaperone involved in gene silencing
[Transcription / Chromatin structure and dynamics].
Length = 279
Score = 33.0 bits (75), Expect = 1.2
Identities = 16/32 (50%), Positives = 17/32 (53%)
Query: 218 EDEEEGLGGEGGGEGMDTEEEDGEDEGEDDNE 249
EDEE G G G EEE+ E EG DD E
Sbjct: 192 EDEEVGSDSYGEGNRELNEEEEEEAEGSDDGE 223
Score = 32.7 bits (74), Expect = 1.7
Identities = 16/33 (48%), Positives = 17/33 (51%)
Query: 397 EWDEEEGLGGEGGGEGMDTEEEDGEDEGEDDNE 429
E DEE G G G EEE+ E EG DD E
Sbjct: 191 EEDEEVGSDSYGEGNRELNEEEEEEAEGSDDGE 223
Score = 31.1 bits (70), Expect = 4.8
Identities = 14/70 (20%), Positives = 24/70 (34%), Gaps = 6/70 (8%)
Query: 200 SITELCLTYITYDPNYNYEDEEEGLGGEGGGEGMDTEEEDGEDEGEDDNEYSDDDDMSWK 259
+T + + + N + + E E EE DG +E ED+ D +
Sbjct: 150 RVTRFNIVWDNDEDNDEAPPAQPDVDNE---EEERLEESDGREEEEDE---EVGSDSYGE 203
Query: 260 VRRAAAKCLE 269
R + E
Sbjct: 204 GNRELNEEEE 213
>gnl|CDD|220245 pfam09444, MRC1, MRC1-like domain. This putative domain is found
to be the most conserved region in mediator of
replication checkpoint protein 1.
Length = 145
Score = 32.0 bits (73), Expect = 1.3
Identities = 17/53 (32%), Positives = 26/53 (49%), Gaps = 10/53 (18%)
Query: 218 EDEEEGLGGEGGGEGMDTEEEDGEDEGED----DNEYSDDDDMSWKVRRAAAK 266
EDE GLGG D E+ D +D+ E D+EYS + + ++ A+
Sbjct: 18 EDEWAGLGG------ADDEDSDDDDDSELEEMIDDEYSKEKAENDELAAKRAE 64
Score = 30.0 bits (68), Expect = 6.5
Identities = 16/52 (30%), Positives = 25/52 (48%), Gaps = 10/52 (19%)
Query: 399 DEEEGLGGEGGGEGMDTEEEDGEDEGED----DNEYSDDDDMSWKVRRAAAK 446
DE GLGG D E+ D +D+ E D+EYS + + ++ A+
Sbjct: 19 DEWAGLGG------ADDEDSDDDDDSELEEMIDDEYSKEKAENDELAAKRAE 64
>gnl|CDD|115196 pfam06524, NOA36, NOA36 protein. This family consists of several
NOA36 proteins which contain 29 highly conserved
cysteine residues. The function of this protein is
unknown.
Length = 314
Score = 33.0 bits (75), Expect = 1.5
Identities = 10/37 (27%), Positives = 14/37 (37%), Gaps = 1/37 (2%)
Query: 219 DEEEGLGGEGGGEGMDTEEEDGEDEGEDDNEYSDDDD 255
++ G G D +E D+ E DDDD
Sbjct: 259 NKYGDFKGVNYGYESDDDEGS-SSNDYDEEEDGDDDD 294
Score = 33.0 bits (75), Expect = 1.5
Identities = 10/37 (27%), Positives = 14/37 (37%), Gaps = 1/37 (2%)
Query: 399 DEEEGLGGEGGGEGMDTEEEDGEDEGEDDNEYSDDDD 435
++ G G D +E D+ E DDDD
Sbjct: 259 NKYGDFKGVNYGYESDDDEGS-SSNDYDEEEDGDDDD 294
>gnl|CDD|222800 PHA00458, PHA00458, single-stranded DNA-binding protein.
Length = 233
Score = 32.1 bits (73), Expect = 2.1
Identities = 12/37 (32%), Positives = 17/37 (45%), Gaps = 1/37 (2%)
Query: 399 DEEEGLGGEGGGEGMDTEEED-GEDEGEDDNEYSDDD 434
DEE+G +EE+ ED+ E+ E D D
Sbjct: 196 DEEDGYVASEAKARKPQDEEEWDEDDEEESEEDEDGD 232
Score = 30.5 bits (69), Expect = 7.3
Identities = 11/38 (28%), Positives = 18/38 (47%), Gaps = 1/38 (2%)
Query: 218 EDEEEGLGGEGGGEGMDTEEEDGEDE-GEDDNEYSDDD 254
EDEE+G +EE+ +++ E+ E D D
Sbjct: 195 EDEEDGYVASEAKARKPQDEEEWDEDDEEESEEDEDGD 232
>gnl|CDD|220759 pfam10446, DUF2457, Protein of unknown function (DUF2457). This is
a family of uncharacterized proteins.
Length = 449
Score = 32.7 bits (74), Expect = 2.1
Identities = 16/32 (50%), Positives = 22/32 (68%)
Query: 224 LGGEGGGEGMDTEEEDGEDEGEDDNEYSDDDD 255
LG E E M+ E++D ED+ +DD+E DDDD
Sbjct: 41 LGKEAEEEAMEEEDDDEEDDDDDDDEDEDDDD 72
Score = 32.7 bits (74), Expect = 2.1
Identities = 16/32 (50%), Positives = 22/32 (68%)
Query: 404 LGGEGGGEGMDTEEEDGEDEGEDDNEYSDDDD 435
LG E E M+ E++D ED+ +DD+E DDDD
Sbjct: 41 LGKEAEEEAMEEEDDDEEDDDDDDDEDEDDDD 72
Score = 31.9 bits (72), Expect = 4.0
Identities = 13/40 (32%), Positives = 24/40 (60%)
Query: 218 EDEEEGLGGEGGGEGMDTEEEDGEDEGEDDNEYSDDDDMS 257
E EEE + E E D +++D +++ +DD++ DD+D
Sbjct: 44 EAEEEAMEEEDDDEEDDDDDDDEDEDDDDDDDDEDDEDED 83
Score = 30.3 bits (68), Expect = 9.8
Identities = 12/38 (31%), Positives = 23/38 (60%)
Query: 400 EEEGLGGEGGGEGMDTEEEDGEDEGEDDNEYSDDDDMS 437
EEE + E E D +++D +++ +DD++ DD+D
Sbjct: 46 EEEAMEEEDDDEEDDDDDDDEDEDDDDDDDDEDDEDED 83
>gnl|CDD|222285 pfam13646, HEAT_2, HEAT repeats. This family includes multiple
HEAT repeats.
Length = 88
Score = 30.0 bits (68), Expect = 2.3
Identities = 19/65 (29%), Positives = 30/65 (46%), Gaps = 11/65 (16%)
Query: 548 ALLRELMTSSDKDFRFMATNDLMSELQKDSIKLDDDSERKVVKMLLRLLEDKNGEVQNLA 607
ALL L++ D + R A + EL + + + LL LL+D + EV+ A
Sbjct: 3 ALLEALLSDPDPEVRAAAAR-ALGELG----------DPEALPALLELLKDPDPEVRRAA 51
Query: 608 VKCLG 612
+ LG
Sbjct: 52 AEALG 56
>gnl|CDD|222927 PHA02774, PHA02774, E1; Provisional.
Length = 613
Score = 32.6 bits (75), Expect = 2.3
Identities = 11/42 (26%), Positives = 20/42 (47%), Gaps = 3/42 (7%)
Query: 219 DEEEGLGGEGGGEGM---DTEEEDGEDEGEDDNEYSDDDDMS 257
+++G E GG G + E E + E E ++ + D+S
Sbjct: 2 ADDKGTDPEEGGSGWFLVEAEAECSDGEDELEDLFDTGSDIS 43
Score = 32.6 bits (75), Expect = 2.3
Identities = 11/42 (26%), Positives = 20/42 (47%), Gaps = 3/42 (7%)
Query: 399 DEEEGLGGEGGGEGM---DTEEEDGEDEGEDDNEYSDDDDMS 437
+++G E GG G + E E + E E ++ + D+S
Sbjct: 2 ADDKGTDPEEGGSGWFLVEAEAECSDGEDELEDLFDTGSDIS 43
>gnl|CDD|197630 smart00288, VHS, Domain present in VPS-27, Hrs and STAM. Unpublished
observations. Domain of unknown function.
Length = 133
Score = 30.7 bits (70), Expect = 2.9
Identities = 16/50 (32%), Positives = 24/50 (48%), Gaps = 1/50 (2%)
Query: 1484 DSIKLDDDSERKVVKMLLRLLEDKNGEVQNLAVKCLGPLV-NKVKEFQVE 1532
D I D + V++L + L +KN V LA+ L V N +F +E
Sbjct: 26 DLINSTPDGPKDAVRLLKKRLNNKNPHVALLALTLLDACVKNCGSKFHLE 75
>gnl|CDD|219458 pfam07539, DRIM, Down-regulated in metastasis. These eukaryotic
proteins include DRIM (Down-Regulated In Metastasis),
which is differentially expressed in metastatic and
non-metastatic human breast carcinoma cells. It is
believed to be involved in processing of non-coding RNA.
Length = 140
Score = 30.6 bits (70), Expect = 3.3
Identities = 15/67 (22%), Positives = 31/67 (46%), Gaps = 9/67 (13%)
Query: 1495 KVVKMLLRLLEDKNGEVQNLAVKCL----GPLVNKVKEFQVETIVDALCSN-----MVSD 1545
++ ++LL LL ++ EVQ LA+ C+ P + ++ + D + ++S
Sbjct: 17 ELYELLLTLLSSRDPEVQKLALDCILTYKNPAITPYRDNLKNLLDDKTFRDELTTFLLSS 76
Query: 1546 KEQLRDI 1552
E + D
Sbjct: 77 DEVIEDE 83
>gnl|CDD|202500 pfam02985, HEAT, HEAT repeat. The HEAT repeat family is related to
armadillo/beta-catenin-like repeats (see pfam00514).
Length = 31
Score = 27.5 bits (62), Expect = 4.2
Identities = 12/31 (38%), Positives = 19/31 (61%)
Query: 588 VVKMLLRLLEDKNGEVQNLAVKCLGPLFKAL 618
++ +LL LL D + EV+ A + LG L + L
Sbjct: 1 LLPLLLELLNDPDPEVREAAAEALGALAEVL 31
Score = 27.5 bits (62), Expect = 5.6
Identities = 11/31 (35%), Positives = 18/31 (58%)
Query: 1496 VVKMLLRLLEDKNGEVQNLAVKCLGPLVNKV 1526
++ +LL LL D + EV+ A + LG L +
Sbjct: 1 LLPLLLELLNDPDPEVREAAAEALGALAEVL 31
>gnl|CDD|117592 pfam09026, Cenp-B_dimeris, Centromere protein B dimerisation
domain. The centromere protein B (CENP-B) dimerisation
domain is composed of two alpha-helices, which are
folded into an antiparallel configuration. Dimerisation
of CENP-B is mediated by this domain, in which monomers
dimerise to form a symmetrical, antiparallel, four-helix
bundle structure with a large hydrophobic patch in which
23 residues of one monomer form van der Waals contacts
with the other monomer. This CENP-B dimer configuration
may be suitable for capturing two distant CENP-B boxes
during centromeric heterochromatin formation.
Length = 101
Score = 29.7 bits (66), Expect = 4.3
Identities = 13/31 (41%), Positives = 19/31 (61%)
Query: 225 GGEGGGEGMDTEEEDGEDEGEDDNEYSDDDD 255
G E D EE+D +++ EDD+E D+DD
Sbjct: 8 GEEDSDSDSDEEEDDDDEDEEDDDEDDDEDD 38
Score = 29.7 bits (66), Expect = 4.3
Identities = 13/31 (41%), Positives = 19/31 (61%)
Query: 405 GGEGGGEGMDTEEEDGEDEGEDDNEYSDDDD 435
G E D EE+D +++ EDD+E D+DD
Sbjct: 8 GEEDSDSDSDEEEDDDDEDEEDDDEDDDEDD 38
>gnl|CDD|237614 PRK14113, PRK14113, urease accessory protein UreE; Provisional.
Length = 152
Score = 30.3 bits (68), Expect = 5.1
Identities = 28/129 (21%), Positives = 53/129 (41%), Gaps = 25/129 (19%)
Query: 1484 DSIKLDDDSERKVVKMLLRLLEDKNGEVQNLAVKCLGPLVNKVKEFQVETIVDALCSNMV 1543
DSI LD D+ +K ++ + D E+ G + + V I+ C ++
Sbjct: 17 DSITLDHDTRKKG---RIKSVTDNGEEI--------GVFLERGHPLLVGEILKTECGKII 65
Query: 1544 SDKEQLRDISSIGLKTVIS---VVYCLGDRNSSSNMK-----------IDTLAFTHTLLT 1589
K + D+++ + ++ Y LG+R++ + ++ LA + L
Sbjct: 66 QVKGKAEDVATASAEDWLAFSKACYHLGNRHTKLQIGERWVRFKPDHVLEDLAENYGLSV 125
Query: 1590 THPPAVFHP 1598
H PAVF P
Sbjct: 126 NHEPAVFEP 134
>gnl|CDD|220102 pfam09073, BUD22, BUD22. BUD22 has been shown in yeast to be a
nuclear protein involved in bud-site selection. It plays
a role in positioning the proximal bud pole signal. More
recently it has been shown to be involved in ribosome
biogenesis.
Length = 424
Score = 31.3 bits (71), Expect = 5.3
Identities = 12/51 (23%), Positives = 22/51 (43%), Gaps = 6/51 (11%)
Query: 391 DVTSVAEWDEEEGLGGEGGGEGMDTEEEDGEDEGEDDNEY------SDDDD 435
D E + E+ E E +EE+ + + ED ++Y S D++
Sbjct: 168 DKDDEEESESEDESKSEESAEDDSDDEEEEDSDSEDYSQYDGMLVDSSDEE 218
>gnl|CDD|222974 PHA03082, PHA03082, DNA-dependent RNA polymerase subunit;
Provisional.
Length = 63
Score = 28.2 bits (63), Expect = 5.9
Identities = 21/55 (38%), Positives = 29/55 (52%), Gaps = 6/55 (10%)
Query: 9 RDISSIGLKTVISELPLGSNALA--ANVCRRITGKLSTAIEKQEDVSVQLEALDI 61
RD+S + +I + L L N C R+ KLST IE Q +++VQ LDI
Sbjct: 12 RDLSEERYRLLIKKKSL-KKVLRTVKNSCCRL--KLSTQIEPQRNLTVQ-PLLDI 62
>gnl|CDD|221333 pfam11942, Spt5_N, Spt5 transcription elongation factor, acidic
N-terminal. This is the very acidic N-terminal region
of the early transcription elongation factor Spt5. The
Spt5-Spt4 complex regulates early transcription
elongation by RNA polymerase II and has an imputed role
in pre-mRNA processing via its physical association with
mRNA capping enzymes. The actual function of this
N-terminal domain is not known although it is
dispensable for binding to Spt4.
Length = 92
Score = 28.9 bits (65), Expect = 6.7
Identities = 15/55 (27%), Positives = 22/55 (40%)
Query: 218 EDEEEGLGGEGGGEGMDTEEEDGEDEGEDDNEYSDDDDMSWKVRRAAAKCLEAII 272
E+EEE + + D +E E E EDD + D K A+ L +
Sbjct: 11 EEEEEEEEEDDLEDLSDEDEFIDEAEAEDDRRHRRLDRRREKEEEEDAEELAEYL 65
>gnl|CDD|237047 PRK12298, obgE, GTPase CgtA; Reviewed.
Length = 390
Score = 31.0 bits (71), Expect = 6.9
Identities = 25/151 (16%), Positives = 51/151 (33%), Gaps = 36/151 (23%)
Query: 303 SDIFHAYMALLKGTRTFSP----------LTPSDPNQMDQEDSPLCLLSTQVPSLVKALQ 352
SD ++ +SP D +D+E+ + ++V+AL
Sbjct: 253 SDPVENARIIINELEKYSPKLAEKPRWLVFNKID--LLDEEE-----AEERAKAIVEALG 305
Query: 353 CQLREKSIR--TRQDCLALLRELVQTLPGCLAPSLPALIPDVTSVAE-----WDEEEGLG 405
+ I + L +L+ + P + E WD+
Sbjct: 306 WEGPVYLISAASGLGVKELCWDLMTFIE-----ENPREEAEEAEAPEKVEFMWDDYHR-- 358
Query: 406 GEGGGEGMDTEEEDGEDEGEDDNEYSDDDDM 436
E ++ EE+ +D+ +DD + DD+ +
Sbjct: 359 -----EQLEEVEEEDDDDWDDDWDEDDDEGV 384
>gnl|CDD|226920 COG4547, CobT, Cobalamin biosynthesis protein CobT
(nicotinate-mononucleotide:5, 6-dimethylbenzimidazole
phosphoribosyltransferase) [Coenzyme metabolism].
Length = 620
Score = 31.0 bits (70), Expect = 7.7
Identities = 14/46 (30%), Positives = 20/46 (43%), Gaps = 4/46 (8%)
Query: 216 NYEDEEEGLGGEGGGEGMDTEEEDGED----EGEDDNEYSDDDDMS 257
N ED E G G + + +E + D EGE D + +D S
Sbjct: 243 NNEDSEAGREESEGSDESEEDEAEATDGEGEEGEMDAAEASEDSES 288
>gnl|CDD|227508 COG5181, HSH155, U2 snRNP spliceosome subunit [RNA processing and
modification].
Length = 975
Score = 31.1 bits (70), Expect = 8.1
Identities = 35/163 (21%), Positives = 65/163 (39%), Gaps = 5/163 (3%)
Query: 1129 PFTSDIYQCTLVRLKAADIDQEVKERAISCMGQIICNLGD-YLKSELPVCLPILLDRLRN 1187
P S I L L++ D V+ RA MG + L EL IL + L
Sbjct: 600 PHLSMIVSTILKLLRSKPPD--VRIRAADLMGSLAKVLKACGETKELAKLGNILYENLGE 657
Query: 1188 EITRL--TTVKALTKIAASPLRIDLRPILTDGIPILGSFLRKNQRALKLSTLLLLDTLLV 1245
+ + + +KA+ I + ++P ++ +P L LR + + +T+ L+ T+ +
Sbjct: 658 DYPEVLGSILKAICSIYSVHRFRSMQPPISGILPSLTPILRNKHQKVVANTIALVGTICM 717
Query: 1246 NYPSAISTEMLNKVTVELAPLVSESDLHIAQLTLTLLTSIAKI 1288
N P I ++ EL + + I + I++
Sbjct: 718 NSPEYIGVREWMRICFELVDSLKSWNKEIRRNATETFGCISRA 760
>gnl|CDD|215718 pfam00103, Hormone_1, Somatotropin hormone family.
Length = 198
Score = 30.0 bits (68), Expect = 8.1
Identities = 16/45 (35%), Positives = 23/45 (51%), Gaps = 9/45 (20%)
Query: 1460 DQMTSSDKDFRFMATNDLMSELQKDSIKLDDDSERKVVKMLLRLL 1504
+ SSD+D R A +L S L++DS K+D L+LL
Sbjct: 153 GSLKSSDEDRRLFAFYNLFSCLRRDSHKVD---------TYLKLL 188
>gnl|CDD|130712 TIGR01651, CobT, cobaltochelatase, CobT subunit. This model
describes Pseudomonas denitrificans CobT gene product,
which is a cobalt chelatase subunit that functions in
cobalamin biosynthesis. Cobalamin (vitamin B12) can be
synthesized via several pathways, including an aerobic
pathway (found in Pseudomonas denitrificans) and an
anaerobic pathway (found in P. shermanii and Salmonella
typhimurium). These pathways differ in the point of
cobalt insertion during corrin ring formation. There are
apparently a number of variations on these two pathways,
where the major differences seem to be concerned with
the process of ring contraction. Confusion regarding the
functions of enzymes found in the aerobic vs. anaerobic
pathways has arisen because nonhomologous genes in these
different pathways were given the same gene symbols.
Thus, cobT in the aerobic pathway (P. denitrificans) is
not a homolog of cobT in the anaerobic pathway (S.
typhimurium). It should be noted that E. coli
synthesizes cobalamin only when it is supplied with the
precursor cobinamide, which is a complex intermediate.
Additionally, all E. coli cobalamin synthesis genes
(cobU, cobS and cobT) were named after their Salmonella
typhimurium homologs which function in the anaerobic
cobalamin synthesis pathway. This model describes the
aerobic cobalamin pathway Pseudomonas denitrificans CobT
gene product, which is a cobalt chelatase subunit, with
a MW ~70 kDa. The aerobic pathway cobalt chelatase is a
heterotrimeric, ATP-dependent enzyme that catalyzes
cobalt insertion during cobalamin biosynthesis. The
other two subunits are the P. denitrificans CobS
(TIGR01650) and CobN (pfam02514 CobN/Magnesium
Chelatase) proteins. To avoid potential confusion with
the nonhomologous Salmonella typhimurium/E.coli cobT
gene product, the P. denitrificans gene symbol is not
used in the name of this model [Biosynthesis of
cofactors, prosthetic groups, and carriers, Heme,
porphyrin, and cobalamin].
Length = 600
Score = 30.7 bits (69), Expect = 8.3
Identities = 15/44 (34%), Positives = 22/44 (50%), Gaps = 4/44 (9%)
Query: 218 EDEEEGLGGEGGGEGMDTEE----EDGEDEGEDDNEYSDDDDMS 257
E EE+G G G EG +E + + GE++ SD DD+
Sbjct: 224 EQEEQGEGEGEGQEGSAPQESEATDRESESGEEEMVQSDQDDLP 267
>gnl|CDD|215601 PLN03142, PLN03142, Probable chromatin-remodeling complex ATPase
chain; Provisional.
Length = 1033
Score = 30.9 bits (70), Expect = 8.4
Identities = 13/48 (27%), Positives = 22/48 (45%), Gaps = 2/48 (4%)
Query: 212 DPNYNYEDEEEGL--GGEGGGEGMDTEEEDGEDEGEDDNEYSDDDDMS 257
+ N E++EE L G D +E EDE ED+ + + + +
Sbjct: 6 NTQANEEEDEEELEAVARSAGSDSDDDEVPAEDEDEDEEDDEEAESPA 53
>gnl|CDD|145949 pfam03066, Nucleoplasmin, Nucleoplasmin. Nucleoplasmins are also
known as chromatin decondensation proteins. They bind to
core histones and transfer DNA to them in a reaction
that requires ATP. This is thought to play a role in the
assembly of regular nucleosomal arrays.
Length = 146
Score = 29.2 bits (66), Expect = 9.4
Identities = 11/26 (42%), Positives = 16/26 (61%)
Query: 230 GEGMDTEEEDGEDEGEDDNEYSDDDD 255
D ++D EDE E+D+E DD+D
Sbjct: 108 ASEEDESDDDEEDEEEEDDEEDDDED 133
Score = 29.2 bits (66), Expect = 9.4
Identities = 11/26 (42%), Positives = 16/26 (61%)
Query: 410 GEGMDTEEEDGEDEGEDDNEYSDDDD 435
D ++D EDE E+D+E DD+D
Sbjct: 108 ASEEDESDDDEEDEEEEDDEEDDDED 133
>gnl|CDD|200601 cd10979, CE4_PuuE_like, Putative catalytic domain of
uncharacterized prokaryotic polysaccharide deacetylases
similar to bacterial PuuE allantoinases. The family
includes a group of uncharacterized prokaryotic
polysaccharide deacetylases (DCAs) that show high
sequence similarity to the catalytic domain of bacterial
PuuE (purine utilization E) allantoinases. PuuE
allantoinase specifically catalyzes the hydrolysis of
(S)-allantoin into allantoic acid. It functions as a
homotetramer. Its monomer is composed of a 7-stranded
barrel with detectable sequence similarity to the
6-stranded barrel NodB homology domain of DCA-like
proteins in the CE4 superfamily, which removes N-linked
or O-linked acetyl groups from cell wall
polysaccharides. PuuE allantoinase appears to be
metal-independent and acts on a small substrate
molecule, which is distinct from the common feature of
DCAs which are normally metal ion dependent and
recognize multimeric substrates.
Length = 281
Score = 30.3 bits (69), Expect = 9.4
Identities = 8/44 (18%), Positives = 12/44 (27%)
Query: 143 AAICRQAGHRFAEHIERVMPLILKYSREEDDELREFCLQACEAF 186
R+ G F H L ++ E+ L E
Sbjct: 95 IEAIRERGWEFIAHGISNSTLHAGLDEAQEREVIAESLDRIEKA 138
Score = 30.3 bits (69), Expect = 9.4
Identities = 8/44 (18%), Positives = 12/44 (27%)
Query: 881 AAICRQAGHRFAEHIERVMPLILKYSREEDDELREFCLQACEAF 924
R+ G F H L ++ E+ L E
Sbjct: 95 IEAIRERGWEFIAHGISNSTLHAGLDEAQEREVIAESLDRIEKA 138
>gnl|CDD|221668 pfam12619, MCM2_N, Mini-chromosome maintenance protein 2. This
domain family is found in eukaryotes, and is typically
between 138 and 153 amino acids in length. The family is
found in association with pfam00493. Mini-chromosome
maintenance (MCM) proteins are essential for DNA
replication. These proteins use ATPase activity to
perform this function.
Length = 145
Score = 29.2 bits (66), Expect = 9.7
Identities = 20/66 (30%), Positives = 27/66 (40%), Gaps = 19/66 (28%)
Query: 218 EDEEEGLGGEGGGEGMDTEEEDGEDEGEDDNE-----------YS----DDD----DMSW 258
++EEE + EEEDGED D+ E Y DD+ ++S
Sbjct: 5 DEEEELEDDIDDLDEEAEEEEDGEDLFGDNMERDYRANPELDRYDEDGLDDESDYEELSA 64
Query: 259 KVRRAA 264
RRAA
Sbjct: 65 AARRAA 70
>gnl|CDD|235549 PRK05658, PRK05658, RNA polymerase sigma factor RpoD; Validated.
Length = 619
Score = 30.5 bits (70), Expect = 9.8
Identities = 7/56 (12%), Positives = 20/56 (35%)
Query: 216 NYEDEEEGLGGEGGGEGMDTEEEDGEDEGEDDNEYSDDDDMSWKVRRAAAKCLEAI 271
+ D G ++ ++D ++E E+D ++ + +A+
Sbjct: 172 GFVDPNAEEDPAHVGSELEELDDDEDEEEEEDENDDSLAADESELPEKVLEKFKAL 227
Database: CDD.v3.10
Posted date: Mar 20, 2013 7:55 AM
Number of letters in database: 10,937,602
Number of sequences in database: 44,354
Lambda K H
0.319 0.133 0.376
Gapped
Lambda K H
0.267 0.0737 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 91,384,114
Number of extensions: 9236771
Number of successful extensions: 11260
Number of sequences better than 10.0: 1
Number of HSP's gapped: 10835
Number of HSP's successfully gapped: 136
Length of query: 1827
Length of database: 10,937,602
Length adjustment: 111
Effective length of query: 1716
Effective length of database: 6,014,308
Effective search space: 10320552528
Effective search space used: 10320552528
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.8 bits)
S2: 66 (29.2 bits)