RPS-BLAST 2.2.26 [Sep-21-2011]
Database: CDD.v3.10
44,354 sequences; 10,937,602 total letters
Searching..................................................done
Query= psy15592
(294 letters)
>gnl|CDD|237987 cd00020, ARM, Armadillo/beta-catenin-like repeats. An approximately
40 amino acid long tandemly repeated sequence motif
first identified in the Drosophila segment polarity gene
armadillo; these repeats were also found in the
mammalian armadillo homolog beta-catenin, the junctional
plaque protein plakoglobin, the adenomatous polyposis
coli (APC) tumor suppressor protein, and a number of
other proteins. ARM has been implicated in mediating
protein-protein interactions, but no common features
among the target proteins recognized by the ARM repeats
have been identified; related to the HEAT domain; three
consecutive copies of the repeat are represented by this
alignment model.
Length = 120
Score = 97.4 bits (243), Expect = 2e-25
Identities = 41/113 (36%), Positives = 68/113 (60%), Gaps = 3/113 (2%)
Query: 38 EVISFLSNPNNVIKANAAAYLQHLCYMDDPNKQKTRSLGGIPPLVKLLGHESPDVFRNAC 97
++S LS+ + ++ AA L +L ++ N Q GG+P LV+LL E +V + A
Sbjct: 11 ALVSLLSSSDENVQREAAWALSNLSAGNNDNIQAVVEAGGLPALVQLLKSEDEEVVKAAL 70
Query: 98 GALRNLSYGRQNDENKRAIKNAGGIPLLINLLRKTADAEVKELVTGVLWNLSS 150
ALRNL+ G ++NK + AGG+P L+NLL +++ ++++ TG L NL+S
Sbjct: 71 WALRNLAAG--PEDNKLIVLEAGGVPKLVNLLD-SSNEDIQKNATGALSNLAS 120
Score = 75.8 bits (187), Expect = 3e-17
Identities = 46/130 (35%), Positives = 68/130 (52%), Gaps = 18/130 (13%)
Query: 76 GGIPPLVKLLGHESPDVFRNACGALRNLSYGRQNDENKRAIKNAGGIPLLINLLRKTADA 135
GG+P LV LL +V R A AL NLS G N++N +A+ AGG+P L+ LL K+ D
Sbjct: 7 GGLPALVSLLSSSDENVQREAAWALSNLSAG--NNDNIQAVVEAGGLPALVQLL-KSEDE 63
Query: 136 EVKELVTGVLWNLSSCEDLKKSIIDD--GLQVVVNHIIIPHSGWDPVSAGETCWSTIFRN 193
EV + L NL++ + K I+ + G+ +VN ++ S D I +N
Sbjct: 64 EVVKAALWALRNLAAGPEDNKLIVLEAGGVPKLVN--LLDSSNED-----------IQKN 110
Query: 194 TSGVLRNASS 203
+G L N +S
Sbjct: 111 ATGALSNLAS 120
Score = 38.1 bits (89), Expect = 0.001
Identities = 31/141 (21%), Positives = 56/141 (39%), Gaps = 27/141 (19%)
Query: 115 AIKNAGGIPLLINLLRKTADAEVKELVTGVLWNLSSC-EDLKKSIID-DGLQVVVNHIII 172
A+ AGG+P L++LL + D V+ L NLS+ D +++++ GL +V +
Sbjct: 2 AVIQAGGLPALVSLLSSS-DENVQREAAWALSNLSAGNNDNIQAVVEAGGLPALVQLLKS 60
Query: 173 PHSGWDPVSAGETCWSTIFRNTSGVLRNASSAGEYARKKLRECEGL--IDSLLYVVKSAI 230
+ + LRN ++ E + + E G+ + +LL I
Sbjct: 61 ED-------------EEVVKAALWALRNLAAGPEDNKLIVLEAGGVPKLVNLLDSSNEDI 107
Query: 231 EKSNIGNKSVENCVCILRNLS 251
+N L NL+
Sbjct: 108 ---------QKNATGALSNLA 119
>gnl|CDD|214547 smart00185, ARM, Armadillo/beta-catenin-like repeats. Approx. 40
amino acid repeat. Tandem repeats form superhelix of
helices that is proposed to mediate interaction of
beta-catenin with its ligands. Involved in transducing
the Wingless/Wnt signal. In plakoglobin arm repeats bind
alpha-catenin and N-cadherin.
Length = 41
Score = 52.0 bits (126), Expect = 2e-09
Identities = 20/41 (48%), Positives = 24/41 (58%)
Query: 65 DDPNKQKTRSLGGIPPLVKLLGHESPDVFRNACGALRNLSY 105
DD NKQ GG+P LV+LL E +V + A AL NLS
Sbjct: 1 DDENKQAVVDAGGLPALVELLKSEDEEVVKEAAWALSNLSS 41
Score = 45.5 bits (109), Expect = 4e-07
Identities = 20/42 (47%), Positives = 28/42 (66%), Gaps = 1/42 (2%)
Query: 109 NDENKRAIKNAGGIPLLINLLRKTADAEVKELVTGVLWNLSS 150
+DENK+A+ +AGG+P L+ LL+ D EV + L NLSS
Sbjct: 1 DDENKQAVVDAGGLPALVELLKSE-DEEVVKEAAWALSNLSS 41
>gnl|CDD|201276 pfam00514, Arm, Armadillo/beta-catenin-like repeat. Approx. 40
amino acid repeat. Tandem repeats form super-helix of
helices that is proposed to mediate interaction of
beta-catenin with its ligands. CAUTION: This family does
not contain all known armadillo repeats.
Length = 41
Score = 50.5 bits (122), Expect = 5e-09
Identities = 16/40 (40%), Positives = 20/40 (50%)
Query: 65 DDPNKQKTRSLGGIPPLVKLLGHESPDVFRNACGALRNLS 104
NKQ G +PPLV+LL +V A AL NL+
Sbjct: 1 SPENKQAVIEAGAVPPLVQLLSSPDEEVQEEAAWALSNLA 40
Score = 44.4 bits (106), Expect = 9e-07
Identities = 17/42 (40%), Positives = 26/42 (61%), Gaps = 1/42 (2%)
Query: 109 NDENKRAIKNAGGIPLLINLLRKTADAEVKELVTGVLWNLSS 150
+ ENK+A+ AG +P L+ LL + D EV+E L NL++
Sbjct: 1 SPENKQAVIEAGAVPPLVQLL-SSPDEEVQEEAAWALSNLAA 41
>gnl|CDD|215629 PLN03200, PLN03200, cellulose synthase-interactive protein;
Provisional.
Length = 2102
Score = 55.9 bits (135), Expect = 1e-08
Identities = 36/90 (40%), Positives = 49/90 (54%), Gaps = 3/90 (3%)
Query: 39 VISFLSNPNNVIKANAAAYLQHLCYMDDPNKQKTRSLGGIPPLVKLLGHE-SPDVFRNAC 97
++ LS+ N+ +ANAA+ L L + + K G + L+KLLG V A
Sbjct: 194 LVKLLSSGNSDAQANAASLLARLMMAFESSISKVLDAGAVKQLLKLLGQGNEVSVRAEAA 253
Query: 98 GALRNLSYGRQNDENKRAIKNAGGIPLLIN 127
GAL LS Q+ E K+AI +AGGIP LIN
Sbjct: 254 GALEALS--SQSKEAKQAIADAGGIPALIN 281
Score = 47.0 bits (112), Expect = 9e-06
Identities = 41/121 (33%), Positives = 54/121 (44%), Gaps = 12/121 (9%)
Query: 40 ISFLSNPNNVIKANAAAYLQHLCYMDDPNKQKTRS---LGG-IPPLVKLLGHESPDVFRN 95
+S L + K NAAA L LC +D R LGG IPPL+ LL S + +
Sbjct: 64 VSLLRSGTLGAKVNAAAVLGVLCKEED-----LRVKVLLGGCIPPLLSLLKSGSAEAQKA 118
Query: 96 ACGALRNLSYGRQNDENKRAI-KNAGGIPLLINLL--RKTADAEVKELVTGVLWNLSSCE 152
A A+ +S G +D I G +P L + L D V+ L+TG L NL
Sbjct: 119 AAEAIYAVSSGGLSDHVGSKIFSTEGVVPSLWDQLQPGNKQDKVVEGLLTGALRNLCGST 178
Query: 153 D 153
D
Sbjct: 179 D 179
Score = 45.1 bits (107), Expect = 3e-05
Identities = 34/87 (39%), Positives = 45/87 (51%), Gaps = 7/87 (8%)
Query: 73 RSLG---GIPPLVKLLGHESPDVFRNACGALRNLSYGRQNDENKRAIKNAGGIPLLINLL 129
+LG G+ L+ LLG S A L L+ + DE+K AI AGGIP L+ LL
Sbjct: 440 EALGGREGVQLLISLLGLSSEQQQEYAVALLAILT--DEVDESKWAITAAGGIPPLVQLL 497
Query: 130 RKTADAEVKELVTGVLWNL-SSCEDLK 155
+T + KE VLWNL ED++
Sbjct: 498 -ETGSQKAKEDSATVLWNLCCHSEDIR 523
Score = 44.3 bits (105), Expect = 6e-05
Identities = 37/142 (26%), Positives = 65/142 (45%), Gaps = 4/142 (2%)
Query: 34 PNLTEVISFLSNPNNVIKANAAAYLQHLCYMDDPNKQKTRSLGGIPPLVKLLGHESPDVF 93
P+L + + + + V++ L++LC D T GG+ LVKLL + D
Sbjct: 147 PSLWDQLQPGNKQDKVVEGLLTGALRNLCGSTDGFWSATLEAGGVDILVKLLSSGNSDAQ 206
Query: 94 RNACGALRNLSYGRQNDENKRAIKNAGGIPLLINLLRKTADAEVKELVTGVLWNLSS-CE 152
NA L L + + + +AG + L+ LL + + V+ G L LSS +
Sbjct: 207 ANAASLLARLM--MAFESSISKVLDAGAVKQLLKLLGQGNEVSVRAEAAGALEALSSQSK 264
Query: 153 DLKKSIID-DGLQVVVNHIIIP 173
+ K++I D G+ ++N + P
Sbjct: 265 EAKQAIADAGGIPALINATVAP 286
Score = 39.7 bits (93), Expect = 0.002
Identities = 31/92 (33%), Positives = 46/92 (50%), Gaps = 4/92 (4%)
Query: 40 ISFLSNPNNVIKANAAAYLQHLCYMDDPNKQKTRSLGGIPPLVKLLGHESPDVFRNACGA 99
IS L + + A A L L D +K + GGIPPLV+LL S ++
Sbjct: 452 ISLLGLSSEQQQEYAVALLAILTDEVDESKWAITAAGGIPPLVQLLETGSQKAKEDSATV 511
Query: 100 LRNLSYGRQNDENKRA-IKNAGGIPLLINLLR 130
L NL + E+ RA +++AG +P L+ LL+
Sbjct: 512 LWNLC---CHSEDIRACVESAGAVPALLWLLK 540
Score = 35.5 bits (82), Expect = 0.037
Identities = 23/83 (27%), Positives = 37/83 (44%), Gaps = 6/83 (7%)
Query: 40 ISFLSNPNNVIKANAAAYLQHLCYMDDPNKQKTRSLGGIPPLVKLLGHESPDVFRNACGA 99
I L +P+ ++ AA L HL + + T + PLV+L G + + A A
Sbjct: 1534 IPLLESPSQAVQQLAAELLSHLLAEEHFQQDITTQNA-VVPLVRLAGIGILSLQQRAVKA 1592
Query: 100 LRNLSYGRQNDENKRAIKNAGGI 122
L ++S A+ +AGGI
Sbjct: 1593 LESISLSWPK-----AVADAGGI 1610
Score = 30.5 bits (69), Expect = 1.3
Identities = 34/134 (25%), Positives = 53/134 (39%), Gaps = 18/134 (13%)
Query: 36 LTEVISFLSNPNNVIKANAAAYL-------QHLCYMDDPNKQKTRSLGGIPPLVKLLGHE 88
L +I LS+ + AA+ L Q LC + + I P +KLL +
Sbjct: 611 LRTLIQLLSSSKEETQEKAASVLADIFSSRQDLC--ESLATDEI-----INPCIKLLTNN 663
Query: 89 SPDVFRNACGALRNLSYG-RQNDENKRAIKNAGGIPLLINLLRKTADAEVKELVTGVLWN 147
+ V + AL LS ++N K + I LI L K++ EV E L N
Sbjct: 664 TEAVATQSARALAALSRSIKEN--RKVSYAAEDAIKPLIK-LAKSSSIEVAEQAVCALAN 720
Query: 148 LSSCEDLKKSIIDD 161
L S ++ + +
Sbjct: 721 LLSDPEVAAEALAE 734
>gnl|CDD|222285 pfam13646, HEAT_2, HEAT repeats. This family includes multiple
HEAT repeats.
Length = 88
Score = 41.2 bits (97), Expect = 4e-05
Identities = 25/98 (25%), Positives = 40/98 (40%), Gaps = 24/98 (24%)
Query: 42 FLSNPNNVIKANAAAYLQHLCYMDDPNKQKTRSLGGIPPLVKLLGHESPDVFRNACGALR 101
LS+P+ ++A AA L L + +P L++LL P+V R A AL
Sbjct: 8 LLSDPDPEVRAAAARALGELGDPE-----------ALPALLELLKDPDPEVRRAAAEALG 56
Query: 102 NLSYGRQNDENKRAIKNAGGIPLLINLLRKTADAEVKE 139
+ + +P L+ LL+ DA V+
Sbjct: 57 K-------------LGDPEALPALLELLQDDDDAVVRA 81
>gnl|CDD|218281 pfam04826, Arm_2, Armadillo-like. This domain contains
armadillo-like repeats. Proteins containing this domain
interact with numerous other proteins, through these
interactions they are involved in a wide variety of
processes including carcinogenesis, control of cellular
ageing and survival, regulation of circadian rhythm and
lysosomal sorting of G protein-coupled receptors.
Length = 254
Score = 36.2 bits (84), Expect = 0.012
Identities = 28/86 (32%), Positives = 40/86 (46%), Gaps = 18/86 (20%)
Query: 32 RDPNLTEVISFLSNPNNVIKANAAAYLQHLCYMDDPNKQKTRSLGGIPPLVKLLGHESPD 91
DP + E+ + N+AAY N+ R LGGIP + LL + +P+
Sbjct: 25 EDPFIHEIAL-------ITLGNSAAY--------SFNQDIIRDLGGIPIIANLLSNPNPE 69
Query: 92 VFRNACGALRNLSYGRQNDENKRAIK 117
+ A AL NLS N EN++ IK
Sbjct: 70 IKEKALNALNNLS---VNVENQKKIK 92
>gnl|CDD|227396 COG5064, SRP1, Karyopherin (importin) alpha [Intracellular
trafficking and secretion].
Length = 526
Score = 34.5 bits (79), Expect = 0.055
Identities = 30/119 (25%), Positives = 54/119 (45%), Gaps = 4/119 (3%)
Query: 42 FLSNPNNV-IKANAAAYLQHLCYMDDPNKQKTRSLGGIPPLVKLLGHESPDVFRNACGAL 100
LS+ ++ + NA L +LC +P + +P L KL+ P+V +AC A+
Sbjct: 208 LLSSAIHISMLRNATWTLSNLCRGKNPPPDWSNISQALPILAKLIYSRDPEVLVDACWAI 267
Query: 101 RNLSYGRQNDENKRAIKNAGGIPLLINLLRKTADAEVKELVTGVLWNLSSCEDLKKSII 159
LS G E +A+ + G L+ LL A+++ + N+ + D + +I
Sbjct: 268 SYLSDGPN--EKIQAVLDVGIPGRLVELL-SHESAKIQTPALRSVGNIVTGSDDQTQVI 323
Score = 33.7 bits (77), Expect = 0.11
Identities = 25/96 (26%), Positives = 42/96 (43%), Gaps = 4/96 (4%)
Query: 55 AAYLQHLCYMDDPNKQKTRSLGGIPPLVKLLGHESPDVFR-NACGALRNLSYGRQNDENK 113
+ + L P Q G +P V+ + D+ + A AL N++ G
Sbjct: 93 YKFRKLLSKETSPPIQPVIDAGVVPRFVEFMDEIQRDMLQFEAAWALTNIASGTTQ--QT 150
Query: 114 RAIKNAGGIPLLINLLRKTADAEVKELVTGVLWNLS 149
+ + +AG +PL I LL T D +V+E L N++
Sbjct: 151 KVVVDAGAVPLFIQLLSSTED-DVREQAVWALGNIA 185
Score = 30.6 bits (69), Expect = 0.90
Identities = 24/91 (26%), Positives = 39/91 (42%), Gaps = 3/91 (3%)
Query: 41 SFLSNPNNVIKANAAAYLQHLCYMDDPNKQKTRSLGGIPPLVKLLGHESPDVFRNACGAL 100
S LS+P I+ A + ++ + Q IPPL+ LL + + AC A+
Sbjct: 334 SLLSSPKENIRKEACWTISNITAGNTEQIQAVIDANLIPPLIHLLSSAEYKIKKEACWAI 393
Query: 101 RNLSYG--RQNDENKRAIKNAGGIPLLINLL 129
N + G + D + + G I L +LL
Sbjct: 394 SNATSGGLNRPDIIRYLVSQ-GFIKPLCDLL 423
>gnl|CDD|202500 pfam02985, HEAT, HEAT repeat. The HEAT repeat family is related to
armadillo/beta-catenin-like repeats (see pfam00514).
Length = 31
Score = 29.4 bits (67), Expect = 0.17
Identities = 10/27 (37%), Positives = 15/27 (55%)
Query: 78 IPPLVKLLGHESPDVFRNACGALRNLS 104
+P L++LL P+V A AL L+
Sbjct: 2 LPLLLELLNDPDPEVREAAAEALGALA 28
>gnl|CDD|205691 pfam13513, HEAT_EZ, HEAT-like repeat. The HEAT repeat family is
related to armadillo/beta-catenin-like repeats (see
pfam00514). These EZ repeats are found in subunits of
cyanobacterial phycocyanin lyase and other proteins and
probably carry out a scaffolding role.
Length = 55
Score = 29.0 bits (65), Expect = 0.47
Identities = 8/26 (30%), Positives = 13/26 (50%)
Query: 78 IPPLVKLLGHESPDVFRNACGALRNL 103
+P L+ LL + +V A AL +
Sbjct: 30 LPALLPLLKDDDDEVREAAAWALGRI 55
>gnl|CDD|147776 pfam05804, KAP, Kinesin-associated protein (KAP). This family
consists of several eukaryotic kinesin-associated (KAP)
proteins. Kinesins are intracellular multimeric
transport motor proteins that move cellular cargo on
microtubule tracks. It has been shown that the sea
urchin KRP85/95 holoenzyme associates with a KAP115
non-motor protein, forming a heterotrimeric complex in
vitro, called the Kinesin-II.
Length = 708
Score = 31.0 bits (70), Expect = 0.80
Identities = 29/102 (28%), Positives = 49/102 (48%), Gaps = 8/102 (7%)
Query: 57 YLQHLCYMDDPNKQKTRSLGGIPPLVKLLGHESPDVFRNACGALRNLSYGRQNDENKRAI 116
+L+ L D+ NK + G I L+KL + D+ L NLS+ + + +
Sbjct: 309 FLKKLSIFDE-NKNEMEENGIIEKLLKLFPCQHEDLLNITLRLLFNLSF---DTGLRPKM 364
Query: 117 KNAGGIPLLINLLRKTADAEVKELVTGVLWNLSSCEDLKKSI 158
N G +P L++LL + + VL++L SC+D KS+
Sbjct: 365 VNGGLLPKLVSLL---DNDNHHGIALCVLYHL-SCDDKAKSM 402
>gnl|CDD|227427 COG5096, COG5096, Vesicle coat complex, various subunits
[Intracellular trafficking and secretion].
Length = 757
Score = 30.5 bits (69), Expect = 1.4
Identities = 26/98 (26%), Positives = 37/98 (37%), Gaps = 17/98 (17%)
Query: 43 LSNPNNVIKANAAAYLQHLCYMDDPNKQKTRSLGG--IPPLVKLLGHESPDVFRNACGAL 100
L +PN I+ A L+ L + L G I P+ KLL V + A A+
Sbjct: 101 LQDPNEEIRGFA---LRTLSLLRVKE------LLGNIIDPIKKLLTDPHAYVRKTAALAV 151
Query: 101 RNLSYGRQNDENKRAIKNAGGIPLLINLLRKTADAEVK 138
L Y +K G I +L L+ + D V
Sbjct: 152 AKL-YRL----DKDLYHELGLIDILKELVADS-DPIVI 183
>gnl|CDD|224286 COG1367, COG1367, CRISPR system related protein, RAMP superfamily
[Defense mechanisms].
Length = 393
Score = 29.6 bits (66), Expect = 1.8
Identities = 13/57 (22%), Positives = 25/57 (43%)
Query: 93 FRNACGALRNLSYGRQNDENKRAIKNAGGIPLLINLLRKTADAEVKELVTGVLWNLS 149
R L++ YG +++AI ++ + A+ EVK+ + +L LS
Sbjct: 103 KRFTDDRLKDTKYGLYGLYDRKAIYEYDQEGKTFTIILQYANFEVKKEILSLLKLLS 159
>gnl|CDD|220789 pfam10508, Proteasom_PSMB, Proteasome non-ATPase 26S subunit. The
26S proteasome, a eukaryotic ATP-dependent, dumb-bell
shaped, protease complex with a molecular mass of approx
20kDa consists of a central 20S proteasome,functioning
as a catalytic machine, and two large V-shaped terminal
modules, having possible regulatory roles,composed of
multiple subunits of 25- 110 kDa attached to the central
portion in opposite orientations. It is responsible for
degradation of abnormal intracellular proteins,
including oxidatively damaged proteins, and may play a
role as a component of a cellular anti-oxidative system.
Expression of catalytic core subunits including PSMB5
and peptidase activities of the proteasome were elevated
following incubation with 3-methylcholanthrene. The 20S
proteasome comprises a cylindrical stack of four rings,
two outer rings formed by seven alpha-subunits
(alpha1-alpha7) and two inner rings of seven
beta-subunits (beta1-beta7). Two outer rings of alpha
subunits maintain structure, while the central beta
rings contain the proteolytic active core subunits beta1
(PSMB6), beta2 (PSMB7), and beta5 (PSMB5). Expression of
PSMB5 can be altered by chemical reactants, such as
3-methylcholanthrene.
Length = 504
Score = 29.7 bits (67), Expect = 1.8
Identities = 20/93 (21%), Positives = 36/93 (38%), Gaps = 7/93 (7%)
Query: 78 IPPLVKLLGHESPDVFRNACGALRNLSYGRQNDENKRAIKNAGGIPLLINLLRKTADAEV 137
L + L H +P V L + + + +KN + L+ + LR D V
Sbjct: 83 QVVLQEGLPHPTPAVRALVLKELLRIV--QSKEGALELLKNEELLSLIADCLRDE-DTSV 139
Query: 138 KELVTGVLWNLSS----CEDLKKSIIDDGLQVV 166
E +L LS ++L S + L+++
Sbjct: 140 GEAAINLLSKLSKSEPGLDELFDSNLQTKLKIL 172
>gnl|CDD|153216 cd01745, GATase1_2, Subgroup of proteins having the Type 1
glutamine amidotransferase (GATase1) domain. This group
contains a subgroup of proteins having the Type 1
glutamine amidotransferase (GATase1) domain. GATase
activity catalyses the transfer of ammonia from the
amide side chain of glutamine to an acceptor substrate.
Glutamine amidotransferases (GATase) includes the triad
family of amidotransferases which have a conserved
Cys-His-Glu catalytic triad in the glutaminase active
site. In this subgroup this triad is conserved. GATase
activity can be found in a range of biosynthetic
enzymes, including: glutamine amidotransferase,
formylglycinamide ribonucleotide, GMP synthetase ,
anthranilate synthase component II, glutamine-dependent
carbamoyl phosphate synthase, cytidine triphosphate
synthetase, gamma-glutamyl hydrolase, imidazole glycerol
phosphate synthase and, cobyric acid synthase. Glutamine
amidotransferase (GATase) domains can occur either as
single polypeptides, as in glutamine amidotransferases,
or as domains in a much larger multifunctional synthase
protein, such as CPSase.
Length = 189
Score = 29.1 bits (66), Expect = 2.2
Identities = 16/63 (25%), Positives = 28/63 (44%), Gaps = 5/63 (7%)
Query: 93 FRNACGALRNLSYGRQNDENKRAIKNAGGIPLLI-NLLRKTADAEVKELVTGVLWNLSSC 151
R G Y Q A++ AGG+P+L+ + + + EL+ G+L L+
Sbjct: 7 LREEEGGYERRDYLNQY--YVDAVRKAGGLPVLLPPVDDEEDLEQYLELLDGLL--LTGG 62
Query: 152 EDL 154
D+
Sbjct: 63 GDV 65
>gnl|CDD|236672 PRK10318, PRK10318, hypothetical protein; Provisional.
Length = 121
Score = 28.2 bits (63), Expect = 2.7
Identities = 13/60 (21%), Positives = 18/60 (30%), Gaps = 2/60 (3%)
Query: 180 VSAGETCWSTIFRNTSGVLRNASSAGEYARKKLRECEGLIDSLLYVVKSAIEKSNIGNKS 239
+ T RN G A + R KL ID+ + S+I K
Sbjct: 33 EGLAQKKDCTFVRN--GDEHTCKEAVSHLRLKLGNTRNRIDTAEQFIDKVASSSSISGKP 90
>gnl|CDD|217434 pfam03224, V-ATPase_H_N, V-ATPase subunit H. The yeast
Saccharomyces cerevisiae vacuolar H+-ATPase (V-ATPase)
is a multisubunit complex responsible for acidifying
organelles. It functions as an ATP dependent proton pump
that transports protons across a lipid bilayer. This
domain corresponds to the N terminal domain of the H
subunit of V-ATPase. The N-terminal domain is required
for the activation of the complex whereas the C-terminal
domain is required for coupling ATP hydrolysis to proton
translocation.
Length = 312
Score = 28.8 bits (65), Expect = 3.0
Identities = 14/51 (27%), Positives = 29/51 (56%), Gaps = 2/51 (3%)
Query: 122 IPLLINLLRKTADAEVKELVTGVLWNLSSCE--DLKKSIIDDGLQVVVNHI 170
IPLL+++L+ + +V + L NL S D + ++D+GL ++ ++
Sbjct: 241 IPLLLDILKTSIKEKVVRVSLATLRNLLSKAVKDFIEVMVDNGLLKLLQNL 291
>gnl|CDD|130022 TIGR00947, 2A73, putative bicarbonate transporter, IctB family.
This family of proteins is suggested to transport
inorganic carbon (HCO3-), based on the phenotype of a
mutant of IctB in Synechococcus sp. strain PCC 7942.
Bicarbonate uptake is used by many photosynthetic
organisms including cyanobacteria. These organisms are
able to concentrate CO2/HCO3- against a greater than
ten-fold concentration gradient. Cyanobacteria may have
several such carriers operating with different
efficiencies. Note that homology to various O-antigen
ligases, with possible implications for mutant cell
envelope structure, might allow alternatives to the
interpretation of IctB as a bicarbonate transport
protein [Transport and binding proteins, Carbohydrates,
organic alcohols, and acids].
Length = 425
Score = 28.9 bits (65), Expect = 3.3
Identities = 13/34 (38%), Positives = 15/34 (44%), Gaps = 10/34 (29%)
Query: 30 RWRDPN-----LTEVISFLSNPNNVIKANAAAYL 58
W DP T V S+L NPN + A YL
Sbjct: 150 TWVDPESVLAGTTRVYSYLGNPNLL-----AGYL 178
>gnl|CDD|185766 cd09243, BRO1_Brox_like, Protein-interacting Bro1-like domain of
human Brox1 and related proteins. This family contains
the Bro1-like domain of a single-domain protein, human
Brox, and related domains. It belongs to the
BRO1_Alix_like superfamily which also includes the
Bro1-like domains of mammalian Alix (apoptosis-linked
gene-2 interacting protein X), His-Domain type N23
protein tyrosine phosphatase (HD-PTP, also known as
PTPN23), RhoA-binding proteins Rhophilin-1 and -2, Bro1
and Rim20 (also known as PalA) from Saccharomyces
cerevisiae, Ustilago maydis Rim23 (also known as PalC),
and related domains. Alix, HD-PTP, Brox, Bro1, Rim20,
and Rim23, interact with the ESCRT (Endosomal Sorting
Complexes Required for Transport) system. Bro1-like
domains are boomerang-shaped, and part of the domain is
a tetratricopeptide repeat (TPR)-like structure.
Bro1-like domains bind components of the ESCRT-III
complex: CHMP4 in the case of Brox. Human Brox can bind
to human immunodeficiency virus type 1 (HIV-1)
nucleocapsid. In addition to a Bro1-like domain, Brox
also has a C-terminal thioester-linkage site for
isoprenoid lipids (CaaX motif). This family lacks the
V-shaped (V) domain found in many members of the
BRO1_Alix_like superfamily.
Length = 353
Score = 28.8 bits (65), Expect = 3.3
Identities = 16/46 (34%), Positives = 23/46 (50%), Gaps = 2/46 (4%)
Query: 17 SPSLNKKCRNSSLRWRDPNLTEVISFLSNPNNVIKANAAAYLQHLC 62
+P+ +K C S LR L E++S SN + +K AYL L
Sbjct: 23 TPAASKLC--SDLRTARARLLELLSDPSNDVDTVKTAFNAYLSLLQ 66
>gnl|CDD|213993 cd07438, PHP_HisPPase_AMP, Polymerase and Histidinol Phosphatase
domain of Histidinol phosphate phosphatase (HisPPase)
AMP bound. The PHP domain of this HisPPase family has
an unknown function. It has a second domain inserted in
the middle that binds adenosine 5-monophosphate (AMP).
The PHP (also called histidinol phosphatase-2/HIS2)
domain is associated with several types of DNA
polymerases, such as PolIIIA and family X DNA
polymerases, stand alone histidinol phosphate
phosphatases (HisPPases), and a number of
uncharacterized protein families. HisPPase catalyzes the
eighth step of histidine biosynthesis, in which
L-histidinol phosphate undergoes dephosphorylation to
give histidinol. The PHP domain has four conserved
sequence motifs and contains an invariant histidine that
is involved in metal ion coordination. The PHP domain of
HisPPase is structurally homologous to the other members
of the PHP family that have a distorted (beta/alpha)7
barrel fold with a trinuclear metal site on the
C-terminal side of the barrel.
Length = 155
Score = 27.7 bits (63), Expect = 4.1
Identities = 11/33 (33%), Positives = 15/33 (45%), Gaps = 5/33 (15%)
Query: 114 RAIKNAGGI-----PLLINLLRKTADAEVKELV 141
I AGG+ P L L RK + ++EL
Sbjct: 82 ELIHAAGGVAVLAHPGLYKLSRKKLEELIEELK 114
>gnl|CDD|240521 cd13394, Syo1_like, Fungal symportin 1 (syo1) and similar proteins.
This family of eukaryotic proteins includes
Saccharomyces cerevisiae Ydl063c and Chaetomium
thermophilum Syo1, which mediate the co-import of two
ribosomal proteins, Rpl5 and Rpl11 (which both interact
with 5S rRNA) into the nucleus. Import precedes their
association with rRNA and subsequent ribosome assembly
in the nucleolus. The primary structure of syo1 is a
mixture of Armadillo- (ARM, N-terminal part of syo1) and
HEAT-repeats (C-terminal part of syo1).
Length = 597
Score = 28.8 bits (65), Expect = 4.5
Identities = 20/75 (26%), Positives = 34/75 (45%), Gaps = 3/75 (4%)
Query: 78 IPPLVKLLGHESPDVFRNACGALRNLSYGRQNDENKRAIKNAGGIPLLINLLRKTADAEV 137
I PL++ L S + A GA+ NL ++ ++ + + +LI L ++ EV
Sbjct: 39 ILPLLEKLQSPSANDRSMALGAISNLI---EDPSMRKLLLKEKLVQILIERLLDDSNTEV 95
Query: 138 KELVTGVLWNLSSCE 152
G L NL+ E
Sbjct: 96 VVEAFGALRNLAIEE 110
>gnl|CDD|224982 COG2071, COG2071, Predicted glutamine amidotransferases [General
function prediction only].
Length = 243
Score = 28.1 bits (63), Expect = 4.6
Identities = 13/33 (39%), Positives = 20/33 (60%), Gaps = 1/33 (3%)
Query: 114 RAIKNAGGIPLLINLLRKTADA-EVKELVTGVL 145
AI AGGIP+L+ L DA + +L+ G++
Sbjct: 33 DAIIKAGGIPILLPALEDPEDARQYLDLIDGLI 65
>gnl|CDD|129661 TIGR00570, cdk7, CDK-activating kinase assembly factor MAT1. All
proteins in this family for which functions are known
are cyclin dependent protein kinases that are components
of TFIIH, a complex that is involved in nucleotide
excision repair and transcription initiation. Also known
as MAT1 (menage a trois 1). This family is based on the
phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis,
Stanford University) [DNA metabolism, DNA replication,
recombination, and repair].
Length = 309
Score = 28.2 bits (63), Expect = 4.7
Identities = 18/50 (36%), Positives = 22/50 (44%), Gaps = 10/50 (20%)
Query: 215 CEGLIDSLLYVVKSAIEKSNIGNKSVENCVCILRNLSFRCQEVEDPNYDK 264
CE +D LL+V S S C LR +FR Q EDP +K
Sbjct: 31 CESCVD-LLFVRGSG---------SCPECDTPLRKNNFRVQLFEDPTVEK 70
>gnl|CDD|219535 pfam07722, Peptidase_C26, Peptidase C26. These peptidases have
gamma-glutamyl hydrolase activity; that is they catalyze
the cleavage of the gamma-glutamyl bond in
poly-gamma-glutamyl substrates. They are structurally
related to pfam00117, but contain extensions in four
loops and at the C terminus.
Length = 219
Score = 28.0 bits (63), Expect = 4.7
Identities = 13/33 (39%), Positives = 22/33 (66%), Gaps = 1/33 (3%)
Query: 114 RAIKNAGGIPLLINLLRKTADAEVK-ELVTGVL 145
+A++ AGG+P+L+ +L D E +LV G+L
Sbjct: 31 KAVEGAGGLPVLLPILEDPEDLEALLDLVDGLL 63
>gnl|CDD|204007 pfam08625, Utp13, Utp13 specific WD40 associated domain. Utp13 is
a component of the five protein Pwp2 complex that forms
part of a stable particle subunit independent of the U3
small nucleolar ribonucleoprotein that is essential for
the initial assembly steps of the 90S pre-ribosome. Pwp2
is capable of interacting directly with the 35 S
pre-rRNA 5' end.
Length = 138
Score = 27.1 bits (61), Expect = 7.4
Identities = 18/55 (32%), Positives = 25/55 (45%), Gaps = 4/55 (7%)
Query: 198 LRNASSAGEYARKKLRECEGL--IDSLLYVVKSAIEKSN-IGNKSVENCVCILRN 249
L N G+Y + L L LL V+KS IE + IG+ +E + LR
Sbjct: 4 LSNLLREGDY-SEALLLALTLDHPHRLLNVLKSVIEDEDSIGSDELEETIGRLRK 57
>gnl|CDD|185056 PRK15101, PRK15101, protease3; Provisional.
Length = 961
Score = 27.6 bits (62), Expect = 8.9
Identities = 11/22 (50%), Positives = 13/22 (59%)
Query: 40 ISFLSNPNNVIKANAAAYLQHL 61
ISF +N NN + NA Y Q L
Sbjct: 592 ISFSTNANNGLMVNANGYTQRL 613
>gnl|CDD|218698 pfam05695, DUF825, Plant protein of unknown function (DUF825). This
family consists of several plant proteins greater than
1000 residues in length. The function of this family is
unknown.
Length = 1127
Score = 27.9 bits (62), Expect = 9.1
Identities = 13/42 (30%), Positives = 23/42 (54%), Gaps = 3/42 (7%)
Query: 19 SLNKKCRNSSLRWRDPNLTEVISFLSNPNNVIKANAAAYLQH 60
SLN+KC+N ++ D L ISF+ N + + + ++ H
Sbjct: 1087 SLNRKCKNMNINGIDEIL---ISFIPNEKRISFSESTRHVSH 1125
>gnl|CDD|224331 COG1413, COG1413, FOG: HEAT repeat [Energy production and
conversion].
Length = 335
Score = 27.6 bits (61), Expect = 9.6
Identities = 22/119 (18%), Positives = 41/119 (34%), Gaps = 30/119 (25%)
Query: 32 RDPNLTEVISFLSNPNNVIKANAAAYLQHLCYMDDPNKQKTRSLGGIPPLVKLLGHESPD 91
E++ L + + +++ +AA L L + +P L +LL E P
Sbjct: 41 APEAADELLKLLEDEDLLVRLSAAVALGEL-------GSEEA----VPLLRELLSDEDPR 89
Query: 92 VFRNA---------CGALRNLSYGRQNDENK----------RAIKNAGGIPLLINLLRK 131
V A A+ L +NDEN+ + + + L+ L+
Sbjct: 90 VRDAAADALGELGDPEAVPPLVELLENDENEGVRAAAARALGKLGDERALDPLLEALQD 148
Database: CDD.v3.10
Posted date: Mar 20, 2013 7:55 AM
Number of letters in database: 10,937,602
Number of sequences in database: 44,354
Lambda K H
0.316 0.133 0.400
Gapped
Lambda K H
0.267 0.0877 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 14,607,246
Number of extensions: 1354505
Number of successful extensions: 1121
Number of sequences better than 10.0: 1
Number of HSP's gapped: 1105
Number of HSP's successfully gapped: 45
Length of query: 294
Length of database: 10,937,602
Length adjustment: 96
Effective length of query: 198
Effective length of database: 6,679,618
Effective search space: 1322564364
Effective search space used: 1322564364
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.6 bits)
S2: 59 (26.2 bits)