RPS-BLAST 2.2.26 [Sep-21-2011]
Database: CDD.v3.10
44,354 sequences; 10,937,602 total letters
Searching..................................................done
Query= psy10547
(269 letters)
>gnl|CDD|206026 pfam13855, LRR_8, Leucine rich repeat.
Length = 60
Score = 49.5 bits (119), Expect = 2e-08
Identities = 25/59 (42%), Positives = 35/59 (59%)
Query: 163 LKHLNLQENDLEQVPVETLRHLKNLTLIDLSKNKIGKIPDDSFSTLNNLVTLKLSDNNL 221
LK L+L N L +P + L NL ++DLS N + I ++FS L +L +L LS NNL
Sbjct: 2 LKSLDLSNNRLTVIPDGAFKGLPNLKVLDLSGNNLTSISPEAFSGLPSLRSLDLSGNNL 60
Score = 41.8 bits (99), Expect = 1e-05
Identities = 28/76 (36%), Positives = 38/76 (50%), Gaps = 16/76 (21%)
Query: 186 NLTLIDLSKNKIGKIPDDSFSTLNNLVTLKLSDNNLTLYKNSFRGLELNNNLTLYKNSFR 245
NL +DLS N++ IPD +F L NL L LS NNLT ++ +F
Sbjct: 1 NLKSLDLSNNRLTVIPDGAFKGLPNLKVLDLSGNNLT---------------SISPEAFS 45
Query: 246 GLELSLKNLNLKNTKL 261
GL SL++L+L L
Sbjct: 46 GLP-SLRSLDLSGNNL 60
Score = 35.2 bits (82), Expect = 0.003
Identities = 22/73 (30%), Positives = 37/73 (50%), Gaps = 14/73 (19%)
Query: 101 IKNLQLSHCRINSITPNAFRHLDAIRNINENTFNGIFIKNLQLSHCRINSITPNAFRHLE 160
+K+L LS+ R+ I AF+ L ++ ++ LS + SI+P AF L
Sbjct: 2 LKSLDLSNNRLTVIPDGAFKGLPNLKVLD-------------LSGNNLTSISPEAFSGLP 48
Query: 161 FTLKHLNLQENDL 173
+L+ L+L N+L
Sbjct: 49 -SLRSLDLSGNNL 60
Score = 31.4 bits (72), Expect = 0.049
Identities = 17/55 (30%), Positives = 31/55 (56%), Gaps = 5/55 (9%)
Query: 78 LDLLYINNSAIRNINENTFNGIFIKNLQ---LSHCRINSITPNAFRHLDAIRNIN 129
L L ++N+ + I + F G+ NL+ LS + SI+P AF L ++R+++
Sbjct: 2 LKSLDLSNNRLTVIPDGAFKGL--PNLKVLDLSGNNLTSISPEAFSGLPSLRSLD 54
>gnl|CDD|215061 PLN00113, PLN00113, leucine-rich repeat receptor-like protein
kinase; Provisional.
Length = 968
Score = 44.1 bits (104), Expect = 6e-05
Identities = 29/97 (29%), Positives = 48/97 (49%), Gaps = 4/97 (4%)
Query: 127 NINENTFNGIFIKNLQLSHCRINSITPNAFRHLEFTLKHLNLQENDLEQVPVETLRHLKN 186
IN ++ ++ L L+ + P++F L++L+L N L L
Sbjct: 443 RINSRKWDMPSLQMLSLARNKFFGGLPDSFGSKR--LENLDLSRNQFSGAVPRKLGSLSE 500
Query: 187 LTLIDLSKNKI-GKIPDDSFSTLNNLVTLKLSDNNLT 222
L + LS+NK+ G+IPD+ S+ LV+L LS N L+
Sbjct: 501 LMQLKLSENKLSGEIPDE-LSSCKKLVSLDLSHNQLS 536
Score = 37.9 bits (88), Expect = 0.005
Identities = 28/63 (44%), Positives = 41/63 (65%), Gaps = 4/63 (6%)
Query: 162 TLKHLNLQENDLE-QVPVETLRHLKNLTLIDLSKNKI-GKIPDDSFSTLNNLVTLKLSDN 219
+L HL+L N+L +P +L +LKNL + L +NK+ G IP FS L L++L LSDN
Sbjct: 237 SLNHLDLVYNNLTGPIPS-SLGNLKNLQYLFLYQNKLSGPIPPSIFS-LQKLISLDLSDN 294
Query: 220 NLT 222
+L+
Sbjct: 295 SLS 297
Score = 36.0 bits (83), Expect = 0.021
Identities = 25/61 (40%), Positives = 37/61 (60%), Gaps = 6/61 (9%)
Query: 183 HLKNLTLIDLSKNKI-GKIPDDSFSTLNNLVTLKLSDNNLT--LYKNSFRGLE---LNNN 236
L + I+LS N++ G IPDD F+T ++L L LS+NN T + + S LE L+NN
Sbjct: 91 RLPYIQTINLSNNQLSGPIPDDIFTTSSSLRYLNLSNNNFTGSIPRGSIPNLETLDLSNN 150
Query: 237 L 237
+
Sbjct: 151 M 151
Score = 33.7 bits (77), Expect = 0.096
Identities = 31/123 (25%), Positives = 42/123 (34%), Gaps = 40/123 (32%)
Query: 101 IKNLQLSHCRINSITPNAFRHLDAIRNINENTFNGIFIKNLQLSHCRINSITPNAFRHLE 160
++NL LS + + P L
Sbjct: 477 LENLDLSRNQFSGAVPRKLGSLS------------------------------------- 499
Query: 161 FTLKHLNLQENDLEQVPVETLRHLKNLTLIDLSKNKI-GKIPDDSFSTLNNLVTLKLSDN 219
L L L EN L + L K L +DLS N++ G+IP SFS + L L LS N
Sbjct: 500 -ELMQLKLSENKLSGEIPDELSSCKKLVSLDLSHNQLSGQIP-ASFSEMPVLSQLDLSQN 557
Query: 220 NLT 222
L+
Sbjct: 558 QLS 560
Score = 33.3 bits (76), Expect = 0.13
Identities = 37/127 (29%), Positives = 59/127 (46%), Gaps = 17/127 (13%)
Query: 128 INENTFNGIFIKNLQLSHCRINSITPNAFRHLEFTLKHLNLQENDLE-QVPVETLRHLKN 186
I+ F +I+ + LS+ +++ P+ +L++LNL N+ +P + + N
Sbjct: 85 ISSAIFRLPYIQTINLSNNQLSGPIPDDIFTTSSSLRYLNLSNNNFTGSIPRGS---IPN 141
Query: 187 LTLIDLSKNKI-GKIPDD--SFSTLNNLVTLKLSDNNLT----LYKNSFRGLELNNNLTL 239
L +DLS N + G+IP+D SFS +L L L N L + LE LTL
Sbjct: 142 LETLDLSNNMLSGEIPNDIGSFS---SLKVLDLGGNVLVGKIPNSLTNLTSLEF---LTL 195
Query: 240 YKNSFRG 246
N G
Sbjct: 196 ASNQLVG 202
Score = 29.0 bits (65), Expect = 3.1
Identities = 23/62 (37%), Positives = 30/62 (48%), Gaps = 2/62 (3%)
Query: 162 TLKHLNLQENDLEQVPVETLRHLKNLTLIDLSKNKI-GKIPDDSFSTLNNLVTLKLSDNN 220
L+ L+L N+ L L L ++ L NK G+IP + NNL L LS NN
Sbjct: 309 NLEILHLFSNNFTGKIPVALTSLPRLQVLQLWSNKFSGEIPKN-LGKHNNLTVLDLSTNN 367
Query: 221 LT 222
LT
Sbjct: 368 LT 369
Score = 29.0 bits (65), Expect = 3.4
Identities = 31/91 (34%), Positives = 47/91 (51%), Gaps = 8/91 (8%)
Query: 136 IFIKNLQLSHCRINSIT---PNAFRHLEFTLKHLNLQENDLE-QVPVETLRHLKNLTLID 191
I ++NL++ H N+ T P A L L+ L L N ++P +H NLT++D
Sbjct: 305 IQLQNLEILHLFSNNFTGKIPVALTSLP-RLQVLQLWSNKFSGEIPKNLGKH-NNLTVLD 362
Query: 192 LSKNKI-GKIPDDSFSTLNNLVTLKLSDNNL 221
LS N + G+IP + + NL L L N+L
Sbjct: 363 LSTNNLTGEIP-EGLCSSGNLFKLILFSNSL 392
Score = 27.5 bits (61), Expect = 9.4
Identities = 30/93 (32%), Positives = 45/93 (48%), Gaps = 11/93 (11%)
Query: 163 LKHLNLQENDLEQVPVETLRHLKNLTLIDLSKNKI-GKIPDDSFSTLNNLVTLKLSDNNL 221
L L+L +N L E + L+NL ++ L N GKIP + ++L L L+L N
Sbjct: 286 LISLDLSDNSLSGEIPELVIQLQNLEILHLFSNNFTGKIP-VALTSLPRLQVLQLWSNKF 344
Query: 222 TLYKNSFRGLELNNNLTLYKNSFRGLELSLKNL 254
+ + L +NNLT+ L+LS NL
Sbjct: 345 S--GEIPKNLGKHNNLTV-------LDLSTNNL 368
>gnl|CDD|238064 cd00116, LRR_RI, Leucine-rich repeats (LRRs), ribonuclease
inhibitor (RI)-like subfamily. LRRs are 20-29 residue
sequence motifs present in many proteins that
participate in protein-protein interactions and have
different functions and cellular locations. LRRs
correspond to structural units consisting of a beta
strand (LxxLxLxxN/CxL conserved pattern) and an alpha
helix. This alignment contains 12 strands corresponding
to 11 full repeats, consistent with the extent observed
in the subfamily acting as Ran GTPase Activating
Proteins (RanGAP1).
Length = 319
Score = 41.2 bits (97), Expect = 3e-04
Identities = 33/136 (24%), Positives = 52/136 (38%), Gaps = 24/136 (17%)
Query: 139 KNLQLSHCRINSI----TPNAFRHLEFTLKHLNLQENDLE----QVPVETLRHLKNLTLI 190
+ L L R+ A R LK LNL N + + E L+ NL ++
Sbjct: 140 EKLVLGRNRLEGASCEALAKALRANR-DLKELNLANNGIGDAGIRALAEGLKANCNLEVL 198
Query: 191 DLSKNKI----GKIPDDSFSTLNNLVTLKLSDNNLTLYKNSFRGLELNNNLTLYKNSFRG 246
DL+ N + ++ ++L +L L L DNNLT G L ++
Sbjct: 199 DLNNNGLTDEGASALAETLASLKSLEVLNLGDNNLTD-----AGA-----AAL-ASALLS 247
Query: 247 LELSLKNLNLKNTKLK 262
+SL L+L +
Sbjct: 248 PNISLLTLSLSCNDIT 263
Score = 30.0 bits (68), Expect = 1.1
Identities = 19/103 (18%), Positives = 37/103 (35%), Gaps = 22/103 (21%)
Query: 118 AFRHLDAIRNINENTFN-------------GIFIKNLQLSHCRINSI----TPNAFRHLE 160
A +L+ + ++N N ++ L L + +A
Sbjct: 191 ANCNLEVL-DLNNNGLTDEGASALAETLASLKSLEVLNLGDNNLTDAGAAALASALLSPN 249
Query: 161 FTLKHLNLQENDLEQVPVETLRHL----KNLTLIDLSKNKIGK 199
+L L+L ND+ + L + ++L +DL NK G+
Sbjct: 250 ISLLTLSLSCNDITDDGAKDLAEVLAEKESLLELDLRGNKFGE 292
>gnl|CDD|178695 PLN03150, PLN03150, hypothetical protein; Provisional.
Length = 623
Score = 36.3 bits (84), Expect = 0.014
Identities = 28/90 (31%), Positives = 43/90 (47%), Gaps = 9/90 (10%)
Query: 137 FIKNLQLSHCRINSITPN---AFRHLEFTLKHLNLQENDLEQVPVETLRHLKNLTLIDLS 193
FI L L + + PN RHL+ +NL N + +L + +L ++DLS
Sbjct: 419 FIDGLGLDNQGLRGFIPNDISKLRHLQ----SINLSGNSIRGNIPPSLGSITSLEVLDLS 474
Query: 194 KNKI-GKIPDDSFSTLNNLVTLKLSDNNLT 222
N G IP +S L +L L L+ N+L+
Sbjct: 475 YNSFNGSIP-ESLGQLTSLRILNLNGNSLS 503
>gnl|CDD|187627 cd05369, TER_DECR_SDR_a, Trans-2-enoyl-CoA reductase (TER) and
2,4-dienoyl-CoA reductase (DECR), atypical (a) SDR.
TTER is a peroxisomal protein with a proposed role in
fatty acid elongation. Fatty acid synthesis is known to
occur in the both endoplasmic reticulum and
mitochondria; peroxisomal TER has been proposed as an
additional fatty acid elongation system, it reduces the
double bond at C-2 as the last step of elongation. This
system resembles the mitochondrial system in that
acetyl-CoA is used as a carbon donor. TER may also
function in phytol metabolism, reducting phytenoyl-CoA
to phytanoyl-CoA in peroxisomes. DECR processes double
bonds in fatty acids to increase their utility in fatty
acid metabolism; it reduces 2,4-dienoyl-CoA to an
enoyl-CoA. DECR is active in mitochondria and
peroxisomes. This subgroup has the Gly-rich NAD-binding
motif of the classical SDR family, but does not display
strong identity to the canonical active site tetrad, and
lacks the characteristic Tyr at the usual position. SDRs
are a functionally diverse family of oxidoreductases
that have a single domain with a structurally conserved
Rossmann fold (alpha/beta folding pattern with a central
beta-sheet), an NAD(P)(H)-binding region, and a
structurally diverse C-terminal region. Classical SDRs
are typically about 250 residues long, while extended
SDRS are approximately 350 residues. Sequence identity
between different SDR enzymes are typically in the
15-30% range, but the enzymes share the Rossmann fold
NAD-binding motif and characteristic NAD-binding and
catalytic sequence patterns. These enzymes have a
3-glycine N-terminal NAD(P)(H)-binding pattern
(typically, TGxxxGxG in classical SDRs and TGxxGxxG in
extended SDRs), while substrate binding is in the
C-terminal region. A critical catalytic Tyr residue
(Tyr-151, human 15-hydroxyprostaglandin dehydrogenase
(15-PGDH) numbering), is often found in a conserved
YXXXK pattern. In addition to the Tyr and Lys, there is
often an upstream Ser (Ser-138, 15-PGDH numbering)
and/or an Asn (Asn-107, 15-PGDH numbering) or additional
Ser, contributing to the active site. Substrates for
these enzymes include sugars, steroids, alcohols, and
aromatic compounds. The standard reaction mechanism is a
proton relay involving the conserved Tyr and Lys, as
well as Asn (or Ser). Some SDR family members, including
17 beta-hydroxysteroid dehydrogenase contain an
additional helix-turn-helix motif that is not generally
found among SDRs.
Length = 249
Score = 34.1 bits (79), Expect = 0.053
Identities = 27/112 (24%), Positives = 45/112 (40%), Gaps = 38/112 (33%)
Query: 39 STCICSYN------TANELS---------IQCNDLTNYPLFKATLNKHVNTKVPLDLLYI 83
S I A E+S IQC D+ + +A +++ + +D+L I
Sbjct: 29 SVAIAGRKPEVLEAAAEEISSATGGRAHPIQC-DVRDPEAVEAAVDETLKEFGKIDIL-I 86
Query: 84 NNSAIRNINENTFNGIFIKNLQLSHCRINSITPNAFRHLDAIRNINEN-TFN 134
NN+A G F+ S++PN F+ + +I+ N TFN
Sbjct: 87 NNAA----------GNFL-------APAESLSPNGFK---TVIDIDLNGTFN 118
>gnl|CDD|205079 pfam12799, LRR_4, Leucine Rich repeats (2 copies). Leucine rich
repeats are short sequence motifs present in a number of
proteins with diverse functions and cellular locations.
These repeats are usually involved in protein-protein
interactions. Each Leucine Rich Repeat is composed of a
beta-alpha unit. These units form elongated non-globular
structures. Leucine Rich Repeats are often flanked by
cysteine rich domains.
Length = 43
Score = 28.6 bits (65), Expect = 0.33
Identities = 15/39 (38%), Positives = 22/39 (56%), Gaps = 2/39 (5%)
Query: 163 LKHLNLQENDLEQVPVETLRHLKNLTLIDLSKNKIGKIP 201
L+ L+L N + +P L +L NL +DLS NKI +
Sbjct: 3 LETLDLSNNQITDLPP--LSNLPNLETLDLSGNKITDLS 39
Score = 28.2 bits (64), Expect = 0.54
Identities = 18/38 (47%), Positives = 22/38 (57%), Gaps = 2/38 (5%)
Query: 185 KNLTLIDLSKNKIGKIPDDSFSTLNNLVTLKLSDNNLT 222
NL +DLS N+I +P S L NL TL LS N +T
Sbjct: 1 TNLETLDLSNNQITDLP--PLSNLPNLETLDLSGNKIT 36
>gnl|CDD|181770 PRK09301, PRK09301, circadian clock protein KaiB; Provisional.
Length = 103
Score = 29.4 bits (66), Expect = 0.62
Identities = 14/37 (37%), Positives = 21/37 (56%), Gaps = 7/37 (18%)
Query: 115 TPNAFRHLDAIRNINENTFNGIF-------IKNLQLS 144
TPN+ R L ++NI E F G++ +KN QL+
Sbjct: 17 TPNSVRALKTLKNILETEFKGVYALKVIDVLKNPQLA 53
>gnl|CDD|211759 TIGR02654, circ_KaiB, circadian clock protein KaiB. Members of
this protein family are the circadian clock protein KaiB
of Cyanobacteria, encoded in the circadian clock gene
cluster kaiABC. KaiB has homologs of unknown function in
some Archaea and Proteobacteria, and has paralogs of
unknown function in some Cyanobacteria. KaiB forms
homodimers, homotetramers, and multimeric complexes with
KaiA and/or KaiC [Cellular processes, Other].
Length = 87
Score = 28.8 bits (64), Expect = 0.75
Identities = 14/37 (37%), Positives = 21/37 (56%), Gaps = 7/37 (18%)
Query: 115 TPNAFRHLDAIRNINENTFNGIF-------IKNLQLS 144
TPN+ R L ++NI E F G++ +KN QL+
Sbjct: 14 TPNSVRALKTLKNILETEFQGVYALKVIDVLKNPQLA 50
>gnl|CDD|205486 pfam13306, LRR_5, Leucine rich repeats (6 copies). This family
includes a number of leucine rich repeats. This family
contains a large number of BSPA-like surface antigens
from Trichomonas vaginalis.
Length = 128
Score = 29.0 bits (66), Expect = 1.2
Identities = 23/126 (18%), Positives = 52/126 (41%), Gaps = 10/126 (7%)
Query: 127 NINENTFNGIFIKNLQLSHCRINSITPNAFRHLEFTLKHLNLQENDLEQVPVETLRHLKN 186
+I + F + ++ + + SI AF +LK + L + L + + +
Sbjct: 2 SIGDYAFYNCSLTSITIPS-SVTSIGEYAFSGCT-SLKSITLPSS-LTSIGSYAFYNCSS 58
Query: 187 LTLIDLSKNKIGKIPDDSFSTLNNLVTLKLSDNNLTLYKNSFRGLEL------NNNLTLY 240
LT I + + + I + +FS ++L ++ + N T+ +F L ++ T+
Sbjct: 59 LTSITIPSS-LTSIGEYAFSNCSSLTSITIPSNLTTIGSYAFSNCSLKSITIPSSVTTIG 117
Query: 241 KNSFRG 246
+F
Sbjct: 118 DYAFSN 123
>gnl|CDD|190374 pfam02643, DUF192, Uncharacterized ACR, COG1430. Two structures
have been solved for members of this large (>500
members) family of bacterial proteins present mostly in
environmental bacteria and metagenomes (distant
homologues are also present in several Plasmodium
species). TOPSAN analysis for pdb:3pjy shows that there
is much similarity with the other solved structure,
pdb:3m7a, solved for UniProt:Q2GA55 (Saro_0823), a
homologue of Thermotoga maritima TM1668,
UniProt:Q9X1Z6., The homologue in Caulobacter
crescentus (CC1388), UniProt:Q9A8G6, is associated with
CspD, a cold shock protein (CC1387), UniProt:Q9A8G7.
However, the genomic context of UniProt:Q2GA55 is most
conserved with a putative xylose isomerase, suggesting
a possible role in extracellular sugar processing.
Saro_0821, UniProt:Q2GA57, is annotated as an
AMP-dependent synthetase and ligase. PDB:3m7a structure
corresponds to the C-terminal (27-165) fragment of the
YP_496102 (Saro_0823) protein and it is structurally
unique, as the best hits from Dali have a Z-score of
3.8 (1nt0, 2j1t, 3kq4) and it is thus a likely
candidate for a new fold. Interestingly, many of the
top Dali hits are involved in sugar metabolism. There
are no obvious active site-like cavities on the protein
surface of 3m7a (http://www.topsan.org/Proteins/JCSG/).
Length = 104
Score = 28.7 bits (65), Expect = 1.3
Identities = 9/24 (37%), Positives = 15/24 (62%), Gaps = 1/24 (4%)
Query: 73 NTKVPLDLLYIN-NSAIRNINENT 95
NT +PLD+++I+ + I I E
Sbjct: 44 NTLIPLDIIFIDADGRIVAIEEMA 67
>gnl|CDD|227223 COG4886, COG4886, Leucine-rich repeat (LRR) protein [Function
unknown].
Length = 394
Score = 28.8 bits (64), Expect = 3.2
Identities = 39/145 (26%), Positives = 70/145 (48%), Gaps = 18/145 (12%)
Query: 78 LDLLYINNSAIRNINENTFNGIFIKNLQLSHCRINSITPNAFRHLDAIRNINENTFNGIF 137
L L ++++ I ++ N +KNL LS ++ + N
Sbjct: 142 LKELDLSDNKIESLPSPLRNLPNLKNLDLSFNDLSDLPKLL--------------SNLSN 187
Query: 138 IKNLQLSHCRINSITPNAFRHLEFTLKHLNLQENDLEQVPVETLRHLKNLTLIDLSKNKI 197
+ NL LS +I+ + P L+ L+L N + ++ +L +LKNL+ ++LS NK+
Sbjct: 188 LNNLDLSGNKISDLPPEIELLS--ALEELDLSNNSIIELL-SSLSNLKNLSGLELSNNKL 244
Query: 198 GKIPDDSFSTLNNLVTLKLSDNNLT 222
+P +S L+NL TL LS+N ++
Sbjct: 245 EDLP-ESIGNLSNLETLDLSNNQIS 268
Score = 28.8 bits (64), Expect = 3.5
Identities = 44/147 (29%), Positives = 67/147 (45%), Gaps = 10/147 (6%)
Query: 126 RNINENTFNGIFIKNLQLSHCRINSITPNAFRHLEFTLKHLNLQENDLEQVPVETLRHLK 185
+K L LS +I S+ LK+L+L NDL +P L +L
Sbjct: 130 DIPPLIGLLKSNLKELDLSDNKIESLPSPLR--NLPNLKNLDLSFNDLSDLPKL-LSNLS 186
Query: 186 NLTLIDLSKNKIGKIPDDSFSTLNNLVTLKLSDNNLTLYKNSFR------GLELNNNLTL 239
NL +DLS NKI +P + L+ L L LS+N++ +S GLEL+NN
Sbjct: 187 NLNNLDLSGNKISDLPPEI-ELLSALEELDLSNNSIIELLSSLSNLKNLSGLELSNNKLE 245
Query: 240 YKNSFRGLELSLKNLNLKNTKLKSVTP 266
G +L+ L+L N ++ S++
Sbjct: 246 DLPESIGNLSNLETLDLSNNQISSISS 272
>gnl|CDD|153299 cd07615, BAR_Endophilin_A3, The Bin/Amphiphysin/Rvs (BAR) domain of
Endophilin-A3. BAR domains are dimerization, lipid
binding and curvature sensing modules found in many
different proteins with diverse functions. Endophilins
are accessory proteins localized at synapses that
interacts with the endocytic proteins, dynamin and
synaptojanin. They are essential for synaptic vesicle
formation from the plasma membrane. They interact with
voltage-gated calcium channels, thus linking vesicle
endocytosis to calcium regulation. They also play roles
in virus budding, mitochondrial morphology maintenance,
receptor-mediated endocytosis inhibition, and endosomal
sorting. Endophilins contain an N-terminal N-BAR domain
(BAR domain with an additional N-terminal amphipathic
helix), followed by a variable region containing proline
clusters, and a C-terminal SH3 domain. They are
classified into two types, A and B. Endophilin-A
proteins are enriched in the brain and play multiple
roles in receptor-mediated endocytosis. Endophilin-A3
(or endophilin-3) is also referred to as SH3P13 (SH3
domain containing protein 13) or SH3GL3 (SH3 domain
containing Grb2-like protein 3). It regulates
Arp2/3-dependent actin filament assembly during
endocytosis. It binds N-WASP through its SH3 domain and
enhances the ability of N-WASP to activate the Arp2/3
complex. Endophilin-A3 co-localizes with the vesicular
glutamate transporter 1 (VGLUT1), and may play an
important role in the synaptic release of glutamate.
Length = 223
Score = 28.1 bits (62), Expect = 5.0
Identities = 16/41 (39%), Positives = 20/41 (48%), Gaps = 8/41 (19%)
Query: 167 NLQENDLEQVPVETLRHLKNLT----LIDLSKNKIGKIPDD 203
LQ+ DL+ E HLK L D K + GKIPD+
Sbjct: 121 LLQDKDLK----EIGHHLKKLEGRRLDFDYKKKRQGKIPDE 157
>gnl|CDD|227253 COG4916, COG4916, Uncharacterized protein containing a TIR
(Toll-Interleukin 1-resistance) domain [Function
unknown].
Length = 329
Score = 27.7 bits (61), Expect = 7.2
Identities = 16/80 (20%), Positives = 29/80 (36%), Gaps = 8/80 (10%)
Query: 43 CSYNTA-NELSIQCNDLTNYPLFKATLNKHVNTKVPLDLLYINNSAIRNINE-------N 94
C T N S C + + LN ++T + LY +N AI + +
Sbjct: 235 CVVTTVFNTGSYICKSTCHIEGLEGRLNPILDTGFRIKYLYADNIAIDGGKQTPGHFDID 294
Query: 95 TFNGIFIKNLQLSHCRINSI 114
+ + N + R+ +I
Sbjct: 295 SPIELEDSNGLVMLVRVLTI 314
>gnl|CDD|217927 pfam04147, Nop14, Nop14-like family. Emg1 and Nop14 are novel
proteins whose interaction is required for the
maturation of the 18S rRNA and for 40S ribosome
production.
Length = 809
Score = 27.7 bits (62), Expect = 7.2
Identities = 25/140 (17%), Positives = 43/140 (30%), Gaps = 35/140 (25%)
Query: 106 LSHCRINSITPNAF----------------RHLDAIRNINENTFNGIFIKNLQLSHCRIN 149
LS C + S+ A R++ + N L L +
Sbjct: 543 LSQCPVRSLRDIAKGLFLASLVLQYQRLSKRYVPELINFLLGAL-------LLLIPEKTK 595
Query: 150 SITPNAFRHLEFTLKHLNLQENDLEQVPVETLRHLKNLTLIDLSKNKIGKIPDDSFSTLN 209
+ F L L + L+++ L L DL + + S L+
Sbjct: 596 TEVTLPFFPSRPPSTLLRLSASKLKKLEPRKLS------LTDLFASSEEEDDSLKLSLLS 649
Query: 210 NLVTL--KLSDNNLTLYKNS 227
L++L K D L+K+
Sbjct: 650 TLLSLLDKAVD----LWKDL 665
>gnl|CDD|177038 CHL00113, rps4, ribosomal protein S4; Reviewed.
Length = 201
Score = 27.1 bits (61), Expect = 9.0
Identities = 25/82 (30%), Positives = 39/82 (47%), Gaps = 9/82 (10%)
Query: 195 NKIGKIPDDSFS-TLNNLVTLKLSDNNLTLYKN---SFRGLELNNNLTLYKNSFRGLELS 250
+I IP S+ +++T+K + L +N S EL N+LTL K ++GL
Sbjct: 120 GRIVDIP--SYRCKPKDIITVKDKQKSKNLIQNNLESSPREELPNHLTLDKLQYKGLVNQ 177
Query: 251 L---KNLNLKNTKLKSVTPYWR 269
+ K + LK +L V Y R
Sbjct: 178 IIDRKWVGLKINELLVVEYYSR 199
Database: CDD.v3.10
Posted date: Mar 20, 2013 7:55 AM
Number of letters in database: 10,937,602
Number of sequences in database: 44,354
Lambda K H
0.318 0.134 0.392
Gapped
Lambda K H
0.267 0.0925 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 12,824,439
Number of extensions: 1182440
Number of successful extensions: 1145
Number of sequences better than 10.0: 1
Number of HSP's gapped: 1097
Number of HSP's successfully gapped: 79
Length of query: 269
Length of database: 10,937,602
Length adjustment: 95
Effective length of query: 174
Effective length of database: 6,723,972
Effective search space: 1169971128
Effective search space used: 1169971128
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 58 (25.8 bits)