RPS-BLAST 2.2.26 [Sep-21-2011]
Database: CDD.v3.10
44,354 sequences; 10,937,602 total letters
Searching..................................................done
Query= psy4702
(138 letters)
>gnl|CDD|237987 cd00020, ARM, Armadillo/beta-catenin-like repeats. An approximately
40 amino acid long tandemly repeated sequence motif
first identified in the Drosophila segment polarity gene
armadillo; these repeats were also found in the
mammalian armadillo homolog beta-catenin, the junctional
plaque protein plakoglobin, the adenomatous polyposis
coli (APC) tumor suppressor protein, and a number of
other proteins. ARM has been implicated in mediating
protein-protein interactions, but no common features
among the target proteins recognized by the ARM repeats
have been identified; related to the HEAT domain; three
consecutive copies of the repeat are represented by this
alignment model.
Length = 120
Score = 41.9 bits (99), Expect = 8e-06
Identities = 23/97 (23%), Positives = 42/97 (43%)
Query: 25 DTETACAAAGALAMLTSVSTPVCSMLLEKNWVDIFKQILAHPDVGVQHRALVIVGNILSA 84
D AA AL+ L++ + ++E + Q+L D V AL + N+ +
Sbjct: 20 DENVQREAAWALSNLSAGNNDNIQAVVEAGGLPALVQLLKSEDEEVVKAALWALRNLAAG 79
Query: 85 GKELAERVLSTELMEILMALTLLEGEEVRKLALQALA 121
++ VL + L+ L E+++K A AL+
Sbjct: 80 PEDNKLIVLEAGGVPKLVNLLDSSNEDIQKNATGALS 116
Score = 29.2 bits (66), Expect = 0.40
Identities = 18/65 (27%), Positives = 29/65 (44%)
Query: 56 VDIFKQILAHPDVGVQHRALVIVGNILSAGKELAERVLSTELMEILMALTLLEGEEVRKL 115
+ +L+ D VQ A + N+ + + + V+ + L+ L E EEV K
Sbjct: 9 LPALVSLLSSSDENVQREAAWALSNLSAGNNDNIQAVVEAGGLPALVQLLKSEDEEVVKA 68
Query: 116 ALQAL 120
AL AL
Sbjct: 69 ALWAL 73
>gnl|CDD|227396 COG5064, SRP1, Karyopherin (importin) alpha [Intracellular
trafficking and secretion].
Length = 526
Score = 32.6 bits (74), Expect = 0.049
Identities = 22/100 (22%), Positives = 42/100 (42%)
Query: 17 LFLLTCEEDTETACAAAGALAMLTSVSTPVCSMLLEKNWVDIFKQILAHPDVGVQHRALV 76
L L D E A A++ L+ +L+ ++L+H +Q AL
Sbjct: 248 LAKLIYSRDPEVLVDACWAISYLSDGPNEKIQAVLDVGIPGRLVELLSHESAKIQTPALR 307
Query: 77 IVGNILSAGKELAERVLSTELMEILMALTLLEGEEVRKLA 116
VGNI++ + + +++ ++ +L E +RK A
Sbjct: 308 SVGNIVTGSDDQTQVIINCGALKAFRSLLSSPKENIRKEA 347
Score = 31.8 bits (72), Expect = 0.11
Identities = 22/90 (24%), Positives = 43/90 (47%)
Query: 15 RFLFLLTCEEDTETACAAAGALAMLTSVSTPVCSMLLEKNWVDIFKQILAHPDVGVQHRA 74
RF+ + + AA AL + S +T ++++ V +F Q+L+ + V+ +A
Sbjct: 118 RFVEFMDEIQRDMLQFEAAWALTNIASGTTQQTKVVVDAGAVPLFIQLLSSTEDDVREQA 177
Query: 75 LVIVGNILSAGKELAERVLSTELMEILMAL 104
+ +GNI + + VL +E L+ L
Sbjct: 178 VWALGNIAGDSEGCRDYVLQCGALEPLLGL 207
>gnl|CDD|205691 pfam13513, HEAT_EZ, HEAT-like repeat. The HEAT repeat family is
related to armadillo/beta-catenin-like repeats (see
pfam00514). These EZ repeats are found in subunits of
cyanobacterial phycocyanin lyase and other proteins and
probably carry out a scaffolding role.
Length = 55
Score = 29.0 bits (65), Expect = 0.16
Identities = 16/53 (30%), Positives = 27/53 (50%), Gaps = 1/53 (1%)
Query: 70 VQHRALVIVGNILSAGKELAERVLSTELMEILMALTLLEGEEVRKLALQALAK 122
V+ A + +G + G EL + EL+ L+ L + +EVR+ A AL +
Sbjct: 3 VREAAALALGALAGGGPELLRPAV-PELLPALLPLLKDDDDEVREAAAWALGR 54
>gnl|CDD|233841 TIGR02388, rpoC2_cyan, DNA-directed RNA polymerase, beta'' subunit.
The family consists of the product of the rpoC2 gene, a
subunit of DNA-directed RNA polymerase of cyanobacteria
and chloroplasts. RpoC2 corresponds largely to the
C-terminal region of the RpoC (the beta' subunit) of
other bacteria. Members of this family are designated
beta'' in chloroplasts/plastids, and beta' (confusingly)
in Cyanobacteria, where RpoC1 is called beta' in
chloroplasts/plastids and gamma in Cyanobacteria. We
prefer to name this family beta'', after its organellar
members, to emphasize that this RpoC1 and RpoC2 together
replace RpoC in other bacteria [Transcription,
DNA-dependent RNA polymerase].
Length = 1227
Score = 29.4 bits (66), Expect = 0.55
Identities = 22/76 (28%), Positives = 35/76 (46%), Gaps = 8/76 (10%)
Query: 26 TETACAAAGALAMLTSVSTPVCSMLLEKNWVDIFKQILAHPDVGVQHRALVIVGNILSAG 85
T+ C AG + + S + +L+E+N + I A P V G+++ AG
Sbjct: 860 TQILCKEAGVVQGIDSGGESIRRLLVERNSDRLKVNIKAKPVVKT--------GDLVVAG 911
Query: 86 KELAERVLSTELMEIL 101
ELA+ V + E EI
Sbjct: 912 DELAKGVKAEESGEIE 927
>gnl|CDD|183129 PRK11425, PRK11425, PTS system N-acetylgalactosamine-specific
transporter subunit IIB; Provisional.
Length = 157
Score = 29.1 bits (65), Expect = 0.56
Identities = 21/56 (37%), Positives = 27/56 (48%), Gaps = 4/56 (7%)
Query: 63 LAHPDVGVQHRALVIVGNILSAGKELAERVLSTELMEILMALTLLEGEEVRKLALQ 118
L H VGVQ +L A E+AE + LME+++A EG VR LQ
Sbjct: 13 LIHGQVGVQWVGFAGANLVLVANDEVAEDPVQQNLMEMVLA----EGIAVRFWTLQ 64
>gnl|CDD|220906 pfam10926, DUF2800, Protein of unknown function (DUF2800). This is
a family of uncharacterized proteins found in bacteria
and viruses. Some members of this family are annotated
as being Phi APSE P51-like proteins.
Length = 363
Score = 28.8 bits (65), Expect = 0.93
Identities = 15/50 (30%), Positives = 21/50 (42%), Gaps = 3/50 (6%)
Query: 80 NILSAGKELAERVLS-TELMEILMALTLLEG--EEVRKLALQALAKGEEY 126
N+ E L+ EL E+L LLE ++V AL G+E
Sbjct: 233 NLAKYDFEDPTLDLTDEELAELLEKADLLEKWAKDVEAYALDEARNGKEV 282
>gnl|CDD|215356 PLN02659, PLN02659, Probable galacturonosyltransferase.
Length = 534
Score = 28.8 bits (64), Expect = 1.1
Identities = 13/34 (38%), Positives = 21/34 (61%), Gaps = 2/34 (5%)
Query: 5 YLILFYCSQVRFLFLLTCEE--DTETACAAAGAL 36
Y +LF+ +RF+F+L+ + D ET C+ G L
Sbjct: 40 YSLLFFTFLLRFVFVLSTVDTIDGETKCSTLGCL 73
>gnl|CDD|130153 TIGR01081, mpl,
UDP-N-acetylmuramate:L-alanyl-gamma-D-glutamyl-meso-
diaminopimelate ligase. Alternate name: murein
tripeptide ligase [Cell envelope, Biosynthesis and
degradation of murein sacculus and peptidoglycan].
Length = 448
Score = 27.1 bits (60), Expect = 3.7
Identities = 17/58 (29%), Positives = 25/58 (43%), Gaps = 12/58 (20%)
Query: 45 PVCSMLLEKNWVDIFK-----QILAHPDVGVQHRALVIVGNILSAGKELAERVLSTEL 97
P S LE ++I + Q+ PD LV++GN + G E VL+ L
Sbjct: 35 PPMSTQLEAQGIEIIEGFDAAQLEPKPD-------LVVIGNAMKRGNPCVEAVLNLNL 85
>gnl|CDD|215132 PLN02238, PLN02238, hypoxanthine phosphoribosyltransferase.
Length = 189
Score = 26.5 bits (59), Expect = 4.6
Identities = 15/61 (24%), Positives = 28/61 (45%), Gaps = 6/61 (9%)
Query: 70 VQHRALVIVGNILSAGKELAERVLSTELMEILMA----LTLLEGEEVRKLALQALAKGEE 125
V+ + +++V +I+ G L+ L L A LL+ RK+ + + G+E
Sbjct: 95 VKGKHVLLVEDIVDTGNTLSA--LVAHLEAKGAASVSVCALLDKRARRKVKYELVGDGKE 152
Query: 126 Y 126
Y
Sbjct: 153 Y 153
>gnl|CDD|235110 PRK03188, PRK03188, 4-diphosphocytidyl-2-C-methyl-D-erythritol
kinase; Provisional.
Length = 300
Score = 26.4 bits (59), Expect = 5.0
Identities = 12/26 (46%), Positives = 14/26 (53%)
Query: 25 DTETACAAAGALAMLTSVSTPVCSML 50
T A AGALA + S S P C+ L
Sbjct: 234 RTLRAGEEAGALAGIVSGSGPTCAFL 259
>gnl|CDD|226991 COG4644, COG4644, Transposase and inactivated derivatives, TnpA
family [DNA replication, recombination, and repair].
Length = 323
Score = 26.3 bits (58), Expect = 6.4
Identities = 24/123 (19%), Positives = 43/123 (34%), Gaps = 11/123 (8%)
Query: 8 LFYCSQVRFLFLLTCEEDTETACAAAGALAMLTSVSTPVCSMLLEKNWVDIFKQILAHPD 67
L + RF L D + LA+L S+ ++ K + ++ K ++
Sbjct: 140 LRHLLGFRFAPRLKDIADQKLYIPEKNELALLPSIIGGSNIIIEIKQYDNMLKDAISIKV 199
Query: 68 VGVQHRALVIVGNILSAGKELAE------RVLSTELMEILMALTLLEGEEVRKLALQALA 121
A++ N S A+ R+ T + L E+ R+ L+ L
Sbjct: 200 GNADPSAILRRLNRASRQHPTAKALLELGRIEKT-----IFLCNYLSDEDFRRRILEGLN 254
Query: 122 KGE 124
GE
Sbjct: 255 VGE 257
>gnl|CDD|183563 PRK12508, PRK12508, putative monovalent cation/H+ antiporter
subunit B; Reviewed.
Length = 139
Score = 25.3 bits (56), Expect = 7.9
Identities = 14/46 (30%), Positives = 21/46 (45%), Gaps = 2/46 (4%)
Query: 33 AGALAMLTSVSTPVCSMLLEKNWVDIFKQILAHPDVGVQHRALVIV 78
AL +L T + SMLL N++ +L V QH + +V
Sbjct: 71 MAALGVLIYGGTGIASMLLGGNFLSYD--VLIFDSVTGQHLGIFLV 114
>gnl|CDD|219797 pfam08326, ACC_central, Acetyl-CoA carboxylase, central region.
The region featured in this family is found in various
eukaryotic acetyl-CoA carboxylases, N-terminal to the
catalytic domain (pfam01039). This enzyme (EC:6.4.1.2)
is involved in the synthesis of long-chain fatty acids,
as it catalyzes the rate-limiting step in this process.
Length = 707
Score = 26.1 bits (58), Expect = 8.2
Identities = 12/62 (19%), Positives = 26/62 (41%), Gaps = 2/62 (3%)
Query: 58 IFKQILAHPDVGVQHRALVIVGNILSAGKELAERVLSTELMEILMALTLLEGEEVRKLAL 117
+ +L+H V+ + +++ + L + L + L L+ L K+AL
Sbjct: 192 VVDIVLSHS--RVKAKNKLVLALLDQLVYPLLPSTVPASLRDALSRLSSLNSRAYAKVAL 249
Query: 118 QA 119
+A
Sbjct: 250 KA 251
>gnl|CDD|218602 pfam05478, Prominin, Prominin. The prominins are an emerging
family of proteins that among the multispan membrane
proteins display a novel topology. Mouse prominin and
human prominin (mouse)-like 1 (PROML1) are predicted to
contain five membrane spanning domains, with an
N-terminal domain exposed to the extracellular space
followed by four, alternating small cytoplasmic and
large extracellular, loops and a cytoplasmic C-terminal
domain. The exact function of prominin is unknown
although in humans defects in PROM1, the gene coding for
prominin, cause retinal degeneration.
Length = 807
Score = 26.1 bits (58), Expect = 8.2
Identities = 12/37 (32%), Positives = 17/37 (45%), Gaps = 3/37 (8%)
Query: 100 ILMALTL---LEGEEVRKLALQALAKGEEYGIIRRPG 133
LM + L L G V L + LA E + ++ PG
Sbjct: 476 FLMLVVLALFLVGGNVYTLVCEPLANNELFQVLDTPG 512
Database: CDD.v3.10
Posted date: Mar 20, 2013 7:55 AM
Number of letters in database: 10,937,602
Number of sequences in database: 44,354
Lambda K H
0.321 0.136 0.385
Gapped
Lambda K H
0.267 0.0786 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 7,170,413
Number of extensions: 659734
Number of successful extensions: 685
Number of sequences better than 10.0: 1
Number of HSP's gapped: 683
Number of HSP's successfully gapped: 34
Length of query: 138
Length of database: 10,937,602
Length adjustment: 87
Effective length of query: 51
Effective length of database: 7,078,804
Effective search space: 361019004
Effective search space used: 361019004
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.9 bits)
S2: 54 (24.5 bits)