RPS-BLAST 2.2.22 [Sep-27-2009]
Database: CddA
21,609 sequences; 6,263,737 total letters
Searching..................................................done
Query= gi|254781040|ref|YP_003065453.1| hypothetical protein
CLIBASIA_04710 [Candidatus Liberibacter asiaticus str. psy62]
(143 letters)
>gnl|CDD|32770 COG2947, COG2947, Uncharacterized conserved protein [Function
unknown].
Length = 156
Score = 174 bits (443), Expect = 6e-45
Identities = 76/157 (48%), Positives = 96/157 (61%), Gaps = 17/157 (10%)
Query: 1 MAYWLVKSEPSEWSWKMQQDKGRVGEAWTGVRNYQARNNMRKMRVGDKGFFYHSN-KGRE 59
AYWL+KSEP +S +M + G E W GVRNYQARN MR M++GD GFFYHSN K
Sbjct: 2 AAYWLMKSEPDVFSIEMLKAVGE--EPWDGVRNYQARNFMRDMKIGDLGFFYHSNCKPPG 59
Query: 60 IVGIFEVITCTYPDPTAEQSSC-------------WECVDICAVCSMPCPVSLMAIKANP 106
IVG+ EV +PDPT + W CVD+ V +P PV+L +KANP
Sbjct: 60 IVGLAEVCALAHPDPTQFDPASPYYDPKATPEDPRWYCVDVRFVRKLPRPVTLKELKANP 119
Query: 107 RLSSMILIV-SSRLSVQPVTTDEYLEVCRMGKLSNPP 142
L+ M L+V +RLSVQPVT +E+ E+ R+G L PP
Sbjct: 120 ELAEMSLLVKGNRLSVQPVTPEEWKEILRLGGLDAPP 156
>gnl|CDD|145181 pfam01878, DUF55, Protein of unknown function DUF55. This family
of proteins have no known function.
Length = 143
Score = 155 bits (395), Expect = 3e-39
Identities = 66/145 (45%), Positives = 84/145 (57%), Gaps = 13/145 (8%)
Query: 2 AYWLVKSEPSEWSWKMQQDKGRVGEAWTGVRNYQARNNMRKMRVGDKGFFYHSN-KGREI 60
AYWL+KSEP +S + +G G W GVRNYQARN +R M+VGD FFYHSN K I
Sbjct: 1 AYWLMKSEPDHYSIDDLKGEG--GTPWDGVRNYQARNFLRAMKVGDLVFFYHSNEKPPAI 58
Query: 61 VGIFEVITCTYPDPTAEQSSC---------WECVDICAVCSMPCPVSLMAIKANPRLSSM 111
VGI EV++ YPDPTA W VD+ V + PV+L +KA P L +
Sbjct: 59 VGIAEVVSEAYPDPTAFDPESPYYDPENPRWYRVDVKFVEKLKRPVTLKELKAEPELKDL 118
Query: 112 ILI-VSSRLSVQPVTTDEYLEVCRM 135
L+ SRLSV PVT +E+ + +
Sbjct: 119 SLVRQGSRLSVMPVTEEEWKLILEL 143
>gnl|CDD|38593 KOG3383, KOG3383, KOG3383, Uncharacterized conserved protein
[Function unknown].
Length = 187
Score = 111 bits (279), Expect = 5e-26
Identities = 60/166 (36%), Positives = 81/166 (48%), Gaps = 27/166 (16%)
Query: 3 YWLVKSEPS---------EWSWKMQQDKGRVGEAWTGVRNYQARNNMRKMRVGDKGFFYH 53
YWL+K+EP E+SW+ + G W GVRNYQARNN+R M++GD FFYH
Sbjct: 10 YWLMKTEPEPRYVKGINVEFSWEDLESNGGKISKWDGVRNYQARNNLRSMKLGDLAFFYH 69
Query: 54 SN-KGREIVGIFEVITCTYPDPTA-------------EQSSCWECVDICAVCSMPCPVSL 99
SN K IVG+ E+ YPD Q+ W VD+ A+ M V L
Sbjct: 70 SNCKSPCIVGLVEISREWYPDDAEGVENDPYYDPESTPQNPRWGAVDVQAIGEMRRCVDL 129
Query: 100 MAIKAN----PRLSSMILIVSSRLSVQPVTTDEYLEVCRMGKLSNP 141
+K + L M+L RLS+ PV +E+ +C +G P
Sbjct: 130 KELKMHKDKGIILKGMVLFRRPRLSIVPVEEEEWNFICELGNGFEP 175
>gnl|CDD|30596 COG0247, GlpC, Fe-S oxidoreductase [Energy production and
conversion].
Length = 388
Score = 27.6 bits (60), Expect = 1.4
Identities = 13/55 (23%), Positives = 20/55 (36%), Gaps = 2/55 (3%)
Query: 80 SCWECVDICAVCSMPCPVSLMAIKANPR-LSSMILIVSSRLSVQPVTTDEYLEVC 133
S +CV C C+ CP +PR ++ V + E L+ C
Sbjct: 7 SLDKCVH-CGFCTNVCPSYRATEALSPRGRIVLVREVLRGKAPGDEEVYEALDTC 60
>gnl|CDD|112456 pfam03639, Glyco_hydro_81, Glycosyl hydrolase family 81. Family of
eukaryotic beta-1,3-glucanases.
Length = 694
Score = 27.6 bits (61), Expect = 1.5
Identities = 9/20 (45%), Positives = 12/20 (60%)
Query: 12 EWSWKMQQDKGRVGEAWTGV 31
EW+WK+ VGE W G+
Sbjct: 635 EWNWKIPPIINSVGEGWKGI 654
>gnl|CDD|39007 KOG3803, KOG3803, KOG3803, Transcription factor containing C2HC
type Zn finger [Transcription].
Length = 968
Score = 27.1 bits (59), Expect = 1.6
Identities = 9/37 (24%), Positives = 15/37 (40%)
Query: 44 RVGDKGFFYHSNKGREIVGIFEVITCTYPDPTAEQSS 80
++ HS+ GRE V + C+ D + S
Sbjct: 138 DEAEECLLIHSDDGREKVDDSQPPLCSSDDNESNSES 174
>gnl|CDD|38284 KOG3074, KOG3074, KOG3074, Transcriptional regulator of the PUR
family, single-stranded-DNA-binding [Transcription].
Length = 263
Score = 26.1 bits (57), Expect = 3.2
Identities = 9/49 (18%), Positives = 17/49 (34%), Gaps = 7/49 (14%)
Query: 39 NMRKMRVGDKGFFYHSNKGREIVGIFEVITCTYPDPTAEQS-----SCW 82
+ V +K F + K + G+F I+ + + S W
Sbjct: 179 EGTSITVDNKRFLFDVGKNKR--GVFVRISEVKSNGSYRNSITVPFKAW 225
>gnl|CDD|34275 COG4656, RnfC, Predicted NADH:ubiquinone oxidoreductase, subunit
RnfC [Energy production and conversion].
Length = 529
Score = 26.0 bits (57), Expect = 3.5
Identities = 11/38 (28%), Positives = 15/38 (39%), Gaps = 9/38 (23%)
Query: 71 YPDPTAEQSS--CWECVDICAVCSMPCPVSLMAIKANP 106
+P EQS C C D CPV+L+ +
Sbjct: 358 LLEPIKEQSCIRCSLCADA-------CPVNLLPQQLYW 388
>gnl|CDD|38190 KOG2979, KOG2979, KOG2979, Protein involved in DNA repair [General
function prediction only].
Length = 262
Score = 26.2 bits (57), Expect = 3.5
Identities = 22/79 (27%), Positives = 29/79 (36%), Gaps = 3/79 (3%)
Query: 63 IFEVITCTYPDPTAEQSSCWECVDICAVCSMPCPVSLMAIKANPRLSSMILIVSSRLSVQ 122
+ I PDP E E + V S P+S I NP +S V R S+
Sbjct: 149 AAKDINMKIPDPEEEDED--EELIGQEVFSNRDPISKKPIV-NPVISKKCGHVYDRDSIM 205
Query: 123 PVTTDEYLEVCRMGKLSNP 141
+ DE C + NP
Sbjct: 206 QILCDEITIRCPVLGCENP 224
>gnl|CDD|34436 COG4827, COG4827, Predicted transporter [General function
prediction only].
Length = 239
Score = 26.0 bits (57), Expect = 3.7
Identities = 13/46 (28%), Positives = 21/46 (45%), Gaps = 6/46 (13%)
Query: 88 CAVCSMPCPVSLMAIKANPRLSSMILIVSSRLSVQPVTTDEYLEVC 133
SMPCPV L A ++ I+++S LS+ V + +
Sbjct: 107 FLAISMPCPVCLGA------VAVSIMLLSQILSLSGVKIGLLVGIA 146
>gnl|CDD|113611 pfam04845, PurA, PurA ssDNA and RNA-binding protein. This family
represents most of the length of the protein.
Length = 218
Score = 25.6 bits (56), Expect = 4.8
Identities = 9/30 (30%), Positives = 14/30 (46%), Gaps = 2/30 (6%)
Query: 39 NMRKMRVGDKGFFYH--SNKGREIVGIFEV 66
+ V +K F++ SNK + I EV
Sbjct: 160 EGTSLTVDNKRFYFDVGSNKRGVFMRISEV 189
>gnl|CDD|38865 KOG3661, KOG3661, KOG3661, Uncharacterized conserved protein
[Function unknown].
Length = 1019
Score = 25.4 bits (55), Expect = 5.0
Identities = 16/64 (25%), Positives = 24/64 (37%), Gaps = 5/64 (7%)
Query: 67 ITCTYPDPTAEQSSCWECVDICAVCSMPCPVSLMAIKANPRL-----SSMILIVSSRLSV 121
+T PT QS+ W + P+S+ ++ PR S IL S LS
Sbjct: 174 VTALPLHPTRAQSNLWNPQGPLSDGPGSLPLSIAPVRKRPRHDPGAPSPGILQDSDGLSG 233
Query: 122 QPVT 125
+
Sbjct: 234 SYLD 237
>gnl|CDD|36489 KOG1275, KOG1275, KOG1275, PAB-dependent poly(A) ribonuclease,
subunit PAN2 [Replication, recombination and repair].
Length = 1118
Score = 25.7 bits (56), Expect = 5.2
Identities = 12/52 (23%), Positives = 20/52 (38%), Gaps = 13/52 (25%)
Query: 96 PVSLMA----IKANPRLSSMILIVSSRLSVQPVTTDEYLEVCRMGKLSNPPL 143
P+ ++ +P L++ + + S Q V T LSNPP
Sbjct: 261 PIQFPYGPQFLRFHPSLTTRLAVTSQSGQFQFVDTAT---------LSNPPA 303
>gnl|CDD|38461 KOG3251, KOG3251, KOG3251, Golgi SNAP receptor complex member
[Intracellular trafficking, secretion, and vesicular
transport].
Length = 213
Score = 25.6 bits (56), Expect = 5.3
Identities = 12/49 (24%), Positives = 20/49 (40%), Gaps = 6/49 (12%)
Query: 5 LVKSEP-----SEWSWKMQQDKGRVGEAWTGVRNYQARNNMRKMRVGDK 48
LV EP + ++ Q V T +R RNN R+ + ++
Sbjct: 55 LVSKEPPKSRQAAR-LRVDQLLEDVEHLQTSLRTSMNRNNRREQQARER 102
>gnl|CDD|176946 CHL00001, rpoB, RNA polymerase beta subunit.
Length = 1070
Score = 25.6 bits (57), Expect = 5.6
Identities = 9/17 (52%), Positives = 12/17 (70%)
Query: 41 RKMRVGDKGFFYHSNKG 57
R+++VGDK H NKG
Sbjct: 812 REIQVGDKVAGRHGNKG 828
>gnl|CDD|39784 KOG4584, KOG4584, KOG4584, Uncharacterized conserved protein
[General function prediction only].
Length = 348
Score = 25.3 bits (55), Expect = 6.8
Identities = 13/45 (28%), Positives = 17/45 (37%)
Query: 65 EVITCTYPDPTAEQSSCWECVDICAVCSMPCPVSLMAIKANPRLS 109
EVI C P + E ++ A + C V L AI L
Sbjct: 230 EVILCANSSPALNDVTYSELKELAAELANDCNVLLKAIDTGQLLV 274
>gnl|CDD|31237 COG1034, NuoG, NADH dehydrogenase/NADH:ubiquinone oxidoreductase 75
kD subunit (chain G) [Energy production and conversion].
Length = 693
Score = 24.9 bits (54), Expect = 8.2
Identities = 10/21 (47%), Positives = 11/21 (52%)
Query: 70 TYPDPTAEQSSCWECVDICAV 90
TY D E C CVD+C V
Sbjct: 178 TYLDQPLESELCGNCVDVCPV 198
>gnl|CDD|133131 cd06600, GH31_MGAM-like, This family includes the following closely
related glycosyl hydrolase family 31 (GH31) enzymes:
maltase-glucoamylase (MGAM), sucrase-isomaltase (SI),
lysosomal acid alpha-glucosidase (GAA), neutral
alpha-glucosidase C (GANC), the alpha subunit of neutral
alpha-glucosidase AB (GANAB), and alpha-glucosidase II.
MGAM is one of the two enzymes responsible for
catalyzing the last glucose-releasing step in starch
digestion. SI is implicated in the digestion of dietary
starch and major disaccharides such as sucrose and
isomaltose, while GAA degrades glycogen in the lysosome,
cleaving both alpha-1,4 and alpha-1,6 glucosidic
linkages. MGAM and SI are anchored to small-intestinal
brush-border epithelial cells. The absence of SI from
the brush border membrane or its malfunction is
associated with malabsorption disorders such as
congenital sucrase-isomaltase deficiency (CSID). The
domain architectures of MGAM and SI include two tandem
GH31 catalytic domains, an N-terminal domain found near
the membrane-bound end and a C-terminal luminal domain.
Both of the tandem GH31 domains of MGAM and SI are
included in this family. The domain architecture of GAA
includes an N-terminal TFF (trefoil factor family)
domain in addition to the GH31 catalytic domain.
Deficient GAA expression causes pompe disease, an
autosomal recessive genetic disorder also known as
glycogen storage disease type II (GSDII). GANC and GANAB
are key enzymes in glycogen metabolism that hydrolyze
terminal, non-reducing 1,4-linked alpha-D-glucose
residues from glycogen in the endoplasmic reticulum.
Alpha-glucosidase II is a GH31 enzyme, found in bacteria
and plants, which has exo-alpha-1,4-glucosidase and
oligo-1,6-glucosidase activities. Alpha-glucosidase II
has been characterized in Bacillus
thermoamyloliquefaciens where it forms a homohexamer.
This family also includes the MalA alpha-glucosidase
from Sulfolobus sulfataricus and the AglA
alpha-glucosidase from Picrophilus torridus. MalA is
part of the carbohydrate-metabolizing machinery that
allows this organism to utilize carbohydrates, such as
maltose, as the sole carbon and energy source. The
MGAM-like family corresponds to subgroup 1 in the Ernst
et al classification of GH31 enzymes.
Length = 317
Score = 24.8 bits (54), Expect = 9.1
Identities = 13/36 (36%), Positives = 14/36 (38%)
Query: 47 DKGFFYHSNKGREIVGIFEVITCTYPDPTAEQSSCW 82
DKG F G VG T YPD T + W
Sbjct: 102 DKGKFCEIESGELFVGKMWPGTTVYPDFTNPDTREW 137
>gnl|CDD|31788 COG1600, COG1600, Uncharacterized Fe-S protein [Energy production
and conversion].
Length = 337
Score = 24.6 bits (53), Expect = 9.9
Identities = 8/25 (32%), Positives = 9/25 (36%), Gaps = 4/25 (16%)
Query: 72 PDPTAEQSSCWECVDICAVCSMPCP 96
PD E+ C C C CP
Sbjct: 178 PDRPEEEDHCGSCT----RCLDACP 198
Database: CddA
Posted date: Feb 4, 2011 9:38 PM
Number of letters in database: 6,263,737
Number of sequences in database: 21,609
Lambda K H
0.320 0.133 0.433
Gapped
Lambda K H
0.267 0.0630 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 21609
Number of Hits to DB: 1,776,672
Number of extensions: 81345
Number of successful extensions: 206
Number of sequences better than 10.0: 1
Number of HSP's gapped: 195
Number of HSP's successfully gapped: 26
Length of query: 143
Length of database: 6,263,737
Length adjustment: 85
Effective length of query: 58
Effective length of database: 4,426,972
Effective search space: 256764376
Effective search space used: 256764376
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.9 bits)
S2: 52 (24.0 bits)