RPS-BLAST 2.2.22 [Sep-27-2009]
Database: CddB
21,608 sequences; 5,994,473 total letters
Searching..................................................done
Query= gi|254781093|ref|YP_003065506.1| hypothetical protein
CLIBASIA_04975 [Candidatus Liberibacter asiaticus str. psy62]
(225 letters)
>gnl|CDD|132144 TIGR03100, hydr1_PEP, hydrolase, ortholog 1, exosortase system type
1 associated. This group of proteins are members of the
alpha/beta hydrolase superfamily. These proteins are
generally found in genomes containing the
exosortase/PEP-CTERM protein expoert system,
specifically the type 1 variant of this system described
by the Genome Property GenProp0652. When found in this
context they are invariably present in the vicinity of a
second, relatively unrelated enzyme (ortholog 2,
TIGR03101) of the same superfamily.
Length = 274
Score = 40.6 bits (95), Expect = 4e-04
Identities = 25/89 (28%), Positives = 38/89 (42%), Gaps = 7/89 (7%)
Query: 57 GFVSLRFNFRGIGRSEGEFDYGDGELSDAAAALDWVQSLNPESKSCWIAGYSFGAWISMQ 116
GF LRF++RG+G SEGE +G +D AAA+D + P + G A ++
Sbjct: 57 GFPVLRFDYRGMGDSEGENLGFEGIDADIAAAIDAFREAAPHLRRIVAWGLCDAASAALL 116
Query: 117 LLMRRPEINGFISVAP-------QPKSYD 138
+ G + + P Q S
Sbjct: 117 YAPADLRVAGLVLLNPWVRTEAAQAASRI 145
>gnl|CDD|162143 TIGR00976, /NonD, putative hydrolase, CocE/NonD family. This
model represents a protein subfamily that includes the
cocaine esterase CocE, several glutaryl-7-ACA acylases,
and the putative diester hydrolase NonD of Streptomyces
griseus (all hydrolases). This family shows extensive,
low-level similarity to a family of xaa-pro
dipeptidyl-peptidases, and local similarity by
PSI-BLAST to many other hydrolases.
Length = 550
Score = 38.2 bits (89), Expect = 0.002
Identities = 15/41 (36%), Positives = 22/41 (53%), Gaps = 1/41 (2%)
Query: 53 FQQRGFVSLRFNFRGIGRSEGEFDYGD-GELSDAAAALDWV 92
F +G+ + + RG G SEGEFD E +D +DW+
Sbjct: 49 FVAQGYAVVIQDTRGRGASEGEFDLLGSDEAADGYDLVDWI 89
>gnl|CDD|163354 TIGR03611, RutD, pyrimidine utilization protein D. This protein is
observed in operons extremely similar to that
characterized in E. coli K-12 responsible for the import
and catabolism of pyrimidines, primarily uracil. This
protein is a member of the hydrolase, alpha/beta fold
family defined by pfam00067.
Length = 257
Score = 34.3 bits (79), Expect = 0.028
Identities = 27/88 (30%), Positives = 38/88 (43%), Gaps = 10/88 (11%)
Query: 52 LFQQRGFVSLRFNFRGIGRSEGEFDYGDGELSDAAAALDWVQSLNPESKSCWIAGYSFGA 111
+ QR F + ++ RG GRS GE G A L + +LN E G++ G
Sbjct: 35 VLTQR-FHVVTYDHRGTGRSPGELPPGYSIAHMADDVLQLLDALNIER--FHFVGHALGG 91
Query: 112 WISMQLLMRRPE-------INGFISVAP 132
I +QL +R PE IN + P
Sbjct: 92 LIGLQLALRYPERLLSLVLINAWSRPDP 119
>gnl|CDD|177511 PHA03061, PHA03061, putative DNA-binding virion core protein;
Provisional.
Length = 311
Score = 30.8 bits (70), Expect = 0.26
Identities = 17/68 (25%), Positives = 26/68 (38%), Gaps = 21/68 (30%)
Query: 163 SDVKDLVNKLMNQKGISITHKVIPDANHFFIG---------KVDELINECAHYLDNSLDE 213
SDVK L +L A+ F K+++LIN+ LDE
Sbjct: 125 SDVKVLAARLKT------------SASSFEFNGHTYHLENDKIEDLINQLVKNGAIILDE 172
Query: 214 KFTLLKSI 221
K ++ S+
Sbjct: 173 KSSIKDSM 180
>gnl|CDD|163135 TIGR03101, hydr2_PEP, hydrolase, ortholog 2, exosortase system type
1 associated. This group of proteins are members of the
alpha/beta hydrolase superfamily. These proteins are
generally found in genomes containing the
exosortase/PEP-CTERM protein expoert system,
specifically the type 1 variant of this system described
by the Genome Property GenProp0652. When found in this
context they are invariably present in the vicinity of a
second, relatively unrelated enzyme (ortholog 1,
TIGR03100) of the same superfamily.
Length = 266
Score = 30.6 bits (69), Expect = 0.31
Identities = 29/129 (22%), Positives = 45/129 (34%), Gaps = 15/129 (11%)
Query: 8 GPSGRLEGRYQPSTNPNAPIALILHPHPRFGGTMND--NIVYQLFYLFQQRGFVSLRFNF 65
P G Y P P ++++ P F MN +V F GF L+ +
Sbjct: 7 APHGFRFCLYHPPVA-VGPRGVVIYLPP-FAEEMNKSRRMVALQARAFAAGGFGVLQIDL 64
Query: 66 RGIGRSEGEFDYG--DGELSDAAAALDWVQSLNPESKSC---WIAGYSFGAWISMQLLMR 120
G G S G+F D D AAA W+ + + G GA +++
Sbjct: 65 YGCGDSAGDFAAARWDVWKEDVAAAYRWL-----IEQGHPPVTLWGLRLGALLALDAANP 119
Query: 121 RP-EINGFI 128
+ N +
Sbjct: 120 LAAKCNRLV 128
>gnl|CDD|165939 PLN02298, PLN02298, hydrolase, alpha/beta fold family protein.
Length = 330
Score = 29.7 bits (67), Expect = 0.66
Identities = 40/142 (28%), Positives = 59/142 (41%), Gaps = 14/142 (9%)
Query: 6 FNGPSGR--LEGRYQPSTNPNAPIALILHPHPRFGGTMND-NIVYQLFYLF-QQRGFVSL 61
F P G + PS + + P ALI H G ND + +Q +F Q GF
Sbjct: 37 FTSPRGLSLFTRSWLPS-SSSPPRALIFMVH----GYGNDISWTFQSTAIFLAQMGFACF 91
Query: 62 RFNFRGIGRSEGEFDYG---DGELSDAAAALDWVQSLNP-ESKSCWIAGYSFGAWISMQL 117
+ G GRSEG Y D + D + + V+ + ++ G S G I + +
Sbjct: 92 ALDLEGHGRSEGLRAYVPNVDLVVEDCLSFFNSVKQREEFQGLPRFLYGESMGGAICLLI 151
Query: 118 LMRRPE-INGFISVAPQPKSYD 138
+ PE +G + VAP K D
Sbjct: 152 HLANPEGFDGAVLVAPMCKISD 173
>gnl|CDD|183825 PRK12903, secA, preprotein translocase subunit SecA; Reviewed.
Length = 925
Score = 29.3 bits (66), Expect = 0.76
Identities = 22/75 (29%), Positives = 30/75 (40%), Gaps = 15/75 (20%)
Query: 150 LIING-SNDTVATTSDVKDLVNKLMNQ--------KGISITHKVIPDANHFFIGKVDELI 200
LII+G ++ V L K IS+T K I AN FF K+ L
Sbjct: 219 LIISGGQSNDSNLYLAADQFVRTLKEDDYKIDEETKAISLTEKGIKKANKFF--KLKNLY 276
Query: 201 N----ECAHYLDNSL 211
+ E H + N+L
Sbjct: 277 DIENSELVHRIQNAL 291
>gnl|CDD|131480 TIGR02427, protocat_pcaD, 3-oxoadipate enol-lactonase. Members of
this family are 3-oxoadipate enol-lactonase. Note that
the substrate is known as 3-oxoadipate enol-lactone,
2-oxo-2,3-dihydrofuran-5-acetate,
4,5-Dihydro-5-oxofuran-2-acetate, and
5-oxo-4,5-dihydrofuran-2-acetate. The enzyme the
catalyzes the fourth step in the protocatechuate
degradation to beta-ketoadipate and then to succinyl-CoA
and acetyl-CoA. 4-hydroxybenzoate, 3-hydroxybenzoate,
and vanillate all can be converted in one step to
protocatechuate. This enzyme also acts in catechol
degradation. In genomes that catabolize both catechol
and protocatechuate, two forms of this enzyme may be
found. All members of the seed alignment for this model
were chosen from within protocatechuate degradation
operons of at least three genes of the pathway, from
genomes with the complete pathway through
beta-ketoadipate.
Length = 251
Score = 28.9 bits (65), Expect = 1.0
Identities = 24/71 (33%), Positives = 31/71 (43%), Gaps = 7/71 (9%)
Query: 56 RGFVSLRFNFRGIGRSE-GEFDYGDGELS-DAAAALDWVQSLNPESKSCWIAGYSFGAWI 113
F LR++ RG G S+ E Y +L+ D A LD L E G S G I
Sbjct: 38 PDFRVLRYDKRGHGLSDAPEGPYSIEDLADDVLALLD---HLGIERAV--FCGLSLGGLI 92
Query: 114 SMQLLMRRPEI 124
+ L RRP+
Sbjct: 93 AQGLAARRPDR 103
>gnl|CDD|184875 PRK14875, PRK14875, acetoin dehydrogenase E2 subunit
dihydrolipoyllysine-residue acetyltransferase;
Provisional.
Length = 371
Score = 28.4 bits (64), Expect = 1.7
Identities = 18/59 (30%), Positives = 27/59 (45%), Gaps = 7/59 (11%)
Query: 67 GIGRSEGEFDYGD-GELSDA-AAALDWVQSLNPESKSCWIAGYSFGAWISMQLLMRRPE 123
G G S G EL+ A A LD +L E + G+S G ++++L R P+
Sbjct: 167 GHGASSKAVGAGSLDELAAAVLAFLD---ALGIER--AHLVGHSMGGAVALRLAARAPQ 220
>gnl|CDD|184688 PRK14457, PRK14457, ribosomal RNA large subunit methyltransferase
N; Provisional.
Length = 345
Score = 28.0 bits (63), Expect = 1.9
Identities = 11/35 (31%), Positives = 19/35 (54%), Gaps = 6/35 (17%)
Query: 174 NQKGISITHKVIPDANHFFIGKVDELINECAHYLD 208
NQK + +IP A ++ I + L+ +C HY+
Sbjct: 222 NQK---LRETLIPSAKNYPI---ENLLEDCRHYVA 250
>gnl|CDD|151520 pfam11075, DUF2780, Protein of unknown function VcgC/VcgE
(DUF2780). This is a bacterial family of
uncharacterized proteins.
Length = 168
Score = 26.9 bits (60), Expect = 3.9
Identities = 11/24 (45%), Positives = 15/24 (62%)
Query: 156 NDTVATTSDVKDLVNKLMNQKGIS 179
VA T++ DLV LM+Q G+S
Sbjct: 36 AAQVAATTETSDLVGSLMSQLGVS 59
>gnl|CDD|181029 PRK07560, PRK07560, elongation factor EF-2; Reviewed.
Length = 731
Score = 27.1 bits (61), Expect = 4.0
Identities = 8/11 (72%), Positives = 8/11 (72%)
Query: 192 FIGKVDELINE 202
FI KVD LI E
Sbjct: 145 FINKVDRLIKE 155
>gnl|CDD|151780 pfam11339, DUF3141, Protein of unknown function (DUF3141). This
family of proteins with unknown function appears to be
restricted to Proteobacteria.
Length = 581
Score = 26.8 bits (60), Expect = 4.8
Identities = 18/52 (34%), Positives = 25/52 (48%)
Query: 86 AAALDWVQSLNPESKSCWIAGYSFGAWISMQLLMRRPEINGFISVAPQPKSY 137
AA L V L+P++ + G W + L RP++ G I VA P SY
Sbjct: 126 AAFLREVIELHPDAPKPAVIGNCQAGWALLMLAALRPDLFGPIIVAGAPLSY 177
>gnl|CDD|185393 PTZ00007, PTZ00007, (NAP-L) nucleosome assembly protein -L;
Provisional.
Length = 337
Score = 26.3 bits (58), Expect = 6.4
Identities = 9/22 (40%), Positives = 16/22 (72%)
Query: 177 GISITHKVIPDANHFFIGKVDE 198
GI+I K+IP A ++F+G+ +
Sbjct: 260 GITIRDKLIPYAVYWFLGEAID 281
>gnl|CDD|179529 PRK03057, PRK03057, hypothetical protein; Provisional.
Length = 180
Score = 25.9 bits (57), Expect = 7.7
Identities = 8/16 (50%), Positives = 10/16 (62%)
Query: 21 TNPNAPIALILHPHPR 36
NPN P+ LILH +
Sbjct: 87 LNPNEPLYLILHFPGK 102
>gnl|CDD|152581 pfam12146, Hydrolase_4, Putative lysophospholipase. This domain
is found in bacteria and eukaryotes and is
approximately 110 amino acids in length. It is found in
association with pfam00561. Many members are annotated
as being lysophospholipases, and others as alpha-beta
hydrolase fold-containing proteins.
Length = 79
Score = 26.0 bits (58), Expect = 8.0
Identities = 15/66 (22%), Positives = 27/66 (40%), Gaps = 8/66 (12%)
Query: 16 RYQPSTNPNAPIALILHPHPRFGGTMNDNIVYQ-LFYLFQQRGFVSLRFNFRGIGRSEGE 74
R+ P+ P A + +++H G Y L +G+ ++ RG GRS G+
Sbjct: 8 RWLPAGPPKAVV-VLVH---GGGEHSG---RYAELAEELAAQGYAVYAYDHRGHGRSPGK 60
Query: 75 FDYGDG 80
+
Sbjct: 61 RGHVPS 66
>gnl|CDD|173608 PTZ00418, PTZ00418, Poly(A) polymerase; Provisional.
Length = 593
Score = 25.9 bits (57), Expect = 8.2
Identities = 15/73 (20%), Positives = 29/73 (39%), Gaps = 10/73 (13%)
Query: 112 WIS--MQLLMRRPEINGFISVAPQPKSYDFS----FLAPCPSSGLIINGSN----DTVAT 161
WI ++ L+++ E + + P PK + + A GL+ N T
Sbjct: 432 WIESKIRFLIKKLETLNNLKIRPYPKFFKYQDDGWDYASSFFIGLVFFSKNVYNNSTFDL 491
Query: 162 TSDVKDLVNKLMN 174
++D V+ + N
Sbjct: 492 RYAIRDFVDIINN 504
>gnl|CDD|150036 pfam09220, LA-virus_coat, L-A virus, major coat protein. Members
of this family form the major coat protein of the
Saccharomyces cerevisiae L-A virus.
Length = 436
Score = 25.8 bits (56), Expect = 8.6
Identities = 25/114 (21%), Positives = 44/114 (38%), Gaps = 13/114 (11%)
Query: 64 NFRGIGRSEGEFDY---GDGELSDAAAALDWVQSLNPESKSCWIAGYSFGAWISMQLLMR 120
N G G + + GD L A A DW ++ + + + ++ L +R
Sbjct: 321 NIPKFGSIRGRYPFLLSGDAALIQATALEDW-SAIMAKPELVFTYAMQVSVALNTGLYLR 379
Query: 121 RPEINGFISVAP---------QPKSYDFSFLAPCPSSGLIINGSNDTVATTSDV 165
R + GF + QP+++ + LA C +NG +D T D+
Sbjct: 380 RVKKTGFGTTIDDSYEDGAFLQPETFVQAALACCTGQDAPLNGMSDVYVTYPDL 433
>gnl|CDD|140308 PTZ00285, PTZ00285, glucosamine-6-phosphate isomerase; Provisional.
Length = 253
Score = 25.9 bits (57), Expect = 9.5
Identities = 8/23 (34%), Positives = 10/23 (43%), Gaps = 1/23 (4%)
Query: 185 IPDAN-HFFIGKVDELINECAHY 206
I + N H G +L EC Y
Sbjct: 99 IKEENRHILNGTAPDLEEECRRY 121
>gnl|CDD|178176 PLN02562, PLN02562, UDP-glycosyltransferase.
Length = 448
Score = 25.6 bits (56), Expect = 9.5
Identities = 10/26 (38%), Positives = 14/26 (53%), Gaps = 4/26 (15%)
Query: 89 LDWVQSLNPESKSCWIAGYSFGAWIS 114
L W+Q P S + SFG+W+S
Sbjct: 264 LGWLQEQKPNS----VIYISFGSWVS 285
Database: CddB
Posted date: Feb 4, 2011 9:54 PM
Number of letters in database: 5,994,473
Number of sequences in database: 21,608
Lambda K H
0.321 0.139 0.426
Gapped
Lambda K H
0.267 0.0779 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 21608
Number of Hits to DB: 3,839,980
Number of extensions: 241372
Number of successful extensions: 484
Number of sequences better than 10.0: 1
Number of HSP's gapped: 484
Number of HSP's successfully gapped: 23
Length of query: 225
Length of database: 5,994,473
Length adjustment: 90
Effective length of query: 135
Effective length of database: 4,049,753
Effective search space: 546716655
Effective search space used: 546716655
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.8 bits)
S2: 55 (24.9 bits)