RPS-BLAST 2.2.22 [Sep-27-2009]
Database: CddB
21,608 sequences; 5,994,473 total letters
Searching..................................................done
Query= gi|254781038|ref|YP_003065451.1| putative DNA-binding/iron
metalloprotein/AP endonuclease [Candidatus Liberibacter asiaticus str.
psy62]
(363 letters)
>gnl|CDD|181984 PRK09604, PRK09604, UGMP family protein; Validated.
Length = 332
Score = 481 bits (1241), Expect = e-136
Identities = 173/346 (50%), Positives = 224/346 (64%), Gaps = 15/346 (4%)
Query: 7 TVIGIETSCDETAVAVVRRKDSHGEILAEAVLSQIDQHGHYGGVVPEVAARAHVDVLDIL 66
++GIETSCDET+VAVV D +L+ V SQID H YGGVVPE+A+RAHV+ + L
Sbjct: 2 LILGIETSCDETSVAVV---DDGRGLLSNVVASQIDLHARYGGVVPELASRAHVENIVPL 58
Query: 67 IKQTLLRANMQISDMDSIAVTAGPGLMGGLIVGLMTAKAISYVSHKPFYAINHLEGHILT 126
I++ L A + + D+D+IAVTAGPGL+G L+VG+ AKA++ +KP +NHLEGH+L
Sbjct: 59 IEEALKEAGLTLEDIDAIAVTAGPGLVGALLVGVSFAKALALALNKPLIGVNHLEGHLLA 118
Query: 127 ARLTDDIAFPYLVLLVSGGHTQILLVRDVAHYDRLGTTIDDALGECFDKIAKSLGLPYPG 186
L ++ FP+L LLVSGGHTQ++LV+ + Y+ LG T+DDA GE FDK+AK LGL YPG
Sbjct: 119 PFLEEEPEFPFLALLVSGGHTQLVLVKGIGDYELLGETLDDAAGEAFDKVAKLLGLGYPG 178
Query: 187 GVEIEKAALMGDGKRFKFPCPLVQGTLCDFSFSGLKTSVQKTICAFDVLEKQDIADICAS 246
G I+K A GD FKFP P+ + DFSFSGLKT+V TI +Q ADI AS
Sbjct: 179 GPAIDKLAKQGDPDAFKFPRPMDRPG-LDFSFSGLKTAVLNTI----EKSEQTKADIAAS 233
Query: 247 FQVTVVRILQARLKQGFLLFRKAFPHKQAVLVVSGGVASNHFIRASLIDLCVLHGFRFVA 306
FQ VV +L + K+ K K LVV+GGVA+N +R L +L G
Sbjct: 234 FQAAVVDVLVIKTKRAL----KQTGVKT--LVVAGGVAANSGLRERLAELAKKRGIEVFI 287
Query: 307 PPARLCTDNAVMIAWAALERMEAGLPADDLSISPRSRWPLDEKAPS 352
PP +LCTDNA MIA A ER++AG DL ++ R RWPLDE +
Sbjct: 288 PPLKLCTDNAAMIAAAGYERLKAGEF-SDLDLNARPRWPLDELSAL 332
>gnl|CDD|163435 TIGR03723, bact_gcp, putative glycoprotease GCP. This model
represents bacterial members of a protein family that is
widely distributed. In a few pathogenic species, the
protein is exported in a way that may represent an
exceptional secondary function. This model plus
companion (archaeal) model TIGR03722 together span the
prokaryotic member sequences of TIGR00329, a protein
family that appears universal in life, and whose broad
function is unknown. A member of TIGR03722 has been
characterized as a DNA-binding protein with apurinic
endopeptidase activity. In contrast, the rare
characterized members of the present family show
O-sialoglycoprotein endopeptidase (EC. 3.4.24.57)
activity after export. These include glycoprotease (gcp)
from Pasteurella haemolytica A1 and a cohemolysin from
Riemerella anatipestifer (GB|AAG39646.1). The member
from Staphylococcus aureus is essential and is related
to cell wall dynamics and the modulation of autolysis,
but members are also found in the Mycoplasmas (which
lack a cell wall). A reasonable hypothesis is that
virulence-related activities after export are secondary
to a bacterial domain-wide unknown function.
Length = 314
Score = 444 bits (1144), Expect = e-125
Identities = 163/323 (50%), Positives = 210/323 (65%), Gaps = 11/323 (3%)
Query: 8 VIGIETSCDETAVAVVRRKDSHGEILAEAVLSQIDQHGHYGGVVPEVAARAHVDVLDILI 67
++GIETSCDETAVA+V D +L+ V SQI+ H YGGVVPE+A+RAH++ + LI
Sbjct: 1 ILGIETSCDETAVAIV---DDGKGLLSNIVASQIELHARYGGVVPELASRAHLEAIPPLI 57
Query: 68 KQTLLRANMQISDMDSIAVTAGPGLMGGLIVGLMTAKAISYVSHKPFYAINHLEGHILTA 127
++ L A + +SD+D+IAVTAGPGL+G L+VG+ AKA++ +KP +NHLEGH+L
Sbjct: 58 EEALAEAGLTLSDIDAIAVTAGPGLIGALLVGVSFAKALALALNKPLIGVNHLEGHLLAP 117
Query: 128 RLTD-DIAFPYLVLLVSGGHTQILLVRDVAHYDRLGTTIDDALGECFDKIAKSLGLPYPG 186
L D + FP+L LLVSGGHTQ++LV+ V Y+ LG T+DDA GE FDK+A+ LGL YPG
Sbjct: 118 FLEDKPLEFPFLALLVSGGHTQLVLVKGVGDYELLGETLDDAAGEAFDKVARLLGLGYPG 177
Query: 187 GVEIEKAALMGDGKRFKFPCPLVQGTLCDFSFSGLKTSVQKTICAFD-VLEKQDIADICA 245
G I+K A GD K FKFP P+ DFSFSGLKT+V I E+ D ADI A
Sbjct: 178 GPAIDKLAKEGDPKAFKFPRPMTGRPGLDFSFSGLKTAVLNLIEKLKQKGEELDKADIAA 237
Query: 246 SFQVTVVRILQARLKQGFLLFRKAFPHKQAVLVVSGGVASNHFIRASLIDLCVLHGFRFV 305
SFQ VV +L + K+ K K LVV+GGVA+N +R L +L G
Sbjct: 238 SFQAAVVDVLVEKTKRAL----KKTGLKT--LVVAGGVAANSRLRERLEELAEKAGLEVF 291
Query: 306 APPARLCTDNAVMIAWAALERME 328
PP LCTDNA MIA A ER++
Sbjct: 292 IPPLELCTDNAAMIAAAGYERLK 314
>gnl|CDD|129429 TIGR00329, gcp_kae1, metallohydrolase, glycoprotease/Kae1 family.
This subfamily includes the well-studied secreted
O-sialoglycoprotein endopeptidase (glycoprotease, EC
3.4.24.57) of Pasteurella haemolytica, a pathogen. A
member from Riemerella anatipestifer, associated with
cohemolysin activity, likewise is exported without
benefit of a classical signal peptide and shows
glycoprotease activity on the test substrate
glycophorin. However, archaeal members of this subfamily
show unrelated activities as demonstrated in Pyrococcus
abyssi: DNA binding, iron binding, apurinic endonuclease
activity, genomic association with a kinase domain, and
no glycoprotease activity. This family thus pulls
together a set of proteins as a homology group that
appears to be near-universal in life, yet heterogeneous
in assayed function between bacteria and archaea.
Length = 305
Score = 312 bits (802), Expect = 8e-86
Identities = 137/314 (43%), Positives = 182/314 (57%), Gaps = 11/314 (3%)
Query: 9 IGIETSCDETAVAVVRRKDSHGEILAEAVLSQIDQHGHYGGVVPEVAARAHVDVLDILIK 68
+GIETSCD+T VA+V D G +LA +SQI H YGGVVPE A+R H + + L++
Sbjct: 1 LGIETSCDDTGVAIV---DEEGNVLANIKISQIPLHAKYGGVVPEEASRHHAENIPPLLE 57
Query: 69 QTLLRANMQISDMDSIAVTAGPGLMGGLIVGLMTAKAISYVSHKPFYAINHLEGHILTAR 128
+ L+ +N+ S++D IA T GPGL G L VG A++++ KP +NHL GHI R
Sbjct: 58 RALIESNVDKSEIDLIAYTQGPGLGGSLRVGATFARSLALSLDKPLIGVNHLLGHIYAPR 117
Query: 129 LTDDI-AFPYLVLLVSGGHTQILLVRDVAHYDRLGTTIDDALGECFDKIAKSLGLPYPGG 187
L +I FP++ LLVSGGHTQI+ V+ + Y+ LG T+DDA+GE FDK+A+ LGL YPGG
Sbjct: 118 LDTNILQFPFVSLLVSGGHTQIIAVKGIGDYEVLGETLDDAVGEAFDKVARLLGLGYPGG 177
Query: 188 VEIEKAALMGDGKRFKFPCPLVQGTLCDFSFSGLKTSVQKTICAF-DVLEKQDIADICAS 246
+IE+ A GD F FP P + DFSFSGLKT+ + I L + DI S
Sbjct: 178 PKIEELAKKGDKLPFYFPLPYTVKPMLDFSFSGLKTAALRKIEKLKKNLNEATKEDIAYS 237
Query: 247 FQVTVVRILQARLKQGFLLFRKAFPHKQAVLVVSGGVASNHFIRASLIDLCVLHGFRFVA 306
FQ T L + K+ LV+ GGV++N +R L LC F
Sbjct: 238 FQETAFDHLIEKTKRALKDTGP------KELVLVGGVSANKRLREMLETLCQELNVEFYY 291
Query: 307 PPARLCTDNAVMIA 320
PP C+DN MIA
Sbjct: 292 PPLEFCSDNGAMIA 305
>gnl|CDD|163434 TIGR03722, arch_KAE1, universal archaeal protein Kae1. This family
represents the archaeal protein Kae1. Its partner Bud32
is fused with it in about half of the known archaeal
genomes. The pair, which appears universal in the
archaea, corresponds to EKC/KEOPS complex in eukaryotes.
A recent characterization of the member from Pyrococcus
abyssi, as an iron-binding, atypical DNA-binding protein
with an apurinic lyase activity, challenges the common
annotation of close homologs as O-sialoglycoprotein
endopeptidase. The latter annotation is based on a
characterized protein from the bacterium Pasteurella
haemolytica.
Length = 322
Score = 179 bits (457), Expect = 7e-46
Identities = 120/348 (34%), Positives = 168/348 (48%), Gaps = 39/348 (11%)
Query: 9 IGIETSCDETAVAVVRRKDSHGEILAEAVLSQIDQHGHYGGVVPEVAARAHVDVLDILIK 68
+GIE + V +V D GEILA + + + G G+ P AA H +V LIK
Sbjct: 1 LGIEGTAHTFGVGIV---DEDGEILANVSDTYVPEKG---GIHPREAAEHHAEVAPKLIK 54
Query: 69 QTLLRANMQISDMDSIAVTAGPGLMGGLIVGLMTAKAISYVSHKPFYAINHLEGHILTAR 128
+ L A + + D+D++A + GPGL L VG A+A++ +KP +NH HI R
Sbjct: 55 EALEEAGVSLEDIDAVAFSQGPGLGPCLRVGATAARALALKLNKPLVGVNHCVAHIEIGR 114
Query: 129 LTDDIAFPYLVLLVSGGHTQILLVRDVAHYDRLGTTIDDALGECFDKIAKSLGLPYPGGV 188
LT P +VL VSGG+TQ++ R+ Y G T+D LG DK A+ +GL +PGG
Sbjct: 115 LTTGAKDP-VVLYVSGGNTQVIAYRN-GRYRVFGETLDIGLGNALDKFAREVGLGHPGGP 172
Query: 189 EIEKAALMGDGKRFKFPCPL---VQGTLCDFSFSGLKTSVQKTICAFDVLEKQ-DIADIC 244
+IE+ A G K L V+G D SFSGL T+ A +K + D+C
Sbjct: 173 KIEELAEKG-----KEYIELPYTVKGM--DLSFSGLLTA------ALRAYKKGARLEDVC 219
Query: 245 ASFQVT----VVRILQARLKQGFLLFRKAFPHKQAVLVVSGGVASNHFIRASLIDLCVLH 300
S Q T +V + + L A K+ VL+V GGVA+N +R L +
Sbjct: 220 YSLQETAFAMLVEVTERAL---------AHTGKKEVLLV-GGVAANRRLREMLELMAEDR 269
Query: 301 GFRFVAPPARLCTDNAVMIAWAALERMEAGLPADDLSISPRSRWPLDE 348
G +F PP DN MIA+ L + G+ R RW DE
Sbjct: 270 GAKFYVPPPEYAGDNGAMIAYTGLLMYKHGVTIPVEESRVRQRWRTDE 317
>gnl|CDD|184878 PRK14878, PRK14878, UGMP family protein; Provisional.
Length = 323
Score = 173 bits (442), Expect = 5e-44
Identities = 115/346 (33%), Positives = 172/346 (49%), Gaps = 36/346 (10%)
Query: 9 IGIETSCDETAVAVVRRKDSHGEILAEAVLSQIDQHGHYGGVVPEVAARAHVDVLDILIK 68
+GIE++ V +V+ +LA + + + G G+ P AA+ H +V L++
Sbjct: 1 LGIESTAHTLGVGIVKEDK----VLANVRDTYVPEKG---GIHPREAAQHHAEVAPELLR 53
Query: 69 QTLLRANMQISDMDSIAVTAGPGLMGGLIVGLMTAKAISYVSHKPFYAINHLEGHILTAR 128
+ L +A + I D+D++AV+ GPGL L VG A+A++ +KP +NH HI R
Sbjct: 54 KALEKAGISIEDIDAVAVSQGPGLGPALRVGATAARALALKYNKPLVPVNHCIAHIEIGR 113
Query: 129 LTDDIAFPYLVLLVSGGHTQILLVRDVAHYDRLGTTIDDALGECFDKIAKSLGLPYPGGV 188
LT P +VL VSGG+TQ+L R Y G T+D A+G D A+ +GL PGG
Sbjct: 114 LTTGAKDP-VVLYVSGGNTQVLAFRG-GRYRVFGETLDIAIGNALDTFAREVGLAPPGGP 171
Query: 189 EIEKAALMGDGKRFKFPCPLVQGTLCDFSFSGLKTSVQKTICAFDVLEKQDIADICASFQ 248
IEK A G+ K + P V+G D SFSGL T+ + K+ + D+C S +
Sbjct: 172 AIEKCAEKGE-KYIELPYV-VKGQ--DLSFSGLLTAALRL-----YKGKERLEDVCYSLR 222
Query: 249 VT----VVRILQARLKQGFLLFRKAFPHKQAVLVVSGGVASNHFIRASLIDLCVLHGFRF 304
T +V + + L A K+ VL+V GGVA+N +R L + G +F
Sbjct: 223 ETAFAMLVEVTERAL---------AHTGKKEVLLV-GGVAANRRLREKLEIMAEDRGAKF 272
Query: 305 VAPPARLCTDNAVMIAWAALERMEAG--LPADDLSISPRSRWPLDE 348
P DN MIA+ L + G +P ++ + RW LDE
Sbjct: 273 YVVPPEYAGDNGAMIAYTGLLAYKHGVTIPPEESFVRQ--RWRLDE 316
>gnl|CDD|181985 PRK09605, PRK09605, bifunctional UGMP family
protein/serine/threonine protein kinase; Validated.
Length = 535
Score = 170 bits (432), Expect = 6e-43
Identities = 120/348 (34%), Positives = 178/348 (51%), Gaps = 45/348 (12%)
Query: 8 VIGIETSCDETAVAVVRRKDSHGEILA-EAVLSQIDQHGHYGGVVPEVAARAHVDVLDIL 66
V+GIE + +T+ +V DS G++L E+ + GG+ P AA H + + +
Sbjct: 3 VLGIEGTAWKTSAGIV---DSDGDVLFNESDPYKPPS----GGIHPREAAEHHAEAIPKV 55
Query: 67 IKQTLLRANMQISDMDSIAVTAGPGLMGGLIVGLMTAKAISYVSHKPFYAINHLEGHILT 126
IK+ L A ++ D+D +A + GPGL L V A+A++ P +NH H+
Sbjct: 56 IKEALEEAGLKPEDIDLVAFSQGPGLGPCLRVVATAARALALSLDVPLIGVNHCVAHVEI 115
Query: 127 ARLTDDIAFPYLVLLVSGGHTQILLVRDVAH----YDRLGTTIDDALGECFDKIAKSLGL 182
RLT P + L VSGG+TQ+L A+ Y G T+D +G DK A+ +GL
Sbjct: 116 GRLTTGAEDP-VTLYVSGGNTQVL-----AYLNGRYRVFGETLDIGVGNALDKFARHVGL 169
Query: 183 PYPGGVEIEKAALMGDGKRFKFPCP-LVQGTLCDFSFSGLKTSVQKTICAFDVLEKQDIA 241
P+PGG +IEK A DGK++ P +V+G DFSFSGL T+ ++ A+D + +
Sbjct: 170 PHPGGPKIEKLAK--DGKKY-IDLPYVVKGM--DFSFSGLLTAAKR---AYD--AGEPLE 219
Query: 242 DICASFQVT----VVRILQARLKQGFLLFRKAFPHKQAVLVVSGGVASNHFIRASLIDLC 297
D+C S Q T + + + L A K VL+V GGVA+N+ +R L ++C
Sbjct: 220 DVCYSLQETAFAMLTEVTERAL---------AHTGKDEVLLV-GGVAANNRLREMLKEMC 269
Query: 298 VLHGFRFVAPPARLCTDNAVMIAWAALERMEAG--LPADDLSISPRSR 343
G F P R C DN MIAW L +AG L +D ++P R
Sbjct: 270 EERGADFYVPEPRFCGDNGAMIAWLGLLMYKAGDTLDIEDTRVNPNFR 317
>gnl|CDD|185566 PTZ00340, PTZ00340, O-sialoglycoprotein endopeptidase-like protein;
Provisional.
Length = 345
Score = 143 bits (362), Expect = 8e-35
Identities = 107/351 (30%), Positives = 160/351 (45%), Gaps = 33/351 (9%)
Query: 8 VIGIETSCDETAVAVVRRKDSHGEILAEAVLSQIDQHGHYGGVVPEVAARAHVDVLDILI 67
+GIE S ++ V +V GEIL+ + I G G +P A+ H + + L+
Sbjct: 3 ALGIEGSANKLGVGIVTSD---GEILSNVRETYITPPGT--GFLPRETAQHHREHILSLV 57
Query: 68 KQTLLRANMQISDMDSIAVTAGPGLMGGLIVGLMTAKAISYVSHKPFYAINHLEGHILTA 127
K+ L A + SD+ I T GPG+ L VG + A+ +S + KP +NH HI
Sbjct: 58 KEALEEAKITPSDISLICYTKGPGMGAPLSVGAVVARTLSLLWGKPLVGVNHCVAHIEMG 117
Query: 128 RLTDDIAFPYLVLLVSGGHTQILLVRDVAHYDRLGTTIDDALGECFDKIAKSLGLP-YPG 186
RL P +VL VSGG+TQ++ + Y G TID A+G C D+ A+ L L P
Sbjct: 118 RLVTGAENP-VVLYVSGGNTQVIAYSEH-RYRIFGETIDIAVGNCLDRFARLLNLSNDPA 175
Query: 187 -GVEIEKAALMGDGKRFKFPCP-LVQGTLCDFSFSGLKTSVQKTI----------CAFDV 234
G IE+ A GK P +V+G D SFSG+ T ++ +
Sbjct: 176 PGYNIEQLA--KKGKNL-IELPYVVKGM--DMSFSGILTYIEDLVEHPQFKDVVSEIVPP 230
Query: 235 LEKQDIADICASFQVTVVRILQARLKQGFLLFRKAFPHKQAVLVVSGGVASNHFIRASLI 294
E+ D+C S Q T+ +L ++ + VL+V GGV N ++ +
Sbjct: 231 EEEFFTDDLCFSLQETIFAMLVEVTERAM-----SHCGSNEVLIV-GGVGCNLRLQEMMQ 284
Query: 295 DLCVLHGFRFVAPPARLCTDNAVMIAWAALERMEAG--LPADDLSISPRSR 343
+ G + A R C DN MIA+A L +G P D +++ R R
Sbjct: 285 QMAKERGGKLFAMDERYCIDNGAMIAYAGLLEYLSGGFTPLKDATVTQRFR 335
>gnl|CDD|163437 TIGR03725, bact_YeaZ, universal bacterial protein YeaZ. This
family describes a protein family, YeaZ, that appears to
be universal in bacteria, but whose function is unknown.
This family is related to the gcp (glycoprotease)
protein family, also universal in bacteria and unknown
in function. In Gram-positive lineages, members of these
two related families often belong to the same operon,
along with the ribosomal-protein-alanine
acetyltransferase gene. Members of this family may occur
as fusions with gcp or the ribosomal protein
N-acetyltransferase rimI, and is frequently encoded next
to rimI.
Length = 202
Score = 58.0 bits (141), Expect = 4e-09
Identities = 37/114 (32%), Positives = 58/114 (50%), Gaps = 19/114 (16%)
Query: 8 VIGIETSCDETAVAVVRRKDSHGEILAEAVLSQIDQHGHYGGVVPEVAARAHVDVLDILI 67
++ I+TS + +VA++ GEILAE E A R H ++L +I
Sbjct: 1 ILAIDTSTEALSVALLD----DGEILAERS---------------EEAGRNHSEILLPMI 41
Query: 68 KQTLLRANMQISDMDSIAVTAGPGLMGGLIVGLMTAKAISYVSHKPFYAINHLE 121
++ L A + + D+D+IAV GPG GL +GL TAK ++ P ++ LE
Sbjct: 42 EELLAEAGLSLQDLDAIAVGVGPGSFTGLRIGLATAKGLALALGIPLVGVSSLE 95
>gnl|CDD|161730 TIGR00143, hypF, [NiFe] hydrogenase maturation protein HypF. A
previously described regulatory effect of HypF
mutatation is attributable to loss of activity of a
regulatory hydrogenase. A zinc finger-like region
CXXCX(18)CXXCX(24)CXXCX(18)CXXC region further supported
the regulatory hypothesis. However, more recent work
(PUBMED:11375153) shows the direct effect is on the
activity of expressed hydrogenases with nickel/iron
centers, rather than on expression.
Length = 711
Score = 34.0 bits (78), Expect = 0.055
Identities = 38/161 (23%), Positives = 58/161 (36%), Gaps = 27/161 (16%)
Query: 156 AHYDRLGTTIDDALGECFDKIAKSLGL----PYPG--GVEIEKAALMGDGKRFKFPCPLV 209
A ++ + + G FD +A +LGL Y G + +E AL DG +P +
Sbjct: 550 ALEKKINAPLTTSTGRLFDAVAAALGLCGERTYEGEAAIALEALALRSDGIAN-YPFEIK 608
Query: 210 QGTLCDFSFSGLKTSVQKTICAFDVLE-KQDIADICASFQVTVVRILQARLKQGF--LLF 266
L LK Q+ + V E + IA I + G +
Sbjct: 609 NKVLD------LKEFYQRFLEDLLVGEDRSKIA-----------HIAHKFVASGLVEIAT 651
Query: 267 RKAFPHKQAVLVVSGGVASNHFIRASLIDLCVLHGFRFVAP 307
A P +V+SGGV N + L GF+F+
Sbjct: 652 AIAVPFGIHKIVISGGVFYNRLLLERLAKYLKGLGFQFLFH 692
>gnl|CDD|178420 PLN02825, PLN02825, amino-acid N-acetyltransferase.
Length = 515
Score = 32.4 bits (74), Expect = 0.21
Identities = 18/51 (35%), Positives = 28/51 (54%), Gaps = 8/51 (15%)
Query: 266 FRKAFPHKQA------VLVVSGGVASNHFIRASLIDLCVLH--GFRFVAPP 308
FR+A+P+ Q V+V+SG V + + L D+ +LH G +FV P
Sbjct: 5 FREAWPYIQGHRGSTFVVVISGEVVAGPHLDNILQDISLLHGLGIKFVLVP 55
>gnl|CDD|162802 TIGR02304, aden_form_hyp, probable adenylate-forming enzyme.
Members of this family form a distinct clade within a
larger family of proteins that also includes coenzyme
F390 synthetase, an enzyme known in Methanobacterium
thermoautotrophicum and a few other methanogenic
archaea. That enzyme adenylates coenzyme F420 to F390, a
reversible process, during oxygen stress. Other
informative homologies include domains of the
non-ribosomal peptide synthetases involved in activation
by adenylation. The family defined by this model is
likely to be of an adenylate-forming enzyme related to
but distinct from coenzyme F390 synthetase.
Length = 430
Score = 32.2 bits (73), Expect = 0.21
Identities = 18/44 (40%), Positives = 23/44 (52%), Gaps = 2/44 (4%)
Query: 221 LKTSVQKTICAFDVLEKQDIADICASFQVTVVRILQARLKQGFL 264
L +K I +VLE QD I F+ TV +I QA +GFL
Sbjct: 210 LTIKPKKVISVAEVLEPQDRELIRNVFKNTVHQIYQA--TEGFL 251
>gnl|CDD|162394 TIGR01507, hopene_cyclase, squalene-hopene cyclase. SHC is an
essential prokaryotic gene in hopanoid (triterpenoid)
biosynthesis. Squalene hopene cyclase, an integral
membrane protein, directly cyclizes squalene into
hopanoid products.
Length = 635
Score = 29.9 bits (67), Expect = 1.1
Identities = 13/30 (43%), Positives = 17/30 (56%)
Query: 320 AWAALERMEAGLPADDLSISPRSRWPLDEK 349
A A L EAGLPAD ++ W LD++
Sbjct: 320 ALAVLALREAGLPADHDALVKAGEWLLDKQ 349
>gnl|CDD|179249 PRK01213, PRK01213, phosphoribosylformylglycinamidine synthase II;
Provisional.
Length = 724
Score = 28.9 bits (66), Expect = 2.1
Identities = 16/61 (26%), Positives = 27/61 (44%), Gaps = 16/61 (26%)
Query: 43 QHGHYGGVVPEV---AARAHVDVLDILIKQTLLRANMQISDMDSIAVTAGPGLMGGLIVG 99
HGH GG P+V A + +++ I++ L+ + +S+ GGL V
Sbjct: 583 IHGHVGGRPPKVDLEAEKRLQELVREAIREGLVTSAHDVSE-------------GGLAVA 629
Query: 100 L 100
L
Sbjct: 630 L 630
>gnl|CDD|171495 PRK12434, PRK12434, tRNA pseudouridine synthase A; Reviewed.
Length = 245
Score = 28.2 bits (63), Expect = 3.3
Identities = 15/48 (31%), Positives = 24/48 (50%), Gaps = 7/48 (14%)
Query: 215 DF-SFSGLKT---SVQKTICAFDVLEKQDIADICAS---FQVTVVRIL 255
DF SF+ K+ S + I + D++E+ I S F +VRI+
Sbjct: 153 DFTSFTNAKSKKKSTVREIYSIDIMEEDGFVQIRVSGNGFLHNMVRII 200
>gnl|CDD|178307 PLN02705, PLN02705, beta-amylase.
Length = 681
Score = 28.4 bits (63), Expect = 3.3
Identities = 14/32 (43%), Positives = 15/32 (46%), Gaps = 5/32 (15%)
Query: 188 VEIEKAALMGDGKRFKF-----PCPLVQGTLC 214
+EI K D F F P PLVQGT C
Sbjct: 630 IEIAKPRNHPDHYHFSFFVYQQPSPLVQGTTC 661
>gnl|CDD|129575 TIGR00484, EF-G, translation elongation factor EF-G. After peptide
bond formation, this elongation factor of bacteria and
organelles catalyzes the translocation of the tRNA-mRNA
complex, with its attached nascent polypeptide chain,
from the A-site to the P-site of the ribosome. Every
completed bacterial genome has at least one copy, but
some species have additional EF-G-like proteins. The
closest homolog to canonical (e.g. E. coli) EF-G in the
spirochetes clusters as if it is derived from
mitochondrial forms, while a more distant second copy is
also present. Synechocystis PCC6803 has a few proteins
more closely related to EF-G than to any other
characterized protein. Two of these resemble E. coli
EF-G more closely than does the best match from the
spirochetes; it may be that both function as authentic
EF-G.
Length = 689
Score = 28.2 bits (63), Expect = 3.4
Identities = 27/100 (27%), Positives = 42/100 (42%), Gaps = 10/100 (10%)
Query: 248 QVTVVRILQARLKQGFLLFRKAFPHKQAVLVVSGGVASNH----FIRASLIDLCVLHGFR 303
Q+T VR+ LK G + K+ V + A+N +RA D+C G +
Sbjct: 323 QLTFVRVYSGVLKSGSYVKNSRKNKKERVGRLVKMHANNREEIKEVRAG--DICAAIGLK 380
Query: 304 FVAPPARLCTDNAVMIAWAALERMEAGLPADDLSISPRSR 343
LC +I LERME P L++ P+++
Sbjct: 381 DTTTGDTLCDPKIDVI----LERMEFPEPVISLAVEPKTK 416
>gnl|CDD|178572 PLN02992, PLN02992, coniferyl-alcohol glucosyltransferase.
Length = 481
Score = 27.6 bits (61), Expect = 4.8
Identities = 19/48 (39%), Positives = 26/48 (54%), Gaps = 2/48 (4%)
Query: 318 MIAWA--ALERMEAGLPADDLSISPRSRWPLDEKAPSKIGSGKRGVKA 363
MIAW A + M A L +D+L I+ RS P + + SKI + R V
Sbjct: 379 MIAWPLFAEQNMNAALLSDELGIAVRSDDPKEVISRSKIEALVRKVMV 426
>gnl|CDD|129097 smart00864, Tubulin, Tubulin/FtsZ family, GTPase domain. This
domain is found in all tubulin chains, as well as the
bacterial FtsZ family of proteins. These proteins are
involved in polymer formation. Tubulin is the major
component of microtubules, while FtsZ is the
polymer-forming protein of bacterial cell division, it
is part of a ring in the middle of the dividing cell
that is required for constriction of cell membrane and
cell envelope to yield two daughter cells. FtsZ and
tubulin are GTPases, this entry is the GTPase domain.
FtsZ can polymerise into tubes, sheets, and rings in
vitro and is ubiquitous in bacteria and archaea.
Length = 192
Score = 27.8 bits (63), Expect = 4.9
Identities = 17/51 (33%), Positives = 23/51 (45%), Gaps = 11/51 (21%)
Query: 45 GHYGGVVPEVAARAHVDVLDILIKQTLLRANMQISDMDSIAVTAGPGLMGG 95
G G PEV A + LD I++ L A+ + +TAG MGG
Sbjct: 55 GLGAGADPEVGREAAEESLDE-IREELEGADG-------VFITAG---MGG 94
>gnl|CDD|178547 PLN02962, PLN02962, hydroxyacylglutathione hydrolase.
Length = 251
Score = 27.5 bits (61), Expect = 6.0
Identities = 16/52 (30%), Positives = 26/52 (50%), Gaps = 5/52 (9%)
Query: 145 GHTQILLVRDVAHYDRLGTTIDDALGECFDKIAKSLGLPYPGGVEIEKAALM 196
G T + ++ + RL T D+ E F I ++L LPYP +++ A M
Sbjct: 195 GFTVSTVGEEMLYNPRL--TKDE---ETFKTIMENLNLPYPKMIDVAVPANM 241
>gnl|CDD|184195 PRK13635, cbiO, cobalt transporter ATP-binding subunit;
Provisional.
Length = 279
Score = 27.3 bits (61), Expect = 6.5
Identities = 17/43 (39%), Positives = 21/43 (48%), Gaps = 6/43 (13%)
Query: 7 TVIGIETSCDETAVA---VVRRKDSHGEILAEAVLSQIDQHGH 46
TV+ I DE A A +V K GEIL E +I + GH
Sbjct: 193 TVLSITHDLDEAAQADRVIVMNK---GEILEEGTPEEIFKSGH 232
>gnl|CDD|181133 PRK07819, PRK07819, 3-hydroxybutyryl-CoA dehydrogenase;
Validated.
Length = 286
Score = 27.3 bits (61), Expect = 7.3
Identities = 13/30 (43%), Positives = 15/30 (50%)
Query: 45 GHYGGVVPEVAARAHVDVLDILIKQTLLRA 74
G G + EV ARA VDVL + L A
Sbjct: 14 GQMGAGIAEVCARAGVDVLVFETTEELATA 43
>gnl|CDD|131857 TIGR02810, agaZ_gatZ, D-tagatose-bisphosphate aldolase, class II,
non-catalytic subunit. Aldolases specific for
D-tagatose-bisphosphate occur in distinct pathways in
Escherichia coli and other bacteria, one for the
degradation of galactitol (formerly dulcitol) and one
for degradation of N-acetyl-galactosamine and
D-galactosamine. This family represents a protein of
both systems that behaves as a non-catalytic subunit of
D-tagatose-bisphosphate aldolase, required both for
full activity and for good stability of the aldolase.
Note that members of this protein family appear in
public databases annotated as putative tagatose
6-phosphate kinases, possibly in error.
Length = 420
Score = 27.0 bits (60), Expect = 8.7
Identities = 15/56 (26%), Positives = 23/56 (41%), Gaps = 8/56 (14%)
Query: 9 IGIETSCD------ETAVAVVRRKDSHGEILAEAVLSQIDQHGHYGGVVPEVAARA 58
GI + C E A+ R + +L EA +Q++Q G Y G+ P
Sbjct: 13 RGIYSVCSAHPLVLEAAIRRARASGT--PVLIEATSNQVNQFGGYTGMTPADFRDF 66
>gnl|CDD|179206 PRK01022, PRK01022, hypothetical protein; Provisional.
Length = 167
Score = 26.8 bits (60), Expect = 9.8
Identities = 13/43 (30%), Positives = 23/43 (53%), Gaps = 13/43 (30%)
Query: 180 LGLPYPGGVEIEKAALMGDGKRFKFPCPL------VQGTLCDF 216
+G P+PGG ++E+ +G + CP+ V +LC+F
Sbjct: 123 VGTPFPGGFKVEEK----NG---VYYCPVKDKQKDVDFSLCEF 158
>gnl|CDD|185355 PRK15458, PRK15458, tagatose 6-phosphate aldolase subunit KbaZ;
Provisional.
Length = 426
Score = 26.6 bits (59), Expect = 9.9
Identities = 14/36 (38%), Positives = 19/36 (52%), Gaps = 2/36 (5%)
Query: 17 ETAVAVVRRKDSHGEILAEAVLSQIDQHGHYGGVVP 52
E A+ DS +L EA +Q+DQ G Y G+ P
Sbjct: 31 EAAIRYALANDS--PLLIEATSNQVDQFGGYTGMTP 64
Database: CddB
Posted date: Feb 4, 2011 9:54 PM
Number of letters in database: 5,994,473
Number of sequences in database: 21,608
Lambda K H
0.324 0.139 0.411
Gapped
Lambda K H
0.267 0.0766 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 21608
Number of Hits to DB: 6,010,246
Number of extensions: 399614
Number of successful extensions: 788
Number of sequences better than 10.0: 1
Number of HSP's gapped: 752
Number of HSP's successfully gapped: 32
Length of query: 363
Length of database: 5,994,473
Length adjustment: 95
Effective length of query: 268
Effective length of database: 3,941,713
Effective search space: 1056379084
Effective search space used: 1056379084
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 15 ( 7.0 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (22.0 bits)
S2: 58 (26.0 bits)