RPS-BLAST 2.2.22 [Sep-27-2009] Database: CddB 21,608 sequences; 5,994,473 total letters Searching..................................................done Query= gi|254781038|ref|YP_003065451.1| putative DNA-binding/iron metalloprotein/AP endonuclease [Candidatus Liberibacter asiaticus str. psy62] (363 letters) >gnl|CDD|181984 PRK09604, PRK09604, UGMP family protein; Validated. Length = 332 Score = 481 bits (1241), Expect = e-136 Identities = 173/346 (50%), Positives = 224/346 (64%), Gaps = 15/346 (4%) Query: 7 TVIGIETSCDETAVAVVRRKDSHGEILAEAVLSQIDQHGHYGGVVPEVAARAHVDVLDIL 66 ++GIETSCDET+VAVV D +L+ V SQID H YGGVVPE+A+RAHV+ + L Sbjct: 2 LILGIETSCDETSVAVV---DDGRGLLSNVVASQIDLHARYGGVVPELASRAHVENIVPL 58 Query: 67 IKQTLLRANMQISDMDSIAVTAGPGLMGGLIVGLMTAKAISYVSHKPFYAINHLEGHILT 126 I++ L A + + D+D+IAVTAGPGL+G L+VG+ AKA++ +KP +NHLEGH+L Sbjct: 59 IEEALKEAGLTLEDIDAIAVTAGPGLVGALLVGVSFAKALALALNKPLIGVNHLEGHLLA 118 Query: 127 ARLTDDIAFPYLVLLVSGGHTQILLVRDVAHYDRLGTTIDDALGECFDKIAKSLGLPYPG 186 L ++ FP+L LLVSGGHTQ++LV+ + Y+ LG T+DDA GE FDK+AK LGL YPG Sbjct: 119 PFLEEEPEFPFLALLVSGGHTQLVLVKGIGDYELLGETLDDAAGEAFDKVAKLLGLGYPG 178 Query: 187 GVEIEKAALMGDGKRFKFPCPLVQGTLCDFSFSGLKTSVQKTICAFDVLEKQDIADICAS 246 G I+K A GD FKFP P+ + DFSFSGLKT+V TI +Q ADI AS Sbjct: 179 GPAIDKLAKQGDPDAFKFPRPMDRPG-LDFSFSGLKTAVLNTI----EKSEQTKADIAAS 233 Query: 247 FQVTVVRILQARLKQGFLLFRKAFPHKQAVLVVSGGVASNHFIRASLIDLCVLHGFRFVA 306 FQ VV +L + K+ K K LVV+GGVA+N +R L +L G Sbjct: 234 FQAAVVDVLVIKTKRAL----KQTGVKT--LVVAGGVAANSGLRERLAELAKKRGIEVFI 287 Query: 307 PPARLCTDNAVMIAWAALERMEAGLPADDLSISPRSRWPLDEKAPS 352 PP +LCTDNA MIA A ER++AG DL ++ R RWPLDE + Sbjct: 288 PPLKLCTDNAAMIAAAGYERLKAGEF-SDLDLNARPRWPLDELSAL 332 >gnl|CDD|163435 TIGR03723, bact_gcp, putative glycoprotease GCP. This model represents bacterial members of a protein family that is widely distributed. In a few pathogenic species, the protein is exported in a way that may represent an exceptional secondary function. This model plus companion (archaeal) model TIGR03722 together span the prokaryotic member sequences of TIGR00329, a protein family that appears universal in life, and whose broad function is unknown. A member of TIGR03722 has been characterized as a DNA-binding protein with apurinic endopeptidase activity. In contrast, the rare characterized members of the present family show O-sialoglycoprotein endopeptidase (EC. 3.4.24.57) activity after export. These include glycoprotease (gcp) from Pasteurella haemolytica A1 and a cohemolysin from Riemerella anatipestifer (GB|AAG39646.1). The member from Staphylococcus aureus is essential and is related to cell wall dynamics and the modulation of autolysis, but members are also found in the Mycoplasmas (which lack a cell wall). A reasonable hypothesis is that virulence-related activities after export are secondary to a bacterial domain-wide unknown function. Length = 314 Score = 444 bits (1144), Expect = e-125 Identities = 163/323 (50%), Positives = 210/323 (65%), Gaps = 11/323 (3%) Query: 8 VIGIETSCDETAVAVVRRKDSHGEILAEAVLSQIDQHGHYGGVVPEVAARAHVDVLDILI 67 ++GIETSCDETAVA+V D +L+ V SQI+ H YGGVVPE+A+RAH++ + LI Sbjct: 1 ILGIETSCDETAVAIV---DDGKGLLSNIVASQIELHARYGGVVPELASRAHLEAIPPLI 57 Query: 68 KQTLLRANMQISDMDSIAVTAGPGLMGGLIVGLMTAKAISYVSHKPFYAINHLEGHILTA 127 ++ L A + +SD+D+IAVTAGPGL+G L+VG+ AKA++ +KP +NHLEGH+L Sbjct: 58 EEALAEAGLTLSDIDAIAVTAGPGLIGALLVGVSFAKALALALNKPLIGVNHLEGHLLAP 117 Query: 128 RLTD-DIAFPYLVLLVSGGHTQILLVRDVAHYDRLGTTIDDALGECFDKIAKSLGLPYPG 186 L D + FP+L LLVSGGHTQ++LV+ V Y+ LG T+DDA GE FDK+A+ LGL YPG Sbjct: 118 FLEDKPLEFPFLALLVSGGHTQLVLVKGVGDYELLGETLDDAAGEAFDKVARLLGLGYPG 177 Query: 187 GVEIEKAALMGDGKRFKFPCPLVQGTLCDFSFSGLKTSVQKTICAFD-VLEKQDIADICA 245 G I+K A GD K FKFP P+ DFSFSGLKT+V I E+ D ADI A Sbjct: 178 GPAIDKLAKEGDPKAFKFPRPMTGRPGLDFSFSGLKTAVLNLIEKLKQKGEELDKADIAA 237 Query: 246 SFQVTVVRILQARLKQGFLLFRKAFPHKQAVLVVSGGVASNHFIRASLIDLCVLHGFRFV 305 SFQ VV +L + K+ K K LVV+GGVA+N +R L +L G Sbjct: 238 SFQAAVVDVLVEKTKRAL----KKTGLKT--LVVAGGVAANSRLRERLEELAEKAGLEVF 291 Query: 306 APPARLCTDNAVMIAWAALERME 328 PP LCTDNA MIA A ER++ Sbjct: 292 IPPLELCTDNAAMIAAAGYERLK 314 >gnl|CDD|129429 TIGR00329, gcp_kae1, metallohydrolase, glycoprotease/Kae1 family. This subfamily includes the well-studied secreted O-sialoglycoprotein endopeptidase (glycoprotease, EC 3.4.24.57) of Pasteurella haemolytica, a pathogen. A member from Riemerella anatipestifer, associated with cohemolysin activity, likewise is exported without benefit of a classical signal peptide and shows glycoprotease activity on the test substrate glycophorin. However, archaeal members of this subfamily show unrelated activities as demonstrated in Pyrococcus abyssi: DNA binding, iron binding, apurinic endonuclease activity, genomic association with a kinase domain, and no glycoprotease activity. This family thus pulls together a set of proteins as a homology group that appears to be near-universal in life, yet heterogeneous in assayed function between bacteria and archaea. Length = 305 Score = 312 bits (802), Expect = 8e-86 Identities = 137/314 (43%), Positives = 182/314 (57%), Gaps = 11/314 (3%) Query: 9 IGIETSCDETAVAVVRRKDSHGEILAEAVLSQIDQHGHYGGVVPEVAARAHVDVLDILIK 68 +GIETSCD+T VA+V D G +LA +SQI H YGGVVPE A+R H + + L++ Sbjct: 1 LGIETSCDDTGVAIV---DEEGNVLANIKISQIPLHAKYGGVVPEEASRHHAENIPPLLE 57 Query: 69 QTLLRANMQISDMDSIAVTAGPGLMGGLIVGLMTAKAISYVSHKPFYAINHLEGHILTAR 128 + L+ +N+ S++D IA T GPGL G L VG A++++ KP +NHL GHI R Sbjct: 58 RALIESNVDKSEIDLIAYTQGPGLGGSLRVGATFARSLALSLDKPLIGVNHLLGHIYAPR 117 Query: 129 LTDDI-AFPYLVLLVSGGHTQILLVRDVAHYDRLGTTIDDALGECFDKIAKSLGLPYPGG 187 L +I FP++ LLVSGGHTQI+ V+ + Y+ LG T+DDA+GE FDK+A+ LGL YPGG Sbjct: 118 LDTNILQFPFVSLLVSGGHTQIIAVKGIGDYEVLGETLDDAVGEAFDKVARLLGLGYPGG 177 Query: 188 VEIEKAALMGDGKRFKFPCPLVQGTLCDFSFSGLKTSVQKTICAF-DVLEKQDIADICAS 246 +IE+ A GD F FP P + DFSFSGLKT+ + I L + DI S Sbjct: 178 PKIEELAKKGDKLPFYFPLPYTVKPMLDFSFSGLKTAALRKIEKLKKNLNEATKEDIAYS 237 Query: 247 FQVTVVRILQARLKQGFLLFRKAFPHKQAVLVVSGGVASNHFIRASLIDLCVLHGFRFVA 306 FQ T L + K+ LV+ GGV++N +R L LC F Sbjct: 238 FQETAFDHLIEKTKRALKDTGP------KELVLVGGVSANKRLREMLETLCQELNVEFYY 291 Query: 307 PPARLCTDNAVMIA 320 PP C+DN MIA Sbjct: 292 PPLEFCSDNGAMIA 305 >gnl|CDD|163434 TIGR03722, arch_KAE1, universal archaeal protein Kae1. This family represents the archaeal protein Kae1. Its partner Bud32 is fused with it in about half of the known archaeal genomes. The pair, which appears universal in the archaea, corresponds to EKC/KEOPS complex in eukaryotes. A recent characterization of the member from Pyrococcus abyssi, as an iron-binding, atypical DNA-binding protein with an apurinic lyase activity, challenges the common annotation of close homologs as O-sialoglycoprotein endopeptidase. The latter annotation is based on a characterized protein from the bacterium Pasteurella haemolytica. Length = 322 Score = 179 bits (457), Expect = 7e-46 Identities = 120/348 (34%), Positives = 168/348 (48%), Gaps = 39/348 (11%) Query: 9 IGIETSCDETAVAVVRRKDSHGEILAEAVLSQIDQHGHYGGVVPEVAARAHVDVLDILIK 68 +GIE + V +V D GEILA + + + G G+ P AA H +V LIK Sbjct: 1 LGIEGTAHTFGVGIV---DEDGEILANVSDTYVPEKG---GIHPREAAEHHAEVAPKLIK 54 Query: 69 QTLLRANMQISDMDSIAVTAGPGLMGGLIVGLMTAKAISYVSHKPFYAINHLEGHILTAR 128 + L A + + D+D++A + GPGL L VG A+A++ +KP +NH HI R Sbjct: 55 EALEEAGVSLEDIDAVAFSQGPGLGPCLRVGATAARALALKLNKPLVGVNHCVAHIEIGR 114 Query: 129 LTDDIAFPYLVLLVSGGHTQILLVRDVAHYDRLGTTIDDALGECFDKIAKSLGLPYPGGV 188 LT P +VL VSGG+TQ++ R+ Y G T+D LG DK A+ +GL +PGG Sbjct: 115 LTTGAKDP-VVLYVSGGNTQVIAYRN-GRYRVFGETLDIGLGNALDKFAREVGLGHPGGP 172 Query: 189 EIEKAALMGDGKRFKFPCPL---VQGTLCDFSFSGLKTSVQKTICAFDVLEKQ-DIADIC 244 +IE+ A G K L V+G D SFSGL T+ A +K + D+C Sbjct: 173 KIEELAEKG-----KEYIELPYTVKGM--DLSFSGLLTA------ALRAYKKGARLEDVC 219 Query: 245 ASFQVT----VVRILQARLKQGFLLFRKAFPHKQAVLVVSGGVASNHFIRASLIDLCVLH 300 S Q T +V + + L A K+ VL+V GGVA+N +R L + Sbjct: 220 YSLQETAFAMLVEVTERAL---------AHTGKKEVLLV-GGVAANRRLREMLELMAEDR 269 Query: 301 GFRFVAPPARLCTDNAVMIAWAALERMEAGLPADDLSISPRSRWPLDE 348 G +F PP DN MIA+ L + G+ R RW DE Sbjct: 270 GAKFYVPPPEYAGDNGAMIAYTGLLMYKHGVTIPVEESRVRQRWRTDE 317 >gnl|CDD|184878 PRK14878, PRK14878, UGMP family protein; Provisional. Length = 323 Score = 173 bits (442), Expect = 5e-44 Identities = 115/346 (33%), Positives = 172/346 (49%), Gaps = 36/346 (10%) Query: 9 IGIETSCDETAVAVVRRKDSHGEILAEAVLSQIDQHGHYGGVVPEVAARAHVDVLDILIK 68 +GIE++ V +V+ +LA + + + G G+ P AA+ H +V L++ Sbjct: 1 LGIESTAHTLGVGIVKEDK----VLANVRDTYVPEKG---GIHPREAAQHHAEVAPELLR 53 Query: 69 QTLLRANMQISDMDSIAVTAGPGLMGGLIVGLMTAKAISYVSHKPFYAINHLEGHILTAR 128 + L +A + I D+D++AV+ GPGL L VG A+A++ +KP +NH HI R Sbjct: 54 KALEKAGISIEDIDAVAVSQGPGLGPALRVGATAARALALKYNKPLVPVNHCIAHIEIGR 113 Query: 129 LTDDIAFPYLVLLVSGGHTQILLVRDVAHYDRLGTTIDDALGECFDKIAKSLGLPYPGGV 188 LT P +VL VSGG+TQ+L R Y G T+D A+G D A+ +GL PGG Sbjct: 114 LTTGAKDP-VVLYVSGGNTQVLAFRG-GRYRVFGETLDIAIGNALDTFAREVGLAPPGGP 171 Query: 189 EIEKAALMGDGKRFKFPCPLVQGTLCDFSFSGLKTSVQKTICAFDVLEKQDIADICASFQ 248 IEK A G+ K + P V+G D SFSGL T+ + K+ + D+C S + Sbjct: 172 AIEKCAEKGE-KYIELPYV-VKGQ--DLSFSGLLTAALRL-----YKGKERLEDVCYSLR 222 Query: 249 VT----VVRILQARLKQGFLLFRKAFPHKQAVLVVSGGVASNHFIRASLIDLCVLHGFRF 304 T +V + + L A K+ VL+V GGVA+N +R L + G +F Sbjct: 223 ETAFAMLVEVTERAL---------AHTGKKEVLLV-GGVAANRRLREKLEIMAEDRGAKF 272 Query: 305 VAPPARLCTDNAVMIAWAALERMEAG--LPADDLSISPRSRWPLDE 348 P DN MIA+ L + G +P ++ + RW LDE Sbjct: 273 YVVPPEYAGDNGAMIAYTGLLAYKHGVTIPPEESFVRQ--RWRLDE 316 >gnl|CDD|181985 PRK09605, PRK09605, bifunctional UGMP family protein/serine/threonine protein kinase; Validated. Length = 535 Score = 170 bits (432), Expect = 6e-43 Identities = 120/348 (34%), Positives = 178/348 (51%), Gaps = 45/348 (12%) Query: 8 VIGIETSCDETAVAVVRRKDSHGEILA-EAVLSQIDQHGHYGGVVPEVAARAHVDVLDIL 66 V+GIE + +T+ +V DS G++L E+ + GG+ P AA H + + + Sbjct: 3 VLGIEGTAWKTSAGIV---DSDGDVLFNESDPYKPPS----GGIHPREAAEHHAEAIPKV 55 Query: 67 IKQTLLRANMQISDMDSIAVTAGPGLMGGLIVGLMTAKAISYVSHKPFYAINHLEGHILT 126 IK+ L A ++ D+D +A + GPGL L V A+A++ P +NH H+ Sbjct: 56 IKEALEEAGLKPEDIDLVAFSQGPGLGPCLRVVATAARALALSLDVPLIGVNHCVAHVEI 115 Query: 127 ARLTDDIAFPYLVLLVSGGHTQILLVRDVAH----YDRLGTTIDDALGECFDKIAKSLGL 182 RLT P + L VSGG+TQ+L A+ Y G T+D +G DK A+ +GL Sbjct: 116 GRLTTGAEDP-VTLYVSGGNTQVL-----AYLNGRYRVFGETLDIGVGNALDKFARHVGL 169 Query: 183 PYPGGVEIEKAALMGDGKRFKFPCP-LVQGTLCDFSFSGLKTSVQKTICAFDVLEKQDIA 241 P+PGG +IEK A DGK++ P +V+G DFSFSGL T+ ++ A+D + + Sbjct: 170 PHPGGPKIEKLAK--DGKKY-IDLPYVVKGM--DFSFSGLLTAAKR---AYD--AGEPLE 219 Query: 242 DICASFQVT----VVRILQARLKQGFLLFRKAFPHKQAVLVVSGGVASNHFIRASLIDLC 297 D+C S Q T + + + L A K VL+V GGVA+N+ +R L ++C Sbjct: 220 DVCYSLQETAFAMLTEVTERAL---------AHTGKDEVLLV-GGVAANNRLREMLKEMC 269 Query: 298 VLHGFRFVAPPARLCTDNAVMIAWAALERMEAG--LPADDLSISPRSR 343 G F P R C DN MIAW L +AG L +D ++P R Sbjct: 270 EERGADFYVPEPRFCGDNGAMIAWLGLLMYKAGDTLDIEDTRVNPNFR 317 >gnl|CDD|185566 PTZ00340, PTZ00340, O-sialoglycoprotein endopeptidase-like protein; Provisional. Length = 345 Score = 143 bits (362), Expect = 8e-35 Identities = 107/351 (30%), Positives = 160/351 (45%), Gaps = 33/351 (9%) Query: 8 VIGIETSCDETAVAVVRRKDSHGEILAEAVLSQIDQHGHYGGVVPEVAARAHVDVLDILI 67 +GIE S ++ V +V GEIL+ + I G G +P A+ H + + L+ Sbjct: 3 ALGIEGSANKLGVGIVTSD---GEILSNVRETYITPPGT--GFLPRETAQHHREHILSLV 57 Query: 68 KQTLLRANMQISDMDSIAVTAGPGLMGGLIVGLMTAKAISYVSHKPFYAINHLEGHILTA 127 K+ L A + SD+ I T GPG+ L VG + A+ +S + KP +NH HI Sbjct: 58 KEALEEAKITPSDISLICYTKGPGMGAPLSVGAVVARTLSLLWGKPLVGVNHCVAHIEMG 117 Query: 128 RLTDDIAFPYLVLLVSGGHTQILLVRDVAHYDRLGTTIDDALGECFDKIAKSLGLP-YPG 186 RL P +VL VSGG+TQ++ + Y G TID A+G C D+ A+ L L P Sbjct: 118 RLVTGAENP-VVLYVSGGNTQVIAYSEH-RYRIFGETIDIAVGNCLDRFARLLNLSNDPA 175 Query: 187 -GVEIEKAALMGDGKRFKFPCP-LVQGTLCDFSFSGLKTSVQKTI----------CAFDV 234 G IE+ A GK P +V+G D SFSG+ T ++ + Sbjct: 176 PGYNIEQLA--KKGKNL-IELPYVVKGM--DMSFSGILTYIEDLVEHPQFKDVVSEIVPP 230 Query: 235 LEKQDIADICASFQVTVVRILQARLKQGFLLFRKAFPHKQAVLVVSGGVASNHFIRASLI 294 E+ D+C S Q T+ +L ++ + VL+V GGV N ++ + Sbjct: 231 EEEFFTDDLCFSLQETIFAMLVEVTERAM-----SHCGSNEVLIV-GGVGCNLRLQEMMQ 284 Query: 295 DLCVLHGFRFVAPPARLCTDNAVMIAWAALERMEAG--LPADDLSISPRSR 343 + G + A R C DN MIA+A L +G P D +++ R R Sbjct: 285 QMAKERGGKLFAMDERYCIDNGAMIAYAGLLEYLSGGFTPLKDATVTQRFR 335 >gnl|CDD|163437 TIGR03725, bact_YeaZ, universal bacterial protein YeaZ. This family describes a protein family, YeaZ, that appears to be universal in bacteria, but whose function is unknown. This family is related to the gcp (glycoprotease) protein family, also universal in bacteria and unknown in function. In Gram-positive lineages, members of these two related families often belong to the same operon, along with the ribosomal-protein-alanine acetyltransferase gene. Members of this family may occur as fusions with gcp or the ribosomal protein N-acetyltransferase rimI, and is frequently encoded next to rimI. Length = 202 Score = 58.0 bits (141), Expect = 4e-09 Identities = 37/114 (32%), Positives = 58/114 (50%), Gaps = 19/114 (16%) Query: 8 VIGIETSCDETAVAVVRRKDSHGEILAEAVLSQIDQHGHYGGVVPEVAARAHVDVLDILI 67 ++ I+TS + +VA++ GEILAE E A R H ++L +I Sbjct: 1 ILAIDTSTEALSVALLD----DGEILAERS---------------EEAGRNHSEILLPMI 41 Query: 68 KQTLLRANMQISDMDSIAVTAGPGLMGGLIVGLMTAKAISYVSHKPFYAINHLE 121 ++ L A + + D+D+IAV GPG GL +GL TAK ++ P ++ LE Sbjct: 42 EELLAEAGLSLQDLDAIAVGVGPGSFTGLRIGLATAKGLALALGIPLVGVSSLE 95 >gnl|CDD|161730 TIGR00143, hypF, [NiFe] hydrogenase maturation protein HypF. A previously described regulatory effect of HypF mutatation is attributable to loss of activity of a regulatory hydrogenase. A zinc finger-like region CXXCX(18)CXXCX(24)CXXCX(18)CXXC region further supported the regulatory hypothesis. However, more recent work (PUBMED:11375153) shows the direct effect is on the activity of expressed hydrogenases with nickel/iron centers, rather than on expression. Length = 711 Score = 34.0 bits (78), Expect = 0.055 Identities = 38/161 (23%), Positives = 58/161 (36%), Gaps = 27/161 (16%) Query: 156 AHYDRLGTTIDDALGECFDKIAKSLGL----PYPG--GVEIEKAALMGDGKRFKFPCPLV 209 A ++ + + G FD +A +LGL Y G + +E AL DG +P + Sbjct: 550 ALEKKINAPLTTSTGRLFDAVAAALGLCGERTYEGEAAIALEALALRSDGIAN-YPFEIK 608 Query: 210 QGTLCDFSFSGLKTSVQKTICAFDVLE-KQDIADICASFQVTVVRILQARLKQGF--LLF 266 L LK Q+ + V E + IA I + G + Sbjct: 609 NKVLD------LKEFYQRFLEDLLVGEDRSKIA-----------HIAHKFVASGLVEIAT 651 Query: 267 RKAFPHKQAVLVVSGGVASNHFIRASLIDLCVLHGFRFVAP 307 A P +V+SGGV N + L GF+F+ Sbjct: 652 AIAVPFGIHKIVISGGVFYNRLLLERLAKYLKGLGFQFLFH 692 >gnl|CDD|178420 PLN02825, PLN02825, amino-acid N-acetyltransferase. Length = 515 Score = 32.4 bits (74), Expect = 0.21 Identities = 18/51 (35%), Positives = 28/51 (54%), Gaps = 8/51 (15%) Query: 266 FRKAFPHKQA------VLVVSGGVASNHFIRASLIDLCVLH--GFRFVAPP 308 FR+A+P+ Q V+V+SG V + + L D+ +LH G +FV P Sbjct: 5 FREAWPYIQGHRGSTFVVVISGEVVAGPHLDNILQDISLLHGLGIKFVLVP 55 >gnl|CDD|162802 TIGR02304, aden_form_hyp, probable adenylate-forming enzyme. Members of this family form a distinct clade within a larger family of proteins that also includes coenzyme F390 synthetase, an enzyme known in Methanobacterium thermoautotrophicum and a few other methanogenic archaea. That enzyme adenylates coenzyme F420 to F390, a reversible process, during oxygen stress. Other informative homologies include domains of the non-ribosomal peptide synthetases involved in activation by adenylation. The family defined by this model is likely to be of an adenylate-forming enzyme related to but distinct from coenzyme F390 synthetase. Length = 430 Score = 32.2 bits (73), Expect = 0.21 Identities = 18/44 (40%), Positives = 23/44 (52%), Gaps = 2/44 (4%) Query: 221 LKTSVQKTICAFDVLEKQDIADICASFQVTVVRILQARLKQGFL 264 L +K I +VLE QD I F+ TV +I QA +GFL Sbjct: 210 LTIKPKKVISVAEVLEPQDRELIRNVFKNTVHQIYQA--TEGFL 251 >gnl|CDD|162394 TIGR01507, hopene_cyclase, squalene-hopene cyclase. SHC is an essential prokaryotic gene in hopanoid (triterpenoid) biosynthesis. Squalene hopene cyclase, an integral membrane protein, directly cyclizes squalene into hopanoid products. Length = 635 Score = 29.9 bits (67), Expect = 1.1 Identities = 13/30 (43%), Positives = 17/30 (56%) Query: 320 AWAALERMEAGLPADDLSISPRSRWPLDEK 349 A A L EAGLPAD ++ W LD++ Sbjct: 320 ALAVLALREAGLPADHDALVKAGEWLLDKQ 349 >gnl|CDD|179249 PRK01213, PRK01213, phosphoribosylformylglycinamidine synthase II; Provisional. Length = 724 Score = 28.9 bits (66), Expect = 2.1 Identities = 16/61 (26%), Positives = 27/61 (44%), Gaps = 16/61 (26%) Query: 43 QHGHYGGVVPEV---AARAHVDVLDILIKQTLLRANMQISDMDSIAVTAGPGLMGGLIVG 99 HGH GG P+V A + +++ I++ L+ + +S+ GGL V Sbjct: 583 IHGHVGGRPPKVDLEAEKRLQELVREAIREGLVTSAHDVSE-------------GGLAVA 629 Query: 100 L 100 L Sbjct: 630 L 630 >gnl|CDD|171495 PRK12434, PRK12434, tRNA pseudouridine synthase A; Reviewed. Length = 245 Score = 28.2 bits (63), Expect = 3.3 Identities = 15/48 (31%), Positives = 24/48 (50%), Gaps = 7/48 (14%) Query: 215 DF-SFSGLKT---SVQKTICAFDVLEKQDIADICAS---FQVTVVRIL 255 DF SF+ K+ S + I + D++E+ I S F +VRI+ Sbjct: 153 DFTSFTNAKSKKKSTVREIYSIDIMEEDGFVQIRVSGNGFLHNMVRII 200 >gnl|CDD|178307 PLN02705, PLN02705, beta-amylase. Length = 681 Score = 28.4 bits (63), Expect = 3.3 Identities = 14/32 (43%), Positives = 15/32 (46%), Gaps = 5/32 (15%) Query: 188 VEIEKAALMGDGKRFKF-----PCPLVQGTLC 214 +EI K D F F P PLVQGT C Sbjct: 630 IEIAKPRNHPDHYHFSFFVYQQPSPLVQGTTC 661 >gnl|CDD|129575 TIGR00484, EF-G, translation elongation factor EF-G. After peptide bond formation, this elongation factor of bacteria and organelles catalyzes the translocation of the tRNA-mRNA complex, with its attached nascent polypeptide chain, from the A-site to the P-site of the ribosome. Every completed bacterial genome has at least one copy, but some species have additional EF-G-like proteins. The closest homolog to canonical (e.g. E. coli) EF-G in the spirochetes clusters as if it is derived from mitochondrial forms, while a more distant second copy is also present. Synechocystis PCC6803 has a few proteins more closely related to EF-G than to any other characterized protein. Two of these resemble E. coli EF-G more closely than does the best match from the spirochetes; it may be that both function as authentic EF-G. Length = 689 Score = 28.2 bits (63), Expect = 3.4 Identities = 27/100 (27%), Positives = 42/100 (42%), Gaps = 10/100 (10%) Query: 248 QVTVVRILQARLKQGFLLFRKAFPHKQAVLVVSGGVASNH----FIRASLIDLCVLHGFR 303 Q+T VR+ LK G + K+ V + A+N +RA D+C G + Sbjct: 323 QLTFVRVYSGVLKSGSYVKNSRKNKKERVGRLVKMHANNREEIKEVRAG--DICAAIGLK 380 Query: 304 FVAPPARLCTDNAVMIAWAALERMEAGLPADDLSISPRSR 343 LC +I LERME P L++ P+++ Sbjct: 381 DTTTGDTLCDPKIDVI----LERMEFPEPVISLAVEPKTK 416 >gnl|CDD|178572 PLN02992, PLN02992, coniferyl-alcohol glucosyltransferase. Length = 481 Score = 27.6 bits (61), Expect = 4.8 Identities = 19/48 (39%), Positives = 26/48 (54%), Gaps = 2/48 (4%) Query: 318 MIAWA--ALERMEAGLPADDLSISPRSRWPLDEKAPSKIGSGKRGVKA 363 MIAW A + M A L +D+L I+ RS P + + SKI + R V Sbjct: 379 MIAWPLFAEQNMNAALLSDELGIAVRSDDPKEVISRSKIEALVRKVMV 426 >gnl|CDD|129097 smart00864, Tubulin, Tubulin/FtsZ family, GTPase domain. This domain is found in all tubulin chains, as well as the bacterial FtsZ family of proteins. These proteins are involved in polymer formation. Tubulin is the major component of microtubules, while FtsZ is the polymer-forming protein of bacterial cell division, it is part of a ring in the middle of the dividing cell that is required for constriction of cell membrane and cell envelope to yield two daughter cells. FtsZ and tubulin are GTPases, this entry is the GTPase domain. FtsZ can polymerise into tubes, sheets, and rings in vitro and is ubiquitous in bacteria and archaea. Length = 192 Score = 27.8 bits (63), Expect = 4.9 Identities = 17/51 (33%), Positives = 23/51 (45%), Gaps = 11/51 (21%) Query: 45 GHYGGVVPEVAARAHVDVLDILIKQTLLRANMQISDMDSIAVTAGPGLMGG 95 G G PEV A + LD I++ L A+ + +TAG MGG Sbjct: 55 GLGAGADPEVGREAAEESLDE-IREELEGADG-------VFITAG---MGG 94 >gnl|CDD|178547 PLN02962, PLN02962, hydroxyacylglutathione hydrolase. Length = 251 Score = 27.5 bits (61), Expect = 6.0 Identities = 16/52 (30%), Positives = 26/52 (50%), Gaps = 5/52 (9%) Query: 145 GHTQILLVRDVAHYDRLGTTIDDALGECFDKIAKSLGLPYPGGVEIEKAALM 196 G T + ++ + RL T D+ E F I ++L LPYP +++ A M Sbjct: 195 GFTVSTVGEEMLYNPRL--TKDE---ETFKTIMENLNLPYPKMIDVAVPANM 241 >gnl|CDD|184195 PRK13635, cbiO, cobalt transporter ATP-binding subunit; Provisional. Length = 279 Score = 27.3 bits (61), Expect = 6.5 Identities = 17/43 (39%), Positives = 21/43 (48%), Gaps = 6/43 (13%) Query: 7 TVIGIETSCDETAVA---VVRRKDSHGEILAEAVLSQIDQHGH 46 TV+ I DE A A +V K GEIL E +I + GH Sbjct: 193 TVLSITHDLDEAAQADRVIVMNK---GEILEEGTPEEIFKSGH 232 >gnl|CDD|181133 PRK07819, PRK07819, 3-hydroxybutyryl-CoA dehydrogenase; Validated. Length = 286 Score = 27.3 bits (61), Expect = 7.3 Identities = 13/30 (43%), Positives = 15/30 (50%) Query: 45 GHYGGVVPEVAARAHVDVLDILIKQTLLRA 74 G G + EV ARA VDVL + L A Sbjct: 14 GQMGAGIAEVCARAGVDVLVFETTEELATA 43 >gnl|CDD|131857 TIGR02810, agaZ_gatZ, D-tagatose-bisphosphate aldolase, class II, non-catalytic subunit. Aldolases specific for D-tagatose-bisphosphate occur in distinct pathways in Escherichia coli and other bacteria, one for the degradation of galactitol (formerly dulcitol) and one for degradation of N-acetyl-galactosamine and D-galactosamine. This family represents a protein of both systems that behaves as a non-catalytic subunit of D-tagatose-bisphosphate aldolase, required both for full activity and for good stability of the aldolase. Note that members of this protein family appear in public databases annotated as putative tagatose 6-phosphate kinases, possibly in error. Length = 420 Score = 27.0 bits (60), Expect = 8.7 Identities = 15/56 (26%), Positives = 23/56 (41%), Gaps = 8/56 (14%) Query: 9 IGIETSCD------ETAVAVVRRKDSHGEILAEAVLSQIDQHGHYGGVVPEVAARA 58 GI + C E A+ R + +L EA +Q++Q G Y G+ P Sbjct: 13 RGIYSVCSAHPLVLEAAIRRARASGT--PVLIEATSNQVNQFGGYTGMTPADFRDF 66 >gnl|CDD|179206 PRK01022, PRK01022, hypothetical protein; Provisional. Length = 167 Score = 26.8 bits (60), Expect = 9.8 Identities = 13/43 (30%), Positives = 23/43 (53%), Gaps = 13/43 (30%) Query: 180 LGLPYPGGVEIEKAALMGDGKRFKFPCPL------VQGTLCDF 216 +G P+PGG ++E+ +G + CP+ V +LC+F Sbjct: 123 VGTPFPGGFKVEEK----NG---VYYCPVKDKQKDVDFSLCEF 158 >gnl|CDD|185355 PRK15458, PRK15458, tagatose 6-phosphate aldolase subunit KbaZ; Provisional. Length = 426 Score = 26.6 bits (59), Expect = 9.9 Identities = 14/36 (38%), Positives = 19/36 (52%), Gaps = 2/36 (5%) Query: 17 ETAVAVVRRKDSHGEILAEAVLSQIDQHGHYGGVVP 52 E A+ DS +L EA +Q+DQ G Y G+ P Sbjct: 31 EAAIRYALANDS--PLLIEATSNQVDQFGGYTGMTP 64 Database: CddB Posted date: Feb 4, 2011 9:54 PM Number of letters in database: 5,994,473 Number of sequences in database: 21,608 Lambda K H 0.324 0.139 0.411 Gapped Lambda K H 0.267 0.0766 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Sequences: 21608 Number of Hits to DB: 6,010,246 Number of extensions: 399614 Number of successful extensions: 788 Number of sequences better than 10.0: 1 Number of HSP's gapped: 752 Number of HSP's successfully gapped: 32 Length of query: 363 Length of database: 5,994,473 Length adjustment: 95 Effective length of query: 268 Effective length of database: 3,941,713 Effective search space: 1056379084 Effective search space used: 1056379084 Neighboring words threshold: 11 Window for multiple hits: 40 X1: 15 ( 7.0 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 41 (22.0 bits) S2: 58 (26.0 bits)