RPS-BLAST 2.2.22 [Sep-27-2009] Database: CddA 21,609 sequences; 6,263,737 total letters Searching..................................................done Query= gi|254780479|ref|YP_003064892.1| A/G-specific adenine glycosylase [Candidatus Liberibacter asiaticus str. psy62] (356 letters) >gnl|CDD|31387 COG1194, MutY, A/G-specific DNA glycosylase [DNA replication, recombination, and repair]. Length = 342 Score = 334 bits (857), Expect = 2e-92 Identities = 139/350 (39%), Positives = 193/350 (55%), Gaps = 19/350 (5%) Query: 3 QPEHIIQSKILDWYDTNHRVLPWRTSPKTEKSSLPSPYKVWISEIMLQQTTVKTVEPYFK 62 Q +LDWYD N R LPWR + PY+VW+SEIMLQQT V TV PY++ Sbjct: 6 GDIEKFQEALLDWYDKNGRDLPWRETK--------DPYRVWVSEIMLQQTQVATVIPYYE 57 Query: 63 KFMQKWPTIFCLSSAKDEEILSAWAGLGYYTRARNLKKCADIIVKKYEGNFPHKVEILKK 122 +F++++PTI L++A ++E+L AW GLGYY+RARNL K A +V+++ G FP E L Sbjct: 58 RFLERFPTIKALAAAPEDEVLKAWEGLGYYSRARNLHKAAQEVVERHGGEFPDDEEELAA 117 Query: 123 LPGIGDYTASAIVAIAFNHFAVVVDTNIERIISRYFDIIKPAPLYH--KTIKNYARK-IT 179 LPG+G YTA AI++ AFN V+D N++R++SR F I K + A + +T Sbjct: 118 LPGVGPYTAGAILSFAFNQPEPVLDGNVKRVLSRLFAISGDIGKPKTKKELWELAEQLLT 177 Query: 180 STSRPGDFVQAMMDLGALICTSNKPLCPLCPIQKNCLTFSEGKSHLLGINTIKKKRPMRT 239 RPGDF QAMMDLGA ICT+ KP C LCP++ NC + G + KKK P R Sbjct: 178 PDRRPGDFNQAMMDLGATICTAKKPKCSLCPLRDNCAAYRNGTPEKYPVKKPKKKLPRR- 236 Query: 240 GAVFIAITNDNRILLRKRTNTRLLEGMDELPGSAWSSTKDGNIDTHSAPFTANWILCNTI 299 A F+ + D +LL KR LL G+ P + D + A + Sbjct: 237 FAAFLILNRDGEVLLEKRPEKGLLGGLWCFPQFEDEA--DLLDWLAADGLAAEPL--GAF 292 Query: 300 THTFTHFTLTLFVWKTIVPQIVIIPDSTWHDAQNLANAALPTVMKKALSA 349 HTFTHF LT+ + + ++ D W++ +L + LP +KK L Sbjct: 293 RHTFTHFRLTIELRASASL---VLSDGRWYNLSDLESIGLPAPVKKLLQQ 339 >gnl|CDD|37668 KOG2457, KOG2457, KOG2457, A/G-specific adenine DNA glycosylase [Replication, recombination and repair]. Length = 555 Score = 216 bits (551), Expect = 7e-57 Identities = 126/408 (30%), Positives = 180/408 (44%), Gaps = 70/408 (17%) Query: 8 IQSKILDWYDTNHRVLPWRTSP-KTEKSSLPSPYKVWISEIMLQQTTVKTVEPYFKKFMQ 66 + +LDWYD N R LPWR + EK Y+VW+SEIMLQQT V+TV Y+ ++MQ Sbjct: 93 FRMSLLDWYDVNKRDLPWRNRRSEDEKDWERRAYEVWVSEIMLQQTRVQTVMKYYTRWMQ 152 Query: 67 KWPTIFCLSSAK-DEEILSAWAGLGYYTRARNLKKCADIIVKKYEGNFPHKVEILKK-LP 124 KWPT++ L+ A ++E+ WAGLGYY RAR L + A ++V EG FP L K +P Sbjct: 153 KWPTLYDLAQASLEKEVNELWAGLGYYRRARRLLEGAKMVVAGTEGEFPRTASSLMKGVP 212 Query: 125 GIGDYTASAIVAIAFNHFAVVVDTNIERIISRYFDIIK--PAPLYHKTIKNYARKITSTS 182 G+G YTA AI +IAFN +VD N+ R++SR I + ++ A ++ S Sbjct: 213 GVGQYTAGAIASIAFNQVTGIVDGNVIRVLSRALAIHSDCSKGKFFQSSWKLAAQLVDPS 272 Query: 183 RPGDFVQAMMDLGALICTSNKPLCPLCPIQKNCLTFSEGKSHL----------------- 225 RPGDF QA+M+LGA +CT KP C CP+ C F + + Sbjct: 273 RPGDFNQALMELGATLCTPQKPSCSSCPVSSQCRAFQLSEQNRTISVGDTYPTDVEKAKP 332 Query: 226 ------LGINTIKKKRPMRTGAVFIAI------------------------TNDNRILLR 255 + + I ++G VF+ T N IL+ Sbjct: 333 RHDFCCVCVLEIPSLERNQSGGVFVLPKRPEQLAGREEFPSVILNEEADPATRRNAILVY 392 Query: 256 KRTNTRLLEGMDELPGSAWSSTKDG-NIDTHSAPFTANWILCNTIT------------HT 302 R LL G+ + P + +I H W+L +T HT Sbjct: 393 LRPAFGLLAGLWKFPTIVSRELSEFVHIFAHIQRKVYVWLLVVQLTGGTEDLFKGQAKHT 452 Query: 303 FTHFTLTLFVWKTIV----PQIVIIPDSTWHDAQNLAN-AALPTVMKK 345 F+H +T V T V W +L + ++K Sbjct: 453 FSHICVTSDVLSTAGLTSAVVTVPPFRLQWIKQLSLDHMVEKEQILKC 500 >gnl|CDD|28938 cd00056, ENDO3c, endonuclease III; includes endonuclease III (DNA-(apurinic or apyrimidinic site) lyase), alkylbase DNA glycosidases (Alka-family) and other DNA glycosidases. Length = 158 Score = 148 bits (374), Expect = 3e-36 Identities = 51/160 (31%), Positives = 86/160 (53%), Gaps = 6/160 (3%) Query: 40 YKVWISEIMLQQTTVKTVEPYFKKFMQKW-PTIFCLSSAKDEEILSAWAGLGYYTRARNL 98 ++V +SEI+ QQTT K V +++ +++ PT L++A +EE+ LGY +A+ L Sbjct: 1 FEVLVSEILSQQTTDKAVNKAYERLFERYGPTPEALAAADEEELRELIRSLGYRRKAKYL 60 Query: 99 KKCADIIVKKYEG---NFPHKVEILKKLPGIGDYTASAIVAIAFNHFAVVVDTNIERIIS 155 K+ A IV+ + G + P E L LPG+G TA+ ++ A A VDT++ R++ Sbjct: 61 KELARAIVEGFGGLVLDDPDAREELLALPGVGRKTANVVLLFALGPDAFPVDTHVRRVLK 120 Query: 156 RYFDIIKPAPLYHKTIKNYARKITSTSRPGDFVQAMMDLG 195 R I K + ++ ++ G+ QA+MDLG Sbjct: 121 RLGLIPKKKT--PEELEELLEELLPKPYWGEANQALMDLG 158 >gnl|CDD|30526 COG0177, Nth, Predicted EndoIII-related endonuclease [DNA replication, recombination, and repair]. Length = 211 Score = 116 bits (291), Expect = 1e-26 Identities = 50/179 (27%), Positives = 89/179 (49%), Gaps = 4/179 (2%) Query: 38 SPYKVWISEIMLQQTTVKTVEPYFKKFMQKWPTIFCLSSAKDEEILSAWAGLGYYTR-AR 96 P+++ ++ I+ QTT + V +++PT L +A +EE+ +G Y A+ Sbjct: 28 DPFELLVAVILSAQTTDEVVNKATPALFKRYPTPEDLLNADEEELEELIKSIGLYRNKAK 87 Query: 97 NLKKCADIIVKKYEGNFPHKVEILKKLPGIGDYTASAIVAIAFNHFAVVVDTNIERIISR 156 N+K+ A I+++K+ G P E L LPG+G TA+ +++ AF A+ VDT++ R+ +R Sbjct: 88 NIKELARILLEKFGGEVPDTREELLSLPGVGRKTANVVLSFAFGIPAIAVDTHVHRVSNR 147 Query: 157 YFDIIKPAPLYHKTIKNYARKITSTSRPGDFVQAMMDLGALICTSNKPLCPLCPIQKNC 215 + P + + + I D ++ G IC + KP C CP+ C Sbjct: 148 LG--LVPGKTPEEVEEALMKLIPKELWT-DLHHWLILHGRYICKARKPRCEECPLADLC 203 >gnl|CDD|144361 pfam00730, HhH-GPD, HhH-GPD superfamily base excision DNA repair protein. This family contains a diverse range of structurally related DNA repair proteins. The superfamily is called the HhH-GPD family after its hallmark Helix-hairpin-helix and Gly/Pro rich loop followed by a conserved aspartate. This includes endonuclease III, EC:4.2.99.18 and MutY an A/G-specific adenine glycosylase, both have a C terminal 4Fe-4S cluster. The family also includes 8-oxoguanine DNA glycosylases. The methyl-CPG binding protein MBD4 also contains a related domain that is a thymine DNA glycosylase. The family also includes DNA-3-methyladenine glycosylase II EC:3.2.2.21 and other members of the AlkA family. Length = 144 Score = 108 bits (272), Expect = 2e-24 Identities = 43/143 (30%), Positives = 71/143 (49%), Gaps = 9/143 (6%) Query: 44 ISEIMLQQTTVKTVEPYFKKFMQKW--PTIFCLSSAKDEEILSAWAGLGYY-TRARNLKK 100 +S I+ QQT+ K K+ +++ PT L+ A +EE+ GLG+Y +A+ +K+ Sbjct: 1 VSAILSQQTSDKAANKITKRLFERYGFPTPEDLAEADEEELRELIKGLGFYRRKAKYIKE 60 Query: 101 CADIIVKKYEGNFPHKV---EILKKLPGIGDYTASAIVAIAFNHF--AVVVDTNIERIIS 155 A I+V+ Y G P + E L LPG+G +TA A++ A VDT++ R+ Sbjct: 61 LARILVEGYLGLVPLDLEELEALLALPGVGRWTAEAVLLFALGRPDVFPAVDTHVRRVAK 120 Query: 156 RYFDIIKPAPLYHKTIKNYARKI 178 R +IK P + + Sbjct: 121 RLG-LIKEKPTPEEVERELEELW 142 >gnl|CDD|72889 cd03431, DNA_Glycosylase_C, DNA glycosylase (MutY in bacteria and hMYH in humans) is responsible for repairing misread A*oxoG residues to C*G by removing the inappropriately paired adenine base from the DNA backbone. It belongs to the Nudix hydrolase superfamily and is important for the repair of various genotoxic lesions. Enzymes belonging to this superfamily requires a divalent cation, such as Mg2+ or Mn2+ for their activity. They are also recognized by a highly conserved 23-residue nudix motif (GX5EX7REUXEEXGU, where U = I, L or V). However, DNA glycosylase does not seem to contain this signature motif. DNA glycosylase consists of 2 domains: the N-terminal domain contains the catalytic properties of the enzyme and the C-terminal domain affects substrate (oxoG) binding and enzymatic turnover. The C-terminal domain is highly similar to MutT, based on secondary structure and topology, despite low sequence identity. MutT sanitizes the nucleotide precursor pool by hydrolyzing oxo-dGTP to oxo-dGMO and inorganic pyrophosphate. The similarity strongly suggests that the two proteins share a common evolutionary origin.. Length = 118 Score = 88.5 bits (219), Expect = 3e-18 Identities = 42/116 (36%), Positives = 54/116 (46%), Gaps = 4/116 (3%) Query: 238 RTGAVFIAITNDNRILLRKRTNTRLLEGMDELPGSAWSSTKDGNIDTHSAPFTANWIL-- 295 R G + I ND R+LL KR LL G+ E P W DG SA A + Sbjct: 2 RRGIAVVVIRNDGRVLLEKRPEKGLLAGLWEFPSVEWEEEADGEEALLSALKKALRLSLE 61 Query: 296 -CNTITHTFTHFTLTLFVWKTIVPQ-IVIIPDSTWHDAQNLANAALPTVMKKALSA 349 T+ HTFTHF LTL V+ + ++ + W + L ALPTVM+K L Sbjct: 62 PLGTVKHTFTHFRLTLHVYLARLEGDLLAPDEGRWVPLEELDEYALPTVMRKILEL 117 >gnl|CDD|37132 KOG1921, KOG1921, KOG1921, Endonuclease III [Replication, recombination and repair]. Length = 286 Score = 54.2 bits (130), Expect = 5e-08 Identities = 41/131 (31%), Positives = 62/131 (47%), Gaps = 11/131 (8%) Query: 89 LGYYTR-ARNLKKCADIIVKKYEGNFPHKVEILKKLPGIGDYTASAIVAIAFNHFA-VVV 146 +G+YTR A+ LKK A I+ K++G+ P VE L LPG+G A + +A+N + V Sbjct: 129 VGFYTRKAKYLKKTAKILQDKFDGDIPDTVEDLLSLPGVGPKMAHLTMQVAWNKIVGICV 188 Query: 147 DTNIERIISRYFDIIKPAPLYHKTIKNYARKITSTSRPGDFVQAMMDL----GALICTSN 202 DT++ RI +R + +T R P + L G ICT Sbjct: 189 DTHVHRICNRLGWVDTKTKSPEQT-----RVALQQWLPKSLWVEINHLLVGFGQTICTPR 243 Query: 203 KPLCPLCPIQK 213 +P C LC + + Sbjct: 244 RPRCGLCLLSR 254 >gnl|CDD|32412 COG2231, COG2231, Uncharacterized protein related to Endonuclease III [DNA replication, recombination, and repair]. Length = 215 Score = 51.8 bits (124), Expect = 3e-07 Identities = 43/187 (22%), Positives = 75/187 (40%), Gaps = 8/187 (4%) Query: 40 YKVWISEIMLQQTTVKTVEPYFKKFMQKWPTIFC-LSSAKDEEILSAWAGLGYY-TRARN 97 ++ I I+ Q T+ K VE + + + +EE+ G+Y +A+ Sbjct: 31 DEIIIGAILTQNTSWKNVEKALENLKNEGILNLKKILKLDEEELAELIRPSGFYNQKAKR 90 Query: 98 LKKCADIIVKKYEGNFPHK----VEILKKLPGIGDYTASAIVAIAFNHFAVVVDTNIERI 153 LK + + K + K E L + GIG TA +I+ A + VVD R+ Sbjct: 91 LKALSKNLAKFFINLESFKSEVLREELLSIKGIGKETADSILLYALDRPVFVVDKYTRRL 150 Query: 154 ISRYFDIIKPAPLYHKTIKNYARKITSTSRPGDFVQAMMDLGALICTSNKPLCPLCPIQK 213 +SR I + Y + + + + R A++ A KPLC CP+++ Sbjct: 151 LSRLGGIEEKK--YDEIKELFEENLPENLRLYQEFHALIVEHAKHFCKKKPLCEKCPLKE 208 Query: 214 NCLTFSE 220 C + Sbjct: 209 KCKKYRR 215 >gnl|CDD|30471 COG0122, AlkA, 3-methyladenine DNA glycosylase/8-oxoguanine DNA glycosylase [DNA replication, recombination, and repair]. Length = 285 Score = 45.8 bits (108), Expect = 2e-05 Identities = 28/155 (18%), Positives = 50/155 (32%), Gaps = 20/155 (12%) Query: 23 LPWRTSPKTEKSSLPSPYKVWISEIMLQQTTVKTVEPYFKKFMQKWPTIFCLSSAK---- 78 LP +P P P++ + I+ QQ +V + + + + + + Sbjct: 89 LPLLRAPGLRLPLAPDPFEALVRAILSQQVSVAAAAKIWARLVSLYGNALEIYHSFPTPE 148 Query: 79 -----DEEILSAWAGLGYYTRARNLKKCADIIVKK-------YEGNFPHKVEILKKLPGI 126 DEE L G +A + A + + +E L L GI Sbjct: 149 QLAAADEEALRRCGLSGR--KAEYIISLARAAAEGELDLSELKPLSDEEAIEELTALKGI 206 Query: 127 GDYTASAIVAIAFNHFAV--VVDTNIERIISRYFD 159 G +TA + V D + R I + + Sbjct: 207 GPWTAEMFLLFGLGRPDVFPADDLGLRRAIKKLYR 241 >gnl|CDD|38086 KOG2875, KOG2875, KOG2875, 8-oxoguanine DNA glycosylase [Replication, recombination and repair]. Length = 323 Score = 43.4 bits (102), Expect = 1e-04 Identities = 28/102 (27%), Positives = 42/102 (41%), Gaps = 14/102 (13%) Query: 68 WPTIFCLSSAKDEEILSAWAGLGYYTRARNLKKCADIIVKKYEGN----------FPHKV 117 +P++ L+ + E L G GY RA+ + A + +K G + Sbjct: 161 FPSLQALAGPEVEAELRK-LGFGY--RAKYISATARALQEKQGGLAWLQSLRKSSYEEAR 217 Query: 118 EILKKLPGIGDYTASAIVAIAF-NHFAVVVDTNIERIISRYF 158 E L LPG+G A I ++ AV VD +I RI Y Sbjct: 218 EALCSLPGVGPKVADCICLMSLDKLSAVPVDVHIWRIAQDYI 259 >gnl|CDD|109681 pfam00633, HHH, Helix-hairpin-helix motif. The helix-hairpin-helix DNA-binding motif is found to be duplicated in the central domain of RuvA. The HhH domain of DisA, a bacterial checkpoint control protein, is a DNA-binding domain. Length = 30 Score = 39.3 bits (93), Expect = 0.002 Identities = 13/30 (43%), Positives = 16/30 (53%) Query: 108 KYEGNFPHKVEILKKLPGIGDYTASAIVAI 137 EG P E L LPG+G TA AI++ Sbjct: 1 SLEGLIPASREELLALPGVGPKTAEAILSY 30 >gnl|CDD|72883 cd03425, MutT_pyrophosphohydrolase, The MutT pyrophosphohydrolase is a prototypical Nudix hydrolase that catalyzes the hydrolysis of nucleoside and deoxynucleoside triphosphates (NTPs and dNTPs) by substitution at a beta-phosphorus to yield a nucleotide monophosphate (NMP) and inorganic pyrophosphate (PPi). This enzyme requires two divalent cations for activity; one coordinates the phosphoryl groups of the NTP/dNTP substrate, and the other coordinates to the enzyme. It also contains the Nudix motif, a highly conserved 23-residue block (GX5EX7REUXEEXGU, where U = I, L or V), that functions as metal binding and catalytic site. MutT pyrophosphohydrolase is important in preventing errors in DNA replication by hydrolyzing mutagenic nucleotides such as 8-oxo-dGTP (a product of oxidative damage), which can mispair with template adenine during DNA replication, to guanine nucleotides.. Length = 124 Score = 34.0 bits (78), Expect = 0.066 Identities = 26/118 (22%), Positives = 43/118 (36%), Gaps = 26/118 (22%) Query: 241 AVFIAITNDNRILLRKRTNTRLLEGMDELPGSAWSSTKDGNIDTHSAPFTA--------- 291 I I +D RIL+ +R + L G+ E PG G ++ P A Sbjct: 4 VAAIIIDDDGRILIAQRPAGKHLGGLWEFPG--------GKVEPGETPEQALVRELREEL 55 Query: 292 -----NWILCNTITHTFTHFTLTLFVWKTIV----PQIVIIPDSTWHDAQNLANAALP 340 L T+ H + +TL V+ + PQ++ + W + L + P Sbjct: 56 GIEVEVGELLATVEHDYPDKRVTLHVFLVELWSGEPQLLEHQELRWVPPEELDDLDFP 113 >gnl|CDD|144032 pfam00293, NUDIX, NUDIX domain. Length = 133 Score = 29.8 bits (67), Expect = 1.3 Identities = 14/35 (40%), Positives = 20/35 (57%), Gaps = 1/35 (2%) Query: 238 RTGAVFIAITN-DNRILLRKRTNTRLLEGMDELPG 271 R AV + + N D +LL +R+ T G+ ELPG Sbjct: 1 RRRAVGVVLLNEDGEVLLVRRSRTPGTPGLWELPG 35 >gnl|CDD|31744 COG1555, ComEA, DNA uptake protein and related DNA-binding proteins [DNA replication, recombination, and repair]. Length = 149 Score = 28.5 bits (63), Expect = 2.7 Identities = 10/20 (50%), Positives = 12/20 (60%) Query: 117 VEILKKLPGIGDYTASAIVA 136 E L+ LPGIG A AI+ Sbjct: 96 AEELQALPGIGPKKAQAIID 115 >gnl|CDD|30358 COG0008, GlnS, Glutamyl- and glutaminyl-tRNA synthetases [Translation, ribosomal structure and biogenesis]. Length = 472 Score = 28.4 bits (63), Expect = 2.9 Identities = 13/75 (17%), Positives = 21/75 (28%), Gaps = 18/75 (24%) Query: 141 HFAVVVDTNIERI------------------ISRYFDIIKPAPLYHKTIKNYARKITSTS 182 +FAVVVD ++ I + P + + N K S Sbjct: 195 NFAVVVDDHLMGITHVLRGEDHLDNTPRQIWLYEALGWPPPVYAHLPLLLNEDGKKLSKR 254 Query: 183 RPGDFVQAMMDLGAL 197 + + G L Sbjct: 255 KGAVSIGEYRVEGWL 269 >gnl|CDD|37217 KOG2006, KOG2006, KOG2006, WD40 repeat protein [General function prediction only]. Length = 1023 Score = 28.0 bits (62), Expect = 3.6 Identities = 11/31 (35%), Positives = 16/31 (51%), Gaps = 4/31 (12%) Query: 147 DTNIERIISRYFDIIKPAPLYHKTIKNYARK 177 D + +I D I+ P + KT+ N ARK Sbjct: 719 DKLLPKI----IDFIREFPEFLKTVVNCARK 745 >gnl|CDD|145490 pfam02371, Transposase_20, Transposase IS116/IS110/IS902 family. Transposases are needed for efficient transposition of the insertion sequence or transposon DNA. This family includes transposases for IS116, IS110 and IS902. This region is often found with pfam01548. The exact function of this region is uncertain. This family contains a HHH motif suggesting a DNA-binding function. Length = 87 Score = 28.3 bits (64), Expect = 3.7 Identities = 10/20 (50%), Positives = 15/20 (75%) Query: 117 VEILKKLPGIGDYTASAIVA 136 E+L +PGIG TA+A++A Sbjct: 1 AELLLSIPGIGPITAAALLA 20 >gnl|CDD|39373 KOG4171, KOG4171, KOG4171, Adenylate/guanylate kinase [Nucleotide transport and metabolism]. Length = 671 Score = 27.6 bits (61), Expect = 6.2 Identities = 15/57 (26%), Positives = 31/57 (54%), Gaps = 6/57 (10%) Query: 295 LCNTITHTFTHFTLTLFVWK--TIVPQIVI---IPDSTWHDAQNLANAALPTVMKKA 346 + N + F T V+K TI ++ +PD++ + A+++A+ AL +M++A Sbjct: 468 MLNELYTRFDRIIDTHDVYKVETIGDAYMVVSGLPDASDYHAEHIADLAL-GMMEEA 523 >gnl|CDD|33463 COG3665, COG3665, Uncharacterized conserved protein [Function unknown]. Length = 264 Score = 27.2 bits (60), Expect = 7.9 Identities = 14/39 (35%), Positives = 18/39 (46%), Gaps = 4/39 (10%) Query: 179 TSTSRPGDFVQ--AMMDLGALICTSNKPLCPLCPIQKNC 215 T SRPGD+V+ A MDL ++ S P I Sbjct: 203 TPPSRPGDYVELRAEMDL--IVVLSACPGGNDPAIGWAP 239 >gnl|CDD|144985 pfam01601, Corona_S2, Coronavirus S2 glycoprotein. The coronavirus spike glycoprotein forms the characteristic 'corona' after which the group is named. The Spike glycoprotein is translated as a large polypeptide that is subsequently cleaved to S1 pfam01600 and S2. Length = 609 Score = 26.9 bits (60), Expect = 8.1 Identities = 21/66 (31%), Positives = 29/66 (43%), Gaps = 23/66 (34%) Query: 87 AGLGYYTRARNLKKC------ADII-VKKYEGNFPHKVEILKKLPGIGD------YTASA 133 +GLG T + KKC AD+I + Y G + LP + D YTAS Sbjct: 162 SGLG--TVDEDYKKCTGGGSIADLICAQYYNG--------IMVLPPVVDAEMMAMYTASL 211 Query: 134 IVAIAF 139 + A+A Sbjct: 212 VGAMAL 217 >gnl|CDD|30702 COG0353, RecR, Recombinational DNA repair protein (RecF pathway) [DNA replication, recombination, and repair]. Length = 198 Score = 26.7 bits (59), Expect = 9.2 Identities = 9/28 (32%), Positives = 14/28 (50%) Query: 107 KKYEGNFPHKVEILKKLPGIGDYTASAI 134 Y ++ LKKLPG+G +A + Sbjct: 1 MMYPPPIEKLIDALKKLPGVGPKSAQRL 28 Database: CddA Posted date: Feb 4, 2011 9:38 PM Number of letters in database: 6,263,737 Number of sequences in database: 21,609 Lambda K H 0.321 0.135 0.421 Gapped Lambda K H 0.267 0.0758 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Sequences: 21609 Number of Hits to DB: 4,485,476 Number of extensions: 234637 Number of successful extensions: 557 Number of sequences better than 10.0: 1 Number of HSP's gapped: 533 Number of HSP's successfully gapped: 27 Length of query: 356 Length of database: 6,263,737 Length adjustment: 95 Effective length of query: 261 Effective length of database: 4,210,882 Effective search space: 1099040202 Effective search space used: 1099040202 Neighboring words threshold: 11 Window for multiple hits: 40 X1: 16 ( 7.4 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 41 (21.9 bits) S2: 58 (26.1 bits)