RPS-BLAST 2.2.22 [Sep-27-2009] Database: CddB 21,608 sequences; 5,994,473 total letters Searching..................................................done Query= gi|254780858|ref|YP_003065271.1| NADH dehydrogenase I subunit F [Candidatus Liberibacter asiaticus str. psy62] (425 letters) >gnl|CDD|184170 PRK13596, PRK13596, NADH dehydrogenase I subunit F; Provisional. Length = 433 Score = 909 bits (2352), Expect = 0.0 Identities = 333/422 (78%), Positives = 365/422 (86%), Gaps = 1/422 (0%) Query: 1 MLTDQDRIFTNLYGLQGKSLSDSMSRGHWDNVDKILEKGRDWIINEVKASGLRGRGGAGF 60 ML D+DRIFTNLYGLQ SL + RG WD IL+KGRDWII E+KASGLRGRGGAGF Sbjct: 1 MLKDKDRIFTNLYGLQDWSLKGAKKRGDWDGTKAILDKGRDWIIEEMKASGLRGRGGAGF 60 Query: 61 STGMKWSFMPKVCSDRPHYLVVNADESEPGTCKDRDIMRHEPHTLIEGCVIASFAIGAHC 120 TG+KWSFMPK RPHYLVVNADESEPGTCKDRDI+RH+PH LIEGC+IASFA+GAH Sbjct: 61 PTGLKWSFMPKESDGRPHYLVVNADESEPGTCKDRDILRHDPHKLIEGCLIASFAMGAHA 120 Query: 121 AFIYVRGEFIRERESLQAAVDECYASGLLGSN-SKLGYDVDIIVHHGAGAYICGEETALL 179 A+IY+RGEFIRERE+LQAA+DE Y +GL+G N G+D DI VHHGAGAYICGEETALL Sbjct: 121 AYIYIRGEFIREREALQAAIDEAYEAGLIGKNACGSGWDFDIYVHHGAGAYICGEETALL 180 Query: 180 ESLEGKKGQPRLKPPFPANVGLYGCPTTVNNVESIAVVPTILRRGASWYSGFGRENNRGT 239 ESLEGKKGQPRLKPPFPANVGLYGCPTTVNNVESIAVVPTILRRGA+W++ GR NN GT Sbjct: 181 ESLEGKKGQPRLKPPFPANVGLYGCPTTVNNVESIAVVPTILRRGAAWFASIGRPNNTGT 240 Query: 240 KLFSISGHVNYPCTVEESMSITFDELIEKHCGGIRGGWDNLLAVIPGGSSVPCLPAGQMR 299 KLF ISGHVN PC VEE+M I F ELIEKH GG+RGGWDNLLAVIPGGSSVP +PA Q Sbjct: 241 KLFCISGHVNKPCNVEEAMGIPFRELIEKHAGGVRGGWDNLLAVIPGGSSVPLIPAEQCE 300 Query: 300 GAIMDYDGLKEMGSGLGTAAVIVMDRSTDIIKAIWRLSVFYKHESCGQCTPCREGTGWMM 359 AIMD+D L+ +GSGLGTAAVIVMD+STDIIKAI RLS FYKHESCGQCTPCREGTGWM Sbjct: 301 DAIMDFDSLRAVGSGLGTAAVIVMDKSTDIIKAIARLSYFYKHESCGQCTPCREGTGWMW 360 Query: 360 RVMERLVKGIAQKREIDLLYEVSKNIEGRTICALGDAAAWPIQGLIKNFRPLIEERIDQY 419 RVMER+VKG AQKREID+L +V+K IEG TICALGDAAAWPIQGLI++FRP IEERIDQY Sbjct: 361 RVMERMVKGRAQKREIDMLLDVTKQIEGHTICALGDAAAWPIQGLIRHFRPEIEERIDQY 420 Query: 420 HR 421 Sbjct: 421 TA 422 >gnl|CDD|178678 PLN03132, PLN03132, NADH dehydrogenase (ubiquinone) flavoprotein 1; Provisional. Length = 461 Score = 717 bits (1851), Expect = 0.0 Identities = 287/418 (68%), Positives = 338/418 (80%), Gaps = 1/418 (0%) Query: 2 LTDQDRIFTNLYGLQGKSLSDSMSRGHWDNVDKILEKGRDWIINEVKASGLRGRGGAGFS 61 L D+DRIFTNLYGL L +M RG W ++ KG DWI+NE+K SGLRGRGGAGF Sbjct: 33 LKDEDRIFTNLYGLHDPFLKGAMKRGDWHRTKDLVLKGPDWIVNEMKKSGLRGRGGAGFP 92 Query: 62 TGMKWSFMPKVCSDRPHYLVVNADESEPGTCKDRDIMRHEPHTLIEGCVIASFAIGAHCA 121 +G+KWSFMPKV RP YLVVNADESEPGTCKDR+IMRH+PH L+EGC+IA + A A Sbjct: 93 SGLKWSFMPKVSDGRPSYLVVNADESEPGTCKDREIMRHDPHKLLEGCLIAGVGMRARAA 152 Query: 122 FIYVRGEFIRERESLQAAVDECYASGLLGSNS-KLGYDVDIIVHHGAGAYICGEETALLE 180 +IY+RGE++ ER +L+ A E YA+GLLG N+ GYD D+ +H+GAGAYICGEETALLE Sbjct: 153 YIYIRGEYVNERLNLERARHEAYAAGLLGKNACGSGYDFDVYIHYGAGAYICGEETALLE 212 Query: 181 SLEGKKGQPRLKPPFPANVGLYGCPTTVNNVESIAVVPTILRRGASWYSGFGRENNRGTK 240 SLEGK+G+PRLKPPFPANVGLYGCPTTV NVE++AV PTILRRG W++ FGR+NN GTK Sbjct: 213 SLEGKQGKPRLKPPFPANVGLYGCPTTVTNVETVAVSPTILRRGPEWFASFGRKNNAGTK 272 Query: 241 LFSISGHVNYPCTVEESMSITFDELIEKHCGGIRGGWDNLLAVIPGGSSVPCLPAGQMRG 300 LF ISGHVN PCTVEE MSI ELIE+HCGG+RGGWDNLLA+IPGGSSVP LP Sbjct: 273 LFCISGHVNKPCTVEEEMSIPLKELIERHCGGVRGGWDNLLAIIPGGSSVPLLPKKICDD 332 Query: 301 AIMDYDGLKEMGSGLGTAAVIVMDRSTDIIKAIWRLSVFYKHESCGQCTPCREGTGWMMR 360 +MD+D LK + SGLGTAAVIVMD+STD++ AI RLS FYKHESCGQCTPCREGTGW+ Sbjct: 333 VLMDFDALKAVQSGLGTAAVIVMDKSTDVVDAIARLSYFYKHESCGQCTPCREGTGWLWD 392 Query: 361 VMERLVKGIAQKREIDLLYEVSKNIEGRTICALGDAAAWPIQGLIKNFRPLIEERIDQ 418 +MER+ G A+ EID+L EV+K IEG TICALGDAAAWP+QGLI++FRP +E RI + Sbjct: 393 IMERMKVGNAKLEEIDMLQEVTKQIEGHTICALGDAAAWPVQGLIRHFRPELERRIKE 450 >gnl|CDD|185547 PTZ00304, PTZ00304, NADH dehydrogenase [ubiquinone] flavoprotein 1; Provisional. Length = 461 Score = 716 bits (1850), Expect = 0.0 Identities = 294/426 (69%), Positives = 336/426 (78%), Gaps = 2/426 (0%) Query: 2 LTDQDRIFTNLYGLQGKSLSDSMSRGHWDNVDKILEKGRDWIINEVKASGLRGRGGAGFS 61 L DQDRIFTNLY + ++ RG W IL KG DWII+E+K SGLRGRGGAGF Sbjct: 22 LKDQDRIFTNLYRDFDTYIDGALKRGDWYRTKDILLKGHDWIIDEIKKSGLRGRGGAGFP 81 Query: 62 TGMKWSFMPKVCSD-RPHYLVVNADESEPGTCKDRDIMRHEPHTLIEGCVIASFAIGAHC 120 +G+KWSFMPKV D RP YLVVNADESEPGTCKDR+IMRH+PH L+EG ++A FA+ A Sbjct: 82 SGLKWSFMPKVKPDGRPSYLVVNADESEPGTCKDREIMRHDPHKLVEGALLAGFAMRARA 141 Query: 121 AFIYVRGEFIRERESLQAAVDECYASGLLGSNS-KLGYDVDIIVHHGAGAYICGEETALL 179 A+IY+RGEF E +LQ A+DE Y G LG N+ GYD D+ VH GAGAYICGEETAL+ Sbjct: 142 AYIYIRGEFYNEARALQQAIDEAYKKGFLGKNACGSGYDFDVYVHRGAGAYICGEETALI 201 Query: 180 ESLEGKKGQPRLKPPFPANVGLYGCPTTVNNVESIAVVPTILRRGASWYSGFGRENNRGT 239 ES+EGK G+PRLKPPFPANVGLYGCPTTV NVE++AV PTILRRG W++ FGR NN GT Sbjct: 202 ESIEGKPGKPRLKPPFPANVGLYGCPTTVTNVETVAVSPTILRRGPQWFASFGRPNNAGT 261 Query: 240 KLFSISGHVNYPCTVEESMSITFDELIEKHCGGIRGGWDNLLAVIPGGSSVPCLPAGQMR 299 KLF ISGHVN PCTVEE MSI ELIE+HCGG+RGGWDNLL VIPGGSSVP +P Sbjct: 262 KLFCISGHVNNPCTVEEEMSIPLRELIERHCGGVRGGWDNLLCVIPGGSSVPLIPKEICD 321 Query: 300 GAIMDYDGLKEMGSGLGTAAVIVMDRSTDIIKAIWRLSVFYKHESCGQCTPCREGTGWMM 359 +MD+D LKE+ SGLGTAAVIVMD+STDII AI RLS FYKHESCGQCTPCREGT W++ Sbjct: 322 NVLMDFDALKEVQSGLGTAAVIVMDKSTDIIDAILRLSKFYKHESCGQCTPCREGTPWLV 381 Query: 360 RVMERLVKGIAQKREIDLLYEVSKNIEGRTICALGDAAAWPIQGLIKNFRPLIEERIDQY 419 ++MER V G A K EID L EVSK IEG TICALGDAAAWP+QGLI++FRP IEERI++Y Sbjct: 382 KMMERFVVGNADKEEIDTLEEVSKQIEGHTICALGDAAAWPVQGLIRHFRPEIEERIERY 441 Query: 420 HRCNFQ 425 N Sbjct: 442 WEANPH 447 >gnl|CDD|131014 TIGR01959, nuoF_fam, NADH-quinone oxidoreductase, F subunit. This model describes the F chain of complexes that resemble NADH-quinone oxidoreductases. The electron acceptor is a quinone, ubiquinone, in mitochondria and most bacteria, including Escherichia coli, where the recommended gene symbol is nuoF. This family does not have any members in chloroplast or cyanobacteria, where the quinone may be plastoquinone and NADH may be replaced by NADPH, nor in Methanosarcina, where NADH is replaced by F420H2. Length = 411 Score = 697 bits (1801), Expect = 0.0 Identities = 258/412 (62%), Positives = 306/412 (74%), Gaps = 3/412 (0%) Query: 7 RIFTNLYGLQGKSLSDSMSRGHWDNVDKILEK-GRDWIINEVKASGLRGRGGAGFSTGMK 65 + TNL + +L + RG +D + K LE+ D II EVK SGLRGRGGAGF TG+K Sbjct: 1 VLTTNLDNPESWTLEEYEKRGGYDALRKALEEMSPDDIIEEVKDSGLRGRGGAGFPTGLK 60 Query: 66 WSFMPKVCSDRPHYLVVNADESEPGTCKDRDIMRHEPHTLIEGCVIASFAIGAHCAFIYV 125 WSFMPK S +P YLV NADESEPGTCKDRD+M +PH LIEG +IA++AIGAH +IY+ Sbjct: 61 WSFMPKDDSPKPKYLVCNADESEPGTCKDRDLMEFDPHQLIEGMIIAAYAIGAHRGYIYI 120 Query: 126 RGEFIRERESLQAAVDECYASGLLGSNS-KLGYDVDIIVHHGAGAYICGEETALLESLEG 184 RGEFI+E E+L+AA+ E YA+GLLG N G+D ++ VH GAGAYICGEETALLESLEG Sbjct: 121 RGEFIKEAENLEAAIAEAYAAGLLGKNILGSGFDFELFVHRGAGAYICGEETALLESLEG 180 Query: 185 KKGQPRLKPPFPANVGLYGCPTTVNNVESIAVVPTILRRGASWYSGFGRENNRGTKLFSI 244 K+GQPRLKPPFPA GLYG PT +NNVE++A VP ILRRGA WY G+E + GTKLFS+ Sbjct: 181 KRGQPRLKPPFPAVFGLYGKPTVINNVETLASVPAILRRGADWYRKLGKEKSPGTKLFSV 240 Query: 245 SGHVNYPCTVEESMSITFDELIEKHCGGIRGGWDNLLAVIPGGSSVPCLPAGQMRGAIMD 304 SGHVN P E + EL+E + GG+RGGW L AVIPGGSS P LPA Q A MD Sbjct: 241 SGHVNKPGNYELPLGTPLRELLEDYAGGMRGGW-KLKAVIPGGSSTPVLPAEQHLDAPMD 299 Query: 305 YDGLKEMGSGLGTAAVIVMDRSTDIIKAIWRLSVFYKHESCGQCTPCREGTGWMMRVMER 364 YD L GS LGT AVIVMD ST ++KA+ RLS FY HESCGQCTPCREGTGWM++++ER Sbjct: 300 YDSLAAAGSMLGTGAVIVMDESTCMVKAVRRLSEFYAHESCGQCTPCREGTGWMVKILER 359 Query: 365 LVKGIAQKREIDLLYEVSKNIEGRTICALGDAAAWPIQGLIKNFRPLIEERI 416 + +G K +IDLL V K IEG+TICALGDAAAWP+Q IK+FR E I Sbjct: 360 IEEGEGTKEDIDLLLSVCKQIEGKTICALGDAAAWPVQSAIKHFRDEFEAHI 411 >gnl|CDD|183071 PRK11278, PRK11278, NADH dehydrogenase I subunit F; Provisional. Length = 448 Score = 336 bits (863), Expect = 8e-93 Identities = 165/382 (43%), Positives = 230/382 (60%), Gaps = 6/382 (1%) Query: 41 DWIINEVKASGLRGRGGAGFSTGMKWSFMPKVCSDRPHYLVVNADESEPGTCKDRDIMRH 100 D I+N+VK +GL+GRGGAGFSTG+KWS MPK S YL+ NADE EPGT KDR +M Sbjct: 49 DEIVNQVKDAGLKGRGGAGFSTGLKWSLMPKDESMNIRYLLCNADEMEPGTYKDRLLMEQ 108 Query: 101 EPHTLIEGCVIASFAIGAHCAFIYVRGEFIRERESLQAAVDECYASGLLGSN-SKLGYDV 159 PH L+EG +I++FA+ A+ +I++RGE+I +L+ A+ E +GLLG N G+D Sbjct: 109 LPHLLVEGMLISAFALKAYRGYIFLRGEYIEAAVNLRRAIAEATEAGLLGKNIMGTGFDF 168 Query: 160 DIIVHHGAGAYICGEETALLESLEGKKGQPRLKPPFPANVGLYGCPTTVNNVESIAVVPT 219 ++ VH GAG YICGEETAL+ SLEG++ PR KPPFPA G++G PT VNNVE++ VP Sbjct: 169 ELFVHTGAGRYICGEETALINSLEGRRANPRSKPPFPATSGVWGKPTCVNNVETLCNVPA 228 Query: 220 ILRRGASWYSGFGRE--NNRGTKLFSISGHVNYPCTVEESMSITFDELIEKHCGGIRGGW 277 IL G WY + + GTKL SG V P E T E++E + GG+R G Sbjct: 229 ILANGVEWYQNISKGKSKDAGTKLMGFSGRVKNPGLWELPFGTTAREILEDYAGGMRDGL 288 Query: 278 DNLLAVIPGGSSVPCLPAGQMRGAIMDYDGLKEMGSGLGTAAVIVMDRSTDIIKAIWRLS 337 A PGG+ L + M+++ + + GS LGTA + +D +++ + L Sbjct: 289 -KFKAWQPGGAGTDFLTEAHL-DLPMEFESIGKAGSRLGTALAMAVDHEINMVSLVRNLE 346 Query: 338 VFYKHESCGQCTPCREGTGWMMRVMERLVKGIAQKREIDLLYEVSKNI-EGRTICALGDA 396 F+ ESCG CTPCR+G W ++++ L +G Q +I+ L ++ + + G+T CA Sbjct: 347 EFFARESCGWCTPCRDGLPWSVKILRALERGEGQPGDIETLEQLCRFLGPGKTFCAHAPG 406 Query: 397 AAWPIQGLIKNFRPLIEERIDQ 418 A P+Q IK FR E I Q Sbjct: 407 AVEPLQSAIKYFREEFEAGIKQ 428 >gnl|CDD|151119 pfam10589, NADH_4Fe-4S, NADH-ubiquinone oxidoreductase-F iron-sulfur binding region. Length = 46 Score = 76.4 bits (189), Expect = 1e-14 Identities = 19/46 (41%), Positives = 32/46 (69%) Query: 330 IKAIWRLSVFYKHESCGQCTPCREGTGWMMRVMERLVKGIAQKREI 375 + RL+ F+ HESCG+CTPCREGT W+ +++R+ +G + ++ Sbjct: 1 VAVARRLAEFFAHESCGKCTPCREGTKWLAEILDRIEEGKGTEEDL 46 >gnl|CDD|162617 TIGR01945, rnfC, electron transport complex, RnfABCDGE type, C subunit. The six subunit complex RnfABCDGE in Rhodobacter capsulatus encodes an apparent NADH oxidoreductase responsible for electron transport to nitrogenase, necessary for nitrogen fixation. A closely related complex in E. coli, RsxABCDGE (Reducer of SoxR), reduces the 2Fe-2S-containing superoxide sensor SoxR, active as a transcription factor when oxidized. This family of putative NADH oxidoreductase complexes exists in many of the same species as the related NQR, a Na(+)-translocating NADH-quinone reductase, but is distinct. This model describes the C subunit. Length = 435 Score = 48.1 bits (115), Expect = 4e-06 Identities = 58/264 (21%), Positives = 103/264 (39%), Gaps = 57/264 (21%) Query: 36 LEKGRDW-------IINEVKASGLRGRGGAGFSTGMKWSFMPKVCSDRPHYLVVNADESE 88 LE D+ I+ +++A+G+ G GGA F T +K + P+ + L++N E E Sbjct: 110 LEPIPDFENLSPEEILEKIRAAGIVGLGGATFPTHVKLNPPPE---KKIETLIINGAECE 166 Query: 89 PG-TCKDRDIMRHEPHTLIEGCVIASFAIGAHCAFIYVRGEFIRERESLQAAVDECYASG 147 P TC DR +MR +I G I +G I + +L+ A+ Sbjct: 167 PYLTCDDR-LMRERAEEIIGGIRILLKILGVKKVVIGIEDNKPEAIAALKKALG------ 219 Query: 148 LLGSNSKLGYDVDIIVHHGAGAYICGEETALLESLEGK---KGQPRLKPPFPANVGLYGC 204 GY++ + V Y G E L+ +L G+ G PA++G+ Sbjct: 220 --------GYNIKVRVL--PTKYPQGGEKQLIYALTGREVPSGG------LPADIGV--- 260 Query: 205 PTTVNNVESIAVVPTILRRGASWYSGFGRENNRGTKLFSISGH-VNYPCTVEESMSITFD 263 V NV + + + G ++ +++G + P + + Sbjct: 261 --VVQNVGTAFAIYEAVVNGKPLIE----------RVVTVTGDAIRRPKNLWVLIGTPVS 308 Query: 264 ELIEKHCGGIRGGWDNLLAVIPGG 287 +++ CGG R + +I GG Sbjct: 309 DILA-FCGGFR---EKPERLIMGG 328 >gnl|CDD|179919 PRK05035, PRK05035, electron transport complex protein RnfC; Provisional. Length = 695 Score = 46.9 bits (112), Expect = 9e-06 Identities = 27/85 (31%), Positives = 41/85 (48%), Gaps = 6/85 (7%) Query: 40 RDWIINEVKASGLRGRGGAGFSTGMKWSFMPKVCSDRPHYLVVNADESEPG-TCKDRDIM 98 + +I ++ +G+ G GGAGF T +K P D+ L++N E EP T DR +M Sbjct: 127 PEELIERIRQAGIAGLGGAGFPTAVKLQ--PG--GDKIETLIINGAECEPYITADDR-LM 181 Query: 99 RHEPHTLIEGCVIASFAIGAHCAFI 123 R +IEG I + + I Sbjct: 182 RERADEIIEGIRILAHLLQPKEVLI 206 >gnl|CDD|151080 pfam10531, SLBB, SLBB domain. Length = 52 Score = 45.3 bits (108), Expect = 3e-05 Identities = 17/53 (32%), Positives = 24/53 (45%), Gaps = 1/53 (1%) Query: 241 LFSISGHVNYPCTVEESMSITFDELIEKHCGGIRGGWDNLLAVIPGGSSVPCL 293 + ++SG V P E + T +LIE+ G L VIPGG + L Sbjct: 1 VVTVSGEVKRPGNYEVPIGTTLSDLIEQAGGLTDDA-RRLKRVIPGGPMMGIL 52 >gnl|CDD|178409 PLN02813, PLN02813, pfkB-type carbohydrate kinase family protein. Length = 426 Score = 29.4 bits (66), Expect = 1.7 Identities = 17/50 (34%), Positives = 24/50 (48%), Gaps = 5/50 (10%) Query: 105 LIEGCVIASFAIGAHCAFIYVRGEFIRERESLQAAVDEC-----YASGLL 149 L C + S GA ++I V+GE + S VD C YA+G+L Sbjct: 312 LSHFCPLVSVTDGARGSYIGVKGEAVYIPPSPCVPVDTCGAGDAYAAGIL 361 >gnl|CDD|181191 PRK07994, PRK07994, DNA polymerase III subunits gamma and tau; Validated. Length = 647 Score = 29.5 bits (67), Expect = 1.8 Identities = 11/34 (32%), Positives = 14/34 (41%), Gaps = 15/34 (44%) Query: 345 CGQCTPCREGTGWMMRVMERLVKGIAQKREIDLL 378 CG+C CRE I Q R +DL+ Sbjct: 73 CGECDNCRE---------------IEQGRFVDLI 91 >gnl|CDD|183453 PRK12338, PRK12338, hypothetical protein; Provisional. Length = 319 Score = 29.3 bits (66), Expect = 2.1 Identities = 10/33 (30%), Positives = 19/33 (57%) Query: 360 RVMERLVKGIAQKREIDLLYEVSKNIEGRTICA 392 + ++RL + +K ++ LY +S N+ ICA Sbjct: 261 KFIKRLNENPKKKEDLKRLYSLSNNVHSHRICA 293 >gnl|CDD|162683 TIGR02071, PBP_1b, penicillin-binding protein 1B. Bacterial that synthesize a cell wall of peptidoglycan (murein) generally have several transglycosylases and transpeptidases for the task. This family consists of a particular bifunctional transglycosylase/transpeptidase in E. coli and other Proteobacteria, designated penicillin-binding protein 1B. Length = 730 Score = 28.1 bits (63), Expect = 3.9 Identities = 11/31 (35%), Positives = 15/31 (48%), Gaps = 10/31 (32%) Query: 226 SWYSGF----------GRENNRGTKLFSISG 246 SW++G GR++N TKL SG Sbjct: 640 SWFAGIDGKEVTIIWLGRDDNGPTKLTGASG 670 >gnl|CDD|180523 PRK06305, PRK06305, DNA polymerase III subunits gamma and tau; Validated. Length = 451 Score = 28.2 bits (63), Expect = 4.1 Identities = 13/36 (36%), Positives = 19/36 (52%), Gaps = 4/36 (11%) Query: 342 HESCGQCTPCRE-GTGWMMRVMERLVKGIAQKREID 376 E C QC C+E +G + V+E + G A R I+ Sbjct: 72 QEPCNQCASCKEISSGTSLDVLE--IDG-ASHRGIE 104 >gnl|CDD|178499 PLN02911, PLN02911, inositol-phosphate phosphatase. Length = 296 Score = 27.8 bits (62), Expect = 5.0 Identities = 18/46 (39%), Positives = 25/46 (54%), Gaps = 8/46 (17%) Query: 142 ECYASGLLGSNSKLGYDVDIIVHHGAGAYICGEETALLESLEGKKG 187 +CYA GLL S G+ VD++V G Y + AL+ +EG G Sbjct: 215 DCYAYGLLAS----GH-VDLVVESGLKPY---DYLALVPVVEGAGG 252 >gnl|CDD|185649 PTZ00470, PTZ00470, glycoside hydrolase family 47 protein; Provisional. Length = 522 Score = 27.8 bits (62), Expect = 5.2 Identities = 15/46 (32%), Positives = 24/46 (52%) Query: 17 GKSLSDSMSRGHWDNVDKILEKGRDWIINEVKASGLRGRGGAGFST 62 G ++ DS+ + K ++GRDW+ N +K S G G + F T Sbjct: 111 GLTIIDSLDTLKIMGLKKEYKEGRDWVANNLKQSKDTGLGVSVFET 156 >gnl|CDD|149700 pfam08724, Rep_N, Rep protein catalytic domain like. Adeno-associated virus (AAV) Replication (Rep) protein is essential for viral replication and integration. The catalytic domain has DNA binding and endonuclease activity. Length = 186 Score = 27.8 bits (62), Expect = 5.7 Identities = 15/60 (25%), Positives = 24/60 (40%), Gaps = 8/60 (13%) Query: 78 HYLV-VNADESEPGTCK-----DRDIMRHEPHTLIEGCVIASFAIGAHCAFIYVRGEFIR 131 +LV NA P CK ++ H LI G + +G + + I G+F + Sbjct: 57 IFLVKWNAKLKVPEGCKYFLQAEKGEEGFHLHVLIGGPGVNPRVLGRYTSQI--EGKFNK 114 Database: CddB Posted date: Feb 4, 2011 9:54 PM Number of letters in database: 5,994,473 Number of sequences in database: 21,608 Lambda K H 0.321 0.139 0.437 Gapped Lambda K H 0.267 0.0679 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Sequences: 21608 Number of Hits to DB: 7,067,435 Number of extensions: 459892 Number of successful extensions: 835 Number of sequences better than 10.0: 1 Number of HSP's gapped: 821 Number of HSP's successfully gapped: 21 Length of query: 425 Length of database: 5,994,473 Length adjustment: 96 Effective length of query: 329 Effective length of database: 3,920,105 Effective search space: 1289714545 Effective search space used: 1289714545 Neighboring words threshold: 11 Window for multiple hits: 40 X1: 16 ( 7.4 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 41 (21.8 bits) S2: 58 (26.2 bits)