BLASTP 2.2.22 [Sep-27-2009]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.


Reference for composition-based statistics:
Schaffer, Alejandro A., L. Aravind, Thomas L. Madden,
Sergei Shavirin, John L. Spouge, Yuri I. Wolf,  
Eugene V. Koonin, and Stephen F. Altschul (2001), 
"Improving the accuracy of PSI-BLAST protein database searches with 
composition-based statistics and other refinements",  Nucleic Acids Res. 29:2994-3005.

Query= gi|254781212|ref|YP_003065625.1| hypothetical protein
CLIBASIA_05595 [Candidatus Liberibacter asiaticus str. psy62]
         (109 letters)

Database: nr 
           14,124,377 sequences; 4,842,793,630 total letters

Searching..................................................done



>gi|254781212|ref|YP_003065625.1| hypothetical protein CLIBASIA_05595 [Candidatus Liberibacter
           asiaticus str. psy62]
 gi|254040889|gb|ACT57685.1| hypothetical protein CLIBASIA_05595 [Candidatus Liberibacter
           asiaticus str. psy62]
 gi|317120677|gb|ADV02500.1| hypothetical protein SC1_gp100 [Liberibacter phage SC1]
 gi|317120821|gb|ADV02642.1| hypothetical protein SC1_gp100 [Candidatus Liberibacter asiaticus]
          Length = 109

 Score =  223 bits (567), Expect = 9e-57,   Method: Composition-based stats.
 Identities = 109/109 (100%), Positives = 109/109 (100%)

Query: 1   MVNFRKLADMIKSKVLSRGYTVDSDALARQLEEDERRIRHYKHVYSTPEGRFVLTDLMVE 60
           MVNFRKLADMIKSKVLSRGYTVDSDALARQLEEDERRIRHYKHVYSTPEGRFVLTDLMVE
Sbjct: 1   MVNFRKLADMIKSKVLSRGYTVDSDALARQLEEDERRIRHYKHVYSTPEGRFVLTDLMVE 60

Query: 61  GGLLSSVSNDSAHQLALLEGKRSLAVHIASNCGLSFERIVQMYSDNPRY 109
           GGLLSSVSNDSAHQLALLEGKRSLAVHIASNCGLSFERIVQMYSDNPRY
Sbjct: 61  GGLLSSVSNDSAHQLALLEGKRSLAVHIASNCGLSFERIVQMYSDNPRY 109


>gi|315122899|ref|YP_004063388.1| hypothetical protein CKC_05775 [Candidatus Liberibacter
           solanacearum CLso-ZC1]
 gi|313496301|gb|ADR52900.1| hypothetical protein CKC_05775 [Candidatus Liberibacter
           solanacearum CLso-ZC1]
          Length = 108

 Score =  124 bits (312), Expect = 3e-27,   Method: Composition-based stats.
 Identities = 62/104 (59%), Positives = 80/104 (76%)

Query: 1   MVNFRKLADMIKSKVLSRGYTVDSDALARQLEEDERRIRHYKHVYSTPEGRFVLTDLMVE 60
           M++F+ +A+ IK+  L RGYTVD   LA +LEEDE+R+R YK V++T  G+ VL DLMVE
Sbjct: 1   MIDFKAIAEKIKNTALGRGYTVDPVVLAERLEEDEKRLRLYKSVFATEAGKEVLIDLMVE 60

Query: 61  GGLLSSVSNDSAHQLALLEGKRSLAVHIASNCGLSFERIVQMYS 104
           GGLLSS   D A +LA  EGKR++AV IAS+ GL+FE+IVQMYS
Sbjct: 61  GGLLSSPEIDDALKLAHCEGKRAMAVRIASSLGLNFEQIVQMYS 104


>gi|315121937|ref|YP_004062426.1| hypothetical protein CKC_00935 [Candidatus Liberibacter
           solanacearum CLso-ZC1]
 gi|313495339|gb|ADR51938.1| hypothetical protein CKC_00935 [Candidatus Liberibacter
           solanacearum CLso-ZC1]
          Length = 104

 Score =  115 bits (288), Expect = 2e-24,   Method: Composition-based stats.
 Identities = 59/98 (60%), Positives = 74/98 (75%)

Query: 7   LADMIKSKVLSRGYTVDSDALARQLEEDERRIRHYKHVYSTPEGRFVLTDLMVEGGLLSS 66
           +A+ IK+  L RGYTVD   LA +LEEDE+R+R YK V++T  G+ VL DLMVEGGLLS 
Sbjct: 3   IAEKIKNTALGRGYTVDPVVLAERLEEDEKRLRLYKSVFATEAGKEVLIDLMVEGGLLSY 62

Query: 67  VSNDSAHQLALLEGKRSLAVHIASNCGLSFERIVQMYS 104
              D A +LA  EGKR++AV IAS+ GL+FE+IVQMYS
Sbjct: 63  PEIDDALKLAHCEGKRAMAVRIASSLGLNFEQIVQMYS 100


>gi|317120720|gb|ADV02542.1| hypothetical protein SC2_gp100 [Liberibacter phage SC2]
 gi|317120781|gb|ADV02602.1| hypothetical protein SC2_gp100 [Candidatus Liberibacter asiaticus]
          Length = 90

 Score = 58.9 bits (141), Expect = 2e-07,   Method: Composition-based stats.
 Identities = 29/73 (39%), Positives = 46/73 (63%)

Query: 33  EDERRIRHYKHVYSTPEGRFVLTDLMVEGGLLSSVSNDSAHQLALLEGKRSLAVHIASNC 92
           E   ++R YK V++T EGR+VL D+M EGGLL++  ++    LA  EGKR++A++I    
Sbjct: 12  EKVEQVRRYKSVFATFEGRWVLLDIMREGGLLATELSNDPIALARREGKRTIALYITDLI 71

Query: 93  GLSFERIVQMYSD 105
            L  E ++  Y +
Sbjct: 72  ALEAEELISAYRE 84


>gi|167041084|gb|ABZ05845.1| hypothetical protein ALOHA_HF400048F7ctg1g12 [uncultured marine
           microorganism HF4000_48F7]
          Length = 97

 Score = 38.1 bits (87), Expect = 0.41,   Method: Composition-based stats.
 Identities = 25/87 (28%), Positives = 46/87 (52%), Gaps = 8/87 (9%)

Query: 29  RQLEEDERRIRHYKHVYSTPEGRFVLTDLMVEGGLLSS--VSNDSAHQLALLEGKRSLAV 86
           R  E++++R+  Y+ ++  P+G+ VL+DL    G+     V  D  +  A  +G+RS+ +
Sbjct: 2   RLSEKEKKRLADYRTIFEGPQGQRVLSDLCHRHGIFDPCHVPGD-PYSTAYNDGRRSVII 60

Query: 87  HIASNCGLSFER----IVQMYSD-NPR 108
            +    G   ER    ++Q Y D +PR
Sbjct: 61  DLLRYLGTDLERLDNLLIQPYGDYDPR 87


>gi|293571151|ref|ZP_06682189.1| CBS domain protein [Enterococcus faecium E980]
 gi|291608764|gb|EFF38048.1| CBS domain protein [Enterococcus faecium E980]
          Length = 443

 Score = 37.4 bits (85), Expect = 0.60,   Method: Composition-based stats.
 Identities = 26/106 (24%), Positives = 51/106 (48%), Gaps = 23/106 (21%)

Query: 6   KLADMIKSKVLSRGYTVDSDALARQLEEDERRIRHYKHVYSTPEGRFVLTDLMVEG---- 61
           ++AD I  ++    +T+D+D    +LE               P+ +FV+T  MV G    
Sbjct: 314 QIADTISDQISGNIHTIDTDREGNRLE--------------NPKFKFVVTPQMVNGVGTI 359

Query: 62  --GLLSSVSNDSAHQLALLEGKRSLAVHIASNCGLSFERIVQMYSD 105
             G+LS + +D A +  ++  KR++ +       L + R++Q+ S+
Sbjct: 360 SFGVLSEIISDVAQKTMVMNQKRNILIE---QVNLHYLRLIQLESE 402


>gi|257899794|ref|ZP_05679447.1| CBS:Thioesterase superfamily:DRTGG [Enterococcus faecium Com15]
 gi|257837706|gb|EEV62780.1| CBS:Thioesterase superfamily:DRTGG [Enterococcus faecium Com15]
          Length = 443

 Score = 37.4 bits (85), Expect = 0.60,   Method: Composition-based stats.
 Identities = 26/106 (24%), Positives = 51/106 (48%), Gaps = 23/106 (21%)

Query: 6   KLADMIKSKVLSRGYTVDSDALARQLEEDERRIRHYKHVYSTPEGRFVLTDLMVEG---- 61
           ++AD I  ++    +T+D+D    +LE               P+ +FV+T  MV G    
Sbjct: 314 QIADTISDQISGNIHTIDTDREGNRLE--------------NPKFKFVVTPQMVNGVGTI 359

Query: 62  --GLLSSVSNDSAHQLALLEGKRSLAVHIASNCGLSFERIVQMYSD 105
             G+LS + +D A +  ++  KR++ +       L + R++Q+ S+
Sbjct: 360 SFGVLSEIISDVAQKTMVMNQKRNILIE---QVNLHYLRLIQLESE 402


>gi|227552401|ref|ZP_03982450.1| CBS domain transcriptional regulator [Enterococcus faecium TX1330]
 gi|257888358|ref|ZP_05668011.1| CBS:Thioesterase superfamily:DRTGG [Enterococcus faecium 1,141,733]
 gi|257896752|ref|ZP_05676405.1| CBS:Thioesterase superfamily:DRTGG [Enterococcus faecium Com12]
 gi|293378067|ref|ZP_06624243.1| DRTGG domain protein [Enterococcus faecium PC4.1]
 gi|227178455|gb|EEI59427.1| CBS domain transcriptional regulator [Enterococcus faecium TX1330]
 gi|257824412|gb|EEV51344.1| CBS:Thioesterase superfamily:DRTGG [Enterococcus faecium 1,141,733]
 gi|257833317|gb|EEV59738.1| CBS:Thioesterase superfamily:DRTGG [Enterococcus faecium Com12]
 gi|292643322|gb|EFF61456.1| DRTGG domain protein [Enterococcus faecium PC4.1]
          Length = 443

 Score = 37.4 bits (85), Expect = 0.60,   Method: Composition-based stats.
 Identities = 26/106 (24%), Positives = 51/106 (48%), Gaps = 23/106 (21%)

Query: 6   KLADMIKSKVLSRGYTVDSDALARQLEEDERRIRHYKHVYSTPEGRFVLTDLMVEG---- 61
           ++AD I  ++    +T+D+D    +LE               P+ +FV+T  MV G    
Sbjct: 314 QIADTISDQISGNIHTIDTDREGNRLE--------------NPKFKFVVTPQMVNGVGTI 359

Query: 62  --GLLSSVSNDSAHQLALLEGKRSLAVHIASNCGLSFERIVQMYSD 105
             G+LS + +D A +  ++  KR++ +       L + R++Q+ S+
Sbjct: 360 SFGVLSEIISDVAQKTMVMNQKRNILIE---QVNLHYLRLIQLESE 402


>gi|317152044|ref|YP_004120092.1| hypothetical protein Daes_0321 [Desulfovibrio aespoeensis Aspo-2]
 gi|316942295|gb|ADU61346.1| hypothetical protein Daes_0321 [Desulfovibrio aespoeensis Aspo-2]
          Length = 76

 Score = 37.4 bits (85), Expect = 0.72,   Method: Composition-based stats.
 Identities = 17/50 (34%), Positives = 29/50 (58%)

Query: 39 RHYKHVYSTPEGRFVLTDLMVEGGLLSSVSNDSAHQLALLEGKRSLAVHI 88
          R Y+ ++ + +G+ V+ DL   G  L S  +    + AL EG+RSL +H+
Sbjct: 10 RAYRRLFESTDGQTVMEDLEQRGSFLRSTFSTDPGRTALNEGRRSLVLHV 59


>gi|84500627|ref|ZP_00998876.1| hypothetical protein OB2597_11731 [Oceanicola batsensis HTCC2597]
 gi|84391580|gb|EAQ03912.1| hypothetical protein OB2597_11731 [Oceanicola batsensis HTCC2597]
          Length = 443

 Score = 37.4 bits (85), Expect = 0.76,   Method: Composition-based stats.
 Identities = 22/62 (35%), Positives = 35/62 (56%), Gaps = 11/62 (17%)

Query: 10  MIKSKVLSRGYTVDSDALARQLEEDERRIRHYKHVYSTPEGRFVLTDLMVEGGLLSSVSN 69
           M   ++ ++G+T       RQ+  +ERR+ HY H+  TP+ R V       GGL+SS +N
Sbjct: 275 MSGEELAAQGWTT------RQMAYEERRLLHYFHL--TPDNRMVFGQ---RGGLISSAAN 323

Query: 70  DS 71
           +S
Sbjct: 324 ES 325


>gi|291515047|emb|CBK64257.1| Fic/DOC family [Alistipes shahii WAL 8301]
          Length = 327

 Score = 36.2 bits (82), Expect = 1.7,   Method: Composition-based stats.
 Identities = 18/46 (39%), Positives = 27/46 (58%)

Query: 4   FRKLADMIKSKVLSRGYTVDSDALARQLEEDERRIRHYKHVYSTPE 49
           FR  A+ I  + L +GY V++ A A QLEE ++ +R   HV +  E
Sbjct: 105 FRIWANKILKEYLIKGYAVNTQAKAEQLEELKKTVRLLSHVLAAKE 150


>gi|291334463|gb|ADD94117.1| hypothetical protein [uncultured phage MedDCM-OCT-S04-C1161]
 gi|291334660|gb|ADD94307.1| hypothetical protein [uncultured phage MedDCM-OCT-S04-C695]
 gi|291334714|gb|ADD94360.1| hypothetical protein [uncultured phage MedDCM-OCT-S04-C890]
 gi|291336440|gb|ADD95995.1| hypothetical protein [uncultured organism MedDCM-OCT-S04-C1073]
          Length = 78

 Score = 36.2 bits (82), Expect = 1.7,   Method: Composition-based stats.
 Identities = 24/65 (36%), Positives = 42/65 (64%), Gaps = 4/65 (6%)

Query: 29 RQLEEDERRIR-HYKHVYSTPEGRFVLTDLMVEGGLLSS--VSNDSAHQLALLEGKRSLA 85
          +QLE   +++R +Y+++++T EG+ VL+DL       S+  V  DS H+ A +EG+RS+ 
Sbjct: 5  KQLESLVKKLRENYQYIFNTDEGKEVLSDLEKRCHYHSTTNVKGDS-HESAYMEGQRSVL 63

Query: 86 VHIAS 90
          + I S
Sbjct: 64 LFIKS 68


>gi|269793671|ref|YP_003313126.1| DNA/RNA helicase [Sanguibacter keddieii DSM 10542]
 gi|269095856|gb|ACZ20292.1| DNA/RNA helicase, superfamily II, SNF2 family [Sanguibacter
           keddieii DSM 10542]
          Length = 1028

 Score = 35.8 bits (81), Expect = 2.2,   Method: Composition-based stats.
 Identities = 28/95 (29%), Positives = 44/95 (46%), Gaps = 12/95 (12%)

Query: 3   NFRKLADMIKSKVLSRGYTVDSDA--------LARQLEEDERRIRHYKHVYSTP----EG 50
           +F  L +MI ++  SRG TVDSDA        L R ++E   R R  K +   P    + 
Sbjct: 296 SFTALLEMIDNRRFSRGATVDSDALKDVTVRRLKRDIKEKNFRARELKTITFEPGSDEQE 355

Query: 51  RFVLTDLMVEGGLLSSVSNDSAHQLALLEGKRSLA 85
            F   D ++E    ++    S   +++L  KR L+
Sbjct: 356 AFERLDAILEASARANGQKRSGDIVSMLLKKRFLS 390


>gi|69245236|ref|ZP_00603314.1| CBS:Thioesterase superfamily:DRTGG [Enterococcus faecium DO]
 gi|257880121|ref|ZP_05659774.1| CBS:Thioesterase superfamily:DRTGG [Enterococcus faecium 1,230,933]
 gi|257882353|ref|ZP_05662006.1| CBS:Thioesterase superfamily:DRTGG [Enterococcus faecium 1,231,502]
 gi|257885550|ref|ZP_05665203.1| CBS:Thioesterase superfamily:DRTGG [Enterococcus faecium 1,231,501]
 gi|257891212|ref|ZP_05670865.1| CBS:Thioesterase superfamily:DRTGG [Enterococcus faecium 1,231,410]
 gi|257894024|ref|ZP_05673677.1| CBS:Thioesterase superfamily:DRTGG [Enterococcus faecium 1,231,408]
 gi|258614545|ref|ZP_05712315.1| CBS domain-containing protein [Enterococcus faecium DO]
 gi|260560270|ref|ZP_05832446.1| thioesterase superfamily protein [Enterococcus faecium C68]
 gi|261208206|ref|ZP_05922879.1| thioesterase superfamily protein [Enterococcus faecium TC 6]
 gi|289566578|ref|ZP_06446999.1| CBS domain-containing protein [Enterococcus faecium D344SRF]
 gi|293552927|ref|ZP_06673582.1| CBS domain protein [Enterococcus faecium E1039]
 gi|293560636|ref|ZP_06677123.1| CBS domain protein [Enterococcus faecium E1162]
 gi|293570170|ref|ZP_06681248.1| CBS domain protein [Enterococcus faecium E1071]
 gi|294615820|ref|ZP_06695663.1| CBS domain protein [Enterococcus faecium E1636]
 gi|294617809|ref|ZP_06697421.1| CBS domain protein [Enterococcus faecium E1679]
 gi|294623457|ref|ZP_06702309.1| CBS domain protein [Enterococcus faecium U0317]
 gi|314940202|ref|ZP_07847375.1| DRTGG domain protein [Enterococcus faecium TX0133a04]
 gi|314941740|ref|ZP_07848619.1| DRTGG domain protein [Enterococcus faecium TX0133C]
 gi|314947616|ref|ZP_07851025.1| DRTGG domain protein [Enterococcus faecium TX0082]
 gi|314950602|ref|ZP_07853682.1| DRTGG domain protein [Enterococcus faecium TX0133A]
 gi|314992531|ref|ZP_07857952.1| DRTGG domain protein [Enterococcus faecium TX0133B]
 gi|314995314|ref|ZP_07860423.1| DRTGG domain protein [Enterococcus faecium TX0133a01]
 gi|68195911|gb|EAN10345.1| CBS:Thioesterase superfamily:DRTGG [Enterococcus faecium DO]
 gi|257814349|gb|EEV43107.1| CBS:Thioesterase superfamily:DRTGG [Enterococcus faecium 1,230,933]
 gi|257818011|gb|EEV45339.1| CBS:Thioesterase superfamily:DRTGG [Enterococcus faecium 1,231,502]
 gi|257821406|gb|EEV48536.1| CBS:Thioesterase superfamily:DRTGG [Enterococcus faecium 1,231,501]
 gi|257827572|gb|EEV54198.1| CBS:Thioesterase superfamily:DRTGG [Enterococcus faecium 1,231,410]
 gi|257830403|gb|EEV57010.1| CBS:Thioesterase superfamily:DRTGG [Enterococcus faecium 1,231,408]
 gi|260073615|gb|EEW61941.1| thioesterase superfamily protein [Enterococcus faecium C68]
 gi|260077463|gb|EEW65181.1| thioesterase superfamily protein [Enterococcus faecium TC 6]
 gi|289161623|gb|EFD09502.1| CBS domain-containing protein [Enterococcus faecium D344SRF]
 gi|291587319|gb|EFF19205.1| CBS domain protein [Enterococcus faecium E1071]
 gi|291591310|gb|EFF22976.1| CBS domain protein [Enterococcus faecium E1636]
 gi|291595920|gb|EFF27201.1| CBS domain protein [Enterococcus faecium E1679]
 gi|291597130|gb|EFF28329.1| CBS domain protein [Enterococcus faecium U0317]
 gi|291602903|gb|EFF33100.1| CBS domain protein [Enterococcus faecium E1039]
 gi|291605387|gb|EFF34834.1| CBS domain protein [Enterococcus faecium E1162]
 gi|313590471|gb|EFR69316.1| DRTGG domain protein [Enterococcus faecium TX0133a01]
 gi|313592991|gb|EFR71836.1| DRTGG domain protein [Enterococcus faecium TX0133B]
 gi|313597149|gb|EFR75994.1| DRTGG domain protein [Enterococcus faecium TX0133A]
 gi|313599512|gb|EFR78355.1| DRTGG domain protein [Enterococcus faecium TX0133C]
 gi|313640522|gb|EFS05102.1| DRTGG domain protein [Enterococcus faecium TX0133a04]
 gi|313645857|gb|EFS10437.1| DRTGG domain protein [Enterococcus faecium TX0082]
          Length = 443

 Score = 35.4 bits (80), Expect = 2.3,   Method: Composition-based stats.
 Identities = 25/106 (23%), Positives = 50/106 (47%), Gaps = 23/106 (21%)

Query: 6   KLADMIKSKVLSRGYTVDSDALARQLEEDERRIRHYKHVYSTPEGRFVLTDLMVEG---- 61
           ++AD I  ++    +T+D+D    +LE               P+ +FV+   MV G    
Sbjct: 314 QIADTISDQISGNIHTIDTDREGNRLE--------------NPKFKFVVAPQMVNGVGTI 359

Query: 62  --GLLSSVSNDSAHQLALLEGKRSLAVHIASNCGLSFERIVQMYSD 105
             G+LS + +D A +  ++  KR++ +       L + R++Q+ S+
Sbjct: 360 SFGVLSEIISDVAQKTMVMNQKRNILIE---QVNLHYLRLIQLESE 402


>gi|301046407|ref|ZP_07193567.1| conserved hypothetical protein [Escherichia coli MS 185-1]
 gi|300301633|gb|EFJ58018.1| conserved hypothetical protein [Escherichia coli MS 185-1]
          Length = 98

 Score = 35.4 bits (80), Expect = 2.3,   Method: Composition-based stats.
 Identities = 21/81 (25%), Positives = 46/81 (56%), Gaps = 4/81 (4%)

Query: 29  RQLEEDERRIRHYKHVYSTPEGRFVLTDLMVEGGLLSSVSNDSAHQLALLEGKRSLAVH- 87
           +Q +  +R I   + V S+ +GR V+  ++ +G + S++S   A  +A  EG+R+LA+  
Sbjct: 16  KQRDMAQREIDDIRFVMSSEQGRRVVWSVLEKGRVFSAISPMDAMAMAFNEGQRNLALEL 75

Query: 88  ---IASNCGLSFERIVQMYSD 105
              + ++C   + ++V+  S+
Sbjct: 76  FQRVMAHCPEQYLKMVKEASE 96


>gi|186687127|ref|YP_001870270.1| helicase domain-containing protein [Nostoc punctiforme PCC 73102]
 gi|186469430|gb|ACC85229.1| helicase domain protein [Nostoc punctiforme PCC 73102]
          Length = 946

 Score = 35.0 bits (79), Expect = 3.0,   Method: Composition-based stats.
 Identities = 19/55 (34%), Positives = 31/55 (56%), Gaps = 1/55 (1%)

Query: 7   LADMIKSKVLSRGYTVDSDALARQLEEDERRIRHYKHVYSTPEGRFVLTDLMVEG 61
           +AD ++ K+  +G  +   A+  +L EDER IR  + + S P+   V TD + EG
Sbjct: 502 VADALRQKLQKKGSQIRVIAITGELSEDEREIR-LEELKSYPQRVLVATDCLSEG 555


>gi|167752499|ref|ZP_02424626.1| hypothetical protein ALIPUT_00750 [Alistipes putredinis DSM 17216]
 gi|167659568|gb|EDS03698.1| hypothetical protein ALIPUT_00750 [Alistipes putredinis DSM 17216]
          Length = 343

 Score = 35.0 bits (79), Expect = 3.2,   Method: Composition-based stats.
 Identities = 17/46 (36%), Positives = 27/46 (58%)

Query: 4   FRKLADMIKSKVLSRGYTVDSDALARQLEEDERRIRHYKHVYSTPE 49
           FR  A+ +  + L +GY V++ A A QLEE ++ +R   HV +  E
Sbjct: 121 FRIWANKVLKEYLIKGYAVNNQAKAEQLEELKKTVRLLSHVLAAKE 166


>gi|117624711|ref|YP_853624.1| hypothetical protein APECO1_4042 [Escherichia coli APEC O1]
 gi|298381717|ref|ZP_06991316.1| conserved hypothetical protein [Escherichia coli FVEC1302]
 gi|301019344|ref|ZP_07183530.1| conserved hypothetical protein [Escherichia coli MS 196-1]
 gi|331648175|ref|ZP_08349265.1| conserved hypothetical protein [Escherichia coli M605]
 gi|115513835|gb|ABJ01910.1| conserved hypothetical protein [Escherichia coli APEC O1]
 gi|294491431|gb|ADE90187.1| conserved hypothetical protein [Escherichia coli IHE3034]
 gi|298279159|gb|EFI20673.1| conserved hypothetical protein [Escherichia coli FVEC1302]
 gi|299882261|gb|EFI90472.1| conserved hypothetical protein [Escherichia coli MS 196-1]
 gi|309702811|emb|CBJ02142.1| hypothetical phage protein [Escherichia coli ETEC H10407]
 gi|320175045|gb|EFW50158.1| hypothetical protein SDB_02406 [Shigella dysenteriae CDC 74-1112]
 gi|323156132|gb|EFZ42291.1| hypothetical protein ECEPECA14_1907 [Escherichia coli EPECa14]
 gi|324008559|gb|EGB77778.1| hypothetical protein HMPREF9532_01746 [Escherichia coli MS 57-2]
 gi|327252183|gb|EGE63855.1| hypothetical protein ECSTEC7V_3030 [Escherichia coli STEC_7v]
 gi|331043035|gb|EGI15175.1| conserved hypothetical protein [Escherichia coli M605]
 gi|332344353|gb|AEE57687.1| conserved hypothetical protein [Escherichia coli UMNK88]
          Length = 98

 Score = 35.0 bits (79), Expect = 3.4,   Method: Composition-based stats.
 Identities = 18/60 (30%), Positives = 36/60 (60%)

Query: 29 RQLEEDERRIRHYKHVYSTPEGRFVLTDLMVEGGLLSSVSNDSAHQLALLEGKRSLAVHI 88
          +Q +  +R I   + V S+ +GR V+  ++ +G + S++S   A  +A  EG+R+LA+ +
Sbjct: 16 KQRDMAQREIDDIRFVMSSEQGRRVVWSVLEKGRVFSAISPMDAMAMAFNEGQRNLALEL 75


>gi|303239064|ref|ZP_07325594.1| peptidase S1 and S6 chymotrypsin/Hap [Acetivibrio cellulolyticus
           CD2]
 gi|302593402|gb|EFL63120.1| peptidase S1 and S6 chymotrypsin/Hap [Acetivibrio cellulolyticus
           CD2]
          Length = 424

 Score = 35.0 bits (79), Expect = 3.5,   Method: Composition-based stats.
 Identities = 29/101 (28%), Positives = 51/101 (50%), Gaps = 9/101 (8%)

Query: 14  KVLSRGYT-----VDSDALARQLEEDERRIRHYKHVYSTPEGRFV---LTDLMVEGGLLS 65
           ++L RGY      ++ ++  R+  + + R R  + VY T     V   LT LMV GGL  
Sbjct: 7   RLLKRGYDNYEKFMNFNSAYRRDGKYDDRSRDGRRVYRTVALVLVCCILTSLMVGGGLYM 66

Query: 66  SVSNDSAHQLALLEGKRSLAVHIASNCGLSFERIVQMYSDN 106
            +SND   +L +L   ++ A  +  N  ++ E  +++ SD+
Sbjct: 67  KLSND-IKELTMLSANQAKADTVIGNRAVNLESALKLASDS 106


>gi|300898428|ref|ZP_07116769.1| conserved hypothetical protein [Escherichia coli MS 198-1]
 gi|300357895|gb|EFJ73765.1| conserved hypothetical protein [Escherichia coli MS 198-1]
          Length = 98

 Score = 34.7 bits (78), Expect = 4.6,   Method: Composition-based stats.
 Identities = 18/60 (30%), Positives = 36/60 (60%)

Query: 29 RQLEEDERRIRHYKHVYSTPEGRFVLTDLMVEGGLLSSVSNDSAHQLALLEGKRSLAVHI 88
          +Q +  +R I   + V S+ +GR V+  ++ +G + S++S   A  +A  EG+R+LA+ +
Sbjct: 16 KQRDMAKREIDDIRFVMSSEQGRRVVWSVLEKGRVFSAISPMDAMAMAFNEGQRNLALEL 75


>gi|89152429|ref|YP_512262.1| hypothetical protein PhiV10p08 [Escherichia phage phiV10]
 gi|74055452|gb|AAZ95901.1| hypothetical protein PhiV10p08 [Escherichia phage phiV10]
          Length = 98

 Score = 34.3 bits (77), Expect = 5.3,   Method: Composition-based stats.
 Identities = 18/60 (30%), Positives = 35/60 (58%)

Query: 29 RQLEEDERRIRHYKHVYSTPEGRFVLTDLMVEGGLLSSVSNDSAHQLALLEGKRSLAVHI 88
          +Q +  +R I   + V S  +GR V+  ++ +G + S++S   A  +A  EG+R+LA+ +
Sbjct: 16 KQRDMAQREIDDIRFVMSCEQGRRVVWSVLEKGRVFSAISPMDAMAMAFNEGQRNLALEL 75


>gi|323948684|gb|EGB44589.1| hypothetical protein ERKG_04907 [Escherichia coli H252]
          Length = 98

 Score = 34.3 bits (77), Expect = 6.1,   Method: Composition-based stats.
 Identities = 18/60 (30%), Positives = 35/60 (58%)

Query: 29 RQLEEDERRIRHYKHVYSTPEGRFVLTDLMVEGGLLSSVSNDSAHQLALLEGKRSLAVHI 88
          +Q +  +R I   + V S+  GR V+  ++ +G + S++S   A  +A  EG+R+LA+ +
Sbjct: 16 KQRDMAQREIDDIRFVMSSEHGRRVVWSVLEKGRVFSAISPMDAMAMAFNEGQRNLALEL 75


>gi|218700989|ref|YP_002408618.1| hypothetical protein ECIAI39_2679 [Escherichia coli IAI39]
 gi|218370975|emb|CAR18802.1| conserved hypothetical protein from phage origin [Escherichia
          coli IAI39]
          Length = 98

 Score = 34.3 bits (77), Expect = 6.1,   Method: Composition-based stats.
 Identities = 18/59 (30%), Positives = 35/59 (59%)

Query: 30 QLEEDERRIRHYKHVYSTPEGRFVLTDLMVEGGLLSSVSNDSAHQLALLEGKRSLAVHI 88
          Q +  +R I   + V S+ +GR V+  ++ +G + S++S   A  +A  EG+R+LA+ +
Sbjct: 17 QRDMAQREIDDIRFVMSSEQGRRVVWSVLEKGRVFSAISPMDAMAMAFNEGQRNLALEL 75


>gi|307297844|ref|ZP_07577650.1| protein of unknown function DUF795 [Thermotogales bacterium
           mesG1.Ag.4.2]
 gi|306917104|gb|EFN47486.1| protein of unknown function DUF795 [Thermotogales bacterium
           mesG1.Ag.4.2]
          Length = 427

 Score = 34.3 bits (77), Expect = 6.5,   Method: Composition-based stats.
 Identities = 18/55 (32%), Positives = 28/55 (50%), Gaps = 4/55 (7%)

Query: 4   FRKLADMIKSKVLSRGYTVDSDALARQLEEDERRIRHYKHVYSTPEGRFVLTDLM 58
           +RK+ D      L +   +D+D L  QLE D   +R Y  ++  PE R   +DL+
Sbjct: 367 WRKVVD----GALKKEMKIDADLLEAQLERDFAAVRFYASLFKYPESRSRCSDLL 417


>gi|289617430|emb|CBI55918.1| unnamed protein product [Sordaria macrospora]
          Length = 570

 Score = 33.9 bits (76), Expect = 8.4,   Method: Composition-based stats.
 Identities = 17/37 (45%), Positives = 19/37 (51%), Gaps = 4/37 (10%)

Query: 20  YTVDSDALARQLEEDERRIR----HYKHVYSTPEGRF 52
           YT     LAR + E+ RR R    HY H   TP GRF
Sbjct: 532 YTKSGKVLARDVHEESRRRRHVHNHYMHHRRTPAGRF 568


>gi|303389012|ref|XP_003072739.1| chromosome segregation ATPase [Encephalitozoon intestinalis ATCC
           50506]
 gi|303301881|gb|ADM11379.1| chromosome segregation ATPase [Encephalitozoon intestinalis ATCC
           50506]
          Length = 1159

 Score = 33.5 bits (75), Expect = 8.9,   Method: Composition-based stats.
 Identities = 20/42 (47%), Positives = 25/42 (59%), Gaps = 1/42 (2%)

Query: 6   KLADMIKSKVLSRGYT-VDSDALARQLEEDERRIRHYKHVYS 46
           +L  MIK K + RG T V  DAL R  EE  R+I H++  YS
Sbjct: 412 ELESMIKEKKMIRGNTGVKIDALERSSEELGRKILHHEEKYS 453


  Database: nr
    Posted date:  May 22, 2011 12:22 AM
  Number of letters in database: 999,999,966
  Number of sequences in database:  2,987,313
  
  Database: /data/usr2/db/fasta/nr.01
    Posted date:  May 22, 2011 12:30 AM
  Number of letters in database: 999,999,796
  Number of sequences in database:  2,903,041
  
  Database: /data/usr2/db/fasta/nr.02
    Posted date:  May 22, 2011 12:36 AM
  Number of letters in database: 999,999,281
  Number of sequences in database:  2,904,016
  
  Database: /data/usr2/db/fasta/nr.03
    Posted date:  May 22, 2011 12:41 AM
  Number of letters in database: 999,999,960
  Number of sequences in database:  2,935,328
  
  Database: /data/usr2/db/fasta/nr.04
    Posted date:  May 22, 2011 12:46 AM
  Number of letters in database: 842,794,627
  Number of sequences in database:  2,394,679
  
Lambda     K      H
   0.320    0.134    0.372 

Lambda     K      H
   0.267   0.0426    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 1,109,027,210
Number of Sequences: 14124377
Number of extensions: 43486059
Number of successful extensions: 127063
Number of sequences better than 10.0: 26
Number of HSP's better than 10.0 without gapping: 13
Number of HSP's successfully gapped in prelim test: 13
Number of HSP's that attempted gapping in prelim test: 127048
Number of HSP's gapped (non-prelim): 26
length of query: 109
length of database: 4,842,793,630
effective HSP length: 77
effective length of query: 32
effective length of database: 3,755,216,601
effective search space: 120166931232
effective search space used: 120166931232
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.8 bits)
S2: 76 (33.8 bits)